LOCUS AB000381 35863 bp DNA PRI 14-MAY-1997 DEFINITION Human DNA for GPI-anchored molecule-like protein, complete cds. ACCESSION AB000381 NID g2102633 KEYWORDS GPI-anchored molecule-like protein; GML. SOURCE Homo sapiens DNA, clone_lib:cosmid clone:169. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 35863) AUTHORS Tokino,T. TITLE Direct Submission JOURNAL Submitted (09-JAN-1997) to the DDBJ/EMBL/GenBank databases. Takashi Tokino, Institute of Medical Science, University of Tokyo, Laboratory of Molecular Medicine; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:tokinta@ims.u-tokyo.ac.jp, Tel:+81-35449-5375, Fax:+81-35449-5433) REFERENCE 2 (sites) AUTHORS Kimura,Y., Furuhata,T., Urano,T., Hirata,K., Nakamura,Y. and Tokino,T. TITLE Genomic structure and chromosomal localization of GML (GPI-anchored mol ecule-like protein), a gene induced by p53 JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Furuhata,T., Tokino,T., Urano,T. and Nakamura,Y. TITLE Isolation of a novel GPI-anchored gene specifically regulated by p53; correlation between its expression and anti-cancer drug sensitivity JOURNAL Oncogene 13 (9), 1965-1970 (1996) MEDLINE 97088635 REFERENCE 4 (sites) AUTHORS Kimura,Y., Furuhata,T., Urano,T., Hirata,K., Nakamura,Y. and Tokino,T. TITLE Genomic structure and chromosomal localization of GML (GPI-anchored molecule-like protein), a gene induced by p53 JOURNAL Genomics 41 (3), 477-480 (1997) MEDLINE 97312709 FEATURES Location/Qualifiers source 1..35863 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /clone="169" /clone_lib="cosmid" /map="8q24.3" enhancer 3460..3489 /note="p53 response element" exon 22563..22629 /number=1 intron 22630..28176 /number=1 exon 28177..28271 /number=2 CDS join(28199..28271,28881..28988,34291..34586) /note="GML" /codon_start=1 /product="GPI-anchored molecule-like protein" /db_xref="PID:d1020751" /db_xref="PID:g2102634" /translation="MLLFALLLAMELPLVAASATMRAQWTYSLRCHDCAVINDFNCPN IRVCPYHIRRCMTISIRINSRELLVYKNCTNNCTFVYAAEQPPEAPGKIFKTNSFYWV CCCNSMVCNAGGPTNLERDMLPDEVTEEELPEGTVRLGVSKLLLSFASIIVSNILP" intron 28272..28880 /number=2 exon 28881..28988 /number=3 intron 28989..34290 /number=3 exon 34291..34742 /number=4 BASE COUNT 8804 a 8796 c 9013 g 9229 t 21 others ORIGIN 1 gcggccggaa ttaaccctca ctaaagggat ccctcgatca tacactatgt ggcctctgtg 61 tctggcttct gtccctgagc accaagttct caaggctcac tgccttgttg cctatgtcag 121 tgcttgattc cacggaatgt actccatgtg gatagagcac atttgtctat cggttcatca 181 gttgaaggac atttgggttg tggaagacca tgtggggcca gaaaagaaga aaatcctaaa 241 aaaaagaccc aaaatgatgc ggactgtgaa ggaatagatc agacagctga aaacccttcc 301 aatggccaga gctggaacaa tttgagcagc aaaataaata acagtattgg attatactcc 361 tgtgcataaa ataaatgttt acaagtccat aatgatatca ataaatgatt gaagaaagaa 421 ggatattgga gagaaaagac caatcttctg cataagaatt caaataattt acatagatac 481 tttgcttgaa aaggtggagc ttaaattgcc actcctcaga tgtggctgca cttagtgact 541 tccttccaac catgacagca tggaaatagg gtgggggaat gtatgagtgt gttttcacgc 601 tgcttagaag gcataccaga gactgggcaa tttacaaaag aaagaagttt aatggactta 661 cagttccacg tggctgggaa gggctcacaa tcatggcaga aggcaaggag gagcaagtca 721 tgtcttacag ggctggcagc aggcaaagcg aaagaccttg cgcagggaaa ctcccatttt 781 taaaaccacc agatttcatg agacttattc actctcatga gagcagcatg ggaaagacct 841 gcccccatga ttcaatcacc tccctccagg cacctgcgac aacatgtggg aattcaagat 901 gtgatttggg tgggggacac agtgaaaccg tatcggggaa aaaacattac caggagagaa 961 gcctggcaga tacaacacca tggaagtggt caaggtcagc atcaacagtg gtgaatccag 1021 tgtcagtgcc tactgttgat gtggtgacag aacagtgatt cacttctgct tcagtcttct 1081 caaaacgacc atcccaggct aaccaccagg aaaacaccag gcaaattgag agacacgcca 1141 caaaatattc aaccagtacc tctgaaacct gtcgaggtca cgaaaaacaa ggcaagtctg 1201 agaaactgtc accgcccaga ggagcctctg taaacatgag tatgtcccac acaggctctt 1261 ggggggtgct agatgagaaa aggacattag gggagaatga agccaatctg aatcaagtgt 1321 ggattcagct cataataatg ggtcaatatt ggttcattaa ttgtgacaaa tggaccacag 1381 gaatataaga tgttgaaata gggatttcgg cctgggtgag gtggctcacg cccattatcc 1441 cagcactttg ggtggctgag gtgggaagat tacttgagcc caggagtttg agacaagcct 1501 gggcaacata gtgagatccc atctctacaa aaaaatggca aattagccag gcatgggggt 1561 gcatgcctgt agtcccaacc acttggcaag ctgaggcggg aggatcactt gagcccagga 1621 ggttgaggct gcagtgagct gtgattgcac cactgcactc cagcctgggc aacagagtga 1681 gaccttgtcg aggaagaaga aaaagaagga gaaggagaag atagcaataa ggctctctgg 1741 gtgtggggaa tataggaact gtctgtacta tcttgggcaa cttngctgta agtctaaaac 1801 aattataaaa gtaataagct cctctaaaaa atgtggaaat gcctatagaa gtgaggggct 1861 cgatggtggg gttcgtgtgg ctggcagagc tctgcaggga aatcgccaca ctgcctaggc 1921 cacatgctta gagaagttgt tctcctggtc cccagaagac cacatcttgg gtcttgttcg 1981 ccatgcaggc agtaacattc tgtctttggt aacctgaaga agactgcgaa tagcttgtct 2041 caggagataa aatgccaaga taatgaagct aaggccacag gtcacgccct tcagtcatca 2101 ttgcacagga gacgttctca caggcatggc ttgcacctag accctggtca caccctgagc 2161 cctgccccta gacacacact taggtctgac cctggctaca cactgagccc tgatcgtgct 2221 tccactctgt ggggcctctg gacatacatt gagccctgat cctggatata tattgagccc 2281 cgctcctggt cacacactga gccctgatct gacactgagc ctggattctg atcacatact 2341 gagacccatc cccacacttg gggtcatgct gtcacactat ttttgactct ggacaaacat 2401 ggatctctga ttgtggtgca caggctgagt cctgantgtg accccacact gtgacctgan 2461 tctggacata aattaatgct gatcttgctg tgaaattaag taattcgaaa tcaaagctgt 2521 tggaacttta aattattttg agacttagag gagtgtgact ataagacctg aatcatgtaa 2581 acaggcagct gtaacctttg ttcttctcat tatagattag cctcttcttt acctgcatca 2641 ttgtgcagta tgttgtaaaa gacaacaggg tgccaaagaa gaccactttc cctcttcact 2701 gttgatctct ttatacactg acttcccttt tacttctccc agacaaagac ttcacgacta 2761 tcacgttgtc taagatggaa tgttaaatat acttttaaac tggaaaaggt aaacaagttg 2821 taatcaaatt gctgtatctc ataaactggc cttgtgcgaa aaaagaaaca taagcctgtt 2881 aaatgtcttt gttttatctg tctagaaaag caagacctta actttccagc ttcggagcac 2941 tgacctcatt cttctagggt ctgtgttacc tggatggtca ttcttagctt tgcacttgaa 3001 taaatttaga ctggattctg atcctttcta ttatttcggg ttgccatggc tcacattctc 3061 aaccctggcc ctggtcatag tgattggaag atgcagtaag agctaggaac agagtttggg 3121 ggaagcaaag gaatgccagt ttgctcagag agagagagca cttcacgaag ggatcagaga 3181 acagctgggg gctcaggctg atggggctct taaaggggca tgagccggga ggcaccggga 3241 aaaagctggg cattgcaagg tgccccagca cctgctcttg gccatcctgc ttctttccct 3301 ctttccttcc cacccagaac cttgccctgc aacccctttt ctccctgctg cctctggcct 3361 cctggagcag ggagtacacc agcttttgga aacccatttt gaaggatccc agcctctggg 3421 ctccaggctc actgagtgtg gcacacagct gcaccatgaa tgcttgccca ggcatgtcca 3481 ggctggtccc tccctgagct gtgctggctc ctttgggagc agagactggg gcggggaggt 3541 ggaggacaca gatgattcac tctgggttcc caggcaggct gggtgcacat gtttccagct 3601 gatctggcta gccaggagcc aggatgcaag gatgcctgtg ttcctgagcg tcgggtgtgg 3661 ctggtttttg ccaggagcaa ggaggtggca aactggctaa acaagggtgt ctaggcttct 3721 gcctcagctc ccagcaggct gcagggctca cccggcacag ctcagctcgc ctgccctgca 3781 gactggggct gagccatggt ttgctcctgc agggcctaca gctgggggca ctggaggtgg 3841 cccggcaagg gtcattagga gctcaccagc caggcaacac ccgcaagggc ttccttctcc 3901 gaacacactg tcttgaagtt ctctgcccgc gaagtggggt ggaaattcag cacccccccg 3961 agcagagcct ggccgcgaga tagatgactt gccctgttta aagcggtgcc cagggacagc 4021 ctccacgtca ccatcactca ggctccctct tgcagcccct gctggcaaag tggaaatgtg 4081 gcttgcagag ttgcgtctga agccaagcag agggtgggaa cggagccaaa ctcataactg 4141 aaccaccagc acccccgttg ggcagccccg gaagtgcacc ctcggggtcc tgcagcggag 4201 aggagctggc cccacctgcc agccccatgg tgctcccagc cagtgatgag gcatggcggg 4261 ggctgctcct ctgtggcctc ctcttaggga agcttcggct gggatgtctc tgaggccagc 4321 cgacatctca aaatggcccc gccatccagg aggccccctc cttccgcaac tgtcagagct 4381 atgcggagtc ccacagctct cgcctcccac tcctgttccc tctcatcctt cacaggcttt 4441 tctggcaaaa tgttcttgca catagagtcc tgccttggca tctggtgtgg acaacaggat 4501 ccagtacacc caaagcccct ctacaagaca gcccccatct gcttgcagac gccccggcct 4561 cagccctgca cctttcctgt ctcacacaca tatacgatac acacacactc acacacacac 4621 aaaatacact cacacacaca aaatacactc acatacacaa gacacacaga cgtgtactca 4681 cacacaagca cagggacaca cgtacacaca tacccacaca tgttacacgc acacattcac 4741 accttataca cacagacaca cattcacaca catagacgtt acttacacaa gcacatgcac 4801 acatgtacac acactcccac acttgcacac tnacacacag acatgcactc acaggcgtat 4861 gcgtacatgt gcacacgtat gcacacttac tgatgtgcag tcacacgcta tacaccgaca 4921 cacacacgca gtcacacacg acacacacac ggcctgcact cacatgcaca ttcacacaca 4981 ttcacgatac agacatacac atgcacacag gtacattccc gcagacgcac actcacacag 5041 tgcattcata cagtacactc acactcggac aagcactcac acaagcatat tcatacgtac 5101 atctactcac acacgtacac acacaagcac actcacatca gtgacacaca ctggctcaca 5161 tgtgcactca cacataacac atcctctttc acacatgcac tgacattcac tctcaaacag 5221 ttgcacacac acacatttac atttactcac acacacaatt acagttactc acaaacagtc 5281 acaatcacac tcacatttgc atgtggacac acctgtaagc acacaaacac ttgtgcacac 5341 acctgcacac tgtttggact gcactcacac accaacccca gagtgcacca ggcttggtcc 5401 agccccccgg cgcttccccc gcgactgacc gaggctggtg taggagggag ggaggaatct 5461 gtcccccctc cggaggggag atgagggcca ttccagagct tagctgctcc gctgcctctc 5521 atcaagagca gggccaggga ggcccaacgg cttcggggag tcagagtcgg tggaccaggc 5581 aggaggaggg taggacctgt ggctcccctt ccacgaccac actgtgtcct gaggtgcctg 5641 gctctgggct ctggcagacc aggtttttca gtgtgggctg catggccatc tccttcctct 5701 ccttaacatt ctttttccat gaagtgggaa tcatgatccc tccagcccta agttgtcccc 5761 agaattaaag gggcatgtgt gtgtctaaca gagtcctggg accaaggaaa accccagaga 5821 aacaggggga cggtgctgtg aaaaggaaga gttggctgtg agggcctgat ccaagtcacc 5881 taggaagatg ggggcaagag ggacccgggc ccagggagcc gcggagccgg ggaggagtgg 5941 ggcgaggtct gtggccggct tcaccgggcc tcctctgctg cctgcccact gctccccaat 6001 gctgggcagg acctcgagtt tcctcacctg caggacctcg actttcctca cctgcaggat 6061 gggtcctccc ctccggcctc tgctcgcctc tccttctcct tcccaaactg cgcagctctc 6121 agcaccctct tcgggccact cccagcaccc agcaaacaca tctaaaacag tgtagcacag 6181 ctcacaggcc ctggcacgct gtgggggctg tgacggggcc cagcagacca cggggacgac 6241 gagcatcatc agagagtaac ctgcaacctg gccatgctgg gtcctcctgg ggacctgact 6301 agcaggaggc acccttgagg tctcggtggt tctggggccc acaggcttca gatctggccc 6361 gtctcccatc ttccactctt ctccacggct cactttgcct gaaaaatagg ctaatgctgc 6421 ctgctctact ggccagctgg ggaggctctg ttggtgaaat gagagcccat aggctggaaa 6481 gtgtttgaaa catttgaagg cccaagtctg gaggggaggt cccaaggtcc actcagatcc 6541 tgagttcatt ggctcagcct tggggcgggg ctctcgctcc cggaagggga tctgcctggt 6601 gaagcccagg ggagtggatt caagtctaag tccgccgctg tggatgtggg ggctgagcaa 6661 accactcagc cctctcagag ggcctcatga gggtcccaat ttctggcctg ccagggcatg 6721 acctccctgg agacctcact ccagacgcct agaagtcagt gaccctccag caagagggag 6781 aggagcccgc agccagggct gctctgatgg acgccacagg gccgttccga cccatgtccc 6841 agactccttt taacacaggc ctcctctaga agtgttttcc aagtgctggg ggtggtaaga 6901 agggggttcc aagaactgcc cagtttggga acgagagggg gtgggagaac tgagaggggc 6961 tgggtcccca gggccagttg tactgaggag gaggcttccg acagccctgg gcccccaacc 7021 ctctgctggg cttctgggaa gcaaacgcct aagtgtcctg ttccctgtag cccagctaag 7081 aaagaccctt gagggaagca ggcagccaca gaggctgtac agtcttggga gccgtggcac 7141 agcccagggt gggccaggac agtccccgtc cccctcacag ccctggggag ggtaaggcag 7201 agcccatcag gctgtcccca gagacgccag ctacctcctg ggccaggaga caggagtcgt 7261 agtccttgtt tagaggttgc ctaggggaag ccctggggga gatggacagg aggaggacgt 7321 ggctaagagc tggggtgggt gttgcggcct ctccaggagt attctggaga gaatttccta 7381 aacagaacag gctctgtgag gccacccaca cttgctgcct gctggcgacc cctggagtgg 7441 catcccccct ccccacttct gtgggaccct cagacacagc cgtcgccatt gctgaatgac 7501 cgtcgctgcc acacggccag ctggaccatg cccacaactc accaaagggc cagaggcacg 7561 actcaccgaa ggggtagagg tgggatcctg cccactggtg ggaggagagt acagagatgg 7621 actctaccca gttgcgacaa attcaggagt ttcccgaagg gactgcgacc tctgtaggca 7681 agcccaaaac atctagggga ggccgggcgc acaccccctg ggaccaaggg tgtgagccca 7741 gggctggcga gtgggtacct gggtgggtct gccaggtggg gagccggctc tgtctgccaa 7801 gggccctgag ggcaggatgt ggtctgccgg acagcacgtc tgctcacaga gggaccacct 7861 caggagggct ttcttctgag cagttgtgcg tcctgtgctg agcaccttgg ggaaactgag 7921 acaggaggta gtgtggagcg gtgagtatcg tggcaggggt gcctcttcac cagctnacgc 7981 cgcaaagcca gcctacccca gcctccagcc ctcatgggcc ttggaacccc tccgggcatt 8041 tccacagccc actcccagga aagcctggct cacaggtaac cagagcacac tcctatcaca 8101 gtgagtccca cagcacccat gcctggatgc tggccaagct caggcgcctg ccactgccac 8161 tgccactggc tgtgccctgc ctgcaggtct ccctcattct accctctcct cagattctct 8221 cctctgcctc ttccaggaag cctcccgtac tccacccaga gccctcttgt cctgaattcc 8281 tggggttctt tgggcttcgg ggataagtgg gtgtgcagct gtcgtgtccc taggtgaaag 8341 ctggaccagt gcccagcctc cccgatggcc ttcttagggc ccaccctgca aacctctgct 8401 ttatcttcca gatacaacct cacccatgaa gggaaggcca cgcctgcatc ccttagggtc 8461 agaagccaag acgccctgca ggctctgcac ctgactctgg agcctaagat ccctagaagc 8521 agggcagtgc tgcctcccta cagcttagtc ctgcccatgg tgggtccttg gttcagacat 8581 tcattcattc cctcccctcc cccctgcccc atgttataga caggctggct gggccctatg 8641 ccgggggtgg gccaaatcct gcatgtgcag aaggatcccg tgagagcctg cccacagcaa 8701 ggtcggtcag aagggatggc tgaggcacaa ggggcggggt ttacacccca ggtgggggca 8761 gcaccgtggg gtgcaggagc ccacagggag ggtgcagggc catccagcaa ggccagcctg 8821 cacaaggccg ttcctgccca gtccaggtgc agagccttac taggacctgg gagagcaaag 8881 ggacaggcaa ggtcatacct gcgggggtgt ggggctcctc tgctgggcct agacgttgag 8941 ttcatacccc tttgccaggt tgggagaggg catcggctgg ggacaaagcg ggtcaagcag 9001 gtatgagatt acccttggat ctcttcctcc tgcagcttgg aggatgccag caggggccgg 9061 cgttgaggag gccctcgggt ggcatcacca gcatggctct gagggtctct atttccagag 9121 ctctgcatct gtgagctcag gcaccccacc cagtcagcga gggagaaggc aggccgctcc 9181 tcagtggaca catgccagtg gcctcaggtc ccattgctcc acctctcgca gtggaaggcc 9241 acccccaatg tgctgagtcc actcagcaga agggaagcgc tttccctgag cttgcaggac 9301 tcggcccaga gcccggtgct tagagggctc tgtgagcctt tgggcccagg gggcctgtag 9361 acgcctttgg aggctggaat gtccgtggaa agcaccacct tgctgattct gccgcagagg 9421 ctgggctggc atgtgggctg gggcttgtcc taacactggg accctggcag gtctctgggg 9481 ttgcaggcag gaagcaggta aggcttccac actacagcct gtatgaggtt gtataaggta 9541 aagcattcat gtattcattt gctcggcaag tattcattta cttcctacat gctggacact 9601 gtgctgggta tttgccacag tggccttgac tggcatccac agggccagca gcttaagggc 9661 acacagcctg ggggccatgg gggtggtaag aatgtgggcc tgagatggtg ggaaatgcct 9721 cctgcacgga aaacatgctt cccgtagtcc caggcagctt agaagaccta ctgatgagca 9781 tcgaaacaga atcctgctcc taagcctggc tgtgctctcc cagtgtgagg tctttggttg 9841 ccgtccatgt cattgtcctc ctccttacca caggggccct gggcagccaa ccttggcaat 9901 attcatgcag atctaatgaa accatgccca cagacaccgt ctgtgcccct gcctggatac 9961 ctcatgcctg tgcctggttg gaggctgcac atctttggct ggggtttgct cttggcgtgg 10021 attccatttt ctcccagcaa ggtaggtgtg tagagggggt gtttttctag caagcagcca 10081 gtggttcgca tgttttttca tacatggcat gtggcattag caatttaaat ctccaccttt 10141 ggctttgatt tttagcatta aaaggaggaa ggggtaagtc caggttgaaa tttaggtcta 10201 aagagacatg tgggggttna gggaagtccc tagccccgtg cttcaggggg ctgattcgat 10261 gcaagttagc tgagcaagag ctcgagtaag gggatttgtt attatccctt aggccagatc 10321 accacagaag ccagccagcc tgcttcattc cccctgagag atttcatctt cattttaagg 10381 ggttttgggg caaaggtctc ttcttctgga gctacttcct gctgaggcga gtgtcattcc 10441 tatatccttt acctttacct gagaggaatg gaactccttt cagcttaagg ggacaatctg 10501 ctagggctga gggcttctgg aacacactgg aagcactgca cggtcagtaa cattgttgca 10561 atctagaaga caaaagtttt cccaggaagt taaacaagca tggtccaaaa atcaaaagaa 10621 ggatagccgc tgatggtcct aaggaaggaa gaaaccaact tgtagagaga gaggtgatga 10681 ggcagcattc ggaactgact cctctgctgg gctcccttta ccagagagtg tggccacatt 10741 gtctgattgt agattgcttg gatgttcacc tctgcttgcc ctgtagtatt tatctaagca 10801 tggcaggagg tgttggtgac tatgcataca cctccttgtt ctgccgttag gtaatcaagg 10861 gccaatggtt acccaacacc acgttatcga aagaactgag ggaaggcttc acaccttcta 10921 gagcctgccc agtctgatcc actacatgac cgatctgcca tgtcaggcct ccccaagggg 10981 ctggtaaacc aatactcagg cctattccca ctggcgccat gccaaggcct taccttgctc 11041 acacagtggc ctgatgttgg gaggtgcagt tgtggttact ccgtttatcc ccatcgatcc 11101 caaggcacat tcacctgtga atcacacatt ggctctgcag gaataggcct gagccaacag 11161 aggaaaagta taaggaaagc aggcagtgga ccacaagagg ttattagggt ggcctcgagg 11221 gataacaatt ccaagaggag cccaaattcg aatgtccctg gtgtcattga gaatggagag 11281 gttcaggtta tgaattgtgc agctgggggt atcaagccag ggtcctgatg atggtctgca 11341 tatcccctgg tgtcattgaa aatggagagt tcacgttatg aaatgtgcag tcgggggtat 11401 caagccaggg tcctgaagat agtctgcata tcccctggtg tcattgaaaa tggagagttc 11461 aggttatgaa atgtgcagtt gggggtatca agccagggtc ctgaaaatag tctgcatatc 11521 ccctggtata tatcatttat ataaggatgt tacttaacta ttgctgtctt ccagaggcag 11581 cttcttgaac aaatacttga ggtccgctat aggtttatgg gagtagactg ttttctccaa 11641 gcaggcctgt cgtccatcag gaaaagcaaa aggttttatt ctaaaaacaa ttccttccaa 11701 ctgtacagct gatggagtgg ccacggaaac ctggtatgga cccgctcatt tctcagtgaa 11761 ctgagcctct gggcttcctt ttccgccatg tcttgagtag gacctgtgca cctgggctag 11821 cagggcaaat gatctgaaac tccatgaccc ggctgaggtc acatcttatt atcatattgg 11881 aagatggcct gctgaacttg tcccaagtta atgatgtatt ttacagcctg atttagctct 11941 tcgtctagaa gaaagtggct aggaagaggc tagcaataag aggagtgggt tgggatcctg 12001 tcggcatcct atgtcacgct tagacaacga tgctcgccta gaaactcagg ttagaggctc 12061 ggcccacaag gttatatgcc ctggggtata tctggaccaa tggaatctct ttgcaagatg 12121 tttgtccgtg tgtgatctga actaatccac cttgtgccct ctgctgggga tgggaaagct 12181 ctacttctca tcatttccca gctcattgct ttgctgtaca gcagccttgg ggcagccccg 12241 aatggctggt ccagaggttt ccaccttcca gtggtggatt ctaatgttct ttatgtgttt 12301 ctctctgttc ttctctgcct gctttggatc ggctgttact aagctgctgc cgagtggcaa 12361 ctctgcttat ctcacggtgt tgaggtgaga ctcactgctt ttggtattac tgattcaagg 12421 ctacttagag attgtctttt cttgtacagt tcagccagtt ctggctaaaa tgtatacatt 12481 aaaaactcat ttaaaactga agaaaaaagt gatgaaaagg tttttttttt tttcttaaaa 12541 ccaaacttcc ttaaagacta ctttaccaaa actttggtcc acagctttcc ttggattgct 12601 tgtatgtctg ggtcaattaa gaaaagttta tgctctggga aacatgtttc tagaattgcg 12661 ggtcttatct ataaagtgct aacatctggc aaacagtgta ggatctcttg cttcctcggt 12721 tttcagactc acagatgaat gaatgccctt aactgtccct tcccatccct tcccatccct 12781 tcccgctccg gtttcctgta tgcaatgggt tcagccttcc gagccctggg agaagcaggg 12841 attctcatgc ttctttatgt tcccctaggg actaggggtt agcagtcctc accccaatgt 12901 gtgaggtcct ttgggcttta tttgctgggg agcctggagc cagggatcct ccaggtcagc 12961 cactacccgg ttgcatgggg cttttcggcc tttaggggcc cagggttgca gtgacacccc 13021 aatggagagc ctgtttgagt ctcagaggga tatttgccag cacatactgc taggtccctc 13081 ttgtcccctt tgtcagctgt gtatttgctg tctgtgcctc attagtgaca tctgtcatcc 13141 tgagcttgca tgctctgaag tttacacctg ccattttcct tctggtcctg gtgattttca 13201 ttctttgggc tatttaattt ggcatctctc tgggcaatgt tggaggccaa aagcctccca 13261 ggaactagtc ttattgagaa gaaaaataat tttgtctaat tcagaagtta tctaaaggtc 13321 aattcaaatg atgaacttga aaacagttac ttacaaaaca agttagaaaa gagccagtaa 13381 gtaggggagc gagagatgtg aataaaagtt atgaatatga agatgtattt ttggtaagga 13441 aggttatcaa gaaaagagaa taacgttgta tgaaaaagga tcttgtatgg taaatttttg 13501 tcctaaaata aaatggctga ttatttagga aagagggaaa gtttgggcaa ggcagaaagt 13561 ccaagcatgt cttcaatgat ctgtgtatgt cagggtcaag ttcatgaagg ggaatttatg 13621 aaagaaattt cgtatgtaat taagttggtt attattaaaa ggaagccact tataatagtc 13681 tttctaaaaa ttggtctcct atgttaaaac aaaatgttct taaggcactg atttgctctt 13741 aatgacatta caagaaattt tactttcact ttttttttgt ttgttttgtt tttttgagat 13801 ggagtcttgc tctgtcgccc aggctggagt gcagtggcgc aatctcggct cactgcaagc 13861 tccgcctccc gggttcacac cattctcccg cctcagcctc acgagtagct gggactgcag 13921 gaacccgcca ccacacctgg ctaatttttt gtatttttag tagagacagc gtttcaccat 13981 gttaaccagg atggtctcaa tctcctgact tcctgatcca cctgcctcgg cctcccaaag 14041 tgctggaatt acaggcatga gccaccacat ctggccaact ttcactttta taaactgttt 14101 ctctcgaaaa cttctctttt tttttaaatt tttttagtat ttattgatca ttcttgggtg 14161 tttctcggag agggggattt ggcagggtca taggacaata gtggagagaa ggtcagcaga 14221 taaacatgtg aacaaaggtc tctggttttc ctaggcagag gaccctgcgg ccttccgcag 14281 tgtttgtgtc cctgggtact tgagattagg gagtggtgat gactcttaac gagcatgctg 14341 ccttcaagca tctgtttaac aaagcacatc ttgcaccgcc cttaatccat ttaaccctga 14401 gtggacacag cacatgtttc acaaagcacg gggttagggg taaggttata gattagcagc 14461 atcccaaggc agaaggattt ttcttagtac agaacaaaat ggagtctcct atgtctactt 14521 ctttctacac agacacagta acaatctgat ctctctttct tttccccaca tttccccctt 14581 ttctattcga caaaaccgcc atcgtcatca tggcccgttc tcaatgagct gttgggtaca 14641 cctcccagat ttctggggat ggtggccggg cagatggggc ccctcacttc ccaaaagtgg 14701 cggccgggca gaggggcccc ccacctccca gacggggtgg ctgccgggcg ggggcgcccc 14761 ccacctccca gacggggtgg ccgggcagaa acgctcctca cttcccagac ggggcagctg 14821 ccgggaggag gggctcctcg cttctcagac ggggcagccg ggcagaagga gctcctcagt 14881 tcccagacgg ggtggtggcc aggtagaaga cgctcttcac ttcccagaca gggcggccgg 14941 gcaaaggcgc tcctcacttc ccaaacgggg cagctgggca aaggtgctcc ccacatccca 15001 gaagatgggt ggccaggcaa agatgctcct cacttcctag atgggacgac ggcagggaca 15061 gaggcgntcc tcacttccca gactgggcgg ccgggcagag gggctcctca catcccagac 15121 gatgggcggc caggcagaga cgctcctcac ttcctagacg gggtggcggc cgggcagagg 15181 ctgcaatctc agcactttgg gagaccaagg caggtggctg ggaggtggag gttgtagcga 15241 gccgagatca cgcnctgaca ctcccagccc tgggcaacat tgagcactga gtgagcgaga 15301 cttctgtctg caatccccgg caccttcggg aggctgaggc aggcagatca actcgagttc 15361 aggagctgga gaccagcccg gccaataggg cgaaaccccg tttccaccaa aaaaatacaa 15421 aaaccattca ggcgtggtgg cggcgcgcct gcaatcccca ggcattcggc aggctgaggc 15481 aggagaatca ggcagggagg ttgcagtgag ccgagattgc ggcagtacag tccagccttg 15541 gcaacagagg gagaccgttg aaagcgggag acggagacga gggagagggg gagaccgtgg 15601 aaagcgggag atggagacga cggagaggga gagggagaag gagaggcaga ggctcaaaaa 15661 ttttcagatc catatttcag tatttcagac attccatttt ttctgtttca ctgctttcag 15721 ctttttcccc cttgagcagg cctgagatga tgaccttctc cttcagcatt tttttgtcgg 15781 ctcctgcacc ccttttttcc tccagtttta actgttggaa tgacctgatg ctaatgtttt 15841 agtttgaagg tttagaaaag caatgttttc ttttagtata agttgatttt atatttcttg 15901 gcttttcttg ctgtgtctaa attttcagtg taatcagaaa acttctcatg ctgttcctaa 15961 gggtcacata tgctcataga ccttaaacac tgttcctgtg tctgattgca ttcaagcact 16021 tctttcctca agtttgtctt ccaggttatc taaatgggct tcctgtaaga agcagcggtc 16081 tcactgcaga aggttttctc tgccttttta tcagctggct ttactttttt tttttttttt 16141 ttgcaactgg gtctcacttt gttgcctagg ctggagtgca gtgacacaat ctcagctcac 16201 tgcaacctct gcctcccggg ttcaagccat tctcttgtct cagcctcccc agtagctggg 16261 actacaggca cctaccacca tgctcagcta atttttgtat ttttttggta gaaatggggt 16321 ttcaccatgt tggccagtct ggtctttaac tccctgacct caagtgatct gcccacctcg 16381 gcctcccaaa tttctgggat tacaggtgta agccaccaca cccagcctat cagctggctt 16441 tttttaaaaa tgcaagattt tatattttat caagataatt tcttatgctg tctttattag 16501 gttttctatt gaaaaaaaaa ccctgaaatt taaaaggctt aaggttttta catccatatg 16561 actttctgta ttgctcgtaa tgtcttcttt attattactc ttggttaaat gtgtggggaa 16621 aagaaagaga gatcagactg ttactgtgtc tatgtagaaa gaagtagaca taagaagctc 16681 cattttgttc tgtactaaga aatattcttc tgccttgaga tgctgttaat ctgcaaccct 16741 acccccaacc ctgtgctcac agagacatgt gctgtatcga ctcaaggttt aatggattta 16801 gggtgcagga tgtgctttgt taaacaagtg cttgaaggca gtttgcttgt taaaagtcat 16861 caccactctc taatctcaag tacccaggga cacaatacac tgcggaaggc cgcagggacc 16921 tctgccgagg caaaccaggt attgttcagg gtttttcccc acgtgataat ttgagagatt 16981 gccttgtggg aagggaaaga cctgaccgtc ccccagcccg acacccataa agggtttgtg 17041 ctgaggagga ttagtaaaag aggaaggcct ttttgcagct gagataagag gaaggcatct 17101 gtttcctgct cgtccctggg caatggaagg tttcggtgta aaacccgatt gtatgttcca 17161 cctactgaga tcggagaaaa ccgccttaag gcaggaggtg agacatgctg gcagcaatac 17221 tgctctttaa tgcactgaga tgtttatgta tgtgcacatc aaagaacagc acctattctt 17281 aaacttgttt atgacacaga gacatttgtt cacatgtttt cctgctgacc ctctccccat 17341 tatcgcccta ttgtcctgcc acatccccct ctccgagatg gtagagataa tggtcaataa 17401 atactgaggg aagtcagaga ccgcaggtgc gggtcctcct tatgctaagc gccggtcccc 17461 tgggcccatt attgtttctc tatactttgt ctctgtgtct ctttcttttc tcagtctctc 17521 atcccatgtg acgagaaaca cccacaggtg tggaggggca ggccacccct tcaaaatgaa 17581 taactgtcac aatgacctgt gattctgttt tgatcaaatg ttttgagcct tttaacatct 17641 ttaacaaata ttctcaaaat caaaatccta aattaagtct ccgatttagt gtcctcgctg 17701 gggcttatta aacctataaa aattaatctc tgtgaggtta taaaatcttt gtctagcttc 17761 caggtagacc atgaactcca gtatcaccac ctccagcctg ataattatat attggaagaa 17821 aattagttaa aggattccct tcagccccct tgaaactgcc cttatcagga gttgttagct 17881 aatccttgtg ctcagttaat aacctgttgc aggcccttga ctcctgggca cacatgtcta 17941 aagaaggcac tgacttccac cagtatctga caaaaactca agttaactga aggctcgtct 18001 ttagacctgg gcaaagggga caacgtaaac tgctttcatg agacacaagg acaggcctgt 18061 attccaaaac attatgattc atttcataat tttgccttta tctgaaataa tataatttgt 18121 tctatgcctt tatactaaat aatttaaatg tttaacttat ttcctggcat tctcaaaatt 18181 agatagggct tatgaccttt tctttaaaat gttgttaatt ctttatgttt tcttttacct 18241 ccaaaattta ctcccaccag gcccagggag tgtcataagt gttttcaggg ccagttttga 18301 ggaataaaat gagtgcaaac cttccaaata aaggacaggc acacaaatgc ctaaatggct 18361 gaacaaagtg cttgtgtttt gtgtagctaa ttgccgcaaa ccatgatgac agtagctcat 18421 acacagaatg tataaagaag ccaattttgt aaccttgcct tttggctttt gatttttggt 18481 tctcatgttg ctgaaggggg tttaagatta atgagtgcct gcccacctac attcccatct 18541 ggcctagagt acttcattgg ctctaagtct tttgactcaa agtcccttgg ccacaggaat 18601 tcccaccaag gggcatgatg gacccggggc aggctgccca ccactccgac tccagtgtcg 18661 acatgggccg caatagaagc atggctgttg acactgccgc tggcatacgt tgaccacaaa 18721 aggaggaaaa taaacgtaaa taaataaata aaatcctaag cacccccatc cccagcagaa 18781 tggggccccc ttaggccaca gaacctgaga aactagttta ggccagagag gggggaagtg 18841 gagctggtag tcagagggcg gggaaggcat ggacaagcct cactatacct ttcccccacc 18901 ttggaattca ggcacaactg accagcatta aaattaaaag aggtcataag actgacaaaa 18961 cagactggat cgaaggccct gccaggcagg attgaagttt gcaccccagg gtagacagag 19021 gtgtaagtta caggaacgtt ttaaaatatg tttgcgtcag gaccaccttc atgaatattc 19081 atagctcctc tgttcaacca gaagacttca tcttattaaa aacctggaaa gaaggatcct 19141 ctgaggatca gttacagccc agatggaggg gctcatcagg tattgttaag cacccccatt 19201 gccgttaaac ttcaaggact cactagctgg atacacctgt ctaagattaa gcccgtttct 19261 tatgagtccc cacaggcacc tgagaaagac actatggcca acacctgtga actcttagaa 19321 tacctaaagc tgttgttttg gaaataaaca gataactagc atcttagtat ctccctccta 19381 ctcagcctcc ttaacaactc tcatcccata caggtttgca cccccagtgg atacctatgg 19441 tcatgtggtc acctattcaa tcagcatata aatagcttat ctctcatttg ggaccatgaa 19501 ctcaatggtt accagcatat tgacaacatt tggtcagggg tcaatgtaca acaccctata 19561 atgtacaatt atataccatc acttacctga gagctaagtg ggcaattccc cttatattgg 19621 ctggggttga gcctgccatt ggcttaccca ccccatgggg aaaatttgcc tattatgagg 19681 caactcttca aaactttact tcttcattct acgcactgtc tcataaaaca ggagacactt 19741 tagaaacatt aaaactctcc cttgattcct ttgcagatat aataattgat gacagattag 19801 cacgagagta ttatacttat gggttgaaca aggagtatgc actataagta ataaaacctg 19861 ctgcacttac gtaaacacct caggtcaaat agaagaagat gaacgtaaaa tatatgagcg 19921 agctgcccag tgcataaata caaccaaggc tgtgatccta actatagctg gtcaacaata 19981 aaaagcaccc tcctgagtct cacttggttt ttgcccttcc taggaccttt gatatttatc 20041 tcattattag taatctttcg cctttgcttg tttaacttaa tggtaaagtt tgtgtcttct 20101 agattacaac aattccacat aaagattatt ctggcaaccc atcccgtctt ctgaccttgg 20161 aaataaagac atcctgcctc aggcccctta gatcaggcat ccagagattt ttacacctcc 20221 aatactaggc aaagcataag cccatagaat catcagtcag gctggagtta gaacgtatat 20281 gactatgtga tcgtaaaccc cagacacaaa aaccacttgc aagtggtcat gagtcatcct 20341 caccctggga agcgaacaga aaccatcact ggagggggac ataccctcca ctcagacccc 20401 cagcctctgg tttactaatg acaaatacag atatgccaca gataaccaga ggcacagaag 20461 tgattaagcc cccaggagca aaggcagaag aaacaaagtc tccaagttag actcccaagg 20521 aattcccata ggaatgattg gaggcaaatg taaacataca gccttgaaaa gcaggaaaaa 20581 atcaggtcac ttccacttct acatgggatg cagtaggttg tggctgctca gaatgtgaca 20641 accaggcaaa tcaatccaca gagttaaaag tcacatttga aaggcccttg gacagatgtg 20701 gaagcaccaa ggcctgcatg aactgaaatt ccgcagaggg cgagcccttc cagggtacct 20761 ttgtcattgc aaatacattt gtcactttgg gcaccagctg aaagccaggc tggctacaga 20821 cagggactcc agtagggaat gcaaaccagt agtgccgtct tggctcacgg ggaaccaaag 20881 tggcccagaa tctccaggta cctaaagatc aagacagttt tcccatggaa gagcctccaa 20941 gtgctgtggt gggagggacc ctggggagtc agagaaggcc aagtagaatc ccccccagtg 21001 tggcagatcc caggagacag agtgccccta gaaggagcct gccacagtca taccactggt 21061 ctccatcata agccagtggc cccgatggaa caggtgtgtg ggatgaggtt caaggggaaa 21121 gtcaaagcct agacagggaa acctgtccga gctgaggatt cggtgtgaaa ggtgagcata 21181 caaattgggt cacccttgtc acacccaact aaggaagagt gccaggtagg gagaccatcc 21241 agaactcagg atagaaaaca tggttccaag aatgtactgc tctgaggcct gattgttgaa 21301 actgcctggg gcaacctgaa accagtttta tctgatggtt actgagacca cctgctgaaa 21361 ctctaagcct cggttcaccc actgttgtca ctgattcact catcacccag cgttcgccag 21421 ctccccaaat gcctactagt gccaatgaaa ctcagacagg aatatgtact atttctcttc 21481 tttataaaaa agagacatag ttcctttttt tttgagacag agatcttgat cttgtcgggc 21541 gggtgggagc acaatggcag gatctcggct cactacaacc tccacctcct gggattcaag 21601 ggattctccc tgccgtcagc cgtccctgag tagctgggat actggcacac accaccacac 21661 ccagctaatt tttgtatttt tagtagagac ggggtttcac gatgttgatc cactctggtc 21721 ttgaactcct gacctcaagt gatctgccgg cctcagcctc ccaaaatgct gggattacag 21781 gtgtaagcca ccgcgaattt aaaaaaagcc ttttctttgt tctttggaca tactgaaggc 21841 cagccagtct gcccccgtgc ccctaattgc agttatttcc tcccaaataa aacatttcag 21901 aaattcacct attttatatg acttcgacaa gagccaagcc acactgtgta ggagatgctc 21961 gggtgaagcc agctgtccct ggccacacac agcttgaagg caaaatggcg ggagctcaag 22021 cgaagccagg caggtctccc cctgaaacct ctgcacaggt gggcgctggc ggggctgaag 22081 ggcaccgctc aagctggctt tggtcagggt tagggacagg gccggctcca gagcgacggg 22141 gtcgcaccct gagcggaggg gctcccgggg tccccagggg ttatggtgat ggcgggaccc 22201 cctagaggga gggtccagga acgaggcagg tggactggga ccttgagggc agggtgggcc 22261 tgggcggtgc caacaggctg ccgagggtgg ctgctgtccc gtttctccag cagccagggc 22321 cgtgcgcttc tagaacattc cccgtggctt tcctaaaggg agcgtggatg gagccagggt 22381 gtgcagattt ctgtgagaaa tattttcacc tgggagcccc aagggggacc ctgcccacct 22441 ttgaacactg gcagatgggc tggaggcctg gagcctgggg acggggcttt gagggtccgg 22501 aaggagggcg gggctgtgcg gccggcggag gcccccgaaa gttccagaaa gcgggtgggg 22561 aggtcggcgc gggaggctca aatggctgga ggcggcgggc acgcacactg cgggctccga 22621 ggggcacagg tcagtttcct gaggcgggtc gggacccagg cgtgagactg gagtctgccc 22681 aggggcccag ctgagccagc ctcctcgtca gctgcttggg ccgccaggac gccgccgggg 22741 gtgcgccgcg cttccctgga tggggtgccc ccactcccct cggagcccca gggagacccc 22801 ccgaactcag ctcctctcag gggtgccagg gggacccctc aaactccact ccccgcaggt 22861 tcctggggag acgccccctg ctcgattccc ctcagggtcc cagggagacc ccctaattca 22921 gctcctctca ggggtactgg gggacctctc gagctccact cccatcaggg tcccagggag 22981 accccccaac tatgctcagg ggtcccaggg agatgccagc accccaactc cgcttccctg 23041 gggcccccct ccccttacag ctcaacttcc ctcgagagtc tggggctggg gctccgttca 23101 gttcttgagt ccccttccct cggggtgtcc cggggccgcc cacccccaca ctgtctgtga 23161 ttccccaagg cgcgggtctc gggccgcagc ctgttccacg ttctgctgct cgttcttttc 23221 tggctccttg ctttcgaagg agagaaggag gccttcgttt ccagtctttt tgccttttct 23281 aatggagccc tgcttttcct tccgtgtccc ttcaggctac ttctgccagg tttctatttt 23341 tcattcttta ttatgacttc gcccaaaata ttcttgactt ctattgagaa ggattcgggg 23401 gtctatttct tattcggagg cgtgtgctta agttccaaac agatgaggat tttccagtta 23461 atccttctgg ggtgacttat tgcttaatgc caccatagcc agaaaatgga ctctcagtgt 23521 ccgaaactgc attcggctct gaagtgtctg tccttgtcac ctcttgcaat gtttcgcggc 23581 gggaagcctg cactcgccga cgctgacgta actgtttctg tctttcaggt ctacagcctc 23641 ctgtgggtgg gcgatattga catatacttt atttctatat atgttatgaa ctcaatattt 23701 cttgcagcgg gtctgctgat aataagatat gcctactctg cgagtctgga agccatctta 23761 agcttaccct gtatgtgccc catgcatctc ttccgttaca cggctcctga gttgacacct 23821 gtgtgataaa ctggtaatag caagtaaact gttttcttgt gctctgtaag ctgctctagc 23881 aaattatcta ggaggaggtg gtcttggaaa cccctgattt ataagcgggc agtcagcagt 23941 acacgtggcc cagaatcgtg attggcattt gaagtggggg cagtagggtg ggactgagcc 24001 cttcacctgt ggggtctgcc ctgctcaagg cagtgtcaga attgaagtga aatgttggac 24061 ggtcggtgtc cagagagttg gagaactggt ttgtgtgtaa aaactnacat atttagggtc 24121 agaagtatgg tggtgtagaa ataattcgta gtatagtccc ttagttctgc ctcttacaca 24181 tcctgttatt tttnaaattt tttcatcttt aagaaatcaa tttagtaatc tttttatctt 24241 taatgctctc attagaattt ttaaagtact tgtatgctaa tgtcaacatt tgagtgaact 24301 gtgggtttat ttctattgtt gttttattga ttagcaataa tttattccta attcttacat 24361 atcctgtgat gtttttattg aacatcttgc gttgcgtaca aaatagctgg aaagcttgga 24421 ggtggttgta tgttactact ggggatgtgt gccctctcct ctctccagga gcctgtggga 24481 aggcctggac tactggcttg ctcaaaggtg gaggccaagt gggatgcttg cagcttcaac 24541 ctgaccctgg ccgcctctgg agccggccct gtctctaacc ctaactgaag ccatcttgag 24601 cggtgccctt ccccttctcc tacagagtgt ggcttgtctc gtgttgaagc caaataaaaa 24661 agatgacttt ctgaaattga aatcttttct ttcggaggaa ataattgtaa ttaggggcat 24721 gggggcagac gggctggcat tcagtatgtc caaagagcaa agagaaggat ttttgaaatt 24781 tgcagtggct cacacctgta atcccagcac tttgggaggc tgaggcaggc agatcacttg 24841 aggtcaggag ttcaagacca gcgtggccaa catagtgaaa ccccgtctct actaaaaagc 24901 caaaaattag ctgggcgtag tggcgtgcac ccatagtccc agctactcag gaggctgagg 24961 caggagaact tgcttgaacc ctggaggtgg aggttgtagt gagccgagat cacgccattg 25021 cactccaact ccaacctgca ggacagagcg agactctgtg ttgccagaaa agaaactatg 25081 tctctttttt ataaagaaga gaaatagtac atattcctgt ctgagtttca ttggcactag 25141 taggcatttg gggagctgcc gaacgctggg tgatgagtga atcagtgaca acagtgggtg 25201 aaccgaggct tagagtttca gcaggtggtc tcagtaacca tcagataaaa ctggtttcag 25261 gttgccccag gcagtttcaa caatcaggcc tcagagcagt acattcttgg aaccatgttt 25321 tctatcctga gttctggatg gtctccctac ctggcactct tccttagttg ggtgtgacaa 25381 gggtgaccca atttgtatgc tcacctttca caccgaatcc tcagctcgga caggtttccc 25441 tgtctaggct ttgactttcc ccttgaacct catcccacac acctgttcca tcggggccac 25501 tggcttatga tggagaccag tggtatgact gtggcaggct ccttctaggg gcactctgtc 25561 tcctgggatc tgccacactg ggggggattc tacttggcct tctctgactc cccagggtcc 25621 ctcccaccac agcacttgga ggctcttcca tgggaaaact gtcttgatct ttaggtacct 25681 ggagattctg ggccactttg gttccccatg agccaagacg gcacttctaa tttgcattcc 25741 ctaccggagt ccctgtctgt agccagcctg gctttcagct ggtgcccaaa gtgacaaatg 25801 tatctgcaat gacaaaggta ccctggaagg gctcgccctc tgcggaattt cagttcatgc 25861 aggccttggt gcttccacat ctgtccaagg gcctttcaaa tgtgactttt aactctgtgg 25921 attgatttgc ccggttgtca cattctgagc agccacaacc tactgcatcc catgtagaag 25981 tggaagtgac ctgatttttt cctgcttttc aaggctgtat gtttacattt gcctccaatc 26041 attcctatgg gaattccttg ggagtctaac ttggagattt tgtttcttct gcctttgctc 26101 ctgggggctt aatcacttct gtgcctctgg ttatctgtgg cacatttgta tttgtcatta 26161 gtcaaccgga gactcggggt ctgagtggag ggtatgtccc cctccagtga tggtttctgt 26221 tggcttccca gggtgaggat gactcatgac cacttgcaag tggtttttgt gtctggggtt 26281 tatgatcaca cagtcataca cgttctaact ccagactgac tgttgagaaa gcctctgggt 26341 aagggaattc ctgggaaaca cactgttttc atgcatcctc tggaagatga ggcctgaagt 26401 taccagggtc tctgtttgct gatgctgatg atccacattt tctagcccac tctgcttctc 26461 tgacaccttt agtcttgagg atccatgntc tgtgaaggaa tccaagctct catttcgcac 26521 tcaccttggc cctggctctg tctccaggac ctcttctact acaaaatcct aaagctctgg 26581 gagctgggtg tcaacctgtg cccgaggaaa tcatacagtt actgtggact ttccagtttg 26641 ctgtcttcta gtattccatt gtagctcttg ggtattttcc catccacccc aagatccagc 26701 tggaaatcag tgaacacact tgatgggagt tttcctgcat gtgctctggg cattgacagt 26761 agaagggtgt tcagaatgtc tgctgtgccc tcatggagga agagngctca gtgtacatgc 26821 tctgggtcag taggtgccct tgagcccagc tttgggagca atgttggatg agtgaaggag 26881 ggatccaggg caaagcaggc acgacagagt ggagacggcg ctgctggctc tcaggggaat 26941 gggcatggag tgggtaggag atccacctaa ggaggctggc tggctggacg agtcaggagc 27001 cccttccaag ggtggacact gacaggcccc cagtcttggt ctcctgcatg ccagaggtac 27061 cagcccatct tttttcctaa acttgatgac ctagggctag gggcatgttg aatctcagcc 27121 tcgcccactg gcgctggact tggtacacag ggtggggcaa agtgggtact ggatcctgat 27181 catccctatc cctggggtgt ggcttcttgc tgcacagtca gcttctagtt ctgtagcccc 27241 agctgctcct gcggtggagg gagctacaca tcaggctctg accccctcca ggtggggcct 27301 tcgcgtgagg ggagtcagca cgcatcagca gctgggccca gggagttgcc ccactgagca 27361 ctgcgggctg acctgctccc aaccagggag atggagcttc ccccttgagt cgggctgctg 27421 aaggggggta ggggatggaa acagtgcgtt tgcaggagta agggtgcagt tgggtccctg 27481 cgagaaaatg tctcagttgt ggcaactgat tggtgacctg gggggcgttt ctgagcccac 27541 agtgctggca tcaggactca ggtgtgaggt gccccagacc ctccccttgc cagtaattag 27601 ctgatggctc ggtgatgccc agggtgaagg aagacttgat tttgggaggg gagttctctc 27661 gtaatgacac tgaggatgcc ttcaagttgg gcttctggca tgttctgccc tcgctcccct 27721 tctgtagtca ccttggccct cgtgttgctg agctgtgtgt gggagcggga agcgcgtcag 27781 tgggcggagg gagcgggaag cgcgtcagtg ggcggagtat ttgagaacat ttcacaagcc 27841 gctgttgagg ttcagaatca accagcagat acagaaacat atttcggagc gtggggaccc 27901 ttgggtgagc tgccacatga agcagcccca ggacctccct ggctcaagga gtgacagcga 27961 gtttgtctga ggtgagggca caggcctggc gaagcctcgt gtgtgggtga gacctgcccg 28021 accccagtgc cttacccgag gagctactgg cccagtgggg gaggcattca ggtgggcaga 28081 gtcagggaga ctcatgaggc cgttgaggcc aggggcatag agctggccaa ggagccatgg 28141 ctcactaacg tgttgtatgg ggctccttcc cttcaggtcc aggctcctgc gtgaagtgat 28201 gctcctcttt gccttactcc tagccatgga gctcccattg gtggcagcca gtgccaccat 28261 gcgcgctcag tgtaagtatc attccctctc actgtcctgg agaggacgag aattccacct 28321 ggggtgctgg gggtcactgg gatgattggc tgcaacgtgg agcaagcctc cgttagctgg 28381 ggcctgcatt gtctgtgtaa tcaggggtgg gcactagggc agtccaggag tagtcatgag 28441 caaggagagg gttaggatga aggagcagct gaccagggac caagggggaa ccttgatgtg 28501 gcccttcccc atcagcgcca ggcaggaggg gctctgtcca gggaaaccca ggaggatggc 28561 ggacccctgt gagtatccag tcttccttgg cgaggtgagc caggtctgca gagcatagca 28621 atcccgtatg tgaccaccaa gtggcgctct ctggagcctg cgttggagag cagggaaagc 28681 tctccttgtg cctggcctcc ctcccaggag ctagcctggg ccagactcag actgcataga 28741 gagctgagct gtgcaggcta ggagaagtcc ttggaagcag aggggaaggg ctggccgctg 28801 aagaagggtg gagtgagctg gtaatgggtg gaaaaggcgt agtggagcag aagcctgaag 28861 cctgctttct cccctctcag ggacttacag tttgagatgc catgactgtg cggtcataaa 28921 tgacttcaac tgtcccaaca ttagagtatg tccgtatcat attaggcgct gtatgacaat 28981 ctccattcgt aagtacctct tggtcatttg gacacattgt agattagtcc cctacctggg 29041 tagtttctgg ggccagggcc agtctgcttt cttctctgaa cccagctctg tttccccttc 29101 cctcatgtcc tcccatcctg agtgcgtttc tgcacgcttg ggtctcagcc tcatgatagg 29161 ccagcatgca tcatcttgtg gagccaggta ctctgcaaag tagtacagtc tgtccacaca 29221 tctgcagcgt ctccaggggg tgggagcatt gttggaccgc agagcctcta ctgtccctgg 29281 ttgtgtgtgt agacacccca tgctgcctgg gtgagtcctc actggccatt caatttctgg 29341 tagtttagag agtgcttcca gttcggaagg tcaaagaagt ggttgggcca tggcactccc 29401 cccagggaaa aacctaccac acacagttgg gagacaagca tgaggtcagg gcacagcctg 29461 cacttccagg atgctcttgc tttttcctca gagccctctg gtgccctctc tccccagggc 29521 cctaacctgc agaagatgta tggccagagg ccagtgacca atggagcaag cagggagggt 29581 gcagccagtg tatgccctgg cgcacaggtg gagcctcgtc tggggctctg ctcagggcct 29641 ttcctgtagc ccttttgtcc ccggagtctg tggttgtgct tgcaacttca catgtcattc 29701 tgtgttctaa gctttgtgca gaaataatcc cctgattacc acctcggtgt ggttgggtta 29761 tgtccccacc cagatctcat cttgaattgt aactcccata atccccgtgt gtcatgggag 29821 ggacctggtg ggaggtaatt gattcatctg ggtggttacc ctcatgatgt gctcgtgata 29881 gcgagtgccc acaagggcga aacctgccac acacagatgg gccggacaag catgcggtgc 29941 agggcacaga ctgcacttcc aggatgctct tgctttttcc tcagagcccg ctggtgccct 30001 ctctccccag ggccctaacc tgcagaagat ctgatggttt tctaaggagg gttttccccg 30061 gaacttctcc tgcctgccat cattcgaaga atgtgttggt tacccctgct gttgtcatcg 30121 taagtttcct gaggcctccc cagccacgcg gaactgtgag tcagttaaac cgcttgcctt 30181 tataaattgc ccagtctcgg gcagttctta tagcagcgtg agaagagatc atacacacct 30241 cttcttcaga ccaactgctg atttgggaca tgggcaggag gctgagaatc tgggctttgt 30301 ctccacagtg gccactgctg gggaccttga tggcatggtt ttgtcatgtt gggattactg 30361 cctctaaatg aggctgagaa ctatctttcc cataactccc tcgtggggta atagttggca 30421 aaaagaggaa cttgagtgag attcgcaagg tatgggtgaa gcagagatca ctcttctgga 30481 aggtcacgat ggtttagtga agtgaaggac actggtggac tccaatctgt cctccttcat 30541 gtccacccca tcttctccac tttgtgagcc tttgactgaa agcaattcta agactccagc 30601 aggcatttgg ccactggcct gcctaggtga tgggttctcc agtggcgcag gttaaatgtc 30661 tctgcaagag actcctctgc acagccttac ttgagaggat aacggtgcat ggcctttgat 30721 attgccctgc agagttgggc tgtgccagcc tgccccagtg taagtgacct agctctggac 30781 tgttgctcct ctgatcttca gccttacctg actgacttcc tctcttggct cccccatggt 30841 catggcagct gcaacacttt tatttaataa cttagaccta gactgtttta gaggctccat 30901 tttcctgaat gaaccctgat ggatattagg acaaagaaat tgggaatgct ggaatgctgg 30961 gacatttttc ctttcaggag atttgctgat ttctggggct gtgcaggatg gtaagcaaaa 31021 gacctctatg gaaagaaacg caggcagtgt cctgagacaa ggggcctggg tatgaggcat 31081 ataggaactc tgcactatct ctgcaacttt tctgaaaacc taaaactatt tttaaaataa 31141 aaggttcata cacacacaca cacacacaca cacacacaca caccataacc ccactacctt 31201 gtggattcaa taacagaatt gcaatgtggt ttacaatttg aaacccatca gtttaattca 31261 ccatattaac aaagaaaagg gcagagatga ttgtagatgc tgaaatgttt tgataaaatc 31321 caacaccctt tccaaatctc tgagggaaat gtctaccatc accacttcta ttcccaatgg 31381 aactggagtt gttaggtagt tcagttaggc gagggggagt attaagtctc atggagcgga 31441 ggaagcaaga atttaacctg cttactattc atnagagaca accaacttcg aataacaaat 31501 atctttaggc aagctcctgg agacaagata agtattccga aatcagttta ccaaatcagt 31561 tctccttttg tacatcaaga aacagattgg aaatgaaatc cgaggcatgg tgctctttaa 31621 agtatngcca aatcaagtac cacaaatcta gctggagttg tgcaagattt caacactcaa 31681 cctgcaagaa cactattgaa gacttaaaac accattaaag acttaaacac atggaattgt 31741 acacgatact tatggatcat gagaatcnaa cagagtaagg atatctcttc tccaattaac 31801 ctacaaatcc aacagaattt cagtcaaagt ctcagcaggg gtatgtttgt gttttgacaa 31861 gttggtgccg aatgcttttt gaaaatatga agggctagga ataaccaaag cagtcttggg 31921 gaaaaagaaa aaagctcgag gacttgtgct ctgaactata aaaccggttt ccttaatcaa 31981 aacggtttgg tgttggatcc tgtggtcagg attgaaaaat aacaacatta gaagagaaat 32041 aattcagaaa aagccacata cttctggttt cttgatctct gacaaagttg tccctgcagt 32101 gcagcaggta aagacacttc tctgcgctga gcagtagtgg atatgttaga tatttataca 32161 caatatctgc accatataaa agaagcagtt tcaggtggtt tgtagatctc aatgtaaata 32221 ttaaaccaat aaagttttta gagaaaaatg taagttgctg tctttatgac ttgtaggaat 32281 taagaattta aaaaatcgaa caaaatagtg ctagtcacat aagaatgaaa ttgataggtt 32341 gaattacctc taacttcaaa acttctgttc ctaagaagat acctgtgaaa tagtataacc 32401 gcaagccacg gatgaggaga aagcatttca aaacatgtat ttactaaagt acttgtgttc 32461 ataatataat gaacttattc ctatgaatta ataaaaaact gatcaatgga aatttaaaga 32521 aaaggtttaa agggaattgg ccagaaaatg gtgtctggat gtgtaacaga cataagcgca 32581 gttgttccct gtcgttcttc ttgaaaggaa tgcagcttaa aaccacaaca caataacgcc 32641 acaggaaatg actcagtatc ctaaagtgat cataccaagt gttgtcaggg ttctgaactg 32701 tgcacaatcc ctcagactat agataagttc aacccagtgt agaaaactct tcgatgttgt 32761 cggctaaagc tgcgtatcca acactgagac tcagcgcttc tactcctagg tacatagcta 32821 attataaaaa agcttataca ggctcacctt aaggcgaata caagacaaga gtattcattg 32881 cagcactatt tggagtagac aaaaaccaga aacaacctca atgtccatca acaataaatt 32941 gtggtatgtt caccaatgta atgctatgca attttgagaa tgaacaatat acaacagtgt 33001 gcaacaatag ggataaattt cataaacaca gattagagca aaagaagcca tccacaaaag 33061 cagatgtaat gtataatttc atttcataaa gttcaaaacc aaacagaact aacctatagg 33121 tttagaagcc catacatttc ctgtctttgg cctgggctgc tcacagcagt gtctgcatgg 33181 aggaacagag tgaagaatcc tggggctctc tcttgcagct ctgtttctga tgctgcatgc 33241 tgagtactgc ctgtgggcac ttggtgacaa tccattgagt tgtggacgta tggtcaggca 33301 catttctgta gaatggtgaa ccccaataaa ctttttaaat ggaaaataac agaaaacaag 33361 tgactggttc gtattcatca tttgtttctg gtgttataga ggagacaaga cccagaccca 33421 tgaatagata aatcaattat ggtaagttcc agaataaagt atttgtgtaa acactggata 33481 gacttcaagg ggaacagaag tagtgatgag ggccaaggta tttaaccctg ggattcttga 33541 aggaggtctn agatggaact ttggtcaggt gccattggtc aactgaggcc cctggacaag 33601 ccctctagtg aagattattg atcacattag tcgtgatcac attagttgcg atcgataacc 33661 ttcaccagag ggcttgtcca tcggataaca cgtacctagt tggaaaggtg catcaagtct 33721 gtactgggga ggaggttggc accaagggtc aagagcattt atatgttggt ccaggagagc 33781 tgaaacacca ggctgagggt cccaggaagg agtttacccg agaggggtct tgccaactgc 33841 agaaaatcta gttaaggcta cattaatagg gacacctctg ttaaaggtcc atgtgatctc 33901 tagaccataa gtgaaacatg atttctggtc tgcatttaat ttctcgggct ctagaattct 33961 taccataaga aatgaacagg acatttcatt gcgtgtagct acatgancaa ggtaccatga 34021 taccattttg ggagggccag agncaannag aatgggcagc tctagnatag aaaactatct 34081 ccagaagaag ttgttctggg agtgatgaga gcccgctagt ggacttggat gttctctctt 34141 agtcaagtgt tctagacaat tttatcacag cgtgggagtg tagaatgtgt acatggagct 34201 aattatggtt gaatgtgagg tgtatgtgcc tcaatattta caagcagaaa atgtgaaatc 34261 aattattttc attgctgctt ctttttttag gcataaattc tcgtgaacta cttgtttata 34321 agaactgtac aaacaactgc acatttgtat atgcagctga acagcctcct gaagccccag 34381 gaaaaatctt caaaactaat agcttctact gggtttgttg ttgtaatagc atggtttgca 34441 atgcaggagg acctactaat cttgaaaggg acatgttacc cgatgaagta actgaggagg 34501 agcttccaga aggaactgtg aggctggggg tatcaaaact gttgctgagt tttgcctcta 34561 tcatagtcag caatatattg ccatgaggac cccaccttgg agggtctgac catcttcacc 34621 tgttccgcag agaaatgttg ctctccatta ttcccttcta agccagagac ccttatccac 34681 tgctcctcta ggtggcccat ttatggtttg ttgtaagaga aaaattaaaa aaatattgtt 34741 tagtggagat gttcatgagc atttgtttgc ctgcatgtac ttgagcactc atagtggagt 34801 ctctattcat atctgttctt ccctcccttg ctctgtattc agggattact ttgtggtttt 34861 attataatca gttgctgtag ctattaattg attttggtag ttttgtcttt gtgcttcata 34921 ctaaagttat gagtgggttg cacactgcag ttatagcatt gcaatatttt ggtttgtctg 34981 tgtacttaat tttaccctag gttttatact ttcagatgtt ttctttttat acgtgagtgt 35041 ttttttcttt cagtttgaag agctcccttt agcatttctt gtaagacaga tgtggtggtg 35101 gtgaattctc tcagcttttg tttgggagag atttatttct ccttcatatt tgaagttcag 35161 ttttgctgga tacattattc ttggatgaca gttgaggatt ttttttttct gttcaacact 35221 ttcaaaatgt cataccactc actcctggcc tgttgagttt cccttaagaa ttctgttgcc 35281 agatgaactg gagctgcttt acatgtctgc ctattttctc ttgccacttt tagggtcctc 35341 tttttgtcct tgacctttaa gagttttgat tattatatgt gtcagggtag tcctatttgg 35401 gtcagatctc tttggtgttc tctaactttc ctgtatctga atatttatct cttcttcaag 35461 ttttggaaat ttttctgtta ttattttttg aacagcttcc tacccctggt gcttgcttaa 35521 ctccctctta aacaccagta attcttagat ttggtctttt gaggtaattt tctatatctt 35581 ataggttatc gtcattcctt ttcattcttc tctctttttt ctcctttgtg tactttcaaa 35641 tagctgtctt caggttcatt gattcttacc tctgctttat ttattttgct gttgaaagcc 35701 tgtaatgaat ttttcagttc agcaaatata tctctcattt ccaaggtttc tatttgactt 35761 tttagattat ttcaatgtct tcgttaatta tttctcctaa atttctgaat tgcttttctt 35821 tgttatcttg gagatcactg agtttcctta aaactactat ttt // LOCUS AB002059 28984 bp DNA PRI 29-AUG-1997 DEFINITION Homo sapiens DNA for Human P2XM, complete cds. ACCESSION AB002059 NID g2350848 KEYWORDS Human P2XM. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 28984) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (22-MAR-1997) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, The Inst. of Medical Science, The University of Tokyo, Lab. of Molecular Medicine, Human Genome Center; 4-6-1, Shirokanedai Minato-ku, Tokyo, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:81-3-5449-5372, Fax:81-3-5449-5433) REFERENCE 2 (sites) AUTHORS Urano,T., Nishimori,H., Han,H., Furuhata,T., Kimura,Y., Nakamura,Y. and Tokino,T. TITLE Cloning of P2XM, a novel human P2X receptor gene regulated by p53 JOURNAL Cancer Res. 57 (15), 3281-3287 (1997) MEDLINE 97384966 FEATURES Location/Qualifiers source 1..28984 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11" gene 9106..20520 /gene="HP2XM" exon 9106..9239 /gene="HP2XM" /number=1 CDS join(9106..9239,9843..9993,11889..11960,16575..16650, 16841..16934,17017..17097,17174..17315,17408..17517, 19708..19801,19914..19979,20155..20232,20323..20520) /gene="HP2XM" /codon_start=1 /product="Human P2XM" /db_xref="PID:d1022906" /db_xref="PID:g2350849" /translation="MGSPGATTGWGLLDYKTEKYVMTRNWRVGALQRLLQFGIVVYVV GWALLAKKGYQERDLEPQFSIITKLKGVSVTQIKELGNRLWDVADFVKPPQGENVFFL VTNFLVTPAQVQGRCPEHPSVPLANCWVDEDCPEGEGGTHSHGVKTGQCVVFNGTHRT CEIWSWCPVESGVVPSRPLLAQAQNFTLFIKNTVTFSKFNFSKSNALETWDPTYFKHC RYEPQFSPYCPVFRIGDLVAKAGGTFEDLALLGGSVGIRVHWDCDLDTGDSGCWPHYS FQLQEKSYNFRTATHWWEQPGVEARTLLKLYGIRFDILVTGQAGKFGLIPTAVTLGTG AAWLGVVTFFCDLLLLYVDREAHFYWRTKYEEAKAPKATANSVWRELALASQARLAEC LRRSSAPAPTATAAGSQTQTPGWPCPSSDTHLPTHSGSL" exon 9843..9993 /gene="HP2XM" /number=2 exon 11889..11960 /gene="HP2XM" /number=3 exon 16575..16650 /gene="HP2XM" /number=4 exon 16841..16934 /gene="HP2XM" /number=5 exon 17017..17097 /gene="HP2XM" /number=6 exon 17174..17315 /gene="HP2XM" /number=7 exon 17408..17517 /gene="HP2XM" /number=8 exon 19708..19801 /gene="HP2XM" /number=9 exon 19914..19979 /gene="HP2XM" /number=10 exon 20155..20232 /gene="HP2XM" /number=11 exon 20323..>20520 /gene="HP2XM" /number=12 BASE COUNT 6346 a 8142 c 7938 g 6558 t ORIGIN 1 ggtgaaacct catctctact aaaaatacaa aaaattggtc aggcgtggtg gctcatgcct 61 gtaatcccag cactttggga ggccgaggcg ggcggatcac taggtcagga gatcaagacc 121 atcctggcta acacggtgaa accccggctc tactaaaaaa ttaaaaaaaa aaaaaaaacg 181 ttagccaggc gcgatggcgg gcacctgtag tcccagctac tcgggaggct gaagcaggag 241 aatggcatga accccggagg cggagcttgc agtgagctga gatcgtgcca ctgcactcca 301 gcctgggcga cagagcaaga ctctgtctca aaaaaaaaaa aaaaaatagc caggcatggt 361 ggcgggcgcc tgtaatccca gctactcagg aggctgagtc aggagaattg cttgaaccgg 421 ggaggcagag gctgcagtga gccgagatca catcactgca ctccagcctg agtgacaaga 481 gtgaaactcc gtctcaaaac aaacaaacaa acaagaatgt atggtaaatc acactaagac 541 tgacaaaaag gaacttaaga gcaaagagaa ttaattcaac aaggtccatc taaggctgga 601 ctatcttctg gggggaggag aggctgtgac tttatctccc cacagactag cctccctact 661 gtctcacaag acttagttac aataggaaac ctctgcacag catagcatct cccacagtgc 721 ctgccagcta aggatgtctt caccttccaa atgtaacact aattgaaaga aaaattcctc 781 tactcaagtt gagaggctta ataattagac acagaatgga gggtttcacc tcctacctgt 841 aacatctaac atctaaggtc tgtaatcctc ctatgtaaca tgtaatgaac attccaggtt 901 ggtgtaactg ttgtagattt aacccttctt tttcctcttt tttttttttt tttttttttt 961 ttgagacagt cacgctctgt tgcccaggtt ggagtacagt ggtgatcttg gctcactgca 1021 atctctgcct ccctgtctca agcgatcctc ccacctccac ctccccatta gctatgactg 1081 caggcatgca caaccatacc tggataattt ttatatcttt tgtagagaca gggtcgcact 1141 atgttgccca ggctggtctt gaactcctgg gttcaagcaa tcttctcacc tgggcttctc 1201 aaagtgctgg gattacaggc atgtgtcact gagcctggcc gattttaccc tgttctatga 1261 atcactttaa atctgtcaac ttaaagatgg tttgcatcaa cgtgagatga aaaagagaga 1321 tgatctgtgt caaggtgttg taaagtggct gagagagggc cagacagatt caagaagttg 1381 tgtgtgttca gaggcaagtt atggccattc cacatgggaa aaagaagaca catacctgtg 1441 actttctcat atcatttcaa tctggaatga tcctggaact tgcaactcct caaacacctt 1501 cagaactcat tgagacttat aaatgtggga tctatagctt tgcctaatgt gggaaaccac 1561 tgtagacagg ggctcttaac taaccatagg gcacaaagtt gaccaacaca ggtgtggtga 1621 ggcatttagc ctagtcatca caacatatct tcgagcctga cattctgtta tttggcttcc 1681 aaatacatgg tacttttctt ttggtagaca actgactgta agcatagcta aaggagcacc 1741 aaatcatcag tctgattgtg tgtgaggcag aggtctctgc ttggtttgga gaaagggcaa 1801 agacctgcct ttggtcttca cagtaaacac tgcacaatat attttataat ctttacagaa 1861 atattagtgt tgcccaaaat tcagtgacaa caggtgacca agatgatgta aagtgtaaaa 1921 accactattt ataacacaaa aaatgataac atagaacctg acaaattact aaagtgagca 1981 gtggagtaat gtggaggaca gaccaggacc tgcagtcaga taatctctga tttctctctg 2041 catcacttgc tgcctgtgag gctccagaca cctaaagcta tctccgccct ggttttcttt 2101 tcttttttct ttttctttct tttttttttt ttggagaggt agtctcgctc tgttgcccag 2161 gctggagtgc agtggtgcaa tctcgactca ctgcaagctc tgcctcccgg gttcacgcca 2221 tcctcctgcc tcagcctccc gaatagctgg gaccacaggt gtctgccacc acacccggct 2281 aattttttgt attttttagt agagccgggg tttcaccgtg ttagccagga ccgtctcaat 2341 ctcctgacct cgtgatctgc ctgccttggc ctcccaaagt gctgggatta caggcatgag 2401 ccaccgtgcc cagcctcagc ctctgttttc taaagtgtaa gatactggta atattgcccc 2461 atgaacttta aagggtagtg gttaaggtat gagggaacat aaagtatttg gtaaagtata 2521 aggcggtagg aaataaatta ctaaaattat tatccttgca acctagctga cgtgaaaagg 2581 ggttgtctcc ataaaggatg cgggccttca ggggcagcat tattctgcag gtctcctgct 2641 tcctgatggc acaaggactc cacatcaccc agtcctacct taaatccaat cggacaccag 2701 tccacaaact ggttggtgtg cttgctcttg atggtggcga cagccgcact gacatctttc 2761 aggaccacat cccctctgta caacatgcag cagcccatgt acttgccatg gtgagggtca 2821 cactagacca tctgattggc tggctcgaag cgggcactgg cgatcttggc cacggacagc 2881 tgctcatggt aggccttctt ggctgagatg accagggcgt aggtggccag ggggaggtgg 2941 atgtgggggt acggcaccaa gttggtttgg aattctgtca agtacacatt cagggcccca 3001 tcaaatcaca gggaggccgt gatggaggac acgatctgcc cagacgactg aggttggtgt 3061 acgtgggaca ctcgatgtcc aggttgtgct gacatgtcat aggtggcttt gctgttgact 3121 atgaaggcac agtcagaatg ttccagggtc gtgtaggtgg tcaggatgga gttgtagggc 3181 tccgtcatgg ccatggagac ctggggggct gggcaaatgg caaactccag cttggacttc 3241 ttcccgtagt ccaccgagag ccgcttcatg agcagagaca cgaacccaga gccagtgccg 3301 cccccaaagc tgtggaagat gaggaagccc tgcagtccca tgcacagatc tgcccgagag 3361 aaaccagaca acgtaagcca atgcccgtgg gagccacacc accaacctcc accaagccgg 3421 ggccacttct ctttgcctct gagagttgat acggattcag accatccata atgaacacgc 3481 atcacacttt gaagcagaga aaagcagaca atacagatct atgcacaaat caccacgcac 3541 acttcacagt aagacgttgc agaggtctag gaaggctggt ttgaagaaat acacagaaga 3601 aaaacggatg gactaaccag agcgctcagc cagcctgctg gactggcacc cgtggctaca 3661 gacactcttg tttagcacca tggagagctc cctgcacaga agcaaggact cagggcattg 3721 ccccctctct acctgagaac tagactcagc gcccagcact ggatttgata atacaggtac 3781 tcaaatacat gtgctttcct ctttactcag acacttctag ggctggcagg gtcagacagt 3841 aatcccccac tttgtgggga ggccctgttt ttgctgaaca gctcaggggc aggaagattc 3901 tttcttccac acacccctgc aactgccaat acttgttctg tgtgaccctt gcagccatcc 3961 tatgcggtag gtgctaccat tgccatggat caagtgggga cgcacagatc caagctcctt 4021 cacctctcaa gggagtgggg ccctcccaca gatctcttgg aggaaaaaaa tggccttgct 4081 gggcctgtcg atttccagtc cccacccacc tccctcatgt acatgtgtct gtttgtgaaa 4141 ccttcatagc accttagctt ccaggccctc catcccggct gcctctctgg cacttgccct 4201 tttactgtgc gggtccaaag agacacctat gttcagtttg tcaaagtctg aaataagcct 4261 gtcttttaat ttcactgact cccagtagta atttttttag aagacaaact gcttctctgg 4321 cagttacact ccagggccaa ttctcaaagg ccaagtctgg tgccacccca ctccctctaa 4381 gctgcttgag ccagatgaga gcttctgaaa tgctgcagaa aagagctctc ctggggcatc 4441 cattaaaatg cagattcacc ggcattcccc taagagtatg gttcagcagg tccagggtca 4501 acatctgcat gcctcaccca gggtcacagg tgattgttac tattattgga ccatttgagg 4561 agaggcaggt agaagtgcaa attcctggca aagcaaacac tcaggggcat gagagtgatg 4621 agaccaagga atcccaacca tgtcttctac ctcacctgga acaaaatcta tacccttttt 4681 gccacggcct acagccctgc agccctctgc cctctccacc ccacagcctg gccacgcttc 4741 cctggctccc tagcctaggc acactgctgt ctgccagtct gggaaacagg cccgcatgct 4801 cactgttcct ctgcttgaaa tgctctccct ctgatcttgc catggctgac tccttcccac 4861 cctgcacatt tcagcttaaa tgtcacctca cagaggcctc cccagccacc tctttaaagc 4921 agggggtccc cattcttctc tagaagcata aactatttgt ttcttttgca gtgccaataa 4981 ttactcagaa tgatgttgtt aacattgtcc cttgaggtca ggaacttggt tttgttcaag 5041 gtggatccct ggtggctaag tcagtgccca gcacatacag cagcgcgcaa tgagatattt 5101 gtggaaggaa tgaagagttc cgtgtggctg gagtggaaga ggagtggggc aaggtggggt 5161 gtggggagca ggaaccagag cacatgctcc ctggcgatgt gcagggcggt cctgccaacc 5221 cccacccatc cctgccccag gaagctgagc tttcatggcc atggtgccct gcgcaggcgt 5281 tacctgcttg gtcattcagc tggttgtagc tcaggtccag ggacttcagg tctgtgtggg 5341 ccagcaggag ttcatcaagg tgctgggccg cctgctcctc caggccattc cctgacagct 5401 gcatcttccg catggcctgg ttcactgtga gggcggcaca gagggcctgg gctcctgcca 5461 ctcccagctg gttctccaac aggtccacat ctgggggtcg gagccactcc tcagccagtg 5521 ggatgcaaag ccaggtacca cccccctccc ccctcttctg ccatccccca aacctccagg 5581 tctgaggcag aggaaaagga cttacaggtt aactgtcctg gggttgcatg gtggctctgg 5641 ctcccaatgg ctgggtcccc tctgcccctg acctccccct ccctgtgtgc ccaccagctt 5701 ctgtgtcaag cggttaaaac cgcctacctc ctggtactgt agagactgct gtggtggggg 5761 gcagccaccc catcgatgat tatgtctggc agtgatatca ccacccactt cagaagccct 5821 cactgcccca ttgccaggat tggcaataat agcccaggtg actgtgcatg gagcacttac 5881 atgggtcaca cactctgctg atcacttcag gagtgcattt cacttcatgc tcaccacagc 5941 cccaagccta tgaggtcttg tgggggccat ttcacagatt ggaaactgag ctgctttcat 6001 aggtcacccc gttttctcag tggcagagac agcatttgga ttcagcttca atgaactccc 6061 accttgccca cccctttctt gaacttcccc gcccccaact ctctgtgtgt gcaggacttt 6121 ccactctgca ttcaggcact ggggagtctc ccatgtgcag ggcagggcag gagaatgggt 6181 gccagccaac aaggaaggag agtggtgggg tagggtggat gaggaaggga cttgtatcca 6241 gcatctgctg cacagcagag ggtctaactc caccccacat accccctcct gtgatgaggc 6301 ccctcctcta tccaccagca acaccagcag gacccaggtg acatggccca gctttcaggg 6361 gtggaggcac atccctgagg aaagggagga ctcacgcctg cccacctgcc ccagacccag 6421 cacctaccgc agatgttgct gcttttgctc agggcacctg ccagggcctc tgtacctgcc 6481 ccatagagcc cactgtccca aaagttcagc cacttgacat atggattgga gctcaatgag 6541 gaagccagag accggcgccc tgggacagac agagcaaaca cactggccac tatcctggct 6601 catgctccag ggctcagccc gctccacgtc ccccacctgg aaagtacccc cctttccacc 6661 tgcttaaggg acacatgctt ctccttttag gccccaccta ggtgtccccc acccccacct 6721 ctcccagagc ccccagcggg ttagaagcct gaaggctccc caacccttca tgccatctgt 6781 catagcatca atcccatctg tctctcagat gtcgagggta gggcatgcct ggcacaggca 6841 gaagctgggg agtacagggg acagactcag gggcactccc atggctgttg tctcccgggg 6901 cctctccctg taagcctgga cccagtgtcc ttccagtggt acctggggcc ccaggccacc 6961 gtgcagcagg ttcagctctg gggcactccc ttggcacaga aagcagaaga tgggcacaat 7021 gctatggatc cagcaagacc tcaggtacag ggtgtccccg accagttctc caagtccatg 7081 ggtgcctggt aggagacagg gggatgaatg tgaacccctg catggctata gccacctgcc 7141 tcctcccctg ccctgcatca ctacctggcc tattttttgc ctctagaagc actgcttcct 7201 atgctcctta ggaccactgc ccgcatatga cagataagaa catcgaggct aaggcaacgc 7261 aaatcttttc cttaaagtca tacagctgtc aaaagaaagc tggacaacct gggcaacata 7321 gcgagataaa aaattattta aattagccag atgtggtagc cccctgtagt ctcagcgact 7381 caggaggctg aggcaggagg ctcaccagag tgcagagttc aaggatgcag tgagctatga 7441 tcctgccact gcactgaaag ctgggtgaca gagcaagacc ctggctctaa taaatgaata 7501 cataaagtct cacagctagt ggtagctaat cctgccagag tcaggcctct acctgtctga 7561 tgacaaatgg cacactatgt cttttaacct gattgcagac cacaaatgtt ttgtgaatat 7621 tttccccagg gaaaaaaccg gaagtagttc taaattctat acatccatta tattagtttt 7681 acctgtggat tgggaaaacc cagctctgat tgcatttcag ggcgggacag cctttggtgc 7741 actgtctggc gggattttcc attttaacct ccttctagaa gcgccttctc atggtaaagt 7801 tcctgatgcc gccaggagcg ccgaggagag ggcagggggc tggagacgcc ccgcagaggg 7861 ctacgtgccc tgctggacag aggtctcctg cctcctcggc ggcgccagcc cacctcccac 7921 aacccctgcg ggagaagccc ccaaggggag gagacgggcc tggcccctgc cccgagcacc 7981 ttccgtctct aggtcggagt ctgaatcggc cttgggaccc tgcttggctt cggggacccc 8041 tgcaagacgt ccacaggccg ccgtcgcctc ttcctcctgc tttttatcct ccccagacct 8101 ctggcaggaa ccgctcatcg ttacgcccct ttcgcagcct cagaccctga ggcggagacc 8161 gcttggcgcc tcacttagag cgcgacccgg ggatgtgggc ggagtctgcg gctgcgctga 8221 ccaatcgagt gtggcgtcca tcgactggcg tctgccacgg caattagcga cgcgctcccc 8281 cgcggcggtc gccccggcaa cccagtgctg taggttgccg tagaaaccgt ggctctcctg 8341 cgctgaggct cctcgcctga gaggataaac tgcacgcgcc acgggctatg cactgggctg 8401 ggcgccttgt gggcatcctc cctgccttcc tagggggttc cagcatcgcc cccctttcgt 8461 ggactgggaa acacgcctga ctccaggact tgtgttgtcc tcactgcact ggggaaggtg 8521 gcgggggcag cttttcagga gggcctgggg aacttcgcag agccaggtca ccctctcact 8581 ctgtgcctct tagttatctt gcatgctctg gtctttgcat acgctgctcc ctgcaccagg 8641 aacctccatc cccatctttg tctgcttgtc gaacttcaga aatctgcaag ggtcagctta 8701 gaggtcactt cttccggaag ctttcctcaa caccctcccc gccctgctgc tgctgccctc 8761 aggccctcct ctcacagcac tgataacagc tgtccgtctc caccctccca ccacctccac 8821 tcccacccca ggaagtgagg ccagagggca gggacagagc tgctgctgtt ctctgtgtgc 8881 cagggcccag caaagggaat gtagggaggg tgggaggtgc agggcagctg ggattagggg 8941 ttgagggctg ggtgttggag gctggatctg gatcctgctt tagtggaagt gtccctttaa 9001 cagcaactgg cctggcctgg ctcgggccct gctttgcctc ctgttcagct gcggctgcag 9061 ctgccatgct gactcatgtg cccgcagcta gcaggagctg gcagcatggg ctccccaggg 9121 gctacgacag gctgggggct tctggattat aagacggaga agtatgtgat gaccaggaac 9181 tggcgggtgg gcgccctgca gaggctgctg cagtttggga tcgtggtcta tgtggtaggg 9241 taagagagaa gagcttttgg ccaggctgga ggggcaaggg aagaggtggg gggtggggct 9301 tggtcctgct gggttgaagt tgagggttgg gctgtttagg ggctggagtg gaagggggca 9361 gattgggacg gggttgggga gagctaggcg atacaagaca ggagagcaag aacaagctgt 9421 gtgtttgtcc tgtgtgtcca cttgcctcct tcccaggccc ccacccaggc cccacccagg 9481 gggcacatga catagtcctt aacatctgtg agagctggag cactaggccc ccagagagac 9541 caccagctgt atctcgggtc aggagagtct gtaaggggga agctggatct agtcaggctg 9601 ggggtgggtg ctggctagtg aaggtgattg tctgagagca ttggctctct gatgcatggc 9661 tggagcttct gtctcattca gggggtctgg agtgggaagt ggggccagag aggaggtggg 9721 gccttcgatg ttgggccggg agcctgtagg gtgtgggggg agaactgagc atgtagggct 9781 cagctccgcc cctgtcacta cacgctgggg acacaccaca ctgcccgact tctcctcccc 9841 aggtgggcgc tcctcgccaa aaaaggctac caggagcggg acctggaacc ccagttttcc 9901 atcatcacca aactcaaagg ggtttccgtc actcagatca aggagcttgg aaaccggctg 9961 tgggatgtgg ccgacttcgt gaagccacct caggtggggg ccctgatgtt gctgacgggg 10021 gcgcaagtcc tttccccact gacagcctga acacccgcca tgcagccagt gtgtgcgaga 10081 gagaagcatg tgatgccaga gacggctgcg ggttctcagg aagggcttca cagaggagtg 10141 gcacctggac aggactttca gggatgtgta ggaggttttg gggtggaaaa aggggccact 10201 caagaagcca ggccagggtt ggacgtgctg gctcacgcct gtaatcccag cactttggga 10261 ggccgaggca ggtggatcac gagattgaga gtatcctggc taacacggtg aaaccccatc 10321 tctattaaaa atacaaaaaa ttagccgggc atggtggtgg gcgcctgtag tcccagctac 10381 tcgggaggct ggggcaggag aatggcatga acccgggagg tggagcttgc agtgagccga 10441 gattgcacca ctgcactcca gcctgggtgg caaagcgaga ctctgtctca aaaaaaaaaa 10501 aaaaaagcca ggccagagaa actgcatttc caaagactgc caacagaaaa gaagggagtg 10561 tccaggacta atggcttgag cttgagagtg gtgtgaggtg ctggggcatg gaacttccct 10621 gtagccctgc tccctgacct ggggcactac ggtcaggtgc tgctcctccc ctcttctcgg 10681 ctgcgttttc ctcctccctc cacccagctc atccccagcc tcaactgcca cttctgctcc 10741 tctgatgccc agggtgtatt cccagtgatc acctgcccag agcacagctg tcttctaggt 10801 gcacacccac atgtccaaag atcaattatt ttcctctcct ggcatggcct ctgtgacgcc 10861 cactagtcat ggtggctgtg acatccacta gtgcctcagc cagacccgtg actcaccctg 10921 gaccccttcc tgtcccttcc aagatttttc accactaccc atgccatgcc atgcatgaga 10981 ctatggcctc ctagagggtc cctagatgcc cctctcgcct cctcccttaa tgctcggtgc 11041 acaccacgca gcagccaagg tgaactttca caccaggcat catgagagcc tgcagcgcct 11101 ggttctaccc tcaggaattc ccccaaccct gcccatgacg gtgtccacac tttcctccca 11161 atcctaatgg ctgccactcc cagcaccatc tggccagccc tcaccttccc ttcctgggca 11221 tacattcccc aaattcacag tgctctcacg agcagcactg gagggtcagc ctttctttcc 11281 aatgtcctcg gccacccgtt gaccacagac acagctttcc ctcttctccc ttggcccctg 11341 ccatgccagt gctgctgtgt gtgagatggg agactcacct cgtctccatc ctgagcaggt 11401 gctgggccca gctctccctt ggatcttcag tactagaagc agcaggctgt tggaatattc 11461 tggttggagc caggcatggt agctggagcc tgtagtccca gctacttggg aggctgaggc 11521 aggaggacct cttgagtcca ggagttagag gttgcagtga gcactgatca caacactaca 11581 ctccagcctg ggtgacgaag tgtaatcctg tctctaaata cacacataca catgcacaca 11641 cacacacaaa ttttggttga gacaagagac ttgtctcaag agatggacat gggcacaagg 11701 cttcctggtc tcaaaaatgg ccagaaccac tgccagcctc ccatctctgg ttcagtctgc 11761 cttacagggg gacagggtta atgacttgat ggggccaaca tcccttccct cataaaccag 11821 gctgccggct tccggccttt ccagtcaaca cgagcccagc caggccaacc ttgagacttg 11881 cctcctaggg agagaacgtg ttcttcttgg tgaccaactt ccttgtgacg ccagcccaag 11941 ttcagggcag atgcccagag gtgagtttac ccaggatcct cccagcgggt cccttgttcc 12001 tccatcagcc ccaggtggcc acccgtgttt ccctttcccc ttcccaggtg ggtgaaggct 12061 cagcctgtgt tcggtgtgcc ccaggcactg gggtacatct tttcctgaat cattatgttc 12121 agtcttcaca tatcccctgc ctggtaggaa gtcctgtgat ccccatttca gaggagaaga 12181 ctgaggctca gtgaggttga gtcactttct taaggccttc aggcctgtgg gtgacaggac 12241 cccgagcttt gggcagcagc agttcccatg aggtgttcag gccctcccat tctggtcctg 12301 cctttgggta ctcttcaggt tggtagtgtg acacccagag ctgcgcacat gctcagggag 12361 gttttaatag caagagccaa gctggaatat cacctcccct tgtctgtgcc cagcctctat 12421 taatatgtcc tgaggcagct ttcatctttg tgggccaaca cagcacactc ttgctcatgg 12481 tgaattcagg attgcttatg atttctggat agtttttttt gttttatttt tgagacggag 12541 tttcactctg tcacccacgc tggagtgcag tggcagatat cagctcactg caagctctgc 12601 ctctcaggtt cacgccattc tcctgcctca gcctccggag tagctggtac tacaggcgcc 12661 tgccaccacg cccagctaat tttttttttt ttgtattttt agtggagacg gggtttcacg 12721 gtgttagcca ggatggtctc catctcctga cctcatgatc cacctgcctc ggcctcccaa 12781 agtgctggga ttacaggcgt gagccaccac gcccggcctg atttctggat agtttttaca 12841 tcaaccgtgg tcaagccaga gtcccccacc ttgttcttct tcatttctga tccagaaatg 12901 ctgattctcc ccctgacatt tcaccttttc cccttgcctg gggatgtccc tgggatcctg 12961 catctgtcac agagcatgct cattctctcc agctgtgaat tttgtttgaa ctattgggac 13021 tcaggacata gtcctgaaag tttacctcca cagtgacatc tttaggcaag tccaacattt 13081 acgtgcctcc tgggctggag ggtcgttgtg cagacagctg tcccctgagc cctggtggct 13141 ggtcctagca cagttgctgg agacatccca tgtccgtagt tggaaatatg cacaaaggat 13201 tgcttactct ttttgtttgt ttgttttttt gagatggagt cttgctcttg tccccaaggc 13261 tggagttcaa tggcacgatc tcggctcact gcaacctccg cctcctgggt tcaagcagtt 13321 ctcctgctca ccccctgagt agctgggatt acaggtgccc gccactgtgc ccagctaatt 13381 tttgtatttt aagtagagac ggggtttcac catgttggcc aggctggtct cgaactcctg 13441 gcctcaggtg acccaccagc ctcggcctct caaagtgctg ggattacagg cgtgagcctg 13501 ccgagagctt ggtcggggag acctgaaccc agcggtgcta aaggaattaa agacaaacac 13561 acataaatat agaggtgtgg agtgggaaat caggggtatc acagccttca gagctgacag 13621 cctcgaacag atttacccac atatttattg acagcaagcc agtgataagc attgtttcta 13681 cagattatag attaactaaa agtattcctt atgggaaaca aagggatggg ctctggttgg 13741 ttatctgcag caggagcatg tccttaaatc acagatcgct catgctattg tttgtggttt 13801 aagaacgcct ttaagcggtt ttccgccctg ggtgggccag gttttccttg ccctcattcc 13861 ggtaaaccca caaacttcca gtgtgggtgt cgtggctatc acaaacatgt cacagtgctg 13921 cagagatttt gtttatggcc agattttggg ggcctcttcc caacatgagc cactgtgcct 13981 ggcaggatgt gcttactctt ggtgaaccca cacaatgtcc ttctctttct taatgctcag 14041 atgtgcattt agtgttcagt ttgtagaccg ttctgaaatt tggctggatc tgtgggtctg 14101 tgtttttcag aatctgtgca attcctcttt gtctgcaacc acacttctgg ctcttcccat 14161 gaaacgtcag ggctgggtcg taattatcag atctgacaac ctggctttcc cggaagacca 14221 gagttctgcc agctcctcta gggatcctgg tgcctgatcc ctcccttaca tgcaccatgc 14281 tctttatagt gtcacctccc tcagcagaca ccgctgagcc tccccgctgg gccagggggc 14341 tagctaggct aaattcacaa aactccatct cccatacttc aaagaccacc cacatggaca 14401 gcccagccca ggtggcaggt ccgatgatgg gacagaggct gtaggtgggg gacctagggc 14461 tgcacttgag cagaatcttt tttttttttt tctttttttt ttttttgaga cagagtctcg 14521 ctctgtcacc caggctggag tgcagtggcg tgatctcggc tcactgcaca cctccacctc 14581 cttggttcca agcgattctc ctgcctcagc ctcccaagta ggtgggacta caggcacaca 14641 ccaccacact cggctaattt ttgtattttt aatagagaca gggttttgct gtgtcggcca 14701 ggctggtctc gaactcctga cctcaggtaa tccgcccacc ttggcttctc aaagtgttgg 14761 gattacaggt gtgccaggcc aagcagaatc ttaaaaaaag gtggggagaa gctggtgagc 14821 aggtggattt ggttgaagca ggatgtcgac acagaggggg cttggtgggt aaaggccctg 14881 agctgtgtga ggtgaggtgc ctttagggct acctgccact gggtggagct gaagtgaaga 14941 tttggactgg ggtgggaaga aggtagttca ggatttcagg ggcccctgta agccccacta 15001 aggagctaaa ctgtttttgc ttggttgttt tctttttctc ttttcttttt tttcccgaag 15061 caatgaggtc ttgctttgtt gcccaggctg gtctcgaact cctgagctca ggcaatccgc 15121 ctactttgga ctctcaaagt gctaggatta caggcgtgag ccactgtgcc tggcaggagc 15181 taaacttgat tagaggaaca gaagagagcc acacgtgggc tcagaggcag ggtgctcagt 15241 ttcctgcaca ttgggatgca ccacttgggc tgctgggcat aggtggatga gggtatggga 15301 agacgtgggg gccccactgg tggtcactgt ggggtctagt tggaggagac ggtagcccag 15361 ctggggtgaa gaggagaggc agacacagga cataggtagg gacaaagaag cagagcatgt 15421 ggctctgctc cgacctccac ccaatcagga cggccctgtc tttcagaaag tcccaccgcc 15481 tcattctggc ttctcagagg ccctcagcct tccttgcgcc cctggtgctg gtgttcttcc 15541 tgctgcccct gagctgagtg ccctgggcag cagtgtccat cctcagttgg ggcaggacca 15601 tgcctgggag agtgcccgat gctcaagggt gccttcgtct ctggggtctg ggaccccaga 15661 aagctcacct gtcctcccct tctgccagag ccccatagtc ccatgcctct gtgcaggcat 15721 taatgtcccc aggttacaga agagcgagca ggaaggagta gcctgtggtc cctcagcaag 15781 ggtgtggggt cctgcttcaa tacccaagcc cctgactcta gggccctgat ctttgtcagc 15841 tatgtcccca tgccgggcat caaaaactca ccctcccaag gtatcttaac cttccctgat 15901 ctgtcatcca aattggacca gaggagctag acctggaaga atcacttccg catccaccag 15961 ggacagaact gtcaggaggg aaggggcagg gtgcgttgtc tcacccctgt aatcccagca 16021 ctctgggagg gtgagacaga aggattgctt gaggccagga gttaaaaacc agcctggtca 16081 acatagcaag actccatctc tacaaaaaaa aaatattaaa aaatcagcca ggcacagtgg 16141 tgtgtgtctg tagtcccagc tactgggaat actgaggtga gaggattgct taagcccggg 16201 agggcgaggc tgtagtgagc catgatcata ccactgcact agagcctgga caacagagtg 16261 agaccgaatc actaaaaata aattttttga aaaaggagga aaggggtctc cctttgtctt 16321 tgaaatacag tactgtacct tcatctggcc agggcattgc tccgctccct cctctgacca 16381 cctcctttta tttgcaccct ccagctttcc tgtgtggccc cacactcagg gtactctggc 16441 ggcggggtgg tgaggttgtt taaggtggga agggggcctg tccttcccac cttgaacctc 16501 cctgcctttg agactgggct gtggagggga gacatcccct gtgccattgg tgactgctct 16561 ctctcccacc tcagcacccg tccgtcccac tggctaactg ctgggtcgac gaggactgcc 16621 ccgaagggga gggaggcaca cacagccacg gtaactgtgg gctctgtctt ccagtgcccc 16681 cagcagggtg ggggccgggc tgggatcctg ggtggctcct gagtgcaggc cctgctcgcc 16741 tctgtccctg catctctctt tctgccaaca accccctggc tgaaggcctc cccaggcctg 16801 cagagatttg aaggtctgga gttcatcttt tgttttctag gtgtaaaaac aggccagtgt 16861 gtggtgttca atgggaccca caggacctgt gagatctgga gttggtgccc cgtggagagt 16921 ggcgttgtgc cctcgtaagt gtccccacaa tcccctaccc caactggcgc agggccccag 16981 gcctggcaga ggctgtcacc tcccttccac ctgcaggagg cccctgctgg cccaggccca 17041 gaacttcaca ctgttcatca aaaacacagt caccttcagc aagttcaact tctctaagta 17101 agcagagtgg gtctcatctg ccccaagacc ctccttgtcc cctacctcat ctgacctttc 17161 ccactcctcc caggtccaat gccttggaga cctgggaccc cacctatttt aagcactgcc 17221 gctatgaacc acaattcagc ccctactgtc ccgtgttccg cattggggac ctcgtggcca 17281 aggctggagg gaccttcgag gacctggcgt tgctggtggg tcccaagttg ggggcagggt 17341 tcctagaggg ctctgggaga gggtcccggg cccacccacc ggtggaaaag ctatgtgcta 17401 tgtgcagggt ggctctgtag gcatcagagt tcactgggat tgtgacctgg acaccgggga 17461 ctctggctgc tggcctcact actccttcca gctgcaggag aagagctaca acttcaggtg 17521 aggccccaca gctcccagtg cccagatgct gggcccatcg ccctctccct gtggcggcca 17581 ggacagacca cacacaggcc caggcctcta gatattccac tacgtgtgtc aagggggtcc 17641 caggagcagg agagagctgt tctcaacccc acatcctcca gcacaggctc cgtcctgctg 17701 ccccaagtcc tgaaccctcc accccatctg tcccaggccc cggcccatct caggctcctc 17761 actgccagcc cttcctccac cccacctcgc ttctagtatc tcccctccac agcaatgggg 17821 tgtttcattt ttactttccc cttctcccct tcagctttgt tttttttttt tttaagacag 17881 aatctcattc tgtcacccag gctggagtgc agtggcccga cctcggctca ctgtaacctc 17941 tgcttcctgg gttcaaccga ttctccttcc tcagcctcct gagtagctgg aattacaggt 18001 gctcgccact actcccagct aatttttata ttttggtaga gatgggtttt cacaatgttg 18061 gccaggctgg tctcaaaccc ctgacctcag gtgatccacc cacctcagcc tcccgaaggg 18121 ctaggattac agacgtaaac caccatgtct ggcctccctt ccgcttttac ctaaactttt 18181 tttttttttt tgagatggag tctcactctg tcgcccaggc tggagtacag tggcgggatc 18241 tcagctcact gcaagttccg cttcccgggt tcacgccatt ctcctgcctc agcctcccaa 18301 gtagctggga ctacgggtgc acgcctccac gcccggctaa tttttgcatt tttagtagag 18361 acagggtttc accatgttgg ccaggatggt ctcgatctct tgacctcgtg atccacctgc 18421 ctcagcctcc catagtgctg ggattacagg cgtgagccac cacgcccgac cttttttttt 18481 gaaacggagt tttcactttc ttgtggtcca ggctggaatg caatggcgtg gtcttggctc 18541 actgcaacct ctgcctcctg ggttcaggtg attttccagc ctctgcctcc agagtagctg 18601 ggatgacagg tgtgcaccac cacacccaac taatttttgt atttttagta gagatggtgt 18661 tttgccatgt tggccaggct ggtctcgaac ttctgacctc aggtgatctg ccgacttcag 18721 cctcccaaag tgctgggatt acaggcatga gccaccaagc cggttttttt gggggggttt 18781 tttttttttt tttttttgag atgaagtttt gctcttgttg cccagactgg agtgcagtgg 18841 cccgatctcg gctcactgca atcctttgcc tctcgggttc aagcaattct cctgcctcag 18901 cctcctgagt agctgtgatt acaggtgcac accaccacac ccagctaata ttttgtgttt 18961 ttactagaga tggggtttca ccatattggt caggctggtc tcgcactcct gacctcaggt 19021 gatccacctg cctcagcctc ccaaagtgct gggattacag gtgtgagcca ctgtgcctgg 19081 cctcaagttt cataaattgc atttattatc atgtctttga gtcttctaag cagatctatt 19141 ggatccttct gccaccgagc gtcacctcgt catgcaggca ggcacacacg accaccaggc 19201 ctggggatga tgcccctcaa catagctcac tgcaccccgt ctgatctggc ttccccaacc 19261 tccccagccc ttcgaaacca cgtggggctg gctcccaccc acatcctgtt cccctgacct 19321 ctgtgttggc aaaccacctg tgtgcatgtt ccttcaggcc cagcctcatg tcccctccag 19381 gaagtctacc ccagttccca gggaagagtg agttcccatc tctggaatcc ctcagccctg 19441 agcctgcccc ttcacatccc ccgctgctgg gtctgtttag ggactcctct gtcccccgtc 19501 ctctcagcag gcagggaact tctgagggac aggtcttcgt ttgctttttc tgttttctca 19561 ccaattacat agggctgaga cccaggactc aggcttgggc tgggggttta tagagtcaat 19621 tgaccaagtt ggacagaggt ctggcagggc cagccccacc tgggggtggg caaagcaggt 19681 caccagagcc ttctttcctg cccacaggac agccactcac tggtgggagc aaccgggtgt 19741 ggaggcccgc accctgctca agctctatgg aatccgcttc gacatcctcg tcaccgggca 19801 ggtaggcaca ggtaggggtc aggccgggga tgggatgggg caggcagaca gggctggagg 19861 aggcatgagg ctgacagtcg tgggctgaga ggttcagctc agatctctct caggcaggga 19921 agttcgggct catccccacg gccgtcacac tgggcaccgg ggcagcttgg ctgggcgtgg 19981 tgagtgcgaa cactgtgggc acctgcaggc tgcagtgagt gctgctgacc agggtgtgtc 20041 caatgcatgc tggagcctcc ggtgcctgca cattgagtct cggggtgcag gctggggagg 20101 tggcaggaaa gcaggctcgg gggctggaac atgggttggc cctgcctctc ccaggtcacc 20161 tttttctgtg acctgctact gctgtatgtg gatagagaag cccatttcta ctggaggaca 20221 aagtatgagg aggtgagctg aggtcgctct gcttggaccc tgggttctgc cacacttagg 20281 aagatgttgg ctggatccct gacctgctgt cctcatctgc aggccaaggc cccgaaagca 20341 accgccaact ctgtgtggag ggagctggcc cttgcatccc aagcccgact ggccgagtgc 20401 ctcagacgga gctcagcacc tgcacccacg gccactgctg ctgggagtca gacacagaca 20461 ccaggatggc cctgtccaag ttctgacacc cacttgccaa cccattccgg gagcctgtag 20521 ccgttccctg ctggttgaga gttgggggct gggaagggcg gggccctgcc tggggatttc 20581 aaggatgagg ccccagcatg gaggattggg ggtagaattc cacccttgaa ccccagcaaa 20641 cagtccctcc cctgactccc accttggtag ggtgctgcct cagggagcca taaaagtcgg 20701 ctgtgttttg agacggcgac agaacctgac ccgtggagac tgggagagcc cagcaggcac 20761 ctgtattgca gggctccgac tgcatgtggc aggggctcct gctgcgtctg ggcctgaagg 20821 tctctctccc agtgctctgt ccccagtgtt cctagcagag gtatgcttac cagctgtcag 20881 cacaaaccct cctgctgcct gggtcctggc cctcctcccc catctgcacc cccatcatag 20941 gtagagaccc caccctccca tcggtcctac atggggctgt gcagctggag ccaaaaaggc 21001 aaggtagaaa gaggagtgat gggggagggg gattgtttca gcttctctgg tgctgtgatg 21061 ccccaggaga gtcctaatct agggaatggg gtggagtagg cagataatcc acctccctat 21121 cccccaggca agggcggagc atgtgtcttg ggcccacacc tgcttagttt atgaggaccg 21181 gctgctttcc agtggtagcc cttttgccat ggaggtctgg gagagagagc agagggcggc 21241 agggctaagt tggtgatcac tgggttcttc aggaccttct atatccctcc tcggtgaccc 21301 cccagcccaa ccccttggaa tctttcctcc aggcttcctg agagccctgg gggtgggagg 21361 ctgtgggagg ctgtacatct gaaattcact tcagtccaag tcatacctag gaagctgtct 21421 gggcagctgc tcgagggagg ccctggctct gatcccaggc tggatggagt ggctggaagg 21481 aatggttcca aacaacacca ccgagatctc cctcaggctg gccaggtttt gcagctggaa 21541 ttctcctctt ggtcccaggg cggggcaggg aattctaagt gtccacccca gggaggcaag 21601 gggctgcttt ccactgtggg tacctggtga tcagggcaag ctgtggaggg ccaggggtgg 21661 ggctgagact gggctgacat ctagaatcac ctgccacctg gagcctcagt aaaatgcctg 21721 gggtccctgc tgcctctcaa tctccagagc catgtccatg gggaggtggg ctctgaaggt 21781 gaaggtggga gagcaggccc ctgaggcctg ggtatccaac gaagggcacg tgcacctgat 21841 tctccttggg gcccagagga agctgatgtc atggctggac aaagtcacgg agtaaagcca 21901 gcaaagccac cctcttcctg tgtagtcctt acaggcatga ctggaaagtt ggggggcatc 21961 tatggtagac atggcacagc catgaagaga ccagtggggt ggtgcagggt ggacttgggg 22021 accctacccc tgaagactga ggccctgcag ctaccaggtg ggctagaagg taactggaac 22081 aggcctgggc acttgtgcac ccatgtagga gcatgagggc cacactcttt tcacctcaaa 22141 gcccttgaag agtgggcaaa gacagcaaga gagctgcagc ctgggcccga gctcagaaac 22201 agctgtcgcc tcagtctgcg cacaagcatg caccccaggg tagtgcctgc agggatgcat 22261 gtgtccccgt gggggtgcct gtgccaggca ggcctcaggt gcatgccatg ctcagaaccc 22321 tgctgccctt tctaggcagc ctccttgggg cccaagctct gctccctgga tctgccacct 22381 agcagacgtg gggagcctga ccccatgcct gtcatggaac cctccttgcc tggtgtgtgt 22441 ggctcccctc ttcactgggc acctggatcc aggcccacct gtgtccctga ctcagggtgg 22501 tcccaggcct ggcacctact ctttagagag ccccagcatc tttgatgtgg attggagaca 22561 attgcctggt tccctggggc aggtgaagac ttggtgccac aaagaatgcc acagtggata 22621 cgccagcagg ccacatggct ggccaagcaa ttattattat ggatcccttg ggctgtgggc 22681 cttcccatcc accccaccac aactgcccag gtagctggag ctgatcataa acaagaaggc 22741 tctgggcaga gtccatggca ccagcaccag ccaaggccca ctcctgaaga cccgaagccc 22801 agcccctgga tgaaggtcct aaggtcctga ggactcccca gcctgtgcag gcctgcaaac 22861 ccaggctgcc cacaacagaa ggggctctcg gcttgtctgg cctctctggc ctcccaagca 22921 ggtgtgggag ggcggggcaa gtgtgggctg atcagctact ccatatggcc agggtcctgt 22981 gctggtgcct ggctgggggg ctgcatagcc tgcactgtct cctccaggct gcccctgggg 23041 aataccacgt agtgtgtgga gttcagccct ggcagctccc gctggttctc cttgctatgc 23101 cggatgccat agccgaaata cactgcaagt cctagacagg gcaggaggca gggcatgagc 23161 ctgaggtaca ggttccagcc cttcctgtcc tctttgccct cctcctgacc ccggtcccag 23221 cctggccccc actcacccat cagcagccag atggagaagc gcacccaggt cagatagcta 23281 agtttcagca tgaggcagat gttgaggacg atgctcaggg ctggaatcag gggaaccatg 23341 gggatctgag gaggcagagg cagggcaggg ctgggccggg ctgcaggaaa gatctgccag 23401 cccagggctc actttctcgg gaatccatag agcctttgtt cctcacggga gattgtggag 23461 acatgtgctc actcaccatg cagaaagggg tgcgggatgg gtgtgtggtc ctccccagcc 23521 ccgtgagact tgttcattct gggatggtag tgggggaagg ggaggcacct gcctgggccg 23581 agtgtgagtg ggtgttggct aggggaggta cctgaaataa gtcttcccga tactgttgct 23641 ggtgagcccc caggacaagg aggctgagca gaaacatgac actggtgagc aggagcagca 23701 ggatgtaacc ccagtgtggg aggtgcaggg tcgagttccc aaagacaagc acgcagccta 23761 tggtgatggc tgaggccaac ataacgccaa gcgcccaagt caccactgct ccagggctgt 23821 acccatccaa gaagcccagg tagggcctca gggctggctt cagctccccc tggctcaggg 23881 acggaggcgt gtacagtgcc caccagctgt aggtggtctg agaaggagct ctgctgcttg 23941 gtcagggggc cagggctggc tgggcctggg gagctgggcg gggaagactt ctggaagcgc 24001 agcacaatga tactggtggc cacgaatgtg taggccagga gtgtgccaag ggacaggaac 24061 tgaaccagcg actccaggtc cagcagcagt gccaggaagg ccgtgaggag cccgaacgcc 24121 agggtgcccg ccacaggcac ctgtgtccgg gggtgcacat gggcaaacac ctggaagaag 24181 agcccatcgg cggccatggc atagacaatg cgtggcaggg agaagaggag gctgagcagg 24241 acggtgttca tggctgtagc agagagagtg gggagggtca gcatggcgga gaagcccacc 24301 aggggcctct gctgggggtc ggtgcatggg tggctccttg ggaacaaggg catgagcttg 24361 tctgggctct cagagtaagc atgagtttgt gtaggagtaa tagattgtca gcgctttgcc 24421 caagaacagg acatctgggt taggtcacag gcccactgga gcatcggatg agggctgagt 24481 ccctctgcct aggaaataat ctataaggac atgcccagaa ctaatatgct atggaccatg 24541 tgtggggcgg tcccttccaa agtttctgaa gaaacttttt ttgaagtgaa gagtttctaa 24601 taattagcac tttccttgcc ccattatgca actagtaaaa cagccccagt tgaaggtgca 24661 ttccccaaga atctgaggac aaggtccccc ttgctctttt ccctgccacc ctgcccagga 24721 agagtctctc accgcagatg gagccagctg ccacgatgaa gccagcccac ctgtagcccc 24781 gctggtagaa ggcatctgca agcgctgagt cggggtccag gctgtgccag ggcaccatga 24841 gggttagcac ggtggagaca aggatgtagg caccagctgc aatggcaagc gagatggcga 24901 tggccagagg cacagaccgc cgtgggttct gggcctcctc actggaggcg gcaatgacgt 24961 cgaagcccac gaaagcatag aagcaggagg cagtgccggc catgacgccg gagaagccga 25021 agggtgcaaa gccgccttcg tcagcgctcc agttgtgagg ctgggccagg atgaagccca 25081 ggatgacaat gaagagaatg acaagcaggc tgatggccga gaaggtgtga ttgagccagg 25141 aggacacgcg ggctccacag gagacaaagg cagaggccag gaggatgatg ccagcagcca 25201 ggaagtccgg gtagtggccc aggaggggca cctgccaaga acccacgtgg gtctcagtga 25261 agttgcggat gctgtggctg aacatagagt ccaggtagcc actccaggca cgggccacgg 25321 cggcgccacc gatgatgttt tcgaggagaa cattccagcc gatgaggaag gcccacagct 25381 cgcccatgga tacgtaggtg aacaggtagg cagagcccgt gcgtggcaca cgtgccccaa 25441 attctgcata gcatagggct gccagcaggg aggccacacc ggccacaccg aaggacaaga 25501 gcacagcagg gccagccacc tccttggcca cggcacctgt gagcacgtag agacccgagc 25561 ccaccatgcc acccacgccc agaagagtca ggtccagcgt ggacaggcag cgccgcagtg 25621 acgtctccat ggtggagtcc tccagcggct tcaggcggtt cagcttctgg cataagcgtg 25681 ccaggctagc aatggtgggc agcccccggg ccatggcagg tggccgagaa gagcaccgag 25741 ccagctactg gaacctgcta gggccagagg ggaatgacag gatgcgtagc cggggcctga 25801 ccagccttcc atccctcctg gtctgaccac ccccagtcct ctctctgtcc ctgacctggg 25861 gacctggaca ccctcaccag gggcctgctg gagagctgga atggggcagg tgtcagaacc 25921 tgtgggcagg aggtcacagg aagcctggag gccctgtctg catcccaggc agcaaaggag 25981 gcctggtgag gccccacctg ctcacccgct cggtggccct gaggggtggg gagggagcag 26041 ctccaggggg agcagggtga gaggctggga acggctcagt ggtggggtcc cagagccagg 26101 tgaaaccgag gcttccccag agtcgagccg gacccggaac aggagccgcg gctctgcgtc 26161 ggggcgggtg tggcccaggc agccacctgg ccacagacag acagtggcgg ctacatacac 26221 ccccgcccct gcacccgtgg ccaggctggg gtatccgcgc gccaggcgga atgcaagggc 26281 gccctgggcc cgcacatctc cccatgtccg acacgccgct gctgctagag cctgctccgc 26341 cgcgcccccg tccccggaca cctgcgcctt cgcctgttct tgggcctgag cccgtcccag 26401 tccccgtccc cgcaggatct gctgcaaccc acctgctgct gccgcagccg ctgcctccgc 26461 tctgagcact gagcccgccc agttcgccgg cgccgagcgc agggcccgcc cccatcccag 26521 gccccccgcg cggccccacc cgctgccaca cacccagata tttcaccctg cccactctgc 26581 ccgacctcgc aaggtcgtta ccggagctca ccggggtcta gggaagccct gtgactgtga 26641 ctcagtcaca gatggagaag gtcagaccgc cggctaactg actagcctag gtcagttcgg 26701 ggaggttaag ggccacgtct gaaagtccgc agcccacctt ctttctactg cagcggacac 26761 ccttcttgat ccctcacccc agtacatgaa ccccacccag ctgggcccca gacaggcccc 26821 acagcaccgt catgtgagct tcaggcacag tgagttagat gaagacatgg gtctggcaga 26881 gtggcctggg gcggtccctg acctctactc ttacaatgga agagactgca gccttttgag 26941 agagttctgc gaattgaggg aatgcattct ggctgttttg agcctgcagc taccacgtcc 27001 tgcaaggaaa caagcagctt ctgggaccag cctgcagccc tcctgtgggg gagtagagct 27061 cctcagaccc cagtggcagc ccttcctagc ctaagcacct ctggcaggcc gggagggccc 27121 tatgggaccc caataccaaa tccccacttg tgtttatcag cgacagccac tgtgtcctga 27181 gtactgcatt ccagtgcccc acgccccagg caggcagcta tgcaacaata cacctggcta 27241 attttttaat tttaattttt ttagagactg ggtcacacta tgctgcccgg gctggtctcg 27301 agctcttggc ctcccaaaag gctgtgatta ctttcatgaa ccatggctcc tgccctcccc 27361 agaacgctat actagacatg tgtccaggct tacatgtggt tggtggaatg gaatcttagg 27421 agcatctgca gcctatccag cctattgggc tggtgacagg atgatggact cccagacaag 27481 atggtgccaa tttggaacat tctgcctcgg ccagaatgcc aataccttct aatatttggc 27541 tggtattgtc tggaggaact tctcagatgg ctcccacttc cggtgggatg gtagcttcct 27601 taccccatcc ctgtgttgcc tgagcagacg tcccttgtcc tgaaatgtca ccacaaaagt 27661 ctaggcactt caggccagag ggttaggcag atttgtaatg ggagctcccc aaaggcctta 27721 ccaatgagtg ccttattgct ggtgtccagt gttttggagt gttgactttt tatctacctg 27781 gagaaggtgt gacaccacac cctctgccac aggcaccaac atacaacgga tatgtgagac 27841 taagcagggc tgatgcagta aattgtacat ttccagagat atttccctca ctactgagtg 27901 agacagctac acagagcaac cccacagtga tgcagtgaag acaaaccacc aggaaacaaa 27961 aactttctaa ctcattctgc aaggccagga ttaccctgac actaaactca acaaagacat 28021 cacaagaaaa gaaaaccaca gaccaaaatc ccttatgaat atagatgtag aaatccccag 28081 caaaacacag tggactcttg aaccacacag gtttactaat acatagatat ttttcaacca 28141 aatgtcgatc aaaaatacag tatttggcta ggcccagtgg ctcacaccgg taatcccatc 28201 actttgggag gctgaggcga gaggatcact tgaacccagg agtctgaggc tgcagtgagc 28261 tatgatcctg tcactgcact tcagcctagg tggcaaagca agaccctgtc tccaaaacta 28321 ctaaataaat aaattaaaat aagtggtggg tgcctgtaat cgcagctact tgggaggctg 28381 aggcaggaga atcgcttgaa cccgggaggc agaggttgca atgagctgag atggcgccat 28441 tgcactccag cgtgggtgac agagcaagac tctgtctcaa aaaaaaataa aaaacaaaaa 28501 acaaaaaatt catatggaaa tgcaagggat ccagaatagc caaaatactt gggtattttg 28561 cctgggtaac atagggagac ctcacctcta cggaaaggta aaaaaattat atgggcatgg 28621 tggcatgtgc ctgtggtctc agctactggg agtcgggggt ggtgttgagg tggaaggatt 28681 tcttgagcct gggaggttga ggttgcagtg aggcatgttt gtgccactgc actccagctt 28741 gggtgatagt cagaccctgt ctcaaaaaaa aaagaaaaaa gtaaagaaag aaattaaagg 28801 caacataaat aaatggaaag acatcccatg ttcatagact gtaagacttt aaatataaaa 28861 tggcagtatt ccccaaactg gtctacagat tcaggggttg catagattga gtgcaattcc 28921 tattgaaatc ccaactacct gttttgcaaa aatcatcaaa attcatgtgg aaatgcaagg 28981 gatc // LOCUS AB005803 15499 bp DNA PRI 07-AUG-1997 DEFINITION Homo sapiens DNA for histidine-rich glycoprotein, complete cds. ACCESSION AB005803 NID g2280513 KEYWORDS histidine-rich glycoprotein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15499) AUTHORS Wakabayashi,S., Takahashi,K., Tokunaga,F. and Koide,T. TITLE Direct Submission JOURNAL Submitted (16-JUL-1997) to the DDBJ/EMBL/GenBank databases. Sadao Wakabayashi, Faculty of Science Himeji Institute of Technology, Department of Life Science; 1479-1 Kanaji Kamigori-cho, Akou-gun, Hyogo 678-12, Japan (E-mail:wakabaya@sci.himeji-tech.ac.jp, Tel:07915-8-0210, Fax:07915-8-0210) REFERENCE 2 (bases 1881 to 2300) AUTHORS Wakabayashi,S., Takahashi,K., Tokunaga,F. and Koide,T. TITLE Structural analysis of human histidine-rich glycoprotein gene JOURNAL Unpublished (1997) REFERENCE 3 (bases 1 to 15499) AUTHORS Wakabayashi,S., Takahashi,K., Tokunaga,F. and Koide,T. TITLE Complete nucleotide sequence of human histidine-rich glycoprotein gene JOURNAL Unpublished (1997) REFERENCE 4 (sites) AUTHORS Koide,T., Foster,D., Yoshitake,S. and Davie,E.W. TITLE Amino acid sequence of human histidine-rich glycoprotein derived from the nucleotide sequence of its cDNA JOURNAL Biochemistry 25 (8), 2220-2225 (1986) MEDLINE 86216149 FEATURES Location/Qualifiers source 1..15499 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3q28-29" repeat_region complement(1359..1638) /rpt_family="Alu" exon 2278..2483 CDS join(2301..2483,5205..5321,6208..6298,7892..8058, 9055..9135,11357..11458,13312..14148) /codon_start=1 /product="histidine-rich glycoprotein" /db_xref="PID:d1022463" /db_xref="PID:g2280514" /translation="MKALIAALLLITLQYSCAVSPTDCSAVEPEAEKALDLINKRRRD GYLFQLLRIADAHLDRVENTTVYYLVLDVQESDCSVLSRKYWNDCEPPDSRRPSEIVI GQCKVIATRHSHESQDLRVIDFNCTTSSVSSALANTKDSPVLIDFFEDTERYRKQANK ALEKYKEENDDFASFRVDRIERVARVRGGEGTGYFVDFSVRNCPRHHFPRHPNVFGFC RADLFYDVEALDLESPKNLVINCEVFDPQEHENINGVPPHLGHPFHWGGHERSSTTKP PFKPHGSRDHHHPHKPHEHGPPPPPDERDHSHGPPLPQGPPPLLPMSCSSCQHATFGT NGAQRHSHNNNSSDLHPHKHHSHEQHPHGHHPHAHHPHEHDTHRQHPHGHHPHGHHPH GHHPHGHHPHGHHPHCHDFQDYGPCDPPPHNQGHCCHGHGPPPGHLRRRGPGKGPRPF HCRQIGSVYRLPPLRKGEVLPLPEANFPSFPLPHHKHPLKPDNQPFPQSVSESCPGKF KSGFPQVSMFFTHTFPK" mat_peptide join(2355..2483,5205..5321,6208..6298,7892..8058, 9055..9135,11357..11458,13312..14145) /product="histidine-rich glycoprotein" old_sequence 2417 /citation=[4] /replace="c" repeat_region 2901..3181 /rpt_family="Alu" exon 5205..5321 exon 6208..6298 repeat_region complement(6835..7116) /rpt_family="Alu" repeat_region 7342..7478 /note="partial" /rpt_family="Alu" exon 7892..8058 exon 9055..9135 repeat_region complement(9389..9680) /rpt_family="Alu" exon 11357..11458 repeat_region complement(12152..12433) /rpt_family="Alu" repeat_region 12802..13077 /rpt_family="Alu" exon 13312..14500 BASE COUNT 4561 a 3384 c 3277 g 4277 t ORIGIN 1 catctgaggc cactctctag ttcccataat gctaatagga accatcccca aagccaaata 61 tcacttaccg gatcagagtt ctcaggacag aatgtctctc ttactaccat tcagcttcct 121 cccaaccatt accatcccag cacccttaat gtgctctctc ccctgaagct gctccagttg 181 ctagaatggc atcattagta ctgtattgct ctttaagatt cagtaaagca acaggaagag 241 gatgcatgac tccctgaggt actctgtagt ctcagcagaa cagcactgta ggaaaattat 301 attcctgttc ttgatgcctc ttcccatccc ctcaacattc ccattccttt ccctacacca 361 gcatttggtt cacccaggac accccattta gggtgcagcc aatatagcca taatagaaaa 421 acttcattaa ggagctcaac tcataataac aaaaactcaa aatagttttt gttttaatca 481 atagctttcc ccttccccac tgggttggag cctcttaaca ccttgaccag attccataca 541 ttcctgtcat tttctgttgc actggactgg tcccattggg tatttaaaag ccagggttac 601 tgacattaaa agttcaattg caaacattat tcaaaacctg ttgcactcat taaccctgtg 661 acttgtggaa atgagatcta cttttcatac acctcccttt tctttctgtc ttcaaatcct 721 ttccctctct tcagttcact ctagaatccc agtaattctc aaccttgact gcacacactg 781 gattacctag gagtttaaaa actaccaata ccaaagtcca ctcccagaga ttctgattaa 841 attgtcctgg aggagtcctt gacatcagga ctttttaaag ctcctggggt gattctaaca 901 tgaagccgag tctgagagac atgactgcag actctcacag acactcactg cggatagaac 961 agggaggaat ggagatgctg cctaactggc tgtttcttgg cagtgaggac ggaacctagg 1021 ctgattccct ctggtccctt ctgatgctgc tggctctgct ggtctgaggg ttgcatatag 1081 gtatacgggt ttcttctgaa gtgtgaaggt agggggtcag cccagtcccc ccctgctgat 1141 cccttctcac agtgggatgg gcatccaact atttgactct tactctcacc accatcctct 1201 tccacgtgaa tcaattcttt ttgcagggaa ttggggtggc atgaggtgtc cccaacttct 1261 ttgcaggcac actaataata aatagcatca cttattcagc atctactcta tgccaagaat 1321 tatactggtt gctttctttt tttctttctt tcttttttag atggaatctc acactgtcgc 1381 ccgagctgga gtgcagtggc gcgatctcag ctcactgcaa tctccgcctc ccgggttcaa 1441 gtgattctcc tgcctcagcc tcccaagtag ctgcgattac aggcatgtgc caccatgccc 1501 agctaatttt ttgtattttt agtagagaca gggtttcacc atgttggcca gactggtctc 1561 aatctcctga ccttgtgatc agcctgcctt ggcctcccaa agtgctggga ttacaggcat 1621 gagccaccgc acccagcctg atttcttatt taattatcac acaagcctgc caagaggtat 1681 caatccatat ttcgtaaatg aggaaactat cagagaagtt ttgttgctta cccagcatta 1741 cacagtaaga aagtaataga gccaggtgag aaattcagat ttgacttcaa aacgacagcc 1801 tcttcattgt tctattttac ctgaagtcat tccttttata atggtacctg tttaacatgt 1861 tcccccgctt agataatgca gctgccttct actgaggagt tttagtgcca tgatccatca 1921 gtccactgac tgatgatgga gatttccagc tgagacaggc aggcagatgg aggaaaatta 1981 cccctagccc catcccctac accaacatag gcactgctta tattcaaaga cgtactttag 2041 atctgctatc tactagaaaa atccagattt gtgcagatca cagttccacc cactcatcag 2101 catttctcag ctttctgccc ccaggaccca ctaatcattt acagcttgtt gatggttatt 2161 tgaacccagg gtcaaggtga acaccctcag ttacacaaca aaaatgtatc aaggttttgt 2221 ttaatcaatt gcgtgtgttt cagaaagtct gtataaaatt ctctgcagtg gcagatcata 2281 gcaagggatg gtttaacaaa atgaaggcac tcattgcagc actgcttttg atcacattgc 2341 agtattcgtg tgccgtgagt cccactgact gcagtgctgt tgagccggag gctgagaaag 2401 ctctagacct gatcaataaa aggcgacggg atggctacct tttccaattg ctgcggattg 2461 ctgatgccca cttggacaga gtggtgagga attgccaatg gcagagctga gttgggagat 2521 tactcacggg ggcaaatgtg actctacccc ctaggcttac tgctttgcac aatgaatggc 2581 tttggtcaga gttttttgtt tggcttctgc ggctgctcta aattgttctt ttttccttca 2641 aactaatatt tgttattagt gtacccgcaa ctatgattcg atttgtgggt acatataagc 2701 atttatgctg agcccagtta catgctaagg actgccagag aaacattaga ataagctatg 2761 ggccttggcc tccaggaaat aataatacac tggcacatac aggcaaacag tgtaagttac 2821 taggataaac tagagagaag gaaaagtttt ctcaaccctc ctaaggctcc taactgagcc 2881 tgaaataaaa ctggtagaca aggccaggcg cggtggctca cgcctgtaat cccagcactt 2941 tgggaggcca aggcggacgg atcatgaggt caggagttca agaccagcct ggccaacatg 3001 gtgaaaccct gtctctacta aaaatacaaa aactagctgg gcatggtggt gcacgcctgt 3061 aatcccagct actcaggggg ttgaggcaga agaatcgctt gaactcggga ggcagaggtt 3121 gcagtgagtc gagatcatgc cactgcactc cagcctgggc aacaaagcta gactccatct 3181 caaaaaaaaa aaagaagaag aagaaaacat acaaatttat ttaatccaag gtttacctga 3241 cactggagtc ttcataaatg aagactcaga tatccaaaga aactggccac ttgcatactt 3301 agaattcaca aaggctgtgc agaatgcaga agtgtgatgg gacaaaaggc tgtaatctaa 3361 gagtcataga ctgaaagaag gacccagcaa ggcctctctg tttgagttct tcatggcctc 3421 tctgtgtggc atttcttgct cccaagtatg ggacaggatc cctttggaat gcggatctga 3481 ggacctacta tcaaacagag taggtcagag aatgtcttta tggccagctc ctacaaagaa 3541 agacacaggg ggaaggttag attcatattt taggatttat ggatgctttc ggggagaggg 3601 attctaccat ctgtgactgt cctggagaag agaaatttta gtttctatgg cttgccttgg 3661 gggagaaagc agagtaggag aacaaagggc agaaggtcag acagatctgg cttctgaggc 3721 ccttcccatt ttcttcagtt caaaggactc agcacgccaa aagtaccata ctttgagata 3781 ccatgttctg agccccaaca atagccaagt gaccaaagga gtggtaaatg tactggagca 3841 atttaatgag gggagatcta gtaagactgg atgggaggag gcttctgctg tgtaatgatg 3901 catgggtagg atttgaacag gtctgggtaa atgtgttcct ggtgaattcc tggcaccacc 3961 acttctaggt gggtatggaa tcaggaagaa gggtgttagg gagccattga atataagcag 4021 gtgggttcca gaagttccat gtgggctctg gagcaggcat caggtgggag ccaagcctgt 4081 gggcttgtct tgaggtgaaa caagcttact gtcttcactt agattacacg cctacctaaa 4141 ctggactaga ggagccgata gacattatag agagacatga agcctgcctc taagctcccc 4201 aaataaccta ctctccctct gatggaaaaa ctgaaattag ggaggccaga caaggagtac 4261 tacagaggac aatttaccct tctgaaagta tcacagaggg aggcactgtg ggttgcagag 4321 aacaggtgtg caggagttta ggattcaacc agtaattcag tgaccattag tcccatgcca 4381 ggctcatcaa cacatgttag ggcataggat gcatcagcac agagctgcct ctgagagcta 4441 gcatagagag cgggtggatg gggaggggtc ctgatgtgcc tggggaaact gaggcaagag 4501 acagggacat gaagtagaaa gctatttatg tattcaggaa tgaaagaaca agggtcagac 4561 ctgggttagt gggagtcagg gcaatgagac tgaggaatag acatgagaga cgttgccaag 4621 aaagaccatt cagggcctgg tactggatag caaaagcaag gacaaggaaa acatctcgta 4681 ggtttacatg ttttacccca gagtctaagg caatatgttg ttagtggtgc cactgacaaa 4741 gacgagggaa gtggagaggg cagctagttt gtaagagaag agagaggatt tagctttggg 4801 caaagcaaca tcaaagcaga aaatcccatg atcagtaata gagctgggca agaggacaca 4861 actacaagga cagattggag gccctgatga ggtcgtcatg acttatactt ccaggcatgc 4921 gtaggtgttg atgcctcctc taaaccacca ctgccgttta gtgatagctt cttcaagttt 4981 atgggcagaa gacaaatcag gaccacttaa aaagatttgg aaataactca ttgaagctat 5041 gtattggaat caatagatca aatgacgcaa aggtagagca tctgcaaaga cagtgagagg 5101 aacaggaatt ctgtttaagt aataaatata tagctttaat acataaaaaa aaaacccgct 5161 aggtttccat gtgctactca catgttgctt ttggcattcc tcaggaaaat acaactgtat 5221 attacttagt cttagatgtg caagaatcgg actgttcggt cctatccagg aaatactgga 5281 atgactgtga gccacctgat tccagacgtc catctgaaat agtaagtaaa gagggcacct 5341 tcactctgct cttttcattc ttattttcca tctactctgt tcactcagcc aatgcctggg 5401 ctaacacagt agaagagaaa gggcagcttt tgccttgaga ttcatgaaga tgatgttaac 5461 cttgagcaat actcttgaga gtctttcctg ggaacccagt tcaagacaac tcagcagggt 5521 gtgtggtccc atcaaagagc ataaccagct gtgggtcctt gggtaaatca tctaataaca 5581 tccctccact ttggtgtttt ttgactgtaa aataagaagt gggtttagga attcatggac 5641 tctaagggcc ttccactctt gacagtctac aggtatcaga atctatgaac ttgtaattgt 5701 gtcctatgaa caatggtcct attttcagga atggccaaag taattacttg cagtgaggaa 5761 ttgggtcttg ctcagcacct aggatataca ctgaggaata agatctggct ttgcatgtgg 5821 gaggtgggta ttgacatgcc aggctctctg acacaggtgg agaacatgaa gatgaataag 5881 cagagattca agatttcatt tccagaaaat gagaggttca cactcctgga actttcacca 5941 ctttttccct tgaaaatgtt tggggtatgg ccatgatgct ttaaatagtt tctcgctggt 6001 tctcctgagg attttgagtg ccaacaggat gcagcacaca gaaactgact aagaaaatgc 6061 agggtttgag acaagctttg atgaaagaaa tgcattttta aatgaacagt aggaggatgt 6121 tgcatcacct ttttgctctt tcatatctga cctgaggaag gcaggatgca atgactcagc 6181 aagtcctcaa gagcctcttt cttacaggtg atcggacaat gtaaggtaat agctacaaga 6241 cattcccatg aatctcagga cctcagagtg attgacttta actgcaccac aagttctggt 6301 atggtaatct gttaaattgc taattaattc ttgattaata caaattcaat catacagtgg 6361 tatttgtctc atctatgtac atatgtattc atatatattc agatgtattc atgtatatct 6421 atgatgtaca tgaaagaaaa gccttcatag actccattat tattcattct ccagagaata 6481 gaattgagac aaatttccct ggtactggca caccttcttt taaagaaatt caacacataa 6541 aaatagagat ttaggagagg taggttagcc tcattataga tataggcttt ggagtcaggc 6601 ttgctttagc ttgagtcatc atctgccagt tgtatagtct tgggaggaag taaagtcttg 6661 ggactatgaa gtaaaggtta tgacactttc tatttaagac aaaataattg atgctcaaag 6721 aagttaaatg acttggcctc aagaacaaag ccaagatatt gtgaagatgt gacacaatcc 6781 caggagtatt aactctaaat tccacagctt ttttatttat ttatttttat ttttgagaca 6841 gagtgtcgct cttgctaccc aggctggagt gcaatagtgt gatctcggct cactgcaacc 6901 tccacctccc gggttcaagc aattctcctg tctcagcctc ccaagtagct aggattacag 6961 gcatatgcca ccacatccaa caaattttgt atttttagta gagatggagt ttctccatgt 7021 tggtcaggct ggcctcaaac tcttgacctc aggtgatctg cccaccttga cctcccaaag 7081 tgctgggatt acaggtgtga gccaccacgc ccagcctcca taccctttta atttacattt 7141 tcaaatagaa ttatcacagg attccttttt aaaatagcaa atccaaatat gatattatat 7201 acaaaaatgc aagatactgc cacaggagtt tgcattgaag aaaggatgat ctgtatcttc 7261 tcctctctct gggcaaaact ctatgttaat gctctgtctt cccccttggc aacaacagga 7321 agaacttatt aaaatattag tggcctggtg cggtggctca catctataat cccagcactt 7381 tgggaggttg aggccagagg gttgcttgag cccaggagtt tgagactagc ctgggcaatg 7441 tgactcaacc ccatctctat tatttaatat atatgttaat atatatatca tattaatacc 7501 taaatatatg atatataata taaaaacata tgatatatta tataatatat aaatatataa 7561 tagatattat ataatagata atatatattt atatattaac gaatgtcata gttgtcatat 7621 ttatatatat atgttagcac tttatcagtg gtactggtga aggcatcatg agtaggggtc 7681 acaacataga catggggatc agaaagactc gaaggaagaa tcagcagttt agttcggttg 7741 ccaaatggct gattgatctt gaagcagtta ctcagctctt cccaccctca atttgcccat 7801 ccttaaaatg aaagttattg tcattcttgc caccaagcca ccattaacat ttccagccct 7861 ttactgtgac actgctatct tctttctcca gtctcttcag cactggccaa taccaaagat 7921 agtccggtcc tcatagattt ctttgaggat actgagcgct acagaaaaca agccaacaaa 7981 gcccttgaga agtacaaaga ggagaatgat gactttgcct ctttcagagt ggaccgaatc 8041 gagagagttg caagagtggt gagtctccac taaggttcgg ttggagtctg aaggcccagg 8101 ctccaacctg gcagcaggaa ctgaatggca taccctctcc acctgcctca attgactagc 8161 tgccttttaa tttgtcactt ttgaggccat aaaagacagg cagcaggtat ttaaaaaata 8221 caaaaaaatt gctttcctca aacacttgtt ttttttttga aggatgggta ggggagtaga 8281 catctgacag agtagggaga aaatctaaag aggaaacagg gaaagacaag aacatatgag 8341 ggttgtgtca ttgcaaatca tccaatacta acattcttgt actatgatgg aaaactgaag 8401 ttcagagaaa ttacctcacc acacccagta atgtgcctat agttagatgg ttccctgctc 8461 tctctacaag gacccaagta ctttcaccca gagctgtctt gtagagcaac tggttgacat 8521 ttgctatgca ttcagaaaat gcagcgttaa atagattgtg ttgcatacaa gtaaggaaat 8581 gagtaccatg cagttgctca aagaatgttt ggacccatgc gcactggagt ttgaaaacat 8641 attggtatag gaaaatgcag gttatagaac agtacttaga ctatggaccc atttatatga 8701 aacacttaga tctccctgtg tatgtgtaac acatctagga aaacactcga agaggaaaca 8761 catcaaataa atgagaatga aaactctggg aagaagttgg agtgcaggag gggtgatttt 8821 tcacactcta ctttatcata tatttattat ttgaaccttt tcagtgaact ggtattcacc 8881 ttatcactta ggtgatacat ttttaaaagg cagaaaccat tttaaaaaat gacacatact 8941 ggtgattttg aattctcata ctgccttaaa aaatattgaa tggtagttat ctactgcttt 9001 tttttccttt ttctttgtct gttcttgaaa ctattttgat cccatctgtt ctagagagga 9061 ggggaaggaa ctggttactt cgtggacttc tctgtgcgga actgccccag acaccatttc 9121 cccagacacc ccaatgtgag tataagaaat gtctgtgatc gttgactaga gtcacagaga 9181 aaaaagaaag agaagaagga gaaggagaag gagaaggaga ggaaggagaa gaaggagaag 9241 aggaatgaga aggagaaaat gaggaggagg agaaggagga agaaatgtct gtgaaataca 9301 aagtatcatc tgggaagcac ttagtacctg ccaggcactc ttctaagtga tttacacatc 9361 agataactta ttgttttgtt ttgtttttga gatggagttt cgctcttgtt gcccaggctg 9421 gagtgcaatg gcacagtgtc ggctcactgc aacctccgcc tcccgggttc aggcgattct 9481 cctgcctcag cctccggagt aactgggatt acaggcatgc accaccacgc ccagctaatt 9541 tttgtatttt tagtagagat ggggtttcac catgttggtc aggttggtct caaactcctt 9601 atctcaggtg tccacccact tcggctggga ttacaaatca aagtgctggg gttacaggcg 9661 caagccacca tgcccagcct atgataactt attaaccctc acactaccat atgagataga 9721 taccattttt aatcccaaat tagcagatag gggaactgag gcacagagaa cataagcgaa 9781 ccataagaac atgattttat aatatacatg atatatacct aaacttaaaa ctaagtttct 9841 aactgattag gattaaaggc ttagttagaa ctatagcagt tcgtactaag accctacgcc 9901 aagtggtttt attaattcca catcatcaca ctcacaggaa tcccacttga caaatgagga 9961 ggtaactgaa aatcagcaaa ggaaaggtct tgccgatgtt cacaaccaag ctgagattaa 10021 aacttagtta ttctcataca aagtccagtg ggatttttcc tctaatcatt tacaaataga 10081 ttcccatgta cctgccagaa gacattatta ataccacctg gcatttgcat ggtgacctac 10141 agttaaagaa acacctccac atctgttatt gccaggagag cctcatgaaa atcctgctga 10201 caaggaaaag caagtatctg cctacatttc ctgtgacaag gggacctgaa attaaaacca 10261 cttacctaaa ggcacttagc tagagagtgt taggaaatgg aattttaatt atgatgccaa 10321 gtccaggatt tttctgaggt ttaacaacac attctccctg gtcaaaaggg agaaaccatc 10381 cattagtacc tttgttagct cttctaatgt ccacactgaa ccaggtctag aaaccttctt 10441 agcgcccacc agaaaagaca cctccctccc ctactcctat ggccaacaaa cccttcatga 10501 tcttggtgtg gataatgtaa acccagagtt gaagcaccag agctgaagag gaggctcagg 10561 ggtaagtgtt ttacagcatc ctcttgttta attaatccct cccacaacct gtgagtacat 10621 agtattatcc ccattttaca atggctgact ttcagaaatc ttttgtaact ttcccaaact 10681 cagacaacta ggaaatggag aagccaggat taacataggt ctctctgatt cccaaactgt 10741 actcagaacc tcaatgctta gcaatttctt ctagcagaca gccgtcctct tgtcctagtc 10801 gatctttcac aggtgggttt aggagcatct ggagcacagg agtggcagcc agttgtagcc 10861 ctggcacttg ggtggagtca gcagacactt gccttgacaa tgtccatgaa aacccatctt 10921 tatgctatac agatgccctc ctgtccaaca ctttcctcca agggaccagc tgaagttgta 10981 aaaatccctc tttgacacca ttcctcattg tcctaggcac aggcaactga ggtctccagt 11041 gcagaagggt tggcctaaag gcaaggcagc aatgatgtgg ttttcctcat catttccatc 11101 acccatgggg aagcccagtg ctggtgcctg ctcatgaaaa aaaccacagc aactttagtg 11161 atacccttca ccacctgcac tttccccaaa gtgttaacaa ggaaagattg ctgaacgact 11221 ggtgggaata tctgatcata ttaccctgat ttttctaaag cacttcttcc ttttctgaat 11281 gtctggaagc taactctggc tggtcatctt cacaccacct ggacacacac actaacagct 11341 cctcattcct ttgtaggtct ttggattctg cagagcagat ttgttctatg atgtagaagc 11401 cttggacttg gaaagcccga aaaaccttgt cataaactgt gaagtcttcg accctcaggt 11461 gggttgtcta agcagacttt gtcatggcag tgccagatta agtgacatac gtacacaaat 11521 agtgttgttg cttcctaaag ctctatgagt gggtgtgtgt gtgtgtgtgt gtgagagaga 11581 gagagagaga gagagagaca gagacagaga cagagagagg gagacaggga gagagagaaa 11641 gagagagaca gacagacatg caagaaaaaa gatacagaga gtatcccaca actggggaaa 11701 ggagtgcaat tgccagatta cacaaaaagt atgaaaactt ttccaagaat tgaagagggg 11761 taacaattgg cagaaaatta aaatcatgtc caattcaacc aaacttgaag gaatttgaaa 11821 atgctctttg tagtcgggac tctagtgctg agacttagtc attcctaagc ttatattatt 11881 gggcatagaa tctaagagga atatatcagc ctcaaattta ttatgtaaat gtatataaaa 11941 taagatgcaa aaccatattt tgcatctgtc tcagaaatct ttggtctatt ttttacatga 12001 gtacaaagat ttgcttcacg aaatttttca tatcagaaac aaaacttaaa ataatccaaa 12061 catccaatcc caagttaaaa aaaagaaaaa ggaacaaagg gcaggattgt caccatgaac 12121 atggctctgt cttttttttt tttttttttt tgagaaggag ttttgctctt attgcccagg 12181 ctggaataca atggtgccat ctcggctcac cacaacctct gcctcctggg ttcaagcgat 12241 tctcctgcct cagcctcctg agtagctagg attacaggca tgcgccacca tacctggcta 12301 attttgcatt tttagtagag aaggggtttc tccatgttgg tcaggttggt ctcaaactcc 12361 tgaccttagg tgatccacct gcctcagcct cccaaagttc tgggattaca gacatgagcc 12421 accgtgcccg gcctctgtag ctatttttta aaaaattaaa ctctgaacaa ggaagagaga 12481 gaagaggtca agtgttcagc ctttaggagt aagggatgtt gataccatca agcctcagtt 12541 aagaatatca gttataaggc actagttttc cagtgttggt ctacagatgg gtgccaggtc 12601 ataatattgt ttcaccactc tgtgtagtaa aaatgaaaaa aaaataagga taatgaagtt 12661 ttttatatct aacatttatc catctaaagg acaatccttt actctgatat tgtgtttctg 12721 gctctttctt ttttttttaa atcaaaattt cattccaggc tgggcatggt ggctcatgcc 12781 tgtaatccca gcactttggg aagctgagat gggtggatta cttgaaacaa ggagttcaac 12841 atggtgaaac ctgtctctac taaaaataca aaaattagga gtctgggcaa cagagtaaga 12901 ctatgtctca aaaaaagaaa aaaaaaaatt aggtgggcat ggtggcttgg gcgtgtagtc 12961 ccagctactt gggaggctga ggcaggagaa ttgcttgaac ctgagaggtg gatgttgcaa 13021 tgagcagaga tcgcaccact gcactccaga ctgggtgaca gagcgagact cagtctcaga 13081 aaaaaaaaat agtttctttt gtgaaacaat gatggtggta aatgatagtt attgttaata 13141 ttttaaatat caactgggct agataacaat ttgagaatct tttttcgtaa tactttttaa 13201 attgatttgt ggaatctatg atctgggagc cattggtgta aatcaagtca agtatctaac 13261 atcttctttt atgttacatg atgataggca cttttctgtg accttttcca ggaacatgag 13321 aacatcaatg gtgtaccgcc tcatttggga catcccttcc actggggtgg gcatgagcgt 13381 tcttctacca ccaagcctcc attcaagccc catggatcta gagatcatca tcatccccac 13441 aagccacacg aacatggacc cccacctcct ccagatgaaa gagatcactc acatggaccc 13501 ccacttccac aaggccctcc tccactattg cccatgtcct gctcaagttg tcaacatgcc 13561 acttttggca caaatggggc ccaaagacat tctcataata ataattccag tgacctccat 13621 ccccataagc atcattccca tgaacagcat ccccacggac accatcccca tgcacaccat 13681 cctcatgaac atgataccca tagacagcat ccccatggac accaccccca tggacaccat 13741 cctcatggac accaccccca tggacaccat ccccatggac accatcccca ctgccatgat 13801 ttccaagact atggaccttg tgacccacca ccccataacc aaggtcactg ttgccatggc 13861 cacggcccac cacctgggca cttaagaagg cgaggcccag gtaaaggacc ccgtcccttc 13921 cattgcagac aaattggatc tgtgtaccga ctccctcctc taagaaaagg tgaggtgctg 13981 ccacttcctg aggccaattt tcccagcttc ccattgccgc accacaaaca tcctctaaag 14041 ccagacaatc agccctttcc tcaatcagtc tctgaatcat gtccagggaa gttcaagagt 14101 gggtttccac aagtttccat gttttttaca catacatttc caaaataaaa tgtgattcct 14161 ttgaagagga aaatgaataa tacattgaat tagaaacata aataaaatga ccagtaattg 14221 tgaaaattac agttcttttc aacctacttt catactgaag atgcagcaaa atgtgaatgg 14281 gaaaagagat ggcctgagaa gagagatcaa atggaaagga gaggaaagaa ctcagtgctg 14341 cctattagta gttaattctg tcactcacca ctacatcact tgagacaaat ctatgccact 14401 cagaatctcc ttctttcctg gacttaactc taattctaga gtctctgtta ctgcttgggc 14461 tatacctggg catactaata aagtatggta ttgaaactat tgttatcatt tatttcattt 14521 tattttctaa gttccagggc acatgtgcag ggtgtgcagg tttgttatat aggtaaaagc 14581 atgccatggt gctttgctgc acctatcaac ccatcaccta ggtattaagc ccagcatgct 14641 ctagttcttt ttccgaatgc tctcctgctt cccaccctcc ccctatttat tattattaaa 14701 gtgacctaaa ttctctcctc gtgattatta ctaggcatct aaattttctt ttggaaaaat 14761 aagttctaaa ttcagtattg aagcaaacaa acctctggct ataaattgca aagaagcttg 14821 ctaacatgaa tcatgaccat aaattcaggt gcataagaaa tgtcacttta tgctcagaac 14881 aaattaattt ctatttgagg atgccatcct actggcttag tggaaattta accccagtcc 14941 ttcttggagt cccctttccc tgccctgcta ctttccacac aaagttgggc cacagttcta 15001 ggatctcaca ccatgccaag ctgtccaggt ggtaggaagc aggatagtga ctatccctca 15061 gccagggtga ctgttggacc aatgatcatc atcatgagtg atagccatcc tgctcaggga 15121 aagtcacaac cctgttcaca tcactctttc agtctgattt atctgctact tctgtagaaa 15181 actgggggtg caggaatgtc tgtacactag ccatggacat gactgccaag gtatggcata 15241 tggctgctct ctccctggga tcttgctgtc ttcccagctg catgcccaga aagcagggca 15301 gagatctcct ctgcctcaga gaccccttac ctcactcttc tccctatacc ttctccctca 15361 gctaacgact ctcttgctag tctcaggtaa gttcacttta tttattttta gggccagaag 15421 actttttcca cactatactc ctctttcttt ttcctcctca aatcaggact caatcatttg 15481 aaacaaacta aaagaattc // LOCUS AB005990 4706 bp DNA PRI 31-JUL-1997 DEFINITION Homo sapiens DNA for 25-hydroxyvitamin D3 1alpha-hydroxylase, complete cds. ACCESSION AB005990 NID g2516245 KEYWORDS 25-hydroxyvitamin D3 1alpha-hydroxylase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Murayama,A., Kitanaka,S., Takeyama,K. and Kato,S. TITLE Human 25-hydroxyvitamin D3 1alpha-hydroxylase gene JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 4706) AUTHORS Takeyama,K. TITLE Direct Submission JOURNAL Submitted (25-JUL-1997) to the DDBJ/EMBL/GenBank databases. Ken-ichi Takeyama, The University of Tokyo, Institute of Molecular and Cellular Biosciences; Yayoi 1-1-1, Bunkyo-ku, Tokyo, Bunkyo-ku, Tokyo 113, Japan (E-mail:ktake@m.u-tokyo.ac.jp, Tel:+81-3-5802-8632, Fax:+81-3-5684-8342) FEATURES Location/Qualifiers source 1..4706 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13.3" CDS join(1..195,845..1035,1543..1745,1831..2031,2116..2288, 2492..2664,2866..2944,3234..3431,3787..3900) /codon_start=1 /product="25-hydroxyvitamin D3 1alpha-hydroxylase" /db_xref="PID:d1023525" /db_xref="PID:g2516246" /translation="MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPS TPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEG PRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNN VVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDT ETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAA MRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVDTVSNTLSWALYELS RHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPD KDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGF GKRSCMGRRLAELELQMALAQILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR " BASE COUNT 1017 a 1353 c 1250 g 1086 t ORIGIN 1 atgacccaga ccctcaagta cgcctccaga gtgttccatc gcgtccgctg ggcgcccgag 61 ttgggcgcct ccctaggcta ccgagagtac cactcagcac gccggagctt ggcagacatc 121 ccaggcccct ctacgcccag ctttctggcc gaacttttct gcaagggggg gctgtcgagg 181 ctacacgagc tgcaggtagg aagggacgcc tttcccgaga cagagtgctg gggaaactgg 241 ttttgacagc gtcagaaagg actgactagt gcagagcaaa tgtgggacag ccagagagaa 301 cggatgccca tgaaataagg aaaaggcgag ttgaggctgg gggcggtgtg gctacactcg 361 ggcagagccc gtcccgactc ttagcagagg gcgctgcgaa agcgccttct cgctgtcctg 421 aggtgtggag atcctgcaga taaagtacaa gtgcgcggga gggggaggcc ggagtgggca 481 gtaccctccg cctgcttgcg gcttaaagct acatgggttc cttttcattc actgaggact 541 cgtcctgaga tggacagtcc agacatggaa cttttagaga tttctcctcg accgagatga 601 tcagagaggt cctgaatgtc tgccttgcac aaagttccgg ttttgcccat cacactcaaa 661 ctttctaggg atgggaaatg agagaggagg agtttgcagt atcaaaggca tacactagtt 721 gggcacccga gaaagagcag aagctcccta ttcccaagcc cagtcaacgc ccttggcgtg 781 ggcacaggtc aagctgaaag tccccgccca ggtatccaag tgtccgctgt gtccgctccc 841 ccaggtgcag ggcgccgcgc acttcgggcc ggtgtggcta gccagctttg ggacagtgcg 901 caccgtgtac gtggctgccc ctgcactcgt cgaggagctg ctgcgacagg agggaccccg 961 gcccgagcgc tgcagcttct cgccctggac ggagcaccgc cgctgccgcc agcgggcttg 1021 cggactgctc actgcgtgag tcttctctgc ccccagaagc ccagacgccc tgcagcggcc 1081 ctctcctgct gcgggtcccg aaactatcaa tctgggggca tggtgggagg tcggctgtcc 1141 cactctacca ttgggcatct tggggttccc atcccagtct gcttcaccca cactcaggtt 1201 gagccccaac ctcactcccc gcctacctcc ttatcctttg aggcccgaat cctgggaact 1261 gggggaaccg gaggaattgg gctagggtac aaggatctgg atggaatgaa tgtaaggttc 1321 tgtgactcaa gttacccggg aaccccggcc cctgagggca ggaaatgagt aaagaggaag 1381 ggggctggaa gacaggcgca ggccggtgca ggtttccgta ccccaagggg gcagaggcaa 1441 tccctctctt ggccaccgca gccgaacgcg ctccttcatt gcagccagtc ccgtcgagcc 1501 gtccccacct tcccgatgcg cactctctcc tcaaccctgc agggaaggcg aagaatggca 1561 aaggctccgc agtctcctgg ccccgctcct cctccggcct caagcggccg cccgctacgc 1621 cggaaccctg aacaacgtag tctgcgacct tgtgcggcgt ctgaggcgcc agcggggacg 1681 tggcacgggg ccgcccgccc tggttcggga cgtggcgggg gaattttaca agttcggact 1741 ggaaggtgag tcccaggaca gagctgggca ggcgtcgggg gcgccctacc agagcctccc 1801 ggaaccctga cggcgccccc tcccgacaag gcatcgccgc ggttctgctc ggctcgcgct 1861 tgggctgcct ggaggctcaa gtgccacccg acacggagac cttcatccgc gctgtgggct 1921 cggtgtttgt gtccacgctg ttgaccatgg cgatgcccca ctggctgcgc caccttgtgc 1981 ctgggccctg gggccgcctc tgccgagact gggaccagat gtttgcattt ggtaaggcac 2041 aggtcgaggt ggaaatgggg gaatgtaaag ctgtccaggg gtagcgaggt attcacgtgc 2101 cttctaccca cgcagctcag aggcacgtgg agcggcgaga ggcagaggca gccatgagga 2161 acggaggaca gcccgagaag gacctggagt ctggggcgca cctgacccac ttcctgttcc 2221 gggaagagtt gcctgcccag tccatcctgg gaaatgtgac agagttgcta ttggcgggag 2281 tggacacggt gaggttctcc ctccgtgctg tgagccggtt ccagggctta gcctccgcag 2341 actccggctc catttttctg ttgcagggga tccattatgg ccacgtagac cagcttggct 2401 tagcaccctg tagccccaga ctcttccata atctgcaccc tctgctgggt tctcacaccc 2461 aacacctctc ttgctttcac atgtttttca ggtgtccaac acgctctctt gggctctgta 2521 tgagctctcc cggcaccccg aagtccagac agcactccac tcagagatca cagctgccct 2581 gagccctggc tccagtgcct acccctcagc cactgttctg tcccagctgc ccctgctgaa 2641 ggcggtggtc aaggaagtgc taaggtgagg gggaaggaga ggaggaacaa gaggaaatgc 2701 caaggaaggg ctggggaagc aactagtgga tggaagcagg gagatagcag agaaaaatgg 2761 ccctctactc ctggccaaaa agggtttgga agttggaaac aatgagaagg gggctgcagc 2821 ccctagcctc atcttgctgt ctccattttg tgctttgcaa cctagactgt accctgtggt 2881 acctggaaat tctcgtgtcc cagacaaaga cattcatgtg ggtgactata ttatccccaa 2941 aaatgtgagt aaagcctatg caccctttct actgagccat tcctcactat cttgggcccc 3001 aaccctgttc tcaaacactc tcctaaaagc ccttcaaaca ctccccttga cccctctcta 3061 ggatatgcct ttctaggatg taatggaaca gtggggagat ccttccaaaa gtagccccag 3121 aaaacttccc catcacccac caggacctgc ctttgtccct gtctgaatat cctcttcttc 3181 atgcctgccc tattctgagc ccaaactgac aagtttgttt tcctttgcac cagacgctgg 3241 tcactctgtg tcactatgcc acttcaaggg accctgccca gttcccagag ccaaattctt 3301 ttcgtccagc tcgctggctg ggggagggtc ccacccccca cccatttgca tctcttccct 3361 ttggctttgg caagcgcagc tgtatgggga gacgcctggc agagcttgaa ttgcaaatgg 3421 ctttggccca ggtgagtgct ctagatttta taccttcccc agactggaga gaccctaacc 3481 ctctaaagtt gtgagctctt tcccctgaca agcataggaa atcatataag acctggtaga 3541 atgaatcttc tgaaatatga taagcccatt ataggcctgg agtgtaagtg agggtattca 3601 aactattttt tccctaccat aatccctcac ccttattaac caagaagtcc cctactggcc 3661 acaggtgcca cccaatcatt gaccattcta acactaataa tgcatgccct ttaccaattg 3721 gattaccaat gaatcccccc attatagata tctttcatag taatgctcac cttcttccct 3781 ttccagatcc taacacattt tgaggtgcag cctgagccag gtgcggcccc agttagaccc 3841 aagacccgga ctgtcctggt acctgaaagg agcatcaacc tacagttttt ggacagatag 3901 tcccatggaa agagactgtc atcatcaccc tttcattcat catagggata agattttttg 3961 taggcacaag accaaggtat acatcttccc ctaatgccta tctgaccaaa ctggatagaa 4021 ccaccatagt gaagtgtgag gcggccctga ccaatgtgtg aagtatgcac ttggcctgac 4081 tcaggaagcc aggtgagaaa accatggtct ctctgcttgc ttggcccttc tgatcatgta 4141 tgcatccccc aaggatgaaa tcagatttta actaataatg ctggatggcc tgaggaaaga 4201 ttcaactgcc tctctttttg ggctttcata gtgttcattg atgctgctgg ctaagcattt 4261 atcaaagcat aagctcagta actgtgcatc tggtctgtac ctggttggtc cttcgtcttt 4321 gcatgtaagc tctttgagag gaagggtgaa gccttatttg ttttttatgt cccctgccag 4381 ggcctgtctc tgactaggtg tcaccataca cattcttaga ttgaatctga accatgtggc 4441 agaagggata agcagcttac ttagtaggct ctgtctaccc ccttccttct ttgtcttgcc 4501 cctaggaagg tgaatctgcc ctagcctggt ttacggtttc ttataactct cctttgctct 4561 ctggccacta ttaagtgggt ttgccccatc acttagttct caggcagaga catctttggg 4621 cctgtccctg cccaggcctc tggcttttta tattgaaaat ttttaaatat tcacaaattt 4681 tagaataaat caaatattcc attctt // LOCUS AB009589 12414 bp DNA PRI 12-DEC-1997 DEFINITION Homo sapiens gene for Osteomodulin, complete cds. ACCESSION AB009589 NID g2696501 KEYWORDS Osteomodulin. SOURCE Homo sapiens DNA, clone_lib:lambda FIX II STRATAGENE. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12414) AUTHORS Ohno,I., Matsubara,K. and Okubo,K. TITLE Human osteomodulin gene: intron-exon junctions and chromosomal localization JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 12414) AUTHORS Ohno,I., Matsubara,K. and Okubo,K. TITLE Direct Submission JOURNAL Submitted (05-DEC-1997) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..12414 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /clone_lib="lambda FIX II STRATAGENE" /map="9q21" promoter 1..1815 5'UTR join(1816..1892,8524..8539) exon 1816..1892 /number=1 mRNA join(1816..1892,8524..9479,10624..11846) intron 1893..8523 /number=1 exon 8524..9479 /number=2 gene 8540..10949 /gene="osteomodulin" CDS join(8540..9479,10624..10949) /gene="osteomodulin" /codon_start=1 /product="Osteomodulin" /db_xref="PID:d1024887" /db_xref="PID:g2696502" /translation="MGFLSPIYVIFFFFGVKVHCQYETYQWDEDYDQEPDDDYQTGFP FRQNVDYGVPFHQYTLGCVSECFCPTNFPSSMYCDNRKLKTIPNIPMHIQQLYLQFNE IEAVTANSFINATHLKEINLSHNKIKSQKIDYGVFAKLPNLLQLHLEHNNLEEFPFPL PKSLERLLLGYNEISKLQTNAMDGLVNLTMLDLCYNYLHDSLLKDKIFAKMEKLMQLN LCSNRLESMPPGLPSSLMYLSLENNSISSIPEKYFDKLPKLHTLRMSHNKLQDIPYNI FNLPNIVELSVGHNKLKQAFYIPRNLEHLYLQNNEIEKMNLTVMCPSIDPLHYHHLTY IRVDQNKLKEPISSYIFFCFPHIHTIYYGEQRSTNGQTIQLKTQVFRRFPDDDDESED HDDPDNAHESPEQEGAEGHFDLHYYENQE" intron 9480..10623 /gene="osteomodulin" /number=2 exon 10624..11846 /number=3 3'UTR 10950..11846 polyA_signal 11830..11835 polyA_site 11847 BASE COUNT 4337 a 2171 c 2288 g 3618 t ORIGIN 1 ccaatcagtt taaattttct agacattttt attctgtctg ctctagttat ctactgattt 61 ttctttcaca cttgttaaaa acatgaataa cataatcaat atcataaatc caaatcacag 121 agttaatgag aagttatata aacattaggc tgttgatttt cagttaatct tccaaaagga 181 ctgtgaaggc ctataaaaca attttttgaa aaagtacaca gatttgctct ctcttcctta 241 gtgaaagcct gagttttaag aacaaagaca ctaacagaaa acatatggaa aagatcagta 301 acaagatgga atcttggcta aagtgtggtc tgccacgttt ttcataccaa tgtctcaaat 361 aatagactag atttttacaa ctgccaaagc atcacttgaa acactgtata cttttcttaa 421 tgcaaagttt aacaaaataa atcttgtgag ttctttgtaa aaagctaatg gaaaacactt 481 ttttgaggaa acaaaatgaa ctccaggatt cactttatct tattagtttg taaaagctta 541 aaagaattcc cccccaaaat gaactccgaa tctactgcca aaagacacaa gtcaactctc 601 tgtttagaat atgagaacag tgtaaaaatt aaacattaaa aacagttaac tgttactatt 661 caggagagat agctctatga aatgaagaaa ttttctaaat ctacaaaata aaggcaatag 721 ctattgagta acataagctt ttcatcccat tttattcaat actacagaat tattttgtgc 781 aatagtaaaa acaggcaaat tgaagataca cctttggagt gcaaccaact gactaatttt 841 atctgaaagg atattagaag tcattctagt gggtacatct gccaaaaaga ggagaaatga 901 gatagctgat aaaaatgatg ataaaaatga agatgtaagc ctttgtatag aaaagtcata 961 tagttaatct gtgtaagcaa tgacaatcaa gtgcaattta aaaataaaca atgcgaaatg 1021 ttatgtttct aatcctgatt tcttgtttta acaaatagaa gtactgatat gaacctaggt 1081 gttgaataac caagggactc aagaaatttt gagttccagg ccaggcgtgg tggctcatgc 1141 ctgtaatccc agcaatttgg gaggtggagg cgggaggatc acctgaggtc aggagttcaa 1201 gaccagcctg gccaacatgg tgaaacccca tctctactaa aaatacaaga aattagccag 1261 gcatggtggc tcatgcctgt aatcccagct acttgggagg ctgagacagg agaattgctt 1321 gaacccggga ggtggaggtt gtggtgagcc gaggttgcac cactatactc tggcctgggc 1381 agcaaagcaa gactccatct caaaaaatta aaaaagaaat tttgagttcc accttaagct 1441 gctgagactc tggagttccc tgtcacagtg ggccttaatt tgaagaattc tttggcatga 1501 aaatgaaaat aaaaacagta atcagtacat gctgtttcat ttagttttgt ttgcctttca 1561 cattagagga aattcttgcc acacctcaaa ccaaaattaa ggggaaaaga aagaagtaag 1621 ccacataaat ctgaccaaga acacaagcat cttaatttga atccacaaag tttcatgtaa 1681 tgaaaagaaa tacataattt taattcaacc cgagtgtttt ccaagaagat tgtatttgct 1741 taaattgcta cagtaattca agagacagcc ctgtctggac acagagttac tgtggatttt 1801 taagagactc agttaaagaa tttaggaatt tctgattcat ttaaaggatt tacaaattca 1861 tcaacccctg aaaactaaag caaattgaac aggtatgata ataaaataat tttcaatttt 1921 tttaagaatg tattcctact gttaaaatta taaaagtata ttggattttt taatattaaa 1981 atacactgca tctattacaa aagcacaatt tttggaagca aaatctggta aaaaagttat 2041 gttaaacttt tgattaggca aaatgttcaa ataaagtagt ataatgagtt taactttctc 2101 atttaagatt tacccaataa aaaatctttg tttagatttt tttttacttt gaatgaaagg 2161 aaagattctc aggttaaagg aatcaatctc tttctcttcc ttaggtctct ttttttctat 2221 gaattataag taacaaagaa ccagctaatg cctttgaggc atacagctga gcaatgctag 2281 tgattccggt tggggggttg atgggcattt gtacaaattt cactcaagat tttctccaaa 2341 tttctctact gactaaatcc ctgatataga atgtttttct tttcatttac ttctttctca 2401 aagggaaaat acagatgctc tatctagtct attgaaggat ctagttactt tgggttttag 2461 ttaattacga tttaaatgta ttatggctaa aaataattga ggtaattgta tttttaatag 2521 caaaaaaatt ttataaaatt aaaatttaga acttttttgg aaaaatctaa attatatgct 2581 aatatttgct ttcttatatg ctagtttcta tgtcatacaa ataaaaacaa ttatgtcttt 2641 aacataataa aatgttgagg attttacttt aaacttttcc tgaacaagta aattatacct 2701 cattagtaat ttcaattgaa acttttcttg atgaacacga gtgcttttct ccaagtaaga 2761 gaaatattat gacacagatg gttttattgc tataatatga aaaattcaca ttgaaaactg 2821 gtaatatata tatattactt tgctccaaac attctttttc acatttgttc ttttcctgct 2881 acacgatgta ctaggaaagt gaggaataca gtatattgtg atgaagaaat attcaataga 2941 aaaaggaaca gtcaagctct catgaatatg ttcatttgaa gaaattatca tagtgaaata 3001 aatgcctagg aataatcaac attagtagag agaagcattt gctagcaagc cttaacctat 3061 aacttattaa cttaatcatt cattcaacac acctttgagc acttataaca ctgcattgta 3121 ttagattctg ttaaaagata ttaggaaatg agatgatcaa caatgaatta tacaatattg 3181 ctgtgagtta tcctatagag aacaaggcat acattgtaga gtatgaaaag cagaagtaac 3241 aaaaatatgc ttagcaaatg actcagaaga gattagttgg ggaatgaaaa atttggataa 3301 aataagagat tgttgtaaca ggggaaatta tcctggggaa agagcaagag gcatagaaaa 3361 taatgagcat atagatgaaa acaggatgga tttcaaaaca gaaaactgca cttaatcaaa 3421 gggagtggga caagattgta ctacatggtt aggaacgtgg agggaggagg acaaggatgt 3481 ttaagataag ttcaggttta gaagagaaat tgttcctcaa ggaaagactt tcttcttagg 3541 tgctcacaaa ggcggaaagc agacaatatt aaaaacctct ttattttcag agacgcccca 3601 gtcatgtctt gattcaaggc cttcttcttt tttcccaaca atttttcata gttgttgaac 3661 aacagactct aattctcaaa tgtaaggttg ataactgatg attagctaat gttctcaaca 3721 aagggataaa attcaagctg agttttcagc caggcaggtg gctcacacct gtaatctcag 3781 cactttggga ggctgaggtg ggcggatcac ctgaggtcag gagttcgaga ccaggctggc 3841 caacatggca aaatcccgtc ttaactaaaa atacaaaaaa atgtagccag tcgtggtggt 3901 gcacgcctgt aatcccagct actcaggagg ctgaggcaca agaatcactt gagcctggga 3961 ggcggaggtt gcagtgagcc gagattgtgc cactgcactc cagcctgggt gacagcacga 4021 gactcttgtc tccaaaaaaa aaaaagttaa tatgttatac tataataagt acaataaaat 4081 attataaaat tatacataaa atatcatcaa tgagtataaa atgtatagaa tattctatat 4141 ccattcctat taatctccca aagtctgggt caaatgatgt ctacaaaaga ctttcctgtc 4201 tccgtaaggc tgaggtaagc cactggcccc acagcattgc ctatctctgc tataaaggat 4261 attaccctct gccttagatt agggttcact ttgttgtatc tctctcctcc actaagctgt 4321 ttgctcttta cagaagggac tatggcttca cgtcttcttc atttctgtat tctccaccaa 4381 cttagcatag tgcctcatca ctagaaataa ttcaggacaa cggtaaactg atgtaaccca 4441 atcaaactaa ctactaatat ttgtaaagtg cttcactaag catgtttaca tacagtaaag 4501 tagtagagta cactaggtga ataagcattt ccaagcagac ccactgggag cttccagccc 4561 agcaaggaga gcctgctctg aacactgccc tttggtatga taccataact ccttaaatct 4621 acctccttcc aagattctct actttttcac ccacgaacac tgactgtagt gtgaaatcac 4681 agcccaaata tgttaattca ttaaattagg cattcattcc cagagtgcaa attcctcagg 4741 tcagaattta ggccctccaa ggccgggggt ggtggctcat gcctgtaatc ccagcacttt 4801 gggaggctga ggggggcaga ctggttgagc ccagaagttc atgaccagcc tgagcaacat 4861 ggcaaaaccc tgtctttaca aaaaattaaa aaattagcca ggtgtggtgg cacgcatctg 4921 taatcccagc tactcaggag gctgaagtaa gagagtcact tgagcccagg agatggaggg 4981 tgagccaaga gtgcaccact gcgctccagc ctgggtgaca gagtgagact gtctcagaaa 5041 aaaaaagtta ggctaatccc ttcagatctg gatttaatag tcttattttg ctccttcaag 5101 agttaaccaa ccacatttcc caaacacatc accatctgtt gcttctatgt ttttgctcat 5161 ctcttctagc cagagatctt cttcctcact tccctctatg agaacctcaa cactccttca 5221 aagtctaggt caaatgccac atcttccacg aatctcacag aaaagaaaga aaaatggatg 5281 taatatagag ttgcacaaag aaacatcatg aatttcaaaa gcacatattg ttaaaaaaaa 5341 aacaaaaaca aaacccaaac cattgctatt ttggcatttt cttgtaggta gggatacata 5401 taagaaaata gaaataaatg atatcttgat cctattttct tgacttaaaa aaaagaagca 5461 aggtcatcac ccaaaaacaa taatggggaa taaggtatta gaggcttcag agaggaaagt 5521 tacaaaacag tagcgtagaa gagagggaag gaggatggtc taaggaaatg cagcaggaca 5581 gcacttgagg ttagtggtct tgaatcgaga gtgagaccag tcaccatgct cctgtttttc 5641 tttagttaca ttcagctgtg acagtcaggc tgtgagtagg tagagaattg gagtaaccag 5701 agttggagtt tggcagattg aaggagccag aaagtttagg gcatgtgcaa aagagcaacg 5761 atagtgatca actatggggt ctaagttggt aagaaggaac tgacaatgtt gggaggtatg 5821 gatgagggac agtgaaaaac agtagaatca atatagctga ggttccacgg aaattggaga 5881 attattgcag ctggagttac tacatagaga tagtggtctg agtgcggaat actcaaaatg 5941 aagagtatga agggttccag taatagaaaa gacaaaaccc aggatgcggg tgcatgggag 6001 caaatggtga ggaagaatga aggaaagtat cactgcagga agagagctca agggtctctg 6061 actctacctg aataactact tcaaattcca aatacacaca atagttacat ttataatttc 6121 tgacaagaaa agaagaagaa attgagatgt ttcttaacct ttctcaaatg gggttctgtc 6181 aaagaaaata acttaagtaa ctattctctc aattatacca atgatgaaaa atagctagaa 6241 ccattctaga tgaacaggag ataagtcatt tcattatatg caatgatttg tttggaaact 6301 cattctctca tgaaaccctg ggtgggaaaa gtagaaaggt ctcagaataa aagaggtagg 6361 attttttaaa acctttttcc taaatgatac attcatttca tctggatagc agttttcaac 6421 gtgcaaagta cttgtacatg catagctcat acaatgctaa caacgaccag gcggataagc 6481 aggcaagata tgacacaagg aggctggatg cagtggctca cgcccgtaat cccagcactt 6541 tgggagccca aggtgggtgg atcatgaggt caggagttcg agagcagcct ggtcaacatg 6601 gtgaaactcc atctctacta aaaatacaaa aattagccgg gtgtggtgga gtgcacctgt 6661 aatcccagct actggggagg ctgaggcagg agaattgctt aaacccagga ggtggtggct 6721 gcagtgagcc gagatcgcac cactgcactc cagcctgggc aacagagcaa gactccatct 6781 caaaaaaaaa aaaaaaaaaa gacacaaggc aacatacaag ataagcagcc aaggtataat 6841 tcagctataa ttgcaattct tattttattg atgaggcaag aggcagatat gttaagtgac 6901 agaaatgaaa tataggtttc aggcttctca tctgggacat aaatgaccct tccctagaac 6961 ttcaagtttc agcactgagt gttcattgtg aagtgtgccc ttttgcaatt cttttccaca 7021 caatatcctc ctcctgcatt ctactgcacg tgtatttgat tctccacaag actttaaaca 7081 cctgggagac aaggacagtg tgtttttaat aatatttttg tatctaaaac ttaaatatat 7141 taatggaaaa aaggaagcta ggaagaaaaa tgtaaagaca cttgaacttt gtagaatgta 7201 aacactgtat gtctttagga tttaaagtag ttattcaagc agagtcggag actaaataat 7261 tggattctgg aaactgtttc atgaggaaaa tttgatggtg ctgttagttt tggacagaaa 7321 atattaaaag gaaacatgaa tactatttct aaatatctga agttatcaca caaaagggag 7381 catttcacgt tcttctgaag ggcagaaaaa ggtatcagca aatagaagtt gtagaatctc 7441 ccaggcaaaa gctctctaac taaaataaag gtctttcctc ccccaggcag tgaccatcat 7501 taaaggatca ggtaattcca aagaacacac agcatttaca gtggagggat cagactgtag 7561 ttgactgatg tcatgctatt atcatagatc tataccctag taagcaagag aaactgggac 7621 tcttaaaaga tggttactta gtagaaagtc tagtatacca gacttttgac tcaagttcag 7681 tgtaatatgt gacttaaaaa aaaaaaaatc tcggccaggc gcagtggctc acgcctgtaa 7741 tcccagcact ttgagaggct gaggtgggtg gatcacctga ggtcaggagt tagagaccag 7801 cctggccaag atggagaaac cctgtctcta ctaacaatac aaaaattagc caagcgtggt 7861 ggcacgcgcc tgtagtccca gatacttggg aggctgaggc aggagaatca cttgaaccgg 7921 ggaggtggag gctgcagtga gccaagcctg caccactaca ctccaccctg ggtgacaagg 7981 caagactcca tctcaaaaat aaaaataaaa agaaaaagaa aaaagaaaaa aaactcactt 8041 catttgtact taaacattaa ttttggtttg aggccaggcg tggtggctca cacctgtaat 8101 cccagcactt tgggaggccg aggccagagg atcccttgag gtcaggagtt tgagaccagc 8161 ttgggcaaca caacgaaacc ttgtctctac taaaaaatac aaaaataagc tgggcatagt 8221 ggcatgcacc tgtagtccca gctacttggg aggctgaggc atgagaattg cttgaacctg 8281 ggagcctggg aggcagaagt tgcagtgagc cgagattgtg ccactgcact ccagcctggg 8341 tgacagagga gacactgtca caaaaaaaga aaaaaatttt ggtttgagca ctgcccattc 8401 tttctggata ttcttttata tatagacata ggtataatac atttcatata actgagaatt 8461 tataccaagc gtattctgag tttactcact ttctccacaa tgtttccttt tttccttgaa 8521 caggaaaaaa aaaaagaaga tgggtttttt aagtccaata tatgttattt tcttcttttt 8581 tggagtcaaa gtacattgcc aatatgaaac ttatcagtgg gatgaagact atgaccaaga 8641 gccagatgat gattaccaaa caggattccc atttcgtcaa aatgtagact acggagttcc 8701 ttttcatcag tatactttag gctgtgtcag tgaatgcttc tgtccaacta actttccatc 8761 atcaatgtac tgtgataatc gcaaactcaa gactatccca aatattccga tgcacattca 8821 gcaactctac cttcagttca atgaaattga ggctgtgact gcaaattcat tcatcaatgc 8881 aactcatctt aaagaaatta acctcagcca caacaaaatt aaatctcaaa agattgatta 8941 tggtgtgttt gctaagcttc caaatctact acaacttcat ctagagcata ataatttaga 9001 agaatttcca tttcctcttc ctaaatctct ggaaagactc cttcttggtt acaatgaaat 9061 ctccaaactg cagacaaatg ctatggatgg gctagtaaac ttgaccatgc ttgatctctg 9121 ttataattat cttcatgatt ctctgctaaa agacaaaatc tttgccaaaa tggaaaaact 9181 aatgcagctc aacctctgca gtaacagatt agaatcaatg cctcctggtt tgccttcttc 9241 acttatgtat ctgtctttag aaaataattc aatttcttct atacccgaaa aatacttcga 9301 caaacttcca aaacttcata ctctaagaat gtcacacaac aaactacaag acatcccata 9361 taatattttt aatcttccca acattgtaga actcagtgtt ggacacaaca aattgaagca 9421 agcattctat attccaagaa atttggaaca cctataccta caaaataatg aaatagaaag 9481 tatgtagact tttattgtgt tttgaagaga gaaagaatgg aatcctgatc ttttaaagaa 9541 aaaaatattc aataagctat gctttagaca ttgctgaaaa tagttgtttt gaagttgctt 9601 aagggaatgg cagaaatctg ccccagagaa atgtgtgcta ttcagtttat agtccttttt 9661 attaaatttt attgccttaa atattgagaa ataggctggg cgtggtggct catgcctgta 9721 atcagagcac tttgggaggc taaggcgggt ggggtggatc acttgaggcc aggagttcga 9781 gatcagcctg gtcaatatag tgaagtccca tctctaccaa aaatagaaaa attagccagg 9841 catggtggca tgtgcctgta ctcccagcta cttaagaggc cgaggcatga gagttgcttg 9901 aacccaggaa tcggaggttg cagtgagctg agatcatgcc actgctttct ggcctggatg 9961 acagagcaaa actctgtctt aaaaaaaaat aaaataaaat aaataaataa ataaataaat 10021 aaataaatat atatatatat acacacatat ataatatatt tatttatata taaaaaatat 10081 atattctctt ttatatatat aactgataat ttatacatat atatataact gacaatttat 10141 acagagcgta ttctgagttt aatcactttc tccacaatgt ttccttttct ccttgcaacc 10201 gggaaaaaaa aaagaaaaaa agaagaaata tataatttta tatatatata ttttctttat 10261 atatatgtat aaattgtcag ttttatatat gtatatatat aaaagagaga aataatagag 10321 tacatgaaga aatatgagta tcctaagtag tgggattgta gataattttt ttttgagatt 10381 gttctactag gaatttatta atcagaaaag ccaatgaagc catatatatt ttgttaatat 10441 aaaactcata atttaatttt ctttttatta tttgattaat ataaaaagga aacctaattc 10501 aatttataaa tattttaatt aacagaatta tatcccatac aatataaaat ctaacataga 10561 attatggcta taactatgga ttatactata atacaagatt aatttttaat ttatttcttt 10621 cagagatgaa tcttacagtg atgtgtcctt ctattgaccc actacattac caccatttaa 10681 catacattcg tgtggaccaa aataaactaa aagaaccaat aagctcatac atcttcttct 10741 gcttccctca tatacacact atttattatg gtgaacaacg aagcactaat ggtcaaacaa 10801 tacaactaaa gacacaagtt ttcaggagat ttccagatga tgatgatgaa agtgaagatc 10861 acgatgatcc tgacaatgct catgagagcc cagaacaaga aggagcagaa gggcactttg 10921 accttcatta ttatgaaaat caagaatagc aagaaactat ataggtatac acttacgact 10981 tcacaaaacc tatacttaat atagtaaatc taagtaaaca tgtattactc aaagtaatat 11041 atttagaatt atgtattagt ataagatcag aattgaattt aagttgttgg tgacatctgc 11101 atcatttcat aggattagaa cttactcaaa ataatgtaaa tctttaaaaa tataaattag 11161 aatgacaagt gggaatcata aattaaacgt taatggtttc ttatgctctt tttaaatata 11221 gaaatatcat gttaaagaaa gtgagtgtat catttctatt aacagtaatt tttctaaaaa 11281 tgaggaagga agtagcatta gcagtaaaga cccacaggcc acgaccctcc tgatcactct 11341 gaggacaata tttaaatgac aggaaggtat attaatgtaa caagcattca tttaaggaat 11401 agaccatttt ctctgacctc ttcttcagga aagctttcac actggtattt gtgatctcac 11461 cattatgaca tccatccccc tagctcacca catagcacat agaatgatat ttttgatttg 11521 taagaggcca tccaggtact aaggacccaa ggcatacaga ttcacaaaat taactaatct 11581 ttttgcctca gaataccaaa acaacaaaat tataaagctg ttatttggac aactaaaaaa 11641 caccaaacta tctcattgca atttgtattt tagcagattt cagcaactat cctaaacaat 11701 gttattgtgt tccattttaa ctgggataaa tgtttttgta aaaatacaac catagaaagg 11761 cctctttgtt acaaaatgat ttgcaaagaa ataactgctt tgtttgcaag attaaattag 11821 tgttggcaaa ataaagttct aaagataata ctaaacactt tatgcttatc agccaatatc 11881 tacataagct aaaattgaat acagttcctt tataatttga atgacaggtt ttaacatctt 11941 ctttaaaacc ttaaaaatca ctttttcgtt aatgtattat ttttaagtat tgtaagtgtg 12001 ccttacaatg atgattatct aactgctgtg atgaactctc ttgatatacc atgctaatgt 12061 ttcatagaat ttctcagata agggacaaat accaatctag ggctgcctat acaaatgata 12121 acaattctga caatatgcaa cagacagaca agtcaaacta tacttcagaa atagttttca 12181 agcttgcaag tctttttttc tggaaatacc agtttaaagt gaagtcttta atgctcttta 12241 catctttgag gaattattag cattatcaga accagaacgc ccacctgttc ccagggacaa 12301 cctcaagaat ccaggatgaa tatgaagaaa actccattca ttcgtctaat caacattcat 12361 caagtccttt cttatatagg gcttggcaat aagggatcga gtcgacgccc tata // LOCUS AC002132 39336 bp DNA PRI 29-MAY-1997 DEFINITION Human DNA from chromosome 19 cosmid F24108 containing MAG, genomic sequence, complete sequence. ACCESSION AC002132 NID g2160193 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 39336) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Kyle,A., Ramirez,M., Stilwagen,S.A., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 1 Mb region in 19q13.1 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 39336) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Kyle,A., Ramirez,M., Stilwagen,S.A., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Direct Submission JOURNAL Submitted (29-MAY-1997) Human Genome Center, Biology and Biotechnology Research program, Lawrence Livermore National Laboratory, 7000 East Ave, Livermore, CA 94550, USA FEATURES Location/Qualifiers source 1..39336 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1 between D19S208 and CAPNS" /map="oriented centromere to telomere" /map="overlaps with CH19R30879 to the left and CH19F16632 to the right" /clone="F24108" /cell_line="UV5HL9-5B" /clone_lib="LL19NC02 F2 chromosome 19-specific cosmid library" /cell_type="fibroblast" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid UV5HL9-5B, which carries chromosome 19 as its only human chromosome." repeat_region 14..386 /rpt_family="ALU" repeat_region 827..1081 /rpt_family="ALU" repeat_region complement(1320..1587) /rpt_family="ALU" repeat_region complement(3975..4184) /rpt_family="ALU" repeat_region 5730..5853 /rpt_family="MIR" repeat_region 5853..6220 /rpt_family="ALU" repeat_region 6259..6484 /rpt_family="ALU" mRNA join(6653..6718,6877..7245,10818..11114,11411..11668, 13712..13972,21138..21425,21811..21907,23182..23281, 24554..25068) /gene="MAG" /note="BLASTN similarity to human ESTs: R17484, F11360, F09020, R42831, R45975, H23809, R89057, F04235, H23810." /product="MAG" gene 6653..25068 /note="myelin-associated glycoprotein" /gene="MAG" /map="19q13.1" CDS join(6673..6718,6877..7245,10818..11114,11411..11668, 13712..13972,21138..21425,21811..21907,23182..23281, 24554..24718) /gene="MAG" /note="myelin-associated glycoprotein precursor" /codon_start=1 /product="MAG" /db_xref="PID:g2160194" /translation="MIFLTALPLFWIMISASRGGHWGAWMPSSISAFEGTCVSIPCRF DFPDELRPAVVHGVWYFNSPYPKNYPPVVFKSRTQVVHESFQGRSRLLGDLGLRNCTL LLSNVSPELGGKYYFRGDLGGYNQYTFSEHSVLDIVNTPNIVVPPEVVAGTEVEVSCM VPDNCPELRPELSWLGHEGLGEPAVLGRLREDEGTWVQVSLLHFVPTREANGHRLGCQ ASFPNTTLQFEGYASMDVKYPPVIVEMNSSVEAIEGSHVSLLCGADSNPPPLLTWMRD GTVLREAVAESLLLELEEVTPAEDGVYACLAENAYGQDNRTVGLSVMYAPWKPTVNGT MVAVEGETVSILCSTQSNPDPILTIFKEKQILSTVIYESELQLELPAVSPEDDGEYWC VAENQYGQRATAFNLSVEFAPVLLLESHCAAARDTVQCLCVVKSNPEPSVAFELPSRN VTVNESEREFVYSERSGLVLTSILTLRGQAQAPPRVICTARNLYGAKSLELPFQGAHR LMWAKIGPVGAVVAFAILIAIVCYITQTRRKKNVTESPSFSAGDNPPVLFSSDFRISG APEKYESERRLGSERRLLGLRGEPPELDLSYSHSDLGKRPTKDSYTLTEELAEYAEIR VK" repeat_region 7563..7681 /rpt_family="MIR" repeat_region 8011..10351 /rpt_family="VNTR" repeat_region complement(12004..12424) /rpt_family="ALU" repeat_region complement(12696..13119) /rpt_family="ALU" repeat_region complement(13410..13478) /rpt_family="MIR" repeat_region complement(14181..14481) /rpt_family="ALU" repeat_region complement(14597..14786) /rpt_family="MIR" repeat_region complement(15813..16094) /rpt_family="ALU" repeat_region 16941..17205 /rpt_family="ALU" repeat_region complement(17706..18001) /rpt_family="ALU" repeat_region 18794..18953 /rpt_family="ALU" repeat_region 19739..19847 /rpt_family="ALU" repeat_region 20080..20252 /rpt_family="ALU" repeat_region complement(20701..20843) /rpt_family="ALU" repeat_region 22302..22702 /rpt_family="ALU" repeat_region 24031..24280 /rpt_family="ALU" repeat_region complement(25836..25999) /rpt_family="ALU" repeat_region 26167..26344 /rpt_family="MER5" repeat_region complement(26167..26344) /rpt_family="MER5" repeat_region complement(26601..26881) /rpt_family="ALU" repeat_region 27235..27712 /rpt_family="MER1" repeat_region 28366..28646 /rpt_family="ALU" repeat_region complement(29327..29903) /rpt_family="ALU" repeat_region 30774..31305 /rpt_family="ALU" repeat_region 32468..32716 /rpt_family="ALU" repeat_region 33080..33362 /rpt_family="ALU" repeat_region complement(33585..33713) /rpt_family="ALU" repeat_region 33731..33853 /rpt_family="LTR10" repeat_region complement(34305..34513) /rpt_family="ALU" repeat_region complement(35009..35266) /rpt_family="ALU" repeat_region 35752..36030 /rpt_family="ALU" repeat_region complement(36032..36192) /rpt_family="L1" repeat_region complement(36308..36379) /note="BLASTX similarity to (1139..1162); match: 0.45, score: 8.5e-14; database searched: nr; hypothetical protein (L1H 3' region) - human" /rpt_family="L1" repeat_region complement(36432..36587) /note="BLASTX similarity to (1066..1117);match: 0.4, score: 8.5e-14; database searched: nr; hypothetical protein (L1H 3' region) - human" /rpt_family="L1" repeat_region complement(36896..37047) /rpt_family="ALU" repeat_region complement(38803..39045) /rpt_family="ALU" misc_feature complement(39066..39239) /standard_name="Xgrail prediction" /note="Xgrail 1.3c prediction, quality= excellent, score= 79." BASE COUNT 9654 a 10413 c 9983 g 9286 t ORIGIN 1 gatcacttga agccaggagt tcgagaccag cctggccaac atggtgaaag cccgtctcta 61 ctaaaaatac aaaaattagc tgggtgtggt ggtgcacacg tgtagtccca gctactcggg 121 aggttgaggc acgagaattg cttgaacctg ggaggcagaa gctgaggtga gctgagacct 181 tgccactgca ctctagcctg ggtgacagaa tgagactctg tctcaaaaaa aattaaataa 241 ataaaaataa aatgcagatg aggctgggcg tggtggagca tgcctataat cccagtggtt 301 tacgaggctg agacgggagg attacttgag gccaggaatt caaggctgca gtgacctatg 361 atggcaccac tgcactgcag cctgggtgac acagtgagac gatgtctgta aaacaaataa 421 acaaataaac tgcagatagc tgggcctcca gaatttctga ttccacaggt gcaaggcggg 481 gcccttgacc ttgcattttg agcaagctgc agtcctgtgg aggactggtt gtgaagctga 541 caccattgtc tgagggcagc aaaggctctt ggatggattg agcctcagct cccatctatg 601 agagctttaa gctttaagtg agagctcccc atggccgaga gcttcattct gaattcctgc 661 tgctaggaat tcaaaactca cattgaagtc ctaacgccta gtagctcaga atgtgacctt 721 agttggaggc agggtcttca cagaggtaat caagttaaaa tgagacggga gaatcctaat 781 ccaatataac tggtttccta aatctgacag ggtacagtga ctcacacctg taatcccagc 841 agtttgggag gccaacacag gaggactgct tgagatcagg aattcaagac cagcctgggc 901 aatatagtga gaccctgtct ctagcaaaaa aaaaaaaaaa aaaaaaagtg ccagtagtcc 961 cagttactat tgagaggctg aagcaagaga gtcgcttgaa tctggagggt ggaggtggca 1021 gtaaggtatg attgcaccac tgcattccag cctaaatgac aaagcaatac tatgtctcaa 1081 acataaataa ataaataaat aaataaataa ataaataaat aaacggcaaa tctgggcagg 1141 ggtgaataca gggggaaggg aatgtgaaga tgaagatgcc atacagagaa aggcccggaa 1201 cagatcctca cacccctcag aaggaaccaa cactgccagc acttgatctt gttgatcttg 1261 gaattcaagc tcaattctgg cagggttttt tgtttgtttg tttgtttgtt tgtttgagtc 1321 ggagtcttgc tctgttgccc agcctggagt gcagtggcgc aatctccgct cactgccagc 1381 cccgcctccc gggttcacgc cattctcctg cctcagcctc ctgagtagct gggactacag 1441 gcgcctgcca ccatgcccgg ctaatttttt gtatttttag tagagacagg gtttcaccgt 1501 gttagccatg atggtctcga tctcctgacc tcgtgatctg cccgccttgg cctcccaaag 1561 cactgggagt acaggtgtga gccaccgtgc atggcctcaa ttccggtttt taagccaccc 1621 agtttgtgaa actttgttgt aacagccctg gaaaactgat acacctgctc tctcgtaaaa 1681 gatcagctga tacggcagca ccggccactg gatctgcggg tcagttgtcc ccttagacag 1741 ggcacctgcc cgccagtcac aggtcctgtc acccagcccc ttcctctcat gggctccagc 1801 cgcctcgctc ccacaggtct cctggagacc cgagttgctg atgtcagcga ctattgcgag 1861 tgctcctgtg ggacactgtc ctctccccag caacagcctt cagttcacac ccagggagga 1921 gatggagccc cacactgcac cagcacacga ctccagctcc aactaggtgg gtgttgatgc 1981 tgaaggctgt gacatgcaaa taccccccag gatctccttg tcaccccgac caccaccacc 2041 aaagcccagg caccaggccc ctctcttctc caggacccag gcattcaagc cccgcccgac 2101 tctcaatggt cctttgtccc ggactgggaa aggcacgggc agagtctgac atcatgaagc 2161 ctggggtggg ggtgggactc cccctccccc tccaactccc ccccagcccc acaacgcttc 2221 tctccctggt ctgagggatt tccccgcccc tcaccccctg ggttctccct tgagagcgag 2281 gaccccccct ccccactgct ctggcctgaa ggaggacagg agatgtccca gtgaggctga 2341 acagctcatg gtgaaatcac tgctgcactt tgtcacgcca cctccagctg ccccgagccc 2401 ccacccacct tgcaaaagtc atgaccaagg ggtggaatct gcactcagag ggtccttgcc 2461 caacttggtc agcacagtcc tatgtgcacc aagcactgcc ccggggacag aagggtgcac 2521 atgacagaca gagaggcaga taggtttctg gtgagcccct gcctctgaga gctcctgggt 2581 tagtgggagg cagaggcggg cagacctagg gtgcacacgg gccagcctgc caaggctaca 2641 gaaggaaagg acagcattgc aatggagggg cctggctgat gggcgaagat gggaaagctt 2701 cccagggaag cggcacccag gctgctgagc ctacaggatg ggtgaacggc agccagtgcc 2761 agagtgggga gtgctatggg gtggggactg ctttccagga agaatgagca gcaggtgcac 2821 aggccttgag tctgaaggtg ccctggggct ctggggaaca cattcaacca cccacactgt 2881 ttattaagct caactgcctg ccaggagctt gtctacgagc aaggatggat tgagaatatc 2941 caagacgaag cttctggcct ggtggaggtc acgttctcat gaaaggagtt ggtgaacacg 3001 gaaggtcatt tcagatggca agaggaggtg acactgacgt tgagggtgga aaggacctgt 3061 ctcctcctgc tgcccaccta atgttgccac caagattccc tgaggacaat gcctctgtcc 3121 ctcagcctgg gacttccaaa gagattagac tgccccccca cccccgggac ccagaatcag 3181 ggtgtacagg gatggaagat ggggagagag caaggcagcc actcccgacc cctccaggaa 3241 agagctagga gaacccaggc gtcaggctgc ccagtccgag ccccgtggtg acaagggccc 3301 ctttgtgccc ccctcccccg ggggtaggga ggagctgggc cctggaggca ggcggcccct 3361 ggcacccagg gggaggggag gggctggcaa gtgggggcct agaccctgga aggcagggga 3421 ctgcgagctg ggctggcgga gcagaggtgc agaagcaact gagtccaagt gagtatggtg 3481 gcagggaggc cagggcaagg ggagcagggc accggggcag tggcacccag acgtgagcag 3541 ggtcaggtgc tctgggtgag ggagcatcgc tgtgcaccgc tactgattta ggatgcagga 3601 gtgtgtggct tggggctgct ttggggatgg gctcggacca tcgctggggc agcgatgggg 3661 ctctgggaag aaaccacaga gagaaaaaac aaagggcccc ccctcagcac cccttgtggg 3721 tgaagatttt gcaggagagg tttccttggg gttcccgagt gcggggcaga tggggtggag 3781 gaggtctgca tctggccgaa gaatggaagg atgtccaggt gatcagacac ctgcaggctg 3841 caatctggga ggtggatggg ggcagtgggc tgttagtgac tgcattctag cacctgggca 3901 catagcaggt gtgtgtgtgt gtgtgtgtgc gtgtgcgtgc acacacatgg ctgcccatgc 3961 accttcctct gtaagcctca gtcgactttt cttcattaat ccccactaag gacacttttt 4021 atttttcgtt actttttttt ttttaagaga cgggactctc actatattgc ccaggctggt 4081 ctcaaacttc tgggcacaag agattctccc aacttggcct tccaaagtgc tgggtttaca 4141 ggcgtgagcc actgcgcccg gccccaacta cagaaacttt ttagatgcat ttgtcctcat 4201 cccatccttc ccttggcatt ttaatccaac agatctgctg tacatatgtt tatggactgt 4261 ggccacacac actgtaatag ctgaggttga tttggccccc atcccgagaa cttgttttcc 4321 ccgctgttaa gagtgcatgg tctacctgct tgtttcatgt ctgcccgggc acgtgctcat 4381 cctcatctgc atctctctcc tgtaccttcc tgtcacaggt gcaagggcag cgtcggccag 4441 gtgtccatcg gcgcgtgtct gagtggacgc gtctatagcc atcctcagtg tgtgtgtgca 4501 gttccttgca tctttgttag cctggcatgc atgctgggct gtgtgtgtgt gtgtgtgtgt 4561 gtgctgaatg ccagggctgt gtctgtccag gtgtgtgtgg acagcagcag aggcattgca 4621 tgcacgcctt gggaactatg tgagtgccag cccatccctg tgggcacagg ggtccctgtg 4681 cctgtctgtg tgttcagggt cctgtctggg tgtctgtatg agtctccgct gcgtgtgcct 4741 gccccttctg gctggggcta catctgtcac atttcatccc cgcgattgga tgtgtctgtg 4801 catgtgtctc tgtccgtggg aagagggcgg tgaaggctgg gaaaagcaat tgcttggaaa 4861 ccctggctgc aaccccaagg caacccatgg agcagcaggg tgccaggggc tcgctagccc 4921 ccaccctggg ccttcctggc cctgctcaaa ggccagccag gagtgcctga cccctttctt 4981 actccccact taccctgaaa gccccccgca ggctaaaaca ccctctcccc gagggaacag 5041 gactcttttg taccggggac cagcttctgg tgagggtgcg gacaccaggg gccgtctgtg 5101 gggtgggaga gggcggcagg aattcacgcg gcatgtcggg aatgctgata ctgtgaaacc 5161 cagtcattga tggctctggg tccctgcccg ccacccccag gccccggccg tgggacatct 5221 gctccctcac tccactcgcc acacccctca gtctcacccc cgtaccccga tgcggtcgtt 5281 cccccttccc tccccacagt tctctgacac cccagcttgg agggagttgt tgccagagag 5341 tggcttcgag gctgtgtggg agggtcccag gcacaatggg agggccacat gggcagcctg 5401 acaacaggtc acaacccatg caggagatac tgaagcgggg gtgggttggg tgcctcaatc 5461 ccgccccctt gcacctgagt gactcttgtt gcatgcaggt tgtctggcgg cttcaggtgg 5521 acccagaaga cgtccccaac tcagggagat tcaggtgagg ggcagggtac caactctctt 5581 cccttgcttc agctttgcaa tgctggccct gccttggctc tctggttctt actttctgga 5641 gatctccagg ggcatcacgg tgacaaagat aatgatcagc acatcatcac tgcatagaga 5701 caggacatgc agcaaagact gcagaccaac tgtgtggcct tgggcgagtc acttaacctc 5761 tctggacctc agctataaaa tgaggataat gcatcttcct caaagagatc caatgagtta 5821 agacagcaga ataagcagtg agggctggca catggccaat gccatagaag ggtttataat 5881 tataagtaat agcagcaggc tgggtgaggt ggctcatgcc tgtaatccca acactttggg 5941 aggctgaggc aggaggatcg cttgagccaa ggagttcaag accagtctgg gcaacacaga 6001 acatggcaag actcccatct ctaaaaaaaa taataataat aaccaaataa ataaactaaa 6061 ataaaattag ccaggcatgg tggtgtgtac ctgaggtccc agttactctg aaggatcgct 6121 taagcctggc agattgaagc tgcagttgag tgagccacga ttgcaccact gcattccagg 6181 ctgggtgaca gggctgagac cccatctcaa aaagtaaaaa gttaaaaaat atcagcagac 6241 caggcatggt gatttacacc tgtaatcctt tgggaagcca aggcaggaat attgctggag 6301 gccaggagtt gaagaccagc ctgggcaaca tagtaagacc ccacctccac aaccatttcc 6361 acaaaaaaga agaaaaatta actgggtgtg gggtggcgca taccagaagc tgagacaaga 6421 ggataacacg agctcaggat tttgaggcta tagtgagctg ttattgtgtc actgcattcc 6481 agcccgggca atgaaacaag accctgtctc agaaaaataa gtaaataaat gcataaataa 6541 ataataatag cagcagcagc taacatatga atgggcccct tcctgaagcc cagatggaac 6601 accccctttc acttccccca gcctttaacc ctctcctctc cctttccagc gatcactcac 6661 tcgctgtaca gaatgatatt cctcacggca ctgcctctgt tctggattat gatttcaggt 6721 aacggctgac aggtgctggg gacctaaagg ctttggcccc tgagcaggtt ggaggtgggt 6781 ccccgcagcc ccccggcatg tggttgggga tgggagccgg agggggtgat cgggtaggac 6841 gtgtccctga gcctcagctc tcctgcttgc ccgcagcctc ccgagggggt cactggggtg 6901 cctggatgcc ctcgtccatc tcggccttcg aaggcacgtg cgtctccatc ccctgccgct 6961 ttgacttccc ggatgagctg cggcccgctg tggtgcatgg tgtctggtac ttcaatagcc 7021 cctaccccaa gaactacccc ccggtggtct tcaagtcgcg cacccaagta gtccacgaga 7081 gcttccaggg ccgcagccgc ctcctggggg acctgggcct gcgaaactgc accctcctgc 7141 tcagcaacgt cagccccgag ctgggcggga agtactactt ccgtggggac ctgggcggct 7201 acaaccagta caccttctca gagcacagcg tcctggatat cgtcagtgag tccccagcgg 7261 ttgtgcaggc accgggagct ggggcagcgg ggcgggaagg agtgtggccg gaaggcctcc 7321 ccgcaccttc cccagcaggc ctggatggca gatctgggag gtgctgattt ggctgggggt 7381 gcaaacctca aggcccacgc agaccatgca gatgacagac atgaacaaag cagggcccca 7441 aaatagtctt gctgccctca gggggaaaca ggtgctgcta ctcttttgga aacacctgga 7501 gaaaggcagt ggggccaggg ggtaggggca aggctgctgg gcccccactt tagcgtggtg 7561 atcttgggca agtgacttaa cttccttggg ccttggtttt cttttccgta aaatggggat 7621 aagaacagca cttaaggcaa tgagtgaaaa gccatcagat gcccagagca gcacctggca 7681 cgtggccccc agctctggcc aacagaaagc agggcggtgg ctgtcaggtg ttctgaattt 7741 tcaagataaa ccagaatttt ggattctttg gtgtaatctc ctaattctta aattttggtt 7801 gctgattcag atttcttttt ttaaaatcct ttaagccaaa tagaatacag ctatggccag 7861 acatggcaat cagaataggg cacactgcct gatttcaggg atccccctgc ctcatttctg 7921 agtggcgctg ctgcttggag tcctcagcgc tgagtcccta tgagagagcc gggttcaccc 7981 tggtgcagaa agagactgca caggtgcagt acacacacac accactctac agaaatcaca 8041 cactacacac tgcacactct acctacacac tacccacaca ccacacacta cacatacatc 8101 acacactaca caaaccacac accacacact gctcctgcta cctacacact gcccacacac 8161 caaacataca catcacacca aacacaccac acacacacac tacacaaacc acacaccaca 8221 cactgcatac actacacaca cacacactgc ttggcgctga ggactccaag cagcagcgcc 8281 actcagaaat gaggcagggg gatccctgaa atcaggcagt gtgccctatt ctgattgcca 8341 tgtctggcca tagctgtatt ctatttggct taaaggattt taaaaccaca caccacacac 8401 acaccacacc aaacacacac cacacacaca caccacatta caaaaactca ccacacacta 8461 cacacacacc acacacacca caccaagcac acaccacaca catcacacac tacacaaacc 8521 acacaccaca cacttcatac actacacaca ctacccacac accacacaca caccacacca 8581 aacacacaca ccacacacac accacactac aaaaaccaca caccacacac tgcatacgct 8641 acacacacac tacacacaca ccacacaaac accacaccaa acacacacca cacatccacc 8701 acgctacaaa aactacacac cacacactgc acatgccaca cacacactac ccacacacca 8761 caccaaacac acaccacaca caccacacac accacacact acataaacca cacatcacac 8821 actgcacata ctacccacgt accacacaca caccacacac ctcataaact acacaccaca 8881 cactgcacac actacccaca caccacacac actacacaca ctacacacac cacacacata 8941 caccacacac tgcacatgct acccacacac caaacacaca ctacacacta cacactgcac 9001 aaaccacaca cacactacac cctatataca ctgtacacat acatgcgcca cccccacatc 9061 acaacacaaa ccacacaccc cacacacaac aaacatgcac aacacacgct acacaaacca 9121 cacaccacac acacaacaca ccaaacacac accacacaca cataccacac tacaaaaacc 9181 acacatcaca cactgcacat gctacacaca cacaccacac caaacacaca ccacacacta 9241 cacaaaccac acaccacaca ctgcacatac tacccacgca ccacacacac accacacacc 9301 acataaacta cacacacagt gcacacacta cccacatgcc acacactaca cacaccacac 9361 actacacaca ccacatacat acaccacaca ctgcacatgc tacccacaca tcaaacacac 9421 actacacact gcacaaacca cgcacacact acaccctata cacactacac atacatgcac 9481 tacccacaca tcacaacaca aaccacacac tccacacaca acaaacatgc acaacacaca 9541 ctacaaaaac cacacactgc acacacacca cacagtaccc aaaccacacg ccacacacca 9601 cacactctac ctacacacta cccacacacc aaatacacac cacacaccaa acacacacca 9661 cacaaaccag acaccatgca cacaccacac acacaaacca cacactatac ccaccacaca 9721 ctgcacacac acaccacata ctacagaaac cacacaacac acactgcaga cactacctct 9781 acaccaaaca cacaccatgc gctacacaca aattacatgc tacacaaacc acacacacta 9841 caccctacac accctgcaca catacatgca ctacccacac accacacaca taccacacac 9901 tacacacagc acacacacac cacacatgat acaaaccaca caccgcatac acaccacata 9961 ctacactcac accacacaaa ccacaccaca cacacaccac acaccacaca caccacgtac 10021 tacgcaaacc acataccaca cacgacccaa actacacaac accccacaca cactgcacac 10081 atacatgtac tacccacaca tcacacacac accacacatg acacagacca cacaccacat 10141 acaccacgct ctacacccac accacacaaa ccacacatca cacacacaac acacactgca 10201 cacataccat acacaaaaac cacacactac acacatacat cacaaacaca ccacatacta 10261 cacaaaccac acacatcaca ctacacacat gcaccacaca cccacaccac acacacacac 10321 acacacatac ctacacacac acacactcac atgcccggcg gagggtaaac caagcaggca 10381 tgggagagga gagccgcttg ggaagtgatg aggtgtgggg agcccacgct gacgccagag 10441 ccacagggtt tgtagccagt gttacctggt gccagctgca gcctcagttt cttcatctgc 10501 aaaatgagca aaattacagt cctcaccctc tcgctttgcg ggagggactg gcaagttaaa 10561 cgccgtcaag tgctctggcc acgacactgt gattttcatt acgatggcgc tctggtggct 10621 ccgtggaggg ctccggggtg accaagagga ggaagggtaa agggaattct gaatctggag 10681 gcagggagcg gaggcgatac cgttgaggag taccattgcc agcaatggag gtggacaagg 10741 aggagggtgg gtggggtggg accgggaccg tagcccagtg ctgctttctc agccctccct 10801 ttccccgcct cgtatagaca cccccaacat cgtggtgccc ccagaggtgg tggcaggcac 10861 ggaggtggag gtcagctgca tggtgccgga caactgccca gagctgcgcc ctgagctgag 10921 ctggctgggc cacgaggggc tgggggagcc cgctgtgctg ggccggctgc gggaggacga 10981 gggcacctgg gtgcaggtgt cactgctgca cttcgtgccc acgagggagg ccaacggcca 11041 caggctgggc tgccaggcct ccttccccaa caccaccctg cagttcgagg gctacgccag 11101 catggacgtc aagtgtgagc ctgggtgcgg gcggggcggg gtggggcggg gtggggcggg 11161 gtccggggag ggggtggacc tggggatgcg gccggaggcg gggccgggcc gtgatggggg 11221 cggggccatg cccagggcca ggcagggatt ggggggttgg gtgggacggg ggcggaacca 11281 ggcagtcctg gggcggggcc aaggctgagg gcggggccgg acagtgttgg gggcggggcc 11341 gggctgggag agggcactgg gccggttccc cagcacctgc tcactaacct cgctgtgtcg 11401 cgggccttag accccccggt gattgtggag atgaactcct cggtggaggc catcgagggc 11461 tcccacgtga gcctgctctg tggggctgac agcaaccccc cgccgctgct gacctggatg 11521 cgggacggga cagtcctccg ggaggcggtg gccgagagcc tgctcctgga gctggaggag 11581 gtgacccccg ccgaagacgg cgtctatgcc tgcctggccg agaatgccta tggccaggac 11641 aaccgcaccg tggggctcag tgtcatgtgt gagtggccca ctctgtgcgt ccacacgccc 11701 acctgcagcc gagagataaa gggaaagggg cctcatccag ggcgagcatg ggctgggtcc 11761 cgaggggacc ggccataaac agtcgaggcc aaggtagaca ggggggttgc aagtcaaggt 11821 gtaccttcat tctttccaca agaatctggg gaacacctgc tctgcctcat cctccttcca 11881 ggacaagacc caggcctgcc tcctagggaa gcctgttaac tgcagcttcc tgggggcagt 11941 tcctggttgt tctgggactt gatgtgggtc tggtcccagg aatctgcgtt ttactttata 12001 aaattttttt ggagacaagg ccttgctctg ttgcccagtc tggaatgcag tggcaccatg 12061 atagcaccac ttgccctcaa actcctgggc tcaagcaagc ctactacccc agcctcctga 12121 gcagctgggc ctacaggtgt gcaccaccac actccaggct aattttcaga ttttttttgt 12181 aaaggtggga ttttgctatg ttgcccaggc tggtatcaaa ctcttgggct caagcaagcc 12241 tcccacctca gcctgctgcg ctgctgggcc cacaggcagg tgccaccaca cccagccaat 12301 ttttctattt ttttatagag acagagaatc gatatgttcc ctgggctagt cttaaacttc 12361 tgggctcaag tgatcctccc gcctcagcct cccaaagtgc tgggattaga ggcatgagcc 12421 accgtgcctg tccaggaatc tgcattttta aatgaattcc attgtagttc taatgagagg 12481 gttgagggga cttcaggcct catccagccc cttgtatgca catttatatt actgggccag 12541 caagaaggga gctggtcaga agaggattct tagctgagaa ccaaagagat accattcatt 12601 cattcacaaa tgtgtactga ggcaaagtgc ccagcactgg ggatattaat aatgacagcc 12661 gcagccaaca tttactccac agctcttgag tgcttttttt tttttttttt ttttgagaca 12721 gagtctggct ctgtcgccag gctggaaggc aatggagcaa tcttggctca ctgcaacctc 12781 ccacttccct gttcaagcaa ttctcctgcc tcagcctccc gagtagctag aattataggc 12841 gcacaccacc acacctggct aatttttgta tttgtagtag agacagggtt tcaccatgtt 12901 ggccagaata gtctcaaatt cctgacctcg tgatctgccc acctcggcct cccaaagtgc 12961 tgggattaca ggcgtgagcc accacactca gcctaatttt tgtattttta gtagagacag 13021 ggtttcacca tgttggtcag gctgctctcg atctcctgac ctcaggtgac ctgcctacct 13081 cagcctccca aagtgctggg attacaggtg tgagccacca cgctcggact cctgcatgct 13141 tttgacatgc aatcttcccc ccggcattcc ccacggctgg ctctgcccac ctcctaggct 13201 cgctgaagcc ttcccccatc atcctatccc aaactgtaac taccctgatt ttccataccc 13261 tcagcccctt tatttttctc tacacacttc ttcccatctt acaaattaca tattttattt 13321 aattacatcg gctgtttcca catgggcctt gttcaagcta gagcaatgcc tgtcactcag 13381 caggttctca gtgaatgttg gatgatcacc cctatgaggt tggaactatt atgaagccca 13441 ttttccagat gagaaaaatg aggcccagag aggttaagat gtgggtccac acacaactgg 13501 acaggggtag agtagggatc tgaagacaag catgctggct accaagctgc tgttcttgga 13561 aatgagatag ccaggactgg ggtcacctga gccttcctga accggtgtgc taggtttggg 13621 gtgggatctg gggtcatatc tggggatggt agttggctgg cagaagaagc acctcctggg 13681 ttctgaccat cagtcccgtc ctaccccgca gatgcaccct ggaagccaac agtgaacggg 13741 acaatggtgg ccgtagaggg ggagacggtc tctatcttgt gctccacaca gagcaacccg 13801 gaccctattc tcaccatctt caaggagaag cagatcctgt ccacggtcat ctacgagagc 13861 gagctgcagc tggagctgcc ggccgtgtca cccgaggatg atggagagta ctggtgtgtg 13921 gctgagaacc agtatggcca gagggccacc gccttcaacc tgtctgtgga gtgtgagtac 13981 cttccgctcc cctatgctgg ggatggacgg ttccgtgggg gacaccaggg ttactgtggg 14041 tgcccacgca tcccaatcag tattggcttt gcctgtcctc cgcagagaca aggaagggag 14101 ggcagggaag ctgaattcac agcaggaaaa taaaatccca tctgagaata ttgtgttccc 14161 acatccggga gtgggctttt tttttttttt ttttttttga gacagagtat tgagtcttgc 14221 tctgtcacct aggctggagt gcagtggcag gatctcggct cactgcaatt gctgcctcct 14281 gggttcaagc aattctcctg ccccagcctc ctgagtagct gggattacag gcgctcgcca 14341 ccacacctgg caaatttttg tatttttagt agagacgggg tttcgccatg ttggcctggc 14401 ttgtctcaaa cccctgactt cagatgatct gcctgcctca gtctcccaaa gtcctaggat 14461 tacaggcgtg agccaccacg cgcagcctgg ggtgggcttt tgaccagtga aatttctgga 14521 tcctgtggcc actgcccctc tgcactaacc gaaacaatga ccatgatctc taacctgtgt 14581 agagtgtcgc tatgtaccag gcactgttgg aagcgctgta cagcaaccca ctcatttaat 14641 cctcatgaca tcttcatgag ggaggttcta ttactgtttc cctgtttacg gagaaggaaa 14701 ctgagaccca gagaggttat gccacttgcc taagttccca cagccaggaa atggcagaat 14761 ttgaggtttg aatccaggca gtctggttcc attcttcttc cctgcccaag gctgacctca 14821 tcctatgtta tgacagcaga atacattgat ggggtaattg ctatacacaa aaggctgtgc 14881 ggctctgtga gaggtggagg atagggctga gaagacacca cgcttatgaa caggtacaga 14941 atagctggag agatgacaca gatgcacata aaaagctact gtcttggaaa tgaaactcac 15001 cattgcaccg ctggccagcg cctctcatct cctcactatg gactgtgcca ccatctccca 15061 ccacccctac ccccacagcc atcactccag gttgtagcag ccaagtcaca gttttgactc 15121 cccttcctta agcaattccc tctggaatat cgccttcctg tttccatttc ccctgcttcc 15181 tcccaagttc ctgttggctt ctagcactcc tccccaaatt tcacctctct ggtcttcacg 15241 atgcccacaa tggcccccac cacacaagtt tgccagaagg ttcgatgaca ttcaacatgc 15301 aaagaattca aagcagtgtc tgctacacag tacatgctct gtgctgtgta catggcgaca 15361 atcagtacat ggcagataca ggttagtcct cgggggcaac ctctcgtggc aaagccgtcc 15421 ctctcctgct gacctataat cttcaggcac tttccctcct ggttctgttc cctagcgggg 15481 ctgatgtgat cctggcgcgg ggctgcgtgg tcccctgctc cctccaactg tgcgtgctat 15541 cctgggccag cttccccctc tggttgtccc ctgatccctg ttacacccca caccagtgat 15601 tcttcagttt aagagtcaga gagaggagcc gatgggagta cttgaacaag cctgcagcac 15661 caactcacat acattttccg gtggggtgca gatctctcct ttatagtaag agcagcaggc 15721 agcaaatgcg cgttaggggc ctgccctggg ccagaccttg aatcctctac agctgttttc 15781 tggttcagtt cttattttta ttattattat tatttttttt ttgagactga gtctcgctct 15841 gtcacccagg ccagagtgca gtggcacgat cttggctcac tgcaacctct gcctcccaga 15901 ttcaagcgat tctcctgcct cagcctccca actagctggg attacaggca tgcgtcacca 15961 tgcccagcta attttttatt tttagtagag acggggtttc tccatgttgc ccaggctgtt 16021 ctccaactcc tagtctcaag tgatccaccc gcctcggcct cccaaagtgc tgggattaca 16081 ggtgtgagcc accatgccca gcctatctgg ctcattttca attctagggt caccctgtga 16141 aacaggggtg tgttataatt atccccttta ccaaggagaa aactcaaggg cggaattccc 16201 tgaggccagg cagctataaa agtgggtccg agcccgataa ttccagactc ctgtctgccc 16261 cagctgttaa acctctggct gcctcacctc cctcacagct cttagcactc taggactcaa 16321 tccggcttct cttgaaaacg aacacgcagg cagatgtgga gtgtgcaggc ctcacctgct 16381 agacttgagg ggagcagtgg ccctgtcccc tgccccacct gccagctccc tactcttcac 16441 ccaggcctgt gtcaaagtgc ttgcccttgg agagtgcgac cctcaggccc cctccctctc 16501 cacacagtca atgttccctg gcaccaggtc tccaatgttc actgacagtg gtggtgaccc 16561 ttgagcaggg ccttctgtct tacctgccaa ccccagagcc ctgaacggca cggggcatgc 16621 aggaagtgtg caaccttacc gagtgcctga gatgtctagg gaactagctt gagtacgttg 16681 tacgcattga cctattttgt tcttacaacc atgtgattga ggggagggta ccattcttca 16741 acccagttta atacaaggaa acagaagttt gactggcatc cagtcccaac agggctgaac 16801 agacctgggc tagaatccca gctctgtccc catcctcctg cctgagctca ggcgagtgac 16861 ttcagctttg caacccttag caagggttca taacagtatc ttaccacaaa gatggctgtg 16921 agaactaaga gaattaggct gggcgtggtg gcccacacct gtagtcccag cactttggga 16981 ggctgagaca gttggatcac ttgaggccag gagttcgaga ccagcctgac caacctgaca 17041 aaaccccatc tctactaaaa atacaaaaat tagccaggcg tgatggcgca cacctgcagt 17101 tccagctact caggaggctg aggcaggaga atcgcttgag cccggggggt ggaggttgca 17161 gtgagctgag atggagccag tgcactccag cctggatgac agagcaagac tgtcccagcc 17221 ccccactcca gcctggatga cagagcaaga ctgtcccacc cccccactcc agcctggatg 17281 acagagcaag actgtcccat ccccccactc cagcctggat gacagagcaa gactgtccca 17341 cccccctact ccagcctgga tgacagagca agactgtccc atccccccac tccagcctgg 17401 atgacagagc aagactgtcc cacccccaca ctccagcctg gatgacagag caagactgtc 17461 ccaccccccc actcccccac aaaaaagaag aactaagaga acaatactga gtgttcatag 17521 tgctacatac tctgtggata ttggatgtca tcatctttct cactgatttt tcctttggac 17581 tccaggttaa atatttgttg tgtgttttgc tgctgaggtt ttgctcatgt gacctctctg 17641 cgggaatggt tctgtctatc cacccatgag tacttgggtg ggatgttacg tatttttaat 17701 tttaattttt tatttttttg agacagggtc ttgctctgtt gcccaggctg gaatgcagtg 17761 gtttgatcat cgctcactgc agcctcaacc tcctgggctc aactgattct cacacttcag 17821 cctcctgagt agctgggact acaggcacac atcaccatgc ccagttaatg ttttaatttt 17881 ttgtagagat ggggtcttgc tctgttgccc aggctggtct caaactcctg ggctcaagtg 17941 atcctcctgc cttggcctcc caaagtgctg gaatgacagg tgtgagccac tgcacccggc 18001 tgggatgtta agtgtttaca acacacttgc gcgcctatct tcggagtccc gtagcccctt 18061 gtgagttact gactgtaacc ccctggtttg cacagatgag aactccatcc tagaggccca 18121 ggagcctgag tggcctcacc cagccagtag atggcagagc tgggattcaa gccaaatcca 18181 tcccaccaca aactcctcat tctttgtacc tttgtcttac aggtttgtct agagtcccat 18241 aacagtttcc gcagcttcac aggttgtctg aaaagatact aagcacgtgc accggcaggg 18301 tgtcggcgtt ccgccgacat ccaggtcgaa cactttgtca ccagagcacc agtggggctg 18361 tcttgagtgg aagtgctttg aaaaagtttt atgtctgtcg gttcaggttc gctgttatta 18421 atgtactccg tggcgtgtta gctggttaac gctatagcaa gcatattgca gctctgcccg 18481 aacaggactt tttattcagt agacttagag atgtttgcac agaggttccc attcctgccg 18541 tgaatccgtg tgcccgtgtg actcccaggt ccaccaccgt taccggcaca ttctttttgc 18601 tagcaattga aaataggtgg ggtaggtgag ccaagggtct cttgcctgtg tgtaagtggg 18661 ttttggtgag aactcactga gaaccagtgc tcagtctaga gcaagccaag gagggtgctg 18721 gactctcagg ccaagaaaga gaacacagtt taaaaagcag cattctgtgc tgtaatacaa 18781 ttgcgtattt tggtacaaaa attagccagg catggtggca cgcacgtgta gtcccagcta 18841 ctcaagaggc tgaggcagga caatcacttg aacctgggag gcagaggttg cagtgagcag 18901 agatcgtgcc actgcactcc agcctgggcg acagagagac tccgtctcaa aaatcatagt 18961 aataacaata ataataataa taatttcctg ttttggaaaa tgaaagaaaa aacctaccag 19021 tactattttt gtaatccagt ggcagcaaag caaagctcaa ccaagtgact ttgggtggag 19081 ggtggtgttt ggtgctagtg gcaggaggga atctcatggg tgaggaaggg ccggggttca 19141 aggcggcttt aggggaggaa gtggggggca tctgcagcgc agactgaaag gaaggagcac 19201 agcctggaag ccagaaagga ggggatagtc taagcctgtg tgagccccaa gggtctgggg 19261 aggccggggg ctgggagagt tccatggttc ctcccttcta cgtgtggggc ccagagtgga 19321 gaaggagcaa gcccctgtct tgatggaaac aggaggcccc ttatgtaagt gtgtgatgca 19381 gggagggtga gtgctgggga ggagaactga gtgaggcagg gagtgaatga tacgtggagc 19441 gaggtgcagc ttccctgggt ggtcagagaa ggagacgccg catgggtaga ggccagtttg 19501 gagcgaggga gggcgttcca caggcaactg atggaagtgt tccaagttga gggagaaacg 19561 tacagaccct ggagtgggag aaagctccac gattttcaga cacagcaagg aaccggtgtg 19621 gctggaaagg actaagccag ggggagagaa ggagagagtg tgtggtcagg cagggcctga 19681 ggggccttgt agatgaaaga ctttgtattt tgggctgggc acagtggctt gcgccttata 19741 atcccagcac tttgggaggc caaggcagga ggattgcttg agctcaggag ttcaagacca 19801 gcatggggaa catagcaaga ccctgtctct acaaaaacaa acaaaaaccc tccagtaact 19861 atcatgagta tgacagattt accgtcctct gcaattctga acataaagct ttaaaatcaa 19921 aaggcttttt gcaaatttgg caccaccctg atttgacagc aaaatctcgc ctaaagaaca 19981 caaggctgtt tgtaatctat actgatttca tttagggtga atattcgtac tttttgctgt 20041 atacattaca ttgaactatg tgaaattttt tctgcatgtc aaaaatggtc aaatgtggct 20101 gggcgcagtg gcttatgcct gtaatcccag cactttggga ggctgaggcg ggaggatcac 20161 tagaggccaa gagtgcaaat gagctatagt actgccactg cacttcagcc tgggcgacag 20221 ggccagaccc tgtctctaaa cgataaaaaa taaaggattt gtattttagg tgaaccacgg 20281 agcaacagaa ggagggtttt gagcagagag tgccatgatc tgatttatgt attacaaggt 20341 gaatgaggtt agagagagtg taagagccag ggcccaggag aaacggcccg tccctgtgac 20401 atagtgctct gtccaggctg aagctgaggc tagagtcgcc tcaaaagtgg caccagaaga 20461 attgccccaa gagagcaccc caaaccaccc cgaaatgagg ccacttggga ttccaaagag 20521 agaggcacta aaccaccagg tgatcatcag tccaaagcat ttgggagggg aacttgcaca 20581 gtgctacggc gagagaaagg gagggtctac ccagacacgt ccgtgagggt cagggtatgg 20641 gtttacatga gggtttaagg aatttggccc agggctgggg ctggtttctt tcattctttc 20701 atttttgttg ttgttgttgt ttggggtttt tttgagacag cgtctcattc tgtaacctct 20761 cgtactcctg acctcaagtg atccactcac tcttcggcct cgcaaagtgc taggattaca 20821 gcagtgagcc accatgcccg gcctctttca gtgttttgag caacaaccta aacaccttta 20881 tcagtgcctg ggaatgttca aggccccagg cgtgggttag tgaagcttgc agtgaaaaca 20941 ggcagccggc ggggtcgtag tgaggtcaag gcgctctcag tcaggacaga gaaaaaagga 21001 ggggaaattg gagggcccta caggaagcta ccgcccccat ccttggccac actggccttg 21061 ccttgagcca aggagtctcc ggggccgggc ctcgcgttgg ctccgggcca ccctcagacc 21121 tgattttgcc cctgcagtcg cccctgtgct cctcctggag tcccactgcg cggcagcccg 21181 agacacggtg cagtgcctgt gcgtggtgaa gtccaacccg gagccgtccg tggcctttga 21241 gctgccatcg cgcaatgtga ccgtgaacga gagcgagcgg gagttcgtgt actcggagcg 21301 cagcggcctc gtgctcacca gcatcctcac gctgcggggg caggcccagg ccccgccccg 21361 cgtcatctgc accgcgagga acctctatgg cgccaagagc ctggagctgc ccttccaggg 21421 agcccgtgag tggcgtggac ttggggtggg agccacagga gggagcgggc acgtcttaga 21481 tccaagctcc caaggctgac caacagccac agagcatggg ctatgcagat tgacagaaag 21541 gtcaaattct ccctttccgg ttggagagga gaagccgccc tgagtttgcc ccgagtttgc 21601 taggaccttt gaggcatgag cccctcccca gggccctcta gcatgaaagg gattgccctg 21661 gcacgaggag gctccgtgcc aatcagtcag tgcaaggatg ccccggctgt gtgactacat 21721 tgactgtcat cccagcatgt ctccaaagtt ctcaggtgtc gtcaccacca cccatagccc 21781 taagggcgcc tgggtctttt ctgtcctcag atcgactgat gtgggccaag atcgggcctg 21841 tgggcgccgt ggtcgccttt gccatcctga ttgccatcgt ctgctacatt acccagacac 21901 gcaggaagtg agtgccagct ggggctgatc tggggatggg agtctccaaa aaggggacct 21961 ggtgggagat ggaggcccag agtgggtggg ggaaggcagc accataagtg tagagaaggc 22021 agattctgtt ctgggaatgt tcagctggac agaggagaga gggttcctgt taattccagc 22081 ccctgctggt caggggcaca ccagggcctg ggctgagccc tttacacaca tcatttgaga 22141 cctgtttgcc cacattctat tttgacagcc tccctatctc taggaactct cccctacaca 22201 ctccagtgtg gtggggaaag cccaggtacc ccactgcagc cacagcagca ggtccacttg 22261 acgccagaag ccccaaagat tacacacatc tggggcaaaa aaaaaatagg ccttggccgg 22321 gcgcgttgac tcacgcctgt tatcccgcac tttaggaggc caaggaggaa ggattgcttg 22381 aggccaggaa tctgacacca gcctgggcaa catagcaaga ccacatctct aaaaaaaatt 22441 tttttttaat tagccaggca tggtggcaca cactgtagtc ccagctactt gggaggctga 22501 ggtggaacga ttcttgagct caggaattag aggccagcct gggcaacata gggagacccc 22561 atctacaaaa aaacaaaaaa attagctact agtggcacgt gcctgttgtc ccagctactc 22621 gggaggctga ggagggaaga tcgcttgagc ctgggaggtt gaggctgtag tgagctatga 22681 tttcaccact gtactccagc ctaggtgaca aagagagacc ctgtctctaa aaacaaacaa 22741 acagacagac agacagacag aaagcctctc cagccccaaa ccatacccac ctggcccaac 22801 tcagctgcag ccgtgtttct ttcttgtggc acacacacat cagctgtcct gagaaataaa 22861 ttaacccctc tgctccacat cctggtggca agaggggcag aaagacccag agcctatgag 22921 gggaacctgg agaatccagg tgggggtcct gaaaaacggc agagctcagg ggccaggtgg 22981 ctcgtcccca gccgggcagg ggccacagag gtgggaagga tgggctgtgt gggtaggaga 23041 ggctggtggt tctgaaaggg ctgcttgcca catcttagag atgggtggga gaactctgag 23101 gtcccctgag ccaaactcct aaaacacgtg ggggtgcaga aagggagggc agagagaggt 23161 ctgacgagtc ctgccctgca gaaagaacgt gacagagagc cccagcttct cggcagggga 23221 caaccctccc gtcctgttca gcagcgactt ccgcatctct ggggcaccag agaagtacga 23281 ggtaaggacc aggctccagg ctggcctggg aacctggatg ccagggacac tgaaggcctg 23341 ggaaaggcct gtgaggccaa ggggcagggg agtgggagcg gcctctcaca ggaggagagg 23401 aaggacattc cagggctggg ttggacctgg aggctgggga ggaggtgccc aggccgaagc 23461 tgtgtagggg gcgatgggac gtggggccta agggccccct cccctccctg cctgtgtctg 23521 tgactgcatt tccttctcca atagtccaaa gaggtttcta ccctggaatc tcactgagtg 23581 ccccaggagg taggttccgg gggcctcagt gtgccctcct ctgggccctc cttgggccct 23641 ggggttggcg tgcatgtccg tgtgtccctg tcaggcgtca aggtggtctc tggtgatgtc 23701 cgggcctgcc cggcagtgct ggccttgtct agtctgtcct gttgcagctc ccttctgtct 23761 gtggccaccg tcctgtgtgt ccaaattcct gtggctaggg ggtggaggga ggggtgggaa 23821 gcgtgggggg aacccacatg tcacctgcag gacctcccag attgagggcg gacccatgca 23881 ggacccatgg gggggcacgt accaaaccag gtatttccag gaatttggaa agatggggtc 23941 ctgactaggt ggttttcacc caaaaaggta ttaggggtta cattgtgtgc cctcaaaact 24001 cctatgttga aagcttcctg gacgggggca gtggctcaca cctgtaatcc cacaactttg 24061 ggaatctgag gcggatggat cacttgaggt caggagtttg agaccagcct ggccaacatg 24121 gtgaaaccct gtctctacta aaaatacaaa aaattagcca ggcgtgttgg caggtgcctg 24181 taatcccagc tacttgggag gctgaggcag gagaattgct ggaacccagg aagtggaggc 24241 tgcagtgagc aaagatcgag ccattgcaat ccagcctggg tctcaaagaa aaagaagaag 24301 aagaagcagc ttcctggaga aactgaattt gtaggcaaat ttcaagttca gcaatgatta 24361 acactggcag aagctccaga gtgggacctg gtgaccgcag ggctcgctgg gcctgcggtc 24421 cttgagaaag gaggttctcg cagggattta tttgggactg aactcaggag ctctgagggt 24481 gcaaacgctc tgtcctctgc attgggaaaa ggcagggagc aggaccctgc taatgggcgg 24541 tttcccctct tagagcgaga ggcgcctggg atctgagagg aggctgctgg gccttcgggg 24601 tgagccccca gagctggacc tgagctattc tcactcggac ctggggaaac ggcccaccaa 24661 ggacagctac acgctgacgg aggagctagc tgagtatgct gaaatccggg tcaagtgaag 24721 gagctggggg cagcctgcgt ggctgacccc cctcaggacc ctcgctggcc cccactggct 24781 gtgggctccc ttcctcccaa aagtatcggg ggctggggca ggaggggagt gaggcaggtg 24841 acagtgaggt cctgggggcc tgacctcccc ctccttccca gctgcccctc cctgccagca 24901 cccccacgcc ctcattacgg ctcctctcta acctccttta ccctcatctg tctggagggg 24961 agctctgtct gtccgtgtta tttattgcta cttcctgcct ggtctcctgc ccccacacct 25021 ggccctgggg cctgtacaaa agggacatga aataaatgcc ccaaagccaa atgccagtct 25081 agatcctgat gctttctgac ccgcctttgg gcagcctgcc atcccaccgt ctacagaacg 25141 atgacagaat ttctccttgc cctcggctga gcctggtatg caggcaaggg cccagggatt 25201 ttagccttga acccagggcc agtggccact aagttgctcc cactctgagc tggtagctgc 25261 gagctgggtc tggtttgact tatggatgat ttcacataaa aatcctcatg ctggtttttg 25321 ccgcagtcag atcagctgac agcaccaggc ttgcacttgt acgtggacat gacttgctga 25381 aagggagggt cctgtgcact ggccgcagtc cccaccactc cctattgtcc cacacctggc 25441 cggcttcctg cactgagatt acctcctggt ccctgcagct tggagtgcga gccacagatc 25501 taggtctcac cagcccgtgt tggtttcacc accatctctc acattcatgc tgcaatttgc 25561 agcgtcagcc tttatacatc actacattca gttccagaaa cattaattaa gcaccagctg 25621 tatactgagc cctttctaga ggctcaggct gtcctgagaa gggtctccta ggacagctct 25681 gtgacgatga gaacactgag acttagagaa gttaggtgat tttcccaggg tcccagagct 25741 cggaaacggt gtgtccaggc catggggttg atgcttctga ttcaaatgca ctctttcttt 25801 tcttttccct tcctccctct ctctctctct ctctcttttt tttgatggag tctgtctctg 25861 tcacccaggc tggagtgcag tggtgcaatc tcagctcact gcaacctccg cctcctgggt 25921 tcaagtgatt ctcctgcctc agcctcccaa gtagctgaga tcacaggcac tcgccaccac 25981 gcccggctaa tttttgtatc aaatgcactt tttctataat acctgtcagg aagttaaaaa 26041 gggccagagt gtctggtata aatgggattg ataactaaca gttgttgaac gctttctatg 26101 attctaccca gtttgcatta attaatctcc attaacctca tttaccttcc ctaataacta 26161 tgacaatggt tctcaaagtg tggtccctag gcagcagcat cagcagcacc caggaacttg 26221 ctaaaaaggc aaattttcag cccccaccct agacttcagg aatcaaaaac cctgaggtgg 26281 ggcccaggaa tctgtgtttt gagaagtcct tggtggattc tgatgcacgt gcacgtttga 26341 gaacaagcag ctgaagcatg gagaagttac atcagttctg tggtccccaa ttgttagcag 26401 tggagcccag atttgagccc cgaccactgg cttcggaggc cacactctga cgctgtgcta 26461 cacggccatc ttggtgagaa ccaacagctg ctcacagagc ccgggtgctg gaggagcagt 26521 aaggttaagc tgtgggctgc gggcagtgag ggcagggttt cctagacggc agctctccct 26581 gatttttttt tgttttttgt tttttttgtt tattttttga gatggagtct cactctgtca 26641 tccagtctgg aatgcagtgg cacgatcttg gctcactgca acctctgctt cccaggttca 26701 agcacttctt ctgcctcagc ctcccaagta gctgggacta caggcgtgtg ccaccacacc 26761 cagctaattt ttgtattttt agtagagacg gggtttcacc attgttggcc aggctggtct 26821 cgaactcctg acctcaggtg atgcgcccaa cccaccctcc caaagtgctg ggattacagg 26881 catgagcccc tgtgcctggt ctctccctgc attttgaaga aggcagagga cacagcacag 26941 agaccaaaat aaaggcctag aggcaggaat aaacaaacca ggtcgggagg ggaggtgaca 27001 tggggctgct tggctgggat gggaagtcca catgcaggag gaaaggagac gtgtgggggg 27061 ccgggctgtg ggaagggaaa ccctctgtgg ggcctgtcca agctgggagg gtgcatgccc 27121 tttcttcacg gagctcacag tcttgcaggg acaaggacat gaaagaatct catgcacaat 27181 atcaaactgc aactgtgaca ccggcttcca aggagaagct cagagttttt acagcagggg 27241 tccccagtgc tatactgcac acccagatac tggtctgtgg cctgttaggg actgggctgc 27301 acagcaggag gcgagcagcc agcgagcaag catgaccacc tgagtcctgc ctcctgtcag 27361 atcagcagcg gctttagatt ctcatagcag cacgattctt atcgtgaact gtgcacgcaa 27421 aggatctagg ttgcacactc acagatggtg cctgatgaac tgtcactgtc tcccatcacc 27481 cccggatggg aacgtctagt tgcagggaaa caagctcagg gctcccactg attctacatg 27541 atggtgagtt gtatacttac ttcattatat atcacaatgt aataataata gaaataaagt 27601 gcacaataaa tgtaatgtgc ttgaatcatc cccagctccc tccccactct caggcccatg 27661 gaaaaattgt cttccatgaa accaatccct ggtgccaaaa aggttgggga ccgctgtgtg 27721 acttaaggtc acagggggca ggggaggctt ccctatggaa ggaaactcac acatgtgtga 27781 aggccccaag tgggaacatg tttagcaaaa ttccaggagc tgaagcaagg ccagagaggc 27841 atcggatggg gctggggata cacaggggcc tctcagcagc cgggagggaa gcctcttcca 27901 gatcccctgc atacatgctc accagccctg ccctgacaca cctgtcccct gacgacagac 27961 tctggctcca gcaggtctgg cctgggagga cttcacccag aggagcaaaa cattgtcaat 28021 aaccagggat cggggagtaa ctcacgctgc ccaaagtgca gcttgagggc cctgggagcc 28081 actgtgacgc tactgtgtat gcttcattta gtatgcaccg gaacaacagg aaaccagaat 28141 gccgcaccag cttcggatga ggctaagtca aaataaataa ataagaaaga aatcaatcaa 28201 ctgaactgat tttaaaaatg tattaaataa acatgttgca ggtggctcct gaaggttgta 28261 ggcaaacctc taaagtttgg gaaacaatga gggaatttga cacaaagacc atccttagga 28321 gggaccggca ctcagccatg gcgaacgtgt aggaccgagc caggtgcggt ggctcatgcc 28381 tgtaatccca gcactttggg aggctgaggc tggtggatgg cttcagccca ggagtttgag 28441 accagcctca gcaacatagc gagaccctgt ctctacaaaa aatacaaaaa ttaggcaggt 28501 atggtggcat gcgcctgtag tcctacctac tcgggaggct gaggtgggag ggatagtttg 28561 agcccaggaa atggagtttg cagtgagccg agattgaacc attgcactcc agcctcagca 28621 acagagtgag accctgtctc aaaaaacgaa caaacaaacc agtaggacct gaactcaagt 28681 tcactctctc agactttccc cacccagctc ctggacccca ccagaattca acacatgctc 28741 actgagcacc tctgtgtgcc aggcctatgc tgggccccag ggtcacagca gtgcccaaaa 28801 agaatcactc tgtgacctcc cagagctgac attctgcaga agagacagac aacgatcaga 28861 aagacaagca aacacatgac cgcacatctg acagtgaagg aactgcagag gggtgcaagg 28921 tcaggctgtg ctgggtcagg ctgccatttc agaaagagaa gtcatttgac ttgagaccag 28981 aaggaagtaa gggagggagc tctgtgggtt cctggggaat gtgggtccag ggaaggccaa 29041 aatcgagtgc aaaggcccct ggcgagaatg tacttgttac cttcatggaa caagaaggaa 29101 agaagcatgc ttgagtggag tgatggagcc agagtggttg cccaaggaag ctcgattaca 29161 cagggccttg taggccacag gcaggactgg ggcttgtttg tttgtttctt tgtttgtttg 29221 ttttgaggca gagtatcgct ctgttgccca aggaagctcg attacacagg gccttgtagg 29281 ccacaggcag gactggggct tgtttgtttg tttctttgtt tgtttgtttt gaggcagagt 29341 ctcactctgt tgcccatgct ggagtgcagt ggcacaatct cagctcactg cagcctctgc 29401 ctcccgggtt caagcaattt tcctgcctca gcctcctgag tagctgggat tacaagcatg 29461 taccaccacg cctgtctaat ttttatattt ttagtagaga cggggtttca ccatgttgac 29521 caggttgctc ttaaacttct gacctcaggt gatccacccg cctcagcctc ccaaagtgct 29581 gggaatacag gcgtgagcca tcccgcccag caatggaatt taatcctagg tttttttttt 29641 tttttttttt ttgagatgga gtctcgctct gtcaccaagg ctagagtgca gtggcatgat 29701 ctcggctcat tacaacctcc gcctcccgag ttcaggtgat tctcctgcct cagcctccca 29761 agtagctggg attacaggca cgcaccacca tgcccaacta atttttaata gacacggagt 29821 ttcaccatgt tggccgggct ggtctcgaac tgctgacctc aagttatctg cctgcctctg 29881 cctgccaaag tgctgggatt acaagtgtga accactgtgc ccagccagga cttgggcttt 29941 caaagagcgc gttgagaagc cgctggagaa gtgagttcgg aggggtgtga tggcaggctt 30001 gaggcttcag aggcccactg gcagctgcgt ggacagtgag cacaggctat cagtggaggc 30061 agggacaggg cttgggaggc cctggccatt gtccaagaga gaggaagtgg agatggggag 30121 aggggtgggg tcaggaccca ttttggagga ggaagtggtg agactcactt gaagagtcaa 30181 aggtgactcc agcagcttcc ttaagaggtt gaagaaagca ggaggggcag gttcagtgga 30241 gaacgcaaga gtgatctact ccggccgagt tcaggcccag ggaggccctc agggacaggg 30301 aaggcagatg gcaccccctc cctgggaaat gccaagccga ggaagagtca gaacacatga 30361 cacagtgact cagagagatg ggggcagacg ctgagatgag ggttggccag gcaggaagga 30421 aggtgagaag gtctcatggc ctcacgtgcc agtagggctg ggccaccacg tgtgggaggg 30481 aggacaccca ggcaccaggc catggtcatg acacgggtct ctttgtccct cccggtctgc 30541 caacctcctg agtcactcct gacggagatg actgatccat ccctggcccg cctcccaccc 30601 tgtccctggg acagtgcctg gggcaggcag ctccattatc ctccttgatg aaagagctcc 30661 cagccagcac tggccactcc agagcctcct ccccaggtgg gtgatcatgc agggctgggc 30721 tcttgcgact ggccgggggt agagatgcac cgcactgctg agaagtgctt tctaggagtt 30781 caagaccagc ctgggcaaca cagcagacat tgtatgtaga aaaaaaaatg tttaaaaata 30841 agctgggcac agtagtgcat gcctgtcatc ccagctactc aagaggctga agcaggagga 30901 ccacttgagc ccaggagttt gaatctgcag tgagctatga gtgcaccact gcatgccagc 30961 ctgggcaaca gagtaagact cagtcactaa aaaagaaaaa aaaaaagatc gaatgtggtg 31021 ctcatgcctg taatcccagc actttgggag gccaaggcgg gcagatcatc taaggtcagg 31081 agtttaagac cagcctggcc agcatggcaa aaccccatct ctactaaaaa tacaaaaatt 31141 agccaggcgt ggtggtacgc gcctgtagtc ccagctactc agaaggctaa ggcaggagaa 31201 ttgcttgaac ctgggaggcg gaggctgcag tgagctgaga tcgcaccatt gcactccagc 31261 ctggacaaca gagtgagact tggtctcaga aaaaaaaaaa aaaaaaaaaa aaagaaagaa 31321 aagaaaaagg gggagaaaaa aaggagaagt aggcgagcct ctttctttcc ccactcttct 31381 cgctctcaac tccaggctgc tccttaaaca ggaggtgaca acctcaacct cgcaaattaa 31441 aaaaaaaaat cttttaattt tgaaataatt atcacaggaa gttgcaaaga tagaacagag 31501 aagtctggca gcagctgagc caaaggcccc ttcccaccct agagctcgat ctgcctgggg 31561 agcgggaagg aaatgacagt ttagtcccaa agggcctctc tgtgaagctc aggctagggc 31621 tgagtagcca gaggtgggtg aattaatgtc tgcagggagt gctgggtcag cagcgtaagt 31681 gacaatgtct ggaataatga cagctcactt ttacagagag cccgttccat agcagccacc 31741 atgcaaagtg ctgtaggggc tggggacctg tcacagcata gccctgaagt cataacagca 31801 atcttattat caccctggga gggaaactga ggtgatgtca caacaaacac gtggcagacc 31861 tgggactgga acccgggtct ttttaccaca tgcttgtcat gggatgtgcg agttgctgag 31921 tgtgtcctgt gtgtgctgtc accagcccgg tgaggaaggc actatcattg ttcccgtctt 31981 accatcctgg gcagaggttg gggggtgagg gggacatagc caggcaggga gtagccaccc 32041 agggctcaca gggactctgt tagttgcctg ctttcactgc ttccctggta aaggggtgac 32101 agtcaagtca ccaaaaaagg agaccaacga cctgtttgaa ggaacaggga ggaagagtga 32161 aagcgccgag gctgcgttct caggagtcct gctcctcctc ccttgccttc tccacgcagc 32221 ccttggcctc cagaaatcaa gacacgtcca caagcccaat ctcctcctgt tagaagacta 32281 tgagccacac ctggtgctgt ggcctgcaga aacctcagct actccttcta ggtgaggagc 32341 aggaattcaa gccccaaagg gtaccctccc tgagaccttt gggacctggc cacctggctt 32401 ggagtccccc attgagattc ctgggattaa aaaaataaaa atatacggcc aggtgcggtg 32461 actggctcct gtaatcccag catcttggaa ggctgaggaa gtggattgct tgaggacagg 32521 agttggagac cagaatggga aatgtgagac cccgactcaa taaaaaataa aaaataaatt 32581 aacggggtat ggtggctggt acctgtagtt ctagttactc aggtggctga ggattacttg 32641 agcccacctc aagttcaagg ctgcagtgag ctatgatcac accacttcac tccttcctgg 32701 gtgacagagc gagactgtct taaatttttt tttttttaat aagtatagat tcacagaagt 32761 tgcaaagata gaatagagaa gtctggtgta ctcttcaccc agcatcccct gatggttact 32821 tcttaagtca ttcaggacaa tatcaaaacc aggaatttga cattggtata atgtgtgtgt 32881 acagttctat gtcattttat cacatctgga tgtatataac aacaactgca gtccagatac 32941 agaattattc tgttactaca aagacctccc ttgtgctact cccttcatag tcagacactt 33001 tcccctacca tccctaaccc ctggcaactt ttcatcttct ccacctctat cattttgtca 33061 tttcaagaat gttatgcctg ggcgtggtgg ctcacacctg tcatcccagc actttgggag 33121 gctaaggcag gtggatcact tgaggtcagg aatttgagac cagcctgacc aacatggtga 33181 aaccccatct ctactaaaat acaaaaatta gccaggcatg gtgttaggca ccttaatccc 33241 agctactcgg gaggcttagg tagaagaatc acttaaatcc tagaggcaga ggttgcagtg 33301 agccgagatc gagccactgc actccaacct gggggacaag ggcaaaactc cgtctcaaaa 33361 aacaaacaaa caaaaagaat gttgtaacac atgtgatcct tggagactgg cttttttcac 33421 tctctgtacc actcttcaga atcgtccaag ttgttgcctg tatcctcttc atatctcact 33481 cctttttatt gctgcatagt actccatggc atggatgtac cgtaatctgt taagccactc 33541 acctagtgaa ggacgtattg gttgtttcca gtttttggtt ttgctttttt ttttcttgag 33601 atggaatctt gctcttgttg cccaggctgg agtgcaatgg cgtgatctcg gcttaccgca 33661 acctctgcct cccaggttca agcaattctc ctgcctcagc ctcccgagta gcttgtgtta 33721 gaaatgcttg atttttggtg ctgtaaagaa atagcacttt aatgtaaatt taatttcttc 33781 agcaaggcta gttttacttt ctgcagaaag ggtatactcg ctagcagttt tgctacaaaa 33841 gtacactgaa caaaggagac agggtcattt ataacttgat gcatctactt tattgctgtg 33901 tctggtttct attggctgaa acgggacctc acattctgta tttgtctgga ttggctagca 33961 acttagaact ttttaaaaga ggcaaaggca gaggagaaca aaggaaggag aaagtaacgt 34021 ggaatactga gaaaagtaaa aacatttcta aataaggaag aggaacaggc tttgacttaa 34081 tgctttcttg gactagtata agcatgctag ggcaaatctt taggctaaat tgtaggatct 34141 aagaacataa agtacattga tttttttatt atggctagta gatatttaag aatgttagca 34201 caggtctttg aataaatttt gcttctaaga gaagttacta tttattctta attagatggg 34261 gaggacagtc tctttgaaaa ggaacttcta ctttactatt tacactggga ttaccagcat 34321 gtaccgccac gaccagctca ttttgtattt ttaatagaga caggatttca caatgttggc 34381 caggctggtc tcgaactcct gacctcaggt gattgccctc cttggcctcg caaaatgctg 34441 agattactcc tgacctcagg tgatcgccca cctcggcctc ccaaagtgct gggattacag 34501 gcgtgagcca ccgtgcctgg cctgtttcca gtttttggct attacaaaca aagctgctgt 34561 gaataattgt gtgcaggttt ttatgtggac atacgttttc atttcttagg gataaaatgc 34621 ccaggagtgc aattgctggg tcatctggta ggtatatttt gtttagtttt tcaggaaacc 34681 ggcaaactgt tttccagagt agctggacca ctttgcatcc ccaccagtcg tgtgtgaatg 34741 tttcaagttc acagcatcct caccagtgtt tggtgttgtc attattggtt atttaatttt 34801 agctgttcta aagcggtatc ttgtcatggg tttcaatttg catttcacta atggctggtg 34861 atgtgagaca tcttttcatg tgcttatttg ccttctatat accttcttca gggaaaggtg 34921 tcatcatgtc ttttgcctat tttctaactg gactgttttt tatttattta tttatttatt 34981 tatttattta tttatttatt tagagacaga gtctcactct gtcgcccagg ctggagtgca 35041 gtggtgggat ctcagctcac tgcaacctcc gcctcctggg ctcaggtgac tctcctgcct 35101 cagcttcctg agcagctggg attataggcg cacaccacca tgcctgctaa tttttgtatt 35161 tttagtagag atggggtttt gccatgttgg ccaggctggt cttgaactcc tgacctcagg 35221 tgatccactt tgcttcccaa agtgctggga ttacaggtgt gagccatcgt gcccagcctg 35281 atttttattt tatatatata tatatgtaca cacacacaca caatatacat acacacacat 35341 atatgcacac atatgtgtgt taaagctgta tgtaaagctg ttatatatta catatatact 35401 acatatataa cagctttgct gagatataat tcacatacca taaaacacac ccatgtaaaa 35461 tgtccaattg aatggttttg ggtctattca ctgaattatg caaccatcac cacaaccagt 35521 tttagaacat ttcatcaccc caaaataatc tcacatccat tagcagtcac tcttcatttc 35581 ctcttacctt tcccacccca ccccagccct acgcaaccgc tacatctact tctgtctcta 35641 cagatttgcc ttttctgaat atttcacgtg aatggaatca tacaatatgt gacattttgt 35701 gattggcttc tttcacttaa tgttttcaaa gttcatccat atccaggcgc cgtggctcac 35761 gcctgtaatc ccagcacttt gggaggccaa ggcgggtgga tcacctgagg tcaggagttc 35821 aggaccagtc tggccaacat ggtaaaaccc cgtctctact aacaatatga ataattagct 35881 gggcgtggtg gcggatgcct gtaatcccag tcactcagaa ggttgaggca ggagaatcgc 35941 ttgaacctgg gaggcagagg ttgcagtgag ccgagatggc accactgcac tccagcctgg 36001 gcaacaagtg aaactcggtc tcgaaaaaaa gttcatccat gttgtagtat ctatcagtac 36061 ttcattcctt tttattatca aataatattt tattgtttag gtctcccata ttttctttat 36121 tcattcatct cttcatggac atgtgggttg tttctgcttt ttggctttca tgaataatgc 36181 tgctgtaaac atttgtgttc aagtttctgg gtgaacatgt tcttatttct cttggttata 36241 tatgtaagag tggagatgca ggagtgtatg gtaattctat gcttaatctt tggaggaact 36301 gttttgagac ttttttccag agtggctgga caattttata ttcacactag caatatataa 36361 atgatttagt ttccatacat ctttccagta tttgatacta tcttttattt tagctgttct 36421 tatagatgtt agtagtgata tctcatcatg gtcttaacct gtctaaaggc tagtgatgtt 36481 gaatatcttt ttagataatt attttctatc catatatccc cttgggtgaa atttctcttt 36541 atatcttttg ccagtcttct aattggatga tttgattttt ttatgtagtg tttctatgtt 36601 gaatttgaga gttctttata tatttgttct agataggagt cctttgtcag atatgtggtt 36661 tgcaaatatt ttctcttagt ctgtaatttg tcttttcatt ctcttaacta ggtcttttac 36721 agagcaaaaa aaaattaatt ttgatgaagt tggattgatt tttttaggga tcatgccttt 36781 ggtgtcatgt ctaagaacac ttcatcaagc tctaagtctc acagattttc tcctatgtta 36841 tcttccagag gttttttgtt tttattttat tttacttttt aattaagaca gagtttcgct 36901 ctgttgccca ggctggagtg cagtggcagg atctcagctc actgcaacct ccgcctccca 36961 ggttcaagtg atcctcctgc cttggcctcc tgagtagctg ggactacagg cacgcaccac 37021 ccaactggct aattttttgc atttttatgt ggttgctttc gagaattttc tctgcctctt 37081 gtttttagaa gtttgactat gatgtgtctt ggcatgaatt tctttgggtt tatcctgttt 37141 gtgatgcgct cagcttcttg aatctgtaga catatgttgt gcgtgtgtgt tgtcaaatga 37201 agataatttg cagccattac ttcttggaat atttttttaa ggcccaccct ccttttcctc 37261 cccttctggg acttcaatga caaaatgtta gatcttcttt tattatagtt actcatgtcc 37321 ttgaggcttt gttccatttt ctttcatcta ttttctttcc attgttcaga atgggtaatt 37381 tctattgtta tacgttttgt ttcactgctt ctttcttcta tactctccat tctgctattg 37441 agcccatcca ttatggtttt tatttcagct cttgtatttt ttagttgtat tttttagttc 37501 taaagtttcc atttggttct tctttatatt ttctatttct ttgctgatac tttttattat 37561 tttatttgtt tcaagcacat ttataatttc tcagtgaaac acttttatga tggctgctcc 37621 aaaatctttg tcagataatt ctaacatctg tgtcgtcttg atgtttgtgt ctgttaatta 37681 tcttttgtta ttcagtttga gaatgcctga ttcctagtat gacaagtgat ttgccattga 37741 agtcaggata ttctgggcac tgtgttataa gattctggat cttctttaaa tattttgttt 37801 tagtgggctt ccctttatac tgctccagcg aaataaagta gggcactgcc tgattgcttc 37861 cagacaggga taaaagtcca ggttctccat ccagcctttc ctgaacctgg ggggttggga 37921 gagatgaggc ctccatgtta ttgtttggta agggtagaag ttcaggttcc ccactaggaa 37981 tccatgacaa ctccctggct gggagtggga gggatgcttc attactgctc cctatgtggc 38041 tttcactgac actgtgtgtg gcagagagag gggggtgctt gttaccactg agtggtggcg 38101 aaagtcccaa ctctccacca ggcccccttt gacactactc caatggggcc gggtggaatg 38161 cctcatttct gcattggggt ggcagtggat gtccaggctg cccacatggt ttccaccaac 38221 accacatcgg gggagggtga tcccagtggg gaggaatgtc ccagctccaa ctctagcatc 38281 actctggctg agtatgggag tggcctaggg ttctttgtta cagcctggtg aaggtgaatg 38341 tctaggtttt ccatccggcc tttgctgaca agggtgatga tggaacttgg atggagcgga 38401 gcaggtatgg actaaaagag ctctgtcttt ctaggatgcc cctttcctcg ttctttggct 38461 agaaaagcat agttttgcta gaacttgttc tgttgggact ttttcttgtt tgtgcccctt 38521 ggcattttca gttgttggct tctccagtat tcagcctgag atatatgagg caaagagaaa 38581 tcccagggaa ttcaccacca ttccttaagt cccgaggttc ctaaccagtc tgcttctctc 38641 catctttcag tcttcttatg cttgtcttgt cacgttcaag gttttcagtt atacttggca 38701 gaataaataa atacatctac tccatctttc cagaacatca ctgctttttg ttcattttat 38761 ttatttatta tttactgaga caaggtctta ctttgttgcc tgggctggag tgcagtggtg 38821 ctatcatggc tgactgaagc cttgaactcc caggctcaag cgatcctcct gcctcagcct 38881 cccaagtagt ttggaccaca ggcatgggcc accatgccca gctacttttc attttttgct 38941 tttgttgact agccccacag agttttgctg tgttgttcag gttggtctca aacttctggc 39001 cttaagcaat cctcttgcct tgctctccca aagcgtatta caggcataag tcaccacacc 39061 cagccctatc ggtgcttttg gaagaaggtg ttcttccctc tctctgtttc cctccctatg 39121 gccagctgcc catagagggc agtggtgctg gagtgaagac ctctcagcgc atccctcttc 39181 atctgtagga tgtgcatggc ttccaatgct gtctgttctg ctcctgagac actcgctggc 39241 tacaggatta taaaggtgat gaggtttcag cctaaagact ctccattcaa taacacttcc 39301 tacaagatat cttttttgtt tttttgagac aggatc // LOCUS AC002366 259202 bp DNA PRI 02-JAN-1998 DEFINITION Human Xp22 BAC CT-285I15 (from CalTech/Research Genetics) , PAC RPCI1-27C22 (from Roswell Park Cancer Center), and Cosmid U35B5 (from Lawrence Livermore), complete sequence. ACCESSION AC002366 U79549 U70036 NID g2739349 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 259202) AUTHORS Muzny,D., Ansari-Lari,M.A., Timms,K.M., Yu,W., Dugan,S., Lu,J., Shen,Y., Rowland,K., Liu,W., Perez,L., Ding,Y., Haywood,M., Jain,A., Leal,B., Logan,O., Nguyen,V., Savage,L., Shen,H., Worley,K., Chen,E., Forcum,J., Arenson,A.D., Chiu,M.W., Gorrell,J.H., Brundage,E., Di,W., Chinault,C., Nelson,D. and Gibbs,R.A. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 (bases 1 to 259202) AUTHORS Chiu,M.W. TITLE Direct Submission JOURNAL Submitted (23-JUL-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REFERENCE 3 (bases 1 to 259202) AUTHORS Chiu,M.W. TITLE Direct Submission JOURNAL Submitted (02-JAN-1998) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA COMMENT Sequencing is completed to a minimum standard of double strand coverage with a minimum of 2 clones and 2 reads with no ambiguities or 2 chemistries with a minimum of 2 clones and 3 reads with no ambiguities. If the sequence quality does not meet this standard, it will be indicated in the annotation. The repeat regions shown were identified using RepeatMasker by Adrian Smit. Sequence similarities were identified using Powerblast by Jinghui Zhang. Exon/Intron boundaries of identified genes were chosen if there were canonical splice junctions that maintained sequence continuity across the splice junctions. FEATURES Location/Qualifiers source 1..259202 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="BAC CT-285I15, RPTCI1-27C22, U35B5" /chromosome="X" /map="Xp22" repeat_region 35..127 /rpt_family="MIR" repeat_region complement(1087..1502) /rpt_family="L1" repeat_region complement(5292..5579) /rpt_family="Alu" repeat_region 5302..5553 /rpt_family="SVA" repeat_region complement(6009..6063) /rpt_family="L1MC2" repeat_region complement(7092..7382) /rpt_family="L1MA5" repeat_region 7423..7721 /rpt_family="Alu" repeat_region 9771..10072 /rpt_family="L1MC2" repeat_region complement(10072..11146) /rpt_family="THR" repeat_region complement(11156..11528) /rpt_family="THE1b" repeat_region complement(12944..13151) /rpt_family="MER20" repeat_region 13862..14010 /rpt_family="MLT1f" repeat_region 14063..14352 /rpt_family="MER42a" repeat_region 15847..16170 /rpt_family="Alu" repeat_region complement(18743..19117) /rpt_family="MER7" repeat_region 23507..23614 /rpt_family="L1" repeat_region 23615..24425 /rpt_family="L1PA11" repeat_region complement(25475..25968) /rpt_family="MLT1d" STS 27234..27683 /note="similar to EST with GenBank Accession Number H48827" /db_xref="dbSTS:G28372" repeat_region complement(28186..28366) /rpt_family="MER20" repeat_region 28715..28995 /rpt_family="Alu" repeat_region complement(30217..30408) /rpt_family="L1MA5" repeat_region 31199..31364 /rpt_family="Alu" repeat_region complement(33939..34460) /rpt_family="L1MA2" repeat_region 34952..35384 /rpt_family="MLT1d" repeat_region 36035..36126 /rpt_family="MIR2" repeat_region 36850..37035 /rpt_family="MER20" STS 38042..38250 /db_xref="dbSTS:L47508" repeat_region 38122..38510 /rpt_family="MSTa" repeat_region 38797..40103 /rpt_family="MIR" repeat_region complement(42465..42539) /rpt_family="MIR" repeat_region 47139..47335 /rpt_family="MER3" repeat_region complement(48993..49166) /rpt_family="MIR" repeat_region complement(49718..49918) /rpt_family="L1PA11" repeat_region complement(51943..52505) /rpt_family="L1MB3" repeat_region 53226..53677 /rpt_family="MLT1d" repeat_region complement(54846..54966) /rpt_family="MIR" repeat_region 55040..55226 /rpt_family="MER20" repeat_region 55691..55837 /rpt_family="Alu" repeat_region 56051..56486 /rpt_family="L1ME3a" repeat_region 58557..59030 /rpt_family="MLT1d" repeat_region complement(59270..59428) /rpt_family="MER5" repeat_region complement(59847..60030) /rpt_family="MER20" repeat_region complement(61290..61384) /rpt_family="L1PB1" repeat_region 61683..61811 /rpt_family="MER20" repeat_region 61907..62113 /rpt_family="L1MB7" repeat_region complement(63588..63969) /rpt_family="L1MA10" repeat_region 64800..65073 /rpt_family="Alu" repeat_region complement(67157..67209) /rpt_family="MIR" gene 67637..75845 /gene="HAMELX" promoter 67637..68496 /gene="HAMELX" /note="similar to bovine amelogenin promoter" mRNA join(68495..68552,69861..69926,71863..71910,73320..73361, 73632..74057,75691..75845) /gene="HAMELX" /product="amelogenin" CDS join(69873..69926,71863..71910,73320..73361,73632..74057, 75691..75696) /gene="HAMELX" /note="similar to amelogenin encoded by GenBank Accession Number M86932" /codon_start=1 /product="amelogenin" /db_xref="PID:g2739350" /translation="MGTWILFACLLGAAFAMPLPPHPGHPGYINFSYEVLTPLKWYQS IRPPYPSYGYEPMGGWLHHQIIPVLSQQHPPTHTLQPHHHIPVVPAQQPVIPQQPMMP VPGQHSMTPIQHHQPNLPPPAQQPYQPQPVQPQPHQPMQPQPPVHPMQPLPPQPPLPP MFPMQPLPPMLPDLTLEAWPSTDKTKREEVD" repeat_region complement(77161..77331) /rpt_family="MER3" repeat_region complement(79184..79473) /rpt_family="Alu" repeat_region complement(79881..80169) /rpt_family="Alu" repeat_region 80196..80253 /rpt_family="MER20" repeat_region 80384..80459 /rpt_family="L1PB3" repeat_region complement(81544..81633) /rpt_family="MIR" repeat_region 84789..84957 /rpt_family="L1MA2" repeat_region 84978..85252 /rpt_family="Alu" repeat_region 85294..85692 /rpt_family="L1MB3" repeat_region complement(88147..88282) /rpt_family="MIR" repeat_region complement(89652..89741) /rpt_family="L1PA11" repeat_region 89745..90029 /rpt_family="L1PA11" repeat_region complement(90283..90596) /rpt_family="MER1A" repeat_region 90849..91266 /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(91514..92000) /note="L1MA8" /rpt_family="LINE/L1" repeat_region complement(92292..92491) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(93351..93401) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 93694..93772 /note="MADE1" /rpt_family="DNA/Mariner" repeat_region complement(94209..94346) /note="(TAGA)n" /rpt_family="Simple_repeat" repeat_region complement(94400..94585) /note="MER5A" /rpt_family="DNA/MER1_type" repeat_region 94756..94794 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 95851..96003 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region complement(96505..96535) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 96897..96930 /note="L1MA3" /rpt_family="LINE/L1" repeat_region complement(97232..97438) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(97447..97704) /note="AluSq" /rpt_family="SINE/Alu" repeat_region 98322..98665 /note="HSMAR2" /rpt_family="DNA/Mariner" repeat_region 98666..98969 /note="AluSq" /rpt_family="SINE/Alu" repeat_region 98976..99906 /note="HSMAR2" /rpt_family="DNA/Mariner" repeat_region complement(99946..100229) /note="AluYb8" /rpt_family="SINE/Alu" repeat_region complement(100724..100765) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 100856..101071 /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(101209..101509) /note="AluSp" /rpt_family="SINE/Alu" repeat_region 101946..101985 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 103509..103601 /note="MIR" /rpt_family="SINE/MIR" repeat_region 104700..104734 /note="MER7B" /rpt_family="DNA/MER2_type" repeat_region 104827..105087 /note="MER7A" /rpt_family="DNA/MER2_type" repeat_region 105120..105163 /note="(CAA)n" /rpt_family="Simple_repeat" repeat_region 105231..105514 /note="L1MA7" /rpt_family="LINE/L1" repeat_region complement(105660..105725) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(105675..105843) /note="MIR" /rpt_family="SINE/MIR" repeat_region 105962..105989 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(106006..106641) /note="MER39b_j" /rpt_family="Other/MER21_gro" repeat_region complement(107669..108074) /note="L1MD2" /rpt_family="LINE/L1" repeat_region 108257..108628 /note="THE1C" /rpt_family="LTR/MaLR" repeat_region complement(110316..110743) /note="L1PA7" /rpt_family="LINE/L1" repeat_region 111211..111254 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 112256..112370 /note="(TA)n" /rpt_family="Simple_repeat" repeat_region complement(112841..113286) /note="MSTB" /rpt_family="LTR/MaLR" repeat_region 113753..114088 /note="LINE2" /rpt_family="LINE/L2" repeat_region 114139..114262 /note="LINE2" /rpt_family="LINE/L2" repeat_region 114657..114761 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 114762..115193 /note="LINE2" /rpt_family="LINE/L2" repeat_region 115380..115525 /note="LINE2" /rpt_family="LINE/L2" repeat_region 115531..115825 /note="AluY" /rpt_family="SINE/Alu" repeat_region 115945..116029 /note="LINE2" /rpt_family="LINE/L2" repeat_region 116066..116280 /note="L1ME1" /rpt_family="LINE/L1" repeat_region 118654..118839 /note="MER5A" /rpt_family="DNA/MER1_type" repeat_region 119090..119110 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(119211..119412) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(119427..119457) /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region 119474..119781 /note="AluSq" /rpt_family="SINE/Alu" repeat_region 119782..119814 /note="(GAAAA)n" /rpt_family="Simple_repeat" repeat_region 119823..119924 /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region complement(121052..121181) /note="FLAM_A" /rpt_family="SINE/Alu" repeat_region complement(121281..121499) /note="MER58A" /rpt_family="DNA/MER1_type" repeat_region 121541..121811 /note="AluJb" /rpt_family="SINE/Alu" repeat_region 121812..121889 /note="(GAAA)n" /rpt_family="Simple_repeat" repeat_region 122006..122041 /note="(CAAA)n" /rpt_family="Simple_repeat" repeat_region complement(123732..123994) /note="AluJo" /rpt_family="SINE/Alu" repeat_region 124412..124618 /note="LINE2" /rpt_family="LINE/L2" repeat_region 124979..125108 /note="MIR" /rpt_family="SINE/MIR" repeat_region 125190..125236 /note="MIR" /rpt_family="SINE/MIR" repeat_region 126132..126184 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region complement(126340..126385) /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(127902..128024) /note="(TA)n" /rpt_family="Simple_repeat" repeat_region complement(128634..128697) /note="(CA)n" /rpt_family="Simple_repeat" repeat_region 129269..129348 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(129351..129412) /note="(GA)n" /rpt_family="Simple_repeat" repeat_region 129413..129442 /note="(TA)n" /rpt_family="Simple_repeat" repeat_region 129443..129515 /note="(CAT)n" /rpt_family="Simple_repeat" repeat_region complement(130012..130126) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 131259..131515 /note="L1PA8" /rpt_family="LINE/L1" repeat_region 131721..131761 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 131814..131964 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(132968..133512) /note="L1MB7" /rpt_family="LINE/L1" repeat_region complement(133517..133561) /note="(GAAA)n" /rpt_family="Simple_repeat" repeat_region complement(133562..133865) /note="AluSp" /rpt_family="SINE/Alu" repeat_region 134686..137741 /note="L1PA2" /rpt_family="LINE/L1" repeat_region 138837..139447 /note="L1PA11" /rpt_family="LINE/L1" repeat_region 139470..139531 /note="MIR" /rpt_family="SINE/MIR" repeat_region 140816..140909 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(140934..141272) /note="MLT1A2" /rpt_family="LTR/MaLR" repeat_region complement(141412..141452) /note="(TGAA)n" /rpt_family="Simple_repeat" repeat_region complement(141979..142359) /note="LINE2" /rpt_family="LINE/L2" repeat_region 143821..143907 /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(144248..144282) /note="(CAAA)n" /rpt_family="Simple_repeat" repeat_region complement(144454..144491) /note="MER5A" /rpt_family="DNA/MER1_type" repeat_region 144545..144788 /note="MIR" /rpt_family="SINE/MIR" repeat_region 145978..146008 /note="(CAA)n" /rpt_family="Simple_repeat" repeat_region complement(146335..146533) /note="MER58A" /rpt_family="DNA/MER1_type" repeat_region 147580..147875 /note="AluY" /rpt_family="SINE/Alu" repeat_region 148121..148163 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 148164..148218 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region complement(149495..149552) /note="MER3" /rpt_family="DNA/MER1_type" repeat_region 149919..150186 /note="LINE2" /rpt_family="LINE/L2" repeat_region 151054..151196 /note="(GAAAA)n" /rpt_family="Simple_repeat" repeat_region 151265..151759 /note="MLT1D" /rpt_family="LTR/MaLR" repeat_region 152354..152665 /note="L1MB8" /rpt_family="LINE/L1" repeat_region complement(153308..153440) /note="L1MB6" /rpt_family="LINE/L1" repeat_region complement(155604..155789) /note="MER5A" /rpt_family="DNA/MER1_type" repeat_region complement(155802..156000) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(157114..157210) /note="(CAT)n" /rpt_family="Simple_repeat" repeat_region 157404..157474 /note="(TAAA)n" /rpt_family="Simple_repeat" repeat_region 159283..159510 /note="L1MC4" /rpt_family="LINE/L1" repeat_region 159739..159908 /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region complement(159919..160024) /note="MER81" /rpt_family="DNA/Other" repeat_region 161837..161910 /note="(GAAAA)n" /rpt_family="Simple_repeat" repeat_region 162284..162452 /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region complement(162646..163856) /note="L1PA2" /rpt_family="LINE/L1" repeat_region 164098..164309 /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(165085..165455) /note="L1PA2" /rpt_family="LINE/L1" repeat_region 165457..166551 /note="L1PA2" /rpt_family="LINE/L1" repeat_region 166553..166585 /note="(GAAAA)n" /rpt_family="Simple_repeat" repeat_region 166761..166824 /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region complement(166859..166955) /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region complement(167005..167050) /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region 167310..167420 /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(167698..167840) /note="MIR" /rpt_family="SINE/MIR" repeat_region 168162..168340 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(168456..168500) /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(168673..168889) /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(168897..168942) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 169301..169635 /note="L1MB8" /rpt_family="LINE/L1" repeat_region 169951..170112 /note="MER45" /rpt_family="DNA/MER1_type" repeat_region 170492..170691 /note="MER63B" /rpt_family="DNA/MER1_type?" repeat_region 171023..171056 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(171600..171653) /note="LINE2" /rpt_family="LINE/L2" repeat_region 172019..172056 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region 172460..172571 /note="(TA)n" /rpt_family="Simple_repeat" repeat_region 172682..172989 /note="AluJb" /rpt_family="SINE/Alu" repeat_region 173181..173345 /note="MIR" /rpt_family="SINE/MIR" repeat_region 173347..173371 /note="POLY_A" /rpt_family="Simple_repeat" repeat_region 173732..173789 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 174439..174461 /note="POLY_A" /rpt_family="Simple_repeat" repeat_region 174645..174700 /note="MADE1" /rpt_family="DNA/Mariner" repeat_region 174701..175324 /note="L1MB7" /rpt_family="LINE/L1" repeat_region complement(176489..176726) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(177175..177245) /note="MIR" /rpt_family="SINE/MIR" repeat_region 178413..178710 /note="MER33" /rpt_family="DNA/MER1_type" repeat_region 179791..179811 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 180855..180995 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(181002..181282) /note="AluJb" /rpt_family="SINE/Alu" repeat_region 181556..181610 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region complement(182057..182872) /note="L1MB8" /rpt_family="LINE/L1" repeat_region 183976..184044 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region 184076..184263 /note="LINE2" /rpt_family="LINE/L2" repeat_region 184844..184910 /note="L1MB5" /rpt_family="LINE/L1" repeat_region 184911..185029 /note="FLAM_A" /rpt_family="SINE/Alu" repeat_region 185037..185734 /note="L1MB5" /rpt_family="LINE/L1" repeat_region 186268..186397 /note="(TA)n" /rpt_family="Simple_repeat" repeat_region complement(186433..186556) /note="L1MB6" /rpt_family="LINE/L1" repeat_region complement(186886..187092) /note="L1MB6" /rpt_family="LINE/L1" repeat_region 187200..187226 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 189229..189567 /note="MER58B" /rpt_family="DNA/MER1_type" repeat_region 189973..190247 /note="L1MA7" /rpt_family="LINE/L1" repeat_region 190946..191077 /note="MLT1F" /rpt_family="LTR/MaLR" repeat_region 192129..192209 /note="(TA)n" /rpt_family="Simple_repeat" repeat_region 192375..192772 /note="L1MD2" /rpt_family="LINE/L1" repeat_region 192823..193069 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(193614..193688) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 195443..195468 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(196367..196545) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(197054..197637) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(198716..198839) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(199820..199894) /note="LINE2" /rpt_family="LINE/L2" repeat_region 200112..200161 /note="LINE2" /rpt_family="LINE/L2" repeat_region 200804..200851 /note="AT_rich" /rpt_family="Low_complexity" repeat_region 201434..201459 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(201796..201833) /note="(GAAA)n" /rpt_family="Simple_repeat" repeat_region 201888..202005 /note="LINE2" /rpt_family="LINE/L2" repeat_region 202867..202959 /note="(GGA)n" /rpt_family="Simple_repeat" repeat_region 203990..204417 /note="L1MB7" /rpt_family="LINE/L1" repeat_region 204438..204501 /note="MER3" /rpt_family="DNA/MER1_type" repeat_region 204587..204622 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(205834..206018) /note="L1PB3" /rpt_family="LINE/L1" repeat_region complement(206019..206060) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(206227..206267) /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(206985..207207) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(207244..207624) /note="L1ME3A" /rpt_family="LINE/L1" repeat_region 209092..209307 /note="MER20" /rpt_family="DNA/MER1_type" repeat_region 210089..210314 /note="MER20" /rpt_family="DNA/MER1_type" repeat_region 210535..210631 /note="MER5A" /rpt_family="DNA/MER1_type" repeat_region complement(210735..210809) /note="MIR" /rpt_family="SINE/MIR" repeat_region 211595..212137 /note="L1MB7" /rpt_family="LINE/L1" repeat_region 212143..212201 /note="(GGAA)n" /rpt_family="Simple_repeat" repeat_region complement(212542..212603) /note="LTR16B" /rpt_family="LTR/Retroviral" repeat_region 213561..213617 /note="MIR" /rpt_family="SINE/MIR" repeat_region 214210..214240 /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(214328..214585) /note="MER58B" /rpt_family="DNA/MER1_type" repeat_region complement(216998..217060) /note="(TA)n" /rpt_family="Simple_repeat" repeat_region complement(218059..218730) /note="L1ME3" /rpt_family="LINE/L1" repeat_region 218733..219031 /note="AluY" /rpt_family="SINE/Alu" repeat_region complement(219054..219171) /note="L1ME3" /rpt_family="LINE/L1" repeat_region 219176..219378 /note="MER58A" /rpt_family="DNA/MER1_type" repeat_region 220729..220936 /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(222501..222718) /note="MER58A" /rpt_family="DNA/MER1_type" repeat_region complement(223728..223814) /note="MER5B" /rpt_family="DNA/MER1_type" repeat_region complement(225288..225327) /note="(CA)n" /rpt_family="Simple_repeat" repeat_region complement(225788..225994) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region 226806..226955 /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(227160..227209) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(227230..227531) /note="AluSx" /rpt_family="SINE/Alu" repeat_region 228279..228418 /note="(TA)n" /rpt_family="Simple_repeat" repeat_region complement(229036..229343) /note="AluY" /rpt_family="SINE/Alu" repeat_region 230871..230949 /note="MIR" /rpt_family="SINE/MIR" repeat_region 231020..231411 /note="L1" /rpt_family="LINE/L1" repeat_region 233616..233750 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(233770..234163) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(234266..234517) /note="MIR" /rpt_family="SINE/MIR" repeat_region 234597..234742 /note="L1ME1" /rpt_family="LINE/L1" repeat_region 235181..235640 /note="MLT1E" /rpt_family="LTR/MaLR" repeat_region complement(235673..235714) /note="LINE2" /rpt_family="LINE/L2" repeat_region 235821..235958 /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(236317..236477) /note="LINE2" /rpt_family="LINE/L2" repeat_region 236560..236649 /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(238422..238724) /note="AluJo" /rpt_family="SINE/Alu" repeat_region complement(239338..239363) /note="AT_rich" /rpt_family="Low_complexity" repeat_region complement(240362..240412) /note="MIR" /rpt_family="SINE/MIR" repeat_region complement(240452..240669) /note="MER33" /rpt_family="DNA/MER1_type" repeat_region complement(240812..240939) /note="MER3" /rpt_family="DNA/MER1_type" repeat_region complement(241009..241078) /note="(GGA)n" /rpt_family="Simple_repeat" repeat_region complement(241589..241883) /note="AluJo" /rpt_family="SINE/Alu" repeat_region 242242..242471 /note="AluJb" /rpt_family="SINE/Alu" repeat_region 242568..242781 /note="L1ME3A" /rpt_family="LINE/L1" repeat_region 242787..243082 /note="AluSx" /rpt_family="SINE/Alu" repeat_region 243083..243111 /note="(TAAA)n" /rpt_family="Simple_repeat" repeat_region 243112..243350 /note="L1ME3A" /rpt_family="LINE/L1" repeat_region 243891..244250 /note="MLT1A1" /rpt_family="LTR/MaLR" repeat_region complement(245838..245997) /note="MER45" /rpt_family="DNA/MER1_type" repeat_region 246164..246279 /note="L1MD3" /rpt_family="LINE/L1" repeat_region complement(246467..246545) /note="AT_rich" /rpt_family="Low_complexity" repeat_region 246646..246759 /note="L1" /rpt_family="LINE/L1" repeat_region complement(247016..247140) /note="LINE2" /rpt_family="LINE/L2" repeat_region complement(247188..247398) /note="MER20" /rpt_family="DNA/MER1_type" repeat_region complement(247401..247652) /note="LINE2" /rpt_family="LINE/L2" repeat_region 248395..248859 /note="MLT1C" /rpt_family="LTR/MaLR" repeat_region 249743..250196 /note="MSTB" /rpt_family="LTR/MaLR" repeat_region 250645..250695 /note="(CA)n" /rpt_family="Simple_repeat" repeat_region 251103..252208 /note="L1ME2" /rpt_family="LINE/L1" repeat_region 252248..253085 /note="L1ME2" /rpt_family="LINE/L1" repeat_region 253104..253636 /note="L1ME2" /rpt_family="LINE/L1" repeat_region 253638..253670 /note="MER3" /rpt_family="DNA/MER1_type" repeat_region complement(254623..254791) /note="MER45" /rpt_family="DNA/MER1_type" repeat_region 255210..255714 /note="L1MB7" /rpt_family="LINE/L1" repeat_region complement(256688..256868) /note="MER53" /rpt_family="DNA/Other" repeat_region 257125..257339 /note="MER20" /rpt_family="DNA/MER1_type" repeat_region 259043..259142 /note="MIR" /rpt_family="SINE/MIR" BASE COUNT 81699 a 51421 c 49221 g 76861 t ORIGIN 1 caactccagt ttgaccattg ctagcttgga aatatgggca aattatttat ctctgtgcct 61 cagttgcctt atctatgaaa ggagacatgg taagagtacc tacctcagag ttgtttgaag 121 gattaaaggg ttcttatgta atgggtttat ttaaaatttt caggatgcga tctcacccat 181 ggttgctctt tatcgttgct atcattgcta ttattattat tacgggagat gttgcagcca 241 ttgggaatga caaaatctag gggatgactg tgtgagtgag tgccacatag aggagaggag 301 aacactgttg ttggagatta ggtcaaggtg cagaaaggca atcattgttg gaaggttcat 361 ctatgtgcat actgaaatga ccaagtgttt aaaatcttag taaaaaagaa tgtttcttag 421 gtaagaaaga tctgtattag aaattaacta tctgtggtca acaaattgta ggaagggtca 481 tcaaatactc ttcatttgaa accaagaatt aggacataca tgttgaatat atataagaaa 541 gaatattctt tactccatga ctgatgagaa agattatgtt gttttagcat atgagtggtg 601 aacacaactg tttaaaattg gtcataaatg ttgtctaata gtgttcagaa gtattgattt 661 aaagagtatc catttaaagc taatgcagaa ttatgttttg taagtattaa aatttatccg 721 attttaatca ctttttagtt ttgtgcttct agagaatatt tcaagagcaa caaaagagac 781 ggcaacaaga tgggaaagga actgaaagag attgagccgc cacatgagca attcacaaag 841 gtctgttcta tttggtaaca caatacgttc ttataaaata caactcatga acaaaaatag 901 aagacagtct gcttgtttct gaatgtatgg aaagaccttt ttttttgatc agtttgactt 961 tattaatatt tgttccacag agctttacag gtatccttta actgacttct aggatttata 1021 gtttttactt ttttcaccaa acaacaaatt catttttata aaattcagtt ggccccgaga 1081 gtttcttttt ttttttaatt atcctttaag ttctagggta tatatgcaca aagtgcaggt 1141 ttgttacatt gtatacatgt gccatgttag tgtgctgcac ccgttaactc gtcatttaca 1201 ttaggtatat ctcccaatgc tatccctccc ccgaccccac aacagtcccc agagtgtgat 1261 gttccccttc ctgtgtccat gtgatctcat tgttcaattc ccacctatga gtgagaacat 1321 gcggtgtttg gttttcgtcc ttgcgatagt ttgctgagaa tgatggtttc cagcttcatc 1381 catgtcccta caaaggacat gaactcatcc ttttttttgg ctgcatggta ttccatggtg 1441 tatatatgcc acattttctt aatccagtct atcattggtg gacatttggg ttggttccaa 1501 gtggacagac cttttggtta tacattataa attacttacc attttcactt tgtattgttc 1561 cagatttaac ataattggta agtgtatttt taggattcat gaatagtttt gtgggagtta 1621 tttaaaacag gaaaaggtaa attggaaatt ttacctggaa atataaatct tgtgtataaa 1681 attgatactt tatgaataca ttaatgaaga tttcatattc agtttttagg agatcagcga 1741 cagatcttta gtggaatgca ggaagtccat aaatccttac tggatgtctt ttttacttgt 1801 cttgtttaaa cagttttagt taaataggaa gtaggcaagt gttgactact gagtatctct 1861 tttatttcct gctggaatgt aggagtgttt tatggcatat tccctttcaa aactattagt 1921 gactagtaat ggttttacat gaaaatttgt aatcataaga gatggttcag atgtataaca 1981 gaatgtgaaa gccagcagat tattaacact ttgtgaattg tgaattatct gctgtcatat 2041 agcatattag ctaatcgtta ttggacaatg acctcttctt catgtagatg tatattattt 2101 aggaaacaat gtaattttct tttgggaggt attcttttta ataaccttgt gaaaagacat 2161 gctcatatgt cactcctcta cactaactga gatgtgttaa ataaactagg aaaagcgttt 2221 gaaaataaaa ttttcaagca acttaaggat gatctgtaca caaacagtta tttacataga 2281 agtcacgaga agagaacatg ttgctataat gttcctcaga aactatttat taattggtat 2341 gaacattaaa ctagaaaagt tatcctgtca atgttaatgc tacatgttac cccttagggt 2401 agaaactttg gtatcatctt ggactcctct gagtccatta tccttcccat gcagttattg 2461 cgaggcacgt ttaccgctgt aatggtcctt taatttcttt ctttaaatcc ctactgaggc 2521 tgccctagtt caagacctca ttcttccttg cttgtattat taagctcttt tctgagttat 2581 ctcttctctt ctgagccatc cttcactgat gggcaatact atcttcctaa cactctcccc 2641 tttctaaaat gcttcctagc ccttccctct tcccagcctg tcagtggtgc ccatgtctca 2701 gttcttagat attcagagac tctccccttt acatgacccc ctcttaggct tgttttgcca 2761 ccctctgtct ttcctcccaa ctctccagat tgacagcctt ccctgaagga actcaccgtt 2821 actcccatgc tttgctcatg cccttccctc caaatgcttt taaaatcttt ctttttcttt 2881 tttctttttt tttttttgac tgattgaaat cttgtgccaa ctaagtcgac atcagtctcc 2941 atttttggct acatgtccga gtgagttgct cttccatagc agtaacacaa gttttgtatt 3001 actttttttt ttattcgtgc atttgctcat gtcacaggaa tacggaagag attgtgtact 3061 cattttttac gtcttcctca gtattaacat acaacatttt agggactatg aactgctgaa 3121 ctatttgttg agggactagt atatgtttca tacgtagtta tgttttcccc aacttgatta 3181 ggtcatatca cagaccattt tatcccttca atgaaaacca cgttatgtgg actattagtg 3241 ataacaaatg gtcactacct ggtgaaaaaa tatggtattc catgatataa gactgacagt 3301 ctttttctgt ctttggtcca tcttgtctgt aacatgttca tcaagccatt ttattcaagc 3361 aaactacatg actgttggac ctgctataat aatatttcac tgaaatctgg aattaaaaaa 3421 tttaagtatt tggagctcca tgtattgatt acagttttta aatttgaaat aaataaccag 3481 catcctttga atattaaaag ctgtttcctt tgacatctaa tttaccctgt ttaaaacctg 3541 agatgtaaag tgatggtttg tcagtgaact agactttttg ttttagagta ttgaggactt 3601 ggaagagtat aatgaaagcc ctctttttgg tgaagaaagt ggtgaagaaa gtcaacttaa 3661 aaaatacatc accgtgttcc aaaataaaat tgtgtcggag accataagta ctgcatattt 3721 aatcttaaaa actcaggatt gatgtaagca ctggaatatt ttatttgtag aaatgtatat 3781 tgcaattact taaatttctt tttttgagac aactgcagta agagttggca cttattcaga 3841 tgtctcttta accactgtta ttctgatggc tgatgaaaaa acttgaacct atgttatatg 3901 atacaagcat aacttgagct acactaaacc acatgacagt atttagttaa ttctcgtata 3961 aattagtgta tctcctctat cattagtttg tttgttaaat gccagacctc gttactgtga 4021 tctgttgaat gaaccctaga cagttctgtg agaaagcagc aattgcacag ttatatacat 4081 ggtcagttaa attttaacat taaagataat ttagttttag gtttggcctc tgtggatctg 4141 tatccgttcc aaaactgtag aaagcacagg catggaacag tacatttttc tctaataaaa 4201 gaagacaaaa cttattttat ctgtgcattc tttgtattta ttcattaggt taaagagaca 4261 attccaatga aagtatgact tcaaaaaata gatatactct atggcctttc aaaattatta 4321 aacttacctt aacatctgtg ttcttgaaaa aaattaaact actataaatg attgagacaa 4381 ctgagcagga tatataaatg tcagtgctga gccattggct tgtgggtatg tcatttccaa 4441 agacatcaaa atgccattta gtttgttttg gtccaccatt caatgatacc acccctcagc 4501 attgattcaa agtattttat taacatttat gtatgcttaa caattacttc acatctatgg 4561 caaggtaaat tatgtagggt gctttgtttt ttaagtttag tatttgtgga gtatgtggaa 4621 gttagtttaa gtatgcatgc atctaaaatc ttgtatgatg aactctctta cctttccttt 4681 ttaatgtgca gtttttgcac ataacataca aaatgttcat ttggaaacac tttagtaaat 4741 atgaaattca gaatatattt ggaataagtc agttaaatgt attcttttgc tgagtttctg 4801 aaattccatt catgaattgt tcagaacaca ataccctcat tgggaggaga ggagagactt 4861 actctcattt tcccttgttc ttttcatttt aattggcccc tatataccac tcttatcttt 4921 tcaaggtttt ccaagttaaa gaaaaaaaaa gttcattcag attctgtttt tctgcctaca 4981 gtcacttata caggaataag tgaatcctta tggggagcat ttaattctta aagcttgcat 5041 tttcataaag gattttcctt ttacccaaag catgtgttga tgtgtaagtc taagtactac 5101 ttcaagtttc taagaagatc cttatttgag atctctttca gtagttacag aggcaaaata 5161 agcactgcag aggtccatgt gagagttcta ttatacatac tgggctatct gactattcta 5221 cagtggaagg atgagaaaga caccacaagg tatttagtga tggaacagtt gatttatttt 5281 tatttttatt ctttttgaga aggagtcttt ctcactctgt caccaggctg gagtgcagtg 5341 gcatgatctt ggctcactgc aacctctgtc tcccgagttc aagcgattct cctccctcag 5401 cctcccgagt agctgggact acaggcacac gccaccacac ccagctaatt tttgtatttt 5461 tagtagagac ggggtttcag catgttggcc aggatggtct caatctcttg acctcgtgat 5521 ctgcctgcct tggcgtccca aagtgctggg attacaggca tgagccaccg cgcccggcca 5581 aaagttgata tttatgtcac cccttagtac ctcaaatgag taactgtgta cttgaagtca 5641 ctctggcaga tattctgaag ggaagaaagt ttttgacctt aaaagttatt aggagtgtgt 5701 gtgtgtgtgt gtgtgtgtgt gtgtataaat gtgtgtatat acgtatatat gtgtgtgtgt 5761 gtatgtatat atgcatatac atgtatgtat atagataata tgcatatata tatgcatata 5821 cgtatatgtg aatatatgca cacacacaca catatatatg actttgtttt ttagaattgt 5881 ttttagttca caacaaattg atctgaaggt ctagagattt ctcatttacc ccctgttctg 5941 ccaccatagc attccccact atccatgtac cccaccagag tggtgccttt gttgcactcc 6001 atgaaattac attgacacat cattatcacc ccaagtccat agattacttc agggttcact 6061 cttggttact ttttgtgtgt gtgtgactat tttcctcaca tctcttgtgc tatagcccct 6121 gttttatttt ttcattaaga taacgtttaa tttatatatt tgtgaccatg aaattttagg 6181 ttaggatatc agaaacaagt cgcactaaaa caaacgtaaa gatacacttt tctcaagaaa 6241 atggtaagac aggtaacttt aatttacata catatgcttg aaaaattgca gttagtgctt 6301 ttaaattcta gactgaagca cgtattttct tttattccat aataaacacc cgctttgaag 6361 aatggctgat atcaagagca tcatgtctcc taatattgtc tcactcagca gctgaatgtt 6421 aaaatgcaaa gtaaaagcac taaatatatc cttagagcag tgatgtactg aacaaaattc 6481 agaggtcata tgcaaagaca gtgcccaaga aattttgccc acagaccagt ggcaagtcct 6541 ttatttcagc aactgaaact gaagagaaac agtcttttgc ctccctgcct cttggagagg 6601 gtttctcctg atagcagagg gaaaggacca tctcatttcc ctgatgttaa aatttcatct 6661 tggagtccta ggcttatatc tttgtgaaga tatctaaatt aaggtgatcc agtcaaatca 6721 caaaatactt gactcactcc ttaggaggag cttaagtctg ccctgatttc tgtgcataca 6781 cctgatttat tgcactcaga gtacaaaagc tctgagatgt caaattgagc caggccgcac 6841 attgcattct tatccaggtg aatcagggct ctctcagtgc tgctcaccca aagcacattg 6901 aaataatttt tcttatttat ttgtaaaaca gtgatgatgg tatctgtgtc agagggttac 6961 tgtgtcacag gaaacaatga tggtgagaag gttagacata tggtacatgt tgaataaatg 7021 ttagctatta gttaccaata tcttcatgat ttttattgat atataatagt tgtaaatatg 7081 tatggagtaa atgtgatatt ttgattcatg aatatgtaaa gatcaaatca gagtagttgg 7141 gatattcatc acctcaaact tttatcaatt ttttgtgttg agaacatcgc aaatgtcttc 7201 taggtatttt gaaatacatg ataactacag ttattcgagt gtgctattga accctagaac 7261 ttatttcttc tctctaactg tatgtttgta tccattgtcc agcatttctt tatccccgtc 7321 aaccctgcta ctgtttccac cctctggtaa ccaccgttct actcactacc tccctgagat 7381 cagtgttgtc caggaatgaa aagcaagtaa atggtaaaga caggccgggt gtggtggctt 7441 acacctgtaa tcccagcact ttgggaggct gaggccggca gatcacctga ggccaagagt 7501 tcgagaccag cctggccaac atggtgaaac ctcatctcta ctaaaaatac acacaaaaaa 7561 aaaaaattag ctgggcttgg tggcgtgtgc ctgtagtccc agtgactcag ggaggctgag 7621 gcaggagaat cgcttgaacc caggaggtag aggttacagt gagccaagat catgccactg 7681 cactccagcc tgggcgacag agggagactc catctcaaaa aggaacaaaa aaataaaaaa 7741 taaaaaagat gaactagttg atagttaata atcatgactg gtctatttgt ctgctacact 7801 ttctttcctc tatgtgtggc aagtgcctac ttactctttt aagacttgtg tcaaagacca 7861 cctagttctt caccttttct ggaccctgtt agttctcaaa caaaattgtg ccccctttcc 7921 tctcttttag tggttctgta cctctcaaac cctgtattgt ggctgttatt tactgaactg 7981 ttaccctaga gtgtaaaaac tttatcatag tcctaatttt atttttggaa actggtatgt 8041 tcccagccat acaggaggtg cacaatacac acgtactaaa taaagtaatg gtcatttgaa 8101 agagtaaatg gtgaatacac agcaattctc caatctttgt gtttatgcaa acacaagaat 8161 taactttatt ggcctaattc aaacgttatc ttgaaattta ttcaaagttc ccattttcca 8221 tgaattaacc agacattttt tgttactaga ttttcactaa ttaaccagtt aaatctcttt 8281 tgtcttatat cattttaaaa gagcatgtgg aacaaattca aacaacttag gtagttcacg 8341 tctgtttagc tcactatttt aaggagtgaa gttcaccccg ccatttgcct acttgcctca 8401 gtactcacac aacaattgca ctgatgtcct gctaatgcgc attaaatcag atatccatac 8461 ctggctttga agtgtgccct attcaacaaa cagctcactt ggttgttatc ttccatctag 8521 tataaagata aaatatgtgc cagagaaaga tgtcttatag agattttcgt gatgtactag 8581 ttacccataa acaattaatt actggggcag acattcctac cttttagaag gagaaggctt 8641 gttcatagga gttgagagtg aagaaaatta attcttgttg aactgataaa tgtcccttct 8701 agatagttta cattgccatt aaagcataac gttgccattt tttctttatc tctcagctac 8761 ttatttcttg catgagctat tctggctttt aaaatcatat tttttggtca tttgccaagt 8821 gcttggctgt tgggtacaca tggaataaat tatccagtaa aaaccatctc agatctttcc 8881 agctattttt gatcaggtac acctgtaaat cttaaaaagc ctataaagaa atacctaact 8941 agtacattgg atacaagaca cacaaatctc acataggaca atggtaactg aagttctgaa 9001 tgttgactca acttaggaaa ttgcagtcct tagggggaaa atatccttaa acaaaggctt 9061 aaaattctac taggtggtca ccaccgttta tacacatatg tatttccctc tttttatttg 9121 ggaaactgat ggtcgtattt ccttttcagg gtggaagaag gaattctcca ttaagagaag 9181 ggaaggaaaa tataggaaaa ataaccactc tttgaaagca tctattgtcc aaatcatgag 9241 gaagagcttg tagttctgag tttcacaaac gtgatttagt tgtttttctg taggatcctg 9301 cgtggggtca cctgtttcac gatagctgtg tcctcagacc cagtgtagga gaagggtaaa 9361 aaatgtgtta cttcactagt tcatacctct ctgagcctct cttgaaactg ctgcagggaa 9421 aatttctcca tctttgttct gtcacttgca tatttggagc ctttcctact gcccttgtta 9481 tgctgctttg ctttgtttta cagcatgcat tttttccatt gccaatgttt gtttgtattt 9541 ccagggtcta gcacatgact gtgctcaaat atcaagtgaa tgaatgaatg tatgaatgaa 9601 tgaacatgac tttcagatgt cctttatggc ttttctcatt gaagtatatg cttgttcttt 9661 acctcttcaa aagaggtgca actcattgag aatagggttg ccaaatgttg caaaaactaa 9721 gcatagtctt acaaaactaa acatagctga gccatacaat ctaacagttg cactccttgg 9781 taactaccca aatgaactga aaacttacgt ccacacaaaa acttgtatgc cagtgtttaa 9841 ccagctttat ctatcattgc aaaaatttgg aagcaactaa tatatccttc aatgggtgag 9901 tgcataaata aattatggta cattcataca atggaatact attcagaaat tataagaaat 9961 gatctagcaa gccacaaaat gatcaagagg aacttgaaat acatattgct aagtgaaaga 10021 agccaatctg aagaggctgc agcctgtatg attcctgcta tatagatgac atctggggtc 10081 tggaggactg tggccctctt ctcacagctc ctctagacag tgccccagtg gggactctct 10141 gtggggactt caaccccaca tttcccttcc acactgccct agcagaggtt tccatgaggg 10201 ctatgcccct gcagcagact tctgcctgga catccaggtg tttccataca ttctctaaaa 10261 tctaggtgga ggttcccaaa ctgcaattct tgacttctgt gtacccacgg gaccaacacc 10321 acatgtaagc tgccaaggct tggggcttgc accaagcaac agcctgagct ataccttggc 10381 cccttgtagc cacagctgga gctgaagcag ctggaactca gggcaccatg tcctaaggct 10441 gcatagagcc ggggggccct gggccaggcc caggaaacca tttttccctc ctaggccttt 10501 gggcctgtga tgggaggggc tgccgtgcag gtctctgaca tgccctggag acattttccc 10561 cattgtcttg gtggttagca tttggctcct tgttacttat gcaaatttct gcagccagct 10621 tgaatttctc cccagaaaat gggtttttct tttctattgc atcatcaggc tgcaaatttt 10681 cgaaactctt atgctttgct tcctcttgaa ggttttgctg cttagaagtt tcttccacca 10741 gataccctaa atcatctttc tcaagttcaa agtcccacag atctctaggg caggggcaaa 10801 atgctaccag tctgtttgct aaagcacagc aagagtcaac tttatttcag ttcccaacaa 10861 attcctcatc tccatctaag accacctcag cctaacttca ctgtccatat cattatcagc 10921 attttggtca aagccattca acaagtctct aggaagtttc aaactttcaa actttctaag 10981 tttcaaagtt tctgagccct ctgtctctag gaagttccaa actttcccag attttcctgt 11041 cttcttctga gccctccaaa tggctccaac ctttgcctat tactcagttc caaagttgct 11101 tccgcatttt cgggtatcct tatagcagca cccaactcta ctggtatcaa tttattgtat 11161 tagtgcattc tcaccctgct ataaggacat aactgagact gggtaattta taaatgaaag 11221 aggtttaatt gactcagttt cgcagggctg gggaggcctc aggaaactta caatcattgt 11281 ggaaggggaa gcaaacatgt cattcttcac atggcagcag caaggagaat tgcagagcga 11341 agtgggggat gccccttata aaaccatcag ctcgtgtgcg aactcactat tacgagaaca 11401 gcatgggggt aacagtcccc atgattcaat tacctcccac tgggtccctc ccaccacaca 11461 tgggtattat attcccataa ctacaattca agatgagatt tgggtaagga cacagccaaa 11521 ccatatcatg ctctaacctc taccttacgc tgtagtggag gaaatgtggt gcctatttca 11581 gggatgtgac agtaagagcc agatacatga actggcatgc ccaaggtaac atagagttag 11641 agagggaggg gaatgaaaaa ctaacaattt ccaagctcta aatatgcact ggcactggaa 11701 gaagtacttt tttttaatga tatgaaatat ttatatcata tacctaatac gtgtcaacac 11761 acgatccatt gggatagcat tttgatttat tcattttaca attgagagaa ctgaaaatca 11821 gagaattaac gcaaattctc tcaaaatcac agaactggta ggtggcaggg ctgacattca 11881 atgccaaatt caagttcttt ccaataagcg cagttctgtc ttgagaggtg tggctaagag 11941 cagaacccag atgcctatga ggagaagctg gaagtccttc tttaaggtca ctttgccttt 12001 gcaacatgag agatggttca gggctctcct atccaggagg tgtttgtaaa acagcggaca 12061 tccatttgtg caatgcattt agactctgcc ctcccagaag tctggatgat gggcaagaag 12121 cgctaaggct catttctcat ccaggcaatc atggggaatg agtgtaaaca gcctcaattc 12181 acaggtgcac ctgaatgtgg gatatggcat caatgtttaa gcccttttaa atttcttcac 12241 ttgcagagct caataaagat tacattactg gaaaggaaag gaaatcacct gcactgaaat 12301 atgacatgag ggtggaaaag aatctaaatg tctgctgggg acaaagacct cctctggtgc 12361 tgctacacac acaggcaaat ttccagcctc atgggtgcaa aatggccata tccagtcccc 12421 tactattcag gtggttttta gcccacacaa agttttataa aaagttaagt cctaatgcaa 12481 aatcttggaa agttcacata catatctaga gttccagcct cttgaggaaa gtggaagagc 12541 tggcaacact gggtcccaca ttcctcacat gtgccccaag gcagcatagg cacccaaccc 12601 tgggccactg tacccactta cattacctgg tatgcctcag agtttgtgaa cctaccctag 12661 cctaaaagga aggtcctagc ttattcaaag ctttgaaaaa aatcctgctg agaaaagact 12721 gaaggaatgc tttttggcac catctcctgt ttcattagcc acagctgcct tactatgtat 12781 ttggaccaat aaaattgact ctcggagaaa gcaaaccaat gaaattcata ggcaatgagg 12841 agagctcaat gaaaaagtga ggggaaaatg atttaaaaag atacacttgg gtggcagagt 12901 agggaagtgg ccctttagaa actgcatata atctgtgcca ggatttctca gcctaggcac 12961 tagcgatatt tggggccaga taacttttca gtggagaata gtcctgtgct tcgtaggatg 13021 tttagcagca ttccctggcc tttatctact agatgccaga gcaccccacc actgttgtga 13081 ccacacacga tgtctctgga catcaccaaa tgtcctccag ggactaaact tactccagtt 13141 gagaaccact ggtctatgta taactcaact tagagagcag ctatgaaccc aaatcatgta 13201 aatgtgaaca aaacattcac attttcccac ttgtcacaat gctcaagtta tactttatct 13261 cagtttctgt tttaaacaga acaatgtaaa agtgtttaca aattatgaag cctttatgta 13321 tcacactgac tttgggacac aatagcaatt ttctttaatt tacgaagaaa tgggagtctg 13381 catataatta tctttctacc aaatttctag gactacagat gggggaataa gccaagtttc 13441 ttgaggaaat agccttgtat ttctttatat gaactgtttt tttctccttg ggtggggcag 13501 gatcaggtat ggaacaccct ctgatagtca acagttgtat gaatggatgt ttagcattta 13561 gagaagactc aggagacatt gtggttcttt ccttttattt agaggactaa acatggtgga 13621 tgaagagact tactggccag ccctccatgg gccagaatga aaattcagga agatatcata 13681 ggaagacagt ctagccagca aatggaacca gtggaatgag ctgcctcaaa acaaagcatc 13741 attcattgtg gcttccattt ggttgtattc tgtcttgcac ccctccactt tgtgagaaac 13801 caggtcagga gcagccctat gcaaaaagct aaggggtaag tctccagctg aaagctatac 13861 tagtgagcct gggaactgct cctccagccc caatcaagcc ctaagatgac tgcagcccca 13921 tctttgcttg gctgcaactt cacaggagcc cctgaaccag aacacccaag ctaagctatt 13981 cacagaagct gttttacaag ataaatgttt gttatttgaa aaataaaaat aaaaagataa 14041 gaaaagatgc ccagttaaat gcaatgtggt atcctgaata ggattctgga agagacaaat 14101 aacattagtg tggaaactgg tgaaatctga gtgaagtcta tagttttgtt gatagcaatg 14161 caccaatgtt ggtttcttag ctgtgataaa tgtaccaagg taatacattt atggtattaa 14221 tgttaccata atagcattga ggttatgtta ttatgttaac agggcaaact gggtaagggg 14281 cataagggaa ctctatacta tcttcataac tctctgtaat tctaaaatta ttccaaaatt 14341 aaaaaaaaat aagtcctttc tcccaggcac cagatgtatt caagcaaagg ttggaaactc 14401 acttatcaag catgtgagaa aatggaggcc tgtattgctc actgaagtgc tccacagaac 14461 tgaaagctca gtttagtgac aagaagggag gtgactcaca agccaggctg aaactagtga 14521 aataatctcc tatttgcatc tccttgatct gaaagaaaac atgtgcctaa gctgacatga 14581 aacatctatt atgtttgcca ttctggtgga gcctgtttgc tggaaaggga gtggacagct 14641 aacaagatgg accagatttg tttttatggg gctggtgatt ttttcatagg tggcttatta 14701 ggttgttgta agattctaaa tcttaggttc tctaaatcct ttttctgctt aagttatcca 14761 gacagtattt ttcatttgca ttcatgaaca ccgattattt gattagtaca gaaatctatt 14821 tttaagctcc aatgcctatt ctgtaaacca aaatattgtt tttaaaataa aatataacat 14881 ctctttacat atatatgtac atataaatgt gaattgagtt atatgatctg tggatatcct 14941 gtaaaaaggt ttagtaatac catattttgt cctttattaa aactaaaatt cattatttca 15001 tctcattatt tctcactttg agggcttttc tttgtggctt tcaagccaca gacctcccaa 15061 aaaataaggc aagagaaaga ctgaaccaga tattttattt ccagtttgga acaggaggtt 15121 tgttaaagaa gtaacaatat tattaatgac ccctggcact tgggagccat tcattgtgcc 15181 tgtactccat gtttggtgct ttctaatgat taatcccagc aacataagtt aaattggccc 15241 agtaagttta cataggaaaa caggtgttac ctgaggacaa gtcaggctgg acaaaggatc 15301 cacttactgc cattgccacc ccgagtggaa tgaattcaaa attgctcctt ctactttgag 15361 ttgcctgctt catattcaaa gtctatccca agaaatggaa tggatagcag ctctgataga 15421 aaagaggaat agtaatctag gaaagcgaaa ccagtgggac caggaaaggg aaaacagtgg 15481 gaccaatgca atgctctggt ttaagagtta acctgggatg aatcccagag atttaggaag 15541 ggagccaaaa cagggatctc ttccctattc ttaagcaagt attctctgcc cagggatgtt 15601 gtaccataga acatgatgac tattcttggc ttgggaatat tatttacaat ataatagtta 15661 tctgtttaaa tgtcatcttc catacactga tctatccatc cacccatcca ccctttgatt 15721 gatcaatagg tgagattctg agctaggtgc tatgaaccca gatgaatgaa tgctatctct 15781 gcctttgaaa aattcaaaat tcagtgctag agatagacat gtggaaacct aattaaagca 15841 aaatgaggcc gggcatggtg gctcgtatct gtaatcccag cactttggga caccgaggca 15901 ggtggatccc ttgagctaag aagtttgaga ccatcctggg caacgtgctg aaatcctgtc 15961 tgtactaaaa atacaaaaat tagctgggca tggtggcacg tgcctgtagt cccagctact 16021 gggtggggtg gggtggggtg gggggctgag gcagggggat cgcctgagcc ttgggaggtt 16081 gaggctgcag tgagctatga tcgtgtcact gcactccagc cggagcgaca gagtagtgtc 16141 actgcactcc agcctgggcg acaaaaaaaa aaaaaaaagc aatatgaaga aatgcttcag 16201 tgaggagaag acatttacta ggtgccagga caaaagagag aagactgctt ccagttctgc 16261 ttgcaaaggc taatgtttga gttcaacctt aagggatgaa gttatccctc tctggggtct 16321 ttgaattttg gtgggggttt ccaccagctt agtttaggac atccttcctt tctgattacc 16381 ctatcccttt taatgtttga ttccttataa tccaaccctt taccaactaa gcattctatt 16441 ggtgaaaaag atgagaaaat tgcaaattca ggagtggatt gctagtttct taagtgggaa 16501 ccagggttca agagagggga aaataataat tgtggaagaa aacaatctat atggaaaata 16561 atggaaaacg tattcaagcc caaactggtc ccatcattca ggcaatgctg gatacgtgtg 16621 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgcg cgtgtgtcat cttcaagaaa 16681 tctttcatac cctagattgt tggttattta accaaactca tgttatcaaa gtttttgctt 16741 attaacttat acttaacagt tcttccacta actttaattg aaatagacaa aggcagatag 16801 gaacagggtg attggttcac ttttgttggt ataagtccca gtgaattaaa cactttcttt 16861 ttcctgctag cactttgtgg gctaggattt ccaaaatctc tttccaaagt gtctatgttt 16921 ccctggagaa gaaaatgttt tatggtattc ttaagcaata gcgatgcctc tccttagggc 16981 tgaggctccc caaagtactt gtgaaatgta ctaatcccca cagtttacat gttgaagtgg 17041 tggtgaaatt atcaaaggtg tggtcagaaa ataagaacat tctgtgagtc tctcactgct 17101 gtagagaaag tcgactcttg gaggtcacca tgccttcagt taaacactaa gaagaaacct 17161 attctgtttt gtgtaaatta ttctggcagg gtgattagtt ctaaaatttt tctatatcat 17221 cttcaagagg agaaaaaaaa cttctggatc ctcgaacctc aaagcaagtg gttttctatt 17281 acagaaaaat cattaagata tttgatttgt tgaacacgtc tccttgttcc gtcaaaacat 17341 gaggtaggga agctgatctc atggtttaga tgccatagct ttaaaggctc aactatggtt 17401 tttccttaac taggtgttct ttgtaatcac tgcttgctgt aagaactgct aatgatcaaa 17461 ttatgcccta gaaggatctt ttacttgctt tcatttttcc acagtcaagt tatatgattt 17521 tttttaaaga agattttttt ttcaaagctt tcagcacatg tctcaaaggg gcagctctgc 17581 tgctgacttt tcatgatcta ggagctagta tgtccttcca ggctcctgaa tttgtatctt 17641 tcattgaaca gtccttaaat ggaggacaat acttttagtt tagtctaggt aggcactgta 17701 tttgtagaga caggagtttc caaatatcgg ggcagggaaa atggcttttg ccaagtcatt 17761 tttttaagct acatttcatt cctggagtgt aaatgaaagg aagtaggcag gagaacaaaa 17821 atacaaaaat catttttccc taataaatat tctaggtggt taactcaaga aaattaaaac 17881 ccaaattttg ggagtcttat cattacttgt taaagtcttg gaaatgccta gaatttcctg 17941 ccagttgctc ccatgaaaac agctctctct gattcagtac ctgaaaagat gatgcaatta 18001 aatgaaacta ttatactacc tatccttttc ttatccattc catgtttttg attatccttt 18061 gagccattca aaacaattcc atatgctatg gaaacactat atacataaat gtgctatgat 18121 tgaattttta atgtatctca tttaattttt caaatattta ttgtgctgta ggtatgtata 18181 aggcactgtg ctagacgcta aagcaaataa aataagaatg gtttcctctc tatcacttgt 18241 ctgcaatggc agaagatgac aatgttcttc cgtataccag tagtttttaa tctaaatcca 18301 tctctatcct gctcccctcc acctactcac tgcaattctt ccttgctcca ttttgtcagt 18361 tgctcatcct ccatgcaact catcctggga tgagttatca ccccaaactc gggaaatgaa 18421 cccttagttc aagccaatcc cagtgattta tttctacttc ccagtattga tttgggcact 18481 ggcaggtgtt gcaattctag gacatgagaa gggaatattg ctaacttctc agtcatcaaa 18541 caagatagta atagtgaaag taataacttg taatgaagat tgacttatta aagttaccca 18601 tttaatgcat aactcatgtg tctatagact gcatggtggc atcagttgtt taaaccagtc 18661 ataaaaatat tgacttttat tttttgctta atgccaccaa aggaactcat ctcaccttga 18721 aaaaatattt ggtcacatat gcagtcatgt gccacataac aatgctttgg tcaacaggcc 18781 atatataaga cagtggtctg aaaagattat aatggaggtg cttcatacag ctatactatt 18841 ttttatcttt tatgcaatat ttctactatg ccttttctat gtttagatac atagatacct 18901 accattgcat tccaattacc tacagtgttc agtacaggaa catgctgtat aggtttatag 18961 cctaggagca gtaggctcta ccatatcgcc taggtgtgta gtaggctatg ccatctaggt 19021 ttgtgtaagt atactctatt aagttttgca cactgtcaaa atcatctaat gatgcatttc 19081 tcagtatgaa tctgtcatta agcaatgcat gactgtaata taagcactaa aacatctaag 19141 aataacaaag gtctgatcct taaaaagcac acaattacta catcatgaaa ttaaaataat 19201 gaagaaattg aaatgaacag gaatagtata gttgaacaac tgcataaaaa ataacaatga 19261 attactataa tattaatttc aaattagggg ttggcaagct ttttttctgt aaagatccag 19321 agaataaata cttttgatct tgtgggccat aaggtctctg tcaaaactac tcaactctgc 19381 cactgtagca ctaaagcagt cacaggcaat acagaaagaa atggacatgg ctgggtttta 19441 ataaaaatat aaaaacaggc aatggactgg ctatggccca tgagcttagt ttgcccacat 19501 ctatcttaag tgaaaacctg ttaaactact cttccagagc tttccatatg taaagctcta 19561 gtaacaccca ctagttgaat ttttaagtaa tcttgagggc ttgattccaa atgacctaaa 19621 aaggctgatt ggttaaaaaa tacactcagc attaaatttc aacgtatacg tatttgcttt 19681 tattttgccc atggggaaac caagaaaaaa ggcaaaatgt tcatgtttga aggatgggat 19741 attcttcata attatcaagt ttattttggg ctatgatccc aaaagagaat atagtgcagt 19801 cttgaaatct ttgaaaggtg ccagaatatt tcagaaactg cagtgacaga agttttggtc 19861 aagatgccaa agtcttgtgc aatagtggtt gctgctccaa caagacatac gaatatgtat 19921 aaatatgacc agttgagaca agaccttgga tagaaaacga tattgtctga gaacataaag 19981 gaagtaattt gtgaatacca tggtacaaag cacatggctt aaagaatccc acacctgctt 20041 ggcaccaaca tagcatcata tagtggcagc agacacttga gaactttaca tggtaaaaac 20101 cctggggtgc taggaagagc atccaagaag aaaattacaa aataacctgc aggataatgt 20161 ctttgtcatt cttggctttt accagagtgg gtggatgagg cagcttctat ctaaagggtc 20221 accaaagtca acttggacag atcatctcca ctgctctatc ttttaatata attgatgcat 20281 ctgtgttttc tctgtctatt gactaggaac tactcgatga taggaactcc ttttgacata 20341 gtgttgaatc ctctatcgca gctagacctt aaagccccac atatgtatct ccttcaggtt 20401 gcataaccct tggtattatg aggaaagagc tgtgaggcct tgtaggaagc cctctgggtg 20461 tcactccctc agtttggctc tagagagaga aggcataaga tactcaggta tggctctagc 20521 cttaggttca tcaccatggc cctatggaaa aagaagttcc cgtccacctt atcaaggccc 20581 tgagagccac agcaaagatc aactaatttt tccatactga tatttttgtt gcttcctttg 20641 tgattccctg ttctctattc catatacctt ccaagactgt gttattgaat agtggtttat 20701 aacctattaa taaatcaaga aaattcagtg gatcatgaaa acatattttg aaaatgaaag 20761 agaatcagat agaagaggat aggagagaat attataaaga atatccaaat gtatctctga 20821 taaagataag ctgtttggct tcatgcattt cctgggctga gatgtaaaat gtacttctca 20881 ctgcagatag gttgtggaca aaaaatctgc aacctgctgt tctaagacaa gtaaacaaat 20941 aaaagtctgt ggtgcattct gagaacaggc tgcaaactaa atggatttct tcttttatat 21001 gaaatagata gtatctcata taaaatggct ttgatagtaa gatcagtaga caatagggag 21061 ccttgggatt tttgtttgct ttttgttttt ccaacaaagg agtagccact gtggttgaag 21121 tttcactctg gtgattttac gtgtggtaga tagaatagag gagaagccct gaattgagga 21181 aagtaggtag gggttctatg actataatta aaatcaagca taatgatgca gtgacatagg 21241 gttgagggga gatacaggac aaagtgatat ggtgatggca aaaatgacaa tgatgatgat 21301 gatgatggtg atggataagt tttatggaga gtatataaga tgtcaggaag tgatttaagc 21361 tcattatggt tattatttca tttaaatctc acatcaaccc tatgagacca tcaatgaaag 21421 ctggaaactg agcatgaagt gaaggagtaa ggaagtgttg aggttgtttt aaaatcatta 21481 ctggagacac tggggacagt gggagaaagg tgatcgatgc tctttagaga tatggggaag 21541 agtgagagaa agagcttaat ttaaaagctt attattaaat aatatgcctg cacgcatcca 21601 ttcacatctc aaagatttgc tgagggccac ttgtgtatta ggcagtacag atgatgttcc 21661 agatcaaaga ccaataagac gtagccctac tccatgggaa gtcaccatcc tttgtgtgaa 21721 gatcacagaa aaggtgattg tggacacaca gaaggtacaa tcagctccac agttcacaga 21781 atagagacct ttgttgaggg cctctgggca cagtggtctc attattccgt gatcatccct 21841 atttggtggt gtttgcttcc cattggcatt tctacctaca gtggattttg tggattctgc 21901 accaggtttc aaccttattt ttacttacac aaaaggcttt ggcagacagt tctagcacat 21961 cacctcttca actgggcatt gccagaaaga ataacagaaa taggacagtt aaatttttct 22021 gtcaagggtt gttatataaa cacccaatct caatttcctt gcctaccagg ttacttttat 22081 tttctaaatc aacatttaaa tcacaaactg aagacagctg cagtgttggc ctagcaagaa 22141 tgtaattttc cttatttcat tgagttgtgt acttggattt gacattgcct aagggataat 22201 ggcgagggat cattttataa ggaagccata taattgagca attttgaggc attggggtac 22261 tcttttatga gactcttctc catattttct taaaccatca gctattcctc attccatgat 22321 actgttttca aaatgcataa acaaagcctt tctcaattgt gatatcagaa atgaactgat 22381 acaagaagaa aacagctcta aagaaaacac aaatatgcat ttctgtctta tagaaccatt 22441 caactgtaaa tattggttta caatataata aggttaagcc ataatgccta cactaacgtg 22501 ctatagtgat tgtggcagga ataagttatt ttctttcatt ttaccttaga tcaggcatca 22561 gcaaattttt cctgtaaagg gccagatagt aaacttctga gacaatgtca caactactca 22621 actctgacat tgagtgaaaa caggcataga cgaattgtaa atgaatgagc atggctgtgt 22681 ttcagtaaga ctttatttac aaagacagat ggtaagcttg gtttggccat ggttcagttg 22741 ctaacccctg attttgacca tttgtattaa aatgttacca aaataatatt aattggacct 22801 aacatactac ttgatataca aaaagtgctc tttggatgtg ttttaaataa gtgaaaagtt 22861 tgtgtagacc attaggtttt gcttaatttc ttgactattt tttacatttg tttcctggtt 22921 ctgtgactaa gagttggcca tttacaaact agatagtctg atccaagagc ctatgccaag 22981 agtagccatt tgaataatca gatagtccat gcttgcctcc atggagtaat tttgggctca 23041 cagggaaaat gattggttcc tgatttcaac cactaaaaat gtctccagat tctcaaaagc 23101 attttcaaaa gtctcttttt gaaccatgtt aagtttagaa ggggtaacaa attaagctca 23161 tcagctattt tatcttatta gttattcagt gtgtacatct ccaggctaaa tggatataca 23221 ggagggagga gccacaagat gcaagaagcc tgggtccctg aataacggcc tggaagagag 23281 tttgcccttc tcaccccatt gccaacctgc attagatagt gcttcagtaa gaaatatact 23341 tttgtagtgg tgttaaggca ctgagtggtt agcctacccc aacaaataca agtatgtatc 23401 tgtgagatac agtcttagaa gtagaatggt tgggccattt aaaatttaga aaaatgcaag 23461 aacccaaaag caaatgcaat aaaaagaaag ataaatagct gggacttaat taaaccaaag 23521 agcttttgca caccaaaagg aaaagtcagc agaataaaca gacaacccac agagtgggac 23581 aaaatcttca caatctatac aattgacaaa gtacgaatat acagaatcta caacaaactc 23641 aaacacatta gcaagaaaaa aacaaacaat cccatcaaaa agtgggctaa ggacacgaat 23701 agaaaattct caaaagcaga tatacaaatg gacaacaaac atataaaaaa tgctcaacat 23761 cactaatgat cagggaaatg caaatcaaaa tcacaatgtg ataccacctt actcctgcaa 23821 gaatggccat aatcaaaaaa tcaaaaaata atagatgttg gtgtggatgc ggtgaacagg 23881 gaacacttct acactgctgg tgggaatgta aactagtaca gctgctatag aaaacagggt 23941 ggagagtcct taaagaacta aaagtagaac ttccatttga cccagtaatc tcgctactgg 24001 gcatttactc agaggaaaag aagtcattat acagaaaaga tacttgcaca cacatgttta 24061 cagccacaca attcgcaact gcaaaaatgt ggaaccaacc caaatgccca tcactcaatg 24121 agtggataaa gaaactgtgg tatatatata agtatgatgg aatactactc agccataaga 24181 aggaatgaat taatggcatt aacagcgacc tggatgggat tggagactat tattctaagt 24241 gaagtaactc aggaatggaa aactagacat tgtatgttct cactaatagg tgggagctaa 24301 gctatgagga tgcaaaggca taagaatgac acaatagact tcagggactc agggagaaag 24361 ggtgggaaag gggtgaggga taaaaaacta tgaattgtgt tcagtgtata ctgcttgggt 24421 gatggctgca ccaaaatctc acagatcacc actaaagaac ttattcgtgt aaccaaatac 24481 cacctgttcc taaaaaacca tggaaataat aaataaataa atacaattta gacaaatgtt 24541 gccaaattgc cttgcaccct gttgcaacaa tttacacctc taccaacagt gtatgagagt 24601 gtttgtttcc ccacgaagtg tatcatccaa cttttggagc catgccactc tgagagaaaa 24661 aaaaaatctc cttggtgttt aggttgcatt tctttaatga atgtgaggga gagcatcttt 24721 tcaaatctat ttataccact tgtatttttc tgtgagccac ctgtaatttg tctatatttt 24781 agattttctt tttatattaa tttgtaataa agttttatgc aatagagaag ttagccttta 24841 gtttgtcaaa tgtgctataa agtttagtta tttgcttatc tgtcagagat aattctattt 24901 ttttaaaatg tagtgaaact tgttcctctc tttgtttatg ttttctgggt aggccttccc 24961 tagtgtaaaa aaatatatat ctacctatgt tttctgctgg tgcatcattt atagtttcat 25021 tttttaatct ttaaatcttt cttccatatc aatttattat tttcttaatg atgattgtct 25081 cattatctca aatacaataa taacccatct ttttcccttt acaaaaatac cgtatttatc 25141 acgtaataaa ttttcatatg tattttggtt catttttaat tcttcagttt tgtgacacat 25201 tttaataaat ggacagggtt aatctccatt ttatctcttt atcagatttc ccctgactac 25261 cttcacttaa ttattttccc agatgatgtt tagattaatt tagtaaaatc atcaaaaaga 25321 aaaattctgt tggtacttta ttttggaatc acattcagct tatagataat taatgacaaa 25381 atttacagct ttatattatt gaattacttc ctaatttcct ttcttatcca gttgagactc 25441 tgtgttgtgg gattagaatc attcctttgg atactgtatt agcttcctag tgctgctata 25501 actaatgaca agagacttag tgtcttaaac aactcaaatt tatcatctta cagttctgga 25561 ggccagaaat ctgaaatggg tctcgctggg ctaaaattga ggtgtcagca gagctttgtt 25621 cctttctgga ggccccttgg gaagaatttg tccctttgac ttttctagct tgtagaggcc 25681 atctgcattc cttggctcat ggcccctttc tccatctttg aagccggcaa cattgcatct 25741 ctctgaccat gcttccatag tcacatatct ccctgattac agctgggaga atttctgcac 25801 tttaaagtac ccatgtgatt agtttgagcc cacctagaca atccaggata atctccctat 25861 ctcaaggacc ttaactcatc acgtttgcaa agttcctttt ggcctgtgag gtattatatt 25921 cacaagttcc agggatgtgg atgtggccat cattgggggt ccattatttt gcctactaca 25981 gatatattct caactgccat gtccctcttc accttcattg aattgaacta gaaaagtcct 26041 aactcaaaag aggatatttg gattaactaa ggctggggaa ttaaaagtcc gatatggtgt 26101 atttagctta acagtgttta gataccagga tgtttatttt tcatttaaaa ttcccagcta 26161 ctaacctcaa acggaccttt tggcctgtcc tgcaatcctg tgacattgct ggcccgttag 26221 ctgccctctc ttccactcct ctcctctttc ctcaaacctc caggacctct tcgccatcct 26281 cacgcctagt ggattaaatt gcttcctatt ttgttggacg ctgagacgca atctggagta 26341 aaatcccacg tgccctccca ccatcttcta actcacttgc atcagtgtcc atataactaa 26401 tgcccatcta tcattatagt gtaaaagaag gtcaaaccct ccacttgtgc tctggatccc 26461 acccttgtgg tctactctag gaaatcaccc agtaatttcc caactccccc atcctcaatt 26521 ctttcctctc tatggtaaat tctggcagct gatgatcatg ttataatagc attcatttga 26581 accctggaag caccagactc acaggagagg gttctacaaa tagataacca caaataaatc 26641 cctatgatca tcctaatgtg acagagaaca gaaatgtgag cttgactatg tgtccatttt 26701 cattgtatca tgattttcat tatatcctat taggtttggc atcatttctt taggctttag 26761 gtcctggcta cagtatttat tttcctgtga aaattattca cacagtgaat ctgaccctgt 26821 tgagaacctc catttgaagg tttgcatggg gatcaagtgg ccacatttag accaaagcta 26881 ctgtcataga gcttgccacc agccacacat tccaaggatc tcagtaaata tttgatgtct 26941 atattcatat gcccaattca gtagcttcag aaaccctcat gcgatgcctg tttctttcac 27001 cagccacaca attccctctc tcctctataa atgagaaaat tatttgtaaa tcattagttc 27061 ctattttgac agaaaatatg aaaataactg ctgtggcctg ggaatatctt caaacatttc 27121 gatagcacaa atctttctct ggatgtgcgg ataaccacac agcctcagaa ataattaatg 27181 gagacagagc ctctagtaga caagttttat aaaattgttt ttggtaaagg ttatttgttt 27241 agtggactct ttaacaagca ggaacaaaga aaaaaatcaa ctggatttaa aaacaaagcc 27301 acagagagta tctgttccct gatatcttat ctgatgtgca ttcaaaccaa gtttcaaaaa 27361 cttttcaaag tcatcattct cttcactaca aattaagttt gctaacaata ttgccatttt 27421 tttttcctta ggaatgaaaa gaatgtctgt gcttattaaa gggagacaaa actcttggcc 27481 cgggccccac atgggccact ttgtgagtta tgttggaggg agaacaggcc tcatctccac 27541 cctacaacca aaccaatacc aggtaaggca ggagggcgtt tctcaaggat aagagcgaca 27601 cggcctgaca gtcactagta ttcatttttc acaattgcct aagtgatttt cttcttattt 27661 gagctgaggg tctctatgtg tgagattagc atgaaaaaat aaaacagggt ggggatttcc 27721 tcttggaaac aacatgctgt ggttgtggta gtgaacaaac cgtgacaaga aagccaaagg 27781 tgcaaggcat atgccagtga aatatgcatg tcttgtcagc ccattggcct tgtctatctc 27841 tctcttctga actctgtcat aacctccaat ggagtctcag gttctgcttc aagaaaaaac 27901 gtggccctaa gactcactat ccaaagaaaa actctggtca gtgtttttgt gctactcttt 27961 ctggtcaaag aactgagcct cctaccgcag aggcaagtcg cccaagctca caggtgtgag 28021 gagtgacagg atgagttcac aagtcctagt ctttagacct ccaagccagc attctttctg 28081 ttagccacag ctgagccaag agaactgcac cactggcact gtgggtgaca gaatgtgaat 28141 gtatgtgtgt acagcaaggt aggtcaactt cagcacatgg ggcctgataa tttttgttgc 28201 agggggctgt cctatttatt atagggtgtt ggtcagcatc ctcagccttt acccactaga 28261 tgctaggagc atccatcccc taagctgcaa aaaccaaaag tatatccaga cattgccaga 28321 tgtccccagg gcatggggag caaaatcaac cctgcttgag aaccacctat ttgtgtgtat 28381 atgtgtatac atatatatgc acatatgttg tgtatatata tatgaactat ttatgaataa 28441 atattcatga agcatcccct cgacctcctc caaaagaaaa caaagtagaa gagaagtttt 28501 gcagatacca aaatcctccg tggaaatgtt ctagtttcta gaagcttcct atccctcttc 28561 tgctttcctt taagtttcag gcaccctgcc caatgttgct tttcatggca gatttacttt 28621 atgtgtataa catcatagtt tttcattctg cctcaaagta gctgtctgct acatactaat 28681 acaaattaaa aagtgccttt ataggccagg tgcagtggct cacgcctgta atcccagcac 28741 agcactttgg gaggctgagg caggcagatc acaaggtcaa gggattgagg ctatcctggc 28801 caacatggtg aaaccttgtc tctactataa acacaaaaat tagctgggcg tggtggtgtg 28861 cacctataat cccagctact tgggaggctg aggtaggaga atcacttgaa cccaggaggt 28921 ggaggttgca gtgagctgag gttgtgccac tgcactccag cctggcaaca gagcgagact 28981 ccatctcaaa aaaaaaaaaa aagtgccttt atagacaaca gtaaacatac agagacaaag 29041 ttttatggat tcaaggaatt caagtacatg tgtctcatct ttactggaat caggatctgt 29101 ttaaaaaaca aacatcagcc aatcaggtgt gcactggctc cagtgcaaca tacaacacac 29161 cagctgaaaa actggctaaa aaggagttag gttcggtaaa atccctgatt ctgagttggc 29221 atagggtcaa acgacaagtt taacagagtg ttaaggcatt cttttcaaat gggttatttt 29281 gaaaccaaac tgagatagac acagcttttc ttctgaccct ttaagatgac cttcctgaaa 29341 attattttta attggttgtt cagaatccat atagactgtt gatatacaca cagctttcct 29401 ttgtaatgct ccaagttgac cttcccttta aataaatgta ttacctgaca tgcaaattgt 29461 ccctcttgaa catgtccttc aaatggacac taatgccagc caacaaccaa gaagctgtag 29521 tttccttcct gtgtgaatat taggtgtgag aagagttcac agctcacggc tttggcagag 29581 ccaatatttg acttgcagtg ttccccggca gaaaggcaga agaggcctta cctttgggaa 29641 tggtgatctg acagctcagg tcacagtcct gttgcaactg ataaaaagcc acttcctgca 29701 gccgggccct ctccagctct gagagactct ggatggggac tgacctcagc cgtacactgc 29761 ggcctgacat gctgttccag gtgaaatcac cctgtaggcc aaaaaaaaaa aaaaaaaaaa 29821 aatcaagagt tacagagttc acttacctct cacaaatctc atcctctctt agtaataaag 29881 tcaagcaaac cacctttgtg ttccaaaatt gggtctcaat tcccaaagtc aaatcctctt 29941 ttagttcact gtggattttc tcagttagtt gcaaatgtca aaatctcaag tatctgctgg 30001 gtcacatgat cttatctcat tatgcatata ggcctgagga aactatgtag aaagatactg 30061 tgagggtcct acttcctcct ttttatggga acacatggga acttctaact tttgtgaaac 30121 actgtgctgc tgctaattgc catcacacac atcagatata gggggatgag tgtgtatgga 30181 gagcctacca tgtgccagcg gctgcaccaa gggcattttt taaaattggc aaataaaaat 30241 cgtacatatt catggggcac atggtgatat ttctatacat atggttatta gatatatatt 30301 tctaataatt agtaactaat ctcaaacact tattatttct ttggttgcta acattcaata 30361 tcctccttct agctatttga gatcatatat tactgttaac catagtcaag ggctttttag 30421 ttaatccctc caacaacctt atgaagtggt taacattttt tacctatctc atgtaaaatt 30481 taataatata atctagtata atatatgaga atatattata atagtaaagt ttatacttat 30541 gacatattat atgaatagta tagtaacata taccaatata atatagatga tgaaactgag 30601 tcttgaggag gttaagtaat tctccccaaa tcataaaact tataagattt ggaataagga 30661 ctcaagctaa gaatgtaggg ctttaaactc tatgctaata ttgtaaccac tgaggttcta 30721 tttgtagcac tctcatgagt tattgtgaac acaataagct tagagttatg atcaacagga 30781 agatgttcag tatttataga gacatttgat gctaaaatac tccacatcaa agagtcaaag 30841 caaagatttc cacaatccta aaagtgtact catgtgtaca catgcacacc cttgcaggcc 30901 actgagggta aatggctcca ccctcaatgc agacagtaag aaagtgacgt ccaaagaaaa 30961 agtactttac aatcctctga aacccagtgt ctccttacta aagacttctt gtgaatctca 31021 tgtatcaggc atgctgcaac agatttgagg aggcagctac tgttgctcaa tgatcaagaa 31081 agaaaagttc tatcagggaa ggagaggtac tggcccacgc agggactcat tgcaataaat 31141 ggaacactta acagaatccc ataagacttc accatcttaa agaagtcact cgcctggacg 31201 tggtggctca cacctgtaat cccagcactt tgggaggctg aggtgggtgg atcacctgag 31261 gtcaggagtt caagaccagc ctggccaaca tggcaaaagc ccgtctctac taaaaataca 31321 aaattagcca ggcatggtgg cgcatgcctg tagtcccagc tacttgggag gccgaggcag 31381 gagaatcgct tgaacctggg agggggaggt tgcagtgagc caagatcaca ccactgcatt 31441 ccagcctggg caacaagagc aaaactctgt cttaaaaaca taaaaaagaa gtcactcacc 31501 ttaaagaagg gagacagaaa gactggtccc agcaacaaag tgccatagca ctttgtttca 31561 cactggtgac aaattcctgt ctagtgtatt cattccaaaa gtgaatggca tgatgttaat 31621 tctttgatat tttccctctt tatctgggat ttcattcact catttaatac atgaatatat 31681 tatactctgg taggggcaga aaatgaacgg ggtaagtaaa aggaatggtg tgtgaaataa 31741 tgataactgc taggaaggag aaaaaggcaa agaaggggaa gtagaagaaa ggaatttgca 31801 gttttagaga cagcagccag gaaagacctc actgagaagg caaaatttga gcaagtgtta 31861 aagaagttat ggcttttagt ttctttgcct tgacatgtat gaattatatc cagtggataa 31921 tgatgaagac tagcaggaac ccaggaggag ctcagggagg gtgatgcagg aggggatgtg 31981 gggagaggag tgagtgagca aatctagtca aagtgaggag gctgcagtca cattcagtga 32041 acagaagaaa attcttctgg ggccagtgta ttttgaattt tttctttctt ttatgttttt 32101 ggtataacca gactaagaaa aaataggaca tatagacatt atggttgggg actcaggcct 32161 tgggtgtacc cctcactcac accctctaag cttagagaac tacaaggaac actattgggg 32221 aatttttttt attattattt tctcccagat agaaagtccc cagatgagtt tataaatttt 32281 agatatcaag cttaggtacc tacgggagct gactgttatt tgaagtgtca aaacagaatt 32341 acatacctat tctttaaaag caatcaaaga tgtgacctcc aatgggctta attatctgta 32401 gtaggggttt gacaatcttt ttctgtaaag accacataga aaatatatca ggctttgtga 32461 gccatgtgat ctctgttaca actactcaac tctggcttgg tgggatgaaa gcagccacag 32521 ataatttata aatcagcgag tgtggccata tgccaataaa actttattta caaaaacagc 32581 aggctggttt tggctggcgg gctagagttt cccaacctgt gaactatagc ttatgactga 32641 aataaagtaa ttcctagaag ggacattttg ttcttattta ctaagtcctc ctggttaatt 32701 aattcaaaca atatttaata agcacagatg agatgctaag ctctgtggta ggcttgggac 32761 tattaagatg acaacacagt cctcgtcctc agagttaacc ttcaggtagg aagaacacaa 32821 acaatccata aaacacagag attgtggtac tagaagtaag tactaaatgc aatgaatgcc 32881 taggagacaa ggctttgtta tctatctgga ggatcaagga aagctttata gagaaggtag 32941 tgtttcaatg gaggcttgaa agttgagtaa gagggggcag ggttcgccca gtagcaatgc 33001 aactttagca gcaggtgcaa aggcaaatga gtttgagaaa acaggcttgt tcagggaatt 33061 acaaggctgg aaggaggaaa agggcaagag gtgaagccaa gcagaagtca gggccagatt 33121 ctgtagggct gtctctgttg gcaatgcaaa ccagattaga ttttcctcca tagacgatgg 33181 ctagtcattc aggaactttg agctagagag gaacatgagc agatttgaaa tatagggaag 33241 atgactggaa acaagtttcc taaggaatta aaggggagag caagtcaaag atgtggagat 33301 gagaaaagac tccgtgctaa gtgggatgct gattccaatt tgaacagaca cgggagaaac 33361 ttagaaccac atagatgcta gagaataaac tggatgcagt gaggggaaga gaaggggtaa 33421 tctaggatga ctcttgagtt tctagcttca gcaactgggc agacagtaac actaatcacc 33481 aagatgggga atggcagagg gaattatgtt tgcaggggga gaaggaagat cacccagttt 33541 tggacatggt gaaaatggta ttttcttttg aaaaaacaga attaaaacat tactaagtct 33601 ggcaaaacag aacctggcca atttatattt ttcttgagct ttgattttgt actctcaatt 33661 aaaacagttc caacacaatc aaataatata tgttttttca aactgtttac aaatgactac 33721 agttagcata ataaaaagta aagtgcgatg cttcaagttt actttagtcc attatatatt 33781 gcagcccacg gttcttaact tcttggattt ttaaacttct ttgttatatt tggaattgtt 33841 aaccaaaaga gagatctgag gttattttgt tatggcagac ccagtgacca agacaatatc 33901 tgtatttttt gtttccttat ttttactttt taaatttctg tgggttcata gtaggtttat 33961 atatttatgg ggtacatgag atgttttgat acaggcaggc aatgcataat aatcatcatg 34021 gagaatgggg tatctgtccc ctcaagcatt tatcctttga attacaaacc atccaattcc 34081 actctttaag ttattttaaa aatgtacagt taagttattg ttgactatag tcaccctgtt 34141 gtgctatcaa atggtaggtc ttattcatta attctaatta ttttttgtag ccattaacta 34201 tccccacctc ccctgatccc ccacactgcc cttccaagcc tgtggtaacc gtccttctac 34261 tctctatcca tgagttcaat tgctttgatt tttagaaccc acaaataggt aagaacatgc 34321 gatgtttgtc tttctgtgcc tggcttactt cacttagcat gacaatctcc agtttcatcc 34381 atgttgttgc aaatgacagg atctcattct tttttatggc tgaataatac tccattgtgt 34441 atatgtacca cattttcttt gtccattttt tttcctgaag ttccctaatg attctcccat 34501 gcagccaggg ttgagaacca ctgagctcaa ggcttgaata gattctagtt tgagtgtgca 34561 tccacattgt tcagttaggc tataataagt gctatgtagt aagtctagaa aatccaagaa 34621 gtcctccatt gtcaaagaaa gatacaaatc taccaaggga aagaaagcta tttctatgac 34681 agtcataata gtaaggacgt tactttctga aactgagagt aagcatgacc ctgctttctt 34741 ttattaaaca aatgaagtga tgacataatc cttatttttt cactcaacaa aaaaattaac 34801 ttgaacctga cactctcaga agccactctt agccactaca atattctggc tggttgttgc 34861 aagcttgtga gttgcccctc tgcccacatt ttgttctgtg gaggtagaga gaagtacagt 34921 aaggcgtaat caggtggaag gtgtggttgg tagaataatg tctcccctaa agatgtccat 34981 acccgaatcc caggaacctg tgagtatgtt attttacatg gtaaaaggga ctttgcagat 35041 gctattaatt tcagattatt gagatgggga gattattctg aattatccag gcgagtgcaa 35101 tgtcatcaca aggagagagg tagaggcaga agcgagacaa aaatgtgata atggaagcag 35161 aggtgggaga gagaaagatt tgaaagagct tggctgctag ctttacagat ggaaaaagga 35221 gccacaagac aagaaatgca cacagcttct agtgcaggaa aagccaagga ttctctcctg 35281 gagcctccag aaggaacatg gcctttgatt ttagcccagt gaggcccaca tgagacttat 35341 gtaagtttgt gttgtaagtt tgtgataatt tgttatggca gcaataggaa atgaattcaa 35401 gagtaagggg gagcattaga aaaggaagtt ggtggggtga agggcatgct cgcagacact 35461 taaaaaagta ggttttgacc ttccatcaga ttcttatctc cctgctacct gatgctaagg 35521 tgctgcttca tactacaggt accatcacca atgatttgtg ctggagaggt cccaagactg 35581 gaattctgaa atcacttttc ccataaaata gaattataga aaattacata tagtaaatac 35641 tttacgctct gtgggtctta tggtctctgc cacagttact caacttttgc cattgtaggg 35701 caaaaacagc catagaccct acataaacaa atattccaaa aattttatct atgacatgaa 35761 aagtcaaggc tcatataatt ttatgtgtac aaaatatcct tccttcaaat ttttttcaac 35821 tatttagaaa tgtaaacata attcttagct tgcaagtcat acaaaaacgg gcatcaggct 35881 gcttttggcc cactggctgt tgtttgccaa cccctgatat ggagattagg gtcatgtagt 35941 gccctacaca tatttgtgta gcatctacag ccatattcca gcaacccagt cctagggcct 36001 tgcattatca gccatataca acagctagat tcaggagggc aagacccttg tctggcttat 36061 tcattttgat gttctcggta tctaaggcag tacctggcac ttaacagcca ctgaataaat 36121 atttgtcaat ctagggacta atgaaccagc cccatcatga gttaaataaa ttggaaacct 36181 aaccaaccca tacaatgttg tatatatgtg ggaaactgta ttctggattc gaataaacca 36241 ttaagagaca gtcttttgaa atagcctgtt gataagccaa gaaaaaccca tgccagtaca 36301 tgaggggaga aaataaatgt gataataggt gacgcgggga agagttaatc cgtgatacct 36361 atacccaccc atcagacttc ataggagcag taagacatga acacaaaggt catagtctcc 36421 ctttgactct catggcatgg accataaagt gataaacaag cacccaggaa gggcaccaaa 36481 gcctaccagc acagtctaga atatgtttct ggtggtttct caaggctcac atgcatggtc 36541 cagaccagca tttttctttg gtgactgttt atcttataaa tgaatgactc atttgaaggc 36601 tttctgaatc tagtcttttg agtgatcatc atggaatttt tttaaggatt gaaaatggat 36661 gatttcaaat ttgtttagat tttataaatg tagatgaaat tgaaattcat ccaatgtctc 36721 tggatagact aaagaccttg gcagttgtaa aaacatcttt ctattctagg gtactatgtg 36781 acagcattaa taaagggcag caatatgata aaatgatata ggataatggt ccttggtggg 36841 gtaatgttgc ccccagtgag catttagcaa tgtctgaaga catttttggt tgttataact 36901 ggggcagcgt ttgatactgg cagctagaag atagaggcta gggatgctgc taagcacctt 36961 ataatgcaca ggactactgc ccaaaatcaa caactatctg gatcaacttg tctacagtgc 37021 agaagttgag aaaccttggt ttagcataaa taagtctgct gttaagccca gattacatat 37081 aatctcctta aaatgatggt gctggaaata ttaacaatta tttatatatt ttgtttatca 37141 taaaaatatg tactatatat tcgtttatat atcatgcatt actatgtgta ttttacatgt 37201 atttttctac gtattataca cataaacaga atcatagggc ttattttaag ctagtattaa 37261 taagtacaaa cactacaaac ctaattaaaa gaagcaaacc aaaaaaagtt tgaaaacaat 37321 agagtcttga ttaactggag aacaactcac caaaagtgtt aactaaagag atttccttcc 37381 caccagaagt atatgtctct atgtttagaa aaaaagaagc aaaagacaaa ctaaaaccca 37441 aacaatctaa tatctatgct ttattcagaa agtcattctt ttgagctaag ttgcaccttt 37501 acagacttga ttttcccatg aaataaaatt gattgaaata attttggttt tgttctacac 37561 cacattgttt tattacatcc ttggggctag tctctaactg cagcatctct ttaatctcat 37621 ccaactgtaa actccccagg gccaaacaag gggtgaggca gggctgttca aaatgtggtt 37681 tgtggaccca aggcatcgga atcacgtggg cagaatgatc attgttaaac acgtaggttc 37741 ctttggctct aaggtagatt tagagaatta gagtcacagg gaataggtcc caagcacctc 37801 cattataaag cagcatcccc agatgagtct tactttttgc ataataaaag ttgagtctca 37861 ctggcatggt ggtgaagact tggattcaga tagatggcag ttctttggta agattcagaa 37921 aatctctgat catcggtttt ctcattatta agatggagaa aagaacaaga cttaactctc 37981 agtttgtagg gataatatca gacaatacaa atgaatttct tgtggaaggc ttataagctc 38041 ccacaagacc tgaataaatg gaaatttaag ttataattaa tattaatcat caccactcta 38101 aatttgcatc accatctgat gtggtttgga tgtttgtccc ctccaaatct catgttgaaa 38161 tacaatcact aatgttggag gtggggcctg gtgggaggtg tttgtgtcat gagggtggat 38221 ccctcatgaa tatcttggtg ctgtccttgc aatagtgaat tcttgctctg agttcatgtg 38281 agatatggtt ttttaaaagg gcatggcact tcttccctct ctctcttgct tctgctctta 38341 ccatgtgaca ggctggttcc ctgttgcctt ccatcctgag tacaagttac ccaagggcat 38401 tactagaagc tgagcagatg acaccaccat gcttcctgta tagcctgcag aactgtgaac 38461 caattaaacc tcttttcctt tctttataaa ttacgcagtc tcaggtattt ttatggtgat 38521 acaagaagga cctaacacag catcttaact tggaagtatt agagacacca aaaactctta 38581 agaaatgtgt gctgaattac acagcaagtt ttacaaggtg ataagagata gagaaggagg 38641 tagcacaggg tgaggggaat aggagtgctg catgagctac caggacagcc ttacagagaa 38701 catgattttt gcaccaaccc ttaaggaggt gagggaatga gccaggcagg ttttgggagc 38761 ccagcagccc aggcagagac tggttgacat aggtatagca tagcctggtg gctaaaagca 38821 aggggtggca agctgggctg cctgagtttg agtcccagtt ttgccactta ctagccagcc 38881 atgtggcttt ttagcaagct acttacattc cccatgcccc attttcctca tctatgaaat 38941 ggggatgatg acagggattt tgctgataca agctgactag ccggggagcc aagtactgcc 39001 tactgattgt tttcataaat caagttttat tgatttatga aaaccacagc cacactcatt 39061 cattgttatt ttctgtggca gcctacaggc cacaattgca gagttgagtt gttgcaacag 39121 agactgcaca gtccatgaag ctgcccttcc gtaaaggtaa ccatctggcc ctttatggaa 39181 aaagtttgct gacccctgga ctggactaaa agtaaagaca catctatgat taggatctgc 39241 cttttgccat caccattttc ctcctctcct aagattttgt ctctgagaaa catcccagag 39301 agcctgtaaa aaaaaaaatc aaacaacata atttagcata ttagaaaata tgcaaatgct 39361 agcatgaaat agaagtggtt atctgcaaat ggcctgttca cctgggttac cgataaacta 39421 aagaagtaga aatgaacatt gtcaaaccca gggctgggct aacaccatct aaaaaatcaa 39481 atgtcactta catcttgttt ccagtggaaa gaacctttcc agtggttagc atagagggga 39541 catactggaa gtcctcaagg gcaaaaacgt cagagcaaaa ggtcaatttc atttctgtgt 39601 agtcccagtt gaattgtgga cttttctaat ggccatcaat ttaattctca gttctaagct 39661 tctctcgagt cttggatggg aggaagaagg taagatggcc tgcacaaatc tcagggttgc 39721 aaaaactgga aacactggca tgaattagaa atttcttaat gacagggcca gcagctgatg 39781 gttctacttg tctttctcta attaaattta gttcactgaa attagaacga cctccggaga 39841 tagattagac cttgaagata ctgctcacat ttatagatgg tccgctctct gctagttact 39901 gttctaagcc ttttatttat attatctcat ttaattctta caacaaccta aggactatgt 39961 taccatcatt attcccatat ttcagatgaa gaaactgagg cacagaactt aagtaacttg 40021 ccaagagcac acaccaatta agtggcagag ttaggatttg aagccaggct ttaggttcta 40081 gaatttagac tcataaccac tatatgtatt tcataacatt tcatgataaa ccaatccaaa 40141 cctctaactt tacaaatgag caaagcagct gcagtggcat aatgggatct gggacatcag 40201 ggtcatgttc ccctctttcc accaccaagc cctttcatta tgagacatac ccttataata 40261 atgaaagcca gacagtgttt aaaaattcag aggcttaaag tttcctcttt tgtaaaagac 40321 agcacaagca catccgatgt attattagcc ttatttaata tccaaactgt tcccctaaac 40381 ccatatttat cacagccaca gcaactaaaa tgttgcaccc atattcaaga agttaagagg 40441 gaaagataaa ccactttacc ctttcagact tcaaaacagc ctgtgcccga atcttttaca 40501 gaaattagct caattatgct gttctgttaa tcatgacaca atcctttcct gcttgagttt 40561 gctcatttca agctgccaca tcattaatct aaaaataagg ttctcataaa tccatattga 40621 cacagacagt ttttagctgt cacacatctc agacactgat cattccagtt ttcttcctaa 40681 atcacaagat ggatgtgaaa gaatcacaaa actttatcac aattattttc tgaaaaagta 40741 gtgtaagtat agatatttat attgcttacc cataaattat aatcagccgc gtctttgctg 40801 tcctctctaa tctaactgaa ctaagcttgc agtaggaagt tgagtctgtt ttaaaaaaat 40861 tatttttttg gtaaaaaatg catcctgaaa tggcagatca tattgaatgt ctgagaaatg 40921 gcccagctgg catagctgtg actcccacga taccttttgt tgccagagac ctccgctact 40981 gtgaatcttg ggaccaaaaa tatgcaatct tttcatcgga tcattgagaa gttcctgtcc 41041 tttcagaaca gaggcaggct gtgtccctga gtaagtgcac aaatgagtga gtgcgtgtgt 41101 ttgtatgcgt gtgtgcctgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgaga 41161 gagagagaga gagagagagg gtgagagaga gagagagttg gggaggagga aagaggggag 41221 ggaagggaca aaaagaatga gagtattttg aagaaatata ctcgagttag aaaaggattc 41281 ctctaaaaaa aatcctcttt tccacaggga tctacaaacg gtaattggat tgtaactgtg 41341 caaactgtcc ttattaggaa tcaaccaaag aatgttggaa gccactggtg aaacccctga 41401 gagcaagata aagcacatag gatgccttct ttgtttagga caaattctca gttcttaaat 41461 tatattttaa aactactgac tccatttttc taacactagc cacacacact aaaaagcaaa 41521 agttttcatt ctctttcaca ctgaagcata acacatgctc tgtattttat cctctttgtg 41581 ggacagcagt ccttgagaag ttgttcattt cctgaacaga gggtgtggag gtcagagtga 41641 atatacccac atggccatat ctaagagggt tcaggttggg gtgggggttg ggggtggcat 41701 ccatgaagag cagagagaaa agggaagatt aattccaatc gcgaattacc ttgatctcca 41761 tgcattccac atggcactga accaaataca actcgaaaag acatctttgt ttcccagtca 41821 ataggaagtt atagttctct catgggtctc aagtgaattc caacaaaaga atcactgtga 41881 ttataattta caaaccggga aatgatttca tggaagatct tactgattca aaattaagga 41941 ctcggtatgc ttttactagg attctgtgaa actaatcatg taagccacac atagcctctg 42001 ctgctgctgc tgctaaggaa ggaattttaa agcagtaaaa tattccaggt ggttttcttt 42061 actagaatga ttaaacttaa acagagattc agagtaaaat ctacctgtta ctgcacatac 42121 ctccaccacc actatcctaa aaactgatgt tgaaaatacg cattattcca gtagacagtc 42181 aacatggctc atactggata ctcatcaatg gaagttacat cctcatttaa agatatctaa 42241 tatcttaata tcaatatatc taatatctta atatattatt tcacatgaaa ttgataaata 42301 tgtaattcat tttttattat taggcaaatg atcctttagt ttataaatag atgacagcaa 42361 catgtagttc tgactgagca cttactaagt gtcaggcatt gttttaggca ttttatgggc 42421 actgacttat ctaaccatca aaataattct tagcagtagg ttacattatt atcctcatat 42481 tacagatgag gaaacagaag cgcagaggag ttaagtgact tgctcaaaga ttcacagctc 42541 ataagtaatg cggctgtcat cccaaccact atgagcttac aaattaactg gaagacaaaa 42601 ccacagttca gagtcagtac tagtaggagc catgttataa gtatctaata aggaagagac 42661 agtgatgaaa taagaatgtt tacctaacac tagagtggtc tgttaaatca attatttgat 42721 ggagacatac actgtcaggt caataaatgt ttcgtaagaa ggtgactaag tgaataatct 42781 caaataatcc cgagagacaa catcttgtga aattcctagt ctgttttaag agtggaagca 42841 aggcaggcct taagaggaag tgcagaaata atggagaagg tgtgtatatt tggtattctt 42901 tggaaaatgc attcaaacca gaaaaacaaa acataaagat aaggaatcct ttttaagatt 42961 taaataacaa ctctttaagt aaaaccaata agagaccaaa gagtcactga aagagccttg 43021 gaaataaata tcaggaagca actaaaaggt tccatctgtg gaagagtttc gctaatcaca 43081 aaaagtgctg agagcacaat ggtcattttt ttcccattta accctccata tctaagcata 43141 ccaagaaaca atttatagcc taagaatcta tgccattaag cattcattcc accttcttct 43201 tggctcttat tctggattat ttctgtttat atcactggtg atattttgca catttaaggc 43261 aaaaatatca tagaatcact gtcaaacttt ttgagtgaat acatttctga ttacagtact 43321 cttttgccta aaaccctttg atgtctcctc cttagctcca ggacaaagtc caaattcctt 43381 agcctggtat ccagtgcggt gtgcttttac ttagatggaa taaaatcaaa tagccactac 43441 ataatagtga atgataacag tagaagttta ctgagcatgt attaggtgct aagcacttca 43501 tgtgtgccat gtattcactg cctccaacaa tttcatttta tagaggaagt gatggaggca 43561 cagaaatgga tggcgcacag tgaggaaaac actgactgct catgaagccc gtagcccctt 43621 tttcgtgggc attcagctag tctacatttc ccagcctctc ttgcagttaa tagtggtcat 43681 atgactggag tctagctaat ggaatgtcag aggaagtgac aagcttcatt tctagcccca 43741 gtccataaaa acctggagac agcatccagg caacctgcat agccatgtgt ggtagtccca 43801 ggagtggacc acatttttca gtgttctttg cctagtgtag tcccctgccc ttcaagcccg 43861 cctggctcta tgagttactt taaccaactg aatgtggcag aaggggatgc caagcccatt 43921 ctggacctga gacataagaa ggcatggaaa cttctgcttt tgcattcata ggacccttga 43981 gcccccatga cataggtgta gccactatgt tgaagagacc acgtggagag gcaatatagc 44041 cagcgtgaaa ggtttagttg ccccaatgtc tcaacctaaa tccagcccac cctacctcca 44101 gctgaatgaa aggaagaagc agcccaggtt cagaatcatt gcttggagca cagctgtccc 44161 accaattgtg gccaatctaa cagtatgaat gacaaataga cttatgttgt gaaatttggg 44221 gatgtatctg atatagcagc tgacctcact ttcctgaact tcccatatga acacagataa 44281 taaatctaag cagcctgatc tggagccctt gcgcttgaaa ctcctcctat acttctcccc 44341 actgaacgac atcatgactc atttcgtccc ttcctacatt ctcacttgga aacagagtag 44401 ttgatcaact ggatggtaaa gtaaaaattg ctctttatat attgttttgc tgagcatgta 44461 gaaagtcttt aatataaaag aagacaaata tatttctaat gacatccttc agattttgaa 44521 taatatatag ccaatggtta gaaataggta atgcttttat ctttcaaaat catgactgac 44581 tcataaaatg gcacagtgaa atttcattta gattatattc attactgata aaccttccat 44641 tggtcctggt aacctttttt gagtgtagca tagagttcgc atgtaaggtc ttaatatttt 44701 ttaagactat atataaaaat gcaaattgtg agagacattt tagttcatct gctgccttac 44761 agtgctcagt gaaaatgtgg taccatcagt ttaaccatca gactctatct ttgtattgct 44821 cagaacttcc aggggctagg tgataagcca atgtatattc tctcattgct gccatttgct 44881 tccttaactg tagctgtccg tcctttcccc attcctagca ccctgacctt ggtctaattg 44941 ctagcgtcct tctgatgcta ctgtgaccta tttgcacgga gagaatgttt tggatccttc 45001 agatctaaac tctaccatca aagcaaatct agtaactgga aagctattta agaccatgaa 45061 tctgaaacag tcaaaaccac aataaagaac aactaaaatt tactgagtac ttattgtctc 45121 catgaatacg ctatcctcta ggaggcaggc actagaatta tctctatttt gcagataaga 45181 aagttgaaac aaagagagcg taaagcgtaa agagcttgcc agaagtcaca caactggcaa 45241 gtaaagagtt ggcactcaaa cccaggcagt ctgttttcaa atgcaataat tttaacccct 45301 aggcaatatc acagcatgcc agcaaccttt tctttattca cttccttgga agacctataa 45361 acaaagtaag tatgacacag agcatccaac agagactgga aggcctctct gcaactggga 45421 gaattttggt aaatacatag atgacagctc agggctcaaa gccctgtgga attgagacac 45481 agaaatatgc agaaacagag gcagggcacc aggatgaaac agagaaaaga aagccaatag 45541 atggggcaat gacctccagg aataaggaga gggacaggta tttcatttaa agaaaaaagg 45601 aacagccatg ttagaaaacc tcctggatgc cagatccttt acaagtagaa ctagccactc 45661 tcagttttag gtaggggtgt aacaccatga aagagatgtt ttaggaaaga tgaatgaggc 45721 cgagtggtgc agaactgata ggaaggggat aaagagagga agcaaactga gtctacaatg 45781 taaccggtaa acggaggctc tacctgtgga atgaactgat aagaagcgag tctaggcagc 45841 aataaaacaa gctgactcta ggaattcttt ctttatgaga gtccctctct ttttccaata 45901 aatatggatc cccagaaaat taagcccaag attatagctg gagaaatacc tttgagtaca 45961 gagtgtgaaa tttgcatctc attttaggtg cccaggaggg ttgacctaga gaaaagcatt 46021 ttatcttgat tcttatggag aaatctgtat atcgctggtt ctaccacact gcagagaatg 46081 gtacttattt tgtagctgct ccttctactc atgtagtgag gaacgctgag ttataaggtt 46141 ggatgcttca ggtttcttga aataagaagg aaactcagat aagagggcac tcattctagg 46201 taagatgtca tgtgaactct ttgtagggca ggtcatttat tctgttatgg gtatattcac 46261 tgctcattaa acatggaatg gatttcagag caagattacc taaaagatac agggatgctg 46321 aagtactgtc acctcaaact gaagaaccca atcccaagtc agatatttac aaatcagcag 46381 gatataaaac aagccttaag gatagaagaa tatcgtattt aggagcttga aaactcttag 46441 gccaaacaaa ctaggcaaaa taaaaaatgc caaagtgaat tcctttccac tttctcattt 46501 caatgctata tttggtgaca ttgtgtttca tcaattgcac tttggcactt tgcaaaagtc 46561 tcttctcagt ggggttcttc tctaatgaga aaggaaaaca atttcctcaa aattacatgc 46621 gacttcataa acatattaag ttaataaatt aattttggat atcatcaaat atatagacgt 46681 tgcttctaat ctgttgccta aaatgtgagc tgtggacaat gagcctctcg gttttactgt 46741 aagataattt atcctattaa actcttttaa taagtaattc aaaatctaca tgaattgttg 46801 taccacttca ttctaagtgg ggaaaagcaa gttattcatt tttagttgca gtctggtagt 46861 taactcatct aaatcttgta gagtaaatga gcaagcaagc tgctttaaat aaaaatatat 46921 gcggctacat ttctaggctg cagaaaaagt ctagcattaa tgatattttc agaatcatct 46981 tccaaagcaa aatgacctca gatacaagtg gtgatacatt ttagaacaaa tgtattctcc 47041 aaggaccaac agttctctaa gttcattgca cattataatt acttgctgta gttttcaaaa 47101 atttttatga ccatgtccct ctccccaaga acagcatggt ccaatagact tttctataac 47161 aatggacatt tttgttaaca gtactaattg ctaaggtggc tactagccac ccgtggctaa 47221 tgagcacttg agatatggcc agagaaactg aatattgaat tggtaattgt atttaatgtt 47281 aattgattca aatttaaatt accacaggtg gatagtggct actgtattgg acagcacagg 47341 tatagaccaa gtcaattaga ctgtctgggg ttggggctca gacaatgata tatttataaa 47401 gctttcccca aaattctaag gtgcagtcca ggttgagaac cattgtaatg gccagagctt 47461 aatgaaacta tcagggaaga caaattcctt gaactttttc caggatccat aacttttctc 47521 caaattccac taagaatttg tcatgtacct ttgcttccct ccctcctgcc tccactcctc 47581 atttcttgca cccttccaca tactcatgtg atgactgcac tctccatata ttccttcatt 47641 gactgattct gccagatttc tcccacaccg agagcataac gtgccctcat ctcaaagctc 47701 agattctttt tttcctttct cttccattga cagcagaaag aaaacttaac aatgctacca 47761 attaagagat taagtctcca agcacataca tccataattt aaaattgttt ttcacgtgca 47821 caggcttgat ccagatgtca ctttccatga gctctgccat agatgtgatg gggaatattt 47881 tgaacagtag cacagttaag cagctttcat cttatctgtt cattttgcca ctgtttcatt 47941 caaatgggat gcagtgttat agcacagtgg ttacagggag gactctcctt ctggctgcct 48001 gtgtttgaat tcaagttcag catataacta gccatgtgag tatccaacgc ttttaagcct 48061 caattacctg ctctgtaaag tggagataaa atagtcctgt agggaggaga gagctatgca 48121 tgtaacactt atgataaatg aacgtgtcac atgcgctgat agacccaggt tttgattgct 48181 ttttccctac acccctcccc aaacctcacc cttttttttt ggtaaataaa gcaaatagcg 48241 gggccaactt ttgaaggaga aagtcatgaa atccctaaaa gcatgttcca agtatttgat 48301 aaattaacgt ggcaatatga agaccatttt gaaaacagtt acaaaaaaat caattatatg 48361 gtaatctgat ttttaaaaat tattaaaaat ttttatttta atgagtcatt tttcttgact 48421 tccatgccac cagttaatat taataggatg gacaaggaag ggtatgctgt aaacatagct 48481 acattttatt tcatttgtaa aaaaaaaaaa ccagtagact cagaattcca gaattcagtg 48541 atatgaacat aaagaacatg atgtccaatg ttcaagtaca taatcgcttt gttgaccttt 48601 tggtaatttt aactgttgta ccctaacaac atgttcccat gtcattttca ctcatgggca 48661 ttactgattt gtgcacgttc acattttcta cacagagttt tgctgcttct ttcttgttaa 48721 gtatccatct atgggtagat ttttttttta acctcagaaa ctacaaaatt acaacccatt 48781 agcaacataa aaatttgtag tcaccacaac tactaattag tggtgcacag taaggattat 48841 tcatcagtga ctcacagcac agataaaata agctgagagc tatgtcctgg atctaaatcc 48901 aaaagattta gatatgtggg gaatgactat catctcttgt acatatagtg tgtacccatt 48961 attattccag tcccttatgg tgtactcatg gttttaatca ttaaacaatc ttaaaatgtg 49021 aggtttatta atatgcccat tttatagatg aagaaatcaa agcacagggc agtcaagtaa 49081 cttgtcccag gacacatatt gcaagtggca tggctgaaat tcaaactcag ggaccttagt 49141 cctgagtttg tggtcttaac cactatccta tgctactttt ctgcaccttg ctctcctact 49201 caagaaaatt catcaaaata actgtatgag gtggttctag gctcatctct aattctctct 49261 catttagttt tcttacaaat attctaaagt ttgtaactta attattatag gtaataaatt 49321 acttctaaca cattgactga ggcatgcctt gaccttgagg tggcaactgt tgacttaaag 49381 cagggtcagc aaaggacagc ctaaggccta tttttgtaat caaattttat tggaaaacaa 49441 ccacacccac tcatttacct attgtctgtg gctgcttttc cgatacaaca gcaaagttga 49501 gtagttgcaa cacagacagt atgatccgca acacctaaaa tgttttctat ctgattcttt 49561 acagaaaaag tttgccgacc ccagatctgg acccactgaa aagaaatcac cagatatatg 49621 acatagggga tgacaatgtt cacatttcat ggtggagcat ttcactattt caaacagcaa 49681 aggaaccaca ttttttttct gtttcttttt ttaattaatt ttaagttcca gggtacatgt 49741 gcaggatgtg caggtttgtt acataggtaa acatgtgaca tggtggtttg ctgtacctat 49801 caacccatca cctaggtatt aagcccagca tgtattagct atttatcctg atgctctccc 49861 tcccccaacc ccaccaccgg acaggaccca gtgtgtgttg ctccccttcc tgtgtccagg 49921 aaccacatgt ttttaaaact atgtttgtga catctctgta tgtatgagca gatgggtcca 49981 cagagatcac tcaatgtcta attttctttt acgcacataa tgagaaaaag ctcagggaga 50041 aaaagtgaaa tcagtagcat ggactttagt atttggtatg agctacttat taggaatcag 50101 catttcttcc aagtattctc gaaagaaaag acattcatac tagagtgatg cctatcaaat 50161 ccactatggc cggcctgaga aacttgtaat taaaaactac aaacattata gatttgtctc 50221 catattaaat agatttaact atctctacat ctatgtgtat ctatatgtat ctcttgacta 50281 ttatatattt attatatgat gaattatata tgtgacatat ttctatctta tatagacaaa 50341 tatttataat aggtatgtga taatacacag tgggtaaagg gtacgtatgc aaatgcatct 50401 tgctttaagt aaaccaatta tttgaaaata cacacaataa aattaggggc aatgattctg 50461 tggttcaaat acatcagtat gggttccctg cagtgaccca ggaaaactga agcaaaatca 50521 tttttaacat tgcttcctat gtctctccta ctcacactca gaaagtcacc agtttctgtg 50581 gatcccacct cattaaaaac tgtcctttcc tttaacactt cctctctggg aatgtccacc 50641 agcacttcca cactcttgta ggctgtctct ccctacaaag caaaactcat cttgtccctt 50701 cccttttgaa agccatcatt gaggctccta gctttccaga tgaagttctc aactctgagg 50761 tgaaaacaca agatcgctca agatctggcc actgcttagc tcttgggctt caccaagcct 50821 cacttacttc acatccctta atttcagcca catggaagag cccagaggac ttcatggtta 50881 gggtgagggg tgctctcagc ccaaatggct tccccatgcc ttcctactca ccctgtaatg 50941 cttactgagc atcacctcct ctgagaagcc ctgtcgtccc cctacaggga cattacaaat 51001 gaatcatcac attaacaaga aaatcatacc atctaaaaca caaacaagtt tactatggac 51061 ctataaggaa caccaaaaca tgaccaaagg aaattcacct tatgtttgtt aaagaaggcg 51121 aatctgcttc tttcaaggtt ccaaatgacc tagacacttg agatgtctag acaatcactt 51181 gtgatgtctc attgagggag gaaagaggag gggtgggaat gtttaaatgt cagctatttt 51241 tccagaaaac tctaacgtca ccgatacgtc cctcctatgt aaccatcaca ctcatctccc 51301 cccgaccctg ccccatctca gctattttct tgactcttat ttaaacattg gaaagctatt 51361 tgattatcta tatttgtgcc ttttctccct ggctagactg taggttctgg aagaatagaa 51421 acagcactaa cattttgtgc ttttgtatat ttgctggttg aaagaaaatt tgaaggagag 51481 tagcaattct gtctatacag aaaatcattt ctgtctatta gcatggaaac aagaccagca 51541 gggaatattt atatttagga catttatttt aaagtagcta aaatcacttc cagtaaaaat 51601 gcaaaaaaaa gaaaatcttg cttcctgact tttagtatta agagacaaac atgtaaaagg 51661 gaaaattgaa aagggagaaa ccaacatgaa gtatgacacc aaatgcaggt atgacgtgaa 51721 aacaagccat ctgcacttac tggttcacat ctgattttct agctgccact tttcagattt 51781 atttttcaaa aatgttttct cctcaatata cgacttttaa aatgataaat tacacagtat 51841 ttcacagacc tgtgtaaata cttttgctct gatgtataga aaatcatgct ttctctgagc 51901 aatgaagcat attgaaattt acattttcaa aaataaatta tagtaaaata atagaagagc 51961 ttggacataa aaatcatatg ccataaaagt taccacttta aagtgtatag ttctgtgttt 52021 ctttttagta tactcataga tagtgtacaa ccatcatcac tgattccaaa acattttcat 52081 cacaccaaaa ggaaagccca tacccagtca ctccccattc ccctctcacc ccagcccctg 52141 ggaaccacta atctactttc tgtctctatg gttttgcatg tttttgacat ttaatataaa 52201 agtagaatca tataatatgt gggctttttg tgtctggttt ctttcactta gcatgttttt 52261 ggagttcatc cacgttgtag catgtgtcat tacttcattc attcttatgg caggataata 52321 ttcactgtat agatatacta aattttgctt gtccattctt cagttgatgg gcatttgggt 52381 tctttccaca ttttgactgg tatgaataaa gctgctgtga atattcatac acactttgtg 52441 tgaacgtagg ttttcaatga ttttgggtat ataactagaa gtggaattgc tggatcatat 52501 ggtaactatg tagccaaaaa caaaaaaaaa aaacgtagac aacatgactc caaatagcaa 52561 gattcaaaca cttattttag tgcaaaactt gttttccgcc tcattccatt cacatgagat 52621 atcaacaatt acaagaaaac tcatgcaatg tttccacttc atatagatca gccatcagag 52681 ctttttccaa gctgtgataa ggaacacaat ccttcagttt cttttgcttt ttacagtctg 52741 atgaatatga gtgtcttgaa aaatttagag gtttctagta ttcttttttt ttttaaaaaa 52801 aaattagccc tgtaatggct ataccacaaa gagtagatta actaatttaa tcatttatgt 52861 ttatatttta actctaaaga ctaattgttg atatccagca tggcaatttt acttgttaaa 52921 gtacagttca gagtttagtt cctttcaggg agatattctt gcctatatca gaagagaagc 52981 taaaaagaat gctacacaaa atagatcaat aggcttgctt taattactct tgtcctcttt 53041 gattttattg cttactgggt gaggaaaagc cagaagaatt taatacccct tacaagatag 53101 tcattgcaca tttcatgagt tgctagggtc aaaagcagat gaacaaatgc tgaacttctt 53161 cctggctcta cagaacacga ccccacaccc tccagggttg agaaatggaa gaagccacta 53221 ggataggtag tgcagaataa tatctctcac ccctaaggtt tccacaccct gatttctgga 53281 acctgggaat gtgttacctt acatggcaaa agggactttg cccatatgat taagttaata 53341 accttgagag ggggacagaa gtctagatta tccaggtggg cccaacgtaa ccacacaggc 53401 ccttaaaata ggacaacccc tttcagcaga attcagagtt tgaggaagat gtgactatag 53461 aataatgatc taagagatgt aatattgctg gccttgaagc tggaggaagg gggccatgag 53521 ctaaggagtg tgggtggttt tagagcctcc agaagaaatg cagctctgtc aacaccttga 53581 tttttgtcca gtgcaacctg tgtcagattt ctaacctaca gcactttaag ataataaatg 53641 tgtattgttt catggcacta aatttatggt aatttgtcac agccatgatg acagattaat 53701 tcaaaagaaa tcagaatatt tttatgcctg caagggacac atacattatt taatgcagga 53761 ttttccaagc tgttccataa tcaaatatat ttgaaaaagt tgtgcatggt agcttcaact 53821 taagtagtca caataaacaa taagaataat agagaagtcc tgcagagggg aaatcataac 53881 tcggagtacc tcctaaatac acttgctata atctcacctc cttctatttt ttattttttt 53941 ttaattttca gaccatctct ccatatcact cgaattcccc agaatatagt tgggacatgc 54001 taaactaatt ccatctgctc agaacctgag gcacaatggg aagaagtgac aagccaaggt 54061 gcagcttggc aacacaaaga cataaccctg gactcatact ctccatctgg agtcccgtcc 54121 aggatgccac tctgtcttgc tgtatctgca tcacagataa tggcaataac aaagggatgc 54181 agccacagtg tgtttcttaa ggttcagacc tagcttaaca tgatgacttg tttttctatt 54241 gccagaattt atgaaagtta gatctttggt gggagaagag ggtatcaggt gaaaatcaac 54301 ccaacaaagc tcctaacaga taaatttttc tacttaataa tatcaaaaac aatacagctg 54361 ctttagcact gaatatattt tattatttaa gataaaagca ttcaacacac tattgaaagt 54421 ttaataactt catattaaca agagtaatac aataaaatat ttccccacag tgatttatcc 54481 tgttcctaaa aatgcctaat tctgctgcca aagtctattt tccacaaatt ctagtttcca 54541 aatgttaact tgagaaggaa cagagagaga caaccaaatg ccattgtctt ccatcatagc 54601 acttaatgtt ttctccacat ttccatttcc cattgcctgt gttccaagag ttatgagccc 54661 cttgtccttt gatgtttttt cagatatgca ttatcctcat cctttttcca ggatacacaa 54721 tgggacccct tcttccaccc ctccattttt ttttaataat aataaagatt atgactaata 54781 aacctaacaa ctttcgacac ttattttttt gccaaagact atgccaagtg catcacatta 54841 ataatctcat ttaatctcaa caacaaccaa ttgaggtaaa tattactatt aatatctata 54901 tttaacagat gagccaaatg aggcagagaa cttaggtact ttgtcctagg tcacataaaa 54961 agtggctgag ccagaatctg aacataatct gatttagagc tggggtcttc tataagcact 55021 agcttagtag tcctctactg gggtgattct gcccctcaag ggacatttgc cattgcctgg 55081 agacattttt ggttgtcaca actgaggagg aaagtgccag ttgtgatggg tggagtccag 55141 ggatgcttct caacatccta cagtgcacag aacggccccc acagcaaaga atgatcccac 55201 cctaaatgtc aacagttcca aggttgggaa acactgctct aacctaatgt ctcccaatgt 55261 cagctcattc tgaatgggac attttttggt aagacatgca aatggctctc acctccaggt 55321 tagaatcttc tttctctact tgggaaagga ggttatctca atctggagtg gatggtactt 55381 agagccacag agtccatttg gtgtatatta tcatagcatc aatgtactca aaagtttaag 55441 ctggcatcct gggacactag gtcattttat ctttttgtcc ctcagtggat acatgaaaga 55501 ggcactagct cactagagcg tgactgaaaa gagccagaag attctggttt ttgttagtct 55561 tgccttgtag tcaggaagac acaatagcag cgtctgtgtc acagataatg gtaatagcaa 55621 aggggtgtat ccacaatagg ttacttgagg tccagaccta gcttaaaagt caaaatattt 55681 aggccaggca tggtggctca catctggaat cccagcactt tgggaggctg agatgggagg 55741 actgcttgag tccaggattt cgaggctgca gtcaagacat gattgtacca ctgcactcca 55801 gcctgggcga cagagcaaga ccttgtctca aaaaaaaaaa aaaaaaaaat caaaattttt 55861 ttcggagcag ttctgtgaat accatttgac ccagcaattc taatatctac tttatagcaa 55921 tacttgcaaa atgtgcaaga tacatttgca aagacattca cttcagtaca atatctgata 55981 gcaaaaaaac ccagtcacca ataaaaacaa cggaaatgtg agcatatgtt caccaaaata 56041 tgtgtgctag aatgttcata gcagcattat tcaaaacagc cccgaactgt aaattatcca 56101 aatgttcgtg aaagtagact gaataaattg tggtatatca cataatggaa aactatgcag 56161 taatgggaat gaaaaaacaa actccaacca cctacaacaa tacagatgaa tcccagagac 56221 atcacgttga gcgaaagaag tctgacataa aaaataacat gctgtaggat accagtaatg 56281 taatctataa aaccagataa aaccaatata tgctacaaaa atacagaata ggggttacct 56341 ttggggagaa gtgactggaa gggagtccaa gggatccctg tgaggtgctg gtcagggatg 56401 gttctccatc tggcacaggt gtgttcactt tgcagaaatt cattaagttg taaacttatg 56461 tgcacttctc tacatgtgta ttataatcta ccaaaatata tatattaata cttatatata 56521 attaatttta ttatatatta gttatatgca tatatgttta aagaaacata aaaaacctaa 56581 gtgtccatca gtgagcaact ggctaactaa actacggggt atctattcag tggaattctg 56641 ggcagtcatg aaaaaaagtg gagtagttct ctctgaataa atttggaaag atatccacta 56701 tatgtaaagt taaacaaaga agttgcagaa cacaagttat ggacataaat gtttattata 56761 atttaacagg gagatctggg gttatatatt ctcaaatggt tattagtggc catcactgga 56821 ggatgggatt tgtgggactt tgacactgcc taatgaattt ttatagtgtt ggaatttatt 56881 tacatgtatt atgtttgtac tgaggggaaa aaaggacatt tagaaaatta agatgaacaa 56941 aatatatctt aagtttgaga ttgttattat ttcatcatcc atgaagatta gggtttctga 57001 tagatgaagg aaacgataat ggaataaaca gagtgtcttg ccttcagggt tgctaactga 57061 acccattgat ggaggtcact ttggtttgtt gctggcaatg ttaacaattg cttcaaatta 57121 gcacttcttt gctgggttgc tctacattca aatccagttg tggtgtggcc accacaaaca 57181 agattcttgg ccacacaaag aatgtgtcta caaaatggta aaacaactga atttgtttaa 57241 attaacaaaa ccatatactc agaggaagga aaaatcaaaa cagctacacc caaagtaatt 57301 cactcggatg ggtggttcat gaaatgcacc cattcattaa atcaacatgt tagcagcctc 57361 aagtggactc tgtgagagac tcaactgcaa tgcaatattc ctgaataaat caagagacca 57421 taaaagacac ttttttttgg cccatcatta gcaaacgttt tctgagtgct cactgtggtt 57481 ctgccatcct gctaaaactc tggggatgtg ataccacctt tataattgca aatgtcatta 57541 tgttgagtgt tcttaattcc accaatgaca ttgaatggaa attagaaaaa catccactga 57601 ttcaggcaaa gttgataaaa ctacaacttc tcatatacag taaaggtctg gcagatgtaa 57661 caatttataa ggtagaataa gttgtgttca atggatctca accatggcta cacattatat 57721 ttgatcgaaa cagaatttct ggaggtggca cctggccatc gatatttata aaagcttccc 57781 caggggattc aaaaacatag caagagttgg agccaatgct tagaccaagt aaatcagaaa 57841 ctaggtagaa tggaacccag gcaaaaatat tatttaaaag ctacacgaat ggttcaaaaa 57901 tatagtctat gttgagaaac actgcttcta aattgatgtt ataaatgaag ccagaggcga 57961 cacttcagta agcattatgt aaagagactc aataccacct gtccttatct tgtgcagttt 58021 tgttagcaaa ttatcacaag gtagtattct accatgttat ggtttcagac gtttcatgta 58081 aattatatct atatctccat gtgctctatt tacaacattt catttgattc caacaacaat 58141 cctatacaga atgtaatatt atccccattt tacacaggag ggtaataagg cttgacagag 58201 attcagcatc aatccctcag tttcacaaag tcaagattcc acagtctata tagtcatatc 58261 gtgctctatg tgcctgaaag cagtgtacca atcaaagaat gttttctaag tatctattat 58321 gagctaaatt ctcttctagg cactgaggag atggagaata actataagat agggtcttct 58381 gtaggataca gcttacagtc tagctgagaa gctagtatca catggccaag tttggaggat 58441 tttgcaggaa gatttctgaa ttggtaggag attggtcctg actggattcc ttccaagatt 58501 tttctctatg acacaaattt tgctggagaa taaaggtaaa attatttgtg taggaagcag 58561 aataatgccc cccaaaatat gtccacatcc aaatctcaag aatatgtgaa tatgttatgc 58621 tacctggcaa agaggagtta aagttgcaga tgggattaat gttgctaatc atatgctgtt 58681 aaaataagga gattgtcctg aattatgtat atgggactaa tatcatcaca ggagcaactt 58741 aaatgtggaa cagggaggca gaagggtcag tgtcagagtg atgtgatatg agaaagactc 58801 accaggccat ttctggcttt gaagatggag gaaggaaaca tgagccaagg aatgtgggca 58861 gcctctagaa aatggaaaag tcaagaaaaa tgattctccc ctacagtcac caggtaccat 58921 gattttagcc cagtgagacc catttcagac ttctgacctc tggaagtgta agataataaa 58981 tttgtgttgt tttaagcgac taagtttgtg attatttatt atagcagcaa cagggaacca 59041 atacaatgtc ttcccaattt ttgaagaggg aaatagaagg ctcaaaaagg agaaattttg 59101 tagtttttac ccaagctatc atatctgtag tcccaaatct tgatcccttt ctcaggccta 59161 ctaaatgaga atctgccttt caacaaggtc ctggcctagt catgtgattt ctagcaagct 59221 cccaggtgct ggtggtcccc aacagttctc cgatttaagc attaaagttc atcagaatca 59281 tggggaagct tgttaaaata acacagattg ctgggcccta ccccaaaagt ttcagattca 59341 gtaggtgtgg gagggggctg acaatgtgca tttctaacaa gttcccaggt gataccaatg 59401 ctgctggtcc agccaccaca ttttgagagc cattgcttta gatgacacta ataaacatgt 59461 aagaggaaag aagtggcact tatcattcaa ctaatatcca tgataggacc tcattattaa 59521 ctgacatttt atagcaggtg agtgatctgt ttaacagtct gtactttatg tacttgctct 59581 gtgaaaaagg caccacatag ccccagggtt ttgggcaaca cttgtgggca aaaatagcca 59641 catctttgga acaccaaagt atcaaatcca tagatgtatt tccccaaatc ccaaaaacac 59701 attttcatgt gtatctatat tttctccttg cagcattcca tatggcacaa ctggcaactg 59761 tggatattga catgctccac atctcataag acaacctgag gcactgggaa tttccttgaa 59821 caagatttcc caacttcaat gctactgaca tttggggttg gataattctt tatttggggg 59881 gctttcctgt gcattgtaga atatttaaca gcatctctgg cctctaccca ctagatgcca 59941 ggaacatccg tatcctccaa ttgtgacaac caaaaatgct tccaaatatt ataaaatgta 60001 ccctgaaggg cacaatcatt cctggttgag gcccactagt ttatctgatt ctcatgcata 60061 tttttatgta gatataatag atctggaaag agattattga gtcccacagc acaccttgga 60121 caggacataa gattcctgag ttgctgtttg gtcagataat gaattcgagc tgatccatat 60181 catgttgagc cttctgtaag ctactgagtg gggaacagat ctttccagat gaacaaccac 60241 acagattgtt tttttttttt ttttccaaac attttgctct tactcaattt tacagccgat 60301 ctgtgagacc caaactccct cagatgttct gtctattccc catcctaaat gtcaggatgg 60361 gagaacaggt taaaggggaa aatcttaaaa atatgcatat ttaaaatttt aaaaacatct 60421 ttggaacatg tattttaaaa taaatgtaag tcagcttgag tttaagggcc tcagtaatat 60481 ccctgataaa aacaagagct acattctgta attgctatag tccatgataa ataaaaagga 60541 aatatacatt ttgtacttac agagaagcca ctgtataatg acaacctact tgtaaacaca 60601 atctgggttt actaacttgc atctgataac aatgccatac acagggcatc aagtcctaga 60661 gggtcaaggc tgcgcagtct aaggcagttt gagagaacac ttcctttaaa ctgagaacaa 60721 caaagatacc agttattacc tcaatcactt tgtgattccc actctgagca ggacagatcc 60781 taagaactag aaaggaccac agtcatgatt tctttagccc gtttatttac aaatggattt 60841 accagggagg atgacagctt ccttcactta cacaattctg aaaagaaaag aggtcacact 60901 tatattttct gaaatgtaca tatttctaaa gtccacaagc ttcctctttt tagtccattc 60961 ctgagactaa attagattag tagggtatga tcaaattgtg attgcccact tgttttcgtt 61021 agctacggta taaatcaaat ttcatgcttc cttttgctgc tcaaggggtg tttatagtct 61081 gcagagtaat gttaaagctg acttaagtaa ggctctttga gaaccattaa ttcttactgt 61141 tgtaacataa aacagcccaa atccaaattt agacactttt tgaattattg acaccatgat 61201 tccagtcctt tataaggaag gtaaagacca ctggcatata ctattgaatt ttattttaaa 61261 ataacaattt ctatctttta aaatgtcttt ttatttccat aggtttttgg ggaacaggtg 61321 atatttggtt acatgaataa gttctttagt ggtgatttgt gagattttgg tgcacccatc 61381 accctctatc ttttttagtg accgaaaaaa tgagtgtttt tccccaccag ctattgtttg 61441 aacaataact tcatctaaat aacagttgtg cctcttgccc ctagtcatct ctttcctatt 61501 tgctaaaatg aatcacagag ttttggaact tggggtacaa actttggacc ttcacagacc 61561 ctgaaatagt tgtctttcca caacagcacc ctgggtccag gcacccacct cactttatac 61621 acccctacac ccagctcagt tcttaggtaa ctgtaaatct gtgggtttca actaggggcg 61681 aatttgcccc ctcactcccc catcaaacat atgcgatgtc cggagacact tttggttagt 61741 cacaacttaa gagcgagggt tgttgctgct aacacctaga aggtagaggc cagggatgct 61801 gctaaacatc cctcagcaaa taattatctg gccccaaatg ccaagagtgc taacaccgac 61861 aaaccttttt gtagaggttc tcagagacac attttccaac catatgttgt atgactctat 61921 ttctatgaaa tgtaccaaat aagcaaatcc atagacacag gaaatagatt ggtagttgtc 61981 ttgtgctaag gggaggagaa tgggaattca ctgttactgg gtatgggatt tcttttcagg 62041 tgatgaaaat gttctagaac tagactatgg tgatggtggc aaactttgta aatacattaa 62101 aaatatacta aaacccactc aagtgtaccc tttcaacaca tgaactttat agcatataat 62161 tatgtattaa gttattaaat ccgtttttat agacctttca ttaaatagct tttatttctt 62221 actaaacaag agcttatatt gtagagcagt tttgaatagc tgtaggtttc tctggagaat 62281 attctctggt aacagttttt gaatgcaacc ctatatttca caggaaaaac attatgttct 62341 gtatttttaa agacttatct atttccagtt gttcttataa tgttatcatt tgatctgggg 62401 ccccaaggta tgcctcattc atctgtgagc aagtgggtga agagtgtctg agccagggcc 62461 agagcccagg ggtcttgcgt ctgctctgac tctcaatgtg atgtgaagac agccaagtca 62521 tctgattgct ttgtgtttct accttctcat tcttgacaca agtctgagct tccagacgaa 62581 ctttcttgta gaaaccctag caagcaagga cagtggatac aaatgctcag tgctttgcca 62641 ctgcagaggg gacacaagtg gatttccctg tccaacattc agggtctctc tgttcccaac 62701 agcccatctg ccattgccct ttttctacct aaaacctctg ctgctgccta gctggtcttt 62761 ttctgcacac caagcatgct ctgcccattc ccgtttctct ttcttccaat gactcagccc 62821 tgtgaggaaa tagcctctat gtctatccaa atccttcaca tctccaaagc ctgaagcccc 62881 tggcacacaa tgaacaatag aattatttta ggcaaggaat tctacattta attgctaaga 62941 tcatggaagc atgagaagat gtcacctttt ggccttcagc tcctcttatg ccctccacct 63001 tgcaccacac cagcagtgct gggttcctgc cagctagcta acacactggg ccacttccat 63061 gctggtgcct ttgtgctgat cccagagatg ctcaggcgaa cacctcacct tcttcaagaa 63121 attgtttaaa actgaccacc ctattgaaac gtgcaactca gcacgtctaa tccttcttag 63181 ctgctctact ttttcctttt ccataggctt tctcttctgc tgtccttatt tattatatcc 63241 attgttcata gttgctttct gtgagactgt gagtaggagt ctttgtgtgt tttaatcact 63301 gatgtaaaac ataaacccag aatatacccg gcatgcggga ggtgaccaat aaacatttaa 63361 acatgttaaa tgcttagaac catggctggc aataaaggtt agtttcatct tcatcatagt 63421 cgtcatcatc aactttatca tcgtggggca acacttacac tattgttctg taccattaca 63481 ccgtgtttag ccttttaatc tatattttac ataatgtata aaggctgtac attttttgag 63541 ggccgatatc ataccatata taatatgtgt ccttttaaaa attaagatat aattcacata 63601 ccataagatt caccacttta aagtgtacaa ttgagtggct tttagtatat tcacaagatt 63661 gtgcaaccat caccactgtc taactccaga acatttctat catcacaaaa agaaacctgg 63721 ggctcattag cagtaactcc ccatttcccc cttccctcaa cccctggcaa ccatgaatct 63781 actttctgtt tctatggatt ttgcctattc ttggcatttc ctatacagtc atacagtata 63841 tggtatttta tgactggctt ccttcactga atgtaatgtt ttcaaggttc atccatgttg 63901 ctgtatggat cagtgcttca ttccatttta tggctgaaca atattttaaa aaactactaa 63961 tctatgtatg tctccactga acctcagcag gaaagttgaa cttactttgg tatattttac 64021 ccaataaaca ggaataaaaa tgcttgtttt tacattactg tcctacctct tttaacacag 64081 atgtaccatt ttcactcctg acagttacca gttaattata atccaaacaa cacactcttc 64141 aactgcatat aatttccctt gttgaaaatt gttccacctg atatttagca aatgcctacg 64201 gacttttcaa attacaagct gtatgaactc ttgctgtgaa tgtttatatt attaaaatat 64261 ttcctactca tgatttaaaa gggctctctc actttaaaaa atcgtgtgtt tttttttgtc 64321 ttggatttta tgttgaacct tttttccccc taagaatgaa cattggatat acccagggcc 64381 tatgcaagta gcactatcac ttactcatct ctcttgctga acatgagtga gaaatatcat 64441 actccaaact tctcgtattt cccctgcatt tgttgcattc acagattatt taaaagaaat 64501 aaatttaaac ctcggctacc cagagatttt caaggagtcc ctagataact actgtgtttt 64561 ggcgaatatc tccccaaatg gagagaaggg agagcaggga gggagggaga aaaaaattag 64621 agaaggggac tttaaaaaat ctatttttcc acttaacaat gtatcaccaa catcttttta 64681 tgttgacaaa cgcagagtga catcctcatc ttgaatgcta tttgtgccat aatcgatttg 64741 aaccagcccc gctattggta gacattttgt tttaaatgtt acttcatagg ccaggcgcag 64801 tggctcatgc ctgtaatccc agcactttgg gaggccgagg caggcagatc atgaggtcag 64861 gagatggaga ccatcctgga caacatggtg aaaccccgtc tctactaaaa atataaaaat 64921 aagctgggca tggtggcatg tgcctgtaat cccagctact tgggaggctg aggcaggaga 64981 gtcacttaaa cccgggaggc agagattgca gcgagccgag actgtaccac ttcactccag 65041 cctgggcgac agagcgagac tctgtctcaa aaacaacaac aaaaaattat ctatcatagt 65101 gtatttgttt ggagtatatt ataatttgaa ctaagaggtc agagccaaat gtaatgagac 65161 tgttaggata atagattcta gcgcttttac agcatctccc ttttcagata cttatcagag 65221 gtcatttgca cagaagtcca caggagcagt tgagagcaat gctgaattat ccatcttagc 65281 tattgaaaag gtccatctaa atcactgtag gatttctttc agaatccaca ttttctatgt 65341 ttacaatgtc atggaaaact tttctacatt ttcattaatt catcaacaca tttatctagc 65401 acctactttg tataagagct ttttctaggt accaaggagc tcccaagaaa gtataagaca 65461 caatacctga tttctaaggg ttgcggtggg gtcaaagtta cgatgtgtat cggaattcat 65521 cggtggtttt gtgcctaaga tctgaagatc atacaactcc aactctaccc cagcaaaaaa 65581 aaaaaaaaaa aagctatcct agagaatgat gttttcttgt aagtaaacat ctacagagaa 65641 aatatttaaa agccagcttt ggctccagct acttaagaag actcactgtt gaaattattc 65701 cagaatattg gataaccctg gggatgcttt aaaagaaggc tcaagagttg aactcttaca 65761 gatgatagga aaggctttgc aaactcataa tacctgactt atccaaaggc caccctcata 65821 ccccgagtgc tgttgaaacc caggcagtaa gccagtggag aggaacccca aaacaaacca 65881 aacaccctcc cagcagtcaa aaataagtgt cacaaaaccg aaattcattt gcttttctca 65941 caggcgaggc cctcccaggc cagccccact tagagtgttc ctttactttc agttcacgca 66001 tacttttaac aagcttgggc atatgatttt ctagcatact gtaaattaga agcagcaaat 66061 tagaagtggg gaagttgcta acataagcat tgtgagaaca gcaaaacgta ctgggctgca 66121 agtcaagaaa aagaatacct actacaaaaa aatgagcaaa agaactattt tatgcagtca 66181 atttattcat aaataaaata attgcttata ggaaaaatat ttcaattgaa atttcatttt 66241 cctcaaaaca tttaccaatt tttaagctgt gtaagtgggg cctaaattag tggaagagat 66301 ttcaaacaga gatccatatt ttaatgtcct atgattcagg atatgcagta ggggaaaatc 66361 aattatttct atggaatcgg ccatcaaagc ttgttcaact gtccatggtg aattagcggt 66421 aactcactat ccagtggtcc agaggaatta atgccacatt aaggcagcat cctgttgatc 66481 attatggcca aatatatcat cctctcattc ttagaaggtg agaaaaagta taagatcaat 66541 ctttctaaaa ctacaaggct actagtaatt catcattgac ttcattagta gagattaccc 66601 acaactttgt agaaaaaaaa aatatcctga atcctttcag acactccaat atatgtaatt 66661 atataaaatg ggggaaaaag attgaaagtg aataaaatgt gatccacatt ttccttcctc 66721 tcccttacca ccttgctgta cacccctgcc aatcctccta aggtccattt tccaacttcc 66781 ttcaaaggga atccttcttg tctctgctat aaaattcacc aataaatata ttcataaata 66841 aagccaaatc ctggggacat ttggctttta gaatcatatt ctatgtttct tttcatgcat 66901 acattttacc tgttataatc ttctacttcc ttctttgcaa tctctctctc cctgattgga 66961 aggtcttacc tttctttgca atattcatcc ctactgattt atccccaaag atcatcttgg 67021 aatcttgttt attagcattc tttaaacatt tactgtatgc cagctgacag ggttggggag 67081 agtaataaat agacttccat ttccccagag atgtcactca tctacttttc acatgaattc 67141 cctgagctag gttctgttat tatttccatt ctacatatca ggaaattgag gctcagacag 67201 gctaagtaat ttgtatatat acgtataaac atatgtatat atacaaactt tataagtatt 67261 acttgtgtat tcaaagagat ccaagtataa catcaactgt atctgggtag atattcagta 67321 tctcctttat tgatctaata ttgatggacg actcgtcagt tagctcctgc cactaattac 67381 aactttgctc aattagatca atctaatcaa attccagaat gaaagaaaaa gcccatgtgt 67441 tataccaagg tagaggggta aaacacacat gaatttaata ggtagatgat cagaggttct 67501 tttagaaagt gaatagcttt ggtcattgct tagggaaagg ggaggctgct gagaacatgg 67561 aaacaaagta gctagccaga cagttaagag actctgaaaa atagagagga atttgggttt 67621 gcctgtatga gaaagtggga ttgggccaga aggatatggg caaagaaggg ccgccatact 67681 gagaggccaa gctgctctgt gagatgggag gcaaaggaca acatcccctg tgacatgagt 67741 ttgcttgtaa aaaacaaaat ttactttaaa atgtgctgtc tggtgtgcac tccttgcctt 67801 gtagtataga tcatcagtaa aagctttgct actgcaagaa ccttaagttc ctgattgttt 67861 ttacggtaga tgggtttcag catggggccg agaaggggct ttgtcccctg ttttggtgga 67921 caccagaggg gtgataaagg agtagccaag gggaacagag aaaaaatacc ttcaatggaa 67981 gcagataata gagcagtggt gatcccaaaa ttataattcc tcagagagaa actagagaga 68041 gctttggggt tttacccaaa tgggaatttg aatcaatttc tcctgatctg aaagggctgg 68101 aaaaatcctg gaggatatcc agattacact agagaatctc aaccacctac cttgattttt 68161 gtaaagaaca acgttgtttg ctagaactga ggaaaaggcg atggcagaga agctttttaa 68221 aaataaatct gaatatatca acatgtacag tcaactaatt tgctgacttc aaaatagcgt 68281 gatgaaactc tttgcctgtt aaacctatta ttgcccataa tgaaagattc catatgcact 68341 aatcacaaca tacagcctta tgtctgatca tagcttctaa aaaatcatga tacagggatt 68401 tttagttata ctcaacacat ttttccttta gaaactggat tggttgttac agatgccatg 68461 aatgatataa attgagctta tagttggaag aaatctaaag gatcaagcat ccctgagttt 68521 caaacagaaa cttgcactga atacattcaa aggtatgtgg attttattta taatttgata 68581 tcctttttat tgtgcttatt tctttctctg catttctttt aaaagatata aatgtattct 68641 acatttctaa tactatatgc catcaatagt tactttttag ttggcccata attaaattag 68701 actctgatta ctgaaatgat ttttaacttt ataatgcaac aaagtgcttg tctaattcca 68761 aaggtctttc ttggccacat ttcacagtaa aatatttatt ttaattaatt atcgattaaa 68821 gtaaaaataa tacattcaca ggctgtgatc tactactttg aaaagctttt tatttaatta 68881 cataaaatag aaccatccat gaagtgtttt ttctcattaa cccaaacaaa caaacaaaca 68941 aaaaacaaat ttaggagtta gaaaacttta gtgtactttt catagacagc attatcacat 69001 gtaatattca cttcttatac aaaatttaag gtctttcttc tgacagtttt cccagtaatt 69061 tgtgccacaa atcaatgtgc acattttagg aggacctgtt ttgcttaatg tcccaagggt 69121 cttatttata gaataattac agtgcaacat caaacaaaat gaagattcca ttgaaagaaa 69181 atgaaacctc ttagggagaa agtattcaaa attcattgaa gaatcacata aaattcgtac 69241 catcctctcc tgttggttca gccacatctt ttgctccaca atctggagct cagagatgtt 69301 actggagggt cttgcctgaa gttctgtaga taggaatttc tggaactaaa atctatcttg 69361 gcaaattaat atattttctc aaatgattag tccttgtttt taacagtagc aatgtccagg 69421 tagactgtgc acgagacatt tctgtgcctc cttgttctaa aattataaaa tactgtgcac 69481 tttgatgatc ttcccaacct ttagagccaa ctttattcct ttaatcatta attataaata 69541 caatttagct tctatgacta agggctgtat aggcatagga aaatggcatg caaagtcata 69601 tgaagataga tcattcatgt ccctgtaact tgtttagccc tcaacctaaa ttaaatgcat 69661 ctttatcact actgtgactc cagaaacaat gaaataggct gagagctgga aatcacatat 69721 gactgaagta aattcacaaa caatggctcc atccttctta ctagcaatag actaatgtag 69781 attatgtgtg ttttatggag cattcattac atccatgttt cagaagagat aagaaaagtg 69841 gatgttgact tacatttcag aaccatcaag aaatggggac ctggatttta tttgcctgcc 69901 tcctgggagc agcttttgcc atgcctgtga gtaaaacacc ccttgcataa gtcagtgtcc 69961 aatttcacaa acttggacat aaaaatctgc tcatagttgg tgaaattagg gtttaaaaca 70021 gtatgagatc agatgtcttc atatgtctct gggttgaaga aacacttcag gagcttgttt 70081 taaaaaggta tattctcaaa tgccgctacc aaaaattctg atttggtaca gctggggcgg 70141 ggcccaggac tctgcatttt tataagcacc ccaggagatt ctgttggaac tgttagcttg 70201 taaatatcac cacccatctc tagatggagg aagcttttgg aagggaccct tgaaaggtct 70261 ccagagaaag tgcttaacca gctttggaca aatattacag agatgccagt tttgtctaaa 70321 acccaattcc tctcaagatt ccaaatctct tcctgccctc cacatattgc tgctcttacc 70381 cctcaggggg taagattttt gtgttaggaa tccacttttt gagccacatt cctgttctca 70441 gaggccccac ccctttgggt acatctgatg agtacgtgac ttttttgtgg ttgcccggga 70501 tcctgcctac acccaataca tagtgatata ttttatattc aaatgtatta aagatcagga 70561 cattagaccc actctgtctg gacacaggtg cataagggca gagaacttag atcacatgca 70621 tgccaagcag gctgcatctg ctttttggtg gaataaagag cagttgatac aaccagaagc 70681 cagcaagctt gcacccttgc ctcctctctt cctctctcac ccacaaaacc aagattctgt 70741 gctctgactt cctgaaatcc tgcattgcag cgattcctag tttcctcagg ggtccctgcc 70801 ttacaaattc agcccttttc ccactctcta agatggtatt gttcaagagg gtataggacc 70861 tgactaaaac cattcacatc cagatgagag agaataaacc ttcccatgaa atgtgagcat 70921 ttttaaagaa tataatagag ccaatcctaa atgaggccaa tgtctgaggg agattttctg 70981 catgaccgtc tctcctccct ggaccccagg cacaaccaga gttccacaat acaggcacca 71041 gacacttggg ccccactttt gagaggacag acagtcctga ttcagatcat cattcaggtg 71101 gccttttatt tttaacttga aaatttttct gccataaaac cgttggcaga ataataaaaa 71161 aagatcggcc taaacgtcat aaatcaactc tttcattttc catgggattt ataatgtttc 71221 cactttgctc cacccatctt gttctggcat tcatcaataa ttcccacctt gtggtttctt 71281 ttgtggcttc ttttgttgta tggaagtgtt gtttactttg catgtcaatt atgagaaatc 71341 cctttgatac ttaaaagtac ttgaccacct cctgatctac aagggaacat tcgtgagaca 71401 attctctcct tgttgatatt tctttattgt gaaagagaaa cacacttctc tccccattat 71461 ggggtgagtt ttatcagcaa gtaaactgtt gggcaggata gggtaggatt cattgaaaac 71521 actgggcgtg ggggtgggat gctgtccaaa gatcactgga gaataaatgt gtgctggttt 71581 ctgcttccag gtctccagcc ataaggctat aaccccagga acagtcattt agctgttctg 71641 aacctcttta aaataaaagg tctcctcttc tatacagcac atttgttcaa actaaaaaca 71701 gacctcaagt atattctgca ctatatagat ttttttaaag tagcttcagt ctcctttaat 71761 gtgaacaatt gcatactgac ttaatctctt cctctctctt ctcttccttc actctctccc 71821 ttcctctctc tttctattct cctcccctcc tccctgtaaa agctaccacc tcatcctggg 71881 caccctggtt atatcaactt cagctatgag gtaatttttc tctttactaa ttttgaccat 71941 tgtttgcgtt aacaatgccc tgggctctgt aaagaatagt gtgttgattc tttatcccag 72001 atgtttctca agtggtcctg attttacagt tcctaccacc agcttcccag tttaagctct 72061 gatggttggc ctcaagcctg tgtcgtccca gcagcctccc gcctggccac tctgactcag 72121 tctgtcctcc taaatatggc cgtaagctta cccatcatga accactactc agggaggctc 72181 catgataggg caaaaagtaa actctgacca gcttggttct aacccagcta gtaaaatgta 72241 aggattaggt aagatgttat ttaaaactct ttccagctca aaaaactcct gattctaaga 72301 tagtcacact ctatgtgtgt ctcttgcttg cctctgctga aatattagtg actaagtggt 72361 ataggagaga ctccgcagaa cagcggaatg catgagtttt ggacgtcggg tttgaggttc 72421 tcctcaacct cttactaact ttgtgatttt gggcaaatca tttcctcttt ctggaaccct 72481 ggtttcctca tctggagaaa ggaaataatt ataataacca tatttcaaaa tattgtttgg 72541 agagtaatat agttaatgaa tatgaaaagt gctttgtcaa gtataatatg agcaaggtta 72601 ctgattattt tttgtatcga ttaaatgccg tattactata tgaagaatcc tcaaacctaa 72661 ggctaaccaa gtatatatac tgttcagaaa ggaataagat tcttacttct ctcacaggtt 72721 caggtaacaa tctatgagtt tatttactta taaaagctga agacaaatgt tagtaagatt 72781 ttgaggcaag attttctgtt gaaccgaaaa gattgacaca tctgatcagt caatctgtgt 72841 ttctaggatg agggacagtg tttgcacctc tctttttccc attgtgacat caaagaaaaa 72901 aatgaaatta acatcatgtc atattattat gtcataattt tgtgtttgtt ttgctcttac 72961 aatgaaaaga aggaactatg gaattaaaca gatttactcc ctgtgtaacc tcagtcaagt 73021 taatgaatct ctttaactcc ccataacctt atctaaaaag tgagagtaat aatacttgcc 73081 tcctagcata taagaaaaga tgaagaatgt gtgtgatgga tgtaaacaca gtgcctgtca 73141 cacaggaagc acccaacaaa tttttacctt cttctttctt ttgtagaact cacattctca 73201 ggctatcaat gttgacagga ctgcattagt gagtctatat ttcctactgc atcagtgagt 73261 ttctatattg gatgaaagta aattaaatca aatgggttct aatatctttt tctcttaagg 73321 tgcttacccc tttgaagtgg taccagagca taaggccacc ggtatgtaga cattttgttc 73381 cttattccct gaaaatatta ggcatgcatt aaaattccca tattaagtga aatatcatgt 73441 ctactccaca tgcagacatt aatgggaaat ttagtttgta aaaaatcata tctgtgtaca 73501 cagttacaaa tttttgcaaa ggaaaaatga ataaaatatt cctatagcca taatggcaaa 73561 gaaaacactg ctgcttctct ggttggagtc acctgagcca atggtaaacc tgcctctctg 73621 tttctcacca gtacccttcc tatggttacg agcccatggg tggatggctg caccaccaaa 73681 tcatccccgt gctgtcccaa cagcaccccc cgactcacac cctgcagcct catcaccaca 73741 tcccagtggt gccagctcag cagcccgtga tcccccagca accaatgatg cccgttcctg 73801 gccaacactc catgactcca atccaacacc accagccaaa cctccctccg cccgcccagc 73861 agccctacca gccccagcct gttcagccac agcctcacca gcccatgcag ccccagccac 73921 ctgtgcaccc catgcagccc ctgccgccac agccacctct gcctccgatg ttccccatgc 73981 agcccctgcc tcccatgctt cctgatctga ctctggaagc ttggccatca acagacaaga 74041 ccaagcggga ggaagtggtg agtatatttt gaagccacta caatgcaaat cctgtgaaaa 74101 tggtgcagca aaataggccc cagagttcta aggtctccga caaccaagga tctagagttg 74161 tagtagttac aggtctatga ttctattagt ccaagcaata tgctatacct ttatgttaaa 74221 gacaaattcc tctaaatggc ttggtaatta agaccacagt ttttatggta ggtttcaatt 74281 ttactattac tgaatttcta ccagaatatg tattaccaaa acccattaat agaaatatat 74341 attactaaac cccatgaatt ttaagggcaa cagtataagg gaatatcagt tctccttata 74401 tttcaaaggt ttgactagca agaataggct agagttgcac tgaaggctta agacaagagg 74461 gagcggataa ttttgagagt gcaaatatct gaacaggcta caaaaggtag acgggaaatc 74521 tcttcaaaaa cctacaggaa gattccccat ttccagtagt tttcaatcta acttggaggc 74581 ggctaaacta aacatactgt tagattcctt ttctgtactg gggttctata gatgattaag 74641 cttttagcaa gaagttactg caatttagca ctaaatcttc cattacaggt agctcttaca 74701 aatgaatggg aatagtcaac aaaacaaact taatctacat ccataaagtc ttacttctat 74761 gtatacgaga ttatgtgatc ctatcatgta tatgtatcca actgtaattc caatttatac 74821 atgttattga tgatttgcta ctgagaagaa gagagaagtg aggtggaaat gaccaggata 74881 agaagccagg acacaaaggt tccaattctg gctttgccct caaagacgag gcagtattgt 74941 aaaagttact tcaaatgtat cggtgtttta ttttctttta aatgggggaa aatgacgaaa 75001 tcagattatt ttcaagtctc tgtccaatta taaataccat agtttctgaa tttaaaaaaa 75061 tcataatata tgtcataaat ggcttcataa ttgtgagcat gtttacggaa aatatggggc 75121 agaatttttt gaaaattgat tgaatcccaa gtaatcggtg cctatcattg gctattctag 75181 tccaaggcac atgttctctc tgtacataga aaatgcattt acttctttat gaataattaa 75241 taccatgaac tttataatgt gctcacatct tgacaaagct atttatggaa aggtgacttt 75301 gggcagatag tttgaactct ttaaaactca gtttcttttt gtgtaaaatt tgagtataaa 75361 cattgatagt ttcttagagt tgttttatgg aaaacaaaat agcgtgataa gcttggagcc 75421 tgacatgcaa gacgtacccc ccaaaaaggt agcaattgtt attttattat aaaataatag 75481 gctttaagtg tcctgaaggt ggaagcagtc ctcatggaca cctaatatct aatgacaacg 75541 aaataacaaa gaaacttcag aaattataga gttcaactta aatggttgta cattgttttg 75601 acaaaactga agccagacat gttattgtaa atggtactca ctaggaacat ttgtaaatta 75661 ttttaactgt tcttttgcaa tttttttcag gattaaaaga tcagaagatg agaggggaat 75721 gaatacttca gatgctttca ggagtgacac aagaacacaa tgatttttgc ttataatcac 75781 tttacttagc aaattctgta actaaaaaag taccattagc agacaataaa atgcattaaa 75841 aatcattcat gtctttgtgt tgaatgaaac ttaaattttc ctatcatttg agatgatata 75901 ttatgaatga gtattcttga atcaggtagg aactgctcat ttagtcatac aaattcacca 75961 tttggcaagc aaataaggaa tatatttctt tttagagaaa atttaaatta tactgtctca 76021 aatctttttt aaaaaacaaa gtttatacat cagttaaaac agttgataaa atgaaaattt 76081 atgaatatgt acaataacta aaacctgaac caaatgaagg gaagtttata tgaaaaattc 76141 tttacattaa gaaagataaa cttttgaaga atgtttatat tatgaaacaa aataggacat 76201 ggaacactgg ctttacactt cacaccacac tggcaatcaa aatctggtct tttgatctca 76261 ttctatggaa aaaagatttt taaatgaaat tatgtgatat gcataataat gtactgaatt 76321 taaagttatt caagtaattt gttttcctca aaaaatgtta tcaatggagc ttcaaataga 76381 gaattgcagt gattttaaac acaagtcatt tctttcacct cataacttta agataaattg 76441 tactgtatag tcttcaaaat gttaacaaat gcataaaatg gactcaaaac aagttttctt 76501 tctcactcat atttatattt gaatcctttt ttcttcctac acccacacag aattgtaaac 76561 agtctttgca gggtttatat tacacaaacc catggtaact ttattccagt acctttatga 76621 ttaaggcccc ttaatcattt tctggaatga ataaacataa ctttattggg agggtgtcaa 76681 tattcaaaca aataacttga ttttaaagaa tttagattca attttttaaa ggtagaaaag 76741 cttttaatca attggcatgg accaattcta atcaattgag tcaatattta ttaagcacac 76801 tctagggcaa ggcactattc catgtgctgg tgaaaagagc aaggccccat ccttaaggaa 76861 tctagaatct accaaagggg taaaataagc ccacaaataa caattaaaaa ggccaaatgg 76921 tattaatact atgacagaag tacagtcaaa acactctggc aaatatggca aatatccttg 76981 gctagctgtc caacagcttt cctcaccact cctttcctgt tggcagagct ggaatcaatc 77041 ccagcattca gccacggttt ccaaatcaaa attcagatac gtagtgtgat aatacagagg 77101 aatgtttcaa accagagaca tacattaact ggtaattata ctctaatact tgcactgtca 77161 aatatggtag tcctgtccta tgtggctatt taaatgtaaa taaattaaaa ttaaataaaa 77221 cttaaaattc aattcctccg tcatagtaga cacattttaa gcgctaaata gccacatggg 77281 gctagtggct accattctgg acaacacaga tacagaatat ttccatcacc attcagtttt 77341 atcagacagc attgctctat gtactacttc ttgaagttga aaaataattc ttattaattc 77401 aaatatttta aacaaaagat gaaacaaaac agatttgctg gttgaaactg aaaatctcag 77461 ctacccacag caagttgttc cccaagcatg ccagtaatcc tgatcctcta tgagaatttt 77521 ctaagaactg tagaagtgcc attctcttcc tcaggccaaa ctttacatac taaaatggtt 77581 tcccctctag ctttcattta gaacagataa atccagagtt cagaagatat ttacattctt 77641 ttctattttg tggttcaaac taatagtgca acataacata gagggaggca cacattcacg 77701 ttacacacac tcacacaccc ataaccacgc atgctttctg ggtttacctg tagtaaaaag 77761 acacatgtgg agacatgtgc catttgagta acttgaaatt ttaaataaaa cttctttcca 77821 ttacaatcat ttagagaatg attatgtggt tatggtattc attatcacca ccaagacata 77881 ctacagttct tccaatacaa aaatcatatc aagaaaaaaa gatcaagctg tgcatttcat 77941 ttccaagcaa tgtcaagttt accatcccct ttctcaactc cacacaaaat aaaaatacag 78001 acttttctct aaatccgaag tttttccttt ggtagtaaac aatagctgaa aataccaaga 78061 ctgagtttaa ggccaagact tttaaaaaga aaataaatta gtagacaact gtattaattg 78121 ggaaaaaata acccaagaga tcctagtatg aacagcttaa catgagacta acaagagaaa 78181 aacgacttca accaatttcc tacatttcca gaacatagta gcatttagta gtactccaca 78241 ttgctttaat aaaatagtca tcagcttatg tctgttgtga aattcttgtt agttggagtt 78301 tttacttatt tttgtgattt ggctttcaat tccttttgcc cctatttgat ctggagcctt 78361 aacagcaagg gccataggcc gacatttaga tattaaaaaa tacctccctg ttgggaaaag 78421 tccatgctgg cctcatttga aacacctgac tcttccagac gataaataat tattgaggaa 78481 taagattggg ctgggagtgg tgaggatttg aagagagaag aaaggagtgg tcatttgggt 78541 tcgaaacatt tacctcaata aatcacactt gaatcctcta tcacaaaatc taaattaata 78601 aagcccattt gcaaatatca ttgtcctatg gtcaacaaaa aacaccacag gacccagaat 78661 tcactttact attaatttta aaaatgaacg atattaaaat ctcaattcct ccaaacagcc 78721 aaattttgtt tgcctctcca aataattttc catgcttcat atttggccat ggaatgtgca 78781 ggctctgggg gtgtcagtta agtgtccaca gaagcacaaa gccggctttc cagtgtgatg 78841 gtagggggtg cagcctccat aaccaaaccg tttgattcac aatcatgacc attgcctatc 78901 tgctgtggaa acttggacaa tttttttggc ctctctatgg ctcagtacac tcgtctataa 78961 aatgtactaa tatcatgctc tgctctgagt cttttcatgt taatgagccc caagctcaat 79021 aaatgttagc tacagttctt gtaatcagaa agagagcttt gtcctgcctc tgtagactcc 79081 caaagagaac actacccagt tggaagcagg agtgatacaa aattaggata tgacatggaa 79141 gtttttgatg ccacaggcac tgaagtacat cacgtttgtc tttttttttt tgagacagag 79201 tcttgctctg tcacccaggc tggagtgcag tggcatgatc ttggctcact gcaacttccg 79261 cctcctgggt tcaagtgatc ctccagcctc agcctcccaa gtagagggga ttacaagagt 79321 gagccaccac acccagctaa tttttgtatt tttagtagaa acggggtttc accatgttgg 79381 ccaggctggt cttgaactcc tgacctcaag tgatctgccc acctcagcct cccaaagtgc 79441 taggattaca ggtgtgagcc accatgcccg gcccatgtct gtcatttatc agtgcatcac 79501 tctgacagtg tagccacctg aacacctccc ttactgattg tgaccctaaa tgatctttgc 79561 cttctcctac ctgatatccc cagacactgt tctttgctgc ctccaagaca agccttcatc 79621 ctcaattcac cactctgagg gtaacaggag agcaacacca ggcaagtgtc atgggggcgc 79681 tccactaagt ggaggctaga acttgagacc acctctctct gcatgtcttc cttttacctt 79741 ctttctaggg ctgagccttt gtgtacagtg actcccaact cagacccttc ccttgtcact 79801 ctggcattgg cacaattcag ctaattccac attataagga tttcattttt ctttctttta 79861 cttttttttt tttttttttt tttttttttg agatggagtc tcgctctgtc gccaggctgg 79921 agtgcagtgg ctcaatctcg gttcactgca atcaccacct cccaaattca agcgattccc 79981 ctgcctcagc ctcctgagta gctgggacta cagccaggca ccaccatgcc tggctaattt 80041 tttttgtatt ttagtagaga cagggtttta ccatgttggc caggattgct caatctcctg 80101 acctcctgat ctgcctgcct cagccaccca aagtgctggg atcacaggcg cgagccactg 80161 cacccggcct ttaccaaacc gttttcctct agatgagtgg ttcttaagca ggtacagttg 80221 tgtctcctag gggacctttg gcaatgtctg gagtcacaac agggatctaa acaaatcaaa 80281 acagcttggg gagtggtagg ggtgttaagg agaatttgca aaataaagaa agtgctgtcc 80341 cgtgattaag tcagctaatg tctctaaatc aaaactatcc tgttaaaagc ccagatttca 80401 ccactacaca atatatccat gtaagaaagc atgtccaccc tctaaatcta taaaataaaa 80461 aaagtaaatt aaaacaaaat ttttaaaaac cgtccgacag aagttagaat agttatttgc 80521 ttcttcagaa cttacttaga agtgagtatt tgtttcttgg gaggttacaa gaagcttgtc 80581 tcctgtgagt taattgttcc tgcttgtttg accttccaaa gatgtgctaa ggagctaatt 80641 gtaaatcaga ctgaagaacc tctgaaagtg attgtgtcta cccgagtgat aaagaggaga 80701 aaataataaa tttagccatg agtgtaacta ctgaattcaa ggttattcaa ttagtctggg 80761 ataagttctg ttctgagatt tgcttggatc tggcagaata atgagaaatg agtggttctg 80821 tccataaagt gggcagtaat ttaagatgcc ttaattctca tgattttcag aaccaccact 80881 ctccaaaaga atgattcttt tgattaagaa gtttttgtta caaaattgtt acagaataaa 80941 acccctacag ccatggagaa tataaagcag tgtattgtaa ggagagaaga tgattaagca 81001 ctgaccttgg gcagctgaga atatctgaca gacccaggcc atcgaacacc ctatctgtcc 81061 aaataaccaa ctcaaggcac ttctgaagtg aggatctccc tgcctccttg ctgctgtatc 81121 ccaaacatcc gggagcagcc atcagtgacc ctcacctcaa tgccacctct tctagaaatc 81181 ttctctcact gctctacgtt ggctggcact cctggaagtg aggttcccct gtctccttgc 81241 ccccatatcc acacatctgg gagcagccat caatgaccaa cacctcaatg tcacctcttc 81301 tagaaacctc tctcgctagt ccacagggct ggagtgcaca ggctcagagg tttcccagga 81361 tatgttgacc atgcctctat ttcaaacaac tgcttctttc tacttttgtc tcccccttag 81421 ctgtcaagct ccttgaggca gatttggaaa aatacctgga aactggtcag tatgcttaat 81481 atctgctcaa tgagccatca ttagcattaa acacccagga tgctgtcata taaaagtaat 81541 gataataata acactgaaca tttactgggc ttattatatt tcaggcacta ttctaagcac 81601 atgacatgta ttagctcatt taattctcac aacgttatag aattgtctct atcaccaatt 81661 tacaaatgag ggtttgaact ttaaatttta acaagaagtt aggctctttt aaaacacaca 81721 cacactgccc cattttttat gagtaccatt tgaaagataa actaattcgt catccgcctc 81781 taggattcta attaaacaat ctatagctat ttttaatggc aaatgtaaat tcaactccat 81841 tagagtcatg aagcttgtct cctgtgactt aatttttcct gcccatttgt cctcccaaaa 81901 atatactaga gagctaactg taactcaaaa tgaagaatcc ctaactcatt aaatgctctg 81961 tagttgcatg agggagttgc ctcttgtgct aaactgccaa agatggggag gagacagcta 82021 caaacatagg caagtatccc ctacgctagg acaacattgc catggcaacc agggtaaggg 82081 caaccagctt tgagtgtgaa cctgctcatg gatggtcctc atgccttctc tggttccaaa 82141 ggatggatgg aaaacaaaga aacgcaagtc cagtgctcgt agaaaccaca aggccctgtc 82201 tttctctcac tgattagttc aatgaaataa tcatgtgttt taggagaccc aggccttggc 82261 cctccgctat cttccttccc agacagccca ctgaacacac actcaacgca tacagaaagc 82321 ccatttcaag cagcattctg aaattcaccg agaatgacaa gaataataag gctactgata 82381 tatccctacc atgacattga agcagaacta attggcttgc atactctctc tacctggcat 82441 ctaatgccca gattaaaata gatgacaaag ttcaggtacc ccaatagctt cctgcactgc 82501 agcctctaca aatgttagct atcaaggcag cagtccaaac tggctccttc catagggtcc 82561 tgactaggga aggaaatgtt tgcaaatttg cacagctgca tgcaataaca aattcaggct 82621 ctgcaggttt acaaaatccc atcttctccc tgtgattagg aacttgggat caacttgttt 82681 tccacaaatg atttttctta atctggaagt aggtacaaca gaggaacttc aaaatctgtt 82741 tcttttgaga caattctcaa tttttaaagt acttattaaa cattgctaaa atgtgcagtt 82801 cttatcagca ggatttacaa tgacacttgg aaagcaatgg gatttactgc ttttgtggct 82861 tttttttccc cgttctggta gtgaaaaaat gttcccgata aacagtaata gatatccaaa 82921 aatctgcaat caccaccaca ttagttgatt tgatagttgc tattactgta aataatcaga 82981 atcacaataa aaagtctatg cctacaaaat gaaatcactt aacaaactgt aaataatcat 83041 aatctaagag cagcaaattt tcttcaaaag caacttggct tttaacaaac taaataattc 83101 gtatatattt taattacttt atacctatgg tggcatgtgc acatgataga gagaggaaga 83161 aatttgcaaa tgaattgatg tggcctgatc atattgttta aggtaataga tcaccaatct 83221 gtagaaaagc caatgatttc actttcttgg tgttaacaca caattcacca aataaaccaa 83281 tataagttta ggaaggattc aaagcagctt ttacttgtag ctatatgaaa ccagcaagta 83341 tttaattagg ctctgtgaac acttcggtaa caatctaaat taatattctg aaaagatgag 83401 ttttaaggtt ctttctatga aatcattaaa gaaaaaaccc tggaatcatt aaaaaaaata 83461 aaatttgcac cagatgctac ctttattgta tgcaaacaac aaaaccaaag ttttctgaac 83521 cagccatctg tctttgctgc atggagtggc cacaggacag agctgaacca cattttgtgg 83581 ttccacatta gtaccctaat cagttatcga tgttatagaa tcttgatcct tcatcatact 83641 aaggaaaagg gcaacctatg cccttgtccc cagtcgtcag tcattatata gtacccaatg 83701 attgagaccc aaagagacac tccatagcaa tgatgttttg gctcatcctg gattagtctg 83761 gtgatcctgg caatgcctga ttcactctga acttcagttt ccttacctct agcacaaaaa 83821 cactggactg gaacagtaac attaagaccc atgcaacagt aacattataa gccaatgtgc 83881 atgttaaagt caaattccat tgattttcaa atactgagaa aatagccaaa aggaatggtg 83941 cagaagaaat ggtgggtcct tactatgatt tgctctgaac tttcaataaa aatatacacg 84001 tcttggcaat ggtatgtaat attccaataa gcctggtttg aagactctga atttactctt 84061 aaagcaacag gcagaggcag tgtgactgtc atttgaacca cctgagcttt gatgttaaaa 84121 gagactcaag cgccaggaca cattgagcaa atccctccaa cgctattact tccctccttc 84181 tttcctactt atgttcacct tttccagatg ctgctctggg attattaccc catggaatct 84241 ttcacgtctt cctggcttgg gtcctacctt gcttctagca gtaggcacga aacacagagc 84301 ctgaaataca cacaccgtat cttcttacct ctccagggtt cttacacctc agcaatagca 84361 gcagctccag gcagtagcga gcccccatac tccccacgca gagcccctgt gcagtgtggg 84421 aacagagaag gagacaggag gcagggaggg acactgctca aagtgctaga ggagccggcc 84481 aggcagtctc cctgtctgga atgcacaact agaggatctc ggagggagag agccttgggt 84541 cactgtgatc ctctgagtca gacaccctgg ggagctctta gttctgaaag tcctccccag 84601 gcacagaagg ctgaaagccc ccaagcaccc actaggaaga aaagtgtaaa agttaaaaag 84661 ttaaaaagtt gagataaatg aggtagacat atggcagttt acagtagatg gtagaaatgc 84721 agtatgattg ctggagggaa aaaatcattt gaaaaaaatg ctcactcaca tatctgatga 84781 gagatttgta tccagaatct gaagaactct tacaattcaa taataaaaag acaacctaat 84841 tgaaaaataa gaaaaaaaat ctgaataaat atttctctga agaagatata caaatggctg 84901 ataagcatat gaaaagatgc ttgatatcat tagccatcag ggaaatgcaa atcaaaatca 84961 cagtgaggct catgccggta atcccagcat tttgagaggc caaggtgggc ggatcacttg 85021 agctcaggag ttcgagatca acctgagcaa catggcaaaa cctcgtctgt acaaaaatta 85081 aaataaaata aaaaacagcc tggtgtggtg gtgcgtgccc atagttccag ctacttggga 85141 ggctgaagtg ggaggatcac ctgagcccag ggaggtcgag gctgaagtga actgtgattg 85201 tgccactgca ttccaaccta agtgacagag tgagactcta tctcaaaaaa aaaaaaaaaa 85261 aaaattatca catcacgccc actaggatag ctttaaaaaa aaaaaaaaac tgttggtaag 85321 gatgtggaac ccttacatag tgctggtgag aatgtgtctc caaatgttaa acatacattt 85381 gccattatac tcagcaattc aacttttagg tatataccca agaaatatgg acatgaaaaa 85441 catgtaagtc tgcacaaaaa cttgcatatg aatgttccta acagcattat tcttaatagt 85501 caaacagtca aaacaaccca aataatccat caacagatga aggaataaac aaaatgtgtc 85561 atatccctac aatagaatat tattcagcca caaaaaggga tgaagtactg ataaatacta 85621 caacatggat gaaccctggg gacattatac tgagtgaaaa atgccagaca caaaagacca 85681 aatgaaatac tggctcttat tcccttgcaa caatcattcc ccaattacac caggctcttc 85741 cgaggtcacc attactcctg caaatttacc tatgtgtttt ggcagaagct acagtgagag 85801 gaggcttcca tctcagtgtg gagctacaac cactggtccc cagcacattc agacccttct 85861 tctcagcttt gcttctaggc cactgacaaa aattctgttc tggcttcatg gttcagcccc 85921 actcctcccc aactccttca tttctgagta gtggctccct tctctccatg tctatggttc 85981 ctatctgtga gagttttttt ggaacttaca gttgctcttg tatatcaggt agttggcatc 86041 tccaaagaga ttaaatgcaa cttaagcaca atggactctc acagttcagt gggtttttca 86101 tagctgaaat tcagtaatta attgtgaccc tctgcctttt tcgcagggaa aggatatact 86161 tttttgagca cttaaacaaa aaaattaggg tattgcctcc aggggccatt gtgcattttc 86221 actatcataa gtaaaaatga aattagaagg cgatgccaat gaaggaagca gtccaatgaa 86281 gagactgcca ctatggtcat tgggttaaag aaaaggtcga cattcttatg taattttatc 86341 atccactaag agaaatacat tacagttact gacattttaa ttattaattt taattcttct 86401 ttttttgctt gtttgtttcc atatccttca tctttccaaa ctattcctgg gtacaatgga 86461 ccaatgccct tgaacattat tttcttaaca ataactagtt ctatcctgtg tcatgctcaa 86521 ggtttagttc gatcagtgca ttctgctaac aggacagaag tctcaaaaga aagatttgct 86581 agtggcccca gttagctaca acacatggtt tctggagtca catttttggc caatatgtaa 86641 cttttgccta ttctgggttt gtcttaaaaa tctcaacctt taaacccttg aaagtatcca 86701 tttttaaagg ttgatgccat tgccttctga aaaatgtcga tcaaagtgtg gaggagaagg 86761 cttcctgtct aaggataaca aagctgcaaa tgaacgttcc ccaaagcgcc ggctgtgttc 86821 attttgtact gaaatgtcca tacaggctga ttctgaaaca cactcgagtg ttgtgggtaa 86881 tagatgtcaa aaaataaaac tcaggcacac gctatgttac tttttaaggg aaaaaagtgg 86941 ggggaaatca aagggtttaa caattttatt cattggctta cattcagcga ctctgaaaat 87001 ttggccacta ctcaggtttg caaatttcta tggaaaatga actgtgtttg gcaagctcaa 87061 cgaaatatga ggctgataaa aaaaaaatgt ttcatgcttt ctcaagacct tatgccacta 87121 actgtgtggg attggaaaaa ctgtttaaat gtcaatttaa atactgtaaa taaatgattt 87181 tctatatcca tatgaattaa tataattctg ttgatttcag gccataactt gaactaaagt 87241 ttctttgaat tttgatgagg aaatgtttca gacacacaca acacaaactt cagattaagt 87301 agtttagttc agtagagttg tttcagagtc agaaatgtta atgcaatact ttattttaca 87361 ctatggagaa ctgactgaaa atttgattga cactaaggtt tttacaaagc aggtcttgaa 87421 ctaaaaagaa ttagaccaaa aaagacaaaa atattatcgt aacagcccaa gccattatac 87481 aaaaaaacat ggtggtaaat cccattgcaa agttcaggac tctcaaatac tatctccaaa 87541 ccaatcatct ttgtgtttat ggagctgact atccagctgc cttcttgacc cttctacctg 87601 gtgtctcagg gtcatctcaa acttccctta tccactaatg aactcttgac tttctcttct 87661 ggcaaatgtt ttggtatcct agcctcccac atctttagag gtgtttacaa agcatttatc 87721 acaatgcctg cctggtatgc agcactcgct taatcagtgt tgttgtggat gatgatgatg 87781 attattatta ttattaaata atgattattt ttatccaccc agttagttga gccaggaact 87841 aagtcatttt ctttattctc ctcttcactg catgttctac ttttaatcta ttaccaactt 87901 ctgtcaactc tgcatccaaa ataaatcagg aatctgaccc ctcctctcca tctttacttt 87961 catcactcca attccagccc ccatcacatt tctcccaaac aaatttagca gccttccaga 88021 caacctagcc atctgctctt gtccttactc aatttgttca tgggggaaat aacagtaatc 88081 ttcttaaaga caattcttca ttttgagaca tacatagagc atatcacata tttattaaat 88141 ctgctaaaca accttaagag gtaggtactg ttattacttc cattttacaa tgaggaagtg 88201 agtgaagaga ggttagaaca tgctcaactt tactctgcct ataagaggca aagccagaat 88261 tcctgcctgg gcagtctggc tcgagttatg tgttcaactg ctacacaaag taccacacca 88321 tatccaacct taaaaaaata acaacaaaca agcaagcaaa gatcatcaac gggcttccat 88381 tgcacttaga ataaaatcca tccacataga acttgcctat gtcaaagggt ctggaccaac 88441 ttccttagcc tcacctttca tgcctcctct ctttcctcct gttcttcaaa cactcatgga 88501 caaggcaagc tgggtcctga ttaggggttt ctgtctatgc tgtttccttt gcttgaactg 88561 ctttcctaag gcgtcagtca cagggctggc tcctcctttt tctggtctct gcttaaatat 88621 catctgctca aaaagccttc cttgacttcc ccaatctgat attaatgttt cctggcttct 88681 tgatctcaac acactgttga ttttcttctt agcacttatc tcaacctata attgatttta 88741 cttgtttgtg catttgtcta ttggctgtct ctcttactgg gctggaggct tcacaattag 88801 aaagattgtt tatcactgta tttttataat ctagcatcat gctaagatgc taggtttagg 88861 tgctaagaat tttgataaat gattgaagga agacaagtac gctttgatga tatgaagcat 88921 ccccacacag accttagatt ctccaagtgt gggcaaaaga tcaaatacag aatcacctgg 88981 ggttactttt tgttaaaagt gagttccagg agctctgagc tattttaagc acttactttt 89041 gaggttcatc tccttgtatt atcaagaatt tttacaattt tgattttgac atgatttggc 89101 aataagtaat tcatcggatt ctcgtttggc tgtgattttt taagtaatta ttgtgcgtgt 89161 tcaggaatac ttgttaagag aacaaaatgt acattacgaa ttgtttttac atgtgatgag 89221 attgttaaaa atactagttg acaatgaaat tggcacttgc tatgttttta gatatatagg 89281 tgagtattta catcattact ttgggagtca ataatttcct tttcagcttg cagaaattta 89341 tatgagctcc tgaacagctt gtggatcctc tgccctgtaa tcagataatg gtgtgggatg 89401 gatgaaatgg accagctgga gtccagtccc aaattgatca taacagaact tctgatgtca 89461 aggaatctgc attttaagaa actttccaag aaactcttat gcaaaataaa aattgaaagc 89521 tgtttttctc atagctagat ttatgtatgt tattttagaa aaatttgcaa tagatatcca 89581 gtgttcactc tttaaatttg tggttgattt ctgagaaatt caagataatg tcccacaaaa 89641 agggaaaaca ttttgctaag gataatggcc tccacctcca tccatgtccc tgcaaaagat 89701 atgatctcat tcttttttat ggctgcatgc tattccatgg tatatatcct tagcaaacta 89761 acacaggaac agaaaactaa atactgcatg ttctcactta taagtgggag ctaaatgatg 89821 agaacacatg gacacatgga ggggaacaac acacactggg gacctctcag aaggtagagg 89881 gtgggaggag ggagaggaac aggaaaaata actaatgggt actaggatta atacccgggt 89941 gaggaaataa tctgtacaac aaaaccccat gatacaagtt tatccatgta acaaacctgc 90001 acatgtaccc ttgaacttaa aagctaaaaa agagaaaaca taaaaatatt gataagatca 90061 ccaagtgtcc ataaatgcaa attatcaata atcgttctgg tgaaataaat catctctgaa 90121 aaatatcttt tgttagcctg catatacaga ttgagagaaa gggacaggca tttctcaccc 90181 tggaggggca ggatgaacca tttgcccaga tggaaaaaca aaagcaaggt aagttcctta 90241 gcccatagct tcgctatgtt ttagactagc tgaccccagc catttttggc accagggacc 90301 ggttttgtgg aagacaactt ttccacagac agtgcggcgg ggatggtttc agcatgaaac 90361 tgttccactt cagataatca ggcattagtt agattctcat aaggagcttg caaccaagat 90421 ccctcacctg cgcagttcac aatagagtta gcgctcttat gagactctaa tgaatgccac 90481 cactgatctg acgggcggcg gagcacaggc agtaatgctc actcctgctg tgcagtccca 90541 ttcttaatag gccatggact ggtactggtc tgcggcccgg gggtcgggaa cccctgtttt 90601 agatgataag tggatgttaa gaaaattaat aattgacggc aggaagaggc ttcctggctt 90661 gctcaaaaga cgaaggctgt tttttctctt ttagggaaag atacccaaat tccagaattt 90721 gaaagagctt aagttaaaaa caaaaacaaa acaaccagcc atgtttttga cgtcaattaa 90781 caaatgtgag aataagcttt gtggatgcca cactcttttg agtgtctatc acctctctga 90841 atgtaacata atgttggaat tcaccatggc tctgttcttg aatctccttt atttttcatc 90901 tgcaattcct ctttggtgat tttaaccaac cttgtatctt taaatgacat ctatgcactg 90961 atagttacct agtttatatc tccaactagg acattttttt cttgaattcc ctgagttctt 91021 caacttcttc tcaatattct tctctatagt gttccccatc taagtcaata gcaactcaat 91081 ttttccagta gatcaggcca aaacttccac tctctatcac caatgggtcc atttcacaat 91141 ccatagtcaa tctgtcagta aatcttgttg gctgtacctt caaaatatgt cagaatctga 91201 cctcttctca taatttcaat ggttccatct tggtccaagc tatcatcatc tctcacttgg 91261 attgctgaat gactcagtaa agacataggt tctgcaaagc ccttctttaa agtagtcatc 91321 tctcctggtt gaaggtaatt tctcttggga aaatccagat tgtcaataag ccacttccag 91381 tcatatcctt ttctagaaag atattaatgc agataaccca gaaaaaaata caggtgaaga 91441 aacacttccc caaatcccac agaaaggttt ggagagtatg tttattcttt agcagttcac 91501 aggattttta aaaaattaac aaataaaaat tgtacatatt tattgtgtac aagatgatgt 91561 cctgacatac atatacaatt gtgaaatgtc ttcattaagc caattaacat atgcattacc 91621 tacatagctg tcactttttg tggtgagaat acttaaaatc tactctgtta gtgattttca 91681 aaataaaata cattgctgtt aattatagta accatgttgt acaatagatc tcttgaattt 91741 attccttttg tctaagtaaa atattgcatc ctttgaccaa catctctcca accctcagcc 91801 ccacccccag gttctggaaa gcaccatttt agtctctgca tctatgagtt caactttttt 91861 agattccaca tgtaagtgag aaagtgcagc atttgtcttt gctttctgtg cctggcttat 91921 ttcattgaac acagtgtcct ccagattcat ccatgttgtt gcaaataaca ggaatttctt 91981 cttttttagg gctgaatagt aatgatgcct gaattcccct ctcttgctaa taacaaaata 92041 gtactattat atgacgggta tttattgttc tggttgacag tgtatagcat ggacatgcca 92101 catgcatttt ggggctcagt gagctgatct tggtaatact gttccccacc cacgattgat 92161 ctcagaaccc aggcctaaag ccttttgtac tataggcacc agcatgtgag ggaaatgtgg 92221 ctggaagcct ttggaaatgt ttcctagaat gtaagtgagt tataggcaga ggaaaccttt 92281 cttcttctat gccagagtta ctcagcacta ttgacatttt gggttggatc attctctctg 92341 gtggggctgt cctgcacacc gaagatgttt agaagcatcc ctggctttta ctcacttgat 92401 gccagtagca cctctccact tgtgacaacc aaaaatgttt cccgactttg ccagatgtct 92461 cctggggtgg tcacccacag ttgagaaaca ctagcttaga catcgtcata tttatcccca 92521 aactgccaca ggttctgttg ctatcagcct agaggtgaca cttagaggga cataaaatga 92581 agagcactgc aggaaaaagg tcaagtcact ggatgaaacc ctacatactt atggattttg 92641 gtgacatgag cctctaaaat tcctaattat ttaaggtggt ttaaatggat tttgctttta 92701 ttataagcca agggcatcaa agtgatccaa ggtccagaat cacaggccac tgtacattgc 92761 tttcttccag ggcatcatac tgcttggaga aatgacaccg gccccacctt agagcaggat 92821 tccatagtac agctttacat gaaggatggc aagtagaatg gctcattttt ccatttcata 92881 attattgctt atacactgtt gttatggaat tatagaattt tagagcagga aggaatttta 92941 gggatggaga attaagctaa aaccaatgtc tatgtttatc ttcaaaggaa ggccttcaaa 93001 agaggatcaa gcccggtgcc tccaaatatt agggaaagtg ccattatcca cttcaaaaca 93061 catcctgagc atgacaagtc atgttacatc ttgtgttaaa agcaaaagga aatattttct 93121 tatagccaca tacccttaaa tattcccttc tccataaatg gaaattctat ttggctttaa 93181 cgtctacatc aattatctat ttacagcaga cttcctgtca tttccaaaag cagagttgtc 93241 atttaattct actatccagc aatattgctt ttaatgtgaa aaatgatgaa tagatacagc 93301 tggcataatt gctggacttt aagttcatag catagtgggg aaaggaagaa gataaaaaat 93361 acaatcaatt tattttactg ttttatatat atacatataa atctgcatta tctttgctta 93421 taacagcaat ttgagcttac aactaatgta atcataaata taaaaataag caaaatgagt 93481 aaaattaaaa tcatccataa tccaacaatc attgattatg tcttaatgaa tattatcccc 93541 tatatttttt gcatattgaa attttcttct caaaaagtat catattattt tgcaatctac 93601 tatatttaaa acatcccctt ttatcagata tttagctgtt acttattcat ccccattgta 93661 aacagtgctg catatgaata gttggctgaa attattaggt tggtacaaaa gtaattacgg 93721 tttttgccat tactttccat ggcaaaaccg caattacttc tgcaccaatc taaataaatt 93781 atatatgctt agaaagaaaa atggaaggat aaaacacaaa tgaagaaatt ctgaaaaatg 93841 gaccaatatg taggatattg ttggcttcaa aatcctgtga ggattcaact gttgatccag 93901 tccccaaagc aatcagatct accaatggat aagaaccagg gcagacagca gctgccaaga 93961 agacagaggg gagggaagag caaaccagtt ttgcagtgaa ggaaaccacc caggtggcag 94021 acttgatgga acaaacactt taagtgaacc taatggaaaa tggttcaaca aatggaccac 94081 cagtagattt tctcagttta taaatgatag acccattgcc tagagaaaag aacagctttt 94141 attcaatcat ataataagta aaaggaaata atttgctcct ctaaaaaagt cctgtgatgg 94201 taactccaca tttctatatc tatatctata tctatatcta tgtctatcta tatatctctc 94261 tctcaaaata tctgtattta tatctgtatt tgtattcata tttgtattta tacccatacc 94321 tatacttgta cttacatatg tatgtgtcat tgtacccaca tatcttgcca gccatccaac 94381 catccatcca tctacataga agtgcttctt aaactggagt atgcatcaga atcctctgga 94441 gggcttgtta aaacacagat tactgggccc catccccaga gtttctgatt cagtaggtct 94501 tcaacagggc ctgagaattt gcatttctaa tgtttcaggg tgatgcagct gctgatggtt 94561 ccgggattac actttgagaa ctgctgctta cacctgtacc cctctcattc tctctttcta 94621 tctcattacc aaactctaat ttaatgaaca tacagttacc cgcacatatt ttgattcatt 94681 atcaattgtc aatatacatc acagtgttaa atactacaca aaatagatat tatgctgatt 94741 tcaaatttct ctccccttaa tttaataata aaattataag atacaaaaat tattaccatg 94801 tatatttttc ctcccaggag aatacttaaa tcgtggtaca tttcttgctt ttacttggtt 94861 ttcactgcct ttggcttcta acttgatgta tttacatctt aatggttcag atttattttc 94921 caaacagtga gatgaaacag ggacaactca aaggcagctt cattttgaaa ttggccaggc 94981 ttttgtaaac aaacccttga cagagcaatt tctaattctc taaacagctg ggggggaaaa 95041 tatcacacag aaaagcagat gttattcgcg taacttcttt ttttcaggtc tacaatcatc 95101 atctacagca ccggatccac catatcactt caagtaaatt ttgactttgc taggtttcaa 95161 ccacattact cacattggag atgtgagtca aagaataaag cttctcagag acagctctgt 95221 ggatgcctct tcatgttagg ttgacagctt ttgatttaag cgttctgtta gctcatgact 95281 gcagcatgca ataggaacac agttcctcag atgccaaccc taacctaaat aaaagctaga 95341 ccggcatctc ctcagggtca agttaaagaa tatgggctgt gctgtctcag acctcagcgg 95401 ctaccaatta gtgggagaca agacatgtat aaaaggaaat ttaattggta attttaatgg 95461 ataaacaact gcagggttga gagaaagagg tcacttgtgc tggtggagtt ggggaaagtt 95521 tcatattcca tgcattcaga tcttgagaaa caaaggttat atctccagaa agcaaaaggc 95581 ggacattgaa aagggaagca aatctttaca gatgaagtca cagcaaagac aatctggagc 95641 caggaattaa cgtgtgacaa gagagggaat aatcccaggg atgatgaatt ttactactga 95701 aatgattata aattacagcc acctacaacc tattgtgaat ggcttttatg gtatcagaac 95761 agaatagtta aaggaatagg aaaaaatctt gtgccctaat gaagaaaagg gagatgtgtg 95821 ttttgctcaa ttcaaagaaa atatctcttt acacacacac acacacacac acacacacac 95881 acacacacac acacaggtgt ataatataca cacacacaca tatataaacg caaacacata 95941 tatgtgtata tatatacaca tacacctaca tatacacaca tgcatacaca catgcataca 96001 cacactcaga ccttatggaa taattctaat tatatataag gtctccttgt tcccattaaa 96061 catccaaaat gcccctagtt gcaaatactt cagcaaatag tccctgaata aaagatatag 96121 tcagccaaca ttgaaataaa gctgttacta caaagaaaaa tattttgcaa aagagacagc 96181 tcatagactg ctacaataca aaagtctgca ttttaagagc cataactgta tatggatgta 96241 agcggtgttt ttaatgttaa atacatgttc aatatttatt actttttgta aattgtgcca 96301 tgaaggaaat ttactggaag caacaaaata gaagaaatgg taacaatttc taagtgtttt 96361 ctaatgtaac aatttggcct gtagttctaa tttatttaaa aatttgttca attgggcaca 96421 aatttagttg cacagtggta ttttgatgta aaagttagta taaaaagcag gaaaagtgaa 96481 catttttcaa aataagtacc ttttctaaaa atagtaatat tttgaatata aaaatagtgc 96541 aataaatgct actcacaagt aacttctcaa taaaatgaat gattgaataa aataatacat 96601 ctaaatttgc tggctctcag atctgagctc ccagaacttc aacagccaca cacatatacg 96661 cacacatttt caaattcaat gttatcctat taaaaataag cctgtgttaa aagaagttgt 96721 aactgtccat cacaccaaaa atgcatagtt tagccacagc cattatagca attttcatat 96781 tattcaaaaa gaaaagattc taaagagtac aaataaagca caaagcagat gttcaaaaca 96841 agtctgcatc aaaaattcaa aaacatttgc cacaggaatg gcgcactttc atcattaata 96901 tacacctact atgtacccac aaacattaaa aattaaaaca aattaaaaat aaaaaagaaa 96961 cggatcataa atattacagc tgccagtttc ctggtgagaa atctttgtga agaaattgca 97021 gttaaagtgg cgctctggaa aataagataa aatgacaatt aagagattat cttttctttc 97081 tgctgtgacg tgacatatgt gagtaagact gttggcatac ccacagatac agtgatgcaa 97141 gttaacattt gcaaaatgtc ccaaatctga agaacatctc aagaaaatca ctggagaata 97201 aagaaaacat tttattgatc atctctacag tacagagttt ctcaatcttg gcattcttgc 97261 tatttgggcg agatcacttg tttttgtggt gggtgttgct gtcctctaaa ctgtgggatg 97321 tttagcagca tccttggcca gtagcaaatc acccctccca aatgtgacca caaaagtctc 97381 cagacattgg caaatgtgtc ctggggggag gtacaaaatt tccccagtga aaaacactga 97441 tagcatattt ttcttttttg agacagagtt tcactcttgt tgtccaggct ggagtgcaat 97501 ggcatgacct cagctcactg caacctctgc ctcccaggtt caagcaattc tcctgcctca 97561 gcctcccaag tagctgggat tataggcatg caccaccatg cctggctaat tttttatatt 97621 tttagtagag acggggtttc accatgttgg ccaggctggt tttgaactcc tgacctcagg 97681 tgatccacct gccttggcct tccaagatag catacattca cataaactta tttgatcaat 97741 tgatgtggta aataggataa aattgtccac aatgttatat tcatacagaa ctcattctgt 97801 gaattttgcc taagtgttgt atgtacacat ttttcatatt taagcacaat agattgacaa 97861 atctggagtg actatttctt aggtataaac atttttaaac tctactagaa tgcttctgct 97921 gattatatta aaatttgaag gattctcacc cagaagttgt gatttggggt ttaacccaaa 97981 atgttatcac tgctttacat gatgtaactt atattatatt ttggataaag gaaaacaatc 98041 tgtttctgct gtttgaatag aagctccaag taatattgct aagcatagta taccctgaat 98101 tctgaaggtt ttgatattag ttatcagaaa taccttagat gaatgtgaat ggtttgtaac 98161 aattattcag gaatgagctg gaaaaacttc tccctggaca gaactggcaa taatgtggaa 98221 tattacagaa atacttaatt ataaaaacca tacaagtcat acttttgtcc ttaaaactta 98281 tcttagtcaa acacgctagt aaaatgaaca aagagaaagt atgaggggac ttcaaaaagt 98341 tcatagaaaa tatgcattat gaaagaacta tacaaaaatt tcaaatgtat attacaccaa 98401 aataaattct tactaacttg ttataacctg ccttaacaag atctagtttg aggcactaag 98461 aaagataaga catcagtttg aaaagagccc ctgtcaggga aacatgaatt ctgctaaaat 98521 tgaagcaaga acaaacatca aatttatggt gaagattggg tggaagaatg gtgaaatcac 98581 tgatgcttta ttaaaagttt atgaggacag tcccccaaag atatcaacag tttgcaaatg 98641 aaaaactcat tttaagaagt gatgagggct gagtgtggtg gctcacgcct gtaatcccag 98701 cactttggga tgctgaggtg cacagatcac ctgaggtcag aagtttgagg ccagcctggc 98761 caacatggtg aaaccctgtc tctactaaaa atacaaaaat tagctgggct tggtggtggg 98821 cagctgtaat cccagctact caggaggctg aggcaggaga attgcttgaa cccgggaggc 98881 agaggttgca gtgagccgag atcgcaccat tgccctccag tctggtgaca agagtgaaaa 98941 cccccatttc aaaaaaaaaa aaaaaaaaaa agaaaaaaga agaagggatg atattgaaga 99001 tgaagcccac agcagcaggc catccatatc aatttgtgag gaaaaaaatt catcttgttc 99061 atgctctaat cgaagaggac caatgattaa cagcagaaac aatagccaac accatagaca 99121 tctcaattgg ttcagcttac acaattctga ctgaaaaatt aaagtcaagc aaactttacg 99181 ccctaaagtt accaaaacca ttgcacttag atcagctgca gacaagagca gagcttttga 99241 aggaaatttt tcacaagtgg gaccaagatc ctgaagcatt tcttcaaaga attgtaacag 99301 gaaataaaac atggctttcc cagtaccatc ctgaagacaa agcacaatca aagcagtggc 99361 taccaagagg aagtggtcca gtcaaagtaa aagcagactg gtcaagagca aaggtcatgg 99421 caacagcgtt ttgggatgct caaggcattt tgtttgttga ctttttagag ggccaaaact 99481 caattaacat ctgtttacta tgagagtatt ttgagaaagt tagccaaagc tttagcagaa 99541 aaatgactac aacaatgctc ctggatcact cctcttatca aacaagggca attttgtgag 99601 agtctctatg ggaaattact aagcactcac cttccagtcc tgatttggct ccttctgact 99661 tctttttgct tactaatctt aaaaaaaaaa aaaatctaaa gggcacccat ttgtcttcag 99721 ttaataaaat aaaaaagatt gcaatgacat ggtaaaattc ccaggaccct tagttcttta 99781 gagatggact aaaatggcgg gtatcattga tcaaaaaagt gtcttgagct tgatgctgag 99841 aaagtttcta tttattattt cttatctttt catttaattt gtcccatgaa ctttttgaag 99901 tgctctcata tatgcagcac aaagtagtgg ttggtttcac ttaaaattct tttgagacgg 99961 agtcttgctc tgtcgcccag gccggactgc ggactgcagt ggcgcaatct cgactcactg 100021 caagctccgc ttcccgggtt cacgccattc tcctgcctca gcctcccgag tagctgggac 100081 tacaggcgcc cgccaccgcg cccggctaat tttttgtatt tttagtagag acggggtttc 100141 accttgttag ccaggatggt ctcgatctcc tgacctcatg atccacccgc ctcggcctcc 100201 caaagtgctg ggattacagg cgtgagccac acttaaaatt ctttagtctg ataaatgttc 100261 caaataagaa aaataagaaa atgtttctac taattctgtt taaagtgtgt gactgcattg 100321 tttaatgata ataaatctag agatcaaaat gcttatcaat cactattaaa attataatgt 100381 attcactgac aatacaggaa ttactcaaat ttaccaagaa atgtgggaaa tttttaagaa 100441 taataagcat atcaagtggg tgcaacatgc ttacttcaat aatacaggac aaaatatgaa 100501 agctattatt tttttctgaa ttgttcattt ctttgtataa ttttctaatg aagattattc 100561 atttaccact aaatataatc aaattattgt atcataatta aatgcacaaa tgagaaaaaa 100621 cacctttgtt agtctctaca tacaaatttt tctttcagaa aatgttgtca ctgttattca 100681 cattcctaca cagataacat ctgaaacctg tattttggtt tctgtaatta ttatttatta 100741 ttgaagtgaa aattatctaa aatatacctc atttgaaaga gaaaaacatc ctagaccaaa 100801 cttattttta aaagtagtta tgctattttg ctagaaatat gctaatttgc tagattagta 100861 gctctcaact gaggatgact tcacacccaa taagacattt agggatgtct ggaaacattt 100921 ttggttgtct caactgaggg gaaggtgcta ctggcatcta gcggctagaa atcaggcatc 100981 ctgctcaaca ttctgcaatg cacaggacag cccctcacaa aaaagaatga tctgccccca 101041 aatgtcaata gagctgaggt ttagaaaccc tgctacacat aaaattgttt cccagtttaa 101101 tctgtgcaac aaggcaagaa gcacgcaatt caaaatggtg tgaagagaat ccctgtttag 101161 agagtatata atgttttacc aaaggaattt ttctttttct ttctttcttt ctttcttttt 101221 tttttttttg agatggagtt tcgcttttgt tgcccaggct ggagtgcaat ggcatgatct 101281 cggctcactg caacctccgc ctcctgggtt caaccaattc tcctgcctca gcctcccaag 101341 tagctgggat tacaggcatg caccaccacg cccagctaat tttgtagttt tagtagaaaa 101401 aggatttctc catgttggtc aggctggtct tgaactcccg atctcaggtg atccacccgc 101461 cttggcctcc caaagtgctg ggattacagg cgtgagccat cgcgcccagc tgggattttt 101521 tttttttcct atggaaagcg gaatttaaat tatcccttct tgctattgtt ggcacctcac 101581 aaagcaggaa acacagttga aggttgaggg gatcatgtac tttgatattg actcaaagta 101641 atgccttcac tataaaacag gaatcacttg acatgctaag aaaaacccat cgtagaaatc 101701 aatataacta aagcttgaat gaatctgagt aactcttttt ccctcgatgt caaaagtaag 101761 tggtagcctg tattgcttag gaggcactat taaaaaacaa acttttaaca gccttaagta 101821 aacgttttca ttttccataa aactaaatga tggcacagta atatgattgc ttacagctgc 101881 aaatgcctca tcatgaacta aacacttgtt aaactttgag taaaattttt cttcaatggc 101941 ttacagtaaa ataaaagcat ttttaaaaat tatttcatta aataatgcca tggcagaccg 102001 cttagttaaa taatgtcata gacttgaccc aatcatccac tgatttcatg ataagtacac 102061 attaaatcta aaaaaaaccc ctacatccct aaatacaaat gactttataa ttttacatta 102121 gggagtagca ctgtataagt tactaatgtt ttgtcccacc caaaggaagg aaaaatagaa 102181 aacttcatgc taggaaggaa aaaaaagttt cctatgtgtt taattttcct aaatgtccaa 102241 gatattcttt agattacatg tcatcaacac ttcaggagaa aggtgtcaag tttctgaatg 102301 gggtagtaaa gagaagttga ggtaagaaaa ggacatctga cattcattcc tgctaagaaa 102361 gaaaagaaaa tcttgctcac acatttcgtt ttaaagactt gtaagggaaa agtttgtcta 102421 aaatttcaaa taatacaaaa caaaacatat tcgtcattca ctttttagta tgcattactt 102481 caatcagatt gttttgttct aaggccaaac taaattcagc cctttaaaga ttggaaaata 102541 tttgatgttg ctggatgttt tgcaaagtat caaataattt tggaatgttg aacacctatg 102601 ggaagggcat actaagtgag taccataatc cagagggatg ttgtcaaaga taggtgagta 102661 aagaggcatt tgaataccag caataagcca aaaaatgtca ctcctcctaa acacagatat 102721 tatgggttca tttagtgact attaccatgt cctctattgg tgccagggat gtaggggctc 102781 aggtaaaaag atatcaaaaa gtcaggtctg agcctgactt cacaatggta cctaggaatt 102841 tgggggcaat cataacctgc ttttgagtat aatcgtgtgc tacagaggaa tgtagattag 102901 atttatcaaa acaaaagtcc tggttattca tttacaacag gctttgtgac cacagaagtt 102961 atcccaagtc tttgattatt ctacaccatt acaagatact ttacagggtt ttaaaggtac 103021 tttacaggta ctttacaggt actttacagg ccaaaaagca tcaacaaaga cattgctgat 103081 ttagtcagca tctggcctgc ttctctgaca tgcattaata tggaaccctt tcaaaccata 103141 ttcagtgata gttcatattg aatccttcac attacaagat aatcactttt gtggtaaatt 103201 attttgtttt ggtattagat tggtgttaat ttttctcatt tcatcgtgaa ggaaggtaaa 103261 atgttgtcta aagtcatttc tgaactttgc acatacatct ccaaacatct gagcattatt 103321 attctgtaca ggagatacag ttatggtata ggaaggtaca ccagtgacta gtggaccagc 103381 atagggctgg agagggtagg gaacattaat gaagtcatgc tgaagtccag atgggggagc 103441 cagcctccga tagattctaa ttcagatgca gcatttattg agtaatttgg gagcgagtta 103501 cctacatggc tcacttttaa aatggggata aaaatctcac tctactagga tttttgtgaa 103561 gttgacaaag attctatgtg aatgcatgat gagttcttag aaaccagtaa cagttaactg 103621 aattcagtgc ctgaccacca ccactctttt ctgtcacgtc tcagggtact ctgccacact 103681 ggctaaaatc caaaggtttc ctaccaattc tgtcaaggat taatgatgta caagtgtctg 103741 cccttgcaga acttctagtc tatcctcata gttgttgaga atttctggga aaatctttgt 103801 ttctgcagct tgtttgaggt attttatttt tttcaccacc aaaaactttt cagtaaaacc 103861 tatttctgag tgtggagata tcttcgtctc ccaaagttgc taaagggcac ctgaagcctt 103921 gatgtacaaa aattctaatt tcaatttaat tttcaacaca ttcaatccaa agactattaa 103981 ctcatgtagc tagatcactc ttcagacata ataactgaat gttctaatca aaatctccca 104041 gagttacaaa aaaatctaat tctgcaaaag acaatcaatg taataattac agaggtatta 104101 tggtgactca ttagaaaagg caagtgttag aaaggagctt tttctgtaaa tagcagccct 104161 atcatctttg caattcgact aatcccatta cacctacaca agaacgaatt tcaaacttta 104221 ataataaatt tcatgaaggt taccgagccc aggcaatgtt ggcaccagag ttatgagatc 104281 attatgggaa gttcgtgaac ctggtgctag agagatgatt catcagctct ctgtaagatg 104341 gtaaagattt tatggatcaa catgggagta gtcatttttc aatatttcaa aagcctggct 104401 tgggtattca acagtttgac atacttagct gtagcagcag ccacagtttg tttagtcagg 104461 ggcaacagga aagagtgata aattagaaaa cgtcaagcca ttctttccct cacctgcctg 104521 actttaagta attaatgaca gctgttttaa tgaattgatt agtttaataa ttgaaaaatg 104581 caagacacta cctttatttc tgttatttct cccatctcct tctaacaaga tgttcaagaa 104641 ttgtatctct gttaaagtac aatcacgtgt caatgatgag gatatatcct gagaaaactg 104701 tcattgtgaa catcatagag tgtatttata caaacaccta agcattatca tacagtgtgg 104761 cagacataca aagagttaca tacaaaaagg tacagaaagg cttactagtg acctagtgga 104821 ccacgaagtg aacatcatag catgtactta cacaaaccta ctacacacct aggctagatg 104881 gtacagccta ttgctcctgt aatacagacc tgtacagcat gttactatac tgaatactgt 104941 aggtaagtgt aacacaatag tatttgtgta tctaaacata tctaaacata gaaaggtaga 105001 gtaacaatat ggtattataa tcttatgaga ccactgttgt ctatgtggtc cattattgac 105061 agaaacatcc tcatttgtaa catgactgca tattcgcaag tggattccag ccaccatttt 105121 acaataacaa caacaactac atgcacaaca gcaaaagcta acaatttttt agcactgact 105181 acatcccagc tactgttata agcacatctc aaggtgttcc accaagctgg tgaggctggg 105241 ggatgtcagc aatggggtta aagggtgcaa agcctcagaa acgaggaaga catttgagtt 105301 ttttttgaga tctgctgcac agcatgttga atatagctaa taactgagta ccatacattt 105361 cataattaca gagagtaaat ttcaaatgtt ctaatcacaa aaagattaaa tatttgggat 105421 gtatatgtta atcagctcga tttaatcatt ccacattgta ttcaaaattc ataacatcac 105481 ttttcaccaa ataaatacaa ttataattta ttaatatata attttaaaaa ttctgagagc 105541 ccactagatt tctcaagggt taacgagatg catgtgggtc ttagaacgcc aggaagcata 105601 gcaactatct ccttggcgtc tctgaggagg atggtcttga aaaccagata aacacaatct 105661 tttacacatc ccgcaaatat taactgaggg cttcgtttgt gccaggcata gagctctatg 105721 ctctagatgt gtttactcat ctaatcatca caacaccacc attgctccca tttcacagat 105781 atttaataga acagtggaga aacttgctca tagacccacg gtaagaaggt ggcagagtca 105841 aaatgcagag aatagattta gcaaatcaat tggagtcacc cagtgaatgg tgtcatcatc 105901 tcagcaccca actggaaagc tgcaggatcc ctctcttcct ttcctctcta ttctcatgga 105961 tgttagaata gtgcctggca catgttcagc atgttagaag aattttgttg gggctcagaa 106021 aacagtatct caaaatgaag actgctgctg agcagcttca gaagcaaaag tttctcctga 106081 ccttctcctg tcctcctctc agtcccattt tgccctaagg atagccatgg aaactagaat 106141 cccttttccc caagatagat catacaaatc agaacccctt ttcctgaaag tcagtcataa 106201 aacctaaaaa tattctacgt aaaaactggc catgaagaaa tgatctgacc taactggttt 106261 gactgtagat cctaagactc ctgttccaga cagggtcctc tctcacatca agaaggaaga 106321 aatgcttctc agagaggcca agaataatct agatggacag gccttactgg gtttccccta 106381 ctcagtccat tagcatttaa atcagaccct ttctgtccaa tcacatttct acatggctgt 106441 ccacatattg ttgaacctaa gcataaaaat ggacaactgt ccattttccc ctgtatgtta 106501 ggtattcatt ccgaaggttt ctgtgtatac aagttaaata aacgtgtatg ccttttcttc 106561 tgttaatcaa tctgcctcat gccagtatat tttcagtgac cctttagagc gcaaagggga 106621 agtttcttct tgactccctt agagttagta aatgaaatag gtaaaagaca tgagccaact 106681 gctcccacta aatgagtaaa accatgctta agataggcag aataaacaca agcccctgta 106741 agctgcaaag tgaaggcatt tagtcaggcc cactaaacaa attctctatg tgaattcaaa 106801 tgctatagat atgttactga tttactgtgt tattcttctt tattggtttt ggaggtctga 106861 gaccatcaca aagaggctct tgtggtctat tcttggcatc tcacaacatc tcacaatgtt 106921 gtcatgggta agataactga taactgtttg ttgattagac tttcctgggc aaatataact 106981 gaatacagag aaatcataga cataactggc tatgcttact ttaaaactgt gaagctctgt 107041 ataaacaact aacaaaatgg ggcatattgc taaatcaatg agagcaaagt acccaagtcc 107101 aaggctattt tcaaatgata cgtttacctc ctcacactaa tctattcatt ttcattcttt 107161 gattggattt taaaggttta gctcaagtgt ggttctcaaa aaaactagac atattacttc 107221 cacctagtga ccctcttgcc gctatatttt tcccaccatt ttcatgatca agctactctt 107281 agatccatcc cctagaaata tgctttctct gaataatcaa attattcaat ccttttttgt 107341 agattctgct cagggcttcc ctatgacgtc tggcttgctc tggtattttg ggtgctgatg 107401 aatcttagag atatacacat atattctcat taatttgcaa cttttttctt gatgcaaaca 107461 aacacaaagc aatcaaaaca aaaatgttct agaggacaca tacattccct tatccacagt 107521 gcagggccac agggtacaga cagcagaaag tcatgtgttt tggcttgctg gtactaccag 107581 agatcttcat ccctatgctt tgtgaagtga atacatcaaa gtgcctctct tccaagatgg 107641 tttgactcaa gttcatgttg atgaatgcat tttagactaa acttttttat tttgagataa 107701 ttttagattc acattcattt gtaagaaata atatagagcc atctcctata ctcttctccc 107761 agcttccccc agttgtaaca tctttcatga ctatagtata atgtcacagc caggaaactg 107821 acattgattc aactatctta tttagaattc acaagtttta catgttatca tttgtgtgtg 107881 aatgtgtgta tgtgtgtgta tttagttata tgcaatttta tcacatatgt atctttgtgt 107941 gacaaccaca atagtcaggg cacagaacag tttcattgcc acaaggattc ctcttgctac 108001 ccctttacag ctatttgcac ctccctctca actcatcatc ttcttaaatc tagcaaccac 108061 taatctgctt tccatattca atgaagggag gatgtccaac tgggcagagg caacatccag 108121 caggagacaa ttacacaatc atcgaagaag atgctcagat aaattgcagt taccttctcc 108181 actgagaaca gtaaggctat ggcgtcatta gaaaattatg ttctttattc ttaaataaat 108241 ccaagtagga tggggcttga tatactttgg ctgtgttccc acccaaatct cagcttgaat 108301 tatagttccc ataatcccca catgtcatgg gagggacccg gtgggaggta actgagtcaa 108361 tgggatggtt accctcatgt tgttctcgtg atagtgagtt ctcacgagat ctgatggttt 108421 tataaggggc ttttgcccct tttgctcagc aagtcttctt gctcctgcca tgtgaagaag 108481 gacttgttta cttccccttc caccatgatt gtaagtttcc tgaggcctcc ccaaccatgc 108541 tgaaccatga gtcaattaaa cctctttcct ttataaatta cccagtctcc agtatgcctt 108601 tattagcagc acgaaaatgg actaatacag ggtagatggt caatccagca gtagaatggg 108661 atgaaaatct aggatagtta aaaatctttg tgcataagga aaagaatagc atgaaaaagg 108721 agaagagata gccttttctt tttcttaaat gagaggaaaa agcacctttt attttccttg 108781 ctgcagtggc agcaaactca ttggttttca taaaacacaa cttctgcgat atggtctgtt 108841 gataaggttg ctccatcatg ggaggacaaa cttgaggctg actgttaatg acagaactag 108901 gtcttctctc aaagcctcta atgtgtgcta tttctcatga agaacccctg acaccgtaag 108961 atgcttcagt taaagcttct gaaatattgc agtggcataa ttgacaagtt ttttggtgct 109021 atgagtttct cacacatgta aaccctttcg gaaaagttgg agataaaaag atgggggaga 109081 gtttggttta gagtcacaaa gttccagata ccagccacag atgcctctgc ccaatgtcgc 109141 tacatgttct tttctgagaa atgcattcaa tcactcagta ttacgcactg tggttaaaaa 109201 aaaaaaaaaa gatctactca ccaccatcta ccctccctta tcctccatgt gaagctcacg 109261 gtagcagcag ctgaagacag aaaagttgta cctgttccct cacaccaggg acattacttg 109321 gtgacctggt gaatggcata ggcaagatta cccgtgttgc tggaggtgat ccctgccaca 109381 gggatgcagt catcctttgt catacagatg gaaaacccct tggttagctt ctccacctgt 109441 tcaggcttta gccttgtgaa acaaagcatg ccaatttagt cagtcatgtg ttgtcagatg 109501 tgtgaggaac tctccttact gcagttggag atcagctgag tctgcatgct agtaatgcag 109561 ttggccatga ttttcacttc ttgcaaccat tattttcaca aacccaggct gttcagaatg 109621 gtagaggcaa tacaggcccc actgagagaa gggttggaat acatgggatg gatcaggatc 109681 ttcaactgac tcccctcttt tggtttcatt cactgtcttt gcagatcaca gtgaaggctc 109741 tcatacactg accatataaa gcccatgttc ttggcacatg attgacagag acaaacatta 109801 atgctctgtt caatgaagtt gcacacagcc caggcatcct tgttaccatc accactggca 109861 aagccttggt aggccatatc aaagaacgca gtttcttttt ctttgccact attattattt 109921 cctttcactg ctttgcatga gggtcccagg catgcaggag aagagcactt tgctctggta 109981 tttttgaaat gtcctccata gcacccgtga agtcaaaact gcaagtcttg gggttataga 110041 attgataatc ttgtagttgc atgccagcta ccctgaagat aggtgtgtga tttccccagg 110101 gtggtttgga cagagacatc tctgtcttca aaaactttaa aaatctttgc acaaaactgg 110161 ctccaaccct taaaccccag ttccagaaat gatccacaca gtgacagact ggccactttt 110221 caacactttg ctcttctcac ccaaggctgg ttctgcagat gccttggaaa attcagcttg 110281 tccccaaatg ggcaggtatt ccttgtctaa atttttttat tatattttaa gttctggggt 110341 acatgtgcaa aatgtgcagt tttgttacat ggatatacac atgccatagt ggttttctgt 110401 acccatcaac tcgtcaccta cattaggtat ttctcctaat gttttctctc ccctagcccc 110461 caaccccgtg acaggccccg gtgtgtaata ttcccctccc tatgtccatg tgttctcatt 110521 gttaaactcc cacttatgag tgagaacatg cgctgtttgg ttttctgttc ttgtgatagt 110581 ttgctgagaa tgatggtttc cagcttcatc catgtccctg caaaggacat gaactcaccc 110641 ttttttatgg ctgcatagta tcccatggtg tatatgtgcc acattttctt tatccagtct 110701 attattgatg gacatctggg ttggttccaa gtctttgctg ttgtctaaat tttttatggt 110761 aatctaactg ggtctcttat tgtcccggta ggcaccaact cccagattca tctttttgct 110821 gttgatttcc ctcttaaagg ctttggtggt tcccaggatg ggatctggag gtctcatttc 110881 cacatgggcc cacaaggagc tggctctggc agaggccatg gtggggaggc ctaggtaaaa 110941 ggtggcagcg atcccctaga ggacacggct ggagtggagc agggccgggt catggtggac 111001 cttgagagtg cagtgggtag gcacaggaca aagtagaagg caagaaggtg cagaaccagg 111061 gggcatggac tcctgctgaa ggtcttgttt tgtattagaa gacctccaca gtggagtgaa 111121 aaatgaagtc acttctctgt atattttgaa ttaaagcttg cacaggtgtt tcagcctgct 111181 aaatagttca acctggacta tttcagctct gaattattca aataattatt tttaaaatat 111241 tttaaattta atttagcaca cagagaatgc ctctgtgtga agtaagtggt ggcaatttaa 111301 aataactaat gatcataatt attggaatct agtctcctgg gcattttcta cttcataatt 111361 tacagggagc ctatgagagt tcaaacacac tgtattttcg ccaaaggagt ctcatgcttt 111421 gtctgcattt tatggcttcc tttgcacctg gagcagctat tttaggactg tctagacaat 111481 caccccagct ttaaaggtct tttcctttct tttcacatag tcactgagca tgaaggataa 111541 taaatagatg acgctggcac tctctcacca aagatactat tcaaatagag acaggaacaa 111601 gaaggccttt gctctggctc acaaaatcaa gatgccttaa tatacagccc cagatgtcta 111661 ccaattggcc aaacttgaaa ccaacaagaa ggccttccaa tgtccacatc tctgtagtag 111721 ctggatcata ctttttggtt gagctgcgtg aaatggttac tatttgacca tcttggacat 111781 ataaaaatgg caatttaaga tgtttcgatc caagagattt gggaaagaat ttgggagtgc 111841 tctgtttcct tttttcctct ccaattccca agcaacattc atctgggttc tagcaaggac 111901 ccagaatgca cagcttctgc aggccacttt tgccactaca gggaaataat gcagccaagc 111961 atgattcagg aacatttcct atatacgtgt cctaggtaag acctaaaagt tagtaagtaa 112021 tatatcccag ggagacgtac accgggtcag tttacttcac ccctcacagt catttctttc 112081 aagaggtaag agacaccttt taagctctcc acaggactcc tacagcccac ctgtccccag 112141 gctctttaat gggttccgag attgcatccc aatctcagac cctcctggga atattcctac 112201 ttggtactct ctgatggtga aagtatttca ttttgtagga aaaaaataca caagggtaga 112261 tatctataga tatatagatg tacattgcat ataatataca atataatatt caattacatt 112321 atatacaaca tacactatat atgtattttt acattatata ctatatatat aatacgtcac 112381 ctagcaatga tcagttttat ttgtttcctt ataatgttag agtatttctc agtacaacaa 112441 gatgtgcaca tattaatttt ctgctaacat cataccattt ctactgtcgt gttcaagagc 112501 tatctccacc ctttactctg tcccactttc tccactgatg atatgtgtaa catgaatcag 112561 tttggaggtc tgaagtccaa agcatttctc ctgagtataa actggagctt caaaaacaag 112621 agcagcccaa gaaggctggg ccaagggctg agaaatttca tatcagtgcc actcacaact 112681 gcaaactttg tgccaggcac tgggctaaat gcaatggtaa gtatggggat acaaagtaga 112741 tgaatactca gtctccactt aagcagatac aattcttata ggggaataaa attacaacac 112801 caagtggagc aaacacagca tagaaccaag tcaacagatg ttgtcttact tcattttgtg 112861 ttgctataaa ggaatagcta aggctgggta gttaataaac aacagaggtt tattttggtt 112921 caaagttctg ctggctttac aagatagatg gcactggcat ctgcttggat ttcagcagag 112981 cttttgtgtt ccattaaaat atgtctgaga aggtcaaaga ggaagcagac atgtgcaaag 113041 agggaccaaa cccaggggtg tcctggcttt ctagcaaccc actcttgcag gaactaatca 113101 attcccctga gaaccaatct agtgtcaaga gtgagaatgc cctcactgcc atgagaacag 113161 caccaagcca tttatgaggg atccatccct gtaacccaga catctcccac tatgccccac 113221 ctccgatatc accatgctga ccgtcaaatg tcaacatgag atttgatggg gacaaacaaa 113281 ccatgtccaa accacagaaa atatggattc aaattctccg agggccatgg cttatgagca 113341 gcgttattag ggcaatttgc ttaacttacc ctatttcctc ccttgagtat cagttaataa 113401 taatttgaat cttacttggc atttaataag ataattactg aaaatttatg tcaggtagag 113461 tgagactggc aaacttaaaa tactataaat gcaagctctt atttttagtg agcatccact 113521 gcatgcatat actaccccat ttcatcctca gcagtctcaa tcttttaatg tattactact 113581 ttcttccttt tagatagaca agttgggcaa aaataagtta attaatctcc ccactaacag 113641 aagactcttt cctgcaggaa aataatgaag tcccgaactg aaggcaaact tctctaaagt 113701 catcagcaat ttttcacttc tcaattgcca atatcttctc actttgtcag ttttcacttc 113761 ccagacactc cttcaagcaa ctcaatctgt tttctacttc cactattcag ataaaactgg 113821 tttggtaaaa attcccaatg ctgccatgtt ataaagctga ggggcgacta ttctgctttt 113881 attctacctg atccctcagc accactaaac ctagctggcc attcattacc tttgccatct 113941 ggacctcccc cagcttttgc tatacgcacc ccttttctct ctttttcctt cactagccac 114001 ttgctctagg ttctttgctg ctgcttctgc tcctttagag ctctaaatgc tggcactcct 114061 catggctctg ttgcaggcca acatctctta gtcaatcctt ctgaagatgt gtctggaatt 114121 tgtgaacctc ctgaatcagc tccaccagaa agaaacaagc ttcatgaggg aaagaacttt 114181 tgcacataaa ttgctgaatt cccaacacct gaaagaatct caggcacact ccaggtgctc 114241 aacaaatatt tgctgaagga atgtgaattg cctgaagtgc ttgagatgtt cctgggatgc 114301 ccttctggac atctaaattg gaatcacagt gcatctgaga accaaaatct ctggtcttct 114361 accacaccga cttttatcca tgttctttcc ttaatctgat atcctctccc acttcctcat 114421 tctttaatgc gcagggaagc ctttcctgac cctcacctcc acccagattt agtcgaatct 114481 ccctctcgtg ccctttcata gaaccctctc ctttgtggta ctgaggtccc ttaaatttta 114541 cacatatctg tgtgactact gccagcctgt ctcagatcat acaggcttgc actcaccatc 114601 atacctccat ttcctcacta agtctccctc atggtgggcc ctcaataaat atcagtgaat 114661 aaattgttaa taatacagat tttatattaa cgttaatata tcaaaatata ataatatatt 114721 taaattgtat atttatgata taatgaataa ttaaaagtta aatgatgaca caaatttctc 114781 tctcctcagc tctgagctct cctctgaatt ctacatgcat atccccatct aggggcatca 114841 catctccaaa cggatgtcta atgggtttct ctatttcagc ctgtccaaag ccccagcttt 114901 taaattttcc aatacaaact tgtttctctc tcaatatttt actcatgtta gtaaatagca 114961 tcacccaccc agttgctcaa gcaagctctt gagagtcaca tctgattccc ccttttcctt 115021 caccagcccc ttacccatat ctaattaaac acttagtcct gttgcttata cttctaaaat 115081 atatttcaaa tccatccctt ctccatctcc acagctgctg ccaagcccaa tcttccatta 115141 acagtcaact ggatggtggc tgcaaatcaa ctggtcccct gtatccattc ttgctggtct 115201 accacagatt ttctacacag agctcaaaac atcaatcaaa ccatccccat tagagcccca 115261 tcctaattca atggcttctc actgcaatgc aatgaagtac atacacctta gtgaagctct 115321 gcctatctat ctacagcacc tcatggcatt gctcccttca ctcactagac tcccaactct 115381 cttttgtaca tactgttccc tccgcctgga atgattaccc catgtgtccc cacccacacc 115441 tcacacagct ggcttcttct cctcctccag tcctcagatt acatgctgcc ttctcagagt 115501 aaccttctta acagtgtcta aaatagattt cggccgggtg cagtggctcg cgcctgtaat 115561 cccagcagtt tgggaggccg aggcgggtgg atcatgaggt caggagatgg agaccatcct 115621 ggctaacacg gtgaaacccc gtctctacta aaaatataaa aaattagccg ggcgtagtgg 115681 cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 115741 aagcggagct tgcagtgagc cgagatcgcg ccactgcact ccagcctggg agaaagagtg 115801 agactccgtc taaaaaaaaa aaaagatttc gccttcagat atctgttgtc acagaaatat 115861 ttctgttctt tacacagcat ttgccacaat ttggaattac tgatttacta ctgttcatta 115921 tcttgtttcc agatagaaac ttctatgtca gtcttgttca ccatttcatc cctagcacct 115981 aaaatggagc tcagcataga ttatcaataa acggttgttg catgaatgga cagagtggag 116041 aaaaacggtc agattaatac agttccaaca gatcaatggt tgccagagcc tgagggtgaa 116101 aagaggggtt gactacgaaa aggcagaagg aagagatttg gtggtggggg tgggggtggg 116161 gggtggtata tatagaattc ttctgtatca caattgtgat gttggctaca caactttatg 116221 tatttgtcaa aactgataga accataccct ccagaaggtc aattttactc tatgtaaatt 116281 aaaaaataaa ttaataaaat tatgattcca aaacccatgt caacccagaa caaatctaag 116341 attccaagat tcacaggttt agagtttttt tgttgttgtt gttctggtgg tggagggttc 116401 agttaacttc ctgtgttcat ggtcaaagta ctgactcaaa acatacttcc ctatttttca 116461 tttatcagtt taaagaaagg aaaccggaaa caaaataaag atgtgaaaga taaataatac 116521 aagaaaaagc acagtagtaa aataatattt aaactaatca tgtgataaaa tgtcagcatt 116581 tattcatgac atttggctct gtttgtattg gatcacccca gcagggcagt ggctgtgggt 116641 tgatttcata atagtgcctc ccacatgtac gcccacgcct gacctctgct ttgtgcagtt 116701 gataatgact aaacttcaca tattatgctg acaaaatcta ctccttccct ggagggctgt 116761 tacacacagg gaattggagt ttcacagctg aacaatgttt ctttctcaaa ttctgggatg 116821 atccacttcc attagaacaa agttatacgg gctgataaat tgctgtcttt tgccaaagtg 116881 ttactcatca tcgtatattc acccaacaga tgccactgag aaaccccatc ctatcagggg 116941 gaaagaagga tgcagagata tgaatagact gtttagacac aagttgcatt ccctaaagta 117001 agcacatgac atacgaattt caccttatga catttccccc acccccaccc ccactgactt 117061 tcatccatta gaacgtgttg atggatatac catttcaaaa tgaaatattt ggatatctga 117121 ctctgatttg gggaaagggg aaatggtgag ctaatttcct cttttatcaa gctgaatcac 117181 taagttcgct ctgttgccat gcacgcataa gtaatgccat agcaacctct cagtttgctc 117241 ttaaacacac aacagcagaa gtatgcgaat aggtacattt tttaatcctt agattttctt 117301 agcagcttaa gttttataag acattttatg gcaccggatt atttagttca cctttggtag 117361 tcttattgct gactgacaca tatatcatca ataccagcac aacagaaagg cttagttctt 117421 ggaaaatttt ccaataactt tcctttattc tcaatctcta gctcacaatt aaaaatttgc 117481 acattggacg ctatagtctg gatcctcttc aggattttct ttataaagat cctaacttta 117541 aatgggactg aaatatataa tttgaaattc atgaaatata gaaatacatg gacctctcat 117601 atagctattt aagtaaattc aatgaaagtg tcattatcac ttggtatgta ctaggaatta 117661 tgtcaggttc tagaaagaaa gaaaatcttt actatcaagg gattcagatg aactcctgta 117721 gaaaataagg gatgggtact cttgaattct gccactttga ggtttagatt tttatgccta 117781 caaaatatat ggtaacacct aactttatac tagtttaaga aaagtccatc taaattgaga 117841 aaatgtggat ttttaaatag ttttattacc tcatgcattt cttcacaaat gtgtacactt 117901 ctttctactg gatgacaaca aaatgagagc actagtacta tggaaatgcc ttttacaaaa 117961 caccatgtca ctgctctctg tcaagtgtct ttggctgata gactggatga ataattgact 118021 tgttcttgaa atacctgaaa tgggcttcag attccatact tgaatctaac attatttagc 118081 cacttctgcc aaatatttta attacttaag gagttaaggt tttgttattt tatagagagg 118141 caatacactt caataaaatt ttagatatcc catccaccct tccatgagca tatgttaatg 118201 tatccaaagc taagggaatt tggatcaagt cattttcact taagcactta tgtataattt 118261 gaaggataca catcaggaaa cagaaaatat caactttctc attgaccaga tgcgtttaaa 118321 aatgtctcag tgaactggca tcccttcatt tccgatcagg tagagagctg tgcattcata 118381 aaactactag gcttgttctt aagcaaagct tgctgttgct agctcttgct gctcttagaa 118441 gtgggagcaa aaggtttgat tctatagggc tgtcctgatt gtaactatag aattaggaag 118501 ccctctatat ttcttgatga atcatatttt aagttatttg tatactatga taaaatgatt 118561 tttaactttt aggaatatta acttgaggcc ttaatagatg gtccccaggg ttaatattca 118621 tttgaaatgc taatatagag aaattcttta cattaatgat tgtcaaagta tagtccctag 118681 gagagaaatg tcaatcttag ctggaaactt attcaaaatg caactgtttg aaccccaccc 118741 cggacctact gactaggcac tctggagatg aggccggcaa tcagtatttt aacaagccct 118801 ctaggtgact tcaatgcact ctcaagtttg aaatccactg ctttatataa tacacattgc 118861 tcatctggga taatacatac agagcaattt atgttgagaa atagctcttt tcttcttttt 118921 gatggtgtgt tgttttttgt gtgtttgctt gcttgttttg tattctgata aaagtaattg 118981 aaaatgttta atataaggat ttttgcctat cccagtatgg ttatgacaca agatcacttg 119041 gttcctgata aactaaaaag tcattgtaaa tgttgttttg aagaacaacc taattaaaat 119101 atattttatt agcagcaata ataagaggat gactaatttg aataatctag caacttgaga 119161 ctggggaaac aatatggctt agagtcagag ggactttagt ccagcttaaa tcagggttac 119221 tcaatataag cattattgac ctttggggcc agataattct atgttgtagg ggtgctatag 119281 gatatttggc agcatccctg gtctctaccc actagatgcc aggggatgcc ccttctcagt 119341 tgtgataata aaaaatgtcc cattgtcaga tgtccccttg cgggggcaaa tagtccctac 119401 ttgagaacca ctggtcggtc cattgggggt tctcaacctc agcagtacag tagaatcaat 119461 taatactgat gcccggccag gcgcagtggc tcacgcctgt aatcacaaca ctttgggagg 119521 ccgaggtggg tagatcacct caggtcagga gttcaagacc agcctggcta acatggtgaa 119581 accccatctc tactaaaaaa atacaaaaat tagctgggca tggtggtggc aggtgcctgt 119641 aatcccagct actcaggagg ctgaggcagg agaatcactt gaacccagga ggcggaggtt 119701 gcagtgagcc gagactgtac cactgcactc cagtctgggc aacaagagtg aaactccatc 119761 acaaaaaaaa aaaaaaaaaa aaaaaaaaga aaagaaagaa aagaaaagaa aagaatactg 119821 ttgctcagac tggttggatt gaacctctgt ggataggctc agaatcagta tcttttcaaa 119881 gctcggtagg tgatacaaag gtccagccaa gactgccaaa cactgtacag acatcccagt 119941 gcatccatac ccttacctgg gagagagctg taataccatc tattgtaact atcacacaag 120001 atcagatgag agagcattta tgaaagcatt tataccacaa ggcaattagt atgttaaagg 120061 cagcagagaa aaataatgta cttttcaaag ttgccattga aattgaaaaa tatatccagc 120121 tctctattat gaattaggat ggtttatatt tactgtcttt tagattcata tcccattccc 120181 aataggaaag agatgtgatg cattctctat gtctctcttg ggagtagatt catgatatta 120241 ttcagtcttc gatgggcaga aaaccataca aattggatca aatcgttcat actatgcacc 120301 agtgatgctc tgaatatata ctataaaatc aatattcttt ctattctttt gcctctcaat 120361 aaccaatttt aataagtgga aaataattca gattcctctg acaatagcct ctttagaaag 120421 aatggggaaa attcttgggt tacaattgtg tcaatgaaag ggattcttta ttttaaaaaa 120481 gataaaaata tccaaagcaa ataagaacaa atgttaacag ttactaaatc tgagtaacaa 120541 atacatagat acttgttttc gtttttctat atttttcttt gaagtatctg attcaaactc 120601 aaagaggatg ctaaaatcag tttgaacaaa gcaccagtga atacttcaaa gaaaaataaa 120661 gaaaaaagca cgtttttctt ggtgccaggt gactatgtag accccatggg gatggggagg 120721 ccagctgggg gccaaggggc tgcatcccac agtgtctacc tcagtttata ttgccacctt 120781 acgcaggcag ggcacgttag cctttgatat tcaagagcag ttctaactga gcagttcagc 120841 tagggtgtaa ggaaatctta gtagttgaat ttttttttca tgtgctctaa cagcttttaa 120901 aacaaggaaa ttattatcaa tatctgcttc tactctccct aatgagttgc aaaagtgtgt 120961 aacccctgcc tttctctcta atagaaccaa tatggtagca atggggacat tttacacaaa 121021 agtcagagtg aaaagatgga cttttaggca catttttttt ttttaagaga tgagggtttc 121081 tctatgctgc ccaggctgga cacaaactcc tggggcacaa gtgatcctcc cacctcagtc 121141 tctcaagtag ctgggattac aggcgtgcac caccatgctc agtttaggca tatattataa 121201 aagaagcaat ttgcctgaag cactacactc ccctgtagac tgtggacaga taagtaagca 121261 aaacaatttt tctcccagag taggagttgg caaactacag cccctgggac aaatccagtc 121321 cactgtctgt ttctgtgaat acagtattaa tttaacacag cccagtccat tcgtttgcat 121381 attatctatg gcttctttca gctacaacat cagagttgag tagttgtgac aaaatctgta 121441 tagcctcaaa agcctgaaat acttaccatc tgtcccatta cagaaaagtt ccctaactcc 121501 catcctaggg tattaacata tgaatttgtt ccatagagcc actgtaatcc cagcactttg 121561 ggaggcggag aggggaggat ggctggagcc caggagtttg agaccagcct gggtaacatg 121621 gcaaaactct gtctctacaa aaattagcca ggcatgctgg tgggtgccta tagtgccagc 121681 tactggggag gctgaggtgg gaggatcact tgaacccagg aggtggaggt ttcagtaagc 121741 tgagatcatg ccactgcact gtagcttggg tgacacagtg agaccctgtc tcaagaagaa 121801 agaaaagaaa gaaagaaaga aagaaagaaa gaaagaaaga gagaaagaaa agaaaagaaa 121861 gatttaataa caagtggaca aaataaaaaa taatatttct caatgacttt tatagtcaca 121921 cagtagtata catttatact acatttttgc tttaatacag aaaaaacaat tataaagacc 121981 caaatattgt caccttttct cttgaaaaac aaacaaacaa acaaacaaac aaacaaacaa 122041 actagtgtgc tctacaaagg aaacctaaaa gaggatgtgt tgataattaa taatgtaaca 122101 cattgttaaa atatcactgt ttgatagtct ctcaacctag ccatttccta gagtaagact 122161 attttactta tatacctatt tgcagttgaa taactgtaat acaggtaaaa atactaatga 122221 aaatttatag gcaaaaatat ttccctttta tagaaaaata gaagtcacag ctgtacgcta 122281 gagaaaaata gaaaagtaat cttttctttc agtccttaca aaataattac ttctataggg 122341 agtaaagatg atatttgtgt tttgacataa gtaaaagctt atttttagtt catttcatat 122401 gcctggactt tccaatgttt ggggaaattt taagaaatgg agacgttttt gtagctatat 122461 aagactctat tatttccaga agcctcataa attgaaaatg agaccttact gacattcttt 122521 cttttgtttc ttctcaaaag gattcccaaa taggaaaaga gtatgtaata aaacagagag 122581 tgcagagcta tggttgtcaa ggcaactgta ataatcttgc ccttgaactg tggtagagca 122641 tccactgcag ttcaatctaa cctcatgttt cctcagaatg cttatgtatg ttcgaaatcc 122701 ttattatata gcaatatcat tccaatcaat caggtgatga taggaatgta gaacacgaac 122761 tgaaatgtcc atggcattcc cattatggaa ggcagacagg aaaatgacac ttgctagcat 122821 acagaatagc taagagaaca tgtccttcca ctccacgcca ttcaacatgc atttgggaca 122881 ctatactggc tactaccatt aatagaaaca taagtccagc ccacaaagta gtgaatccag 122941 attaaatgag ctacaatcct gtgataggtc cccatagccc taggaactat gacaaactac 123001 agaatggctg gtatacacac tcacaaattg ccactttgtg gaaaaaatgg tcactattta 123061 tgatcatcag gttttcttca tagtgaagag aagaaaactt tccccataca atttggaaga 123121 aactgttact acttatttgt atctcatctg atgcttcctt caggactgca gaggattgtg 123181 ttttctctag gagatagtca gtatcttaga ggctgggcag ctagagtttg cctcactgaa 123241 cactaaatga tctttgctta tattttccaa ctcagacact cctcaattct gaaagtctgt 123301 taagatagct ggatatttca aagtccacat gcattttccc atagaaataa agatacagtg 123361 gttttggttc gtaggtcagc tcaagtagtc tcatttaacc tatgacatac gtgaagtaca 123421 gagtgggaag tacaaatgta aaagggaacc ccattctcac cgttgtgaga ccctggaaaa 123481 cagcaaagag tccccatcat gggatagggg aaggagcact gggttgggat cagaagtctt 123541 tgtctgtatc agttctttct ctgtgcctcc aggccacaga ccatctcctc tgaagtcagt 123601 gctgaggccc ctggatccag ctcaatcact tagctcacca tgggaactca gcacatcatt 123661 tactggaaat aaaataataa gcaccttgcc atttgatctg aggtcgtcat ttagtcattt 123721 gaacatttat ttttctgttt tgttttgttt tgagacgggg actccctctg ttacccaggc 123781 tggaatgcag caatgtgatc ttggctcact gaagcctcga cctcctaggc ttaagcgatt 123841 ctcccgcctc agcctcccaa actgctagga ctccaggtgg gcaccacacc tactaattaa 123901 aaaaagtttt ttttaagaga cagggtctca ctatgttgcc caggctggtc tcaaactcct 123961 gggctcaagc aactctcctg ccccagcctc ccaacatttc aatatttctg agattcaaga 124021 ttctcctcca taaattagat gtttgggaaa aataggccag gcactaagct tctcttactc 124081 tcaaattgta ttacccttag atatgtggat aactcctgtt attcctgttt tttttttttt 124141 ttggtataat agtgataaca atgggttcca taaagggtgg atgggaagat atcataagga 124201 tagagacaga gttaggataa gcattttact attggtgggg gattgaaaac ttatattttt 124261 ggaaaggaat ttgctgaaca aaatagacac tggcattttt tgctgaattg acaatgtgat 124321 aagaagcact ccccaatttc ccctctcagt tatttttgag taagtcctac cccatcttcc 124381 tggtgtctcc ttccactcat taaacaaatt tctttgtttt ttgtttctag tgtatttatc 124441 accatctcac ttaccttatt ttttaccgtg tatgtattgg ccatcctttg gccctctaga 124501 aataaagtac catcagggca ggggtttgtg tctgttctgt ccagaactgt atctccaaag 124561 cctagaatag catctgcaca tagtaggcat tcaaaacata cttactgaat aaataaatga 124621 atgaatcact gaatgagagc aaaaaaaaag aatactcttc ttcagatcta aataaaatag 124681 acatttcaaa gtatatgttg aggcatccac tgaagacaaa ttttggaata ttatgaaatc 124741 cagtaggagg aagattattt tttgtcagaa aaactaaggt aaggagaccc aataccccag 124801 actactatcc aaaagattga gtaaccaggg tgattgctta gatacagatg ctcttgatat 124861 attagaagta aacatgtata aagaaggaat gcttcttgta tttctactta ggcctgggaa 124921 atgtctggta caatggaaaa caccctgagg cagaagccac caggcatcca gaatcagtgg 124981 attcttgctc tgcctcttag tacctgttga ctttgagcag ctagaggact tcccaaactt 125041 cagatttctc acttataaaa ctggaatatg gccccctgcc ttacaaggtg gttgtgaggg 125101 taaaataaaa cgtaaggaaa cccagaaaaa gaagatcttc agtaaatgat ggccactcct 125161 actattgcca tcaccaacag tagcctgctc aagtcactta acccctctgg gctccagttc 125221 ccttttctag aaaatgaaaa cattggacga gaatgatttc aaatggacct gccactccaa 125281 aaatcctgta tgctcagttt cactccctca aatatgtaac tatggggctg ggtgaaaaaa 125341 caagatcttt taaagagaac ctagacttag ctatagaggc ttattaaatt atctgtgtta 125401 taattctaca ttcataaccc taaaatataa ttttccctta aaaatattcg gaaaaaaagt 125461 ttcccatgtc ttggcaaacc aaactccaga gagttttcca gtgtttggaa ggggctgaag 125521 ggccccaata ttttaaagat gaatataatt tggtgctgaa atcttccttt tgtgtttatg 125581 actatgtccc aaaacagaat cctagaagag agaagtgtag cttagaaata tcaagagact 125641 gcctaagaac ccagcagggt ttcatataca ttttagttga tttacacttt aaaattgtta 125701 ctcacttaca tgtttccaca tttctactgt cccaaacaga atgaatttga caatttgtga 125761 atcatgaaaa aagaaagaaa atcaacttaa taaacaagtt attttccctt tgggtcatat 125821 gtaccccata agataacttt aaagcagcac ttacattaca gagacattta cttgggatta 125881 gcagaaaata attttcaggt tgctggtcag tctttaaaaa tctttttaaa gaaattatgt 125941 attcaaagag ctcccggggg aaaaaaatct gttaactata atacttctga ccaacaaaaa 126001 aacctaaatg gtctgcaata cttctaagtt attagtgaaa aagacttccc tttagtacaa 126061 cacaattttt aattccttga agtatgtctt ggggtaacaa acacccccat cctttaggtc 126121 atattcaaat tacacacaca cacacacaca cacacacaca cacacacaca cgcatgcata 126181 cacagtgcac cacccacttc tctcatttcc tcctgtaacc acaaaacgga atgttttact 126241 acaatcatgc acacccattt tatttacata aaaatgatgc ctttaaactg gtaaagaagc 126301 aggagatcat aatcagaact gaaattagtc aaaatgaaga aaaatctttt aaaaataaaa 126361 atgaacttta cattctaata tatttgtgcc tggatcaaga gctactatat taaaataaac 126421 attcatatca tataactaaa tacacgtgac ctagaaacac ccaacttaca aatatcctgt 126481 acatgagaac agtcgaccca agtgagccag gccaggaaca gggcaggctc tccccagact 126541 tgaaggctag agaaggattc ctctaggatt cccctccatc tcgtcctgct ggatagtggc 126601 gttggaagtg attagagtag aaaggggaag tgagaagcca tgggccttgt accgcctcag 126661 tactgtcttc ccaccattct caacgaggac tccatttccc ataaatgtta caaagagtgc 126721 cttggttccc aggccggacc gtcagaagtc tgctggaccc acagtgcagc tggaactgtg 126781 ctattaatac taggctgact tcccacttgg aagccaactc cagtcctgac agtgctgtca 126841 gctctgttat ggttaaatgg atttctgggc ggtcctggga gcctgccagt gagaacatcc 126901 tttctgtggg aaaattccta ccgaggccca aacagctcac tgtcaaacag ggttgtagac 126961 cacaagttgt gagagcctta ggcattgacc aatcacttct cacctttccc ttctggaaat 127021 acttgagtga tcattggagt caactcttac acagagaaac ataaggcaag tccaacactc 127081 agcatgtgat ctgttccatg gcaagtaatg tgtatatttt actgttgtca ttaataaaac 127141 gactacaaaa ataaattata cctcagtaac accatgttct aatcaactga gagacatgat 127201 cacttaatag tcctcttgaa taagtggaac ttacgtgtag aaaggttact gttctcatca 127261 atagcaaaga ggtagtttgg taaaagttga tgaaatacaa gatcaaatag tttacccaca 127321 caacacacac acaaagacag ctaacaatgc aacagagatg gggagttatt atagtagcag 127381 attgtaatct gctaggagat gaaagatatt cactgcctgg gagccttaag tacacactgt 127441 taactggtcc aaataaggca atggtcatgc tgaagcaggt acctttggta tatcttcagt 127501 ttatttacca gtaagaagta gggcctccct tatctactta aagacagcac agaaaccact 127561 gaggaccaaa ttagggcatg aaaaagtaga ggatctccca tatctgaggc cttaagtcag 127621 ttgatcaaag ttcttaacca aaggacaagg ggacagacaa gggccaactt cagtgtagtg 127681 acttgtcctc catgtagggg gtcagaagag agatttcggc acatctgaat aattctcaat 127741 tctccagtac tgagtcacta atgataagtc agctttggtg ttgtatctct cttacttaat 127801 catccccaag gataaaatat tccagtaccc atggggaatc tgaaatgagg taagagcagg 127861 tttcttaaat gcctgagaat gatattttag ttggaatata atatatggat acacacatat 127921 gtagactata tattatgcat atacactctg ttcacctgga ttgatgtata catactatgt 127981 ataggtatat atactacata tatatgcata tgcacatata tataatcttt cttttttata 128041 caatatatgt caactgacct ttagaaatgc cttcaaatta tatgattcag ttcaagcata 128101 cttagtaccg tatccatctg tgggtagcca attagaaatt catgcctcac taacatgttt 128161 ccagactgaa ggtaaattgt aagcatgcac atctgtgttg gttgtgaaaa acaacatcct 128221 ttgcactatt gaagaaatct taatgccaac ctaccctcca gagaagcctc agatccttgg 128281 aaaattttca gtttgatcaa cacacagagg taagccccag caaaagttct ttaacaggta 128341 tttggaaaaa aggcaacaaa gaatttccct ctctaagtta ggctgactag tgtggtgctg 128401 tggagacaga aggggatatt aattccacat agtgctctct cccaggcaaa gcattttaaa 128461 acaaaaacag caaaaaaagt tctaatttca agtcaccctg agaactaaac ttcttccttt 128521 caggacattt actgaacttg gccagtgctt agaagtcttt agtattgagc tatttgctag 128581 ggtgtcctta tttaatcaga aacaacctgt ctcaggcaac agtattgctt atagtgtgtg 128641 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tatgcatggt agtatgtcaa 128701 tctgagccaa cttgtcccaa gggccacacc tatttcttga gagttaaaca aacccgattt 128761 aggataagga gtttgaaatt agagatgtgg caagtagccc aaatgaattt ttgtctgacc 128821 cgtcacttct attttagttc tctcttccct aattgttctg cctcctctct gtactttctg 128881 atcattttcc ttgacctcat cattggaaag ggtgtactca tagagtgggt gaagccatat 128941 tctagaaaag tcactgccta ggatttttaa tgagctaaca atcgtttgtg actacatcag 129001 gcaaactttt catgcaaaga aatgcttaga aagcccaaaa agacagataa attgattatc 129061 tccaagctac caaaacaggc tggatttttg gtgcctgctg gctattaaat ggttatgtag 129121 aagaatttgc tctttcatac aacctggtag tgacatggta gtacagcagt taacttttac 129181 cacttgtcac ttctccagtg tttccattat ctgccaaaga gaacctacca gttgtgaatg 129241 tctaaactag ttagggtcag tgagcaacac ctagcctaca agtttgttgc aatgattaca 129301 tgagataaaa agtataaagc acttatgtag tacctggcac atggaaagag ctctctctct 129361 ctctctctct ctctctcttt ctctctctct ctctctctct ctctctctct ctatatatat 129421 atatatatat atatatatat gttcatcagc atcatcataa tcattataac catgatcacc 129481 accaccacta tcatcaccac catcattgtc atcatttatt gtgaaatgag gactactgat 129541 tataaaattt tagttaggac aggagtacca tccatgatag aataaactga aaggtaaaca 129601 caataataca tagttctcta tgtagaagga tcccctgaga gttcatcagt ctccacatat 129661 tttccccaca aaccatatat ccacttttct tccattacca tggaatctga ctccaaaaga 129721 tttagataaa ttactgattt cagcagcatg ctggacaaat ttttccggaa actttgtgga 129781 tcagacagta aaatgtagca aatacagctg tacatacaga tggactgaag aactatgcat 129841 atcaggtact aattaacaaa gtaggcaaat ctaccaccac aggtacctgg tcccatcttg 129901 tttaacatct tcatcagaaa cttacataaa agcatgggca gtgattatca aatttttact 129961 agacaactaa gctaggaaga atgcataata taatagaaga gagaattggg gttttaaaat 130021 acaattgtaa tttatatata ataatttaat tagattaaaa ggttaggatg atactttaaa 130081 aattcaagaa attagaaatt tgcatttaaa ttgtaacaat taattagaca gagagaggca 130141 acttgagttt aattctgtga aaaggaagag gtaggatttc taatgagcca acaattgtcc 130201 gataagctca aatgaactcc aagtctgaca aaagcattac accaggcaat ggaaacttga 130261 gctgcatgaa aaaaactttg gaatccaact caagaaaaga gaaaagtgac cacgcaatgg 130321 attccacccc agccagagca cacatggagc actaaactct ggtctggagg atacgccttt 130381 gaaagacatt gaggcatcca gcaagttctg aagagggtag tgaggctaga gaggggaaga 130441 tggtgaaagc ttggtcattt gacaaacctt tgaaatccat gaaagaaact gatgttgaga 130501 ctgaagttgc aaaggctgtg cagtaataga aatgctgacc taaaatattt gaatggtgaa 130561 gtagtagatc tggcctatct ttcaagagag cagatttcaa atgacagaat gagaatttca 130621 ggaagtaggc ctgggcctcc ttcaaggaga ctatatctaa agaaaacatt cacccaagga 130681 taagtgactt aagggtaagg ggattcttaa tacttaaagg ttcatgcaga taaccatctg 130741 attatccatt ctgaatgtgg taaaaaggaa tcctgttttg ggtgagtggc caaacatatg 130801 accgttaggg tgccacaact gtaattcagt tctcctgtgt aaagtgtcaa tgagtgatat 130861 gaggaagagg gagggcttcg ttagaggaga aggggatggc tggaaatggg gaaagagcag 130921 tttacggctg gctttgggta gctgaagagt gtagcaggta gcatagttcc catgataaat 130981 gaattcatac taattaatta ataaaggctc aaaccaaact ttggaatttt actcaatgct 131041 agtcctattc taatgacaac tgagtcaaag atagctgccc aagtgcccta tggtctaatc 131101 cattccctcc ccatgctgga atgatgtttt attctttttc acagtcctca tgacacctta 131161 ggcactaggc agcagcttta ttaactttct acttttgctc cagggagttc aggctaaaga 131221 atgaggaatg attagtcaac agtagaaact agtttgacac accacatgtt ctcactcata 131281 agtgggagtt gaacaaagag aacacatgga cacagggagg ggaacatcac acaccggggc 131341 ctgttggggg tgggggacaa ggggagggag agcattagga caaataccta atgcacgtgg 131401 ggcttaaaac ctagatgaca ggttggtaga tgcagcaaac caccatggca catgtgtacc 131461 tatgtaacaa acctgcacat tctgcacatg tatcccagaa cttaaagcaa aataacaaac 131521 acacaaacaa aacaaaaaca aaaagaaact tgtttggcaa gatgactatt tccttgacta 131581 ctttcactag gatatatgac ctcaacctaa gttgaagtac tttgacatca aggtgatata 131641 aacatttttg ctgttttttt gagaaaaata tcacagtagt atgtcaatgt gggttttttt 131701 cttaaatctc ttgatcactg ttaattttta aaatttgtct ttttaaaata tactttttaa 131761 tgtctgatgg gacattaaag aatattgtta gactgtgctt caacactata gtgatttaac 131821 atattttgat ataatataaa atatatagca tacattatgt tttgatatat gtaccttaat 131881 atctaacaaa actttctaga agaattacat ccatttatgt attaaattat actaaagaaa 131941 attgagaatt tatttcatat ttaacacaaa atatccatat tgcctggacc cagaaatgag 132001 gaaaatgatg ctacagtgtt aaagcaaggt tagaatatta ttgtagagca taattaatgg 132061 agggagtgca ttttcaaata tcctctttac ctccttgttg ctatgttggc tacacacata 132121 atactacccg agcctgtacc tctccttaat ccattgaaac cttttcttgc tgaggaattt 132181 catcacagcc atattacctg ccagtaagtg atagttcagg tcaatttgaa atgccacggc 132241 ttttgatcct gttgtctctg agtatttcat tttagatttt aattacaatg gtattgtgct 132301 tatggaccga ccccaaactg cattgaattg ggaagtcatt ttcctttatt taccttttat 132361 aatataattt caaaatgaca ggctcataca caaagcataa ttctttatag acactggctc 132421 atcattggac ttaatcactt ccaaagcaat gaggatggaa gaaatccagg gggtcctgat 132481 tcatggagat tcctggcttt tttcccactc agttgatgga aaatgttttc tctttttaag 132541 ttagatgagg gatggggtta gggaaactta catcctacac tcttatgatt tggctcatgg 132601 ctttattgtt ctccaaattt tatgtgtttg ggtcagttaa tcagactcct aactctatcc 132661 attgtattca aacagatgta gtactttttt ctcctttctt cccccaaagc tgcattactt 132721 tttggatatg gggtctctct ggccagctgc aaagattctg cctttgtgag agatgacatc 132781 acccccatgc taataagtct tttaaaaccc agaaattatc ttgggtatcc tacaagtcat 132841 gaacagcagt acactaggga tatagcgatt tacttaataa atatctgttg tgtggataga 132901 aaagctagaa ctactagtca ggtctgcttt ttctcgaact tgagtcttaa ttatatcttt 132961 ctttaaaatt gagatataat tcacatacta taaaatttac cactttaaag tgtacaatcc 133021 agtaagtttt agtatattca gagtcatgca accattacca caatcaactt tggagcattt 133081 tcatcacctt aaaaagaaat ccatacccat taacagttac tcctcacatc cctctccctc 133141 aacccctagc aaccatgaat ctactttctg cctctgtgga tttgcctgtt ctggacattt 133201 catatatatc cagtcataca atatgtagcc ttttgggtct ggattcttta cttagcataa 133261 cattttcgag gtttgcatgt gttgtcgcag gaatcatccc ttttgtggct aaatatattc 133321 ttctgtattg atatatcaca tttgtttatt tgttgatgaa catttggatt gttttcaatt 133381 cttagctatt ataaataaca ctgctctgaa cattcatgta catgtttcca tgtagacata 133441 tatgttcatt tctcttaggt agataaccaa gagtggagtt tctgcgtcat atggtagttc 133501 tacgttttaa ctgtatcttt ctttctttct ttctttcttt ctttctttct ttctttcttt 133561 ctttctttct tttttttttt ttgacagagt tttactcttg ttgcctaggc tggaatgcga 133621 tggcgtgatc tcagctcacc gctacctctg cctcctgggt tcaagtgatt ctcctgcctc 133681 agcctcccga gtagctggga ttacaggcat acgccaccac acccagctaa ttttttgtat 133741 tttttagtag aggtggggtt tctccatgtt ggtcaggaca gtctcgaact cccaacttca 133801 ggtgatccgt ccaccttggc cacccaaagt gctgggatta caggcgtgag ccaccgtgcc 133861 cggccttaac tatatcttaa ttgttttcct caggtagcta tgagtggtct caattcttct 133921 gtaaattctt ttatgattta gtgacttcct gcctttttat aatagaaaaa ttggtttggt 133981 atattttatt ccattttatc ttcataaagg tcctccaccc atgtctagaa acattttaat 134041 gcaaagtcct aaagaaagtt aaatcctaag cacttatctc cttaaaaata aattcagctt 134101 gatttagaag aattctgtat gtaaaaaaat gggtaaattc cccattcatt aataactctt 134161 tgttataata ttatctgtta tggcatctcc ttataaagca aacacctcag gtactaggta 134221 ttacaaataa tttcttcctt gtgtgaatgt caagtcctct gagagctgat ggtatgaatc 134281 tgtacagaaa cattttctgc tcactgtaaa atctggtagt gccaaaccca tcttatatga 134341 gacaaacgga tcaatctgaa ttagtgacac caaaaacaaa caagtttttc catgtaaaca 134401 tttttatatt caggaacaca caattattca gattttcagg ataagaaact ttgttagata 134461 ataagaggac attatgttta ataaacaaat taattctgta agtggcactt aaaggaagat 134521 taaaggtaac caaacagtct tgctcatatg tattatcaat acaacttcta aatcttattt 134581 agcctataat aaaaggcttg ctaaaaatgt gtatcagtac caaacctcct atcaccatat 134641 tccaatgtat tcaaaattcc taatcatctt tgcacaagca ttattaagcc agagcaaaca 134701 cattcaaaag ctagcagaag gcaagaaata actaaaatca gagcagaact gaaggaaata 134761 gagacacaaa aaacccttca aaaaattaat gaatccagga gctggttttt cgaaaggatc 134821 aacaaaattg atagaccgct agcaagacta ataaagaaaa aaagagagaa gaatcaaata 134881 gacgcaataa aaaatgataa aggggatatc accaccgatc ccacagaaat acaaactacc 134941 atcagagaat actacaaaca catctacgca aataaactgg aaaatctaga agaaatcgat 135001 aaattcctca acacatacac cctcccaaga ctaaaccagg aagaagttga atctctgaat 135061 agaccaataa caggctctga aattgtggca ataatcaata gcttaccaac caaaaagagt 135121 ccaggaccag atggattcac agccgaattc taccagaggt acaaggagga accagtacca 135181 ttccttctga aactattcca atcaatacaa aaagagggaa tcctccctaa ctcattttat 135241 gaggccagca tcatcctgat accaaagccg ggcagagaca caaccaacaa agagaatttt 135301 agaccaatat ccctgatgaa cattgatgca aaaatcctca ataaaatact ggcaaaccga 135361 atctagcagc acatcaaaaa gcttatccac catgatcaag tgggcttcat tcctgggatg 135421 caaggctggt tcaatatatg caaatcaata aatgtaatcc agcatataaa cagaaccaaa 135481 gacaaaaacc acatgattat ctcaatagat gcagaaaagg cctttgacaa aattcaacaa 135541 cccctcatgc caaaaactct caataaatta ggtattgatg ggacgtatct caaaataata 135601 agagctatct atgacaaacc cacagccaat atcatactaa atgggcaaaa actggaagca 135661 ttccctttga aaactggcac aagacaggga tgccctctct caccactcct attcaacata 135721 gtgttggaag ttctggccag ggcaattagg caggagaagg aaataaaggg tattcaatta 135781 ggaaaagagg aagtcaaatt gtccctgttt gcagacgaca cgattgtata tctagaaaac 135841 cccattgtct cagcccaaaa tctccttaag ctgataagca acctcagcaa agtctcagga 135901 tacaaaatga atgtacaaaa atcacaagca ttcttataca ccaataacag agagagagcc 135961 aaatcatgag tgaactccca ttcacaactg cttcaaagag aataaaatac ttaggaatcc 136021 aacttacaag ggacgtggag gacctcttca aggagaacta caaaccactg ctcaatgaaa 136081 taaaagagga tacaaagaaa tggaagaaca ttccatgctc atgggtagga agaatcaata 136141 ccgtgaaaat ggccatactg cccaaggtaa tttatagatt caatgccatc cccatcaagc 136201 taccaatgac tttcttcaca gaattggaaa aaactacttt aaagttcata tggaaccaaa 136261 aaagagcccg catcgccaag tcaatcctaa gccaaaagaa caaagctgga ggcatcatgc 136321 tacctgactt caaactatac tacaaggcta cagtaaccaa aacagcatgg tactggtacc 136381 aaaacagaga tatagatcaa tggaatagaa cagagccctc agaaataacg ccgcatatct 136441 acaactatct gatctttgac aaacctgaga aaaacaagca atggggaaag gagtccctat 136501 ttaataaatg gtgctgggaa aactggctag ccatatgtag aaagctgaaa ctgcatccct 136561 tccttacacc ttatacaaaa attaattcaa gatggattaa agacttaaat gttagaccta 136621 aaaccataaa aaccctagaa gaaaacctag gcaataccat tcaggacata ggcatgggca 136681 aggacttcat gtctaaaaca ccaaaagcaa tggcaacaaa agccaaaatt gacaaatggg 136741 atctaattaa actaaagagc ttctgcacag caaaagaaac taccatcaga gtgaataggc 136801 aacctacaaa ataggagaaa atttttgcaa cctactcatc tcacaaaggg ctaatatcca 136861 gaatctacaa tgaacacaaa caaatttaca agaaaaaaac gaccaacccc atcaaaaagt 136921 gggcgaagga tatgaacaga cacttctcaa aagaagacat ttatgcagcc aaaagacaca 136981 cgacaaaatg ctcatcatca ctggccatca gagaaatgca aatcaaaacc acaatgagat 137041 accatctcac accagttaca atggcaatca ttaaaaagtc aggaaacaac aggtgctgga 137101 gaggatgtgg agaaatagga acacttttac actgttggtg ggactgcaaa ctagttcaac 137161 ccttgtggaa gtcagtgtgg cgattcctca gggatctaga actagaaaca ccatttgacc 137221 cagccatccc attactgggt atatacccaa aggactataa atcatgctgt tataaagaca 137281 catgcacacg tatgtttatt gcagcactat tcacaatagc aaagtcttgg aaccaaccca 137341 aatgtccaac aaggatagac tggattaaga aaatgtggca catatacacc atggaatact 137401 atgcagccat aaaaaatgat gagttcatgt cctttgtagg gacatggatg aaattggaaa 137461 tcatcattct cagtaaacta tcgcaagaac aaaaaaccaa acaccgcaag ttctcactca 137521 tagatgggaa ctgaacaacg aaaacacatg gacacaggaa ggggaacatc acactctggg 137581 gactgttgtg gggtggcgga aggggggagg gttagcatta ggagatatac ctaatgctaa 137641 atgacgagtt aatgggtgca gcacaccagc atggcacatg tatacatatg taactaacct 137701 gcacattgtg catatgtacc ctaaaactta aagtataata ataataataa taataataaa 137761 cactgtaatt ctatattgtg atttgcttcc catttgaaaa aaagtatttc ctcccttgtc 137821 ttctaagaag gtactccttg agggatttta tagagttcag atttgctgtc tttctcctgc 137881 ctccttcggc accagtcaaa tcattttgta aaattcactg ggattaacag gcattagtaa 137941 tttgtttatc ggtcacatag ataaaatttg cctttgaatt ttatatgaat actgtccaca 138001 aacacagaat gttttttcta tcttttcagt tgactaagaa aaaaactaca cacatgcgca 138061 tataaaaagc actgtttcct tctgatactt catttgaaag cccctcaaaa gcagaagaaa 138121 aaaaataaaa ccatatgaat taaagttaag gtagaggatt tttccaacag catcatttct 138181 gctggttact aagttccaca aatagctaat gaggccaacc acctttgaaa tcaaagtgct 138241 gtgccccatc tcctaatgga gccacagtat gctaagggaa gtgaaacttc tagagtcagc 138301 acccagatgg tgttgacaaa gaaacccacg tgcatggtgc agtgcagact attttttaaa 138361 ttggcctcaa actctaaaaa gcacacaaac acacatttgg atcacctctg ttccaccaca 138421 caaaacagtc ccagtttgcg tgactcccca ctgcctggtt cacatgctga gtaactggat 138481 gctatttcaa tatttagctc aggccaacaa cattagcaaa gcagattttt ctcaccttat 138541 tgcccatggc caaatatata actttgaggt ggtctgcgcg gtctgcttct ttccatatga 138601 gtataacatt gacctgtatg catatgattc tcagcaatgt agaggtaact gctcacattt 138661 gtatcttagt aagagggaat tgatgaagca ataaatgttt aacagctagg tgtcatctcc 138721 cagctgtgta tgaatttctg ttataatatc aatgacaaat cattgactaa gcttatctta 138781 tagacagaag tccactgaac ttgcttttag acaacctaag tcgaaattac ttttctgaat 138841 gtaaattaat tcagccactg taaaaagcag attggcaatt tctcaaagca tttaaaacag 138901 aactaccatt cagcccaaca atcccattat tgggtatata ctcaaaggaa tataaattgt 138961 tctatcataa agacacatgc acatgtatgt ttatcacagc actatccata atattccaaa 139021 gacatggaat caacctaaat gcacatcaac agtagactgg ataaagaaaa tgtggtacat 139081 atattccatg gaatactatg cagccataaa aaggaacaag atcacattct ttgcagcaac 139141 acggatggaa ctagaggcca ttatcataag tggaggaaca taggaacaga aaaccatata 139201 ctgcatgttc tcacttgtaa gtgggagtta aacatttagt acacatggat acaaagaagg 139261 gaacaacaga caccagggcc tacttgaggg tggagggtgg gaggagagtg aggatcgaaa 139321 aactacctat tgagtactat gtttactaca tggttgatga aataatctct acaccacagc 139381 cccacaacat gcaatttacc tatataacaa acctgcacat gtacctttga aactaaaaat 139441 gttaaaaaaa aaaaaaaaga aattactttt ccaccaataa ttagccatgt gaccttggaa 139501 aacttaatta actttattag cagcctcagt tagttgccta cccaacagtg atttcccctt 139561 ccctccttct tccaaagaga ggtctaatgt tatttggcta ttcaaccttc cgccacgtga 139621 ccaagggtag atgacccatc ccggctccaa agatagatcc tcattatgtt tggtcccttg 139681 tggttacact attccatttg ccaggaatgt tttaagaagg gaaatgcata tcagattaag 139741 tttggttgca taactacaaa aaagagcaag aaataaaaat aacagtggtc ttaaataaga 139801 tgcaaatata attccctggg gaattatatt ttatatactg gagaaaagaa gacaggtgtt 139861 aggcaacagg gggcaccctg ttcaggagta gatagatgca tcagggatga ggctctcaac 139921 tagcttactt ttctgccatt ttgctgtgct gtctcaggtc cggcactgct gctaggagcc 139981 ctggacaact tcagccaaca ggaagccctt actgcagggg acttagtcac atgatgacag 140041 ttagctatac aggaggtagt gaaaggaagg tgggggaaat gcaaggagaa ggggagacaa 140101 ctagtagtca ttgtcacagc atgtggccca attcccgcct atgaaatatg aagcttccgg 140161 gaaagtctcc tgctatgaaa gagaaacaaa aagagataaa ctgtacttct ccctctgatc 140221 agtgtcacat ctcaatgtga tgtctgaaac tgatgcagcc atcttggtac cagcctgaga 140281 gcaaagccaa caatacacag agcaggacag agcctgcaga gagaaggacc caacatcctg 140341 acattctgta tggtacccag atcgtaggcc atgctggact tcttgtcatc ttagatggaa 140401 atttcattac tgtttatttc caatggaggg ggagtcttct gttacttgca gacaaatgtc 140461 ttcaaattca ttcgacctat ccagtaggcc ttacaaattt aaagacagtc atttctccct 140521 cttggtttgt attttcacaa gctaaaaatt ctagagacat agaagatcaa tcaattctta 140581 ttttaatcac tattatcaaa atatatcttg attaatgcct tagcaatatt aagtaacgta 140641 tatgcctata gtgatgtgtt attgtcatgt agaccaaaat agaaaagcag aacaaaaaga 140701 taaaattcat taaatacgtc atattcacgt tcattaaaaa ataattgttc atttatcttt 140761 tggcttattg ttttataaaa tgaatgcaga aaaccataaa ataattcagg tattggggtt 140821 ccaattcctg gccttaccac ttgcattgtg atctaaggaa agcaatttca tctgtgagag 140881 ccatagtttt ttatacatta aaaaatggga gaggaatacc actggtttta gtcagccttg 140941 gctgccaaaa caaaatatca tttgcttttc acagttctgg aggctggaca tctgagatca 141001 tggtgccagc agagttcagt tctgatgagg gatctcctcc tggcttgcag acagccacct 141061 tctcactgtg ttctcacatg gcagggaaag agcaaggtct ggtctcttcc tcttcttata 141121 agggcactaa tcccatcatg ggagccccac tctcatgaat tcagctaaac ctaatcactc 141181 cccaaaggcc ctacctgcaa atagcatcac actggctgta gggctaggct tcaacatagg 141241 aattttggtg ggacgcaaac attcagtcca taaaaccatc tcttcttaag aatgtgtgaa 141301 ggttgaagaa atgtgtaagt ggcataatgg cagttatata gtactcatta actggcaact 141361 atgacatcat gggcagccag tgaggataca aagacaaagt tagtagatct gattcaatat 141421 ttcattcatt cgtttataca tacatacatt cagattccac ctagttccag aaggcattga 141481 aagcagaagg aaagcaaata taaaaattat tgcaatgttt taagttatag aataaattgt 141541 atctatcaag aaattatggg tagatagaga gatagcagaa accaagagac atcagatgtc 141601 atgaaggagc agaagccatg agggaaagag aagaaaagtc aacgaaaggt atttgagagt 141661 agactagagc gacgtgggta agtgagggag gtggggagaa agagagaaag ctagttagca 141721 tcaagtctac agcctggtaa atgtcgaatg ttactagagt catactttgc acagggatga 141781 gtttcctttg agtcttgcat ggggtggtgt tcagttttca caagggctaa gatgtttttc 141841 cccttttaag ttctttcctg tgtaacttta caatggatat ctcctacctg aaataaccaa 141901 gtgtgtctct gttccttgca acatggaaga acataaaaca aagtagtagc ctaagccaca 141961 gaattgataa aagggtgaaa gaaatgtgtc tacagtgaga ggagaagagg cctaagatgg 142021 aagtatgttt aaggagcagg tgaggaataa catgccttca aaggagagca agacatatta 142081 ctcccagagg taggaggaga aataatagag ttatggaaaa gggaagaaaa gacgttgtca 142141 ggaaaaaaca gctggattca gaggttccag aggtaacatg caataaggac tcactctcct 142201 gaattcgcag caaagttgcc tataaccctg aagacagaag actcaaagga atgatggggt 142261 agaagccaga tagcagagag ttcaggagta gatagatgag catgtgaact agctttgagt 142321 gtaaactatt cttatgagag atttagttgc aaaaaagaga tggctttcag aagaggtttg 142381 ctcaaggaag ggttttgaca tgacaagatg actgagagtg tagaagttga agctggggaa 142441 gagttaacat gatgctatct ggagtgaccc tgatgaagca gaactgcatg ggatttggac 142501 aggatgacag atgcctttgc ctctgagatg ggaggaaatg gggcctgtgc tgcccatact 142561 tgaatttcat ttgtgatgaa tctttgacta caaggacaaa ttgacccagt actttcatgt 142621 ggaactagct ataacaaagg atatttaaag ttttaagtga gcttgtttta acctcacaaa 142681 gtattatata taaataatat ggtcatttaa aatttcccaa ttattgttat ttgttctgga 142741 tcttaagctc atatcataag aacattgatg gggctaagtt tctggaagtg acagatcttg 142801 tgcttgtggc tggctaatga atcacccagg aagtttgaga gaccaggatg ttacctccag 142861 attgtgggac ttccaggctt ccgccctatc aggcaggcac atctgcttct ccagatgaag 142921 gaaacaaaac cagaatagaa gagtcagttg gcaaagcaac aaggtcaatt cttgccctct 142981 ccttcctggt ttcctgcact gacctggaac tatctacgtg aactgccttc aaccaattct 143041 gcactctgca aaaagaatta aagcaggaag gactacactg ttctcctctt tcagcatccc 143101 tgaaatgccg tctgactcat ttactaaacc cacaagtttt gactgatcta tactgtgcat 143161 tagactctgt cctaggcact ggagaaagca tggtgatcgt aaaatagttt tcctatgtag 143221 cacagattgc cgtgggtctt gcatgaggac taattgttaa taaggattct tgttgtttat 143281 ctaaacggat gcatctgtaa atcttgcttg agcatattgc attactgttt acaaaagggt 143341 aaggttgctg tttatttttc ggcatggaag atgcacacat tgtacttttc atttacaaga 143401 tacagctcag ggtgaacaaa gccctaggat aaatactctt ttctaaaaca cacaaaagat 143461 gataaactcc aaagcttaca tgtgttctaa ttttcattat ggcccatttg gcccagggtt 143521 ccatatctgt cagaagatcc ccgtgacata ttagggaggg catggttttg gctatcatct 143581 aaaatagcat tcttattcct cctccaattc taacttccca acctgcctta attatcacat 143641 ttggattcat cttccatgaa ttaaaatatt agactatatc acttcttaag atacaggata 143701 gaaatagaac caggatgctt gcaatttatg acttccatga agttaactct aattttatgt 143761 ttcctaatat tgctgttgtc aagcttaaag ggtattacct agtcctgtta agttctattg 143821 tgtcttgtcc atctcatatc tctgtagctt ataaagtacc tgacaaagaa ggaaagagct 143881 tgataaatat ttgtggaata aaataattta atcaataact aattgtcacc cttttaaatt 143941 tgtagacttt aaatttcctg tctctctggt ttcatgtttt acagtctaaa aaatcctgat 144001 attttggtag attctcacca agtaagactt aggtgcccat atgtcccagt ttgcttagtg 144061 cagttccagt tcatgcctaa tgtgccaggg taattattcc tagtgatctt ttccctttca 144121 aaagtatccc agtttgaatg accctctaat cactattctc ccgggtccct tgagcccctc 144181 tgtaaaattc tacatgtgca tttcagaagc tggagaaagc tttcttttta caggacattg 144241 tggtaggttt tttgtttgtt tgtttgtttg tttgtttttt tgcctactcc ccttcctgat 144301 gataaagtat ctctacggac tttcagacta ctgggcaggg gttgtctagg aaaagcctag 144361 ggccatgctt ttcaaatttt agtgtgcttt aatgtatcac tgggggcttc tgtgaaaatg 144421 tagatcctat tcagcaggtc tggaatgggg ctcctgatac tgttgggctg aggacagcac 144481 tttgagtaac aagggtctcc aagttttcct agttagccat gacttagagc ccagctatta 144541 tgacataaca ttgcttagtg ctcaacagga tactctttag aagccacaga gctgaggtca 144601 aaccttgccc tcatcactca tacctttgtg accttacaca atgtattgat ccctctaaaa 144661 ctcagttttc ttatatatga gatacaaaga ataacaatta tctcagagga atgatgtgaa 144721 gattaaatga aataataaat ataaaacagt actatcttcc ctcattgtaa atgttcaaaa 144781 ctggtggtag ctgaaattgt catcatcaca gttatcaagt gtagtttgaa ttcctccccc 144841 gttttctctg cccagttctt ccctgccatt tcccccgttt tactcaaagc ttaacatcag 144901 aagttacttg tttagataaa tatcaaaggg taatgagcag agatatattt tggatagtgc 144961 cttaaaaatg gacactggga tatgctaaac tatcaagttg tcagtacatg caggaattac 145021 aatcaaggtt tggacagatc tacaggaagg gccaaaagtt ggcaattaat gccaattcat 145081 agatgcttaa ttatcacttg actggattaa attccctggg ctgaggctgc ctcctttagt 145141 tagcatgcgt caatagcagc agtcagaagc aattcaattg tatatactct tctaaaaccc 145201 agaattacca gaagaaagag aatgaatatt agaaaaatag aaaagtttaa tagaccaact 145261 catcaaccta gaccttgttt taaactttgc cttctctttt tcctaattgg gtatcaggta 145321 gcagaattcc aagtattcca gtttcactta ctttatttta gcctttcaaa agtcacataa 145381 tctaaacctt cgttttaact catggaaagt gataggactg aaacagcatt ttgtaaataa 145441 caacttacta tttaaaccta tagcgtcaac atcattgttg cagtcatcaa aattgttatt 145501 ctcttcctca tcttcaaggt cccttctggc tataaacatt tctctttcct ctgatgtcag 145561 tcctaatgta ggacccatgt tctttctctc tgggtagagt caccagtgaa agaataaaag 145621 tgatatcaga aacttggata attttcttac tagaaaatct ctacttagtc attgtaaaac 145681 tggatgttga gtttgaagtg acaatgtgct ataggtttac ctcctacaat tcaaagctca 145741 ctatagacta taattttcca ttgattaatt cattgactta ttcattgtac acttgcttag 145801 accatagtac ctatggcatg aagaacataa taaaatgtct ctccctagtg agaactcaca 145861 gaactaccag ggcatgcttc ctgaggccca tcacactcta caggtcaatc atgattccat 145921 cttgagttaa ctattcttga cacttacatg gcaggtcatt ttcaaagtat gcttacaaac 145981 aacaacaaca acaacaacaa caacaacatg ttcttaacaa agattggccc aactatcaca 146041 atgactggag ggcctactta tagaaatagt aggaataact aatttatatt ttccaccagg 146101 gttgcactga gcactgatag gtctcacttg atggataatt ctcaaaatcc cttaagaaga 146161 tttccatgca gaaagaagag aaatcaccag catattttca ttatctgact ggagggatgg 146221 ctatttccta gagaacagca accaggatat gtgcaacatt aaaattaaca tggtccctta 146281 agtgtttccc tcaccagatt ggttctacaa gcacacagct atttatcctc taagcaaggg 146341 tcagcaaact ataatccatg gaccaaatcc agcccatcac ctgttattgt aaataaagtt 146401 ttattggaac cagccatgca cattcatttg tgtattgtct atgactgttt ttgtactcaa 146461 atggcagagt tgtgactgag atcttatggt cagaaaagcc taaaatattt actatctggt 146521 cttaactaaa agaccatgct actttaccaa aaacaacaac aacaaaacca ttaagcttgc 146581 attgacattt cagtgattta gtttgaagaa ttcatagata tctttaataa ttttaaatta 146641 ctataatcct taatagaggt aattaaaagc tacaatggag agatttggaa taaaaatttc 146701 tatgcaaata tccctttggt atatggttaa tcactaacaa attaatcaaa tattatttat 146761 gagaaagata atgcctagtt ttatatattg gaaacttaat gaaaaagata acaagtaaac 146821 atattcaaca gctctgattt ctgccagtat atcataaaat aaatactagt acttgagctc 146881 atgctcatat cctaatatct acataataat taggaataga tgagttgatc cacgactctg 146941 ctgtaagtac tacaggaaat gtctctgtaa aagtatatta tcattttaat ctagatttaa 147001 tgtatctcat aagcttttaa caatgtatat gtgtaatcat ttgagaatct agaacccaaa 147061 tgaaaaaata tatatttaaa tggagaagag ggttatttct gtgtcctgct ttagttaatt 147121 ttaagacata aatatctaaa gttgataaaa caaaagggga aacaagccca tctgcagaaa 147181 taattttgca gtttatatcc aagctacttt tgaaatatag aacctgcttt ctcctttcaa 147241 agaccctcaa gtatttgctg tagctttatt attaaaatat ctagatgtgt cattataatc 147301 agatggaaaa ttcagcaaaa ttgtgttttc ttagtaaata aagtatatac tgactatatg 147361 tttatatatt aactttttcc cacatgaaac cagtatttaa agctaaacat ttttctcact 147421 acagatactt cttgactaat gagtcagtga ttaaatatat accaagtgaa tattaataaa 147481 gaatggattt tttaaaaatc atgttcactt tgggatattg atctcatatt cctcatgcta 147541 tgtgaattgt acagaatggt taagaaatta ttgttgactg gcacggtggc tcacgcctgt 147601 aatcccagca ctttgggagg ccgaggaggg tggattacga ggtcaggaga tggagaccat 147661 cctggctaac acggtgaaac cctgcctcta ctaaaaatat agaaaattag ccaggcatgg 147721 tggcacgcgc ctgtagtccc agctactcgg gaagctgagg caggagaatg gcctgaacct 147781 aggaggcgga gcttgcagtg agccaagatc gcgccactgt actgcagcct gggtgacaga 147841 gcgagactcc atctcaaaaa aaaaaaaaaa aaaaaaaaaa gaaaagaaaa gaaacgaaat 147901 tattgttaac ttaattggga caatgattta caattctact gatgctgaaa ggttaaggaa 147961 agactttgga aattcaaaag ccaaccacaa ttaatattgc tttattcttt tctttcttat 148021 caggtacaaa aaatttggaa tgtttataga atgcttgcca tatgtaaata ttgaaataaa 148081 aattattcat acaaatccct aaccatgaaa ctaaagattc taataattct aaatttttaa 148141 atatgaatat tttctatttt tttcacacac acacacaaac acacacacac acacacacac 148201 acacatatat atatatactt ttaaaattag gatgctaaat ctctgcccat ctgtgtcagc 148261 cttttaaaga gcttgtgatt cccaatcttc aatgccagga tacacagatg ttagtaggtg 148321 agacaacaga ccgacaaatt cctgcttggt aagtaaaata aaaagtaaaa tgtaaaacag 148381 gagatcttgt tttctctact ttggggaaaa taaatcaaaa caaaacaaaa tcatcaccag 148441 ccctctcaag ttcataaaaa agacattgtt taaaaaaaaa agagagagag aaaaaagcaa 148501 atgatagtaa tatttatcct gaagcaaggt ggctaaaatt tgacccaagt acttcctaac 148561 tactgaaaat taaatcattg cttttgtgga tttgaaaaac agaaggtctg catttcaagc 148621 atataaataa catgaaataa ttgtgtcttc actttgaaag ggctatttac caaaagaatg 148681 tgacattctg aggggcacac tttggggatg aaatttaagc aaaggtcaca tctcaaattc 148741 aaccagcctt tttcacctgg gctgcaatgg aatggaatga ttgaacaagt gtgaactcct 148801 aggcagggtg ggcataggtg gagttagtgg gtaaatttga tttgtttgct aaggttagtg 148861 aaataaattt atctcctttg gagtgggatt ccaggtgcta aaaaataaaa catggtgttt 148921 gtagccacat ttctttcctt cattttcaaa atttatccaa atgagattaa gttgattggt 148981 actctacaga gatagcagcc acctaccatt ctgattaata tttcagtgag gaggcaatgt 149041 cttattattg agaccaagca attatcagcc ttttgactat ataacataaa ataaaataat 149101 caagaccaat atcccatgga aattatctga agatacaaac acaagcctga ggagacatcc 149161 ctatagctaa gcctgttgag gtaaaggact ggagtttgaa ggggctcttc ctccttcaca 149221 gaagcactat ggcctagcta ccctctccgt ggtagtaata tggagtgggc ccaagagaga 149281 gcagctttaa aattcaaaac aagatcctgg aggactgtgg ttgacttggc caggtactta 149341 ggagccacag ttttgctcgt aaagaggctt tgagtagtca ctccctctca ggtgggctct 149401 agggtggaga ttccatcttt gtggtccttg ggacatctca aacaacagag atgtggaagc 149461 tataaagaaa acctctgagg aagactattc actgggacag ggcagatatt acatatttcc 149521 agcatcacaa aattttctat tggacaatgc tgtatatcta ggtgctataa catttaaagt 149581 gttagctttt taaaatttat ctgcaaccat ttcttctctt aattcctcag aagtaggcaa 149641 tctaatacca cctctggagc tactatttca atgaataatc atatgtttgc ttagccatat 149701 tggaaaatca aataaatctg aaagagtatg ttgtccctgg aatgtgctaa tttctagtta 149761 ctcactgcta tgcgtaagaa gttaaactaa aacctgttct tcacattaac agacaaaaag 149821 ggctttcttt gtctgactat ctaaattgac ttcccacaca ggaacctaag tgatagcaag 149881 ccgttaacaa tggttgactc atcctttaaa tatcaggtct cctttatgtc atatttctga 149941 ttgtcacctt ttgagaaagc ttccctacca tcctactgac tcccctccat cattctatcc 150001 ccttatcttc tgtattacca cctggcatat tatatactta ttggcttgtc agtctcccta 150061 ttctacagca taatctcttt aaaagaaagt attgtatttt attcactgat gtattcctaa 150121 tgcctagaac agtatctctg tttttgttca ctttatatat gcttagtaaa tacttataga 150181 atgaatttat gaatgaaacc tccagaaagc agatacaaaa ttttcttcat atacattcct 150241 ctggagagca aatgtataac ttagatctta ttctgaacag atttgttctg gggcttaaaa 150301 agactaagaa cctgagtttt acacacatta tgccatttca ttatcagcaa catcattctc 150361 tgcaatgtcc atacatgaaa gttctgcaac tgatcaaatg cttactttat tagaatcata 150421 caaaataaat ataagttgaa aaagcaatga catgaaacat ttatatttga agggcttgtt 150481 gttgttgttg ctgtaattgt aagaaaattt acaccaaaaa gaacttcatc tatcctaggt 150541 aatttagatc actccaaatt ctattctgat ctttcttatt aaacatacct ctatatttac 150601 gcatgtgtaa tcagtaatca tgtaactgtt gtctttgaga tttccttaag agattttagc 150661 aggagctcca aaaggttcaa gtttggctgc agaacatggg gcatttttat gtagatgtgg 150721 attgtgattc tcagtaccaa tggcttctcc ctgctatgtc tgctttcccg cctcccattt 150781 gtcaagatat aaactgcaca gtcagggctg aaggagctac aaaagttaaa caatgtgtgt 150841 aaatgaagat aagaattctg catgagagca agggttagaa gaggaagcta agaaggacag 150901 aaaaagagat tgactcaatg ttgtgtaaaa atagagacca agagggctca aaggaaggtg 150961 ataagaaaat taaattagaa aaagccagga agataaagga agttcaaaat atccataaat 151021 agccaaatga ataagaaata caaagaaaat actaagaagc agtaaaggat agtaacaata 151081 agggaaaaaa gaaatgtgta gcaagggaaa gaaatagtaa aaaaggataa acagtgatac 151141 agcatgaaaa gagaaaatga aaaaagactg taaatacaga agagagaaaa taacaagatg 151201 atcatgtgca catacaaaca cagtgagatg cgataaggag agaaataaca gaaactcatc 151261 tattggcaga aagatggcct cccaaagatg tccacacacc catccctgga acctgtgaat 151321 attttgactg acatggtaat aggaacactg cagatatgat taagtgtata gaccttgaga 151381 aggagaaatt atcctggatt atccaggtaa aaccaatcta atcccatgaa cctttaaaag 151441 tgagactctt ccagttggag tctgtcaaag ggaaatggaa gaaaagcaca aaggcacagg 151501 gagatgcaac actgttgact ttcaagatgg aagaagggca caaaagccaa tgaatgaggg 151561 cagcttctat taataggagc tcaaaaggca gggatctgcc cctggagcct cccaaaggaa 151621 ccctgctgac accttgattt tagcctagtg agatctgtgt cagaattcta gtctacagaa 151681 ctagaatata atacttttgt agtattttaa gacactaagt ttgtggtcat ttattattgc 151741 atcaatagaa gactaatgct tatgagatca caaattatta cagcaaaatg cttattttgt 151801 gcagcacaat tgttaaaatc atgcttggtt gtcattttac agcatacaaa gccctttccc 151861 atatattgtc aggtctgatc attctaagac ctgtggaata taagcaaaga ccatcaattc 151921 tacccctcct gatgagacaa ggggggccca gagaggtacc atgtccaagg tcttggtact 151981 aaatgttaca attaaggaca gaatgagctc tccaagacct cagtttgctt ccatagtaaa 152041 accatgattg ccagcaagac attcacatct aataaactca agagcacaat taaagtaatt 152101 tgcctggttc aatactctgg tgctttgatg gcctgcagag caaaattctt ccagtgtgag 152161 tcccatacct catgcttcaa cttgcacact ccaaatgatc aatccctctg ttaaatggaa 152221 tgggattcat tgtttccttg ttgttctgct ttcccagaaa ggtaagctct ttttccaacc 152281 agtgcccagc atagtgcttg gcacaaaatg gatcccaaag ttttgtgtta atttaattga 152341 atcaagaaca cagaaacgtg atgctaaatg aaagaagcca gtcacaaagg accacatctt 152401 gtatgatttc attcctatga aatgcctaga ataggcaaat ctatagaggc agaaagcaga 152461 ttagtgtttg caagggctgg tgggtgatgg ggaaaatagg gtgtgactgc taatgggtat 152521 ggagtttcct ttggagtgat gaaaatattt taaaatttat tgtggtaatg gatgcacaac 152581 tagaggaatg tactaaaagc cgttgagttg tacactttaa atcagttaat tgtatggtat 152641 gtgaatcaga gctcaataat gctattaaat tttttaaaaa gattatatca atttcctcat 152701 ttgccccata ttgtgcttaa aatccagaag acttatttca atgtcacctt tgattagtcc 152761 cctcgtgagc gggattttag tagtagaatt tgctaccatt cattcgttgc caacattcaa 152821 atgaggctga gggtttacga cagtgagaga gataggtcca ggacagaaaa agtaagaagg 152881 atggcaactg gaagggacag ttgagaagtg gagccaagga actgatctgt gggcccaaac 152941 acaggacata gatctcagat ctaggaaaag agggtttctg gccaagatga aagggactgt 153001 tttcatgatg gatacaagca gcacgtcatt gttactcact gacagtctgg gccaggatgg 153061 ggaagaaggg ctcttccctg catctacatg gagtaccact ggcatggaca cagtggtggc 153121 tctaggctgt gtggcacatg tgggcacccc aatggttcgc tgcgtatagt gtcatctgag 153181 aaagaagagg aggatgctac ctgtacttga caggatgagt ttgggtattc aaatgaaaac 153241 agccccctcc agggactatg gaacataact ggtggggatt ttcattgttt ctttgtttaa 153301 cagctttatt gatatatatt tcacatatca taaaattcaa caatttcaag catgcaattc 153361 aatgacattt agaaaattta ccaagtggtg ctaccatcac cataaaccag ttttagaaca 153421 ctttcttcat cccaataaga tccttcatgg acatggttac ttttaataag aactctctca 153481 ttaaatatca agtgtgatca ggagagaaac atgggtttca tcaatagata actggaatat 153541 tttttctttc tcaaaaattc ttccataact atcctttcca ttcccgaaga ccatcctctg 153601 cttcagagct tcatgactcc agttccataa tactacaaaa gtctcacagg taatatgtct 153661 agggacagcc aaaccattcc ttctaactca tggttgtatt taccttccta aaatccctgt 153721 tccatcattc cattatctct tcctattgca tcacttacaa acacttaaga cttgggtacc 153781 attattgttt ggccactaat cactttgtct tagttgaaga cctggcacca gattatttct 153841 atggcacttt cacacaatac tatgattctt ttggctggct tttaaactcc tctataattt 153901 gagtcagcac tctctatcca gttttatttt tgattatttc tcaataagcg tgttctactt 153961 cactcaagcc aactttctcg tgatccaaca tgttgataga cccttctctt cacggtttta 154021 tcaatactat tatctttact tggaaaattc ttttcttctc tactcatgtg tatcagtcag 154081 ctattgctgt gtaataacct actcggaagc tggagagtgg ttgatcctga atggactagc 154141 tcatgcatct gcagggggct gtgctctagg ctgggcttgg gtggagttgc ttagctatgc 154201 agctctacgc cacatgtctc tcattcctct ccagcaatcc ggctcagaca tgtccttctc 154261 atggccatgg cagaagcaca agtggacaag tgtgaacaag aaaggtctct tgaggtctag 154321 gattacaaca ggtatgctat cacttctcca tgtccctatt ggccaaagcc agtcaaaagg 154381 ctgactccag agtcagagtt agagggcact gtcatgttac ctaaagagca tgtatacagg 154441 gatgagtgga acactgggtc catcaataca tgctaccaca ctatccaaat ccttcagatg 154501 gatttcttgg tcactgttgt atttggcata ggaaaagaaa gtattccaga aactactgga 154561 acctttcaaa gatgacaatg agggatctac aaacaaatta ggttttgaag agctggaata 154621 ctatgaactc acataaaaat aagtatcatc aggtttaaca aacaaaatcc tttaataaca 154681 aatagcaata tttatatctt cagtccctga aaaagtatga tagaaatggc tccacttatt 154741 catgtgtctg ttgtttaggg cttgaactgt ccagaacata atctttaatc tggaagtaac 154801 cccaaagctt tctcatgaaa tttcttcttg agatagggga gctctgagat catcctctgg 154861 ccagagttga ttaaggtact taatgaactt gattgcctgg tttcctgtgt cagccttaaa 154921 ttctactctt agttatagcc aagcagtagg aatctatgag gtgggatgga ggacagtgac 154981 gtacaaacta cctaaacaca gatgttttat ctgacatttt tatcctttac aatacatcct 155041 tgcctataat gtatgcccca caaaaatcac ataaaatacc aagtgatgag caattattga 155101 agagagaccc atataaaaat aattatttca ctggctctca gaagacaatt tgtgattaga 155161 aaaatgcaca aatagagaag attacttatt tgggccagta aatacgaaac tcaatgtaat 155221 ttctaattta gaattctgtg gctctccaaa atgggtttta cctaattcaa tttaaggttt 155281 aattttagat aatcaaaccc caaaccaagc ctgtgtatat gttgacatct tttgaaccta 155341 gactaagtct ccgagacttc ccttcttaag tagagaaaat aagagaaaaa gaaccttcct 155401 agattttcag ctgggtgtgg ttcctggatt tgtaattgca ctgatttccc catgagtcac 155461 atttgttcct agtgtacgcc agactttccc cttttagatg cttgaggaga tacttctatg 155521 cgacaacgga acactgaaag tggtgaggaa accatacgat tgtgtctagc tcaatagaca 155581 cagaatgatc tgttgtttta gagcagagct tctctccctt gagtgtgcaa gaagaacttg 155641 ggggcttgta gaaacacacg ttcctgagct ccaaccatgg agtttctggt tcactgggtc 155701 tgggacaggg ctcgtgattc tgcatttcta acaagctccc acgtgttgca gatgctgctg 155761 ttccgtggaa tacattttga gtagcactgg ttaggagtac agtttctcag ctttggcact 155821 actgacatct ggaccaagta attctctgtt gtgggggctg cactgtgcat tacaggatgt 155881 tgaggagcat ccttggcctc taccgactag tagcacctct ccaggtatga caaccaaaaa 155941 tgtttccaga cattgccaag tgtcctctgg gagtcaaatc acccccattg agaactactg 156001 gcttaaagga tgggttctga ataaaagaga catgagctga aatctgccct aaccatttga 156061 ctacgcatgt ggttcagggg agcagtgtcc acatggctaa agcttcggta tgagtatgcc 156121 tccctcacag agttgttgta gggacaggtt aagtgaggat gtatttaatc aagtggcatg 156181 gggcctggcc ccaaagcagc agtccatagc ggttcctgag tattgtatca aaaaatcttt 156241 ctatgaggca atatcatcca aactctgttt ggaaggtaag cagagttacc tacctagaag 156301 ggtaagtttc cgtctactta gttcaagtgt gtgcgttata tttagaattc tagagtcttt 156361 tctcagttca gaggcaataa caataaaata aacccaatat ctatctctat gatcatgaag 156421 cttaaaatct agttccatgg ctaatacacc gagcatattt ttatgatcct gaaataatct 156481 ttatttgaaa catagatgtg tttttcaagt ggtttctgaa gcaaaatgct ttgtcagttt 156541 gtcaaatccc tttcatccca acgaccatag aagatataaa ccagttgttg tccaaagaac 156601 aaaagtaacc aagggggctg agataaaaat ctcaggcaac actggtgaaa agggtaaagc 156661 tgtatttaca tatctcgtac cctatcaata cactcccata aggagcacta gtgttagttt 156721 ttccttttgc caccaggggg agaaccaaac tttctcacag agtcacacat atttgtcaga 156781 cgtgcgttac agaggctggc agaccagggt tgaacaaagc ttttagttac ttcctaccag 156841 aataaaaata tctgctggca tgaattcacg agagaaggca gtacgaacca cttttgtata 156901 caaggtcttt taagcttttt tattccaaac actgcagcaa caactatttt tgatcctggg 156961 agtcaagaga cgacaaagtt aactatctct gggtcttctt actttcttta catttaagat 157021 gaggcattag gaataagaca agtaacatta atgttacata agacataaat gtacatttgg 157081 ctccctttag cttcctcaag cctagtatta aaggatgatg acaatggtga tgatgacagt 157141 ggtggtgatt gtaatgttgt tggttgtggt agtgatggca gtaatgatga caaagatgat 157201 aatggtgatg gaaattttag ctaaacataa tgaacaccta ccaccatagg atagatcctg 157261 ggctaagcaa cttatattag tagctaatta aactctagga ggaaaacatg gagctttctt 157321 catttactta ttaaaatttt aaagttagag gaagtttggg catcctgatc actcccagaa 157381 acctttgaac cccccaccac aggaaaagat aaataagtaa atagatagat ataaaggtta 157441 ttggtaattg ttatattaat aataattaaa taaagagtgt ttttatgctc agatacaatt 157501 ccacagagct gtatttgtga aggctggctc ttcatgtgat aaactattgg cctattttca 157561 ttgttgtccc aataaaataa ggcaagaata cacatagtat atggatcctg ccatgtactc 157621 agaatttttt cttattacaa tccactcttt ataattttgt ctatcgttgc aactttttta 157681 caatgagtgc atgttacata taaaataata atgcagcttc gtcatcttag aataaacaaa 157741 aggtgtagat cccccaaaat tacccagaat aaatatcacc tgcatgtctg agtctgaaac 157801 atgggcactt gtgttgatta aactgtctcc taaatcacta cagattcact gagtttttgc 157861 ccaaacaaat caaagtacca aggaaaccaa aaaacaacaa caaaattaca aatgaaaatt 157921 catctgtaat gaaagtcaga aacaacgagg tcaaagcagc ttatacggac atcaaatttc 157981 acttggcttt gtctgttaga aatttacctc aatggagaag tggatgtggc atctctagtc 158041 acatttttag tgatgataat tattatttgg tggatgaata cacaaactcc acattattta 158101 ttagatacgt tacttcctct tggctaaaat cactttgggg cttaagccaa agccttccct 158161 acataaatga catcaatcag attggcattt accagaacat tccaaaatca ataaggcgtg 158221 tatttttaag agcctttttt gctatcttta gatcggcaaa tgaaggagtt ctgaagaacc 158281 aggagtggga gttccacaaa taaacagcat gcttcatggt tcattgtgga gaaaaatctg 158341 gaaactctca aggggatttt attgctgatc tgaagtgact tgacgaaact caggatccca 158401 aatgagggaa atcctttgat agtaagagtt tgcctaatat cttctcagag tgaattctaa 158461 aattaagagt taggtctaag attaagtcag aaaacactgc aaaagggact gccttcttaa 158521 gatatgcccc cttctctctc ccatcacttt ccctacaaaa caacccagga tgagggcaaa 158581 actcactaaa ataacattta tgtgggtttt atttttgagc cttggcctgt ggtctctgcc 158641 attggtggct gtactgttct gactcaattt ctgccccata tcctaaatct atagtctcag 158701 gtacaaatta tcagcccata tgaggactga gagggtgacc tgctcgtaca tttctacctc 158761 tttttaaaag gcaacaaaat acctacttcc tctcatttca gaggaaaatc cactttccac 158821 ttcatgtata aaactcagtt ggctataact tatgaagaaa taagctaagg agtgatgact 158881 cttcagaaaa tatgagtact ggagaacgaa gaaattatca aactataaaa ggaaatagag 158941 tttgtgcctg gaaaagacaa cagggtgctt ccccttgcct ctcagaataa actagaaaat 159001 aaatgaagtc tatacacaca gttttcctga ttctcagttg aaatttgaaa ggaaagtaaa 159061 taaaagagtg acagaatttt aatgcttgaa aaaaacacaa agaacaccat attgatttat 159121 agaaaaaaag aataaggata aaaaagtatc tacaggatgg acaatgggga tgcacaaaaa 159181 ccatgtgaga gaattaagaa tttcaatttc agataagatc atggatccaa gatctaccta 159241 ctatgaatgc tcttgtaaaa caacacgatg tgcccccggg tagccaagat tttggtctct 159301 aaaactttca actaaaaagt gaaagacaca gggatccttg aaaagtagtt gattccatgt 159361 cttgtggcag gaaaaaccag gatgagtctg gaacactttg ttgttaccaa gaagcttgaa 159421 tactttcaag ggtggtgcta actggacaaa tggaccagct taatgcagtt gtcactaatg 159481 aaagcatgac agtttaatca acaacaaaaa actctgtgaa aagtcatgag aacacatata 159541 ctcaaaaaca acaaataaca taatctgctc aagaaagatg tgcatgactt cccccaggaa 159601 tatttactgg aaatacttca cattgatgaa cccaaacaca tgcttgtcaa attcaagttt 159661 tgctaagttc caggaggtca aagtagacaa tacaatgttg catagggtgt ttcaaattgg 159721 gtagagaatg cagaatgatg gttctcaacc ttggctacac aggaaaatca cttgagaaaa 159781 tttaaaaaat atcaatgact gggcactatc cccagacaaa ttaaatcaga atctctggag 159841 gtggggcttg gggcctggta tgtttgaaaa gttccccaga ccattctagt gggggtggag 159901 aaccactggc ctagaatcta ggataaccag tcatcctgga cactttcctg gttttagcgc 159961 tgaaaagttc ccatgtccca gcaaaccctc cagtcctgga aaaacaagga tgattggtca 160021 ccctctccaa tcaggcaata aaaagttgtc acagaaattt tcaataaaaa ttacataaga 160081 aaacactgta agaaaattaa gtataaatga ccatgattca ccagactaca agaaaaggtt 160141 gaagcttaga aattcttact actgtataaa tgactatagc acaataaagg cccagtggga 160201 ttgaggaatc agaatggcaa accctagttc ttggtgtctt ggtccaaaat ctctgactga 160261 aaacccccaa tttataaatt tgaagtcaca ttctcaacca ataactgtat ctatcatttc 160321 tgttttcctt atttgggtca gtaaccttgc aactctgaga agtatactaa tactaatata 160381 attctaagac caaggctaat tctatgctaa tagtgctggt gctggtacta ataatgatga 160441 tagctctagc attaacacta ttatgaatac taatgataat gctgctacca gtattaatac 160501 taatcaggta ctagtagtaa tattgataaa atacatgggg caaatgaaga aattaatcaa 160561 actttggtat taatacaatt gagttgagtg taaaaataac ttctggaatc aacatgttag 160621 gatatgcaag ttttatttgt tctttgaggt gtttgaattg ttttggagaa tctgacatat 160681 cgaacttatg attttaacta aaatacttgg gacttaatga tttaagtttg caatctctgt 160741 ttaagtgttt gaggaaatct gatgagttaa atctagaagt ttaaattttg taaggaatat 160801 ttagatgctc agtaaaaaat taagtgctag atttgaaagt ctcactagct ggtatttaag 160861 ggatcaaaag aaaggtgttg tgttttgttt tgtttttaaa cattaatgca aaaatgataa 160921 aaaaatgaga tatataaatt gggcttaaag gcttaaaaaa ggtgttttta aaagactagc 160981 ttgatatact agtattacac aaagtgagag tcaacgttaa tattcttgta caagtgaacc 161041 cacaagcaaa agctaaggct ttaaaaaaat acaagtggtg tgatgtttct ggaacagctg 161101 tggaagtgtg atctcagaga acacaagaag acagcaaaga caattcctga gaaattagaa 161161 gggaaaggac agagcaggca cagcaaatct agcctcaagt ctcagtgaaa aggcccaaag 161221 gggccaaaca ctttgctgag ttgtgggaag ctcccctgcc aaatctctgg aacgaacctg 161281 agatgccttt ttgcaatgac tcagatacta gaattgaaag aatgtgacgt aagttttagc 161341 aatggggcaa tttatgattt attttagaag aactaactga ggaaacaaga tgaaagttgc 161401 tggtagggct tagcagaaat gctctcaagc aagccaagga taaagatctt tgcaatatta 161461 cttagaagtc caaaggttgc tagagaccac ggagttcagg gagaggccag cagctacagg 161521 gctggaaggc aggtgagcct gtcttgtggg aaggggagag gtgaatctag acgtgacgag 161581 atgcctgagg cctaggatgg tttgcctggc ccagatccac cggccataat aggaacctgg 161641 gagcccaaca gacatgaggt gtcacattta aacgcttagg tggggcaaac agggccataa 161701 gaacatacat ttttaaataa gacacaaacc aaatacacgg gtctgaagtt ataaagcaca 161761 gaaaacttac tttcccagca tcttgaaaca cacctaattc acaaaacgtg agcatgtcat 161821 ctaaggatgg aaacatgaat aaaaaagagc acaaagaaaa gaaagaaaat taaactataa 161881 atagcagtaa ggaaaaagaa atagaggcaa gtgatggtgg taaacataga caggaactgc 161941 gtataatgat ttgctggcaa aggaagaaat acccgcttaa aaaccatata tattttttaa 162001 ggtggactga ggttaacagt tgaaataaag gcagcagttg aaaggaaggc agcagctgaa 162061 agaaggtcat ggattaagta gactgatcca gcagtttcca cgccaagtcc aagtctgaac 162121 aaaggaaaaa cacttggaaa aaatcatttg gttaaaaata ataacaaggg ataggcagaa 162181 gagtttgacc aggatgttgc agacaggaag aaagcaggct acagggaaat gtgggaaagt 162241 gatcaatgaa ggagaaaggc agcaaagccc tatctaccta catcagtggg tgtcaaccct 162301 gactgcactc tagaattacc tggggagctc tacaaaatac ctatgcacca ccctcgacca 162361 ctgagctgga atctctgggg cttggggatt gggtatgaag tttctaaagc tcccaggtga 162421 tctcaatttg cagctagggc tgagacccac tgctttgcat gcagagagag taaatgtgag 162481 gtcccatatc tctaaattat caggcccaaa cagccacttg cacctgaagc actccaataa 162541 ttgtagccaa agtcatttcc ttatttgaac ttcgcaggta aatgactaga tttttcactt 162601 gacttaagga gctttaaaat tcagaaggaa gaaaggaaga aattttttta ttatccttta 162661 agttttagag tacatgtgcg caacgtgcaa gtttgttaca tatgtataca tgtgccatgt 162721 tggtgtgctg cccccattaa ctagtcattt aacattaggt atatctccta atgctatccc 162781 tccccgcttc ccccacccca caacaggccc cggtgtgtga tgttcccctt cctgtgtcca 162841 tgtgttctta ttgttcaatt cccacctatg agtgagaaca tgcggtgttt ggttttttgt 162901 ccttgcgata gtttgctcag aatgatggtt tccagcttca tccatgtccc tacaaaggac 162961 atgaactcat tgttttttat ggctgcatag tatttcatgg tgtatatgtg ccacattttc 163021 ttaatccagt ctatcattgt tggacatttg ggttggttcc aagtctttgc tattgtgaat 163081 agtgctgcaa taaacatacg tgtgcatgtg tctttataac agcatgattt ataatccttt 163141 gggtatatac ccagtaatgc aatggctggg tcaaatggta tttcaagttc tagatccctg 163201 aggaatcaac acactgactt ccacaatggt tgaactagtt tacagtccca ccaacagtgt 163261 aaaagtgttc ctatttctcc acagcctctc cagcacctgt tgtttcctga ctttttaatg 163321 atcaccgttc taactggtgt gagatggtat ctcattgtgg ttttgatttg catttctctg 163381 atggccagtg atgatgagca ttttttcatg tgtcttttgg ctgcataaat gtcttctttt 163441 gagaagtgtc tgttcatatc cttcacccac ttgttgatgg ggttgtttgt ttttttcttg 163501 taaatttgtt tgtgttcatt gtagattctg gatattagcc ctttgtcaga tgagtagatt 163561 gcaaaaattt tctcccgttt tgtaggttgc ctgttcactc tgatggtatt ttctcttgct 163621 gtgcagaagc tctttagttt aattagatcc catttgtcaa ttttggcttt tgttgccatt 163681 gcttttggtg ttttagacat gaagtccttg cccatgccta tgtcctgaat ggtaatgcct 163741 aggttttctt ctagggtttt tatggtttta ggtctaacgt ttaagtcttt aacccatctt 163801 gaattgattt ttgtataagg tgtaaggaaa ggatccagtt tcagctttct acatataaaa 163861 ttttctcaac taaactcaga aaaggaagtg gggaaatatg tttgggtggc ttgtaaacta 163921 tcaattattt tacatatttg aaaccatctt taaatagaaa tccagtgtac cattggtctt 163981 cacactgaaa acaattcact cagccacaaa ataaaagcta gcaacacgga gataagaatg 164041 tcttatttag gtcacaaaat gcctacagat tctttttctg ttttcatctc tactgatcag 164101 tgattctcaa ctgggggcag ttttgcaccc ttaggggaca tttggcaata tttcagacat 164161 tttttattgt catgattggc agcttgttat tggaatctac cggggatcga ccagggacac 164221 tgctaaacac tctacaacgc atgacaaatc ccttaaagta aagaattatc cagtccaaaa 164281 tatcaatgtg ctgaggcaga aagatcctgc tctagatagt aaggttaaca tttctgcaaa 164341 taataactaa tactctctat aataactctc tattgatact ctctactaat aactgccttt 164401 caaccgaaac acaaataagt ttaaacatgc accaattata tgctgaatta aagaaaaaca 164461 aaaaagcctc ttacagaaaa tcaaagcact aataaaatta tcctgtgggt ttaccacaat 164521 gagagatcaa ggactggaga attaaaacat tggccaacca agtgaagttg ctaaagcaaa 164581 aaccagatat caaattattt tgattgttgg aattgagttt acactcagac aaattgagag 164641 tcctctaaat gaagacttct gagcatgcgt tatcatcaat atttagtaaa ttatgcttaa 164701 aatatctgtg tttgctaaat tttacatgtc ctaaataata catttgtagg ggaaaaaaag 164761 gcttttcaag ctatacaatt atataaattt cttccaaatc acttggtaaa tttgctcact 164821 tcatggaaaa caaaacagag gtgggatagg aagatatttt tcagaattgc aatgcaaata 164881 cagtgagaaa taatatcacc cccaaacatg agaaagtcgg tgtggttaga gctgaaggaa 164941 gacaaggaaa acttatcatg catcccagtc tctgatctgt gataacagta atgttaacca 165001 ctagtctagt tctaatggga actagaagtc tattctagag aaagatttac aaattattaa 165061 gagtattcat taaaagccac aaaacatgcc tatgtcctga atggtattgc ctagcttttc 165121 ttctagggtt tttatggttt taggtctaac atttaagtct ttaatccatc ttgaattaat 165181 ttttgtataa ggtataagga aggaaaccag tttcagcttt ctacatatgg ctagccaatt 165241 ttcccagcac catttattaa atagggaata ctttccccat ttcttgtttt tgtcaggttt 165301 gtcaaagatc agatggttgt agatgtgtgg tattatttct gagggctctg ttctgttcca 165361 ttggtctata tctctgtttt ggtaccagta ccatgctgtt ttggttactg tagccttgta 165421 gtatagtttg aagtcaggta gcatgatgcc tccaggtagg caataccatt caggacatag 165481 gcatgggcaa ggacttcatg tctaaaacac caaaagcaat ggcaacaaaa gccaaaattg 165541 acaaatggga tctaattaaa ctaaagagct tctgcacagc aaaagaaact accatcagag 165601 tgaacaggca acctacaaaa tgggagaaaa tttttgcaat ctactcatct gacaaagggc 165661 taatatccag aatctacaat gaactcaaac aaatttacaa gaaaaaaaca aacaacccca 165721 tcaaaaagtg ggcaaaggat atgaacaggc acttctcaaa agaagacatt tatgcagcca 165781 atacacacat gacaaaatgc tcatcatcac tggccatcag agaaatgcaa atcaaaacca 165841 cagtgagata ccatctcaca ccagttacaa tggcgatcat tcaaatgtca ggaaacaaca 165901 ggtgctggag aggctgtgga gaaataggaa cacttttaca ctgttggtgg gactgtaaac 165961 tagttcaacc attgtggaag acagtgtggt gattcctcag ggatctagaa ctagaaatac 166021 catttgaccc agacatcgca ttactgggta tatacccaaa ggattctaaa tcatgctgct 166081 atgaagacac atgcacacgt atgtttattg tggcactatt cacaatagca aagacttgga 166141 accaacccaa atgtccaaca atgatagact ggattaagaa aatgtggcac atatacacca 166201 tgaaatacta tgcagccatg aaaaacgatg agttcatgtc ctttgtaggg acatggatga 166261 agccggaaac catcattgtg agcaaactat tgcaaggaca aaaaaccaaa caccacatgt 166321 tctcactcat aggtgggaat tgaacaatga gaacacatgg acacaggaag gggaacatca 166381 cacaccgggg cctgttgtgg ggtgggcaga ggagggaggg atagcattag gatatatacc 166441 taatgtaaat aactagttaa tggggtcagc acagaaacat ggcacatgta tacatatgta 166501 acaaacctgc acgttgtgca catgtaccct agaacttaaa gtatgattaa aaaaagaaaa 166561 gaaaagaaaa aaaaaagcca gaaaacttta aaacataact ggtactctca aacaagcaat 166621 ttaaataatt gatgaatccc atttccacgt gcttgatagt catcagtgat tgagggcctg 166681 tgagtgagaa caatcaagtc agtggtcttc aagttttgct gtacgttctc atcagttgga 166741 gattacaagc tttctttttc ttttttaaaa tgatgccaag gcaggcttct tcccagacca 166801 aataaatcag aagctctagg agtgaagctt gatctttggc acagtattgt acagcacatc 166861 tcaaacttta actttcatgt agattgccta gggatcttcg taaaatgcag attctaattg 166921 agtgggtctg gggtgtgcct gagattctgc attttgcaat gtcaacagtg gtgttctaag 166981 gaacacactt ggaatagtga ggtgtcaaaa gcttcctgtt tgattctaat gtgcagccag 167041 cattgagaac tgtctatgac taagctccta gttgtaagct gggtgctatg tgggcacaga 167101 agtaaaatag aaaaagaagc attcctcaaa taaggaattg ttccatccta aatcactgtc 167161 atgcaattca catatctttg gcttatcttt tgcaagaaaa aagggtcaaa ttttctaatg 167221 ggaagaaatt ccattgtaga ccaaaggaag ctcagttact cctctggaag gacagcccat 167281 ttgatgttat cttatattaa tacaggtaac ttgccccagc cttagctttt gcgtttcctt 167341 ttctggaagg ctctttcagc agagttctgc atggctgttt cctgttcatc attccagcct 167401 cctcctatgt accattccca accccacccc atcactctca tttccttatc ctgggttttt 167461 caccagctgt ttatcacgag ttaattaatc aataacaata aatggccagt gagctcaaag 167521 aaacccacaa ccattcccat gccctgctga aacatgaatg aagcctgaat attggttagt 167581 actattggtc ttgggagaga gaaatcctaa aaggaagaaa aataatgacc ttattaaaaa 167641 atcacactgg ctagcatttt ggaggaggct ttacatttat agcttaatct caaattgatt 167701 gtctcatttg atgcctatga aaaactcata gggtggggat tttccccatt ctcaaggtag 167761 gaaaactgag tttttgagat gccaaagagc acaaaatcac agaaggaaga atcaaaggca 167821 ggcatctgaa ttcaaagtct aagttctttc aacttctcct agctgacttc tttatttttt 167881 caacacactc ttaagaatca acaaccaagt cctaagactg tacatacatt tgaacttgat 167941 attaagtaaa tttaacccac cagtatctaa aatgggaaat aagagataca aggcaagtca 168001 cgttggtgtg tttgtcccaa ccctacaccc tactttctcc atgtgatttt taatgggggc 168061 cctttctatt tactgaatta caagaaggat atcctgaaaa agacactatc aaattcagga 168121 aatagtcagt tagtttgtat tactgagtat ttataaagca gtattatttt cttatttaaa 168181 gaaaatggaa gttttaatta ttatttattc caaagattat aaaaggtgaa taataagtga 168241 tttaagagag tatatgataa tgattacaaa tctattcaag agcaatttaa gttttttagt 168301 ctttaatttt tcctatttaa aaaagtaata cttttttaaa gaaaacaaac aaaagcaaaa 168361 accaaaaaca gcagaaatat actgtgaaaa gaagaaatct cctatagttc ctccccctga 168421 agaaagccat tgtaagtaat aatgataata ataatataac taacatttgt tgagtgtagt 168481 aaaggccaca cactgtgcta ttaatttgtt aaaattttct atggatataa caaaatatat 168541 gcatagacat tatgttatta atataacttg catcatatta tatgatgtgt tctattctgt 168601 tgtgatacac tagcaaatgg tagtacatat ctgtacataa cagaatcatc actcatcatt 168661 gtagaaatgt ttacatttag taacatttgt tgtactaagc actgtgttca atgcattacc 168721 tgacttattt aattctgtca atgattctct gaggtagtgt gtagtattat ttcataatct 168781 aggaaatcaa gtattggaga gatttaatag cttattttag gacatggctt tgagaggcag 168841 agctaggatt tccactgaga gctgtctcag cccagagcct gaattcttat caatggtatt 168901 tttaacttgt tttaatactg agaaaaatat tttttaaata ttgatgaaca ttaactcaaa 168961 aaagcgtctt tagaactgct acaatatgtt tgcaatttag tagtaattcc agaaagtgtg 169021 cttggtagta attgaactca aatattaagt tcacttacac cctcataaat agcttaaagt 169081 ttctcagaag gtagggtgag tagttgatcc agaaggtgcc aaaatgataa gcgttataag 169141 ccaaaaatgg acttcctgta gaatgcaatg gggctacctt tccattctca atgaaaattt 169201 cagcaaaaat aaatgtgcac tcacagataa gtagataagc attatttgtc ctacaccaac 169261 ctcacatatc ctataaatct tagcgtaaag ttgtgcattg acctgctgca acatgaatga 169321 gcctcaaaga cataatactg agtgaaagaa gtcaatcaca aaagaccaca tgtcatgttg 169381 tatggttcca ttcatatgaa atgtccatag aaggcaaagc tgttccaata gaaagtagat 169441 tagtgggtac ctggggctgg gggtgggaac agggaatgac tgcaaatggg gatgaacaat 169501 ttcattggga tgatggaaat attctaaaca gattgtggga atgttgcaca actctgtaaa 169561 tttattaaga aaccactgaa ttgcacactt aaactgagta aattttatga tgtaaaatta 169621 cttcaataaa gttattttaa aaatatatct caataagaga accagaaatt acttgcaagt 169681 gaaggattaa aattaatctt atacattcaa aataagtgag taacattata atgtatcatt 169741 ttccttccta ggacacccat ttagaaaaat cacaatataa tgaatgaaaa taaacatcat 169801 ttttttgtta ctacttattt agtatttata gtgatctgac caagcatcca tctgtggtta 169861 atatagaaac agcccactga tctacacacc agacttctaa gaaattcagc tttctaaaat 169921 cagaagcaaa aaaggcatct gaacaactat caggattggc tacataattt atggagccct 169981 gatcaaaatg aaaatgcagg acctcttgct cagattaata attttaaaat gatgacagca 170041 gaacattagt catgttgtcc ttctgagcat gggccctgtg cgaccacaca ggtaggatac 170101 tcataaagct ggtgttggca atcaaactag ttgtcacaaa aattctggac taaggccatg 170161 cattttattc aagaggttta ttctgtaaca gaagtcctct aactctcatt ttattctaca 170221 tgcacattta tggacttggt tatctcattt gatggactaa gggcaatcaa aatcaacaca 170281 taagaactga gtggaagatt tcagaaggaa gaagccctct tctgaaggga aaagtgaaga 170341 aacactaaga agctaaccca gcaatatgaa gtataataat gggaccagga aatagtaaat 170401 ttttctatta caccctgttt gcaaagacag caacatctaa tacattttag aaaatattca 170461 tcaaaaaaca aaaaggaagg actcaatggg gcagcagtgt gctgataaaa catttaacaa 170521 ctggttctct ggaggaaagc tctgatttgc gccatttgtt aatttccata atataaatac 170581 tcattgtggc ggattccagg gtaccaaggt gaggacacag gacatgcagt taggaagaga 170641 tgcacacaac tggttctcca gccaacgcaa gctggctcca gcatactgct gaaatgggga 170701 aaacattagg tactaccggc agtgaaaata taagacaggc ccggggcaca ggaggagaaa 170761 taaaattcag ttctgttccc ttataaaaca ttttgttcca agggttagca aagagagaag 170821 gtgactgttg ctatttgcta aaataagata ctaattctaa tggcaaaatc atcttggata 170881 atcaaaaatt gactacataa ctatttccct tgtcactttc cttccaaaaa taattatctc 170941 tgaataattt caaggttaaa gattgtaagc ctaaagagca accgactgta aaaacaactg 171001 aaaactatgc cagtgatatg gcttttaaaa aaatagtaat atttgttttt aaaaaagaga 171061 gcatggccaa gtaaaaatat tttctctttt ctgttccaat ggtttaacct atgaatcacc 171121 ctctaggagt actatcatac cacagatagg caactcttac tctttacagc aaaagcataa 171181 gtcttcacta tcagttttag ttgcttccaa tagaaaggag agtatatgtg atattaaaat 171241 gtaaacattt caaatcacca taggctacct tgagaatccc tgattcaaaa gacctctctc 171301 accctcagga acactttaaa atttcttcag agtcaagcag tcaacggcat tctggctcag 171361 ggcatgatac cacagaagac acttccctga tggtctctta ggtggcagcc atattactca 171421 cccagcatgc ctggtgtttc ctaaatccct tttcaattgc ccttcttaca cccagaagca 171481 tcaaagagag gtctggggat gactgctccc tcttactgta gcaactctct cctccagaga 171541 gggcaggggt taaagttatc ttttttaaat tatctaataa aaccgtagtt gatcaacata 171601 ttttttccaa aagtatttac ggagtgctta ttacatccaa gcactgttct aggactcata 171661 attttgatat ttttgctagg taccagtcca attctgacag tacttagaag gcaaggtaac 171721 atactgtgca catgcaagcc tgtttaaaaa cagttctgga atgaatgcca agatgagaga 171781 cgctgactta cattagggaa gcctatacaa ctgattatcc tcattacctc tgtttaacat 171841 aagctcagac ttttttttaa aaaaatctca gaacctatat ggacattatc agttttaatc 171901 tagtgtcttt tttgccttta taactttcta gtatgagcgc atagaatatc ctacttaatg 171961 agagatcctc taggcatgga ggcctagaga tttggtaaca ctaattattt tggttaaaac 172021 atatacacac acaaacacac acatgtgcac acacacccta aatgggatcc atctttggga 172081 ggccaagtag atgagtatag gtttcccatt cttgggagta gatggtatca ttactagcaa 172141 ctgttgtgag atgcccctgc ctgtaagaac tgcgacaaga aactcctttt gatggaagtt 172201 gcccccaaag cagaaaggtc agaatcaggc tgcaaagctg tggacatgtc agggtcttca 172261 gatctctctc ctacaaggca tccttactat ctggagatag atggaaatta ttgatctttc 172321 aacatcctct tcaccaagaa gaaagaataa taacctccaa gaatattagt agcactgtac 172381 ttcctgaaag tctgagaaga attattaccc aagacatgga agcacatggg aaagggtagc 172441 ctgcaagggg ctaagcattt atgtgtgtgt ataacatatg catatatatg tgtatgtata 172501 tcaaagacag catgactata cataatatga agatgtaata taatatatat tatctataat 172561 gtataatgta tggtatagat aatccattat atataccatc aaacaggtag gtgacaatgt 172621 atattttatc tctcacattt aatttcattg ctctcctaag ctgtatgaaa atatcctgca 172681 agctgaacat ggtgactcac gcctgttttc ctagaacttt gggaggccaa agtgggcgga 172741 tcacttgcgc ccaggatgag accaacccag gcaacatggt gagaccccat ctctacaaaa 172801 aatacaaaca caaaaaatta gccaggtgta gtggcacaca catgtagttc taggcactca 172861 ggaggctgag gtgggaggat caactgagcc cagggaggtc gaagctgcag tgagccaagg 172921 tcacaccact gcactccagc ctgggcaaca gagcaagact ctgtctcaaa acaaaaatga 172981 aaacaaaaac gaaaacaaaa acaaaacctg cagccaagtt gcctctgaaa cctagaatga 173041 tgtcagacca acagattccc cacattgata gtaactaatt aaatcactgc tttctttgag 173101 ataaagacta gaacaggcac tctttactct ggggcaggcc tactccactt tggaaaaggt 173161 gccaaatctc atgggcttac actacctggt gaccttacac aagttactga aactctcagg 173221 ttccttatct gtaaaatggg gataataatt gtatatgttt cctagggttg ttatgaagat 173281 taaagaagta tttgtaaaaa ttctaaacca gtgcctgtta aatagtaagg actgcgtaag 173341 tgctataaaa aaaaaaaaaa aaaaaaagaa accgtgggag tatcttattg ttttctaatg 173401 tttgcagagg gagtaagagt tttactcatc tatatataat atgagcacat aactcattct 173461 cagtttgatt tttccttaac acaaacaagc ctcttttcag agtttaaaca aatcacaaaa 173521 gaatttttaa gtcattgtta aaattgacat gatttataag ctggaatgaa agtgccattc 173581 ttgtccctta aatgcatgag gtgatcaata cgttttttcc tagatataag aaattccttt 173641 aggtacctgc tttggcatgg ctggcataga catatttata ttaggtaatt aagaactcag 173701 atgaaaacaa tttgatacta ttgttaagca caatatgttt taattattat agtcataatt 173761 atataattat acctttatat atagtataac aatgcattct tttctacagt ttgcagtttg 173821 attcctttta tccttgatgt catttaggaa ttctaaatgt cctaatttct ccaaaacact 173881 gaatgagagt tgcagaaaga caaacataaa accagattaa ctgcaggtga aaatggcatg 173941 tggataattt atttctgaat ccagaggaat tacatttttg atgctttgtc tctccttagc 174001 ctagagtctc aagaagcttt tatacataca agtcaacaga ttaacattcg tgatgattca 174061 ctgtttcaaa ataacttaca tggtacatcc aaagttactg ctagaaatat tgctggcatt 174121 gatttcttta agcacattta aacacaaaag cccaacagaa aaatcagagc ttctacattt 174181 ggactaaaaa gaggacacac acagagatct gttcctctct aatagaatgt attaaatagt 174241 agacttgagt ggtgtttgta agtatgacat tgtgggaggg aaactatatg aagcaaaggc 174301 tataccttga ataggctgtt attactgcac ccatcacccc aaaacaggcc agtgtgattt 174361 tatgtgcctc ccccacaaaa tagagcttct taattaaagc cttagcactt accagttcct 174421 gccacacaat aagcaatcaa aaaaaaaaaa aaaaaaaaaa accactggct taaataaaga 174481 ccaaataaat ggatcattgt cctaagtagc ttacagtcta atttgggagg tgagactaat 174541 ttgggatgtg ggataatcaa tcctgaggac aaatagtgac atcaacagac ttgtgcaggg 174601 taaaatgtca aagaggatat ggagaaatct gaatccctgt attaagattg gtgcaaaagt 174661 aattgcagtt tttgccattg cttttaatca tcactgttac atcactggtg ggaatgtaaa 174721 atggagcagc cactttggaa aactgtctgg cagctcttca aaagtttaaa catagagcta 174781 ctatgtgacc cagcaattcc actcctagct atatttcatt tcaagagaaa caaaaacaca 174841 tgtacacaca aaaacttgca cataaatgtt cacagcatta ttcataatag ctaaaaagtg 174901 gaaataaccc aaatgtccat cagctgatga atgaataaac aaaatgtggc gtacccatac 174961 aacgaaagat tattcagcca caaaatgaaa tcctgataca tgtacaacat agatgagcct 175021 tgaaaacatt atgctaagtg aaaggagaca gttacaaaag gccatatatt atatgattct 175081 atttttatga aatatccata aaaggcaaat ctatggggac agaaagtaat tagtagttgc 175141 cagaggtgtt aaagggaaag ggaggggtgg ttggctgcta atgaatgcag caattcttta 175201 tgaggtgatc aggatgttct agaacatgtg gtgataaatg cacaattctg tgaatatact 175261 aaaaactact aaattgtaca cttttaaaaa ggggaatttt atggtatgtg aactagacct 175321 caatttctta aaacatgcaa aagatgcatc actgcaggag ttcagtggag aagtattaat 175381 aaaatttgga agaccttaga ggcttccagg gggaggggat aattgagcta ggtcttcagg 175441 gatgcattgg atttaggttt aaggagggaa ggcatttcat agaagagaaa gaaggtgacg 175501 aagtgaagct ggtcattgtt ccagtcatga aaaaaaaaag aaccaacata ccttgcaata 175561 atattgtcaa tgaacgtgcc cccccgcacc ccgtcccccc ctggctaaaa gcactgtcgt 175621 ttacctctgc ttcaggtgga atttacagtt caaggcagcc aatatgcaaa ccacctaagc 175681 ctccattttc acagctgtgg cttaccaatt tctctttaaa acaccaactt ttagatttct 175741 gtcccactca attacattga aagaaatcct agatatccta tctttcatcc agaaatattt 175801 cagcattata ctcttaatgt ttgacattaa cctagcatga actatcactg tatgatactc 175861 atgctgtgac tgtgccgtat tgtatttgac tttcccaata tttgggggtc tgagtttcaa 175921 gaaggacatg aatgtcaact tctagtcatg taaatagaat taaaaactcc tagtgctgtt 175981 tatcttgata aggtttctat agtttactat taaatattag agttcaagcc tccacttgga 176041 atcagatcca ttcaggagac tgtatcatgg tgttgcatag cagaattgct atttaagaat 176101 aacattcttg tcacattcat ccacattcca aaaatactct gcttggcctt gatggcagga 176161 tctgagaaga agctggtcat ggtgaacaca aagtttggga gagattcttg ctgaaatata 176221 gctttgtgtg acaaaagagg aaaagcacgg gattcctggg attggaagag catccaaaag 176281 cagctaatcc agcctcccgc ccaatgaaca aattccctcc gacgatgcta ttaggtaagt 176341 atcaccttga gttcaagtag tcacagcttt atgtccaacc agctcatgac tggacaattt 176401 ttatctagac agtcctcata gcatggccaa agtcacaaat gatgaaagac ataataaact 176461 ctagaaaaat acataatatc aagaccatgt ttcttaacct aggcactgta gacatttggg 176521 tctggataat tctttgctgt aggggctgtt ctgtgccctg taggatattt agtagcatct 176581 ctgacctcta cttgcttata cgccaatagc accctcccca agatgtggca agcaaataca 176641 caaaaatgtc tccagacatt gccaaatgtc ccttggtggg ggagtagggg tgggggaggg 176701 caaaattgcc cccagttgag agccacaggg ctagaggcat tacatgttaa gagctacagg 176761 agggaagctg tgacctaacg gccattcata tgtacaggat attaggatac ttacgacctg 176821 aacaaaaata tacccttcag tatccaacag taagttattt gacattacca aaactcagca 176881 aatgtttgtt agcttaaata aaaccatatg attgtatata tttcctgttt tagaataaga 176941 ataattcagt aaaaaggcta gtgcattatg ttttcattgc atacatatat acatataaat 177001 aattgaagga gatctgataa catgttaaat tttaaaattc agtggtgtta ttatttagtc 177061 agactttgtc ccatcactgt ccaataactt tatgtacatc attctgctta attctgactc 177121 catctctgaa agataggtct gctgactcaa ttatgatata atactaaggc tcacaaataa 177181 cttgctggaa gtcatccaat tagtagatac aagagttggg atttgaaacc atgcctgatt 177241 ccaaacacaa acccaaagta caacattgca ttgagattta ggatctgatt cccaagttct 177301 tggccacaaa agtatgctgt acgtttgtct gtctcgtttt aaagaattac aaatatgaac 177361 aaatgagaac aattcaaata tgtgtcaagt ttttttcacc cattttggtt ccaccttctt 177421 tgaacatttt caaaacataa aagactgtga gtcagacttt gagcctatgc ctaatttaag 177481 caagacttta tatactagat taattaacaa aatagaagat tgttctgctc tcttgatctc 177541 acattgcaac aacccaggct ttattgtaca ggggtcacct cccttctaga atggctttat 177601 ccacaactat gcttgtatag agaagtctac aagtcactag ctctctgtga gtatcagcta 177661 caagacattg ctctggcaac accaaggcat gcttgcaatt taacaaactc tgtaagtgga 177721 aggaaaaaag agccttccac ctaaatgcca aatctatatg tttcacatag gagtgatgtt 177781 tgaaatatct atttcatttt tgcctgccac tcccctattc tctcagagct ctgttacata 177841 aagcattagg tgttctgggg aataattgag catcagttgt ggcagagcgt aggaggatgc 177901 ctaggttacc tccattatca cacactctat ttaggaggtg ggcaaatcct atttatgtaa 177961 ctccagatga agatacaccc tagcaatttg ggagggtgat caccaaaata aataggtaat 178021 ttcacttgaa caccttgtac taagcacaaa tctgcttact gccttcatct tcttcgaaag 178081 gaacaggtgt cattgaaaac aatagtttgc gttggctgct cagtatggga aagccaaggg 178141 gttgaaattc aaagctgagt agaatacggg cagagaacag cagctgaatt ttactacgaa 178201 atgtgaaaac acagctctgt atcctcccca cctgtgaaga taagacatcc tgactatttc 178261 agtgtgagaa aagttcagca atcttggggt gctcagggct gggtgtaggt atgggggaga 178321 gttagagata gttgatgtga aaacttcccc agtaggaaac tgagtcaaat cagtcatctt 178381 ttggttgctc attcagttct ctaaagccct ggtgtcaaat acgacagcca ctagccccaa 178441 gtgaccaatg agtactggaa atgtggctag tcagaattaa aatgtgctgt aagtatgaaa 178501 taaataatag tgcaaagact tggcatgaca aatgtgtata tctcgttagg aacttttcta 178561 ttgtttgcat attaaaataa tatttcagat atattaggtt aaataaaata tattactgtt 178621 tctttttatt tttaaactta tctaaatgtg gctattataa aatttaaaat tgcatatgtg 178681 gcttggattt catttctatt ggacaacact aatcaggctg aaagttactt taaggcatga 178741 tttatgctga gctcaaagga aggcgcttcc ataggaaagt gttgcttgac tgagcttggc 178801 tccctatctc aaaatgttaa cttcaaatat acttcaaaac tgaaatttca ttcactgcct 178861 tcttggcaaa atctgttaga taaacaaatc ttctactctt tgagacatga aaaaggatgg 178921 ttaacaatgt aactaaactt tcttcctttg cagacatact taaaccacgt taaagcagag 178981 ataaaacagg ctctaacatt gaaatgggct cataattagg tgctaatgtt gaatatcctg 179041 taatgtttca aaaagacaca gatgcagaaa ggccaggtct gcacaaggca agagagaaaa 179101 cgttgcattt atgaatgcaa atatgttaca gatatccaag ttcaccagtc tatgccttac 179161 tgatacttct tgctttttgg catctcctct ttaatttcca ccacttaact gattcacacc 179221 tgggtcagag gcaaagccgt cttcctcatc tctctgatga gcctctccac tctttgcatc 179281 tcccattgct aatctgtctt gtccccctac cattgcaaat catctcaact tttggcttaa 179341 gaacctatga gagtagcagg cagcccagga tcgaggtcat ccttgcatca aagccttaaa 179401 atccgattct acataatcca gccaagcaag agattccaaa ggcaagcacg agaatcatta 179461 ataatgaagt agtcttccaa tctattcaag tcaatcctca gtctttttaa tgcaacattt 179521 acattaaaac ataaaaataa tccctttgtt ttgcatttgt ttaattcatt tacaaattat 179581 tccatataat tagaactcaa agaaagtatc actgtctgcc aacaggaacc aaattttctt 179641 ttcagttcat aagtttttta ccctccaaaa cctcatctta aaaagttttg tgtttgtggg 179701 tggttataaa tgtttttcta atctagagct caaagaaaat gtggccaaag gatactgctt 179761 tgggtaacct caggaaggaa gaatatctcc aaaaatatta aaaataaaaa agcagatcat 179821 agctcttaag gcccagtatg ctaaatcctc agcgagcctg gatcatgtag gaacgagtaa 179881 ttctagataa gcacacatcc tggatagctc agggttaaca ttctaactat cttattgtaa 179941 gagcttctat ttctctaatg gagacatttt catgtctttt aaaaccattg aacagtttgt 180001 cttataagca taattatagg tgaaaataca gattaaataa agagctgatt gaatctgtca 180061 tgatattacc acatgaagca aatgtctgca ggcatccaca ctgacaatcc attatgactg 180121 ttatatcttt tcaatatggc acccagcgtc ctggtaggcc taatataagg ggctgctggc 180181 tgcccctcca gcaaacgtta gagcagttcc tgactctact tcttcattgc tgctctcttt 180241 tccactgtca acaggaaggg caaactgctg tccctctgtt gattctcact tctaccacca 180301 catggccatg ggtgcccacc ttgaggtccc aggtgacccc agtactgttg ccactgtgat 180361 ctctaccact ctgctcctct ttgctttctc catccttagt tctcaaaatg agcttttcat 180421 gcacctatca ccactggggt ggaatgtctt tttccttcta taagacaaat ccatgacgga 180481 agacactgct gggtaaaaac acatctggat cccaattcta cttttaagga gaatttcttt 180541 tttctttctt tttaaaaaaa ttactagtca ttcctgcagg ttggtatcct acaaatcaag 180601 ggtttactgt atgattatta catatccata agaaaaaaat gacaatttaa aaaattttaa 180661 tggcaagctc tggcatatta attggaaatc cttggttctt agtctaaaca ttgaattgtt 180721 gtgtaatcaa tttctagtca aataatcaat ttctacttca gtttcattaa taattatgat 180781 taagtcagta gcaaaaaagc atgttatttc tcctcgggaa gatgtaagac agcttaactt 180841 aagggaccct aaatatggtc tttgtaatca gtggacttga gtttacaacc caacttcacc 180901 acttattttt gtgactttgg aaaagttact tagcctctac aagccccagt ttcctcatct 180961 ataaaatgat ataaagaaac ctattagaga gttgtaagga attttttttt agataggatc 181021 tcactctgtt gctcaggctg gagtgcagtg gcatgatcat cactgcagcc acagcctccc 181081 tcgctcaagt gatcctcccg cctcagcttc ctgagtagct gggactgtaa gtgtgccacc 181141 atgcccagct aatattttat tttattttta gagacagggt cttgccatat tgcccaggct 181201 ggtctctaac tcctgggctt aaatgatcct cctttctctc ccaaagtgct gggattacag 181261 gcatgagcta ccatgcccag ccgagaattg taaagattat aaagtactcg cataagtcat 181321 caaggaaaat gtatgacaca gagtgggtac tgggatgggg gcggagattt tgcccattgc 181381 tgatcccctg gcaacagact aaagaagact atactgagcc tcagctagga aaagcaatcc 181441 tcaaatactt gacagcatgt aagaatgcac ttttaaagat aatgatcttt tttccttttt 181501 agagattcat ggtactgttt ggcttacagc attccgcctg agcttgctgt aaactataca 181561 cacacacacc ccacacacac actcacacac aatgtaaaca cacacacata tgccactgcc 181621 accatgacta tcagccagtt aaacagatta tggtacattc acccacaaat gtccatataa 181681 aaagcatcct agtatcctgt ggttggccat cgtggttact cccagcatgt tggccctgat 181741 taatgtgtgt gggatgtgtg cgtagggggt attgaaatat actgtatccc agtgtccagc 181801 gaaaggccag accgtgttca cgtaaacagt cctgacatat tctgtttcag tttcccatga 181861 cataaaataa actgtcacct acaaactgca agtaaactga gaagattgtt aagcacatat 181921 ataacaaaga aacatctaaa tagataaatg gggagttttt ttttttttaa taagtaaatg 181981 tgcctagaca gactattacc ttacagtgga atagaccatg catgtctagc attgcttggc 182041 tttcttggtt cattttttat aactacatta ttgagatata attcacatac tttacaactc 182101 acacatttca ggtgtttgat taaatggttt tcagtatatt cacagagttg tgcaaccatc 182161 accacaacca attttagaat atttttatca cctcaaaaag aaaccccata cccattatta 182221 gtctctctgc attttatccc acccccagcc ctagggaacc accaatctat attccatctc 182281 catggatttg cctattctgg atatttcatg taagtggaat cacaagagaa tatgtgatct 182341 cttgtgtcca gctcaagaga gacttagcac catgttttca aggttcatcc atgtgggagc 182401 gtgtgtcagt acttcattct ttttcctatt gaataatatt ctcttgtatg gttgaccata 182461 ttttgtttat ctgttaatca gttaggatgg attcttgggt tttttccact tttttgttgt 182521 tatgaataat gcatgtgcat tcatgtataa gttttttgtg tacacacatt tcatttctct 182581 tgggtagata actaggagtg gaattgctga gtcatatggt aactctatgc ttaacttttt 182641 gaggaactga caggctgttt ttcaaagtgg ctgcagcatt ttacattccc accagtagtt 182701 tatgagggtt ctaatttccc tacatcctca ccaacactta ttctttgtat ttttttcatc 182761 attactatcc tggtgggtgt aaagtggtat ttccttgtgg ttttgatttc catatccctg 182821 actgctaaag attttgagca tgttttcatg ttcttattgg ctaactgtat gtatgttcca 182881 tgtgttcatt ggccatttgg agttgaacgg ggaggaggtg tgggatgata ttcaaaggaa 182941 tagtgtagga atcagataca ggaatgatat cctatagtcc agtatttttt tctttctgct 183001 ggcacaatga atgttctaag gaatttatga gtagcttttt aatcttaagc ctgggatttg 183061 tgagctaact ataaatcact taaccaaagc atgatgaaat gctattcaac aaaatgctga 183121 aggaagcagt aagattaata actaaagaca acgtatgact tcaacactat aacttaaaaa 183181 aaatcaaaac aacaaaaata gaactataag tcttctgtcc attagtcaaa ggtgttatca 183241 ctggctccta ggtcacagat tgagacctca gcctggttag tctgtcaagc agagttcact 183301 gaagtaaacg tgtgtaagga ggcagggagg gggaagcgga atggaagagt ttataacaaa 183361 aatcgcaacc agattctcaa tgcatacagt gtggcaaacg ttgtataagc ctcacgggtg 183421 aactacaata ttggttattc ttgccacaag tctgccagga tttcttaaaa agttgttgct 183481 ctcattggtg gtttgcttgt ctgtaacaga gaagggcaac aaagttcctt ccatgtacac 183541 agtttcctct tgctctgttt aatagccaca agcagcttat aagcataaaa ctctatccag 183601 gtacaacgga agcaccctta gccatagcta taataggccc agggagagga ggtgaccgat 183661 gcagcagtta cccagtcaat aggggtctat gaggaagcat gatggaagag ttttcccagt 183721 atggacgctg ctgcccaaat tgagagttgt agaaattgcc gttcagtggg tgccgggaac 183781 agacagaagc agatttaaag gcagtaggac gacaaagtag tcatccatga gttcttccta 183841 ccagagttta caagatatct gagggtggca atgctatggg tatcctggca ataaaagacc 183901 tcctattctg agtgtccaaa aggctgtaac acaatttcat cacatctctt ttttctaatt 183961 gaggaaattg atcatcacac acacacacac atacacacac acacacacac gcacacacac 184021 acacgcatcc tccaaccctc acactgctta actgtttcta taacttcccc ttactccaat 184081 atccttactc tgtggcctgt aagtccctgc catttttctt gaccaccgca cccatctctc 184141 caaagctctg ttctgctact gttcccctta ctccttcact cagtctgcta ctttcttcac 184201 tcatttgctt gaacttgcta tgccactttg ctcctcagca tctttgcaca aactattcct 184261 tcttttctat agtgttttcc gtatgttaat ccacctaata atcatcagcc tgtaggtctt 184321 ggcctagtct tcacttttaa agcataacag atataataac tagggccacc gacacgtaga 184381 tccacagcct gcaagaaatg gggtgggggg ttgtctctgg cttgcacccc atgattcact 184441 ggatatctat gtcaagtgtt cactcaggag cacaagcagg ggcctgtggc cacatgtcct 184501 ctggcctctc caagtcttgc tgccaagtgc cagggctgga aaatggttga agtggatgac 184561 cccctgccac ttcctccaag gtgacactgt gaaggtggag caggacatcc atactcaggg 184621 gacacagatt ctcaagcaag gagaggtgga gggagccttt tgtgtacact ttactttgat 184681 aatggatact gctggaatat gggcagactg cagtggacta tttttaaaga aaaggatctg 184741 gttttaaatt catatgtggt tttgtgattt taaaaggagg actacaaaaa atactgttgt 184801 gaagcaaatg atcaccatct aatgtgcctc ttgtttgtta tacaaataaa agttataact 184861 aaaataaaac acttcacact ggctaggatg gctactatca aatagacaaa gctggatgtg 184921 gtggcacatg cccatactcc cagatgctca aaagattgag gttggaggat tcctcaaacc 184981 caggagttca agaccagtct gggcaacata gcaagatgct gtctctaaac attactacta 185041 ataataataa caacaacaag tactggtaag gatgtggaga aactggaacc ctcaaacacc 185101 attgtgggaa tgtaaaatga tgcaatcact ttggagaaca ctctggcggc tcctcaaata 185161 gttaaacata gagttcctat aatgaccaag caattctact cctagttata tacccacgag 185221 aagtgaaagc atacatccat acaaaacata tacacagaag tccatagcag cattattcat 185281 aaaagcagaa aaactgaaaa caccacaaat gtccatcact tgatgaatga ataagcaaag 185341 tatcatgttt ccatatgatg aaatattata tggtgataaa aaggaatgaa gtactaatac 185401 atgctgcaac atgggtgaac ctcgacaaca cactcagtga aaggagacag tcacaaagga 185461 ccacgtattg tatgattcca tttatggaaa atctcaagaa caggcaaatc tagagacaaa 185521 aagcaaatta atggttgtca ggggctcaga gtagcattgg tggaggagat aaaaagagag 185581 tgactgccag tggcatggag tttcttttgg ggaaacaaaa atattctaaa attagctagt 185641 ggtgatggtt gtacaactcc gtgaatgtac taaaacccac tgaactgtac actttaaatg 185701 ggtgaattgt atcacatgtg aattacatct caatagagag ggtttaaaca aatgatgcta 185761 taccttgagt ttacagagca aaatgtccct acataacttt gtgtttccat gtttgaccac 185821 ttagcaggaa ggcctctcaa gttaatgaac acatttccgg ctctcgcaag acagtcttac 185881 tgattgctat gagactaaaa ttccacatga acagaaccat agtgtctgga tgaccttgat 185941 gagcaagaga attataaata caagttagaa aattttcaaa aaagaaggaa ggaaaagaag 186001 ttaaagtggg aatagggtgt taccactgtg ctagaaaaaa taaaaagaga gaaacaaaca 186061 aacaaaaaaa tgggtgaaaa gttgttttta gtagaacctc tttaaacaag gggaatgctt 186121 tcaacagtcc caagcaaatt tcagtgtcac atggttaagg ccaaaatatc ccaaaataag 186181 atgaataaga taagattatg gccctgttgc tatgacttca ttatgtggaa tatagatatt 186241 gggcctgtgt cactggcaga cattatttat atatatatat atatatatat atatatatat 186301 atatataaaa tatacaaata tatatgaaat atataaacat ttatttatat gtttacgtta 186361 gatttatata tgcatatata ataaacataa atatataaac aaatgaacat atatttatag 186421 gaagataact ttattttggt ataattgata cagcataaaa tttatgcatt ttatgtgtac 186481 aattcaatga tattttagta aatgtaccaa attgtgtaac cataactatt aattcagttt 186541 tgcaacattt ttatcaaatt atatagacag tcctggtggc tgctcagata tgtttttgct 186601 tttgatttta gaaagtcgag tttcttagaa gtctggtgtc tagttcagtg ggatgtttat 186661 gtcttaacta caaatgggtg cttggtcaga tcactgaatc ctgaacaact agcaaaaaat 186721 aaaaaggtgt ttcccctcat ttattagatt ttgaactttc taagaggtgg cctagaaagg 186781 aaaaaatata tatataccat tactaacgta tttcaagttt gattgtataa tattgatttt 186841 aatccttcat ttgtaaataa gtactttttg attctcttat taactcatac tctaaaacaa 186901 cttcattgag atgtaattta cattctataa acttcacctg taaacttact gcgttggata 186961 accatcacca caatccaatt tgagaacatt tttgtcactt caataacatc cctggggctc 187021 actgacagtc actctccttt cccaaccaca gccccaggaa accactaatc tattttctgt 187081 tcctatggat ttctggaagg cattacactt tcacacacag atagaaaaag atatatcaaa 187141 aatatatctc tcctttaaga aaatcctatc tctaccccag aatttcagtg tttgtgatgt 187201 ttaaaatttt taagtaataa ataaaagttg ccacaaatgc tatgaaaaac agtcccccaa 187261 atgagaagtt tctccaccat aatatacatt ttaaagttta taataaccaa ttaatagaca 187321 gcactttaac accatgtgat attttcctga ttaaagccaa aagctactaa ttctaccctg 187381 ctccatgact aaccagcctc ccaccaaaaa caaaaatcca aaacacttta gaacacatat 187441 tgttcatttt atttgcagac aaaactcctg gactactggt ctccttcaaa tttggatcat 187501 aagctgaatt aagcaaggga aacttcccaa aatgacatcc tattctgcag ctttttagag 187561 ctacagaaac ttttctagac attaaattct ctttaaagat aatccggaat tatgttagac 187621 tagtttaaaa accatcactt ttcatagcaa catgattata ttgtgggcat ttccaagtta 187681 attatagatc tgtcatttta aggaataatc attaattgag atcttttcat tgaattaatg 187741 aagggatatt gtacagctgc aacaaaataa cccttttccc cttaaccatc aatggagaga 187801 attcgtacca tggtaatttg gaagctacca ccatcatgta tgcatatatc atagttctca 187861 agaaaatatt gttcccgaat tgtcttttcc tactaaagaa ataacccttg gtcaatgtga 187921 caaatatatg acattgagtc atttccacat aactctccag cagaccacat ctgtgaagct 187981 agataaatcc tgtttcaatt tggattgcca taaaacacaa cactcttttc aggccagtcc 188041 atggacttct gatagcagcc ttcacaggcc acaggttgcc actttccaag tctaggatgc 188101 tgtggctctc acaagggtgc tgacccggtt tctattttca actggaattt cctcctctgg 188161 ggtcatttga tttccaccct cagagacatt ttcctggttc tctgttctgt gccccgacct 188221 actcgttgct tcaaaaccaa aaagaattcc aggttcatcc tcctgcacag tccagtgaga 188281 caaagactcg tggacattat gctttcaatc tggtggaagc catggttttt tgaacctcag 188341 tgagaccctc ttagaagaca ccaaggtgac atttactcag gaagctgctg agctctaaga 188401 caagacctgc aggaaatgat agcgggaggc catggtgtag agctaaagct gttaagacgt 188461 cttcattcaa agggtcacaa gaatgtggaa gagtcagcca aaagaaacat cacaaagtgt 188521 aagacagggc tacttccttt ggcacaacac tggctgcatt ggcaggaccc agagatggct 188581 cacccccagc tcttgtggct gagcgctcag caagccacac ggagtttaaa gaacaggaga 188641 ctggtgtttt tagagtcagc tcctggaggc cctttatgtt cttgcctctg agggagttct 188701 ggatacagta acataaaaat ctcagataca tccacaagaa gaactagcat tttaaaccca 188761 tgagtaaatg acacttttct gagtttaagt ctccctttcg cactggtaag aatggcaatg 188821 tttatgaaaa ttattctgcc tagaataatt gcaaaataaa ccatgaaaca tatactattt 188881 ttaacttaaa atttttaaag cattttccag tgatgtagag attggacaag gcaaatagtt 188941 ttcagacact atacttgcag atgcatcttc ttattcaaga ggaagcctta gagtaaaaat 189001 tggagactaa ccttttaaaa aaatgtttcc cttagcgcta tacacatacc cagttctgca 189061 catatactgg tgaatcaatg aatgttcact gacattttgc ccttatctga aatataccca 189121 ccactcccct ccataccaac aatatgtgtc cctttctttt gccttcatac tgatgtaacg 189181 tagacctatc tacagtgtct gtggtaattt gagacatagc attagactag agataggcaa 189241 actttttctg taaatggcca aagagtaaac attttaggct ttgcaagcca tgcagttcat 189301 tgcaaccaac cactgaactc tgctatcata gtgtgaaagc agccaaagac aatatgtcaa 189361 taaatgcaca tggctgtgtt ctaataaaac tttatttatg aacactaaaa tctgtatttc 189421 ataaaatgtt cacatggcat aagatattct tcttttggta tttttttcaa ccatctaaaa 189481 atgtaaaact gattcttagc tcataggaga cacaaaagca ggcagttgtc tggatttggc 189541 ctgtgggtca tagtttgttg acccctgatt taagaactga gctaccgtaa tatggcagac 189601 agtctactag atatgatcac gtacagagca aactggaaaa aaaaacaggc cttggcaact 189661 gttcaatcaa gttgatataa tagaggaaga tatggtgaag gtgttcaaac ttcaatgaga 189721 caagattctg tgtgtgtact ttaagtgcat tgcttttaat gtagcttatc catttacatg 189781 tcaaaattga tcaaattaat aatttaattg ccaaacgttg cctctggtgt gtttggccta 189841 aaggattaat atatttttct ttcattcata aatcacctca tccttaattt cttctaggtt 189901 ttgaaacagt agctaaattc agaaatcttt taaaagtgat actagtaaaa agaagcagat 189961 aaaaaatctc cagggagatg ttggtcaagg atacaaaatt ttagttaggc aggaggagta 190021 agttcatatt ccttgttcaa catggtgact atagttaata acaatgtatt atatacttca 190081 aaactgctaa gacaatagat tttaagtgtt ctcaccacaa aaaataagta tgtgaagtaa 190141 tgcatatgtt gattatctca atttagccat tccacaatgc atacatattt caaaacatat 190201 tgtacatcat aaatatattt aatttttaat tgtcaattaa aaataaattt taaaaagtat 190261 ccagggaggt gattctgaat ggtttctttt tgatagtttg aggaaccaga gctgctacac 190321 acaaggaaac agggacgagt gtcgcaattc tgtttaaatg ttaatatcat tcagtctatt 190381 tctatattcc cttaatgtct actcagtctt tctccaatca ctgtggattt atgggaactg 190441 atatcagtga gagaacctca aaaccctgct tttgtcattt aattgggtca tgatgacata 190501 gatattgaca tccatgacag aaagaactca ctgacgaaac agaaaaagaa aatcaggata 190561 gatttgtacc ttcttaaaag tattggcatt ctcaggataa actgattaaa cactgaggta 190621 aagtgaatca attcccccat ctcttcatcc ctcgttgctt cacatcctta ccacagcccc 190681 atcatggaag gagtgcattt ccccacacac tgaagttagg cttttacatg tgacttgctg 190741 tggccagtag caggtgggtg gagacaatag tgtgtcaaat tcctgcctac gtctcaagag 190801 tcttcagttg atcccacttg ccatcttgtg cctttgtcat tgctgaactt ctttgggtag 190861 atgctgccag cctgggcccc aggatagata cacatgggac agaattgaca gaagagcctt 190921 gacactttgg cctctggtca aacagctgag ctctgtctag ctaagccaac cctaaccgac 190981 ccacagatcc actaggatcc gtgatggttt ttttatgcca ctgaatgtca gagtagcttg 191041 tcacacagca ttagtgtggt gatagctaac tgatacaaac acacttctac attttagaca 191101 acacagctat aaggtaaagt acgattttgt gtgtctatgt gcatatatgt atgcatgcta 191161 ggagttctat atttcattcc tttctaatga cgccttaaat aaaaatgtgc cctcccccac 191221 ctgcggcagc acagaaatat catggaaggc ctcattctca aaccaagttt cctcttagac 191281 tcactacatg agcctttgat cctttacaga tgtttgcaat gggtgataca aagtatctgc 191341 gtggtaatgg tgacattacc tgtctaattt taatgcaatt aaattgcatt aaaagcaaga 191401 gttgggctca gtatgactac atttaaaaaa tgttcttcct ttctaataaa tatcaagtca 191461 taatcatatg gtaaaactga aatatgtgaa gtatctagta agtttgccta tgtttgcaga 191521 atcagacagc atcagcagtt tggacacctt ttttaaaata gtgagcacca aataaatgca 191581 ggaagtcttt ctaactgtgt cccgatgata aggtttcact cttcagcaat cctgaataaa 191641 aacagctatt aaggattttt tgaaccagtt catcatgata aggttcaaac ctttccccaa 191701 gagaggtcta cgaacggcca ctccctttga aatggatgaa taaaaatgca tgtgtgccat 191761 gaatgccaag gtttacgtaa ctgttgtgct gcccagagcc cagctcggtc atggctacct 191821 caaatccatg caggactcgt gtttatcctt gttcgtaaat acttttggaa aaccctgaca 191881 agggcagagt gccagctctc atgtaccttt tccaggctgc tgcaagttca gtcacgcggt 191941 tgtcggtcca gaaatggatg tgccgttaga acagccatgc aaagagggag tgtgccgcat 192001 ttacgcagga gataaactct gccgtgtttc aatcaaaagc agtttgctag tgaaaacaca 192061 ggtgttggaa ttataaaatt accacaaccc tcaaaccctc aactaaaaat aacaaacagg 192121 atgttcttta tatgggtgtg tacatatata tatatatata tatatatata tatatatata 192181 tatatatata cacatacata tatatgcatt gctgacaaca aggacacacc aaaatcctct 192241 ctgatctccc tatcaattca ccatactttc ttctgttatg tttatggatg ctttgggtgt 192301 aaataatcaa tggagacagg aatgtttcca ccgtgcagca agaacaggag gaagaattta 192361 aaatgtgtca tcttatggag acagttcagt ggtagcccag attcaggatg agggcaggag 192421 tagggtggat gtgaatacaa aggggtagca tggggggagt gcttttgcgg tgatggacca 192481 gctgtgtatt ttgaatgcgg tagtggatat gccagtctgt acaggtaata aaattgcaca 192541 gaactgcacg catatataca tacataaaag aatgcatgta aaaacctggt aaaatgagaa 192601 tctgtagtct agttaatagg attgtcccaa tgtcagcttc atggttttga tattgtgtaa 192661 ataagatgct accattgaga gaagctagat aaggggtaca tgggacctct cagtactatt 192721 tttgcaagtt cttataagtc ttcaattatt acaaaatgaa aggctttaaa aagtattacc 192781 tcaaaggtat caccttaagc caatggcatg tcaacaatca cccactgcac agtgattcaa 192841 gcatgcagac ttggcaatca aattaagaga caaggatttg agtcccagcc ccactacgta 192901 ctagctgtgt ggccctgaaa cagacataag gtcagagctt tggtttcctc atcttttaaa 192961 tgaggataat aggacctatc tcacagggtt attttaagga ttaagtgaga taatatatgt 193021 agaataagca ggactcatag agagcattca ataaatattg gctattatta ttactctaag 193081 tcagactaaa tatttctgag agctttaaat cactgattaa tttcattgac actcaagtac 193141 atactaagca tcaggcacat gctataaatg gccaagcatt gagtcacaca caatatagtc 193201 cctggttctg gtctgcgtcc ttttggattt tctggttctg ttgtgaatat ggtctccaaa 193261 gaagcaggat gaatatttat gaaagtgggt gcacagaaga ctcaaggagg aactaactta 193321 agaataagag gtactttctg cagtccaggt tgaaatgaga gttttctctc tgctaaaatt 193381 tactggggta cagggcggtt taactgcaat acttagctgc aaccattcct tgaccagcag 193441 agataaatga aaattgcctt tcagtcaagc acaggttaga ccatttgtgt tgatgtgcaa 193501 ttttaagagt taaaaccctt tacccatcaa ccagaaaaaa agccaaacca aatcttcctt 193561 tttctttgtt tggcaaggaa attacattag actatcctaa tgggcttaga gggaaaaaaa 193621 atctaattgt agtctatatt aatacattta catattttga attatattaa taaatactaa 193681 taaatttaga ccagggcact ttaaagtaat gaactttcca atattagatt aatgacaaaa 193741 tatacgtaga atatatcact ctctactaaa tgctttaata ggtttgtgat ataaaaatcc 193801 ttaaaagcaa gcacaccacg tatggatgga gctggcatag cacctgagta ctttaaatgt 193861 ggtagtcaaa ataatctaca gtgtggaaga aagagacatt tgaagcaatt tgggagactt 193921 ctatatactt aggaatttaa ggtctttcat aaatgtaagc aaaccccaaa caagactgaa 193981 aatgcttgta atgatatttt atgtttgtca acaccgaccc cccctcccgc cagccctcaa 194041 caaaaatagg aaataaaatg ttctgccctt ctctgtgctg gtctcactgg attggctggg 194101 cttgggccca gaaggagatg gagaacaggg agaccaagaa gtcagagtat ttactgttcc 194161 agtctatccc tgcttctctg tattttgccc ttgctggagt cttctctgga cacattgcca 194221 gggtcccttc cagccaaggc tgcaagtcag tgggctctgg taacgctggt ccctccctgg 194281 taacagcttc tcagtgttgc taattgctgg gtgcttcacc actcaccgtt ggttccttta 194341 aacctgtcaa cacattggcg tagttcattc ctcaaattaa gccttttgag tgatttatct 194401 gtttcctgtt tggatcctga tcgacacaaa gcccttgact gtgaatcagt agctaatgtg 194461 ggaggatgcc ttaacttttg agataaaagt taagaccgtg ttttgcattt tgctcagacc 194521 atcgaaatgg catcttaaca tgaatggaac tgttgaaagt catgtgcata gatacccaca 194581 gaaataatac tatgtgttat gttaccattt tcaaatcatt ctagcaaact caatgttcca 194641 tacccttgct aatctattat tcatctgtgg gtctaattaa catggcatca ccaagctagg 194701 aacctagagc aatgtccagg gatttgtgcc catgtggaga ggcacccacc tggcacagag 194761 atgcttggca aatatggagc ccacgtgctc cacaaagtgc tgagatgctt ttagcctctg 194821 tattgcactt ggactggctt atacatttct ttaccagata tttcagtcaa ctgcagaaag 194881 gggtcattca aaaatatgtt aaaagtttat tcatctaaac tacttctgtt gctgttgcca 194941 tggcaacttt tctctaaagc cacaactcca aattcttcta aaattgttat ttcagaaatc 195001 taatttagct gacatcccaa acactcaaat tttactataa gatttttttt tatgttagaa 195061 aagttctaag aaaatgacaa gtatccgaag agaagaggag aacaagcctg acaggttttc 195121 atttgaaaga gtcatggcat ctttcaagga aattggaagt ggaagagtct tcagtaacag 195181 taactgtatt tgaatgaaaa aatttggctt atattttgta tatgtttctt gtattgaaaa 195241 gtatcccata ttcacatcca agtagtcacc acaaagatag tgaaagcagt accacataat 195301 gactggtggg agtccttgtt gcatcactgg atctaccttt acgtttttaa gtattcaggg 195361 tatatttaag aataaaataa atcagaaaaa ttggtctttt gaaaataagg ttagaatgag 195421 aaaacctgag aacagaaagt acattaaaag atattaaatt aattaattgt gtttgtaaac 195481 agtggtctca gtccaaaatg tggcaagcac ccaagaccgt taaatctatc cccaccctcc 195541 ccatcatctc gtaactcaca atctaaaaca gaccatgcta atggacatgc agtgacatgg 195601 ccaggagtta agaacgggct ctgcaattca atgggagaac aggtaggcca caaacccgcc 195661 tgccaccctg ggctgctcag gagattttga accctgatgt gattgagaga gaaacgctgg 195721 gagccactgt ttccctccct ctttcctttg gcttttcttt cagcaactct tcccttttcc 195781 ccagagcagg ccggtctgtg tcattatcct caaggagtag ggacagaagc aggcactcta 195841 gtttctcact ttcatacaga tgcagatccc acatccacac accaaggctg ctgagcactc 195901 cacacatcct catttggaga ttataacttc taattttagg cccacgatta acacatttgt 195961 taaacgggta ttctggctaa atcagaagag aggtggcaaa ttaggcaagg ctgttagacg 196021 gtcataaagg aagaagttag ctaatgtctg acatgagaga gaagacagta aacatttttt 196081 taacccaaga ttaggaaggg ctttggcatt gaggagtgaa gaaacaaaaa gcccaagatg 196141 ctgaggcttc tggaaggacc accaacaccc tctccagagt aatccccaca cagtatgaaa 196201 cacactgttc ataactgtga ctcacagcca gggaagctca ctcacatttt ctcacctcta 196261 tctctgaaaa cccaagaaac tacgaataat gtattttaga aagtgccata caaaattgtt 196321 ctcaaaaaca tttaagattt tgatatctac tgagatacaa ttaagtcagg gtttctctgt 196381 ctcagcacta ttgacatttg aggctggaca actccttgtt gggttgtgtt gtgctttgta 196441 ggatgttcag caacatgtca ggtctctact taatagatgc ctgtggcaac ccctgccagt 196501 tatgataacc aaaagtgcct ccagacattg cccagggtcc tctggttcag aaccactggt 196561 cgattttacc tagaatatgg ataccgaggg acatattgcc agatgacatt atatagagga 196621 agtattccta taccttttgt atcattagag agtttggctg aaatgaacac tgggtggtaa 196681 aacaaaggac tgctctgaaa acttgtacta gattacctca gagtcagctg aggtgaagta 196741 ttgtattgca ctttctgaat ctactttgct gctgaatcct acattttgta ggggctgctt 196801 gagtagtaag tggttcaccg tgtaaattaa tcagtgtttg ccaaacccag ctgatattgg 196861 aaccaccccc aaagcttttt caaaggattg attccaagtc tggtccttag agattctcgt 196921 ttagtaagaa ttcagtgaaa ctggaaagct aatttttagt ctcgtctctc cctctgacct 196981 caatcatcaa ctaataaagg gttaagtttg gaaagtccta gcgtaaccta gggaaaaata 197041 cacaggcagc cagattcatg ttctcataaa tatttattgt atgcctaccc tgtgccagga 197101 cctgctctaa ggcagtggaa agacagtggg aaaggatcat tgggtccttg ccctgaataa 197161 acttacacac tagtcagagg agacagacaa acaatgaata tataatataa taccaagtgg 197221 tgatgtgaac aatgagggaa gtggctagtg tgatatgatc cagggtaact gagtgggctc 197281 agaaacactt cagagaaagc agtcaggaaa ggcgacattt ttcctgcaat ttgaaggatg 197341 taaaaaaaaa aagcctatca cacagaggac attccaggca gaagggacag caagtacaaa 197401 ggcttggggc agaaaagaac tggtcgcatt tgaggaacag aaagggcaca gatgtggcca 197461 ggcaatgata agcttgtgga gtggggtgtg gtatgaggca gagaagacca tgtaaggagc 197521 acttggaagg tcaaggtcat ggatgaagct attattctaa gtggaatgga aaccaaccaa 197581 agagcttaag cactgaaatg tcactatttc attacctttt aaaaagattg cactggccat 197641 agggacagcc aaccactcac tgagatggga aagtctgaga atgaaacagg tgacagggaa 197701 gtgaggaatc aagaacccaa tgaagacttt gtatgactgg acaataattt cacaaccaag 197761 ttgagagtca tgtgtgtgtg gtccagggtt atcacaatgt gggaacagga aaagtaaata 197821 ataaaagttc atcaccagaa aggactaggc aaagggtggt aattgttttc aaatgattgg 197881 cagagatgtc tgctatgcaa catccaacag gaatctgatt ttttatcaat tagtagtggt 197941 acttaatgag tggtttgaag gaggggagat gtagcttcct aagatgatgg ggcttccaca 198001 ccaaagaggc ctcatccagg aaaatttctg catgcagcag aaaagtaagc atgccaggtt 198061 gggggtggga gtagaaaagg tatctacaat tagtagaggt gtcaattaag aggattcaat 198121 aatctctcac atatttgtaa gttgaaaaaa ttttgctttt atgaggagac aaatgacaac 198181 cagagtaata taaaaatggt tttataaaag taagttctgg gccatgccac aggtcagtat 198241 aagttctgct gagtttccta agatgtctac actgttagca ccgatggatg ataactgatt 198301 catatcagca taggcctgca acggcgccca gttctgggca tcttcagaat taccagagag 198361 tatcctgtat tcaataccac tcaattatta agcaaactta atgacttcaa agtctttgaa 198421 aattgaatgt tcatagagaa caaacgtgga aactttcata gtttccattc tatgtgaaaa 198481 caggttctct gtcagggtaa gcagcttggc acttctggtg agttccttag gaaggggtta 198541 actggagcaa gggcagagtc agcatctgat tacaaggctg tggtcaaaag cacagatggt 198601 gcagtgacca caccaaacag gaagtgacat cagcagcaga ggagctcaga ctagaccagt 198661 acaaggaagg agaatgagca gcttaggacc tcaaaaacta ggtctcaact actagattca 198721 tgaagtcatt caacacattt ctactgagca cctactgtgc ttccagcact ctgctaggtg 198781 tccaggtgac agcagggaca agacacacat agtactactc ccagcaagtt ttcaacctgc 198841 cagcgagaga gaaatgaatg atgtcagtcc agggattcct gatggtgact gtataatgga 198901 cactgatttg cttttctctg ccccacttga gggtaaagac aggttggggc accaggagag 198961 aagagcttga gcctcctaag gtccccagta agtcacaagg ccaagctttt aatccagatt 199021 ttttttagtg tctttcagaa agacataaag atattgcatt gaacttaatg gatacttaaa 199081 tttttagttt ttgtctacac tagtcaaatt taaaagctat ttggccatgc aatttgacaa 199141 tacagatagt cacattaagt cactgaaggt ccaacatttg gttgttggtt tattactgtg 199201 gcaaagtgga tcgcagaggg atgattaggt tcgactatcc tcattgtcta aatgactcag 199261 aagagcactg caggatcagg gcacctggat aaagctctca gttctgtcag cagcgacaca 199321 tgtgctgcct tgaattctac tgttctgagc ttagatttct tcaatcagga aacagaaaac 199381 ctgcataagg cagcaattgg atttcagact gaaattgacc tagactgtgt tcaaatattt 199441 tgcaatctgt ataagacagt aacaaggagg gaaatatcat tatgactatt tgcttattat 199501 ataaacttca agagcttctc tagaaatgct gccagaaaat cattgattca ttcatgtcat 199561 cctcatcctt catactctat cacctcttta tcccatccaa actaagtctg acaggttcta 199621 tgttctaagc atctctccaa tatgccaact gctctcaacc tcctacccag ctgccaccct 199681 agcatcagtc atcttctcga atctctccca ggcccaaata ccaggatgca ctctagctcc 199741 tcctcatcct cactccatac tgggatcatt tcaaaatgta aatctgattc agtcaccaaa 199801 aggcatcaag gacaacaccc agctggaagg acggagtcac cataatctga acgtgggaaa 199861 actgtaggag gaaatttttt gaggagaaca ttagctgcta acacacatta ttttctgtgt 199921 gtgatgataa agcaagaaga cagttgtatt atactttgaa aatgtctgtt tatgtaaatg 199981 gaggatgttc ttgtctagtt aattgcattt tgctctctgt gaccattatg accatttaaa 200041 aagatgagtg agatgtccat agcaactcct aacattttca ctatctaact gtctctttgg 200101 aaaattgacc aaatacctgg aacacaacag gcattcaaca aatgtttatt gaatgaatca 200161 aatgtgagcc actgacctct tatatgcagt ctcaccttgg acaaacgaaa aagcactgaa 200221 ggaagagcat tacatactta tccccgaatt atatttcaga ggaggcagga ggtattaagg 200281 acacatggag ggtcattcat gtcaaggaag acagacccaa aaaaaaccca agatcacagg 200341 aaaatgcaca gtgggacaaa accctgttga gcaaagtaag tggaatattt gtgctaaacg 200401 gcaaaacaag ctctggaatt atcaaaaccc aaatgaggga agtaaacaaa agtgcagcgt 200461 tagaatagat attataccac caagggttat actaatatga aaaacaatgg gtcgcactta 200521 ataaagaaca ttctctggac acgtgactga ttcagagctg ctgctttctc atcctaaagt 200581 atgcatagga aacttcagag agcccgggtg acttattcca acctggaatc ccaacaaatt 200641 ccctctggaa tcttcctttt tatagtctta aaccagaata tacatacatg ataatctgta 200701 ctagaataaa gggctatttc atcagcagtc ttcagagccc caagagggga cattagcatc 200761 ggcttgactc ttcagtttat gattttagaa gaatcagaca gcctaattaa atgacaattt 200821 taaaatattt ttgtaatata tttacataat tgaggcaata accattcatt gaatgaatag 200881 atttttttaa aattaaggag cccgttctca cctttcctcc cccacccgca ctcttgcaaa 200941 tggcaaagac ctcagatcat gaatcaaagg gtaccagcac acagttcaaa aaggtggtgg 201001 ggtggagatg ttgaatgggc acaatgagca cagactctac ttccttcagt gtaaagcaac 201061 ctctgttgaa aagtggttac cctttaaact actttaaaag gcaaatcact gagccaccgg 201121 tgtctccaag acagatgaac gagcattagt aacttacaag tatctcattt ggtgtttcaa 201181 gacaagtaac tatagaaaaa ccagagaaag agctgttaca ggttcctgga acgtttgcct 201241 gggaaggtat cttaacaatt tcctaattca ggtcttcatt tttccagtgg aggaaagcga 201301 agtgtacgag gtgtgcaatt tgcccaatgg ccctgcaatt aatccaggag ccccgaggtg 201361 acaccaggaa ccaactggag cagctccggg atgcctgagt ccaccacccc tgtccacctc 201421 tactcctttc ttcaaaaaaa caattttttt tttttttatg ttggggactg gattttggaa 201481 gacacgaatt ccagacagtt aactgtcttt gcctgattat aaatattaat gattcttgtc 201541 aactctctgc ctgcttctct ccaattttta aagttggcag tcagaacgtc cagtctcaac 201601 agtgcaaatc aattcaagga aaactcagta aattagcaga tgacagaaca actgcagtgt 201661 ctcatctcaa aacgacacat ctcacacgtg gaaaatcaat gtctgaagac tggaataagg 201721 taaaccttct ccctgccccc ttcctggagg gggcatttga aatcttcaga agtagtagtc 201781 attcctcaga aacaactttc cttttttcct tctttccttt cttttctctc tctcaaagaa 201841 tatactcatc tagtatctgg atttggtcgg tcgttaacct taaaatcttg actccttctt 201901 caaagaaggc aaatcccatg agctttctct ctgttgtgtt tcctgcagta cctgcagcac 201961 ctagcgcagt ccctggcaca cagtaggtcc ccagtaaatc tttgtatcaa cagggagaac 202021 agaagccacc ataagggagg gggaaggtac gacagaagag cccctggtat tctcttgtat 202081 tagcgtgcaa cattaatgat aatgtgcaac attctgtccc gccacaaagt aggagggaaa 202141 tgcactccct ttctaaggag tgaaacggct gcaccccaac gctcccggca gcatttcttc 202201 agcaaatcca ctcgccgcac actgagatca tgaagcggcc ctgcctcctg tccagtctta 202261 ttctcgtcct ttcaacatat ctgactgaag gctgaatgag cgacctaggc ctaaccccag 202321 gaaccccatt tgtcagcctt tgtcagcgag gtggtggcgg agtccagccc cgtcccagat 202381 gcagtgacca cagcgttggg gcaaagaaac actgaattca gggttggggc tcccctgtac 202441 cttctaaaag atttcgggaa gcgccactga gttagtgagc aggtggcact tcggggtgac 202501 gtacccggcc acccgccacc ccctcagccg tggacctgtg gggagccccg actaccatcg 202561 cccctcgcag tccccacccg gagcggccca gtggcgcccc ctgtgtcctc tgcacgcgtc 202621 aggacactca ccgcgtagtg cagctggggc ttctcgccgc ggtacaccgg gtgccccatg 202681 tcgcctccgg gaggagtggc gcgccttggg ctcagcctgg ggtcacggtg tctctctgag 202741 ttccccgcac ttcactgcca tcctggggca gcccctgggc gcgagccctg ctccgtgcgt 202801 gccgagtgct gtgcaaagcg cccctggcac agcgtcccgc tgccgctgcg gagccggtgg 202861 ggacctggag aaggcagcgg cgcgaagggg gagggggaag aggaggagga ggaggaggag 202921 gaggaggagg aggaggagga ggaggaggga gcggccgagt ctgatgcgat gggcgggcgt 202981 ttaattcctt ccgctcctgg cgggtccctc cggcggccgc cgccgctgac agctgttggg 203041 ggctgcggag gccgaacgcg cccacgacag gtgtcaggct ggccggcacg cctgcctcct 203101 gccaccccac cctccccgga cccgtgggac cccggctgct gaggcgggga ctgccaaagc 203161 ctttaatgcc agcgagaagg cgaacttggg tttccaagaa agagagactt tcactaactt 203221 tgaaggaggt gtgggcctga cagtggaagg agattcagag ccttctcctg ggagggcaga 203281 gtttcccttt ctagaaagcc ttcttctgtg agagtctcaa attgtgggga cccttgggga 203341 gggcaggact gagaggaacc aagtcatgct acccaggaga actgctgcgg ggacaaggct 203401 gcccaggatg ctggtcactt agggtgaccc gagttccagg ggggcttgca gggcacattc 203461 tgtaggttca caataaccct taaaactggc ctctgctctg aaatcccctc atggaaacct 203521 gcactttaaa acaccttgga gaagatcagc gagttccctc cacagccttg tccttctgta 203581 atttacgtcc agcctcccac aagttgagcc cagaggaagg ccaggcctgg ggccagagta 203641 gagactccag actcatgaat caatcaatgc aagccccaga ggggtggcta caaagtggaa 203701 gatgtgcttg ggggtagctt gtgccagttt gtcagggtcc cactgtactg aaccggctga 203761 atgactctcc acttgctctg tgtaagagga gaggggagga gaggggagga aagttggatg 203821 tggccccgga atttcagccc ttacaattta gttgaatgaa ccatccactg gcatgtaata 203881 ttttaaaaca atcacagcat aattataatt agcagtggga atgtgaagcc aggggtgcaa 203941 gtggcctgag gaatctacca ttagatagtc ccatgacagt ttatccacct ggaaaactgg 204001 atggttatat ctgctaaagt caatcatagg cctaccttcc aatccagcaa attccacccc 204061 tagatataca gagtacaata atctcatcat ataccagaat gctcatactg tgttgaggca 204121 taattgcccc aaataaaagc gatccaaatg tacaccaact gatgcatgac aaaccaactg 204181 tagtctgtgc acatgatgga atactattca gcaataaaaa ggaaaatgct actgatgcac 204241 agaacaaatg gattaatctc acaaacatta tgttgggaga aagaaatcag acacaagtgt 204301 acatactgta agatttcact tatttattct gcaactgaaa ttcaaatgtt taagtgcaat 204361 aggcaaatgg aatccatggt aataggtgtc ataacagtgg ttaccaaagt gggaaggtgg 204421 tattgactag gaaggtggta caagggaacc ttctagggtg ctggaaatga tctatatctt 204481 gatctgtatg gcagctacta gagtacaccc atatgtaaac attcactgtg cagtacagtt 204541 cagactagtg taatttacac ccttcactac atgtatttta tactgcaaaa ataaaaaata 204601 aaaataaaaa taaatttaaa atcaatccct tgccaaaccc aggaacttcc aaacagtgtt 204661 gcctgtctta tgagaaagtc aattgaacta tttgcccaag tctttcattc tcccacagcc 204721 tgatctaccc aagtctcctt tcttctgctt tgtaggagaa atgagcatgc ccattgacct 204781 tggaggcctt ccttgctctg tgacatgtcc aggcagagct cgaaaaatga cacactaaag 204841 ttccctcttt actctcactc tgttctgctg ctaaaagcaa cctttagagc aaagtcatca 204901 aagatcctca gaataagaca gagaaccagc ttgagcttca ccatctggca tctttcagag 204961 aattgcagcc agggcagcct gatgggggca cgtgtgtaca gatgtaattc aatggcctag 205021 acaagcccca gtttagggac tttcacttca ctgtggtcta ttcataaagg tgaacactcc 205081 agccttagga ctttttcttc catcacgctt gcctttaagc ccctgaagct aaaaatccct 205141 ttccctacaa atactattta attatgaata tatttaaaca acacaaggtg gatccctggg 205201 tcttaaaacc ttaaccatag ccttttgatc attacccctg agtaagtcac ctcttaaaac 205261 tgggtcatcc tggatttggt ttgtattctt tctttcaaaa atgtagtaag tcactaaatg 205321 acattcttgt aagccaaaaa tactccatca gccccacact gtgtacttac attttaggaa 205381 tgacttctgc cttctagcat tatttattgc taaagtgtca tagctaagtt ggagtaaaat 205441 gttcaagtgc aataggattg ttttttctct gtgtttgaag agtttgtgag attgggcaaa 205501 gcacctaaag aaatgatcag ttcatgctgc aagactctta aatcaccttg gaatgaatca 205561 tccacctcct tcaattctta gtccaattgt gtaaggctca aagctccaga aaggatctag 205621 aagatcatgc agaactctaa gaagtctgtt caatcttaaa caatttgaat ttcacacatt 205681 attcattatt tcatttaaat atatttcagg gtatatttaa aacaccactc taatcaaaat 205741 ttcttatttg catatctcta cttccaagta tccttcccat gttatttgtt cacttggtca 205801 ttattcaaca aatattattt taaattttaa attttaattt gtataaatgt atggggtaac 205861 agtgtaattt cattacatgg atagattgtg tggtgatgaa gtcagggctt ttagggtatc 205921 catcaccaga agaatgtaca ttgtgcccat taagtaattc accatcatcc accccactgc 205981 ccttctgagt gtccattgtc tatcattcca cactttataa atatttttaa gtgcatacta 206041 tgcacaggta ttgttctagg tcaactaaac tgtatattga catggcatgt ggttttaaaa 206101 tgcacatttg atcctttggg gtaatggaga gagataaaaa ggcagcaaga agcacataaa 206161 acctcacttc agaggaggct gaagttcatc caatttgttg gttaagagcc tgctaggcta 206221 atgttctttt ttgtttgttt ttctttttta aaaaataata aatttatctg atgtttatca 206281 ttcaccccag agaggactta aaaataatct gtatgagtta ctctactagc atcaccttta 206341 agagaaaacg tgccctgcta actgtggaat ttccatgtct tcctgtccct aaacattaca 206401 gcatttccag aactgaatgc tgccctttct gccagagcat ttaaattaaa atacttgttt 206461 gcatttattc tgcctgccta taggctgagt taaaattttc cacagttaat gccaatgcaa 206521 gggtgattac ttttttcaga aaaactttac ccaaatctgt ggaagcaaat tcaacttcag 206581 agtgtaggaa gagctgaggg cccatctgag tactaaaaat cacttcagat ggcaggacag 206641 gcccttcttg aaggcccagc cccaccattg ttaacagagt tactatgaga ccctctaaga 206701 tggggaatcc cctcccccaa acccagtgcc cacactatgc catcctattg tcctgttaat 206761 ctagcccctg gagtattttt aacattttga aatggaaaaa tcaatagcat ataatgtgca 206821 gcttaatgaa ttgttacaaa acattcacct gtctaacctt cactgaggtc aaaaattaaa 206881 atattgctag taccctagac ccccatgtcc cttggctgta tttccctcct ccatcccagt 206941 ggtagacact gtcatgactt cacttcttca cttccctttt acagcagggt ttctcaaccc 207001 cagcactctt gacatttggg aaccaatcat tctttatggt agaggctgga cattgcagaa 207061 tgttttgcat catccccagc ctctacccac tacatgccag tattattccc cctacgccaa 207121 ttgtggcaac caaagatgac tctagatatt gcagatgtcc cccagggagt ggggaaaggc 207181 ataatcaccc ctggttgaaa tccactgcct tttaggaagt caggcatcca taaatattag 207241 agcttagtgt tgcctgtttc ctaactttat gtaaacagat tcataagtat gtttttatta 207301 tagctgactt cttttgctca acattccgtt tctgaggttc atccatgttt gctgtatgta 207361 gctgttgttt gttcattcca attgctgtct agtactccat tatatgtgtt tattactatt 207421 tatctttgta actgttgatg gacatttatg tccttttcaa tcttggacca ctgaaaacca 207481 cactgcttga acatcattat acatatctcc ttgtgcatat gtgcatgcat ttttataggg 207541 tccaaaccca ggaatataat tgctgggtca taggcatgca tatcttcaac tttactagac 207601 aatgcctaac agttttccaa actgtattaa ttagaatctt tatttacact attgtataaa 207661 cagactgaac acagtttata tgaagactga acacagttta gagttgtctc ttttttaagt 207721 ctttttcatg gggctacaaa gatgaaatta aggaaatcaa actccaggat cagaaaggag 207781 agctggagta gcagcttaag gccctccttg gctgaaatca tgtgtctcct gtaagtgaag 207841 gacctaccag ccctgctaag gccctggcat tggcagtgat aaggccgact tcacccacac 207901 gcacagacag gagcaatgga tacactgaat gtgcttttga gggaggacat ctcattaggc 207961 agaatttcag caaagtgggg gtaaaatctc agcctgataa ctgtaaacat atatgccatt 208021 tcaggaatta tcattttact ctatagccat aatttcttcc caccctctta acccttccag 208081 ggatatggag atagaagaaa tctaaggaag aatgaaggtt ctcattcctt tttccaggta 208141 agtttaagga gacaaccccc agcctgaaga cagctgggct gaagaccagt gaaggaggaa 208201 ttcctgcagc tggagctgca gtcacatagg agagatggag aagagagagg agaacatcct 208261 ggcacagcat agtaccattg tctttggcct caaaccctag tacaaagaat atcataaagt 208321 tatgcatcaa ttggagtcga gaatatgata atccgagaat tcagggaaag aaaacaagtg 208381 aggaagggtg aggaaataaa tgaccattac agtaggttct atttggtatt attggtttat 208441 cttgtttttt gccgttttga tattttcact gaagaatttc tactccatgc cacagagcca 208501 gggagggtat ccttaaaaag tgcctaaaaa gagacatttc taatatatat gttctattca 208561 agggtcatta ggcaaaaagg agaggccgca atttctccag ctacgacttt ccaaataacc 208621 atttgggaat taagaaatga tttttcattt atatccataa tctaatgtta gtctattgcc 208681 ttcattcctc tgatcataat ctcctttctt ctgcacagtt gcacagtgtt tcagttgtct 208741 aaagcctgca gaattcccat gacaacgcca acaccgcagg tgctagaatt attgtttgaa 208801 atggtaacaa ttccactatt cattaagcac ctacatcaca ggctcaacta gatgctttgc 208861 aaacatctgg atcctcagaa caactgggaa ggaaagtatg actatcttaa ttttatttat 208921 gaggttaatt aacctgtttt gcatacctaa taaatggtgg agttgagatg taaatggtag 208981 ttgaccccaa tctctttgca ggtcattcca gtatgtttca aggtggagat tctttttgac 209041 aaatcaggcc cattcattag attgctggtg tcccatccac acctcctagg ccagtgattc 209101 tcaacagggg aaattttgcc actgaggaga aacttggcaa tatctgaagc tatctttggc 209161 tgccacaact gcaggggaag ggtgtcactt gcatccagtg ggtggagacc agagatgctg 209221 ctaaacatcc tgcactgcac aggacggcct tcacaacaaa gaaagttctg gtcccaaatg 209281 tctagagtgc caaggcggag aaatctttca cagacataag gaattcatcc tttaataatt 209341 tccacttagc agtacctctc acaccaacaa cacattttct ttcagtgaag taggtcttgc 209401 accacccaag aaaacaaaga gaaagttctc cagagcagaa gagtagagct gagccctggg 209461 tctccaccca tcgttgaaga cccatacaca agtgggtctc cacgcaacac tgtgtgttgt 209521 taaaaggcta acagactggg taaaaattgg ccatctgtgt gatcaaaact gaccactctg 209581 gagttctttg gagtgcttga ctagtcaccc ccctaaaaaa accactttta taacctcaag 209641 aaacccctgg cttcagaggc tcaggagtag gatgaatgat gacacctggc attactctgg 209701 tgcctgcagg tttcaaagca attttgcata tattggtttc cctcccctcc ccccgccaac 209761 taatctatct tgtgataatg atagggagca gaacatataa ggcaaatgtt aagaattcgt 209821 tgtcaggagt gtaataaaat gaggagaata aaacagaatg tccttctcct gccaagtcac 209881 ttaaactgtc tgtatcccat taatccagca aatagagagg atgagccagg gagttccaga 209941 tcccccagga atgcttggta cataaaggtc aagaaccaac agtttagcag ggccttcttt 210001 gccttgctgt gcagtaagaa ccaaccgaag aaaactgctc caagcctaat ttcttgatta 210061 ttttctttgg ggaccatgcc ttatgaaggt ggttctcagc caggggcaga tttgcccctt 210121 agggtgcatt tgacaatatc tggaggcata tctggttgct gtaacttgga gggaggggga 210181 gatgctactg gtatctacta ggtagaggcc agatatgctg ctcaacttcc tacaatgcac 210241 aggacagcct cccacacaca aggcaaagaa tgatctggtc ccaaatgtca atagtaccaa 210301 ggttaagaaa ccctacttta tggttaaaat tgttatggtc aaaataattc taataaatca 210361 gaatctaagc atcagactgg gacttttttc attatttcaa ctttagttgt cttccttcat 210421 gtccttgcac acagcacaga tgaaaagata ttacctggaa agataataag atttcttctc 210481 cctttcagtg gaatacagaa caagaggaag agggaccaga aggaaatggc tagtcagtgt 210541 ttttgaaggt atggtcttca ggaccatggg catcagcatc acttgggtgg gtttttaaaa 210601 atagattttg aggacctccc cagagcctac ttgcacaatt ccttgcagca ccacaccatt 210661 gacattttta gtgagattct ttggttgtta tcccgtgcaa cacagtttga aagccactga 210721 tgtagtggta gtatgatgca gaaactgaaa ctcataaagg ccgggtgatt tgtccaacgt 210781 catgcagata gcatgagaca aagtcagaag ccatcctgag ctctggattt aatgttaaaa 210841 gaaggtggaa aatagtccta agtgtatgac taacaagcaa ggattagatc cattcatagc 210901 tacagaagaa gaaatgacca taacatttgg cccagttcag cacaacccag agcctgctaa 210961 gtaacttgct acattttcct gtggttttgt caagttttct ccccattgtc ttcatgacta 211021 agaataaaaa gaatgctaca cgtgatttct tttaaagctc ttaagataat tgatttccat 211081 ccaattttat ttttcagaag cacaaaattt tttctcacat taacatgtca agaaatgaca 211141 aatgtataca taaactacac aaatcattag tcctcttgat tttccaaagg gaaaacccat 211201 gcagagtcta tcgaaaagcc ttcttaccca gaaaataatt tgttttctct tatcttagca 211261 aaataaactt agccacttgt tccctgttgt tgatgggaaa gtctgagatt tggaacaaaa 211321 gaaagctgtt ttgtacatta taacagataa tatgacaacc tcaaagctat cagtttgcat 211381 ttctgtaagt cccatttttg cagtggaatg attgaaagaa cagaagatat tcagtctttg 211441 aaagttgaag gaggacataa agatgatgaa taatttgagg ggatgtcata cataagaaag 211501 tcttgtccat atttcttcag aaagctggat aagtaccaag tgtggaagtt aaggagactc 211561 agattttgtc tgaatataag aaagaacttt ctgtatctga cttagcaatt ctactcctag 211621 gtatatagcc aagagaaatg aaaacatatg tccacaaaaa cttgtataca tatgtttatg 211681 gtagaattat tcataatggc caaaaggtac aaacaaacca aatttccatc aactgatgaa 211741 tgaataaaca aaatatgatc tatccataca atgaaagatt acttggccat taaaaggaat 211801 gaaatactga taaaacgtag atgaacctcg aaaattgtaa gctaattgaa agaagtcagt 211861 cacaaaagac cacatattat attatttcat ttatacaaaa tgtcaagaat aagcaaatcc 211921 acagagtaga tgtcgggttg cctaggactg cagagcttgg gggttgagga gttgaggggg 211981 ggatagctaa agggtacagt gtttctttac agggtgatga aaatgttctg aaatttatgt 212041 taatagtggt tacccaactt tgaatatgtt aaagaccaag gaattgttca ctttaaatga 212101 atgacttgta tggcatgtga attttacctc cataaagtag aaggaaggaa ataaggaagg 212161 aagaaaggga gggagggaga aagggaaaga ggaaggaagg actgacttta taacaaatct 212221 agatggaagg caacagtccc aggaggtggt agtatcttct tccctggcca tatgtaagct 212281 gaacctagat taccatcagc cagagaggct gcactatgtg ggcagtctca ctatacgccc 212341 ttcaaactct aaaatgtaac agtcactgtt tcaatatttt atcctttttc tgtttcccta 212401 actgcaacat gagtagtgca aagctggctt ctatcaattc attcagtcca ttctttcatt 212461 caatatttca ttgctgactt aaggtaaagc cttacgatat ttctctgtct taataatata 212521 tgtgcttgtt gctttaggta gagttcctca gaagcagata ctaagataaa gactgatgtg 212581 caattgattt attaagaacc tgttcccagg agaaactggt aaggggtgag cataacaggg 212641 tagggaaggg aataaagcca agtaagggtg tgatttctag caaagtccca gtctcagtct 212701 ggtcctgagg gaagctctgg ggtattaatt acatctaaac tttgtcccgc cttcgccaaa 212761 gaatttggac tttaatattt tcacacacat cagtcagtca tgggctgtgg gacgccccgc 212821 ccccagggtg atgtaaactc ccaggcactt cctgctcttt gtgaatatag ggcaactgct 212881 ccagaacccc acagagtgct cacagatgtt ggtcattaga agcaaaacta cacagaagct 212941 ggtgaatggg acaggcacac agaactggta aaagatctga gatattccgg gcagagcaca 213001 gacagagtcc actgtgcttt ttggttgaat acatgttaag ctctaaaggc tatgtatgac 213061 cttgttccta cgtccagagg tcttgaacag ggtagaagct tgcaacctcc agatttccac 213121 ctgggtgagg tatgagtcgg tcttttcgta ctcctcctta ctgataaatg ctttgactga 213181 catctttgcc tacattcgat ctgtttcatt tcatacaccg aaaatcttct gaaatgggta 213241 atctgcccac aagcaccaca gggaattcct caacagcttt agcatccgta atattactcc 213301 aaaatagcat tgctgttctg cctcttactt taaggttggg cagatgccaa aagctgcaaa 213361 cccatggaac catccatgac cgggtacact taccagattt gtttccttct cttaaattaa 213421 cacaaatagg cctacgatgc taattccgga gccacgttca aatttcactg tttcatcaat 213481 catttcgtga gctcattagt tgattaacac ttttttggtg gtgattaaat tagtgtacat 213541 gcacactcat gtgatcgtgt aatcacttaa aacacagcca ggcacataga agctctgtat 213601 caatgtttgc tattattaca caagtacaga gcttctctca ggttgtatga tgggatcttt 213661 gtcctcaaaa tcattcttcc tgccaaaggg cttacaacag tttttctgac ctatttcaat 213721 taaaaagaaa ttgtttctat cttgccttta cttctactac ttcttactag cactttcttc 213781 tttcaagtcc tctgaggtta tcacataaga tggggttcct ggcaaaattc ctcaacacct 213841 agcagaaaga cagcaaatat tccatagact gaaaaatgag aaattctttt taaaaagggc 213901 agaaaataaa agaggtccca gaacctcttg acgcatattt tggtggtgaa gagagagcca 213961 gagggctttg ctccagagta tggttttgtt tgtatctctg aaagtgcacg agttgaaatt 214021 ccataaatag aggagcttct gtaccacttt ctattgctaa tattttcttt tcctctgacg 214081 tctgtcctca gcatttcagt tcatggtgag tgcttaataa atgcatgatg acaacagatg 214141 ttagaaagtg ttacgagagc acagagcatt ctagaaaaac ctaggaaaac accagtgggg 214201 gaaaacatca ttatttaata aatttatgaa aactttaaaa ggaaagcatg ttgcttttca 214261 accaaggact ttctaacaga acaaacctgt ggggagataa ttgtctcaga tctagcacag 214321 ggtgagtaaa attgttttta catttttaag tggttggaaa aaattaaagg aataacattt 214381 tatgaaatga aaattacatg aaatacaaat ttcaacatcc ataaataaag tcttcttgga 214441 acatagtcac acccattcgt ttacttactt gtctatgact gctttctcac tacaccagca 214501 gaggtgagca gttgccagag accaactgca tgttcactaa acctaaaata tctggccttt 214561 tgcagaaaat gttcacccac ccctgattta gaaaactaaa attgcaattt gtttcctttg 214621 aatctttttc tacttttggc tttcatcttt tccctcctgg agttttataa gcccaaaaga 214681 aaaaaaaatt atcttccata tattcatctc tattaacata gtgattattt tgtttcattt 214741 atagtctgga aacaggcaac ttaccaaaga gaaaccagac tgcttagatc aagaggactt 214801 ccaaattgga tgctactgct ttatgtatct gagcttttaa agacacattt gtattatttc 214861 tagaagcctt aggtaaatgt atttgggatt cattttgtct gtagaaagca cctgcagccc 214921 catttcttct agaacaatat aaattgtaac taaaaacaat gttcccttgt tgcatttgta 214981 tttactattc tcaaaccatt tgacaaagta tttccctctc actaattagt gtttcctaag 215041 tgttattgga agccactcgc ttgccttgca agtttaatcc tacatttgga cacatgcttg 215101 ttaattggtt tccctccaaa gttgaatggg gcttgcaccc atctgtcctg gattcaatta 215161 caccaaatgc cacatgctct gagatcagaa aagcaaagga ttctaaaaca tccatgtcta 215221 tatttggccc ttgacatcct attattcttc cttgtcttaa aaaccctagg ccactcacag 215281 gaagtgaact cgaattgtta tttttatggc ataaactttt caacaggctg taacttctat 215341 gtcacgtcat actatataga taaatatgtg tgtacataaa atttttaaaa atccaaccca 215401 ttgttttgca cttactacag tacacatagc atcctctcca cactttgaac attccaaaat 215461 gcagccatcc ttctactcaa tttcttgcca gaggagatgc ccactgaaat tctaggtcac 215521 acattgaatg tgattccttt gttttcaagg agaaatttcc ctccacctat aatgctctac 215581 ttcctctgtc acctcccttc catttctcct ttgcctgtgt aactcccatt agcctctctg 215641 gattttctgg agggacattc cccgattcac ctcaccaccc tccaatagga gctttgctag 215701 cttcctgttc gtacctctgt taggatgctc atcttgtctg tttcctcctg aaacacttgt 215761 ctcctcaaag acagggctgc catactcatt ttttaacccc agagcctggt cccatataca 215821 ataaccactt gagtaatatt tgttgaatgc tgttactaaa tacaatttca ggagatgaga 215881 acaatcagca tcagtggtta cagcaccagc aaatcaagaa acagtctcat caggaatcaa 215941 ttacaatggc aatgactgct gcctggagga acctggtgat ttatataaca taatcattca 216001 tcgggcccag atttgattgc catgcccaag tgctgaatgc tgcagatggc tcaccattct 216061 cattaggaca ctgtgatgta gtatttcaaa agcaaaactt ttcagaatca taattccgac 216121 ccttaggaaa gaacaaacca tatctccaga aaaggacagt ccacaggaaa tcaaagggat 216181 gctctccaat ctcagggctg ccatttttgg ttacagaata aatatatcaa aggccctcct 216241 acgaaggaat tcagtatgac caagaggtaa atgcgactcc cagagcaggg gtgcttccca 216301 agaacacatt agtctatgta gagtttagga ggcagaactc tctaatttgg tttctcacca 216361 aaacaagtgc aatctactca aaacccttta aaaaggctgt gctttgataa ttgggtcttg 216421 tctttgcata ctctcccctg atcaactctg ttatcagtga gatctatctc cttgatgttt 216481 ccacactgaa gggacatcac ggggcagaat aatttgcttt tcctaagaca agactctcca 216541 aacgttttgg ttcacaaagt gagaagccca actagcagcc aaccacagga gcctggcaag 216601 gacacacttg ttccaatgcc cctagtcccc caactcaact gtcctcctgg gactgccaag 216661 taaaaaagag aaggaacagg gaagttattt ccctaaggag ccccttggga agcagcggta 216721 acccaagcac aggggccatg ccactttgct ttctactgtc agattttaat tttcacaaaa 216781 tcgagagtca tgagattttc tgtgttttct cctaaactaa tatttggatt tctttttaaa 216841 aaatgatcct tttaggtgct acacgagacc tgggaggctg cattgagatc acctaaatct 216901 tctaagattc ttgtacacag ctccaattcc ttctcatcac ttgccaggtt tttttttttt 216961 cctttggagg atggtaactt aaaaaaaaat ctcatgaatt tctacacata tgtatttgaa 217021 atatatatac acacatatac gtaacatata cacatataca gtctaagaaa cataaaatga 217081 tatttttaaa tgaaagattc tggataatta tgaaacaata actagtacat cttatgtata 217141 taactattgg attgaatatc tagctttaca gttatttaaa tttcatttac agcagtgtct 217201 ttaaaaaatt gtcatcaatt gccctaattc ttggattatt acatcccttc tattcatttt 217261 tagcatttaa tatttttctt tcctttataa ctccccaccg gcatcaatga gaggggacaa 217321 atttaactga atatgtcagc caaaaaatat gggtgatttt taaaaaagac ctggctgttt 217381 cttctgaccc tcttcttaaa aaaaaaaaaa actgcctcaa ttcttccttt gggagtaata 217441 atatttgact ccttttgtaa aatctaaaat tgattttttt tagaagtata cacattttat 217501 tcagcttccc acataagact gggcagccag tcaaaattca ggagcacaac tgaggcagac 217561 aacactttgc atacaaacag ctcccccagt ctccagcact tctttacaat ttcacgaagt 217621 taagactaaa cagaaacggg ggaaaaaatc cttccaaatt gccttcttag ggtccataaa 217681 ttccacattc agcttctttg catcctttgc tctctccttt ggcaagaaag cccaggagca 217741 cgcagtctgc tcagtgggtt aatccactgc agtttagaca caaccttttt atttttgcca 217801 tggggtgtgt ggagctgaga ccacctagtc agtcacactg tcagggtaga cgtagtcaaa 217861 tgcaaaaatc tgaaaacaga aaatttattc catttgtgaa taagactaga atccaaaggg 217921 ccagggctaa ttatgtggtt gtttcatgag ggacatgtca taggcatcat tttcaataga 217981 aaactggctg agagcaatgc tgatacattt ctatcattaa agagatgtgc attctttttt 218041 ctgtttttga aactctttat tgaaatacaa tatataagca gaaaaattct gaactcataa 218101 gtgtacaact ctgcattttg ccaaagtgaa ccccctgtat aacgaccacc caaatcatga 218161 agagtcctct cctgatctct acccctaaaa ggaaccatat tctgacttca aacaccattg 218221 atctgtttgc ctgtttctga actttaaggc cacagaatca gtcttttttg tctggtttct 218281 cttgtcaata ttatatgttg tagcaggtag cgagactcct cttcatcgct aaaagtattc 218341 catcatatga ataaaacaca gttttccctt ttctattgtt gatagacatt ttggttgtta 218401 ccaactttgg attattaata aatagtgcca acgtgaacat tctggtccat cccttttggt 218461 gaatatacgt aggcatttct gttgggcgta tacctgagag tggaatggct ccatcacaga 218521 gggggacata tgttctactt tagtagacac tgcccaacag cttctcaaag tctttatacc 218581 aacccacagt gccaccagca atgtacaaga gttccagttg tcccatatta atgcaatgct 218641 tgatattgtc agctctctct ttatttcttg attatagcca ttctgatggg aaagcagtgc 218701 catcccactg tggtttaaat taacatttcc taccaggtgc ggtggctcac gcctgtaatc 218761 ccagtacttt gggaggccga agtgggtgga tcacgaggtc aggagatcga gaccatcctg 218821 gctaacatgg tgaaacctca tctctactaa aaatacaaaa aattagccgg tcatggtggc 218881 aggcacctgc agtctcagct actcgggagg ctgaggcagg agaatggcat gaacctggga 218941 ggcagaactt gcagtgagct gagattgtgc caccgcactc cagcctgggt gacagagcga 219001 gactccgtct caaacaaaac aaaacaaaaa agaaaaccaa aaaaaaaaaa atatttcctg 219061 gtgactaatg agttggaaca cctgttcatg tgcttattgg tctttggata tcctctttgt 219121 aaagtgtctg ttcatgtctt ctgcccattt ttttcattga gttatctgtt tagaacagag 219181 gttggcaaca tttccctata aagggccaga gagaccacat tttagacttc ggaggtcata 219241 tatacagtct gtgtcacaac tattctgttt gacagcacaa aagcagccac agaccatatg 219301 taaatgaagg ggtgtggttg tgtcccaata aaactttatt taccaaaaca gtcagcgggc 219361 tgaatttagt ccacaggcag cgaatgttga ccactggttt agaagatttg tgtggttttg 219421 acttttaaac attagcaaaa gttgggctat ataccagagc ttgctaattg tctacccaat 219481 atccattctc ctgtcattca ttggtaagat aatactgaag gaggtggtaa aaatacctgg 219541 ctgaaaaaac tatcgatatt ggagagcaag tgttaggttg agggcttcag taattttggg 219601 aagctgtctc agcaaggatg cccttttgag ctcccctctt ccttattctt tctgtctgat 219661 acatgggcat gatggctgga aatatagcat tcatcttgtg agcatgagac aaccttttga 219721 atggtgtggt cccacatgat tatcttacct tcattactag ccaaacatct tgtcttcaca 219781 tttccaggtg agagaccgat tctgattaca gggtcaaggg catattacta cctggagact 219841 tttgcatact tattttagca aaggtacaca ccagagaacc acacaatgca aatgtccact 219901 cgaggcttta acaagcactc ttgtctacat gtgacaggat ttaataaggg ggaacaatct 219961 tttttcccct atctttaaca ccaaaggaat gactgctttc aaaccacttc gggcttcttc 220021 aactctcttg tttcatagcc cagcctgttc taaaatcatt ctcttctgtc tgaagcattg 220081 caaagcactt ttccaagcag gaagcctgag cccagcaata ctgtgattgt aaaagcaaat 220141 atgaaagcca ttataaactc agcagtaact ccctacttag aactttagaa tttattttta 220201 ttgagaagaa cattctttta aaagcactgt tcttttcctt gttgtctgag ctcactttgc 220261 ctacttggac aaaaactgaa gctgttaaga gttaaggctg aagacttgaa tatgtggtcc 220321 atatgggcat gaaatagttg aattcccagg ctcttaaaat aggacaggaa ggcatcaaaa 220381 acctctgcaa ggtaaaagtt ttctgacatt gtaagaactc ttaaaactct tccccatttt 220441 ccaagaccac atgggggcca gataattttc actcttactg tgtcactttt caaggtggtg 220501 ctacttaata attacatcca gatgacagac aaaaactgga ttatcctagg caaacctaga 220561 tgtgtggaca tcctacacac attgcccctt ccctccaaga ctcaatgcaa agttgagatc 220621 aaatatgtca agagaagcct agggcagcag catgcagatg tagccaatgc tttttagaat 220681 atcagttcaa tgatatcaaa tctggaactt tgcaacatct tattagggca gtggtttcaa 220741 ccagggtgat tctgcctccc agaggacatt gggcaatgtg tggggacatt ttggttgtcg 220801 tgacgagggt caggggttgg gcactactgg catctagtgg gtagatgcca aggatgctgt 220861 taaacatcct acagtgcaca ggatggcctg tcacatgtgg cccaaatatc aacagtgctg 220921 aggttgagaa actctgaatt aaagccagca gaagctagag aaatagatag gatgggccat 220981 gcacattcca aagcacatca gcatgaacct tatgccctgt ggttggttca gaagaaagac 221041 ccagtctcac aggttgagta aagttaactt cccatcacac cacagtcaga gttccccagt 221101 tgcttcaagc agagggtaca gaagttagtg tcagactgtt gttattatat atgttcaaat 221161 tacatttctc aataaactaa cttattgaaa agtaatttat tacatctgag aggttaatta 221221 ttgttaatga tttgtttggg cttttgttct acaacctgaa ttaaaggatc ccttcaggta 221281 ttattccttg aaatcacctc aactgaccac tgtgatgaga atttccatgt ccagagtgag 221341 aaaggcagat catgacttct ggtatctttt tttaagaatg gagcctgaat aagttccatc 221401 ttatctgggc atgttccaag tgaaaaattc agtaattggg ctttagaaag aaaagaaatt 221461 gataaccagc catgatactc agtattcatt ttccaatatg gaccaacatg gatgtaagga 221521 actgtgtttt ttctccttaa ttgcttttca ctccttcttt tgatgccctc ctcagataaa 221581 ttctccatgt gactcacact gaaaagaaaa aaaaatgaga aattctcttg tagccagaag 221641 agttgttggc agaaagctga cattccgttc cttttctgtc ctaccaacgt gcacacattt 221701 aattccaaag aatgactgat gaaaaaaaat taaaaaaacc tctgggccct gactgaaaca 221761 tcaaaccacc tcagagggtg gcaaccttaa taataaatat ttaataggcc ttagttgcaa 221821 atatgtaaat actttacaga gacaatgaaa ccacataact tcgattgaca ttttgaggaa 221881 tatctatttt atttgtaggg aagtatatct cacttcagcc tatcttctaa atcataagaa 221941 cgacattgag tctcacatct tgtcagagat tttttttggc caggccatat tttactatta 222001 aaaaaagtgc ttattaatga ggagtctcta aatgctagga acagatttgc aatctaatgt 222061 tagttttccc agaagacact gaatttccat aaggaaatag tttagaatac cgtcattggt 222121 ctagtcaaca ttgaattaca ccatttcact tgtgcctatg gattacatgg ttgaaggaaa 222181 tgacactgta gcgtcagaca cgcactttct ccaagatgag ggaggaggta cttgagtctt 222241 caagggagta agttaagatg acaagcctga tgactgtcta catcagaata tcgtgaccta 222301 ttagttttcc acaaacccat ctgtagttag aaccttccat gtgtcatcat ttacaagtta 222361 cttcatttct atagcaccat aacaaactat gatttaaatg gttttattga agccaaaact 222421 gaaaagtcaa atcaattaaa atatgaactt ggaaaccttc ccacaaaaat aatccactag 222481 agtcctttat tatcttagat caggggtcaa caaactacag ctctcagacc aaatccagcc 222541 cactgcctat ttttgtaaat aaagttttat tggaacacag ccacaactca ttcatgtatt 222601 gtatgtttgc tttaaggtta caaaagcaga gttgagtagt tgtggcagag attgtctagt 222661 ctgcaaagct gaaaatattt actctcggct ttttacagga acagtttgct ggcccctgcc 222721 ttagattcct gggtctcttc caagtgaata cttttacaca ggggtacttg tatacataat 222781 taataatctt caagaaagaa tgttatctac agacttagca ataaaacatg caaacagtca 222841 caaagcaaga acacatggtt ctagcaagaa aggtgtttct aagggatgct ggctccacaa 222901 atttttcact taaggcattg caattttcat aaataattgt caattaatat aaagaatttg 222961 gttgacatgt atatttattg atgcaagggt gggtgtttta aaataacaga tataataata 223021 acccgttcta tagaagacag ctgtgttaaa acacttgcag tcaccattgg acagccttta 223081 tgttaaggaa gttgggaata accaaggcag acttctagaa caaaggagta caagaagact 223141 aaacaaatac ttctccacac ttatgctttc cacctcaatt ggaacagctt tcgcctcaaa 223201 gcatattcag ccactctctg tcatgcatcc actataattg taccacttgg aaatcttaac 223261 ctcaaatatt ccagtctttg agcacagcct ccaatccttc ctccttactt gattttcttc 223321 tggatcatca gtccctcttg acttttgtgc cttccttatc cagcctggat tccaaagtca 223381 gctgcagaaa cccttcttgc ccttccgttg tctcagacca taccaccata ccctccttcc 223441 tcaagttgga tggtaagtac actggggcaa aatgagtgca caaatatgca gactagacac 223501 agatccacaa ttcttatccc agatggggtt cagagcactg gttggccatc agtccattgc 223561 attccttcct cttccaggac ttgcagcttt tagcctgtac tcactccact tcttttctca 223621 agcccctgca tgaactggca ccacttaaag ttacctcttc tctcacagag aaccagcacg 223681 atttctggga actattcctc catcccttaa aatatgctgc tgtatagttg tgatataatt 223741 ggtctggggg tgggcctgga cattgggatt ctgaaaggcc ccatagatga tgcactcagg 223801 gctgggaacc actggcctag tctttaattc ctcccaaatg agtgcgttca tcttagctct 223861 ttctctgagc tcctaccaaa aagctccctg gacacccctg catgtacatt ccacaggcat 223921 ctcaaattta tcatgtgaaa taccaaacat tatccacact tctctcctac cataaagtat 223981 gtttcattga aatgtctcac cttcatccaa ttttctaagc caggggccag tgatgcctcc 224041 agggttttat atgcagcaat acagtttggt gcgggaggct gggagtaagt tttgttgact 224101 tggtttctat acttctctgg ggcagctggg aacaagatgt tgtgaaagct atgagaagcg 224161 tattcttcac tagccttcct ttgtcactgc cagctgagac accacctttg gcccctaacc 224221 caacaagcca ccaaggctag tgggccccat gtcagaattc tctcttgaat ctatacgttt 224281 tatttccttt gtgtcctcac caccaatggt ttattcattg gtctccctgg ctttggcctg 224341 ccatccccac cttcgcaagc cccaatccat atagcttact ctccactgcc aaaactgtct 224401 ttctcaagtg ccagttcaat tacatctgat tagcaacaga aatctgacat ccaggcagcc 224461 attgtgtgtt gtttgggagt ccctggaaga cctcacattg gtttgtgctc ccaggaatgc 224521 atcccagctt ttctctcccg cctgggcaca ttctcactag gcttctaggc tcgagctcta 224581 aagtcacacc tccactagtg gagcctcctc tacacccaca ggccaatgca cccctacctc 224641 tttgggctac gctgcccctg tacatgcctt ctcttggatt aaaagtcatc atttattttc 224701 tacatgtgcc ttctttccct taccaagttc ccctgaaagc atgagccaca acttcctgat 224761 ttttagatct gcatccttca gcacaggggc tggcacactg ctggagatgt gctaacaaag 224821 ccttagtaga tggatgcatc aaggtccagg gatttggaag gattcctgga tgatttgtag 224881 ctgactggca tcagtgatca taggcgcctt aggcaaaact gggagtaaac atacgtggac 224941 tgacatgtca gaaacatctt ggctaggaag aaaaatgtca ggctgtgaag ctccacaatg 225001 ctgctgggga acactctaag acacacaggt atgatatcct ttccctgtga caggtgagat 225061 attcaccctg acagaaatgg ctgatgagtg ggcacattcc acgaccatgg acactcacag 225121 ttgtcaccag aacagcccct agataccaac aacaaggcag tttgttcttg ctcctttaag 225181 ggagagcttc ccagaaaaat aagaagtaag ctgacaggtc gttgatattt gcatagagcg 225241 tgataagtga tttatgcttg aaatcccctt tgtagaacaa ataataagtg tgtgtgtgtg 225301 tgtgtgtgtg tgtgtgtgtg tgtgtgttct tttgctatct attgtatgtt tgatcccagg 225361 ttgatgaaag ggtgatagct gaaatcctat aaaaaaacag cttgttgggc ttgcagcccc 225421 gtcaacatat ggtgttcaaa ctaaatacaa caaacgtgtc atttcaacgt gctcagctac 225481 ctacgtgaaa ttctaaacaa gtaatacaat atgaggagaa gagaaatgtt tttctggaac 225541 tgagcacctg agcctgctgg gcttctgagt catgctagtg tgagatgagc ccttcactgc 225601 agggctctgg tgagcctgcc ctttgtaatg cacgttcttc caaggctgaa catctccgac 225661 atcctctaag aagtcacaga aaactcaccc cgaagaaatc cgttttacag catcagattc 225721 atacactggt tctgctagta cttcctgaat gccagccatt ccaaaagatg aaaatcatgg 225781 tccaaagcag ggtttctaat cttcagcact atgaaccggt gtgggcagat aactccatgt 225841 ggtgctagct gtcttgtgca ttgaagaaca tttagcagta ccctgggcct ctgtccacta 225901 aatgtcattt gcacacctcc ctccattgtt gtgacaatca aaatgtctcc aggcattatc 225961 aaatgtcccc tgggtggcaa aatcctcccc catttcccca ccgctgacct ccgttgcaaa 226021 ccactggtcc aaaggatggc atcttgctgg agaattacca agagggctgt taatatgagt 226081 gcaggtgaga gaccccagca ttcccattcg tggatctggc tgtgcagcat agcttgccca 226141 gggatgctta gcctccctgt aacctgcaca cattggaagt gttcataaag ccaatgtcac 226201 atggatcctc agggaagagt tatcacaata gctgcccctt acaagaattc attccaatac 226261 ccttaacagt tgcaggaata aaatgcagct gttagacgca ggggcaaaca ccaagatgtg 226321 gcttttcttt ctctctcctt tatttttcat aattacccaa agagaaactg ttatcaccat 226381 ggcctataat ggaactaaaa cgcaggagaa ccacttgtta ccatagcaac aaagccggag 226441 tatattttct gtaccccttg agttcacagt gcagctttct ggtgccatct gacttgattt 226501 gctaattgct ttcacagaac aatcagtcca ctcaagcatt acagttcaag aagtaaaacc 226561 ttagttacaa ctgatgataa atattcaaac gtggttcctc aagttgaaag agggtttata 226621 tgaaagcgaa tctagcaata caagtaacag ttctgaagaa caacattctg gagactcaca 226681 acccccactc atctatactt tcagctttgc ttttgctgct cactctgtct acagtacctt 226741 ctgcaccccg cacaggtgtc actcccctgc tccccagagt aagacaatgc cctttctcca 226801 tgatgcatag caccttggta acaatctgcc ccactgtaat accactcctt tacatgtttg 226861 aaattactgc tagactggaa actccttgaa ggtagagatg atgtcttccc tggcatttaa 226921 taagggttta ataaatgtct gttgaattga aatgactctt tatgtttgac acttaagatg 226981 aaaatcaaca gataacaaaa gaaactcatg caaaaattag gaaaacaaga aaaaccacaa 227041 ttacgtatca acccctgacc ccagacttct ggcaggagat ataatattag tatggatgca 227101 atctgaaatc ctaaatattc catcaatggg ttcagagtaa ccacattgtt cattcattca 227161 ttaaatcatt caaggaatat ctattagatg tctattatgg actaggtaca tgataaatga 227221 gaataataat ttttttaatt tatttttttg agactgagtc tcaccctgtc gcccagtatg 227281 gagtgcagtg gcacgatctc ggcttactgc aagctccgcc tcccaggttc aagtgattct 227341 cctgcctcag cctcctgagt agctgagatt acaggcatgc accatcatgc ccagctactt 227401 tttgtatttt tagtagagac ggggtttcac catgttggcc aggctggtct caaactcctg 227461 acctcaggtg atccgcccac ctcagcctcc caaagttcta ggattacagg catgagccac 227521 cgcaccaggc catgttttat atttctacaa tgttttacag tccatgtaac agtttcatcc 227581 acagaaccta atcagaggaa tatccggaaa cacattgagt ggttgcattt cctaagatca 227641 tacaaaagtg agaaatctgg ctcttctgac cgatgctatg cttactgttc tttttacaaa 227701 aacttccatt cagaatgata cctctgaaga attgcatgct tctcaacaag caaataattg 227761 aatttttcaa ccttcaaaac ttcaatattc tgctcacata ccatgacagc taatgaattt 227821 tactctacag agattttttc atagaagcag gcatttgcac acaagaatct ctctgtgtga 227881 ctggtcccct gggctttgtc caagggcaga aagctcactt agttattaag ataaatagca 227941 aagggaagcc aaacagaagc taaatgaatt ctacaatttt tctcagtctt gcagagttgt 228001 ggaaagtcaa gttttccaaa tctactcctc actaatatat gctgcaagac tgacttagag 228061 gcatccttac atgggcattt attaaaagca tttattaaag gcatttatta aaagccccta 228121 ttcaaaataa ggaaacacta gtgtagattt ggaaaaccca gaaataatta gctacacttt 228181 gtcaactcag aacttagaag tggtcaagct aaaatttgat ttgggattta ttacatttat 228241 tttctgcagt tttatgtgga attggctctc tctctctcta tatatataca taatatatat 228301 atatgctata tatatacata atatatatac ataatatata tatagtatat atagcatata 228361 tatactatat atatacataa tatatatata tagtatatat atagcatata tatatataat 228421 tcttaggaat gttcctaaat actcccttct ccagaatgtt aaagatcttc tggcttgtcc 228481 agcttcagca tgtacctgcc ctggcctcca acggggaccc ttgcaaaact tgtacttgag 228541 atttttgagc acttgaaaga ggtacaaatc tggttatagt gtttaaggat ggtgatggga 228601 gaggaaagaa gacattgtaa gcaatatttc tgggtattaa atatttagtt gtttccaaga 228661 agttcaagta aatgacttgt tttaaaatta cttctggttg taccaatagt attctcttga 228721 ctcttctaaa actaagccac acatgagtta catgatacat ccccagccct ctgaaggtat 228781 attgttgctc agcaaggttc gtaaaaaagc ctatcaatcc acttgctttt gttttgttta 228841 gtatttccaa aagaagagta agcaaagtag cctgcaaaca tacatgtcaa ctctcagtct 228901 tggtaatgat aacattttaa gatgcattaa taatgtatct tcctgagaaa tctcttgtgg 228961 ttttgcactt cagctgtttt ctcatcttct ttccttggct ttgtctcatg ctgtcccaga 229021 aaccaaaaaa agctgtttct tttttgagat ggagtctcgc tctgttgccc aggctggagt 229081 gcagtggcac gatctcggct cactgcaagc tccacctccc gggttcatgc cattctcttg 229141 ccttagcttc ccgagtagct gggactatag gctcctgcca ccacgcacgg ctaatttttt 229201 tttttttttt tttttttgta tttttagtag agacagggtt tcaccgtgtt agccgggatg 229261 gtctcgatct cctgacctca tgacacgccc gcctcggcct cccaaagtgc tgggattaca 229321 ggcgggagcc agggcaccca gccaaaagct gtttcttaaa ggatcatttc aaacattagt 229381 gggtttgcat ggaaccttcc acgatgaagc tccttttcta taaaatcaca ctctaaaata 229441 taagctctaa aatactttcc ttgaagcatt cagagtaaag aaaaagagaa aggacaccta 229501 attttaagcc ccttaaataa aactgagggt gtctttgtta aagaaaataa ttacctgaat 229561 aaaagagatt tcttttttct taaggaaggt ttccaacaat ctggcacttg aattcattgg 229621 agtattttgt atagtaagtt tccatatgga gttgtattgg caagaatcaa ggagtgtggt 229681 tttcaggtct tggaggtgat taggtttata tcatttcatt tactgattgg acagccagtc 229741 tagctctgcc tcatgtgagt gacaaccaga agtcctgctt gaaaatccgt aattcccact 229801 tccatatgaa acaaatggtt caacaaggga atttaagagg ttcttggtgt gcactgggca 229861 ttttagatag catttgggat gagacggatc tcatgtcata ggacatgtat ggtgtaaaat 229921 atttgttatt cagtatatac acctcaatcc agtttggttt cataattgga atcactgcag 229981 actatgaacc aatattgaac tacctgggaa taggagctgt ttggctttat aacaaaacat 230041 atttttttca agcacaactt ggggagatgc cagatcttta ttataaacat ggaaaccaac 230101 acttaattgc tcttgagttt tcctagactc tatttcctta ttatccaatg ggtagattcc 230161 aatatatcta cttgacattg gagagaaaac atattctcca aggcagtgga agaaattcct 230221 atatttacta aatggaaatt cctctatttg ctttccaggc ctcttttgaa aataaccaca 230281 ggaagaaatg ggatatactc ttactgaatc caaactctaa agtacttttg cagattttgc 230341 agagtgaaga caaaagaaga aatggtgacc ttctttttgc aataatggtt ctgatgacat 230401 aaacatagac tctttggtga catctttgtc attctacaaa gcattatgag aggcagaata 230461 gttactgata ctagttccta ccacttatga ggccaccatt acaactgcag tagctggatc 230521 ttgccacagt tgtgtaattg cagccaagga cttcattact tcagtagcat tgggacaaaa 230581 ctgaaatatt tacttttata gacataccaa taatattaaa ccttgcaaac gggaaaaaat 230641 aaaaaggagt ctgtaagtcc ctatggccaa taaatcaaga cttagccaat tcatgatatt 230701 ttgatgttca catggtaagt gaaattattt tcctcttttc catctacccc cactatccgt 230761 ggcttctctc caattgtagt tttcatctcc ttttcaaggt gagtgatttt aggaggcaca 230821 gtgttggatt gaaataagat gacatgagac tagattcttc tacctcaata acttggtacc 230881 tggtacctat ggcaggtcac ttaaaatctt ttgatctatt ttctcacttc tataatggag 230941 atgataatac ttgctctggg aaaatcagat ggcataatgt atgtgaaagt gttttaaaat 231001 ttctaacctg ttgttataat ctgcagatga tataacttcc tgcatagata actcaagaga 231061 atccacagat aaattattag aattaacaag agttcagtaa ggtggctgca ataatttcaa 231121 tatacaaaag caaagtgcat ttctataaac tttcagccag cagttgtaaa ctgtaatatt 231181 aaaaactgtg ccatttataa ttgcaaaata atatataata taccatacaa aatcaaaata 231241 tgcaatattt tatgtagaaa attacacatt ttaactatcg taagccacta aagaagcctg 231301 aagtaaaatg agagaaataa catgttcatg aatatgaaga caaaatatca ataatgatgt 231361 taattcttcc ccatttttta cagatttttt acaatttcaa tcaaaatcca aactaattat 231421 tttcctttga agttgctaag gaaagggtcc aagtgtccaa gattttcttc gaggaaaaca 231481 aaaaacaggt agagagacct gccctgccca atgtcattaa aagaatgtga caagggtgaa 231541 agaacaaaga aaactttgaa atgggacaga atttagctac agatccagat catattggtg 231601 aaattgtgat atatgacaga cctggcctca tagaataaag aacttcaatt acccaactta 231661 accagtggca aggggtggag gtggggcaag cccactgatg cctccttggg cccagcagaa 231721 gacaaccctt ctctgcatac cacagtacgt gagagaactc cttgaaaatg gaaacaacat 231781 gaaacctcca ttacaaaaga caatcctagc aattttctaa tgtgttcaac ataattaatt 231841 cagaaaagaa taattacaat gttcactaat cagtgaatct attctattcc aaatattaca 231901 aaatagaaaa cgtgtcttgc cttgaaaaaa tgaaatatat agtgagtatt caataggcaa 231961 tatacttcta cacactcata ggttttgccc ctaatattat ttttatcctt aaaagagggc 232021 tggtctactt agaggaacac aatgcttagg tctgaggctc aatcaaacat gcaggacata 232081 acagagattt tgtactatcc tgttaatact gctatcaata gagcgaagca tagcatttgg 232141 tagtcacagc tggaaccctg tgcatatccc ctagatccac catggaggtc cccaaaaact 232201 gtgataggtc tgaaatttta ctctactttc aagcgaatga gtcattctgc cagtttcatg 232261 gatgctgtta gaagacatga gacacctgag tccaagatca aggacaatat attattcaca 232321 acaatagtaa tagccagagc atcaccattt gctccagtcc ctccaagccc cagtccctac 232381 aaggtgactt atgggagtca ggtgacacct gctcacataa tggactgcat tacaggacat 232441 catctctgag cttagagaac cgaaatcttt tgtgatggat ggtaagtttg cctaaccttt 232501 gtccctaagg aagacattct cttccttagg gacagaaaac aaacctgccc ttctctctag 232561 agggagacat gacctctatc cctcttgaga ctgttcacta aacaaatatt tttgaaaaga 232621 tattccaaaa gtgtgagtgc ctctgttcac aagatgttag aaatgccagc gacacatgga 232681 gaactgcctt tcaatagagg tcagctgcag acagctcccc taaattcaaa ctgctccagg 232741 gcgatctccg gcaatggaag tctactaggt cacctgcact ggtcaggctg gaagtgcagg 232801 ggaggtaagg gcccagaaaa ggttacagat gatggcataa tacccctgca tcatcacccc 232861 tcactgggac atttctgaat tgtgttctac acaggccctc agctgcattt agccccactt 232921 tcccacagca gtaaccaact cattaacata gcttctatta gctttcctcc tttccttgtc 232981 ttccttactt tctcactgtg ttcctgcagt tacctcccaa ataaactacc tgaactcaat 233041 tcctggtccc agggtctagt tctggaaaat ccataccaag acagatagag caaagaaata 233101 aacccacatc agttagtgtc cataactgac atctttactt tgtgatgtgt agacacagcc 233161 taaaaaaatg gaagggagtc atactttaaa gatctttcct gtattaagat ggtccctgat 233221 tcagagttat atgatggtcc aaatcttcat tattctgcag tcttagctgt attaaagcaa 233281 gataatccta aatttgatgc agctgaacct cctgccatag attcaataga tttactgaag 233341 actttagtaa ggctgcgggc cagggaatga gctcatcctt caaaaatatg cctcggcctc 233401 ttgccactcc agtactttca aaagcagcag ggcaggaata aggaattggc ttggcataag 233461 ccagtggatc caaaatccag tgcatgtaaa cacacaaatt gcttgtgggt tactagagct 233521 gggcaacaat gtctgactcg tcacatccac cctagggttc ctccaagctc ctccatatgg 233581 tatactacag ccaaaaagag tatgtcttgg agtagcacca actaaactgt gtatattgtg 233641 acaagctaca tctctccaag atttattttc ttcatttatc aaaaggtagt gataatatct 233701 acatctcatg gttttgtttt aagagttata tgtgataata taaataaagc ttaggccaca 233761 ggttccaatt tcttttattt agcatgtatt tcctgagcgc tgatcatgtg ctaggcattg 233821 ttctaagagc ctggggtaca tcagcaaaca aaacaggcag agatttctgt cctcatggag 233881 cttacattct aataaggaga gatagtaata aaaagtaaat ataatatata agtaaattat 233941 ttaggatgtt aatggtgata aattctaagg agaagtaaaa actatagcag ggtagggaaa 234001 catcaggaaa gaggaggtgg gatggggtgg ggtgaggaag cttgcagtat tagataatgt 234061 ggtcagggta ggcttcaatc agagtttgag attagagaaa atattcaaag aagaggaggg 234121 agttggagta cttatatctg ggggaagagt gctccaggaa aagacataaa tatattttaa 234181 tgaacattaa cttctactag ccatctttat agtggctttt taattgaact gaacaatatt 234241 gacaataaca ataaaaaaca cggtaaacat ttatttagtg ctcatgatgt gccaagaact 234301 acttaggctt ctcatctatc aactacatcg attctcatag gaattcttcg agatatgact 234361 atctttcaga gaagcactat cttatttttt agataaaact gaagcaggca gaaattagta 234421 aattgcttaa ggtcacacag caagaaagta gggaaaggaa gatccaaaca acacattctg 234481 actcaagtat tgagtgccaa accactgtac catactgcct ttatttaatt aaaaggacag 234541 ttaaaggggg ctggaaaggc tacctggtgg gagttggggt tgtagagaaa tatctggact 234601 atcaaggggc atcagatata tttttggggt gatggaaatg tttagctatt gttttgtggt 234661 ggtcgtttca caacaattat caaaattcat ccatttgtac acttgaaatg gatgcagttt 234721 attaaactat tgctcaataa aatgaacttt gaaaaggggt tctggaaaga ttagcttgca 234781 tatactgaaa tcaccagttt cattcacccc aattacccta cagaacatct gtctttaatt 234841 actcaagttt ctaattcatg ggccaagtgt gagttctgta ttttgtctta actcccttac 234901 ttttccttga ggtgtgactt acatggtcta tttaatatgt gaccattcac agacttcagc 234961 cctaggttgt gcatttaata gtgagtagcc aatgaattag tgttaaaaaa aaagtgttat 235021 agttgatcct tgttcactgc ttttgatagg cagaattaaa agatggcctg caagattccc 235081 atcccctggt atatacacct ggtataaccc cttcccatgt gagtgtaggc aggacctggg 235141 aatatgatgg aacagtctgc cccttgatta ggttaccttc tgtggcaaag gtgatgggat 235201 gttgctccta tgattatatt acagtgtgta agactccata gtagcctaca ggagagagat 235261 gctcctgctg gcctttgaga agtaagttgc cagattgtga gagggcctgt gagatggtca 235321 catggcaagg aacttcaagg ggactctagg agcacagtgc aacccctggc tgacagctgg 235381 caagaaaagg gtatcttatt tctgcaatgg taagaagcta gattttgcca tcaacatgag 235441 cttggaagag gatcctgagc tgaagatgag acagcagccc tggctgatac gttcacttag 235501 ggtcatagga cccttgtatg accctgtatg atcctgagtg caggatccag ttaggtgttc 235561 ctggcctcct gacccacaga aactgtggga taatgtatgt gtgctatttt aagccactag 235621 gtggtactct cttatgcagc caaaacaatt gaatatactg ctccttaaaa agatccagtc 235681 attcatttaa catttattga gatcctgctg tgtgtaccac tccagaatag cttcctcagg 235741 ctggaatttt ccaccgtact tcccttggtt ggacctaatt atatttaagt gcctcctcct 235801 gcattgtttg tgcagtttac cacagtccta ggagctagac tacctgggtt caaatcccag 235861 ttctactgtt tactagctgt gtgactacag gtagattatt acttctgtgt gcctgtttcc 235921 tcatttgtaa agtgagaata ataatggtgt ctactccact caaatgagct tatttattag 235981 tcaggtaagg ttatgctggg gtaacaaatt aaccatggag cctcactggc ttaatgcaat 236041 ggaaacatat ttttcagtga tgctgtgggt tttcgggggc tctacttctt gtagttacta 236101 atggactcca gctgagcgag ggtcctccaa actgagatgc caccatctta acagattgct 236161 tagacagagc tggagggtca tttactttct tggccagaaa gtaatacatg ctacttccat 236221 tcacagtcca tgctgcaaaa ccagtcaagt gatctcacgt aactcaagct ggttgggaag 236281 agacaaacaa agaaggagcc caccctgtgg aaatcttttc atatacagag aagagccagt 236341 atacaggctg tggggaagga attactgaca tgttcacgaa agagtaagaa gctcagtgtg 236401 acagttgtgt ggtgactaag gacaagagtt atagaagaaa tcattggaga gatagggagg 236461 ggcaagatca cacagagtga gcaaaagcaa gagagtgaga gagacagaaa gtgctgtctg 236521 ggtcacctct tagactcgta gccacacaaa ggggggagct atctgtgttt gtcacttcta 236581 tcccccaagt gcctaacaca cttgctggac actcagtaga acctcagaca ctttgtttac 236641 tgaatgaata catgaatgag gtggtggggt tgacatactt gtctctctga atttcattcc 236701 ccgggcctct ctccaaggta cggactgctt cttgaccatc acaacgaagc tttggaaaca 236761 tatttcaggg atgatgaagc aaacattatc ttgtgtacaa gacagttcag cagaacatta 236821 tcaaggaaac cacattccta tcttgtaggc taaggatgga gctgaaagat gtgactggac 236881 tttagacttc ctgcaggcag caactcttcc ccactgcagc ctctgctggc tgagaaacag 236941 gggcctctct caagaagaag catatcagat ctgaccagag actggccaga aagaacaccc 237001 tgcagactgc ctgaagccca gggctaggag ggaggctgaa ggagggacaa tagcatctct 237061 ggaagtcaaa atattttctt ttacccagct gccaacttgc agagaaggaa attaaagcaa 237121 ggctgctttc ctaatttaga gcgaggatat tttccatatt tttgtagaga tattttattt 237181 atgagacaca aaaggaacca ggggatacag atagtaagag tgtctactgg ccttttaatg 237241 cacggtaggc tgcacatttc cttcttgtgc tttagaaata ggttcaataa aagaaagtcc 237301 aacttgtgaa atcacacaca tctaattctc tttgagattt aaggccacaa gttgtgacct 237361 gagctgagtc acagctgggt cgaagggaac tgcctcatct cttctccacc cacgacaatg 237421 ccctctccat caaaaaatat ctaactgtgt ttgtcccaga ttaacagcaa gccttgggat 237481 gatgtgggaa agcgaccttt gcattgtttg tacagtgtag taatggcgtc cttgtgtttc 237541 ctccatccca actgtctgaa aacactttcc aaatttattt cagctgtggt taaggaataa 237601 cagaatagaa acatagcagc gattttgtag tttaatgact caatcaagaa agaaaacatt 237661 accatcgtgc tgtgtttgac aattcactcc aaatcgatta atttgatttt tataatgaga 237721 tttgggataa cattctgatt ctccaaatct tagaaagcag ccatttcatc cctcaaatcc 237781 aatataaagg ctttacctgc atgggattca aatttataat gttgtgtggt aaggattcta 237841 tgatgcattt gcttccaact tttaaactca aatacctatg gtgacaccat caaagctaaa 237901 aagtgtgagc aaaaactgga cctctgccag cactttgcta gactgtcaca ggcgtttcca 237961 ctaactgcat ggctgactga aatggctttt aaaaggtagt ggcgagaccc ttgaaagtac 238021 ttgagtgtat tcttgtatct ctcctcttgc agtctgcccc atggctgaga aatgcagtct 238081 gtcccagcca gaaatctggg tatccatctt tttactttat gattcctttt aggtgtctcc 238141 cattagcatt ggtggatttc tctctttctg catccagtac agaactgcag gatccaaaga 238201 tcacctgagc agagtggaag gtggtgttgg aggagtaggg agacctcact atacagtgaa 238261 gttacgtaac gtgcaaatat atctaccttg aattcaacta tgggtgttct ggaaaaggtg 238321 tgggtggggc gtgtggagag ggaggatgga gggaaaaatg ggagaatgcc tatgaaaaaa 238381 cataattcaa ctattgtggt tttctgttgt tgttgttgtt ctttcatttg tttgtttgtt 238441 gatacagggt ctcactctgt cacccaggct ggagtgcagt ggcacgatca tggctcactg 238501 tagcctcaaa cttctgggct caagtgatcc tcctgcctca gccttccaag tagctgggac 238561 tacaggcact aaccaccata cctggctaac tttttgattt tttgtagaga tggggtctca 238621 ttatcttacc caggctggtg tcatgaactc ctggactcaa gaaatccctt tgccttggcc 238681 tctcaaagtg ctgggattac aggtgtgagc cactgaaccc agcctgtaat tcaaacattt 238741 gattacatgc tacattgcac tcatgagtgt gtccccctgg gagtgcacat aactttgaaa 238801 atgtgattcc attaaagtca tcacaactgc ctgttcagtc ggtgtctgtc tttgcgtgct 238861 tcccacagac agcacctcac taagctaaac tatgaaccgc atttagtgtt cgaagtcata 238921 cattctttaa atacaacatg gccttcaatt ataagtcaac tgcttcgtag ttggctgcca 238981 aagcaacagc catcttaaac catgaaatgc ccttcagaag aaagacatgt gctaggaaga 239041 tggaggaggg agcttgaaga aacctgggtc ccaggtaaat gtgaagtcac tattccagtc 239101 tcatctaaat cagaaaataa taacttccta tctttttcag ctatggttac tttgagtttt 239161 tggatagatg cggctaaacc tcatcctcat ctacacattg tttacatgca aaatgaattc 239221 caccttgaga tataatataa gaagcatgcc ctctttgtgg ctgaatataa accacagtgg 239281 aactttctcc aaaaaaggaa tatttacaaa gctattccct tacctccagg taagcagaaa 239341 ttatttaata ttttatttta aaagaaatca ctgcattcaa tacaatgtat cttgctgagc 239401 ttttcttgat cggcatttta agctcatatg attaattttt ctgactttct gaagcacata 239461 gacatgaaaa ctgaaactaa gtcacagata aacaaggagt acacacaagg atgtatcaaa 239521 aggcaaagaa aaaagtatcc taacactcaa acaaatattc ataaactgta attacagggg 239581 aaaaggcaca ggatgttcct gcctgtttat aaagattaac aaaccttggc caaacccttc 239641 ttcttttcct tatctccagt aattgccaag caagttcaac tgcaaagcta aagaatgtgc 239701 caatctcttc tttcaagact ttattcctct ttgttttcac tgctcatttt cttcaatacg 239761 tcatcacaag ttccttagac ctacctttag ataaaatgct tctgtttcct taactataat 239821 gagcttcatg taaggagcaa caccaacatt tatgaaaaat ttgtcttcca tttcttctac 239881 caagttaaat acctggccct aaagtaatct gggtggccac ctggtctggc catgtcattc 239941 ttcagatgct acaaagagac atagtaagtg agctggcttc cccatctatc cacggcaaag 240001 tcagaaccgg cttccaagcc tagaacaatc ttctgactgc accaccttca tccctcacgg 240061 ttcatctgtt atctttcagc tttcttaatc ttgtttgcca ggtcgccacg catcactcat 240121 tactcactga tcttcgggtc tatgatcctt ataactgccc atgccattac atactttata 240181 aatcagatac tctcactgtc ccttcccttt tctacagaac ttacttgtct acctttttat 240241 ccctttaaat aaaagaataa tgatacccaa ttattgagcc cgttgttact gtgttaacat 240301 gggctttgcg tgcaactcta attttgccct ctcaatactt gtatgaagca aaaaccacag 240361 gagataagga aacttaggct cagagaagat aagtaacttg ctaaagttcc caagacactc 240421 tcattttatg cttcaacctt ctctgtagag tagcactgtc caatagaaat gtgatacaag 240481 ccatatatgt agtttaaaat gttctggtag tcagataaaa aatgtaaaaa taagtagttg 240541 aaactgactt tttaactata tatttcattt aacccaaata tcccaaatcg tatttcaaca 240601 tggaattgat gtaaataaat aattattaat aagatatttt acattgttac agcacatctc 240661 aatttggaca gtggattttc attagaaata cttgatctgg acttagactt cataaaatat 240721 atgattgcaa gaatagattc acatacccaa gttgttccaa acgtacttaa aacttttcca 240781 aaaactgaac acagcatcac ttttttaaaa gtttaaattt cacttcattg caaattatat 240841 taaattaaaa tgaattcatc agtcacatga gccatatgtc aggtgctcaa aagccacatg 240901 gtagtcagtg gctaccacgt tggatagcac agttctaaat ctatgaatgc attcttccag 240961 acaaactcaa ctatattttc ctacctgaac tcatgcagac gagtatgtct cctcctcctc 241021 ctcctcttcc tcctcctcct cttccttctc ctcctcctct tcctcttcct cctcctccat 241081 gagaacagtt aaccctgccc atcttccgag gcccagttca gatgtctttt cctacacaga 241141 gactttcctg atccactcaa aaggaagtaa acttctcctt tgaacttgca gagcactttt 241201 tctgggcctc ttgaatgtca atgatcatat ttctctttgt agagtacaca tttcctttca 241261 ctcctgactc ctgtactacc atctttggaa acccataaag gcttacctca aaacaatgta 241321 aaaagtagct cctaaaaata ttttgtgcat gaatcactca aaatgatttg tgagggaaat 241381 attaaaatga cacaagtcta agcctttgaa ataaactatt tactctccac aaacaaaacc 241441 agaagaggac aacttcggta aaaaattata acctctttgg tatccttata taggttgaga 241501 aagaatatga aaaaaaaaaa aactattacc acacctcact tgttttaaat tttataatag 241561 atacattggt ttcacaaatc caggggcctt tttgtttctc tctatagaca caaagtcttg 241621 ttctgtcacc caggctggag tgcagtggca caattatagc tcattgtaac cccaaactcc 241681 tggactcaag ctgtcctccc acctcagcct cctgagtagc tgggactatg ggcacatacc 241741 accacaccag gctaattttt tgtttcttgt agagacgagg tctctctatg ttgcccacac 241801 tgatcttgaa tccctggcct caagcaatcc tcctgccaca acctccccca gttctgatat 241861 tacaggcatg agccaccaca cccaaacaca aggggcattt ttaaagtaca tgtctatata 241921 tgtttcttac agtaaggcag aatgaatacc cacaccagcc ttccacagaa aacaactagg 241981 gatgttggat aatgtatttt aaagccctat ttaaaatatg catgagtgga caaaaaagta 242041 aagaacactt atgtctcaaa ataatcagga agaacattgc agccctacag ccagtgcttg 242101 ccttgaagtc atttgttgaa tgtggtgaat ctaaattctg ttttcctgac ctcacaggat 242161 tcagtggaca aaatgcgaat tctaggcttg cccatggtaa caagttgaat gggacagtcc 242221 ttcccataaa gctgggactc agccaggcat ggtggctcat gcctctaacc ccaacacttt 242281 gggaggccta ggcgggagga ttgcttgatc ccaggagttc gagaccagcc tgggcaacat 242341 gatgagaccc catttccacg aaatatttaa aaattagctg gatgtggtag cactcaccac 242401 aatcatggct cactgcactc cagcctgggc aacagagcaa gaccctgtct caaaacaaac 242461 aaacaaaaaa aaacactctg ggtctctaaa aaggatgtat ctttatggta agggagaaca 242521 gccctctctc aaccaaaggc aggaaaattt cctaccttaa actttgcaag tgaaaacaac 242581 tcttaacaca aatcacagta gaatggatat attatggtat atacatagaa tgaaatgcca 242641 caaagcaaca aggagcaatg gactacagat gaatgcaaca ctgaggctga aactcacaaa 242701 tataacgata agcaaagaat cagacacaaa agggtacact gtagatatcc atttatataa 242761 agtacaaaaa tggccaaagc tggtgagaca cagtggctca tgcctgtaat ccaagcaatt 242821 tgggaagcca agatgggtga atcgcttgag cccaggaatt tgagaccagc ctggccaaca 242881 tggcaaaacc ccatctctac taaaaataca aaaattcgcc aggcttggta gcatgcacct 242941 gtagttccag ctactcagga ggctgaggca ggagaatcac ttgagcccga gaggcagagg 243001 ttgcagtgag ctaagattgt gccactgcac tccagcctgg gcaacagaac aagactctgt 243061 cttaaatgaa tgaatgaata aataaataaa taaataaata aataaataaa taaataaata 243121 aatggccaaa gctaactgat ggtggtagag gtcagaatat tagcaacctt tggagatggt 243181 gggttggaga tcatgattgg gcacaaggga ggtttgtagg attccaatga tgttctattt 243241 cttgatcagg gtgctagtta tatgactgtt attacttggt gcaaattcat gaagctgtac 243301 atttataatt tgtatacttt tgtgtgtata catttcaata taataagaaa tgaaacaaaa 243361 atgaaactca acgaacagtt tcaaaggtag attaagcata gctgaatcag taacctggat 243421 gatacatttg aagaaattat gctgaatgca gctcagcccc tgccccttca tttcacagta 243481 atctttctaa agccttctgc cttctctctt tcatcggctt tttctcttcc acctctgttc 243541 cttaaatcct ggtaccttct agtgttctag ccatcacgtc cttcttttct taaccacaaa 243601 ttttattttg aatctactca atagttaatg aaaagtaata tttttaaaat gtaaagaaca 243661 tttatctgag atggacattg ttggcaaaaa tattaatcaa gtggtatctg tttagtagca 243721 gaagtactta aaatatctga aatgacttct agaaaacatt ttctaatctg ttttttgctt 243781 tccagtttga gaaaatttct tccaacaaag tccgaaacac agaattgatg ctaagtggca 243841 atttctaaat gacaaaaagg gaaagccaca aaaatgtagt tatgtcaagg tactatgcaa 243901 tgaatgtttg tatccctttg aagtggatat attgaaggcc taatacctaa tgtgacagta 243961 tttgagggtg gaacctttga gaagtaatta ggtatgaggg tggagccttc atgaatggaa 244021 ttcatgtctt ataaagacac taatgtagaa gaaacatgag agagatgatc atgatgatat 244081 gagtgaaggc aggcatctgc aaatcaggaa gaaggtcctt agtagacaca gaatctgcca 244141 gcaccctgat cttggacttc ccagcctcca gaactgtgag aaataaatgt ttactgttca 244201 aaccacccag tctatggcat ttgctagagc agcctgaact ggctaagaca caacaaaaat 244261 taagtctttt agaatatgat aatgcttttt aagaaactat gtcagaattc taggaagata 244321 atctgcacat aaatcaggcc tttcttagaa aacaaatgtg atgatgttat taaaaataaa 244381 tgattcaaag caatctttaa ggcaattcta tttctaattc tagaaactta attagttact 244441 tttctatttt gctctttgaa ctgacgtatt ctgtactgtt ttaaacaaat ttgatttttt 244501 aaaaacacaa gtaccatact atactcaaat gtgtgcatac ctacatagat atgtgcatgc 244561 atatatgccc atagaatgct gacatagtct cctgaggagt cttttacttg aaatctcttg 244621 cagacttcta acaactctaa tccactaaaa tacagcccaa aactataagt gacaattacc 244681 tctttgcacc ccaggaacta aggtccaacc tgggacctct ttgcaaccca ggaactaagg 244741 tccaaggtcc aagggtggga ggatgtaaag tattagattt tcacgctaat tgcacacaat 244801 gcctacatac aaggaaggga ataatagcag aagaaaagcc tatcccaatg tcattctcat 244861 aggtcacctg tccatttgct ctcagtggga gttaattccc agaaactagg ggggcacact 244921 tgtgagtcaa aagaggcagg agagagaggt gggtacatct gttctatcta atgggattcg 244981 gttccacagt agagccagtg cccatgttgc ataaacagga ggaagcagcc tcagatccag 245041 gatacctttt tttcaccatg atgaatagaa catcagtgca gaagggatgc agaagggaag 245101 atgggcctaa ctggaattat gtagtacata tggtgtgtat gttgtaagtt gtaggtatgt 245161 tgctatctgg ctagcacagt taaccaaata ctgtaagtgg acattgtacc ctcaaatttc 245221 aagatacaat gttgttgaac attgataaat aaatgatgaa gctaaaccaa tttccagcaa 245281 tgtatttccc agatactttc aaaatgtcag tttaccaaga cctctcgtat ttcatttcct 245341 cacaggtcct tttggtattt catcagaaat gactggctga tgggttccct tgtgaaagtg 245401 ccccatttgg acaagttaaa tgagtaaacc tggttaaaaa aatatattca ccaagagcct 245461 ctgggcatag gtacagtttc caagagccca agcattcttg gctacttgta agtattattg 245521 gaaggcactc aaaacaaaca ctccatttgc ctgatcttca gcaaagaaaa tatcagcctg 245581 tgtgagctaa tgtggatgtt attttagaat tagcttcact gccatttcag ggtagctaat 245641 actattgtcc tcaactcaaa gccttttgaa cttagattta gaaggcaggc aagctgcaaa 245701 gcttgtgact ttgagtgttc tagaaaaact caaatatgag tagatttagc ttggtcgtgc 245761 aggacacttt cccttaactt actgaggtat ttgtatcatg gaagaagttg aaagattact 245821 ttcaactgta ggaacagcag gggtagcttc acaggcatga aaccaggcag tggctcaggg 245881 cttcacagta caaagggctc tgtgctgagt taaatgctct gctgctgcca tcttgaaatt 245941 cttaatattt cttgaacaag gggctctgca ttctcatttt gaactgggcc ttgcaaacct 246001 ggaaacagca atgtggcttt acttctacct ccttcctaga caactttcct tcttgtttgc 246061 atatacaggt ttctctattc atattaaata taaaatgtat cttttagtag ctgagtctct 246121 gaaaggtttg gcccaccaca aggataaaaa acaggatgag agaacagtaa agtaaaatga 246181 taacattaga agaaatggaa actgggtaca gggtatacag gaattatctg tacaatcttt 246241 gcaacttttc tgtaaatcta aaattattct aaaatataaa gaccagctca tcatcaattt 246301 aaaaagaatg tagtaattat aaagagacaa ttgtaaaata accagaaaca acaacagtaa 246361 caaaagcctt tcacacttga aagttaaaaa taaaaaagaa acaacagtat tcttcttaaa 246421 aaatcacttg atcaaagaaa attaaaaatg gaaattgtaa acttggaaat atattactat 246481 tattttcata accattaaag gatttaatta gaattgttta aatatagcaa aaaatatgtt 246541 taaaactcaa cagtctactt ttggaccaca acaataggaa aaatgaaaat ccctttattc 246601 agaaatgtgc attaacgata tgtgcatgtg tttgtgtatg tgaataaatt ataaaatttt 246661 gctaaatgat actaaagaag actgaagtaa atgaatcata tcctattgct ggatggaaat 246721 aattaatttc ataaaaatat cagacttgtc taaagtaata catttttaaa attctttctg 246781 taatctgaaa aagataatga aaaatctctg tacccctggg ggatgcaggg aatcatcctt 246841 gggggaagga caatgtaata gtctgatcct gagacccagg gatacagggc ctacctcaaa 246901 ctgaggctga accaggtcag caatggctta attcacttaa aaacatgtcc ccaggctcaa 246961 aaagctagga aaggacagat ccagggcttg agaccattca ttcagtcatt aacgcattca 247021 ttaatccatc aattattttt tgatcaccaa ctctgaccag ggcattgttc tgggggagag 247081 acagagcaac gaacaagaca cataccagtt tgctttcctg ctttcttggg atatacattc 247141 aaaagggtta agcacagatt tgagaataca aaagcaacaa ataaataggt ttctccacct 247201 tggcactact gacactgggg gaagaatgat tccctgttgt gagggtggtt ctgtgcattg 247261 taggatgtta gcagcatccc tggtctccac ccactagatg ccggtagcac cctccacccc 247321 agttgtaaga acaaaaaaat ctccagatcg ccaaatttgc cctggggaga caaaatccct 247381 ccatgttaag aaccactgac ataaatgaat tcaataattt cacatagtgg taagcaatat 247441 gaagaaaata aaattggatg atggacagag atggagaggc accactttag actggagggt 247501 cacagaaggt tgtttcaagg ggagaaggcc aagaatcagg cattgacata tgagggcaga 247561 gagttctagg cagtgggaac agcgggtgaa aagttcctga ggtggtaagg agcttggtgt 247621 gttaaaggaa cataaagaaa acagagtgac tgaggtgtga tgacacagaa tgcgtgagca 247681 ggggccagat catacagcaa cttagggttt atgcttgaaa agatgattca gttgctcttt 247741 ggagaacaat ttgaggtagc ctgaatagaa ggggaaattg gtagtctagg agaaagactg 247801 tgggagcaga aatagggttt tagtgaagga aatgggagtc actgaacaga tctgtcacat 247861 ggtgatgtgg aggtaagagg ccttgctaat agactggaca tggacaacaa gttccccttc 247921 tcctttaggt gtctgccaag atgggtccca ttgtctatat tctgctggag atccacccct 247981 gcctccaact aggccagatg tgctgcattc ctgtttgggc tcaatctcta attcttggcc 248041 ccactctgta ccctcatctt ctaataggtc acccctgacc gctgccctag tcagctccat 248101 ggccagaaac ctcatcttcc cttagcctcc ctaccatttt ctcacatccg gtttccctcc 248161 aaacattgcc agcacttgaa gtctctcggg agctaaagct gaacattctg tttctatgat 248221 tatagaaacc caagaattat cttcatcatg aattccagag aagagcctag ggggtttctc 248281 agagtcatct tatgaacaaa agacatggta catacagggg tcaaaaatgc aagaaagagg 248341 aatgtacagt acaacatagt aaccccaaaa gggatattta aacagtgatt gaaatcacgg 248401 gttgaattgt gtctcccaaa aagatatggt caagtcctaa ccccatgtta cctgtgtatg 248461 tgaccttatt tagaaatggg atcttcgcag atgtaattag ttaagatggt cacattggat 248521 tcgggagggc cctaaatttg taagggtgtc cttacagaaa aaggagggga cacatacaga 248581 cacaagcaca cacaggaaag aagtccatgt gacaatggag gcagagactg gagtcatgta 248641 actgcatgtt aaggaacatc aaggattgcc aggagccact ggaaactagg aagaggcaag 248701 gaaggaatct cccctagcac cttcacaggg atcatggcac tgctgacact ttgattttgg 248761 acttctggcc tccagaactg acagagagaa tacatttctc ttgttttaaa acacccagtt 248821 tgtagtcatt tgttttggca gccctaggaa actaacacaa tgcacattta gacagtttgg 248881 gccaaaaaaa gtacatttta agtacatgtg aatttcctgt tgaaaagcta tagcacagaa 248941 tatttgaatg atgaaggtta atcagatgaa gaactccagc agcttgacct tctgaattca 249001 gttgatattg aacacatatg ccctgttaaa gctgccttct caacactcct tttggctgct 249061 caactgctcc actagaccca agctcctggg gtccaggctc taatgctggg ttctttgcaa 249121 agtgaaattc aatatgtacc ctctgggccc tgaagcctga gtaccatgcc tgagggagcc 249181 tcagtaaagg ttgattgagc ctttgcatgt tctgattacc ctttcccaca agctgtaaac 249241 ttgacaagca cacactactc caaagagacc tggacagtct tctaaaggtg tcaccctcac 249301 accacaccac agctgagtca gcaaaattcc agggagtaca caacagtttc atccccaagt 249361 ctgtctacag cgttatctat tggtgcttct ttgggtccac ataccacctt ttctagatct 249421 aatcaatgct tcttgtttac atccacctcc atatccagct aatgaaaatt taaatggatg 249481 tataaatctc accaaattat aagtggacca aggcattgta actcagaaga tggaatgcta 249541 gacagcaaga gaaatgtttt ggatggtggc acttctctgg catgctttca actctttggc 249601 aggggacaag aagccctttt tcttagtcat acagaagaat aaaaatatgc aatcaaaggg 249661 acaaattaaa attataaagc agtgaacaaa tggcaataat aatttgcctt tcattttaaa 249721 gtgtggcatt ctatggcatg gctatggttt ggatacagtt tgtttgccct cactaaatct 249781 catgttgatc tctgatcctc agtgtggtgg tgttgggagg tggggcttag tgggaaatgt 249841 ttgagtcata ggggcagatc cctcatgaag gacttggtgc catttttgca gtagtgagtt 249901 ctagctcttg tgaaatggaa ttagttctca agggaatgaa ttatttttcc caaaaatgag 249961 ttgttataaa tccaggatac tcctgggttt ggtccctctt tgcatgtgcc cacttccctt 250021 ttgacctttt ctgccatgtt ttgacacagc acagaagccc ataccacaag ccaagcagat 250081 gccagtacca tagtactttt atagcctgaa gaaccataat ctaaataaat ctcttttctt 250141 tgtaaattac atagcatcac atattccttt atcacaacac aaaacagaca aagacaggca 250201 ttcaatattc tgcagtatta tcttcagtgg tttaacagta ttccataata gcgctatatg 250261 atcatttaat caatccacta ttatagactg tttccattct tcttgtgccc tatttttaaa 250321 aaatactgtg ataacacatc cttgaagcta aattttagca cacacccagg gttatttcta 250381 caggataaat gtcttgatat gaacctatag agtttcagga attaacatct ttaaggcttt 250441 tggttcctac agcttttgat tcttattact gaattttttc ctacccatgc caaaaaagtt 250501 gtatgcattt gcacctccag cagcatatct tgagtgtttc tttcccaatc caattaccag 250561 cattatatat tatacaattt tcatctttgc caaagttacc cataaaaatg gtaaaaagca 250621 tttgagtgta atattaaaga aactacacac acacacacac acacacacac acacacacac 250681 aaacacacac acacagagct ttcaagaaaa taatcattga tctaaaaact taaaagcttc 250741 cagacaaaaa cacaggagaa tatcttttcg actttgaggt aggaaaacat ttcctagaat 250801 ataaaatttg atacacttca tcaaattaaa aacaaaaaca actcttgctc atgaaaagac 250861 atcattgctg aagtaaaaaa aaaaaaagca ccacagacta ggagaaattt ttaaaaacac 250921 atgtctaatg aaggacttgt atttagattt ttaaaattcc tacaaataat gcaaataaac 250981 aaaacaatag gaatgggaaa aaagacgtga tcgagaaatt ctcaaaatac tcaaactgtc 251041 aataaatatg aaaaggtgct cagcaaggag atgcatatta aagccacaga gagattgaat 251101 aagatatctg agacagtgtt acaaggcaat caaaagatat aaaggacata cacattgaaa 251161 acaaaggaga aatactaagc tttttttttt cccagaaaag acaatcatgt attaattatg 251221 taggaagaaa atccagaaat ctactttaaa aagccactag agctataagt agattgagca 251281 tggcctcagg atgcaaggtt aatatccaaa accaactgaa tttctatttg taatcaatga 251341 aggattgaaa gatgaaacca gaaaaacaat acaacgtata atttaatcaa agcacacaaa 251401 ccactcagga ataaatttag ccagatatac acaacaacct aaaaactgaa aatttaaaaa 251461 tattactgag gaaagaaaaa agacaaataa atcaaaccat aatttcagtg taatctcaat 251521 caaaatccca tcatcagttc ttgcagacat tataaaattg tattgaaata ccaagaaccc 251581 agattagcca atacattttg aataggaata accaaattat aggacttaca gtgtctggtt 251641 tcaagattta ctataaagct acaaaaatca agacattgtg tacagtgaaa gacaggatga 251701 taattgcatg aagaaagaca tatagaacaa tgggacagaa tagagaatcc acaaacagac 251761 tcataatata gtcaactgat ttcaacaaag gtgtcaaaag taatttaatg gggaaaggag 251821 agagtttcaa aaaatgatgg tataacaatg ttttcttaaa aaattaaact gtccctctac 251881 ttcacaccat atataaaaat gaattcaaaa tgaatcatag acctaaatgt aattattaaa 251941 actataaaat gttttcaaga taatatagaa gaaaacctct gcaatcttgt ggtatacaaa 252001 gatttcttaa gatacaagaa aaacccaaca agaacagatg aaaatattaa aaattggact 252061 ccatcaaaat ttaaaattca tgctctttga aagatatcat taaaaaatga aaaggcaaga 252121 catagactag gaaaaaaata tccacaatac ttatatcaga caagtgactt gtttccagag 252181 tatacagaga gatcttacaa ctcaaaaatt gacaattttt cagttatgag aacttcaaat 252241 atgataaaaa aaaaaaaaca atttttcaag ctggtaaaag atttggaaac ttaacaaaat 252301 aagataataa atggctgata ggcacataag atatgatatt tatcagcact attcatcaag 252361 aaaatgtgaa ttaaaacaag agataccact aaacttgcta gaatttaaaa ctatactgta 252421 tgattactgt atgaatgaca aatgttggta agaatatgga acaattggaa ctctcatgcc 252481 ttgctgatgg aagcataaaa tgtgacaacc ctttagagaa ctgtatggta atttcttata 252541 aagttaaaca tacgcttacc atatatcccg gccattttac ttctaggtat ttactcaaga 252601 gaaaggaaaa catatgtccc caaaagaatt gtacacaaat gcttattgca gctgttttca 252661 taatagcaca aaacagaaaa caactcaaat attcatccac agatgaacaa gtaaacatat 252721 tgtgatatgt ctgtacagtg gaatatgagt caacaataaa aagaaattac tgaaatacac 252781 aaggacatgc atgaatctca aaaatatgtt gagataaaga agctagacca aaaaatacac 252841 atacaatatg attccaatgc taagaaattc tagacaaaca aaactaatct gtagggtcag 252901 aaaatagatc agtggttgcc tgctaccagg cactgactgg gaaggggcat gaagagtttc 252961 ttttatggtg atggaaatat ttgatatctt gatgtggtgg ttacatggat atatatacct 253021 gccaaactta gtcaacctac acacttaaga tggcatattt tattgtatgc taatcatact 253081 tcaataaaag tgattttttt taaaaagaga tataaatgca tacccttcag tatgactaaa 253141 attataaaga ctgcaagacc aagtgttggc aaagatataa ataattagaa ctcctgtaca 253201 ttgatggaaa acttctttgg cagttaaagt taaataaaca gcttccctat gaaccagcaa 253261 ttccactcct agctttatac ccaagagaaa tgagtatgta tgtccacaaa aaggcttgta 253321 aaagaatgtt cacagaagct ttatttgtaa tagccaaagg ctggaaactc cccaaatatc 253381 tatcaacagg agagtagata aacaaattgt ggtgtgttca tacaatggaa aactacctgg 253441 caataaagga acaaactaat gatgtccaca acaacttata gtctcaaaag cattgcgtac 253501 agtgaaagac acattaactc atacagtatt attccattta tatggatttc aaaaacaggc 253561 agcactaatc tatgtggata gaaattagga tggcagttgc ctctgagcag gggaactaac 253621 tggaagaggc ataaggaaac ttcctggggt gatggagatg ttttatatct taatctgggt 253681 aatggccatg tgcatttctc aaatttcttc aagacgtaca ctttaatatt gctgtgtttt 253741 actaaattgt gccttgcaaa gcatgaaaaa aaacccataa aaataaaatt tgaagccaga 253801 aagattgggg ggttgagagt tcagcagagc ttgtgttcat ttgtgcttaa tttctcattt 253861 tcatagagat ttcaaagatg ctaactatca agaatattct tcaccaacaa caaaattctt 253921 cctagagcaa agtgacacag agctctagtt tcatgagggc acaaagaaac aaaaccagga 253981 gactgctgag cagtgtaagc ccaaactctt tgaactcata gaagggattc taaattggga 254041 atccaaacat tttctgtttc ctagcagaaa tactcatgca tgcaaaagct ccaggcaggc 254101 aatggggcta ttgaaatggc aatctggcca tgacagatgt tttaaaatgc atttgaatcc 254161 actcagtctc cactttaacg aaaggcggct ttctcccaga gttactcata acaactcatg 254221 tagtctgtgg tcacagttaa tctgacttac agggtcctga gtttgaaggg ctgtgtcttg 254281 gaaggtttct cacaggccat gagacaaaca gtgcctgccg cacaagaagc aacacatctt 254341 tcatggtgtt ttattgttgc tatcatgaaa tgtgaagtgg tatgactgta caaccagcaa 254401 gggcaacatc ccggttcagc tcatcagaac acagctccaa agagatttaa tgttggagac 254461 acctagtaac acattcattt ctggagggta gcatccctct gatcaacagg tcattgtata 254521 cattcagctg cttaggaagt atctgcaaat gtgacattct gattacaggc tggtactcta 254581 tggcagatgg aatctagata ttttcaaact tagaactctt ttcagggctg gcttcatggg 254641 ttcatgacct gtgcacacag ccccatgctc agaagggctc tgcacttggt ttaaggctca 254701 tctctcgcca tcttgaaatt ctaataattt tttcaaaagg agcctcgtgt tttcattttg 254761 tactgggttc tgccaattat gtagccagtc ccagctgatg ctacagaagc tcatctctcc 254821 tggaccttga gactaaaagt gaaccttggc cccactgctc ttaatcaaga gctaatctat 254881 atattaaatt tggttcttaa attcactggg ggtcgggggt ataccgtgga gggagaatct 254941 atgtaccagt ctgcctttac atttgcaaat ctggccaggt gatccccctt caatttggct 255001 ccacaaaaag ccttaccttg aaatacacat gcttagggtg ggagcaggat actgtgaaaa 255061 tgaaaggaag gctgggcaag gattaagtca gggtaataat gaagggaagg aaaggaaagg 255121 gaaatgtacg acagttgagc tcttaattgt tctcacttgg atttctgcag tttattcttt 255181 ctctagtttg tttccctgcc cgttttgcct atccacacaa aaacttgtgt gtgaatcttc 255241 atagcagcag tattcataat agcccaaaac tgaaaacaca aatgactttc aactgatgaa 255301 tgtaaaatgt gatatacaca atggtatgtt attcggcaac aaaaagaata aagtatagat 255361 atatgatacg acatggatga accttaaaaa cattatgcta aatgaaaaaa gccagaaacc 255421 aaaaaaaatc acatactgta taattccatt tatatgaaat atgcagaata ggcaagtcca 255481 cagagacagc aagtagatta gtggttgcca ggggctgagg atgagtaggt atggagtggg 255541 agaaatgggg agtgatggct ggctaatgga tatggagttt ctttgtggtg tgataaaaat 255601 gttctaattg attgtggtta tggttgtaca cctctgtaaa tatactaaaa gccatttaat 255661 tatgcagttc aaatgcgtga attgtataag tgaattacat ttcagtaaag ccatttttcc 255721 aaaatgtaat gttaagtaat cattacagtt ttagcatgtt agctgctttt cctcccccca 255781 gtctttttta gttgaaataa ccaagacaca cagaaagaac tcagcaatac acacactgta 255841 actcaagcca aatttcccct gggttactaa ataaacaatc atcagagtta ttgagtctct 255901 cagatgaatt ctggttccca tgatgctgca tcctgtacct ctgctgtagt tctcttatct 255961 agcaagagct cattgctgcc ctttctgcca caatgaatgt aaggaagatg acaagcaaca 256021 tttatttccc ttatttttta gtttgtttta ttcaaataac tcacaaatca gctgatcttt 256081 taggcaatga tagtattaat aaatctttta gaaaaaaatc atcctaaaat atgtacaata 256141 tatgaaaggt taactgcatc tggtcttaaa aatgcaaatt actaggatct cccctggagt 256201 gctactattt acagaggatt ttttttcctc tccctctaat attttcatgg tgatactaac 256261 tagaggcttt cagaataggg gctactgtgg aatgctggag aattccttcc ccaggatcct 256321 ccgtggctgt attactcaat gattcgattc ccatgagctt ctcttaggga agctaagagt 256381 aatttttttt cacatgcacg cacacccaca tataatgact gcaggcctta atccaactcc 256441 atataatcaa tttgcagaaa gacacaaaag ctaaaagaag ggacagggca ctatggaaag 256501 tagcagcaat gctgaaagtg ctgtgataga aacagcccaa gttgtggtgc cacggtgagt 256561 ggagtacaag ctaaccaaca ggcagagtgg ggactccatg attgaaggat acgatgagtg 256621 caccagaata cagtaatgca ggcatttcct agtggagtcc ttagagaaag aatgtgtgcc 256681 aagtacagtg gccaaataaa ctacaagaca cccagttaaa cctggttttg gataaagaac 256741 acatactttt gtagtataac tatattccaa atattgcaaa ggatatattt tactaaaaca 256801 ttgtttgttt tctgagattc caatttaact ggatgtcctg tatttttatt tgcttaatct 256861 ggcaacccta attctaagga aagaaagcct cagctcacta atggcaggta aatgtgagaa 256921 gggagaggaa attctgaata caggtccagt attctttatt cccctagcac agaggaaaca 256981 agacacaaat gacaaattaa ctcactaaga ataagagtca cactaaatct ggcaaactcc 257041 catgaagttc atagagttcc acagcttaga gggcatttcc cccaatttag caggtgaaac 257101 ctggaagcta cagttgcagg agcccagtgt tctaagccgg tggttctcca ccaggagtgg 257161 ttttaacaag gtctagggga tatctgtcac aactttggtg ggggaatgct ggtggaattg 257221 gcttcaagtg gagaggggcc agagatgcta ctaaatattc tacagtgcgc aggatagtcc 257281 cccacagtga ggaataattc agcctcaaat atcaatagta cttggggagg agaaactcta 257341 ctctcagcta aaaagtaaga aaagcctaaa tggggaaaag aggaatggca ctagaagtca 257401 tcactgacag ctggcgtacc cccagcagct gcagataaag agaatccagc agattccaca 257461 cagtacctgg gcagccaaaa ttcccgttct cacgtaagac ctctcctgga ggcaccctaa 257521 tgagtttggg gaatgtgcag agggctgatg agttggtgtc ctgagatgtt gacctctcag 257581 catttttctt tccatagggg gccaccttcc ttgtgctcat actgtcctgg ttcacttggc 257641 aaccatcaag caatgcccac ccacaccgtg gctctggggt gcagacttga tgctggtctg 257701 gactgtcatg gaccctgtgg cagatgtcct tggggctcag ctcatatccc ctccaagtcc 257761 ccagtgttat gcttagtggc tctaacaggc tcttactcca aacggccagc cttgtgtctc 257821 ctggttagag aactacccac agacttccag aatctctttt gcctgcacat gtgaagaacc 257881 aggagtgccc agaaatttat atcccctccc cacagtgttg gtggggtaat cccgatgggg 257941 aacacagcac tgagtggggt ctacactgtc cccagtactt ctctgtggag ttgcacttaa 258001 gttatcctct ggggggtgaa ttttatgaga ccttgaactc tgtggagcct ccctctcttc 258061 ttcatcccac ttcctctacc attttttccc aagaatattt cctgaccaat ccttttcctg 258121 ttaatcctac tctcagagac tgcaccagag gaacacagtc gaaggcagaa ctcttcgtgc 258181 tgccctcagc agatatgagg gatagacata tgacccaaaa tgggctgcag cagatattga 258241 ccactgcctt accaacaacc atctgtacac acaccctctc ctgccatgca cacagaactt 258301 gcaatgtgtt caagaggcag gaacctttgg tttcaggaaa agctgtccct tcacttagcc 258361 tcagatgccg agccatgatt gttgtaaacc actcatgtta actccgtcgc ttcttgccaa 258421 taattggtct tggagtggac acgtgcccca cttctggcca atgggatata aagagaagtc 258481 cactgggggt tttagagtag agatcttcta tcccccaaaa taagagtgtt gtcagaggaa 258541 tgacctcttt tctatccttt cttcctcctt atgatgctga ggtgaagaag tgatgcttgg 258601 agctgcaatc atgagcatga actcaagagg gttacaagac actgctcagc tcctgctatc 258661 actgagtcac tgactcaacc caagagcccc ttacctccaa acttctagct ctgtgagata 258721 atcaaatgtt tttttttttt cttttctttt cacttaagcc actgctgatt actcttctgc 258781 ttaccaatga aaatatcctg agatgaaagg caatcatagc cctcccatga aactggttaa 258841 taaattggat gttgtaatag aaatggtctc acttcctttg ggatcatgag ctgtagggat 258901 ggatgtctct gttggattgt gattatcttt tatgactata gggagaaaga catatttagt 258961 ggaagaagtt aaagccagca gggagggaaa cagagctgag agactgctct tatactaatg 259021 ttaattctta tagtgacacc gtcttcaact ctcttatctg taaagtggga ataataattc 259081 ctattatgca gggctgctat gagaataatt tgaaataaca tttcaaattg taaagtgcct 259141 ggttccactt ctcttaaatt taaaaaccta gagaataagg tattccaggc ttatctgcaa 259201 ag // LOCUS AC002482 107815 bp DNA PRI 21-AUG-1997 DEFINITION Human BAC clone RG208O03, complete sequence. ACCESSION AC002482 NID g2340090 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 107815) AUTHORS Dante,M and Gibson,A. TITLE The sequence of H. sapiens BAC clone RG208O03 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 107815) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. SOURCE INFORMATION: This clone is from a release of the human BAC library. The library contains cloned DNA from human sperm. See: Shizuya et al., Proc. Natl. Acad. Sci. USA 89:8794-7 (1992); Kim et al., Genomics 34:213-8 (1996). The clone is available from Research Genetics, Inc. (http://www.resgen.com). VECTOR: pBeloBAC11 Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of RG208O03; actual end is at 107815of RG208O03. The orientation of this clone is unknown. The location of this clone is unknown. FEATURES Location/Qualifiers source 1..107815 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RG208O03" /clone_lib="CITB-HS-A" repeat_region 4242..5282 /rpt_family="L1" repeat_region complement(6039..6639) /rpt_family="L1" repeat_region 6134..6552 /rpt_family="L1" repeat_region complement(7185..7411) /rpt_family="L1" repeat_region complement(7559..7809) /rpt_family="ALU" repeat_region complement(7870..7968) /rpt_family="L1" misc_feature 8968..9080 /note="match to human EST AA426245 (NID:g2107586) zv83e03.r1" misc_feature 8968..9080 /note="match to human EST AA403048 (NID:g2055610) zv63d09.r1" misc_feature 8987..9080 /note="match to human EST T64262 (NID:g668127) yc09a06.r1" gene 9005..10609 /gene="MCP-4" CDS join(9005..9080,9953..10067,10504..10609) /gene="MCP-4" /note="RG208O03.1; match to mRNA U46767 (NID:g1732122) and protein U46767 (PID:g1732123)" /codon_start=1 /product="monocyte chemotactic protein" /db_xref="PID:g2340091" /translation="MKVSAVLLCLLLMTAAFNPQGLAQPDALNVPSTCCFTFSSKKIS LQRLKSYVITTSRCPQKAVIFRTKLGKEICADPKEKWVQNYMKHLGRKAHTLKT" repeat_region complement(9161..9452) /rpt_family="ALU" misc_feature 9951..10067 /gene="MCP-4" /note="match to human EST AA426245 (NID:g2107586) zv83e03.r1" misc_feature 9951..10067 /gene="MCP-4" /note="match to human EST AA403048 (NID:g2055610) zv63d09.r1" misc_feature 9951..10067 /gene="MCP-4" /note="match to human EST T64262 (NID:g668127) yc09a06.r1" misc_feature 10504..10595 /gene="MCP-4" /note="match to human EST T64262 (NID:g668127) yc09a06.r1" misc_feature 10504..10726 /note="match to human EST AA403048 (NID:g2055610) zv63d09.r1" misc_feature 10504..10611 /note="match to human EST AA426245 (NID:g2107586) zv83e03.r1" misc_feature complement(10559..11088) /note="match to human EST AA404346 (NID:g2059071) zv63d09.s1" misc_feature complement(10676..11088) /note="match to human EST T64134 (NID:g667999) yc09a06.s1" misc_feature 10817..11088 /note="match to human EST C00820 (NID:g1433050)" repeat_region complement(11388..11410) /rpt_family="L1" misc_feature complement(12859..12929) /note="match to human EST D21008 (NID:g504828)" repeat_region 14553..14656 /rpt_family="MIR" repeat_region complement(16487..16594) /rpt_family="ALU" repeat_region complement(16612..16901) /rpt_family="ALU" repeat_region complement(18162..18384) /rpt_family="L1" repeat_region 19534..19647 /rpt_family="ALU" repeat_region 19648..19686 /rpt_family="L1" repeat_region complement(26321..26348) /rpt_family="L1" repeat_region 27743..28035 /rpt_family="ALU" repeat_region 28366..28392 /rpt_family="L1" repeat_region 28490..28512 /rpt_family="L1" repeat_region complement(28807..28827) /rpt_family="L1" repeat_region complement(29912..29993) /rpt_family="L1" repeat_region complement(30384..30672) /rpt_family="ALU" repeat_region 31292..31317 /rpt_family="L1" repeat_region 33022..33045 /rpt_family="L1" repeat_region 33825..34152 /rpt_family="L1" repeat_region complement(34242..34369) /rpt_family="L1" repeat_region complement(34578..34859) /rpt_family="ALU" repeat_region complement(36578..36610) /rpt_family="L1" repeat_region complement(37312..37342) /rpt_family="L1" repeat_region complement(37683..37838) /rpt_family="ALU" repeat_region complement(37852..37959) /rpt_family="ALU" repeat_region 40754..41037 /rpt_family="ALU" repeat_region 41055..41338 /rpt_family="ALU" repeat_region 41343..41363 /rpt_family="L1" repeat_region 43586..43607 /rpt_family="L1" repeat_region complement(46738..47032) /rpt_family="ALU" repeat_region complement(57803..57835) /rpt_family="L1" repeat_region 57962..58250 /rpt_family="ALU" repeat_region 58574..58843 /rpt_family="ALU" repeat_region 59889..60178 /rpt_family="ALU" repeat_region 60179..60213 /rpt_family="L1" repeat_region 60382..60421 /rpt_family="L1" repeat_region complement(61580..61868) /rpt_family="ALU" repeat_region 62573..63819 /rpt_family="L1" repeat_region complement(68474..68765) /rpt_family="ALU" repeat_region 70746..70784 /rpt_family="L1" repeat_region complement(76188..76493) /rpt_family="L1" repeat_region 76478..76656 /rpt_family="L1" repeat_region complement(81212..81508) /rpt_family="ALU" repeat_region complement(81522..81619) /rpt_family="L1" repeat_region complement(81636..82262) /rpt_family="L1" repeat_region 84233..84492 /rpt_family="ALU" repeat_region complement(85230..85263) /rpt_family="L1" repeat_region complement(85264..85565) /rpt_family="ALU" repeat_region 85895..85925 /rpt_family="L1" repeat_region complement(86136..86206) /rpt_family="L1" repeat_region complement(92598..92887) /rpt_family="ALU" repeat_region 93762..93780 /rpt_family="L1" repeat_region complement(103505..103626) /rpt_family="L1" repeat_region complement(104724..104749) /rpt_family="L1" BASE COUNT 29663 a 25319 c 24180 g 28653 t ORIGIN 1 aagcttgccc tgggaggtga ccaacaattt acttcagatt tggcctcctt tgacatctgt 61 gtagtttgag aagtcttcat aaatgagaca tgatttatac caggtctcaa atgattcctg 121 tttgggaagg aattacaggt tgatggaata ttataagcaa tgcatatagt gatggatata 181 agaactgcag gtggtatgaa ggaagaagta aggaagtggt cttgtaatgt tttcatgggc 241 aatgcatggg tttttctccc ccagcatacc aatggatgtc tcctcttctg caggcttgcc 301 tttctggaga gattcagccc tggaggctga acatattgat tagtgctggc ttcctcccct 361 tttcagtagc cactccagca gttagacatg tttcctataa aatgaaaaat ttgcaccatc 421 gtggtgctcc tcagagcctt aacaagtgaa atgagataaa cagggaaaag taaaggcaca 481 aggtgcccca gctctgcttc cctgagcagt tttggtttta atctgttacg tcagaactta 541 ccaataagtt actccttgag aatagacttt agaaggtaaa caaatatctg aaaaccacaa 601 cattacatac atttgccaac actattatgt tactctctaa gattgtgggc actaagattg 661 cctagtacta tagtgggcaa gatggcagga tgcttgggtt ctaagattct acagcaacat 721 aatgattcta aaataacagg attctataga tcttggagac ttctgttata taaagattat 781 tgggttggga aacagctggc tgaacttcta ctctagacat aactgcatta ctaacatagg 841 accttcgaaa gttctcactc tgattggtga accagggtag attttcaggt cttcatagga 901 catcatccat tcagaaccat cagcaaataa tcccggaagg tctctatttc tttccccagt 961 acagagtctc cctgatttat ggcagcaggt tcttctggaa aaggcatgga agaagattta 1021 tagtattaga cctatgagtt gccagattaa gcaaaaatat tgcaagaggt atatctatgc 1081 ttaaaaagaa aaatcactcc ctgtttattt gcaagtcaca tttaactaac tgccctgtat 1141 ttcatctggc aatccagcat tgcagggccc ttcctcaaaa agacctggcc tacccatcca 1201 tcaggagact tcaggtctat aaactggctt agatctgata actgggtaag agatatgaac 1261 atactagctt tctgcccact agaattttct accttaaagt caagaagcac cttagtaagc 1321 atctcatcta tttcatttct agggattcca tagtcttagc tacaaatcct tgtcaatcaa 1381 agcactcaga ttccaactct ggacctgtgg cttcttgggg taattgtgtc caccctgtct 1441 ccgctgttta ataaacattg ctgcctggct tttgttgctc ctggaaccca tttctctcac 1501 tgagatcagc aaccccagtc tcattgcagc agtgattctc caagagtggt cctcagacaa 1561 gccttatcgg tatcacctga gaacttgtta gaaatacaca attttggcac tgttctaaga 1621 actctccatg cattatctta ttgactctcc caacatcgta atgagatagc gattaccaat 1681 cagtttcctc atggaggttc agagagggaa gtaatttgct ctgaatcaca taaccagtaa 1741 acgatggata taaaacttga acccatgctt aactgacccc agttcgcagg ctcacattct 1801 cctatgtcgt attaagtcct gaatcacagg aaaggacaga caggagattt tacagctggg 1861 aagatttgaa tgtcctctca ctcatccagt ttataggaaa gtttggggca tatgtgtgtg 1921 agtcaatttt aagagtgtta gactgagaaa aataacatat agggttcaat cgcccaaatt 1981 cagaatttaa tgtactggca caagtacctg gaagggatat taatggtcag ctagattggc 2041 taattgaagt ctagacttaa aaagaagtct aagagaagga agcagagcaa gatggctgaa 2101 taaaagtctc cactgatcat cctccctgca ggaacaccaa attggacacc tatctacaca 2161 caaaaagcac cttcataaga accaaactta ggtaagtaat cacagtacct ggctgtaact 2221 tcgtatcact gaaagaggca ctgaagagag ttggaaagac aatcttaaat tgccaacacc 2281 acccctcccc tatctcttgg cagtggctgc atggagcgga aagaaaatgt gggcgggggg 2341 agggaaagtg cagtgattgg gtgacttgca ctggaactct gtgctgccaa cactgggcag 2401 aactcaacca gcgcccacaa agggagcatt tagacctgcc ctggccagag aagaattatc 2461 tgtgccagga gttggaatgt taattctggc aagccttgcc accatgggct aaattgctct 2521 ggctaaattg ttctaaataa acttggaagg cagtctaggc cacaaagact gcacttacta 2581 ggcaagttct agtgctgtga actggtgggc gacacgcaac ctagtgagac atcagctggg 2641 gcagccaagg gagtgcttat gttacgcttc ccccaacccc atgtgtcaca gcttgcagct 2701 ctgaaagaga ccccttcttt tcacttgaaa agagaggtat taaaaaaaaa aaaaaacttt 2761 gtcttacaag ttggatgcta aatagctcag ccacagtagg aagccacagt aaggcactgg 2821 ggagggttgt gaggctccca ttccagggcg tagctccagg atatttccag atacaccctg 2881 gagctgaaag gaatccacta ccttgaaggg aaggacccag tcctggcagg attcattacc 2941 tgttgactaa agagccccgg ggagctgaat aaccagcagc cataatcagg tagcacatgc 3001 tataggcctt gggtgagact ctgagacata ctggcttcag gtgtgacccg gcacattccc 3061 agatatgata gctacaagaa caggccccct tgcttgagaa aaacagagag aggagtaaag 3121 aggactttgt cttgaactta gcagctcaac tacagtagag aagagcacca aatgggcatt 3181 ggggtccccc gttccaggcc ttggctcttg tacagcatct ctggacctac cttgggccag 3241 aggggagccc actgccctaa agagtgagtc ccaggcctgg cagcattcac cacaagctga 3301 ccaaagagcc cttaggcctt aagtgaacat tggaggtacc ctggcagtac tccttatgga 3361 cctgtggtgg tggtggtgaa catgaggaga gactcctcta catagggaag gggaaggaaa 3421 ggatggaaaa aacttcattt tgtggcttgg gtgccagctc agccacagtg gaacagagca 3481 acaggtatat tactaaggtt tctgactcca gaccctggct cctggacagt atctctggac 3541 ctgcctaagg cctggaagaa tttgctgccc ttaagggaag gacataagcc tggttggttt 3601 taccacatgc taattgtaga gccctagggc cttgagtgaa catagccagt agtcagctag 3661 tggttccagt tggtgctgct tttaggtcca acccagttca gtcccagtag tggtggccac 3721 gggggtagtt gtgtaacctc tcttccagct ccaggcatct cagcacagag agagataatc 3781 tgtttgtttg ggacaaagta agaaaaaaga acaagagttt ctgcctggta gtccagagaa 3841 ttcttctgga ttttatccaa gaccaccaaa gtggtacaca gaaaaccaca gaataacatt 3901 gtaattgtgg tgtgtaaact atgcataagt tacgtagaaa gacaaaaaga tgaactgatc 3961 caaatatctg caacaacttt tgaagacata gatagtacaa taaaatataa acagaaacaa 4021 cagaaagtta aaaagtgggg agatgaagtt aaaatgtaga agttttatca gttttctctt 4081 tgtttatgta atcagcatta aattgtaatc aattttttta aaatgggtta taagacacta 4141 tctgcaaccc tcatggtaac ctcaaatcta aaaacacaca acagatacac aaaaaataaa 4201 aagcaagaaa ttaaatcata ccactagaga aaataatctt caccaaaagg aagataggaa 4261 ggaaggaagg aaagaaagaa gagaagacca caaaacaacc agaaaacaaa taacaaaatg 4321 gcaggagtaa gtccttactt attaataata acattaaatg taaacagact aaagtctcca 4381 atcaaaagac gtaaaacgtg gctaaaagga tgaaggagca agacctgatg atccgttgcc 4441 tacaagaaac acacttcacc tataaaggca cacaaagact gaaaacaaaa gggatgcaaa 4501 gagatactcc aagcaaatga aaaccaaaaa agagcaggag tggctatact aatatcagac 4561 aaaataaatt tcaagacaaa aaccataaaa aaagagacag agggagttat tatatagtga 4621 taaaggggtc aatttaacaa gaagatataa caattataaa tatataggca cccaacacta 4681 gagcacccag atatataaag ccaatatcat tagagctaaa gagagagata gactccaaca 4741 ggataacagc tggataattt aacaccccac tttcagaccc tgaaacagat catccagaca 4801 gaaaatcaac ataggaacat cagacttaat ctgcactata gaccaattgg atctactaga 4861 tatttacaaa acatttaatc caacagctgc aaaatacaca tttttctcct cagcacatag 4921 atcattctca aggatatatc agatgttaag ccacaaaaca agtcttaaaa cattcaaaaa 4981 attgaaatag tatcaagtat attctctgaa cacaatggaa taaaactaga agtcaataac 5041 aagaggaatt ttggaaactc cacaaacaca tggaatgtaa acaatatgct cctgaacaac 5101 tggttggtca atgaagagat taaaagggaa actgaaaaat ttcttgaaac aaattatgat 5161 ggaaacacaa tataccaaaa gctatgggtt acagcaaaag cagttctaag aggttaattt 5221 atatagctct aagtgcctac atcaaaaaag aagcaaacct tcaaataaac aacctgaaaa 5281 taaacgcttg aagcaatgga tatcccattt actttgatgt agttattaca cattacatga 5341 ctgcatcaaa atatctcatg taacccataa aaatacacac ctattatgta ctcacaaaaa 5401 ttaaaaatta aaaaataaga agcctacagt taatgaagtt gaaatgccag aaattctctg 5461 gaatgacagg aaggagtcca gagactcagg gaagtggatc tgttggaatg gatctagtct 5521 atctgttgga tctgttggaa tggatctagt ctatgtgcaa gctgctggga ggcccagggg 5581 gacactccct tcaatccagg tattaggaaa tgttttgatg agttgtggga ggagggcagg 5641 atgcttaaga gctctgtagt ggctattctt tatagatcag tgatgacagt gggacatgat 5701 ataatcccat agttctcaaa cttttgtgta tttcaggatc acctaatggg cttgttaaag 5761 cagtctgcca tgctactact ggtggtaatt aaagtaatca tgatatgttg acagatccta 5821 ttcctgtgcc cctttccatg gtgcacctaa aacatacgag caatccaatc ccagaatccc 5881 atttggaagt tctattttat ggggccaatt caatgtcagt aagtgtccca gcaagaaata 5941 gataacactt aaaatggaag acttggagaa aggtttagta aagaagctat ttacaaagat 6001 gtaagtgggt gaaaagagac cacaggggtg tttggtacca tgaggctatt aagaacatgg 6061 ttagactgca aaaaatttct ccggttctgt aggatatctg ttcactctga tgatagtttc 6121 ttttgctgta cagaagctct ttagtttaat tatatcccat ttgccaattt ttgcttttgt 6181 tgcaattgct tttgacattt tcatcatgaa atcttttcct gtgcctatgt tttgaatggt 6241 attgcctaga gtttcttcta gggtttttat agttttgggt tttacattta agtctttaat 6301 ccatcttgat ttcatttttg tatgaggtgt aaggaagggg tccagtttca gtgttctgca 6361 tacggctagc cagttttccc agcaccactt attaaatagg gaatcctttc tccattgctt 6421 atttttgtca ggtttgtcaa agatcagatg gttgtagatg tgtagtctta tttctgagat 6481 ctctattctg ttccattggt ctgtgtgtct gtttttgtac cagtaccata ctgttttggt 6541 tactgtagcc ttgtagtata gtttgaagtc aggtagcatg atacctccag ctgtgttctt 6601 tttgtttaag attgtcttgg ctattccact gcggcctctc agggagtgag ggaaagggag 6661 agcgtcagga taaatagcta atgtggatgg agcttaatac ttgggtgagg ggttgatagg 6721 tgcagcaaac caccatggca catgtttacc tatgtaacag acctgcacat cctgcgcatg 6781 tatcccaaaa tttaaaataa aataaaaatt ttttaaaaag aacatggtta tattcctaaa 6841 tctaaaaagg aaaaacaggg gatagctcag caaacctgga taggagagtc atggagaaac 6901 caccttgaaa gaagttaaaa ccattagttg agggatgtat ccaggttgag gaacacaggg 6961 agacaacaga ggagaagaac ctcaacttaa ctcttctccc tgctctaatg tcctcagaag 7021 ctcccttgtg cctaaaccca gtcagaagct agaaggtcag caacctcact gatgtaaccc 7081 accaagattg gcttcctagg gcagagaata gaatggaaaa gaatggaatg tgtgtctgaa 7141 gggacaagct tataatatcc ttttctcagg tgagtcttga cacatttttg tgtcaaagtt 7201 aggtaggtct caaaatgaat tgagtggtgt acaatctttt gatattctct ggaagagatt 7261 gcacaagatt gaaatgacca cttccttcag tgcttggagg aattagctgt ttgtgccctt 7321 tgagcccaga gaatgttttg tgggaagatt ttcaattcct gattccattt ctttaatagc 7381 tttagaatga ctttctattt tttatttatt tgtgcaccct ggtcagttag atttttaaga 7441 aattcatcca tttcctctaa actaaatttt tggcataagg ttcttatcat tttctttttg 7501 gtgtctgcag gctctgcaac tggttatatg tacctttttc cttttttctt ttcttttttt 7561 ttttttctgg gacagagtcc tactctgtca ctcagactgg ggtgcagtgg tgcaatcttg 7621 gctcactgta acctcggcct cccaggttca agcaattctc ctgcctcagc ctcccaagta 7681 gctgggatta caggcatgtg ccaccatgcc tgggtaattt ttgtattttt tttagtaaag 7741 acagggtttt accatgttgt ccaggctgct ctagaactct tgacctcaag tgatcccccc 7801 gcctcagccc cttttttctt ttttttttca ttttcagttc aggagagttt tagcttaatt 7861 ataggctaca gaaccagctt tgggcttcat ctatcctttc taatatttac tgtttcctat 7921 ttctctaatc ctagctcttt atttcttccc tttactttca ctgggcttat tttgctatag 7981 tggaatgcag agggatgagt attccaggaa ggcacaaaaa ctgtgccaag tcttggagct 8041 agggatgagt gggaaaggga catgttcaac cattttaagc cattccctcc ccacctccca 8101 gctcccagat atgtgcccct cgcaggagga gcccaggaat gggccaaaca cctcacttct 8161 ttgctctgag ggccacccca gccctcccat caacagctct agaaacccaa tggtccttcc 8221 tggaaacacg gggcctgcat caatcagagg tgtttgaacc atgtccctct gggcctgagg 8281 ggcagaaggg gacacaatat gtaatgtaag gagcccctgt catcagaaat ctgacttaat 8341 ctgtttcaga tattagactt ccacataaaa gttgacttgg aaaaagactt ctgctgctaa 8401 acaaaagttg aaactgcctt ggtgataaaa tataagcaga ccagctttct cttctagctt 8461 tccctctcat ttcccataag attttggtca agttatttaa tctctctgca tccgtttcct 8521 cttctatgaa atgggcatga taataatggt atgtacctcc tcaaactaaa tatataacgt 8581 gaacagagtc cttagcacag cactctttct ctacaggagt taattttcat tgtttttctc 8641 tttcctgttg gagaaagtaa gaagaaaaca gctcctttat ggcttcccat ggtgaatggc 8701 tggggcgcgt ctgtgtccct ttctcctctc tggctccttg tggcctgaac agccagaagg 8761 aagccatgcc atgctgtttc agccctcagc ttccctcttg catttcctag aaaagtcttt 8821 ggtgcccagc tccagctcag cagattcagg atcccccttc atcatgactt ggtcaacgcc 8881 ctgctcaggc caaggtcctc tgagagttcc aagcttctcc actccctata aaaggccggc 8941 ggaacagcca gaggagcaga gaggcaaaga aacattgtga aatctccaac tcttaacctt 9001 caacatgaaa gtctctgcag tgcttctgtg cctgctgctc atgacagcag ctttcaaccc 9061 ccagggactt gctcagccag gtaagtcacc tcccttcgac tctccctctc tttccctctg 9121 tttctctatt caaggaagac ctaagcccga gtgctcctcc actttttttt tagattgagt 9181 ctcattatgt tgcccaggct gaagtgcagg ggtgcgatct tggctcattg caaccttcac 9241 ctcccaggtt caagcgattc tcttgcctca gccttctgag tagctgtgat tacaggcacc 9301 cgccatcacg tgcagctaat ttttgtattt ttagtagaga aggggtttca ctatgttggc 9361 caggctggtc tcaaactctt gacctcaagt gatcctcccg tctcggcctc ccaaagtgct 9421 gggattacag gcgtgagcca ccaggcccag ccaagtgccc cacttctaag cccaccagaa 9481 tagtaaggct cctcagaggt tcactttaac atctaatttt aaagatagaa agctgaagcc 9541 catgttggag gcagaaggga ccctagccat ccacctccag gttattgcag agcaagaatg 9601 aaacctaagc ttctgactcc agatttaggg ccttttcttt gacctcatct gatcgtccca 9661 aactctgcag atctgggacc acacccagga cctttcccac tggccttgcc cgtggcctcc 9721 cctagatggc tgtgacatgt ctccaccatg cagctgagcc tttgagatcc tgaggcacat 9781 gtcacaggtc ccacctcacc tcagggtcta gggtgggagt gctgggcttg ggggtgagta 9841 agatctactt cttcctcttt gctttgcatc ccatacagat gctccctgct gtattcaagc 9901 tgagaaaagc ctaacacatc ctcaaagtct ttttctttgt aactatttct agatgcactc 9961 aacgtcccat ctacttgctg cttcacattt agcagtaaga agatctcctt gcagaggctg 10021 aagagctatg tgatcaccac cagcaggtgt ccccagaagg ctgtcatgtg ggtagaaaaa 10081 tccctgctca cctggctcct ccccactccc acattcccca atccaaagtt ctgccccagg 10141 agacagacgt cagactgact tgagatctta ggatgagatc tagccagact gtgtgatgca 10201 aatcctccaa ttttggctgc acaacaggtc caaagaggac ctataatttc ccacaccttg 10261 tttcctggat gggcaccagc ccacaccctt tagcagatgc caggatcagt ttcccagggg 10321 cagcaagagc agtggctgcc tccagagacc ccttctgtcc acacacctcc tacttcctgt 10381 cctggagggg tgccccttca cctgtagtag gtggaccagg caggtttaga acccagtgtg 10441 tcatctcctg ggtaaaccct caaagggttc catctaactg tgccagatct ccttcctcca 10501 cagcttcaga accaaactgg gcaaggagat ctgtgctgac ccaaaggaga agtgggtcca 10561 gaattatatg aaacacctgg gccggaaagc tcacaccctg aagacttgaa ctctgctacc 10621 cctactgaaa tcaagctgga gtacgtgaaa tgacttttcc attctcctct ggcctcctct 10681 tctatgcttt ggaatacttc taccataatt ttcaaatagg atgcattcgg ttttgtgatt 10741 caaaatgtac tatgtgttaa gtaatattgg ctattatttg acttgttgct ggtttggagt 10801 ttatttgagt attgctgatc ttttctaaag caaggccttg agcaagtagg ttgctgtctc 10861 taagccccct tcccttccac tatgagctgc tggcagtggg tttgtattcg gttcccaggg 10921 gttgagagca tgcctgtggg agtcatggac atgaagggat gctgcaatgt aggaaggaga 10981 gctctttgtg aatgtgaggt gttgctaaat atgttattgt ggaaagatga atgcaatagt 11041 aggactgctg acattttgca gaaaatacat tttatttaaa atctcctaca cagtggtgtt 11101 ttcttcagga gtaactgcca accagtaggg gctctcagag gtgtgggtgg atggcatgcc 11161 agtggaaggc agatgtacag aggccctggc catggggcta gcctggctca tctgagtttc 11221 aaggagggcc tgcctcacca gcccaggctg ttctggaaca atggctccag tccccagcat 11281 gggcgctcct ggctcgcaac tcatccgttt catattcagt tcatttcttt ctttaaaacc 11341 acactcctgc caaaagtaaa cccatccctc cccctctagt ctcccacccc aatctatgcc 11401 tccccactcc aagcagcaag tgattcagac ttcatgtgca cctgacactc agagggaaac 11461 cagatgacca agtcctagtg agcctcagga ttggtaggga agagggtgct tgggcctaat 11521 ggactttaga ggatggggaa gtgtggtcat agcaaagatt cttttagggg aggtccaaga 11581 tagcaatgtg gggagagggc ctggggggaa tatccccaaa cggtcagata tgggcttcca 11641 gaatgtaaaa gttggagaca tgtagaaagg aaagttgagg gcaggttcta gaggaattag 11701 aatatcaggg tggggagttt agactcagtt ctgaaggcag gaggaaggtg caaaggtttt 11761 tgagtcatcc agagggaaga atggatgcca ggaagactag ctattggctg tggataagat 11821 ggagttgaga agggagggcg gaaaacagcc tactgcaata gtccaagcaa gagaaaatag 11881 gggctgaaga agggcagtca tgggggtgga ggggaggagg cagattaaga agactgggcc 11941 agatgtgtga catgttgaat ggggaatgag ggaaacacag cgtgagacct ttgccctcta 12001 aaagtgatct ttaaaggcgt gccaacccca agacctagaa atacttcctt ccttggctat 12061 tcttccttga atcacacgtt gtacaaattg aagcttatga ttgtcccttt taaagcgttg 12121 gccctcctct tcctatatgt ccaccgaggg tttatcctgc tcttcccaag acagcctcac 12181 tcctccagat aagctgggtg ggcatctcaa tcagacccag aacctaccca tgcctggaca 12241 cccactgaga tcttcctcaa gaagagatgg gcactgctga caacagcagg gcctgtcccc 12301 cagatctggt accacactgc atcaggcagt acacttggca atgtgggaca cctggaggag 12361 ttgtcctagg aggcaacagc tctaggctct aatgccagat ctgcccctga attgctgcat 12421 ctctctttag ccctcagtca cctcatctat aaaatgggtt tagcaacatt ttacaggcag 12481 acaacctcac atgatcccct ccagctaggg cagttgtgat ttggtgataa catcaccccc 12541 caagccctct gggcagggca gggagaagaa gagtcctgca cccaaatgtg gatccagctg 12601 tgaaccccag ctttctgttc cactgtacca ctcctggcag ccaaatctgg aggcagagaa 12661 tctgcaccac caccctgact caaagcttac accccaggac aaaggcagtc actgagagct 12721 agtgagtcac ttattagccc agttcccctg gcaccaggtg caacatgtct ccctcctcct 12781 gacccatcct tcagcccctg gctcccacct ctcaatgcca atcaagtttt cactctgaaa 12841 tccaagagac ccagagggtt gggggttgat gattgtataa tttaaatgtt taaagtgcaa 12901 caacatagct tatcaagacc aagcagatcc tctgtgacct agcaaaagca gggcagaagg 12961 aatggtgtag ggctggtagt ttcggggaca ggtgaagcca tgtggtttcc agagcccaca 13021 atggaaagaa atctgctcat tttctttttg acgggcagtg cctcagcatt tttctgtgcc 13081 tctgaaccca tccaactgtg tccaaggcgc aggcctcttt gcctctcttc agcttgaatc 13141 tgtgaacaga gaataagaga gaaggctgag ccacccaagc caccactggg tgggcaacgc 13201 acaagccccg cctcctccgg ctgccaggtc ccctaaactg ctcttgctag gccttgggct 13261 cccctctgta aatgaagttg ctgcacagga tgtgtggttt gcacgctggg gaacagccaa 13321 agcaccagga gcttatgaat tgaatttcct ggacccctct cctggtttca atttggtgta 13381 tcagggtggg gctgaacagc ctgaaaacat tctgacgatt aggtgagatg agaaattccc 13441 atcctggaat agctctttgc accaacttta atgtttctgt gtccatctgg ggccttactt 13501 gttgcgttgg actcatgtgt ttgtgtgtgt gtgtgtgcgc gcgccttgtg tgtgtgtggg 13561 tgtgtgtatt tttctcttct gatctgtcta cttcctaaac tggactattg tccaatctat 13621 agaccaactc tatctcttac tctctggttg atcttagaca aatatctcta ccttcctgaa 13681 cctcactttt catgataaca aaaatatgtg ccccaagagg atcgacagag agaatataca 13741 tgaaaaagcc ttaaaatact gactggtttg tgaaagctac tccaattaag tttgtaaaga 13801 ccctgtgatg cagctatgtt tatgaccctt cacagataag gaatacagaa ggatacacag 13861 acaattcagt acaaaggagt tacctgtcca cggtcaccca agactatggg ttccaaatcc 13921 aggggcagga actcatcctg gttcccttgc tctttccagt gcacaataac ccctcatgct 13981 tttgttaaac ctctgagctc ataatgctgg gatacaatac ttggcacctt ggagtttcag 14041 gaaacacaag ccactgcagg agcacagatg catccttcct cacacctgct aggaagctca 14101 gcgctccctc tagcgcccaa gtgtggcttg cacccaggtg ctggcagctc tgggtttaga 14161 aaccttattc cggtaaaata ctgtcatcta gtgtctactc ttgtaattcc aggctggcgg 14221 ggttctgttc tctagggaga gattgagcag gtgatcactt acattaagcc ctcattggag 14281 cagatggagc tggtatttct gtaacacagg attgccctca ggggaatctc ttgctccgca 14341 aatgagaagc aacatctgga gaagggtacc tgcactagaa gaggaacaca gacgatggtt 14401 tgcatccatt tctcaacctc tcacctgtag cacaattttt aaaaaacaaa caaaaaaacg 14461 cctcccagga caagccctgg cttggggacc acatccagac ggtgccggca tagctctgcc 14521 tcatcagcca ccttgttctg gggctaggta aagagtcaga agattggggt tcagatctga 14581 gctccccgca ccccagctga gtggccctga ggaagatgtg ccttcaccct ggccccagac 14641 ttcccatctg taaaataaaa ctttttggag cctggtttct aaaggctcat ccagttttgg 14701 tatttcacca catgtgcatg gttctgcccc agtattcatg tccaagggct ctggacccca 14761 caccctgggg acctggttag tctgatgcag gacacagtgg tccacagttc acagacctga 14821 tctggggaag attaggaatt gttcgaggag tccaaactga tgagatgaca agggaagagc 14881 tcccaggaac tagctagctg ccctactcag tctcttgaat cccctcctgc cctggtcctt 14941 agctctttcc cagcccccaa cccccactct gtccatttgt gcattctcct cccacaccac 15001 atcccgcaag agacctgcat tccaatccca gcttcttcct cctagttttt ctgtttgcac 15061 cccctttcat gcctcttact aggttggtat tctccaaaca gtcttggagg gcagacatca 15121 tcaactccat ttcactgata gagaaagtag agtttgagag ctaaagtgac ttgcccaaga 15181 gtgaccaatt gttagagccc ggtcttgcct tctgactcta cttggtgtct gtcacactgt 15241 agaagctaca cccaggctgt cccctggacc ttgggcctcc cagctttacc cgagtaacct 15301 cacaggttcc cttccttggg tcttgtactg aagctgtcag ttcagttctc tgtgtttgcc 15361 ccaactcatg gttctcctac tagccctcac cccagcctgt ccaagttcac cattcctcct 15421 ctttaggtag gaagagggga gatagagttg gtgagttcaa ggagaagcct gctccaaccc 15481 cagagtggct gaccacgttt ctgccctccc cagagagaag caaaatgatg cctgccacac 15541 tcactgctct tgctgtccac atcttccggc cacatcccag ctagcagcaa gcacaccagg 15601 gctgtggtga tgatctgcat gtcttctggt ctggcttggg ccttcctgga gcagctttga 15661 tgagcctggt gaagctaaga gctcaccact gtgccgtcct gcaatgctga gggcttttat 15721 ttgaaaaact ccacccaaaa ctccaggcca tggtgggagc cacaagtggg aagccatagg 15781 gacagggaac agttgctcac acccagcgac catgtggtaa actcctttcc ccacctctcc 15841 tggctggtat tcgtggtggt gatggattga taggttgcat tgggaacttc cccaagtgtg 15901 ggagcagata agaatcttga aaaagtgcgt gggaactgtc ctggttgagt gggaagcaaa 15961 gaaaagtaag tgagaggaag tggcctctgt gtggcttgtc caaccctacg tcattccatc 16021 caagtactac tgcttcaggg cagaagagtc tggacctgaa cagagataca cctgcatttg 16081 aatcccagtt ctgcagagga gggattacat aaacttaccc aaatgcatat tcacacaaaa 16141 tgcaaatagc aatacatgtc tcaaggacgt gcgataagaa ttaaataaca ttctctatgt 16201 ccagtgactg gcctagatat ttgccatttc tattttttct atttttttca atcctcaaca 16261 attttcaata gcttattctg ttctaggtcc ttggttagcc ccagagaagg cagttaccaa 16321 tgaggagcac tgcctgacct caagcaactt ctcttcaatt cacagataaa caagtgaaag 16381 ccacccagtg taagcacgga gatagatgtg ttatgtttgg gttacagaga ggactgagag 16441 ctactcagca gcatctgaga gtcctgccag gggaattttc cctttttgtt tttttttttc 16501 tagagtctca ctctgtcgcc caggctggag tgcagtggca cacactcact gcagcctcaa 16561 actcctgggc tcaagcaatc ctccagcctc agccagggaa tttttccttt ttgttttttc 16621 ttttctaggg tctcattctg ttgcccaggc tggagtgcca tggtgcaaac tcactgcagc 16681 ctcaaactcc tgggctcaag cgatcctccc gcctcagcct cacagtgggt ggaactatag 16741 gtgtgcacta ccttgcccag ataatttttt tattttttat ttgcgtagag atggagtctc 16801 actgtgttcc ccaggctggt cttgaactcc taggttcaag caatccttct gcctcggcct 16861 cccaaagtgc tgggattacc agtgtgagcc accaccccag ctaggagaat gttctttaag 16921 caatatctcc tgtgtgctga tgtggagagg ggaaagctaa gtagaggcag ggtcttctat 16981 gaggacaaca aggcataagg gagcacactt catttcaaat catcaggaaa agcagaagtg 17041 tagggtccaa ggagagtgac aggagtgaga gagtggttgc agagagatgt ggctgtacag 17101 ctacagagaa gactgggtca taaagatgta tgcagtactc ggaataaaga gcacatagta 17161 ggtgctcaag aaaaagtagc tgagccaaat ctaggcattg aaatcatgaa gcctgtgtgc 17221 attatataga gtgatggcta gactgatcta ctctgattgt gcagtcttta gctcaaaaac 17281 ctgcagtggc tccacaccat caattgggtg gagcaccgag ctactcacct ggatgtttgt 17341 ggttcttcat gtcatgcctc agtggaggcc agcacattcg ggtaattgca attcctgatt 17401 gcaagtaact gccacttgct gtactatctc tgggattttg ctcctgctgt aacctccacc 17461 tgcagagccc acctctctgt ctcactctgt gaacatccta cccattgcaa atgtcattgt 17521 ctccacaaag gcacttctgc catatctagc agataggatc tcagcctctt gtgagaactc 17581 tcatgcctgg attgccatac aattcctgca acactttcct ttagtccgct ctccaaacat 17641 ggaaatgtcc aagttgaaag ggctcttcca gactatctac aacagtggct cttgaagtgt 17701 tttgggagag cttttgtaaa aatctgaata tcttagtccc atttcaggat gagaatcaaa 17761 acacatgtgc atgtacatgt tggaaagaaa aaattttcag gtaattttct cagacaccag 17821 aagataagaa tcacatatct cgtataatca cttcatatta tgatgaaatg agaggaaagt 17881 aacttgtcca aggtcatgtt gacctcaggg acaatggaac ttgagccctg gtcttttgac 17941 cttcatgtgc agctgcctct gagaacacac tttttgcaca gtccacaact aacttcctct 18001 gctcagccca accaagtacc atttatcctt tcaactgctt cctgccacct gaaaagtggt 18061 tgaaaccaca aactgctgtt gtttctatcc tattctcttt gtccttagag atggataaat 18121 actatgtgct attcatccta ctatgcgcta tgcatccatc tttttctttc aacttttatt 18181 ttaagttcca gggtacatgt gcagggtgta ccattttctt tatccagtct atcattgatg 18241 ggcatttggg ttaattccac gtctttgcta ttgtgaagag tgctgcagtg aaggttcgtg 18301 tgcctgtatc tttataatag aatgttttat attcctttgg gtatataccc agtagtggga 18361 ttgctgggtc aaatggcatt tctacaagag atgaatactt ttttattcct ttaccatagc 18421 gtgtcaggag agagagaaga tgcaagtggg tggtcagtca gccaaatgta actgtaggtc 18481 ctttagatgt aggaaacaga cagggggaaa tgtggccacc ttggaaattc tttgttcttc 18541 ccctcctcct ggggttgctg cccagcccaa tcaagaaggg aaaccgggct gcccactcca 18601 ccaccccctt cccctccccc cacctccctg tggccacagt cagaccagca caggcaaaaa 18661 gagaacagca agatttagtc atgcttttcc tttccaggct ccctcagcac atggaaggaa 18721 ctgactggag gctcttatca cacaggaact taatgtttct catgggtatc tcttccaccc 18781 caatgtgact gtcaaggaca cagacttctc taggctctag cacaaaggct aggaccaagt 18841 agtaactcaa ttaaacagtt gtgtaactga acaagctgga acagatggca taggatggac 18901 tgatgtataa gagatgctct cagaataaac actgaaccag ttattcatat gaacacattt 18961 cttcgttaga aatttgaaaa tcagaactgc ttggagatag gttttcttag tcatctttta 19021 tgaaataagt actgattaca gaggacaggt agagaccaga tagagttcag cccatgcagc 19081 tgcctcttaa gacatctcct ccagtggcgc ccagtggtgt ctgagggtgt ccaggagtgt 19141 tccagccctc tttcctcctc ctggatggcc cgagtggcct ggggcttgac cctgcctgct 19201 tgccttgcag ttagagttca gcctgatcct tgcagatcag ggcatagaaa ctgacaaggg 19261 agaagtcatt tgcccagcaa catacaacca cagaaatact ggtcttgctt tgtccagctc 19321 tggggtcaag ccttctgagt tccctcactt agggaggaag caagctatag aggaaatgca 19381 ggatagtcag acacaactgg gtcagatctc agctccaccg ctagagccta ggcaagtaat 19441 ttacactttc tgagtctttg ctctctcatt gtaagctcca caactagctg gacttggcac 19501 aactctcctc attgttttta agataaactc cccaggctga ggcaggagaa tcgcttgaac 19561 ccgggagaca gaggttgcag tgagccaaga tcgcaccact gcactccagc ctgggtgaca 19621 gaacaagact ccagtctcaa aaaaataaag taaaataaaa taaaaatcaa taaatcaata 19681 aataaactcc ccaaacctta aacacattta tttgaggtta tccaaaggat caagtgagat 19741 ttaagaggag ggaggggaga gaaaaagaaa aagttgaatc attcaaaatc attgtggagc 19801 accgaatatg taccagacac tggggatacc gcagcgggac aagacaaaca tgggccctgg 19861 tattgggaat tgtggtaagt caagattagt gcttcgagga gacggacaat gtcctgtgag 19921 ctgcagagtg tgcggaaatg ctcaccacgg ggccggacac atggtagatg ctcaatgaaa 19981 ctgtgtcctt ttcccctcga tcctggacca ttctcagcat cagcacatga acccaggcag 20041 taccaccctc tcccctactg gggcccattt cctggtctct ttcccttctc atctggtgct 20101 gcttgaaacc cacaccaagc cttgtttcat tgcccacatg agcagggtgt cctcttgggt 20161 cctcaagtct gcaccagacc ccctacctac tgcaagcaca aattcaaatc cacagcaagg 20221 acccgagcac caaaaaacac ctcgggattc aaagcccttt ttcatgtttt ccagcacctc 20281 cttatactct gcccacattc cctttccccc tatgttttcc gctgacattt agcacttttt 20341 ttttaaggag gcctctcgta tttacatatc agaaacaatg tgaaggaggc atggagaacc 20401 cctcatggca acaggaaaac tgaggggtgc cccaaaggat gtcacaggca cccagctttt 20461 caatgtgctg ctgattctga cacaacatgg cccaccaagc tcagggtgac atgcaactga 20521 cttggttggt ctcacagggt cctgcacagc acagtcggca gacttgcagc cacacaatcc 20581 cgcttgttgt ctgcacatac actgggattt actgcccctt ctgcctggaa cactctttcc 20641 tgctttatgt gcctaacttc ttgcaggctg tccaactcag agctcccaaa gtgaacccag 20701 cccactcctc acagcgggta tctcacacta gtgtaatttc gtggtggttc acactccttc 20761 ctgtctagaa gctcccagag agcagaagtc cctctgctat ggtcagcatt tcatgctgca 20821 gcgtttaaca ctgcactgat gtgcagcaac tattagctgg aaaagagaga aggagagatt 20881 ttaacatcac tatggctgag ctcatgtcac ggaggagaaa tgaagaccct ggaggttagc 20941 agacagacag gagttcacac agattgcctg gatcagaatc aggataacga gctctggcca 21001 aaatcctgtg cttggttata ctgcactgtg cagtagaaga ctgcaatttc catgtgtgct 21061 aaatgggcac catttgacac gtaagattcc agtgacagat tgcagttcaa cagtgtcttc 21121 attcattcat tcattcattc attcagcaaa cacctttcag ctgggcctgc aaggatgaag 21181 gcatgattta tcaggagagg agaaggaaca ggggctagca tggtgagcag tggttatgag 21241 ggtgcggtct gaagccagaa tgccctggtg cccagtccca gcttccctga acgctatttg 21301 cgtaacttct ttgagttact taaccactga gcctcatcta taaacaggga taataatagc 21361 atctgcttat aggcttgttg tgaaaattaa atataaagta cttagaatag tgcctgacat 21421 atttctccgg gaaaatggga acagtgacat cactgtatga aagtgtgtag gctgctgtct 21481 tcagagcttg ggggtgtgtg aggctgcatg gcaaggggtg gctagaaatg aagggtgagt 21541 agggctgggt tcagttgtgc aagatcctgt ttactgagct ttgccttgac cctgttggca 21601 atagagggcc tgagctaacc ttttgctcag gacagaggat ggcccgatat gtggtgaggg 21661 ctgagggctg gtgccaaagg ctaggacaga ttgaaaggag gcagccaggt tggggaggct 21721 gggaaagagc tccagtgaga gatgaggagg gttgaggttg tgtatgggag catgtgaaaa 21781 tcaggatatg agctggctct gccacctgct gtctgtagga tttgggcaag ttcctctctc 21841 taagcctcaa tttccttacc caaaaagtgg gaaggttggc ctaaatcagc gactcttgcc 21901 cttgaaatag gattcttata tggatcattc aggaagcttt ggtcaaaagg acacatgctt 21961 gaatcccgct ctagatcccc atattctaaa tgcaggtcag ccttttccaa atgaaaaggc 22021 ttctcatata atccaatccc accatttctc taccctgtca aaagccccta acctggatgg 22081 ccactgggac actctctagc ttggcattta ttgactctca ggatctccaa gcaagggcag 22141 actcctgccc accagccatg ctcaaagcac cacacttgcc cattcctgct gggaccggca 22201 gtcggatctg gcctgttaac agcaaacact catgcggagg gggcctgctg gggaggttac 22261 cagaaacgcc ccttcccctt ctgatgaggc tcaaaggcag tgaaacctaa tgggactgat 22321 cctggccttt gaggtattta ggtccctcac ccactctgag aaccacactt gcaatccttc 22381 catctgaaac ttagcaagca ccttgaactg ggaatgtgtt gtgaactctg cactgtggtt 22441 cttgagctgt ttgatgctgt atcaaaaata ttcaccttgc aacatctctg tgagtatgca 22501 aggggtccct ggggtcaggc atacgccata tacagcttgg tgcccaggag tacaccggat 22561 gaaagtgtcc tttcatcttc tccctcctgg cccccactcc cgtagagcag gaagagccct 22621 taggaagacc tggggtagac cgggtcatat tacagatgag tagcctgagg ctgagaaaga 22681 aaaagtgaac tgtcatgtgg ggacttggta acaggaccct ggagacagcc ttcctcttaa 22741 ttcccaggcc aggctctttg tcttgtgatt tgctggccat gtgctgatat ctggagggct 22801 tgctatggac agagagaagg tgaggatgct gaagttctgc aggattccca cagcccacct 22861 ggaggccagg acacagtgac tttgcaggga gtggtggggg gaaagcccct acattttcac 22921 cccacagagc aaagatgcca ggggcaggaa gtggcaccat tgacttcaaa atatgcccac 22981 taaaaccaac aaggccaccc atcattgagg tgacacaccc acaggctact gggaaagtcc 23041 cccaaggagt ccttcccttt ctggggtggc aggtggaaca gcctgggaca ctccatggct 23101 cacagcccat ttcccaatgc atcgtaggcc cacagtcctg ggattccatc tgtccatgtc 23161 tctcaggatt ggcccacttc gtaagaagcc cccgtcacct ggtctggcct cctgggctct 23221 ggagtgaact aaactgacca cagatcccgc agtaagctca aactgctcaa cctcagcaac 23281 tgaagctgag cctcagacac accatcctgc acctggcctc ctggccttct tctccatcct 23341 ctgatggcgt gctatccaga catccatatt ttaatccagg ttgccaccca ttagtacctg 23401 gtcaattctg cttatcaccc taaacctctg agtcatcatc tgttaaactg ggtttacaat 23461 gcctctgtct caggaatgtc gtaaggacta aatatggaac tacatgcaaa ggctttctag 23521 cacagagcct ggctcagagg ttgtgatttc tctgtgttct gtgtttcctc cctgcctgcg 23581 tggtttgcct gaggcatctt tataaatgct ctcaagagct gattctaagt gagtcaggtc 23641 ctatttctca gctgtggctg gctagtattg cttccttaga agagtgtgag tttcaggaac 23701 caagtttaca agggagcacc tgaagtcagg actgctgcct gccagatgta actagaccct 23761 gcagtctgtg ttccacccac cctgtgaccc tgaccaagac aggtgacata atttacatgg 23821 ttcagtgcaa aacgaaaatg atgactctct tcttcaaaaa ctaagaattt caagacagca 23881 caagttaaat gccatgaaac caggcattac tactgcgtta ctctctggag tcacctctgc 23941 cctggcctgc aaggtaactt tccagaatcc tactgttatg gttttattgg cctgttaaag 24001 tccttgacat aaagatacat cctaaatgag tcagagagat gcatttcact tccccctaaa 24061 atagctccaa gtggcacatc aaccctcaag tcatacagca cccccagcta ctctcccctg 24121 ctccccagcg gaatcaaagg ccagagctgc tgtcagcaaa tcagatctaa ggtggggcag 24181 aagggattgt ggagaaggga cagatgtgaa aaacatgaag gcattcattc tgctgatctg 24241 gattccttct cggccactgt ttccagccag agctaatctc aactaaagac tccttgtacc 24301 acccagatat ggcatgattc agcaatgggg tgtgttgggt gcccagcaca gtgtcttgca 24361 taataaataa gggcctcaat taaaggaatc tgtcaccagt gtcctctgca acctcttcta 24421 gccaccttca atggatgggt cctctgatat ccagctgtga caaagcaggc agatggctct 24481 gaacacccca caccccgtct cctaggccct cttgtacatg gccataaaca gcactgggga 24541 aagaaaaagg aaatagggaa gctagctttt gtactggctc tcccaccctt ggcctctgtc 24601 tatatcagtg tcctcatcca taagaggtgg attttagtct ttttctagct taggtgacca 24661 gaggttgtga gcctcacaga agacaatgac tggatggaag tggtgtttgg ctgagcagaa 24721 ggttatcaac actaccatcc ttctaccatg actgcctctc actcaccttc cttcccagcc 24781 aagagattga aataattcat gccactctct ccagtttctc acctctcact ttctcaacat 24841 gctgcaaact ggcctctgcc ccctgctaca tcaccaaaac tgcaatggca tcaccagtca 24901 actgcctgtt gcaaatccaa ggcccccctg gctgtcatct cgtctgacct ctcagcaccc 24961 ttgatctgtt gaccattcct tcctgaaacg tcccttgcca tggttttcat ggccctcctg 25021 gttttccttc aacctgtatg gctgttactc tgagactggt taggggcttc tcgtcctctg 25081 tccactcctt aatcttgatg ttcccctatc cctctgtagc attaagatag ttaatagcta 25141 atatttacta aatgcttcct gtgtgctagg tagttctcta agcacttcac atgcgttaat 25201 gtgtcaaatc ttctaaagcc ctggaagtag atgttatcat tgtcccccac tttgcagatg 25261 agaaaactga ggcccagggt tgtctgagat tacttagcta gtcatcaggg aggtggtatt 25321 taaacacaag ctgtctgcct ccagagttct gaacccgtcc tccacaccgc ctctcagcag 25381 acttgttcct gaatgctgtc caaacatttt agcttatcct gcaagcttcc ttctaatctg 25441 gccacaacct acaccaccta taacatcata tacagaggat gtcccctagg ccgaatttca 25501 caccgttcca cctgacacac cttgctcttt gcctgtgctc ctcctctctc tctggggaat 25561 aaacttctct ttgttttatc agctaatgag cttcactctt ctgacaaccc ttgttcaaaa 25621 ttgcctcctc tgtcacagag gccttctcca cctcttccca actctccctc ttctctgagt 25681 gccaggcact ttgctcttct ctctgtggtt atgcatattg ccttataatg taattcattc 25741 attcccatgc ctgccatctt ggccagactc tgagctcctt gcctaacaca atgcatggat 25801 gtaataagca ttggcgaatg ctgaatgaac atgtaaatga acgaatgagc tggcgcacat 25861 ctacatagca cctgagactt tcacctggcc tactccgtgt tttaacaaac tgagccaaaa 25921 aacaaaacac aaaaacccta agcctttaca cccatatcta tacaggctat cacatgcatg 25981 cagcgacaca tgcatgcaca cattccctcc tgagccctca ttcacttcct ggctcagctc 26041 ccttaggcta tcttagcctt cgggccattc tccccatctg cacaggcttc caggcatctg 26101 aaaatcctgc ctttcctaca atgtgtcatc tcaattctgc aaggactgtt ttccattctt 26161 ccataatatc cattgcattt catggggcct aacagttaca atctcaccac aaagtcatgt 26221 accatgtcag acaagacagt ctccaagcag aaatcagtgg ctaccagttg tcataacagt 26281 cctttatatg ttcacaacga cttgtggctt ccttagcccc ttagacccat ttgtaacttt 26341 gagcctttca accaggtatg gaatagttaa tataaatcta ctttataaat aaggaaatcg 26401 aggctcatac agattaatat atctctctca cacacacaca cccctccacc caaatgccct 26461 gggcaaagta ttgggctgag tgtttaatag gtagctagtg atggttcact gtgtcttctc 26521 tttccccaat gggtctaaag tctcctattg gagaacccat cagactccac atgattctgt 26581 taatatatgg tcttatctcc ccaaccacag cccctcaagc ccgaaagact gcttgttcca 26641 cattgaagta ggtccatgcc aggtgaacat gcgatgggtc aaagccatgt aataagatag 26701 gtctcctcca aaggcctgca aaggagcatt tctttttagc acccacaaaa gtatttccca 26761 gggcaagcct ctgccaatgt ctcccctgtg taaacaagta agcaggtgaa tgcattctgg 26821 ctttactaat tcaggtacat aaacagaaag gcaagatgaa tttaagatta ttattaaata 26881 aatgtggttc tcttgctttc aaccatatat aattcttcta acagtattat cgggttgaag 26941 gtacttagaa gatatgcttt aatggaggtt tttatcagga tttgtaatta gccgagcaat 27001 tgtgagcact ttaaagtact tggaagcaga ttaaacccat taaagagaat caaccactca 27061 aaaacacaaa agcagtatcc atttacatgc cggcaattgc tgtgccaatt ggaaatgcat 27121 atgctcaagc agctactgta aatcttactc ctgatgcagt atttgacatc cctcccagca 27181 agaaacactc attcattaag taaacaaata tttaaaggat gcctactccg tactgggcac 27241 tgttctatgg gctggtaact gaaacacaca agacacttgt ttttgtagag ctgacacctt 27301 gcgagaaagg catcgttttc atgcaagcta gtatgtgaac aagataattt caaacaggat 27361 aagttgatga aggaaacata acaaaaggat ataactagag aaaggattac agagtttgca 27421 tttgagctga gacctgaatg acaagaagat gtgaaaaatg aagaggctga atattccagg 27481 cagagagaat tgcaaacatg aggcctaatt catctttgct tgtgctacct tctgaataag 27541 acaaagattt aggtaaaggt agttgactga ggaggtgcag ggaaaattag taagagagta 27601 ggggagcaaa acagggaagg gacacactta gtaaaggctg tcttatcaag tcagccccat 27661 tgaggttgaa gtctgagagg aaggccaggg accaactcac gtaaagcctg gtaggccatg 27721 aagtattaga aagtcagtct gtgctgggca tggtggctca tgcctgtaat tccaacactt 27781 tgggaggccg aggcaggtgg atggcttgag gtcaggagtt caagacgagc tggccaacat 27841 ggcaaaaccc tgtctctact aaaaatacaa aaattagcta gacatggtgg cgtacgcctg 27901 tagtcccagc tattctggag actgaggcag gagaatcact tgaaaccagg aggcggaagt 27961 ttcagtgagc agagatggca ccactgcact ccaatcttgt tgacgacaga gcaagactcc 28021 gcctcaaaaa taaaaaaaaa aaaaagaaag aaaaacgaaa gtcagttttt gtgtattaga 28081 aagccaatta tgaaatgcag ccagggatag gaactgactt ctatttttta gtaagacttc 28141 tctctcagct gtatggagaa cagattttca ggacaataag aaccagtcaa gaggcaatag 28201 caagaagcca ggcaagggag ggcaatggct ccggctggcc cggggacagc gcaaagtgga 28261 tggatccagg tatgtcataa aggtagagca ggactgactg gtgagttaga tgtgagggac 28321 caggggaaag aaggatgcag ctttggctca ggtgtgccag tacagagtaa gaggaagaca 28381 agagaaacaa gtttcatctc gccagtctcc aggatgggct ctgccactct cattgattgt 28441 gtaatcgggc cagccactct cccaaactgg gcttcagtgt tctcatttgg aaaataagag 28501 agaagaacta atcaacttct taggaccctt tcaacgttaa tatgctatac ttctaaaaaa 28561 aaatcagcct ttgaagtttt agttattttt aataaatcca taaacccatg taaaccacca 28621 ctcaaaataa aagcaaggat cttgacaata agatacatcc accatatttt ccctctatac 28681 tgctttccct gactccccca acctaaggaa accaagacct caatctccct tactttctct 28741 tttccatata tttgaccaat gtgtgatgtg tgtttctaaa atgtatatat atatttttta 28801 ccttagttat ttttacttta taaaaagggt atcatactgt ttatttatca cactgtttat 28861 agggacttat ttgttgaata gtattttgtt aatatctagg acattatcat tatcatgtat 28921 cattatcgct gcagttcatt tgttttaaat gccatattat attccattgg ataaacattc 28981 tattgtttat atagtcactc tcttttgatg ggcatttgag tcttttccag aattttgtca 29041 ttgtaaacag ttctgaacat tctgacacac gaacattctt atatatgcaa gacttccatt 29101 aggtatatat gaaaagcaga atttctgggt cagaatattc actgtcatga gaaagttata 29161 tctatcagct ctaagtgtcc cctcctctgt gaaatatcta ttcctagttt ttcccatttt 29221 tcttttgggt gtttgtattt ttctcatcaa cctctttggc ctgcttcctg cctgtagaat 29281 actatcgctc aagatccttt catgtcctta aaaatcagtc ttatttaacc taggctggca 29341 gcactttcag caatgtaact ctctcgaaaa cccttttcga aatgtcagta gccaaatcca 29401 ctattcattt tgagacatgt cttgctctcc ctctatatgt agtttcaagt actctgacat 29461 catcaggctt taggagacct ttccatgaga cattctgtct aatggacagg attccctggg 29521 agtgccgtat tcttatcaga agtcttaaca aaggcagtga cagttgtgcc ctggatttgg 29581 tcttggtctc caggctaagt gttcatagcc atgcctttgt tttgttttgt gccttaaagc 29641 tatgtattat tttgagaatt tttttaacca cagaggttgg aagtgagaga ttattgcatc 29701 tttcaacact cttagcccta gcatttctag actttctata ttccctttca gttccacttg 29761 caaatgagcc tgccctttgc tgacatcatt tctttcttgt agtacctcat gaaatgtagc 29821 tagtaaaagc taactcacac taaccaattc tgtctcaaaa cctcttctcc caaagctata 29881 aattcagtag gtttattttc tgccctccga gttatcatat gtgacagttt taccaaatgt 29941 ttcactacag cattccatag gttgccattt ctctagcccc tataataatt tctctgctgt 30001 ctgccaccca gctgcaaagc caaatctgta catcttatcc tttggttaga gcaatttctc 30061 tcttctaggt attaatttct gtatcggcca gcttttgctg cagcaacaag caaccccaaa 30121 atatctgtgg ctgataacaa gaaacatttc tcttttttgc tcttaagtct gcgggccaac 30181 tgcagccctt cattaagctg tgggtcatgt cagtctttgt cactagtctt cccaattcca 30241 gggtccagct aaaggaacag ctcttgctca ggacaggaca ttctactggc agagggtcca 30301 agagaaagag gattatcaga aacttgcaat gctttcaaag cttctgctgg gatctggcat 30361 gtcatatctg ctaacattct tttttttttt tctgagacag tctcactgtg tcacccaggc 30421 tggagtgcag tggcgcgatc tcagctcact gcaagctcca cctcccgggt tcacaccatt 30481 ctcctgcctc agcctcctga gtaactggga ctacaggcac ccgccacaac acccggctaa 30541 ttttttgtat ttttagtaga gacaggtttt caccgtgtta gccaggatgg tctcgatctc 30601 ctgacctcat gatccgcccg cctcggcctc ccaaagtgct gggattacag gcctgagcca 30661 ccgtgcctgg cccatatctg ctaacagtct attggctcaa gggaggtaca tggccaacct 30721 caaagtcaac ggtgtgaaga aatacattct aaaccctttt aaatgagaat agagcctcct 30781 gcagatttga aggagctctt gatatgttct tatattaatc cttttggtat gtgtattgca 30841 acataatctc ccatttgtaa cttgttttta agttgtctcc ttaaaaagta tgttaatatt 30901 tttattgaaa agatgtatag gtttaacaaa ttgtagtagc tccactatcc agacaaccat 30961 tttcattttt attgaccttc tagtccctgt acacattaaa caaactccac cccagagtta 31021 catactataa tgagtaactc aatgcaatta tcaatttcta caacatgcaa tgctacccta 31081 agaactgccg caaattttaa gtccagaaga caaagaaaca gaaacttagc aaaagataaa 31141 tatttacaaa tacagtgtag aaaactgcaa aatatttcaa cagttttaaa aaatgctttc 31201 tttcaggcag actccaggga caacagtgag cagggaaccg cggacagtct ccagatctca 31261 gaaagaggaa cttgagcaaa aacgaaacag aaaaaaaaaa aaaaaaaaca agaaaaacta 31321 ttctttcgaa accttccagc cacctcaaag tgagacccgt aagcaaacag gtatacttca 31381 gaagtctcta attctcatta gttgactttc agtaatcgat gttggcccca tgcagatgtt 31441 ttaaatcact aacacaaaca ttgctgtctt tggtattctt gcagtgtttt catttatgag 31501 aaaacctgca gaagaaactg cttgaaccaa agcaaagtag caaggaaaca ggttactata 31561 taatcagaag gccctaagaa aggtagcagc aggtcatata attacctcct tgtggataaa 31621 atccctgtcc tcaaactacg caacctacta cctgtacttc ttccttagga accaattttt 31681 tcttgccagt cttctacttg gcagatgccc tttaaagaaa cctgtgaaat taagccacct 31741 tttttgattt ttttttccaa attcaaacca attagaattg tttcacagat gcttttaaaa 31801 agttattggc catatgatgt tcatcaaaac cacaaatgca tttgcttaaa accaaacttc 31861 aaataagtta ggaaggggcg aaagtaattc aattacatta gaagatgcaa accagtagtc 31921 taagcctcag cccaccatat attgttggag cacatatttt attgaaagcg tttaatcttc 31981 aaaattcaaa catcaaatct cataaaatct ccggccttcc ttgatcaatt agaaaagtag 32041 atgacacaat gtaactcctg cttgctgcct atcacctgga gcaaattaac accctgcctg 32101 tcccctttag atgccatgcg ttttctagtc atagtcagct cctcctggga ctctgaaggt 32161 atttcagctt ttcagttttt gacccctgaa ttatagtaaa tggaagcaaa ttctgatcat 32221 gatttttgtc aacattaaaa acaagctttt tgaaactgtc actctctccg caatggtact 32281 tttaagtatt tttacaaaat gtacccattc tccataattt ggtccgctta ctccgcctgg 32341 ctattaaggc tgtaaaatgg acccttatgt aattaatgaa acatgtaaat tcatttttct 32401 ccaggcctta cttagttaat tagttagaaa aagataagtt tccaactgca gagtagtagt 32461 aatatctaca gtggagacac atgtcagaaa accacaaaat tcctgtgaat gaaagaatga 32521 ctgtacttgc ctcatgttct gcaggcaagt tatgcatgtt gtcagtttaa ctaagattat 32581 gccaaagaga atttcgtgaa cattctcact tttctttggc atacaccgaa aggcaacaaa 32641 tgtaagtttg tttttttttt ccccaatgtt tacgcaattt ggaacaaaga ccaattatca 32701 atgtaggtgc cctaccatac gtctgggaat acataaaaag cacaaatgtg gtccaggata 32761 catcactttt tttgggccca cccaagggag agtttgatgt ttcctcttac gtaaagttac 32821 tgttgtcaat ggcactgcac tagatattaa cattcaagat aaatgaaaat atttgtgagg 32881 tggtaacctt ttcctcacct tcacactttg ccccttttcc ctcttgtccc aaaataaaat 32941 aaacagcaaa cacttaaaac agggagcttt tagccttgct tggctataag gcgcaaggta 33001 gaatatggtt tggctattgg aagaaagctg aaacatggac ctttcacagg tattaaaatt 33061 caaggtaacc tttttagcat tttgcttcta ttttatttaa agcaacaaat tatagatgtc 33121 tcatctgaca ttttcaagta attccccctt tgtgaataaa ggaccacttc aagtttaggg 33181 agcaaagttc agagcttagc gacttaccgg ggagcaaggg tagagtgcag agcccatgtc 33241 tccttttcca ggctggtggc gaaactgcct aaatcagcac agcaatgcgg cagtgctcac 33301 agcattctcc aatgccatcc aaaaccaaac tgggaatgaa tcataacaga catgactttc 33361 tttcagctta gagagtgctc ttcaatttgt atttaccttt accactcaga acctcacctt 33421 ggacacagat atcaggatgt aaggcagtga acccactgac aattccaaag ggacctagtc 33481 ccctattaag atttctcgag gagtccttac tttctccact ctttgagatt gttgcttaat 33541 ttcaaatttc tgaaatatac catatgtgaa aattgtaaat gttaaatcac tgcatctatt 33601 gtactacagt tgaacagaga aaattaacaa ctgataaact gtaaatacaa aagaatgtca 33661 aaatttatta ttctgagttt taatatcatc tagataaaaa caaacaaaat aatgtagcaa 33721 gcctactcct cagtcaagtt gtcgttttat gaaagaaaaa tcaattttaa tacacatgta 33781 tcaatctctt attttatagt caaaactcct ctgtcttaag acacaaggtt aaattgtcac 33841 ccttagcaaa ctaacacagg aacagaaaac caaacaccgc atgttctcac tcataagtgg 33901 gagttgaaca atgagaacac atggacacgg ggaggggaac atcacacaca ggggtcgggg 33961 gaaaggggag gaagagagca ttaggacaaa tatctaatgc atacggggat taaaacctag 34021 atgatggatt gataggggca gcaaaccacc atggcacatg tatacctatg taacaaacct 34081 gcacattctg cacaagtatc tcagaactta aagtataaat aaataaataa ataaataaac 34141 acatatatat atttaaaagt tttaaaacta aaaaaaaccc ctattctcca agatttaaaa 34201 aatatttcta ttaatatttt ctattaaaaa ttttaaagac tttttaaaca tttaaccatg 34261 tggagttgat ttttatatat catgtaatgt aaggatccaa tttaatttat ttgataggga 34321 ttaccatttt tccaggccta atttttgaat ggtctctcct ttccccatta atctgacaca 34381 caaaccatat catataacat agctttatac atacgtgctt ttgttcctgg acttttgtat 34441 atcattttat tggtcagctt gtgtatcgct gcattgacgt acaactcata acaagttgtg 34501 gtatctggta acgtaaatcc cctctgggat tttttttgtt ttaactggaa gtgtctgtct 34561 atttttaccc ttttgttctt tttttttgag acggagtttt gctcttattc cccaagctgg 34621 agtgcagtgg tgcgatcttg gctcactgca acctctgcct tccggtttca agcgattctc 34681 ctgcctcggc ctcccaagta gccagggtta caggcgcctg caaccatgcc taatttttgt 34741 atttttagta gaggtttcac catgttggcc aggctggtct cgaactcatg acctcgtgat 34801 ctgcccgcct ctacctccca aagtactggg attacaggca tgagccacca cacctggccc 34861 cttttattct tctttaaaat ttttttagcc atttcatcaa gttccaggaa acactctgtt 34921 gggcaccaag ctatgctgtt actatatact ggcttattga accccaacag ttacccagtg 34981 aaggtcaata ttattctccc catactatag atgaggtgca cagaatagac tggctccaca 35041 gcttcaaagt agaagctcca agattcaaac ccaggcatgt ctgacttgag agcccctgtc 35101 atgcccacat cacacagttt gggttggcat gtgcccagct aggagatgct gagtcaatcc 35161 tgccctctgg gacccacaga aaggggaggt tgtcaaaggt ctcatggaac agctggtgtt 35221 tcctgtcatt gttctgcatg cctccctgtc cccagttcag ccatgcagcg ccgggggtgg 35281 gagtgggacg cctgagccca cagaacctgg taggtctgca gagtcactag cggtgtatgt 35341 ctgtcacttt gctgaggatt cctagcgaca gcagaaacca tgcaggctgc gtttgaggca 35401 gctcattttc gctaattaag gaaaacaaca gcaaacagaa gcctgacatc tggaggtgat 35461 tcatagtcta agaaaagggc agttagttgg tctccacagt agaggagtct ctctctcttt 35521 ctctttctct ctctctctct ctatctctgt ctctctcttt cctcctcacc ccctgcacct 35581 taggactggg gagatgagta catgtgagta cctacagttc ttgaaagaga agaatatcat 35641 cttattactc cccttacata caatgaaaca aagatggtta aataactggc caagtgattc 35701 catttctcta tccccttagt agaggatgtc atattaaggc agctcatttg caaagttgtc 35761 atgagcaatg agagcttgta gttagagcag tggctctcag tactgtgcca gcagcttccg 35821 cattagctgg gaacttgtta gaaaggccaa ttcttgggcc ctactgagcc ttaaagtttt 35881 gggatgaggc ccagtaatct ctttaacaag tcctccaccc cgagtgattc tgatgcttct 35941 cacatttgag agccactgat agaggggatc ttccacagca cctgacagag cgaattctca 36001 atcaatggta actatttggg ccagagctca aaatgctcat tcatgaaaag ccaaagtctc 36061 agcctctggc tctgctgact ccagacaact gtttccatca agtgtcgtct taattcctag 36121 gagtaagacc tctttctggg aagatttgtc ccaaggtcag acatcggtgt attggtagag 36181 attacctgga gatattccac tttagggcat gaaagaataa caacagggct gaggttcagg 36241 catgtgaaac actgtggttc ctgtgtaatc ttatttaaga cccttggagg ttgagtctgg 36301 gactctcacc aagtcaaatg tgcatgtgac attattatat atgcacacac ctatacatac 36361 agcttctcat tcatacatag gggttgttaa gtctccatta tccacaagta ggttagatct 36421 ctactgaagt taacaggttg gtttaggttt gacttaactt ttatttgaca gatagccttt 36481 ggaattgaaa atcattgtga taaaacatta gagtacatgc aaaaagatta atagtcctat 36541 caatttcaaa gaactacttg tttctcactt tccattattt agctttttct acataaatct 36601 tcatatcatc tcaatcatca tgaatgccaa attttgtgtc ttcttttttc actgaatatc 36661 ataaacattt tctgtgatgc cccttagttt tcataactgc cattgtgatg acagaaaaat 36721 atttcatcaa gctgaagtta cacgtctcaa tatatagata tttagctggt tatttaggtt 36781 tttgatatgt agatatttat tatagatatt tagcttgtta tttataaata acacttctgt 36841 gactactttt ctgcaagtcc cattttctac ttacaatagt tttctttaga ttagcttcct 36901 agaagtggaa ttacaggaag agagggcatg acacaactct gactacagga acatacagaa 36961 aaaaaacatt agttaactct gttggtagca gtattaatac acctgattta caaaatgtag 37021 tctttatccc tgagtggtat tatttctccc cctttttgac aattaaatga gaattcaatt 37081 gtatcctaag ttattggttt aatttttcct tttgtaaatc ccagctcttc ttttttccta 37141 aagacacagc tggtcaaatt gggtttcaaa gaaaatttat ttcagttagt ttgtacctat 37201 cttcacatgc atgtgtttct ggatacatgt gtgtgtctgg gtatatgacc atctctccgc 37261 atgtgtacct atatccatgt gtatggaaga acacatatgt tgtagactgt atttgcctga 37321 ggttccactg cagatgctgt ttctgagcat gcattggttg gaatatgttt atgtgtatgt 37381 gcaaatatct acatctaagt gagactgtgg atacaataca actaatgaaa tcaaacacag 37441 actacttgtg gcttaacacg gttcttccga ctgcactttc ttgttgactg tcaccttctt 37501 gcatagctcc actaacatag ggtaaaaact aatgaattcc aagtaacagc gaagcatcca 37561 gcggaattga agtcatttga tcagcaaaag cagacagatc actacaagga ggaaactggc 37621 ttccttcttt ccttccttcc taccttcctt ccttcctttt tttccccttt tttccttatt 37681 tctttctttt gtgagacagg gtctctctct gcccaggctg gagtgcattg gcacaatcac 37741 acctcactgc agccttgacc tcctggactc aagcaatcct cccacctcag cctgtcgagt 37801 agctgagacc acaggtatgc caccatgacc aagtaattaa aaaacttttg tggtctcact 37861 atattgccag actagtctca aattcctggg cttaagcgat cctcccgctt cggtctctct 37921 gaatgttgag attatggtca cgagccacta cacccagtcc tggctcttct tttcatgcag 37981 agaatagata tgtcctcagt ggacttggta tccaaaaacc acttgtcagg ttaatgacac 38041 tgggcaagcc actttacatt tccccaaacc ccagtcctga ccctgtgttc tctgagtcta 38101 aactagtggt cagaaggaag gatgctcatg aaagctaaat aagatagtgc ccatgcccac 38161 agctccttgt ccgcagaagg tgctcagaag ccacggcctc ctataggaaa gggagccttt 38221 cttaagtccc tgaaagagac aatgaaggct cataatctta agggttgcca aaagtctttg 38281 caaaattctg ccttgaccag cgctaaagcc tcacacacca ttgctgagta cgtgcagtgt 38341 gagggtagac caaactctct atgccatggg gaccttgggg agctagtcaa aaagtccaca 38401 tgggaagtgg gagggagtta gaaatcactg ggacaaatct gtacactcaa gcttttaaaa 38461 atccaacaga actgctctcc tccttacacc tttatccaaa caactttttc agggaacccc 38521 aaactatgac acagacaagg ggcagggact gaggatgggt gggctgccta ctcaccccct 38581 ccctgtcctc tgacaggggc ttctagggca atgccacact ctgaaagccc cctagctgaa 38641 agccatgcaa ctagatggcc cctgagtctc tttcagcatt gccagccatt tgaaggaaag 38701 cattctaagt catggtctcc aagcagtagc aacaaggcca actaacactt attcagcgtt 38761 ttctacatcc cgggttccac gcttcacttt ttagatcctt tctctcactt tatcctcgtg 38821 tattccccag gaggaagacc ctgatagtat tttgacagca ccgcggctgc tgtccttttt 38881 ccatcatggc cctagttacc cagccctgct aatgcctgtc tcagggctac ctcccccact 38941 aacgaggagc ccttggaaac agagcgctgg tgcactgatg agcctctgtg cagctttgcc 39001 tgtggaatga acaccccaac ctagacctcc aaggagctga acaaccctcc ctagcactgc 39061 tgggctccag gcatttgcaa agggccctca gttctgagaa cccctcaggc ctctgaagac 39121 tcattcttcc tccctcccac atatggtagc ccttctccat ctctgggaaa tcccagggca 39181 gcacctattg ggtgcaaagt gggtgttcag tggacaaatt tcttcccccg ccaaaactgc 39241 atgggggaag ggacattcaa actctcttaa ctgtaaggac tcccctatct gtccctccct 39301 ctgctttaga acttaatatc acataggagg gtttgcttaa gagcctgggc actggagtca 39361 gctggacata aatacaaact tgctgatgat gtgatttgga gcaaattctt taacctctct 39421 gaacctctaa gagaaaacag ttactaactc attatgacca catagtgacg tgagagtcaa 39481 ataagataat gcacggaaag tgcttcgaac agtgtctggc tcacagtaaa tgctcagaga 39541 gagcttggaa agaaatcctg ccctgcctca gaggctcaga gcagcaagca agaaagagcc 39601 aggtggtcag agtggaacag ggagcagcta aacttgatgg gagacaggag caggaagact 39661 gcccaattca tgagaagccc aatgcccagt cagtgggagc aggggagggg acagtgactc 39721 accacagtga cttactcaaa taatctgccc caccaccacc accaccggca gacaaaaaag 39781 gaaaatctac ccaaatatta ttagccacag gaagcagcag agtttaggca gaaaggcctt 39841 ttggtggtag aaacacatgg acaatgtgct agcaaagcag aaaaagagtg ccaggcaggc 39901 aggaaggaga gaagagatta aaagtcaggc tggctgcaga acagagagga agtcattggt 39961 aacagcttct ccaggcccca cagctccaga gagcaaggtc aaggaagcta aaaagatgaa 40021 gcccctaggg acccaggaag gtgcctgctt cactccagca aaatgtcccc tttcctttaa 40081 tccaagatct ttttcccatg gtctgaggcc tggaaagaga agagggacat ttagctggac 40141 taagccccta ctctgtgccc aggaccaggc tgcaataagc tttacctcct gctatctctc 40201 ttgcctccaa gaacccaaca gtgcaggggg tagcaccccc caggcacagc tgagaagcct 40261 gtcgctagat acaggccaga gtggggattt aaacacagac taatggcatg cttagctcca 40321 cgcttttcct ttcaggcagg taaagcatca tgctgcggca ctggggttgc cagtagccac 40381 ttccatcaat tatgtgtgac cttgggcaac caactcgtct ctttggggct cagtttccat 40441 ggctataaac catcacctac agcatatggt gttgagagga ttgaaggagc tagagctgag 40501 cctggctcag agctggcact cagtttgtgc cacctgtgga gcttgtattc atctcaagaa 40561 aagccatgcc tcaggagaac agaagaaaac atgccgttcc cagcccacat tggtggtagt 40621 ggagagggca ctgggcttgg agttcagaga cctgaatggg agcccaggtc aacaccttcc 40681 tagggctgtg atcttggatg agtcactaga ctcctccaaa cctcagtgtc ctcatctctg 40741 cagtgaagat aatggctggg aataataccg gtattcctaa agctttggaa ggctgaggca 40801 gcaggattgc ttgaggccaa gagtttgaga tcagcctggg caacaagcaa gactctatct 40861 ctacaaaaaa atttaaaaat tagctgagtg tagtggcaca tgcctgtagt cctagctact 40921 caggagactg aggcaggaga atcacttgag cccgagaatt caaggctgga gtgagctaga 40981 atcacaccat tgcactgaag cctggtatca cagcaagatc ctgtctcaaa ataaaataaa 41041 ataaaataaa ataaggacgg gccgggctca tgcctgtaat cccggcactt tgggagacca 41101 aagcggaggc atcagttgag gccaggagtt tgagaccagc ctggccaaca tggagaaacc 41161 ctgtctctac aaaaaataca aaaattagcc aggcgtggtg cacacctgta atcccagcta 41221 ctcgggagcc tgcggtacga gaatcgcttg aacccaagag gcagaggttg cagtgagtca 41281 agattgcacc actacactcc agcctgggca agagggagac tccatctcac aaataaatac 41341 ataaataaaa taaaacaaaa taaggataat gactctcctg actgcacagg atcatggtgg 41401 agggacatcg tggagatgac agcacctagc agggatcggg gggagcagag accaggaggc 41461 acggactggc ctttccccgt ggtggcaggg actgatccac tggtggccag tgacagggag 41521 aggggaagga gcatgggaga agatgcagat acccagaaag atccaaacct ctaccgcaaa 41581 gtcaaggtag acatttcccc agccccgggg agtgagcctg atatgcatgg agcaaaggta 41641 aatggggagg ctatcaggct ggggtctctg tccttttctt tgtcaccctc tgctcaaggc 41701 agccactagg gagctgactg gccacaggga ctttcttgcc atgtaatcct ggggaaagct 41761 gctcacccac tttgttcctt ctctttttgt tgtccacact aggagacagt gattctcgcc 41821 aagctcctgt gtgggcagga tgagaacgag gaggaataaa ccactcagaa ggaaagagat 41881 ggaagatcac ataaataacc acccagtggg gcattcaaca gtatctccct ggctgagggg 41941 tgggggtaag ggcacatcac aggaaggtgt gacagaggag gaccaaacca cagagacgaa 42001 aattagcaac aaggctggga agggaaaccc ttgtgggtaa aactttgcct ccacctcagc 42061 agcatacatg gggaggcaga gctgctgtag ggcaaagtgg gtgtgggtgg cggttcctca 42121 gccccctgct ctctcccagc ttccatctgg tctgcgtgaa accagaatgg aattggaggg 42181 catgaaagat ttgggataat agaccatttg tgtctttcct agctgtatgg ctataagcaa 42241 gacctcacct tcttgagctt tcattttctc atctctgaga tggaaaggtg atatctacct 42301 tactgagtag ttgggaggag aaaatacagc tgtaagaata gtaactactg ttacatgctg 42361 gggtgcatgc aatttcatta aactttcacg acaactctat ccaatgaatg ttttcatatc 42421 catttacaga gaaggagaca gactcagaga agttaagtga ttttcctatg acaaacagct 42481 gataaacagc aggattaatg tcaggcccaa gggttgtcag acttcatcca cacgttggag 42541 cctgacataa cgtaactgat aaataatgat tcattcaccc ctccaaaatc ccttccctgg 42601 ttcctactcc attcccagtg gctgcctgtg gctgggctca ctcctcggtc ttggtccagg 42661 actgtggttt ctgtgcaggc tgccagtgaa gaagtctagg cctgtgatcc acttactcct 42721 gatatcacct ctccctggaa gtggctcctc aatttgaccc cacacgctga tgacaaatac 42781 ttagggaggt gatctagcag gaggaggtct agtgatacag gatgcattca tatctgtgat 42841 cttgagccca ggctgccttc atgtagcttc ttttgtctgc tgctgtacag aggtgggaga 42901 tcatgaaccc aaaaaggaga cagttgaagg tgtggaaccc aatgctgcat caggagagtt 42961 cattgcaaag atcactcgtg acctaatcct cccctgaaga aaacaagcag ccactggtct 43021 ttacccttga agtctctctg ctttccctga caggatattg caagaatgct ttgggatgga 43081 gggacgaagg gtgatgggct ttcaccaagc ccctgctgtc tgccaggggt tacaatagga 43141 acttgacgtg agtctcttca ttgaggtgcc tcagcagtcc tgaaggacag gcatggtata 43201 ttcttaaaag aaagcactca ggctcaagga agtttgtcaa agtgccaaaa gccacctagc 43261 tggtaagggt catcaaggcc cgtctgactt catgtccacg ctctttcctt tccaccaggg 43321 ccccagtgga tccgcctggt taccatggga gaccagtgct tcagaccagc catgccaagg 43381 taggtggacc ttgcttggtg atgttcttca ttgtagacct tcctgcagat ttcagtctcc 43441 ttttcatctt gagccccaac atgactgtat ccacagatca cctatctgac atgtttcctg 43501 agcactcact cattacgccc aaggccctgt tctagaagct gggcaaggtt ataacatgaa 43561 gactgagaat ctgtcttcag agaatttagc agagaagaaa gatctaagga aaatgggacc 43621 atagtgccag gggggtacga gtgaaatgcc ttgagcaatt gaggggacag acagatcgcc 43681 acctctctgg tgggaaggga atagccaaag aggtacaggt caagccaagc atctgattga 43741 tagcttagtg ttcagagttg gaagggatct caaaggtttt ctgacccaaa cctctcattt 43801 tctacatgag aaaacggagg cccagagggg agggtcattt gcataaagcc acatgacata 43861 tggtcagcat caaagccagg tgaggaccag gttccctgac tcagcactac ctttgttcct 43921 catatgcccc caagcaaggc tgacaggtaa tctagcaagt ggagcccggt gcaaatccac 43981 tgccataact gtaagaactc agcatgctta acatgccagc cttctgtatc aactaaaatt 44041 ggattcagct gcacaaagaa acccaaaaca atcctagctt aagcatgaga ggagttttta 44101 atctctcaga taaaagtcca caggcctgag gtccaggctg gttctgcaaa gttgttaagg 44161 accaggctcc ttctagcagc ttctctctgc cccagtgtat ggcccccatc caacaaccca 44221 agctgagggc acctgcactc caggcagcag gggcaagatg aggaagaggg gcgatggggg 44281 cataccctgc ctttctacta gggttgctga aggccatcac aggacacatc cctttccagc 44341 ctagaagtta gagccttgct ggatgggagg ctagaaaatg tggtttttat tcggggtggc 44401 catgatccca gtcaaaattt ctgttacaat cgaagaaagg aagaatggct atttgccttc 44461 ggttttcact tctaagtttg ctttctgtat cctgcctctt tcaacctctc cttccccagg 44521 tctagatacc ctgggtagcc cctgccttgg tattcattgc cttatcaaca gtttgaggtg 44581 tggtcacctc cctgccctac tcctgcacct tattcagccc agcaacagtt gcattctctc 44641 gttggccaca gaggccactt ggaatgggca gggccgctgc tgggcttggg gacaacgagc 44701 ctgagctcta attgacatag ctcatcactc actctgtgcc cttcacctaa ttacaccctc 44761 tctctgagcc aggagagaga gagagtgtgt gtacatttca tatccacaga aagacaaaat 44821 gaaatgaaag ggaaaacatc ccaaaatgta tcctacagat aggcccgtaa gtaagatata 44881 ctgtagcatc tttgaatact caagaaaact tttataatga caaaatattg gaaacaagct 44941 aaatagccat taactgaaaa ctagttactt atatgttagc tctatataat agaatatttt 45001 gcaactattt agaaaaataa agccacttta tgtgtgctta tatggaacaa tattcaaaat 45061 acataaagta ctaatggaaa aggatggaac tgcatgcatg gtatataacc tttaggtgaa 45121 atcatattca ctcaaatgtg tgtgtgagtg tgtgtgtgtg tgtgtgtgtg tgtgtgtaaa 45181 cttcctctgg aaggatacgt gagaaagtgt ctctaagaag agggactggt gtcgagggac 45241 aaagagagac ttgcttttca gtacatgaaa tttttgaatt gtacaccagg agacatctaa 45301 aataaaatat aagaaaggaa agggaaacat cgagtaataa ttatttaatc tttgacccag 45361 agacaatttt cttctagacc ttacatgtcc ctttagaaaa agcaagacac tgggtaaacc 45421 agctgaaccc ccatctggtt cagctctatc caaggtcatt gtcaccactg ccttccctgg 45481 agtctatgac ccctctcctt ctgacatggc ccttggagga agagattaca gtgctgagaa 45541 agagaaaaga accagacaag gaggtttgga gtcttgctat gttgagcttc tctcagggaa 45601 cccctgcctg tccagcccac tgttcctctt atggggtgaa ggggattgag caggatgctt 45661 gtattgcaag ggacaaaaaa tttctaagtt gacctagcct aaaccactga cctagttcat 45721 cagcccatat agcccaattt attgcctcaa tgcaatagaa aactcaagaa aggagtagaa 45781 ctgaatctgg ggggcactgg aacagggacc tcacagcctg tcagagctgc cctgccccca 45841 taggtctaat gtcctcttca tggggccttg attctttcct ccaatggcag ccttcacacc 45901 agcagctctg ggctcatggc caaaagctca ggcacttgag aaaaaaagag tcactttctc 45961 ccatctccag ttagaaaacc cccagggaat gacagtcact ggcctggcag gagtcagata 46021 cccacttctg catcaatcag tgtgactaga ggagcaggct tctatttcag gtccagcaag 46081 cgcacatctc cccatccatc ccagcttaca ggaagtgtca cttcctccga gaagccttcc 46141 caaatgcccc cgattaaacc ctcattgcac tgttttctcc ctcactgcac ttgtctgagc 46201 tccagtttca ttcatgtgtc tgtgattatt tatttgatta gtatctgtct cctcccctaa 46261 atacaagctc cgtgagagca aggaccagtc tggatttgcc cttcttgaat acatagccct 46321 ttgcatgggt ctggcacacg gtggcagaaa gtggaaatag ggggtttgaa aaacataatg 46381 acaaatatgc tgtggctgct ttaggcctgg ctgcatttcc ctgttgactg tggcagccct 46441 tattttctcc ttagaaaaca agccccctga agttcttctg gcaatatcca tcctcccaca 46501 aacaggcttg ggcctggtac ctagttattt gttcaacagg tatggattga acaccactat 46561 gtgccttatg cttttctggg gtgggatatg tcaatgaggg acatagagag aaattccttc 46621 ctggtttggg gagactgaaa agagacaaat gaattcagga aggttggaga gagggatgca 46681 atttggaagt ggatagtcag ggaaggcctc accagaaagt gacatgtaag caattctttt 46741 ttgttttcag acagagtctc actctgttgc ccaggtgggt ggagtgcagt ggtgccatct 46801 cagttcactg caacctctgt ctcccgggtt caagcaattc tcctgcctca gcctcccaag 46861 tagctgggat tacaggcagg caccaccatg ctcagctgat ttttgtattt ttaggagaaa 46921 cagggtttct ccatgttggc caggctggtc tcaaactcct gacctcaagt gatctactcg 46981 cctcagcctc ccaaagtact gggattacag gcttgagcca ccacacccag ccaacacata 47041 agcagttttg aaggcaataa gggagttaac cgtgcagcca tccagggaag aagaggtcca 47101 ggagaaaaaa cagccaatgc aaagcccctg tgcagatgca ctcctaaaag aacaacaagg 47161 aggccagtgt gcctggaggg aaggagagga agagtccagg gttaacatca aggaggaagt 47221 aacgatcagt tatctgcaga gactaagatg ccattgtaaa gactctggct ttgactctaa 47281 gtcagttaac ccctggagaa gttttcaaca aaggactgac atgatgtaag ttgtgtctta 47341 acaggagtca tctggctgtt ggggggtaaa ggacagaagc aaggagatca gttaggagga 47401 gatggcataa gccaaatgag agatgataat gcctcaagcc aacatgtgag agggttggat 47461 atactgtgat gtcagagtcc agaggatttg caggtgccct ggagttggca atagagaaat 47521 agaagatcca agggtaactc caaggattag tttttagctg aacaactcaa aaaatgaact 47581 taccatgaac tgaaataagc aagaatacag gaagaacaga tttgatggag ggtgaggaga 47641 atggcaaggg gttatcagtt tggttttggc catgctatgc tcataattcc tcttagacac 47701 ccaagggaaa atattaagct tgatatacag gtagatgagg gtatctggag tttgtgagca 47761 ataatgtttg agagccatta ggtttaacca catgaaaata tcaacatcca accattttga 47821 cctacaaaat ggtaatctct tatggtttaa cctacagatg gttattaaag ccataccact 47881 gggcctgatc aacagggctc ggacataggt gaagaagaca taaggtctat tgactgagac 47941 ctggcatact ccattgttaa aatgtcagga agatgaggag aaaccaagaa gacactgaga 48001 aggaaccatg gtggggaggt ggtggtggca tgatagaaaa accaggagaa tatggtgtcc 48061 tggaagtcaa gggaagaaaa tatttcagaa aggaaggagt gatcaacata gtcaaatgtt 48121 ttgcataact caaggagaat gaaaattgag aattgaccac tggttttagc attggtccct 48181 gggggactgg gtaaaaatta tctgagtgga ttcaagagaa aatggaagaa aaaaattgga 48241 ggcagcaaat atagacaacc gtttacagct tacttttttt ttaatacctg aaaggactgt 48301 tatatttgag tctttgaaga taggtaaagg actgagaata gagtatttgc aatcacacac 48361 tttagagtaa aattctaata agtaactccc aattcctcct ccccagcaac tagtgaatct 48421 cttagctaag gaaattattt ctatgtgttg agcccattca ggacctgctg aaacttgact 48481 caagctattc tgatcccata ctagtctgag atgagaagat tcacttccac atgtcaatat 48541 gtttaatatt aattacaaag aactatcaga ctctcagtgt gcaattgttc agttttaatt 48601 ttaagcagag tcttactccc tgtgtttctc aagagggttg ctctctccag gatgcctgca 48661 aaacaccagc ctctggttgc tgttatctag aaaggcttca ctctctggcc ctatggcttc 48721 ctctgggagc ttattaatga caccaagaca tgaacccagg tcttcagact tcaagcacca 48781 tggcctttcc acggtgcaac actggctttt gatgctttat ttgagatctt tggggagata 48841 gcacctgcaa tttaattgat ttttttatac agttctctgt gtaccaaccc tgactcccct 48901 ccaaagtgta cagataaatc atagcagata gaaggtcaaa atgaaatttt tacaaccgta 48961 aaaaatggga attatttaaa ccatcttatc tcagaagaac ccaggactga atcgccccaa 49021 acacacgtgg tccacaagaa tctgttagat cctaagatct agtaccaaaa atactgacaa 49081 tttgggcaaa taaatttata ttgtcagcgg tagaatgtcc aggctgataa agtacacatg 49141 tcttcagtat cagagctatt tgttggcttg agttttttta aaggtcattg ttgaaagggt 49201 agccccctgt gggaaagaat gcatgagaat tgtgttgaat ttttaaagcc tgtggcattt 49261 aggccttaaa tgatatttga aagtactctc tctctgtctg tctctgtatt tgtccttttc 49321 tctcttctct ttctctcttt ccctccccca ggtaatttat aggaaaaaaa agagagtatt 49381 taaataatat atgttttctt attttgcaag atattaaact ttttaaatta cacataacct 49441 tgagatttaa aaaattcact ttgtagctgt ccttgaaggg tttgttgctt tgtgatccag 49501 ggtgagagac agttctaact caacccgttt acagagatta aacaatttgc ctcccttgag 49561 acctgccatg gatttccctc tgggctttag ctctcctcca ttccttaggt gcaggctcct 49621 tccctgcact cccacctcca atgatattaa ccatatggag gctgaagtgc atctgccaac 49681 atgaacaacc ttgaattgat cttcattagc atgattatca gcatcccctt aacacaataa 49741 tcactgaaca ctaattgttt tccaggcatg taaagttcag tgattacatg cattgtctca 49801 ttaaatccta tgacaaacac aagttgtagt tatgattata attctcccca ttttacagag 49861 aagggcactg tggctcagaa agtaagttaa ttaagatcat ctactcagga aatggcaagg 49921 tgaaaatttg agcttagatt tgttggcccc ccagagtcca tttttgtaat tactatttag 49981 tgcagcttta aaagttaaga tcatcaccaa attactcaat cttaaccaag acatttttaa 50041 tttatctccc atgggataac acattatgag catttacttt ctgatctcag tagaatgcag 50101 agaaggttaa actattccct ccctgagaca gcataatgca gtggaaagat ctcagacaag 50161 gatgtatttc ctaggtccac cctgacttct taagggctct tagtttgggc ttccatgtac 50221 ccccttggta cccatattca tatctgtaaa ctaaacataa aaatacctgg tgtcttagct 50281 aagatacttt gggttatgag taacagaaac agaagctgat tcaagttttc acttagcaca 50341 gtgtttagca aatagtcgat gcaccataaa taagtggtga atgaatgaat gagtgtctta 50401 agaaagtagg aaaggtatca gaagaatcct ggggcgtcat gtggtatcta aaagaaacct 50461 ggaaatcaaa gcttcagaaa gatcagaaat cataggcagc tctggtgatc tcagcaggag 50521 tttgtgacca acgagtacag agaccaggat caccagttcc tcctgccaca cctctgcccc 50581 agccctggca atgccggaga agtagcctgg tgttcatgtt tgtgcatgtg actgagtgtc 50641 tttgacaggt tgagtcccca tagaatgtgt tacatgagat gctaaagaga tggaaaatag 50701 aggttgcttt tcaggttcca tttccagtca agattcataa caagcacgga tgcttaacct 50761 gccccccaag atcaactcca gaggaacaca aaggaaggga aaactgaaaa ccaggaggaa 50821 ccatgggggt gtggtcctat tttgaattgg acggtagggc ggaaggtgat ggtgtggcct 50881 gggcaggacg tatcccagca cagcagcaga agtgtaaaag agagggggaa atattaggaa 50941 ttctgttctt tcttctaccc agtaacagcc ttgggaaaaa cctgtggaga ctaaatctta 51001 tgccctcatt attgtcagga gtaaaatcag agctgtacag aagtcctgtg ctaaagggag 51061 gctgagcaaa gcaggcagag tgactgactt gggccgttac ttcctctccc gtcctatctg 51121 aacttctggg gctgtaaaat tactgtctaa agagtggagc ctctctatgg gtggcaaagg 51181 gttaagattg agctgagagg tgtcttggta agtgttttct catgccaggg gttctgcacc 51241 ttaaggctca cctctcccta cttcctgagt tcactggaat tgaagtagat agaatctggc 51301 atctcaccac caacactgcc aaaaggagta accaatactc tcagggaaaa atgagctcac 51361 aagtcaaaac aacaggacaa ttgaggagca caaacatttc aatagaaaat agttatgttt 51421 ctgggcccag aggccaggct gggttgggga ccaggatccc aaccctgtga caatgggaga 51481 aaaggggact ggaactgttc ctgagcaaat acctcatggc aagtaccatg tgggcttctc 51541 tgtacaccat cttgcttaat ctcacctctg tcctttgaga taggacttgt cactaccacg 51601 tcacaagtca gaagaagctg aggctcggag cagtcacatg gcttcttaaa ggagacccaa 51661 tttacaaata gcatgacaga ttttaaccta gacctaagct ctggagtcag accatctaga 51721 tttaagtcac ttcctaactg catgaacttg accaaatcat ttaacttctc tgacctcaac 51781 ttcctcttct gcaaaacagg gctaaaaatt gtacctaact cattggattg ttgtgaggat 51841 gcaatgaatt tagaatagct cctgatagtt atgtggtact cagtaaacat tggttgttat 51901 tttaattatt tccagctgtg tacctgtgcc tttaacattc agcctttctg gtaaaatgta 51961 ataaatctcc tctcttcttg tttaagtagc aaccaattgg aattgttaca catttgcttc 52021 ttaaatagag tccacctaca tggtctagaa aggaagaaac agggtgagtt ctgagatcaa 52081 atcctaattt ttaacagaga gctcaaaatg agagagtatg tcctgctttg gaaaagagat 52141 ttctctaagc acccagaaag gacacaggta ttaccagaga aaggattctg ttccaagatc 52201 taaacttatc cagaaggaat caaaagcaaa ggctaaaata ttaaacatta gggaatgctt 52261 ccttaactaa ttaaacacca atgcaacgga atattatgaa actctcaaaa cttatctgta 52321 cacataaggc attaaacaca ctcacacaca tacttgtgaa acagttaaat gcaaaaaaca 52381 cagcacaaaa cagtgggtag actatgattc agttctggaa aaatacagct atctgtgtag 52441 acacagagag aaaattacaa gggaagtgtg ctaacatgca gggaatgggg gtaacttttt 52501 tgagactttc cttgaagtta tagtcctttc aaaagaaatg caaaatattg aggaatagaa 52561 ctatgttgta catttatgtt ttaaaaaact cacaccaatg caaatctaat ttatttttct 52621 tttgcaggaa gttaatctga aacatatatg gtcttctatt gatgtattct aactatataa 52681 aaatacacaa agtcacctac acacaaatcc acacccatag aagcttgcag aaggacaagc 52741 gattcataag catcactgga atgggaagct gtttccaatc aaagacccag aaaggattga 52801 tgggagagag gacctacgat gtgcctagaa ctttacagca agtttcctat aggaatttcc 52861 atattaaccc ttcaaagtac ttagcattat ccccatttgc agatgaggaa gtgaggctcg 52921 gagatgtgaa gtgacttgcc aggcatcaca cagctcgtgg gtgtggcttc cacctcacca 52981 cactctgata ggcaccagtg tcaggcacaa tcagatacaa agtcagtgag tttggagctt 53041 ttctcaattg gcctcaacac ctcctaactg tctctttgca aagcagccca ctcttaccag 53101 aacagcagct tcttcctctg ccctccccca gtcgaaccct gcactttctc atctttgtat 53161 cccaggagct ggcaccggcc ttgtcatggg gagacaccta gggaaagatg ggcaaagaga 53221 caagaatttc caagtttcct ctgactgctc ccctgcaaca cggagagcct gagctagcac 53281 aaaggtcccc ttccacacgc tggccatcac acatctgttg gggagatgtc attgtggcca 53341 caaactccgt cgtctgtgat cgataacgga tgactgatgt gagtggaaat tttgaataaa 53401 ggaacatgat cgtgccagag aagatatgag aaataaacat ggcgaagcgg gtcctcttct 53461 gcaggagcag gctcttgcct gtgccaggac ctggcacata acatggtcac attaagcatt 53521 gaataaacat ttgttaaatg gatggatgga ttatgattat cctgtgaatc cacacaaatt 53581 tggtagtacc acaggtattt caaaattgat gtgccacaat ctgaattcat catctccgca 53641 acattctctt ttctctcatg gtccattggt tatccaacca ggacatagct gccagccttg 53701 actcctccct gtcccccact ccgccccatc ctccctgtca ccaaagcctg gcaagtctgt 53761 acacgttaag tctcaaatgt cgttaaaccc ctgtctttcc cccaccctgg aggccaccct 53821 cttggtcagg agtgattgag atcacaagct caggagtagg catgcctgaa caagcatgtg 53881 ccagctttac gccctttggt aaagcactca acctctggaa acctcaggtt tctcttctat 53941 aacatggggc aaataatggt tcctacctca gtagaggata attatgagag cacaatgtga 54001 caagtagatc atcttccacc tgacttaacc caaaatccta ttactgagtc tccctgtttc 54061 aggtcccgac cccttctccc caggcagtca gagggcccag tctgaaattc aagtctgttc 54121 tctcactcct ggtgtctagt gaaaggagcc tgtcttttgg agtcataaag tgtagattca 54181 tgtgtgccta ggaagcccag ttatttggct ttctcatgcc ctccatttat gcatctcaaa 54241 gcatcaccag gcattgctgt atgctgaggg ttacatctga cagcacaagt gagagcctta 54301 gcccaggacc tggcacccat tgggcactca gtaaagctta gctctttttc ctctcccttc 54361 tttcttgtca ggagagtgta ggaggaagtt gggatcaaac cctggaggga gaggtcccaa 54421 agggtggaga aagtgaaggg gaagagtacc gtgaaagtca ggctggcagc ccagcctgtt 54481 tcatcaggac acccccagag agctgcccct tgctgagcag aggtgtccag tgagccatcc 54541 atcagaagca ctggcagtgc tggtaaaatc acagcttcca tagccccatc acactctatt 54601 ctctaaatcc ccatgcagta ggcctgggca tgacctggga gtcagcatta taacatatat 54661 cccaggaggc cccactgctt agctgggttt gggaagccct gctcagtgaa tttgcctttg 54721 ataagcagcc cagaagcctc cttaaactga agctattcct tggccatctt tttggtagat 54781 aaaaagtggc agcaacaggg aatttgattt catatggatg gatttacatt aattggagaa 54841 aacaaagtct agacattttg aaaaaccaag ggacaaggtg gccaagaatg ctaacattgt 54901 actgtggcct ttgggctctc agcacctgat tagagaccag gacaaacctg ctcaggcctg 54961 gcctgaggcc gggatggttc taggttgact tcgaccagca ggtgatcttc tctcgagcct 55021 agaataacat cttttgtcag ctgtctgatg ccactcttct gtcttggtac caggttaaat 55081 ggcatttaga gaaacaattt ggcacctgaa ctctaatatt ggtgcttagc agctgtaaaa 55141 ggatggtggg ggggcatcat ctagagaccc caaggcaata attcttcaga tcacatggag 55201 ccttagaggc taccataact caatactaaa gaaggccaca cagttgcaga aagggtacta 55261 agactttcca aattctgacc aggtatcagg ccctgtgcta ggccatgtgt atgtgtactt 55321 acataaatga atgcataaat atatatatat atatatatat atatatatat atatatatac 55381 acacacacac acacaagcat gtacacacat atacgtgttt atgtgtgtat atatgcacat 55441 atttattttc tatatatgaa tatgtatgtg aatgttcatg tgcatgttta tattcatata 55501 tgtattcata tacatatgca ggtgtctacg tgcgtatatg tgtgtatata taggatatac 55561 tgtatatatc ttcacaatag ccctcattat caggcttatt ttagagttaa gaaaactgag 55621 gtgtagagaa attaagtaac ttcccaatgt cacaaagctg agtttgactc caaatgctta 55681 tgaaaccagt gtcctttcta cactgcttcc tctcctaaaa ataaaacata aagaggcaca 55741 cagcagttct agctctctgg aatatgtgtt gtaaatcctg ggtacaatga ctataagaat 55801 ctccaaaacg tgccttacta tatctaggag taggggagac aagctctggg catagccagg 55861 tagcccaggt gtgccaggcc aaaagatcag agacccagtt cccaagtaag caaactgatg 55921 aactcactct ttctccttcc caacctttct ctgcctgccc tatgttatag gaaccatgta 55981 atctataata gaacttttag actcccatgc atgcgttggc ttaggagaag gtgtttggat 56041 ttaagagtct ggagcaataa tttcctttta gtttccattg taactttgaa tggttacaaa 56101 ttgcctcatt gtagacttta aataaataat gcatgtctct ccaataatta ttcaagagaa 56161 gatttgtttc ttacctaccc tgtggtctat tacttgcctc tggtatttta ataatcttaa 56221 tcatcatctg tttagcatct ccaggctgct gtgctcactg caaggtagct tataccttaa 56281 ccctgacatg cacaataatc tggccagtca agttgcttgt cccactttct tgtctgtgtg 56341 tcccatggga ctgtaagttc catgcagata gagttggcat agaaatggtt cagatgcatg 56401 gggccccagg acctagcaca gtgcttggca catggtaagc attccacatt tactgaagga 56461 aaggatgggt gaatggatgg atgagtgctg ggaggctgta ttgtaaataa caaaaatggg 56521 atctgaagga aggtatggga ctctgaagtc tgcattcctt ttaccttttg ctatattttt 56581 ctgtctcact attatatttt gttttctctg ggcctttttc ttcagtcaca cttttccaag 56641 ttcatgtgtg ctatttcttg gactggttct atttatttag ctggaggcag atttgcattt 56701 ttaaagttca tgtgtttgtt ttcgagccaa ggtcgttttt atgtgtgtca tttctctggg 56761 ccttcacttt tcacgtttta tgaccatatc cagaatttcc cacttctcca agaagtaact 56821 gggtggttgg gaggggggac agtcaagggt attggtccaa tgggcaatga gatgtgggtg 56881 agattggggc cagttcctgg gaagcccctc tgaggccccc acacctccat gcctccatgg 56941 cctccatggg cacggatggc aggcccagca cataggcctc tctgccatca tcccacccca 57001 acccccactt ggagaataga ggtcaggtct gctgtatcaa ctgtcctatc ttgaatgctg 57061 gtagttgtaa tagagtgaac gagagctgta tagggtctta taccagcaat agatgcttgg 57121 cccagaaata acacatcact tccactcata tttcactgtt cagctccatc catggggtcc 57181 tgtccaacca cggggagccg ggaattgtag tcccaccaag ggcccagaaa aaaagagaga 57241 cagaagtaat taatgaatag caataataac tgcacaacct ccaagacaac ttcccatctt 57301 aaaaccctcc tgcttttctc tcaccaccaa gagtcccaga tcagagcccc tgaggtcatg 57361 agaatgcagc ctttgccacc ctgccagcct catttctagg cctccatctc cagtaaccca 57421 gctgattaag ttcccttggc aggccatgca tggcccttcc acatggtctt cctgctgtcc 57481 gatctgccca ggaactcatt cagggtgact cggggcaaat ggcttctcct ccaagctagc 57541 ctccaccaac atctccaatt agagctgagt ccccgtctct gcaatcctta gtaacttgtt 57601 cactgtcctt ggcattttac cctctctgtt tattcatgag catgtatctt ccaccagcct 57661 gacccaagag acagtgacta agttctactt gtctcacaat gctcagacct cacagcaccc 57721 ccagtgtgag atgttcagcc tacacctctc aaatatttag tgctgactac tttgacaaca 57781 cttattattt aggcatggct tatcaatttt taaaaatgtt attaatatct gttttacagt 57841 ttattattta ttccgtaact ttccaaagga tgggccatct tgatcttgct gctgcacttt 57901 atgcctatcc tcatggctac tgggctacac agtctaattt gactttaaga ttcaggaggg 57961 tggccaggca tagctcacgc ctgtaatccc agcgctttag gaggccgagg caggcagatc 58021 acctgaggtc aggagtttga aaccggcctg gccaatgtgg caaaacctct tctctactaa 58081 aaatataaaa attagcaggg tgtggtggct catgcctgta atcgcagcta ctcgggaggc 58141 taaggcagaa ggatcacttg aacccaggag gcaaagtttg cagtgagctg agatcaggcc 58201 actgaactcc agcccggacg gcagagagag attccctctc aaaaaaaaaa aaaaaaaaga 58261 ttccaggaag gtatgttgcc gaaaagaaag tagagtgtct tatgacagac ctgtatcaca 58321 cccagaccag gaagcatcaa tgtcatcaaa gcatggatgg cgtgactatt gtcactatct 58381 ccttcctatg ctcctggaag gatgaatgtg tgggtggtga tcctgggagc ctgtgcctgg 58441 agaggaggga gccaagtaag gctgtgaatt agccaggact cttttggtac atggagcagg 58501 aaccctgttc aagccagctt acacatgcag ggaaaatgat acctgagaac atctcttaga 58561 aaccaaggac ataggtctgg cacggtggct cacacgtgta atcccagcat tttgggaggc 58621 cgaagtgggt ggatcaacgg aggtcaggag tttgagacca gcctggccaa catggtgaaa 58681 ccccatctct actaaaaatg gtgcgtgcct gtaatcctag ctactcagga ggctgaggca 58741 ggagaatcgc ttgaacccgg gaggtggagg ttgcagtgag ctgagatcac attactgcac 58801 tccagcctgg gtgacagact gggaatctgt cttaaaacaa aagaaaccaa ggacgtaaaa 58861 gcatctgagt cataagagca aaaggaagca gagacttgca tgacgctggg attcttctcc 58921 ctgcctctcc acttctctct atatgatgcc ttcaaccttc tctctctgac gatcagcttc 58981 ctcctctttt ccactccaca ggctgggaaa tacggtgtca gggagcccac tgcttcacat 59041 ataacaatca gggctatgaa agggaggctg cacagctctc ttggttccaa atctgaaagt 59101 cccaggaaag aggcacttgg gcccagttag agtcaagtgt ccactcttgg actaaccaac 59161 taggtccaaa ggccagagtg ccattgtgac ctacctggtc ccactatatg gatggagaag 59221 agaataaagg tcaccaacag tgacagtggc caagaacccc taaagcttca caggagagaa 59281 tccgtgtagg cctggctccc aacaccagaa ggacagaaag atggctgagc ttgctgcata 59341 acaaccacga ggggaacttg tcagaaatac agatcccagg ttccaggtcc caggtcccag 59401 tgagtctcac tcagtaagtc tttgctcctg cctaggaatc tgtggttcaa gcaagcatcc 59461 caggtaatta caatatattg tcagatttgg gaggcattgg tttagcttgt gctccaaatt 59521 taattttatg tctgttttat tcctttctgc tttggggttt attattcttt ccactttcct 59581 ttcactaaat aatttgtacc ccttctaggt tatttaagta gagacttaat tcccagtttt 59641 tcttctttct tactaataga tgccatcatg agcaacagtt ttttctttaa gtaccagcgt 59701 gacagcatcc tataaattta tatgatatgc tgtgtttaca tccttgttct ttaaatactc 59761 tgttctgaat atcttccctt tccatggcat tatttaggag tgtttttaac ttctaagtga 59821 ctggataacc ccttcttgtt tatttctaac ttcattgaac gaactgtagt gttagaatat 59881 ataaagcagg ccgggcgcgg tggctcacgc ctgtaatccc agcactttgg gaggccgagg 59941 cgggcggatc acgaggtcag gagatcgaga ccatcctggc taacacggtg aaaccccgtc 60001 tctactaaaa atacaaaaaa ttagccgggc gaggtggcgg gcgcctgtag tcccagcttc 60061 tcgggaggct gaggcaggag aatggcgtga accccagggg gcggagcctg cagtgagccg 60121 agattgcgcc actgcactcc agcctgggcg acagcgagac tccgtctcaa aaaaaaaaaa 60181 aaaaaaaaaa aaaaaaaaaa aaaaaaagaa tatataaagc agaatgtttc tgtcttttct 60241 tattgaataa atgtctttgc atcatagtaa gtaaaaaatc tatccataaa gatttcctga 60301 aaaatctatc tataaaaact cactggtcaa gtacaagatt tgggtgttgg gtgttgtgtg 60361 ttgctgctct accccagccc tcatcaaaaa caaacaaaca gacagacaaa caaatcaccc 60421 ttaacaaagg ccataaagag caggtctttg gagttgaacc tggctgggct tctaacttgc 60481 tctgtgcagc cttagacaaa caattcacct cctcggagtc cattcttttg cctagggaaa 60541 tcatgagtga agacaggcat gtactggaca cttgatctta gtagcatcct ctaggcagtg 60601 tttgccctgc aggaccagct gggttattta gaggagtgaa gacacagaga ggtttagaag 60661 gaggcacagt cctcagtatc ccaggtccct ggtgtgcatc acaatctttc aataaaatgg 60721 agctgatcat gggtatgaaa attgttggtg gaaacagaca taagggttgt gcctacttag 60781 ctgctatttg gcgcatagaa gagtctgagc atctttcccc tctttctttc cctatccagt 60841 tcttctttat tattttaaaa actcagctca gagtcaattc ctccaggaag tatctgccca 60901 ttctctcccc agcccctatt ccacagcatt gactcctgta cacacagatc agcttgtgtt 60961 atttccatcg tcattacctg atgcctcatc cctctccaca gctgtactca gcttcttaat 61021 ggcaagtact gagagttatt tagaggagag agaacaagag agagaaggat acccaatatc 61081 ccgcaccaac tgttgagtag ggcaattaat tgaagaacat ggagatagta tgcatattgt 61141 caaaaatatg tgatactttt ccagcaaata tttatggtgt atctgctatg tgccaagcag 61201 tggacataca ggagtgaaca acagggtcat gattcctctg tgggaaatat ggatttccca 61261 agaaggtgat ggaggcccag tcatgggtga aatacagggt gctggagagc acttaggtct 61321 aagcacctct tctcccattg aaagggtgga ccagctgtcc aggatatggc ttttagcatg 61381 agaccgcaaa gataaggagt tagcaatgcc tggaaaggga caggaagaag actgttccaa 61441 gcagagagaa cagcaggtgg aacaccagag aacatgtgag tgcaagaaaa ggagtgatgt 61501 ggctggatgt taggcagtga aggagcacaa agagaaacaa gcagcaccag accagacaga 61561 aacttggttt tgttttgttt ttttgttttg agacaggatc ttgctctgtt gcccaggctg 61621 gatgcagtgg cagcatcagg gcttattgaa gcctcaaact gggctcaggt gatccttctg 61681 cctccagcct cccgagtggc taaggctgca ggtgcatacc accacgcctg gctaattttt 61741 gtattttgtt gtagagacag agtctcacta tgttgtccag cctggtctcg aactcctgag 61801 ctcaagcaat cctgccacct cagcctccca aagtgttggg actacagata tgagccacca 61861 tgccaggcaa ataggaactt gtgaaccatg ccaaggggct tggagtttaa gaacaagagg 61921 gtcttgccaa agagttcgtg cagtaaacct aaacgagaaa tggtggtggc cagagtaagg 61981 gtgggggctg ggggtgtggg taggaagggg tgaagagaga tgtcgaggag gtagatacat 62041 ggacttggtg attgattggt caggggtgaa gagagaagga cagtccaggt ggtccaggac 62101 ttcagtggca tccttcacca aagggaaagg aagacattcc tttgatgatg gttgcaagga 62161 tgggtggaga tggaggcgaa ctagattcag gctagtcaca ttccatatgt ctttgggaca 62221 tgtgaaggag acaacattca acaggcagtt gactgtgtaa cagtaataac aaaaactcaa 62281 ctaagccctc acaataacct caggagaaat gaaactaagg tacagagaag ttgagtaact 62341 tgctcaaggt cacacagcac atagtggcaa aggcagttcc atacacagac attcaggttc 62401 cagagctcat gtgagggctc tgaaactcag gagagagcca gccgggtcca catgtggcta 62461 actcaccacc gaagccacaa gaacaagaat gatgagtgag gggaacgaag gggctaggca 62521 gaacttgagg aacaccaatg ctagaggcct gggctgggac aggaagccaa aaaaagagct 62581 acaatcatct gatcttcgac aaagctgaca aaaacagcaa tggggaaagg attccctagt 62641 caataaacgg tgctggggta actggttagc catatgcaga agactgaaac tggacttctt 62701 ccttacacca tacacaaaaa tcaattcaag atggattaaa gactcaaatg ttaaacccca 62761 aactataaaa actctggaag acaacctagg caataccatc ctggacatag gaactggcaa 62821 agatttcatg gcaaaaatgc caaaaacaat cgcaacaaaa gcaaacattg acaagtggga 62881 tctaattaaa ctaaagcgct tctgcacagc aaaagaaacg atcaacagag taaacaactt 62941 acagaatgaa agaaaatttt tgcaaactat gcatctaaga aaggtctaat atccagcgtc 63001 tgtaaggtac ttaaataaat ttacaagaaa ataaaccaaa caaccccttt aagaagtgga 63061 caaaggacat taacagaccc ttttcaaaag gagatataca tgtggctaac aagcatatga 63121 aagaaaaatc tcagcatcac tgatcattag agaaatgcaa ataaaaacca cagtgaaata 63181 ccatctcaaa ccagtcagaa tggctgtttt ttaaacatca ataaaataag atgctggcaa 63241 ggttgtggag aaaaggggat ggttatatac tgttggtgtg agtgtaaata agttcaacca 63301 ttgtagaaag atgtgtggcg attcctcaag gagctaaaag cagaactacc attcaactca 63361 gcaatcccat tactgggtat atattcaaag gaatataaat tgttctacca taaagatacc 63421 tgcatgtgta tgttcactgc agcactattc acaatagcaa agacatggaa tcaacctaaa 63481 tgcccattaa tggcagacct ttgcaggaac atggatggag ctggaggtca ttatccttgg 63541 caaactaaca caggaatgga aaacctctgc acattctcac ttataagtgg gagctaaatg 63601 atgagaacac gtggacataa agaggggaac aacagacacc aggacctact tgagggtgga 63661 gattgggagg agggagacga ccagaaaaaa taactattgg gtactaggat tagtacctgg 63721 gtgatgaaat aaatctatgt aacaaacccc tgtgacatgt acacagttta cctatataac 63781 aaactgcaca tataccccta aagctaaaat aaaagtttat tttttaaaaa acgaaagtaa 63841 gtatgtagtc agagaaaagg gtttctatta aatagaaaaa aagcttaatt agaaatttaa 63901 aaacaatcat ctctgtggag tgatggggtg aaatccttaa ggcaacgagt tgagaagtca 63961 gttgagattg aaggaaccag acagcaagga tagtccagag ttagtgaagg ggagatgaaa 64021 gaaggcagca gctagaagta gctgcagggt taaaagagag ttttataata acgaagacag 64081 tagattatat taagtattga tgggaaggag ccagtagaga tgagtttgag gctccaggac 64141 agagcctgta taatcaggag agtgaggttc cttcctgact aggctccagg aggaatagat 64201 ccacacatgt gcaaggatcc cagtgtgacc gggaggagaa acgaaagaca gggttgggaa 64261 gagaatgggg atggtggaag caccagagga tccccagcct cactgagctt ggacgctgca 64321 cactttctgc agcagcagtt catgcggctg tgtgatcttc ctttgagagt gctgagcagc 64381 ccaggtgtga ctgaggcggt gctggacatt tggatagatt ctgatctggg gctttgctag 64441 gagagcaaaa ccaaaggaca gtgagccaag ggaattgaaa atactggcac tgaagtggtg 64501 gaaagcatgt tctgtgagct ccagactgac tgaaggagct gtgaatcttc tattgtcatc 64561 ctctgcccaa cccatcaccc tcccaccaac atatttacca cgggactttt agagtacagt 64621 caagattccc ttttggtttt ctctctctct ggatatcgaa accaatttcc tgggatgggg 64681 gtgggggaag gctggagaaa aatgacatta cagagaagga tcagccttgg gacattcccc 64741 aggccagact catttgaaga aaacttgagt ttttcccttg tctaaagctc tgaggtcttt 64801 ggggagcttt ccagctctgg tctaattgta atccctaagt acatatgcta tgttccaaag 64861 tcaatatata gagttagtgc aataccaatg taaattcttc tgagtccttt aatagaattt 64921 gacagctctg agtgtaggat gcttattctt ccaggtaaag aaccatcaaa gcatttgtct 64981 tttaatggca gtgatagctg atccctcctg agtgattaag atgtgaattg tcaatcactg 65041 aaaccatcct gtactgagtt aaaaaccaca aaggtcacca gaagagagat tcctgagatg 65101 gatttgaatt ctggcaaaac caacgttagg ccagccaagg agcatattgg agaaagagat 65161 tggtatttaa taagcatcgc ggggacagtt ggctaacagg ccaggggaga gttaatttaa 65221 attcatacag catacctgtc aagttcagct atatcgtact gctaatctgt gacaaacctg 65281 gagacaagga actggcattg ttacccctgc tgctgataag gagataaatt cctggctatt 65341 gaagaaaaag ataccatcag ggctctgata gatgggttca aatatgtgag aataaaaatg 65401 tttgcattgc aatgtaataa aaataaatac caaaaggaaa gccacggcat ccgttacgtc 65461 tgtctcattc tccctcccag gcctgcttac tgcccaaaac agataatgga taggccgtta 65521 tgtgattctt cctgataagt ggtcaaagga gatgaacaaa attaaaccag gaaatggaga 65581 tagtaaatca ccacgtggaa ataggcgcgg tcttgccagc tataaaatac aatttagatc 65641 caggcaactt tcaggtcctg ttacatgcct actaaagcca caaaaataag tatgataaaa 65701 cccaggatag gcagccagtg gaaacacatc cttcagcatt gccttttaga aagactggac 65761 aaatttccat aagcaatcag gcaacatgaa tgagagccac actgctgttg agaggctctg 65821 acccggtaat tccttctttg gttaaaaaaa aatcctacag tgttagcccc aaagaaagaa 65881 aaacacatct atagcagcat tatttctata ataactataa attgccaaca acacaaaggc 65941 caaacaagaa ggggctgacc aggcaaaggg aaatgtgtta acacaataga ataacactgt 66001 gctactatac atgactaata cggggagcgt ggtaggttct ggaaacagtt ttatgaaata 66061 tgcatagtca tcattagtgt gcatatccaa ggccttcctg gatattgata aagaagagac 66121 tttggatggg gcttaattca cttaatgttc attaaggttt catgataggc tggatgctaa 66181 ttttagtcat tcattcaaca ttcattgagt attgaccaca caccagatgc taaggatatt 66241 aagatgagaa agaaatgctt cctactgcgg agggctctgt tgtcaaatga gaagatagac 66301 agacagacag acacacacac acatacgcaa aatgatcaag tgtgggaaga ggtatagata 66361 gggtgtcata tagaaagggg agggggccta gacgtcacta cagatcttca tatgcaggat 66421 taggataagt cacaggtggg gcaaacatgg aggagaacat tgttctaggg agagggaatg 66481 gcattgcaca ggctgtgacc gtatagatct ggggtgtgtt tgtgtgtgtg tgtgcgtatg 66541 tgtgtgtgtg tgtgtgtgtg tgtgtgtttg tgttctccat cccaatagat ctgcagtcta 66601 tagcctagga tacttctttc atgggaccca gaacccagtg gctgccttcc cagaaccaac 66661 tcacccctga catttccctt cactaccacc ctcccacaaa ccgctcctcc cccgttcctc 66721 tgccacgcgc agcctcactt ccttgctgaa atcatgcaag gccgtttgcc ctgcgtaaaa 66781 atgctcatgt tgtgccatat gctgtgtgaa tagaattatt ctgaaggctg gtttaaactg 66841 gaaaggattc ttgttatttt atatagttca tatcttattc attgcagagt gttctttcct 66901 gaaagaacac tttcatatca gaaaggagac gtagagcctt aggtgaaatt tccctgttac 66961 tggtttcctg aaagtgctgg atatggaaac tgttgttaat aaattgagta aagtggggaa 67021 agtaaatgca ggaatgcaag cccccagctc aagttcatat taaccaatag gtggtgtgcg 67081 gagtgtcgtg agtaagaaat catggcagca ggtagccaca agttgccatt tgaatgttga 67141 tgttttgaaa gctgtgggtg ctattatagc ttaacatata gtttgatttg acgatgtgta 67201 gcgtagcatt tagaacgtag tgtatgcaag agaggaaaac tggaagcttc ggaaagggaa 67261 cgaggacatg aatcaagagg ttctcaggta agaatgacaa aggaagaggc ttagtggaat 67321 ggtgtgatta ccagcacatc agcatgaact catgactttg aaaacaaatc cctctctctg 67381 ctgagatgac ctaagctgcc gatagcattg cctagcgcat aaccaggaag caagcaaagc 67441 aatccaaatg aaagcagccc tccctcactg ccattctcag aagaccagga atagattcag 67501 gggagaaatc ggagtcagaa gcctttcgtg gttggagaga tataatgttt tatgagacca 67561 ataatgcttt ggggggccat gttgcacaga agtgggggtg agaggtgccc acctaagaag 67621 ctagccccca cttgtggaag ttgtcacgca gcaacaagta gaacttcacc ttgagcaaca 67681 aggcaataat tattgccagc tgaatgggct aagagtctgt gaaagatact tgaatctgag 67741 atgatccttt ggaaatgaac gttaatgtga agtgagctgc caagattgag taacaacagc 67801 tgaagtgttg ttaaccgaag tcagaaaatg ctatacattg atcacactgt gttatcagaa 67861 atccatcctc cagtttcagt ctgtctctga gtgacaaatc acactttctc tcaccctaca 67921 agcctcagac ctcctgagca gttgggttga aatttcataa tgtatccaga tgattctatc 67981 atgatgtctg caggttatgt gtagtgtttg tgtgtgagga gcaaaaagtg gggtgcactg 68041 atccaaaaat tcttctccag ggcttatgcc ccttgccttt gccttgggag tgattcttgg 68101 taaccagggg agggccttgt acccggactt gagggacggc tgaagaggag tctcaatggg 68161 aggcccaccc agccaaccat gagagaccct ggtacacatg ttatcagtat tctctctaat 68221 atgtcagccc agttgcaatg gcagatgatc aatggattgc ttggctgcct ggccaccctc 68281 tagcccacac ctttcttctc tggacatgga ctcatacttt cccttctccc cttggaatgg 68341 tcccttttca ttcaggcttg gaatttggat tgagattgtg cgacaaagac tgctctctgc 68401 tcaccaaaat ctgtgctccc tacttctttc cagctgcata gctagactgt tttcccagcc 68461 tctctctctc tttttttatt tttgagatgg agttttgctc tgttgcccag gctagagtgc 68521 agtggcgtga tctgggctca ctgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 68581 tcagcctccc atgtagctgg gattacaggt gcacgccacc atgcccgact aatttttgta 68641 tttttagtag acatggggtg ttaccatgtt ggccaggctg gtcatgaact cctgatctta 68701 agtgatcctc ctgcctcagc ctcccaaagt gctgggatta caggcgtgag ccactgcacc 68761 cggcctgtat tcccagtctc tcttgaagtc acatgtagcc ttgtgactaa gttctgccct 68821 gacccatggg agaggagcag aaatcagtgc cacttccagg actgccctat acacaacccc 68881 cacttgactc ctctttgttc gtattcctct ctgggctgcc tgaaataaag acaaccttgg 68941 ggtcaccttg gaagccctta gctaaaggtg gcaaagcagc cagcagcctg ggacccagag 69001 acaacttgaa ggacagttcc ctgctgacct gaggcaagaa acaagcatct atagcagaat 69061 aaatcctctc acataggcaa gaaacatggt tctgtagcat ctgagcctct acattgggtt 69121 ctatttgtta cagcagcaag aattacccta actcactcag atggcaaaca gtttaaactg 69181 tggccatcag ctttgactct gactgaatcc ttctttgctg tgaccattag aaacatgatg 69241 ctattataca cttgggtctc gccagccagg gagaattggt ggctgtttga atggcagcct 69301 cctcgggcta cagaaagaaa aacatggtgt cacacaaaat gctgtaggaa ctacaagaca 69361 agaagatcca tccatgtgct acctgggcta tgatctgcta ggcccatatg agctgcacgc 69421 tgtcccaaag gtcaagacca gctctgagtg actgggcagc agagacaggg aaggcagctt 69481 gctgggaggc ccacccagcc agctcaacag caggtgccat tttgcttgtg gctggcgtat 69541 agagttctct tcttagaaag ggaccatttc agttaaatgt aataaatatc ctcttaagta 69601 acacatagtt gaaattctta ttttgctttc agggcctgaa attggttaac agaacaattc 69661 acataagtca caaagtagcg gataactcag tacctggact ggagccaaat tcccctggag 69721 ctgtgcagca ttttagagag gagacctggg tagcctttat ggggacttct ctttgggata 69781 ggagggggat gtgtttgcag gactcttcag gtccaaaatg ctgtgcttct gagacaagag 69841 tctttcccca aatagagcgg cagctgttgt tttgtgtttc cttctccctc caggatccat 69901 tcccttttct tgtaactaat gccccacaca cgctttgcag gaatcacctc tcctaaccta 69961 tgggctctgg gacaaatgag ctaaccccag ctccacgcac aaagtatttt atccaagctc 70021 tctgagctta aggtgaggcc atgtctctgg tgattgatta tttcaggaga gagtgagcaa 70081 tctaagttag tccaatcgga gtgaattctg gggcttatgt aggaggtacc agagagaaaa 70141 ctttgctctt ttctgcaggc ctggaaacag gcacgctgaa actgatctac aaccattttg 70201 ccagtatgta gggcttgagg attgagccaa cagaaccagg ctgagaggca gtataaatcc 70261 aggccctgat cacattactt aagcacatgg atccagttac acctgaagct agagactttt 70321 caggttattg aaccaataca tttcattttt aatcaatata attttggagg taattttttt 70381 ttcactcact ttcagtggaa agagtgtcct ggctaatggg acaaatttta atggaattgt 70441 cttgggaaag aactcatggg ttctaaggga gtaagtgaga ataagaaatc atcatagact 70501 tcaggaaagg ttgcagaaag aggaaagggg atctgttagc tctctgccta ggacccccac 70561 aagcctgggt caaccatggc agagccaacc ataagagacc ctggcacaca tgttaccact 70621 attctctcta atatgccagc ctgtagacat gttaaaagta ttctctctaa tatgccagcc 70681 caggaaaatt atcctaaaca ttcaaagcca tcttacatga agctgttctt tgaagcattg 70741 cgtgtaacag tgaaaaattg aaaacagtct aaatgtccaa caatcagggg tcggttcagt 70801 gagcttgttg ccttcagaac agcatttatg aagtacgtgt aataacatgg aaaacactta 70861 tgatagtgtg agtgaaatca ggatatggag aatttgtaca gtcagtatga actcaactat 70921 gtgaaaaagg gggaaaaaag aattacaagc aggcaaaaca aagaaaagta aatgtaccca 70981 aaagttaatg cttggcaagc gggataattt tttctttcta ctttgcttta ttttctaaac 71041 ttttgtaagt gttattttta caatggaata aaattctgcg taatgttagc tgttggcgat 71101 gttgtacaca gacagtgtaa tgtacgtgga tggatagttt ttgaagtaaa ggaggaatga 71161 gggtgtatga ctgctcctct gccaggcatc caatcgggtt ccaggtgttt cactgtagct 71221 ggatgacctg tggcaaatcc atttgtctct ccagacctca gcactctcct ctgtcaatga 71281 aaaggcttca gaatagatgt cccagaaaca gacccaaata atatcataat gtccaggttc 71341 tgcccccgca tactcagcct tgtacttgcc caaggaattc acctctctga acttcagtcc 71401 cattcatggg gaccatctcc ccactttgcc ctgcctgtgg gtacacactc tttaagctgt 71461 aaatcactat acatagctgg gagtttgtct tgttagccat gctgctcccg ctcaggcctg 71521 cctcagccca tctgctgtca tttctgccca ctcgagttaa gccatctgca aatagcaaga 71581 gagtgttctc tctgatggct cttggggctg cgccctttct gctttttctg tggcaaagcc 71641 tgtttgccct ctctccagaa acacggctgg gcagacagga gccctgaaca gctccctcag 71701 acatgtacca ggctctctca tgcttggctc tgccatggtt gacccagcct tgtggggggt 71761 cccaggcaga gggctaacag ctccttggga gtccacagct gtatggtcca atacagtcca 71821 tagatttcag cagtggtctt gatgacccat gccccgaagc actgccttct ctaagtggat 71881 agacttgttc tcatttcatt aaagacccag gtcctgggaa gagaaagtgg cccctactag 71941 ctgatcactg aggcactgag tcatgacaat ctggctgcag aatccactcc tcaagggggt 72001 tcaggattag aatgtgcaag aacataggag gtgagagcct gagaggcctt tggagacaat 72061 ctggtccacc tgctcctggg acagatggag aaaccaaagc cagaaagagc tgtgatgtgc 72121 cctaggtcac atagctgtca gcagcagagt tcagacagaa gcccatcctc ctgcctgacc 72181 cctacccctg gcttgtgcac acaacccaat cattacctct gtctgagagg gtaggaggag 72241 gaaggcccag cagaggccca gagaggccag atcatctctc caggcctccg tgccttggaa 72301 gggagagttc tcgtgtctcc ttatccctac ccatagccct tggccctatt gtctccagtg 72361 cttcctgaag actgggatta tgggaggcaa agcactgggc gggttgctct agacatctgc 72421 ccctgggtct ttatggcagc ctctcctctg gcctctctga ggcaagacta ttcttcctca 72481 tcctgtttgc ccggagcctg ccgggcacaa ggtggccata gattctatct tccaaactgg 72541 gacacattga agcccagagg ggactcaatg aatagttatg cgggaacacc ttgtgtaaac 72601 tgaggctgcc ccagatacac ctggacctgt gcttttcctg gccaagccag gttcttttgt 72661 gacatgattt gagtaatgtt tccatcatct attttttcat ccctcttctt tctacctggc 72721 taatctctac tcgtccttca agacaagttg aaacctcaca gtttctaaga attatagcct 72781 tccttacggc ccaaaaacaa gtttaattcc ttttgcctct gcatccaaac ggcttggcta 72841 atacctttat tatagcacct gacacctttt cttggatcac agcttctcaa ccctggcact 72901 tattgacatt ttgggccaga taattccttg ttgcaggaag atgtcctgcg catcacagaa 72961 tcttttgtaa cattcctggc ctctacccac cagatgccag tagcaaccca ccctaccccc 73021 atctgattgt aacaactgaa aaaatgtctc cagatactaa ccaaatcatc cccagctgag 73081 agccactgtc taaatgttgt gtttgtgcgt ctttttttac actatgtggt gatttctgag 73141 agagtatttt atcatcttct gtgcttgtca gttcccagca catagtaggt gcttagtaga 73201 aactgtctgc tgaatgaatg agtgacttga acaagaagaa acgcagaaag gtggctaggg 73261 tgccagctcg gttggatgca ttctcagtca tgaccactag agggcagaaa aggcctgcca 73321 aaagcacaca acggagtcca ggaaagagga gatcccaatg ccagagccag gggtccttgc 73381 ctggctcttg gggctgcggt gacagggcca cagtctctcc tgctagttcc ttgtcatctc 73441 cccgatctat aaatattgct gttcccaggg ctcagtgctc agacctattc tctatttcct 73501 atctattctc cttgtgtgtt ctcatctagg atcatgattt taaaaatcat ctacatgctg 73561 ataactccca aatttatata tctagcccac acatctcctt aaatgccata catgtatttc 73621 aatcacctac ctgaatctcc atgtgaagat ctaattaata ggcctctgaa gcttaacaca 73681 gccaaacttg aactcctggg atccttccct tccctcctcc ccaaactcac tctatcccag 73741 tccttaccat ctcagcaaat ggtgactcca tccttccaat ggtccaggcc taaaaccttg 73801 gcctcacttt tcctctcact aacctatatt cagtctgtca gcagatactc caggctctac 73861 ctctaacccg catgagcatc ttatctgcgc taacaccttg gtgcaggcca ccatcatctg 73921 tcacctgggc tatctatggc agcagcctcc aaactgatcc cctgatttca cctcttgccc 73981 tttgtctatt ctcacggcag ctgaaggaat cctaagccat gaggactctg gtcctaccac 74041 ttctctagtg aagagtctgc taatggcttc catctctctc agacaatgac caaaaatctt 74101 acaaggccaa cagggcccac catgctccac tccctccatc atcgcctctc tgacatctcc 74161 aacaaccccc agcccacacc ccagcatgct ggttccctct tcctccagac acgccaggca 74221 caagcccgct tcatgtttat tcctgatgtt ccatggacct tccccagctt cccctcccag 74281 ccctgtttct tggcttattt tcttccatag cagttatggc aaactgatct actagatgtt 74341 gtagtttctt atctgttgcc tttctatgtc caattgtggt ctcactaggg aaagggattt 74401 ttttttttag tctaatttgt ccactgctgt ctctccaggg cacggcacac agtaagtgat 74461 caataagtag ttatggagtg atgaatgagt gaatgaagag agtgagttcc gacaaatcct 74521 gagtttacta gggagaaatc cagtctcagc ccagttggct ttcctaagaa gatcttccaa 74581 gaaagagttg actttgtcca aaacagaaca gagaatttcc ctggagtttg agaaaagtgg 74641 agggagggag actgacagtt attggacacc tattagggtt aagaggatgg actctggagc 74701 ccaacacatt gggcttaaat cccagtttca ccgcttactt cctgtgggac tgcagttaag 74761 tccgttaata tcttcatgcc tcagtttcac aactggaaat tgaggatagt aacaatacta 74821 gcctggtagg gctgttatga ggattaagtg agttgatata tgcaaagcac ccagaataat 74881 gattgacact tagtaagagc tctctaagga tcagttacca ctgttgtgtg ctttacatat 74941 cccatttgat gcttataata atgctgttgg gtaaataatg ttactagccc attttgcagg 75001 tgagaatgtt gacactcaaa ggcaaacagt tgcttatccg agacctcaca gctagtgaga 75061 ggtgaccgca gtgcagctct ttctgacttc attcccatca aaaagggtgt gggtgtcccc 75121 tgagcttctc cttcctctca tccctcactc tgttttctgc tgctgccacc accatcaagg 75181 tggcctgtgc ttaacagccc ctgcttttta actcctgccc aagggagaac acaaactctt 75241 tccgactcta ctagcaccag caaagcaggc tgcatcttaa agagaactgt gataaacatc 75301 aaagcagcca gattcctcag gtgggggatt ctgatcctaa gctgatgccc ccctcatccc 75361 tctgtcccca ggaagaagag accaccagcc cacaggcaga cagcacctgt ccctgctgca 75421 cttggtaccc atccccttcc caggcactcc ccggccaact gagattcagt gggtccaaga 75481 gggagctgga atcagtattt tagcaggctc ccaaggcaat ttttgtcttg gggaagtttg 75541 aaaaagactg gcatagacct taagggcatg gactctggaa tgaaactgcc tagagttcaa 75601 actctagctc catcctttcc tgtgtgacca tggacaggtt tctcgacctc tctgaatttc 75661 tattttcttc tatgtgagat gtgggtaata aaactgtcta gcttatagag ttgagagagg 75721 attaaatgaa atcatgcatg taaagtgctt aacacaatag ccagcactca acaaacatta 75781 ttatgatcaa tattgttgtt tttagtatta tccaaagtga agcagcctag attttcagat 75841 tcccttggac ccaaagttca gtgcaggtgc cttcacacct ttggaggggg gggagctgac 75901 acacaccgtc actctggtca tgtctccctg gacctcacag accccacata accagaactg 75961 cacatgcagt agggtgaggg gccagctctg acactccagc ccctgcactg aggtgggaag 76021 ttacacagta gggtgaggcc agcactgggc ccagagcctg gcacaaagcc caggtgcctg 76081 ggagggcctg aatagcacct ctttgcctta gggagccggg gctagatgct gctgactggc 76141 tccaaatgca gcctctttac ctggagtcac acagcccttc agaaaatcag aggtgctccc 76201 cttcctgtgt ccatgtgttc tcattgttca attcccacct atgagtgaga acatgcggtg 76261 tttggttttt tgtccttgcg atagtttgct gagaatgatg gtttccagct tcatccatgt 76321 ccctacaaag gacatgaact catcattttt tacggctgca tagtattcca tggtgtatat 76381 gtgccacatt ttcttaatcc agtctatcat tgttggacat ttgggtcggt tccaagtctt 76441 tgctattgtg aatagtgccg caataaacat acgtgtgcat gtgtctgttg tggggtgggg 76501 ggagcgggga gggatagcat taggagatat acctaatgtt aaatgacaag ttgatgggtg 76561 cagcacacca acatggcaca tgtatacata tgtaactaac ctgcatgttg tgcacacgta 76621 ccctaaaact taaagtataa taaaaaaaat cagaggtgct gaggctcaga aggtgctgac 76681 ccccgctttg ggactgcgtg acactcggaa agaaaacaga gaggaactga cttcctgacg 76741 ggcatggagg ccacctcctg ctgcctccct gtcagagcag ggtggtgtgc tctgactccg 76801 cccctccttc aggaagggca gaggtgacca cttattgggg tgagacagtg cagggcaggc 76861 agccaggtgg agcaaaaaga gtctcagaca gggagtcaga gtggagacgg agggatggag 76921 gatcagagag tgcgagaggg agaggagggg aggaggagga agcagccgac gcccagcccc 76981 agcccggcat gcaacactgc ccctgcagcc tgcccgcttc tctcctctgc tcctgagaac 77041 gcctgccatt cgctgctggg ccatcctctc cctctagtcc aggcggttcg cttatgctgt 77101 tcccttaaca ggaacccgtc tcctcgcaaa ccctctttta ctgacttggc taattcctgc 77161 tcgtctctcg gatttggcct gacatcacgc tctggggagc ctccctgagt gctgcctccc 77221 ccggtctggt tgagtctagt gggcgacccc agcaagctgt acaccagaca acccctgccc 77281 tgcccccata acagtgtttc ctgaggacca gaaccacctc tttgcggggc acctcctcac 77341 tccgcggaga agagatgtgc tgtttcccgg cctcatccca gcacctccca gagcctggcg 77401 ctcctagcta ctaatggaag ttcccagaag gaagccctcg ccggatgcca ctcacctggc 77461 ttctcagtag cacctgagcc taagcccttc ctacaccgca ggccacattc cctctgctgc 77521 tcgtcccacc aagcccccaa acccaaagct gcaacccctc gaattctaca caagtgagct 77581 aaaggcgggg gcgggggggg gggtgtccca aggtggggga agaagtgacc taagatgcaa 77641 ggagagatac ctgagaccca gggaggcccc agggagcagc tgggaacatc cgggtgagag 77701 aaagagggtg ggggaggaga agggcaggag cgcaaagtga ccgccagctg aaagagccga 77761 tccgcagagc tgcagccagc agggcctgga gacatcagac ctggcaacta aaatgcccca 77821 tgaatattta acagccgtca atatttaatt agtgcagaca gattaactct gaaatcacac 77881 atacttgggc tgggctcccc aactcctgga gggggagtca gaaggagagg gatgggccca 77941 agggagaggc tctgttcatc accatatggc ccctagcctg ctcaccttcc cctccctacc 78001 acctccccct gcaaagagcg ggcaacactc acaaaagtca gaggggcagc gacaccctga 78061 ggagccaagg ggaaggtctg catctccaga gggttggggg tacggtgggg tgagggcagt 78121 gccgggcaga ggggccaggt cagcagctgg gcgggaggag gtggcacagc agccctttcc 78181 accgcagctc caccagctca gccgagggtg ggctccacat cagcagcccc gcacagagcc 78241 tgccagaccc cagcctgact tgcaggagca ggtgtcatgg tgtaattact tctgtgcctg 78301 tgccatgcct ggattatgct gtttattgca tcagcagtgg gaccaagagt ctttttattc 78361 tcttcttgtt tctttgaggt ttatcacatc ttgtgccact tttttattta atggagtatc 78421 cctggacttc atcatctggt cataaattgc cacagttgat gtcctttttt ttttttttta 78481 taaaaaacat ttggcccagg acttgtaaaa tgtcctagcc aggcctgggc tctgcccgcc 78541 tctccatgct ggagtttaca ttccagcaca aagatggtct taggttccac aaacaaccca 78601 gcttgcttta tgaccttgtg cccttgtgca tcctgtgtgt tccaccttga gcaccctacc 78661 ccttcctcct actttaaggt ttgactcaga ggcctttttc ctccaggata ctttcattga 78721 tgtcctatct cataacaatt tgtttgcttt cctgactccc ccaggatgag cgcctcaaag 78781 acagaaatca aaacctgtac atacatcctg cctaaccttg gaaagggcac ctagtgggtg 78841 ctcagtaatc ttcttttaac tttaatgaat taaagtaaga agagatttca gatagggact 78901 ggagaactgt caaacacatt gtgtagacgg cttgtgtatc cgttagctat ggccgtgtaa 78961 cagccacaaa agtctcagtg gtgcatataa attagccttc attctcaaag gtattcaagt 79021 cagctatggc ttggtgatct aagttcagcc aggcaattct gcttcaagcc acagccctgc 79081 ttctcgctgc aagatggagg gttgactgga aaagcctcct ccaagcatct gtcacccacc 79141 ctgggctggt gggctagctg ggatatgctc attttatggc agtgggagag gtacaagagg 79201 agatgcccaa acacataaaa gcttctgtca tattactaac atctccttga ccaaagcaag 79261 tcacatggtc aaacccaaag tcaggaagca aggaagtaaa caggacatga gaagacatca 79321 tgaaaagggt taggggaggg ggtgtaaaga aatgggccag tatttcaact gatcacaggt 79381 gggtccccat caccccatgc acaggatccc agaatggtag gaccccagtg ggacaggaaa 79441 gagaggaatg aatgggctca cacttgtagt ctcatgagct tctccttctc acctcctccc 79501 cacccttttt tccccgggca tctgggttaa caaaacagca acagagacaa aataagttta 79561 tagaagacac taatgtgagc cacaggttat ggtaccaaaa gcagagagaa ggggacagag 79621 ccgatgctga gaatgcaagc cgaagacaga cactggatgg tggaagatag aagccaggga 79681 ccaggacaaa tgatgacctt tagccacaca gtgcagggaa caaagcctta gccctcaagt 79741 cagcaggcct ggctatggct gtttcccacc ggtatgataa ggactttgga ctctaagaaa 79801 ctgaaaaccc cgatgaccct gatgtcatga ttcctcaacc tctgcctcgt ttctttctcc 79861 tcaacaggca cattctgaaa tcttctctaa acccctcaca gtgaaaataa gaccaccctc 79921 tgtctatgtc attagactcc caaggcttca aggcaaaaaa ggattgtcag aagtaggtca 79981 ttggcaggtg ggtggttgtc tttccttctg agatgtggaa tccaatattt agagtcagaa 80041 taagcagctg gtagatggaa aagaatcctc ttcggaatca gaaaaatacg ggttcaagtt 80101 ctggcaccac catctgctag ttgagtcatc ttggagagtc acataaactc ttgagcctca 80161 gcttcactgc ctgtcaaatg ggctgatcat agtaaccact cggagctgca gaaggggtgg 80221 ggattaaatg atagaaggtg tgtaacacac tcagcctggt gcctggccca acaaaggagc 80281 tcaacctata tcatttcctc tcccttttct atccccagct cctgaatctt cattgattct 80341 gcctcccctg cccccaaatc tatgaaatta aaattcctgc caacagttta ataatttcct 80401 cagctaattc ccttcagacc ccaaggccta tatccggaca tggtgatttg gtccaggtag 80461 gcgtcatcca aatgttctaa gtattctctt tcctaccttt aatcaactat attgacaagc 80521 cttctgtccc tttctaattg cccatcccta ggattttttt catttccttg ttaaagacca 80581 atattacaaa gcacttcagc agctcagcca tttttatcat catcattaat tttatttccc 80641 ttctcattta ttaatgggcc aggcggcttt caagtatctc ttttcccctg atatatttgt 80701 aaaacgccct tcttattccc tttggaggcc ccaaggctct ctgcagcttt tatttcttcc 80761 cttggctgcc cttctgtttc tctgaagcct ttgctgactc cagatcttcc ttttgtgccc 80821 cccgcccacc ttcctcagcc ctccacctct cctccctccc tccctccctc cctcctcaga 80881 cttggtcgga ggcctggtct ctaagcctgt ctctgtgttg aggcccacat ggctctgaga 80941 gcaccaagga tttgccaagc tcctgcccca tccctattct gctcacactc tgtgagcaga 81001 gaggagcttt tatccatgta gctgatcagg aggaaaggac cagaaaagcc tatgtccccc 81061 ttttattccc gcccacccca cccatcccct tacctcccag gactcctgat taagggctct 81121 gtacacaagg ggcatccatt cattcattta cttaaaacaa catcattgag atataattta 81181 catacaatac aattcaccat ttttctttgt ttgtttgttt tgagacagag tctcgctctg 81241 ttgcccaggc tggagtgcat tggcgcgatc tcagctcact gcaaactcca cctcccaggt 81301 tcacgccatt ctcctgcctc agcctcagcc tcccaagtag ctgggactac aggtgtccac 81361 caccacacct ggctaatttt ttgcattttt agtagagacg gggtttcact gtgttagcca 81421 ggatgttgtc gatctcctga ccttgcgatc tgcctgcctc ggcctcccaa agtgctggga 81481 ttacaggtgt gagccattgc acccggccac aattcaccca tttttaagca tttaattatt 81541 aaatacaatg attttttagg aaatctacca agttatgcaa ccatcaccaa ccatccccct 81601 ttagaacatt tttatcacct tcatttacag ttaattcaac ttctcacctc catccctaga 81661 tgacccctaa tctgctttct gtctctctag atttgccttt tccaggacat tttatatgag 81721 tggagtctta caatatatgt ttgtgtgtgt gtgtctgcct actttcattt agcatgtgtt 81781 tgagattcat ccatgttgta gcatgcatca gtatttcatt ccttttgatt gttgaacaag 81841 atcccacggt atggatatac cacatcttgt ttaaccattc tcctgttgat ggacgtttga 81901 atagtttcca ttttgggcaa ttatgaataa agctggtggg agtatttgca tacaagtctt 81961 tgtgtggata aaaggcttca tttctcttag gtatatacct aggagtggaa ttgctgggtg 82021 gtatagtaaa tttatgttta attttttaag aaactgccaa actgtttttg gaaattggct 82081 gcaccatttt acaatccagc cagcaatgcg tgaggatccc atttcgtccc atcctttcta 82141 gcacttgtca ttgtatgttt ttttgataat gaccactgga gtactatgaa gtggtgtctt 82201 gctgtggttt cgatttgcat ttcttgatga ctaacggtgt caagcagctt ttggtgtgct 82261 taattcattc attgtctcaa cgaatatttg tggggcacct catatgtgcc agaactgttc 82321 tagggaccag tgagcaaata gataaaattt tacagggcac tcatattcct gttggagaga 82381 cagacgatac acaagacaat aaatacatac agtatcagat aaggacaact ggcaggccag 82441 ggagaaaaaa gattaagcag tcaaggggta tataaaatgt caaaggttga aatttcaggg 82501 aaaggagtca caaagacctc actgcatcaa gacctgaaga aagtgaggga ggtcttcatg 82561 ggagcatctg ggagaggagt gattcaggga aacagaacac attccaaagg ccccaaggta 82621 gaagcaggac cttcaaggac tagccaggag accatgcgtg agaaaaaaaa aaggaagaaa 82681 gtaccacgcg ataaagtcgg agaggaaaga gggagccaga ccttgtggag tctgtaagtc 82741 atggtaaggt gtgcatgtgt gcatgcattt gcaagtgtag aacagagaaa gaagggaagt 82801 cactgaaagg tttggagtga gcaatacaat ctggcctgtg tgttaacaga gttgctctga 82861 tggagaatgg accatgatgc tgggtgcagg gacagagtag aaactcggac ggtggggaga 82921 tcagttaaga ggctcttcgg tgacccacag cacagagatg atggtggctc agaccagagt 82981 ggggtagcag gaatggtggg aggtgcttgg attctggaaa tattttagat gcagagctgg 83041 caaaattagc tggctgatta tagcttggga gaaaaagaga agtcatgaat gaccccaaag 83101 tgttcagcct gagccccccc aaaaatggac ttgccattca ctgagacagg gacaatgcag 83161 gaaaggcagg tctggaaagg cagggtcagc agctaagctt agggcacgtg gcatgagacg 83221 cccagaaggt ggcctaaaga gatgctgagt agaccgatgg atatacagat cccgagctca 83281 ggcgcacatc ctggaatcat cagcgtatag ctggcattta aagccatgaa acacagcagg 83341 ctgggaatca aagaattatt tcattagtat ctgaggcagt gctcaataac ccccatggag 83401 gtgtttctgg gataactgag ctggagagca tggctgagcc tggaggtcta accagacttt 83461 gatttcctct tctgattctg aaagcaggcc tgaattattc acctgacatg gggtctagag 83521 gatgaggcga ggccttcctc ttggggacct gccttgtctt cttgccaact ttatatggaa 83581 ctcagaaata gggcctgcct tgggatgtgt ccatgaggat tgctatgggc tgaactgtgt 83641 cctcagtcca aattcatatg ctgaagccct aatcaccgat ggaactgtat tcggagatag 83701 ggcccatagg gaggcaatta aggttaaatg agaggattag gatggggcct gataatatag 83761 gattcttgtt cttataagaa tagacaccag agctcattcg ctctctctct ctctctccca 83821 cccttcccat gccccccacc ccccacccct caccatgtga gctcacaggt agaagactgc 83881 cgtctgcaag ccaggaacca aatcagccag gaacttgacc ttggttcccc agcctctgga 83941 actgtgagac atgaatttct aaggtttaaa ccactcagtc tttggcactt agtgacaaca 84001 gtctgagcag tttaagacaa agacacatgc agccccacag ggaatgttac acctaggtgg 84061 cagcatatct gcctactttc taatgttccc aggagagtcc agaatttgac ctgtcacttt 84121 acaactcctc ttcctttgtg gtggaaggag ggaaaggaaa agtggagagt gagaagcaag 84181 gttgcagttc tttccccagc aaaactctct caacagaaag aaggaaatag ggggctgggc 84241 gcggtggctc acgcctgtaa tctcagcact ttgggagacc aaggtgggaa gatcacaagg 84301 tcaggagatg gagaccatcc tggctaacac ggtgaaaccc tgtctctact aaaaatacaa 84361 aaaattagcc aggcatggtg gcgggcacct gtagtcccag ctactcggga ggctgaggca 84421 ggagaatggc gtgaacccag gaggcggagc tggcagtaag ctgagatcgc accactgcac 84481 tctagcctgg gctcccatcg tatgtaacat agcagtgtta ctaatgttac taggtcatat 84541 ccctaactac tattgttact agtgttacta ggtcatagcc cctaccctgc ccaggagcac 84601 atgcagcctc tgcacccagg cttctgacct tgcaccatcc accagtaatg taggaaggcc 84661 atagcggatg ggagcccctg cactcctcac tctccattaa cactctttca tggatccctt 84721 tagcatccac catgactgat ctctctaaca catcctcctt accttgcttt ttgaacttgg 84781 ctctcatcat tgaccttccc agccctagac atttggcagc ctcttgccca tgattttgct 84841 tttcccgtat ctgagctgtc tcaggatacc ttgggaattc atcatctttc tcttccatcc 84901 ttctaccctt ccactctttc accagccatg cactgaatat ttgtgcccat ccgccaactc 84961 acccccttca acttatgttg aagccctaat ccctgtgtga tggtatttgg aggtggggcc 85021 tttgggaggt aattaggtca tgagggtgga gccctcatga atgggattag tgccttgata 85081 ggaagaggcc aaagagctca ctctccttcc accatgtgag gacacagcaa gacgatggct 85141 gtctataaac aaggataagg cgctcatcaa gaacccagcc ctgcggcact ctagctagga 85201 cttccaatct ccaaactgtg aaaaataaat gtttggggtt tttttttgtt tatttgtttg 85261 tttatttgtt tttgagacag agtctcactc ttttttgccc aggctggagt gcagtggcat 85321 ggtctcggcc cactgcaacc tctgcctcct gggttcaagc aattctgctg cctcagcctc 85381 ccaagtagag tagctgggat tacaggcacc caccaccacg ccgagctaat tttgtatttt 85441 ttttagtaga gacgggattt tgccatgttg gtcagactgg tcttaagctc ctgacctcag 85501 gtgatctggc ccgccttggc atcccaaagt gctgggatta caggcaggag ccaccacacc 85561 cggccaaatg tttggtgttt aagccaccca ggatacagaa gtttgttatg gcagcctgag 85621 ctaagacacc cccctcctct tcacacccac ctccccttca cactcatcac ctaatttgag 85681 cctatggggt tgtgtatgga gtctttatag gactgaacat gcttcaaacc aagtcatcat 85741 acagacacag aaaagcaaaa agatatgatc ttgtaaaaga aatgtctcct tctcccaaat 85801 taccaggtat gtggaatata atatatgtta ccttatgaac attttttaaa gattattaaa 85861 tttaaacaca gattgaggta tttgaatttc aaagactatc tgtacaaagc aaacaaaaaa 85921 aaaaaagtcc caagattggg aaaggagttg atataaatga taactgacaa agtacagccc 85981 cgtggttctc agactagagg attctgtagt tcagcaattt tttttttttg taacagcaaa 86041 aaaggaggtt gacagataca tcattggtat gaatggagaa ggaagaaaat aaaagtcata 86101 agtgtgaatg agatgacagt taacaagcta aggagtttgt tgtggtttat gtatatactt 86161 ttaagttctg ggatacatgt gcagaacatg caggtttgtt actatacatt gatgggtgca 86221 gcaaaatctt taaagagaaa attggaggat tcacatagca ttaccagctt tatgcagttc 86281 taagtgtaga tacccgagaa atcatctcaa gtctataatc actatctttt cacgggtgac 86341 aggatatttt caaaccaaga aagcaggaaa gatataactc cagggaaagg acaatcctgt 86401 tgataatgaa taaatttaac tttatgaaag actcactcat gtttttctca tttctccaca 86461 agctgatgaa aactttggca cgaagcactg gaatgccgaa gttaaaccca tattaattcc 86521 tacaaaatat gcagtctcgt aagtcttcga gcttcaccca gatgctgtgt tgccttttag 86581 aatctgcatg catgttggaa aggtcccttg atgacaggga gagggaaggg agccctacct 86641 tgccctgggg gtcttctttc ctttaagaac tgattccctt tggatttttt catgggtcat 86701 tattcttggt cttcccaagt tgcaaacaag ctatttttcc atgacaattt tcttttcttt 86761 cacaggtttc tgataagacc caacattcta agtcactggt tgctagcttt ccaagaataa 86821 ctacttcttg gcaaacatta gttaagtttt ggattaaggt taagctgctt tctgttgggt 86881 tccgcatgtg tgtcttagct ttagacaggc aaacacctag ttctccattc ccacgattct 86941 tattgatcta tgctattttt caccatgatc caggttgctg ttcataagcg tggtcagcat 87001 accttcctgg agaaaagcac acagtgggag ccaatggtat gaggaaggaa gttgccttag 87061 agtttacaaa agtccagctg tcacagaatg atctgattgc tgagtcagaa gggatgtgaa 87121 actctctagg tctttctggg cccatgaaaa agaagcttct gaactgagaa agtggacccg 87181 gtgtgctcta gaagaggtca ggcttctgtg tgctccatcc cttgctccac tggccgcagg 87241 agtccagcaa gtcacttaac ttgagtaacc ttggtgttag cacctctaac aaaacacaat 87301 accacccgcc tcttgggcca tgtgagcatg caatggagtt gcacacaggc tcactgacac 87361 tggcctggtc ccagcatggg ctcctttctt tccttccctg ttctcttgat tccaggggag 87421 gctgcccagg cagagcaacc ttggagaaga gagacttgtt aatggctcaa ggccaacatc 87481 tagacaaaga tggggaagag tcaagagtat catgagtggc ttgtccctgg gtctggctcg 87541 cttgtcacca ggctctgtag cactgagttg gttgcacata cctgcctcct atgctgaggt 87601 tctcctgtga tgcagggagg tccctgcact aactgcatga acttgggtca ggatggtgcc 87661 ctcctgctta cacacagggg ctcctgcagg gcagatgtta ccatcatcca cactgcatac 87721 agagaggagc agagaggtct ggaattggcc caaagtcacc agctgacacc tgagagactc 87781 cacattcaaa accagggttg tccttatccc atgcacttcc tgtgtcacca ccacctagga 87841 aaggagaatc tacgagagag aaatgacctg gtggggagag ccctgtcccc aaggttgcag 87901 gtcaaggaaa gaaacctctg aagtgttcaa tgtgtcccca ggttgaatgt ggggacaggg 87961 tgggggagga gttccttggg gagggtgcta tttaggtgta gaatcagaca taagaggagg 88021 ttgagatgtc agggaaatga tgaagggtga agaatggaag tcagaatctt tatcgccacc 88081 actttctctc ctccctctct ccacttcctc tccctgtttc tgtgtctccc cttcttcctc 88141 cttccatctc ctacctcctt tccctcttcc ccccttgccc ttcctgctcc ctctcttccc 88201 acaagcaacc cacacacgct cctccacagc tcttcctcca ccctcctctg catggcctcc 88261 tcctcctgct caccccttgg ccccagagag cattctgcca ccttccattc cttctgtcat 88321 cttatctctt ctaataattg cacatttcct tgagacaaat tcgaaggcca aggagagata 88381 aaatgacttc tcagtcagta accccccgag caagtttctc taaagtatgc tttcttgaca 88441 aactagcagg ccttgtgttc tttgtgattt caaaagaaag taatgccact tgtcctcagt 88501 ggttacagaa ttaatccatt agtcatcact gagtgccaat tatgtgcttg acactgagca 88561 gggcacttta cctctacagt ggcatttaat ccttccagta atccaggaag cctccctccc 88621 ctcattatac aggagaaaaa acgaggtgca gagaagttaa gtgatgtgcc cagcgtcaca 88681 aaacaagcaa gtaatagaga caaacataga agcagttaat taaaaagaag tttctacacg 88741 ctatcattga gctgcctaca aggcatccag ccaggacctt ggatatgagc aacagaagca 88801 ttctagttca cttaagcaaa aggaaacttc ttgataatct atctaaggct caaaagaatc 88861 tgcaggaagc ctggagagtc cagcctggac atgggtggga accagggcca cttggaaggc 88921 caggtatgaa aacagggaaa tcaagacctc tttgtcctgg gccctgtgct gtggggacat 88981 gagctccctg tatttcagaa gcttgatgcc tccttcctaa ctccagaaga cagctatgct 89041 tgttcacttg ctgtcctttt ggtgatagaa tcacaggctc caggttatca tcctatcaga 89101 ttgtagctga tggggaagaa gtaactcccc aacaaatttc taccagaagg gtgacacaga 89161 gaagagagca tcaagtagct tggggcaggg agacacagat gacagcaggc tttacagaac 89221 agcaagtctt aaaccaaggc tgcaaagggg agccatggaa tcctaatgac agctcactgg 89281 gagggcacac acattctaaa aaagctgagc tgagactgcc ccacgcttta gcacattccc 89341 tcctcccagt aacctgttga gacagggact gttagcaccc cattgtagag ggaaggaaac 89401 tgaggtgcag aaagctgaag tttcttaccc aagaccccac aggtataagt ggctgagctg 89461 gaattcagtc caggcctggc tgactccaga atctaagcac ttaactgctc tgctagcaaa 89521 tattgcctca ctataacaga gcattaaaca tgtcaaagtg ctctaagatc cattatctca 89581 gttcacactg caaccctgca aagtagatag ggcaggtgtg atgatgcaca tttcccagcc 89641 gagcaaatgg ggcccagaga tgctaagcga cctacagaag gtcacacagc tggtgaatga 89701 gagagcctgc cctgccacct cctggctcag caccctcctt gctgctaata tactccagag 89761 gactctgagt gcccagttga agagaggatc agagtcagga caagaaagag acattcattt 89821 ctagattctg gtatttaatt cccaggcagg ccacctgacc cttcgccctg agaaggcacc 89881 caagtgccat caagcgcctg gtgccctgcc gcctcaaggt ctgggcacga ttaaacaaaa 89941 acgagggatt gttgcctaac aaattagcaa caggtaaaag caatcaagcc atctgttgcc 90001 tcccgccgct gctgcctaat cccctttctc ggcggtgacc cccgctctgc gctcaaacac 90061 aaattaataa actggccttt gcatatgtat ttgtggcttc aggtgtggag agagccacat 90121 gagcaggtgg tgggaatgca gttgctgaag atgctctgct catgcaaatg tccttaggta 90181 gggagggaag tctgctgaag ggcaatggga cagggtcagc tgacagagag gatgagccct 90241 gagtggccag tgaaggtggg gcttacgggt tagctctctg ttccacgtgg gaagtgtctc 90301 tatgccttgc tcatccctct gaggtcttgc atcgctggtc ccattctcca tattaaaaac 90361 actgaggcca agagaggcta tatgatcatc ccagggacag cggaaggcat ggacaaaggc 90421 agaagtgagc cccaggcccc ccatgatggc cagctttcct aggagctggt cttaggccac 90481 ctcccagagt ctcagtttcc tcactgcaag tttctaagat atgtaagtgc atcagcaatg 90541 cttaattgga cttgaccaga attaatatta atcgtgacta gtatttattt caccttctca 90601 ccatccttag aaagtagatg ttactatgcc cactttacaa atgcagaaac tgagattcca 90661 gagctgttaa ttgcaattat ttgttaattg gtacttactg tggtctaggc tgggctcagg 90721 ctaacagtag gcacttggat gcaaagatgt acagggaagg ccttattcct gtcctcacag 90781 cactgacaaa gtcagttaca agtgctttgg attaaattaa ataatggggg gggatgggca 90841 cagataggtg ggtcagggct atgcagcctc tctgaggccc aaaggatgaa aaggaaccaa 90901 gcagacaaga agagggcaag ggcattccag gaagaggaaa cagcatgtca gaggccctga 90961 gaagggaaag gatatggtgt ctggggtcag tgtgctggtg cccagagcaa gaaaaaggtt 91021 gagctcgtgg ttagacccag atcagacagg gcctggccag ccctagtgga agctctggat 91081 tttctcctgt gtgtagcagg aggagggcca cctgatcagt tagacaccaa agacaggctc 91141 ccttcactgc gccacgctgt cccctcacag ggctgctcct gcaggagtaa aggaaactag 91201 tggtcacctc cttggaaagc cttctcatag ccccagattg gaccagagtg aggaccctgg 91261 aagttcctag actagggttt tcttctcctg gtcacacagg agccacacag agctaggaca 91321 ttgtctttcc cattttaaaa tcccatgtta ctccggataa ggacagcaaa ggaggaaagg 91381 aaccttttct gggccaccag aaggatgagc ttgggcttgg gagacacatc tctaagcatc 91441 cagaaatgtg tctgggagtg gggatggcag agctcccatg ctgtgatctc acagggggac 91501 agggctggga agtccctttt aacttctgac atcccagtcc aggtagtggc aaagacctgg 91561 gctctggagt accccccatt ccttgctgtg tgacctgggg cctttccaaa ctctgtgtgc 91621 ctttgtttcc ccacttgtaa aatggacagt ctccctcaaa gagcgttggg gaatagaaaa 91681 cagctgcagg caccgagccg gatgtctgca ctgagtaggt gctgagcgac attagcatta 91741 acctccatct gcttccaggc tgcccctctt ttagccagtc agcatcaaac ctacaactcc 91801 tattgtccca gagccttgct gcatctgaag tttcaaatcc cttgttaaaa ataagcttca 91861 aaacaagcca gaagaggagg tgggaaagga aaataagaga aggagagagg tggaaggagc 91921 tttgctcacc tgaagccaga cctctaattc tgctggtggc cagggacagc cctgccgcca 91981 gtttctctca caaatcaaag ggctgtggag taatcaactg caaagtgctg agttctttga 92041 gggtagaagc cagaaaggat gtgcttgtcc ttctcagctc acaggagtgc cttgaggaga 92101 tgggctgggc gggcctgtaa ctggagttgg aggggagagg agggtggtca agctggagaa 92161 agggaggaga gttgagacca acactggctg tgcagtcaga cctgatcctc ctcctgtgat 92221 tctggccagt tacccacttc cttaaggctg tgttgtctta tctgtgaaat ggatcgatag 92281 cagctgcctg gcaggcttgt tgggaggagg gtattgaata ccagattcag agccctgtac 92341 cctgggtgtg gcacacagta ggtgcagcac acaagaggtg ctccgtgagt gttgcttctc 92401 attcattcac ccctctctgc agggaaaggg gaggactgga gtgggctcaa gtgtccaaat 92461 gccaggcttg ccatctcccc accccattga gctgtcttcc ctcatgtgtt ccttagaagc 92521 atcttcctag tcttagggtg tgaagttgca tgtctaggct cctacgccat tattcccctc 92581 ctttctctct cttttttttt tttttttgag acagagtctc gctctgtcac ccaagctgga 92641 gtgcagtggc gcgatctcag ctcactgcaa gctctgcctc ccaggttcac gccattctcc 92701 tgcctcagca tcctgagtag ctgggactac aggcacctgc caccacgcct ggctaatttt 92761 ttgtatttgt agtagagaca gggtttcacc atgttagcca ggatggtctc gatctcctga 92821 gcttgtgatc cgcccacctc ggcctcccaa agtgctggga ttacaggcgt gagccaccgc 92881 gccatgctcc ctttctcttt ttattctgtc cccacggccc tccccactgc ctttctcctg 92941 agctgaccca gccccctcct aataggcttc actgccccca gcctctgagc tgcccccagg 93001 cccactgagc acacgactgc agatccacca ccctgaggcc caggtctgct ttcccctggc 93061 ctcggcacca ggctcaggca cactccccat cacaatggct ccacttggca gccaaagcca 93121 gccactctct gggtgctgct ggccttatgg aatgaccatc acagcacctg gcaccagctg 93181 gcacccaacc aaccgtcaac agccctcacc gcacaacccc ctctacccgc tctcagcact 93241 gccattccca acgctcacca tacccgatcc tgcctcttta tttttgctca tgcaaaaacc 93301 caacatcctt cagcaatcag ctaaaatgct ctctccctcc agaatccccc cagaatattc 93361 tctccttcac actgtgcaga atttgggctc cttattctag cactcatcaa agtctactta 93421 gaatagggcc atttgcatag atgggtaatt tgctaagtct gcatttaagt ttgtactaca 93481 tggaacagga tgttagctct gaggtgccaa tgagatagat acctggaaac aaagtgagtg 93541 tgagttgggg ctactccagg gcagtcaggg gatggggtaa catgaattgt caagggtgtg 93601 gtgatgctgt gggacagaag gaagtgtgtg ggtatgtgca caccagtctg catatgtgca 93661 ctggggactt cacacacaca gagctccttg cagtcctaag tgagcattca ggcctcattc 93721 tctctctctc tctcttctct ctctctctca cacacacaca aacacacaca cacacacaca 93781 ctgcacatat aatgcaaaga catcttgtag atcttcacat tcttaaacct catgattggt 93841 acaaattaaa caaatattga aagaaactcc agcagagctc tccaagcctc agtcctgctg 93901 ggtataccaa aacacctcca gcagctgaac agcacaatga taaggaatga caatatgttt 93961 taccctttct ggagtgctgt tcacaagcag ctcttagatc cttgcgatct gctcccatct 94021 ttagggagga ggcctgaggg cacagtctgt ttgaggcaga gctggtgttc aaagtccctc 94081 ttgggattct gactcccaag ctctacgggc agaagctgtg gaaggcttta tatcatcagg 94141 tactgcatct ccttgggact ctctccacaa gcagagttcc tccaagcctc acggatctac 94201 gcctccttct tctatgagac aagaaactgg aattttccat gtgagctaag taatgggggc 94261 atgagcatgc acatcctccc ttgaggttca gcttagacac aacttccccc aggaaatctc 94321 tcctacaact ctcccctcat ccccaggctc cagtatgttc ccatagcacc aaatcttggc 94381 aggtagtata gtggcctgtt tgcaggtccg tcatctggat cctgcagcag cctttgagcc 94441 ctaaccacag ggacagggtc tcattcacct atgggtccag gtcctggcat atcttagact 94501 ttaacaaagc cttactgaac atatggttgg atggatgact ggatggacag atggatggag 94561 ggatggatgg atggatggat ggatggatgg atggacagat ggacagatgg atggatggat 94621 gacccaggat gcaaatgaac tctatgcaca gctacagcta tagaagcaag gctcaatcac 94681 ttgcgggtgt atggggagag gaggaggagg gagagggaac gagagccggc ctttgtcctc 94741 atctcttcca cctggcaatt ttcccaagga agcatgagtt agtcttctaa ggaaagtaga 94801 ggggatgtga atagagaagg aaaccccact accttcagtg gggtccaaga aggcagctca 94861 gccctcaatc ctagatgcct cctgagccca gagatcagtc tttagataag aaagtgagtg 94921 ttctctgtca ctgcccctgc cagggggaga gaggcccctc ctatacctgc tcctagtaca 94981 ctctgcaccc attaccagtt tccctggctg aaaatctcaa gagatttttc ctcaagtact 95041 tgtatctaca ggagtttaag aaatgatcat gcagagaatt gtcatccccg actcctctgc 95101 aaaccctaca gaggcttgga aaagttcatc ttccccagac tcccaacatt ctcttttctg 95161 tggccagtag ccaaccatga gcgagttcat taatattaat gttaataatc tcctacattt 95221 gcttcacgct ttcctgctta caaagcactt tcagatccat catctcagac aatcctcaac 95281 accagcctga gaggtcagcg gggcagtgat taccatcccc atttcactga ggagggaaca 95341 aaggtccggg aagagaaggg atcagcccaa ggtcacagag cagccaggac tggagagagc 95401 tcagggcaga gtccaggtgt cctgaccccc aggccagcgc tgggttactc cacctccatg 95461 ctgacaaccc ctttccttct ctgcctcatc aatgtcacca ttggcgtcat ctcctggttc 95521 tcagtcatta aaaagatagg cttaaaataa atacaagagg caggctttag ccatctgcta 95581 aaatgggtaa aacgacctgc cagcaatttg ctggatgacc ttgggctaga cagtttccta 95641 tctgggcctg caattccttg tgtttcaagt gacagtgctg gcctcttctc ctccggccct 95701 acccactgtg acgggaggct gctggaacct aagggtgcca cccttggtgc cagactgggg 95761 ctggcagcaa tcatgcctct gcgatcccct cctcagcact ctggagtggg agggatagac 95821 acctgtcatg agcctgtgac agccaaagga tggagctata aagcaacaga ggactggcaa 95881 gaaccctcct gagcgtgggg gtggagctaa tcacaattgc tacagtgtag ctactctatg 95941 acaagtgctt tatgcatgtt attgacatgc ctcaaaatag ttctaccaga cagcaattag 96001 caattatttt cattttaaag atgagacact ccaagctcag agaggtttct tgacacgccc 96061 aaactcacac agcaaacagc agaggcaagt tccaaaccca gatctgctct gatttctcag 96121 tcccaaatgc tagaaggtgg agtgggaggg gtgcacgtgg aaccccctaa gttcaccctt 96181 cctccatttt ctttctccca acctctctcc cctcccttct cctctcttcc tccctccctt 96241 tccctggagt cacagtccca ggctgggtct cctctgctgc accagctcgc cctgtcatct 96301 cagcacaggc ggagatttat ggatctcact tcaaggattt ttctttaata ataacattta 96361 taatcatttt agaaattaag aactgaggat gcaaagcatt ttaattaaaa taatccgcag 96421 ttgataatta taggactctc ttagctggaa ccatcttctt tcaatctgtc acactcccta 96481 aggaaggcag gagataatcg tgaggcagtg agctggtgag ccagccctgg tacctgtgtg 96541 tgataccgtg tgtcccaggg gtcattctca gaggctgcct gctagggagg tcggcttcgg 96601 ggtgagctgg gctggcatca tctttagagt tggtcccctg tgagccatgc aaggctggct 96661 tttcacagga ggataagccc ttccagagcc tcctccccac ccctacgaca cagtcttctg 96721 ctggttaccc ctgtgggctc agagggtctg agacccaaga cacagaggaa cgtagaaccc 96781 tcgctcggcc aatgccctcc tgctaatgcc cccaaattcc ctgcctctct gccccagccc 96841 caacctcacc ctcaccccag gtgcagttct gccctctgtg gtcaggagag ggcagccaag 96901 gccaccaaat gcagacagag tccctgaaga atgggcctgc ttattggact tagagaagga 96961 caagagagga gccccggctt ggggaggtgg gatatgaaca agccaaagtg accccactcg 97021 atgcctacca ctgagacacc tctgctcagc tctggcccca ccaacagaga atgacacaga 97081 tatctcttcc cttcgaaggt cacattccag cagagacaat aaacatgtaa attaatggta 97141 acagaaagtg attcatgccc aaaatgaagc tgcagcagtg agggtcccaa gagtcttcag 97201 gtctgtgtgt ggggtcatca gggaaggctt cactgaggag gcaatgcctt ggcagggttg 97261 tgaaggatga ataggagccc accccggctc agtgacaatc ttgaccacta ggatttcttg 97321 actgtttagc atgtgccaca cactgtgttg gtctgttgtc acatctcatt caagcttcac 97381 aacatgcctg aaagatagac tttattatca gcacctttgt ttttctctag ataaagagac 97441 agacttggag aggttggatc atgcactcca ggtcacatag taaataagag gcagaggtag 97501 gcattgaacc ccaacccatc agtgccaacc cctgtgctta tcctgtgcga cgtggggccc 97561 tccaggataa cagatttgtc taggaagact tatcccccac agaggaaatg tgtacaaggc 97621 tgcagggccc aaagcgagtg acagcagctg gtaagagagg aggcaggggt gacagctagg 97681 tgtgtcctgc cagaagacca gtttggggaa agcttaactg agtgactgtc cctttctggg 97741 tcttagtttc cccatcaata agtgtagggt ggggctgtct ccagggactc tgattgtaat 97801 tgataacaaa taatggcatt tggattggtt gggactgctt tagagagctc agagggtctt 97861 cagatttctg ggcccactgt agcatcccaa gctctgggca gagtgcctct cccctgcccc 97921 aacactcatc ctgggagaga gggggtgcac agggaaggtg ctcctcacca gggctccacc 97981 ccaaatccat taagggcaaa aatgtaatcg ggaaggagat tctgacagtg gcttgaaagt 98041 gacctcactc gggctgatga atgcctccgt ttcctcgcgg aaaacatgcg ggacatgagc 98101 cggccagcca ggagaggatt ggacacctcc agcaggaagt ggcagctgtc ccgccactga 98161 ggccagacct gtccctccac cctctgcagt ccaagcagct tccgtcccca cccccactgg 98221 agcctcccca agcagtggac aagaccggaa aatgattata taggctcact catggataaa 98281 attggttctt ggtctgagaa gccacaaagg ggagatccaa gatgagcacc atggaggtcc 98341 tggggctggc acgctggcag cgaggcagca ggagtggaag tctgtgccac gctggcccag 98401 gacacaccat cagggccacc cttctatccc agtaatcaca gcttaggcag cagccccaat 98461 cccctgatca gcaccttctc tgcaccctca agagggcaga aattggctgg gaatactgag 98521 aagaaatttg ggcctcccag acccagggtt tctggggaac agctctgcca gggttgagca 98581 atctagggag gatatgctag ttttgcgagg ctccaggctg cagacaaggc tggattaagt 98641 gagggcttga aggaggggcg tactcaatga gcccaccttc taaacactgc ccagcaaagc 98701 acaaccttcc cactgcttcc taaatagtgg cttctctgga tgctagctgg cctcttcaga 98761 tgaggaattt ccctcccatt ctgggccctc cccttcagag aacactggag cccaaggtgc 98821 tttcagtaac aagtttcatt gcagctaaaa ataatggagc gacccatatc tcattttcct 98881 gcattctgta acgacaagtg ccccaaaagg ggcagtggct gggaattagc aaaaattcaa 98941 ggcaactgga aaatatttag atagaataaa atatgtgcat tcaaaagaaa gctaagaaaa 99001 ggcaggattc agaatgatgg ttaccttggg tggggaaagg cagggagatg ggacaggcag 99061 gaggaagcca cacaattaag tgtcaatcat tgtcaaggac tctcctttta tgttaggagg 99121 tgagttgggg gcacttattg cattgttatc atgtgcaaat ggatgaatga aagaaaaata 99181 ggaccaaaca tggcccaatg aggagagtgt aacaaagatg ataattactc aaatttggtg 99241 catctgagct tccaaaaatt ggggaaaata tcaaagaaaa gtgtaaattc tatgtggacc 99301 acacatccct ccccttcccc tgtttggctc ccattactga gcctttctgg ctttccaaga 99361 agagtcttag ggcccagatt atgcagaaaa tttcctctta aagcagaact caacctgcta 99421 ctttggtgtc ctggacatta aaaaggaaag gaataatttt aaattaaaaa aaattatcct 99481 aagatgtgca ggccaaacgg tgtctgctcc ccctgggacc caaggggttt caccccaggg 99541 gaggctggtg tgccaaggac attggctgtg atttgggaag ggtgagaatc aagagagcca 99601 gagggtcagc ttctctccct tccccacttt gagtgttcct gccctgagaa tcttgtcctg 99661 acttgggcct caggtggcag ggtttcctga acatggctac cccactctag gcccaggaca 99721 aggggacctg tcctactctg ccaacctctg tctctctctc ccactcccat tcccctccct 99781 ggcctttccc taaggaagag atcctatggg acttcccaca ttctcctgtc ttagggtgct 99841 cagcttttcc tgagcagatg ggatggatgc tcttgagggg ctcctctact tcacaaagga 99901 gcaagaggtc aggatcccta ccattccagc cagacagaag ccccggcccc ttgcctccaa 99961 attcccagga aaacaggata cggaagagac ccttatctcc ctgtccttgt tcaagctgat 100021 ccatcattat gtctcatttc atagctcttg aatccaaata aactaccatg gagcatgtta 100081 gtgaggcttc ccatgctgtg gttggtctgc tctggttagt atcccatggg gaaggccaaa 100141 aagagcagct gtttttatag aaatggagag attaggggaa gcaagctcag gctaaagccc 100201 aggaagagat gaggtgaatg tctggaccaa gcctctcctc caagctttat gatccttatg 100261 cacagccctt tgaggtctgc acagtccttt attgtatgtc atctcacctg atagtacttt 100321 gagagttgtc agccccattt tctggatggg gccagtggct caggagtttg cagttcctgc 100381 tttgctggtg tggccctagg aagacaggtg acaggtgcaa gttcacagga ctcataagtg 100441 acagacactg gcctcgagcc catttctgct ggtttctggc ctggcactct cgagccactg 100501 cttccatgcc agcctttcca actgcttccc tcagctggtg gcctagggag gccaccccct 100561 tccactgtgg caaggttgcc ccagggaaag ggaatagaca gggaaagggg tgagggccaa 100621 tgactgcaga acgattggcc tcagaggaga ctttgattca gtctaaggaa gcgcttggta 100681 acaagactta aggaaagaaa ttgtggagag atgacaggga ggaatcttgg ctctgttcat 100741 tcttgctgtg atactcaggc atggccactt accttctctg ggtttcaggg tcccccttga 100801 caaaaggaca ttttattact catcttcctg cctcacagac cagctgggag ttaaatgtga 100861 tatgatttgt tatggaagag cttcctcaat gacctaagct attcactgag agccaaccct 100921 tggctctttg tactgtccaa cagaggaaca ttctagctgg tgagagtgct cagcctaggc 100981 cttatgttcc ctccaggaca gtacaattcc acaaatattc actgggcaca tactatgtgc 101041 cagggcactg tgctaggtgc cagggacaga gaggctaagt gatcaacctg gtccctgact 101101 ccttagtgct cagactctgg tctagtgatt ctcagctacg attgcacctt aggatccctg 101161 gagggaagtg gcaaaagcat cagtacccaa gccccattcc aggccaacga atctgaatct 101221 atatattttt caaagctcca aggtgattct gaatgcagga tgagacaaga aagtaagcaa 101281 ttacagtatt ttcttatcct tggtctgctc aggaaagtac agggagacct tggaaaggtg 101341 cctaagtcag attttagtgg tggtggtgga aggagggtta gggaaggcta ctggaagaaa 101401 tgacaagtga ctccatgcaa tgtgatggca aagttggagg ccctgaaggc agttctgctg 101461 gactgactca gaaggtaagg ggaaggcaac atgtttgagg ctggaagagg cagtcagggg 101521 ccagattgca gagggcaggt ttgcccaatt agggagtttg cacagacccc agaggcaatg 101581 gggtacatgg agaggagtta agcaaagaaa tgccatgatc agatgtctgc tggttgctta 101641 ggggacttac gttgttcatc tgggtagcaa aaggctccag caagttgggg tcaggacctc 101701 ttccactacc cccatgcatc tggagagggc agaagctttc cccagggctg ctgggcgcta 101761 ctagtttaat ctgagttcca tggtgcatta gcaatgagct ctcaggatca gccaagtccc 101821 tggtactaat cagccgctga ggcaggatcg atggcaacta cattgatctt gctttttggc 101881 tttggctcct tatagagctt attgactgga caatctaaga gggcggggga tgtgcaccca 101941 agtctcactg aactgtggta gaaaagcgag gagaagggtc cctcacaggt cctgcacctc 102001 actctagtct cacactcatc ctgaatgtgg cccccactgc caaatcctac accctacaca 102061 aaatcatggt gttcaaggct tcagcctcag ctgttgccac tcactctccg tgttcaggtt 102121 tgaacccaca gaaatgcctt tccagctcct ctgcctctag atacgttctt ctctctaaca 102181 aggatgtgac tgactcctac ctcaatttac ccagtctgat ctcttccttc ctccagatcc 102241 agctcaaata tcgcctcctc taggaagcct tctgtggcct aacctcaccc ccaaagcacc 102301 aataaagtct tctgtatgtc cacagcattt gatatgtatc tatatgtatg acactgatac 102361 acttttacaa ctaattactg tgtctgtctc cccagaatgt aagttattta aggacctagc 102421 atacagcaga tgctcaatga agggttctgg agtgaatgag tgaacaaatg aatgaagaaa 102481 tgctttaaaa tatcatacag atttcagatt tctttcttta atcctgaccc cgcacagccc 102541 tctaagggta cagaatagca gagactctgc ctaatctaat atggatggca ctggttcaga 102601 tttatacctg aaatattaga actgggagga aacttatgcc ttattgttat tgagttcaat 102661 ccctaatttt atagttgggg ccacaggtcc agggacaata actgactcac tcaaatctca 102721 caaaagcaga tggcaggctg agaaccctcc aggtgctatt ttatgtggac atgaccatga 102781 aagtggggga tagcaaagcc cactgaacac ctccaggtat tcccacccat cactctgact 102841 tcctgtaaca gcctttgtct cttgaggtgc atctggctga acaggtacct attagctggg 102901 ccacaggatg gaggctccca gaagacaaaa gttctttctt acagtgactc actatatgac 102961 aaaagggctt tatttcatat ctacaagcct cagtttccca atcattaaaa tgggggaaaa 103021 gattatctgc cttttctctt ctggtcttag gagaggctac tccttgagca aagcctgcca 103081 ttctttaaaa aatatcttca caggtcatgc catctggtat cgatgagaga aggcaagaga 103141 acatggggtc taggaagtca agctgagaac atagcctgtt tttgtttgtt ggtttacttt 103201 ttattgaaat acaacataca gaaaaatgaa cttatcataa gtgcacagcc catacatttt 103261 tcaaaaactg aaccaatgtg tacataatat gtagatcaag aaatagagcc ttcccgaagc 103321 tgcccttgga ctcccttccc atcactactg cctaaatcca ccatactgat ctcaagcacc 103381 atagattagc tctgtctctt tttgcacttc atagaaacaa aatcttataa ctgctcctcc 103441 tttgtgtata cctccttttg tacaacatca tgcttccata aggttgtgtg tcattgtaga 103501 gtggccattc taattgctag atattattct attgtataat tatattatga tttgcttatc 103561 cattctacta ttgatggaca tttgggtagt ttccagtttg gtgctattac agatagtgtt 103621 aaaatagtct gtttttaata ttttaaacaa tcacttaaaa agacagatac tttaaaatat 103681 ccaggaagaa aataattgta tattcatgtt tccatttaat aaatatttgt taagcaccaa 103741 ctgtgtcctg gaatggtctt ctcccttcct ctcctaagct tagcctttac agaacaggtg 103801 acatcccacc tcttctacaa agcatttttg cttgccccca accgtgacat tctcccaacc 103861 cttaagcctc aaggtaggtc ttgccaagaa cacacagtag accatgagcc caccctgcta 103921 gccttgttcc tgctccttct ctgcttattt tcaggtgggc acttctgaaa agcaggtatc 103981 atctttctct gccttccaca ataccatgca caacactggg cacattatca aactcacaaa 104041 atatacattc aatttaacta gatctcagcc ataagaaggt gatgtaaatg catgaagacc 104101 cccactctct gtctctaccc caaggataat atttttgcaa aacatggaaa agatctatgc 104161 caaacatggg caaaacccac ggacagcgat ggatggagaa atgcattgca gtagccttga 104221 tagccaaaat gaattggttc cagagcctga gctctatgtt gtggtttata atcacttctc 104281 tctccactcc ttggtgcaaa taatcagtag acactgaagt tttggataga ggacaaattt 104341 cagcaccatg ggtagatcat cctgttggct gttctcaagg agccacagca gacttaacag 104401 caaggtattt tagtgagaca aactcatctt agttgaattt cagaaatgct catatcctat 104461 tttcactttc ccacagactt ctgctaagct ctaaattcca tataatgagg ttgaaaaagt 104521 ataggtttta aagctagata gacttgaaat tcaaattcct gctgtattgc ttgtcagctc 104581 gactttagga aaattactca atctttatga acctgaattt attttatttg taaatcatgc 104641 ttataatatt tctttatttc agatgttata tttccaagtc taaaatttgc atttggttct 104701 cttctgtggt ttcttttcat ttcttcttct tcttcttctt tttttttttt tttttttttt 104761 tttttttttt ttttgctaac attggtagtt cagtctttac attcattgta gacatacttt 104821 tctttatatc cttgagcata gttataagag cagctttaaa atatttgtct gctaattcca 104881 aaatctgggt cattttaggg ccagtctcaa atgttttcta tgggtaaagt tttcctattt 104941 cttcctatgg ctagtaattt tgaattacat actggataaa gtaaatagta taaagtgttt 105001 agattctgta tattgctata aaaagtattg tttgttgttt gcttgtttaa ttaagcagtt 105061 agtttgacta aattctcact gcaaaactgt cacccctttg gtggatggca cctgaaattt 105121 cagttccatt agccttaatt gggcaccttg gtgtctgtcc tgcatatgca ttggttcagg 105181 ggtaagccag atatttgata gaggttgtac acagagtctg aggttcccca ccctggtttg 105241 cctctttcaa ggtttcctcc tttacttttc agctaccata gttgccttga aatctattcc 105301 ctggctcttc aaatcagtat gacgaggttt cgacccatct aagacaccta gcatgaggag 105361 tactggggtt tgccctccgg ctaaacttat taagaaagtg aaatttactc agtgcccttc 105421 ccttctccca agtgccaata aacacccacg cgaacccttc tggaatctgc ccactttaag 105481 ttactctcct tctgagaacg attttttaaa tattttgttc agagttcaca gctattatct 105541 gtggcaggtg tggtctgtta agagtttctc agctggaatc agaagcaaaa actcccatag 105601 tacctctctg aagagctgtg tggaaggtgg tatgagacaa gtctataaag tactcatcgt 105661 agatgctcaa taaatgttgc ctttccatac tgccatcccc aggcctcctt tttttaagcc 105721 tgaaataatc cttccataaa tagtgttttg ttcttcttga agccctttca gctatattgc 105781 atcatccaac cctcacaaaa ttcctgagga atctgcaggg taaacatcag tccccctctt 105841 gacaagtgag aaaaccaagg ttccaaggaa ttatgagact tgaccaaagc cacatagtta 105901 accagtagca atggcagccc cctgcttgct gacatctccc tccacaaaca tctgttggcc 105961 atgaggatga gcaaatagaa cgctctagca aatccaggca ggttggggga gggcttcacc 106021 ctcacacttc cagactcttt gctatagagg aatcctgagg gtacatatgg aatatgcctt 106081 cctgtgtcta gcttcacccc agcagtgaaa gtcctgcaga ctatagggtc cagcactgat 106141 gggtctgaac acaccaatga gtttatgtta agtgcctcga gggtccctgc atttatcaca 106201 aaactgggca gggagcagtg tgttctcaca acactggcac tgccacccag gtccagtaag 106261 ctgtgtgcct tgcagtagta aacacaattt cccaacctgt taaatggcaa aaatatctac 106321 atggccagct ttacatgttt ctagcaggca cggagataat gtgaagatca taggagtact 106381 ttgtaaataa aacagcaagt gactgcaaaa cagtgctgct ctgatctcca gtccctcaga 106441 gagacacagg acccctgtct tgctgcacta caatgatctg tgtccctggg tatgagtgag 106501 cccacacaag acgaggctta gcagaggaag gctaggtggc tgacgatact ggctttaaaa 106561 ctctactgct ttacttctta actgtgcacc ttggggcaag ttacttcact tctctgaacc 106621 ttggtttctt tatcttaaaa atgagaataa taacaataat gatagaaata gtaaataaat 106681 catgaggtta ttatgaggag taaatgtgat tatgtatatt aagtgctttg cacagtgtct 106741 agcatgcagc aagtattcaa taaataatag ctagtaataa aaatggcaag tgttacctgt 106801 gagtgcacag aggttctgag caggctgccc attgccatgg aggcacatgc agctgcctgg 106861 acccttgtgc ctggtgctgc ccatgacctg ctgtgcaagc tctgactctg gggaacttgc 106921 cctctgcctc tgaaccttca ttcctggtct caaaaatgag ggggctggat tagagcctgt 106981 cttccaggtt ccacattctg ctgagctgct tctagacccc cctagacaaa gcaagagatt 107041 tggggaagaa ggagcaattg gtatactcag ctccaggcag ttgcctgcag ggacgggaag 107101 gcttccagac tgtgctgagc ccaggaaatg tggcctggcg ggtgtgggct gcagaaaagc 107161 ccagtgaggg ggatagattt cccacacaga agggagagag ggcaccaccc aaggaacatg 107221 aagcaaactc cagccaggct gaaggacctg agacccccca gagtggaagc tgggatttca 107281 gccacaaatt aaacctgccc ggcctccacc caaagtccct ccaccacctc ctctgatggg 107341 agtaggtttc aattccatct tgaagtgagc tcattcctgg gcctggccca gcaagtcctg 107401 cctagcacac cccattccca tcccctccct ctctctctgc acttctcagt tagatagagg 107461 tggtgagagg aagttgagac cctggctttg ccactctcta ttcttgagga aagcacctct 107521 aaacctcagc ttccaaatac atgaacgagg tctaatgatc ctaccctcat aaggtttcac 107581 tgaacaatgt atggaggagt gcatgagctg gacgatggtc cttgggccct ggcgctgatg 107641 ccccctccct gccactcagg gtgcaggggt tgttgcaggc tgtctgagaa tgtcctctca 107701 ctgcagtctt tttgcattcc ttcatttgct caaaaaacag tgaatgggca tctactttgc 107761 ccatggctag atcagagctt ttcaaagttt aatatgattc aaccggagaa agctt // LOCUS AC004022 116552 bp DNA PRI 20-JAN-1998 DEFINITION Human BAC clone GS155M11 from 7q21-q22, complete sequence. ACCESSION AC004022 NID g2795822 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 116552) AUTHORS Hinds,K., Tin-Wollam,A., Becker,M. and Strowmatt,C. TITLE The sequence of H. sapiens BAC clone GS155M11 JOURNAL Unpublished (1998) REFERENCE 2 (bases 1 to 116552) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (20-JAN-1998) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: The sequence of this clone was established as part of a mapping and sequencing collaboration between the NHGRI Chromosome 7 Mapping Project (Eric D. Green, Director), John D. McPherson in the Department of Genetics (Washington University), and the Washington University Genome Sequencing Center. For additional information about the map position of this sequence, see http://www.nhgri.nih.gov/DIR/GTB/CHR7 or send mailto:egreen@nhgri.nih.gov SOURCE INFORMATION: This clone is from the first BAC library from Genome Systems, Inc. (http://www.genomesystems.com). Cell line: lymphoblastoid Haplotypes: two VECTOR: pBeloBAC Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of GS155M11; the actual end is at base position 116552 of GS155M11. The orientation of this clone is unknown. This clone contains STS sWSS2562 (NID:g1113513). FEATURES Location/Qualifiers source 1..116552 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="GS155M11" /clone_lib="GSBAC1" /map="7q21-q22" repeat_region 1..989 /rpt_family="L1" repeat_region 1333..1941 /rpt_family="L2" repeat_region 2000..2292 /rpt_family="Alu" repeat_region 2450..2601 /rpt_family="L2" repeat_region 4884..5107 /rpt_family="MIR" repeat_region 5464..5581 /rpt_family="MIR" repeat_region 5753..6177 /rpt_family="MaLR" repeat_region 6302..6386 /rpt_family="Retroviral" misc_feature 7851..7934 /note="match to EST AA256366 (NID:g1891905) zr79h12.r1" misc_feature 8940..9032 /note="match to EST AA256366 (NID:g1891905) zr79h12.r1" misc_feature complement(8940..9032) /note="match to EST H80516 (NID:g1058605) yu76b06.s1" misc_feature complement(8942..9032) /note="match to EST H49508 (NID:g989349) yq20h12.s1" misc_feature complement(9293..9438) /note="match to EST H49508 (NID:g989349) yq20h12.s1" misc_feature 9293..9438 /note="match to EST AA256366 (NID:g1891905) zr79h12.r1" misc_feature complement(9293..9438) /note="match to EST H80516 (NID:g1058605) yu76b06.s1" repeat_region 10073..10366 /rpt_family="L2" misc_feature complement(11651..11823) /note="match to EST H49508 (NID:g989349) yq20h12.s1" repeat_region 12453..12787 /rpt_family="MER2_type" repeat_region 15031..15133 /rpt_family="L1" repeat_region 15140..15434 /rpt_family="Alu" repeat_region 15690..15983 /rpt_family="Alu" repeat_region 17416..17700 /rpt_family="Alu" repeat_region 18269..19080 /rpt_family="L1" repeat_region 19090..19791 /rpt_family="MER50" repeat_region 20062..20706 /rpt_family="Retroviral" repeat_region 22243..22678 /rpt_family="Retroviral" repeat_region 23388..25206 /rpt_family="Retroviral" repeat_region 25323..25531 /rpt_family="Retroviral" repeat_region 25899..26228 /rpt_family="Retroviral" repeat_region 27005..27146 /rpt_family="Retroviral" repeat_region 27814..27976 /rpt_family="L1" repeat_region 29198..29901 /rpt_family="MER50" repeat_region 29917..30084 /rpt_family="MaLR" repeat_region 30165..30416 /rpt_family="L1" repeat_region 32285..32347 /rpt_family="(GA)n" repeat_region 32335..32625 /rpt_family="Alu" repeat_region 32962..33104 /rpt_family="MIR" repeat_region 33580..33915 /rpt_family="MaLR" repeat_region 34034..34316 /rpt_family="Alu" repeat_region 34317..34358 /rpt_family="MaLR" repeat_region 34525..34758 /rpt_family="L1" repeat_region 35069..35816 /rpt_family="L1" repeat_region 37505..38491 /rpt_family="L1" repeat_region 38496..38791 /rpt_family="Alu" repeat_region 38794..40367 /rpt_family="L1" repeat_region 40358..40794 /rpt_family="L1" repeat_region 40882..40932 /rpt_family="L1" repeat_region 42055..42140 /rpt_family="L2" repeat_region 42479..42661 /rpt_family="MaLR" repeat_region 42721..42906 /rpt_family="L1" repeat_region 43323..43444 /rpt_family="L2" repeat_region 43833..44387 /rpt_family="L2" repeat_region 44441..44845 /rpt_family="MaLR" repeat_region 44936..45386 /rpt_family="MaLR" repeat_region 45390..45603 /rpt_family="L2" repeat_region 45697..45899 /rpt_family="L2" repeat_region 46176..46355 /rpt_family="L2" repeat_region 46420..46743 /rpt_family="L2" gene 47311..72842 /gene="H_GS115M11.1" CDS join(47311..47384,53393..53463,54997..55052,56296..56464, 60209..60335,63575..63775,65420..65501,69453..69581, 72684..72842) /gene="H_GS115M11.1" /note="match to e235828 (PID:g1262347)" /codon_start=1 /product="aryldiakylphosphatase" /db_xref="PID:g2795823" /translation="MAKLIALTLLGMGLALFRNHQSSYQTRLNALREVQPVELPNCNL VKGIETGSEDLEILPNGLAFISSGLKYPGIKSFNPNSPGKILLMDLNEEDPTVLELGI TGSKFDVSSFNPHGISTFTDEDNAMYLLVVNHPDAKSTVELFKFQEEEKSLLHLKTIR HKLLPNLNDIVAVGPEHFYGTNDHYFLDPYLQSWEMYLGLAWSYVVYYSPSEVRVVAE GFDFANGINISPDGKYVYIAELLAHKIHVYEKHANWTLTPLKSLDFNTLVDNISVDPE TGDLWVGCHPNGMKIFFYDSENPPASEVLRIQNILTEEPKVTQVYAENGTVLQGSTVA SVYKGKLLIGTVFHKALYCEL" repeat_region 47928..48046 /rpt_family="MIR" repeat_region 48614..48700 /rpt_family="MER1_type" repeat_region 50115..50302 /rpt_family="MIR" repeat_region 50474..50887 /rpt_family="MaLR" repeat_region 51165..51298 /rpt_family="MIR" repeat_region 57042..57210 /rpt_family="MER1_type" repeat_region 57414..57708 /rpt_family="Alu" repeat_region 59042..59309 /rpt_family="L2" repeat_region 59429..59590 /rpt_family="Alu" repeat_region 61183..61282 /rpt_family="MIR" repeat_region 62005..62092 /rpt_family="MIR" repeat_region 62618..62720 /rpt_family="MIR" repeat_region 63012..63102 /rpt_family="MIR" repeat_region 63255..63310 /rpt_family="(CA)n" repeat_region 64435..64726 /rpt_family="Alu" repeat_region 66056..66374 /rpt_family="L2" repeat_region 67114..67153 /rpt_family="L1" repeat_region 67156..67452 /rpt_family="Alu" repeat_region 67454..67822 /rpt_family="L1" repeat_region 68106..68228 /rpt_family="L2" repeat_region 69674..69979 /rpt_family="Alu" repeat_region 71923..72358 /rpt_family="Retroviral" repeat_region 73979..74275 /rpt_family="Alu" repeat_region 77354..77505 /rpt_family="Alu" misc_feature 80865..81313 /note="match to EST AA470846 (NID:g2198155) ne21a10.s1" misc_feature complement(83678..83794) /note="match to EST AA215910 (NID:g1815857)" repeat_region 83800..84094 /rpt_family="Alu" repeat_region 84101..84166 /rpt_family="MIR" repeat_region 88388..88489 /rpt_family="MIR" repeat_region 89239..89320 /rpt_family="MER4-group" repeat_region 89442..89601 /rpt_family="Alu" repeat_region 89648..89705 /rpt_family="Alu" repeat_region 92871..92963 /rpt_family="MIR" repeat_region 93275..93532 /rpt_family="MER1_type" repeat_region 94061..94353 /rpt_family="Alu" repeat_region 95113..95198 /rpt_family="U13" repeat_region 96367..96655 /rpt_family="Alu" repeat_region 96775..96997 /rpt_family="MIR" repeat_region 101570..101762 /rpt_family="MER1_type" repeat_region 105101..105221 /rpt_family="MIR" repeat_region 106378..106476 /rpt_family="MIR" repeat_region 107765..107921 /rpt_family="MIR" repeat_region 108073..108346 /rpt_family="Alu" repeat_region 111094..111388 /rpt_family="Alu" repeat_region 113018..113290 /rpt_family="L1" repeat_region 113309..114133 /rpt_family="L1" repeat_region 114286..115551 /rpt_family="L1" repeat_region 115582..115626 /rpt_family="Alu" BASE COUNT 35288 a 22586 c 22851 g 35827 t ORIGIN 1 aagcttttgt gcttcaaaag acaccatcaa gaaagtgaaa aacaacccaa agaatgagag 61 atagtatttg taaatcatat attagatgag agacttgtat tcagacaata taaataattc 121 ttacaactta acaaaacaca aataatgtaa tttaaaaatg gacaaagtat ctacctagat 181 agacatttct cctaagatac acaaaggcct gtaagcacat gaaagatgtt tatcatcctt 241 agacattagg aaactacaaa tcaaaaccag aatgagatac catttcacac ccactaggat 301 ggtgataatt aaaaagacag acaataacaa atgttggcaa ggatgtagaa aaattggaac 361 cctcatactt tgccagtggg attgtgaaat aatgtaacca ctttgggaag gagttttgca 421 tttcctcaac atgttaaaca tagagttacc acacaatcca gcaactctgc accttggtgt 481 atgctcaaga gaattgaaaa cctatccatg caaaaaccca tatacaaatg ttcatagcag 541 cattactcct aatacaaaag gtggaaataa cccaaatgtc tatcaactga tgcattcata 601 aacaaaatgt gataggtatt tatacaatag aacactattt ggcaataaaa agaaatgaac 661 tactcatgca tgctataata tggatgaagc ttgtaaatat tatgccaact gaaggaattt 721 agtcactaaa agccacacat tatgtgattc catttatatg aagtgtccag aataggtgaa 781 ttcacagaca gaaaaattgt tggcaggagc caggggagtg gggaaggagg aatgacagct 841 aatgagtagg tgatttcttt ttgggtgatg aaaatgttct ggaactaaat agtggttatg 901 gttgcacaat tctgtgaata tactaaaacc cactgaagtt tacattttaa aagtctgaat 961 tcagtggtgt gtgaattata tctcaaaaat atattataaa aacataagtt atcaatatac 1021 catatatgct ttaaaaacaa tagtttccaa tattattttt gacttgttca cagttcataa 1081 tttcaaacat ttaattttaa aatatttttg ggtctataca gcatgaagaa aattaagaac 1141 agggaaactt cactaacagt aaccacatga aaggaggacc aaattgaagt gctagaattc 1201 catgaaatct tatatacact tttatttgct tagcaagctc tgctttaaaa gataattgaa 1261 tctaacaatt tttcattggt ttatagtata caaactattt tcatatataa tggtgtcttt 1321 acttctcgaa ataaaatcct gggcccaacc ttgacacttc ctctccctgg cccttttgat 1381 caactcatta taatgttttg tttaatgtat ctctccaata tctctccagt ctgaaatatc 1441 tgtccatctc tccccatcct cactgctacc agcagttcaa ggccccagca tgccttccct 1501 gattctgcat gggcccccaa ctgtaccttc cttctcatac cacctcaatt cattgtcata 1561 gtaaggattt ctttttatgc aaatctgatt atgtccctct tttcatcaaa tttttcatat 1621 tgttctgtca ctctgaggat aaaccccaaa tctccagaac ggcctgggtt cctgggtgct 1681 tcccctgtac cttctggcac ctgtatgctc tccagtttca ctggaaacca ttctcctcca 1741 gactctctgg accagccata ctggatttct tttcattcag tgaagtgtct catgcctcag 1801 gatctttaca tctgctgttc cctctgcctg gaatgtccgt ctgccacttc tggctcactc 1861 tttgtcatca ttgaagactc agcttaaata tcatcttctc agggaagcct gcccaccatc 1921 ttgttccaat ttgtgtacct ttctatattt cttctcaatg tacctctttt ttaaaatcat 1981 tcattacggt ggtaactaat ttttttttta tgagatgaag ttttgctcgt ctcccaagct 2041 ggagtgcagt ggcacaatct aagctcactg caacccctgc ctcccgggtt caagcaattc 2101 tcatgcctca gcctcccaag tagctgggat tacaggcatg cgccaccatg cccggctaat 2161 tttttgtatt tttagtagag acagggtttc accatgttgg ccaggctggt ctcgaacttc 2221 tgacctcagg tgatccacct gccttggcct cccaaagtgt taggattaca ggtgtaagcc 2281 accactccca gcctacagta gtaacttttt tttttaaatg ttaaaggata ggaattttct 2341 ttccctctcc tgctcttgct ctatagcaag ttcctaaact taactgttac tcttcaaatg 2401 aatgacactc tatgaagaac gtgtctcccc actctagtat tctttgtttt aatttattta 2461 ataatttttc ctccttgttt tcctctctca aatataatca ccataaaagc atattgattt 2521 gtccattgct ggattctgag agtctagaac agtatctgat gcacagtagg tactctatga 2581 acacatatca aatgaataaa taaacgaacc ctataggaca ggcagagcac atagtaatat 2641 tcaactttat agatggatac aatgaagttt agaggaatgc agtgaagtgt gacttcactt 2701 ctgcaattag taaatggctg atccagaact cagacagagg aggtctcact tctaatcctg 2761 tgtcttttac taaaatgaga ttaggactct atttaaataa ttttaaaact gcaactgttt 2821 tgcctcacac tgtacaagac ataagattct gcattggaga tggtataata agtgattaga 2881 tttacctgaa actatttcat tgcatgggat catattagag gtagaggtgg aggtgggttt 2941 ggggcttttt ttctttcttt tttttttttt tgtttgtttt tgtttttgtt tttgagatgg 3001 gacatgttta tacaataaca gtcatataat caggcttcca taagttgagt atattatgat 3061 tcacagctta tcttatttgt ttgtcatctc ccttaattat gtatttaatt cataaggcca 3121 aaatattaca tatagaaata gagatggaat gaggagatgg agaagaaaag tactctcact 3181 aagtatatct tatttggaaa gagggattca gtagatgcac tagaagaaat gaaatcaggc 3241 tagcacaagt gattgttcac tattgtgtct agcaaaagac tgaaggtccg tgttggggct 3301 gaaatccaac ccatgttctg tttgccacgc cctgagcctt ggaacagagc tggatcagac 3361 cagaaggaag atttcctctg tgagcaagca cctttcctta atttgcatca aagtgcccag 3421 tgtgtttcag aggtcctggt tatgaatttt gcatctcagg gtttgggctt gttccatcac 3481 gtgcttaaaa actacatttg ggaaaggaat agggaagatg aatacagatc aagtcctaca 3541 gtgggaggat gggaccagac cctttaaaac tccagagctc atcaaaccta gaactcctga 3601 taagtcagaa gacagcattc tcagaggcca aattactacc ctcaatcttg ttcacagcag 3661 ttcacattcc aattgaaatg ggtaggtgac atttctgagg tgggtactac ctagaattag 3721 ttagtttgtt ttatctcgca ttcatattta catgccatgt ccattgagct cactaaatat 3781 tgttatgttg gcttgtgtca gaaattcatt gtctagctgg gaaatgttgc ttattttaag 3841 gtgtttacac tgccctgaca ttcactgagc ttctgttgca atgtagataa ttattgtaca 3901 cacacaacat ctcagcaaag ttttgtttgc ttgcactcac ttctcttgct gaagggccat 3961 ctggggaaac cacatccctg tttatgcatc attatagttc tgtaggctcc tgttagacag 4021 gttatgcata taagggtgaa gtgactatgg caaaacatct tggctctgcc actttgtggt 4081 gatgaaattg ctgatggatc attcgcctcc tgagcttcaa tttcctcaac taaatctgga 4141 ttctggcatt tgtctgactt acctatcatg tagactgtga gggtaaaatg caagatgtaa 4201 gaagaaaact tttggaagtg accaagtttt aaaacaagtc taatattagt tgtgataggg 4261 gtaactttct tggaaattat cccttttttc catgtagaca atactgtgta tctttatgtt 4321 gtgaatcatc cccacatgaa gtccactgtg gagatattta aatttgagga acaacaacgt 4381 tctctggtat acctgaaaac tataaaacat gaacttctca aaaggtatgg aatgaaagta 4441 ttgtgcggtt ctatttactt tgcacctctg ctgtgtgctt tgtatttttt ggttcagcag 4501 gctttcttaa acttttgttt tgaaattgtt ttgatttttt tctaaagctt tctaatgatt 4561 cccatggtga ctcttcccag cgcctcacta gaaattgcca tgttaggtat tgaatatgcc 4621 tatattttcg agcattttga aagcatttta atcctataaa ggtgctcaat taaattccca 4681 gctctaattc cattctcttt accacggcaa ctcaatacct actatcggtc tctgtttaaa 4741 cacctacacc cttttttggt tgtcctagta ggttgtgttc tgtgagcctc acccactggg 4801 tgttataacc aacctccaga tgagccactt ctttgtttaa ttacagcttt aaagtagaat 4861 tttttaaaag tccttgggca cttataaaaa ttacatttgt taagcattga ctatgtgcca 4921 ggcactatgc cagaatactt aatttaactt aattctcaca aagcctttaa taataggcat 4981 gggtaaacta agacttagag aaggtttggg ttagataact tgcccaagtt acacaatttg 5041 taagtggcag aaacagggta taaacccaga tccatctgaa ttcaaagctg agctctcaat 5101 tgcaatgaac atctttctta ctgcattcat taaaataaca ggcttttcag tggagaggct 5161 gcctgtttaa aggctgtagc aaacaataaa gaattcatct tttatcaaat gggcttaaca 5221 ttaaaaaact ggctccaaaa cagtaaattg tttcatggga tagaagaata ggaaaagaaa 5281 agaaagttca tcttctttcc tttgctctct ttctcaacat catgtattaa tttctcccca 5341 cagaatatgg gggaaacgag tgcagagcat agaagtaaga gaagttgaag ttattgaact 5401 ctgctcctaa tgcaccgtat ggggttctga ggccatagcg gagaacagga cttccacagg 5461 gttgctggga aactgtggac aagtcactta attggtgcct caatttcttg agcataatgg 5521 ggattctcat ttgcctgact tacctaacta gtggattgcg aggatcaact gctttagcat 5581 aggatctcac aggaagagtc tcccagaaag tgcagcctga gggcacacct gatatttggc 5641 cacttcatta gggagtacaa tcccaggaag cagacttgag aaaccagtgt gaggcaagga 5701 agggagaagg ttatgggttg aattttgtgt ccccctgccc tccccacccc actccccctg 5761 ccaaaaaata tgttaaaatc ccaaccacca cttcctcaga aggcgacact ttggaaattg 5821 agtcattgca gatgtaatta gtaaagataa aatcatgctg aaatagagta cccttaatgc 5881 agtagaactg gtgtccttgc aagaagacag gcagatatac agggagaatg ctatgtgaag 5941 atgaatactg acattggagt gatatatcta caagccaagg aatgcctggg actactggaa 6001 tgccttgctg gaagaggcaa ggaaggatcc ttccctagaa ccttcagaga taacatggcc 6061 ctaccaataa cacggttttg gattcctagc ctccaaaact gcgaggcaat acatttctgg 6121 tgtttaagcc actgagtttg tggttctttg ctgtgccagc cctaggaaac acatgcagtg 6181 agggaggtac agcaaaggcg tcattaagcc aaccgctgct aactgactgc ttgaaatcat 6241 ggacttatcc tccagtaatc agcgtaagct gcatctcagg acaatctggt caaggagaaa 6301 aagggagaag ggtttatcca ccagccccat ctcctattag tcaaaggttt gcctcaagag 6361 gccttcattt tcccacattc ttggttgttg tgcatgtgca agtctagttt tgtggtatcc 6421 tatgactcaa tgtcaacatg aaatctggaa gtgggaggca agaggcccac agtaggcatg 6481 agctgagaca ctgctaggct gcccctctgg gaagtacatc agacctgtgc agaactggtt 6541 gaaatagcag cagctggaac aaggtaggtg aggctatgag gatctaaaga ggagcctgag 6601 ggtacactgg cacaggcatt gataagcact tacaaggggg cataacagag agctacagaa 6661 acaaattgtg aggcagttta acctaattta tgatgtcagg caatactttt tggagtaaag 6721 aatgcttaag ctgagataca aaaggtgagt agccgagtga ccttgggaaa gtcattatat 6781 gtcaaaactt caaagtattt cctgattcac ccaaattttc ctccctaaaa tttaatctaa 6841 aaatgtttaa gcattatttt ttcaagtcac tttttttttc ttttcaaaaa taaaaatgac 6901 tttatttttc ttacggattg taagaataag acttgctcta ttttaaaatg tcaatgatat 6961 tgaagaacgt aagaaagaat atcacccata attccatcta ctcagagaaa atataaaggt 7021 ccagatgaac tgggttatgc tatgataaca ggacagcaaa accctaacat cttggtggct 7081 caaaactgcc aaggtttgtt atgttttttc atgcatgcta tgtatccagt gtaggtgatt 7141 tagggactca gtttcatatt atcctcatta agggccctag gaggaccaca gtctctatca 7201 tcataaaagt tgctcattac tgtggaaaca tgagggaaac atggtgaagc aagcagtggt 7261 tcttaaaatc ttccatccac aagccactca cctcatttca tttcatttca ttggccaaag 7321 caagtcacat ggctgtgcct aactgcaagg ggatggggaa gtgtaattcc tgaaggagga 7381 taaccatagc tatcagtgaa ctgcactaca actgtcacag ataaccatga tcaacattta 7441 caagtcaaca gattttgggg tttgagattt atttgtgaat ttatttcttt tcactgattt 7501 ctgtgaaaga gaagagtaag aaaataacaa atggctaaag aaaacccatc agggcaatgc 7561 agaatcatga ttttctgtgg aattccttaa tccatactag ttcatgtatc ctttttcaaa 7621 ttgatgaacg tattagtcta tgtagagatg agtgattttg acaaggataa aggatattta 7681 aatacctcat gggtctcttt ttccacctcc ctttcacttt agtgtgaatg acattgtggt 7741 tcttggacca gaacagttct atgccaccag agaccactat tttaccaact ccctcctgtc 7801 attttttgag atgatcttgg atcttcgctg gacttatgtt cttttctaca gcccaaggga 7861 ggttaaagtg gtggccaaag gattttgtag tgccaatggg atcacagtct cagcagacca 7921 gaagtaagct cttttgtgta tttttctatc ctttgccctt gacattttcg tggactcctt 7981 tccttacgag gaagttactt ccttacttag gcttacgtag tgaatatatg tgtggaggga 8041 tcagttgtgg ccagggttca cactaaatct gtcatcctaa gcaatccatt tgtttccttg 8101 gactttaggt tcttcccagt taaaggaggg atgtggactt aaggatataa ggacctcatt 8161 ccaaattttt acataactct tgaaacacac tttcctaaca cactacaaag gaagataaat 8221 gagcaatacc cctgcccctt atgaaagtac aaagtatttg accacttatg ctaagttcag 8281 ttacacaaaa gctcttagga gagagggaag gctaccttag attgggcata gggattgtat 8341 aagtttcaga cctactgact tacaacagca gaccattcaa tgagaaaacc ctattttaaa 8401 cagcatgaaa ttttatgaga aaacactttt aaaaagtgtt taaaaattgg aggttttttt 8461 ataaacagaa gtaccttgaa tattaacatg tatatttgag ataatgagca aagatgatcc 8521 ctgtagaaaa cacctctcag tgtccttcct cttttaaaat attgtgattg agctttatgg 8581 tcatttgata ctttttttca agcactagtt tttagaagta tatacacttt tggccttgta 8641 agagatccac ttaccttcat tcaatttaag gaaaaagttg tgacttcgaa gtaataagat 8701 tgagcctata tatgtcatct agaaacacca gtttcattat agtcataact catattcaaa 8761 gcctgtttaa agcacttcaa gaatattaaa aatactaccc cattatcttc ttttactgtg 8821 gtttagaaaa ggattgcagt ggtactttca aggagtgtgc atttttatac ctcttctcca 8881 aaataaaagt gagtgttgtc tctcattacc acttgcttaa gtgtaaatgt catttacttc 8941 aaaggtatgt ctatgtagct gatgtagcag ctaagaacat tcacataatg gaaaaacatg 9001 ataactggga tttaactcaa ctgaaggtaa gagtaccctg tctggcccac acaatctgac 9061 ttcttacctt agtcccatac ttgacccagg aatgtgcccc ttgaatgagg aaaacatggc 9121 tatttgagag caatgccagt gagattatgc atgcgtttct aaaatacaga agcaaagccc 9181 cctaaaaaca ccaactttag gtgccagaag aagctgtagg ggttgtgggg atgaataggg 9241 aagaaagcga ggaatcattg agacaatgtc ttcactactc cctctttgtt gtaaggtgat 9301 acagttgggc accttagtgg ataacctgac tgtcgatcct gccacaggag acattttggc 9361 aggatgccat cctaatccta tgaagctact gaactataac cctgaggacc ctccaggatc 9421 agaagtaagt tcctatacct ctctgagctc acaggcaaag ctgcaagaac aaatttgttg 9481 gggtgacttg gaagaattta agccagtcac atttggtgtt atagaactaa agaaatttta 9541 cgtatgagaa aagtaaagtt tttgacaggt agggagataa ggttcgttaa ctcttttagg 9601 ccaccattaa cttttcaata actcttgctg ttgggaataa tggaaaaaca taagctcgat 9661 tattcatatt gacccagatt gccatatcgt atttccaatc aggagatagt ctctgagcag 9721 gtaggtcttg cagttagaca aagagagagt tgtatatggt accatcttct ttcacactgt 9781 ccaacatgaa gctaagatta aaggatttaa ataactgtcg cctcaaaata aaatgagttt 9841 ttgcctgtaa gttctaacat atacgatggt gttgaaaatt gtcttatttt caagatatca 9901 tttgtgttat tttaaaatat ctcttctaaa acaatagcat aatccattga attttctaca 9961 ttaaaaacaa cattataaga caactgcatt ttctatggca ttaaggttaa caatttaaaa 10021 atctttagtg aatttatata tagctataaa ttctaggcac acatactcag agatttattc 10081 acaaaaattt attgagtacc tgtaatgtgt cagacattgt acttatgttt aggatatatc 10141 agaaaaaata tacagagatc cctgatctta agaaaggggg acaaacaatg aacaataagc 10201 ataagaaata agtaaattac atagtaactt tgaagatggc aaattttatg ggaaaaaagg 10261 ctcagcagag taaaggggat caggattact atatgcagca ggatgagcag gtcctactac 10321 taaagtcaga gatcagagta ggcctcattg agaaagtaca atttgaaaaa aattaaaatg 10381 tgtgaatcca catttggtga tttgagaaac tgaaattcaa attcaaaccc agtaattggc 10441 ctagtgggta aattaggaca tacagttatt tctacttcca gtattgctgt ctttcagtta 10501 aaaatatgat tatttctttt tttattaatg tattagtcac agcttcttaa aaagaacaca 10561 agaaatgtaa aaagaaacaa aagtccaacc taagcctaga accagaattg ttaccctaaa 10621 ctaggaagca atccttgcct cagatatgta cccctaagag tgatagacaa aactttcctt 10681 taaaagccca accacaatgt ctgtcttaag gaaatgaaat atgatacata ggattattgg 10741 agaattgttt aggatctttt tcgtatctct attttcaagt tgaattattt ttaaaatttt 10801 agataagact ttttgtcaca attactgaca ccctttaatt ttggttgaat ctcatgtttt 10861 gtacatgaaa tcaaagctta agtatttttg tctaacaaac ctgagatgaa ggatctccct 10921 taagataaaa aaaagaacaa actctgttct aataagagat atcaatatag actcgtaact 10981 cataaatttc taattgtttt cctttaattc caaaataaaa attttatttt attttgtttg 11041 agtgaacttc atccagccta tttggaagcc tgagtgctta aatatacact gattagcaaa 11101 gttaagtgta ttaatcctgg gatcaataac agtactaggc agtgatgtgt caaatgaaat 11161 tccaaagtag tatccattat ttatgctttc tagcaacttt gcagatgtga aggggctcca 11221 gctgtgatga ctgccatccc atcagttacc atgttgggtt ggctggataa ctgggtgttt 11281 cttcattatt ttgtcttgtt tctctcccca tactgtaatc tcaaaggcaa gattcttccc 11341 acagagggag gaccaccatc cacttgagaa ctaaataaga gcatcttaac tccttctgta 11401 agggatgcct gcactaaaaa aattgtgtaa gtcatccctg aggaatttgg ctaatgtcag 11461 aaagccagtt cactttggtg catagatatc tatgacatta gtactttcat cagaagaaga 11521 acaatttatt gaagatatac agattttagc ttctgatttc tgaaaaatag atttatccta 11581 gaatgtttgg gaaggacatc atgaaacagg tgtttactca cataatatac taataaacga 11641 tcttttctcg acaggtactt cgcatccaga atgttttgtc tgagaagccc agggtgagca 11701 ccgtgtatgc caacaatggc tctgtgcttc agggcacctc tgtggcttct gtgtaccatg 11761 ggaaaattct cataggcacc gtatttcaca aaactctgta ctgtgagctc tagactctag 11821 atagtaaaaa aaaaaaaaaa aagtctacat attttgtaaa agtaaactga taattgtatg 11881 ataagtggca ctgtaagtaa atagcaaaca ccaaccagtg agtgtggctt ttcttatgga 11941 tagaagtaaa ggagcagaca gagattcctt gatagccatc aaattgcaag tcaggttaat 12001 gacagtccaa caagaagcca aactttacgg atctttgtta gcagccccaa tgttctttct 12061 acaataaaac atgtggacta tgtccagtga ggtctccgac tcacctagag gtccttggta 12121 actgggtcca agtttcctcc tacagttttg acgttctggg tctcctacag tccagaccaa 12181 gagaaaagct aatgttcaag tatttacaac ccctgttctc tccccgcagt tgtgctgggc 12241 acagacagtt aattaagtgt accagtctgc agataaagaa ttagcatcat gtgattgccc 12301 aaagccctta gaacttgcat tctcccaggg taccgacaca taggaaattg ggttctgaga 12361 aatctgacag ttgggcctca gaggctacct tagggcaggg agtgcttatc tctaataggc 12421 ccttatttgt attgatgctt gtatctgaaa tacagttggc cctctgtaac tgtgagttct 12481 acatctatgg attcaaccaa ccagagattg aaaatatttg gaaaaaaatg gatggttgca 12541 tccgtactga acatgtacag actttttttg gtcattaatc cctaaacaac acagtataac 12601 acatatttac ataatattta gatgatatta ggtattataa gtgatctaga gagtacctac 12661 ataggggaat gaacataggt tatatgcaaa tactatgcca ttttatatca gggacttgag 12721 catccgtgga tgttggtatc catcagagtc caggaatcaa tcctccatgg ttattgaggg 12781 acaactgttt tgctaaccta ctgggctctg aatcttagat gaagccttaa ggcagttatc 12841 tttgactgcc tttatcttac ccttctggtt tattcaataa acagttatta agcaacctcc 12901 agtaaatttg gccctaagga atctgaaatg agtagaatgg ttctggccta atgtcctatc 12961 ttgaagacca accttaaact taatttgatg agaatgaata gaagtgtcct tttttggtcc 13021 caaaccagta caacctttag caaacaaacc attaattaaa gtcagagcac ccaagcacct 13081 cccttccctt tttgtgactt ttattcggga gtggttggtt gatgcttttt ctcttttggg 13141 gagaccattt tatatgcaaa agaacattgc tcagttttat gtaggtttgc tgaaaagttt 13201 tgcagggtaa aattctcctt ttttccagtg taacccattg tgattttgac tataagactt 13261 tgtagttttc attgcttcaa agaggaataa acattcagca gaatgctata gtcagttatt 13321 agattttaaa tttcaaaagc ctagctatca agagaacaca gaataaaaaa gacattaaag 13381 acataattta agaaaataag tagagtcagt tttgaagaat ataatagtct ttgttttagt 13441 ccctagccct catacttact atattataat gtttaccaga ctaggggaga ataaagagag 13501 ataggaaaag acaaagataa tttagatgct ggtaagttcc tgagaattga tcctgctctg 13561 ccttctcctc aggccaagtt agaaaggctg ggggataaaa ctttcctcag gaaatttggg 13621 tgatctgatt gctgagggac taatgtgtcc tttccctggg agggcctggg aaagtcacaa 13681 agccttaggg attctctgtg ctcagcctga tgaaaaaaca gagaagaaat gctagtgtga 13741 tttactgggg atggcatcac acctacactt gaacttggca cagaagcaaa ttcccacgtg 13801 cccagagggg tatgggggtt agcagagaaa gaactgatag acttaaaaag gtcttgctgt 13861 caggaaaaag aaaataagta gtcatctgat tcaaatgaag accgaagttt agacggtgac 13921 atcaaagacc agctgtctcc catgatgctt tagctctccc tgctggattt tagagaaaac 13981 tcccaggatt gggagggggc tttggaattg ccttatatat tgatttgtgc ttccaaacaa 14041 tgtgtgggga gtcaaaaaca aaattacatg aggctgtgga aaaaataagt tctgttacac 14101 ttcatggttt tagctttttc agaattggaa caggatagag tggaaagaac ttagtctttt 14161 gctaatggac atgtgattag tatcttttat tccttgttct gtggctaaga ccaatggact 14221 attaagttgg ttttctcctg gaaatgccag tggaagatcc agcataatct cttttacact 14281 tcacccctgc tcttgctcca agtaggtcag tctcaacctt cagctactgg gatcccctca 14341 ctcagtgtgt cctctctttg ggagaggcct tggcaaagtg gttgaagctt ggattcctcc 14401 tggagtaggc aatttaccca ttctttccaa aggatagggt tgttgattgt cccattatca 14461 tcctctcttt gccttgtcat ctcccttcaa taagtttttt ctccccttca tagggaggca 14521 caggtttacc aacaggcctg cagcctagac agttatctca accaattttc cctttgcata 14581 agcaccccca atctttatgt gattatcttg agcctcttca ctcacttggc ttgggaagga 14641 gaagaatcgc tcctcacaat aaatgctgca ttttgataat tcattctctt ctcttcccta 14701 gtctttctgc ttaattatca gagacatgaa ttgggaatac aagggtgagg gttagaggta 14761 gatgaactct tccaggagcc acaagcaagc tttggttgtg atatttgact tcaagtattt 14821 tcagcaattt tcagaatgtg gggtacttgg tgctttgttg tttttatctt tgagcccttc 14881 gatgctgatt ctataatcac agaaaaattc tcattctgta ttcctttggc atcaaacagg 14941 ttttagtttc aggtagcaga aaccactcta tctagttttt ggggaaagtg atcaaatgca 15001 ggaactaatg cttcacaaaa tgctggggta tttcctttgc gtttggaaat aagacaaaga 15061 ttctctattt taccatgact gttgtttaac ataggattgg aagtcctgac caatgctata 15121 agataagaaa aagaaatagg ctgggcacag tggctgacac ttgtaatccc agcactttgg 15181 gaggctaaag agggcagatt gcttaagtct aggagttcca caccagtctg ggcaacatgg 15241 caaaatccca tctctactaa aaatacaaaa aattaaccag catggtggca cattcttgta 15301 gtcccagcta cttgggaggc tgaggtgtga ggatcacttg atcccagaag gtcaaggctg 15361 cagtgagctg tgatcacacc actgcactcc agtctgggca acagagagag accctcactc 15421 aaaaaacaaa agaaaaataa attagaactg taagaattgt aagggtagaa atagaactat 15481 tattgtttgt agattacata atcatctact tgaaaataaa acaatagaac caagagatat 15541 accattagga tactcaacag tgttgcctga tacaagctaa ataataacat ttaatagcaa 15601 tgacaattat gaatttgaat catagagaag atagcactca caatagcaag aaaatctata 15661 cagcatctaa gaaataaata tgttaactgg ccgggcacgg tggctcacgc ctgtaatccc 15721 aacactttgg gaggccgagg cgggtggatc acgaggtcag gagatcgaga ccatcctggc 15781 taacacggtg aaaccccgtc tctactaaat atacaaaaaa ttagctgggc gtagtggcag 15841 gcgactgtat tcccagctac tcgggaggct gaggcaggag aatggcggga acccaggggg 15901 cagagcttgc agtgagccaa gatcgcgcca ttgcactcca gccggggtga cagagcgaga 15961 ctccgtctca aaaaaaaaaa aaagttaacc aagagtacac aagacctctg tggagaaaat 16021 gttcaaactc taataaaagg tctagaaaat attttgaata tgtagataca tattctaggc 16081 tcctggataa ggtgaactta taaggatata aattctctcc aaattaatca gtaaattcaa 16141 agtctagtag aattctttta aataaatgac ataaacttat ttatatctta aagggacaac 16201 tttgatttct agcaatacaa caggctaaat aattcagaac aaaacagcct aatataaaca 16261 acaaaagata atagagaaaa tatttttgaa agcatcaaat atcacacaag ataatataga 16321 atcagcaaat cataatttaa aggaaagcag aaatccagaa aggtaagcag gattgaattc 16381 ggtcttgtcc tgagggcatc ctaactgatt gatcttgtgc tttgggtttt aatgtctcag 16441 agggcaaggg gaacagaagc caaatcacag ttcatcctaa aatgaggagt ctaataaggg 16501 accaacacct ataaaaccaa aaccctctga ccccaaaggg ctaaatattc ctctaaggtt 16561 gaaataaata taaatctgtt caatatttcg actcccagaa acctaaaaag aaaattgtct 16621 tgggactgag aaaagtaaaa taaatacatt taatattgct gattaattta atattgatga 16681 aataaattca agattgttgt tgatatccat ctgcttatca tcactcagat tttgagccaa 16741 attcccactg attgaacgga aaaaaaaaaa aaactcaagc tgtaaattta gtataaagtc 16801 ataccatatt tgtagtatat ttactcacct ggcagaaaca aacacatatc ctctctggat 16861 tttatcctag atcttaagaa acctaacaaa taattatcta agcaatgagc aacaaatcat 16921 agtaaataaa taaataaata aataaccaaa gaaacagagc aggagaaaca aaagccagca 16981 gaaacaagtg atggcagaaa cagacttgca gagaccccaa atacagaaat gatcaaatag 17041 aggctcaata actatactta cagagtttaa agaaataaaa gacaatttta aacccagggt 17101 acagtaaatt gtataaaaaa attgcaacag atttgaaaaa tataaaattg aactttcaga 17161 aatgaaaatc tagtaataga aatttaaaac ttaatgagta gccaggaaac attttattat 17221 tgaggtatag cagcttaaaa attgttcaca tatacattta taaagggtct cacaatctga 17281 gggaaacatt ttccactttt taatttcaaa gcacacaaaa agtgttactc cattggaaag 17341 gctatgagga tgacttctaa tcttcgggac tcaaatgcct cgggatagtc actcaaagcc 17401 tttttttttt tttttttttt tttttttttt tgagacagga tttctctctg tcacccaggc 17461 tggagtgcaa tggcatgatc atggctcact gcagctttga cctcctgggc ttaagcaatc 17521 cccctacctg agcagctggg actgcaggcg tgcaccacca cacccagtta attttttttt 17581 tttaatagat agggtttccc cacattgccc aggctagtct caaacccctg ggttcaagca 17641 atctacttgc ctcagcctcc caaagtgctg ggattacagg catcagccac tgcacctggc 17701 cccacacaaa gcctctaaat acaccctctt aaggtagatg actggtgagg tcatgtaggt 17761 tgcaaataaa gacaggccaa aacatgctca ggctcagata aagttatggg ttatgtgata 17821 catgtggcta attattggaa agggggaaag ggacctgaat tcattcttaa gtttttgagg 17881 tcagtttttc ggaaacagtg tggggttgtg tcaaggacat tagggagcaa aaaaaaaaaa 17941 gaaaagaaaa tgagagacag aaacaaaaac ctttaatggg tgaatttaac aggaaatgta 18001 gcagatctga agagataatt aatgaaatga aaaatagttc atgggagatt atattcagta 18061 taacacggaa aaactgagag aggaaactct aaaaaagaca tggaggctaa aagtgagaag 18121 atctagcata ggacaaattg gatccccaga aggaaaggaa aaaaagaaaa gagttgggta 18181 gagaaaatat tttaatcatg ctgagactta gaaattcata cctagccata tcttggtgaa 18241 actgcagagc atcaaacaca aagactttaa aaggagccaa gggtgggaag aactatattg 18301 ttcaatcaaa atagacttta aaagtaaaag tattaccatg gatagaaggg accacccaga 18361 agatagagtg attttaagtg tacatgttcc ttctaataca accttaaaat atacacagca 18421 aaaatttgct gaactactac aagaaaaata caaacagtcg tactgtcatc atgattgatt 18481 ttaacttggc attcattgtt tattggtaga actagcaaaa tttttttaaa aagattagtg 18541 agaatagaga gtacctaaaa cacacaattt gcaaacttaa cctaatagga ttatatagaa 18601 ccttgtgcca agtaattgca gatacacatt atttcaagct cacaaggagc atctacaaaa 18661 attgactata tacctggctg tcaagcgagt cttgataata gtatatattg tctgactaca 18721 atgcaatcaa gcttagactc aattttttaa aaacctagaa aatttgcatg ttgaaaaatt 18781 atgtaacatt tctaaataat ctgcgagtca aaaaagaaaa agaagtaaaa tggaaataag 18841 aaagtgattt taattgaata ataaaaattt tccaacagat aaaatcttat aggaaacagc 18901 taaaattaaa tttaagtaaa atttatactc ttaaatgagt acaatagaaa agaggaaaaa 18961 ctgatattaa agaggtaata tcatcctttt gttgatagaa gaaaacagca aagtaagacc 19021 aaagaaagtt caagaaggga aatagtaaac atcagagcaa aaactaacat agtagatagc 19081 tctaggattg ttagcagtgg tggatccata tgggtctgca gcaacttgat tcttgcctcc 19141 tcaaaggaaa taatttacct aaggggtata aggtagagtg agagaccaag gcaagttttt 19201 gaacaggaat gagagcagga gtgaaaggaa agaaagtata cttggaagag ggccaagcgg 19261 gcaacttaag agatccaagt gcctcatcca acccttcact tggggttttt atacattggc 19321 ctggttctga ggtttgcatc tctccttcct tgatttttcc ttaaggtggg atttccacat 19381 gtgcagtggc ctgccagcac ttgggagggg ctgcatatgc aatgtgttta ttgaagtttt 19441 gtgtgtgctc acttgaggca tttttccctt accagttgag cattcctaga ggaaggtcat 19501 ataccagtta aactccacca ttttgcctct tagtatgcat gctcgggccc actcacccaa 19561 ctcctgagac cttatcggga agctgctgat cttgtttcag gtgttttcta tctgttggga 19621 gcctgcctgt ccctggcatc agctgcaacc aattactatt ttagtgagac agcttaacaa 19681 tcacctggcc atcacctgat catctgatgg ttgcctgaga ttcatggaag cagcggcggg 19741 gtgggtgcct cttttgccct gttcatgtct gccaaactac ctactctaac attcccgacc 19801 caattctttg ggaaaatgga caaaggtcag tcttctgtaa ctgcttcctg ctgacagaag 19861 gacggtgatg attgttctgt gggtcttggc gtcttgctgt cagggcaggg tggctttgtg 19921 ggttggtgaa agtggtatcc agcaaggtcc aagggagaca ggggcaggat ttcgcttctt 19981 tcatgtctca ctgatgggca gtctagtggt cctctgtaga agtgtgactc ttgaatattg 20041 agacgatggt atctctcact gaggatcatc tgaagcttga tggcctgaag gtgtgaggag 20101 acaaatcggg ttgctagatt tagaagacat ggcccaaaac ggagcaaaag taggagacta 20161 acaagtgggc ctaaaagggg aataactcag gagaagcatt ttcagcttcc ttcccaatct 20221 aaccaaccct gagaggcttg ttccagtaaa ctggaggctc aatttaggag ttgtctgata 20281 ttgtactttt cctgattgat tgacccataa gcaaaatttt tcacctaagg ctaaacaagt 20341 ttctctctgt gctgctgtta acttgtctaa ctcctgataa ttttggagga ctatagctgc 20401 caaagagtca atttgttctt gcacagttgt taaggtttta gccgtagcat caatgttctt 20461 ggctatttcc cttgagggct ggctatgggt caaggaggct tttgtgattc tagcaattct 20521 ggttcccata ctggctataa taccaagtcc tgtgagaagg agaattaatt ggatagccct 20581 cctcatcctg gaaaagatgg aatgcctgta gattggtgct ggaagagaga gattgccagg 20641 gactatgaag atgtctaggg atacatagcc tatggtacaa gttccagtcc ggttagtggg 20701 gaggcactgg tgaactggct ggccacaaat atagaaggct ccttgggttt taagataagt 20761 agagatgtaa aaatgaaaca agggggtgag gacagctcca agaaatgcca aggctgccaa 20821 cacacccagg tagctggtgg ctatagtcat gcctgctaag acttatgtgc atgggctttg 20881 gttctgttta gttcccttgg ttttattttc ctaaaaaaga gaattttgag tctggtccaa 20941 tagaacccat tcttctgtag aaatgagatt ggcaatttgc aggcattggt tgtatcagca 21001 ggccggaacc agaaattttg aatactgcag gagcttgggt gtccctggca aaactgggtg 21061 gatttatcca gcaaagttcc cagaggaaag gtaatatttg aaggtttggg aaatcttgtg 21121 tgatggtaga attagttgtg ggtgtacgtt tggcaagttc gttggttagg atctacagtg 21181 agagtaacat tgctaactgt actggaaaga gccccaacat ccattccatt tttcttattg 21241 gccgtaatgc aaataggggc tagtcccatt aaagtggtat tagaaaagat ggatccaaaa 21301 ttgggaggtt ctttgcagat atcagggaaa tcttgcttta cttttgcaag aaggctattt 21361 gcaggcccaa acatctccca ttagcctccc attgatagaa aatatgcagt tctgccttta 21421 tacttgtcca atctcttggt gaggctgaat aagctctccc tgccgtttta gcagaagagc 21481 tggtacataa ccagcaattg gtggagtaag gggatcctgt gctttggagc agttactgag 21541 ctttttccaa taatgggaaa tcaggatggt agtgtcctgc tagtaaaggg gtgagattga 21601 gaaccaactg gagcaggccc atagcaacct aaggtgactg tgaagagaga aactggtcac 21661 aaagaaagct gccactggaa ggcctagtgg tgtacaagtt agggttaaaa tagtgataat 21721 aatcagagca attgcaagta agatttctaa tattaagatc atcttactct gaagagaaag 21781 gatgttgagg ggcattactt atcttttcac ttaaagagga atcacagatt ttccagtgcc 21841 tctcaagtgt gttctggagc tgagggcatt gcttccagat ctggtgtttt ggagcctttc 21901 catggcttca ctcaggtgtg atgtatccaa ctgacaatct ctgatacttt aatggctcta 21961 agggtgaaga ggccccttcg gactggagtt agttgggaat tcagagttcc atctctccaa 22021 gctttaataa gaactagtga gcctggagga tacagaggct ggtatttttc cccttctgat 22081 tttggctttc tttgtaacca aaattcctgg agagcctgtt ggaaacctgc taaagaagag 22141 acacagtagg tgattttggc agttttttca tctagcatta agtctgaata taggaatggc 22201 ttaccatata aggcctcaaa ggggattaat tgcaagggag tctgaggggc aatatggatc 22261 caaagtgggg ctaagagtaa gaggtccacc catggctgtg ctgttttctg acagagttta 22321 ttgagtatgt gttcgagggt ttctttagtt ctttccactt ttcctgaagg ttgtgatcac 22381 caggcaaagt gaaggtacat ttatgcttac taacggccct actaacctgt tgagttactt 22441 aggaaataaa ggatggacca ttgtcacttt gcaatgacct gggtagctca aaccaggaaa 22501 tgcattcctt taggaggaat ttagctacct cctgtgcctt ctcagtcttt gtggagcagg 22561 cttctaccca ttttgtgaag gtatctacac agaccaagag gtacttttac ctgaagcctg 22621 cagggagatg agtgaagtcc atttgccagt cctccccagg gtatgtactt ctcctctgga 22681 ttggacttat taatggaggg ggcttccctc tctgggaatt atttatggta catagggtat 22741 aggcttgaca aacctgttga atggttttgt ttagtccctt cccactgaac acttatttac 22801 aaatttgtcc tagactgtcc tttgcaaggt ggaagaagtt gtaaaaactc ttaatgactt 22861 tccattggga ggttccattt ttggacccat cttttctaat atcactttaa tgggaatagc 22921 ccctatttgc attatggctg agaagaaaaa tggaatagat ataggggctc ttcccagtac 22981 ggtttgcagt gtgactctca ctgtagaccc taaccaacaa acttaccaaa catacaccca 23041 caaccaattc tgccatcaac caagattccc cacacttttc tccatctacc tgccaatttc 23101 ttcatctctc ctccaatttc tccatctacc tcccaatgga aagcttttcc ctgggaggct 23161 ttccattggg aggtagatgg aaaaattctt ccaatctgta ccatccactg gtttcttctt 23221 tatacccatg ttggctggac caatctattt cttccttagt gtattggaga aagggcagtt 23281 cactggggag agagaggaac aagtctccca tgaaggtatc atttgacatg gctgcctctt 23341 cagcattttt gtcagctaac ctgtttcccc atgtagttta atagaagccc ctgtggtgtc 23401 cttcacaatg ctctactgcc acttcctggg gcaaatgagc agcctccagt agctctaaga 23461 cttgagaccc atacttaata ggagcatcct gagctgttag gtaccttctt tccttccaga 23521 tagccatgtg ggcatgaagc accaagaagg catatttgga atctgtatag atggttattc 23581 tttttctttc ccctagtttt agggctctgg ttagtgtgat gaattcggct agttgcactg 23641 aggtccctgg ggacagagct ttagtctcta caacttggtg atgggactct gtagcatacc 23701 ctgcatatct agttccatcc ctgacgaagc tactgccatc tgaaaaccat attttgtctg 23761 gattatctat gggctgatcc tttaagtctt cccagctggc ataaattttg ttaagaatct 23821 cacagcatga atatgttgag ttatcatctc ctggtagtgg cagtaagagg gctgggttaa 23881 tggtggaaca ctgctcaact gttacctgtg ggttttccaa caacaagact tggtacttta 23941 tcaatgtgtt gtcagtgagc cattatggtc cttttaaatt caataaaggg tctatttggt 24001 gggacattag gagccagatt gattgtccca ttgtgatatt gaagtcctcc tcgagaataa 24061 gggctgctgt acccactacc ttgaggcaat gtggccacca ttgtgctgct gggtctaggt 24121 tctttgaaaa ataacctaca gcatgtttga ctggcccaaa ggtttcagtt agaactccta 24181 gtgctatacc ttgccttttc agtgatgaaa aactgaaatg gtttggttag tatgggtagc 24241 tcaagggctg gggcttgtga gagggcagtt tttaactacc taaaggcttg ctcctgattc 24301 ttttcccaga gattgtgtcc ctattagtcc cttcttttaa agcctgatat aaggatttaa 24361 taattccacc acattccagt atccaaagtc tgcagtatct tgtgatcccc agaaaagctc 24421 aaaattgctt ttgggtggtt ggtgtagtat tctaaggatt gcctgtatct attctgcaga 24481 aagcttgtgt tctttgaggc tcatgattat ccccaggtat tggattgctt gtcttagtag 24541 ctgtgtcttg gccttggaca ctttgtacct tctgtctgca aggaaattta gatgttggat 24601 tcctaactct cttgttgggg agcagattaa caggtcatcc atgtactgta ggagatgccc 24661 tcccatataa agacttaggt cctgcaaata tctagccaag gcttgcccaa acaaatgggg 24721 actatctctg aaaccttgtg gaagcactgt ccaggtgagc tgttaacttc ttcctttttc 24781 attctccaac tccaatacaa atataaattg ggaggatggg tctaggggga tacagaagaa 24841 atttttgaag tctaagaccg aaaaccactg agcatctgga aatcttctag aattacataa 24901 aggttgggga ccattgggtg tattcggact actgcctcat ttatgactca cagattctgg 24961 actaacctgt attctccgtt agtcttttta acagttaaga taggggtgtt atagggtgag 25021 ttacaggata ttaatccatg tgttagaaat ttttaaatca ggggctgaag tccctctttg 25081 gcctccaact gaagaggaaa ctgccctctg caaggataac tagaaggatc cttgagctga 25141 attgctatag gcatggccat tacagccttt cctggtattc ctggggccca gacctctgga 25201 ttaattggaa ggtctgtggg cagttcatcc ttgtgtgtgg tttccttgag gaataagatt 25261 gcatttggaa aagggggtgt tggccctgga ggaaaggtta actgaactcc tagtctaaac 25321 aaaatgtctc tacccaaaag aggcatagga cattctggca tcactaaaaa tgaatgagaa 25381 aataccattt atccccacag gcagcaccaa gaaggagcaa acctctatat tatgggcacc 25441 ccatttaccc ccattgcctg gcaggatttg aggataattg cccagaaaaa gaggtaagca 25501 ctgagcaggc agctcctttg tccaaaagaa accttacagt catacctgcc gtgtccaaag 25561 cagcccttag cttcatcccc tcaatagtga tgttcaatcc aggagctggc cagaccatag 25621 ggccccttca gctcaaggtc atcaggggtt gggattctgt cccaggggcc ctttggccct 25681 cagagaagtc tcagatccag tggctgagct tgtggcagag gggcaagcca tgcggggctt 25741 tttcccatct gctccactgg ggctgtttgc cttccagtgg cctgacttct cacaccaatg 25801 tcagttatct gggagggttg tctgagggca acctggaggg ggctaatgga cccgtagagc 25861 agccaatagt tggagctccc tcttttcctt tcacttctcc ttttcctgaa cactttcctc 25921 ttcctcctgg tcctggttgt aaaagactga ggaagctatt ttgagggtgt caggcatagg 25981 ggtgttggga cccagtgcta gtttctggag tttcttccaa atcatctggg gctgcctgag 26041 ttaggaaatg gtcctttagg accagttgcc cttctggtgt ctctgggtct aggttagtat 26101 atctcattag ggcctcttgt agcctcccta ggaaagcaga ggaggtttca agaaggcctt 26161 gatctattaa ggccaattta ttgtagttca caggctttgt tctgctagct ttcattcctt 26221 ctaccagata gaggaatatg tggttccttg tccacatgcc ccactaggta ttgtaatccc 26281 aaaagaggtc cacctgagaa actgttgtgc cccctacccg cccccacccc cagagtaggc 26341 accagggtca gtcatatgca ttgcatttgc aaattattgg gccacttcct taatggagtc 26401 acgttctccc ttagacagag tttgccctag tatgactgag aggtccctcc aggtgagttt 26461 aaatgttaag cctaccttgt ggaacccttt catgtatttg tcagggtctt ccataaactt 26521 acctagttcc attttaattt ggctcaggta ttgcatagtg aatggagttt ggaccctagt 26581 ggctctactg gggctgctaa cctcttgaag cggcataact taagggggtg gtgtccagag 26641 gatggccctg gttagggaag gaaatgagcc agggctggga gacttggata taatagggta 26701 gaaggggagt ttgagctaag ctctgccctt ggtttctctg gggcttgagc cctggataaa 26761 cacaataatg gtctcccaaa ccaaccagag ggggctgcct gactgtggct gctatcattg 26821 ctggatccat tttacaagcc tgacaaaggt cagggttgtt tcttaggact atgaaggcct 26881 gtacatagga gatttttgcc caatttccct gctgccagca aaacaaatcc agtcaataga 26941 tcatattaaa atttatgatc tcattttctg gccaagattc ctggtccctg agcttggatt 27001 caacccaggc tatattacag aaaaatgtta gccttttctt ttttagagtt tgggggtcaa 27061 acttttccca gttcttgagg atgcatctag gggtgtgtcc tgtcatatgg agaggcaatt 27121 acccatctgt gaagagagaa cagaggagaa aaaaagaaaa aaagaatgca ttccctctta 27181 ttcccctatc atcctttcct aaacattgca tcccccaatt cgtccttaca gttccaaaat 27241 aaaccagtct caccaagtac ccctaacctt ggtcccatct catcatgatt acccacttga 27301 gaacaaagga gatactggag taaacagggg gccccttctt catccttggg gtccagaatg 27361 aactggtctt attgggaacc cctaaccttg ctttccttca tccctattct aatggtaatc 27421 tgtgcctata gcttgggacc agccttcatc tctgtcctat gagtactttt gtctgttgtg 27481 cctgaagcct tgggctggct catatctgtc tctatgacct tatggtgacc tcacttggag 27541 cattctagca acaaaatgat tatttctttt ctgatattcc tattttgggg ggcataaaat 27601 ttgaataaac tttaatgaaa aaggcaaatt aaatgtaaaa acattgaaat tattcttcag 27661 gctttcaaag aagttgttgg gttttttaaa attatacttt aagttctggg gtacatgtac 27721 agaatgttca ggtttgttac ataggtatac atgtgccatg gtggtttggt gcacccatca 27781 acccatcatc tacattaggt atttctccta atgctatccc tcccctagtc tccaccccct 27841 gacaggcccc ggtgtgtgat gttccccgcc ctgtgtccat gtgttctcat tgttcagctc 27901 ccacttatga gtgagaatat gcagtgtttg attttctgtt cctgtgttag tttgctgaga 27961 atgatgtgct ttaagtagac gggaagactg gttctcagcc aacggctgca aggggttgga 28021 ctttcctccc ttcaaatatg gccttgaagg tccttatacc tattgagaag aatctggaag 28081 tgatcagaag aatggggaaa attttgtatt tggggttcct tatcctcccg gggtaacatg 28141 gagaacaaat gcttcaacag gcaagataat ccctctgcta caggaggaaa tgggaagaag 28201 caagaagaat tcccacggaa agccttcatg tgcttgcaaa accagcagcc cttggattca 28261 aaagggcagc attttttgcc ttcttaacat aaaggaggaa tctccagagg acttggggct 28321 tggaagaata agggtttgca aatggcaaag gaagaatttt ccttcctccc taaggattgc 28381 ttactcaaaa aaaaaaaaaa aaaaaggtgg gggggcagtg gtagtaggtg gggtccttaa 28441 aaagccacag agtgagatcc tatgcagtgg ataaacagca tcaaaagaca ccaaaaaact 28501 tggccctgga gcatagcata aacgaaatgc atatggtaag tcataaggag ctggcagagt 28561 cagggttctg gtaagtgtct gtcctagaaa tgagccaaca gacatccaca aggaagccca 28621 tttctttgct gctatgcaaa cagcaagagc ttcaggcata taaataacaa actgggagtg 28681 tgtgtttaag gcagagtagg aagtcgtgtg gcatgcaaag tgaaagcggg gaaaaggcag 28741 acttgccccc tggtgggtac gcaagaccat ttcagaattc acagagagaa aacagaatag 28801 gtggtgttgg ttttaggaaa aaagccaatt ttagtaggaa aaagagagga aaccccagac 28861 attgcacagt cttaggcttt agccctaccc ctctcatgag cctcctatcc aggagagctg 28921 ttattgtctc aggtctactt ggtgcagtct ccaaggtttt attcacccct gtgagccacc 28981 atcagggcca gctgagagat cagctatggg gaccagagct actgtgactg agaggaacca 29041 ttttgggggt tagttaataa gcaggagaat gaaaggggag aaggaaacca tgtatgggta 29101 tgggggttgg atgcctccag cctaagaagg caaggcatag agatgtctta ccagtagaga 29161 atatatccaa gtcatggtag caaagtattt ttgcactggc aaatctgtac agctctacaa 29221 cagctcaatt cttgcctcct caaaggaaag aattcatctt acttgtataa ggcagagtga 29281 gagactgagg caagtttttt agcaggagtg agagtttatt taagttttag agcaggaatg 29341 aaaggaagta aagtacactt ggaagagggt caagtgggtg acttaagaga tccaaatgcc 29401 tcatgtgacc tttgacttgg ggttttaata cattggtatg attctggggt ttgcatttct 29461 tctcccctga ttcttccctt ggggtaggct gtccacatgc acagtggcct gccagcactt 29521 gggaggggcc gcatgtgcag tgtgtttact gaagttgttc acatgctcac ttgaggtgtt 29581 tttcccttac caatcgagtg gcaggttata taccagttaa actctgccat tttacctctt 29641 agtgcgcatg cttggaccca ctcacccaag ccaagatcct gagatcttat caggaagctg 29701 ctgatcacca gcttcaggta gtttctatct attgggagac tggctttctc tggtgccagc 29761 tgcatccaat tattatttta gtgagacagt ttaataacca cctgaccatc acctcatggt 29821 cacctgacat tcctggtggt gggcagtggg agggggaggc tctccttccc tgcacatgtc 29881 tgcctaacta cctactctaa ccccaatacc ggaatcccaa catgaacccc aacatcctca 29941 cttcctggta ttcatgccct tgcatatccc cctacaacat caaatagggc tgacctgttt 30001 aaccaatagg atactctaga aatgatgatg tgtgattgtc aaggtcagat catacaataa 30061 cgctaccatt tccaccttgc tcccagaaag aatgtgttat gataaacatt tctgattatt 30121 tttagctgct aagttccaaa gtaatttgct acgataacca gtacaattaa tgaaatagaa 30181 aatgaacata taatggagtg attcaacaaa ccttaaaagt tcattcttta aaaagattga 30241 taaaaactaa tacccctgtg atgagacaat gaagaaaaat aatggctcaa atatccaatt 30301 ttagaaatga aaactaaaac atgtagatgc tatagacaga gagggaaaga ggagtcaatg 30361 aatcatttta tgccaacaaa tttgaaaatt taatgaaatg gacaaacttc tagaaaaaat 30421 atatcatact ctattaatta gggtaagacg aaattactat aacacataga acagaaatat 30481 agtagcttaa aaatcatgga tgtttatttc tctctcatct aacagttctt gtgagagtta 30541 ttagtagggg agtttctctt gagggggtca ttcagagacc ccactcatca aatccttcag 30601 gaaagagaag gcacactcag aacggctatt gagggaagtt ctttaagggg actattgatg 30661 taaggatggg aaaactaagg gaaattaaca agacacaata caatactcca gtgctagcaa 30721 tgggatagaa gaggtggggt gaggggaagg ccaggagctc ttaccatctc taggtttgaa 30781 ggaataagga gtggaagcag atactggcac aagaagaggg tagacaaagt ggagactctc 30841 ccagtagatg ctgtggccat tggtagaagg atgaagccaa cctgtgacag ctcagcacag 30901 agggcatgga gggaatagat atcctgactt aacttattcc accctttgaa ctagaagcca 30961 gaaggcaagg gagctattgg tggagttcat aatgcctcca gggacggagc attgaggggg 31021 ctgcatagag agtgaatctt gagcagtgag tggctctttc atattgctaa ccattcctag 31081 gtgactgcgg ttatctgcat ggttgaatct aggtcactac tgtgtctgtg ttctgtcaaa 31141 gtggaagaag aaagactaga gggaatgtaa agaagtaccc aatgtcttaa ggcccaggac 31201 cagcaatggt actcagcact tcaaattccc ttctaatggt aaaatctagc cacctggcca 31261 tacccaacta attgcaaaaa agaaatgggt aatgtcaaat gactggagag acatgttcaa 31321 ggaaaaagaa aataaatttt agtgagcaat tagcaatctg tgcctgccat ttggccacca 31381 aatagccatc tgcacccttc tctcacacat tgactccctc atccctcacc aagggataaa 31441 accccaaccc ccatccaatt tctgcaccca cttcaaagta cagtatctcc aggtgatgtt 31501 ctgacctcca cgttaggtcc actagggctg catgaggaag gctagaaaac aaaattatta 31561 tccaccaaca catctagatt acaatctaga gtgtgatcag agtaactata ataagcagtc 31621 ttaggaaaag aaaggaagaa gggaatacac ttaatagtca ctgaaaagtt gtttcttttc 31681 ttttcacatt gcatagcaat gagattatta ttccaccaga gaggaagtga gatggtcccc 31741 tgtcctgacc atggtgtact tccccatgtt ctttatgatt tcaaattttg ccctttggaa 31801 attcttccta gtctgttatg ctttaaggcc acctctgaag tgggtcttga aaagaccatg 31861 taattttagt atccatttcc tacaggtata ggtctgagag tctgaggaca ttttcaggcc 31921 atgcttctgg ttgcttttga aatatcaata ccttagaaac ttgataggct tctgggtctg 31981 tttggttcca tgtgccagca atgtcaccca gatttcattt cagagcatgg tgttaaagcc 32041 aactttactt cctggttacc atgacccatt tccctctctc atttaaatgc tgtctctttt 32101 gaggccatca gaatatatgg gcaagaagaa gcaacacctt taatataatc tcccatgact 32161 agttttccta atttggaggc attgggaagg gctgtgagcc ataacaatct tacctcctgg 32221 tgttaaggac atagcaccat atttctagac attctctgta gctttttcct ttatacctgc 32281 tagttctctc tctctctctc tctctctctc tctctctctc tctctctctc tctctctttt 32341 ttttttttga gacagagtct cactctgtca ccctaggtgg agtgcagtgg cgtgatctca 32401 gctcactgca gcctccgcct cttgggttca agcaattctt ctgcctcagc ctcctgagta 32461 gctgggatta caagcacaca ccatcacacc tggataattt ttgtattttt agtagagaca 32521 aggtttcacc atgttggcca ggctggtctc aaactcctgg catcagatga ccagcccaca 32581 ttggcctccc aaagtactgg gattacaggc atgagccacc atgcctattc tactagttct 32641 catttgaaaa cacaattttt cttttactga cctctgctaa aagcatcaag gaatggccaa 32701 cacacatcaa cattctggtt gctctctggg taaatgatca atctttcaag ttactgccag 32761 agagagtttt acaaaatatt ttgtgatgac agtacaagag tccccatttt tgcagcctgt 32821 gtgtgaaatc tgtttcctca gttctgctct aacgacgtat ttatctaggt tcaagtttct 32881 atatttttaa ggtcatgcta agctgttgta atgaagactc caaatatagt ggcttacagg 32941 acgtagacat tgctggaagg tagtgcatca cagtgggtaa aagcacagat ttgggagtcc 33001 aattttctgg atttgaatcc cagttcagtc acttactaac tgtgcaaatc tggccttctc 33061 tgagcctcag tttaaaaaaa aaattaggta ataatagtta ctactacctt cagctggtaa 33121 taaatcttta aaattgaggt aataatatta cctagcattg gtaagaatat taaaagagtt 33181 aatacatata acatttagag ttaatgtatg ttatatgaca atgccataga agtatctgct 33241 aattttttta gaacatttat attctctttc acataagaga tctcagataa ttattccaaa 33301 tcagtgtgaa aattctgata cttcagggga ctaagagctt tccattgagt tgcttacact 33361 ctatggggca ttgtcattag cttcctgatc aaaattcgct caccttcaca tcagattcca 33421 gccatggaaa gaagaaagag aggatgagaa aacacgtcaa atatctttgg gcccaatttg 33481 gaaatggcac aaactacctc cactgacatt ctattgataa gagaatagtt atacaattat 33541 agctaattgg aaggaagctg gaaaatttag tttagctggt ggtatggttt agctgtgtcc 33601 ccacccaaat ctcatcttga attgtaatcc ctataattcc caagtgtcaa ggaaaggatc 33661 agtgggaggt gattggctca tgggggcagg ttcccccatg ctgttcttgt gatagtgtgt 33721 gagttctcat gagatcttat ggttttataa ggcagttttc cctgctcttg ttcacactct 33781 ctcctgctgc cttgtgaaga aggtgcctgc ttccccctct gccataattg taagtttcct 33841 gaggcttccc cagccatgca gaactgtgag tcaattaaac ctcttccctt tataaattac 33901 ccagtcttgg gtctttcttt cttttctttc ttttctttct tttctttcct ttctttccct 33961 tctttccttt ctttcgcttt cttttccttc cttccttcct ttcattcatt cattcattca 34021 ttcattttga gacagagaca gagtctcact ctgtccccca gactagagtg cactggtgtg 34081 atctcagctc actgcaacct ccgcctcccg ggttcaagca attctcctgc ctcagcctcc 34141 cgagtagctg ggattacagg cgcacaccac cacactgggc taatttttgt atatttagta 34201 gagacagggt tttaccactt ttggccaggc tagtcttgaa ctcctgactt cagatgatcc 34261 acccaccttg gcttcccaaa gtgttgggat tacaaacgta agccacggca cccagcctca 34321 ggtatgtctt tatagcagtg tgagaatgga ctaatacagc tgggtagcct cttgccaaga 34381 acggaaggat tttggtaggc agttagtaca ttctactaca gttccttaaa ctgactcgaa 34441 aagagcttgg aatgtattat aattctataa tgattgagga aattaagtta gtgaattagt 34501 ataaaatatc tttatattaa aagaaactaa aaagtcaaaa caatgaaagg tctataaaca 34561 ttgaaatgga taaataaatt atagtagatt catacaatga aatatggaca gcaatttatc 34621 ccacaactag agatacatgc taacaacatg tatgcatttt acaaataatg ttgagctaaa 34681 gaaaacaaaa gcaattcatg tagtatgatt ttgttgtggg ggaattccaa aataaaaatc 34741 aaacaatgtt gtttgaagat gcatacctag gaggttaaac tagagagaac agaaaggtga 34801 tgattatcat aagagttagg acatttgttg cttcagggaa agctgtcaag ggaaggtaca 34861 ggggaggctt ctgggttgtt agtaatatta tattttaacc tggatggtga tttacattgg 34921 tgtttgtttt ataactgtcc tatagaagga tatatgtatt ttatacactc tcaggtatat 34981 atgatgtatg tcacaatttt ctaatggata tgagagagca aagtcagctt caaaaaataa 35041 agggcaaggg taggcctaat aataaattag tttagtattt gtgcaagaaa aaacaaatag 35101 accaattgaa cagaagagag atctcagaaa acaacatatt tatgtatgca gaattaatat 35161 aagataaagt tagcagcacc agtcaatagg gaaaggatgt tttgtttagg agatggcatt 35221 gggagaatgg tctcactcta aggaaacaaa actatatctt atccaacatc tctagataac 35281 tacctaagaa aggtggactc tggatgggtt aaagatctta aggtgaagga taactcttag 35341 ttaatagaag gcaataaagg agagtatctt tatgacctaa aggtagggaa gaacttctga 35401 aacaaaattt aaaaagcatt aaccctaaga ctaaaaaacc tgattgaatt tgaatcatca 35461 aaattaagga tttccactcc ttgaaaaaca ccatggcaaa gataaactga caaggtatta 35521 gtaagtagac tatattagga actcttaaaa ataacgaagt ggggataata accccaagga 35581 ataacagaca aatggtgtga agaagcaatt tacagaagag ggggaaaaat gacacacatg 35641 aaagatgctc agaatcctag taatcagaga cgtgccaatt gaagcagcac ttttaaatta 35701 ctttatacct attagactag caaagaatta aaaggcttga tattgccaag tgttgatgag 35761 gaggcgggga tgtaggggac cttatgcatc attggtggca gagtagatag ttgtacccgt 35821 gacagttaag atccagagct caaggaacca caagaactgt aggctccaga gccactcctg 35881 cagccacagg cctccctgca aaacagatgc ttgcaatatt gcctctttcc tcagttattt 35941 ctaaatttaa ttttgctcaa gtcctggcta acttagatac aaaattacat cctatactct 36001 ggcttcaagg cagttgaagg aatatagcct ttagtttcta tctctgcagg gcaggtatgc 36061 acactagaag agtgaggtcc aaccccccag ttggggttca ggaactggta ttttataaaa 36121 actccataca ttattctaat gtatgagccg atatttaaca acacttcacc aaaaggatgc 36181 tggaaaagag cttgaaggtc ctgatttagt gtcctcacta cacatcacag aagtaagacc 36241 taggtgcaca ctaagtatta ttgatgtgtt catgtttcat tgtcacaata cagcagaaat 36301 ttgtctacag caagaaaaca aatcatttac tttttgccaa agttcatttt tattgtgcaa 36361 tgagattttc tagaagagtg attcactaga aatgtaaatg accatattag cacagccaag 36421 ggcaagtcat ccaactgagg gaaaccaaga ggaatgttct taagcagaag cagagaatgt 36481 aatgaaaatt tttaacatat tcagtttcat attctgtcta acagatgtat cagaaaaagc 36541 tcagtccttt agggatgact ttgggaaatg gtgaatgaat aggtatcaca tacttgtgtt 36601 catcaagctc caggctttgc ataccagagg agttctgtgg tttctgacag ctaactggtg 36661 aatcacataa accatagaat gtaattgatt ctatcaagaa gtaaaatcaa atgaagtata 36721 gatggagcat tctgtggact gctatcaatt ttctctaaag aaaaaaagaa ttggcttctg 36781 ccatgaatct gaagattgtc tgtgggtaaa atgcaaatgg aaagtaaatt gttcatgaat 36841 gaattctaga aaagcccact ggtattcata gagagagaaa gtaaatatct atatagaaat 36901 tttgggaata ataaagtcag atgtgcccta ctattaatag gtttaaacat tttgatcatt 36961 tccctatctt tatcttgctt tactcactac tttccaggac ttttcctacc aggcactgtc 37021 aataaatcat gtgcttctga attcctctgt ctcccattct agggaaccag acagacatca 37081 aatatgtatc taagttgccc aagacaggag ccagatactc actttatcag cctttgtggt 37141 acactggaca actgttttcc atatattcca tttctcttct tgctgtgcca ctcttagtgg 37201 gctgagtata catctctgtg ttattaaact caggaatggc catgtgactt agtttggcca 37261 atgaaatact ggtagaagta acacagatca ctttcagata gaaattttaa gagccactgt 37321 agttcactgt gtctcttttc tctttgcccc cagactggta atgttgtaga tatagcttgc 37381 tctgccagac tggaccctgg attagtgata atcagaagca gagccagagc taacttgcaa 37441 tggacaagta gcatgagcaa taaatgttaa acataaacag taaaccattg ctgttaccag 37501 cctctgttgc agcatgctgc agtagttcat tctttttcat ttgctgtgtg tggtatttca 37561 ttgtatgaat atttcacaat ttatccattc cactattgat agacacttat atttatccaa 37621 gtttttgtac attatgaata aactgcaatg aattttcttg taaaatgtat tttaatgcat 37681 gtatgtaaac atttcttttg actatataga attggaatta ctgattaata aatcacatat 37741 atgtgcagct tttgcaagta ctgtcaagca gttttccaaa gtggttgtag cagcatactc 37801 cttcacgaat aatgttggag agttctagtt gcttcatttc tttgccagca cttaataata 37861 ttagttttcc ttctttaact tttaactatt tggattgtta tgtaagaata tctcattgtg 37921 gttttatttg catttcttag ataactaatg gagttgaaca ctttgtctta tggttgttgg 37981 ccagttggaa tccttgtttg tgcagtgaat atttaacatt ttgttcattt aagaaattag 38041 gttgacatta tttttcttat tgatttgtag gtatgaatcc tttgtgcata catgtattga 38101 aaatatattc tattttttag agtctttttc ttaatcactt ctgttgatga acagaaagtc 38161 tgcaatttta atgaaatcta acatcaatat ttttgtttat gaatgttgct ttgaatatcc 38221 tcttttaaaa atctttatgt accccaaaat tatgaaatat gcttccttgt gatcttataa 38281 aatatttata gttatttttc acatttagat ttttcatcca cccagaatat acttttggtt 38341 gtcctgtgag taggaatcaa gattcatttt ggccatatgt atatccaata tatacagcat 38401 cattttattg aaaagaaaat cttttcccca ctcattgcag tggcacttta tcataaataa 38461 ggtaaccatt tatgtgttgg tctacatctg gacctttttt tttttttttt tgagatggag 38521 tttcactctt gttgcacagg ctggagtgca gtggcacgat ctcggctcag tgcaacctct 38581 gccttctggt ttcaggcgat tctctggcct cagcctcctg agtagctgag attacaggcg 38641 cccaccacca cgcccagcta atttttgtat ttttagtaga gatggggttt caccatgttg 38701 gccaggctgg tctctaactc ctgacctcgt gatccacccg ccttagcttc ccaaagtgct 38761 gggattacag gcgtaagcca ccgcgcctgg cctatatctg gactttttgt tctgttccgt 38821 tggattattt gtcagtcatt caacaaaatc acagcgtttt aattataata gatttataat 38881 aaaccttgat ttttggtatt aaaatccccc aaatttgttc ttcctcaaaa ttttttatgg 38941 atattcttaa ctcttcgtat tttcacttaa atttagagtc agattgccaa tttgtacaaa 39001 acctttctag gatttgtgtt ggaattttat tgagctcctt atagatcaat gcggggagaa 39061 tgtacatcat taaattgcca cctttccaag atctgaagtt ggtatatgtg tctctttaaa 39121 tgtttagttt ctgttagtga agttgtagat tttcgtgtag agattttgaa catcttctgt 39181 tacatcctaa gagtttagca gttctgatgc tcttgtagat agtatcttct taaaatttca 39241 ttttctaatt gtttgttact acacaagaac aaatgattct gtatactgac tctatagcta 39301 gctaccttag taaatttatg tttttatttt aaagtttatc tgtagagtct ttcgtatcat 39361 ctacatacca agcacattgt ctgtaaataa tgagtgttct agatcttctt ttccagcctt 39421 gacatctcct ctttcttttt cttaccttta gtttaatatt gggcataagg gataatcaca 39481 aatgtctttg ttcctttctc aatcaccagg gaaagctttc aaattctcat cgagtgtgaa 39541 aattgctgta actttttctc tttggcaaaa tattcttcac ctgattaaga aatttgcatt 39601 ctctttctag tttactaaaa atgtttttaa ttgtaaatga gtgttggatt tttaaaatac 39661 tttttctgca tctattgaga tcatcatttt tcccctttat tctgttaatt atatgttgat 39721 ttaaaacttc aagccaaact cccatatggt gtataaacct tacttgatta tgatattatt 39781 atgtttttat aattgtggac ttacttgact aatattttaa gatattaatg tccatgttca 39841 taataaagat tggcctgtaa ttttaatttc ccttttttgt aattaattgt caggttttag 39901 tatcaaggtt atattcacct ggtgaaagac ttgagagtta tttcttcttt ttttattctc 39961 caaaagttag tgtaaaattt agcatttttt cttccttaag tgtttggaat aattcactta 40021 tgaagctacc tgggactaaa gatttcttta agggggattt taaagtatag attacattta 40081 tttaaagaat atagaaatat tacaattaaa tgtttctgct tgtgtatgct ttgataaatt 40141 tagcaattca ttcttttcat ttaaatttgc agatttagtg gaataaagta gttcataata 40201 tccttttgtt acatatttag gatctgcggt aatatcctct ttatttctca catttagaat 40261 ttgttttctt tctcattttc tttctgtcag tcttaccaca gttttatatc tttcattagt 40321 cttttcaata atcaactttt gaatgttttg tcctaagttt ttattgtagt aaatttcaca 40381 tagtataaaa ttcatcattt aaccattcaa tgacatacaa ttcaatggca ttaagtatgt 40441 tcacaatatt gttcaaccat caccatgatc ttgcgccaga atatttttca ccatctcaga 40501 aggaaaccta tgtgcattaa gtagttattc cccatcctcc tctcccctgc agttcctgac 40561 aactactaat ctgctttctg tctttatgga tttgcctttt ctggctattt catgtaaatg 40621 gaatcctata atatgctgcc ttttgtgttt ggcttctttc acttagcata atgatttcaa 40681 ggtgcatcta tgttgtggca tgtattagga ctttattctt tttttataac tgaatgacac 40741 ttcatacaat ggatagtctc tcttcaagtt tgttttttat ttcattgatt gctgatttta 40801 tcttcatttc catccttctc ctttctttgg atttttaagc tttttgagat gtgtgcttag 40861 acaattgaag tgaagtgaca tagttctgtt agctttccct tttcgtactt tcagtttctg 40921 tttttaggtg cacgtcagtt ttggtttgcc catatctaga catacaaact tagggctcct 40981 acacttccta attggatgac tttgtatgaa tggggataat cgtattgatc ctatctcctt 41041 gtccagggtc ttgtgagaga atgtaaggta aaagcatttt gtaaactaac ttccagactt 41101 ggatagtacc ttgtactttg cactgccatt gctttatatc tgcatagtgc tttcacaggt 41161 ttttaaaaat ttaatacagc aaacccacag gaattatctc cattttacag atgttatttc 41221 tgctactata gaattttcag gaagtgacca tgaatttagt atttaaagcc cagggttttt 41281 gggttttttt ttcttccagc tttactatag ctgcctcatg cagaatggta ttattggcat 41341 tcttagaaca ttggaatttc acactttcag aaacttgaga acattgcaac aactgccaac 41401 agggctcagc atcctctgca gatgaacaac accaagtggt atagttaccc aatcaatcct 41461 tttgcacttc tacaaaatta agcttggcaa acttggctct ggctttctac caattcattg 41521 acagttttta ctgactactc cgtgaattag gctttttgtg ggaaaataaa agaatgggtt 41581 gatgtgattg ttaccaaata tatttaggaa gaagtatttg aacatgtgca aataaatact 41641 tagcacatga ggatcaaggg aggaaaggaa agggattcta atagtcaagg tgagatctga 41701 gtttttaaag aagtttgctt tttaaatgtt tggttataca tttctggtga taaaaaaatt 41761 ctaagtttaa aacaccttca taacttctca taacatttaa aatcttcaca tagcatggac 41821 tcatactctt gtctgcacat tttaacatgt gtgaaatagg aaattattgt ataattaata 41881 gtttcttaga attgtgttgt agattaatta tagtttttct tttttagtag tacataaaat 41941 aaaagaataa tagatagtta tgtatttaac actgatggca ttacagtaaa taaaatatgg 42001 aacattaaag tattccagag gattctgtaa taacttttaa acattcattc attcattcat 42061 gtattcaaca atatttattg agcacctaga ttttgttagg tgctgtacca ggcgttagag 42121 ctataacaac aagcaaggca tttttataca ggcttctggt ctagtgagaa aaacagtacc 42181 gtgataaagt attaacaata cagagtggca ggaataataa agaagcagca tggtgcaatg 42241 agacatatga gaatggtatc caagatttag acatcaggaa aaactagtag aaataaattg 42301 caaacaggtg cctgaaggag caggtaggat ttgatcatgt taagaatggg gactagtatt 42361 cataccctcg tgtaattcct tcaccttgag agaacagaat gaatacaaca tggaaaaaat 42421 gatatgtcac tttgaaaatt aggttacaaa gagatggtga cttccatctg gcttgttaaa 42481 atgagatcac agtcctggcc aatgcctggg ctacgacctt gggaaagacc ctgaggcaga 42541 ggcaccaggt aagttgtagt catactcctg gcccacagaa agtgagaggt gataagtgtt 42601 tgttgttata agctgacaca tttcaggctt gtttgttaca tgatggtaga taactaatac 42661 atctgtaggg aagtggggtg gacgtaggta tgagagggat tgcaaatagg catgaagaaa 42721 tttggaggca aaggatatat ttattacctt gatgcctgtg atgttttcat gggtgaatat 42781 atatgccaaa acttattgat ttatatataa tgacatatat tgaattataa acacatataa 42841 ttatatttat atattataaa tgtgtgcagt ttattgtata taaattatat ctcaataaag 42901 acgtttgtat tttaaaatgg ggaataaaaa gcttatggaa agacagtgtt tcaggcaaaa 42961 gagcaaatga aaacaaaagc ccagagcaag agctccagaa aaagcagacc atttcaaagt 43021 gtgcattgac atgggaaacc tttgctcttg gatttgaaag agataagtag gttaagggaa 43081 ggaaggagga caagattggt tgtgaaaaga ggtaaaagag aacaaagaga acaaatagtc 43141 agtgacttca aacaaatacc atggattttt gtgcaagaaa gaccacttgg gttctttaag 43201 gcccatagga caactgggtc agaaaaggag agagtggcat ctctagtcta ctcccaaacc 43261 atctttcaat ccctgctcct cctgcctcct ttatccccaa ccccttgcca ttcatttgta 43321 ctttccttcc ctttacaaat atttgagcat ttactatgtg ccagaaactg ttctaggagt 43381 tagagagaag gtagttagca agttaaggtc tgtgacctct aaaagtttac atgctagtag 43441 gggaaggcag gtggcattaa gtaggaagat gatttcctat ggagtgctgt gaagacagtg 43501 gtgatagagt ccttaacacc aggtatggca cagcctcaat cagtcagttc agtcccagct 43561 gaacaaaaca ctatttggga gaagaatcct agagacccta ctgtgccagg aaataatacc 43621 ttggttgcta gatcatggta aagtgaggga ggattccagg gaagagagaa tggtcagtat 43681 tgtggaggac ctcttggagg gtctgaggaa gtggttattt aacaaccact gtagaaatat 43741 gtcattattc taataaccag taattctgta ctggtgctaa ccatttgata tcagcactga 43801 ggtcttagta actggtgcag agagagggtt caggagaaga gggtgtggag gaagtctact 43861 ctagatggta atattgggaa aggcctctca gaagaagtga catttaagct atgacctgaa 43921 tgaagagaaa gagaaatgca aggggtagtt gggaagttaa gtcaggggga tgggaagagg 43981 attctagcca aagaatagca aagacaatga tggtgaagtg caataaactt ggtgtgtgta 44041 ccgaatggta caaaagctaa tgttgctgca acgttatgct gagtgaatta taagaaatta 44101 ggtcagatga taagcacagc ccagatcttg tggggccttt caggccacaa taagaaaata 44161 aaatgagaag ccacaagagg gaggatttta agcatggaaa tgatataatc tgatttacat 44221 ttttaaagga ttacgtgggc cttggtaaga agaatggatt tgtggggagg gtgattaggg 44281 agagaacaat ggagaatctg ttctagcctg gagatgatag cagcctgaac cagagtaatg 44341 gcagtggaaa tggggaaaat tgggttgatt caagattatt tgaaagttgt agtggtttca 44401 aagagctatc tatacattct ttgatatgtg ggctaggcat ttttgcactg ctataaacac 44461 ctgaggctgg gtaatttata taaaaaaagt tttaattggt tcatggttct gtaggctgta 44521 caggaagcat ggtgtcaaca tctgctcagg ttctggtgag gacctcagga agcttaaaat 44581 catgggggaa ggtgatgggc atcactagca tgtcacatgg caagagaggg agtgggggtg 44641 gggggaggtg ctatagtctt ttaaacaacc aaatctcacg tgaactcaga gcaagagaac 44701 tcactcatta ccatgaggat ggcaccaaga caattcagag ggatctgcca ccatgaccca 44761 acacctccta ccctgtccca cctccaccat tggggatcac atttcagcat gagatttgga 44821 ggggacaaac atccaaacca tatcaatatg cctcaaggca tatgaagggg agcctcaaaa 44881 ggtgaagttt aattcttctc accttgaatg taaactggag ttagcagttt gcatctaaca 44941 aacagaatat gacagaagtt aggggatgtc atttcagaag ttaggttaat aaaagattgt 45001 ggcatccatt ttggaggcat gctatctcac tctttcagat tgctcattct agtgaaagtc 45061 agctaccttg ttctgttgta gttctgtaga gaggcccacg tatcaaggga ctgaagtatc 45121 aagggactga agtctctaaa gagctgtgtg agtaatcttg gaggctgatc tcacctcctt 45181 cacccactcc agttgagcct tcagataaga tggcaattca cactgacagc tcagctgcta 45241 actcatgaga gaccttgagc cagaggtatc tagctgtgcc atgcctgaat tcctgaccca 45301 cagaaatgat gagataataa atgtttgttg tttttaatgg tgagacttta gtataatttg 45361 ttatgcagca tacgataact aatacaaaat agagcctata ggactgactg atctgtttgg 45421 tgcaagaggg tgtgaggaaa agagatggat gaaagaagat atgtagattt tggacttgag 45481 caactggatg gtagtgatgc catttattga agtgtgagtt tgggcagggg ggattaagag 45541 ttttcctttg gacacatttt gaaaagactg ttaaacaccc cagtgggcat gtcaaccagg 45601 tagcctgctg ggccctgccc aaattcctta tgcactgttt atgagcataa gaaaatgatt 45661 gtttggttta gcagctaacc ttgtgaaata tctccaatct tccattctca ccttaaccca 45721 ttctaatatg gtttgtgttc ttttgactcg tgtagaattt gttcttatta aaccacctat 45781 tgtatccatg ttatctaatt cagtagttac ttctctgcca taagcatact caactcatag 45841 ccacattgga cacagatcat cattccccct cttttcgaat actcttctct tttggattca 45901 ttggttctac ttcttaggta gattcttctg ctccacctgg tgttaaccag ggtggcttag 45961 ccagtggtat tcagttggtg gccaggctgt ggtggctggg cttgtctaat agcccacttc 46021 attcatttct gcttccttgg tgccgatggt taagaggttg gcttcagctg accactttcc 46081 ctctccatat gttcatggcc tttcatgtga atgctccagt ggagtggtca ggttttttac 46141 atagtagctc aaggcttaag agcaagtgtt cagaaggagc agagagagag ggcagtagtt 46201 acaatgtgag gccaaagaag cttcccccca gaaagctaaa ggtgataagt aaagcatgtt 46261 ggtattggct ggcaatattc cacaagagat gaaaggacag atattgcaga agagagaagg 46321 tataactggg accaaaagcc ttgagaagga aagagacatg gagcaaatca ttcacagtaa 46381 cagcagacag cagagaagag agacatggtt gtacagaggc acctcctttg ggtctttact 46441 caaatgcccc attatcagtg agaacttctc tgactgctat tcttcagcag agggtattcc 46501 ttatcccctt tcttgcttta tgtgttttct ccataacata tgtgcatatc cataacacac 46561 acatgcatca cctagagcat tatatatgcc acagtgacat gttttgctga tttctcaatt 46621 gactcccccc attggaatga acgtaagctt gaggaagacg ttttgtcctg ttctgtagca 46681 tctagaacag cgcctggcac atagtaggta ctcaataaat gccagctgca tgaggaaatg 46741 aatgagctgt gtgggggatg tacttgagtg aactctaaag tcagagtggt gttgagagaa 46801 aaatgcttga aatccagatg ttggaaggtg acacagagta gtagcctggt gagaacagtt 46861 agatcttagg ggttcctact acagccctcc cttccgcacc tttttggctg tcaccatgat 46921 caagctactg aatctctctg agacgcaagg accgggatgg cacaaagtga gtgctcacca 46981 aagcttgact gtcctttccc atggcaattt acttcagctt gtttgatttc ccctccccga 47041 ctggactagg cacctattct ctgtcttctc tctttacagt tggaaggagc aaaatgggac 47101 ttttggctga aagtgctgag ctcctgcggt gggggctgac cgcaagccac gccttctgtg 47161 cacctggtcg gcccagctag ctgccgaccc ggcggggagg ggcggggcgg gccaatcggc 47221 gctgccccag cagggctgcg gctgcaggca ggcagagcct cctagcccgt cggtgtctgc 47281 gcccatcgat ccctttgtct atccccgacc atggcgaagc tgattgcgct caccctcttg 47341 gggatgggac tggcactctt caggaaccac cagtcttctt accagtaagt tgggttgtgg 47401 tgagggctgc agccttcgtc ctcctcactc cctgccagct ccacgagccc ccagaaagtt 47461 gggtccaggc tgttaacaga acatttacaa gcaggtgcta gatgcacact ctgcgggggt 47521 ttcataaatg tttgtccagc cgtcatgtgt cacttgaaaa caaactcttg ctcatggcct 47581 gagacatggg ataaggaggc aattgccagg ggtgtgagtg cccaggttgt cagcatacac 47641 gaattaattt ttcatgaaga tttgtgtgcc ctggctacag attcacttct caacctggaa 47701 agaaacaaac taacctggct gggggttctc ctgatatttt gtgtattaga gaaaggagcc 47761 cctgtcgctt ctgcgccttg actgcctggc catggcaaac cttccaggct tgaagattct 47821 caaattgagg ctactgaggg ccacagcaaa actgcgtgtg gacccgagca ggtggagcta 47881 acgtgtgctg tgatggttcg cttcagtggg ggcaggcggg gaagagggaa ccctggagcc 47941 agcatctgat gtcctggggt ttagctctgc cccatgagta tgcgatttgg gcaaatcact 48001 ttacatctct aaccttagct gcttcagtaa actggaaata gtaatacaca ctgcctgcga 48061 cagagtagct attcaataaa tgttagttcc ctgcttccct ttaaagtatt taataaaggg 48121 ttaagtgcta tacaaggggg gtggggatat tttaagaatg tatcaaaagt ttcataactt 48181 ttaacaggag ttcaaagtct acagatcatc ctagtctcta aggcttagtc aacaatgcct 48241 tatttacttt aaagtaaaac ctaataaaaa ttaaatataa aaaaattata tgagcttatg 48301 gactaattag aagcttacat tcagaatggg tttttactat ttattgtgga tgtttttcta 48361 atttttttaa gttacagagt gcagaaaaaa agtaagaagc agttggtatt cttggctttt 48421 aaatcatgta aaaattacag tttgtgagtt tttattaatc acagtgctgg tggctgccaa 48481 aggagattat aggtatagat aaaaatatat agataaagga tgggttgctg agcctcctgg 48541 agtgacccag acagaggaag gaagggaaat tttacccact gttggaaacc cctaaaacct 48601 agctactcaa ctgggtctgg ggaccagatg cctgagcatc acctaggggc tccctgtaca 48661 ggcagaatct caagccctac cccacactat tgaatcagaa ccagcgtcat ggaacaaact 48721 ccctggggga ttagagatca tgaatctgcc ccagggagag aaggcaagag atgatatcat 48781 gaggcctgca tcaggagcaa gccaaatccc caggatagat gcatcctttt ttcttcagac 48841 ttggagtgta tgtgtgtgcg gctcacagga agctatatat gcctgggctg gagggaactt 48901 aaggaagcac acaagggatg cctcaagaga tttctgagtt tcccaggtgg gagcagtaca 48961 gagaaaagaa ctgagagagt tcccctagac cacgagtctc ccacccacga gtctcccacc 49021 tttgtagtcg gctgtatgtg ccgcacaagt ccagggaatg ccttggatga tggtagaaac 49081 agctcccctc ccctacccca acttgggcta tggaatgttg cagcatctgt atggcaactt 49141 ccttgtccct acattgctaa ttgcaatacg tgagtccact gagtttccct tctcagaagg 49201 aaaccacaaa gctaacgttg ggcctctcga tgcatttctg ccattccatg cttactgtag 49261 gcaacacaca gttgtgggaa gagagtagtt gtggattcct gatgacctgg gttagatatc 49321 atttctgcaa ctttcaagtt attttaccac tttattccag aaaagttcca gccagtttct 49381 ccacccgaaa attaggggat aataatacat attacagcat atatatatat atatacacac 49441 acacacacac acacatacat atatttatat ttatataaaa catataaata aaactaagta 49501 tatatatata tataaaaatc aggcatatag cagatgtcct gtaaacgtta gcttttatag 49561 ctgtcatatg tgatctcaaa aatatagagt gatacttgaa tatgtattat aatcatactt 49621 cattgaaagt ctgagataag gacaaataat cctccaaccc tccagtctcc cctatgaatg 49681 caaccaacac catcttctca gtctgatgta cttgcctagt atctccacaa aacacttcag 49741 tggacatctg cagagtacct tctctttgca aacgttgtgt ttctgctgga ggttatagga 49801 atgaataggt tgcagtattc actcaccaat tttaaagtct actgggagca aaagaacaaa 49861 aactgacagc tgtgaaactg acaactatga caactatggt aagaaagttg acagatgtat 49921 atgtgaaggt tcctgagggt aaactaagga gttattaatc ccaatgcaga agactggaga 49981 agctacaaag catcatgagt aggggtcttt ccagtgggta atgggctaaa gagcattcta 50041 ggtgaaggga aaagtgtgag caaataaaaa gaggtatgaa agaaatacag ggaagagtcc 50101 ttagtatcat gacacttgat ttcaaatccc aactctaccc tcttaaataa gcaaccacat 50161 gcaatttact cagcctctca gtgcctccat ttccttacct acaaaacagg gtaaatattc 50221 atctttttct caaaggtggc taaaaagatt aaatgaggaa gtatgtgtga gagcccttat 50281 tggagcttgg cacatagtag gccatgaaaa gcaatttgaa tgaatgaggc gtgtgtagta 50341 gcagaaaata gactgattag agtgaatttg gccccaagaa gagtttgaga atcgatacgg 50401 atttcttaat gcctaggcaa agatctctgc ttttttctgc gagcagccag catgtatttt 50461 caccgaatac ttctgccttg gtttggatgt tcgtcctctc caaacctcgt atttgatctc 50521 taatgttgga ggtggggccc aatgggaggt gtttgggtca tgaagatgga tccctcatga 50581 atagattaat gacctccttg ggagtgagtg aattctcact ctattcattt cgagagagct 50641 ggttgattga aaagagcctg gtacctctcc cctctcttac attctctctc atcccatgat 50701 ctctgtgagt actggcttct cttcatcttc tgcaatgagt ggaagcagcc agagcccctc 50761 accagatgca gatacccaat cctgaacttt ctaaccatcc agaatcataa gccaaataaa 50821 ccttttttct ttatcagttg cttagcctca gatgttcctt tagagtgaca caaagcagac 50881 tcagacaagt tcacagcaaa ggctgtatgg tttctgcgca gagttcagag tctagtgaat 50941 cagagatata tctcagacat tgacccttat ctcttaggcc cagaacattt ggacttaagc 51001 ctttgacaag atatttccta ataaattgaa aatgaggtag aaaaaatatg agttggatga 51061 taataacaca ggtgggtcaa ttttcaggtg catcttccag cttttaagca gtgttctctg 51121 tagattttgc ctggtcagag agctaaatgt gagctttgtg ttcatacttg aattccaatc 51181 cctgctcagc cactttataa ctctgtggcc ttagaaagcc acttaatctt ggtcaactat 51241 aacttaggga aaataacacc tcacggaggt gtttgaggat taaacgcggg catatacggt 51301 acctgaacta tagtagtaat gattatcctg ttgagatatg acatgtttgg gccacaaatt 51361 ggaacccaaa tcaacaagtg ctgacttaat gcccccttgc ccaccctggg ctccctttaa 51421 ctgtgattat tctatgtttt ccaggctaga gatcatcttc tatgagagag aagcctgaat 51481 tagaatagga gctgaccagt tctgcctttc tccatctttt gtgaacccat atgtttagtg 51541 acccttgtga gtcaggcagc atgagtattc ttcccattgg ttttgagtct aagaaaagtg 51601 aatttgacta tccttggtgt ttttcacaag tcttggccca ttcagtgatt ttatgtcctt 51661 gaatctctcc ttccatgcca gtctttgtgt tgtcctcggt tatgaccatc acccttaaca 51721 tcttgtacct gccctttggt tccttgaggg catgctacaa atattcacta cactttccct 51781 agaccagagt tttttcaaat ggtgggtaac aaagtcaatt tagtaaataa ttaccaacat 51841 ttaaaaataa ataaaacaga attgaacaga aaagttatga atgtatgaca cagattagga 51901 tgagtattgt tttgtgaagc ttgattcaga tgtgtatgtg tatgtgtgtg catgtgctgg 51961 gtgttcgcac catataaaat acttttccta ctgtgagtct ctgtctgaaa agtttgtaag 52021 ccactgcttt atcacttccc cttttatttc cttattgtca atatttgcaa ttataatgtc 52081 agattttcat tttgaaaata cctttccctt tcttaggtta tgggacagat tccaactttc 52141 ctttgaagcc tgctttctaa gttcacatct atcaatcagg aaaccacagt ctcctcacag 52201 tccaagcctt catcatctat tcattcatct ctgtatttct agtacctaga ctggtggctc 52261 tttactctgg ttatacgttg ggacaaactg tggcactttt acaaagcaga ttattaagag 52321 gcactccttg cctcttacgt cagatttttt tagacagcga aaatgatagc cagtggcctg 52381 atggtaggaa tgtactcatt tttttctgaa tgaataaatg aaagatcaat cacccccaaa 52441 ttcctatcat tttgtaccac gtcggtaaga attttatcta gcgtgtctgt tccgcctgta 52501 gcccatccgc ctcctgtgag ttcaagtcat caggaggaca agtaaatttc ttagcctatc 52561 tgaagccaaa taagatttct agggatattt ggataattgt cactacttgc tatgacaata 52621 ctgtatcact agttctctaa aaaaaaaaaa aaaagtttat tgtttaaatc tcttggttgt 52681 gctaacgtcg tctgtttctt tcaaatgcag catcagtagc ttcgctctgc tcaggcagcc 52741 ggctggcact tccttctcac atctgttcat ttttcctttg tcctttactc agacattttc 52801 tatggtgttt atgcactcat atctaataag tcgctcctgt gtatgtgtgt gagtgcacct 52861 gtgtgtgtga aatgcctttt tacatgcagg ccttgtgtat gattttctga agttacaaat 52921 cctttcaacg tacctgtatt tttcaagcac tacatcacaa agcagatgcc cagtccaggc 52981 tgcggagtca gattgggcag atgagtagga ggctgcccat ctgtcagcaa ctctaaaaca 53041 tcaatggaat aaactggaac catgttgcca tttacccgga ctttccacct aggagtcctt 53101 acacactgat gaaattgctc ctgttcctgc caaagtacac agagccctag agaggatatt 53161 atgtatctct tatgtgtacc tatccacctc tgctaaataa aaactaaata ttgtacccag 53221 agttccccac ttgtcttgcc ctggccttgc ctggaaaaat tgcaggaaaa actctggagt 53281 tgaaactcag gcaaagtaca gtgaaggaga acttttgtgg actgcctacc tttgcaactt 53341 attataattt ctgaatagtt ttgtgtgtgt tttatttctt tctcccccac agaacacgac 53401 ttaatgctct ccgagaggta caacccgtag aacttcctaa ctgtaattta gttaaaggaa 53461 tcggtatgta tgtgagagga agaagggtct catttgaact tccggtattt ttttgctctt 53521 tgtgtgatta atgtgcctgt tcaattctgt ccaaaaataa aattttgaat caggagagtc 53581 agtggtagtt aatgataaat accatgttgt ttgacttttg acttcaccaa catactgcca 53641 aggtgttggg agttttggtt gtttgttttt gtttgatcaa tagagcaaaa aaaaaaaaaa 53701 gaaaaaaaga aaaaagaaaa atagaataat gtttgattcc atggcaatct aacctgagga 53761 gatggcagaa gaggggaggg agagccagat aaagcattga ataaaggtaa ctttttttta 53821 gaaaggctct gacaggactt tgcaattacc acaaggcttc tccttatgca tctctctttt 53881 gatattggac aggaaaattt tttccagaag ctctcagcag atttcccctt agatcttatt 53941 tgtcagagct ggaggacagg ctggtgccct agttgcaaag gatgctggga aagtaattat 54001 caagcctcca atgtgggagg caggccctgt cagcaaagat aaaacggtag gagaatagca 54061 gctgggaggc aaccaacaag gagagtcccc gggctcaact ctccagctgg ttccaaccaa 54121 ccagctttgt tatttgaaga caattggttt gagatcatgc ttttgtgtgc tttgggtgtt 54181 tctttttctt tctttttttt tttttttttt ttgttctttt agaattattc atctcagttg 54241 attaaggctc aggaggctag atcagatgac ttagtaccac taaatttcac tagatatctt 54301 atggtgcgat tgttttctag aaaaatatga tattgatatc aaatatcact gggtttttga 54361 aaagcaaact ttccactaag tgtagaaata cctatcaaat agttcatttt tctctagata 54421 aattgaaatg ttaaaaattt acatgtactt aaagtataac taacttgaag acagtaatat 54481 ttaaccgatg tttataatta gtagtaaaaa ttcatattct tgatcaccta aaatgtatcc 54541 ccattttaac tcaactattt ggaatttgct tatatcttca atatggcttt ccaagggttt 54601 aattgatttg gaagtattaa gtatcaacat atatttgtaa tttaccttgt atctttagaa 54661 aacaagttct tttcaaataa tcatggcttt tgtacgtttt gtgaatttcc atgaacttta 54721 agaggattca gtctttgagg aaaaagctct agtccatcaa tttaaaacaa atatactaag 54781 tgtaaatttt aattaaactt gttaatgaat taatatattt ttaatttcca tataatcgca 54841 ttcatcaatt tgaataaatg aaagaatgaa ttattctgaa cctattaaag aagagtgatg 54901 tatagcccca gtttcaagtg aggtgtgata aagaaatgga tccacatcct gcaataatat 54961 gaaacaacct gtactttctg ttctcttttc tggcagaaac tggctctgaa gacttggaga 55021 tactgcctaa tggactggct ttcattagct ctgtgagtgt tttctttcac ttttctgtgt 55081 tctagaacgt tttctaggac tggcagttta agtctttcac ttaggctttc tgtataccca 55141 tgcccacttt caaataaaat gaacagtcat ggtttaaaag acaaattgtg tatgcattta 55201 cacttgtgct ataattatga tcatgcgtgt tctatgcata tttgtgttct tcggtctaca 55261 ttgaaccata ttacgttagc ttctattgca gcaatttttt tgtgagttca gtgtattcaa 55321 gaattgtttt cctctcaatt gtctattccc taagcactta aaattgccat attcaaatat 55381 ttgctaaatt aaatatacat ttaatgtata ctacttcaca ggtcatttca actttcacat 55441 ttttaaacaa atttcagaat aagcccttta atttaaaatt gttgataggt cccaattccc 55501 aattcccaag ctttgaagac agaatccacc atatttgctg aaactgtcct gcatatagat 55561 cgctatggta gctgatatag tgcaaaacaa aagaagtcca ccatctggcc tagggatcat 55621 aacataacag agttgatagg aaacgacaga caatcatcca gccatttaaa aacctttgct 55681 accccttgta atttacttag tccagttttc tcttacaaag atgatagact agtcatcctg 55741 caaagcaatg tcttgcaaac tttcttgagc aacataaggt gtatttcaag caaaatctta 55801 tatgggaaca caatatataa aacaacagct ttattgtttt gattgaaggc taagaaagga 55861 gtagttcctc tggcccacca aaactctgtc tttgagatgc ttccaaggac ctaccaggga 55921 acagtttaaa agccataagc aatattctct ctgtattcct tgcttatatc attgaagact 55981 attattaatt gcctttttgg tctgctcttc catttttgat ctttctcatg ggctggaagc 56041 tgtagtgtcg tgcacatatg actttgctgc aggggatgaa agggtttgtg tggggtggcc 56101 tagtggagca tgtggcttcc agaagacttc tcctggggac ccattacaga agctaactaa 56161 gaggcttcct atgtggccat ctggctagcc ctgtggtttg gagcagtcaa catgaaccca 56221 tgggccatgc agcgcctcta ttagcatttg aagtactggt attgtgcttg atttttctcc 56281 tccatggttt ataagggatt aaagtatcct ggaataaaga gcttcaaccc caacagtcct 56341 ggaaaaatac ttctgatgga cctgaatgaa gaagatccaa cagtgttgga attggggatc 56401 actggaagta aatttgatgt atcttcattt aaccctcatg ggattagcac attcacagat 56461 gaaggtaaca atatttacta tttattaaaa ccaaaaccaa acttaaaaca aaaacagaaa 56521 aacatgccaa taaatgaatt attttttctc attcaaacaa agcatagtca tgaataatgt 56581 tcttactctg ggttttaaac ttaccctcta tgttaattga actttatagt aaaaccactt 56641 aaatacttga gaatgctaaa atatttaaga tttgtaaagc atctaatttt gtaaaatact 56701 aacattcaca ttatccctat aattaccata ataataatat ccagccctga tcattatagc 56761 ttttccattt taaacgattt tcaagactcg ggcatttaca ggaacagatt tttctaaccc 56821 aaggcatctg taagctatag gtccagtgac agaaagtcac catggtcagc tttctgccgg 56881 tcccattatc ttcaatgcct cctgtccgct ggttcctctg ttctgtgttc aaaatgctcc 56941 tctgcatgtt ggtccctttc aacataactt ggtgtcatcc tttatgtctt tactcccttt 57001 ctccctttcc atttgctgtc acttccttgt ctctgtatta attttcagcc ccagttgcac 57061 aatggtagca cctgaagagt ttaaaaagac tcgatgccca gcccccagct tgacagattc 57121 caattccgtt agtctgaggg aggcctggat cacagtagtt gttaaaagtt tttcagttga 57181 taaaaatgtc aggctagggt taagaaccac gtttataccc attacctcca tttcatcatc 57241 agacgatccc tcattcagtt cctcttctgg actttcccca ctgtgggtgg ctaaaattgc 57301 taccttgaga atcatcagtg atgttcctac taacaaatct cctctcttcc cttcctccct 57361 ctccttcctt ccctcttccc ttccttcctt ccttcacctc cttttgtttt ttgttttgtt 57421 ttgtttttga gatggagtct tgctctctcc caggctggag tgcagtggcg caatcttggc 57481 tcactgaaaa cttcacctcc cgggttcaag caattctcct gcctcagcct cctgagtagc 57541 ggggaccaca ggtgcatgcc gccatgcctg gctgattttt cgtattttag tagagacggg 57601 ggtttcacca tgttgcccag gctggtctcg aactcctgag ctcaggcaat cctcccatct 57661 cggcctccca aagtgctggg attacagttg tgagccaccg cgcccagcct acttcacctg 57721 cttttatgct gtttaccatt ttctcattca caaaacttgc tcttccctga tactgcagtg 57781 cacctagctg ttccagttct catcctacta caggttttct ccccaattca ttgttttgtc 57841 actcattgct acttccctga gcactacccc agaggtttga tcttttactt cctctatatt 57901 ttctctattt tagaactttc aaaccagtgt cacactgcct gctttctagg aggacatccc 57961 caaatagggg tattattatt cttcttcttt tcattagccc ttcactttta atgccgcagg 58021 tgcttactgg acacttctat ttggattcct ttcctataga cactttaaac tcaaaatgtt 58081 gaaagtgatt ttccatcctt ttatctttcc ttctcttccc catgtgagac acacacacac 58141 acacacacac acacacacac acacacatat acatatatat ccctttcaat ggactttcca 58201 ctcatccacc cattcatcac cattatgttt tattttaaaa atatttttaa gtacaaagaa 58261 atatagtgtt tgatgtgtgg atacttttta aaataaaaat gaggtaatat tacacaaccc 58321 cttctataaa tttcttctgc attaaacact acatcatgga ctcctttata tatagcacat 58381 ttgcagttgc tttgccaagg cagttcggga ttcagttgtg tgggagcact gtaatatagc 58441 tcatctgcta acaaatattt aggttgttcc cacctttatg ctcattgcaa acaatgccaa 58501 aataaacttt ctcgcacatg tattttctga gactatatcc ttaggataca tttctagaag 58561 tagaatccac tgcacaaaat gtgaattcag gttttaatcc tcacttgaat ttgatttcca 58621 acaccctcct tcctcagctg tattcagggc ctttggattc ttcctgtcaa atgcctctca 58681 tgacctttcc tgccttttat tgccagtggc atcctcttgg tccagtcctt tctctccaca 58741 ttgctggatt actgctgtag cctcagctca tgttactcct tggttcaaat accccagtga 58801 gcaaagcttg gatcttcatc ttaagcccta acaagactga ttctggctcc ctgtggtttc 58861 agccctctca ctctcccact ctctcttgcc ttcactctgc tcccagcaca acctcacaat 58921 taagtgaata tgtccaggaa gttctgcccc ggggcctttg cacactccca gcacaacctc 58981 actattaagt gaatatgtcc cggaagttct gccccggggc ctttgcacac tcccagcaca 59041 acctcactat taagtgaata tgtccaggaa gttctgcccc agggcctttg cacactttct 59101 cccctcaacc tggaatgctc tttccctaga tatccacatt attcgttctc ttatttcact 59161 taggtttttg ctctaacatc ccctcatcag agagaacatg cctgactccc atgaagataa 59221 tacttcgtta ctctctattc ttttatcctg ttgtattttt cctttgtagc atttaacgcc 59281 aactaaaata aattaaacac ttgtttgctg aatggggtaa tgaaaatcat tttagcagct 59341 ttctctgttc aaatcttctc aattcaacct gccaaatttt gcatcttaac atggtgaaac 59401 cccatctcta ctaaaaacac aaaaattagc caggcatggt ggtgggcacc tgtaatccca 59461 gctacttggg aggctgaggc aggagaatcg cttgaacctg ggaggcggag gttgcagtga 59521 gccgagatgg cgccattgca ctccagcctg ggcaacagga gtgaaactcc gtctcaaaaa 59581 caaaacaaaa caaaacaaaa acgccaattt gatcatgtca gaggtttcaa aagacctact 59641 ctttcttaag gcaaagacca taatctaact tttacttgcc agctctgatc acacattctt 59701 gaccgcttag actggtcacc tttgtgttct taaagtccca cttatccaag gaagctgctt 59761 ctgtttgttt gctactattt ctcagatgtt cttctatgga atgtatcatc tcacactgtg 59821 taaattacac ctgatcacac cttctcatat tgggagttcc catggttttt atccctttca 59881 tcgcaactgt gcatttcgtg gagaccattt gaactggaaa acttaaagga ttctttggag 59941 ttttagtgtc agtgatactc attcttgaga attttttttt ctcctgtgga gttttgagtt 60001 ttcagagact gtcactggtt cttccttgag ccacttttgt ttggaacagg agcaatcacc 60061 aatgcttctc tctacaaaga gatgaaaata ttccatctgc ctgacatcct aactactagc 60121 tggaggtggg ttgaaattgg tttttggaga gagggctgac agtgagagct tagttaatgt 60181 ttcatttttg tttttaatct cctctcagat aatgccatgt acctcctggt ggtgaaccat 60241 ccagatgcca agtccacagt ggagttgttt aaatttcaag aagaagaaaa atcgcttttg 60301 catctaaaaa ccatcagaca taaacttctg cctaagtact gagatatatc acatttctat 60361 tttcctttag ctgtagcgga tagttaatcc acttttagtt tttccttttg gccagagtag 60421 gtatggctgt tcaactctca tttctaatga tcacaaactc ctccacatgc tcttctgaag 60481 aaccttctga ggactcaagc ccctcagaga cggcccattc ttttcctctc ttgtttctcg 60541 agctgattgg atgctggctc ctggcctcag gatctccatt caagttatta ccaggaaaaa 60601 aatcctcttc ttcagaacca gacttagtat ttcaaaccca aaccggcttc atagataaat 60661 ttgacatcat gagcttattt atttatatta aaaagtagac tcttagatgt ttccatgaat 60721 tagaaacaca gacccaagag gagaccctga atcaagtgca tctaagacct ccttatactg 60781 cctccgctgc tgtttgattg cacaaaattt atgcaatagc agatctgtgg taacactcaa 60841 ttccccctag gcaccttgtt catgtaacat atggtgaaat aaaacactca agaatctcca 60901 atctcagtca acttcctcac atgccctgtg aagatttggc ttgtaaggcc agtgatggat 60961 ttaggagaac aatccgactc tatcttctga aacatttacc tccaactgat tacgattttc 61021 tagaatccca acctattttt atataattta tgtacagcat ttgactacct ccatatctgt 61081 aaatctagat ttgtactctt tgcaggtacc tagctaaggg cctggtatgt agtgtgcact 61141 ctgtgttacc aggacccctt cctcttcttt cttatctggt ggtttgagga tgaggaaaat 61201 gagattggag agaaattgag ttactggatg aaggccaatc aactcactag tggccaagta 61261 ggaaccccag ctcaggtcat cttctgcccc atcaagatga ataaaagttc agagaacagc 61321 ttttggtgga agagtttatt cgtgcattta tttaacaaat atattttgga gagagaataa 61381 ggtcactgag tggagaatgt gaggaacacc aagatttaaa gtagagaaca gtatgtagaa 61441 tggagaagct gggcaaatga taacactaat aaatgagctt tctctgtaag tcaggtgcta 61501 cactaactaa ttcaaacaga ttattttatt gatgttttac aaataccttc tgaattaggt 61561 gttattatta tatccatttt ggatttttaa aaagttgagg cctggactaa ttgttaagtt 61621 tccccagatg acatcatcag taaatggtgc agctgtgatt ctaatcctgg taacatgagt 61681 tcagagctgg tgctttaaac ctctacaaaa tagtgaaaaa aattatgtgt tttaacattt 61741 aatcttggaa atctcagatt ggtctacgtt accggattgg agtatttttt tcttgagcca 61801 gttgcaaaaa tattggtctt aggcagcatt ggagactgaa aataaccacg gtgccgtaat 61861 taactagagt tctgctctct caatccaagg attagctgtg tgtattttct tctttgtatt 61921 ttatggtttt ttataacatt aatattccaa taactaaatc tattataatt taaacttgtt 61981 attattcata tatcacaact gttattattg gtcatttact gtatgccagg catggtgtga 62041 aggcctttgc attcattatt tcattgtatt cttactctag cctttcaagg tatacagaga 62101 aaacagaggt tcagaatagt ggctcaaaat ttagtgacaa tgctgtggcc ccacattata 62161 aggatagcaa gcagtgtcag gatgcaaatg aagtccatca gatttcaaag ctattactct 62221 attacccttt aaaaatttgt tatttattta cttattttac tacacctggg ggtagtaaat 62281 tttcagcaat aaaaagctat tgctctaaac cattccactc tgcccctttc tgtgaaaaac 62341 agacaaatga tagtccagca tactaattta agaaatttac ttttaaaaaa gagagagtaa 62401 aatcagattg tagatttatg tctggagccc atccagacta atcctaaatt ccgagacctt 62461 tttttttttt tcatgttgta atagtggcta gcattgattg agggtttatt gaatgtcatg 62521 ctccctgcaa agcacacaac aggcattgtc tcattgtaat tcccactgca actccacaaa 62581 ttaagatcct gacccttagt tttctactaa tgagctgtat gattttgaga atatcattta 62641 acctctcttt acagatgaga aaactgaggc tctgcatgcc tgtctaactt gcctgagagc 62701 tcagagctgc gtcagagctg atgcgctggc catgacgcct ctgcaataga gcagagataa 62761 ttttctccca gatcatgccc tggggaactt ccatcacctc aatcagcaaa gaatgactca 62821 cttatgtgtt ccctattgta tccattaaag cagaaaaata gactcatata tcttacattt 62881 tcagtaacag agaatcctaa tcagaaagga aaaaaaaaaa gccatagcat tctatttctg 62941 agcattcttg ttcatggaca tttcattcag tttagaatca cgaaatctta aggctggaag 63001 atcaaccagt ctatcatcct gctttcacag ataagaaaac tgaggtcctg acatgtgaaa 63061 agattgattt ttcttagatc acacagctaa tagtagcaga gcttaaacta aaatgcaagt 63121 atccaggctg ctggttcagc attttttcaa tttcttatgg gagttagttc tatgtgagcc 63181 cagatatgtg agcacatttt tgccctgaca atgatttctg gcttaattta atttctactg 63241 aatttagtgt atttacacac agacacacac agatacacac acacacacac acacacacac 63301 acacacacac gacgtaacac tattttaaga tacatataat aggattttta tgttacagtg 63361 ctataatcac ctcctcagat ttcttagacc aatgattgtc taaggattgt atcggcagga 63421 ctaatttcat attaattgtg ttccattata gctagcacga aggctccatc ccacatcttg 63481 attttaggaa tagacagtga ggaatgccag ttaataatcc tgtaatgttc aataccttca 63541 ccttatatat tatgtgtgta tgttttaatt gcagtttgaa tgatattgtt gctgtgggac 63601 ctgagcactt ttatggcaca aatgatcact attttcttga cccctactta caatcctggg 63661 agatgtattt gggtttagcg tggtcgtatg ttgtctacta tagtccaagt gaagttcgag 63721 tggtggcaga aggatttgat tttgctaatg gaatcaacat ttcacccgat ggcaagtatg 63781 tgaactctct gaaatgtagt ggatttactc agattctcag gagatatctt ggaatatatt 63841 ctttttttaa gtaatatgat acattttaac atgttaccct agtgagataa aattaggaaa 63901 aatgtaaaat gtcttaattt ttcagggggt gaaatttatc aaatcaaaaa tttcattgtg 63961 aaggcatact ggttttattt cctttttgga aagaaagtac attttaagag ggggaactgt 64021 gatgaagatt tgatatgtct gaatagcttt gttttgagta aagatgtgag tgtataatga 64081 gttgaatagt atgcaggtat agatacgtca gtagtgcagg tttatgacaa gatgaggaat 64141 ttgtaccaaa atgtaaatca aggcaatact tccttcatca agaaagtatt actgtgaaaa 64201 acaagtcagt ttaactaagc agtgtgctta gtgatgtatt tgatttataa tgttatctca 64261 tggtgagcta taacaaacaa tttgcaaacc aacattggtc tggagactgc attttgggta 64321 gcattggtat atacatgaaa tatctatatc catccacata tatgcccttt ctcatcttgt 64381 gtcatcatca aggattttag agtccgtaac atatgcttag agaaagcaac aggggccagg 64441 tgtggtggtt catgcctgta atcctagcac ttcgggaggc caaggtgggt ggatcacctg 64501 aggtcataga ccagcctggc catcatggcg aaaccccatc tctactaaaa atacaaaaat 64561 tagccaggcg tggtggcgca ctcctgtaat cccagctacc agggaggcca ggcaggagaa 64621 ttgcttgaac ctgggaggtg gaggttgcag tgagctgaga ttgcaccact gcactccagc 64681 gtgggtgaca gagcaagact ctgttttttt ttaaaaaaaa aaaaaagaaa gaaagaaaga 64741 aaagaaaaag caagcaagca agcaagtgag agcaacaggg acagggacag cttttaatta 64801 ttcatatcat agaccagttt tcaaaaaata gaagataatt gagcagtaat ttcttttttt 64861 tgttgggggg atgcacagtg cttagactaa ccccagatct ttgtggggta tgactaaagc 64921 cacagggtca ggaccacact cagggcagaa gaggaggaag acttctcttg atccaagcca 64981 agcagctttt cttctctcat attttctctg gattaaaaca aggggctgtt gtccaagggc 65041 tttcctgcaa ctttcccgag agagaactca ctagcccctg aaatgcagaa cctgggcctt 65101 gacactaaat tgtcaaacca aaagagaaac gctgtaatgc attttactcc cagatactaa 65161 aagttagtgt tagggctaag ttcctcccat actgataagg acaactatgt cccctccttc 65221 agaaacaact tttcagagat taaacccctc tgtcactgta aactcactga gttctttcag 65281 aatttggtgt gtttgaggtg ggttccattt tccattcctt aagcaaagtt cttccctatg 65341 aataggtatt acaacttcat cttaatctct caagttgtgt tacttctagt actttgatgt 65401 agctctttct tctttgaagg tatgtctata tagctgagtt gctggctcat aagattcatg 65461 tgtatgaaaa gcatgctaat tggactttaa ctccattgaa ggtaagacgt attaacactg 65521 taagtctgca gagtgataat tagattctaa ttgatttgtg aaattaaatg tgaagaaaaa 65581 aggactgctt aattggggtg ggtgagaacc aatttttcac tcctttatta agagtcatta 65641 gcttggttat gtgtttgttg ttcattttta ctcaactgca gagtggcaaa cagcaaaccc 65701 cacctcttac ccgtgaacct tgcattcttt ctgctctagc aaattttaag cttggcattg 65761 tgttcccaga tcctccttcc atccctccct tcctgccttt cttctgatct ccctctcctc 65821 caccttcctt ccctctcctc ccttttcttt ctcttttttc ctctatgttg atggaacagc 65881 caagaaatag cctcaaattg tactttatgc agggctgctc tccacaatgt ttatgaagtc 65941 cacgtgctca cgatggctgc taggtgacct ggctgcccct cacctctctg cctttacctc 66001 caaccattgt gtccctgccc cacgtctcct gccactgtct cagaacgggg caccattccc 66061 tctgctagga ggagaaacct tcctccagat ctccatgacc ccctccttca tctcctttca 66121 tttttactaa aacgtcacct cgatcttcaa aacctggaac tgctccctcc cgctacgtat 66181 tttccctcgc ctttcctgct ttatttttcc attcagcacg tattgctaac atactatata 66241 ttttattcat accctttatt acctgattcc ccaactaggg tgtaagctcc acaaggacag 66301 gctgtttttc tatttgcttc tggtacaatt cacattgact agttggcact taacaaatac 66361 ttgttaaatt actgcggtgg ggctgggcat gcttcaggga ttcacctgct gcccccacca 66421 attctggtca tctgaaggtt ggagtcaaaa tgggtgaaaa cacaaaggag aggaaaaaga 66481 gaccaagagc aaactagagc cctaagaaag cagccctcta cctccgaaaa acagcaagac 66541 gttgctttcc tgccataaac gcccactgca ggaagcactg tcagcgccaa ttgtatgact 66601 gttcctgtca gcacatctga gagactgtca tgaggggcct cactcagaga aaattacctt 66661 aaagcaacat caaatgtaac atcgtgtttg acattaaaaa tcactggcag ggctagtttg 66721 gtattagctc aagtaaaatg gaatttcaca cgctcatact ggctaaggga ataaaaaaca 66781 agctgcggtt ttaacaacag gtatttctgt aaaacaaaag ttaaagaatt ttatgagaat 66841 agaacccaat gtaacagaca ttcaccacag aatgtcaagg ctctgctctc accagggtga 66901 gactgagggc aaggccagtt tcagcaacag accaggcctg tctcccatca tgaaaacccc 66961 aaaaaggttc tccatcaatt gtcacattat tctttctttt gaaagaccgt attaattaat 67021 atcagttaat taatgcccga gagtgccaag taatttgata aactagctga ggctattctt 67081 aaatttgtga tgaggtgtgg atgggtttat gtccaattgg atattgttca gccttaaaaa 67141 ggaatgaagt tcttggccgg gtgcggtggc tcacacctgt aatcccagca ctgtgggagg 67201 ctgaggtggg tggatcacga ggtcaagagt tcaagaccag cctggccaat atggtgaaac 67261 cccgtctcta ctaaaggtac aaaaaattag ccagacatgg tggcacatgc ctataatccc 67321 agctacctgg gagactgagg caggagaatc acttgaacct gggaggcgga ggttgcagtg 67381 agctgagatc acaccattgt actccagtct gggcaacaag agcaaaacgc catctaaaaa 67441 aaaaaaaaaa aaaaaaaagg aatgaagttc tgatacctgc tacaacatgg ataagccttg 67501 aaaaacatga tgctaaatga cataagccag caacaaaagg ataaatattg tacgatttac 67561 taataggaag tacctagaac aggcaaattc atggagacag aaaatagaat agaatggtga 67621 ttgctaggaa cttgggggag agaagaatag ggagctattg tttaatgggt acaggatttc 67681 tatttggggc tgattaaaag ttctggagag ggatggtgat gatggttgca caacattgtg 67741 aatatactta atgccactga gttttacact taaaaatggt taaaatgtca aatttttata 67801 tattttacca cgatagaaat aattctctcc tctctccaaa aaaagattcc aggacttagt 67861 caatgaatta gtcacacttc aatgaacctc aatgcgcaaa aacctggaat atttgttaac 67921 agagaatact ctgtttttcc caaagagaat aatttattat gttttcttga ccaaggaatg 67981 cgaatttttc gagttgcctt tgtgggagtg tgagatgtga tacgcagggg tcttgcatga 68041 gttagcatgg agaggaatta tttatttgca aggcaaagag ttcattgctg gactcagaac 68101 aggctccctg ctagactgtc agcacttcca tggcatgagc tgtgtctgct gggtacctgg 68161 tgctctccag gtgttgcgtt ccttgcagag ctcatgtagg agctcagtaa ttgtgtcttg 68221 aatgaatggg accccaggtt tctgtgtcct tctcccactc ccactctcat ctcaggtttc 68281 atctgccgtg gaattcacca attggctgta ctgcttccaa agataagagc aaagccctga 68341 aacaaacatt ttcctcattc aaagacatta tggtatcaga gtggtgtgct gacactggga 68401 tggacagggg tttagggatg gagaaagcaa gcatggagcc gcctgtcttg cctgtctctg 68461 gggtaagaaa gactgatggc tctgcatggt ggcgagactg aggctctgcc atgatgcttg 68521 agccctctaa tccctccttg ggcatgtgag gaacagctct cattttgagt aggaggcaca 68581 gggaaacgtt ggttttctgc agctgtctgc ttatgccctg gctttttcaa tgagtcatga 68641 ccagttattt acaggttctc tgggcactga ggtatgctgg cctctgaggt acatggtaga 68701 tattatttct ccaactgaga attagaggta aacaaataga tgggatttct ccccagtaga 68761 aggcagccaa gatgatgaat ggtggaatta agaatcattg gtattattta tgatccttgt 68821 acaaagcctc gacagcaatc tgtgcagtca tcaaatgctt gcaacttata tttagagaat 68881 aaacagcaga agaacgggga gttgaacaac atcaggaaac taaagaacaa acttttcctc 68941 aactacaagg caaaaatgag accctgggtg atcatttctt ctctggctgc acagcctatg 69001 attagccatt aaagagtcca catttcattg atattcttct gagaaagaat ctacatttcc 69061 tggttgtttt ctgaatgctt ataaaaatat cattaccaca tgccatttta gaaggagatg 69121 ttttatgatg caccgaaaac ttcttttttt tttcagtttt aggttcatgc acagatgttg 69181 gtacaaatga ttttgtctaa cttttaacag tctttcttgg ctttctgtat cttcacccta 69241 cccaattctg ccctggaaac tgacaaaata ctagcataga tgaaaggtca gagaaatttt 69301 gtgagtctta ttgaaatggg gcagaattgt atccttggaa gcagcagcgt ggtgaccaag 69361 gacaagaggc aacttttctc tgatatattt tgctccatca caattgtgtt ttactttatg 69421 tcttgttctt ttcttgtttt taatctttta agtcccttga ctttaatacc ctcgtggata 69481 acatatctgt ggatcctgag acaggagacc tttgggttgg atgccatccc aatggcatga 69541 aaatcttctt ctatgactca gagaatcctc ctgcatcaga ggtaagtttt aaaaatgaac 69601 cagataagtg actttattcc cagctactgt tttttcagtt gacaacataa ttctcaattc 69661 ttggtttttt tttttttttt tttttttttg tgacagagta tcgctctgtc gcccaggctg 69721 gagtgcagtg gcgcaatctt ggctcactgc aagctccgcc tcccgggttc acaccattct 69781 cctgcctcag cctcccaagt agttgggacc acaggcgcat gccaccatgc ctggctaatt 69841 ttttgtattt tttttttttt ttagtagaga cagggtttca tcctgttggc caggatggtc 69901 ttgatctgct gaccttgtga tccacccgcc tcggcctccc aaagtgccgg gattacaggc 69961 gtgagccacc gtgcccggcc aattctcaat tctatgtgca aataaaaggg gggacatgga 70021 tagatgatcc aaatgatcag aaatgaaggc ttttggaggt gctttagtca gagtctaggc 70081 aagaaacaca tggcacactt aaattggata atttgagggg agtttaatca acggactata 70141 cacaaaatgt gtggacacag tgtagggtaa tcaacagagt tataatgcag taccccaggg 70201 ctagcaaggg agggaggaaa tgaataccag aagccagaac aggtagacgg tggtgggtgg 70261 agagggttgc ctgacaggaa cttggcctct ggttaaggga tcctcaaaga acataatgat 70321 gccatactgc tatggtttaa tcattcatta cctgccttcc tattcaactc ccccttcctg 70381 tgtggcggta tacgcacagg tccatgcaat ggaccagtcc tggagcacac tggccagcct 70441 ctctagatct tcctgcaggt ggcaagataa atagctggaa ttattctggc ttagaagatc 70501 tcatcttcca ccttagttac agtactcaaa ggtcccattt tttgtccacg gtctttattc 70561 agtgaactat tttgggtaca agtaagaatt tgcaacctac ttcaatatag gtatttgcac 70621 atttactgag ttttcagtat ctactaagga tacttattac ttacatcata tatatgatat 70681 catcttgaca tatttgtcta ctgctgtgta ttgttaaagc tcagttttaa gagtccattt 70741 cacctgccca acaggaaaaa aaaaagtgtt aaagcccgta tctcttgggt taaaaaaaaa 70801 agaaacaaat gtgtatttcc atttacatta tggctctctt accttttatc tctaggtttg 70861 caggacaatt gtgatcatgt ttaatggcaa ccctttgtgc cttttgctag tttttagcta 70921 ttagggttgc tgaatggtat gtttaataga attgaagaaa tgtatgctgt ggagtgttca 70981 gtgctcagaa ctgagcagaa tgttagattc taatttgttc ttgcttgtta gctcatttag 71041 taaccttggg tcatttgcta tagcatggac ttggtttgtt gttctggaga aaatacttca 71101 tttctaatga ttgatctgca aaggagtagc aaaaagtagt aacaattcct tatccaatgc 71161 actgccccta gatgagtgat ttagttttat taacaagaag cttagctaag ccaaggtgta 71221 ccatcaacta tttccctgta actactacat gttaatgcta tggcaacttc ctaagacatc 71281 tgtctttggt gccagctgtt tttaagaggt cattttgaac gttcccacag acccttttat 71341 atccacgcag tggccttcac cgtctgccac atttgtgatg tagatatcct ttcttcaagt 71401 cattatatat tatcataaag gcttaataat agaaagctag agctgaaggg gaatttagag 71461 ataatctagt cccacccatt tcattctaca gatgagaaac aaatgtgagt ggaagatcca 71521 agacatgacc caggtctccc acctcagtgt agtgcttttc atactaagtt gaatcacacg 71581 ctattgctat tttttacagt cagaatgatc agatattggc tatttcatat ggttggctta 71641 acatgatact acactgttat tattgataca atcacatttc tcctcaatta agtgaaggaa 71701 actgaactta caaaatctta tacatttaga gacaggaggt taagttcaga acacatgagt 71761 atggtacaca gatatggttt tttgagggcc acattttttt tcttgtctgc attttaaaaa 71821 ccaggaatta agtgtaaaca tccagttagt tccctggctt acctgaaaat acatgaaaac 71881 ggaccttaat tactgcatgg caacagtttg gctattgctg agtgtattag tcaaggttcc 71941 ctagagggac agaactaata ggatatatat atatgtaaag ggggtttatt aagtattaac 72001 ttacctgatc acaagatccc acagtaggcg gtctgcaagc ctgaggagca aggagagcca 72061 gtctgagtct caaaactgaa gaatttggag tctgatgttt gagggcagga agcatcagca 72121 tgggagaaag atgtaggctg ggagggtagg ccagtctctc ctcttcatgt ttttctgcct 72181 gctttatatt cgctggcagc tgattagatt gtgcccaccc agattaaggg tgggtctgcc 72241 ttttgcagcc cactgactca aatgttaatc tcctttggca acacgctcac agacacccag 72301 gatcaatact ttgcatcctt caatccaatc aagttgactc tcagtattaa ctatcatact 72361 gagtaacagc ttcctccctc cacatttact gaaatcccca ctgctcccta atgtcccaca 72421 tccaggatgc gctgctcatt tgcatttcct gcctcatccc tgtaggtcta gatactctcc 72481 acctctgcta aagatcactt gagatcagta cccacctgtt cccaaatgaa gaaacaggtg 72541 tgttaagtgt aattttggaa caaaatagga tttatctcat tgtataagat tatatggctt 72601 cttatggagg atgaccctat tatgcataag gttgttttaa gtgacttaac aaaaatatta 72661 acccaatgtt atttcttcca caggtgcttc gaatccagaa cattctaaca gaagaaccta 72721 aagtgacaca ggtttatgca gaaaatggca cagtgttgca aggcagtaca gttgcctctg 72781 tgtacaaagg gaaactgctg attggcacag tgtttcacaa agctctttac tgtgagctct 72841 aacagaccga tttgcaccca tgccatagaa actgaggcca ttatttcaac cgcttgccat 72901 attccgagga cccagtgttc ttagctgaac aatgaatgct gaccctaaat gtggacatca 72961 tgaagcatca aagcactgtt taactgggag tgatatgatg tgtagggctt ttttttgaga 73021 atacactatc aaatcagtct tggaatactt gaaaacctca tttaccataa aaatccttct 73081 cactaaaatg gataaatcag ttatgtcaat tgtcagatat taaataacag tgtgtgaccc 73141 caaaagtact taccctaaaa catgtgttgc ctggaagcac atgtgtgtat cgctgccttg 73201 ccatgtcttg ttcagaagac acaggggagc agggttagct cacgtgtctt tagaactcca 73261 gtactcaccc agggactcca gttcacaggc cagaaaacat atgcattatg aagttcccct 73321 ctactccatg cacatagtaa gtctgactat ggcagtcaga cttacttact cccattttcc 73381 cttcgatata tgactttttc tcagtaaata ttaacctgaa ttattccaac tccccttgta 73441 ctcttgcttt ttcaattctc ctgttgcaat gacacatagg aaaatcttaa aattcttggg 73501 agtgttgtca cacctgaaaa ttatgagtct ctatgatctt ggcacaaatt gtacatttga 73561 gtgtctttga cttggttaaa ggaagtttgt tcacttcgat gactggatac agaatgaatc 73621 ccataattga catgggcgaa gctaaaagtg tccccaaaga ctacactgtt gttgaggtgg 73681 tggtagtgct ggtgggtttt tgtttaatat ttaaacttct tgttgtggag gctgaaaaga 73741 aaaaaaataa tagaaaggta aacaaacaaa taaatagaaa agatcaacaa cccctttggc 73801 tatctactga gacatgacta ggaagaaaac atgactttat cattttgtta tagaaactga 73861 tatataaagg ttacacattt tcatttattt gtttttctga tttgaaggta taaccttcat 73921 gatgaattac ttcttcaggg tgttaaggca gtgactttag aaacaaattt ttttcttgtt 73981 tttgttttgt ttttgagacc gaatctcact ctgttgccca ggctggagtg cagtggtgcg 74041 atcttggctc actgcaactt ctacctccga ggttcaagag attcttgtgc ctcagcctcc 74101 cggatagctg ggactacagg cacacaccac catgcccggc taatttttgt atttttagta 74161 gagacggggt ttcaccatgt tggccagcct ggtctcgaac acctgacctc aggtgatcca 74221 cccgccttgg ccttccaaag tgctgggatt acaggtgcga gccaccacac ccagccagaa 74281 acgaagtttt tctaaggcac ttgttatgat gatagtctgg catttccggg aaaatgtttt 74341 gccaataaat tgagaaccac caacagttgt ttaagaatca ctgacttgaa gtaataaaca 74401 ggaacatttt gtttgggtca aaaaaccact ttattcacag caagattgga gttactcatg 74461 attccagtga cataatatcc cgtgccctac atgggctcaa tttttctgag atgctgctca 74521 cagcccagga tgctgggttc tctgaaagac tgtcttgaaa ttatcaatga gtgtgatgct 74581 tagggaacac ccccaacata tttcattatt tagaaagaaa gaatttcctg agtagttagg 74641 ctccctaaca tcttcaataa aatgaccaca ttctgttctt tcttgttctt gatgatagat 74701 aagcaaattc cactgaagaa acaaaagcct atccaatttg acacagcaca tactcaagat 74761 cactcaaaaa gggacatggc ttggttagaa atttcataac atacattgtg aaaaagagga 74821 aaacagctta attcatgtat tctgtgattc ccagatctaa attttcgcac tgacataggc 74881 ttagtcacat ataaattata gtttgcgtgt gtaagaatga atgaaattaa tagccctcac 74941 ttatttttaa ggctttagaa agaatattgt ttaaaataac cccatactct acgatggtcc 75001 tgattcttcc ttagcttgcc cctcgctatg tgagactgac tacttaatgg atttgtcaca 75061 gaggttttcc tggctgagag atatcagaga ctttgatggt gaagcagcga caggggaaca 75121 aaatgacggg cgaggaagct tcgagtgcag gctttgctct tttaacttta tgcccagttc 75181 cataattgtg ctttcaaatg taatttctag ctgccaattt acactttacc acagttaaga 75241 cttagtttca tttccatgaa aaatgtgcac tttgctcttg attgacagat atatgctttg 75301 acaattgata caggtttacc cagtcaaggc ttgttttagt tgaaggtgaa aatggaagag 75361 ggaaaaaaaa ttgcttcaac agacctcagt atttcttttt attgcattcg ttgtctaaca 75421 acaataatac tcattggcaa gaaatttatt tacactaaca aattaaattt aatcacaggt 75481 attgttagat tggtcagaaa acaaaagacc actgtgatat tttgcttgaa tgcctctgaa 75541 aaccaaagag taaggaatgt catttgaaac tttttgaaca atctagagaa gtacatatta 75601 accaagcaca agagaacaat tcagacgtgg cactattgtc tcttccatgc aaatgatcag 75661 acatcaaaat gtacaaatta aaagcttagg taacatgcta tccatactta tcagacttaa 75721 aattattttc aaacaattcg tgttctttac tgatatacaa agaaagctga aaagaaatca 75781 acttcttaca gtggaataga atttattctc ttttattttc agctgcccag ttttgtcagt 75841 aaggtgtagc aggtgtacaa gtggttgcaa caccattaaa aaatataaaa gcagtagtta 75901 tattaaataa tgttgaagaa aacatataca tatatattta aggaatttca ctaagcacta 75961 actaaatttc atgttgttgg gaggtgttat ctggctggag tgcctcttaa ttctttttaa 76021 ggtaatttaa ttatttcttg ccaattgttc tttcttgaat gaatgattac atccttaaca 76081 gaaaacctgg gtgtataaat aatacagctg aaagaaacag aagtgtacac cttccaacaa 76141 catacactcc caaccacttg tgttgttttt ggcaacaacc aaaatgccat tgcttgcaca 76201 aaaagtaaat aaatgatcat atgatggcaa cattcatcat aagagagcca gtaggcagtg 76261 actgtaagca tcacagccac caattatgtc attttagaat tgtgagcaaa ttcacaagaa 76321 cacacagtgc aatttttagc tgcatacatt tcttctgagc tccaactcag caatatgcaa 76381 tcttgacatc aaagcattag ccaaatattt gaggaaattt tcttactaaa ataccaagta 76441 tctgataaat caattttacc aaaatatata aacatataga aggataagtg catttagtag 76501 aaatacttac aataaaaaaa tctcttaaaa tctatgagcc attcaatgta tacacaacgc 76561 aagaagtggc aataaggcta tgattacagc tccaggtcct gactgacctc gccaagttcc 76621 atacacaata tattttttaa tgccatgtga ttgcagtgtg tgtgcgtgtg taaggtggga 76681 gtgatattca cacatatgta gttctgaaca tctacacaaa cggcatgcac attgttgaga 76741 agttatctgt tctcttattg catacatctt actcactgtt gaatccctag cagttttggg 76801 gttgggttag ggtcaagcat aggaaatcat catatttctt tccttctcca ctcttgagga 76861 tctggaatca ggtcacagac tgtgctgggt ccacacgctt acatatgtga accttcaagt 76921 aactaccatt ccctcaatag gcccacatag acaccacgga caagcaaata aggaccctct 76981 gtctaccata aatagaactt gatgctttca ctttccagga gaagctgatt ctgtcggctt 77041 tctctttgcc cctccaactt cttagttggg cctattctat gctgctactt caagttactc 77101 aaatgtgttt tgtctggtgt tatttttttc tccagtgttt tggaccaaga ctagatttcc 77161 tgagacttta aatacgcaag acaaatgcaa acaaaatctg ccaacatacc caggctgctt 77221 ttcaggagtg aatcactgaa gcgcatttgg atttcctata acactacttt cttttgggaa 77281 tgtacagccc tcccccagct ccccatcccc catcataatg taaaccattc taatagaaaa 77341 tatggatgtt ttggctgggg gtggtggcac atgcctgtag tcccagctac ttgggaagct 77401 gaagcaggag gaccacttga gcccaggagt ttgaggctgc agtaagacat gatggcacca 77461 ctgcagtcca gtctgggtga cagatgacag aaaaaaaaaa aaaaaaagaa aagaaaagaa 77521 aatatggaca ttttatttct ggactttatc cttaaatgct tagctttaga atagtgcaca 77581 ggtacatcct taataaaaag taaatttgaa aagacaggaa aggactgtgc ttttctcttc 77641 cattttacct tttcattctg agaccactaa ggtctttgca ttaagtaggt gacgtgaacc 77701 ctagtgacca ctgcccacca aaatgcccct catctccagc tctgtcagga gtacctttag 77761 ttactcaaat ggaaatatct cagtgtaatg atggggctac aacatatcca aagtatagat 77821 ggttctctca gggctgctgg ggcagcttcc tttagaatcc tctagaacac tattgttaaa 77881 cactggatct cctcttgaag tttagtctgc ataaccttta acacttaaac atgaatcaga 77941 gtatctatat aaatgcaaat tttcatttcc cttttataaa atgcaatttt caagcacatg 78001 ctaattctgg tgcacaatga acccagacaa atttgtcttc agcacaaact ctaacggccg 78061 aggtgctaga tcaaaccacc tgtgaccatt tttgtagctt gggactggga tattttaaga 78121 tttcaggcta gtgatgatta cagggtcaca gatagtggca aaggtaaatc tggttcctca 78181 ttttccctct tgccactttg tttccaacaa caatggagca aaaggtgaca gtcccatttt 78241 gtgccagttg aaagctgagt ttggctttag tacctatcat actgcaatca tgaactgctg 78301 tgggttaaaa aaagcttgtg atgttgaggg tgtgaaatca attgtcaatc ccagctaaag 78361 gatgatagcc acagaacagc tttattgtca tgaaactaaa gggggacttc atcaggagtg 78421 ttagaatgga tgaaatgagg tttcttccct aacttaacat tctgcagaaa attcctgatt 78481 caggtaaaaa aagaatgaga ttagtattac ataaactgtg agttacttga agggagatgg 78541 tcttaaacaa agtaatctgt tataaaacta acatccttta atgggaaacc ctgacccaga 78601 tacagctcat acaatgcttt ctgtctgcag cttggcctcc tcaccacaca ttccctccca 78661 acaaagcata acctataccc caaaacactt ttggctattg ttagaatcat actgagctct 78721 cctttaggga atcagttcat agcaaccaga tattcccttt ttgttcattt tctgataata 78781 aaaactgcag tttttaaggt tgatgaaata acaattctgt gttaaattgt aaggacactg 78841 aaaaccacta gcattaaagt aggaattgtg caagacattg gagccttgat taggtatcat 78901 gcactacagt caataaaaga aataatagga aaacaaaatg tgcacaagat ctgaggtctt 78961 tcactttgct ttcataccct taatcagttt acatcttatg ctttgtgggg caatcaaacc 79021 aaatgacttt tacaatttca tccgttttaa gtggtaatgt taaaataact cacttcggcc 79081 taaaattact ataatttaac ttaaaaaatc aaaatactta taatactaaa tgattttagg 79141 aataagaatt tgatttataa cattgtcagt tggacattta tctaaacatt caatatacaa 79201 agttataaag aaaaatactt ctaagatatt taagaaaatg caatataata gatcataaat 79261 taggtgcaaa aacaagacac ttacagacat ttccagtata tacatccatt ttcaaatatt 79321 accaaagtgt aaacataact taagataatc atccttctat ttctactctt gaatacagtg 79381 aaacttccag tgtgatccag gtcacttttt tcccaaagat aagcaaaaat ggcaaaaatt 79441 tacaatagca agcactaata ttaccagatt tttactaatt tagtgtttta gtatttgaaa 79501 agttttgcca aattgtatca ctcattaata gtggcttctt atttctcatg atgtgaaaaa 79561 ataacaaagc accagacatc tcaagacatt aatagagtat ccaatacggc agtttcatta 79621 tcatacccct aatgaagaca gacagaaaaa cataatggtg actttgataa ctgtaaaaca 79681 caatgttact tcatgtcaac gtttctgagg attgaatgaa gggtctgtac ctaatgacaa 79741 aagagctgac gcataaccga caatactctg gcaatgaaaa gaaaacaggc cttgggggta 79801 aaacagaatt aaaaatgtcc agtttagata atttctgaat tcagattttc agataaaaaa 79861 aaatcacact gttattaaaa tacaagatgg ggctacaagg gagttccctt aatctgttac 79921 actcaaaagc cttgccgagg ctctaaatat ggaatacttc tagtggaaga tacgagatgc 79981 tgggaacaga gcagagattc ccataaagta aaagaagtaa aatactgtcc aagaattcat 80041 gtggcctcaa gtcttattct gaaacttgtc ctttggaaag tggataacct catttttaga 80101 gtcaacgcag caaagagaac cagatacatg agcataaaaa acaaaaaacc ctcatcacat 80161 gaaatttctc aaaacgagtt tattttctcc ttcttctgcc ccagtctcct gcacttgact 80221 tagtttggtg aaacatggaa atccttaatg ctggcttaaa attagtataa gttcctccac 80281 tactaaagta tacctatgca ctgccttaaa caaacttttt ctgtaaaatt gagaatcttt 80341 gaagtatgga aaggaatggt tcggaaaaac cattcaaaat ctctctaaaa tgttaactga 80401 cagtttcctc taactggtat ttctattttt cttaaaagta catttgttaa aatactaact 80461 gagcccacct gggaaatctt gagtagaaaa tatgtgatga gttcctgatt gtgacatagg 80521 caggctgaag accagccaga atgttcccca ggagaggaaa cctgaaaatg gtggtttcct 80581 taaaaatcaa tttcctgttc aatttcttgg gttaggtagg gggaagagga actgtgttgt 80641 gtttgccagt gttgctgaaa tgctgaacac acagcagtag caaggtcaac agatcccaat 80701 atcccatggc tgcattaatg tctgattttt ttctttcttg ttcagccgta tctagaggat 80761 aaatattacc ctcagaaccc caaggtaata cccacatcca cattcttaag gcttttgctt 80821 ttttcagtgg ttaacaaata gctataattt attttctttc tttttgatgc ctagcaagca 80881 gaatgacttg cttgcctttt cccttagtag gtaactacag agcagagcac ccatgtttcc 80941 acagcggggt ggtctcactg tccagactga gaattaagag caaatctttc ccacacattg 81001 gagtaggtgt tgcagtccta gaagttaagt actggtctca gatctcatgg cagggtgctc 81061 tctgccatgc agacatgtgc aattgccatt tcaggatccc acactgaaaa ttcacggcgc 81121 tccattaaca acaccagggt gtgtcatgct cttgggtggg gtcacactag ttctccaagt 81181 gacaaaacac acccaacaca acacaagatg ttggtaagtt agagaagagc aaaacacagc 81241 tacttaaatg aaatctctgg tttgttaatg aaccaaaggt ggcttgggat tattttagta 81301 gttaaaatta agagtgcatt ctattgcaga aaatgcagtt attcaacaca cagttcctca 81361 gccggcctag agccgcatta cccttatcat ttagtttctt tttcatcctc tggaggagag 81421 cagggcagga agaggtgggg acttctcttg gagcatctcc atcatctgta agagggtatg 81481 tgttactgct caccagcacc ctcggctgta gttgacgtca tcttttctgt ctttttggac 81541 ttcctctgca tttgctcttg ctcctttctc cttagctttt ctctttgttt ttccattttc 81601 tcttgggcct tccgagcctt ctctagagac atcttcattt ccttgagttt ctttttgacc 81661 actgctcggt cctgggatgc tgtcattcca agagcctaag aaagaaacac acacacaaac 81721 tgaatttgaa aaccaatgag ccctctgatg gtgttaacat accatatatc caagttcgta 81781 aactaattca aacattgaag aggtaagaac agaggaagaa ctagttacat ttgctaaaaa 81841 ccaacacact gggggctacc ctttattttt tcttttgttc tttgaagtcc cttgcaaagt 81901 ataccctaaa gctcattaat taaatgcttt ccaaatgagt aattaatggg tgaatgaatg 81961 gaaaaagggc acaattctac ttctccaacc tttgggctgt aggcatttac gggtaaaggt 82021 attgctaaac tcatcaatat tttcaaagca taggagtgag cacctgctgt cacagagggc 82081 tccaagaagg gcaagaagaa ttgaaatggc ttcctttctg ccaatgtttt cattacaatg 82141 agaaccaaaa atgtctttgg ggccagaatg ctcgtgtcag catgactagc cattctttcc 82201 ctgtaggcag acctataaag gaagttgatt ataatgagga ggttgctcga tgaacttaga 82261 attcagaata aggggagttg cttatgtact ttaaatgttt cgtgaccttc agaatcaaca 82321 gcaggatgca tctatctatc tccccaacag ggacagcctt gtcaatgagg ccttcatatt 82381 cacaagctat tgattgatct ctaacctttt caagattgtc aagagatact cgtctgcccg 82441 tcagcccagc agggggctga ggcgggggtg gggcttgggg gtacatggag ttggctgtaa 82501 gcaccttttt ccctttccca aagggatgga tgccaccttg gtctctaaga gagatggact 82561 ggacaagtgg aaatgaaagt gccatggagg ggagggaaga tgtgtgtaca tatttgggga 82621 aaaaacttct ggggagaaca tcatcccagt ttgggagtgg tgtgtaagga agagtggagg 82681 tgaaggagaa atgaacaagg aaaagaagac tcgtggatct gggtcttaat tctaatgtga 82741 tgaaaaagtc cacagtggcc attattaaaa ctcatactac atcagatgta aggagagtta 82801 atatatggac catgtgttgc caggtggaag caatggagtg ggtccctcac tctcctaatc 82861 agcagcctac catttgattt ggaacccctg atggactttc tgagacttca tttactactc 82921 agaaagaatt caacatcctt tctgctgaat attgatgaga attacctctc agatatgcag 82981 aatgctctta cctaatgaaa acattttagg tgatatgagt aatacaaatt atacctggta 83041 acctaagaca gacatataat tctttacctt aagtttattt ccatccaact gcaggagctg 83101 ttctccagtg atgttttggg cactgaattc agatacatac tgctccagat ttaggctcat 83161 taaccagtga gaaacctgct gcacactcca ttcctgaacg gcccgattct gacactgact 83221 gtgtttggga gactgtccat catcaaggat ctaagatagg aatgttacgt agtgttgtta 83281 aaggaccaag atactatggc tcatattatg ctatcaaaag gaatttctta acctatgaga 83341 gccacattta tgtttaaacc aaaatatgat aagctaatgg ttctagtggt gttagaaaaa 83401 atacttatta cctatatgtt tttatgtttt tagaagttta gaaattcccc ataataaaaa 83461 agcaccatta ttaatgtagg aagtttaata ttgaagaaaa tatattaatg ttgcaagctt 83521 ttacatcaaa acagtctgga cagattataa gatgtagaaa ataaatattt cataaatttc 83581 tatatgcaat gcatcactta tcctctaata agacattgga attttgtaaa aatatattaa 83641 caaatactga ggtatctttc acttgcaaga ggagagcatt aagctctcaa gccttcttag 83701 gggaagtttc acaaccatta tttcacttga gcagcaagca aactgtgagg aaggtagggc 83761 atataatatc catcattact ttcaagaaat gaggaaatgg ctgggcatgg tggctcacat 83821 ctgtaatccc agcacttcgg gaggctgaag tgggtggatc acttgaggcc aggagttcaa 83881 gaccagcctg gccaacatgt tgaaatcccc tctctactaa aaatacaaaa attagccagt 83941 gtggtggtgg cgcatgcctg taatcccagt tattcgagag gctgaggcag gagaatcgct 84001 tgaatccagg aggtggaggt tgcagtgaac caagattgtg ccactgcact ccagcctggg 84061 caacagagtg agactgtctt aaaaaacaaa aaaagaaaca aggaaactga ggctgagaaa 84121 gtctatgtga tctgcccaag atcacacagt tagtaagtgg tagagctcaa tagttaggag 84181 tttattctaa atttctggct ctgtccactg cagaatattc cttttaatag tgcaaaggga 84241 caggttggtt cctagaccca ctaagctgag aggagatgaa ggtgatgata ggtcttgaag 84301 gaaagacagg agtcaccttg gtgacccaca cagggaacac agaagaaggg atagggtaag 84361 ctgagatgaa ggcaaaggag aggaagaaga atggaccatg tcaatgtaat ggtgatggaa 84421 gagaaatcga ggaactctgg aggtgcaatg cggagatcct cttaagttag gaacatcttt 84481 taaactaaga tatgacaaag gcagagagta aagctattta acacaaggaa acccttcact 84541 acagaggaag atattcttgt attgtctgat ttttcagtca tctccaaaga gcgaaaaaac 84601 acattcaaac aatagcacac aagccaccac acacagccca ctgatagctc acctcgtcat 84661 ctatcatatc caggctctga gtgagggagc aataaaagaa tcaagaagaa aagtgttaca 84721 gaaacaggag acacaacata tatagtacat aattttaata aaattaaaat tgcaccaagc 84781 ttgaagaata cgggcagaaa agcttacaag aaaaaaaaaa gttttgtgaa taaaaaatca 84841 tgttagaaag aaatagtaag aacaaatatt tagagcaaaa atatctttat ttatggtgcc 84901 ttctgtcctc cctccatttt tcaaggaact aaatcactaa cctcatttcc tgactctctt 84961 gtacacattt atctcctgat aatttgtagc aatactaatt ttgacccttg tttttaaata 85021 aatggttctc atgctatttt ccctctttat caaaactatt gattctcaat cagagacctc 85081 accctaatca gaggcagtat cagaatcttg gagtgagtgt ggttgtcagg ggaggcaagg 85141 aaaaaaagaa aggagtaatt aggaaacttc tatgtgattt ctgagctgtt cccccatgtt 85201 taactctgca ctgcccagcc tccccatcat gtgtgtttac atatacgtgt aaaaggggcc 85261 cctggaaagc ttaagaacaa gactttgctt ttctgttgag aaagggaggg agagaaatca 85321 tgaagacttc aaattaaaca taaaccacag tcctgatgat ttctatgcaa tatctaccct 85381 aaaaacattt tcctggtgat ggttcatggg taaaacgtca gatttgacaa agcagctctg 85441 tcatagtgcc atggaattac ctcatctgat gacagtgcta aggactgaga gagccctggt 85501 gttttaggtt ctgctcctaa gccgctgagg tctgctgaac tggtactgct gggactgaag 85561 tcatcattga aggtaaaatt ctgtaaaaat aaatgaagct cagtgttaaa tgagtgcagt 85621 gagtgagcaa gaggtaggtg tgtctacaag ctctaaaagg gctcttttaa acaatcttag 85681 cagaaattca tgtatgattt ggtgatgacc ttcaatgggg cagcggttga gaatgctgca 85741 ccttgtctgg ttgagggaat tctgggtgct ttctccatgt gatgaaagga tagatgtttt 85801 agagagacaa ttcttcaatt gctcttcaag ggtacagaat tcattatcct ctgcaactgc 85861 tcagaggaat aatattaata agcctatctt ctttgaccct aatcgtgatg agtaacagtg 85921 taagtaaaaa tacaagatgg tgtggagagg ctgagttatt aggaacagat ggcaattcat 85981 attagaaagg tagctagagc acctcctggg aaggatcgct tattgttggc tgtgataaat 86041 actatgagaa atggcagatc ctctcctgga tgtacccttc aagatgatga gttctggggg 86101 aggtcacagc ctgacaatac actagtagtt acatgctgaa atacaactct ctcactgcac 86161 ccgcagagca tcagagaggc tgccacacag cacagacaca ggtttccaaa taatcacctg 86221 ggggtggggg aggtgggagc gtgtaaataa aacttcccca gaacttcatt aaaatattca 86281 gtggatatga aaatcaaaca acaatgacta cgtagctgca cttgtaaaaa catttttaac 86341 caaatctgga gtctctacag tgcaagataa tggcagaatt cttcgaattg catttagatg 86401 cctcagatcc aatactgaaa atgtttcaag accaaagtga tgtgctgcta tcaaaacttc 86461 ttttgctagt aattctaaaa tggatggatt tccaggaact ctaagcactt gaccaacacc 86521 atcttgtttc tattgcattt ccttaggaat tggactgtta gatttccact aggttccctt 86581 atactgggtt tagttggtta tcaaaaataa ataccatgcc agaggctgga gaattcagta 86641 ccgaacaaac tccaaagttt gctaaaactt cagaaaagaa gtctaggtta gagcacatgg 86701 agctaggcca gcagaacatt taagcagcag gacatttaac atgaacatta ccatttggta 86761 ccccagtcaa aaccctttga gctgttgaaa agatttaaat tgaaactgaa aagaattaaa 86821 ttcaagtaac attaatcacg tttctgtttg agaaagaaaa gcgaacacag ataaaaggag 86881 accaaacaac gtgtgaaagt tagccaccat atgtgtttca ataaaatgtt agataaggtt 86941 tgcagtgtca tgctggaaat aaccatcatg catttaaaaa caagtaacac ccatgcatat 87001 atttcctaat tctgttgaag aactttggtg atagaattat acagcagttg ttcaacacag 87061 aagtttgaca ttaaaagaag aaataaataa ggatgctatg tcatcattgg tttgttttaa 87121 agagtttcac ttagatgtac tgaactcgta ttcagtccca aatttatttg cttttcagta 87181 aatcttgaga agcacagtga aatagtttag taatgtaatt taaagtagaa tactgggtaa 87241 acagatctat cattctgttc tatgattcaa atgcatcttt cccagcatgg cttaactgtc 87301 aaactgagat taaatagcaa tatgtgaaaa gataggcaaa tcaaagagta atgtacaatc 87361 tacaatgaaa cagcattaaa gaaggtggta gttaaatgag gctttctttt ccccacataa 87421 gaaataccat gcattgttta ttgcagtgtt attacattgg tttcttggat ccagactaca 87481 gacataagtt gagaatgcct tccaaagatt gaagaagagg agggatttgg gtttctcttg 87541 tttgcttttg taatccatgt gttttctacc ttggacccct ttttatctga aattagagtc 87601 tcaggagaag gctgaaggga acttgtaggc gcaggcaggt tcctgaagga ataggatcct 87661 ttccggctgt cattaaacca tgagaaaggc atgcaagggg aacgagtgga ggtctgagct 87721 gttgaacatg gcttgggtgt ccagttttcc agtctgcctc tgaagatcct gttttgtgaa 87781 aagaatgggt gtgttagata aagaaagggc ggataaaaga cctactgaac ccaaggactc 87841 tgaatatagt ttgaagtaac aatttgagaa actgttctat tcaacagagt aaaaagtttg 87901 cacttttgac tacaacacta atcttaatag ttatcttttt gcctctaaat ggattctgtg 87961 acaatgatca acattttcgt attgctattg ccatggtgaa tttttataaa atgtgcctaa 88021 taagtgtacg tagatgaatt ataatcaggg aggggcacag tcttttagga actgctatta 88081 ggttaatcat cactgcattc cctttatctg ggtgacatgc ctcacagcta gggacattag 88141 ctcataagca gcattctttc ctgtcttcct ctggacttgt gttccgaagg gtctgagact 88201 taagaaaatg atatatttaa ataattttga aatttcaggg ttcaaaatac tttaggtgga 88261 acacgtagtt tattggtttt tcaccagaat cttagtgcct tttcacagga ggcgtaggag 88321 atctgaagtc ctcatctccc acttggaggg ccagggttta ccacctccta agtacatggt 88381 ctcaggcctt ctctcggctt ttagttctcc atctgtaaaa agtggatatt tttatgtatc 88441 tcatatgtgt ttgttaagaa ttaaatgaga tatgtgaaag tgcttacaaa ttgtcaatgg 88501 tctatgttta ttgtttaaca tttaaaggat tcaaaggcat ttgaagaacg cagtttgaaa 88561 acagattttt ttgtcttaat gtaggttctt tcatatctgt aattatcttt tggttaagag 88621 aaaaaataac tatcagaaac caaagtagga acaggcagca gtttatgtgc atcttggaag 88681 aggtcaccct tgccgtttga tggaatgcct attatttgat agctgctgat accacgataa 88741 cacagagatt ccctttcaat taaaagtgcc tggacaccac agccagacaa cttctgtccc 88801 accacatgta tactgaattt cagtcttctt acatctcatt tcaaatatat tttgaaatat 88861 atttccagtt tatataaata gtataacacc taacaattcc catttatttg caaatattat 88921 tgtgaggatc gactgggaca taaatgtaaa agcatcttgt aaactgtaaa gcaagataca 88981 aatgtaaagg gattataagg ctgataactg aagagaatga agaagtcagt gaggttctga 89041 taaaatattg agatcagcat gctgggggaa ccagatgtca tcctacacat ggtctacact 89101 ctaggtgagg aactccaaaa acatgaatct ggttgattat ataggagagg gaaagctgtc 89161 ccctctgcag cctctgggga gggtggaggg agaaatctta tctccttaat gtaaattcaa 89221 cttgggaagg atcttgggct ctgacccttt gggatgtaaa catgtgtctc tagggaaaga 89281 tatgactagt ggctccagct ttacaactct aggaatgtct ccatggagat ggtctcatcc 89341 ccctcttcca cttagcctcc ttaaaatgta accacattac ctctgcagag gtcaatctct 89401 ctggagttct cactatctat gcacccacaa ttcaacttct gttttctttt ttttgagaca 89461 gggtctcgct ctgtaaccca ggctagagtg cagtggcaca gtcatggctc cctgcagcct 89521 ggacctcctg ggctcaagca atcttcacac ctcagcttcc agagtagcta ggacttatag 89581 gcgtgtgcca ccgtgcctgg ctaacttttt tattttttgt agagataggg gtttattatg 89641 ctattgacct cctgccttgg cctcccaaag tgctgcgatt acaggtgtga atcacaacac 89701 tcagcctcat aatttatttt ctgaggcttg ttttttagct taggttccac attctagtaa 89761 aaatgcctag aaagctctag ctgtgtagaa acatataaat attttctctg cagtcccata 89821 ctcttataat tgggtatctc tgaggaaaaa ctggaccgga tgatcattat ctcctctggt 89881 tatagatttc tgaatcaagg gaatttttaa atgagttcat ccacaatttt caataaaata 89941 tagtagtgct taaatatatg gatattgttt taaagtttat aaataggcca tgattaatga 90001 cagcatgaca atcttgtcct actcccagtg aacagccttt gtaacaatgc ttttagaaca 90061 atcactatta cctttaaaag caaataggga caaaataaga attgttttct acttgtggtc 90121 attccattgc tttagttgaa aaggactttt ggtagtccaa atcctgtcca agcaacattt 90181 gaaaagaaaa aaatctgaag gtcacatcta attaaactct ctcacaacat tttcactgtc 90241 agaaaaatat tcaagactaa taattccaca tggacctgtg catccagcac tgaggagggt 90301 cttgaggcca tgtggttctt acatgtgcgg tgaaggagat cacagaagct gggcctctgg 90361 tgatacccag aggcttcctt tccctagctt tgcacaccaa gtaatagaca ggaaggagga 90421 agagccaggc agaaggcaat gagaggccac ttccgagaag acacctacgt ctctacagtg 90481 gcaaatcacc agattcaggg atattttgaa gaatattttt cagtttccca tgaggcttgg 90541 ctgtgcccct gctgaagttg tatcctgtat ctctcattcc tctgtgagta tctgtgattt 90601 taaaacccca tgaactgatg taacccctgg gtacttgcta gtatccttaa agaactgaac 90661 agtgagtggt atgctttcat ttctcggacc tagagaacat tcatttaaaa aacgaaattt 90721 ttgctttaaa attacagacc tatcagctta gcagacatgc tttctaaaat aaatgtcaat 90781 ttccttctga ataaattgaa ctcttgggcc cacaatagaa gcagaaaggg tttggcagca 90841 gttattccat gttgaccact cttatagtct atttcacttt gtggagaaat ccattaggaa 90901 taacataaaa tccttctttc ctttcttatt tgcttcctcc ctccctttct tcttctcctc 90961 ctcgttcctt ccttttcttt ccttttcacc ttcctttctc cccaaaatcg agattggtag 91021 aaatgaactc ttgatgaatc tgaaaataat ttttccagtg gatcaaaata ggtagatagt 91081 ctatattccc ttcaccttct tttttagcta taatttactg gcttgctatc atttttggaa 91141 aaaagagaat cataagtcac tcctaggaac tgggagaaag aatggcttca aattcgtcct 91201 cataatccaa agataccatt tcagaggtag ctgtttcctg gcatttaaat ggtttatgta 91261 atatctctta taagtaatga actcatttac atatatcttg aatccagtgt atcctggggc 91321 tggcatccca aaattaaatt gccagctaaa cattccaagg tgatctttgt tgttgtcatt 91381 ctaactttga atacaggtgt gtgaagtatt cagcatctaa ctggatgtca tcaaaatatc 91441 attaagagtt cagaaactgc cagaacagat gaagtataaa agaaatacag aatttaagcc 91501 caggagtaaa agaagttaat tgaagaaaat tagttgcaac cagaggaaga ttgcagactt 91561 gtatttacct aaatcgaatg aagtaattct ataaggttct aacatccacc aaggagtttg 91621 aggttgtaaa gctagcctca tctaaatggg atctagagac cctaatggat aacagctgtg 91681 agaaacatgt ccatctaaag ctcagggcca acccccgccc cctactcaga catatcctgc 91741 tcccagatcc tgagttctga tctaatcaat aatagttcat ctgagatggt cactgtagtg 91801 ttcaactctg aatcattcga gatggtgaaa ctattccctt aatgagtctt ccccatactg 91861 gaggggttta aaatgaaata gaagaggatc acgtaaataa aatccatact atcaaaatct 91921 aaatgctaag tttaagaata acagatatat tgcttaggtg tttaagaaaa tttgtatgaa 91981 tagaaatctg aaagagaaat gtgtccactg attctacctt aaaatgttaa actcatttta 92041 atgatctaaa ctcattttaa atttgagttt ttaacaaaca cacaatatat taagcacttt 92101 gataactata ttgggacgtt tagagtcact acagttaaac agtaacttat tagattttca 92161 agagcttttt taagtatgga aaagtgtttg cctgttaaga atgtaagttt gtcaggcttc 92221 aggcatttga aatactgtac ttaaaaagaa aactgaatat taaattgata aatgcagcta 92281 aaaatgggct agacagttat ggggaataag tccgatctac tagggctaag ctgacattat 92341 tcaaaggtct ccatgaaaat gactacattc actgggccag gcttaggcag agatggggtg 92401 tttttctact cacacacatg tgctcatctc catcaattct tctaaaaggt gtgtttccag 92461 ccagctcaga tgattcttaa gcaggggttt tgctcttact cagcaacagg ggcagagtgg 92521 accctccagg tacttgtcat aggatataat ttcaaactca ctgtcatgcc atgatttatc 92581 taaagcctct ttgagagaaa tagcctatga agtacagcct gaagtataat ccatatagcc 92641 cattcattct tcaagatata acaaaagtca tccttggatg aacaagactg ccagcaaaat 92701 acgcaagaag aatgtcattt tgtatccttc ccatcaaacc tgctttaata aacaatgata 92761 acaatagaca tttttattta aggatttgaa atgtacttca atgtgaccat cttaatcctc 92821 ctaataacat gaagaagtat gtgaggggcg ggtgtgtgtg tgtgtgtcta tgtctccatt 92881 ttacagatga gaaaactgag gtacagaaag gttaagaaac ctgtccaaga agtaagtggc 92941 agagttggga tttcaaccca ggcttttaac catattgaca ttgtgttttt ggctttcact 93001 ggaaggtcac tcttatactt actatgaaga tgttattgaa gatgttgtat tttaagtaac 93061 aattttactt actttttaat ctaacttttc cctattcaat ttacttttta ataaattata 93121 ttgttgtagc atatgtactt ttgtatgtaa acaaattatt ttggaataac gaggagtaga 93181 atataaataa gaaaataaac aataaatgaa tagaccaact gaataatggg gtagaaatta 93241 aaggaagatt tgtttttcat ggaatcctac agatcaacac tatccaaaaa gaaatataaa 93301 tgaaccacat tgtaaactaa aattttctag tagcaccatg ggaaaaaaag aggtaaaata 93361 attttaacaa taatacattt tatttaccct ccaaacccaa aatattattt tgacatgtaa 93421 tcaatattat aatattatga agatatttta tgttcttttt gttcatacta agtctttgag 93481 tccagcattc attattctcc tacacatatc ccaatataga ctagccttat ctgactagag 93541 gcaaccatac tggacagtgt acgcagaggt gaatgggaag tcagagccat gcagtgtcag 93601 cccaatccac ttattttttg gtaggaaacc aaggtcccag gaattcagag acttgcctaa 93661 tggaaaagta aggactacta ccttggattc cagctcagtg ttgtttctgt tgttcacact 93721 ttctacttga atgcaaattc atttccatag aagactgcca tctcctttcc ttatcatttt 93781 ctcaacatac tatcacctag ccccatctga gatacacaca tctccccatc tgagatacac 93841 acatctcaca gagttttcca ttccgtgact acactggaaa cttatctaca gagctccaaa 93901 gtagatgctc cactgacatc agtggctact tactccagaa tacagaatag gattttattt 93961 taccctttgg tatacatttt aaggggtgaa caagctctga caactaaatg tttcctgctt 94021 ttgttatctg ctatgctgcc tgtttaaaat cagaatgttg gctgggtgta gtggcttgtg 94081 cctataatcc cagcactttg ggaagctgag gtgggagaac tgcttgagcc caggagttca 94141 acaacagcct gggcaacata gcaagacccc acctctacaa tttttttttt aagctggaca 94201 tggtggcaca cacctgtagt cccaggtact cagaaggctg agctgagagg atcacttgag 94261 cccaggaggt caaggctgca atgagctgtg attgtgccat agtcctccag cctgggcaac 94321 agagtgagac actgtctcaa acaaacaaac aaacaaacaa aaagacaatg ccactgagca 94381 catatttgaa aagggatgga accaagtcat atttctttgt tgggtttatt tgtttattat 94441 tattagcctc atccattttc tctagttgtc tcatgtcgcg catgtatctg agtcattatg 94501 tatccacaga aacacaacaa ggtaaactct ttatagagag gaaaaaacta aaaccaagac 94561 caaatagtga aagtcttgcc tctttaaaat ctaacagcca taggttatag ctgaccaatg 94621 atcttatcag aatcttagcc taataagaat tggatgaaat gcttcaaaaa gataacatta 94681 ctcagtccaa ggatccttcc ctcctttttc tgaggaagtc tcccagggtt ggtgagtgta 94741 ttcccagttg cccactacaa tcatagtgcg accctgcagc tgatggtgct tacagcccac 94801 agactttcag gaccctgaga atccaactca attgcattaa gctcctactg ggcagcaatt 94861 ctgttcttcc tgggataggt tttataaggt tctgtaccca ttatcaatct atacagaaga 94921 atatagctgt ccacagattt tattagtgtt aaccaattac tatagcagct caaaattttt 94981 tttcttgcaa gagaaaagtg tccttcttct ttccttgttc ttctattcca attatatgaa 95041 ttcctttcca tcctgccaaa attctactat aattactaat aattataaaa ctatgatttt 95101 attttttttg ggtatgccaa tgtcataaca aggtttgagg gaggccatct cacacatgcc 95161 tgtgaacacc caatcatcac acttaggaac tacaaaagat caaaatcatg attctaatta 95221 gaaactagtt tgtagtgtgg gtatgtgtac aataatttct gactgctgaa tcagggggac 95281 tccagagttt taaaaataca gtggtggcga ggcagccaat ctagaagtct agctgtgggt 95341 tcaaagcagg catcgactgt ggtgaagcag tgtgctgatg gccacctgcc cttgacaagc 95401 tgtggttttg cctatagtga ggtgggcagg cactcaccaa cacaatcaag attggcttga 95461 ctttcatttg tgatttatac aaggaggtga atcatctctg gctactatgg caaaattcca 95521 tcaaacaccc tccccctttc tggctgcagg tgctcacatc cctctcaggt ggtgcctcaa 95581 agatgagagg gagtgactgt cttcaaagcc cagagaccaa aggaaggaca gtgtcatttt 95641 cagtatctga gtttctctcc catcatgtat ttatgcatct ctttctcccc taaacggttt 95701 cattatttgt ttttttccat gagtatgttt gattttattt ttctattctt aattttaaca 95761 cacaaagtgg atcattctca ctcccgaacc acacaaagtt aaactacttt cgagatgcat 95821 atagcctcag atttaccttt attttcctga gattgttgtc ctcattctga gagcattcta 95881 cccttttcca atgtacagta taaaggaaag ccacactaac aaagataatt aatttgggaa 95941 ttctctctcc tcatctcccg cccattatgc tcctttaatg ctcctttaat tccaagaaga 96001 aggtacaatt tctcaccaag gtatatgtca gaaaagagaa aaaagaaaag taaacagcag 96061 tggacaaccc atgaagtgga gaagtggtcc ttgcatgatt ttatagtatt ttatttttag 96121 tgtgtttctt ccttagtcta tgatattgag attatttcca aattttataa agaaatttaa 96181 gcatcattag atgcataagt aactgaaggt ggcatagtat gttggaaaaa catctacaca 96241 ggaggagcaa gccccaggtt gacattaact gcctgctgca taaccctggg caaatcaatt 96301 aaattctatg aaggtaaagt tcttcatcaa taaagtatgg tattattatc tactctccca 96361 atcttttttt tttttttttt ttgagacgga gtctcgctct atcgcccagg agtgcagtgg 96421 tgcgatcttg gctcactcaa acctccgcct cccgggttca agcaattctc ctgcctcagc 96481 ctcccgagta gctgggatta caggtgtgcg ccaccacgcc tggctaattt ttgtattttt 96541 ggtagagacg gggtttcgcc atgttggccg ggctggtctc aaactcctga cctcaggtga 96601 tccacccgcc ttggcctccc aaagtgctgg gattacaggt gtgggccacc aagccttgcc 96661 atcctctcaa tcctataagg atcaaatgaa atgataaata taaaactact ttgtaacttg 96721 caaataataa taaggtaaca gtaatggcaa tagcagcaaa ttgttatctt tattattctt 96781 tgaatacctc ctacatgcca agtatcattt taaatgtttt acatgaaaac tcattgaatc 96841 cttacactga ccttatgtaa aaggtactgt tattactctc attttacatg aggaaatgaa 96901 ggcacagagg gtaactcact tgcccaagat ttaaaagtta taagtaggag agccatgatt 96961 caaatccagg cagctgggct tcagagcttg atgctttact gactccctat acaaatctca 97021 gcttaaaatt atataatctc agaaaacaac tgggaaaatt tgataacatg ctgttcaaag 97081 ctcgctcttt atgggaaggt ctttactgaa gactaagcat agttgacagt gcccacaaaa 97141 tcaattaaat agccaagtgc agcaaaagat ggtggtcact cctacataca tgtggacaag 97201 acaatcttga atgtcaacaa ggtacttaaa tgcaaaatga ttatcctttt agacttaata 97261 caatgcacta gcattgtggg tagccaggca aaaaatctga agtatcacat aaatatattt 97321 gatatttctg aagtatcaca tagatatatt tgtgctgaat ttagataaac acttattgaa 97381 atggttcatg tggccataat tctgcaatta agaaccaatg gatatgccca aggctagatg 97441 gaggctataa ttggagtaag accaatataa actaattgtg caatttttct tccatctcaa 97501 ttcttggcat atcactttag ctcaaagata ctgaccactg attttggtta acaatagaag 97561 agggggccag aaagtttagg tatagaaata cacgtgttct taatcacaca accgtacctg 97621 ctacctgcag aaaacctact ggcttctttt tctttttctt tccacttctt tcccttgctg 97681 gaattccttc gcaaaggcgc cctgtaatga tagtaagcag tcagaaaggg gaagctgaaa 97741 ttttctcttt caatcctcct tttacagagg gctttctttt tacatcgtga agtgcttacc 97801 ccagatccac aaacttccgc ttaatttttc ctccttgcac cgccaatgaa ctggatgccc 97861 ttagtgattt gggatctttg gcatcatcta ggatatacac atttgtgaaa aagatcaatt 97921 attattattt ttaaagtcac aattcaaaag tcaatgcatc tctgaatcta agggctttgc 97981 ctaaattcct aattgtaaaa taaataatca atgccttctt atacgaccta ggagacacag 98041 ttacattgag gaactcgcag ttaagcaggg gaggtcctct ataggacaga ggctcaacat 98101 tcatcacaaa gagttttagc ttcaactgca tgggaatgaa caagggagaa taaatactct 98161 caaatcatcc tctgcattcg aagtagggac caacatgact tttgttcctt ctctggctcc 98221 aaattcttgg cttcaagtgc ctgggctctg tcctccattc ctttcttgcc ttggggttgt 98281 agtgccttct aacaggaaca tagcccactt ctcccgtatc tccaaagccc agcgcagaat 98341 atcacctctg agagtctggg agatttaggg ggaaagtcag agtcaaagtt ttcttctctg 98401 gctctccttt gctaatggcc tgtggccggt cctggaacag aggaggggag gacatggggg 98461 agactcctta aacaggggct cagctacaaa acatttatat ctttgaaagg gccttcttaa 98521 tagttaggtt acagcaaaac ttggctggtt ccccttaatg tggaagatgc caaaatctcc 98581 tcagtgttca gagcagactg atctttgcct ctaggactct caaaggtagc agctgctggg 98641 gaatggggac cctcgaaatg ataccaaaag ccaataggaa gacaggcttt gcaaatggag 98701 ttcaatgtca aggtgggaac caattccact ggtggcactt gttccatctc ggacaggctc 98761 tacccaaact attcagcaga gggtactgat cggcatcacg tttcagcata tccaattagc 98821 tcagatgatt aacaaatgct gttttaaatt tgatgttttt tatgtgtgaa actcttcctc 98881 cccagttctt ttttggtatc tgtttttctg tctcttatct tttaaaatgt gttttaaatt 98941 tctaggaact ttctagtatg gaaggcaaag agaatggccc tcttaaaaaa gtgttcaatt 99001 aactttgcag gttaatgttg ctcatgattt ttgtttactt caagagaaat ttctcaacac 99061 acatcgacat acacctttaa aaaacatgaa gtacacacac agttcatgga tagcagtcta 99121 ggttcaaagc tctcaagcag cacaggacac tgacaaatac tccaaatgtt tttagcaatg 99181 agcttttcat atttttagtt aatgatactt agattacaat aagcaactca ctgaaatgta 99241 cataatgaaa cacaagagtc agaaatatgt ctgtgtcaaa aatccacttc aagcctcaaa 99301 tatttattat gcatatttgc atcattactc tttgggtaac tgaattgtca cagatgtgac 99361 acaatcatat aaaaaaatcc atagacatag acatggcaca aacaccttct aaggtgattt 99421 tactaatgga ttttattttt aaaagcaaga tttaaatttt cttaatagaa gctgaagaaa 99481 agaaagtgcc aaattcttcc agatcaatcc taaagcaact cttaatggct aggactatgc 99541 aaagttaaaa ggagactgcc taattatttc ctgaatgaga ggtagctggg agatgttaca 99601 atgagcgctg agaagaccag ctgaggcaat gtacacgaaa acactctgta acccatgagt 99661 gttgcagtta tttggacatc atgcgactaa aagcctgatt ctgcctatca cagagcttgg 99721 tgctaggcaa ccttacatct ttatctcaca catttgccaa agtgcgttcc caggaacact 99781 gaaatgaatg cttccatagg gctgaagatg ttaagaataa ggaataatgt actttcccat 99841 ttaaaatcga ttaaatccat ctttaatgca cactagaact tctagtctgc ataaaatcca 99901 acattagatt ggtatgattt caggaaaaaa gaaacataca ttttgtttat ttagctctac 99961 gtaagacttc tgttaggctt tttaaaatct ctccaatact acctttgtga ccaattatac 100021 atctgagccc ccagattggg tgcctcaact ttaagctgtg gcaacctggt tttcattttc 100081 aaataataaa tgatctcctg atggccaaat cctaaggtct tgtactggac tgcatctcct 100141 taatctcact gtaataagat acttgactat ctcttccttt ataatgggga ctcccaaatc 100201 tacctggcag tcctgacctc tctcttgcgc tcatgacctc ctctgttaaa aaaatgagtt 100261 aatcctattt cccccatcat gttgctgatt tcctattttc tgttaatgac aagaccatat 100321 gttctgaggc gtgaagccca tgatagtttt gcctcacttt cttacctgtt acctagtcaa 100381 ggtatgtctc aagggtcctt gaatcattct ttcctgtcac tgtctactgc tcctctttag 100441 agggcaccac ctcatagggt ctcctaactc agcttctcca tacctggtgg tgtcttttcc 100501 tttttattcc tctggtcaaa cttcttgtgg tttcctagta atatggcaag ttatctaaga 100561 tgtttaactt ctaaagctat atacttaacc tactggctca cttttgtgtg taactactcc 100621 accctgcata agcaaactgc tacagccagg ctgggatagc cttgtacctt taaatcatat 100681 gatatgtgcc tgtctctgtt aatgacatca ctttacccgc tattccctcc ttacctaccc 100741 tttttttccc ttaagtgctt tgatggctat tctaattgta cttccttgga taggttttgt 100801 ttaccaacct cagctggacg caatctctct ctcttctgag cgttgatata attgtactct 100861 atctatactt gcagaggccc aatgctttta ataagcaata aatcctgtaa agctttaaaa 100921 aagccagttt ctatttacta gacattaatt caatgaatat ttttgaacac atcttctggg 100981 tcaaataatg tgttattaac cacacaaaaa attagaataa taatctctag taattgtgag 101041 tagaaaattg atagtgttca atttacttta tttaatcttg actgaacaaa ggcccagtgc 101101 tatgtaagcc tcgggaccaa aactttagta ttaggttgtt tatataacag gaacgaggaa 101161 aatattttca tgccttctac tggcatataa cctgttttac aaatctctag aaaaaaatct 101221 tttacttcct ctaatttgat attccgttag atgcttatca tttagccagt tataatggga 101281 cttctccatc ctaatttcca ctgatgtccc tcagatatga aaaaaaatga tttgtaaatg 101341 tggtattgac acaattcaga aaatcgccaa ctcctgttag ggtaaccaga gccttatcct 101401 caagcctgca atcaaacatg gctgtaatta atcatctttg tggtaacttg atagaccaaa 101461 agtaaaggta taacactttt taaaaattct agataaatgg caaccaatga tgaagccatt 101521 ttcagagttt atccagcaat tttgagtata agaaaatata cttctagacc aggcatcagg 101581 aagctttctt tacctttcag gggccacaga gtaaaatatt ttaggtttgt gagatacaca 101641 gtctctgtca caacgactca acactgctac tgtagcatga aaatagatag tatgtaagta 101701 aaaaagtgtg actgtactcc cataagactt catttacaaa agcaggtgga gactacattc 101761 ggttgattat ccctcttcta gaagatctgc actggttacg tggagaggta tattagcaat 101821 tcatccaaga acattattaa gtatcaatta tatgcctgtg ttactttgat cttcgtcttt 101881 ttttcctata tgactacgtt agtgtttact gaaactctgg gctgctattc tacaaaatcg 101941 ctctatattt ttctatatga ctatgtagtg tttttttctt tatgactgtg ttagtgtttg 102001 ttgaaactct gggctgcccc actcccctga agaactccct tgagtagtaa acttatagag 102061 agctgaagag ataaaaatac tgaaccagat aggaaaagat aaaactaaat taaagtaaaa 102121 tacatggagg ggttgtttct cttattctat aatataaaaa gaaattaata ttaattatta 102181 gctatgagaa aacaaatgac ttatgattat gagaaaaagt catttttata ctgtacaagc 102241 tgaggtgtta tgaagtcaga aaaagaaatt aacaatacct ttttctcgta atattttctt 102301 gttggtggtt tggtgatggc aaggagattc ttccacatca tgatcagact ttgaagtaga 102361 aaacagtgaa gtgtctcccc acatagagga gagaggcccc atttctgggt ccagtggttc 102421 ttcttgaaat tcagctatgt ggtcagacga gaagggcaca ttgctatcca ccggggtgag 102481 gggtggaaca ccagaatctg attctggaga agacgttctc aaacccttag gtggaagtaa 102541 tttggtaata tgcatgtggt tatagaaact gtttgatggc tgttggtata agagattata 102601 aggaaggtta tgccactgag tatcagttct atgcaatact tactttgaaa taagaccaac 102661 cttttattag tgaatcccca tgaaaaccag tttttgttga tttacagatg agaatgcatg 102721 gttcatacca attcaccata ttaacatcgt cactaataca tatacctaag acctacagaa 102781 acaattactt gtacgttcat tatttcccac taagtaaaac caatagctga atatcaaaag 102841 caccatcatg aaatatgaca atacagataa tgctcttggt tagttcactg aatgaataga 102901 caaaatattt tgaattacaa caatataaag aattcaggag acaaagacgt gaactattct 102961 agctgcttac caactaggac atagactctg tttgcagaaa aatcataatc ttcaataagc 103021 tgtaggaaca atgatatact ccaacagtac aaatcaggaa acggtcaagg gtgctcacct 103081 ttctctctag actgtcctcc ccatctgtgg aactaacact atcatacagt cttgtcctag 103141 agggtctctg gcgtctgttc ttcacagaga gctgggctcg agttttcagt gcttttgaat 103201 ccaggcgctc tgtctctggg actgcttcat tcaagtctaa gaatattgaa agataatcag 103261 tgagagaaac ctttggagag aactgatgaa gagaccacag acactgataa aatctcaagg 103321 taaccagtaa catcaggtaa ggatcaggtg accaccatga cagcctgtaa ccaagaaata 103381 accaagaaat cagtggggct taggagaaat atacaaagtc aaaggaagaa accaacacta 103441 aagatgatga aggcgctata aaaatccaac atccgaaaag catacaacct tcctgcaata 103501 taattataaa aatcactaat ttgtatttac tatttatgtg tatgcatgtt gggaagttag 103561 atgaccaaag atatagagga gggtatatca aaagttgaca gatcttagaa accaaggaca 103621 aaaaacctac aaagtcaggt ttccagttca actacaggca acagaaggta actatcataa 103681 atttactgta gcaaaattta ctagcagtct tcaggtacta tatcattttc attttaatat 103741 ataattttca acccaaaaca gtttctgaaa atctaccttg ttcctttttg aaacctgtag 103801 aaagaccagt gaggaattta cctaatcagc aaattgacga tagctgggga aaagaaacaa 103861 ggtctggcta cacaaatgaa gaaacaaaca gttaaaaaat cgtaagtact tagtttacta 103921 gtattgtgga tgaggtagga taaacatttc cttttctttc attaacatct aactttgtat 103981 tacaagttat ttgtatgtgc ctctgactca tctctgtgaa taccttctct ccccaccaga 104041 aacgacactg attttcgaag ggtaaatact taagtcttta aataaaaagg ctaaaagaaa 104101 aagagagtaa agtgggacac aaagaaaatg aactattaac agcaaaggcc acagagagaa 104161 ttcaattagt cctctttagc ctggctctta ccacctttaa ttttgctacc tagattttaa 104221 gttaggaccc cgaagttaac attctaaagg aaaggtaccc attaacatac ctggatattg 104281 gttaaggaaa agtactctct ggtggaactt atttgaaatg ctactactcc tttcatttgc 104341 atatgtaatc actccttaga aataaaggag tttgatcatt aatcctattt tatctcttca 104401 gagtggcagt cattagctga agtcatgttc aaggagttaa taattgagtg caaataactg 104461 agttgcagaa aattatgtac ttgtattaac agaataaaca atttcttata atattattag 104521 ctcaggcaat tcctttttac agcattattc tagatgtgct ctaaatatgc tatactacaa 104581 aaccaaagca aggaagcacc tcagtttgtg cctcttgtct atgaaaccta gcacttagta 104641 tatggctcta acaaacaaaa ttaactacat ttttgcccag gagatacatc cattgaatga 104701 caatatttag agaaggtttt gacttttcat ttcctatatc tgaatgaaga atgaatactc 104761 agaaaattca gaccattcag gtacttataa agaacagcta aatattttct aattagcagt 104821 ctctttactt tttacatgtc aggggagaaa aaataattcc ttgatcactt taatttcctc 104881 cttccgataa aatctatatt tgtagctgcc cattaagagt tcattagttg tgataactgg 104941 gagagagtca atcttataag ctcttgctta aatataaaat gtggaacacc acgtagatga 105001 aagctgtaat ataattagaa acactgtatg tagtacttgc acgttttatg ccttatctag 105061 ctcatatttg gattgtatgc tctataatac aaatcatgcc acatactcta tctcatttgt 105121 tccttacaac agccccatga ggtaccaatc aagtattttt tttctctcct gaagctcaga 105181 gaacttgagt agcatgccct catttacata actaattaat gcgctaccat gattactcta 105241 gtgtcttgat ttccaggcta ctaggctatc cactatgatt tagctagtgt tgataaacct 105301 cccattgcaa gattttaatg ccattagagt ttggtgtttt gggttataaa atcatatatg 105361 acgtgactag tctaggaaac tcatttctcc aataggctcc atctaaaata ttataattaa 105421 tcttccctat atgttagata acatgataaa aaaaaaggat gacatgttga aaagtataga 105481 ataaattact tcctggatga ttttttattt tggtctggtt ggtttaataa cagctttggc 105541 tactaagtga caaaaagacc caaggaagaa tctcactcca cttttgtaaa catggttcct 105601 tttgcagtag aatgctggcc ctgaggatct ggtatggaac ccaatatcca ggactacaat 105661 aggtactcca tcaatggaag ctgaacaaat ataagtgaga gaaaatcaga actacttatt 105721 agcaggtttt aatagtgcag aaaatttact ctgggactcc ttgctctatt actattcaat 105781 aatttgcctg gtgcttgtgt cagacaaccc aaaaatggca atctggtgta tctgctttaa 105841 cattagctga ctaggtgcaa actgcttgag cacacacata aaattatatc tatttatgat 105901 gctgttagct ggaatctaaa aagttgtgcc ttatatccgg tacctaagta gccaacaaca 105961 cattgtatgc agttaacaat ttttgaaggt ctgaaagcaa atatcacaga gctactttat 106021 tgcagtgatt catggattac aaatcagtaa tttatattcc attctaaaaa taatttttta 106081 aaagaaccag aaaaatagga tgaagttgaa aactttcgcc ttaatagcat ttacaccaga 106141 aggaagctgg aaggaatact gtccacaaga agcacctacc aaagacacta catgcaatat 106201 gtctccctta acagtattag aggacacgtg tgtacacgga aggaaaatgt gtataagatt 106261 tatgctgctg ggtaactaga agggtgactg accatcacat gctatctttt tccatcatgt 106321 aatacaaaaa tcctaatatt acaactcaaa ataaccttgg gattctggtc ccagcttatt 106381 tctcatataa gaaaactgag gcttaaagaa ggtcaatttg tgtattaact tgcaccacac 106441 agttaatctg tgacagagag tgaatagagc tcaggcttca gaagcctatt ctccttctgg 106501 aaggtcaatc tcaagtctgg aagtaaactt cacattcttc tgccttcttt gagtcctctt 106561 tttggtctta ctttatgttc ccaacaaaag cacatgtcaa gggctaactt tgcaaacttg 106621 cagaagcata tagcatcaga ggaaacaaaa acccccaaca ctgagggatt ttttttggct 106681 ttacatatat ccaagaataa aattcttaca aagaaaaata gctgtcagtt ttgtggatgg 106741 cagtgtggca ggggaaatcc aactggctca gtgtttctaa gttatgtgct gctagatgta 106801 aaggcttaag tggccatttc tctcactgtc attcgttctc tacagatatt ttagtcaata 106861 atattacact gctttcataa gcccaagacc tcagttcttt cttttcctcg tttctccagt 106921 gtgccttgtt ctgttgtcca taagggtcag cggaaaattt ccttttcctc ctctgctttc 106981 ttatcttcca tttttctcag accctccagt ctctgaaata gcctacctct gacgcaatgc 107041 agctagaaat acagaacagg agaagactga ggtgaatcaa taaatacaga cagataaata 107101 cacagaagat atgaagtagc taatgacaga gcagttttga gtgagagaga ctccacctct 107161 atacctgcat tttaagacta ttttcaatac tgctacaaat tggatccttt aaaagaattt 107221 aatggatttg atgattttag cagctgttat gaaaaatgct ctaagaaatt aggatgctat 107281 atatcatatt acacagagac cagttcccta cagtaatact gtgagtcata cagttttttt 107341 cactagattg acgttatttt tgacgtaagt ggaatgctgc ccacagatca gatacaggca 107401 gcgagggaac taggggaatg gagcagcata acaatggggt ctgctccgca gttacgtgca 107461 ctaacccact cagaaacagc catacttcat gccatcacac tgcctcgcct gtgctattac 107521 caatggaggt gctgagcatc atgaggcact tggtggcatt ttaaaacttg aacagtatct 107581 gcatatcatt tgcagccact gcaagttcat aaaaggtggg cccactgggg tttgggttaa 107641 tagataattg ccttcaggag gtaagtttat gtatctcaga gaggcagtgt aaaacagtta 107701 ccatgagagt agacctggag ttgttccttg aatagaattc agaacaagcg gctgtcccat 107761 taaattgggc aatttatcta acctctgtcc ctcaggatcc tcaattgcaa aacagtcatg 107821 ataacaaaac ataataactg tattgctatg aagattatat atgcattaat agatataaaa 107881 gtgcttagaa gagtggtatg taataagcac tcaataaata tctttataaa gagattagca 107941 agatgttaag tgggtaaagc aagttgcaga ataacatgta cactcattta aaatatttag 108001 aaaacattaa cacaaaatac attttctagt ggtaaacata tgcatatagt tgcatagaaa 108061 aactggaaag aggccgggta cagtggctca tgccatgcct gtaatctcag cactttggga 108121 ggccgaggtg ggcagatcat gaggtcagga gttcaacaac atggtgaaac cctgtctcta 108181 ctaaaaatac aaaaattagc caggtgtgat ggcgtgcacc tgtaatccca gctactcagg 108241 caggtgaatc acttgaactc aggaggcagg ttgcagtgag ccaagatcgg gccactgcac 108301 tccagcatgg gtgacagagc aagactccgt ctcaaaacaa caacaacaac aacaacagca 108361 acaacaacaa caaactggaa agatatactt cctactgata tagtgcttac ttacaaagaa 108421 ggtattagat taggagatgt tggtcaaagg gtactttaat cttatttgca atgtttaaaa 108481 atttggtatt atattttgta ttatgtaatt gtttacattt atgtattact tgcgtagtta 108541 aaatgaataa aaatagagaa aaacatatat ttagtatatt ttctcttaaa ggcaccttaa 108601 ttctatttta atttaattag atctggtgtc ctaggactca gtatctccac atgttgatat 108661 aagcccaggg aactttgtca aaatctgcat gactatcagt cttccccatg agagaaggct 108721 gctctttcat tacatctgtg gtgattgtct gcctggcttg ctaccttact aactgccaga 108781 taatttggcc taaggtctat ggcttagctt tcagttatta cttcatttta gtattgtatg 108841 gggaagatgt actgaattag acacggttat ggaggaaggg tatctttcaa cagttccaga 108901 acaagcagaa gattattgct attttctaat cttctaacta ggtctgaact tttgttgtat 108961 gagctataca tttaatttgc taatcattga aagaatatta agaaatccct ttttcagttt 109021 ttttgtcaat acatgcaatc tcaaaactat ctgtatagcc ttcaaaatgt aagttggtta 109081 tttgttagtg tagtgcttct ctggtgcaat ctgatatcta atataccctt cttcagttta 109141 tgatcagcct acactcctgt tctcagtgga cagagaggct taaattctgt tatccatatc 109201 tcagaaacaa ctcttcagta ataccgctag actacctgga aaggcagtga ctaccagtca 109261 gaaaaaacta taggaaatgt cctaactgct atcaaatatc atttgaaaca cttgtaagag 109321 ttgcttgatt ctagactatg aaaggatatg gaatttagat ttagaccaca gggaaaggca 109381 tttaaatatc tcagggttct ttcagtgtca gatgtcttta gattctgatg gaaaagagat 109441 ctgcataacc taaaagccag ggaaatacaa aaatgtaatc aaatgacaaa ctagttgggg 109501 actgcaggca tctatgagta gaactatgac atcatgtgga taagctgggt catgggagct 109561 aggatatgtg aatgtctttt tcttgacatt gatattgtag tgtctttgac atggcaacaa 109621 taagcaagat tctgtgtttt cactcagggt tgcctacaag aatctaatca gtttctcaat 109681 caaaatctat ggatattagg taaccccaag catgctaagt agacaccctc ccattacata 109741 ttgtgataca atagcatcct tgattagtac aggaagaatt cagattgtag gcctcttaca 109801 gaaagtaaac tcagttacct gagatgatag ttatttccct aaaaaggaaa gcatctttca 109861 atcatgggaa gattttgtta tgtaaagcat tagaaaatgt aagaaaactg aaggttcaca 109921 atccttggaa gtgtaatata atagatgtat aagaattttt taagtacttg caaagactgt 109981 cagtctccta gtaatagtac ttaatagtac tttaatagaa aatggaatat atcaaatttg 110041 taaatgtatc gttaaatttt tataggtgtt tcattagtta agcttggaaa tagtttgtaa 110101 taaagtttgt aaataaataa taaaaatccc tcaacagtga ctttcaacaa attaaaagat 110161 ttaaagaatt aggtttttga taatgtcact tgagtaaatc aaacattaac ttaaaaccaa 110221 aaattaatac ctgaatgacc ttttcaatgt tcaaaaaaaa acccttagtg tcatcaaaaa 110281 ctcacaagca acagtgtttt ttggtatacc tatgttctaa tgtttcttta gtttccttgt 110341 aagtggattt attccttata ggtcttagac atgattactg tatagtatta aaacagaatt 110401 atttcatgtc tacttaaagg aactcagtca taacctgatg gacctgatac tctaattgct 110461 gaaataatat ggttggcaca attttcagaa ccatacacag agcagaattt tctggtacta 110521 aataactgtt catctgttca gttttaactt tacagatgct tcaacctctg tgactggtca 110581 tgtttccctc ttggctatta gatatcttac gtgaacactg tgacccacct tgaaaacaat 110641 gtaggagccc agggcaacac ttcatataca aaccacactg ggtagcatct tctccttcat 110701 acttttagat ccctctaccc tgatatccta gcctagtcaa ttttatctta gcatagctat 110761 atggttatta tttaatctat aattagaatt tcatggtgaa ccaggtgaac cagaccataa 110821 tatgtatata tcatggtgaa tcagaccata tatcaatata agaaataatt tggaatacag 110881 tagtcttctg aagtttcctt gaaaagaaat tactcacatt tgggttaaaa tgatctttac 110941 ttataaaaga tcataatgta tcttgttcta ttaaaagcaa gtaaaattca aaagccaaaa 111001 caaagtagcc cacttttcct aaaatgaaca gaaaattact aagttaaagc tcattctgac 111061 aaagtaacat tgtgtttcct cttgaagcaa cttttttttt tttttttttg agtcagagtc 111121 ttgctctgtc ccaggctgga gtgtaggggc actatcttgg ttcactgcaa cctccgcctc 111181 ccgggttcag gtgattctcc catctcagcc tcccgagtag ctgggattac aggtgcccac 111241 caccaggccc agctaatttt tgtattttta gtagagacag ggtttcccca tgttggccag 111301 gctggtcttg aactcctgac ctcaggtgat ccacccgcct tggcctccca aagtgctggg 111361 aagacaggca tgagccaccg tgcctggcct agatatctcc cagtactatt ttataactca 111421 gtatgactga agggtctggt ggattctacg aacttatgaa gatgcagtga gaaatatgaa 111481 tgaagcagag ctgtcaattt gcaactgaat acctccttta gaatgtctca ctgatttggc 111541 aacaaaagtt ttaatagttt acaagtagtc agattcaaaa cttgcacaaa actagacagg 111601 caatttaggt gctttgcttt ttaagtaaga ggacagattt tggccttaat gatacaccag 111661 aaaataatca ataaatattg atttctcatt cacttggctt gttcattaaa atttcccata 111721 gtgatttacc aggaatgttc accctttaat tattagtgca attcaatatt tcatggtcaa 111781 aattatttat tggggattaa acaaaattga aaataagttt tcaggaaaaa tgaaaattgt 111841 gaaagtttac attatatctg tatgcaaaaa atatatatat atttttaaaa aggctgaatc 111901 tactctagaa ttccctaggc tccaatttag aactaaacag tgtgacagca tgaaataatt 111961 acctcattgc tctgtatcta atgctcttgt atgagactct cctgataatg cagtgcaagc 112021 attcactttt cttttaatga ctttccatct aataaattct ggctttaatc atgcctttaa 112081 ccttatttta aaattctagg ttcctatgct aaaagagaat gcttaatgta ctctcaaact 112141 ctatcaggaa ataaatcaga tatggtggta ggggtctggc ctggcctcaa acccacaggt 112201 atctgaaaag agctgctgat tttgagggct caagagaaag gcaaaaagga caaaggaaat 112261 gatttcttgg aggagtccaa tgaccttagg agcaccctta tggatataaa tgtctggctg 112321 atgctatcct tttttatttt cttactaaac cttaaactaa tatatatttt ttagcttaaa 112381 aactcttttg tggttttctt attcttacct tttaattatt ttaaagtttc tagtctaaaa 112441 catttatata aaaatgtata ccatatatgt aactcatatg aagaaagaaa aaaaaaaaga 112501 caaccatgta cctccatcaa gcctaaacta taaaattatt acttttgaag ccctctctgt 112561 tctctatttc tagttgtaac ccctgtattt tcttctagtt attccataaa tttatcccta 112621 aagaatatat tgattagtct ttcttgcttt taaacttcag atgcgtggta gtgaattgta 112681 tataatattc tgcaaattgt tttttagtta gctggtttgt ttaacattat gttgaaaggt 112741 gtaactaaaa ttaattaact ttttactgca gctatacaaa caatgctaca tggaatgttt 112801 ttgaatatgc cttttggtat acactgacaa gagtttctgg aggatatata tacacatcga 112861 agtaaatttg tcgggccata ggattttcac atcttctatt ttaatggata atgccaaatt 112921 attttaatga aacagtttta tgaattttcc tgtttttcca tgtgctggct atcacatttt 112981 actgtgacac tttatctttg ccaattgttt gatttttgcc agtgaagttg aatgtctatg 113041 tttactaatg tttaggttcc tgcttttgtg aacttcttat gcgttttgcc catttttcta 113101 ctatgttggt tgtgtttttc ttatgattta tgtgagttct taatttattg tggacaacaa 113161 tatttttttc ttagttatac gtgctacaaa taccctcttc aactttcggt ttctctttcc 113221 atttacttta taatgtcttt taatgatcag attttcttaa tattactgtg gtcaaattca 113281 tccaaccttt ttacgatttg gaatttttgt gaacaggcaa cctacagaat ggaagaaaat 113341 ttttgcaatc tactcatctg acaaagggct aatatccaga atctacaatg aactcaaaca 113401 aatttacaag aaaaaaacaa acaacctcat caaaaagtgg gcgaaggata tgaacagaca 113461 cttctcaaaa gaagacattt atgcagccaa aagacacatg aaaaaatgct catcatcact 113521 ggccatcaga gaaatgcaaa tcaaaaccac aatgaggtac catctcacac cagttagaat 113581 ggtgatcatt aaaaagtcag gaaacaacag gtgctggaga ggatgtggag aaataggaac 113641 acttttacac tgttggtggg actgtaaact agttcaacca ttgtggaagt cagtgtggcg 113701 attcctcagg gatctagaac tagaaatacc atttgaccca gccatcccat tactgggtat 113761 atacccaaag gattataaat catgctgcta ttaagacaca tgcacacata tgtttattgc 113821 agcactattc acaatagcaa agacttggaa ccaacccaaa tgtccaacaa tgatagagtg 113881 gattaagaaa atgtggcaca tatacaccag ggaatactat gcagccataa aaaatgatga 113941 gttcatgtcc tttgcaggga catggatgaa gctggaaacc atcattctca gcaaactatg 114001 gcaaggacaa aaaaccaaac accgcatgtt ctcactcata ggtgagaatt gaacagtgag 114061 aacacatgga cacaggaagg ggaacatcac acataggggc ctgttgtggg gtggggggag 114121 cagggaggga tagcattagg agatatacct aatgttaaat gacgagttaa tgggtgcagc 114181 acaccaacat ggcacatgta tacatatgta acaaacctgc acattgtgca catgtaccct 114241 aaaacttaaa gcataataat aaaaaaaaaa agaaatcata tgaaatccca cattttcttc 114301 taagagtgtt aagtatcgtc tttcaaattt gagactgtaa tccatttgga acttatttca 114361 tgaataatgt gaggtagaat ccaattcatt ttttccatat ggatggtcaa ttttcctagc 114421 accatttatt agtccatcct ttacccacag aactgcagtg tctgctctgt attattctca 114481 aactgacata aatgtgtcag tctgcttctg gactctgtat tctgcatatt agaccacttg 114541 acaattgctg ggactatatc acgtggtctt aatcactttt acttcataat gaatctcagt 114601 atctagaaga acaagttcct caccttgttc ttcctcagga gaacctggcc ctattctggg 114661 ccaagcaatt caccttggcc ctattctggg ccaagcaatt catccatata tattttagaa 114721 acaacttacc aaattctaaa gaagttaaaa gccttgtttg aattctaatt aaaactgcat 114781 gggatatata gattaatttt gcataaaaag catttttata atattgaggc ttcctttctc 114841 tttcaattta tgtaagtctt ctttcatgtc tttccatggg tcttgagatc ttttgttaaa 114901 ttcatcactg actaccttat acatagtttt tgctattttt atggcatatt ttagaaaatt 114961 acattttgtt tgttggaaat gattgatgtc tgatgcttct taaatccatc taccctgtta 115021 attcttataa catctaataa tcagaagact cgagtttttt aatgtattca attttattat 115081 gagtgaataa tgaaaatttt gattattttc actccttaaa tttttgattt cttttgctta 115141 tctggtagta cttactagga cctctaggac aacactgaat agaagcagta gtaatacact 115201 tagtgatgaa atattaaaag cattcttaat tttaaaggaa attcttggag tataacattt 115261 ttccacttca tataatgtgt gacagagact tttgttagcc tcacaatatc aagttatgga 115321 agttcctcca tatttttagt ttgataaaac ttttaaactt gaaaaggtat taaacttcat 115381 taaatgtttg tcttcattca ttgaaataat tatatgtatt tttcctctag ttaatgtgtt 115441 gattttatat gagtaggttt tctaatgtta aaacaatttt gtgttgttta catagactca 115501 agttagtcat atatatttta attacatttc tggattcagg ttgctaatat taacaggata 115561 tttaatagct ttcaatgtca cttttttttt ttttgagaca ggatcttgct ctatcactca 115621 ggctggagta tatttttttc ttgctcagtt ctcattagag ctaaggaaca gtaagtaact 115681 aagggaacaa agtataaaat gccttatata accctccttg ttcacacttt tgggaagata 115741 acatgcagta ttacctaact ggtggtccac tcactgagta ggtaggtgtg aaaatgaaat 115801 gaagcagcac ttggtagcaa tattcaaatt tcacagaaat ccttgaatca catttacaaa 115861 cctacaagtg actaagttcc tgttatgctt caaaactgtg aatgggtcac aatgcttaca 115921 agattatgta caaacccttc tgcagtcaaa gcccacttct actccaagaa cacactatgt 115981 catttcaaat ggctctacct tttcaacata aagaccatga atcagaaaaa aaattatgct 116041 aatgttattc aattttctct tctttgaatt ccctctcatc ttttgagatg cttacatagt 116101 gccttaaatt ctatttattt gtctcctacc aaacttttcc tctttcattc ttgatcccac 116161 aaatcatata ccttgcagta taatcggcag gacctagtac tttctttctg cctagtagcc 116221 actttctctt acaagtcagt gtaccaccta aggcctgcct ggaagcaggt tttcggtgtt 116281 caaaataatt ttttaaatga tatcacactt actcctgatc tctcacggta gaatagtcag 116341 aaatatgcct tcataggtaa cttccaccta ctggccctag tctttctttc agtatctcta 116401 tgggatatgc acaatccttt agcaaaccag ccttcaaaat atttgaagca gcaaatggta 116461 tcaacataag gatcctggaa tatctagaag ctcagaatgt attgcttctg tgacttttct 116521 aggaaatcca ataaaactaa tattacaagc tt // LOCUS AC004130 116215 bp DNA PRI 06-FEB-1998 DEFINITION Homo sapiens BAC clone RG293F17 from 7p15-p21, complete sequence. ACCESSION AC004130 NID g2842785 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 116215) AUTHORS Tin-Wollam,A., Sutterer,C. and Fronick,B. TITLE The sequence of Homo sapiens BAC clone RG293F17 JOURNAL Unpublished (1998) REFERENCE 2 (bases 1 to 116215) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (06-FEB-1998) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: WUGSC Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: The sequence of this clone was established as part of a mapping and sequencing collaboration between the NHGRI Chromosome 7 Mapping Project (Eric D. Green, Director), John D. McPherson in the Department of Genetics (Washington University), and the Washington University Genome Sequencing Center. For additional information about the map position of this sequence, see http://www.nhgri.nih.gov/DIR/GTB/CHR7 or send mailto:egreen@nhgri.nih.gov SOURCE INFORMATION: Clone RG293F17 is from a release of the human BAC library CITB-HS-A. The library contains cloned DNA from human sperm. See: Shizuya et al., Proc. Natl. Acad. Sci. USA 89:8794-7 (1992); U-J. Kim et al., Genomics 34:213-8 (1996). The clone is available from Research Genetics, Inc. (http://www.resgen.com). VECTOR: pBeloBAC11 Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the right is RG326G04, 200 bp overlap. Actual start of this clone is at base position 1 of RG293F17; actual end is at 116215 of RG293F17. This clone contains STS sWSS2601 (NID:g1222836). This clone RG293F17 contains a bacterial transposon from base position 4508-5845, which has been cut from the submitted sequence. FEATURES Location/Qualifiers source 1..116215 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="RG293F17" /clone_lib="CITB-HS-A" /map="7p15-p21" repeat_region 1078..1302 /rpt_family="MIR" repeat_region 1939..2240 /rpt_family="ALU" repeat_region 4266..4475 /rpt_family="L1" repeat_region 6166..6460 /rpt_family="ALU" repeat_region 8190..9034 /rpt_family="L1" repeat_region 9046..9128 /rpt_family="MIR" repeat_region 9629..9780 /rpt_family="RETROVIRAL" repeat_region 10256..10552 /rpt_family="ALU" repeat_region 10693..10742 /rpt_family="MIR" repeat_region 11122..11270 /rpt_family="MER1_TYPE" repeat_region 11568..11677 /rpt_family="ALU" misc_feature 14096..15596 /note="CpG_island (%GC=67.1, o/e=0.70, #CpGs=138)" gene 15522..93712 /gene="ITB8" CDS join(15522..15648,47352..47437,50727..50901,62766..63012, 64381..64546,65442..65600,75118..75213,78611..78700, 82575..82709,85436..85841,88343..88568,89777..89886, 93330..93493,93590..93712) /gene="ITB8" /note="match to P26012 (PID:g124975); H_RG293F17.1" /codon_start=1 /product="INTEGRIN BETA-8 SUBUNIT PRECURSOR" /db_xref="PID:g2842786" /translation="MCGSALAFFTAAFVCLQNDRRGPASFLWAAWVFSLVLGLGQGED NRCASSNAASCARCLALGPECGWCVQEDFISGGSRSERCDIVSNLISKGCSVDSIEYP SVHVIIPTENEINTQVTPGEVSIQLRPGAEANFMLKVHPLKKYPVDLYYLVDVSASMH NNIEKLNSVGNDLSRKMAFFSRDFRLGFGSYVDKTVSPYISIHPERIHNQCSDYNLDC MPPHGYIHVLSLTENITEFEKAVHRQKISGNIDTPEGGFDAMLQAAVCESHIGWRKEA KRLLLVMTDQTSHLALDSKLAGIVVPNDGNCHLKNNVYVKSTTMEHPSLGQLSEKLID NNINVIFAVQGKQFHWYKDLLPLLPGTIAGEIESKAANLNNLVVEAYQKLISEVKVQV ENQVQGIYFNITAICPDGSRKPGMEGCRNVTSNDEVLFNVTVTMKKCDVTGGKNYAII KPIGFNETAKIHIHRNCSCQCEDNRGPKGKCVDETFLDSKCFQCDENKCHFDEDQFSS ESCKSHKDQPVCSGRGVCVCGKCSCHKIKLGKVYGKYCEKDDFSCPYHHGNLCAGHGE CEAGRCQCFSGWEGDRCQCPSAAAQHCVNSKGQVCSGRGTCVCGRCECTDPRSIGRFC EHCPTCYTACKENWNCMQCLHPHNLSQAILDQCKTSCALMEQQHYVDQTSECFSSPSY LRIFFIIFIVTFLIGLLKVLIIRQVILQWNSNKIKSSSDYRVSASKKDKLILQSVCTR AVTYRREKPEEIKMDISKLNAHETFRCNF" repeat_region 16367..16411 /rpt_family="MIR" repeat_region 19318..19695 /rpt_family="L2" repeat_region 19719..20019 /rpt_family="ALU" repeat_region 20171..20303 /rpt_family="MIR" repeat_region 21192..21371 /rpt_family="MIR" repeat_region 22060..22362 /rpt_family="ALU" repeat_region 22627..22923 /rpt_family="ALU" repeat_region 23461..23764 /rpt_family="ALU" repeat_region 25495..25561 /rpt_family="MARINER" repeat_region 28545..28641 /rpt_family="L2" repeat_region 28642..28695 /rpt_family="RETROVIRAL" repeat_region 28981..29310 /rpt_family="L2" repeat_region 29362..29706 /rpt_family="L2" repeat_region 30602..30960 /rpt_family="MALR" repeat_region 33045..33124 /rpt_family="MARINER" repeat_region 33574..33727 /rpt_family="L1" repeat_region 33861..34162 /rpt_family="ALU" repeat_region 35013..35240 /rpt_family="MIR" repeat_region 37367..37666 /rpt_family="ALU" repeat_region 38876..39009 /rpt_family="RETROVIRAL" repeat_region 41291..41433 /rpt_family="MIR" repeat_region 41679..41977 /rpt_family="ALU" repeat_region 42244..42417 /rpt_family="ALU" repeat_region 42430..42876 /rpt_family="L1" repeat_region 44091..44913 /rpt_family="L1" repeat_region 45045..45375 /rpt_family="MALR" repeat_region 49090..49284 /rpt_family="MIR" repeat_region 50198..50495 /rpt_family="ALU" repeat_region 51157..51274 /rpt_family="L2" repeat_region 52969..53361 /rpt_family="MALR" repeat_region 53486..54129 /rpt_family="L1" repeat_region 54325..54628 /rpt_family="ALU" repeat_region 54702..54921 /rpt_family="L1" repeat_region 56264..56393 /rpt_family="L2" misc_feature complement(62953..63034) /gene="ITB8" /note="match to EST AA644567 (NID:g2569785) ab63h07.s1" repeat_region 64883..65176 /rpt_family="ALU" misc_feature complement(65432..65781) /gene="ITB8" /note="match to EST AA644567 (NID:g2569785) ab63h07.s1" repeat_region 68145..68326 /rpt_family="MIR" repeat_region 69959..70272 /rpt_family="ALU" repeat_region 71081..71262 /rpt_family="MIR" repeat_region 71269..71485 /rpt_family="MER1_TYPE" repeat_region 72363..73091 /rpt_family="L1" repeat_region 76240..76507 /rpt_family="L1" repeat_region 76645..76810 /rpt_family="MIR" repeat_region 77190..77564 /rpt_family="L2" repeat_region 78837..78911 /rpt_family="MIR" repeat_region 79399..79567 /rpt_family="MIR" repeat_region 79872..80162 /rpt_family="ALU" repeat_region 80938..81240 /rpt_family="ALU" repeat_region 81820..81869 /rpt_family="L2" repeat_region 84688..84993 /rpt_family="ALU" repeat_region 88817..89012 /rpt_family="MIR" repeat_region 89043..89332 /rpt_family="ALU" repeat_region 89984..90280 /rpt_family="ALU" repeat_region 90379..90477 /rpt_family="MIR" repeat_region 90993..91064 /rpt_family="L1" repeat_region 94867..95168 /rpt_family="ALU" misc_feature 98971..99377 /note="match to EST H00880 (NID:g863813) yj27b12.r1" misc_feature complement(99176..99474) /note="match to EST R24249 (NID:g779137) yh30b08.s1" misc_feature complement(99211..99443) /note="match to EST H00881 (NID:g863814) yj27b12.s1" repeat_region 99930..100412 /rpt_family="MER9" repeat_region 100510..100848 /rpt_family="MALR" repeat_region 101361..101514 /rpt_family="MIR" repeat_region 102988..103279 /rpt_family="ALU" repeat_region 104046..104167 /rpt_family="MER1_TYPE" repeat_region 104205..104268 /rpt_family="MER1_TYPE" repeat_region 104365..104588 /rpt_family="ALU" repeat_region 104589..104657 /rpt_family="(TA)N" repeat_region 106532..106833 /rpt_family="ALU" repeat_region 106874..107195 /rpt_family="MER2_TYPE" repeat_region 107819..107869 /rpt_family="(CA)N" repeat_region 107941..108082 /rpt_family="L1" repeat_region 108092..108538 /rpt_family="RETROVIRAL" repeat_region 108747..109044 /rpt_family="ALU" repeat_region 109070..109716 /rpt_family="L1" repeat_region 109753..110126 /rpt_family="MALR" repeat_region 110127..110225 /rpt_family="L1" repeat_region 110227..110741 /rpt_family="MALR" repeat_region 110746..111478 /rpt_family="L1" repeat_region 111485..112770 /rpt_family="L1" repeat_region 112758..112897 /rpt_family="L1" repeat_region 113032..113356 /rpt_family="L1" BASE COUNT 35883 a 21756 c 21914 g 36662 t ORIGIN 1 aagcttctta gtttatttta atgagtagct aagcatgaga actgaggcat tggccacttc 61 tcacctggag actgtgactc ctaacccgtt tcaggttagg tcacaaaggg gagattgggg 121 aaatctagaa aacaaaattg aaaaatccca gtgggtaatt ctgttggctc cattccccta 181 ctgaaaacct ctgccttaca tagtcctctg ctcttagtct gccatattct ggcaagaaaa 241 agacctgagc tggactcttc cttcattcta gcacacttat ttgggatgga agcaaacctt 301 aagaattaat tgtttcttgt caagccacta agtagtacca ttttttagta aatgctaaac 361 aattctgtca taataagctc agtctgccgt gcacctcttc accaataaaa taagctctgt 421 ttactagatt ttccattaat tacctaagat ggatataaaa aacattactc aatattctta 481 gaattttact actggagtgt caagctaaag caaataattt atatatttac atagttcaag 541 atgatgacaa tagtttactg atagtcggtt attctgctga ggtttatctc cctctctgca 601 taggacttaa cctcgctccc caaccaaaaa aaaaaaacgg cagttaggca actgtcctgc 661 catctgcctc agatgctaag cagataaaag tcggctcatt aaaaacaaaa tgacttgggt 721 ggagaggaaa aaatgtgaaa tagcctgtta ttataataat tacctcagta aaatgtgtta 781 ttttatcatg acaatgagat tttctgaaca tgttattaca tttgtggtaa aaagaaaaaa 841 atcaaaacta tgtttttgtt ttcaaaatcc agtcatcgaa ggaaaacacc aatttaataa 901 atttctcaat aattatcaaa ttgtaacttt ttaactcttt cccattaaag ccaaagccgg 961 aaagaattct taatgtcata ttaatgctca ttatctacaa cagagaaagc tgaatatcta 1021 taatagtttt gagtgctaag tatacagagt aagtgcccat gtgtgctgga aagcagtcat 1081 aattgtgagc ataggttgtt gaatcgcatt gcctggcttt aaatcctaag cccaccattt 1141 acaatgtgat cttgtgcata taattttacc cttgtaactc agtttcctcc tctaaagaac 1201 agtgataata atagtactgg gatcataatt acctttgagg actaaaagat acaatgaata 1261 gaaaatgctt tgcctaggcc ttggcacata agatgtactt aaagcatgag catattttca 1321 ctctcaaatc tataatgaca gctcatactt ccctttttaa gtttcagaac tatatttcca 1381 attgccttct ggatgtctcc atctgaattt cctcaggcca cttctaaatt atcttttcca 1441 gtatataata gtttttctcc aatactcctt aacttaccac taaaactatt ttaccaagac 1501 agagtaaggt gtttggaatg gtgttttgag ctctggtatc cttccaaagg atggctatat 1561 ggtaattatg cttccagttg ttatttatct cccttttcta gagatacatt gtttgctgtc 1621 aaacagaata tctaaaataa ttttaactga ccccttaagc gcttcaaaat attgttatct 1681 tataatcttt ctaatattct aaacagcact tttaatagtt tcttaaaggt actaagccta 1741 atgtcttact gaacactttc tataaacaac agaacttgat tttgcccagt tgtaaataca 1801 tcatttcttg agcacctctc tcatactgcc ctcagtgtcc atgatgaaaa gtcatattct 1861 tacttagttt tgagaccaaa aaccaactgg gcaagatcta ggactgtcgg acttttcttg 1921 ataaaagttg aggcaatagg cccagcgcgg tggctcatgc ctgtaatccc atcactttgg 1981 gaggccgagg caggtggatc acctgaggtc aggaattcta gaccagcctg gccaacatgg 2041 tgaaacccca tctctgctaa aagtacaaaa ttagccgggt gtggtggtgc gtgcctgtat 2101 tcccagctac tcaggaggct gaggcaggag aatcgcttga acccgggagg cagaggttgc 2161 agtgagccaa gattgtgcca ttgcattcca gcctgggcaa caagagcaaa actccatctc 2221 aaaaaaaaaa aaaaaagaaa aaaagaaaaa aaaaagttga ggcaataatg agcatcagga 2281 agatattgct aggctcggcc caagtgggta agttaaaaac aggtgtcata acaggattca 2341 gtgaaattta ttaactagaa gaaatgcaag tttttctctc aagagaacca gtaattcaat 2401 aaataacacc tgaaaatcag aaaatgtcag tctcatagag ctacagcaga gtatatttgg 2461 aagattcaaa atcaattgaa catcaagaaa aagagagttc tcaaaagata ggaaaatttc 2521 agttttataa aagccaatct aatgtaacag gagagaagct attataactt gaaacttctt 2581 cctgtaggct gaataaaata cagagttaag aggtacgatg gaaactgata agagctagaa 2641 cacagatcag ataacaggca atcactattt ggctaaagga aatacagaga ggttgttaga 2701 aattttcgtt atgccacatt tcattgacaa tagtataatt cctgctgaat cgatgctaaa 2761 tgttgagttg atctaaggaa ataaagaatc agaggctggg cactgctccc tttgtccgtg 2821 agccactagt gcagggatca cagacttaaa tgactgtagg acttgggcgt gtaacataca 2881 ccagtgaact cagcctgggt acatgaaaag cagtaacatg atcacaaaaa aaatggttac 2941 tgactcttga gaagacaaca gagagtggtg agggttgtag caaaccggag agctcggacc 3001 ccttcttaag tgggtggaag ataaacagct ccagctcatt gccttgaaga aatgcaggcc 3061 cagttgtgct caatatcctg attaagagat gctgaaacta ggactttatg aaacatcctg 3121 atatttaagt tttacacaaa atttatcaat ttttaaaccg ctctgtggaa tgatgctatg 3181 agagccaaat aaagatctac tgggcacatt tggtccgtgg actgtcagtt tgctagtctc 3241 cacaacaggg ttttagaact acgcaatggc ctttgtttct ctttcttcac tttcattctt 3301 tgtatttgat tctgggactt gcgttttcta gtcaatcaag aagatccaaa gagtccaggt 3361 gaaaaattag gagtatattt gggagcaaac agaccttagt tagcaaaaaa tacaaacact 3421 taatgatgaa ttgaatatgt aaaatttaga ctagcaattt ttgtatttct tttgctaacc 3481 ttgccaagat agattatggg taccttaata aagctaattt cttttccctt ggccatctga 3541 tatcctttaa acctaaatct gttggatagt ttccatttgc ccatccagtt tccttctcta 3601 cccactctgt gcccccagga agctggctgg tatagaatgc gttaaaggac tactttacct 3661 ctagcttcca ttgagtttgg cccatctggg agcactggca ggagtttgga gggtggagag 3721 aaatgcaagt cagggcattt atttgcccca ctttccccct gctggtccat tgatttaatc 3781 ccttcttcct atttggagtt cctccttaca cagcccccca tcccccatgt gtgagtgtat 3841 gtgtgtgtcc cttgaaatat tgtcttttac tgttcctcca agccagggat gttaaggttc 3901 tccactatta caagtcccag aggaatatac caccggttgt tccttttctt aaaaccttcc 3961 ctacagtcct caactactca gtttgtctgc tttctgtttc ctgctggaag gctaacttaa 4021 aaagccattt caaagtacac aatccatccc cagcattggg atcttaactg gacaaagcgt 4081 ctcatcaagt cctttaattc ttatttcttt gggtcttttg gctggcccta tcttcttcta 4141 actccagact agcaactgtc ctcccaacca cacccaccct ttctttctta ctcgttgtca 4201 gaaagctttt cctgagggtt acattcccca gggtttaaaa tataattaaa aagtgctttc 4261 gtggttttct ggttggagga tgaatttgta caatctcttg gaaaagcaaa gctatttggc 4321 agtaagtgtt aggagccttc agaatgtatc taacctttga cccagtaatt ttaccttggg 4381 aacttttcct aagaaaagaa tcctaaatat taaaaaattt ggataaagat gcatgcaaat 4441 atgtttatta ataaaatggc caaaaactat aaacagaaca cactccaaca ttgaggtatg 4501 tcaagtgtct tctgtttgcc ccaccagctt ctctctccaa ttgccctccg ctataagaga 4561 ccagccttcc taagactctg actgtggctc cactaccttc tggctttggt atgaacttta 4621 ctaatggtac gtcttgataa aattgaagat gagaagaaga atgaagtagg agtagttatt 4681 ttccctgtcc cctttccgca gatagcctca ggctggccac cattcttgaa caaaggtaaa 4741 acggtaaaac agcctttcac gtggccctct tcacataact caatctccct aaggttcagt 4801 aatatatgct tctctcgtct tttgaagcct aaggatagtg atgcattctg ttctgcgccc 4861 tgcccccatg cacactgttt ctaactcagg gatactgtgc tatcactaac accctatccc 4921 acggttatca gaaaattctc ctcaaattgc ccaatttgag tacactaaat atatttcact 4981 gccaagaatc tggtgcaatc acacagcata tttaagtata ctaggcgcca cagccactat 5041 ggaataatat gtaacataga aagataatca caaaaagtac aacaaaaaaa gataaatcta 5101 gtatgaacac aggtacataa aaataaagca aataagattc atcacgaaaa tgtaatgttt 5161 tctataataa aaacatatta cttttttaat gagaaaatgt ttaatgactt aaaaggaaag 5221 acattgtaaa tcatgcccag aggaatttga aaagaccttg cagaagtggc aatagaaatg 5281 atgcttgaca atcaagtagg acaaatgagg aaaaaaaaaa agtggaggaa aaaacatgtc 5341 tggccaagaa tgaagcctat gaaaaggtaa gtgaaagtgc ttatagaaaa ccaggattct 5401 cttctgaaga gcctagggta aacacagttg agaaagggca gaggatttca tagaaaaggg 5461 tactgggaat gtgatttggg gccagaatgt ccaaggtttc atagctgatg ctaaaggtct 5521 cttcatggcc taggagaaag gttggagagt ggcattaaac gaaccataga agggtttaag 5581 aagaggacaa aaagcttaga tttctgtctc caaaggaaaa ctctcctact actgtggaga 5641 aaggagggct aagagttccc gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agcctgaact 5701 atgaagtttt actaaggagt aaagagggta aagggaatag aaaagatttc agatgcttca 5761 ctaagacacg attaatataa tttgtaaact aatgaaagac aggggttgtc tagcatacat 5821 ctgtaaaagg gaacaatgat acctgtggta cacttaaaat cttgatagac ttccctgcta 5881 ctttaggttg aaaagggttc gaatggccaa aaatattgtt ttcattttat gttgcaattg 5941 tatatcatgc tgaaaagcag atctttataa ataaatttct tatatttctg aaccttcaga 6001 ggactaaatt cccagatatg ggaattgtat ttcaaatatt ttaaagattt tattacataa 6061 tgccaactga ttacaaaaaa agttagggcc aatttacaat ttcactataa atgagttcct 6121 attttaacct agctaccact aaatactgtc atttttttaa gtcttttttt tttttttttt 6181 tttttgagac ggagtctctc actcttgccc aggctggagt gcagtggcgc catctcggct 6241 cgctgcaagc tccgcctccc gggttcactc cattctccta cctcagcctc ctgagtagct 6301 gggactagag gcgcccgcca ccacgcccgg ctactttttt gtatttttag tagagacggg 6361 gtttcatcgt gttagccagg atggtctcaa tctcctgacc tcgtgatctg cccacctcgg 6421 cctcccaaag tgctgggatt acaggcgtga gccaccgtgc tttttaaagt ctttgagtat 6481 ttgatgggta ccaatagtat ctcatttctt aaattagtat ttttttggta actagtaaag 6541 ctgggctttt tccatttctt taagcatctg tattccctat cctctttaca catttatcta 6601 ttgtaatgtt ttcttatacg tttgtatgag catattctat ggtatgtgtt ttcacccatt 6661 tctgcaatag ttataaatac ctaaggttta aaaaaagtat catcaaatgt gtccatttgt 6721 tttctttaaa ttggataaat tatttttaca tgtagaaaga actttcattg ccaacgccag 6781 aaaatactga cctatattgt gtcctagttt tacactaagt tctttagttt atcatgaata 6841 tattttgata taaaggaagt ttctagatta agtttttgtc caagtagcta ataattgtcc 6901 caacaccatg tattatttat cccgtcttca ttgatttttg aagcatgact tacgttatat 6961 taaattgcta ttaatattgg gttctatttc tgagctatct tacttttaac aatttaccaa 7021 atgagtcatc ttttttatta ttattacata ttgtttccaa atcagattat agtatctttc 7081 aaaaaaggga ttgggttcta ttgatgcttg tgttttccat agggcccatc actttacaca 7141 tagtaaatgt tcaataaata tttacatatt tattaaatga atgtttaaat tgaatatcct 7201 gtgcctaaaa tgcaaagaat ttcctcacta tgttaaaagg tcacattctt caaggtgaaa 7261 catgaagtaa ttttgaaatt ggagaagggg gctgagaatg gcagtattcc aagtataaaa 7321 attccatgtg aatttgatta gaataattta aattttgtag tgacttcact ctccagaggc 7381 tatgcgatca tgagaaaatg ctgaaaggga gaattttagt tgtgagctcc ctctgagtta 7441 ttgaagttca gaaaggcctt tagctgtgta atgatgtgca aatcagtcca cattccacgg 7501 tggctcaaca gatgaactat gtcaatggtg aaatgttgtc tttataatgt ctgctgatac 7561 ctctatttga gagaaccagc tgttggtcct ttatactgct gtccatttag aaatctgagc 7621 aataaagcaa tacttaatac ttattgcata ttcatttacc cccctagcta catattagtg 7681 taaacatgtt aaattttaaa taaaaggcag tattaagcgt atgtgtctct ggcacataca 7741 agtaagtgaa gctttcatgc agttgtatat caaactgtac agagatgact tcagactaaa 7801 aagattgtgg aacaattttg tttgattaag gagacataaa actagattgt cataaagaat 7861 gctaggatta cagacttcta taaaaagaca tgctcactac atctattcat gtgagtaaag 7921 aatagtatag ttaccagaaa aggaaaagtt atgtattcag caaaaccaaa gatatttgat 7981 gttattttta tgacaatatg caagtcttga aataagtatt catgtatttt caaggttgtt 8041 tatattatga aattaaaata gtcctaatta ccattgcaaa aatccttggg ataatatatg 8101 atgaattcta acatgtttcg ctcatgacct cttttgattt tttacaatta ttttacaggg 8161 tagttttgtg tggtgtgtgt ttgtatttta cagactttaa tttttagagc acttttaggt 8221 tcacagcaaa attaaggggt aagtacagag ttccgatata ctccctgccg ccacatgggc 8281 tcagccacta tcaacatcac acatctgtta caatcgatca atcaacattg gcatatgatt 8341 atcacccaga gtccatagtt tacattaggg ttcactcagt attgtgcatt acatgggttt 8401 tgacaaatgc ataatgccat gattccacca ctgcagtacc atacagagta gtttcattgc 8461 cctgcaaatc ttctacgctc cacctattca tctctccttc tccaagtccc aggtaaacac 8521 tatcttttta ctgtctccat cagattgcct ttccaaaatg tctcagggtt ggaatcaaac 8581 agcatttaga ctttttggtt ggtgtccttc acttagtaat atgcatttaa gcttccttca 8641 agtcatccca tggcttgata attcatttat ttttagtcct gagtaatatt ctattgtctg 8701 gatgtaccac aatttattta tctactcact tactgaggac atcttggtta cttccaagtt 8761 tgggcaatta tgaataaaat tggtacaata ttcatgtgca ggctttgggg tagacatagc 8821 tttcaactcc tttgggtaaa taccaaggag cacaattgct ggatcattta gttttgtgag 8881 aaactgccaa accatcttcc aaagtggcat tcccaccagc agtgatgatg gttcggattg 8941 ccctatgtcc ttgtcagcat ttgatattac cagtgtttgg gattttggct gttctaatag 9001 gagtgtagtg atacttcatt ttaattggta attttgatag ttgttttaat atagtcatta 9061 cataaatgag ttgaggctgc agaaaattaa ataagttacc tatgattcta tagctagtgg 9121 gtagcagatt caaatttact tcataagact ctaaagccca tgtgcttttt tgcactgtgg 9181 ttgtcttata gccagacaag aaaagaattt aaattcatac aggcaggaca ccagcggtta 9241 gtttccagtc acatttattg cttgtcttaa tgtagttttc tccagaagag gaccctgaca 9301 ccaggaaata ttgggggtca cagggaaatg aaggaaggaa agacaaccaa cgaaggatct 9361 atttgaatgc aattatcact gtgaccaagt atagcttact cctacagggg aactttggaa 9421 aatggtgcta aatattactc aaaatgtaac gtgtggcagt accagtcctt ctactgtttg 9481 agactggccc aggacaaaga atgtataaaa attaaaagta ctaagaaact tttatagcaa 9541 tttgacattt tagtgatacc catttttcca ataattcatt tttactttat gttatttaag 9601 tatcaatcca tgatagattg aaaattaaaa actagctcaa aaatagtgta gaacacaagg 9661 cctaaagtga cttcatacaa gaaacagggg agctggggta tttatacctc acttcccatc 9721 tggttggtgc acagttaaga cagcacccag gaatgttgtt cattccctag cacttctagc 9781 acacagactt ccagttgtat gtgatgaagt ggtgagccag ggacatcatc tttctgacga 9841 tgagttaaag actaaacaag tattatttta ctgcatgcta taaaaggata acactatgca 9901 cagggtctgt gcacaggatt agactatgca cggataaaat gtttcattca aaagtaccca 9961 tattttctac caagaaagct gaagatgaga gccccttctc tccttttccc acacccagaa 10021 agccagtgta ctaacatatg atgaaaattc tgaaaactag atattcccaa caaggatttt 10081 gaatcttgag caagtgatcc aaagcaaagg gaggattcaa agttgttctt aagagttaga 10141 cacttcctgc tactgcttat gcctccgtgt tttctggcct tggttagtcg tagtctccca 10201 gcttcctgct gattctgtga ttctgtgggc tcccaatttt ctttcaagaa tttccttttt 10261 tttttttttt ttgagactgt gtgtcattct gttacccagg ctggagtaca gtgatgtaat 10321 ttcagctcat cgcaacctcc acctccaggg tttaagcaat tctcatgcct cagcctctca 10381 catagctggg gctacatgtg tgtgccacca tgccaggcta attttacatt tttagtagag 10441 acgggttttg ccatgttggc caggctggtc tcgaactcct gacctcaggt gattctcctg 10501 ccttggcctc ccaaagtgct gggattacag gtgtgagcca ccacacccag ccaatagttt 10561 ccttttttac ttagaacagc cataatctga ctctttactt gaaactagaa gtcatcacaa 10621 tatgagataa ttgaggagaa tcagagctct gatttctgcc tgctcatgaa ttcccttgaa 10681 aaagtgaggt tggtttcttt gccagtcttt gtctcagttt ccacatttgt aaaatggggc 10741 tagctctgta gcatgcttcg aggaaaatta atctgtgaga tgttctctca tacaggtttc 10801 ttcctttgga gaatctaatg gcatgctgct ctattgagtt tccggaaatt cctctagtaa 10861 gggatttaat ttgactttaa ttcagcattt cccaagctta atcatggaag tcttgtttca 10921 gaaaacatta atatctcaca gattgagtca tccataaact atcttggcaa gagacactgg 10981 ctccagcagt tcatttttct ttgaatttgc acctgacact gatctctgtt attaaacact 11041 catgaggcct ttaaggagtc tctgtattaa cccattaaag taggaagttc ttaaacttca 11101 gagcatataa aaaccatttg aggaccttgg tataatgcag attgctgagc accatacaag 11161 atctactaat tcagaatctc ctggaataga gacaaggaat tagaactcag aattactcta 11221 tgggggattg tgttgtatgc agtcttcagt ctacagtatg agaaacactg tcttatgtaa 11281 caattcagtg aaaaatacct tcaagcaatt tatgtacttt atcattagaa gaagctagag 11341 gtcatagtca caataatata tctcactggt ctgaccatta atcatcttct gagatgtctc 11401 aggcaagctt atagataact tgtgaatacc cacccaaaac gtactgaatg gctctgatcc 11461 caacttcaaa aagatgtaac attaaaattc aaatgtgtct gtaggtgcca ctatgagatc 11521 aataatttgg gcacatctag ttaaagtgtg aatagccaca gctgtatgtg gtgtcttaca 11581 cctataatcc cagctacaca gaaggctgag gtgggaggat agcttgagtt caaacttgaa 11641 ccaggagttc aaggccagtt tgggcaacat agtgagatag caagattcca tttcttaaaa 11701 aacagtctga gtgtctccct cactcctgct acctatctat agcttattca ttctcaccat 11761 aaactcactt agtgagaaat caaggttaaa ggagtaggga agaaatttaa gtaatttaat 11821 tttatgactt tttagtaact cagctcaaag actctaatga taggaagcct ggaatgaata 11881 actaaaatgt tatttttaga ctatctgtga atttcatttc ctctcattac cacaaatgat 11941 gaaaaattag ctgtcagaaa tttcaaagca atatagcaga acatgatgga aaaacaaatt 12001 aatcaagtca ccatcttccg aaacattttt cctttattag aattcttcag gtaatatgac 12061 gaagataaca atactgcatt tctttaaaat atgctgtaca tttcctcata ggaaaaaggg 12121 tgagttatgg tgaccagtga cattaactgg aaacaagcaa acttttggtt cctgggccac 12181 tagttgtatt tactttaaga aaaccctaaa gatgatttgc agtctttcag agagtttact 12241 tttatgttta cagcccaaaa tgtgcagata acttgtttgt aaataactgg caggggtcac 12301 agtgtcacat ctaggagatt tcattttatt ttatttcgac aaccagagtt acagcctctt 12361 tacaactgag aagtggtaat aaatggtatt gtgcttattc ttcaacaact ccatagcact 12421 ggcacctgac atatatttgc atgaatttca tgtaacaaaa tgtagaaatc tttctttcta 12481 aagggcttca aaatttcccg aaacagactt ctgaacttca ttttattcct cttgctacat 12541 ggttgttctg gattagtata tatatcaaca cctaaaaaca gagttacaaa aattagtgca 12601 tacctcatca atttcataaa gttacatcct ttcttgtttg catacattta tattttattt 12661 attactgtaa ctgccactta caacccagca acaaagagtt gtattttaga atattatgga 12721 atggtgatat gatagacact ttaaagtgta ttgacatgtc cttatgggaa ttagggattt 12781 aatttttttt ttttttgatt gaagcactgc atgtctcaag ggtggctgca ttcagtaggt 12841 ctgtgacata acccctggaa aatacccagg ctgcccaatg gaagagtcac taatcagata 12901 ttgaatatga tatttttgtg ttcaaatttt caagtcctca cacccatcct aactaatttt 12961 taggcctttc ctttaagaca accaaaaaac aaaactataa ctgttcaaac ttggaactaa 13021 ggaaaggaaa taaatattaa agcggagctg gatttatttc tatttattta gagtacatca 13081 cctccagtag aattatgttg ctaaggttgg gatcacggct ttggtctcgc gaagggacct 13141 gctttgcaga tccagggctc gtttaatttt ccagctgggt tcctgctttg cttaactaat 13201 cctgtgcacc ccgttgtgcg gtccacacct gaaaggggga gtttcgatgt tgtggtttat 13261 taaccaggta ctttccattc actcagtcaa cagaaagaaa gagagagacc gcgcctccta 13321 cttttttttt tttttttttt ttaactacaa ataaggagac ggaaactatt aaataagaag 13381 gggggaaact cccgcatatt tcaccttgac ctggctggac tggtctgctg cctctaagag 13441 gcgggtgttg tattaaaatg agaggtcaaa ctcctcgaat gggaatcgct cctgccccta 13501 gagggaaaag agaaccatca ccgtgttaaa ctggaaacac tgccttgttt ccaatgcggg 13561 ggactgagtt ccctccctcg acttagcgtg gtaatgacag gacccgagga ggcgcgctgc 13621 ttccgggcag ttgcccagca taacacccgc accctacttg cacctaaatt tctgagctgc 13681 aagcaggcag aagtgccttc cactttctct ccttagctcc cctctccccg ctttgctgct 13741 gtctgtaact ttttgttccg tttactggga aggtgacgct taagatcaaa agacccactg 13801 taacttgcaa aagcccagag ccaccaccag acgctgggga aaaacagggc tgtccagtcc 13861 agggttaagg ttttgagcag agaaggagca gcgagaaatc tcggccagtc gctccgggaa 13921 cagcccctgc agactggtcc aactctccaa aggctaggga caccggggca gccttgtggt 13981 ggggtagagg acgagctcct aactgcaagc aaagtgctga gtagagggcg ttgggggcag 14041 gggaggtccc ttgactccac tttgtatgca gcagctgcgg ggcagtgagt atctccgaac 14101 aaagcggcga gcagcgcggg ggtgagagat tggggtttgc tcgcagctgc cgctggcgct 14161 gagcgagtct cccaccaccc ccgcgagcgt tggggtgcaa ggattcgggg gaggagggag 14221 aagggaacga ggcgtgccat tcctgttgcg cggctacgct tcaggagttc ttgtgaacga 14281 tgggaatggc aactatcaga gccaaggggg aaaagaaaga gctggagtga gcaaaacaaa 14341 gagcgagtgt gggccgcggt gacttcatgc ctcaccaatg tcccgcccac gctgctccga 14401 gctgttactg ctgctcctgc ccgggctgca gccgcgcctg cgggcagccg aggcgaggcg 14461 ctccggtagc gtggggaggt ttcctagcaa ctctctgccc tgagtgcacc cggctacccg 14521 caggtctgga gccccccggg gggccagagc aggaccaccc ctctcccgcc catcgccggc 14581 ctcgcctctc tcgcatccta gccgtgtgca ccccagagcg cgccagcaac tcgggctctg 14641 gactgcggga cgcctgagcc gcgcactgag gcgaaaagga caagggcacg cagcccccgc 14701 cccgcgaagc cgggctccgg cacctcgcgg aacctcccct cctgcgctcg cagccgcagg 14761 ccccactccc cgccagggag ggagcggcga tgcctcgctc tgtgcctgcg ggtgtcggcg 14821 ggtgcttcta gggcgctccc agagccgcct ccccctgttg ctggcatccc gagcttcctc 14881 ccttgccagc caggacgctg ccgacttgtc tttgcccgct gctccgcaga cggggctgca 14941 aagctgcaac taatggtgtt ggcctccctg cccacctgtg gaagcaactg cgctgattga 15001 tgcgccacag acttttttcc cctcgacctc gccggcgtcc cctcccacag atccagcatc 15061 acccagtgaa tgtacattag ggtggtttcc cccccagctt cgggctttgt ttgggtttga 15121 ttgtgtttgg ctcttcgcta agctgattta tgcagcagaa gccccaccgg ctggagagaa 15181 acaaaagctc ttttctttgt cccggagcag gctgcggagc ccttgcagag ccctctctcc 15241 agtcgccgcc ggggcccttg gccgtcgaag gaggtgcttc tcgcggagac cgcgggaccc 15301 gccgtgccga gccgggaggg ccgcaggggc cctgagatgc cgagcggtgc ccgggcccgc 15361 ttacctgcac cgcttgctcc gagccgcggg gtccgcctgc taggcctgcg gaaaacgtcc 15421 tagcgacact cggcccgcgg gccccgaggt gcgcccggga ggcgcgagcc cgcgtccgga 15481 aggcagtcag gcggcgggcg cggggcgggc tgttttgcat tatgtgcggc tcggccctgg 15541 ctttttttac cgctgcattt gtctgcctgc aaaacgaccg gcgaggtccc gcctcgttcc 15601 tctgggcagc ctgggtgttt tcacttgttc ttggactggg ccaaggtggt aagttgtttt 15661 gttttgtttt gttttcttct cttccccaaa ggtcttgggc tggcacgaag agctgcccgg 15721 ccagagcatc ccggccaggt aagttgcggt catcagagaa cttcaagggg gcgggtgcgg 15781 ggcagcgtcc tccagcacag atggcctaga ggccaggtgt caagcatttc tgccctggag 15841 cgacagcaag gactaattat cagaggagac actctgtgtt actcctaggt gttgaggaag 15901 ccctaataga aaggaatttt ggaaaagagg gtgtcctgtt tacagaagtc atttaggggc 15961 cctggtccat tcttaagggc atcgtctgta cacaatgttc tccctatctt tctaacattt 16021 gggtccacta aggactaact tgcctttaac tcgaggctgt agataagttg gttataatgg 16081 gagtagtctt caatttttga gtccagaaaa agacagcata caagtttctg ctatcattag 16141 tgctcaatct gaatgaatta cccacactgg agggatatta atgctttaag tcatcatcct 16201 ttcggcttaa aaaaaaaaaa tactatctaa ttgatccctc actgagatca atctctgaga 16261 cagagaagca gaaagaactc tgaaccagga ttaacagctt tctgtccagg caaaccattt 16321 gtgtaagagc taccttaatg tggagaggtt aactctttac gaaatgcatt gtcctcatct 16381 gtaaaacgag gtttattaat aatacctgcc ttctctaaat cacaaaggaa agttcaagtc 16441 aaatagcata tgagaaggtg ttttaggact tttgatttat gaaacaaaat caaatgtaag 16501 gtggtatacc agattggatt gttggaaaat tacacaacta agtaatagga cgaatcatga 16561 gtttaggatg agttcttaat ggttttcaat ataggcccaa catttatttt tgctaatgca 16621 gtgactcttg tcatactttc tttttttttt tttttaattt cactttttct ctgtttggtc 16681 accaagtcct taaattatat tagagccttt ctctgacctc ttgggctcaa ggagagttta 16741 attaatagtt gtaattgttt tgtaggttaa aatttactcc tatactttaa agtgtcaagg 16801 ttttgcctga agctaattaa aacagtcaaa tcataacctt ataagtaggt tttatatata 16861 tagatataga gatatatgga aacatccagc tatctttagg aaattgttgc atatacagtt 16921 ttctaatatt tgtttcaggg agatttgctc agttatctga aatggagaaa ttagagtaaa 16981 tctctttatc ttgtggttat tttttggctt gtttgttgat ttcaaaatag tgaaaaggca 17041 aaggaaaaag taacattcat tatcagttat ctcttaaatg gaagaaacat gcatttgttg 17101 cagcaaattt tccgctaagc ataatttcat cctgtttcaa taagtaataa agtgcttttc 17161 tagatttagt aaacacacat acacactagt aaaatgacca aaatttgtgt taacacagta 17221 tattttgttc ccaacgtaca ggcaaaggaa aaagtcctac ttgcctctaa aatttactat 17281 ttaaagtaca tttttaaaaa ccattaactg tgcttataaa cattagtcag gctcttgtga 17341 aaaagcattg catttctacc cagtgtctaa aactgatttg aaggaaatat tatgaatggg 17401 cctacttgca cctgcagacc tgaaaaggga atgcttatct gaaatcagaa caaggcatcc 17461 ttttcccagt ctctgtcaat atacaaaggt ttctgatgta tactgatttg catggcagtt 17521 aaaaacatgc aaaaaatgta aattatagcc agtataactc agaaaattag agatcagcat 17581 aaactgctgt acttttgtga tatagctaca agtctcaaat tatatgtctt tacagtgcaa 17641 attagatgct taaagcagaa tagaaattta ggtatcagtt acaatggaaa tcttgtttcc 17701 ttgctcagta gtttacacca ggagaatgtt gagggcaata aagccataca aaagagacca 17761 gaagctgcat aagaccagca taactttggg tttaattttt aaagcagaat ttataggtct 17821 cgaacatata gtggactttt aataacaaat atatagtgag cagtggggga tgttattaat 17881 caaatccaaa gatttctgat gtagcaaaaa ttatagcagt ttgtttttta actgtctctg 17941 gtgaagatta cccacagcta gagaaataat agcactgtaa aatggtcaat aaataaccag 18001 aaaaacataa agtatatcta gtggttaata tttgaagagt taatgtagca acattgtttc 18061 cacatttact tattttaaaa gattaagaac tttcattacc taaattttta tatagcaata 18121 taagtgtaac aaatttcaga tatcacagtt tcttttaaaa aagcggtaac tattattgca 18181 atcaaatttc accttctgtg tattgcaatg aaatttcacc ttctgtttat tcaagtatca 18241 attatttatt attttcaaca acaatcaact agattaacat tattttgtga ccatatcatt 18301 acttcccctc aaagcgttgc atagcataag tgttcccaga attatttatg cttacctaaa 18361 gtgatcggat atttaaaaga accaaggtat cattaacaaa aaattcagcc ttctaaatta 18421 tttgcataat aagctcccaa attaattcta tcttgaagca atggagacac tttctgatca 18481 attaaaaatc atacatgttt gccctctaaa ctgacagcaa agtagctgat atgctttgga 18541 aaattcctgt gccccaaaat aacttcttct cttgttacca aattatctga aatctcacta 18601 cttaatggat tctttttggt taaccatggt attaacccta tgatttttta aagtttttaa 18661 catggtataa tggaaatttt caaatgcata cagaattaga gagaatggta taagggacca 18721 ctatatactc atcacccagc ttcattaacc atcaactcaa agccaacttc tttcatctgt 18781 gctcctgctg gtacccaatt cacccccttc tcccaactaa atattttgaa agaaattttt 18841 catatcattt tattttaaaa tatttcagta aataaaaata ctactggagg aggaacacac 18901 tgtaactaat cttaaagcat gaattttttt aaaaactgaa acttagggca acaagaatgt 18961 tcacaatttt taagggaagc aattgcaaaa gagaaggaca cgaggaatgg gaaagacaca 19021 gtttcaacca tttacttttc aatagttcta ttatagttcc attaaaaata tttttatttt 19081 gaaagtatgg atatgtaata gctttttgat acgcatgtat atgaacagag aaaaacaaca 19141 tgagtaaatc tgcttcccgt gacaaatgtt aattgtttta ttccaatgga atcacagcag 19201 gtataccttg taaagtttca gtttcaagga tctgcctgcc cttgcatgat tagtttttct 19261 cctatctact gtatcattta aagcaatatc catatatgtg gcaaaacctt tttatctcct 19321 attttttctc tccttttgct ataagtcctt gaaagagttg tccgtaactc ctctctccga 19381 ttcttctcct caccttctct ttgaatccat tgtgagcagg ctttctttat cacctctgca 19441 gtccactctc ttaagatcag ggtaacctcc acattgccag atctgatggc taattcttaa 19501 tctccatcct gcttgatgta tcatctgttg gtgacttgac actgttgacc ttgccctcct 19561 tggaaacagt tttgttggct tcttggaccc acactctctt ggtttaagct ccttctccat 19621 ctctgatgtt gattcttcct catttcttaa cctctaaatg tagtagtggc tcaggaccca 19681 ctcctaagga cttctctgat ccagtcttgg ttttcttttt tttttttttt ttttttttga 19741 gacggagtct tgctctgtcg cccaggctgg agtgcagtgg tgcgatctcg gcccactgca 19801 agcttcgcct cctgggttca caccattctc ctgcctcagc ctactgagta gctgagacta 19861 caggcacccg ccaccacgcc cggctaattt tttgtatttt tagtagagag ggcgtttcac 19921 tgtgttagcc aggatcgtct tgatctcctg acctcgtgat ctgctcgcct cggcctccca 19981 aagtgctggg cttacaggcg tgagccacca cacccggccc agtcttgtgt cttaaatacc 20041 atttccacat tgacacctcc agaatattta tctctaaacc tgacctccag aaggtgcagc 20101 aaattgtgta aaccatgtct ccactacacg cctaaaggca aagaccctaa cgcattcaaa 20161 accaaactcc attttcttgt ccagtcttca taatcacact gtgagagagg tccctgtttt 20221 tgaaacaaat aaacaggagc agaggagata agtaacttgc ccaatgtcac ctatctagta 20281 agtaggagag atgggatttg aacatcgtag agaaagcttg aattatgctt cataatatga 20341 acttcctaat cagtcttcat catttcaaac ctttgtccac atgttccttt ctactcagaa 20401 ctttccccta gctccccatt gtcttcaggc tcctcataca acttcctccc agctttgcta 20461 tggacacacc cacctgtgac ctccttgagg gaagaagctg cctgtatttt tgtgtgtcca 20521 atgcttggta cacagtaggt cttcacaaag tatctggagt aaatgacccc atagataact 20581 tgaggataat agtgagcaat ctaaaattgg gcagctaagc aaattctccc taatcatttc 20641 taagcctcac tagaatttga acttagagaa ataaaatttt cttttaaaag tgggttacac 20701 cataaaggac acttgtttct gttttcccag accataccag aaagcagtct cttgaccttc 20761 tggacaaatc attagcaaag attaaaagct gagacttagt actctttaat cttatttttc 20821 aaagtctgtc ttgttttcaa actttctgtg acaatgattt ctattgttga taataaaata 20881 taaaaagtaa tcctaattct ttacccttgg tatcaacttt aaaattgatt ttacaatttt 20941 acaaaattgt aattagatta caaaaactgg aatttgtagt cattttcaac taaatacttc 21001 aaaaattaca aaaactggaa tttatagtca ttttcaacta aatacttaaa aaattaataa 21061 aaaatccaag ctattctaac tcctttaatg actattcatt cagactttct taaagcacat 21121 acagctttat gacaggcact atactgagtt ctggagttaa aaaataacaa cgattaagtt 21181 ttgtgtatta tttactgtag gccaggcacc ctcctggaca cattatttga aaatataata 21241 tatcctttaa tgcttacaac cctgttagtt atatactgtt attcccattg tataggtaag 21301 caaactaatt caagaggcat tgtataactt cttccagatc tcttaactaa taaataatgg 21361 tcccagaatt ccgatctgtg tggtttgagt cagcattctg aatcataatt atagacatat 21421 agattgcttt tcaaaggtga gaggagtccc taccttgaac cttggaaggt tcagctctag 21481 ctgtcctcag gccattgacc tcaaccatgg gcagggagtc catggggctt gctcaagtaa 21541 agtctgtcac cccctctaga tcatctccag aattcttagc ctaaacagcc tcagactact 21601 tttatggcct gccttctggg ggtaaactga gtcacccctg tgccaaatcc taggtgaaga 21661 tggccaagga cactgatgca ggagaggtaa acagagcttg gacatgcaag ctgaagtgtc 21721 catgcaggta aacaacaggc tacctgtgct gcagaataaa tttagaggtg ggagagaaga 21781 ggggaagata atggctggag aggcaacgct tcacacactg cccagttcct aactagaaat 21841 ctggaggtga atattccaaa ttcagaccta gccttccagg tcattatgca tatatattta 21901 tcaaggaagg aagatagaac ttattttacc tatgggtttc tttgcctggt ttgtagctgt 21961 taaatatgta gtctgttatg ctcactgcta tttgtgttct tgctgttggc cctgaaacag 22021 ttaggggcag acttgtctac tagtttaaat attactgggg gccggttgca gtggctaaca 22081 actgtaatcc catcactttg ggaggctgag gtgggaggaa cacttgatgt caggagttcg 22141 agaccagcct ggccaacatg gagtaaagac cagcccgtct ttactaaaaa tacaaaaatt 22201 agcggggcat ggcagtgcgc acctgtagtc ccagctactt gacaggctga ggcacaagaa 22261 ttgctagatc tcaggaggtt gcagtgagcc aagatcatgc cacagcactc caggcctggg 22321 caacagagca agattgcatc tcaaaaatag ataaataaat aataaacaaa taaatattat 22381 tgagccgggg agacaaagaa agcaccagag accattgcat aaggtagtct ggtagtcttg 22441 actacctcat gtcaaagtgt ttttatattt taataactaa atcccttttg gtaatctttg 22501 agatgtgtac agattataca aaattttact ctcttgcaaa tcatgttaca atgagagtca 22561 aatgtatgca tagtcttaat ttagttttac ttatttccct ctattatttt aaaaaatatc 22621 ttagaaggct ggccacagtg gctcacacct gtaatcccag cacttttgga ggcagaggtg 22681 ggtggatcac gaggtcagga gttcgagacc agcctggtga acagggtgaa accctgtctc 22741 tactaaaaat acaaaaatta gctggatgtg gttgcacaca cctacagtcc cagctactca 22801 ggaggctggg gcaagagaat catttgaacc catgagatgg aggttgcagt gagccgagat 22861 cgtgccattg ctctccagcc tggcgacaga gcgagactcc atctcgaaaa aaaaaaaaaa 22921 aaacaactta ggaaagaaca ttttactgac atgaatgacc taatttggtc aaattattca 22981 gttggcatat atttgatatt taatatctat aagtgaatga ctatatagaa aaagcagtgc 23041 catactatat tcatggagta aaaggactat ttttcgactt aaagtgaatc ttaatgtgac 23101 cttatttcct ctttcgcagt aagtaaatta tttgatttac acaccaaaaa agtgtattga 23161 aagtcagaat gatttccatt gggattttat aatttaatta aaacagaccc accttttaac 23221 aaaagtgcta agatcaattc ttgctaggaa actactaata ttaaacagat atatttaaat 23281 tatcttcctg cagttattga aaaaagtcct taactaatat tagtttagga aacaaaaata 23341 agattattaa cttaaaattg cattgttaaa gtgttcttat cagatatata cttttaagat 23401 gatttttaaa gatgcttcta ttttgagtgc tattatccaa agtactaaaa gttcatttta 23461 ggccaggcgc ggtggctcac atctgtaatc tcagcacttt gggaggccaa ggtgggtgga 23521 tcacctgaag tcaggagttc aagaccagcc tggccaatat ggtgaaaccc tgtctctact 23581 aaaaatacaa aaattagccg ggcatggtgg cagacacctg taatcccagc tacttgggag 23641 gctgaggcag gagaatcgcc tgaacccggg aggcggaggt tgcagtgagc cgagattgtg 23701 ccactgcact ccagtctggg agagaaaaag cgagactttg tctaaaataa aataaaatat 23761 aaaagtttgt tttataaaca gcatataggg tcttataaag tggcctttca tgctgacctc 23821 atcaatgagc caaccatcac tgctataatg gttttagacg tgcgctatct ctgcctggtg 23881 ttttattgat gttttgcttt gttgtaatca aataatactg ttacatttgt taaggacaga 23941 atctgttgct attccagata aatattggta tcgtttgttt attctagagg attgccaaaa 24001 aattcacagt ctcgtgatag tacagggtaa gaaagcagag tagctgtcaa gggctagttc 24061 aagtcaaagc ctatggaaga gaccgagttt ggaagtggag agaagggctg gtgatgaaga 24121 caggaggatg aaaaactaga aatattccag ctctgaagaa ctgtaccaaa tatgtcatat 24181 taggtatgtg agatgtgtgg tggatttggg ccaggaatgt tttggtgttt tccatggttt 24241 tggactcaac ttccttttag gataaagcgg gttgaaattt caagatctgc cgatagtaaa 24301 tgaagccaca gttaaaagta ctgctgtttc atgtgtacat gtggtatcag ggatctttcc 24361 ttggaattga accacaaaca gcttcccagg taaaggcatt tttttttcct gattttagta 24421 aggaccatat atttaggatt atttcttagt aactgttaca ggtactaggt agggaagggg 24481 gctgggtgaa aagcttcatc cttatggaat gagggattac cactatgctg taaaggatca 24541 ttagaatatt atgaggtcaa acaaagaaaa taattaaaag gaaaattcaa aattctacaa 24601 ggactggatt gacaggattt atattaagga gtaccaaata gggaatgtct tttctaattg 24661 cctgctggag aggtgcaaag aggccaggct ggataggctt agtcatgcaa ggtattcatt 24721 gactgcaata gaacaaagga tgcaaagaag ttttggggtg gcaaggactt gctgtagaac 24781 tggctgttct agttggaaac aaagacacta ttgtagcaga actgcattgt tggcagcaaa 24841 atgactggct ggtctctttc ttagtacaaa tcatacctcc caaggtattg ctccattgtg 24901 tttttgtgca tttggtttgg atttttatgg ggaattgaag acaagtggat cataaagtgc 24961 aaaataaaat gctctagaaa tgacagatgg ggcacaattt ccaagaaaat tcatctagac 25021 agtggcaaca ctgagaaaaa aaagaaacat tcaagaaggg agttacagtt tcagacatcc 25081 tcagttttgg ttaaaaaggt cccttttgat gagtgtgtca gtcactacca gctagaacaa 25141 gggccctaaa gtgagggcca acgtatataa tcagagagcc tgacgtcagg ttgatactga 25201 aaatatgaga actaaatggc attactacca gtctttagtt tagaagatat agcatgtttg 25261 atgccttgta aagaaggtat ccaaggccac atgatacttt gtaacatctg atgggactgt 25321 ggtcacatgg gagaagggaa gggggctctt gagcacgtgc ctgtggccca ctgaagacag 25381 catagccagc tggagattgg gcccacccac ccttttaggg ctgtccgggg ctgacagttg 25441 gctacttggg ggtcagcata gcactgatgc cagaactcac ttgcctgagg ttggattatc 25501 tggttataga cttgtctggg cagagattga cttttgggca ttatgaaaag tcagacatca 25561 gggctttttg gacaaattct gcctcttagg tatatgtttg agttaaaaaa aaaaaaaaag 25621 atgtgttact tctgactgct ttctactggt ggggtagaaa agttgggtca gccagaacga 25681 tcattggatg ttacactcct ccttctcaca gggcaaactt ctcagatact gcccaggtgg 25741 tatgtgatat tttctcctat actgcgtaat tatcaattcc tttgggatta atcttaatga 25801 aaaaaagccc acatgcagaa ccagagtatt cttggccctt tgaacaaact agatgcttct 25861 agcatgatat gaaagaacat caagctccag aggacgtaag aattggcctc caagttgtat 25921 aatataaagc tgccctagtg acagaggacc cactaactgg tcagtccact tgaggatctt 25981 tgttagggct ctgtattgac aatctaggaa ctaaataccc tgaaagtccc aagaaatata 26041 gaattacctc aacgttttgg agcttacctc tccagcctaa gtggcacttt tacacatgaa 26101 tttcctaagg aggaatttgg attcccaaaa acattatcct taaaagtgca aactcacctt 26161 tctaggcctc taaggatgct gtacagtaag tggcccagga tggtctaaat gagttggtta 26221 cacaagacca ggttgttgga tgttgcctag atatacagta aactcttcca tccaaaagat 26281 ctcaagaggt gccacataaa cactgggaga ctggctggaa attacagaat ctgagagggt 26341 tttgccagta acttctatac ccatggcaca tttggcaaaa gatttttcat tcattttttt 26401 ttttttggaa ttattggtgt gcaaaagtga aaggtggtgg gaatctgggc actttaggtt 26461 gttcaagacc aaaggaatca atggtcacag atattcacac tgcatttgag agcacagaaa 26521 cctagaaaca caaagtttgg agtttggttg tcatttgttc gcaagtacgg cagagccttc 26581 caggccagaa gaaacagatt aacttagctg tctctgcata atcaatggat atgcagtgtg 26641 ggtctgacaa ggtgaagaga catcctattg cctcagtcat ggtgagggtg taattcctgt 26701 ttttcccatt caaatataaa ttggttgtat attttgaaag gcagcttgca gatcttcagt 26761 agacacggca gagttgaacc tgaaaggggt taaatgttag aagacaaaag agatctgagg 26821 tcccaggaaa agctaagtag gaaacagagg ggctcagatt tcaaagttta gaaatgggat 26881 gttggatatt ggcagacaat ttcatgtctc tgaacagctt tgctgccgtt tggccagttt 26941 ttattgatcc ggctgcctgg gctgactctc agtagtaacc taacaatact gtgtggtttt 27001 taacagataa tgtatatttg ggaaccagac agaataaaga tgcaatcatg taataaaatg 27061 gcccaggaaa ttccttaagg aagttttctc tggccctccc actccccctt gaggtgacat 27121 cgtcctgtta tctgctgcct gctgtaccct gcctgttcca tttcatagca ctcttgttaa 27181 gttctgtctt atgtcaggtt ttgctgttat tctacaacag gaatacaagc aggtgcccta 27241 tccctaaaac cctgcacaat gccatggctt aattgcaagt attaagtaat tacaaaaaaa 27301 aattcaggat ggagaagcca tcaaaattat tacaatctca aaattcctta agaagcggag 27361 ttttctgctc tccactccca gatgataatg tcctgtctct tttgactaaa tattcaccca 27421 ttctcttcct ctaatcaagt agcttttgat aatactaaca gagctaccta tgagcaggta 27481 aaaaatagtt ataggagaat cctggtcacc tagagtgcct atgaaaagtg atgttttaat 27541 taattaaatt ctatggccat ttatattctg actcttgctt ttgaacatct ttttaatatg 27601 tgtgtgtaca ctttccttgc ttggtgagta catcatttaa actttctttg gtatttgaaa 27661 aaggccttgc agttatcctt tgatcacatt tgcaaatttt caccattact atttaatcaa 27721 taaatgaatt cattgcttga gcacctgctc agtgcatttc acattggact agttgctatg 27781 gggacagaga ggaagtaagg tatcaaggag atggggtggc tgagcctgag gaactgtgtg 27841 ggcatacaat acgagatagt gctgtggggc agagatgggt ggtttgggcc attagtgctg 27901 tagagcttta ggcttctgat tcagacagga aatatgtcaa atgtgttctg atgcaagaca 27961 gaacattgcc gtcttttcta aagattgata cttagtttta atttttctca tttttttagg 28021 agggaattaa aatttagagc cctgattaac tgaaggttcc agctatttag attttcaata 28081 catatggtcc accttttcta ggcatagtgt attttgcgag atggtcttaa aatgcatcag 28141 cataaagcaa aatccaggct gtgtctccaa tagtacatat aattagttgc attgttctta 28201 gctattagaa gaaactttta aatgtgtgcg taaagactgg aagaacagaa gtgtcagtat 28261 taatttacta aatgagtaga gtgaggacat cacaatggtg aaattcaggg acaacctgta 28321 ggaaacaata gtgctttgtg tgtgtgcagg gagcagtgca gaatggccag aggagcaagt 28381 gcagtagcag gacaaagctt ccccagggag acgcaccctg agctgagttc gagagttgac 28441 caagagaagg gacagcattg ggagaaagtt acctgaatgt cacagaagta aacagaaaga 28501 gagagagcac atacgatgtt ttgggcaatg caggggagac acccattcat cctctccttc 28561 agtgaataca tactgaactc ttacactgtg cctggtattg ctgtaagcct cggggatatg 28621 gcagggaacc aaacaaaaag atctgctctc atggagcttt cgttctaatg gagagagaca 28681 agacatttga caaattaaag aaacagaatg tatagcacac cagacagcac taagtatcta 28741 gcaccttctg tagagagaaa gaaacccaga tgggtagtca agaatactgg agaaggaggt 28801 atgatcttaa gtaggataat taggaaggtc tcactgagaa ggtgctgagc caattcccag 28861 ggcctcaggg agggaaatat gaacaatata tcatctgcct caatgatata taactttcta 28921 ggaggatatt gacattgtaa tccaataaat tctataattt taaaatctaa gatgtgctaa 28981 ttatttaata aatatttttc aacaagtgtg tggaaagtac cattcaaagt atgaaaacat 29041 taggaaacta aaggacaaaa gttcttgtcc tcataagtct tatattttgg aggaggacag 29101 acaaaataaa aaaaaacatg gttatacata attatctagt gtgttcaaag gtggtaagtg 29161 ctatggggaa aagtggagca gggttggggg aagacagagt tgccatttca agtagattag 29221 atggggtaag cctcatgaaa aaagaccagt gtttgagcca agccctcaag acagtaagag 29281 agttagcctt gtggataact ggagaaaggg tagagagatg agaggctatg aagcaatgtg 29341 agaaattttt aatgcatagc aggatcgaag gactggtagt ggcagattgt tggtatctgg 29401 agcccaggca gagtgaatca gcttgtaaga caagaggtgg tggtcagaca gggtgatatg 29461 tgaaactgga attgtagaga agttgcagtt actggtaatt acaaggtcta aatttcttaa 29521 catggaagca aagtagctaa atcaaggtgg aggacaaaat aattagagga gaagagaata 29581 aggagctgat gcatcttgta ttgcaaagat gatgtgtgta tatacaaaaa ttgtcaggaa 29641 ttgacatagg agtattgttg gagaaaatta tatcaagcaa ggaggcaaca ttttctaaaa 29701 acgagaaagc atctccctgt ggttggtaaa tgtcagtgaa tgcctgcaac aattcaggat 29761 ggtgggtgac ataatgtaag ttacaagtgt caaagctggg ggtgggatga ttaggggatg 29821 agagagggag aagtgcctgt gagtagcagt gaggactcag gaagacactt atacatctcc 29881 aagctcaagg ctgttaggat tctgggagaa acatcagcga ttactcatga ggtcagggga 29941 agacaagttc tcaggggaga gccaggtttc tgttcaagca agacaagaga acattcagat 30001 aagggactgg gatatgggca gtccttctgg aaatggacta tgaatttcag agggcacagg 30061 ggatgggttt ctggagttgt aaaggaatag gagataagga gaaaagaggg ctgtgcagag 30121 ccttctaggg attagaatgt gttgggggaa taagggtgca tctgggagtc ttgggcttct 30181 tgtagaaact ggtgcaataa acagggataa gtgagattag tctttgggat ctcaaggtag 30241 atagtggtga tgatactgtc agtggctgag gggcaagggg aggttctgag acttgctctt 30301 aattcttagg gaaggaagag acaaggtctg aggaagcctg acttagttct gaagtgaggc 30361 gttgactcta ccccgtttcc tgccaaaagc aacgtggact ttctctgggt gtcttaagac 30421 actcttgact ccagatgatt aaagtgattg cctgagaagt atataacacc tgacgattat 30481 acctgaaggg ggctccacag gcagagagga ggcatgtaca ttgaccatga cataggactt 30541 taggagaaaa gcaaaggaga gcaccttgga gttaatggca ggtagatttg ttagaacttt 30601 atgctatggt ctaaatgttt ttgcttctcc aaaattcatg tgttgaaatc taatcatcaa 30661 tgtaatggca ttaaaggtgg ggcctttgga aaatgactag gtcatgaaga taggaccctg 30721 gtgaattgta ttagtgcact tatgaaaaag gcctgtggga actcgtctgt cccttctgcc 30781 atatgaagac acagcaagaa ggtgccacct atgaggaagg ctccctcgcc ataccaccac 30841 atctgttgct gcctttgtct tggacttgcc ggattccaaa actgcgagaa ataaattctg 30901 ttatttgtaa attacccaat ctaaggtatt ttgttatggc agcctgaatg gactgagata 30961 ctttactact caggtggtgg aaataattcc aggcgttttg ttttgttttg acttgattgt 31021 ttagtagggt agtaatgtga tcactgtatc agcaaaagaa actggcaaca atgtatgtgg 31081 tgatttttag tggtaaatgg tgtttgtaag agattctgag aacgttgtta cattaaagca 31141 aatgatcgat aaggaaggcc tcaactagga tggatccatg ggaatggaaa tagggagtga 31201 gaaatactaa agacagtaaa ttaaaagcaa aaagtcttgg tgactgacag gttgttgagg 31261 ctgtggtaga gagaggagtc aaggttgacc acaaagattt aaatcagaat agatgggaaa 31321 ataatgtaat taatggaata catcattcaa gtcaggaaga ggaacataat gtttgagact 31381 cgaagatttt gaagttgtgg caggaatctg agtgggttgc caagcagatt gttgaaagtt 31441 caagacagaa gcttgggtga gagtttggaa ctgaaggtac caatttggtg gtcatgttca 31501 aggaagtgag gcttggaatc ttgttagttc atagtcatca agctgtagag ttgagagagc 31561 aatgtgaaaa tagccaaaga cataacaagg ggagttgctt gagaatgaga gaagtgatag 31621 agtgcccact cagtgttgaa agacaggaca tttaaacatt gctaagtgct agttgctaaa 31681 tgtaattaag aaatattatt tgcctttgag tgagcaatta actagaaagc tagaggaaat 31741 tatcttgata atgaggagag gcatttttct aataagagaa caaagacaat tttgacttga 31801 aaagaaggaa tattatttag atcttctcta taatataatg gtaaagttag ggtcatatct 31861 taagtgagag gacttggtat gagttcttat ggatttgata agaaagaatt aaaacagtga 31921 aagtgaaaac atatgatgga aagtcaacaa gaaattaatg aaagcagggt ctaggaacta 31981 gatgaggtga gaaagaaatt tggagggaac ccaattctta ataggtttct ggcagtgccc 32041 cataaggcag cctcagataa tcagatgttg cttaggatgc atcaccccat ttgtgcctga 32101 cctcctgttt actttcaact ccatgtttta gagttcttga atcagtagcc tgccttctgt 32161 ctcagcctct ggctcctctt gaatttgtta tttattctgc ttctctttgg tttttcatat 32221 ctaagaactg ttttattgtt ccgttatcca gtgactcctc tatagctcat gagaatttcc 32281 tgctgcagct cctggcccca gagattccta atggctctgt ccatggtgga ggtagtgagg 32341 tgttgggggt gacttgggta aggcagaata accttgagca aggtgatgaa tacattgcaa 32401 atttagccgt ttctggtgct gataacaaag atgggggagc atataaataa gagctgtgac 32461 ctttgaaaga atagagattg ataagtgttg caagatggaa gctggggaga atctcataag 32521 tagccagtgg gaacaaggag aatatgtttg caaagcactg ttggaacatt tccccaagga 32581 atgatgaact ctgcataaaa ctaacttttg atgaaggaga aacagtagct ggaccatttg 32641 gggaaatatg gagaagatgg aataagaatt ttatagggca atatggacta tagttcataa 32701 cacagtacca atataaaatt ttctgaattt accacttcta ctgtgatata ttttttcaaa 32761 tttctaccga tgatattaaa ctgaacgaat gttaattgaa aattataaga tattaaagta 32821 ttacggaaga ttatctatcc tataagatac agttaacaca ttttataata attcagagtt 32881 cctttttgtc tgagtctcca aataaagtga attttagaaa agacaaatat ttagtgtcat 32941 atttggcatt gaggatagaa gaggatttac tttagagaac aggatataaa gtataatatt 33001 gatgtatagt tcaagctata ttagtattac tattgatcat aatattaggt tggtgctgaa 33061 gtaattgcgg tttttgacat tactttcaat ggccaaaacc gcaattactt tagcaccaac 33121 ctaatagtac taattactaa gatacttaaa cttacattgg tatcattacc tactccattc 33181 aatttcttag gctgccgaga gttggcaaga aattgcaagc agaatataat aaataagttt 33241 tgaatttgaa aagatttttt attatctcac aaacaagttc aatccataca aaaaagtatt 33301 aaaataaatt aatctgaata taaaaggata tgtaaatcaa atgccgtata tgccgtatag 33361 ggataaacaa ctttaaccac tatctaatct gtattaaaca acaacaacaa caacaaaata 33421 aaaacataag atgtcactgg attttcttct aaaaatcata gtttaatgtt ttgctataca 33481 ctggcgggaa agaaatatat tccttcattg tgcactggtt gtctttttct aacaaatatt 33541 ccatctcagt taatttacag tgctaactgg aaataaactt tatcgatata atgtacatca 33601 ataaactgta tttatttaaa atgtcagtgg atgagttttg ccagatgtat agacctctga 33661 agctacgtac cactacagtc aacgtacaga acccttccac ccaccaaaga tttgctcatg 33721 cctctttgct gacagtgcca aaacacaaaa cagggaataa tcagtctgca ttattggtaa 33781 ctgttttatt gaatgtatag ggattttggt gccccttttt taaaaaacaa aaacaaaaac 33841 aattccctgt attgaagttg ttgttgttgt tgttgttttg agacggagtt tcactcttgt 33901 cgcccaggct ggagtgcaat ggcatgatct cggctcactg caacctccac ctctcaggtt 33961 gaagcgattc tcctgcctca gcctcctgag tagctgagat tacaggtgcc cgccaccatg 34021 tccggctaat ttttgtattt ttagtagaga cggggttttg ccatgttggt caggctggtc 34081 ttgaactcct gacctcaggt gatctgcccg cctctgcctc ccaaagtgct gggattacag 34141 gcatgagcca ctgtgcccag cctttggtgg ccttttaaaa taaatctttc ctattataaa 34201 ctgtatctta aaacagtccc ataaactcac agaactgtac atgctataaa gcaaatgttc 34261 tccaccaatc atgtcattgt aagttgcact ctacttttca aactgttttc acagctcagt 34321 ggaggaacac agcacacaat gaggggaaca tttttcctac atcatcaacg aggaagcaaa 34381 gcctcagcag ttagtttatg aagccgaggc cagcgaatag gcagggttag gacaagaaca 34441 agttgccttc tcccctgttc cagatttggc tttcaagaca gagctagaaa ttcacaaatg 34501 gaaacagcaa ctccgaacac cacttcctat gctactgcct aattagcatg cagctcagag 34561 ctgtcccctg gcaaaagcct ggagaaacat ggttgtaaaa ggacagctct cccctctgct 34621 gaacaagtgc cagccagtct gctttatgca ctcagtgagg ttttcaggct ggggctttca 34681 cgcgaaaact cagtttccct gtgcatagaa aggaaatgaa ttctgaaaag ggtaccaaag 34741 tatttaggag agtctctgac tccaaggatg gaaaataatc ttactaggga gaatctaaca 34801 ttcttatctt tagagtggga aaataggatt accttagcac tgatgataag aaaagtaatg 34861 acatactact tgtattaaag aattttctgg aggtactagg gattattgct gagtcatcat 34921 ttaagaacta ataatagtac taattagcca ggagttttca gtcatacttt ggaataattt 34981 tcttttttat gttattctct attccaccta agacagtgtt tacaagaaaa ggttctgtca 35041 tcagaactgg gttcaaactt ggtctctgta tttaccacct ctgtgacttt aggtgagact 35101 tagcctccct aagctgtagt tttccatctg taaaatgggc atagcagaac tttcctcatg 35161 ggagctttat gaagcttcag gtagatcatt tatataaagt gcttggcata gggccaacac 35221 agtgtgatca tgtaatagat acttgatttt ttaaattttc tctaatttta ttgtcctgta 35281 tgtatagatt taattttcaa gtgcagagtc aaaagatatt aaatatattt ttgaattttg 35341 aaatgtgatt aaaggaaaac caaatcctaa gtgacctaaa attcatattg ggttttaatt 35401 taccggtgtt tgttttttac ttagcactaa atctcttaca atatagagtg ttgctttgct 35461 actgtaatca agttcacacc tgcaacgtga ctttaagttt aaaagataca atgtagaaaa 35521 tgcattctta ctatcctagg gcattttttt taacctcctt aaccatggca tttgaagttt 35581 ggacatgaat aacagtgctt ttttttttaa gttggagaaa ggaaacctat acctgccaag 35641 agttggaaat gaaaatagga agagcccctg gaactctagt gtgttagatt gaaaacatta 35701 tttagaggaa cagactggga aaccagtgaa tattttggct tttatagaaa gggagtgtgc 35761 ctcataaaca ggacagttgg aatgtatgga aaatttccag tgaaaacaag aaacaaagca 35821 catgactaat actgcttttc caagttagac aaatgcagct ttaaccagaa tttgccactg 35881 aggggccttg ggggccctcc ttgaatctgg atactttgat ccctttttca agttctaagt 35941 ctaatgtagg aaaagctgta cagatagatg atatggtttt gacccccatc tttcttcctg 36001 tttctgccgt tcttgaaaat tcccagcagg tgagtgtcag aagacaaatt gaagtattca 36061 gctaaataag caactgcaaa actgaattta ttcctcttca agaactactc ttcttgtatg 36121 ttgagtgacc atttggctgc caagagaaac tgctttgcct aggaaataat atggactgaa 36181 aaatcagttt caaacacaaa tgaaaaatat tccttatgtt tggtcatctt ctttccacat 36241 caaatataca tgtaaagatg tatataagat ataatctgaa gataatcaaa cagctattaa 36301 ctggatttct gcatttcgcc tatccttcaa aaccagcctc aagtctacct ttatgttgag 36361 agagtgtatc tttaactcat ggtgttctct ttctcctttc acatttcctg taaactcatc 36421 atgaattgcc ttgtttattg ttttgatttc tcattcaaca acaccagaat actctgtgtt 36481 tagggtaacc ttagacttca gttttcctga ggagtttcag tttacacgtc ttgcaccggt 36541 acaattaaca gtgccccttt caggttaaaa agtgtctcag tgtagttcat aaattatata 36601 gtcaccccat gtgtgcagca atcagagctt tttcaatgca aagttgatta tgatgggcat 36661 aaatgaatca taataaaaag ttctagtatg atcccatctg actgggcttc ttgtgatggc 36721 ccatttcaac accatgggtg tcattagtgg cttcacaatg gttcttcttg taaattccct 36781 catgacctag agttttgatt tatgccatcc tagcccactt taagctctga ctcatgctgg 36841 gattgcccta aacagggcag ggcagactaa tggaatgcat tgcattgtcc taacatttct 36901 taacctgccc aaggttgaac atgcttgttt gtttttcttt tgctctccaa aaatctttat 36961 gctagcgtaa aggcctgttg taatcacata aattactcac tatgtaacag gagaagttga 37021 aatttgtccc tgaaacataa agattatata taataaaagc cttaaaaacc tacctattga 37081 gaaccaaaga tatttagtgc tgtttcctaa ctggcattac aagaaaattc tgctcagtca 37141 ggggatactc acctttcaag ggacaaggaa catgtaatga atttaaattt ggtttagatc 37201 tgtgtaataa ggaaaagtaa gtgaatttgg tgatcttgta cctgaggatg gcggcattaa 37261 aaaaagacta ttttaactta atatcatttc attaacttct ttaaaaataa gctagttcag 37321 ctgaactaca caatatcaga agaccaagtg taagaaaacg gtaatcggcc gggcgcggtg 37381 gctcacgcct gtaatcccag cactttggga ggccgaggcg ggcggatcac gaggtcagga 37441 gatcgagacc atcctggcta acacggtgaa accccgtctc tactaaaaat acaaaaatta 37501 gccaggcgtg gtggcgcgcg cctgtagtcc cagctacacg ggaggctgag gcaggagaat 37561 ggcgtgaacc cgggaggcga agcttgcagt gagccgagat cgcgccactg cactccagcc 37621 tgggcgacag agcgagactc cgtctcaaaa aaaaaaaaaa aaaaaaaaaa agaaaacggt 37681 aatctgtgtt tgtggaaaca tagagatgaa ttcatctagt tatgtatcat tatgtaaaca 37741 tagttacatt gccttttgtg ataaataggt tttggaaatg aagattactt aagaatatat 37801 gtagtcaaga cttataaaaa ctcaggaatt aaattgtcaa attaacctct acttggaata 37861 ttatttatta attacacatt tactgagtgt attctagtat attgagtgta tgtacaaggc 37921 tcatatactg ggtatataaa gaaaatgtat ttaagcacct ttcctcccag gaactggtag 37981 tctagtattc ttacttttcc aaggaaaaac aaacaaacaa acatgaagaa atatatctct 38041 ttagtatgga tgatctactt cacataaagt aattttatgc tggacatttc ttttaaaact 38101 ttacatttat tctgttagag tctcttattt atctttcaaa actcagatga aaattcatca 38161 gttaggaatt gtgtttatct gcaagtaaga gggtctttga aaacaggacg aagtaagatt 38221 cgagtaagct ttcctttcac atcagaccca gttctgagag taaacgagca gatacaggga 38281 ctccttggta ttctgagagg cctgggctcc ttttctccgg ttgtgtttag cctcggcttc 38341 tatgagacaa gttccctcaa gtctgaatat ggctgctgca ggtctggaca ttacagccat 38401 gttccaggca gaaagaggca ggatgggcat aaggggaata aaaaggaaaa caagaaaact 38461 tttttgaaaa ctttcacaga agccccattc agaaacttct gcttccattt aaatcatcat 38521 cgttagttat gcaatcatca tcattacaag agaaggtggg aattgtggca tttcagttcc 38581 tctcaatgcc accttcaaca aatttaggat tccgttagtt aagaacaggc aggcaaacag 38641 caatctgtgt gacaatacac tctaatcatt tatcaccctc tgggggagat tattgtatag 38701 acattgctag tgccccaccc atgctctttt tatagggctg gtgcacccat ccccagctgc 38761 tctgcattct ggccgtttat ggctcacagt tgtcactttc tcaaaggaat tgccctccac 38821 caaaaggagc cacattgctt agagatgcct ggaacatcaa cccgctccaa ccccactcct 38881 ccccgtacca ggctgcctgc agccagtgac tgtctgatat ggggtagaaa acaggcccca 38941 ggctcccagg tgggacaact ttgtgttgtc gtttgggctc aaagctatgg gaggggtcag 39001 gttgaggcca cactcccatt gcaccaacgt atatgcttgc tgccatgcct cactgcctgt 39061 cccagtaaat cacttgctaa caatctctac tctagggtct ttctcagggg aattaaaaca 39121 attgtttttt cttgtaatgt cctcattccc ccagcttcct cctgagctac tatgtaagaa 39181 ttgaacatta ttgtactacc tccttctctt ccaaatacac gttcagacta tcaaacctaa 39241 tcactggacc agcaaacaga gaaggcaaga tagagaggaa tccaaactgg ttttcactaa 39301 acataaatac tattgtaggt ttttctaagt gaaagtagag gcaaaattat ttagagaaaa 39361 tagtttgccc cataaagata cttattttca tttccatggt ctgaatgctt gtttgctaca 39421 agtttgaaac aaattaaaca ctgtgctgta ggcaaaggag aggaatgtgc cagcctaggc 39481 tgggcgacac tgcaagagag tagtatttcc tgaaagcgta attcaggagc caactacaaa 39541 tattttgaat ttcatgaata ttgttcaatt aattctatat gagcacagtt taaataagca 39601 ttggtggtac ttagtaacta aagtttttcc acagatgtgt tatcattatt tgaatgacat 39661 ctgcttcttt gtctatttca acaaatagat ccttgctccc cctgttttga gggatcatct 39721 cagacactaa cttcccggta atgtatcttc tctactactt cagaacctcc ctactcagat 39781 ccaccttttt cagagagctt tctatgtttc ctccttttct ctcacgccca tttacttaat 39841 ccctcgtaac atttaactga cttttgtttt gtttcctgta ccttattttc aaagaagtag 39901 gcctaagcac ctactccttc cctttatttt tgtaggggcc tttaggaaac ctcaaatcaa 39961 aacttggaat aactaggtca tggcaagtga ttagaatgcc aatgtattta tttgttttaa 40021 tttttatatt aaataatagc atatatttct ctgcaaaaga gagattcagt aatcaagtaa 40081 tgtattttaa ggtatattta catgactatg atatgtatat ctagaaggag gaaaacccag 40141 ttatactcta aactttactc ttaagcatgc tcaattcgtc tgattttgac tgttatcaaa 40201 gcagaacatg tattaattac aacagtgttt acttgtttta ttttttaagg aagtagcttt 40261 atacggttat tcaaatttaa tttaatttgt atagtaaatg caattgggaa agggatattg 40321 cctggaaaaa aatgcatttc tcacaaatca ttttgtttag gagtaacaga agaatccaga 40381 atgattgatt ctaaatatca agtactacag tctttgtcag ttgtaattgt tctttttgcc 40441 agataaccat gaacatcctt taaatgttgt tgaaatttta ttgactttac tacagaaaaa 40501 aactagaaat gtctctaaaa acttgtactt gatccaatca cttgatttat ggaaagatcc 40561 cattctaaac actgaaactt ctactgtcct ttttagtttt ggggagtttg ctggtccatt 40621 ttttcccctg tcacttctat cagctaaata ttttcaaaat aatatgaatg aatggcatca 40681 tcaaaaacaa agaataccta agtgggtgtc atgaatatgt tttttcctga cttttgtgta 40741 ctctatcctg ttgatgatgg atatttatta tactttatga cagatagcta taggggtcat 40801 ttcacatatg tcataaccat ttaattctgg gaaataattt aaaaattgtt cagcttaaat 40861 caatgttcat gttgcaaaat ctgaataaaa tgcaggtgtt ctaagagatc ttcagtaagc 40921 cagtttgtag tgatgccaaa attgtatttg taatagattg gatggatgtg tatatattca 40981 actttatggg cttttaagaa tctttttttt actaaataaa tgcagtactt tttacttaat 41041 agccattttt atatattagc tgtattttta aaactttatc acccaaggga ctttcatttt 41101 aaagacagca tgttacattt acgacaagtt tgaggacaca taacatcttt ctttgttttt 41161 ttgaacatag agattaatat acacatgaac atacacgtat atatgtgtct atatatatgt 41221 gtgtgtatgt gtatatagat aatataccca acatagataa tattaattat aatgacatta 41281 caacagtttc tttatcgaat gcctactatg ccgtgagcac tctgttggga tgctttacct 41341 tacttcatcc tcacatcagc aatttgatat ggttatagtt agccctttgt tactgcggag 41401 gagatggaga cttggagagt ttaggtaact agcttagtaa gtagctgtgc tatggaacat 41461 tggaagtttc ctcccagtaa ctatttagca atactacctc atcacacctg ttgtcaacag 41521 ttgtaagaca atgagaaaat ataagacaca cagcacatgc taacaaatcc agaaatagaa 41581 ttacttagaa tatacacact tattgacaat atgcaattaa atctttttct gatttctgtc 41641 aaggcaggtt ttttttgtta tcgttgttgt tttgtgtttt gttttgtttg tttgttttga 41701 gatggagtct cgctctgttg ctcaggctgg agtgcaatgg catgatcttg gctcactgca 41761 acctccgcct cccgagttca agcaattctc ctgcctcagc ctcccgagta gctgggacta 41821 caggtgtgtg ccaccacacc cagctaattt tttgtatctt tagtagagac ggggtttcac 41881 tgtgataccc acatggctcg atctcctgat ctcatgatcc gcccacctca gcctctcaaa 41941 gtgctgagac tacaggtgtg agccaccatt cctggcctac ctggttactt tctctagatg 42001 ggagatagca atgtgacata aaactgcccc aggggctcca gaactgctgg tacagttata 42061 cttctaaagc actaggcccc aaaccaattt ttctcagatt tccagatgat aaaaatatta 42121 gcatcatttt tttctttttt tttttgtgag tcacaatgga tgtcaacttt ctaacccttt 42181 ttatttctat tttcttcctg ctctttcctg aaaattttca cacagctata gcaaggcaaa 42241 gaaagtagct gggactacag gcatgtgcca ccacacctgg gtagtttttg tatttttagt 42301 agagacaggg tttcaccata ttggtcaggc tggtctcgaa ctcctgaact caggtgatcc 42361 acccacctcg gcctcccaaa gtgctgggat tacaggcatg agccactgcg cccagccagc 42421 agtatttttt ttaaactttt attttagata cagagtgtac atgtgcaggt ttgttacctg 42481 gtatattgca tgatgctgag gtttagggta ctaatgatcc gatctcccag atactaagca 42541 tagtacctga cagtttttca acccttgctc ccctccctac tttcaccctc tagtagttgt 42601 cagtgtctac tattcccatg tttatgtcca catgtaccca atgtttagct ctcacttata 42661 agtggtaaca tgtgttgttc ggttttctgt ttctgagtta atttggttag gataatgacc 42721 tccagctaca tccactgtgc tgcaaaggac atgattttgc tctttcttat ggctgtgttg 42781 tgttccatgt tgtacatgta ccacattttc tttatccagc ccaccattga agggcaccta 42841 ggttgattcc atgtctttgc tatcgtggat agcactaagg cagtacattt tttttttaat 42901 aacatgggct ttatgaggca ctgagaacta aaacaaatga gcacaagagg aggatctgag 42961 gatctgtgct ttagcacatt ataatacttc ttccattggc ttaagaattt ccattgggat 43021 caataaaaaa taatccttat taaaatgcag tgttcgaata taaagaataa ggtgcatatt 43081 gcaacatcac taggcaaata gctgaacaag cagcaactgc aatcataaag tgggaaaacc 43141 cactcaggca caacctctaa tgaggtgtca gtgcagtgct gagggctgct agggaaccat 43201 tgctggaaag tagttttctt gtaatgttgg atgggagggc aaatgaacta ggtagaccca 43261 gaagctgaag acagactgtc cgtgtctgta aacagcagtc atgctttgtt gtatatgcag 43321 tgtagagatt tttactttca cagaagggtg tgtttgtatg tgtgttgtgg gaggggaggg 43381 aaggagagag agtgtgtgtg tgtgtgggag agagagaaga aattcctagc acacactgac 43441 taatagcatt cagtatactt tgaacattca gtgaaaagaa ttcctgtaat tgttgatatt 43501 ggtatatttg gtattttact tacttgcata tttttaaatt agaaagaaaa ttcttaaaat 43561 gctgcatgcc aaagtgtatc tggcatgtct tctgtgaagt ataaccatga cactctccag 43621 aaataatgtt tgacactctt cccatgcagt tcaatttatg accttcctct agtcctgcag 43681 aaccatgtgg ccacctcctc cagatccatt tatgggatct tttttcctgt agctatgtag 43741 aacctgagaa atgtaagccc tggaacctgg gaagtattga caggaccaac atttctctct 43801 ctctgggtca tcatcgttag gtgtctgaaa tatgagatta gagatgggaa ctagaacttt 43861 ttaaagaaaa gaatcagaat acaaagaaaa cagattttca cttgtgaatt tgtttttgcc 43921 atgtgtacta tggaagtagg agccttatct gagcttgaaa ctaggtacta ggttaactta 43981 gaacagagtc agacaattgt tagaaacaaa gactggagtc ctagattcca cagcttccct 44041 tccatgagac tacagtcctt taattttttt ttttttactt gagtagcgtt ggggtacaag 44101 tggtttttga ttacgtggat gaattgtatg gtgatgaagt ctgggctctt attgtttcta 44161 tcagccaaat tgtgtacatt acacaatagg taatttttca tccctcagcc ctctcccaaa 44221 cccttcttct gagtttccaa tgtccattaa accattttgt atgtctttgg gtacctatag 44281 cttagctctc acttgtaagt aacacgtact gtttggtttt tcattcctgg gctactttgc 44341 ctaggataat ggcctccaat tccatccaag ttgctgcaaa ggacattatt tttttacggc 44401 tgagtggtat tccatggtgt gtgtatatat atatatacca gattttcttt gtgcactctt 44461 ccactgatgg gcacttagat tgattccata tgtttgccat tgtgaattgt actgtgataa 44521 acatgtaaac acaggtgtct ttttgatatg atgacttctt ttcctttgag tagataccca 44581 atggtaggac tgctgaatca aatggtagat ctacttttgc aactgcagtc tttttttttt 44641 tttttttttt ggagtttttt caattatatt attatatata atcacttgga aaaaacgttc 44701 cttttttttt cacaaccttg ccagcatctg ttgtttcttg actttttaat aattgccatt 44761 ctgactgcca taagatggta tctccttgtg gttttgattt gcatttctct aatgatcggt 44821 gatgttgagc tttttttcat atgtttatta gccacatgaa tgtcttctct tgagaagtgt 44881 ctgtccgtgt tctttatcca ctttttgaga ttgcagtctt aactgccaac acaaaagtct 44941 tacctgttga gtgaaactct tggtcaagtc tgaactgaaa tagacttaat ttcatgtttc 45001 agttgagtca aggctacaag ttggcgaaag tcaaattagg tctgtgtctt agttcatctg 45061 tactgctaaa acaaaatatc tgtgactggg taatttacaa acaatacaaa tttccttctt 45121 acagctctgg aggctgggga gtccaggatt aaggtgccaa catccagtgt cttgtgaggg 45181 acttcttgct gtgtccttac atgatggaag cgcagaagca caaagagagc taaaagcttt 45241 ggagccttaa tcccattcag aagggtgggg ccctcatgat ctaatcacct cctaaaggcc 45301 ccagctctta atactattgc attagaggct aaggttcaac atatgaattt ggggggacac 45361 actcagacca cagcagtttg caaatgtggt taatgtctac acctaaattg gttttctttc 45421 tggaaacaaa actaaaaagg tcactcactt ttctgaacat tttagtagta agcagatttt 45481 acagtttggc agcctacaac agatggaaaa tttttaggct aaggaatttt catactctgg 45541 gtagccaagc tggtggtaaa ggaagcacaa aatattagag ttcactttag ttctttgagt 45601 tccaggccag gtctagaaag atttgtgaag tgaagaagtt ttggacaatg tatacacaaa 45661 gttgatataa agtgaaggac aaggcataga aggagataca tagaggagat aatagaacat 45721 taaaggtatt agggaaagat gtttcctgct tactccattg agtgtctggg gaaatacagg 45781 gaagggtgct ccatactacc ccagttagca cttctaactg caaacacagg agtaaacaca 45841 gttcctccct ctctactctt ctcctactac cttcccctct catccccttc cacctccagg 45901 tcagaactgc ttttatcttg tatttccaga atgttgggga aagttgagag agaaagaaaa 45961 aaagagaaaa cggttggaat atggctagct agagattctg tggtctctat tccctttaaa 46021 attaccatgt ccatgtccat gtgtgattgg gcacatacac acatgtgcat gagtgtgcac 46081 atgcacacag ttttctttta tatgtaatac attttttaaa gggatgaact tggattagaa 46141 gaaagaaaac gagatggcaa agagagatac taaatgtgac tgcctctggg gagcaatcct 46201 taaatatgcc ctcttcctcc acccctactc atccctctct aaagcagagt gagaggcacc 46261 ttttccctgt tcttctagca ctttggtgaa atctctctgt gcatcttata ctatgtgata 46321 ctgtttgctt atatatgtct cttctcttct tgtctgtgat ccacctatat ccgggatgtc 46381 ccttttgttt actttgagtg agatggacag atcacaccgt gacttccagt gtccttagaa 46441 catcttatgt taaagaatag actgtaggaa gtctgagatt gttctactaa aaactcatga 46501 taaaatgatg tatgactcat tctgtatcat agtttgtgtt ttaacctata ctaatgagcc 46561 ttattttaat tgtttcagca gttagtggat ggagggataa aggtaaataa tttctaggca 46621 ctactatgtt attaaaatat tatggacatg tgcaggaatc tatggaaatg tgaattggtc 46681 atgatcttag atcttaagtc ctccaatcta agaaattaaa ttaactaaaa caactcattc 46741 agtttaacca ctttaattct aaaatactaa atcatgccag tatgtctggt tctggtattc 46801 ctcaagccaa aatcatcaat attaacacaa ttagaattcc caagttaatt actttatatg 46861 cccatcaaga ttgtttctca gaaaattcat tcactgtgat ttaatgtgaa tttatatgtt 46921 gagatttatg tctataaaca ataatttttg taggaaaaac tctagaatgc aaagcaccta 46981 tttgtttcag agaataggaa atacagtgaa acttactaaa ctttctaaaa aatgtggaag 47041 gtccttgtcc tctactattg cagcagaatt gaagccagac ccccaaacct gaccaaaatc 47101 agatttttgt cactcttctt tctcccaaat tcataactaa atcataatta tccaggtatc 47161 tattaattac aatgagccgg tcgactcttt ccaataaatt atatatcttt gttgtcagat 47221 ttttgtaata aaaggtcctc ttgtttcagt ttttcatttg tttcaattgt tcatttgttt 47281 cattttagtc tagaattata tacgctttct attttgagag taaattataa ctgttttctc 47341 cttcattgca gaagacaata gatgtgcatc ttcaaatgca gcatcctgtg ccaggtgcct 47401 tgcgctgggt ccagaatgtg gatggtgtgt tcaagaggtg tgccattttt ttttttcttt 47461 ttctcatggt tgacttgagc tatagcatga cttcattttc ctaatattct gtgccttgaa 47521 ggtgcatcag gaaaaataaa gatactgctg aacataagga aaattttatt tggttttatg 47581 cttggaaaca gcatatctgg ctgaatataa aataaatcat aacattttta ttaggaaaat 47641 tatggtaatt ttgaaatgaa gtgcaaacca ttttatgttt tccatgtgtt ctatagagag 47701 aattggcaac cacagctagt aacttttgat agttaagtag gctttggtgt ggaattcttg 47761 cttcagcttg aggcattagg ttatgtatcc tgttgttcaa aataagtgtt tttctgttct 47821 accaagagaa acgaatccta ataacaaaat tcttagtaaa acaattacaa tgcattccat 47881 ctttcttggc tctagggttg ttttgtgttt tcttcttttg tcagtaatct ggagttagaa 47941 ctgcctgtat ctagtaaatt gcactgaaga agcgatctgc taatctgggg tttcttcttg 48001 tatgagattg cctaaataat tttacattta cttatgtagc tcctgggaat tggaacaaac 48061 aaacaggtct ttaatggctt tttgcttgac atggagataa gtctctagct tttgtctcct 48121 ctgacttaac aactgtgcct actgttatcc ttccaacctc ctttattact tcttggagaa 48181 aaaagaaagt tataaggtag caacagatgg tggatatcac tttcttctag gaagcaagtc 48241 agaatagagg caggggaggg gttcaagaaa cagtaggcat gtctgaaagt aagtctccct 48301 actgatagcc taaaggcttc ccagatggac tctgggctcc accagtgaga aaaccagatg 48361 tcattcatga actagaaagt ggttcatcag aacccgtgaa gccaggcatg tcatagagag 48421 gacaacttca gagtttctgg gtccgatccc atcactgcct gtgagccaaa tgaccaaaag 48481 aattcacttt attctagcat ctccctttcc tcgtctctat aacctcaaat ttggtgtctg 48541 cttccaggtt tttctctgtc aagaatgttt catggcttgt tatttcctga ccccaaacca 48601 gtacttcatt ttagattttt actgcttcac taaggaaaat ccttagagcc ctgaagtttg 48661 tgccaacttg ctgtcaactg ctgaatgctt tagtgagggt agtttggaag agccttcaaa 48721 agatagtaca gaagcccagt tgtcagccac atgtaacatg ttagccatag aacatcaggc 48781 ttcagaggga ccccttctcc ctgtcacacc tttgactgga agaaaataat ctggagagtc 48841 acagctctgg gaatagctgt cagtcctgcc atagaaggag ggattttgag tattcggagg 48901 aatcagttgt ctgatagaaa gagctaagtc aaggagaaag gaggtggctg ctggagggga 48961 gaaggaaaac ctgtcactgt tggcaacatt cttcagatgg gatcttaggg aaaacggtca 49021 gaaactcact tgggtgctca tgttcccaaa tattctatat agcgcagtgc agcctagttt 49081 atgtattgaa agagcaggct ctgaagaaac actgagttgg attcaaatcc tagttccacc 49141 agtttctaat gttataaact tgcacatatt acctaatcac tttgaacctc agatttccat 49201 acctctaaaa tggaaataca aattgaaccc gcctcacagt cttgtgagga tgaaatgaag 49261 caatgtggga agcactgggc acagaggcat ggcttaagaa atgggattta ggactatata 49321 tagaactttg acaaggttct ggaacaactc attcacggtt aggaaactag gaggattgcc 49381 tgcattaata acataacctc aggggaagca gccagtttta ggactcagcc actatccatg 49441 ccattaacag gagctgtcca ctccctgcag gctgacctga cgtcattgag aggaagaatg 49501 tattatttaa ttttggaaga tccctcaccc tttaggtgcg gatgtgactg atgccaatga 49561 ttcctcaacc agcaaggctg gaaggagagg caagggagag gcaggctttt tttttttttt 49621 tctccccatg attaggttag ataaatggta acaacgtaaa cctggcttga ataattttca 49681 aggctaatca aaagtaaaac gtctctagaa aatttagaca ggatgtatcc agtttgcttg 49741 cactattatg tagcatgtta atctgccatt ccatatactt tgctttctct ttccttctac 49801 actacataag taattggaaa ataattagga tagaaacagc atcatcaata ttacctagtt 49861 taatgcccca tgttcctttc taacatctat ggaagacatg atatgtcttg aagaattccc 49921 ttcctctcct acctgttgtg tctgatgcag agtgtcatgc gtagtgagag ttcagagcgt 49981 aggctcggag catagttata ctttcacagc atatatgata ggaaaagttt cactgtcaaa 50041 gggaatatga tagaattctc tcacttctgt gcaaattgct atgtttggaa gtcaaaaaga 50101 cccaggcgtg ggtaggaaaa aaaaagtttt catatatata tatccccaga atactatatt 50161 aaattgatga aaataacaac caataaaaag ccaaccaggt caggcgcagt ggctcacacc 50221 tgtaattcta gcactttggg aggccgaggc aggcagatca cctaaggtca ggagtttgag 50281 accagcctgg ccaacatggt aaaaccccat ctctacaaaa gatacaatag ttagctgggc 50341 gtggtggtgt gtgcctgtag tcccagctac tcgggaggct gaggcaagag aatttttaga 50401 acctgggaga tggaggtggc agtgagccga gatcgtgcca ctgcactcca gcctggatca 50461 ggagggagac tctgtctccg acacagaaaa aaaaagccaa ccagagaaac ttcagtctga 50521 gcttttcctt ctcaaatagg ttttagatgc agaaagttgg agtaggattt ttgcttgtgt 50581 ttttcttgta gaaatagtaa aaacataagg attagaagaa attaaaatga tataaaatac 50641 caaaaatgca atatttgcta tgtcattttc tttggatgta attgatatac actaaaaaca 50701 tcagtgattt tctttctttt aaacaggatt tcatttcagg tggatcaaga agtgaacgtt 50761 gtgatattgt ttccaattta ataagcaaag gctgctcagt tgattcaata gaatacccat 50821 ctgtgcatgt tataataccc actgaaaatg aaattaatac ccaggtgaca ccaggagaag 50881 tgtctatcca gctgcgtcca ggtttggtca ttttcaaata aatctataat gattcttaac 50941 ttgaaattgc aatttacttt aatttgctca tttttaatat tatgtggcag gcagtattgt 51001 tcttgtagaa accacagtgg actcagaatc aagaaattag aattctagtc tttatagtgc 51061 ctgcagccag ctgtgtgtct tcggacactg caaacctgac cacactatat tctaattatg 51121 tgttctgttc acataatctg atcaggttgt gatgtaattg tgtttttcta ctctactgcg 51181 agccctagga gcaaggggac atgtttattc cactctcttt ttccaatgcc tagcagagca 51241 ctagcatgta gtaagtactc tgtaagtatt tgttagcttg attgatcctt tttggaaggg 51301 atcttccaaa attaaatcaa tcaagcaatc aatatttttg tgatttaatc acaaccaact 51361 tgtgattaaa agtttattac aaagttctta gagactttgt aatagacttt taatcacaaa 51421 gctgcaattc atgattatca taattacttt gggcagtttt gtgatggccc ctgagattag 51481 tatacacctt ttctcctata cctcttttgt acttcccctt ccttcacagt gtatttcaca 51541 gtttggccta tgttgtaggc cactggacat tattcgcttt cactttcctt cagtgctaca 51601 gaatcctggt gtaaaaacta catgtagtaa gataaatcaa ggtacacaga tgcgcaccac 51661 tcaattggat gcatgtagtt taacgcagtt cacaattgta tccaatcctt catcatttca 51721 ttgtgggtgg tcctgtcctg tcattaattt ttaaataacc attcttgaca cactcccttc 51781 cccattccag aactgtgcta tttcttccag actctgatta tcagaaaaac aaaatccatt 51841 actttcttgg ccctgggcat attcccagtt tccaattaaa actgtgctct ttcccatatt 51901 aaataatgac acagcccagt atggacattg tgagcaacag ttggaaagga atatacaaaa 51961 atgaaagtaa ttagattggt aagattatag ctcttcccac cttccattgc aaaaaattat 52021 ttagtgtcgt aaataaattc ttttttagaa aaagagagac agggtgaaac aaaaattgta 52081 cgttctcgta cacctaattc cagttgaata catccactaa actctgccct tagatattcc 52141 tataattaca catgcctctc tcttgcctgt gttttctttt gcctttatct tagtactaga 52201 aagtttgaac attttccgta agctgttaac catctttatc tctctttttt gtgtaatttg 52261 tgtctccata caaatatgcc tatttttcta attgaatgcc aattttcagc agcttatata 52321 taaagcttaa cctatatcat ttgcctctaa taaatatgac tacattctta aaaatgtctt 52381 actctttttt aaattcttga gatttgaaaa ttataagctg cattttattt atttattttt 52441 gttatgctcc ttcgtgcaca ctgaatataa gctgtatttt aacgttatag ttgcccaacc 52501 atctttaatg atttttttta attttactgt tatcctacta ggtttgtaat cccagttttt 52561 ctgatctata tgagcctgag attatgccta acatccccca tttcttatgc atttccatat 52621 ccctagcaaa actataatct agggcctttc ccattgttaa tctacctgat tcttcaggat 52681 cttaatccag gccatcgccc ctttcagcac acatgtaatt caattatatt tcagctttct 52741 gactcaaatg agtctgaaaa tacttcccag ttcacacctg aatttaagag taagttgtta 52801 atatcatctt tcctgcaaat ctttcagaat tctcaattcc tgcctcttca ctcttcattt 52861 tccttcaaca ggattttcaa aacagcagaa ctgaaaaacc cacaccttaa tatttgcaca 52921 tgaagtaaat aggaattata taatagaata gaaaaataat aaacagtctc ttagtctgtt 52981 tggctgctat aacaaagtac gttgattggg tggcatacac aacagaaatt tattttctca 53041 tggttctagg ggatggaaag tccaagatca agaggctagt agatatggtt cctggtgaga 53101 ggtctcatcc tggcttgcta cgtccacaca gggcagagag aaagtgatca agagagagaa 53161 cacacatgca aaagagtgag agagtgaaag ctctccagta tctcttctta gaagtgcact 53221 aatcccatca tgtcggctcc atcctcatga cctcatctaa acctaatcat atctctcaga 53281 gaccccctat tcaaatactg tcacattgga ggttagggct ccaacatatg aatttttcgg 53341 aacacaattc agtcgatagc atttgataca taaataatat tttcaaaagc ccatggaata 53401 tttttagaaa tgaattacac tttaggcagg aaaacacaat tctaaaaata aaactgttct 53461 tatatgccaa tatctacaag catggtgtaa taaaatttta agtcaataca aaataataaa 53521 taaatattaa ataacagcaa gtttcagaga agcctctcct aaataattct atctccaaga 53581 ataaattgaa actctagtcc tataagccaa taatagcagt aacaaacact tcaaaatata 53641 caggttgtta ttgatgatgt tgttacaaaa attcacaact ttaagtgctt ttattattaa 53701 agaaaaatga atccagatga actaagcatc caaagatata agaagaagat aaacaaaaaa 53761 aacataagaa aaacagagaa aagagttcct aaatttaaaa acaaagcaaa attaatgaaa 53821 tgcaaaatag gaaaaaaaga ataaaatgga tgtataaatc cagagaacca gagagctggt 53881 tctctgtaaa gtctaataaa atagataaat tgttggcaaa tctaattaag aaaagagggc 53941 aaaaaatcaa ataacctaat tatgtaaacc ataattatag aaaaaaattc agaaagaagc 54001 ctttaagaaa tattaagcac aaaaatctat aataaactgt ttaagtcttg agataatttt 54061 ctaggaaaat atgtctaaaa ttgactcaag atagagaacc ctaagaaacc taatatacaa 54121 gaagtttaaa aagttataaa tgtttataat tataaaaatt ataattacct cccttaataa 54181 gaaatactcc ctaaaaacat taggtctaaa tgctttcatg agcagaaatg tttcatgttc 54241 attcaaatat tgaaggaata catcatctct gtggtatata aacttttaca ctactttatt 54301 tatttattta cttatttatt attattatta ttattattat tttgagacgg agtctccctc 54361 tgtcacccag gctggaatac agtggtgcga tctcagctcg ctgcaacctc tgcctcctgg 54421 gttcaagtaa ttctcctgcc tcagccccct gaatagctga gattacagga gcctgccacc 54481 atgcctggca aatttgtttt gtatttttag tagagacagg gtttcaccat gttggccaga 54541 ctggtctcga actcctcgac tcaggtgatc caccagcctc ggcttcccaa agtgctggga 54601 tttcaggagt gagccatcga gcccggccta aactcttaca tcactttagg ggatgcctag 54661 aagctactta attattacaa ggttagtata accctattat caaattctta ataaaatttt 54721 agcaaatcaa attaactgta tgtgataaga agaacaatat caccaaacaa ataggcttaa 54781 gctagaaaac caaagatagt tgatttttgg aattgatgta tttttatgat gttagtttgc 54841 caaaaagaga taaaacatat tgttatctca atagataata aaaaataatt caataaaatt 54901 tagtgcccat ttccggcaaa accaaaatcg aagaatactc ccttaggata tgttaaaatc 54961 cttctcaaaa gtatacataa atatgattac atttttaaaa atgaaataag agttataaac 55021 attttgaaaa gataaagcaa aattatcatg ctatggaaaa tccaagcaaa tcaaatgaaa 55081 ctaactattg aatactttta aattttactc tggcaggttg ttttttaaaa tattcaaaaa 55141 tcagtgaatt tcctttgtaa tattaaatat aatgaagaaa aatgatcaat tgtgtgatta 55201 tctcataaaa gcaacctaga atgtaaactg tttaggaata aacttttaaa aaagtataag 55261 atccattgtg ctacctaata gaaaatacga ataaatggag aagaatttta tgctcctgaa 55321 tataatatca taacaatgac agttctctat atagtctata agtataacaa agtccaagtg 55381 aatcaataaa tgtaagacaa tcaatcccag tcaaatttga atgcagtttt ttaaaacaca 55441 tcttcactga aacttattat gcatgtatct ttgtatataa ccgcatattt atttagcatg 55501 cattcctggc agtagatctg ctaggtcaaa tgtatgacaa tgtatttact ttttaatgta 55561 gattgttaca ttattcacac aaaaaattcg atgttcagtt tagcctttca ttagtcaacc 55621 atgcctgctt tcattctttt taatgtctgc tatttgttta gtccattagc tgttttcttt 55681 tgcctttttc ctagaactag aaagtttgaa catttttcct aaggtattaa tcatcttcat 55741 ctctcttttt ttgtgtgtga tttctgtctc catacaaata tgcctatttt tctaattgaa 55801 tgctaatttt cagaggttta ttatatatta tttgcctcta ataactataa ctatattctt 55861 ttaaatgtct tgctcttttt taaatatata agtttgaaaa ttataagttg cattttttct 55921 actggtatgt tacaaataat ttatacactt caatgagttt agcttgttct tttttataga 55981 tataatgatg atttggtcct tctgagacta taaatgaaca gttcaaaagt tggaaaattg 56041 tttcagtgct gtagacaagc aatgatagtg gtaatgacac aggagaaaac taggtaaact 56101 tgagagagaa tggagaggtg gaatccacag gagttggtga ttgataggcc atgtgggact 56161 cccaggatct ggcaagcaac ttagtgtggt agggaaatct gtatgaatga agtggtgttg 56221 atttaaactg ggatgtcttg agttgagaga gagagactta agctatatga ggctgaagct 56281 caggggagag ataggggttg gagatagaag agaaacagga gacatcaaca ggagatggta 56341 agtgaacgca gtggggaaga gatcatgggg agtaggagag tgaaaaaaga gaaataattt 56401 tatgtcagtc attgtggagc ttgattttat taaatttccc attactattt gaatttctat 56461 gtgcaaatta gcataattgt ctaacgtgtg taagtcatct ttccaacata gttgtgagct 56521 ccatgaggca gaactagttt ataaccccca tatatagcaa atatttatta aattaaactg 56581 aaaatatgtt cttaaaaaat gaatttgaca tcaaagacat atccgtctta gggcataata 56641 tataaagcta aatgttggca accccagata tatagaaaca tactccatga tgtgaaagtc 56701 cttttcacca tgagatgaat cagaatcaag ggcttgatat aatgcatacc taagtggaaa 56761 atgcattgag ctcttaatat taattttgat ttatcacttt attcttctct gataaaaatt 56821 tatgacttcc aatgtagcat ctgttaagaa tttttcatga atatatttat tacatcatgt 56881 actgactagg gtttcttaaa aaatgttagt aggactatat tttaattgga tgttttaaat 56941 gacatttaaa aatgacattt tctttacttt tcacttctta taatatactt tttcttattt 57001 tcacttatga atcttatgga aaatgataac agaaatagac tattgatcct agttgatgtt 57061 ctctaaggaa actggatatt aactctttag gaaaatagga cagagaaggg aattctctaa 57121 ttctaattca gaaataatac aatgtatatg ataaggatta gtttaaccta tatttcatta 57181 acttaagaaa atgtttcaaa cttggttcgt tttaataatg tgcaaaaaga atcttctaat 57241 ttagtgtaga tttagtaaga ttttccagat atgcttcacc tgtttcttaa tttcaaaatt 57301 tagaaaacaa aacagaactt gcacattttc tctccacccc ctggcacaca ttcctcactt 57361 tatttcccac agaaattagg aaatgtactg gagtctgtac cttgccatta taaactggtc 57421 ccatttgggg gcccttctcc ttcccctgct gaagtataat gaatagaaat tggcttatca 57481 ttcttccact tacagctttc cacattttct cttggtttta aaatttcatc ttaagcagat 57541 cattttttta cttatctttt cctagtgtgc agtcatgttc gttgtgccca gtacatcaga 57601 gatgctcaca ccattctttg agtagtttaa aaactcattt taaccacttt ttattctttg 57661 tattcaaacc aatcactggc aatagctcta agtaggtcat caactctcct ccatgtcttc 57721 tttctaattc tgccacagac tcacttcttc ccgttaaaat taatggaagg aaatgagtgt 57781 ctgagttctt agaatctcaa aaggcatgag gataaagctt tcctggagat aatataagtg 57841 gtggcaggaa gatttgggag ccagatgata ctcttttcct cttagagaaa ctctgtggaa 57901 gctctgccta tactgtggga aataaattct agacgctggc ttctttctgt agtaaacatg 57961 tgggcccttt aaaatgttga accaaaatgt gcttcaaata tagtttagtt ataaaacatt 58021 tatgggggac tatgtatgtg ccaactacag aggcttcaga gatgaagaaa cagttcttac 58081 cctagtgttg cttagaatct agtagtagta agtaataatt actaacatat gcatttacta 58141 tataggcaat actagggtaa atattttaca tagattacct tatttagtag ctcttagctg 58201 ctaaaaaaaa aaaaaaagat aaagatgtcc agtctagagt ctcataattg tatggtaaac 58261 actaaaatgg tggtatgatc cagttgccat ggaaacacag gggcggggcc ctcagctcag 58321 tttaggaagg agcagattac tgagtggtgt ctttaactgg taattacatg aagaagaaga 58381 ggttgggcca ccacaggcag aggaaggacc acaggcaggg cacaggagta tgcatttgta 58441 ggctgtatat ggggaagtaa agcagatttg tcttgctaga gctaaagatg tagacaagag 58501 taagtattat gtaggaactt cgatctgaca tgtttagcca ttgggcattg gccatctggt 58561 ggagagctgt tgacaaaaaa taagtaattt ggattttaaa aggatcgact cggtggtata 58621 gataatgatg gctggaaaat gagacgatag aggtagacat atgaagaggt tatggaaata 58681 atccaggcaa aaagacaatg aagggcaagg atggatagga gtatatccca gtattctggc 58741 atgctagagt tcaattgaaa tgtttaactt gatgaataat aggtaaggaa agtgagtgaa 58801 aactaaacta ctagattctt aactcttaaa cccttcactg ggtcaaaagt aatgagtttg 58861 aacatagtgt cgaaattaac ctgattttac cttgtgtttg aaattctaca ttcataaagg 58921 aaaaaaaaaa ggaaatgtta tggtcggtcc caaattatag tcaattataa ctattttcct 58981 caaagttggt cttacctatg tttagtggta agaattgatt gtttaatata ggaaaaaaga 59041 cctaaaattg gtgagataat gtgattcaaa ttccaaaaaa aaaaggaaaa ttagatagct 59101 ctgcattcag aagaattgaa tgatatttta aagacattca cataaacaca cacccaaata 59161 gattcttcac atttacttgt attccaaaaa cagaaaacta ttgaaaagaa tgaaggatgg 59221 taacaaagta catgaacaaa aaaacaactg aagcatcacc agaattttgt tttctgttgt 59281 agaaacttac ttaggttact ttctgagact ggcaaagaaa gataccagaa atttaaagta 59341 atcattaccc tttatcttca acacccaaac cccctaaaat gacaagaaca gctataatta 59401 atttaatgtg aaattaaaaa gggaacactg acatatcatc tattaatatt caatacaaat 59461 attaacccat ggtgccttgc aactaaaaat atctggagag cacatgccag gttaagttta 59521 atctttatat aacaaattca gttacatata actgtttata atacctgata gttggaatag 59581 cccagatagt ggtaaagtat taatagaaat atattcaaca tggcttgtgc aaaattaata 59641 aaattcttta ggtaaaatgg ctttgaaaat catatattat gcttgaggag gagtgaatgt 59701 gggagggggt tattgtcatg gggcttgtcc tacttgggag gaaggaggaa aagtgaggag 59761 caatcattcc ccttcccttc cttcccctcc ctaacatttc caggctctgt ttaaaaccaa 59821 tggatattgc cattaagcac gtgatgaaac tgagggtgct gtccctgtca gggcagggat 59881 ccatggtcta tcaggacatt tctgctgtcc ttttcatgta gccacgtccc catgccaact 59941 agagttcctt tggtactaac tctgaaacca ctgatgggaa gtggagattt tcctggaaac 60001 cctgaaatgg gtcatagtac ttcattaatt aaatcatttc cagggtcact ctaagcccat 60061 gttttgtacc ctattccatt tctcctccaa aattatgccc aggcatcttg tccaaaatag 60121 tttataagtg gctctatgat gagatcacgt acaaaggtta agtgggggtg ttaactaggg 60181 acacaatatc cacagtgctg caaaaaggta tttctatgtc ctcttactgt ggaatactga 60241 tgcctaggac acctgccccc aagagtcttc tcctttattt ataattatat gatccaaagc 60301 caacttgaga gcatattatg aggactatct caaactaatg ggcatgacat ggtaatccaa 60361 gcttgacccc tttggtttcc tccttatcaa gcagagcacg atttggggtc agcctagcta 60421 ggtctattta ccttgaattg actacttaaa gctagaccct cacagcaagc tcaagggtct 60481 tgcagtgtaa aaatatcaat tattattctt gctagcgatt actaaaaggc ctgtgtatgg 60541 aaatactctt ttcctcttta ttagaaactt ctctaccaag gcctcagctg actccggtat 60601 acgtgctaag gccagaattt atagaatcca aaatcgagcc acagcagccc acactctcac 60661 accccaacca tttggcagtg ctttctcaca gcctctcttc tttgctgagg ccagcgagac 60721 cggaatttgc ttgctaggct atgagcagct ggtttctgct gagggccctt gtgtggctag 60781 catttggttg gagggccctg aatagcttac ctccaaggac tgggcttttg ctgatgctgt 60841 acaggccctg gttgtttgct gaggtcacca acctatacct ttgctgggtc cctggcacaa 60901 tatgctgacc atctattaca tctagcagta cctctgaaaa cagccattta tctttagaca 60961 gagccaccat aaactgcaac gattcaccaa gataaaaata ccatgatacc atgtggccgg 61021 tttacaaaaa caatcttttt taaatctaca aagatagcaa cacttttcat caacagcagt 61081 tcacctagag agagtcgaca ctttttgtct gagtgaaaaa caaaaccaaa aaaacgctga 61141 tccacacatc aaggttcaag tccctgaaca tgaaccgctc tcccagtagt gttgaatttc 61201 tacttctcat cgatctggcc cagtgtacca gtgtctaaca ttcccctgaa tttaatttct 61261 ccactcaaat tgcagaacac tgccacagga acagttcttt ctgtcaacag aagcccaaaa 61321 gcagagcaga atataggata tgctctcagg aggcagccta gcttctgaag gcaagggatg 61381 agagaatgga ctgggctggg ccatgtgtgc cctatttctc ctttcaaata tgggccaaaa 61441 cctaggagca ttacactaaa ggggttgcct gcacagaaaa agcaaattag gaatgagtag 61501 tccctttctc cataattatt attatatacc agcaaaacct tattattttg aatacagtag 61561 gtaatttggc tgaaaaacat ctgatttgtg aaataatcga aaagaaaata ttgctatttt 61621 atgctaacac caaaaatata atttgataaa ttagaaggtt ctttaaatag gtaaattatg 61681 tatacaaatg aatgtttatt ttaaaaatgt tttccattcg gaatattctt atggaaaaat 61741 aatgacttac aaagcattta cttaatagtt taaaaaagta aaatttacaa attacaaatt 61801 gcttacattt aaatagatgc cttttcacca tatacttgaa attaaaagta gaggtaccaa 61861 ctctatagag ggttaggggg gacaacccaa gtccttgtat caaatagtaa tatgacagag 61921 aactaaataa aattgtagtt ccctgcgggg actaccgaca tttgtccttg tggcagcccc 61981 ctctcctcat ccagttgttt tgcccaaggt gggtgcctcg ctcatgggcc ccctcctcag 62041 catgtgttca ttgaacacct accgtgtgtt ggacaaatgc tgacccctac cttccccagg 62101 tcctccatgt ggcactgcag ggcaccattc tttagcccac tgtctcattc cttcacttgg 62161 ctaactgatg ggaagaatct taccaactaa ctagtgatga ggtactaatt attttgaata 62221 gcttgctgta ctacacagta ccagagtttt ctcatctcct gtgtaaatac actttaaccc 62281 aaaggaataa atttatgcct ttatccaata ggagatcacc tttttttaaa gataccttcc 62341 cagttttcct caagtcattc cacacgtaca tccccctcac tgtcttgaga tcgaatgccg 62401 cttgctggtt tgcactgtgt ttttctagct actaaggcaa tggaggttca tataatgata 62461 caataccctg ttacatttat atggagctgg cccatttgtt attatcccta ttcagcctat 62521 agaattgtgg gaaatgaact gtaggatttg gagaccaata aattggtatg gttttcaaat 62581 tgccttataa ttatactaga ataataatta tactataatt attataatta cactagcata 62641 ttccaaattt tatgtgcaaa tttgtgatga ttgtgctagg tgctagaatg tgactaattt 62701 attaagagct tatcctgaaa agtttaaaat gaagactaca tgatgttttt cttctattga 62761 actaggagcc gaagctaatt ttatgctgaa agttcatcct ctgaagaaat atcctgtgga 62821 tctttattat cttgttgatg tctcagcatc aatgcacaat aatatagaaa aattaaattc 62881 cgttggaaac gatttatcta gaaaaatggc atttttctcc cgtgactttc gtcttggatt 62941 tggctcatac gttgataaaa cagtttcacc atacattagc atccaccccg aaaggattca 63001 taatcaatgc aggtatctag ggtttgatgt ggatatgcta aattaatttt ttgcttataa 63061 aatctcactt tgtttttaat gtaaagaact taccaaattt tatattttta tcttataaaa 63121 taagaataaa aatatttggt gaggtgggga ggtttcttca tatctgaaaa cttctgagat 63181 gaactgtaga ttgggctttt tattttatat ttctttaaat atttagtcag tttttatcac 63241 caaaaaaaac aaaaaaacaa acaaaaaacc ctacctaatg ttagtaactt tgaatgaagc 63301 ttcaatagtt aggctgccat gtatataaat acctctttgg aaattctgta tttttatttt 63361 ttgattgctc caaaattgct gaattatgat ttgacacatt ttaaagagtt tgtgccaaca 63421 cgtcatctat ctctccatta ctgtactgtg actggggata atgatgttgt ctccgatgtg 63481 actacaaaga aatctcagtt atcacaaaga aagtgggtca aaagatgaga atttgattgc 63541 ttagaaagag ccttttttct tatcatttta gatttctgat tctaaataac atatgtccca 63601 ctatgaaaga agaattagcc ttaaaaatac aagtgtctca cacaatattg ccattagcct 63661 gtggaatttt cagaaaaata aatctgtatg gatctttttt cacaaaagac tttatctttc 63721 ttgcttctac aaattgctta gattacttta caatacacat tccaatttac cgaaacagct 63781 ctgtcatctg ttgtaccgga acaggattga tgcatccatc aatggagttt cataagcctt 63841 cttgaggtga atgtacagga ggaatcattc ttactttctg ggtatcatag attctcaaaa 63901 taattttaat gaatattgat tgttcatcaa taattccacc aagcatcact ttaaaaatta 63961 gtgcagtatt aaaaaaaact aatatcatat ttcaactaaa ttcctcaaaa gcattttcca 64021 gaagtgtttc tcagctgttc aatgataaga aatcactgca cattcttata ttcaccttaa 64081 aaattcacag cgcagattaa atattgaaag cctttaactg tcctatatta aaaaacatgt 64141 ttaatcatct ttattttttt gaataccaaa tgcttatagt cttatttcct atttgcaaag 64201 aaattgtcat cagcattgag cttatctcta attttataag catagatttt cgtcgtataa 64261 agtttccctg gcacaaaaga agtactccct tttcacacaa ctgtacaatt ttactgttaa 64321 gtattccaga atgcttaaga agagcaaaat aaactttaca cttcttcttt tcccacacag 64381 tgactacaat ttagactgca tgcctcccca tggatacatc catgtgctgt ctttgacaga 64441 gaacatcact gagtttgaga aagcagttca tagacagaag atctctggaa acatagatac 64501 accagaagga ggttttgacg ccatgcttca ggcagctgtc tgtgaagtaa gacgtttcac 64561 atgatcgagt gtttgctaag atgccaaact ttcaaagaat ttagacgttt tattttctct 64621 ttgatttgtg gagtgaaaac gtagtgtaga agaaaacata tcttactatc tcttactcct 64681 caccaaggtg attcatttcg gtatgtgttt gggacagagg ggctaaattc agggagggat 64741 ttaaatagga aaagacaaaa gggaaagaat aggttatgaa cgcgaaaatt ctttccaaaa 64801 gggctgcttt tctcatgcgc cacactccaa ggaggaacta gaagtagact gttgccctcc 64861 tcacttcagc ttgttctcct actttttttt tttttgagac ggagtctcgc tctgtcgccc 64921 aggctggagt gcagtggcgc catctcggct cactgcaagc tccgcctccc gggttcacgc 64981 cattctcctg cctcagcctc ccgagcagct gggactacag gcgcccgcca ccacacccgg 65041 ctagtttttt gtatttttag tagagacggg gtttcaccgt gttagccagg atggtctcga 65101 tttcctgacc tcgtgatccg cccgcctcgg cctcccaagg tgctgggatt acaggcatga 65161 gccaccgtgc ctggcctctc ctacttttta atctatattg ttcttcctcc agtggaacac 65221 tagaaatctc tagccaggaa ctagtgtggt cacatggctg tgggaaaata gaggatgtac 65281 aaataaaagt taaagctatt tacgcagcag cgttgtaccc acgcacatcg ttctagcctg 65341 acttactgtt cgtcaactca aacgtaatgt ttacttgagg taaaaaccac agtacacatc 65401 tgtgttattg tacaaagtct aattacatgt ttatttttaa gagtcatatc ggatggcgaa 65461 aagaggctaa aagattgctg ctggtgatga cagatcagac gtctcatctc gctcttgata 65521 gcaaattggc aggcatagtg gtgcccaatg acggaaactg tcatctgaaa aacaacgtct 65581 atgtcaaatc gacaaccatg gtaatgcagc agtaaccgct tagtagatat gactcttttg 65641 gttaaagtac aattttaaat tggcaactca actatttttg taccaggaaa ttacctacaa 65701 aagaaagata cagaacaagg ataagccatg agagagaatt tatggaacag tgcaatctat 65761 aaatactgat caataaaata tatgaataaa acagtttgac agtttctagt agctgactct 65821 ttgatgaatg acaaagcact gtataatttt tgaaaataaa aatgatacaa gattttactt 65881 tttatgtaat tctaagacag gggaaaaaag tatacctttg ctttatacaa atagaccagg 65941 tctacggggt cgatatctac aaacatttta aggaggaaac ctcacagtac aataggtaga 66001 aactaaatag aaaccaggga aaaaagacaa gagagaaaga gagtcaaaca aagcaggctt 66061 tctctgtttt gtactttacc tctctgcatc ttatctattt aaggtctctt tttaaatcag 66121 tgactatctg gcttgcttcc tgagggttag tataagctct gaatcatata tttaatataa 66181 tcacctagaa ccatatcttc cctggcatta tctaattcta gttgtgaaga atcaacagtc 66241 caaagcatca tttacttctg atgttgacac ttatttatat attcttgcat atgttaaaac 66301 agaacctctc caactttctc agttcatgag accctagtct ctcagtgatt ttattccatg 66361 acacccgtgg gtcaaaagaa atacctaata gttctgtttt tttaagtggt taagtccaaa 66421 taactcaact atttatgtct taacaactta gtatctcctt aaaaataata caaatgtatc 66481 aagtgtagag atacatgttt cagttttagt aagtaacttc agaaattttc aaatcctgaa 66541 atcagattgg tcatcactac cctgatttct tcttccgtgt tgattttcaa gttgaccagc 66601 ttggtaaaga ttaatgtcat ttaaaagaat agaacgaaat gtaatcttaa aactatgaac 66661 tccctccaac tagttcttca ttggtacctg atacatgctg aatgtcactg tatttctctt 66721 gaagatttaa attatttcct gccacccttg ggagtgtgct gcagtgcacc aagacacctc 66781 acacacagtt tgggaatcac agtatcagaa gcatgaaaat tatgctgaaa tttgtgtggt 66841 tcccatgggt aaattttaag tagatcacca gttcatctgc aataccctac cagatctgtt 66901 gaaattctag agtggcaatt gtattaccat atgaaacagg ttcaacactt ggtcatttta 66961 gtttcttgaa agtagtatca aaatataaag attttaaaag gtaactagag attgaattca 67021 aaaattcaag aattggagtt attccaatag tgtctttcca taattctctt ctactcagcc 67081 ttatttcact ttgacagacc cacctacttc tacttagtca tttcttatct atctttcctt 67141 ctatttaacc tcttttgatt gatggtagag ttctaattta tttattgggg agtaggagag 67201 ggtacaggaa gaattattct attgtgagac aaagagaagg tttttatttt tttaagaaga 67261 aaataacaat gacattattc tcaaattgta ttattctttt gaagttttgc caagtcactg 67321 aagtcatctc tccataattg tatttaaaag gatactattg ctcatctagg gaggatttcc 67381 taggcaaaga aatgctgtca taattttagg catataccat gttatagaac cttatggaaa 67441 taagttataa gtccagtgga attttagata gggcttggat tcttatgaca tcaggaaatt 67501 ggatgattac atcaagacct tttaccttct tggtcattgt tcccagtatt taaaggagtg 67561 acccttccac atgtcactgc tgtagtatat tgagtaatcc aaggaataat gtgtttgtta 67621 acaccttaga aggatacttt tattttattt accccaggtc tttgttttaa aggattatct 67681 tcctaaaaat tctcattccc ttcatttgta gggtcactgc ctacatatta ggtaaaagaa 67741 gctgaagcat tttgattgat cccttggcca tgaatttata cattcaagat gtgctcctgc 67801 tttggaatgt tctatattga aagctacctt gactttgact ctaggcatat ttatttgtgc 67861 atttcaaaat gacctctatc ctcccctatt tggccattat ggaggaattt taaattttgt 67921 ttaagacaag taatcaaaca ctttgcttct tggctttatc tccagcagcc aatgccaata 67981 atgttgaatg gtatctgagt ttcttgattc aagagtcttg gggtttgtat ttataaaagg 68041 aaacaactga ttaaaaaacc ctttatatgt ctaaattaca tgggacagag ttctgttgct 68101 tatacatata gctatttcct gaaggtcaat gtgaacccta gactaataac aactgccatt 68161 ttattgagta tgtattcaac tgaaagcact gtagcaatca ctctgcatac agcatcaaat 68221 ttatctttac aacaactcta taaggtagtc ataattagcc tccttttaca aacagcagga 68281 ctgagggtca gaaaagctaa gtaagttgcc tgagaataaa cagctaaaac attaataaat 68341 tgcagagttg gccctagtct gtctcaatct tatattaaaa tctgcattct ttactttgcc 68401 acagtctgtc gactgaaatg cagaagaaaa agaaaaataa tgttcttgtt gtcatttctt 68461 gtctttctgg ttgcagtctt atgaaagtat aaccagatga tttgacttgc ataagttgta 68521 aaacaacata taaaataatt tatatttaca cattgtgata tttgtatatg gttgctggca 68581 aatgatgtta cactattgaa taaattacat tttccactgc tttttcatca agatcaagtg 68641 ccagaataaa aaactgtagc aatagtatag ttttatgatt tatccagtga taatgacaat 68701 ttgaaatgtg acattctttc taaaatggca gatgttaata caaaaattat taatgatgta 68761 gtagactatt aatactgagg gtttggaatt tcaccctgtc tccccaccca gcttctccag 68821 ggttatttct acccttagtc agagccctcc cagaaactgt tcacccaccc cttggtggag 68881 gccaagaata gcatgaaggt cttcagtaac aatgaaagaa gaccaaggaa caagtatgtg 68941 gaggtcatcc aatgaacctt atcaatggga ttttcagatg ttgatgtgac tcacagtttt 69001 aaatagcgag attcaaattg gcaagatcct accacattcc tattgtatgt actagtgcat 69061 accatatatt cactagtatt tactggttta ggcttggtgt ttttccatgc agccttttat 69121 atgcctcatc acattatggc cagtgattct ttatccctcc tatcatatac ctcctcagaa 69181 tccccttact gctggcttca tagttcattc cttttcctgc aggctcattt ggataggcta 69241 gctgctgagt cgatcagcat cttatagcat ggaggcagaa ttgcagaagg cctatcaggc 69301 attaaacagt tttgctttct cactggcaaa aatagtcaac actgtgcctc tcctcatgca 69361 actttaacat gtacttgact tttaagtatg ccatttagga agctttcttt ttactatgaa 69421 tcttatctac ttggaattat acttaatttt tccaggactg cagatgaata aagagagctt 69481 aagaagctac tttgctaatg taatccataa ctttgctacc tgagttaaca aacgttcata 69541 acactattac atttattctt tctgaaaggg atatttctga cattttcaaa gtctggaaaa 69601 catttataac taatgttgag atcagacaga attccatgtg aggagataat tgtgatcttt 69661 taataaagag atctgttaat tttgtttgct gtttcccgac agagacgtgg ccttttctac 69721 ttcagaaaat gttaaatctt tttctgcttc tcaggtttgt ctcagatgca tactaataaa 69781 aatgcaaatt catggaacac ttattttgtg ccaggtatgg aatacaacta gaaagtttta 69841 agtccatgtg cagcttttaa aaacgattgt tccatctttt ggggcatcga aagactacaa 69901 agtacctgat ttcagcttgc tgaatgatct gcttcaaatt gtaaacacta tgagaacatt 69961 tttttttttt tgagacggaa tctcactgtg tcgcccaggc tggagtacag tggcaccatc 70021 ttggctcact gtaacctccg cctcccaggt tcaagcgatt ctcctgcctc agcctcccga 70081 gtagctggga ctacaggcgc gtgccaccac acctggctaa tttttttttt tttttttttt 70141 tttttggatt ttttagtaga gacggggttt cactgtgtta gccaggctgg tctaaatctc 70201 ctgacctcat gatccacctg ccttggcctc ccaaagtgct gggattacag gcgtgagcca 70261 ctgcgcctgg cctgagaaca ttttaatagt aattttatag gagaactaat ttgaatgctt 70321 ctcatattct caaaaagagt ggcctggaaa ggatttctgc aaacaatatt tagtaggaat 70381 tgttttaatt gaaagcaaaa ttgcatgtag aaatcacctg tgccagactc aaagtagcct 70441 ggctctgctt gtgtaatgcc tttcaaatgg ggtaaaatta tggtttgcaa aagtgacaca 70501 ataaaattca actttgataa aaatttttat agaatggaat attctcatta aattagatgt 70561 tgtaggtaag tcatacttaa gcagaaagct ttttgttgcc attattattg aaaaacttac 70621 aaaaatcaat gcataccctt cttcagtaaa tagggtatcg tgattacaac tgatgtcata 70681 tttgccagga gccacaaact caaacatctg cagggccagg taagaaagat aagcaagtga 70741 cataggtcaa ggaaagcaca tccaagagca tatcttgcct ttaggaaagg cctccaaatc 70801 tctcttatta agcagtcatt tgttaaaaac tgtcttttct tgagcagacc tatggtgttt 70861 gtctctttat gttcagttaa cagcattcaa cattccagtg aaatgtagat attgctttac 70921 ttctacttta taaatgagga gggaaaaagg tgctaaagag atcagcatct tgtctggaat 70981 cacacaccct tatatagcag tgtccatagt agactttaag ctgtttgaca tgaaccgcta 71041 ctcatcattg gtcttaataa ttacagaaca gtaagtgaag gctttacgaa gttaacctat 71101 ttaatctcca ccatctgagt ggtagatact agtgtttata cctgctttac acctatggaa 71161 actgaatccc agcaaggttc agacagctgc ccagtgtcac atagctagta gtcgcagagc 71221 cagaagttga agcccagatg atctgttttc agagcccctg ctggtggtgt ggttctcaac 71281 tggagtgatt ttaccctcca gaggacaatt tcatcaagtt atgacatttt gactgtcata 71341 actgtggaaa gagggtgcta ctggcacctg ttgtgttgag accagggtgc tgttaaacat 71401 cctacaatgc acaggacagc ccctttgtca aataaaaaat atatccagcc caaactgtca 71461 atagcactac tcttgagaaa ccctggtcta tgcagttatc acagtgcaac tatgtagata 71521 caaaattaac tatcaggaca gatgttgact gacacatcat tgaaagtttc atctctaaat 71581 aaatgtctgc tatctgatta cttctatgct atttgcctct ttcctctaaa aaatcacgca 71641 gataaaaggt ttttatttaa gttgtttggt ttttatatct tattacaata ttccatacta 71701 taaattctat attctattat aaaaatttat attatggatc tcacttgtat aaagagaatc 71761 ttaaatgctt ttttatccac gtagatatac acattattaa aattctccca tacatctgta 71821 cacagttatc tctttaccca aaattgcccc caatgaatct ttcttacctc ttctttcctt 71881 ctgacattta taagttactt cccacagcct taattactta gcagctgtct tggcacattg 71941 ctttccttct cccttttgcc cctgaagtgc ttgttatctc tattaattat tttcagagtt 72001 tacagaaaca atatgtaatg ggttagacag agagggagtt ggagttcagg agcagtgatg 72061 tcaccagaac aggagagttg gctgagaggg agcgtgtctc ctgctagtga aataaaagat 72121 aaaatacttt tttgaagata ggctgcttca tgaaggaaag gggaaggggc ttaggcaatg 72181 gtcattaatc cacaggtctt ctgtcaacag tggtttattt aaacatctgt tatacatttg 72241 taatttttaa atatctgcct tgaggtactt ggaattttct tttaaaagag atttctctct 72301 agggctcaaa aaatctgttt ttgtaggctt gaaaattcga ctaatctttt tttttctttt 72361 ttttattata ctttaagttc tgggatacat atgcagaatg tgcaggtttg ttacctatgt 72421 atacacgtgc catggtggtt tgctgtaacc atcaacccat catctacatt aggtatttct 72481 cctaatgtta tccctcctct agccccccaa cccctgacag gccctggtgt atgatgttcc 72541 cctccctgtg tccatgtgtt ctcattgttc atctcccact tatgagtgag aacatgcggt 72601 gtttgttttt ctgttttcct ttgttagttt gctgagaatg atggtttcca gctttatcca 72661 tgtccctgca aaggacatga actcaccctt ttttatggct tcatagtatt ccgtggtgta 72721 tatgtgccac attttcttta tccagtctat cattgatggg catttgggtt ggttccaagt 72781 ctttgctatt gtgaacagta ttgaaatgca tgtgtcttta tagtagaatg atatataatc 72841 ctttgggtat atacccagta atgggattgc tgggtccaat gatatttctg gttctagatc 72901 cttgaggaat caccacactg tcttccacaa tggttggact aatttacact cccaccaaca 72961 gtgtaaaagc attcctattt ttccaaatcc tctccagtat ctgttgtttc ctgacttttt 73021 aatgatcgcc attctaactg gcctgagatg gtatctcatt gtggttttga tttgcatttc 73081 tctaatgacc acgactgatc ttcttaataa gcttctgacc aaactagaac tttctagatt 73141 taagttttat gtacatttgg tcaaagtgac ttcctgtttt tggaaagggg aagagattaa 73201 gaagtcttca aaagctgctt acattttaga gtgccactta aatattgatt ttatgtcagt 73261 taagtaaaag ctggttgaaa acatgtatta aattgtcctt aaaatacaaa tgtttccaaa 73321 aattaagaag ggtgaatgct tagcaagctg tgactatctt gtcaaagcaa aaatacttga 73381 aaagcagttg ttgtcattaa aaaatatttc caatacagac agatttacct tgcctctgct 73441 tgttctcttt attcactagc attatgcaaa ctgtcatcaa tctattttaa gattatgcct 73501 ttcattagga acaagtaata agaatttagc tggcagaatt tgtgtttttg gagttttttt 73561 ttttccattg ctttacaatt atattcttaa gaattgctct gagaaaaact tttcctatgc 73621 agacctacgt gttaatactt tttaagcttc ttcattagtt ggtttcatat catgtgtaaa 73681 ggcaactcat gaaaatttga tttgtttcta tcacaacaat tatagatgac agagaaataa 73741 agtgttattt gttacttaga acttacgtta taagtgaaac tagttagtct agccctttta 73801 ccaagttcac atagttgtca caatacagaa atgtttacca agttggttta acaggatttt 73861 tccttaaaga aaatacagtt aacattggag agtgcagtgg gaagcaaaaa ctatggagta 73921 atttgaactt aaatgttatg atactaattg ggctcagtta catactaatc tttggcttat 73981 gccattaggc accagtgaat gaggagagac tactgccagt gaactttaac atcatctctc 74041 tgataagtat ctagctatga acttgttatt cgagacaatc attttaattt ctagaagagt 74101 atttatagaa aaaatattac caataatata cctaactata taactctaga tagtcattta 74161 tgtaatcatc agatttctca tgttctctaa actaaatttt atttcaattt tcctactctt 74221 ctctaacttt ctcattacca caaagagtgc cacagttggc acaacataaa ttttatttat 74281 caaagtaaag caggtataaa atatatcata caacaaaaat attgagcata atgataagtt 74341 gagaaatctg taatctatac cttgtataag cagttgaaat aagaatcaga attgctatta 74401 gttttgatca cttactgcat gcaagaaaat gcactaaata gttcacatac tattattatc 74461 tttttaatct tgccagcaat catggaaggc aggtatcatt cacatattac agaagatgga 74521 attgagaact tttgttgtaa ggtgacaaac ctaggacttg agtagagcat ttcctgcttc 74581 taaagagcgt tactatatca tgttatatat atatgttata taactatatc atgttatata 74641 tcaaatgtgt atatatagta tatgtggatg tttgaatgtg tatgtgtata tatatgttta 74701 taggcatata catatacatg catacatatt tgcagttcaa tacttgttac tgaatttcac 74761 cttagaatgt gattgccact aacgtgtctc tatttctgaa tattaaatta tctgtcagtt 74821 acaccatttg gagtccagtg ttaatctaaa gaggtgctgt tcactctttg ttagttagtt 74881 gaattcagct ctcttcatct ttttaagtta caggttttta tgggactttt gtctataaaa 74941 tattttaata catggatatt gatgttatgt tttcaaaaat ttaaatgtct tcagtaaaac 75001 catagtcctt cccaggtaca tttttatgta agtcatacag ttttcttcat attagatgtt 75061 tgaataggat ggagctatga aatttcattc ccttatttat tttattattc attacaggaa 75121 cacccctcac taggccaact ttcagagaaa ttaatagaca acaacattaa tgtcatcttt 75181 gcagttcaag gaaaacaatt tcattggtat aaggtatgtt aactctgaaa atgttaaaat 75241 taaatttttt tcattggtaa ttgaaattaa gacatttctt tagaaccaca gataaggatt 75301 ctggaattat tatactttgt acaatctaaa ttctatattg tattactaaa gtatccttta 75361 caatgtctta acttagcatt taagcataaa ttctatttag gtttgtaaca tctagaacca 75421 aatctgagaa ctgagaattg tattggatac acttgagtcc agcctgatgc ttagggtggt 75481 ctcaaatgtg acaatggtgg agaagactta tcaggcaaaa agtagttcac aatccgtctt 75541 tgcatcttgc atgtctagag tgcttcttct tattctggta catgagcata tcctcataat 75601 aggttaaaat tagagaatat cagcttaatt ctaaatagca ctgacacacc tagctttggt 75661 ctaggtcatt caataacgga ccaaaagggg ccatcgaagt gtctgagcca gtcaaagcct 75721 tcacacataa gaataatctt ggtggtgtct ggagtctttt gggccaggtg agaccctcaa 75781 gcaaaataga cagcaactgt gtcacactgg aggtgccatg gaccccccag catagaactg 75841 ttcctctgca accctgttat agctttcttt tcacaatctt taaggctctt tcttcctaga 75901 aactctcttg ttctcatttt catgaagcta tgacacccta attttcattg tgcatctctg 75961 tttacttggt ctccctcatc tcctttgcta tctttgtttc tcccatctgc tccctaatta 76021 tacatttccc ctcagtggcc agtccccagt gccctctact tatttttaaa aactgcagag 76081 aaattacatt ttcttggctt caataaatac ttgaacccat ttccatttgc cacaagctgg 76141 tgttccccct accagtttcc ctattaccta cctaccttat ctgaaagcac tactttaatc 76201 gtaacacttc ttcctttaga aatctatggt gaagctttat ttatttattt agggtaaatg 76261 tatggagtac aagtgcaagt ttgttacatg tgtagattgt gtagtggtga agtcagggct 76321 ttcagggtgt ctgtcactcg aatagcacac attgtaccca ttaagtgatt tctcatcatc 76381 caccccactc ccaggccctc acccttccaa gtctccatta tctatcattc tacatcatgt 76441 atacacatta tttagctccc acttatgggt gagaacatgt gctatttttc tttctgtgtc 76501 tgacttagtg aaactttatt taattccttg tttacccaaa tgtatatacc atggaaccac 76561 tttttaaggt aacaacaaac taacatctgc agaagtagga ctctggccaa aattaaatgg 76621 cccaaaccac tttattagca atcgtaacag cttccactta ttgtacactt aactattgtc 76681 aggcattgag tttgtaaata cttattaatc taatcctctt agcaactcat aaaaatgagg 76741 acagtcatct taatatgccc attttaaaga tgaggaatca tattttggga ggctaaatat 76801 ttgcccaagg gtttctaaca acccaaatcc tctgctctta acctctaagc ttaatagttt 76861 actttgaaat accatgcagc tttttatagc ttttgcttcc tctttgctgt ttctctaaca 76921 ttatttcttg gcagcattat tggctctagc agtttcttcc tattccttta aaaatttgcc 76981 attaactgat aaaccataaa accaggttta ttattaacta aattatgcac aaagtcagaa 77041 tcaactggct ccaaagcctg ggaagctccg gtgtgagccc catgcatgcc cctgtagtca 77101 cctcctttga ctgcctagag atctttttta tcaatcacct ttcacctggg ttctaacaac 77161 ctcatggctg ataatccatc ctccatctgg ctgccagagt gatctattga aatcacaagt 77221 gtgaccacag tactcccttg agtaaaagtt tcagcggtgc cctctgaatt agaggaccaa 77281 atctctaagg tggcacatga ggccatgtct ctgcttgcct ttctggcctc attgccccac 77341 tgttccttgc cttgcaggga tgcttctcat ccctccagcc atgcttctaa tcctcctgtg 77401 cacaaacact cagccaggct gattctctcc tctgtgcctt tgttcttgct gcccaccctc 77461 gcttctcatg tctggaagcc cttctccctc attactttat taactcttac tcatacttta 77521 aggcatggct tataagtcat ctcctccaga aaacagtcct tggcaacctt ccccagctct 77581 caaatgctgg ccaggtatcc ctttcattag ggtctgtgca catctctatt gtttcacgtg 77641 ttcacatgta cagtgtttag tgaagaaatt gtcatttata tgcttgtgag atccttaagg 77701 gtggaaacta tgtgacctag tgatttggca tctccaaagt gtataaagta ggagctcaat 77761 aaaagtctaa taaatgaatt tttaaaatct ataaatgata catttttcct tatgacttga 77821 gatgccaagt tgttagtgtc taagtaatgg ctggcatcca gcaggaactt aaggatattt 77881 gtgaatagga taaaatgcaa gaataaatgg atttcaaaac tttttaccat ctgatttcct 77941 ataagtacgt gccctttagt cagattggct tttttctatt tttgcttcat cttcgattca 78001 tgcagacctt taaagccttt gctcattctc cctaaaaata cctacggtct tctccataat 78061 aagcagcatg catcaagaac acatacccca tgaagtttcc acttgtgact atgatttttc 78121 ctaactaaag aagttttggg aaatgctgca ttaaataatg ttaaatcatt tctagattgc 78181 aggcctactc tgaaactttt aatttgcaaa tgtgcatttt atgtttctaa gtggaagaaa 78241 ccatatgcag aacttcccaa gctcatttgc cctcaaaact tttgtatgtt taaaatctct 78301 tgggctttac attccatgga agaaatttgg gagatgctca tctgcatttc gattggattt 78361 ttaggtcatc aggtacagtg attccggatg cacacacagt catgcacatg tacagacaca 78421 cacaaccact acatgggtat atgttctggc tgctctacta ggttgcttat atttttcttg 78481 tgtcttatag gttttcattc atttaatggt tcaccttcaa ctgataacta caaaaatatt 78541 tactatttgt tggttgctaa cccattacat actattgtca ttgtttaaat gctactgata 78601 tgttttatag gatcttctac ccctcttgcc aggcaccatt gctggtgaaa tagaatcaaa 78661 ggctgcaaac ctcaataatt tggtagtgga agcctatcag gtatgtatat attagataca 78721 attttttaaa taaaattttt acctcaattt ctgtgacaag gtacaaagta gtgccatgtc 78781 atctcgagag gaggagtaga aagcatatta gattaggaat taggaatctg aagttcatcc 78841 ctggccctac cactaactag ctttttgctt tgggaaagtt atttctgctt tctgtgcctc 78901 acttttctca ttgacagtca acagagattg ggctagaggc tatctaaatt tccttctaca 78961 tccagtgctc agtgaatcat caagtcatag ctatagaatg tacattatta ccatgaaatt 79021 gcgtcatttc ctatgccgta atgataaagt agaagtccac agttacgtaa ggcaataatc 79081 tgccagggca gttaaactat ctgggatata gcaattatta agaggagtcg gttgggatct 79141 ctatgatcaa ccacttgatt gttcaacaat tattttagca tttttatcaa aagagtttga 79201 ttaaaagaat caccttagta gatactctgt tttcttgagt tgatggttgt atgctgtcat 79261 cagtcctatg aagagtgaat gcctcaatgc aaagaattta aagaaacatc tgttctttca 79321 acctgcgtca ttcctgaata gaaccacctc tgttaagatg ttgtgaaaat agtgactgct 79381 acttttttac cactaagatc cagtcacttt acatacatga cctcattcag tcctcacacc 79441 tgacctataa agcaggtggt tagtacccac attgtatatt agtagaaacc aagactcagg 79501 aaagtgaaac aaacttatca aaggttgttc aactaataag cagaagaaag tgggatcaga 79561 atccaggtgt tcctttactt caagcctcag gtgcttcctc ctctgcgata ccacctgttt 79621 ttcattgttt tacagtgtgt caaaagatat gctttatgct accttgaatt tatgggccaa 79681 ctatagagtc tctttcccaa attctctgac tcccttccta acacgcagag tttaatagtt 79741 aaattatagc caaagcattg tggaaaaacc tctcgtttct cttctcgttt tctccagaca 79801 gcaccttcct ttagcttggg cacaaaagag aaaggcagga agtagacaca atatatgaaa 79861 tgaagaactt aggccgggcg cggtggttca tgcctgtaat cccagcactt tgggaggccg 79921 aggcaggtgg atcacgaggt caggagattg acaccatcct ggctaacacg gtgaaaccct 79981 gtctctacta aaaaaaaatt agcagggcat ggtggcgggt gcctgtagtc ccagctactc 80041 gggaggctga ggcaggagaa tggcttgaac ccaggaggcg gagcttgcaa tgagccgaga 80101 ttgcgccact gcactccagc ctgggcgaca gagcaagact ccatctcaaa aaaaaaaaaa 80161 aattaagaac ttgtttaaac tttcatcttc taaaaggaaa ataaatcttg cttgatttgt 80221 tttcactgag gatagttttc tatgtctgaa aagtacgttg agtaaaataa ggaagagcgt 80281 cttaaattac atctttctca ctgttacaaa aaagaacagt gagactgaag agggacttat 80341 cacagaaaat caaatgtgtc tacaattcat tcccatcagt agtctgttcc caagaataat 80401 caaaataaga caagagctta tgctgtaaga aaccactcta aattctttag tatgaaccca 80461 tttattactc caaaaaaagt atataaattt acagattttt acctgtgtgt cagactgccc 80521 cagtggcaaa ggagacagca acatttggtt attttgccta gcataggtat gcaatgcctt 80581 agtccagtgt agttctcatc aaccgtaaac agtggaaaaa taatattaaa tccaacaaaa 80641 acttactggg gctaactact ctccttaagt tggaattctg ttattatttt atatatgttt 80701 atattatcaa aaatcttcta aactttcatt ttgtcttcac agtccaacag aatcaattca 80761 caatccattg aagaattata tagaatctgc ctctgtacct attagatact aaaagggaag 80821 ctcaagatta tgtctggggt tggatttctg aatccccaaa ctgggcaaaa ttttaaggat 80881 ttgtcttcac atcaggcttg aaaaacattg gcctaactag ctaagatctt ttccttttct 80941 ttcttttttt ttttttttag atggaatttt actctgtctt gcccaggctg gagtgcagtg 81001 gcgagatctt gactcactgc aacctccacc tcccaggttc aagtgattgt cctgcctcaa 81061 gctccagagt agctgggatt acaggcgtat cacgcacacc cggctaattt ttgtattttt 81121 agtagatgca gggtttcacc atgttagcca cgctggtctg gaactgctga tctcaagtga 81181 gccacctgcc tcagcctccc aaagagctgg gattacaggc gtgagccacc acaccaggcc 81241 aactaagatc ttaagagaag cagaatttta gaagggtaaa ctgaggatgc tcattgcgtt 81301 aggcatttcc ttctgatctg ctggttttcc acatatgaac atctttgtat accactggaa 81361 gtttatattg atttaccaag attatacaat taattgttgc agtgtgtatg agctctctgg 81421 aagacatagt cttacctact gtaaataata tttactttga gaaattagag gaaagtctct 81481 tgttctaaag atacgctacc agaattccat gatcaattat ttgtcagtaa gcacgtgttt 81541 ctttgttgat tataccattg aagaagagaa atcactcaag taaagaatta gaagccactg 81601 gaaattattt ttcttttcat aacccgctgc tcactctctt tctttaggca catacttcgt 81661 ttttcctttt ttttgtttgt tttttttttg gcaagtgaag gatttttgca tgtggtagac 81721 tcttagttgt atggaataaa atgcacacag atctgaagta tacattaaag gtagctggat 81781 cctttggaat ttgaagattg taatgaaaga aattagtgtg tgcctgccac acaataggca 81841 ttaaatagat atttattgaa tgagtgtata catgggtgaa taaatgaatg aatggttttc 81901 tgcccagtaa gaggacctat tttggtatta aggattgcag caagcttgac tgacacaaac 81961 aagtcccaag gacttgggaa aaattctaag ccattcttgg agagccttat agaactatat 82021 ctgtccagta attcttgctc cttcagtgta gaaatggcct ccctaaaatg tacttgtcaa 82081 ttctgtgtca ataagtcgtt ttagctatca tgatagtctc aattgtttga aacaaaaata 82141 tttctttagc agctttatat ggcagaagca ttttccacat tttaaggtgt caagatttac 82201 cctttgtgat gatatatgtg ttctcaaggt ttaattacac tttatcataa aatatagtta 82261 ttactggatt aaaaatgact gttttgttaa cagactgata aatatctttg aagggctcaa 82321 aatgcagtca tggaaatatg taaaaaaaat agaacaattt gaatgcagat aatctaatcc 82381 ttcagtattt cttatgtatt tcaaatataa ttgggtcatt atgtggaaac actgttttta 82441 ttcatccttt tatagaaaca gactctttga tctagttaaa attaaatcta taatgattta 82501 ataagcttga ctctgctttg ttgctgacat ttgaaactac tctcgtatgt aatttaaaaa 82561 taatgtctct gcagaagctc atttcagaag tgaaagttca ggtggaaaac caggtacaag 82621 gcatctattt taacattacc gccatctgtc cagatgggtc cagaaagcca ggcatggaag 82681 gatgcagaaa cgtgacgagc aatgatgaag tatgtgggtg tgcatttttc ccttttaaaa 82741 taaactaagt tgtttggcgt actttcttac agaaaaagca aaccattctg aaaaatttta 82801 cttactactg actctaagaa tcttatttat acttcaagga aactttcatt gtctctcctc 82861 aaagttccta cttgttttag ctcagctgat tgttctctcc aactctcagg ttttaatatg 82921 tatatgtgtg tgcgtgtgtg tgtaatatga aacaatcatg tatcaatagt aactttaaac 82981 ttaggcgtca tgaccggtat aaattaggaa atcacatatt ctcaaataag tagtgtctgt 83041 acttataata atgataatgc ctttttgatt tcatattcac aaactctcaa agtaaatgta 83101 aaattgaaag tcacatatca cagaaaagaa tgggaatgat agaacagttg ataaaggcta 83161 agaatttcaa aatctttcac tttgtgattg ccccagtaca ctcataatta tctaaattat 83221 ctagaaggag gtagggaaat ttattaagaa caatacaaaa tgggaagagt catcatttac 83281 caaaaaaaaa aaaatgggat gtaaaggaga gagatccaga ttgtaacagt atcaggaaac 83341 agcaccagag gagatagaga ttctccaagg attggcacaa cggagagagt aacacaaaga 83401 aacatactct ttaagggagt ctttgctggg tgaagtgatt ttgattgagt catttacgtc 83461 atgaggcatt gtctcatgtt taaatgaatg acttggacta gtagatttcc aacccctttt 83521 aatctaaagt gcaatgatga ttaattatgt aatgtcatgc tatagaaaaa aagaaaaagt 83581 tattgaaagt gatagaagat tctaaaaatg aataaaagag actcttctct ttagttagaa 83641 tatgtttttc tatcttagga attttagagc tgtgtatttt aaattaatta ttgcaagttt 83701 ctaattctgt ggtttcttct atgatcactc cgtggatttt gttcaaagga catcatatac 83761 cagtttgcaa aatgcacttt ggcttttcat taaaactaca gagaaaactt tttactagca 83821 aatatagcac taagttactt tatgaaaaaa cgcttactgt cttagaaata gttatttcct 83881 tcaaaatgtc attttggaag aatggttttt cttagctttt acattgtttt ttttcttttc 83941 atttgcttac atgccatctt tccttcattt tcaattgagt agtataaata gtttaactct 84001 caggaatagt catatgacct cttaattata gaaaccccag agtatgatct actgttagag 84061 atggcagtgg ccacattaac ttatgtatct gttaagcttc agcattctca aagacaaaag 84121 cattttaagc ataacaccaa ctgtattgat tttaatatta tggaacaaat tctaatgttt 84181 catgtatata taagccaaat ttagatactt tatattatta tatcgtagat atatattcca 84241 accaaaaagc tcagcgcaaa ttaattttaa attgtaatgg atctcaatcc aacgtgaaag 84301 aaatatacct tcttatttac caaagtacct aaagatggta aggtgagcca acaaatataa 84361 atagacatat taaatcaatt gtgttgtgca aaaagaagaa atagagaaaa agaaaacatt 84421 tgaccaattc ctgccagcac agctccttag ttctgggcta tcttcttcca tggagacaaa 84481 tagataacct gtgttgaggt cttaagattt gcaaatgaga gtcctaaaca tttttctttc 84541 gagtaacatc agagaaggcc acaggttttc ctccactgaa accctgtata aactaaaatt 84601 taactccgaa actttacttg ggctcctaga aaataggaac aaaattaaaa ctgttcattt 84661 gaacatctta tctcagggtt ttgattttta tttatttatt tatttttgag atagagtctc 84721 gctgtgtcac tcaagctgta gtgcagtggt gcaatctcgg ctcactgcaa cctctgtctt 84781 ccgggttcaa gcaattctcc tgcctcagcc tccctcccaa gtagctgaga ctacaggtgc 84841 gcgccaccat gctcagctaa tttttgaatt tttagtagag atagggtttc accatgttgg 84901 ctaggctgat ctcgaactcc tgacctcaag tgatccaacc gcctaagcct cccaaagtgc 84961 tgggattaca ggtgtgagcc accacgccag gcctcatctc agggttttaa accagtagag 85021 cagaagccct ttactttggt ttatgtgatc tatctctagt gtcaaatatt caatacccca 85081 cattttaatg gctgagttta gaaaatggcc tattctttct catgtttgct aagaaattaa 85141 tcatcagtga tggggtagat gcttcagata ggcttgcact caagtctatt taatggtgca 85201 atatagggat tagtagttat tatattcttt aaccctcaag ttcattcatt ggaaaataaa 85261 tgactttaaa aagagatgtt gcatatatat gaaagagatt acatgtgttt cttttacaaa 85321 tatcgttaat ttctctaaga tggtttctta tcaattcagt tattaacaga tataatattg 85381 gtaataaaaa taattaaggt atataaatat atttcctttt tcctcctaaa tttaggttct 85441 tttcaatgta acagttacaa tgaaaaaatg tgatgtcaca ggaggaaaaa actatgcaat 85501 aatcaaacct attggtttta atgaaaccgc taaaattcat atacacagaa actgcagctg 85561 tcagtgtgag gacaacagag gacctaaagg aaagtgtgta gatgaaactt ttctagattc 85621 caagtgtttc cagtgtgatg agaataaatg tcattttgat gaagatcagt tttcttctga 85681 gagttgcaag tcacacaagg atcagcctgt ttgcagtggt cgaggagttt gtgtttgtgg 85741 gaaatgttca tgtcacaaaa ttaagcttgg aaaagtgtat ggaaaatact gtgaaaagga 85801 tgacttttct tgtccatatc accatggaaa tctgtgtgct ggtgagtata aatatataca 85861 ggcagctttt acttgtcact attacaccag catagatgtt aaagtgtctt taaatcacat 85921 gtttataaat tagcaaaact gaatctgaat ttgttattca aagtttcaaa gtccatgaaa 85981 ttagggcatg gcaacagatt tatagtacgt gattttgata tttatatctt aacctatttt 86041 atacaggaaa tgcttcaaaa taaaaaccta aaattaattg ttattctcat gctttccttg 86101 atcttggaaa gaaatatacc agaaaaggaa atgctgatga tatctgacac attttccaga 86161 agaatcagag tattcttcta gaaggatctg attcattctg gtaaatggtc actccctcca 86221 acttgctcag aggattttat ttctgtcagg atgatgggga gtgagctata agagtatact 86281 tttcattcat atgaaaatcc ttaatgcatt attataaatg gtattctaag ctggcagtaa 86341 tttaagcctc tccagtttat tttgggccac caccagcaac aaaactgcca aaggacagca 86401 ttcttcaaga aaatactgcc ttactaagta atctaaaaga ttcgtccttg aggccacaca 86461 cagaccaaat tttaaacatg aaataaatta aaataaaaat acgaagatca ctcttgggcc 86521 attcaattta ctgccaccta caggttagat tccagaaaat ctaatcaaca ttagcaaact 86581 aaaacaaaat actgttttta aatgaattaa ataaactacc tgtcatctga gttctgtgat 86641 aattttcctg tgacaatgtg ctaaatttac tcattgagaa aaacatcttt ttagtatttg 86701 cattttagta aataaatatt tgttaaatta ttgacgaatg gaaacgtcat cataagcata 86761 tgggacgtaa caaacctata aaaaataaat gctataatgt agtctcagct gaaaatttgg 86821 cctgtatcac ctaaatgtgc aatggaaaat tgtcttttta ttctgtaaca tacatagaat 86881 atcttaatat acaaacattt attcaactgt aactgacatc atgggtagga tggattatag 86941 atgactttag aatgaagcag cagttgtcta ctgtatttta tagggaggaa aaatattatt 87001 ttacagtgaa gagaaagaat tttctgaagt cacgtcccac agctgtcaat ctctcaagag 87061 ggtttcaaat acatttgagt atttgtccca gtctctgttt gcagtgaaat atcgaatgta 87121 ctagcttttc tatatgagta tgatactact gacccacaag cagttgagaa tctgctttga 87181 attgtttctg tgtaagtgca gactgtaact gacatgtact ttctccacta gtctcctttg 87241 ggaataaaaa cacatgttca caggcctgct ggcggggagt cagcataagc ctgcacagca 87301 cagagggctt ggccgacatt tgtctggcct ggacctcatt agttttacac ttagcaaatt 87361 gagctggtcc tacagtagaa tctgtggaac aattaccctg tgaggtgctc agcaataaaa 87421 gaagagagag agggagctaa gaaaggaaag gaagggagag agatagaagg aaggaaggga 87481 gggaaggaag gaggaaggaa gggagaggga gaaaagagga aggttggttt gtggctagat 87541 aagcttcaga attcctatat catcatgtga aatgtacccc cttttgacga tttctagtgc 87601 acattggcat attaagtttt ggacactttt actgtctaaa atccatttaa ctatgtgtaa 87661 ctatgttttc ttcaccccac atgcataatg ttcccccacc acacacacca tctttctttc 87721 tttttttcat ttaccatcta agaactgctc acagtataga ggaggatctg ctctgctaac 87781 ttactaaata gtgcactgag gatgtgaatg caattgttca acttctcaag agacctgatc 87841 tttaaacatg gcactagcag tactgaagca ctagaaaagt ttgcctcaca gtcctgcaaa 87901 gttatcctag gccttcagca acatcagaag actgagagaa taggtgatgg aagccatatc 87961 aaatacctgt atcttttcta atttcaagtc aggcagttaa cattactttg gcctttgcag 88021 tttcttgata tgtgtgtttt taatgtaaaa cggaatatgc aatgacattc aagtagtctg 88081 gcagccactg aagcaggggt gcccttcgtg cttgttagcg gagtgcatta gcatggggag 88141 attatctaca gtagaacagc tttgaaaaca gaccctccta aatccagcac tgtttgtgtc 88201 agaccaatga aaaagtgatg ctgttcccat tcattggact gtcagtgcgt ttccatggag 88261 atgatacagg aatccatggt tgaaaactct ttacaaagtc agtgatccag ataacatagg 88321 tcctgcggtg tcttcctcac agggcatgga gagtgtgaag caggcagatg ccaatgcttc 88381 agtggctggg aaggtgatcg atgccagtgc ccttcagcag cagcccagca ctgtgtcaat 88441 tcaaagggcc aagtgtgcag tggaagaggc acgtgtgtgt gtggaaggtg tgagtgcacc 88501 gatcccagga gcatcggccg cttctgtgaa cactgcccca cctgttatac agcctgcaag 88561 gaaaactggt atgatttctt tgactccaaa catacacaaa gaatagctac ttttgtcctt 88621 tttcgttctg acttccttaa tcttaaacgt tgccagatac attgtgacca ccatgctaaa 88681 aaagaattaa atgagaaatg ctatgaatta gaagacttga attcaaatct ttgacccctg 88741 gatagctgtg gaagcatcag agtatatctg gacactcact accttagctt tttttgaaat 88801 tagaggttgc ctgctaccca gcttactatt tagtaagcgt gtgatctcgg acaagttgtt 88861 ataattttct gtgcctcagt ttcctgatat caaaagtgag gatagtaata ataccttgct 88921 gtatatctgt gaaagtatca aatgattcat ataaaaaaac tcacctttgt acttgatgca 88981 tggcaatcat agaataaatg ttagctgtta ctgctatgtt ttatatttaa tatatatata 89041 tatttttttt ttgtgacaga gtctcgctct gtcacccggc tggagtgcag tgacatgtca 89101 ctgcaagctc cgcctcccgg gttcacgcca ttctcttgcc tcagcctccc gagtagctgg 89161 gactacaggt gcccgccacc acgccgggct aatttttttt ttttttgtat ttttagtaga 89221 gacggggttt caccgtgtta gccaggatgg tctcaatctc ctgacctcgt gatccaccca 89281 cctcggcctc ccaaagtgct gggattacag gcgtgagcca ctgcgcccgg ccttattttt 89341 aatattttaa tgataaagaa caatggtgtc tgtgacacac cctagaaata agtgatatat 89401 aaatgaaagg ttatatatta aatgaaggtg acattactgt tttgtgccaa taaatttttc 89461 agtaacaatt tgagatctgc acaaagactt ctaatctaaa cactgtaact tccagtttgt 89521 tttattcaac attttaaacc attgactttc tcagattagt agctcaagta agttttttag 89581 aattatgcat ggtaatttat ttaaatcatt tggaaatgca atccttcgat gtaatgatgg 89641 agttagttct aatgaacata atcaaaattt gaattgtttc tgcattgtta ttttgtcttt 89701 ttttttcttg cagaaaatat tatagtccac caccttttta aagaataaat attttcactt 89761 ctgtttcccc ttgcaggaat tgtatgcaat gccttcaccc tcacaatttg tctcaggcta 89821 tacttgatca gtgcaaaacc tcatgtgctc tcatggaaca acagcattat gtcgaccaaa 89881 cttcaggtag gccaaagctt aataatcaaa gcacagaaga gtgtctgtag aggatgatgt 89941 tcccccaaaa gacccataat taaccatgct aaagaaagga ctgggccggg tgtggtggct 90001 cacacctgta atcccagcat tttgggaggc caaggcgggt ggatcacctg aggtcaggag 90061 ttccagatca gcctggccaa cgtggtgaaa ccccgtctct actaaaaata caaaattggc 90121 caggcgcagt ggcgggcacc tgtagtccca cctactcagg aggcataggc aggagaatcc 90181 cttgaatccg ggaggcggag gttgcaatga gctgagatca tgccattgca ttgcagcctg 90241 ggcaacaaga gaggaacccc atttaaaaaa aaaaaaaaga ggactgaggg ctttattttt 90301 aaattatgaa acattgcttt attctttata acaataataa atgtgtgttt ttgtgtcatc 90361 ttctacctta gatacttcct tcatgttcac tggctcactt aatgtcatac aactctttga 90421 gattggtata atctccatgt tatagatggg gaaactgagg ctcagggaaa ttttgtatga 90481 gggctgctgt ctcttaacta gtaatgttaa gatacatgtc ttagatcaag atcttcaaac 90541 taaaagttcc atacactttt cattatgtca catttataat cattcatcct taagctatat 90601 ttttatcaaa atgaaaagta ctaggaagga atggcccttt tgaagttaca tattttgtaa 90661 aagcagatca tcattactta ggattttttt ggcttcacct gaaacatgaa agcaaatatt 90721 gattagacat gaggaagttc agaactagaa gcaagtttgg attatgaagg aaaattatgg 90781 ctccacagat tttatctgtt actgaaatga gtttacatgg ggaactcata tttctggttc 90841 actgaggcct tgacagccaa tttatttgga tgcctttcca tggaagtacg ttgcatggga 90901 attcagctgt gtagatgaga ggggaaaact aaccactcag ccacaccata gcaagcatac 90961 tcatctctgg gagatttaaa aaactgaatc gcaaactcat tatattgtat acattaatat 91021 gtgcagtttt tgtatatcaa tgatacttaa taaagctgtt taaaaaaaaa aaacaacaaa 91081 agcaaaacaa aacaaaagtt atttagaacc cagaacctta ctttgcacag gggtaaggtg 91141 tggaactcca cgggcattag aaactaacct tgaaattata tctttattaa acattcccat 91201 tgcaactatt atttcaagtc tctcccacct gtaagaaaaa tcccagctct tagagagact 91261 gcacaaaaga gtaaactttc tccagagttt tgtttccttt ttccctcaca tccacactga 91321 tcttacccca taatgaactt ggctctgaaa aaagccccat gtgctgccag ctggagctgc 91381 ggcaatcact ccagaaagcc aagtgccttt ctaataaata gggccttact atcaacatgt 91441 catttattat gttaagtaaa cactactttg acttaagtgt tgttacaaaa agtattctct 91501 taaagctcct taggaaaagc cccggcctga cctaaactaa ggaaagcatg ggctaggcat 91561 gaatgtttga ccactcatag catttaaata tgcatgcaga tttgagctct gaaaatgccc 91621 tctgttctaa cctattagaa atatttctaa aatgtgtcag tagctagggg ctgcccctcc 91681 cttcacaact attgtgcaag gagagggaaa gggcatttgt taggcacctc tattatctat 91741 ccactcagtt gcacttaaag gctgttcttt cagtctctct ggctgagctc ccaccaggcg 91801 agaagccagt aagtgggatc aagccacgtg ccccctctga tgatctgcct tagctgattg 91861 agtagtccta aatcaaccag aggaaatgac agtctgtcag aacaaaaggc atgtatgtta 91921 tgagcccgaa gtcactgtgg aggcctcaaa agacaccaat tttgagcaga atattttgga 91981 ttctctctct gtgagacctg tagcaacaga gacctgcact ggcatttggt gtttccattc 92041 cagttgtagc actgggatag cctttgacct ttctttcaaa gtaccttatc acaaaaaaaa 92101 aaaaaaaaaa ataggcttaa ttaagtcttc ccacttgaga ggaacccttt cccttatcac 92161 tgagcttggc cttattcaaa aacatactta gtcctcactg gaaattctga cccaaatcac 92221 ccaggattgg ttgggcggtc cactatgcta agccttggca ggtcacctcc gtagtggccc 92281 tctcagggac accttggaac tttggatgag gttcccaaac tctggagagt agggagggga 92341 acacatgcca ctttagccac aaagtaattt tatgaatctg gggcagctgg tgagttccat 92401 ggactacttg cctattgtgc aaccatcttc ctgttagatt agggtaatgc ctctggcaac 92461 aacagagcct gtagactttc aagcaccaga tctttggctg tgagggtgtg ccaaagatct 92521 ggtgctaaaa tgagatccag catgcccctc ctcttaacac aacatttctt tttccagcta 92581 gtaattattt gtcctaagca ttatttattc ttaaaataca cagtgcatgt ttagaggcaa 92641 atggaatagg tgtagttaac agactagtaa aaagtaaatt aatgaaaatg aagaatgata 92701 ttttcatgtt gtcccaaatt ggacactcct atttcctgga gaagccgtgt ataataagaa 92761 tgaaaattta cacttcaatt tctgaagaaa tatttattca cctattggag tagttttgat 92821 tttagcttct gcagatgttt actttatttt ccattactgg catggaacaa atgaagacca 92881 agcataacaa gagaaacata tcctaaggcc cagaactgct caataatttt ttaaaatcct 92941 cttttccatt tagtgatcaa cagtacgtac cacaaaaact agtcctcttt ctgtcttgga 93001 tcaaattatg cttggttatg ggacatggac atattccagt taatacaatt aaagaatcaa 93061 gctgcaatgg aataatgtga ttgaatacct taatcatggc tatttcccaa aggagggttt 93121 tctattgaaa atattttcat agtaaacgcg caagaaaaat tagagagctt tttgttttga 93181 atcaaaggta atttcaatcc cgatttgaaa atggagattt gcaaatatct ttggcactat 93241 taattgaagt gtgaacatta tttacttgat agactctttg atttacttat tccaattttt 93301 aatccattta tttgtttgct tcattacaga atgtttctcc agcccaagct acttgagaat 93361 atttttcatc attttcatag ttacattctt gattgggttg cttaaagtcc tgatcattag 93421 acaggtgata ctacaatgga atagtaataa aattaagtcc tcatcagatt acagagtgtc 93481 agcctcaaaa aaggtcagtg aattctaaaa aagaactgct aacagtatgt tattgcctag 93541 ttacaatatt taggccctta gttaataata atatttcttc tctattaagg ataagttgat 93601 tctgcaaagt gtttgcacaa gagcagtcac ctaccgacgt gagaagcctg aagaaataaa 93661 aatggatatc agcaaattaa atgctcatga aactttcagg tgcaacttct aaaaaaagat 93721 ttttaaacac ttaatgggaa actggaattg ttaataattg ctcctaaaga ttataatttt 93781 aaaagtcaca ggaggagaca aattgctcac ggtcatgcca gttgctggtt gtacactcga 93841 acgaagactg acaagtatcc tcatcatgat gtgactcaca tagctgctga ctttttcaga 93901 gaaaaatgtg tcttactact gtttgagact agtgtcgttg tagcacttta ctgtaatata 93961 taacttattt agatcagcat agaatgtaga tcctctgaag agcactgatt acactttaca 94021 ggtacctgtt atccctacgc ttcccagaga gaacaatgct gtgagagagt ttagcattgt 94081 gtcactacaa gggtacagta atccctgcac tggacatgtg aggaaaaaaa taatctggca 94141 agtatattct aaggttgcca aacacttcaa cagttggtgg ttgaatagac aagaacagct 94201 agatgaataa atgattcgtg tttcactctt tcaagaggtg aacagataca accttaatct 94261 taaaagatta ttgcttttta aagtgtgtag ttttatgcat gtgtgtttat ggtttgctta 94321 tttttgcaag atggatacta attccagcat tctctcctct ttgcctttat gttttgtttt 94381 cttttttaca ggataagttt atgtatgtca cagatgactg gattaattaa gtgctaagtt 94441 actactgcca taaaaaacta ataatacaat gtcactttat cagaatacta gttttaaaag 94501 ctgaatgtta ataggggaca ctgtaaagta tcatcaaaac ctgaatagct tcattgtgca 94561 caagtgtgga gttttgtatc ctcttacctg gtaaactgaa gggattgttt ggccatttca 94621 tttatcttat cattaattca caagatagtt agaaattctg cctcaagcaa agtaccacat 94681 tttgaatgtt ttcttagatt ttgattgcaa gtagatatca gcatttttta aatgaaaagc 94741 tatattatct tctcccttca aggcagccta aggatgttct ttcccagaat cactccaacc 94801 cttcttgcca gaattcataa aagtacaaaa ttggagaata gatgatatct tagaaataag 94861 cttttttttt tttttttttt ttttttgaga cggagtttca ctcttgtcac ccaggctgaa 94921 gtgcaatggc gcaattaggg ttcactgcaa cctctgcctc ccgggttcaa gcagttctcc 94981 tgcctcagcc tcctgagtag ctgggattac aggcatccac caccgtgccc agctaatttt 95041 tgtattttta gtagagacgg ggttttgcca tgttggacag gttgatctca aactcctgac 95101 ctcaggtgat ctaccctcct cggcctccca gagtgttggg attacaggca tgagccacca 95161 tgccaggctg ctaattctcc tttttagtga gttagggaac tgagcctcag aaaacttaaa 95221 cgatttctca gaaaacactc aagtgataaa gtggccacat tggaaaggag tttttatctt 95281 ctcattgtca ggccagtgtt cattgcacaa tatcatgcta cctcttgaat ctttaaaata 95341 ttcaattggc aaatgttttt caatgtgatt tactcatgtc ttaagtgtat gaggaaagtt 95401 caaagcaaaa tagaaaggaa taattcaaac tgaattgtcc ataatcagct tccagtcttt 95461 catgctaatc agcttcttaa gagactgaag tatggcatac ctacagggga attccttcgc 95521 accatagcct gtatgaacag tgttccctgg agttctccag tgctcagctt gagaccttga 95581 tacacgggcc atgagccctg tcttccccaa tggaaattta tttacactta ccttatccct 95641 atggacttag tctgatttta ttggctagga gtctaacagt cctgtgtgga tatacagttt 95701 tgcccatgac aacaaaggaa tctatccgaa atatcttttt ttttataata aacttccaag 95761 atttgctgtc ttccagcact tgagttaaag tactagatac tgcattttga tgaagactaa 95821 ccccatctca tattctaccc taaagagaac tgaaaaacct ataataagtt gttctggagc 95881 caataaacac agcagctctg ttagatgtcc tctacagcca agcactttca atgctaactt 95941 gaactgcatt tccttcctca aatgagagat tgacataatt cagtactgtg agtcacttgt 96001 ataagaaacc tttgatcact aaaaataatg taaaaattgg gtttagtagc ctaatacaca 96061 taacgttctt cttaaaaagg aaaatggatg gatgcctgac aaccctccaa aagaaaaaag 96121 tgtaagatag ccattaagat gatgacaatt tttgaaatga acattatgat atttatgaac 96181 aataaacaaa tttccgtatg gaatgaatta tccaaaaaga gtataacaaa atgaaatcct 96241 taaaaatcca gagtttatat tttttttata ccctcacttg tttgcactaa ctttatagtg 96301 gaccaaggct gttaccatag gaagggacaa acttccttgt aggcaactca gtgttagacg 96361 atgattgtgg ttatgcttgc aaagtcttgt gcttatcttt tttgttttta cttaaaaagc 96421 taatttttaa agattgtagg gcttgtattt tacttgaata attgatatct tcctgtgtaa 96481 tgatttgtga gatgagaatt aatatttgac tagttagaat taattaaatg gtaagggaac 96541 acagggtact cttaggttaa ataatgtatg caaatagagt ctattttcaa ctaatatggc 96601 cacaggagcc ttttgagatt cattgatatt aaacacaatt aatgaaattt taaattgtta 96661 acagaattga gaacttgaac aacactttta gtactgcagc atttttgtgc cctaaagtat 96721 gtaatgattt ataaatgtgc catacataca ctacaacata acatttgctt tgttatgcat 96781 tttatttctc tggggacacc attgcactgc agtgcacacg tatttataaa catttgttat 96841 atttttggaa acttgctaat atttattaag tcatagactt ttctggagga cttaaaaatt 96901 cactaaaaat ctgattatgt cttaaatgtt cagtttatct ttggtttatt aaaataaaaa 96961 aaaaatctaa gattaaacac agtagatatc tctggaggca attttccaaa actcaacatt 97021 aaaatttgtg gatgcatgag atgcaatcct tcaaagaatg aatctgaaat atatttttaa 97081 tatttactta atatccactg aagatatctt tatgcaagac aagagtcagc catcagacac 97141 tgaaatatat tatgatagat tatgaagaat tttctctgta gaattatatt cttcctggaa 97201 cctggtagag tagattagac tcaaaggctt tttcttcctt ttcttactcc tgttttttcc 97261 actcactctt cccaagagat ttcctaaagc ttcaagctta ataagcctaa tagtgaaaaa 97321 taactgaatt taatggtata atgaagttct tcatttccag acatctttaa ttgatcttaa 97381 agctcatttg agtctttgcc cctgaacaaa gacagaccca ttaaaatcta agaattctaa 97441 attttcacaa ctgtttgagc ttcttttcat tttgaaggat ttggaatata tatgttttca 97501 taaaagtatc aagtgaaata tagttacatg ggagctcaat catgtgcaga ttgcattctg 97561 ttatgttgac tcaatattta atttacaact atccttattt atattgacct caagaactcc 97621 attttatgca atgcagacca ctgagatata gctaacattc tttcaaataa ttttcctttt 97681 cttttataat tcctctatag caaattttta tgtataactg attatacata tccatattta 97741 tatttcattg attccaagac atcacttttt caatttaaca tctctgaaat tgtgacattt 97801 cttgcaactg ttggcacttc agatgcagtg tttaaaatta tgcttgaata aatattacac 97861 taatccaact ttacctaaat gtttatgcat ctaggcaaat tttgttttct tataaagatt 97921 tgagagccca tttatgacaa aatatgaagg cgaaatttaa ggacaactga gtcacgcaca 97981 actcaacatg gagcctaact gattatcagc tcagatcccg catatcttga gtttacaaaa 98041 gctctttcag gtccccattt atactttacg tgagtgcgaa tgatttcagc aaaccctaac 98101 ttaactaaca agaatgggta ggtatgtcta cgtttcatta acaaattttt attattttta 98161 ttctattata tgagatcctt ttatattatc atctcacttt taaacaaaat taactggaaa 98221 aatattacat ggaactgtca tagttaggtt ttgcagcatc ttacatgtct tgtatcaatg 98281 gcaggagaaa aatatgataa aaacaatcag tgctgtgaaa aacaactttc ttctagagtc 98341 ctcttacttt ttattcttct ttatcatttg tgggtttttc ccccttggct ctgatcactt 98401 taacttcaag cttatgtaac gactgttata aaactgcata tttaaattat ttgaattata 98461 tgaaataatt gttcagctat ctgggcagct gttaatgtaa acctgagagt aataacacta 98521 ctcttttatc tacctggaat acttttctgc ataaaattta tctttgtaag ctaactctat 98581 taatcaggtt tcttctagcc tctgcaacct acttcagtta gaattgtcta atactgctct 98641 attaatcagg tttctagcct ctacaaccta cttcagttaa aattgtctaa tacagcaata 98701 tttaaaaaaa aaacactgca attgtcaagg atggaaaatg tgtgatttgt gtaaacaatt 98761 tttaccaact ttacattttc ctacagataa atgtgaaatt ttgataagaa gtctacgcaa 98821 tgacaagtat ggtacataaa ttttattaag aatattgagt ataaagtact ttaattctaa 98881 attataagaa aatatacatt tgcacatatt aatatagaaa ttcattttgt gtatatttaa 98941 catagctttt aaactatttt acattagcta cttcattatg gtttcttgaa cttctgaaaa 99001 aaattagaaa tgtattaaac ttatcagtaa cataaaaact tattttgttt cacctaacga 99061 atactgcgtt tgtaaaaata aatttaatat agaatatatt tttaaattaa atatttgaat 99121 ataaaatagc tctaagaaag aagcaaatta tcactgaaca tatttcttat tatttctggc 99181 tttgaattat acgtaactta aattgtctta aatgatacag aatattggag aatatgatac 99241 tttcacataa tatactatga acctgttcat ataactctga ttgactacta acttctgttt 99301 tatgtattta ttaaagagct gacactgtag tttgtggtga gatgtttatt tttctaacag 99361 agcttataac agttaggaca aggcatttaa ttaatgcatc attctgttta gtagtaggtg 99421 ttaatcaata tgaaattctc tgttttaaaa taaaaatgta aaaatctaag aatatgcaaa 99481 acgcttcctt atttactttg tgttacatta atctcaaagc ttccagacaa cagtactttg 99541 tcccccaagg aaatgtaata gcatctatac actgtcatta gcagaggaca ttacataatt 99601 ccaaaaagat attactccta gaaaattgta gggccaatga taaacagggg aggccatcct 99661 agttgttctt ttcctttttg atagttattt gcagtagaaa cagaagaaat attctaatga 99721 caaaaacctg aaaaaaatag ctaaaagtta aaacaggcat tttccatgtg ccagagactt 99781 ctaccaccac cttaggtgtg acacattcat aaccttcaat tgcacattct caagagtcag 99841 gcttaaatta attgaatttt attgaatacc catgtattgt gatttttctt accctagaga 99901 agagaataaa gcagaatgtg gtagatactg agatcagctc agtcgtggag accctaaccc 99961 agcggcgcca gaggaattaa agacactcac acagaaatac agagtgtgga gtgggaaatc 100021 gggtctcaca gccttcagag ctgagagccc caaacagaga tttacccaca tatttattga 100081 cagcaagcca gtcataagat ttactaaaag cattccttac gggaaataaa gggatgggca 100141 gaaataaagg gatgggtctg gctagttatc tgtagcatga acatgtcctt aaggcacaga 100201 tcgctcatgc tattgtttgc agtttaagaa cgcgtttaag cagttttcca ccctgggtgg 100261 gccaggtgtt ccttgccctc attctggtaa actgacaacc ttccagcatg ggcatcaagg 100321 ccatcaagag catgtcacag tgctgcagaa attttgttta tggccagttt tggggccagt 100381 ttatggccag attttggggc ctgctcccaa cagatagata tattaaaaat gaaataagac 100441 atttagacag attagctgtt cttaaggagc ttttaatagt ttgaaggaaa agataaagtg 100501 taatgcacct ggtatggttt ggctgtgtct ccacccaaat ctcatcttga attcccatat 100561 gttgtgggag ggacccgtgg aaggtaattg aatcatgggg gcagatcttt ccagtgctat 100621 tcttgtgata gtgaataagt ctcatgaaat ttgaaggttt gaaaaagagg agtttccctg 100681 cacaaactct tttcttgtct gccaccacgt gaagtgtgtc tttcaccttc tgctatgatt 100741 gtgaggtctc cccagttatg tggaacttgt aaatccaata aacctcttta tttttgtaaa 100801 ttgcccagtc ttggatatgg gtgtatcagc agtgtgaaaa ctaatacagt acttttcgat 100861 agaaggtctg tgctaaagag accagatgga cacttactgg caggtttcac aaggccttta 100921 gttttgagaa ctcatctcaa ttgtgtttga ctgcagtctt tcaagtttag aatcctgatt 100981 tttttattga gtgttatttg acccatttct tcctttcttc tttagtaata gaacctaact 101041 cctgagttgc agcagagcac acagttgaga cccccagact cctttgtagc tgaatgcagc 101101 tacagaactt aatgttggcc agtgggatgg aggaggaagt gatatgtgca gccctggggc 101161 tcacttcctc ttcctttctg ttgtctggaa caactggagc tgcagtcctg gcaaaggaag 101221 ccttgtgcta aaattcttag agcttcccct ttacccattg actgttcacc gcaagacatc 101281 tctggtagga aaacaatctt acctaaacca ttgcatttga tattatagca gataatctct 101341 actgtaacca atgaagttgt ctttatataa ttgtctcact gaatcctcag gagcaaaaag 101401 taccattttc cttaattttt caaataggga aaataagtca cagaaaggtt aaggatcttg 101461 tatgaaagca tgcagccagt aagtggctgc acttactgca ggattcatac ccagatctgc 101521 tttactttcc ataagtaact tatctattgt gagcacagat tttactagct caacaatcag 101581 aagttaacct ttgtccccac accccataaa acatatgcaa caaataaagc tgaggagctg 101641 atatatgtgt acaaatacta cacagggaaa caacatcctt taagatgaca aaattaaaaa 101701 aacctttaag atggtgagag gtactcggat tccagagtta ggagagccag gagaataaga 101761 gaaagcccat ggaagacagc atttggcctc gccttgcaag aaaggtgagt atgcacatga 101821 agaaagggga ggtaacctgg gagattttac aaatgataaa gtataaccaa acgcaggaaa 101881 gagtgtggac tccatccaag aacattgaaa ttgtattgag atacagcatt agataacaaa 101941 aggcagtaac ctacaggtcc taagttatgg agagcttgat tcccaggcta agaattttga 102001 gctttggtta ctggggaaga gcaaactgta aaggttttgc acagaaaagt ggtattattg 102061 gagttcaact tggttgacat ataattacat aaactaaata taatttgaag cattaatttt 102121 tatactcaaa tactaaatcc aaatagtagt taacactatt aagtattcaa tgctattaaa 102181 tgttatctaa tactaaattt aaatacaatt taatactagt gtaaaataat ttgaagtaaa 102241 gtattcgaaa gaaaaaatat tgaaatccaa cctcctagtg gcaatcactt ttacatgtag 102301 gcatatttcc attttatttt tctatgcact ttataactgc tattatacgg ataatgcata 102361 tttactgtac aatgtattta tatgcattat acagatatgc atatttacca taatatctag 102421 gttaaaaata cctagaaatt aaaatttaaa aaatattact ctaatcccac cacccagagt 102481 tggtcattgc catatatttt tcaggaaaaa agtacatata taatatatgt aacaatatat 102541 gtaattacat atatattaaa tacatataaa acatatatat aacaaatgta tatattaata 102601 tattggtgac taagaaatat actatctata gtccggtata aagaggtttt ttttccaaaa 102661 acatatcaaa catttatcta tgttaagatt tatagctgtt ttaataggtg aatattgtcc 102721 cataaatgtg ccacgatcat attcactgag agcccttata ttcgctgaga aaagaatgag 102781 cggatatgat acaaaatatc aaacaatatt caacaaggag ggactagtta aattattttt 102841 gactggggga gacttgtaca tttgtaatgg agaggagaag gagccaggtt agaaagaaat 102901 gtttatgttg ttatgttggg aaggtttgga tgcatctgat gtacggtagg aagacatttc 102961 aaattgtgac aaagaaaagt cagcgacgcc aggcacagtg gctcactcct gtaatcccag 103021 cactttgtgg gtggatcatc acttgagctc agtagttcga gaccagtctg ggcaaaatgg 103081 caaaaccctg tctctactaa caatacaaaa actagctggg cctggtggtg tacacctgta 103141 gtcccagcta cttgggaggc tgaggcagga gaatcacttg aacccaggag gtggaggttg 103201 cagtgagccg aaatagcacc actgcactcc agcctggcga cacagtgaga cgccgtctcc 103261 aaataaataa ataaataaat aaataaataa ataaataaat aaaatttaaa aataaagaaa 103321 ataaagaaag aaaaagaaaa aagttagtga aatgagcata aaggcttttc atgaagaagt 103381 aaagtagaaa ggcaggaaat atgagctact gctaaaagga agatttgaaa gaacttgggg 103441 actggctaga tgtaggtgat aaagaaagga gttagttgaa gcaccaactc caaaagacac 103501 tacaggcttc tatccaaact ggataaaact cccaatctgg tctcacaaat cacattggtc 103561 ttgccaatcc atgtctttta gactaaattc taactttatc aaaatattgg accagacaaa 103621 gctgggaaat tttaaaccat gtcaatgatc atgctttcaa atattatcaa aataaaccta 103681 attgaaagat aatatcactt tcgatgtaat tatttatata tgaacttttt catcataatt 103741 agaagcagtt gtggtctgta caagaaaaac aagtgatatc ttgtctttca acgtggtaat 103801 cctttagcac tgatgtgggc tggtgcatta tttcatactc agggcctagt cactcagact 103861 agactggtga gagggaaact cctgctttat tgggctttgg catgctcttg ctgttttact 103921 ttgtcatgat tacgaaattc attctaccat cattaaagtc aagagagata attattcact 103981 taaaaaatct taatttacca ttaacatttt tcccattaaa tacaagataa tctataaatt 104041 cacagtttca aagggtagtt cccagtccaa gagcatcacc atcacctaga agtttgtaga 104101 aacgcagatt ttttggtctc catcaagacc ttctgaatca gaaactctgg gactgtggcc 104161 cagcgatgtt cttcccccct cccaacccgg aagcttatta gaaaccagca gtgtgtttta 104221 acatgctctt taagtgaccc taatgcccac tgatgcctga gaaccacttg caaagagtga 104281 aaactaacat cagatgacca aaaaggtaca gattatgctg accccagaca aaactttatc 104341 tgatcttgag gggaagaaaa tgttctagga aacatagcaa gactgtctct tacaaaaaaa 104401 aaaaaaaaaa ttcttttaat tagccaggcc tagtgtcaaa tgcctgtaat cccagccact 104461 tggtaggcta aggcaggagg atcacttgag cctaggggat caaagcttcg gtgggctatg 104521 atcactctgc tgtacttcag cctgactgac acagtgagat tctgtctcag gaaaaaaaaa 104581 aaaaaaaaat atatatatat atatatatat atatataaat atatacattt atacacacat 104641 atatatatat ttctacaagc acgcatccaa attgtctcta tttcactaat tttgctctta 104701 ttctggtagt aagtctgctt taatgagggc ttggaaaagg gaaaatgacc ctctgagtgc 104761 caatgactag caaaagggaa acaaaacata gagacatacg tgtaagggtc agtcggaccc 104821 aaagaataag tgaaacatca ggggaaaaaa atatatgtgt ctcattattg gaaaaataaa 104881 gccaggacaa aagatgagag atcatcagtc aataattgtg gggaggagtc agaggggact 104941 tggattaatg gaaagtctaa atctagaaag tttaattttg tctaacataa ttttccatga 105001 tactccaact agacgataag tatctaggca cctattacgt taggcttctg tgagctatga 105061 tgtctgtatc tgttccattt ttctagtaac atactaaatg gaaataggcc aaagcataat 105121 ttaggatata aattgttttg atgaagggtt ccatcttctc ttacaaatgt ttactatttt 105181 acctacagag agtaactagc tctcatttca gacagcagtc caggtgggcc tactagtaag 105241 agggtgctat aaagttagct ccaggacctc acctcaaaga atgtgactga gaaaagattt 105301 acagactatg atcacagtgc tcttctaaga actgttcatt aaataagcca cagatattaa 105361 gcagtaggtg ttttctgttc ttcctgaaga ttattaggtt ttaccaggga taatagaata 105421 atactcgctg gaaattaaaa ttcctccttt gtttatttat ttcaatacca tattcatttg 105481 ttaacttgct tggtctctgt ttagaatatt agttagcaag gaataaatat tttgtttaca 105541 gtttcccaaa acgttttctt ttttcaagaa aacacaattt cttatgtatt tttggtgggg 105601 catttaacag gcctgagaaa gcttaccagg gtagaaattt gtcatgaaaa tttactttta 105661 atccctgcca gatccagtga tcaggtgggc accaagggca tgggatgcta gaatgggcct 105721 ccacctacat cactttcaac tggggcacct actaattgtt atgctaagat atcaaggtga 105781 ggtgtgccct ccctcccaca gtttcagatt ttcaaaggaa atggtactga gaacctgcta 105841 tagaattctg gacccctcaa gccatggagg atataaagag ggaagagata ggaggagagg 105901 aagatcgttt aaacattttt tgagacctct ctgctgtgag cctccactcc acttctcttt 105961 ggggacagtc tgaagatgtt ccacctaacc ttctgcttgg tggggcttca agggaaccat 106021 gaggagcttt gaaacgtgtt ccttactttt aagagacaga gaagtctgat gcttggcttg 106081 tgccagaaag tcttcatcgt agattattgc taataacaca taagactgtg gcctgggtcc 106141 tatctcctcc caggaaaact aaagggaata aatgtctagt actagttttg gtatgtccat 106201 tctaactcta attaagtgtc tattctcact ccacaggtta gagtttaatt tgaaacagaa 106261 tatataagta ccattttctt tcctttcaaa tagcaaatca ataatatcag actcttttca 106321 ggaatgattt tggatataac taacaactcc ctctcccccg accaaagaat ctgaaaagaa 106381 ccaaaaatgt ctataaaata ttttcaatta agctgtttaa catgttttct taaagtaaga 106441 atttatattt tgatacattt gaaatttatg agattacttt gcataaagta gattaccttt 106501 tttttttttt tttagaaaaa agtagattac tggctgggca caatagctca tgcctgtaat 106561 cccagcactt tgggagaccg aggcaggtgg atcacctgag gtcaggagtt cgagaccagc 106621 ctggctgaca tagtgaaacc ccgtctctac taaaaataca aaaaattagc cgggcgtggt 106681 ggcaggagcc tgtaatctca gctactcagg aggctgaggt aggagaatcg cttgaaccca 106741 ggaggcggag attgcagtga gctgagatcg cgccattgca ctccagccta ggtgacaaga 106801 gcaagactcc atctcaaaaa aaaaaaaaaa aaatcagtgg attaccttaa tctacagatt 106861 aaattttcaa atacaatttt ctcttggtat ctgtgggaga ttggtaccag gacccctgag 106921 gaaaccaaaa tacaaggatg ctaaggcccc ttatataaaa tggtgtagta tttgcatata 106981 acctatgcac atcctcctgt atactgtaaa tcatctctag attatttata atgccttaga 107041 cactgtaatg ctatgtacat atttattata ctgtattgct ttttatttgt gcttttttgt 107101 tgtattgtta tttttaattg tttgtaaaaa atattttcaa tctgccattg gttggagctg 107161 ctgatgtgga acagcaggat ctggaaggcc aactgtattg tttacttctc acttagcaat 107221 ttcctaggat tccttttgtt tgattaagaa cttctatttt taaaaaaata gaaatacaga 107281 atgttttcca gtgtttaaaa caatcaccat gttttccttt tgaccttaat aaattgtaaa 107341 aaagatcaat tgctaagttt tgaagtttca tttccttgtc ctttgatgac attaactcta 107401 tttgggaaca aggattatgt aaaactcatt gagtcaaaat tgaatgacta gaataccagc 107461 tatatatgcc atcttttcaa tttttcttat taataatgga agaatatgca atttaaggtt 107521 cttcctagat tctcagtcat ttagtaagaa taagcataca tattatatct attctggtaa 107581 tggggagata atggtctaat tagtcagcaa tcacttgaaa aaaaaagggc agaataaaag 107641 aagaaatgca cacacttcca aagaattgaa gagacagtaa atgcggtctt aaatatgttg 107701 atgggtgaat gtgtcaatga gtttcacaat cggtgtgggt ggttcaagaa taaggaagag 107761 gcaaatggag aaaaatgtta caggagggaa actctccatc acttatcttc catccttaac 107821 acacacacac acacacacac acacacacac acacacacat acaaacacat tgtatgagat 107881 atcatatcag tgatgacata gctgcccagg accacttttg aaattttttt cattatattg 107941 aattcacata ccatataaca caccaatctt aagtttacag cttgaggagt tttgacaaat 108001 gtatacatcc acataacccc accactatat cttaacatat ctgtcaatat agaaggctgt 108061 ttatgcccct tccagtcaac caattccatc ctgtgatgct taattttatg tgtcaacttg 108121 actgggccac agcacccaga tatgtgtgtg gtcaagcatt attttggatg ttttagtgag 108181 ggtgtttttt ggatgagatt tacgtttgaa ttgtggactt tgagtaaatc agagtgccct 108241 ccatgctgtg agtaggcctc atctaataag ttgaaggcct gaggagaaca aaaagctgag 108301 cttgcagagc aacagggaat tttccagcga ctacctttgg acttcatttg cctcggctgt 108361 tcttatttcg ccagcagact gcctttggac tcaaactgca actctcctgg gtttccgcct 108421 gtcagctgcc tccatcaaaa tttggacttg ccaagactcg agaatcacat agctaattcc 108481 ctaaaataga tctctttctc tggaaataca catcttgttt gttctgtttc tctgaaaata 108541 cacatttgtt tgttctgttt ctctgaagaa ccgtaacata cctccctttc catgatatca 108601 ctgatttctc ttaaacagcc aacagctttc ttactttaat tccccttaga aaaaaatagc 108661 ctttttctta atattgaccc aaacctaatc ttttcataga aacattaaaa tatggaagct 108721 agcataatat ttaaattaca tttcctggcc gggcacagtg gctcacacct gtaatcccag 108781 cactttggga ggccaaggcg ggtggatcac ctgaggtcgg gagtttgaga ccagcctgac 108841 caacatggag aaaccccatc tctactaaaa atagaaaatt agccaggcat ggtggcgcat 108901 gcctgtaatc ccagctactt gggaagctga ggcaggaaga tcgcttgaac ctgggaggca 108961 gatgttgcgg tgagccgaga ttgtgccact gcactgcagc ctgggcaaca agagtggaaa 109021 ctccatctca aataaataaa taaattacat ttcctttttt gaagtcagat ttactgtagt 109081 ataatttact ctatagaaaa attcactttt taggagtata tttagataga tattgacaaa 109141 tgtgcctata tagttgtgaa accactacca caatcaagat acagaatatt tccatcaata 109201 ttccaatacc cctttgtaat caatcctctt tccctactct cagctcctgg caacccttga 109261 tctgatttct ctccctatag tttctctatt ccagaatgtc acataaatgg aattatctca 109321 tatgtattct tttgtgtatg gcttctttca tttttcataa tgctattgac tcaaccatgt 109381 ttgcatgtac aagtagttat ttttgttgct gagttgtaag attccactgt atcaatgtgc 109441 tgcaatttga ttatccactc actggttggt gactatttgc agttttccaa tatttggtga 109501 tgatgagtaa aggctccgta agcattctca gatagctctt tgtacagaca tgtgtttata 109561 tttctgtcat ttaaatacct aggagtagga ttattaggtt gaatggtaag tgtatggtaa 109621 ctttatcaga aaccaccaaa ttgttccgta aagtgcctgt gccatttcat gttcccagct 109681 gtaatgtatg agtgttctag ttgctcagca tccttgatag gcatttgtag agtcaggttt 109741 taaacagcca tctgatatgg tttgattgtg tccccaccca aatcttatct tgaattgtaa 109801 ttcccataat ctccatgtgt cataggaagg accacatggg gggtaattga atcatagggg 109861 tgcttacccc catgctgctg ttctcctgat agtaagttct catgagacct gatggtttta 109921 taaggggatt ttccccgttt tgctcggcac ttctccttcc tgccatcatg tgaagaagga 109981 tgtgtttgct tccccttctt ccatgactat aagtttcctg aggcctcccc agccatgcag 110041 aactgcgagt caatcaaacc tctttccttt atatattacc cagtcttggg cagtccttta 110101 cagcagcatg agaatggact agtacaccat cctagtaggt gcgtactact gaatctttgt 110161 ggttttaatt tgtgcttccc taatgactaa aggtgctgag cttcttttac gtgctttttt 110221 gttatctgca gcagccagcc tccaaaatga accctcaaat gatccctaga tactgatatt 110281 cacagtccta tgtagttttt tcctaaattt agtaggctgt gtaacaaaca gaatatgcag 110341 gaatgatggt gtgtcacttc tgaggtcagt cataaaagcc attggtgctt cctccttgat 110401 ctctcttggc tcacccacct tgttggaagc cagctgccat gttttgagga tgttcaaaaa 110461 ctctatagag agatccatgt gacaaagcac acaggccacc tgacaatagc cagcacccag 110521 ctgcaagtga gccattttga aagcagatcc tcaagcccca gttaaggtac aaatgacttc 110581 ctgccctggc caacatgttg actacaacgt caatgaaaat cagaacaacc agcaaagctg 110641 ctcctctttt actgagctgc aaaaactgtg agataaaaaa tgtttatcat aattttatga 110701 caccaagcct tgcagtaaat tgttatataa taatagataa cacagcattc atatcttctc 110761 tttggaaaaa tatctgttca aacctttagt ctgaatttta aatttggtta tttgtttttt 110821 gttacttcag ataaaagagt catttgtgta ttttggattt aagtccttca taagataagt 110881 ttttataaat attttctccc ggtttttgac attttaaaaa attttcttaa tcacatcttt 110941 ctagagaagt atcatatttt gatgaaatcc aatttgtcta ttttttcttc ccttggctca 111001 tgcttttggt gtcatattta agaaatcttt gcagagtaaa aggtcatcag ctattttcct 111061 acatttttct ccagaagttt tataaattta ggttttacat tgagacatgt gatccatttt 111121 gagttgtttt ttaatatggt ttgagttcct ttttcaaatg tgtaattgtt ccagcatcat 111181 ttcttaggaa cagtattctt tcttcgtgaa ataactttga tgcctttgcc aaaaatcaat 111241 taactacatg tgcatgtttt tatatctagt ctctcttcta ttccattgtt ctatatgtct 111301 acccttctac aaaaattcat tggctttatt actccagttt tatggtaaat cttgaaatca 111361 ggtaatatga atcatctaac tttgttttgc tttttcaaaa tttcttttgc tattctaggt 111421 aatatgcttt tccatataca tttttaaata ctttttttca gtttctattt taaaagcctg 111481 ctgagccaca catctacaac catctgatct ttgacaaggt tgacaataac aagcaatggg 111541 gaaaggactc cctattcaat aaatgatgct ggaataactg gctagtcata tgcagaagat 111601 tgaaactgga cctctccctg acacaaaaat caactcaaga tagattagac ttcaatgtat 111661 aaactaaaac tataaaaact ctagaaggaa acctaggaaa aagcaattgc aacaaaaact 111721 gacaaatagg acctagttaa actgaagaac ttctgaacag aaacaaatac accctcaaac 111781 tatcaccaga gtaaatagac aacctacaga atgcgagaaa atatttgtaa actatgcacc 111841 tgacaaaggt ctaatatcca aaatctataa ggaacttaaa tcaataagca gaaaacaaac 111901 aaccccatta aaatatgggc aaaggacatg aacagacact tcttaaaaga acgcatacac 111961 gtggccaaca agcacatgaa aaaaatgctc aacatcacaa atcattagag aaatgcaaat 112021 caaaaccaca atgaaataca gtgtcacacc agtcaggatg gcttatagat gactaaatgt 112081 ttttactaaa tgtcaaaaaa taatagatgc tagtgagatt gtggagaaaa aggaaccatt 112141 atacactgct ggtgggaccg taaattagtt cagccattgt ggaaaacagt ttggagattt 112201 ctcaaagaac tgaaaaaact actattttga tcctcaccct cctcacagca atcctattcc 112261 tgggtatata tacccaaagg aatataaatc attctaccat aataacacag gcacatatat 112321 gttcatcgca gcactactca caataacaaa gacatggaat cagcctagat gcccatcaac 112381 agtggactgg ataaagagaa tgtggtacat atacaccatg aaatactatg cagccataaa 112441 aagagaatga agtcctgtcc tttgcagcaa catgggtgca gctgaagacc attatcctaa 112501 gtaaattaac acaggaacag aaaaccaagt accacgtgtt ctcacttata taagtgggag 112561 ctaaacactg agtatatatg gatacaaaaa gaggaacaat agacaccagg gcctacttga 112621 gagtggaggg tgggaggagg atggggattt taaaaactac ctattaggta ctatgctcat 112681 tacctgagtg acaaaattat ttgtatacca aattccagca acacataatt tgtccacgta 112741 acaaacctgc acatgtactc cctgaaccta ttattactaa ttttacatag ctttcaacat 112801 acatatctta cacatacttt gttagaatta tccctaagta tttgatattt tagtgctatt 112861 ttaatggtac tgttttatta attttatttt ttattgtcac tatagtgaat tatgattttc 112921 atgcatggac ctttcatcct gtgaccttac taaactcatt gattaatttt agtagctttt 112981 ttgtagagtt ttggatgatt gtataaatca tctatcctag ccatcatctg gggctaggac 113041 ctccagtaca atgttgaata ggaatgatga aagcagacat ctttcatctt tgcattgtct 113101 tcaattgtga gatgaaagta ttcagacttt cactattaag ttaccagctc caggcttttt 113161 atatattccc tttattatca agttggggaa attatcttct gtttttaatt tgctgagagt 113221 tgtttcataa tggatgctaa actttgtcaa atgcctttac tgcctctatt gacatgatca 113281 tatggcttgt gtccttcttt tagcttgttg atatggtgct ttatatttat tgattatttt 113341 gaattctgat ccaacccaac tcctttaaaa accctccatg gtataacctc actctaccta 113401 tgcatccata ttttgcagga ggtgcaatca caacccctaa gctgtggttt ctcactatgc 113461 tgcttaacct ttgacatgca atgctctttt tccttcacct atcaaatccc catctcttta 113521 attccacctc ctccatgaag ctatctctaa ttacttaaac tctgtctcta cgggccaatc 113581 attcggacat gaacactctt ccaatccatt tttcattcgc catgtcaaga agtgggagaa 113641 attgctaggg taagggaggc ctggagtgag gacaagagtc agtgggggcg agtacttagg 113701 aggcagggta gaatgatatt tgaggatcac agttggtatt tttctcccag gacaacaaca 113761 cagggaattg ctttagttct ggggagattc cagtaatgta taatcacaga taatcctcct 113821 ccctccatta atgttcttgg aacaattctg gaagactgtg gtcctttttt cccttgtgtg 113881 ttactgcaga gaaaaagaga acctcttgtt tcacccaaaa ttctttgcaa aaacatggat 113941 aaaataataa ggtaaaatgt tacatttatt cattcaacaa atgcacagta gcaccttgta 114001 tgagccactt tgctaagctt ttttactata tgggatacaa aaataaaacc tatttgacat 114061 ttctgaattt aactatatat attatcttgt atatacttat tctttgcatt gaaatcttca 114121 ctgctctgta atttcccacc ccagaaacac tcagcttcct ttcactcaag gaaaagtgct 114181 attcggtaca caaattctaa aatacatgct cacaaaaaca cattaacatc aatttagcac 114241 tattttgcat cactttacac aaccagtttc attatcaaat attcataaaa ataaacttta 114301 atttttaaaa cctttgaata tttcctcata atcttcctcg taatttctcc aaatagaaaa 114361 atttatcgaa gatatagaaa actggttccc attaaaaatt aagaataaaa tacaccttgg 114421 ttaaaaccaa cttttgggat taaaacagta tgtaactcgg aacagtgcag cttttggcag 114481 tcaggtttcc tatgatgaaa ctggcatctg ttttcatgga agtattttgt tacttatgcg 114541 tctggacatc ttcagctttt gaccttaata acagtcttca gatgattatt aagttacctt 114601 gataagaaaa ttcaaattgc aaaatattta tttttaaaat gtatttgtct tagtgtggca 114661 gacatactac agaggaactg cctggtaaaa tagaaattac caaatatatt ggattagcac 114721 ttacccaccc cctcggatca gaatcttaag acacaacagt aaaactttgg tgatttccat 114781 gatccaagat gaaaaacaaa gaagaaagtc ttatactgta agtaatagtg gggatttttc 114841 tctcttatat tagaacttag ctaaagttca gctggaattt acagggctga atggtcagaa 114901 tcaaatgtat ggcttgaaca gaaaacacaa atgagttgtc ataggaatga caggctagga 114961 cacttgatga attccagttt tatgtcttac tcacaatcaa aatactgtgt ccaaattgta 115021 ccttgaacaa tttcggagtc atctgcctgg gaaagtctgt attccccttt cttcaatact 115081 tggacactct gatttttggt acgtttttct agttaagcct tgtggatgca aagtaaataa 115141 tttattagtt ctagggaaat atgtaatggt acaggaatga acagcataac acagcttatg 115201 aaaggactgt ttttaaagta tacagctatg tagagagtat tcttcagtgc ctggctataa 115261 atcatccatg aaaaccaata gagacccagt ccagaatgtt agaactgaga gggaatatag 115321 gaaggaccca gagggaagag gaataccttg cccattccct gtcagctccc agagggcaat 115381 ccacattgtg gaactaattt ccacaaacac acagaagtcc accctctgca catccccaag 115441 ctcttcttct ctgttcatac cctcctcctt gtaaggggtg tctgctgtgt tatttaacaa 115501 tattcaattc ttttgtaaaa gacaagaaaa gcaaatagag ttgatacaac tcttaagaaa 115561 tactcaatat aaatgtaagt ataaaaggag cttgtgagac acagaagctt atagtaggta 115621 ttaagcatag atagcaagtt aatgacctgt gaactatgtc agacataggg aaagaaagaa 115681 gcacatcagt gtgttaaaca aaggtgagga tggaaagccc aaaggcagca aaaattcatg 115741 gtctgttaaa tgttgatgag ttaagagaga ggccaaatat aatattcatg ggacactgaa 115801 taggcacgag agtttaaatt gcctcaactc actggtgaca cccattctgg ccaggtatcc 115861 tctgcctgag atgtgcagat agaatccaca gctgctcagt aagttaatgt ggaagctact 115921 gctgagacca agccgatttg cccctgtgtg tcttctacag tgcaagtgtt gactccaggc 115981 actgtccatc tgctgagtcc cttgctcaca atcaaacctg gtccttgtaa tcatgcagac 116041 aatggagaga gctgataagg tggaaagaga gataggacct agaaacattg cgtgatgcag 116101 aattcacata gcatggttct tttaaccctc taccctctcc ggtgtcttca cctacaccat 116161 atcttttttc ttgcatctca ctctgtttta cgttgattca attaaccaca agctt // LOCUS AF000573 55795 bp DNA PRI 31-JUL-1997 DEFINITION Homo sapiens homogentisate 1,2-dioxygenase gene, complete cds. ACCESSION AF000573 NID g2130646 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 55795) AUTHORS Granadino,B., Beltran-Valero de Bernabe,D., Fernandez-Canon,J.M., Penalva,M.A. and Rodriguez de Cordoba,S. TITLE The human homogentisate 1,2-dioxygenase (HGO) gene JOURNAL Genomics 43 (2), 115-122 (1997) MEDLINE 97386578 REFERENCE 2 (bases 1 to 55795) AUTHORS Granadino,B., Beltran-Valero,D., Fernandez-Canon,J.M., Penalva,M.A. and Rodriguez de Cordoba,S. TITLE Direct Submission JOURNAL Submitted (22-APR-1997) Immunology and Molecular Microbiology, Centro de Investigaciones Biologicas CSIC, Velazquez 144, Madrid 28006, Spain FEATURES Location/Qualifiers source 1..55795 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 1..1074 /evidence=experimental exon 1075..1256 /number=1 mRNA join(1075..1256,7876..7947,8752..8840,13210..13315, 31008..31067,32794..32885,35750..35784,36603..36682, 37289..37388,39212..39336,41968..42072,45085..45211, 50295..50476,55090..55449) /product="homogentisate 1,2-dioxygenase" CDS join(1242..1256,7876..7947,8752..8840,13210..13315, 31008..31067,32794..32885,35750..35784,36603..36682, 37289..37388,39212..39336,41968..42072,45085..45211, 50295..50476,55090..55239) /note="This genomic sequence codes for the cDNA described in Fernandez-Canon et al.: The molecular basis of alkaptonuria, Nature Genetics 14: 19-24 (1996); deposited in GenBank Accession Number U63008" /codon_start=1 /product="homogentisate 1,2-dioxygenase" /db_xref="PID:g2130647" /translation="MAELKYISGFGNECSSEDPRCPGSLPEGQNNPQVCPYNLYAEQL SGSAFTCPRSTNKRSWLYRILPSVSHKPFESIDEGHVTHNWDEVDPDPNQLRWKPFEI PKASQKKVDFVSGLHTLCGAGDIKSNNGLAIHIFLCNTSMENRCFYNSDGDFLIVPQK GNLLIYTEFGKMLVQPNEICVIQRGMRFSIDVFEETRGYILEVYGVHFELPDLGPIGA NGLANPRDFLIPIAWYEDRQVPGGYTVINKYQGKLFAAKQDVSPFNVVAWHGNYTPYK YNLKNFMVINSVAFDHADPSIFTVLTAKSVRPGVAIADFVIFPPRWGVADKTFRPPYY HRNCMSEFMGLIRGHYEAKQGGFLPGGGSLHSTMTPHGPDADCFEKASKVKLAPERIA DGTMAFMFESSLSLAVTKWGLKASRCLDENYHKCWEPLKSHFTPNSRNPAEPN" exon 7876..7947 /number=2 exon 8752..8840 /number=3 exon 13210..13315 /number=4 exon 31008..31067 /number=5 exon 32794..32885 /number=6 exon 35750..35784 /number=7 exon 36603..36682 /number=8 exon 37289..37388 /number=9 exon 39212..39336 /number=10 exon 41968..42072 /number=11 exon 45085..45211 /number=12 exon 50295..50476 /number=13 exon 55090..55449 /number=14 BASE COUNT 15929 a 11683 c 11114 g 16963 t 106 others ORIGIN 1 gggccctgtc ctaaggcaca tgcatgtttc aaactctgtg tcatgtttgc tactagccaa 61 agaaaaagac cctacctctt gatgggagga acagcaaagt cacattcaaa ggggcacaca 121 tacaaggata ggaggaattg gagcattttt tgccatctac cacagggtgg tttctctgtc 181 tctctcactc taattaccaa tggttaaaaa ttggagcttc atcttatctt gaaaaatcag 241 ctagcctact ttattcactt acacttcctc cctcatctgt tggcatttga gttgtgataa 301 tgcctaattt acaagattgt ttttgtgaag ttagatatgg aaaaatttga caaaagcaga 361 aactcaatat atgttttctc ctttctgaac accactagtc atgaagctgc tacctgccaa 421 tgaccaagga gagcggccaa gatgaagaac aagggagtca tagaggaggt ggctggagaa 481 taatcccttg gtctgtttta gaagtataga ggcattagga aacaaaactc ttaaagaatt 541 tataagaaaa ctaaacgtac tttaagaaat taaaaggagc agtaggaaac atatcagttt 601 gctctaagtg aaatatctca ggctctgaat ggttcaccca agaacctgag ctcattcctg 661 ggaggcacct gcagaaaact ggttctggtc ctggtccctg tcttcaggtg gctgtgtgta 721 aacatcttac cccagtgtac acatctgcag ctggtaaatg cctgggggtt ttctcactgc 781 tctccaggag aatcagcact gccctgttcc ttcctagcct gagtcaggac tgagcagaga 841 atcctccacc acacacagca ggcactatct gccacagttc ctttccccga tagcttcaaa 901 ttttctgcct tttgaaataa gcctactttt aactggaata aataattggt caattttcac 961 ctcaggtgaa gaggaaccaa gcctctggaa acacttagga acaaactgta aaaaccaaag 1021 gcaattgtgt aaccggttaa ataagcttgc tggactttgt ccctgtgtat gagttagaca 1081 attctttcag ctagtttgag tgacgcactg accagtgaag cgcagtgaag cagtgggaac 1141 cggaatatcc aaagagtggt ttgaaggaga aagaagcatt gtggctttat atcctctggg 1201 cctgggtttc ctgaagtcac cacacataga ggagagagaa aatggctgag ttaaaggtaa 1261 gaaaccatct gacaagtttg ctatggcttc ttcagccaaa atttatggaa gtttctgaga 1321 acttcagggt atcagagttg gaaaagactt agagaggccc tggttcaaac ccctgtgttc 1381 agaggcttcc ctgtggtgtc tgtgggcaca gctggagctg aagaacaagg gaaaggtgat 1441 aaaggaaaga aatgaagctc ttattggtca tcccttgcct ttgttcattt caattattgt 1501 cttggaaatc tgtagacatc ctaagagctc ccctccacta cggacatgca cgtttctctc 1561 tatgtctttt tcccaacccc tgctctgaca ccttcccatt gctgcagcct catggaccac 1621 ttccggtgac cagttagata tgcttgatct atttagttaa ttaaggctct gaatcagccc 1681 ttggccagaa gaagcacctc ctctaggaaa tcattccttc tttttagtaa taccaattga 1741 aaagtatgtg tgggttatgt gctgttaggt gctaagcatt ttaactatga attaactcat 1801 aatcctcaaa ataaatctgg ggtatagaga gaggttcttg tttcaattag aaaagtaatg 1861 cttggagaaa ctatgatttg cccaaggtca ctgccaatag atgacttgat tagaagcagg 1921 gtatgtttga atgcgaagtc caggctcaaa aaggggcctt agtggctcct actaaactgg 1981 tatcttaagg caaggaaaac agaatatgtc cctaatatct aaaaccatgc ctagtgccca 2041 ggccaggctt gaatcaatgc tcaatgcatg catgtgtaan tgaacagaac tgcccagaac 2101 cttggctcag gccaagtagg ccagaccacc ccacaacaga catcttccct gaatgcaggg 2161 tttgagaagt gccctaatgc cctctaactg atgctgagtt tgaagaatcc atcagttaag 2221 gggaggtgtt tctgagtttg aagaatcaaa caccttccta attggtagga atcctgaaaa 2281 cactccacac cggagtaaaa gactttggtt ttttatcttt atgcctgact aaaagttaac 2341 tctcattctt tcattcagca aatattttgt gctagtgcat gacactgcag attaatatcg 2401 tagatgaact gagtctgggc taggtagagt caaaaaaaaa tttcctaaag gagcctgtac 2461 ctaaactgag tcttcagaga gaagttggaa ctggcaaagt gaggaaggac ggggtctgga 2521 gggaagagtg agtgcaaagg cacagcacat acagaatgcc tgcatgctct agaatccaaa 2581 actgtgtcag ccaaatttga gttgatggag ggacagccaa actgagatac ccaggcggtg 2641 gttggattta tggatctgga tttcagggga gtgggcaggg ctaggggaat aaaactgagg 2701 gtcatgatgt cagtggtagg agttgaagtc ttggaagaaa gagtagatag aaactttgtt 2761 ttctgatgaa cattctatga agaagttcag cagaagttgg aaacatcttt accatttata 2821 aataaatttt atgtaaatcc tatcctgctg tcatgtttaa gcaacttatt atgcccagca 2881 ccaggacatc aatgttaatt gcatgccagt ctttgctctc agtttcaaag gcattcttcc 2941 ctgcctgctg tctaatcttt acaaagaaag gttaactcca gaggtcagga atctgaacac 3001 agaaaaccag attgcatcat attcattcta atcaataaac attatataat ttttaaaact 3061 gtttctcatc ttttatttct tcattttaat gaatatgtca acacaaagcc gtaaatgaaa 3121 tcacatttac acaaaagtac aactggggta agaaaattca aaagataaag taataaaatt 3181 aaatttaaaa tgaaataccc tatgtatttg agtaagttgc ttagcttctc ccaaccttgg 3241 tttcctcatt tctaaaacga agaacttgct taatattcag gctctgggcc ccaccaccaa 3301 atccttgatt cactggctct gggtctgtat gtaacaagct tctggagtga ttctaatgat 3361 tagccagaga caggacctac cattctacgt aactgatttc ctcttgctgc tgtaaaaaat 3421 taccacaaat ttattggctt aaaacaacac aaatgtatta tcttgaagtt gtggaggtca 3481 gaagtctgaa atcggtttca ctgggctaaa gtcgaggtgt cagtgtggct gtttcttctg 3541 gagactctat gagagattgt gttttcctgc cttttccatg gattctgttt cctcttctag 3601 aggctaccca cattccttgg tttgtagcct cttcctcgat cttcaaaaaa atgtatctct 3661 gcttttgtaa tcacgtcaca ttttctcctc ttataagtca aatctccctt tgcctccctc 3721 ttataatgac acttgtgatt acatttagga tacacccaaa taattaggat atctccccat 3781 cccaagatcc ttaattatat atgcaaaagt ctgtttggct agaaaaggta atactcacag 3841 gtttcagggg ttaagccata catatctttc agagtgagcc attagccatt attcagccta 3901 ctgcgataat gtgcaagttc cctttacaat ctatgttcta tgttcccatc aaggttaatt 3961 agaaatatta gactcttttt ccacagaaaa aaaaaatgta gcagtagttt catctctacc 4021 ctgacttctc tctctttcct tcatgcccag taagagaagt gaggtgcacc atctgttcat 4081 ctcatcaggt gttacctaga atcagagtca ggagaaagca cctcataaaa ctactctgac 4141 aagtaaacca tatcctgggg gaccaagtga naattaaaaa gctgtatact cataagctgc 4201 caaagagctg atggtactga agtcacatgg agctcaggaa tttattcttg catttagaaa 4261 gtatgcatat aattctctcg aaaagaaatg aaatcatatt tttctgtcgg atcaaattat 4321 tcagaaatcc gaactgtaga aaacatacaa acacacttgc ttaatgttag ctttttaatc 4381 ttatttttgt acacagtata cttttacata ggcactgaac cagctggaga tgaatgcttg 4441 gattttcaca gctgataaaa gcagagttca gattctgtcc ttgagctcaa tgtctggaga 4501 aagcagctgt gaatactgct tccttaattc agatgccttt ggccgtgagg ggcatggcat 4561 tttaaagctg caataaactc tgtacttttt ctctcagggg attagaaaaa ccaattgatt 4621 tgatagtgac tcagcagttt tcaccttcct ctttagccaa tgccctgcag ctaagattga 4681 gaaacggatt cagaactcac acctcttggt gcttggtggc tcccttccaa accatcagct 4741 gttctttcta ctttctttcc tattttttaa agcacagtat actttaacat aggcactgaa 4801 ccagctggag atgaatgctt ggattttcac agctgataaa agcagagttc agattctgtc 4861 cttgagctca atgtttggag aaagcagctg tgaatactgc ttccttaatt cagatgcctt 4921 tggccgtgag gggcatggca ttttaaagct gcaataaant ntgtactttt tctctcaggg 4981 gattagaaaa accaattgat ttgatagtga ctcagcagtt ttcaccttcc tctttagcca 5041 atgccctgca gctaagattg agaaacggat tcagaactca cacctcttgg tgcttggtgg 5101 ctcccttcca aaccatcagc tgttctttct actttctttc ctatttttta aagcacagtn 5161 tgaatcaagt gcttcagaac catttatact aactcgtgtt ccaaacacat aaccctgaac 5221 cctcttctaa ggccaccagt tacaaagaat ggggaaggtt aatgagtaac tccaccagtg 5281 cctgccagaa agctcagctg tactccagcc ttccttcctt ttttgcagag agaaacaatg 5341 gcggcagttg aacaaacgga aggcactgta atgcccagtg gcttcagttc acagggtggt 5401 tctaaaattc cattttcaca gaattgaaga taatttgggg ttattaccaa ttctcagcag 5461 ctcaaattca atcccgttga tggtgagcag ggggttcatt ttgagagtag aaaattggat 5521 gcagtggaga atcagaatgt tggatggtct ctgccatttc cccaaagata tcatcctaac 5581 tgcatgctgc tgcttttaac aaatcaggat acaacttagt attttttgtg ttgacaaatt 5641 ttcaaatata caaaatattg agtctgaagg caaaaagaat aatttttcac ttccattgct 5701 tcctaggtca taacattcac atctgatatt tcttccaaat acagactgtt gttggcctga 5761 tcatcactca cacatccagg cttgttacat ctccacgcca ctgagtatca gggaacctgc 5821 cccaatattc acgtaggttc ttttctattt tccctaagcg tcggccaact ttagaaataa 5881 agggacagag tacaaaagag agaaatttta aagccgggca tccgggggan gcatcacatt 5941 tcggtaggtt ccgtgatgcc ccacaagcca caaaaaccag caagtttgta ttagggattt 6001 tcaaatgggg aggcagtgtg caaataggtg tgggtcacag acatcaagta ctttacaagg 6061 taatagaata tcacaaggca agtggaggca gggtgagatc acaggacccc aggaccgagg 6121 cgaaattaaa attgctaatg aagtttcggg caccattgtc attgataaca tcttatcagg 6181 agacagggtt tttaggatca actggtctga ccaaaattta ttaggcggga atttcctctt 6241 cctaataagc ctgggagcgc tgtgggagac tggggtctat ttcacccctg cagtctcgac 6301 cataagagac aggcgcacct ggaggggggc tgtttataag cctatacctc ctggntcgta 6361 ttctctttct cagggatgtt ccatgctgag aaaaagaatt cagcgatatt tctcccattt 6421 gcttttgaaa gaagagaaat atggctctgt tctgcctggc tcaccagcag tcagagttta 6481 aggttatctc tcttattccc tgaacaattg ctgttatcct gttctttttt caaggtgtcc 6541 acatttcatg ttgctcaaac acacatgctg tacaatttgt gcagttaatg caattattac 6601 agggtcctga ggcaatatac atcctcctca gctgacagga ttaagagatt aaagtaaaga 6661 caggcataaa tcacaaggat attgactggg gaagtgataa gtgtccatga aatctttaca 6721 atttatgttt agagattgca gtaaagacag gcataagaaa ttacaaaagt attaatttgg 6781 ggaactaata aatgtccatg aaatcttcac aatccacgaa cttctgccat agcttcagct 6841 gatccctccg tttggagtcc cagacttccc gcaacaactg aaaactctgg gaaggcctac 6901 ccttcaccca cttccaccct tcttatatct cttaaggagc tgtgttcaga tggttacatt 6961 tcgctcgctg agcaagactg cattttcatt atcctgggct aaaatcaatt ggtatttaag 7021 ttagagcaaa aagggttcta accttattta cagcagcaca catacactta cattagaatt 7081 ggctttaagg gaaaaattca atcaatcgac aaatttccac tgattacctt ttatgagcca 7141 agactgccat agtctcagcc ttcatgcagc ttataggcta gcaaaaaagc cagacagtgc 7201 acaagaactc acaagtgtgt tgagtgttgc aagagtaagt ttcaggggct ctccatgact 7261 tacctggttg agaaaccctt taactttttt tttttctggg tgtaaagtga agagatggca 7321 atcagagtag gttccattgt gtcattgtga aaagatgctg ggctggcgtg tctcctgaag 7381 gccacccata tntttaaact ttattttcat gcattcgcac aaatatagat gtgttccata 7441 gatgaggcag atagcagagg ctttntgcct acagcaaggc tagtttggag agggttttgc 7501 tgtcagggtt ggacccacac accttttcat agatgctttt tttttttttt cccagagttt 7561 gaaagggaaa tggagtttca agttcaggga ggttacagtg tagacccaca aatcagctaa 7621 gtacagctaa gggtggagna gtttatgcaa tatccagcac tcttctgatt aattagaagt 7681 ctttaaaatt caccagaaac catattgtct tccctaaatg attctatatg tcaagacaca 7741 gtaaagatat tatatgtgga acatcataca caaacatagg tccatataga aattcttacc 7801 ttagtaaact gccaattttc tgaaatgctc tttaaatcct gctaatggtg gcatttgtgt 7861 ctttgtcacc tacagtacat ttctggattt gggaatgagt gttcttcaga ggatcctcgc 7921 tgcccaggtt ccctgccaga aggacaggta tgagcaaata aagtgcttag aagagctctg 7981 attcctattt cctggatgac tagctttcag gccaaagtgc ctccacccac cgtggttctt 8041 ccaatctagc ctgtgaactg tctagaggtt tcccaagtca taggggcaat gaagatacag 8101 agcaatgaca acacaaagat taaaattgga ttggtggcac agcagccaat ccaaactagt 8161 aaaattaact gtagggtttc cagtaggatc cccaaaaatc cagggttaaa gagaatccct 8221 catattcccc ttgaacacca tggatgtgag cattatgata aaataacaag gtgctttcac 8281 atgtatgcac aaagatgttc atactagtgc tggttagaaa agcaacaaca cgggaaacaa 8341 aaatattttc aaaacttttt gcaggccgta tcatagcata aaaaaatcct catttcttaa 8401 aaaaaaaagg aaaaaatgcc tgattattac agggttacag gtatgatagg gtcacaattt 8461 tggaggataa gaatgcaagg aaaaatacca acatgtgaat gcttggttat ttctggttgt 8521 gagataatgg gtgatatcat atagtttata cattttctac aataaatata tattactttt 8581 ataatcgggg ggncaagtca catcaaaagt tgttacaaga aacatatcgg gttgccagat 8641 ggtcatgtgc agtcgcccag agcatcccca tccccctact gagttggttg gtgggaaggt 8701 gggatgcttt tcggatggga gtaatattgt ttattgcatc ctttgtttca gaataatcct 8761 caggtctgcc cctacaatct ctatgctgag cagctctcag gatcggcttt cacttgtcca 8821 cggagcacca ataagagaag gtacaaggat tagatgaatt ctgacctgca gactgtgggt 8881 actatgacag ggactttgtg ctgcctcctt ccccctgggc cacagcctcc cagagctgcc 8941 caagatcctg cttcttatag gctatggtct gggtcatggt cagaagagaa tgaacttcct 9001 gccagctagc accaagtgac tggtggaaga aggtgaagga aagctgcaca gagctgggaa 9061 aatacacatg tccagttcca tttacaaatt acctcctata ttcagaagga aatattccta 9121 atagtttggt cttcttctaa agactgtcct gttgagtttt ggcttggggt ttacctgttt 9181 tgttttgttt ttataatgac taagtcacaa ttatggaaga ttcagagcaa tatctttctc 9241 agctgactag aggtatgttc tttgcaagaa aatgttgggt aaacgaagtt tggatagtta 9301 aaaantcaca tattatggcc cactgtggtt tgaattaata aggtttttac tctggtattc 9361 gctttgctag aaaccagcat attcatttac atactgtgcc atgttccatg tttgctctaa 9421 tcntacatcc aggtcaatga acgtaatccc ttgattaact catatcaggc aggttggatg 9481 cagtatgtgt tatagttttg ggaggcagta agtgttatga tttaaacaaa tagtgtctaa 9541 ttataaaaat tttaggctga ggtatttgtg aagaggaatg ctgctggtat tagagactcg 9601 gggagaaggg catgtggtct tgtactgctt agtagggcag cctgggttta tctccacagg 9661 tccttctgag aagcaaatga gatagtaagt aaaaaactat aatggaccac acaaatgata 9721 aagattgcaa ggtttaatca tggccaccct ggaggcaggc cccctaagta taaatcctgg 9781 ctccttttcc agctgtgtgg ccttgggtaa atcatctgcc ctcttggatc ctctagttct 9841 tcattcgtaa aacaaagata attatggtac ctatttcata gctgtggtga ggattaaaag 9901 aaaggatgtg tgtaaagcac attagcttag cttagcacat tggcacaggc acatggttag 9961 tgctcagtta atgctagtta ttgtcatgat aactatcatt aagtattgcc atcatagcag 10021 agtttctgcc tcatttccag ggtatctcta ggaggctgtc aagagaactg ggcatgatat 10081 gtggcaagtg ctatgttaca atgtcagtat caacagtact tttctctcaa tagatatatg 10141 cgtgggaaga ggaagagata gagaaagatt tctatacatg tcaaaaatca ggggataatt 10201 ccttcactat ttaatcatct ctctctctct ctctgtaagc ccaaataggg taagataagc 10261 acgcagataa ttctgtccat tctaatttat gacaacctgt cctctatgac tatgttcttg 10321 actttatggc ccattcccta acaggccacc tggtctccgg attctgcagc ctgtccccat 10381 tattcancaa tatcttcagc cagcaggagc tgagctttgt tccagcaggc tgaaaaggag 10441 ctgttggcat ttcattaaga agctggcact gttccctgaa tcagaattaa atctctgttc 10501 agggcaatgt agtactgcct ggggagtttt ataaactcag ctaaaaaata agggacacta 10561 ggtatggctc acccatctat ttagaagaac tctccaaaga tatccacaag aattttgtgg 10621 aaatgcttcc cacccaaaat ctccagtggg aaaatgggac cagtcttatc cataagtagg 10681 ccagcctgaa attcttagtt actactcatc tgaaccatag catacagggc caccacaaaa 10741 ctattaaaga gtctatctac ctacctacgt acctacctac ctacctatca gggattaatt 10801 tttatcagat tagctagctg attgatccct tgggtcttga aaaggcaaaa gtctggagag 10861 ataatttgga aaatgtggac tgtgataaaa attcagaaga ttaagagaag aaaggcaatt 10921 taacaaacac atgataagcc agcagctact gtatgccagg catttttggt acacaaactc 10981 tctgagttct cacaacaatc ttaggtgaca aggatgaaga acataaggtt caggaagtct 11041 agtaactctc ccaggatcac acaggtggtt ccaagacata cacacatgct ctgaacttca 11101 tatactggga nctggagaag cttagtaaga ccaaggtctg aagtcncgtc ctcnagaaat 11161 ggagaaatca aatggaaact aaacagatct agcccttttt cccccagatg gctcaatgtt 11221 gggccaggcc aactcaactc actgaggtaa actgtccttg gtcttgaaag ggccaaagcc 11281 tgttgattca ggtctggtct tccgttgtcc ttctgtattt tgtcagggca gtaccctctg 11341 aaaggacttg ggtccctctt agtgancttc aatgtccctt ccagcttctg gaaggtctta 11401 tcctggtgtc ctttcaaatt cctttatcaa agtcttcttc ccttcctgaa gcctgaaatc 11461 tggtcctggt agagaggaca ccctttaaaa agaaaagaaa gaaagacaaa ttaaggcata 11521 aatgtactac aataaaacat ggacatacac tcttataatt ttagagtaca gaagtgttaa 11581 tttattttta ttttatttta agttccaggg tataggtgca ggatgcacag gtttgttaca 11641 taggtaaatg tntgccatgg tggtttgctg cacttatcaa cctatcacct aagtattaag 11701 cccagcatac attagccatt tttttaatgc tctccctccc ccaatcccac cccccaacag 11761 gccccagtgt gtgttantcc cctcactgtg tccaagtgtt ctcattgttc agctcccaat 11821 tatgagtgag acaagcagtg tttggttttc tgttcctgct ttagttcgct gagaataatt 11881 ggcttccagt tccatccatg tccctgcaaa ggtcaagatt tcattccttt ttatggctgc 11941 atagtattcc atggggtata tgtaccanat tttctttatc cagtttatca ttgatgggca 12001 tttgggttga ttttatgttc ttgcattgtg aatagtgctg caataaaaaa aagtgtgcat 12061 gtattttcat aacagattga tttatattcc tttgggtata cacccagtaa tgggattgct 12121 gggtcaaatg gtatttctca ttgtaggttt ttgaggaatc accacactgc cttccacaat 12181 ggttgaaata atttacattc ccaccaacag tgtaaaggca tttttatttt tccacagcct 12241 ttccagaatt tttttttctt ataaattaat atcaaaaaga caaacccaat agaaaatgat 12301 aaaggatttt taacaagtaa ttcaacagaa aaactataaa gagccaataa acaaatagac 12361 ttgttttttg gaaatatttt gtattttcta attcataata aaggaaatac aaaataaaat 12421 gagatactat ttttcaccta aaacactaaa atttttaata atactccaga tttgtgaaga 12481 tgtgggtgac caggcactct cacacgttgt tagtatgata ataaagagtt aaaatgatct 12541 gataattttg tttagaaatg cactggacgt atctgaaaga atatataaga aatatataag 12601 aaaatcacta agctctgatg ctctgaggaa tagccctgaa gaaactggag gacgaaggtg 12661 ccatttacct ttcaggactt tttaattact tgaattattt ttacctttgg catgtattac 12721 ttttgtaatt ttaaaggaga aaattgattt actatttcaa aaataacaaa gataccaccc 12781 atattctctc aagctgcgta gaactcattg ttcttgcctt ccagagcaag gactagcatc 12841 ctatgataca cacataagct aatgtttttc tatggatttt cctacacaga tatccctcaa 12901 ggtacttccc cttcccttat agtttttaga aaagacaatt cttcattttt tggcagcatg 12961 gaaataacca tgagtcagag tccatttgaa gcatgatggg tggaatgaaa aagtgagaag 13021 aggccagggc tgtgggcagc agatggccat gaaaagtgat gctgtatcat tgcttcactg 13081 cttcacacat tagagcctac aggtcgttgt tgacttgagt gtcttcctga tggtactgcc 13141 tgagccattc tgtgtatcac tcagaacatt actctaaggc ttgtatatct tgtatgtttt 13201 tccctctagc tggctgtata ggattctacc ttcagtttct cacaagccct ttgaatccat 13261 tgacgaaggc catgtcactc acaactggga tgaagttgat cctgatccta accaggtaac 13321 ctggtcctgt gaattgaagc ttatcatata cccaaagcct tgactagaaa tacctaaata 13381 gaataccatg gtcattggaa aaaataaata gttttttaaa gtattatttt aataaatgct 13441 aaagtgtgtc tgttttctgc tcaaacttgg gtggtgtatt agtccattct catgctgcta 13501 tgaaaaaata accaaaactg ggtaatttat aaaggacata tttttaattg actcacagtt 13561 ctgcatggct gaggaggcct caagaaactt acaatcatgg cggaaaggga agcaaacaca 13621 tccttcttca tatggcggca ggagagagaa gtgttaagtg aaagggggaa aatcccctta 13681 taaaactatc atatcttgtg agaactcact atcatgaggg taaccatccc catgattcaa 13741 ttacctccca ccatgaatgt ccctcccaca aacgtgagga ttatgggaac tataattcaa 13801 gatgagattt gggtgagaca cagacaaatg atatgaggtg ggaattttgg aaatgatgga 13861 ttgattatgt atatacatat tgttaaaaga aaaacctttg acaaatctaa tttaacagag 13921 tttaattggg caaaggacaa ttcaccaatt attctacctg tagccaggat aggttctgag 13981 agattccagt gcagacacta gtagaagacg atttatggac agaaaaagga aagtgatgtn 14041 gagaaaacag aaatgaggta cagaaatagc cagattggtt gcagctcggc atatgactta 14101 cttgaacaca gttttaatag tgtgtcacca tgattggcac aagagtaggt tacacatcca 14161 attaggttat ggttcactat gtatggagaa acctttaggc agaacttaaa atntgtgagg 14221 aggcaacttt aggacaaact tgattaacaa tctgcgtgtg tgtgtgtgtg tgtgtgtgtg 14281 tgtgtgtgtg tgtgtgcaca gcgacacata agcttgtatt tctctnccct aanggcatat 14341 aganngacac nagagtagnn cngacttact aaaacngcnc nggtgaatng gatgagcgga 14401 aattttatgg gtgggtcaaa atattttata ttttatnggt gggtcaaaat atctacacgc 14461 tcaggggttt tttttgtttt taatctaaac ttaccacaaa atatagaaaa tacatttaat 14521 acagacatct ttgaagnnag gcctttgaaa tctggcatag gggtgagagt cttgctatct 14581 gggctgaatt atgcaagtct attgtggtca cagatcacta tcacagtctc ttcagaccta 14641 ggacttccat tctttttcta ggagtttcct ccttttggtc tantaatagt gggcgttttc 14701 tctaattatc gttattactt acgtactctg cttgcagagt atataagata aaatgagaca 14761 gaaccagtac tatagcaatg agatagaccc agtgctataa acgactgagc tcttttttta 14821 accttctgaa aaccccagtg tattgccttt gtccctgaat ttaagtacat gttgctgata 14881 ataggtatgt agggcaggaa gaaaaacaga cctgggtttg catccctgcc catctactta 14941 ctagctgtgt cttaagctat gatgtttcaa ttgtaaaatg ggatgataac atttgcctca 15001 cagtgctatt gtgaagatta tgagagaata ggtgtgaacc tgcttcatac aatgcctggc 15061 atgtgatagg caaaaaccat gattctgata cagctattta tatttacctg aatttacatt 15121 attcaggtaa ctttatttct tcattttata ttatttttta ctctgaaggc ttcttctgcc 15181 ctctttccct ccctcctact ctctctctcc tccctctctc cccaccccgc cactttccac 15241 aacccaagga aatctcctgc ttcccctttg tttgccatga cattttgtct atacctgcca 15301 atctccctcc cctaagtgct taaacttttt ataaaacaaa agtcaaaata atttataaac 15361 acctttggtt tcctgcccag atctctctct tgtcttaatt atggaaatct cagcgaaata 15421 agcatctgct tgaatgttac caccctctct accagggaca ggaagggagg gaggcattta 15481 gggtggaaaa atgtgaattg gtatttaaaa tactatctac ctcatagcca ctttgaaact 15541 aagatatttt gtttaaataa ttgaccctta aacttgtatg aagtttaatg cataaagcag 15601 acatctccct ggtccccagc tggcctatat attgccttta agcacattgc aaaatgcagc 15661 ccctatttta acccccaacc ccttagactc agtcccccac acaaagggac acagacgagg 15721 cattccgctc ctccctttcc ccttcagtgg gacaaggccc ttgcacaggg cacagcttgc 15781 acaaccaacc tggggaccct gacttcacct cattcctgag tttcatgtgg attcaagtaa 15841 tatgggtata gtaattatgt atatgaagtt aactaaatga gggattataa gaaaagtcca 15901 tgggcagatt tggataaatt taaattattg tttatttata atgaaaaata attatttatt 15961 attattttgn ttcaagatag caatcaaatc agtagtggtt aacagggtag gctggggagt 16021 tccagggctc ctgcctgctc atgtcgatca caacaatttt ctaagcctct caaccttaat 16081 aaatcttcat agatctgttt atagttgtgt ggacaacagt ggagccatac ttcttttaca 16141 tagttttatt cggggtggga ggttaggttt taaacctaac ctttaaagaa aatatgtgat 16201 gtggctntac tttgagtgtg atgtgtaata tgctcagtaa tataccttta ttatatatta 16261 ttatttcctt gactatcatt gctacatttt atttaacatg aatacaccct ttcataggaa 16321 ctattttgga gttcgtgtga catatccatg gaagacaatc ttgttctaat agaatttttc 16381 tagctttgtc tttttttttt ttttttttga gacaagagtc tcaccctgtt gctcaggcgg 16441 gagtacagtg gtgtgatcac agctcactgc agccttgact tcccaggctc agatgatcct 16501 cctacctcag tcccctgagt agctgggaat acaggtgtgc accacaatgg ccagctaatt 16561 ttttgtattt ttggtagaga aagggtttta ccatattgct ccaggctggt ctcgaactgc 16621 aagggctcaa gcaatgtgcc tgccngagcc tcccaaagtg ctggaattac aggcatgcac 16681 cactgcaccc agcttagctt ggtctgattg tatgagtaaa atatacttat aaaaacccaa 16741 aagtattgta gaaattaata aaatattact ggacgtctcc catcattaag tgagagtcaa 16801 gtatataagt tgtacatgtt aattcattat ttcataaata gttgacatac attttgacca 16861 ggaagtttga agcaactgga ttaaaaagct tgagttaata cagaggcatc aaaatagctg 16921 tatcagggtg ggcatggtgg ttaagcctgt aatcccagaa attggggagg ccaaggcagg 16981 cagagacctg agactgggag tttgagacca gcctaacctg aggtcaggag attcgagaca 17041 gcttgaccaa catggagaac accatctcta ctaaaaatac aaaattagcc aggtgtggtg 17101 gtgcatgctt gaattcccaa ctactcggga ggctgaggca ggaaaattgc ttgaacccag 17161 gaggcggagg ttgtggtgag ccaagattgc accattggac tacagcctgg gcaacaagag 17221 caaaactcca tttcaaaata ataatgacaa taataataat agctgcattg gaagtttttg 17281 cttgtttttt agcagaccct cagcccatat ttgttcccct tctccccgct tcccatgtct 17341 cccattctaa cctatggtta gaatggcttt gcttttgtcc ttgaacttgt tgcctttttt 17401 ctgttttgaa ttccctgaac actttctata tggtagccac caaatgaatt tcttctccaa 17461 tatcttccct gctcctttga gaccttgtct tcacacacac tgggagagat gtgcctcaga 17521 attcaatagc gtctcagtgt ctctgtgaac accggctcca accaggtagc agtttattat 17581 ctttctgact tgtgacctct gtcacctgcc aactacttct ctctccatga taaaccctgg 17641 ctgatcttgc aaaatctacc cttttgtctt ttcttttaca ccttctttac aagttaatat 17701 ctctttccaa ttttatactt cattatttaa ccacagcatt ttgcattcat cactttctcc 17761 ctcataacct cctccaaata tgtactccag actctcagca agaaaggctg agaaaattat 17821 atttaggcac cctctgacag tccttcccag ctgaagcctg tgtgataggt atgtcttcac 17881 ctgtcataga ccaaatccaa tacttcatca gcttcaccaa ctatccacat tagtcatctc 17941 tgggaatttg aagacctgag gactgcctga tgattcctcc agtgtaacat ttaatcccaa 18001 aggcagcatc ttgcaatatg tgatcatcac tgtcatcctt gataccctct caggaagact 18061 gggggctgct ccccctaaag ccttcatttt ctttagtgta tatttaatca gttatccaag 18121 aattaatgca acctcttttt gagtgtattt aaattcagcc tgaggacata aattcaatta 18181 attaatacct cacataagaa gtatacctaa ttttatgtgt ctgcctctaa tttctgaaat 18241 ctcagaattt aatgaactaa tctacatcca gtttaaccat gattttacag atttcagcta 18301 gaattcctga attagcactt ctctttaaga gtcctttaaa atttattttt cttattattt 18361 cattctaacc tcttgattgt tgaaaaatca gattcttgac tcagttaaga ctcagctgan 18421 aaaattacct cttggttctt aaccagtata atttggatca agtgagaaat acctcaaagc 18481 taattaaaga acatatcata aaatgccttc tgtctctagt acatgcaaat tataaaaaaa 18541 agtttcaaaa caatataccc ttacagggta tttgctgtct aatttgtcaa actttccagg 18601 ttttgccatc ctgaaatccc cttccctcca caaatgtctt ggatttttct gcctctcttg 18661 cccaagtctg tcctagtcac agccagactc cttgagaaat aatctaaacc cactttatca 18721 ccctcattca ctctttaacc ctatgcaatt cagcctgccc agatgactct attgaaaata 18781 tttataccct ctttacacaa tantgcatca ctctctaatt aatacatatt tttatttaaa 18841 acttacatta aagtgtcaat gcccagtaat tccactaaca cagagttaac aacttntaat 18901 gttttgggtt ctttccagtc ctttcttttt gaacatatat gtaaataaat aattcctagg 18961 gtcccatgta gccagtaacc aagagtaatt taaaataata tntggccagg cacagtggct 19021 catgcctgta atcccagcan tttgggagnc caagatgaga ggttcanttg agtccaggag 19081 tttgagacca gcctggcaac atagcaagac ctggctctac acaaatgtga aaaaaaatta 19141 gctgagcatg gtggcaagca ccggtagtcc caggcactca ggaggctgag gtgggaagac 19201 tgctcgagcc tgggaggtca gggctgtggt gagccctgtt catgccactg cactccagcc 19261 tgggcaacag tgtgaaaccc tatctcagaa aaaaaaaaaa aaaaaaagga agaaaaaaga 19321 aagaaggaaa gaaagaaaga aaagaaagag aaaagaaaag ttaatcttta cttgtttttc 19381 ctgaagacat taaacagtat ccaatatatg acccacagca acacttttta aaatcccctt 19441 tttccagttt ttcccagact ccaaccctct cacatctgtc ttctctcacc tcacctctca 19501 ccttccccaa gctcttcatg ccttgtcacc attgattgtt ggcttctctn cttcctccct 19561 gtacattctg cctctagaaa atgatacctg cactgtagat cccctacccc atttggctgg 19621 gatcacacat ggtgttatgt gggataacac acacataaca gtgacctcca tggcaatact 19681 ggagttcaaa ggactccagt gagaataggc aaatctgaat tttccgcaga gatcagggaa 19741 gggaagagga agaggctatg aatacactta taatagtgct tttaaaaaaa aatcagttac 19801 atatatattt actataatta agctgatggt gtaatgcagt cttagattat ttttatttaa 19861 atatcatatt ttgcgccttt ttccatgtct ttttaaactt cttgagtgta gatatttttg 19921 atattatatt ataatccaat aaatgtttat aaaacaattc ctaatattgt gagatatttg 19981 aattttttcc atttttgcta ttatgtngta aatccacgct aaatataatc cttttctcct 20041 ctacatccca gcctggaaca acagacctcc ttcttttctt cctgaatccc cttttttccc 20101 tgggttccat gacattgcat gattcccaaa cctgcaaaat aagtattctt caaggtcctg 20161 aactaaaatc ttgatatagt ttgactgtgt ccccacccaa atctcatctt gaattcccac 20221 gtgttacggg agggacctgg tgggaggtag tctatctgca caagctcttt ctttgcctgc 20281 tgccacccaa atganatgtg actttctcct ccttgccttc tgctatcatt gtgaggcttt 20341 cctagccacg tggaactgta agtccaatta aacctttttc ctttataaat tactccttct 20401 cgggaatgtc tttatcagca gcataaaaat ggactaacac aactctcacc tcccccttgt 20461 catcatctcc cctctgagag aaggggaagt acttactctc tctcttcccc tcctcttcct 20521 tactgtcttt cttatcctcc tcttttcttc tccttctctt caccccaacc ttccccttcc 20581 cctctgtatt gttctgctaa agctatcata acaaaaaaat actgggtggc ttaagcaaaa 20641 aaaatttact ttctaacagt tctggattcc aaaattgagg tatcggcagg gttgctttct 20701 tctgaggcct ctctcattga cttgtatctt ctccttggct cctcacatca tcttccctct 20761 gtntctatag ccaaatttcc tcttcttata aggacaacaa tcatatcana ttagggtcca 20821 ctctagtaac ctcatttaag tttaattatt cctgtaaaaa cccatctcca aatatagtca 20881 cattctgaaa tacagggggt taggagttca acatatgaat ttggaggagg aacacaatct 20941 agcccataat tctctccttc tgcttctttt tccctctaaa tatctactaa ggattcccaa 21001 tcttttggat tctctttgcc ttanagatga agggatagtt gttgataaga aataatcaag 21061 tgggacaaac taacaggagg aactccttga gcaatagagg attttcctag ctcactacac 21121 tgaatcaggt gccaagtcca tagaatggtg atgcctgagt atacataaga gttaccaggg 21181 gacttgttga aaatgtaaat tcctcagtct tgctccccaa agatcccagt tcagtaggtc 21241 tgaggtggta ctcaggagtc tccccagatg actcccatgc aggttggtgc tggactgcac 21301 tttatgaaat tctgctgtaa agtgtgcttt caggatctct ctggttcctt ttttcctttt 21361 cctacctcca cctggttcag acctcatttc ttcttaccta gactatgtta atgaccttct 21421 acctgattcc tatgtcttcc cctccctatc ccacaccgtg ctgccagaat cagcgtgcta 21481 aattatcatc tcaaaaatct ttaaatttaa ccatcacccg ccccctccac tgcccagcca 21541 atccagtgca ggcttccaga taagagatca agagctttga ttcactatca cagcctttca 21601 gactctccac tcacttccat gtcccctacc tatcagtcac actagaacca tcgtcatttt 21661 atctacctgg gatttcctgt tcttccatca caaatactca cccatctaaa tctcactcat 21721 gatttatgat tcattttaaa taatactttt ttcattatgc cctttcaata tttctcagag 21781 tatttaaggt tctatgtctc tatcatcttc atctgatctg cagtatccag tacatgcctt 21841 cttccaaata gaatcgcaag tatttgttga atttactccc tgatagttca gatcctatca 21901 ttctataaat aaattagatt gcatttctct aaatgcattc agtggtgtgt gccaacatta 21961 cagtgcacat tcccctgctc agcccttttg gacagcttgc tagctctttt tacatttcat 22021 ctccatcaac ttacatttca cttcttgcaa agatattgga gctgtctgca aaccagatga 22081 tttcattgct tatatctctt attttccata tataagaatg ttaaataggc cctgtgccag 22141 gacttattcc tgagagaaca tttctattta catttctctt ccaaagttat ttgcactgat 22201 ccacttcctt gtttttctat catgaggttt tcacatttcc aggatcttct gtnatggatt 22261 acctgaggga aaaaacaatg tcntgttcac ctccagcatc tagcataaaa tctggcacac 22321 aganaggttc aataaatact gtttgaaata attaatataa ggggattaga caagaaacnt 22381 taaccctgaa ggacctaagc agggttngtg tcccaggaat canactggcc agggtgattc 22441 agcactggag gaacagtnaa caaaggtcaa tttggaaaac caggttcaag accnttnaaa 22501 ggagggaaag cttgggttta ggagccaagg agatcaggaa taatttggga aacaggtcaa 22561 gacaggctaa gatcaaaggg ctcagggtgt aggttagtaa nagggtctga aaaagtgttg 22621 gcatcagttc ttagtaaaaa aaatttaact gttctaaacc atcagttttt tcacttctaa 22681 aatagagatg ataataataa tcatagggag cctcagagag ttgttgtgag aattaaatgt 22741 catccaacaa atatttattg agtgactatt atgttctata cttcgtgcta ggtgctaggt 22801 aatggattta aagctcttag ctttacattc aaggacaagg tcaaatgaga atccaagttt 22861 tcatccttga tggctcatct tcatttgttg ctcatgaaag acttttgtta tgttttgttt 22921 tgttttttga gtacaggctg gagtacagtg ttgtgatatc agctcactgc aatctctctc 22981 ttgggttcaa gcgattctcc tgcctcagcc tcccaagtag ctgggattac aggtgcccac 23041 caccatgccc ggctaatttt tgtattttta gtagagacat ggtttcacca tgttggccag 23101 gctggtctcg aactcctcat ctcaggtgat ctggccgcct cgggctccca aagtgctggg 23161 attacaggcg tgaaccacct cgcttggcct gaaagacttt tagaaatcta aattacagtg 23221 acttggatcc tcattatcta tattcttgcc tatcacctca aagaactcca acggatcaac 23281 caaagcctga tttctctcca cataattcat gctacctttt tttcctcaaa cttccaaagt 23341 ttctgaagaa ttcaagtgag ccactgtttt attttaatcc ttcctatctt tcttagtctg 23401 gactgaaact caccaccaca aattaccctg gatctctctg ctaacaacta tttatacatt 23461 gtattagtcc attcttgcac tgctataaag aaattcctga gactgggtaa cttacaaaga 23521 aaggaggctc atagttccac agggtgtaca ggaagcatag tggcttctgc ttctgggagg 23581 gcctcaggat gcttccaatc atggtggaag gcaaaggggg agcaaggtat ctcagatggc 23641 agggagcaga gcaagagaga gaagaaggaa gtgctacacg cttttaaaga ccagatctca 23701 tgagaactca ctgtcatgag aacagcacca agaaggtaat gctaagccat tcctgagaaa 23761 tccaccccca ttatgcaacc accccccccc tcttgctcct gctcccatca tatgaaatgc 23821 tcccatcata tctttgcctt ctgccatgaa ttgaaaactt tctgaacctc tccagaagca 23881 gaagctgcta tgtttcctgt gcagcctgca gaactgtgag ccaatcaaac ctcttttctt 23941 tataaattac tcaatcttag gtctttgttt atagcagtgc aagaacgaac taacaaagaa 24001 aattggtacc aaggagtggg acattgctat aaagataacc tgaaaatgtg gaagtgactt 24061 tggaactgga taactggcag aggttggaag agtgtggagg gctcagaaga caggaagatg 24121 agggatagtt tcaaacttcc tagagacttg ttaaattgtt gtgacaaaaa tgccaatagt 24181 gacatgaaca ataaagtcca ggctgaggag atctcaaatg aaaatgagga acttactggg 24241 aactagatca aaggtcaaac ttgttctgcc ttagcaaaga acttgactgc attgtgttcc 24301 tgccctaggg atctgtggaa ttctgaactt gagaatgatg atttaaggtg tctggtacaa 24361 gaaattttta agcagcaaag cattcaacat gtgaccaggc tgttctaacc gcttatgccc 24421 acatgcatga gcaaataaat gacctgaagt tggaacttaa atttaaaaat aagtttagat 24481 ttaaacctat atttaaattt taacttgtat ttaaagggaa gtagagtgta aaagttagga 24541 aattttgcag cctggccatg tggtagaaaa gaaaagccca ttttcaggag aagaattcaa 24601 acaggctgca gaaatttgca taaataaaaa ggagccaaat gctgatagcc aagacaatgt 24661 gggaaaggcc tcaaaggcat ttcacaggcc tttggggcag ttcatcccat cacaggccca 24721 gaggcntagg agggaagaat gattttgtgg gccaggccca gggccttgct accctgcaca 24781 gcctcaggac actgttttct gcatcccagc tgctccagct cctcctgtgg ctcaaaggga 24841 tccaagcacc gcttcaaagg gtgcaagcca taagccttgg tagcttccat gtggtgttaa 24901 gactgcaggt gcacaggata caagagttaa ggcctgggag cccccaccta gattttagag 24961 gatgtataga aaacttaaat gttcaggcag aagccagctg caggggcgga gtgctcacag 25021 agaacctcta ctagggcagt atggaggggg aaatgtgggg ttggagctcc cacacagatc 25081 cccgctgggg cattgcctag tggagctatg agaagaaggc taccgttctc tagactccag 25141 aaaggtagat ccatccacag attgcaccgt gcacctgaaa aagccacagg cactcaatgg 25201 cagcccatga gagcagccgc aggtgctgaa tcctgcaaag ccacctgagt ggggctgacc 25261 aagaccttgg tagcccaccc ctcacaccag tgtaccctgg atgtgggaca ttaggtcaaa 25321 gagatttttt tggagcttta aaatttaatg acttccctgc tatgtttcaa acatgcatgg 25381 ggcctataaa ccctttgtcc tggccaattt ctcccttttg aaatggaaat atttacctaa 25441 tgtctatacc cccattgtat cttggaagta actaacttat tttatntttt acaggctcat 25501 aggtggaagg gacttaactt gtctcagatg anactttgga catttgagtt aatactanaa 25561 tgagttaana cttttgggca ctgctgggaa ggcatggtta tattttgaca tgtgagaagg 25621 acaggagatt tggaaggagc caagggcaga ataatatggt ttggatctgt gtccctaccc 25681 aaatttcata tcaaattata atccctaatg ttggaggtgg ggcctggcag gaggtgatag 25741 gatctggggg acagtttctc atgaatggtt tagcaccatc cctgttggta ctgtcctcat 25801 gatggtgagt gagttctcat ganatctggt cattgaaaag tgtgtggcac ctcccacctc 25861 tctctctctc tttctttctc tctctctttc tctctctctc tctctctcac tcctgttcct 25921 gcaatgtaag ccatgccttc tcctgcttca ccttctgtca tgattgtaca tttcttgacg 25981 ctcctcagaa actgagcaga tgccagcatc atgcttccct gtacagtctg aagaactgtg 26041 agccagttaa acttcttttc tttataaatn acccagtatc aggtaattct ttatagcaat 26101 gtgagaacag actaatatgn acactatgat gactatttca ccgagaaatt acttcaaaat 26161 tcttggatag gtggtactga cttctaatga natgtctgtc tatcctcagc cagtgttgtt 26221 gagagtactc aagtcatcca atctagttga gtccatggga aatttggagg gaatctcagt 26281 aaattaaaga atttcagttt tgaaagggan acttggatca tctagttcaa ctcccagtat 26341 tctaactgga aatatttagg ccaattatct aatattttct tttcatctct gagcatgaat 26401 tttgaatctt gataatcaaa tgttgatacc aattctgtct tcctcctttg gtaatcttaa 26461 gtaattttta attctaactt tggacaccct taaagatgct ttcaaattat ttttgaccac 26521 ctcatttcta gcaagtaaaa gtttatcttc caatagactc atagaataca gttgtaaaat 26581 atgtgaagca ataatacaac taaaaggata aatagacaaa tccacaattt tagttggaga 26641 cttcaacacc cttctctcaa cagttgatag aacaactgaa cagaaagtca gaaaagatac 26701 agaagtgatc tacagaagga gcaggattgg aaaaaaagtt ttttaaaaaa gaaaggatat 26761 ggaagtgacg attttacacg tgtgaagaac tgacatttta cctttggact tttagaaata 26821 tttgattttc tagacatcgc ctgaatgtta ttttgacttc caaaagatct cacggtccag 26881 gcacagtggc tcatgcctgt aatcccagca ctttgggagg gcgaggcaag tggatcacct 26941 gaggtcagaa gttcgagacc agcctggcca acatggtgaa acctcatntn tactaaaaat 27001 acaaaaacta cccgggcatg ggggcatgca cctgcagtcc cagctactca ggaggctgag 27061 gcagtagaat cgcttgaacc caggaggtgg aggttgcagt gagccaatat cgcgccactg 27121 cactccagcc tgggtaacag agcgcgactc caaccccccc cccaaaaaaa agatctcaaa 27181 aatttacttt attcttgttc ttagcaatgt tgttagtgcc ttagatttat tcatctattc 27241 agtcattcaa caaatattaa tgcctactat agtcaagcca ttgcattggg cattacaatt 27301 gattttatta tttttgaaat gtttaataag taaattctat agagaagaag ctatttaaca 27361 atagcgtata gtataaataa aataaactat ttttaaaagt tgtaggacta tattagtctg 27421 atcaagctgc cataacaaaa taccacagac tgggtagttt aaacaacaga aatgtatctt 27481 cttacagttc tgaaaactga gagtccaagt ccgaggtgtt agcagggttg atgtcttgtg 27541 aggctcctct ttggtttaaa gacagtcctt cttactgtat cctcacatag cccgtcctct 27601 gagtgcaagc agagagagag atctcttctt cttataaggc catcaatcct atcagattag 27661 ggccccacat ttatgatctc ttttaacttt aaaacccatc tctaaacata gtcacattga 27721 aggttagggc ttcaacacac aaatttggaa gnggacacaa tttagtgcat atcaaggact 27781 aaatcagtca gcaagcatcc tttacttctt tcagagtttc tcctctggag agaattatgg 27841 aattatagtt ctcaccagca cccattgtta tatgtaagaa aaaaacaaaa tttgggagag 27901 aaaatggctt tcctgaagat cacagagtaa acgggaacag agacagtagc aaaaaccaag 27961 tcttcggtgc actctccttt tggtctggcc ttggctcaga gagttactgc tccacagcat 28021 acaataaagc tgcagctggc tttagtagat gtttgtacca tgcactccat ggactcttcc 28081 tattaatcct ctgctcaaat ctccctcctc acacttatca aaatatcaca tcagcgctcc 28141 tgaccaaaat ctctgctttc tcttgtgagt actgagttta tcagaaccac cttggggatt 28201 ttcttatcct tcatgataca ggcccagtat gatctgaata tagactctgt ataatcttct 28261 gccatcaggg ccaacagata aatgactcct ctccctatac tatctctgtc atttggtggg 28321 gatcattttt tgcattagag aaggttcgtc aaggtcgttg ctncttccag attgtgggcc 28381 ctattgaagg tatttcctct ccactataaa atgtatttgc atttcctccc accacctcag 28441 gctctctgtc ctcataactt gagggctaat ttgctacaaa aaaccttgaa aattaaccag 28501 ctaacctggt caacccacca ttctctctgg ctcaaccttt cccacccaaa aaaaacaaac 28561 cccttctcta agggccagga ctttattacc actcacagca aggtttttgt ttgtttgttt 28621 gttttgttgt tgttgttttt gttgtttttt gttttttaaa cagagtctca ctctgtcgcc 28681 caagctggag tgcaatggtg aggtcttggc tcactgcaac ctccacctcc caggttcaaa 28741 aaattcttct gcctcagctt cctgagtagc tgggactaca ggtgcatgcc accacaccca 28801 gctaattttt gtatttttag taaaaatggg gtttcactat gttggccagg cttgtctcga 28861 actcctgacc tcatgatcca ccccccttgg aaaaaaaagg tgctggaatt acaggcgtga 28921 gccaccgccc ctggccttac cacaggtatt ttaacatcag gcagatgtac aaagatattg 28981 tcccagtaca acaaaccagg cagaaaaatc ccctgctact gggatttctc cctctctctc 29041 ttctttcttt ccccttgctc tttcttcttt tttcttgatc actttctttg ccaccgtatg 29101 gcccaagatg ccagaaaatt agttgattga tcagttggtg ggttggttgg tttgttgact 29161 ggatggactt ttctgagtat gatgcttggt tcatcaatac aagaatcaca gaacctttgg 29221 attgggtttt taaaccctag ctgcacatta gaagaacttt tataaaatac cgatgtttgg 29281 gccctacctt cacagatttt gatttaaatt ggtcttggtt tcaaaggctt tccatgtgat 29341 tttcatgagt ggccaggatt gagaattttt ggatagagct aaaagattac gcaggcaacc 29401 attgatttga aaattaaatc ccttttactg ggaaatccag aaagaagact ctttctggat 29461 tttctttccg aaactaagat tttaggtagt attgttttcc taagaatttt cctgggtata 29521 aataatttgc tttgttacta tagaaggaag cagcaggtaa tgtttagtgg tagaccagaa 29581 attcccctaa aaaatccagc agtggttagc tgtggcaatt cctgaagtta gggaaatatc 29641 ctcccaggtc agtggttttc aaatgggaga aaatttcccc caagtgaagt ttggcaaatt 29701 ttggagaaat ttttggttgt tacaggttgg tggaatgtca atggaacctg gtggaggcag 29761 gccatggatg atgctaagca tcccacaatg cacagaacac cttcccacaa caaagaatta 29821 ttttgtgtaa actgtcaata gtgccaagat tgagaaacca attcctagat ccttgccctc 29881 aggtccccct aataaacccc taagagatac catccttaat gaaattagtc ctaaagcatt 29941 tccttttttt tggagcatga tttcagcaat gtttcccaaa ttaaaatttt aaatcctttt 30001 taagtatttt ttaaagtgac catcctgctt tggttttaag aaaagaaggc tgctgcctga 30061 ctgtgtgtgt cagagaggca gaataatccc aaggctgcct tttccaaatg gtggttcctt 30121 tcccatttgg ctgcctccct gaaaaacaga atgagatgag ccctatgacc tgatggaggc 30181 cctgccaagc ccaccttgac caggtgctgc agccagtgga gctagttttc agggtgcggg 30241 tcgctccatg aaaagtggca gctgggctta tcagctgcca gcctggataa gtcaggcctg 30301 ctgccaagtt gattaataag cgctgggctg acaattcaag aagggcctaa acctgctgat 30361 tttcctcccc ttttccttag atcatttcat tattaaatta aagattgttg tgatttaaat 30421 tgaactgtca ggcagcttag ggacaaacgt ggggtttggt gaagcagctg aacctggtac 30481 tgcatttacc aaatcagatt tcagctgggt gtcttctttc tcttaaaaaa acaatttaga 30541 antttaaggg cttgaagtag aagtagaacc ttcccctgca gtgtcaggag cccacacagg 30601 gaggntggca tgccaggaaa gggacctgat tataaatagc atgaaaagca gcatcagtca 30661 aaantcactg atgtgcctgg gcaaaatgtt aaaagaaaca aatagntttt aaaaatggaa 30721 gcaatggttt tatttttata atttaaagtg ntaaggaaca aaagcagttt atattcacat 30781 agttttaaac acacaaacat gtgtgtgcac gtgcacaaat aacacacact aacagagtcc 30841 ctttaaggaa atgatggata atcattttaa tataacaaaa tcacaagtag agcctcacag 30901 agggggtagc tgtcctttnt ttttctatgt aattgatgct atttatttag aaggtctatg 30961 gtttaccctt gtgaagtaat ttttctgtat gtgtattttt tttccagctt agatggaaac 31021 catttgagat tccaaaagca tctcagaaga aagtagactt tgtgagtgta agtcagtccc 31081 atccacaccc ttcctactgc tctcctctct tctgtggtga gtgagccaac catttgcagc 31141 ctgaatgcca agcaaaagca gccctatcag gaggcagcac acacaaaacc acctgcgtat 31201 gctttggggg ctggcggagg gatgtcttac ttacagaaac catatggccc ctgacaccag 31261 agggtattca tgaactattt atatctcaga aaggccaatg acaattgaac attttcttac 31321 ttgccttcct ttcttttggc tagaacgggg gcttttaacc tttttctcac tttgatgtct 31381 gatgtctgat gatgtctttg tttataaata tataaaataa attatataaa attacaaaga 31441 agctaatgat attgaaatat aacgataaaa atactattaa aaacaaattt gatttggttt 31501 atagtcattc atgtgctttc ttgttaatgt gctaaatcat aatatctatg ggaaaacatc 31561 taataattct ggtatgagca taaatatact ttgaggtatc tataacaact ataatgtgat 31621 atccatgatt tttatcattg ataaagttac aggcacctct aatactgctg tggttttttg 31681 cctacattca tattagttga aagaaatgat aaatttcact cagagattag caaaagtaaa 31741 tataaaatat ttcccatcca agttcataaa tctccctgaa gtagatgctt gggctagaat 31801 ggcgtcaatt caacatggca actctagtta gtacaatgct gattggaagc tctgtttggt 31861 agttataggt cccaggagta ctggaataac tgcacagagc aaggagtatc tgggggagag 31921 gagaaaggag ctctaaggaa gtggggctca aagagaccag tggagggttc tgcagaccag 31981 cctcagctca tggcttacgt agggacactc caagtgccga gccataggat aaattgttgg 32041 aaacatcttg aagttctcat ctttattaag tgctccaaac tctttgtaga ctcacttaaa 32101 tataccctaa acttacaaat tgagttaaaa tacaaaatga tatccccatc tcctttcttg 32161 tgtgaattgt agatatctgg aggcgcagag ctctctcctt ttggctctcc atctgcaggt 32221 ttggggctga gggctatttt gagaagaccc atgtgttaag gtttcttgca ctatggatca 32281 tctggtcaca cagcaganag aattacagtc catctgaata cagtcctctg ccctgttcca 32341 ganagattag gctgacccta tccataagca gagggctgcc catagtggag tgtacttgag 32401 aggcattaac tggtccctga tgctgcaacc ttcctcccca agcctcctag atgcctgatg 32461 gtacctccaa cagaatctct tcttccagtt gatgtcaagc atcaaattga taaaatacca 32521 agtaatgaac tgatactctg taagttcttc tgatatagcc actaagacaa cccagattta 32581 ggccatggct ggagactaag gtaaaccaca aactaccaaa catgagtcag taaattcagg 32641 ctccttagag gctgcctgga ttccaaacgt cccaccggtc ccaaagggaa gacctagtag 32701 aaatgtgcct catgaacctc agggaacagt ggaaaggctg cctcttgggc accgttcaca 32761 ttaacctttc cctcttctgt gggtctctcc cagggcctgc ataccttgtg tggagctgga 32821 gacataaagt ctaacaatgg gcttgctatc cacattttcc tctgcaatac ctccatggag 32881 aacaggtaag tggctcagtt ttagggaagg acatgcccat cttctaaggg attttggcca 32941 gaagtcaaac tccaggtttc actcttgtga agtcacatat gcacatacat agttgttttt 33001 gttattggtt ttctcctcaa ggcaatattc atatatattt attgacaatg ataatgaatg 33061 agacaaaagg catttatctt gactgaacat cagtgatata aacaagcaca tcacataaca 33121 atgtnaacag aaaagggagg atggagccct ggaatgcagg agtagatcna aaggcatttt 33181 gactttaagt ctgagtccaa atatgctttt gttgttgttt gatatttcaa ggatttttct 33241 gagccaaata tgagtgacca atggcctgcg acacagccat caggagatcc tgagaacatg 33301 tgcccaaggt gattgggcta cagcttggtt ttatacattt tagggaggca taagacatca 33361 gtcaacacat ataaggtgta cattagatcc aaactgtggt ttttatctca gtagaaaagt 33421 aacagcagat ttaaagcagg cagaaaagaa aacagagaaa tagagaactt agaaactctg 33481 tagttgcagg ttgacctttg ggctctgaat gatacaattt tcccattggt ttaaaatgtg 33541 cacaacagac tgtaatatgt aaccagctgg agtactagaa actctggcat acccttgaac 33601 ttttccattt tacacaaaca cttgcaagta gaggcacctt tctccttgtc tttcctcatt 33661 cttagattat ttgtttccca cgtttttttt cttaaaagga ggaactgagc tgtgacctag 33721 gggttttgtg ggtggtggat tggtgtactg aatgtaggca ggactccaca gtgtttcacc 33781 accgagtcgt ttccaccctc ttacctgtct cagtttctct ctccagagat ctagcacctc 33841 tgagaggcct caaaatgcca agtgatcagc tcttatatgt atttccggga caaaactatt 33901 tttggggggg ttccctgtag ggccactgca catcacaggg gatgaatccc tcagacactg 33961 caactcagcc cctagtcacc cagggtgcct ttcagttggg aagaacaaaa tgccctttct 34021 cttcagagct gaaggagctc agtctctcat ttatgcacaa aaatgacagt cacacgaatg 34081 cgcagcaaag ccaactggag ctaaaatttg gggaggaaaa accaatggga ggagaccaat 34141 ttagaataca cctccaaact tggaaaccca aacaggtacc caaaaaggga gtcattcttg 34201 ttgtctttag aaaaagacaa cggaggccgg gcacaagtgg atcacgcctg taatcccagt 34261 actttgagag gccgaggtgg gcggatcacg aggtcaggag ctcgagacca gcctggccaa 34321 catagtgaaa ccccatctct actaaaaaat acaaaaaatt agccgggcgt ggtggtgggc 34381 gcctgtaatc ccagctactc aggaggctga aaaaggagaa ttgcttgaac ccgggaggcg 34441 gaggttgcag tgaaccgaga tcatgccatt gcactccagc ccggacaacc gtgtgagact 34501 ccatctcaaa aaaaaaaaaa aaaaaagaaa aagaagaaaa agacaatgga gaaatccttt 34561 agaatggacc tgtgaactag aattaggaac ctaaacaaga gcttcctagg agggaaaaat 34621 caagaactgt caaccaaaca gggctcagga ggacttaaca gttccatcag agganaagcc 34681 caaagttgaa ggcactttca ataggtccct gctgatacct tagctctgag ttcaggcaac 34741 tccttcaggg ttctgagtct tctctgaggc ccaatgtgtc caggtgccaa attattgttg 34801 acaaaaatag tcaaactatt aatatgtaaa atatttgaat tgatgtattc tgagccaaat 34861 atgagtgacc agtgacccat gacccagccc tcaggagatc ctgagaacat gtgcctttga 34921 atatccttaa tttctaaggt tccccagggc tgttcttgag tcccaggtca cagtgtgtga 34981 gtttccatta ggaccaccca ttagtagcag gtaacccgac tttggttcag ctaacagctc 35041 tgtgtaagct taaaattttt aagtagagga gcttatatac tggctttgca aaacaaagta 35101 tgcaaatggc attcctgcct tattatacac tacctctgga actggaccat ttgtgtccaa 35161 agcccaactt tacatttcct ggctgagcga ccttgtgagc catgaatcag ggacaagatc 35221 acagccctat gacaaggctc tggaggcctc tctttgggtc aaatctcata tcaggccact 35281 ccctctctcc acttgagccg cattggcctt cattcacttc tttcaagctg ccgcgtgccc 35341 tctttaagca cttgcacttg ctgttccttc tgcctggaac attctttcct ttctccaaac 35401 cctcacctag ataacacctt cagatatcag gtcaaatgtg tccnttcttt gaagaacctt 35461 ttgctgattc ttcaaactaa ccaagtcgct attctttcat tccctcaaac ttgattttca 35521 cagggtcatt acagtctgca aagatattgt tctgtaatta tttgattgct ctctgactcc 35581 ccactagact ctaagttcca caagggcaga gcctgggtca ttttgctccc cattttatac 35641 ccagctctga gcttgcnaca taggcactaa acaatgattt gtgttatgac gtaatttgtc 35701 tagacaatgt taaaactgaa aatcacaaga attactgtat atgtttcaga tgcttttaca 35761 attcagatgg ggacttcttg attggtgagt tctgaagact tcagaccccc tcactgataa 35821 ttgaccccta accgcaagct gctttctaat tttatacaca ttattttcct tcattcttca 35881 atctgtctta tttcatttag tgcaatccag agactgaggt ctagaggcca caggtggtct 35941 ggaaaggttc tagtgtcctc aacgccccta ctctattcct ggctctctgt ctttgcccat 36001 ctcttctgga cctttctgat gtgggagcct gggcagctga aaagaaatga agactaatac 36061 taatgggatt aacttattac ggtaactgta tttgccttga aactctctca gtaaaagccc 36121 atttatcatt agtaactgca atttaaattc aaattttcat gagaccatct aaaatgtaaa 36181 gtctatcaga gattgttctg gtctgcctat tatacttgat ggttttttag tatcaatatt 36241 ttggaatcaa cattttcgtg tatatgtcac acaagcaaat gtcagccttt cactggaact 36301 atcagaaatg ggtgattttc catcctattt ctgctacccc tttttgacca tgtcacaaaa 36361 ttgtgcatat acaagttcct tgcctggtga ccaaaaaagc agagtcattg ttttattcac 36421 attcttctag ataggaaaat agctaatcta agttgtcaga atttatcttt ttccccaaga 36481 gactcaataa agctttttgc atttggaatg gaaaagcaaa tgcagcctta agcctttcct 36541 gttcatgaaa tggttatcac accaaaagaa ttgctcacca ctgccctgtc tttttttgtc 36601 agttccgcag aaagggaacc ttctcattta caccgagttt ggcaagatgc ttgtacagcc 36661 caatgagatc tgcgtcattc aggttggtga tgtgtcccct tctcttccct gcttctcact 36721 aatttgggag cagtcagatg tttcagctgc tgctatttcc aactctaaaa ttcgtctgtc 36781 atgtctggac aacgaggagg gaatctgagt tatgcagaag gttccatgtg ctgaagagta 36841 agcaagcttg ggtgcctgct ttcttttaat atagcacata attctaaagg aaagaccact 36901 tgtcattgca ctttcctaaa tatttacata tactttggat aaggccctgt tcccactaat 36961 ggcagtagag gcagtgaagg tggacttttt ttttttttag tgtggctaaa tcaggcttac 37021 cattatctct ggacccaatg agagggtaga ctagaattgg cttgggtcac atgtgtgatt 37081 aggaattcag tttttctttt taaactgaat tttaaaaatt tctttttaaa aaaattcagt 37141 ttttcatgtt tgctctggtc acctcccaag cagctcaaca aacaagagct ttacttttgc 37201 atgtgggatg ggctatggct ctgggcagct tctttataac tacagctgtg atttcagagg 37261 gcacaccagt gtctgctttc cattccagag aggaatgcgg ttcagcatag atgtctttga 37321 ggagaccagg ggctacatct tggaggtcta tggtgtccac tttgagttac ctgaccttgg 37381 accaattggt aaatcttcat tacaagcctc taagcctcgc ttgagatgta ggactcagat 37441 aattgcatga aagttaaaca tccttctccc ttcgctgtct cactcaaact tccacttttc 37501 tctctagaca aattcttctc tagaccaagt gtcttcccag aaaatatatg caaaacagaa 37561 agcaacccaa accaatccat tataaatctc tctagaatag tatacagaca ttggagataa 37621 agacggcatg taatagattt ctggttgcat agtgtgtaca ggaaagaata tagaagtttg 37681 ggttactgtg ggcttgggtc accagaaaca gctccacaga agagatgaga atttgactaa 37741 ttctttatga aaagttgagg agagtaaaga ggaaagggaa cagctttgca aaagcatata 37801 agtgagaata agctttgtga ttaggaggaa gatatttttg ttaagtcagt gagatttata 37861 atggattggt ttggattgct ttttgttttg catattgtat aaaatgggaa ggccataaaa 37921 atgccctggg ctgtattcct atggaaagga gggatggact gggtgacaca tagtctatga 37981 acctctgtgt acaccccacc aaatccacac acttagcagt caaaaaaaca catcactcag 38041 tgctatgaga gatctagaaa ccacacccca ggaggcagat taaaacattc caaagtgaac 38101 caaggtctcc acagccccca ttcttctggc aagctggagg aagtgaaaat gttaggaggg 38161 gttgttctca aactgtctct tggattgaaa gaaaaggctg ggaaaattgg cccttggttt 38221 actctttggt cacatctccc aattctccct tccttgtcaa cagcttttta gagcaaacag 38281 ggacaggcca gtgtgtgcct tctggtcagc tcctctgagg aatccaaata tgcagaggaa 38341 ggtgggaggc agcaagaggc agctggagcc gtggtgggcc gtctgtcatc ctgttcacct 38401 cccactcttc ccatcctgct ctggcttcct gctgctctat agtcagcata gcctccaggt 38461 tccgtggaac atgtaccctc tggaaaatat ctataatttg ctaaccagcc ggggacttat 38521 ttttttaagt cccaggagct ggaaatgata aaacactcct cagctcgctc ctctcagagt 38581 taaactgttt ttgcacttaa gtttttcatg tttggtcttt tgatttattt cctttctctg 38641 gaggaaacac tcactcctaa gcaaaggtaa gcaggcaggt cctaaatact tcttcctgag 38701 tggcagacag ggtttttcag gctggagaac cgttttcaga ctggaatctg agttctgtct 38761 agggagcaga ggcatgcctt aggcaacagc gtgtaaccct ttctatctga gatcaaggac 38821 tgtcaatgac ttctagaata ttcaagggtt ccttcaaaag aaaaagttct ggagttctat 38881 cagatcttcc cttttcctct tttcagttct ctttctctct tcccttcccc tcacctgtag 38941 tgataagata atgcatataa gattgttttg ttaaccatag tcactccaca accttagaaa 39001 gtaatgctgg tttttgtgaa tttttttctg accagattcc ttgggtcttc ctaaaaggta 39061 ccagactgga aatgcagtga atctttgagc aatgaaagca atttgtggag tagctcttga 39121 catgaaagaa agcctttctc ttcatgcaac catgggcatc tttcctatgt tttggaagtt 39181 tctaaaagac ttttgggtta ctgttttcta ggggccaatg gcttggccaa tcctcgtgat 39241 ttcttgatac ccattgcctg gtatgaggat cgccaagtac caggtggtta cacggtcatt 39301 aataaatacc agggcaagct gtttgctgcc aaacaggtaa agtaaaaggg ttggcaggca 39361 agggaggaaa tgaagagtga ttgagcactt actgattgcc aaggcgagtg ttacaaactt 39421 tacatatatc ctttcgttgt tatcatacca ctacggcact acaaataaag caaatagttc 39481 tagtagtcaa ttatactttc atagtcatgg aaaaggcagt aatgataaat aatttaccaa 39541 tttcacagta agtggcagaa ccagaatttg aacccatatc tgtctgactc caaaggttgt 39601 gccctttcta ctcaccaagc aatgaaaata ttagcaatat gtaggtttct ataagtggta 39661 tggaaaacat aatatatggg cagcataata tgttctgaaa agtaaatgag ttaacagatc 39721 agccaagatt tatagtctcc aataccaccc tatttcaaat acaagcaatc ctacaaaaca 39781 agtagaattc agttaattag cttggcttgg tttgtaccac caggctggca attcaggaca 39841 caagcaaagg cacagatatt aataaataat aataacaaca atgtaatatg gactctataa 39901 cataatatat tactaaatgt actgaatata tattagaata aaataatcat ataataagaa 39961 gaacccaaag atctcttctg tttagcatta tttgtttgct tcattttgtc tatttgaatt 40021 attccttata ttttccaagg tatgttattc ttcttaggca gcatctagag ataatgccaa 40081 ccagccgagg ccaaccagtt tggcacttaa aattaacccc tgaagcagac atattgttat 40141 tctcttattc tctgggtcaa ggtctctcaa aattcggggg gtatttgcat cagaaatcct 40201 caaggaggct tgtttaaaat tcagatccta ggctgggcat ggtggctcat gcctgtaatc 40261 ccagcagttt gggaggccaa ggcaggcgga tcacctgagg tcaggaattc aacaccagcc 40321 tggccaacat ggcaaaaccc cgtctctacc caaaatacaa aaattagcca gatgtggtgg 40381 tgcacacctg taatcctagc tacttgggag gctgaggcag gagaatcgct tgaacccagg 40441 aggtggaggt tgcagcganc caagatcatg ccattgcact ccagcccggg cgacagagca 40501 agactccctc tcaaaaaaaa tgcagatcct tagatactac tctagccttg caaaaaatga 40561 taagctttgg agatgggaca tggaaaccta cattcatttt aagcatgttc cccagatcat 40621 tgtttacata caattgttaa agaaccacaa ttgtagaagc agccagtagc tcctccgaaa 40681 ggtttgcact ttccaagtgg accctgtgct ggttattttc tgtgtgagtc ccactgatcc 40741 attctccctc tttgccatga tctgtgccct ggaagttgac tgtatcaccc gggctcccta 40801 accactggtt tctgttgggt tcaacccaca gggaaactac aggcaggaag aaagacaaat 40861 tgggatattt cttcccctgc ttcctcgctg tttaagaatg tgtttctgac agtagctgca 40921 tcccccatat gacctcagtt cctgttgggt aaccctagtg gtgggctgga gctgattcat 40981 agtgacttgc aaaagctgat atgcacgctt ctgtccaact gtgtatttgt tgatgtcata 41041 ctggtagcta aaaaatcatc atggtgagac tgtatacacc ttagaaatca gtaaacacta 41101 tatgtcaatg ctttatgtat ttttttcaga gtgctagttg tgaaatattt accagcacac 41161 cactgggtaa cacccttctt tataattcta gctcttgcta aagtcctata acaatatttc 41221 cctcctcatt ccacatgccc aggaatagca gtgtcttccc gctattgcta ggctcaaata 41281 cctcaacatt tctagtcatt ccagttaacc ctaaccacgc ctctatgagt aacgccttca 41341 ttaaaatctt tagcactttt cggaccatag atacgtttgg acctcctttc ctatagcagg 41401 acactgattc tcttctaccc caggacaggg accagggaag tattatgtta gggattgctt 41461 ctgaaatctc aagtcttggg agttgggaac ttttaggagg ccttcttccc ttcttcctgg 41521 actcaccaac ccaggcactg tgtctattgc aagctcctct aggctgaaat tacctggaga 41581 agcatcacag caggtactct ctgttccagg aaacagaagg tagcccagat tgtttgagac 41641 ctgctgacaa tccacagatc ctcacagtta ggttaaaaat tgttgaaaag gcactacttg 41701 gagaatgtgc tgtgtttcct gctgcatgtg tgatcagaag gcagctgcag cttatgaacc 41761 agaaaaatag ggttagtggc taagtagagg gtgtgtggag caaccacaaa atgattcagg 41821 gttatacttc tcccaaagga cggtaaaatt aaatcgagaa caaaatcaaa tcaaaagaag 41881 agagaggtat actcagtaca tattgtggga tcacagctac aaaagcatta gatggtaaaa 41941 ttatcctttc ctgctttttg tgtttaggat gtctccccgt tcaatgttgt ggcctggcac 42001 gggaattata caccctacaa gtacaacctg aagaatttca tggttatcaa ctcagtggcc 42061 tttgaccatg cagtaagtct gaaccctgta ggggccttgg gatgaggcaa cgcttgggtg 42121 ggaggggtcc acagtagacc cctgacattt acagatctaa catttccaat gtcaccgttg 42181 ggaagcccta aatatagtca tttgtccttt ggtgagggag taaataacac aactagtagg 42241 gaagcaaagc cagatactca acattcccct gtgaaacata gcttcattct ctacagagca 42301 ttggtgatac tctgacttgt atttgccaaa ctaagtggat aaacacttag tttctttgtg 42361 gaaaatttat cccccaaaac aaccaagtgt taatgtcggt ggtagaatga ggagaaagga 42421 actttatgaa tttggagaag tccctgaccc cacaactctc aagagcctcc tctatgatct 42481 ctgacttctc accccagtgc cttcattcct ggctgtattt ggcagtgaca gagtgaacag 42541 cccacttcta aagttactca taactttcag tgcttgagtt taaaccaaag aaattctaga 42601 gcatgaactt ggtcatattt ttcactcagt tcgcaggcca caaatggtct ctggtctaca 42661 gtgcaggctc ccaagaccct acattcttaa taaagtggtg gttctgagac cccgggggga 42721 agttcatttt gcagctcact tcataagaga tgaacctaac ccacaagctg gaaggataat 42781 tagagcagaa gtcagagact cagaggcaaa aggattattg tgagccagtt cagtttatct 42841 ctggggggac aaacctacct ctccaaacag cagcaactct taataaaagc aaaggggaaa 42901 agaaaagtga gacatcagac cccaaacaaa cagcacgaaa aaggatcctg agacactgat 42961 aaagctttat ttgtgattca ctacctttgc tttaatagtt tcagatcatg tcccatgaca 43021 gttgaagaaa tgtcaattct cacagcaata agggttgggt ttatttttat ttgaatttac 43081 acagatgcga tcttctctgt atttgctgca atagttattt ttccaaaatt tagttctgat 43141 catgccacca ctgtctccat tagaaagcct ttgcaggttc ccatgggctt aggattaagt 43201 ctgcactctg aaccatggtg tttgatgctc ttcaccatct gtcccaccct ggaatcattt 43261 tccagtatca cctcttgccc ctttcccttc tgtcaaagca ttggggtttt agccttctct 43321 cctcttaaca aaatttttgt gcactttgat gtgttttcag tgattaaaan gtcatttccc 43381 ttccttccca nttccttagt cctccctccc tccctccctc cctttttccc ccatccccgg 43441 ccacgtcctt cctttctttt ttttttttta acttttattt taggttcaag ggtacatatg 43501 ctccttcctt ccttccttct ttccttcctt ccttcctttc tctctctctt tttctttctt 43561 tctgttcatt cttgaagacc tacctcaaat gtcacctcct ctttgaagcc ttccttgacc 43621 tttcaggtag aattagctgc ccaaccccca ccccacaatt aatgctaaag aagcagggaa 43681 tcgatcccca tgatctttga atctccagac tcttacatgg catctgaaag ggaacatgat 43741 tgttgatttc aattaaattg aatctcaacc ttctcactaa agaattctga agagttttaa 43801 aaagtataat ttcgtccctc aacctatcct atctctacac taccactatg aagactacgg 43861 atgtaaatgc tgtgttttag acctgaaagt caaacaaagg taggaaagca atgaaccact 43921 tctcagactt ttctgccaat ggatacaatc tggaatctcc gtattctccc aaagaggagc 43981 acatctacac caaaacttgg attttgcttt atttctattt ctactttgct ctgagcatcc 44041 cttctgctgc acatctcagc tctagtcaac ttgatctgag gttgagaagc aggcaccgtt 44101 ggttctcata cccacggggt aataagttgc atgtaatgtg ggctcagcag ggtgaaaaag 44161 cagttcagaa tccttttagc tggtgttaat tcattttgcc tcaagcacag caagaggaaa 44221 aagtcatatg agtgattatt gcaccttgat gacaagccca gacttctgtt gtttgatcag 44281 aaattaggct gtccgtttgc agcaacatgg atgcagctgg aagccataat cctaagcaaa 44341 ttaatgcagg aatagaaaac caacttctgc atcttttcac ttataagtgg gagctaaaca 44401 ttgagcaccc atggacataa acatgagaaa aatagacact ggagactact agaaggggga 44461 gggtgaaaaa actacctatt ggatactatg ctcgctacct gggtgatggg atcctgtatt 44521 agtccatttt catactgctg tgaagaaata cccgagactg ggtactttat aaagaaaaag 44581 gggtttaaca tatccacagt tccacatggc tggggaggcc tcacaatcat ggtggaaggc 44641 gaaggaggag caaaagcatg tcttacatgg aggcaatcaa gagagcatgt gttggggaac 44701 tgccctttat aaaaccatca gatcttgtga gacttattca ctatcatgag aacagcatgg 44761 gaaaacctgc ccccatgatt cagttacctc ccaccgggtc cttctcacaa catgtggggg 44821 attatggaag ctacaattca ggatgagatt tgggtgggga cacaaccaac catatcagat 44881 cctaacccca aacttcagta tcacacaata tacccatgta acaaaactgc atatgtatcc 44941 ttctatctaa attaaaagtt gaataaaaat aaataaaaaa attagaatgt cacttgcaaa 45001 cacttcatgt atgcatgaaa tgtgtgtcat tgtccagtca cttcccttgc atcatgcgtt 45061 cagtctctcc ttgtgtgttc acaggaccca tccattttca cagtattgac tgctaagtct 45121 gtccgccctg gagtggccat tgctgatttt gtcatcttcc cacctcgatg gggggttgct 45181 gataagacct tcaggcctcc ttattaccat agtaagtctc tcttttacca agccacattt 45241 gcaagccaaa gagacaacaa agcagtgagc gctacacatt gggaataatg cctgggttat 45301 ttctggacac ccaccatggc atgaattcag gatcctgctg cacaaggcct agtgctgatc 45361 cccaagcttt aatctcttta agggatggta gttgtgttag tccattcttc gttgctataa 45421 aggaatacct gaggctactt aatttataat gaaaagaggt tcatttggct cgtggttatg 45481 caggctgtac aagcagcatg gtgctggcat ctgtttctaa tgaagcctca ggaaatacaa 45541 tcatggtggc aggcaagagg gaagctggca tatcacatga tgagagcaag caagagaaag 45601 gaagaaatcc ccagcttcct tttgaacagc caaaccccat gtgaactcat agagtgagaa 45661 actcactcat aaaccgtcat gatggcacca agccatttat gagggatctg cctccattac 45721 ccaacacctc aanccaggcc ccacctccaa cactggggat cacatttcaa catgagagtt 45781 ggagggaaca tacatccaaa ctacatcaac agtgatgcat tgattctgag cctagttccc 45841 agtttctctg tccctgtaag aggcagaaag gctggtaaag tttagaagcc ttgttagaaa 45901 ggcttctaaa caagtttcct cacattttag atgtttcctt ttggggaata ctggggaggt 45961 tgttagttag tctgttgttt tgggaaagtt catcacctaa gtctggttca gagaaccaga 46021 gaaataagaa gagagagaca gagagagaca gagagagaga cagagagaca gaaatgtgcc 46081 ttctccaatt attttggcag ggcaaagcag actgcagaac atttgttgcc tagtacaaga 46141 actaacaaat ctactatttt tattttatta atattattat tcctacccct tataattatt 46201 ggccaatttc tatggcctgt tccaagtttt ttaaaaactt actcatgaag tgtttataat 46261 cttgtaagag acaggtagta ttattgtccc ccccgctttt tttttttttt tttcagaaaa 46321 ttaaactgac acaagaggga ttaagtaact ctctcaagga ctcacagcaa gttaagtgta 46381 gagtcaggat ttaattgcag gctgtctgat accagagcaa acactcatga ccactaactg 46441 gtatggctag catagtgcct gatatatagc atgcttttaa taaatattta tttaataagt 46501 gaatgcatca acaagaataa ataaagtact tcacggctgt acttaatgag gttgacattt 46561 ttaattgtgc aatagctagc tgaaaatcca atcaaaggtc aatgtgcaat tctacttcct 46621 ggagtgtgca tttttttgtg catagagttc tggaaagact ctaagaggca ttccacctgg 46681 aacattctag aaaacaaatt cctttcttta gtaaactcca gggccttctg tttataatag 46741 gtccctctct atatatacca gttcccaggt gtcagggctc aacagaacac aacagtgcac 46801 tgaatggcca atttagctat tttatagcat caggaatgaa ggcccttagg atggacgtag 46861 gccactccat ggtcaaatgc cagcatcccc cttttcagat ggcaccatca tggtagagaa 46921 aagaaaacaa tcagttacca tggcaaagac ggcagctatt ttgtgaatag catcataagt 46981 cgtttctaaa aactgtgaag cctactaaag actctaactt ttctcctcct ggaagttagt 47041 gacctagagg acctgttgtc aaggagagaa aattgccctg gcttatagtc cgtacttcgc 47101 atgttactcc ttaataattc accatttcac tcttgttctt gtattgaggt tactctgtgt 47161 gcctgtagtg gttcttacat tttaccatgt gtcacagtca gctggaaggt ttattaaaac 47221 acagattact gggctccatc ctcagaattt ctgcttcagt aggtctggga taggacctaa 47281 gaatttgcat ttttaacaag ttcccacaag tgatgctgaa tctgcctttt tttttttttt 47341 gagacagggt ctcactctgt cacccaggct ggagtgcagg tggcacaatc tcagctcact 47401 gcaacctcca ccccccaggc tcaagtgatc ctccccgctc agtctcctga gtagcttgga 47461 ccacaggtgt gcaccaccac acctggcttt tttttttttt ttttgtattg ttagtacaga 47521 cggggtcttg ccatgttgcc caggctggtc ttgaactcct aaactcaagc aatccacctg 47581 ccttgacttc ccaaagtgct gggattacag gcatgagcca ccatggctgg ccagaatcta 47641 catttttcac tggcaagtca ggggattctg atgaaggaag tttacagact gcttgttgaa 47701 aaaatactgg cctaggtgtg tcccatccct aggtggaaga ataatagagg tccatctgat 47761 ttagacaagt tgctactgtg ctatgggtct caacccatcc ttaattttta tttgtagttc 47821 agactcgctg acacaaataa tcttgagaga tgcctcctat agttttttaa tgactcncaa 47881 agtaaatttg gccccagggt agatcttata aaggtatggg taccctgagt tcgttgggaa 47941 aaaaccacag aaccccaggt tagaaatgag ttttccagag atattaaaaa tattgccttg 48001 gcaatacnca aaatctttgc tgctttgatg tgactttatt ataagagaaa atggagaatt 48061 atttctatag aaatattgcc tccagtcatc ttaatgtgcc ttagtcaact agattcagct 48121 tttttttttt tttttttttt ttggagacgg aatctcatct gtctcccagg ctggagtgca 48181 gtggcgccat cttggctcan tgcaagtttg cttcccgggt tcatgtgttt tcctgcttca 48241 gcttcctgag tagntgggat acaggcgccc gctaccacgc ccggctaatt ttttgtattt 48301 ttagtagaga cgaggtttca ccatgttggc caggatggtc tcgatctctt gacctcgtga 48361 tccacccgcc tcggcctccc aaagtgctgg gattacaggc gtgagccact gagcccggcc 48421 tagattcagc tttaatcaga aaggcaaggt ctatcctccc actgtccttt gaaatataga 48481 aaatgatata gaaatgtcat caccaagctc actgctatct aaattcatat tcagatcaaa 48541 gctgaattct gttttgctct cttcttaaaa ttaagccatg aaggtgaccc tgaggacaca 48601 tactagagca aggagcttga caccaaattt aagtcatgag ctgaactgtt tttaaaacat 48661 agacaattta tgcttacatt tgcttctttt tctgtatttc cacttcatag agaaacaagt 48721 agaactatga tctgggtctc acctcttcag caggagctca cacactctca ctcactttgt 48781 aggggcctgc agtagctttc tgtttctgag cagggtgggt gtggctgctt agtcaattat 48841 aaaagcattt ggcacttggt tatacttcac atgtaagcaa gaagggcagg catgtatcat 48901 gtccagtgtt tgattttgta agggttgcca ctgccacatc tctctattct gtccttggtt 48961 tgccctctca gtactctcac aggacagact atgagtagct ttaaaggcta tgttgttgtt 49021 gttgttgtta attattcttt tttaagtagc tttaaaagct atttcttatt catctctgtc 49081 tcatgttagc cagcacatat ctaattatat gtgctggcta actggatggg cagatgggtg 49141 aagtgggaaa aagacagatg aatgaatggg taagtagatg tgtggtttga gatgcaaagg 49201 cacttaaatt tgcccaggtg ttatatttct ttaattcctg ctctattgta ttgctactct 49261 aataaagagc tttatttcaa tagaagcata agattggaag agtggaagca tggtgatttt 49321 tgttaaattt tgtgtgactc tagaaagcac aactaggacc aatgaattta actcatataa 49381 aagcaaattt ccatttaata taagaacttt cttacacaca gagctcttca ataatatgat 49441 atgctttgta gagtggagag cttccattca catccaggca gagcccggga ggtaatcagt 49501 caggaaagat atagaaattt cttctacatt gagtagaagt aaggatacct tccaatttca 49561 agatttagga ctctatgaat agggtatcaa tggaactaga aaaaataaaa acaaatccaa 49621 aagacattct gaagaacaga aaaacaagag gataacagat tcatctatag aagataatat 49681 ttgttataat aaatagcatt tatggtctgt tttatacaca taccaagatt ctttcacaca 49741 gattatcatt tgataatgga ctttaagtct ggggttcttt tcattacatt atcatctatc 49801 atccatcaaa accacacacc ctcaatctaa gccttctaga ttgggaatta gaaatatgct 49861 gttgctgtgt cctacagtgg agctacttgg gaaagggtta tgatttggga gataaagata 49921 acttttgata acagggttcg taaagcttta ggatgataga gaacatttcg gttttctatt 49981 atgactagca ttttaattat tattattttt tttggcccct gatttttttt agatttagcc 50041 tgattcctgt cagagagtgg aactcgagac ctctgataat ccacatggcc atggagtaga 50101 gctgtgcctg ccaagaatgc caatatgaat gttttatgat tgttcttctg tcaatccata 50161 ttctttatta atcctataca atttactagt ttctcctgtg ttcttggaat tctgcagaaa 50221 taatgtttaa ggggctttgt gttttactgg tcttgccttg gataataaaa ataatactgc 50281 ttcaaccttc ccagggaact gcatgagtga gttcatggga ctcatccgag gtcactatga 50341 ggcaaagcaa ggtgggttcc tgccaggggg agggagtcta cacagcacaa tgacccccca 50401 tggacctgat gctgactgct ttgagaaggc cagcaaggtc aagctggcac ctgagaggat 50461 tgccgatggc accatggtaa gtatgttaac tgccacattc cagcggcctc tgaactcccc 50521 acccttactg agaagggttg ccgtttttca tttctccaaa gtcacagagg aagaatcaaa 50581 agaaggaaag ttaatgactg cataagtatg tttacatgta tatatactca catccttgtg 50641 tatatactag acttctgttg tggtggtgta ggaaagatca ttgggacttg ggagtcagac 50701 ctggttctgg agtttaagtc ccatctctta gttcctacag gatcttgaac aactcaactt 50761 agcttctctg tgcttcattt tccccatcta taaaatgggt atagtatcca acccagcaga 50821 tggctatgag ttaacctaac agactgtcca acactattca aatgtaaggg aacattatta 50881 atactatgtg aagcctcttc ctgtatcctt atgcagccct gggcatatgc tatgtctcag 50941 agtcttctgc gttttgaagt aatcctccag ggccacagta gtagcggagg caggatctca 51001 ccttcagcac gtgtttcaag ccaaggaggg tcaaaaagcc cttgcctcac tatagtcagg 51061 gcttgtgaga agcaattctc agctgttttt tgcattttcc tttgctctgc tttggcatgg 51121 atgcaactga tgggatttaa aacctgtttt gtattcttcc tgtgtcaggc tgtaaacatt 51181 gaggctgaac acaattgcag ctggctgcgg tgcttgattt tattttattt ttgcattctt 51241 cgaggtaaag tttatggtcc acagggagta attttatgct aaatcacatg caaatttgct 51301 ttacttttga aaataaatgc cttcaatgct acctggttta aaatctatag caaaaacctg 51361 cctgcttcat gcttgcaaac tacctaaaca gtgagcatta tctgtccgtc tgtccactgc 51421 agtttgcttt cagtcccaac cctggagcca gagttggtgg actcctttgc aaaaggaggt 51481 gttcagagtc ccatttgctt gctttggctc tatttcccct ttcaagggcc tctatggagt 51541 ttctgtaaaa cccctacatc tcaaagagat gcagtctggc agtctcagtt aaactattac 51601 ctctgaaatc tcctggaagt gcaatcctgg aagctattct agagttgcca ctcaataacc 51661 tttgttcccc caatttggct ttgggtagtc aatgttatga gggagctgtg ttttacaaca 51721 gtgaggcgta ggacagggct gtttctatga ggagcaatac tttgaattac tagtgagaaa 51781 gagctctcct attaggatgg gagaccctgg gtatcaagga gccagaatac ctgatggcgg 51841 agggtccaac atcttaccta gacctaactt tcagtcgtcc tcaccatttc ccattatata 51901 attcagaccc aggtctatct acatgcctta tccttattct tgtttggagt gattaacttg 51961 gtacaatagc atcttcaaag tcagctggga tggggctttt gttaatccag ccaaatggag 52021 aggcttttaa gaagcagtca gcaaagagac aatagcaatg gccataatga tccttgtatg 52081 tgaatagtgc tttttaaaaa tctttattac ttattattta ttatttttat ggatacataa 52141 taagtatgta tatttatggg gtacatgaga tatgatgata caggcaatat gatacgtaat 52201 aatcacatca agataaatgg ggtatccatt acctcaagca tttatccttt gtgttacaaa 52261 caatccaatt atattctttc agttatttta aaatgtgcaa ttaaattttt attgactatg 52321 gtcaccctgt tatgctatca aatgctaggt cttattcatt ttttttcttt tatttcttta 52381 tttattttta acttttattt gaggttcagg ggtacatgtg aagatttgtt atataggtaa 52441 actgatgtca caggggttta ttgtacagat tatttcatca ctcgggtact aaacagtacc 52501 caatagttat tttttgtgat tctctccctc ctcccaccct ctattctcaa gtaggcatca 52561 gtgtctgttg ctcccctctt tgtgtccaca agttctcatc atttagctcc catttagaag 52621 tgagaacata caatatttgg ttttctgttc ctcttagttt gccaaggata atggcctcca 52681 gctccgtcgt gttcctgaga aacacatgat cttgttcttt tttatggctg catagtattc 52741 cattatctgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtatgtgt gtgtgtatgt 52801 gtatatatat atatatatat atatataaag ctcttaagtt taattagatc catttgtcaa 52861 tttttctttt gttgccaatt gcttttgatg tttttgtcat gaaatctttg cccatgctga 52921 tgtcctgaat ggtattgcct aggtcatctt ccaaagtttt tatagtttgg ggcttaacat 52981 ttaagtcctt aatccatcat gagttggttt ttctatatgg tgtatggaag cggtccagtt 53041 taaatcttct gcttatgact agccagttgt cccagcatga tttattgaat agatagtcct 53101 ttccccattg cttgtttttg tcagcttcat tgaaaatcgg atggttatag gtgtgcggcc 53161 ttgtttctgg gctctctatt ctgttccatt ggtctatgtg cctgtttctg tatcattacc 53221 acactgtttt ggttactgta gccttgtagt atactttgaa gttgggtagc atgatgcctc 53281 ctgctttgtt ctttttgctt aggatagcct tgggtatttg ggctcttttt tggttccata 53341 tgaattttaa aatagtcttt cctagttctg tgaagaatgt cgttggtatt ttgatatgaa 53401 cagcgttgaa tctttacatt gctttggata gtatcgtcat cttaatgatg ttgattcttc 53461 ctacccatga gcatggaaag tttttccatt tgtttgtgtc ttctctgatt tctttgagca 53521 gtgttttgta attctcatta tagagacctt tcacctctct ggttaggggt attcctaggt 53581 attttattct ttttgtggca gttgtgaatg ggattgcatt cctgatttgg ctttcagctt 53641 gactgttgtt ggtgtatagg aatgctagtg atttttgtac attgattttg tatcatgaac 53701 tttgctgaaa gttgtttatt ggttgaagga gcttttgggc tgagactatg gtgttttcta 53761 gatatagaat caggtcatct gcaaacaggg ataggttgac ttcttctctt ctggatgcac 53821 tttatttctt tctattgcct gattactctt tgccacgact tctaagacta tgttgaatag 53881 gagtggtgag agagagcatc cttgtcttat gctggttttc aaagggaatg cttccagctt 53941 ttccccattc agtatgatgt tggctgtggg tttgtcatag gtggctctta ttattctgtg 54001 atgtgttcct tcaatgccta gtttattgga gagtttttaa cttggaagat atactgaatt 54061 ttattgaaag ccttttctgg catcttggaa tagcactggt ataacttgca tagcacttcc 54121 atatgatctc atccatcaag cctgtcacag gaaatcaggc agcgatactc atatccattg 54181 ttttagacaa aaataattta atgtatatta gagcccaaac attaatgtgg gtgctttggg 54241 ttgtagcaca agataattct attattttcc ttttgcctgt ctgtattata caaatttcca 54301 aatgtgagca tataagaatt gtagggtttg gtttttgggt tttgttttgt ttttaaagcc 54361 acagtggata agaaaagtca gcaattgatg aagctgtaat gtctaagatg cacagagtat 54421 ttacatattc tttctgttgt ggacaaaaac ctcctgacag tgcaacatcc accattcaga 54481 cacttctgag atatgacttt taaaagtgct acagccattc ctgcaatttc cttcacctcc 54541 ctctcctcac ccactccctt tttccctttt cccagcccat ctcttggtca gagtcctgaa 54601 aagacaggct caaatcccag ttctgcgatg acttgtcgcc ttaagcaagt cgtttcccag 54661 ctctcagcct taattcctcc tctgcaaaat gggaagtatg actttcctca caaggacata 54721 gccacagtaa gtgaagccct cagaagatgg atattcccag gattctccca tgtctgccct 54781 tggatctagc agacttagtt ccgtaatacc atgaagataa tttgaggtta agggtgaatt 54841 ttattttaca gaatgaaant tgaggccctg agaagttaag gtcttgtacc aggtctcacc 54901 agcttacaaa tggtaggacc agagccacaa ctcagggctt gagcctgctc tcctcaccac 54961 tctagcacac aaatctttga tttgttattg gaaaatttca cctacccata ccttctgttg 55021 acatctaatc aaatgtgttt attatctatc tcacctcttt tttctcatcc ccatcgtcac 55081 ttcctccagg catttatgtt tgaatcatct ttaagtctgg cggtcacaaa gtggggactc 55141 aaggcctcca ggtgtttgga tgagaactac cacaagtgct gggagccact caagagccac 55201 ttcactccca actccaggaa cccagcagaa cctaattgag actggaacat tgctaccata 55261 attaagagta gatttgtgaa gattcttctt cagaatctca tgctttctgg tagtattgga 55321 ggagggggtt ggttaaaatg aaaattcact tttcatagtc aagtaactca gaacttttat 55381 ggaaacgcat ttgcaaagtt ctatggctgt caccttaatt actcaataaa cttgctggtg 55441 ttctgtggac gtagtgattg gtcataanca agtcttgatg agacaaacct ggcaggtcag 55501 gtcagtcttg ctaagattga accaaggccc agggcccttt gccttgtggt tggcctgaat 55561 gccttcattc taggagcaca agtccaggca tatactaggt atcacaaaac caggccataa 55621 ggccaaacgt taanaccatg aactgaaact gggaggtntg gcaaacctct taggatccaa 55681 tccatacatg gggaaattta ggggatcatg aatgccccaa atatccacct ttcaatcgat 55741 naagaaccgg gccaataggc tgtctaatcc tggtagtcgc ttnccaccaa tatga // LOCUS AF001689 4111 bp DNA PRI 03-JAN-1998 DEFINITION Homo sapiens ribosomal protein L23A (RPL23A) gene, complete cds. ACCESSION AF001689 NID g2739451 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4111) AUTHORS Fan,W., Christensen,M., Eichler,E., Zhang,X. and Lennon,G. TITLE Cloning, sequencing, gene organization, and localization of the human ribosomal protein RPL23A gene JOURNAL Genomics 46 (2), 234-239 (1997) MEDLINE 98086480 REFERENCE 2 (bases 1 to 4111) AUTHORS Fan,W. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) Human Genome Center, LLNL, 7000 East Avenue, Livermore, CA 94551, USA FEATURES Location/Qualifiers source 1..4111 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q11" gene 1..4111 /gene="RPL23A" mRNA join(674..719,1393..1576,2437..2613,3288..3357, 3582..>3596) /gene="RPL23A" /product="ribosomal protein 23A" CDS join(695..719,1393..1576,2437..2613,3288..3357,3582..3596) /gene="RPL23A" /codon_start=1 /product="ribosomal protein L23A" /db_xref="PID:g2739452" /translation="MAPKAKKEAPAPPKAEAKAKALKAKKAVLKGVHSHKKKKIRTSP TFRRPKTLRLRRQPKYPRKSAPRRNKLDHYAIIKFPLTTESAMKKIEDNNTLVFIVDV KANKHQIKQAVKKLYDIDVAKVNTLIRPDGEKKAYVRLAPDYDALDVANKIGII" BASE COUNT 1044 a 1010 c 1016 g 1041 t ORIGIN 1 cgcgccggcc cctgaattga tttttttttt gccaccaaat cactgtgtta ttgataaacc 61 acattccttc tttggacctc agttcgaccg agcaaatcct gctccagcac tttagaactt 121 agctgtatct aactcccgag tcaatccaaa tgtgttcttt cctcttggca gcgttcctgc 181 ctcctggcat ggcaatcctc ttccctgaga ctggcagttg ctcaagatag gaagctcccg 241 ggaccagacc cgagcgccag ccggatacgc cgtccgctgg cctgagcaaa taaacgcgtg 301 ttttcaaaaa actacaatcc ccatggtggc acgcgcctgt agtcccagct actctggtgg 361 ctgaggcagg agaatggcgt aaacccggga ggcggagctt acagtgagcc gagatcgcgc 421 cactgcactc cagcctgggc gacagagcgc tcaaaaaaaa aacaaaaaac aaaaaacaaa 481 aaactacaac ccccatgaag ctttagtgcc tttgagggga ggagtggcgc atgtttgttc 541 acgcacgaaa aagaaattaa acctaactac cgttcccaga gggcgccgct ctgcaaatta 601 cccaatcagc tctaagtaca aagcatcgcg agtctttagt gctctttggc gctataagcc 661 cgtgggaacg agcattggag acccttttca caagatggcg ccgaaagcga agaaggaagg 721 tgtgtgttgg tgatgggccg cagctggttt accggggatt ccgcgccgca gagcgaacga 781 attgggaaca cggctgctgg gctaagccct gagggtttgc tccggggctg ctccctgcgt 841 ttcggacacg ctgcagtata cgtgggccgc gtgggcccag cctcgtgggc tgagttccgg 901 tagagggagt tggggggggg caacgcggca ggcatcatcc gccagggagg gccagacatt 961 cggttctggg aagctacatg catccactgg ttggagctcc atgtccccgg gcctgtaagg 1021 aattagtgcc ttcagcttta accattttcc ttctggttca tgttcaaccg ggatacatta 1081 cccgcccctc tctccgcagc gtggttttgc gatggggtat gtgtaaaatt cgtaggacat 1141 tttctggaaa gtatcaagcg ttcattcagt gctactttgt ttcatagtcg tatccctgga 1201 gttacttaga gttggtcgct ttcgcctctg gtatcgtgca tatgatggaa aagttttaat 1261 ctcctgacac ttgtgatgtc ttcaaaggaa ccactgatgc acctgtggcc agggtggccc 1321 actgcagttc ttggggccgg aagtgaccga tttctaaatc ccgcacccac gttttctttc 1381 cttttctccc agctcctgcc cctcctaaag ctgaagccaa agcgaaggct ttaaaggcca 1441 agaaggcagt gttgaaaggt gtccacagcc acaaaaagaa gaagatccgc acgtcaccca 1501 ccttccggcg gccgaagaca ctgcgactcc ggagacagcc caaatatcct cggaagagcg 1561 ctcccaggag aaacaagtca gtactgcccc ctgtacccat gaaaagattt gggtattctc 1621 cattggtaat ttggaaattc actcactctg cgtgatggtt tctcaaacgc aaattgtgtc 1681 cagtgtgctt ctctaattgg aagtatgagg agattgtttc tgctgcattt acaaaactgg 1741 caggatcagc ccagaggccg ggcgcggtgg ctcacgtctg taatcccagc actttgggag 1801 gccgaggggg ggcggatcac ttgaggtcag gggttcgaga ccagcctgag tgacatggaa 1861 gaaaccccgc gtctattaaa aatagaaaat tcgccgggca tggtggcgca tgtctgtaat 1921 cccagctact cgcgggaggc tgaggcagga aaatcgcttg aacccgggag gtggaggttg 1981 cggtgagcgg aaatcgtacc attgcattcc agcctgggca ataagaatga aactctgtct 2041 ccaccaccac gaccagttaa tttttgtatt tttagtgaga gggagtttca ccatgttggc 2101 caggatggtc acgatttctt gacttggtga tcagcctgcc tcagcctccc aaagtgctgg 2161 gaatacagcc atgagccagc acacctggcc cagagatgat tctttaatca cttgaggttg 2221 taagagccct tcaagagaaa gaaaatgcac aacgttctgc ccctgggatt ggggcttcag 2281 gcccttagat ttcagggtgc agatgatgac actgtaaagc gaccaaagtc tgaacaaagt 2341 gattggtacc tcgttgtctg atgcacctag gctctcctgg ctctgggctc caaaagaatg 2401 ggcccaggcc aggtgacccc attttgcctc tcccaggctt gaccactatg ctatcatcaa 2461 gtttccgctg accactgagt ctgccatgaa gaagatagaa gacaacaaca cacttgtgtt 2521 cattgtggat gttaaagcca acaagcacca gattaaacag gctgtgaaga agctgtatga 2581 cattgatgtg gccaaggtca acaccctgat tcggtgtaga acaggctgca agggatgggg 2641 agcaggctgg accagcagtc tggagccaaa aaaacctgca ttccatgaag ctttttgatg 2701 tttaggtagt cctggtaatg caggactaca cgtgtagtcc gagctatgca ggaggatctc 2761 ttgaggccag gaatcaccca tgattgcccc attgtactcc agcctgagca accaagcaag 2821 gcccttttta aaaaaaaaaa aaaaaaaaat cccctggacc tttcaccagg ggaaaaaggt 2881 cccttaactt aaacttggtt taggtttacc atttttttta acccttgagg caggaattac 2941 aggctgcttt atttacaaac ctttttttta acctaaggct ttctaggact taattggggg 3001 tgggtcccca tttccggggt ccccactcct attgggaaaa ggcaactaaa ttgcaaccct 3061 tggcagatta ccttggttta agttttaatg tggcctgtgg tgtttcccat aagagaattg 3121 gctttgtggc ttcatggtgt cctgtggggt aatgatggaa aaatcattat tggaaaagaa 3181 tgacatgaac aaaggaacca ctgaagtgcc ggaggactgg aggaggaagg gggagggtgt 3241 gggggcagtg agggtggcag ggactaaggc ttcctttttt accctaggcc tgatggagag 3301 aagaaggcat atgttcgact ggctcctgat tacgatgctt tggatgttgc caacaaagta 3361 agtttccttc ctacaaaccc cttaatgctc accccttggg tgcaaatgat gcatatgtta 3421 gcgaccaaag cctgatcttt gctgattagt cataattaac tgactgcacc cctatccttg 3481 acaaacctta tcctcacatt cctcattttg ctttctaaaa ataacattcc agcttatgtt 3541 catatttcaa ctccagtaac gaggctccct ttttttttca gattgggatc atctaaactg 3601 agtccagctg cctaattctg aatatatata tatatatatc ttttcaccat atacatgcct 3661 gtctgtcaat ttctggttgg gctgggaggc cacacacaca cactgacatg acagggcttg 3721 ggcaaaactc ctgttctact tatccttttg aaatacctca ccctgccact ccaccatgtt 3781 atgatcattc caaaaatctt tgtgactaaa gttagtgtcc taggaaaaac cagaactccg 3841 aacttgcctc catggttgaa gtaacaagct gttcaaaaaa ccccttttat ccctgggaaa 3901 aagctgtgtt ataaaaacca atgaccccag gggtttgaaa gggtgttaac atccatttca 3961 ggggaatttt ggattggcat gggctctctg ggtagcaatt attgtcctca caccacccgt 4021 cttactattt ccaagcgggg tctgttttgc ttccctcgcc ccttgcccca attaaaggga 4081 acaaggaact tccgaaagaa atactttcct t // LOCUS AF005058 5161 bp DNA PRI 01-JAN-1998 DEFINITION Homo sapiens chemokine receptor (CXCR-4) gene, complete cds. ACCESSION AF005058 NID g2735718 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5161) AUTHORS Wegner,S.A., Ehrenberg,P.K., Chang,G., Dayhoff,D.E. and Michael,N.L. TITLE Genomic organization and characterization of the promoter for the HIV-1 entry co-receptor CXCR-4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 5161) AUTHORS Wegner,S.A., Ehrenberg,P.K., Chang,G., Dayhoff,D.E. and Michael,N.L. TITLE Direct Submission JOURNAL Submitted (21-MAY-1997) Division of Retrovirology, Walter Reed Army Institute of Research, 13 Taft Court, Suite 200, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..5161 /organism="Homo sapiens" /db_xref="taxon:9606" gene 2..5161 /gene="CXCR-4" TATA_signal 919..923 /gene="CXCR-4" exon 950..1052 /gene="CXCR-4" /number=1 5'UTR 950..1037 /gene="CXCR-4" mRNA join(950..1052,3185..4760) /gene="CXCR-4" /product="chemokine receptor" CDS join(1038..1052,3185..4228) /gene="CXCR-4" /note="seven-transmembrane G-coupled receptor for the CXC chemokine SDF-1; entry co-receptor for HIV-1 strains with the syncytium-inducing phenotype" /codon_start=1 /product="chemokine receptor" /db_xref="PID:g2735719" /translation="MEGISIYTSDNYTEEMGSGDYDSMKEPCFREENANFNKIFLPTI YSIIFLTGIVGNGLVILVMGYQKKLRSMTDKYRLHLSVADLLFVITLPFWAVDAVANW YFGNFLCKAVHVIYTVNLYSSVLILAFISLDRYLAIVHATNSQRPRKLLAEKVVYVGV WIPALLLTIPDFIFANVSEADDRYICDRFYPNDLWVVVFQFQHIMVGLILPGIVILSC YCIIISKLSHSKGHQKRKALKTTVILILAFFACWLPYYIGISIDSFILLEIIKQGCEF ENTVHKWISITEALAFFHCCLNPILYAFLGAKFKTSAQHALTSVSRGSSLKILSKGKR GGHSSVSTESESSSFHSS" intron 1053..3184 /gene="CXCR-4" /number=1 exon 3185..4760 /gene="CXCR-4" /number=2 3'UTR 4229..4760 /gene="CXCR-4" polyA_signal 4726..4731 /gene="CXCR-4" BASE COUNT 1160 a 1338 c 1287 g 1376 t ORIGIN 1 tttcatctct ccgggcttat ttgctggttt ctccgaatgc gggccttgtc tggttcacgc 61 tggatcccca acgcctagaa cagtgcgtgg cacgcagttc gtccttctat aaatatcgga 121 ctaaatgcat ctctgtgatg gtaataccca cacggtgttg tgagaatgaa tgagtgattc 181 tgtgcaagtt cctagtgatc tgttacaaaa agtactggtc gctaaattac tcttataata 241 aagcatactt ttaggataat aaagcactat tcgcgaattg gttaccgcta ttatgaaatt 301 actgagcaat acatatctac atctgatcag tctccagaat tatgccaaat cctaccttct 361 tctgaaagta tctcctaatt atctgcacct gaccctagtg atgctgtgaa tgtgcaagta 421 tagctacatc ctccgaagga aggatcttta ctccttttac ctcctgaatg ggctgcgtct 481 gctgaaagcg cgggggaatg ggcggttgga agcttggccc tacttccagc attgccgcct 541 actggttggg ttactccagc aagtcactcc ccttccctgg gcctcagtgt ctctactgta 601 gcattcccag gtctggaatt ccatccactt tagcaaggat ggacgcgcca cagagagacg 661 cgttcctagc ccgcgcttcc cacctgtctt caggcgcatc ccgcttccct caaacttagg 721 aaatgcctct gggaggtcct gtccggctcc ggactcacta ccgaccaccc gcaaacagca 781 gggtcccctg ggcttcccaa gccgcgcacc tctccgcccc gcccctgcgc cctccttcct 841 cgcgtctgcc cctctccccc accccgcctt ctccctcccc gccccagcgg cgcatgcgcc 901 gcgctcggag cgtgttttta taaaagtccg gccgcggcca gaaacttcag tttgttggct 961 gcggcagcag gtagcaaagt gacgccgagg gcctgagtgc tccagtagcc accgcatctg 1021 gagaaccagc ggttaccatg gaggggatca gtgtaagtcc agtttcaacc tgctttgtca 1081 taaatgtaca aacgtttgaa cttagagcgc agcccctctc cgagcgggca gaagcggcca 1141 ggacattgga ggtacccgta ctccaaaaaa gggtcaccga aaggagtttt cttgaccatg 1201 cctatatagt gcgggtgggt ggggggggag caggattgga atctttttct ctgtgagtcg 1261 aggagaaacg actggaaaga gcgttccagt ggctgcatgt gtctccccct tgagtcccgc 1321 cgcgcgcggc ggcttgcacg ctgtttgcaa acgtaagaac attctgtgca caagtgcaga 1381 gaaggcgtgc gcgctgcctc gggactcaga ccaccggtct cttccttggg gaagcgggga 1441 tgtcttggag cgagttacat tgtctgaatt tagaggcgga gggcggcgtg cctgggctga 1501 cttcccagga ggagattgcg cccgctttaa cttcggggtt aagcgcctgg tgactgttct 1561 tgacactggg tgcgtgtttg ttaaactctg tgcggccgac ggagctgtgc cagtctccca 1621 gcacagtagg cagagggcgg gagaggcggg tggacccacc gcgccgatcc tctgagggga 1681 tcgagtggtg gcagcagcta ggagttgatc cgcccgcgcg ctttgggttt gagggggaaa 1741 ccttcccgcc gtccgaagcg cgcctcttcc ccacggccgc gagtgggtcc tgcagttcga 1801 gagtttgggg tcgtgcagag gtcagcggag tggtttgacc tcccctttga caccgcgcag 1861 ctgccagccc tgagatttgc gctccgggga taggagcggg tacggggtga ggggcggggg 1921 cggttaagac cgcacctggg ctgccaggtc gccgccgcga agactggcag gtgcaagtgg 1981 ggaaaccgtt tggctctctc cgagtccagt tgtgatgttt aaccgtcggt ggtttccaga 2041 aaccttttga aaccctcttg ctagggagtt tttggtttcc tgcagcggcg cgcaattcaa 2101 agacgctcgc ggcggagccg cccagtcgct ccccagcacc ctgtgggaca gagcctggcg 2161 tgtcgcccag cggagcccct gcagcgctgc ttgcgggcgg ttggcgtggg tgtagtgggc 2221 agccgcggcg gcccggggct ggacgacccg gccccccgcg tgcccaccgc ctggaggctt 2281 ccagctgccc acctccggcc gggttaactg gatcagtggc ggggtaatgg gaagccaccc 2341 gggagagtga ggaaatgaaa cttggggcga ggaccacggg tgcagacccc gttaccttct 2401 ccacccagga aaatgccccg ctccctaacg tcccaaacgc gccaagtgat aaacacgagg 2461 atggcaagag acccacacac cggaggagcg cccgcttggg ggaggaggtg ccgtttgttc 2521 attttctgac actcccgccc aatatacccc aagcaccgaa gggccttcgt tttaagaccg 2581 cattctcttt acccactaca agttgcttga agcccagaat ggtttgtatt taggcaggcg 2641 tgggaaaatt aagtttttgc gctttaggag aatgagtctt tgcaacgccc ccgccctccc 2701 cccgtgatcc tcccttctcc cctcttccct ccctgggcga aaaacttctt acaaaaagtt 2761 aatcactgcc cctcctagca gcacccaccc caccccccac gccgcctggg agtggcctct 2821 ttgtgtgtat tttttttttc ctcctaagga aggttttttt tcttccctct agtgggcggg 2881 gcagaggagt tagccaagat gtgactttga aaccctcagc gtctcagtgc ccttttgttc 2941 taaacaaaga attttgtaat tggttctacc aaagaaggat ataatgaagt cactatggga 3001 aaagatgggg aggagagttg taggattcta cattaattct cttgtgccct tagcccacta 3061 cttcagaatt tcctgaagaa agcaagcctg aattggtttt ttaaattgct ttaaaaattt 3121 tttttaactg ggttaatgct tgctgaattg gaagtgaatg tccattcctt tgcctctttt 3181 gcagatatac acttcagata actacaccga ggaaatgggc tcaggggact atgactccat 3241 gaaggaaccc tgtttccgtg aagaaaatgc taatttcaat aaaatcttcc tgcccaccat 3301 ctactccatc atcttcttaa ctggcattgt gggcaatgga ttggtcatcc tggtcatggg 3361 ttaccagaag aaactgagaa gcatgacgga caagtacagg ctgcacctgt cagtggccga 3421 cctcctcttt gtcatcacgc ttcccttctg ggcagttgat gccgtggcaa actggtactt 3481 tgggaacttc ctatgcaagg cagtccatgt catctacaca gtcaacctct acagcagtgt 3541 cctcatcctg gccttcatca gtctggaccg ctacctggcc atcgtccacg ccaccaacag 3601 tcagaggcca aggaagctgt tggctgaaaa ggtggtctat gttggcgtct ggatccctgc 3661 cctcctgctg actattcccg acttcatctt tgccaacgtc agtgaggcag atgacagata 3721 tatctgtgac cgcttctacc ccaatgactt gtgggtggtt gtgttccagt ttcagcacat 3781 catggttggc cttatcctgc ctggtattgt catcctgtcc tgctattgca ttatcatctc 3841 caagctgtca cactccaagg gccaccagaa gcgcaaggcc ctcaagacca cagtcatcct 3901 catcctggct ttcttcgcct gttggctgcc ttactacatt gggatcagca tcgactcctt 3961 catcctcctg gaaatcatca agcaagggtg tgagtttgag aacactgtgc acaagtggat 4021 ttccatcacc gaggccctag ctttcttcca ctgttgtctg aaccccatcc tctatgcttt 4081 ccttggagcc aaatttaaaa cctctgccca gcacgcactc acctctgtga gcagagggtc 4141 cagcctcaag atcctctcca aaggaaagcg aggtggacat tcatctgttt ccactgagtc 4201 tgagtcttca agttttcact ccagctaaca cagatgtaaa agactttttt ttatacgata 4261 aataactttt ttttaagtta cacatttttc agatataaaa gactgaccaa tattgtacag 4321 tttttattgc ttgttggatt tttgtcttgt gtttctttag tttttgtgaa gtttaattga 4381 cttatttata taaatttttt ttgtttcata ttgatgtgtg tctaggcagg acctgtggcc 4441 aagttcttag ttgctgtatg tctcgtggta ggactgtaga aaagggaact gaacattcca 4501 gagcgtgtag tgaatcacgt aaagctagaa atgatcccca gctgtttatg catagataat 4561 ctctccattc ccgtggaacg tttttcctgt tcttaagacg tgattttgct gtagaagatg 4621 gcacttataa ccaaagccca aagtggtata gaaatgctgg tttttcagtt ttcaggagtg 4681 ggttgatttc agcacctaca gtgtacagtc ttgtattaag ttgttaataa aagtacatgt 4741 taaacttact tagtgttatg ttctgatttc tgttgacatt cttttggcta gtagaagaca 4801 aaagtaatac atttatggta tgcaaagcac tatcctaggt atttcattgt aatattttac 4861 ttacccctta tcacaactct gatagattct gcttctgtta ctaattacat tttatagaag 4921 aggaaacgga ggcacagaaa gcctaagtaa cttggttaaa ggcatgtagt aagtatcaaa 4981 tcctgtattt taaaccaggt aacatgactt aacgaatctg aagccttcac cactttaaat 5041 tcaaatggaa gtttagaaat ggccagccag cacctatttg tatgaaaggt catctttcag 5101 aggataagca tgtataaaga agaaaaggta tgcagtcgtg tttggatttt actccaccat 5161 c // LOCUS AF005260 4449 bp DNA PRI 13-JAN-1998 DEFINITION Homo sapiens homeodomain-containing protein BAPX1 gene, complete cds. ACCESSION AF005260 NID g2766599 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4449) AUTHORS Tribioli,C., Frasch,M. and Lufkin,T. TITLE Bapx1: an evolutionary conserved homologue of the Drosophila bagpipe homeobox gene is expressed in splanchnic mesoderm and the embryonic skeleton JOURNAL Mech. Dev. 65 (1-2), 145-162 (1997) MEDLINE 97398454 REFERENCE 2 (bases 1 to 4449) AUTHORS Tribioli,C. and Lufkin,T. TITLE Molecular cloning, chromosomal mapping and developmental expression of BAPX1, a novel human homeobox-containing gene homologous to Drosophila bagpipe JOURNAL Gene 203 (2), 225-233 (1997) MEDLINE 98086223 REFERENCE 3 (bases 1 to 4449) AUTHORS Tribioli,C. and Lufkin,T. TITLE Direct Submission JOURNAL Submitted (23-MAY-1997) Brookdale Center for Developmental and Molecular Biology, Mount Sinai School of Medicine, One Gustave Levy Place, New York, NY 10029-6574, USA FEATURES Location/Qualifiers source 1..4449 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4p16.1" mRNA join(<861..1326,2751..>3286) /product="homeodomain-containing protein BAPX1" CDS join(861..1326,2751..3286) /note="Drosophila bagpipe gene homolog" /codon_start=1 /product="homeodomain-containing protein BAPX1" /db_xref="PID:g2766600" /translation="MAVRGANTLTSFSIQAILNKKEERGGLAAPEGRPAPGGTAASVA AAPAVCCWRLFGERDAGALGGAEDSLLASPAGTRTAAGRTAESPEGWDSDSALSEENE SRRRCADARGASGAGLAGGSLSLGQPVCELAASKDLEEEAAGRSDSEMSASVSGDRSP RTEDDGVGPRGAHVSALCSGAGGGGGSGPAGVAEEEEEPAAPKPRKKRSRAAFSHAQV FELERRFNHQRYLSGPERADLAASLKLTETQVKIWFQNRRYKTKRRQMAADLLASAPA AKKVAVKVLVRDDQRQYLPGEVLRPPSLLPLQPSYYYPYYCLPGWALSTCAAAAGTQ" BASE COUNT 820 a 1325 c 1470 g 834 t ORIGIN 1 tcggatgagc ctccttgttg ctggagggtg cttcgaggac caggcttagg agagaagggg 61 ttaagtcaga ccctgtcttt agcggcgcaa cgtgcccagt tttgtctgtt aaagtcaacg 121 tcgcccaact ggagccagcc aatggacggc cgggagcgcc cgggacagcc cagtggaggg 181 gtgtggcctc aggctgtcac tcagcagtga ggaatgacac ccgggcggga ggggggtggt 241 acagtggaac gcaagggaag gaaagagaaa agaaacagat aaaaccaggg cagggggtaa 301 aaaagttccc cgcgcctctc ggaatccagc ggccgcggcc gcggctaccg ccgccgccgc 361 cgggggtggg ctccggaaga gcctgcccag cagggaaggg gaggacgggc caggagcgaa 421 atcctgcagg cgaggaaggg acagacgcgc aaagaaaaag tgactcgggc tgtgcgcccc 481 ggcggcgctc gggaggggac tgctgccacg gaggggcccc cgccaatcct ccttcacgtc 541 ccccgagggg gcagacccca gcccgcccct aagggcgaga agcgatgaga taatccagct 601 gagctgcgaa aggagagcgt ctggctctgg attcgggggg agggggtctc cgggcggggt 661 gggcctctaa ggtgggggcg agggtcggcg ctgatcctgg gacggagtgg ggcgggagct 721 gctgggccgc gcagcagtgg gcggcggcgg cctcgcacac ccagctcact cgcgctgcgg 781 ccgccggcgc tctctgtccc gctcggagct gctcggcgcc cagctgcccg ccccgccggc 841 cgctcctgcc cgcggcgcag atggctgtgc gcggcgccaa caccttgacg tccttctcca 901 tccaggcgat cctcaacaag aaagaggagc gcggcgggct ggccgcgcca gaggggcgcc 961 cggcgcccgg gggcacagcg gcatcggtgg ccgcggctcc cgctgtctgc tgttggcggc 1021 tctttgggga gagggacgcg ggcgcgttgg ggggcgccga ggactctctg ctggcgtctc 1081 ctgccggtac cagaacagct gcggggcgga ctgcggagag cccggaaggc tgggactcgg 1141 actccgcgct cagcgaggag aacgagagca ggcggcgctg cgcggacgcg cggggggcca 1201 gcggggccgg ccttgcgggg ggatccttga gcctcggcca gccggtctgt gagctggccg 1261 cttccaaaga cctagaggag gaagccgcgg gccggagcga cagcgagatg tccgccagcg 1321 tctcaggtct gctgcggctt cgcggggatg ggggccgggc tgagggatgg acgcggaggg 1381 caaccacgcc gagtggggtg gtcggcttga gcaccctgcg ccctggggct cggaggctgg 1441 ccaaatcccg aagttctgcg ctctggcgcg agctgctcta tcacttcggg aactgagccc 1501 gcgagagacc ggagtagctc ctcagagccc tacgtcctgg ggtgaggcct gagacatcct 1561 gggccaatgt gcattagact gggttgggag aaccgctgcc ccttgcgttc gggagttttt 1621 gagttcgtcg ccagtgcttt ctgccgtgtc tgtccccaaa tccccaaaat cagggaagcg 1681 gaaaagtgag gggttcagag acctcaggta gccagtgacc tagctctgga ccagagacct 1741 gccagttcaa ggtcccaggg ccgggcccca cagtaggtgg ataggtggtg aagaacacct 1801 ccgagtgggc cagagctgag aatctgcaga actagagctt tggggaaagg gctggtttgg 1861 ctttctcaga ccccaggtct gggagagcag ccctaggccg aagatcccgg attgggtctt 1921 tagttcatcc tggaggcagg agagaggcct cctcagccgt ggcctgtcct gtcctgtggt 1981 ctggaagacc cgggtgcctg ccctagttgg gtacaagctt ctgtaggcct caagcacagg 2041 ggaccgctgg acctttctgc tggagttctt cggaagggcg ggtacgagac cgagaagctg 2101 gtgtgggatc agggtccaaa aactcgatgc aggggcaacc ctccctcttt actctccagt 2161 ccatcctctc atctaaggcc tcggctcagc ctgtttccat gtctctcaca gcaatcccag 2221 gcggctgagg gtgaaagatt ctaaatacct ttggccccag gatccgcctc caccttccct 2281 tacctgcacc agacaagtgt gtgctgcgag gacgcgcgcg tgtgtgcgtg tgaactaagt 2341 gtggagaggc gcgtgaaccc aagcagagag acggaaatgc agtctagaat cccaatgctc 2401 aacgctgaga gccgggaggg ctttcgcgga tgctcgggtc gagtgcccca agttactgag 2461 gggagcctgg ggcccagaga gggtggggag ctggtctcga caacacagcg ttgacacagc 2521 ggctgagtct gccgccctcc ccagaagagg cctagatggg atacccagtg gggcaagagt 2581 ctaagactgg tctcccgggc ccagggcctc ccagacctcc gctccgctcc gcttgagtgg 2641 gatcaaagtg tatttccgcc ccactcccgt gacccctgca gttgaggccc ggcctcacct 2701 cgtctgtggc gctgacagtc tcttgtttct gttcccaccg gggtccgcag gcgaccgcag 2761 cccaaggacc gaggacgacg gtgttggccc cagaggtgca cacgtgtccg cgctgtgcag 2821 cggggccggc ggcgggggcg gcagcgggcc ggcaggcgtc gcggaggagg aggaggagcc 2881 ggcggcgccc aagccacgca agaagcgctc gcgggccgct ttctcccacg cgcaggtctt 2941 cgagctggag cgccgcttta accaccagcg ctacctgtcc gggcccgagc gcgcagacct 3001 ggccgcgtcg ctgaagctca ccgagacgca ggtgaaaatc tggttccaga accgtcgcta 3061 caagacaaag cgccggcaga tggcagccga cctgctggcc tcggcgcccg ccgccaagaa 3121 ggtggccgta aaggtgctgg tgcgcgacga ccagagacaa tacctgcccg gcgaagtgct 3181 gcggccaccc tcgcttctgc cactgcagcc ctcctactat tacccgtact actgcctccc 3241 aggctgggcg ctctccacct gcgcagctgc cgcaggcacc cagtgaaccc gcttgggctg 3301 aggcagcgag tgattcccgc gctccggctc cggaccggcg ctgacagctg taggctgtag 3361 cctgcacggg gcgccccgcc aaggaggcac ctggaggtga aacccagctc cagctcccgt 3421 tagccaggac ttgtcccctg gcagctgggc tgagtctgcc ctgagggggc gcctttttct 3481 aatttgaaca gaggcaccct atggcctagg ggccctgatc gcccacctgc ctggaagccc 3541 ctgggctcta tttattatca tgacaatgtt ggaattaaat tttgattcga atatgtctgc 3601 ctgggggtgg ggttttccct gagcggcaac tcctggagac cacatagcct gaatcctcag 3661 aatttcaggc ctgctgggag ctttctgcac taggccacac tagttcatgg tatccatgct 3721 accaatctat gtgtatctac atatctttta tttttggaaa ttgcatttgt aaccaagggg 3781 tgcgaaaccc tggcagtccc aggcagcacc aggccagggg ttgatttgaa acgtgaagga 3841 ttgggttttc aggccctctg ctccacccct cctgtgtgtc agagctaggg tgggggtgcc 3901 cgattcgggt gctgaatgta aggaggggag cctccaagtg tggtgcaagc cgggggtctc 3961 cacatcttcc ttctctgaag tccaggtacc tgcacaagca ggaagcgcct gggagccccg 4021 gaaggaggag agcgcacacc caggcagccc tctgcggaaa ctttccttgg tttcttttta 4081 tttgtgtaaa ggaggttaag acgtgtcgca cttttcagtt gtttgtattc aaatgacgat 4141 tatttttcta ctcaatgtga atatccctgg ccagcctttc cacggcgccc accgcagtgc 4201 cgctgcctgg ccctcagtgt ctaccttccg ccctctgcga ctccagtgct ctggcccggg 4261 actcccctat ccgcccctca cttaccctta aacaggtgat cccacctgtc ttgtcaacct 4321 cgccgctttt cgcctcctta atggcactgt gcactcaact agagtattaa ctgtaaaaag 4381 atttgtgaag tttggaagct ctattcgctg tattttttct ttaatttata aacttttagt 4441 ttaacatgc // LOCUS AF006501 40107 bp DNA PRI 08-JUL-1997 DEFINITION Homo sapiens chromosome 22 cosmid clone c1155, RNA polymerase II subunit 14.4 kDa (POLRF) gene, complete cds. ACCESSION AF006501 L48815 NID g2262072 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40107) AUTHORS Pusch,C., Wang,Z., Roe,B. and Blin,N. TITLE Genomic structure of the RNA polymerase II small subunit (hRPB14.4) locus (POLRF) and mapping to 22q13.1 by sequence identity JOURNAL Genomics 34 (3), 440-442 (1996) MEDLINE 96374842 REFERENCE 2 (bases 1 to 40107) AUTHORS Pusch,C., Mullenbach,R., Gott,P., Schmitt,H., Wang,Z., Roe,B. and Blin,N. TITLE Cosmid-derived transcripts and sequence tags mapped to three subregions of human chromosome 22 JOURNAL Gene 183 (1-2), 29-33 (1996) MEDLINE 97149275 REFERENCE 3 (bases 1 to 40107) AUTHORS Wang,Z. and Roe,B.A. TITLE Clone c1155 with RNA polymerase II subunit 14.4kDa from human chromosome 22, 40107 bp JOURNAL Unpublished (1996) REFERENCE 4 (bases 1 to 40107) AUTHORS Roe,B.A. TITLE Direct Submission JOURNAL Submitted (01-JUN-1997) Department Of Chemistry And Biochemistry, The University Of Oklahoma, 620 Parrington Oval, Room 208, Norman, OK 73019, USA FEATURES Location/Qualifiers source 1..40107 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /clone="c1155" mRNA join(7031..7128,10095..10164,12650..12780,20414..20485, 20940..21114) /gene="POLRF" /product="RNA polymerase II subunit 14.4 kDa" gene 7031..21114 /gene="POLRF" CDS join(7109..7128,10095..10164,12650..12780,20414..20485, 20940..21030) /gene="POLRF" /codon_start=1 /product="RNA polymerase II subunit 14.4 kDa" /db_xref="PID:g2262073" /translation="MSDNEDNFDGDDFDDVEEDEGLDDLENAEEEGQENVEILPSGER PQANQKRITTPYMTKYERARVLGTRALQIAMCAPVMVELEGETDPLLIAMKELKARKI PIIIRRYLPDGSYEDWGVDELIITD" BASE COUNT 8760 a 10368 c 10941 g 10017 t 21 others ORIGIN 1 gatcacttga ggccaggagt tcaagaccag cctggccaac atggtgaatc cccatctcta 61 ctaaaaaaat taactggaca tggtggtgga cacttgtaat cccagctact caggaggctg 121 acacatgaga attgcttgaa cccgggaggc ggaggttaca gtgagccgag atagcaccac 181 tgcactccaa cctgggcaca gagtaaggct ctgtctttaa aaaaaaaaaa aaaaaaaaag 241 ttttgtcaaa acttagcaga gcaaggctaa gaaagctcag gaacctgggg cctcggcgat 301 gcctgtagtg gcaacttatg tataggggct gcagggagag gttaggctgt ggcattgagc 361 cctgatagcc cctatcatca ggaagtggta gatttcagtt aatcctgagt tctggaaaaa 421 taaaaaaaaa aggaccatct gactggtttc ctctgcgccc tgaatacccc taagccaaca 481 cgtgctgagg ctcaagagca ttggtacaag tgcatgggga ttgattagct accttcccac 541 ctgccctgtg ggctgcttgc gtccctctcc cttgtcctcc cccagggaga tggagaaagg 601 ccctagctga gaccctcact tactggtggc ttgaggcttg aactgctccc ggctgtaggc 661 cccattggct tgacacatgt tggcaggccg gaggtgggga cgggctgcga ggatgggagg 721 caggtagatg ggcgaggcta tttgcttgga aggtaagact ctctggctgg atgttgggct 781 gcactgtagg ggcaaagcat ctcctcctgg ggaacgaaga gttggggtgg gaggacccac 841 atggcttttt tttttttttt tttttttttt tgagtctgag tcttgctctg tcactcaggc 901 tggagtgcag cggcgcgatc tcggcttact gcaacctctg cctcttaggt tcaagcgatt 961 ctcctgcctc agtctcccaa gtaactggga ttacaggcat gcgccaccat gcttggctaa 1021 tttttgtatt tttagtagag acaggccatg ttggccaggc tggtcttgga ctcctgacct 1081 caagtgatcc acccacctcg gcctcccaaa gtgctaggat tataggcatg agctattgca 1141 cctggccact ttttaaaaat aaaagttggc cgggcatggt ggctcatgcc tgtaatccca 1201 gcactttggg aggccgagat gggtaatcac ttgaggtcag gaattcgaga ccaccctggc 1261 taacacggtg taaccctgtc tgtactaaaa atacaaaaaa ttagctgggc gtggtggcgg 1321 gtgcctatag tccctactgc tcgggaggct gaggccggag aatggcatga acctgggagg 1381 tgggacttgc agtgagctga gattgcacca ctgcactcca gcctgggcga cagagcgaga 1441 cttcgtctca aaaaaaaatt aatagaataa aaataaaggt aatacctacc tgttaaaaga 1501 agtcaaaagg ccaggtgcag tggctcacgc ctgtaatccc aacactttgg aggctgagtg 1561 gcagatcact ggaggtcagg agtttgagac cagcctggcc aacatggtga accccatgtc 1621 taaagaaaaa caaaaaaaat tagccaggca ttatgacggg tgcctgtaat cctagctact 1681 tgggaggctg aggcgggaga atcacttgaa cccaggaggt ggaggttgca gtaagccaag 1741 attgcaccac tgcattccag cctgggtgac tgagcaagac tgtctcaaaa aaacccccaa 1801 aaaacaaaaa ttagctgggc gtggtgatgc aggcctgtaa tcctagctac tgggaaggct 1861 gaggcaggag aatctcttga acccaggagg tggaggttgc agtgagctga gattgagcca 1921 ctgcactcca gcctgggcaa cagagtgaga ctctgtctca aaaaacaaaa caaaatgaaa 1981 caaaacaaaa caagtcaaac agtacaagga gtcttctctt cttccttcca gtcccacccc 2041 tctataatga atgagtgata atggtaacag tttggtggtc tccttccttt ttgatgctta 2101 tacaaacatg ttcaaagata tatgtgcata taagattctt tcaaacactg gatcgtactg 2161 taatatatta aaataacaga ctcttttcac ttaacatttt ttcatggaca tctgtccaga 2221 tcaatccatt tatatataac ttatttttta tagctatata ctagtatcct tagaatggat 2281 acattattta ttcagctcac ctccttgttt cagaaattta ggttatttct aggtttgttt 2341 tttactacct caaacaatgc tgcaataaac attcttgtac atcaaatctt tgtaccgatg 2401 ctttaatttc tatggatggg ttcctaaaac tttctgtgtg gtgcatggac aagcagggac 2461 agtttggatg acagcaggta cctatccctc ctgctgtcca tccccccgct tggccatccc 2521 tggggcttct cccccaggcg atccaagtgt gtcacctgtc tggtcagttg accccacctc 2581 tcaactgcca taaagcccag cttagagcag gtcccatcca ttgttttttt tttttttttt 2641 gagacagagt ctccctctgt tgcccaggct ggagtgcagt ggcatgatct tggctctctg 2701 caacctacac ctcctgggta caagcgattc ttgtgcctca gtctcctgag tagctgggat 2761 tatggtgtgt gccaccacgc cgggccaatt tttttttttt tttttttttt ttgagacaga 2821 gttttgttct tgtcacccag gctagagtgc aatggtgtga tctcagctca ctgcaacctc 2881 cgcctcctgg gttcaagcta ttctcccgct tcagcttccc aagtagctgg gactacaggc 2941 ccgggtcacc acacctggct aatttttgta tttttagtag gggtggagtt caccatgttg 3001 gccagggtgg tcttgaactc ctgactttgg gtgatctgcc tgtctcggcc tctgaatgtg 3061 ctgggattac aggcgtgagc caccgcacct ggctcatcca ctgttttgat ctgggtgcct 3121 gtctaccttt tctgcccatc cagcctctac aaggtcaatt ccccaggagc gagcggtcag 3181 aagtattaat ttttttggtc acacacccaa aaattttgct gtagagagag gtttcactgt 3241 gttgcccagg ctggccttga actcctgggc tcatgtgatt ctcctgcctc agcttcctga 3301 gtagctggga ctacaggtac accccagtat gtatgtgttt tgccaggctg gtcttgaact 3361 cttgggctca agcaatcctc ctgccttagc ctcccaaagt gataggatta taggtataag 3421 ccaccacagc tgacctgttt ttttttattt tttatttttt gtttttgaga cgatgtcttg 3481 ctctgtcacc cacactggag tgcagtggta tggtctcggc tcgctgcacc ctctgcctgc 3541 tgggttcaag tgattttcct gcctcagcct cctgagtagc tggtattaca ggcatccgcc 3601 atcatgccca gctaattttt gtattggcat agagacaggg ttttaccatg ttggccaggc 3661 tggtcttgaa ctcttgacct caggtgatct gcccgcctca gcctcccaaa gtgctgggat 3721 tacaggtgtg agctcccaca cccggcctga cctgttttta ttttaattta attttatttt 3781 attttaatta ttttgagatt aggctccact cggtcgccca ggctggagtg cagtggtgca 3841 atcacagctc attgcagcct tgacctccca gggtcaagtg atcctcccac ctcaacctcc 3901 tgagtagctg ggactacagg ttggcaccac cacacctggc taattttaaa aaattttggg 3961 ggctgggtgc agtggctcac gcctggaatc ccagcacttt gggaggccga ggcgggcaga 4021 tcacgaggtc aggagttcga gaccagcctg gccaacatgg tgaaacccag tctctactaa 4081 aaatacaaaa attaagccgg gcatggtggc gcacgcctgt aatcctagct agtcaggatg 4141 ctgaggcagg agaattgcct gaacccagga ggcggaggtt gtagtgagcc gagatcatgc 4201 cactgcactc cagcctgggc gacagagaga gagactcatc tcaaaacaaa acaaaacaaa 4261 aaagttggcc aggcgtggtg gctcactcct gtagtcccag cactttggga ggccgaggtg 4321 ggcggatcat gaggtcagga gatcgagacc atcctggcta acacgatgaa acgccgtctc 4381 tactaaaaat acaaaaaatt agccaggcgt ggtggtgggc acctatagtc tcagctactt 4441 gggaggctga ggcaggagaa tggcatgaac ccgggaggca gaagttgcag tgagccgaga 4501 ttgcgccact gcactccagc ctgggcgaca gagtgagact ctgtctcaaa aaaaaaaaag 4561 gttttgtgga gatgaggtct tgtggcttgt ggcgttgccc gggctggtct tgaactcctg 4621 ggcttaagcc atcctcccac ctcagcctcc cagtgttgga attacaggca tgagccacgt 4681 gcccggctgt ctttgttttt tttatatgag gtcctctctc ccttaacatg gagaagcatt 4741 aaaagtgtgg ccccagatcc ctctgtccag gaagcctctg tggactgccc cttacttttc 4801 atgatgtcca tgatgtggcg ctgctggatg ttcgtcagtt tggattcctt catcatcact 4861 gcacgacaaa gcattctatg aggcctcttg ggctgcaggc ggatgccttc acctgacacc 4921 ctactgaaat ttaaaaccct acatcctagc cccattctgc actgagcatg ccttctctat 4981 ctttcctgct tcgttttttc tatagcactt acgaccacct aaaattctat atattttatt 5041 ttattgttat ccttctatat tagctgtatg ataagaattt ttgctggttt aataaaattt 5101 ctgccctctg ttaaatattt tttcctcttt ctcttttttt tttttttttt tgagatggag 5161 cttccatctt gttgcccagg ctggagtgca gtggcgcgac ctcagctcac tgcaatctcc 5221 acctcctggg ttcaaggaat tctcctgcct cagcctccca agtagctggg attacagaca 5281 tgcgccacca cgcccggcta attttgtact tttagtagag actgggtttc tccatgttgg 5341 tcaggctggt cgtgaactcc taacctcagg tgatctgccc accttggcct cccaaagtgc 5401 tgggattacc ggtttgagcc acctcatttt ctttcactgc tgtatcccca gcacctaaaa 5461 cagagtctaa ggcacatggt agacttaaaa atgttgttgt tggaaaaaaa caaaaaagtt 5521 gttgagtaaa agaacacatg aatacttgtt catcatgttt caggggctgg agctacaatt 5581 gtgggccgaa acaggcacgg ttcctgcctg natgaaattc atgatctgga gggatagatg 5641 ggacaccaaa taactgacta cagcttgtgc taagcccagt gaaggaaaat gtgtgagggc 5701 acataacagt tgatcctact gtggtggtga ggagcagaag tcttcctcgg gaaagttgaa 5761 agtgtgatct aaaggatgca taggagttaa ctactctaaa aattttagtt ctttgacttt 5821 tggcaaggtg tagcaccttg aaccttcccc tcccccccgc cccgccccgc ccccgactaa 5881 ggcatatgca gtagaatttt gaagctttct gaaccccagg ctccttatct gtaaaggaaa 5941 atagtgctaa gccacaggtc aggtacgggg gaactgagtt acagagggta actaccacca 6001 ttcttctctc atcctttaac tcaggggtcc ccaaccctcc gtccgtggac tggtaccagt 6061 ccctggcctg ttaggaaccg ggccgcacag caggaggtga gcggcaggct agcgagcctc 6121 accgcctcct ttcaggtcag tgacggcctt agattctcat aggagcgcga accctattgt 6181 gaactgcgca cgcgagggat ctaggttgca cgctccttat gagaatctaa ctaatacctg 6241 atgatctgag gtggaacagt ttcatcccga aaccattccc cacacctccg tcagtggaaa 6301 aattgtcttc cacgaaaccg gtccctggtg ccaaaaaggc tgggatcgct gctttaacgg 6361 actgacctct gagcagctcg caggtccccg gggtgtaagt gatggtcttg gggcggcgcc 6421 ggaacccagt tcctttggtc actacctcca tctgcttctg tgaagccatg ggaggactct 6481 gaggagggaa agacgcaatg acacaccccc tgtctcctgt ccctaatctc tgttccccga 6541 cgcccaacac ccccagaagt ctggaagcgg ggatcgagcc agagaatgat gccgccagtt 6601 aatattgcga cagtgcccaa acctcccatc cagacacaca gatacacaca cacagtcctc 6661 attctgaccc agcgcctgtg gcagggaagt ctcctcggga gggagtgtta acacagccat 6721 ggtcacaaca gtacccactg agtgatatgc caaaattgaa atttcaccct aagaccctct 6781 ctgggtgctc ccctctccac atagaagtgg gctcccggag gcggtgacca ctgctttaca 6841 gccgcatccg caccggtgcc acgcagaaat actgcgcgat ctgggctgct ggagctggca 6901 gctagcgcct ctcgctacta tagaaacgcg cacacaccag tcagcgcatg cgcactttcc 6961 ccgctctgtc ctgcctcccg gaagtgattt cctctgggtt acggcgcagg cgcaagataa 7021 gctaggagcc gcgcgagtcg tagtgtcgct gtttgcgggt ctccgcggcg ggaccggggc 7081 gcagcggggt cgctgaggcg agggtgtcat gtcagacaac gaggacaagt gagtgcggga 7141 gcggagtggc ctttgcggca accttggaag gggcggatga ggcaaggctt gggtcggctg 7201 aggagcctgg cgtctagggg caatgtctga ggggactggg gtcctgaggg ggccggcgga 7261 gccgaggaag ggaannncgg gcgacccagg agcacagccc cggctggagg cacaggaggg 7321 cccaggctcg agggtccaga ggggatccgt agaaaaaggt gtccttcagc aaatgttcca 7381 gcgagccggg ctctgtatca tgcagtggga atattgaggt gaatgaacaa tcgtccctgc 7441 ctctccacgt gctcactttc tagaaaaggg gtcacaggcc cataagtaag caagcaagca 7501 ggtaacggtg gtcgcgtgag cttccccagt tctagacgta tgtagtatag gagtactcca 7561 gaggagggtc tggggcagct ccaccgggtg gcattaaata aaagtctggg acggtccttt 7621 gcagtaattt tttttttttt ttgaaacaga gtcttgctct gtcgcccagg ctggagtgca 7681 gtggcgcgat ctcggctcac tgcagcctcc acctccgggg ttcaagggat tctcctgcct 7741 cagcctcccg actagctggg attatgggcg tgcaccacca cgccttgcta attgcttgta 7801 gttttagtag agacgggttt caccatgttg gccagactgg cctcgaactc ctgagcttgt 7861 gatccacccg cctcggcctc ccaaagtgct gggatnacag cgtgagccac cgtgcccggc 7921 ctgcttgtag taatctagat gagagaagtc agtagcttat agtagggtgg taggagtaga 7981 ggtggtgaga actggttttg aattctggat atattactat tttaaagata gagcccaaaa 8041 attacctgtg ggatatgtgg gtgcaaagag agctagtctg aagataaaaa taagaaacgt 8101 gtgggattgc taccccattc ccagtgagtg tcagtagaga agaggagatg acccgtggct 8161 gagcattagg gcatttggac attgagtgct tgggggaaaa gagaacccgg ggaggtgggg 8221 gaacaccaag agggtgaggt gtccttggag tcaagtgagg agagcaagtg gcctaatgtg 8281 ccagctgata gatcaggcta aatgagaagg agagttggcc attggattta gcaacgtgga 8341 gatcattaat gagcttggaa agtggttttg atggagctgt tggggtgaaa ggctttttgg 8401 agtgagatta tgacaaaatg ggaagagagg aagatagtga gggtggacag ctctttcaca 8461 catttttgct tcaaaggaca ccatggaatt agggctatag ggggagagag aagagaaata 8521 agggtttttt tttttttttt tttagatgag gaaaagcaca tgttgaaatg ctgatggggc 8581 cgggtgctgt ggcttacacc tctaatccca gcactttggg aggccgagga gggtggatca 8641 cctgaggcca ggagttcgag accagcctgg tcaacatggt gaaaccccat ctttactaaa 8701 aatacaaaaa ttagctgggc atggtggcgt gtgcctataa tcccagctac tcgggaggct 8761 gagacaggag aattgcttga acccaggagg cagaggttgc agtgagccaa gatcacgcca 8821 ttgcactcca tcctggacaa cagagcgaga ctttgtctca aaaaaaaaaa aaaatgctga 8881 tgggaatgat ccagtagaga gggaaacagt gatgatgaat gagtactagc aatatttggg 8941 ggcagatggg gaaaaattga aggggtaaaa tggaacaaaa gtcggattta cttcttcttc 9001 ttattattat tttttatttt ttgagaagga gtctcgctgt gtcacccagg ctggagtgca 9061 gtggggtgac cgtggctcag tgcaccctcc gcctctcggg ttcaagcaat tcttgtgcct 9121 cagcctcctg agtagctggg attacaggcg cccaccacca cacccggcta atttttttgt 9181 atttttagca gagacgagat ttctccatgt tggccaggct ggtcttggac tcctgacctc 9241 aagtgatgca cccacctcgg cctctcaaag tgctgggatt acagacgtga gccactgcac 9301 ccagctgatt tacttattaa gtaaatattt attgagtgct tcctacatac tgggcgagac 9361 acagatctct gccctctttg attttttttt tgtttgtttt ttgttttgcg gtggtggggg 9421 ggggttttga gacggagttt cactcttgtt gcccaggctg gagtgcaatg gcgtgatctc 9481 agctcaccac aacctccgcc tcctgggttc aagtgattct cctgcctcag cctcccaagt 9541 agttgggatt acaggcatgt gccaccacca ctcccggcta attttgtatt tttagtagag 9601 acggggtttc atcatgttgg tcaggctggc cttgaactcc acccgccttg gcctcccaaa 9661 gtgctgggat tacaggctta atcccagagc cacaatacaa agccaccatg cccggcctct 9721 gaattttttt tttttttaat ttttcttatt tattttgaga caggttcttg ttctgtcacc 9781 catgctggag tacagcggtg caatcttggc tcactgcaac ctccacctcc tgggttcaag 9841 caattcttgt gactcagccc cccaagtagc tggtattaca gacatgtgcc accatgcccg 9901 gctaattttt gtatttttag tagagatggg gttttgcctt gttggccagg ctggtcttga 9961 attcctggcc tcaagtgatc cgcccacttc aatctccgaa agtaccagga ttacaagtgt 10021 gatccaccgt gcctggcccc tctttgattc ttttaagtga tgctctaagt aactgatctc 10081 ttcttcctgt acagttttga tggcgacgac tttgatgatg tggaggagga tgaagggcta 10141 gatgacttgg agaatgccga agaggtcagt attcagcctc aggctcccac ctctgcagcc 10201 caagctgcca aatcgtctga caggtgatga ttagctgaat gtgtctgcct tctatttgtg 10261 gtcaaggccc ctgccataag cattcccagt tagagtgagc tgtgttccag tcttgcatag 10321 cagaacagcc ctgcagggtg ggcctgaaag cttttgtcca tgtggtccac agcctgtggc 10381 ttctggggca gttagttcac tgacaagtta aacggtgcct tccttcattt ggcaaacatg 10441 tgttgaatgt ttattatatg ccaggtgctt tctgctgggt gggactgctt aaggagcaaa 10501 gggaagaggt aagcagggct cagatgaggc agtattttgt agcctaaagg caatgggaag 10561 ctactgacaa actttgagcc agggcacagt ccaggttaca ttttagaaag tacattcctg 10621 cctcgagtaa ggagtgaact ggggaagggc aagagtgaaa ggtgagaggc ctgtggcttg 10681 gacataggtg ctggtagtgc aggtgatgag aaggcagata aaagagttat ttcggtgggt 10741 tggttatggg gctgagaaag tgagtggtga cgaggacgac tctcaggttt ctggcttgag 10801 caactaggtg gataggttgc cagcctcaag aggaaagtag ggctttctgt cttgtgtgac 10861 tgatgaggga aaacctgcag ttatgttctg gagagacatc nnctctctgc atggcacttt 10921 cagatacctt ttcacttgag tcagatgcgt ccttctccat ccttctggtc cccttcccat 10981 ccctgtcatg tagattgtgc ctcctagatg tacttcagcc ctgtctgcat ttttctactc 11041 ctggtgcctt aacctggctt agatcttatt gcttttcagc cacattcctg caggaattgt 11101 gtggctggtt ttcctgcttc agtctcactc cagactctgc tctgtgtgct cttctccttt 11161 cagcaccctt tcttacttgc ccacatgctc agttactact ggctttccta aatggcctct 11221 ctgcttctgc cactgccgcc tgcagaatat tcgcaacaga gtatccagag tgatcctttc 11281 tttctgtttt tgttgttgtt gttgttgttt gtttgcttgt ttaaattaag aagtggtgtc 11341 ttgctacgtt gtctaggcca gagtacaatg gctattttca tgtgtggtca tagcgcacta 11401 cagcctnaac tcctgggctc aagtgatcct ccagttcagc ctcctgagta gctgggactg 11461 caggttctgc accaccatgc ccagctcaga gtcaactttt tttttttttt tgagacgggg 11521 tctgctctgt cgcccaggct ggagcgcagt ggggcgatgt tgactcactg caagctctgc 11581 ttcccgggtt catgccattc tcctgcctca gccttctgag tagctgtgac tacaggcgcc 11641 caccaccacg cccggctaat tttttttttt tttgtagttt tagtagtgac ggggttcacc 11701 gtgttagcca ggatggtctc gatctcctgc cttgtgatcc gcccgcctcg gcctcccaaa 11761 gtgctgggat tacaggtgtg agccactgtg cctggcctgt ttacttaatt ctttatatac 11821 cacagaattt attctcttgc ctacaatcct ctacgcacct tttgttctaa ttattgtcca 11881 caactagttg tccaaacctc tttctcctct atgcctttgc atgttgcctc cttcctcatt 11941 ggtttgacag ctcctgctca tccttctagg tgagctcaag tctcatccct ggagttttcc 12001 taacccccag tgtgagtgag gtgtccccac ctcctcggga actctgttgc acccttcata 12061 gcacctgctc tgtttactct tgtctcccct ttagaccatt aattgagagc agaaattcct 12121 agctttcttt ctcagcatct gttaacaata tctggctcag gacaaatgct tagtaagtaa 12181 ttgttaaata ttcctgcctt caccttgact tttctagcca cttggcccat tggctcagct 12241 cccctatctg aaagatggga agactattcc tagcttcctg cttagggtaa tttttttgtg 12301 ccaggacctc ttttacagga ccaacattct gataccaccc catctccagg ccctctgctc 12361 tcttctggat gaggcagtta gctctggaat ctcagacctc cattaccagg tctgaggctt 12421 ggtcaggtta gctgactttc caagcctctg aaatgagaac tgtaatccct gcctcataag 12481 gttattgtaa ggattacatg agttgttagg aagcagttat gtggtcccct ggcatgcagt 12541 aagcattaac ggtggctgta gcgagaattg tgattgttgt cattatcact ctccctgtaa 12601 cccaaaagtg ccacctgagt ctttccctct tttttgtctt ggtgtccagg aaggccagga 12661 gaatgtcgag atcctcccct ctggggagcg accgcaggcc aaccagaagc gaatcaccac 12721 accatacatg accaagtacg agcgagcccg cgtgctgggc acccgagcgc tccagattgc 12781 gtgagtgatt gccccttcac tgtcttctcc atctgtcagg ccacagtaag ctcttgtacc 12841 gctgatcttt cccagcctgg cacctgaaaa cagactctct gctctcacat tccaaaagtg 12901 gtgccgtccc tgcgtaccag gcattgagct agtcccctgc cctcagggag cttgtgatct 12961 aataggagaa ctaggacatg aacatctgag aagtcaagta acaatatggg agaaaaacga 13021 taaaaaaaaa aaacagaact agatggttgc gtgtcagatt agtgcagcaa aaaaggtgaa 13081 ttcagagcca atgtgtacca gtgtagtttt tttttttttt ttttgagata gagtctcgct 13141 gagtcaccta ggctggagtg caatggcaca atctcagctc actgcaacct ctgcctccca 13201 ggttcaagtg attctcgtgc ctcagcctcc caggtagctg gaattacagg catgtgccac 13261 catgcctaac taattgtgtt tttaatagag acggggtctc gccatgttgc ctaggctggt 13321 ctcgaacgcc tagcttcaag tgatccacct gcctctgcct cccaaagtgt tgggattgca 13381 ggtgagagcc accgcgcccg gccatggtgg ttctttgttg ttttattttc cgagtcaaga 13441 tctcaatctg ttgcccatgc tggagcacag gggcctgatc atggcccatt gcagcctcaa 13501 actcctgggc tcaagcaatt ttcttttctt tttttttttt tttgacagag tcttgctctg 13561 tctcccatgc tggagtgcac tggcccaatc tcagctcagt acaacccctg cctcccgggt 13621 tcaagcgatt gtcctgcctc agcctcccgc gtagctggga ctacaggcgt ccaccaccac 13681 acctggctaa tttttgtatt ttttcttttt tctttgagat ggagtctcgc tctatcaccc 13741 agcctggagt gcagtggcgc gttctccgct cactgcaagt gctgcctcct ggcttcacgc 13801 ccttctcctg cctcagcctc cagagtagct gggactacag gcgcccgcca ccgtgcccag 13861 ctagcttttt gtatttttag tggagacggg gtttcaccat gttagccagg gtggttggtc 13921 tctttctcct gacctcatga tccgcccgcc tccggcctcc caaagtgttg ggattacaga 13981 cgtgagccac cgtgcccggc tatttttgta tttttagtag agacggggtt tcaccatatt 14041 ggccaggctg gtctcaaact cctgaccttg tgatccaccc gccttggcct cccaaaatgt 14101 tgggattaca ggcgtgagcc accgtgcccg gccggtgcaa ttttcttttt tctttttttt 14161 ttttttttga gatggagtct cgctctgtcg cccaagctgg agtgccgtgg cgtgatctcg 14221 gttcactgca atatccacct cctaggatca cgccattctc ctgcctcagc ctcccaagca 14281 gctgggacta caggtgccca ccacaacgcc tggctaattt ttttgtattt ttagtggaga 14341 tggggtttca cggtgttagt caggatggtc tccatctcct gaccttgtga tccacctgcc 14401 ttggcctccc aaagtactgg gattacaggc atgagccacc gcgcctggcc ggctcaagca 14461 attttcttgt ctcaacctcc caagtagctt caagctggga ttacaggtat gtgccactat 14521 gcctggcttt gtattttttg tggagatagg ggtcttgcta tgttgcccag gctggtcttg 14581 aactcagcct caagtgattc tcctgcctca gctcccaaag tgctggagct gggattacaa 14641 gcatgagcaa ccattcctgg ctttaaggtg gttcttgatg tgaggtagac tatggccagt 14701 ccagcagagt ggggggccct ccagcttgaa agcacgctag agagatgaaa gcacagagca 14761 gtggtggggt tttggggatt tgaaaatcag caaacttccc tcctctccca catttgggag 14821 aataaagatt caggagatag agtagacagg tggagatgcg agtgtccaat gagtgcagat 14881 gtgttagcca taggcagtgc aggccttcag atgcctgcta aggtttgggg cagactggag 14941 aacagactgg aagcaccctc tgccaacaga tggctttgag aaagcaggga ctttcaaaga 15001 ggctgggcca cagatggcct gctttttttc cccccaggct cacctgtcag aatagttcct 15061 cctcctttct tttgaggagt aggtgagggg acagagcggg gtactctaat ggtactgtgg 15121 ggaagcatga gggaagcatg agactgagaa caggagcccc taccccaaac tccctataga 15181 ctgatgtggg aaatggtggg tatctggagg agcgggggta gttcaagtag ggaataatgg 15241 aaattagcct agaaagaaaa gcagctgggt gccgtggctc acaactgtaa tcccagcact 15301 ttgggaggac gaggcaggtg gattgcttga gcccagaagt tcaaaatcag cctgggcaac 15361 atggtgaaac cccatctcta caaaaaatac aaaaattagc tgagtgtggt ggcgtgcgcc 15421 tgtagtccca gctactcggg aggctgacgt gggaggatca cctaaggcca ggaggttgag 15481 gctgcagtga actgtgatcg cactactgca caccagcctg agtgacagag actccatctc 15541 aaaacaaaca aacaaacaaa caaaaaaaaa acaaaggaaa gctggggcca gatttgtgga 15601 ctctgtgtgc ttacaggggc agcaaatgct gtctgtacat tctcctcctt agtgttcaag 15661 cagcactggg ctgttgcctc ctctagtcct gtaggggtaa agatgaccat catgtctact 15721 tcaataaagg gttttatggg tggacaacat tgttctcccc cattcctcag tagcggacct 15781 cttctcagcc tggttgcact ccttgctctt atgcatggca ttagagccac acccagtgtg 15841 acagccatac ccaatatgcc accaacgtca ggatcctctg ggcctgcacc atcaactatg 15901 gtagccatga actgtatgtg tccatttaca ttaagtacaa ttaaataaaa ttagaaattc 15961 agtttctttc tcactagcca gccacacttc tttttctttt atttatttat ttaatttttt 16021 ttgagacgga gtcttgctct gtcacccagg ctggagtgca gtggcacagt cttggctcac 16081 tgcaagctct gcctcacggg ttcacgccat tctcctgcct cagccttccc agtagctggg 16141 accacaggca cctgccacca agcccggcta attttttgta gttttagtag agatggtgtt 16201 tcaccgtgtt agcgaggatg gtctcgatct cctgaccttg tgatctgcct gcctcggcct 16261 cccaaagtgc tgggattaca ggcgtgagcc actgcgccca gcctatttat ttttttttga 16321 gaaggagtct cactctgtca cccaggctgg agtgcagtgg tgtgatcttg gctcactgca 16381 tcctccgcct cccgggttca agtgattctc caacctgagc ttcccgggta gctgggatta 16441 caggcacccg ccaccacgcc tggctaattt ttgtattttt agtagggacg gggtttcacc 16501 atgttcgttg gccaggctgg tcttgaactc ctgatctcag gtgatcctcc tgcctcagac 16561 tcccaaagta ctgggattac aggcatgagc caccacgccc agccttttct tttatttttt 16621 tagagacagt ttggctctat cacccgggct ggagtgcagt ggtgctatca tagctcactg 16681 cagcctcaaa acttctaggc ttaagtgatc ttcccacttc agcctcccag gtagctagga 16741 ctacaggcta atgccaccac acgtggctaa tgtcttaatt ttttgtagag acagggtctc 16801 acaaaaagac tcctggcttc cagtgatccc cctgcttcag cctccaaagt gctgggatta 16861 caactagcca catttcaagt gtgttcagta gccacatgtg gctactggtt accataaagg 16921 atagcacagg gggataggac atttctgtca ttgcagagca ttctatgggg caccctaata 16981 taggctaaaa gtggcccagt agaaagaaga attcataagt tatttttaaa acaaatagac 17041 tgcttatttg caagttaaac agtctttcag cactgaagaa agatactgca ttagcacttg 17101 ataggatgct gagatgatta aagatgtagt cccagccggg cgcggtggct cacgcctgga 17161 atccgagcac tgtgggaggc caaggtgggc ggatcacctg aggtcaggag ttcgagagca 17221 gcctggccaa catggtgaaa ccccgtctct actaaaaata caaaaattag ccgggtgtgg 17281 tggcgcgcgc ctataatccc agctactcgg gaggctgagg caagagaatc gcttgaacct 17341 gggaggcgga ggttgcagtg agccaagatc gtgccactaa tccagcctgg gtgacagagt 17401 gagactctgt taccaaaaat aattaaaaaa aaaaaaaaag acataatccc cagtcctcca 17461 ggtgttttct ggtgggggag atagacctgt gcacaggtac ccacaaccca aggcagatgc 17521 gagtggtgct ttatgaaagg tacaaagaga gttattgcag aggtgactgg gaaagccttg 17581 tgaagggatt agaattctag tggggacagg taaagagagt tggttttgtg ggtactccag 17641 gtgccagaaa cagcccaagg aaaggtccag gtggaggcag gcagttgtgg atttgtgatg 17701 agggatcata agcagcttgg ggcataaggc tggagaagat gacactggtc ctgtgagtga 17761 caggttcaga agattagagt tcatttggtg ggcagtgtgc agcacttaaa ggttttgagg 17821 ccaacatcca gcatccaggc cactcataaa tccaagtgtc ctgtttttct tttctttctt 17881 tctttctttt tttttttttt tttgagatgg agttttgctc ttgttgccca ggctggagtg 17941 caatggcatg atcttggctg attgcaacct ctgcctcctg ggttcaagga attctccagc 18001 ctcagcctcc tgagtagctg ggattacagg cacacgccac catgcctggc taatattttg 18061 tatttttagt agagacggtg tttcaccatg ttggtcgggc tggtcttgaa ctcgtgacct 18121 caggtgattc acctgccttg gcctcccgaa gtgctgggat tacaggcgtg agccactgtg 18181 cccagccaag tgtcctgttt ttcttgggga gcaaagcttg atccagctca ttccagtctt 18241 tagtgctctt tccacttaag ggctatgaca ccatctggtg gctccttgcc tgctgtaagc 18301 tcttgtacag ccataatggg actctgcact tggtaggtgc aggtggagtc tcactctgtc 18361 acctaggctg gagtgcagtg gtgccatctc ggctcactgc aaccttcacc tcctggttca 18421 agcgattctc gtgcctcagc ctctcgagta tctgggacta caggcatgtg ccaccacggc 18481 cggctaattt tttgtatttt tagtagagac ggggtttcac catgttggcc aggctggtct 18541 caaactcctg acctcaggtg attcacctgt ctcagcctcc caaagtgctg tgattacagg 18601 cgtgagccac tgcacctggc cagaacaact gtttgttggg taggttttgg atggggtttc 18661 agtctggttc attagcacca cctggtggct gatgatgttt gtcacatcta cttcttaact 18721 ggatggttga tgtgtttctc tgtagaagcc atggtgcaag aaaaattaaa ttacttaagg 18781 acacattctt gcacattcca tgtgcataat aggtgcggaa tgagtgaatg cccttgaatt 18841 atttatattt tgaaagaaag caggcatatg cagatcaggt attttgccac attctttaaa 18901 gctttatatt ttgggaaagt ttaaatactt attaagagag gatataggga aagttaaata 18961 cttcatctct gcccttggga aattttcagg cttttgggag taatataaat cgaaatattt 19021 agtaataaaa ttagtaagct ggtagctctc agactagctg attatcagaa tcatctgggc 19081 agttttgcaa atgcaggctc tcaggccgct tcactagaga ttctattccg ggggtctggg 19141 gaaggaccca ggaacctgta ttttttagca tgctctcctg ttaattctgg atttgttggt 19201 gtgaacctga gctggaatca tcaggggtgg cttcctagag gagatgggcc tagaattggc 19261 atccctttgc ttggtggtag agtccccact gtgggccaag tgttatgctg ggggctgggc 19321 ctccgtggtg aatcacatgt ggttctgggc cactaggagc tcatggtcta atgtggaaag 19381 tttccaaaca agtagttaca gttcaatgag attggtgcca agaaaagtat atgcaaaatg 19441 agtccaggta ctgaagagtc agcacacgat ttccagggtg gagacatctg aacctctcgc 19501 aggtttttag ggagaggaag agcttgtaag agcacacaaa agggaaacaa catggcgcat 19561 ctagggaacc agaaatagtt tggtatggct ggaatgtgga gtaaatacag gaaagtagca 19621 cgggactgtg ctaggaaggc aaacagggac cagactgcac tctggcctga tagccaaggg 19681 gagtcatgac agtgttgtaa gcaggtgagc cccctgactg ctggcctcag gtgaagggtg 19741 gggagggctg ggcaggaagg gcagggcctc tactgaggtg ttgtgtctat agagaatgga 19801 aatgatgccg ggcatggtgg ctcatgcctt tcatcccagc aatctggaag gctgaggcag 19861 gcggattgct tgagcccagg agttcgagac cagcctgggc aacatggcga aatgttaatt 19921 tcacccatct atacaaaaaa taaaaaaatt agcatagctg gtggtgtatg cctgtagtcc 19981 caagcaggag aattgcttga gcctgggagg tggaggttgc agtgagccac gatctcccca 20041 ttgcactcca gtctgggcaa tgggagtgaa accttgtctc caaaaagtag agagagaaaa 20101 tagaaattga aatgaggtgc actgggcact gaggcaaggc tctggggttg gagcagttag 20161 aatggtgtgc ctttgcacat gtctcccttt ccaccaggac attctgctgt ggccgctgga 20221 ctccggagtc tgtgataggc ctctgccagg gaagatctct ggcagtgccc ctacacacca 20281 ctccctactg tgcctgccta cctagaagat gacgtttcta ggtggggacc aggatgccgt 20341 tccaccctca ctgcctgttg ggcccttctc cctgggtttg tagtctccct aacacctgtc 20401 cctatcccta caggatgtgt gcccctgtga tggtggagct ggagggggag acagatcctc 20461 tgctcattgc catgaaggaa ctcaagtaag tcactcaaat catggttcac ttctccaaat 20521 ctctgggagg catctgttgg gctctctgtt gcaccctttg ctcccacttg cctggctgga 20581 gaagttgatg cttattccta cagatcctta ggaacagctc accaggaagt aggaggaaga 20641 gagaagagga ggcagggaaa gtgggagggc actgatgaga catggacgtc ttctggctgt 20701 tggaatttcc ctgttcctgc tgcaggaggc cttaccaata ttctgtcatt cttccccctc 20761 ttgtctgata taaaaacttt ctttttcttg taattgcttt cttacaaagt acactcatca 20821 gccccaagag gctgctccta agacttctct ttgcttgtgt tttgcacaca atctgttcct 20881 tctgcggcag agctggggtc accagccctc atgtacttgt gacttctccc ctgccacagg 20941 gcccgaaaga tccccatcat cattcgccgt tacctgccag atgggagcta tgaagactgg 21001 ggggtggacg agctcatcat caccgactga gctggagtca tcttcctgcc cttgccccat 21061 gcccaatttt cattctcact ttatatgtgt aaataataaa atattcaact ttccaacccc 21121 cttcccctct gcttatctgc aatgtcacca cctgttgctt ccccgttacc gccatgctgc 21181 gtggagcatg cacctattcc agtggccctg tgactgtcag ctccttaaga agcaccaggg 21241 gcccttagcc cctttggatc ccccacatcc ttccctccat ctccctgttc cccagagcaa 21301 aggctgctgc aggggagaca cctcagctgc cttccaagca gacagacaag tctttgtgcc 21361 cagggagctg gttgccacgg aaacccccaa tttcctttcc agtggggact ggctgcaggg 21421 gcttctccct tctcaggagt atcacagagc aggtctcatc aagccaccca ttgtttccta 21481 aggacctgct tcgagcctct tatcgtgggc tcggatcccc tttcaggagc agtgccccag 21541 caggaagcgt gggggtgtgc tgatctccca ccctccccag gcagagccct gctgggcaag 21601 tcagcagctg gagcaaggac cgagcacctg cctacccctg cccccatggc tctgtcccca 21661 ctcctcctca ggactctgcc cacacgctgt cctctgtggc ccacctcttt ggtgtgccac 21721 acccatgcct tcttcctagc ctctataaga tatcctttcc ctcctatttg gggctggtga 21781 tcccctgagg ccttggggac atggtgctgg ggtggtggtg ctcctgggtt ccaggtgtta 21841 cattaagtca gagctggagc tcccccttcc tctgcctgag gttctaatgg ggtcctcttc 21901 ctttctccac cctgcttacc caacctgagg taagaccagt cacactggct cctccctcct 21961 agagggggtc agggggaggg tgtatattga catgaacagg gatagagggt aaactggctc 22021 cctgaatatg ccagccttaa cctccattcc actgccagct ccccttcaaa gaggaggagc 22081 tgggcttccc taacctctgc aggaggcagg gcctccaggc ctaggtgcag cctggccctg 22141 ggatgggatg tggggagtga atggtgagga tctgcattgg tgggaggggt gtccgctgcc 22201 ctggagaagg gttaattcag ggagcagtgg acttcacacc cccatccacc ctcctccaag 22261 cctgtggaat cctttaatca agttgggtgc tgaaatttca gctctgaaat gccgcctttg 22321 tgctggcatc caggcagccg ccccagagtg cgggggtcac tttctcggcc ccatttctct 22381 aaatggtctc tttgttccct gctgggctgc tcagtatcag atgtgattaa agggagatgg 22441 ggtttgggct ggggaggagg ggaattgggg gttaaccctt aggggataga gtgcaaggga 22501 aatgggacag aaggggtgtt tttcctcctg tcttcctctt cccattctcc tcttttgggg 22561 agtcccctgc tattcagctg tctgggcctg gcttttgact tctcttgaat aaatgtcccg 22621 gtcaccacta ttggttctgt tttcttaggg tttggcatag tctcttgagt catttccaga 22681 aacacctgac ccttttttaa caagtggagc tcctgccttc ctgaggcctg tgcacacttg 22741 gaacctttgg tctaagggat gcacacaacc caggtcttct gacttccttg gggtcatttc 22801 ccctccttcc acatcctcag tgatttccca ctcactctca tgagatgtgg ggtcttggag 22861 gtgatagtgt catcttccgg atgtgccccc tgctccccac tgtgccagct ccatatccct 22921 catcccacgg cgagcaggag tagtgaacag accacaaatg gacaaaagtc ctgcctttca 22981 atgaagggga aagggtgggt tgtaaagatt ataaacacat aaggttagta ctgattgatc 23041 catgacaaaa ttctagaata ggtccctcca tgtttaaaag tgcttatata ataaagagga 23101 gatcgggcac tagcataggg tgtctaagaa ccaatattgc caaataagtt tttttttcga 23161 tggttactgg tccagtaggt tgagagaatg ccagagatca gtgaatcttg attttgacaa 23221 ggcatcagta gtctccatgc cactgtgaca gagaagtgtg tgcttcatgg tgatgtaatt 23281 agacagtttt agaactgatt agacagcctc aagagtggag ataatagctg ggcgtggtgg 23341 ctcacacctg taatcccagc actttgggag gctgaggttg gcagatcacg aggtcaggag 23401 atcaagacca tcctggctaa cacagtgaaa ccccgtctgt actaaaaaat acaaaaaatt 23461 agccaggtgt ggtggcgggc gcctgtaatc ccagctactc gggaggctga ggcgggagaa 23521 tggcgtgaac ccgggaggcg gagcttgcag tgagccgaga tcccgctact gcactccagc 23581 ctgggcgaca gagagagact ccgtctcaaa aaaaaaaaaa agagtggaga taataggcca 23641 ggtgcggtgg cttacgcctg taatcccagc actttaggag gccgaggcaa gcggatcatg 23701 aggtcaggag attgagacca tcctagacaa catggtgaaa ccctgtctct acttaaaaaa 23761 aaaaaaaaaa aaattagctg ggcgtggtgg tgcacacctg tagtcccagc tactcgggag 23821 gctgaggcag gggaagcact tgaacccagg agacagaagt tgcagggagc caagattgca 23881 acattgcact ccagcctggc aacagagcaa gactccatct aaaaaaaaaa aaaaaaaaaa 23941 aaaaaagagt ggagataata ggactgacag tggagagagn cgtttaatgc tgtacacagg 24001 tgtccctctt catttcgacc tttattagtg acttgaatga ggctattcaa gtttgcagat 24061 acataaagct ctaaaggaga gctaagttct aaaaagttta atacaggtgg ggaaatgggt 24121 caaacccaac aaggtgagat ctgttaaaga tcagtgtaaa gtcctgcttt tatctaaaac 24181 aaaacaaaaa cagcatacat acaatttgga tatgactttg caatggatat tgtctaaaag 24241 atcggggttt cggttaccat ccagcttcac gggaaccagc agttttgggg aggttgcttg 24301 aagagcttcc atgatgcagg tcctagatgg agggattttg gccaccctgg ggactgggct 24361 acttcttggt gccctgcctc tcccctggtc ctgactaggg gctggagacc gcagatggaa 24421 ggaagggagc tgaggatgcc cagtctggag cagtggaaga tgcattgctc cagatacaaa 24481 gcagaagcag gggctgcaac cacaggcagg tggaggggat gatggttaaa aatgcaagcc 24541 ttggagtcag agctgggttc agattctggc accagcactt cttagcatca gcctcgcaaa 24601 cttcagtttc cccatctgga aaattgagat aacgatcgga cctacccctt cactggcctg 24661 aatgacataa aacccttggt ggagcactta gtaaatggca gctggatata agctgttatg 24721 agctgggtat gaggaagaac ttccaaccat cggagttgtt catcagtgga gcaagatgcc 24781 aaagagtctt tggagttgtg tgcttgtgag gtggagtggt cctggcctct gggtgcagtt 24841 tggacttctg ctgaggaaag agctcttagg gagtagacgc ttaaggagta gactgtctga 24901 catccttggt ctggacttgg acatgggcac ctgctgtgaa ggcttggcta gaacctgacg 24961 tgggctgctg tgtgtgtgct gggggaggtg gcaagtgggg ggatgcagca acacagggcc 25021 tatccacaca tcctgaagga caggcctctg actgcaggca gcacggtgtc accctcagcc 25081 cacctcctcc tcttcagata cctgtctctt gaattctttg caaactcccg tctgctgagt 25141 gctgggtgag cagaggggct gcttgcaagg cagtgagttc tagctgagcc agttaagagt 25201 gagtggcccc cctgcagtgt ggggcatagc cgccgggctg gggagaagtt cccatctgag 25261 gcttcagtcc ctgagctcag atggacccca ctgggatgcc agctgtccta acactctcca 25321 ttgttcccac tttcttagca gggagagaga cagaaagaga gagaggagag tgggaggggg 25381 gagagagaga gaaaggagag ggggaggggg gagagagaga gagaaagaga ggagagnngg 25441 gaggggggag agagagagga ggggggaggg gaagggggtg gggaaagagg gagggaggtg 25501 ggagagagag acctttgatg atctatcctt tgcattctac tcctagctac ttccagaaag 25561 catttgagga agcaccagta cagggataag agatgaagag acaggccagg tcaggctcac 25621 caagcaggta accggaacct ttaattttat tatgtggaat gcttaatgca gagttaatag 25681 gggctagagt ggctaggaga ggggactact gagataaata acaggagaca gtaatgagtt 25741 acatgtggat ttggggggct gcagaacagg aaaatagggg cagatatagc tacatacttt 25801 attcaaatac cactgggagg caaggggtaa gaggcaaggg cagaatgtac aggtcaggat 25861 tctagtgtct tcagcctgcc gacagcaacc tgggtgctgg gccctgtgtc tcctggagcc 25921 aagggttagg gcctgagcag ctgtcactct ctgggaagcc cagggagtgg ggctgactga 25981 gcaggggcca tagagcctag taagggaaga gggagctggc aaggagccca gggagggagg 26041 ctctgtgaat tgtctctgat tcctttcctg gagcccctgc ctcgtcacct cctgggatgc 26101 gtctcaaggt catggaggtt gtagtggagg aggactgggg gctgtttctc agacaaagaa 26161 tgaggttatt ggcacagaat tggatcaggg ggcctctgtg ccaactcctt cctgccttca 26221 gaaagggagg gggcattgaa gggatgagaa ctccactaag tccctcgaac cccccactcc 26281 ccaatgaggc tcctcaaagc tactctcagc ccctgagtgg gagacacggc cagctctggg 26341 cacagtctct gggtggccta gcccttaggc actgtggcat aggtggcatg ggcatgtcag 26401 accctcacta tctgtgagag agaaaacagg agtcatagga gtggtaaggc ctccgatgcc 26461 accaaccctg gtgcttcccc tccccgagga gggtgagcag gcccttctca ggtcctggga 26521 tagagggtca ttcctggggg aaggtgcagc ccctcatctt tcagtgtggg tgcaacagtc 26581 aacctccttc tcctctgtcc agcctgttct cctggggctt tgctgctgga gcctggatgg 26641 ggcgggtggg tcatcagggc agtgagccag acagaaagcc ccccgacctg tcagcctctt 26701 cagcctcctc agcctcctcc actgccacca ccaggcctga ggtgggcaag gaacagggca 26761 cacaggctgg gggcaggggc tgggcggggg gtggtggcga cagggccccc tttagggccg 26821 ggacagtgtc gtatatactg gctgctccca gtgtgtgggg ctgtgggact ggggccctga 26881 ggggctgggg tcagagatgg ccgtgtagag gggccgctgc gagggcccca tataggagaa 26941 gcccgagtag agnccagaga cctggcccga gtgcccataa tagggtcctg agggctgatg 27001 gtcagagtag tcaaactggg ggcgggagat ggaggggaag gctgagccat agtggggcag 27061 gctgagggag gtgtaggcga tctgtgaggt ggatggctgg tcggtgtagt gtgggggccc 27121 ctggggcccc gcggtctctg tcttcacctg ggctttggca tccacaccag gtggtgagac 27181 cgtgggcaga gccacgcctg gtggcttgga gatccaggcg gagtgtccac tggccacggc 27241 cagggcactg cccagcccat agccggctgc tgagtagctg ctcacgtggc ctgggtgccc 27301 attgggcggc aggtactggt ccaactcagc cacatcaaag gtctccatgt tggacattac 27361 ctcgtggctg atctcaccaa tgtccacgtt gccgaagtcg atgtgaggct tcccgccctc 27421 ccccatggag cgcccgtccc gcttcgggtc tgccttgccc gactgcagct ctgtcttcgg 27481 ggtggttgga ggggtgggtg ggccatggct ctggcctggg tagaagggag acagagagag 27541 agagcgcaag ggggaagcag gttagaggca ggtgggcgca cgtgaacttc catggttcac 27601 cttcaggcag cgggtgcctc tgggaaaacc ttgtaagttt cacattttgg cagcatgagt 27661 cgggtttttt tttgtttttt ttttttgaga tggagtttcg ctcttgttgc ccagactgga 27721 gtgcagtggc gccatctcgg ctcactgcaa cctctgcctc ctgggttcaa atgattctcc 27781 tgcctcagcc tcctgagtag ctggaattac aagcgcccac cacaatgccc agctaatttg 27841 tgtattttta gtagagacgg ggtttcacca tgttggccag ggttgtcttg aactcctgac 27901 ctcaggtgat ccacctgcct cggcctccca aagtgctggg attaccatca tgagccactg 27961 cgccccacag catttgttta aatactggtg tcttagattg atctctagtt gtttatgaaa 28021 ttctcaaatc ttagggatgg gtcctgggga ggcctgcctc attctgaaaa aacctttcct 28081 gtggaaagtt cttcctctca tcagcttctc tgctgtccag gtggtctggc ccagcctttg 28141 cgctctgcca gccccgcttc cacacccatg cctactgtct tccttgctga gcctcgctcg 28201 cccaggctcc tgagtgctct gtttcacctt tgctggttgc ctgggcctgc tccctgtcct 28261 ggcctcatgt tgtgtccttt ccaacccttc cctcctcctg agcccaccct gcttcccagc 28321 ccactggcaa gcccttagcc ccgtcatatg ctggcctgtg ttgccctgtt gttctttgag 28381 ctgccttcct ccatgggagg aacttagaaa actggtcctt gttcctttgt ctgttcactt 28441 tcctattcac atattaattt agcaaactct tagggagcat tacaaaatgc caggtactgt 28501 gccaggtgct ggggatacta atgtaaataa aatggatccc tgttcccaaa agctcatagt 28561 ctagtcatag atccatagat ccacatgcac ccagagttgg aagtgctgtg gtgacaatat 28621 gcaaggactt gggagatcag aggaggagtc cacccgctgc ctcagggatt tggggaaggt 28681 ttgatcaggg aaggatgtct gagtcgggtg gggccttgaa gggtgagaaa aagtctgtgt 28741 ccagtttcat tctgaggtgg ttgctctgta tgtggtccag ggtctcctga tgtgtgtgtg 28801 tgtgtccctg aatctttgaa ggtgaagagt atgtacacgt gtgtgggagg gatgactgcc 28861 tggcatccca atggagacag aacatgtcca tggtcttaag agactaggag ctccccaagg 28921 acaggcctgt gcctccttca tcagaccaga ggcagcgcca agtcctgcat tggccaaggg 28981 ctctgtactt caagagacct gcctaggctt ttgaatcctg gttctgtgtc tggacaaatc 29041 cccagtcttc tcttaactag tttccttatc agtaaaatag agattgtacc aagcagccca 29101 ggcattgcct ctttctggaa gccttccttg acaagagtct ggtgggcagt ggggtgttct 29161 tggagacctg actggatggg atagacggcc atgccacctc tggatctcac acagacttct 29221 tgtgcacttg tttctctctc tatatatata ctatatatat agtaagagat aggaggctgg 29281 gcgtgctggg tcatgcctgt aatcccagca ctttgggagg ctgaggtcag gaattcgaga 29341 ccagcctggc caacacggtg aaaaaccatc tctactaaaa atacaataat tagctgggtg 29401 tggtggcgtg catctgtaat cccagctact taggaggctg aggcacaaga atcgcttaaa 29461 cccaggaggc agacgttgca gtgaaccaag atggcaccgc tgcactgcag cctgggcaac 29521 agagcaagac tccatctcaa acaaaaacaa aaacaaaaaa aagacagagt cttactacat 29581 tgcccatgct ggtctcaaac tcatgggctc aggtgatcct cccacctcag cctcctgtgt 29641 agctgggact gcgggcgcat tccaccatgc ctagctttgt ttcttcctgt acttaattca 29701 ctcgttttca tccctgctcc tctctctaga ctatgggccc ctgaggccag ggactgtagg 29761 cttctctggt tagtacccct ggtgtccact ccagtgtttg acacaaagta gggctcagag 29821 cgtgtcctct gaaggcagag ctctgctcac tggagcactt cccatgtggc agacctgtgt 29881 gtgtcacttt ctgtacatta ggaacacagg caaggcagct gtggtgccaa gcagatgccc 29941 gggagccaaa ctggggaagt tcctgtttgt ttctcatcag taaaatgaga gtaacttgcc 30001 acagctgaga tctgagccca ggcagtttgg gtacctcatc acattgtttg caaggattga 30061 gttagtatac ataaggcaac tagcacctag cacagtgcct ggaacagagt agctgctcaa 30121 ttaaggttag ctgttattat cattaatgaa aattaaacaa gatagcttat gtctggccca 30181 tccatggcca gacacggtat cacatgccat gtctcctctg tcagaatggc tggtgggatt 30241 tgggaggccg aggcgggcag attgcctgag ctcaggagtt tgagactggg cctgggcaat 30301 atagtgaaac cctgtctctg ctaaaataca caaaatgagc cagacgtggc ggtgtgcacc 30361 tgtagtccca gctacttggg aggctgagac agcagaattg cttgaacccg ggaggaaaag 30421 gttgcagtgc accgagatca tgccactgca gtctagcctg ggcgacagag tgagactctg 30481 tctccagaaa aaaaaaaaaa agaatggctg gtgggaaaga ggcagtgatt tcctctcctc 30541 catttgctct tccatttctc aaaaccatcc tgtcttctcc ccctttactt gtaactgccc 30601 cagccacctc tcactccatg acctccacag agtctttttt tttttctttc tctttgagac 30661 agagtctcgc tctgtcgccc aggctggagt gcagtggcgc gatcccggct cactgcaagc 30721 tccgcctcct gggttcacgc catcctcctg cctcagcctc ctgagtagct gggactacag 30781 gtgctcgcca ccacgcccag ctaatttttt ttatttttag tagagatggg atttcaccat 30841 gttagccagg atggtctcga tctcctgacc tcgtgatctg ccctcctcgg cctcccaaag 30901 tgctgggatt acaggtgtga gccaccacac ccggcggcct ccacagtctt taactggccc 30961 tctttgccca gtaggatcag ccctggccag agccatctca cccagccact tcccctggca 31021 ccggaacccc catggagctc ccctctgggc ccctgcagct ctgctggggc aactgcacag 31081 cctggcacgc tctccctaga gtccagggtc tcattgccat ccagccatct cctgtctcca 31141 ctgactggcc ttgccccacc ctcagctctg tcatcagcac tcacctgagg ggtgctcggg 31201 gttcccatct gacatggggg agccctctcc tgggtgccgg tggtccaagt gggcgctctt 31261 gtagtgggcc tggatggcgg cggtcccacc ttgctcgggc tccccaccgg ggcactccgc 31321 ctcgccctgg gcgggcttcc cgttcttccg ccgcctgggc tggtacttgt agtccgggtg 31381 gtctttcttg tgctgcatac ggagccgctc agcctcctcg atgaaggggc gcttgtcact 31441 ttcgttcagc agcctggggt gtggtgggag gcggagagga cagcagaggg gctggcgtga 31501 atgccagagc actccaggtt ggcctccctc tgagtgtcca tcttggaaga tgtgaggccc 31561 tgggatgggg cacccagagg acaggacccg gggtgggggc tgtgccctat gatttgtgga 31621 ctgagagatg tgaggcccaa ggaataacag cctcagaggg ctgcccccaa atctttcatg 31681 ggctggaggg tgagagaggg gtcagcccgc tgctcctggc agcccctgag gaggagttag 31741 aagtaggagt agaggtttca aggggcagat gccagttcaa ggaaaggaaa aactttctca 31801 tcatcaaagc tgtctgcaag tggaattggc tacttcagtg ggggtgagct ccctgggact 31861 gagggtgggc aatagaagca gcatggctgg gggagactca gtctctgcta cttactagct 31921 gtgtgtgtga ccttgagcaa ttacttctct gagcctgagt ttcttcacct ataaaaggat 31981 gataacattg gcttaacttg ggtgtgggat tgcccagaat aatggatgtg agatgccttg 32041 caccagggcc tgggaagcgg gctgagcacc ttgaggaaga ggtgcttttc tttctttctt 32101 ttttttttct gtgagggaat ctcactctgt cacccatgct agagctcagt ggtatgatct 32161 cagctcactg caacctctgc ctcccaggtt caagtgattc ttgtgcctca gcctcccaag 32221 tagctgggat tacaagcatg cgccaccacg cccggttaat ttttgtattt ttagtagaga 32281 tggggtttca ccatgttgac caggctggtc tccaactcct gaccttagtt gatccgcctg 32341 ccttggcctt ccaaagtgct gggattacag gcatgagcca ttgtgcccag cccaagaggt 32401 acttttctat ttttaatttt ttatttattt agagacagag tctcactctg tcgcccaggc 32461 tggagtgccg tgatgcggtc tcagctcact gcaatctcgg cttcctgagt tcaagcgatt 32521 ctcctgcctc agcctcccgg gtagctggga ctacaggcac gcaccaccat gcctggctaa 32581 tttttgtatt tttagtagag acgaggtttc accatattgg ccaggctggt ctccaactcc 32641 tgacctcagg tgatccaccc gccttggcct cccaaagtgc tgagattata ggcatgagcc 32701 accgtgcccg gccccaagag gtgcttttct aatgctgctt gcgggggtgg ttggacagga 32761 ggatctaggg ttggactctt aactctgggt cctgattgtg tgaattccag tgtggtggga 32821 ctgagggggc agatgtggct cctctttacc atgggtctgg gtggggggta ctgcctcctg 32881 tgggggatcc tatatccttg tccagaaaat agcagagagt agaggaagct agcctctcct 32941 cagaccctgg aacagtccct gaaaccaaca catggagaga agaagccctg aagccagcgc 33001 ccttgccacc gactccagcc ttaggccagg gttggagggt ggagggcatt tagggtttgg 33061 ctgccccagg gtgaggaggg caggatgggg taccttacag agggggtcga ggggaaagtc 33121 cagattctag aatcccagac cttggtactg agaatccatg agtctttgtg ggagcagcct 33181 tgtctgtcag tccagagggc ttgtgtcttc agctgctcca cccctcacac cccagtgagt 33241 gccctcttcc cccagtcagc agaaggccga gccctccctc cccaggctgg gcgggggtgg 33301 tcatgccagg cacctcctag ccggcagcag cattcctctg cggcttagcc tccctgtcta 33361 ctccccccac cccggccctg agcccagccc tagccccagc tttctcagag ggtccctcca 33421 gccgagactc tgtcagagtc agtgtacaca cttaggacaa agatgccgga gcagggcctt 33481 gagagtcagg gagtgggatg tgggcaccct ctctgacggc tgggaccagg ggagggggtg 33541 gagcaggcct tgagggtgtg actggggaga acccacagag gagagagctg ctccgccagc 33601 agtggacccc aacagagggg cttctgggat ggctggggac atgggacaca gagctcactc 33661 tttcacctat ccttcccttg ccccctgccc agctggcagc cagcatcagc aagaaagtga 33721 caggggccag aagagagaga ggattccaac atcggattcc gctgctacct cctaagctag 33781 ttcccagcat tagcctaacc ctgacagaaa tctaagcctg gggccactgc catccccgac 33841 cccagctccc tcctccacca gctgtgccgg acagctgatt ccacctccac cccaacccca 33901 agcccagcct gcccaccatg taaacccaat ctccagcacc agtccccact caacattacc 33961 agccccaaac ccccaacttc gcctccagct ctaactccaa acccagtcct catctgcacc 34021 ttcctgtcag ccctctttct ccctggtctc ccaggtttgg aatgggggca agaagaggta 34081 ggctgattcc tcacccaaac tggaaacccc aaccggggac ctcccacagt ggggcactga 34141 ttgctgctac ctgtgtcctg ctaggccagg ggcataagaa gactacttgg ggaaggggcc 34201 tgtggcactg ggaacagagg ctgggtgacc cccaccacac aggagggact cttgcctgtg 34261 tccgcctttc cagtttcttg gctctggtct ttccaggcag atcagcaacc cacaggcacc 34321 accctgtgat cagcctcaac tagcccctcc ccagatgggc atcaaaggtt agaatgactg 34381 ttggggctga cacggcccct taggacctgg gctccagtgt gtctgatccc acactccctt 34441 ggggcactcc acacagtctg tgtgtatctt tctctttttg gttttaagac acctgttttc 34501 cagctctgcc tacctaacca ccccacttct tgacatccca ggagtcacag cttggggagg 34561 tttcaggaga agcccctgga gcaattggcc agcccagccc aaatccaaat gtctctcagg 34621 acccccctcc agggtccagg gagccaccct ctgttggcat tgcctttgat ccttgccatg 34681 gttctggccc catcctccca ttgggattgg gtcctcagca agtagtgcca gttttcctga 34741 ctgccacacc tcaggctggc ccctgtgcac acagcttgag tgatggccaa gaataagtcc 34801 tcctgaaggg agtccaagcc cagggttctg ggctggggga acctggggac agggacaatg 34861 gaggagccaa tcttaaaccc ttggcaatgg gagatggagg gaaaatgagg tccgtccttg 34921 ctttgactct cgttgggctc gtactgtggt cttctcatgg tccccagagt ctgccctatc 34981 tgtatgtctc tcctggttgt ctgtttctca cttgctcttt gtcccctggc actaaagata 35041 ctggcctggg ccaagcccca cccagatctg gatggacctt gggtggagga gctagctgtg 35101 acctctgctc ttgggcagaa ctgggctgtc tgggtccaca cagcgggctg gtcgctgagc 35161 cctgggcctg ccctgggtga gcagggcact gagggctgag aaggggcagg ggaaggggca 35221 ccaggtcttc agcaaacaca tggcaagaac aggccagggg gcgggatgtg aatcttttgg 35281 tctcttggag atcctcctgc accttctacc actctaagcc tctagaccca atctctaccc 35341 aactgtggct ggcaaggggg agggcagcgg aagagagccg ggctcctctg acaaagccct 35401 aattttctgg aacccagatg ctcctcactc ttgttctcat cttcttgctt ccccccaacc 35461 tcctgaccac tcctcaaagt gtaggtgatc caccactgtc accagggtca ggggaagtcc 35521 tctggacaga gagtatggga gtcctgggtt ctggccctgg ctctgctacc aacttctatg 35581 tgacctttgt gattcctttg cctttctggg cctgtctctt ggtcagtcta gcgagaatca 35641 ggtgactgct accggctcac atctcctgga gtccctggtc tctttcagag ctgctctggc 35701 actcccgggt gggagtgggg gactgtcttg ttccacacat ccccaggagg cctggctgaa 35761 ccccagacct gagtcgaggg tgggataatt gctccctgcg ggggatgttt ttcctagagg 35821 tgtgggctgg tccagggaac ctggactctg agggcccctg ggcaggaagg aggccaaggt 35881 ttccttctgg ccagagtgct cagctccagc tctggggcag tggggaagag gttgctcaac 35941 gttggggtgg ggccctggca gggaagatgg cgccgaactg ggagagggag gaaagagatc 36001 agcttggtga ctctggggca gggcaggagg gggagcccct tactgtgtgt cctctcccta 36061 ggaggatgaa gaagccccca actcaaagga gggtggcgtc agaaagtccc agggaagagc 36121 cctaggatag gatgaggtcc cgcagcagca gatttgtccc gcagttcttc atccctcact 36181 cttccaaaga ctctcatctg aattccgttt tctttaaacc tcccctgagt tctccatctg 36241 actgcctcag tttccctgaa tctccagcca accctttggt tctttccaag cgtttccctc 36301 gtatcctcct gaccacagca gctggtggca tagtgtgcgt atctgtgggc aggagttagt 36361 gaaggcctcc ttccctccct ccaaagcccc aggccccacg gtctgccacc cgcaaaattc 36421 caaccacgag ggggcgcctt cctgcccgcc gccccgtagg gagacccggc ctaggccgct 36481 ggagttccga gttccagggt ccttgtgatg gaagggcggg cgcggccccc acacctggtc 36541 ttccagccct atccaaggag gactgccaga cagtcccgct ctgaggtgca ggaggccggg 36601 ccgcctcggc taccctgaat ccacccgaag ctagagggcc cgagcccggg gggcggtcgg 36661 gtgctcacct ccagagcttg cccagcgtct tgctgagctc agcgttgtgc aggtgcgggt 36721 actggtccgc gagcttcctg cgcgctgcct gagcccacac catgaaggcg ttcatgggcc 36781 gcttgacgtg cggcttgctt ttgctggcgc cgttgacgcg cacgggcatg ggcaccagcg 36841 tccagtcgta gccgctgagc acctggctga cggcctcgcg gatgcacacg gggaacttgt 36901 catcgtccgc ctcgccgtcc tgctgctcct tcttgacctt gcccagctcg cctggccccg 36961 ggctggctcg caggcccgat ccgccgccgc cgccgtcggg ccctagcgag ggcgcgctcc 37021 ccggggacag gcagcggggc tcctccgagc ccacggggct cagctccacc tccgataggt 37081 cctgctctcc gccatgtcgc ccccggccgc cgccgccgcc gcctcggccg cctcccccgg 37141 gccagccgcc ggggtcctcg caaagagtcc aacgcccacc tggatggaag gagggcgcga 37201 tggagcggcc gcgcgcgcag ccccgagggc ggcccgagac aggacgtggg cacagccccg 37261 aggtggcggc ccttcctgct ccagctaaac ccatctggcc cctggggccc acgcacatgc 37321 cagactctag gtgggtgcgt cccgcctctg tgtcttccca ccacctggtc ccaaccgtct 37381 cttggtgtgg gtggcttttt ttttagcctc cttttttggg tggaggtttg ttgatgataa 37441 ggaagaaaaa aacggagcct ttattttgct ctaatcccgg cccctggaat ttcccacctt 37501 ttctctcacc tttccttctt ccctcagcct gccccaacgg ggtttagagg agagccaggt 37561 ggggggcaag ggcggggtgt cgcctctctc cacctcacag cagggtccca ggcctgggcg 37621 ggccagggag gagcaggctg cgcactcacc tcctcggacc tctcctccag tctggggctc 37681 gtccttagga agtggaaaac cgtgtcccaa ggtgtgcggt ccagctcggg gctgggaggt 37741 gacgctggtg ggctgggagg gaggggcggn gggtcctgag cctcaggtca cggnctcgtt 37801 aggacggagc ctgaatcctt atcaccaacc accctggccg gacaggccga ggctgactga 37861 gcgactgagc ctganccnct ncggaggnct ggngaaggaa ggagggggag aggggggagg 37921 ggagaggagg gagggaaggg cagaaggtgg agcctccagg ctttccttca tccccaacac 37981 tcagggctcc tccacccgga gcccaggccc ccccacccag ctccctgccc tactgcccag 38041 gcccatcttg ctgaggctta tcttgggctt agtctgggag gtggagggga ggttaggccg 38101 aggaaggaag agatgtaggc tggagggtca gaacctgttg gatggaagaa aattgggccc 38161 aaccaggact cctaacgtct gcctcaggtg gcaggatggg cacttggcag gggactaggg 38221 agagacgtga aggcggtgag ggtcctgggg gcccgtaagg tggctggccc tagctgcttc 38281 gtctgcgaaa tcctgttgag tgcctagtgt gtgccaggca ctgatgccca gctcctcacc 38341 ctgcctccta ccttttccca ggcagccctt tctgccccat cctgcacagt aagagagact 38401 tctcagggat ccctctgaga agtgcagagc cccagcttga cccctggtcc cttctctgca 38461 gacacccagc acacgcgcgt tctggtctgg ttcaaggtgg aaacacaaga ctaaagctgc 38521 caggaggcca aggtgggaca ggttgggggc cctctctcag taggtcttag aggcattttg 38581 ggttgctacc tactgggacc cagctccttc cacactgggc tcaccctgtc ttgggagagg 38641 gtaggggagt cctagtcctg attgtcacga agtcctctac tcccccaccc caacaagggg 38701 tggggacttg ccctgcctct tccctcctct gcccacctct ccctccctcc ccttttcttt 38761 cctgcccatc ctccgttgac acctttttct ctctctttcc ttggcctctc ctccaaaggt 38821 cgcagtgctg ttctcttaac tgagggcagc cccgccgggg tgttgaggcc gaggtggggt 38881 gtatgtgggt tatgtaatgg agctggggtc gcattggcaa ccatgtgacg ttctgagctc 38941 ctcaacagca gggcaaagcc caagaccacc ccaaggggtc ctgtctggag tgggctgcaa 39001 agggtctggg gaggcaaggc ctcatgcttg ggaaagtcca aaggggtgta gggggtggga 39061 aatgggacac aatgaggtag tcacacgagg cagaggcatg ttgggggtgg ggggagcaga 39121 cactggaaca cacacacaca cacacacaca cacacatatg ctcacacaca caaccagcca 39181 gacaatcggg ccagtgtgtc tggggcctcg gagcccagtg aattaggagt ttgataaggg 39241 ttttctctnn ctctcgcttg ctggccttgg acaatctccc cccagttttc ctcgacggca 39301 cccccggcgc tgccaaccct cctccccccg cctcccttct cttccctgtg cccagactct 39361 tgttcggggc cttgaaacag tgagctgtct ttgttcgaaa tacaggtgaa cggcaatcat 39421 gtgattaaca cacacacaca taaaaagctt tttaacagcg cccgctcggc ctcagaaccg 39481 cccggagagg agccgccctg gatggacaga gggacgaggg acgagcatct gccgtcgtgt 39541 cccggctgcc ctgcagtcgc ctccaacacc cgctgctgcc cgcccgctgc ctgcctgcct 39601 ggcatctctc tctcccggtg ggtccccact catccttcca cccctcagtt cctcctcttt 39661 gagggaagga cctccgccgc gctctctgct gtgtctgtga ctggcctgag acccacctgg 39721 gacaggaggg gtggttgtgg aaggaaagag cccagagtta ggaggtggaa tgtggggtcc 39781 cacagctgcc ccctctctaa ctccctgctt cctcctttgc cctccttggt ccagctgtgc 39841 caggatgggg gtgggggtag cagtgggcac gctctgtctg caggaagcca cgctagacag 39901 aaggggccac tccctctctc tctcccttgt ccccgcacag acactgcctt cctctcaggg 39961 ctgtgctggg ctttttgctg agcagtgttg ggtgaggcag gtgggcctgc agggcgggtg 40021 ggaagggctg tcagatgacc agtgctgtgt gtgtgtggtt gggggccctg ctgcgaacac 40081 atccctgccc accatgctgg caactgg // LOCUS AF008303 4002 bp DNA PRI 13-JAN-1998 DEFINITION Homo sapiens prepro placental TGF-beta gene, complete cds. ACCESSION AF008303 NID g2580619 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4002) AUTHORS Lawton,L.N., de Fatima Bonaldo,M., Jelenc,P.C., Qiu,L., Baumes,S.A., Marcelino,R.A., de Jesus,G.M., Wellington,S., Knowles,J.A., Warburton,D., Brown,S. and Soares,M.B. TITLE Identification of a novel member of the TGF-beta superfamily highly expressed in human placenta JOURNAL Gene 203 (1), 17-26 (1997) MEDLINE 98085971 REFERENCE 2 (bases 1 to 4002) AUTHORS Lawton,L.N. and Soares,M.B. TITLE Direct Submission JOURNAL Submitted (16-JUN-1997) Psychiatry, Columbia University, 722 West 168th St., Unit 41, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..4002 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region complement(32..207) /rpt_type=dispersed /rpt_family="Alu" GC_signal 847..852 GC_signal 897..902 GC_signal 911..916 TATA_signal 938..943 mRNA join(967..1275,3096..3986) /product="prepro placental TGF-beta" exon 967..1275 /number=1 CDS join(1038..1275,3096..3745) /note="PTGFB" /codon_start=1 /product="prepro placental TGF-beta" /db_xref="PID:g2580620" /translation="MLLVLLVLSWLPHGGALSLAEASRASFPGPSELHSEDSRFRELR KRYEDLLTRLRANQSWEDSNTDLVPAPAVRILTPEVRLGSGGHLHLRISRAALPEGLP EASRLHRALFRLSPTASRSWDVTRPLRRQLSLARPQAPALHLRLSPPPSQSDQLLAES SSARPQLELHLRPQAARGRRRARARNGDHCPLGPGRCCRLHTVRASLEDLGWADWVLS PREVQVTMCIGACPSQFRAANMHAQIKTSLHRLKPDTVPAPCCVPASYNPMVLIQKTD TGVSLQTYDDLLAKDCHCI" intron 1276..3095 /number=1 repeat_region 2077..2316 /rpt_type=dispersed /rpt_family="Alu" repeat_region complement(2632..2941) /rpt_type=dispersed /rpt_family="Alu" exon 3096..3986 /number=2 mat_peptide 3407..3742 /note="putative" /product="placental TGB-beta" polyA_signal 3964..3969 BASE COUNT 891 a 1107 c 1114 g 890 t ORIGIN 1 tctagaactc ttgacgtcag atgatccacg tgcctcggcc tcccaaagtg ctgggattac 61 aggcgagagc caccgtgccc ggcgcagaat tctttttttt agagatgagg tattgccatc 121 ttgcccagac ttgtctcgaa ctcctgggct caaacaatcc acccacctcg gcctcccaaa 181 gtgctgagat tactgacata agccaccatg cctggccccc agaattatga atcctgtgag 241 gatggcttca aggtgagcgc tgagccagac aaaaggatgg ggtttgggag caccctgctt 301 agactggaaa gataatgttg gagaagactt cctggaagag gggctttttg cgtagagttt 361 tgaagaatga gtaggagttc tccagaggag gatgagtaac tgcaataaca cccagtttat 421 caagtgcctc ctatgtgtct ggccctgtgc tttacccctc atttgaccac ctctccagtg 481 agagtctcag tccttttttt cctggtgagg aaacaggcat ggcagagagg catgacacat 541 caaggttgcc cttcctggct ccatctagcc cgttctcctc tgcttccttt gtttttcacc 601 atctttagcc tttgacccca accaaaaaga gaagagagga aatcccatgg gcatagacag 661 ccacctctta aactcttgtc tggaattttt cacatagtaa caatgtcttt ttttcctcca 721 aaaagactcc caggctggaa tggtgtcctc atatcgagga agaggatact gaggcccaga 781 aatgtgccct agctttacta ggagcgcccc cacctaaaga tcctccccct aaatacaccc 841 ccagaccccg cccagctgtg gtcattggag tgtttactct gcaggcaggg ggaggagggc 901 gggactgagc aggcggagac ggacaaagtc cggggactat aaaggccggt ccggcagcat 961 ctggtcagtc ccagctcaga gccgcaacct gcacagccat gcccgggcaa gaactcagga 1021 cggtgaatgg ctctcagatg ctcctggtgt tgctggtgct ctcgtggctg ccgcatgggg 1081 gcgccctgtc tctggccgag gcgagccgcg caagtttccc gggaccctca gagttgcact 1141 ccgaagactc cagattccga gagttgcgga aacgctacga ggacctgcta accaggctgc 1201 gggccaacca gagctgggaa gattcgaaca ccgacctcgt cccggcccct gcagtccgga 1261 tactcacgcc agaaggtaag tgaaatctta gagatcccct cccacccccc aagcagcccc 1321 catatctaat cagggattcc tcatcttgaa aagcccagac ctacctttga gcctcagttg 1381 ccccatctgt gccctgggta ggaatatcct ggatcccctt gggtctgatg gggtagccga 1441 tgcctgattt gcacccacaa cgtgggaggt tataacctgt ccccaagttg caaatgggga 1501 aactgaggta cagggatgcc agggaccctc ctgaatgtgc acagcacctg ggaaacccgg 1561 ggacggggtc gggcgtgcgg tgtgggcatg gtccctgacc tgggctgggg ccaggatcca 1621 agagaaacaa tgggggtgtg gtgcatggat agaagtttgt gatgggcaga gccagctgga 1681 gctggttctg aatctgatga aatcgcttgt caaggcactg ctaaagtcac atgcaacgag 1741 cgacttgtta ggggaggagg agagtatttt ccccatctct gtgccctcca tggaggctcc 1801 aggggtgcag ggcactggat gaggtctggg ctgtggtggg acccaggagg gggcaggcag 1861 aggggcacac tcttagcccg ccctaaaccc atctgtgccg tatgactccg gcaaacctct 1921 cacccttgct aagcctagca actaggttaa ggcaccctca aaccccagtc tccccagtct 1981 ccctaacact tgtaccagcc tgaggggctg tgattcctag aggctcagta gtctggaatg 2041 tgatgggttt ggcactcttc atttctcggg attacacctg taatcccagc actttaggag 2101 gccaaggtgg gagcatcgct ttaggccaga acattgcagc ctgggcagta tagcgagaat 2161 ccatctctac caaaaatgtt taaaaaaaat tagccagaca tgatggtgtg tgcctgtagt 2221 ctcagttatt caggaggctg aggcgggagg agcacttgaa tccaggagat tgaggctgca 2281 ctgagccatg agtgccccac tgcactccag cctgggtaac agcaagacca taataaataa 2341 ataaataagt gaagcaagta agtttttgac tgccagaaga aaagcagagc catggttgga 2401 atctcaggct gcctgaatat cagatataca ttaattcctt caagtattta ctaagcacct 2461 attgtgtgcc aggcacagtg tcaaccaagc taagattcta agtagggaat atgaatgata 2521 cacaaggaaa taaaataggg tgcttcccta ttttaggcag gctggaagga agaggaggag 2581 ccagccatac tgagattcag gagattcaag gggagaaatg cccttttttt tttttttttt 2641 tttttttttg agagggagtc ttcctctgtt gcccaggctg gagtgcagtg gcacgatctt 2701 ggctcactgc aacctccacc gccccgattc aagccattct tctgcctcag cctcccgagt 2761 agctgggatc acaggcgcca gccaccatgc ccagctaatt tttggtagtt ttagtagaag 2821 cgggcgtttc accatattgg ccacatatcg gccaggctgg tcttgaactc ctgacctcaa 2881 gtgatccacc tgcatcggcc tcccaaagtg ctgggattac aggcgtgagc caccgcgccc 2941 gcacggagaa atgctcttgg taggggaaat aacttgtgca aaggcgccgc cgccgtggtg 3001 agatccagct tggcgtgtta ggggaaatgg gcaccctcgt tcctggaaaa cggtaggcct 3061 gtgggtgacc agcttcccta tctgtcttcc cacagtgcgg ctgggatccg gcggccacct 3121 gcacctgcgt atctctcggg ccgcccttcc cgaggggctc cccgaggcct cccgccttca 3181 ccgggctctg ttccggctgt ccccgacggc gtcaaggtcg tgggacgtga cacgaccgct 3241 gcggcgtcag ctcagccttg caagacccca ggcgcccgcg ctgcacctgc gactgtcgcc 3301 gccgccgtcg cagtcggacc aactgctggc agaatcttcg tccgcacggc cccagctgga 3361 gttgcacttg cggccgcaag ccgccagggg gcgccgcaga gcgcgtgcgc gcaacgggga 3421 ccactgtccg ctcgggcccg ggcgttgctg ccgtctgcac acggtccgcg cgtcgctgga 3481 agacctgggc tgggccgatt gggtgctgtc gccacgggag gtgcaagtga ccatgtgcat 3541 cggcgcgtgc ccgagccagt tccgggcggc aaacatgcac gcgcagatca agacgagcct 3601 gcaccgcctg aagcccgaca cggtgccagc gccctgctgc gtgcccgcca gctacaatcc 3661 catggtgctc attcaaaaga ccgacaccgg ggtgtcgctc cagacctatg atgacttgtt 3721 agccaaagac tgccactgca tatgagcagt cctggtcctt ccactgtgca cctgcgcggg 3781 ggacgcgacc tcagttgtcc tgccctgtgg aatgcgctca aggttcctga gacacccgat 3841 tcctgcccaa acagctgcat ttatataagt ctgttattta ttattaattt attggggtga 3901 ccttcttggg gactcggggg ctggtctgat ggaactgtgt atttatttaa aactctggtg 3961 ataaaaataa agctgtctga actgttccct gtggtgtcta ga // LOCUS AF009356 3653 bp DNA PRI 03-FEB-1998 DEFINITION Homo sapiens regulator of G-protein signaling-16 (RGS16) gene, complete cds. ACCESSION AF009356 NID g2828564 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3653) AUTHORS Snow,B.E., Antonio,L., Amgen EST Program and Siderovski,D.P. TITLE Cloning of a retinally abundant regulator of G-protein signaling (RGS-r/RGS16), genomic structure, and chromosomal localization of the human gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 3653) AUTHORS Snow,B.E., Antonio,L., Amgen EST Program and Siderovski,D.P. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) Quantitative Biology, Amgen Research Institute, 620 University Avenue, Toronto, ON M5G 2C1, Canada FEATURES Location/Qualifiers source 1..3653 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q25-q31" exon <1..91 /gene="RGS16" /number=1 mRNA join(<1..91,967..1077,1822..1886,2174..2340,3394..>3653) /gene="RGS16" gene <1..>3653 /gene="RGS16" CDS join(48..91,967..1077,1822..1886,2174..2340,3394..3615) /gene="RGS16" /function="GTPase-activating protein (GAP) for G-alpha proteins" /note="RGS-r/RGS16" /codon_start=1 /product="regulator of G-protein signaling-16" /db_xref="PID:g2828565" /translation="MCRTLAAFPTTCLERAKEFKTRLGIFLHKSELGCDTGSTGKFEW GSKHSKENRNFSEDVLGWRESFDLLLSSKNGVAAFHAFLKTEFSEENLEFWLACEEFK KIRSATKLASRAHQIFEEFICSEAPKEVNIDHETRELTRMNLQTATATCFDAAQGKTR TLMEKDSYPRFLKSPAYRDLAAQASAASATLSSCSLDEPSHT" exon 967..1077 /gene="RGS16" /number=2 exon 1822..1886 /gene="RGS16" /number=3 exon 2174..2340 /gene="RGS16" /number=4 exon 3394..>3653 /gene="RGS16" /number=5 BASE COUNT 877 a 876 c 1010 g 887 t 3 others ORIGIN 1 agcctgccac catcctgcct actacgtgct gccctgcgcc cgcagccatg tgccgcaccc 61 tggccgcctt ccccaccacc tgcctggaga ggtaagggga ggttgtccaa ctgcttgggg 121 gaagtcccac caactcttag ggagtggagc ggcaggcctg gggacctggg tgggcgcggg 181 gatggggcgc tagaggcttg agtagagagg tagaggcgca cttgatgcca cgtgctcaga 241 ttccaggagc agggagaaag caaggggaag ggaagcaagg gcccgcggct tgctgccatc 301 cccatcctct gacaaactgc agtcgtgctc aggtgacctg ctgaagaccc ggctgcggga 361 aggccacgtt gacatttcca ggcagcactg agcctgccca tttcttaagc atttgttttt 421 cattcctcag accagagcta ggggcattgc ctcctctgcc cctggaagac agggagattt 481 tttttttttg gtggggtgtg tgggggtggg ggctcccact gttgagggtg tgtccatatt 541 ccagagagct actcaacagc ttccagtcct tactggctgg gaaggaaaaa gtaaagatga 601 aggcaggaat cagtttccca cacagttcag aaatacttct gaaaagaatg ctgagttctt 661 ttctgaatca acaggcacga agagagatca gtgtgacttt agggaaaagc ccatatatgc 721 agcgtgtgta ttacccagaa agtgtaagac ctaccacagc atagctcaaa cattgctttt 781 gttgctttgg ggttgatgat gctgataaga gaagcactct tggcctgggg acagccaaga 841 gtttccagga actcttgagg ctaaagtttg ggtggcacca cccagagctc ttctgggacc 901 cataaacctc tccacactgg ctgagcaaat gagagaatgc tttcatgttc ctgatttgtt 961 tttcagagcc aaagagttca agacacgtct ggggatcttt cttcacaaat cagagctggg 1021 ctgcgatact gggagtactg gcaagttcga gtggggcagt aaacacagca aagagaagta 1081 agtgttgctg accagctcag ggctccgaga gggaagggag tgggaaggga gtcatgagga 1141 gggaaataca gggagaagcc aggacacgct aagtggacct tggcaacacc tccctagaga 1201 ttttagtggg acttgttgct ggtctgttat gactacttgg ctcatgggtg agtgggtagg 1261 tggctccagg gctccaagtc catccacgga tgagggcaca cagggcatga atgcctagtt 1321 cccagactct cccacctcgg gattcattgc acgtgcatcc agggctagga cagaatccaa 1381 gaccacttga agttttctag tatgcacctt gcctagaaac tctttcatat tggctatcaa 1441 ctgtaccaaa ctctggagtt atgtgtagat gtaatctcta tttatgagga ccttatttcc 1501 ttgggaaggt gtacaaaaca aatctttgag agtcagggga attgaaataa tttttggaca 1561 gagacgagac aggcttagag atattgactg aggactaaag agtccctggc tttgatttct 1621 ggttgctgaa ttgtgctcta gaaagcaata ctgcttaccc taagtccaac taagagtttg 1681 aggtgtcagc ttcagcaatc accaagcaag cctttctgtg aattttatgc agttttccat 1741 cagagttctg gctagaattt gtactatgct ttggtttgtt gatttttttt cttactcttt 1801 ttttttcttt ttgtctttta gtagaaactt ctcagaagat gtgctggggt ggagagagtc 1861 gttcgacctg ctgctgagca gtaaaagtga gtaggagcct tttgtgtctg tctctgggtg 1921 gcacatgtgt actcatgtgc tttggagcca gccccaggac tgctgagagc ctgggcagaa 1981 taatagtagt agtaataata ataataatga tcccctagtg tgccagcctc tattcaacat 2041 ataaaggcag gtcattgttc aagaaaatta tgttctaaaa atagagccat taattgcaat 2101 tgctttcctt tcttcttgta ttccctgctg tccctccagc tgacctattt cctcttcttt 2161 cccacaaccc cagatggagt ggctgccttc cacgctttcc tgaagacaga gttcagtgag 2221 gagaacctgg agttctggct ggcctgtgag gagttcaaga agatccgatc agctaccaag 2281 ctggcctcca gggcacacca gatctttgag gagttcattt gcagtgaggc ccctaaagag 2341 gttagagccc ccatatgccc agtgaatcct cacgaccctg cctgctcctt cccctctgag 2401 aatcagggct agaggggagg aagggctagg tgtagagata gaagggcctt acttggcaca 2461 ggggtgactg ggcttctgtg accagagact aggaaagcca ggcccagtgc cactcctggg 2521 tgcttaaccc taagactcat gatggggcag gatgtgttgg caggtttttc ggcttctgtc 2581 catacccacc tttataaacc catccgtttt tagaggctca aaacacttag caactgggga 2641 gacaaggtca gcacagggac cttatgtaac ctggctcagt ccccaggctg acctgcctga 2701 ggcctgctgg tctgtggtct gtgatttttt tgtctacctt ctctcctggc cgctaaacgc 2761 atggcagctg catttgctaa agaacacagc gccacctggt ggaannntca gagcctgcct 2821 gtcacagtca cggatgagca aatggacctg tgtagagcag ggcaagagca ggagatgagg 2881 tcaggcatga ggaggatggc caggtgtggg cttggggagc ctcgtgggcc cagcaagccc 2941 tttggctctt cttctgaaag acatgaggag acaacagagg agctaccaaa ctgagttggg 3001 tttttaaagg ctcactgtag cgactgggta gagaacaaac cacaggggtg gggggtaagg 3061 tagcagagag accaggtagg aggccttggt gcaaatccca ggagagctga tagtgacttg 3121 aagcagggtt gctgtagaag aaagggtgaa aggggattgc gtttgtatac attcttgata 3181 gatcctcatg ttaattctga gagattggag gacagacact ggatcagatg tgggtacaag 3241 cccagagctg aaaagctagt gatcaactca ggagccccca cttcctgatg cagcatccga 3301 tgctctttcc aatgcccagc ctgccgttgg ccctccctca gccatgccca ccctcttccc 3361 caactcaccc tgtgtgtgtc ctccttcccg caggtcaaca ttgaccatga gacccgcgag 3421 ctgacgagga tgaacctgca gactgccaca gccacatgct ttgatgcggc tcaggggaag 3481 acacgtaccc tgatggagaa ggactcctac ccacgcttcc tgaagtcgcc tgcttaccgg 3541 gacctggctg cccaagcctc agccgcctct gccactctgt ccagctgcag cctggacgag 3601 ccctcacaca cctgagtctc cacggcagtg aggaagccag ccgggaagag agg // LOCUS AF010238 14543 bp DNA PRI 29-JUL-1997 DEFINITION Homo sapiens von Hippel-Lindau tumor suppressor (VHL) gene, complete cds. ACCESSION AF010238 U19763 U68055 U68176 U49746 NID g2282063 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14543) AUTHORS Latif,F., Tory,K., Gnarra,J., Yao,M., Duh,F.-M., Orcutt,M.L., Stackhouse,T., Kuzmin,I., Modi,W., Geil,L., Schmidt,L., Zhou,F., Li,H., Wei,M.-H., Chen,F., Glenn,G., Choyke,P., Walther,M.M., Weng,Y., Duan,D.-S.R., Dean,M., Glavac,D.,., Richards,F.M.,., Crossey,P.A., Ferguson-Smith,M.A., Le Paslier,D., Chumakov,I., Cohen,D., Chinault,A.C.R., Maher,E.R., Linehan,W.M., Zbar,B. and Lerman,M.I. TITLE Identification of the von Hippel-Lindau disease tumor suppressor gene JOURNAL Science 260 (5112), 1317-1320 (1993) MEDLINE 93262488 REFERENCE 2 (bases 1 to 892) AUTHORS Kuzmin,I., Duh,F.M., Latif,F., Geil,L., Zbar,B. and Lerman,M.I. TITLE Identification of the promoter of the human von Hippel-Lindau disease tumor suppressor gene JOURNAL Oncogene 10 (11), 2185-2194 (1995) MEDLINE 95303481 REFERENCE 3 (bases 8843 to 14543) AUTHORS Renbaum,P., Duh,F.M., Latif,F., Zbar,B., Lerman,M.I. and Kuzmin,I. TITLE Isolation and characterization of the full-length 3' untranslated region of the human von Hippel-Lindau tumor suppressor gene JOURNAL Hum. Genet. 98 (6), 666-671 (1996) MEDLINE 97085563 REFERENCE 4 (bases 1 to 14543) AUTHORS Duh,F.-M., Zbar,B. and Lerman,M.I. TITLE Genomic sequence and structure of the human von Hippel-Lindau disease tumor suppressor gene JOURNAL Unpublished REFERENCE 5 (bases 1 to 14543) AUTHORS Duh,F.-M. TITLE Direct Submission JOURNAL Submitted (25-JUN-1997) IRSP, SAIC Frederick, NCI-FCRDC, P.O.Box B, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..14543 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p25-26" repeat_region 141..438 /rpt_family="Alu" promoter 565..670 /gene="VHL" gene 565..12993 /gene="VHL" 5'UTR 643..714 /gene="VHL" /note="5'UTR" mRNA join(643..1054,5392..5514,8667..12993) /gene="VHL" exon 643..1054 /gene="VHL" /number=1 CDS join(715..1054,5392..5514,8667..8845) /gene="VHL" /note="elogin binding protein; VHL protein; pVHL" /codon_start=1 /product="von Hippel-Lindau tumor suppressor" /db_xref="PID:g2282064" /translation="MPRRAENWDEAEVGAEEAGVEEYGPEEDGGEESGAEESGPEESG PEELGAEEEMEAGRPRPVLRSVNSREPSQVIFCNRSPRVVLPVWLNFDGEPQPYPTLP PGTGRRIHSYRGHLWLFRDAGTHDGLLVNQTELFVPSLNVDGQPIFANITLPVYTLKE RCLQVVRSLVKPENYRRLDIVRSLYEDLEDHPNVQKDLERLTQERIAHQRMGD" intron 1055..5391 /gene="VHL" /number=1 repeat_region 1227..1440 /rpt_family="Alu" repeat_region 2038..2185 /rpt_family="Alu" repeat_region 3851..4074 /rpt_family="Alu" repeat_region 4299..4588 /rpt_family="Alu" repeat_region 5062..5351 /rpt_family="Alu" exon 5392..5514 /gene="VHL" /number=2 intron 5515..8666 /gene="VHL" /number=1 repeat_region 5901..6187 /rpt_family="Alu" repeat_region 6254..6558 /rpt_family="Alu" repeat_region 6824..6981 /rpt_family="Alu" repeat_region 7205..7493 /rpt_family="Alu" repeat_region 7569..7620 /rpt_family="Alu" repeat_region 7678..7867 /rpt_family="Alu" repeat_region 8003..8292 /rpt_family="Alu" exon 8667..12993 /gene="VHL" /number=3 3'UTR 8846..12993 /gene="VHL" /note="3'UTR" polyA_signal 12354..12359 /gene="VHL" polyA_signal 12408..12413 /gene="VHL" repeat_region 12864..13154 /rpt_family="Alu" polyA_signal 12988..12993 /gene="VHL" BASE COUNT 3570 a 3314 c 3560 g 4098 t 1 others ORIGIN 1 gaattcagtt agttgacttt ttgtacttta taagcgtgat gattgggtgt tcccgtgtga 61 gatgcgccac cctcgaacct tgttacgacg tcggcacatt gcgcgtctga catgaagaaa 121 aaaaaaattc agttagtcca ccaggcacag tggctaaggc ctgtaatccc tgcactttga 181 gaggccaagg caggaggatc acttgaaccc aggagttcga gaccagccta ggcaacatag 241 cgagactccg tttcaaacaa caaataaaaa taattagtcg ggcatggtgg tgcgcgccta 301 cagtaccaac tactcgggag gctgaggcga gacgatcgct tgagccaggg aggtcaaggc 361 tgcagtgagc caagctcgcg ccactgcact ccagcccggg cgacagagtg agaccctgtc 421 tccaaaaaaa aaaaaaaaca ccaaacctta gaggggtgaa aaaaaatttt atagtggaaa 481 tacagtaacg agttggccta gcctcgcctc cgttacaaca gcctacggtg ctggaggatc 541 cttctgcgca cgcgcacagc ctccggccgg ctatttccgc gagcgcgttc catcctctac 601 cgagcgcgcg cgaagactac ggaggtcgac tcgggagcgc gcacgcagct ccgccccgcg 661 tccgacccgc ggatcccgcg gcgtccggcc cgggtggtct ggatcgcgga gggaatgccc 721 cggagggcgg agaactggga cgaggccgag gtaggcgcgg aggaggcagg cgtcgaagag 781 tacggccctg aagaagacgg cggggaggag tcgggcgccg aggagtccgg cccggaagag 841 tccggcccgg aggaactggg cgccgaggag gagatggagg ccgggcggcc gcggcccgtg 901 ctgcgctcgg tgaactcgcg cgagccctcc caggtcatct tctgcaatcg cagtccgcgc 961 gtcgtgctgc ccgtatggct caacttcgac ggcgagccgc agccctaccc aacgctgccg 1021 cctggcacgg gccgccgcat ccacagctac cgaggtacgg gcccggcgct taggcccgac 1081 ccagcaggga cgatagcacg gtctgaagcc cctctaccgc cccggggtcc attttgcaga 1141 cggggaactg aggccccttg aggcaggaca catccagggt gacgctgctc gtaagcgtca 1201 gagcattctt tttttttttt tttttttttt tctgagacgg agtctcgctc tgtcgcccag 1261 gctggagtgc agtggcgcga tctcgactca ctgcagcctc cgcctcccgg gttcaagcga 1321 ttctcctgcc tcagcctcct gagtagctgg gattacaggc gtgcgccacc gcgcccggct 1381 gatttttata tttttagtag agacggggtt tcaccatgtt ggtcaggctg gtctcgaact 1441 aactgacctc gtgatccgcc cgcctcggcc ttcccaaagt gctgggctta tgggcatgag 1501 cctccgcgcc cggcccagag cattctttat aaggccgaat agtttgcatt tgaaggtggc 1561 tcccccccag tcccccaccc cacgtgtatt ttcccctcaa agaaaagctg catccttaac 1621 accccatctg ttcagtcctc atgactccag tgggccagtt ctgcgtagtc cctgccctcg 1681 tggagaacac attcctcctg gggagactga cagatgcaaa gacaggaaca agccagggtc 1741 atgttggcgc cggaagagcc gaccgtgtgt ggcgtgggaa attgacttac ctgcctgctg 1801 ggagatggag gggttgcggt tgtgtggttt cagttaagga gcacttcccg gagaaggaag 1861 agagcaggat ggagtaggaa ctagccaacc ctaggtaaga ggttctagac atgcgtgcgt 1921 tgagacctgg agtcttggga gaggatgctt aaaaggtgat tttaccccta ggaatatggg 1981 ggcactgaaa tttttttttt tttttgagac gggagtcttg ctctgcaagc tggagtgcag 2041 tggcccacgc tagaatgcag tggcgcgatt gcggctcatt gcaacatctg ccacctgggg 2101 tcaagtggtt ctcttgcctc agcctcccga ggagcgggga ttactggcgt gcgccaccac 2161 tcctggctaa tttttttttt agtagagacg ggggtttcgt cattttggct aggctggtct 2221 cgaactcctg acctcagatg atccacccgc cttggcctcc caaagtgctg agattacagg 2281 tgtaagccac tgcgcccagc cctttgaaag tttttcagta tttatgtata tatatttttg 2341 agttggagtc tggatctgtc gccagactgg agtgctgttg cacaatcttg gctcactgca 2401 atctccgact ccctggttca agcgattctc ctgcctcagc ctcccaagta gctgggatta 2461 caggcacgca ctaccattcc cagctaattt tttgtattct tagtagagac agggtttcac 2521 catgttggcc aggatggtct ccatctcctg cgctcgtgat ctgcctgctt cggcctccca 2581 aagtgctggg attacaggcg tgctgggatt tcggccacaa cgtccgaccg aaagttttta 2641 agcagggaca tgacattgtc agatttatat actgaaaagc tcacccaggt tgccaagtgg 2701 tttggagggg aaagactgct gtcgaggaag cagttaggta gttgtgaaaa cccaggtgag 2761 gaataactag gccttaccta aggtgcaggc agtaatcttg ccatggcctt taagcagaga 2821 agtagtccta gtgtcactta atctttacaa aggatttttg caaggatccc gatctttctt 2881 ccttgagggt ggtgtactta atacactttt acaccagact tctaatgtta gatgaagaac 2941 acagtatttc cagggatcaa catttctgta ggctcctatt ttatatagga aattgtatga 3001 attttgtatt ttactccaaa atttttctgt gcccgattta atataaaaat ttactgagcc 3061 tgggtgcagt ggctcatgcc tgtgatctca gcactttggg aggctgaggc aggaggattt 3121 cttgagccca ggagctggag accagcctgt gcaacatagt gagaccctgt ctgtatttaa 3181 aaaaaaaaaa aaattcttga aaaattagca gggcacattc ctgcctttag tcccagctac 3241 ttgggaagct gaggcaggaa gatcacccga acccaggagt tggaggctac agtgaactat 3301 gatggtgcct ctgaatagtt gctgtactct agtctggtaa cacagcaaga ccctgtctct 3361 ctatcttgtc tttttttttt tttttttttg agacaggatg tcctgctgtt gcctgggctg 3421 gagtgtggag gctggagttt ggtggcatga tcacggctca ttgcaccctt aacctgggct 3481 caagcagtcc tcccagagct tcagcttccc aaagtagctg ggactatagg catgctccac 3541 tatgtctggc taatttcttt ttttattttt atttttagta gagatgaggt cttgctatgt 3601 tgcccaggct gagacctcat ctctttttta ttttttttaa attttttatt atactttaag 3661 ttctagggta catgtgcaca acgtgcaggt ttgttacata tgtaaacatg tgccatgttg 3721 gtgtgctgca cccatagaga cctcgtctta aaaaaagaaa ataacattac ttttgaaggt 3781 acttaatgca ctgaattgta catttaaaaa tggttaaaat ggtaaatgtt tgaggcaggt 3841 agatccacct gaggtcagga gttcaagacc agcctgacca atatggtgga accctgtctc 3901 tgctaaaaat acaaaagtta gctgcatgtg gtggcatgcg cctgtttagt cccagctact 3961 cgggaggctg aggcaggaga attgcttgaa cctgggaggc ggaggtggca gtgagccaag 4021 atcacaccac tacactccag cctgggcaac agagcaagac tccatctcta aataataaat 4081 aaaatggtaa cttttatgta tattttacca aaatttaaaa aattacaagt ttacatttct 4141 taaaatttcc catcaaatct gtaagtaaat ttatgccccg aggaacaagt gctatattta 4201 ttctgagaca acctcctcct tccttaaaca gaatcttagg gctggaggat tgcttcctgc 4261 cctcttttgt ttgtgatgta tgcattttga aaattctggg ccgggcgcag tggctcactc 4321 ctgtaatccc agcactttgg gaggccgagg cgggcggatc acaagatcag gagattgaga 4381 ccatcctggc taatacggtg aaaccctgtc tctactgaaa ataacaaaaa attagccggg 4441 cgtggtggcg ggcacctgtg gtcccagcta tttgggaggc tgtggcagga gaatggcata 4501 aacctgggag gcggagcttg cagtgagccg agatcgtgcc actgcactcc agcctgggcg 4561 acagagcgag actgcatctc aaaaaaaaaa aaaaagaaaa agaaaagaaa attctggtat 4621 aatttacata cagtaaaatg cacagatctt agggtttgat gagttttctc tcgacatgtt 4681 tttgcacttc cttgtttttg agaagcactg atttgagaag tcagtggctt tttctcttta 4741 gtttgcaggg tttgctgtga tttgtaatca cgtacttgac ctaggcttcc cttttccacc 4801 atggtagcag aaagggcatg ggatttagag ctttaagtac gcgctctttg cttactgtct 4861 tataccttga gcatgtcact tctcctctca gacttgtttt ctcatctgta aatggatctg 4921 ttgtgaggac tgactgagat aatgttacta gaagggcttt gtataatatt taagcagagt 4981 gagaggtaag ctttttgtgt aggtcagggg aaatggagaa aataggtgcc ctgactcaga 5041 ccagtctggc tctttttttt tttttttttg gagacggagt cttgctctgt cacccaggct 5101 ggagtgcagt ggcgcgatct cggctcacgg caagctccac ctcctgggtt cacaccattc 5161 tcctgcctca gcctcccgag tagctgggac tacaggcgct cgccacacac ctggctaatt 5221 tttttgtatt tttagtagag acgaggtttc accacgttag ccaggacggt cttgatctcc 5281 tgacctcatg atccgcctgc ctcggcctcc caaagtgctg ggattacagg tgtgggccac 5341 cgtgcccagc caccggtgtg gctctttaac aacctttgct tgtcccgata ggtcaccttt 5401 ggctcttcag agatgcaggg acacacgatg ggcttctggt taaccaaact gaattatttg 5461 tgccatctct caatgttgac ggacagccta tttttgccaa tatcacactg ccaggtactg 5521 acgttttact ttttaaaaag ataaggttgt tgtggtaagt acaggataga ccacttgaaa 5581 aattaagccc agttctcaat ttttgcctga tgtcaggcac ggtatccaat ctttttgtat 5641 cctattctct accataaata aaatggaagt gatgtatttg tacgttatgt gttaaaggtg 5701 ttatggtgtc tcaaaagcac tttgggctct taagagacaa gcgaaattaa agtatcatat 5761 cataggttag ttttgtagaa ttgtagaatt acgaatgcct tttgtttccc tggccaaatt 5821 gtgccctgga gttccaggag aacaatgtgt agagcatgag atattttggc ttatttgttg 5881 ctgacttcta atttttttta tttttttgag acagaatctc gctgtgttag ctaggctgga 5941 gtgcagtggc gcaatctcgg ctcactgcaa cctccgccta ctgggttcca gcgattctct 6001 tgtctcagcc tcccgagtag ctgggactac aggcgtgtgc cacccactct gataattttt 6061 tgtattttta gtagagacgg ggtttcaccg tgttagccag gatggtctcc atctcctgac 6121 ctcatgatct gcccgcctac gcctcccaaa gtgctgggat tacaggcatc agccacagca 6181 cctggcctat gtattttcaa tttaacacaa tcaagctcac agtgccaatc agaggtgttt 6241 tttttttttt taatttttat ttttagagag tctcacagtg tcatccaggc tggagtgcag 6301 tggtgcgatt tcagctcact gcaacctctg catcctgggt tcaagtgatt ctcctgcctc 6361 agcctcctgg gtagctgggg ttataggtgc ctgtcaccac acctggctaa tttttgtatt 6421 tttagtagag atgaggtttc accatgttgg ccaggctgat cttgaactcc tgacctcagg 6481 tgatctgccc acctcagcct cccaaagtgc tgggattaca ggcgtgagcc actgcgtcca 6541 gcctgttttt tttttttttt aaatcattga agattggtat aatacttcac tatttgtttg 6601 aagctcaaat gattttatca gggtaaaccc taataaactg atgttcctgt gggtaaaaaa 6661 aacctcacta aagaccagca gtgtgtggtg gctcctgcct gtaatcatgc ctgtaattcc 6721 agcacttagg gaggctatgg cgggagggtc gcttgagacc aggagttctt gaccagcctg 6781 gacaacaaag tgagacccca gctccacaaa aaaatttttt tttaattacc tgggcatctt 6841 agcatatgcc tgtggtcaca gctatttggg aggcttaggt gggaggatcc cttgagccca 6901 ggagtttgag gctgcagtga gccatgatca taccactgca ctccagccca ggtgacagag 6961 tgagatcctg tctcaaaaaa agaaaaaaaa aactcaaaaa ccccccaaat acatgggttt 7021 cataggatcc aaactactat gtgtgtatag atcctgtttt aaggaagtag atatataaaa 7081 atgagcattg ctaagttaaa tttggtaaat ttgccttata gaacaccctc gagtacgttt 7141 ccagtgagtg taaaatagga attgggatac ccaattcagt tgtactaaat tttctttttt 7201 tttttttttt ttgagacgga gtctcgctct gtcgcccagg ctggagtgca gtggcgggat 7261 ctcggctcac tgcaagctcc gcctcccggg ttcacgccat tctcctgcct cagcctccca 7321 agtagctggg actacaggcg cccgccacta cgcccggcta attttttgta tttttagtag 7381 agacggggtt tcaccgtttt agccgggatg gtctcgatct cctgacctcg tgatccgccc 7441 gcctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcctagattt 7501 tctaagtaca cattgttttg gttatgtgtt ttgtgactac caccccaaaa ctaataacca 7561 cctttttttt ttttttgaga cagagtctca ctgtgtcacc caggctggag tgcagtggcg 7621 tgtgatcttg gctcactgca acctctgcct ctcgggttca agtgattctc ctgcttcagc 7681 agctgggact acaggtgtgc accaccaagc ctggctaatt ttttgcattt tagtggagac 7741 gggggtttca ccatgttgac caggctgatc ttaaactctt gagctcaggc agtctgcctg 7801 cctcagcctc ctaaagtgct aggattacag gcgtgagcca ctgcgcccag cccaccgttt 7861 tatttgttca taattctgta gtccaggctg ggctcagcta ggcagttact ctgctggtgg 7921 tagtcgttgg tgtggctgcc ttttgctggc agctgggggc tgggcctgtc cctctttttt 7981 ttttttctct tttctttttt cttttttttc aagatagggt ctcactctgt cacccaggtt 8041 ggagtgcagt ggcatgatct tagctcactg caacctctgc ctccagggct caagtgatcc 8101 tcccacctca gcctccccag tcgctgggac cacaggcatg tgccaccatg cctggctaat 8161 tttttgtgta ttttgtagag acggggtttc gccatgttgc caggctggtc tcgaactcct 8221 gagctcaggc gatctactga cgttggcctc ccaaagtgtt gggatcacag gcatgaacca 8281 ccatgcctgg ccagggcctg ttcctcttta tgtggtctct ctagcagggt agctcagggc 8341 tttcaaaagt ataaaagcag aagtcagcag gcctttttaa ggcttcggcc tagaattgcc 8401 agtgtcgctt catcgacatt cagttagtta aagcaatcac aagcccagcc catttcaagg 8461 tgaaattact acagaggcat gaacaccatg aggtgtccat agggggccat cagcataaca 8521 cactgccaca tacatgcact cacttttttt ctttaaccta aagtgagatc catcagtagt 8581 acaggtagtt gttggcaaag cctcttgttc gttccttgta ctgagaccct agtctgtcac 8641 tgaggatttg gtttttgccc ttccagtgta tactctgaaa gagcgatgcc tccaggttgt 8701 ccggagccta gtcaagcctg agaattacag gagactggac atcgtcaggt cgctctacga 8761 agatctggaa gaccacccaa atgtgcagaa agacctggag cggctgacac aggagcgcat 8821 tgcacatcaa cggatgggag attgaagatt tctgttgaaa cttacactgt ttcatctcag 8881 cttttgatgg tactgatgag tcttgatcta gatacaggac tggttccttc cttagtttca 8941 aagtgtctca ttctcagagt aaaataggca ccattgctta aaagaaagtt aactgacttc 9001 actaggcatt gtgatgttta ggggcaaaca tcacaaaatg taatttaatg cctgcccatt 9061 agagaagtat ttatcaggag aaggtggtgg catttttgct tcctagtaag tcaggacagc 9121 ttgtatgtaa ggaggtttat ataagtaatt cagtgggaat tgcagcatat cgtttaattt 9181 taagaaggca ttggcatctg cttttaatgg atgtataata catccattct acatccgtag 9241 cggttggtga cttgtctgcc tcctgctttg ggaagactga ggcatccgtg aggcagggac 9301 aagtctttct cctctttgag accccagtgc ctgcacatca tgagccttca gtcagggttt 9361 gtcagaggaa caaaccaggg gacactttgt tagaaagtgc ttagaggttc tgcctctatt 9421 tttgttgggg ggtgggagag gggaccttaa aatgtgtaca gtgaacaaat gtcttaaagg 9481 gaatcatttt tgtaggaagc attttttata attttctaag tcgtgcactt tctcggtcca 9541 ctcttgttga agtgctgttt tattactgtt tctaaactag gattgacatt ctacagttgt 9601 gataatagca tttttgtaac ttgccatccg cacagaaaat acgagaaaat ctgcatgttt 9661 gattatagta ttaatggaca aataagtttt tgctaaatgt gagtatttct gttccttttt 9721 gtaaatatgt gacattcctg attgatttgg gtttttttgt tgttgttgtt ttgttttgtt 9781 ttgttttttt gggatggagk ctcactcttg tcacccaggc tggagtgcag tggcgccatc 9841 tcggctcact gcaacctctg cctcctgagt tcacgtaatc ctcctgagta gctgggatta 9901 caggtgcctg ccaccacgct ggccaatttt tgtactttta gtagagacag tgtttcgcca 9961 tgttggccag gctggtttca aactcctgac ctcaggtgat ccgcccacct cagcctccca 10021 aaatggtggg attacaggtg tgtgggccac cgtgcctggc tgattcagca ttttttatca 10081 ggcaggacca ggtggacttc cacctccagc ctctggtcct accaatggat tcatggagta 10141 gcctggactg tttcatagtt ttctaaatgt acaaattctt ataggctaga cttagattca 10201 ttaactcaaa ttcaatgctt ctatcagact cagttttttg taactaatag attttttttt 10261 ccacttttgt tctactcctt ccctaatagc tttttaaaaa aatctcccca gtagagaaac 10321 atttggaaaa gacagaaaac taaaaaggaa gaaaaaagat ccctattaga tacacttctt 10381 aaatacaatc acattaacat tttgagctat ttccttccag cctttttagg gcagattttg 10441 gttggttttt acatagttga gattgtactg ttcatacagt tttataccct ttttcattta 10501 actttataac ttaaatattg ctctatgtta gtataagctt ttcacaaaca ttagtatagt 10561 ctccctttta taattaatgt ttgtgggtat ttcttggcat gcatctttaa ttccttatcc 10621 tagcctttgg gcacaattcc tgtgctcaaa aatgagagtg acggctggca tggtggctcc 10681 cgcctgtaat cccagtactt tgggaagcca aggtaagagg attgcttgag cccagaactt 10741 caagatgagc ctgggctcat agtgagaacc cgtctataca aaaaattttt aaaaattagc 10801 atggcggcac acatctgtaa tcctagctac ttggcaggct gaggtgagaa gatcattgga 10861 gtttaggaat tggaggcggc agtgagtcat gagtatgccg ctgcactcca gcctggggga 10921 cagagcaaga ccctgcctca aaaaaaaaaa aaaaaaaaat tcaggccggg aatggtggtt 10981 cacgcctgta atcccagcac tttggggggt cgaggtgggc agatcacctg aggtcaggag 11041 ttcgagacca gcctggccaa catggtaaaa ccccatttct actaaaaaat acaagaatta 11101 gctgggtgtg gtggcgcatg cctgtaatcc tagctactca ggaggctgag gcaggagaat 11161 cacttgaccc caggaggcga agattgcagt gagctgatat cgcaccattg tactccagcc 11221 tgtgtgacag agcaatactc ttgtcccaaa aaaaaaaaaa attcaaatca gagtgaagtg 11281 aatgagacac tccagttttc cttctactcc gaattttagc tcctcctttc aacattcaac 11341 aaatagtctt tttttttttt tttttttttt ggggatggag tctccctctg ttgcccaggc 11401 tggagtgcag aggtgcgatc tctgctcact acaagctctg cctcccgagt tcaagtgatt 11461 ctcctggctc accctcctga gctgggatta caggcgcctg ccaccatgcc tggctaattt 11521 tgtgttttta gtggagacgg ggtttcacca tgttgtccag gatggtcttg atctcctgac 11581 cttgtgatcc acccacctca gcctcccaaa gtggtgggat tacaggtgtg agccaccgcg 11641 tccagccagc tttattattt tttttaagct gtctttgtgt caaaatgata gttcatgctc 11701 ctcttgttaa aacctgcagg ccgagcacag tggctcatgc ctgtaatccc agcattttgg 11761 gagaccaagg cggatggatc acctgaggtc aggagctcaa gaccagcctg gctaacatgg 11821 tgaaacctca tctccactta aaatacaaaa attgccggcc gcggcggctc atgcctgtaa 11881 tcccagcact ttgggaggcc taggcgggtg gatcacgacg tcaggaaatc gagaccatcc 11941 tggctaacac gggtgaaacc ccgtctctat taaaaaatag aaaaaattag gcgggcgtgg 12001 tggtgagcgc ctgtagtccc agctactcga gagcctgagg caggagaatg gcatgaacct 12061 ggaaggtgga gcttgcagtg agctgagatg gtgccactgc actctaacct gggcgacaga 12121 gtgagactcc gtctcaaaaa aaaaaacaaa aaccaaaact tatccaggtg tggcggtggg 12181 cgcctgtgag gcaggcgaat ctcttgaacc cgggaggcgg aggttgcagt gagccaagat 12241 cacaccattg cactccagcc tgggaaacaa gagtgaaatt ccatctcaaa accaaatttt 12301 caaaaaaaaa acatgccgct tgagtactgt gtttttggtg ttgtccaagg aaaattaaaa 12361 cctgtagcat gaataatgtt tgttttcatt tcgaatcttg tgaatgtatt aaatatatcg 12421 ctcttaagag acggtgaagt tcctatttca agtttttttt gttttgtttt gtttttaagc 12481 tgttttttaa tacattaaat ggtgctgagt aaaggaaata ggcagggtgt gttgtgtggt 12541 gttttaacta ggcgcttctc tctcagagag ttttgaaacc tgtttacata aaggcccaag 12601 atgggaagga gatccaaaca taagccacca gcctcattcc aagtctcttc tctttccaac 12661 cctggatttt ttttttttat ttaacattgt ttcttttagc tttatttttc ttataaaaga 12721 aatgtatcac tataaaaaat tacacactac agaaaaatat taagaagaaa aacattcaca 12781 tcggaaacaa agttttttcc catgaaaaca gaacccaaaa gggtaagtgg ttagtatttc 12841 accagcaatt atgttgagaa taaggccagg cgaggtggct cacgcctgta atctcagcac 12901 tttgggaggc cagggcaggc agatcatctg aggtcaggag tttgagacca gcctggccaa 12961 catggtgaaa ccctatctct actaaaaatt aaaaaattag ctgggtgtgg tggcatgtac 13021 ctgtaatccc agctattcag gaggctgagg caggagaatt gcttgaacct gggaggcgga 13081 ggttgcagtg agctgagatt gcaccattgc actctagcct gggcaacgag tgaaactccg 13141 tctcaaaaga aaaaaatata tatatataga gagagagaga gagagaatac cacagtgagg 13201 gcatgggcta gaaatcagtg cactaaggat atgaaataga tgtcaatgtg aacttttcgg 13261 atactttgac cctgggtctt tgtatcctct tcttagcacc tcagtcccac gctctgctag 13321 tcattggctt cctgataccc cttcaataca gactgagtat ccctaatcca aaaatttcaa 13381 atccaaagca ctccaaaatc caagagtcca acgtgacgcc acaagtggaa agttccacat 13441 gcgagtactt aacacaaact tgtttcacgt gcaaaactgg ggaaaatatt gcttacaatt 13501 acctacagcc tgtgtctata aggtgtttat gaaactggtg ttatgatgta tatgttttct 13561 ttttttgttc ctggctcata actcccatag cccttgttac ggatgtgagc caccttgcct 13621 ggctgatttt taagtttttt gtagagatgg ggtctcgctg tgttgccctg gctggtttta 13681 actcctgggc tcaagcgatc ctcccacctt ggcctcccaa agccctggga ttacaggtga 13741 gattacaacc ctcatttcag agaaggtcct accccatacc ctgggggaag gaatggtgac 13801 atcataaagc ctcgttaaaa cccatgagga cagtggagag tgtcaggata gctgaactac 13861 gtgtagaggt tcctggaggg tggtgcgccc agggagggga cagaagctct gcgcccctta 13921 tcccatacct tggtgtacgc atctcttcat ctgtatcctt cgtaatatcc tttatgataa 13981 accaggtagg ccgggcgtgg tggctcacac atataatccc agcactttgg gaggctgagg 14041 taggaggatt gcttcagcct gggagttcaa gataacatca tagtgagatc ctgtctctac 14101 tagaaaaaaa aagaacaacc aggagtggtg gcgcatgctt gcagtcccag ctgttcagtt 14161 tgcactccag cctgggagac agagcaagac ctgctgtctc aaaaaaaaaa gactggtaaa 14221 catttttcac tgagttctgt tagccactcc agcaaattaa acccaaagcg aaggtggtgg 14281 gaaccccaac ttgaagctgg ttggtcagaa gttctggagc cctaaacttg ctactggtgt 14341 gtgggtgggg gcagtcttgg ggactgaggc ctcaacctgc aggatctgat attatttcca 14401 gaaagatggt gttggaagtg aattagagga tacctaattg gtgttcactg cagaattgat 14461 tgcttgctcg ctctcgggaa gaaatctaca catttggaca cgaaagtgtt ctgggttggt 14521 attgtgttag tgtggaatct aga // LOCUS AF015812 7834 bp DNA PRI 11-NOV-1997 DEFINITION Homo sapiens RNA helicase p68 (HUMP68) gene, complete cds. ACCESSION AF015812 NID g2599359 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7834) AUTHORS Hloch,P., Roessler,O.G., Weitzenegger,T. and Stahl,H. TITLE Genomic Organization and Expression of Human p68 RNA Helicase JOURNAL Unpublished REFERENCE 2 (bases 1 to 7834) AUTHORS Hloch,P. and Stahl,H. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Universitaet des Saarlandes, Medizinische Fakultaet, FR 3.3 Medizinische Biochemie, Geb. 45, 66421 Homburg, Germany FEATURES Location/Qualifiers source 1..7834 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q23-q25" mRNA join(938..1151,2419..2584,2933..3029,3135..3268, 3383..3448,3677..3818,3900..4060,4149..4321,4704..4814, 5028..5089,5138..5197,6441..6665,6882..7586) /gene="HUMP68" /product="RNA helicase p68" gene 938..7586 /gene="HUMP68" 5'UTR 938..1107 /gene="HUMP68" CDS join(1108..1151,2419..2584,2933..3029,3135..3268, 3383..3448,3677..3818,3900..4060,4149..4321,4704..4814, 5028..5089,5138..5197,6441..6665,6882..7285) /gene="HUMP68" /note="nuclear protein; similar to DEAD box protein encoded by GenBank Accession Number X52104" /codon_start=1 /product="RNA helicase p68" /db_xref="PID:g2599360" /translation="MSGYSSDRDRGRDRGFGAPRFGGSRAGPLSGKKFGNPGEKLVKK KWNLDELPKFEKNFYQEHPDLARRTAQEVETYRRSKEITVRGHNCPKPVLNFYEANFP ANVMDVIARQNFTEPTAIQAQGWPVALSGLDMVGVAQTGSGKTLSYLLPAIVHINHQP FLERGDGPICLVLAPTRELAQQVQQVAAEYCRACRLKSTCIYGGAPKGPQIRDLERGV EICIATPGRLIDFLECGKTNLRRTTYLVLDEADRMLDMGFEPQIRKIVDQIRPDRQTL MWSATWPKEVRQLAEDFLKDYIHINIGALELSANHNILQIVDVCHDVEKDEKLIRLME EIMSEKENKTIVFVETKRRCDELTRKMRRDGWPAMGIHGDKSQQERDWVLNEFKHGKA PILIATDVASRGLDVEDVKFVINYDYPNSSEDYIHRIGRTARSTKTGTAYTFFTPNNI KQVSDLISVLREANQAINPKLLQLVEDRGSGRSRGRGGMKDDRRDRYSAGKRGGFNTF RDRENYDRGYSSLLKRDFGAKTQNGVYSAANYTNGSFGSNFVSAGIQTSFRTGNPTGT YQNGYDSTQQYGSNVPNMHNGMNQQAYAYPATAAAPMIGYPMPTGYSQ" 3'UTR 7286..7586 /gene="HUMP68" BASE COUNT 2059 a 1513 c 1990 g 2272 t ORIGIN 1 cgtctgaagg tcacgagccc cgccgacagc ccagacccag tccgggctag cccgaggcct 61 ccctggaggt ggacggtttc agtccacaca tactgggacc ccagggagac actcaccagc 121 atccgagcct gccatgtttc agaggcaggt cgccgccgga ctccgacgcg gccgggaagg 181 cgacggtgtc ctggaaggac cgatccacgc agaccgacac tgggcgcgga cgcacgaacc 241 aaagcgcggg aaggaggcgt gaagaaggac ggacgttaaa gagcttctcg ccgctgattg 301 gtcatcagag gagcacttcc ttcacaggac gtgaaacggg ggcggtttgg gaagtttaga 361 gaccattctc cgccgaccaa aacccgtcaa aggattatca gacacgcggg tcggacggtc 421 cacatcagcc ggcagcccgg gcgggtcccg gggtgcgagc agcgcacttc cggtgagcta 481 tttcgttttg tatccctccg ccgacgtcaa cgggaaagta gtgcggaccg ctctctcggt 541 ggtccggggt ggtacagcca cgtgacaacg ccaggccccg ccttccccct cttttggtta 601 cagacgtgag ggctctttgg agacgtaaac atctccgagt ggcgagggtg ggcggggcta 661 gggcttggga aagggcgggg tggcttgctt gaggtgtgga aagaccagaa gaaggtgagg 721 tcaagagagt gcgaatgagg cattccaatg gtgggtgggc cctgacctga gagagtggcg 781 cggggagggg tgaaagcgcg gcgatcctgg aacgccagcg ggcgttgcgg cctatgcgcg 841 aggggcgggg cgattaggtc atagagcggc tcccagcgtt ccctgcggcg taggaggcgg 901 tccagactac aaaagcggct gccggaaagc ggccggcacc tcattcattt ctaccggtct 961 ctagtagtgc agcttcggct ggtgtcatcg gtgtccttcc tccgctgccg cccccgcaag 1021 gcttcgccgt catcgaggcc atttccagcg acttgtcgca cgcttttcta tatacttcgt 1081 tccccgccaa ccgcaaccat tgacgccatg tcgggttatt cgagtgaccg agaccgcggc 1141 cgggaccgag ggttattcga gtgaccgaga ccgcggccac cgagggtgag tttgggagcc 1201 gagctgtcag gccaggcggg tggggggatg ggagggcggg tcagggtggc ggccggcggg 1261 ggctttgcgg cttggacttg gcctttccgg gctatcttgg gacttccttt cccgaacgtt 1321 gcgccatttt gatattcacg tcacagtgat tggaagagat ttgacggtgt agtgtcttca 1381 agcttgcttt ttgtgtgggg atttggggag ctgtcggggc ggctgccatt tggtagctgt 1441 tgagggagtt gagagggagc gtattgtgcg gatgaaagcg gacgcttcga ggcatgacga 1501 aggaacatct gttaggtgcg gcgtttcggt aggtgttttt ggggtggccg ggcattctgt 1561 gggagcgagg ggaccacttc caaagccctg gtgctgttgg ggtaggaggg cggccggcat 1621 cagccatgtg gctgagtcgc gagtacaaaa tgccggcctc ggacatggcg gcggcgcctt 1681 tgttaccccg cccggcggag gagctcaaaa tggcagcgtc gagaaaatgt ggcgcagaga 1741 gaaatgcgag acaaaggggg aagcgccgcc ccagcgggaa cgccgcccgg ccgactccgc 1801 ccgggccggg actcctcccc cggtagtcgc cggctcctcc ttttcttttt tcctgcgtta 1861 tataattttg attcgttgat ccggagctct accgcggcgt tcccccagct gggtttgcta 1921 gcagaagtgt ttctgagaaa acccttgttc tgttatcgct gactgtactg tttaggttct 1981 taccatcaaa gctgtttggt tccaaaacgg ccatatgagt aacatcgtcg tgatgctctt 2041 cggttcatgt agccttgtta ttgctgatag tgaattgcta ggctggtggg gaagattaca 2101 gtaaccacaa gaagtggtgt gtgccagaat cccaaattct ggcatgtggg tgacaagttt 2161 ccgacatgat aaatccccgg cttccgacat gataaatccc aggctgttta catgacctaa 2221 gtaatgtgta cttgggacta cgggaaatgt taactgtggc tgttgagaga gagagagatt 2281 ttcacgaagg acagtgctag gtttacctct cgaagtctgt tttcagtggt ttttagcttg 2341 tgccaatgga tgacaaatct atacagaaac ctgggtatag cctaaagaaa atgtgaataa 2401 cgtttttttt cattccaggt ttggtgcacc tcgatttgga ggaagtaggg cagggccctt 2461 atctggaaag aagtttggaa accctgggga gaaattagtt aaaaagaagt ggaatcttga 2521 tgagctgcct aaatttgaga agaattttta tcaagagcac cctgatttgg ctaggcgcac 2581 agcagtgagt aaattcatgt ggcttcatca ggctgtaact cgatcgtgga ttctagtaaa 2641 tgaaattctg acaggtgttt tgcaaataac tcaattttgg tagagttaca tgttctgact 2701 tcataattgg gaaaggtgtg actcactttt ggaatatagg tggctttggg atttttactt 2761 aaattaggtt gagtataaca agaaattttt ttttcataat agggtgttca taggtgggtc 2821 agattaaaat gaaggctact ttaactagtt actaaattat gaagttaggg gcttatcaat 2881 tacgtattta cgtagggtgg tgtcatgaat ttagactgta tattgtttgc agcaagaggt 2941 ggaaacatac agaagaagca aggaaattac agttagaggt cacaactgcc cgaagccagt 3001 tctaaatttt tatgaagcca atttccctgg taagtgctac ttttcagttc tacctacccg 3061 tgtttttgtt tccacctacc ccctcttttt cttggcatca ctaattttta ctaaatatct 3121 gttactaatt atagcaaatg tcatggatgt tattgcaaga cagaatttca ctgaacccac 3181 tgctattcaa gctcagggat ggccagttgc tctaagtgga ttggatatgg ttggagtggc 3241 acagactgga tctgggaaaa cattgtctgt aagtttggga gaactcttga gttgatctga 3301 tatatgcaag aaaatgtaat ggtaatttaa aaacgagtat tttaatgtga tttctgtttg 3361 tccccacttt caccctaaat agtatttgct tcctgccatt gtccacatca atcatcagcc 3421 attcctagag agaggcgatg ggcctattgt aagtatatat tttactttta ttagaagcat 3481 aatgtgtaga ttttagacta catagctaaa gatgtaatca tttgtggtgg ttttatatag 3541 aggttagctc atcctattca gctggagctg ttttgggtat tggacaacac atgaagaaag 3601 gatctgctag tataataagt tagcagttta aaactagtac caggtttgtg ctgaaagctg 3661 tttctctttt ccttagtgtt tggtgctggc accaactcgg gaactggccc aacaggtgca 3721 gcaagtagct gctgaatatt gtagagcatg tcgcttgaag tctacttgta tctacggtgg 3781 tgctcctaag ggaccacaaa tacgtgattt ggagagaggt atgtaatgaa aagggtttta 3841 tttgtcattg gtgctaaata tcctaggtat tgtagttaca cttacgtatt taattaaagg 3901 tgtggaaatc tgtattgcaa cacctggaag actgattgac tttttagagt gtggaaaaac 3961 caatctgaga agaacaacct accttgtcct tgatgaagca gatagaatgc ttgatatggg 4021 ctttgaaccc caaataagga agattgtgga tcaaataaga gtaagtgtcc tttgaaatat 4081 gtgatcaaac tgaattgtgt ttcactctta agagtctgat actaattttt ccccccaaaa 4141 tccattagcc tgataggcaa actctaatgt ggagtgcgac ttggccaaaa gaagtaagac 4201 agcttgctga agatttcctg aaagactata ttcatataaa cattggtgca cttgaactga 4261 gtgcaaacca caacattctt cagattgtgg atgtgtgtca tgacgtagaa aaggatgaaa 4321 agtaagtttt attaactctg ttatatttgc ttcctaacaa ctttgctgta aaattgagga 4381 tcattgtttg gtgagttgtt ttaggttatt tcagttggtg tgatttcatt tagttagcct 4441 actaatcctg aaaatttctt gaatcttcaa ataatggccg tcaccattta tagctttcca 4501 tatgaagaat tgaattcatg tctccctggt tgacttaagg accaagggtc gaactgctcg 4561 ataagtggat tagcaggcgt cttccttcct tttgaccttt ccagccatgt aaattgaact 4621 taatgttttg ctgaccataa atgtgtggcc ctagcaatgg tcttttaaaa ctcaggattt 4681 tcctttctct ctcctattat tagacttatt cgtctaatgg aagagatcat gagtgagaag 4741 gagaataaaa ccattgtttt tgtggaaacc aaaagaagat gtgatgagct taccagaaaa 4801 atgaggagag atgggtatgt gtgagctcct ccttgaagca gattgattaa aacagcttag 4861 gaagggcaaa cttggatcac gagcagtgga tttttttcat atctgatagt gaatttaact 4921 ttttcatttc tggcgaaatt aaagagatct gtgaccaaaa gtggtcaagc actggagtct 4981 gaggttttca atgtgagttt aataacacaa cttgtctttt aacttaggtg gcctgccatg 5041 ggtatccatg gtgacaagag tcaacaagag cgtgactggg ttctaaatgg taaatatttc 5101 aaatgaagta tttttccccc ttacttaacc tagctagaat tcaaacatgg aaaagctcct 5161 attctgattg ctacagatgt ggcctccaga gggctaggtt agtacaaact cgcattcatg 5221 gcttggtttc ccagaagatc tccatttaac ttttttaaag aaagtttatt gctttcttta 5281 acctgcattt tttctaagtt ttttttcaca taaaggtgct gtctttgtgg caaggcctag 5341 gcatgacaat cggaggactc gagggggatg gaggactagt gatcggctgg ctgcttccag 5401 tcgattagag aggtgaaaag ctgaacgtgt gccagtaatc ttcaaaaggc agaacatatc 5461 acctctgccc cgtaaactgt tctctccgag ggaaaaaatg gaagttatct cacagttcac 5521 tgccgtggta tttcttctgt cccatgcttt gcatgactgc catggtacag ccttgtttca 5581 aactgttcac tgtgatctgt gggtctttga gtttcagtga gtttgctgaa atgtcgaaga 5641 agtagttcca aacttcaatg ttcaatgaaa tttttgttca agtttgaaat ggagagagca 5701 gctttaaaag gtactaagcc ttttacaaat tggtgagtta ctggcacatg agatctagag 5761 caggagcaac ttctacacac tatgagtaag tgggaaaaga aagtgctttg aaagttcctc 5821 cctcacctac acagtagtcg tcatgtcgag acctgccaga gagagacaca ttctcaagtg 5881 aatcctggct tcttggaagc gcttgcctag acgagacaca gtgcataaaa acaacttttg 5941 ggggacaggt atgttttctt gcagctgcgg ttgtaaggtc ttggcaagac aagcagtgtg 6001 gccagaattt tgaacttctg atgaatgtgt aatgcaaagg accttgtaca tttttttgtt 6061 tcaaggtcct caaaatgagc acatgaagag gttgctgtga aactttaagt ggccctactg 6121 cgcagaagca ttcagatgtc acttgatgat ctgtaaggga acttgctgat ttgggaatgt 6181 gcttatttaa cacacattcc ttttgacagg gtctgtcact ggggtggggg tgatgaatta 6241 tacagatgac atgtgctttt tttttctttt ttcaacctca atggtattcc tacaggaaat 6301 ggataaccat tttaactgta tttttttgca gcccgtacct tcttgggaat acaattgtct 6361 aactttttat ttttggtctg gctgttgtgg tgtgcaaaac tccgtacatt gctattttgc 6421 cacactgcaa caccttacag atgtggaaga tgtgaaattt gtcatcaatt atgactaccc 6481 taactcctca gaggattata ttcatcgaat tggaagaact gctcgcagta ccaaaacagg 6541 cacagcatac actttcttta cacctaataa cataaagcaa gtgagcgacc ttatctctgt 6601 gcttcgtgaa gctaatcaag caattaatcc caagttgctt cagttggtcg aagacagagg 6661 ttcaggtaag gatgactgat aggaaatgtt ggtagttacg gtcactacgt atacaaatcc 6721 atttaaatgg tattggaggg tgagtaaaac cttgaagtga aaacttaagc tgaaaaattg 6781 taaaaacatt tcacgcctac catgaataga tctgtttctt ctgtccacaa tgatttgtgt 6841 catagacata attgatcaat ttgcaattgt tttcttgaca ggtcgttcca ggggtagagg 6901 aggcatgaag gatgaccgtc gggacagata ctctgcgggc aaaaggggtg gatttaatac 6961 ctttagagac agggaaaatt atgacagagg ttactctagc ctgcttaaaa gagattttgg 7021 ggcaaaaact cagaatggtg tttacagtgc tgcaaattac accaatggga gctttggaag 7081 taattttgtg tctgctggta tacagaccag ttttaggact ggtaatccaa cagggactta 7141 ccagaatggt tatgatagca ctcagcaata cggaagtaat gttccaaata tgcacaatgg 7201 tatgaaccaa caggcatatg catatcctgc tactgcagct gcacctatga ttggttatcc 7261 aatgccaaca ggatattccc aataagactt tagaagtata tgtaaatgtc tgtttttcat 7321 aattgctctt tatattgtgt gttatctgac aagatagtta tttaagaaac atgggaattg 7381 cagaaatgac tgcagtgcag cagtaattat ggtgcacttt ttcgctattt aagttggata 7441 tttctctaca ttcctgaaac aatttttagg ttttttttgt actagaaaat gcaggcagtg 7501 ttttcacaaa agtaaatgta cagtgatttg aaatacaata aatgaaggca atgcatggcc 7561 ttccaataaa aaatatttga agactgaatt aagtggaaat tgtactttat ttatataatg 7621 tcatgtaaaa ctttgcttaa gatggtctgg tttttttttt gtttttgttt ggtttttttt 7681 ttccatgaaa acaaatgact gttccttttt atttaatttg ggaggcaggg ggaatcagaa 7741 ggcccttctt tataatgagc tattcatatt gcaggagtca gaatgaattg atacaggtga 7801 atttttagtt acaggctaaa ttgcataaaa gctt // LOCUS AF015954 4031 bp DNA PRI 02-NOV-1997 DEFINITION Homo sapiens lymphopain gene, complete cds. ACCESSION AF015954 NID g2582180 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4031) AUTHORS Brown,J. and Enver,T. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Leukaemia Research Fund Centre, Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, England FEATURES Location/Qualifiers source 1..4031 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" /clone="Genome Systems clone address DMPC-HFF#1-0258-E" /note="human genomic P1 clone" mRNA join(<135..221,482..566,1686..1799,2455..2609,2710..2806, 2889..2969,3059..3184,3349..3413,3495..3704,3788..>3898) /note="C1 peptidase family member" /product="lymphopain" CDS join(135..221,482..566,1686..1799,2455..2609,2710..2806, 2889..2969,3059..3184,3349..3413,3495..3704,3788..3898) /note="C1 peptidase family member" /codon_start=1 /product="lymphopain" /db_xref="PID:g2582181" /translation="MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKL FQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQL YGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAG NIETLWRISFWDFVDVSVHELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQ GKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGV IKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWG AQWGEKGYFRLHRGSNTCGITKFPLTARVQKPDMKPRVSCPP" BASE COUNT 814 a 1239 c 1143 g 835 t ORIGIN 1 ttgtgggttc taggggtcca ccctccaccc accaagcctg tcttcccctt cgtggtttct 61 ccagccccgc cctcacaccg tcctcctcga tctgcgcggc ttcctgcctc catgccactc 121 cagactgcac cggcatggca ctgactgccc acccctcctg cctcctggcc ctgttggtgg 181 caggcctagc ccaaggcatc agaggccccc ttagggccca ggtaagtgct cccaaaccct 241 tacctatgcc agtcccacgc ctggggtcac cccttcagaa acagctggcc tgtcattctc 301 atttttcaga gggagttgct gaggctggag agggggctga cgtgcccaag ggcacatggg 361 aagacagaaa aggctaactg gctgacccct agcccagtcc ttgcacggct gggaactaag 421 gtggggcccc agcaatgatt cctctgccca ctgcccctct ccacccttgg ttccttgcca 481 ggacctaggt ccccagccgc tagagctgaa agaggccttc aagttgttcc agatccagtt 541 caaccggagt tacctgagcc cagaaggtat cacagggcac atacatctcc agtccaagcc 601 cccacttttt tttttttgag acggagtttc actcttgttg accaggctgg agtgcaatgg 661 tgtcatcttg gctcactgca acctccacct cccgggttca agcgattctc ctgcttcagc 721 ctctgagtag ctgggattac aggcatgcgc caccacactt ggctaacctt tacattttta 781 gtagagacgg ggtttgacca tgttggccag gctggtcttg aactcctgac atcaggtgat 841 ccgcctgctt cggcttccca aagggctggg attacaggtg taagccacgg agcccagcca 901 tcccgctttg taatagatga agaaatcgaa gctcagggtg ggaaacaggc atggccaagg 961 ccactcagca ggacaggcca gggctggggg tgtgttccct ggtttcccga aggcatgagt 1021 gggtccagga tcattctttg ctatggatct ggagggcagt ggcaggaggt ggggtcctgc 1081 cctagtccta cgtgatcccc tggctccaca gccacttccc tgcgattcta ggcagtggac 1141 agggggccac cacctagtca tttgctcatt cagcaaacac ccaaggacac ccattttgtc 1201 cccacattgg ggacttcttc tctagcccat gatctaaccc ctgcccctgc cctctcgctg 1261 tgctgatctc tctagggggc tgatctgtgt gtgcctttta gcccagactg cctgtgcctt 1321 gtcatctcct ctccctgcca tgttccagct ctcctcctgt ggtcagaggt gtaaaggacg 1381 tggcctggca ggaagctggt ctccagccac acatccttga gtcaccacca ccggacccct 1441 tgcctggctt cccacctagg gtggccagag atggtcctct ggctgcctct ggggaaaacc 1501 caaccctcgg tgggctgggt aaccacttcc tgggcctggg tatctgcagt ggggccttgt 1561 gggggctaca ggaaaggagc cagtgctcaa ctgggtgggt gtgggtggaa gggtgcccag 1621 ccagcggcta cttcctgccc ctaagggctt gggagtcagc ctaggacaac tcccttttgg 1681 tccagagcat gctcaccgcc tggacatctt tgcccacaac ctggcccagg ctcagaggct 1741 gcaggaggag gacttgggca cagctgagtt tggggtgact ccattcagtg acctcacagg 1801 taccattaac ccttctggct cgtaggtggt ggttgggaag cgatctcccc agggactctg 1861 gtagctggac ccagcggacc ttcaattttt acttcccagg gaaggaaatt catttccctg 1921 aggaccaggg gcttgggtgc tggtatcaaa tcacccagtg caggggcaga atctctgcag 1981 tcctgtgtcc taatccagcc taggggccag cgcagcagct gcagggaagc acccagcagg 2041 gagcccagcc tctctaggct tggcaccctt gcctttaggg agacagggtc ttgctctgtt 2101 gtccaggctg cagtacaata gtgcaatcac agctcactca gcctccaccc tcctagactc 2161 aaagcaattc ccccgacccc tcaagccttc ttaagtagct ggggctacaa gccgcacaac 2221 cacgcccagc taattttcta cttttgtgga gatgggtctc ggactcctga cctcaagtga 2281 tccacccacc ttggcctccc aaagtgctgg gattataggt gtgagccacc acgcccggcc 2341 tatccttgcc tttttgcccc tcagggaatt atgggaccaa cagctggagt gggagaggca 2401 gaacagggag aaggggtcaa aatggagacc tcagtggccc tgtctgttcc ccagaggagg 2461 agtttggcca gctctatggc tatcggaggg cagctggagg ggtccccagc atgggcagag 2521 aaataaggtc tgaagagcca gaggagtcag tacctttcag ctgtgactgg cggaaggtgg 2581 ccggcgccat ctcacccatc aaggaccagg tatctgccgc tacccagctg gctctaattc 2641 agctaagtgg tgggagggag agggtcacgg cccgaacacc tagccccgcc ccaccctctc 2701 gccctccaga aaaactgcaa ctgctgctgg gccatggcag cggcaggcaa catagagacc 2761 ctgtggcgca tcagtttctg ggattttgtg gacgtctccg tgcatggtag ggttgggaga 2821 gggtgcgcgt gtgacagggg agggaggcct aggggcctgg tcacccacac tgtcccttct 2881 tgcaccagaa ctgctggact gtggccgctg tggggatggc tgccacggtg gcttcgtctg 2941 ggacgcgttc ataactgtcc tcaacaacag tgagtggact gccccgccac tctggacatc 3001 tgcaggggga cagggtgggc aggagcggaa cctcctccct tgtcttgctt atctgcaggc 3061 ggcctggcca gtgaaaagga ctacccgttc cagggcaaag tcagagccca caggtgccac 3121 cccaagaagt accagaaggt ggcctggatc caggacttca tcatgctgca gaacaacgag 3181 cacagtgcgg gcagggcagg gacacgggcg gatggcaggg acagacaccg gggcagaggc 3241 agacacgctg ggctactggg ctagaagaca agagggggtg gggggagcga aggagcgaga 3301 gacccacaca cccacatgcc aagggtctga tgatgttcct gtccccagga attgcgcagt 3361 acctggccac ttatggcccc atcaccgtga ccatcaacat gaagcccctt caggtgagat 3421 gggggagctg atggggaagg ggcatacagg agacttggtc ccacactcag cccctcggcc 3481 cccaccccct gcagctatac cggaaaggtg tgatcaaggc cacacccacc acctgtgacc 3541 cccagcttgt ggaccactct gtcctgctgg tgggttttgg cagcgtcaag tcagaggagg 3601 ggatatgggc agagacagtc tcatcgcagt ctcagcctca gcctccacac cccaccccat 3661 actggatcct gaagaactcc tggggggccc aatggggaga gaaggtgagt gtgatctatt 3721 gggggagggg gcaaggcaga acaggcctct tcccaccttc ccgcccctat gtcccctaac 3781 ctcctagggc tatttccggc tgcaccgagg gagcaatacc tgtggcatca ccaagttccc 3841 gctcactgcc cgtgtgcaga aaccggatat gaagccccga gtctcctgcc ctccctgaac 3901 ccacctggcc ccctcagctc tgtcctgtta ggccaactgc ctccttgcca gccccacccc 3961 caggtttttg cccatcctcc caatctcaat acagcctgaa taaaccaaga caagacctct 4021 ggcttgtgaa a // LOCUS AF016052 9700 bp DNA PRI 16-SEP-1997 DEFINITION Homo sapiens zinc finger protein ZNF191 (ZNF191) gene, complete cds. ACCESSION AF016052 NID g2394173 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9700) AUTHORS Shi,S., Yu,J., Liu,M., Wu,G., Zhou,Y., Chen,S., Zhao,S., Chen,Z., Tan,J. and Yu,L. TITLE Cloning and Characterization of a Novel Human Zinc Finger Gene, ZNF191 JOURNAL Unpublished REFERENCE 2 (bases 1 to 9700) AUTHORS Yu,L. TITLE Direct Submission JOURNAL Submitted (25-JUL-1997) Institute of Genetics, Fudan University, Handan Road, Shanghai 200433, People's Republic of China FEATURES Location/Qualifiers source 1..9700 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="G16-1" mRNA join(787..868,4494..4996,5251..5398,7458..9700) /gene="ZNF191" /product="zinc finger protein ZNF191" exon 787..868 /gene="ZNF191" gene 787..9700 /gene="ZNF191" exon 4494..4996 /gene="ZNF191" CDS join(4577..4996,5251..5398,7458..7996) /gene="ZNF191" /codon_start=1 /product="zinc finger protein ZNF191" /db_xref="PID:g2394174" /translation="MSAQSVEEDSILIIPTPDEEEKILRVKLEEDPDGEEGSSIPWNH LPDPEIFRQRFRQFGYQDSPGPREAVSQLRELCRLWLRPETHTKEQILELVVLEQFVA ILPKELQTWVRDHHPENGEEAVTVLEDLESELDDPGQPVSLRRRKREVLVEDMVSQEE AQGLPSSELDAVENQLKWASWELHSLRHCDDDGRTENGALAPKQELPSALESHEVPGT LSMGVPQIFKYGETCFPKGRFERKRNPSRKKQHICDECGKHFSQGSALILHQRIHSGE KPYGCVECGKAFSRSSILVQHQRVHTGEKPYKCLECGKAFSQNSGLINHQRIHTGEKP YECVQCGKSYSQSSNLFRHQRRHNAEKLLNVVKV" exon 5251..5398 /gene="ZNF191" exon 7458..9700 /gene="ZNF191" BASE COUNT 2653 a 1892 c 2031 g 3124 t ORIGIN 1 ttacaggccc attataagga gtttttatgt ccataaatta ttatgacata aattggaggg 61 ctttaaatta ggggagtgaa gtggtctgat ttgtaatttt aaatgatcgc gtcagcagtt 121 gttttatggt ataaaatgcc cagctttaag atgaccatgc attctacaaa ttcaccgtga 181 catcactgcc aaaactctta acctgtctta atctagtttt cctgacagat aaataggtcg 241 gaataaagct ccatgacata actagtggga gccaaagatt tgaatcacgt cttggtttca 301 aaacttgcta cccacaaatg agttgtgagc tcttgtcagg ttatttactg tatttgtgcc 361 tcaactatct cagctgtgaa atggggataa tgaaagttcc tacatcattt atttgtggaa 421 aaaaaaatga catgtaaagc acgacgagag cgcctggcac aatgtagtaa gctttcacaa 481 acactcactt tcattccttc ttgcagggga ttactgagag gatccacaag ataatgcacg 541 tgaaagggct tgcgaaattg taaggactat cttaatggta gagtcctctt tatgttttct 601 tttaaacggt ccagaggacg ctgggctgtt cgaggagagg cgcgaggcca agcgcagcct 661 caatgcggtc attgtcgtgg ggccgcccct cccccgcccg gggcgcttct ccatggcaac 721 gtgggcttta cctctccccg cctcctcagg cccgcagacc ggaagcagcc cgcgccgggg 781 gtttctggga aaaggcttgt gaacggcgtt tctgcgtctg ccgtggacag cgaagctgct 841 gcggttcctg agccggaggt ttgcgccggt gtgcggggaa cgttatgaga gggagggggg 901 acaccgggct agggtctgcg gacctcttag caggagccgg cggggaagcc tgtccgggac 961 cttcctcagc ttcccgggag gcgcggggaa caggccgcgg tctccgcgcc tagcagttgt 1021 ggggcgggga ggggtaggcg gcaggcctgc ttgggtcttc ggcatttgca gagagctctc 1081 taaccccctg tgtcctccga gaccctcacc ttccccctcc cccaccggcc ggttttgttc 1141 tatttcgagg ccctggcgtc tcgcggaggg ttctgggagt cgtgatggga gggggtggta 1201 gtgggacgag acttgcgcgg ggcttctggg atttgtagtc tcaccggccg caggcggaag 1261 tgccgtggtt ccgacgctgt tgtcacaggg tggcctttgg tcatctgaaa tccgcggtcc 1321 tgagtgctgg gaagaggggc tagaggacgt cgggccgggg ccggggctgg ggccgggata 1381 cacagcaggt ctatgcagca gtaaggtggg cgatccacct gggacgaggt accctgccca 1441 aacattgtcg ggaccgtggg ccttccctcc atttcccaca agcggtatgc tctacagcat 1501 aaaaggaaga gaaaattatg tagaaatact tgcaccgttg ataagcaggg tcttggaata 1561 ggaacgaatg aatgttggct agaccttatg tataaggcca ccacactcca gggtatttaa 1621 ttaaaccagt atgtattact tgtctgataa ggccttttct gattctagtc ttcatcttgc 1681 ttgggttaca ataagttttt ttaagttgtt attgttcaga ttttgggccc tgaataaata 1741 aataggctga atggctcatt gctagtatgt gtcgtaaaca acagcacggg catgatcgtg 1801 caaataataa aggtccagaa gcagtataca gtttccctga aggtagaagc tggggaaact 1861 gtgagtcctc cagtgtaagg acaagacact ggaccactgc aggaagtcaa ggtgcttatg 1921 cgagtgcctt ttcttaatag aagtttctgt gattaatgcc tatcattata aatataaatg 1981 tacgtatggt gtggtggaag actgaatatt tacgtattat gtgcaagata atttttcaaa 2041 attatgtgat gtttttcaca tatgatcttt tcaatattat gtgatctgct actcagcaag 2101 tgaacatttt ttcacatata agtccaagga gttcttaaac accttaaaaa tattttttaa 2161 atggtaaaga aaattttttt aagcacataa taataaagcg aaccttcctt ttgagttgac 2221 ttgtgtgtgt caggaaacac caatcctgaa ttggaaataa tgcgttccag ggtcaggcac 2281 atttagaaag cactgaagta gtctggttct tacttggatt cttttgatta ttgacatagg 2341 gtttgtcgtc ttagtaggtc atgattggca aactttttct gtaagcggcc atatagtaaa 2401 taaggtcatg cataactcaa cgatagtgct atggtctgag aaatgtgtta ggctatttca 2461 tccttgtgtg aacgtcatag aatgtacttg cacaaaccta aatggcgtag cctactacac 2521 acttaggcta tctggtatag cctattgcgc ctaggctaca aaccggtacg gcatgttgct 2581 gtactgaata ctgtagagaa ttgtaacaca atgctaacta tttgtgtatc aaaacataga 2641 aaagataaga gtaaaatatg atataaaatg tttgaaaaaa aaaaaagata tgcctgtaca 2701 gggcacttac catgaatgga cctcgctgga ttggaagttg ctctaagaag tcagtgagtg 2761 agtggtgaat ttggaggcct gggacattac tgtacattcc tatagaccat ataaacactg 2821 aacacttagg ctacactaaa tttattaaga aaacattttt cttcaataat taaccttagc 2881 ttactggaac tttttcactt acaaactttt acattttttt ttaacttttt tactcttttg 2941 tgataacact tcacttaaaa tacaaacaca ttgtacagct gtacaaatat atttatatcc 3001 ttattctgta agctttttgc tttttttttt ttatagacag agtctcgctc tgtcgccagg 3061 ctagagtgca gtggcgcgat cttggctcac tgcagcctcc acctcgcggg ttcaagagat 3121 tctcctgcct caatctcccc agtagctggg actacaggcg caatccacca cgcccagcta 3181 atttttgtat ttttagtaga gacagggttt caccatgttg gccaggatgg tcttgatctc 3241 ttgacctcgt gatctgcctg cctcagcctc ccaaagtgct gggattacag gcctgagcca 3301 ccgtgcccgg cacagctttt tgctattttt aaattcattt tttagagaca gggtcatact 3361 ctgttgccca ggctggaatg cagtggtgtg atcatagctc agtggagcct tgaactcctg 3421 gtctttagtg atcctcctgc ctcagcctcc tatagtgctg ggattacagg tgtgaaccat 3481 tgcacctggc caattttttt tttcttttgg aactgttttt gttaaaagct aagataaaca 3541 cattagctta ggcttacaca gggtctggat tatcagtatc attgccttct acctccatat 3601 cttgtcccac tggaaggttt tcaggagcag taacatacat ggagctgtca tctccataat 3661 aacagtgcct tctgtaatac ctcctgaagg acctgcctga ggccattttg cagttaactt 3721 ttttatttag taagtagaag gagcacactc taaaataata tagtataaaa agtatagtaa 3781 atacataaac cagtaacata gtcgtttatt attgttatca agttttatgt accatacata 3841 actgtatgtg ttaaactggc aacacagcag gtttgtttac accagcatca ccacaaacat 3901 gtgagtaatg tattgacctt acaacagcta cagtgtcact aggtgaatct tgaatttttc 3961 agctccatta taatcttatg ggagcactgt catctatgtg gtccttcatt gacccaaaca 4021 tcattttgta gtgcatgact gtgttgggtt tttttaggcc atctagtctg ttgaaactac 4081 tcaactctgc tattgtagcg tgaagtatga gtaagtacaa ttatttatgg acacagtcat 4141 gttaatttca tatcattttc acgtgtcatg aggtattcat ttgattttgg ggagctccat 4201 tccctaaata tcattaggta ttcatttgat tttagggatc tccgctccgt aaggcactgt 4261 gatgaagacc acagccattt aaaaatgtga aaattattgt tggtgcaagt gttgtacaga 4321 aatgggtggc aggctagatt tggccgtgaa gctgtagttt gccaattcct ggtgtaggtt 4381 attgattgga atttttaact caaagaacaa gtatctttta cttttaccag tataagttca 4441 aattcttagg atatttttgc ttatttatat atcttttatt tcttatgcct caggagtgcc 4501 tgtgaagaaa acggggtatt gccctgaggc ttatattctg cttcagttgt cttttcttga 4561 aatattataa atcagaatgt ctgcacagtc agtggaagaa gattcaatac ttatcatccc 4621 aactccagat gaagaggaaa aaattctgag agtgaagttg gaggaggatc ctgatggcga 4681 agagggatca agtatcccct ggaaccatct cccagaccca gagattttcc gacagcgatt 4741 caggcagttt ggataccagg attcacctgg gccccgtgag gctgtgagcc agctccgaga 4801 actttgccgt ctgtggctca ggccagagac gcacacaaaa gaacaaatct tggagctggt 4861 agtgctggag cagtttgttg ccatcctacc caaagagcta cagacttggg ttcgagatca 4921 tcatccagag aatggagagg aggcagtgac agtgctggag gatttggaga gtgaacttga 4981 tgaccctgga caaccggtga gcctgcttgt gtcatttcct gtgtcagaag cactggcata 5041 ggtagtaggc aaagctgctg aattaattcc actcaactaa gttagaatta tgtttatgcc 5101 ttttccccca ttcccctacc actccttgtt atctgtgccg ctttgccatc taggcttctg 5161 cagtatttgt ttcatggggt ttctcttacg tgtttttagt tctgattctc attacagttc 5221 cttctctgaa tcaaatcttt ctgtttccag gtttctctcc gtcgacgaaa acgggaagta 5281 ctagtagaag acatggtatc tcaagaagaa gctcagggat taccaagttc tgagcttgat 5341 gctgtggaga accagctcaa gtgggcatcc tgggagctcc attccctaag gcactgtggt 5401 gaggaccaga actctgtgtg ggctagaggg tgaaagagag tataggcctt ttgttcagaa 5461 ctttgcttgt tctcactggt ggggggatca tatcactctt aatctaatat ctcagttttt 5521 ttctgattaa gttgattttg ctggcctcag accttttatt cctagacatt aacatcttca 5581 gccttttctt ccctttcagt ctccttccta aaaagagttg cccatgtcag gctagcattt 5641 gccttacttt gagccagcac ccacccatgg accaagtccc tctattttgc attcagtatt 5701 acttgtcatc tgttttacat ccttttactt tcaatctcct gcctaattcc attggtacag 5761 taagcttacc tgaaaacctc ttctaatctc tttgttgtat atcactgaaa tctttctaaa 5821 tatttatttc tgatttcctt caacgtctac ctagagtcat tttcttctaa gaagcttctt 5881 ttgtagaaat atttggagta atttttttcc tttttctaag tatttatgca aaaaaagcct 5941 ccttagctat aacatgcttt aaatgcagaa aaagcttttt taaaagctca aagttgtatt 6001 aattcacgag accagtcttt cccaatatat tagccattaa caccatatac ttatgtgtat 6061 cctttgatat ggcccagtat attcctaaaa ctgctagatt gctttctcat gtttatgtga 6121 actaaaatga aattttgttt gactctgctt tagtaatcat tccttgaagt agttggagat 6181 ggtgctggta tgccactgaa tgaggtctga gcaggttttc ttcacatctg aggggacagt 6241 gccagccagt caacttttgg ggtggggctg aagtctgctg aaaatctgca gttttacatg 6301 tttcatggga cattcttctg tgcaataaag tttgagaagt atcattcttg atatatgggg 6361 ctgatatttt caacttctca tcatctgaat tttagagtga cttttcacct gcgttcctta 6421 tttcagctgt ctagtctctt ccacacttgg gaatttctgt ttcctgtgtg ccgtttgtct 6481 ccacttaatc ttcacgctcc tctgtttttc aagtctccaa gcctcactat cctcctctaa 6541 cctttcagtc tgtctctctt agccttatct cctgacttga tctttttgtt aattatttac 6601 tgttttcctc ttcgctgaag gcagttgcta accctatatg aactggatct ttactcctta 6661 aggcatgaag ttttagaaaa gtcttaaatg acacttgctg ttatctctgc ctgcgtgtga 6721 gtgatgctaa cccatgttgg taattcaggc ttttctttca ttggacacgt ttcactgagg 6781 cttagaaata accatttttc taagtaccat gacaaatacc acatgggaga agctgctttt 6841 attggaaaac tttcatttat taggggcata ctgaattttt atgtcatgtc tttggttaac 6901 acaggaggcc cccagcttat cttcagcccc ttcttgtcaa gactcctgtg ttctgctttt 6961 aggctaacat tggctgtgac ctgacttggc ttgttcattt cagaaatagc cactttctga 7021 catgtttttc taatttacat ctttgtattt tgatgtaatt tctccctttc tcaaagtaaa 7081 agaacattat aatgatttcc tagtggccaa ttttatttta tttttattct tacaaagttt 7141 catccatacc tagccacatc attttgacta tctccctaac catcttagaa tctaccatct 7201 ttctgacctt tgttttctga ctaccatttg taaaccactt gcctatgttt tattcagata 7261 tcccttatat catgatgtta tactgttatt cagagtaatt tcattttttc ttttttgaga 7321 ataggcctta acaccagtgt cagagtggtt tatagatgtg tgctagctat tttagttcat 7381 taaaaaaaaa atttttttta ggtttctttt tcccttaata gataatgaat gatatttaca 7441 ttttctgttt atttcagatg atgatggtag gactgaaaat ggagcactag ctccaaagca 7501 ggagcttcct tcagcattag aatcccatga agttcctggc actctcagta tgggtgttcc 7561 tcaaattttt aaatatggag aaacctgttt ccccaagggc aggtttgaaa gaaagagaaa 7621 tccctctcga aagaaacaac atatatgtga tgaatgtgga aaacacttca gtcagggctc 7681 agcccttatt cttcatcaaa gaattcacag tggggagaaa ccttatggat gtgttgagtg 7741 tgggaaagca ttcagccgaa gttccattct tgtgcaacac cagagagtcc acactggaga 7801 aaaaccttac aaatgtcttg aatgtgggaa agcctttagc cagaattcgg ggcttattaa 7861 tcatcagaga atccatactg gggagaaacc ttatgaatgc gttcagtgtg ggaaatcgta 7921 tagtcaaagc tcaaatcttt ttagacatca gagaagacac aatgcagaaa aacttctgaa 7981 tgttgtgaaa gtttaagaaa ttgaaaaaaa aaaaagaatc agcactcagg tctttttctt 8041 cagaaatgaa gacaaaattt aaaatatgaa atgatgcaga atagtttttt ccctattgac 8101 tgtcagaaaa tccactggga aatgtaaaaa tcttcactca ccattatgat atttatcttg 8161 aaagaaatgg tgtcatacct gcctagaaac tgaaatttta aacttaattc aggtcttaat 8221 gcctaaattt tccatgtgat gtttatagtc tgtattactt ctccaaataa tgaactacct 8281 gattcattgt ccctttcttg aaagtttctt ttttaagaca aatacattat ttctgcattg 8341 atcattgaaa tgttctttat atggatacat tcccttatat attaaaaggc aacaggaatt 8401 acaaagtctg aaaaccattt taaaccatct tttaaaaatt tacccttatt tccttttacc 8461 taatttgaat atgcatttga gaaaataaga ggataaagga tggctaagag cctcaaaatg 8521 aaccataaga tctcagataa gaagtgatgg tgataaaaca tcaaaaagtg aatggaacac 8581 cttgattggg gaagataagc aaatactttg atctaagaat tgaaatgtat caagaattta 8641 tattttgcct gcaggaaaat tcaaaagcta ctcatcctct ctataatttg gagtcatctt 8701 actaatgaaa ataatgtttt cccatatatt attaaaaagc atacagtcta aataataaac 8761 agttgtaaaa taatgaaggt agaaattata acactaggga aaaatttgta gcggatggca 8821 gtgttgaagg caaatgtaaa cataagggta atggtctgtc atggctttta gaaaaagatg 8881 atagagttca tcattatttt gccttcatct ttgttaagga cagaaaattc cctgacaggt 8941 gggcaagtat caggttacct attttttatt cctttggtac aaaagggttg aacgtcaggc 9001 taaaaaagca gccatgcatt tattattaaa cattttctac cgacaaggca ctgtgctagg 9061 tactgtaatc ctaccataag taggtaggta tttcttccac tgtaaatcat aggggtttgc 9121 tgttttatgt gagttagcct cttccccttg tctgagcatt cctcagggga ggtcacctgt 9181 gaggttccca gaactgtagt tttttttacc agggtgttgt atttggaggg ggaggaggac 9241 tcggctcaaa agagctagct ggctctccag tgttcagagg tgagtccacg atactcttac 9301 cacaatttgg aagtttgtga atctttttaa agaactaatc aatctctaat agcattgagg 9361 ttgtacctac atattaagtt gaatggactg ttctatttaa aaaataaaca actagacaat 9421 taactagttt attaacctat cacaattgaa ttttttttta attttcagtc ttaacacatt 9481 ttttaaaatg tattaaagta atacattgta gtagtaggat tatatactcc ttggctgaga 9541 attccaagta ctgtggttct actgttgagt ggaaaactct ggaagttaaa atatagaata 9601 tgagaggagg cttttttata atgggcatca ttgtgtggaa aatgacccat gtgaatacaa 9661 atatttccta gttcagagat tttggttata tctggtgctt // LOCUS AF016898 17004 bp DNA PRI 10-DEC-1997 DEFINITION Homo sapiens B-ATF gene, complete cds. ACCESSION AF016898 U94771 NID g2668606 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 6536 to 17004) AUTHORS Dorsey,M.J., Tae,H.-J., Sollenberger,K.G., Mascarenhas,N.T., Johansen,L.M. and Taparowsky,E.J. TITLE B-ATF: a novel human bZIP protein that associates with members of the AP-1 transcription factor family JOURNAL Oncogene 11 (11), 2255-2265 (1995) MEDLINE 96112626 REFERENCE 2 (bases 1 to 17004) AUTHORS Taparowsky,E.J., Meyer,N., Tae,H.-J., Budde,P. and Johansen,L.M. TITLE Direct Submission JOURNAL Submitted (01-AUG-1997) Biological Sciences, Purdue University, Lilly Hall, West Lafayette, IN 47907, USA FEATURES Location/Qualifiers source 1..17004 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q24" /note="fragment B" exon 2704..3007 /number=1 mRNA join(2704..3007,5339..5443,16180..>16695) /product="B-ATF" CDS join(2945..3007,5339..5443,16180..16389) /codon_start=1 /product="B-ATF" /db_xref="PID:g2565253" /translation="MPHSSDSSDSSFSRSPPPGKQDSSDDVRRVQRREKNRIAAQKSR QRQTQKADTLHLESEDLEKQNAALRKEIKQLTEELKYFTSVLNSHEPLCSVLAASTPS PPEVVYSAHAFHQPHVSSPRFQP" exon 5339..5443 /number=2 exon 16180..>16695 /number=3 polyA_signal 16690..16695 BASE COUNT 4746 a 3880 c 3804 g 4545 t 29 others ORIGIN 1 aataggtgat aaataaatgt taatttaaac cgaatgaaac ataaatatga ctattattgt 61 cctgaaaggn gctttcaata cactaggata atgcagctca taagtgccat aaatgaagag 121 caaagtgaat tcacttatgc attcattcgt ttgaaaacca tttactcggc atctgttagg 181 tactgagcat atgactataa acaaggagaa acacagactt tggtttttac agggcccact 241 tctcaccact ttcatcactg tctaactggc ctatgctgcc agcatctttt gtttggatgg 301 ttgcagtaat ccaatactga aatcctgact catctccctg cttccacttt gctcccctat 361 cgtaaaatct caactgtgtc cagagagatc ctctaaaaac ctaagtcaga tcatgtcact 421 cctctgctct gacacctcca atggcttctc atctgtagga gattgtagaa attgccacaa 481 ttagttaaat ttttcctgtc gagaagtgaa gtctgtttct caactcattg agtctgaact 541 ggccttgtga ctttcctcgg ccaatagaat gtggcaggtt ctgagttctg tgctagctct 601 gagcctgggt tccatggggc ttcgaatgct tctgctccta ctcttggaat cctgccacta 661 tcaaataaac atgctccgac cagcctgttg gtggatgaga gaccatgtgg aggagaaatg 721 ccaagcaagc caggccagcc agtcaattat cagccaacct accagctgtc tggagacaca 781 tgaccgagcc tagccgagat cagcctagcc cagcctatcc ccaaagaacc tttcagatga 841 cccacagact catgagccaa ataaatggct tgagtcatcg aagtttctgg tagttttttg 901 ggcagcgaac actgatagac tatctcattt agacaaaaag tcaagcttct ttttactaac 961 tgtatcctca gtacttagtg cctggcacat ggtagatgca ccatgaacgt ttgttgaata 1021 tattaataaa ggtgtaaatg gttttaacat tctattgagg gaaacaggaa aaagataaat 1081 cagtaaacga ataaaataag tataaaattg ggtaactgct atgggagaaa tgaatagtag 1141 ctgaggtaga ggataagaat gaaggtggaa ggggaaaccc tccatgggct cactgtgtct 1201 ttcagaatcg aaagctaaaa tcctgatgtt ggcctctaag gcccgttccc tgaccgggcc 1261 attacctcac tgacctcatc gcctgccact tttgctgcct ctggtatcca actacctggc 1321 ccccctgctc tttttccaat actcaagcac acttccacct caggtcctgg aatgtccttc 1381 cctgagacac tcaccattca ctccctctct ttcttcgaat ctctgctcag gtatcccctc 1441 tttgggcaag gcccttctga ccacccagaa ttccctaacc tccaaccctg ctttattttc 1501 ctccatagga aaggaatttt gtcttactta atttaacctt agggcaatat tgtcttagtc 1561 tagaaattca gctcatggat acaaaaatcc tctaggagcc gtaacttctg tcaatcctaa 1621 aatttaataa ggcaaatgaa aggcttcgag ccatactggg aataaggtaa ggccttttga 1681 aaaagccacg atgcccaggt gagggttgaa ttgatcactg agaaccctac aaatgcgctc 1741 atggcaaggc tgggaatacc ctacacttac cctctcggtc atttacttca gagtctgcgg 1801 ggcattctaa tccgcgttgt catttgcttt gtgccacatt cctaagaggt aggcagggca 1861 ggttttatca tcctctgttt acagatgaag ataatgaagg tcacatagct caagtgagct 1921 acccagggcc acagacatac aaagtggtag aaccagaatt taaatcttgg ttggtctgac 1981 tccaagtgta gtatttgttt cactttttct ttctttcttt ctttctttct tttttttttt 2041 aactgtttct ccattctggg ttctgaccag gtgaccttga atcataaaga ttcagggcct 2101 gtcttgagct gctcaatatg gaaggcatct tttctctaaa gagtgtgagc tccctgccaa 2161 gcagcacagg cgcttaatag gctttcagag cagcaattct agaacctccc ttgtggcttt 2221 ctgatctccc acagcccttg tgtttgatgg tcaggaaagg agaataccct tggatcgtct 2281 tatctccaga ccaccccgag gctggctgga cttaaggggt gagaaaagac aagatgggtg 2341 agcagctgct ttcggctgaa aaccagaaag aaatcccagt gtttttgctt tattttggtt 2401 taaaaatcaa gtcgtatgac aatggaaaca ttttcctggt tgtcctctcc tctctgccca 2461 tgttgacaga agacggactt gctaattctg gctggggttt ttccctgcct gggctgtttt 2521 tctcttttat cttcatcaat tgaagagggc tgctgtgtct ttcaaactgt cggaaggtat 2581 ctgtattttt ttttctttca cataattctg taggtctttt cccagatatg tatttttctc 2641 tctaacttct aactcgctga agtttccgcc catgtgactt ccgacgtgag ttaccagaaa 2701 ccacaagaga gagagagagg tgcaagcccc aaagcgagcg acatgtccct ttggggagca 2761 gtccctctgc accccagagt gaggaggacg caggggtcag aggtggctac agggcaggca 2821 gaggaggcac ctgtaggggg tggtgggctg gtggcccagg agaagtcagg aagggagccc 2881 agctggtgac aagagagccc agaggtgcct ggggctgagt gtgagagccc ggaagatttc 2941 agccatgcct cacagctccg acagcagtga ctccagcttc agccgctctc ctccccctgg 3001 caaacaggta gagtcctcct ttttctctct cctaccttct gattctcctg ggggatggaa 3061 agagagccag gcttccttgt cctgcccagg gagctgagga tggaggaagt ggctcgttgc 3121 acgggcactc tgttagactt aggacatgga atttgctact aagctgtgca tattggcaga 3181 gatcctcatc cttccaccca ttcttccaaa gccccttttt ctctccattt tccaaggctg 3241 ctatcacctc tgcctcactg gggttgccac cctaaaaagc tttctaggaa gcaaagagga 3301 ggatgaacat caaagaatgc agagaaaaga gtctactgtt ctccaaggct gtagaaaagt 3361 gagaggagtt ttcaagtggg ccgctggcat gcacagctca aatcctagat atgcagtttc 3421 ccaggagaca gccaataggg aaggggagaa aaggggttag ggaaaagtta aaaaaaaaaa 3481 aaagaagaag aagaaggaga ggaagaagga ggaggaggag gagaaaccca ggntagcata 3541 gaaggatctc tctactgggg cactgcctct gttgctagca cagccagcaa ggggagatgc 3601 tagaggtcca gccaatatct ctttgnttgt ggnttctgga taggagcaag gaaatgaatg 3661 aggaggtggc tctggaaatt tgttggaaga catttgctgg tagttcatct tagctctggg 3721 gaaaccagag agactgagat agggattgtg ggagcatttg gagtctagat tgtgacttcc 3781 aagtccaaag gttttagctt gaggattacc aaaaaaggag tgccctgccc atctcacagc 3841 ttctagggtt ttgttctgca ccacactctt gacatccttg ctgtaggaag ccctcctctt 3901 tctcatccga catgtatccg tgggttaaaa ttttctcaac cttcatagtt taaaaacttc 3961 attctttcct tagcacagat atccacaagg gttttcaaaa tgctgcagcc aaaataaaag 4021 tatataaata ttgaacaggg cgtatatcct aagcctctaa taacggagca gcaggctggg 4081 atgagtctgg gtggagacct ctagaaggca cagaatccag tctgctgctc aagatggtgt 4141 cagagtggct ctgactgcct ggaagtttgt acttttctta ggaaggaaat tactcatggc 4201 ttccatggta acctggtgcc ccatccccct tagaggaaga gcacttcctt cctatctatg 4261 ccaatgacct cctaccacaa aggcctgaaa gtattggaca tgttggtgag cccagtactt 4321 agtgatccca gagctttgcc cttggatctg tgcccctcta gcctgtggga acgtggaagg 4381 aacagggatg agttgccaga ctcctacacc tacactgcag catctgggag agaactggag 4441 ctaacctctc tgcccaggca caggacatgc ctctctctgg tgtaattaga gaacacccaa 4501 agatcccctt atcctaccaa gcacccatct actgcccacc acagccaggt cacctccacc 4561 attaacgttc tttttggcat tgatgtgcca ggcattgtgc tggggccatg gggaatgtca 4621 cagtggctga gcacaacgga aggaaagagg aggcagatct gggatgcacc ctcagctcag 4681 gaatgcagca gaagaccaca tggggtgcag agtagctagc ggggaagggt cttggaaagc 4741 cattgggctc agcaccagaa gatgtagcac tctcagatga ccaggagtcc ttgggcacac 4801 tcttagagcc tcagtttcct cacatataaa atggaaatga tgatatcaac ttcacagcat 4861 tgggaattaa acaagagcac ctctgaaaga acacttgccc agaggttggc atatagtagg 4921 cactcaacaa agtctgaaaa aaagaaaaca gctaaaagaa cagcaagaag agcctaaagg 4981 ccaaaaagtg agagggcgag ggtggagcag ccctacagag gagaggtgat aaaacacact 5041 tcatcgnntt aaccattttt tttttttttg gccaaataca gctctattga gtcctcaagt 5101 tctcttttct ttttcctgta gctattggtc aaaaaagtca gagacctggg aatgtgtaag 5161 caggcatgga gaagagctct cttcgggtgc aggcttttgg taaatagagg gatggagggt 5221 tcatgcagag acgaaggcat ggaaggatgg ctggacggat accacacacc accagagcct 5281 gctttcactc agaagggaga tgcagaccca tgtaatcctc cccgtctctg cttcccagga 5341 ctcatctgat gatgtgagaa gagttcagag gagggagaaa aatcgtattg ccgcccagaa 5401 gagccgacag aggcagacac agaaggccga caccctgcac ctggtaagtg ttcagatcaa 5461 tcaacatcca cgtgagcttt aggctttgcc ctccgccatc tgggaaccct tggaccatag 5521 ctttgattct gcttgtgtgg ggatgtgtat gtggggtggg ttagtggagg gtgagggccg 5581 tggggtgagg tagggaaggg aagctgtcgt gagagtgctt taagaaggct cagtagtccg 5641 gcgtggtggc tcacacctgt aatcccagca ttttgggagg ccaaggcggg tgggtcacct 5701 gaggtcagga gttcgagacc agcctggcca acatggtgaa acctcgtctc tactaaaaat 5761 acaaaaaatt agccaggcat ggtggtgcat gcctgtagtc ccagctactt ggaggctgag 5821 gcaggagaat cacttgaaca ggcagaggtt gcagtgagcc gagatcgcgc cattgcattc 5881 aaacctgggt gataggagtg aaacttcatc tcaaaaaaaa aaaaaaaaaa tagaaggctc 5941 tgtagcaggg aatgggccag gaggctaggc agagaaatgg gcctttctac cccaggaggg 6001 ctcaaggggt gggagaaggc agagctgtat tgggaaagaa tttagaaggt atcaagtgct 6061 ttggggacgg cccattgtct cgcagtgcaa ggaccacaca tagtcgcatc ccttggcaga 6121 gacccaaatg cccctgcatt tgacacaagg aaattcacag ggngctgtac cagattcctc 6181 ttttcagagc attattaaag tcttattttg aaatttctgt ctttaatggt atcccaaggg 6241 gtctatgaaa gatgcaattt caagtgccct tatctgggat gatcaccatc ctagttgagt 6301 tgcctctagt cccaaatttt tctagagaaa gcttttacct ctgtcttgag tggtccaggg 6361 gccttgagca gtgctgaact gcagttccgc cagtggccaa ggcccnagat ggtgtttcta 6421 ggttctggct gtgcccctta taaagtactg cttcctcccc attgctcata ccttaccacc 6481 ttacctctct tcactctgcc cccagtatgg ttcttgttca accaactctc actagttcct 6541 ttctctccat cccctcatcc ccttccctag caactctacc ttcctgccct tgctccagaa 6601 acattccctt acctctaata gcctagcaat tcctttccat ggaaaaagtg caattggcac 6661 caaaaatcaa taagcttgta tttatctgga agattagcat gataatcagc cagttagaat 6721 ggctgtgtta accgtataag aataagtaag taagatgtag ttttagggga aataatgcat 6781 ctttcagctt tacaaggaaa cagagcagac tgcaccaaca gctctgattg ggtgcttggg 6841 ttttatttga aatgtattca gcttacactc cattgcacac taacattaaa ttcacagcta 6901 tttgttaact ttttggaaag aatgaatgaa cagaaagtct gtactaactg tgctcaacag 6961 tctgaatgtc aagtttttgg tggccctgct gcttgtatgt tctccatgat ttcactgtgt 7021 ccccacattc tggtttacca ccttaagctt gaacatctat gatccttacc tctttctcta 7081 tcagagtctg attctgattc agtattcttt agggcaagca aacagaccaa tcaaatttaa 7141 aaggataaaa tcttccttgt gtacaagtag acaatgtcct cagactggca aactgctgta 7201 agctttttaa gtgtgtacct gttttcttaa gtggaatgta agtatagttc ggctgctcag 7261 cagtcagaac agggattgaa tcccagccct gctgcctact atctgtgtga cagagagctt 7321 cttggcctcc cttgggcttg gtttccttat ctagaaaatg gaaaatgaca ataactaccc 7381 tatacagatc gtacatttaa agagcccaac acatgcctgt cattcataaa tactcaaaaa 7441 tttaagataa tcacaaatta tacacacatt ttatatatat atgaaaacat atatacattt 7501 tgtgtatatt taatcacaga agctcagggt tgaaaggttt cttaaaggct agaaagttta 7561 gacataagtt tgatgcttca gacttctctg caacacctcc agccaatggt cccattgtat 7621 gcctggtcac ttctagggct ggagatcttc ctctctccca atttatgaac agaagagtct 7681 ttcctatgtt aagctgcacc cctgtctccc tgtaaattca actcactggt cttcatccaa 7741 acccttgggg aacgtattaa tgagggtgac acagatgtct ctttgctgcg gcaaggcttt 7801 cagcactcaa tcagctctct ccctctccaa gacaaatgtc agagcccaaa gagtcagctc 7861 atctaattat acaagtcatg tgacattagg gtaatctcta gagcttgagg gtccacagag 7921 ccccaagtca tatagtcaac atatccatag ccacacccca gcctcagttt ctagaatgtt 7981 ccaatcttcc ccacatctga acacagattc ctcctccact gctttctttt gagcaccaca 8041 gcatcgtctt taccttttgc ccaatgagta ggtcgcagcg gagttgatgc agcaagcctg 8101 cggactgctg ccaatgcaac ttatgtttct tggcttcagg tagaagcttt tggcagcaac 8161 gtgttcaatc tgccctacat caaattagct gtggtttgac atcttccttt cctccgagat 8221 gacaatcacc tatcttctcc atccattatc ctcttaatta tgtctgagat tcaggtgaaa 8281 agtcttctgg tttaccacca gtgtcagggc tcagtgtgtg gtttgccctt gactcctttc 8341 catgcttggc tccacatgct ggggttgttc tctggccctc agacatggcc ataggctcgc 8401 ctaccagtgc tttacagatg agggtttggg agctactgag agtaccatat caaaaaagga 8461 ccacattcct attagggttc tagcctcata tatactggta tgttttctta gtttcaactg 8521 ctgtcactat tatcaacctt aaaatactga aacaattatc ttgttcctct agtgagagtg 8581 tgggatgctt actattcgac attttcagca tctatcctat aaatggctgg caccaagaca 8641 ctgtacatcg attgctttaa aaattgacag agtagggaga gagcctcttt ttatgattgt 8701 gttataaatc cctggccttg actatctctg acataaaatg gatcggtgat tcctttcgta 8761 attgtgacat caacaaacaa tctaattctg ttttaattct ccttttgttg agcatctcac 8821 acaagcaccc tgcacactta gttccttttc tcctgcagac aactgtaggg ttcattgtat 8881 atagtacaga tgttgattaa aaaagagaaa ggcataactt gaccagttag tttctaaaag 8941 atatcaatag ctaataagtg gacctaagat aatcagctga aaaattattt gaaataataa 9001 gagaatgcag tcagcttcta aaattaaaac aaaatgtaat aacgttccta tatgcaaaac 9061 ataaccaatt aggaaattaa tggaaggaaa gatcctgttt ataataccaa tgacaacagc 9121 aaatggaagg aaggaataat ttaacaaaaa ttatgaagag tgtatcctaa atgattttta 9181 aaagattcaa ataattggaa agatatacct tatgtaaagt tagcaatttt ccttaagtta 9241 attgataaat ttaatgagat cccaataaaa aacccaatag ggttttctgg gatttgggca 9301 agaactagat tagctattct aagtttccag tgaaaaaaaa tcatatgaca attgccagga 9361 aaattctaaa gaagaagtat aatgaaaaaa gattaactga ccaacatatt aaagtgaaac 9421 acaaatacag aaatcaaaac aatgtggaaa agtcagaaca ttgaacagga tagaatatgg 9481 tttcagaccc aaaaacaaat tcagatttta agacccaaat acatatttca actcctagga 9541 cacaagtata aatacaaaat gggctgggca cggtggctca cgcctgtaat cctagcactt 9601 tgggaggcca aggtgggcag atcacttgaa gtcaggagtt cgagaccagc ctggccaaca 9661 tggtaaatgt taaatttttt aaaaaaagga agctaaagaa acttgaaaag acaaataacc 9721 attaaaaatt tatcttggaa tgaggaatta ttttctaaat aaaacataaa accctgaagc 9781 cataaaggaa aagattgata tacattactt ttattcataa aaagtaaaag tttctttata 9841 gatcaaaata ctgtatgcaa aatcaaaggc aaatgacaag ctgaggaaaa atatctgtaa 9901 tacatatgac caaaaaaggt atgtttactt actatatgag agagctctta tgaataggag 9961 atgattaaca acccaaaaga aaaatgaaca atgggccggg cgcggcggct cacgccttaa 10021 tcccagcaat ttgggaggcc gaggcgggcg gatcacgagg tcagcagatc gagaccatcc 10081 tgtctaacac ggtgaaaccc cgtctctact aaaaatacaa aaaattagcc aggcgtggtg 10141 gcgggtgcct gtagtcccag ctacttgaga ggctgaggca ggagaatggc gtgaacccgg 10201 gaggcggagc ttgcagtgaa ccaagatagc gccactgcac tccagcctgg gagacagagc 10261 gagactccgt caaaaaaaaa aaaaaaaaaa gtaaagaaaa atgaacaatg actatgaata 10321 ggcaatttac agaaaagaga aataagtngc caataaatgt gtgaaaagat ggtagacctt 10381 atttagttag gaaaatgcca attaaagcag cagtgaggct ggcaaagatg aaacatgttg 10441 ttgatacacg atgttgccaa ggtaatggag aagcaggcac catcacacac agctagtagt 10501 agagtctgca tggccctggt agaaggcaat ttgcccacgt ccattcaaat tataggtgca 10561 cataccctgc gactcagcaa tttgacatcc aggagtgtgt tcacaaacaa atccacacat 10621 catacattct gtacatgtaa agataatcat tgctgcattg tttcttatag gaaagactga 10681 acataagcta aaaaaatcca ccaataaggg attcaataaa ttattgtcca ttctgtggaa 10741 tattgcagat atgttaaaat aatgagttac atctgaatgt gctaatctgg aaggattatc 10801 acttctgtac tttaagttaa aaaggcataa atatagtcca tagtttgcta ccattagttt 10861 tttaatttaa tgttttatct atttatttat tttttatttt tagagacagg gtctcactat 10921 gttgcccagg ctggtctcga actcctgagc tcaagtgatc tgccctgccc accttggcct 10981 cccaaagtgc tgggattaca ggtgtgagca acagcaccca gccccttctg tgtttttaat 11041 aaaaagaaaa agttacatat atgcataagt ataaatgcaa actttttcaa atagattata 11101 aaaaattgtt aatgatattt atttctgaaa agtaggacta aagatctaaa gcaagaggga 11161 gatacttggc attctttacc ttttatacag tttgggattt tttatcacat gcttttatct 11221 ttataatata gatatcaaaa ttctcagcat tggatcatta ccacacatgc attaggagga 11281 tttcttttct aattttgaca tcaaacattg aataatcttc catcacgaca tggaagagta 11341 ttcaatcttg acgttatctt ttgaacatca tacataaata tctcagccct tgggagaacc 11401 aactgtaagt tgggatttga tgactctaag gtccacagta ctatatgata aaagatggca 11461 aatccgaaac caaagtgtag ccatcttggt tggttggtca tggctatctg gtgccgtaca 11521 gctggaagga tttcaagtca aagaccagga tcagtgnaaa agagagctga ggtcaattaa 11581 gaagctttat tcatgaaaag gctaaaatca gtggagttgc cgggagcagt ggcaaatagc 11641 agtgttagcc ctgagccgag ctgttgacaa gtcatatctg gcctggtgga tggagctgaa 11701 agcaagtgag agtaacagcc aagtttgaaa gcttgagatt taggtactgt cttagatgga 11761 agtgggccag gtgcttgcct ggggctgcct cctgaaggag tctcacacca gtataagaaa 11821 aaagagaggt ttacaggctc cagggatgtg ctttgtcata tatgacagaa aagtcatgag 11881 tataaattat gcattaaatg aantttgagc taacactgtg agaaatacgg tatttacaat 11941 tggagnaaaa ttgagtaatg gaggaaaaga tgtttgggta gcactattgt tcttggcctt 12001 ctgaggaggc agagaaggat gcttacggtc ctatttcact cctggaattc tgaataggag 12061 aaagatgggt acaggatgcc caaatgcagg ctcaccccca cccctctcag tagcagaaga 12121 catgcatatg ggagaagctg catttcctcc cagcctcact gtgctctagg tgtccagccc 12181 tcattgttag gggatgtcac tgttgtacct ctggataaac agctgctctc atgccacctt 12241 tgccagttgt cccttcagct gggcgagccc tttccagggc atgactgccc caaggagggc 12301 cagaggggac tgggcaatag cagggacatg gagggacgag gccccagatc agccacttcc 12361 tcctgcaagg gaagtcaaac cctttctctc ataagtgtca ggctggctgc acggtccctg 12421 ggctggcagg gcaggtttct tgggggccct agcaggaagc actcttgctt cccaaggtaa 12481 agacaagttt cttatcactg gcggtggttt tctggttaga ggcagcaggc agcggttaca 12541 aactttgcct agagctctgg gatctgcaag ccagggaggg ggaggcagac gacagagcta 12601 gagtgagctc agcagagctt gcttagggag gaaggacccg ctgctacttc ccgaaagctg 12661 ggccagcctc taggcgggga gaaggtcaga gtggccctcc cttctgcctc ggggtttgcc 12721 caggctcttc agagcacaga atttacagta aatgttgtca ccagtggaaa ctctcagagg 12781 gggcgaagaa aagggctaga ggaaggagcc aggactcccg gcccccagtt cccgtcttac 12841 caacaccccc tagtctcatc gagttgcttg atcttacctt tcggtaaaga gggatgccct 12901 ccttctggga tgtgccagga aatacccaaa tggtaaaatg aatgagccgg ccacaccttg 12961 cagaaggcag ggctgtgact ccatcatcgt ggctcttgat aactcctctg ccccgtttta 13021 gaggggcttc ctttgtactg accttggcac actttctagc tcagagcccc attcaagaaa 13081 tgccaggttg aggctgagca tgcttgtgtc tgtccttcac agatgatgga acctggcttg 13141 atgagcaagg ccatgttggt atatgcttct ctgccattta gaaagacccc ccccaggatg 13201 tggggtttaa gatgattgac agtataaccg attaacctca ttagcctaca gctcatgcac 13261 ctgatggatt cgttagggca aattttattt tattatttta ttttattttt tgagagagga 13321 cctcactcca tcagtgaggc tggagtgcag tggcatgatc atggctcact gcagcctcca 13381 acccctggtg caccatagct aattttcttt tatttttatt attgtagata ctgggtctca 13441 ataatgtgca caggctggtc ttaaactcct gggctctcag gaggtagagg ttgcagtgag 13501 ctgagatcgc accattgtac tccagcctgg gagacagagc aagactgtct caaaaaaaaa 13561 aacccaaaaa accaaaaaac tgctgggctc aagcgatcct ctcacctcgg cctcccaaag 13621 tgctgggatt ataggtgtga gccactgcac cagcctattt agggcnnntt ttaaatgtca 13681 agttatctta catctgctca ccacccaccc tatttttgag ttaccttctt cattccagca 13741 gatgatgtca taagtctgtc agtcaacaaa catctgagtt catactatgt gccaggcatt 13801 gagccaggca ttaacaaata ggcatttaat aagtgcttga ataattaaaa caaatttaaa 13861 ataatattta actagtgctt tccaacttgc ctgatcatta gaatgacctg tggcactagt 13921 taaaaatcca aatttcactc cctagtagtc tagtggctag gatttaaaaa aaaaattatt 13981 attatttttt taattcagct ctgcagccag gcatggtggc tcacgcctgt aatcccagca 14041 ctttgggaga ccaaggtggg cagatcccct gaggtcagga gttcaagacc agcctggcca 14101 acatggtgaa accccgtctc tactaaaaat acaaaaacta gccaggcgtg gtggtgcctg 14161 cctgtaatcc cagcaactcg ggaggttgag gcaggagaat tgcttgaacc tgggaggcag 14221 aggttgcagt gagccaagat cacactactg cactccagcc tggaggacag agtgagactg 14281 tctcaggaaa aaaaaaaaaa aaaaaaaaaa attcagatct gcagacttca ctccgcaaat 14341 cccaaaacca ctgattcaca ttcctagatt ttaggatggg cctgaaattc tattcttaac 14401 aagccttcaa attaattctc atgatatggc aagtttggga aatactgacc taatccaaat 14461 agatttagaa aggtaagacc tggtctccaa atatcaggtt gccttttgaa ctgaaatttt 14521 tttttttttt gcatctccaa tgatatttgc ctttcctctg tggttaccaa tgcttagcag 14581 agaaagtctg tttggttgag tcgaataatg taaaccttaa aattataccg gaattgcgat 14641 ctcctgggct ttctcgagga tcgggattgc ttttccataa ttgctaatta ccagattgca 14701 gttgttgatt aattaatgat gagatcagaa atcagcactt cttagcaggc atttttctga 14761 ttagcatgta tttgatggaa cccccaagca acagggaaaa gagtttccag aaaaaagaga 14821 agctcacagg caagggacac ccccaaagta ccgtcagaat gctttctggt gggccggaca 14881 gggtggaaca aacgttgtag cagactttga ccatttagta tttatttatt tcatttttat 14941 ttttattttt atttttgaga tggagtcttg ctttgtcacc caggctggag tgcagtggca 15001 tgatctcggc tcactgcagc ctctgcctcc tgggtttcag caattctctt ccccgagcct 15061 cccaggtagc tgggattaca ggtgtgcacc accacgcctg gttaattttt tttttttttt 15121 tttttttgta tttttagaga cagggtttca ccatgttggc caggctggtc tcgaactcct 15181 gacttcaggt gatccaccca cctccgcctc ccaaaatgct aggattatag gcatgagcca 15241 ccggcacctg gccccattta ctatttataa agaccctgag cattctagaa tctgggtgca 15301 gtagacaagt atccagtgac tttatttgta agacgaaatg cagacaccgg gagcttttgg 15361 aaaacacgtg gcataacaca gtggaaaaga gtgaaactgg tgtaagacgt tctcacaacg 15421 accttccatt tccttaggct ctggcctgcc agctgctttc ccttgctgta ttcccgccac 15481 ctcccaccca cacaccctca cagagacccc tacatgcata cacgcagagg aacacactca 15541 cacacacccc tctatcaggt tgaaccatat gaaattgctt ttttgccggg aggagagatg 15601 tagtcttgct gtgttgccca ggctggagtg cagtggctca atctcggctc actgcaacct 15661 ctgcctccct ggttcaagta atgcttctgt ttcagcctcc tgagtagctg ggactacagg 15721 cgcactgcca cacccagcta atttctgttt ttgtatttta gtagagacag ggttttgcct 15781 cgttgcccag gctggtctca aactccccac ctcaggcaat ccgcccgcct cggcnnnnna 15841 aagggctagg attannnnnn taagncncng cgcccggcct gaaattgcta ttcttatagg 15901 tcacaacaga caaatatcag cagcttcacg tggttcagcc tcatatctat gtgtatacat 15961 gcacacccac ccatgtgctt acgtaaacgc gcccatacac acctacaaac atatatacac 16021 ttgtgcacac attcacacgt gcaggaaaat tctctggagc ccttctccat ggctgtgcta 16081 gtgaacaact aatctggccc ctcctgtgtc cctccgcacc ccacccacct tgcctccttc 16141 ctagacacta acctccggtg ctgatcccca cccctacagg agagcgaaga cctggagaaa 16201 cagaacgcgg ctctacgcaa ggagatcaag cagctcacag aggaactgaa gtacttcacg 16261 tcggtgctga acagccacga gcccctgtgc tcggtgctgg ccgccagcac gccctcgccc 16321 cccgaggtgg tgtacagcgc ccacgcattc caccaacctc atgtcagctc cccgcgcttc 16381 cagccctgag cttccgatgc ggggagagca gagcctcggg aggggcacac agactgtggc 16441 agagctgcgc ccatcccgca gaggcccctg tccacctgga gacccggaga cagaggcctg 16501 gacaaggagt gaacacggga actgtcacga ctggaagggc gtgaggcctc ccagcagtgc 16561 cgcagcgttt cgaggggcgt gtgctggacc ccaccactgt gggttgcagg cccaatgcag 16621 aagagtatta agaaagatgc tcaagtccca tggcacagag caaggcggca gggaacggtt 16681 atttttctaa ataaatgctt taaaagaaac cagtctgaca aggcagtttc tctctttcac 16741 cctgtccctt gacactctaa attctgcgat gttgggacct tctttgtttt ttggactcta 16801 gtaccctcaa agagctctag ggactccact aacagctcct cttttccatg gtttattgat 16861 gtacttggga agctttcaat tccttctcta ggtgtctcct tctccttctc cttctcctcc 16921 tcctcctcct cctttctcct cctcctcctc ttctccccct tctcctcctc ctcctccttc 16981 ttcttgacag aatttctcag atct // LOCUS AF017178 18609 bp DNA PRI 01-JAN-1998 DEFINITION Homo sapiens pro alpha 1(I) collagen (COL1A1) gene, complete cds. ACCESSION AF017178 NID g2736080 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Chu,M.L., de Wet,W., Bernard,M. and Ramirez,F. TITLE Fine structural analysis of the human pro-alpha 1 (I) collagen gene. Promoter structure, AluI repeats, and polymorphic transcripts JOURNAL J. Biol. Chem. 260 (4), 2315-2320 (1985) MEDLINE 85130970 REFERENCE 2 (sites) AUTHORS D'Alessio,M., Bernard,M., Pretorius,P.J., de Wet,W. and Ramirez,F. TITLE Complete nucleotide sequence of the region encompassing the first twenty-five exons of the human pro alpha 1(I) collagen gene JOURNAL Gene 67 (1), 105-115 (1988) MEDLINE 88329734 REMARK Erratum:[[published erratum appears in Gene 1988 Nov 30;71(2):501]] REFERENCE 3 (sites) AUTHORS Tromp,G., Kuivaniemi,H., Stacey,A., Shikata,H., Baldwin,C.T., Jaenisch,R. and Prockop,D.J. TITLE Structure of a full-length cDNA clone for the prepro alpha 1(I) chain of human type I procollagen JOURNAL Biochem. J. 253 (3), 919-922 (1988) MEDLINE 89025644 REFERENCE 4 (sites) AUTHORS Maatta,A., Bornstein,P. and Penttinen,R.P. TITLE Highly conserved sequences in the 3'-untranslated region of the COL1A1 gene bind cell-specific nuclear proteins JOURNAL FEBS Lett. 279 (1), 9-13 (1991) MEDLINE 91138770 REFERENCE 5 (sites) AUTHORS Westerhausen,A., Constantinou,C.D., Pack,M., Peng,M.Z., Hanning,C., Olsen,A.S. and Prockop,D.J. TITLE Completion of the last half of the structure of the human gene for the Pro alpha 1 (I) chain of type I procollagen (COL1A1) JOURNAL Matrix 11 (6), 375-379 (1991) MEDLINE 92157916 REFERENCE 6 (bases 1 to 18609) AUTHORS Korkko,J.M., Earley,J.J., Nuytinck,L., DePaepe,A., Prockop,D.J. and Ala-Kokko,L. TITLE Analysis of the COL1A1 and COL1A2 Genes by CSGE and DNA Sequencing in 12 Patients with mild OI (Type I). Identification of Common Sequences for Null Allele Mutations JOURNAL Unpublished REFERENCE 7 (bases 1 to 18609) AUTHORS Korkko,J.M., Earley,J.J., Nuytinck,L., DePaepe,A., Prockop,D.J. and Ala-Kokko,L. TITLE Direct Submission JOURNAL Submitted (04-AUG-1997) Center for Gene Therapy, Allegheny University of the Health Sciences, 245 North 15 Street, Mail Stop 421, Philadelphia, PA 19102, USA FEATURES Location/Qualifiers source 1..18609 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 273..279 /gene="COL1A1" gene 273..17776 /gene="COL1A1" mRNA join(<420..522,1986..2180,2322..2356,2468..2494, 2585..2686,3409..3480,3708..3752,3911..3964,4128..4181, 4680..4733,4850..4903,5234..5287,5376..5420,5537..5590, 5705..5749,5928..5981,6239..6337,6426..6470,6574..6672, 6804..6857,7076..7183,7278..7331,7457..7555,7721..7774, 7863..7961,8858..8911,9055..9108,9212..9265,9377..9430, 9882..9926,10020..10118,10416..10523,10982..11089, 11310..11363,11527..11580,11799..11906,11995..12048, 12175..12228,12369..12530,12632..12739,12897..13004, 13112..13165,13270..13377,13758..13811,13924..14031, 14370..14423,14784..14891,14983..15265,15398..15588, 15877..16119,16249..17776) /gene="COL1A1" /product="pro alpha 1(I) collagen" CDS join(420..522,1986..2180,2322..2356,2468..2494,2585..2686, 3409..3480,3708..3752,3911..3964,4128..4181,4680..4733, 4850..4903,5234..5287,5376..5420,5537..5590,5705..5749, 5928..5981,6239..6337,6426..6470,6574..6672,6804..6857, 7076..7183,7278..7331,7457..7555,7721..7774,7863..7961, 8858..8911,9055..9108,9212..9265,9377..9430,9882..9926, 10020..10118,10416..10523,10982..11089,11310..11363, 11527..11580,11799..11906,11995..12048,12175..12228, 12369..12530,12632..12739,12897..13004,13112..13165, 13270..13377,13758..13811,13924..14031,14370..14423, 14784..14891,14983..15265,15398..15588,15877..16119, 16249..16395) /gene="COL1A1" /codon_start=1 /product="pro alpha 1(I) collagen" /db_xref="PID:g2736081" /translation="MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNG LRYHDRDVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESP TDQETTGVEGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQL SYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRG PPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGD AGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPT GPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPG AKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTGAKGEPGP VGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAGER GSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG PPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGP AGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGER GVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKG DRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGAPGD RGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAP GAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAG RPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGL PGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAK GDRGETGPAGPPGAPGAPVAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQG PRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGA PGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQPPQE KAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMC HSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKDKRH VWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQT GNLKKALLLKGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPI IDVAPLDVGAPDQEFGFDVGPVCFL" exon 420..522 /gene="COL1A1" /number=1 intron 523..1985 /gene="COL1A1" /number=1 exon 1986..2180 /gene="COL1A1" /number=2 intron 2181..2321 /gene="COL1A1" /number=2 exon 2322..2356 /gene="COL1A1" /number=3 intron 2357..2467 /gene="COL1A1" /number=3 exon 2468..2494 /gene="COL1A1" /number=4 intron 2495..2584 /gene="COL1A1" /number=4 exon 2585..2686 /gene="COL1A1" /number=5 intron 2687..3408 /gene="COL1A1" /number=5 exon 3409..3480 /gene="COL1A1" /number=6 intron 3481..3707 /gene="COL1A1" /number=6 exon 3708..3752 /gene="COL1A1" /number=7 intron 3753..3910 /gene="COL1A1" /number=7 exon 3911..3964 /gene="COL1A1" /number=8 intron 3965..4127 /gene="COL1A1" /number=8 exon 4128..4181 /gene="COL1A1" /number=9 intron 4182..4679 /gene="COL1A1" /number=9 exon 4680..4733 /gene="COL1A1" /number=10 intron 4734..4849 /gene="COL1A1" /number=10 exon 4850..4903 /gene="COL1A1" /number=11 intron 4904..5233 /gene="COL1A1" /number=11 exon 5234..5287 /gene="COL1A1" /number=12 intron 5288..5375 /gene="COL1A1" /number=12 exon 5376..5420 /gene="COL1A1" /number=13 intron 5421..5536 /gene="COL1A1" /number=13 exon 5537..5590 /gene="COL1A1" /number=14 intron 5591..5704 /gene="COL1A1" /number=14 exon 5705..5749 /gene="COL1A1" /number=15 intron 5750..5927 /gene="COL1A1" /number=15 exon 5928..5981 /gene="COL1A1" /number=16 intron 5982..6238 /gene="COL1A1" /number=16 exon 6239..6337 /gene="COL1A1" /number=17 intron 6338..6425 /gene="COL1A1" /number=17 exon 6426..6470 /gene="COL1A1" /number=18 intron 6471..6573 /gene="COL1A1" /number=18 exon 6574..6672 /gene="COL1A1" /number=19 intron 6673..6803 /gene="COL1A1" /number=19 exon 6804..6857 /gene="COL1A1" /number=20 intron 6858..7075 /gene="COL1A1" /number=20 exon 7076..7183 /gene="COL1A1" /number=21 intron 7184..7277 /gene="COL1A1" /number=21 exon 7278..7331 /gene="COL1A1" /number=22 intron 7332..7456 /gene="COL1A1" /number=22 exon 7457..7555 /gene="COL1A1" /number=23 intron 7556..7720 /gene="COL1A1" /number=23 exon 7721..7774 /gene="COL1A1" /number=24 intron 7775..7862 /gene="COL1A1" /number=24 exon 7863..7961 /gene="COL1A1" /number=25 intron 7962..8857 /gene="COL1A1" /number=25 exon 8858..8911 /gene="COL1A1" /number=26 intron 8912..9054 /gene="COL1A1" /number=26 exon 9055..9108 /gene="COL1A1" /number=27 intron 9109..9211 /gene="COL1A1" /number=27 exon 9212..9265 /gene="COL1A1" /number=28 intron 9266..9376 /gene="COL1A1" /number=28 exon 9377..9430 /gene="COL1A1" /number=29 intron 9431..9881 /gene="COL1A1" /number=29 exon 9882..9926 /gene="COL1A1" /number=30 intron 9927..10019 /gene="COL1A1" /number=30 exon 10020..10118 /gene="COL1A1" /number=31 intron 10119..10415 /gene="COL1A1" /number=31 exon 10416..10523 /gene="COL1A1" /number=32 intron 10524..10981 /gene="COL1A1" /number=32 exon 10982..11089 /gene="COL1A1" /note="exons 33-34" intron 11090..11309 /gene="COL1A1" /note="exons 33-34" exon 11310..11363 /gene="COL1A1" /number=35 intron 11364..11526 /gene="COL1A1" /number=35 exon 11527..11580 /gene="COL1A1" /number=36 intron 11581..11798 /gene="COL1A1" /number=36 exon 11799..11906 /gene="COL1A1" /number=37 intron 11907..11994 /gene="COL1A1" /number=37 exon 11995..12048 /gene="COL1A1" /number=38 intron 12049..12174 /gene="COL1A1" /number=38 exon 12175..12228 /gene="COL1A1" /number=39 intron 12229..12368 /gene="COL1A1" /number=39 exon 12369..12530 /gene="COL1A1" /number=40 intron 12531..12631 /gene="COL1A1" /number=40 exon 12632..12739 /gene="COL1A1" /number=41 intron 12740..12896 /gene="COL1A1" /number=41 exon 12897..13004 /gene="COL1A1" /number=42 intron 13005..13111 /gene="COL1A1" /number=42 exon 13112..13165 /gene="COL1A1" /number=43 intron 13166..13269 /gene="COL1A1" /number=43 exon 13270..13377 /gene="COL1A1" /number=44 intron 13378..13757 /gene="COL1A1" /number=44 exon 13758..13811 /gene="COL1A1" /number=45 intron 13812..13923 /gene="COL1A1" /number=45 exon 13924..14031 /gene="COL1A1" /number=46 intron 14032..14369 /gene="COL1A1" /number=46 exon 14370..14423 /gene="COL1A1" /number=47 intron 14424..14783 /gene="COL1A1" /number=47 exon 14784..14891 /gene="COL1A1" /number=48 intron 14892..14982 /gene="COL1A1" /number=48 exon 14983..15265 /gene="COL1A1" /number=49 intron 15266..15397 /gene="COL1A1" /number=49 exon 15398..15588 /gene="COL1A1" /number=50 intron 15589..15876 /gene="COL1A1" /number=50 exon 15877..16119 /gene="COL1A1" /number=51 intron 16120..16248 /gene="COL1A1" /number=51 exon 16249..16395 /gene="COL1A1" /number=52 polyA_signal 16644..16656 /gene="COL1A1" polyA_signal 17772..17776 /gene="COL1A1" BASE COUNT 3508 a 5727 c 5160 g 4214 t ORIGIN 1 ggggcacccc tacccactgg ttagcccacg ccatcctgag gacccagctg cacccctacc 61 acagcacctc gggcctaggc tgggcggggg gctggggagg cagagctgcg aagaggggag 121 atgtggggtg gactcccttc cctcctcctc cccctctcca ttccaactcc caaattgggg 181 gccgggccag gcagctctga ttggctgggg cacgggcggc cggctccccc tctccgaggg 241 gcagggttcc tccctgctct ccatcaggac agtataaaag gggcccgggc cagtcgtcgg 301 agcagacggg agtttctcct cggggtcgga gcaggaggca cgcggagtgt gaggccacgc 361 atgagcggac gctaaccccc tccccagcca caaagagtct acatgtctag ggtctagaca 421 tgttcagctt tgtggacctc cggctcctgc tcctcttagc ggccaccgcc ctcctgacgc 481 acggccaaga ggaaggccaa gtcgagggcc aagacgaaga cagtaagtcc caaacttttg 541 ggagtgcaag gatactctat atcgcgcctt gcgcttggtc ccgggggccg cggcttaaaa 601 cgagacgtgg atgatccgga gactcgggaa tggaagggag atgatgaggg ctcttcctcg 661 gcgccctgag acaggaggga gctcaccctg gggcgaggtt ggggttgaac gcgccccggg 721 agcgggaggt gagggtggag cgccccgtga gttggtgcaa gagagaatcc cgagagcgca 781 accggggaag tggggatcag ggtgcagagt gaggaaagta cgtcgaagat gggatggggg 841 cgccgagcgg ggcatttgaa gcccaagatg tagaagcaat caggaaggcc gtgggatgat 901 tcataaggaa agattgccct ctctgcgggc tagagtgttg ctgggccgtg ggggtgctgg 961 gcagccgcgg gaagggggtg cggagcgtgg gcgggtggag gatgagaaac tttggcgcgg 1021 actcggcggg gcggggtcct tgcgccccct gctgaccgat gctgagcact gcgtctcccg 1081 gtccaacgct tactggggca ggagccggag cgggaagacc cgggttattg ctgggtgcgg 1141 acccccacct ctagatctgg aaagtaaagc cagggatggg gcagcccaag cctcttaaag 1201 aggtagtcgg gccggtgagg tcggccccgc cccggcccca ttgcttagcg ttgcccgaca 1261 cctagtggcc gtctggggag ccgctagcgc ggtgggagtg gttagctaac ttctggacta 1321 tttgcggact ttttggttct ttggctaaaa gtgacctgga ggcattggct ggctttgggg 1381 gactggggat ggccccgaga gcgggctttt aagatgtcta ggtgctggag gttagggtgt 1441 ctcctaattt tgaggtacat ttcaagtctt gggggggcgt cccttccaat cagccgctcc 1501 cattctctta gccccgcccc cgccacccca catgcccagg gaatgggggc gggatgaggg 1561 atggacctcc cttctctcct ccctcgccct cctcctgtct ctaccacgca agccactccc 1621 cacgagcctg ccctcccgat ggggcccctc ctattctccc cccgccctcc ccctctcacc 1681 ctgtggtttt atttcacttg gcttcagcgc caatgggctg aggttggagt tggaagccac 1741 cgcggactaa agctttgttt aaattcctga gaactggaaa gagttacagc ctccctggcc 1801 aggcgcctcg gcgctgtcac ccgcgctgat gaggagcagg cgagctttta aggatttgag 1861 gaaagaagaa cggggggagg ggcgggaagt gaaaaatcca agtgtgcctc ttagacccgg 1921 gggaaaggtg gttaagctgg gggttgcagt cactactgac aacgcccctc ttccgcctgt 1981 cccagtccca ccaatcacct gcgtacagaa cggcctcagg taccatgacc gagacgtgtg 2041 gaaacccgag ccctgccgga tctgcgtctg cgacaacggc aaggtgttgt gcgatgacgt 2101 gatctgtgac gagaccaaga actgccccgg cgccgaagtc cccgagggcg agtgctgtcc 2161 cgtctgcccc gacggctcag gtgcggctgc gctcggggcc tggggcctgg ggctggggct 2221 gggggtggtc ggcgctcgct ggccctccgt gctggaggcc tctgccgacg ggagcagcat 2281 tagcaaacct tggctctaac gggcgtctct tcgtccccta gagtcaccca ccgaccaaga 2341 aaccaccggc gtcgaagtaa tctcctgccc tcgaattttg cccctgcgcg gcccgtgact 2401 cctcacagtc ctcccttctc taacctggcc tcttgtttct tctcccccaa tcccacaggg 2461 acccaaggga gacactggcc cccgaggccc aagggtaagc gttgcactct gggctgtggg 2521 gggctgcagg tgggcatggc tctcggcccc acgctcaccc cggccccgcc ctctccccct 2581 gcagggaccc gcaggccccc ctggccgaga tggcatccct ggacagcctg gacttcccgg 2641 accccccgga ccccccggac ctcccggacc ccctggcctc ggaggagtaa gtggagaggc 2701 cttgtgtgtc cactctcccc tgttttgttt ttgttttttg gcagatgaca taattttata 2761 ctttgaaata atttcaaact tacagaaaag ttgcaagaat cctacaggaa actctcacat 2821 acccttcaca gtttgtgaca tgtgctttat tagtctctgt ttatgtatat gtatcttttt 2881 ttttctgaac tgtttgagca agttgctaac atcaggctct tttgcgccta aatacttagg 2941 tgtgtttttc ctaaaaacaa gagcattctc ttaactgacc tacacaatga ttaaattcac 3001 tctctaatgt gcagtccgta ctcaaagttc accgatgtcc cgataatgtc ctttatagat 3061 tccacccccc accaccccaa tctgggatcc agtccaggat tatgtattgc atttaatcat 3121 catgtctcta gtttccacaa atgtagaacg ttcctcagac tttctttgtc tttagtggca 3181 ctgggagttt tgatgagtcc agttgttttg cagactgtcc ctcaatttgg gattgtctca 3241 ttagattaga tgcagggatg catctttggc aggaatgtct taaaagcaat gttattcttc 3301 tcagcacatc acaccaggaa gtgcatgatg tcagtttctt ccatcctcag tgccgtcttc 3361 tgcctttcaa ttcactgtcc tcactctgac ttctcttgtt tgttctagaa ctttgctccc 3421 cagctgtctt atggctatga tgagaaatca accggaggaa tttccgtgcc tggccccatg 3481 gtgagccagc agggggagca tggatgacag aagagagaat gggtatccag aggatgtggg 3541 catacgcggc tggtatacac agcttgggag gtccatatca cctttgggac ctcagagtcc 3601 agaaaggatg caagacgact gggtggtccc aacaggcatg aatgactaca tccacatgct 3661 ttcctacaga gggatcacca tgacccccct ttcttctccc tctatagggt ccctctggtc 3721 ctcgtggtct ccctggcccc cctggtgcac ctgtgagtat ccaggacgtc ttcatatgcc 3781 tccttgggct ttggtctttt ggagggaaga ctgggatgag ggcaggagag atgctcagag 3841 atctcttggt aagattggag aaggttgaca gggacttgtc ttctaaccca tctttttcct 3901 tcttctcaag ggtccccaag gcttccaagg tccccctggt gagcctggcg agcctggagc 3961 ttcagtaagc actctctata cagattcata ctccttctac aaacacacag actctcctat 4021 agaagaactc ccaggcctgg ggtcttcctt acctcttccc ttcaatccca gccttcccct 4081 tctttttttc ttatccatat tctaaccacc tcttctatct tttctagggt cccatgggtc 4141 cccgaggtcc cccaggtccc cctggaaaga atggagatga tgtaagtatc cccagcaaga 4201 agataccatc tgaccccatg gcctccatgg gttgggtcct gcaatttcca ctccaccaca 4261 tttgggaacg atactcagag gaaggagggc aagtcctctc tgatgcacgg actgccctgg 4321 aacaatgatc ttttcgctta gtgagatgat tccatgtccc caacaaagtg actgttctcc 4381 tcaccccagc caccttagag caatccccaa ccccatccct ttggggaaat tggtgcgcag 4441 atggtgaaat taaaatgctg gtgacagaag tagacagaaa ttcctttaga ggcactcaga 4501 tttcaccaaa cgaaggtttc actgtagatt taaactgagc tctagattca aagataagat 4561 tctgggcccc caaacctgac ctgcaacaat ccaaagaaga ctgagacctt ctccactttt 4621 ccagccccta ggcggtggtg gggaggcaga ggcatgatgg tcttttctct ccctctcagg 4681 gggaagctgg aaaacctggt cgtcctggtg agcgtgggcc tcctgggcct caggtgagca 4741 gggggctgtg gctgaacctg ggcttcactg cacttgggct tcatttagga gctgggtcca 4801 cagtgatgtg ttctaatggc ccttccttgt cttcttcatc tctctccagg gtgctcgagg 4861 attgcccgga acagctggcc tccctggaat gaagggacac agagtgagtc acctttgagt 4921 catttaagct ccccaagtcc ctagcatacc cccatccagt cccagcctct tccccaaaag 4981 atcctgagtt gcatcatggt gggtggcagc tacagaagtc ccaagggcca gagagtggac 5041 atccaaaagc actcctcatg gaatcccgat taccgattgg gtgagatctt agagccattt 5101 ggggtttagt ctagctcaga aacaaaggga tggcggtgat gacctcccaa ggctctttct 5161 cagatctagg tggatgtcaa ggctgttcca ccccctccac aggttcttac cttctacctc 5221 tttcctgctt tagggtttca gtggtttgga tggtgccaag ggagatgctg gtcctgctgg 5281 tcctaaggta agaggctgtc tgaacatcat ggtcctccac atccccagag tcccaccatg 5341 aatgaatttc tcactcatta ttctctgatc tacagggtga gcctggcagc cctggtgaaa 5401 atggagctcc tggtcagatg gtgagtgtgc ccagttccag agggcaggga tggggcagga 5461 ggcaggggca agatggaggc ctgggggaac aaggctgtct cccatctcat ctgacttctc 5521 ttggtttggt tgtcagggcc cccgtggcct gcctggtgag agaggtcgcc ctggagcccc 5581 tggccctgct gtaagtactc ctggcccctt gggggatccc tgagctctgg aaggggctcc 5641 ccaggaactc tagggactgg ccagtgctca gtggacttaa cggggcttcc cctctctcct 5701 gcagggtgct cgtggaaatg atggtgctac tggtgctgcc gggccccctg tgagtgtggc 5761 ctgtaggcct cagggcctgg gagtggggag gggtctcagt gtctgctctt ggggctgaca 5821 atgggggcag gttatgttgg tctgaacccc aggacttcct ctgtcccagg gtgtgacttg 5881 cagctgccat ctcttccttc tcgctgacat ctccatttca ttcacagggt cccaccggcc 5941 ccgctggtcc tcctggcttc cctggtgctg ttggtgctaa ggtgagaccc cccactctcc 6001 tctaagcatg accctcatgg gccaaggggt tcatgtctcc ctgttcccca aaccaaaggg 6061 acccagagtg gcaagagagc agcccgttca ctaacacctt tgtcctgggg tctccgtctc 6121 tgatcttaga gtcctgatca ttgctctcct gtccctgtct ccccttcctc ctgccatccc 6181 gagaggcaag gttgggtttc ccagggtggc ttctgatatg tcctttcttc tgattcaggg 6241 tgaagctggt ccccaagggc cccgaggctc tgaaggtccc cagggtgtgc gtggtgagcc 6301 tggcccccct ggccctgctg gtgctgctgg ccctgctgta agtgtccccg actcagtgtc 6361 ccctttgcca ctttctaacc tcagagtcct tgcctgttgc tgacactcct ttctctgtgc 6421 cacagggaaa ccctggtgct gatggacagc ctggtgctaa aggtgccaat gtaagtatcc 6481 tgccaggctt cagtcccact cctgccgcct gcagcctgcc tgcccctttc cctctgctcc 6541 taggctcacg ccctggctgt ctgcctccca cagggtgctc ctggtattgc tggtgctcct 6601 ggcttccctg gtgcccgagg cccctctgga ccccagggcc ccggcggccc tcctggtccc 6661 aagggtaaca gcgtgagtac caaactctcc cttctgccca ccccatgcac tggctccagt 6721 gcggctctca tctggggagc aggaagacgc aggccaactg agcgcccccg actctcagct 6781 catcctcttc tccccccttg cagggtgaac ctggtgctcc tggcagcaaa ggagacactg 6841 gtgctaaggg agagcctgta agtctccccg ccatccttct tgcagcccag cccaccctgc 6901 cctaggagcc ccctgaggga aatccagaaa ggaagaggag cccctagtct tctgggggag 6961 tccctgccac acccccagga acccctgaca ctggaggccc agcctcagcc ggctctgagg 7021 ctggcacagg atggcccctc accacaggcc gcctcctcct ctcggccctc tccagggccc 7081 tgttggtgtt caaggacccc ctggccctgc tggagaggaa ggaaagcgag gagctcgagg 7141 tgaacccgga cccactggcc tgcccggacc ccctggcgag cgtgtaagtg tccctgcccg 7201 ccccctccca ctccaccctc attgcctggc tggtgcctgt gtgtcgcgga gttcactggc 7261 ctcctctcct cctgcagggt ggacctggta gccgtggttt ccctggcgca gatggtgttg 7321 ctggtcccaa ggtaacctct ccttgcggcc ggggggctga ccctgccgct ccctgggcat 7381 cttcttcctc ttttggcccg tggcaaagag ccacaaactt gagaccctaa ctgttcctgt 7441 gacttccccc aaccagggtc ccgctggtga acgtggttct cctggccccg ctggccccaa 7501 aggatctcct ggtgaagctg gtcgtcccgg tgaagctggt ctgcctggtg ccaaggtgag 7561 gccccaggct ttcagcctgg cttggccagg ccctgaccat cccgtgtagg gtctgggatg 7621 aggcgttctg gatcaggccc aagggtctgc cctctggagt cctcccccac ctccatcatg 7681 cttctcccca agtcccactc atacctctct gcctccctag ggtctgactg gaagccctgg 7741 cagccctggt cctgatggca aaactggccc ccctgtaagt atcactcccc ctgaaccccc 7801 tgccattgtc ctgtctgcct ccctgctgtc ctcactgctg ctttcgtgcc tcccatcctt 7861 agggtcccgc cggtcaagat ggtcgccccg gacccccagg cccacctggt gcccgtggtc 7921 aggctggtgt gatgggattc cctggaccta aaggtgctgc tgtgagtatt aagtgaggat 7981 ccatgaagag ccagggacaa acacacctga gacttgaagg agtcctgggc tctgggctca 8041 gctgtgccgc tgacctgccg tgtggccact cactctcact ttctggacct cagcctccct 8101 atctgtaaaa tgaaagactt ctcggcgggg cacggtggct catgcctgta atcccagcac 8161 tttgggaggc caaggcgggc agaccatgag gtcaggagtt tgagaccagt cgggccaaca 8221 tagtgaaacc acgtctctac taaaaataca aaagattagc tgggtgtggt ggtgtgcacc 8281 tgtaacccca gctagtcagg aggctgaggc aggagaattg catgaacccg ggaggtggag 8341 gttgcagtga gctgagatca cgccattgca ctccagcctg ggcaacagtg cgagattcca 8401 tctcaaaaaa aaaaaaaaaa agaagaaaga aagaaagaaa aaatgaaaca cttctccagg 8461 ctccatgacc actgctctgt cctgaaataa gtgttgttgg tggccctcca ccccgacacg 8521 tggggatagg acaggccttt gatatgatag gcacccccag tcttggtgga ttctttgagg 8581 tccaaaagga gatagcagag aagatgaaag ccctttgcag tgcaggccac agcgggcatc 8641 taacagggaa aaggcagagg agcctggaag ggcatcttgg gaggagtggg ctcagaaagg 8701 gcccagcaag aagcacctgc aggggcattc cccgggggcc aaacagtctt ttgaaaagaa 8761 agtcccttaa aaagtcccac tcagagtaaa tgagaggccc caggaggccc tggcttctca 8821 cttcagcccc ctcaacccta actccctttc tccacaggga gagcccggca aggctggaga 8881 gcgaggtgtt cccggacccc ctggcgctgt cgtaagtatc tcctttccat ccctacctcc 8941 ttcccattgc tgccccggca ctttctcctc cctgcaggag gggtgctaga ggccacggtc 9001 ctcagctgct cggggcctcc taaccctgag ttcccctttg ctctctccct gcagggtcct 9061 gctggcaaag atggagaggc tggagctcag ggaccccctg gccctgctgt gagtgtccct 9121 gatggggaga tctggggagc agaaaagggg agacaccctc agcccctcgt ctcctcggcc 9181 tccccgtgac tgtagtgttc tctctgtgca gggtcccgct ggcgagagag gtgaacaagg 9241 ccctgctggc tcccccggat tccaggtgag gcctcatggc tgtcaggatg ctgggaggta 9301 ggggtaggaa acacctcttt ggtctcttcc agattctaaa ccttccctcc cttcttcccc 9361 catttcccac ctacagggtc tccctggtcc tgctggtcct ccaggtgaag caggcaaacc 9421 tggtgaacag gtaagaggga gcagccggcc agaggggtgg gagatgcagg gaatccagag 9481 ggacaggccc ccgcctccta gctaatcaga cagccatcaa ctagagggat tgaggttaga 9541 caccggaaag aacttcctcc catgaaggga gcagcacaga gggaagtggg ggctgcatga 9601 ttgctagtct gggtgacttc ttttaagagc tgctggaata tgctgtgact ttccctcaac 9661 ccttgtattg ataaatcttg gtccatagtt tggggagggg ggaagccttt gacacatccc 9721 taggaggaag agaggggctg tttgggataa tctcaattca gtgctgagaa ggggttcctc 9781 tctaatcacg gccagacccc aggaggaagg accgtgcttt ccagcagagt ggccccaggt 9841 aggttttgct cactgtctgt tcctctctcc ctccccctca gggtgttcct ggagaccttg 9901 gcgcccctgg cccctctgga gcaagagtaa gtaggcctct ctcgctgcat ccgtcaaggt 9961 gcgttgtact tggccctatc tccagagcag ccttcacatg ccctgtcctt cccttctagg 10021 gcgagagagg tttccctggc gagcgtggtg tgcaaggtcc ccctggtcct gctggtcccc 10081 gaggggccaa cggtgctccc ggcaacgatg gtgctaaggt gagggcagcg tggaaggggc 10141 tctggcaagt ggcccaggga ccaggtctca cccctcctgc agcaggggat ggcgggccat 10201 gaccaaagcc atggagatag ggtgtggggt ggggggaaaa gaccagggca ggggcccaca 10261 cacagcctgg agtctgggct gtgagtcttt tcatcttttc tcaaggcttg tcgttggcct 10321 tggaaacaag cctgggagat accaagcggg gcttagggct gtgacccact cttggggccc 10381 caggcctcac tccagtcttc ttggttgtca catagggtga tgctggtgcc cctggagctc 10441 ccggtagcca gggcgcccct ggccttcagg gaatgcctgg tgaacgtggt gcagctggtc 10501 ttccagggcc taagggtgac agagtaagtt caaccttccc cctcccctga gccctacatg 10561 gctcccatct ctgcctgctt tgaatctctc agcatctctc cttctctctg ggatctgtcc 10621 ctcttctcgc taatcctccc ctcttcccct ttcccctctg gcctttttgc tgatgaatcc 10681 tctccctgtg gtccaggccc atctatcccc atgggttacc atggtgatga gaggtggggg 10741 catctccttg gtggaggctc ccttattcat cccgctacac aagtcagggg cctcttaacc 10801 tcagttccac ctgagtctcc aggcaggaac cctttttcct gaaagaatct ttgagtcctt 10861 ggcccaggtg gaggcagggc agagctgcag agggcctctc aggaaaccca gacacaagca 10921 gaacactata ggtcacctcc ttgccccaca ctggaaatct caagcttatc catgtcttta 10981 gggtgatgct ggtcccaaag gtgctgatgg ctctcctggc aaagatggcg tccgtggtct 11041 gaccggcccc attggtcctc ctggccctgc tggtgcccct ggtgacaagg tgaggtggcc 11101 gcctccccac cttctgccct aacacatagc ctcctcagca ggcctgggca cggttccgtg 11161 gggttgcgtt gggagagcag gtcctgccaa actgagctgt caacctggga acctggaggg 11221 accagaagga ggggaggctc tcctggggtc atctactagg agtattcagg ggaggccctg 11281 accctgagcc tcttgtccct tgctctcagg gtgaaagtgg tcccagcggc cctgctggtc 11341 ccactggagc tcgtggtgcc cccgtaagta cagaagacct gttaagaccc catacttggc 11401 ccttccctcc cttcacacag cacccctggc cctgtctgtg ccttcacccc ttgcctctcc 11461 cctcaccgca tccccgcctt ccctcctgtc agacgcatct ctccaatctg actccttttc 11521 ttctagggag accgtggtga gcctggtccc cccggccctg ctggctttgc tggcccccct 11581 gtgagtacca agacccccat catttttcat caccgactgg gacctgggac ctcgagggac 11641 ggaatgagga caaaggcgtc agccatcctc aggggagaag ggtggagacg ggattgtttc 11701 ccacccaagc atcttcctgc ctccattact gctcctcccc caggtagtgg aaactcctgc 11761 ctccttccct ccattcaccg ccctgcttcc tcccccaggg tgctgacggc caacctggtg 11821 ctaaaggcga acctggtgat gctggtgcta aaggcgatgc tggtccccct ggccctgccg 11881 gacccgctgg accccctggc cccattgtga gtggcttggc cctctgtgcc cacgaggctg 11941 gtgggctggg acccaggacg ggtccaggct tgatgcgtct gtgctctcct acagggtaat 12001 gttggtgctc ctggagccaa aggtgctcgc ggcagcgctg gtccccctgt gagtatcacc 12061 cgcctctctg ttgagcctct cccctctccc caggcagcgg tggcaggtga gggcagctgg 12121 gtcggatgag ttggctgttc tccctctgac tgttcctatg ttctctcctt ccagggtgct 12181 actggtttcc ctggtgctgc tggccgagtc ggtcctcctg gcccctctgt aagtctctgc 12241 agcagagtcc actgctctag gttgggggtg ctgggtgggg gctgccagaa ggatggtggg 12301 gctgactgag gacccaatga tgcaccagag ccccctggag tctgacagcc cctcctatcc 12361 tcatccaggg aaatgctgga ccccctggcc ctcctggtcc tgctggcaaa gaaggcggca 12421 aaggtccccg tggtgagact ggccctgctg gacgtcctgg tgaagttggt ccccctggtc 12481 cccctggccc tgctggcgag aaaggatccc ctggtgctga tggtcctgct gtaagtgcca 12541 gctcagatct ctgcagctcc ggaggtgtgc agagctgggg aggggtccct gtgctgctgt 12601 ctggcacctc acccctgttt gcctcccaaa gggtgctcct ggtactcccg ggcctcaagg 12661 tattgctgga cagcgtggtg tggtcggcct gcctggtcag agaggagaga gaggcttccc 12721 tggtcttcct ggcccctctg taagtgcccc cctcaccttg gggggccctg agaaaaacca 12781 tcacaggact tggagtgggg cggagccaag gagaacagat ttggtagaga tgactccagc 12841 ggactcaagg gtcctcccag accctatctc tggcctgact ctttcttctc ccttagggtg 12901 aacctggcaa acaaggtccc tctggagcaa gtggtgaacg tggtccccct ggtcccatgg 12961 gcccccctgg attggctgga ccccctggtg aatctggacg tgaggtgagc agtccccagc 13021 ccccatgcca gtaccctcag catggccatt gtggccttgc ctaagccctc ttccccggct 13081 gactctcact tctctctctc tctctctgca gggggctcct ggtgccgaag gttcccctgg 13141 acgagacggt tctcctggcg ccaaggtaag atggcaacac tccatgacca cagccttgtc 13201 tgctgcttcc ctgccccatc ctggcccttc acccggggct gacccatatt cccctgctct 13261 ccccgccagg gtgaccgtgg tgagaccggc cccgctggac cccctggtgc tcctggtgct 13321 cctgttgccc ctggccccgt tggccctgct ggcaagagtg gtgatcgtgg tgagactgta 13381 agtagctggg ctccagttcc ctgtacctgg tcaggccagg gactcttcag gcctccttag 13441 aggcctgggg atgggtgtcg gacttcaccc aggcaggggg aggaaaggag atcctgcaag 13501 atgtcagggc cttaatccaa aaaactgagt taaagctcag ccctaagtcc cctctcccag 13561 acaggaccgc ctctcccatg agttggcccc agctcccgtg aagattgcag tggggaggtt 13621 tccctgggag ttgggagaga tggccacagt gggaagcagc tgaggagaga gagatccagc 13681 agaggggagg cctcatcctg cagccccagc ctcagccttc cctggccaag agctcatgct 13741 ttccttgctc tccccagggt cctgctggtc ccgccggtcc tgtcggccct gttggcgccc 13801 gtggccccgc cgtaagtacc ctgctgtgtc ccccatgcct tcagaactct acagatgcag 13861 acagtgcccc actcgatgcc aatggaactt ccgcctgaca gtttgtccct ttctctcttc 13921 tagggacccc aaggcccccg tggtgacaag ggtgagacag gcgaacaggg cgacagaggc 13981 ataaagggtc accgtggctt ctctggcctc cagggtcccc ctggccctcc tgtaagtatg 14041 ctcagcccct ccccagtccc catgctgtgc tgtgggatag gagggggagc ttcgcctcag 14101 tttccccctc tggatagtca ttctttcccc tccctagtgg ggactggggt ctgaagattt 14161 gtgggcatgt ccaagtagct tctgagaggg tgaggggtac acagagaggg attatgggag 14221 aggtctctgc ctatggacac cctcgggcta gatttccaga ataatgaagg ggcatgggtt 14281 gcccacactg cccttgtctc tccagccagg ccctcaggct acatttgacg ctcactgggc 14341 ctgaactgcc ttttttatct gtccttcagg gctctcctgg tgaacaaggt ccctctggag 14401 cctctggtcc tgctggtccc cgagtaagtc atgccttctc tctcctcttc ctgagcccca 14461 agcccaggct cacctcgggg acccttgcca ggacccaggc accctttgcc tctctggaga 14521 agggttcagg gacagggagt gggcaaagaa aggaagaatc ctgaacaaac aatctgatct 14581 agctttggcc tctctgctcc ccaatccgtc ctcccctggc tcagcggctg ggaggagcta 14641 tggcatgtcc tatggaaaga ggctgaggct ggctctatga gccgtggggc cagagccagc 14701 agggagggtg gtgggcctct cctccagagc tggggttgtt cgggcttctg gcagcctttc 14761 tcaaaccatt tcccccactc cagggtcccc ctggctctgc tggtgctcct ggcaaagatg 14821 gactcaacgg tctccctggc cccattgggc cccctggtcc tcgcggtcgc actggtgatg 14881 ctggtcctgt tgtatgtagc ccctcatccc ctctgctcat ggccctccag cccccatagc 14941 acttggatgc cggaatcccc actctcttcc ctctctgtgc agggtccccc cggccctcct 15001 ggacctcctg gtccccctgg tcctcccagc gctggtttcg acttcagctt cctgccccag 15061 ccacctcaag agaaggctca cgatggtggc cgctactacc gggctgatga tgccaatgtg 15121 gttcgtgacc gtgacctcga ggtggacacc accctcaaga gcctgagcca gcagatcgag 15181 aacatccgga gcccagaggg aagccgcaag aaccccgccc gcacctgccg tgacctcaag 15241 atgtgccact ctgactggaa gagtggtgtg ggcctgccct agcctctccc tccctcctac 15301 tcctgccatg ccagggtccc catgcccata tgtgccccta ccatatggtg ctggctgctc 15361 cctttccctg actccatctt gccctgccct accacaggag agtactggat tgaccccaac 15421 caaggctgca acctggatgc catcaaagtc ttctgcaaca tggagactgg tgagacctgc 15481 gtgtacccca ctcagcccag tgtggcccag aagaactggt acatcagcaa gaaccccaag 15541 gacaagaggc atgtctggtt cggcgagagc atgaccgatg gattccaggt gcgtgagctg 15601 gacctcagag ccagtgttag gagatgggct agcccagtgc tcagaaggga catgaagtcc 15661 tggagtaggt ctctgctaag ggtgatggac agagctgggc tgggaggcag gggtctcagg 15721 tccctgctag tggttcagac acaggctgcc gatgggcagg tggtgcccct ctgatataac 15781 ggtgcattgg gcagctctct gaggaccctg gacaggaggc cagcaggact agaggttccc 15841 gcatagctca ctcttccctc tctctcctcc ctgcagttcg agtatggcgg ccagggctcc 15901 gaccctgccg atgtggccat ccagctgacc ttcctgcgcc tgatgtccac cgaggcctcc 15961 cagaacatca cctaccactg caagaacagc gtggcctaca tggaccagca gactggcaac 16021 ctcaagaagg ccctgctcct caagggctcc aacgagatcg agatccgcgc cgagggcaac 16081 agccgcttca cctacagcgt cactgtcgat ggctgcacgg tgagtgccca gaatccccag 16141 gcagggcccc acctctccgg ccttgggcat tttggccagg ccatagtgcc ctctctccat 16201 cactcccacg tggtaatgcc ccctcccgtt gtctccgccc caccccagag tcacaccgga 16261 gcctggggca agacagtgat tgaatacaaa accaccaaga cctcccgcct gcccatcatc 16321 gatgtggccc ccttggacgt tggtgcccca gaccaggaat tcggcttcga cgttggccct 16381 gtctgcttcc tgtaaactcc ctccatccca acctggctcc ctcccaccca accaactttc 16441 cccccaaccc ggaaacagac aagcaaccca aactgaaccc ccccaaaagc caaaaaatgg 16501 gagacaattt cacatggact ttggaaaata tttttttcct ttgcattcat ctctcaaact 16561 tagtttttat ctttgaccaa ccgaacatga ccaaaaacca aaagtgcatt caaccttacc 16621 aaaaaaaaaa aaaaaaaaaa aagaataaat aaataacttt ttaaaaaagg aagcttggtc 16681 cacttgcttg aagacccatg cgggggtaag tccctttctg cccgttgggt tatgaaaccc 16741 caatgctgcc ctttctgctc ctttctccac accccccttg gcctcccctc cactccttcc 16801 caaatctgtc tccccagaag acacaggaaa caatgtattg tctgcccagc aatcaaaggc 16861 aatgctcaaa cacccaagtg gcccccaccc tcagcccgct cctgcccgcc cagcaccccc 16921 aggccctggg gacctggggt tctcagactg ccaaagaagc cttgccatct ggcgctccca 16981 tggctcttgc aacatctccc cttcgttttt gagggggtca tgccggggga gccaccagcc 17041 cctcactggg ttcggaggag agtcaggaag ggccacgaca aagcagaaac atcggatttg 17101 gggaacgcgt gtcatccctt gtgccgcagg ctgggcggga gagactgttc tgttctgttc 17161 cttgtgtaac tgtgttgctg aaagactacc tcgttcttgt cttgatgtgt caccggggca 17221 actgcctggg ggcggggatg ggggcagggt ggaagcggct ccccattttt ataccaaagg 17281 tgctacatct atgtgatggg tggggtgggg agggaatcac tggtgctata gaaattgaga 17341 tgccccccca ggccagcaaa tgttcctttt tgttcaaagt ctatttttat tccttgatat 17401 tttttctttc tttttttttt tttttgtgga tggggacttg tgaatttttc taaaggtgct 17461 atttaacatg ggaggagagc gtgtgcgctc cagcccagcc cgctgctcac tttccaccct 17521 ctctccacct gcctctggct tctcaggcct ctgctctccg acctctctcc tctgaaaccc 17581 tcctccacag ctgcagccca tcctcccggc tccctcctag tctgtcctgc gtcctctgtc 17641 cccgggtttc agagacaact tcccaaagca caaagcagtt tttccctagg ggtgggagga 17701 agcaaaagac tctgtaccta ttttgtatgt gtataataat ttgagatgtt tttaattatt 17761 ttgattgctg gaataaagca tgtggaaatg acccaaacat aatccgcagt ggcctcctaa 17821 tttccttctt tggagttggg ggaggggtag acatggggaa ggggccttgg ggtgatgggc 17881 ttgccttcca ttcctgccct ttccctcccc actattctct tctagatccc tccataaccc 17941 cactcccctt tctctcaccc ttcttatacc gcaaaccttt ctacttcctc tttcattttc 18001 tattcttgca atttccttgc accttttcca aatcctcttc tcccctgcaa taccatacag 18061 gcaatccacg tgcacaacac acacacacac tcttcacatc tggggttgtc caaacctcat 18121 acccactccc cttcaagccc atccactctc caccccctgg atgccctgca cttggtggcg 18181 gtgggatgct catggatact gggagggtga ggggagtgga acccgtgagg aggacctggg 18241 ggcctctcct tgaactgaca tgaagggtca tctggcctct gctcccttct cacccacgct 18301 gacctcctgc cgaaggagca acgcaacagg agaggggtct gctgagcctg gcgagggtct 18361 gggagggacc aggaggaagg cgtgctccct gctcgctgtc ctggccctgg gggagtgagg 18421 gagacagaca cctgggagag ctgtggggaa ggcactcgca ccgtgctctt gggaaggaag 18481 gagacctggc cctgctcacc acggactggg tgcctcgacc tcctgaatcc ccagaacaca 18541 acccccctgg gctggggtgg tctggggaac catcgtgccc ccgcctcccg cctactcctt 18601 tttaagctt // LOCUS AF017257 101569 bp DNA PRI 01-JAN-1998 DEFINITION Homo sapiens chromosome 21 derived BAC containing erythroblastosis virus oncogene homolog 2 protein (ets-2) gene, complete cds, complete sequence. ACCESSION AF017257 NID g2736086 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 101569) AUTHORS Zimmermann,W.W.K., Korenberg,J., Rosenthal,A. and Schattevoy,R. JOURNAL Unpublished REFERENCE 2 (bases 1 to 101569) AUTHORS Zimmermann,W.W.K. and Schattevoy,R. TITLE Direct Submission JOURNAL Submitted (05-AUG-1997) Genome Analysis, Institute of Molecular Biotechnology, Beutenbergstrasse 11, Jena 07745, Germany FEATURES Location/Qualifiers source 1..101569 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" source 1..97350 /organism="Homo sapiens" /clone="1151N9" /chromosome="21" repeat_region 586..727 /rpt_family="MLT1D" repeat_region 728..1130 /rpt_family="MSTA" repeat_region 1131..1497 /rpt_family="MLT1D" repeat_region complement(1675..1978) /rpt_family="AluSp" exon 2689..2786 /note="MZEF_prediction, 0.681, 0.453, 0.484, 0.567, 221, 0.513, 0.550, 0.548" /evidence=not_experimental repeat_region 2921..3216 /rpt_family="AluY" repeat_region complement(3218..3638) /rpt_family="L1MB5" repeat_region complement(3643..3945) /rpt_family="AluJo" repeat_region complement(3949..4028) /rpt_family="L1MB5" repeat_region complement(4998..5280) /rpt_family="AluY" repeat_region complement(5361..5484) /rpt_family="MLT1G" repeat_region complement(5734..5864) /rpt_family="MLT1G" repeat_region 5865..6230 /rpt_family="MLT1B" repeat_region complement(6274..6573) /rpt_family="AluSg" repeat_region 7075..7372 /rpt_family="AluJb" repeat_region complement(7952..8070) /rpt_family="MIR" exon complement(8119..8174) /note="MZEF_prediction, 0.515, 0.485, 0.482, 0.460, 212, 0.534, 0.526, 0.615" /evidence=not_experimental repeat_region complement(8207..8230) /rpt_family="AT_rich" exon 8684..8753 /note="GENSCAN_prediction, 3.14, Internal_exon, 70_bp, frame:2, phase:1" /evidence=not_experimental repeat_region complement(10878..11376) /rpt_family="MER64B" repeat_region 11435..11646 /rpt_family="LINE2" repeat_region complement(11895..12152) /rpt_family="HERVL" exon 12025..12178 /note="MZEF_prediction, 0.648, 0.590, 0.545, 0.604, 122, 0.535, 0.622, 0.559" /evidence=not_experimental repeat_region complement(12847..15206) /rpt_family="TIGGER1" exon complement(13255..13314) /note="MZEF_prediction, 0.606, 0.627, 0.504, 0.332, 122, 0.497, 0.572, 0.574" /evidence=not_experimental exon complement(14279..14366) /note="MZEF_prediction, 0.922, 0.554, 0.355, 0.685, 221, 0.500, 0.619, 0.545" /evidence=not_experimental exon complement(15060..15246) /note="MZEF_prediction, 0.664, 0.459, 0.478, 0.522, 221, 0.522, 0.513, 0.552" /evidence=not_experimental repeat_region complement(15853..16155) /rpt_family="AluSx" exon 16859..16959 /note="GENSCAN_prediction, -0.55, Internal_exon, 101_bp, frame:1, phase:2" /evidence=not_experimental repeat_region 16989..17244 /rpt_family="AluSx" exon 17410..17521 /note="GENSCAN_prediction, 3.64, Internal_exon, 112_bp, frame:1, phase:1" /evidence=not_experimental repeat_region complement(17455..17929) /rpt_family="LINE2" repeat_region complement(18474..18678) /rpt_family="AluJb" exon complement(19127..19253) /note="MZEF_prediction, 0.630, 0.657, 0.609, 0.600, 112, 0.494, 0.643, 0.575" /evidence=not_experimental exon 19214..19256 /note="GENSCAN_prediction, 0.74, Internal_exon, 43_bp, frame:1, phase:1" /evidence=not_experimental repeat_region complement(19234..19591) /rpt_family="MLT1D" repeat_region complement(19592..19866) /rpt_family="AluSq" repeat_region complement(20561..20901) /rpt_family="LINE2" repeat_region complement(21014..21123) /rpt_family="LINE2" exon 21276..21392 /note="GENSCAN_prediction, 1.86, Internal_exon, 117_bp, frame:1, phase:0" /evidence=not_experimental repeat_region 21682..21995 /rpt_family="L1MB8" repeat_region complement(22008..22296) /rpt_family="AluSx" repeat_region 22343..22387 /rpt_family="L1MB8" repeat_region 22388..22727 /rpt_family="MLT1A1" repeat_region 22784..22825 /rpt_family="(CATA)n" repeat_region 23077..23183 /rpt_family="MER45" repeat_region 24091..24190 /rpt_family="MIR" repeat_region complement(24439..24493) /rpt_family="LINE2" repeat_region complement(24517..24590) /rpt_family="MIR" misc_feature 27418..29082 /note="0.84, GC=, 70.00%, #163" /note="Region: CpG Island" exon complement(27719..27771) /note="Xpound_prediction, 2%, 62%" /evidence=not_experimental repeat_region complement(27976..28074) /rpt_family="(GGGA)n" repeat_region complement(28076..28226) /rpt_family="(GGGA)n" mRNA join(28170..28459,32374..32445,35340..35451,36610..36729, 37118..37318,39306..39389,40723..40944,41801..42064, 43884..44002,44972..47247) /gene="ets-2" /evidence=experimental gene 28170..47247 /gene="ets-2" /evidence=experimental repeat_region complement(28362..28412) /rpt_family="GC_rich" repeat_region complement(29741..29781) /rpt_family="POLY_A" repeat_region complement(30921..30967) /rpt_family="(CA)n" repeat_region complement(31416..31641) /rpt_family="MER33" CDS join(32374..32445,35340..35451,36610..36729,37118..37318, 39306..39389,40723..40944,41801..42064,43884..44002, 44972..45187) /gene="ets-2" /codon_start=1 /evidence=experimental /product="erythroblastosis virus oncogene homolog 2 protein" /db_xref="PID:g2736087" /translation="MNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLN EEQTLQEVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKN PWLWSEQQVCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILW EHLEQMIKENQEKTEDQYEENSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLL DSMCPASTPSVLSSEQEFQMFPKSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDH DSPENGADSFESSDSLLQSWNSQSSLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDY IQERSDPVEQGKPVIPAAVLAGFTGSGPIQLWQFLLELLSDKSCQSFISWTGDGWEFK LADPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTSGKRYVYRFVCDLQNLL GFTPEELHAILGVQPDTED" repeat_region 32555..32630 /rpt_family="MIR" repeat_region complement(34187..34353) /rpt_family="FRAM" repeat_region 34838..35135 /rpt_family="AluSx" repeat_region complement(37506..37544) /rpt_family="(CA)n" repeat_region 37988..38292 /rpt_family="AluSx" repeat_region 38293..38412 /rpt_family="(GGGA)n" repeat_region 38905..39021 /rpt_family="LINE2" repeat_region 39504..39803 /rpt_family="MER33" repeat_region complement(41096..41163) /rpt_family="LINE2" repeat_region complement(41220..41611) /rpt_family="L1ME3" repeat_region complement(42961..42995) /rpt_family="MIR" repeat_region complement(45706..45733) /rpt_family="AT_rich" polyA_signal 45721..45726 /gene="ets-2" /note="Pub" /evidence=experimental repeat_region complement(45908..45967) /rpt_family="(CAAA)n" polyA_signal 46061..46066 /gene="ets-2" /note="Pub" /evidence=experimental exon complement(46193..46238) /gene="ets-2" /note="MZEF_prediction, 0.891, 0.588, 0.727, 0.667, 111, 0.549, 0.687, 0.545" /evidence=not_experimental exon 46994..47169 /gene="ets-2" /note="MZEF_prediction, 0.700, 0.540, 0.423, 0.500, 122, 0.521, 0.538, 0.560" /evidence=not_experimental polyA_signal 47226..47231 /gene="ets-2" /note="est, N99244, &, Pub" /evidence=experimental repeat_region complement(47747..47940) /rpt_family="MER63A" repeat_region 47941..48619 /rpt_family="MER64" repeat_region 48899..49190 /rpt_family="MER64" repeat_region 49191..49460 /rpt_family="AluSx" repeat_region 49461..49737 /rpt_family="MER64" repeat_region complement(50065..50354) /rpt_family="AluJo" repeat_region complement(50591..50622) /rpt_family="(TAAA)n" repeat_region complement(50623..51925) /rpt_family="L1PA2" repeat_region 51923..53073 /rpt_family="L1PA2" exon 52656..52807 /note="MZEF_prediction, 0.537, 0.609, 0.427, 0.627, 121, 0.516, 0.595, 0.617" /evidence=not_experimental repeat_region 53320..53395 /rpt_family="MIR" exon 53805..53939 /note="MZEF_prediction, 0.670, 0.503, 0.519, 0.561, 221, 0.548, 0.558, 0.571" /evidence=not_experimental exon complement(53834..53918) /note="GENSCAN_prediction, -0.37, Terminal_exon, 85_bp, frame:0, phase:1" /evidence=not_experimental repeat_region complement(54557..54667) /rpt_family="MIR" repeat_region 55435..55565 /rpt_family="(CA)n" repeat_region complement(55642..56064) /rpt_family="MLT1B" repeat_region complement(56086..56177) /rpt_family="L1M1" repeat_region complement(56181..57097) /rpt_family="L1M1" repeat_region 58243..58523 /rpt_family="AluSx" repeat_region 58778..59081 /rpt_family="AluY" repeat_region 60006..60060 /rpt_family="(CA)n" repeat_region 61176..61443 /rpt_family="LINE2" exon complement(61336..61535) /note="GENSCAN_prediction, 2.75, Internal_exon, 200_bp, frame:2, phase:2" /evidence=not_experimental repeat_region complement(62232..62347) /rpt_family="MER5B" repeat_region complement(62631..62935) /rpt_family="AluSx" exon 63049..63140 /note="MZEF_prediction, 0.570, 0.500, 0.388, 0.615, 221, 0.472, 0.550, 0.525" /evidence=not_experimental repeat_region complement(63118..63568) /rpt_family="MLT1D" repeat_region 64089..64302 /rpt_family="MLT1D" repeat_region 64435..64703 /rpt_family="MLT1D" repeat_region 64893..65303 /rpt_family="MLT1B" repeat_region complement(65507..65576) /rpt_family="L1ME2" repeat_region complement(65577..65965) /rpt_family="MSTA" exon 65721..65744 /note="Xpound_prediction, 62%, 1%" /evidence=not_experimental repeat_region complement(65966..66732) /rpt_family="L1ME2" repeat_region 66752..66836 /rpt_family="MER47B" repeat_region complement(66878..66961) /rpt_family="L1ME2" repeat_region 66962..67310 /rpt_family="THE1C" repeat_region complement(67404..67705) /rpt_family="L1ME2" repeat_region complement(67711..68019) /rpt_family="THE1B" repeat_region complement(68022..68410) /rpt_family="L1ME2" repeat_region complement(68411..68461) /rpt_family="AT_rich" repeat_region complement(69079..69100) /rpt_family="AT_rich" repeat_region complement(69319..69435) /rpt_family="MLT1A1" exon complement(70026..70276) /note="MZEF_prediction, 0.593, 0.471, 0.460, 0.541, 221, 0.541, 0.536, 0.547" /evidence=not_experimental repeat_region complement(70891..71126) /rpt_family="L1MA9" repeat_region complement(71190..71483) /rpt_family="AluJb" repeat_region 71718..71912 /rpt_family="MLT1D" repeat_region 71912..71963 /rpt_family="MLT1D" repeat_region 71963..72316 /rpt_family="MLT1D" exon complement(72342..72422) /note="MZEF_prediction, 0.869, 0.388, 0.572, 0.643, 221, 0.512, 0.599, 0.535" /evidence=not_experimental repeat_region 74350..74653 /rpt_family="AluSq" repeat_region 74763..75302 /rpt_family="LINE2" repeat_region complement(75506..75644) /rpt_family="MER5A" repeat_region complement(76088..76310) /rpt_family="MIR" repeat_region 76446..76681 /rpt_family="L1M4" repeat_region 76696..78063 /rpt_family="L1M4" repeat_region 78059..78379 /rpt_family="L1MC2" repeat_region 78405..78704 /rpt_family="AluJo" repeat_region 78705..78739 /rpt_family="(GGAA)n" exon complement(80067..80230) /note="MZEF_prediction, 0.574, 0.531, 0.477, 0.469, 122, 0.544, 0.523, 0.563" /evidence=not_experimental repeat_region complement(81126..81291) /rpt_family="LINE2" repeat_region complement(82263..82341) /rpt_family="LINE2" repeat_region complement(82349..82410) /rpt_family="MER5A" repeat_region 82505..82569 /rpt_family="(GGAA)n" repeat_region 83102..83206 /rpt_family="(TA)n" repeat_region complement(83343..83453) /rpt_family="LINE2" exon complement(83820..83934) /note="Xpound_prediction, 0%, 69%" /evidence=not_experimental repeat_region 83885..83940 /rpt_family="MIR" repeat_region 84191..84284 /rpt_family="AluJ/FLAM" repeat_region complement(84306..84451) /rpt_family="MIR" exon complement(86402..86522) /note="GENSCAN_prediction, 2.00, Internal_exon, 121_bp, frame:1, phase:1" /evidence=not_experimental repeat_region complement(86897..87013) /rpt_family="(GAAA)n" repeat_region 87017..87381 /rpt_family="MSTD" repeat_region 87999..88731 /rpt_family="L1MA9" exon complement(88299..88340) /note="MZEF_prediction, 0.617, 0.493, 0.510, 0.334, 112, 0.598, 0.515, 0.59" /evidence=not_experimental repeat_region 88732..89031 /rpt_family="AluSg" repeat_region 89032..89632 /rpt_family="L1MA9" repeat_region 89636..89933 /rpt_family="AluSg" repeat_region 89947..90604 /rpt_family="L1MA9" repeat_region complement(90635..90755) /rpt_family="MSTC" repeat_region 90790..91247 /rpt_family="L1MA9" repeat_region 91784..91959 /rpt_family="MIR" repeat_region complement(92028..93539) /rpt_family="L1MA10" repeat_region 93544..93904 /rpt_family="MER80" repeat_region complement(94419..94555) /rpt_family="AluJ" repeat_region 94558..94911 /rpt_family="MSTA" repeat_region complement(94930..95087) /rpt_family="AluJ" repeat_region 96615..96912 /rpt_family="AluY" repeat_region complement(96913..96936) /rpt_family="AT_rich" repeat_region 97811..98197 /rpt_family="MLT1C" repeat_region complement(99600..100335) /rpt_family="L1ME3" repeat_region 100368..100576 /rpt_family="MER58A" repeat_region complement(100584..100675) /rpt_family="L1ME3" repeat_region complement(101281..101421) /rpt_family="MIR" BASE COUNT 27935 a 22428 c 22623 g 28582 t 1 others ORIGIN 1 aagcttggtc gaaagccttc tatggcctca gtctatcact caagcagaaa ccaccaacac 61 ccacatgctt tcctgtctgc gttggcagaa cttcactatt aagttgccct ctctaatatg 121 actttactat tctaagaaat cgttgcccag gaaataaaag ttactaattg tattgtttca 181 ggaacttccc agaaaaacta atttaagtga ttatcaaact actcaactct tcaataacta 241 gtatcaagga ttatttactc tccaatacac ttctgagaac cctgccttaa aaccctcacc 301 ttgctgtctt caccaatcct gaaccattat atcatgatcc ttaatcctat tcaaacttct 361 ttccccattg aaagacctgc caaaccccct aaaccttaca aatagcccaa ctttgtcata 421 ctcacagaaa atactatgca gactttgttg agatagttgt gttgcttacc atgggaagca 481 gtaaactcag ctttgcctga ccagcaggtt gttggggtga tgtttttggg gagtcagtac 541 tggacaccta agtaaacttg ttacttcatg cttccctctt ctagaaggca gaataatggc 601 tgtccaaaga tgtccacatc ttaatcttag gaacctgtaa atatatttcc ttacatagca 661 agggggaatt aaggtagcag atagatttaa tgttgctaat cggctgactt tgaaatgatg 721 agatcattga tacagcttga atatatgtcc ctgtcaaatc tcatgttgaa ttgtaatccc 781 cagtgttgga aatggggcct ggtgggaggt gattgggtca tgggggcgga tccttcatgg 841 cttggtgctg ttgtgataat gagtgaattc tcaggagatc tggtatttaa gtgtgtggca 901 ccttccccct acttgctctc tcttgccctc attctcacca tatgagatgc ctgctccatc 961 ttcttccacc ctgattggaa gcttcctgag gcctccccag aagccaacca gatacagtca 1021 ctatgcttcc tacatagcct gcagaaccgt gagccaaata aacctctttt ctttataaat 1081 tacccaggct taggtatttc tttagagcaa tgcaagaatg gcctaataca attatcctag 1141 attctccagg tgggaaccca tgtaaacaca aggatcttta catgtggaag ggggaaactg 1201 aaagagtcag tgtcaaatcg atgagatgca ggaaagactc aaccaaccgt tgctggcatt 1261 gaagatggag gatggggcca caagccaagg aatgggcagc cttagaagct gaaaaaggca 1321 agaaatgcat ttctacctgg agcctccaga aaggaatgca accctgccaa cacctagatt 1381 ttagcctggc aagatctgta ttagacctct gacctccaga actgtaagat aatcttgtgt 1441 tgttttaacc agcaaatgtc tgttcatttg ttttggcagc agtagaaaac cagcatatct 1501 ctccttgctg atgctccttt ctccatctga aatctcaatc tgatctcact cttctgcctg 1561 ttgaaatcct gcctaccctt tcatttcaat ggcagatcca cctcctccaa gatacacttc 1621 cctgaacacc ccttttctga attctatcag catctggttt gttctccttc attatttatt 1681 atttttattt ttttcagatg gagttttgct cttgttgccc aggctggagt gcaatggtgc 1741 gatatcggct cactgcaatg tctgcctccc aggttcaagc gattctcctg cctcagcctc 1801 tcgagtagct gggattacag gcatgcacta ccatgcccgg ctaattttgt atttttttag 1861 tagagacagg gtttctccat tttggtcagg atagactcaa gctcctgacc ccaggtgatc 1921 tgcccacctt ggcctcccaa agtgctggga ttataggcgt gagccaccat gcccagcctg 1981 ttctccttta ttttatacga tcacatttac tagggtatgt agcgtatatc ttctatgtct 2041 tataaataac ttttgcacag ctgaatggta aagatttgtc tattctttca atgtaaccat 2101 accattatcc ctttagctag ctaaataaca tccatggtgg gtacagaagt atcctgacag 2161 aagtcatgta tttgaggaat gaaatgtaag ttcttcatta attaaggctg ggagaatctt 2221 cctcttttgg aaactaatat atcaacttga aagtcaattc taataagaag gttgaaagaa 2281 tgatatggaa aggaatttgt gaataataga aaagaaatga aaagaacact ttctgtaagg 2341 ctagcttctc cactgggcat cagggctgtt gctttgagcc tatagctttt ctagcgccta 2401 tgaaaatatt tgaaagccta gacaagaaag aaacatcagg aagttaaaat aggttcatca 2461 aatgtctaca aaatgtaacc ttatctcaac tcattaactg aattttgcat acatacaaat 2521 tccattacat ttgcaaaaaa tgtgtaagtt tgtttttctc acttcatcag aattcagaat 2581 tctcaagcat gcaccaggaa cgaaaataac tacttttcgc caactaattt tgaaagcaat 2641 gttttctctt aattgtttca aattacttta catttgaaaa tatttaagat gagaagtaga 2701 tgcattttaa catatgtgat ggaatgtatc atggagcctc caaaattgag agaccctaca 2761 gcctacccag agcttaaagc agccccgtta gttactgggc agaaaggaat ggggtgtggc 2821 tgacaggtaa tgtggatgat ggtggtaata acagtgactc atattccttg cacacttacc 2881 atgtgtcctt tatttcttac atcatttttt aaaaattgag ggctgggcgt ggtggctcac 2941 tcctgtaatc ccaggccttt gggaggccga ggcgggcaga tcatgaagtc aggagatcaa 3001 ggccaccctg gctaacacag tgaaaccccg tctctactaa aaatacaaaa aattagccgg 3061 gtgtgatggc aggcgcctgt agtcccagct acttgggagg ctaaggcagg agaatcgttt 3121 gaacctaggg ggcagaggtt gcagtgagtc aagatcacac ccctgcactc cagcctgggc 3181 aacagagcaa gactccatct caaaaaaaaa aaaaaatttg aggtaaaact cacgtaacat 3241 acaagtaagt attttaaatg cacaatttag tgcattcaca gtgttgggca gctaccacct 3301 atctctagtt tccagtcttt ttcatcaccc cagaaaaatt ccccataact gtcaaacaag 3361 aactctccac tccttcctcc cttcagcccc tggaaaccac caatctgctt tctgtttcta 3421 tagatgtgcc tattctggac atttcatata aacagaatca tacaagtgac tttctgtttc 3481 tgacttcttt cactttgcat gttttcaagg ttcatccatg tcgtaacata tgtcagttct 3541 tccctcattt tttggctgaa taatattttg ttgtgtgtat tttgggctac tatgaataat 3601 tctgctgtga acatttgtat acatgtttct gtatgggctt tttttttttt tttttttttt 3661 ttgaaacagg gtctcactct gtcacccagg ctctggagtg cagtggtgca atcatggctc 3721 actgcagcct tgaactcctg ggctcaagtg atcctcccac cttggcctcc caagctgctg 3781 ggactacagg catgtgccac catacccagc taatttttgt attttttgta gagacaaggt 3841 tttgccacgt tgtccaggct ggttttaaac ccctgagcca agagatccac ctgcctcagc 3901 ttcccaaaga gctgggatta cagcctcgag ccactactcc cagccacatt ttgatttctc 3961 ttaagtatac acactgggag tgagactgct ggatcatatg ataagtctgt gcttaacttt 4021 ttgaggaatg attcctttca tcttttcaac agcaatatgg gaattggaaa tcagtgtgat 4081 catgtgccca aggtgacact cccataagag gtggagtgga atttacagtc aagccattct 4141 gattcagagc ccctgtcctt gcactcagtc aagatgtctc gttagtgtac atgaaataga 4201 gacagtggtg ggagctgagg gctgcacgca gggcacaaca agaggtagag gctagcaagc 4261 gtaggatgtg ctcaggtgca ttcagaatcg aggactatga gggtggacag gggccagatt 4321 cttgagaatc ctgaaggaca gcagagtagg aagccatggc tccaaagtca ggcagccaga 4381 agtagatggg ctttgaggac caatggtgcc agggtgtgtc attgggacct tctaggaact 4441 caggtaaaaa atgatggcca agagtaggca atgttagaga gttcatcctc aggcacagac 4501 agggcaaagt gtgcagcacc gtggactgcc tttgtgactt ttatgattag attgagagaa 4561 tcaccacctc ttagcagaga gagcaacaaa caatgcaaac aatgcaaaca cagacacaca 4621 aagaaggaga aacaatagcc ctcatcacca cagccctagt ggtggcagca actaaggaaa 4681 ttatatagag aagttggagg agcctgtgca accagcaaac agaaatgttg tgaacccatt 4741 gagatgtatg gactttccct gacatgctac caaaagtttg tcattttagg agccctgaag 4801 acttcttgaa gatctgatgt gcaggcctag atctcaagct tatttatttt tctgttcttc 4861 ttcttgcata ttaagatgga tgcttttctt attaattttc agccttgccc tttttcccca 4921 atatgtataa acaatttcaa gctataaatg tgcttctaag ttctgcttta gctgcacgtc 4981 ctctttcttt ctttctttct tttttttttt tttttgtgag atggaggcag gagtgcagtg 5041 gcgtgatctt ggctcactgc aacctctgcc tcccaggttc atgccattct cctgcctcag 5101 cctcccaagt agctgggact acaggtgccc gccaccacac ctggctaatt ttttgtattt 5161 tttagtagag atgggtttca ccatgttagc caggatggtc tcgatctcct gacctcatga 5221 tccacccacc tcggcctccc aaagtgctgg gattacaggc gtgagccacc gtgctcagcc 5281 tcctttttca tatttagtat tttcctaatc attcagttgt aaatatttta atgtccatta 5341 tgatttcttc ttccaccttt tgttttgatt atctattgct gcattccaaa aaaaaaaaat 5401 tagcttaaaa tgacaatcat ttcattctat ctcatgatat tgtgggtcat gaattgggac 5461 aggtttcagc tggatgattc tcctgctctt ctgaacatta actaagggaa ctagtgatct 5521 tcagctggta gatgggctgg tctggagggt ccaaggactt tcactcacat gttggcatct 5581 gggtgcagat ggctggatgg catgctcagt gagactgctg gacagtggct acctctggcc 5641 tctgcagctt ggcaggctca ggatggttgg accaactcaa agcagctcag atccccagag 5701 agagaggaac tggccagtct gggaattgaa tctagaaatt agcaaagcac aattccacca 5761 tagtctgttg cttgaagcaa cacagaaagg tctcccagat cattggaagg cacacagctc 5821 ctgctgctga atgggaatag tggaaaaata tttctgatca ttctttttgg agtgaattgt 5881 gtcccccaaa aattcatatg ttgaaatccc ccagaacctc ataacatgac tttttgcagg 5941 tagggccttt atagaggtaa ttaaattaaa atggggtggt tagggtgagc tttagtccag 6001 catgactgga ttagaagagg agattctgac acagacacac aggcacaaag cccatgtgag 6061 gacagtgaga aggcagccat ctgcaagcca gagggaggcc tcgcaaggaa tcaagcctgt 6121 cgtcacctca atcttagacc tcttgcctct agaactgtga gaaaataagt ctaatgttta 6181 agtcacccag tctgtggcgt tagttatggc agccctagca aactaataca ttatctttaa 6241 tccatacatt aatgaattct tcataattac ctttcttttt ctttttcttt ttcgagatgg 6301 agtctcactc tgtcgtccag gttggagtac agcggcgcga tctcggctca ctacaatctc 6361 tgcctcctgg gttcaaatga ttctcctgcc tcagcctcct gaggagctag gattacaggt 6421 gtgcaccacc acgcctggct aatttttgta tttttagtag agacaaagtt tccccatgtt 6481 ggtcaggctg gtctcgaact cctgacctca tgatcagtcc acctcggcct cccaaagtgt 6541 tgggattaaa agcataagcc accacgcctg gcccataatt atctttgtta aacttcaatc 6601 acatgggttt ctaatttaat agcatggcga ctacagaatg tggtctgtca tgtatcaagc 6661 atttgaaatt ggttgagcct tccgttttag cctttatgta tgtgtaagat attggaaaat 6721 attcttggta aaattgacca gaagatgtat tctacagtca ctgtttaaga aacaagtgaa 6781 aatggtggtt aaaataacca aaacaactaa acgtgtgtgg aaataaacaa agactaccca 6841 aaggagagca ggcagaggcg atccattcag agcttgctgt agcataggag tcagacactg 6901 tcacttgtgt ttggcagaaa ctcaaatcag gctgttgatc aagtttttat catgagggca 6961 ggtttcaaga ctgccattca tttttcactg atagtttact ttgggcttac caggaaaatc 7021 agtcactgtt aaccaatcat tggtttctat gtaaatcaaa atattaacaa ttgcccaagt 7081 gcagtagctc acaccttcaa ccccagcact ttgggagcct gaggcaaaca aattgcttga 7141 gcccgggagt tcaagagcag cctgggcaac atggcaaacc ccacctctaa aaaaatacaa 7201 aaattagcct ggcatagtgg catgcacctg tagtcccagc ttctcgggag gttgaggtaa 7261 gaggatggct tgagcctggg aggtcaaggc tgtagtgagc catgttcata cgactgaact 7321 ccagcctggg tggcaaaggg agaccctgtc ttaaaacaaa caacaacaaa aaacagtctc 7381 tatttacata acttaaagaa gccctttgga gaaggtagaa acagactttg ctgccctctc 7441 tgagcaaaca ttgtctcatt aatcctcccc acaacccaac aaggaggttt gacctactga 7501 gttccatttt aggagataag aagccgctgt atgcctcagt tggctaccca gtctaggggc 7561 agtaagcccc cacttgctat aataggttgt gattgtctga tgtattaaac tgaactttaa 7621 aaagtcattt ctatttcgct acacagcagt ggggcagcgc agaaacaagg aaattatatc 7681 ttccgcttcg actgtttttc atttttactt tgtgtttcta gattctcaaa cttccatcac 7741 tctaacagat tctcaaatta gaggagaaaa caatcttact ttacataaaa acaccaaatt 7801 gatcctttat ggatgcagtg tggtcaatga cagcatttgt tttcccagca tagattgtgc 7861 tgatgcattg tgtatgtggc ctggtctgta gcctgtgtga ttgacagatg attgagagac 7921 agagagagag agagattgtc aggaggctta atgatcctca caacacaact ggggtaggtg 7981 ctgtggcccc cacattacag ccgaggccca gagagagtca gtagctcacc caaggtcaca 8041 cagcttgtga acagcagagc cataatttga tcaatgctgc taggctgcct gtcaaaatac 8101 acccaaaaca aaacttactt tttgtatagc atgagctttt aattctacgg gttgctgttg 8161 cattgaatta tgccctagtg taacacaaac taaaatatcc agttgcaaaa ttaaataata 8221 aataaataat gggtctgcgt gctgtttatc agtccaaatg tggctgcatt ttttggccaa 8281 ctatgtcttt taagattaca tcctaacatt aagtttttgt tggtttctaa tcccagcttt 8341 ggcagttttg gacaaaagct atgggcataa tatcctcttg gccctccagg caggggcgct 8401 gcagattctg ggctgggggt cagtgggggc agcttacaaa atgtttttct gtgagggcct 8461 cagctgaaat agaggggaac actgctgagc tctggcctct gtgtctgtat cttgactgac 8521 tttttaaatt tttttttctg aaatttctgt gtggcctgac aagtccgaga ctaccacaaa 8581 gtggccacaa aatttgttta aatgttgtct gtcaagttgg aaattttatt ttaaaattca 8641 tggtgcaatg gcattataat aaaccttatt tttccttctg cagtctgttt ctggatgctg 8701 gcagtcattc agaaacattt gctgagcacc tcgtacctgc tgaatgaacc cacgtgaagg 8761 ggagccagga gtgcggatca ttaaaaagcc tgtactttct cagtccccat ttgacaccac 8821 agagagaaaa tgcatgctcc tgatttgggt agatctagaa tgttcttatg aatatggtag 8881 gtggagcaag ggaaccaggt tcttatgata catgataaga aaggccctag cttggtcaga 8941 cacaaataaa agccaagctt ttcatctaaa ctcccttgca ccatcacccc cttaccaacc 9001 agtacagaga gcttaacatc tctataccag atagtgaatg tttctaataa gaatggcaaa 9061 ttcttattgt gagagagata agggcattca agtgttaatt caagagaaat atgtatatgg 9121 ttctttaaat gatttattta ctaacagcat taagtatggt gtaaaggaat ttacataccc 9181 taattctcgt ctatatattt ttttggattt taaaatgaca acttgtctaa gttcgcatta 9241 gtattaagcc ctaacatgag acagtattta tagcataaga tttttgccac ctatttttaa 9301 ttgtagcgct gttttatcaa taagatcaat taaagagttg aacttttgaa aagaagaaac 9361 atgaaaatga atttctaact gtacagtatt gaatacatga aagttctctt aaacttagtc 9421 cagagtctca ataaacatgg taagtatgta ataattcaca tttacaaccg ccataaaaac 9481 tctatcaacc aaattatttt atgacgtcaa aattatctga aaaaaattga ggaactaaaa 9541 tttttaaaaa taggttctca gttagagctg tctattaaaa ggataaattt taaactctcc 9601 ggagctggcc ttcctggttc caggagcagc ctgttggcct tctctcgcaa agctcagttc 9661 ctttcctcca ttccagaaga ccctcaagcc tgtcccatgc acctggtttt acatttttaa 9721 atgttaaaag gaagaccacc gtgaaagatt ggcatgttaa tagtaggcta aggcgacagg 9781 tgctggggct ccagcttaga gatccccaca gattccatgc tcccagggct ggctgcagcc 9841 atctgccccc acccaccccc acccaccccc acccggccca gccctcccct cctggaggtg 9901 gaatcgtggc ttgcacaatc ccctctgggc ccctggggga ggcgcaccat attgattttt 9961 tggacttcgt ttctgtgtgt ctcaccagtc tttgcccggc acagtgactc actggcgtcc 10021 ctttcaggac tttctcccgc ctgataacat ctaaatgact catgcctaca ccacctggag 10081 agctcaggga aggcacggtt tgttcaggcc accctcactc caagggccag cccttttttg 10141 gaatagaaac taaaagttgg tagcttccat ttgtagttcc atctaacaga agagccgacc 10201 cactcacatt agctggctca cgactaaggc aaccaatgcc tcccagaaca tagactcaac 10261 tggcaacccc ggcactgaag gtcaaggtga gtcaaagaca ctcatgcaga agggtggatg 10321 ctcttccaag tgtgactggc tggcctgcac ggctcactgc ctcccctcct cccgtcaaga 10381 tgcagcttca gtttaaacca ccttcgttcc attcatcaaa gaggaatatt tactgttatt 10441 gtaatgactc ttctcacagt acacaaggct acacctgaaa cacacaggcc tttcagaaag 10501 cacaaaggag cattgtgcaa gcccctgccc agtgagatgg atgggggcgt agacacgcca 10561 gttaacagcc gggctttgca ggccagcaag acttatctgt ggccgcacca tggccactga 10621 gcttgtacct ttcttctttt aaaaattgct ccttcattat aagcagagtt aaatacctgc 10681 acatgtgatt tgtgttgaaa taataaagca acaaacacac atcaagctga ggagagaatc 10741 ttggctaagg taggaagtcg tgggtctggt ctcagtctgt gctgctgacc aactgtaaga 10801 actcagaact cactgtccac tcaggaccac agtgtctcat ctgaaaggac gaggagactg 10861 gaacacacca gagccatggt tctcaatgtg tggtccagag gccctcgggg gtccctgagc 10921 ctctttctgg ggatctgtga ggtcaaaact actttcacaa gtatggtttg cctttttcac 10981 tctcactctc tcatgcttga acaatggaat cttctaaagg cttcctgaca cgtgatgcta 11041 caatggatgg aatccagagg tggataggag aagccagctg ccttctatta aaccagaaat 11101 gaagaacatt tgcagaaatg tacaacaatg ccatgcttcc cattaattat tttgttcttg 11161 ctttagaata tgtcactttt cattgaaaaa tgttactcat gattttatgt aaaaggttta 11221 ttgttgttac ttttaaatat atattttaag tgttttttag ttttgatttc taatatggca 11281 aaaattggta aaatcacata cacaagggct tttggggatc ctcaaaactt tttaagagtg 11341 taaagggatc cagagaccaa aagtttgata accactaacc aaaaactttg acaaccatca 11401 tcttgaaggg tccttggact ttctggaatt tggatctccc ttgccttgtt cagttttctc 11461 tctttttagc catctcgtac ttacagccat ttcagttcct atgtacctga cttattaatt 11521 atgcttattg tttaccatgt gtgttcccct cccatataaa ctccatgagg actgagatat 11581 ttgtccatga tgttcatcag tgcatcccag gctgcctaga actgggcctg gcccttagca 11641 ggcacttgtt tttggatggg agggaggaga agggagggga gagggcaagt tctttcagtg 11701 tctgtataat ggacagagca aggtgtacca cctcggtgtg atagaagaga tggagacagg 11761 agaaaggaag ctctgcacag gatcacgtgg taggtggcaa agtcagggct ccatcacagc 11821 ttctgttgtc acatgactct gttatctgac cctttaaggc agccccttct gctccaacag 11881 agcttcaggg taactgagtg ccccgcaatt ggtgttagca gagaatctgt gtccagttga 11941 tccctaaaag tttgaggaac actcattccc cagtgcataa gttcctgata aatgacctca 12001 taaataaatg acctcccttt gtaggaggtt ggctggaagg ctcagatgca catggctgct 12061 ggaagtgagg tcctcccaca agagaaccaa cacgtccttc aagggtctcg gggtctgtga 12121 acagacatgg acatcaggat tggttgaggg gcttccaggt tcaatcctgg tggcacaggt 12181 gaggcccatt caccagacca gagtgtcgcc agccatccat agcaggtgtg ctctttgtgg 12241 ctgcctatgg cttttagtcc tgaaaagctc aggtttaagt agccaggagc acaacccagg 12301 ccttgctgct gcttctggac atctctggga gtgaaaaggg agttctatgc tcagccacct 12361 gagtttgtgg agctcgtttc aatctcagcc tctcccatca gcctcctggt gactcctcta 12421 agaggtctca gtgcttcctt caccctcaat actcaaggcc ttggcgaaag ggaattcttc 12481 tgagacctga agtgtagaat cacgagggag aaacagtcaa gcaaaattga ccgttatcgc 12541 ccaggagata agacccagga gtcagttcag agaacgcaga catgaccgta gaattgtgta 12601 catagcaggc cttgccctca gtggatatcc aataaattgc aatggacata tgttttcttt 12661 tctgcacact ctcttgcctt tcccttggct ggatcttggg gcttctggga gagtgatgca 12721 gaggggaata ttggaagaag aggctgagag cacagatgaa agtcatccca tctgtttgga 12781 tttcatttta agagcagtgc agaggcattg agtgttttta ggcagagaat aacattaagg 12841 aacgtacagg tgtacttcgg agatattgca ggctcagttc cacactgcag caataaagca 12901 attatcgcaa taaagtcata taaatatttc aggtttctcg gttcatataa aagttatatt 12961 cacactatat tgtagtctat ctagtgtgca atagcattat gtctaaaaaa caatgtacat 13021 accttaattt aaaaatactt tattgctaga aagtgctaat aatcatctaa gtctttgtca 13081 agtcataatc ttcttgttgg tgggtggtct ttctggatgt ggatggctgc tgaccaatca 13141 gggtggctgt ggcaatttct tgaactgaga caacagtgaa gtttgctgca ttcattggct 13201 tttcccttca tgaaagattt ctctagcatg caatactgtt tgatggcatt ttacctacag 13261 aacgtctttc aaaattggag tcaatcttct caaactctgc cactgcttta tcaactattt 13321 atgcaatatt ctaaatcctt tgttgtcgtt tcaacaaagt tcagagcatc ttcagcagga 13381 gtagattcca tctcaagaaa ccactctctt tgcttattca taataaacaa ctccttgtgt 13441 gtgaaagttt tatcatgaga ttgcagcaat tcagtcacgt cttccggctc cacttctaat 13501 tctagttttg ttgctttttt ttttttacca catctgcagt tatttcctcc actgaagtct 13561 tgagcctctc aaagtcatcc ctgagggtta gaatccactt cttccaaact cctgttattg 13621 ttgatatttt cacttcctac catgaatcat gaatgttctt aacgacatct agaatgagaa 13681 tcttttccag aaagttttca atttactttg cccagttcca tcagaggaac tactctctac 13741 ggcagctata gccttataaa atgtatttcc taaataagac ttgaaagtca aaattatccc 13801 ctctatctgt ggactgcaga atggatgtta tattagcata catgaaaaca gcatttatct 13861 ccttgtaaat ctccatcaga gctcttgggt aactaagtgc attgttaata ttgaacgata 13921 atattttgaa aggaatagtt ttttctaagc agtagttttc aacagggggt ttaagatatt 13981 tagtaaacca tactataaac agatgtgctg tcagcgaggc cttgttccat ttctagagaa 14041 caggcagaat ggatttagca taattcttaa gggcccttaa gattttcaaa atggtcagtg 14101 agcattggct tcaacttagt taccagcagc accagcccca ccaagagagt aagcctgtcc 14161 tttaaatttt tgaagtcaga cattgacttc tcctctctag ctatgaaaat cttaggtagt 14221 atcttctaat acaaggttgt ttcatctaca ctgaaaatct gtgatttagt gtagtcacct 14281 tcatcaatga tcttagctag atcttcagga taacttgccg cagcttctcc atcagcattt 14341 gctgcttcac cctgcacttt tacattctgg agatgacttc tttttttaaa tgtcatgaac 14401 caatctctgc ttatgcagct tcctcatacc tctcaggcat cacagaattg aagagagtta 14461 gggccttgct ctggattagg ctctggctta agggaatgtt gtggctggtt tggtgttcta 14521 tccagaccac tcaaactctc cgcgtcagca taaggctgtt tcactttctt accattcatg 14581 tgtttgctgg agtagcattt ttaatttact tgaagaactt tctctttgca tccacaactt 14641 ggctatttgg cacaagaggc ctggcttttg gcctatcttg gttttgacat gtcttcctca 14701 ctcagcttaa tcatttctag ttttttattt aaagtgggat acgtgtgacc ctttctttca 14761 cttgaacact tggaggccac tgcaggctta ttaactggcc taatttcaat attgttgtgc 14821 ctcaaggaat aaggaggtcc aaggagaggg agagagacag ggaaatgccc agccagtgga 14881 gcagtcagaa tacatacaca tttattgatt ccgttcactg ttttctgtgg gtacagtttg 14941 tggttcccca aaacgattat aataataata tcaaggacca ctgaccacag accaccacaa 15001 cagatagaat aataataaat ttgtgcaaat aaataataaa aaaacattgt gaaaattacc 15061 aaaatgggac acagagatag gaagtgagcc catgcctttg gagacatggc accagtggac 15121 ctgctgtata ccacaacctt cagtttataa agaacatggt atctgtaaag tacgataagg 15181 taaagcacag taaaacgagg tgtgccagtg ttggggagca catctggctg cctggcatag 15241 tattgactga aagaaagata acagagggag ggtatctgaa aagtaactgc cagatgctat 15301 ggtgtctgct cctttgcagt cagggagaag agaaagaaga gactgattat caaagaaatc 15361 ccaaagctag aatccgcaga cccccattcc agattcatta aaagtggaca agagataggg 15421 gagcctgagg attgtccctg gttgactgga aggtcaccca ggaggaaggg gtgtacctca 15481 gagatggaca gggatgggcc ttctgctcgt cctacaggca cccattctca gcacaggctc 15541 atccagccaa agactaaatt tcccagcctc ctttgcaact agatgatatc acgtgacaaa 15601 atccagacca atgggcctgc catgtgcaat tctgattagt gcccttagga agatgagcct 15661 ctgcttctct ttcctccttt cctagggagg aatgcaaatg tggtagggag acatcttcaa 15721 ccaggagaat gaggacacag caaggtggga ggatcctaag tcccaaaact accacaacat 15781 caccaagcca acaaaccagc ctagggtcac cttccccaag tgttatggga gtgaataata 15841 aacctctatt tctttttttt ttttttcctt cgagacaggg tcttgctctg taacccaggc 15901 tggagtgcag tggcacaatc ttggctcact gcaacatcca cctcccagct tcaagtaatt 15961 ctcatgcctc agcctcccaa gttgctggga ttataggtgc ccgccaccac atcttgctaa 16021 attttgtaat tttagtagag ccagggtttt gccaggttgc ccaggctggt ctcaaactcc 16081 tgacctcaag taatgcatct gccctggctt cccaaagtgc tgggattaca caggagtgag 16141 ccactgcacc tggccaaacc tctatttcat ttaagcccct gtatgtgtgg cctcctgtta 16201 ctgatgttat tttcatgtgt tattctacgc gaggggcaga acaacttgct tttgttgtat 16261 ttcaagtgac tgtatcgctg aaccaaccac tctcggagcc acacttcctc tgcctacagg 16321 ctctgtgttg acttgtggca tgaagaacac ctgtgaggta tccacagcat acaccagccc 16381 agggacggag gccacatggg cctgagcctg cagctggcag tggcggacac agatgctcca 16441 gcaggagtga ggggaaacac tccaccaaat gaaggcctca gggtcattcg gacctcgtgg 16501 aaacaagttg tatcaggttt gctggcgaag cagctaggag gctgtggcag ctgttattta 16561 ttcttcagta ggacatttgg cgcaatgtct tacatcttca tgggtaatta gacaaaaaga 16621 gctcaccacc cctaggtgat gtcaatgcag agggagagtc ctagtgacat actacgggct 16681 tggcctcagt cactgtttgt tcaatgactt gcttatcagg gttatatgtg ggccaaagca 16741 gttactaata gaagagacct tggataacaa actcgagggt cataaagatc tggagaggct 16801 ggaatgatgc actgaaccta cacagactca ttcattcctc ccccatctgt tcattcagtg 16861 aaagcatatt catcaacacc cagaggacgc caggcactca ggaaggtgct gggacaccac 16921 aggtgaaaag ggtccggttc tcactgaccc tgcgctttcg tatgtggaga aagacaatga 16981 aatacaacag accgaggcag gtggatcacc tgaggtcagg agttcaagac cagcctggcc 17041 aacaacgcaa aaccccgtct ctactaaaaa tacaaaaatt agctgggcat ggtggcacgt 17101 gcctgtaatc ctagctactt gggaggctga ggcaggagaa tcgcttgaac ctgggaggcg 17161 gaggttgcag tgagccgaga ttatactact gccctccaac ctggggagga gagaaagact 17221 gtctcaaaaa aacaaaaaca aaaacaaaag acaaaaaaaa aacagataaa gcaacaaaat 17281 catttttgga gagataatgc tgggagcgca ttattggcag gatgtgatag tgacatgaaa 17341 gtaaggctct ttccgcttgg gatcggcaag aactgccctg agccagtgac atttgcccat 17401 ttgcctcaga tgatgaaaag gagccaggct tgagagccag gtggaaagaa cgccagggag 17461 cagcgagtgc aaaggcccca gggggcagca acgggagttt tccaggaaca gacccaggca 17521 ggtacactgg agtgtgacca gcaaagagga gaatgaggaa gggggtgggg accaggtcac 17581 agagggcatg gtgaggagtt ttggattctc ttctctgcag ggtgagaaac cttctgggaa 17641 ttttgaggag agaggtggcg tcataggatt aacttctcca aaggctacat tggctgctgt 17701 gtgcagggtg gattctagga ggtagaacag aggagagctg ggaggctctt gcatggtggt 17761 gatgaccagg gcactagaga agagactgtg cttgagatgt gttctggaag cagagtccac 17821 ggcatttgct gatggtttat gggagtaaag gaagaagaga aatccaggac aactccaggt 17881 cttttgctga gcaaccggtg ggtggtggta ccggttcctg ggacgagaag cccttttctc 17941 catctctgct aataactcgc tggcctgagc cagcctcgtc acttgccttt aatgctcctg 18001 tggtcacatg gctggctgcc ctgctgccac cacgcctctt ccccatggtc tgggattgat 18061 catgtgcagg gcagcacctg aggtccacat taacactccc acacaccaac agatgagctt 18121 ggaaggaagc ccctgtgtgg ttcacacaca ggaaataggc gggtggtcac tgtgtgggac 18181 tgtcttggat ggcaggtcaa tggcacatct ctggtcgagg aggggaataa ggacaccagg 18241 caggggagat gcctgagttg aaggagtgag accaaggcag aagcccaccc agaggatgag 18301 ggctgaggga aagagaaggt tctcatgcac agtgaacagg aagcaggggg ctctccggac 18361 ccggaagtga ccctgggccg gccaggggac agaccagtct ctactatacc atggggctgg 18421 ctctgggggt attttaggtc tccaggttga tgtctgcaag gccagctttt tttttttttt 18481 tttttttttt ttggagacaa ggtcttgctc tgttgcccag gctgcagtgc agtggctcaa 18541 tgcaacctcc acctcccagg ctcaagcaat tctcccacct cagcttcctg agtagctagg 18601 actacaggca cgcaccacca tgcctggcta attcttgcat tttttgtaaa aatggggtct 18661 caccatgttg ttcgggctaa ggccagcttt ttcttgctga tgcaggaaga gggactggat 18721 cagactcaaa tctgagaggc cctggaggac ccagctatga gaaactagca atgaactaga 18781 gcaggagggc caaaggcagc ctgctggctt cttctcattt ttccagcctc tgacacctca 18841 caccattatg aacatgtaat tgacctgaat ttgtaaattc ttattctttt ttttaatgat 18901 ttctttcctg catgaatggc agagggtgac tcatccccag ggacggggct caacctttca 18961 acatgaactc accaccatgt aacgcgtgcg gcccccactg gctgaagcca tttgataaat 19021 ccctttttct catttttaca gagagaaggt ctaagatcca agctacagcc taattcagcc 19081 atttgccctt tacccgagtt tttcaatata ctccacactt tcttactcct gctgagagcc 19141 ccactttcct tgaatgcatt cctctcttgg ctccctccct gccgccggcc ttctccctct 19201 ctctcccttc caggtcactt ctcgtgatga gattggactt gtttcctggg gctctggtaa 19261 caagttagct taaaacagag aaatggaccc tttcacagtt ctggaggcca gaagtctgaa 19321 atcaaggtgt tggaagggcc actctccctc tgaagcccca gtagaggatc cttcgtggcc 19381 tcttccagct ccggggtccc tgacaccccc cggcttatgc ccgcttctct cccctcactg 19441 cctccgtcct cacacggcca tcttctcatc tccttgtatc tcttctctgt gtgtctctta 19501 taaggacact tgtcatttga ctcaggaccc agcaggataa cccaggataa tctcgtcttg 19561 agatccatca cttactacaa ctgcaaagtc ctttttttag atggaatttc gtgcttgtca 19621 cccaggctgg agtgcaatgg cacgatctag gctcgctgca acctctgcat ctgggttcaa 19681 gtgattctcc tgcctcagcc tcctgagtag cagggattaa aggcacctgt caccataccc 19741 gtctaatttt tgtatgttta gtagagacgg ggtttcacca tgttggccag tctggtctca 19801 aactcctgag ctcagatgat ctacccacct ttgcctccca aagtgcgtga gccaccgcac 19861 ccagccaggg ctgctcttaa atgatgaaac caatgggctc tacagcctca aatgctgagg 19921 gtcaagctgt gtcctcttct gccaagaaca aggaagtggt atgtctgagt gatatgtcat 19981 tgctcattgt ctgccataag gttgggacaa agagcagagt ctgggcctgg ctctcagctt 20041 agccagccag gcccccgggc caggctgagt cactgtgtcc tcactgctta acccagggta 20101 gcagtgctca ctctgcttat cgcagggcaa ggggcgatgg agctgctatc tgggaaacag 20161 tttgaaaact gtgaagggct ccacaggcaa agtgccattc tgtttgtttt agtggggaat 20221 cactctagga aaaaattagg aggattctgg tcctgaatcc tctagaagtg tgctctgcga 20281 ctttgggcaa atcatttaaa ctgaagatag taatgctact caaaatcagt agtgtcacag 20341 acgtgtccac ctaggaaagg tataaagtac tatataaatg tgaggttcta tcattattat 20401 agtgtttacc atttcagcaa agtattgtaa aagacattac ataatgcatt tgcatctttt 20461 tactgataaa acttctacta taagcattag tatttactaa ccagcatttt acattttcct 20521 ttgaaagggc gaaattcctt ccgcagaatt aaatcgtgta atttttccaa cagatattta 20581 ttgagtgctg tgcatcagga tttattctaa gcattgcagg aaagtagtga acaaagcaga 20641 tataattcct tccctaatgg ggtggaattt gtgctgtgga gacagagaat tattaaggta 20701 aaaaagcaaa agataaagaa cgtaacgata agggcttaag aaaaaaaaac tcattcggtg 20761 tcagaaagga gttgttgaaa ttttaggtgg ggtggccagg gaaggcttca ccaaaggtga 20821 ctatttagta aagatctggc agcggtgaca gtaggagtca cacggactcc tgggaagacc 20881 cttctaggga gagggaacag ctcaggaggg agttggcctg gtgcccagag caggaactga 20941 acatggaggc agagactcag tcatgtaact gctggaggct gagtctcagg gcctccaggg 21001 gcgtggcggg cttgggctca ggcctttctt ctgagtcaca caagagctgt tggagggttt 21061 tacctgagga cggacaggct gactgccgtt taaacaggat ccctctggct gctgtgttga 21121 gaatgggtat cacgtttaga agccttttga aataagtatt tgaatgcaca agtcatactg 21181 aaagtcattt tttaaatgtc aaatgcttgc cttttcccct aaactagagc aggtcattct 21241 tgcagtgaaa tagtccactc acttttctat agtaggatga aaagaaagaa acgagaaggc 21301 acgacttgct gtatctagtc aaacattctt gctaccaaga aacattcacg ctccgagtgg 21361 agaaggggct gcctgcatct cgtgccctct gggtgagccc ggctatgtct tccttcacca 21421 ccctctgctg ggagggtggg agctgagggg acaaggacac ctgagagagg cctgtgatgg 21481 gctgcccagt gccacccaag cccctactgc acatcttccg ccccagccca ggttggtcca 21541 gaaaaagaac atcaagtgtt gttacccaat tctggtgtca aagacaccct aagagtcctc 21601 cattgctact gaccagcagt ttcacatgta cctacaaaca gaaatggggt aattgggttt 21661 aataaaaaaa aaaatacatt ggccataaaa aggaatgaag ttcatccatg cagtacatgc 21721 tacagcacaa aagaaccttg agaatattat gttcagtgaa agaaggcagt cacgaatgac 21781 cacatattgt atgattccat tcttatgaag tgtccagaac acttcataaa tcataaggca 21841 aatccatagt atcggaaggt agatccgtgg ttgcttaggg ttgggggtcg gagggataat 21901 ggggtgatag ctaatcaaaa cagctttgtt tttgaggtga tgaaaatgtt ctaaaaggga 21961 cttcgataat ggttgtactt acctgtgaat attctttttt ttaaatattc tttttttttt 22021 tttgagacat agtctaactc actctgtcac ctaggctgga gggcagtggc acgatcttgg 22081 ctcactgcaa cccctgcctc ccgggctaaa gcgattctcc tgcctcagcc tcttgagttg 22141 ctgggattac aggcatgcac taccacaccc agctaatttt tgtattttta gtagagatgg 22201 gatttcatca tgttggccag gctggtctca aactcctgac ctcaagtgat ctgcccgcct 22261 cagcctccca aagtgctagg atcaatggtg tgagcccaaa ggtgtgagcc caaaggtgta 22321 agccaccgtg cctggtcctt acctatgaat attctaaaaa ccgttggatt gtatagtcta 22381 aatgggttgc tatggtctga atgcatgtgt cccctcaaaa ttcctatgtt gaaacctaac 22441 ccccgaggtg atggtgttga ggtgggacct ttaggagata acagatcatg atggtggggt 22501 cctcatgaat gggattcgtg cccttatgaa tcaggcctga ggagcccttt atcccttctg 22561 ccatgtgagg acacagcaag aaacagagag cctcaccaga cactgaatct gctggtgcct 22621 tgatcttgga cctcccagcc tcccaaacta tgagaaacaa atttctgttg tttacaaatt 22681 ccccagtcaa atgtattttg tcatagcagg agaagtggac taagacatgg atcaatttta 22741 tggtgagtta tacgtcagta aaggtgtttc tataggtgtc tgtcacacac atatgcatac 22801 atagatgcac aaacacacaa atacaatgct cttttgtgac ttctaaaaaa gtttaatcac 22861 aaatggaagt atttacactc agccaggaga atctgtaaac atgccctttc aattccattg 22921 tcaaacaaga aagaatggtt tcaaggcact gagancagag tgagtacgac atggaaaccc 22981 aaccctacct tgggagcagg ggctgacaac ccaccactag gatgtaccct gtgtatctgt 23041 agtgctggct ccctgcacac agggcaaggt cccgtccagg accatctgca gaatttgcaa 23101 ggcccagtgc aaaatgaaaa tgggcctctt atgcaaaact tattacaaat ttcaagatgg 23161 tgaccacaaa gaattaagcc aagtgtttct ttacgctttt aagaggtatt tccttgctag 23221 aactggggca actcccatgg cgttcagtga gaatcttcca gaagatgagg gctggtgtcc 23281 tatgccttaa gagctaagag tggaggtgct ggccctcctc tggttcttca acctgcacca 23341 aaggggcatg tcccaggtgg tgaccactgg aatttagagg taaaggcagc atttgcctgg 23401 agcagagaac acgaggtgag gcgggatgac tctacaccac agccatagat aagaggtact 23461 aatagctcag tcggtctgaa agagacacac ttaaagtcat aaatgggaag acgtgggtgg 23521 aaagtcttgc agcaattgga aagccattcc accaaagacg tgttataact taaaaatatt 23581 ccaacttctg aatgatttct ttatacacct gcttaagtcg gtgacattat tcagccttaa 23641 cactttaatc caagttttat tttgtcacca aaatgtgagt taagctaact taatcatttg 23701 ctagactgac tggcacagga atgactaact cttggtttgt gtgaagacta gtggttcagt 23761 cttctagtta gcaaaataca taagccgatg cactgaggat tattcagttc catgcatact 23821 gttggaatca ttttgtgatt cagacaaaaa cacacaacta taaaatcata tatctgtaca 23881 cgtctgactc agttcagaca catttattta gggattatca gttttacaga atacaacggg 23941 tcgttattac tattttggtt gttcagactt gcctaaagca cgtcactgtg cagtttgata 24001 tcatggtgga ttttcaagcg gtttataccc tcccagagta ctgccttata agggaaaatg 24061 actgtgtcag ggtggcagtg tgctggacga gggggcagga ggcctggagt gtggccccag 24121 cttcgccagg aaataggcat ttgaccctgg gcaaggaatt gatcctccct gggtctcagt 24181 ttctgcatct tacaaaccag agttggtctg agccatcagc tctctgaagt tttttccagt 24241 tttaaaatcc aatgtttcat tttctcgttt ctgaccacca aaggcaagaa aaacatagtc 24301 aaagacacca tggttcccat tgacagcact gggattgatt ttataccttg ttgtcaacca 24361 aaagtgccag tgcagggcag cagtgatcag cgatgcccgc aggagaccaa gacttccagg 24421 gccaggggac ctgctactta tttatcacac acttaatgac acgccagaca ctgttctaag 24481 tgcttgggag acattaacag agggaggccc cgaggggccc acaggtggcc tgctagggaa 24541 tggctgggcg ggatttgacc ccaggcagtc tgggcccaca ggctgccctt cagaggtgaa 24601 gacggtagcc ctagctgggt aaccatgtct tcatgcccac agggagtgca agggacaaag 24661 gaggctgaca aggagcagaa ggaattagaa aggagcatta gcaacgtttt tgaccaccag 24721 cagaataccc tatctgccat tccaaggaat ctcacagaaa agactagaca ctggccaact 24781 cttgcataat ctgggtccac cctgatggag caggaggcag ctccacggcc ccacatctgc 24841 ctaggattct tggtagctgg gccagcctgc aaatgggcaa ggttggttgg attcggaggt 24901 gggcggaagt gtgggttaag cttatcaagt ccttgacccc aacatatatc agtgaaatag 24961 ccaccactgc cagacacccc agcaggccag atgcaggcgg cagcttacac atgaggaaat 25021 gatttctatc tctaggttca ttcttttctt cctccactcc ttaccaaagt ctcactgtgc 25081 atgtgatttt aactgaactg tgcacttcac ttttacttct tgagggcggg aactaactaa 25141 gccttaatca tatttgtctt ctcagcatct gactctgtgt ttgctaagtg aatgaatgat 25201 gcacttcgga ccaaatgacg tctgtttaac ttgcaactgg aaaccaccac aaactgcctg 25261 ttgactactg atttggattt taaggtaaat acagacatgt ttctaaggag acgtttttgt 25321 gtgtgttttt tttcaaagga tgacttcgta agctagctaa tcatttctca taattatgtc 25381 tggattctaa agaattcttt ttctgcattt tactctctag atatttcatg ttcgtaacac 25441 atatattcta ctgtaatatc ccttttcctc ctctcttctg tgaatgtccc tctttcttaa 25501 ggggaaagtg ttaacattgc tcataactca gtctgccata gcgcatctga tcactccacc 25561 ctgtgcacct tctgactttt aatttccttc caacaccctc gacttttgat ctccaagcac 25621 aagggctata cttttattta ttaggtttaa tttactgtaa agctcattgt tccctgtgaa 25681 tcgcacgata acagtcaatg aataatcact ggcttcagcc agactgcttt ttaattctgc 25741 tgagtaactg gcaagccaac tagtattaag gacaaaaatg taaggaaccc tgatattgcc 25801 ccagaactag gataagccat tttattatga acatgtgact gccctcgtgc agaggtggat 25861 cagctacagg gcaggaccca gggcaagggg gctgcttggg gcagcctcag actctttgga 25921 aggggaaata gggaggacca gggacctggg tgggaggcag gagcttaggc ttacgatgac 25981 cccagcacac aggcccagct tcctcttccc acagcctccc tgccgtctta gctctgccat 26041 ttggtcttcc ggccgggatc cctttcccaa ggctttcaca catttaaacc cgcgaggggc 26101 aggacttttt tttttttttt cgtgtatatg atagtttttt caaaatgttt tctataaaat 26161 cctccctgag gagagcttgg aaaaagattc ttaatagcta gatcattttg agtccttctc 26221 taccatgtac tcaacactct aatatattct gcacatttgg ttctatgtca tttgccatag 26281 tcttaaggac gtaggttcca taatcatccc ttattctgta gaggaggccc aaggttagtg 26341 agcagaggaa ggttccactg tcgggaaccg gtggtcttca aactcctgtg ctcctctttt 26401 gatatctttc tgctcatata tgttggaaac agcccagcat gagcaaagca ccacctccta 26461 gcccccagcc tgtgccagcc ctgcaaagag gaaatatggc tccactcagg gagcccatca 26521 gcgactcagg gagccctgga gcacctggca ctcagcaccc atgaacacga gggcaggcat 26581 ggtgggacaa ggcagtctgc ttggggcaag tgagcttcgg gaagtagctg agctttgggg 26641 agcagctgtc tatacccagc ttctcagggc tgtgcaagta gcctgaggga gaaagaccct 26701 atattagctg aaactgcagc ccagatgagt cagtattttc tgctacggtt ttttaaaaaa 26761 atagtaacat actcatgtct ccatggtaac ttactatgga tctgtattcc atttatggaa 26821 ctctttcttg acctatctgt gttctctaat ctgggcctga atatacactg agaagccatc 26881 agaatcaatt cagtctccaa gaaataatca cttcccctat acgtgattgt ttcagccgta 26941 tagattctga ctgtgactca tgccacggtg atacccggag accttcgggg accagaaaca 27001 ttaaggtgca ccgatatctc aattatccgg ataactctag acctagagtc accatggaaa 27061 cctagaaccc tgagtcacag agaagctggg ccattcctct cccttcccgg tggtgccgca 27121 gttcttgaga tgtgagcagc tttcagcccc gtggcctcgc ctaagctggg gtttcctatt 27181 tcatccacaa cttcaggaat ctaaactttc ctgcaaaata ttgagcatgt gacctgagtc 27241 caaacagcca tggactcaag caccttctga taaaaacgga aaccacagtg tagggggtgg 27301 ttttgaccag ggtttcggag tcagataaac tcaggtttcc accttgaatt tctcatcaat 27361 ctgacgtggc aggttattca tcctaaggtt cagttcccac ctgtgtaaag tgggagccgc 27421 aagtcctccg agagtgacga tgatgtgcgt ggagtgccca gcccagattg aagcgcggcc 27481 aaggcgggtc gctatctggg caccgctcag ctccagaggg cgccactccc gcggagcctg 27541 cgggatcggg gcttcccggg agcagcgcga tcagcaccac gactcgggga cacagccagg 27601 gcccggtttc tacaggaagc gcctcatttg gagccttttt gtgatagaat gatcattagt 27661 cctaagccca ttcagaggtt caagaatggg gtcggctcaa tttcagggcc ttattaccca 27721 agcccggctg cccttcggtg ccaccagcac cactgctccg tcgctgcgga attccaaagg 27781 caggtttggc gttagggcct tggccccaga gaggacgccg agcgctccac ggaaagtctc 27841 cgcccggctc ccagggcgca cactcgcgcg cacgtggggc cgaggccctg ctcccggggc 27901 ctcagggcca gccggcgagg gacccagccg agtgacagca ggaggcggag ggaaggttgg 27961 gccggaaggt gtcagccccg ccccgcgctc cctcgccgcc tccgccctcc tcttctctcc 28021 cctcgtgcgt tccctctcct ctccctccgc tcccccaacc ctcctgctgc ccccttccct 28081 ctcctcccgc ttctccccat cctgcctacc tcccttcccc tcctcttctc tctcctcccc 28141 ttccctcccc tctctcttct ctcctccctc gtttcctccc ctcccctcca ctcggccgtc 28201 cctccttcct cctccctcct ccctcctcct cccgctcctg aagagcgcgc cgcgtggggg 28261 acggcccggt tacttcctcc agagactgac gagtgcggtg tcgctccagc tcagagctcc 28321 cggagccgcc cggccagcgt ccggcctccc tgatcgtctc tggccggcgc cctcgccctc 28381 gcccggcgcg caccgagcag ccgcgggcgc cgagcagcca ccgtcccgac caagcgccgg 28441 ccctgcccgc agcggcaggg taagagctgg gcccgcagag agcgcccggc gcgcggctcc 28501 agtcccatgg agggtcaccc ggggcctggg cgggggtcgc ggggggcact gacacgcaga 28561 tctcggggcg ctgccggggg tgcaggtggg ggtggcggct gctgcgagga ctctaggggc 28621 gcgcgtctga gttccgcgcc ggctcgtttt ccggttatgg agtggcctcc ggggctggcg 28681 gggtcggccg gggggttcct gcgtgctagg gccgctgtct tcggggtcgc ctagcggcgg 28741 gcgcggccag ggcgcgctgg cttgtttcgc tcgcttttgt ttttaaaagg aaacgcaggc 28801 ctggtagggg gtcctgccca gtggatgtcc cggcgaacat gatttcgcga acgggagtgg 28861 gggcacagga gagcgtgtcc gaggtggcct ggcgccccgg ctttgagggt gacttcctgg 28921 agcggcgccg ggcccggagg atctggggcg cccaagacac ctgaaggctg cggcaccgcg 28981 ggaacctgcg gggcgcgggg tgccatggtc acctgctcgc cgcgtccagg gcccgggctg 29041 gggacccctc ggtcgtgcga ggagagcgtg gggaacctgt cggaaatgag atctggttgc 29101 gctgggctgc ctttattttc tcgttcctac agcatttgaa tgaagaggta actgagtgtt 29161 tgcttgtgtg tgtttgggtt gcgtgtgtgt tatcctattt tattttttac ggcaggagac 29221 cttttatgtt agcctgtaca caatttcagg gtagcctaaa acagtagttg acgcgctagt 29281 tatttagaaa gtaaagaaat gagcagtccc ctttggagac aggagtaatt tattttaatt 29341 cgccagtaag agattatatt tgttccatac aatggaagcg cctgtgcgtt atctgcctgc 29401 ccacaccacc accaatcaga gagataaatc tccacgttaa gctttattaa aattctgagt 29461 agtagcgcag cacagtaaat gttagcaccg accatccgca gggaaatgct cagatactca 29521 tttctgggat tagtcattca tctctctaaa ctgatcgctt tcaaagcgaa aaaagcaaac 29581 taccttcaaa tgtgtgttac aaggttgtgt tgtttctttt ttttgtttgt tttcaaattt 29641 atctaaataa tcttacataa gtgattactt gttccaataa gggtactgtt gtcaggcaaa 29701 ttctccttgt ttttaaatgt aaccattctc agctgttaca tttctgctct ttatcctttt 29761 tttttttttt ttttttggtt taaaagcagc gatccatcag caacaccaaa cttgaaattg 29821 atttatgtgg aaaaacttgg cttgtctgcc atctaacaag ccctgttgag taaaataagc 29881 aagcttaaat ttgattagtt gtgtgtctgc ctgaaaccat ttagtagagc actttaattc 29941 tgcatggttt ttacaaaact cattaaaagc tggacaacaa aagaattcta ttctgtagct 30001 agtaattaca gctctttatg tgggagtggt aggctgcctt ttctcctggt atttgtacac 30061 aaaagctggg gagagctttt ccactgcctt ccacattcat tctctgtcca tctcagagaa 30121 gctgaaaaac agaatcaagt cagcaagcag tctgcaagcc taaggaagaa aaagagctga 30181 cccatccggc tctggatgga taactggcaa cacccaagat gtctatgatt ttggccttct 30241 gtttgaggct tgtttatttc tttaatcagc agctaccagg acttatttca tgcagagaaa 30301 tgacataaag ctctagagga aggagcattt cacattggaa ggagtagtga gaagggcctg 30361 atctgataat atcaggatgg gtgggaaggg tccattcctc atcattcatg ttctcccggc 30421 acagaggact gtgtggagga ctcaggattt aggaccatac tcaactgggg tttgatctcc 30481 actttgccct gcggtggccc ctctggtagc tatgtggatc ctgggcagat tgtccatgtt 30541 gctgggcctc cgttgcttca ctgtatgacg tctgcctctc agggatgcag gaagattcaa 30601 ttaggtagga gtggatggtc ccccagcacg gggcttagct cttagtaact aggcattctg 30661 gcaatggcag ccatcaatag gagagactat tgattgttga gtgaatataa gtggcataat 30721 ccaccaagtg gagattctga gtgaatgtgg ccactttgga cccaagactg ctgcccctct 30781 ctgtcctctc ataggactaa tatttattgc cagtgctgac cagcctttgt tggctaccat 30841 gatcattacc tgcactccga gtgttttaac agaattttct gatgcctctg gcctcacctg 30901 ctgcctttta gccaagaggg gtgtgtgtgt atgtgtgtgt gtgtgtgtgt gtgtgttggg 30961 tgtgtgttgg gtagcagggt tcttggaagg tctggtttag ccataatgtt gatcataaag 31021 agaaaacaag aaaataaatg caaattaaat agttcggtga gcctttgtta tttcgccaca 31081 tccagtaagt tctatcagtt gaatgttgtc tgaattactt aaaaagttct cagttgtaaa 31141 caggtaagag cacccagtgg aggccatgga gctaggctta gagctttgcc accagccttc 31201 agttccatag ccatgaacct gggagggcaa ggaatcaccc tagaggtgaa ggccattcat 31261 cggtgctaac ccaaggtctc tgaaaatgag ttttagttgc tctgttatct aagtatcttt 31321 cttactgata tgtaacttac ctggctgaac aattacattt tagagctcca aataaatttt 31381 cagcctatta tgtcattgtc caattacaag cctcattgta atagccatgt taaaaacaat 31441 aaagccggtg aaattgattt cagtgtattt caaattaacc taatatattc gaaatattat 31501 gtatgtataa aaattattga gcttttttct tttttccact aagtctttga gagctagtgt 31561 gtatgtgtgg atttgtacta gccactggtt gagagctctg tagccacaca tggcaagtgg 31621 gtaccatatt gcttagccca ggcctagtag aagattaaga ctcagtcaga ggaagtaagg 31681 cagatccgtg cttggaatca gttcctcttg aagaagtgca cagccaagca acacacaaag 31741 cagaaatatt ttgagctctg tattcaaagc aggaagatct gggaccatgt aagggggttt 31801 ggtcttcagt tctccctttg atcccttccc tgggtgtgtg ctttcttaca gcctggagag 31861 tggaagggaa gagagggggc ttcctggact cctttcctgt ctctagaaat gacccctgca 31921 gctgtcctga agctctcagg ttaagctgat tgtgtccaga gcagacagga ggcggcccac 31981 tgccccactg tacccaccca agcactcacc taagctggcc atggaaatgt gaatgaaaag 32041 cctgacccag gccctcttgc cctgctaacc cagcaaggag gctctgacca gccagccagc 32101 aagtgacttt ttctgtgaca tctaagtgga attcagtgag gaatcagcat tattattcaa 32161 tcactgaaac agcatatccc gttgtgatac tgagaaatgc tttcctcttg agacctttgg 32221 gaggacagcc atttaaacaa gaattccctg cattcacttt tgaattacct gttagctgta 32281 gaactcagcc aggatcaagt cttgatgcca ccaatgatga gtttagtgta cttaatcttt 32341 tgcctctttg actttttttt tttctttttt aagatgaatg atttcggaat caagaatatg 32401 gaccaggtag cccctgtggc taacagttac agagggacac tcaaggtgag tgggcaagtc 32461 ttaatttttt tttttaattg gaaaactcga tctctaggag gaaagaaaaa aaaggcctgg 32521 gtcccagaaa actggttgta gccctagctt cttaatttat tggccatgtg actatagact 32581 atttgcttac ttgctctggt cctcagtttt ctggtctgca gaatgggtat cttgatgtgt 32641 gttgttcttc tcctccacga tatgaaacca aaggccttta tagcttgttt tctgaggctc 32701 cttccaggtg aaaatcctat aactcccaat aaaacagctg gagaggctgt ctctactgcc 32761 ttttctgtct acccaggagt ttccttggtt gatgggaagc tgtttaagaa ctgaaatgag 32821 agagaagagg cagacagacc caaccgctac agctcctatc agaaagggca gcgcccgtgg 32881 aattgtaggt agcctgttac tggctcagag aacaagcagg cctacttatg ttaaaccaaa 32941 aaccagatga aaaatgaaag cgccctctct gctgcataaa cattttatgt agcttctatc 33001 atcattcctt ttgggaacat tctttagtag cacaaactgg taatctccca cttgtcttct 33061 ccaaatatga cagcaagatt tgattgtttt cagtattatg tagtaaacat gtttcagaag 33121 catgatttta aaattggcct cctcaaagtt tagcgtcttg cataatgatg atgtacgtct 33181 ctggcatatt acattttcct ttgtatatca ttattgaggt tatttgtctg atatgaccca 33241 aagaggcaaa actcagcaca gtcctttctg cagtattcta aaggtcatca aacttcagcc 33301 tagtgagtct gcttgtttga tttggccgga cattttaagc atggcagaag tggtacaaga 33361 aatcatggta ttaagttgaa accacacccc ttagaaaaat ccttctatta attcaaataa 33421 tttgacgatg cttatgcggt ttctgaaaag aagcagtcgt tgctgaaatt gatgtgttga 33481 aataggaagc acagatttgt attgtctttt ggcttcctgg ctttaaaaaa aaaaaaaaag 33541 aatttacggt taggaaaggc atttcttagt cacctgggca atgcttgtgg gaactttaaa 33601 agtttattgg aggtttcacc aaaattagct ctgctctaaa gtagtttgta aatccagatg 33661 agtaatttgc acttgcctga ttgatcggtt ttcctcgtag atctcaggca gttggtcatc 33721 attgacttgg atttttttgg gctgggctcc agcctctgcc tgccctcttt tgtaatgcgt 33781 cattcattgt tctgcatttc taaaaagaaa gcgggtaacc gattttattt gtaaatgtgt 33841 tacacatttt cttattgatg actcaacaga tgggtcattg tattacacat aacatgggat 33901 ttccagctta tgttggaaaa atatagtctt tcttcacgtg ttcctcaaca atttaggcga 33961 tgatgaatcc agatattatc ggttactttc aaatgagtct tacattaaaa ggtatttctt 34021 tcataaatgt tatgtaagat gctcagaaat cgagttagtg taaggtttta tctttctgtg 34081 tgtctcttct tttctgcaga tagtttagtt taactttaaa cctatttcct ttggaaatct 34141 ttccccttag taggtgattt gggttttttt tgttttgttt tatttgtttt ttcgtttttg 34201 gttttaagca acagggtcct tctctgttgc ccaggcggga gtacagtggc ttgatcgtag 34261 ttcaccgcag acttgaactc ctgggctcaa gtgatcctct ggcctcaacc tcataaagca 34321 ctggggttac aggcataagc caccccgcct ggcctggggg cggggagggg gttcattttt 34381 aagacacttt acttgttata aaatgaaacg ttaaggaatt ttgtcatgta ttactatttt 34441 ccaaaccatt ggaattgatg tggaaaagca tctctagttt cagttaaatt gtttcatggc 34501 aaacatttca aaataaatgc ttccaaagat aaagtagtgt atttcattgt ttgatttttc 34561 agtgtttatc gggtcatttg ccttttaact gcattgtgct cagacggcct tgtcaagggt 34621 catttttctt acacacacag cctagttgaa tttttctcaa tgtaacagga gggtcaggtg 34681 tttggtaaag gggtctctga ggctcctctc gacaccattg taagattctg ttattttcag 34741 atttgtgtat ttaatatatt aacttaagtg gattaggcca ctaaaacaaa ttcccagtca 34801 ctccgactct gtgaggttac ttaaataaat ccacagtggc cgggcaccgt ggctcacgcc 34861 tgtaatccca gcactttggg aggccaaggc gggtggatta cacgaggtca ggagtttgag 34921 accagcctgg ccaacatggt gaaaccctat ctgtactaaa aatgcaaaaa ttatccaggc 34981 gtggtggtgg gcgcctgtag tcccagctac tcgggaggct gaggcaggag aatcgcttga 35041 acccaggagg gggaggttgc agtgagccaa gattgcacca ctgcactctg gcctgtgtga 35101 cagagtaaga ccctgtctca aaataataat aataatccac agtaaaatac aatcatttca 35161 gtcactttga ccattttacg agtctgtttt aaggactgac tggttgattc catgtttgct 35221 tcaaaattgc cagtttaatc agggactatt ggtaggattg cccacagacc acttttacaa 35281 caggtaattc aatcagttta aatattgtat ttccattttt tttccatctg tttttgcagc 35341 gccagccagc ctttgacacc tttgatgggt ccctgtttgc tgtttttcct tctctaaatg 35401 aagagcaaac actgcaagaa gtgccaacag gcttggattc catttctcat ggtaattggt 35461 tcctcagact tgacaaattg tgcatgattt tcctaagtag ttcagttaat aaagagatga 35521 cagttcctaa gtgagaaatg ttggtatctg gaatctttga tatgaatagc cagaagttct 35581 ggacctggga gaaggcagtt tttcatttgt atcaacttgg tgattctgat gtggattctg 35641 atgtcttgag tgccagacgc cccccagtga gccgctgtta ggaaactgca ggggaacccc 35701 tgagctgccc ctcctggatg ttttcaacat tggcaagcag gtcctcatct ccctgctctg 35761 ccctggaaaa cagacagaag agtagacatt aaaacacacc caaacttgta atttgggttt 35821 gcatcattag ttaaatcatc tcccctgccc tgagttttgg ggcaacaaat tgttctttgt 35881 aatccttacc catcggtagt tctttctgta gttcttcgtg tcattgttta ttcttcttaa 35941 aggaatagca cgtcagataa tgttcataca attactgtga cagagctatt agttggaaac 36001 atgccagaga tgactgtgtc atgtgaagta ggtgctgaga atacaggcca gaaaattaat 36061 taaaacgcat acactggcgt aagcataacc ctgcatggca gccatttatc attcttgaaa 36121 ttgctgaggg tgtctggaga agattgtttt tcaataaaaa tggaaggtgt ggttccatta 36181 ccgtgctaag tgcggcacat tcatatcaca ctcacttctc agctctacgc ctagtttaat 36241 ggaagtgact tccaaatgat acagtccctt cagccttttg accagaaacc atgcaactgt 36301 tttcattgcc tacaatttta aaatagactt agcagttttc cctacgcaca actgctttgt 36361 gacgctttgc aaaatagaat caaatctgag ttttatttta gattttttta ataagtatag 36421 ttctgagata tatattttct caattatagt cagaaaagtt ttttgttaat agttacactg 36481 ttttaaggaa tcatgccaag gtttgagatc aaaattgttc ttttccaaaa actaagatgt 36541 ctctcctaaa tctccacctg atatcaccaa cttgaagtcc taatgtcccc atggggggtt 36601 tccttccaga ctccgccaac tgtgaattgc ctttgttaac cccgtgcagc aaggctgtga 36661 tgagtcaagc cttaaaagct accttcagtg gcttcaaaaa ggaacagcgg cgcctgggca 36721 ttccaaagag taagtactgc tttcctgagc ctgcactggg tgagaagaac caacttcagt 36781 gcagttgttt gatcttgact tgtttattaa gcttttgctt ggggtattct gcaaagagta 36841 gcatggatgt cgttaacctg agccagtttc tgtttcaccc caatcaaccc aagacttgat 36901 ccagaactca ttaaatatgt agtagaagga cgcattactg gtgtcttgaa atgtgcctgg 36961 gtcgtgtcag aatggtgacg tgtcatcatg gtatcttgct cattcgtggg ttctggtgta 37021 tgtcggtact tggtgcataa attagggatg acagtggtct cactaccttt ccactgtttt 37081 tacctcatgt gttccatttt ttcttctctc ctggtagacc cctggctgtg gagtgagcaa 37141 caggtatgcc agtggcttct ctgggccacc aatgagttca gtctggtgaa cgtgaatctg 37201 cagaggttcg gcatgaatgg ccagatgctg tgtaaccttg gcaaggaacg ctttctggag 37261 ctggcacctg actttgtggg tgacattctc tgggaacatc tggagcaaat gatcaaaggt 37321 accagctgaa cgtcttactt ctccttgtcc aggatgagct gtggccggga agactgattg 37381 ggaaagtcac gtgggtgttc ttcaacctta gggttgccac ttgaaatgac atagagtacc 37441 ttgcctcaaa atgccactca agtgagtcag atatatggca gtgattatag atttttatcc 37501 cactttatgt gagtgtgtgt gtgtgtgtgt gtgtgtgtgt gtatagcatc aagtatagcc 37561 acaaggtagt agccttagtc actaaattgt ttgctattgc tggctgtatt catcacggga 37621 gtccatgttg actcagatat gtaggacagc aagaattccc acgtcttctg agctcacctt 37681 acagagctaa gagatatggc agccttaatg aaagggggga gcctttactc acataaaccc 37741 tgtgaattct actgtaaatt ttcctacatc caaatggaat gtattttgga tgttgaggat 37801 tgttgagggg gcggggtctg tttttaaaca agtgagaaat tgatatatat ttacacgcta 37861 tcatattgta cagaaactgg ccattgatca gtagaaatct caccaaatac agagaaagtc 37921 ttgattcaaa acttaaaata gcagggagac caatgtttaa aacatgttga gttaaaaaag 37981 aaaagagggc caggtgcggt ggctcacacc tgtaatccca gcactttggg aggcggaggc 38041 gggcagatta cttgaggtta ggagttcaag accagcctag ccaatatggt gaaaccctgt 38101 ctctactaaa aatacaaaaa ataaaaatat tagctgggca tagtggcaca tgcctgtaat 38161 cccagatact cggaagcctg aggcaggaga atcccttgag cccgggaggc agaggttgca 38221 gtgagtcgag atcacaccac tgcactccag cctgtgggag tgagactctg ttgaaaggaa 38281 ggaaggaagg aaggaaggaa ggaaggaagg aaggaaggaa ggaaggaagg agggagggag 38341 ggagggaggg agggagggag ggagggaggg agggagggga gggagggagg gagggaggga 38401 gggagggaaa gaaataaaat acagtaggga ttatttaggt taaaaaaata taagaaaata 38461 agcatgttta tagaattcag ggaactagtc ctcacctgac tgattttaaa cacactgaat 38521 ttcagcatca actgtagact cagaatctcg aaaacaccta ggattcaaca gctaactttt 38581 tttagttacg gcgtctgagg gccagggaaa ctcattttta aagaccctca tttcttctga 38641 acttggacag ggcttttcta ccaagtttct ggtgatctgt ttattggcaa attgggcctg 38701 ttttaccatt cctcgggcct gtgagtttgt gtgtatttgg gtttccgggt ggacgtgcac 38761 tttgtaatat attcgtagta agtatgcata cacagtgcac aaaccttttc cctttcaaag 38821 ttcttgggca catgtgttcc ccatcccaga ctcccacagc cctggagact tcccacaccc 38881 tggggaagcg tgcaggtttg gaagcagtgg aactgtaagc ttcacggggg cagacaccat 38941 ccattttatt aatttcctgt atgcagcctc tggtacagat ggcatttcat aaatacttaa 39001 ctgtttcttg aatgcatgaa tgaattcaat acaccatctg gaccttttgc gatagaaaat 39061 ttagtgttgc caatgacatc attcattttg gaaaatttat tttagtaatt ggaaaaatgg 39121 gttgctttct ctcgtttttt gaaaaagaca ggcagggacg tggagcacca tgagattaaa 39181 tgtgaagcac tccgtgcatc tcatcattct aaaattatca tctcacagga attcagaaag 39241 tcaatttgtt ctgtgtagtt ttcacctaca tctgccatct tacgtctcct gtgtctgctt 39301 ttcagaaaac caagaaaaga cagaagatca atatgaagaa aattcacacc tcacctccgt 39361 tcctcattgg attaacagca atacattagg tcagtccgat tgattctgcc cttaagaact 39421 ttgtcttcag tcttcccagt agacttggaa tctctctact gtaggctcct aaggggccac 39481 ctagacctgt gtgtctctta tggtggccac acgtgggtgt tgagctataa gccattgaaa 39541 tgtgactggt ccacatcgag atgtgtagga agagtatagt catgcacacc agatttggaa 39601 gtcttagtat gaaacagaga atataaaatg tctccctgat aatttttaat gtattgatta 39661 catattgaaa tgataattgc tatatgaggc caagtaaaat aagaaaatta acttcacctg 39721 tttctttttg cttttttaat atggccacta gaactttcgc agttgcttaa gtggctcaca 39781 cgatcttcca taggtcagca ctgttagtgc tagtccaggc aattcttaga gcaaggagca 39841 gaacgaagtg agctgattac cgcagtcacc agtaatgttg gtttagtgta gcaaacaaga 39901 ggcggcaggc acagagtgcg tgaaaagcaa tttgtaaact ggctctgaac ttgggagctt 39961 aaagagctgt catggaatct gggtcctttc ttctggttgc tgatactggg gagggcacag 40021 ggtctggcca ggccagcgag ggaggggtcc tcggaacagc aggcttgggt ggtgttggtc 40081 cggcccagtg ttttctctct ggccccagct ccaccattca gacccggtga agcccacctc 40141 tctgccccac tgggggcttc ctgctcagga ggtctttctg aacatgacag tgggacaccc 40201 tgtgcctgcc tgaacctgaa tcagagtcag agagcttcct tccctctcca gaaaataaat 40261 tattctgcaa tgaacagaga caggagcggt gtgatgaagg agcagaaagc agagtgggct 40321 gtgcctcagc acacagggtt catggcctgg ttctgccctt atgaatcaca cagtacatga 40381 cacgtcactt ggtctgtctg agccttactc tcctcggaag ctaaaagagc cagtgagccc 40441 tgtcttcctg aaggtcccat gaaagcacta gggttgtaga aataggaaca ttgagtcctg 40501 cctgcagagg ctgcagtcat tttggcagag ggctgctggt tgagaagctg aatggtttga 40561 gaagaaggcc tggtgacaga cccccaccaa agcccgggtc tccactgagg ttctgcagag 40621 ggctggagtg tgcggagtgc tcacctgtca tttgctcgta ggggttagtt actggggtaa 40681 cactgacttt aagagctctg ccgtccgatt gttctgttcc aggttttggc acagagcagg 40741 cgccctatgg aatgcagaca cagaattacc ccaaaggcgg cctcctggac agcatgtgtc 40801 cggcctccac acccagcgta ctcagctctg agcaggagtt tcagatgttc cccaagtctc 40861 ggctcagctc cgtcagcgtc acctactgct ctgtcagtca ggacttccca ggcagcaact 40921 tgaatttgct caccaacaat tctggtaaga ttggaagcat ctttcaacaa ggctgttgct 40981 ttgattctga gaaccccaga gccataatga acctcttaat aaatacttcc tggattcagc 41041 cattagagaa gggggtcaaa gcccatgttc taagtggatt tccaacaagc atacccctaa 41101 tatgtttcag gcactggccc aggtactgca ggcattgagt aaacaaaaca gtccctgctc 41161 ttacagccag atttctccac actttgggat atttatggag gcgatgcagg aatgtgacat 41221 ttcttcttat ggagatgtaa cataaataca gtgcacaagt ctcaggtcat gtggctggtt 41281 ggatttgtgt atgtatgtat acccatgtaa taaccatcca gatggagata gagaagtttc 41341 cagtagcctg gaagattcct tcgtacccct tcccatcaat aatgtccccc cgacagaagg 41401 gaacgattct gacttttatc attacggatt aggttttgat tttcttgaac tttacataga 41461 tggaatcata aggcatgttt tcttttgtgt ttgtctttta ctcatcatgt ctgggattca 41521 tgcatgttgt tggatgtagc agccatgtgt tctttttcat cgctgtatag tattccatta 41581 taagaatttt attcattcta ctgttgatgg aagataggat ttgtgtagta tagttaccct 41641 aaggaataag cacctgagta gagagcattt tgagaagtct gagtggttct acaccagcag 41701 gtgcatgatt gcactggtag tggttggtga tgctatggtc gtgcccaaga ccaactgtgt 41761 cctggaggaa tcaaagtatg tgtttggttg tctttgccag ggacgcccaa agaccacgac 41821 tcccctgaga acggtgcgga cagcttcgag agctcagact ccctcctcca gtcctggaac 41881 agccagtcgt ccttgctgga tgtgcaacgg gttccttcct tcgagagctt cgaagatgac 41941 tgcagccagt ctctctgcct caataagcca accatgtctt tcaaggatta catccaagag 42001 aggagtgacc cggtggagca aggcaaacca gttatacctg cagctgtgct ggccggcttc 42061 acaggtgtgt gtggaactcc gagagcctgg ccgcccagtc tcctgggtcc tgtcccttgc 42121 ttcctttcga gccacagtac cacattcacc gagggtgttt ctaagctagg tacaccagtg 42181 ctctacactc catgttttat gcgtggcttg ctgtatctct gaattcgaca aaagccacac 42241 ttggaggaat tttcatgtac acagagctcc atggaatgag tgaaaatcga caggccactc 42301 cctccacctg ggacgttctg aggaatagtt ggagcagggg acatggcctg tgcagtcttc 42361 agccctgact cacccactcc aagaatctca aagaatgtaa actggcttag ggaggggcag 42421 cagagccagt ctcaaataat atgggcactg tttacctttt cttgcttcta aaatctatct 42481 gctcttattt ttgaggtttg ggtgagagct attctaggca gtggacaaat ataaagctgt 42541 ttttagaaac aggtatttcc agtactttag cgaaaatact ggggctttgt tttcactgct 42601 tcgtgtatcc ttaaatgcgt ttttgttttt gtttttttaa ccatatggag ccactgtgga 42661 ttgaaattac ctaacttaaa aaaatctgat aagtttaaaa taaggatact gaggcattca 42721 ttctttttct gtcatctaag aacaatcagc cacgtaaacg tattaaaata cacggaaaga 42781 gtgtgtgaaa gccattatcg gtttcattat tgggagttta gggcctgaga agaataaaca 42841 gccataacgt tagtgtggtt cattttaata gtaacataag tttctgttta ttgactgcca 42901 actctgtagt taagtaggca tacagtctgt aggacagcct ggcatggggg cggtatgagg 42961 ccattttaca gatggggaaa ttggctcaca ggcagacagc tatcggtgct cacacagctc 43021 tggttctaga atgcctgtcc cgtgctttac agcaaagctt tattgataac ggccattgcg 43081 tttctgagga agagctattt ggtagaaaga ggttgggctt agggtcggaa gatctaggct 43141 cttctcctgt ttgctgtgat gggcgtgtga tccaggacaa ggcacctggc ctctgagccg 43201 catcttatcc ctcctcttta gaacagaacc acactgacac ctgctttgca aggtggtagt 43261 tctaaatgtc aatcgaggac aggtggacga gaagctcttg gcaaagcatg cttcaccaga 43321 cagaggctgt gaggcaacgg gtgtgacccc gcgtggctat tgagtaaacc gggctcacat 43381 taagcaattt acacatggtc atttggcagg aaggtggcat gggatcccac gtatgacacg 43441 cctcattctg tgatgaaaca ccccctcagc cacccccatc tctgccccac tgggtgtttc 43501 tagaaaaaaa ccttgaaggt ttttacataa ggaacaatta aaccagatgg ggaattttaa 43561 tgccaagagt tggtggccct aaactttctt ccagaaagaa caaacctagc aatcaagggg 43621 aagagtgtct cgcctagtga ccttattttc taggagtagg cagtgtggag caggaactca 43681 catttggtgc cccgccccat cccgttaaag cacttagtac tgtcaccaac accttgagtg 43741 ccaccctgag tgtaacatcg gaaccccatt cagagagttg ggtctgcatt cctaaatcag 43801 catgtacaat taggatggtt aaagacttgc tgtattttac atctgtgaaa gggtatgatc 43861 cgtctccctc cctctccccg caggaagtgg acctattcag ctgtggcagt ttctcctgga 43921 gctgctatca gacaaatcct gccagtcatt catcagctgg actggagacg gatgggagtt 43981 taagctcgcc gaccccgatg aggtatggcc agagccctgg gaaatctctg ggcttgaaaa 44041 cctgatttcc tgcttgcatt caaaaactca gttctttggg cacaaaaaag ggttcaccag 44101 tactgctgag aatctttcca cgtgaggcat ccttggctgt tgggaaaatg gaagtggagt 44161 cattgctttg ttgataaacg tgtacagtgt tttctgggta tgcttcatac aagggcgttg 44221 cacagatttc actgtcttga tcagttgtct gatcaagagg cccaagctgc aaactgaggt 44281 tctctgccga ccatctgaaa atgccttagg aaggctgccc ttgtccatcg ggggaacaag 44341 cctcatgtgg ccccgagcca gccttctgtc cactccttag ctgagcccag cagtgttgac 44401 cccagccctg cagagttaga accaggagcc ctgtctaaag gagaaggcca agggcaggtg 44461 ggataccaac cccgccagaa aagcgttaga gagaagccct tggtctctgg cactggggag 44521 cgtaatctgt caccattccc accccgcctc cagttccctt gggagacaag cattgaggaa 44581 tgagtatatc acacagctca gaatcccact tggcacctgt taacttcagt agttggaata 44641 tccagcctgg agggtgggtg acaccacctt tccctggtct caggaccctc ctggctctga 44701 acccttggct cccaggaggt ttcactgagc tggggccagg gagcagggac ctcattcccc 44761 agtggttctg ccccttgggg acacagtgcc cctaccatag gtactcaaag gtactcagag 44821 gtactcaaaa ggtcctcctg cggaccttgt gtaggtgtca agttctctct agagtgaaca 44881 tgcctcagaa tcataatcag ggaggaatgt cattcacttt ttcttcattg acaaattgag 44941 tttaactctt ttccatccat gttcaccaaa ggtggcccgc cggtggggaa agaggaaaaa 45001 taagcccaag atgaactacg agaagctgag ccggggctta cgctactatt acgacaagaa 45061 catcatccac aagacgtcgg ggaagcgcta cgtgtaccgc ttcgtgtgcg acctccagaa 45121 cttgctgggg ttcacgcccg aggaactgca cgccatcctg ggcgtccagc ccgacacgga 45181 ggactgaggt cgccgggacc accctgagcc ggccccaggc tcgtggactg agtgggaagc 45241 ccatcctgac cagctgctcc gaggacccag gaaaggcagg attgaaaatg tccaggaaag 45301 tggccaagaa gcagtggcct tattgcatcc caaaccacgc ctcttgacca ggctgcctcc 45361 cttgtggcag caacggcaca gctaattcta ctcacagtgc ttttaagtga aaatggtcga 45421 gaaagaggca ccaggaagcc gtcctggcgc ctggcagtcc gtgggacggg atggttctgg 45481 ctgtttgaga ttctcaaagg agcgagcatg tcgtggacac acacagacta tttttagatt 45541 ttcttttgcc ttttgcaacc aggaacagca aatgcaaaaa ctctttgaga gggtaggagg 45601 gtgggaagga aacaaccatg tcatttcaga agttagtttg tatatattat tataatctta 45661 taattgttct cagaatccct taacagttgt atttaacaga aattgtatat tgtaatttaa 45721 aataattata taactgtatt tgaaataaga attcagacat ctgaggtttt atttcatttt 45781 tcaatagcac atatggaatt ttgcaaagat ttaatctgcc aagggccgac taagagaagt 45841 tgtaaagtat gtattattta catttaatag acttacaggg ataaggcctg tggggggtaa 45901 tccctgcttt ttgtgttttt ttgtttgttt gtttgttttt ggggggtttt cttgccttgg 45961 ttgtctggca aggactttgt acatttggga gtttttatga gaaacttaaa tgttattatc 46021 tgggcttata tctggcctct gctttctcct ttaattgtaa agtaaaagct ataaagcagt 46081 atttttcttg acaaatggca tatgttttcc acttctttgc atgcgtttaa gtcagtttat 46141 acacaaaatg gattttattt tttagtttaa ctgtgtttct ccgacagctc acctctctct 46201 gaccacccag ccatttcctt cctgtgctcc acgttcttct gtgtgattaa aataagaata 46261 ttatttttgg aaatatgcaa ctccttttca gagatcagga gggatttatg tagcagctat 46321 ttttactgca aaagtaattc actggaaaaa aaatgtaatt tgtaagaaag ctttattttt 46381 atctcagctc tatgtaaagt taaagttact gtacagagct gaaggacggg gggcggtagg 46441 ggtcttgatg aaacctcttg aacgaagcac agtttgtccc atctttgttc actcgtgtgt 46501 ctcaaccatc ttaatagcat gctgctcctt tttgctcagt gtccacagca agatgacgtg 46561 attcttattt tcttggacac agactattct gaggcacaga gcggggactt aagatgggaa 46621 agagaaagca tcggagccat tcattcggag aaaacgtttt gatcaaaatg gagacttttg 46681 tagtcgtttc aaaagagcac ctgagtcatg tgtattcccg gcctttataa atgacccggt 46741 caagttggtt tcaaagtccg acaggcttgt ctgtttacta gctgcgtggc cttggacggg 46801 tggctgacat ctgtaaagaa tcctcctgtg atgaaactga ggaatcgggt ggccgggcaa 46861 gctgggaaga gcaaagccag agctgcgctg cctcaatacc cacaaaagac cattcccagt 46921 atacataagc acaggatgtt tttctcaaga gggatgtatt tatcacttgg acatctgttt 46981 ataatataaa cagacatgtg actgggaaca tcttgctgcc aaaagaatcc taggcagtgg 47041 ctcattgtat gtgaggttga accacgtgaa attgccaata ttaggctggc ttttatctac 47101 aaagaaggag tttcatgggg ttcagcctaa cagttatgga aactacagtc cttataaacc 47161 attggcatgg taataaacag atcttaagta taaaaatttt gtaattgggc ctttactctc 47221 tcaataataa agtattttgt ttatataaat tctttgtgat agtcctcgtt cttcctctcc 47281 acacccagca tgaaggagtt ggaggaagga tgttaacccc agatccattc tctactcaaa 47341 acattccatc atcaagcggc aagtctctgt ttaactggtt tatacacaag tcacttagaa 47401 accacaaccc aaattggaat caactttgag cccttcctaa aagaattccc aaaaagtgct 47461 ctctttcaaa acaaaaaatt cttttaagaa agtgttataa tagaaagatt caaatgtctt 47521 tctttgccag aagcttggca gagataacag aggagagatt ctggaatgtt tatattaagt 47581 acattagaat ttgaggttta agtacttttg gaactgaggc cacaacactc tgtcccctca 47641 gtggagtctg acattggtaa ggtgatggtg gtcttaggac acctgttttc aaatttgttc 47701 tctttacatg ctccaaaatt agtgagtatg ccaaaaagct tatattcatt ggtgcgctag 47761 agccagctca ctgtgttctc ccgagagcta catgtccctt cccatcatcg catcagtaat 47821 ggtagcttga gatggggcgt gatgagagta tttacaccat gggaatcagc agacactaca 47881 aatcagggcc ttctctcccc cagagtgctg attgttaaac atttaccaac actctactgg 47941 ttatatctat caataatttc catattagaa attaaagtaa attttaaaag aattcattta 48001 aaaataattt tacttattgg cattaacagc atttttttct tgcaaaaagt aattattttt 48061 taaaaagtta gcatggcatt gtttcacatt tgcaaatctc tttaatgtct ggtttagtat 48121 tagagctgga ttcttctgtt tctgcattca atctgtttgc atatattttg atagagtatg 48181 tgaagaaaat ccggcctaac agatatgtaa cagaaacaag gaggagcatt taagactatt 48241 tcagataatt gtagatattc tttctttact atgccaaaac ttaacaaagt ataattcctt 48301 agttgcaatt caggtatgaa accatattaa taaactttcc taaggttaca ttaaattcca 48361 ttggcctacg ttgcactttg aacagatctt ttagccgtgt gtgattttgt aacatcctgc 48421 attggtcact tggaaaatat tgcttcaaaa tgttaacaca tttcaaaata taatattcaa 48481 caattacatt acttaatatc accacaaatc acatcagaaa attactggga aactatcaaa 48541 ctcacagtgg tggatacaca ctttttaaaa ttcaaatttt cacttgaaac ctcaaatttt 48601 ataatcagca acaaataccc ctaagtattt gaggagagat tctcggatgt ttatactaac 48661 tacattagaa tttgaggttt aagtactttt ggaactgagg ccacaacact ctgtcccctc 48721 agtggactct gacgttgtta aggtgatggt ggtcttagga cacctgtttt caaatttgtt 48781 ctttttacat gcttcaaaat tagtgagtac gccaaaaagc ttatattcat tggtgtgcta 48841 gagccagctc actgcgttct cccaagagct acgtgtctct tcccaccatt ttggggggga 48901 agcgacaggc tcactttatt tagtttttga ggaaggacca gccaaatacc caaatcagaa 48961 taatttgcca gttattctct cagataaaag tgatattcat gaatagagtg gacagtttag 49021 cttgcaactc aaatacacaa ttgcatctcc ttacaacaat caccatattt cagtatgtgt 49081 gcagaccagt tctttaagtg tactttccat tttattacac agaaattaaa aagatgtgtt 49141 ttccattagt aaaaaataag tttttactgc ttcatcaaga gcattcttga ggccaggtgt 49201 ggtggctcac acctgtaatc ccaacacctg gggaggctgg ggtgggcgga tcacctgagg 49261 ccaggagttc gagaccagcc tagccaacgt gacaaaacca catctctact aaaaatacaa 49321 aaatcagcca gatgcggtga cacatgcctg tagtcccagc tactttggag gctgaggttg 49381 gagaatcact tggacctggg aggtggaggt tgcagccagc ccgggtgaca gagcgagact 49441 ctgtcttaaa aaaaaaaaaa ttcttgagtg aaactcttat tttttctttt ctttttttct 49501 gcaagtgcat ggtggtggtg gagagtgcag tgacacagtg actgccaact gttccggttc 49561 acgtgattct ggttcagcac catagtgcaa aaccacctgt gcttttatac catcatagca 49621 aaggtcaaaa agtcaagcgt ggatttgtat ccttgtaaaa atagtcttaa ccttgaggat 49681 ccttctgaaa gggtctcgga gacctcagag tctgtggggc acattttgac aatcgttcta 49741 gaaagaagag aggctcactc agagcaaaag tggtggatct ctggaggaca aagcgaaggt 49801 agggaaaatt agcctttggc ctcccaaatc cagtgtccgt gcttccagtc ttctgacctt 49861 ttcatagcca ggtaaggagg aagcatttct tgtttaaatc aagaagaatt gtctagtctg 49921 catgataaag aaaagttcct tcacacaggg tgaaatggat actctcaaag cccttttcct 49981 gccatccttc tatctttgtt cttttccatg ctccagtcca attccaaatc tgtttcagga 50041 aacaaaaaat gtatattttg ccactttttt ttgagacagg gtcttgcttt gttgccaagg 50101 ctggagttta gtggcatgat cacagcccac tgtaacctca acctcctgga ctcaagtgat 50161 ccacccatct cagcctcctg agtagctggg actacaggca tgcaccacca gacccatctg 50221 atttttaaat ttcttgtaga gatggggtct ccctatgttg tctaggctgg ttttaaactc 50281 ctaaaatcaa gggatccttc tgctttggcc tcttaaagtg ctgggattat aggcatgagc 50341 cactgcacct ggtcttgcca cagtttttct gaaacttaga agaaaagagg aaagggaagg 50401 gaaataagta tttagagaaa tcatttttcg gaacatcttc aaaactggga ttgtgcagag 50461 atgaagtcat ggggcctgcc tgcccgcctt ttccatggcg gggaagaagc agaattacat 50521 ttccgtcagt tctgccccac atcctggctc tcccccatgt cctggctctc cccaacatca 50581 ctcaaccagc tttatttatt tatttgttta tttatttatt tatttattat actttaagtt 50641 ttagggtaca tgtgcacatt gtgcaggtta gttacatacg tatacatgtg ccatgctggt 50701 gtgctgcacc cactaactcg tcatctagca ttaggtatat ctcccaatgc tattcctccc 50761 ccctcccccc accccacaac agtccccgga gtgtgatgtt ccccttcctg tgtccatgtg 50821 ttctcattgt tcaattccca cctgtgagtg agaatatgcg gtgtttggtt ttttgttctt 50881 gtgatagttt actgagaatg atgatttcca atttcatcca tgtccctaca aaggacatga 50941 actcatcatt ttttatggct gcatagtatt ccatggtgta tatgtgccac attttcttaa 51001 tccagtctat cattgttgga catttgggtt ggttccaagt ctttgctatt gtgaataatg 51061 ccgcaataaa catacatgtg catgtgtctt tatagcagca tgatttagag tcctttgggt 51121 atatacccag taatgggatg gctgggtcaa atggtatttc tagttctaga tccctgagga 51181 atcgccacac tgacttccac aatggttgaa ctagtttaca gtcccaccaa cagtgtaaaa 51241 gtgttcctat ttctccacat cctctccagc acctgttgtt tcctgacttt ttaatgatgg 51301 ccattctaac tggtgtgaga tggtatctca ttgtggtttt gatttgcatt tctctgatgg 51361 ccagtgatgg tgagcatttt ttcatgtgtt ttttggctgc ataaatgtct tcttttgaga 51421 agtgtctgtt catgtccttc gcccactttt tgatggggtt gtttgttttt ttcttgtaaa 51481 tttgtttgag ttcattgtag attctggata ttagcccttt gtcagatgag taggttgcga 51541 aaattttctc ccattctgta ggttgcctgt tcactctgat ggtagtttct tttgctgtgc 51601 agaagctctt tagtttaatt agatcccatt tgtcaatttt gtcttctgtt gccattgctt 51661 ttggtgtttt agacatgaag tccttgccca tgcctatgtc ctgaatggta atgcctaggt 51721 tttcttctag ggtttttatg gttttaggtc taacgtttaa gtctaatcca tcttgaattg 51781 atttttgtat aaggtgtaag gaagggatcc agtttcagct ttctacatat ggctagccag 51841 ttttcccagc actatttatt aaatagggaa tcctttcccc attgcttgtt tttctcaggt 51901 ttgtcaaaga tcaggtagcc gtaaaaaaga gaattttaga ccaatatcct tgatgaacat 51961 tgatgcaaaa atcctcagta aaatactggc aaaccaaatc cagcagcaca tcaaaaacct 52021 tatccaccat gatcaagtgg gcttcatccc tgggatgcaa ggctggttca atatacgcaa 52081 atcaataaat gtaatccagc atataaacag agccaaagac aaaaaccaca tgattatctc 52141 aatagatgca gaaaaggcct ttgacaaaat tcaacaaccc ttcatgctaa aaactctcaa 52201 taaattaggt atcgatggaa catatttcaa aataataaga gctatctatg acaaacccac 52261 agctaatatc atactgaatg ggcaaaaact ggaagcattc cctttgaaaa ctggcacaag 52321 acagggatgc cctctctcac cactcctatt caacatagtg ttggaagttc tggccagggc 52381 aattaggcag gagaaggaaa taaagggtat tcaattagga aaagaggaag tcaaattgtc 52441 cctgtttgca gatgacatga ctgtatatcc agaaaacccc attgtctcag cccaaaatct 52501 ccttaagctg agaagcaact tcagcaaagt ctcaggatac aaaatcaatg tgcaaaaatc 52561 acaagcattc ctatacacca acaacagaca aacagagagc caaatcatga gtgaactccc 52621 attcacaatt gcttcaaaga gaataaaata cctaggaatc caacttataa gggatgtgaa 52681 ggacctcttc aaggagaact acaaaccgct gctcaaggaa ataaaagagg atacaaacaa 52741 atggaagaac attccatgct catgggtagg aagaatcaat atcgtgaaaa tggccatact 52801 gcccaaggta atttacagat tcaatgccat ccccatcaag ctaccaatgc ctttcttcac 52861 agaattggaa aaaactactt taaagttcat atggaaccaa aaaagagccc gcatcaccaa 52921 gtcaatccta agccaaaaga acaaagctgg aggcatcaca ctacctgact tcaaactaca 52981 ctacaaggct acagtaacca aaacagcatg atactggtac caaaacagag atatagatca 53041 atggaacaga acagagccct cagaaataac accctaccag ctttctttcc tgatagggct 53101 ccagaactcc acatggtgga cacttgtcac tgggaatgat tcaggcttta atggggctgc 53161 tgggcgggag cacagggctt gagggccaga ggaagtggag caggaactgt ggcaagaggt 53221 agagggagca agctgcccca tcacctctcc tgaaatatca agcaagttaa caggaggatt 53281 tgttgtgatt tggagatgga ccttggatta tcactgtcac tgggtctgtc aggcactatt 53341 tgtgaccttg ggtatgcaat ttaatcactc tctgcctcag tttcccctcc ataaagccag 53401 gtcatttgtg gctgttgggg ggattgatct attgggggta taagattgtg taaagtcagt 53461 aacctgggat gggggtgcta tttgaacaga gcacgccaag gaaggtgtgt ttgagccagg 53521 ggatctgcaa agaaaatatg gatcccagac aatcatgtct ccattttcat cagaagaaat 53581 ccaggggcag agttcagtga tgaaacaagt acttaactga accaagactc aaaccctggt 53641 cttctgtttt ctggtgttgt cctctggcca aaataaccct gtcatgtttg gttgaagtca 53701 ttataaatgc tgccccagat gctacctagg gctccttcaa gtttctagtt cctccttcca 53761 caaaccagag gccacagccc ttgatttgaa ttttaaatta tcaggaacct cctacccaaa 53821 gaacacacag ccctcattta cagcagctgc tggcagctag ctgagttcga acagatgcaa 53881 gaatctcata atctgggcca gatattaatc tgtcttttct gtaacagcaa aacagaaggg 53941 tgagtcttga agcagtgcag ggggaaaccc cttattgtgc aactcttttg taactcctgg 54001 gcggagtctg gacatgtgac ttctgcttga aatttaacat taggggaaag accatttttg 54061 gtcatagctg aggtgatgac atcttggcaa ctggaatgga catgacctct gttccatgta 54121 ctatgagtaa ctgccccctc cccatttctc aaagtcatgt tcagatggca aaactgcctg 54181 ggattaccac atctgactct tgctagctgt ttctgtgtct atggcttcat actgtccaga 54241 catagacctg gggctcagca cacatgtatg tttgagtggc ataaggtcag aagcacagat 54301 ccagcaaaga ctgtggcccg gaggaaggtg ggttggttgg aggaaagaaa gtggaggtgg 54361 gagcaaacga agataagtaa acaaaaaaat gagctcttaa gcaaaaaaga ttgcctgtct 54421 gctgtagcct gtaaagctct gatgagaaac acttttatga tttgctggta aactccaccc 54481 aatttggttc tggaagaggt tatttttcat tgcagtgtaa gcatgtaatt agttaattgg 54541 gaaatatggc gccattttca aagctcactt tcatctgtca tctcacttac acctccctgt 54601 gaccctggga gattggtttt attatgccca ttttacagat gtggaaactg aagctcggag 54661 aggtgaaccc cctaaaaaga aattggagag gcttatcctg actcatccgt ggtccctgat 54721 tccttgtgcc tgagatgcag actgtggctt gggaataggg gagggctcag agggctggag 54781 ctcagctgag tggccggatg ttccctcaca gggtaccctc ctttggcttc ctgttgaccc 54841 tagtagttgt caatagtgct gcaggtgctc cacgccttcc caccctctgg atgaagctca 54901 tcctaaggaa ttagcatctc atttctaatt ctcaggtcct caggaatcag acttgtattt 54961 tgggtcaaat ttgcatacct tggggcttct gttttctcaa gctatgtatg gaccccagag 55021 caaatcccaa ccatggttca atatcagcct ctgggaatct gcagatcact ggaagccagg 55081 ctgcccacgc tttctggaag ccacttccta gtgaaggagc cctggaggaa agaggcaatg 55141 gtataggagt ctccctaaag tgagaagtgt ggagcccagc acttcaacta tgtctgtggg 55201 tggagccaaa ataaccagtt gctcctgatc gctgcatgtc agcaattgta tcacaacaag 55261 aggccacagg gcatcaccct acccaacact gtgatggggg ccggggtcac atgtgcttaa 55321 gtgtctctgt aaatcagccc ttcagttggg attaagacag ttcaggatcc caagaacaca 55381 catgcacaca cacaacccac ttgcacatac atgtgtgcag tcacacttcc agagacacac 55441 atagacatac atatgtatac acacatgtgc gtgtgtgcac acaggtattc acatgcacac 55501 acacacgcac acttatacac taacacacat gtgcatatgt gcacgcatgc acacacacac 55561 acacacccca ctccttttgc ttgcttcacc ttttctgtct ctgctcctcc tggtggtcat 55621 cacctcccat gccctgtctc ctgtatcaat gccccaggac tacctactat aacaaagtgt 55681 cacaaagtgg gtggtttaaa gcaacagaaa tgtatccttt cacagttctg gaggctagaa 55741 atccaagatc atggtgtcta taaggttggt tccttttaag tgcagtgagg aaagagctgg 55801 ctccagccct ctctcccatt ggctggtaga tgatcatctt ctccctgtgt ctcttcatgt 55861 cactttccct ccttgcccct ctctgtgtcc aaattccccc tttttataag gataccagtc 55921 atattgtatt agggccagat ctaattattt cattttaact caattacctc tgtaagtaac 55981 ctacctccaa agacagtgtc attttgaggt tctgggggtt aggacttcaa catatcaatt 56041 tgagggcaca caatacagcc catatacacc tcaacactaa ataagttaca tggtagttct 56101 atttttactt ctttgagaaa tcttcatcct gtttttcgta gtggctgcac tgatttttac 56161 ttccatcaac cgtgtgtccc cgtcttttga tatgtgtgta tgtcatatgt cttcttttga 56221 taagtgtgta ttcatgtttt gaccactttt taatgggatt attattattt gttgttgttt 56281 tgactgttga ctttgttgag gtttttgtat attctggata ttagttcctt gtccaaatga 56341 atagtttgca gatattttct cccatccaaa agggcatctc ttccctttgt agattgtttc 56401 ctttgctgtg cagaagcatt ttagtttaat atagtctcat ttctctattt ctgtttttgt 56461 tgcctgtgtt tttgaggtct tagccataaa ttgttcgcct agcgcaatgt cctgaagtgt 56521 ttttgctatg ttttcttcta ccagtttatg ttttcaggtc ttatgctcaa gtttttaatt 56581 caccttgggt tgatttgtgt acatggtaag agataaaggt tcagtttcat gcatctatgt 56641 gtgagtatct aattttccca gcatcattta ttgaagaggg tgtctttttc ccattgcatg 56701 ctcttcattc ttttatggaa aatcaattga ctgcaagtat gtggatttat ttttggtttg 56761 tctattccat tggactatgt gtctattttt atgccaatac catgctattt gggttaccaa 56821 tagccttgta gtatatttta aagtcagata gtgtaatgcc tttttctttt ttctcagaat 56881 tgctttggcc tttcagactc ttttatactg ttttttctat ttctgtggga aaaaatgatg 56941 ttggtatttt gatagggatt tcattgaatc tgtagattgc tttgggcagt atggtcattt 57001 tgattatatt aattctttca atccatgagc atgggatgtc tttccatttg tttgtgttct 57061 ctttcatttt ttttttttca ttagtatatt attatttgtg cagtgagcag aagcaaagtc 57121 agggacactg tttgtagaac ctggctgcct cccctcccgc agcccagctg attgtcctgg 57181 ttgccaggcg cttgtttgca ttagagacag aacactctgc agggttctga atggttcatt 57241 gtggccctta ggccactggc aagcagtggc agagccacgg aggggctgct ggtccccaaa 57301 tgtctctcca gccactctgg ggctggtggg ggctgccctg ctggtgatgg cttgagagaa 57361 gacctggggg tccctgggca tcctggggag gtcctggttt gtgctgctgt ggtgatctta 57421 atcatcaaga gatgacacac tttatgacac acgaaaagaa ctgagccaaa cactgtttcc 57481 atcaagaggt ccatgtagaa ggcagccaga acgtgtagaa ggatggaagt caaggcacag 57541 gcccaggaag ccttcaagtc acccctgatg gaggccagac cttgaacatt ataatccctc 57601 tgcttgtcaa ccattaatac aggtcttccc tgcaggacat aggaggcttt ctgggaaaag 57661 catggaaaaa acaagcctac taaatgttct caagtgactg atgtaggaca catcttaaaa 57721 ggatcaaagg aggatgaaga aaaacttcag aagaaaataa caggtggttt gaatgaaaac 57781 actcattttg cagaaagtcc agaagtgctc aaccaagaaa ttacaagatg gaagcaaaga 57841 gtacagaaaa aggaggcgaa agaaaagctg gaacttttga atgcacaaat aaaaaaggtc 57901 atgaaagata ccatgagcca attaaaatct atggctgaaa acctgctgga caccatgtag 57961 ttgcgagatt tccctagaga ccacggtgac gagacctaga aaacagagct aaagagaaaa 58021 gcagaaaatg aagaccgcct cccccagccc cagccctccc aactgtaagt cagaagagtt 58081 ttgctgggtg atggtgactt aaatatgtcc tgtgaaagtc aagaaaaatc aagaaatagc 58141 caggcaattg tgggaagaaa atgcaatgaa tgagagctca caggaaaaat caaatgcctc 58201 caaactgaga aagcatcttt gcaatttgaa aattcctagc tgggccaggt gtggtggctc 58261 acacctgtaa tcccagcact ttgggaggcg gaggcaggct gattacttaa ggtcgggagt 58321 tcgagaccag cctgggcaac atggtgaaac cctgtctcta ctaaaaatac aaaaaattag 58381 ctgggtatgg tggtgtgcac ctgtaatccc agctacttag gaggctgagg caggagaact 58441 gcttgaacct gggaagtcga agctgcagtg agccgagatt gtgcctggca acagagcaag 58501 actccatcta aaaaaaaaaa aaatcctggc tcacaggtgg gatccagaac ctgcagctga 58561 agcttcaaat cctgcctgaa tcacatcaag aacatatagt gcaacttcag agaagatcac 58621 ctgaggtgga aacacacagc ttagagatga agaaacttcc cagcatgtgt agaaacgtga 58681 acacacacag cggatttgaa gcctctacaa ggagatggcc aaagacatca gcaaagaatt 58741 ggagaggacc acttccttct gtcagaagaa gattatcggc cgggcgcggt ggctcatgtc 58801 tgtaatccca gcacgctggg aggccgaggt gggcggacca cgaggtcagg agatggagac 58861 catcctggct aacacagtga aaccccatct ctactgaaaa atacaaaaac tgagccgggc 58921 gcggtggcgg gcgcctgtag tcccagatac tcgggaggct gaggcaggag aatggcgtga 58981 acctgggagg cggagcttgc agtgagccga gatcgcgcca ctgcactcac tccagcctgg 59041 gcgacagcga gactctgtct aaaaaaaaaa aaaaaaaaaa aaaagaagaa gaagattatc 59101 atctgtgagg aaagagctca ggaaagctgg agggcagttc tgtccactga gagaaagctc 59161 caggagctaa gaaaagaaga tgattgtagt aggccaatcc tggctgacgt ggacttcaag 59221 ttccaggctt tccctggggg atcctttgct cctgcttctc caggcacagc ccacagaggc 59281 ccagaagtag caggggtccc ctgggtcatc aggtccccag caaggaggag ggaggtgctg 59341 tgaggctcag ggacccaggc caccaggttt cactcaactc tcccagaagt agggccctga 59401 atgcgcggga ccacagaggg acacagcagc ctgggctgtc tctttaaagt gatctggatt 59461 tatctctttt tagtttagct attgttactt gggatgccac gattttgatt tcttgaagtt 59521 tggtatagca ttataatttt acgttagtat tttaaaatat agacatcaat gtaattctca 59581 taaatataat tcctcatcac tggacccttt aaattacatg gtatgagtat tgcattttta 59641 tttaagttca ccttttatgt ttgagatggg tggagatgtt taaaacgata aagttcattt 59701 ctgaatgcca caagctgtta ctgtacatct atacaagtaa cagtatttac ctgaataaat 59761 tttaattgtt tttaaaaaca atgcaaaaaa acttaagaac aatataacat tcaaactatc 59821 taatatttgt attagtttag tcctaaattt attgactaac ctgggagagt tgatatttta 59881 taactatttg tttcttgttt tatatttttc ttaaccattg atgttttaag acaatgttgt 59941 ctaaacattt ctaaactata taatcaacaa aaagatttcc attgatggca aacccattct 60001 ctgttacaca catacacaca cacacacaca cacacacaca cacacacaca cacaaaacac 60061 ccgaatgttg agagaaggga ctttcagttg ttatgtcagt acctgaagaa aatgtagaat 60121 taagagagat acaaagtgtt tttttaatat ttacataaat aaaggaaaaa ctttgtatgt 60181 acacaatttg tacacacaga aaagacaact aaattatcta tacactaaag gagagttgcc 60241 tgatgaatca gacaagattt caacaagaaa cagttcattc caaatggtgc agatgaagag 60301 actttgataa aggaaagttt cctgggctgt cacagtttcg tatgacagtt tcctgggcgg 60361 agataaaggc ttggacaaga gatggtgaga ctcccagggg ctagcactag agaaaaggta 60421 gtcttacctc tggacctaaa aggaggagga tgtggtgtca tctgagccca gtgaagaact 60481 gtcttgggac ctgaagaaac tagagctaga ggtgcctact aaggtggcca cggcgcattg 60541 ctggagctgt gccccatgct ccaactgtct ctgcagccgg ctggggaggg tgggaacatc 60601 aaggaaggct ctctggagga ggtggtccct gagcttaaca ctgatgaatg agtaggagct 60661 acctccaatg cttttcacac atgaaggggt agatgaatca cttggagatc tttcactaag 60721 caggttcaga ttatcaggtc ttgcagcagg agctgagacc ccgggttgct cccaggtgat 60781 gcctacagca ggatgagatg tggaacagca agaatccttc gataaataca gggcaggatc 60841 cggggaagtt gaagggacag gagcatttcc tgcagacagt acagcccagg caaagacaca 60901 agggcacaga cagcctggtc tgcgtgagct gcaccatgtg actggggttc tgcagcccaa 60961 catcaggcag gaaggacagg accatgagaa acagcgtgca acatatgcag ggtgtcttcc 61021 caattatcct cagaaacact catctgcgtg ggtgagtctt gtcctgagga taccacctgt 61081 cccagaaaag tgtgaggaac cgtaggctga agctccccct tggagaagcc accatggagg 61141 gtatgcttgg cctccagaat cccctgtgtg ggtcagcagc tccctgctct cccggttgct 61201 cagcccagac atctggaaat ccctcctgac tcctctgtat ccatccatca ggtactccca 61261 taaattccgc cctgcaaagc ccctctactc ttttctgtcc atctcgacta ccaccatcct 61321 gtaagccacc attacctctt cccaagtgct ctgttccttg ggtaaccacc tcccttacta 61381 ggcccccact tcactattac ttcctccaat tcatgtctgc acacagccag agtggacctt 61441 ttaccaaaga cctgagcatg cttcatatga agccagacct cggcctgcat gcggtgagtg 61501 tgcagcagcc cttgaccttg gtccttgtct cctacctcgg gtcatgaaca gcaaaggtga 61561 gcacacatcc ctgagtgccc tggacgggcc tggctggggc cagcgtatca tgctgtgccc 61621 agtatattat tcatagctcc ctttcactct taaaattgct tcagttacga caataaatta 61681 tatgatcaca gaatgaatgg gtcacttgca ttcaaacttt tcaaagagaa ctgcaacatt 61741 cctaaaatat agaagctcat gagaaaatag acaacatgag tcaaatcaaa ctctaaatac 61801 agccgtggac agtcaaagaa acgccactgc tttgccaagt tagcccatgt gggaacatca 61861 cgtttttggt taagacaatt ctatattctt aaaggccaaa tgtaagccac ctcagttttg 61921 ggaaggtccc aataaaagct gaataatgtg taagcatagt ttaaagttaa ggcccctcta 61981 atttattgtg gacctgaaga tattatgctt gaaaatcatt tcacaggagt ccattctaaa 62041 gtttgtatga tgcaacgttc ttgtgaatta tagacaaatg attcatctgt ttgactcaga 62101 ttcaaggaca ttttttgtgg tgaagtaagg cactgagtct caagatataa tgtaaagata 62161 tggagttgct atgccaggaa aagaactata gaaagggcag ctaggaagtc aatgcaatag 62221 tcagcacacc tctggggccc tccctgggag cctctgattc cattactctt gggtgggccc 62281 tggacatcta tattttctcc aagttccatt catgattcta atgctcagca aagctgggga 62341 accactgcaa taaggcatga tcttgagagt tatggcgcag agacatgcat ctgcctctga 62401 ctcagcacaa ggggcacccc gcttctcacg tctcatgact gcatggtgta cagtgtggtc 62461 agttcagcct ttatgtctaa cctctgtcct tctccaccct gtggggtgcc gctgacctgt 62521 ggggtgttga ccacaaacag attcctaccc catgtttctt gggggttcat cccatgggga 62581 tttccagaag gagactggag tttcactctc ctgcctcccc tgatccctgc tttttttttt 62641 tttttttgag acagggtctc actctaacac tcaggctgga gtgccgtggt atgatctcgg 62701 ctgactagca acctccacct cctgggttca agcgattctt ctgcttcagc ttcccgagct 62761 gctgaactaa aggcacgtgc caccacgcct ggctaatttt tttattttta gtatatttta 62821 gtagagacgg aatttcaccg tgttggccag cctggtcttg aactcctgac ctcaagtgat 62881 ctgcccgcct tggcctccta aagtgctaag attccagacg tgagccaccg ggctctcctg 62941 cctccctttt gaccagctgg ccttgagctg gctgtgtccg tgggcctgag gctatgctcc 63001 tctcaaaggc tgactctcca tggctctcct tgtggttcca ctaactagtg cctacccttg 63061 tccctcgagg cttagaaggt gctgaagctg aactgcaggg agccccaggt tactgcctgt 63121 attcattacc tgtgtctgct gtaagaaatt tccacacact cattcactta aaacaacata 63181 tatttattat cttacatttc tagaggacaa gagtccagaa atagtcccac tgggctaacg 63241 tcaagggcta gtgccctctg gagctcctag agtcagcctg cattcccagg cttatggccc 63301 cttccaccat cctgagagcc agcagtgtgg caccttcaac tctctctctg cccaccctcc 63361 gctttcatca tcactcatct ctgactgtga ggatcctgct tccctctttc gaggaaccct 63421 gtgactacat gggtgatcca tgacagtctg cccatctcca gatcctcagt caccctgcaa 63481 agtctctttt gccgagtaag gcagcatgtt cacaggttct ggggagcagg aggtggacct 63541 cttgggggtc attattgcgt gtaccgcacc ttgctttcta ctttgcagtt tcctacgccc 63601 acacctttgt gaacagcccc gtggtgtatg aaccctcctc aaagtcacct gtttccagca 63661 tcattttttc cctaactctg tgcaccatct ctagcactta ttttttaaat gtattgaaaa 63721 gatagactca attagcactt atggaacagc tttccgagga aatgaataga tttgagcagt 63781 ttgctgatag aggtgcccag tggaaatatc aaagtacaaa tgagaatgtt ggtggtttcc 63841 atgaagtgaa gctgtctgac cagacaataa catgagaatc atgtctccac agagattacc 63901 cacttttgaa tcactctgtc aaggtgttgg cacaatgctg ccctcattat cgaaaggcga 63961 tgactacaac agtcttgttc attcaagagc tgtactccag tcaaggaact gagaacctcc 64021 cagggggcac agccttgtgc ttggttgtgc aaccaacatg ccttatttct aatttcacac 64081 acaatgtgtg tgatcagcaa aacaagggcc ccccaaatat gtctatgccc taatctctga 64141 ggcctgtgaa tatgctgtgt tacatggtca gagagacttt gcagatgtaa ttaaatttag 64201 ggactttata atagtgagat tgtcctggat tctctgggtg ggccaaatta atttcatgag 64261 gccttcacag tagggcactc tcaggctggg agcagaagag gggtgcagca ggaagagaag 64321 tcagtggagt ctgatcctga gatggatggc atgcacgctc actggccctg agatgcaggg 64381 gccacatgtg aggaccagag ggagatctct aggggctctg acggctcctc ccagctgaca 64441 gccagcaagg aaacagggac ctcagcccca caaccatgga aactactagc agtatagatg 64501 aacctggctg ctgatttttc ccagggcctc ttagtaagaa cccagctggt tgacatcttg 64561 gtttcaggct tacagaaccc aaagcagaga aagcagttga gctgtcccag acttccgacc 64621 tacagaactg tcaggcaata aaccgttatt ggtttcagcc cctagctttg tgactgtttg 64681 tcatggcagc aacagaaaac taaacgaagc catttcagtg agagcaggaa aagtgctgta 64741 aaatagtatt ctctaaaaga gaaaattttc attgtttatt tctgatgata aaaataatac 64801 actctcattc ttggaaaata gaacaacatg aaaaaaaaag aagaaaatta aaattactta 64861 taatctcgcc aaccagagat cattatcatt attgcaatgg actatatcat gttccctcaa 64921 aattcatatg ttgaaattcc caaccaccaa tgggactgta ttaggaggtg gggtctttat 64981 ggaggtaatt aaggtaactg aggccatgaa agtagggctc tgatcgaata ggattggtgt 65041 cctcataaaa agataattta ataattatat atattgtata tacacaatct ctccctctct 65101 ctctctttct ctgctacgtg aggacacagc aaggggagag agcagctatc tgcaagccgg 65161 gaagaaagcc cgcatcagaa acccaatagg ctggcacctt gatcttggac ttccagcctc 65221 cagaactgag agaaataaat gtccgttgtt taggccactc agtctatagt atttacttat 65281 ggcagcccaa gcatactaat acaattactg atcgcatttt cctgtatttc ttaccaatat 65341 tttaaagccc acatttttct tagaagtaag aacatgtcat atataatcat atagcttctt 65401 aataagtaac attgttgcca ttaatatgtt actagaaaaa tttaaaaatc tgtttttcaa 65461 tgactacata atagtttttc aagctccata atggcaaggt ctttcctttt ttaaaaaagt 65521 gaggtttaat ttacatgcag taaaatgaac agattgtaag ggttcagttt gatcagtgta 65581 ttagttcatt gctgcactgc tgtaaagaaa tacctgagac tgggtaattt gtaaagaaaa 65641 gaggtttaat tggctcatgg ttctgcaggc tgtatagaaa gcacgacagt ctctggggag 65701 gcctcaggaa acttacaatc atgtcggaag gcaaagaggc agcagatgca tcttatgtgg 65761 ctggagcaca aggaatagag agagaggaga gaggtgttga acacttttaa acaaccagat 65821 cttgtgagaa ctccatcaca agaagagcat caaaggaaga aactgctccc atgatccaat 65881 cacctcccac caggcccccg ctccaacatt ggggattaca attcgacatg agacttgggc 65941 aggggcacag atccaaacca tatcaatcag ttctgacaaa tgcattcagc tatgtaactc 66001 acatcactat caatatatta gatgttttcg atcactctag aaaattcctt ctgtgccact 66061 tcctgatcaa ttcccaccat tccagaaaca acttctattc tcatttctat ccccatgaat 66121 tagttttcct tgagaaggtc ataaatgtta tcatacacta tgtacttttg tgtctgcttc 66181 tctcactcgg aataatgctt ttgagattca tgcatgttgc tgtgtattgg tgctttggtc 66241 ctgtctgttg ctgagtcatg tatgggtaca ctgcaatttt gttattggtt caccagttca 66301 tgaacgcttg gcatatttct gttttggagc tagtatgaat aaagctgcta tgaacatttt 66361 tgtgcaagga tttttgtggg catatgtttt catatctcta ggaactgaat tgtgagtcat 66421 aggatagata tatatttaaa attataagaa acttccaaac catttttcaa aatggttata 66481 tcattttatg tgctcaccag ctacatgtaa gagttccagt agttcggctt cctctgctat 66541 acttggaatg gtcaatcttt ttcattttag ccatgctaga gggctgaagt cctatctcac 66601 ttttaatatg catttccctg atgactaacg gtgttcatgt gcttattggt cacatatcct 66661 atttggtaaa gtctcttttt agtcttttac ctaattttaa ctagggttgt tttttttatt 66721 attaaattct agcagatttt aaaaatatat gcagatgctc cctgatttgc aatggagata 66781 catcctgata aacctgttgt aagtcaaaac tatcatgagt ttgcatttaa cactccaata 66841 aagccatcat aaagtcaaaa aatcataaga atggacgttt tctcccagcc tgcggcttgc 66901 ctatttcata tcttaataat gtcttttgat gggcagaagt ttttaatttt ggtgaattct 66961 cataatgctt ggctgtgggt tctcatccaa atctcatctt gaattataat ttgaattgta 67021 atccccacat gttgagggaa ggacctagtg ggacacgatt ggatcatggg ggcagtttcc 67081 cccatgctgt tcccgtgata gtgagttctc acaagatctg gttgtttaat aagggtctgg 67141 tgctaccccc ttctttctct ttctctcctg ccaccatgta agatgtgcct tgcttcccct 67201 tcgcctttgt gaggcctcta cagccatgca gaactgtgag tcagttaaac ccctttcttt 67261 acccagtccc aggtagtatc tttatagcag tgtgaaaatg gactaataca aattccaatt 67321 ccaattttag tgaaatccaa ttccaaattt tggtgaattc caaaaatttt tttctatgac 67381 tctgctaaca aatgtgtcca tccccaaact cataaagata tttttaaagt tttcttctag 67441 aggttttaca ggtttagtgt tatgataagg cctatggtcc accttggaat aaattttttg 67501 tatggtatga ggtagggatt gaggttcttc cctctcctca tatggatttc acttatttct 67561 gcatctgttt ttggaaattt cctttttttc tcgttgaatt gctttggcat ctttgttgaa 67621 aataccattt gaccatatag ttgggaactt atttctagaa tttctatttt gcttgatcgt 67681 tttttgtctg ttctttcagc agtacagtcc tgtattagtc tgttctcatt ctgctaataa 67741 agacatatgc aagactgggt aatttataaa ggaaagaagt ttaattgact cacagttcca 67801 catggctggg gaggcctcac aatcattgtg gaaggcaaga gagcgtatgt aggagaactc 67861 aaaaccatca gatctcatga gacttattca ctaccatgag aacagtatgg ggaaaccacc 67921 cccatggttc aattatttcc acctagctct gcctttgaca cacgcagatt attacaattc 67981 aaggtgcgat ttgggtgggg acacagccaa actatatcaa gtactgtttt gattactaca 68041 gctttaggtt aaaatttgaa gtcaggtagt ataagtcctc caaatttgtt ctttttcaaa 68101 attatttttg tcattctatg tgctttgaat tttcatataa attttagaat ctgctcattg 68161 atttctattt ttaaaaacca tctgggattt ggattgggat tgccttgcat ctatacataa 68221 atttggagat aaatgacata ataagaaaat tgtctcccaa tccataaaca ttgtatatat 68281 ttccatttat ttaggtctta ttttatttct ccacaacatc ttatagtttt tattacagag 68341 gtattacata ccttttgtta aatttatctc taagtatttc atattttggg gtgctattgt 68401 aaagagtatt ttaaaacttt tatttttaaa tgctatataa aaatatatgt gtatataaaa 68461 acacatacat gtgggtgtat ttgtatatct atcactacag ctatatgttt atattaatgt 68521 ctatatctat gtctatcttt gagtctctat tttcagactt gggcaaactc tactcctcaa 68581 tttgtggaaa aatctcagta ttgatctatg catggtaggt gcatttcata catatgtatg 68641 tatttttaac aatgtaagaa gcaccctcta tttcaccttc caaacaaaaa cttaactgac 68701 agtaatctat atctaatcac atggtttccg ccacttaagg cctatcccct ggtctccata 68761 gtcaagtatt atcatcccaa ataaatctct gaatttattg aaccttttcc ctgttcacgg 68821 attttgtggt gatgtttcca ggtttttgct gtcacaacta attctgtaat aagtattttt 68881 gtgtggtata tttggtccaa acactgatta tattcttggg aggtgttttt ataattggaa 68941 ttcccatgct aaagtacatc aaagggcatg aacattttaa gattcttgat atagtttagc 69001 aaaagatttc cagagtgctt gtatcaacta actctttcac cagcaagaca agaatggcca 69061 cttcagagta cctttgccaa aatttattaa atttttttaa cttcgttaaa tataatgagt 69121 gacactttta acttttgcat tattcacatt ttattagaca gtggtttttc ctactctgca 69181 aattttctat tttgattttg aaatatttgt atttaatgat atttaatgta ttgtggagag 69241 taactagctg cacatcacat ttatagaaag tatttttccc ttttaatttt gtacatgatt 69301 ataccaagtt ttatacatgc tccatcctca tgacctaaac acctcccaaa aggcccaaca 69361 ccccaacaaa cagtgttaca gtggggatta agcttccaat acatgaattc tgggggacac 69421 attcagacaa tagcaaatat taaaactata aattgtataa aattgtataa acacaattta 69481 cctattatct gttaggaagt gtcattgttc taaggtcatc ctagtcctgg tcagccagtc 69541 tcacatgaaa taaaatccac ctggctgtga agccacctcc ttcagcacag tgccatcatc 69601 cctgccctct acttagtgaa aaacaatggg actgttaaag cgttgcattg gaaagcatta 69661 cctactcttc cagcctgatc tggaatagcc tcagacgccc ccagcccctc tcagaatgag 69721 gggacttttt ccagctccgc aggactgtgc agggcccctt cccagtgcag ccttgactgt 69781 ggggtgcggt ccctctgcat gccactcaca ggacaagttt aatcaagccc agtttaagga 69841 tcacaaccat aatgtctaca aaaccatgat gaactacaac catttgtagt tcattagttc 69901 agaaaatcat tactgaactt ctgtgaacct ccacagtgca agagcccttt ttgaaatgca 69961 aatttggata gattcccggt tcaggcttac ttatatgcca agataaatga aaaggggatt 70021 cttacatgtc agtcaaagat gatttctcat ctagcacaca ccctgagctt ctggaagtcg 70081 cccagtcact cactacgtta tttgccacac aggtattaca caaagaccac gagaaaaagt 70141 catcgtccaa accagagaaa ggaaattgac cggtgacctg tcctgggcgc atagccgcat 70201 gtgatgacag gcaggaccct ggcagggcag agaaaccagc tctgttcaga ggtttccaga 70261 ctgtaaattt aaagccctgt acaatttcaa caacaatgaa aatttaagga tcaagtgtac 70321 tggcctttca gccttccact ttgccaaata agcaaattga aaaaatagtt acactcattt 70381 ttgctatcat tttataaaag gatattttta acagaaaaaa agggtaaaga aagttatctt 70441 agaataaaag acagtttctc aattaagttc atttgaagta tgacaacttg ctatacattc 70501 cttatttttc tactttcgtt tgcagaccag tgcaaaggct tttatatgga ccaaccccag 70561 tcagaaagca ggcacttgga aacctcgtgt ctgtgaagaa caaggggtgg aacctctgaa 70621 atcctcccct ttgagatttc ccatgcattc acacatcctt ctgtgtgcct tgaaaatgta 70681 agaaacagat gtgggcaaga ggcagccaga agactccggc acaggccctg caacctatct 70741 gctcacatcc tgcttgggag acatgtcagc gataaactcc aggagtagtc acgagggctg 70801 acaggggttc ctgcccggct ctgcatccaa cagcctggtg gagcctaggc taagtctaca 70861 aaaatacgga gctcagagct cctacctaga tttctttgct ttattcaggc agaattgaca 70921 aattaaaagg tatgtattta aggtgtacta cttgattatt tgaaatacat gtacacaatt 70981 atgctaatta acatatccat cgcctcacag ttaccttttt ggtatgtgat gagaacactc 71041 aagatctgac ctcttggcaa atttcgtgta caacacagta ttgttcattg cagtcaccac 71101 gcaggacact agatctccag aatttaagtt acgttttgat tgattaggta tgagttgggc 71161 ctaacgatag cattttactc tttttattct tttttctttt ttttttttga gacagggtct 71221 taactgtcac ccaggctgag tgcagtggca caatcatggt tcactgcagt ctcaactacc 71281 tgggctcaag caatctccta cctcagcccc acaagtcggt gggactatag gcacaccatc 71341 acgcttggct catcttctat ttctcataga gacagggtct aactatgttg ccgaggctgg 71401 ccaggaactc ctgtacacaa gtggttctcc cgccttgggc tcccaaagtg gtgggattac 71461 aggcttgagc cactgcacct gactggacat tttactcttt atcacagcat tagatacaca 71521 tggtctaaaa tatcacaagg cttataatgg aaacaactag cccctggctc tgcctacccc 71581 acgctgctca tagccatgag attacgacca ccagtgcact tagtgatgct tttgggaatt 71641 cacttctatg cttgttaata acatgctcat agggctagat cctgatcctc cattgagcac 71701 attaaccttg gggatctgca aaaccatggc ttctccccac taaagatgtc catgtcctaa 71761 tcctgggaac atgggatcct gtgaacatgt cactttccat gccagaaggg actttgcagg 71821 tgtgattcaa gtaagcatct tgagatgggg agattttcct ggattatcca gcctggtcca 71881 atatcatcat aagggtccac atgaaaggga gggattatcc agcctggtcc aatataatca 71941 taaggatcca tatgaaaggg agggattatt cagcctggtc caatatcatc ataagcgtcc 72001 atatgaaagg gagggatgag agtggagtca gaaaaaagag aggtcgccgt ggaagcagag 72061 gttggaacaa ttcactgtga agacagagga aggggccaaa gccgaggaat gtggtggcct 72121 ctggaagctg gaaagggcaa ggaagattct cctcagagct tccagaaaga caacagccct 72181 gccgacgcct ttgattatag cacagtgaag actcatttca gacttctgaa ctccagaact 72241 gcaaggtcac agttgtgttg tcttaagcca ctgaatgtcc ggtaatgagt tacagaagca 72301 acaggaagcc aatacagctc tttaatactc ccccaaatta ccaccctgtt gccccaacaa 72361 acattcgctt ctctttcttc atcttcccaa tgttgtttat ctgggtcttc agaaagtcaa 72421 tactatgatt atgtaaatat tttccactga gccacatcaa gaaataagat tgcatttctg 72481 tctcgtacat gtgtatgttt tcaagcttat aattactttg gattttcccc gatgcacctc 72541 tggctaattt tcaacagcca caatttactt gtaaaacgct ttctcaattc tattttccgc 72601 acagcttata tatcaaaaaa aaaattcagt tcatgttttt cctggaggaa tgcctccggg 72661 aaaccctggt ctcccagctt ccacctggac tgatggctcc ctgggtcccc tgcatggctg 72721 ccagcctgac aatcccttct gtcctgctct gcttcactgg tttcgctgct gagactcctc 72781 tttcttggtt ttctcttatc ttggacgagc atgtgcccag taactgcctg agaaagggtg 72841 cagacaagaa acgtttcttg agatcttgag tgcttgaaaa tgcctgtatt tcattttcaa 72901 aattgatcag tagtttcata agccaagaat tccacattag aaatttgttg cccttccaga 72961 attttgaagg cccttgtctt ctagctccct gtgttgctgt tgagaattct aagggcctcc 73021 tgattgctga tatttgctaa gtggcctctg tgttcctcta gaagcttcta aggccttctc 73081 tttatctctg ctgcagtgat attccatgat gatttgcctt ggcctgagca tctatcagga 73141 gatttatgcc acttagctct agggaatttt cttgcatttt aaaaaataac ttattctcta 73201 tatatttttt tccttttctc tcttttggaa ttcctgtttg tcaggagttc aactttcttc 73261 tcttgtcttt tcatctctcc tctcctattg ttggttcttc agccttttgc ggtatttttt 73321 ttggcaaaaa ttctcattgt tatcttctaa tcctcctgta gaattttgtt tcagctctgt 73381 cctctttcct gcccctaatc ctggagctgt gaatgttcct gctaggagcc tgatcttgaa 73441 tcacaagtat acagctcctc ttaagtccta gaaggtttta attgtgggtt tgtccagaaa 73501 gtttccctct gctccctgaa ttgcctgttt tctctgagat cctttctgtc ttgtctttag 73561 tttctgtttg tctgtgttag agggcttcct gtctggtgat ccttggctat ctattcatag 73621 ttaagagtga gacactaaaa tatttatcag aaactggtgt gctggggcag gatttgcctg 73681 tgcagtgaga ggcgaagtac atgttcagag gagcccacat tgttagcatc tggagcctgt 73741 cctctgggtg atgttgtcct cagaggtcat ctgttttcct gcttgctggt tataggtttg 73801 gctatggctg ccctggagca gggatgggga aggggccaga agtccaccat tcagtgtgca 73861 caattcatat ccccacctca ccctgcactg gagtgccagc tgcctcgatt tcccagagga 73921 tcaccctcga gtctctcgtg ggatcaagga gctagtgtct gcctgagcag cagatctaac 73981 aacccttgga cagcattcaa taacccacct cttgccaacc ccatctcatc ccaacctttc 74041 agagggcctg gagtctgatt ccggagcctc tcgggtggat cctcaaggtc ttctgcctgc 74101 acctctgact gcagacactc ggtagccagc ttccttccct ctgctaagtc accgcccacc 74161 catggctgtg tctgctttcc tcttcagaca taactcacat tgtcattcat cttatttaat 74221 ccccagtcct gggctcctca tcctgtaagg ttccctgttt tcatgctttt actgacattt 74281 tagggggggt gggcagaaaa gtaatgaaag tacatgtacc accggccatg tttaagcctc 74341 ttctgcctgg gctgggcatg gttgttcacg cctgtaatcc cagcactttg ggagtccgag 74401 gtgagtggac cacctgaggt caggagttca agaccagcct ggccaacatg gtgaaacccc 74461 gtctctacta aaaaatacaa aaattagctg ggagtggtgt tgggcgcctt taatcccagc 74521 tacttgggag cctgaggcat gaaaatcact tgaacccggg aggcagaggt tgcagtgagc 74581 cgagatcagg ccactgtgct ccagcctggg tgacaagagt gaaattccat caccaaacaa 74641 acaaacaaac aaacaaacaa aaaagccccc caaaaaacaa agactcttct gcccatgcct 74701 gtcagatact gaaggaaaaa cgcaaagcac cttccaggaa agctgagtca cagaggtgca 74761 cagagccaac ccaggagctt ctctttctcc cacagcatgc acccatccct agcaagtcct 74821 gccagctctg ctctaaacag gaccaggctc aaagctcatc tcttcacctc cttcactacc 74881 ttccctacca ttcgtctggg attcctgcag gcacctctga ctagactgcc ctgctgtgtc 74941 ctcttcagtc tactctcaga aattagactc agctggagag attcctgtta aaacaaaagt 75001 cagataatat gaccccttca ttctaaacct tccgacagct tcccatttca ctcatgacaa 75061 tatcctaagg ccttaccatg aactgcagag tcaacacaat ctttccatat cacctcctac 75121 tcctggcctc tttgcttaat ctgcatcagg cacactgaac cccttgccat ttctcacaca 75181 cactgagtgt gttccagccc cagggccttt gcacttgctg ctcgatcccc agatactcct 75241 atgatcatgt cttcacttac ttccatgctc agctgaaatg ttaccttatc agagagacct 75301 ccattgtttt atcaatccat ttgcagcctg aaaaagtctg attgtgattc ctgggttcca 75361 aacaccttga tccttcagct gctgatatta caatcagaaa ttaatactgg ttattggtat 75421 tctgacttct atttggaatc catgaagatc cctaggccaa cacttctcaa gctttaacgt 75481 gcatgtgaac cagaaagatg ctagccagtg cttcttagac ttttacatac aaaggaatca 75541 ctttgagagc ttgccaaatg cagattaaga ctcagcaagt ttgggactag accagaggtt 75601 tgcatctgta acacgcgccc aggtgatgtt gagcctgctg ctccccagct taccttttga 75661 acgacaaagt ctagaccaag gtgaggatac tgtccctctc atcccagaca cacaagagag 75721 gctgggcctt catccctccc atccccacct gtgggcaagt tactccatca gccagaagat 75781 ggtcagctct tacagcctca gaggtctaat gatgcaacag ctgcagcacc tgctccctgg 75841 gtgcccgcct gtggggtgag tcagcggctg gactgagatg acagctcatg agcatgtctc 75901 ctgggactgt gcctgtggcc tgggcttggg ccagactccg tcagcaacta aatgcacctg 75961 gaaggctctc ctcagctctg caaatcccac cagctgcctg acctgctttc cccaagccct 76021 accactcctc tcgcctactc ttaactgcct tcgtcacccc agttaggtca aatggcattc 76081 tcttcagaat gacctctacc atctgttgag actctgttat ctgccaggca gtgttgcagg 76141 catttccctt gcatcatttg atttaactct cataagtgtg ctatgggatg actgttatta 76201 ccttctttta agatgaggaa atgacactgc agtgactata actaatttgc tcaggatcac 76261 aaagctggga agtgatggag gtgggatccc aggctggctc tctgattcca tcttgggcag 76321 gggacatcac atctgtgcca ccaccctact tgtggtctac caggctccac cacctccatt 76381 atcttagtgg acaggtggag aagcaccacc tcatctttgt ggagggaaat gctgtacttc 76441 agcttccttc atcctaacac ccagggaaaa ggaaacatcc cagtcccatc tcagctagct 76501 ttcctgtatt acctaatata agatgggata ataactgaga ctcccaggtg aaggtcacag 76561 cccagggaca caggctcact aaaaacctga aacctaatca ttggactcta tatagaatga 76621 ttcgcctccc ccatttctgc ccagcacatc cgcagtgctc ttttgtagta actggaacac 76681 aatataagct atatgtcagt gagcttgaag atatatgaat agaaacttct caaactgaaa 76741 gacaaaggga agaaagactg aaaaaaaaat ggagcaccat gtttaaaaat ggcaggacaa 76801 ttacaaaagg tgtaacatat atgtaatggg aataccaaaa aagaagaaag ggagaaagga 76861 acagaaaaaa atatttgaag taataatgac tgagaatttt ccaaaattaa tgagacacca 76921 aaccacagat tcaggaaact aaagagaaca ccaagcagga tggatatttt aaaacgtata 76981 cctaggtata tagtatatat tcaaattgca gaaaatcaaa gatgcagaga aaataatgaa 77041 agaattcaga ggtggaaaaa aaacctacct tacgtatgga ggatcaagca taagactaca 77101 tcagacttct cctcgaaaac catgcaagta aaaagagagt gtggagtgaa atatttaaag 77161 ggctgaaaga aagaaaacac cactagctat agatagaatt ctgtatccag tgaaattatc 77221 cttcaaaagt gaatggcaaa tacttttgca gagaaagaaa aacggaaggg gtttgctgcc 77281 agtaggactt ccttgaaaaa gaagggaaaa tgacataggt gaaaaactca ggtctactaa 77341 aaggaatgaa gagcattcaa gaagaaagaa acgaaggcta aaatgaataa tttctttttc 77401 atattttatg ctaatctaac agtttatcat tttcttcaag ataataatag taaacatgta 77461 ttgggtgatt atagtttata aattagtgaa atgaatgaca aaaatgttat aagagatgag 77521 gaaaagaatt gtgaatactc tgtgataaca tattctcact acacatgaag tggtattgtg 77581 ttatttgaaa atggacttgg atcagtttta agtgtatatt gcaaaatcac aggcaactac 77641 taaaaaaatt gtaaagaagt ataactgata tataatgaaa gaggaaataa aatggaatca 77701 tataaaattc tcaaggaaaa tcagaaaagg agaaaaaagt gtaagaggaa aaagaaacag 77761 aacaagggaa acacatagaa acagtaataa atatggttga tattatatca aatatattaa 77821 taattagttt atatgtcaat gttctaaata tgtgaattac aagacagacg ctagctgggt 77881 ggattaaaaa aaccagacct aactgtatgt tgtctacaag aaactcactt taactataaa 77941 gacacaatta gaatacaagt aaagagatag agatagatat accatgttaa tgctggaata 78001 tctgtattta tatcagacaa tgtggactac agaaaaagta tcagggcaaa ggggcaagaa 78061 gggcggagtg agcaagtgga gccaggagga attttagggc agtggaactc ttctatgctt 78121 tactgtaatc gtgcgtgcct gacattatgc atttgtcaaa actgctagaa ctgaacaaca 78181 catggagtga accctcacgt aaactatgga ctttcatgga taataatgta ccagtacttt 78241 catccatttt agcaaacgta ctgcaacaat gcaaaatgct gataataata ggggacactg 78301 tgggaggagt caagggtcta tgtaatatct ctgcacaatt tttctttaaa cctatcactg 78361 ctctaaaaaa taaagcccag ttttgttttg tttttaaaaa aagaggccgc gtgcagtgcc 78421 tcatgcctgt aatcccaacg ctttgggagg ccaaggcagg aggattgctt gaggccagga 78481 gtttcagact agccttggca acatggcgag atcctatctc taggaaaaaa aaaaaatagc 78541 ttggcatggt ggtgcatgcc tgtggtccca gctactgggg aggctgaagt agggggattg 78601 cttgagccca ggcatttgag gctccaatga gatatgataa tgccactcca ctccagcctg 78661 ggcaacagaa caagactctg tctcaaaaaa aagagagaaa gaaagaaaga gagagagaag 78721 gaaggaagga aagaaggaaa ttatcacaac tatcgcttca ggcgttggct gacctagacc 78781 tccaccccgg ccctcaccac ctggccagtc ggagccttgg tcctctacat ccatggggcc 78841 cctactgcct gagctaggca atctctcaag accccagggc tggccttggc ggacagttct 78901 ctaatttcat tctgggtaaa gggaccatga gcgagttgct catggcaaag tcccagagga 78961 tgaacaagag gctgtgagtc tcggtctgcc agagggattc taggagagag aagtagctta 79021 ataagcaaaa ggtggcaaat atgagccgaa agaaggcccc atccccttgt agtagacatg 79081 cacttctgaa gacggctcag tggattggag actgttccca gggccacctt gggaggcatg 79141 agggaaggag cttgggtcaa cctgtcatac ccaaagcaga gtcaccctca ctcaggacac 79201 ttgctgaaca gggacaaacc agcttcatag aaaatccatg aaattgtaga ctccaccaga 79261 aattttagac tccatctact tatttctcct ggattgttac attgaaaatc caggtctctc 79321 agtttagacc tttggggaga ctcttcagtc ctctccccat ctgctaacct tgaccaaaca 79381 caaaggagag aggggcttca gaggagggtc cctcggacct tggtgggcag gacccacttt 79441 ccctgtgcag aaggaaactt gaaatcagga gactgagcct cacccaggga caggccccac 79501 cgctgtgccc tgacaggccc ttgtggacag aaagggacag ggcgtagcct cacttcgacc 79561 ctctgggttt gctgtggggc cagctgggcc tggtgtggct gagctccaga gtgctgtcag 79621 ccactcccag ttccctcatg acctgggcag ggaatataat atcacctcta tctgtgcagt 79681 aagcacagca atgtcaggga cactgttttt agaacctggc tggctcccct cccccagccc 79741 aactgattat ctgtcctggt tgccaggcac ctgtttgcat tagatacaga acactccgca 79801 gggtcccgag taggcagttg tggcccacag gccactggca aacagaggca gagccacaga 79861 ggggctgctc gtccccaaat gtctctccag ccgctctgaa gctgctgcgg gctgctgtgc 79921 tggtgatggc tgagatcatg agatgagccc cgggggtccc tgggcatccc tgggaggtcc 79981 tggtttgtgt tgctgtggtg gtcttagtca ttaagagatg acactttata aagaggaaaa 80041 gagcttagta aaagtgttgt tctcaccagc aggtccatgg agaaggtgga acactggact 80101 aaagagacag gctgaggaag ccctcaagtc acccctgatg gaggccagac cttgaatgtt 80161 acaatccctc tgcttatcga cccttaataa aaatctgagc aggtcttctc tgcaggacac 80221 aggaggcttt ctggaaaaaa aaacatggaa aagtgagtct cctaagcgtt cccaagtgat 80281 tgatgcagga cacatcttaa aaggatgaaa ggaggatgga ggaaaacttc agaaggaaat 80341 aacacagttt gaatgaaaac acctattttg cagaaagtcc ggaagtgctc gaccaagaga 80401 tcgtgagatg ggagcagaga gtacaggaaa ggaaggagga gaaagaaagg ctggaaacct 80461 ttgaatgcac aaaaaaaaca ggtcatgaaa gataccacga gccaactaaa atctctggct 80521 gaaaatgtgc tggacaccat gcagttggaa gatttcacta gggaccacag tgatgagggg 80581 aacttggaag cagagctaaa gaggaaagca gaaaatgaag aaaccacccc cttaaaaatt 80641 aagtaaattt ttttaaagtt atgtgcatac attctgtata tacagaaaag atatctaaat 80701 tgtatgcacg cagaaggagc attgccccca caaggaacag ttcatcctaa atggtgcaga 80761 tgaagagact ttgacaaagg aagtttcctg agctgtgggc agggatgaga gctcagacaa 80821 atgatgggaa gcctcccaga tgccaaggtt tcccaccccc aggtttcaag ggaggagaga 80881 ggaagtgatg tcatctgagc ccagtgaggc cctgtcaagg gaccagggaa aactgccccc 80941 actttttgct cccatcctct cattgactgc aggtgcacag gcagctgcct gaaagagggc 81001 ctgcgaagtg caagccagag gagttggcct ccatggcaca gagcaggcag aaaacaaatt 81061 tggagaatga ccagcactct tgggtttatt tgactgaaag tgcttgtttt attgcattca 81121 ctcacatcca tttatttaat aaatatccat ggagtgtcta tcatgtcttc tgcactgttc 81181 taggggcttg ggatacagag atgaaaaatg ccaacccgat gaggcctgca gaccactaga 81241 acttacattt tagtgatggg caaggtcagg taataaaaga caagcaagtg atgggaatat 81301 gtgctctgca gacatttcca atccagtgat gggagaacag gggctatggg agagcaccca 81361 ggaggcaaca tttaacctga gatctaaaag taagaaggag gaccctagga gggtcagggg 81421 cagaatattg caggtagaag gtacaggtag gataagacca ggagaactct gcctctgggt 81481 ccaggggaag caaagtaaag tggggttgtg gggagtgtgc agagccctgt gcacagatgt 81541 gacatccagt gcagtgtgat cacccaggga agggtcttac ccctcgagag gtggggttgc 81601 agggaagggg gcatggagga atgtgtatgc caccttgacc tttttggttg tggagagaac 81661 ggaataagaa ttgggggaga ggcaggattt cagcgagggg tgtcacaggg gtctgctgtt 81721 ggggtggtca tcacaagatg aatcaaggac agatcgggct gtgtttggat ggttgtgagg 81781 atcagagagg aggaatgaaa gacacccttg ggtgctttct tataaccctg tgatgtttgt 81841 cactacttta agaaagcctt agtaaaatct ctgcaggaga gcccctgaca aaaagtcaga 81901 aaggctgtag tgtattaatt accccaggga aagcctgctt ttccctagca tagagtgtgg 81961 tgtgccgcat gtctctcttg aacttgcatt tggttcctgc actactccca caacaaggac 82021 ggtaggggcg ggtacaggtg gggctccagt gaagggtggg ggttctcagg tatgatgcgt 82081 ttacagcctg cattgaggct atatgatgcc cctcgtttgg agattccaag aggaatttta 82141 agtttattcc accagagtgc accaccatta aataagaaag actcatgagc tataagtgct 82201 catgaatgtg gtgggtgtac atgctggggc tgtgcctcgc tctttaattg tcccagcagc 82261 cggctgtggt gggtaggaag atcagggcgg gctctctgga ggaggcggct cctgagctga 82321 acactgaagg atgagtagga gctatctcca gtgcttttca aaattgaagg ggcagatgaa 82381 tcacttggag agcttgttag taagcagatt ctgattcatc aggcaggggc ctgagccttg 82441 ctttgctccc aggtgatgcc tatgctgatg agatacgaaa gagggaggct ctttagatca 82501 ttatagggca ggagtcaagg aaagatgaag ggacagggaa ggaaagatga agggacaggg 82561 aaggaaagat ttctggccag gacaaggcac aggcaaagac acgagggtgc acacagcttg 82621 gtctagggga acaggagagt ttgtgttctg gatccaggga ctcaggaggg agggcaggac 82681 ctggcagcag aacagatctc caacccacac agatgggctc ttcctgctca tcctctccaa 82741 tgcccctatc taccatgggt gatctttagc ctcagaacac tttatttttc tgaggagtac 82801 aaaggatcac agactggaag tcacactggg gaagcagcca tggagaacgc actctgccta 82861 aacacagctt gtacaaggaa tagcctgaag acaaggtcaa aggcttcaaa tcatgaacca 82921 actcacgagc tccggacaag ctcatttgtc ccatgagcag gatgtctgag ggctcagtga 82981 atcttagtga acacatcaaa tacaaataca ccttatgtct aagaatctga tgacaaatga 83041 gtatttttat cctctttggg gtaaaataat gtgtattctg ctgtgttctt gactgaagag 83101 gtatacatat acatataatt atatatacat atgtatacat acacacatat atataattgt 83161 gtgtgtgtat aggcacatta cacaatgcaa cacaacacct atatgctttt ttatttcatt 83221 ctacaaatat cctcagtgaa ctccaaccac cacacgaggt acaaggttca agaaaagatg 83281 ctacataggt tgcatgtact ttttcaaggg ttcgaacacg ctacttagac ttctattttt 83341 tcatttattc attcaacaga gatttattaa gccccttcta tgtgccaggc accaggcaga 83401 tttgggagat actactgtga acaagacaga caaggtcctg gcttcaaaga gctgaacttg 83461 actgagaggc gggtggtgac tgaggcctgg cggatgcagc ggctccctgc caagatccct 83521 ctccagttca gtttcccagg agccagggag ggagctcgcc tgctgcacag catcgtcagg 83581 gaaacttgta attacagcat ttattaggac agattagaga gccaaaagaa tcacattgtt 83641 aaaggaacaa tgaaattaaa aaagaggtta agcaaaaacc acaggagaaa aacccatgca 83701 ttcgccagct attacaggct cacaagatag cagttacatg attgaaatct cctccttagc 83761 ctctgatctg cttaggcatt tttagctgga agagaccaat catatctcta gtcttgagcc 83821 tggggccgtc ttctccggct gtgggtacag ttgggaggca gctttgcaga ggccaagggc 83881 cgtggtttga ccttgggcaa atgacttaat ctccaggagc ctcagtttac ttatctgtaa 83941 taattcctgc cttgcaggct ggccatacgg ccatacgact gagaaatcat aagtgtgttg 84001 cgaccagcac atacattctc aaagagaggt aactgccagt cagtggtaga ccaagccatg 84061 ttagagcctg cagcaaaagg caagttcaga aatacagatc ctctttattt aaaatgttgt 84121 tattttgtca tcaaggaatt tattgcatta attttgatta tatatatagc attaaaacag 84181 tatctatctt ggctgggcga ggtggctcag gcctttaatc ccaacacttt gggaagccaa 84241 cgcaggagga ttgcttgcag ccgggaattc aagaccaaca taggggctct catagtcccc 84301 tcctccccca tgagatagtc atcatcatcc tcattttaca ggtgaggaca tggaggcacc 84361 gagacattaa gtaacttgaa gaccacacag ctaagcagcc agagttccaa tccaggctac 84421 agaaggtcag agcctgtgtt cttaaccacc aggaaacaca tggctttaga cttttaccac 84481 agacacgtgc gcccaatatg caaaccagcc gacccacaaa aactgctttg catttttaag 84541 cagttgcatt gaagcttaca gagtgttcca gtgcgctagg agccccactg acccatcccg 84601 ggctggccag gtgggctctt ctcataactg tgtgtctgcg gcatctgacc ctgcttgtat 84661 cttctcctgg aggactttga gataagactt catctgggga cactgagagt ccaatgaatg 84721 gagacactag gacacccccg actggaaaag acagacggat gtgcacactg tcaccctcta 84781 cctccttcca ccacagaccc aaggtggcac acaggaaggt gcacactgca ccaaaaaaca 84841 gaaaactgga aatgtgaatg cacaaggggt cggggacaag aggacccagg aggaggctgg 84901 gccaagaggg ggttagaaag taccctgtgg gccataggac ttcaacagtt cacacagttg 84961 agtccctaac tgaggccaag ctgtatgggc caagaacaag ggaaacaaga tcagggctga 85021 agcatgcccc ttccttggga gaaacacatg tgcttagggc ttggctttaa aagagatttt 85081 ccttgtgaaa tccactcagg ggcccctgag gaatgtcatg gactgggtct tgatgtgact 85141 ctttcttatg ccactgcata tgacaagcac tgcagcttcc gaggccccca tggggaaagc 85201 cctgagagag ggatggaggc tcccccatgg ctcccagcac atggacccta cctcctggga 85261 cacatttctg gtctgtctaa tgtgatggtg tcgcctctct gtgctgttca cctgctccac 85321 cccaagggta ggccctggtt cctgtgcatt ttgcactcct cccagcaaca caaaccagtg 85381 ccacaaaggc cctctgttga acttgcaggg tgcaaccggg ctcctcagtg gttgggaaga 85441 ccctgcactg ctgctctgac ccccctctgt ctacatggag ggacatgttg gggtccatgg 85501 agaagtcacc aggtgatcaa gatctgacct cccccatccc atgttcttcc gtgggaggat 85561 taatcactgc ctctctctag ctcccctttc ttagctaatc tggagtcata aaacatctta 85621 gtattaggcc acatccacac aagcagcagg taagttccct tttacatata gcctctccca 85681 ggtattctga cgtgggcgtc tcccccaccc cagcctactc acagatttca gcatccaccc 85741 tgagggtgtc tgtgtgaaga gcctgtctcc tccttggcca tccagggcct agctgctgtc 85801 cagttattgc tggaaccctc atcagtctgg ttgagtcaca aggaaaagat gttctgcccc 85861 aggcactggt ctggggacca cacattctcc ttctaaggtt gtaacatggc agactcttca 85921 ccttccaccc caaccagtgt gtcaatcaaa ggacagggcc acacaggcaa ggtagatcac 85981 agcagccagt tattaggaaa gaggactgag gttataatct cacaggccac tcccacctgg 86041 aagatggtgg ccttgttttc agggtcaggc atagtttacc aaaagacatt tggaaaaggg 86101 ccttgttatt tatagttttt aagctgtgca agtgaagaca ccaggaagag agcgctaggg 86161 tcagagtggg ttgagctggg gttttcgctg ttgttgttgt tgttgccatt actagaaaag 86221 agacaatgca aaaacccaag gcattgcctc agagcccagg ggtagggatt ctggactccc 86281 tccacaatgt tcaggtcatt ctaaagatgc tgagagtcca cgaggaatga gatgaaatga 86341 cagggccatg aatcaattcc cgtggtgttg aaagcttcag ggaggaaagc tcctggctca 86401 cagctgggtt ttcttgggtc ctcacagtac cgttgcgctg gtctctgttg ctacagcttc 86461 ttggtggtgg tgattgcagc tccttggtgg tggcagctgt gctctctgat cccaaaggac 86521 cactatggtt ccagaggaaa gagtggttgg agaacaagag cagttttcag cagaagcact 86581 tgtgactgag ataagacacg gaggagagct ccatcctcta agcccaagtg agtagcggct 86641 cttggtgaca gcaatgccag gccctacacc acttgcattt ctattcactg agtgtctatg 86701 gagggccagg cctggctctg tgctctacaa gcccctcttg gcattcctta attctctggg 86761 gtttgtgtgt gatccctagt ctatggttga gaagtaaatc agattcagaa agctggtgag 86821 gctagaacag ggcttgaact catgtttaca gagaagcgaa ccccagaccc ctttccctct 86881 tctctaattt gttcaatctt tgtctctgtt tgcctcccct ctatcattct ttatctttcc 86941 tatttctctc tcccttttct ttccctcctc tgtccctctt ttctcttttt tctccttttt 87001 ttctcatttt tctccttgct gttgtttgaa tgtgtcctcc aaagttcaag ttttggaaac 87061 tgaatcccca gggtgcaagc gttaagaggt ggggccttaa taagtggtga tcaggtcatg 87121 aaaccctgcc ctcatgaatg gattaatgtc attattgagg gagcaggtta gttattgtga 87181 gcgtggaatt gttataaaag caagttcagc cctcacgctc tggctctccc aagctctctt 87241 ctgccttctg ccatgggatg agtagcaaga agccctcacc agatgcagcc ccttgacctt 87301 ggacttccaa gcctccagaa ccataagaaa taaaattttt ttctttataa atgacccagt 87361 ctgtgatatt ctattataac acaagatgct cttcccacct caatcaactt tcactgccca 87421 acccagttgc acatagactg agtatcagtt gcacatagac tgagtacgtt gctgggtcct 87481 ggtcacatgg tgatgcctga atattctccc tgacctgaag cctttgcaaa tctgcttact 87541 aaactccatg aaaactgtgc tcagggtttt tcattgaaaa attcaggtga ggtgaatatt 87601 ttactcatct cttttcggtt ggttttaact tttatgtaac atttaattaa ttctttcaaa 87661 tgaactctgt gaatacctaa ttcagggaga gaagcttcaa aggagatagg aacatgctgg 87721 gagacagagg atgtactggg ggtagggagg aaagggaaag gaaacggaaa caaaggaaaa 87781 gggaagggaa gaaaatttag cattccccag catgactcaa ggttttcctg gtagttccta 87841 taatctttca ggaaagtaga gccttcttca gtgagaaatc tttccctaat attgaagtca 87901 actttgaatt taaatccttt ctgagccact gactatttac tgacagtcct tctgaccttt 87961 aactgtgagc taagattttc catttgaaag actaatgagg gatttattcc taggatacaa 88021 agatggtttc acataattaa atcaataaat gtgatacacc acattaaaag aaagacagac 88081 aaaaaccatg tgatcatctc aagagataaa gaaaagtcac cggacaaatt tccacatttt 88141 tcatggtaaa aagtcccaat aggtgagaga tagaaggagt gtacctcaat acaataaagg 88201 ctacatataa aaagctcata actaacgtca cactctacag tgaaaagttg ggatcttttc 88261 ctctaagatc agaaaaagac aaggatgcct actcttacca cttctattaa acatagttct 88321 ggatgttcta gccagagcaa ctagaagaga aataaataaa agtcatttga attggaaagg 88381 aaaaagttac attgtctcta tttgcagaca acatgatctc atatataaaa tatcctaaag 88441 actctaccaa agaactgtta gaactaatac atttaataat gttacaggat acaaaatcaa 88501 tatgcaaaaa tcagtagcat ttctatataa gaacaattaa ctatctgaaa aagaaatcaa 88561 gaaaacaatc ccatttacag tggcgcctcc ccccaaaatt acttagatgt aaatttaacc 88621 aagttaatga aagatctgta gactgaaaac tataaaacat gaatgaaaga agttaaaaaa 88681 agacacaaat aaatagaaag atattctgtg tttaagaatt ggaagattta aggccaggca 88741 tgacggctca ggcctataat cccagcactt tgggaagcca aggcaggtgt tacacgaggt 88801 caggagttca agaccagcct ggccaacatg gtgaaaccca gtctttacta aaaatacaaa 88861 aaattagcct gctgtggtgg caggcgcctg taatcccagg tactcaggag gctgaggcag 88921 agaatttgct tgaacccagg aggaggaggt tgcagtgagc caagatcatg ccactacact 88981 ccaaccttgg cgacagagtg agactccgac tcaaaaaaaa aaaaaaaaga attggaagat 89041 ttaatattgt taaaatgtcc atactgtcca aagtgatcca agagtcaaat acaatcctat 89101 caaaattcca atgacttttt ttagcagaaa tggaaaaaac aatccataaa tttggttgga 89161 accacaaaag acccatgaat ggccaaataa atcttgagca aaaagaacaa agctagaggc 89221 atcacactac ctgattttaa aatctattac aaagctatag taatctaaac agcataaaac 89281 tgccacaaaa cagacacaga caagtggagc tgaataaaca gcccagaaat taattcatga 89341 atttatggtc aattgagctt tgacaaagat tccaagaaca tacaataggg aaaggacagt 89401 tcttcagtaa atggtgctga gaacattgga tatccagatg caaataaata aaattagacc 89461 ctcgtctcac accacacaca aaaatccact caaagtagat aagcgactta aatataagac 89521 ctagaacctt ataaatactg gaagaaaaca tggagaaaaa gctctaggac attggtcttg 89581 gcaatgaatt tgtggatatg accctaaaag cacaagcaac aaaagtaaaa atgcagccgg 89641 gcgcggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcggg aggatcacga 89701 ggtcaggagc tcaagaccag cctgacctac atggtgaaac tccatctcta ctaaaaatac 89761 aaaaattagc tgggcatgct ggtgggcacc tgtaatccca gctactcagg aggctgaagc 89821 aggagaatcc ttgaacccgg aagtgggagt ttgcagtgag ccaagatcgt gccactgcac 89881 tccagcctgg gtgacagagc aagacaccat ctcaaaataa ataaataaat aaataaataa 89941 attcgcacaa atagaattca atcaaaccaa aaaagctccc acacggcaaa ggaaatgacc 90001 atagagtgaa gaaacaacat atagtatggg agaatatatt tacaaatcac acatctgata 90061 agaagttaat atcaaaaata tgcaggaaac tcaaacaact caatagcaaa aacaaaaaca 90121 agcaaaaaca aatataaatc agttaaaatg ggcaaagaat ctgaataaat atttctcaaa 90181 aaagacatac gagtggccaa tgggtatatg aaaagatatt caacatcact aatcagagaa 90241 atgcaattaa tatcacaata agatattaat atcacttcac atctgttaaa atgactagta 90301 tcaaaaagat gaaagataac agatgttggc aaggctgtgt agaaaatgga acccttgtac 90361 actgttggtg gggatgtaga ttaatacagc cattatggaa aacagtatgg agatttctta 90421 aggaattaaa aatagaacta ccatatgatc cagcaattta atttctgggc atatatttaa 90481 aggaaataaa accggtagct tgaagagata tctatacttt catgttcatt gcagcattat 90541 tcacaacagc caagatatgg aatcaactta agtatccatt aatgcatgaa tggataaaga 90601 aaatactatt caacctttaa attagccctc atgctaattt atttagaaaa atttatttct 90661 tatggttctg gaggctggga agtccaaggt gaaggggctg catctggtga gggccttttt 90721 gctacgcatc ccatggcaga aggcagaagg acaaggtagc ttgagagagc aagagtgtga 90781 gggctcttta aaagaagtaa attctgtcat ttgtgacaac atggctgaag ctggagggca 90841 ttaggctaag tgaaataagc cagacacaga aggataaata ctgcataatc tcacttatat 90901 gtggaatcta aaaaagtagg gctcatagaa acaaggcata gaatggtggc taccagaggg 90961 tagaggttgg aggattgagg agatggtggt caacggacac aaaattccag ttaaacagga 91021 ggaataagtt caagagatct attatacata attagactac agttaataat aatatactat 91081 acattggaaa atttctagaa aggcagtttt aagtgttctc actccaaaaa taagtatatg 91141 aggtaatgca tatgttaaaa agcttgattt agccattcca aaatgtacac aaatatcaaa 91201 acatcatgtt gtacaccata aatatttaca atttttactt ttcaatttga aaaaaaaaga 91261 aaaatgaaga aagaaaactt cttgttggat tgagttctct atcatcaaca gggatggaga 91321 ggttttaggg aaagagaaaa tatcacctct ctcactgtaa gaaacagatt ttattattac 91381 taacctagaa cactgcattt tctctttgtt gttgggaaat ttctgatgac acctttaaaa 91441 aaacctgaga tttgtctctg ggtaagttaa atgttcatta agtgatcaca gaaagtaaaa 91501 gtttgggaaa ctttaccagg agcgtttggg ggaagcaaac cctgcgcaga tttcctttgt 91561 tcttgcaata gcagttccct cttgagtgtg acttggggcc cctcagttgg ccctgggaca 91621 tctgctcgct ttgggactct ttgtgaacat tacttgggga aggaagcact gggggaagga 91681 agcactgggc ttgggacagg gttgggcgct gcctcttcac tggaccatga caaggttgtt 91741 acctcaccaa ggagaggtgc aaaaagctta ggggcttgga tttctagatt tcagtgccaa 91801 ctatgccact tactggcttt atccttgggg aatttatcta ctctgtgacc ctcagttttt 91861 ttatcttaat tattaataca tacctcataa tgtgactgtg aggattcact taataatata 91921 tggaaaacca tagaatagtg cccagcatct aggaagtgcc acagccccct tcagcagcta 91981 gtgaaacctg cagaccactt ttcagagtga tattattatt tttttctagg tttactgagt 92041 tataattgaa aaaataaaaa tggaatatag atgtacaaca tgaagctctg atgcatatat 92101 ccattgtgaa atgatgacca caatcaagct aattaatgtt atctatcact tctcatagtt 92161 caaccttttt ttgtggtgag agtactgaag atctactctc ttagcaattt tcaaatctaa 92221 aatacattat tattaacaca gtcactgtgc cgtacgttag ctctgaggac cttattcatt 92281 ttatacctaa aagtctgtat cctttaacca acatctccta atttcccact gtcatctcta 92341 ctgccacctc tggtaaccag ccttctgctc tgtttctgag tccaaccttc ttagattcca 92401 catatgagtg agatcatgct gtgcagtgtt tgtttttctg tgtctggctt gctttcactt 92461 agcataatgt cctccaggtc cacccatgtt gttgcaaatg gcagaatctt cttcttgtta 92521 aagactgagt aatatccctg tgtgtgcgtg catgtgtgtg tgtgtttgtg tgtgtgtgtg 92581 tgtatcacat tttcttcatc cattcatcca tcaatggaca ctaagcacta aggttgattc 92641 cgtatcttgg ctattgtgaa taatgctgca ataaacatat gagtccagat acctcttcaa 92701 gatactgatt tcatttcctt taaatatatg cccagaagtg ggattgctgg atcatatggt 92761 agttctatat ttagtatctt gaggaatttc catactgttt ttcataatga ttgcagcaat 92821 ctatattccc atcaacagtg tacaagggtt ccattttcta catggcctta ccaacgtttg 92881 ttatcactta tctttttgat aatagatatt ctagcaggtg tgaggtggta tctcattgtg 92941 gttttaattt gcattttcct gatgattagt ggtgtagagc atcttttcat atacccattg 93001 gtaattcgta tatcttcctt tgagaaatat ttattcagat cttttgccca ttgttagctg 93061 agttatatgt gagttggttt tggtttgttg ttgttttttg tttttgctat tgagctgagt 93121 tccttgtata ttttggatat taaatccttc tcagctgtat ggttgacaga tacattcttg 93181 cattctgtaa gttgcatctg taggttgcaa cagagtctct ttactctgtt gattgcttgc 93241 tttactgtgt gaaagctttt ttagcttgat gtaattgtgt ttgtctattt ttgcttttgt 93301 tgcttgtact tttagtgtca tatccaaaaa gttattgccc agaccagtgt catcccctat 93361 gttttcttct agtaatttta aagtttcagg tcttatgttt atgtctttaa tccattttga 93421 gttaattttt gtgtagggtt taagataaga atccaatttt atttttattt tttgtatatg 93481 gatatccaat ttccccaaca ccatttattg aaaattctat cctttctttg ttgtgtatta 93541 acatcagaat aatattttta aatacataaa attcagaaga tgacaaagga aaccaattac 93601 attgaaatgc atacagagtt ataattctga aagagcaata tatgtgcctc tttgtaaaca 93661 catcatatat caaactgcag tgaccgttct aacaactatt gcaatttcaa aatcatgttg 93721 agtaggagga gtactttgag attctgaaac aacgttcttg tgctatgaaa tatccatgat 93781 tttgattggt gatggtatcc caggttttgt taatgctgct gtaatctgtt gcttccattc 93841 catagttgaa taaaatgctt gatatctgtt ggaaattagt aaaaataaaa acgtattttt 93901 ttccatccaa gttcattctc agaccctgaa gagtcacttc tctggattct gcagcaaagt 93961 tcccagctgg ggcagcaaga tttaggcaat tgaaaagaac atacaccttg ttctcagtgg 94021 caaaccacat ggaaagcttt aaatgtcaga gaagaattct gccattttgc tgactttttt 94081 gtagttctcc taataaacaa gtgttaagtg acaagctttt cagaggagat aattttcttc 94141 caagtggtct ttttatttac ctgaatatta agttcaaaga tactctagtc tctcaaagga 94201 aacctgtggg tggagtgttt tttaaaatgt caaattccaa acgaagctga agggcacagc 94261 ctcaagaagg ctctgtgttc accagtgcac agggacgtgt cagcccttta ttgccattgt 94321 ctacataaca tttttcaagt tctctgagtt gtgaggctgt cagtatgaag actcctattt 94381 tttctgctat taaatttttt tttcctccaa ctttttcttt ttctttttct tttcttttga 94441 gatacaatct cgctctgtca cccaggatac tggaatgcaa tgcagccttg aactcccagg 94501 ctcaaacaat cctcctgcat cagcctccaa agtagctggg actacaggca tgcactgtgg 94561 tgtggtttgt ctctgtgtcc ccacccaaat ctcatgttgc attgtaattc ccagtgttag 94621 gggaagggcc tggtgggagg tgattggatc atgggggcag atttctccgt tgctattttc 94681 gagatagtga gtgagttctc atgagatctg atggtttaaa agtgtgtggc acttcccccc 94741 accccctctc tcctgccacc ttgtgaggaa tgtccttgct tctccttcac cttctgtcat 94801 gattgtaagt ttcctgaagc ctctcagtca ttctttcctg tacagcctat ggaactgtga 94861 gttaattaaa cctcttttct tcataaatta cccagtctca ggaattcttc atagcagtga 94921 tgatgaatac acaccatcac acccagctaa tttttcttta tttttttagt agagatgtgg 94981 tctcactatg ttgcccaggc tgttctcaaa ctcctgggct caagtgaccc gcctacctcg 95041 gcctcccaaa gtgctggtat tacaggcgtg agctgccgca cccagccctc ctccaactgc 95101 tgctggttgc tatcataggt tctcactggg tgtccaccta acatgtccct tcaggaaggc 95161 tggtgtcctg acattcggta gcggaggcct ggtgttgagc tcacctgcat ttctgccccc 95221 ttctctggct ttctggggat gccctcctca acctgaaaat gacttccaac tcctcagcac 95281 agaggtgggc acgtgaccca agcaggtcca ggattccata agatcccaaa cacaaaggat 95341 cctgtaatcc tggggctcca gtggccgtct ttttcctttt ggagaactaa cctattgcag 95401 agaaagtagt agagctaaaa gaaagagaaa gaatcctgat agcatctgtg actctggatc 95461 cagctatcca taaagcgcca tatgctaatg cgctcagtta cccaagtcaa tgagtcccct 95521 tagaagctta agccagtttg atctgagttt ctgccacttt caggaggaag aatcttattg 95581 gtatatttta taataatccg gctgatatgc aaactgtctt caatagaatg ctctgtccag 95641 agtcacagtg gagtccatcg caacaaataa ggtcagacag ctcacagcta caactgggct 95701 catgccttcc aagtatcaaa gtcataacaa acttcctctg tgtgagtcca caaggcattc 95761 ttcctctggg ggaatcactc cctttccatt gtcctgatac tctctctctg tggggctgta 95821 gaggagtgca tggcttttgc cctacttcca agccttctca gggaacacag cagggacaaa 95881 gagctgaagg ctcagccttg gccatggccc atcaggccca ctgcaggctt ctcctgggag 95941 aagatccaga gcctcgggct ctgggctccc atatgcaggg agaacctccg accacagaaa 96001 ttggaaggag ctaccactcc ctaggagaca tcttcccttg agctttttca agggctgaac 96061 aaacagagcc cacacttgtt cacggaatcc atgggcctca gcacatcaga gagaggagaa 96121 gaaattgacc cagagtgtaa cccaccccca acttcgggcc ccaccaagaa ttgtcagtga 96181 tgccagagga agaggcacct aattcaacag atcaatggag actcacatca cccacaagca 96241 ccacttatct gctatttcat cccacaccag tacctgatca tcaaccgttg caaacactca 96301 ttgctgtaat tgtatcatta gtttcaatca cagccaattt ggggctggca tgtgaatatc 96361 cttttctgtt cccactgctg atgaaatttc agaattataa ggtgaatgaa acctagtaga 96421 ttaaagtaat tgtgaaactg atgctaaaga acagtcctgt tccccatgcc gcctaaacat 96481 gtaatttcag acatcaaaca gaatctttaa gctgaaataa aatctctact tagtctagca 96541 ccattgtatt ctctgtatgg gatgaaggca gctcttaccc catctattaa gatttattat 96601 taaaattgcc tgggggccgg gcgcggtggc tcacgcctgt aatcccagca ctttgggagg 96661 ccgaggcggg cggatcacga ggtcaggaga tcgagaccat cctggctaac acggtgaaac 96721 cccgtctcta ctaaaaatac aaaaaattag ccgggcgagg tggcgggcgc ctgtagtccc 96781 agctactcgg gaggctgagg caggagaatg gcgtgaaccc cagggggcgg agcctgcagt 96841 gagccgagat cgcgccactg cactccaacc tgggcaacag cgagactccg tctcaaaaaa 96901 aaaataaaaa aataaaaaat aaaataaaat aaaattgcct gcgaattaaa aaaatacatt 96961 ttttcccttg gcaaagtcag cacctagcta tctcagttaa ctctctggag gagaaaaagg 97021 ggaacagtgg ttgtgaaatt tgagacttgc aagagtgccc aggactgcct aggcatagac 97081 tagactgttc cacctaaata ctcccataga aaatgcccta ggtgaagtct ccaggtgtag 97141 ttttcagagc aaggccctca cccagttctt caggttccat aatacattag tgaccacact 97201 aaatcttaaa gtccttgcca tcagttgaca taaaagtcca gtgagaactc tggccttacc 97261 agtgaagact aggccccaag aaagctgtgt gagctgttgt atgttgagtc agcctccagc 97321 cagtcttcta cccattgcta gaataacaaa tcaagtctcc tccttcaact gggactcatg 97381 tgatcgtcac cctccttgaa agacagccct gtcctaaccc cctccagagc tgccacatcc 97441 ctggtcccca aacaatcaag tgaggaggga aggctactgg ccacccacaa ggccctgaaa 97501 gtacagcgtg ttgactcagg aataatccaa gcctatggca ggaggaacag gactttatgg 97561 aagctgggag actgaagaag gagtcaccat ggcaaccagc tctccagctt tcattctcct 97621 ccccatcact tgatccagac attctccttc cagatcataa ggcctgacat ccatccactt 97681 cctatcatgt ggtcacaggc agatctctct acccactggc aaaaagtggg ctgatggaaa 97741 tgagcattgg gtgcatggtg cttggactgg gtttcatagt tatcgctcca aaatccatgg 97801 cctcttcgga acctcagaat gggaccttat ttggaaaaag ggtctttaca gatgtaatga 97861 gttagtataa catcattctg aatcaaagtg ggccctaaat ccaatgactg gtgtcctcta 97921 taaggaaaga aaacagagac acagagacac acagggaaga agccatgtga aggccaacgc 97981 agaggttgga gggatggagc aacaagccga ggaatgccaa ggactgcagc aaccaccagg 98041 agcttggaag aggcatagaa gctactagaa ggagccaatc ctgccaactc ctggatttca 98101 catttctgat ctccagaacc gtgagagaat acatttctgg tgttttaagc caagctgttt 98161 gtggtgctta ctaagccagc cacaggaaac acacacaggg ccacatgaac acttcaaaat 98221 acagccattg caaaacctca aacctggaat ctaattagcc tgaaccagcc attaaaagaa 98281 aaagtgatgt atgcctgata accattctcc atatttttat aaggagacag aaccagtaag 98341 aaaaagaaca tagataaatt aattcccccg ttttacaaga tatttatttt acaagtaacc 98401 ttccagggaa ttccctggta tgaacaaatg catgcattat gaacccatcc aatgatattg 98461 agttcactgc tggaatctgg gcaaaacact gcaagttgct cggtgaaaac aggtccaatt 98521 gtgtttggtt tgctctaagt cagctgcagg gaggcactgc tctgccacaa gacaggaggg 98581 aggaaagctg gggaaatcag ataggtttcc ttgttgcagg tgtgttcaaa ggagggttgg 98641 agcatcttcc actgacttca cctctatgaa gctcctctcc tgtcttcgta gtgagaacag 98701 agatggctgt aagtcaaaca cgcccggaaa aggcattcgt agaccagacc atccatgcac 98761 tgtgttaact gaccacttag ggcgagcagc attacaggcc tgtagttgtg gaggcaggag 98821 ctacagggac atttctgatg tggctccagc cctccacgat ggtatgatcc agttgaaccc 98881 agaatcccag cacatgtgga ccagtcagga agcaactgac tgagagcggt agaggaccca 98941 aatttcttcg gaggtattta cttcgttgag aaaacccaca gggcagctcc aatcccacat 99001 ccctggcatc ctgtggctta tgggcgtggg gccacttgcc tgttggaggc gtgatgcatg 99061 atccccagta tttgctctgc aaggccaggc atgggctcac cgagaaacaa cctcgccttt 99121 gtgttgtcct cctcctggac aatccatggg acaggccagc cctttggagg gcacttcctg 99181 ggctgtccat ccctcactgg ggaagcaaga aggctctgga ttgcctgccc cccagccacc 99241 tgatatgagt tgggcacagg gagtgtcatt agcctggggc attaatcccc tctggtagat 99301 gtcccaacaa acatgcacca atcagagaca ggggcagggc accaggggaa actgtagtct 99361 cctttgcttt gaatcagtta ttatggagga ctttatcaca aacacagaat tacattccca 99421 aatattcatt cttttcaaca caaagcttaa aaggtattgc agtttgaatt gtagttgaga 99481 catgactctg tagtacccac tactgcataa ggttcaagtc caaaatgatc cacagtcttt 99541 ttcttcttta atattttctc tttttcactg agatgtaaca ctgtaccttc tattgtaggt 99601 gctccaatct caggtatacc atccaataat gtcttacata tgtctacagc catgcaactg 99661 cctcccagag caacatctca gacatttcca ggctccagca gacttcctga tgttccttct 99721 ctttcaatac aaacctccag tcctacaggt aactgctatt acaacttcta gtaccatgtg 99781 ttattgagct tcatatagat cggaattgta caatgtgggc tccttgagat ctgctttctt 99841 ctgcttaacg ttatgtctgt aaaacatctt tgcatggggc aacagtttat tttttcattg 99901 ctatgtacta ttccattgaa tgcatatgcc ccaatttatt tagttcatcc taacgtggat 99961 gggtatgtgt tgttttctgt ttgggactat gatatataaa ggtgctagga cctttctcac 100021 acatgtctct tggtggccac agccctcgtt tcttttgaga atatgcccag gagtttctgg 100081 ggcatggaaa tatgtatgtt taactgtagt agacacctcc aaactctttt ccaaaactga 100141 accaatttac acttcaacca tcagtgtatg agagctcttg ctgttaccca ccattgtcaa 100201 cacttgttgt tgtcagtcct tttaatttta gcctttctaa tggatgcata atggtatctt 100261 actgtgggtg taatttgcat tttcctcttg gctgtaatac tgggcatctt cttatgtgct 100321 attggccact tgtattttcc tctattatgg gggtctccgt gaatcactct ataaagggcc 100381 tgacagttaa tatctctggc tttggaggtc atatggtctt gtccctacta ctcaactatg 100441 ctgttgtcaa gcaaaagcag ccatgggcaa tgtgtgaatg aatgggattg gctgtgttcc 100501 aacacagctt atttataaaa acaggctggg atgggaggtg gattttgctg gtgaggcttt 100561 aatttggtga cccctgctat attgtaatgt gcctgcccaa gacttctgcc cattttaaaa 100621 ttaggttgtc tttttaattt attcatacaa gttctctgta agttctaaat tccagtttta 100681 tattcttaat catttatcat gttttgaaaa ttaaaaaaaa aaaacagtta ttatgttgga 100741 ccattcctat agcttttgcc ctgtgtttct accatagtcc cctaagagat gagatcactc 100801 ttgtgaaaag agcttcctgc tgttacccaa gagagctgtc tgctgccatc tttgccccag 100861 catgacagtg tggcagccca gcccagagca ctgtggctct gcttgtagcc ttggggagtc 100921 ccactgccta gaccaggctg ggtgaaaata gcctgacttc ttggaatgga cacagttttc 100981 aaaaggagca taaatttagt ctgccaattc atgtatatgc catgatggtc aaaattgagg 101041 gatctgggca aagattcctt ttccattctt agccaaatgc tttctaggtt gggcatcagt 101101 tgatggtgca attggaggaa attgcccaat tattttgttg cttcactaac aaaatgctta 101161 ctgtctttgc attgtaaaat ctttttttca tttataagct ccccaaggga gcaaaagcca 101221 tgcacagtgg tgtgccagga gccctcatga ttgttttgcc tgttatctta tttgagcaaa 101281 tattagtatc tcattttgca gcaaggaaac tgagggtcag agtagttaca tgaattgatc 101341 gggatcaaga gattaaataa ttgaccaagc cagaaatgtt ttactccaaa gtccatgatc 101401 ttccccatgg gcccatactg tggtttccct ttcatataga gatggggctg gaagggccat 101461 tattgaatat ctggtcaagt tccctcactt gatggatgaa cgctgagatc tggaaggagt 101521 caaccaactc atcacccaag attcccccaa caagctagca gcaaagctt // LOCUS AF027807 19924 bp DNA PRI 18-DEC-1997 DEFINITION Homo sapiens beta-casein (CSN2) gene, complete cds. ACCESSION AF027807 L10615 NID g2695660 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 9438 to 19924) AUTHORS Hansson,L., Edlund,A., Johansson,T., Hernell,O., Stromqvist,M., Lindquist,S., Lonnerdal,B. and Bergstrom,S. TITLE Structure of the human beta-casein encoding gene JOURNAL Gene 139 (2), 193-199 (1994) MEDLINE 94156198 REFERENCE 2 (bases 9438 to 19924) AUTHORS Kwiatkowski,D.J. TITLE A high resolution linkage map of human 9q34.1 JOURNAL Unpublished (1993) REFERENCE 3 (bases 1 to 9437) AUTHORS Edlund,A., Johansson,T., Leidvik,B. and Hansson,L. TITLE Nucleotide sequence of the 5'-flanking region of the human beta-casein gene JOURNAL Unpublished (1997) REFERENCE 4 (bases 1 to 19924) AUTHORS Edlund,A. TITLE Direct Submission JOURNAL Submitted (01-OCT-1997) Molecular Biology, Astra Hassle, Tvistevagen 48, Umea S-907 36, Sweden REMARK bases 9438-19924 submitted previously FEATURES Location/Qualifiers source 1..19924 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 1..19855 /gene="CSN2" promoter 1..9389 /gene="CSN2" TATA_signal 9360..9367 /gene="CSN2" exon 9390..9437 /gene="CSN2" /number=1 mRNA join(9390..9437,14108..14170,15036..15062,16042..16062, 16158..16202,17307..17837,18756..18797,19519..19855) /gene="CSN2" /product="beta-casein" intron 9438..14107 /gene="CSN2" /number=1 exon 14108..14170 /gene="CSN2" /number=2 CDS join(14120..14170,15036..15062,16042..16062,16158..16202, 17307..17837,18756..18761) /gene="CSN2" /codon_start=1 /product="beta-casein" /db_xref="PID:g2695661" /translation="MKVLILACLVALALARETIESLSSSEESITEYKQKVEKVKHEDQ QQGEDEHQDKIYPSFQPQPLIYPFVEPIPYGFLPQNILPLAQPAVVLPVPQPEIMEVP KAKDTVYTKGRVMPVLKSPTIPFFDPQIPKLTDLENLHLPLPLLQPLMQQVPQPIPQT LALPPQPLWSVPQPKVLPIPQQVVPYPQRAVPVQALLLNQELLLNPTHQIYPVTQPLA PVHNPISV" intron 14171..15035 /gene="CSN2" /number=2 exon 15036..15062 /gene="CSN2" /number=3 intron 15063..16041 /gene="CSN2" /number=3 exon 16042..16062 /gene="CSN2" /number=4 intron 16063..16157 /gene="CSN2" /number=4 exon 16158..16202 /gene="CSN2" /number=5 intron 16203..17306 /gene="CSN2" /number=5 exon 17307..17837 /gene="CSN2" /number=6 intron 17838..18755 /gene="CSN2" /number=6 exon 18756..18797 /gene="CSN2" /number=7 intron 18798..19518 /gene="CSN2" /number=7 exon 19519..19855 /gene="CSN2" /number=8 BASE COUNT 6388 a 3593 c 3125 g 6818 t ORIGIN 1 gatccaaata aacacaatca ggaatgacaa aggcgacatt gccaccaacc ccatagaaat 61 attaaaaaaa accctcagaa gcttttatga acacttgtaa gtacacaaat tagaaacatt 121 agaagaaatg gagagattcc tagaaaaata tattctccca taattgagcc aggaagaaat 181 tgaaaccccg aaagaacaat aatgagttct gaaattgagt aagtcataaa aaatcttcca 241 accagaaaaa gccctgaagc agatgcattt acaactgaat tctgcaagac atataaagaa 301 gagctggcac cagtcctact gaaagtattc caaaaagtta aaaaagaggg acttgctcat 361 aacttattct atgaagccag catcattttc atccaaaacc aggcagagac acacataaaa 421 aagaaaactt tatgccaata cccctgatga acatagatgc aaaagtcctc cataaaatac 481 tagcaaagtg aatacagtag cacatcaaaa agctaatcca ccatgatcaa gtaggcttta 541 ttattgggat gcaagtttgg ttcaacataa acaaatcaat aaatgtgatt cttcacataa 601 acagaaataa aaacataacc catataatca tttcaataga ctcataaaaa gtgttagata 661 aaattcaata tcctttcatt taaaaaaccc ttaacaaact aggtatcaaa ggtacatatc 721 tcaaaatgat aagagcaatg tctgacaaac ccacagccaa catcataatg aatggccaaa 781 aactggaagc attctccttg cgaaccagaa caagacaagg agatttactc ttgtcactcc 841 tattcaacat agtattggta gtcatatcca gagcaagcag acaaaagaaa tataaggcat 901 ccaaatagga agaaaggagt caaattatct ttcttctcag atgttatgat tctatactta 961 gaaaaacccc atagtctctg cccaaagact tacagatatg ataaacaact tcggcaaatt 1021 ttgtttgagt tagtttcaga atacaaactc aatgtgtaaa tctcagtagc atttatgtac 1081 accaataaca tccaatctga gagccaaatc aaggatgcaa ttccattcac agtcgccaca 1141 aaatgaatat aataccgaga aatacagcta gccaaggagg taaaagatct ctacaatgag 1201 aattacaaaa cattgctgaa agaactcagc ggctacacaa acaaatggaa aaacattcca 1261 cgctcatagg tagaaagaat caatattgtt aaaatggtca tatggcccag agcaatttac 1321 agatttgatg ctattcctat aaaactacta atggtatttt tcacagaatt agaaaaaaaa 1381 ttctaaaatt catttggaat gataaggaac ctgaataacc aaagcaattt taagccaaaa 1441 gaacaaagct gaaggcatca cactgtgtga cttcaaacta tactacaagg ctaaagtaat 1501 caacacatca tggtactggt acaaaaacag acatgtagac caattgaata ggttagagaa 1561 gctggaaata aagccacaca cctacaacga ccagatcttt gataaagctg acagtaagaa 1621 gcaatgagga aaggactttc tggttgatca atggactttc tagtcactaa atgataactg 1681 gctaacaata tgcagaagac tgaaactgga ccctactttt caccatgtat agaaattaac 1741 tcatgttgga ttaaagactt aaatgtaaaa tctaaatcta aaactttaaa aaccccagaa 1801 gaaaacatag gaaacaccac tctggacatt ggcttggcaa agactccaaa aaatctccca 1861 aaacaattgc aaaaaacaaa gaaacaaaca aaaaacattg ataagtggga cttaatgaaa 1921 ctaaaccgct tctgcacaca gaagaaacta ttgagtaaac agacaaccta cagaatggga 1981 taaaatattt acaaactttg tacttaacaa aggtctaata tctagaatct ataagaaact 2041 taaacaatca acatgaagaa atgaacgatc ccattaaaaa tgggcaaagg acatgaacag 2101 atacttctca aaagaagaca cacgtgtggc cagcaatcat atgaaacaat gctcaacatt 2161 actaatcatt agggaaatgc aaatcaaaac cacaataagt tatcatctca caccagtcag 2221 aatggctatt gttaaaaagc caaaaaataa ccgatgctgg tgaggttgta gagaaaatgg 2281 aatgcttata ctctgctggt aggaatgtaa attagttcag accctgtgga aagcattttg 2341 gagagttctt aaataactta aaatatagct ccatttaatc cggtagtctc attactggct 2401 atattcccaa agcaatacaa atcactctgc cataaagaca tatgcatatg tatgttcatt 2461 gcagcaatat taataatagc aaagacatgg aatcaaccta gaagcccatc agtggtggac 2521 tggataaaga aaatattgtg cagatacacc atataatatt aatactatgc agctataaag 2581 aagaaatgat atcatgtcct tagcagcaac atggatggaa ctggaggcta ccctgagcaa 2641 attaatgcag taacagaaaa ccaaatacga cttgttctca cttttttaag tgggagctaa 2701 acatcagcaa cacatggaca caaagaatgg aaaaatagac attatggttt acttgaggct 2761 agaggttggg aggagagtga ggattgaaaa acttcctaat gggtattatg ctgattacct 2821 tggtgaccaa attatctgta tacaaatcct ccacaacatg caatttgccc acataacaaa 2881 cttgtgcatg tatcccttga aacttacaca aaagatgaaa aaaaaagaaa aagaaaataa 2941 ataaaaatca aggaaaaaaa tttacaggca tacctcacag acattgcagg ttcacttcca 3001 gagcaccaca atgaagcaaa tatttcaata aatataaata aagaattaac ctatctaaaa 3061 tgtcaatagt gccaaggtta agaaactctg gtttaaagta tagtacgtgc ttcataaatg 3121 tatcctgagc ttgaatgaaa ttttatgatt atatggaaaa taaaagcagt tagtatcaaa 3181 tgagaagtgt gtgctatgaa ttagggttat gggctttacg ctttcatgat acctgcttca 3241 tgaacgtatt ctctaagctc atatttctta ttccatgatt cttgattacc tttattaatt 3301 aattttagta gccatatttc agagaagatg tcattgatta tattcaaata atctcattga 3361 aggaatgaaa acactgtatc tgtataagaa atgctacact atttctcttt atataatgat 3421 ttgttaatta catgtcatat ttttcataaa tttgaatcat tatataaatc cccatgattt 3481 atttatttta ttttcctttt tccttctata ttttgatgaa gtttttaaat agtattccaa 3541 atattagaag aaagaggttt ccagtataaa gtctctgtaa attagagata aaatcttgac 3601 caagttatgg aactttgcag gtctaaagtt tttccttctc taaaataacc ttggaatgtt 3661 acatagacaa tatgtagatc ctcttccagt tattattcct tgaacatttg atgtcctttt 3721 attttaccct ttatcctgcc ttcatatctg cttgctttct tcaaggcttt atttacatgg 3781 ctgcagttat tatctttaca cttcacccac atctcatgct tgcgtatctc atattcgtca 3841 cccaatatag aaaagagcat tgggtgaaca aattctgttg ctaacaaata gtactagaac 3901 tctttgcatt gaatatatct gcttctcgtt cagcgctgtc ttatcaaatt tcagtgggtg 3961 caattctttt ttttattata ctttaagttc tagggtacat gtgcacaacg tgcaggtttg 4021 ttacatatgt atacatctgc cctgttggtg tgctgcaccc attaactcgt catttacatt 4081 aggtatgtct cctaatgcta ttcctccccc ctcctaattt tacctctgaa aataacaatt 4141 ctcttttttt tttttttgtg acagagtctc gctctgtcgc tcaggtggcg cgatctctgc 4201 tcactgcaag ctccgcctcc agggtcactc cattctcctg cctcagcctc ccaagcagct 4261 gggactacag gcgcccacca ccacacccgg ctaatttttt gtattttaag tagagacggg 4321 gtttcaccgt gttagccagg atggtctcga tctcctgacc ttgtgatccg cccgcctcgg 4381 tctcccaaag tgctgggatt acaggcgtga gccacagcgc ccagcagaaa ataacaatta 4441 ttaaattgtt cattgtgaaa gcctgggttg ccccacatga cattttgctt atttgtcatt 4501 gggttctgat gcttactggg ctcttagcac ctggttttat cttaaacctc tggcttttca 4561 attttttctt aggcaagcct cttattctag tgctttcctt aatcatagtt ttgcccactt 4621 ttctgggcta atcaagtctg taattatact gattagtgtg actaaaaatt gatgagcatt 4681 acaaggatat gttatattta ttgcacaatg cataaagcat ttttctaaga aaaaacaaat 4741 gatatataga atggacaagg catgcttgtt tttgtattac tttcttagaa accttaactt 4801 tgttgcctgt ggccttggca actcagagct aaaccacaaa gccacattgg gtagctaggg 4861 cctttcctat ttggacttag gacgctggag taatttcatc atttcttttc ctggcttcct 4921 cctggctcag tatgaacata aaatgaatat cagctgaaca attcaaatgt atagtagaca 4981 ttgacctggg ttttctgaga aatatgaaat atatgtgaca atcaggagtg tccccctcat 5041 tcatagatct aggatgatat taatgcaagg tatattataa agaaaaatag ttatgtaggt 5101 ctatactcag cttatttaat ttagaagggc atctatttgt gcttaagtcc tgaatatatc 5161 tgtcataggc agtaaataga aaaaataaaa attataagaa tgagtaacat ccaacacagt 5221 cagccaaaca cttatgaata gcaatcacta aagaggcaag gaaatttgta gacttatttt 5281 gtaaatgggc acaactcaag tattttgaag cctacctcag tctctacttt cgtactttga 5341 ccaagaccct gctcttttct gggattcagt attctttttt gcctcaattt tctgtacttt 5401 cataaacatt agcaatcaat atttacctac tggaaaacca aaacaaaaac aaagaacgtg 5461 agctcaaact tggtcagttt cattttaata aatgtgtacc tgtatcattt gttctctaaa 5521 aatagccatt ctctaaatgt tactgtccat cttaaattct tcaatagatc cctccatctg 5581 cagaataaaa ttcacattcc taaggagcaa agcatgcttt gtccatcgtc gtctgattgc 5641 tttctctttt tgtgttcaca ttttcatcct ctctcttcct tgaacttcat tatccaggtt 5701 cactacttga tttagttcta gttacttgtt tctcatatcc aagactacag tctttaacta 5761 tttttagcta gaattctcta tcctcctcga ttaactgctt tatctggctt caagattctg 5821 ctcaggactc atggcctcat ggagattcta ttttcctctc ccatgaatca actaaaatga 5881 actaagtatt ctccttctgt caattgcacc ttcttactgc tttattagca tattccttac 5941 atgtcgtaac aagtagattt ttatgtctgt tttcccacga aggttttgac ctccttgagg 6001 tcaaatgtat cttttatttt tcatttcatt ctggataatt taggaaatgc ctgcaaacat 6061 agatatttaa tatgaaatac attttcctct aacaaacttt atgttcacct attttcagca 6121 ttctgaatgt atggctattc cttggactag tcatcataca gaaattctct tctttctaat 6181 gctgaaattc ccaaatgaaa aaatctctgt ttttgtactt ctcaaaattg gtctcttctt 6241 aggccaggtc tttgcctttc ttgccatctc tagtcattca agccctcttt ttagtatgtt 6301 agcttaaaca atacctgatg gtttgccatt tttaacacaa acactaataa tcccaaccct 6361 atcctaacaa ttctcttaac ctattgatac taaatctcca ccatcttgcc atacaactac 6421 ttagataaaa aaatctgata tataatctca tataagtcat aattattgct cgtggatttg 6481 ttattcattt tcattttcac attaaccaaa aatttctgct ctttcaaaat tacatatatt 6541 taagatgtac aacatgatgt tttgatactt atatagtgaa atgattactg tagtcaagca 6601 aattaatatg tccatcacct cacataggtt actcttttta tcttgtggta agaacatcta 6661 aaatctactt tcagcaaatt tttattatac aataaaatat taatgattat catcctcaca 6721 ctgtacatta catgtctaaa cttattcatc ctacataact gcagctttgt ttcctttgac 6781 ctacatctcc ttatctatct tccaccctga acccctgata cccatccttc tacctctttg 6841 tttctatata ttcaattttt aaaacaagat tccacatata aataacattg tgtaatattc 6901 gtccttctgt atctggctta gttgacttaa agattcaact atgtataacc atgtccttca 6961 atttcagttg agaacgcccc ctttcacctc actagaacat taagaccatt taacttcaac 7021 ttctctacct tttatctcaa acatgtatta tatctactta tcttcttatt cctttttgtt 7081 ttcttaggaa aactggtaca ttttttatag gtcttattct tttctccatg actgtacagg 7141 aacttgttca ttactttact cccacttctt tggtatgttt caactctttc cttattggcc 7201 tctttcttgt aagctacaag tcttccttac cctaaaaata ccttcccgtg gttctgatta 7261 tccctgtaca cctagtcaca atcttacagt tctccttatc tcctcttcta agtttctcag 7321 aagggcagta cacatgttac ttttaattcc tcacaaatta cttacttctt aaccatttgg 7381 agtgtatctc aatcagctgt gtagcatttc actgttgacc tcacttgcct tcctctagca 7441 atgcaggata tcaaggagag gaagtggatt ctgtactctc ttattcttga tatcgcaggt 7501 tttttttttc ctatttcctt ttgctatacc agcccctgta aaagaaagga tacttgaatt 7561 ttgttttgta caatttcttt tctcacttga tactctttcc ctagagacct tcacttactc 7621 ctttgaaggg agaaattacc tttaaaaatt atactcatga ttatttcttg agtatcacat 7681 ttaatttaaa acatcttctg gacaccttca aatttaccac gtacaaggtt ttactcagtg 7741 tttcatttct aatttcagcc tggtttcata ttttgattta taacatggct aacttttcag 7801 tcactcaggc ataatatatt aattcatttc cccaattttt cacatttctc atattttaat 7861 gagaagtttg tctttttatt gcctaagatg cccaggacca tttccctact tgagatcatc 7921 ataatgtctg tcttaggcta ctgcaataac ataacataac aacttccttt actaggttac 7981 ctgatttgtt gtcataaagt catgaaatcc aagaaatttt acaatccatt tttctttaaa 8041 tctctaatat atttgcttcc actgaattat gtcacagtgc atgcctgaca ttcacagtca 8101 tccatgatct gattctatca aacctctgca attgtaactt gcaatgtatg ccctctgaat 8161 atcctcaaac tggtatgttg ctctttcagt tcctgtgttt tcttcagcca aaatatctat 8221 ccccatttaa tcatctcgca aaattctact cattctctaa ggctcaacta aaagagcagt 8281 tccttaatgg caagctcccc tgccacacac agcacagagg ccctactttt ccaattcagg 8341 cataattaaa ttgtacatat ttcaaccaaa tgctacattg atagataaat tttataaaaa 8401 attatcagaa ggttaaaaca tgggctctga gactcacatg tgatcaaaca gaatcacatg 8461 tagtcagaca gatctgagtc caactggtat ctttgccatt aacaatatgg ttttgagcaa 8521 gttacttaac cttttggcat ctctgttttc tcaacctaaa tatgaagagg agagctacct 8581 catgatgttc ttgtgcagat tatacaagat aatatctttg cttagtgctt ggcaaagggt 8641 tagaactaag caaaaaaacc catcattaat attatcataa ttatgcttga cctgaaaaga 8701 gataatcaat attaaaaggt aaataaaact ccgtgatact ttgcaaaatt tttattacag 8761 atttactgag gattttgcct atacaagcca cccagattcc ctcttcaagt tattccatta 8821 cagaggaaag tttatttatc ttgtgtacaa aatatttgtt ttaactaaat tatttttttt 8881 tccaaatcaa gtggccctgg gagaaacagt ttgcctcaca taactaagct accatcatgg 8941 acttgttatc ttgtcagttt cttcatttca gcataaagca ctaacaactg cttaggttca 9001 catgaaactg tggcctttgt gaagccctgt gtatatctag cgttcacata caattttccc 9061 acaactgaaa tctcaagccc cactaggccc taaagagacc taattatgtc cataataact 9121 tgggacaggg ccagagtatc actgtcctcc agtcattgtc ttttgttatt gaataattgc 9181 atttcctgga caggtttctt ttatcaactt gtgaattgtt cccttgttat aatgatgccc 9241 agaatttctg gggatatgta ataggcagaa atcattttct aatcatgtgg acttcttgga 9301 attaaagctc gctattgatc ttatttcaaa tcacaaaatt agtgtgtcat taaatatagt 9361 atataaacag tcacagaagc tgatggacca tcatccatcc agcactacct tcacttctct 9421 tcaatcttgg aaaaacggta agaattttag atacaaattt gttgtatctt ctcctaacca 9481 tgtatttcaa agtagcttga gctatagaag gaacgtagtt acttaaaata tagtgtttga 9541 aataaaagta tttaaggtag tactgagatt atttttaaca cttaggctaa aaaagataga 9601 agtgcccaga agaatttggt aaaaaaaatt gcaatgacat atagccttca tgagtttaag 9661 aaaatataat agctaatttt atgcatatat atatatatat atatatatag tatgaggcta 9721 gttcatagtt aatattaaaa ggagattcat ttttttaatt aatcttccaa tttgatttag 9781 catatcacag ttataaaaag aattatactt caattatagt atgggtagat aaataattac 9841 tggaaaagct ttgtccagca ggtagtttac agcagtagca tactgagttg agaaatggag 9901 ggaatggtat gggggcattt actactttaa atgtttataa acaatgtttt tagaagagtt 9961 tactcatcaa acttgtctag agcctcaaac ttcccagttt attatctagt ttgtaatatt 10021 tcaaaacaat attaaaatga gacagaaata cagaattaag taaaaaagaa acaatgtggt 10081 ttactttgta agaaaattct taatggggta gaaaagactt ggaaaccata aacaaagata 10141 atagatgata taaaagaaaa agtagcttag agtgtaccag ttgataatga agctctggct 10201 taaaaattag tatataaaag tatactagta tgtagtatat ttatagtata gaaatgagta 10261 taaaaactag tataattagt attagtatat attagtatat tagtatataa attagcattg 10321 taattataca ttcgtatata aaattattat atatattagt agattagtaa taaaactagt 10381 aaagtttata ttacttataa aatatagaaa atgctgtggg atcttcccta tgtgtaatta 10441 tatataatag agtgttatat tctgtttctg aaccctgaca taaaaatttg ggaacttata 10501 gtggtcttag gcaaagtaaa attaataaaa acttaattaa cattcataaa aaatctagaa 10561 aaaatatttg ttttctgtaa tagaattatg ttcaaccttc aaatattgaa aagaacatct 10621 ctcaaataaa ttaaagactt ttttgatgaa gtattttagt tcaaaaattt agttcaaaaa 10681 ttgagtttca atatctgaga atgaagattt gagtggcgag taatattgtg gcacaaacat 10741 tattttgaaa taaattacaa aaaaatgtaa gaacacataa caaggagatg atttagtata 10801 ttttggtcaa aaatattaac atatatttca caagaagagg tagtcccaag cttagcagtg 10861 ggcaagaggc tctgacccct tggcggatca tcaagagaat cgtgtgtaca tttcaataaa 10921 gagaagagaa gaagcctagt gtacaatatc taaagtcatg tggcataaag gagaacagac 10981 attattagct atgtggggaa gatgaatata aacggagaag aaacaaactc aatagtccaa 11041 taaagtctct ggatagtgac acaaataagg aaagtgttaa aatgaaaacc tcagtcttat 11101 tggaaatgag gagaaataaa ctaaaatagt catggtaacc gtttagtgaa aagaaaaaag 11161 gtaaaaataa aatgtgactt ttttgtacac attttcttga tcagtctctt ccagtagaac 11221 tgaggctcca tgagggagta atatcaggac tgtaatattt tgttcattgc tgcggtcctg 11281 gtccctagac agtgtctggc aagaacagat tctaaagaaa tatttttaag ttaatgaata 11341 aatctttttt taaattttat ttgtcttttt aaattattat tattatactt taagttttag 11401 ggtacatgtg cacaaagtgc aggtttgtta catatgtata catgtgccat gttggtgtgc 11461 tgcacccatt aacacgtcat ttaacattag gtatatctcc taatgctatc cctccccact 11521 ccccccaccc cacaacagcc ccggtgtgtg atgttcccct tcctgtgtcc gtgtgttccc 11581 attgttcaat tcccacctat gagtgagaac atgtggtgtt tggttttttt gtccttgcga 11641 tagtttgctg agaatgatgg tttccagctt catccatgtc cctacaaagg acatgaactc 11701 atcatttttt atggctgcat agtattccat ggtgtatatg tgccacattt tcttaatcca 11761 gtctatcatt gttggacatt tgggttggtt ccaagtcttt gctattgtga atagtgccgc 11821 aataaacata catgtgtatg tgtctttata gcagcatgat ttataatcct ttgggtatat 11881 acccagtaat ggatggctag gtcaaatggt atcctagctc tagatccctg aggaatcgcc 11941 acactgactt ccacaatggt tgaactagtt tacagtccca ccaacagtgt aaaagtgttc 12001 ctatttctcc acatcctctc cagcacctgt tgtttcctga ctttttaatg atcgccattc 12061 taactggtgt gagatggtat ctcattgtgg ttttgatttg catttctctg atggccagtg 12121 atgatgagca ttttttcatg tgatttttgg ctgtataaat gtcttctttt gagaagtgtt 12181 tgttcatatc ctttgcccac tttttgatgg ggttgtttgt tttttcttgt aaatttgttt 12241 gagttcattg tagattctgg atattagccc tttgtcagat gagtagattg caaaaatttt 12301 ctctcattct gcaggttgcc tgttcactct gatggtagtt tcttttgctg tgcagaagtt 12361 ctttagttta attagatccc atttgtcaat tttggctttt gttgccattg cttttggtct 12421 tttacacatg aagtccttgc ccatgcctat gtcctgaatg gtattgccta ggttttcttc 12481 tagggttttt atggttttag gtctgacatg taagtcttta atccaccttg aattaatttt 12541 tgtataaggt gtaaggaagg gacccggttt cagctttcta catatggcta gccagttttc 12601 ccagcaccat ttattaaata gggaatcctt tccccattgc ttgtttttgt cagctttgtc 12661 aaagatcaga tagttgtaga catgcggcat tatttctgag ggctctttcc tgttccattg 12721 gtcttggtat cagttttggt accaagtacc atgctgtttt ggttactgta gccttgtagt 12781 atagtttgaa gtcaggtagc atgatgcctc cagctttgtt cttttggctt aggatttact 12841 tggcaatgtg gcctcttttt tggttccata tgaactttaa agtagttttt ttcaattctg 12901 tgaagaaagt cattggtagc ttaatgggga tggcattgaa tctataaatt accttgggca 12961 gtatggccat tttcatgata ttgattcttc ctacccatga gcatggaatg ttcttccatt 13021 tgtttgtatc ctcttttatt tcattgagca gtggtttgta gttctccttg gagaggtcct 13081 tctcatccct tgtaagttgg attcctaggg attttattct ctttgaagca attttgaatg 13141 ggagttcact catgatttgg ctctctgttt gtctgttatt ggtgtatatg aatgcttgtg 13201 atttttgcac attgattttg tatcctgaga ctttgctgaa gttgcttatc agcttaagga 13261 gactttgggc tgagacgatg gggttttcta gatacacagt catgtcatct gcaaacaggg 13321 acaatttgac ttcctctttt cctaattgaa tgacctttat ttccttctcc tgcctaattg 13381 ccctggccag aacttccaac actatgttga ataggagtgg tgagagaggg catccctgtc 13441 ttgtgccaat ttttaacggg aatgcttcca gtttttgtcc attcagtatg atattggctg 13501 tgggtttgtc atagatagct cttattattt tgagatatgt cccatcaata cctaaattat 13561 tgagagtttt tagcatgaag tgttgttgaa ttttgtcaaa ggccttttct gcatctattg 13621 agataatcat gtggtttttg tctttggttt tgtttatatg ctggattacg tttattgatt 13681 ttcgtatgtt gaaccagcct tgcatcccag ggatgaagcc cacttgatca tggtggataa 13741 gctttttgat gtgttgcata aatctttcca tacatattta taacttcttt atgccttttg 13801 aaaaattcaa tactgtaaat gggacttttt taaaagtggg gatagagttg ttagctgaaa 13861 aatctgaata gctggcaatg aagtttggaa tttgaaaaat gagaatcgca agccagaatg 13921 gattttgacc tccttcatgt gatataactt ctatttagta tttattctat ttattttcta 13981 aatgcagata tttttgttat atattatctc tctttttttt ttgttttata aaaagtaacc 14041 ttacctacat aagaaagtat atccaattga ccaatcttcc accattccat tttttctaca 14101 ttcacaggac ttagtagcca tgaaggtcct catcctcgcc tgcctggtgg ctcttgctct 14161 tgcaagggag gtatgtgcac aagaaaaaat tcctaaacaa tcaataaata gtggactata 14221 tgcttatttg tagagaataa catcaccaac attttttact gtataaataa tgaagaattt 14281 catgagaatt ctcttggctt ctatcaaaat catttatatt tacccactgt ctcaacagtt 14341 tcctatagtg ccccaaatgc ttcctgcacc aatgtgctgc tagtcactaa agaaagagca 14401 aacaaatcaa taagtaataa aaaatcataa aaatggcaac gaaatataat attgcataaa 14461 tacaactcca aagattctca agctagataa atatatctta ttccagtgat aaaatgtata 14521 tataccttac agcctagggc actgggtcaa atcctgtgtc tgtctgtaca aagacatcca 14581 tgggatgaag tacagagaca atcataatca tgatcataaa tatattaata ataatataat 14641 aaaaatattt aatacaaatt aaagtgactc ttcttttacc cataaaaaac tctgtcttta 14701 ataaatgtaa gataaaaata tattaataga ttactaaata taaaagacat taaagtaatt 14761 accttttaaa cctcaaaagt catataacat tttttatttc tcaaatttgt gaaagagata 14821 gctctgcata agtgatgtaa aaattaagta ggatgcatgt ttaacaatga gttagctata 14881 gaagttgaat ttttaaacac cttttcagaa ggaacaatcc aatgcatcct ctgaggtgag 14941 attatttttt tagagaaaat ttatgaacca taaaatagta aaattctcta atgatctaga 15001 agatttagct ggttgtcaat ttttttttcc cacagaccat agaaagcctt tcaagcagtg 15061 aggtaagtta acattctacc caattttaga acagtaaaat cctgtgctat ttttctatgg 15121 tgttacatca tggcagttaa gctaatgcag ctatgttaat gacatataag ttctagtaca 15181 tatttctttt atgtgtgttt aaggacaata agtatctgga taatacaaca ttctagaact 15241 ttgtaaattg gcttgctact tgaaattgtg ttattttgct tctttttttt ttaatgaacc 15301 atattgtcct gttttctgcc ttgaactctc tttactctga acttattcac cttaagcata 15361 tactgacggc cttctatggg tcagacattg actaggagct gtggtacaaa ctgaaaagtt 15421 agagatgcca gtgatagtta agagcttgct aagtagtagg taggcagacg tgcaaaaagg 15481 acacatttta taattacaat agccacttta gtaaataagc gagatttagt gtagtcctaa 15541 atttacctga gagagtcagg tagatattta ttcttacacc actcttcttg ggtgtgtgat 15601 cacttgataa gtaaatggta catcaataga ttctgtctct tcagaaaggt tatatgatcc 15661 ttaagggaag tgtttaactc ttatgattca tttgaagtga tcataatgta tcaattgtta 15721 ttagaatata ttttcaatga gtatttcttt ttttaaaaat cagatggggt tttagatatc 15781 atttgttgtc agggaatgag catggtggta gagagaaaat tagcatctat taattgcttt 15841 tttttgtacc atgtccttct tgactgcatt catctgactt tcttttacat aaccagctca 15901 tatagaacag tttaatacat aagtttaatg attatacaaa gttattgatg atattatcaa 15961 agaattaatg cctttatttc ctgatttata ttgcaaagta agaattgtta aaactaaacc 16021 taattttata tttttctgta ggaatctatt acagaataca aggtaaattt tcacatttaa 16081 aatgtacaca ttttcaaaat ttcctccttt taatctttac agatggtaac atatctttat 16141 atatgatata tctttagcag aaagttgaga aggttaaaca tgaggaccag cagcaaggag 16201 aggtaatttg ttaatgataa gtatatgttt aaaattatta taaagtataa tacatacaaa 16261 aatatttata atgtgtatgt tgattctaaa gaatgataat aaaataaatg ccatataccc 16321 accaaccact ttaaaaatta aacattgata aagtcaatat attttctagg atatgcatgc 16381 atttttaaac tcataattaa tttctagaat acagactaaa tacataatga tattactaat 16441 ggtatatctg ttttcaacaa gtacacattg gtgggaacat ttccaggttg ggaactatga 16501 tcctcttatt tccaaggtgg atatggtaat gaaaaggtgt atacggctgg taaaaaaacc 16561 tatgtaaaat ttgtcccata ttgcctttat tcccatatgg acaacacaaa atcctgtatt 16621 tcattaaatt acatttatgg tgatatgttt acccaattat aattttcaat tgctttgtgt 16681 accaatgaat tgtttggagt cacataaaat atttctaaaa tcaaatttaa acaaaaaaat 16741 tgatgtctct tgaaagactg aagagactat cttctccaag ggaaggaatg taagaatttg 16801 gccggtgtta aattacttcc tgagattagc cacaaattaa gactgatttt ctttttcatc 16861 aaaccaaaac aaaacaaaaa taaaccttgg aattgtccct ctaagtcttt ttaaactgag 16921 tctcttttca ctgtatgaaa acaatttggt ctactattgg cacaactgtg ttgaaaagta 16981 aattctctct gaagaccaaa gtgtacagct acagctatcc aggcaattca ggaaaatgtt 17041 aaaaaaagtg tttctgaaat atccaaacat tgatttcact ttggcctgtg gagttaccca 17101 tgaagtgagt ggattaaaat ttttcaacaa acagtttact catttttctt cgcatgactc 17161 aagatacttt cttaaccaaa ataagtgaaa tattttctcc tctcttttta ctcatttatc 17221 attgtctaaa agagagaaat gaattcatta tgaatggcaa ttatagctta atcaaggact 17281 caaagattct ttttccttct ttccaggatg aacaccagga taaaatctac ccctctttcc 17341 agccacagcc tctgatctat ccattcgttg aacctatccc ctatggtttt cttccacaaa 17401 acattctgcc tcttgctcag cctgctgtgg tgctgcctgt ccctcagcct gaaataatgg 17461 aagtccctaa agctaaagac actgtctaca ctaagggcag agtgatgcct gtccttaaat 17521 ctccaacgat accctttttt gaccctcaaa tcccaaaact cactgatctt gaaaatctgc 17581 atcttcctct gcctctgctc cagcccttga tgcagcaggt ccctcagcct attcctcaga 17641 ctcttgcact tccccctcag cccctgtggt ctgttcctca gcccaaagtc ctgcctatcc 17701 cccagcaagt ggtgccctac cctcagagag ctgtgcctgt tcaagccctt ctgctcaacc 17761 aagaacttct acttaacccc acccaccaga tctaccctgt gactcagcca cttgccccag 17821 ttcataaccc cattagtgta agtccaaatt tactggcttt gctgtttcat tcaagatgtg 17881 tatgtgatgg tagaataaaa gaataaatgt agagtaaatg aattaaaaaa acagtttaga 17941 taagtgattc ttttattatt atactttaag ttttagggta catgtgcaca acatgcaggt 18001 tagttacata tgtatacatg tgccatgttg gtgtgctgca cccattaact cgtcatttaa 18061 cattcggtat atctcctaat gctatcccat cccccatccc ccaccccacg acaggtccag 18121 gtgtgtgatg ttccccttcc tgtgtccatg tgttctcata gataagtgat tcttaatgct 18181 tacctataca atagaatcac ctggagaact tttccccaca aatcccaatg cctaagctta 18241 cccagagatt ctgatataat tgttctagtt ttttgtgtag aggaaactga gtgttgagaa 18301 aaaaactatt gcatgaattc tggtttaatt agtctgttga gaattctgat ttagataaag 18361 taattaaggc ttacaaaagc cggaattaaa tttaataata tgattgaatt tggaaaaaaa 18421 agctaaaaaa tgttctgtca ttttccttgt gcacatctct tttacacaag ccttacttca 18481 catcttgttt ttgctataag tatatatgaa ggcaaaagac tgagatgctt atttcactac 18541 ttacaacatt cttaaggcaa gttttcttac taagaggtta tttatttatt tatttattta 18601 tttattttac acaagcctta cttcacatct ggtttttgct gtaaatatat atgaaggcaa 18661 aagacggaga tgcttatttc actacttaca acattcttaa ggcaagtttt cttactaaga 18721 ggttatttat ttatttgtat ctgtttattt ttaaggtcta agaagatttc aaagttaatt 18781 ttccctcctt atttttggta agttttggga gtttggagat ttaattgatc atttttatac 18841 atgatgtctt tttacattta attctcctag agaagtccaa tacagtgaaa atttcataca 18901 tacaagaact ttttttatta attatcaatt taatggttga ctatcattta ctgacctgaa 18961 actatctatc ttttgcattt caaataactt taattttatt tatgtactat tgacagattt 19021 gactggcttg ctttcaaggg cctatatact tacatttgat tatcactatt tttaggaaag 19081 acagaatata tacttatttt acttttatgg aaatatattt gagcttttgt caaaagccta 19141 tttgcatttt tatttctaac ctagccttca taaaatttgt atttacttta cttaaaatta 19201 tcttttaatt catgagttaa aattactcca agtgtaaagg ttaaaaagag gagagaacag 19261 cattgcaatt ctaagatata aagccttttg ggattatgaa ataccagaca tttcactgaa 19321 acaatttcaa gttcactaat atttgatgaa ctttggtgaa gtttggtgaa caaactttac 19381 atgcctccaa accgcaacag aatgcatttg caatacaatt tcttttgtga attagtcaca 19441 ccaaagttaa aagtgaagag agttgaatag ttacgtgtta taacataact aattatatat 19501 ttgctctcta ttccacagaa ttgactgaga ctggaaatat gatgcctttt ccgtctttgt 19561 atcacgttac cccaaattaa gtatgtttga atgagtttat atggaaaaaa tgaactttgt 19621 ccctttattt attttatata ttatgtcatt catttaattt gaaatttgac tcatgaacta 19681 tttacatttt ccaaatctta attcaactag taccacagaa gttcaatact catttggaaa 19741 tgctacaaac atatcaaaca tatgtataca aattgtttct ggaattgtgc ttatttttat 19801 ttccttaaga atctatttcc tttccagtca tttcaataaa ttattcttaa gcatatttca 19861 gttcttctgt ctttttttca aacctaatcg gcctctttaa tgttaacttt gatttattat 19921 tgat // LOCUS AF032455 22214 bp DNA PRI 15-DEC-1997 DEFINITION Homo sapiens aldose reductase gene, complete cds. ACCESSION AF032455 L14440 NID g2687577 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 2801 to 5398; 7766 to 20048) AUTHORS Graham,A., Brown,L., Hedge,P.J., Gammack,A.J. and Markham,A.F. TITLE Structure of the human aldose reductase gene JOURNAL J. Biol. Chem. 266 (11), 6872-6877 (1991) MEDLINE 91201333 REFERENCE 2 (bases 2659 to 3308) AUTHORS Wang,K., Bohren,K.M. and Gabbay,K.H. TITLE Characterization of the human aldose reductase gene promoter JOURNAL J. Biol. Chem. 268 (21), 16052-16058 (1993) MEDLINE 93340218 REFERENCE 3 (bases 532 to 3886) AUTHORS Ko,B.C.B., Ruepp,B., Bohren,K.M., Gabbay,K.H. and Chung,S.S. TITLE Identification and characterization of multiple osmotic response sequences in the human aldose reductase gene JOURNAL J. Biol. Chem. 272 (26), 16431-16437 (1997) MEDLINE 97341182 REFERENCE 4 (bases 1 to 22214) AUTHORS Bohren,K.M. and Gabbay,K.H. TITLE Direct Submission JOURNAL Submitted (28-OCT-1997) Pediatrics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..22214 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region 1177..1224 /rpt_type=tandem /rpt_unit=gt enhancer 2037..2047 /note="pseudo osmotic response element" enhancer 2110..2121 /note="functional osmotic response element" promoter 2659..3267 enhancer 2872..2886 /note="pseudo androgen response element" promoter 3001..3267 /function="basic promoter" conflict 3008..3010 /citation=[1] /replace="ga" conflict 3056..3058 /citation=[1] /replace="ag" protein_bind 3083..3089 /note="GA2; GA binding protein alpha has higher affinity for GA1 than GA2; GA binding protein beta 1 binds to GA2 only in the presence of GA binding protein alpha" /bound_moiety="GA binding protein alpha; GA binding protein beta 1" protein_bind 3113..3118 /note="GA1; GA binding protein alpha has higher affinity for GA1 than GA2; GA binding protein beta 1 binds to GA1 only in the presence of GA binding protein alpha" /bound_moiety="GA binding protein alpha; GA binding protein beta 1" conflict 3157..3159 /citation=[1] /replace="ac" CAAT_signal 3171..3174 conflict 3193..3195 /citation=[1] /replace="ac" GC_signal 3194..3203 TATA_signal 3238..3243 mRNA join(3268..3371,10614..10781,11467..11583,12571..12648, 13239..13361,13864..13970,14295..14376,14971..15054, 17022..17104,19572..>19614) /note="transcription start site determined by primer extension using placental mRNA; minor transcription start sites found at bases 3269, 3270, 3271, and 3272" /product="aldose reductase" protein_bind 3278..3283 /bound_moiety="SP1" /evidence=not_experimental CDS join(3306..3371,10614..10781,11467..11583,12571..12648, 13239..13361,13864..13970,14295..14376,14971..15054, 17022..17104,19572..19614) /codon_start=1 /product="aldose reductase" /db_xref="PID:g2687578" /translation="MASRLLLNNGAKMPILGLGTWKSPPGQVTEAVKVAIDVGYRHID CAHVYQNENEVGVAIQEKLREQVVKREELFIVSKLWCTYHEKGLVKGACQKTLSDLKL DYLDLYLIHWPTGFKPGKEFFPLDESGNVVPSDTNILDTWAAMEELVDEGLVKAIGIS NFNHLQVEMILNKPGLKYKPAVNQIECHPYLTQEKLIQYCQSKGIVVTAYSPLGSPDR PWAKPEDPSLLEDPRIKAIAAKHNKTTAQVLIRFPMQRNLVVIPKSVTPERIAENFKV FDFELSSQDMTTLLSYNRNWRVCALLSCTSHKDYPFHEEF" repeat_region complement(4402..4614) /rpt_type=dispersed /rpt_family="Alu" repeat_region complement(5221..5431) /rpt_type=dispersed /rpt_family="Alu" repeat_region 5875..6177 /rpt_type=dispersed /rpt_family="Alu" repeat_region 6512..6769 /rpt_type=dispersed /rpt_family="Alu" repeat_region 11731..11774 /rpt_type=dispersed /rpt_family="MER5" repeat_region complement(12769..13054) /rpt_type=dispersed /rpt_family="Alu" repeat_region 15344..15538 /rpt_type=dispersed /rpt_family="MIR" repeat_region complement(18920..19188) /rpt_type=dispersed /rpt_family="Alu" BASE COUNT 5292 a 5326 c 5436 g 6160 t ORIGIN 1 tctctcgatc tcagtttctt cgacgtgtaa catggattga tcatagtatc catgaaaggg 61 gtggttatga agattaaatg atttaatgta cataactcaa tagatatata agcagcagca 121 gcagcacctt catataacat atgccaagtg ctgaaccata atacagtgtc caaaccataa 181 taaagactca gtagtggttt gtcgttgtgg aagagacttc tgttgctcac catacccatt 241 cacttctgtt ttctggcata caggaggatg acactcccat ccatgaagat aggctggctg 301 ggactctacc acaagatctg gttaaagata taggtatgaa gtgatgtgtg ccacttccag 361 gactgtgcaa ttgattgctc atttgagact cttagttctc ttttgccttc ctaagccaac 421 cctgaaacct tgggttggac agcaggggaa acaatgtggt ggacctccat agcttgagtc 481 cctgagtgac aatgtgaaca gagcctactg taccctcagc tccaccagct tgtgttggac 541 atgtagcata taacctttgt ttgtttatag aagctttatt gatgtataat ttacgtacca 601 tatgatttac ctttttaaat atgcaattgg gtggtttttt tgtatattca aacttgtacc 661 agatctcgca gggccttgaa tgccaggctc aagagtttag gttttatcct atgggaattg 721 gacagttcat aagaagttgt cttttaattc aaaatgtgtt cattaaggaa ataatctgac 781 aaagagtatt ccccatttcc ttctgtcttc ttcctaaatt tcctaaatgt tataggttgt 841 attttccttc ttagcatcat tccatagcta tatgactttg ttttctactt tcaccttaac 901 attacctcat atacattttt ctctgtgtct ttgtcatctt tacttttgat ggctacacct 961 tatctcttgc tattgttgga tatttacagt tttactttaa gtatacagca ttttaaaaat 1021 acatacgcat atgtgtatat ctgtgtgcag gcctgggact aagggaaggc aagtgccaag 1081 gacataaatg taaaaaagct cctaccttct ggcatgaaaa tttgtacctg tgtacctcat 1141 ttgtcttacc ttggtcccag cccagcccta tacctagtgt gtgtgtgtgt gtgtgtgtgt 1201 gtgtgtgtgt gtgtgtgtgt gtgttttcct tttaaattat ttccttagga aaaattccca 1261 tgatgggaga ttactggttc agagcatgtt aagattccaa ttactagagt gtttctcagg 1321 aggattccat tgaaggtttt tttccctttt tcaaacaaaa ttttgatatg tatacttatc 1381 aaaagatggg agatcaaaaa tcttgagatc aaaaattttg gtatgtgtac attttacaaa 1441 gattacagag gcaatggggg atgttagggg gctatcacaa tagcatgtcc caactagagt 1501 gctattgttg ctatagaaat tccactgata caacttttca tgtctgcatg cccagccaaa 1561 atattctggg tttatttatt tattttacag acagggtctt gctatgttgt ccaggctggt 1621 ctctaactcc tgggctcaag cagtcatccc acctcagcct cctgagaagc tgagactaca 1681 ggtatcactc cagctttaga cacagttata acttcatgca atcaagctgt atactcaatt 1741 attatcaaac caacaacaaa gcctctgata ttgagtccat gtccctaaca gacgaccctt 1801 ctgtgtgtac caacctggcc actcaaggtg tatcattttc acaagagatt agctctttgg 1861 atttggaccc aggaggttca attggcagag caacctccaa gtcaagagga ctctcatgga 1921 ggaaggaact tgctgtttta cacaaagcaa gttcccagct ttaggctcaa gttcaaattc 1981 tattacttgg ggacaaatga atttgcccaa tagcaacaat ttttacaaaa gttacatgga 2041 aaaatatctg ggctagtctg ttctgtataa atttttccag gagggagcac ttttaaagaa 2101 agcaccaaat ggaaaatcac cggcatggag tttagagaga cctggtgctt gagtcactac 2161 caggcagatg gagttcccaa tcttgcataa ttaggggaaa gatcggaggg tgatggagca 2221 gaaagagcag ttggggtgag gtagagtatc atgaagagag gttgttggaa aagaaaccaa 2281 aattgtcttc aaggatcttc aagtgacttt tcttaggcta tcttggaaaa atgtgtgttt 2341 ctacctgcca attagggtga tcaatttact attttgtacc ttctaatttg aaatactgga 2401 agctacaata ttagagtcaa ctttttcttc tgtggttttc tgtctgttca cttttcttcc 2461 tctagagagt tttgataagt tactgtactg gaacttactg ctttgataga gtctcacttt 2521 ccaaagactg actgaaattt gtcttactga ttctgaatgg acagccttac tttccaaatt 2581 cttctgagag ttctggatca agcccatcaa ggctttctaa cctcaagaac atcaaaggat 2641 gctggctttt ggtggaaggc ttgctgaacc acacctagta tgcacagaca cagctaacac 2701 ctagagtggg gtgcaaagag agatctagaa ggctacaaaa agtctgggca aacccatatt 2761 ttgtgaaggt gctttgcaaa aggattgcaa gccatttttc tggcatgcac ctttgtctaa 2821 gaacaaagtg cggtaaacta tatgtgtagt cagtatgatt cacatagatt aaggacaagg 2881 tattcgtcag attcccaaag agcatcaaac tggggttgga caacaaaggc attcttctgg 2941 gctcttaatg aaaccaaaac ttatttctag tctatgtatt tatgggatgc ctgttatttt 3001 cgctaaagca ttcgctttcc caccagatac agcagctgag gaactccttt ctgccacgcg 3061 gggcgcgggc gagcgttggg ggcggaaaga atccgctgcc actaggacca ggcggaagaa 3121 gcatccccgc cgacccttgg ggaaggccgc cgcggcaccc ccagcgcaac caatcagaag 3181 gctccttcgc gcagcggcgc gccaaccgca ggcgcccttt ctgccgacct cacgggctat 3241 ttaaaggtac gcgccgcggc caaggccgca ccgtactggg cgggggtctg gggagcgcag 3301 cagccatggc aagccgtctc ctgctcaaca acggcgccaa gatgcccatc ctggggttgg 3361 gtacctggaa ggtaggtgcc tcgtgggggc cgcgggccgg ggctcgcctc acactctccg 3421 cgcggcctgt attggcgagg gaccccgagt gaccctgagc agctcgcccc cgcgggacgc 3481 ccggcgtgct gggagccacg cgcgggcttg cagggtcccc agcggctggg gtcggccttg 3541 cagagaccgg gggccttggc tccccgggtt ggccctgggc gtcagggcta gcatcctgcg 3601 tagtggggtt tgggagcagc tcacggggag cccccgccct accgctgcaa cccttgatgg 3661 gtctggccca ccagtcgcat tttgggtcta gcgggcgccc caagcggcac aacgcgagga 3721 gggaggcggg gaaaagtggc tcacagaccg gtgacctcgg gcgcagacag gacgtggacc 3781 gtcggcaagg tgtggggagc gcagacccag cctcctttcc ctcgaggcac ctgtagcccg 3841 gttgcctcac ttgtaaatag ttggtctcaa tcgtttcgat tcttgccctc cctgaggaat 3901 aagcatctca agccggtaga gggcagagaa ttaggtggcg cgagtttgcc cggacccagc 3961 tgttaacaag gcgcagtgcg agctcctcgg ccaggcgccg cctcggggtg ccctgggtct 4021 ggaactcggg agaaaaatct gggcaccccg tagctagttg ataattcagg aggcgccaag 4081 gttagcattt tccctgtaaa acctcaacac cacacagcgt cccgagaagt gcctagccgg 4141 atgagaatga caataggaca tgaaacatca taggatatga aaaatgataa tatgatatga 4201 aaaaccaaaa ataatagcct gagcctgatt atctgaaaat tgagcaacaa aggtggtggt 4261 atgggagctg aggcaaaacc tagaattttt aaccttctaa gtattgcata ttatattcca 4321 cagccataac ctgggtttct cagtttctgt ttttgttgtt tgcttgtttg tttgtgacag 4381 ggtctcgttg cccaagctgg ggtgcagtgg cacaatcatg gttcattgca gcctggaact 4441 cctgggctca agtgatcctc ctgcctcagc actgcgggta gctgggacta caggcgctca 4501 ccaccacacc tggctaatct tttttcttcc ccgcagagac agtctccgct tgttgcttag 4561 ggtgatctcc aattcctggg ctcaaagcgc tcctcccgac ctcgcctccc aaagctacag 4621 gtgtgaacca tctggcaggc ccacgtccca ggtttttctc ttaaaatttg aatatgggcc 4681 tgggcgttgg atatttacac acagacaagg gaagaaaaca acagaaagaa ctgagtgaac 4741 tgtgagatgt ttgaaatagg acagaagagt ttacatgtac cctaaaacct gctgttgaaa 4801 attagtaaat ataaggtaag aaaaaggtca tttgcatcag ggagagtgtt tcttggccag 4861 aatgatcagg gtacgtgatt tgtttcctcg aaaaactcca gtggtgtctc taaggttgta 4921 attgaattac aaccatgaca ccaagctaat ttccatctgt tgcatgttat gctcttggaa 4981 tggatgcaga aggatgcttt cttggcaata gcaaatactt tctaagcatg atgcaacgca 5041 cagatatata catgtaactg gaataaaaat tacttaacag tacttaacct atgtaggctg 5101 catttcaata tttttttcta ttccattttt ccaaaaatta ttgtcatacc tccagaagct 5161 agttttgtga cccatcattt gaaaatcact actctagggt aaataaggag taaaagttta 5221 tttttttttt ttttgagaca ggggctcagt gtcacccagg ctggagtgca gtggcatgat 5281 catgattcac tgtagcctta acctcttggg ctcaagtggt cctcatacct cagcctccca 5341 agtagctggg accacaggtg tgtgccacca cactcgggta gtttgttttt aattttttgt 5401 agagacaggt ttccctatgt tgtccaggct gaggaggtaa aagtttatga aagagggttt 5461 cagattattc acttgagcac tcaatgcccc aaaaagctat gtatttgttt aagataacat 5521 cacatcataa catctgtaca tttaggttat gcattccagt agacagagtt cattgagaaa 5581 ttggataatg tctgcttcat tcattactat ttctagcact tctcacagtg cctggcaaat 5641 ataggagtag gtatccaaaa gttatgtgtt gaattcatat gaagaaggta aaacgacttt 5701 agagatagct gtatggtgaa aacgcccact tgtattcctg tggatgagga gatcaggttg 5761 gtgtagaatt gtacggtggg ggtagcgggg aatggtgggt caaagtaatg gtataaggat 5821 ggacttggaa tcaacttcct tttttttttt aatgagctta aaacctttcc ctggggccgg 5881 gcgcggtggc tcacacctgt aaccccagca ctttgggagg cagaggcggg cagatcatga 5941 ggtcagaaga tagagaccgt cctggctaac atggtgaaac cccgtctcta ctaaaaatac 6001 aaaaaattag cccggcatcg tggcacgcac ctgtagtccc agctacttgg gaggctgagg 6061 caggaaaatc gcttgaaccc cagatgtgga ggttgcagtg agccgaggtc ccgccactgc 6121 actccagcct gggcgatgga gcaagactcc atctaaaaaa acaaccaaac aaaaaaacct 6181 ttcccttgta tgtgtgtatg tattctttca cagatgctcg acagaagatg tgatcctttg 6241 cctgtctttg tgacagctgg aggaggccct gcaggcaggt gcagttcctt attctcctgg 6301 gtcttcccat gccgtagcgg ctgtgttgta cctttcagac cctcagcagg gctctggtag 6361 tctctactct tgaaggtcct caaactccat tctcttctca ggtgagggtt tcaggcagat 6421 ggggagacca tctctgcccc agacccatgt ggcttggaca gggtttcttt ctggcagtct 6481 tgccttctac attggatagt aaacaattac acctgtaatc ccagcacttt ggaaggccaa 6541 ggcgggtgga tcacctgagg tcaggagttc gaggaccagc tggccaacat ggtgaaaccc 6601 cctctctact aaaaatacaa aaattagccc ggcatggtgg tgggtgcctg taatcccagc 6661 tgcttgggag gctgaggcag gagaatcact tgaatccaga aggtagaggt tgccatgatc 6721 tgagattgtg ccactgcact ccagcctagg cgacaagagc aaaaaaaaac tcggtctcag 6781 aaacaacaac aacaacaaaa cccatcacag cattgtaaag ccaggaagct ttttaaaaac 6841 agagatgatg tctggtgccc accagtgtct tgttactgat agtgttttta atctagtgaa 6901 aaattgttgg aggtaaaagt cattgtaagt ctcctatgta tgctcgcaga atactcctag 6961 ggctgaaggt acaactatgt ttctccatct tttgccatga tcactacctc cctcccacct 7021 aggagccttg tgagactttt ttttccccta atttgcccca aaataaaatt gtaatatcag 7081 agataatatt gtacttctgc ttatggccag tagccttcta gagggtcaca agctgttcta 7141 taagctgagg ggggtttttt tgccccagag aactaatttt tgtctctttg gggacaatag 7201 cacccacttt aagaatgcat gaggtaaaag aaaatgtgcc aggaaagact tgctccacct 7261 tccccaagtg gggtaagctt gtgatggccc aagatccttg gatggggtga gggttgaggc 7321 gctgtgggca tggaggtaac acaggagcca gatttactgt tcacttccct ggtgtctcta 7381 aagcccccat ctcttcctct tcaagaccaa agacatttaa ataagccctg cccctacagg 7441 gtcagtcctg gtccatgcac atctcaacta gttggaaaca gcagtgtctt agaatttggg 7501 acaaattgat taaaacagaa caggaggaag ttaatttctg ctgccaaacc tgattctgag 7561 tgatcgtttc ctgttccttg ctcaggatta tcagcacagc attctcatgc ttctactagg 7621 accgtttctt tgcccaaagt gaagtttgct ttgggccctg gcccttatgc tttgatttca 7681 gtagtaacac ctgctaacta gagaacattt caaaaatgaa gagttccctg ttcccagaga 7741 tccccgtggt taatattgca aaatatccaa tctctgagat actttggttt taccataaac 7801 actgtcctgt gaccgtcttc ctgtttagca ctgtggcttg ctttttgact tgcagaggga 7861 agagagggag gggtatcttg aagtaactag caagacctgt aagcaaaagg gaagaagact 7921 aggttttaaa aggagtgttg tatctgtgta ggagggagac actgatttta gttatcagtg 7981 tctcagagtt ggtattagcc agccggtgtt atcaaaggag gaactaggac cgctgtcttg 8041 aaggaggtgg tggaggactc aagggcagag gcttctcact gccatcactt ggtgccatca 8101 gcccagagca tttcaagtca ctgcccaacg tagagctttg cagagccaga agttctggga 8161 ggaatgggca tgcatggttt tgcccatcaa gggcacacag cttagtggca ggatcgggtt 8221 agaaacatgt aggatttgta tgattgagcg taattggagt gtttcaaacg cattctacag 8281 ctctcatcag taatgctgtg cctggatgcc acccagcatg ttttcattag tttacctctt 8341 tttagaagtc aatctggacc cctccttctg tttgcatttt ctctttgttg acatggaatg 8401 gggagatgtt gtcatcctgt gttggcattc ccaagtttag tgaatggttt ctcaccatag 8461 cttaagttgg ctgcttcatt tttcataagg caaattagtt tctctcaggg aagggctgta 8521 cattgagatc attaacatag aatgtggttg tggatcctcg tacattttca ttgcagaggt 8581 gactttttcc ctccagcacc tttgatcagc ctcccagcac acatgtctcc agcgaatggc 8641 acatatccaa ggaatgatca gcagcttctt cgtgacctct tgcagtcatt gctgtctttt 8701 acttttctag aaaaacctct cacaactttt ttttcatgtc gtgccaagcc aagcttgatt 8761 atcctagtct gtgcctcttt tggttccaaa gaactgaaaa accactttaa tcttaagcag 8821 gaaaaaaaaa agtgtgtttg gggtggggat tgattgactt ccccaaacaa cagccgctga 8881 cattaggtgt ttgagattca gggttgtaaa tgttgtcacc aggatttggc ctccctccac 8941 tttcctgcag cagctccagg ctgacattct cccagctcaa gtgaccccag cagaaccggg 9001 gctggactca ggggttccaa ttcaagtgca aagattgagt cttatggcct ggattgagtc 9061 atgttcctgt ccccgcacca gttgcatggc aggcagtgcg tggctaggct gagcctgcgg 9121 agtgtgccct cagcagctcc tcagagaaaa atcaatcaag gcgccatcaa gcctggagaa 9181 agttcagtgt gtcctgggca gtttcctgcc tgggccccag ttgttctttc tggggtaggg 9241 atgtcctctg ctctccctgc tgtcatcagt ggtggttgtg cccggacacc actcagggcc 9301 tgtggtaagg cattctttct ggaaattgag attctctgtt ccttcaaccg tgcagagtgg 9361 ggaaggctgc aagagccagt tctgaggtcc tccagtcaaa taagaggtcg ccaggtcctc 9421 tgagggtcgt cccctcttaa cccacacctt cgatgagcat ccccaccctg cctcgctgct 9481 ccctctgagg tggagagtga gctccatgct aacaggctct ggtaacgggc ttgttcaact 9541 gggccccttt cttccccaaa gcctgttgaa aatagctgat gtcgaaaaca cagatgtctg 9601 cagctgctat ggcctggggg gtttttcttt tcatttgttt ctaccccagc tctccaaagg 9661 aatgtgctct gttcattcaa cagcccgctg cacatgttta ttgctcagct ctgtgatcat 9721 agcaggatag tgaaagttgg gggatacgtg tccatgcagc ctacctgggt gccagtgaag 9781 cgcgatgcca tgtaggcact tcctcctcct cttcagtagc caggctcccc acagctgttc 9841 tggttactca gatggtcagg tggcttggcc atggacaccc attcagtcat tttgccaggt 9901 cagccactaa ccatttggtg atcttccatg atccacattc cccctcgttt ccctttaact 9961 tatttctctc tgctgatacc tctgaggggc catctcccgt gtctccaccc cttgaagcag 10021 atcctcatac agtgagaatt gacacatggg cctttgaata aacatcagtg atgtgcttaa 10081 cttcatgata atctcccaga accgcttgaa aacaagcgac tgccggtatc agaagccatg 10141 catgaaaaga gagcgtaagg ggcaggcatt cactggctga actggggagc cacatgggaa 10201 ttggggtcca tggacatcta tttttgaaat attttgagat atttgaagga aatggtttcc 10261 cttactatct cagcattcag agtgtgggga aatgttttta atacagtggt ttgcaaatca 10321 gtggatgtga agaagcaaga aaatgagatg gcttcagcat atcttttaaa agtgatctca 10381 cctacactta agtggctgtc agattctcct acagggcaat gctgtgtggc ctttcacttc 10441 catcccccgc gtctgaggct gatagtccag aatgctgctg aggaagtgca cgtggtctgt 10501 ctttgctgaa caaagtggca ggctggtcag cagggatcgg tggctgccac tgcccctcct 10561 ccttgctgtg cagactctgc attctgcggg gctcacgttc tttctccttt cagtcccctc 10621 cagggcaggt gactgaggcc gtgaaggtgg ccattgacgt cgggtaccgc cacatcgact 10681 gtgcccatgt gtaccagaat gagaatgagg tgggggtggc cattcaggag aagctcaggg 10741 agcaggtggt gaagcgtgag gagctcttca tcgtcagcaa ggtatcgttc cgcggtgggg 10801 ctggaagggg ctctcggtcc catcatctgt tggtgcatca gcccaaagcc aagtccacac 10861 tcaatacagc tacacttttc actgatgggc gattcttcca ggggaaacgt atcacccacg 10921 gccatacagg gctccagagc tcgcccatcc ttggctctct gttgaatggt cattccatct 10981 gggccccatt gcttagagtc ccgtgctgat gaggctgagg ttgtggattg gccctccctt 11041 tccagggagt cgctttcaag gatggaactt ggttcccatc tgcaaactga atctccagtg 11101 ctagccactg tctttgacct gagatactga tgcacaaagg acatgaggca ggaggatggg 11161 gcctggctcc acaaacacca gcccgggggg aaagtttcga gagttcttgc ttaggcctag 11221 ggcaggctag ccaatctgtc cattccattt tgtgtcctct ggtccttgta agaggtctct 11281 gtgacctggt catcctgcct gcctttggaa agttaggatc cagtacgcag tggaaatgcc 11341 ctcacaacca ttcacagcat catggtaaaa caaatgggtt ttggggagag gagactgctc 11401 tgtctttcca gccaggaagc tgccttgtgg ctggcaatga tgacatgtgc cctctcgctg 11461 gcttagctgt ggtgcacgta ccatgagaag ggcctggtga aaggagcctg ccagaagaca 11521 ctcagcgacc tgaagctgga ctacctggac ctctacctta ttcactggcc gactggcttt 11581 aaggtatggg atgccttggt gcagaccagg ctcccttggg gctcaggtcg aacctgtccg 11641 gttcgtgttt tggagctccc ttagcagatt acgtgtgggt gactttctgt tcttgctgct 11701 caaggtatag ccagtggacc atcagcactg gcatcacctg ggagcgtctt agaaatcaca 11761 gtctcgggcc ccacgcaggc catttgaatc cgcaccagca gtgactcata ccacacttct 11821 tcaaagtttg aggagcactc gtctttgcca gttggtagat gtgcaattta aaactgtcag 11881 caaacacagc tgctgtctgt agtgcacaca gccatgaggc acggtcagcc ctcccctcca 11941 ggcacgtgca cacaccaacc cccatacaca tgagtcccta cccacgttct acatgcatac 12001 cctgtcctgt ggaggcccgg taactatttg cagtctagtt gtttgcacaa gaagaacttg 12061 agagttactt tgtcttttgc cttttttcta gccactgcct cagttttctg ctgtgagcca 12121 ggttcaaagt gtgggtttag ctttatcccg tagccctgtt ttattgcacc acagcatatc 12181 agggctgaag ggacttgact tttatctttt gattttaggg gtaagctggg gaggctgggc 12241 caggtggtag ttttataggc cagtgacccg ccagccttgg gccctgtgag tcctggtctt 12301 gagtccttcc tgtatccaca cattgcagtc agtggcgggg ctagtctctg tgaccaagta 12361 gtgtccatgt ctccatctga cagtctcagc tccagcccgt ttgataaaga agtgttggct 12421 atctgcctat ttccttttca tagtggcaca cagattatcc ctagccagct gtggtggttt 12481 agaattgttt tagatgccag cttcccccag caggtctgtg aaggactatt actgattgtt 12541 ttgtttgttt gttttttatt gtaatgacag cctgggaagg aatttttccc attggatgag 12601 tcgggcaatg tggttcccag tgacaccaac attctggaca cgtgggcggt aagacagccc 12661 ctggttggac cttgttttcg tgactggttc cctcttcttc ttagaagcaa tctgcctgcc 12721 atttctcaca ttgcattttg ttattctgtc cctgttgttt gctctctctt tttttttttt 12781 gagttggggt ctccctctgt tgcccaggct ggagtgcagt ggcacaatct cggctcactg 12841 caagctccac ctcctgggtc acgccattct cctgcctcag cctcctgagt agctgggact 12901 acaggtgccc gccaccacgc cggctaattt ttgattttag tagagacggg tttcaccatg 12961 ttagccagat ggtctcgatc tcctgacctc gtgatccgcc gcctcagcct cccaaagtga 13021 ctgggattac aggcgtgagc caccgcgccc ggcctgttgc tcttcttatc tcattaaagc 13081 cagcctggct ttcgtctcac tgtctcttag cttctgaaag gtctaattca ctggcactga 13141 agagtgaggg caggacccgt taaactggtg caaacgacac ttctgatcct gtcctttctg 13201 ggttcgtttc tgcaccacac agacactttt ctctgtaggc catggaagag ctggtggatg 13261 aagggctggt gaaagctatt ggcatctcca acttcaacca tctccaggtg gagatgatct 13321 taaacaaacc tggcttgaag tataagcctg cagttaacca ggtaaacatc ccccaacgac 13381 actggcattg caacgttgct gttttactgt ctgatgctgg tagggatttc tcgtccacct 13441 cacccagcaa gtggccagga ggcagctcct ggtgttggca gagtcctgag ttatgattca 13501 gcatgacaca gcacatctgg gagaggaaac ttacaggaaa gaaagagttt gagcctggct 13561 tgtgacgatt tggggtctga caaagaggca gccatgagat cctcagaact gtgatagttg 13621 ataagtggct aaagccaaga aaagcctcta gggcccatga gaacatctat aataagataa 13681 taataacacc aaggatacta cctgggaact gtaggaagtt acagcttatc tgggaaatag 13741 ggtgtttgtt atacatagtt ttctgtatgg ggtagggagg gactttagca taggaagctt 13801 catctgtgtc ccatcagtgt gtagccacag tgctcccatc agctcttgtt tttgcatttg 13861 cagattgagt gccacccata tctcactcag gagaagttaa tccagtactg ccagtccaaa 13921 ggcatcgtgg tgaccgccta cagccccctc ggctctcctg acaggccctg gtgagcttcc 13981 cacaggctca tgctcctgtg tcacttggtg aagattagaa aagactaaaa agaacaattc 14041 agcctcaagg aagatgtttc tgggggtgat ctgcaaaact tcccctgagc ctcttgctgg 14101 ttttcctgga agggttggag tttctagtag caagacacct actctttttc caagggttgg 14161 gggttcttaa cccagctctg aggaggagac tggtggctgc cctctgaggt cgatgggagg 14221 attagggttg gggtttctgg ctgctgtgag tggtggccat gatggtgtga cctggactgt 14281 ctttctatcc tcagggccaa gcccgaggac ccttctctcc tggaggatcc caggatcaag 14341 gcgatcgcag ccaagcacaa taaaactaca gcccaggtac agccacttca ggtgttgctg 14401 accgtccaca actgcctgca ttcctgacag tcctgttagc caagaggagg aagtgactga 14461 gcctgttaca ccctcacagg aagtatggtt aggggtcctc aagtacagag tggaaagggc 14521 acagatcggg gttttagaag actctggcat gggctcttag attaatagtg tcctgcgtcc 14581 ccactactgg caagggtgac tgcccagcca cgcttgttca ttcatgtgga acctcatctg 14641 tacaaatgta agagctctta gccgcgcagg gaatgttctt tctcctgagt gtagtgtgca 14701 ttctagccag tgaggcctca tgtggtctca tgatatgcct gagacactga agcgtggtgg 14761 cacagtggct agcgcaggac tctggagtca gatctggacc tgaatgcgtc gcctacctgt 14821 tgctagctgt gacctgacat cttggagccc ctctctgatc acctgtggag ttctagcacg 14881 tccttctgca ggttgtgtgt gtgagagact gagatgatgg gtgcgagtgc ctggcatgta 14941 tacacactca ctgtctcctt gggctcacag gtcctgatcc ggttccccat gcagaggaac 15001 ttggtggtga tccccaagtc tgtgacacca gaacgcattg ctgagaactt taaggtaaga 15061 tcttggctgg tcaggcctgg ccctcctcca tggagtgggg gatgggggag gcctctcatc 15121 ctgtctctgg agtgtcatct gtgggatccc caccatcctc tcttctgagg ccagggagct 15181 gtggcgagca agccaagact gagactgaca cctcaccagt ggagccgtgt gccaggggca 15241 ggccttgggt ccagggccgt gctgtggcaa tacacctaca cctttgctca ggcccttcag 15301 cacaccgaga ggttacccgg ggagaatctc gctcttgagc ttcactgcct ggacctgccc 15361 tgcactggag ctctgtcagc taccagctgc ataacctggg caagtcgctt aactgctcga 15421 tccgtcagtt ccctatctat aaaataatat ccattttgag ggttgttacg aagattaagt 15481 gatgcatcag agcacttcac atggtgctta ataccttagt aagtgatcga gaaatgttgc 15541 tgtgctgata agtccaggga ctgcaaagga ccctctgggc catttagtcc actcaccagc 15601 tctgcaagga gttcgtgtga tgatgggcag agctagacgt gtgtcaggac ctctggcgaa 15661 cgcctcagtg tggcagctgg agcagctcag gctacttggc gccacctccc tccctgcctg 15721 cccttgcagc attggtggcg ggggtgtggg ggaggggggg cacgcccaaa ggagagcagc 15781 tcaggggagt cctgatggcc caagagcctc tgttctcact gagcaaagtg gctctgccag 15841 ggatgctctg tctttatcca agatctgtgc caagcgcagc acagccatgg gcctggggct 15901 cttggcaggc cccagcttag gtgagcagct gattttgccc aattccactg cccaaagact 15961 gtggccatag tgcctgccac ctgtcacata tgagacagga tgagagtgat ccctgaagag 16021 ggaagtggag ctcattgttt aggagaggag acgtttttct tcttacgtgc aggtagatgt 16081 tctaagtaca caagataaaa tggcaaaaca gtggccactg agggaaaggc atatcccacc 16141 agagatgcat cgtttgtttc gtatttcatt ttttaaggaa ttaggaaaga aaaaataaat 16201 gactaatctc aaaatgctaa cctaattatt ttatttttgc atcttactgt atagcttttg 16261 cacatttttc tgtgatcata accacagagc ctgtaggaag actctctttc tgcacattct 16321 gtgatcatct gaagctaaga gccttcggct ctgggagtgt atgtcagcct cctgtgggtc 16381 tcaccagtgg tgcctgcctg cctccctccc tcccagcctc cctgccagct gagttctgta 16441 ccgtctgctg ccttcatgca ggtggttctc ttctcccagt atacctttct tgccctctct 16501 gtcaaaatct tatacttctt aggccatgtc aaataccatc tcctcttcgg ctgtggttgt 16561 tcagtttccc ggctggaatc cctttggaat acctctgccc ttcctgtgct gttgagtgtc 16621 ttatagagtg tgtgattagt tatttgtatt acagttgtag aaatacctct gccctttgtt 16681 tgcttcataa aatagaggca aagacacatc ttacccataa ttatattcct cactgcaact 16741 tatcacaata gaagtgctca gtagtgtttt ttgaaatgaa ataaaatgta tcatttccct 16801 gggaggttct ggaaaggaag gactgtttcc atcgttccat atattatcta gcccaacatg 16861 acacatttcc tgaatactaa tatatcagga ctttttccta agttcagttt atcttatcgc 16921 tcctctctgc tcaagccatg gaagacatct gggtcctgca cagtattttg tcttttattt 16981 ttatttgcag agcttactgg atttttttgt ttgtttttta ggtctttgac tttgaactga 17041 gcagccagga tatgaccacc ttactcagct acaacaggaa ctggagggtc tgtgccttgt 17101 tgaggtgagt gagccccagt gtgaggagca gttcccagga gatattcaga tgctgcacag 17161 caagagagtg atccctgcag gcctcagcgc caatgtgctc tcgtcacagc tgttcttgct 17221 ttgcacatta ccttgctttc tcagccacgt gacttgcatc agctgaactg catggagcca 17281 agatgccaag agatccgctc ttgatatctt cttccagccc tggggcattg ttggcctggc 17341 ctgaagtaac tacatatttg gctttccatt caatttttaa atcctagcag ttcatctgaa 17401 actaggtgat cccaggtgac cctggatcaa actctggaag aatttgcctg tgactttttg 17461 tcttaatgac cctaatgaaa ggggtttggt cggggagccc ctggggctga ggtttgagac 17521 agtcccggct ctgacctggt aaggccacat ggctccaaaa caggccctcc ctctgagcag 17581 caggcacctc agttcctcca gccacacatc ctctggttct gtttcctctg catggtagtt 17641 tggaaggagc tgcctaggaa ggttttctgt ggggttttct ctttcctacc attacaataa 17701 ctaggtgacg catgtaactc actttctgct ttcaagaagg aacttctcat tttgcagtgg 17761 agactggaga ccctgtttct ctctcactta cccttattct tgtctgagct ctgtgtttca 17821 cactctctcc tttagtactt cctcttctaa ccaaaatgtg aacattccct aagatggggt 17881 cgggctacca gctttccctt tccagcgtct gtgccatctc cctccccgga gtttgcgctg 17941 ctggtggctt cagccagccc caggcagatg tttgcccagt ctgcgtcctg gacaataacg 18001 ctggcctgct tcaggcatat gcgtgcctgg caagcctgct gctcgacact gtggctcaca 18061 ctcaaaatag gccgcaaaca gagcatgtca caggcacccc agggcagtgc cctttcagac 18121 atctctgtta atgctactta gccctagaca ttaaaatcat ttttggatct agtggacaag 18181 gaggagagga gtgaatcagt caccaagcta cattcgttct cctttcagaa tctctcccat 18241 tcatccgtct tccttttccc ctcccacttg agacagagtg gattgaaggc tttcagggct 18301 gaggaaacac agtggttttc tcattgctcc ctttacagtt aggctcctct gccctccagt 18361 attgtgctag aaacttctgt tcgttatcct gttgcgtggc ggtgagtggg aggaagaggc 18421 tagccctgac atgctcaggt ctcctaggga gttttcgcta actacacaca caccacccct 18481 ctggcagata atgctttact gagcctgtcc tgctgatgtc agtgtcactg tgggtataca 18541 aaaaaggcag accatggaga agcttttgaa taagagttac agactgtaag cacccagctc 18601 agattacata actctggcct tgcgtgagct gcccagctat ctgtccagtg ttcctgcctt 18661 tatgttccta gcacagccat tggccctact agatggacag tgagtgagcc ttgacaaaca 18721 cgtctgcacc tgtgctccct gccatgtgct ccctgccctg tgctccctgc cctgttttcc 18781 aatataaaat cgccgaccgt tgattggggc tcgacttcag cccccactga tacaaggagg 18841 ctccctctgt cctcagaggc acttgccgtt gcttctctta tcctttgaga ttcagtcaca 18901 tacagacttt taacatccct tttttttttt gaaatgggct cctactctgt cacccaggct 18961 ggaatgtaat ggcatgatca cagctcactg cagccttgac ctcctgggct caaatgatcc 19021 tcccacctca gcctcccaag tagctggggc catgggcatg caccaccatg cctggttaat 19081 ttaaaaagtt tttttattaa gatggaaatc ttgctatgtt gcccaggcta gtcgcaaacc 19141 cctagcctca agtgatcctc tcgcttcagc ctgggattac aggcgtgacg actgtgtctg 19201 gcctgccttc tcatattatt acacaagatg tttaagacca caaaatactg tctggtcctg 19261 ccagctagac tgagttactg gaaggcagaa atcgtgttgc cgtcctgcac ggcccaggca 19321 cagtgctggg cacatgggga gggtggtgtc cctcacacat gtgctgcttg atgacagtag 19381 tggtagaggg gtctggaggg tgagttgctt cccccagctg cagaaccatc atctgcacac 19441 tcagaactag cttgttgcct ctgaggttct attttatttc ttcagtgaga cacatcattt 19501 ctctgttggg agttgggtat agtcaaaatc acctcttcaa aaagcaactg ttttctcctt 19561 cctcctgaca gctgtacctc ccacaaggat taccccttcc atgaagagtt ttgaagctgt 19621 ggttgcctgc tcgtccccaa gtgacctata cctgtgtttc ttgcctcatt tttttccttg 19681 caaatgtagt atggcctgtg tcactcagca gtgggacagc aacctgtaga gtggccagcg 19741 agggcgtgtc tagcttgatg ttggatctca agagccctgt cagtagagta gaagtctctt 19801 ccagtttgct ttgcccttct ttctaccctg ctggggaaag tacaacctga ataccctttt 19861 ctgaccaaag agaagcaaaa tctaccaggt caaaatagtg ccactaacgg ttgagttttg 19921 actgcttgga actggaatcc tttcagcaag acttctcttt gcctcaaata aaaagtgctt 19981 ttgtgagctt ggttttgtga gctttggttt tttaaaacaa tagcaacctt ctatctcact 20041 tcaaccatca ctctccacat acccataagc aggccagtca gtgatgcaaa aggagtgagg 20101 acttgacttt tctacaagtc ctcaactgca agttgggggc ccctcacccc agtactttac 20161 ccatttagct ttagggttgt gggtctcatc ctggctgcac accaggtcac ctttgggggc 20221 tttaaggctg gtgatgtctg gacctcgccc tggaggctga tctgattggg ctgtgatcag 20281 gacaggagga gtgtacaggc tctccaggtg attccagggt gcagccaagg ctgagccaca 20341 gcttttaggt aaacccatgt tcgattccac agattccttg gttgctcctt ctacgtatga 20401 aggccgaggt cggtgtttac taagcagcac aacctgagag aaattgcttt tctgaagcct 20461 gttgtctaga caagaggcaa agaaaaggca taaataaatg gcttccaatt ggataaagat 20521 gaaatagggc tacaaaataa agattttgag gcggaatgct gttgcttctt tttgttaccg 20581 tttctttatc cagagttaag ttaacaaagg ccaagaaagg aatggaatag acaagtgccg 20641 acttgaaagt tcatcaagtt aggttgtttt gtggctgccc ttgtttattg gataattaga 20701 cccccatgat caagtcacat taactgcttt gcaggtgatc tgtgtttaag atgcacctct 20761 aaaggtatag ttaattgggc agtggtggtt cctggccctc acccagcgct ggaggggcac 20821 acctagcccc tctgcggcta gggttgagcc cggtggagaa cccagagccc cactgcacag 20881 gctgccctta ctacctggca gccttgggcc tgggtccctc gtggcgtctg ctgatcacag 20941 tggtccagtt tcccctgccg gagtaggcag gtcacacctg ttaaactgca gaccttatgt 21001 tgccctctga tgattaagag tagattttag ttttgttaca aaagttacgt caaatcatct 21061 gacctgatcc ccactctgtt tctccagttg gcttcaacac tctttggaaa aactttaagg 21121 tagtcactgg taacagtttt taaatcggtt acctgccttt tagatctccg cttaggatag 21181 accattgtag catgacaatt tgagactcag cccaggagca taaacaagat aaaaggcaag 21241 ctgtcgaacc tcccctgccc tagcgtttgg ccagcagaac cctcagattt tttctaagga 21301 ctgccctttg aggagccttg ccggcagtgg aagaggagga agagaattaa agcatgccac 21361 gaatgacagg gagaacagct ggttctgcat tttgaagctc caatcgtatt ttcagtagtc 21421 agcggcttgg gctcaggtaa ggcccgttgc ttgtatgctg aaaagtgtgg cttaaggagc 21481 agaagctgag gctgtggagc tcagtggaac cagcttcatt ccactaacaa ttcatccaag 21541 gagtggatgg atgagagaag gtccgtggat caagacaggt gctgcttcat tcaccaatgg 21601 agagctccat tcttccttcc ccttgtggaa tcaaaaccgc tacaccttcc tgggcagact 21661 ccagcccagc ctcaggtctc tggtggactc atcacgatga atgaagtcac ccttcagagg 21721 gcaaagtcag tggcagttat agaacaatgg tgctggcccc atctcgggac atttgcccct 21781 gtgaagttgg gggcacaaca tacattcatg aagcagtggg gttttaccaa aataatgttc 21841 cagccaagga gcagaattgt taagatattg agaccaatgg aaatacagaa caaaatatta 21901 aatcacacgg gcaagtaaga tctgcattgg aaaatctctg aacaatgctc cggagacgat 21961 ttcaaacagc atcatttttc aggatcttga tttcaaagaa caaggtttct caacactatg 22021 tcctttatgt tgataaaatt gcacaaactg ctgtagtgca atgtccttgc tgctctctgt 22081 cacggcttat cacggggctc attggcagga ccctgggagg gtgcccagcc gggataagca 22141 cagtcagtat tggattcaac ccaggaagat ttccctggag gggcgttctg ggaattcatg 22201 gctctctagg atcc // LOCUS AF034632 2040 bp DNA PRI 01-DEC-1997 DEFINITION Homo sapiens orphan G protein-coupled receptor (GPR38) gene, complete cds. ACCESSION AF034632 NID g2654158 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2040) AUTHORS McKee,K.K., Tan,C.P., Palyha,O.C., Liu,J., Feighner,S.D., Hreniuk,D.L., Smith,R.G., Van Der Ploeg,L.H.T. and Howard,A.D. TITLE Cloning and Characterization of Two Human G protein-coupled Receptor Genes Related to the Growth Hormone Secretagogue and Neurotensin Receptors JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 2040) AUTHORS McKee,K.K., Tan,C.P., Palyha,O.C., Liu,J., Feighner,S.D., Hreniuk,D.L., Smith,R.G., Van Der Ploeg,L.H.T. and Howard,A.D. TITLE Direct Submission JOURNAL Submitted (17-NOV-1997) Biochemistry and Physiology, Merck and Co., Inc., PO Box 2000, Rahway, NJ 07065, USA FEATURES Location/Qualifiers source 1..2040 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q14-21" mRNA join(<1..901,1703..>2040) /gene="GPR38" gene <1..>2040 /gene="GPR38" CDS join(1..901,1703..2040) /gene="GPR38" /note="orphan G protein-coupled receptor related to growth hormone secretagogue and neurotensin receptors" /codon_start=1 /product="GPR38" /db_xref="PID:g2654159" /translation="MGSPWNGSDGPEGAREPPWPALPPCDERRCSPFPLGALVPVTAV CLCLFVVGVSGNVVTVMLIGRYRDMRTTTNLYLGSMAVSDLLILLGLPFDLYRLWRSR PWVFGPLLCRLSLYVGEGCTYATLLHMTALSVERYLAICRPLRARVLVTRRRVRALIA VLWAVALLSAGPFLFLVGVEQDPGISVVPGLNGTARIASSPLASSPPLWLSRAPPPSP PSGPETAEAAALFSRECRPSPAQLGALRVMLWVTTAYFFLPFLCLSILYGLIGRELWS SRRPLRGPAASGRERGHRQTVRVLLVVVLAFIICWLPFHVGRIIYINTEDSRMMYFSQ YFNIVALQLFYLSASINPILYNLISKKYRAAAFKLLLARKSRPRGFHRSRDTAGEVAG DTGGDTVGYTETSANVKTMG" BASE COUNT 347 a 631 c 598 g 464 t ORIGIN 1 atgggcagcc cctggaacgg cagcgacggc cccgaggggg cgcgggagcc gccgtggccc 61 gcgctgccgc cttgcgacga gcgccgctgc tcgccctttc ccctgggggc gctggtgccg 121 gtgaccgctg tgtgcctgtg cctgttcgtc gtcggggtga gcggcaacgt ggtgaccgtg 181 atgctgatcg ggcgctaccg ggacatgcgg accaccacca acttgtacct gggcagcatg 241 gccgtgtccg acctactcat cctgctcggg ctgccgttcg acctgtaccg cctctggcgc 301 tcgcggccct gggtgttcgg gccgctgctc tgccgcctgt ccctctacgt gggcgagggc 361 tgcacctacg ccacgctgct gcacatgacc gcgctcagcg tcgagcgcta cctggccatc 421 tgccgcccgc tccgcgcccg cgtcttggtc acccggcgcc gcgtccgcgc gctcatcgct 481 gtgctctggg ccgtggcgct gctctctgcc ggtcccttct tgttcctggt gggcgtcgag 541 caggaccccg gcatctccgt agtcccgggc ctcaatggca ccgcgcggat cgcctcctcg 601 cctctcgcct cgtcgccgcc tctctggctc tcgcgggcgc caccgccgtc cccgccgtcg 661 gggcccgaga ccgcggaggc cgcggcgctg ttcagccgcg aatgccggcc gagccccgcg 721 cagctgggcg cgctgcgtgt catgctgtgg gtcaccaccg cctacttctt cctgcccttt 781 ctgtgcctca gcatcctcta cgggctcatc gggcgggagc tgtggagcag ccggcggccg 841 ctgcgaggcc cggccgcctc ggggcgggag agaggccacc ggcagaccgt ccgcgtcctg 901 cgtaagtgga gccgccgtgg ttccaaagac gcctgcctgc agtccgcccc gccggggacc 961 gcgcaaacgc tgggtcccct tcccctgctc gcccagctct gggcgccgct tccagctccc 1021 tttcctattt cgattccagc ctccacccgc cggtacttcc catcccccga gaaaaccatg 1081 tcctgtcccc caggagctct gggggacccc agggcgcttt gagggtggga tccccggatc 1141 cgattcagta accagcagtg cttttccaga gcctctgaga ccagaaagga gagttggtaa 1201 ttcttaatcc aaccacctgt tagatgccac aaatgaggag tcctcacagt gctcttgaga 1261 agacgaggga gatttcatta agctaaaatt ttttatttaa tgttaagtga tgctgaaggc 1321 taaagtaaac cttgctcgta tcaaaaagta aagattgtgc agacctgttg tagaattctt 1381 ttcaacagag aacagaaaac ttgtctccga agtgggtttg tggaaggaag cctgccaagg 1441 cggcttgttc agagaaattg ctccttctgg tttatgtcca gccttgataa cacatatggg 1501 agcctactat gcagttttaa agcaagtatc catgcagcct gcagcctggt cattttttct 1561 ggggtgagga tctgcctagg tagaagtttt ctctaattta ttttgctgtt acttgttatt 1621 gcagatggtt ccttgtcggg gtggggggtt tatttgcttc ccaatgcttt tgttaatccc 1681 ggtgctgtgt cttatgttgc agtggtggtg gttctggcat ttataatttg ctggttgccc 1741 ttccacgttg gcagaatcat ttacataaac acggaagatt cgcggatgat gtacttctct 1801 cagtacttta acatcgtcgc tctgcaactt ttctatctga gcgcatctat caacccaatc 1861 ctctacaacc tcatttcaaa gaagtacaga gcggcggcct ttaaactgct gctcgcaagg 1921 aagtccaggc cgagaggctt ccacagaagc agggacactg cgggggaagt tgcaggggac 1981 actggaggag acacggtggg ctacaccgag acaagcgcta acgtgaagac gatgggataa // LOCUS AF036329 4498 bp DNA PRI 06-FEB-1998 DEFINITION Homo sapiens gonadotropin-releasing hormone precursor, second form (GnRH-II) gene, complete cds. ACCESSION AF036329 NID g2833652 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4498) AUTHORS White,R.B., Eisen,J.A., Kasten,T.L. and Fernald,R.D. TITLE Second gene for gonadotropin-releasing hormone in humans JOURNAL Proc. Natl. Acad. Sci. U.S.A. 95 (1), 305-309 (1998) MEDLINE 98081869 REFERENCE 2 (bases 1 to 4498) AUTHORS White,R.B., Kasten,T.L. and Fernald,R.D. TITLE Direct Submission JOURNAL Submitted (02-DEC-1997) Neuroscience, Stanford University, Jordan Hall, Bldg 420, Stanford, CA 94305-2130, USA FEATURES Location/Qualifiers source 1..4498 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20p13" /clone_lib="PAC library from Genome Systems (St. Louis, MO, USA) constructed by Pieter Dejong" /note="PAC clone location: plate 184, well B21" mRNA join(1312..1355,2098..2258,2369..2526,3372..3431) /gene="GnRH-II" exon 1312..1355 /gene="GnRH-II" /number=1 /evidence=not_experimental gene 1312..3431 /gene="GnRH-II" exon 2098..2258 /gene="GnRH-II" /number=2 CDS join(2105..2258,2369..2526,3372..3422) /gene="GnRH-II" /note="GnRH-II precursor; gonadoliberin, second form; LHRH, second form; similar to chicken GnRH-II" /codon_start=1 /product="gonadotropin-releasing hormone precursor, second form" /db_xref="PID:g2833653" /translation="MASSRRGLLLLLLLTAHLGPSEAQHWSHGWYPGGKRALSSAQDP QNALRPPGRALDTAAGSPVQTAHGLPSDALAPLDDSMPWEGRTTAQWSLHRKRHLART LLTAAREPRPAPPSSNKV" sig_peptide 2105..2173 /gene="GnRH-II" mat_peptide 2174..2203 /gene="GnRH-II" /product="gonadotropin-releasing hormone, second form" misc_feature 2204..2212 /gene="GnRH-II" /note="cleavage site" mat_peptide join(2213..2258,2369..2526,3372..3419) /gene="GnRH-II" /note="GnRH-II-associated peptide" /product="gonadotropin-releasing hormone associated peptide, second form" exon 2369..2526 /gene="GnRH-II" /number=3 exon 3372..3431 /gene="GnRH-II" /number=4 polyA_signal 3411..3416 /gene="GnRH-II" BASE COUNT 949 a 1309 c 1400 g 840 t ORIGIN 1 aagcttggct ctggtttaga ttttccagga aaagccggaa tcaaattaca gaataaataa 61 aggcaaccat ccccctttta aaggtacacc agccttggtg catctttgaa gaaagcattc 121 tgtaaacccc aaccagaact aaactagtac gtcgaactca gattcatttt cactaaacca 181 caagcaaatg tttccctaaa aatcacccag ttaacaaagt ccgcatattt aagccaaaac 241 aatttaactg aacaaatggg ccacacgttg atttccggtc cctgctaata agtcagtctg 301 gaagttcaca ggtgtgccca tcctgccttg gctgctgaag tccaggtgtc tagggctgac 361 tgatgcccat tatgcctccc ctcccccatc tttgtcacag gatttgacgc accagctctc 421 caaatgaccc tggccctccc atttgctgtt cagcccaagt gcggagattg gctatgaacc 481 ctgtaaacag gcctctgacc cccagaggct gatggctggc caaggaaagc tgagctgctg 541 acgcagactg ggaagcaaga gcccacttcc agcagcccag gctagctgtg tccaaatcca 601 tgactgggga ggggttagag ccttgaggga caaaattatt ctacctacct agggagactg 661 cactggccca acagctgggc cccatctcat gggcccgctt cttcgccagg agagaagcca 721 ctccggggta ggtactgccc cacccaaacc cagccatctg gagtgaccca gccctggttc 781 ccaggtgtgt ggatgtgaat tgtcccaccc aacccactct acagtgagca aacggaagcc 841 ctctgggaga gtggtcacag cctcccctgt acctttgaac agcctgccag gctccctact 901 cttaggcttc cactgtccac cagggaaagc cctgagctgg gagttgggga gcccccaggc 961 attgcccctg cccaggacac aattctcttt tgggatcagg gaaggctgtg agggctttct 1021 aggtctcaag atcaggagct tgaagatgca gcctgggaag tgggaaggtg agaccaggac 1081 ataggccagc ctaaagcaag agtcctgggc ctgaaggctc ctgggaaggt ggtggggagg 1141 gagcatgtgt cggtggcctc agggcagcag ctgcctggtg aatgttcatg gactggatgc 1201 tctgggaagc gggttgggtg gtgagcttct ctcttcccct ctgaagacgt cactggagtc 1261 tgggggtgga gctgcctggt ctataaatcc tggggccatc aggctggggt cctgcagctg 1321 cctgaaggag ccatctcatc cacagctctt ccttggtgag tggggagcct tccctaaggg 1381 ctaggacacc tgaaccaatt ttcatcctgg gcgtatggtg tgctgctcct cttccccatt 1441 cccaggtgcc tccacccctg aaccatgcca gagaagtccc cttttcctct cctctcccca 1501 acagctctac catctattct tgtgcttgtt gcccctggca tgggagggat aaggggtaga 1561 agcacttgcc cccatcaata ccactcatcc attccacatc cccaactact atggaagaga 1621 tacagcaggc cacggagaaa agggcagaag gcctgcaact ctggttccct agcactggtg 1681 ctccaaacac gcctacattg agaactcccc tgaccatcca tctatcctcc catccattgg 1741 cctgaattca ggtctctgtt cccctccaac tttcttccac ttctggaaac tccttgaagg 1801 aaagatggat ggacctggac aagtgggagg gccctcagag ctggcaaggc aggtagcctc 1861 tgtgccccag gctcagggag aaggctcgtc ccctggagca tcatcccctg ctgggccagg 1921 atcccccagg atctggaccc ctgtatgctt gggatgagga gcggtggcag agagggaagg 1981 gcataaggag ataccaaagc tgcccctgag atgccagttt tccaaagtgg ccctggagga 2041 agtaggggga tgtgggggtg aggtaagtct ccttgaatgc tgtaccctgt ccattagagc 2101 agccatggcc agctccaggc gaggcctcct gctcctgctg ctgctgactg cccaccttgg 2161 accctcagag gctcagcact ggtcccatgg ctggtaccct ggaggaaagc gagccctcag 2221 ctcagcccag gatccccaga atgcccttag gcccccaggt gggtgtctcc cagcctcatg 2281 gggaggaaga aagtgatggc cgggggctcc cccaccctcc tggagcctga ggtcggggta 2341 gggaggacag catcagttcc cttctaagga agggccctgg acactgcagc aggcagccca 2401 gtccagactg cccatggcct cccaagtgat gccctggctc ccctggacga cagcatgccc 2461 tgggagggca ggaccacggc ccagtggtcc cttcacagga agcgacacct ggcacggaca 2521 ctgctggtga gtagggtgag aggtccccag catcaagacc agccactggt catcagaggc 2581 cattgtggct tagggttggg tgctgggagg gtggggagaa tgaaacacca ctgagatgcc 2641 ccctgccaca gcacccccag ccatttctca gtgcccctac tgcacacagc agggtgctgt 2701 ctgctatcct tcctatttcc caggaggatt ctagacaatt tacaaagcac ttgggttaaa 2761 gaccaaagtc actagtagac tagaaggaga taattgttct ataagacagt ggtggccatg 2821 ggatcccaca ggcatcctga caagccaatg actgtcttga ggtggacaga ccccaggcca 2881 gtggaaagag gtgagggatg caacctcact cagacaacag ggccaagagg accaggtggt 2941 gactgacatg tgcactagga acatctcagg gactgcagag ctccccaaga ccatagcaga 3001 agacaggcgt ggggaaatgg tttgctactg ttttgcaaat caaacattta cagtgcatca 3061 ggagagcccg gtaactaaag aagaaagtgg ttagttccta tgaggcaaag tcttaccgcc 3121 tgatttgtgt gtatgtgctg aggtttctat gcgtcaggct tgtttagggt ggacaagagg 3181 gcatgcccaa gggagctgga gatccccaca ctagctggat cctcaggctt ctacgggagg 3241 cggggggcgt cctgctgtgg gaggccacat ggggactggg ggggacgaga ggggagagaa 3301 ccaggaagat ggcagctcgg cggttacgag accagtgtcc tgagacatga ccgccacctc 3361 tccctccgca gaccgcagcc cgagagcccc gccccgcccc gccatcctcc aataaagtgt 3421 gaggttctcc gaagctgttg cgtcgagttc tgtccttcgt cccctccctg tcttccccgc 3481 tgagaccctt ccctgcgtgg gggctggagg gacgcgggtc cggccccgcg ggcgggagta 3541 actaagggat ggccccgggc cctggcggga aggccgggcc agagcctggg ggcgggatgc 3601 ggacgtccgc agggtcgccg cttcggttcc agaggccaca cggccgggcg gggcgtgagg 3661 gacagcccga ggactacagg tcccaaggtt ccccgcgccg cttccggggc acggtggcgt 3721 cccggcaccg cggccgcagt gaggagactc ggccatgcta cgcgcgctga gccgcctggg 3781 cgcggggacc ccgtgcaggc cccgggcccc tctggtgctg ccagcgcgcg gccgcaagac 3841 ccgccacgac ccgctggcca aatccaagat cgagcgagtg aacatgccgc ccgcggtgga 3901 ccctgcggag ttcttcgtgc tgatggagcg ttaccagcac taccgccaga ccgtgcgcgc 3961 cctcaggtgt gcggccgggg ggaggtggcc gcccgcgcgc gctggtgacg gtgggagtgg 4021 gcggagaggg tgctgattcc tggcgcgtct gcacccagga tggagttcgt gtccgaggtg 4081 cagaggaagg tgcacgaggc ccgagccggg gttctggcgg agcgcaaggc cctgaaggac 4141 gccgccgagc accgcgagct gatggcctgg aaccaggcgg agaaccggcg gctgcacgag 4201 ctgcggtgcg tggggcggga ggcggggcgg ggcggcgcgg cctggccggc ctgggagaag 4261 cccgggcccc gctcagcctc ggccctttga ccctcacagg atagcgaggc tgcggcagga 4321 ggagcgggag caggagcagc ggcaggcgtt ggagcaggcc cgcaaggccg aagaggtgca 4381 ggcctgggcg cagcgcaagg agcgggaagt gctgcagctg caggtgggca acgtttccgg 4441 agggtgggac tccagcgggg acgcggcttg cggggcactg gaattagata tcaagctt // LOCUS AF037372 2331 bp DNA PRI 25-DEC-1997 DEFINITION Homo sapiens cytochrome oxidase subunit VIIa-H precursor (COX7AH) gene, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF037372 NID g2708819 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2331) AUTHORS Jaradat,S.A. and Grossman,L.I. TITLE Human cytochrome oxidase subunit VIIa heart isoform gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 2331) AUTHORS Jaradat,S.A. and Grossman,L.I. TITLE Direct Submission JOURNAL Submitted (09-DEC-1997) Center for Molecular Medicine and Genetics, Wayne State University, 540 E. Canfield, Detroit, MI 48201, USA FEATURES Location/Qualifiers source 1..2331 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" /tissue_type="heart" mRNA join(741..826,1464..1550,1673..1757,2185..2297) /gene="COX7AH" /product="cytochrome oxidase subunit VIIa-H" gene 741..2297 /gene="COX7AH" CDS join(812..826,1464..1550,1673..1757,2185..2237) /gene="COX7AH" /codon_start=1 /product="cytochrome oxidase subunit VIIa-H precursor" /db_xref="PID:g2708820" /translation="MQALRVSQALIRSFSSTARNRFQNRVREKQKLFQEDNDIPLYLK GGIVDNILYRVTMTLCLGGTVYSLYSLGWASFPRN" transit_peptide join(812..826,1464..1511) /gene="COX7AH" mat_peptide join(1512..1550,1673..1757,2185..2234) /gene="COX7AH" /product="cytochrome oxidase subunit VIIa-H" BASE COUNT 507 a 710 c 688 g 426 t ORIGIN 1 ggatcgctga acccaggagg tcgaggctgc agtgagctat gaccgcacca ctgcactcca 61 gcctgtgcaa cagagcaaaa ccctgtctaa aaaaaaaaga aaaaaaagaa aagaaaaaga 121 aaaccttgta gactaggcca ggatctttcc tggttgctaa aaccattagg acaactgtgc 181 tgcaccccgt tgactgccag aattagagag ctataaatag atatgatgtg tttcttaggg 241 taacataaaa taccgtttta ctcccaaaga ctttagcttg tgtttcctaa aaacaaggtc 301 atttaatgtc tcaacaaatc tctgagaaag atcctgcctt agccctcacg ttacagatac 361 ggaaacaggc tcggaggtgc aatctctggc tgtgtagaca gaacagggag ttggaacacg 421 gccttatgcg tttcctcact ctcacatacc agcgggcagg gagagatccg gggaggagtt 481 gcacttgccc ccggcggaaa tctccaaaaa tctgcaaaaa tgtattccct ggtaccgctt 541 tgggctccgg tgggaccaag cgaaattccg agaccgaaac ggatgcgcgc tgcggcccag 601 ggtgcgggtc tggaccgcct cctccgtggt ggacgaatcc gaggagcagg actccccgca 661 acccagcccc agccccagcc ccagccccgc cgcgtcccca gctgtcacgc tgcgcgcagc 721 gggtggggcc tggggttcct ggacagagga ggactacgcg tgtccttggg cggagaaggg 781 aggtgactcc ggcggaagag gacaaggcag aatgcaggcc cttcgggtga ggcccccatg 841 ggctgggccg gagcccccca cccagaaaac cccgccctaa tacgcgactc tccaagtgcc 901 cccgcatcta gggccttgga acgtgagcca ccttcagggc ctgggactcc ccgctaaaca 961 gcctgggtcc cttcccccac aatccactct tcggagacaa ctggggaccc ccaccccggc 1021 ccaaagcccc gggtttgaac ttgacatctc gggaaccctt aacactggcc tcacccgggc 1081 cttctcaggc agcctggacg ctcccgggga cctccattcc ggtccccacg ttcccgggac 1141 ctggtttcct ccctttaatc caggaaaccc tcccccaaca aaaatctcag gacttaggcc 1201 ccccagaaca aagtccttct gggtcctctc ccttccagga ccctctaggc agcccggtca 1261 cccccaggcc agactccagg gcgagtcccc tcccaaccca gctggcagtt cgcgggtaac 1321 agaatccaag ctattaccct cctcgccttt gggacgctcg ctagggatgg ggctgtcccc 1381 acgtccaggc ctcagagagc ggggtcccac ttcaggacac ccctccaggt gcctccaata 1441 cccgcccacc tcccaccccg caggtgtccc aggcgctgat ccgctccttc agctccaccg 1501 cccggaaccg ctttcagaac cgagtgcgcg agaaacagaa gctcttccag gtggggggtc 1561 ggggggggtg gggatccgac gcgccagccg ggagcgcgcc gagccggggc aggcggggcg 1621 cgctctaagg agcagccagc acccctttct catcagacac ccccacatcc aggaggacaa 1681 tgacatcccg ttgtacctga agggcggcat cgttgacaac atcctgtacc gagtgacaat 1741 gacgctgtgt ctgggcggtg agcgcagggc ccgtctgggc tgcgggggag gcggggctgg 1801 acccagagta agaggtggct ggtttctggg caggactgac caggatctgg gttgggggtt 1861 ggcgtttagg aagggggtcg agtactgagc tggagtcagg cctgcaggga ggggagtaac 1921 tagggtttgg gctcttgtcc caggacgggg gctgggttag tggtggggct gaggcccagg 1981 gaggagtcgg ggagaaagct cccagcccgg tctagatcaa agccaggggg ctttacaata 2041 ggagtccaat ctgggaaggg agctggagtc cgaactagag ctggggtcca ggctgggaac 2101 cagggagcct gaaggcctta aaagcctggg atgtccaggg aggggattag gtctcattct 2161 gtctccaacg cttccccttt ctaggcactg tctacagctt gtactccctt ggctgggcct 2221 ccttccccag gaattaagac caagaagcct ggggggcctg agagacttga acaagtgtca 2281 ataaacgctg gcctctgtgt gtgtctgtgt ttgtgtccac ccctccccac c // LOCUS AF039307 4322 bp DNA PRI 04-JAN-1998 DEFINITION Homo sapiens homeobox A11 (HOXA11) gene, complete cds. ACCESSION AF039307 NID g2745850 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4322) AUTHORS Mi,X., Winters,J.L., Stevens,D.B. and Fleischman,R.A. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 (bases 1 to 4322) AUTHORS Fleischman,R.A. TITLE Direct Submission JOURNAL Submitted (19-DEC-1997) Internal Medicine, University of Kentucky and VA Medical Centers, 800 Rose Street, Lexington, KY 40536-0093, USA FEATURES Location/Qualifiers source 1..4322 /organism="Homo sapiens" /strain="ATCC65272" /db_xref="taxon:9606" /db_xref="ATCC:65272" /chromosome="7" /map="7p15-p14" /clone="c2A/4.3 (GDB:180620)" mRNA join(<1799..2507,3920..>4152) /gene="HOXA11" gene <1799..>4152 /gene="HOXA11" CDS join(1799..2507,3920..4152) /gene="HOXA11" /codon_start=1 /product="homeobox A11" /db_xref="PID:g2745851" /translation="MDFDERGPCSSNMYLPSCTYYVSGPDFSSLPSFLPQTPSSRPMT YSYSSNLPQVQPVREVTFREYAIEPATKWHPRGNLAHCYSAEELVHRDCLQAPSAAGV PGDVLAKSSANVYHHPTPAVSSNFYSTVGRNGVLPQAFDQFFETAYGTPENLASSDYP GDKSAEKGPPAATATSAAAAAAATGAPATSSSDSGGGGGCRETAAAAEEKERRRRPES SSSPESSSGHTEDKAGGSSGQRTRKKRCPYTKYQIRELEREFFFSVYINKEKRLQLSR MLNLTDRQVKIWFQNRRMKEKKINRDRLQYYSANPLL" BASE COUNT 956 a 1230 c 1207 g 929 t ORIGIN 1 gatcttagag tttatactct aaatctcccc aagatatgta aataactttg gctatttcct 61 ggagagggaa aaacaaaagg ttatcttttt acatattttt ttattttcct tcagcaacat 121 cccagatcct tccaagaaga gagttgttgg gaggcctcag gtctgggccc ttctcagctc 181 ctggctctgc ctggctgctc tgtgctctgt gtcctctcct ttctttcgct tcctccaaac 241 attgctcctt caatcctgca ggatggggag catattttgc cttcttaatt tatttttttt 301 cctcttctca agaaagctag actcagagta ttgctatggc ctctctctat ccttagcaca 361 aacctagctt tttagagaca tccctgtttc cccaggtgca gggagttcgg gaagcacctc 421 tcctttctct ggtattgtat tcctcctgtg gaatgagcag taggaaaggc acagagctct 481 ctgagttttt gccctgcaca tcccttgctt tcactctcac acattgcaag gaaggagagt 541 aggagagtag gtgggttacc cctttctcag ccacctctcc ttggccctca gcccgtcctt 601 tccacctcca ttctccccac acccctggag ctctgtaagc agcctgatgg gccccccacg 661 aagatgcagc atacccagga gaagtctcct cggatgtcag cgcctctaaa gcagcccaag 721 gcttgcctca attgcatggt ttcccgagtc ctcagctcca gaagaccagg cagatgggtg 781 gaccggtgag cagcagggca gcccctgtgc ctctgtctct gccgagtcac tccgaagccc 841 ggcaggcagc gaggaggagg gagtttctcc aaggacagaa ggtgggatga agaggtaggc 901 agggaagatg aggggagagg tggatcccgg gtaagacgaa ggcccttccg ggccctgcgg 961 atcagtgaca aaccgcgggg agaagccgtt ctggctgttg gcggtttagg gacggaaggc 1021 actaaagcgc ttcggaagtg accatgaatg agagagtgta atcaagtcac cgtgcaaatc 1081 ggacaagcca ccaggcaggc acatccacgg cttcaaactc tggccccgaa ggggttccgg 1141 ctagggtcgg aggcagaggc gcttcccaga gcaagtctat gggaggggga ctgccaagaa 1201 gggggtgcaa atgcgagact ccaggagaac agactccgag accacaggcc acacagcgaa 1261 cggactccca cctggctatc cccagtccag ggcatccctc acccacccgg ggagctgcgg 1321 gtgggaggtg gggacgagag ttgagctctc accgccctct gcacactcga gaacgaggac 1381 cctgcaattg agcacaagca tgctgcatgg gggcgcaccc cagcctctcc gcgcgcgccg 1441 ggaggccccc cagccaacat gagttacacc ggcgattacg tgctttcggt gagaacaccg 1501 agtgacgatc tgttgcttcc cctgaggtgg ctacaaagaa aggaagccgg gagggagggg 1561 agaggaggaa aaaaaaaaaa ggaaaggggg ggggggggga aaggcccgga ctagctagca 1621 gcttgtcaat ttcaacatcg ggtcacatga ccagcacctc cctgctaagg atggggatag 1681 atttccacgt cagcttacgt ctccaaattt ctacttcacg gatccgcttc aaagaggcag 1741 ctgcagtgga gaatcatgtt aagctcggct actgcggaga gcccaaggta gcccaataat 1801 ggattttgat gagcgtggtc cctgctcctc taacatgtat ttgccaagtt gtacttacta 1861 cgtctcgggt ccagatttct ccagcctccc ttcttttctg ccccagaccc cgtcttcgcg 1921 cccaatgaca tactcctact cctccaacct gccccaggtc caacccgtgc gcgaagtgac 1981 cttcagagag tacgccattg agcccgccac taaatggcac ccccgcggca atctggccca 2041 ctgctactcc gcggaggagc tcgtgcacag agactgcctg caggcgccca gcgcggccgg 2101 cgtgcctggc gacgtgctgg ccaagagctc ggccaacgtc taccaccacc ccacccccgc 2161 agtctcgtcc aatttctata gcaccgtggg caggaacggc gtcctgccac aggctttcga 2221 ccagtttttc gagacagcct acggcacccc ggaaaacctc gcctcctccg actaccccgg 2281 ggacaagagc gccgagaagg ggcccccggc ggccacggcg acctccgcgg cggcggcggc 2341 ggctgcaacg ggcgcgccgg caacttcaag ttcggacagc ggcggcggcg gcggctgccg 2401 ggagacggcg gcggcagcag aggagaaaga gcggcggcgg cgccccgaga gcagcagcag 2461 ccccgagtcg tcttccggcc acactgagga caaggccggc ggctccagta cgtataaagg 2521 cagcgccctg cgctttatga aaggggagtt ggaaatctag cgcagcaccg ctgttaaatt 2581 tttatatgct gttttataag catataaggt ttatgaaggg cctttaggtt tcccccaaca 2641 aaactttgtt ataaaaggcg ggatcacgtg gcgagtgctg aaattggccg tgaaataccc 2701 accggccaag gcatgcctgc aaaaaaaaac aactcccctt ttcccttttt tcccttctca 2761 gctcccaagc ccagcagctc gaccccgctc ggggccagct ccaggctggg gagccggagg 2821 tgggcgcctg taaattcccc ctgcaagggc ttcgggagct tttgatagcg aggtaatccg 2881 gagattttta taaaagccat gtgttgcgcc gggactccag tcccggccgg gattcaaagg 2941 cgtcgctgtt taacgacggc gctaggagcg cgtttaacag ataaagtagc ccgcttagct 3001 ctagcaatat attaaaagaa caaattaacc caaagttggc ctttttgtcc aaaggaagcc 3061 attttcctct ggcgtggtgg caattacgtg cctccttctg cccaactgca tgttattcac 3121 aattgttatt caatctgtca aaagcgattt tttttttgtc ttgtgggtct tttttttttt 3181 tttttttttt tttttttttt aaagggggag caagtagagc ctgcccgcag cttcgggaag 3241 ccccctttcc cttcacccta cctaagatag agccgctgct ctgccgtgtg caggatggcc 3301 gtgtggccgt ggtgctcagc gaaggcctag ggtagggggc ttccgccgcc acctccccac 3361 ctcggcggag ccggaaggct ggccggggag gcagaagctg gtggccagga ctcaggctga 3421 caaagacccg gcagccgccc cagatgttaa ggactgccca gtcttggccg cagccgggaa 3481 agacagggac tggagccgcc tttactgcct tgggaagggg cagcccaggt gctggaaggt 3541 gaaatgtccg caggcggctg tttgtttagt ctggcctgcg agaggcagcc ggcaaccccc 3601 acttggctgc tttcaaagag gcttaggcag cgcaaactcg cctgttgctg tgcgaagctc 3661 tcctttctgt tctgggggca gcgggagcag cttccccggg ggcctattta gggagctggc 3721 ttttatctgt ggctgagcct ccaactgcgg ccgttccaag catgcagtcg gagcgttaaa 3781 ggcaggggcc tgggcagact ttgacggggc ttgggctgga ccaccagtcc ccatggcagc 3841 ccttctcctc agctatgggc tctggggtgg attgcagccc ccacacctgg ctcaccccat 3901 gccttttctc cctccacagg tggccaacgc acccgcaaaa agcgctgccc ctataccaag 3961 taccagatcc gagagctgga acgggagttc ttcttcagcg tctacattaa caaagagaag 4021 cgcctgcaac tgtcccgcat gctcaacctc actgatcgtc aagtcaaaat ctggtttcag 4081 aacaggagaa tgaaggaaaa aaaaattaac agagaccgtt tacagtacta ctcagcaaat 4141 ccactcctct aagactccag cggctggaat tgggtggggg gcttcataca catgagataa 4201 tatgcagatt ttgcccttga caaagtcaag ccacatggtg acttttgaaa agaggtgtgc 4261 aagagaggga tgcatggaga tagccccaca ggaggtggtc tgggactctc ttgattaaga 4321 tc // LOCUS AF041427 45859 bp DNA PRI 19-JAN-1998 DEFINITION Homo sapiens ribosomal protein s4 Y isoform gene, complete cds. ACCESSION AF041427 NID g2791858 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 45859) AUTHORS Zuo,L., Baybayan,P., Kuang,W.-J., Brown,L., Page,D. and Chen,E. JOURNAL Unpublished REFERENCE 2 (bases 1 to 45859) AUTHORS Zuo,L., Baybayan,P., Kuang,W.-J., Brown,L., Page,D. and Chen,E. TITLE Direct Submission JOURNAL Submitted (08-JAN-1998) Advanced Center for Genetic Technology, Applied Biosystems Division of Perkin Elmer Corp., 850 Lincoln Centre Drive, Foster City, CA 94404, USA FEATURES Location/Qualifiers source 1..45859 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Yp, deletion interval 1A1B" repeat_region complement(9..647) /rpt_family="L1" repeat_region complement(651..957) /rpt_family="Alu" repeat_region 1234..1434 /rpt_family="MLT1" repeat_region complement(1716..1959) /rpt_family="Alu" repeat_region 3230..3499 /rpt_family="Alu" repeat_region complement(4615..4663) /rpt_family="Alu" repeat_region complement(5447..5709) /rpt_family="Alu" repeat_region complement(5806..5945) /rpt_family="Alu" repeat_region complement(6416..6701) /rpt_family="Alu" repeat_region 7913..9759 /rpt_family="L1" repeat_region 8762..8841 /rpt_family="MER25" repeat_region 9782..9890 /rpt_family="L1" repeat_region 11220..11514 /rpt_family="Alu" repeat_region complement(14178..14425) /rpt_family="Alu" repeat_region complement(14430..14746) /rpt_family="MER4" repeat_region complement(15641..16192) /rpt_family="PAB" repeat_region 16442..16568 /rpt_family="MER4" repeat_region 16570..16880 /rpt_family="Alu" repeat_region complement(16940..17180) /rpt_family="Alu" repeat_region 17784..17966 /rpt_family="MER4" repeat_region complement(18000..18232) /rpt_family="Alu" repeat_region 18253..18414 /rpt_family="MER4" repeat_region complement(18663..18955) /rpt_family="Alu" misc_feature 18711..20301 /note="CpG island" mRNA join(<19693..19695,20233..20310,22145..22325,23714..23811, 32667..32838,43156..43313,44861..>44962) /product="ribosomal protein s4 Y isoform" CDS join(19693..19695,20233..20310,22145..22325,23714..23811, 32667..32838,43156..43313,44861..44962) /codon_start=1 /product="ribosomal protein s4 Y isoform" /db_xref="PID:g2791859" /translation="MARGPKKHLKRVAAPKHWMLDKLTGVFAPRPSTGPHKLRECLPL IVFLRNRLKYALTGDEVKKICMQRFIKIDGKVRVDVTYPAGFMDVISIEKTGEHFRLV YDTKGRFAVHRITVEEAKYKLCKVRKITVGVKGIPHLVTHDARTIRYPDPVIKVNDTV QIDLGTGKIINFIKFDTGNLCMVIGGANLGRVGVITNRERHPGSFDVVHVKDANGNSF ATRLSNIFVIGNGNKPWISLPRGKGIRLTVAEERDKRLATKQSSG" repeat_region complement(28910..29242) /rpt_family="Alu" repeat_region complement(31903..32189) /rpt_family="Alu" repeat_region complement(34206..34493) /rpt_family="Alu" repeat_region complement(34694..34887) /rpt_family="Alu" repeat_region complement(35493..35773) /rpt_family="Alu" repeat_region complement(35827..36758) /rpt_family="L1" repeat_region 38571..38670 /rpt_family="MER4" repeat_region complement(39681..40276) /rpt_family="Alu" repeat_region complement(43962..44227) /rpt_family="Alu" repeat_region complement(45347..45448) /rpt_family="MLT1" BASE COUNT 12437 a 9045 c 9912 g 14465 t ORIGIN 1 ggatcttaaa gtcttcaata cattttgatt tgatttttgt ttatggtgag aaacaggagc 61 ctagtttcat ttttatgcat atgaatatcc agttttccct gcacttttat caaagagact 121 ttcttttccc caatgtatgt tcttaacacc tttgtcaaaa atgagttcat tatagacgta 181 tggatttatt tctggttgtc tattctgtta cattggtcta tgtgtctgtt tttatgccag 241 taccatgttg tctgatttct atagctctgc tgaataactg caaatcaagt tatgtgattt 301 ctccagtttt gttctttttg ctcagaatgg ctttggctat tctgggtctt ttgtggtttc 361 atacaaattt aagaaatatt tttgttctat tgttgtgaag tgtcattagt attttgatag 421 ggatttcact gaatctgtgg attgctttgg gtagtatgga tattttaata agatggattc 481 ttctaatcca tcaacaggga atgcctttcc atttgtgtgt gtgtgtgccc gtgtgtatgt 541 gtgtgtctat ctgtgtgtgt tctttaattt cctgcatcaa tgtttcataa ttctcattat 601 agaagtgttt tacttctctg agtaagttta ttcctaggta ttttatttta ttttttattt 661 ttttgagatg gaatctcact ctgtcaccca ggctggagtg cagtggcaca gtctcagctc 721 agtacaacct ccacctcctg ggttcaagtg gttctcctga ctcagcctcc tagtagctgg 781 gaatataggc acctgccacc atgcctaggt aatttttgta tttttagtag agatggggtt 841 tcaccatgtt ggccaggctg gtctcaaact cccaatctca tgtgatccac ccacctcagc 901 ctcccaaagt gctgggatta caggtgtgag ccactgtgcc ctgccaacat ttttagtctt 961 aacattaaaa ttttaaagta aaatactggt taccagaggc ttattggtgg ggaggactgg 1021 agaaatgcca atcaaagagc acataatttt taagttaagc aggaggaata agtttaagag 1081 atctattgta caacatggtg actatagtta ataaccaatt tcatgcttga aaattgtgaa 1141 gagaaaacct aactcaacaa cacatttaaa agattgttca tcatgtccaa gtataattta 1201 tctaagggat gcaagggtgg agtatgcagc cacagccaag gaatggcagc aagcacaaga 1261 tgctaaagag aaatggaata gattctctcc tagagcctct gaaaggagca tagtcctgcc 1321 aacaccttta tttccaccct gggatactga tgtgtgaatt ttgtatcatg tggaactgtg 1381 agaaaataca ttttatgttg tgtgccacca agtttgtggt aacttattac agcaaacaca 1441 agaatctata catcctccta agtattttcc tgtatgtaac tccttgtttt ttttcacaga 1501 agcaccaggc actgatgatc catacataaa tgaataggtc ctgcatgttt ccctgctcct 1561 ttatagaaga gtccaacact tggattctaa acatatggaa tagtgccaca gtgatccaag 1621 tactaaggga tgagaggaga gtttttgttt tttgttttgt tttgttttgt ttttgttttt 1681 gtttttgttt ttatctgttt tgatcttgat ctgttgccca ggctggagta cagtggtgtg 1741 ctcactgcaa catctgcctc ccaggttcaa gcaattctcc aggctcagcc tcccaagtag 1801 ctgggattac aggcatgcac taccaggcct ggctaatttt tgtattttta gtagagacgg 1861 ggtttcaccg tcttgggcag gctggtatca aactcctgac ttcagatgat ccgcctgcct 1921 tggcctccca aagtgctgag aattacaggt atgagccact gtgcctggtc aagtttttgc 1981 ttttaagtca acctcaggga gtttgggaaa tgccactaaa agataatcac tcatatttga 2041 gcagaggaag aagccatatt atattccagg agtgataaca atcagatggt tatagttccc 2101 tgtgattgag aatcacttta tttcccaata taaatttgga ggacacagat tatgtaattt 2161 ctgcaagctt cacattttgt aaataaaata tcaattaaca tttcacactg tatttattga 2221 tggaacttaa gcaagaaaac aaattttcag aagtaagttc ttctgtgtta ttttaattgg 2281 cccttagaca aacaaaatcc ccatccacgt ttattataac aagtcctaca atacacagtt 2341 gcattttaga aagtaataag tttaatctta tctctttcag tacaaaaacc tctaagagta 2401 accaagtttg tattctacca ggttgtgtta acaaggtgaa taggcaaaat ttacttatgg 2461 ttttgtcaca tgttgttaat atcagaggtc agtgttatgc atatcactct actgcttctt 2521 ttgcttatgg atttattagg cacaccagta aaagttagaa gataaaggat atcattttaa 2581 gttgtgacaa agtagcagat gtaagtagcc aggcttgctt attctgcttg cctgcagaag 2641 tttacgaagc ccctcatatc gtgatggaat gcagccctcc ggaagaatgc cctgaaaatg 2701 ataagcagga cagatcatgg gtttccatgc ctgttgcatg aatcactgca tttttagaaa 2761 aggtaagtct aatgaccctg acccttgcct cttcctgtac ataagataca ggaattcatg 2821 actacacctc tttaatctat aaccagatgt acccttgcac ccaaactttt gatgatattt 2881 tgctttagtg gtaactttta taaaaaaatt agataaaatt tctgagcaca cacagatgca 2941 ccaccacctg ggtatggtga tcgctcttgg ttgaatcacc gtgttggagt agtctaacaa 3001 aaactttcca aaagactttc ctggggtgta attctcacta agactatgaa taacattaac 3061 tttaattatt tgaaagcttg attttcttct ttttagttaa aagttatcct gctatgtgat 3121 tctctcaaaa cattcattat gttttgcagg agaaaaatag ccaaaatatt aagaaatagt 3181 aaataataag catcacttat catataattg caaaagatat tataaaagtc ctgtaattcc 3241 agcacttcgg gaggccaagg caggcagatc acctgaggtc gggagtttga gaccagcctg 3301 accaacatgg agaaacccca tgtctactaa aactacaaaa ttagccagtc atggtggcgc 3361 atgctgtaat cccagctact caggaggctg aggcaggaga atcacttgaa cccggaaagc 3421 aaggttgcag tgagccaaga tcatgccatt cactccagcc tgggcaacaa gagtgaaact 3481 ccgcctcaaa aaaataaaat aaaataaaat aaaagacttt aatattttct agaatatttt 3541 atatgctagc attactagac tgggtttgat gggaatggat ttcttcgatg tttggacatg 3601 tattggacat taacttttga actttgtgaa tgcattatat tttcaaatga gtttattttc 3661 cagttaattt ttaacttaaa taacatatgc atgttttttg atacttcgaa ttatgcacaa 3721 tggaatcttt cctggggaaa caagaaaatg tgtgcaaaga tgtaatatgc tcacagagtt 3781 gttcttaata gaaacagggg aagtaaacct aaatatttat aagtagagat aatgttaatt 3841 atatctaaat ccatataatt tacataaaat ttatgcaaaa attaatgaag agacagttat 3901 actacacctc atgtaatctc taaatacatt tgcagattga tgtcttttca agacagggat 3961 gtcaccattt cactttaaaa tgagaaaaga gataagtaaa tagagtttca gaccagcagg 4021 ctaactttgg tataagaaaa aagaataatg gaagatagac aagagggtta gaagtaagga 4081 atcatcaaat aaacagttat gaaagatacc cagaattaga gtttatatac tgagaatgat 4141 ttctctaaca agaaatgtgg gtagaaatgc tccatcaaaa tgagattgta agccttaaca 4201 gagctattat aatgcaggat tctggattca atttggaaaa cagttgaagt aggtttaaca 4261 tgcataggat ggacgtgcag agggtgttaa taggatttcc ttccaaaaga ttacattgaa 4321 gtccccatgt gaggctatac aaaaaataca gttacatgtc taaatagtta gggctatctg 4381 aacaaatttt acagagaact ggccttggtc aaagaacaag ccttgacaca gggaaaggaa 4441 gcacacacgt gcacacacac acacacacac acgcacataa tgctgtctac tgaatacagt 4501 ttatagtcat aatcatataa aaactgatat ttttatatgg tctgtttttt aaaaggtttt 4561 gttatttctt tttctagttc tttactttcc tgtgtaattt tgtatttgca attgtttttt 4621 gagatggagt cttgctctgt ttcccaggct agagtgcagt ggcacaatct cagcaatctt 4681 aaaagattcc tgaataagta gttatgacta tttcacctct tcgaaggagt gtttttacat 4741 ttcagaaggc agatggagct taagactgtg gagacattcc aggagtttta agactcagat 4801 gccccttttt tccatagaga cagtccccaa aaagaaacta cttattttgg agagtataaa 4861 gctttctctc cttctcaaag gagattaatg ggcagagtgt atgtatacat aagtcagttt 4921 caaaatttct gggttcctca tctacagcac agaccatggt ttgtgtagag tgacatggaa 4981 ctagctttcc tcacttctct tcagaaaagg agagaattgg ggtaaacggg atctggaaca 5041 gttactgcag caagttggtg atagttaata cattctcttc tccaggaacc tggggtctgc 5101 tttcaggata atatgaataa atgtagatat taaaaatgta tcacccatat tctggtatgt 5161 ttcaatatac tgaaagaata tgtgtgtatg aacctcagag cttggtaatg tatgtgtaca 5221 tggtgctaaa cccaccaaaa acacaggaaa aggaaacaca caaacacaca cattcataca 5281 cacacacaca cacacacaca cgtaatgttg tctagcagtg aatacagttt acgtagtcat 5341 aatcatgtaa aaactgagat ttcatatgtc tgctttttaa aaggtatttt actgctattt 5401 ctttttctag ctgttaactt gccctgtaat tttgtatttg taattgtttt ttgagatgga 5461 gttttgctct gttgcccagg ctggagtgca gtggcacaat ctcagctcac tgcaacctct 5521 acctcccggg ttcaagtgat tctcctgact cagccacctg agtagctggg attataggca 5581 catgccataa tgcctggcta atttttatac ttctagtaga gatggtgttt caccacgttg 5641 gccaggctgg tctcaaactc ctagtctcat gtgatccacc attgcctccc aaagtgctgg 5701 gattacagga ataagccact tcacctggct ttgaaattct tgtattttcc caaatagacc 5761 ctttgcttga aaagtcatct ctttatattt tatttatgtt ttatcttttt ttattttatt 5821 ttaaaaacag gatgttcctc tgtcacccag gctggagtgc aatggcaggt ccatagctca 5881 ctgcaacttc aaactcctgg gctgaagcaa tcctcttgcc ttagtctttc aaagcactgg 5941 gattataggt gtgagccatc atgcctgccc atctctttac attttcattt catgttaatt 6001 ttatgctatc aaataacata aaatggctcc atagatgaag ccacttttga tctttcactc 6061 tttgtggcta agactatgca taggttttcc atagcagcct tgtcaattag gaccacctcc 6121 tttatgatcc tggttcttgc tgtcatcttt gagactgaga agttgcaaat ggaacaacag 6181 caacaatggc agctgcagca gacacttctc cagcattata gtgctgtaag gaggaatgtg 6241 ttctgtcagg ggctctacat ttactttctg gaaagaccta gactgtgact gctgtttttt 6301 gggttgccat ggtgagacaa tggccttagt gccactctga tttaccatca ttttgggttt 6361 cccatctcag cacatcattc ataggttatg ttgcagattt ctttttcttt ctttcttttt 6421 tttttttgaa acgaagtctc gctcttgtcg cccaggcagg agtgtagtgg tgtgatttcg 6481 gctcattgca atatctacct ccccagttca agcgattctc ctgccttagt ctcctgagta 6541 gctgggatta catgtgcgtg ccacacgccc agctaatttt ttttgtattt ttagtagaga 6601 cggggtttca caatgttggt catgctggtc ttgaactcct gacctcatga tctacccacc 6661 atggcctccc aaagtgctgg gattagacgt gtgagccacc gtgcctggcc tggttgcaga 6721 tttctttcag ctcttgtcat tccagttgaa gagagaccat tctagagatg gctgcaggca 6781 agcatttaaa acctttgaga gaacaacagc acatcaggga gagtattatc atgactattg 6841 ggaggataat accaagaagt tggagtatgc tctttaccta aggtaccaca aaccagaact 6901 taagattaaa tagatcaaag gataagctag ataaagagtc tacatactta actaagcagt 6961 cttttcatta atcttctacc agtgaatttt tccagtaggc catgagtacc agcaactgga 7021 gagatacttc tctgttctgc caattctgtg tgaactccaa aagagaattt aaagtttgtt 7081 gcatccttta cagtggaatt tgctatacag cctatcatga gcatacattt gtaatcattg 7141 cttcttctat tttaaaccat ggaagaaaaa cctaacaaat gatgcctttc aggaagagtg 7201 aaggcctcct ggtaatgttc tctttaaccc atgatgtggg ttaagaggaa tgaaccaata 7261 ttttgtttct gattgattat gaggcaatgt atgtaccatt aaagtttctt actgacattg 7321 ggcctttatc ttgtatccac caaagtataa agttatccat gtataaggct ggctgtaaac 7381 acattcacaa ataaaagtac accccataag tgcatataag agacctcctt tccatttcta 7441 ttgttcatag agccatagca agataaaatt ttcaaagata agtttcatga tagtagaagt 7501 cttaatctgt gaactttaga aaagttgctt acatcaaaga cagcatgctc ttctggggag 7561 aaacctccct ggttagtttt accttaaggg ttccaatgga tgtacagttc caagagtgtg 7621 gagggaccct tctcagctgt gagatgatta aacctaaagt tcaaggtcct taagttttgc 7681 tgtagtgtgg atggcaaggt cagtttttct ttgatgttct cagaagaacc aaactgtaaa 7741 aagctttcac agcaaggctg ctgtggtcag actgcctctc tagattcctt ctctctgggc 7801 gtggcatctc taaaaaaaaa aggcaagagc cccagtcagg ggcttataga ccaaaccccc 7861 atcaccctgg gacagaatac ctgggggaag gggtagctgt gggcagcttc tacagactta 7921 aatgttcctg cctgtcagct ctgaagagag cagcggatct cctagcacag cgtttgagct 7981 ctgctaaggg tcagactact gcctcaagtg ggtccctgac ccctgtgtat cctgactagg 8041 agacttctcc cagtaggagc tgacagacgc ctcatacagg agagctctgg ttggcatctc 8101 ctgtacgatg gtgggtatcc ccctagaatg aagcttccag aggaaagaac aggcagcaac 8161 cattgctgtt ctgcagcctc tgctggtgac acccaggcaa acagggtctg gagtggacct 8221 ccagcaaact ccagcaggcc tgcagcagag gggctgactg ttagaaggaa aactaacaaa 8281 cagaaaggaa tagcacatcc attcagagac cccatcagaa ggtcaccaac atcaaagacc 8341 gaaggtagat aaatccataa agatggggag aaaccagcgc aaaaaggctg aaaattccga 8401 aaagcagaat gcctcttctc ctccaaagga acacaactcc tcatcctcaa gggaagaaaa 8461 atgggtggag aatgagtttg atgaattgac agaagtaggt ttcagaaggt gggtaataac 8521 aaactcctct gaactaaagg agcatgttct aacccaatgc aaggaaacta aaaacctcga 8581 aaaaaggtta gaggaattgc tagctagaat aaccagttta gagaagaaca taaatgaccc 8641 gatggagctg aaaaacacag cccaagaact ctgtgaagca tacacaagta tcactagctg 8701 aatcaatcaa gaggaagaaa ggatatcaga catggaagat caacttaatg aaataaagat 8761 aaaagacacg attatagaaa aaagaataaa aaggaatgaa taaagcctct aagaaatatg 8821 ggactatgtg aaaagaccaa atctacattt gactggtgta cctgaaagtg atgggggaga 8881 atggaaccaa gttgcaaaac actcttcagg atatgatcca gaagaacttc cccaacctag 8941 caaggcaggc caacattcaa attcaggaaa tacagggaac accacaaaga tattccttga 9001 gacgaacaac cccaagacat gtaattgtca gattcaccaa ggttgaaatg aaggaaaaca 9061 tgttaagggc agccaaagag aaaggtcggg ttatccacaa agggaagccc atcagactaa 9121 cagcagatct ctctgcagaa atcctacaag ccagaagaga gtgggagcca atattcaaca 9181 ttctttaaaa aaaaagaatt ttcaacccag aatttcatat ccagccaaag taagcttcat 9241 aaatgtagga gaaacaaaat cctttacaga caagcaaatg ctgagagatt tttgtcacca 9301 ttagaactgc cttacaagag ctcctaaagg aagcactaaa catggaaagg aacaaccagt 9361 accagcccct gcaaaaacat gccaattgta aagaccatca acactatgaa gaaactgcat 9421 caactaatgg acaaaataac cagctagcat cataatgcca ggatcaaatt tacacataac 9481 aatattaaac ttaaatgtaa atgggctaaa tgtcccaatt aaaagacaca gactggcgaa 9541 ttggataaag actcaagacc cgtcgatgtg ctgtattcat gagacacatc tcatatgcaa 9601 agacacacgt aggctcaaaa taaagggatg gaggaacata caccaagcaa atggaaagca 9661 aaaaaagcag tttgcaatcc tagtctctga gaaaacagac tttaaaccaa caaagatcaa 9721 aagagaaaaa gaacggcatt acataacggg aaagggatca gtgcacgcta gtcacttcca 9781 ctcaacattg tgctagaggt tctagttagg ggaactggga aaggaaaaga aataaaagtc 9841 atatatactg gaaagaaaga agtagaccta tctctatttg cagatgacat catctacata 9901 agaagtccta aataatccct taaaaaaaac ccacaactac taacaaacta attaagcaag 9961 tttatgttat acaagacaaa tataaagaaa tcagttgtat ttctatacat tagcaatgaa 10021 catccaaaaa gtgacattaa gaaaacaatg ctatctgcaa tagaataaaa aagaataaaa 10081 tactttggaa aattttaaca aaagaaatgt aaaaattata cactgaaaac tctaaataat 10141 tgttaaaata agttgaaaaa tatctagata agtgaaagac accccatgtt catggataat 10201 attgattatt atcttaaaat taaagactta ttattaagcc tttaagatgc caatgcccca 10261 ccaattatca tgcagattca acacaatctc tatcaaaatc cttgctgcct tttttttttt 10321 ggcagaaatc gacaagctga acttcaaatt caaaaatgca agggactcag aatggccgaa 10381 acacttttca aaaagaacaa agttggaggg ctcacacttc ccggtttcaa aacttactac 10441 agagtcatta agatagtgca atactggctc acatgcatat gtatatgcat gcaatactgg 10501 ctcacagatg catattgata aaacaaaatt gtgaagccag aaaaaaaaaa acttccctta 10561 cctggtgaaa atgccctgta gcataataat ttactgttat aacatcaacc ctcttgcatg 10621 ctaaagcttt tatataacca gaaaacatgc actgaaaatt acaattgaaa tcccttaaaa 10681 gtgtttaaat ggcccaccag ctgaccaagt gtacctaaaa ttttgttttc ccaggaatat 10741 gggaccaaac attggataca agatatttta gtaatttgtt aagttaccac acaaatgtat 10801 tcaatttgga ttattttatc ttttccatga tgagtcatgg aatgcagaat ttttaataat 10861 aaaggcttta aagacttggg aaggacaagg tggccatcct gctggttctc catgagtcca 10921 tgcctaatta atattactgc ctcttgaata ccagttgatt ttccaaatta ggtgcatagc 10981 actgataaca aatgggttta tcattagcac tgataactaa tgggttatca taggtaacgc 11041 aacttgagac tgtggagttt attcaaattg tatatttaaa caatttcagt atcagctggt 11101 ttaacatgaa aatctgacaa ggtagtttct tggtatttaa ttttttgttc ttgggttagt 11161 aggtttatat aagaaaattt ggttattctg tgatttaaaa taacttaagg ctgggtgcag 11221 tggctcatgc ctgtaatcca agcactttgg gagactgagg tgggtggatc acctgaggtc 11281 agaagtttga gaccagcctg gccaacatga tgaaagcccg tctctactaa aattagaaaa 11341 attagctggg catggtggca ggcacctgta atcccagcta tgaaggaggc tgaagcagga 11401 gaattgcttg aacccaggaa gtggaggttg cagtgagccc aggctgtgcc actgcactcc 11461 agcctgggca acagagcgag actcaaaatt caaaaacaaa acaaaacaaa aaaaccggaa 11521 taaccataat tataatggat agcatatatt tagacattag aattttagaa atcccataca 11581 attttggaat gtatattagt cttattcacc aaaatataaa ataaagattg aacatcattt 11641 tgatgatccc aagtaactaa gcatgtcaag taatcctgtt aatctctctt ctggatgctt 11701 tcaggggcct ctctctgatc catccaaaaa gccaggcatt aggaaagaca ttggaactga 11761 agttagattt ttgaatttca gattaccata aattatgtat tttgccaaaa tcatgactcg 11821 gaaattttaa agaagtgaaa acctttcata taacctttta tttaaaaaaa aattatactg 11881 ttcttacaca ccttgcatgt aaaactgttt ctagcagtct taattgcatg ttacaatgtt 11941 gattcttagc aatttaattt taacataaaa cctgttatgt taattatgtg ctaggtgttg 12001 ataagatctg attgtttcca gcatagctag gggtgtggcc aactccacat gtccccagga 12061 cttacctagc tggaaagcag gcaagttaaa caattctcaa aagacaaaga agcagcttat 12121 aaacttaaag catttagcaa acctaatatt tggacataat ttagaccaca tgtttacatt 12181 ttgaagacat ttgtatttta ccaataacct ttaaaactgt ttataagatt agtaaagtta 12241 tgtgaactaa aaggcattac actttttact tacttaaaac aaaatatttg atttaagttc 12301 attattaaac gaattaatta aagctctttt tagtaattaa aacattacag aggagataaa 12361 caatgacttt ggtttgcaca gagagaaaga gaccagaact gtcaaacctg actgggtaag 12421 aaattcttac ttcaaattct atttacaaaa gtacattcaa catgcttaaa gtatcaagaa 12481 gcctaaaacc caaaaaatta gcttaaggtt aaaaggctag tgtgcctcca ttaactcctg 12541 taagccctga ccaagatagc ttaggaattc cagataaacg gtacaaaggg tgacttgata 12601 gaaatgcata ggaaacaaaa ataactattc acagaaccaa ataagagcct tccactagaa 12661 cctaaaagaa aatcatggtt atatatgtat aaatatgcat gcataagcaa aatctgaagg 12721 aaacagcaaa caaatgaaaa ctagaagcaa aaacaaataa acagaaaatc aacctgaagt 12781 ttttgtattt aatcttctct ggaggctaca gtgttaccca gaaccccctc aaaaaactat 12841 gcataatatt ttattcctga tacacaattc aatgtcctta agtttattaa gatcatcatc 12901 catcctgtgc aattcagaaa ttcactgtaa acacattttt aagagttaaa aaaaaaggga 12961 agaaacatga aatgcaattt aacagtgaaa gacagttttt ttttttgttt tttttttaga 13021 caaaacttga gaggggcttc tagctaattt cagtcaggag cactttctct tacaaactgt 13081 atgagtatat attggtttta ggctgaggga gtttattaca agtttggaat gtttctatat 13141 gggggagaag tttatggaag gtttggaata tctctgggag gaggggaggt tatcttgggg 13201 ctgacatctt tctggcagta gggggttatc tctgggctag tgtgtctctg gatggggagg 13261 agtttgtaat gtttcttgtt ggagatgtta tttgtggttt agggtcatgc tgacttttgc 13321 cattaggctg ctgtcctttg gatttaggct gttttttgat taagatgaat tttaaaatga 13381 ggtgcttgtt catgatggca atactcctgt tctgtcaaca tgaccaataa ggactccagt 13441 actggcacta tccacataaa atagtaaaca tagtgtgaag caatgcaagc atgtatgttg 13501 aaacttggct ccacactaaa tccagcttca ggcttaactg tatgtaaaaa aaaaaaagaa 13561 ctgtcaaatt gccagtgcat tttttacaat acttcttact ttgctttaat caagatgaag 13621 agctttagct atgaaaatgt taattagcca aatgtgtcca attctttatc aggttttaaa 13681 gaatatttta taatataatt tttccacatt tttctcacct actgaatggt tcattaccac 13741 attgtttcaa gaataacctt ttcaaatctg taatctgaac taccttttgg ataactactg 13801 aaatagacaa aatttttttg tttgtttttg tctctttgga gtctggggag gtactaataa 13861 cataatccta ataacataac ctttttccgg cacatttttg tatacagaat tacatgttaa 13921 ctagaattct tatgaaaccc taaaatgtaa gaaattctga acctatcaga tatggacatt 13981 tacagataag aacaattcca caattttaga acatatttcc ctacactaca accctttctt 14041 aattggaaat gacgcagata tgaaatgagc atcggaaata acttttaaga ttttaattta 14101 cagaaagggt ttacctaaaa catttattcc attcattgta ctcaattgtt acactttatt 14161 aattattatt attattattt ttgagacagt ctcgctgtgt ctcccaggct gcagttcagt 14221 ggtgtaatct cggttcacta caacttctgc ctcctaggtt caaggtgatt cccccgcctc 14281 agccacccaa gtagctgtga ttacaggtgt ctgccaccac accaggctaa ttttttgtat 14341 tttcagtaga gatgggattt caccttatta gacaagatgg tctccatctc ctgaccttgt 14401 gacctgccca cattggcctc ccaaatgtga cattttgaac aagagagaca tgatacatca 14461 attaacacac gtaaaatgaa attggttcca tgtggaaagg tagggcaact caaagtgggg 14521 aggggggttg ggaacttcca ggtcacgcgt aggtgggaaa caaacggtag catttttttg 14581 agtttctgat aagcctttct gaggtaggca gtcggatatg catttatctt aatgagcaga 14641 ggggtgagtt tgaatacaat agaaggcagg ttggccataa gcattcccag cttgaatttt 14701 ctcttcagct tagtgatttt ggggagcgca aaatattttc ctttcataag ttactctcat 14761 tcatcaacag aaagcatgca aaccaagata attttgtttt ggctgagttt atagttttat 14821 aaccttctgt gccaaacact gacatgttaa aatagctagc aaagacaaac ataaaattca 14881 gacaaaatgt atgctgacaa ttcagaaggc atttctattt ttattctaaa ataattttta 14941 aaattagttt aaataatttt aaagccacct tgttcagtaa agttatattt gagtcacatg 15001 aacttgaaaa ttgcttatac ttatttactt aatttatgag cactcttagt tataagccaa 15061 tttggtagac acaccatatg acaataagtg tacatataaa ttaacacatt agacctgtat 15121 acacacacat aaacaaagat ccaatagctt agaacatcag ccatgagata gcaacataag 15181 cttgctgatt ttagcctgtt tgcctgaaca gataattcag tgaagtctgt gaaccaaaat 15241 ttctgctgaa ggagtctcca tggcagattt ttaaaggcca aacctccccc gactccaaag 15301 ggtactgggg ccaaacagca ctaaatgaga gtgtcacaca ttaaccaggc cccttgctta 15361 agacagcagt acaaaaacct ggatacatgc aatgtcatcc cactttccca ttcaactgca 15421 aactccaggt tccaaacaat attcgggaca agcagtatgg caattcagag aaaattctaa 15481 ggagggctta gtactagacc tcagaacccc tgccaagggc ctctcctttg gagagggtga 15541 ggtctgaaga attcccctac ctgtgcatcc cccaatctta gagtcagatg tctctgacct 15601 taggagtgta caagtgccag ttgcatgttt tccctccaga ggggagggag ggagggtttt 15661 tatttctaca agcggttaca gggagaaggc ctggaaatta ttgccaggcc gactcaaaat 15721 tacaaagttt tccagagctt ctatagctcc taagctctat atcctacatg taagtgtgca 15781 ttcatctaaa gacatatgtg aatacttctt ttaatctaca agcaaggtct gagtcctgaa 15841 gactttcctc tggagtctcc gtaaatttgc ttaatctcag tgggtgcagg tgctgggatg 15901 attatacttg tcacctgcta aatcataaag gtttgagaag ttccttcaga ccccctaaga 15961 aacttttttg tgaaggcctg aagagtttct tcagatcccc agtaaaactt gtttaatcat 16021 gcttttaagg ttcaggaaag gcctggggaa aactcttggt ggacttttgt tgcattccag 16081 cctttgtata aggacactgg ctttttcagc ttttaatatt taacttaacc catcagtcag 16141 tgctgaaaca gttgttatag aggcctgtgt tagtgagacc tggcctgcca cattaccaca 16201 tcccttagcc tgtttggcta aggctctttc aacacactta ggccatgtga aaagtgaaaa 16261 cgcatgcctg gcacagaata agtgctcagt aaacacatca atttgcttac tgtttggaag 16321 gaagctttat cacccaatcc ctgatttaag gtaggagtgt atattgccaa gtagtttgtt 16381 acattttctt acaccttgat tattttgaaa ttatgcatat ggcactactc aactaagaga 16441 gtgtgaaagg aaaaataaat gttggaaccc ccaaatcact acgaggccaa agagaaaagt 16501 caagctggga actgtggcag gcaaacctga gatgagaata agacaaagct aaaaaaaaaa 16561 gctacatagg gccgggagcg gtggctcatg cctgtaatcc cagcactttc ggaggctgag 16621 acgggtgcat cacgaggtca ggtgatcaag accatcctgg ctaacgtggt gaacccgtct 16681 ctactaaaaa tacaaaaaaa tatatatata tatattagcc aggcatggtg gtgggcgcct 16741 gtagtaacag caacttggga ggcggaggca ggagaatggc gtgaacctgg gaggcgagct 16801 tgcagtgagc tgagatcgtg ccactgcact ccagcctggg tgacagagaa agactctgtc 16861 tcaggaaaaa aaaaaaaaaa agctacatag attcctcaca attgccccaa aagagaaatt 16921 cattgtggac ctcaaggtct tttttttgag acagagtctg gctctgctac ccaggctgga 16981 gtgcagtgtc gcgatctcgg ctcactgcaa cctctgcctc acacattcaa gcgattctcc 17041 tgcctcagcc tcccgagtag ctgggactac aggcacgtgc caccatgcct ggctactttt 17101 ttgtattttc agtagagaca gcgtttcacc gtgttagcca ggatcgtctt gatctcctga 17161 gctcgtgatc tgcccgcctc tactatgagc tttcattttt gtcactgcag gaaaggcttg 17221 actgctggca tccttataat ttgataaggc catgatttcc catgcttccc attccatgaa 17281 cgttaatgat aggaactgga agctgggttc gttttttgtt cttagccagt tgagtaggat 17341 aacgggagaa taagaaaaga aggtttacat ctcctgcaac atttgcaagt ctactctgag 17401 ctgcatcaca cataggaatc agggaccaca cctggaaccg attaaaaaaa aaaaaaaggt 17461 ccttccccct tctggacagg acaattattc ctattcactc cttggcattc aggtaacaca 17521 agagagtgat cctagctaat tgccctcaat ttccaaggaa ctacagggaa acagccactg 17581 aaagaccaaa aaagaaaaga aaaaaaaaat tataaaagac ccagatacct taaacaaact 17641 gagcagtggt gatttggtgc ctccacacgg aaactcctag attcactgcc cacagccaga 17701 aacctgcagt tgcctctgtg tttaggcact gcccaccaag ggtctcaaat tggaaagcgc 17761 agaggttcta cgcccagcca tgcacctcaa gatctttact ccaaagcagt tctgtttaat 17821 ttcaccctgg cgacctacat tggtagctca tcttcacagg tgcaagacag aaagtcctcc 17881 atgtgcacat ctgcaacaaa tgcctatctc attgtttcct ctgctacact gtttatgtaa 17941 aaatacagac tcactgagcc agactatgaa agtaattgac tattcctcta cccttcctgt 18001 tttttttttt tttttttgac agagtttcca aagcgattct cctgcctcag ccctcctgag 18061 tagctgggat tacaggcatg tgccaacacg cccagataat ttttgcattt ttagtagaaa 18121 cagggtttct ccatgttggt caggctggtc ttgaactccc aacctcaggt gatccatctg 18181 cctcagcttc caaaagtgct gggattacag gtgtgagctg ccacacccgg cctctacctt 18241 tcctgttaca gttaaattgt gtatttagtg aaaggctgat cagagaccca aaataatgca 18301 accatttgtc tcttacgaca tcaaactcta ccttcccact gccccagccc cccaccaagc 18361 caccccaatg tcctgcacca tgtacatctt acatcttgcc ttcctaaaat gtatgaaacc 18421 aacctgtgca ctggccgaga gagagagaga gagagagaga gagagagaga gagagagaga 18481 gagaataacc agaaaggaac acaaggaaac ttttggaggt gatggagatg ttcattacct 18541 cgattgtgat ggtggtatcg tggatttacg cctatgtcca tactcatcaa attgtacaca 18601 ttaaggaagt gcagcttcat ttgactggtt gtttttttgt tttaatttaa tttaatttag 18661 tttattttga gacggagtct tgctctgtca cccaggctgg agtgcagtgg cgcgatctcg 18721 gctcactgca agctccgcct cccgggttca cgccattctc ctgcctcagc ctcccgagta 18781 gctgggacta caggcgcccg ccaccacgcc cggctaattt tttgtatttt tatttttagt 18841 agagacgagg tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc 18901 ccacctcggc ctcccaaaga gctgggatta cagaagtgag ccaccgtgcc cggcctggct 18961 ggttgttttt tgtatatcaa ttctgtcagg cggacgcgcg tgaaaaattc caatggaagc 19021 tgtgaccaca cacacgcctg cgcagtacaa cttacatagc ggacttcact ctcttaaaag 19081 tgaattgttt tgcaggtttt acgacatgga cagagctgtg cagccatcat cacgagctaa 19141 tccaaagcaa ggtattcgac ttaaaattca aatgccatga aacctatcct ttcaatgttt 19201 acggtacaga cgattttagt actttcccaa acctatgaaa ttggccccaa tcctgaacaa 19261 ggaatgattt gtctctgaaa caaatcatga ttttgcaatc acagcatgtc acagtcacaa 19321 ggaatcatat aggttatttc tcacgattgg catttttttt taaggctcct aaatcgtgct 19381 gtttcaggac ttttgctaaa tattttgaag gtcggcgaaa gataagttta agagtgtaaa 19441 ctgcgtcctc ccagctcagc caacttgacg gggggtaggc acaacaaaaa gtcctgcggc 19501 ataactgaca cctgtaaatc agtatcagaa actgaaaatg ctaagaaatt cagttccagg 19561 atatgaactc tacagcggaa gaataagaaa ccggaactaa acttctcact catctgactt 19621 cctgcggttt acgacaacag actttggttg caccggaaaa gaacagattc tcttccgtcg 19681 cagagtttcg ccatggtaag accgatagtt cctaactcct agtatatctg cctccatcat 19741 ttgaaaaagg gccgttctac cttggcgatg tttcactttt cggtcgcccc tacttggtgg 19801 gtgggaatgc tgctggtgct gcttcaggaa acccgatttg gagaggggag gtctgagaag 19861 agtgtgtctc gttgtgctcg gtctgggggc cgtcacgtga gtccatcctt gacgcagccc 19921 ggcagtggct ttttacccgg attagcttat tttcacacgt tgaagcagcc tcagattcgc 19981 ttggagagca ctgtggacgc tctcttgacg gatgctaggg agacccgcca agttttcaaa 20041 ggtctgtgct ttgcgtggtc tcttctgggg tctggcagga agtaatattt gttcaaaacg 20101 gggcttgagt atgggctggg cgatcttttc tccttgagtc aggtattacc cgttagcagt 20161 tcctacagct gaagatgagg gcatttcact gtcagcggaa actaatttct ctgagtttcg 20221 tgtccttttc aggcccgggg ccccaagaag cacttaaagc gtgttgcagc gccgaagcat 20281 tggatgcttg acaaactaac gggtgtattt gtgagtataa ctttgttttt tgtggttttt 20341 ttttgttttt tttagtcaag agtgcatgct aaatacctgt ggctcggtat ttcttggggg 20401 cggcgtttgc ttgtttgtta tgtgggattt tcgttggtaa agacttcccg caccctgaac 20461 ttgtccttgc acttttctgt gcaacaaagg gccaggatgt cacgtcccga tggtttcctt 20521 tgctctttta gctcgccact tcagagggca cttcctgagc ccagagctgg gtgggttaat 20581 gaatgtgtcc aattccacat aagaaaggag ttcagtgaca gtccgctgag tcccccatta 20641 ataatcaaat gcgtttaatt taaagttacc ccttgtttta gagggtggag tagctgcttt 20701 ttacgttgag cccgagttac tggtttaata gtaaaggcta caggagccca cgaaacaagg 20761 gcaagaagtt gtgtgagaaa gtctggaggc tcttgaaccc aaccttgaag acctggcagg 20821 atgagggcag tatactcaga ggcacagtag gcaagcatct cacaaggaga ctaggcaatt 20881 tccacattga tggtgtggtc acactaggag agagtggtta aaactgtaaa gtcttggtct 20941 tctgagacag agaggagcca gtgtgaagaa gacagaatcg aggcgcttaa gatgccaagg 21001 gttttttgag caggaataaa tgtggtaaga gactatattt agctctgagg gtctccattt 21061 aggaggcact tagtaactaa cttgatttgg cctcttcttg gtacagtttg aactctgaac 21121 actgagtagt agaaacttgt cttccttgtt ctgatatttt cagagagaac ataggcagtg 21181 tgttggctgg ggacctggat ttgcaacctg tccctgtcag tttctgcttc caagtgctac 21241 ctctgctact gtcacctact atttgacctc cctggatggg ccattgtttt ctctattgtt 21301 aagtggatga gcaaggcact ctttaagaag acttgctggg aacttggtcc cttagtcttg 21361 tgggttcctc tccttttccc tgtgacattg atgaaagtgt gtggttaatg atcgcgagtc 21421 cgttttaaga gagacaaagc tagagaggga tctgtagaag cagcttgaac ggtctccatt 21481 ggaagagatt cttgtgtgat ccaaacttaa aaggtaattg gggagtggtg ttcaagacag 21541 accttgagac ttttatatgt actggtagca gatatcctaa ggacagtata acaccaaaac 21601 tgcgagagta taggttacag gatttgttgc ggtgtgaatt tgatgacact taagccggca 21661 agcagccaag agaggaggaa cgcacattgg gtattacagg gtggtcctgc aacatcggca 21721 gggcttgtgt ggaactttga agcctggcat ggtgagagag tagtagaata tgtcttcata 21781 aggttttggg taaagcgtaa gctacacttg agatacatgg gatgtagtgt tgtgttcaag 21841 ggaggtttga agaaggagta tgagtggcag tggtgcttgg ggcagggcaa aatggggaga 21901 tggttgtgag gctagatttg gggaaggaag gggcacaaaa tgagatttcc atggttgggg 21961 atatgacctt cctaacaaga cagacctcta aactaataaa ataattgccc tcttgtattc 22021 ctcttcccca caacaagtta agttcagctg aacttatgca gaactaaagt acgtcatctg 22081 ctcaatctgg ttacataagg agtggatctt cttagattag gttacctttc cttgtcttct 22141 acaggcacct cgtccatcga caggtcccca caagctgagg gaatgtcttc ctctgatcgt 22201 cttcctcagg aatagactca agtatgcgtt gactggagat gaggtaaaga agatatgtat 22261 gcaacgtttc atcaaaattg atggcaaggt tcgagtggat gtcacatacc ctgctggatt 22321 catgggtaag gaaagagttc tttgttttgt tttgaagaag gaacaggttg agagaaggac 22381 aagatgtagg gctctcatgt tgccagtgat aaaactggac actgaacttc agatttctgc 22441 agaagttact gctggtcctt tcagaaggag aatgagggat agggagagga gtggcaaggc 22501 tcaagtgaaa agctggaaag gggttgataa tgtaaatggg atgaggccag cagtggctgc 22561 atgctggcgg attatcaaag ttaggctcct accttcccgg cagagactgg gagacagagg 22621 agctgtcctt aggtgttggc tggaacaaac agtaaattct tttggcagcc ttgagttttc 22681 tcaggcaggc attttgaggc agactgggct catcttgagc ttttagaagc tgtgttagtg 22741 ttgaggtctt cataggtcaa ggttgaagct tagccaagaa gagggctcag aggaaactgg 22801 ctaacgattg gtcaaggaga gaatctttgt cagtgaacag ggttttggct ctgtcttccc 22861 acttcgttga tgacttaact attctggatc tccaaggatg gatttgcagc tttttttggc 22921 ggggagggtg gaggaggggc agaagggtca cacacctcta gggaaaaagt gtaccaagag 22981 ggaaggtagg gaaaggtaag gaattctgac ctagtgcttt gttcaccctt ttcatggtga 23041 aaggctcagt acagcttttt agcctagcat tgtccagaaa gcagggggat gattactttg 23101 tcaggtgtgg atgctaccga gtggatgggg ttccaggccg gacattccta ggatccagcc 23161 actgagttag aaaagacaga ccttcctttt cttactttcc ttccttcctc tttttttttt 23221 ttagcatttt gctttgtttc tcaaaacccc aaatcctgag tgaggcagga ctcctgataa 23281 gcgaagtcat ggagggcgta aatggggcca ggacattgat tgggcagcct gctgagcctg 23341 gagccttgct tgcttaaatg tgaggtctgc ttgttcattt ggtttcctgg tatttgtggg 23401 tgtgctcatg tatgttttct cctcaggatc ctgagtgcgt taacccagca cacagtctca 23461 agcagctatt gcacaagaac ataggaagca atagagcagc cataaatgta ttcactttct 23521 tttgttggat gacatgtatt gggagaggag caggggttgg ttggtttcat tcaagcacct 23581 ccgccaagcc ctggggactc ctaggctgct aaaacgttct caggttcaac ctggcataga 23641 gtgttcctgc tgtgcaggca aaagaaactt tgtgtttgtg ttaacgatgt tccttatacc 23701 cttttgtgat cagatgtcat cagcatcgag aagacaggtg aacatttccg cctggtctat 23761 gacaccaagg gccgttttgc tgttcaccgc atcacagtgg aagaggcaaa ggtatgttgc 23821 acaagtcaag tgagctttgt aatttccaga cagataggtc ggttgctaag tttgtcagag 23881 ggtgtttaag ccagattccg ggaatatttc ctatctctac ttgtttgtct ggttgtttaa 23941 tccaggcatg taggtgcaca ctgatagaaa gtaatggatt ttccacacaa gactgtttta 24001 gaaacgggat tacaagtaag gagttctctg aaggacaaga tatttctgta caggaaagat 24061 ctgtgatatt gaccaggaag gcctgggtag aaatccagat ttcttgtcat tgattaattg 24121 tgtaactgca ggcaacagta catagaacct tttttgagtc tttgttttcc ttgtaaatgg 24181 tgataatatg ctttcctgcc aggtttctgt gactatgtta gtagtagtgc tgccttgcac 24241 ttgctagaca tagaagatga ttgactgcct gatgtctgcc tctgccctta aaactccctc 24301 ttccctcccc agatttaatg gctgaagttt caaggaaact ttcaagggag ttgctttctg 24361 tgtggacagc accttccatt tttctgtgta gggaagacct ctttggaaac caggatcacc 24421 cttcagtcat ccattcctct tttccctgga tttagggtca tggggtgact ttgtttttca 24481 catgacccaa tcacttacag tgaccagatg ccatctgaat atgagatgca ttctgtaagt 24541 gaatcacaac gtttcattgg atgttgtttc tacttagttt aaccagcttg cctttttctc 24601 tgttatttct ctgggggaaa cacattagag tccctaaagg aattgacaaa tatcggttgt 24661 ttggttgttt ctgacttatt tggtcgtgtc ccttgatgca gggtccattc tgtttttgtg 24721 gattcgccca caggaggtac tttgttttag atgataggaa ctaggccttg aatcatgcag 24781 tgtgttcagt cgcgtagggt gttggattct tttgtaccta aaaggtagaa agcacagagg 24841 aaatcaaatc tgatgactgg tttctgcttt tgtatggaat ttggatctca agagacaatg 24901 cagattggta gtccctgtga atttttaagt agggggagat ggagtgggac aaggagatac 24961 tggagtgttg ttaataccta gtactggccc taccagattg gccagccagg gtgtaaaaac 25021 cttgtgtcca ggagactatc gcatagtagc gagaagtgcc aaatggttgc cctgggttac 25081 ggtattggcc ttatccttgt gtaacaggtg agagacttgg gaagttttga ttgagcttat 25141 ttgtttgctg gttgtttctt tgggcagctg gaagggagca ttccctgtca caaaccccac 25201 ccagtttagc ttggcttgag ccaaggacac agcgagttca gagtggagaa tcagaggctc 25261 ctgggacgtc agttatgtca gggacacacc agggagcctt ttcatagttt attaatccac 25321 caggtagcat aggttagtgg agagacagaa catccaaaca gccctccacc cttgaaaata 25381 agactagaaa agtggtattt tgtgggtctt gggaaaacag cctgatggat ttcagacata 25441 atatttcccc gtaacaactt ggtattagtt gttgaagacc aaggaaatgg aagtcactgg 25501 atttaagaat gggagattag gtctgttttt gtgggtgatt gccattacag tggatgggag 25561 aaaaagaaag agtttgggtg agcccatgat gaacttgcac acgagtaatg ggaactaagg 25621 cagtgcctgt tcccattgtg agtgggggta gagagcacaa gtatcacctg gaagatggca 25681 aagaagacag gggtttagaa tatacatccc cagctgtggt gtggaggaag cacatgtgga 25741 acagggtgat tgaatatttg gtggtaggat aaggagatgg tagctgcaga tcttggtgaa 25801 atacatggat gcttgagaga gttgtggatg gaaggacagt accttaatgc tttgggttca 25861 gcttgccttg caacttgcaa atagatgata agacagaagt aagtgctgac tgaaatagag 25921 gactgaaaca ctgctcaaaa gagttacagt agatgctgtc tgaatagttg gcacagaaac 25981 ctcattgata ctgggcagct tgtgttcagt gataggtgtg tatggggaga gggagatttc 26041 atattgcaga aggtttatga acagggtcta ggtggtgcag gttctgtttc ctgaaaagta 26101 gaaaatactg gttaaagaaa tcaaaaagag aaaatcagag taaccaacca aacagagact 26161 tgggtttatg gattaccatt cttgaacgtt gagaagactg aatctcataa aggtgacagt 26221 tattctcaaa gtagaagata cgggacagtg actagaaaga ggcagaaaga gggattttct 26281 tttgagatag tgggtgatgt ttgcccattt gttttagaga attggagact ggtgcaaaag 26341 gagatgatca agaccatgag tggggagact agtgcctttt tggtgggcaa ggaagaagac 26401 aggtggaggg aataggcagt tgaagtttaa gtttcagaaa agatggaacc tgggcagtta 26461 gatctgtttc tgagggaggc cagctgggaa agatttcata ctagtttgtg taggtgttgt 26521 acaaagggca gtggaaactt gaagcgaaga gtttatctgt ggtttgagag ggacagcgta 26581 gaatgctctc aggggagcct cacatttgac attcattttg gagaatgaca ggaggcttca 26641 ccttgcagtc aagattgtat ggaaagggtg tcaatggggc aggcatgttg tagtaggtgg 26701 tgatgtaatt acaaattatg actggtcaag actgctgagc agacagatgg tagagttgag 26761 gttttcttgt ggtagagtgc tattcaacag ttaagaataa actttttctc tgtagttctt 26821 ctattaaaat aaatctgaaa aaattcagta attttttttg gatacaggtg ttttttgggt 26881 tacatggatg aattatccag tggtgagttc tgagattgta gtgcacccat caccttagta 26941 gtgtacattg aacataatgt gtagttattt ttatccctgg ccccctccca acctccctca 27001 tctggtctct ggcccctccc acctccctcg tctgagtctc tgaagcccgt tatatcactc 27061 tgtatgcctt tgcatactca tagcaagata ggggttttgg ttttccactc ctgtgttatt 27121 tcactttgaa taatggcttc caggtccatc catgttgttg caaaaggcat tattttgttc 27181 ctttcaatgg ctgtttttct ccaaacttct gtctctgaga gtgctcagaa actgacacat 27241 tctggaggct accctgtcta agcattttgc ttagatatgc gtggcttaca tatacctagg 27301 ctcagctacc ttttatttcc gtggctccca accctctcct tttctctcag acatttcctt 27361 ttgtctcaga cattttatta taagcagttg gtttggcatc cttcagctca tactgttttg 27421 ttttcatcca ttgtatcctt gcctttgtgg ctttgagcct gcctacttgc aggactcaat 27481 ccgaactctg attcctttac cacatagagg ctgagcccca tactaggaac taaggattga 27541 actcagagca gtctcctgtc tgctgtcatt gttgtgcctt tctcacttgc cagtggtgac 27601 tcagagggcc ctacagtctt tgggagtctt ggggctggag gacagaaggt gatggggtgt 27661 cacatcttta aaatctgaag acagtctttt cagctgtggg acagaggaaa aataattggt 27721 tttctgcaga tgttgaggct acccctgtgg taagaggttt agctcaactg gaagttcccc 27781 tcatgatatg gccagtctta cccttagtga acctctagtt tcctggcagg atgggcctct 27841 actaagcgag aggacccaag atctaaatcc tgagctgttt tggccacatg aaggtgcata 27901 ggtaataaaa gtcgtctgag gtggatgtga ggaaagcctg ctaaagttgg cttttgcagg 27961 agcactccca ctaataaaag tgttcaaaag tttttagcct gccttagtcc tggtcttgaa 28021 gcactggaga cctggtcagt cagctctgta tagaggttag acacagaact gttacccgtt 28081 aatgtgctag catgcttaca ggcatttcac tcacttatag atagttgcta agtgtaatct 28141 gtatttcaca gtcatctgtg gatgcttaaa ggaccgacac gcatcgtgaa gcttgggtta 28201 acagttccag acggaactat ctcacaggat ggagggaggc cacttggaac tggaggcacc 28261 tttaaagaaa gaggccatgt cttttttcag agggtgtttc acatgtgcag ttaaaggccg 28321 aggacctttg ggcacaaggc ttcaagtttg atgttgtgat atagcagtga actgcatggt 28381 gctgttgcta tccatgcaga aactgttggt tagtgaaagg atctgtggag gatataggag 28441 aagaaaaagt ttttacttgt tttgtttttg ctgtctttct cctgactgtc agcagtacct 28501 gctaactatt ttgactccct tagcactgtt ctcagctcag ctgagagagt acactggaca 28561 ggatgtacat gaaaagtgga gggcagcagg taggctttga ggtgtgaata ggccttctat 28621 gagttcacag tcatagaact tgcttggacc tgtcttacct gggctatttg gagctaatga 28681 gccagggtgg aaaagggaca agattaactt tcactggaaa tgttttagaa tcacccaagt 28741 aggttcagga tgtcacatac aaaaaatgtg gaggacagtt gggtgatagg aatgcattca 28801 gcaggccgtg atgcaaagac ttgcttcatg aagttggagg ggcagaggta gagagcttgc 28861 atttcagagt tctttctctg gttatcaggg acagagggtt tctttctttt tttttttttt 28921 ttttttttga gacggagtct cgctctgtcg cccaggctgg agtgcagtgg cgggatctcg 28981 gctcactgca agctccgcct cccaggttca cgccattctc ctgcctcagc ctcccaagta 29041 gctgggacta caggcgcccg ccactacgcc cggctaattt tttgtatttt tagtagagac 29101 ggggtttcac cgttttagcc gggatggtct cgatctcctg acctcgtgat ccgcccgcct 29161 cggcctccca aagtgctggg attacaggcg tgagccaccg cgcccggccc agagggtttc 29221 ttatatctag cgtggagtag agctggagag attgaaaatg cccactagta gccctaggtt 29281 ggttaaagta acactgcatt agtaagtggt cactggattg gtactgatga ttggatagca 29341 cattttaaag gtaaagtgca agttactctt ttaaatgagg atggaagagt gaagttgaca 29401 gattgttttt aagtctggct ccactggttg cagaatgtat actgagtggt gtaatctttc 29461 ttatggtttg gtcccagggc ttcgggaggt ataaggggaa aatgtgtttt ctctgtctgc 29521 ttactcagca cctctagcct tacttaccgt agactcaaac aaacctggaa tcttagccca 29581 ctgctttggt gcttatggac tatagcttcg tacttttcca ctttagatta gagggcaaag 29641 agaagaatgg gctttgatag gcatattata ccccatccac atgtattcac cctgaggttg 29701 ggcaggttag tatccctctg tgcccttgtt atatcattta ttgtctgagg cagttctgct 29761 tggctgctta gtgaacctcc tttcagtacc ctggacactc tgtgtctgtg tccatattct 29821 ccctctgcct catctcacgt aagtattttg cctactgtag caccacagct tgatgatttt 29881 ctgttctggt acataacctt atctgctgaa tttggttgca tattggatct gtcttctctt 29941 atggggatgt ttatacctga ttggaaggaa tgtgccaagt tgtattcagt gtcttaatga 30001 cagcaccccc taccctgcct ctgcacacat tcacattacc ttcattacac ttgtttatgg 30061 atggccaagt acttacatcg ctgggtggtg gggccattcc tatagtcacc ttggtttgca 30121 ctgtgtttgg tttgctctcc ttctgccacc attcactccc tccaagttcc atccaaactt 30181 tcaaagggct tctgtcctgc tgtgccttac tatgacaggc agtagaacaa aagttttaca 30241 cactgtggcc tttttgtttg tatgcaagca ggtttctgat gtattgtcct ttagggccaa 30301 gttctggacc caaaggcatt cctaaagtta atttgggggt ggtacacaca gtggcagggc 30361 agccggaaag gaggatgtgg ttttcctttt ccttttggaa gaaaagggct cagtcagcct 30421 aattgtccct tgaagtgctt ccaacttttc ttttaagagc tttaattggt atgtttgcat 30481 gcagcatctg ggagtttctt caatttctac atttatgttt gacttctgaa ttatagtctc 30541 tgaagttttc agttggttgg tccagtaggc agttctgatc agggaactta cgcagctttc 30601 aattctttgc ttaactcact ggccaggaaa gctggaaatg atcttgagtt tctattagta 30661 gtggtggtaa atatgaggtg gtggtggcaa acaaactgaa cactcattgg agaatgcctg 30721 gaggggtgtg tcaactgtga agttttgatt tgaagagcgt gctctcatta gattgactgc 30781 ataggatcca ggccttaaca tggataagtg ggcttggata tgccaggctg atgaattcca 30841 gatttacctt aactggggag ttaatgtggc ctaagttcca catgtggaat aggagccttt 30901 gtcacctttt tggactgctg gcttatatta aatggcctca catctccata gcaccttgaa 30961 tggtgttaaa tctatagtag cagcagaaat aatattagct gacatttact gctttgccag 31021 gtcctttatg ttccattatc tcataaattt ataagatgta gtagtgtgca ataattattc 31081 ccatcttgca cctctgaagc ttaaggggac agtatttcaa cttcggcctt ttttatagga 31141 gaatgagcca agctctggtt actttactta ctgtgtctct ttccacacct cctaggcagg 31201 ttaaaagcag tgataagaga gaagctggca aacctagtta gtcctgagag cccaaagata 31261 cgggagagag gaaatgcagc agctgattaa aaaacaaaaa agtggctcct cgagttcatt 31321 tattgtacag tttctggagt atttctgtca tcttttcagg gagtttttct gtcatctggc 31381 actctgtttg tccttgtcat tttcatattt acccagaatt tgtcctgaga cttcagagga 31441 ctgtactcag tttaaatgtg gagtttttaa tatcttttaa cttttataag ctttttctcc 31501 caacaaagct gaaagaaaac ccactttgtc acttctgagc ggtgacagtc catggggagc 31561 ctgtggtatg taggatgggc attggcaacc ataaatggat tatgtttgtt tgagattaca 31621 tagtgtacca tttgaatctc ctgaaaattg cttctggaac aaaatggggc ttggataact 31681 cttgggctga tacatgtact tggtatgaga cattttgaac tcctcctgct gatagatgtc 31741 acgagcaaca ccagagtggt aagtgttaaa caaggaataa agcaagtagt taatgggttg 31801 tgccatttaa aaatgagatg aggaagagca gcattgtgta cagaaagcat gtagctggcc 31861 gagacagctg ttttacctaa cctggtcata aaagcctcat gatttttttt ttttttgaga 31921 cagagtttca ctcttgttgc ccaggctgga gtgcagtggt gcaatctcgg ttcactgcaa 31981 cctgcacctt ccgggttcaa gcgactttcc tgcctcagcc tcccgagtag gtgggattac 32041 aggcatgtgc caccacccgg gctaattttg tatttttggt agagacgggg tttctccatg 32101 ttggtcaggc tggtcttgaa ctcccgacct caggtgatct gcctgctttg gcctcccaaa 32161 gtattgagat tacaggcatg agccaccgca cccggcaaaa gccacatgtt cttagcagtt 32221 cttagtagtg gtagagttaa gtttgtctct ggggctcttt gttaccaccg ctccagagtt 32281 gcccattgag gactatcaat tggaaatatg tccctgagga aaaaatgata taccagttga 32341 ttgtgtgtta ccactcagat acacaggcaa caggctttta gcctgtgata aaggaccaga 32401 ggttagttca gctgagcact cttccagcag ccaccagagg gttctgaaac agattccccc 32461 ttggcaaggt ttcccaaatg tctaaattac taaatcagta tgggagatca tcagctctgt 32521 gactggttaa tatagttttt cccaagagca ctccttctct gaccgaggaa atgctggaaa 32581 tcgatgggat aagtaatgtc tgggtgactg ctttaggctg gctttgtgtt tacatattga 32641 attgattttc cttgctgtgt ttatagtaca agttgtgcaa agtgaggaag attactgtgg 32701 gagtgaaggg aatccctcac ctggtgactc atgatgctcg aaccatccgc tacccagatc 32761 ctgtcatcaa ggtgaacgat actgtgcaga ttgatttagg gactggcaag ataatcaact 32821 ttatcaaatt tgatacaggt aagttttttt tttcccttgt gttgtctgcc acctccctct 32881 ttgtcttttc tttttctttc tattttattc ttctgttttg tctgtacacc cattatgatt 32941 agagagaact aaaacacatt tggttttctc taagagcata ttgtgtcatg gtctcttggg 33001 tttgcagtgg gttcttgatg ttgagtagag gtgtgatgtc cctttgggct atctccagta 33061 gcccagcctc attcaagaat gtaaaccata gctatgtaaa ctctgtgtcc attctgtttg 33121 tcagaagagg ggaactttgt tttaaatgcg ataaagttta gctgtgttcc ttgaggtgaa 33181 gggctgaagt tgtccagata atcttaagct agtgatgaga aaaagatgtg tctacagctt 33241 tgtttctcct ccctacacct caggctctga ctcctgacag gtttataccc aagtgcttgc 33301 taatacctga tcagctaagg aagtaaaata ttttgcctga cttgttttaa cagaaatttt 33361 tagacatgca gaaacactga aagaataggt cagggacaac aaacaatggc aactcctttt 33421 gtgccattct tcagtgactt tttttctgtt gctttttttg attttgttct gttgcttatc 33481 tgccctccct gtttacctgc ttatggcacc cccgcagtcc tcacttggac tctgatttaa 33541 gacctagcca aagtcctcag ttgccttggt gtttgaacct aggctggctg tcttgagttg 33601 agtactccac aggctagggt tgtctgattg tttgaccact atgaccttgt acctgtgctg 33661 cctgcacttc tgttatttac tgttgacctg aaaattttag ccaagaatgt ttgtttcttc 33721 gctgtgaaag ttgagagggc ttcccattag gggaaaatat tgagagggga tattaaacag 33781 gatgtgtcat gcttgtgata cctaaacctg ttgtttgatt tcagtgtgac cacagagata 33841 accagatatc atgcacctcc tagtaatagc aggaagtaat agcaggaata cttacattca 33901 ggaagtattc ttgtcaaaga atgaagacaa acttgggtaa agggacaaag gaaaaaatgt 33961 atatattgga acacacttaa gaaacatttg cggagttact tagattttcg agtggcttgt 34021 gagaatattt caggttctaa tgatttggta ctggccattt gtttctaagt gatccagcgc 34081 aggactgcat aggtaaagat gaatcaaagc tggccaggag tttaaaaatg ttacgaagct 34141 ggagagtggg taggtaggag catcttatgc ttgtctctca aacgtttgca tacctttgag 34201 aataattttt tttttaattt tttgagatag agtcttgctc tgtcgcccag gctggagtgc 34261 agtggcacga tcttggctca ctgcaagctc cacctcacag gtttgcgccg ttttccttcc 34321 tcagcctccc aggtagctgg gactgcaggt gcctgccacc atgcccggct aatttttgta 34381 tttttagtaa agacggggtt tcaccatgtt agccaggatg gtcttgatct cttgaccttg 34441 tgatccgccc accttggcgc cccaaagtgc tggaattaca ggcgtgagca ccgtgcccag 34501 ccgagaataa tttttaaaat ggatgtggac ttcttatctc caatgaaatg atatgttgta 34561 tctcatttgt ttattaaatc aaattcccag ttaaaattca acagattttt ttttctcccc 34621 cacctaactt gtttgttttt gttttgtttt gtttttgggg gagttttgtt tcattttgct 34681 ttcagacagt cttgctctgt tgaccaggct ggtgtgcagt ggtacagtct cggctcactg 34741 caacctccgc ttcctgggtt caagtgattc tactgcctca gcctcccaag tagagacagg 34801 gtttcaccat gttggtcagg ctggtttcaa acttctgacc tcaggtgatc cacctgcatc 34861 ggcctcccac agtgctggga ttacaggtgt gagcgaccgc cctggcccta actttatttg 34921 aaatatatat ccctttgtct tacacaaaca tgcttggtat taattttttt gcttatttgt 34981 ttattactta ctagtttact tagtttagtc ctgggagtgt acataaaata atctcagaat 35041 gacaaagcca atattcctag taacaagaaa cccactgagt gagggcaaag gtttttttgt 35101 agttcttggt ttgtttggtt ttaaatgctt aaataatttt taattgtggt ttcaaaaaaa 35161 aaagatacgt cagagattta ccaccttaac catttttaag tgtacagttc gttagtgttg 35221 ttaactatat tcacattctt atgtggcaga tctccagaac tctttcatct tgttaaactg 35281 aaactatgtg ctttaaccaa ctttgccact ttctcctttc tacaacctct gacaaccacc 35341 ataccactct gcttctgtga atttgactct tttgggtggt gtcttaataa gtggaattga 35401 tagtgtttgt ccttttcgac tagttaactt tgcttggcgt catgtcctca gggttcatcc 35461 atgttgtggc atgtgaaagt atttctttcc tctttttttg aaagagtttc actcttgtcg 35521 cccaggctgg agtgcagtga ctttatctcc gctcactgca acctccatct cctggtttcg 35581 agtgattctc ctacctcagc ctcctgagta gctaagattt ctggcaccca tcaccacact 35641 cagctgattt ttgtattttt agtagagatg gggtttcgcc atgttggcca ggctggagtc 35701 aaaacccttg acctcaggtg atccaccctc tttggcctcc caaagtgctg ggattacagg 35761 cgtgagccac cgcacttggc cagtatttcc ttcctctttc taaggcagag taatactcag 35821 ttgggtgtgt ataccacatt ttgtttatcc attcctctgt ggatggacat ttgggttgtc 35881 tccacctctt ggctactgtg agtaatgctg ctggggacac agcggtgcag acaactcttc 35941 cagatcctgc tggatgtttt tgtatacaca cttagaagtg gcattgctgg atgagatagt 36001 tttgaaattt ttgagaaacc attgtgcggt ttttttatag cagccttgcc attttacatt 36061 cttaccaaca gtgtagaaga atttcagttt cttcacattc cttcccacac tttgctgttg 36121 tttgcttttt atatattttc tttttttttt tctcgtggct atcctggtag gtatgaggtc 36181 atatctcatt attatttttt taaatttgga gttctgtgat gagtagagat gttgaacatc 36241 tttttgagtg tttgtatgtc tttggagaaa ggttttattc atgtcctttg tccagtattt 36301 agctaggttg tttctggtat tgtggagttg taggacatct ttagatagta tcagatgtaa 36361 gatttgtgga tgttttttcc cattccatag gttgcctttt cacccttact gattgtttct 36421 tttggttggc aggcattttt aaatttgtca ggttttgctt ttggtgtcat gtccaggaaa 36481 ttatatctct tatgtcatga agcttgtggt ctctgtgttt gtctttggag ctttacagtt 36541 caaggcctta cgtttaatct gtttagagct ggtttttgta tatggtgtaa tgtaaggtat 36601 tctcatgttg tttctttaca tggggacgta gatttaccag caccacttgt tgaagtgact 36661 gtcttttctc cattgagagg tctgggcacc cttgttgaag atcatttggc catatatgct 36721 tgggtttatt tctgggctct attttcttcc tttggtctgt ttgcctgtct tttatgccag 36781 tatcacactg ttgaattact gtcattttgt attttgtttt caaattagga agtgtgttat 36841 ccaagtgagt caaccatgct caacttcggt tttgtcagtt tttgttaatc tgttttaaaa 36901 tatcaaccct tagttttttt tcttttttct tactgttttt ttcttttatg tctgctctgg 36961 tcttatttaa ttttcttcct tctgccaacc ttgggtttat tctgtttttc tagttccttg 37021 taatgtagtt aggctattca tttgaggtct ttcttctgtt gtaacataag catttttgag 37081 ctgccttgtc ttgtgtgaca atttttgact tatagtctat tttgtctgac ccaagtttgg 37141 tcacccctgc tctcttttgc ttactatttg cctggaacat caacttcagc acttcacttt 37201 cagcctgtgt gtgtcgttaa atctgaaacc ggtcttttgt taacagcata tagttgtaac 37261 ttgactggtt ggttggctgg ttttattcat ttagctagtc tgtgtcctta gattggggtg 37321 aggagttaat ctgttttaca gtcaaaatag ttgctgttaa ggaaggcatt agttagtata 37381 gctctgttgt taattatttt cagtatgttc tgtagctatt ttatacctat tttttctccc 37441 ttattacctg cctttttgtg ctctgctgct ctttactttt tatagtgaca aaaaataaaa 37501 gaacactttg aatcctttct gattttgtgt gtcttccatg gttattttct ttgtggttac 37561 caaggggatt acataaatta tcttaaagtg aaaacagtct gttttcaact gttaccttca 37621 gttgcatctc catatcccct cagacaaatt ttgttattga tggcacacat gaatgttttt 37681 ctattgtgta tccttttacc attggtttat ggttattgat gtgcttttgt gttttaagtt 37741 ctataccaga cttaaaagtg atttatacac ctttgtgatc taggattctc tatttgtctg 37801 tatatttacc tttatcagaa agctttatat tttctttctt tctttctttt tttcatatgg 37861 ttttgtgtct gacatccttt tgtttcatcc tgaagtgctt ttcttcagca tttctcgtaa 37921 ggccagccta gtggcttttg cttaccttgg aaggtcttat tttttctttc attttgaaag 37981 ttatggtatt cttgtgaggg actaaaccct gatgattatc ttacccaaat ccctatctaa 38041 gggatctgga gagttacgcc ctacaaatga gcatgactca tcatcagatg ggttttattt 38101 aaccctatat ttcatgactt actttccagt ctgactctgg gataacatta cgagacaatg 38161 aagaaaataa aaatatttta ccccaaaaca tgtttgccat attttgaaat ggacttgcaa 38221 agctgctctt cgtgggagtg gaggagaaat ctgcatctgt aaagaatctc tattaaaata 38281 gataaatctt tttcttccag accctcccaa tcctaaagag ataaactaaa atctgaataa 38341 gaaacattgg tcatttattg tctctaaggg caaccacttc cctaagactt cagaagaact 38401 tcagtctcca tcatctttta tcttaacctg agcattccct ttcccttctt accaatccca 38461 ggcaaactca accgtcaacc agaaaatgtt taaattcacc aatagactgg acctgataac 38521 cccccaaccc tgccctttct tgaccaaacc agtgtatttc ttaaatgtat ttgattgatg 38581 cctcatgcct ctctaaaatg tataaaacca agctgtgccc tgaccacctt ggttacatgt 38641 tctcagggcc tcctcttgag ggttgtgtca tgggccatgg tcgctcatat ttggctcttt 38701 tatagttaga ctcttcgttg tcacttggtt taagcacttt gaatatgtca tcccactgtc 38761 ttctggattg cagagttata gcttatctag caagagctcc cttgtaaatg atgagtcact 38821 ttttacttgc tgcttccaag attctttctt tgtgactctt gagagtttga ttatattgtc 38881 tcttgatgct ttgggcttat cctagttgga gttcagcttc ttgaatttgt atgtccatct 38941 tctttcacac aactggggag tttgggccat tattctcttc tagtaggctt tttgcccctc 39001 cctgtctttc ccccaaatcc tgttctcatt ttctctgttt ctgtcttttc tccttctagg 39061 actcttgtag tttgtatatt gttcctcttg atggtgtctg ctactccctc tcttaggctc 39121 acctttattt ttcttttttc ctttttgcgc ctctgacttg ttaatttcag tggcctgtct 39181 tcaagtttgc agattctttc tggacctgtg atttttaaat tctgttcaat tgtattcagt 39241 tccagagttt gatccttttt tgtgggtttc ctttacattg ttgataattc tcatgttgtt 39301 tatgggttgt ttttctgatt ttgtttgtag ctgtttcctt agtttgatat gcatctttaa 39361 gatggttagc ttgaattttt tgtctggtaa ctcatacatc atcgtttctt taggcttatt 39421 tttctaaaga aaaacagcca tctcttccag tcttccctat gtatcttcat acagggagta 39481 tgaccttcac cagtcagcca ggctagaggt tttgggaatc tcatctcagg gcttgcatgt 39541 gtactttctt ggctgtgctt ctctgaagcc cctaacctct tgctttgcct gatgcctgtc 39601 tgtggtacta tggcctctct gctgttgtaa caactgtctc atataccctg ctccaccccc 39661 atccctgtat gtcttttttt tttttttttt ttttttttga gctggagttt cattcttgtt 39721 gcctaggctg gactgcagtg gtgacatctc tgctcactgc aacctctgcc ttttggattg 39781 aagcgatttt cctgcctcat cctccgagta gctggggtta caggcatgtg ccaccatgcc 39841 ctggctaatt ttgtactttt agtagagacg ggatttctcc atgttggtca ggctggtctt 39901 gaactcccaa cctcaggtga tcctcccacc tcagcctccc aaagtgatgg gattacaggc 39961 ataagccaca gtgcctggca atttttgtaa tttttcgtag agacagggtt gtctcatcat 40021 gttagccagg ctagtgttga attcctgacc tcaggtgatc tgcctgcctt ggcctccaaa 40081 aatgcaagga ttactggcat gagccaccgt gcctgggtat tttttttttt ttttttccaa 40141 gacaagagtc ttgctttgtc acccaggctg gagtgcagtg gcatgatttc ggctcacggc 40201 aacccctgtc tctgaggttt aagcattctc ttgcttcagc ctcccaagta gctgggatta 40261 taggtgtgtg ccaccatgcc tagctgtgat ccactgcagt agaaaatttt gttttcacat 40321 actgctttgt ccataacctt cctctcactt cagtttttgg gacttggatt taagttgtat 40381 tagagctttt cacagtagac ctacttgtcc agtcttctcc atttttcatc cttttgtctt 40441 tccatggtgc atcatgagca gttactttca gcaccttcta tttttctgat tctctcttca 40501 actaactata gacatgtttc attaaaccca ttcctttcca ttcttaattt tggtaaccgc 40561 atttttcatt atttgtttga ttttcaaatc cgctaggtta ctctgcaaag gtgggcttct 40621 tgttgattta cattagtttt gttatgtttt gtgttttctt ttatccttgt acaccctttt 40681 aaggcagttc caagttctgc atttggtttg tgctatgttg agggtggtgg tctttatcat 40741 ggtgtcttag ttccttccat gctagacact acatgtaaag tactgttagt aagaatacta 40801 tgcgatcaag atgattctgc cttccttctg tctggcatct gggctgtctt agcagcgtga 40861 gatcaccttg atcaagactt gagattccct tgctggctgt ccagagaggg tgaatttcag 40921 attttaagtt attaggaaga aggattgttt tgggctcagg tgtttgtgag ggttttgcct 40981 ttggggtcca gctaattatg ggaagcgtct gctgttttca cctggagact gaacccggag 41041 ctttgatttc tattctattg tccctgtgag accatcaaaa agaaaggagt gtcccacaac 41101 aaaagtcaac attagtttgc actcagtctg ttttcctggg agttgtctga tagacgtaaa 41161 gaatagagct gacacattca agaattataa aatagtgatt catgggaagg gtctccgctt 41221 tcaacctcca tttactaatg tgcccgtagt aggcaccatt ctaaatgtca ggggcttttt 41281 actggaactt aacacatctt gaaagaatca atgttttcat caatttgtag cctttcttca 41341 cttttagaag cagtgacaaa caagtgaagt gaaggaacta ttattaacaa tttgaatttt 41401 accataatta gctctgtttt tttccctatg catactaagt ccatagtgtt cccacatgag 41461 ggttctggag aatgaactaa aaaaaaactt ggtgcacctt ttaattgaca taagcattgc 41521 aaggcatttc tgctactctt gcctaagtac tgattttctc ttaaaggagg ctctgcaatt 41581 cagtcttgag caaaatagta agttttcatt gacatttcat cttaccgctt tttcgggatt 41641 atcttctgct taccagctag ctcaagccct gcattttgtt actgagcagg tgcttctatc 41701 caaaccttcc ttaactccaa acctaactta actttaatgt aaagtatagt ataagaaagt 41761 tctttgtaac tgaggttgtt aggggctgct cgtttactgt cttagtagtt gccacaagga 41821 ccctaaagga ctggccaagt acactgactt aagaaacaaa atcttgggcc ttttatgtga 41881 ttttgttatc tttcagatac aggtctgtgg gtactttctt gtgactttag tgggtatcca 41941 taccatctta cacaaaatta ctttcaaatc atttctctaa ctctccagta cagttgctaa 42001 tgttgaggca aaacctttgg tccttttaat tcctatggct tctcaccatg tcaaaacaca 42061 tataagcagc tggccttggg tttctgttaa aaatctgcat gttggatcaa ccaatgtatt 42121 tgtggtagaa agggtcaaac tgcagaaatc tgtttgatca ttgtcagacc aagcaatttt 42181 taccttagag ataattgtaa ttttcttgca tcctagagcc tcatctttaa gatgctcaga 42241 ttgagttggg ttcttctgtt tcccatcagt ttttgagaga tcaataagtg taatatgttt 42301 gtgccattga ttcataggtt ataatttcaa atttatcctg gagtttttta ggctaggtta 42361 cttgtttgtg ttttgttttg ttagtattaa gacagggctt tattctctgg ctttggtggt 42421 gtgtcttctt taaaaaaaga ggatgtgtcc tttttccttt tttttttaaa tgtgtcttca 42481 cataaaaaag aggacttgtg gctgagtctt tattagcagt ggtttgaggg aaagtttttc 42541 ttaaaacatg tttattctgt gataggtaga acttccttga caaggtagcc tgtgtcctag 42601 atagtatact tatattagat ggttatgtcc cgcaaaaata actctagatg tctgaaacaa 42661 atgtattcct ccaggggtcc cctgagagcc cttaaaagat tccccattga aagactggat 42721 tagctagcca tccagttaaa ggcctttctt tattgtctag tgtctttctc atagcctgaa 42781 tttctccagt tctccctccc tctcagtcat cctccttatc tctctaccca tccaccctac 42841 ccagtgctgt tccatgccat ctttgtcaac cccaaaatga ttattcaacc tttgaagaat 42901 tattgttgga caaacaatgc agggaaagct tatgatttga aaaatcatca ggggcctgcc 42961 tttaaatgct gactgcaatc taattcacaa atagcattcg tttacagtga caagagttct 43021 tgaaattaca attttgggat gatgctccat tttttttttt cctggtgagc ccgtgtgtgt 43081 taatttgttt taaaggcaac aggcccgtct ggttttaaaa tgactcctct cctgtttatt 43141 ttcctttccc tctaggcaat ttgtgtatgg tgattggtgg agccaacctc ggtcgtgttg 43201 gtgtgatcac caacagggaa agacatcctg gttcttttga tgtggtgcat gtgaaggatg 43261 ccaatggcaa cagctttgcc acgaggcttt ccaacatttt tgtcattggc aatgtaagac 43321 ttacactctc tttacttctt tctaagaaga ctcatcctaa aaagcataaa gacctgcatc 43381 cactactggt gtttgtctgt ttgttctttt tccagtgtta aggttcacag agtgcagcaa 43441 gctctctatt tagagcctct gagaagaaac aagattagat ctagagcctt acacctgcct 43501 cttcaggtgg tctgtgtgag atgccagcag tgtcattcag taaattaagt taaactcagc 43561 cttgcaactt ttgcagggct acatcactgg tatgggctgg attaaggtca ctggctccaa 43621 tcctggtagt gctgtgcccc cttaattttg taagtgagtc tgctaaaccc ctctgagact 43681 ccatcttaca gttcatcagt gaacatggag ttgatggaga ttgatgcaag ggggctatga 43741 gtgagttgct aatttgagaa taagggcatt tacaaggcac gtaacacttg ttagtttact 43801 tgttcaagat gatcttgctc tgggatcttt tgagaccttt tcagtgaagg agcacctctg 43861 aaatcatgca gtatccttac atgggagtca cttgccccaa gacttaagga aagggcaacg 43921 gacataattg aaaaggcttt ttatttttat ttatttattt attttttgag acggaatttt 43981 gctctgttgc ccaggctgga gtgcagtggc acgatctcag ctcactgcaa gctctgcctc 44041 ccgggttcac gccattttcc tgcctcagcc tcccgagtag ctgggactaa ggcgcccgcc 44101 accagacctg gctaatttct ttttgtattt ttagtagaga cggggtttca ccgtgttagc 44161 caggatggtt tcgatctcct gacctggtga tcacccaact tggcctccca aagtgcaggg 44221 attacagatg tgggccacga cgcctggcca actgaaaagc ttttcatcaa gctattcagc 44281 cccgggagca gcatctttca cttgtaccct ggcagtcctt aggatatttg agagaagtgg 44341 tacagaggcg aagcctgctg ttgttttttt ggctgtgcca ctcagaagct ctatagttgg 44401 gccattcttc ttctcagtat tgaattatga aacctcagtt tatacctttg aaagcagggg 44461 atcagggaag tgagtaaatg catgtaaaag ttctgtaaag ctgatgatag ggtgctgaca 44521 gtacacaact tcaaaactat gagaacctga agttagtggc cttgccttta aagatgctta 44581 tgaaaaactt gttggctgat tgcttcacat aaaatagaag caggaagacc tttatttagc 44641 agctcagcag tgaggagtgg gagggactgt gagaggcaga taacctctca ggagacatgt 44701 ataaggaagt gtttattttg ttttagattt tgttgggtgg tggtctgtta gctggctatt 44761 aatgtgaagg aaggtatgag gcatgtgtgt tttggtggga tgttgttttt ctctcctccc 44821 tttctttctg tacttacttt tatctcctct tctattgcag ggcaataaac cttggatttc 44881 cctgcccagg ggaaagggca ttcgacttac tgttgctgaa gagagagata agaggctggc 44941 caccaaacag agcagtggct aaattgcagt agcagcatat ctttttttct ttgcacaaat 45001 aaacagtgaa ttctcgtttc ttaatgtgtt ttttccccct tgtggataat agtagggttt 45061 gaatttgctc atgattttgg cactgtctta agatctctag gaataccacc tatacttcct 45121 ctgaccctcc agtaaataaa cctttgcttc cctcatacat gttacccaca ccatctctgt 45181 gggcagagtg atggagtagc attccaaaac gaagtacaaa aaacgtatga catttcagaa 45241 aggacggcat tcttttgtgg aagctatggg agatggtctt agttttcttg cttgttcaca 45301 ttattggtag aaattaaatc ctgtggctgt aggaaagagg tccctttttc cttgctgact 45361 gacaatctga gcgcttttct cagcttctgg aggctatgtg cattctttgc ctaaatgtcc 45421 catttctcca tcttgttcaa agccagcagt ggcaggtcag gtccttcaca tacagagtct 45481 ttggtctttc tttgacccat tcagcaatgg ttctccactt aaggggggcc cattagagta 45541 acctcttatt ctcatggatt gtcactttaa tgtcgtattt ataggatctt tggatattgg 45601 ttttcttaca gaaacttcta tggctggtgg cgcctttgcc tgagttttgc ttggcctgct 45661 gggctcgttc tgcccacttg gcatcgcagg ctgtgcttgg cgtataatac tggcctggat 45721 cccatgcctg ccaagggcaa accaggcgtg gagcagtgag gaggggtatg tgaacaagca 45781 tgaggtctgg ccactgcagt cacacatgtt ggctgctgcc gctgggcagg cagctccagg 45841 tgccaggatg ggtgctggc // LOCUS AF042001 4034 bp DNA PRI 05-FEB-1998 DEFINITION Homo sapiens zinc finger protein slug (SLUG) gene, complete cds. ACCESSION AF042001 NID g2832265 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4034) AUTHORS Cohen,M.E., Yin,M., Paznekas,W.A., Schertzer,M., Wood,S. and Jabs,E.W. TITLE Human SLUG Gene Organization, Expression, and Chromosome Map Location on 8q JOURNAL Unpublished REFERENCE 2 (bases 1 to 4034) AUTHORS Yin,M., Cohen,M.E., Paznekas,W.A. and Jabs,E.W. TITLE Direct Submission JOURNAL Submitted (07-JAN-1998) Center for Medical Genetics, Departments of Pediatrics, Medicine, and Surgery, Johns Hopkins University, CMSC 1004, 600 North Wolfe Street, Baltimore, MD 21287-3914, USA FEATURES Location/Qualifiers source 1..4034 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q" mRNA join(<447..525,1271..1816,2724..>2905) /gene="SLUG" /product="zinc finger protein slug" gene <447..>2905 /gene="SLUG" CDS join(447..525,1271..1816,2724..2905) /gene="SLUG" /note="similar to mouse SLUG protein encoded by GenBank Accession Number U79550; entire SLUG gene sequence is contained on cosmid 185D8; cDNA encoded by IMAGE clone 29339" /codon_start=1 /product="zinc finger protein slug" /db_xref="PID:g2832266" /translation="MPRSFLVKKHFNASKKPNYSELDTHTVIISPYLYESYSMPVIPQ PEILSSGAYSPITVWTTAAPFHAQLPNGLSPLSGYSSSLGRVSPPPPSDTSSKDHSGS ESPISDEEERLQSKLSDPHAIEAEKFQCNLCNKTYSTFSGLAKHKQLHCDAQSRKSFS CKYCDKEYVSLGALKMHIRTHTLPCVCKICGKAFSRPWLLQGHIRTHTGEKPFSCPHC NRAFADRSNLRAHLQTHSDVKKYQCKNCSKTFSRMSLLHKHEESGCCVAH" BASE COUNT 1184 a 863 c 777 g 1210 t ORIGIN 1 cccggctcct gcgcccctcc tagctcccag agagcgtgga tcgcgggcgg ggctcaccga 61 gcgaggttac ctctcttgaa aatacttaaa cacttttttt cctctccact gaaatctcaa 121 aaaacagccc attttgaacc agaataattt agtctgacaa cagattcttc ctctgttcac 181 agctgtccca gagggaggag ctgaaatctg aacctctcag ctgtgattgg atctttcttg 241 caaaagagag gaaaaaaaaa ccctcccagc caaaacgggc tcagttcgta aaggagccgg 301 gtgacttcag aggcgccggc cagtccgtct gccgcacctg agcacggccc ctgcccgagc 361 ctggcccgcc gcgatgctgt agggaccgcc gtgtcctccc gccggaccgt tatccgcgcc 421 gggcgcccgc cagacccgct ggcaagatgc cgcgctcctt cctggtcaag aagcatttca 481 acgcctccaa aaagccaaac tacagcgaac tggacacaca tacaggtaaa aagagaaaaa 541 tatatctaga actacgtatc tagagctttg caaatatgaa tctacgtatt acccttcaaa 601 taccattaca aaagagccaa agattcaact gttttccttt ttaacagtgt cctgaaaggg 661 agcttacatt cttttgatca tttagggttc gttttaattc aaaactttta aaattcttga 721 ctaacaagga acatttcatt caaatctcgc agcttcgttt gctaaagcct gtgtatacat 781 gcataatgag tatgtgaaag atgagcaaca taaggattga attactgcgt ttttcatgca 841 ctaagtctcc tttcctttag aaggcggctt gatgcaggca tatctttgta ctcacggtcg 901 taatacacct aaatagccca tacgtaacat ggaaaacaac ttttctgtat gtttaaaaat 961 aatgttgagg ctctccttcc tcaatgggga cggcgaaggc actggcttgg ctgcactgtc 1021 tgacttcaga gcacctatag ctccgtgcag ctgtcagcct atgtgtgcat taagtacagt 1081 aactccgtag gtgccagggt ggaaatgaaa cattttacca gtgtgtatgc cctcctaaat 1141 gggtctatct tcttcctaag aaaatcacgt gtgtgttgct aaatatcttt ccagtgtgtg 1201 tatacttgcg tgtggattta tttttaatta tttttgcctt aatcttctct ccctcccttt 1261 tctttcccag tgattatttc cccgtatctc tatgagagtt actccatgcc tgtcatacca 1321 caaccagaga tcctcagctc aggagcatac agccccatca ctgtgtggac taccgctgct 1381 ccattccacg cccagctacc caatggcctc tctcctcttt ccggatactc ctcatctttg 1441 gggcgagtga gtccccctcc tccatctgac acctcctcca aggaccacag tggctcagaa 1501 agccccatta gtgatgaaga ggaaagacta cagtccaagc tttcagaccc ccatgccatt 1561 gaagctgaaa agtttcagtg caatttatgc aataagacct attcaacttt ttctgggctg 1621 gccaaacata agcagctgca ctgcgatgcc cagtctagaa aatctttcag ctgtaaatac 1681 tgtgacaagg aatatgtgag cctgggcgcc ctgaagatgc atattcggac ccacacatta 1741 ccttgtgttt gcaagatctg cggcaaggcg ttttccagac cctggttgct tcaaggacac 1801 attagaactc acacgggtaa gagaaaaacc ataggcagga atgttactct gagatacaat 1861 gaagccctgg gctggctgtt ggatttgcat gaagttgtga catagtgctt ttaatgatgg 1921 acagtcattg atagctactt cctgactact caaaaagttg ttagggcaca aagtgctcca 1981 tttcttgcta acaacctctg ctccagggac tgcaaacagc atgtagactt tggaaagtag 2041 attcagaagg tttggtctca aagatcagtt agaaaatcag gaaagatgtg cttgaatgtg 2101 tgttctgtga aactgccccc agtaatcagc aaatacaaat gttgctatct taatcaattg 2161 atttggttaa tagatgaagg cttttcctac agttttctgt tagttacatt cttgctctga 2221 tatttgtcac ccgtgggtga cttggatagt tgaaactaaa tttactttgc ctctcttgct 2281 ttgtttgggg attttttgct ccttaatttc ttattcgtat cacagaatta ttctgcaggg 2341 tatcatggca cttagaccac cttaatcata gtacagaaag agggaaaagg ggaaatattg 2401 ggaaatgttg ggaggtgctc aacattagct gcatctgcta aagcagaaag ctttaagaag 2461 tgaaagtcca agaactacgt aaattatttc tgtatgattg gcagcagtaa taatattcgt 2521 gtatgatttt taaatgcaat attcaagttt atcttaaaaa ttcaggtatt tttaaaagtg 2581 tctattatga gggattttaa tcaggttttg ctgcttctca ttatttaagc aaaatgtgat 2641 tgacttttgt taacatgatt ttactttatg tttcctctaa aaacactgac tgtctttctt 2701 tttttctccc actccacccc caggggagaa gcctttttct tgccctcact gcaacagagc 2761 atttgcagac aggtcaaatc tgagggctca tctgcagacc cattctgatg taaagaaata 2821 ccagtgcaaa aactgctcca aaaccttctc cagaatgtct ctcctgcaca aacatgagga 2881 atctggctgc tgtgtagcac actgagtgac gcaatcaatg tttactcgaa cagaatgcat 2941 ttcttcactc cgaagccaaa tgacaaataa agtccaaagg cattttctcc tgtgctgacc 3001 aaccaaataa tatgtataga cacacacaca tatgcacaca cacacacaca cacccacaga 3061 gagagagctg caagagcatg gaattcatgt gtttaaagat aatcctttcc atgtgaagtt 3121 taaaattact atatatttgc tgatggctag attgagagaa taaaagacag taacctttct 3181 cttcaaagat aaaatgaaaa gcacattgca tcttttcttc ctaaaaaaat gcaaagattt 3241 acattgctgc caaatcattt caactgaaaa gaacagtatt gctttgtaat agagtctgta 3301 ataggatttc ccataggaag agatctgcca gacgcgaact caggtgcctt aaaaagtatt 3361 ccaagtttac tccattacat gtcggttgtc tggttgccat tgttgaacta aagccttttt 3421 ttgattacct gtagtgcttt aaagtatatt tttaaaaggg aggaaaaaaa taacaagaac 3481 aaaacacagg agaatgtatt aaaagtattt ttgttttgtt ttgtttttgc caattaacag 3541 tatgtgcctt gggggaggag ggaaagatta gctttgaaca ttcctggcgc atgctccatt 3601 gtcttactat tttaaaacat tttaataatt tttgaaaatt aattaaagat gggaataagt 3661 gcaaaagagg attcttacaa attcattaat gtacttaaac tatttcaaat gcataccaca 3721 aatgcaataa tacaataccc cttccaagtg cctttttaaa ttgtatagtt gatgagtcaa 3781 tgtaaatttg tgtttatttt tatatgattg aatgagttct gtatgaaact gagatgttgt 3841 ctatagctat gtctataaac aacctgaaga cttgtgaaat caatgtttct tttttaaaaa 3901 acaattttca agtttttttt acaataaaca gttttgattt aaaatctcgt ttgtatacta 3961 ttttcagaga ctttacttgc ttcatgatta gtaccaaacc actgtacaaa gaattgtttg 4021 ttaacaagaa aaaa // LOCUS AF042084 6500 bp DNA PRI 20-JAN-1998 DEFINITION Homo sapiens heparan glucosaminyl N-deacetylase/N-sulfotransferase-2 gene, complete cds. ACCESSION AF042084 NID g2792517 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6500) AUTHORS Humphries,D.E., Lanciotti,J. and Karlinsky,J.B. TITLE cDNA cloning, genomic organization and chromosomal localization of human heparan glucosaminyl N-deacetylase/N-sulfotransferase-2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 6500) AUTHORS Humphries,D.E., Lanciotti,J. and Karlinsky,J.B. TITLE Direct Submission JOURNAL Submitted (09-JAN-1998) Research-151, VA Medical Center, 150 S. Huntington Ave, Boston, MA 02130, USA FEATURES Location/Qualifiers source 1..6500 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q22" mRNA join(1..1032,1219..1306,1603..1757,1940..2125,2386..2514, 2645..2827,3544..3640,4401..4524,4669..4843,5058..5228, 5336..5445,5640..5742,5840..>6500) /product="heparan glucosaminyl N-deacetylase/N-sulfotransferase-2" CDS join(28..1032,1219..1306,1603..1757,1940..2125,2386..2514, 2645..2827,3544..3640,4401..4524,4669..4843,5058..5228, 5336..5445,5640..5742,5840..5965) /codon_start=1 /product="heparan glucosaminyl N-deacetylase/N-sulfotransferase-2" /db_xref="PID:g2792518" /translation="MLQLWKVVRPARQLELHRLILLLIAFSLGSMGFLAYYVSTSPKA KEPLPLPLGDCSSGGAAGPGPARPPVPPRPPRPPETARTEPVVLVFVESAYSQLGQEI VAILESSRFRYSTELAPGRGDMPTLTDNTHGRYVLVIYENLLKYVNLDAWSRELLDRY CVEYGVGIIGFFRAHEHSLLSAQLKGFPLFLHSNLGLRDYQVNPSAPLLHLTRPSRLE PGPLPGDDWTIFQSNHSTYEPVLLASLRPAEPAVPGPVLRRARLPTVVQDLGLHDGIQ RVLFGHGLSFWLHKLIFVDAVAYLTGKRLCLDLDRYILVDIDDIFVGKEGTRMKVADV EALLTTQNKLRTLVPNFTFNLGFSGKFYHTGTEEEDAGDDMLLKHRKEFWWFPHMWSH MQPHLFHNRSVLADQMRLNKQFALEHGIPTDLGYAVAPHHSGVYPIHTQLYEAWKSVW GIQVTSTEEYPHLRPARYRRGFIHNGIMVLPRQTCGLFTHTIFYNEYPGGSRELDRSI RGGELFLTVLLNPISIFMTHLSNYGNDRLGLYTFESLVRFLQCWTRLRLQTLPPVPLA QKYFELFPQERSPLWQNPCDDKRHKDIWSKEKTCDRLPKFLIVGPQKTGTTAIHFFLS LHPAVTSSFPSPSTFEEIQFFNSPNYHKGIDWYMDFFPVPSNASTDFLFEKSATYFDS EVVPRRGAALLPRAKIITVLTNPADRAYSWYQHQRAHGDPVALNYTFYQVISASSQTP LALRSLQNRCLVPGYYSTHLQRWLTYYPSGQLLIVDGQELRTNPAASMESIQKFLGIT PFLNYTRTLRFDDDKGFWCQGLEGGKTRCLGRSKGRRYPDMDTESRLFLTDFFRNHNL ELSKLLSRLGQPVPSWLREELQHSSLG" repeat_region 2892..3182 /rpt_family="Alu" /rpt_type=dispersed repeat_region 3213..3505 /rpt_family="Alu" /rpt_type=dispersed repeat_region 3766..4056 /rpt_family="Alu" /rpt_type=dispersed 3'UTR 5966..>6500 BASE COUNT 1421 a 1790 c 1586 g 1703 t ORIGIN 1 attcctccct cccttcctcc ccccgccatg ctccagttgt ggaaggtggt acgcccagct 61 cggcagctgg aactgcaccg cctcatactg ctgctgatcg ctttcagcct gggctccatg 121 ggcttcctgg cttattatgt gtccaccagc cctaaggcca aggaaccctt gcccctgccc 181 ttgggagact gcagcagcgg tggggcagct ggtcctggcc ctgcacggcc tccagttcca 241 cctcggcccc ccaggcctcc agagacagct cgaactgaac ccgtggtcct tgtgtttgtg 301 gagagtgcat actcacagct ggggcaggaa attgtggcca tcctggagtc tagtcgtttt 361 cgttatagca ctgagttggc acctggccga ggggacatgc ccacattgac tgataatacc 421 catggccgct atgtcttggt catttatgag aacctgctca agtatgtcaa cctggatgcc 481 tggagtcggg aactgctaga ccggtactgc gtggagtatg gtgtgggcat cattggcttt 541 ttccgagccc acgagcacag cctactgagc gcccagctca agggctttcc ccttttttta 601 cactcaaact tggggctccg ggactaccaa gtgaatcctt ctgccccgct actgcatctc 661 acacgcccca gccgcctaga accagggcca ctgcctggtg atgactggac catcttccaa 721 tccaatcata gtacatatga accagtgctt cttgccagcc ttcggccagc tgagcccgca 781 gtgccaggac cagttcttcg tcgggcccgg cttcccactg tggtacagga cctggggctt 841 catgatggca tccagcgggt gctctttgga catggccttt ccttctggct ccacaaactt 901 atcttcgttg atgctgttgc atacctcact ggcaagcgcc tctgcctgga ccttgaccgc 961 tacatcttgg tagacatcga tgacatcttt gtgggcaagg aagggacccg catgaaggtg 1021 gctgatgttg aggtcagtct tgttacttaa atttttttct tagaagctat tcacactctg 1081 gcctgtcaga ctcccttcct tgattctgag cttgtcatag gtacactcct gctttccagg 1141 caggaagaat gctacttctc atgtatttcc tgttcagtct gctctcctga tttctcttgt 1201 ccctgctact cttcccaggc tctgttgacc acccagaaca aactcaggac cttagttccc 1261 aacttcacct tcaacttggg cttctcgggc aagttctatc atactggtga gcttataccc 1321 ctactccttt ggcatatcat agtttgacct gtcccttcac tgtccacctc cctgggcaga 1381 caagttgaag aaaagggtca aggtttgggg tcagactgct gtgaggagct agtggtgggc 1441 atgaatttag gttcgcatgg gtggctggat ggcaagttgg cagtggggca aatagaaagt 1501 cttaggcaag gatcagggat ggaaggtggt cagtgttgtt acaaaataaa ggagggaaca 1561 ggggctgact ggctgtggtc agtggtcagc gtgtggtcat agggacagag gaggaggatg 1621 caggggacga catgctgctg aagcaccgca aagagttctg gtggttcccc cacatgtgga 1681 gccacatgca gccacacctg ttccacaatc gctccgtgct ggctgaccag atgaggctca 1741 acaaacagtt tgctctggtg agacccttgg atctcttccc cataatttct cccagccatg 1801 cttccttttc taccttcccc taatgggcta cccttttcct atcctttgcg tttttcttta 1861 tttggaggct tccccaccct ctttaccctc tactccccaa acctcaccag gtcctggtga 1921 gagtctatgc cattcccagg agcatgggat tcccacggac ctggggtatg ctgtggcccc 1981 ccaccactcg ggtgtgtacc ccatccacac gcagctctat gaggcctgga aatccgtgtg 2041 gggcatccag gtgaccagca ctgaggagta tccccatctc cgccctgccc gctaccgccg 2101 tggcttcatt cacaatggca ttatggtgag ggactcccaa tacaggcttg aatcaaactc 2161 taacatttag acagctttca acatgcattg acctatgttc tctatcaact gccagtccaa 2221 gtatgggggt acataattgg gggtagaaag gtgaaaaaat tttaacaccc tcagtaagag 2281 ctttattgct ataacttcag actcagtcct ctataaatcc cagtgtcctc catctacttc 2341 tcattctatc tttcaaatct catctgtttt ttcttttccc tccaggtgct gccccggcag 2401 acatgtggcc tcttcactca cacaatcttc tataatgagt atcctggagg ctctcgtgaa 2461 ctagaccgga gcatccgagg tggagagctc tttctgacag tgctgcttaa tccggtaagg 2521 tgctgagagg aacggctaga agattggaga gccaaggtgt ttgcaggctg ggacctttat 2581 tcaccaaggc taattcagag gtggggtagg ctctccaaat ctgacatgca ggtacccctt 2641 acagatcagc atctttatga cccatctgtc caattatgga aatgaccggc tgggcctata 2701 cacctttgag agcttggtgc gcttcctcca gtgttggaca cggctgcgcc tacagaccct 2761 tcctcctgtc ccacttgcac agaagtactt tgaacttttc cctcaggagc gaagccccct 2821 ttggcaggta aaggtggagc tcagtctgat gagagacatg ataggggtgg ggtgtttttt 2881 gttgttgttg ttgttgtttt gagacagaat ctcgctctgt cacccaggct agagtacaat 2941 ggcacaatct ctgctcattg caacctccac ctcccaggtt caagcgattc tcctgcctca 3001 gcctccctag tagctgggat tacacgcacc cgccaccacg ccggctaatt ttttgtattt 3061 ttagtagaga cggggtttca ccatgttggt caggctggtc tcgaactcct gactgcaggt 3121 gatccgcccg cctcagcctc ccaaagtgct gggattacat atgtaagcca ctgcgcccag 3181 ccggggtggg ggtttaagag aagactaggc caggccgggc acggtggctc atgcctgtaa 3241 tcccagcact ttgggaggcc gaggcaggca gatcacctga ggtcaggagt ctgagactaa 3301 cctgaccaac ttggagaaac cctgtctcta ctaaaaacac aaaattagcc ggacatggtg 3361 gcacattgcc tataatccca gctactcaga aggctgaggc aagagaattg cttgaaccca 3421 ggaggcggag gttgcagtga gccgagatca tgcattgcac tccagcctgg gcaacaagaa 3481 cgaaactccg tctcaaaaaa aaaaaaaaaa taggccttat ctgattcttg gccctggctt 3541 cagaatccct gtgatgacaa gaggcacaaa gatatctggt ccaaggagaa aacctgtgat 3601 cgtctcccga agttcctcat tgtgggaccc cagaaaacag gtgaagtatc cttagctaac 3661 ttctcttgcc ctgccttaaa cccaaaggat ccttttgtca aggaactctc ttctctctcc 3721 aggtaatcac aggagtgaaa aaagtatttt taattttaat aatttttttt tttttttgag 3781 ataagagtct cacttctttg cccaggctgg agcgcagtgg cacgatctca gctcactaca 3841 acctccacct cccaggttca ggcaattctg catcagcttc cctagtagct gggattacag 3901 gcaggagcac cacacctggc taatttttgt attttttagt agagacgggg attcaccatg 3961 ttggccaggc tggtctcgaa ctccttgacc tcaagtgatc cgcctgcctt ggcctcccaa 4021 agtgctggga ttacaggcat gagccaccgc acccggcctt aatttgaatt tttaagagag 4081 aaaaaaagga actcctttta tacccacccc tcattgatcc agtctcttga ggcagtgatt 4141 cctaaccctg gttgtatatc agaatcatct gtggcgtttt tacaagatca caagtcattc 4201 tggtgcacag ttaaagcaga gaaccactgt tttagccctt ccctgggcct ctctcagact 4261 tgcctcctat cattgtagtt tggccatagg ggattttcct acccagagtg gttatttttt 4321 cacttccttc aagcttatgc tgttgagctg gcttccctcc cacaatggct gcctgaagct 4381 ggtctttctc tcccatttag ggactacagc tattcacttc ttcctgagcc tgcacccagc 4441 tgtaactagc agcttcccta gccccagcac atttgaggag attcagttct tcaacagccc 4501 taattaccac aagggtattg actggtgaga ctgggcccca tttgaatgtt tctcagagga 4561 acactccctt gccccaaaat accccattcc agcttccctc agcacagact tctggccact 4621 tcactctgca tatttttctg cttctctctc ttctgtgcta cttcctaggt acatggattt 4681 cttccctgtt ccttccaatg ccagcactga tttcctattt gaaaaaagtg ccacctactt 4741 tgactctgaa gttgtaccac ggcggggggc tgccctcctg ccacgagcca agatcatcac 4801 agtgctcacc aaccctgctg acagggccta ctcctggtac caggtctgcc tgaagctagg 4861 gacacacagg ctaaaggttg gggggagggg ggaggggaca gtgactgcta gagagggaga 4921 aaggagtgag gtttgagaaa cttaactagt gccttgggga cgggtacccg tgctgtgctg 4981 tctgtctcca gaccaagcgt aagggcagaa tactgtcaaa tcaccagtcc ttcaccttcc 5041 ttctctttgt ccttcagcat cagcgagccc atggagaccc agttgctctg aactatacct 5101 tctatcaggt gatttcagcc tcctcccaga cccctctggc actacgctcc ctgcagaacc 5161 gctgtcttgt ccctggctac tattctaccc atctacaacg ctggctgact tactacccct 5221 ctggacaggt ataatcccca caagagccct cagtggggtt ccccttcccc tcttcttccc 5281 ttcagcccag aggcttctta ggaatataga gggaaccaca tccctgtcct tttagttgct 5341 gattgtggat gggcaagagc tgcgtaccaa cccagcagcc tcaatggaga gcatccagaa 5401 gttcctgggt atcacaccct ttctgaacta cacacggacc ctcaggtggg agagcatgac 5461 ccaggggagg agtactgtat gggaaatggg gaagcaggtt gaccatgtct gttagagaca 5521 tagatttact ctgtctcaca aggagagaca tgtccctgtg gcagggaagg gcattgtgca 5581 tttatggaac ttgagtttct cttttccctg ccacctcatg gcctcttcct gttgagcagg 5641 tttgatgatg ataagggatt ttggtgccag ggacttgaag gtggtaagac tcgctgtcta 5701 ggccggagca aaggccggag gtatccagat atggacactg aggtaagtaa agccaggggc 5761 aggggaatcc ctctccactt ccaggatctg attctgtgtc ccacttctta ccacttctgc 5821 cttcattttc tcctgtcagt cccgtctttt ccttacggat tttttccgga accataattt 5881 ggagttgtcg aagctgctga gccggcttgg acagccagtg ccctcgtggc ttcgggaaga 5941 actgcagcat tccagtctgg gctgatgtcc cagcctccca taccagcaaa atgccccctg 6001 cttccctaag ggtcaggtcc agagcagggc ccacaagggg gattagagtg gcctggcccc 6061 tccccctcta cctcagtagc ccccaggcct gagatggctg agaagggaag ggtatccttt 6121 tcccacagtt ctgggacaaa taaaggggct tcctttggta ccccacataa tagtgctagg 6181 tacctttgac ccatcatctt gggaggtggg gaggaatgag agggtccagg cagggtgtag 6241 gggaatgtat tagtccaatg agatttccct cttcatccgc agcagtgtat ctattctata 6301 cctggctatg ggagagaccc cttgcatggg agggacccct tgctatggcc cctttagcca 6361 ggcagtggga tctacctgtg gcccggcctc cctaatgtca ttcacattga atggggatga 6421 ggtcggacag tggctcatag agccgagtat gagccctagc tgtgggctag aaatgtcctt 6481 aataaacatc cttatttttc // LOCUS D49493 17286 bp DNA PRI 24-OCT-1996 DEFINITION Human gene for human bone morphogenetic protein-3b. ACCESSION D49493 NID g902930 KEYWORDS human bone morphogenetic protein-3b; BMP-3b. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hino,J., Takao,M., Takeshita,N., Konno,Y., Nishizawa,T., Matsuo,H. and Kangawa,K. TITLE cDNA cloning and genomic structure of human bone morphogenetic protein-3B (BMP-3b) JOURNAL Biochem. Biophys. Res. Commun. 223 (2), 304-310 (1996) MEDLINE 96264636 REFERENCE 2 (bases 1 to 17286) AUTHORS Kangawa,K. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 17286) AUTHORS Kangawa,K. TITLE Direct Submission JOURNAL Submitted (04-MAR-1995) to the DDBJ/EMBL/GenBank databases. Kenji Kangawa, National Cardiovascular Center Research Institute, Department of Biochemistry; 5-7-1 Fujishirodai, Suita, Osaka 565, Japan (Tel:81-6-833-5012(ex.2478), Fax:81-6-872-7485) FEATURES Location/Qualifiers source 1..17286 /organism="Homo sapiens" /db_xref="taxon:9606" exon <3490..4235 /number=1 5'UTR <3490..3916 CDS join(3917..4235,13061..13986,15866..16057) /codon_start=1 /product="human bone morphogenetic protein-3b" /db_xref="PID:d1009064" /db_xref="PID:g699606" /translation="MAHVPARTSPGPGPQLLLLLLPLFLLLLRDVAGSHRAPAWSALP AAADGLQGDRDLQRHPGDAAATLGPSAQDMVAVHMHRLYEKYSRQGARPGGGNTVRSF RARLEVVDQKAVYFFNLTSMQDSEMILTATFHFYSEPPRWPRALEVLCKPRAKNASGR PLPLGPPTRQHLLFRSLSQNTATQGLLRGAMALAPPPRGLWQAKDISPIVKAARRDGE LLLSAQLDSEERDPGVPRPSPYAPYILVYANDLAISEPNSVAVTLQRYDPFPAGDPEP RAAPNNSADPRVRRAAQATGPLQDNELPGLDERPPRAHAQHFHKHQLWPSPFRALKPR PGRKDRRKKGQEVFMAASQVLDFDEKTMQKARRKQWDEPRVCSRRYLKVDFADIGWNE WIISPKSFDAYYCAGACEFPMPKIVRPSNHATIQSIVRAVGIIPGIPEPCCVPDKMNS LGVLFLDENRNVVLKVYPNMSVDTCACR" intron 4236..13060 /number=1 exon 13061..13986 /number=2 intron 13987..15865 /number=2 exon 15866..>16827 /number=3 3'UTR 16058..>16827 polyA_signal 16810..16815 BASE COUNT 4249 a 4374 c 4540 g 4123 t ORIGIN 1 tctagatgaa gagctgtgaa tcttcctccc aatctttgga caagaacccg cagacacaga 61 caataacagc ataacagttc cttggtagag gtctgtgact tcctcatcaa ggaaacattc 121 cttttctctt tccttttttt tttttttttt tagttgttgg cactgtgcac taagaaacga 181 attttctctg cagagtaagg aacagccagg cttgaaactc tcaccaaatc tgccagtctg 241 ggtctacttt aaatgtgcta caacttcttt caaacacaca ataaatgaaa agaccaatct 301 gaatgatcag tgcaatttag ttagtatcat gtcactttaa aatctggcca agaatcctca 361 tactatcctg ctcttgtccc catcttagag ttgaaaaaac tgaggtccag agaaggcaag 421 taacttgctt aacatcacac agctaagtag cagtgaaaat agggacatga atctagtccc 481 agttgactcc acagggtcat gggagcaccc tcacttccaa gaacatgttg acatggggcc 541 acagaaccaa cgggagagta caggagctac agcagagcct ggaggccaga cttctcactt 601 aaagaagagg ctcctaggat tccagagagg caatggggct gccccagggt tacagagcta 661 gttaaaatca cacactagct ctgctacaaa ttaactggac gatatcatga aacaatcgct 721 tgggaacatt tatttgggga gaggttcttg atgaaaccaa atgagcttgc ttttcaaaac 781 tacccagggt ttgttttact tccttacttg ggtgcagagc ttcataacta tggagtctct 841 tcactccttg ctcatgactt tctgagctga agaaaaccct gctatcccag agtgcacagc 901 aaggaggcct agaggactga gggcccccaa actacttctc acctgtctgg agagaacact 961 gttcactcca atcagcaagt tcaagacctc agcacccctg gcaattgcag tgtacggaac 1021 tcccagaacc atgataattt tgtaggaagc tgtcagaaat cacagatcca aggacttgtc 1081 tccacaatct gccatctcaa ctcttattct gagctgcggc ctgagctatt aaaaacaaaa 1141 cattctcgtg ggcaagctcc acaattagcc tcatttccac actagaaaaa ctgttctgca 1201 tgagctgaaa ttgcacagag attgtttctt gtcatagtga tgcttttcaa gggccgtgct 1261 gagagccagc gaggatgtga cccagtggta atctgaattg cttcaccaaa cacattttta 1321 tcccatcatc tcggctacat gaaagggggc cacagtgctc gtgttcaagg ctttacagtc 1381 ccgctgttca gcaataaaat gaggcaagaa atattagcca tccctggctg atgccttaat 1441 tctttcccaa caatgtcaga gaatctgtac cctacaggtt ttagtgctgg ctaaagcctg 1501 accctcaggc tggcttgaat ggagcgggaa tgagttaagg agagccagaa ggctttaatg 1561 gacacggatt ccaaatagcc cagcttgtgc gggcctcctg cggactcacc ctccttttca 1621 acaggcctcg attgtgtatg gccccaacag acacaaaact acccatgagc agaggttacc 1681 cccagccggt gcaggtcaga ggcattctgg gggggaaggg gagaggggga cagaggggaa 1741 gtgaggggca catggagaag gagggaaatg ggggtgggag tgggggaagg gaccactgaa 1801 agatggtctg gatcatttat aaggagagat tcactaagtc ccaggctgag acccgatcat 1861 ctcacttcca ggcaaaccat ttttgaaaac tggttcagaa caaatgggag ctatcctaat 1921 gggaagttga acacactgtg atctaatttt caaaacaaat aaaatgcttt tctctggttt 1981 tccaaaagct acaagtctga ggttcagaag ctcttaatag gatggcctct aatttgacac 2041 tgcaagattt aaaaaaatta caaatgggtc atgttttaca caccaacaga agccttttcc 2101 agcaattttt caattattgc attgtgagcc ctccaagaaa agtaactatt ttggctcaca 2161 gagcccaaag ttaccaagta aagggaacag cctgttcaca agtttgcaaa tagtcctttg 2221 aaatgtagca tagactgtat ctgggagctt tctggaagaa catgggcttg acgtggtcct 2281 ttccttccct ctcaagggtc cttaaacttt ttaggtgggt cttgggcccc cttctaagtc 2341 tagggaagcc agtggacccc ttcttaaaag aatgttttta aatgcataaa ataaaatatg 2401 cagagttaca aaggaagaca attacgttag aatagagtta tcacaaatat ttaaaaacaa 2461 ttttgtaaca tagtaactat gttccttttt gctaactaac tagcagtagg cctaactgaa 2521 gtggtaatga gcaaacatga tactttgatg aatccgcaac cactctcatg tgatatgaaa 2581 atatctgtgg atggtgccta ttggtgacaa agtcactaat accactgtgg ttggttgctt 2641 gcatttataa cattagttaa tgctaagttt cagttagagt taatgaaaat aaagatgtat 2701 ttttttcccc atcaagttca taagagccct tgaattctat caatggagaa gaaatcctgc 2761 tagaggcaag acaaagatca ggaccttctg ccccagggtt aagctgaggg gcagtgggcc 2821 agggactcac tacaccctca cttcactggg aatccttctt aggaaagcca ctgctttctc 2881 tcttcctgct ctgatgcttt ggaaatgttt ttaaaggcaa attgaaagca tgtcttattc 2941 ctgcaatcct gaaggctaca tgtccaaaga ttaataagga cctttgatgt gtttcttcaa 3001 caactgaaat tactctcgtc aaataaaggg ctggttaatg agctaattct gagttttaat 3061 cacgatcttg ttccagttcc cttgggtggc atgagctttg gggtgagggc ttttctttac 3121 ggagtcagcc cagagaggat ctctgttccc cacactctca cattttggac atacgctgac 3181 tccgaggcaa ctgggcgcac acatatcccc tctctggtcc tcacagggga gctgtcatct 3241 tgcaatcaca tagccatcgc atgccattgc aggggcaacc cagcaggcag gcccccctcc 3301 aactgccgcc cagcccccct cgcagccagg gcaggctgcg gcgccgacac acaggagccg 3361 gctgcggggc tgggtctcgg gctctcggct ggggagcggg gagcgcgctc gcccacgccc 3421 cccacactcg cgggcgcaca ccccggcgcg cgcacgctgc cacacacggg cgcacgcaca 3481 cggcagccgg gccagggacg accctgtcag ctgcagcccc agaggtccgg ggcgcgcagc 3541 cgggtcccct cgagggcgca gccggccgcc ccgccccgcc cctcgaagca gccgggccgg 3601 gcgcgcagtg ggctacaaac tttcgcagcg cgagtccgcc aaggcagcgc gccgactcgg 3661 gctcggctcg gctctgcgct gctccggacg gctgtgaccg ctggccgggg gctcgggcca 3721 ccggtaccca cggaccgcgc gcccgggtgc ctgctccgct aagcccctcg ccccgcgcgg 3781 acctcggtat ccagcgccct gctgcccggg ctctccccgc gcgccctact gccgcgaggt 3841 cagtccgcag cctccggtgc gccagcgctc gccttcctcc tcctggactt cgaccctttg 3901 ccgccctcac cacgccatgg ctcatgtccc cgctcggacc agcccgggac ccgggcccca 3961 gctgctgctg ctgctgctgc cgttgtttct gctgttgctc cgggatgtgg ccggcagcca 4021 cagggccccc gcctggtccg cactgcccgc ggccgccgac ggcctgcagg gggacaggga 4081 tctccagcgg caccctgggg acgcggccgc cacgttgggc cccagcgccc aggacatggt 4141 cgctgtccac atgcacaggc tctatgagaa gtacagccgg cagggcgcgc ggccgggagg 4201 gggcaacacg gtccgcagct tcagggccag gctgggtaag tagagggtgc cccaggaccc 4261 cttctcctca ttctccacct tcctcacttt tctgtctccc cacccgtgca cctctacttt 4321 tcctcttcta gccaacgggt gagtggttca ttcattcttt cactaatgat tcactgatct 4381 ctccatgcca ggccctgaac ctccagtggg gcaccaagag tggcagcagg tggcacagag 4441 gtcctgcccc tggggtgctc ccacattgct ggacatacaa agactacccc atacacaagg 4501 ctgtaagcac agagagtgga agaggggaca taggccaggg gagtatccct gtcatagggt 4561 caattatgtg ggcattttac aaatgagaag ggcacagcca gtgaacatag gcagccctgc 4621 agaagctcca ggcactgggc cctgtcagga agcaggagcc cctagctggc gtcagctatg 4681 gttcctgtgt ccagggaagc agagtggact gagaagggcc ttggctgccc gggattgagg 4741 acttccacgg actggcatga agcgttcaga ccacaggggt ttaaggaaag gcccaggggc 4801 ttaggcagtg ggaacactct gcctatctcc ggcctctgtc ccacctttcc acttcctcag 4861 ccccttggat ggagctgggc agagggcact tgcaaaggcc catgctgccc tgttctgaga 4921 gcacttccag gctcctgcct cagcagcctg cccacctttg caggccctgg gtagagagga 4981 ggtgcagcca gggccacagg ggtggtgaga agaccctggt tgtcatggtg attactcctg 5041 gttgcagaat aatgttccag aaaatgtggg tgggggtctc ctgcagctct gcagaagatg 5101 aggccattca acaggagaaa aatctgcagt ctacaggctc ccctcctgag ggtgagggaa 5161 acaagcctag ctgtggagct agacagcctg gggtcagatc tgtcactggc tgtgtgactc 5221 aaggtaacat ctgtggcctc tcataaccat ggattgggga gcacctgcca caaagaggcc 5281 ccatgccctc ctcctccact tagcagactc accaggtcag tggccctggg ctccaggcca 5341 attggatcta actccatcct cactcataac tttcttgcac catgtggggt ccatttcttg 5401 aggaacgtta ttaaattaaa ccatttaaga atcatctggc atgaacatga ccttgggtaa 5461 atcactaaag ctaggtggac ttgagtgtat tcatcgataa aatgggcaag gcttgccaat 5521 atctcccctg tgcagttgaa gggaagatga aatgaggtca tatgtgcaag ctcagagccc 5581 agcactggcc agattaagtg ttgatgtacg ggattcccct gggagcacaa ataatcgtgc 5641 taagtattgt gtattcctgt gggtttaaac ttgaccttgc ttctctatta actcaatcct 5701 caaagccttg tagttgagga agcaaggcca gcagggtcac tgctgcgaca cgggtgagcc 5761 tgcccactga gcccatctac ttgctgcagc caggcatgga gatgccttct tgtgtcattc 5821 tgttttcaca gaaagggggt agtacaagta gcagtggagt tcaggatatt aatctcaaaa 5881 aggacctgaa gacagtatct caaatccagc actcccactt taaagataag gaaactgaga 5941 ctcagtggtt gttcacagtg gcaggggttg gcaactccca tctgccaggt cattgctgtt 6001 ctcactgtca ctccccacac cctgtcatct ccctcagagc tatgaccagg gtctgggagg 6061 ggtagggtga aacagacaag cctgggaaat caggattctg gtagtctgga gtgtctgaca 6121 aatccctctt ctttggaagc ctgaggcaca tttgggtgtg aatcccagct gtaccagcta 6181 ccaggtgtgt gcagtggcac aggttagttt acctctctga gcatcagttt ccttgtagta 6241 cagtgaagaa gtgaggatta aatgggctaa gcaggttttg tgctttgcat acagtggggg 6301 cccaggcagg ttgcttgatc ctctcctccc ttgattcatc tgagtgtttc atgccaagcc 6361 tcttgcctgc cctcagatgt gcctgcttcc tgatccagtg acctgtctga ggaggtatga 6421 gtttgggcag ggacccttct ctcatggatg atgccaccca tgtgggaggc ttcagctctg 6481 gtccgagaac atgccagtct cagcctgctc cccagtggtg ggtttctgct gcctcagtgg 6541 ggcagcacct cttcctgaac aacaaatgcc tccatcccca cagcctgcga tacgtaacta 6601 tacctccccc gccaccgggc agtgcagctc agctgccaag tgctggctgt gagctggcct 6661 cgtggggtcc aaatcccaat tctgcgatct gggcaaggga ctgaacttgt ccatgactca 6721 gtttttttca cttctaaaat ggagctaata gtgctcccct ccttatagga ttttttttaa 6781 tggaaactga gatatgtcaa gtgcttacac cattcctgac acaggtaagt gtcaatggtg 6841 tattagctct tgctgttgat ataccagcca ccccttcctt ccttccctca ttgctccctg 6901 aagtgaattt cgttccttat taaatctgca tcggtgggga aagcctatga acctgaccct 6961 aagggttaag gttagaccac ccctaattcc cactgaatga cgtgtgttta aatgtaagct 7021 ttgtcaccta ccaatgtgat actgagttgg ttttcttcat ctctaaattg aaaataatac 7081 tacatactca cctgtcaaaa atgccaaaat tcaagttgat ttcagttccc caagtattcc 7141 ctaagcactt cgtttggtag cagatgctgt aagctcaggg agcacaaaga tgcccagaaa 7201 agctcataaa ctagcagggc agatcaaagt aagcagcaga atagggcagc aggcctgggg 7261 ggtggcatat aagaaggagg gcaggggtgc tgcgggagcc caaaggcgca ctgacctggc 7321 aggggcagag cctctgggtg gggccaggaa agccttccca gagcaggcca ggccccagcc 7381 actttgaagg gcaggtgggc cttagccagg tgcggaagtg aagctgtcca ctgtgttcag 7441 agactgccag gagtttggtt ttaatgataa cgcaaggaga gaaaggcagg agatggagac 7501 ggacaaatag gttgcaagca ggccatggag agccatgagt attatgaatt tgttccccca 7561 taggcaatgg ggcaatatac aacgtgcatg cgtaagggac agaaggaagg aaggagggag 7621 agagagaaga ctagagggga gagagagaaa ggagagaggg agagagaggg aggaagagga 7681 gagactgcag ggaaaaagag ggataaagga gagagagtga ttataaataa atgcatgctt 7741 attggggaaa acctggaaat tatagaagat cacagtaaag aacataagtc atccctaatc 7801 acatagccag atataaccgg aattgttagt atgatatttt ggtagaatca ttccagccat 7861 atctatctat ctatctatgt ctccactatc tatatctcca ttgttttacc cttatatgta 7921 agatatatat ggatatatgg atatatttga gagatatata tgtgagatat atgtggatac 7981 atcttttatg gattatattt tcagcatgca ggtaaaactt gcaggtattt ccatttgaaa 8041 ttactttttt tcattactgg tcatcttgaa cttttcccca tgcctgctgg ctatttgcaa 8101 ttcttcctgt ggaaactgcc tccttgtatg ggagtccctc ctttgtccta ctgatatgca 8161 aaggttcttt acacagtaag gatcttaacc tttcacttct aatagacaat ttgcttttga 8221 gacatgagtg cgatgtaaga gggaacaaag ttctgggacc actgttgtgg ttgagtctgg 8281 agccctgagc gcagggtgga aagatgctgt gttcaggatg gttgtgtaga aacaggaagc 8341 agagcaaaca gggcctggag attggttaaa agcaggaatg ggagaggcaa aagagttctc 8401 agtctcctgc tggaagggct ggaggagaag tgaggcctcc cactgggcag gaggtcaggg 8461 agcaagggtg tagtgccctg agggtggcaa tagttcctga ggccataact gttctgagcc 8521 cttgctgggt gccaggcaca gtgctgctag tgcgctctgc agagctgatc tcacaataac 8581 ttttggaggt gcaaatactc tatccagttt atgaatgagg aaactgaggc acaaagtggc 8641 tccatgactt gcctgagtcc ccacagctag taagggatgc cagcaggcgt tgaacctcaa 8701 ccctagagcc tgcatggaaa cgggcactaa gtactaaatc tgtgttagtt ccctgctctt 8761 ctctgcagca ccgtctggcc caccgcccct tcctccagct tctgttggtg catgcatcta 8821 agcccagcgt tctcactgat tgccctagtg gctgtttgag gggcagtggg tgaagggaca 8881 gatctccaaa ccgtcttgga atctttttat atagggcaga gcatctaagg ggctgctctt 8941 ggcctgagag ttgaccagca gtggggctgg acaggccttt ggatgaggaa gtcccttccc 9001 ccagtaccat ccccactgcc ctcaggaagg aggctgagtg cccagcaaag tggacctggt 9061 cacagttctg agaattccca agagccactg agatcagctt gtctaaataa agtcagtctt 9121 tctcttctta gggagtcaga actcaaaaat atcaaaataa tgacctctgt ttgcaaaacg 9181 tgctgcagtt cacagcacat gatctttgtc cattagctct taggcagatc atgaacatat 9241 ccagaacaca tggaggagca aaaggcatgg gcttggggtc agacagagct gggtacaagc 9301 cccgactcca ccccttgcca actgtgtgac tttaggcaaa ttgcacaccc tctctgaagc 9361 ttcagctctt tgtccacaga atgaatgacc gctccctact ctcttcatgg tgaaaatgat 9421 gaatcaagca tgcagaacac tcagcatagg accaagcaca tacacttgga gctcagaatg 9481 tcataattaa taaacaattt tcttccaatt taaagatgag aacactgagg ctcaagggga 9541 agtctggcta aacacctggg tgcccactgt ctccatttgg atttgcatca tcagctgcat 9601 ctttgtaaag aatgaggatt ccttgacttc ttaaggctgt ggttactgca ctctagggaa 9661 gcatgccagg aacccccagt gttaactgaa gtaacagacc acacttctct tgtgcccact 9721 tgggcaggat tttcacagac aatctcattc aacatgcatc tctgagctcc aactgtcacc 9781 taagccttat ggtgaggtct agcggtgcag tagtgaataa aactggcagt gtccctgtcc 9841 cgttagagtt tatgttttag gaagataaga gcagtaaatc agtatacagc aggatgtcag 9901 gaagtgctct gaaggccaat gaagcagggg cagacagagt gatgaggctg ttctccagcc 9961 acgggagccc aggggaccca ttgtgggagg ttccatttga gcagatacct gaagaggttg 10021 agtggtactg gatggggaac catgaaggtg gaaggaagcg tgaggcacag tcctgagaat 10081 aaagccacat gttggaagga gaagaacgcc agtgtgacgg gggcagaggg gtggggttgg 10141 gctgtggata ggagataaag ttgcagaggt aggcaggcca gatcaagtaa ggccttggta 10201 aggggttgag ttttatttgg ggcaatggag ggctccgagc ttggggggtg acctgtgatg 10261 gaatgcaccc agcacaccgg gggcctcgct tggacagtgc tctttaaaac tgtgacatga 10321 ggagacactg aagacgggtg tcattgccat taccagtctg tctcccctct aggctgaact 10381 cctcctgggc aactttgtta tctgtctccc ctctaggctg aactcctcct gggcaacttt 10441 gttatgaccc tggtcagccc aacttagcct ggagtaggag ttttgcaaat ctttatcagg 10501 aaaacaaaga gaagaagagg aagaaagaaa ggaaaagagg gtggatagaa aaggggaaag 10561 aggggaagaa aagagaggag ggaggaaggc agagaagaaa gaagagagaa aagacatgaa 10621 tcacagagca ccagcacgca tcagaggaaa taatgacgtt ctcttcaatg cttctctgtt 10681 atggcttggg ctcggctaaa ctgaaagatt tcaggttgag gtcccttctc tgagccacag 10741 cctggagccc cctgcactgc gtacatgcag aaacatccag agagaggacc agcctgtggt 10801 ccctcatgca ctggggctgg gggtgaccca tcttgtcccc agcagtcccc agcagggagg 10861 gtcttcccag agcaggctgc ccagtgaggg ctgagctccc tctggcagga ggcattcaag 10921 ccacatggag gtagaactta tacccttttg cacaacctca aaggttcctc cccactctga 10981 atctgttagc agaacaatgg aggagttgag acttgaactg gatcaaagaa aaacaatgtg 11041 ctgtgttggc tgggttgcag ctcagatatt tggatcagtg aatgcaaggt accccagttg 11101 atggacggga gcttgctctg ccagccttgg tcctttagat tcaagaggca gaaagcacat 11161 ttgggggcac ctcctttgta cctggcactt cactagggtg tttctgtgat cttccagaca 11221 atcctctgtg tgagatggag gctgttgtcc ctgtgaggca gatggggaaa gcaaggttca 11281 gggcgtggcg ccactgagtc ctgggagctt ggttcctagg gcattttcta gcttggggat 11341 gggggggtgc aacattattg gtgaactttc accccatcca tcggtcaacc ccagatgctg 11401 cagatacacc accgctacca ccggagtgag ctatcagctg ccactgactt cccaaacttc 11461 cctgatggcc agagtcccct ggagtgctga ttaaaatgtg gaagtcattg ttatgccgtc 11521 aaggccacat gcttggggaa gccaggttca agtgtgcatt gagaccctgg gctgctgctt 11581 atgtgtggag catgttgagg ggctgtgaca ctgtcctggc aggtttgttg tgaggattaa 11641 ataaggtaat ggcttcagtg ccacctgagc ctttactggc atttatttat caggccttcc 11701 atagatgtgc ttctctgttt cttcccgctc atagtccctt aagagtatgt gaacaatccc 11761 agttgaatta tgccttcttc ccaacagaaa caaagaagag aaagtggaat atttctgtta 11821 aaaaaaaaaa tccctgcaga tgctgtaggt acaggggtca gagaggtgtg gccctgaccc 11881 atgcccctag agagcccagt gtcccccaca cacacttccc gggcttggga gccctgctcc 11941 tctccagacg ttccccagca gaaggacact tcgttgtcag gaaatgaagc agtggaaagg 12001 agaaagccag cctggcctct ctggtgtcct gttagtgtgc ggcatcctag ggaggcacca 12061 ggaaagactc cccactcctc agaagcctag agcggacttg gccatttgag aacgctcact 12121 gatatataat aatgtttagg cgattcaaac ctgacacaat actcatgctg ctgccaaggt 12181 tggggcagct tcctgggact gctgacatgg ctgtgctgcc ctggtgccct ggggaacgcc 12241 tgcttccacc aggcaaatag gaaggagtgg ccggcagccc cagggaagcc ggagctggag 12301 cattcttgaa ccaagttgca gatgtaatgg ttgagacgtc ccggtcttta cctggacctt 12361 ggagatttag tcaggttgcc ttgtgtaatc ctgatgctgc tcttttcttg ttaaaacaat 12421 cttaaaagtc ctcaggcaga acaaagagca gatgcctaaa tggctgtact tttgatcctg 12481 gtttccctta ggaggtgctc aaaaagcatg agcgtttatt tatttcgtct cttttggaag 12541 aatgtttggg tttaatgtga agtgtattta atgattgcat ttaaaaactg attgcataaa 12601 tagagggctg tgcaaataca ggaggggctg ccttgatgca gggcttggac ccaacccaag 12661 ccatcctgag cacttaatga aggtgtcagt gacatttgtt catttgtgca tgcagaggca 12721 gtagagggca gggggagcaa acaggagctt tggagtccag caggcctgaa tgcctccctg 12781 tctgccccta tttctttgct gttgcttttt gttgttgttg ttgtttgcta cgggatattg 12841 acaatctcaa ccttaattcc tcacctgtgc actgaggatt gcaaagggcg ccctcatcag 12901 tttcatgtga ggattacacg aaaggaggga agcaaagcgg gggaggagga tggtcctgaa 12961 agtgtggatc ggtgggtgtt tgctcctcca gccctgggtg ggagaccctc aggagcacga 13021 gcgagaactt cacagcctgt ggtctctcct tccctcacag aagtggtcga ccagaaggcc 13081 gtgtatttct tcaacctgac ttccatgcaa gactcggaaa tgatccttac ggccactttc 13141 cacttctact cagagccgcc tcggtggcct cgagcgctcg aggtgctatg caagccgcgg 13201 gccaagaacg cttcaggccg cccgctgccc ctgggcccgc ccacacgcca gcacctgctc 13261 ttccgcagcc tctcgcagaa cacggccaca caggggctac tccgcggggc catggccctg 13321 gcgcccccac cgcgcggcct gtggcaggcc aaggacatct cccccatcgt caaggcggcc 13381 cgccgggatg gcgagctgct cctctccgcc cagctggatt ctgaggagag ggacccgggg 13441 gtgccccggc ccagccccta tgcgccctac atcctagtct atgccaacga tctggccatc 13501 tcggagccca acagcgtggc agtgacgctg cagagatacg accccttccc tgccggagac 13561 cccgagcccc gcgcagcccc caacaactca gcggaccccc gcgtgcgccg agccgcgcag 13621 gccactgggc ccctccagga caacgagctg ccggggctgg atgagaggcc gccgcgcgcc 13681 cacgcacagc acttccacaa gcaccagctg tggcccagcc ccttccgggc gctgaaaccc 13741 cggccagggc gcaaagaccg caggaagaag ggccaggagg tgttcatggc cgcctcgcag 13801 gtgctggact ttgacgagaa gacgatgcag aaagcccgga ggaagcagtg ggatgagccg 13861 agggtgtgct cccggaggta cctgaaggtg gacttcgcag acatcggctg gaatgaatgg 13921 ataatctcac cgaaatcttt tgatgcctac tactgcgcgg gagcatgtga gttccccatg 13981 cctaaggtag ggtttcttcc gccttttgcc aaattctaag gctcagctct gccgctaccg 14041 tcaagttcct cagcctgcag gacttctgtt tccccatctg caaaatggga ataacagtac 14101 ttcctatcta ttccaggcag gaaataggta gacataggtc accaagtggc agcccgtagg 14161 gtagttgccg cccacatatg tgtgagtttg gctgtgtttt ttgaagtatt ggcttggttg 14221 aattgggact ttaaaatgag aacattcttt tgaaaagcag gagaatccat gtctttagaa 14281 acgcatcccc acatagcaac tatctgcaga ggttgagtag atgctgcttc ctgtacacac 14341 ggcaggtccc agaagctcct tcctgaccaa ggctctcatt tctgttccca gcctggaccc 14401 cagagtgtgg caatctgtga ctagcacagt gctagcctca gcagctctcc cactgtagat 14461 tccctcctcc ttgcagctca gggcaggaat ccacagggaa taggccttca ctgtgcacag 14521 ggtctgcagg acacacagca ctgggctgtt ctgtctgttg caccagccta aggatttcca 14581 gttctcacct tcgctccaga ctgggctgtg gctgagtcca tccctctccc tgcagcctgc 14641 tctgcagcag cctggctgct tgtgatgtca gccctcggca ctcagcacac agtttcttcc 14701 tgcttagtta ctctgcccag ggcaggtcat gcctttccct ccaagaaaag aaagtcagtg 14761 cataaatcca atttgaactt aaataatcag ccagctgggg aagtccagac ccagacatgg 14821 ggcagcagta acgtgtttgg gaattaaatg gcttctcctc aaaacttctc ctcaaaaccg 14881 tgcatgtgca agagagaggg aggtgcctga catagagacg gagcctgcta aaagggccta 14941 gcaaagaccc aggcagatgc aagcccggaa ccctgaaggc ttgcagctgg ttggccatgc 15001 ggttagctta tcaccctggc ttgctgcctc cccaccttcc gggacaccag ccagaacgct 15061 gcaaaggcag gtgtccctgc cctgctccct tcccaccaca cccacacaag agggcttcct 15121 cctcaacagc actacatata caaatctctg tttggctggt gatgcactga agtggatgac 15181 ccggttggtt catcttgtca gaggtgggca cagttttcca agccaggtca ctttctaact 15241 aggggtcttg gtcaagttac tcagtcattt taaacctcgg tgtcttccca gtaaagtcca 15301 ggtaataccc accccataaa attgctaagc tgacaaaaaa aagggggtga tgcagggaga 15361 gcagccccta ggaaatgtca atgtccagcc tgtcacactg ccacgcacat gccaggcctt 15421 cccttcagtg acaagagcac aacctcgtga gaagcatgcc tgtcctcctc agactgcgag 15481 accagggctg tgctggcatc tcgctgggtt ttccctctgg gcagcttctg cctgcttcct 15541 ggtgctccgc ggggatgtgt gacactcagg cgcccccatg tggccctgcc gtgttgggga 15601 cacagcgtgc ctcagggaag ccaaagggca ggaaggcctg ggggctcttg gagacctcag 15661 cccggaactc agcgtatgtc tgattttaga ggaaccgtgt gccccgcctt gggatctcgt 15721 ccattcctct aggtggatgc ctattctgtg gcctcagccg gggaacaaca gcagagcagt 15781 atggcctggg actgtgagga tggcatgggt gcgtgggtgg gtgagggcag agctgaggta 15841 accctactcc ttttctgcct tgcagatcgt tcgtccatcc aaccatgcca ccatccagag 15901 cattgtcagg gctgtgggca tcatccctgg catcccagag ccctgctgtg ttcccgataa 15961 gatgaactcc cttggggtcc tcttcctgga tgagaatcgg aatgtggttc tgaaggtgta 16021 ccccaacatg tccgtggaca cctgtgcctg ccggtgagac cactccaggg tggaaagaag 16081 ccacgcccag cagagctgcc ttctcggagc cttctgcaac caggacttgt ggtgcagctg 16141 cagacacaga gcacagctca tgggcaacat cactggggcc cagagagagc tgtccgccag 16201 tgcatcatta gggggtcttt cattgctagt gactagcccc ttaaatgcca gcctgagtac 16261 ctgaaggaat ctgggaatta gccctggcct gaaagtggcc catcattcat acccactgtt 16321 ctgaaggctt gaaaacaaaa catatccaca acattggctt gatgtgatca tcatctcata 16381 actgagcaag aagactatgc aaatcttagg gcgctcgctc cctgcacacg gaaagaactc 16441 tgtttaaatg ctcagttcag aacactttgg gccacatagt gattttggaa aacaggataa 16501 tcgtggtgta aatgagtgtt tcctttcaaa gtccactgca gagcttttat ccatatggta 16561 tgcacatgta gccaatattg gtttcttttt cttaatatat atattttatt ttaaaacaac 16621 aaaaagggag ggcgttgaca ccattcccca cagagatagt catgctgagt gtgggttgtt 16681 taaacatgca tattgaaata acacatatag taacgtggga atactaaaaa ataaccaaga 16741 ttttatattt ttgtaaatta tactttctat actgtagatt gtgtatgtta tgtgttttta 16801 tggaaagcta ataaattaaa ggtacagtgg tatcttgaaa aactgaatgt cacccatttc 16861 aaaatctcac tggctcccca tctgtaagta caccatctgg tggttgctgg ggactcagcc 16921 tcttttaggg tcgctggagt cccctaagcc atctccatac tccctggtgt ctgtcccagg 16981 ctccagaaac ccaggctttc ccatgctttc caggggcctg tcccactact ggttcctgca 17041 aacccaagct aggctgatgt tgagaatgga catggctcca gacagggaca gtggcctctc 17101 ggggtttctc ctcatcccct agcctcaggg acatggattg attgggaggt gcctggggtg 17161 ttttaaaaaa catctatggt tgcctggtca gagatctttt acttctggta attcatggag 17221 caagagattc aacataagta ggggtctctc ctgaggtcca aggactcatg aggaagttgg 17281 acgggc // LOCUS D63789 5669 bp DNA PRI 27-DEC-1996 DEFINITION Human DNA for SCM-1beta precursor, complete cds. ACCESSION D63789 NID g1754608 KEYWORDS SCM-1beta; SCM-1beta precursor. SOURCE Homo sapiens placenta DNA, clone:hg44. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yoshida,T., Imai,T., Kakizaki,M., Nishimura,M. and Yoshie,O. TITLE Molecular cloning of a novel C or gamma type chemokine, SCM-1 JOURNAL FEBS Lett. 360 (2), 155-159 (1995) MEDLINE 95180438 REFERENCE 2 (sites) AUTHORS Yoshida,T., Imai,T., Takagi,S., Nishimura,M., Ishikawa,I., Yaoi,T. and Yoshie,O. TITLE Structure and expression of two highly related genes encoding SCM-1/human lymphotactin JOURNAL FEBS Lett. 395 (1), 82-88 (1996) MEDLINE 97002294 REFERENCE 3 (bases 1 to 5669) AUTHORS Yoshida,T. JOURNAL Unpublished (1995) REFERENCE 4 (bases 1 to 5669) AUTHORS Yoshida,T. TITLE Direct Submission JOURNAL Submitted (07-AUG-1995) to the DDBJ/EMBL/GenBank databases. Tetsuya Yoshida, Shionogi Institute for Medical Science; 2-5-1, Mishima, Settsu, Osaka 566, Japan (E-mail:teyoshid@fl.lab.shionogi.co.jp, Tel:06-382-2612, Fax:06-382-2598) FEATURES Location/Qualifiers source 1..5669 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="hg44" /map="1q23" /tissue_type="placenta" TATA_signal 2154..2158 exon 2197..2278 /number=1 prim_transcript 2197..5349 gene 2218..5230 /gene="SCM-1beta" CDS join(2218..2278,4075..4189,5062..5230) /gene="SCM-1beta" /codon_start=1 /product="SCM-1beta precursor" /db_xref="PID:d1010504" /db_xref="PID:g1754609" /translation="MRLLILALLGICSLTAYIVEGVGSEVSHRRTCVSLTTQRLPVSR IKTYTITEGSLRAVIFITKRGLKVCADPQATWVRDVVRSMDRKSNTRNNMIQTKPTGT QQSTNTAVTLTG" intron 2279..4074 /gene="SCM-1beta" /number=1 exon 4075..4189 /gene="SCM-1beta" /number=2 mat_peptide join(4077..4189,5062..5227) /gene="SCM-1beta" intron 4190..5061 /gene="SCM-1beta" /number=2 exon 5062..5349 /number=3 BASE COUNT 1702 a 1058 c 1248 g 1661 t ORIGIN 1 ggatccagga ggataacaag ggaatctcct actctcaaag agtctgccat ctagtgggag 61 acgcaggaat gtaattgagt aggagaacac aatgagattc gttgcagaac agccatgaga 121 acagaacaaa gttctaagag agcataaagg ggtggcacaa cttaatttta tcaaaaaaat 181 tcaggaaaac ttatacagag aggaggagtt tacaagtaac tatgtaggga gctgtcatgg 241 gtattccagt taaaggaaac atgtgaggag cataaaagag gctggcccat tgggttggct 301 gcacatgtat gtgtttgtta aggtttggga gtgtgtgagt gaatggtgga aggtgagtct 361 gaaaggaaag cagtactaga tcttgagcat tcttatatat cacaatgaaa gatttgaaat 421 acatcctgta ggcattggaa gttagcaaaa gaggttctca gtagggaaat ggcatgatta 481 gattgaggct ttacagtgat taccctggca aagctgcaga gaacagactg agaggaggcc 541 ctggtctggg taaccagtta gtccactgta atttgcctaa catttgagca gtgtggggag 601 aaaggaggac acatctcaaa ggactaccta gaaggtatac ttagtccagt ttggttgaga 661 atgacatgta ggtggatagc aatgagtcta agatgatccc tatatatcag tatttggaaa 721 ttagatggaa gagaacacat tgctccatgc taaggactaa tatgaggaga agcagtttga 781 atagaagatg tgtccagtgt tcaaggagtg attgtgcagt aaggtagaga tcattaaaga 841 gccagtttga agtttagaat gaagtctggg tgaaaaatca aatgcgatta gtgggaagtc 901 tcttagaggt taacctatac tttttatgaa aacataggaa tttattttca tattccctaa 961 tagcaaagga cctttaacta cacatatatt taataaatac attttataga tacctatgta 1021 aaataaagaa caaccaccac acaacaaact cagtggcaga aaagttccag tgcaatcagt 1081 atatttaaat tctactgggg gtattgcaaa tcaaatttac attttgggag actatatggc 1141 aatatataat aagagctata aagatgatca tatcattttt cccagtaatc ctactcctgg 1201 ttattaaagg gaagtaattc agtgtatatt aatgaaatcg accttatagt tctgatcttt 1261 ataataagtt ataagaaaat ggtttaactt gtatgtgtat atatttactc gaagtagaaa 1321 tatgaaggct aaaaaaatgg gaagatattt aaattggtgt taaaacagca tttaaaatta 1381 ccacaattat gagaacacat gtatgccaat gcagattcac tggaaaaata ttgaaaatga 1441 aaactgtcag atggtaagat tataatttta tttctttttt aatttgaaat aaattggtaa 1501 cagcacagct tttcaaaagc ttctataaat gtgtatgtta agttgtaata aagcaaacac 1561 atgcatgtaa gacatgctta aacagttatt taattgtttc ttgggtacct ggggagatgg 1621 ggtgaagaaa ggggggtgac ttgaatgaag gtggaggaga aaaatgagaa ccaagaaagc 1681 aaaggatcga gaagctcagt gtggcagcag cctctcttcc cctcctgaga gagtcaaagg 1741 gtggcatcag ggactcatga tccatggttg tggaagcctc atgtcacact ggatgtcaca 1801 tgaggtggga tggaacacag tgaccacccc acctcatttc ctttacagct tccgtggggg 1861 ccatggcagt gaacagcctt caggcatgtc tacggtggaa gatctgaatt caggctggtg 1921 gcaggagaca acacaaccac gttttctttt atgcatgcat ttggtttaat tgacacatta 1981 accacagaca aaggggtaaa ggccacaagg cgataggtta gtatgaacag ggaaagggac 2041 attttttttt tttaagaaaa ataaaagcat cagtattgca aagactttcc atgatcctat 2101 acccacctcg aaagccccct ctcaccacag gaagtgcact gaccattgga ggcataaaag 2161 agatcctcaa agagcccgat cctcactctc cctgcacagc tcagcgggac ctcagccatg 2221 agacttctca tcctggccct ccttggcatc tgctctctca ctgcatacat tgtggaaggt 2281 aagtggagaa gctgtctgtg agataaagaa tagggaggca aggcaggtgg gcacacattt 2341 tgggtttgac tcgggttttg actggactaa actgctgtct ccaggggagc cttaaacttc 2401 ccatgtgcaa gaaaggaatg atgattttga ctgtagaggg cttcgtaaac ttccaaaaca 2461 gggagaattt gattagtatc tgggctccta cttttcctaa ttgggtaatt tcaggtaaat 2521 tccttaacca ctcagggcct gtgcttattt atgtataaac tgaatagaat aagagacatg 2581 atcacctgag attaagatta aataaatatt atggtttatt taataacatc agatttcctt 2641 acaagcagta attttttgat taatgttagc tatggattag aggtgatgat tataaatgca 2701 tttgtaggtt ttgcccattt aatatatagt ttgataaatt atcaaaatct tagagagttc 2761 agttacaata tggggatgca ccagaggatg tatgttctgg agcaaatcaa tgttttcaat 2821 acaaaacctg tgtgaaggcg acagtagtgc ttgctgtgga ctggatgtcc cagtcttgcc 2881 ttccttcccc ttgataatgc aataagggac ccccatttta ggacgcagga caggcagaaa 2941 gataaccagc ttgatggggt ccacaccatg tgcaatcact accagctgag acttcttgtt 3001 ttccagcaag gtggtgatga tgttaacccc tgctcaaaga acaggtgatt tcctagtggg 3061 gacaacccct ttgctagcag ctttcttctc agcctgggcc aacagtctct gcttcttctc 3121 ttgctttgtc tctggtcagt acttgtggat cagcttaagt ggctgagtag ctgtttgggg 3181 gtctaaggct tgggtgaact ggttaatggc agaaggcatt ttcagctgct tatagaggat 3241 agctctttgc agctggaacc agatatagcg gggccatttc acaaagcagt ggaggtctct 3301 tttgggctgg atgtcctgtc caatgcctgc ctaagaaaac tcttaggcct tttctcacac 3361 agcggtttca tcactttctt agcctcctgc ttcctcacga cggcagggac tgggccacct 3421 tctttccttt ggccttcttt cttttcagca tcttaggcag ctgacagaga gggaaatttg 3481 accatttaaa aaggggaaca cctttattta ctcagtcaaa agcatgcttc cttccctcac 3541 tgaatgttgc cttgcctaga gtactcttca cgcattactc tgtcatctca cttatggtac 3601 tgtaacatgt tgcactattt gaaatgatct tttctgtttg cctgtctgct gcctggctcc 3661 ctcatgagaa gatatgctct atgaaaacag ggataatgtc tgtcttaata aaacatgtgg 3721 gacacaacag gcaccattgt ataaatgaat gaatgcgtgt cactggggca tttgctagcc 3781 gtcccaaatg tctaagtgaa aatatacaca gagacgggat aacatcttgt tattttctct 3841 cagcatgaaa ttcctgaaac aattctgttg attgagtttt taaattagtc aaatatttac 3901 taagaatctg tgacgggcaa gagattcggg atgcctatca gtcctctctt cccccaaaaa 3961 gcaaatggcc ttatattctc acaacattct cagagtaatt taacagacga ttgttcctgt 4021 gatctgggta attgctttat ttttaattgt ctgttgtttt tttttcctca tcaggtgtag 4081 ggagtgaagt ctcacatagg aggacctgtg tgagcctcac tacccagcga ctgccagtta 4141 gcagaatcaa gacctacacc atcacggaag gctccttgag agcagtaatg tgagtctgcc 4201 tcctcagaag ttgggctggg tgggtaccta gaggtataga aatacactct atagaaatgc 4261 tgccatcctc aggaaaagta ggtcagcata gaggaacacc tcaacttaac caaaaacctc 4321 tttagttttc cttatcaacc atgtctttct gcagcccaac cgaatagcga ttattgcaga 4381 aattgggctg ccaaagaaag aatagaagtc ctcctctatt tgtcttagtg gaagagtctg 4441 ttgaatactg tgcacagctc tgagatctgg gtttagagat ggctggctca tgtcagggtt 4501 tccctgcaag cctcactgga gttgggggat cttagggttg agttaggcag agtcccatac 4561 tttatcagtt gccatatttc aagaaaatga gtcaatgcac aacctacatg gtccctttct 4621 tctaccagaa tctcattttt agaagtaata actcttccca atacatattg caagctttgc 4681 tctaaagaat gaaaatgtaa aaatcacctt tttaaaaaaa ataagatgag tattttcaaa 4741 tttgaaaagg aagaggttat ataataatgg aactagatgg cctcaaatgt ctttttgtta 4801 caacatttgg tgacatggat gagaaaagga gcctgtgaat tatggtgaac aaaggggctg 4861 gatactactt gcagatattt ctcctttatg ttaaaataga tggcagaaga agggtgctca 4921 tttatgatct catggctctg aaagactatt tcttgcagta atttctgcac aagatctctt 4981 catgtctgcc ctgatcttaa ctcctgaccc tgaggctttg agaacgtggc taacttcatc 5041 tgtcttttcc ttgcgttaca gttttattac caaacgtggc ctaaaagtct gtgctgatcc 5101 acaagccacg tgggtgagag acgtggtcag gagcatggac aggaaatcca acaccagaaa 5161 taacatgatc cagaccaagc caacaggaac ccagcaatcg accaatacag ctgtgaccct 5221 gactggctag tagtctctgg caccctgtcc gtctccagcc agccagctca tttcacttta 5281 caccctcatg gactgagatt atactcacct tttatgaaag cactgcatga ataaaattat 5341 tcctttgtat ttttactttt aaatgtcttc tgtattcact tatatgttct aattaataaa 5401 ttatttatta ttaagaatag ttccctagtc tattcattat atttagggaa aggtagtgta 5461 tcattgttgt ttgatttctg accttgtacc tctctttgat ggtaaccata atggaagaga 5521 ttctggctag tgtctatcag aggtgaaagc tatatcgatc actcttagag tccagcttgt 5581 aatggttctt tacacatcag tcacaagtta cagctgtgac aatggcaaca atttgagatc 5641 tatttcaact tgtctctata atagaattc // LOCUS D63861 17133 bp DNA PRI 06-MAR-1997 DEFINITION Human DNA for cyclophilin 40, complete cds. ACCESSION D63861 NID g1769811 KEYWORDS hCyP40; cyclophilin 40. SOURCE Homo sapiens female placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokoi,H., Shimizu,Y., Anazawa,H., Lefebvre,C.A., Korneluk,R.G. and Ikeda,J.E. TITLE The structure and complete nucleotide sequence of the human cyclophilin 40 (PPID) gene JOURNAL Genomics 35 (3), 448-455 (1996) MEDLINE 97001145 REFERENCE 2 (bases 1 to 17133) AUTHORS Yokoi,H., Shimizu,Y., Anazawa,H., Lefebvre,C.A., Korneluk,R.G. and Ikeda,J. TITLE The structure and complete nucleotide sequence of the human cyclophilin 40 gene that is highly conserved during evolution JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 17133) AUTHORS Yokoi,H. TITLE Direct Submission JOURNAL Submitted (22-AUG-1995) to the DDBJ/EMBL/GenBank databases. Haruhiko Yokoi, Kyowa Hakko Kogyo, Co. Ltd., Tokyo Reseach Laboratories; 3-6-6 Asahi-cho, Machida, Tokyo 194, Japan (Tel:0427-25-2555, Fax:0427-26-8330) FEATURES Location/Qualifiers source 1..17133 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /sex="female" /tissue_type="placenta" repeat_region 849..890 /rpt_family="(GT)n" /rpt_unit=849..850 repeat_unit 951..1266 /note="negative orientation" /rpt_family="Alu" protein_bind 1447..1453 /bound_moiety="Ap1" misc_feature 1681..1688 /note="E box for transcription factor binding site" protein_bind 1752..1757 /bound_moiety="Sp1" protein_bind 1759..1767 /bound_moiety="EF1A" protein_bind 1792..1800 /bound_moiety="EF1A" prim_transcript 1829..16065 /note="cyclophilin 40 precursor mRNA" protein_bind 1851..1856 /bound_moiety="Sp1" protein_bind 1860..1865 /bound_moiety="Sp1" exon 1913..1997 /gene="hCyP40" /number=1 gene 1913..15456 /gene="hCyP40" CDS join(1913..1997,3718..3858,5910..6016,7996..8184, 9511..9633,9792..9898,11934..12075,14346..14432, 14524..14566,15368..15456) /gene="hCyP40" /codon_start=1 /product="cyclophilin 40" /db_xref="PID:d1010571" /db_xref="PID:g1769812" /translation="MSHPSPQAKPSNPSNPRVFFDVDIGGERVGRIVLELFADIVPKT AENFRALCTGEKGIGHTTGKPLHFKGCPFHRIIKKFMIQGGDFSNQNGTGGESIYGEK FEDENFHYKHDREGLLSMANAGRNTNGSQFFITTVPTPHLDGKHVVFGQVIKGIGVAR ILENVEVKGEKPAKLCVIAECGELKEGDDGGIFPKDGSGDSHPDFPEDADIDLKDVDK ILLITEDLKNIGNTFFKSQNWEMAIKKYAEVLRYVDSSKAVIETADRAKLQPIALSCV LNIGACKLKMSNWQGAIDSCLEALELDPSNTKALYRRAQGWQGLKEYDQALADLKKAQ GIAPEDKAIQAELLKVKQKIKAQKDKEKAVYAKMFA" intron 1998..3717 /gene="hCyP40" /number=1 protein_bind 2004..2023 /gene="hCyP40" /bound_moiety="HSF" protein_bind 2039..2047 /gene="hCyP40" /bound_moiety="NF-kappaB" protein_bind 2057..2065 /gene="hCyP40" /bound_moiety="NF-kappaB" protein_bind 2068..2073 /gene="hCyP40" /bound_moiety="Sp1" protein_bind 2388..2397 /gene="hCyP40" /bound_moiety="NF-muE3" exon 3718..3858 /gene="hCyP40" /number=2 intron 3859..5909 /gene="hCyP40" /number=2 repeat_unit 4188..4499 /gene="hCyP40" /note="positive orientation" /rpt_family="Alu" exon 5910..6016 /gene="hCyP40" /number=3 intron 6017..7995 /gene="hCyP40" /number=3 exon 7996..8184 /gene="hCyP40" /number=4 intron 8185..9510 /gene="hCyP40" /number=4 repeat_unit 8233..8518 /gene="hCyP40" /note="negative orientation" /rpt_family="Alu" repeat_unit 8971..9108 /gene="hCyP40" /note="positive orientation" /rpt_family="Alu" exon 9511..9533 /gene="hCyP40" /number=5 intron 9634..9791 /gene="hCyP40" /number=5 exon 9792..9898 /gene="hCyP40" /number=6 intron 9899..11933 /gene="hCyP40" /number=6 repeat_unit 10305..10602 /gene="hCyP40" /note="positive orientation" /rpt_family="Alu" repeat_unit 11162..11273 /gene="hCyP40" /note="negative orientation" /rpt_family="Alu" exon 11934..12075 /gene="hCyP40" /number=7 intron 12076..14345 /gene="hCyP40" /number=7 repeat_unit 12415..12709 /gene="hCyP40" /note="negative orientation" /rpt_family="Alu" repeat_unit 13242..13510 /gene="hCyP40" /note="negative orientation" /rpt_family="Alu" repeat_unit 13668..13970 /gene="hCyP40" /note="positive orientation" /rpt_family="Alu" exon 14346..14432 /gene="hCyP40" /number=8 intron 14433..14523 /gene="hCyP40" /number=8 exon 14524..14566 /gene="hCyP40" /number=9 intron 14567..15367 /gene="hCyP40" /number=9 exon 15368..16065 /number=10 polyA_signal 15612..15617 repeat_unit 15645..15946 /note="negative orientaion" /rpt_family="Alu" polyA_signal 16038..16043 BASE COUNT 5025 a 3083 c 3658 g 5367 t ORIGIN 1 attatgaaaa aatttaaatg tgcaaaataa agaagagtag tataataaac tcccatgcac 61 caatcatctg cctggccctt atgtttcatc cacaacccaa cccacttacc tgactccagg 121 ttattttaaa acaacttcag gcactatagc atctaatttt cagtaattta aaaaagagtg 181 tataagaatg ctttagagat tacaaggcaa atgcttacac attatctttt tccattctca 241 taccctgttc tttggcactt taccgaaaag gaaatgagaa gggttcagag aaattaaatg 301 acttttcttt ggtaacatgc ctagtataca tgtagtcaga acttgaagct tttgcttttt 361 ttttttcttt tttaacatca actttcgtgc ttagtccagt gtataacatc acaggcaaat 421 tcaaattgag aataggaatc agttgttctt taaccgtaac ccttctaact tttacttatc 481 ctaagaccgg gtagcaagta gggcaggagc cttcatctag agggatggtg acagctctga 541 taggggaaaa ttagcactgg tctttccttc caaaaagcca ctctggctgc tgctgttgac 601 acaaggctcc tggcttttga gacattattt gagtgctgga ttccaatact ttttactttc 661 tcataagaaa aaaaaatgtg aagcatctaa ggatatctcc tgccttattg ataaagtaag 721 cctcaggctg aatccaaatt gctgagctcc aatagaacta cagggcaaag catctacaca 781 gggaattgcc actcttctcc aaaattaggc tttattaaga ggcaaggcta tgttctcctc 841 atatatatgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt aaaaagtaaa 901 tgttttacac agatccccat aagggaaaag tgggctatag taatacagta tttttttgtt 961 tttctttttt ttgggggggg gggacagttt cgctctgtca cccagtctgg agtacagtgg 1021 cgcgatctcg gttcactgca accgccgcct cccgggttca agcgattctc ctgcctcagc 1081 ctcccgagta gctgggacta caggcaagca ccaccacgcc cgggtgattt tttttttttt 1141 tatttttagt agagacgggt ttcatcatgt tggtcaggct ggtctcgaac tcctgacctc 1201 aaatgatccg cccgcctcgg cctcccaatg tgctgggact acaggcgtga gccaccgcgc 1261 cccgccaaat acagttttat caggaatcaa aactgtgctc tcttctggac tctcagcttt 1321 agctgtttgg agcgtttagt ctaagcctta ccacttacat tgtaatgtga agatcaaatt 1381 gtaacgacat ggcaataata aggctcctat tggttgggta ccaccagttt atgccaggca 1441 ttgtactgat tcatgtcttt aatcctcaaa acgaccttcc gaggtcagcg tttttattct 1501 ttttccagag gagcaaactg agtttctgag aagtcacacc caacttggtg gcagagctga 1561 cgctggcatg gggttgactc caaagctctc tacgccacga gggtgggaaa gacaacgaaa 1621 cacaagcgag aagaataatg agaactggga tcgcaacttc attaaatgag aagccaggaa 1681 caggtggcgg agggcacgcg gttcgctgca cggagtcgct caggctcttt gaaacaccgc 1741 gcgggggagc ggggcggtgc cggaagtcag gccatggctt ccggttcttg cgtcggaagt 1801 ggccggtcag cgtcgctgcc ggtctccggc ggagacggac tctggagttt gggcggcccg 1861 ggcggccact aggtactctg atattccgta ctaaacacgt ctgcaagtca agatgtcgca 1921 cccgtccccc caagccaagc cctccaaccc cagtaaccct cgagtcttct ttgacgtgga 1981 catcggaggg gagcgaggtg agcggagtct gacgctcgcg gaaaagcccc gaggaggcgg 2041 gggttcccgg gatttggggg attccaaggg cggaatgcgg ctccggccgg tgactgctca 2101 gtttcgcgga ggggatggct gcctaggctg gtccgccgga gcaggcccgg agatctaggg 2161 ctagatcgat ccagcgccct tgtatcaatg ccatcggtgg tgcggaagtt tctggaaatt 2221 tctagttcgg gaattgtggc gtcctttagg tttgcctcct gatttggtgg cacttccacg 2281 ccacctttgc actaccaagt tagcatgaaa tattttatgt ttgcaatggg acagttcgga 2341 aagcatatcc tgggatatgc gtctgctggg ccccaggaaa ctatatttgc cacatgagat 2401 aatatatcag cacgtgctgt gttatttata cgttaaaggg acttgtttgc tcattgcttt 2461 tgccttatta ctgattctta atttagaggt ccgttcagat gagcaaacat tttctcttaa 2521 tttgccattg ggaaaatgcg ctcaacgtag tgatggtgtt gcctaggtta catagccgct 2581 cagaagttcc taaggcctgg tgaattaggg agcgtgttgc tttgtaactg gattggcctt 2641 gttctggaac acgttggaat tgctggccag gcaatgtgtt tgaatgcatg tacaggacac 2701 acacacgttt acacgtacat tcagtttgtt gcaactagct agggctaggt ttaatctcac 2761 ctgtcttagc tgctatctag aggaggaatt tttacagctt ttgcttcaga ggttgcatga 2821 ggcagtttgc tcagagcaca aaggctgcag gtatacgtaa cacgtggcaa gtccgcccac 2881 ccaaagactg tttaattttc tcccgttggt tgcttggtga ttggaaattc tcagtacctc 2941 ttgaagaaat tactcaggat ttaaaactat ttgtatttaa tgatatctct tacatggtgc 3001 taatcttttg gctactataa ttggttatga tccttagtgt aatgtttcaa atctgtttat 3061 atacatacat gacagtaggt agactaaatt tgaatattag aataaaataa gattcactaa 3121 ttctagtgag tatattcaaa ggaaatgaag tttgtatgtc aaagaggtat ctgcactccc 3181 atgttcattg cagcattatt catgatagcc aagataatgg aatcaaacta ggtttccatc 3241 tacagatgga ttttttttaa aatttgatat atgtgcacag tggagtacta ttcacccata 3301 aaaaataatg aaatcttgtc acttctgaca acatgaatta atatggagga cattatgtta 3361 catgaaataa gctgggcaca ggaagacaag gcatgatctc acttacatgt ggtttctaaa 3421 aagttgacct tgaagtagag aatggaatgg tggatttcag gggttgggta gatgttgatc 3481 acgagagacc taatttcagt tagaaagggg gaataaattt aagagatcta ttgtataaca 3541 taatgactat agttagtagc aatgtgttgt gttcttgaaa ttgcaaaaaa gaaaaattac 3601 agtaaagggg aacttggtaa ttcatgattg atctgctacc ataattatac caaccttctt 3661 tttatcattt gtctatttgg ttgttttgca tactgacaag attttctttt cttttagttg 3721 gtcgaattgt cttagaattg tttgcagata tcgtacccaa aactgcggaa aattttcgtg 3781 cactgtgtac aggagaaaaa ggcattggac acacgactgg gaaacctctc catttcaaag 3841 gatgcccttt tcatcgaagt atgtgtaaat tttcttaatc ttgttattct tcatacagac 3901 aagtcctgtt ctttaggatt tggggaagaa aacactaagt aggaagtgaa gcttaaaaag 3961 tactattagc gtccatgtct gaggcagtaa cgcagtttcg taaaggttag gctgtgcgag 4021 gtggaattac ttatggtact aaggtttatc tatagagaaa tggactcccc tagctttacc 4081 catcctctca ttctccttca ctctaagatg atttgattga gaaaggaagg ccgtagtcaa 4141 ccaaccttag gtagcttcat tgcctggtga caaaaaagtt atgcatagac cgggcgcagt 4201 ggctcacgcc tgtaatccca gcactttggg aggctgaggc gggtgaatca cgaggtcagg 4261 agatcgagac catcttggct aacgcggtga aaccccgtct ctactaaaaa tacaaaaaat 4321 aagccgggtg tggtggtggg cgcctgtagt cccagctact cgggaggctg aggcagtaga 4381 atggcatgaa ctcagaaggt ggagcttgca gtgagccgag atcacgccac tgcactccat 4441 cctgggcgac agagtgagac tctgcctcaa acaaacaaac aaacaaacaa acaaaaaaat 4501 gcataatttt tgggcctgag tgcttgggaa ggcagcagat ttgaagattc agtgttttgg 4561 gcacatgggt gatagtgtgg tcaaaatatg ccatttcata ctgcttatta cactttgtaa 4621 tgaatcatac ttttgtttag tgataatggt tctcccaata gactgcaggc tctatgagtt 4681 caagaactgt atgttttcac cattttgttt ttggcatcta atacagttat tgagtaagaa 4741 ctcagtaagt atttgtttaa taaataggca aataagcaat aatcacaaga gcaacatttt 4801 ttaaacgatt tattaagaaa aaattaactt tttaaaaata tttaaaatgt ttactttcat 4861 gtttaataaa aacttgaaat atgctaataa actatatgca gaaaatattt aacatagcaa 4921 gtgtgagact gctatcttta gaaaggtctg cctgcaagag tagcacttgt ttggagtctg 4981 ggaacttgac acttgtacca tttcctaaat agtaatggtg atttactatg cctagcctat 5041 ttgtacaaac atgaatgtgt gctctctttt ggggagtctg gaattttggt acatgctagg 5101 tagagtgtgc ccaataaata ccttgggtgc tgagtctcta attggctctc ctgggcagaa 5161 atactgtgta catgttgctg catttttgtt gctgggggaa gtgtgtgttc tgtagcactt 5221 catgggacgg agaaagcata aggaagcctg cacatggatt tttccagact ctgcctgtgt 5281 ctttcccgtc atctcgctgt gtatccttac cactgtgatc caaccacata ctgatttgtg 5341 tgagctctag ggttctctga acatgaaagc aatcttggta acctttgaca caaagtgata 5401 tatttgctga aaattttcca ttagggtaga acaaatctac tgggaatcat tggtctgtat 5461 tcagcagaca ttattgagta actaatatgt actatgtgta catgcacata gttataagga 5521 tgtaaaacaa tacagttggt gtacttaagg agatcattgc tggaaggggc agatcataat 5581 gccatatgtt atgtaggcag agtcaagata ctatgggaga aaaattagga catacagctt 5641 agggatttca gtgacagttt tataggggag gtatcatttg aaccgatttt ctgaatgtta 5701 gcagggatca ggcagtgaaa ataggttgga ttcagatggt gaaaagcctg aagcgtttgg 5761 agttgcggag acgggtgggg agggacaaac aggagagaga catgactaga gtcatgcact 5821 acaaagttgt cttacgtacg tatccattag taagcattgc taaaggcatc catgtctcag 5881 tcactaatag ccattttcct tttttgtagt tattaagaaa tttatgattc agggtggaga 5941 cttctcaaat cagaatggga caggtggaga aagtatttat ggtgaaaaat ttgaagatga 6001 aaatttccat tacaaggtaa agtagagaaa atgcatgtta ttgctgataa aaagattgtt 6061 atatatctgg aatacttgtt ggattaagac accaaaaaat gtttcaagtg tgcttagatt 6121 ctccatttct tgtaatagga atcacttttg taaataatgg ttctttaata tggaaaagtt 6181 ttcattcagt aatgattaat tcagcattat ttcattgtga tccagttttt atatgcttca 6241 gttaagccag tgagttttta aatgcgacca gcatctggca aaattgtttc caggaaaaat 6301 gtttccattg ttggaaggat ggtaacttgt gcatccttat ccttgaatgt gaagagcaac 6361 aaaaccatca ctaatatcca ggatgtcagg ggatttgcac actagcagat atagttaaat 6421 gtccaagctc ctaaaggact tcagtggtgc ctgtgccttc ttgtcttcgt tgtcccaaga 6481 tgtaattggg gtatattgca tcattcaagt tataagaatg gacatgtaag tacaaaagta 6541 aagtgatatt tttggaaata aggaatatga gcaagggtat taggataatt tgaaaaaatg 6601 tcgtcagtct tccagtcagt gtttattgag ccactatcac tggcaccaaa gggtttataa 6661 aaatgaaagg ccttgatgac aaggaattta taatcctacc tcagagacaa gattaagata 6721 catgggacta ctggagaata atagtatata aataaatgct aaattgattg aatatccttg 6781 cttgtataac attgctaata ctgtctatat actgcaggag aaagatgtga tttatgcagt 6841 ctgaattatt atggaagtca ttgaggagga atggctactt gggctgtgca gactgggtct 6901 tgcaaggcgg caacatgtct gagatttggt taggcaggaa ctgtagtatt gatgcagaat 6961 ttaaaaaggg acgtcggtga ggaaagacaa aggcaagaga cgttgaggaa ataaagttct 7021 agttactgtt gtaccaaagg aagctaccag aaggatgaaa actaaaggtg atgtgatgat 7081 cacattgaca tagagaaaag gggcctatag acaagacaca tgattcttgc tttctataaa 7141 cgtggtggcc ttaaaatctg aatttgattt agactgaggg gagtaaaagg cattttggtg 7201 gggtgggggg tgtcatgaaa cattatataa catgagagtt acggtgcatg caccaacaag 7261 gaatacattt agaaagccta ggagataaag tgaaaaggtt aagaaattgg gtttcataga 7321 tcctttgaca ggattatgcc aaggtagaat tttgatttca agatgtatgc agcagtttct 7381 tttgagcact ctccgaagag gtgattcatc tgggatgaaa acctaatcca aatcctgtat 7441 gatttggatt cttttgagat taggaaagta ccaaactgaa tgtcttgggt aaattcagta 7501 tcccattatt gttccagttt attgatgttt tgtaaaggaa gacttatact gtaggcatat 7561 ttatgaattt ggaagaccgt tcaggtcttc ccagaaaaaa agagcattca tcataaagag 7621 ggcaaaatat attacaagag tttcataatg ctgcatcttc attttaaagg tgaagtgtaa 7681 gtattttctc atggccaatt tagaaataac cctccccccg actattgttt ctgttaccta 7741 tttactttga taaggttttt taaagtttaa agtgattaag tgccccatgt tttgtgaatt 7801 aaaggttgca gtttataaaa aaataaaaag taattaaaaa attttataag tgaaaagtaa 7861 aaaaaagtag tttgatctaa atggtagtat agtgcacccc agttttatgt tcagtaaaag 7921 aacagttcag actaacacac aattcagaaa agaacacatc taaccttggt tttcttttaa 7981 tttttatttc tacagcatga tcgggagggt ttactgagca tggcaaatgc aggccgcaac 8041 acaaacggtt ctcagttttt tatcacaaca gttccaactc ctcatttgga tgggaaacat 8101 gtggtgtttg gccaagtaat taaaggaata ggagtggcaa ggatattgga aaatgtggaa 8161 gtgaaaggtg aaaaacctgc taaagtaagt aaaaagttac agtgaaatac acttattatc 8221 ttagttgcta tctttttgag acggagtctc gatctgttcc aggctggagt gcagtggtgc 8281 ggtctcggct cactgcagcc tctgtctccc aggttcaagc tattctcctg cctcagcctt 8341 ctgagtagca gggattacag gcgcgcgcta caacacccgg ctaatttttt ctgtattttt 8401 aatagagacg gggtttcacc atgttggcca ggctggtctc aaactcctga ccttgtgatc 8461 cgcccacctc ggcctcccaa agtgctggga ttacaggcat gagccaccgc acccagcctg 8521 ttagttgcta tcttaaagaa atataagtaa aaggatcagg tgattattag ctggaaaaat 8581 gttataataa ttttgggtat aggagccagg atctgccact tttttttttt ttttttaact 8641 tttctacaat gggaaagaca attagggtag ttttaaagga gatttttatt atttcaactt 8701 tcatttggtg atctttgcta gtttacgtta agtcttccta tacctgcaaa actgttcagt 8761 tggtatgtat agtagtacat agtggaatga ggataataag cccaaaatct tgtctggatt 8821 gtagcattaa ctagaggctt ggccttgact ttgttactta acttttccac acgtgttttc 8881 tcctgtctaa attacttatg ggtttagact agattatctc tgaagaccct tgtgtactta 8941 atgtctgtga atcaaaatga ccttaatatt ggctgggcat ggtagctcat gcttgtaatc 9001 ccagcacttt gggaggctga ggtgggagga tcacttgaga ccaggagttg gaaaccagct 9061 ttggcaacat agtgagatgc tgtctctatt aaaaaaaaaa gaaaaaagaa aaacccctaa 9121 acgaccataa tactacccag cctaagtgac agggttgtga gtgggagttg gaagctcagt 9181 tacttgtagg aggtaggctg tcactcagtg tatgacaaga atagtagtaa accggaagct 9241 catgccccac ctaatgccac ttacatagac aatgataaaa ttgccaactt attctgtgtt 9301 tattgagctc ctgaatagca ataaaatgct tatgtaaata ctaacattgt cgtattagtt 9361 ggaatgtata aaaacagcat tatagtacat aaatgagcct tctcacataa atctgtgaat 9421 tggacagtgc tgcacatgac ttcagctgta tagcttttat ctctcattat tacgatagtg 9481 cacttctaca atttagggtt atttttacag ttgtgcgtta ttgcagaatg tggagaattg 9541 aaggaaggag atgacggggg aatattccca aaagatggct ctggcgacag tcatccagat 9601 ttccctgagg atgcggatat agatttaaaa gatgtgagta ctttcagatt agtcaaactt 9661 taattaattt agtaagtgaa aatatttagt actaaaagtt tgagcttttg tttcagtctc 9721 tcatatgata gaatctagca ttaaaccaca ttacttacta aaatccaaca tatttgtatt 9781 tttttataca ggtagataaa attttattaa taacagaaga cttaaaaaac attggaaata 9841 cttttttcaa atcccagaac tgggagatgg ctattaaaaa atatgcagaa gttttaaggt 9901 aatattattt ctaacaaatt atttttaaga aattataaat tgaattataa actaaaatta 9961 agaattgctc attatatttt tgcaaattac ttgtggaata accaggtgtg ttgaaatagt 10021 tgcatttcct tttttttccc tcagcttgtt ttgagaaatt tcaaaagaca gaagttgaac 10081 gagtagtaca taattactca taagttttac ttacattcac caatttttaa cttttgctgt 10141 ttacgttttt acagaactat tttgatagga agttgttgat actgtgatac tttaccctaa 10201 gtatttcagc acgtatttcc taagaaacaa agacattctg cataagttca ataccgttaa 10261 aacaccccaa agtaatattg atagataata tcaatccata tttaggctgg gcatagtggc 10321 tcatgcctat aatcccagca ctttgggagg ccaagctggg cggatcacct gaggtcggga 10381 gttcgagacc agcctggcca atatggtgaa accccatctc tactaaaaat acaaaaatta 10441 gctgggctgg tggcgcacgc ctgtaatccc agctacttgg gaggctgagg cacaagaatt 10501 acttgaaccc aggaggtgga ggttgcagca agccgagatt atgccactgc actccagcct 10561 gggcgacaga gtgagattct gtctcaaaaa aaaaaaaaaa agtaatccat attccaactt 10621 ccccagttgt ttcaatttct tagaaaaatc aagggtccaa actaagatta tgtgttatat 10681 ttagtggtag tatctcttac gtttcctacc tcccctggtt ttttattcat ttgcattttt 10741 gtggagtcca agtcagttgt ttttatagac ttccatctgt atgtgtttgt ttccttaaga 10801 ttagtttgag gtgaaatatt ttggcaagaa gacttaggtg ttgttttgta ttacctatga 10861 tatcacatcc tgaagacaca tttcgctttg gcctgtttga gttcacacac atggttaaag 10921 tagtgtcagc cacaagcatt tgttagtctg aggtggtatc gtgaaattgt atgaatttct 10981 agtccccaac aacttttcac acagtggatc ttagcatcat tgaaaatcca tgcctgaatc 11041 tgttatactg aacttctaag tttatcatcg tacagttatt aactagcatt cttctgtttg 11101 cgagagcctc ccgcatattt tttaagtatc actatagacc tgtggatatt tttcttttct 11161 ttttttttcc ttttatagaa acaaggcctc ctttatattg cccaggctgg tctggaactt 11221 ttgggcccaa gcaatcctcc ctcctcacct tcccaagtag cagggactag aggatatctt 11281 atgggtagtt ttcaattccg tttgttagta cgctattatt gtcattcttt ttcatgccca 11341 aagggttcca aatttgttca gtggaaggcc cctttccttt ggcatgttcc tattagtctg 11401 cgagcacttg cttgctctaa gctcatcttg ccttactttc agatctgata tggcagtttc 11461 tccaaggacc tctgatttct tttagagaaa gaaaacttta gaaaccaaat tctgggggcc 11521 aggtgtgctc agttttgggg cccttttcag ggacaatgca aaatcaacac aatagagttc 11581 tccttcatct ttcctctact gaattccctt ctcctattat gagaatcctg gttccaaaca 11641 taattatgtt ggaaagtaca tagtggggac ctgagtttta aaaatagaaa agtgttacta 11701 gtgcataatt agatttggga attgctctta aattggcctc tttgaacact ttggggaaca 11761 aaaacaattt tttttaagcc ttcttaaaaa ttttaaaaag tgtgatcatt aagattatta 11821 atagcatttt tagagctttt tatatcagaa atgacataac acttctggac atagagaaga 11881 gtaattcctg tttatgtggt ctttcatact gcttttatta atgcattctg cagatacgtg 11941 gacagttcaa aggctgttat tgagacagca gatagagcca agctgcaacc tatagcttta 12001 agctgtgtac tgaatattgg tgcttgtaaa ctgaagatgt caaattggca gggagcaatt 12061 gacagttgtt tagaggtaag tctgtttgat tttgaacttt ttgaataagt tggattaaga 12121 cttagtttga atagtagcaa cttttatata cattaatata tcttattgca taggacctaa 12181 tgaggcttaa gaaaacattt tgtttctaaa tattttggac cctcagttct ttctctacat 12241 tcattttgct tataactctg tgtcacttaa cagctatgaa gatccttgcc tttttttctc 12301 aaagcacata tatcagctca tgtattcact taagagcacc tattgtgtgt taggcattag 12361 aagacaaagt cagaaatgat taagatgtgg gttatggagt cagtcaggca tggctttttt 12421 cttcttcttt tttttttgag acggagtctt actctgttgc ccaggctgga gtgcagtggc 12481 gcaatctcgg cttactgcaa cctctgcctc ccgggttcaa gtgattctcc tgcctcatcc 12541 tcccaagtag ctgggattac aggcatgtgc caccatgccc agataatttt tgtattttta 12601 gtagagacag ggtttcacca tgttggccag gctggtctca aactcctgat ctcaggtgat 12661 ccacccgcct tggcctccca aagtgctagg attacaggtg tgagccaccg tgaagtagtt 12721 atagagagtt atataacaca atgtaaggca tttagtgtag tgcctggcac attttaagca 12781 cctaacagag acgagttatt ttaattaaaa gatccatacc ctcaagaagc ttaataagtg 12841 attcaggata ggagggggtc agtttcaatt taggttcatc aggacttgtg tatggagaat 12901 gggttctatt ggaacaggtc atttatcttt aggaggtcat ccaggtatgc taactgataa 12961 agctagaaag gtggcagttt taccttcctc aaatagagca attatgactc tcttccctgt 13021 tgtatagttt gaaaagtttt ccaaatgaaa ctttaagaat tttacatatt aagccaggca 13081 cctagactag ctatgctcag agcaagtaga actaaaatga gatcctaatt ttgtagggta 13141 gatggaataa gggataggaa tatatttaga agggatcata caacagagga aaaaacttat 13201 caaaagaaaa gatgggtctt cactctgggt ttttttttgt tctctgtcac ccatgctgga 13261 gtgcagtgat gcaatcaaag ctcactgcag cctcaacctc ctgggctcgg gtgatcctcc 13321 tacctcagcc tcctaagtag ctgggactac agacccttgc caccataccc agctgatttt 13381 taaatttttt gtagagatgg gtttttgacg tgttgcccag gctggtctct taactcctgg 13441 gctcaagctg tccacccgcc ttggcctcac aaagtgctgg tattatagtt aagagcctct 13501 ctgcccagcc tcgatctctt tttgacctgt gaggcaggta aaatctgagt attggccatt 13561 aaaagatgtt acataaacaa aagataattt taaaatgttt gttctagggg agagacctga 13621 tgcataaagt ctcttaggtt aaaagacctg ttgtaggtaa gaacctaggt tgggtatggt 13681 ggttcatgcc tgtaattcca gcacttaggg aggccacggc ggcgggcaga tcgcttaatt 13741 ccaggagttt gaaaccagcc tgggcaacat ggtgaaaccc tgtctctaaa aaaaaaaaaa 13801 aattagtctg gtgtggtggc acatacctgt agtcccagct acttggaggc tgaggtggga 13861 ggagcacttg agcctggggg gttgaggcag cagtgagctg tgatcaggcc actgcactcc 13921 agcctgggcc acagagtgag accttgtctt aaaaaaacaa aaaacaaaaa cttaagagtt 13981 tataacttaa gagttattaa gttattaatg ttcctaaaaa acttataatt caagtaaaaa 14041 gtgtatacaa atgtttcttc aaaatgattt tatcttaaaa gtgttagttc aggcccattc 14101 gggagattga aaccacagtg gggagcatat tggaaactag agaaatgtct ttctcctcca 14161 tgtccattca gtgccctctg ctgacaaaac ttagtgccca ctggcagagg agagctattt 14221 gcatgtttca gctccatttt cacagagcag tgaaggatgg gtttggagtt gataggcaat 14281 gaattgaaat ctggcatata agcttaatac tttgatataa atactagctt tatacttttt 14341 tgtaggctct tgaactagac ccatcaaata ccaaagcatt gtaccgcaga gctcaaggat 14401 ggcaaggatt aaaagaatat gatcaagcat tggtaaattt tgttccaaat gtttaatttt 14461 ttaaaataga caactacctt tataaatcat acacctaact taaatgtttt tttccaatta 14521 aaggctgatc ttaagaaagc tcaggggata gcaccagaag ataaaggtaa gttggcagct 14581 tttgtagtga aagttaattt tgttatttaa atacttatcc tcaggaacca ttgttcactt 14641 tgccagattt tagatgtttg ttcaacagac actacagaat gcctgctgtt gggccaggca 14701 ttatcatata gaatgaacaa gacagtcaaa gtccctgccc tcaaagagct tacattctac 14761 tcccattcaa gaatatagta gtttttcacg ttatttattc ttcaagttat catctctgta 14821 gttttttata tgtctaaata ctgtttgtct tatggtatcc tttgagactg tgttcctatt 14881 ctggttagtc tagtcgagag ctcagcattg taagacacag tattcatggt aaatttaaca 14941 tttgggacca ttcaagttga atgtggctat aaccttagta ttgatgtact ggttatatgt 15001 tttaggaaac tatgacaaaa ataagcttgt ttttcaaata aaccaactat gaatagtgac 15061 actagatctg ggtattacaa aatgtctttg gtctcttatt actaaaatga aacaagggaa 15121 gcaattactg ttctctaatg tagaggcaga aggcaaatct tgatgtactt gagttctctc 15181 agttgacatc ctctaaaatt acgtttcatt tctgtgtttg tgtgattgct cagattttct 15241 gttactaggg cttagttact ttgtagagaa gttttttcaa ggggtagtta acattaaaat 15301 agtttgttag tcaccataat ttttggatat tttgagaaat aactcacatt gcatatttat 15361 tttacagcta tccaggcaga attgctgaaa gtcaaacaaa agataaaggc acagaaagat 15421 aaagagaagg cagtatatgc aaaaatgttt gcttagaaag gattcagttt tgcttattgt 15481 gtgttgattg tataaatgca ataagaaaat gtaaaggttt ttgtctatga atatgatccc 15541 taatgtgttt cttttgacac cttagttcct tactgtttac agtttaggag tactgatagg 15601 ggttcatgct taataaacat gtcacaatac agtaagtaaa gtggttttgt ttgtttcttt 15661 gagatggagt cttgctctgt cacccaggct ggagtgcggt ggcgcaatct cggctcactg 15721 catcctctgc ctcccgggtt caagcaattc tcctgcctca gcttcccaag tagctgggat 15781 tacaggcacg tgccaccacg cccagctaat ttttgtattt ttagtagaga tggggtttca 15841 ccatattggt cacgtcacgt tggtcttgaa ctcctgacct tgtgatccac cccgccttgg 15901 cctcccaaag tgctgggatt acaggtgtga gccaccgtgc ccggccaagt aaaatgtttt 15961 ttaaaatggt tatgtgcatt attcataaaa aataatggtg tccagtcttt ttaaacttgt 16021 aaagacacat cttattgaat aaagagatga gagcttaagt ttgtatattg tttccttcga 16081 ttctttgatg tatccatggt ggatgatgct aggttttcat acttatagat aatacacaca 16141 cacacacaca cacaatcttg ttcagggatg ggcatacttc ttctgcaagg gccagatcat 16201 aaagttttta ggctttgtag accatgtaat gtcacaccca ctcacctgca cagttgtaat 16261 tcaaaattag ccatagacaa tgtgtaacca atgggtattg ctgtgtttca ataaaacttt 16321 agtctagaat atatattcat ttttatattt gattacttaa ctttggcaag tcaaatgtaa 16381 agtccagtag tgaattttgc tcttggtact gaatctctga agaactgaag aggtaggaag 16441 atgctgcatt cttataaagg ttatttaaaa gcaatacccg taatttcttt atgggacaat 16501 cttacatata gtataaaatt tttatttgat aatcattttg tgactattct gttcagtaat 16561 atgcagaaag acattctgaa atctgttaat ctctaaacat tttaacgtta gcttgccata 16621 cttgaaagaa actggctagc tgcagtttac attccattgt aagcaggtcc tcctccacct 16681 tcaggtacca cccagttaat attctgactt ggatctttaa tatcacatgt tttacaatgt 16741 acacagttct gagcatttat ctgtaaccga aatccatcac cttgttccac aggtacaaat 16801 tcataaactc ctgaaaagta aaaataaata atattgaagt gcctaacttt aaaaaatttt 16861 taaattcaaa agtaaaaatt aaggaaagag tagccacaga aaccttaaag gttgctttct 16921 gaatgcttac ccatagagat tatgaagacc ttggtgcatt gctattgatg tgagtgtaaa 16981 ttacatgtac ataataaaag gaggtacagc ggtattcaac tgagttgtct cactatcaag 17041 ggtactcgac actgagtgtt tgagttccac acaacctcga agtcctccct tattcacagt 17101 catcctctca ttcacagggg acaagttcca aga // LOCUS D67013 8748 bp DNA PRI 09-OCT-1997 DEFINITION Homo sapiens DNA for alpha2-HS glycoprotein (AHSG), complete cds. ACCESSION D67013 NID g2521982 KEYWORDS alpha2-HS glycoprotein; AHSG. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Osawa,M., Umetsu,K., Sato,M., Ohki,T., Yukawa,N., Suzuki,T. and Takeichi,S. TITLE Structure of the gene encoding human alpha2-HS glycoprotein JOURNAL Gene 196 (1-2), 121-125 (1997) MEDLINE 97464058 REFERENCE 2 (bases 1 to 8748) AUTHORS Osawa,M. TITLE Direct Submission JOURNAL Submitted (21-SEP-1995) to the DDBJ/EMBL/GenBank databases. Motoki Osawa, Tokai University School of Medicine, Forensic Medicine; Boseidai, Isehara, Kanagawa 259-11, Japan (Tel:0463-93-1121(ex.2630), Fax:0463-92-0284) COMMENT Sequence updated (21-Aug-96) by : Motoki Osawa. FEATURES Location/Qualifiers source 1..8748 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 1..361 /note="AHSG promoter region" CAAT_signal 269..273 TATA_signal 296..303 exon 362..622 /number=1 5'UTR <362..409 CDS join(410..622,2952..3062,3710..3794,4454..4617,5805..5906, 7126..7209,7854..8198) /note="AHSG" /codon_start=1 /product="alpha2-HS glycoprotein" /db_xref="PID:d1023520" /db_xref="PID:g2521983" /translation="MKSLVLLLCLAQLWGWHSAPHGPGLIYRQPNCDDPETEEAALVA IDYINQNLPWGYKHTLNQIDEVKVWPQQPSGELFEIEIDTLETTCHVLDPTPVARCSV RQLKEHAVEGDCDFQLLKLDGKFSVVYAKCDSSPDSAEDVRKVCQDCPLLAPLNDTRV VHAAKAALAAFNAQNNGSNFQLEEISRAQLVPLPPSTYVEFTVCGTDCVAKEATEAAK CNLLAEKQYGFCKATLSEKLGGAEVAVTCTVFQTQPVTSQPQPEGANEAVPTPVVDPD APPSPPLGAPGLPPAGSPPDSHVLLAAPPGHQLHRAHYDLRHTFMGVVSLGSPSGEVS HPRKTRTVVQPSVGAAAGPVVPPCPGRIRHFKV" intron 623..2951 /number=1 exon 2952..3062 /number=2 intron 3063..3709 /number=2 exon 3710..3794 /number=3 intron 3795..4453 /number=3 exon 4454..4617 /number=4 intron 4618..5804 /number=4 exon 5805..5906 /number=5 intron 5907..7125 /number=5 exon 7126..7209 /number=6 intron 7210..7853 /number=6 exon 7854..8584 /number=7 3'UTR 8199..8584 polyA_signal 8564..8569 3'clip 8585..8748 BASE COUNT 2424 a 2070 c 2114 g 2140 t ORIGIN 1 gatcacagta gaagacattt cctctgctgc caaacccatg gcactctgag gctgactgtg 61 tccacctcat tccctcagct gtcttctctt tgctgctatt accatgttcc aagcagactt 121 tggagcatct cccccacagc agcatggact ttggcagatt tcttggggac cagcgatgtc 181 ctaacctgtt tgcttttcca gggctgatgt ttgcagggtg tttttttttt tcttttgaac 241 caaagcagaa atcatcctgt atccttatgc aattcttccg gcaggctcca acagataaat 301 aaagcccacc accctccatg ggtctacctt tcccagcaga gcacctgggt tggtcccgaa 361 gcctccaacc acctgcacgc ctgcctgcca gggcctctct ggggcagcca tgaagtccct 421 cgtcctgctc ctttgtcttg ctcagctctg gggctggcac tcagccccac atggcccagg 481 gctgatttat agacaaccga actgcgatga tccagaaact gaggaagcag ctctggtggc 541 tatagactac atcaatcaaa accttccttg gggatacaaa cacaccttga accagattga 601 tgaagtaaag gtgtggcctc aggtaagtgg acctgctgtc tatgagctga aataatgtgt 661 acatggagct caatcaggtg cctcaaaaaa tcaccatcca cccagtgcaa atgaaaccac 721 agaggagtaa attctctgat ttcttcccag gagtgaggga aggggcaggc agagggcagg 781 agaggagaca ttctgtatgg cagtcatggg tgtcaggagg gagctgggtg gggtgtgagg 841 tggtgtgcag gagaaaaagt gcttcaaatg gtagtgtgca gatcacagac agaaagtgta 901 acttgctgga aaaactagga cccaagagac cagctcctag ttgccaagtt accactggct 961 gaaaatcacg tatctgtctt tggtttggtt tctctctaac aaagactgag aatgaataaa 1021 actagcatct ggcagatgcc tactatatgc caggcccatt cacatagatt atctcattta 1081 ctctttcccg gtcctgcctc ctggtgctgt gtggtacata tattgttctt gtcttaccca 1141 agaggagacc aaggctctct tgtgtgtgtg tgtgcagttt tttggttttt ggttttgttt 1201 tttttttttt ggtccaaaat catataatta ctaagtcttc aggctgggat ttgattccat 1261 atctgtgttc caacttctac acaaactgcc tcccaaagag agttacccac atcccagaga 1321 gaagtcttgg cataaacaca attcacctcc tcacacacta gacaggaaac caacgcagct 1381 tgaagccagt gacaagaaaa atcaagctgg aaatatgcct cggggatcag tcaagagatt 1441 tggagaggtg gaaagaagct gtctgcctac tgcctgtttt gaaattagat ttatttctga 1501 ttaaggacaa ttctttcagc aaatatgtat tacaagcctt ccttggacaa gaaccagaga 1561 tattaggttg aaccatataa aactgccatt tttctatatc aaaagcaacc aaatattggc 1621 cgttttaatg gttcaaccta atacagtggt gaaaaaggca caatatgtgc ccacaagagc 1681 ttacaatcta ggttggaaaa taaggttcaa caacaggaag cctggaccga ctgacgactg 1741 ccatccgtct cacaaagaga caaaatattt gaaatcagga ttgctccgga tggattttaa 1801 gagtgctgca gccatattaa agcacagtgg tggttaggag gaaacgctga tcaagtcagg 1861 ggaaatgaac acgcaacacg cacatctgag ggaaaaggta atcatgaatg ggcattgtga 1921 cttttactaa aggcagagct tcagagttgg ttcccttgag aaacccaggt gtacccggtt 1981 cctgttcgcc agagctgtga acgctttcag gcagtcactc tgggcacacc tggacatcat 2041 aaaatgcgga acttctccca ggggagggga tgctgaggct tcaggtacta gtgaatcagg 2101 cagaaccaat gagaggcaaa cagagctggg ctgagaggag aaaaggcata cttgtacctt 2161 ctggtttttc aggttcgaag acaagataca gaaacaggtg aactcacaag aatatctcca 2221 aggattgttg caagctccct cgtgtctaca ctagtgacat ccagtttcct gtcagaggga 2281 gacatgccct tccccattat cgccagcagg gggaagtaga gagcagcatc gttgcatgcc 2341 ggcacctgct gcacaagcca agacaaagga aaaaccaagg acaacagcag caaaaacctc 2401 taggagggaa aagaaaacgg aggaaggaag gaaagcaaat aatgaaaagg aagaaagaaa 2461 gaaggagagg gagggataga ggagattaaa aggccacagt aagatattac cctacaccac 2521 ctattttgca gcttgtctga gaaaaatcca aacttgcatt ttcccaaagc actgcttgcc 2581 gagtgaaatc ttaaaaaata aaataaataa taaatacaaa taagtgttaa cacccatttg 2641 tagttttcaa atagagcgca gagtgagggc tgtggctcca tcgacttgtt caagcccagg 2701 accccgtctg ctttgcgagc atcatctggt gcttccttaa tcaacagacg aagaccagac 2761 aagccctggt cattgtcctg cccacaggcc agttcagagc tagacggagt tgcagactga 2821 cagtaagaat gacatttccc tcacctctcc aaaagcgggg tgctctcaag cccaatgagg 2881 gcgcataccg tggaccgcac cacaggatca ggggaatagg ttgctcgcgg cttcactctt 2941 tgtctccaca gcagccctcc ggagagctgt ttgagattga aatagacacc ctggaaacca 3001 cctgccatgt gctggacccc acccctgtgg caagatgcag cgtgaggcag ctgaaggagc 3061 atgtgagtac ccttcttagg atgactgtag gtggcccttc ggccagctcc accgattcac 3121 ccagcgtctc agcctgcctt cttggctagc cagggtgcag tttctaaaat tgccatttgt 3181 ggccgagcgc agtggctcat gcctataatc tcagcacttt gggaggctga ggcaagtgga 3241 tcgcctgagg tcaggagttc aagaccagcc tggccagtat ggtgaaaccc catctctact 3301 aaaaatacaa aaattagctg gacgtggtga cgggcacctg taaatcccag ctcctcggga 3361 ggctgaggca ggagaatcgc ttgaacccgg gaggtggagg ttgcagtgag ccaagatcct 3421 gccattgcac tccagcctgg gcaacaacag tgaatctcta tctcgaaata ataataataa 3481 tcatcatcat cataaataaa attgccattt gatgccactt gccctggggc tgagttttac 3541 aagcgtttaa ctatatcgtt gtatccctga aagctgagag tgccatgttt cagtattacc 3601 cagcaaaggc gattttgcaa gggtcacctt tgacagccgt gcctggaggg agcctgcccg 3661 gggtgcgaag gggaagggca gccatcctca cgtgggtttc tttctccagg ctgtcgaagg 3721 agactgtgat ttccagctgt tgaaactaga tggcaagttt tccgtggtat acgcaaaatg 3781 tgattccagt ccaggtacag atgactattc ttattctcat tttttccttg tagagaaagt 3841 ggggaaggga tctgaataat tttcaactta agtagttcta gcagctttgt cggtgaggaa 3901 aaggagaagc caaatttcct gggttctggg atttttaaaa ttgtgtttta agaagctact 3961 cttggcctgg tgcggtggct cacgcctgta atccacccac ccgaggcagg tggatcacct 4021 gaagtcagaa gttcgagacc agcctggcca acatagtgaa acccccatct ctactaaaaa 4081 tacaaaaatg tggtggtgct cgcctgtaat cccagctact agggaggctg aggcaggaga 4141 atcgcttgaa cctgggaggc agaggtggca gtgggccgag atcgcaccac tgcactccag 4201 cctgagtgac acagagtgag accctgtctc ccaaaaataa gaagttattc ttactggaag 4261 tgaaaattgc ctcgtgatga taagagctcc ttcagaaatg tcagcatagc caaagccttt 4321 tgaaggttta gtaagaagca gagaaagtgc ctgaagctat ctggggaatg ccttagccct 4381 tgctaacgca gcagagctgg ggccatgcca gggagaatgg ctgcccacat cctggtttcc 4441 tctctccgag cagactcagc cgaggacgtg cgcaaggtgt gccaagactg ccccctgctg 4501 gccccgctga acgacaccag ggtggtgcac gccgcgaaag ctgccctggc cgccttcaac 4561 gctcagaaca acggctccaa ttttcagctg gaggaaattt cccgggctca gcttgtggta 4621 aagactgaga ttcttttgac aggttgggca gttcggtggc acttcgggaa tgtactgtac 4681 gtggtggagc gggaggcagg gcaagaacag gcgcaggggc agcgatgaga aagcaaggag 4741 agggttgttt ggaaagggaa gaaagcatcc taagggggta tgaggctcct gagtgtcatg 4801 aggaccccaa caccctcagc gcctccccca tgctgagcca ctgtaacgtc cagcagccac 4861 agctgccggc aggtacatcc ccactccctc cgttccagct aaaaccaaag ctcagtgtca 4921 gctggtagag tttgcccacg tcggccagaa gcactcactg taaatttgct gggctccagt 4981 accacccatc tccgctgaac atctgccaca gactcgtaat taatactcac ttgtgctgac 5041 aagcttataa tggcaagatc ttaaaatgcc tttcgagtca ctggagaaaa catctcattg 5101 tactgtgggt ggtttagcac attggaattc aacagaattc aaatgtttaa gaaaatgtat 5161 tctggatatc agccatggcc atacttggaa atacgctagt atagacggca attctattaa 5221 tcagaatatg tgattctcag aacatcccca ccccagacta caccaaataa cagatatttt 5281 attgtgtcca tatgctccaa ctactttaaa aaagaaaagc tcaagtgata tcttccatac 5341 tttcatctaa atcttttcat ttgagcctgc tctatgaaac aggtggaaga ggtattaatc 5401 tcttcacttt cccaccctat cttggaataa cctgaacctt gggtatcaag tgcagcccaa 5461 gagtgagggc tggggggagg cagggttccc actcctatca gtctaaggct ggccttctga 5521 ttccggtttc ctatctggaa actcacctcc accctgaagg accggtgatg gaaactttcc 5581 cctcctacaa gggagacaca acccctacct ctaaagcaca agcacttgag aacacaaccc 5641 cataacaact tccctatgta aaccattgag ggacatgtct tctgggccga cgcatggtct 5701 gcatgaatgg tgctccccga aggaggctac ttcccgctct ccttctctgc ccttttcatt 5761 gtaagtcatc tttcctcaag agcattttca tgtactcttc tcagcccctc ccaccttcta 5821 cctatgtgga gtttacagtg tgtggcactg actgtgttgc taaagaggcc acagaggcag 5881 ccaagtgtaa cctgctggca gaaaaggtga gtgggccggg accttggggt gttaccactc 5941 ggacagagct gtttgtggaa cagaacatcc ttggttagtt tgtttcttgg ggctgcagac 6001 agagaataac agtgaaaatc ccctctccct gtggatcacg gaaagcctcc ttttagggtg 6061 tcacctcatc cctttaagag ctgtcatcaa atcatctcac ccactggaag cacatgaagt 6121 taggagaaag agagaggtta tttgttaatg aagccaagtc acgcccaccc actgggaatg 6181 tgaagtgcac atttcctaga catataactc tgatacaaaa gctttcaagt ccttgagcca 6241 ataatgtaca cttctaggat ttcagtctta agaagtcatc aagtggccag gcatgatggt 6301 tcatgcctgt aatccagcac tttgggaggc caagacgggt ggatcgggag gtcaggagat 6361 cgagaccatc ctggctaaca tggtgaaacc ccgtctctac taaaaataca aaaaaattag 6421 ccaggcttgg tggtgagcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 6481 gtgtgaaccc aggaggcaga tgttgcagta aactaagatc gtgccactgc actccagcct 6541 gggcaacaga acgagactct gtctcaagaa aaaaagaaaa agaaaaagaa ttcctccgtg 6601 acatttgaca gaatatatct ataaaaatga tttattatgg atataaagag accaaaaaag 6661 agagatctgt atgtccaaca ggaaggtgtc attgaataat ccatgcacat cagtaaatag 6721 aaaattgtgc agacactaaa aattgtgttt tcaaggaata atgaatgata tgagaaaatg 6781 ctattatggc aagtgaaaac acacaggata caacatcgta tagtcacaat gatctcaatt 6841 tttaaatcat atttaatagt attttaaaat aagttagaaa tgcatcaatg ttaacagtcc 6901 ttctttctag gccaccacca gaaagggatt atgggtaatc tctctcactc tccaagtatt 6961 tctgtatttc catgttatat atagaatcat atacctccca caagcagaaa ctataacttt 7021 aagaaaaatg gtttttccaa ctaatttaag gttggcgcgt caatgaaatt gggggggatc 7081 catttttgaa attagttaaa ataaatcctc tttctctgtg ggcagcaata tggcttttgt 7141 aaggcaacac tcagtgagaa gcttggtggg gcagaggttg cagtgacctg cacggtgttc 7201 caaacacagg taacagctcc gtgaatattc ttgcctacac cttcagaata caatgacccc 7261 ttcacattta tgcagtgcag tagtgatgac aggacatttg ctctcccgtg cttctgaatc 7321 tcacagtatg aaataacact ggggtatgcg gaatcatcaa caaatggaag gatattttag 7381 ctatgccttt ccctcccacg aactagtgac atacgggaag aaccatctta ctgtgtagtt 7441 gacaaagcca cctttttatt tgtgggaggt gggagtggtt ttctgagttg cagagaccag 7501 gtggccagat ctacctgtta gctcccagtg gctgcagctt cagatgacaa agagggtggc 7561 actgctgggc aagggtgagc cataggtggg gtgcttttac tcattggaca tatgtgtgta 7621 agtccaccat cacaaagaca atcctagtga ggccggggca acataggcca gtcacccctc 7681 cttgtaacct tgatgacaat cccttgtact taggtaggtc ctttcttgct agactctttg 7741 caaataaaaa tgtataatgt gaggaaattg ggtgccagtg ccacctgggc ctgtgggttg 7801 tcttgcctgg gaggaggaag caaactaact gaaggaaatg gtcctttttc cagcccgtga 7861 cctcacagcc ccaaccagaa ggtgccaatg aagcagtccc cacccccgtg gtggacccag 7921 atgcacctcc gtcccctcca cttggcgcac ctggactccc tccagctggc tcacccccag 7981 actcccatgt gttactggca gctcctccag gacaccagtt gcaccgggcg cactacgacc 8041 tgcgccacac cttcatgggt gtggtctcat tggggtcacc ctcaggagaa gtgtcgcacc 8101 cccggaaaac acgcacagtg gtgcagccta gtgttggtgc tgctgctggg ccagtggttc 8161 ctccatgtcc ggggaggatc agacacttca aggtctaggc tagacatggc agagatgagg 8221 aggtttggca cagaaaacat agccaccatt ttgtccaagc ctgggcatgg gtggggggcc 8281 ttgtctgctg gccacgcaag tgtcacatgc gatctacatt aatatcaagt cttgactccc 8341 tacttcccgt cattcctcac aggacagaag cagagtgggt ggtggttatg tttgacagaa 8401 ggcattaggt tgacaacttg tcatgatttt gacggtaagc caccatgatt gtgttctctg 8461 cctctggttg accttacaaa aaccattgga actgtgactt tgaaaggtgc tcttgctaag 8521 cttatatgtg cctgttaatg aaagtgcctg aaagaccttc cttaataaag aaggttctaa 8581 gctgaatgtg gtcatgctta ttgcgacttc atcccagctc ccctcacatg catagccttt 8641 taccccaaca aacacagtgt ccctaatcaa aaccaaagtg aaaagagaac caaaagagaa 8701 caaaaacctg ctgtattgcc agatacagga aaaagtgaga ctaggatc // LOCUS D83195 4592 bp DNA PRI 21-APR-1997 DEFINITION Human DNASE1 gene for deoxyribonuclease I, complete cds. ACCESSION D83195 NID g1197172 KEYWORDS DNASE1*1978G; deoxyribonuclease I; DNASE1. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4592) AUTHORS Yasuda,T. TITLE Direct Submission JOURNAL Submitted (19-JAN-1996) to the DDBJ/EMBL/GenBank databases. Toshihiro Yasuda, Gunma University School of Medicine, Department of Legal Medicine; 3-39-22 Showa-machi, Maebashi, Gunma 371, Japan (E-mail:tyasuda@news.sb.gunma-u.ac.jp, Tel:027-220-8031, Fax:027-220-8035) REFERENCE 2 (bases 1 to 4592) AUTHORS Yasuda,T., Kishi,K., Yanagawa,Y. and Yoshida,A. TITLE Structure of the human deoxyribonuclease I (DNase I) gene: identification of the nucleotide substitution that generates its classical genetic polymorphism JOURNAL Ann. Hum. Genet. 59 (Pt 1), 1-15 (1995) MEDLINE 95283231 REFERENCE 3 (bases 1 to 4592) AUTHORS Yasuda,T., Nadano,D., Takeshita,H., Tenjo,E. and Kishi,K. TITLE Molecular analysis of the third allele of human deoxyribonuclease I polymorphism JOURNAL Ann. Hum. Genet. 59 (Pt 2), 139-147 (1995) MEDLINE 95351715 REFERENCE 4 (bases 1 to 4592) AUTHORS Yasuda,T., Nadano,D., Takeshita,H., Tenjo,E., Sawazaki,K., Ootani,M. and Kishi,K. TITLE The molecular basis for genetic polymorphism of human deoxyribonuclease I: identification of the nucleotide substitution that generates the fourth allele JOURNAL FEBS Lett. 359 (2-3), 211-214 (1995) MEDLINE 95172237 REFERENCE 5 (bases 1 to 4592) AUTHORS Yasuda,T., Nadano,D., Tenjo,E., Takeshita,H., Sawazaki,K., Nakanaga,M. and Kishi,K. TITLE Genotyping of human deoxyribonuclease I polymorphism by the polymerase chain reaction JOURNAL Electrophoresis 16 (10), 1889-1893 (1995) MEDLINE 96154578 REFERENCE 6 (sites) AUTHORS Iida,R., Yasuda,T., Takeshita,H., Tsubota,E., Yuasa,I., Nakajima,T. and Kishi,K. TITLE Identification of the nucleotide substitution that generates the fourth polymorphic site in human deoxyribonuclease I (DNase I) JOURNAL Hum. Genet. 98 (4), 415-418 (1996) MEDLINE 96384954 FEATURES Location/Qualifiers source 1..4592 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" exon <951..1091 /number=1 intron 1092..1419 /number=1 GC_signal 1204..1209 GC_signal complement(1295..1300) exon 1420..1567 /number=2 sig_peptide 1421..1486 /gene="DNASE1" gene 1421..3944 /gene="DNASE1" CDS join(1421..1567,1896..1984,2149..2232,2628..2743, 2989..3101,3177..3331,3711..3807,3897..3944) /gene="DNASE1" /EC_number="3.1.21.1" /codon_start=1 /product="deoxyribonuclease I precursor" /db_xref="PID:d1012508" /db_xref="PID:g1197173" /translation="MRGMKLLGALLALAALLQGAVSLKIAAFNIQTFGETKMSNATLV SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYVVSEPLGRNSYKERY LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPG DAVAEIDALYDVYLDVQEKWGLEDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLI PDSADTTATPTHCAYDRIVVAGMLLQGAVVPDSALPFNFQAAYGLSDQLAQAISDHYP VEVMLK" mat_peptide join(1487..1567,1896..1984,2149..2232,2628..2743, 2989..3101,3177..3331,3711..3807,3897..3941) /gene="DNASE1" /product="deoxyribonuclease I" gene 1511 /gene="DNASE1*4" allele 1511 /gene="DNASE1*4" /citation=[4] /frequency="0.0012" /replace="g" intron 1568..1895 /gene="DNASE1" /number=2 exon 1896..1984 /gene="DNASE1" /number=3 intron 1985..2148 /gene="DNASE1" /number=3 exon 2149..2232 /gene="DNASE1" /number=4 intron 2233..2627 /gene="DNASE1" /number=4 exon 2628..2743 /gene="DNASE1" /number=5 intron 2744..2988 /gene="DNASE1" /number=5 exon 2989..3101 /gene="DNASE1" /number=6 gene 3012 /gene="DNASE1*3" allele 3012 /gene="DNASE1*3" /citation=[3] /frequency="0.0116" /replace="g" intron 3102..3176 /gene="DNASE1" /number=6 exon 3177..3331 /gene="DNASE1" /number=7 intron 3332..3710 /gene="DNASE1" /number=7 gene 3398 /gene="DNASE1*1978G" allele 3398 /gene="DNASE1*1978G" /frequency="0.68" /replace="c" exon 3711..3807 /gene="DNASE1" /number=8 allele 3737 /gene="DNASE1*2" /citation=[2] /frequency="0.4369" /replace="g" gene 3737 /gene="DNASE1*2" intron 3808..3896 /gene="DNASE1" /number=8 exon 3897..>4066 /number=9 polyA_signal 4060..4065 BASE COUNT 956 a 1347 c 1312 g 977 t ORIGIN 1 ttaaaaatta gccaggcatg gtcccgtacc catagtctta gctactcagg aggctgagga 61 gggaggatta tctgagccct ggcggttgag gctataatga gccatgattg tgccactgca 121 ctccagcctt ggcaacacag tgtgagaccc tgtctcaaaa acaataaaaa cccaaaacaa 181 aagaaccaag aaattactgg acctgaggcc tggcctttag ctgctgtttt gtttttgtga 241 cctggtcact cgggatcccc tgggcctaaa cacacagcct attgtctacc tcaagaaggc 301 tccccactgc ttggctggca attggggtgg gctttgcagg ccccacctgt cctgtcccca 361 cggcgctggt gctgcaggcc cccaccactg cttgttccga gctcccccag cctcctgcag 421 agttgcctgc acctgatggc gatgaatcag gaaggcaggc gtgtcctggg ccacagagca 481 gtcatggctg tcagccacca gggggctcca tttgcacctt tggatgtggc tttggcctct 541 ttgtccaaag tgaccttggg gcccccagac aagagacagg gagactggag cccagtccca 601 ccctcccgca catacctggc ccatccctgc cctatcctgg aagatggggg ccaccacacg 661 tgcaagggac acgggatagg aacctttggc cttgttatca gacattttaa aactaagtgc 721 aaacgtgatt atcaggtgca gtttttacag cagcaagaaa cctgtgctta cagaaagaaa 781 cacgtgctag caacccacct atgcggaaag ccacacagag ccattgtttt ctgtactctc 841 aggtgacggc tcacatttgc cccagggaag gtcacagctg cctgaacttt taaaactccc 901 agacacgcac tgcctgtgca ggatccggag cccagcagca ctgccagggc cttgaagtgc 961 ttcttcagag acctttcttc atagactact tttttttctt taagcagcaa aaggagaaaa 1021 ttgtcatcaa aggatattcc agattcttga cagcattctc gtcatctctg aggacatcac 1081 catcatctca ggtgagcacc aggtggagtg cctctgggtg actggccggt ttggagcagg 1141 gagggaggct tagagtctca tcctccagca gccagtgagg cggaggctcc agcgtccctc 1201 ccgggcgggt tttctggtgg atggaggagt gactcggggt cctctacgtg gtgccagctg 1261 tttggctttc tggacgttgt aggaaagggt ttcccccgcc tgcgtccccc tgaccttgag 1321 ctccaccagc ccctgccagc tgggctccag aaggctggag tgctgtggca gggatgacgt 1381 ctcacttctg ttatgtctct gtgccctgtg ctctcccagg atgaggggca tgaagctgct 1441 gggggcgctg ctggcactgg cggccctact gcagggggcc gtgtccctga agatcgcagc 1501 cttcaacatc cagacatttg gggagaccaa gatgtccaat gccaccctcg tcagctacat 1561 tgtgcaggtg aggccagggc agcctccccc caaaagcaga ggagctctgg agtctagggc 1621 tggtgggcag ggccagccct atggagccac agggtgtcgg gtgtggggta ctgagcacca 1681 ctgctcccag cacggtggaa caggctcttg gctgtggacc aaggtcctca tccctgctgt 1741 gctgtccctg gctggcagca ggagcccagg cagaaacatg aggctgcggt taaaccgagc 1801 aatgccacga gcatcagctg tggctccctt tgtggcgctg tagggtccct gggtggcacc 1861 agccctgctc agtaccactg tggccctgcc cccagatcct gagccgctat gacatcgccc 1921 tggtccagga ggtcagagac agccacctga ctgccgtggg gaagctgctg gacaacctca 1981 atcagtgggt gacagtggca gggtcatagg aaggtgacat ctcgtccacg gcacagcctc 2041 acttcacttg ggccccaagg gtggggacct gggcactgcc tcgctatcgg cagccagagg 2101 ggtcccctat ggcccccgcc actgggacct tttgtttctt caatccaggg atgcaccaga 2161 cacctatcac tacgtggtga gtgagccact gggacggaac agctataagg agcgctacct 2221 gttcgtgtac aggtgggtgg tctagaaagc caggaagccc ctccctcacc tgggagggcc 2281 ccaacagagc agggaagtag tttgtcctat tagtttgtcc tatggtaaga acctgaggct 2341 tcagagcagg gtccattttg gagtgaaaac accccaaggt cctggaccaa tgggttgagc 2401 aggtgcctgg ctcccccgcc ctcctgtcgc ctgggtcccg gaccaatggg ttgagcaggt 2461 gcctggctcc cccgccctcc tgtcgcctgg gtcccgcacc aatgggttga gcaggtgcct 2521 ggctcccccg tcctcctgtc gcctgggacg ggacgcctgc ctcctgggaa gcaggagtgg 2581 ggagcttcca gcctggggtc acctcctcct gcccggcctt cccgcaggcc tgaccaggtg 2641 tctgcggtgg acagctacta ctacgatgat ggctgcgagc cctgcgggaa cgacaccttc 2701 aaccgagagc cagccattgt caggttcttc tcccggttca caggtgggtg ctgcctgggc 2761 cagggtgggg ctcggcttgg cgcttatggc ctccaccccc tcctagggaa cctggaatgc 2821 ctgtgtcaca cactgccctc ccagtccctg gggcttgggt tttccattca agtcatttgg 2881 aaaatatcca ccccccgggg ggactgtcat gatacatagt tccagctgac atggtgactg 2941 aacctgcccc cagggagtgt gcctcacacg acgtggctgt ctccacagag gtcagggagt 3001 ttgccattgt tcccctgcat gcggccccgg gggacgcagt agccgagatc gacgctctct 3061 atgacgtcta cctggatgtc caagagaaat ggggcttgga ggtgaggccc tcccaggggc 3121 agtgggcacc agcggcctcc gcatgtccca gggccacagg cagcgtttcc tggtaggacg 3181 tcatgttgat gggcgacttc aatgcgggct gcagctatgt gagaccctcc cagtggtcat 3241 ccatccgcct gtggacaagc cccaccttcc agtggctgat ccccgacagc gctgacacca 3301 cagctacacc cacgcactgt gcctatgaca ggtgagcagg gcctcgcgct tagggcagac 3361 tgagggcacc tccaagggca gccgtgactc ataggtccgg cttcagaagc ctcaaagcct 3421 ttgaacactc acccaactga gcttcagttg atccactaca gggaacagaa taacaagagc 3481 cacgattttt taggtttttt cggataagca catctgggga taagaggaga ggcagacacc 3541 taggctgtca tgtggtttcc acattgaggg gcacagacca gggtgtgcag ttttgggcac 3601 ccacagacct gcactggcag gtcccagggc tcttagttta gttcctgcgg gtgctgagcc 3661 aggcccatgt gtgaaagggg aacctacttt ctcttcccaa cacccatcag gatcgtggtt 3721 gcagggatgc tgctccaagg cgccgttgtt cccgactcgg ctcttccctt taacttccag 3781 gctgcctatg gcctgagtga ccaactggta tgtgtcctcc cttgcacagc cacatgagga 3841 tgggacacag gagctcaggt aggctcagcc cagaccctgt gcccacttgc ctgcaggccc 3901 aagccatcag tgaccactat ccagtggagg tgatgctgaa gtgagcagcc ccctccccac 3961 accagttgaa ctgcaggaag agaggaccca tcctgccaca ggacccagaa aaaaagccca 4021 acacacactc gggttaagaa atacctttaa atttaggtaa ataaagctca aggaggtggg 4081 gctgtcatct gtgtggtcag tccttctggc cccctggctg tcagtgtcgc tccagggcct 4141 tgacaagcag ctcattcaag cggcccacca tggccctagg gtcgtcaaca agtccagcag 4201 caatcatggc gttctcgtat atctgaaagg caagaggaga aacccattat gaggggcatg 4261 gggcaccttt tcattttttt ttttttgaga cagagtctca ctgttgccgt ggctggagtg 4321 cagaggcacg atctcggctc actgcaacct caggtgatcc ccccgcctcc caaagtgctg 4381 ggattacagg cgtgagccac cacgcctggc catttttccg tttttaaaac agaatttggc 4441 tgggcacggt ggctcacgag gtcaggagat tgagaccatc ctggccaaga tggtgaaaac 4501 cccatctcta ctaaaataca aaattagcca ggcgcatgcc tgtagtccca gctactcggg 4561 aggctgaggc aggggaattg cttgaacccg gg // LOCUS D83657 4092 bp DNA PRI 22-AUG-1996 DEFINITION Human DNA for CAAF1 (calcium-binding protein in amniotic fluid 1), complete cds. ACCESSION D83657 NID g1502284 KEYWORDS CAAF1 (calcium-binding protein in amniotic fluid 1); S100 protein family. SOURCE Homo sapiens adlut DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamamura,T., Hitomi,J., Nagasaki,K., Suzuki,M., Takahashi,E., Saito,S., Tsukada,T. and Yamaguchi,K. TITLE Human CAAF1 gene--molecular cloning, gene structure, and chromosome mapping JOURNAL Biochem. Biophys. Res. Commun. 221 (2), 356-360 (1996) MEDLINE 96192053 REFERENCE 2 (bases 1 to 4092) AUTHORS Yamamura,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 4092) AUTHORS Yamamura,T. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) to the DDBJ/EMBL/GenBank databases. Tokujiro Yamamura, National Cancer Center Research Institute, Growth Factor Division; 5-1-1 Tsukiji, Chuo-ku, Tokyo 104, Japan (E-mail:tyamamur@gan2.ncc.go.jp, Fax:4302) FEATURES Location/Qualifiers source 1..4092 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /dev_stage="adlut" /map="1q21.2-q22" TATA_signal 1971..1976 exon 2001..2048 /number=1 intron 2049..2985 /number=1 exon 2986..3143 /number=2 CDS join(3006..3143,3631..3771) /codon_start=1 /product="CAAF1 (calcium-binding protein in amniotic fluid 1)" /db_xref="PID:d1012700" /db_xref="PID:g1502285" /translation="MTKLEEHLEGIVNIFHQYSVRKGHFDTLSKGELKQLLTKELANT IKNIKDKAVIDEIFQGLDANQDEQVDFQEFISLVAIALKAAHYHTHKE" intron 3144..3630 /number=2 exon 3631..3890 /number=3 polyA_signal 3868..3873 BASE COUNT 1091 a 942 c 958 g 1101 t ORIGIN 1 ctgagtatct gggactacag gtgcctgcca ccatgcctgg ctaatttttt gtatttttag 61 tagagatggg tttcaccatc ttagccagga tggtctcgat ctcctgacct cgtgatccgc 121 atttagcaac ctcttgcctg gatggctggg atcctgggct agtcccttct ccaggcagat 181 gtgaaaaaaa aaaaaaatct caggatccca aacttactaa gccagaggga aaagtcaagc 241 tgggaaatgg gtcatgcaag cctgccccca cccaggttgg tttctaaata agatagctac 301 aaagataaaa agccatacac caatctcaca acttgcccac aaggaaattc cttgtgagcc 361 tcaagatctt taccctaaaa cagttctatt gaatttcacc ctggcagtgt aattgacagc 421 ttatcttcac aggtgtggga caaagggcag aacataaagt catccctctg ctcacccaag 481 acaaatccat atctgattgc ttcctctccc ctatttttgt tatcttatgt aaaaatgcag 541 attcactgag ccagatgaag gcataagtga ctatttcctc tactccgctc ccacatgaaa 601 attgtgtatt cagtgaaaga ttgatcaaag actcaaaagg atgcaatcac ttgtctctta 661 tctactgaca catttaaaaa atatttcttc ctcttccccc aatatccact gtttctcctt 721 caaatattga agccctcaaa atcatctttg gagaaagacg tagacctacc tctgggcacg 781 catccttaac tttggccaaa taaacctcct aaaatgattg agattcacct tggtcattgt 841 cattgattta cacaagcctg atcatcctgc tctggtggtg tcctcctggg cctagtgaaa 901 caaatactgt caatggctta acaatgttgc actgagattg tgtcagtcaa ggtgacacat 961 ggatcggaga gacagacttt tgcctccaga tccagagagg tcggtgggca catgaaatga 1021 actagctttg ctaaggggga tatctgtgtt gtggggtaga tggcagcaca ggttctggca 1081 tggccttgat tgctcagcaa tcttattgac ctctaggcat aaaccaaggg cagagtgctt 1141 ttggatgtcc cttagctttg gtgcgacatg gggaatgtac agatgagact taatccattt 1201 tgggcaggtt tctgaatctt gaaggaccac ctctccatga atcctgggca tctgttgtcc 1261 cttgctgcct ctctgtgagt aataaagttg ctttactaaa actcgtgtga atgttctgtc 1321 ttactgaact catgaacatg gtaaagtaac ctaggcatgg tgaactttct tcacacacaa 1381 tgattaccct actccagctt tctaggtctt tgtaacagcc aagttagaaa gttgggaaag 1441 caaaggggag gcatagtagc attgctctct ccttttttga aaaccttgga atctattttg 1501 aaaactgatc ctctgctcca gtgatgaact ttgcatccca gaacgactta ggaggaggtc 1561 tccctaccta tctccttagc tgcataaaga tctttaagtt gatattccat ttgccaaaac 1621 tgaagagctg gtaacctcat ctgttaaaac aagacactca ggctgtaaag tttgttctcc 1681 tagtctttta ttagggagca gtggtgtaga gctaaacctg aaacctgggc cctggctcag 1741 ggctaaaaaa ggttcagggc acccctgtgc gggtgggtgg agtgggggtg attttgtttc 1801 aggctgagca ggagggcaaa attcagtctg ggtacaatct gggctgctcc tgtggcagcc 1861 taagccacag gtttgaagct tctattgagc aacagagtac tttacataaa ttgcctcaag 1921 atgaatcaga gatttcttca cccaagggct aagatgaagc ctgaactgat tataaaacct 1981 tccttggctc agtgcccttc accactgctg gctttttgct gtagctccac attcctgtgc 2041 attgaggggt aagtgcattt tccttggatg ggttcttagg tggtccacag ggtagtgact 2101 gcaggtgacc actttgaaat cacaaggtgt tggaaaccac aaccctccac ttgcctgagc 2161 acctctgggg tgatattgca ttttggaccc aagtagatca gctcttgttg tgctatgtag 2221 actcagtccc atgatttaag aatggctaag caggccgggt acggtgctca cgcctgtaat 2281 cccagcactt tgggaggccg aggcgggcgg atcatgaggt gaggagatcg agaccatcct 2341 ggctaacatg gtgaaacccc gtctctacta aaaaaataca aaaaattagc cgggcgtggt 2401 ggcgggtgcc tgtagtccca gctactcggg aggctggggc aggagaatgg cgtgaacccg 2461 ggaggcagaa cttgcagtga gccgagatcg ccctactgca ctccagcctg ggcggcagag 2521 tgagactccg tctcaaaaac aaacaaacaa acaaacaaaa aacccaaaac aacaacaaca 2581 aaaaaagaat ggctaagcag gggtgggtta tggaccattt gtgggggata tgtggccata 2641 gtgccatcac ctgaggaaag caaacagcct ggccaatcac agaacccttc ccaagggacc 2701 ttgagtaagc aataccctcc aagttatgca ccccggtgga tgaggtggta tggaacatgg 2761 gcaactgagg ctgatttttt tagacgttag acagtattat atgtttctcc tccccatcct 2821 atccacacat gagaagttca ccatggatca aacagagcag agaccacaaa atagatatgg 2881 aggatgttat gttttcttta ggcccttgat tcagaaagag aggggagagt ggaggttgag 2941 gggtggggga gaaactgcct ctagacttcc cttgtttcct tgaaggttaa cattaggctg 3001 ggaagatgac aaaacttgaa gagcatctgg agggaattgt caatatcttc caccaatact 3061 cagttcggaa ggggcatttt gacaccctct ctaagggtga gctgaagcag ctgcttacaa 3121 aggagcttgc aaacaccatc aaggtaggtg atgcccctct cactgcaaac ttgcccatga 3181 ccctgcactg aggggagtgg caaggcctgg cagaggatgg tgggacaagg actggggtgg 3241 ggttggcccc agtttgagct cccaaaggat gctctccccg ctgctgtggg agaggtcctt 3301 tgcttccctc tctcaataac tcacttaagg ctctactcag ttcctctctc tgagtctgtt 3361 tcctcatatg taaaatggag agaggcaggg cacgggaggg gttgaatgag tgagcagaag 3421 agtaccttcc aggtcaagcc ttgttaattc cagggtggtt catgaccatt gtgttctctg 3481 ttttgttaca ctggattctc caacacagtg ttggctaatg agttaatgtg cctgggcctt 3541 cacaacccga gctctgaggg ataagaggcc taagaaaggg tcttgtgtgg aggttctgaa 3601 ctcaaggtct ctgttgtcct tttttcctag aatatcaaag ataaagctgt cattgatgaa 3661 atattccaag gcctggatgc taatcaagat gaacaggtcg actttcaaga attcatatcc 3721 ctggtagcca ttgcgctgaa ggctgcccat taccacaccc acaaagagta ggtagctctc 3781 tgaaggcttt ttacccagca atgtcctcaa tgagggtctt ttctttccct caccaaaacc 3841 cagccttgcc cgtggggagt aagagttaat aaacacactc acgaaaagtt ctgaatagtg 3901 tctgtgaact tttttctttc catgcctgcg gttcccaaac ttgatcacac cttacccctg 3961 ctcttctggc agtctgattt agatccttag aactcctgcc ccatttgttc atttgatcac 4021 caattattta tttatcgaat tatttattga atgtccacca ttcattcatt cattcactca 4081 ctcaacaaat at // LOCUS D84344S3 1822 bp DNA PRI 18-AUG-1997 DEFINITION Homo sapiens DNA for SM22 alpha, complete cds. ACCESSION D84342 NID g2340831 KEYWORDS SM22 alpha. SEGMENT 3 of 3 SOURCE Homo sapiens peripheral blood lymphocytes DNA, clone_lib:EMBL3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1822) AUTHORS Yamamura,H. TITLE Direct Submission JOURNAL Submitted (06-APR-1996) to the DDBJ/EMBL/GenBank databases. Hisako Yamamura, Osaka Medical Center For Cancer and Cardiovascular Diseases, Gastrointestinal Oncology; 1-3-3 Nakamichi, Higashinari-ku, Osaka, Osaka-fu 537, Japan (E-mail:QYJ02406@niftyserve.or.jp, Tel:06-972-1181(ex.2365), Fax:06-972-7749) REFERENCE 2 (bases 1 to 1822) AUTHORS Yamamura,H., Masuda,H., Ikeda,W., Tokuyama,T., Takagi,M., Shibata,N., Tatsuta,M. and Takahashi,K. TITLE Structure and expression of the human SM22alpha gene, assignment of the gene to chromosome 11, and repression of the promoter activity by cytosine DNA methylation JOURNAL J. Biochem. 122 (1), 157-167 (1997) MEDLINE 97420698 FEATURES Location/Qualifiers source 1..1822 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocytes" /clone_lib="EMBL3" /tissue_type="peripheral blood" intron <1..64 /number=1 exon 65..256 /number=2 CDS join(77..256,368..545,842..944,1275..1419) /codon_start=1 /product="SM22 alpha" /db_xref="PID:d1022696" /db_xref="PID:g2340833" /translation="MANKGPSYGMSREVQSKIEKKYDEELEERLVEWIIVQCGPDVGR PDRGRLGFQVWLKNGVILSKLVNSLYPDGSKPVKVPENPPSMVFKQMEQVAQFLKAAE DYGVIKTDMFQTVDLFEGKDMAAVQRTLMALGSLAVTKNDGHYRGDPNWFMKKAQEHK REFTESQLQEGKHVIGLQMGSNRGASQAGMTGYGRPRQIIS" intron 257..367 /number=2 exon 368..545 /number=3 intron 546..841 /number=3 exon 842..944 /number=4 intron 945..1274 /number=4 exon 1275..1822 /number=5 BASE COUNT 390 a 497 c 558 g 377 t ORIGIN 1 ccgggtgaaa gcagagtgct ccctgaccct ctgcccctcc ctcctccacc ctggcctgct 61 ttagctttcc ccagacatgg ccaacaaggg tccttcctat ggcatgagcc gcgaagtgca 121 gtccaaaatc gagaagaagt atgacgagga gctggaggag cggctggtgg agtggatcat 181 agtgcagtgt ggccctgatg tgggccgccc agaccgtggg cgcttgggct tccaggtctg 241 gctgaagaat ggcgtggtga gtggcaccct gggctagggc gctggggggc tggggtgtga 301 ccccctgtga gtcctgggcc aatccctgag gactgctaag ctgcgtccta tgccctatgc 361 ctggtagatt ctgagcaagc tggtgaacag cctgtaccct gatggctcca agccggtgaa 421 ggtgcccgag aacccaccct ccatggtctt caagcagatg gagcaggtgg ctcagttcct 481 gaaggcggct gaggactatg gggtcatcaa gactgacatg ttccagactg ttgacctctt 541 tgaaggtaga gaggagaatg ctgggggagg aggtgggcag gaggacaggg tgctgggaca 601 gggagagggt atgaccaaat atgccacaac taggggtgtg ctcgcccgca cacagcaggg 661 atgggatatg ccgagaataa cacgccacgc tcacagggcc cactgagagg cctcccttga 721 attggggaca actcttggcc ctggtttggc catttttttg tgagagacgg gggcaggccc 781 tggcttggag tcttgtttat acgttcttga tgttcatctc ctctctcctg tcttctcaca 841 ggcaaagaca tggcagcagt gcagaggacc ctgatggctt tgggcagctt ggcagtgacc 901 aagaatgatg ggcactaccg tggagatccc aactggttta tgaagtatgt ggcccccagg 961 gagcttgagt ctccgcatgg ggtgggaggt ggcttgttct aaggagcttg cgggaaggat 1021 taggggaagc agatagccaa gaaaggataa agtgagggtc tgggatgggg aataatgggt 1081 ccttaatact ccttgacccc tccctttcca ccctcctgcg ctcagtctcc ctagcctatg 1141 aggcaagcta gattagggaa aaaaagtgca acaggaaggc aatgggattg ggctaggacg 1201 taacagaggg atcagaaaac gggtggaaaa cacacagttc taccaagtct ttatcctgct 1261 tcctcctctt ctaggaaagc gcaggagcat aagagggaat tcacagagag ccagctgcag 1321 gagggaaagc atgtcattgg ccttcagatg ggcagcaaca gaggggcctc ccaggccggc 1381 atgacaggct acggacgacc tcggcagatc atcagttaga gcggagaggg ctagccctga 1441 gcccggccct cccccagctc cttggctgca gccatcccgc ttagcctgcc tcacccacac 1501 ccgtgtggta ccttcagccc tggccaagct ttgaggctct gtcactgagc aatggtaact 1561 gcacctgggc agctcctccc tgtgccccca gcctcagccc aacttcttac ccgaaagcat 1621 cactgccttg gcccctccct cccggctgcc cccatcacct ctactgtctc ctccctgggc 1681 taagcagggg agaagcgggc tgggggtagc ctggatgtgg gccaagtcca ctgtcctcct 1741 tggcggcaaa agcccattga agaagaacca gcccagcctg ccccctatct tgtcctggaa 1801 tatttttggg gttggaactc tc // LOCUS D85429 7246 bp DNA PRI 18-FEB-1997 DEFINITION Human DNA for heat shock protein 40, complete cds. ACCESSION D85429 NID g1816451 KEYWORDS HSPF1; Hsp40; heat shock protein 40. SOURCE Homo sapiens (sub_species:Japanese) placenta DNA, clone:Hsp40-No4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hata,M., Okumura,K., Seto,M. and Ohtsuka,K. TITLE Genomic cloning of a human heat shock protein 40 (Hsp40) gene (HSPF1) and its chromosomal localization to 19p13.2 JOURNAL Genomics 38 (3), 446-449 (1996) MEDLINE 97131529 REFERENCE 2 (bases 1 to 7246) AUTHORS Hata,M., Okumura,K., Seto,M. and Ohtsuka,K. TITLE Genomic cloning of a human heat shock protein 40 (Hsp40) gene and its chromosomal localization to 19p13.2 JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 7246) AUTHORS Ohtsuka,K. TITLE Direct Submission JOURNAL Submitted (18-MAY-1996) to the DDBJ/EMBL/GenBank databases. Kenzo Ohtsuka, Aichi Cancer Center Research Institute, Laboratory of Experimental Radiology; 1-1 Kanokoden, Chikusa-ku, Nagoya, Aichi 464, Japan (E-mail:kohtsuka@aichi-cc.pref.aichi.jp, Tel:052-762-6111(ex.8843), Fax:052-763-5233) COMMENT Sequence updated (23-May-1996) by: Ohtsuka Kenzo. FEATURES Location/Qualifiers source 1..7246 /organism="Homo sapiens" /sub_species="Japanese" /db_xref="taxon:9606" /chromosome="19" /clone="Hsp40-No4" /map="19p13.2" /tissue_type="placenta" CAAT_signal complement(1859..1863) misc_feature 1878..1887 /note="Sp1 binding site" misc_feature 1945..1954 /note="Sp1 binding site" CAAT_signal complement(2024..2028) misc_feature 2054..2068 /note="heat shock element 1" misc_feature 2079..2093 /note="heat shock element 2" TATA_signal 2104..2109 exon 2140..2390 /number=1 gene 2180..4589 /gene="HSPF1" CDS join(2180..2390,3483..4063,4359..4589) /gene="HSPF1" /note="Hsp40" /codon_start=1 /product="heat shock protein 40" /db_xref="PID:d1013503" /db_xref="PID:g1816452" /translation="MGKDYYQTLGLARGASDEEIKRAYRRQALRYHPDKNKEPGAEEK FKEIAEAYDVLSDPRKREIFDRYGEEGLKGSGPSGGSGGGANGTSFSYTFHGDPHAMF AEFFGGRNPFDTFFGQRNGEEGMDIDDPFSGFPMGMGGFTNVNFGRSRSAQEPARKKQ DPPVTHDLRVSLEEIYSGCTKKMKISHKRLNPDGKSIRNEDKILTIEVKKGWKEGTKI TFPKEGDQTSNNIPADIVFVLKDKPHNIFKRDGSDVIYPARISLREALCGCTVNVPTL DGRTIPVVFKDVIRPGMRRKVPGEGLPLPKTPEKRGDLIIEFEVIFPERIPQTSRTVL EQVLPI" intron 2391..3482 /gene="HSPF1" /number=1 CAAT_signal complement(2543..2547) /gene="HSPF1" misc_feature 2592..2606 /gene="HSPF1" /note="heat shock element 3" exon 3483..4063 /gene="HSPF1" /number=2 intron 4064..4358 /gene="HSPF1" /number=2 exon 4359..5760 /number=3 BASE COUNT 1591 a 1884 c 2111 g 1660 t ORIGIN 1 agatctgcct gctggatgct cctgatagtt atttatgtaa ctatccgcca tgatgctcct 61 gatagttatt tatgtaactt ttttatgggg aggctgaggg tctcactgtc acccaggctg 121 gagtgcagtg gcacaatcat agctgcctgc agcctcaaat tcctgggctc aagagatcct 181 cccgcctcag gaggcaggag gatggtcttc ttcttcagga gtccctgtct gcccaggggc 241 agctgtcact ggaatctttt agctgccagg aatgaggtga tggctgaaga gcccaccact 301 aagaaaatgg ttagtcaccc ctttgacaat gcttcctttg tgccatgcct gctctgagca 361 ctttgcaaat gttgactcat ataaacttca cagccctcat gatgtgggca caactgctat 421 cctcattcct cattgtgcag atgaggaagc taaggccagc aaaggcctag taactaaccc 481 aaggttaccc agccgggaaa tgctggagct gggaatgaag cccagctgtc tggttccaga 541 gccagtgtta catggctctg acctcccagg cagttcttcc aggaacctcc cccacttccc 601 tgggcccttc ccaggtggaa cccctttctg agcctccttt tcccatcagc tctgtccttg 661 acagaggcac cccagacttt cagagccaaa cgcgatcgtg ttgcctacaa ttgaggaaac 721 agaggccttg aaaggtgtgt gacacaccta tgtcccccca aattataaag gtagttgggg 781 aagaaaaaga agcacgtgtg taagaacggg gaatatttat gaggttctca ctgcgttcca 841 ggcactgggg acccagtgca gacaggacct ggcccctgcc caggtggcga tgtcagactg 901 ggagggaaga cacattgaaa gccaagagac aagataaatt tcaaatttag cggcaagtga 961 tcttgagagg gcagtaaagg gaagaaggcg ggactgggag gaccttggga gatgcgcttc 1021 cgtccagatc ctcaccttgc tttaggaggt gagaacagag gcagtcttcc gtcccaaaat 1081 ccttctcccc acggttggga gaatagcgct ggtgggtttt ttgttcattt tactttattt 1141 tacttatttt atttatttat gagacggagt ctcgctctgt tgccgaggct ggagtgcaac 1201 ggccggatct cggctcactg caacctccgc ctccccgatt caagctattc tcaggcctca 1261 atctcctgag tagctgggat tacaggttcg tgccaccaac gcccgggtaa tttttatatt 1321 tttagcagag atggggtttc accatgttgg ccaggctggt ctcgaactcc tgacctcaag 1381 tgatccgccc gcctcggcct cccaaaatgc tgggattaca ggcgtgagcc accgcgcccg 1441 gcctgtttgt taatcttaaa tagaaacgga gtctccttat gttgcctagg ctggtctcca 1501 actcctagga tcaagagatc ctcctgtctt ggcctcagga agtgatggga ttacaggcat 1561 gaggctccgc gcccggcctg ataggttctc ttattgccat gttgtagata gaagggggcc 1621 cctgggttag gcagacacag gttaggtagt tcgtccggcg tcaccgggcg ggagcccacc 1681 gcaggcccca cgagacaagg gcgagcaggt tcatcgcctc ccggggaaag gactgggtag 1741 tcccaggccc caccccttcc tggaacagca agttcgctgt gggtcccgac ctactgaccc 1801 aggcggaggg cgggacgcag ggtagctggg gccgcgtaga gagggaaaga aggtcgcgat 1861 tggctcgtcc cgaaaggtgg gcggggcctc ctccgacctg tgcgcgcgcg ccgcggggag 1921 gtgttgccga gggcggagcg ggagggggcg tggccccgcg cgggcggccg ttgacggcga 1981 ggccccgccc cggatgtcgc gtgtcgctga aagggcggcg gcgattggcc ggcgccgcgg 2041 gggcgggcgg ggcggaaggt tctggagggg gctggcgggc tctggaagct tccgccggac 2101 gggtatatag agtccgggac tggtcggcgg cggagccggg ggacggcgac agcgggtcgg 2161 cgggccgcag gagggggtca tgggtaaaga ctactaccag acgttgggcc tggcccgcgg 2221 cgcgtcggac gaggagatca agcgggccta ccgccgccag gcgctgcgct accacccgga 2281 caagaacaag gagcccggcg ccgaggagaa gttcaaggag atcgctgagg cctacgacgt 2341 gctcagcgac ccgcgcaagc gcgagatctt cgaccgctac ggggaggaag gtgtgtgctc 2401 gcggccaggg gcgcggcccc gccttttgac agacagggaa actgaggcac gctggcccct 2461 ctcggcggcc ccccgggaag gacgccccgg gccgtggggc cgagctgccc cgccctccgg 2521 tctcgcgggc ctccgccctc ggattggcgg ccgcgcggtg ggaggaggag ccttggccca 2581 gccgctcggc cagaagcttc tagcatgtct ggggctgccc ctccccgggc gcctcctccc 2641 gcggccccga gcagccccgg ggtgcggtgc atgggggtgg gggagccggg ggtgacgacg 2701 gggacggcgc ggcggagccc gctgcggacc cgggctcacc tgggctcggc cgccggggtc 2761 cgcggggcgg cgcctccggc tcagctgcgg ggcgaggggt tgtgaatgca ggagacgacc 2821 ccgttcgtgg gcttgggggc tgggttggga taattcccgg gaagtgatga cctggccccg 2881 gcagggcacg caggaggcag cggccgccca ggtccggaga gcggggccgc ctcggagggt 2941 aaggagagca gggctttttg ttcctgtgcc cgttgatgat tcaggctggc ttctgggctt 3001 gatctggagc tgactttttg tgcaaggatg gtggcagtga atgctaatgt gtgaggattt 3061 aaaaactccc tatttatttt tgtttttttt aatcttctct tcctacgcac cattagccct 3121 aaaacgttga gagtaaacac ttaccgaaag ccgaccctgg accatgtgtc tttagccatc 3181 tcttggtgat gttgccagga ggatgggtca gtccaggagt ggcagggcct aggtggcacc 3241 tggaggtgaa cacctaaccg catgagtcct tttccactgt caggctacgc ggaaatactg 3301 ttggtagggg ggcgtctctg ctgttggcag tgagaggtag aaacttgggg ttcttccttg 3361 ctctcccctg acagacaaaa aagttcctcc cccatggctt ccaagctgtt tccccctgta 3421 agggaagctc gagagagagt ctgctgttca gccatctgac tgcttctaaa tctgcttttc 3481 aggcctaaag gggagtggcc ccagtggcgg tagcggcggt ggtgccaatg gtacctcttt 3541 cagctacaca ttccatggag accctcatgc catgtttgct gagttcttcg gtggcagaaa 3601 tccctttgac accttttttg ggcagcggaa cggggaggaa ggcatggaca ttgatgaccc 3661 attctctggc ttccctatgg gcatgggtgg cttcaccaac gtgaactttg gccgctcccg 3721 ctctgcccaa gagcccgccc gaaagaagca agatccccca gtcacccacg accttcgagt 3781 ctcccttgaa gagatctaca gcggctgtac caagaagatg aaaatctccc acaagcggct 3841 aaaccccgac ggaaagagca ttcgaaacga agacaaaata ttgaccatcg aagtgaagaa 3901 ggggtggaaa gaaggaacca aaatcacttt ccccaaggaa ggagaccaga cctccaacaa 3961 cattccagct gatatcgtct ttgttttaaa ggacaagccc cacaatatct ttaagagaga 4021 tggctctgat gtcatttatc ctgccaggat cagcctccgg gaggtaaggt gccaggtggg 4081 gcggtgtctg tagagggcga tggctgcttt ttgaagcagt ttagtatttg ctgaaccaat 4141 gtgttggggt gtgtggtgcg tgccaggctt ggggcacaac agtgaacagg actgaccagg 4201 gcgctgttgt ggagctttcg tggtggggac gtgtagattt gggggccagg tcttagctgc 4261 agaggcctga tgggtcttat ctatggcaga cggcctctgg gcaagagctg agcctcttga 4321 gccagcttcc atctaatgct gtccttttgc ttttcaaggc tctgtgtggc tgcacagtga 4381 acgtccccac tctggacggc aggacgatac ccgtcgtatt caaagatgtt atcaggcctg 4441 gcatgcggcg aaaagttcct ggagaaggcc tccccctccc caaaacaccc gagaaacgtg 4501 gggacctcat tattgagttt gaagtgatct tccccgaaag gattccccag acatcaagaa 4561 ccgtacttga gcaggttctt ccaatatagc tatctgagct ccccaaggac tgaccaggga 4621 cctttccaga gctcaaggat ttctggacct ttctaccagt tgtggaccat gagagggtgg 4681 gagggcccag ggagggcttt cgtactgctg aatgttttcc agagcatata ttacaatctt 4741 tcaaagtcgc acactagact tcagtggttt ttcgagctat agggcatcag gtggtgggaa 4801 cagcaggaaa aggcattcca gtctgcccca ctgggtctgg cagccctccc gggatgggcc 4861 cacatccacc tccagtccct ggccaggggt gagaggcaga ccagcagatg gacttgatcc 4921 ctctgtgtct ttttgcttct ggctggtaga taatgtcaac ctgcagtctt gattcccaga 4981 ccctgtacac tcctcctttt ctgttgtgtg atcagtttgt gctttattct gtatttgtct 5041 cccatgtctt gctcttctcc tggagaattc tgtcttctct ttggccatct caaattgaga 5101 acctaaacta ttcctgcaga actgcctggt tggcgtccac aagcaatacc tctcgttcca 5161 gcaggaccaa gggagccagc ctccagtgag tgactccagc aagtgcagcc acctctccct 5221 tgatggtctg ggagcctggc ctcagcaagg ggccttcctg acctctggct ccagtgaagc 5281 tgaatgtcct cactttgtgg gtcacactct ttacatttct gtaaggcaat cttggcacac 5341 gtggggctta ccagtggccc aggtaatttt ttgtttcatg gactatggac tctttcaaag 5401 ggatctgatc cttttgaatt ttgcacagcc ctagatacaa tcccttttga taaaagggtc 5461 tttgcttctg attacaggag cactgtggaa cgtctgtaaa tatgttttta taattccatg 5521 tatagttggt gtacactcaa aacctgtccc cggcagccag tgctctctgt atagggccat 5581 aatggaattc tgaagaaatc ttggggaggg aaggggagtt ggaacaaatg tctgttccct 5641 ggaggccagt ccagtgctca gacctttaga ctcattgtaa gttgccactg ccaacatgag 5701 accaaagtgt gtgactagtc aatgaagtgc gacagcatta aagactgatg ctaaacctca 5761 ggggagcggt cctgtgactc tgtttgaggg ttctgctggt tttgggggtg gagtggagag 5821 ctgggcatcc ttccaaattc aatcaaggtg agagggtggg atgggcagga ggcaagtgcc 5881 ctgcaaggga aacgtaagtc tcccttcctc agccataagt gggttggaga acttctaccc 5941 agactcgggg tccaaaaaac cagagaccca agcacagtgg ccttggggat cattttttga 6001 tgaaggttgg agtcaaagta gcggtgggga gaggcagggt ctggctccta attctgcctt 6061 tcttcagggt ccctggcttt ttcagcagac cttggctctg gggccaaggg gccaacccag 6121 ggtctggcct cttaattcct tatcactcct tccctgctag aacgggctgg gcgtcctgaa 6181 ggctggccaa acccgagctc aaggtcctgc ccaggactag ccccactgta gagaggaccc 6241 atttcctgcc ttggctttat ttgcaggcta ctaaagctgc ttttactttg taactttctt 6301 taaaataact ggttttatta taaagtaatt gctcaggctt aaaacaatgt attgctgctt 6361 tgttacaaaa tgttctaaag tggaaacact gtatatagac caggagcagt ggctcaggca 6421 tgtaatccca gtaccttggg aggccaaggt gggtagatcc cttgagctca ggagtttgag 6481 accagcctag acaacattgt gaaaccccac ctctacgcaa tctaaaaatt cagccggaga 6541 atgcagcgta cacctgtagg gccagctatt atgggactga ggctggagga tcaattgagc 6601 ccaagaggtg gaggcagtag caactgtgtt tgtgccactg cactccaacc tgggtgacaa 6661 agtgagaccc tgtatcaaga aaaaaaaact aaaacacagt atccaaaagt caaactgggc 6721 cagacgcagt ggctcacacc tgtaatccca gcactttggg aggccaaggc gggcagatca 6781 cctgaggtca ggagtttcag accagcctgg ccaacatggt gaaaccccgt ctctattaaa 6841 acaaaaagcc aggcgtggtg cacgtacctg tagtcccatt tactcaggag gctgaggcag 6901 gagaatcact tgaacatggg aggcagaggt tgcagtgagc aaagatccca ccactgcact 6961 ccagcctggg tgacagagcg agtctgtgtc tcacaaaaaa gtgtcaagct aagagttgct 7021 ttcccttccc actgaaacag ttgaagcgct gaaaacaggc taggggcagc cctgccagtc 7081 caggagaact ttctgcaatg actagaacgt tctatatacc gtttacttaa acatcttaca 7141 aattcaataa aattagaatc agttcctcag tcctgccaca cttcagttgc ttagcagcca 7201 cacatgtatg tggccagtgg caactgttgg acaagtcaag tctaga // LOCUS D86992 24132 bp DNA PRI 23-APR-1997 DEFINITION Human (VpreB) DNA for immunoglobulin light chain, complete cds. ACCESSION D86992 NID g2114212 KEYWORDS VpreB; immunoglobulin light chain. SOURCE Homo sapiens human/rodent somatic cell hybrid DNA, clone_lib:specific cosmid library; LL22NC03 clone:123E1, upstream contig. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 24132) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (08-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) REFERENCE 2 (sites) AUTHORS Kawasaki,K., Minoshima,S., Nakato,E., Shibuya,K., Shintani,A., Schmeits,J.L., Wang,J. and Shimizu,N. TITLE One-megabase sequence analysis of the human immunoglobulin lambda gene locus JOURNAL Genome Res. 7 (3), 250-261 (1997) MEDLINE 97228902 FEATURES Location/Qualifiers source 1..24132 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /clone="123E1, upstream contig" /clone_lib="specific cosmid library; LL22NC03" /map="22q11.2" /tissue_type="human/rodent somatic cell hybrid" misc_feature 1..24132 /note="itv-region III" exon 18391..18436 /gene="VpreB" /number=1 gene 18391..18914 /gene="VpreB" CDS join(18391..18436,18523..18914) /gene="VpreB" /standard_name="immunoglobulin lambda gene" /codon_start=1 /product="immunoglobulin light chain" /db_xref="PID:d1020781" /db_xref="PID:g2114213" /translation="MSWAPVLLMLFVYCTGCGPQPVLHQPPAMSSALGTTIRLTCTLR NDHDIGVYSVYWYQQRPGHPPRFLLRYFSQSDKSQGPQVPPRFSGSKDVARNRGYLSI SELQPEDEAMYYCAMGARSSEKEEREREWEEEMEPTAARTRVP" exon 18523..18914 /gene="VpreB" /number=2 BASE COUNT 6219 a 5791 c 5738 g 6384 t ORIGIN 1 gatcaatcac cccaaacagt ccagtgaccc ctgatggttg agcagcagca ctagaacccc 61 aaatttctag gggagaccca gcttccagtt tattaagttc catgcactta attcttgttc 121 tgcttgattt tgggtcagca gttacatgaa ctcacgtgtt tctcaaccag tgttctggag 181 atctggctca gtgcagggct gtggtttcaa agttattcaa gcaatgccac caaaagcctg 241 taccccagaa tacctgccat aggccatcct gtctatccct ggagacagtc ccttcttatt 301 gaagatgaag cactttggcc cgtagctggt tgccagagct ttcagaaaag cgtcggagta 361 aaacaaccat ctctgatgac aaaagactta aaatggctgt gaagagctct attaattcct 421 caaatattca ttgctctatt ttttatctcc ttttataact cctactgcca ggctattggc 481 ctttctactt ctatcctctc cccgacattc ccatcttagt aaatgactcc acaagtactt 541 cagctcccat tcaaggacct tggattaccc ttgggtcctc tctcttactc aacagcccat 601 cctatgccaa gtctcaccaa ttcttctctc tctctacaca cacacatata caaacttatt 661 tgtaaacaat tattttaaaa aatctctctt ctgaggttcc agctttcttc ttcttttatt 721 tttttgagac tgagtctcgc tttgtcaccc aggctggaat gcagtgacac gatcttggct 781 cactgcaatc tctgcctcct ggtttcaagt gattctcatg cctcagcctc ctgaggagct 841 gggactacag gcatgcatca ccacgcccag ctaatttttt tgtattttta gtaaagacag 901 gatttcacta tgttggcctg gctggtctca aactcctgac ctcagataat ctgcccacct 961 tggcctccaa agtgctagta ttacacgatg agccactgtc cccagcctct ctctcttttt 1021 tttttttttt tttttttgag acagggtctt actctgtcgc ccaggctaga gtgcagtggt 1081 atgatcttgg cttactgcag ccatgatctc ccaggctcag gccatcccac ctcagcctcc 1141 tgactagctg ggaccacagg catgctccac catgcccagc taatttgtgt attttttttg 1201 tagatacagg gtttcactat gttgcccagg ctggtgtcag gctcctaggc tccagtgata 1261 cccccacctc ggcatcccaa agtactggga ttacaggggt gagccaccac acccagtcac 1321 agctttcttt ttagagttca ccacagtttg agtgattatt tggaattttt aagtggcaaa 1381 aattacaaca atatatttgt aaatgaaaga gaattgtgaa atgtatttga aattttaact 1441 ttaagatttt ctttaaaata cccatattga atgaagagta tcaaagtaac tattgcttaa 1501 aaaatggggc ttcatttggg ttaaaacaca agaatcatga acatgatcaa aattcttcaa 1561 agtgcttgct tggaatgata cagagaggat tagcaaatac aaatagtttt tttttgtttg 1621 tttgtttttt tttttttttt tttttttttt ttttttaaga gatgcggtct cactatgttg 1681 ctcaggctgg tttcaaactc ctgagctaaa gcaatcctcc tgtcttggcc tccccaagtg 1741 ctaggattac aggcattgag tcaccactgc tggccaagat gctgatcttt ttgttgttgt 1801 tgttgttgcc taggctggag tgcagtggcc tgatctcagc tcaccacaac ctccgcctcc 1861 cagattcaag caattctcct gcctcagcct cctaagtagc tgggattaca ggcatgcgcc 1921 accacgcccg gctacttttt gtgtatttag tagagagggg tttctccatg ttgcccaggt 1981 tggtcttgaa ctcctgacct caggtgatcc acctgcctca ggctcccaaa gtgctggtat 2041 tacaggcatg agccaccgcg cccagtgatg ctaatcttat gaaagacaac caacaacaaa 2101 ttctgaatac taaagtgtgc gaaatgtatt taattctttt catagcacag atgagttaac 2161 ccacgatgtg ttgtcccagg cactatggag tcatcttgaa tttcatgttc taaacaaaca 2221 cccttggcct ggaggaatgc acctgcatat cacgacagga cctagtgtct cagatttagt 2281 ccggcagctt tgaatttgga aatcctcttt ggagcttctt ctgacacagc atcttttctc 2341 tctgggaagg taggtgacac ctgagaacag aggcatggtg tggcatcaga gatggaggta 2401 agaattattt ttctaccaca atcccaggaa aagactctcc tgttcctggt gagggtgaga 2461 gctctggctg aggatgtcct ctttcttctt tcttctccag aggatttttc agtctctctg 2521 caaatggctg cttcacagcc agggttcatc gaaacccctg gatcgtcatg gccagtggtt 2581 tgcgtctctc cagcgccact ccatacgagg gtctttctgc atcctgctct catggtggac 2641 ttctgagggc catgctcctc atttgcaggg ctcacaaata ttctgtaaat atccaccagt 2701 gttctgataa cggggtgacc caggacaaac tgaccgtctc cagggttcag gtcccaggca 2761 tgtctagctc cctctttcct ttcactgaaa actaatgtcc tgtttttttc cgtatctata 2821 tggacctttc tgggttcagt ggtgtgggaa aaatagattg ttggtgaaga ctcatcttca 2881 tctcctgaac tgccatcatt gcagtgttca gagtggggat ggtgcttgga atcattccct 2941 tgaccttgcc ttcgaccttg cattagtttt gactttaaaa tgtccttatg aaaattcccc 3001 caagtacgga agcccatagt ctcatcagag atcccacctt ttccctgttc ccctccattt 3061 ttttttcttt tttgaaatgg agtctcactc tgttgcccag gctggagtgc agtggcgtga 3121 tctcggctca ctgcaacctc cacctcccag gttcaagcga ttctcctgcc tcagcctccc 3181 gagtagctgg gactacaggt gcccgccacc atgcccggcc aatttttttt atttttagta 3241 gagacggggt ttcaccatat tggccaggct ggtctcgaaa ttctgatgtt gtgatctgtc 3301 tgccttggcc tcccaaagtg ttgggattac aggcatgagg cactgcaccc agccccttcc 3361 cctctgtttg tgaactccat cccaggatca tggttgagtt cacccagcag gttttcttgc 3421 ctctccagtt cttccagtcg agcccacagt tcctcatttg taagtaagca tccttggact 3481 tgagatcatt ttcaaaacct gaggttttgg attttgaatc aggtttatgt gcaatccagg 3541 gttttctctt taattcagac ttgttttgaa catgttttct taggtcaaca agatctccat 3601 ccccatcacc ccatccctgc agatctgctg tgaatgcaac tttagattca aagtttctcc 3661 tggcttttgt aaagtcatgt agtgctttcc tcacacactc cttcctatgc tccactaacc 3721 ccactgcttg ctttgcagag cactggcaaa accagttgtc cccgaggaga agagtgattt 3781 tgctggtatc cacaagcctt cctggtatag aaggcaagag gcccagttgg caccatgagc 3841 tcgtaagaca gcctattggg caaggtttgg agtcgctctt ggagggtatt acagtccccc 3901 tttacctttt gccagtgtgc aatttcatct tggcacctag agatcatctt ttcctgttct 3961 ccttggggcc tcataacctg ttgaggatgc agctgtttga aggtggccga ggaggctgca 4021 cggtggggca cacggtgggg ctccatggct cctggttctg cgtgtgcgct gtggtcggca 4081 ctgctcaggc tgctgccccc agtcatggcg actgcagtgc ccaacttcct ccagctgagt 4141 cctcatgaga tccttcctgc aatcagtgct cccagctcct cagtcccttc aaaagataaa 4201 tcattccgga gatgggcggt ggtgatgtag catgacaatg tggctacttt caatgctacc 4261 gaactgcata ccaaaaaatg gttaaaatgc tgaattttat gttatgtata ttttaccaca 4321 ttaaaaaaat attagctggg tacagtggct cacacttgta gtcccagcac tttgggaggc 4381 tgaggcagga ggattgcttt tagtccaggc atttgagatc agcctgggca acacagtgaa 4441 acccccatct ctacaaaaaa actcacaaaa ttagctggcc aagttgattg tgtgcctgta 4501 atctcagcat ctctggaggt ggaggtcggg ggattgcttg agcccaggag ttcaaggcct 4561 cagtgtactg taatcacacc actgcactcc agcctgggtg gcagagcgag atccagtctc 4621 caaaataata atagtaataa ataatagtaa taataataaa taaataagtg ataataaata 4681 aaaatatcca cttgggataa ttatcacaag aactttgctg tcatggttgt gattttttgg 4741 aacaaattct catggttcta actaaaccaa atcaaataca acattgattt gaacgcccat 4801 gctcacagca gccttattca caaaaaccac aaggtagaag caacccaaat gtccatcgac 4861 agatgaatgg atcaacaaga tgtggtctct ccacacaatt cagccttaaa aaggaaggaa 4921 attccaacac atgcgacaac atgcaggaac tttgaagact ttatatatac taagggtacc 4981 caccccctac ctcactgtat tcatatgttg aaatctcaac ccccaagctg atggtattag 5041 ggtatgggat tattagattc aaagtttggg acataattag gtcatgaggg tggagctccc 5101 atgatgggag ttgtgacctt ataaaaggga ttccagaggc ccctcttacc ccttctgcca 5161 cgtaacaata caacaggaag tcagaagtct gtgaatgaaa gagggccctc accagaaccc 5221 aaccatgcta gcccttcatc ttggacttcc agcctccaga gccatgaggt atcaatgtct 5281 gttgtttata aaccacccag accatggtac tctgttatgg cagcccgacc tcactaagac 5341 actaatcaaa acaaaccggt ccacaaaagg acaaacactg tatgattcca cttacatgag 5401 gtacctagag tagtcaaatt cctacagaca gagtagaatg gtggtttcca ggggctgagg 5461 ggagagggaa atggagagtt gtttaatggc atggagtttt agttttgcca aataaaaaag 5521 ttctggatta gttggacaac aatgtgaatg tacttcatgc ttctgagctg tacacttaaa 5581 aacggttgcc tgggcatggt ggctcatgcc tgtaatccta gcacattggg aggccagggc 5641 agaggactgc ttgagccgag gagttcaaga ccagcctggg caacatagca agactctgtc 5701 tctacaaaac atttttaaaa aaattcacca ggtgtggtgg tgcacacctg tagtcccagc 5761 tacttgggag tctcaggtgg gaggactgct tgagtctgga tgttcgaggc tgcagtgagc 5821 tgtgatcaca ccactgcact ccagcctggg caacacatag tgaagccgta tctttaaaaa 5881 aaacaatggt gagactgggt gcgttggctc acacctataa tcccagcacc atgggaggct 5941 gaggtgggtg gattgtttga gcccaggagt ttgagaccag cctgggcaat gtagcaagac 6001 cctgtctcca caaaaagtta gctgggcatg atggtgcatg tctgtggtcc cagctacttg 6061 ggaggctgag gtgaaaggat tacttgagcc taaaagattg aggctgcagt gagctgtgat 6121 tatgccacta tactccagcc taggtggcag agtgagacct tgtctcaaaa aaaaaaaaaa 6181 atggataagg tggtaagttt tatgttatgt gtattttatg ttttttttat tttttaaaga 6241 aacagggtct tgctatgttg cccaggctag agtgcaatgg ctattcacat gtgtgatcag 6301 agtgtactac agcttccaac tcctcagctg gagttgaggt gattctgcct cagcctcccg 6361 agtagctgag accacaggtg tgcacagctt tgcctggctc cattttacta cactacagtg 6421 tctctctttt tacatggttg ttgcagtaaa agaaaattca gtgtatatag aaacatcaca 6481 agcaattgtt ttaagattat atgtaaagtg gaaataagtt gtagactcat atagttgtca 6541 ataggttttc taacataaga actttgggtg gggcacttga gaactgattc agatggaaga 6601 tggctctagg ctggtggaac agtgttgaga attcctgaat gtgtagtatt cctagacctc 6661 tgtccaccaa attaaagcag gtcccccatc accatgatcc tacagggggc atccccctta 6721 gtaacccatg gaatctggcc cggcctcttt ctctagccta aattgagaca ttctcctcct 6781 tgtttctcta ggcctcagcc acactggcct tcttctattt ccaaacacac accaagctct 6841 ctcctgaaac acagccttcc cacacaccgc atctctgcct agttcataca ctgccggcaa 6901 attgtgtatt ggggctgggc ccttcccact gttcagctgt catcctattt ttttcctttg 6961 tcaggggtcc tttgaaaact acgtattcca gaccaggtgc ctggacattc tctcttatga 7021 cacttattaa aacatgttgt tggctgggtg cggtggctca cgcctgtaat cccagcactt 7081 tgggaggccg aggcaggtgg attgcctgag ctcaggagtt tgagaccagc tgggcaacat 7141 ggtgaaaccc cgtctctatt aaaatataaa aaatcagccg ggcatggcag tgtgtgcctg 7201 tagtcccagc tactcgggag gctgaggcag gagaattgct tgaacctggg aggcggaggt 7261 tgcagtgagc tgagattgca ccactgcact gaagcctggg agactctgtc tccaaaaaaa 7321 acaacaacaa aaccaaacaa aaccaaaaca aaagaataaa aaaaaatgtt gttgtcatta 7381 ttttcccgct agactcccac aagattcccc actagactct aagcttcatg ggaggagggg 7441 ctatgcacat tcacagtgtc tgctcttacc agagagtctc acagagcagc cctaacaggt 7501 tagaccaaag aacaggtcat ataaaaaccc aagttccaat ttctcctgag atgtttttct 7561 cctaaaaacc aagttaagaa ctctaggcct ctactgggaa gctgcctccc cattcagaca 7621 gggtctcccc attaccacaa tcccttctag tcctctcttc tacttgcctc cctaggtgat 7681 cattgcaggc atctgagttt gtaaacccga gtttacaaaa ggggattaga gaaatgccca 7741 catcaaatga tctcattcac ccatttctca gattctaggt cctgatggtt ggcacatgtt 7801 gcaatgagtt ggatgggggg agtccagcca ggacctcagg gtagccacag acttcttcaa 7861 tttccaccct ctaaggtttg gtcctcagat tggaattatg tttcctcaat tctctgcaga 7921 caggaagtgt tgagatctga ctcttagtag taatttaaat gagagaaaga gaggaggagg 7981 gaggtatttt tatatcctat ttgtatctaa ataccagaaa aggtagggag tgggggctta 8041 cacctgtaat cccgacactt tgggaggcca agacgggaga atctcttgag cccaagagtt 8101 tgagaatagc ttgggcaacg tggcatgacc ctatctctac aaaaaataca aaaattagtt 8161 ggacatggtg gcatgtggct gtagtcccag ctatttggga ggctgacatg ggaggattgt 8221 tttagcctag gaggtcaagg ctgcagttaa ctgtgattgt accactgcac tccagcctgg 8281 gcgacagagc aagactgtct caaaacaaac aaacaaataa acaaacaaac agaaaaacaa 8341 acaacccggg aaaatggtac tttgggaaca aggaccttgg tcaacccaat tccttattga 8401 aggaaataac gtatacagtg gtccatttct tttttttttt tttttgagac ggagtctttc 8461 tctgtcgcca ggcttgagtg ctgtggcgtg atctcggctc actgcaacca ccgactccct 8521 ggttcaagcg attctcctgc ctcagcctcc caagtagctg ggattacagg catgcaccac 8581 catgcccaga acatttttgt atttttagta gagatggggt ttcacaatgc tggccaggat 8641 ggtctcaaac tcctgacctc gtgattcacc tgcctcgacc tcccaaagtg cgtgctggga 8701 ttacaggtgt gagccactgc acccagccta cagtggtccg tttctaagac aaaatgcttt 8761 gaattggctt aggtcagcaa cctacagaag aaacatgata cactaggtcc ctgcttggat 8821 agccagccaa tgcctgcttg tcagcctccc ccttccccac cttcgcccct tagttgcctt 8881 cacctgaacc aaagtagttt agtctaagat gaaagtttac tagcctgcaa aatagcttgt 8941 tttgtctgtt cttagcctgc ccagctactt aggtcataag tctaacactt gaagagcccc 9001 taagctaact aagattacaa tgcattgtgg gctgcaacaa aatgcagcaa aacgacccta 9061 aaataaacaa aaaacaaaaa actcctggcg ctcccaccca acaatcaaca ggcgagaagt 9121 ttatgacccc gtagtactca gcctatgagg aactggggga gggacctgtg cactagggga 9181 taaactgctt gttgaaagtg tgctgggtgt gcctgtcaga cacctgatct tgcaagactg 9241 tcattaaaag tctcactttc actgttctcc gggtctccga gtccattctt tgggttggga 9301 tgggcgagac tgtttctcac attatgaaaa gctgaaggtt aagtgaactt ggggaatact 9361 gcctcctgct ggctggatgt catggaagcc tggcccatca gcagagaatg ataagctgga 9421 gttctggagg gtctgatttc caaggaatca cccccttgga agctcctcta atacaccctg 9481 atgagattaa cctgattagg tccattgttt gataaaaaga ggaggaagtt taatcacttc 9541 cacattcttt gcccacgact aaccctgatt ggatttgggg ctatggttca aatgtatgac 9601 cgattgattt ttttcacttc ttctgttgga cttttgaaaa aaagttctct gaaaaggaat 9661 ttagaggaaa gagacttatt ccagtgaaca gttacagttt gcaaacccaa cagacacaac 9721 ttttggtaca aaaggaagtt acattccaga gaacaagggg aggctttgtc ttttctagag 9781 aaaattcccg ctcagaatcc cactcaggtc tgcttatgca aatgaaagat tcaaactttt 9841 tagttctgat tacttggctc tagctgggtt ctgattagtc aaagcaggtc acagtctact 9901 ggtggcgtaa acaggaacag gcagctatgt aagtcccaaa gtttttccag aaactcaaag 9961 tatgtgtgtg acctctggtc agcaaataac cacttgtctc aaatttaaat tcaggcccag 10021 gattcatctt gaggaattgg ctctttcagg gttcagagac caatcaaaat gtctatcaga 10081 atggatgtgt taaaaggcaa ataatgggca gggcatggtg gctcaagcct gtaatcccag 10141 tactttggga ggccgaggca ggtggatcac ctgagctcag gagttcgaga ccagcctcgc 10201 caacgtgatg aaaccccgtc tctactaaaa atacaaaaaa aattagctgg gcgtggtggt 10261 gtgcgcctgt aatcccagct acttgggaag ctgaggcaga agaatctctt gaaccctggg 10321 ggcggaggtt gcagtgagcc aagatcatgc cactgcactc cagcctgggt gataagagca 10381 agactctgtc taaaaaaaaa aaaaacagca aataacgttg tcctgtgcca tgagccacat 10441 actcaaatat ttacacaggt ccaggggaag taaagagtaa atatggccac caggcgcggt 10501 ggctcacgcc tgtaatccca gcactttggc ggctgaggcg ggtggatcac gaggtcagga 10561 gatcgagacc atcctggcta acacggtgaa accccgtctc tactaaaaaa tacaaaaaaa 10621 ttagccaggc gtggtggcgg gcacctgtag tcccagctac ttgggaagct gaggcaggag 10681 aatggtgtga acccgggagg cggagcttgc agtgagccga gataatgcca ctgcactcta 10741 gcctgggcga cagggtgaga ctgtctcaaa aaaaaaaaaa aaagagtaaa tatggcctga 10801 tgtgaaaaaa aattataacc accagaattt cctaaaatgt attttattga tctatagtgt 10861 tattgcacaa catccacaaa tatctatgat gacaattgtt taaaattcaa gttaaaatac 10921 aaaatttaaa aaagtttacc tacttttacg ttgttaattt atgactcttt gggtttgaaa 10981 tgagatttga tgtcattgta tttatagata ttccttgact tcttgtgtta tgtaccaata 11041 agcacattgt taagttgaaa acatcataaa taaaaaatgt gtagctgggc caggagctat 11101 ggctcatgcc tataatccca acactttggg atgccgaggc aggaggatca cttgagctca 11161 agagttcaag accagcctgg gcgacaaagc aagaccccca tctccacaaa aaattaaaaa 11221 attagccagt tgtggtgctg catgcctggg gtcccagcta cttgggaggc tgaggcagga 11281 ggatcatttg aacccaggat gtcgaggctg aagtgagcta tgacggtgtc actatgctcc 11341 agtctgggta acataatgag accttatttt taaaagaaca cacaatgagt agctgagtag 11401 ggctgcggct cactgccact gccagtgttt gaagaacagt tcccactgaa tgcatctcaa 11461 ttttgcactg taatcaagta gaaaaactgt aagtccaatc attgtaagtc tgggaccatc 11521 ttgtccatag tcttatgaca gaatactgtt taaaagtgta gatctatccg aatgttgatg 11581 aatttattgt gaaatgcttc cccttgtgtt gcttctatgt gatctatgtg cagcagtgga 11641 catttttttt ttaagtgaaa aacaaatcat gcccctccct tcattctctc ccccgtgatg 11701 tttcccttag agaacattcc tcccttccct tcccagtccc cacctcagag atggggcaac 11761 tcctcccttt ttatcacact tcttccactc tgaggctcta ggcaatcagt tctcggtcac 11821 tcccacccat gtgaaggttt tttttttttt tttttttttt gagttggagt ctcgctctgt 11881 cgcctaggct ggagtgcagt ggtgcaatct cggctcactg caagctccgc ctcccgggtt 11941 cacgccattc tcctgcctca gcctcccgag tagctaggac tacaggcacc tgccaccacg 12001 cctggctaat ttttttgtat ttttagtaga gatggggttt caccatgtta gccaggatgg 12061 tctcgatctc ctgacctcgt gatccgcctg ccttggcctc ccaaagtgct gggattatag 12121 gcgtgagcca tcgcgcctgg ccccgtatga aggtttttga tgatgcttgc tagaatggac 12181 ccagataatt ttcgtgtgga cagaaggtag atttttagta gctcttgtta gaggtcatgt 12241 aggagattgt ctcagcttga gctgctattg caaataccga tactgtagac tgggtggctt 12301 gaaaacacca gaaatgcatt cctcagagtt gtggaggctg gatgtccaag ataggacagg 12361 tctgagatca gggtgccagc atgcttcagt tctggtagga atcctcttcc aggctacaga 12421 ctgccaactt ctcgtgtctt cgtgtggcag aaaaagggct aaaaagatct ctgggatttt 12481 tttttttttt ttttgagaca gagtctcact ctgttgccca ggctggagtg tagtggtgtg 12541 agtggcatga tcttggtata ctgcagtctc cacctcccgg gttcaggcaa ttttcgtgtc 12601 tcagcctccc gagtagctgg gactataggc gtgtgtgacc atgcccgact aattttggta 12661 tttttagtag agactgggtt tcaccatgtt tgccaggctg gtcttgaact cttggcctga 12721 agtcatgtgt ctgccttggc ctcccaaagt gctggggttg caggcacgag ccaccgtgcc 12781 cagcagggcc tcttttgtaa aggcactaat cccattcgtg aggttctgcc ctcatgacct 12841 aattacctcc tacaggtccc atgtctgagc accatcacac tggcatttag gatttcagct 12901 tatgaatttt gggaggacac aaacattcag ttcataacag cattgtgttg tcacccactt 12961 tttttttttt ggtaggaaca gtgtgtcgaa ctctggtctc aagtgatcct gctgcgtcag 13021 cctcccaaag ctttgggatt acaggcacgc actgccaagg ccagccacat ccatattctt 13081 aatctccacc atggcctcat gagcctggtc ccctgtcagt gggctagggc ctcgacatgg 13141 gtaatacgca tattagctga ggttcaggaa cccaactgac ttgtgtaaga atttccagct 13201 ggccccgcag aacaagaggc taaagagtgg ggagcagtgc tagtgttgcc actgctacag 13261 ctctcgggca gtattcacta ccctcatcct gtatgtgtgc ccaggtgctg ggtgagcatt 13321 tcactccaca cgagaggcat catgcattcc ttcattcgtt cgttcatgcc gttatactat 13381 agtgaacaat aaactcttga ggaatgtcgg gtttagggga gccagtcccc tgttcagttg 13441 aaaatctgcc tggctatgta taaattttga ctcccccaaa acttaactac taatagcctg 13501 ctgttgactc gaagccttac tgatgatatc aacagtcaat taacacatac tttgaatttt 13561 atacgtatta tgtactgtat gcattttttt ttctttgaga tggagtctca ctttgtggcc 13621 ctggttggag tgcagtggca ccatctctgc tcactgcaac ctcccagatt taagtgattc 13681 tcctgcctca gcctcccaag tagctgggat tacaggcatg tgccaccaag cctggttaat 13741 ttttgtattt ttagtagaga tggggtttca ccatgttggc caggctagtc tcaaactcct 13801 gacctcaagt gatccacctg cttcggcctc ccaaagtgct gggattacag gcgtgagcca 13861 ctgcaccctt cctgtactgt attcttataa taaagtaagc tagagaaaag aaaacgttat 13921 taagataatc ttaaggaaga taaaatctat ttactattca ttaggtagaa acagatggat 13981 tatcctacag gaggaggagg aagaggaggg ggttggtttt acagtctcag gggcgaaaga 14041 ggcataagaa aatctgcaaa taagtggacc cacgaagttc aaattcctgt tgttcacagt 14101 tagtggtaca caggagacac taggtaagct aaaggttggg ctgggcatgg tggctcacac 14161 ctgtaatctc agcactgtgg gaggtcaagg cgggaggatc gcttgaggcc aggagtttga 14221 gaccagcctg gccaacatgg tgaaatgctg tctctactaa aaatacaaaa agtagctggg 14281 cattgtggca tgtgcctgta atcccagcta cttgggaggc tgaggcacaa gaatcacttg 14341 aacctggaag gcagaggttg cagtgagcca aaatagtgcc acttcactcc agcctgggtg 14401 acagagtgag actgtctcaa aaaaaaacaa acaaaccaaa aagattagct aaaggttgaa 14461 ctttcagtga tgaactaaac agaaacaatc tcatagagat tgtggacagg agaagaaaca 14521 agcacaggtg cagttaaaac ccgtgataaa tgccccggca gaaacaagtg ttaggccagg 14581 cgtgatggct catgtctgta atcccaacac tgtgggaggc caagacagga ggactgcttg 14641 aggccggggg tttcaagcct gcagtgagct atgattacac cactgctctc ctgcctgggc 14701 ggtagagacc ctgcttaagg aaaaaaaaaa gaaaggctgg gcgccgtggc tcacgcctgt 14761 aatcccagca ctttgggagg ccgaggtggg cggattacga ggtcaggaga tcgagacgat 14821 cctggctaac acagtgaaac cccgtctcta ctaaaaatac aaaaaaaatt agcagggcgc 14881 ggtggcaggc gcctgtagtc ccagctactc gggaggctga agcaggagaa tggcgtgaac 14941 ttgggaggtg gagcttgcag tgagctgaga ttgcgccact gcactccagc ctcggcgaca 15001 gagcaagact ctgtctcaaa aaacgaacaa acaaaaaaac aaaacaaaac aaaacaaaaa 15061 cctaaaccaa aaccaaaacg cagcatagtc catgagagag gaagagcagg gcaggcctac 15121 tcacctgggg ctgtcgggga ggctgcccag gaagggactc tgaagtgagc acctggagga 15181 aggaggttag ttctgaggga ctggaattca tcctggcatt gggatcacgg gcaaaggtcc 15241 tgtggtgaca aggagactgg tgcgagcctg gcttagcacc atgtgagagc agaggggtgg 15301 aaaggcacta ggtaagcatg ccttcggatg tgaaatgagg gagacgggga ctgggggacc 15361 ctgacatgag ggaggaggaa aggagatggg aagaatgggt gaccagaggg accactagac 15421 caccgagggt gagagcttgc taatatcaga gccatgtgta ggagaatgaa gtgcaatctt 15481 ggatagacct ccagtgcagg tggtccagcc atcaatgacc ataaccacat gcatgattgt 15541 aggtggaacc agaagaacca cccaggcaac tcacagaact ggggaactaa tcaagtcttg 15601 ttttaaatga ctacattacg ggtggtttgc cacgtagcag tggatcgctg aaacagttta 15661 aaatatctat tcagccgggc gtgcggtggg tagctcacac ctgtaatccc agcattttag 15721 gaagctgagg caggaggatc tctcgaacgc aggaatttga gaccagcctt gagcaaaatg 15781 gcaagaccct gtctccacaa aaagtaaaaa aaattagctg ggtgtggtag catgtgcttg 15841 tagtcccagc tacttgggag gctgggacag gagaattgtt tgagcccagg agttcggagg 15901 ctgcagtgag ctgtgactgc accgctacac tccagcctgg acaacagagt gaaaccttgt 15961 ctctaaaaaa aaaaaaaaaa tctacaaacc aggcaggagc actgaatagc tcctgtctcc 16021 cctgagaccc ttgtagacct tgaggtctgt tacagagctc accaaccttc tcttggtcta 16081 gaattatcta aaaagagttg agtgcagcca gaaaggactg aatacatggt gttcttcaat 16141 cccatcaaac tgctgtgccc tgcagatgat ccctgagtcc aggtgttggt gggttgagga 16201 caaggctgag ggaacagagg tgggaacagt ggcccaggac caggagaatg cagagttcag 16261 agtgcatgac ctgcatctaa ctctagatgc tccaggggtc tgagaccatc tccagcaggg 16321 tctggaaggc ccagtcattc cccagcacgc acagtcacca ctggatacta tcttgctctg 16381 tgagtgatgc aaaggtcctg ggtcagtgaa gcctgtgaaa caaagttggg accagagtgt 16441 ctctacggtt gagggctggg cttatttgct ctctaagtat aattgtaaga gcaaatagaa 16501 acacctatcc tctctgagct taaaaggatc aaaaagctct gacagtgtca gtggtagggc 16561 tagttcaggg gctgcagtag tgctggtgcc cacaggtgga accttctcct gccctgattc 16621 ccaaggactg gtgctctcgg aggggtctgg gaggtatatt tctctttcac aaatacgtgc 16681 acaacacatg aatgagctac tggatgaggg aataaatgaa tgaatgagtg aaaaaagctt 16741 cctttctcct ctccaggagt tagcaatcac tggttagttc cccactgccc accagcctgt 16801 gcagaggagg gaggagcaac aggtcataga tgtggttgca gctgttactg gacagagtgg 16861 tcaagacatc ctgtgtctgc agaggccctc cccagccctc acctggacag tggagctgtt 16921 ggccagagag gcagtgggaa gaagggtggg gctcccagag tcgacccagc gggctgaaac 16981 tggtgcagtc tgcgcctcac caggcagaga atggatgtgg ggcagcttga caccgagcag 17041 gggctcatct gagcctcagg ggtggttgct ctatcaaagt gtttgcaagt aaacagtccc 17101 agtgcacacc cagctgctca gtgccacctc gtgcacccag ggccacttca gccttgatca 17161 caggcctgca caggttccag gggaggagac actccagctt tttgagggag ggtgagcatg 17221 tgggatcaga gactttaagt ccaaatttcc ccttaatttt tttttaaatc ttgcaggtag 17281 tatgcaaaca tacatttaat gtattatttt tattctcatt tatttttttg aggcagggtc 17341 tccctctgtt ggccaggctg gagggcagtg gcgagatctt ggcttgctgc agcctcaact 17401 caacctccca agctcaagtg attttcccac ctcagcctcc caagtagttg ggactgcagg 17461 tgcctgtccg ccatgaccag ctaattttta aatttttttt gtagagatgg ggttttgcca 17521 tgttgctcag gctagactcg aactcctgag ctcaagtgat cctcccacct tggcctccca 17581 aagtgctggg attataggtg tgagacactg catccggctc ccaaatttcc cctttttata 17641 aggacaccag ttgtactgac cggggcccca cctctatgac ctcatctaac cttcattaac 17701 tccttaaagg ccctatctcc aaatacaggc acactggggg ttagggcttc aacctatgga 17761 tttgggggat gcaattcagt ctatcacagt gccaagctgc acctgctcgg tgagccagaa 17821 caggggtggg gaaggcacca ggccctggtg cttctttccc atgcctgagt cactctactt 17881 ggttctagga gggagaggag catgggctga aggcagtgct gcaacatgga aggggcagaa 17941 catcccaacc cgctctgggc ccccagtgtc ttctgctgta caaagcaggt gttttcaccc 18001 cccgaggatg tgtcctccct tcctggctca ccaagcattt aaatgtagac agaaaattta 18061 tttatctggg gtgcggtttc ctcaggggga agttgaggtc acgacccctg aggtacctta 18121 acccaaaggc ctccagggca ctggccccag agtctccagc aggtgcttcc tccctgaatg 18181 cttccatatg gcaagatcag agccacaaat gctgccccga cactgttccg gtaaccgacg 18241 ttgcagaccc aagaaagtct gggatggtag tggccaggaa cggggtgtgc cacggcccca 18301 cctctcagag cagggaggag gatgctgggc cagcccttgg ggaccccagg caccgtggcc 18361 acaggagtca gagctctgca tgtctgcacc atgtcctggg ctcctgtcct gctcatgctg 18421 tttgtctact gcacaggtga gggaaccccc agatcccaaa gactcctgcc ccttccttca 18481 tcctgccctg cccccacggc ccacatgcat ctgtgtcacc aggttgtggt cctcagccgg 18541 tgctgcatca gccgccggcc atgtcctcgg cccttggaac cacaatccgc ctcacctgca 18601 ccctgaggaa cgaccatgac atcggtgtgt acagcgtcta ctggtaccag cagaggccgg 18661 gccaccctcc caggttcctg ctgagatatt tctcacaatc agacaagagc cagggccccc 18721 aggtcccccc tcgcttctct ggatccaaag atgtggccag gaacaggggg tatttgagca 18781 tctctgagct gcagcctgag gacgaggcta tgtattactg tgctatgggg gcccgcagct 18841 cggagaagga ggagagggag agggagtggg aggaagaaat ggaacccact gcagccagga 18901 cacgtgtccc ttgaactgaa gacagcagag gcacgcatcc ccttggagag actgtcatgg 18961 aagagggtgg agccgccgcc cgaagcgccg aggaggctga gccactcagc atctcctggt 19021 cctgcagtgt tgctgtaaat ccccattgga gactgcatta gggaattaaa gctgcttgtc 19081 actttttgct gagtttggtc tgactgttgt gcgacttcgt agtaccagcc ttgggcaaag 19141 gcccggggcc ccgggagact ggcagatcac tgtggtgagc actcacctga gtccttactt 19201 ttgtgggcca tgggaggcaa ggggttgacc gaagcaccct ggaggggcat ccatgcccag 19261 gacaggaccc tctggtgtca tctgattctg cccagcggtt acagggcaaa caagcttaga 19321 aagcacctgg tctgggcctc cagggagaac aggagccagg taataggcct agagtgggtt 19381 ctccaccttg gtcctatcga cgttttgggc tggacagttc ttggttgtgg ggctgtcctg 19441 cggattgttg aatggtgagt ggttatctgt ggcctctacc caccagatgc cccttcccag 19501 ttgtgacaaa caaaaatgtc tccagacatt gccaaatgtc cccaggtggt aaactcacac 19561 tgattgtgaa ccactggtct agatgagcaa gtaaatgcat ttacttattt gtacaagaga 19621 gtggcaggtg tcggcaaaca tgttcttttt ttcctttttt tagggataaa gtcttgctct 19681 gtggtccagg ctggagtgca gtggtgcaat catagctcac cgcagcttca aactcccagg 19741 ctcaagtgat tgtcctgtct cagcctcctg agcagctgag actacaggcg tgcatcacca 19801 tgcttggcta tttatttagt tttatttttg gagagatgaa tgtctcacta tgttgctcaa 19861 gctgatctca aactcctgag ctcgagtgat ccccctgctt gggcctccta aagtgctggg 19921 attagaggcg tgagccactg cacccagccc attttctata aagagccagg tggtaaatat 19981 ttttggcttt gtgggctatg cgatctctgt cacaactgtt caattctgtc ctagtagcaa 20041 gaaagcagcc acagacaaac atgtcaacaa agaggcgtgg ctgtgttctt ataaagctgt 20101 atttacagaa acagatgggg ctacactggg ctgggtatgg cgcccaggct gtagtttgct 20161 gagccctgct cgggagcact ccagctcctg tctccaggtt ttaagcaagc catgcttggt 20221 gggataatca gagtcccccc tcaacttatc ttgcctgtga ccccaggcag aagcctctgg 20281 acccatggag atacctagag gcatgaagga cgtggtctca ctgagactgt gttggactgt 20341 ctgggtgaag tagcaatggg gagctccctg cagccccggc ccacagcaag gccccctgag 20401 cccgtgggtg gtgggtctga gtgtgccttg cagagaggga ctctctacac tgggtgtcca 20461 cacaacctgt agacatgcct cactcagaga ggtgccgggc aaaccatctc ccttgtaacc 20521 ccaacttgcc tgggaacgtg cattcacccc agctaaaaag atcctgaggc atctccccag 20581 agaggtggac acatgtcagt atcagcgttc catgtgtggt catgagtccc tctcttgtta 20641 aacagtggtc tccagcactt tgggaggccg aggcgggtgg atcacctgag gtcaggagtt 20701 agagaccagc ctacccaaca tggtgaaatc tcatctctac taaaaataca aaaattagct 20761 gggtgtgttg gcacatgccc gtaatcccag ctacttggga ggctgaggca ggagaattgc 20821 ttgaacccag gaggcagagg ttgcagtgag ccgagatcat gccactgcac tccagcctgg 20881 gtgacagagc gagactccat ctcacaaaaa aaaaaaaaaa aaaaaaagag acagtgatct 20941 cagcttatgc catactctat gcactgggtt cttcctgcgt caggctttta cccctgggcc 21001 aggcccacat tttaaaacat tattatttac ttgtttatta ttttatttta gagataggat 21061 cttgctatgc tgcccaggct ggactgctgt ggctattcac aggttcaatt catgatgctc 21121 tacagccttg aactcctggg ctccagcgat cctcctgtct cagcctgctg agtggctgaa 21181 ctacacgtgt gagtcactgc acctggccat ggcccacagt tttaaagacc agctagatgg 21241 ttccagagga actctgtctt ctgcttgtaa cccgtccctc tcttgcctcc atccttcatg 21301 attctgggat tccatgctat gagaagagat ctagaaagcc gggagctgag ttggactcct 21361 gtatcatcct catgccccca ccccaccagc tcaatggaaa atgagacctg gaactcaggc 21421 ttgaggccag actgagtgtg agtcttagct ctggaggagt cacttaaaat ctaagccaac 21481 atttccaatg ttgcatgtaa aggggataat catagaactg attaacttat ataaaaagtt 21541 tttatctggc caggcgtggt ggctcatgcc tgtaatccca gcactttggg aggccgaggc 21601 gggcagatca cctaaggtca ggagtttgag gccatcctgg gcaacatggt gaaaccccgt 21661 ctctaccaaa aatacaaaaa ttagctgggc atggtggcgg gtgcctctga tcccagctac 21721 ttgggaggct aaggcagaag aattgcttga acctgggagg cggaggttgt agtgaaccga 21781 aatcgcacca catcactcca gcctgggtga cagagcaaga ctgtctctca aaaacaaaca 21841 aacaaacaaa caaaaaggtt ttatccatca tgagtgctca acaaaataat agtatcatga 21901 tttccccatc agtcacctgc ctggccttcc ctgttgtcaa atcctttcct ttagtctgag 21961 tcatttctca ccttgaccat tatctggtct ccttcccatc catcctcctg ctgcatgcag 22021 tctataccca ccactcctca tctttgttgc tcctagatcc agccttgcat gcattcctcc 22081 tctctaaccc tttattgtct cccacactct aaagaacata tgcaagccac atctgtaatt 22141 tccaattttc tagtggctac attattttta aacagtaaaa acaaatagat gaaatgattt 22201 tttaatgata tattctattt aacataataa atccccaata ttaccaattc aacctataat 22261 caatataaaa attagttact agatagttta aacttgtttt tcttttgtat tcacacttca 22321 aaatctggta tgtattttat attcatgtct tggataattt gctcaagtgg ataatctggt 22381 cagccaagtg gtcaagagct acatatgcct tgtgaagaga ccactttggt tgaggggatg 22441 caattcagcc tccttcccat gacacagcct ttgaaggctg gctcaggctt ggtcccaagt 22501 cccatttctt gtgcaccctc tctccctgtt tctgccatgt taagccaact gcaactgatc 22561 aaacacccga gtctcttggt tgcttctaaa cctttgccca agctgtttac tgccaggact 22621 tcctgtccct ttgttgtggc ctccatctgg ctgcactgac tcggggctcc acaactgtct 22681 caaatctcac tccttgggca agacttcccc gagctcttca tgcagttatt aacttcctcc 22741 actgtgcaaa cagcatgttt tcctaactat atttagcttc tactgcacag ttataaattt 22801 atttagcaaa gctccctccc ctactaatct gcagattctc agtgggcagg atctactcaa 22861 cccattgatg tgcctgataa acacttggca attagacagt tgaatgaaca aatgaatgaa 22921 tgtgtaactg aatgaatttt cgggttccaa ggaccctgac attctgggat cccattaggt 22981 ggctggggtc accaccttag agaattgtga ttccctaaga agggagaacc agcccagacc 23041 ttctcgatca tattagtgac tggccacatt tcctctcctt tctgattctc tctctccccc 23101 acacccttac tgcatgacag tgatgagaaa atccacttga tcaaccaaac tacaactttc 23161 ggcctcatcc ctatccctcc cttgccttct gaccagggcc acagtcaccc tgtgggttct 23221 tccttttgag tgtctctcac tcccttcctc ctccatatct ctgagaccag cctccagccc 23281 tgctggcctg ctctgctgcc ttctggctct ttgttgaacc caagaccagg gtttatatat 23341 gttaaatata aatatatttt tccatatgaa atgtaaacat attttaaaca ttaaatataa 23401 atatacatct tagatatggc ctgtgttgga atgtgtaata gattgagtat ataatgtcta 23461 ttcaatataa aatttatatt tatatatgca gtaatgattc aggttgattg tagttaagaa 23521 aaacaagctc caaattcgag agaaatatgt aagaagagag acaggaagaa aaaataatga 23581 ggcaggtaaa tgcaacagac aattcgagac ccacaagtgc agagcaggct tcccagaccc 23641 ggataatgtc tcctgggctg ataggaagcc ctcaaccccc caagtccttc tcagccataa 23701 accgcctgag cacagagcca cagggaccgt gttggggctg ggcctctcga ctttagttcc 23761 tctcattctg tgcaaaagga aaaacaattc agaatctaca gaggtttaga tgtgtgtaga 23821 tgtggacaga gaagtcctgg cacggtggtt cactgcccaa gaagacagtg agtccctgga 23881 gggatagaag aatatacatc atgctaatat atgtcatccc agtactttgg gaggccgagg 23941 cgggtggatc acttgaggtc aagagttcga gaccagcctg gccaacatgg tgaatccccg 24001 tctctactaa aaatacagaa attagttggc tgtgggggtc gatgcctgtg atcccagata 24061 ctcgggagac agaggcagga cgaactgctt gaacctggga ggaggaggtt gcagtgagct 24121 gagatcatgc cg // LOCUS D87675 301692 bp DNA PRI 16-SEP-1997 DEFINITION Homo sapiens DNA for amyloid precursor protein, complete cds. ACCESSION D87675 NID g2429080 KEYWORDS amyloid precursor protein; APP. SOURCE Homo sapiens cell_line:2Fur DNA, clone_lib:P1. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hattori,M., Tsukahara,F., Furuhata,Y., Tanahashi,H., Hirose,M., Saito,M., Tsukuni,S. and Sakaki,Y. TITLE A novel method for making nested deletions and its application for sequencing of a 300 kb region of human APP locus JOURNAL Nucleic Acids Res. 25 (9), 1802-1808 (1997) MEDLINE 97263807 REFERENCE 2 (sites) AUTHORS Hattori,M., Sakaki,Y., Tsukahara,F. and Furuhata,Y. TITLE A novel method for making nested deletions and its application for sequencing of a 300 kb of human APP gene JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 301692) AUTHORS Hattori,M. TITLE Direct Submission JOURNAL Submitted (04-SEP-1996) to the DDBJ/EMBL/GenBank databases. Masahira Hattori, Institute of Medical Science, University of Tokyo, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo Japan, 108 (Tel:03-5449-5623, Fax:03-5449-5445) FEATURES Location/Qualifiers source 1..301692 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="2Fur" /chromosome="21" /clone_lib="P1" /map="21q21.1" exon 9001..9204 /number=1 gene 9148..294603 /gene="APP" CDS join(9148..9204,64121..64288,86196..86325,122920..123032, 125075..125268,154226..154428,176087..176254, 178853..178909,193794..193927,200243..200317, 201043..201201,220515..220643,221581..221680, 264310..264531,271195..271248,278599..278699, 284404..284550,294502..294603) /gene="APP" /codon_start=1 /product="amyloid precursor protein" /db_xref="PID:d1023126" /db_xref="PID:g2429081" /translation="MLPGLALLLLAAWTARALEVPTDGNAGLLAEPQIAMFCGRLNMH MNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGR KQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSE KSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYA DGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTT TTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTE EYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQ KAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANER QQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHT LKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELL QKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSF GADSVPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYE VHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDA AVTPEERHLSKMQQNGYENPTYKFFEQMQN" intron 9205..64120 /gene="APP" /number=1 exon 64121..64288 /gene="APP" /number=2 intron 64289..86195 /gene="APP" /number=2 exon 86196..86325 /gene="APP" /number=3 intron 86326..122919 /gene="APP" /number=3 exon 122920..123032 /gene="APP" /number=4 intron 123033..125074 /gene="APP" /number=4 exon 125075..125268 /gene="APP" /number=5 intron 125269..154225 /gene="APP" /number=5 exon 154226..154428 /gene="APP" /number=6 intron 154429..176086 /gene="APP" /number=6 exon 176087..176254 /gene="APP" /number=7 intron 176255..178852 /gene="APP" /number=7 exon 178853..178909 /gene="APP" /number=8 intron 178910..193793 /gene="APP" /number=8 exon 193794..193927 /gene="APP" /number=9 intron 193928..200242 /gene="APP" /number=9 exon 200243..200317 /gene="APP" /number=10 intron 200318..201042 /gene="APP" /number=10 exon 201043..201201 /gene="APP" /number=11 intron 201202..220514 /gene="APP" /number=11 exon 220515..220643 /gene="APP" /number=12 intron 220644..221580 /gene="APP" /number=12 exon 221581..221680 /gene="APP" /number=13 intron 221681..264309 /gene="APP" /number=13 exon 264310..264531 /gene="APP" /number=14 intron 264532..271194 /gene="APP" /number=14 exon 271195..271248 /gene="APP" /number=15 intron 271249..278598 /gene="APP" /number=15 exon 278599..278699 /gene="APP" /number=16 intron 278700..284403 /gene="APP" /number=16 exon 284404..284550 /gene="APP" /number=17 intron 284551..294501 /gene="APP" /number=17 exon 294502..295722 /number=18 BASE COUNT 85134 a 59131 c 61215 g 96212 t ORIGIN 1 gatcatgatt ggtaaatctg tagtgactaa ttgcccactg ctgcctattc catctgacct 61 aaattcctca ggtcttctaa cattaagacc tcttcctggc cggttgtgct ggctcatgcc 121 tgtaattcca acactttagg cagctgaggc aggcagatca cttgaggtca agagttcaag 181 accagcctgg ccaacatggt gaagccctgt ctgtactaaa aaatacaaaa attagccgga 241 cctggtggtg cgtgcctgta atcccagcta ctcgggaggc tgagtcagga gaatcacttg 301 aacccggggc agcgggggag gctgcagtga gtggagatca aaccaccgca ctccagccca 361 ggtgacagag caagagtcag tctcaaaaaa aaaaaaacaa aaaaaaaaac ctcttcctat 421 agctaactcc cacttaccac ccccatcatg aacactcttg atgtatttac atggtttctc 481 cttcgaacat cctcctttct tctttcttaa tggttgttat caaataccct gataaaaaac 541 aaaaacaaaa aacctcctct gaaggtccct tattcaccct tccaacgcta caggtctgta 601 actctcattt tctttttaaa aaatttttat ttttttaatt tattttattt tttttttcag 661 acggagtctt gctctgtcgc ccaggctgga gtgcagtgac acgatctcgg ctcactgcaa 721 cctccacttc ccaggttcaa gcaattctcc tgcctcagcc tcctgagttg ctgggattac 781 aggtgcctac caccacacct gacctcaagt aacccaccca cctcgacctc ccaaagtgct 841 gagaatacag gtgtgagcca tcatgcctgg ccaaaatttt taaattttaa aaaatatatt 901 ttattttttg tagagacagg gtctcatttt gagcccaaac tggtcttgaa ctcctaggct 961 caagtgatcc tcctgccttg gcctcccaaa atgctgggat tataggcaca agccaccagg 1021 cctgatcctt acttttcttc tgatgaattc acatatatgt gcacaaatac tttatactaa 1081 attgtattta ctgatgtact tttttcactg tgccttttct ttttcttgcc cagatatttt 1141 tctcatataa acattagctc cttaatggga gcaaatgaac cagttttttt ttaattccca 1201 cccaaagtga gaatataaaa attttttatt gatccaccaa tactgaacac tttcatttct 1261 aatagttata tttaactgaa taaattacac acgggacaaa aatgttattt aagggataaa 1321 gttgggtgtt tgctcaggga caacgttgta tattgaatga tttggtgctt ttgtgaattt 1381 atcattcaaa agaccatcgt gatggctaaa taacagaaag gagagcttta ttggcaatat 1441 caatttgcaa acccggaaga catagtcttc ggtgtatgct gaatgtggtc tctcttcaaa 1501 agagaggaag gacagttggg tttcatgcct cacagggtct gtttcacaca gtggagtcat 1561 acatattcag caggtttgga ggaaaagata tacatattta tgaggggagc tgagtgcatg 1621 tgcaatgggt aaatatgtat gtgacatccc atgtacactt tggggcaggg ttttagtgtt 1681 aaaatgaggt aaaatttggc tctttacatc aaaaggtgaa ctacaggacc caaagacagt 1741 ttgtgcacag cctctaataa actggctgac actggcttaa ggtctgcaat tgcttatcag 1801 aaaagaatgt ttgtaaggct ggtcctcatt ccaattagag ttgtagtggt ctgggttgta 1861 aatcacagga tggggctgat agttcctatt attagggagt ttagagccat agaaattgag 1921 aaattggtca tgccagccag tccccgaacc ctaaccctgt aggtaacttt gtttccttaa 1981 ccttacagtc catcttaggt gataaagggg tgtctgtttt ggtatctcac atcacaaatt 2041 gttggttggt ttgtgtgttt gtttcatcat tcaggatgtt gtttctttag ggaatgtgaa 2101 cctgaattct caaggcttgt tagactgtaa tgttcccatt cattttaggt ttagctcatg 2161 cttctctagc cacagccttc acttggattt taaaagttga attactcatc aaagtctcta 2221 ggacacgaaa gacaatcctt aggtatgatt tgaccagtaa aaaagagatc cagctgcctt 2281 gaagcataag atcccctcgg ctccaatgtc tatcactaat attcagtgtg gcaaggatcc 2341 caggccacag agctgtggct tcctgcagct gctctgggga gtgactctct tggagcatgt 2401 gatgtggtct tccattgtgc aggaccagcc cagtggcatc ctttcaacac ctctggcaag 2461 cagcctttcc aagcacgggt gccgtctgaa aacaggaggc atatctttca catcctaggc 2521 acacgcccta gggagtggtc agggttttgt ccagttctca gcaaactagc tacagctcca 2581 tcccttactc ccacactcaa gagagatact agaatacaac tgagagtagc ctgatatgat 2641 gctaacctcg agttgctttt atttaaatta aaataaatca accagacaca gtggctcatg 2701 cctataatcc ctgcatttta ggagatcaag gagggtggat cacgagctca ggagtttaag 2761 accaacctgg ataacatggc aagaccccat ctctacaaaa agtacacaaa ttagctggac 2821 atggtagtgc gcacctgtag ccccagctac tctggaggct gaggtgaagg atcacttgag 2881 cccaggaggt agaggttgca gtgagctgag attgtgccac tgctaataat taattaaata 2941 attaattaaa ttaataaatc gtgccacttt attaaataaa taaaacaaga gtaaatcact 3001 cacaaatttg gagcttttat tagcaaaaca ttacttagga aatctaaata aataacacgg 3061 ggttgacagc cattgttcta actggcagcc cctggcaagc tcaaagccag gattatgctg 3121 gtcacttaag tgacagctat tgcgaattgt tgttctctca agaaaaaaga accgatttct 3181 atggtaaacc aggcactgtg ctgggtgcct ttacaattca tcaccacacc acctaatgaa 3241 aggagcattc ttcagaaact gtagtgctca ggctttctca aggcctgagt tcttttccac 3301 cagagcatat tgttgcccta ttatccaaag ttctctaagg aagagaactg acgtaagacc 3361 cacatggctc cattacatct tctggctact tgattgattt tcatactccc tacctctggg 3421 gttggtatgt actatctatt tctttctcct ctcgttcttc ctttttattc cataaaatac 3481 aggaatattc ctgtacatta gtccttgcag caaccttgga attactacat tcctcaaaca 3541 agttatggaa gccagctgcc aatattggtc cctggttaaa cagtgaattc tgttgttcca 3601 tagagttact actgaaatac ctaagccatt ttgtaaaata taatttagtt gatctgaagg 3661 ctgtctctaa agcagtttta tgtagtgatt acagagaagg actaatttca agagtatttt 3721 attgtttaaa aaaatgtaaa cattttatgg atgcactagt gaagtaaaga ccaataaatg 3781 aagcagtaac tttaataaaa gggtaagtaa aatgtcacat cctctgccta tattcaggtc 3841 tgttaggtat gtgtagttaa atgtaggtaa gttagttgat aattatttat ttaagcattt 3901 ctttatgtct actcattaaa aagaaaaaaa gattaaaaga atgttactat gtgaaaaact 3961 gcccatcact ggggaaaaga attttattat gcaaagcttc aacgctattt acagtttaga 4021 cttttgtagc tattgaaggc tgacattgag ataaagaagt taatcatgtc cttctgtctt 4081 ggaggaggta gaaagagatg agaatgaata caattcagga tctacttctg gtctttgatg 4141 aggagttagc acacggttct gggaggaaag acaggttaag aggcatgtga aactctcaaa 4201 tacgtcactg cgtctgccaa cgtacatgat acccagcaag ctcacatctt catggaaagc 4261 atggtaattc ccaacactac cggaagtctg gagtggctaa gtaatccata tattcaacca 4321 ggaagcagct aaagaaatat tctaattacc taggaaggtt tctgatttca aaaggacatg 4381 aataaaaagt agaaggaatc cactcccaag gacggacatc agagtagctt aaaatgtgag 4441 aataatttta ggggaatttt agaggtttgg ttatagactt atgttccccc aaaattcata 4501 tgttgaagcc ctaaccccca gtaccttaga acatgactgt atttgggtag ggcctttgaa 4561 gagctaatta aattaaggcc actggcgtgg gccctaatat aatctggctg gtattcttgt 4621 aagaggagga gattaggaca cacagaaata ccagaggtac ctgtgcagag gaaagaacgt 4681 gtgaggactt agcaagggtg cagccatctg caagccaagg agacctctga ggattccaat 4741 cctatctgca tcttgatctt agacttttct ggaactgtga gaaaataaat ttcttggttt 4801 aagccaccca gtctgtgata ttttgttatg gcagctctag taaactaata cagattttaa 4861 atgtcattaa atgtcaatgt ttaagctttg acaaaatttt ctaaaggaaa gtataaaagg 4921 tcattttctt tcttttcaga gcctgatgat tgcgggaggg gtaagccagc tgcatgggga 4981 tcatgatgca atgctgatgc aggacagaca gaaagtagat ctcttccatt tctatttttt 5041 ttttttctgt tgagttgaat gatcttcaga ctgaaaatga aagaaaggtc actggaaata 5101 aaggccaaag atgagtgaca ggattataga ataagtctta gctgttctaa agaaggacat 5161 attatgtacc cccaccccca aattcatatg ttgaagtcct aacccgacag tgtctcaaaa 5221 tgtgaccata tttggagata gggtcaaaga tgtaattaag gttaaatgag gtcattagca 5281 tggatcctaa cccaatatct gctgtcctta taacaagagg agattagggc acagtaagac 5341 acagagggaa gaccatgtga gaatacaggg agaaggtggc catctgcaag ccaaggagag 5401 aggcctcaga agtaaccaac tcagccaaca cctcgatttc agacttccag cctcctgaaa 5461 tgtgaggaaa tacatttctg gtgtttgatc catccagtct atggtaagtt atggcacccc 5521 tgcagggttc atctggctca gacttaacga ttgcttttgg tgatatttat agggcacaga 5581 taacagccta aacacaagac gacagaaacg cggcccagca gactatgcat aaaatagaaa 5641 tggggtatct ggaccaattg gagtctgcag tgggatgcgg ttactaaaac agtcaaatgc 5701 aacatgaggc tccaggcaga gtagtgggca acatctccca tgttgcagca gtcagagcac 5761 acttcgagta ctgtaaaaag acacagacaa gccagaacac atttagagaa tggccaaggt 5821 gtggaaggaa ccagaaacca tgccattatg caactgttga aggaagtgcc tgttttacct 5881 tgtgaagaga agactctaga ggaagaagta gcatgaaaac cgctggcaaa tttgtaaaga 5941 tctgaagtgt ggaaaagaat tattctgctt ggtcactggg gatacaagga tatctgagtg 6001 ggagtttaaa ggcgggggat gtgagcttta aatgggataa gaacattcta gtaaccagaa 6061 atgcccaaag atagaatgca cagtctggag agccagtgaa tatctcacaa atggagacac 6121 ttgaaactag gatggggatg ctgttgtagg aattccagca gacaagtggt tgttggttcc 6181 ttccccaact ttgtagggtt ataactaggg atgttcctgc gttttctgct tggaggatct 6241 gcaagacacc tcagggcagg aaatggcatt aaatgcagaa cagagctagt ggctgaaaag 6301 caaaaagcca tcaggatctc tgagtagtga aggaaccaga gaacatgcag gcaatgtcca 6361 tcattctgac gcaatcagca gcataatcat cttcccccag gaacatcttg accagggaat 6421 gtgtcagtgt ggtgaatttc aacagtggaa agagaaactg ctaaatctaa gaactttaat 6481 ttttatagat tatgatctca tctctacaat tttgaatttc atgctcaata aaagttcctt 6541 actctctttt tttttttttg agacggagtc tcgctctgtc gcccaggctg gagtgcagtg 6601 gcgcgatctc ggctcacttc aagctcagcc tcccgggttc acgccattct cctgcctcag 6661 cctccccagt agctgggact acaggcgccc gccacgacgc ccggctaatt ttttgtattt 6721 ttagtagaga cggggtttca ccgtgttagc caggatggtg ttgatctcct gacctcgtga 6781 tccgcccgcc tcagcctccc aaagaaaagt ccctcactct taaagttgcc tcctccttcc 6841 cagggctggc ttcatgggca tgcaaccctg gagagtctca caggccctgc ggtgggagga 6901 gccccatgct tggtttaacg ctctgccatt gccatcttaa aattcttaat ttaatttttt 6961 ttcttttttt tgaggtggag tctcgctctg tcgcccaggc tggagtgcaa tggcacaatc 7021 ttggctcact gcaacctccg cctcccaggt tcaagcgatt ctcctgcctc agcctctgga 7081 gtagctggga ttacaggcag gagtaaccac gctcggctaa tttttgcatt tttagtagag 7141 atgggggttt caccatgttg gccaggctgg tctagaactc ctgacctcag gtgatctgcc 7201 cacctgggcc tcctaaagtg ctgggattac aggcatgagc caccaggccc ggccttaaaa 7261 ttcttaataa tgtaacaaag ggtctcacgt ttgcattttg cagtggactc tgcaagattt 7321 gtagctttgg accacgtttc tctttgcatt cagatacctt cttttttgcc ttatttgctc 7381 atgcagaccc ggaacaaata cggaattgcg gtgggtaaat gtggtgcaga aagtgaacaa 7441 ctgggtttgt cctgtcactt taggcttttc cctgctgtcc cagcttcatg tcacttactt 7501 gctattagat ttgggagttc attagcttca ttttcctgat gtataaatag gaataatagt 7561 aacagcctct ttggcttttg taggaagtaa atgacatgaa gcgtataaac aaatactgca 7621 tgacaataaa tatttgtcct tatttgttga ggacatccaa aggacattca ggggcaaaag 7681 taatccaaga gtcaagactg aatgcctagt gcgggaaaag acacacaaga caacatttag 7741 gggagctggt acagaaatga cttcccagga aggaagtctg taccccgctg gctgagccat 7801 ccttcccggg cctaggcacc cttgtcagcg caatgagcaa gggagagaag gcaggctgca 7861 gtgcagccct cagaagggcc agagcactcc ctggcttcag tccttcgctc caagccctgt 7921 gtggagtggg ctgtggcttg gtaactaaat gctacttcag gtcaagagca ggggatatat 7981 ctgggcagtt ctagagcatt ctaaactatc tggacactaa ctggacagtg gacggtttgt 8041 gtttaatcca ggagaaagtg gcatggcaga aggttcattt ctataattca ggacagacac 8101 aatgaagaac aagggcagcg tttgaggtca gaagtcctca tttacggggg tcgaatacga 8161 atgatctctc ctaatttttc cttcttcccc aactcagatg gatgttacat ccctgcttaa 8221 caacaaaaaa agaccccccg ccccgcaaaa tccacactga ccaccccctt taacaaaaca 8281 aaaccaaaaa caaacaaaaa tataagaaag aaacaaaacc caagcccaga accctgcttt 8341 caagaagaag taaatgggtt ggccgcttct ttgccaggtc ctgcgccttg ctcctttggt 8401 tcgttctaaa gatagaaatt ccaggttgct cgtgcctgct tttgacgttg ggggttaaaa 8461 aatgaggttt tgctgtctca acaagcaaag aaaatcctat ttcctttaag cttcactcgt 8521 tctcattctc ttccagaaac gcctgcccca cctctccaaa ccgagagaaa aaacgaaatg 8581 cggataaaaa cgcaccctag cagcagtcct ttatacgaca cccccgggag gcctgcgggg 8641 tcggatgatt caagctcacg gggacgagca ggagcgctct cgacttttct agagcctcag 8701 cgtcctagga ctcacctttc cctgatcctg caccgtccct ctcctggccc cagactctcc 8761 ctcccactgt tcacgaagcc caggtgggcc gtcggccggg gagcggaggg ggcgcgtggg 8821 gtgcaggcgg cgccaagggc gcgtgcacct gtgggcgcgg ggcgcgaggg cccctcccgg 8881 cgcgagcggg cgcagttccc cggcggcgcc gctaggggtc tctctcgggt gccgagcggg 8941 gtgggccgga tcagctgact cgcctggctc tgagccccgc cgccgcgctc gggctccgtc 9001 agtttcctcg gcagcggtag gcgagagcac gcggaggagc gtgcgcgggg gccccgggag 9061 acggcggcgg tggcggcgcg ggcagagcaa ggacgcggcg gatcccactc gcacagcagc 9121 gcactcggtg ccccgcgcag ggtcgcgatg ctgcccggtt tggcactgct cctgctggcc 9181 gcctggacgg ctcgggcgct ggaggtgggt gccgcgcctc ggaaggcggg gggaggctgc 9241 acggtgggga cgcgataccc cccaagacct taacccaagt ctttaatgca gagaagccgg 9301 gggtccgtca atgggacccc tctcctctcc gcccccgctt gcggacgtcc agcgcatccc 9361 cgctttcggc ccagccctgc cccagggagt cgcgctccgg cccgctgaga gggagcgggc 9421 gaggcgctgg tctccctggt tccgcgccag cccggggcga gaagggtagg gggcgaccct 9481 gagcccagac cccgacttag tccctgcctt ggaagcgggg gtcgggggag gcgagagaca 9541 ttcagacagg ggggaagggg gaaggagcac gtggggaaaa ccgaaaacgc agcgtcccta 9601 aagccagtcc ttcgctcttg aatcgttgcc ctttcctttc tcttgggccc tggggaggag 9661 gaggaggagt cgggacagcc caggaagcta ggagcccctc gcttttccct gcgttctccg 9721 cctcgtttca ccttcctctc tcacccccat cccccttcct tgctccaacc tcttttcgcg 9781 gctgctggcg gcggcagcct gggtgcgacc ctagcccgag ccggcgtcgg ggtcgcgcac 9841 cgccttccac aggcaaactt tgtgcatgtc cgcgtctccc ttttgtgtaa cttgcgagaa 9901 atgggagggg tcggagaccc atagccaccc cgcgccttcc ccaagtctgg acgtcccggg 9961 agttcgctcg caccctaagc tctgtctgag aggcagaagg tccgcgggaa caaaagccgc 10021 gacaccccgc cctcgccgtg ggccgagccg cgttagtctc agtgcgccac cgaggctccc 10081 gcccggctcc cttcccgggg ctgcgcgtcc gcctgggagg gggcagagac gctgccgggg 10141 cttctggcgg agcctcgggg cccctctgct tcccaccctc ggatcacctc cgaggcttaa 10201 ttctgtgctg ggctgtaaag tccttggcgg ggaggaaata aggcggagag tgggatgtgc 10261 gtctggaaag ggcagtggcg gatggtgggg aggcacgggc tctgtcctgt gcatccaagg 10321 aggcgtttgt ctgattaagg cgcggcctct tccctggcag ctctggggac tctggtttag 10381 ttcccctggg ggcacaggat gctggggagg gtccgaaggg tctttttttt agggtgcaga 10441 taaaaggatc gaattgagtg aagattaaga cggagaagat ggcgcctctg cagtgcagca 10501 aagaaaagct gtgtggaggc tgcagcctag tgaaatccac ccaccactag gtgtataaca 10561 agtcttcccc attcatccaa ctctcgaaca ttcagctaac gcctgtgcac aatgactgtt 10621 tttctttaca tctgaagatg cttagaagtt gtttgcatga cagttgggcc agaggaggct 10681 gtatttcact tcggacttcg gttgggttcc tgtttggcat cttaacattt gcaaccttcc 10741 ttacccgttt ttcccctctt ttcccccagc ctcctagtaa ttttccaaaa gaaaaaaagc 10801 aatttccggg cagatgtttt cctgtaattc cactgaggct gtacagacta tcacaccatt 10861 agttttttag tttctaattt gcaaggtcat tgaacttttt tctcttctga tttttaccca 10921 caagtggctc aggaatcacc ttagggctag ttaagacagg ttgttttaag agttaaagaa 10981 tttcgttttc catgttttaa aaaaatgcac ttaactaaga gaattctgat ttctctcatg 11041 acttacttac atttatttgt agaattattc cagttttccc gaatggtctt actacagatt 11101 aaaaaccata catttaagga attaaaagca tatgataacc gtatttttga agcttttgtc 11161 aaaacttaaa tatgtaactt tgttgagcaa tatttctttc tgtataaaaa gaggaagcat 11221 gtaggcagat tcgaatttag tttttggtct gtttaattgc attaatatgc tctttctgcc 11281 atcttttggg accttttcag tagctattgt taactccttt aaagggtgga ggtgagattc 11341 acattatgcc attttgcgaa gtgttggtag aagggtaggt aattcagtct actgaggttt 11401 gagggtggtt tatctggttc tctctgtgtc cactgaaaag atgtaattac gagttgactg 11461 tcttttatag agcttagaaa ataggtctgc tgcaggtggc tgtagaaaag aaatgaataa 11521 gggcaatgat aatggggcct tggctatcgg gggaaacatg gttattatct gggttgatga 11581 ttttggctgg tatgttgata gaggtgtgag tttaacaaac cctaatctat atgatgtttc 11641 actaccctaa tatttattta tttatttatt tttaaatcta gcttcctctt aagttttatt 11701 tccctctgtt gagggtttaa cctagtcaca atcatcttac ctttgtggta gtgctaattt 11761 atagtttatg gaaacctgct gctactgact agtacctcgc taaaggtaga agccgacaca 11821 cagaatagta aagctcaggt ttgatggtga tctgtaagga gtcacggttc ttgctaatcg 11881 atgagaaacc atgataactg gcagactgca gaagcttttc tcttgtcacc ggaaccagca 11941 gaggctcctg aacgacattt gcaaggctat tagtactgcg gagaagttac tagatggaat 12001 aatactggtc ctgactgagg aaatgaaaag gaacatgcaa gttcacgagg ttaaatgagg 12061 aatggctgat tcaaaaaaat taaaaagaaa taacgtgatc aacaatattg gtttgaagga 12121 atgggagagg ttgtctttat gttaagtctt gttgctagag gatctctatt ttatgttact 12181 gttttttgca tactagagtc ttctacagat atttgtatga atgttgtcag tcattctgga 12241 aataagcata gctattactt gcatttcacg tggccactaa actatcagtg agaagtgcat 12301 taaattcagt agaaatgtac attgtttctg caaattgtat taccttgaga atcttacaaa 12361 gcatctttca ttaatacatt tcttttaaaa tctctatcaa ttacaaatgt aaagataaac 12421 ctgtacagat gcattttaaa tttaaagctt ccagaatgta aggtttagtt cgactgcata 12481 cctcattacc tccccttccc tcctccctcc ccttactccc aggaagtttg cagggtaaat 12541 aaaacatgca catgataata aaacaaagtg gaatgtacta aaataccaaa aggaaaatac 12601 agattaggta tactgagtaa atatgttaca gagaaaactt cctgtattta aaaagtaagg 12661 aacaatggga tgagatcata gataatcctg gagcatttcc tggtagtaaa tttccagcca 12721 ggtttatatc ccaggtctgg ggattggagg ttaagtttct gcatatacaa tatttgttta 12781 cattcctctc tgcctctacc agacagacag acagacacac acacacacac acacacacac 12841 cctctgtctc tttctctctc tttctctctc tctctctctc tctctctctc tctctcactt 12901 gagtgacaga tgctggtgca gggtattctc aacaggagta gggaatttga aataaggagt 12961 tgagaggtca aaggtaaggg gctgaatgca ggaccagaga gcttcagtag catcttgatg 13021 ggcagagtat gagatgaaca gtggcaacga ggaggatagg gaaaaatcac attttctgtt 13081 tgtgttttgt ttttgtctgg tggttttcaa cctctctctg tttcacttga ctatgcttga 13141 ttatgaaatc tttttttttt ctttttttaa tttaaagatc tcaactggct ttgttgtgat 13201 tctacattca ggcaacacta ttctgtaaaa tagtataaat gttccgatga gttgaacaga 13261 ggagattggc tttgtagaca gaaaagggct gaagaaaaca gaaataaaaa aatcttttcc 13321 ttttcatagt gagagctcag atgcaaatgt atggagtggg gatggatggg aaatgccgtc 13381 ttttgcaaac ccttggaaac gctgtcaccc tgtgggtctc acagccagtg tacagataac 13441 accctcatct gtacacagaa gacagaaggt ggcaggggaa ggaatgagca caccccagaa 13501 tgagtagcaa gtttaatcaa taaaatatgt atttcttttt ttattgttgt ataagaagaa 13561 aagcagacag aactcaaacc ctgaagctga ttttgcaagt gaaaagctta gtaaaggggg 13621 aaatgttatg taactatttt ttcagtttag tgtggatggc ttaagaaata atcattactg 13681 tgaggacact agtatttttc ttggaaaaaa aagtttggca aattttcctc tttagattaa 13741 tcttggttat agattattat tatgcttgag tggtacctct tcagaagtag aaatctggca 13801 aaaactggag tacaatgatt gctctaggaa taaattcaca tatttaaatt taaaagcctg 13861 gatatgtgca tcaattaata gctttatgaa acagggatgc ttagtagatt agctagctga 13921 gtattcagag agggccacag aagtctcagg ttcacatgta ttaaatctgc tgaaatagaa 13981 acctaagaga tcattttatt taggtttatt ttgtgtggct acatgttgaa gaaacatgca 14041 atagagattt agcttgaacc ttgcgagact cacagtatct tacgaaagaa gatagttaaa 14101 atggggaaga tgagaggttt ttaaaaatgt cacctctgta gaacgaacca gagttgttct 14161 caatttataa tgttgaattt gatgccaggc gtgatgttag gccaataaca ctgagtcatt 14221 gagtgtttct caggtatgta gagaaggcag cattgcacac taatcaggag agtttagacc 14281 agggcaggaa tgcatgggca taaattctgt ctttgtttct tttgagctag tgaccatggg 14341 caagtttctg tgcctcagtt ttttcctttg taaaatggga tgagtacttc tagttattga 14401 ggatcgaatt aatatctttg ttaaatgctt gaaacagtgt gtgagttaat atatatgtaa 14461 cctgatggca tagcataagc actgtgtgtt agttgctgct atgtatggtg atgatgttga 14521 tgtacaatgg gtatatgtaa gtttcctgca tagttgaaag ttaaaaatct agatgatgtt 14581 tcacttattg acaaattact tattggccaa ttaagactat tgcaggaaat tgacagattg 14641 ttgtctatga gtttgatcta aatttaaaaa gcacatggta tctcagatac aatttttaga 14701 atacatgtta taaaatttca tttattaaat atgtattgtg tatttcaatt ggttttttga 14761 aaaatgaaaa taattcatgc tcattgaaaa atttaaaaca gtacagaaat gtgcaaaaag 14821 cataagcatt ttttacacaa atgggaaata cctagttttt ttttttcttt ttttccacat 14881 aacatggatg tagacctttt tttttttttt ttgagacaga atctccctct gtcacccagg 14941 ctggagtgca gtggcgcgat ctcggctcac tgcaacctcc gcctcctggg ttcaagcgat 15001 tctcctgcct cagcctcctg agtagctggg actatgggca ctcgccacca tgcccagcta 15061 atttttgtat tttttaagta gagatggggt ttcaccatgt tggccaggct ggtctcaaac 15121 tcctgacctc aagtgatctg cccacctcgg cctcccagag acctattttt taatgttgta 15181 cacactggca gcttgccaca gatgtctaat attgagcaag tttatacact ggcttgatta 15241 ttaactatta gttggctatc agctatgtgc aaggaactgt gaacagactt ggtttagttt 15301 tcagtagaat ctttgctagc aaaacgaata aacaggcttt gccaatctgg gtagaaatgg 15361 aagatttcaa atgccagcaa aagaattttg gtcctaatat ggaagcttga agttatgttc 15421 acccaaggac agggaaacca aaaagcagat tgcttgagaa tgcggcattt ctctaggaaa 15481 tcagtttcaa aataaaatga attttttgtt ttgttttgtt tttgagacgg agtctcgccc 15541 tgtcgcccaa gctggaatgc agtggtgcga tctcagctca ctgcaatctc cgcctctcag 15601 gttcaagcga ttctcctgcc tcagcctccc gagtagctgg gattacaggc gtgtgccact 15661 atgcccagct aatttttata tttttagtag agacggggtt tcactatgtt ggtcaggctg 15721 gtctcaaact cttgacctcg tgatccgcct gcctcggcct cccaaagtgc tgcgattaca 15781 ggcgtgagcc accgcgccca gctgcagtaa agtgaatttg tgtaaccagc catgttatgg 15841 ctgttacatc tttaaaactt tatagtgatc tgctgtttaa ttgaaattat atattttttt 15901 taaaaatcag catatgttgc tcaaaaataa aaggttttta gcttcaaaag caggccaagg 15961 atatttgaaa gccttgagtg tatgaaactg gtgatactgt ggttaaatga tgatgaacgc 16021 agggaaaaat caagaagcag agtcccaaag aattcacatg ggtttttgtt aaccagctca 16081 tgtaattata attgtcctgt ccaactaaag acaagaagac aagaagtggg ttcacatacg 16141 aggatgtgta atgacagaag ccatatttgt gataaacatc ctgattacag tagatacagt 16201 gaggagtgat taagatggtt ccagaagcag gagtggaggg ttcatgagca gggatgagga 16261 ataaatagga caaacatttt ggaaaaggga aaagatgaat aagcctgttt tgtaaattaa 16321 tggatcctga tgaatgaagt gcagtgattc cttaaggtct tattaatgat ggaccccaaa 16381 cagtgtttga atctcttaat ttccagatgg ttcctttttt tcttcacaaa gtcagaaggt 16441 tttcttatga gaaatttgat tcaacaagaa taattgtcac ttcttttgtt ttttggtgat 16501 tttttttttt tttttttttt ttttttttga gactgagttt tactgtgtca cccaggctgg 16561 agtgcagtgg cgtgtcttgg cttactgcaa cctctgcctc ccgggttcaa gcaattctca 16621 tgcctcagcc tcctgaatag ctgggtttac aggcatgtgc caccacgcct ggctaatttt 16681 tgtttttgtt tttttttttt ttagtagaga cagggtttca ctacattggc cagggtggtc 16741 tcgaactcct gacttcaagt gatccaccca cctaggcctc ccaaagtgct gggataacag 16801 gtgtgagcca ctgcacctgg ccaatttttt ggtgattttg ataggaattt ttttttttgc 16861 ctgatgcttt ttttgagaga atctaataat ttgtctctta tttaataatt tggctttcat 16921 tagcttccag aaagccacat actgtcttca ttgaaaaatg taaggcatcc tagatgatac 16981 agactgataa attacttttc agccataaaa gatacttatg agaataagag atctgttgca 17041 taaagaaaag tcatttgtgt taagtgaaga aatgcattat tactcttgct gctttgctaa 17101 atttcaaaca ttgaataact aatatatttt gaccccctgg agatatttta caggaagaaa 17161 aatgagaggt ttaatcctac tgttgagata gacttaacgt gttttagacc tgcccctcat 17221 cccaagttac tacaaactca tttgtttagt ttttccttgt ctgtgtgctc gttatttcac 17281 agtaaaatta gtgaggcaac ataaatttta tttttaagag aaagtaaaga aaatggcttt 17341 atccattaaa ggtttattca tcttaaaaaa ggacttgtta tatctactcc tactattgga 17401 tgaattttca aaagcatatt gatactgtgt ttggttttgt ttgttttcgg gacaagtcct 17461 cactctgtca cccaggctga agtgcagtgg tgtgatctgt agccgtggcc tcctgggctg 17521 aagccatctt cctgccccag cctccagagt agctgggact acaggcatgt gccaccacat 17581 ctgactaatt tcttaaatgt tttgtagaga tggggtcccc actatgttgc ccaggctgat 17641 cttgaactcc tgggctcaag cgatcctcct gcctcgacct tccaaagtgc tagattacag 17701 gtgtgagcta ttgtgcctgg ccaatgttgt ttttataaat atcattactc agttattttg 17761 tttataattg aattgatagg ctcatagaat aaagatggat aagcactgtt tcttaactgt 17821 atttattgat ttttttatct ttcaaaaaaa aaatcctcat ctacttcaaa gattaagcct 17881 ttaaaaccga tgcaaagttt ttgaagtctt agcttaagtc ttcagagact aacattaaaa 17941 cccatcatat gaaaagtgat atgtagtagc aaatattctc catcctgatg ctctcaattg 18001 cttattatgg tttaacacaa atataatttt gtcattacag taaattagtt tgaggatata 18061 actgaagagt acctaagaaa ctcccataga aatgtgaaaa tatgcagact aattattcta 18121 agaataatta atcatagata cttacatatt aaattgatta gcaatttaaa attaaagcca 18181 aatttctgtg gaaattagct tattttcata cacaaagaaa cctagattct taggggtaca 18241 accaagaaag ggccactcct tcatttcaca aatattttcg aggtccttgc ccccaagagg 18301 gtagacagaa taaaaaggag gaaggaagtg acttggaaaa aattaagttg ggtaaggaat 18361 ataagagaat gctacgtaga aggatgtgtg ttgtgaatgc aatagtttct gtagtattgt 18421 caaagttctc tgatcatttc agtatgtgaa ggtaaactta gataactggg ggaaggaact 18481 atatatttac tacattgtaa aagaataatt gtagacagaa gatctacttt gggtgattaa 18541 atgttgaaaa caagtggtgg gagagagaag ttagtgttga ttgtgatcac acacttttct 18601 atacagtaat ttgtttgatc actcatgata atgccttaca ttttgtatgt actaaataaa 18661 tgagttctct agaaattcta tgttgtggtg ataaagaatt acgttcagat tgaagattcg 18721 aacagctgtg caaccttctc aacatttaag aaactgcgaa caagcataga aagaagatta 18781 aagcctttta aataagattc ttttttttca gcctcttttt ggaacatttt attgtgaaaa 18841 ggtttcataa atactgtcaa tacatcacgt tcagtcattt gcaattgtat ttcaaaaatt 18901 cagttggcct gcaaaaagaa aaaatgcctc atttttgaaa tatttcatcc atgccctgat 18961 tgattgggac actttcaata gagggtgcct agacataatt ttgtgtgaag aagttagtag 19021 agtttggaaa gaaaatgtgt attggaatta aatttagtgg catgaaaatg aaaatttctc 19081 tgtcattgtc tttttttttg gctccttttt tggccatttt acagctagtg tttagtgtga 19141 caaggatcaa aattggccct ttaaaaaaat atattgaatt ctattacctc tgttatctaa 19201 aagaattgta tttttcaatt tgaagagctt gaagccaata ctactatata ccattaactt 19261 gacaggaatt gaaaatctaa ttgaggtcat tatgattagc cttgtatctt gctagtattt 19321 tctcaatctt attttgaatg ttttgtgtca agtgttaaat cctcttatac cacagaatcc 19381 agaaaattta aatgcaacag aagtaaaacg aatataaaaa ttctatggtt gttgtggagg 19441 ggcgttcttg gctaacccat agggtgtttg aatgtaaagg cagcttacag aactgctgct 19501 agttgaattg tggagggttt cacttttaag agtacatttg agacaaaatt gcaattttgg 19561 aatcacttat tcctgcttgt tcagtgaaat aggagagggg ggaattcctg agggaagctg 19621 ggagatttcc aggtgatgat agagtagcca tgtgagtatt tggctgaaac agagtagaag 19681 ccacctctgt ctgttggaca gaggtggcct taacactgag tatcaatcag catggccatt 19741 agagctgtcc aggacaaaca taatgaatgt tgtggtggca gagagggtat acttaaatga 19801 acaaaattgg aggctgactt tccttgtttc ttggtagttc ctttcttcgt atttgctatt 19861 tggatatttc acttcacttt tgtgagctag actcaaaggt taaatttata tcattgtgaa 19921 ttttctgggt catattgtag ggacacagat gggaagcatc ccttcagttt attgagagta 19981 tttcttatgc gcatgtgttt ttaattgtat ttattgattt aataaatagc cgatgagatg 20041 agaaaaaaat attagtgtct tgttgttgca gagaaaaaga cattggatgc attcttagaa 20101 tttattatca catgcgaagt tatttagatt tacaaagcaa tctgacatgt tgacttcatt 20161 gctcgtttgc tcatttctat cagaggttga acacaaagcc aagcctaagc ccgtgattct 20221 tctgtgtgca gatgaggggg aaggtttaat gcagaatgtt tagctgaata tttggattgt 20281 gcattcccct tttatgtctc tactgcaagt tttctcaggg catttgtgat tgacttctgt 20341 gatgttgaca tttatttgga gttagatgtt taaaaaatta ctaatttatg gcctggtgcg 20401 gtggcttaca cctgtaatcc cagcactttg ggaggccaag gcgggtggat cacgaggtca 20461 ggagatccag tccatcctgg ctaacatggt gaaaccctgt ctccactaaa aatacaacaa 20521 attaaccggg cgtggtgaca ggcacctgta gtcccagcta ctcgggaggc tgaggcagga 20581 gaatggtgtg aacctgggaa ggcagagctt gcagtgagcc atcgtgccac tgcactccag 20641 cctaggcaac agagcgagac tccgtctcaa aaaaaaaaaa aatttactat cttattgttt 20701 ctctcatttt gaatttgtga aattattatt cattttatgg acatactgca tggattctgt 20761 gtttcagtta tgacaggaga tctagaatct gggtggggct gtgagtaaat agaaaggtaa 20821 ctcagtctgt gccttgagga attccgcagt ccagtggagg aagaaagaga cggagaaata 20881 attaggtaca gtgtgccaaa ggctataata gaggaggtac aaactcttcc cagagaaggc 20941 aggcagaatt tgaactaggt ttcaaggatg tgtgaaggtg aactggggtg gggagggctt 21001 cccagacaga aaaaacagcg agggtgtaga gtttcaaagg gggaagccca ggggtataat 21061 aagtgaagga gacagtgtga aattataaat tttggatgcc cttaatgagt ttgaacttca 21121 ttttcttaga aatggagtac ttttaaatgt tcctaaaatg gggacgaggc attgatattc 21181 atggttcagt taggtttagg agcttttgta gaagtccttt tcaaggtctg cttaatctca 21241 tgggccagat agtgacacct cgatgcaagg gagcatgtaa atgtgctcta ttagctggga 21301 gcattgctat cagaaattac atttccggct ctcttcttaa gggtaaagga gaggtaggat 21361 attgagtggc aactggcatt ctgggatgta gaagaaatac agtaccataa agggatagag 21421 gaaaatgggt agacttaacg ttctgaacca aagctaagat tctccgtgag cgattagagt 21481 aggcttttgg gatttgctta ccttctctta tctgctagcc agccacacag gaagcacatc 21541 tttgtggagt gacagaggag gaggtaggaa atgctttctc atttttatgg tagaaacatc 21601 caagagctga gccttcattc tggctgagag aactctagat ggttcagact tgcattcctg 21661 ggaaggcctt ccagctagag tgatggaaca aagcaatgat ttctgtggcc ctgtgtggac 21721 agcagtggca gtcagtttac tctccctgtt ggttaactgt ctttcgtttg atctagagac 21781 ttcctcaaaa caattttgtt gtgcttgttc tttgaaagca acagcagaca aaacaagctt 21841 tgaggttttt agaaaagagc ttgacactga gcacctcttt tgagctagat ctgtctgtca 21901 tgctgaacct gttttctgtg gctgcgtagc ttttcaaatc tgtttatgct tgctggtgtt 21961 attgaaacag aatgggaaga tctatgcaaa aaggaacatt gatcttaata aatttatgtc 22021 agtaagaatg ggactgccat tttaaaattt attttaattc cttacctatt ttcttgttca 22081 attattctaa aagtgattaa atgaagatac gtcacagaaa gtatgctttg aaatcaagat 22141 gtggggaaaa cctgtagttc tgagtaggcc tgatagaatc atcattgtta gagtatttgg 22201 aggttttgta gtagggattt tgtcttcact taaatattta gttgaaatat gttccaatag 22261 tttgacacat tacagaatct gtctgaaaga aatgtctaaa atatgattcc tcagaagctg 22321 ctcttggttt tactttctgc tgttggtttt actttctaac atttcatttt gatcccgcag 22381 ttttaagagt aatcaaaaag gccgggcacg gtggctcaca cctgtaatcc cagcacttcg 22441 ggagactgag gtgggtggat cacttgaggt caggagttca aggccagcct ggccaacatg 22501 gtgaaacccc gtctctacaa aaaatacaaa aatcagccga gcgtggtggt gcatgcctgt 22561 agtctcagtt actcgggagg ctgagacagg agaatccctt gaacccagga ggcggaggtt 22621 gcagtgagcc gagatcatac cactgcattc cagcctgggc aacagagcca gactctgtct 22681 caaaaaaaaa gagtaatcaa aaagctacat tatttaaatc ttggcatgtc tttaaaaaat 22741 gttaaactgt caggaaaatt atcacttctg gataccacct tcagaaaaac acatgaaaat 22801 ttgtgcatgt ctcaaatagt tgttaattta tgaaaaatta gaaagagtgt ttgaatgtat 22861 atttctgtca tattatattg ctcatatttt taaattatac tttttttacc cgtgaagccc 22921 atatagttaa gtttgaaaat tttaattata ttagaatttg aaaattaaat aaatctcctt 22981 taattcatgc ataattttct aatagtcata tagtttgtat tatataattt gagaaacttg 23041 gttctaaaaa aattgtctgc caaagaatgt atatttctga attctgttct gctttcatac 23101 atctgaaaat atttgagcta tcataactgg ctatgcttca aacccagtct gaaatgggtg 23161 acctttgctg gctttggatt gcagagctgg gggctgttgt ttgctttatc tgatttaaaa 23221 tagtcagtag tttattttgc aggcagttag atgtatacct ttgcataaaa tcattgaact 23281 gaaaaggttg gcactccaag aatacgttgt gagaaaagag aacttgagta ctcactctcg 23341 actcacctga atcttattaa tattttattg attaatatct gttgtatttt taagtaaata 23401 agctctctag aaactatctc tgcttctttg ttcatttaac atatattaaa caaattgata 23461 cagattgcaa aaattagttg caacgttcct agttagatat caatgaaata aaaatggtct 23521 attcatacga gttcatttta gttattccta ctaatatcaa atgtgtttta aaaaatggga 23581 gattataatc tttttatgaa gtaagataaa ttattacttt gtatatttaa ttcaaagaaa 23641 acaaaaaggc cacaagtgcc tgcataaagg cacatccact taggattatt gttaaggcct 23701 acagttgata attaggattc aggcatagca taaatgaagt ggtatctttt tttttttttt 23761 tttttttgag acgaagtctc gctctgttgt ccaggctgga gtgcagtggc acggtctcgg 23821 ctcactgcaa gctctgcctc cctcctgggt tcacgccatt ctcctccctc agcctcccga 23881 gtagctggga ctacacgttc cgccaccact cccagctacg tttttttgta tttttggtag 23941 agacagggtt tcaccatgtt agccaggatg gtcttgatct cctgacctca tgatccaccc 24001 gtctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccca gccagatatc 24061 aattattttt agacccaatg atatttgctg ttgatttatt gttggtgcac aaaataagaa 24121 aacaaaagca tggttttaaa acagcccttt acacatttaa tatggaaatg gtaacctaca 24181 ggtctgtatt ccatacagat tcctgggctc attttatctg tgaattcctc tgggcgtcat 24241 gcatgcccag gactgacttt aaagcagaat tatccagtag aggtataata ggaccgttag 24301 atctaattta aagttttcaa gtagccacat aaaaaaataa gcaggcgaaa ttaattttag 24361 tctgtcacta aattcaaaat gttatttcaa cagggggtca atataaaagc tatttatgaa 24421 atacattatt ttttcatatt atgtcttcaa aatgttcaca tttcacatat ttatttttta 24481 ttttattatt actatttttt tgagacggag tctcactcag tcggttgccc aggctagagt 24541 gcaatctagc ttgctcattg caatcttggc tcattgcaac ctccacctcc tgggttcaag 24601 cagttttcct gcctcagcct cctgagtagc tgggattaca ggcgtgtgcc atcacaccca 24661 gctaattttt ttatttttgg tagggatgag atttcaccat gttggccagg ctggtcttga 24721 actcctgacc tcaagtgact tgccgctttg gcctcccaaa gtgccgggat tacagttgtg 24781 agccaccgca cctgattaca tatttcacat attttaattc agaccagctg catgtcaagt 24841 acttagtagc cacagatggc tagtggttac tgtattggac agtgcagctc taggagagtc 24901 tcttcaacct gcagctgtct aggaagagaa tcatagtttc tggaacaggt agaatctata 24961 agacctcctt tttaagcttc tccactttac agatgaagaa agtaatatct ggagatgttg 25021 agttgttcaa aatcacccac attttaagga tcaagagccc agtccctgtt ctctaacttc 25081 taatctcttt cttcatcact ttttctgctt catgttgctg catccccttt agttactaag 25141 gtactaacgt taggttgttg ctatcagtta aatattaagc caggagcttt aagacaatct 25201 ggcccacttg aaggcatggc ctgatttgga caaaacttgt ggggtaggat gaatagagtt 25261 accaggcata tcttcaacct ctctgtggac tagtattata catatatgag atgatgtagg 25321 gtccagtgat cagttcatct ttttctattg tgagcactgt tggaatagga atggggggac 25381 cttttgggac ccaggatatg tgattctgaa ccctgaaatg tcagatgccc actaaaagga 25441 aggggagaga tggacctgtc attaccttac acattttatt gtctgtatgg gtatgtaaac 25501 atgtacagat tataaggaat tgcacttcat ttctgggctt ctggaaacac aggtattaag 25561 atttcattta cctccccttg ttttggaagg cttattgtgt gtgtgcccat agttcttacc 25621 cagttacact taaagtatga tactttttga cagcttaagc ctattgcatt cctggcatgt 25681 aaaatagtta ttttcagaat gtaagcaaat gtggaaagct tactgcctga catggagggt 25741 tggagaagtg ggattggtgt tcttttgttg taaactcatt tttgtgaaat atttgcttta 25801 caacgtgaag catgcatgat ggatgaactt ttagaattga aagttatttt tcatttcatt 25861 cctgaaatat ttaccagcat ggatgttgac tgagtgcttt gaagagtttg cttagtcata 25921 taacttaggt tttatgattt tagatttaat taaacttaaa gttttcctgt ttgatcctca 25981 ggacttctca tctgtggcaa ggctttactc acaaaggaaa tatcaacctt gaagttaagc 26041 atgccctgtg tttctttata ttctgatgaa aacgattgaa gtttgccagt gctgtgtctg 26101 ggacaatgct tagaggattc atggcttgta ggaatgagtt aaaagtacaa ggatattggc 26161 tggcatggtg actgtcgcct gtagtcccaa aactttgaga gtctgaggtc agaggattgc 26221 ttgagctcag gagtttgaga caagcctggg cagcataagg agatcccatc tctatgtttt 26281 ttaaaaaagt gaaaaacaac aacaacaaca aggtacaggg gtatctgcgt ttccttaggc 26341 agctgtctta tataccttta accttaaaaa aatccaccat ttttaaaaca gtgttttctg 26401 ttgtttttct tagtaaatat aaacaatagg aactgaagtt tagggcttaa aatttttgag 26461 agaataattt ttatttcagt agcttttggg gtacaagtgg tttcttgtta cttagttgaa 26521 ttacgtagtg gtgaattctg agattttagt gcacttaagt agggtacctt gtacctaatg 26581 tctagttttt cttttaatcc ctagtcccct attaccctcc tcctcgtgag tctctaaagt 26641 ccaatatgtc cctctgtgtg cctttgctta ttcctaactt agttcccact tataagagag 26701 aacatacggt ttttggtttt ccaattcctg aattacttta cttagaataa tggcctccag 26761 ctctatccaa gttgttgcaa aagacattat tttgttcctt gtaatggctg agtagtagtc 26821 cctgatatat atatgccaca tcttctttat cctctcatta gtcagtggcc acttaggttg 26881 gttctgcatc tttgcatttg tgaattgtgg tcctgtaaac atatgtgtac aagtgtcttt 26941 ttcatacaat gacttctttt cctccgggta gatacccagt attaagattg ctggatcaaa 27001 tggtagatct acttttaact ctttaggaaa tctccataat gttttccata gaggtggtac 27061 tagtttacat tctgagagaa taatttttgt aaaattatgt ttaagctaga tttggaattg 27121 ggaaagaatt acattcttta taggaagctt ttgaaaaata tatttaaaaa tatataaaaa 27181 tatgcataat cctaccaccc acaaatgtct gtattattat gtcatttcta aaccattttt 27241 gagattatgc tatagattac gttttagatc ctacatttta tttcttaaca ttaaatctca 27301 tactatttta agttctttgt acatgtaatg tttactgaat gggtaaatct gtttagggaa 27361 actggagcta tgtagttgtg gacttgtgct tttgttttgt tttgttttgt ttgtttgtac 27421 atttgctttt tagaaatgtc tattcagatc cttggcccat tttttttttt tttttttttt 27481 ttgagatgga gtcttgctct gtcgcccaga ctggagtgca gtggcgcgat ctcggctcac 27541 tgcaagctcc gcctcccggg ttcacgccat tctcccgcct cagcctcccg agtagctggg 27601 actacgggcg cccgccacca cgcccggcta attttgtttt tgttttgtat ttttagtaga 27661 gatagggttt caccatgttg gccaggatgg tcttgatctc ctgacctcgt gatccccccg 27721 cctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgtgcccag acagacatgt 27781 gcttttttat ttggcatttc tagcctgaac ataggctgga agtagtgact tagagaggga 27841 ttgtttgcca aatatcaggt aatattttaa ttatttatta tttattgtaa tgctgtgaat 27901 caggagtccc taacccctgg gccatggact ggtaccggtc tgtggcgtgt taggagcttt 27961 gctgcacagc agtaggtgag caatgggtga gtgagtgttg ctgcctgagc cctgcctctt 28021 gtcacatcag cagggcatta gatgctcata ggagcgcaaa ccctattgtg aaccgcatgt 28081 aagaaagtga gagctgcgtt gcacactcct tattagaatc taatgcctga tgatctgagg 28141 tggagcagtt tcatcccgaa accaaaccac aaaattgtct ttcatcaaac cattccctgg 28201 tgccagaaag gctggggacc actgctataa attgctgctg tcaagtagtg ttacgcatgg 28261 tggtctttaa cataaaaaaa aaaggtaatg aacgtacacc acacagatta gcaaaaataa 28321 ggaaaggctg atctagtagt tcaaacaagc aatcctctta tttaatggtt ctgtgattgc 28381 tcgtgtaggc agaagcttaa ctgtaagttg ttgcatttga caggcaaatg tagctggagc 28441 tggaatgtag atattagtga aattgtgatt tcagtttact ggagccattc agtgggagca 28501 tgaagtgggt tacatgatta cacggatata aataaagcgt cctccttgcc acttcactgc 28561 tgttaaagtc tctcctgatg ttatgtggtg aaaaacaagt caaaaacatt aattctgtca 28621 agtttccaaa gatataagaa tctttgacca gaaaagagac ggtatttatc ttcaaattta 28681 acaaattgtg atgaaatgct gctgttacag aacctagtgt aaggtcgtgg agtccattca 28741 gccagacaca tgtaattttg attattaact gccgtttctg tggtcagtcg gccagtttta 28801 tatatttcac attataggta tttggagcaa agacagtgtt gtttattgtt tactttaaaa 28861 aaatacaata ctgtaactac ataatgagta aatgttgata ctgttgaaaa ctaataaaaa 28921 caaattctta cactatcaac atcacagaaa ttctaattgt aagaaaatat tgatataata 28981 tggaaatccg ttcatttatt tggtgcatta attgaacacc aacactatgc caagcattgg 29041 gatacaacca tgaaagaggc cagcttcctg atctcacaga gcttgtactt tgttgggtgg 29101 ggcagagagc aaatcagtca atctgtctgt ctgtctgtct atctatctgt ctatctatct 29161 gtctatctag atgtcaatca gtatactctg tgccacttgg tggtcagtgc tatgaagaaa 29221 aattaagcca gtgaggagct ggatagagac acagtctgga agggtatgaa ggggatcaga 29281 atatgctacc cccaaatatg ccatttcagc aaagtactaa tttgagctga agacaatcga 29341 gaaatagcag acccaggaca agctttaccc cctttttata aaaattttgc tgtgtgaagg 29401 tatcctcctt cccctctcca tccctggaga gaccactaga ggtggcacca tcgcctgcag 29461 aaacaaacct tattgaaata acccttatct tctattagtt tcccccatat atttacctta 29521 ccacacttcc cctggaagcc ctaatcccct ctcctttgtc ttgtgacttc tccacagttt 29581 attgctcttt gttaaaatgg tgtgtatgct ctcaggccta actgcttctt ttgggttttt 29641 gcttttctgt ggagctccct cctctcccct ccctcatgta aaaataataa cattaaataa 29701 aatttgtgtt ttttcccctg cgaatctgtc tcgtcagttt aattggcaag gcctcatgtg 29761 ctgaacctaa cagggtagag aaaaaaggtt tttttcccct tttctacaag tattaaacct 29821 ggagagaaca atacagtgta aaggctgtga ggtgggagca tgctggggtg ttggaagaag 29881 cacatgagct tacaaacatt ccatcaattt gcattcattc caaaaattgt atgcaatttt 29941 tcagtgacga ttcccattga tcaatcattt ataaaaacta atggtgaggg tcttccatac 30001 tcagtttact cccttcttgg aaaagaatgt acattttaat caagatgatt tcatgaaaga 30061 cattctgtac tccaaacatt taaaagagcg tttagagtat acagcaattc atcttttctt 30121 ctgtatttta aaatttaaga cagggagaat tattgtatta tacttagttc agatttggtg 30181 gtggttttgt tttttggttc cagtgaaaag aaattgcatc agtaacctgg cggtcaaccg 30241 tgccatattg tttgtgattt attagtctgg aaaaaataac tgtatttctg gcatatgtaa 30301 aaatagtttg ctgagaggcc tccttggttg tggcaatagt aatactaatt accaggtact 30361 gtatgtagct cattctatga accaggcctt gttgaaaagt gctttcactt aattttttct 30421 tttcagcagc ctgtgaggta cctactgtta tccccagctg ggagggagct ggccagtgga 30481 ctgggtttaa actatgtttt cattgtcttt actcttaacc acttatcttc tctgcctcca 30541 gcaaaataag tcaaagagtt cagccagtgt ttcttcttgg actaagcagt cagcctccta 30601 agtaagcaag ccttggaatg caattgattg gtaagatact aaagtaatca gtaaggccct 30661 cagttcctgg ctgccagctt ctttcaccct ctcctgcagg aagttctgaa tgctgatatc 30721 tctacttggt gtcttctctc cccctcccaa ttccattatt tccctttttc tcaatcccaa 30781 gtgaaggaag ttctgagtgt ggatgttgga gtcattcaga ccttggatct gagtcacaac 30841 tctttcaatt tactagaaat atgactgcaa gaaaatgatg taagcttccc taaaccaatt 30901 tccacatcta caaagtatct aaatagattg cttaggagaa aaattgaatt aaattgaata 30961 aagtgcaaag cacagaacat ggccaggagt aggccctgga tgaatgaggg aaaaatggtt 31021 ctgattttta tttttgagcc tctttgtaac atttgttggc cccttcagaa taatccttag 31081 agttcatatc agccaccctt catccctcct gggatgggcc aggggagagg cccagttgcc 31141 tgtgacactg cttcgtttca catttcactg ctttagcatc tcaacatttc attgataaca 31201 tttttgatgc ctgaaaacag aataacttat ttgtaattct acatacattt tgtttgtgat 31261 caagttgtta gagtttgaaa tgagcctcat ttccttgggt tttgtccctc cccatctgac 31321 cttcctcttc tcagtgaacc ctgaagaaaa aacactagca aaggaatctc aagaatgttg 31381 gttcgggaca cagctctctc catctactct cctgccctag tcctgcccct ccttgtgttt 31441 tcagcggtga ccaggcactg gtgacagcct tggctgcttg gaccagctgg aggtagctgc 31501 aaaatgcctg cctgctagct ttgaaaaatc tctgccaacc tcattgaaat aattgtttac 31561 cacatatcag aggggactgg gaagtgggta attgagagat ctttttgaag gggtttgtgg 31621 cttctccact tgcagaggct tcctgaaatg gtgggtaatg ctcatgcctc agggagaatc 31681 aatatccttc tcctaaaacc tgctcacacg tagctctaag tattagacat gcatttttgt 31741 taacatttta tcagtgcatt tttctacctg tgtcatttgg accttagatt cgggagaaaa 31801 tcactgtgta tctttggtgg tttcaggtgg ctcttaaatt tcattaattc acaaaatctg 31861 aattgaatgg aatagtctgt tttgggaggt ttttttgggg gggttgggga gggattggat 31921 ttttcagtcc ttccaagaga cttcaggttt tttttttgat gttctgatta gcccggtgtg 31981 tgtaaggccc tatcaggtgt tcgcctgtgt gatcctatca cacccacatc ctgtagctgc 32041 atccaagtgc tttgaggccg acttaatggg gattaactta acagttgctc tgtgggaaca 32101 gatgctcttt gccgcttatt tctaataaat tgattgagcc atttaaaaga tttttttttt 32161 cttttaagtt acatcttctt aagaaactgt agacaataaa gcaaaactaa aatacttggg 32221 tctaaaatta gtgtattata tttaaaacac tgggcttcag actgattata tcatgctctt 32281 ttactgactt tctagtcaga tgcttattta taaaaataaa gcttatataa agggcagcat 32341 tcaaatgtta caccagtcgg ttttagcagt ttgaccttcc acagtagcta aatatttttt 32401 acactgagag aagaattctt taacattata tgctatcagc atgtcaagtg aattctgagc 32461 ttgcatgcaa cagatgagac cctaggtgga aacataaaaa cataccagtg tgaaagagaa 32521 aagcaggctc tctgtctttg ggaaggagtt ctgcagggag gggtgtttat aagtgtagta 32581 gatacttagg aaaattaata caattattat aaaagtcatc tgggaaaaac cattaggtaa 32641 gaatagccaa taaaaaacca gaagcaaaat gtaaagatat agaaattaaa actgcacatc 32701 ccatgcatag ataggtcaat ggggcaaact ggaagtctag aaataaagtt gtgcatatat 32761 tggaaattag tgcaagataa aagtgtcttt ttacatcaga aaggggaaag atggattatt 32821 tagttaatgg tgttgggtta tctgactgta catagaagga aggtgggtaa ttatttaccc 32881 agtaccaata ctggattacc tagtattatg tacccagtac agtttgtcag caacaaaagg 32941 caaacttttt tattccatga agtgttctta caaatcagta agaaatacta aatatgaaca 33001 tcttcagaga aaaatgaaca tttttgaaaa cttattatga aaagaattaa acatacacaa 33061 gagagaatag tgcgctgaat gctcagcaat gatcagctct cacccagtct tgttttatct 33121 gtattcccct ttctcaacct gctcaattct cttttccctc actgggttat tttgaaataa 33181 tttcaattct ataattttat gtataaatat ttcagtgtgg atgtctaaca gaatctctat 33241 tagacgtgga caaaaaaggc aaagaatttt atagtacaca ggaaaaaata cacagccaat 33301 aaacctatga agatatgctc aatctcatag tcgtcaggga aatgcaaagt aaaacagtga 33361 tataacattg ttttttaaaa ccctgatggg caaagactaa aaaggcttgc cagattggca 33421 ttttcatcta ctgtcaattg ataacagata tttctttgct tggagtacag tggcacagtc 33481 tggactcact gcaacctctg cctcccaggt tcaagtgatt ctcctgcctc agcctcccga 33541 gtacctggga ttacaggtgt gagccaccac gcctggctca ttttttgtat ttttagtaga 33601 agcagggttt caccatgttg gccaggctgg tctcgaattc ctgacctcaa gtgatctgcc 33661 cactgcaggc atgagccgct gcatgcagct ggtacagaaa ttttttagat cagtcattta 33721 gaattttatg tacatatccc ttgatctagg aattctagaa actaattagt tcatagtata 33781 ttgtataaat gcgcacagat gaggatatta tgagagcatt atttgtatca actaagcaag 33841 cctttaaaaa actcttatac agtagaatat tatgctgata cattaaaagt gggcataacc 33901 atactgatgt atagaaatat ctaaaatcat accgatataa gtgaaaacat tcagaatgca 33961 ggcataatca cagattaaac agtaatttat ataaaaaaca gaaaatttat atgtgtatta 34021 tgtcagtagc agattcaagt gaatctgaca aaatctgact ctgggcaatg agaacattgg 34081 aatggaggag aaggtagtac atatttactt tgtatattat caccttttgt actatttgag 34141 tctttttaac tgtacgcaga tactttagag caggggtccc caacacccag ccatggacag 34201 gtactggtcc atggcctgct aggaaccggg ccacacagca ggaggtgagt ggcaggcagg 34261 tgagccagtg agcattactg cctgagtgct gcttcctggc agtagattct cacaggagca 34321 caaaacctat tgtgaactaa acatgggagg gatccaggtt gcctggtcca gtccttgtga 34381 gaatctaact aatgtctgat gatctgaggt ggaagagttt catcccaaaa ccatcccatc 34441 ccatccagtc tcatgccatc ccacccccaa caccccatgg ataaatggtc ttccacaaaa 34501 ccagtccctg gtgccaaaaa ggttagagat cactgcctta gaggctgctc cctcttggca 34561 catcttgaat gtaggactcc ttggttctgc cattcctggt aggtagtcag ccagcttttg 34621 atagttctcc ccagagactt catcccacag gcaagccatc ccattccacg taaaacatct 34681 ctaatgttag gaaggcacgt attttgatgt gatatctaga atcctcccag ctgttcttca 34741 gaggaacatt acttttataa agttgaaaag aacgttttat tgtattataa aactaggtca 34801 tttcagtgaa tttcatctat tttcatgggt aactgattct tatatcaaaa cttgcttttg 34861 aaagtactgg aagcataaat tattacttca ggcccatttg ctttactttg ttcagcacat 34921 tgaacatcaa ggagcaactc cctcgatcta aaatttctag cattggtgct tgtaatatca 34981 tttttcaagt tgacattaat gctatctttg gaattctcat ggtaagtcag aagaaagcag 35041 tttagtggaa ctgttctttc tcccactgaa ttgtatactg agatctgtta tattttctgc 35101 attttaaccc agaaagttgt tcaagagaat tgaccatgat aagacatctg tagcttattt 35161 tgtatttttt taatggaaaa ttgtaactga cttttaaatt taagattgag tgtagccttt 35221 cccaaaccaa tagctaatgt ttttataact atttagtgta ggagattata ataaaatcca 35281 cattgcttaa gatataaaat gtttgtgcag taataggctg tataaaaaca gtaaattgag 35341 attgtatcag gaataatgta ttataaacca tatatgaaac caggtactaa cttggtattc 35401 tgcatagctt tatcagaaag gcagagcaaa tactatctca aaatgaagta taaacaaaat 35461 gaggttcagt gagtgatgtg gtttggctcc atccccacct ggatctcatc ttgaattgta 35521 gtccccataa tccccatgtg tagtgggagg gaccaggtgg agataactga atcatagagg 35581 cggtttcccc catcctgttc tggtaatagt gagttagttc tcacgagaac tgatcacttt 35641 ataagggact tttcccccct tttgcttggc acttcttctt cctgccacca tgtgaagaag 35701 gacgtgtttg cctccccgtc taccctaatt gtaaggttcc tgaggcctcc ccagccatgc 35761 tgaactgtga gtcagttcaa cctgtttctt ttataaatta ctgagtcttg gatatgtctt 35821 cattagcagc gtgagatcag attaatacag ggaggctaca taatttgacc caagtagcaa 35881 taaacaatag gggtgaggtt gaaatccagg cccctggtac ttatggcctt aagctaagtg 35941 aaataagcca gacacagaaa aacaaatact gcatgatttc atttatatgt ggaacctaaa 36001 acactcaaac tcatggaaac aagagtagaa gggtggtgtt accagggttt ggatgtgggg 36061 gaagtgaggc gatgttggtc aaagggtgca aacttgcagt tatgagatta atacctaatg 36121 tacagcctag tgactagtta ataatagtga attgtacact tgaaaacgtc tgcgagagta 36181 aatgttaaat attcttacca cacagaacaa aaaaagtaac tctgaggtga tggatatgtt 36241 aattagcttg attgtgataa tcatttcaca gtgtatacat atatcaaaac atcacgttgt 36301 atacattgaa tatacataat ttttgtcaat tatacttcag taaagctagg gaagaataaa 36361 agaagtgacc taaccaggac acctatctgt agctctatgg taaagccaaa agggataata 36421 acttttgatt atcaaaactt catcatcatt attatttttt aagactgggt cttgctatgt 36481 tacttaggct ggtctcaaac tcctaggcta agtgatcctc ctgccttggc ctcccgacat 36541 attaggatta taggcatgag ccatcacacc cagccagttt tgaaggtatc tttattacat 36601 ttttttgttt aaatatttaa tagtaagtgt ttaacttgaa gcattttttt ctgccttaaa 36661 attgattttc ccactcattt tatggatcaa aactattatc agattttaat tcccactcca 36721 ccacattgtt tataaaagtt gacaattatg gaaagtaaag taatactgaa tttttctaat 36781 gcctatcttt gacatggaat aaagattata aaatttctta taacttgaat gagtttttaa 36841 aaattggaca tctttttata aaaatctagc attgaccatt tcaacataag ttttacttat 36901 ctctgaaata caggtagtac ctatttttat taattttctt aagtgatgct gagtgacact 36961 atttttgttt tttttcactt tttctttacg ttttttttga gacaggatct cagtctgctg 37021 ccaaggctag agtgcagtgg tgtggtctta gctcactgca gcctcggctt cctgggctca 37081 agcgatgctg ccacctcagc ctcctgagta gctgggacta caggtttgca ccaccaggcc 37141 cagcagtttt aaagattttt tgtagagatg gggtcttgct gtgttgccca gactggtctt 37201 gaactcctgg cctcaggcga gcctccgacc tctatctccc aaagtgctgg aattatagac 37261 atcaaccacc gtgcctggcc ccagtgacac tgtcttaatt ttgaataggg aagtatctaa 37321 tggtattaaa atggtctggc agagaagggg atatgaggct tggtgaaaat aagcacagga 37381 tgcaccatct ttgagtaagt gttagtttct gcagtgagcc agcagcaaca ttcttagcag 37441 agcctagaaa gatcgagtga gggctgtgtg gtggtgcagc ctcttaagtc tctctggttc 37501 cacaacatgc tcagattggt ggctttgata ttttctccat ccaggacgca ttgtgctaag 37561 cttagggaag gtccagtgga tatttaccaa gcctcttata ctgccccgca tgttgccatg 37621 agacagcagc gttatcacct taaatggtgc aaatactcct tgtgcatacc ctacccctga 37681 gatctaacat ctggattctg gatactgttt cctattcttc catcttgctg catggttccc 37741 aggttatgta tatagtgtgg ctaggcacca cacatggccc tggctccttg cagccataca 37801 gaggtgacct aactggatgt aattgggtat gtaccgggtg ttggctacct gtgcaggttg 37861 agggccttcc acaggtctag caaagcaatt cattgccttt ctgaattttt gaatgcttca 37921 tttgtgaaac ccttgatgta ataactgttg atattatttc tttttgaaaa ctccccttac 37981 tcttccctgt ttcttcattt gtaaactgga accgatagaa aagtaccaaa cttagggttg 38041 ttgtgaggcg taaaagaaat aatacagccc agagaacata gtatatactc agcaagtatt 38101 tattaatatg actactaagc cagcattggt gattgattgg ctcatagtgt cttctatttg 38161 ccagatgttt tcagaggctc tgttcatgtt catacccatg tttgtaaatg acagaatgta 38221 aatttcttaa tcctctgtct tctcctttag catcagctct tgtccccaaa gtaaaagaac 38281 tcagtcatcc agaacccagt tttgtagatc tatttttcaa aattgggcag ttaatagaaa 38341 attaaaagca gtctagggaa aagcctttcc tttgggtttg ttgctttctt ttctgtcata 38401 ataagactgt gcccaaactg tagggaaaaa tcaagaaact gtaaaataca tcatcagggt 38461 gtttatagta aatgtaaaac gttaagtaac atttgagtta atctgtgctt gtacaaacct 38521 cagaagctta aataatgaac caatgacata taatttagaa agttttcatt agactctttg 38581 cgttggtatt tgcaatatca gatttgcatg tcatcattat tagccttcct ctattttgag 38641 tgttgactag gtgccaggcg tcatgttgag aaatttatgt gcttatttaa ttctcacact 38701 gactccatga gatagatact ccattttaca gatgaggacc ccacatcata accttttggc 38761 cagaattaca gttagtgtta gaatttaaac ctatgcagtc tcattccaaa tcctctgtct 38821 ttactcctcc agaatgctcc attaaatgaa ttgtttctta taatccggtt gttttaacgt 38881 tggaaaagaa cttgagaaca tttagcccaa ccaccctatg aagtttggca gtacttcttg 38941 ccattctttc aatagataat ttatttgagt ttaagggaga gtctatgaat taacactagt 39001 accaaaactc ccatgagaaa ataatgtaga cagcttttgt taaaaaagca gaatttttaa 39061 aggtgttcat tcagttatct tcttgcctaa caagttcctg tgttgtgata gactttgaat 39121 ttgttaaata tgaattggct tcctgattct agctgttttg ccagaggcct gatttagaga 39181 agggattaaa ccactggtca accttggatg gaatttgggc cagcagttcc ttctccacag 39241 cccctgacac ttggctttgt gccaagaaaa tcctacaaaa ggaaattgat ttactctacc 39301 caagacagag ttccggaggc tttggttgtc cattgttagc ttagaagttg gcgcagtgtg 39361 cgtgtgatcc acgcctaaat agcacagcct tgctgtgcgt ggtagaagtt gggttagtgt 39421 tgacatgctg ttgactcacc ctcccgagga tggaagctct ggcctgggtc aagttgtggt 39481 cactgcagtt aacagtttgt tgatctcagg gagtattcca cagttgctga tgtaattgac 39541 aatgattgga gccagctctt ccccagattc aaatggacca attagaggac ttgttggttc 39601 tgtttatcaa ctatgtaagt gagtactgtg aagttggcat tgtttttagt ctgtatttgg 39661 gtaagatgaa actctgttct cagacagtaa tatctttcaa aagtacaatg ttacttattg 39721 tgaccaaagt agggatgaag tttgcaaatt atttggtgta atgatttaat ttggccatcc 39781 atgcagtgca gttctctacc agtgtgattg ttctctttgt aaaatactct tactttgcca 39841 gaaaattgtt tgtttgtttg tttgtttgtt tgtttgtttg tttgttttga gacggagtct 39901 cgctctgttg ccaggctgga gtgcagtggc ccgatattgg ctcactgcat ccttcaagca 39961 attctcctcc ctcagcctcc tgagtagctg ggactacagg tgtgtgccac cacgtccagc 40021 taatttctgt atttttaata gagacgaggt ttcatcatgt tggccaggat ggtcttgatc 40081 tcttgacctc gtgatctgcc tgcctcggcc tcctaaagtg ctgggataca ggcgtgagcc 40141 actgagccct gttgaaaatt cttattaata agatgtttct catagaatgt ttcaaaaaat 40201 tgtgactggt ttcaaaatca gatgatttgc aagaattagc catagtataa acacatagtt 40261 ctcaaaccgt tttctgggag tgggcccaga gatctattta acaaaacctt gagtagagtc 40321 tggtgcaagt taaaattgga gaaccactaa tgtaacctta ccttatctgt ggaccacatt 40381 ttgagctttg gaagaaataa ccctttctat ccttcacaag gtaacagatg atttttttgc 40441 acaattattt gataatgagc ttatatttga gattaaggta tgcacacgat aatgtacatt 40501 ctcactctag gtctggggac cttgattctt tgactcttct gtccttttct ctccagttac 40561 ttggttttgt gcattaacaa acttttcaat ttacttattt ttttttactt atttacttat 40621 ttctcatccc ttacgtaatg ttgcatcatt gttttttgta gcctctgcag ttcaagatac 40681 taatcacccc tttctaccac attaaaatac agactctacc taaaatcatg tatgggagta 40741 acttgatata tacgaaagtg taaccgtgtc acaattcccc gccccacccc gccagtctac 40801 ataaaactgc aaacacttta ttgcttacca agaaagtaga caataatatc gaagaaagcc 40861 ctccttttcc tcttctttga ccaacaaggg agcagttcct gacaaatgtc agtttgcttt 40921 ttgagttcct taggtttctg tgttgtgttt gcaatggata aatattgagt aacaggtact 40981 gactgtattg tatacattga aaactatcca ccaacagaat atttgcagga ggattttgtt 41041 ttgttttgtt ttgttttgtt ttgttttgtt ttctgagaca ggctatctct ctgtcaccca 41101 ggatggagtg ccatggagtc atcatggctt actgtagcct taacctcctt gtctcaagcg 41161 atcctcccgc ctcagcctcc taagtagcta ggactacagg agaggaacac catgcctggc 41221 taattttttt tttttttttt gtagagacag agtttcgctg tgttggccag gctagtgttg 41281 aactcctggg ctcaagaaat ctcccaaagt gctgggatta taggcatcag ccactgtgcc 41341 tggcttgaga atgacattct ataagtagtt tcagtccctt agtttaatcc catattttga 41401 tcctataaca tgtacaatat ttgtgtatag tatatgatct gccaactctc taagaagatt 41461 ttaaaactta ttatgttaaa tttcagacat atacatcatc cagcttcaaa aattgaccac 41521 attttgctgt tcttgtttca gcacctcccc ctgccccatt ttcttttctg aagcatttta 41581 aaataattct caggtatcat atcattttac ttctgttcat aggtttctga aaggatttga 41641 tgcagaatgt tattattaaa ccagtataca gagagaaagg ccactggata ttagtacatg 41701 ttttggtttc agacggtcct tttaagagta tattaaatca cataaggatt ttctgagtac 41761 ccattagaac cacaaaaaat acctatgata cccttcttac tgcagtagaa gatgagtttc 41821 ctagggtagc cctttgttga tggaacaaac atgtattttg tttcatcagg aaaatcattg 41881 gaacagccat gactcagatc cactaaatga aggaacaaaa ataggacctg cctcacctcc 41941 tgattttttt ttaaatctct aggatgagta accagaatat ttttatgttg tgctaagtag 42001 gtttgtttgt ttgtttttta ctctgactta tattatttga attgtagaaa tatattgttt 42061 ttactttttt ctttaaaatt ctatctggag tttctaaatg agagattgta gctcctaaaa 42121 cctttcttag ttttggtgtc ttgctttctg ttcttagcgt ctgtggtcat ttctaactcc 42181 cagtgaggaa agagctctct ggtcttgttt ctttgttaaa tgaaggagca aacctccgtc 42241 tcgttgatcc actcccaacc cacttgtttt gtaacaatga aagttatctc atacttaaaa 42301 gacaacttga aattataaca cacctgtaac aaagtaaagg gtatgggttt ttacttaggt 42361 ttaaaaaaag ggcagtaacc cttttcataa aaaagcaaaa tttagatggg atagaacccc 42421 cgtttatagg aaaagatttt aaaatctcat ttgctctcat tgaaacctta accttctctc 42481 aaatactggc gcacctctga ggaacttcaa atacggtttg aaaactaatg gtttaaggag 42541 ggccaggcgc agtggctcat gcctgtaacc ccagcacttt gggaggccga ggtgggcgga 42601 tcacttgatg ttagcctggg caacatggcc aaatgctgtc tctactgaaa ataggaaaaa 42661 ttagctgggt gtggtggcgc atgcctgtag tcccagctac tcaggaggcg gaggtggagg 42721 cgggaggatc tcttgaaccc aggaggcata ggtgcagtga gcagagatgg atctcttgaa 42781 cccaggaggc ataggtgcag tgagcagaga tcgcgccact gcactccagc ctgggtgaca 42841 ttgagactgt ctcaaaaagg aagacgaatg gtctaaggaa actagtttgt ggtttctcag 42901 cggagaatga cattctagtt tctacagatc tggcaacaga agacaggcac ataaactggc 42961 ctgctttgga gtcagggcca agacttaaat ccagatttaa aaccagcttt ttttttttct 43021 tcttctaaaa caaccaacct gtgttttagg tccagtgtct gccttgtaag ttgctcaaag 43081 ctcagataca aggaagtaat cagcacacat ttttgtgata acattcctga aaagaaaaag 43141 atgaggtgga ctaaggtaaa tacactgctc cttaaaaatc catattttta gattatttat 43201 aaatcaaatg ggcatggtta ccaaagatga attctagagg tgtaaaactc cagtcttata 43261 cctcaaattt cagttttcct tagggagcaa aacacttcca tgatcttact gaggacctaa 43321 aaaagcctgt acatatttta aaaatagcac tccctagata aagcctgatg gatccagggt 43381 acagtagaat catgaaattg cctgtttctt tttaaaagat gtatatgatt gctcggtggc 43441 tgtatattgt ccatcgtgtg cgtgcctgtg tggtatgttg gttctttcag agaggcagtg 43501 gcgtgcagag tgagaggatc acagatgtgt atgtgacaaa actatttctt ttctttcttt 43561 tcttttcttt ctttctttct ttctttcttt ctttcttttc tttcttttct tttcttttct 43621 tttctttttt cttttttgag acagagtctc actctgttgc ccaggctgga gtgcaatggt 43681 gtgatcttgg cttactgcaa cctctgcctc ctgggttcaa gcgattctcc tgcctcagcc 43741 tcctgagtag ctggattaca ggtgtgcacc acgatgcctg gctaattttt gtatttttag 43801 tagaggcagg gtttcactgt gttgttcagg ctggtctcga acttctgacc tcgtgatccg 43861 cctgcctcgg ccacccagcg tgctgggatt acaggtgtga gccaccgtgc ctggctgaca 43921 aaactatttc taaaacacaa gtccagatat tcctaccaag ggctctctgg ggcttcgttt 43981 tgatttatat gcaaaacact cctaatgtgg tgacattggg gagataaaga tctgggatga 44041 tgagagagaa aggcaggtag gacacatgcc cttccagtaa ctttcccagc tagggcttgc 44101 atttaacctc tgatggaggc ctcaccatcc aggcagatgt acttgcttag gaatgagttt 44161 gagataataa ggctaaatgt gctgccttct gtatataatt tgcatttgtc ctaatgtacg 44221 actaaggaaa tgaaaaatca aagtttgagt gactaaaata aaagaggtct gtggagtaat 44281 tggattgaat ttgcaccaag tcaaggagaa aacttctgta ttttgtatgt tttgtcatgg 44341 ttctatgatt acttgttaga aagtgatata aaattcaata acttagtttg atatgaactt 44401 ggctatcttt aagttataga agtcgtgcat ataatgtggc tgttttcaac ccccataccc 44461 tttctgagtg gttggagatg tcttgaattt cgaagtgaga aggtctggtg ctgttacata 44521 gataatcatg ttagccagct ttaccttctg ataagctctc aatttaagaa agtacttcta 44581 atcctggctt tatctttggc ttgctatctt attatgcaca gagcataatt gtaagaaggg 44641 aataatatct cttgctgtct attcactaag aatagtgtga ggacagggct cttaaaaaca 44701 tgaagagatg ggcactattg tatagatata gagatttgat gaaatgttaa aatgttggat 44761 gattttaaat gacaagtgga gtggtgagaa gtttattttt aaccttatgt ggataccttg 44821 gatatttaga aacatcgaag ttttaaggaa gaacaaatat cttgactcca agaaaatggt 44881 catgtgatac tgtggcatct atttggtctt cctgatgtat atgtttcctg acatttagct 44941 cctaaaatcc ttggaatctc cagtgctaag aaggtcttat atgctaatga gatgactggt 45001 ggctgggtaa cctaccatgt gattagaggg ttaggacttt cagtcaccct ccccaacctc 45061 caggagctga agtttgagtt gataaactaa tgatcaatga tacctacata atgaagcttt 45121 cttaacaacc ctgaaggacc gcgttctggg aacttccaga tagctgaaca catggaggtt 45181 cctggagggt tggatgtcag gaaagggcat ggaatctcgt tttctcttca gccgcacctc 45241 accccgtgca tctcttccat ctggctgttc atctgtgtgt cctttgtaat atcctttaaa 45301 ataaatgggt gaatgtaagc aagtatttct cttagttctc tgaaggtgct ccagcaatga 45361 atcaaaacga agtaaggggt ctggggaatg gtagccagtt ggtcagaagc acaggctaca 45421 acctggggct tgtgattgac atctgaagtt gggggagttt tgtggtactg agccctcagc 45481 ctgtgggatg tgtaatctcc aagtacagtg tcagaatcca gttgaattag aggacactca 45541 gctggcatcc attattggca tgcagtattc cttgtcatga gcctgggcaa catagggaga 45601 cctcatctct acaaatatag ctgggagtgg tggtgtgcac ctgtggttcc agctactcag 45661 gaggctgagt gggaagatcg cttgagtcca ggaggcaaac attgcttctc ctccaaatgc 45721 tgtactttaa catgtttgga aactcctgca aaggaatttc aacataagcg aagtcaatgg 45781 gttataaagg actgaatgcc attttgtaaa tgtcagaaat caaagtcaca accaagttaa 45841 gttctagatt agggccagga agagaaatgt attatctggt caaccatatc aaatctgtat 45901 tttggtggaa gagcattttc ttatattggt tctgattggt ttgatacgga cagggaactg 45961 ttttctttat tccagttatt tgttttgttt tgtttctgag acagagtctc gctctgtcgc 46021 ccaggctgga gtgcagtggt gcgatctcgg ctcactgcaa cctctgcccc tcaggttcaa 46081 gcgattctcc tgcctcagcc tcccgagtag ctgggattac aagcgtgctg tagtttttgt 46141 atttttagta gagatgaggt ttcaccatct tggccaggct ggtcttcaac tcctgacctc 46201 atgatcccac ccgcctcggc ctcccaaagt gctgggatta caggcatgag ccaccgcgcc 46261 cggcctattc tagttgttaa tcagttaaca tagccaaaag atatagaata agattgtaga 46321 ctgaatccat agtctaatga tttgtcttcc tagtaattgt gaataattca gacattcaaa 46381 attagtctgg catcatacaa acattagact agtcctaaaa tgaggatatt ggcaagtgtg 46441 ttaataaaaa ttgcatatgc agtattaata ggttccttac gtaggctgcc gattactcag 46501 gaactgtcag atgttttgtt ttgttttgtc tttgagagat ggggcagggt tttactctgc 46561 tggagtgcag tggcacaatc ttggctgact gcagccttga cctgctgggc tcaagcgatc 46621 ctccctccac agcctcctaa gtagctggga ctacaggcat gcaccaccat gcccagctaa 46681 tttttgtatc tttttttaga gacggggttt tgccatgttg cccaggctgg tctccaactc 46741 ctgagctcag ggatctaccc acccatgcct ccctaagtgc tgtgattaca ggcatgagcc 46801 actgtgccca gcccccagat ttttaatctt ctctttgaca ttgagtttca gaaatgaagt 46861 ataaatggtg aatttttagt gtagaatatg atgcactagg aagatcactt acccgtttat 46921 tcactttctt agaccatgtc aacccgattc tgaatttagg gttctgttgt gatggtgttc 46981 tttagaagtg attttaacac tcaaaatact atagtctgta acataattta ctgatttttt 47041 aatttttgca aactaagttg aagtaatcaa gtatttatga gttctttggg tgagaccatc 47101 tgtcaaagaa ttctgggctt aatggagata catatatgtt gtgaatacta aacagatatc 47161 caagtaataa agcagaaatc atatgctgtg taacagaaat gtaaagttct gaattagttc 47221 acgagtagga gagaaacttt tttttaaagg tagaggagaa ctaaaagtaa tttcttaaaa 47281 gagtgcctat tttacatctt gaatattaaa agtctaattc agtgccttgg gatcttgttc 47341 tttttgtaga ttttgatttg aatgaacttc agtttgctga aatgtctaat gggtgcaaag 47401 tacagattta attataaggt tgagctaaaa tcagaattta agcatatggc atatataata 47461 ccaatttaat ctttccccct taaatttagt tatttttagt gagtcccaaa caccatattt 47521 gagacatcac ttttttcttt tttaaagaga tagggtctcc ctatgtgagg ctggagtgca 47581 gtagctattc acaggctcga tcatagcaaa ctacagcctt gaactcctgg ccccagtgag 47641 tgtaatgtct caccatgcca aagtactttt atcttaaatt gcttattttt ttgtttattt 47701 ttttaactga ctctgtttac aaaattaacc ttttatctag tgacagctag attgtatcac 47761 atttgtcatc tatggacaac tgatttttag ttgtttaata tggtaagttt attattgttt 47821 ttccttattt aagaaaacag gatctgagtg aaaaaaaaaa aaatcttaat tcaactatca 47881 cgattccagt aggaagtaga cagacattca tctgtataca ttaggtgtca aagtagagtt 47941 ttgtggttta tcattctgaa gatttgagat tcaagacatg actatctttc agtcatattg 48001 tgcaatagat aacatataat gtttctcact gatgagaaca gttggttaga tctgccacac 48061 tgagcgtgta gctggggttg acgacacacg tgttgcagtg tcgttaccag tttgtctctt 48121 ttagagacaa cttggcaggc agacgtcttg cagataaaca aggaaagatt gtctatttta 48181 aatggcataa gacattcttt aaaccagtgt gtcatatgtg gttatatagt cagcattttc 48241 tttagtgcag gaaatcctct ctcagggttc ctgaataaat agcaggtatc tttgtaggga 48301 tagctcgcag gagatttcaa atgctgtgta ttcatcagtt caggctcccc agatgtctta 48361 cgaacgtgct cattttagtt ttcttttctt ttccctcttc ctgctagata caagttgaga 48421 caatatgatt agctcttaaa gtgcttactc tccattttat gtatattact gcattttaat 48481 cctcataata tcctgtgagg tgggatgtta ctatgtccat ttgagagata aaggaaatca 48541 agaggtaaag ttaactggct ttctccaaac agctggtaat tactaaatac agaattgact 48601 tttgcctttt gattccaaag atcatattgg tactttcatg tgtgggtccc ttcagccagt 48661 tcctacagtg accctgaccc agctgtaagt cccctgaggt ctctagagat atagacagtg 48721 atggattaga ccctgttcat agttgcagat acaagacata gagaggtttc tagcaataat 48781 ttttttagtt atatattttc atagggcttt tgatttatgt aacctattag tatatatttg 48841 ttattccagt agagttaata cattattcaa agagattcca gtttttaagc cttttctccg 48901 tttgtctaag ttgttcatta tgaaatgatt ggctagaatt tgaaagtctt tacctacaga 48961 gagattctga ctggcttgct tggtcttgga accctagtta tgagtgcatt gtcttgcttc 49021 ctggtgaact ttgcccagaa atagttacca cctcattttt tttttttaaa tgaaagctgg 49081 ataattaacc ctcaagattt cttaattaaa agattactat tataaattga cccacaggga 49141 gttaatacaa taaaattatt tttgatcatt ttatatgaaa taattatggg gcaatcttct 49201 gtaatcttga gttgaagggg ttatcttgct taatgttaat aacttttgat acaccaacta 49261 ccttttaaaa attaacgttt tttttttttg cttctgtgga agggaggctg tattttgtag 49321 gatttacagt ttacatagtt attgttttat atgatgtcag ttggattaat tatgttttga 49381 gtggaaactc tgtttggttc aaccaatatt tgaatgctaa ttttgttctg tgaacaaata 49441 gaaatacaga gaacagtaag acatgactct ttccccctaa gtaaaggctt aggttggagt 49501 tttcaacctg ggaattactg acattttgag ccaaataatt gtttatgttg ggtgggggag 49561 ggcctacagc ggaggctgcc tgtacattgt aggatgttta gtaacatccc tagcttctac 49621 ctactagata gatgccagta gcaccaccca agtcgtgaca atcatctttg ccaagaactt 49681 tgaacctcta cctagaaaca gttcataggg ctcaaaaagc aaagtggcct aggacccatg 49741 tagttaacaa gcttaccctg ttggagaacc actgcatagc tgaacaacat gggtgagact 49801 agtgtcaggg taaaggagct cttgactgga gattggccgt acacagaaaa caaagtgagt 49861 gttatggata tttggggatt taccttcatt agggcacagt tggttagcgt taggtgtaca 49921 ccaacattat tccctcttta gaaacagcca tgcacgtaac attttagcac atctgaagag 49981 gcacttctat tctaattacc taaaattctt attatctaga attacatgct atgaacaaca 50041 gcagtaaagt cagaagccag aatctatgtg atgttatctg ttttgcgatc acagttgtgt 50101 tcttctctta gtttatttga acaagtttgt gaaaaaattt ttgtgcttat gtaaatactc 50161 tttgcttaga ttttttggta gaatgtaaat ctttcccaca cagtgcaata cccctcctgc 50221 ccaagggcga gatatagatt gtacagtttg ttatttctgt gttttagggc ttattcctct 50281 gtagacatgt caaaaagaca tccgtcgggg gtaaggaatg cattagcaat tcaagctgat 50341 aatctgcatg gtgagagtcc agataatttt ttccccaggt cacataggca gctctgcaca 50401 agtctgttaa acatatgcta gttttgaaaa ttaaaataag ttcccatcac tcaaggcaat 50461 gaaacgtaga tgaattgctc tgccagattg actcttagaa ctcctgatgt gatgtggggc 50521 tagacagaat gggatgtttg ctaataaatc agggaggtag tgattgtttt ccttcttttt 50581 ctcccccacc cccgtcagga caaactcaca ttctgtagcc tggtgcactc taataatgcc 50641 tgttgacctt ggccatgaaa gcactctaga ctcaccccct aaacaccgca ccgaatgtgc 50701 tcatccttga aatttacaag ccttagaatc ttccctacct ccactcctgt ttactgcctc 50761 ctgttactcc tggtgcttcc agtcagaccg tagtcttggc tgtgcagtac ctttaggagc 50821 cctcaaaaat gtttaatttc ttttaaaatc agaaggaaaa ataaagaaca agataataat 50881 catgaatata caataatgaa tccagcctga cttagagttg tctttatacc aatcaacaga 50941 gccctaaaat atagctttta atgttttttt aagagagaaa gaggcccacg aaagttgtcc 51001 tggggccccc gcatcctcct catacactgg tgggatgctg ctcttgcttg ctttggaatg 51061 tggcagcagc tcccagtgac ttcacacgtt tattacaaat tcaaggtcat ttggacgcta 51121 gccttgtgct acctttcatg cttcctgtcc gacccatact cagcctgggc atgttttctg 51181 ctctgatctt tggtctgcct tccttctcct tctccaagtc aaaccagtca ctctttcctg 51241 agagctcttc ggtgtaccct tgccagacct ctgcattaac actccagcac ctttggttag 51301 ccaatttggc cttatttgtg tctccaccat tggattatct gtccatcaag gtcaggaatg 51361 ttttcgggtt accccactgt ccccaacttt gaatatgttc actgcctgga aaatgtttat 51421 ctgaatataa ggcatcaagc cagaacttgc ccaaaactta acctaggact caaacagctt 51481 tgaaatctta tctgtgtaga gtgggaacgg catgtgtttg ataaccatca gcatttgctg 51541 ttttgcttag ccatcaacaa tccgtttcta tctcttcctc atagtatttt cagctgtatt 51601 tttgtacagc agtcattcac gttaggtgtt aggttacact gaaatctctg acaaccagtt 51661 ctgtgctact gaaggctttt agcagcagga ttaaaaaaat agttgattgc acattcgaag 51721 tctcactgct cacactgttt atgtaattct tggagtccag ttagatcatc tgatttttaa 51781 aatgttgcct gaattacaaa ggagttcagg aatgtgttat caaagaatta ggaaattaac 51841 attgaaattt attgagcaac atagctaata ggttgtcttg ctttttaagc cttcttgttt 51901 tctaggagga gcctgaagtc cgtggacaag ttgaactgta gctggtattt aggaattggt 51961 aaagacagag gtgtggatgc tatgttttct tttctctgtt aggaaattac tctcctggtg 52021 ccacaataaa aaatttcaca ggtatctaaa gtggaattta ttttgtattg ttccacacct 52081 ttgtcaataa ttacattttc taccttctag tttattactg ttattatttt tcatgccagg 52141 tgtttcttaa ttgttggcat aagtatttat tgaatgctta tatgtgggag tgagagagtc 52201 aggccagtgc taaagagtag ggaataaccc ctagacagtg aagtaacttt tgtcttgttt 52261 gcacattaaa aaaaaaaaaa gattaggaat ccttgctcta tatactagaa atgtttgatg 52321 ttctcttttc ttctcttcaa acagagaaaa cttagatgaa tacttatagg ctctactgtc 52381 aactaactgc accctcctaa tattggcgtc tgcatatttt taccattgtc aatatcagat 52441 tgcctctgat gttgtgtcac tgaaactgat agtaacatta catccaagag taagactgca 52501 gtgtgatttt tggctcttgg tcttctgttc tttcggcaac aattttttat aaatatagtc 52561 aagaaaatag aaataagagt cttcactgct acctcccctt cagaattatt ttctcctagt 52621 attgtggttc ctgatagcct ttattcctat agaaatggca attcgaaaaa ttccttctgt 52681 gaatctgagg gctctacaaa gggagagctt ctttgatagt gctttggtgt aagttattta 52741 agaaggacaa tctacccagg cttatcaagc aaataacctg gaatttaaac ctggagtatt 52801 ctttagtgtt ccaagtctgc aggaaaagaa gtaatcttgg cttaaactaa ataagttaac 52861 agaatacaat ttgaacaatg aacaagaatg ctgtttgaaa attatcttgg gtttttatgt 52921 caagttaata ttctcactga atttctatta tgtttgtaaa gcagctttct cctagaccaa 52981 aaatcctagt aataaaacca caacagtatg agagttataa gcacacagta aaggctgatg 53041 agtaaatgtt atgacacttt gaaaggaaat aaaccagacg atataatgta aatatccttt 53101 gtgcttatca aaaataagtg acttcatgtt ctgttcctgc aatatttctg taaatctgct 53161 aaaaatagaa aggtaaaggt ctaagatcat aaatctctga tgccctttcc ttaaaagtaa 53221 ggaattcttg tcttggattc ttatataata tacttattta taatctcact tgttctgatt 53281 gcaggtggcc tcttactatc tttacatcca tgtgaatgtg tgtgctttga tgtgtacata 53341 tgagagacaa gaacaaaagt aaatgtcaaa tatccgaatt cctgtttcag tgagctttca 53401 cggagtaaat tctttttgtc tacttgcctg tgatctcttc taatatgact tgcgttaatg 53461 acctttaaga aatcttttaa taaaataaaa aattttatta aaatctttta ataaaaaaat 53521 tttattaaaa tcttctaata aaaaatttta ttaaaatctt cttcatacgt gttacttgaa 53581 tttactcatg gcaaggacca gttagtacca ttttatgaaa tcacaagctc atgggcttgt 53641 gataaaacat ataagtgctt taaatttggc attggtaacg gcaggccgac tcacttgaaa 53701 aatgctctga actgctctgc ctcatgctgg tgggaagtca gatggaaatg actttctaga 53761 gagtttagct ggcctccaag cttagtgggt actcactctc cttatccccc gatagctgct 53821 ctgagatttt aatgcacatt gaagacaata tgataactta acctggaaaa ttagcatttt 53881 ggtaagggtc tttaacttca ttagtatttt cataccactt tggtattaga aattgtaaag 53941 atattaaagg gtatggacaa tgtggaatgt atgtaccaga ctcagcagtg tatactgact 54001 ccctcactat agcccagaac tggaaatccc gctaactacc tctccccatc ctagcctgaa 54061 ttgccctctc cacctatgtt ggttttacaa tggcctgacg cctgaaagct ggtagaaact 54121 gtataggaaa atagggttgg gataaatttc tgggctgaca aaggatagag gaatatgctt 54181 ctttttcccc ttgtccaagt atatcagcaa ctctaagatg cagatttatc agtatggctc 54241 tactgggcat gttctagtta aatggtagac ccactacctc ttctggaatc ttgttttcca 54301 ttatattccc acctgcagct tttcaaagac ttattctctt ctctaatggt ctcttggttc 54361 ttttttttgc ccatgctcta cgttccgagt gggtcaacat tgcagtgggt caatgagaag 54421 aaatagaggg gaaaaataaa agaaccagaa agaaacctgt taagtttgca cttacttgtg 54481 cattaagcat attagcgtaa gccttaaact ctagaaaact cactgtagtg ctgtaaagta 54541 ccactgataa agtaaaagtt catttttatg ctgtgtcgtt atttagtgat gaatatgata 54601 ttgcataaaa gatgaggatc tgtaaggcca ggcgcggtgg ctcacgaggt caggagttcg 54661 agatcagcct gaccaacatg atgaaacccc atctctacta aaaatacaaa aattagccgg 54721 gtgtggtgac gcgcgcctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt 54781 gaaccaggga agtggaggtt gcagtgagct gagatcacgc tactgcactc cagcctgggc 54841 gacagagtga gactccgtct ccaaaaaaaa aagatgagga tctgtgtata actgccaggt 54901 gttgtgtgag gtaccatgaa taacctggca atatcctaac tgtgcaggta tgaggagtga 54961 cccccaggaa tataattgca attaaacttg caaattatgt ggggatactc tatttcaggg 55021 caataaaata aattatgtct agagctcata actcttacac atatttggct aggtctatgc 55081 agcaaaaccc aggtgataac tgaggcttta actttgggat ttgtttggtg ctgactctgc 55141 cacatggtga aatttggtag gatctcagtg cgagtgtcac atgatcagtc tgcttgaccc 55201 aacatgccag aattccattt ctcagcccac tctcttctct gcatacatac ctacctcccg 55261 gagagggctg ggcgggagcc tacgtgtggc aagagccctc actgcttttg ttacacagat 55321 ttcaacacat atggcttgat ggagtgtgat tgcagtagca tgaatgtttg gtgatgaaat 55381 ttgcatgtcc tttgctggga gaagtataat tataagattt ttgtattcac atagccacat 55441 gtgaggtatg agcaggaacg gggtctattt aatagcatta gattgctttc agatttgaaa 55501 tatttccttt tagaaatata ttgtgtaaac aaaatatgtt taattttttt taaaaactat 55561 gcctttgtac aaattgttta tcatttgcag aggtctaaaa aatgtaggag aaaagaagct 55621 cacaaaatca atcagatcta aagattgcaa agattgatta ccatctcctt aaaagcttca 55681 gtacatgtgt ttacagtatc attctatttg cctctgctca gattctttcc aattctagga 55741 caacccaaag tgtatagatc ataagccata actgatccta ccttacataa tacattaaat 55801 ataaaatatt tatggccagg catggtgacc agttctgtaa tcccagcact ttaggagacc 55861 aaggcaggag gaacacttga ggccaggaag ttgagaccag cctcggcaac atactgagac 55921 cttgtctcta cagaaaaatt agccaggtat ggtggtatgg gcctgtactc ccagctgcct 55981 aaaaaaaggc aacttgagaa agtttctacc ttctgaaaca atctatgcct tgattctatc 56041 acttagtatt ttagttattc ctatcaaata catgactgtg ttaatttctt ttgccacaaa 56101 aatgctgcaa agccacaaac tcactccaaa atttagtggc ttagaacaat aaccatttat 56161 tatagcccac aagctcaagg gtcagctgat cccggcttag cttggctaca tttgatataa 56221 aacagtagaa atttagaaat tgaatatatg tagaatatct cttgtaccaa aaatgaagag 56281 gaataataga ttttgatttt tctttgttgt attctgtgct gggtgctaat ctgtcctagc 56341 tgtcagaaca agtagaagcc atactggaaa atcatttatt tgttactatc aaatttattt 56401 cagtgaatgc cattcaaacc actctctctg aggagcccag ttaccaagtc ctccaccttg 56461 ttagcaacat ggacttaaat gcactaattc tagaatgaga tcttagcaaa gtaccagcat 56521 ttgagatttg tgcaaattga gctcttaaac tgtgtgtata tgtgtgtgtg tgtgtgtgtg 56581 ttcattctag cactcccact acataccata caagtagctg tatagcaggt atatatacat 56641 gtgtatatgt acataggtag gtatatgtca actgttgaaa aactagaaaa tacacattaa 56701 ataaagtaag ggttgataaa cattagtagt tggaacattg ggagatttgg acttctactt 56761 tgcagattca gttggctgca ttagatttaa ggaaagaact aactttggag aaatttgtgt 56821 ttgcatttca ggagagattt cttaaagagg aaactaccag aggttagtag atattaacat 56881 ctagtgattt aaatttgaga aatgcaaatt agacatttaa gctgtgggtt tgcttcccgt 56941 tttacttgct ctaaaagtaa tttttttaaa agccagtaac aaagacatgc aaaatagtac 57001 tgccatgggg gaaaaattac ttgaaaataa tattcaagat tgaagaatta aaaaagcttg 57061 acatctagta ccaggaatgc cagctggctt gttctgtcaa aaagcaagca tgtgctttgg 57121 aataacactc ttgagtagac tctaggcggg ttttgagccc ttctgattat ctaaatagat 57181 gtggaagtga cttatcttgg tttttctgag tcaaatgaga taatgagcaa aggccttagt 57241 aggtctttga tttttgcctc taaccctttt tctatctgca tgttcacaag gtcaaggctg 57301 atgagatatg gcaatattga gagagtcaga ataggataaa gatctcaaat actatatata 57361 tatttttttt tcattttttt ttccctttgg tgactgtatg agacatgtca gtaaaacttt 57421 aaagacaggc caggcgttgt ggctcatgcg tataatccca gcactttggg aggccgaggc 57481 gggcagatca cttgaggtca ggagtggcaa catggcaaaa acccgtctct actaaaaaat 57541 acaaaaatta gccaggcgta gtggtgcacg cctgtagtcc cagctactca ggaggttaaa 57601 gcacaagaat cgcttgaacc caggaggcag aggttgcggt gagccgagat caccccgctg 57661 cactccagcc tgggcgactg agcaagactc tgtctcagaa aacgaaacaa aacaaaagac 57721 acacagaaaa actgtaaaga cagaatgaaa gagatgaact ttttacatct cgctcatttc 57781 atcacatact atatagtatg tgtatggttt catatatgtt agtgtgtgtg tatgtatata 57841 tatgaagtta aaagaaaaca ttcacctttt aagactaaat ggacattgac tagattgtaa 57901 actgtttttg aggctaagag tttgtttgcc atgcataagg ctctattttt tgtaattcca 57961 cttttatatt catttatatc agtcctagca gtagatgcca ctcttactgg gtcctgccta 58021 aatgcatgct tcaaatgttg gttattaatt tcacgagtag tgaataacaa agactcctat 58081 tgagggacct cttgatttcc tactgggcac cagcagcaag catgacttac tggccttctt 58141 atgagtgggt aggagttttg ggtcaacttg agaacatcta tcctttgagt agtaaaaaag 58201 cttggtatta ggttggcaca aaagtaattg cgatttttgc accaatctaa tatcaacaga 58261 ggccgtgaaa aacatcaagg gaggatagct tgtaattaga agccagatca acagtgaccc 58321 aaaaagaatt tcaggtttcc aggagtgaaa tatgtagatt actcctctga aaggaattgt 58381 ttttctccta ttcgctttca tggcaaatca ttttttaagc cgaaactcaa gctctgaaat 58441 aattgtgtag caaatagtgg ctgtgaatag atgtggcaac tctccacaag ggtaaataga 58501 aaccacttgc agtttcagtc atcacatttt atttttcttt tttaatagct tctctatttt 58561 tgctcaccac aaatacatgc acacacacac acaggggcaa agccttggaa aattgcaatt 58621 ttccatttta atttgagaga cagcccggtc acattaacag aacccttgac ggaataggtt 58681 gaattccctt ctctgtccct ataaccccat ggccttggac atatccttac ctggtgccct 58741 ttcctccctc ttatggtcca cttttgtgat tctgatttta atttcagttc tctgttagtg 58801 atgaaatgag caactgcaga tgttccgctt acactggaaa tacttttttg gcatttaaaa 58861 gaaaacccta cagatgaatt attcccctga aaggggttca ctcagatttt ctagattatg 58921 attctagaaa atagttcttt acatttgaca atggagaatt taaatgtttt aatatgaaaa 58981 tgtaaagcac tcttgtgggg tatagttgaa ttatgcccag gttaagggaa tccagtattt 59041 atgtcaggaa accatggatg tgatagaaat agtatcgcta caaagctggg cacagtggct 59101 catgcctgta atcccaacat tttgggaggc cgaggcagga ggatcacttg aggccaggag 59161 ttcgagacca gcctggccaa catggtgaaa ccctgtctct actaaaaata caaaaattat 59221 ctgggcgtgg tggcacaggc ctgtagtccc agctacttgg gaggctgagg caggagaata 59281 gcttgaacct aggaggcgga gaggttgcag tgagccgaga tcatgccact gcactctagt 59341 ctgggtggtg gagtgtgaca gtgtatccaa aaaaaaatac tgagattatg caagaatctg 59401 actcgaagaa gtttgaactg attaaaaagt accaaatact tcattatatg aggccagcta 59461 tttagactag tcagcgggtg gttcaaatgc agaaaacaga tggctttcat tagccagaaa 59521 ccttaaaatt acataaatcc acggatgcaa ctaagctatg gatctgtgcc ctctccttgc 59581 caaacaggac ctctcagcta gcaaaacaga gagctccttg ttcacttttc cctcctagtt 59641 gagaccctgt tggggaggaa tgctgccaag cctgctctat aaggaatttt gatctcctag 59701 atgataacag agagaaactt tttggagaaa gggaacaccc tgtttgaaag gacaacaaga 59761 gaacttgttc aagaaggtgg ctttgcattt aattgaaaat tgtagcaact gtaggcgtag 59821 ggtttccaga ttcacatctg cagactggtc cagattaggt atatatgttt atattcgttt 59881 atgttatccc cttaagcctc ttttaaagcc atttctaaaa attgctttgg aaagtccatt 59941 tcacttcagt attctttata ttgtggctca aaaatatttt ttttctggta acaaaaatat 60001 tttctttctg tttttaagac ttgtagcatt tttaagtttg atatgtacgt ctctgtagtt 60061 cagagcactt ggtaaacagc ctcttgcttt tgaggcaatt tatgaatgct tcattgtttt 60121 tagctggtta ttttgttcat tgtcaggaac actgttttcc catgggaaag catctccagt 60181 tttatgatgt cagtggcaag tcattcagtt tacaagaatt accttcatat tcagtcttag 60241 aaaagcatag accacaaacg gccagttata tcacagttct catcttttaa atgttgtctt 60301 agatttttgc aaagtaatct taactcacct tagaatagtg ttaagacata aatagaacta 60361 aaatgtagta gttaagtggt tatatttccc atctactaag gtatttatat ttcttttctt 60421 ttagaaaata aaaatgttta aaaaattagg aataattatc tttaggatga ctttgtaaga 60481 ctgtgctttt ttactggcag caaatacctg ccagatttca ggtatttggc aggtaaattg 60541 aatcttaaaa gatttcaatt ttaagagtgt ccatattaac agatcctgaa taggaccctg 60601 tgtatcattt attaagtcat ttcttggagg cttcactcag gaaaagggag ttaacccttg 60661 gatggcaatg tttaatgtgg ccacatttcc ttctggctga atgccaggga gtggctgcgc 60721 ttagcaacca ggctttgttt cataaatggg ttatttgatg caggtgagga tgccgtgagt 60781 tgtttgtctt gttttaattc agggagacag gcatatgtag tgaaagcatg gaattgtttg 60841 cagatgggct tcagtaagaa taggaaggtt tctggttcaa tttgtatctc caagctaatt 60901 gctctgattt ccatatacac atctggtgtt ggttcggggg gaaaaaattt tacttgagaa 60961 acatggcttt tgtgagttga tatttatttg tgaagggcct cttgtaatgt caaatgtaag 61021 aagatctata cacacaacca attttacaat ttctaattca acagcttacc ctattattga 61081 ggattcgcag tttaaattat taaatgatct ccacttgaag aaaatagagc atgttaatca 61141 tcacgattct ccaaacattt atttttaaag ttaatattaa atctcatctg gtggaaaacc 61201 tgacaggaac tgaagcatat attttctagt ctaaactctc attgttaaaa tgaagacaac 61261 atgagacagt tatagcgccg taatcaagtt atttgaaaaa aatgttttaa aaattaccac 61321 tttttggtag tcaacataca cacatatgcc aacagatggc attaaaaaat ggcaagaaca 61381 tgagacaatt atagggtcat aataaattta tttgaaaaaa atgtttaaaa aattaccact 61441 ttttgatagt cagcatacac acatatgcca acagatgaca ttaaaatagc tgtaaagtca 61501 agaagcaggg aggatgacaa agtgaaaaat agaagaaatc aaaggaagtt aagtcaaaag 61561 aggaaaaaaa aatgaagcat ttacttggtt tgtcaggtag ggtgatccag tgtattttgc 61621 caaaaacttt cctggtttta gcaccaaatg tctcagaaca tggcagctga tgacctccct 61681 cacagtgagt gatcccagag agcaatgaag aattgtgatg cccttcatga ccttgttttg 61741 gatgttacac atcatcatgc ccttcagtga cctttctggt agcaaagggt ttgtttttaa 61801 aagagttgca gcatctttaa gtctaatata tacatgctcg gaagttacac catcatcact 61861 tctgctgtac tccatttgtt gcacagacag accctgatac attgtggagt ttcccagggt 61921 ccatgaatag caggaggcag ggactattgg gggccttctt ggaggctggc taccctgggg 61981 agagaaattg tcaatttcct ctccttgcgg ggaggaaatg gaatcaagtg catttgtatt 62041 gacaagaagg tagaacgctt tccttagaca ttgtaatgaa cagattgtgg ataaacaaaa 62101 gtacagagag agtttgaggc agaggaggag actgtttcaa cagaatattt accagctatg 62161 gtaaaggctc attagtgtct ccgtacagtt aacaacatgc tgccaagact ggaaatcttt 62221 gggacaggaa tgaattcgga aggtgaacag aataaagtac tgtgggaaca atggcgattc 62281 cattagtgga ttgtacattc cttacgggca gggccctcca tagcatctga tgtagggcct 62341 tatccacagg tatctgatta ttatttgact ttctttaatg tttttgcagt gttattatct 62401 atactactta tttcagtaag attgacaaca tttgcatttc gatttctaag atcaagacaa 62461 tgaatttcct cttaatccct caggaaagct gtgaaattta aatttataga gcattctact 62521 tagcctgctg ctttgcatct gtcagagaat gagacccccc agtttcatgg aagtcacagg 62581 aagtggcaag gggctcagat ggagttgagg ggttctagct atttctgcag atgatctgga 62641 tattttggca tcggctgctt tgacttaacc atctggcaat gtaaaggcag aatttcgtta 62701 aatattggaa aagttacttg aaaacatggc tctagagatt ttgccagcag cttttgcact 62761 catgacttgg ttcaatctaa gacctgtttt gtttttttgt tgtaaggacc taagtcctct 62821 gttgggtttt gacgaagtat gtgtaaaaat gactaaaatg gccagaggta tcttatttta 62881 catttatgtg ggaagaaatg tttggaaact gtttcatttc tttaactgaa aatttctcag 62941 tgcaatttta gttggtgttt ttcctcctgt ggacagtttt taggaatgat acaagttatc 63001 gtggaatcac ccagttcttg agtgtgtgaa cgaatggctt ctgctggaag ttaattcttc 63061 ttggcatggg tgtctcacat ttccccagca gctggatttg acaagggagg gcctatcatg 63121 attaatatgc cttctaattg attggtttaa taacaccttt actcctgtgg atatattttt 63181 catttcttgc tgaatttttt tttctttttg agataccaga acctgttaag tgacacggta 63241 tagatatttc aggacttcca ttttctcttt taaaaatatt tttgtaaagg taatgcctgt 63301 acacagaaga gtttgaaaat acagaaggca ccctttttca tgaagttatt ttgtgaagtg 63361 gtctattgcc tgcaatggta atagctagga ccccttactt actacccagc aggcactatg 63421 tagagctcca aatgagaaaa tgttggctcc ttgcagatgg ttgggagagg agacaccagc 63481 actcttaaca gacttgttaa gagttgagac tgataaccag agaatattct gggggaaacc 63541 aggagggatc gccaacatac tcagttatat ttgtggatgg tgaattctac agtgaatttt 63601 tctgttcaaa atcataatac agtcacttgc taactaccaa cgccatactt gctctcagtg 63661 ctgagaggtg gcacaaatat tattttcagt cttggtcatc aggcagggcc tggcaggcat 63721 catgctctgt gtgagtcagt ggagtttatt ataagaatgc ggtagcctcc acagtgatta 63781 tatttattat aggaaaacgt gtaattttgt tgtcagaacc tttcagcagg taccacaaaa 63841 tctcatatgg gaaaacgtgt aaatttttca tgaataaatt cctttcggta ttggtaattt 63901 ccttctctgt tttagtataa aagaaagcat tgccttatag gtggctaaga acatctgtag 63961 catacataac acttaatcta aaggagtgtt gaagaccggg ctgattccta attgagaatg 64021 agcaagaata gaactctttt tgatagttat ccctgttctt cctccaagcc tctgccttgg 64081 agctatggat actataacta actgaagctt cttctttcag gtacccactg atggtaatgc 64141 tggcctgctg gctgaacccc agattgccat gttctgtggc agactgaaca tgcacatgaa 64201 tgtccagaat gggaagtggg attcagatcc atcagggacc aaaacctgca ttgataccaa 64261 ggaaggcatc ctgcagtatt gccaagaagt aagtcctgtc cggtggctag caattcacgt 64321 tggatcacat gcatttgttt tcaaaaaatt taacttcttg tattttgcat cagtatttta 64381 accctacagt aaaaatcttg gttcctaatg attcaccata ccattaatat atttatttgc 64441 attaccctat gatatacata taaatgtttt taaaattatg atgtcgtatt atgaccatca 64501 ctaaacagta gtttaagatg tcacagcact tttttttttc ctcactctgt cacccaggtt 64561 ggagtgcagt ggcaagatta tggcttactg tagccttgac ctactgggct caagcaatcc 64621 tcccacctca gcctcccaag tagctggact ataggcacat gccaccaggc ctgactgatt 64681 ttttttaatt tttagtagag atggggtctc cttgtgttgg ccaggctgtc tgaaactcct 64741 gggctcaagc gatcctctgg ctggacctcc caaagtgctg agattacggg tatgagccac 64801 catgctcagc cctccacact cctttcatct tcagtatata tctcaccacg gctctacagt 64861 caaggtcttt tggttaaaag tgctagcatt aatttttttc tgttaccagc ttgatacata 64921 ttttagttca tttgtatctg tttgtattct attatagttt ggttttttcc tcccttatta 64981 attttacttt ttgactatca tgttagttct atgaaatata ctcagaaagc tcacttttca 65041 ctcattctct attgataaac attttcattg ttcctgattt atttttttgg caaaagaaaa 65101 tacatatttt tccttcttgt acaaaaaaag tacaaagttc actgttctgc actttgcttt 65161 tttttttttt ttttttttta acttaacttt gtatcctgga aatcatcctt tgtcatttcg 65221 tggagatctt catttttgtt taattctttg tagtcttttt gctgttacta tacagggtgc 65281 tcctttctta ttcaatttaa aatagctgga gttccaggca ttttatctca cacttcttag 65341 atagttatct ctgcattgtg actgcatctt aagaacatta gaagtaatta gtcttatgct 65401 tgttggcaaa gcagaattct cgacactcat tataaacagc agatagcctg gcaacatttt 65461 cttttgttgt aatccttttt gtactagaaa tttttcttta gcctacttgg ataatgaatg 65521 tttgttgttt aattaatatt actattctat tgagatcctt ttattttaaa ttgtaaatac 65581 taaaattatt attattaaaa ggtttaattt gttttgatag cgctctgttt aagaaaggac 65641 aagctctaat tactaattgg tgttttgaga agcaaaaagc atctacatta tcacaatata 65701 ttacatatta agtaattttt atcttgaaag gagtatgaac catgatgtga aatgatgaat 65761 agtggaaatc actaatactt aagttcatta tatgtatttt ttgttttgtt ttgttttttt 65821 ttgagatgac atcttgctct gtcgcccagg atggagtgca gtggtgcaat ctcagcttac 65881 tgcaacctcc acctcccggg tataaatgat tctcctgcct cagcctcctg agtagatggg 65941 actataggca cccgccacca cgcccagcta aatttttttg tatttttagt agagacaggg 66001 tttcaccata ttggccaggc tggtcttgaa ctactgacat tgtgatccac ccatgtcagc 66061 ctcccaaagt gctgggatta tagccgtgag ctaccacgta tggtccatta tatgtatttt 66121 tacaagaaca ttctgtgtgc tgatttacta agtatggatt gataataaat cagcttttgc 66181 ctgggtgtga cttttataaa gtgtcaatgt tagaaataat atttacattc tgctgaagag 66241 ttcttgtagt acaattgtgt agataagtta taaattttat aaatcagaac tctgtgcata 66301 cttatatgtt tttgatacca tttttttgat atatgcatac agactatgta ttctgagtgt 66361 caaataaact ttttttaaag gctagaatta aaatttagtt tatgacattg accaaattgt 66421 ccttttctgg tgaatcgact attttgatac atgtgtcata gtttgcatct aagtttttct 66481 tcccttagtt tcacgtatga ataaccagat gttttcacat ttgtatggtg gccataccaa 66541 aaattactct tcttgacccc aggaacaccc ctcttttccc gtttcactta gagatcctac 66601 taatgcctgg gtagctttac tttctgtctt agcctgttgt ttgtcccagg gtgtattagt 66661 ctgttctcac attgctgtaa taaactacct gagactgggt aatttataaa gaaaagaggt 66721 ttaagtggtt cacggttcca aaggctgtat gggaagcatg gctggggagg cctcaggaaa 66781 cttagaatca tggcaggagg tgaaggggaa gaagacacat gttcacatgg ccagagcaag 66841 aaagagaaag agtgtgaagg gagaggtgct acacactttc aaacacccgg atctcatgag 66901 aacactgtca cgggacagca ctcgagggat ggtgcttaaa cattgaaacc acccccatga 66961 tccaatccct tcccaccaga ccccacttcc aacactggga attacaattc gacatgagat 67021 ttgggtaggg acacagagcc aaactgtatc aaagggttga gcacagagtg gttcttccca 67081 gttctggact ttaccagatt gttggtggcc accatgagtg gggagaatta gccctttgct 67141 gtcaccttcc tcttgcaatt ctggtattat ctgtcctctg cctaccaccc acccacggaa 67201 gtcttgttgg atgatacaca ccagccagat tcttcaacat taggcgccat aagtaagaaa 67261 gggccaaaac atcatacctt gtgagccatt ttaatggatt tgggctttat tttgtagcaa 67321 caggaagtca ctagacttta ttttttatct atttattttt ttcttttttt ttcttttgtt 67381 tttgagacag agtcttgctc ttgtggccca ggctgcagtg cagtggcaca atctcagctc 67441 actgcaacct ctgcctcctg ggttcaaatg attctcctgc ctcagcctcc cgagtagctg 67501 ggattatagg cgcctgccac cacgccggga ttttttttat ttttagtaga gacagggttt 67561 caccatgttg gccaggctgg tctcgaactc ctgaccttgc gatccaccct cttcggcctc 67621 ccagagtgct gggattacag gcgtgagcca cccgcacccg gcctagactt tattttttca 67681 gagaagagga gtgattgtag atttgcattt aaacaaatca ccttcttgct ttatcactat 67741 gtcccctacc atatatcttc gttctcccag agactgccat gctgcagcct ctagccgtgt 67801 gctgtccagt actgtagcca ctggccctat gtggttgttt ggcacttgaa atgtggctag 67861 tggaaactga tgtgtgctat aagtgtaaga tagataccgt atctcaaaga tataatgtga 67921 gaaaacctgt aaaatgatca cattcatttt tatataggtt taacattgaa atagttatgt 67981 tttgaatatg ttgttaaata catcataaat gttactgagt tgtaactatg tgattaaaat 68041 tgatttcact tgtttctttt gacgtttcac gtggcagtga ggaaattgag gttccatagg 68101 cagttgcatt ctgtttccgt ggaacagtgc tgttctctgt gctatcccga gccaccagtg 68161 actctgagct gtaagggtct tagatcccat gatctggcaa gtgggagagg gaggtagaaa 68221 tgagtccctc cataccttca accagaactt cactgcatgt tgcaagtcag ggaacaaggc 68281 ttgtttgggg tccttttaaa tacatctttc attaaataca tttggatgta agatgattat 68341 atgcctttct agaaccctgt ggtgctctct tcctcctcag acataatgtc tttggatact 68401 aagggtgcat tgatgaacag aacacagggc ctgttcttca aagggggaga aaagcttgta 68461 aaatcataat tttagcaatc taaaaactac atggtgacag tgacactaat accatcagca 68521 ttgagcattc acaaagggtc acaaagggtc agctactatg ctaagtactt gcactcattt 68581 tacccactca gtgacacctg tggagaagaa ccgaagaaag caccttgggg aattgcagca 68641 ttgaaggcaa acaaagagag acgattcccc atgagggaga tgagcagtgg ccagagagat 68701 aggataggaa gtaggaggcg ttacatgctg gaaggcaagc catgtaaatt tctaaatgag 68761 gggaaagata agagtgcaag catgatgcaa acggaaagca gttgaagtgt agtttttgac 68821 atggtaaaat catgtctgtc atgcagactg aacacatgtc aaatatttta ttcaattcat 68881 taatgagatg aactgtaaga tgtgaaaaat ggttcatttt gacttggaga gatttaaaat 68941 tccttcccca gcacacttga gttgcatggc tatgaacagg aatcatctgg aattatctgg 69001 aatatcgtca aattatttcc atggtacagg aacacttgca ttgctatggg gtgtctgcag 69061 tatccatctt tacaaagctc taagggtgtg gaatagcttg taaggatgta agtaataagg 69121 gtgtaagaat tttaagggtg taagtcgtaa aatagcttat ttagtagcaa gtcaactgga 69181 ggaacatgct gtagaatcag ataatccttt ccccgccacc ctcagggtct gcaagacaac 69241 acctgcattt caatgctagg acacccagca atctcttttg ctcatatctt gtttttggac 69301 atttgtatgt gtgtgtgaca aaacattgtt aggatctggc aaatagaagg tcattgttga 69361 ccctgtggaa tagcatgggg agaagttacg ttgtagtcca ttgcaggaaa aacaacgtgg 69421 gtcaatggtg agtcagctgt ggactcctgg aggaaagacc tagaggcaaa aagctttgag 69481 gtcattgaag acaaaaggaa aggctttgtg attttgattt ttaaatggaa ggaatttggg 69541 cctaaggaag cttaaactaa actgtgggaa ggactgatac ccatcctaag cactagttgc 69601 aggtatctct taagaactag gttttggccg agctccgtgg ctcagacctg tagtcgcagc 69661 gctttgagtg agaggatcgc ttgggcccaa gagtacaagg ttacagtaag ctatgactat 69721 ggtattcagt tctgtaagaa ctaggcagaa aggagatcca gtggatatgt gattgcctgg 69781 gtcttggaaa ggggagggaa catctccttt tctgaggaag aatggggaga tagggtgggt 69841 taattaggca gtttcttttg ccaggtgata aaaaccccct aaacaatttg tattttatat 69901 atggagaata ttttctccca ctgatgggaa ggatcgtagt gtgggggatg cttacccaat 69961 ttcttaaaaa tgtacacaga aatgagatcc ctaggtgctg gaaccgaggg tcagtactgc 70021 ctagaatttc cccattaggg tctcttttgt ttctgcttac cttcctgctg tgtgtttact 70081 ccttttctcc tgctaagatg ggtgtcctca aggagaccag ggatgttggt agccacgacc 70141 tctggtagtg ttagtcctgg tttggcatcc caactttgtt ctgcccatgg gcattatcag 70201 ggagtgcttc tggttggcac tgctttgagt tacacgtctg cccttggagc caccaccgta 70261 tctgcaggga ctcagtatca ggactagccc atcttggcct tgtgggtgtg gccgtgtgat 70321 gtggggagag gagctgttgc cagaagaggg atgaggaatg gcatgggcaa aatagaagaa 70381 tgaccccagt gctgcaacac agtgggtatg gattagaaaa ggcttgtagg ctggacaggg 70441 cgaggtgagg gagtgcatca tgcattagga ttttaagtgg ctttttgggg agtattgagc 70501 aggtggcagg ggagacaagt tcactcagaa gttttagaca gtgttgacat tttctttcaa 70561 attgtgcatc ttgaattctc tagaacattc tttatagata tttcaggatc ttcctgccac 70621 tttccacatt gtagctattc tggtgttttt cttttataat tagaaataat gaatagttct 70681 catttgtagg cttggatttt catgcgtgtg ttggaaagtg ttcgttttgc cttttttttt 70741 taacttcatg tttttatttt ttaaaccgta ttgatggatt cattgctagg aacggcaagg 70801 aaaagaatac tctcacaatg ttaccacgtt tccttctcac aggttttata agttgtatta 70861 tttttatatt ttcaaagtta taatcttacc atctcctctg ttgccatact tctcattcat 70921 agggtctgtc tttaactgga ctcattgttt acagctagtc ttttcatcag atcatctcaa 70981 accctgtatt acattaatat tttgattccg ttttgttgga tgactttcat cttaatcatt 71041 cttattgttg aacaatgttt ttcttattat tgaacaattg cttcattagg tatagaattc 71101 ctggttgatg ggtgtttgat tcagcaccat gaaaattcac ggtattctgg tatctgtcat 71161 tgggcaaatt cttcttcttt tttttttttt ttttttttga aaaaaatgcg tggaaaccat 71221 tagttaccat tgacagagct actcatttaa tgattttttt tttctcccaa tagtggtgga 71281 tgaatttttt actttgcgtt ctcactttgt gtctctcatg accaaaatat tttagagata 71341 actagctggc taagtctgag gtggctgtta acagtctggt cctaggatag aacaaaacca 71401 acagaactca tcaccgtcac gtcgttttga ctttttgctc acaagtttaa ctcattgttt 71461 aatttgtttt tccaaaggat tgctttgcag tatacatgtt ggtgcatagc ttaacatgat 71521 gataaataca taactttgac tgtgaattat ctactctgct cctttcatag tctcatgctt 71581 ctctgaattt gtattttgtg ttactgggtt gttttgttgt tatcattcat agtgtttttt 71641 ttgcccttga ttctttgaac caggttttgg tcaagggcaa ggttcaagag aaaaagtaaa 71701 aacacagtct tttatttttg ctcttggtta taaacccttg taaagtttga ttgagaagat 71761 aacttggatc ttatggttta aattgagtct ttgtttaatc ccaagtcttc caaattcttt 71821 gtgcttatgt attctttcct ctttcccaat cgtaatagtt ttgcttctgc ctagtcattg 71881 gtcaaatacc tgcctttcaa tgcatttaaa aaatattttg tgttagcacc atttgagatt 71941 tttcccccct tttttttcaa gtatgtattg aggactgtta tgggatgaat gtttgtgttc 72001 ccccaaaatt tagatattga aaccctaacc ccaggctatt cggagatggt attcggagat 72061 gatacctttg ggaggtgata actgttagat gaggtcatga gtatgtgccc ccgtgatgga 72121 actagtgccc ttataagaaa agataccaaa gagcttgcct ttttctccct gctcaaacga 72181 agaagtcaca taaggacaca gcaaggaggt ggctgtctgc aagcaaggaa gagcggcctc 72241 accagaactc ggtcatgctg gcaccttggt tttggacttc cagcttccag agctgtgaca 72301 aaagacattt ttgttgttta agccattcac agtaaggcag cattttgagc taagacaaag 72361 acctactgtg gcttctgtag gcaaaagccc tacagaccca aagtgttgtg agaattaaca 72421 tttctaggga tcatgcctta cagtggattt tatccaacac ttagctctga cctagagcac 72481 gctagtcgtc gttaggtaaa tgcttgtgga tggtagtttc tggcacagag attggagcgg 72541 taatgggcag gagttctgta tccagcttat gctgagagag cttcccataa aatggagcct 72601 atactttcct tttgcagaga ccatttctat ccattgatag aattcttttt tttctttttc 72661 ttttttttga cgggatctca ctccatcgcc caggctggag tgcagtggca tgatgtcaac 72721 ccactgcaac ctctgcctcc tgggttcaag tgattctcat gcctcagcct cctgagtagc 72781 tgggattaca aggtgcacgc caccacatgt ggctaatttt tgtattttgg tagagatggg 72841 gtttcaccat gttggctagg ctggtcttga actctagtga tttacctgcc tcggcctccc 72901 aaagtgctgg gattacaggc atgagccact gcacccagcc ttccatcgac agaattctaa 72961 agacatgttt aggcaagagt tggtggaatg ttggcgtgta gtccagattt gccttcagtt 73021 tgtttttatc cgcttcaaat gcaaaggcca tgataattca atagtaaagt cataaatcca 73081 tagtatgact taaaatattt ggttgtaata ttacaaagta tgcttccaaa cacattaagc 73141 atgcaaaatc cttatattgt tctttttgtt tagtgtttag ggtagtttgt gtgaatcata 73201 gatgaccaca gtagactgtt gagttgttat tttcttttga cattctttga tagaaaacag 73261 tgcttttcct ctgttattta tactagaagc aaaaagtatc tctcatagca ttatggaaac 73321 aactcgaata caactacgtc aattattcat cttttttttt tttttttttt tttttttgag 73381 acagagtccc actccatcaa ccaggctgga gtgcagtggt gcagtcttgg ctcactgcat 73441 tctttgcatc ctgggctcaa gtgattctcg tgcctcagcc tcccaagtag ctgagattac 73501 aggtgcccac accattctca gataattttt gtatttttac tagagatggg gtttcaccat 73561 gttggccagg ctggtctcga actcctgacc tcaagtgatc cacctgtctt ggcctcccaa 73621 agtgctggga ttacaggtgc gaaccactgt gcctggcagt tattcatcat ttaacagcat 73681 aaagtggcca cttgctttaa taaactatta tagtaccaaa attttggcct gaggaattta 73741 taaatataaa tagaacaaaa caaaaataat attccccttg aaacatattc taaaaccagg 73801 ggctaaaagt tagaattgtt agattttatt gtcctgtttc tttgggatga aatgaaattt 73861 tttaaaaacc atagttttgg ccgggcgcag tggctcacgc ctgtaatccc agcactttgg 73921 gaggccgagg cgggcagatc acgaggtcag gagatcgaga ccatcccggc taacacggtg 73981 aaaccctgtc tctactaaaa atacaaaaat atttgccggg cgtggtggtg ggcacctgta 74041 gtcccagcta cttgggaggc tgaggcagga gaatggcgtg aacccgggag gtggagcttg 74101 cggtgagccg agatcgcgcc gctgcactcc agcctgggcg acaagagcga gactccggct 74161 ccaaaaaaaa aaaaaaaaaa aaaaccacat agttttactt aaaccaaagg ctttgggatt 74221 ccagaagaat aaaggaagtg tacttcaatt ctacatctca agaatctatt tcttcccaaa 74281 tttaatacag gcaaatttag tctatattta ttgggcactg ttaggttttt gtgtcaggtg 74341 ttgctgagtg gcgggatacc attcttggcc cataaggacc ttataaatct attaatacaa 74401 ttaaattgtg aagaaaattt agaaaattgc aaaggtagaa tgaggttgca aattggaggg 74461 tcttattctg gagataggtg gaagctatgg cagattttta gcatgaatgc tttctggtca 74521 agacactttt agaaaaatta gcttgtttcc aagagtagat tagattgggg agagaaggaa 74581 ttcaaattac aaaataatag tgtgaaaagc attgtgggag atcatcctgt ccatcccctc 74641 attcatgata tcctgaggtg ctgtattaag ggcatcattg ctctttgtca cccgagcagt 74701 ccaggaaggc tgcctgttca gtcttagact gtatcctcac agtcttaaac catatcctca 74761 ctttatgata gaaacacaca ggcaagtgtt gtaagagtta catattgtca gttaaacaag 74821 aaattggatg agtccccata ctattcttcc tcttttaaag agagaacttg ggccgggtgc 74881 catggctcac acctgtaatc ccagcacatt gggaggccga gtcgggctgg tgacgagatc 74941 aagagatcaa gaccatccta gccaacatgg tgaaaccccg tctctactaa aaatacaaaa 75001 attagatggg catggtggcg cacgcctgta gtcccagcta ctcgggagac tgaggcagga 75061 gaattgcttg aactcggagg ccgaggttgc agtgggccga gatcacgcca ctgcactcca 75121 gcctggcgac agagtgagac tccatcccaa aaaaaaaaaa aaaaaaaaaa aggaaaaaaa 75181 aagagataac ttgaaacagg ctcttccagc ctctctattc ttttaataac atgcagtgca 75241 cttcttggag ctaggggcta gggtctaggg gcttcttgaa ggtgttgatg tctctggtaa 75301 cttacactgc agggcaggtc tctggaggga actttgaaca ggttaagcct caaatgcagg 75361 cagagatttc caattgggaa agttaggtga acgtggcttt atcatctctt ttattggttt 75421 atgggtattt tctctttttc ctgttttctt tcccaccatt taacattatc tgttctctgc 75481 ctttttaacc ttctggcttt aaatgaacag ttgcttaatc tatagattaa caggatcagc 75541 aaacaaagca cagcagtgcc tgcacaaagc agaaactgcc tgttagatga gttgagcaaa 75601 accatttcct ttgcccaact ttctcctcta aaatacataa ctgaacatgg gattcccaga 75661 cacgaattgt ttctcaacct ttctttttcc attctacagt tgtatgaaat taaaattaaa 75721 taaaaagcaa acaacagagg tagtcttttt aatcttaccg agttggaagg tatactatat 75781 acattatcct atggtagaag gtctggtaca ttcttgacca ttattacttt gctatgggca 75841 cccttcaccc ttgtaggctg aaaattggca aatcataggc acgtatacat tgtggttttg 75901 gaatttgaaa gatctcaaat tcatattgag tgagacagtt tttgaatgga gggatcaggc 75961 tgtctatatg tggatatatt ttatgcattt ctggtctgta taaatgtaag gattgcttta 76021 ctgaccgtag atttcttcta gggttaaaaa gttgaaggca gtttggatat tttaaaataa 76081 gaaaattagc cctacattga aagagagaat aggaaaatct aggctatgtg tacacatctg 76141 cttgtgtttt cacaattgtg tcctgtccta tttattttta tttctctccc atctcttccc 76201 tcccacccac aatttgtttt tcattcaaca ctttgaaatt agaatagctt atctatcaaa 76261 caaggttctt tgacaggatt ttccaactct gagtatgaaa gtgtggatat aacaatatat 76321 ttatatagaa tgtcagaagc agagtttggc cagattattg ggaaatattc taacgccctg 76381 cagcatttta ccgcctttgc ttttacataa ggccatcatt tggaaacata gcatgctgat 76441 acctcccatt attgcagtag cgtttcatca gctgcacgct gcctctcagc gcctcattaa 76501 gttgcaagca gagaacgttt tcgttttcca gttctctttc tcaattaagt tagagattgt 76561 attttcagta ttcctcatta ttaaaagaat ttttgttgag aatagtttaa gatgtgatat 76621 ttctttgtat tccaggaaaa tttaaagaat gtcattgaaa tccagaaaca gaagtatata 76681 ttgtctagga catgctttgt cttggtttat atgcctaagt ccgagtagtt taaaatttat 76741 gattaatata ctctttgaat gcttagaaat ggcatcgatt ttagttgttc tagtttcatc 76801 ttaaataaat tggagtaatt actaactcag gattatgatg tgctacacat aaaacatttg 76861 cagtgatgtc tggtacttag gaaaccctca gttatgacat ttttcttact ttcttcttgg 76921 atatgttctt tgaaaaaggg actggtatgg aaaagctttg agacgatgaa aatatacaag 76981 ataggctatc tctgaaacaa tttggtatag agcatttgat tctctgtttc atgtatatat 77041 tcatatatgc tagctgcagt catgcagaca caatttggat ctgggttttt ttttttttcc 77101 tttttcacat gcagttataa atttagaaac ctggctgttt cctctggact aaatgtattt 77161 atttaaattt tcagtgcttt aaatgcaaaa ttacaagcat atgtacaagt agaaagaata 77221 gtaaaatgaa ccctcccatc tatctgtcac tgaattgcag taatgatcaa atctagtcag 77281 tcttgttcct gagtcagtct ggaatgcagg ttttttccct taactattca ggttctccag 77341 caatcctggc aaaacacatg cgggtttcag ctcatctctt tcagtgacct gtaaattaag 77401 atgtgctaaa gcatcacatg ttactcattg acgttttgtt aatatattta aaacatttta 77461 ttagtaccac ataattttta gcgttttcaa aaagaaataa gaatttggct aaaggtgaat 77521 tttaaaaatt gaatgactaa gtagaaaagt aggcaaactg ttttcagaat tagtagcaat 77581 ttccaattcc atttactctg ttcattaact ttgttttcag ccttttgtta gccatgtttg 77641 attttgtgtt tttgaaatac ctttggtcat tttaggactt ttggcattaa tcacctgtct 77701 taggtggcct gtttacaaaa gggacttttc ggttctccat ttgcccttaa tgttgggaat 77761 agagacaaag tttctgtagg actctgacaa catatggctt tacaaagttt ttttttaata 77821 tatacatttc tcttcttggc tttcttaaat atcatcaatg tcttttcttg ctttacgtat 77881 ttagcaactt aaatgtcatc ttgtctgaga tcaatatatt tccaacttgg agagatagtg 77941 agaagagaga cttcagattt ttaaaatgtg gaacaaaaca ttctgatggt gaccaggttt 78001 taaggtattt cttcagtaga caaaactact taggctaccg atgataagct tttaatatta 78061 ataataatta ttatatataa aaatattaat ataatatgat ctaagattgt ttctattccc 78121 cctctttcct ggttgttagg atttggaatt tcttatgcac tctttttttt tttttttttt 78181 tttttctgag acagagtctg gctctgcctc gcaggctgga gtgcagtggc gcgatctcgg 78241 ctcactgcaa gctccgcctc ccaggttcac gccattctcc tgcctcagcc tcccaagtag 78301 ctgagactac aggcgcctgc caccatgccc ggctaatttt tgtatttttt agtagagacg 78361 gggtttcacc atgttagcca ggatggtctc gatctactga ccttgtgatc cacccgtctt 78421 ggcctcccaa agtgctggga ttacaggcct gagccaccgc gcccggccta tgcactctta 78481 aaattatttc atgtactgat acattcactt tcaatgataa aacatggctg tatgaccagg 78541 taggtaggct gcttttcaga ctcctgtact aactgaatgg gccccattag cccaagtgag 78601 agcttaactt gtcactcttc ctaaatgcct tcctcagctg agtgcttatt tacagtaatc 78661 tactgcttat agcagatgct atgatttgta taagctgtcc atatctactt ggaagaacaa 78721 tattgaaagg cattgagata cttactttcc ctcaaaagcc cataatccaa caactattta 78781 gaatatctta tttaagagat aggtttataa tacccaaagg gaacttaatg tatactgaaa 78841 gtccctgatt gaatggattc tgaccttaac cagaaatatt aattacatgc ttatcagaca 78901 cttactgtgc tatcccaggg tttgccactg ttgagggaac accagatgaa tcatattgtc 78961 cctgccttca aggattatat attctaatag gacagacagc ccatcagcag aataggaggt 79021 agcaagtggt cagggatcag ataaagtgat caaggcaagt ggaggatgca gtggtaggac 79081 agaatggtct gcgcacagag attgagggca tcatgtggtg ggaattagtg aactagcatg 79141 caagccgggc ctttttcttg gcttttgttc aattggaagg agcataaagc tactaaagga 79201 gcttatgttt agaaataggg cttattgaag gggacaaggc atctcctaga atccaaatgc 79261 acatcccaag acttaaaaga gttagagggc acaggcaggc cctgtcagat ccggtctctg 79321 ccagatccag tcttgtctgt ttctcatctt cccttgtctc tctgagtatt ccctgttttc 79381 ctttcctgga ctaactgctt cttccagatg cctagtttga gtccacccat ggttttttgg 79441 gggcttttct ttgttttttt gttacgttct gtttttgttt tttgtttttt tgaggcgggg 79501 tctcactctg ttgcccagac tggagtgcag tgacgtgatc ccagttcact gcagcctcga 79561 cctcctcgct taagctgtcc tcccacctca gcctcccgag tagctgggac tataggcgca 79621 agccaccatg cacccatggg tgttgacatg ccacctgtgg ctcagactta tgctcacctc 79681 cagtcaagtt tcttccttat ctaacgacag cctttgtgtg ttcgtgaaag cccttgaggg 79741 aaagaatgct acgtacagtt tggttttctt gggctctaat aagctatagg ggatggtgct 79801 gcagagtttg aaatacagtc actaatgagt cctccttcag agagctgagg gtgggagcac 79861 cacataaaag gagttggggg tcagcagcct cctgatagtc tgaaagccaa agatgatgga 79921 tgtggggctc cacggggagg ctataacatg ccacgttatg ttttctgccc ttgaggaaag 79981 cttcttttca ccaggcaggc agtacagata caaacttaat agcgatgtga ggcacactga 80041 taaataccag aaaacattct gtaagagggg ttagcattat ttcactgtgt gcagaggagg 80101 tttgtccatg tggggggacc ctaatgggcg taggcatttg aagtgtagtc taagggatct 80161 taggtaagag tgtgtttgga acacagtgta cgtatttata cagcaaatgc tctgtgtggc 80221 tataatttag gaaaatttgt aggacttgag gacagcttat ttttctacta gctgacagag 80281 gcttgggttt gcttctgctt taagtataag tatctcgagt ccttttttaa aaaggaaaat 80341 caaggaaata gttggaattg ggtttcacat aaaaaacatt ttagaaaaga aggggagtat 80401 gagagttagg atttgctaat gtttgctgta ttagcaaatt atttgcaatg cttgtcatat 80461 tagcaataca tgtatgtttt ggaaaacaca tgcaaattaa gcagcttaag gaacttttat 80521 ctttatttcc tatgtaagat ttgagatgct gcaaagaaaa tattaaaaca ttgaaccgat 80581 acttacagta gtcctcctat atctgcagag gataacttcc aagaccccca gtgaatgccc 80641 gaaaccttag atagtaccaa acccaatata tactatgttt ttttcctata ggtacatacc 80701 tgtgataaag ttgaatttac aaattaggca cagtaagtaa ttcacaacaa ctaataatag 80761 aatagagcaa ttatgacact acgccaccat cactactctt gcactttggg gccgttatta 80821 agtcaaataa gagttacttg aacacaagcg ctgtgatgaa gtgatcaatg gcaggtagca 80881 tgtacagcat ggagacgctg gacaaaggga tgattcactt tctggactgt gtgggtttga 80941 gatttcatca cactgctcag aatggtgcgc aatttaaaac ttattgttgg ctgggcatgg 81001 tggctcatgc ctgtaatcct ggtactttgg gaggccagga caggaggatc tcttgaggcc 81061 aggagtttga gaccagcctg ggcaacatag gatgacccta tctctttctt aaaattaaat 81121 ataggccggg cacggtggct cacgcctgta atcccaacac tttgggaggc caaggcgggt 81181 ggatcacaag gtcaggagat tgagagcatc ctggccaaca tggtgaaacc ctgtctttac 81241 tgaaaataca aaaattagcc gggtgtggta gtgggtgcct gtagtcccag ctactcagga 81301 ggctgaggca ggagaatcgc ttgaacctgg gagatggagg ttacagtgag ctgagattgc 81361 accactgcac tgcaacctgg gcgacagagt gagagtccat ctcaaaaaaa aaataaatta 81421 attaaataaa tagaaaaata aaactcattt atttatagag ctttctattt aatatttttg 81481 gactgaaact atggaaagca aaaccacaga taagggagga ctactatagt tatgataaaa 81541 atcaggaact taatctagcc acgagaacat ttgtatttct gatgaaattt ttattcttgt 81601 ctaaaatgtt gaactatatt tatatttata tctttaaata taaatgaact atatttatat 81661 ctttataaat gagctatata tttatatttt atataaatat aaatattttt atatctatgt 81721 aaacataaaa aggctttttt tcctctctgg atgtttactg gggttattta ataaaaacct 81781 agggatgata ttgggaattt ttgttggaat tttaaatttt aacgtgtttg tttctttttt 81841 tgttttgttt cgttgtttgc tttctgagct gggagttggc tgtgcatgtg gcagttgctt 81901 tatcaaaagg gtcttgctct gcttcagaag aagttttgtt accttgattg ctaatgttct 81961 cccccttaaa taaacctcct gatgttggaa cagcaaaatt gcattatgtt tggaaaatct 82021 aaattaataa tcccttatct ctgaataaaa gtaactgaat caggttgaat ttactgcttc 82081 cattttaata ggacagtagg aaaaaactat gcatgggctt tatgtggttc aaattttttt 82141 tttttttttt ttgagaccga gtctcgctct gtcaccaggc tggagtgcag tggtgtgatc 82201 tcagctcact gcaacatccg cttcctgggt tcctgcgatt ctcctgcctc agcctccgga 82261 gtagctggga cttaacaggc gtgtgccatg acgcccagct aatttttgta tttttagtag 82321 agacagggtt tcaccatgtt ggccaggatg gtcttgatct cttgacctcg tgagccgccc 82381 gcctcggcct cccaaagtgc tgggattaca ggtgtgtgcc actgtctccg gcctggttca 82441 aattttgtaa tctccacatc catctgcata ttaacaattg tcattgttgg tgttatagag 82501 agagaagctg aagcacagaa aataacacgc atgttctaga tttcctggtg ggaactggaa 82561 gtaaattctg tgtttggctg atcactgtat aaagttctag tactagtctt tagggaaaag 82621 ccccgtcttc atggatttta atttgtcaat ttgcaattct acctacttgg aaatagtcaa 82681 cctaatcagt tactgccctg ttgcaccaac tattaccata ggaatctgaa gatggtatgc 82741 aaccaacatg aaaaagagat ccttttctcc tggtaatcag tttgtaccct acaaacaatt 82801 cgttggtttt aagttgtata gagggaacca tgatgtgaat atgcctgcca ctgggtttat 82861 agaaaaccga tgactcctga ggaaattctc agcagggttg tctttcagtg aaaagaaccc 82921 tttagaaaga tgcagtaact cacactttac tgcacataaa gtcatgtccc tcagggtgga 82981 ctctcaggtg tcatacagtt tatgactcat gagtgctgca gctttgtgat ctaataaaca 83041 agtaaagacg gtgggagaat tatattcctg tacaaaaagc cacttcacct ttcattatat 83101 aggaaaaaag ccatgggaat attaaatttt tagtggcatt tttagagcat tttctgtgat 83161 ctaaagtgcc tgcattgaga tattaagata cttaagaaca aagtaaaata aagagaagta 83221 ttaatagcag gatccctttg gagctaggtt actgtggcag tggtttcaag aataactctc 83281 tcagcaccca ccccaggatt ctgatttaga gatttctttg tgtgttctct tagcatatac 83341 ctctaccact gagtttatcc cctgctgcta cactgattga taccttattt ttaaattttt 83401 gtttttaaaa tagacggttt tttagagcag ttttagattc acagaaaagt tgagaggaag 83461 gtacagaggt ttcccatata cttgctacct gcacacatgc ttagcctctc ctattataaa 83521 catcccccac agaatggtac atttgttaca atcgatgaac caacattgat acatcattag 83581 cacccaaagt ccatagtttc cctgagggtt cactcttggc ggtatacatt ttttgggttt 83641 ggacaaatgt ataatgacat atgtccatca ttgtagtacc atacagagta gtttcactgc 83701 cctaaaaatc ttctgtgctc tatctgttca tcctccccct gcccccaacc acccaaaccc 83761 gtggcaacca ctgatctttt tagtctccat agttctgcct tttccagaat gtcatatagt 83821 gggaagaatg tcatatagta tggagcctat tcagatgggc ttctttcact tagcaatatt 83881 agttaagttc ctccatgtct tttcatgggt tcatagccca tttcctttta gtgttgaatg 83941 acattccatt ttctaggtgt gtaccccaca gtgttattta ttcattcaca gtaccttatt 84001 ttttaactga tttttttggg gtctgggact aaggtgaggt aaacaaggca gtcaggaatt 84061 gagattaaga atttttaatg taatattttt aaaaatcaaa attactgcaa aaaaactcca 84121 tgatgaacaa aatatcaact ttttagacag agattgagtc acgctcaatt gacatcaagc 84181 catgcatagt tctcctactg ccttattaaa atggaagcag taaattcagc ctgattcagt 84241 tacttttatt caaagataag ggattattaa tttagatttg tcaagccctc tgaatgtcat 84301 ggacaattgt gagtgaaatt ccgtgtatcg tctaactccc catggatgtt atcttgctca 84361 gaaaccttgt tctccctcca aggaaggcac tacttctcct gtagccttct caggtcgaag 84421 ttcttttctc aaaatcctca cagcgttgcc cctagaggtg ggaatgtcct cttttcctct 84481 tcatttcctc ttttcctctt catttttgtt tccaaaccac tcttcctttc tcttccctaa 84541 aagcccccct acctcagctt gaagtgcatg ctttcagaca aatcactttg ctcctccacc 84601 tgttggtcat ctccagacct ctttctacct catcgatgtt caccattcag actttgaaga 84661 ttttagctcc tgccttgctg tcatcaggtg tagctgcttc atttgctgat tcttaagtga 84721 tcctaatatt gatctcaatg atccttccag tattccaatt tgtcagttca tgattctcat 84781 ccctccagtg accttgtttt catcattctc catcacccat tgcttacaat ggtcatatta 84841 tactccttgc aatgaccaat cactgcaact cctccaaaat tttaatttca attatttgac 84901 tctccaagta ctacttatta ccacctcact ttctccagtc ctgtgcgcca tacattcttt 84961 agcccctgga gtgtgatcag tggactcaga ttccactggt ccctaccacc ttttacctgt 85021 gtctaattcc cctcatgttc tcacttttgc cacttgagat tgcatggccc agcattctaa 85081 tcattccctt gcttataccg tcacttctct attcgttgtt atttatctgg caaaacaacc 85141 accctgttgc aatgcaaggt ttatgcaagg gaatgcacat tttcgaaata gatttttatg 85201 tttatgtttt tctttcaaat ctctactcat taataaatag aagtggtttg atatttcaca 85261 aatgtgattt aattgatcaa gtttcacatt tcattctaag tgctcagtta aacaccatga 85321 tgtattcaca acagtaatca aagcaagggg ctgtagttag ttataggagc aaagagaatt 85381 catgagttta tttttatcta aatgacaccc ttgataactt ttcgataagt tattttcatg 85441 taaattactt ggaataagag caggattttt atatagaaag ctttctatgt gtcatatcct 85501 gtaaaatgtt tgtccgatta gtgagagata gggggaggtt gaattttact aaacataata 85561 tccccagcca tatctactgt aacgtttgtt taggacatta cagactatta tcctacacca 85621 atttatttca catcaaaaat caccccatgc ttgtaagaat cctcctttcc ccctcttgtt 85681 tctattttag ctcccctgga aaaataatat tttaagtcta aatgagtaca ataaattaaa 85741 aagttgcatt gcttaatatt ttgaatggaa tagctagatg cttgttcatt tgctatcttt 85801 ttctttgtac tgaatttctt tagcttggtc agaggtagag caggcttgac tcttagtgga 85861 tagtggtata ggttgagtat cccaaattta aaaatccaga gtccgaaatg ctcaaaaacc 85921 caaaactatt tgcatgtcaa catgatgctc aaaggaaatg ctcattggag tattttggat 85981 tttggacttt ccggattcca gatgctcaac cagtaagtat ataatgcaga tgttctaaaa 86041 tcggaaaaag tcagagacct tgaaagcact tctggtccca agcattttgg ataagggaca 86101 ctccacctgt accttacagt ggaggcttgt tagatgcttg taaatgccag cccctgcctc 86161 aagtaacaat tgattctttt tgtgtgctct cccaggtcta ccctgaactg cagatcacca 86221 atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg ggccgcaagc 86281 agtgcaagac ccatccccac tttgtgattc cctaccgctg cttaggtgag ccggccggcc 86341 gtggggctgg tgttgattgg gggcctggtc ttgagggaag aaaaagagga tgctcctgtt 86401 aggtcacata cacagacttg ttcttcagca cattgccact ctgtgttgta ctgtgttttg 86461 gactcttgca gttacattct gtgcactgac cctataggag cagtattttt gagttccctg 86521 cctcagaatg aatttaccca gggtgtatat tgaaattaca aattcctggg ccagttccag 86581 gactcctgaa tgaaaaatgc ctatagtagc ggatccggga attcttattt taccgtatcg 86641 catagatgat tctcatgaac aggggccttg tgtgtttctt cacatagact ttctagaaga 86701 aagaatctaa tgtgaagctg cagcattttg ttaatttcta aaaaaaaaaa aaaaaaaaaa 86761 aaaaaggctg ttctacaata actactcctc ttatttggtg atagtagagg aattggaatt 86821 gggaaggctt tttttctgct tacatcaaac tggaggaaac agtctgatta ctaatgttta 86881 atgtttgttt tgtcacaaac ctcaaattta cctcaacatt ttatttaaaa aaattgagta 86941 gttttatgtg gtatcgtgct cttttcctct ggaaactgtc atactccata tttaatctaa 87001 cttggttatt agtctgttcc tagttgttgt ttttatgtga ccagtcattg cagaagtcaa 87061 gttctttgct ctggtatctt acatctagat gttttccaca agtgtattca tcagtaggca 87121 ggttgtaaat aaggatatag agattgtctg tggagattgt ggacaaatat tgatgtatat 87181 tctgatgact tcttataaag gttaaaagga atatttcaaa gaaaatccaa cagcaaactt 87241 acaaagtaag gatatttact gaagattccg gagatctatg taatatatgg tcaaaaatga 87301 atgtcctaag agcatactga ttgctgttca ttcatgtgca taatttgaga cataaaatga 87361 aaacactggg cattagaaag aagatttcac ctggtgaatt tacttacaat gtaattgatg 87421 aagagagtgt agaaaactag atttcttaaa catctgataa ttcaagagtt tcctttttcc 87481 tccttattcc ttgttcctat atgctttata aggcctcaac tcttttctgg ttcatcaaaa 87541 agtgcagttt aattagatga aaataagcag cacttaaatt ctagatacag aactcctact 87601 caaaaaaata tagaagtcat ctgtccccag gaactgtgta ttttatactc acttagttat 87661 tacaccaaga aataacacac agagattgaa caattcagta caagttagga agtagatcat 87721 gagtctgtta atttgatggg ccaggctttg ttaatgattt gtttcttttg gttcttactt 87781 acctgttcta tgaagtattc caaatatatt ttcattagaa tgtaagctca ctgagaacag 87841 gtagggcttt ggatgggcat tattgctcct ggtatgacac tcaaaactgc agtcatgttt 87901 cagaagtgaa cttatcattt tacatagatt tttctgcttc atattctatt cagaatgatt 87961 tgtattaaag ttgatggaaa gatgaaatcc acattaagat tgttatttac tacctagtgc 88021 cagttgttgt agcaaatatc ctttgaaccc caggtctatt ttatcctatc tgttgatcac 88081 ttcctttgcc cctgctcttc aagcctcctc atctctgaaa agctctcctt ccactgggtg 88141 tggtggctca agcctataat cccagcactg taagaggccg aggtgggaag attgcttgag 88201 cccagcagtt agagaccagc ctgggcaaca taatgagacc ccgtctctta aaaaaacaaa 88261 aacaaacaaa aaaacctcca agtgaacgaa cagtacaaca agtaataaag gatataaaag 88321 atgttctcag atctcaatat caagtcagaa gattttataa taatattata attacacgga 88381 gactgtccaa atatttgtat ttgttgtgct acactgcatt atgcaccatt tagggaaaca 88441 acagcctaag taatgtggtt ttaaataatt aatctcatct accaaatacc tgtttgatag 88501 taaagttaat gacctgtcct caccttatgt ttttaaccat tatggaaaga gaaccacttg 88561 aaaatatatc attccactga cagataatta cccagctgtg aagcacaggc tggagactgg 88621 caaggagaaa tggagagaag cctataaaat aattctttct cagggaagga gtttattcct 88681 gccaaggctt tgccatcaac cattatcaga tttctctttc atcctccgaa cagattctgg 88741 gctcctttga tgtggaggta cataattcaa aaagttgggt cacttgtgac atgttaagca 88801 aatttctcag ttctggttgt gtctgatgtg aaatgagcac gttgacacta actgttcttg 88861 actactatgg gactagataa tcatttgcta tttcctgtag gtacttggcg tgttgaatca 88921 ttttaccata ttttctcatt ttatgaatca ctggtccttt tccattggac ctgaaattag 88981 accctaatat gacaagtaga gcatatgttg tctcttactt tgttattaac cttagattag 89041 aattaaggaa tataggtgct gcttttttct taggtagtgg caaattttcc aaaggtctca 89101 agcccagagt ataatacaga gttgataaaa taacctggta gagtatgtag agcatgcctt 89161 gtctttcagg aaaaggaact gaaaaacata ggttttccaa taggtgtaac ataggtaaca 89221 tagggacata ggtgtaactg atgtacattt tacttgcact taatagacag gcagtgttag 89281 ttcttcgtct ctcttcacag gctatacaac taccaagatc cagagaggtt gaatttcttg 89341 ctataactca cagcaaataa atgggagagt cagcactaca gtccaagctc tcttgattat 89401 tggtccagtg ttttttacag tatacacaaa aatttgtttt aaaacaaata ttcactatga 89461 attagagaca atagcactaa aaaagaaact cacagctatc gttgtgttga cagaggagac 89521 aaccatgacc tagtatagga taaatctcca tgtgatatgt tttctacttt atagtagtag 89581 taggtaacat tcttacatat tattcctgct taacaaagaa ttactgttca gcattaacat 89641 aagtatgctc taaaatctct tgactgtatc caaatatatg ttttagacaa gagttttttt 89701 tttttactga cctagcaaac aaaataaaat tatcttaaac attaagcgtg ttaaaattca 89761 aaacaaagct tgaagagaga aaaatgcttg ttatgtgtgg catacaacct tcctaacctc 89821 tttatcagat gaaggatcca gtgtggcact ggattatttt tctgtaatat tttgcctcgg 89881 acatatgtgc tatacatttc ttgtctgcat ggggaaagta tgtcaggttg ctgttgccac 89941 tttcttccca tggagtggat aatttccctt gagaataagt tggctttctc atctcaccct 90001 ttgcttttgg tagtacctct ggaaccagag ggtaagattc atcttaatca tgtgaacatt 90061 ttattaatta tctcatgtgt gattcaagca catcattact gttactcaag gagaatattt 90121 tctttgcaaa gctttcaacc ctttccattt gaatgactgc tcttcatggt gtaatagatc 90181 cactgttgaa attggcactt ttctgtactt gtctctgacc agcaaaggca aggaacattg 90241 cgtaaaataa attactgttg aattattggg gcaaagtcag gttaatagct taatccaaaa 90301 aaggaattac aagtgaaaga attttacctt aaatgtcatt ctgtttcatt ttgtaattaa 90361 gaccttccaa atgtgtaagt agtaatgacc agaaatctta tcagatatgg tcaaatgaga 90421 aaggtattca attcatatta attttgaatt ttattgtttt aatacaattt taggaaaagc 90481 atgccttgtt ttctgggttt cttggctgcc tttgtatcaa ttatctacag ctatcttctt 90541 tagcatggaa aggatgtatt tgaatactgt gacattttat accagggtaa ttctttgaat 90601 ttatgtaaac aacaactctc ttgagacaca tcagtacaac acaatagtca gaaatgtcac 90661 atgagagttc accaaacttt aaagatggcg atgaggatgg aaaaacattt ttattgtaga 90721 ctttaaggat cttgtaaatc cccgtatgta atccagtgtg ctagagaatt ggtgcccatg 90781 gcgcctcctc tccccaccga cctgcactgt ggaaatccta ctaattttga agtcatggga 90841 gatgcatgtg tgggacctat acatgcactg tgatgagaat gatgacttca tcattctcat 90901 tatttaatga gcccagctgc tgcatacaag aaggatgcaa attcttgaat gttcttgatt 90961 gtttacatca tggtctcata aggaatgacc agaattgctg ggcatttctt catcaggaaa 91021 ggctaatcag ccatccttgg agagacatca gactccttta tatgtggcct tcccatggga 91081 cctttgaatg gtcaacgtta atcatgcttc agaaccagca tactaagtaa aagaaaacat 91141 gtccggaaaa gaaatcttat cagtgtttat ctcccaacag tgggattaca ggtgattttg 91201 attttattta agcaattttc tatgactttg tataataaat gagtagcttt tataatcagc 91261 aaaagataga ctatgaaatc agtacatata attttgtatt tgtatgtagt accattgaag 91321 gataatagaa aaaagtcaaa agacttgaaa gtcgacacag aatgatcatt aatttgtttt 91381 atctcttggt agttgtactt tatgagcttt atacatacag tatgggaaaa tacactttag 91441 ttttgaaaac accatgatgc caaaataaaa gaaaataacg ttcattcata cgatcttcaa 91501 caaaatttaa gcagcattaa tcatcactga ttcttcagga agattcttga tcttccttaa 91561 ttttttttcc ctccaaaatt gcatatggca gccacatttg tttcaggcca gaaggaagaa 91621 tgggaagagc ttacatttat ctcacagatt ccttctgggt gaggaagact tagaaagtga 91681 agccctcttc catttgatta agcaagaaca cctataaaaa tggtgtcttc ggccgggcgc 91741 ggtgctcacg cctgtaatcc cagaacttcg ggaggccgag atgggcggat cgcaaggtca 91801 ggagattgag actgtcttgg ctaacaacgg tgaaaccccg tctctactaa aaatacaaaa 91861 aaattagcca ggcgtggtgc gggcgcctgt aatcccagct actcgggagg ctgaggcagg 91921 agaatggcgt gaacccggga ggcggagctt gccgtgagcc gagatcgcgc cactgcgctc 91981 cagcctgggg gacagagcga gactcagtct caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 92041 tggagccctt ctaggtaggt catcctccat ggctcactgc agcacttcct tggccctgtc 92101 ccaagttcca taattgcctc aattgcattg gctcctgctt gatttgcttt ctaaaactca 92161 ggacagagga tgagttgagg agactgactt ttttaacaat cgcattttga aaagaaggaa 92221 gcgtagtggt gtttcttggc tccgtttcca gttttgtttc ttttcatttt aattattgta 92281 cctggtattg tgtcctcttt ctgggacttt taacaactaa tgttttgatt gataagagct 92341 ctaatagcat gggtgattta tacttcatta catcttcagt tttgtcttct ttactggagt 92401 tgtattcagg tgtcattaag taacgaggca ttaaaaagaa acgataactt ttgaaaaact 92461 gctgttatgg cttatgagtg ggctttgctg atattttctt gaagcacaga atttatctat 92521 ctctgtaggt cctagatgat aggtgagaaa aagaaaccca gcttatttgg catgatattg 92581 tcccaaggct tgttagtaat tgctttgaaa aatgaatagc aaggctaatg gacatctgat 92641 ttgtagtaca tgctaagaat tgcaaatgtg tgctcaaaag aaatttctct gattacaaaa 92701 gtaacatata ttcaacgtac aaagttgaga aaatacaacc actagctaaa aataatcaaa 92761 accatttggc ttccttgcag tctacatttt tttcatgtat gttttgcttt tgtttttagt 92821 gttatttaat cgtttcctgt tttgtgactt gctttttaca cctaacattt ttagaatact 92881 ctacctgtga aattaaatat tcttggagag cataattttt aatgtagcat cacaccagac 92941 actgtaatgt gtttactctt ttttatggtc atttaggtgg ttttctttat tttgtggtgg 93001 tgtaatagtt gttataaatg atgctgtgat aaacatcctg gtagataaat ttgtgtttag 93061 ttgattttta tataacagtt attggcggaa tgccttgcaa atcttatagt tggtgtaata 93121 aaatatatgc gccacataat tgtttatgaa aatatggagg aaaattttaa tactgattta 93181 tgatcctgag tttaacataa gcatggatac ccctacattt agctatgatt tttttattga 93241 tagagtaaaa gcgcctgtaa tttctactta aggccattta aaataattac tatttaaatc 93301 tatttaaaca attttaaata ttgcttttcc aaatttccac cttaggaagt agttaattga 93361 atattctcag agaaataagt cctacaaaat tgaccaataa aaggtcagtt acgtattcat 93421 tgtgtacgtt ttattgatca cccacagtgt gtgagcccct ggaggtgggt gaaatgtggg 93481 cctccctctt acagagttga tagtgggcct gaaggggatg ggatttcagc tcaggaattt 93541 aaggaagact tgacaggaag tagcattgtg agccaagatt tcaaggataa gatttcaagg 93601 caagaaagga gatgtatttt aggtattcag acagagggtc aacaaagtct ttaaggtagg 93661 aactagttca cttttcaaaa agaagaggta attggtcatg ataagagtag tagtttttaa 93721 aaatcattac tgttttcatt catttataat gaatactatt attttataat tttaaagttt 93781 atatccaaat aattgttttc aagatggtaa tgaatgtata ttgcttttat gtatacacat 93841 agtacttttt gctttttggt aactaagttt ataatttttg agggcaggaa atatttttgt 93901 ttttcactta caaagaaact atattttaat aagagttttt tgacagcagt gctttatgcc 93961 aaaggaaaat aaagcaacat ttaacatatc aaagaaagcc aatgcaaaac aaggattatt 94021 tatttattta tttattttat tcatttttga gatggagttt cactcttgtt gccccggctg 94081 gagtgcaatg gcgtgatctc ggctcactgc aacctccact tcctgggatc aagcgattct 94141 cctgcctcgg cctcctgagt agctgggatt actggcaccc gccaccacgc ccagctaatt 94201 ttttgtcttt ttagtagaga tggggtttca tcatgttggc caggttggtc tcaaactcct 94261 gacctcaggt gatccaccca tcttggcctc ccaaagtcct gggattacag acatgagcca 94321 tcatgcccag ccaaaacaag gattttatat tcaatcaaac tgacattcag atatgatcac 94381 atgaaataaa tatcaacgta caataactca gggcatattt tcacattatt ttcctgtaga 94441 atctgctaga gaacaagttt cagataccca aaatacctag agaaaaactg atataaggat 94501 gggtaatgag cattaaggag aaacataaaa actaagactc aacgatggta aaaagtatga 94561 taatatagta ttccacagct acatgctgtg acaataaaga gattctgtaa ctgtcaggcc 94621 tctgagccca agctaaacca tcccctgtga cctgcacgta ttcgcccaga tggcccgaag 94681 caagtgaaga atcacaaaag aagtgaaaat ggcctgttcc tgccttaact gatgacattc 94741 caccacaaaa gaagtgaaaa tggccggtcc ttgccttaag tgatgacatt atcttgtgaa 94801 attccttctt ctggctcatc ctggctcaaa aactccccca ctgagcaccc tgtgaccccc 94861 acccctgccc cccagagaac aacccccttt gactgcaatt ttcctttacc tacccgaatc 94921 ttacaaaatg gccctacccc atctcccttt cctgactctt tttcggactc agcctgcctg 94981 cacccaggtg aaataaacag ccttgttgtg acagtaactt taatggggaa gaatgagaag 95041 aagatgtggg taaagaaaca actgtttttc tattaatggt ggtagtatta gtgtgtaatt 95101 ctgagcctgg tgcataagta atatggtgta acataagtga atagttgtgt gatattttaa 95161 ttcaatcatc ccatgtattt gagaatcaga attttgatgt ggtaaaagaa gattcacatg 95221 taacatataa tctattaaaa agataccctg tagtcatgag tttgactttt aggtatcatt 95281 tctcagtata acatattgtt gtagatttca gataacattt tttccccata aatactgtcc 95341 gctgcaggcc taaaaataat gaaaaggcct aaaagaccaa ctcagtagcg gtaaacacca 95401 gtactgcacc tagattatgg tctttaggaa ccattttcca gtaaaagata ccagtaatct 95461 ttggaaagca ggattttttt ttcttttttt ttttttttga gatagagtct agcttcgtcg 95521 cccaggctgg agtgcagtgg tgcgatcttg gctcactgca acctctgcct cctgagttca 95581 agccattctc tgcctcagcc taccgagtag ctgggattac aggtgcctgc caccactact 95641 ggctaatttt tttttgtatt tttagtagag acggaatttg accaccttgg ccagtctggt 95701 cttgaactcc tgacctcgtg atccacccgc ctcagcctcc caaagtactg agattacagg 95761 catgagccac cacaacccgg cggaaagcag gaaatttttt aaaagctatt ttaattcagt 95821 catgtaaaaa aaatgagcag aaagaaacaa aaaacttaaa aaaaaactta aaaaatgagc 95881 agaatcttag gaatcacctg cgactttggt tattcatgct tagccatttc tataggaaat 95941 tccagcacta tttgagttag accactaatt atatgtgtgt tgtagctttc tccaagagat 96001 aatggaattc acaaatcaaa actttgagat tattaagtgt tacaagtgaa ttataacccc 96061 agtcatttgc tttcaaatga acagctaaac atattttgtg ttaatttctt atggatgctg 96121 ctttgtttgt ttgtttgttt attgagacgg aatttcactc ctgtcaccta ggctgaagtg 96181 caatggtgag atctcggctc actgcaacct ctgcctcccg ggttcaagtg attctcctgc 96241 ctcagcctcc cgagtagctg ggattacagg cgtgcaccac cacacctagc agatttttgt 96301 atttttaata gagacagggt ttcaccatgt tggccaggct ggtttcgaac ttctgacctc 96361 aggtgatcca cccaccttgg tctcccaaaa tgctgggatt acaggcgtga gccaccattc 96421 ccggcctaga taggctttta gttggtttcc atatacagtg tatataggga gaaacagact 96481 aagacagaca ctggcatcca cagaagtaaa aataaaaagt aaggtataat ttaaatttca 96541 cacggtcatg gtcacatttt tcagcttttt gcacctatgt tgtataaaaa agacaaaaat 96601 gcctttagag gctcaagttt tcaggacttc ctgaagctgt gtcatagttt aaaaatgcca 96661 taaaaatgcc gtcagattta ataacgtatc agctgaacac tggatttctg ttatgttgag 96721 tcttcaaaaa tttctccctg tatgaggaat taatactgtt atgtctgatc tggtggtctg 96781 aaatgtgaac ttttctgggc caaccagtaa acagctgctg ttgctttata gctgcgtaat 96841 tcagtcttgg aaattattag ataattccca caccatttca gcttgctgtt ttaccataaa 96901 tttaatgtga acagtatttc tagctaacct caagctgtgt ttattgtcgt ggaattttct 96961 cttttattct gaggtaaatg attacaaatg gttatgatga ggaatcaaag aagaaagtgg 97021 actttcttca aagttctttt tctactctag ccgcttctga gcatctttct caagcactgt 97081 gtttccatca gtccctaccc caagcagaga aaggaaacag ttttcccctt tgtcttcaga 97141 accctcttct catcgtgtca ttttccaaag caaataaaaa tcttgagata ttgtcttgag 97201 tttgattttt tttttttttt tttgagatgg agtttcactc tttgttgccc aggctggagt 97261 gcaatggcac aatctcagct caccccaacc tccgcctcct gggttcaagt gattctcctg 97321 cctaagcctg ccaagtagct gggattacag gcatgtgcca ccatgcccag ctatttttct 97381 gtttttagta gagacggggt ttctccgtgt tggtcaggct ggtctcgagc tcccgacctc 97441 aggtgatccg cgcctcggcc tcccaaagtg ctgggattac aggcgtgagc caccacaccc 97501 tgccgagttt gatgtttttt ttaaccattc aagccagtaa ataactttcc agacagctgt 97561 agtagaaaac agaatttgtg tgtgacaaaa cttaagattt tgggattgag agctctatgg 97621 tttatgccat gtggataatt gtagccagat gaagaatatg aaatttaagg aagtgtttag 97681 agtggtacag acagctatta ttacgattct tggggtagaa gagtagctta catacaacat 97741 aaaataatgg cattaactga attaacttca gcctttgcca tgaaacccaa gctttcatta 97801 tgttttaaat aatgaactac attaaaaata gtatataaat atgtaaaata tattagacat 97861 agacttctca taataaatta tgggaacttt tgatgtattt cttccctagg aaaagaacta 97921 aacaagtgct tctttgtgga cttaaaattt ttcagtttac tattgtgaat taggtacagc 97981 atatctttat aattccagat aatacccata aaagcagaac tgtttattta cttattatga 98041 agggaatact gaaatgaaat taaggacaat ttttttctca cgtattatat ttttaaaaag 98101 ttatttgatg tttgtgatta aataaagaat gaatttttaa agccagttct caattagcac 98161 tgaggaaaag aatatgcttt ttttcaggta cacaaataag catcatggat ataagaaacg 98221 acataaagat gaatctacag tctggattaa gacgtatttt aatttcagtg tcaaaggctg 98281 ttaggaaata ttcagatttt agtacttgac aaagaaggtt ctcaatattt gtttcagtgt 98341 tcagtgaaag gataaattct gtcgtctttc tcttttatca tattagtcac tgagtggttt 98401 cattaaacat tccctcttcc atatcccctg ataaatttca acttgttggt gatgcatccc 98461 aggagtgtct gtcttttttt tttttttttt ttccttctcc tgtcagtgag tcttaggctt 98521 tggctttgaa accctatagt gaagggaaat ctcaggcaac agctgtgcaa ccgtctggag 98581 agcagctagt ccagattgga ggagacgaac gaggctggtc gcaggagaga ggcctccagg 98641 aaaaaatggc attgatgcat tacctgcgtt aaagattttc ctgtacttag catgcttctt 98701 agggttctac tggagcgttt gggaaagata cagtattgat ggatgcatag aaaatgaggc 98761 aaatttttta aatggggcaa atattaactg tagacaaaga aaagccaaaa gtacaaaata 98821 acagcgttgt aagatacgaa gataaaaata cgcctaagta tgagcgttct ggcagactca 98881 caatacagat gccatatatg gggtaaaacc ataccagtag gatggtggga agggaaaggg 98941 gtgggctcat gttccatagt gggaagacct acagcggaaa aatcaggaga atagttattt 99001 aagcacagag tttagggagt aagttgcaga agagacaact aaatgagatg gccgtgaatg 99061 atttttatga cctcctcttg ttatggagaa atcgatcaag gaggcctata tttgatggca 99121 ggaagaagtt gagtttaagg tttccagaca tttgttttaa tctggaaaat ttcttttttt 99181 ctttctttct tttttttttt tttttgagac agagtctcac tctgtcgccc aggctggagc 99241 gcagtggcgc agtcttggct catcgcagcc tccgcctcct gggttcacgc cattctcctg 99301 tctcagcctc ttgagtagct gggactacag gcgcccacaa ccacgcccgg ctaatttttt 99361 gtatttttag tggagactgg gtttcatcgt gttagccagg atggtctcca tctcctgatc 99421 tcgtgatccg cccacctcgg cctcccaaag tgccgggatt acaggcgtga gccactgcgc 99481 ccagcctaaa atttctttta agcttagaag ttgcaagtca agttactgtt aaaggcgtcc 99541 taactttgaa ttagataaag ctaaactgtg gtttaattga gctgagtttg ttgtatgcta 99601 tatctcaact tcacaagatg tttgcttttt cttggctata acctaagatt atacaaatat 99661 aaaattacct ttatgcccat ttgatagact ttagggctga atttccctgt gtaagatgtt 99721 gtagaggttt gtacttgtga ttgtttttac tctagaagca tgtttatctt cactccttta 99781 gtgacacgtg tttgttcagg aacttcaaac tcaatgtgat caaaactaga ctcatgttcc 99841 ctgaatagtc tctcctgact acaaatctgt tggaaacttc caatcctaat taataattgt 99901 aaacttttta atgtaagtca tatgaggggt tgttgtgagc ataggaggta gggagatggt 99961 agtgtgagct ggagcatatg gtaaaagttg tttgaaagaa agtagaccca tgatgtggat 100021 cctgagggat acgtaaagaa aatgaagagg tgtagattta aaaatgttta tggggcctgg 100081 gcgtggtagc tcacacctgt aatcccagca ctttgggagg ccgaggtggg cagatcacct 100141 gaggttagga gttggagacc agcctggcca acatagtgaa accccgtctc tactaaaagt 100201 acaaaaatta gccaggcatg gtgatgggcg cctgtaatcc cagctactca ggaggctgag 100261 gcaggagaat cacttgaaac cgggaagcgg agattgcagt gagccgagat cgcgccactg 100321 cacgccagcc tggatgacaa gagcgaaact ccgtctcaaa aaaaattaaa caataaataa 100381 ataaatatat tcatggggat gttccaggga aagtagtttg tgttggtagc cttgagaaat 100441 aataatagaa aattagaata gaactaagtt ggttcattaa agttggaaca cttgtagaga 100501 tttttgattt ggacagacat taattcattt gattatttct cttattcgat gtccagaaaa 100561 atatctgtac attcaatgtg gttttatttg tactgcacat gtcattaata ggctaaagtt 100621 acctctaaga tacctgcttt cttagtcctt gttacgctgt aaattctggg catcaggata 100681 tagcttttta ttatgttagc tcttttaata gatgatacaa ttggattggg gaaaaaactg 100741 attatactct ggagtaatac tttcatagcc tgtagtatcc tgcaaatagc tcctggtcat 100801 cttcgaattt tactagcaaa cttgtagggt aataggttgg gacaaagtat tataatcagt 100861 ggctttgagg gcctgataag aggaaacaat aattgtgcag ctatattgtg aataagtagt 100921 gcatgatgct gatggttaac cctttcctgc ggtgctcaaa cattgggaaa taattgcttt 100981 ttgttcaaaa tgtaagctct tcgataattg agacttttgt caattcactg aaatattccc 101041 agagcttgaa catgtctggt acatagtaca tatttaataa atatatagat ttaatgaatg 101101 aattgaggat gtagccaaat attggaaaag gaaagggaaa agaaagaata gcccaaacat 101161 tagaaagaca gaagatgtgg agcagcaatt tcacatgcct aaaagaattt ctgagattat 101221 ggcaaaatca tttactttac ttactcctta ttgatcaaca aaggaagagc aggttttatg 101281 taacaaccaa gaagtaacat agggatctca gaagagaaga aaccaagtgt ctttggatct 101341 taataaaaag aacattcaac tccagaaata gggttaaacc tatcctacac atgtctagta 101401 aaaagtcagt gtatcagagg catctgtgtt tagagataaa acataccatt agattcattt 101461 ataatagttt cattgtcctt agtttttaag gctaatgata aagatttttt tttttttgtg 101521 agacacagtc tctctgtcac ccaggctgga gtgcagtgac gcaatctctg ctcactgcaa 101581 cctccgcctc ccgggttcaa gcgattctct tgcctcagcc acccaagtag ctaggactac 101641 aggcacacgc catcatgcct ggctaatttt tgtattttta gtagagaggg ggtttcacca 101701 tgttgcccac gatggtctcg atctcctgac ctcaacagat ctgcccgcct cggcctagtg 101761 ctggaattac aggcgtgagc caccacgcct ggccttaaga attaattttc acagaaatct 101821 gcttctaaat attattcagt gataatagat tttagaaatt tcagattctt atgttgtacc 101881 ttaaaggtta ggatactatt tataaatgct aaacatgaac tcctgtcatc ctattttaat 101941 gtctttatta agtgattaca tgaatgtggc gtttcagtag gtgtttggtg tattaataga 102001 tgtaggtact cccctcttaa acctttaagt attttcatgc catgaaaaat ttggggggca 102061 ttctctttat ttattgatta ggtacctgta ttttcaagaa tattactcag cttcagttcc 102121 cccaaatctt ttttagtgtt aactaccttc cgagaacgtg gtataacatt ttaaaatata 102181 ttttttgaag gtttggatgt cactgaagat ccccctattt attggatggt atctttatca 102241 gctgggaaga taattagttt cccaatgttg ctatattttt atgaataggt cctgtagctc 102301 ttgctttctt gaaagcttag aattgattgt aagccatgga ctaatgatat tctccaagca 102361 tatttaacat gaataggctc atgttcagta tgtcaagaag ttaccctgtt gtactattta 102421 aataatttga gcttagtttt ctttttccac ctttgttcta aattcttagg tgagtctgaa 102481 ttcctgggca tctgtgcaca cagcttcttc agtaggtcct ttactctcat tgtatctgat 102541 ctttgacttc ataaagaccc ttggcctact gaccctgtat ttaatttcac atctcaatag 102601 tatttagcct tctcatattt ttcctcttct ttagagctcc ctgatccctc actccagtca 102661 ttcaactctg tgtattcctt taacttccac cctcgctctg tctcttcaca gatatgcttc 102721 tttatatggt agtcagaagt tcctgtgtga tcttcgtaac ctccagttcc ctccacagtc 102781 acctgttatc aggcatctgt tccttgagct ccagggaatg agctctttgc tatcattctg 102841 gttgtcggac tcactgggca ctgttgtgtc ctccacagtc tctcagtggc atcggttgat 102901 gggtgcattc tccccacact gatgtcattc tgctgcttct gattttcttt ctgctgtcct 102961 ccattttcct ttcaggctcc cttccccgtc ctccccgcgc caccttttgc aatatttagg 103021 attcagaaat tccaaatatt gcaaaatatt tagaatcaga attcagaaat tctaaatatc 103081 agtgtttaat aggagtaccc tggggttgtc cattttctca cacaacacat acttgctagg 103141 agatctcata agtgcagtgg tttttcagag gcctgcattg atgattcctg agtcattatg 103201 tccaggccag gcctcataca tactatttac tggttctctt tctgcgggtg ctccaccaag 103261 aattcagact caatggaata tttcttctaa gatgtacacc agttccttat caaaaaatgg 103321 caaattcaga cttcttaggt taaacagaga actcgtgaac atggcatatg tctgttattc 103381 catttttctc tgttggctta ttgctgagta ccatttggga taaaaatctg tttcaggcaa 103441 aggataggat tgttttcagc ttttattaaa ttctcctaga cccttgattt tatgcagaga 103501 actaattaaa tttgaatgaa tttaatgtag aaaccctgct attgatgcag aaatcaacct 103561 gattctggac tagcaccaga atcaactatt tttgttgttt ttgctttctt tttttttgct 103621 tttgctttct ctgccggtaa ataacaacta gtatgtaaga atcaaatgaa tcaagcataa 103681 atccctagga aatatttatg cattcatttt taacaattta aagatgtgaa gatatataat 103741 taagcataat cctctcaaag tttgacgttg aaaagtattt cagatacttc aacttaaata 103801 tattttttgc aaaatatctt ttttaaaaaa acagatgtgg tctcagcaat gtaattatta 103861 cagtgtgagg agagcttaca gttgtaccaa aatttataac taggaagaag agaatgtgaa 103921 ataatccgga aacctacaaa actgtcctat acaggaatac caattaaaca gctactaaaa 103981 atgagtgtaa aatgacaata tggctttatc aagtgcttag ggtctggaaa aatgcacata 104041 ttgcctgtta tttattgtta tacttaatga atgttctttt gcacgtgaga actgttataa 104101 aagatgtctt ttgtacatac ttggtacacc agttggcttt ttaacccatt agctgttttc 104161 ccccaaaggt ggcctactct agatgacgtt aaaaatacac attttccagg gacttatata 104221 aatagtcaac tgccctatta tggaaactta aaagcgaagg ccctgaaata tccagacagt 104281 ttgaacataa accaaaaatg aagtgaatct ctagaattgc tgtgtaatag cgtgagtggg 104341 gagggggtcc ctcaagagag ggaactgacc tttttcttcc tcgtgggtga ccttgagtac 104401 acactttatt tagtcattaa aagggtcacc tgaagcttag agaatgaaga tcctacagcc 104461 tgttaatttt atcattgtta gaaactctct gcctttggcc tgaaaatggt aaatcacctt 104521 tcttagaaga ttctcatatc cagatattct taccaaatca gaatcttttc cagcattttc 104581 ttcaaaaaac cacagttctg ggttaaaaca cagcagatcc taactgtaat tttcctttca 104641 tgtacatttt agtcatcttt ataattttaa atgaaagaca ggaactacat tatttgaaat 104701 aagggttaac cccctctaat tataagcatt aatcacacac agttttctca tttctaaatg 104761 ttaaatataa aatgggagct atctcttcag acacttttct gttgccttct gtgtgcggca 104821 ggaccttact tttaagctta gtttgattta aaggctgagt ttctgcttgc ttggaatgct 104881 tgctcttcaa aatatttgct ccctggagag ggaaactgaa gatggagctt tcacgtgact 104941 ttggattgct gacctggaag acgtttgggg taataggatg ggctagggca tggttccact 105001 gctttctgtt tgagtggggt caggaaagga aggcggggct cgctgtgtgc aaacaggaaa 105061 acatagtctg aagatgtttg aagcctgagt gcactttcat ttgatgcctt gctttattgc 105121 ttaagagaat tttaaactgc cgaatccgca aatgtgctgc agcctataat gacacatatt 105181 taaatgattt ttgcatctaa acaaatgcat tcactctttg cctggtgtgt acaatttgaa 105241 tgacactcgt gctctgcctg gaaggaacag acgtttttca ttataaaatt cagtcaagca 105301 gaaaagtagc aaaggaagca gaaatttcca tttacatcta tgattgaagt tgtagtgata 105361 aaagaggatg actaagttga agagtagcct tgttctcatt ctgctatctg acatgggatc 105421 ttgtcacttt ggtaatagtt tttaaactta cattattgag gaaggctctg atcagaagtc 105481 tcaaattata catgttcttc aaccttagat ttttctgggt gttttgtttc aaaagtctca 105541 aattatatat gttcttcaac cttagatttt tctgggtgtt ttgtttggtt ttttggttat 105601 ctatagatag atttgagtat ctcttttttt cttaaaatcc gttacttact agtaataaag 105661 acttaagtgg tagctttacg ctccattatc acagagattc acagtacttt ttagttttta 105721 cttgtgagcc tcagcaggtg tagacagata ccttagttgc cactgataac atccagttgc 105781 cactagatca agaaatggtt caaaattctc caggggcatt ttctgctaaa gaagagttgg 105841 tgatcttcct tttggaagcc ttaatgtact taaaccttct tccttagaag tccagaaatt 105901 caattatttc agctctctgt ggagtgataa gtcctgccaa attgatttac aataatgaaa 105961 ctaagccttg tttacaaatc taagtggtga tttctaagtc cacagaggcg tggtaacaga 106021 gagagagatt ctgatttatc tgggggtcta tccgtgtttg aacagattgt gtgatttttg 106081 aaacatgatt caaaagggat gttgataaat tctaaaatac atttttattg aagagcttga 106141 tgaatcatgt atacttactt tatttgttat tctattagat attgtcttcc tcatccctag 106201 gtggtgttgc tagtacagtg acagaattca aatccttctt tttcctctgc tctaacagag 106261 ctttgtgaac tcatagcaat attgtaatgc ctggtaaaag gtggtgccag tttcaagtac 106321 tatatcctaa ttacttcatg gctaactctt gttattgtta acaatattgt taactctttt 106381 gttaaatgca gtgacatctt ttttctttct cattctctgt gtactttttg ccctaaatag 106441 ttctgcatct tttaagatac aagtttgctt tttcttggag gtttttattt ggaaaatgcc 106501 tgaaatatat agaccaacat ttgttcctcc aatggttaaa ttacattgtc tcccaaagtg 106561 aggcatttta agtctggact atacccagga gaatctaaaa gaggcagtta ataagctctt 106621 gaagagcaca gtggggtggt tgtgtattta ttgagtctca cacttaggag agaaataggc 106681 tggaccaaat gagcctgggc atcaaaagca tgtgggtggt cgttggtgat atgtgcagtg 106741 atgagtgtgt gtgctgaggg tcctagacca ggctggcagg gaaatgaact gaaggcagtg 106801 aagcctcccc tgataactgc agagagtgct gcagcatcca ggacctcagg atcatgatga 106861 acaagctgta gtggatggaa gggagaggaa ctatgggaag gatgagtttc tggccagaaa 106921 gtggtttact ggagtgctgt gtctgagtat tacatgatct gaggaccatg aggtccagct 106981 ttgggagtat ggagagtttg agtggagtag agttatagtc ccatggtctt ctgattcaca 107041 gacattcatg agcgataagt atctattgag aaagtggaag catgattttt atgacctcct 107101 cttgttgtgg ggaaatcgat caacggggcc tatattggtt gggttctaca accaaataga 107161 atgtgttcag caggagtcat gttttgggct cctcacaatg ttcagtgtta gttctgttcc 107221 tttctcttgc ttccacatag gaaagacttt gacccaatga ggaattagtc attgtccttt 107281 gggtacactt ccggaatttg ataatcagtt gggcacttga ttctagtaca tttgctaaaa 107341 gtaattaata ccctgcaatt ggaagcatgc agcaagcgac tgggaatgct aagtgaaagg 107401 tcagaattga aagagtggag gaggctgttg acccttaggc atggatgata tgtctgagtg 107461 agaaagcaag aagagtagaa gacagagcct tactttataa tcagaattat gaaaagagga 107521 gccaatgaga gggaaagaat gattggcttg agatagattg gtggaaactg gagaagattg 107581 aatttaaaga agaaaagtgg gccaagttca gtggcccatg cctagaatcc cagcaaattg 107641 ggaggctgaa gctaggagag ttgcttgagg ccaggagttt gagatcagct tgggcaacgt 107701 ggcgagaccc ctgtctctac aaaaaataat taaaaaaaaa attagccagg catgttggca 107761 tgtgccggta gtcccggcta cttggaaggc tgagacagga aggttgtctg agcccaggag 107821 gctgcagtga gctatgatca cagcactgca ctccagcctg gttgacagag ggtatccccg 107881 tctctttaaa aaaaaaaaaa aaaagagaat gtggcagagt agtcagagaa ctggagggct 107941 gagaagagac tgatggcatt tgttgttctt gaccttgaaa gaagagttgc agattgttgg 108001 agcaaggcca gatggtaata ggttggaaag aacaagtgag gggcgtgaga gtgacagtct 108061 acaaccccgt taagaagtta tctgtgaaaa tgcctcttcc tgtcttgatt atagcctccc 108121 tcgcacatgg ctttctgagt atggtatgtg aaaacagcat gctgtatatt tttaccttcc 108181 catatgagtt aaggttttgg taatgacttg tggttctttt gcctcacgga acaccaacct 108241 ggttggtatt ttgtcggcaa ttatgtggtt tttaaaaagc cccgtatata accttaatta 108301 attgcgtctc taaaatctat caggctttga gttttttgtt tttttttttt aaataaggca 108361 gaaagaaagg aaagtgttag gagaaagcaa ccagtgagaa tacctatatt ttgaagaaac 108421 aaagtaggga atttagaatt atctctacta tatctgcttt ttcagctcat ctctcgggat 108481 acagaggact gaactaagct aaagtaggtt caagtctaaa ggaaacctta aattaatttc 108541 tgtattagaa ttaatcacac acctataaat acaatttatg atatgagaac tgcaacaaga 108601 gaaatggggg gagatgagca gtttgctgat caatagggtt aatgatggta acatagaatc 108661 ctagtgatat gcctcccatt gactaaaatg tatgcgttaa gatagcatag cgcataaaat 108721 ggtgtactgc agtctcagtg tattcaagct tattatggtt tctctgccta ggtttgtgtc 108781 tttggcttta gattttctcc cataacccca acctgtccac attcaagtca ctactccttg 108841 aaatatatta ttgatctcta aaatctgatg ctaagtctgt cttgcccaac ggtggccact 108901 agccacccat ggccattgaa tatttgaaac ctggctagtc caaactatgc tgtgagtata 108961 aaaaacccac cagatattaa aaagttagtc tgggagaaaa aaacaaatgt agaatgtctt 109021 aatgataatt atttacattg attgcatgtt gaaatgatga aatttaggat gtactgggtt 109081 aaaatatatt aataaaatta attttacctg tttcttttta cctttttatg ttaatgtggc 109141 tacaggaaat tttaaattag atatgtggcc tctattatat ttctgtgaga caaagctgct 109201 ccaagctgtt ttggtccaga actatatctc ggttcttttg tatcctacat ggacctgcat 109261 agaatggtat ttagtaatac tcagcagtag cagagatgat ctcttatcaa attatgttaa 109321 gttttaattt ttattcttgt ctctcaccct atgtgcacct ctctggccag gaggcattta 109381 tcatatcacc ttttaatatt tggaaatcac ttgatctgct tgagtcattg tgttgttttc 109441 aaatctagaa ctctttctac cttattttgt aacttctata attaaacaaa aggcgaacca 109501 ggagcagctt ggctcttagg gatgtgtgaa aaggcgctca gtccttaagt tggtgccaaa 109561 agtgacaaat ctataggatg agatgcaggt ttatggaaga gaagcatgaa aatttttatt 109621 ttttaaaagg acacagcaag ctagtcacca aaagacagat acttgattct acttatatgc 109681 ggtacctttt tgaatttgaa gagtcaaatt caaaaagatg gaaaacagaa tggtggtttc 109741 caggttctgg aggtggggga atggggagtg gaggtggggg aatggggagt ggaggtgggg 109801 gaatggggag ttgtttaatg gtgtgcaggg ttttagttta gcaagatgaa aacagttctg 109861 gagatggaag gtagtgatgg ttgtacaacg atatgaatct acttaacact aaactgtata 109921 attaaaagtg ggtaagcaaa gagtgtctta caaagcaata acttgtaact taaaattttt 109981 aaaaacctta atgagattag ttgtaatggg ttatcttaag tagaccctta ctgccggaag 110041 gaacagctga atcacatcac ttgggagctt gttagaaaag cagactcagt cccaccctag 110101 acttgtccaa tcaaaacctg cattttaaca aggcccccag gtgattcata ggcactgtaa 110161 agcttgagaa gccctgctcc aaatatcatt tctcaaagtg tgatccatga gccacctgaa 110221 gcagagccac cctgggaagg tctgttttca ggcgcgtccc agaacacgta aatcagatct 110281 aggctgtgga gctagggaga agatgaatct gcagactgat caggatccca ggtgattttg 110341 aagcatggtt tgagtttgaa aacccctaat ctggaaagct ttaacaacta ttactcttac 110401 aattccatct tttaacgaat gttggaatta ttaaatactt gtaacaagaa gtcctgttat 110461 tgataaaatc aagatatgac taatagtgca gtcaaaaaga ggcaacaaaa aagcacctat 110521 aacttatctt gggttggtta atgaggaagc tgtcattcaa aattagtaag actactgtaa 110581 catttcttgg ggtcctgaac attggaatga gagtcggttt tcagagttga gcccacctct 110641 gtcattagct ttgtgcaaac ctcatcaggc tcaagattcc tcatttgtaa aggggaatga 110701 attctctgac cagttagttt acattcctaa gacaaggtta aatttggaaa gcaaagaatt 110761 ttcggtatat atgtgataat tttgaaattc ttgaattttt aaaaaattta aatgccacat 110821 tacttatatt tttttctaaa aaaaagtcaa accaagcgct gtctagttat taatagatac 110881 cacagatgag atgcctgagt tggtaagctc tttttaattc tagttctttt caactacaat 110941 aattttatag actttttttt gaaggctaga atgttgatgt ggaatcaaat aggtatgcct 111001 ctatgtatgg tagatgcacc tgacagggat cacttgactt gaacatgccc tgacaatgat 111061 cctgtgtggc aggtgcacct gattctgagt tctgagttaa gaaatccggg agtgggccgg 111121 gcggggtggc cggtgcctgt aatcccagca ctttgggagg ccgaggcagg tggatcattt 111181 gaggtcggaa gtccaagacc agcctggcca atgtggcaaa accccgtctc tactaaaaat 111241 acaaaaacca gctgggcgtg gtggcgcagg cctgtagtcc cagctacttt ggaggctgag 111301 gcaggagaat tgcttgaacc cgggaggtgg aggttgcagt gagccaagat cacgccagtg 111361 cactccagcc tgggcgacag aatgagactc tgtttcaaac aacagcaaca acaaaaccca 111421 aaacaaaaaa acaacaacaa agaaatccag gagtggctaa cccagagagc cattccttat 111481 ctgtaaggaa catctgagcc ctcatctcct tctgtggaac atgggcctcg ccggggattg 111541 agaccctttg ttatgagtta agtggaagtt gccaggtgga gattgttagg gggagggtgt 111601 taaatgagaa tgctgtagag accacatgcc ttttgcaagc ggctgcggtt cctctgtcca 111661 acccaccacc actggactct ctaccctgta tgcaagcccc cagtaaaacc ctatgagaca 111721 tacggttcac tagctctggg cctcttctgg ctcttgaacc tggtgccatc cacattgcag 111781 tcagtaggag tttggcatga caccttatta tgtgaacttt tatatcaagc cattaaattt 111841 ttttaaaata gactttatat tttacaacag ttttaaattt acaggaaaat tgagaggata 111901 gtccagagag ttcccatagg ctttgcatcc agttttccct attgttaaca tcttatatta 111961 atatggtaca tttgttgcag tcagtaaacc aattttgata catggttatt aactaaagtc 112021 tatactttat tcagatgtcc ttagtttttg tctaatgtcc tttctgtgtt ccaggatgcc 112081 atccaggaag gcgcattata tttaattgtc aggtctccat agcctccttt cctcttggct 112141 atgacagttt ctttatactt ctctagtttt ttaataacct taaaggtttt gtattggtca 112201 gataatttgt aggattcccc tctgtgggaa ttttctcatg ctttgcctgg ggttattggt 112261 tttaagaagg aagaccacag aggtaaaatg cccatttcat cacatcatat caagggtacg 112321 tgtttatcaa catgatacag ccctgttggc attgaccttg atcacctggc tcaggtggtg 112381 gttgttgagt ttctccactt gaaaattact cgtcttccca ctttatgtgc tctactcttt 112441 ggaaggaagt cagtttgcac agcccacaat taaggagcag agaattatgc ttcattttct 112501 tgagggcaga gtatctacat aaattatttg gaattcttct gcgcaggaga tttgtctctt 112561 ttttcctatt tatgtattca ttcattcagt catttagact catggatatt tacattgtac 112621 tttgatttct aatccagcac tacttcataa tttttattgc tcaaattatt ccggctttag 112681 ctattagaag ccctttcact tggttcctat gctcatttga catagcctct tttttttttt 112741 tcttacaaga tgctgcagtc tcatcttgca tattttcagc cctaatccta gaatcagctg 112801 ttcctccaag gagcccttgt tccttttatt ggagagtaat gttattggag aatagtatta 112861 aaatctggat gttaaggatg ctcgttgcca ctggagtttc tttgctttta agacttctca 112921 gttgatagag caaggcaatg tatttgtcca tgttcaccct tatgttatgc agatctataa 112981 atatttctat agctaactag ttgtatttat attaagctat aatatctagt ctatattaag 113041 ctaaatacaa gttcatccgg atgtttccta ctctaacgta ttaccacatg gacactatat 113101 ctctcctcct tggttatgta taaattctca ctccatccac catctatgta cttaattgtt 113161 cagtttcagt atatgtgcgt agccgtatca gaattgttaa cccaggctgg gtacagtggc 113221 tcatgcctgt aatcccaata ctttgggagg ctgaggcagg tggattgctt gagcccagga 113281 atttgagacc agcctgggca acatggcata accccatgtc tactaaaaat acaaaaaaat 113341 tgctggtcat ggtagcatgt gcctgtagtc ccagctaccc gagaagctga ggtgggagga 113401 tcacctgagc ccaggaagtc aaggctgcag tgagctgtga tcgtgccatt gcactccagc 113461 ctgggagaca gagtaagact ctttcttaaa aaaataaaaa ataaaaaata taaaaaaatt 113521 tgttaaccca cacctatatg gatcttttta aaagctgcaa gactgctaac aaatatttat 113581 ttatttattt atttatttat ttatttgaga cagagtctcc ctctgtcacc caggctggag 113641 tgcaatgaca caatctcagc ccactgcaac ctctgcttcc ccggctcagg cgattctcct 113701 gccccagcct cccgagtagc taagattaca ggcacccgcc accatgccca gctaattttg 113761 tatttttagt agagacgggg tttcaccatg ttagccaggc tggtctcgaa ctcctgacct 113821 caggtgatcc acttgcctcg gcctcccaaa ttgctgggat tataggtgtg agccaccatg 113881 cctggccata aatacttatt ttgttcccgt ggtttttttt ttcttttctc ttttttaaag 113941 taaatccttt gcactatttt tattatttta agaccagtgt tattcagagc ctgggttaaa 114001 cactattctg tgtgtgtgtg tgtgtgtgtg tgtgtgtttg tttgtttgtt tgttttttga 114061 gacagagtct cgctctgtca cccatgctgg agtgcagtgg tgtgatcttg gctcactgca 114121 acctccgcct ccctggttcg ggagattctt cggcctcagc ctcccgagta gctgggacta 114181 taggtgcgcg ccaacatgct cagctaattt ttgtattttt agtagagacg gagtgtcacc 114241 atattggcca ggctggtttc aaactcctga ccttgtgatc cacctcggcc tcccaaagtg 114301 ctgggattac aggcatgagc caccactccc ggccagcact attctgtctt atactcactc 114361 tgccatctgt gtgtgtatat ggtacgtatg tgaggctgga tctgtgcact tggaaaagtt 114421 ggagagatga tttaggtaaa acttttagtt ttcacttctt ttgccttttt gcttatcttg 114481 atccatttag ttcctaccat ctggcaaata gttctcatct gaagaatccc atctgtcagt 114541 tctcactatt atccggtaat agttacagtg aaatgctggt gacggcaaca gccacttttt 114601 aaaaggatgt gtaattcacc ctcatgtcct ctttgagaac ccagtaactg cccacactct 114661 gaagctaaaa ccataacagt tttatcacat ggaaagtttg ttgagaaatg tttggctgtt 114721 tctgtgaagt aatcgtcccg tctcttgagt acagatgcct cccaggaaaa ttttcccagc 114781 ctatttgtta atgtgaaagt tacaaaagat agcatataat gagcacattc aagacaaaaa 114841 aatttgtgga gtaagagtct gaaatggtaa aataaaaaat cacttaagaa aacatttatc 114901 tgtagctgtc attttcatat acacatatct agtgtataca agttagtaag tacataattc 114961 tcaaaagttc tacctgtcta cacatctaca cagaaaaatt aggaaagtct tgtccagact 115021 tttccatcat cttcagtata aagcatcttc tagctgttgg aatagaaaca agctagattc 115081 ttgactcctt tctatttgat tatttagatc tgaccttcag acgacatcca gaatctgaga 115141 acacgcacca cctctactgc tctcatcctg gtccaagtca ttgtcatctc ccacctggat 115201 taatgccaga gcccccttcc tggtcttctc acttctagtc ttgaccccct agggcaagaa 115261 taattagctt agaaaaccac cccagagcct tccagcatta ctctgagtac tggccagagt 115321 tggacaatgt ttaaggcccc gtgcagtcag ctgtccctgt cctgtcctgg ctgtcactca 115381 ctcgcttgcc cactgatctc ttccacatca ttcctccacg acagcactga tctccttgtt 115441 cttctgtgag ttcaccaggc atgcatctgc ctcaagactt tgtgtaatac ttgttgcttc 115501 ctctggctag aatattcttc cctttgattc tgaatacttt gttccctcac ttcctcaagt 115561 cttcactgaa atgtcatatt ttcaagtagg cattccctga ccaccctatt taaaatttca 115621 cccctcagaa aaaacaaaca aaccacccct ttctctgctt tatctatgta acatttatta 115681 ccatccaaca gacactgtat ttttatctgt attttttatc gcttccccgc taggatcata 115741 tatcgtttta tctgttttgt tcactgctgt attcttagca ctgagacata attgtaagta 115801 tgcagtgaag agttagtgga taaaggacag tggctgcaag aggggattta ggaaacaccc 115861 cctcccagcc aggaccagca ctcggtgcaa gagttgaaag agttgatcac tcagattctc 115921 tgcttctcca gtcgtaggga ctccaccccg ctgtgttgat tgctgtcatg tattcacccc 115981 atcataagca gtactcacat gtaaaacagt accatgtcat ctctgtgggg ctacaagtag 116041 gtggtgatgg tgcaatgaac atataaaacc ccagtgtctt cagtaggtgc gaaataggtg 116101 attagtgaat gatgaagagt agctgttgct gttttgatta ctatgattat tattatgaga 116161 aactactgtg gttgatgaga tagatactgg ggagaaggat actattaata taatttattg 116221 agcaactttg ttctgggcac tagttgagta ctttagggat attaattcat ttaattctcc 116281 cagttctgcc atttactctc tatgtgacct ggactgttac atagtctctc agtctcaccc 116341 tctctttttt tttttttttt tttttttttt tttttttgag acggagtctc gctctttcgc 116401 tcagtctgga ctgcagtggt gctatcttgg ctcactgcaa cctccacctc ccaggttcaa 116461 gcgattctcc tgcctcagcc tactgagtag ctgggattac aggcatgcac caccacgcct 116521 ggctaatttt tttgtatttt tagtagagac ggggtttcac catattagcc agactggtct 116581 cgaactcctg accttgtgat ccacccgtct cggcctccca aagtgctggg attacagact 116641 tgagccactg tgcccggccc ttgccttctc ttataaagtg aaatcattgt agctgctttg 116701 gtgtggttct gaggacggtg agagaagacg cacgtgaagt attgagggtg gtggcaggca 116761 cacagttcag agttactaac aatactggtg ttttatcttc acaaggcaga tctgggccca 116821 tagcacaagg aaaccgaggc tttggcattt tacagtaatt acattgccag aacatttccc 116881 ccactttctt gaatcatctt tacctcttga gagatgagga agagagggca cacctcgggc 116941 agaagtccaa agacctcagg catttttggc ctttcattcc catgattctg gccaacatca 117001 ttttagtccc cagcagttca tagaaatatt gtcttttaat ctgatattca aataatctgt 117061 gccttgaaga acctgtgtat ggctctagaa tcttttacat taatctactc aaccccactc 117121 tgatgctcaa aacttcagca gtctgttttt tttttatttt aatgtcttta ttattatttt 117181 tttttttgag acggagtctc gctctgtcgc ccaggctgga gtgcagtggc gcgatctcgg 117241 ctcactgcaa gctccgcctc ccgggttcac gccattctcc tgcctcagcc tcccgagtag 117301 ctgggactac aggcgcccgc taccacgccc ggctaatttt ttgtattttt agtagagacg 117361 gggtttcacc gtgttagcca ggatggtctc gatctcctga ccttgtgatc cgcccgcctc 117421 ggcctcccaa agtgctggga ttacaggcgt gagccaccgc gcccggccta atgtctttat 117481 tttaaacatt tatttttaga ggtagggtct ctctaagtta tgcaggctgg cctcaaattc 117541 ctgggctcaa gtgatcctcc caccacagcc tcccaagtag ctgggatgac acccccaggt 117601 gccgccatcc tggcagcaga gtcagttttg atacatcttt cccttctcat tcttcccaat 117661 gtcatggatt gggaagttac tgtggtttta gttgcagtac tttttactct gaattctgtt 117721 tgcttcctgg aattctgtct gtcttttgaa aagacaacat aaagaaatcg aaaaatatac 117781 tttaaattcc atagtgtggg agcctctgct actcgcctgt ttgatattca ttcccatttt 117841 ttcctttcta aaagaattcc agattttttt tgttcagggt gacaatgtgc tctgctgaaa 117901 atactttcac agattccctt gcaggtgcat gtgaccagat cacagtaatg gcacatatgg 117961 cctcagtgga atttgctgag gagtatgtcc ctcccaagta acagcaaagc tttattagag 118021 gaaaaaacaa caacagaaaa cctttcacct ttgtcttgtt tctcctgcct tctgtctgga 118081 acttggacac agagtcttga atttctgtga ccataaaaca cgaggataaa accaacgtgc 118141 tcaggatggc agtacagaaa gataggaata cgctggtccc tgatggcatc atggaatatc 118201 tgcagtggcc ctagcttcag tgtcacttct taggtgagat acataatcct accgtgtgtt 118261 taagccacgg tttgacatgt ggtctattct ctgccgctga ctgcattccc agctgttcta 118321 tatggcttgc tagctaattt aagtctagtt ttattgtcaa attgaccaca ttgtagatta 118381 taatattaaa gtacttcaca gtctaaaagt gaactacttt tgtaaaagcc aaaggaaatg 118441 ggccaaatga accattttga acagagttta aatgagtgag ttgagttacc actcttggat 118501 ggagaactgg agacctggat tttcagtcta gttcttctgc cagttggctc tgtgggcaag 118561 tcacttgacc tgagctcttt tcctcccatc tgcaaaatga ggtaatttga gctagacagt 118621 cctcccaaag accaccaaag agctagcaga ttgaggcctg cagactccta ttgaatttgc 118681 catccttgtt caccttgcca agacttcata gccaagaagc cagacccaga ttttgttcat 118741 tagctttgaa aacacccttt cagttgggtg tgtgtgtgtg tgtgtgtgtg tgtggtgtgt 118801 gacatgcgtt ttgaatataa attgtgaatt gttttagcaa gaaaccaggt tgtatgtttc 118861 tcaataaaca tgcatactgt atattgatga atttatccac ttgaaataaa tggtgccaaa 118921 ttgcctaaag tttgctgtaa acaggaacat gatttctcat tatgatgttt aatttgtttt 118981 agtctttgat ctcttacatc agaattctgt taagaactct ttttacctga aagatcaaag 119041 caaataagca gaagcattta taaaaataag gtatacgtgg ctgtggagag aaactccagg 119101 tatatacgat cataggcttt ttaaaattaa taaactttgt ttttaaaaac aaagcagtgg 119161 aagttcacag caagtttgag tagaaagtac agtttccatg taactgctgt cctagcaaag 119221 atttttaagt acaatatttt ttaattaaaa agagaaaata taaaccgtgc taagccagtt 119281 ggtgaaaaag gcttcattat acttttagca ctcttaacgc ttattaaaag tttttaaaaa 119341 cagatttttg tggttgaatc gcttacaatt tctgatgtgt aaccttctaa catgtttcat 119401 caaattttaa aagtaatgta agcagtgctg tccactgcta aattttattt taatgttaga 119461 atacaatgag gtttctattt tgccattagt cctgagagct tattaagtaa ccagttttaa 119521 tcctttaaaa aaccattctc attttgaggg acacatattt cttagctttg tgcaatataa 119581 gaagctaaga ccatcttggc agtatcaaat gaaatgtttt gcaagttaga aagttcagta 119641 accacttata aacataatga acaattgtta tagtcagtct aatgcagaaa tacaggtgtt 119701 gggggcaggg gggtggtggg tgggcagact tggtccacaa gtagtagttt gccaacctct 119761 gtcctaaaca caagttcaag aatatggttc tcttggatag attttctgcc agatgaaagt 119821 aagagtaaca tccttagcta atgaactcct aatatccaag gcaactttta attaattcag 119881 gcttttaaga aaagcttgtt taccaactcc tcattaaaaa tgttcagact ggctggtcgt 119941 ggtggctcat gtctgtaatc ccggcacttt gggaggctga ggcaggcaga ttgcttgagc 120001 ctaggagttc gagaccagcc tgggcaacat atctagaccc tgtctctgca aaaagtgaaa 120061 taattagctg ggcatgatgg tgcacagctg tagtcccagc tactcaggag gctgaggtgg 120121 gaggattgca tgacccccgg aggtcgaagc tgcagtgagc cattgatcca accactacat 120181 tccagccagg gtgacagagt gagaccttgt ctgaaaacaa acaaacaaaa agattaatgc 120241 agtacatgtc cttttggtga ataagtaaat atgttttggt aactttagct ttgcattata 120301 atctagtcgt cagcagattg actaagggca gataaaacaa gatgatttgg ttttgtggtt 120361 tggcacttac tgtctgattt atgcccccaa gaactcttgc aggccaaggt cgagtgtatc 120421 tgatacctgc tctggctctt ctcacgaact gatttagtag ctttgggcca gcattagatt 120481 ttttaaactc tttgacctat ttgtacatga caggcattta atctttccag tagttacttg 120541 tcaaatagaa aagcatcact tcttagtttg gacatttgca aatatgactg agcagagctg 120601 acagatgagg tgggcatctt caggccatag ggtacttctg gcctcctgta acctagaaaa 120661 ttgggatgcc tcagatatca ggtattcttc caagcagcta gggttagtcc ctgacctcag 120721 aaaccctgag agccctcaga aaccctgaga aatccacctg ggaatggtgt atttgttttg 120781 gatgagatta cctttacttt ccattctcca ggtcttcttg tgttttttgt ttttgttttg 120841 ttttgttttg ttttatatca ttctatgtaa aagagaaatg tggtatcatt tgcattatca 120901 tgttatttct ttccttataa tttatatctt aatgcttatc tctggggcta ggttataatc 120961 tcctgagtta ggtagcctta ggtttcttgg cggagattcc tcccctggga agtgaggttt 121021 ataaaagaac ccgtgatgtg gccctgtgga gctgcatggg gtgtgtgctc agtactcact 121081 ggtttcttcc ctgcaaagtg acacctgcag cctttggggt accatgtctc ataatttttg 121141 cattccctgc aacacacact atactgctat gggcatcatt gatcatttta aagatttaag 121201 gaatgacaag atacagtgct tattctcccc tatgatttgc ttagattttt atctaaatag 121261 tacttattga aataagtact tcagtatcat gctacatctt acatgtttga gctagaaaca 121321 gactaaagtt taaacccttt tatgagaatg aaaagttact tccttcccct tcttagggct 121381 tttaatccct ctttatgttg catttcagaa aatgccctat agcgtactat gaaagaaaga 121441 cattcattcc tgtctttgct gctgtgtgga cgatctgtat tccttgctcc gtgggagttg 121501 tgggtttatg tctattcttg catgttttgc ctgaaggacc acagtgacct acttggtggt 121561 ctccctgctt tcatggtttc actatgctga tctatccttc acttaattat agagtctatc 121621 ttaaaaaaaa aaaaaaaaaa aaaaaaactt ttggcctatg ttgcttcttc caaaatggca 121681 tccaaaacct aagcctcttt aaaattggag tccagcatgc ttccccagtg ctacttcctt 121741 ctggactccc tttgccactt gctttctgct tccacccata tccagatggg ctgcaaatat 121801 gccttgctgg ttctttactc tgtctttgca caggaggctc cctcattttg aagggtcttt 121861 tgctacccat ccatccatcc atttggattc tataatagag ttcttgatgt ttgacagaac 121921 taggccagga attgtggggg tataatggca agcaagacag agtctatcct tacgttatgg 121981 aagttcaatt catttcccat gtggcatact acttctcacc attcagattc atctcgaata 122041 ttgtttcttc tttgaggttt ttcccactgc acccaagata ctggcctgac actttcctta 122101 tctctattac agaatttaac ttgttcattc attaacagat atttattgcc tactatgtgc 122161 aagaatctgt gctaaaagct gggaatattg cagtaaacaa aaattaagtt tacagcttac 122221 atataatgtg gtaattatcc atcttttacc ttcattagac tgagttcatg taggacttta 122281 tgtaatatct ttatatcttc aataccagaa atgcctagaa ataatttgca taatatacat 122341 tttctttaaa gatttaacaa aagtctgagg gacagttttc agaataaact gaattaaact 122401 ctctctctct tatgtgtgtt atgtaaacat atatataaaa tacacttttg ggttttttgc 122461 ttaggaagta gaaaaatgct aaaatttctg ttacttacaa gcagattaaa atgtccaaga 122521 aagaaaagta gggatttata ttgattctgc tcaaatgctt acctagacaa tgtcaatatt 122581 tatattttac tcccctgttg tagtagaatg gtaattggtt ctctagttaa gcagcttttt 122641 taggattaat tttgaagcaa gaaattcaga ttaattaagg gacaatttca cttgcagcat 122701 ttaggcatta ttttaaccat caaacttgaa gtggcaagaa acggagttgc ttaaggaggc 122761 cagacgtaac attctaaagg tagtagggtc ttgattgggt tgcttaggca ttaaaaggct 122821 gtttaacttg tcttgaagtc tatctttcct tgatgtcttc tgcggtaaga acactgtgat 122881 acagatggaa tgacgggaag tggttttcct ttctttcagt tggtgagttt gtaagtgatg 122941 cccttctcgt tcctgacaag tgcaaattct tacaccagga gaggatggat gtttgcgaaa 123001 ctcatcttca ctggcacacc gtcgccaaag aggtaccagc ccataaattc tttcttattg 123061 caaagtgaag atttcctggg gacgtgctta atggcatttt aggggtattt tgaggcaaca 123121 gctatgttga taagttattt aagaaaaaaa cccagtgcta ctatccgtaa cccagatttt 123181 agtttttaga gtgattttag gttcacagca aaatcgaaca gaaaatagag ttcccgtgta 123241 ccccctgtcc ccacacatgc gcagcctccc catccatgaa tatttttaaa aagtcaaaaa 123301 tagggccagt tgaactagat tcattatgtg cctgtgccgt gtttcgcaac ttaaaaatat 123361 aagttgtctg tgccatgcct tataacttaa aaatataatc tctaatagtt ataactttaa 123421 agtttaccct tcagattatt tttatgcacc tgttgttttc ttttgtcata tacaattttt 123481 taataattta gccttttccc ctgttatcta ttagtgaata cagtggtaga acagtgaagt 123541 agctttttta taccagcctg aaagctccag ttagaattca gtttctcagt tataccatcc 123601 agttttatat tcagcaggcc atgcctacca tcttgatttt tccacagggt gagtctgacc 123661 gagcacttgg ttaacctctg cttttggcta agccaataaa agatgtaaga tacttagcat 123721 gaaaaggtca ctgtgtggta agaccacctt cacagctgct gtgtcccctt gataggtttc 123781 ctaataaatt tatataggtt tatttgtggc aagatcagtc ttaggctaac tagaattagg 123841 ctcctttatc tagctcatag tgcttcactt gagtcaattt tgatcccctg tccctcaccg 123901 gaacatgtct ggagacattt tttgttatta cagcttgagg gagtggtgtt actggcatct 123961 gatagtgccc agaggccaga gattctgcta aacatcctac agtcaacagg ataaccaccc 124021 gccaccccca agaattaggc agtccaaaac atcaatagtg ctggggttga gaaatctgat 124081 ctagcttagc ttaaccattg tatcatagct ttatagctgt tagacagaaa ataccactga 124141 agatttagta ttattttatg cctcttataa aatagtatat ttggatccac tgtagaataa 124201 aatgtttttt atgtaaaaat gtgttttgga ggtttaagag cttgctgagt ttaagaaaac 124261 aaaccaaata atttgaagtc cagctaccta tttgataacc tcatttttta ttccttctgt 124321 gttttcaaaa gaaaaatcct ttgtctctaa gaaacggaaa tgactcccat aaggtcaaaa 124381 tacagtctgt aatattttca ttgtctacct ttatgtcttg aattattcat ccttcattta 124441 tcactagtgg atgctctcag aactttatgc taatctagat ttgtgcaaat ttagcaaatc 124501 ccagaaaagc aaaaagacat tcaaggagaa atctctaatg actgtaagtt ctgattctta 124561 gtggctgcag gggataattt cttctcttcc caggattgtt actctgaaag tgaaccaaga 124621 gagtcagaat ttctaggata aggcaggtcc atgtgagata aacttccatc tgtaagggag 124681 gtaggagtga gctggggatc tacctggaat gcgcacaatt tctggataca cgaattaagc 124741 cagaaggcat taattctctg gtatctgttc accagtggag actgctacat cttatttccc 124801 aattagtttc cattccagtt gttcgtataa acctctatta tataacatcg tggtcttatt 124861 aaataaataa taaacaagaa taaaaagagt accaaagtgt aacccatgct aaaaaaaatc 124921 ctgttgtaaa ttcactcaaa aagcaaataa tcttttcctt tgtgaaattg gttcctaata 124981 tattgggtct gcatgttgat tattttatgt ggagttttct tacaatgaaa cacatctact 125041 ctaccactca ctgttttctc cttacacttt gtagacatgc agtgagaaga gtaccaactt 125101 gcatgactac ggcatgttgc tgccctgcgg aattgacaag ttccgagggg tagagtttgt 125161 gtgttgccca ctggctgaag aaagtgacaa tgtggattct gctgatgcgg aggaggatga 125221 ctcggatgtc tggtggggcg gagcagacac agactatgca gatgggaggt aaggtggcct 125281 ttgtgttcag cctcagagat gctgaaacat cttgtatgga gtatttgtat cctgtaaatt 125341 aatctttctg tttatcactg aaaaggtctc tgcccactcc catcagagtc tgctgttatg 125401 caaaaatctg aactatgaat ttttatggca tcctgttgaa ttaataatat cagtcaccca 125461 tcacagagtt aattttaact atttaatatt aaacttggga tcaaaatccc actgataact 125521 atcataggtt actggtagtt ctaacaggga gttgaaataa taatggcgtt ctgttttggt 125581 cattaattta aaaatatttt taaatgctct ctgggattgt cagtcattag tttaaggtga 125641 attttcgtta aaggtttggt ttgagatttt gggcgttctc tgtgtggatg tgtaagggtt 125701 tttttttggc cttttatcat ttcttttcct tcatagtgga agtcaaaggc attaataaat 125761 gcttgttaat tttttttaac tacctttgat ttgttgattg taaagaaacc tattttcgcc 125821 tctttttctt agggagctta gtcccgttac ttcttacctg attctcattc ttaactgaag 125881 gcaaacatct atattcaaat cataaaatta taaaatgaga atctttgggt tggaaatgcc 125941 atctggttca ggcattgatt ttatgctgtc tcatcccccg aagggtgatg attgtatcac 126001 ttatcttatt ccttcagtat tctcttagat gttccctaga gctacagatt tccaaacagc 126061 acattcaata ggagttgtaa gtttataatc ggcaaactac aggatgcttg tttttttttc 126121 ttttaatttc tggtgtaatc tgctactagg gcccccatag agtttattgt ctatatcagc 126181 tagaacactg ggaccacagt cataaactga aggatccagg caaactgggt catatggtta 126241 cccccaccaa ttacagtctt cagtgatgat gtcactatcc aggttgatat tcagttgaaa 126301 tgctagccag ctccttccta gttaccttca tttacatccc atttcagtac ctgaacacag 126361 tcatgcttta gactctgaaa ttccttagaa atagtaccat aaggtttata gctttgatat 126421 tcctctctga tcatgatctt ccctcataat cacctcccca gttcaggctt ttagtccatt 126481 atcctctttt tccctgcctg cctttctaat caggatggaa tcaatggata accattctta 126541 cctattccat tgaacatcct tgacttggct gaaacccttc acattttcta aacatatttt 126601 tgtcaattct ccatagctag gaaaaattaa tttctttctt tagttgcttc tgggatttct 126661 aaagtatcaa gtcatgtaac cttaccactt aattccattt cttattcatg ctgtaacatc 126721 agctgggatc ttaatgttac tagatgcacc taattatctc tgtcttgatg acctatcatc 126781 ttccccatag cagttgatcc agcccaactt cattctcctt ggttcccagc gtaactctct 126841 ctcatgctct acagatgacc cctccaccgt catccattgt gaatttctgc aacatggtac 126901 ctgacatttg gtaggtgccc agtacgtgtt tgttgaataa accattgaat tttctcctgt 126961 acttcagcaa ctcttgtacc ttcttctatc cctttgtttc ctgtcttccc ttttatgatt 127021 atgttctgct ggactctcaa tcatctcctc tccctttctt gcattttcag tctccctctc 127081 ccccacaccc ccaggcttca tccaccctcc tcccagataa ttctttccca tgtgttacta 127141 tctttctact atgtgttttt aatcctgttg acagttggta ggaaatattt gatggtttat 127201 tctacctgta gaaaagtcca aattctcttg cagggcattc agaagtctct tacatggctt 127261 cagcctgcat tttcagcctt atacatgtat gtgactctcc tggcatccta aaaccattga 127321 ccttttcttt ctctaggctt tcattcccac cgttctgccc catctgaaat atttttcctt 127381 tttttgtcaa atggggttaa tgtatgataa agatatttgt caactctaaa gtgccacttc 127441 ttatgacccg tcaaaatcta tatatttata aatgttcacc tgaaaagatg cagcctttac 127501 aagtctttac aggtgccttg ctctgctata ctacccttgc tttacctttt tcttataaga 127561 ccataactta tcttccaatt aattgtagtc atatctcttt ttatctgtta gaatccaaac 127621 tatagtttat gtgtctggta tgtcttgcac ataggtcctc aataaatatt agacaaagta 127681 aattgtatat tttaaatttt ttttcatagc ttttggtaag gacttcatgg tccacgattg 127741 gatattgtag aaagcttatc cagcccactg cccaggacga ctttgaatgc ggaccaacac 127801 aaattcataa actttcttaa aacattatga aatttttttg cctttttttt tctataggcc 127861 ctgttataga gcctcccatc ttttagcaaa gaactgtcta atacaaatgt aatgtgagcc 127921 acatactttt tcgttttctt tttttaagat ggagtctcgc tctgtcaccc aggctggagt 127981 gcagtggcgt gatctcggct cactgcaacc tccacctccc gggttcaagc gattctcctg 128041 cctcagcctc ctaagtagct gggattacag acatgtgcca gcatgcccag ctaatttttt 128101 tttttttgta cttttagtag aggtggggtt tcaccatgtt ggccgggctg atctcgaact 128161 cctgacctcg tcatccacca acctcagcct cccaaagtgc tgggattaca ggcttgagcc 128221 accgtgcctg gccttacttt ttcattttct aatagacagt ttaaaaatag taaaaagaaa 128281 caggtgaatt taattttaac aacattttat ttaatccagt atatctgaaa tatcatgtga 128341 ccaatatgaa aaatttaata agatagttca cattcttttt ttatgtttct ttctttcgta 128401 agaagtctta acacttcttt taacatttaa cagcacgtct caattcatac tagccacaca 128461 ctgtgtttga tagttacatg tggctggtga ctactatatt tggacagtgt agttcctgga 128521 tgacttgaaa tttcttatca ctctttattt ttccattaag acagacggtt tgtataacct 128581 agtaatgttt tctaaggctg agccactgaa tttttattat tgtttttaac tattcttgtt 128641 tttaaaattt aaaaaaattg gtttggattc agtataatta gtatatcagg tcacttgcac 128701 cattttattt tttaacttct tttcctgtag ccctgcaggc gggcagggct tctttcatgt 128761 ggacaaaacc agaaaggtgt tgcatcactg atccacacac aaggtgggct cctgccaact 128821 ggcagcctcc ttgttccagg aaccagccct ccttttgaca cttgaaatgt aaaatcagaa 128881 tagaagttta acgtacctga cttaaaacat cttcagggga tgctacgtga caattcttga 128941 agaaggagaa acgagctacc attttcctgg aaacaatatt tgattcacag tattttgttg 129001 tttggtagtt aatgaggctt gttttgatag aagatgagtg tttttctatc gctaatgggg 129061 catttagaga aacattcctt gtaggatgag ctctgtttta tgacctcatt gaaaaagagg 129121 tatgaaaaag aggtaaatct ttaacatagg caaaacaggt agtgtttcag aaatttatgt 129181 gtaaattacc gagtatattt ttctccagtg ttgtaatggt catattctca gatcacctca 129241 aaatacctat ttatgcagtc tgtaatgtag cataatccag actgttacta acattatgta 129301 gcataatgtt aataataata attacgtaca taatgttaat aatacacaga ctccagccat 129361 cgtgggtaag atttttttcc atgggaaaat tcatttggaa agctaacttg taagactagc 129421 caggtaggca gcctagccct ggaggcagag ctgctcctca cctggaactg cagaatctga 129481 ttgaacccag tggtccatgg gggttgctga attcgtaaag caaatgaaat gttagaagac 129541 cctctaattg ttgtagagtc acacttttcc tcttgcagag gaggtactca catttttttt 129601 aagaaacaat ctttaattat cgatgtattc ttttaggttt aggtttgcag acactggtga 129661 gcgtgtgctg agaggtaact tgatgtatct ttctggataa ttagatgatt ttttttttct 129721 atttctttga acattttttt tttaaagagc tgcttaagat ctggaggagt gattcctttt 129781 gaccgtgtct tattttcatt aagactgtat aacttggaag agtttttttt tttctttttc 129841 tttttctttt ttttttaaga tgaagttttt gctcttgttg cccaggctgg agtacagtgg 129901 tgtgatcttg gctcactgca acctctgcct cctgggttca aacgattctc ctgactcagc 129961 ttcccgagta attgggatta caggtgttca ccaccacacc cagctaattt ttgtattttt 130021 agtagagaca gggtttcacc atgttggcca ggttagttaa tgttactctc tggtttgggg 130081 ggcagtgata tgttttggct tttttcccac ccagatctct tcttgaattc ccatgtgttg 130141 tggaagggac tcggtgggag gcaattgaat catgggggca agtctttccc atgctgttcc 130201 tgtgaagtga ataagtctca cgagatctga tggttttaaa aagaggggtt cccctgcaca 130261 agttctctct ctttgcctgc tgccatacat gtaagacatg acttgctcct ccttgccttc 130321 caccgtgatt gtgaggcttc cccagccatg tggaactgta agtccagtta aacctctttc 130381 ttctgtaaat tacccagtct caggtatgcc tttatcggca acatgcaaat gaactaatac 130441 agctcggggt ggcttgtgtg ggtgcgtatg gccatgtatg caggtgtgcg tgcattggaa 130501 tatacaattc cgtattgaac ccagtttaaa cttcatcagt tctggtgatt gctggagtgt 130561 cacatatggc aagacgtcac cttgccctct aactacacca gaaaaccagg ccagcaccat 130621 tgactatcta gatacagtgg gccgtttata ttattattgt ttcatatttt tagattattt 130681 ggtaaacaat ttacagaact aagaataaag gaatagtgag aaataactat tagcaatttt 130741 aataagagtt aacatagtaa atattttaat tttacgaaga atttaaatat ggtaaaccca 130801 catgtttatt ctgtattttc ccagttttga aggtatggat ttctgaaatg agcattatcc 130861 tttggaatgc agctagctgc taatcacctg ccatgcacga acctgaaata tatccgtctt 130921 acaaatactg gttctacatt tcaaatacat ctctattagg ttcatcatat ttcttggtct 130981 ttttttttta tacttgcaat ttcatttaat tattgtaaaa atttaagata tttaaaacat 131041 tgacagctca taattttatt gaagagagaa tatagccatc ttccctcagg aataaaatgt 131101 tttcaagttt ggatttgtaa acagctatta aaatgatcaa gccagtcttt ttttccccct 131161 ttttttctgc attatctctt ctaatcacct tagagagatt cctgccttgg gcatttgtcc 131221 acttcctaat ggtatcaaat aaatgtgttg cacagacata aaatgtgttg cataagcatc 131281 ataaaatggc aaggaatact gtcacatcat tttagcaatg atggttaatg tttttgtcac 131341 ataatattga gtgatgatat ccttcctgtt gaatgactaa gtggatgaga ataccatatt 131401 tcctttttat tcttagaaat acattgctca gccgggcgca gtagctcacg cctgtaatcc 131461 cagcactttg ggaggccgag gtgggccgat cgcctgaggt tgggagttcg agaccagcct 131521 gaccaacatg gagaaagccc atccctacta aaaatacaaa attagctagg cgtggtggcg 131581 catgcctgta atcccagcta ctcgggaggc tgaggcagga gaatcaattg aaccagggag 131641 gcagaggttg cagtgagcca agattgcacc atcacactcc agcctgggca ataagagcga 131701 aactccacct caaaaaagaa agaaagaaag aaagaaatac attgctcact tgtcaattct 131761 tgagcttgag aaaattcatc taggatatca tgtgaaggct catagcctaa ggttttactt 131821 acgtgatcaa tttcattctt cttctggttt atctaattgg taaaatgcct ccttctgtca 131881 gttgttccta tttgccaata tgtggaagat gatttttgac tatattcagt gagtctccag 131941 acttcttttt atcatgcttt ttaaagagat gggacctcac tctgttgccc agacttgagc 132001 gcagtggcta ttcactggtg tgagtatggt gcattacagc tgcaaactcc tgggctcaag 132061 tgatcctcat gcctcaacat cccaagtagc tgggactatg cagacttttt ttaaaccatc 132121 ttgaaagatg atgcagtact ttgggaacac agcaataaaa atgtgataat ttccctattg 132181 catcaccttt ctaggcaatt atatcattct ttcatgctca tattattttg tcttctttag 132241 cacagttttt aaatagttga tggctaatga tgaaataatt cttcggatga taaatatttt 132301 gtaacatagg aacacaatat aggaaaaaga tcgaggttca taccatatct cagactcatg 132361 gttaagtaca agctagaatg agttttagtc gattaaattt ctttttctct atgaatggat 132421 aaagtgagaa tatcagccca caagtagtta gaagtgctca gaactcaaag gtaaaattgt 132481 gtccaagacc tgatggtatt ctaaaggtta acagcctatc ctacttacca agaatttaca 132541 tttattatga ggcctgtaca acctgatgca ggaaaatact tggcataatt ttctataagt 132601 taggtagtta ggtgctcaac aagtatttga gttgaatcac attggcattc acatcaaaca 132661 catacaccta cgtcatcatg ttaacttaag gttaatagtg tcaccaaaac taactggtct 132721 ttaaaaaaga aaaactaggc caggcatggt ggctcatgcc tgtaatccca gcactttggg 132781 aggccgaggc tggcagatct cttgatgtca ggagttcaag accagcctgg ccaacatgga 132841 aaaaccccct ctctactaaa aatacaaaat agccaggcat ggtggcatgt gcctgtaatc 132901 ccagctactg gggagactgc cagagaatcg cttgaaccca ggaggtggag gttgcagtga 132961 gctgagatcg caccactgca cttcagcctg ggcaacagag caagactctg tctcaaaaaa 133021 aaaagaaaag aaaagaaaag aaaaactaag gaatatcatt ctagacatgc agtaattaag 133081 taaaattgca tcaggagtct aataacatag catgatgggt cccatgtttc attatttcca 133141 tgggcagaag agggatggct acattataaa taaaggaaaa ttgtaagaat aagaaaagat 133201 gaagtatgaa atatttccaa ctcaaaaaca aactgtatct gagcatgcag aggtacaaaa 133261 tgtcaccttt ttcctccagt gtaatgaagg gcactaaacc tcttgagcat ttgatatgtt 133321 ccagatacta tattaaagtg attttttttt ttaggagaca ggttcttgct gtcacacatg 133381 ctggagtgca gtattgcaac catagatcag tgtaaccttg aactcctggg ctcaagtgat 133441 cctcctgcct cagcctccca aatggctaag actacaaaca cattgccacc atgcctgcct 133501 aatttatttt gatttattta ttttttttgt agagacaggg gtctcactat gttgcccaga 133561 ctgttttcaa actcctggcc tcaagtgatc ctcctgcctg ggcctcccaa agtgttgaga 133621 ttacaggcat gagccaccac gccagaccaa ggggtgaaat cttgataaca ataatcaatg 133681 cagtcatcat aaaacgcttg cacaggctca atttgtaggt ggcacagtgc tacatctagt 133741 caaaagcaaa tttaattatg gtcattgtgc tctgagaact tttctcagaa aataataacc 133801 agatggtatt atagatggtg catttcacaa atacatgatt tgtagttagt gaaaacttgt 133861 cattcaatca tgtaataaat tatgcacatg ccggtcttcc acttagctag tgatgttgaa 133921 acattcatag ccttacctga tgagctcagg ctgtcatgag aatttatgtc tattcagtac 133981 tttgtcttta aaattaggtg agagagaaag agaggctgta tttgactata tatcttttta 134041 tttgtatctc tggagttaca agtttttgaa tatataatct ttcagctgcc tttctgcatt 134101 gcattttgag aaatgatatt aaatgctcaa ggaaaaaaaa cctaacaact ttagagggcc 134161 gcaaagtgaa tcatacaaaa gataaattaa agctttaaat ctttgccagt ggatgagtca 134221 agtaagagtg gatttaatga aatggctcat ggctttgaga agcagctgtc ataatgctgg 134281 gttggcagct tccaggactg tgtgaggctt tctgactgtc tatactgaaa taggagaact 134341 catttgaaaa tgacgaatag gagaatagca gaaacttgca ttttcttttg agggaactgg 134401 ctgccagttt tgaggtcctt atttagaaat gctcgattcc tgtattgaat aaacttggac 134461 tacatgtatg tcaaaaagaa aagcaaaaca cttttatagc cttaagcaga ttacagatgt 134521 gtggccaaac cagtggggtc tgtttttgtt tccaagcagg tgatggctgg aagaggagat 134581 gcctctgtag tgcagggctt gagttgctgc tgtctgagct acatctggca atgagactgt 134641 gctgcttact gagacctaat ggcacaggtt gccccagtaa caggatattg ctgaatgaga 134701 aatactgtag gattcatttg taagagttat ttggagggtt ggattattca gtgctaagga 134761 tcctaagcta ctgatgcatg gactcttttc ttccctccag tatcttttaa ttaaactgta 134821 caaattccct ggcttaaaaa tagtaggaaa agaggaagaa aatacagatt cttatgattt 134881 taactggtct ctggcaatag tacaccgaaa cattaatttc ttgttgaaac catctggata 134941 tttgtatctt cagtgtaaaa cccctaattt agataggact ctccattgtg acatggaaaa 135001 ataacccttt aattagttaa ttaattttac tgtggcctca aaatctacat attagcatat 135061 caagccaaaa actgttggct ttacttggga gccatgtggc ttaaactagt accagtaact 135121 taccacaagg aactcagcaa aattataagt ccatgtttgt tttggaagga aggcaattat 135181 tacgaaaata atttagagat gttcaaacta tttcctttga gtagaggtag agagtgaaca 135241 aaatctgtcc caagtgggag ctaactagct tccttttttg ccctccctgt ttacaaagta 135301 acacttttag tttactaggt aacatttgca gtatccccag tgtttcaatg taaaattgta 135361 ggtggtagac taatgccact tagccaaagc catgatcacc tgaattttta atatatattt 135421 ttgtatggag tgagatactg ggagaacagt cttcattagc ttaaactcta gaatgtactg 135481 tttgtttgtt tgtttgtttg tttgttttga gatagagtct cgctctgtca cccaggctgc 135541 agtgcaatgg tgcaatctca gctcactgca acctcccctc ccaagttcaa gtggtactcc 135601 tgcctcagtc tcccgaggat tacaggtgcc tgcttccaca ctcagctttt tttttttttt 135661 tttaatggag acggagtctc actctgttgc ccaggctgga gtgcagtggt gtgatttggg 135721 ctcactgcaa cctctgcctc ccaggttcaa gcgattctcc tgccttagcc tcccgagtcg 135781 ctgggattac aggtgccaac caccatgcct ggctaatttt tgtattttta gtagagacga 135841 ggtttcacca tgttggtcag gctggtctcg aattcctgac ctcaagtgat ccactggcct 135901 cggcctccca aagtgctggg attacaggca taagctacca tgcccagcca agaatgtaca 135961 atatttttaa agaaatgcgg tttattacca taaaatacag gttgagcatc cctaatccaa 136021 aatctgaaat gccctaaaat ccaaaatgtt ttgagcactg acatgacagc aaacgtagaa 136081 aattatgtga ctttatgtgt tcaatgtata caaacttcct ttctgaaaca aaaattattt 136141 aaaatgttgt ataaaattgc cttcaggctg tgtgtataag gtatttttga aacataaatg 136201 aattttatgt ttatacttgg gtcccatctc caacatatct tatatgtatg gaaatatccc 136261 aaaatttgaa aaaaatctga aatctgaaac acttctggtc ccaggcagtt tggataagga 136321 ttactcaatc tgtaaataca gtagctcaaa ttgattttgt aaggcagtca agttattatt 136381 tatagtattt taatttgtat tattctttta atatagcaat tcattttata tatttatata 136441 aataaatttt ttgcaattga gcaaattaaa aaaattaagt gcttccttca gatctacatt 136501 ttattttcta tttgttatca atttaataag taaaggacac acagatcatc tgattgtgtt 136561 gtcagacatt tgaaggtgca actgatgaat gttcactgtt gtactttgtt ttgtcatgta 136621 aattgtttta aagagaagtg aatgttgagc atatagtagg ccacacgtgc ctgtggattg 136681 gctcattgtt ttaggaatgt gtaattctgg aaaagatttg ccaagggtat tcaaaagtat 136741 agtttgatat tagaaattat ttgaaaataa tctttatagc atagcaccat tttatttaga 136801 tcaatagtaa ttcaagcagt atacataatt tgctattgta agcgtagcat ataacctaag 136861 ttttctttta catttgagac cactgtttgc tgttggtcga atttggtctt ccaagatttt 136921 ttgcttagtg acctaattgc cagcctttgt gaagagctgt tacttattgt caaaccaggc 136981 agtaagggtt tgcttttatc tttttatcag ttctttcaat gctagtagtc atgagattaa 137041 tgtagttgga tactaaaatg ttaatacgtg ttagataaca aactgggaaa gagatgaatg 137101 acatcatgaa ggaaagggca gtattgcact tacagtgatt tcatgatgat agaactaacc 137161 tgcctaaaaa gacattgggt ggaaacagaa accaggtgta atagacgcat ccaagtagtc 137221 aaggaaagaa aagacttgta agtaaataag ataatctttg cttcaaaaat atcttcaaag 137281 gtttaaaatg tgctttaccc tcatatataa tctttttgag cctatttctt tacctcaaac 137341 atggggaaaa ttgcaataat ctgtatcacg gtatttgctt ccaaaacgac taattggtgt 137401 aaatgcatat caagccggta aagtttacag tggacttgtt taaaatgcgt agattggaat 137461 tgttggggga gcaggccagc agcatgggat gttttttttg tttgtttgtt tgtttaagac 137521 tctgtcgccc agactggagt gcagtggcac aatctcggct cactgcaacc tccgtctcct 137581 gagttcaagc aattctccta cctcagcctc ccaagtagct gggattacgg gtgcctgcca 137641 ccacgcccag ctagttttta catttttagt agagacaggg tatcaccacg ttggccagac 137701 tggtctcgaa ctcctgacct caggtgatcc gcctgcctcg ggctcccaaa gtgctgggat 137761 tacaggcgtg agacaccgcg ccagctgggg tgattttttt cttagtcaaa tcacttctct 137821 acccatcatc tcacactgtc gaatacaaag actctccttg gggctcaatg actcatctcc 137881 cacctctggt ttgataatca tggctattat aacatgttcc ctttttaatt agttatttta 137941 aaattattgt aggtcttgta atcataatca ataataatac gtaatggctc cttaagcact 138001 tctatcctag ataaggttga gaacattatt aaaattagtt ttggtagtaa tcaagtaatg 138061 ataattctta tttatgaatt taaatatgta ggagtttttt tattttaggg aagcaaaaag 138121 gactagaaag tatgatgtct catgattaga aaactgctat aaaaatgaag gtagaattga 138181 tttgggctgt tttggacatt ttaaatagaa atttaaatag gtattcaaat agcaaagtca 138241 ttctacttta aaacttctat acttttttag gatcttcaat tttatattta aattatattt 138301 ttgaaatggt tttagaagaa atattgtatc ttaaactcta actactaata agtaagagaa 138361 gatatatcaa aggtttgatc ttttatctcc tagaatatac tattcattcg tttcccattg 138421 tacgtcattt gaatgttaat ggttagcctc aaatggtttt caacataatt gtatagctac 138481 tattattgga aaattggagc caaggactca tacacatatc agaagtggta atgaagcact 138541 ttggttttat gaaaatatgt ttacaaggta gtcacaggat agcaatattg catttactgt 138601 aatacaatga tggtggaact aagtgcagtg gaacaggatg ctgagaccac agttaccgca 138661 tctctaagtg cctcgtctag ttatggtcca acctcagata taactcattg gtgttgaaac 138721 acacctagca acattttaag catagaccgt tcattaaaag gtaccttttt aaaaattttt 138781 atttttaaaa attgacatgt aataattgta cacatttaca gggtacacag tgatcagatc 138841 agactgatca gcatattcat ctcaggcatg tatcatttct ttgtgttggg aaaattcaat 138901 atcctgcttc tagctgtttg agattatgtc atatattact gttaactgta gtcatcctac 138961 agtgctgtag agcactagaa cttattcctc ttatctagct gtaatttggt atcctttaac 139021 aaatctttct ctatcctcag tttctaccta cccttcccag cctccatcat cctctgttct 139081 actctttacg tctatgaaat caacattttc tagcttctgc atatgagagc atgtggtgtt 139141 taactttctg tttttggctt atttcactta acatatgtcc tccagttcca tccatgttgc 139201 ttcaaatgac aggattttat tcttttgtat ggcttaatag tatttcatta tacgtgcatg 139261 cacgcacgga cacacacaca tacacacaca cacacacatc acattttctt catccattag 139321 tctctttgga catctaggtt attccatatc ttggctattg tgagtagtgc tgcagtaaat 139381 atgggagtgc agatacctct tttaatatac tgatttctct tcctttagat aaatgcccag 139441 tagtgggatt gctggatcgt atggtagctc tatttttagt tttttgagga atctccatac 139501 tatcctcaaa tggttgtact agaacagagc ttttttacct tccttccatt tgggaatctt 139561 cttacccaac atgtatatta gtaacttggc tatgtatgtt ccattctctg tctcaaatac 139621 tgtgtctagg acatctgcaa tctttggcta tgaccacgac tgtaaagtgc atgacgcttt 139681 tgctctgtca tcagtattag ttgacaggca agtaagatag tacttctctt cataacccct 139741 ttacctgccc agcagaatac cagtacagac agacccagaa tatctctggg ctcaagaatc 139801 cagacattta taggatggga cttttatcta taaatcacct taagattagt aagttcttag 139861 aattggttac ctttgccatt tctcagaagc cctttgtcta gtaacctgtg tgtcactgtc 139921 atttccagcc tgtctttcct acacattgtt tatgcaggct ttaggagtgc ccacccccca 139981 gttctacccc ccacccccac cctagccttt gtggttttaa ttatcctgtc ttggctgatt 140041 ttgttttgtt ttgacatttc atccattaaa ttcctgacac tggctatctc tttacaatta 140101 ttatactatt tttctttctg gctcgctctt cctgattgct gtaatccatg cccttgactt 140161 taaataccat ttgtaagtag actaggccaa ggtttatttc tctaggccag actctaccta 140221 gttatttcat aggcatatga atcctaacat gtccaaatga gagttcttaa ttcccctgct 140281 acccagtcct caatgtggtt ctctcaaagc tattctatct cagtaatggt cctatgccct 140341 ctcctaggga ttattgctga gaacctagaa ctcatcatcg tctcttctgc acatcctatg 140401 aattctggat ttatccactt atctccatct tttccagcac taatctaaac tcccattgtt 140461 tcttgctgga gttatttccg tatcctgcta atctctgctc tcgttctctc tccctgacac 140521 atgcgctgct ttcttctccc acaccaagcc attctctgta cagcagccag gatgtagagt 140581 gtttatttac ataaagcaga tcctggaacc cccaaattta agttgtgttc gtggcttacc 140641 aatacgctta aaaaccaagt gcacagagag aaaacaatga agccttattt ggatatgctg 140701 aggataattg ataacgggta acaactaaca aacacgtagt tggtgtcagg tttgacctcc 140761 tacattcctt acaggtatta gtgtatttga cccagctcct ccctacctcc gcagtatcac 140821 cccatgcctc tctttccccc gactctcccc tgctcagtgt cacacaagtc aatatcccca 140881 gcttgctaac ccccgggctg acagatctca ttgtctataa tggatgttcc aaggtcccac 140941 cttcagcctc agttttgttg ccatctttcc agaatgaatt cccttgatca ttctgttttt 141001 tggagacaag gtcctgctct gtcacctagg ctggagtgca gtggcatgat catggctcac 141061 tgtagccttg acctccctgg ctctggagat cctcctgagt agctgggact acaggctgca 141121 cctggctaat ttttttttaa tttttagtgg agacaagatc tcactttgtt gcccaggctt 141181 gtctcaaact cctgaactca agcagtcctc ccacctccgt ctcccagagt gctagaatta 141241 caggcatgag ccaccatgcc tggccctgat cattcttttc tgaagaaaac accccttctt 141301 cattattctc aatcctagca gcctatttac ttccattctg aaatgaatca cattttagta 141361 ttttattggt tcgttgtcgt tctctccccc catagaatgt aatctctctg agtacaggga 141421 gcttatcttt ttcccttttg gagcctcgga tccagcagaa tgccagacac ttagcagcca 141481 ctagcatttg atgagtgaat ggcttttcat acacaatttt agacattgtt cctgattggt 141541 ctctctagtc aataatagat ttatcaaatc taatcgataa tagatttctt tttcttttct 141601 ttttttcttt ttctttttct tttttttttt gagaagtctt gctctgtcgc ccaggctgga 141661 gtgcagtggc atgatcttgg ctcactgcag gctctgcctc ccaggttcac gccattctcc 141721 tgcctcagcc tcccaagtaa ctgggactat aggcgcccgc caccatgccc agctattttt 141781 tgtgttttag tagagatggg gtttcactgt gttagccagg atggtctcga tctcctgtcc 141841 tcaagatccg cccgcctcag cctcccagag tgctgggatt acaggcctga gccaccgtgc 141901 ctggccgata gtagatttct tttattgatt agagagacta atcagcaaca atgttttaaa 141961 attctcacat agatctctga atgggaaagc agtcttgaat tatgcttgca gggagagaga 142021 ggggaaggaa ggaggaggaa tggtataatt agacttcata aggctagatt ttgtttctac 142081 ttaaataatg gtatcacagt gggacatatt atccatgcag aaagtgagtg ggcatctggt 142141 ctattatctg tggtctttca ggaatggcag ctctttagtt taaaaaacaa caaagtgcag 142201 cactgtttat ttaatggaaa tagaaaagca gtgcctgtgt attagtggct cagatttttt 142261 tgttctttgt tttcagctgt ctgctgtatg cctctccagc ggggtcttac agaatctcaa 142321 gtctaccatg atcaaagctg aacccattag ctccatttct taattcctcc tgtatcctct 142381 gttaccatta gtaactattc tccaactcaa agcatcagat tcctcctctt actcactccc 142441 ttcatcccca gtcaccagtt ctgtgctgtt attttatcca agttctcagt aggcttcacg 142501 agcaccgcct tcacatccca actgtcgctc ctccacctat ctgtcctcca tgcttctgct 142561 agctcatgtt tgttgcaata caactaacct tgccactctt ctgcttcagg cttttctact 142621 aggctctgca tctagatgat ggtccagact cctcagcctg tgaaacaaag cccttcataa 142681 aagagcagag ctacagctag tcagctccct tcctgcagtc caccccctcc tccaccatac 142741 tgagaattgg caattcctgt agcactcttt gtatgagtcc attttcacac tgctataaag 142801 atataaccaa gactgggtag tttataaagg aaagaggttt aattgactca cagttctgca 142861 tggctgagga ggcctcagga aacttacaat catggcagaa ggggaagcaa acacatcctt 142921 cttcataagg cggcaggaaa gagaagagcg aggagcagag tgggaacagc cccttataaa 142981 actgtcagat ctcctgagaa ctccctcact gtcacgagaa cagcatgggg aaaccactcc 143041 catgatccaa ttccgtccca ccaggtcttt ctctagacac atggggatta tgaggattac 143101 aattcaagat gagatttgag tgaggacaca gccaaaccat atcactcttc atctgctacg 143161 ggtgccattc cctgtttcta cagtgtcctg tttgctacct tctttgtaga tttctattca 143221 ttcatcatgt cccagtttac ctgttacctc ttcaacagat tattcttcag tctgtccctc 143281 ccaaactcca ccaatgaaca ttctttgttc agcccattat tgttaatatg cacttcactg 143341 aaaacactgt tttatgcttg tttacatccc atctctgcag ggctctgagc ctatagtcag 143401 cagttgggaa cttactaggc atagcgccca gtaaaggata gctctttata tatatatata 143461 tatttttttt ttttctaaaa taataagccc caagaaaaac attacctgtg aaaaacaaat 143521 gatattttta taccattttc aaaacctaat agcaataaaa agaaaagaaa tgtcataagt 143581 atacatttaa gctgtagtta aaaacagtaa ataggcacag tatttaactc ttatcatgtg 143641 ctgttccaag tgtgttattt acatcattgg ctgttcacac catcccctga ggtggttaat 143701 atttccatcc catgttaaag atgaggaaac caagtcacag cagcctaagt gacatgttca 143761 tagtcacatt ggagctggca ggtggcagca ctgagaatga aacccacgtg gctgagctct 143821 ggaaagtcac cattaatcac cacactagac cttctctcta ggatctacat aactagaaca 143881 tacaattaac tggagtatat gtaagtgtag tcgttccttg agaaaggaaa agaattcaca 143941 aatttcatag cctcatccct tgtgcatttg tggcattgtg ctaggggtgt ctatttgata 144001 tttgacccta cagatttctc agatgccaac caaccacaga tactttaaaa aaagatactt 144061 aatacttacc aatgcaagaa agtgggcttg ttggaagagg aaaaatgtta gctcactaat 144121 ggaaaatcaa ctaaaagtct tacctctcca gaacatgctt ttcctaaatg ttgtggctaa 144181 cccaagaagc tacatcatct ttacctttat gatacaattt aaagtacttt tctacttact 144241 tcaagccctc agagcacaga ggcaagggag gccttgggct ataaaccctg tagcctgagc 144301 cacatggctt cttctcctgg acagatacct ggcatattct gccactgagg atcagtggtg 144361 cagatgtatc tgttttctcc taggaatggg gctccactta cgaatctggg gcaggacaag 144421 gtattgctgc attttgggga gcgtgctgga aggaggagca gagtttgttg tttttgcttc 144481 ctgatatgga ctggctgtgt ctccattcaa atatcaactt caattatatc tcccagaatt 144541 cccacatgtt gtggagggac ccaggggagg taattgattc atgggggctg atcttctctg 144601 tactattctt gcgataatga ataagtctca caagatctca tgggtttatc aggggtttct 144661 gcttttgctt cttcctcatt tttctcttgc cttttcctca tttttctctt gccgccacca 144721 tgtaagaaat gcctttcacc tcccgccatg attctgaggc ctccccagcc atgtggaact 144781 gtaagtccca ttaaacctct ttttctcccc aatcttgggt atgtctttat cagcagcatg 144841 aaaacagact gatacacttc cttaacagaa ccaattattt tttcccaaaa aagaggggga 144901 gaagcccaga agcatgcagc ttctctcttc tttctaagga ataggtcttc acaaagtgcc 144961 cccaaaagaa aaggaccagg cttttctctt actttgtgac tttcctaggt tgctttgtag 145021 ccattttgga cacagaccac ctatagctgt gtagtagttc tcatgtgtat ggacacagtg 145081 gaaggagatg ccaggcacat agttcttgct gcattgacac cttttcaggg catcttcttt 145141 taccatcttc tgtagaaact gcaacagccc ggccaactac cagccccagc ccacccctag 145201 ctttgctgct ttcattttct ctgtaacact tgtcaccttg taatgtgcta gtgtaacatc 145261 gttgttttag ttttcttata tctttccttc cctgtagaat gtaaatacca ggaaggcagg 145321 gaattttgtc tgctttattc tctgcagcat ccctagactc cagagcagac cctggaactc 145381 aaatatttgt tagacagcca aaggtcatat acttccaact ctccacccaa aatatatatg 145441 gatcacccat ccttagtagc agctaaacct ttggacatga ggaatttgct cgttcacagt 145501 tttctggcct agcatctgat taaccaaact ttttcacata tcaagaattt aaaatgactc 145561 caaaactttt tatgacctta ggtgcttctc cttaattcta gttggttcat aaagctgtca 145621 gaacaataat ctgcttttga taatatatga cttttcttgc ctccaagata gcctttaaaa 145681 aagcaattta gccttttact gtaattatat tccaaattta agtcttgcca gaatatcatt 145741 ttcccccttt aaacatactt tttatgatgc caagatattt taaagaaaag ttcagatagg 145801 aagcatattt aagatgtcag tatatgtctc taaaatctaa gaacattttt ctacagagtc 145861 cagatgatac catctcaccc acaaaattaa cataaatttt tagtaccatt ggataccatg 145921 tccatattca gattctccct gagttccaga aaggtccttc acaggttatg catttcacat 145981 cctatccggc accagggcca gcatcttaca tccttaattc ttaggattca gaatagcact 146041 ttttatatcc tgacactgac ttgctgaaga agctgtacca gttttcctat aaactgtcca 146101 ctccttggat ttgtcttata tcatgtccca ggggtattgt ttagattgtt gctctattgc 146161 taggatattc atgttaccct aagtgtgtac tgagatagca gagtaaagag agattgtctt 146221 ttaaaggttc tgcccaggaa ctgctggttt acacaaaagc tgggctttag tgcttctcgg 146281 atctctttgc ttgtttcagt aatttttttt tagaaattca ctaattcact aacgtaagtg 146341 gggcagggtt tacaggagca tgtgttgaga gaagttggtt tgactttaaa tggtcatttt 146401 tacaatccgt atttaggcct tacaggtcgc atacaatctc tgtcacatat tgctgtcatt 146461 tttaccccct ttcttttctt tttataatcc tttaaaaaat gcaaagccca cccttagctc 146521 acaggatgta caagaacaag ttgcgggttg cagtttgcca agacctgctc tgtgccactg 146581 ctgacttgcc attcttggag cccactttct gttcttgtca gccaattccc tttcaggtgc 146641 tcaggtttcc ctccctcctg tcaccgggcc tttgcacatg ctctacccac cctgcctagt 146701 ggagagctat ctttggcccc tggacttcat tcacttcgga ccttctttgg gtcacatttc 146761 agtcatgcct cactagacac ctggaatctc tgcctcacta gacacactag cactcactct 146821 tttgggtctt ctggctgtat gtaattgtta tcagggcagt aattttgcat ttatacaggt 146881 gctgaggtca gtgtctggcc tttggctact tgccttgtga gccacctttg ctttccagtg 146941 tacccctagc tcctggccca gtgtccagct tcccagcatc cattcatgtt ttccctggag 147001 accattctgt gcccagcagc cagagacatc attgcaactt aaatctcatt gagtcgttcc 147061 agttgctgta agaaatgtaa ccgtactggg taacactatg cctgtctgtc tccaccacca 147121 gatcttatac ttcctccctc atctgctact ccctaagctc actggccctt tgttgttttt 147181 gctgaacgtg ctaatctctt cctcacacct gggcctttgc acttgctctt acccctgcct 147241 cagccatcca tgcactctct tggttttgcc gtgcccgact tcagcatttt tagttcacat 147301 ggctcttccg tggggccctt ccctgaccac cctgtttgaa atgctaccca tccctggcac 147361 acatctatca tagtcattca ttagcaatta gccccatctc ctttccctat agtacttttc 147421 cataattttt tttcttgctc atattacacc tccccaacta atagatagat tccaagaaat 147481 tgagtgcttt atcttcacta cctggaatag tactggcact tggtaaagca cagcgaatat 147541 tgttgattga attactaggg gttggcttag agaggtactt aatggaatga ttcaatggat 147601 gaatgaatga atgcattgcc acttttttgc tcattatcat caatttgtac atatgccttt 147661 caactcaaat atttatccat tggctaaaga aaaaaaatac ctgaatcccc gtgtctctgc 147721 agtttaccct tcctcactgt taaaactcag tgcctgtgta aaatttcatt atattttatt 147781 tacttcttat tccacttgtg cattatgtgg atattcctgc ctcatcccca cagccaccta 147841 tcacttccgg ccaactcaga tgcctctggt cgcaacaatc acttctctgt tgaatttctg 147901 aagcgtttaa tctgcagtat gtctgttact catacagcag ttagtgtgaa ttgtctcact 147961 cagaataagt attcattaaa tgtgtaagaa tgatgatata gcctaatttc agatcaagat 148021 tgtgatttat agctcattct taatcagctg tgtcatgact ttatttattt attccttttt 148081 tttttttttt tgagacagag tctcactctg tcacccaggc tggagtgcaa tgcagcttct 148141 gcctcctgag ttcaagtgat tctcatgcct cagcctccca agtagctggg attacaggcg 148201 tgcaccccca cgtccagcta attttttgta tttttagtag agacggaatt tcactatgtt 148261 ggccaggctg gtctcgaact cctggcccca agtgatctgc ctgcctcggc cccgcaacgt 148321 gctgggatta caggcatgag ccacctcctc aggcctgagt catgaatttt tggtagtcca 148381 gtcatgtggt tatcttccta aagctggagg ccttaaacct agggccccag gtgggcttat 148441 gcctttagct gcaagtaaca agaaaccctt ttcaaccttt ttactgattt atttagagag 148501 aagtacacat aagaggtaat gcaagtaagc aaagtttttt aagaccacac aactagatag 148561 ccagcatccg gaccagaaca cagagacttg ctggccccta gaatcctctc tcctggcccc 148621 tgccagttac tggcctttcc cctgagttct atttgcctgc agtttgcttg ttttcatatt 148681 taattcaaat agaattaggt gatgtgtccc cttgtgtttc gcttctaaca gtgcctttat 148741 gcgggagaag tactggctat ttcttacttt gccctccctc cccggaaaca ttttcctaat 148801 ttagtagcag taatttcaaa aactgctttc ctttcggaca aactgactta tccagccatt 148861 gtaccttcca gtttagactc tggcagtttg gagagttgag acagagagaa ggagaaccgt 148921 ctgtttatct tcttgaaaaa aactactgta ttttctctct cttctgactc tgggagcctc 148981 tagactcttt tttcccattt tcatatccac agaggaagag tctgaacctc ttcctgctga 149041 gctttcccct ggttggttgc atttcaggaa ctgacattgg ttgtcttccc tcttccctca 149101 tcagtgcaag taggttggat ctcatgatag tccatattca cataaccatg agaatcagag 149161 ctttctctct gtcatttcct ccctgtttat ctgtagcaag aagaaaaaag tccgtcttaa 149221 cagagactgt gaaagaataa atatagactt tattgtttta gagcagtttt aggttcacag 149281 caaaattagg cagaaggtac tgagattctc ccatatactc cctgttcata cacttgcaca 149341 gcctctccca ttatcagcgt cccccttaga gcgatttgtt acaatcagtg aacctacact 149401 gacaccatta ttatcaccca aagtccatag tttacattag cattctctca gcattgttta 149461 ttctgtgagt ttggaaaact gtctaatgac atgtacccac cattgtagtg tcatactgag 149521 tacttttact atcccccaaa tcctctgtgt tcctcctgtt cgaccctctc ttgcctttaa 149581 cctttccaag ccactgatct gttactgtct ataattttgc catttctaga atgtcatatg 149641 gctttcttag aaaatacaaa tggtgacttt tttggcatga ctttaataaa cagaaatgaa 149701 aattggtgtg tgaatcaaaa tctttcactc tgtaaaggga cttatcaggc ggcacctgca 149761 tgtggaatta tatcaaggtc agctttagtt agaaacagaa ggagagtcac ctttgtgatg 149821 tcatttctcc ttcatccaag taaaatctcg catgctgaca cataaaattt tatcatcagt 149881 gatttatttc ctcgtaggaa tgtccctttg accttattaa attcagatac tctggggatg 149941 gtcagcattt tcagatggtt cccttgttct gtaaaaatgt ttgttattgg tgtttaaaag 150001 ctgtaaaagc cacccactat aaattgatag ccactaaagt ctgcgaatgt ggtatgaatg 150061 ttccactatg ttgccaagcg tttattttat tgtgagttgt tatttgattg tttgtaagga 150121 aattgtttta tgatgctcag ctttatattt ccataaactg ttcctccctt gcaactcctt 150181 aaactgtgat gagcttttca aaacaagtcc acaggacaac agtctgtcca gcatttgaag 150241 aagtggtgga aaagtcattt ttttcctttc agtgtttcta gctgttcatg aaaactgcag 150301 ttaaaagttg taatgttgtt gtgtttggca ttctttggga agatgcagca ttattcaaac 150361 aaaaccgtgt gctcttgttt taaatagtgt ccaaatgaat aattgaattc atggacatct 150421 aaatttaatt ccagaaatca aagctatggt aataagaact tggacctgag ttctcttagg 150481 ccaagtaatt taattcctta gccggttgaa ctatttatta cgtgtctccg ttcctggtta 150541 ttatgccagt aaattgctca ggcagatgca gggatggatg atgggcatat aattcactct 150601 ttttcacggc tagacacata ccagaatgca gatcacacta aacagctgtt tttacaaaca 150661 tggtttctat tcagagcttt aagggccaca tccaagtaat aacaaagaag cctggagaag 150721 gaacctgggg atacccaggg tcatgttttc accatcccac ctggaaacac ggtgctgtcc 150781 aaaccctgga tggtacactt ggacatcgtc attccagtca tggttcaaga tcacgtgtat 150841 acatcatgat ttaaagagaa tactggtttg atttttcttc ctgactttcc ctgaattttc 150901 cacttaaatt ggaaaacagc gctagttagt gcccatgatt tgattgaata agtaagagat 150961 ctcaaacacg gtaaattaac tgatgacatg catcaggtca tctttctttg attcaccagt 151021 aatacaacag tgagatccta cagaaaaatg caaaggcaga agctcagggc agctgagcaa 151081 ctgttaagga ttcttttgaa atgtagcccc acgtagtaaa ctttcgtgca ataatttggg 151141 cagtcaaaaa aaaaagtcag ttatttcacc tcatttattt ctacgaggag ataaaggctt 151201 ccaacatctg attttcagtg ccagtgtttt ctgtttctct tttcacctat ttcgtcctag 151261 tcttgtggaa atgttacaat aactgattag gattttctct gtattaacac ttcatttatt 151321 aacatatgag taagttgcta ttaataaatt attctgctta atcagtttct ttttattctt 151381 tagccacatg ctaaacttct tcggatgtct tcacttaata tccttacatc atatttaatc 151441 agagttagaa aggcgttcca cttgggggtc atacacattt ttttgaaact tttataaacg 151501 ccctttatac tattcttgta tcctacgtaa tcagagtcag gtgttccatg gggatcatac 151561 acatctttat acaacttgtg taacacatgc tttaaatcct gaggtaatca gagaattcct 151621 gccattgatg gattaatctc tttacatatc tttccctttc actctcagca aaactactta 151681 tggcggtcct gttagacttt gaaagccttt caggtttatg gaggagtctt cctttctatt 151741 gtcgtagatc atggaatcat agtgttcgct tgtctcacgt gtttggaatc tgatttccta 151801 cataatgctg tgctctcact acgagtccaa ggcttcctgt ggctacaaat cccgggttcc 151861 tatcactttt cccaatgatg agcttctcct tgttggccag aatcagctct gtggtaataa 151921 ttcccttctg ctttctccct ctcgcttttt tttttccttc tgggtacatt ataatgtcag 151981 catagcagct tttaattcta ttactatggt acttctcgaa tagacctgtg gtagttgccc 152041 cagaaaggtt gaaaccccct gccacatcta ggtctttctc ctgtattggt tttgtgatct 152101 gatcaaggac atctattcct ctagtcctgc taggtggtat acgataggct tttactatta 152161 gatattcaaa tgctgtcaac catgcctgcc ttgacaggta catgatttcc acaaggattt 152221 cctaataggc ctttttaatt ttgttctaga catgcccttc ttttgccatc caaatttgct 152281 tattttttac aaactatgat gcatcagcat gtaaatgttt aataccataa atatttctga 152341 tcctcttccc aatttcactg agttactggt tatctcttgc cattattctc agcatattgc 152401 agtggatctc agaatatttt cccctgacca ccagcatcag tattacctgg gaatttgaaa 152461 tgcacattct caggccccac caaggcctca tgattcagaa actctggagc tgcattagaa 152521 actgccacag gcacctggtg atctgtgttg gtttttgttt gtttttaagg agacagtgtc 152581 tttgtctccc tggctggaat gcagtgatgc agtcatagct cacagcagcc tcgaactcct 152641 gggctcaagc aatcctcccg ccccagcctt ctgagtagct ggaactacag gtgcacacca 152701 ccatgcctgg ctggtttttg tttgttttta aggaggcagt gtctttgtct ccctagctgg 152761 aatgcagtga tgcagtcata gctcacagca gcctcgaact cctgggctca agcaatcctc 152821 ccgccccagc cttctgagta gctggaacta caggtgcaca ccaccatgcc tggctttttt 152881 tttttttttt ttttaatttt tttggagaca gggtctcacc gtgtggccca ggctggtctc 152941 aaactcctgg cctctaggtt ttttcccacc tcagccttcc acagtgctgg gattacaggc 153001 atgagccacc acatttagca ggtaggtgat ctgtgtttca ataagccttc caggtgattc 153061 tgatgctcac tgaaatttga gaaccactga aatatttggt tttctacaaa acataagcag 153121 ttagtaacca tagggtacgc aactaaatag ttccagcagg catccctgac cacagggcat 153181 tgcagattta cttaagtcag tttcatcttc cacctgtgtg agttccagca ggcatccctg 153241 accacagggc attgcaggtt tacttaagtc agtttcatct tccacctgtg tgagttccag 153301 caggcatccc tgaccacagg gcattgcagg tttacttaag tcagtttcat cttccacctg 153361 tgtgagttcc agcaggcatc cctgaccaca gggcattgca ggtttactta agtcagcttc 153421 atcttccacc tgtgtgtcca gtttaagctc tcagttgcag tttttaaacc aatttccacc 153481 agcatatact gagtatccgc taccgtgcaa tgcgttgcgg ctcttgtatg tgcactatga 153541 gaagtaattc ccataatacg gcaaactcag ttttgatatc caccacccca ctttttataa 153601 tggactacat ttttcagagc agttttaggt ttacaaagaa ttcaagcaga aactacaaaa 153661 agttccctct acctctcact tccccactcc ccaacagttt cccctggatt aacattttgc 153721 cttagtatgg tacgtttgaa ccaatatcga tacactatta ttaaccagta ttcctgtttt 153781 aacatgagag gcgacttttt gtgtagtata gtctcttggt tttgacaaat gcttaatgct 153841 gtgtatctat tattacttat gacagtatcg tgcagagtag tttctaaaaa tcctctctac 153901 tccacctatt catcccttcc ccactccccc tgaaccactg gcaaccgctg atctttttat 153961 tgtctccgta attttccctt ttccagaatg tcctgtggtt agactcatgg catccccttt 154021 tagcaactgc agaaactaag aactgtcccc aaggataaaa atttttgcag acccactgag 154081 ctgggattta tacttagacc tgctgaattg aagttttggg tgtgtttctt ctcaaattgc 154141 caaaattcca tatggacgac ttttcttttc cttccctgaa atgtgtttaa ttgacttttt 154201 ctgttatttg tgtttgcctt cacagtgaag acaaagtagt agaagtagca gaggaggaag 154261 aagtggctga ggtggaagaa gaagaagccg atgatgacga ggacgatgag gatggtgatg 154321 aggtagagga agaggctgag gaaccctacg aagaagccac agagagaacc accagcattg 154381 ccaccaccac caccaccacc acagagtctg tggaagaggt ggttcgaggt aatccaccat 154441 ttgcttggat tccccccacc cccaaggaaa agaaagcgta ataccagagt tggaaatatc 154501 caccctagca ccactgcctt ccccaatcaa aaacatgttt ttttttccaa aaggcttctt 154561 atgcttgtga aatttttttg gtttaacaag caaacaattt caaataatgt gaaatcttta 154621 ttatacagtt tgttttgtac cttgtatatg ctgcttggca aatcccaagt taattctaca 154681 agactccgcc caaccaggta gtcatctaat ttgacaaaca ctgagttagg tgactccttg 154741 gtattccttt ggcagtgagt gtttgttata taatattgtc tgttaccacc tagacgatga 154801 gctccttgag ggtaaggact tgattttatt cagtattgca cagagtgcct ggcttgtagt 154861 ttcgataaat cttagttgag tgagtagttg tgattagatt tcattaaaga aggagatttg 154921 ggccaggcac agtgcctcac acctgtaatc ccagcacttt gggaggccga ggcaggcgga 154981 tcacctgagg ccaggagttc aagaccagtc tggccaacat ggtgaaaacc catctctact 155041 aaaaatacaa aaatgagcca ggcgtggtgg cacgcacctg taatcccagc tatttgggag 155101 gctgagacag gagaattgct tgaacccagg aggcggaggt tgcagtgagc tgagatcaca 155161 ccattgcact ccagcctggg tgacagagca agattgtctc aaaaaaaaaa aaaaaaaaaa 155221 agatttggtg agtatcgcat tattttatgc tatgaatttt aacagaggtg atttatgttg 155281 agtttacatt gttttattgg gaagtacggc agtgctttct tcaagaagta aatatagtat 155341 taatgtctta gactttctct attgaattaa tgtttgtgtt gcaaatattg tgaactaaaa 155401 gaactaaagg tcatacttgt ctccatgaaa taaaaacaaa cattcagttg atattaaatg 155461 accaaatgtt acagaaacga gtaaactgtt ggattgagtt ccttaaagag tatgcatttg 155521 ttccttaaag agttctgcat ttcatatctg ccatctgctt ggcagatagt aaaagatgct 155581 tctagagcta gattgaatga atatgtactt gactgaattt taatgtactg caaatactta 155641 taaaggtcaa ccacaaatgt tacagcattt acaaaaagtg gctttctctc actttttgct 155701 aaagtttaat gatttcagat ttattgttct attagagctt aacatgggct attttgtctc 155761 cctcaactat attcatagtt gtcactatag tattttcttc cctttgttgc attttttacc 155821 ttattaaggt gtaatttaca tataagtttc atttgctttt aagtatataa ttcagcggct 155881 ttaagaaatt tgccaagctg tgcaactatc accataatct aatcttagaa cattttcgtc 155941 actccagtaa gatcacacat actcactccc agttaagccc tctactcacc tctagcccca 156001 agcaacaact aatctacttt ctgtctctat agatttgccc ttttttggac tgttattata 156061 agtgggatca tataatacat ggtcttttgt gtccgacttc tttcacttaa cacagtattt 156121 ttgaggttct acatttcttt aacttgagct tttaaaattg agattattct gaattaaaca 156181 ctcagcgtta gcttaaaatt tattgaaagt tttacatgtt ctttgaaaaa aaagaacaga 156241 gtgcaagatg gattaagttt atcattccct acttaatgac agtttcttac tcactttagc 156301 acataagtaa atgtgctagt aaatgcgcat tttgtttaaa ccaccatttt tcaaatactg 156361 agacatgctt tctcagggtg tttttcatac aactgttagg accaactatg gcagatctgg 156421 tcaaatgaac ttgttagcct gatgcctgta gcagggaagg acagcttttc agggcttctc 156481 ttctcgaaac ctaaatgaag tgtgaataca aattacagca gctagcagcc tgcagtggga 156541 ctgaaagtcc ttggtgaatc tccatcaagc tgggctgttt aatagaacca gtttagtgtc 156601 ccattttact tgaaacatta gattttgagc acctgcttgg tacaagccat tttgctaggc 156661 catgtggaat ggatgatgag cacacacaga ccagctaacc ctagtggatg ttaaaatctc 156721 ttaggaaata gaagtcaaag aagtacaaag ccatctgtgt tgtaagaaat gtgaacacag 156781 aacctgcctc cgagagtaag agggaacttc tggattgatc agagaaggtt ccatggagag 156841 catgacattt gaactagagc caaaagtgtg ggtgggtggt gtgtctacca gtggattttg 156901 ctgtagagtc atcaggacta gcaaaacaaa tgtggtgcca tggaaaggca tatttaggag 156961 acaggattgg tccaatttaa gctagagcag atgtctttta ggttgggacc aaaatataga 157021 gaccttcaat gccaggctgg caaagatggg cattaactgc aggtgattaa ttaacaaata 157081 ttaattcaga agctactgtg cgctgcagat ttttccagga gacaataaag agattttgtt 157141 ttgatgctag tactgttgag gtaacaggtg aacaaggagt gacatcagtt atgcagtaca 157201 atgagaggat gccacataaa cttaggccca tattctagaa caagacaatt tgcttgagat 157261 aaattggtct cattatcgtc tgcattagtc cagttctttt tgcttgagtt ttgcaggtag 157321 tattatttga tattaagacg gaacacagag atgttgtatt gttttatttc ctgtagaaag 157381 acgaaggcag ttaaaagaat ttcataaaca aaacactcca ggctaaaggc aaaaaagaaa 157441 tgaaccttaa aaaataacaa gttaataatt ttgttggcaa gaagagcaac aaaagcaacc 157501 ttctgaaagc aagatacctg gtcattgaga gatagcaact gttaatgccc cagaaataga 157561 ggaatcttta atgcaggtcc attcattcca cctcaaactc ctgaacaaaa gctctaattg 157621 aggcaagtct tcctgggaaa gggggcctaa cagaaacatg ttatggtcag ttgtgcccat 157681 gcgagataat ttttttgtgc ttgtagctaa ttccagtaaa gaagtagatg ttttagaggg 157741 taacgttaaa ggggtcagct gcaaagagtc cttttcctcc tcctcactgt tacaaaaggt 157801 gacttgtggt tcttccctaa ttgagagcaa tggcaacgcc ataggtgatt actcattttg 157861 taaaactcta attacaagta ggttgcttct gtgttcaaca taaaaattag aagtgacttg 157921 caaagatgag atactgcagg gaggcttaca atcttgtttt cagtgttagt gtttcagtac 157981 ctggaacaaa atgtgattta cagccatcct ttctaagaaa attgtcatca ttaccaacat 158041 attgctactg ataccagtcc ttaatgtgct tcataaaaaa ctcaataatt tcataacccg 158101 ttaaaaccgg aaagaaatgg agggaagaaa aataagacac ttttagcgta gtcataatct 158161 tctctgccat attgtgttag aatcgatgat acccaaatga gaagatgtct gttctctgtc 158221 actgattcca gtagaataaa attattgttt aaaagtaagc acattcgctg cactccagcc 158281 tgggcaacag agtgggaccc tgtctcaaaa aaataaaaat aaaagtgagc acacgattgc 158341 aagatctatc tggttttggg ggcttattgt catctttggt tgttaaaaaa atacttggat 158401 attattggtt attggtctag tggaataagc cattaccgat ttgagtacaa cagtggtgta 158461 tttagatact tgctcctaat tttctgttct tcaagaaggg attgtacttc agaaaccaga 158521 gatatcaggg tagttttgat tgagccataa attctaaatt ctgattgaaa tgggtattgt 158581 aataaatggc ataatctttt atttaatcta gccaattgtg ccctgtccag aggaactgga 158641 catttcttcc ttatgttcca tttattcagc aactattcgt tgtacccctg ctgatggcta 158701 ggctctttgc tactgtggaa ggcatcgttg gggcagagca gacagcattt tacctcctgg 158761 gagactggca gatttttgaa gcctttgttg cagatataga ttttcacggt caccactctt 158821 gccctttaat gtaatttaaa tttttttttt tttttttttg acacgggatt ttgctatgtt 158881 tcctaggctg gattcaaact cctgagttca aacctcagcc tcccaagtag ccgggaaaac 158941 aggcctcttg tgctttaaag cagtgattct ctgtctttaa accttgtaac tatgctgtgt 159001 ggtggaaatt ttacttcccc gtttaaagcc aagatctcta aggccaagaa tttttttttt 159061 tttttttttt gagatacagt ctcgctctgt cacccaggct ggagtgtagt ggtgcgatct 159121 tggctcactg caagctcggc ctcccgggtt cacaccattc tcccgcttca gcctcccgag 159181 tagctgggac tacaggtgcc tgccacgatg cccggctaat ttttttgtat ttttagtaga 159241 gtcagggttt caccatgtta gccaggatgg tctcgatttc ctgacctcat gatctgccca 159301 ccttggcggc accccccgcc ccccgccccc cgccccccac aaagtgctgg gaatacaggc 159361 caccatgcgt ggccgaaaaa aattttttta attggcctgg gctgggcgca atggctcatg 159421 cctataatcc cagcaccttg ggaggccgag gcaggcggat aacgaggtca ggagattaag 159481 gccatcctgg ctaacacggt gaaaccccat ctgtactaaa gatacaaaaa ttagccaggt 159541 gtggtggcat gcgcctgtaa tcctagctac acaggaggct gaggcaggag aatcgcttga 159601 acctaggagg cggaggttgc agtgagctga gattgtgcca ctgcactcca gcctgggcaa 159661 cagagcgaaa cttcgtctca aaaaaagaat aaaataaatt gtcctgggtc acataactgg 159721 ttaagtgatg aaatcagaat ttggacctag ataggccagg atatttctgc tacactcatg 159781 tcaaaattac actttttatt ttctttataa atacacacaa attcatttca acctgtgttc 159841 ccttgtaaat gcatatacct ttgatatagg catatattca tctattattt tacactgtgc 159901 atttctgaat ttttagaatt tttacttgaa gtgatcaggc tggcgctgtg gctcacgcct 159961 gtaatcccag cactttggga ggctgaggtg agcagatcac ctgaggtcag gagttagaga 160021 ccagcctggc caacatggtg aaaccccgtc ttcactaaaa atacaaaaaa attagctagg 160081 catggtggcg cgtgcccata atcccagcta ctcaggtggc caaggcaggg gaattgcttg 160141 gacctgggat gcggaggttg cagtgagctg agattgcgcc actgcactcc agcctggcga 160201 caagagcgag actccgtctc aaaaaaaaaa aatagaaaaa tgatcggggg tccacaaata 160261 agagtaatga tcaagggtca gcaaattggg gccaaaaata gcctgttttc ctgttttcta 160321 cacaaaagtt ttattaggca cagcccatcc catttgttta aaaattgtct atggatgttt 160381 tcatgttttg gaagtagagt tgaggatcgc aaaagagatt gttttgcctg caaaactgaa 160441 aacatttact atctggcctt ttcagtaaac catttgctaa cccctgattc tcactataag 160501 acggtgactg tgaagttctt tctatttcac atactgccat atcaagttga tttcttttga 160561 ggtagggttt gttttttttt ggttagagat agactagtat agtgtttgag agaattttta 160621 tgaccttggg ccagttaagt aacctgtttt ttatgtgtgt agtgtagatg gtaatgttat 160681 cttgcatgtt tcttgtaaga atgactttag gtaacttatg ttaaacgctt accatcatgc 160741 ccagtaagtg ttcaatttac aaactcaagc tgtccagatt caaatctttg ccttacaagc 160801 tactaactgt gtaaatttgt ataaatcaaa taaccttgaa tcctcattgc ttctttttgt 160861 aaaataggga ttaataagac ctatccaata tttaactatt ttgtgaggat taaattaatt 160921 aatttatagg aaacactttg cataattcct ggtatacagt aactgctcaa ttaaccttag 160981 ttgctctctt agttatggtt atatcatcat tccgatcctc cattgcattt gaaacattag 161041 caaactaatg tttactacta atgaggaatg ttaaaggtaa accaataagt tgacttttaa 161101 ttataaaagg cataaatgta acctatccat atctattggt ataataaagc tgatatttag 161161 agagaacatt atgttatgat taaaagaaac atttcttgct taaatttttt ttgtgtgaat 161221 atacactgta tattgaaata tctgcactta aattttaaat ccagctgaaa attggataag 161281 atagtattag atgccactgc atttcaagat agagttttat ttcttgcagg tattttaagc 161341 cttgtgggag ttcttctgtt aagtgaacag atagagacac tgatacagat ggctctttca 161401 ccgtctcatg tgatgataaa agctaaaagg tttgatccga atattgctta tattgggcta 161461 agtgattgac cttggcagaa ttaaattatt tttcttcttc tggtgacaat taatctggga 161521 gaagtaccat ttgatttatt gctatacatt tcaccaatat ccacaaggca cttgtaggta 161581 ccagacatgg cattggttgc tgggggcaca gagataggaa caaagtggct tgctcaggtt 161641 gcttctagcc atatctgcat tgtccttgag agggatgagt ctctcaggcc tgcctaggaa 161701 ggaattttgg cagcatggag aagagaaagt gctcagcccg gaacaaaagt tttcacctac 161761 tatgttgcag agaagttggc gtatcagtaa ttataaagga agaataggga tttactgggt 161821 gatagagcac caaatagaat gaatgcaata catacttgaa gtaaatcaga gtagcttgta 161881 ttgatctcaa ggggagaaga aacaaagctt agcttaattc tcatgcctct gttgttactt 161941 gtttgtaatg gacatgacac ttacaaggta tcctataatt cactaacggg atttttactt 162001 atgattaata tttaactctc cacaaagctt ggggtaaact gtgaacttaa cttgctgtga 162061 acccgaacat gaactcacat acttcagttg gaggcctcag tagtagcatg ctgtacaaaa 162121 atgaaactaa agctccaggg agtatttggg aacagggcaa aagggtaatg gctcctcctt 162181 atagagcaag cctcaggcct ttacctttgt gttttgaaga tggagaggat gcacttctac 162241 tctcgcccgg gttgtagcat gcttgctaat gatgtcaaga gaactagtgt cagcactgac 162301 catagatgtg acaggtaaca gcctgttatt ttgtgcccca tgtcatggca gccattgatt 162361 ccgaggtgtt tgatacacac tttgatagag gcagagtgaa gcaggacatg gacaagcatc 162421 tttccgtgta gtttaatggt ctcaatctag aaagcccact cagataatta ggttgcttaa 162481 aatgtgatga gactagggaa tagagccata tagaagtgtg tgtgagagag aattcagtgg 162541 tttagtatat tcacaaagat acacatttga caccagcctg ggcaacatag tgagaacctg 162601 gctttaccaa ttaaaaaaaa atagaatagc agaagtaatc agtgaaaggc attttaatac 162661 aattatatct agatttatta aaaataaaat tttagtccaa tttaattaaa ggccagaaac 162721 tagattttta aagaatatta gtctttctca tattttcccc aaatcagttg ctttaaaaaa 162781 aaaaaagtgt gcagatgcct caggcacaat aactgtactc aactcaagaa tttgcaatta 162841 agaaaaaaaa aaatcaaacc tcaaagaaga aatccagcaa ctcagaggtt caggttcatt 162901 taaacatgca tacatttaca tgggcacact ttttttgttt tgttttgttt ttaagatgga 162961 gtttttgttt gtcgcccagg ctggagtgca atggcgcgat ctcggctcac tgcaacctct 163021 gcctcccggg ttcaagtgag tctcctctct cagcctcctg agtagctggg attacaggcg 163081 cacacaaccg cgcctggcta actgttgtat ttttagtaga gacaggtttt caccatgttg 163141 gccaggctgg tcttgaactc ctgacctcag gtgatccacc tgccttggcc tcctaaagtg 163201 ctagcattac aggtgtgagc cactgcacca gctgggtgca cctttttgtt tgtttgtttt 163261 gttttgtttt gttttgtttt gttttgtttt gttgacatgg agtctcagtc acccaggctg 163321 ggattcagtg gcacgatctt ggctcactgc aacctccgtc tcccaggttc aagcaattct 163381 caattctcct gcctcagctt cccaagtagc tgggactaca ggtgtgtgca accacacctg 163441 gctaattttt gtatttttgg tagagatggg tttcaccacg ttggccagac tggtctcgaa 163501 cttctgacct caggtgattc acccaccttg gcctcccaaa gtgctgggtt actggtttga 163561 gccaccacac ccaaccgggc acacgttttt agccaagcca taggtctgat taagtacagt 163621 cagattcctt gagcaggtta caaagctgtg gatttgaatg gagatcttta gttcccttgc 163681 agaacaatga gtttccaaaa gcagagtggc ttacagccta accagctgct tctctcagcc 163741 tttaaccatt ttgccctcct gaaaattcat ctttggtttc caaccacctc ttccaatatg 163801 cctgactttt tgtgattctg ttagtagaag agtaaaaagt caaggggagg caagaatgaa 163861 tgaatctcat ataaaaatga tgctcaagag aatcccttaa ttttcaagac agcagcccaa 163921 attgtaaaca ttgagacttt tacagtgtta aaaaaaaaaa agtgtttaca ggccaggcat 163981 ggtatctcac acctgttatc ccagcacttt gggaagctga ggcaggatga tcgcctgacc 164041 ccaggagttt gcaagcagcc tgggccccat agggagacca aaaaaaaaaa ttgttgtttt 164101 ggctgtagtg tcacaggctt atagtcccag ctactcagga ggctgaggtg ggagggtcgc 164161 tggagcccag gtgaggtcaa gcctgcagtg agccgtgatc acaccaccac actccagcct 164221 tggtgacagt gagaccctgt ccaaaaaaaa aaaagattaa ggcagaggaa atgttaaagg 164281 ttacgctata tttagttgat aataatggta agactgttca ggtgttttaa ctatgagagt 164341 agggttgcct ctttcttaaa tttatatatg ttcttattgg gtataatttc tagacagaat 164401 atgcaagtat cacagaatat ggatggcaat tttatgttgc ttgcacgtga ctttaggttc 164461 ttatgtttca tgataggagc aaagaaaagt cttgcttgcc aattcttatg attgcatctg 164521 ttctggaaat ttactagcat ttgttaccca ggatttctga agtgctggtg actgctgccc 164581 agtagtcttg gtgggcaaag ttactaaaat gcatcacgaa tccttctggt gtttgcttgg 164641 gctatggcag ggtttctgga ccttagtact attgatgttt tgggccagat aattgttgcg 164701 gatgattgtc ctgtgcattg taggatgttt agctgcatca ctagcctcta ctcgatgcca 164761 gcagcacccc caagtgatga caatcaaaaa tgtcccctgg gaggagtagg ggatagactc 164821 tctctggttg caaaacactg ggctatggtg ttgagatctg cagtgatgag cagctttaga 164881 tatgtgaaca gaagcccagg cgtggtggct cacacttgta accccagcac tttgggaggc 164941 tgaggtgggc ggattacttg agatcaggag tttgaggcca acctggccaa catggtgaaa 165001 ccccgtctct actaaaaata taaaaattag ccgggcatga tgatgggtgc ctataatccc 165061 agctacttgg gaggctgagg caaagaaatc acttgaacct gggaggttga ggttgcagtg 165121 agctgagatc gtgccattgc actccagtct cggcagacag actgagatgc cgtctcaaaa 165181 ataaataaat aaaaataaaa aagtgaacag agaaataccc gtgcaatact gtgtctcatc 165241 ccccaaagac ggagtcttgc tgtcgcccag gctggagtgc agtgacacga tctcagttca 165301 ctgcaatctc tgcctccatg gttcaagcga ttcacctgcc tctgtctccg agtacctggg 165361 attacaggtg tgcctcacca agaccagcta agttttgtgt gttttttttg ttttgttgtt 165421 ttttgctttt tttttttttt ttttttttga gacgaagtct cactctgtca cccaggctgg 165481 aaggcagtgg cacagtcttg gctcactgca acctctgctg cccaggttca agcgattctc 165541 ttgcctcagc ctcccaagta gctgggatta caggcacctg ccactgcgcc cggctaatat 165601 attttttttt tttgtagttt tagtagagac agggtttcac catgttagcc aggctgatct 165661 tgaactgctg acctcaagtg atccatccat ctcggcctcc caaagtgctg ggtttacagg 165721 catgagccac caagcccggc tggtctcatt ttaatgtaag ctctaacttc tgtatcactg 165781 ctttgcacat tgtagataac cagtgaacat tttttattgt ataagaaaaa gtaaaataaa 165841 caaaaatatg ccaaagcaaa tatgataaat acacatttag taaggtaaaa gcaagtattt 165901 gataaaatgt gtaatgtagt ctagtagctt tatttagaaa cagaaaacaa cataaaatgt 165961 tgacttgaaa cattgcgtat gcatatttgc catgtagaca gtgagatatg gaggtaaact 166021 gctttcctag ttttgttttg tttttttttt tttttgttca aaaggaattc actctcccct 166081 tgtccactgt tataaaatag caaatggtga ttagcatctg tgctgtaacc agagacactg 166141 ctgttcccag ctggaatctg taggttgttt gcctgctaga gattgtgaca agaagccagt 166201 taacatggag aggtgtacat gtcagtcttc cccagaaaag gacatgaaac agtaggacca 166261 tccttccaac ccatgaccta catctagagg cattcttttt ttcttttttt ttttttaaac 166321 cggaacagaa atatcatcta attctttctt tacagagaag tggtataagg aagtacattc 166381 tggccaggca cgatggctca tgctgtaatc ccagcacttt gggaggccga ggtgggtgca 166441 tcacctgagg tcaggagttt gagaccagcc tggccaacat ggtgaaaccc ctcgctacta 166501 aaaatacaaa aattagccgg gcgtggtggc acacacctgt ggtcccagct actcgggagg 166561 ctgaagcagg agaatcgctt gaacccggga gacggaggtt gcagtaagcc gagttcactc 166621 cattgtactc tagcctgggt gacagagcga gattcgtctc aaaaaaaaaa aaagaaaaga 166681 aaagaaaaag gaagtatatt ctaagccctt gaattgatct tggctttctt cattattttt 166741 acaagtaaaa ataaaaactt tgatatttct gatgttctgg gatattatag gaatgatacc 166801 acttgaattc agctttctaa gaagtctggt atttcaatag tcttaatatg agtaagaaaa 166861 ttatgatctt ggtactggat ctaaggcaga ggtatcgaag ctgttttaat gttgcttgag 166921 ataggacaga acttcagagt aaaataaatg tgtatagaca atctgttcaa tctgttcaaa 166981 ttaagttgtt gagctgtata tttgaatctt aagactacta attttgttgt atctgtcttt 167041 tcccaaatat atgtgtactt cttttaataa gccaaaacct atgcaacttt tactaattct 167101 tccctaacta cttcttttgg aaaatgatta aatcttatgt ctcacgtttt gacatatcta 167161 ttgcatttaa aagataaggt tgatcatgct gcttctcaaa ctgagatatc cattcgagtt 167221 atccaaggcc cttggaaaac aatcgttgtt gagtctgtat ggggggcctg agtcccaggt 167281 ggtgttgact ctgctggtcc atgatcctat tttgagtagt agactctagt agcgtctcct 167341 gccttccata tggtgcagat ttgcagcatt gagacaggaa gtaagaaact gaaaaggaaa 167401 accattctta cctttggtgt tattgttctt ggtatattac agataccaac aagtatgagg 167461 caatcctgag acatctacct actgagcttt tctttcctta aattcacacc tcaagaccaa 167521 ggaaaccgaa aaactcttac agtctcccta catgtgttca cttgtgcccc tatttcctgt 167581 cattaacatg gtagtgtagc caactcctgc tagcctgtaa aagctaatta ttacagtata 167641 gaggaatttt gtaagccata gccattagta gcttgaaata ccaatggtgg gagtatttat 167701 accatggcag ttggcataca ctgtaactca agagtgttct atttgtttgt ttttctagac 167761 agtgccagtt tcccagcata ctcctgtctc tcattcagta agacaaagca cttcccctta 167821 ttttccaata caggacatga tatcccgagt tttgagcttc ttgttaatag cctgttgcac 167881 gctaaccacg cagctcagtg aagagcaagt ttattcccat aacaaacagg aaacaaaggt 167941 tgaggaaaaa ccacagtggt acccttttgt gagtgttccc taacaccgtc acgggagtcc 168001 tgattagtct aaatacttta caagtaaatc tccttcaggt ggcattttat aaacaatctt 168061 tcatcatgga gagatgagaa ttgggtcagg tataattaga atgacagatg tattggctag 168121 cctgagagaa ttaacgcaat tcagtaaaaa ggctagccaa ctgcggatat cttctagtta 168181 cttccgtgac aaattagcac catcctcttg tactgaagca gtgctttcat tagcatgcac 168241 agctgtttgc atcatctttg tttgtatttc aatccatatt tggagcttaa gtgagatttg 168301 tcataagact tcagttcaag cttgcttgcc taattttaca tgcttctatt tagtaaaata 168361 tgattactag gtcataaata tttgataatt tattgcctta ttagagtaat tttctaaaca 168421 tgttttttga cgactgcttg ccactacgta caagcacttg aaaaatgttc ttttctatta 168481 aaagtacatg gttccagaat tgggaaaatc acgttcttaa aaaataacca gttttctatt 168541 gtgttggctt ttaaaatttt ttcatcagta gaatcaaatg caaatattgt ctgcttattt 168601 ttcaacagaa tacaagtttg attttaataa cttaggctca tgaaatgtca ccacgtagta 168661 atacttagaa ttacagcaat aacttaggct aaaaaaataa agttgttatg aactacattg 168721 agctcagaca gatatatgta ttatgtaaac atttatataa aatgtatata aattgtatat 168781 tgtattatat aattatatat tgtaatatat agtatataat atataattta taaatataat 168841 acataaatta tatttataaa ttataaagtt tttatataat atattataat ctaatatatt 168901 ataattatat ttataaatta taacgtttat ttgaaatatt gaagttttgt atacttagtg 168961 gattttaata tttatgtcgt tttcatttta ttccactgtt acattaagga agtaacttat 169021 tacaaaaata aattatattt gtcttacatt tgcccttttt actctactta aaattggaga 169081 aaaagagttc aagataggga atttctgtca gaaaggagaa atagttgtat tatgtgaagt 169141 ttaagtagat attcttgtag agtatggaga aaataatgct ataaaatgtt gccttttaaa 169201 aaaaagttca gaaaatatag ttcctagctc tctctaaatg actagtttga ttgtgactgc 169261 tttcccaaaa tttgtaaatt taaaatttat acatcacatg gaatatagtt ttatttcaca 169321 tacattccag taggctgaga atacttgaaa ttcaatgatt aaaaatgaaa taataataac 169381 cagctttgct ttattctaaa accagcttta ttctttattt accactttat ttacagcgaa 169441 ttatttgcta atctgttaac tttgcttcat aagaaggcat tataattatc gtacatttga 169501 aagtggagat tttctcccca ttctggttgc cagtttcaca actgtgttca ttctaaattt 169561 attaaaatat gtaaaatcag ctttctaacc ttggttgtcc acaaagaatt tcactccaat 169621 ttgaaatatc tattcagaac taattttttg tggttgagca ttattttttg aacaatgaat 169681 tgcataatca ttttgtataa tccagggagc tctggcttag attctgtatt attttgtgtt 169741 atactgattt ttagaaaatg gaatgtagcg ttgacacgag cacaaatctc agcaagggcc 169801 taccttacaa agataaaatg cgtggagttt tattagaagc acatgatttt gctttgtgtt 169861 tcgtatacac cagcatttca gtgtagtaaa tggcattttg tcttgaagag tgagcagagt 169921 tgttacaaaa gaaggcataa tgcttccttg ccagcagaat tcagataagt gaatatcaga 169981 aaagctattt gtgatcttga aagtgtaaga aattcaacat caataaagga ttgtggttcc 170041 ttagatagcc atgtaagtct ttgcctatgt ttctaacatt tctctttagg aaatattctt 170101 gcttttattt ttctgaacaa attttctcca ttattatctc tgatatttac tttgaacaga 170161 aaggtggatt ggaaatcaaa aaataagttt gatttcaaaa atgatttcaa aatgttagtt 170221 ttctcagtat tttacatttt gagaagagaa caaagaaggc agatgaagag ttttttgttt 170281 tttcttaaat ttgttttctt ttcaacttta tttttaattg acataataat tacatatagt 170341 atacattttg aagatacggt tttaattttc acaggaatct tatttgggtt tagttatata 170401 aggtctaatt ttgcatcctc tacatttagt ggctgattta taaatttcca ctgagtgata 170461 atagtaatta atgtgaaaca ggttttttgc tgatgttttt agaagtttca gtagattagg 170521 aaggaagaaa ttatgctatt gagaaaaata gaaaatgtaa acataagcat ttgaactctc 170581 tcttgagata tttttatctt ccatgaaact agatgcggct tattgaagat gtgtttcagg 170641 tctcagtgct ctaaaactct tctacttggt gagcatcttg attggttaga tctcttacta 170701 atatcccatt gctagagagg gatgtaaagg aaattttctc aaactgagat ttccaaatct 170761 tccaatctct gagtcaagta aagagacaag gttagaactc ttaggcaaac tgaaaaatca 170821 atattcctat attatcatta tttttttctt tttttgacac caagtctcac tctgtcgccc 170881 aggctggagt gcagtggcac tattttggct cactgcaatg tacgcttccc aggttcaagt 170941 gattccagtg cttcggcctc gcgagtaggt gagattatat gtgcacgcca ccatgcccag 171001 ctaatttttg tatttttagt agagacaggg tttcgccatg ttgactacgt tggtcttgaa 171061 ctcctgacct caggtgatct gcccacctcg gcctcccaaa gtgctgggat tacaggcttg 171121 agccaccatg cctggcttcc tatattcatt tttttatgtg ggctttttaa aatcttttca 171181 atagcaaaga cttgacacca acccaaatgt ccatcaatga tagactggat taagaaaatg 171241 tggcacatat acaccatgaa atactatgca gccataaaaa aggatgagtt cacgtccttt 171301 gtagggacat ggatgaagct ggaaaccatc attctcagca gactatcaca aggacagaaa 171361 accaaactct gcatgttctc actcataggt ggaaattgaa caatgagaac acttggacac 171421 agggcgggga acatcacaca ccagggcctg ttgggggggt ggggggttgg gggagggaaa 171481 gcatcaggag aaatacctaa tgtaaatgac gagtttatgg gtgcagcaaa caacatggca 171541 catgtatacc tatgtaacaa acctgcacgt tgtgcacatg taccctagaa cttaaagtat 171601 aataaaaaag aaaaaaaatt aaaaatcttt tctttgagac aattgtagat ttacatgcag 171661 ttgaaagatt taatagaggc cgggcgcggt gcctcacgcc tgtaatccca gcactttggg 171721 aggccaaggc aggcggatca tgaggtcagg agatcgagac catcctggct aacacagaga 171781 aaccccgtgt ctactaaaaa atcgaaaaaa attagccggt gtgtggcggg tgcttgtaat 171841 cccagctact caggaggctg aggcaggaga atggcgtgaa cctgggaggc ggagcttggc 171901 gaagcctgca gtgagtggag atagtgccac tgcattccag cctgggcaac agagcgagac 171961 tctgtctcaa aaaaaaaaaa aaaaaaaaaa aaagaattaa tacaaagaga ccctgcatag 172021 tcttcactca gtacagtatc acaactatta aattgacatt aatcccttaa cactgcttta 172081 acccaagtct atgctgctgc ttctccgtgt tcacttctga gaggcactgg ccatttcttg 172141 tgtgcttatt catactttgt gtaaaagcat tagcatcata catcgctggt gctgacccta 172201 ccctcagtcc cagcttctcc cagagtaggc taagattctc ttttctacta catggaaaat 172261 tgttctggaa taataaaggg ttagcaaaaa agtcagagaa aacaaacagt tgcattccct 172321 ttccctgtag tagagtagga atataattat gagtgataga ggatacgttc cattcctttc 172381 ccagtcctct ccgaagtcag cttcactgct gtaggaagtt gctgttctca ttatttccct 172441 gagggaaagt gttgggaagt gagagttgag gtgatgggga atcagggttg ctgacaattg 172501 ggggaattat aacgagaagc tctttttcac caggagaggg gttgcatgtt aagagaaggc 172561 ctatctgctc atcctccatg tcagcagatg agaaccagtt ggacaaggta gctgtctctt 172621 agagattgag aagtaacaaa agcacactgt cagtcctgtt cttccagcct cacgtacatc 172681 ttctccatac attatgtatt tgacattaaa ttaacataca ctgttacacc cattcctgac 172741 cgcattatct caggtagcct tttgtatttg agtattgtgg gatctgttgc tattgcccat 172801 gatgcccact tggactccac caaggcagtg aaactcttgg gggccgttca aagtcaagag 172861 tgccaactgc ccctaacagg attgggagaa ccaaccaaca ctccccgaat agggactaac 172921 tcactctttt cttggccttt atgaaaggaa ggagttaagg ctcttacagc tggagactcc 172981 agcaagagtt gtaggctcaa cgtgtagctt atactattta cttaattaac atgtaaatgg 173041 attttgagtt ctgaatgctt gttctagttg attagggctc atttagcctg gcttctacga 173101 attagctata acaaatggtt tcatagctgt tatagaatga ggcagaagaa aataaaatga 173161 atgtagcatt atgtttaatt agtggtatag gaaaatgtta atataatgcc atttagattc 173221 tggactgggg agattgtgcc tttatcataa gtaagaaatt gttgctaggt ttcacataag 173281 ggcacatagt tatcatgaat catctccttc gatcatcatt cttttatccc attacttgac 173341 ttgcacccat ccaaaattct accactttta tttaatatat agactcataa agacaaggtt 173401 tgtgcatctg cagagctgtg atgttcactg cgaggcccgt ggcacacatc tataagaaga 173461 cacatatttg gtactctaag atgatacttt tttttaatgc caaaatgtga gggtcagcat 173521 aaattgccaa ggcatggtaa attttgttgc caatgctccc gtgactatgg ggtgaaggga 173581 ttctgtgcct cccttggtag caatgaccac ggtgcttgca atcccagtgc caattcccgg 173641 cacagtgcag gctctggagg cactgctgcc ttgcagtctt attggaatta acactgcctg 173701 agtaagagat taataatggg tattgttgtg tcttagaaac gttaaagagc atgtgacatg 173761 aatatgtaca cattggtgaa ggagatacag agaaggaaag caaagtgaag ctgtagatgt 173821 ttgttacaac tgaaggtaag ctctgggagc agagaccact gctggctata gttgatggga 173881 tccacaatgt caatacaaat ggggactaac tattgtttta cttccctata cacgttgctc 173941 tttcagtaga tgcaataaag tactggtaaa accagaggtg gctaccatca cgatgatgtc 174001 aacaggaggg acagtcagca ctaagcccag aaggtgtcaa acactccgca ggagaaatgc 174061 gccatgcaac ggacatgaag atgatctgac actcttcacg tggttttcag atggaaacgt 174121 ggctacgaaa gcatcaacct cattatcatc catcattaag gccatctcac tcagtactgc 174181 tgctttcaaa gtccacgctc ccaaagcaaa ttggatttct gtacacaata ctcttacagg 174241 atgaaaccca accaacttgt tgacgtaagt atcccatcta cttcgcaatt ttatttatct 174301 tccaaaatta aaggactggc accctgattt attaaaagtg aattgggtct agggaccata 174361 tcccctctga gttactgaca gagcagcttc tggcctgtga agctcaaagc catgcctaga 174421 tgtgatggat ccttgtcttt gattggctaa ttagaatcaa tctgttgaag gaccagtggc 174481 agtggctcac gcctgtaatc ccagcactta gagaggctga ggcaggtgta tcacttgagg 174541 ccaggagttc aagaccagcc tggcctacct ggtgaaaccc catctctact aaaaatacca 174601 aaaaattagc caggcgtgct ggtacaaggg taatcccagc tacttgggag gctgaggcgc 174661 aagaatcact tgaactcagg aggtagaggt tgcagtgagc caagattgtg ccactgggcg 174721 gcagtgcaag accctgtctc ataaaaagaa aaaaagaatc tgttgaatat tcctaacttg 174781 gaagtaatgg gtgatgcgtg gctttcatct tttagaaagg tgttttttcc tctgtcagta 174841 aaactaagaa gacatctgtg tgtgctaagg cgtcagtcta gcaagggcag ggcacaaacc 174901 cagagagaga attgggctgt tgtttcactt ccaataaaaa tgggatatac atgggaggta 174961 gagttataat tctactcagg attgacttta tacatgcaat tggatgggtc actgattgac 175021 tttgaggcaa tcagggatga gtgaaactga tcttggggag ggtggcaaga catgttgaca 175081 ggtaagatgt gttttaatgt ataacagtgc gtggtgtgta ttaatcagtt ggagttgatt 175141 gactgcgttt tattgtttgg aacattcctg cccaagggtc aggtaaatat acaaaatata 175201 caaataatac agatttttaa tacataaata tcaattaagt agcataatta aataatacat 175261 tgttatagtt gtacttaatg tgtatgtaaa tatatacaaa taatacagat ttttaatata 175321 taaatatcaa ttaaatagca gaattaataa tacgttgtta tagatgtact tcatgcatat 175381 gtaattaaat tattaatgta taatatataa atacatattt taatatacaa ataaaatggt 175441 cttaaatcag agttaattta agaccgttta tattagtaca tatcttaata tacaaatgaa 175501 tggtcttaag ttaatgtcat tgagtataag tcacatttac agatatgaca tgtgtttcca 175561 gataaagtca atggacagac agcttccaac actggaaaac attcaaaata tatgcaatta 175621 attttatatt tgagttaagg caatcattta ttaaggaaag caatcagatg tttgtgaggg 175681 cttaaacttt cgcttcataa ccaaagcagt ttccttacat gaaagttgac ttcttaaaaa 175741 gagaagttta ttttttatga tggaggattg ttgtgatgtg cagatacctt ccgtcatttc 175801 atactcatct aaaagcagcc aaatgatctt ttcacctttg gccggaatcc ttttattgct 175861 accgagtcct gcactcatta aaccagatga cagtgcatgc tctgagaggt gtaaatctct 175921 agtcctggtg gccagttaaa ttcctcagta aatgtttggt agatgctgcc taataaacca 175981 gtccaggttg ccactgggag gattaaaaga agtaaacgtg tatacatgaa cagagagaca 176041 gtgccttttc atgctaaatg tggttcccca catctcctct gattagaggt gtgctctgaa 176101 caagccgaga cggggccgtg ccgagcaatg atctcccgct ggtactttga tgtgactgaa 176161 gggaagtgtg ccccattctt ttacggcgga tgtggcggca accggaacaa ctttgacaca 176221 gaagagtact gcatggccgt gtgtggcagc gccagtaagt ggacccttct tcgagcctgg 176281 ccacctttcg tctctctcgc cactgactct gctttttgta acagattgat tttcctggtt 176341 cttgggaatg ggcctgttgc taccactaac cacatttctg tccacttctc taattgctca 176401 gagtctccgc agtatgttca atcatgagca cacctctccg tctttccctg ataaagcatg 176461 gccatggatg tgttctcttc ctagctgtag cacatatgtc ttgcaatcca gagggacttt 176521 tgagtgcttc tcttttaaac aaagctggag tggctgtttt gtcttctgca gccagccgtt 176581 tctgcaaatg catacttatg tgtttgtccg tgtacatatg tgttcatgtt tatgcatgtg 176641 cactacctgt gcagatactg tctaccactg gaagcctggc tgatgcagct tgagttgcat 176701 tgataaggtg tatggctgtg atgaatacct ttaagttggt tcataacatg tcttttattg 176761 tgctgatgag ggaatgtgat tgaattgagc tcatgcatct caaatgccgg agggtagaag 176821 tgcccgtctt gccccctccg agctcagcac acagacattt gccaggtcaa acttctgctt 176881 cttctcttga aagtagaaga aaacttttga cttaacgtga gaaactttca atgacataag 176941 gctcgaggaa cataatttta ttttttttgt ttttttaaag acagggtctc attctgtctc 177001 acaggctgga gtgcaatggc gtgatcttgg ctcactgcaa cctccgcctc ccgggttcaa 177061 gtgattctcc tgcctcagcc tccccagcag ctaggatcac aggcacatgc cactgcgccc 177121 ggctaatttt ttgtattttt ggtagagaca aggcttcacc atgttgacca ggctggtctc 177181 gaactcctga cctcaagtga tccacctgcc ttggccatcc aaagtgctgg gattacaggc 177241 gtgagccact gcaccccgcc agaacataat tttcaaagtc tttctcttga acattaagct 177301 tttcataatt ccgggggata aaaatatgaa gtcatgatga cttaagtgca tgatgaaatt 177361 ttccccaaag aggtaattct caggaggtcc agtaccagat gtgagacttc catgtcgtta 177421 acagacttcc aagtgaaatt cccaattgtg cttatttatt tatttattta tttatttgtt 177481 gttgttgttc atgagtagga agtatgaact actcaagaag taattattca gctcagcttg 177541 ataactaaaa atgaaactca ataactgtgt attacttttt tgagagactc ctgcctaatt 177601 ctccgctgtg ttgcaagggt ggcagccaca tttcctgcct gagattccta gggattatat 177661 aaattgtttt gcagcaagat cttaggatta agcaaaacaa cagaagggcc tgcaccatta 177721 atacccccac tgtaaaggag cccaggatct aagaggaatg taatggaaaa gctcctgttg 177781 gtctcctgcc atcgccagtg aatgagatga ctgcttcagt cccatgtgtg aacaggtgta 177841 ggaactgata acctcaaggg cttttttttt ttttttttct ggctctgatt ggatgattgc 177901 agcttctgga tcaaaggact cccttcttca gtttcaaaca aagcagtcaa ttaggtcagg 177961 tattgcgaac acaatggaga attctgttgc ctgcagcctt tggaacaaag gccttttgtt 178021 acagtgattc tcactggttt agccggtggg agaggtttcc tggcagattt gcaacttttt 178081 tctccctttc cctgggtgga ttcattttct gaaatatcat gtatagtttt tggtaataag 178141 agatttattt ggttttaatt attagcagtg atgaatttgc ttcaagagat taaaattctc 178201 tccacctcac cccagtactg ccatcccttc acctaacact ctctcccttt gatcagtttg 178261 ggctgggagt gaccagtggg cacctgtcca ggtgtcctgg gtggcacaca gtttgctgag 178321 ggttttgcaa aaggtcgatg ctcccaacag agatgtatta gatgaggcta gctatgcctg 178381 attttagagg ggaaaggtga cagaataatg tatgaagaag gcaaaggctg aagatcagtt 178441 agatgggtgg acaagaataa gtagagagta taagcctcag caaacagcca gagccgagag 178501 ccacatgctg tcctatgaat gagtgaaata aatggagttt ttgtgaaaaa atgcatcgaa 178561 gggaattggg gatatttggc acaaaaggag caagaaatcc tcttctgtta agaaacctgt 178621 tgttactcgg atatactatc agagacaaaa tatccagcat tctcaaatgt taacttctta 178681 cgaaaataga tcttatgttt atatgttcat tttggttttg ttggagggac caaacctaag 178741 tgagtgattt tgtttgttag gttgtttttt tgtcagtgga ctcgtgcatt tcagccatca 178801 ttcccatgtt tctctttttg tttttagtta tgttctctta ttttttccat agtgtcccaa 178861 agtttactca agactaccca ggaacctctt gcccgagatc ctgttaaacg tacgttgtca 178921 ttcacctgag ggaagggaag aggggaggag gatgctgctt ggttcacata actccagcat 178981 catcaccttc tttgcatggt tttgtgtttc ttgaacacct gtcttagtaa aatgtttctt 179041 cccattacct tgcttgtaat tacatctgat tttgccagac agcttgagat gttgggctaa 179101 gagcatcatt gactaagttt cttctatttc tgaccaattt cctttttatt tagtctggtt 179161 ttattgaata ttatatggac aacatcattg tattgtattt gccattacta ttttatttcc 179221 taaaagctat cagtgtaact gagagcaggc ttagcctctc actgcttttg cagaactgaa 179281 gaacaagggc taggtgcagt ggaaggaaag tgactttact tagcaaagct agcaatgggg 179341 aaatggtcca ggctcctgcc tcaaagcaac catctcaaat tttggattaa aaaacaaagg 179401 cttaaaaagg ggagcttgga atgcagggca tgaaggaggg gtgaggaggt gctgctatac 179461 aggacttgtt ccaaagactt gagttattat ccagttcggt aagtgggctg gcgcatcccg 179521 cacaatcggg ttgtaaatta actgcagcct tgaggtgatc tcctgatgag ggagaattct 179581 gaaggtgcct aagtcagtca gtggaacttc taagcaagca ttaataattt taataataat 179641 tataagcgta attagataat ggaagcgctg tgcagggagt gcctggtggg aagaaagaga 179701 gcaaaggtta ccatttcatt actaaaactg gaaggaaaag aaaggaagaa agatctttaa 179761 aatagagtac tcggttacat caagatgcca tgatttaaca agagactata atacataagt 179821 aacttgtttc ctcagaaaaa gtacttaata ttcatggtat tgtagtacat ggactaattc 179881 gtacacgcag atttcaattt atgtgtgtta taagacccgg tttcaacttt agacatgaag 179941 ggtcactttt agacattagg aacccctctt tctcctctcc cagaaggcat aatggtaggt 180001 tagaggtgct caccttttga ctaggaagta gagaaagagc ctcctataaa gaaatgcatt 180061 gtccagccac tgcaggcctt acttacagtg tcttatgttt tgtttttatt tcagttgtaa 180121 tatcagacga gtttagtttt aaaatgtggt tttagtactc atatttgaat ttggattcca 180181 tccagaaata gatttatcca gtgttaggtt agggatgata tatgccattt atgcttatga 180241 ttgctggaag aagaggcaca atttcagttg taacttagat attttataat tagcatttgt 180301 gtacattatc ctgaaaaaaa aaaaccctca tctcaaaact gatgaagcca gttatctttt 180361 ttaaagatga taaattatag atgaaggatt aggtatttta aaaaagactg catcctaaac 180421 tataaagttt tttaatggcc tccataaaat attcagaata attcacatta tgtataactt 180481 ttgaaggatg tggtttgtac tgctgcaaca taaacgtttc ttttgctgtt gtgaaaaaag 180541 catttttctc taaaaacact gtaagtacag caagtgggtt gacagactag gaaaaagctc 180601 tagctattcc actatagtgt tgacataggt acgcatgttg agttgcattg taccgagcat 180661 tttcacagat catttaattt ttcttcagaa cactttgtgg gcattttcct cattcctttt 180721 gtacatacga gaaaaaatga ggcttcaaga gcttagtgac ttgtcccaag ccatcgaaca 180781 tggcaggacc aggctttgaa acccaaaatg acgttttggc cgcactccac accaaatggc 180841 taagcattgg ttctgacatt gttttgccgg ttagtgaaat cacatttgcg attcagaagc 180901 acattttaag aatacagaaa acagtgaaaa gaggatgaag aatgttaaca ttttttccta 180961 aatgccatga atagagcacg catcagtaaa ttaaatgtct ttcccattct tttataaaac 181021 agtgattatt tttctcttgt ttcattttag catttataag aattctattt ttccattttt 181081 aataactgat ggaaacaagc atcattgtct gataaattta ttttcatgct gtacacgtaa 181141 ctaccacttt catcatccac tgttccaaac agatactgaa agtctctttt gtaattttct 181201 ccttcccagt aattttctcc atctacttgc agtgtatttg ggatcctgac acctttccat 181261 tctaggagga tgaagtgctt tgggcagcta tattttcttg tctgcctaca gagactattc 181321 aatcacagca agtgaagggt tttgcagtgc tagtacaaac agacaaggaa aaaaacatat 181381 aatttctagt tatttatagc ctgtgcagct atgtggtctt atttatagca aactgcacat 181441 ttgatttgaa aaagtcatgg tcatctggag gtcaaattca gatggcagac tcaattcaga 181501 gaaaaggagc tgaactagaa ggtaagcaag cctgttagac tgggcaactt ggacttgact 181561 ttgctccttc aacaatctag agccagttgg atgatatttc tgcacaaagg tggttggttt 181621 tacaggttgc tcagtctaag ctgcttctag tcaaataaca aatgtgaaag gagatagaaa 181681 aagctcttca gttgaactga taaaagatta cacttaggga gaaaatgtat gatggtttgc 181741 attataaaaa caataatttg ccataaatca ggcatttcta ggtttcaaac ttttgtcata 181801 aaaaccaatt ttgagcattg caattttagg gacatatttt attgttgtta ttttaaataa 181861 aaatctaaac cacaaatata aggctgagag ctttcatttt attcgtgaac ataggaattc 181921 aaatcattaa aaaaaaaccc tttctcattt gggataaaga atcttacgag agctgagtta 181981 atttctgaca ttagttttgc acctgcccat ttgggtttag gacagtgcaa aaggtttcac 182041 atttaatctt caaaatatag tatttgtaga caataaaatt atataaatat ttgtggattc 182101 atttatattt taaatatgag agtgtgacca ttttttcctg gtgctgtctc catcagactg 182161 catcccaaaa ggcaaacaga ccaactcctt aatggcattt aatctctcaa agagcatcat 182221 gccaattttt agtcagcaga ttgcaccatg tgatctggcc aaagtgtgtc ccatctcagg 182281 ctccttcatt tgcgctgaca tgtggtacaa acaactaaac caaggtttct caacctcagc 182341 actattaaca ttttgattca aataattaat ttttaggagt ctgtcctagg catatgggtg 182401 tttagcagta cccctgaccc ctacccacta aatgccatta acacaactcc agctgtgata 182461 actaaaaatg tctctagaca ttgctaaatg tgcccagggg gtcacattct ggttgagaat 182521 cctgttctag agtagtagtt tccccacatc tgactcagag atagaactct cctggtgtaa 182581 acacgtgtac tttccacaaa tttgggggca tgttgaaatg ttagaatgtc agcatcatcc 182641 aattctcgac agatgtttga aattgaattt gttagactat tgtccaagct ctcaatattt 182701 gcaaggtaaa gtggaagttg ctgttgctgt cctgactata gtattgaagg tctcgtagtt 182761 ttcctgctct aatgtaagct actgtggcag cagaagagaa accagcaaat cccttagttt 182821 gaagtcattg gttcacctat gcaccacccc tttctcctct ctgtttcatc ttagtttcct 182881 agaatcccag gatatgctgg aggacaacca tctgtagaga tgtcctatag tcttgctaca 182941 caggttattt gtactttagg attatgtttg atggccttat agtttgtgca gaggtgtagg 183001 cctgacccca tgcaggattg agggagggaa aaagattgat ttctaggcac atcagaatgg 183061 tccagatgac atgaaacaga ctggtacact tacagagaat atgaaaagaa ttggaactta 183121 acaagtcagg ccccaagttc catccccagg ttccaagttt gttgccgcag aagctcataa 183181 ttttcaaatc gtagaaaact cacaccacgc agcctttcaa tggaatatgc caccttaacc 183241 ccagggtcgt tgtaaagaca ggagccataa gacagaagtc tgaatgccta ggccatttaa 183301 tctttggcca aatttgaact attgttctta atcacactat tcttgttctt ttcactgtgc 183361 agtagcaaat gttagttccc attagcacat tctgcaatac cattggaact aaaactgcca 183421 aattagatta ggcaacctgg gattacactt tgctccccaa ccaacctagg gagggagaac 183481 aattttaaag cattaaaaga tgtgatggtg aattaaacta cctgtcacgt cagtgccagt 183541 atattctatc agattaacaa gtgtgaaaag ctaaatcttt ggtatctgga aatgcttaga 183601 taggaagaat ggctttttct tcaaagccat aataattaga gatgacccta aaattgaaag 183661 gaactggatt gcttagcaac tcagcatttt ttaaaaagaa gggccatcta ctttgaatac 183721 aggttaaata tctcttctct aaaatgggac cagaagtgct tcagattttg gatttcggaa 183781 tattgatgta ttcatgatga aataacatga aattcattta tgtttcatgt acaccttata 183841 caaaagtctg aagataattt tatacaattt tttttttttt ttgagatgga gtctcgctct 183901 gttgcccagg ctggagtgca gtggcgtgat ctcggctcac tgcaacctct gcctcctggg 183961 ttcaagagat ccccctgtct cagcctccca agtagctgga atgaatgaca ggcgcctgcc 184021 accacacaca gctaattttt gtgtttttta gtagagatag ggtttcacca tgttggccag 184081 gctggtcttg aactcctgac ctgaagtgat ccaccagctt cggcctcccg aagtacttcg 184141 attacacgtg tgagccacca cgcttggcct atacagtatt tttgataatt ttgttcatga 184201 aacaaagttt ctgtacgttg cactgtgaga gaacaaaggt gttgctatcc cagccaccat 184261 gatgtcatgt cagtgctcaa aaattatcag attttggagc atttcagatt ctcagattag 184321 agatgttcta cctgtggtaa aagaatgttc attatgtctc tgctctgttg tttagctatt 184381 tgccaaaaga gggtaagtgg atcgtttaat aaagtggtca gcaaatattt ttcatacaag 184441 gccagattat agatatttta ggctttgtag atcatgcagt ttgtcataac tacttaacta 184501 tgcagtggta gcatgaaagc aacaatagac aataagtaaa caaatagaca aggctgtgtt 184561 ccgataatac tacttttata aaagtagcca gcagctgggt gcagtggctc acgcctgtaa 184621 tcccagcact ttgggaggcc gaggcgggca gatcacctga ggtcaggaat tcgagaccag 184681 cctggccaac atggcgaaac cccatctctg ctaaaaacac aaaaactagc tgggcgtggt 184741 gatgggcgcc tgtaatccca cctactcagg aggctgaggc aggagaatca cttgaacccg 184801 ggaggcagag gttgcagtga gctgagatca caccatcgca ctccagcctg ggggacaaga 184861 gcgagacttc atctcaaggg ggaaaaaaaa ggtagccagc aagctgatcc ttggtttaat 184921 acagcaggtt tttttatttt tatttttatt ttttttccaa tagctttggg gatacaagtg 184981 gttttagtta cggggatgaa gtgtatagtg gtgatgtctg agagtttagt gtccccattt 185041 cactgagtag tgtacattgt acccaatacg tagtttttta tccctctcct ccttcccact 185101 cttcctcatt ctgagtctcc aatgtccatt atgccactct atgcctttgc ataatcatag 185161 cttagctccc aatttataag ggaatacata ctgtatttcc attcctgatt acttcactta 185221 gaataatggc ctccacctcc atccaagttg ctgcaaaaga cttttatttt gtggtttttc 185281 tggcctacta gtattccgtg gtatatatac accacatttt ctttatccct cattggttga 185341 tgggcactta ggttggttcc gtatctttgc agttgtgaac tgtgcagcag ttatttttga 185401 ccccaggatg tgcgaggcca ggtattttga gagacaccag caaaggaatc tgagtactgg 185461 ctatcaatac ctgataattt tattaatgag ttgagaccca gatatgccgt agaggtaaat 185521 aaccatataa aggaaatata gaagtgtagg agaagattgc tctggaaggc tggggtaaaa 185581 tgaaatcagc tttatcctta agacctccat agaggttaac tgtaccatct ccttttactg 185641 ctacagttta tatgaaactg gacaatattg aagattttga cttttcgaac tcatctattg 185701 agtgttgagc agctatgaca ggttatttct ttatgtgcct ccattaacga agtcaagaat 185761 agcttctgac aaattgacaa aaaacaataa cctaaggtta acccattctg gacgccagtt 185821 attgctattg cttcacatat attttttaga ttcaaacttg aggttgcttt taagttttag 185881 tgtgatgtgc tgaatagaga atagatgtta ataaaaaaat aatgccagca ttcgttcctt 185941 cacattcagt tcatatattt gaaaaaattc ccatgtcagt atgtgtccat ggacataaaa 186001 aagaaatgta gcttgactag agtaaacaca tcatagatta tatttttatt accttatgtc 186061 catgtgatgc attgatagtg ttctccacca actatgattg cactgactat aaaattgtaa 186121 tacaagggca tattaggggg agaatgcctt tgttttttct tttgttgtta ttttgttttt 186181 aagagagtct ggtattacac tggaggatca caaactgtat tttcttgtta taaatggctc 186241 ctctatcttg gacttaatgg ttaggataaa tatttgaatg tagaatgcgt taggagtttt 186301 aactattttt gaaattagga aaatggagag acactttttt tttttttaga gacacttttt 186361 ttttaagtta taataagcat ggcactcgga aagctctgcc atgtttcaag attaaagttt 186421 cacatgaacg ctaacaagtt tatatttcac tcatatataa atgatcctcc ttgtctcctt 186481 tcctaaatta tagccatctc aaatgacaac attgctgggt cggtatcaaa tagcattttc 186541 aatgataagg caatgattta tataggtctt aatttgtttt aatcttctgt taccaaggct 186601 gtagagataa attgcttttc tgctgatgaa aggagactaa cccaattttc ttgttagaat 186661 ctaatattaa tatatcgact gaatgccatt cataagcctt ggtttaggga aacattcagc 186721 cttctggcct gttggagatg gccacggtca ttgctgtttg aaggtgggtc atttcttctt 186781 ttgtgtcctg agtccattgt gactgtaaaa ttatgggtca ccccagaagc ctgaggaaac 186841 aaatgcatat caaaatgata ggggcaatct ttcacagtgc ctttaatttt gaccccacat 186901 tcttatatga atgtgaagta ccacattagt atgaatatga aatggagtgc tccgcaatga 186961 ttacaaaatc agaactaaaa tgagggaaat cagtgcatgt acagttcaac ttaatggatt 187021 atcctctaag gagtcttcag acaggctagg agagtgacgt ctgcctcatc ctgaagatat 187081 tagcaggttt tcccctgttt taaagtaact tcccttcaaa tgctaacagt tttctcagag 187141 gaaacacagg tatggaaaat gtttcgttgg tgtcttggcc ccagtgatgg aggagcattt 187201 tgtgatagga acgttgggtg cagcatcgcc atttatgaat aaactgaaag ctaacctgtg 187261 ttacttcacc ccagaaaaca gagctctggc aggtattgag aatagatcac agtgtgaagt 187321 ggcagtattg tgacaagcat gcgtgtatca gtattaaaat tgaaatgagc cctcacgcac 187381 atgtgaaagc agtttctccc gttttcatat tcagttccct gaggtaatac agcattttgg 187441 tgtgagagaa agaagggaat ctgacccttc tcgtgtggga agggggctga gggcaacaac 187501 ctcctgtgac agccaccctg gtccaagcca ccttcatctt tgtctccttt tttttttttt 187561 ttttttttga gacagagttt cgctgttatc acccaggctg gggtgcaatg gcacaatctt 187621 ggctcaccac aacttccgcc tcccgggttc aagcaattct cctgcctcag cctcccgagt 187681 agctgggatt acaggcgcct gccaccacgc ccagctaatt ttagtatttt tagtagagac 187741 agagtttctc catgttggtc aggctggtct caaactcctg acctcaggtg atctgcccgc 187801 ctcagactcc caaagtgctg ggattacagg cgtgagccac tgcacccagc ctgtcttttc 187861 ctcttaaaag atcttccttc tacttttgtc ccatagcagc cagagtgatc cttcaacaaa 187921 acgccccatc acatatactc cccctcaaac agcccacgct ggtccctcag aatccttact 187981 gtggctttaa ggctgtacat gatgtccccc atttcttccc tgcctctccc actacttcct 188041 ctcttgcttc tttagctctg gccatgcggg tctcgtggaa tttgccagcc agcttcaact 188101 cttagggctt ccgtgctagt tttgtcatct gcctgccatg tccttccagc agccagctgg 188161 gaagatctga atgtcacctc ctcacttaag actctgcctg gtcgctcact tactgcggtc 188221 tacctttccc ctctcccctc agctctcccg atccttctga actctgctct gtttttcttt 188281 ttcctctttt tctccataat acaaatcaca tattaacaat atgacttact ggcttttaaa 188341 aaattcttgt ggtaattgtt tctttcccta gaatataatt ccaccaggat cagggccttc 188401 atgtttttct cattggtttc tcccatgtgc ctcacatata ataagggctg caatgttggt 188461 gggctggagg aataaaagaa tgaacaggcg cactaaaggc aggatagttt gtccatgtca 188521 gagctgctgt ctcagctgct ttgatacctc tgacttccca gggtggcaaa gaataaggat 188581 gtggcttatg tagaaggcgt tagtcatcca aaatgccact gtatatttgg aagcaaatat 188641 tcaccttgaa gcagtgatgt actaggaatt agaactgctt ttctgttaca ttttttcatt 188701 tgttatgagt gtttagatac acatgatcac tattaaaagg aagggaggta atagcctata 188761 tatcctacca tgcccctcaa ggctctgtta ggagactaaa atacaaaggt catttaaaaa 188821 gttaaaagct tatgcatgtg atgttggtat tgctgctaat gactaggcat ctagaggtca 188881 aaacctagag ctaaggatat tgaaccctgt gttttgtctc gttttaagtc taggtttttg 188941 tggagggggg aacataccta gccagcttgc ttgcagaagg catgagtgag caggctgaag 189001 gagggaccca aggccagaac gtcctccata ttctagctcc atgggctctt tcacttgatc 189061 cctgctttat gacccaactt ctgttgaagg atttgccaga tagtgtttgg ggcaggaaca 189121 gctgtttcat agaaaacaag ccaacggaca tcataacaaa gcatatggtt gttttgccaa 189181 gttgtgctgc tcattaacaa aacaaaacaa caaaaaccct ttacagtgcc ccagcatgtg 189241 gatggattgg ccctttacta ctctaaaaat tagctttaca gcctttttat ggcttaccca 189301 atcttaacag actttatttg tgtgtgttta gggatgcaaa tgggtatgtt aaatgatagt 189361 gttttgcctc tacgcaataa aatagggtaa ttttttcttc ccatgtatta aacgagaaca 189421 cagaacatgc ctagcatggt gggctctgat gaaaagaggc tgttatcatt tggccaacat 189481 ggaaatccta gcctttattt atttatttat ttgagacgga gtgccaacat ggaaatccta 189541 gcctttattt atttgtttgt ttgtttgttt gagacggagt ttcgctcttg tagcccagac 189601 tggggtgcac tggcatgatc tcggctcacc gcaacctccg cctcccaggt tcaagcgatt 189661 ctcctgcctc agcctcccaa gtagctggga ttacaggcat gcgccaccac accctgctaa 189721 tttttttatt ttattttatt ttagtagaga cggggtttct ccatgttgat caggctggtc 189781 tcaaacttcc gtccacctcg gcctcccaaa gtgctgggat tacaggtgtg agccacccca 189841 tccggccgaa atcctagcct ttatcagagg ttccgtggaa tctcagaggt ctgtgagcag 189901 gataaactgc tttatggtag tttaaaatgc aaaaccagaa ataaaccctc ctaaagccag 189961 tttgtttcca ctttctaagt ggttacagtg acctgtgttt taagtacctt aaatatatca 190021 gtgacctgct gatatacagc ttcctgtttc ctctgtgtcc tgtgacatat cctgacattc 190081 cttgagcccc actcccttcc caaaaggaag taccttgatg ggtgaggaca tagggcacac 190141 gtgtctctgg ccttctgtgg acataatcac aaaatggtag tcagtcctat gaatttaatc 190201 tctttaggat ataaggatat tcacgaaata gaagttgagc tctctggaca tactccacct 190261 taaaattgct ctctccttcc gggatcctca ccagctgcag agcagtagta atgaactatt 190321 aaattcattt aacaatatca gcatatgcta tccttctctg cttgactaga cagtaatctg 190381 gcttgctggt tggtatgaat ataggataat ttcactcaca ctaattgaga tggggagttt 190441 gtatgaatgg ctgaaaaatt tgagttaaag ataaaaagag acacagcttc ctcaactatg 190501 ggaaataact gggactgtta acaaagattt acttgaagag aaattgtatt gctcagggtt 190561 ctccagagaa acaggaccaa taggatatat atgtatagga agagattgat tctgcagaat 190621 tgactcacac aacaattatg gaggctaagt cccacaatct gttgtctgca agttggagac 190681 ccaggagagc tggtggtatg atttagtcca aatctaaagg cctgagaacc agggaagaca 190741 atggtgtaaa tttcagtcca gggagaagag aagaccaata tcctgactca ggcagccagg 190801 aaataaaaag ggtgaactgc tccatcctct gcccttttgt tttcagaccc tcagcagatc 190861 gggtgattcc tgcccatgtt gaggagggtg atttacttta ctgagtgcac caattccgat 190921 gctaatctca tccagaagca ccctcacaaa gacactcaga aacaatgttt aatctgggca 190981 ccttgcggcc caatcaagtt ggcacagaaa attaattacc acagaaatat ttctggattc 191041 aatggattaa gttggtaata tagtattcat attcatttcc acgtgcctaa aaagaaggtt 191101 tttatccttt ttaatatcct cagtcctatt aaacatacct cacattcatg ttccatggtt 191161 attctcttat tatgttgcta agtgcatttc tacttaggcc aatctgatta tagtgattca 191221 gattatagtg tttatcttct ttcctttata cagtagacag tgttcgcaaa ccataggagc 191281 tgaaaaaatt tccttcttat atagtaagtt ttatgattcc agttttccaa aggggtgctt 191341 atgttatggt tataacagtt acagccagat gactatcttc atcagcaggt ataaccccaa 191401 atatttatac attttcattt gtttcaaaca taaaatatgg tttgcctaaa catgtcattc 191461 tgccagattg gactatgcaa aagaatgtta ctgttttatt ttctgctttt actttaactt 191521 gttctcattt gagctctacc agttaagatg agaagatgac taaagctagg gaagcacatt 191581 ttaaatgcta aataccatgt tcatttcaag ttgggaactt gaaatgcaag tatttttaac 191641 ttaatcatta agttaaagtc agtaagaacg tggtatttag catttattca ttttaactta 191701 atgattaagt taaaaatact tgcatattgg tttgcttgaa gctatggaga aaaacattta 191761 ccttttttcc aaaaaggtct taatattcaa aattattgtt tcaatcttac aaattaacaa 191821 tttacttatt tttgatactt tcggcctttt ttatgtgggc tacacctaac attttttgaa 191881 aaatacatac tgcttttcct gtaggttgca ctttgtgatc ctgtgtgtaa ttcattttca 191941 gtggttctat tttatttatt ttttgttagt ttgtattttc ttttgtttct ttttctcaat 192001 ttttaaaatt ttaaaacttg tccttttctt ttttttcttt attattgcac tcttctacca 192061 aaacattttc agtggctttg aataaaagta tcagaacttt cccgtggtta gcattagctc 192121 tgagatttga acaggatgta ttaagcagag gaggagggaa ggccactaac atgtatgagc 192181 tccactgtgt gtcaccctct ctgccggatg acttacatgt ggtatggtat ctcattttcc 192241 cagctctcct atcatgtcct gcggcatgtt atcaccactt gataaaaaaa ttgaaacatg 192301 gagtgcttaa gtttcctgcc ccaggtcaca gctgatctga aaaagaaatg ggattcaaat 192361 ccaaatgtgc ttaattgcaa atccccacat gactctttcc ttccttcctt ccttcccccg 192421 ttgctgccaa tgtaagaagg aacaccactc cctcatttgc tacagagaag acaggggcag 192481 ctgtgtaaag gcaccaggaa tatgccctgc ctccttcctt ccacttgaaa gcaacatctg 192541 gatagacact caactgcttt tattttttga aaaatgatta ttaatcagga ttgaagtgga 192601 gggaaaaaat caaaacgaat gtttcttttt aaaatgaaag atactgttta attatttaac 192661 agccttccgg ttttagttaa tatctttgaa gactttggat taagattcca gacattttct 192721 tgaaggatgt agaaaacgta tttaacttca agtttagttt caaaatgact tagctatcaa 192781 ttcttctcat tgaagtcctc tttgagatat tatatcccaa atgtaataca caggtaataa 192841 ttcaatagga atacttggaa cattggtgat atttgaataa ccttttgaat gaaacaattg 192901 atttgatctc aaaagatagt gaaaacgtct cattttgttt tttatgttaa atatgcttcc 192961 tcttaagtca aaattctttt ttctttttgc ttttaagctc ataaatactc tgaattctca 193021 ctaatctttt ggaaaattgc ttgtataggt attataattc acttggctca tgagttacat 193081 aatatagcag tatatgttgc aattcaaata ttgcttagca gtattagttg aattcaaaat 193141 agaaatcact ttctttttga caactagcat aatgaattca ttgactattg tcaaatttat 193201 gaagacagaa aagtcctttt gtgattcaaa aatgtgttta ttgatgttta tatatctttt 193261 gtcgtgtttc catactgtct catgaaggtt gtcttcgtaa ttcagttatc taaatacatc 193321 atacatttac cttttccctg gggtgccgtt attctacaac aggctaatta cagatatctg 193381 tggtttctgc aggagtctta gtgtatgata ttgatgatat catcatttac tttcaaaatt 193441 gcaaaatagt aatttaggaa agcatgggga tttggaaaaa catgtcttca gcaccaactg 193501 tttttgctct ttgcatgctt gtttcgtaaa gaactctaat gctataattg caaaatggac 193561 cattttaaag atttttcctt cattctgtac ttgggagtgg tgaaagacat ccttactgtg 193621 ctgcacagtg tctcatggtg ttctcttaaa cagcattaac gtcttgtatg cgctgcttta 193681 ctaaattctc tgttctgaga aataactgaa aatacggctt tctattaaac gagtggatta 193741 ttctgttgtt gttggctttt ttttctcaaa cctccttctc ttctacttta tagttcctac 193801 aacagcagcc agtacccctg atgccgttga caagtatctc gagacacctg gggatgagaa 193861 tgaacatgcc catttccaga aagccaaaga gaggcttgag gccaagcacc gagagagaat 193921 gtcccaggta agtctggctc ttccatcatt cagccctacg atattgggaa acctgagctt 193981 gcctctgcct cagtctcccc acaggcctgc tggctttatc aagatcttta aagatgtaaa 194041 gttctaattt taaatgttta ctgtgtgggc acagtttgtg ggttttttgc atcctccgtg 194101 aacatgctgc ctaggaagga tccaggtata gttggatttt gtaagcacaa attaaatgat 194161 gtctgtaaca ttctctctgg cgaagaaagg tacttggcat tattattctg tgtatatgta 194221 gtacctgtaa tcaacatgtt gtaaattaaa atccatcaaa atgctcttta gtaaagatga 194281 ggttttgtat taaatcccag actgttataa ttgtgaacct acgattgttt acagtctgta 194341 aaatcataga gacttaaaac tgagctgaac catagtgatt atttcatcta gtctcctaag 194401 tgatacatag tggaatagag gttcgagaga gtggcttccc attttagcac aaagcaaaat 194461 cagtgagcca gagacaacat ttagaatcca ggcctttctt gctcctagta cagtactcct 194521 ttcattatac tttgctgaaa aaaaaaaaaa aaaaaaaaac ctgttttggt ttggtttgtt 194581 tttttttagc aggtccttaa ttaagcaaat taaaaaataa cttatgtcta aaacatggct 194641 aatccatata tactatcaac tacaattgaa gctatgtctg ctaccaggca ctggatgttg 194701 tattggagtt tgataatatg gaggatttta gcccttgctc agattcctca ttcattcatt 194761 cattcaatca acaaatatgg acttattatt ttccagtcac ttttctgatt tgtagtctac 194821 aaacctagga gtgaatgtgg ggggaaaaga agtaggacat aaaagtgaat gaataaagat 194881 ctcaggttaa agtaagccat tcatttttag tgttaatatt ctgtggcaaa tatcagaaga 194941 taggagaaat ttttgtctta gcacagatgc ttaatctctt gagagacatt aaatgcaatt 195001 tatgaattag actgctgcgt attttctaag atcacagtca gtctttagta gacaattaat 195061 ctttcctgta atttctgatc tgggtccctg ctgaccccag gaagattcct ccctcccagg 195121 gccaaatgaa tggcccctga gcatgtcttt cccatgcaaa ccagtcaatc aaaagctcct 195181 tactcccaac caccttctta tactctgggc cactattcac ctgctctaat caccccagga 195241 ccaggtacca gaccactagg gacaacctca tgccccatca ccattatttg aactaatcag 195301 tcctaaaccc gcctaccctg cttcgctcat tctttccatg aaaaccacaa taaaaggctc 195361 ttgctcatgt tttcccctcc tgaccaaccc tggtgcttcc catatgaccc tgcatgtagt 195421 ggcatgtacc ggcttcttgg gagctgagta acaaactgtc ttttccatga taatcatctc 195481 ctgacctctt ggcctcacct gaataataag aaaatctacc cttgaaaaca taaaactaat 195541 agtttatcta gaaaggtaca acacaggtac tgttttttgt ttgtttgttt tgttttttgt 195601 ttgtttgttt tgccttaaga tgaataacaa tacttggatc tactttggta taataacata 195661 aagaggaaaa aagccttata atcacagtac ttttggctct ggaatataat gtgcagttct 195721 agtggctgcc cacgatgaaa acctaacagc taattgaact agaaagctag ccacagggaa 195781 ttattaagtt ttcacctgat gccaagaaaa tcagatcttt ccttaggaaa gacacaggca 195841 ctgttcggaa agcattggag gactatgaat catacttatt gccactgaaa tgtcctcatt 195901 tgaatttaca tcctatctcc cttctttatg aagctttcag aggcttcctg tgccctcaca 195961 aatccacttc tgggcacatg aagtcgtaaa aaaagaacaa tgacaaaaat aaacaccgaa 196021 tgctcagctt gtcattttca gcgcctacca ctgtaccctg actctggtct tcctaccacc 196081 tcttccttct caactgctac ccttgctttc tgtacagcgt taagctgctg ttctgatcag 196141 gccactctgc tcatcgtctg ttcatctcat cttgggcttc cctttgccct tggcattgag 196201 ttcttctgga aatgccctcc cactcttctt cacgctgaca catctcaccc attccttaca 196261 ccccacttca tactcagaat gagacaggca gtcattgctc ttacctttca tttaattcta 196321 aatatttgct ccccggtttt gttagtgatt tctggacaca cctgcattat ctttccaagc 196381 tggatagaaa ccttgagacc tcaagtcatg aagcctgcta cctgcattca tcgtggaccc 196441 actgtagcat tttttttaat gtgtgttttt ttaatcctgg attttttctc ctagtatatc 196501 ttgttcattt ttgaccttat taagaaaact tatttcaaat tctctgttag ttttagatgt 196561 gacaggtctt ctttaaaata ataatttaac ttttagcaca tctgtgtaac ttgtgactta 196621 gttcatgatt ttgtagattt tgttcatctt ttagtcatca gatatatctg atataaccta 196681 agagatgttg tcccagagca gagaatgttg tctactagat tccagtggtc ataggaatgt 196741 gagacacctg aaatcaatat gtatcttcat agtctgtggt gctgaaatct agggactttt 196801 caactttaaa cttgcctgcc tcttaaatct ctacttagta gtagtcattt tctctgcatg 196861 aaacagccaa gtctttgaag attgataaat ccaaatgaaa ttgatacctt tgtgcctgca 196921 gtggaaaatt gctttataag aatccataga gccactgcaa gggccaggtc ctgagaattt 196981 taaattttaa aaattgtgcg gttttttaaa ttaagtttta ttgtagtttt tgttatatgt 197041 tttcttacaa atacagagga ttttggaatt ttttcatatt ttggtttgtt gttgccattg 197101 tggagtataa ctctaaacct gtagattttt gttgctaaac ctgtagattt ttgttgctat 197161 atgtcgtgac tagcatctag tcaaatagaa atctacagct gaaccatgtg tgagcacttt 197221 tattacttat aaacttacta cttttagtat atgcttaaaa tgaagtcctt caaattgaaa 197281 gctgacccaa ctcataggtt tttatttctt atcttttttg gagatggagt cttgctctgt 197341 cacccagctg gagtgcagtg gcacaatcta gactcactgc aacctccaac tccctggttc 197401 aagcgattct cctgcctcag cctccccagt aactgggatt acaggcacgt gccaccacgc 197461 cgagctaatt tttgtatttt tagtagagac agggtttcac catgttggcc aggatggtct 197521 cgatctcctg accttgtgat ccacccgcct gggcctccca aagtgctggg attataggca 197581 tgagccactg tgcctggcct aggtttttat ttttaagata agcagcaaca ttttaagagc 197641 gcccctttct cctgtcacta aatataaata aacttagctc acctaatgaa ttttttattt 197701 tataataaaa aattgggtac cacagggtac aaaaacatga tctttagagc cacagagatc 197761 ttggtttata ttcttccctc tccctttttt acctgcatag cctcgagctg gtaccttaac 197821 ctggttcgta aaatgggatg gccctggcta atctcacaag ctgttgtgag gaatcaatta 197881 aattagatag attttgtaaa gctcttaata cagggtcagc agtaatgcta cttgttatca 197941 gtcccacagt gaaattcact gttcacaggg agatggtcag cctacatata gtgtgtattc 198001 atttgcagta gagtcttagt gggatggtgc aggtgatttg tttatttgtg ggggtatagg 198061 atattgttaa attgttctga aaatcaaaat gtgagttaat ttctatagta atgcttattt 198121 gcaactcatt agctaaataa atctctttaa ttatttcatg gattccccca gcttaaagct 198181 taaagtccag tttcttaaag tgtatgaggt aaaatggaaa aaatgtaaat gtaatatggg 198241 aaatcagttc tcacattgca gtagaacatt gaacttctat ctcaaactaa ttttgagatt 198301 ccttgcttga agaattgttg gtagagtacc accactgtga acattcccaa agcaggaaat 198361 tgaaatattt atattgcaag gcttatctca tctataaagc aaagatggtc ttccatttcc 198421 cccttgccca caggttccct ttattaacgt ggtagaagca ccaaagcaga tgggacaata 198481 tgtttctttt cttccatgtt ctattcttgt tcattgcagc agggcatttg tcccaatttt 198541 gctgtgaacc ttgagattat ttttgatatg cttttcaagt gtaatgaaat atttattcca 198601 tcttgtttcc tccatggttt tacaaaaaca acaagaagaa gctaattgcc tatcatatac 198661 aaagaatgaa tgccaatgga caataaaatc tgtaattagg aacccacatg gtgatttgat 198721 atagtattgt ctcatagaag tcttctatct atagttattt attccaagcc tctttttctt 198781 tcccatcttg aagtatgtta gaagagtcta ttccttaagg ctaagtacct acaaaaaaca 198841 ggctcttata atacagtaat caagttttct catgctttgg gagtacacag agtatgcaag 198901 tacacatctg gcaccaaacc aatattttgt ttccttgatc tttcttcatt ttacctagtt 198961 cagagttgca cttcaaacaa attctacttg tcatacactt gaaaaggcag ctgtaggtgg 199021 taggaagtta atattttctt tgaacaacat acagaggtgc agacttttac tttttgtatc 199081 aggttaatga gtgaacttct ctcagggaaa tccaggtgaa catcaaagtt gagagcagga 199141 agcattgatt tcctacaaaa cacattcacc tgatatctta gtcagtttgt gttgctataa 199201 tagaataccc cagcctggat aatctgtaaa gaaaaagaag tttaaaattc ggctctcagt 199261 tttgcaggtg gggaagtcca agaacatggt gctggcatct gttgaggact tccttgctgt 199321 atcatatatg gtggaaggaa gaagggcaag aggagggcaa aactgagttt tataacaacc 199381 cacttctcac aataacaaac ccactccccc cataatgata ctaatttact catgaggaca 199441 gagactgaat cacctcttac aggacttacc cctcaacatt gttgcactgg ggattaagtt 199501 tccaacacat gctttttagg ggttacattc aaaccatagc agttggttac taaattagtg 199561 ctacttgtaa aaaaatacat gtttgctgca ttcagaatat tacatgtttt cttggagttt 199621 ttggaaggac ttagcgtaat ccttgctgtt catgacttag gttggcaagc tcaaagggcc 199681 ttaattcccc atcagttctt taaaggggga taggtcttag gatattagac agcagaagtc 199741 ttctttttct tcccccatta tgtactgctt tttggcattg tattcatgac acacacacac 199801 ataaaatgtc atattttctt taaagtcaaa atatacagtt taaaatggtt tcagttgctt 199861 tcataagaga gtaaaactta ggtgttcagt ttaatattgg aagaccttgt tcagttaatg 199921 tgtgccttcc atgaatagtt ggtatttgtt cacatcaaat caaatgactg tcagttctcg 199981 gggtaaaaaa agaccaacct tgactcaaaa agctatattc ctgcaataaa tatgaacatt 200041 tatgtttaat atactgctgt gattaattat ttatatccta gtgggaggtc aaatattctt 200101 cagataggaa ggggtatgta ataaattttt taaatatcta aaatgacaaa aatagaggaa 200161 aaacataatc atccatccta ttaagtctgt attcaaagga tgaactgatg attttaaatt 200221 caaatgtttc cttaatttat aggtcatgag agaatgggaa gaggcagaac gtcaagcaaa 200281 gaacttgcct aaagctgata agaaggcagt tatccaggta aaacctgaac ccatttccta 200341 ccaaacatca catgccagca gtctttttaa tgagtgtctg caggtttacc tttatctgcc 200401 agcacctcac gtgtgattgt tcatagtccc atgaacttca gctgttctta atcattgtct 200461 agtaattgag ttgtttaaat cctgccttat atttgctccc aattatttca agttataatt 200521 cattgccata aaactcacat tgtaacttgc aatctacttt ttattaataa tacatctttt 200581 cagtcaccac aatcttctgc tgcttctccc tccagcaaaa attatgagaa tttgaatttt 200641 tgatggaaac cagtctcaaa attaccttaa gtaaaccaac agtttattat ggtcaattac 200701 ttgaggtact gttagtaaga gagggatttt tttttaatat gtaatataag agtttcaaga 200761 gtgcatatat ttatccaaag ttattagtca aatcagcctt cagagagcct atgaaatact 200821 gattttcaat gagaaagtag gtttgatgag ggttggagag tgcaagaaag tgaagtaagc 200881 tataaaacgt agaattagtc aatgttggaa tgactatgca gtctttagga tactttttag 200941 cactagaaga aaatggaata ggatgtcttt tttagaagac ttgaaattgc tgcttcatcc 201001 tacttattca gtccccatgg acatatgtgt ttatgatggc agcatttcca ggagaaagtg 201061 gaatctttgg aacaggaagc agccaacgag agacagcagc tggtggagac acacatggcc 201121 agagtggaag ccatgctcaa tgaccgccgc cgcctggccc tggagaacta catcaccgct 201181 ctgcaggctg ttcctcctcg ggtaggtctc gctgcagccg agttcacact tcaggtcaca 201241 gcacagacag taagggtggg gcactgggaa ctggaagcca tacaaaaaga atgaggagaa 201301 atgccttgag cactgttatt cagaggttca acccctgtcc attccatctt gaaggtcaaa 201361 gggtcacagg gcagctacct ccacaaggtc atctctacac agcaggtact cacacttccc 201421 cacagagcag ccagacagga ctcccaggga ctcacattgc aagccctgac ttgatttaat 201481 attcctgagg tggtgacaat tgatggcata gttcaagttc gaacacaaaa gctggaagga 201541 tgtcttccag tcccttcagg ctgctgtaac agaataccat agactgggta ccttatcagc 201601 aacagaagtt tatttctcac agttgcagag gctgagaagt ccaagatcaa gacatcagca 201661 tacttggttg tctggtgaaa gtttgcttcc tggttcatag aacttaaatt ttgtttatct 201721 tttatttatt ttatttttgt ttttgtttgt attttgctgt atcctcatat ggtggaatgg 201781 gcaagagagc tctctgggcc ctcttttata agggtactaa tgtcaatctt gaaggctctg 201841 ccctcatgcc taattgtctc ccaaagcttt caccctcaaa tagcatcaca ctgtggctta 201901 ggtttcaaca tatgaattgg gggtgggaga gacacaaaca ttcagaccat ggcaaaggat 201961 tttagaagta accttgccca gccccctcat catgtaagta taataatata actaaggtca 202021 aggaaagttg agatttggct gagttgtaag aatttttagt ggaccagaaa gaattagaac 202081 tcaaatttcc atctcagagc agtggccata ttctctgtgg ggaagcattt gtcagtcatg 202141 acattagcaa tgatgagact tattcatggt tttcagtgat cattgcccca aaaaggcatg 202201 ttgagtggta caggcataga tgtgttcacc ggcattctgt gggagagctt acttggaatg 202261 gaaggaactt tctgagcatg agtgtgtaac ttatgtccac ttgttctttt tttttttttt 202321 tttttttaaa tgagatggag tctcactctg tcacccaggc tggagtgcag tggtgcaatc 202381 tcagctcact gcaacttctg tctcctgggt ttaagcaatt ctcatgcctc agcctcccaa 202441 gtacctagga ttacaggtga gccacgacac ccagctaatt tttgtatttt agtagagaca 202501 gggtttcaac atgttggcca ggctgatctc gaactcctga cctcaggtga tccacccacc 202561 tcggcctccc acagtgctgg gattacaggc gtgaaccacc atgcccagcc atttgttctt 202621 tctgtttgtc attgagcaag ggatattgaa agctctgcct gtgattgtgg accgggccat 202681 ttttctaaac agtgctatca gtttttattt tgtatagttt gaggctgtgt ttttaggcgt 202741 gtaaatgttc caggctgtta ggccctcctt gagcctcttc ttatgaaatg atctttttat 202801 cctttgtaat gttttgctcc aaaatctact ttaatattaa taaagctatt ccagctttct 202861 tttgattaat attaatgtgg tgcatttttc catccttata ctcttattga ccttgtttgt 202921 ctttatattt gaagtgtatt ttttgtaggc agcatgtaat tgggtcttgc gttttccttt 202981 ggattgggaa gaccatgtac actgaatgag gttatggata tggttgtcac tgaatctgtc 203041 atcttgatat ttgtttgcta tttgtccagt ctgttccttg ttcttctttt ttcctctttt 203101 ttttttggat taattagaaa ttttttatga ttctatttta tctcttttgt tggtttgtaa 203161 gttacaacta gtttgttatt ttactggttt atctcctttg tgggtttata agttataact 203221 agtttgttat tttactggtt gctttaggct tatagaaaac attctaaact tgtcacgatg 203281 atgttatgtc agttcaaatg atattatacc actccacaga cagtacttta caaaagtact 203341 tccatttttc atctctctct ctctctgtaa gcggttgttg tcatactttt tccttactac 203401 attgttgatg tttttcttta aacagttatt ttttaatgag ttgtaaattt aaaaaaaaaa 203461 caacatgttt acccatgaag tcatcatttt gggtgctgta aactctttgc gcagatccag 203521 ctggtatcat tttccttctg cctgagagat ttcctttaaa atgtcttaga gactgggctt 203581 ggtggcttgt acctgcaatc ccagcaactc aggaggttga gataggagat cacttgaacc 203641 caggagttcg agactgcagt gagctgtgat tgcatgactg tactcccatc tgggtgacaa 203701 agggcatcct gtctctttaa aaaaaaaagt ttgagtgcac ataagttggt gatgaaatct 203761 ttgtttggct ttgaccaaaa atgtcccttt tatacctttg gttttgaaaa atatttttgc 203821 tgggtaaaaa atactggatt gaggtttctc tttcactgcc tgaaagatga tactctgctg 203881 tcttatttgt gttgatatct ttatctttat ttctctaaaa ataacatgtc ttttttttct 203941 gtttttacag ttttctctat atcactggtt tttaacaatt taattttgat atgccatggt 204001 atcattcagt ctgcccttaa ttccatctgg ccgtttttca tcttatcacc tcacatactg 204061 tagttgttgt atatagaaat ttaacctggg ctacccccaa cacccataat atcgtctgta 204121 tcttaacttt tgaacctatg gaatacagtt attatatgaa aactgtttaa atatccatgt 204181 ttgctaattc taacatctct gtcaactttg aatcaatttt gattgcttat tattacatgt 204241 ggcatttccc tgtatcattt tatgcctcat aattttttat taaatgtcgg acgctgaatt 204301 taaacctgtt gaatgctagg tgtttttgta atattcttga gcttttttct gtcacatagt 204361 tgagttactt ggaaacagtt tgatcctttt gagctttgca tttttagatt tgttaggtgg 204421 gatagaagca gttatcagcc taggactagt tattccttac tagtgaggca agacccttct 204481 cagtcatcta ccatgaatgt taagttttct cacctgactc gtgggaagag gcattattcc 204541 tgtccctgtg tgaacaccaa gcactgttaa tgctaattct ttttggtgat tttccccctg 204601 gcttcagttt ctttcacatg catgtaccca tcattactca gctatattct cagggatcac 204661 tctgcacatc tctggattct cttttggagc aactctctcc tcaccaatac cctgtcctgt 204721 gaattacaac taccttgatc tctccacaca cttcatgtcc tccactccac gagtcctcta 204781 ggctctgtgt gggtttcccc acctgaaacc atgacctgga aattcagggc catgagtttg 204841 gcagctgtaa gtcctcattt gtttcccctt ttcaggaatc agtgtccttc acctgatgtc 204901 taatgtcttg cagaaatgtt gtttatgcca ggtgtggtgg ctcacgcctg taatcccagc 204961 actttgggag gccaaggcgg gcggatcacg aggtcaagag atcaagacca tcctggccaa 205021 catggtgaac cccgtctcta ctaaaaatac aaaaaattag ccaggtgtgg tggcgcacac 205081 ctgtagtccc agctactcag gaggctgagg caggagaatc acctgaacct gggaggcaga 205141 ggttgcagtg agccgagatt gtgccactgc actccagtgt ggtgacagag caagactcca 205201 tctcaaaaaa aaaaaagaaa cgctgtttgt atatgaatat attgtccagt tattggtggt 205261 tcaggcagga gggtaaatcc agttcttctc actcaatctt ggctggaatt gtttgtgcat 205321 tttatgcaac ttaaccagaa agtcacccat ttgcaccgat tttcctcacc attttgattt 205381 ttctttctat ggataaagtt acagtgcaca gaaagccatg aggagccgtt aaagagctcg 205441 tggcattctt ccaagacgta agatactatt taccctgttc taccgatgat atcactgcaa 205501 cctcgggaga tggagcggct ggttcagggt gacactgctg ggaagggaga gggtagagcc 205561 gcgagtcact gtggtgcctt ctcactagaa cgagcttcct ctctaaacaa agagcttttg 205621 aaaaagaata ttcatattag cgttttcatt ttctatttgc caaaattctc tatcttagca 205681 aggcacttgt caggaaattt ttttttatat ctctgcagaa caaaagttcc ttggttcctc 205741 tcttttttgg agaacacgct ttcaggagag gagccgtgtg atcatttgcc agagaaggat 205801 aggctggtgg gtgctgagac cagaatcttt atagcagaaa cttgtgaagc gttccagctg 205861 aaagtccaat ctctaaagta agatctttgc taaatggaca ttgatgggac taaaaataag 205921 ctgagggctg aatggaggct ttaacaaatg tacatttaaa ctctcagttt agtctcatca 205981 gccatcacag gaaaggcagt ggcaagaatg tttgccgaag tgtcatcttt tatcaaaggg 206041 gataatttta gagtggttgt aatcgtgctt attgtgggcc tccgcattgc ttcatatgga 206101 attaggcaac atgttcattt gaaaaaaaaa ttaaaaactg gattaggaag aaatatttta 206161 ctgtcaaaaa tcagatcact gttgttgatt gatctaagga ttcctgttaa attcttaact 206221 cggatttatt tatttattta tttattttct ctctctctct ctctttctct gtttctcttt 206281 ctctcctgcc ttccttcctt ccctccctcc ctccttcctt cttcccttcc tttcctttcc 206341 tttccttttc tcttcccctc ccctcttctc ttctctaatc ttctcttttc ttttctcttt 206401 tcttttcttt tcttttgttt tcttttcttt tcttttcagg gtcttgctct ttcacccagg 206461 ctggcatgca gtggtccagt cgtagctcac tgcagcccca acctcctggg ctcaagcaat 206521 ccttttgcct cagcctcatg agagttgggt ctacagaagc tccaccatgt gtggctaatt 206581 tttaaaaaat ttttgtaaag acagggtctc gctgtgttac ccaggctagt cttgaacttc 206641 tggcctcaag cgatcctttc acctcggctt tttaaagtgc tgatattaca ggcatgagcc 206701 accatgactc acctttaaat ctttcctttg taataaaatg tggctgggtg gtcacaggcc 206761 tggagtctag ggggcattac ttgccatttt tgttaagtta ccatctttag gtgttcattt 206821 aagttttaat gttgtatggc ttgacctggc acttaggtct gcactttggc aggcacaaga 206881 aaattgtaaa taccttcttt tttttttttt tttttttttt tttgagacag agtcttgctt 206941 tgtcgcccag gctggagtgc agtggcgtga tcttggctca ctgcaagctc cgccttctgg 207001 gttcacacca ttctccagcc tcagcctctc aagtagctgg gactacaggc acccgccacc 207061 atgcctggct aatgtttttt tttttttgta tttttagtag agacgggctt tcaccgtgtt 207121 agccaggttg gtctggatct cctgacctcg tgatccgcca gcctccgcct cccaaagtgc 207181 tgagattgca ggcgtaatta aatgccttct tgaggacttt tataccttcc tcttgaagac 207241 ataaatccag agcttactgg tggttgtctg aaaaagatca catcttgtaa taggagtcat 207301 acataaatat ttctgagatc tccctcatac ttaaccaccg tgcctggaaa atgcagacaa 207361 ctgtattcag taaatggtat ttgaatataa gtaaatcaaa agtgtcgttt ccatagattc 207421 tgtagatgtt accacttaga ataattacct gtctaaccct taacaaattc taccgtatat 207481 tctcagactt tgcaaacacg ggattccaaa acaatgtgga ttgttttcag tttctttgaa 207541 gctcagattt aaatcatact tttactcagt ttttaaacct aaacgtggaa gtgttatgag 207601 atgttgtttt tgtgcagcta gaattccttg ttctactaat gaatttcatt tcctgagtcc 207661 ctgtgtctga gtatactgta tgtttcaaat ttctgattta gttatgtacc tataggcaaa 207721 tacactgaat ggtcaaccta aaaatgcatt gtttgactca tatttagagt catacatata 207781 taggcacatg tatttaaaat aaatacttta ggctagatgt ggtggctcac agctgtaatc 207841 ccagcatttt gggaggctga ggtgggagga tcccttgagc ccaggaattt gagaccaccc 207901 taggcaacat agaaaggccc tatctctaca aaaaaaaaaa aaaaaaaaga tttttttaat 207961 tagccaggca tgatggtgcg cgcctgtagt cgcagctact ctggaggctg aggtgtgagt 208021 gtgaaccagg aagtcaaggc tgcagtgagc tgtgatggtg ccactgcact ccagcctgga 208081 tgacaaagca agacgctgtt ccaaaattaa aagttaaaaa atgagataaa tacttgaaat 208141 attacaatta ttaaaatagt tgttttcccc atgggaaaaa aaagttgagc agagctgtgt 208201 ctttgcactc agaaaaaact cttactctgt tgaaggctag tgactcttct aaacaccaac 208261 cccttcacct ttttactttc cttagacttt ctttggaaaa tcaaggagtt agcacttttt 208321 ccccctggaa gaattaaatg cctctccaga actgcagcac aggaggttaa cttctaccac 208381 ctttggaggc ctgcaagcta ccagattaaa tacttaaagc taagcattca tcatttctta 208441 acctgccctc tacagtcatg cacttcctgc tttttacttt gtcagggctc gtactcaaac 208501 tattcatagc ttcatgaaac aggaactaaa tgctgtctga gcctttcccc ttgtaactct 208561 cagcgttctg cttatccttc atagacagaa gatgatgata tgtcctgact gttgaagaga 208621 aattagactt aatgtgatcc tgtacagtta aaccaaaggc cagataggga ttttttattt 208681 caatcaactt tctattgtag gcctaggttt ggaaaacctt agcaccccct aatttgtaag 208741 acttgaactg ttttaaagat aagggataca cagccaaagc tttctttgta attctgagat 208801 ttttggtgtt tttacagata cttagtctaa atctttcatc agacagttat catgattatt 208861 attctgactt caacaactgc attttctaaa gggaagtgag aattaggaaa cattttcctg 208921 tacatctgtt tatctatccc agttgtgttg tttggatttt ttcagtgtct taacgattga 208981 aaccatctta tccatactga aattcatttc atgcttctat ctgctgcttc acattcaggc 209041 tatcagttta attgagcaaa aaaaaaatgt tggcagtttc tatagcatcg gcatccttct 209101 ttggtttctc atttttgaat ggttcagcca catgttaaat tttgctgaga tctctggaaa 209161 cttcttcaca taattctttg atagaacatg aattctgtga gactcttcat taaagagcac 209221 ggtttcatca ctaagttgag attttgctca aggctatttg gtaaattcca gcagaagagc 209281 aataaatgtt caacaagaaa cagccgtcat tggagataac aagggtagac attaatggga 209341 ggtttgtcac aggtaaacag gtgatggcat gagaaggttt aaaccacctc ataaataaat 209401 ttttcgccac tatatgatgt ggattacaac taaaccaaac tttaaccaat gtgggtgttg 209461 gtgtaccttt cttttctaaa gcaaaaaaga aattgtggta ctaaacttga gcttttatat 209521 gataccgcta attgcccttc acccaacaaa ccaattattt ctacccatca cttttcgttg 209581 caaaagaaat tatcttataa aatatatgta tatagcatat atctccaaag caaatatggg 209641 acatctgtac ctcaatgtaa ctgatagtaa gttaaaaatt atttgctgag ttcttggtat 209701 gtggacagat gaatattata atgtttaagg cattccacca gcaggttttg aatttaacca 209761 aaattttatg aagctaatca gaatggaatt aaaagttgaa agactgccag gccaaaaaga 209821 tcatctttca gaaaagtaaa taattgctct atacttacat atttattatt tatttgttca 209881 ccatccgttt atttttagtt gagttctgta tacattggtg agcaaagggg gactcattag 209941 cttgggcact cctaagggca aacccaataa cagatagtga atgggaagga aaatgagaga 210001 attagcattt agtgaacaaa atcttataaa ctatgctgaa tggtatacat gctctttcag 210061 taaagccact cagtactatt acttccctac cttctgaatg tttaccaagt tcgataaaat 210121 cttgctacat acaaatggga agtatgtaac aagagatact agtggtttct gggtttaatt 210181 aagttacata gaacattaag aagtatgttg ggaaggcatt agcttaaaag ctgtcttttg 210241 ccactgaggc catattcatc acccagattc cctccacagt ggagcactgc tcagtcactg 210301 ccttcagaat gcaaataggt cttcacagga tgctattaaa acccatagca ccgtatggtt 210361 ttacgctctg ctttcttcca gcaaacctac gtattttaat gttaatatat atgttcattt 210421 tatctttgag aatgaaaatg tattatgaaa atataaatat ttagtttttg tagacaaata 210481 gcacatttaa agatccagta gttattcacc tgcctggtat gactgataga gaacagatgg 210541 gtagcaaaga atcctcaccc tggaaacatg tacgcataca cacgcaaaca caatctcgca 210601 taagctttaa ggggttcatt cattttgtga aggttgacca tggataccag tttaaagatg 210661 ccaggtgatg tttggctttt aagtatcatc accttgagtt tccttatata aatgttttaa 210721 tatcaacaaa aaactcagaa tttctaatag gtcataacac tgcctaggta tgggaagagg 210781 agagggtaaa attaaaagaa atgcaaatgt acttttcttt gaataggcta accagttaga 210841 ggatgtatga agcaggacca ccatagtcca agtatgaata tgcaaaatta aagcagattg 210901 ctgtaaatct ctcttaaaaa gtcaagaata tgtcacagtt ggcatttttc ggatgatcag 210961 accgtttgac aaatctctgc aagggtccga gttagtcgaa catatctttt gcttatgatt 211021 tcagcaacac tgtgcgcttt gccttctggg ttgtattact aattctgttc atcattaagt 211081 gtacacatgt ttcactgttt gactaaacaa agcttgaaga catacaagtc caaacatttt 211141 ctgtcctttc tgtattccca ggtcagaaga tcttagctct ttatccattc tgtgctagag 211201 tttgtctgaa agcatgaatc atggggttgt gttgggtgct aaatttcagt tctatatgct 211261 gccttcacta atttatgact gtggtctgaa caaccaaaat aaattatttc tgagaactag 211321 caaaataccc tggaaaaaat ttatgaaacc taccagattt gtaaaaaggt cagccaggcg 211381 tggtggctca tgcctgtaat cccagcactt tgggaggcca aggcgggtgg atcacctgag 211441 gtcgggagtt caagaccagc cggaccaaca tggagaaacc ctgtctctac taaaaataca 211501 aaattagctg ggcatggtgg cgcatgcctg taatcccagc tactcgggag gctgaggcag 211561 gagaatcgct tgaacccggg ttgcagtgag gggagatcac cccattgtac tccagcctgt 211621 ataacaagag tgaaactccg tctcagaaaa aaaaaaaaaa aaaagaaaag aaaaaaggtc 211681 actaagtgat gagagaatta aaaccactcc tattttctac cttttataac ataaactctt 211741 gtctttactt cccaggtttg ccatctttat aacctgtaac taggagcaag tgatttaata 211801 aagggattcc tcctattccg tctgcaaatt ggagttaatt tttctcctct ttctgcaatt 211861 tctgttgttt tagcttttgt tagccataat tagaaaatca aaaggattca ttgaaagttg 211921 agacaaaatt gtgaagtagg aagatgtctt cctccgtaag aggagaaaga ttgttagagg 211981 aagtacatgg aggctttggt gacacatccc tatgaggaag tgatctgctt ccacgggaag 212041 gagagaacag atgatagatg actcctagat gctcctgcgt catgttcagt acaaaatcag 212101 agccgcatca gcaaaccgga taccctggga aggaacattg ggataactcc ttggccataa 212161 acaaaagttt tgctctggcc ttatgactaa gtggattaat agagattagt tgttcaatat 212221 ccttccatct aatggatcca tattgttaaa acttaagaaa caccaaaggg ggagagtgtg 212281 agaatctact gccattttct ccattgcttc attcaagact tctgtgtcca ccttggcatg 212341 ggattatcat ttgccctttt atactcacct taaactacaa agatcatttg tttgcctttc 212401 tgaagaaagg aaggtggata aattttgcag gagactttaa aaattcagct caactcctgg 212461 cattaaagac aataaaggtg gtgattcaga ggtccccatg gcagtttccc caaatattac 212521 gtttaactaa gatcatctca agcacagaga gacttaactc tctaaagtgt gatttttaaa 212581 cctgcacacg gaaattttct acattttata tgctcaagat ttcttgaaat ccagggtgta 212641 aaactgtgaa aataatagtt ctgaacctat ttatgaaagc aggagagaat tgctttgttc 212701 agttttggaa accaccaggg tgcaatttaa tgctccagga gagggttttc tgtgacctgg 212761 aaaggttggg gtgattcccg tacagtgagt catttacttc cttcttggca ggattatgcc 212821 cagggctttt aaacgctctt gacacttcac tgccttccat aagaaacggg tttagggcac 212881 aagtgtctat tatgataaga tgccagattt ctagatcaag cataaagctg tggagtggga 212941 caataggact taataatttt aatacacttc ccacgtgaac tgtcaagtaa taagctacat 213001 attcagacgt gactggcatt gagtcatcct gccattagtc aaatgtatga gacttttccc 213061 aagtcccagc aacaccaaat ttgagaaatc gtataaaaga aaaatgaaaa agtaaatacg 213121 tctaagaaat ggggaaatca ataaaaaatg tactagcacc aggggcggtg gctcacgcct 213181 gtaatcccag cactttggga ggccgaggca ggtggatcac ctgaggtcag gagtttgaga 213241 ccagcctatc caacatagtg aaacccttct ctactaaaaa tacaaaaatt aggtcggtgt 213301 gagggtgcac gcctgtaatc ccagctgaac tcaagaggct ggggcaagag aattgcttga 213361 acccaggagg tggaggttgc agtgagttga ggtggaggtg caccactgca ctccagcctg 213421 ggcgacagag tgagactcca tctccaaaac cgtgttagct tttatcagga aaggccatta 213481 aattttcttc aatattacct acatcaattt gacatcaaac tataagaaca caatatgtga 213541 gatgttttga aattaatgaa atgaatcttc tctttgagac gagtttgtgc acgctatgaa 213601 gtccattaaa tattttctgg gaagagttcc cagttaaaac tactttctaa agtacccaag 213661 cacactatat agatacatca agtaggttta ttaatgtcag tatttcggtg aaaattggga 213721 aggccactat tctgagtaga gaggtgtttg ttttctactt gaagactatg aaggatattc 213781 tgcactactg ggcctatttc tgaaattgta tctagctgca agttactcat taatttaatt 213841 aacaaatatt tattagcaac caaattgtgg caggaactgt gcctagaggt agaaatacat 213901 tacgctcttg tttccagaga ggaaaatgag taatgttcta aaatgtaaat cttatgtttc 213961 tctttcctct tgctgtaaat gatgtctaag ccatttcggt ggtgcctgtt cttaatcctt 214021 taataaatta ttttgaatga ttttttaaaa ggcagtgtga ttgtgaggtg agtttagtac 214081 aaattaaaaa atgtcacatc agttgcaaac agacagcaca taattatgag ctaaacactg 214141 tttcagaaac agtagtgaca ataaatagga ttcggtataa ttagtcctcc tgggacagat 214201 ctcttcctct gttctttgcc atatatttta ataatgagat tagccatcta atgccaatat 214261 aaccagactg tagtattaat acacagaaaa attattgcta gtaacagaag atttaaagac 214321 atgagggtga tccggattat atgcctgcag gttaaaaaaa aaaaatgttt cttaaactct 214381 ctttatcaat atccttttcg tgagccactg aattcttttg tttttctgat catatgaata 214441 acccgtctta agtatgaata ttgtgacaat agtaatagca attactttaa gagaatatgt 214501 ttctatagag ataacagagg ggtttttccc taccattcct gtatgaaggt ttctgtgttt 214561 ttaattaaaa ttttaagaaa ttcatctttg tcttagcata ataaaataaa cctcactaga 214621 caatcaggac accttgttat tcttagatac attattatag ctcaagaaat aacaaatgtt 214681 aagcctggag gatttgtatt ttggccaaaa gttgtgaggg caattattta tttatttgag 214741 tgaaatatgc aggaatcata tatcaaaagg agggaggtct tttttttaac ctcaacaaaa 214801 atataaaagt atagggaact gttatctatg tgaacccagc aaatctgaga caggtctcag 214861 ttaatttaga aagtttattt tgccctggtt gaggatgcac ctgtgacacg gcctcaggaa 214921 gtcctgatga catgtgccca aagtggttgg ggcacagctt agtgttatgc atttttaggg 214981 agacatgagg catcagtcaa tacatataag aagtacattg gtttggtctg gaaaggcgag 215041 acagcttgaa acaaaaagca ggaagactcg aagtggggag gggccttcta ggtcagagat 215101 aggtgagatg aacagttgca ttcttttgag tttctgatta gcctctccag ggtagacaat 215161 cagggtgatt ggctctaagc agttcccagc tagacttttc cctttagctt agtgatttgg 215221 gggccccaag aattattttc tttttacacc tgtgtttttt ctttcttaac taaaaatagt 215281 atttgtttct gagggataga gataactacc attttatttg gtatgattta tttgcctacc 215341 gtccaacaaa gaaggtaatc tgattttgtt ttcatttcca agcatgattc ctagagatgt 215401 gtttaggtgg ccaaatcaat gaagcagaac cctgcaaatt catttattgg gagatatcct 215461 aaggtcttcc tgaattatct gatgttccca ggctttgctt atccaatagg cattaggaag 215521 aaaaggcttg tgtttatgta gatgcacaca aaatgcgcat ctcagcttac ataaacagtg 215581 aaggggtttc ccaggtctca ttatccctgg aggagctgag gcctggggag tcctcagagc 215641 ctcctttgct actctgttat tgtggcaggc caaatctcac taatgcaggc ctccgtaaca 215701 gctgtttccg tactgactga gtggttcagt taaatactaa aagctaaaaa aaagccagcg 215761 cccttataca gaggctggaa tgtaacaaaa gcccaccaag agttttgcct aggcctttcc 215821 tggcaaaacc aaggaattct taacaggacc catttaggat taacaagttt tactgggagt 215881 ctgaagaaac tctacaggcc tccaagaaca agtttactgg aggtctgcag gaactcctca 215941 aacctccctg atttagcagg agacaagata agagtaatca ccccagcacc tagacccatt 216001 tagattaagt aaatttactg agactccaga ggaaggtctt caggactcag accttagtta 216061 tggattaaag aagttaatca cttctgtcat gagatgaatg cacacttaca cgtagacata 216121 cagcttagaa ggcatataag ttctggaaac tttgtaattt tgagttggtc tggcactaat 216181 ttccaggcca tctccctgta aaaggttaca aaaataaaaa ctctcttcct cccagttcat 216241 ctgcatctcg ttattgggct gtgaggtata gtagcccaac cctcagttca gtccgggaac 216301 attattatca aactctgggg attttcagga taagttagga atatattttg aaggagaatg 216361 gcagtattaa atgagatcag caattgaaaa catgtgaata cattcaccat ttaattaccc 216421 atctcaatag aagataagga tgaatttagc actagaaacc atgaatgtca caaaagccta 216481 actttaatac cttgaatcac tttccagtaa tcctaatcct ttatccacat ggatagagga 216541 cattcctatc ttgctctttc ttctccccac acattagctt aataattagc taaataattt 216601 catgcattag ctaaggaata atggaactcc agaaacctag gacagtaatt gagagactga 216661 acaagctaaa tctgaacctt ctgcacaatt aaaggagctg atgttagcat ggcccacaga 216721 atattacctg cacttgtaaa aactagaaag tcaggcaagt actcctggaa cagactgtct 216781 tttctaacca tgcacttatt tctttgtttt taactttgtt tttgtttgtt tgtttttgag 216841 acggagtctc gtcctgtcgc ccaggctgta gtgcagtggt ggaatctcag gtcactgcaa 216901 gctccgcctc ccaggttcaa acaattctcc tgcctcagcc tcccgagtag ctgggactac 216961 aggcgcccac caccatgcct tgctaatttt tgtattttta gtagagatgg ggtttcacca 217021 tgttggccag gctggtctgg aactcctgac ctcgtgacct gcccgcctcg gcctcccaaa 217081 gtgctgggat tacaggcgtg agccacggcg cctggcctgt ttttaacttc gtaagttgct 217141 ttgtagaact atcttaaact tggtctattt tagattctcc ttgaagggcc agatcacaaa 217201 gtgattagga gaaggaggct ctggagccaa gacagtgaga ctaacccatt cttcaaatct 217261 aacactaact agctgtgtaa ccattgggcc agggatgtag cctatgtaac cacaatctcc 217321 tttgtgcttc atgcctcacc aataaggtgg ggagagctgc tatgactctg tcatagggct 217381 gagacgaaga tttgatgagt taccagcaaa aagcatttaa tatactgtct cattcagaag 217441 cagtcaatac acattgaact cttattactt agaggaaaga aaggggtcga tgatttatag 217501 ataaggcctg agtgaaagaa ataaatgtgt attgagtatc agcaatggac tacgcatcat 217561 tctgggtaac tatcactgaa cacatttaat ctgcaaatac tatctgaaag atactgccat 217621 ctgaattgca taggtgggaa aagtcaaggc ttttgtgctt atttttaaat gtatgttaac 217681 atgaataatt aattggccgt gcactggtcg cacgcctgta atcctagcac tttgggaggc 217741 caaggcgggc agatcacgag gtcaggagat ggagaccatc ctggctaaca tggcgaaacc 217801 ccgtctctac taaaaataca aaaaaattag ctgagcatgg tggcacgtgc ctgtagtccc 217861 agctactcag gagactgagg caggagaatc acttgaaccc aggaggcgga ggttgcagtg 217921 agccaagatc acgccactgc actccagcct gggagacagg gcgagattcc atctcaaaga 217981 ataataataa taataattag tgggaatata tgctgtttgt tgtgatagga gaagaaaagg 218041 ggagatccag atgttggtca tcagaaatga cctgcatctt ctctaaattg aaatttagct 218101 tgaagtagta atgccaaggc cagcctgggg ccatggcatc ctggtgtacc tgcttgttgg 218161 tgacagtcct tttaggacaa aatatcttac aaaaacattc atagatgact aactggaaaa 218221 aaagaagtta tgatgaaaag caattcttac tgtttttgtt ttttgttttt tgtttttttt 218281 aattaaaaga caggcagtta ttctctttcc caggcttgag tacagtcgtg caatcagggc 218341 tcactacagc ctcaacctcc tgggttcaaa taatcctcct gaccaagctt cccaagtagc 218401 tggaactgca ggcatgtgtc actgcaccta gctaattttt aaaattttct tgtagagaca 218461 ggttcttgta atgttgtcca ggctggtctc aaactcttgg ctgcaagtga tcctcctgcc 218521 tcagactccc aaagcaatcc ttactttttt aattgaagtg cttttttttt ttcttattta 218581 aactcttaaa aattcatccc attttcttca aaacaataaa tcctttacaa gggagaaaaa 218641 tgaaatagag caactaagaa agcatccatg aaggccagga aggaacatca tggttaaatg 218701 cgattagtca cagccccttc tgtctccagt cttggatgaa taatggtaac agaactagag 218761 agatttctaa gaccttggtg cttaataaca agcaagcatc gagggtggtt gtgcaagatc 218821 ctgatcacac aactctgcat atcaaatatc cttcacaatc aacaagagtc agcctttaat 218881 ctctgatgtt ttcaaatgtt cttttttttt aattcacatg tataacctat cctcttttca 218941 atatatagag atatagctta tcagtcatta cctcttttac tacttttctt ctgagattta 219001 aaaggtcatc ctctgatttg taaattcaac cagatttcac agttcagatg agagttgtaa 219061 taagttgcca agaactgttg caagctgtga tggctgtctt tctggggatt atagagaagc 219121 acatttcttt taggaaaaaa gaaggcagat gagcccaagt tgttttattt ttattgtgta 219181 tatgtatttg ttttctcaag tacagtagca tggcctctgt ggcaggcatt caccaagcaa 219241 atacagagca ccctctgggt caagcactgt gttattgggg ttccaaataa agaaaaaaga 219301 ccaaatccag ccctgtggat ctcacattaa agaggggagt aagaatcata gataatttaa 219361 tgagttaggt acattaatgg ggatttgcac agtatttcat aggagcacag aaaaagggta 219421 cctcgttcca cctgagatgg cagggagatg gctcctaagt taactaagca aaaggggaag 219481 gagaggcaca tcccatagga tatggataac acaacaaaag ctccattacc acccgcccca 219541 ccaccacaca caccatctct ggaataaatt ttgtaccttc caactggttt tcagggtact 219601 catgacacat gggttaataa tagctaggtt tgcccaaatg accaaaattc tattcccaac 219661 atgaattttc acaactggtt tttttgtttg tttgttttaa tgaagaagaa atgttgtgtt 219721 tttggaaatc atcttacaaa atcatgtagt ggaatgcata tagtgatgaa attgttgaag 219781 accagggaat ctttgtcctt agtttttatt aggatggatg ttttcaggta aaactaaaat 219841 aaaaatgaaa tggaaccaat tagaaagaaa ataggagcac ttaagacgaa atgtaattgt 219901 tgctgaaaga attagaatct gatagcttag aaagaactta attaaatttc agagtaaaat 219961 aaatacagtt tagcaatggc ctttctccta gaaaaatata atgtctctta cagttcagtt 220021 actccttggc ttacaaaata gtggttaaat ttcctgtttt tctcattttg atttcgtatg 220081 ttacatgtta ctggtgatgg gattatctgt gtgacattac ctctggtctt caaaatgtac 220141 acgtagcaat gtagtctcac atatgcccct tgggtgcgaa aagagaagga cctaacttaa 220201 cctgacagac ctcaaatctg ttctctttct atagcagagt actcctattt aaaaagaaaa 220261 tgttgagtaa ggaggacagt catcatatat gtcgagaatg ttaaatgctc ttgtggaagt 220321 gaaaagcagc cgatctaagt aattacccat taagaaagga gtaggagaaa tgtttagaag 220381 gcagacctgc aaaaaggtga ctcacagtac gttcacatga ccggatgcat tagtggaacc 220441 tctaacccat cgccaatgga agaagcagtg ttttgcacaa acttgaaaaa gagtttttca 220501 ttttcctccc acagcctcgt cacgtgttca atatgctaaa gaagtatgtc cgcgcagaac 220561 agaaggacag acagcacacc ctaaagcatt tcgagcatgt gcgcatggtg gatcccaaga 220621 aagccgctca gatccggtcc caggtaagcg tggggtataa tcatcttctg cagctttgac 220681 aatccaggat tcttgcttct tgggttggtg acaggatccc acgcattctc cgagcaccta 220741 taggtgtttt ccctatggca agagctcttt aacttctggt aaattctaaa agcttgtctt 220801 agagcaatat gtggtttggc attttctgag ttatgaaaat aacaaatgta aaaatcttag 220861 tctactgcta actgattgct aaggaactga ttaaaaccaa tgactaaaat gagaaaaaca 220921 aaagttgtat aaactaaaat cttgaaaact ggttcaactg aagtatggca ggcaatcata 220981 aacctactgg aatgttgaga tttcatgtga gcctaaatga agcccagtaa taatgtattt 221041 tatactcccc ggttcccaaa ggctgaaaac tgggatctac cttaggaaat gtgaaatttt 221101 acattttctt gaatgaaaga gttaaatata tatagcatga aatagccaat caaattgaaa 221161 tacgttgttt tctaggggga aatttgcaaa gaaatgtatt taggcatgag ggtcctgtta 221221 agagagcctc agtattagga atcagaggtg gcagaggtag cctttttaat tggctgttat 221281 aagcatacaa gataatggta agtttcaaga tcaagtctta aacctgagct caaaaagtta 221341 gagaacagaa ataggagaac aagaaaaata aagttgccct ttcagttcaa cctttttaac 221401 agtagcagtg ttttgtcagt tttcttttgg aagtttgtat ctacctgcag tgcttggtaa 221461 gaaaaataac ctgggcgaca gagcgagact ccgtctcaaa aaaaaaagaa aaaaagaaag 221521 aaaaagaacc attcctaccc ccagacattg ttgacctgga gttgtcatcc tttgatgcag 221581 gttatgacac acctccgtgt gatttatgag cgcatgaatc agtctctctc cctgctctac 221641 aacgtgcctg cagtggccga ggagattcag gatgaagttg gtaagtaagc tgttcttttg 221701 atgctgcaca tggacatgta ttttccccca gaggaaaatt gaggaagtga gttatctgtt 221761 tgaggaatat ggaagttaat agaacagctt gccttttacg atgaaactgt caggctggaa 221821 ctagcttact agccttagca attgaatcag ttcttacctc tccatacttg taagactgga 221881 ggaatgtcgc ctgacttagg gtatagtttt gtaataattg gacacttgtt attgaattat 221941 acgtggcagc attttttcta atttgcccaa tcttgttatg gctctcaaat tatgatgctt 222001 tggtacttta caattaatgg ggaaaaaaac tcattcttca tcttttgagt gtcacgattg 222061 ttccatctta caaatgaaaa agctcaaact taaaatttga gtaacttact ggcctgtgtc 222121 acatgcatta taagtagcac agccaaaaca cataactgtg acttgtgact ttgaggttat 222181 tgttatattc agtaagtcac aattctcttt gatggctgag cagggaagcc atgagtgttc 222241 tctgaaccac atcattgttt aaacttaaaa gatagtgggc tattaaactt atattctgaa 222301 ataattgtac tctcacaaga acttggaaaa atagtacaga gaggtctcat gcatccatca 222361 cccacatgcc tcaccagtga catcttacgt aactatagtc cattatcaaa accaggaaac 222421 tgaccttgtg tgcccactgt cttttatgtc cctagttttc ctaatgacta atgatttcat 222481 ttaagatctc tcatttctgt ttctcttgga aagagaatca aataccacat gagtctctgt 222541 cagctgccaa aatgcccata actgtttaac ttacttcaca aaggtacata gatgacgatt 222601 atgttggtgc atctttcaat ataagcatat gtacctatac tggagattta gtgtgtagtc 222661 taccaacata taagcaatat gtatctggca tatccttccc cataaagaat cctacccttt 222721 gttaattggg ttggaaatag atacaggtta tcatctggaa tcagagcacc caggatgttg 222781 tagatggatg tacccattac agtaagtccg actttgaagt gcctgaaaca ccgctcaagg 222841 gaacagctgg ccctcacttt ttcccatttt atgccagtgt tattctgctc attctttcct 222901 gatttatgtg gccttgtaaa tattgtttgt attgctgtga aatatttggc atgtgtcaag 222961 tgatgatgtt ctgttcagta ctgcacatta atggacggaa tctgaaatta tactttgagc 223021 tacataactt gcagaagaaa cttcatatgt cagggaggtg cagtgtccca gaactttggg 223081 aggccaaggt gagcagatca cctgaggcca ggagttcgaa accagcctgg ccaacatggt 223141 gaaaccccct ctctaattta aaaaagaaga gaaaaaaagc aacttcatag tgtgccaagc 223201 ttttgttcag aaggagcaac tcagtgttga gcaagaattt atgaagatag tggaggtgga 223261 gtttttagtc tacaccatga attgagaaca gagatctcca attaattttt cactgtatca 223321 agtttcttta gcaaacaaat atacttttca ggtatacttg gagaatagac atcagtaagg 223381 aacctttaaa atactatgta ttaaataagt gtaaagaagt tctccaaaaa gttgtagaag 223441 gtacaaaatg aaataattta tcagtcttaa tgaactgaag ctaaaatctt ctagaaaaca 223501 tggtttgttt gtttgtttgt ttgtttttta cagctatctt aagcctgttt tcacatgtga 223561 cttcagttga agtggcatag gtaataacta tagggaaata agccattaca aaggactaaa 223621 aagttgatga ccaatattcc accctcaccc agcaattatt gttgactctt tataattaaa 223681 ttattagaac tttttatttg gcactatact actgaaaaca ttagccacta attagtcaca 223741 cttccacaga acagatggtg gttcagatat cttgaatatt acgaaatgcc caagaatagg 223801 gcagcacaat aaacttcaaa tctcagtacc agggaggata tttagataac tagtatcatg 223861 gaatcttttt ctgtaaaaag aaaaaaaaaa acagccatta ggttggatta agaacatatt 223921 agggggtgca gcacaccaac atggcacatg tatacatatg taacagacct gcacgttgtg 223981 cacatgtacc ctcaaactta aagtataata aaaataaaga acatattacg tatgtgctct 224041 taatgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtat gtatgtaata tgctctcaat 224101 ccacccatta gttgggtttt tttaagtaat gacatatata tcattctaaa tataccagat 224161 gccacattct gaaagataat cttttttaaa aacctttcat ttggaatcct gggtgcttcc 224221 atttttaaat gactgttctc tatagaaaga gaagtttgca ataacgaacc ctgtacacta 224281 tctgagctgc tagtgtcaaa gctacagaag tggtttgctg gggagaatag cttttctggg 224341 ttgaacctag ttgtcatcta tagtcctttt tcatttgaag gaacatttaa aacaatacgt 224401 ccctttttag taggaatata gttgaaactt caagattggc caggggtgct aattgcttaa 224461 gccaaatgtt attagaggtt cctcatgtta ttctttttac ttttgtatgt gtttaaaatt 224521 tctcataata aaaagtgttg cttttagctt actttctaaa taggttcttc atagattttt 224581 tttccaggtt atgtcttgta gtcgttctaa tattctctga ttaaatgctt tgtcattagt 224641 ctgagttgca tttatttgct atttctgtat ataatcaaat tttagatttg gagtcagttc 224701 agtacaaata gcatactaaa attcagagac actggcaaaa ggagaaaatt ttaaagtccc 224761 taaaaatgca tgccttttgt atagaagttt gtgcttataa atacaagcaa ttaggatgtg 224821 cttaatgcag tcaaacatct gtagagagat acaatggttt aattaaataa tggtaacagt 224881 tgctttatgg ataagaggtt ttcaccagtc tggggaactg ccagtatctt tagtgggaga 224941 ggtatatctg cttcagtaaa ttatgcttcc aattagaatt ctcttttatg ggtgaaaatg 225001 aggaagaatg gagtcatgct gagaagaagg aatgctatcg gtgaagggga aaattgacat 225061 taaatttgtg agcagtaaac tgtgaccaga gcatagggtt catcatgctt gtgagttctg 225121 gccctggcct tgtatcttcc attgtgcttc cctgaggcat cctgtctgtt catgctactt 225181 tcccactaac ttcagtgaat cccaaattca tgggtctagc cacagccagc gtactgaatc 225241 ccagtgccat attccagacc ccaaatatcc catgtgcctc tgaacctgca atcaatgtgt 225301 ataacactca tgtccttctg gaaatttact tgttttacac actctattct gagttctctt 225361 ggttggccta aactatccat gatgccttcc tctccctcac ccattatttc ttattagttg 225421 caagttctgt atatttggcc ttctgtgttc atgtgaaatc tctgcacatc ccctacttac 225481 ctgagatcct gtcctctcat cagtgtcatt agactcccag ccagtcaccc tacctcttat 225541 ctcacattgg ccccctccca cttcaattcc tccttttcgg aactatgcaa gttcccttaa 225601 aaacaaaaga caaaacaaaa aaaacagctg caggccggcg gcagtggatc acctgaggtc 225661 aggagttcaa gaccagcgtg gccaacatgg tgaaacccca tctctactaa aaatagaaag 225721 attagccagg catggtggca ggcacctgta attcccagtt actcaggagg ctgaggcagg 225781 agaatcactt gaactcggga ggcagaggtt gcagtgagcc aagattgcac tattgcactc 225841 cagcctgggc gtcaagagca aaactccgtc tccaaaaaaa aaaaaaaaga aaaaaaaaaa 225901 gctgcttgtg gtttttcctt gcttgaaaag ttgcaccttt caagattcaa gtaaaatgcc 225961 caagtcctat aaaccctttc taatttgcat agagaacgga gaaaagagct ggcatttgag 226021 tgacaaccat gtaaccaact aatgagtgcc tctttgtgat aatgctgatg gtcacattct 226081 ttctttatga atgttccccg ttgaggtaca ggaccgccat atagagatgt ggcagtggca 226141 ggtggtttag ggtaagagtg gagactgata attttattct acatcccact caaggagcca 226201 agccctctgg cattgtgcac acccactgga agtgggctgt tcttgcattt cacacaaagg 226261 cagcatatgg gctagtgaag atactcggaa attcagtgat agcatcccca gtttacagtt 226321 gaagaaattg aactttgaac ctaagagagt ttgccaactt gcccagccca gatagtaaag 226381 tgatgaaagt ggggtggaag ccaagacctg ccagaagtcc aaaatctgtg ttctttctgc 226441 tgcctatttt cctctctgaa gtggaatttt cccctactta acctcaatgt gaaagcactg 226501 ggtatccacc tccaatacca tactctaact tggagtaaat tatttgttta tatctctact 226561 aaccctccat cctgtgaact cttccagata tatgttggtc atccttgcct cctcaagatc 226621 cagcaaggac cctggtactt tttagatatt agtaattata aatataaggt tgaagccagt 226681 ccccaggata aaaaggcaga tgctccttaa cttggactga tgaagtgggc tcctccgtaa 226741 ccctaaagac ctgaccaaaa ctccttcctt gacatctagc gtaagcacca cacaaaacaa 226801 atttctttcc taggccacag catcctctaa agctgtgatt gatcaagcta actaaatgga 226861 tgtggttggc caagataaca aagcataaat cagggttctg aggaccttta ttaataaaat 226921 tttacaacat ggtccttctt tgtaccatga ccctctgcta tttggtcatc tacttgattt 226981 aattgtgctg tcctgtttga ccgccaacca cagtctgaag ctattcccat gacaaataag 227041 tgtgaatttt caagtgtcct gttgtgttat aaaaggcccg tgttttattt tataatttca 227101 tatttttttc attcaatttt aattattatg tgcaagttcc caggctcagt catttattaa 227161 tgtttttatt ttttaaaaag ttattttcag gcagattatg aagatttctt gcaaagaatg 227221 gcctcaggca ttgtttgtat agagataaaa tctcctcttt taaatcacta tgtaggtctt 227281 cattaatatt ttttcattca acaaacgatg gaatcattat ttcagtgaaa ctatggctcc 227341 tcttcagtgt tcatattctg ctttaattat ttattaatcc ttcaaaaggg cagaatccct 227401 tcatcctttc aacctttgct ttgtcaaatt gttacttaaa aatgatggtg ttttctttgt 227461 cctttgtaat cttccttaaa tctctgctaa aaggattcta ttctactctt tgtattggca 227521 ctgaggttat ttatcaaagt gtgcttctct ttgcagctac ctcaaggcta aaaatgtagc 227581 tcttttcaga gatggctgtt ttacagaatg ctataataac caatttatga gaaaggcagt 227641 aggaaataaa gagggaacaa aggcataatt gtttcatacc catttttttt cttctagcat 227701 tcaaaattaa taaaaacatg aattattata aaccagatgc gggaaagatc tctggctgaa 227761 taaatccata gtgaatgggg caggattgaa agatatggcg gggaatcaag aaattcgcaa 227821 taatgaaagg atgcttattt tttattacaa ttttactgtg gcctcttgat aaaatgttcc 227881 actcttcagg aaaaatgcag catggttatt tcaaagacag tgtttccata ttctgtttct 227941 aataagcctg ttgatttggg ggtttagtgg gtccaaagta gactaggttt aaaatagaat 228001 atttatagca cttcccagcc caacaaaata gaaatttaat gttattgaga atttcaagta 228061 tccttatgtg taattgttag tcaaggatga acatcaaaaa aaaaaaatca taacagcaaa 228121 tagagagctg tcactttgtt cttataagaa cataggtgtg actttgtttt tcataagaaa 228181 tgacttttaa atgtatatag atttaatggt ttacagctga cttaagtatt tggacataag 228241 aacttttttt tttttttttt ttttgagatg gagtcttgct cagtcaccca ggctggagtg 228301 cagtggcgtg atctcggctc actgcaacct ccgcctcccg ggttcaagtg attctcctgc 228361 ctcagcctcc caagtagctg agactacaga cgtgtgccac cacacctggc taattttttt 228421 gtatttttag tagagacagg gtttcaccat gttggccaag ctggtctcga actcctgatc 228481 tcaggtggtc tgcccatctt ggccccccaa aatgctggga atacaggcgt gagcccccac 228541 atctggccaa gaacatttat acatattatc aacaaccaaa gccttgttcc cccattggtt 228601 tcataatagt ctcagagctc cttattcata ttttccctag tgttattaag taacagacat 228661 gaggcctgta agtttagctg cctgtgtcta aatgtatatc tttgacccaa gacaatttct 228721 ctttcaaaat agaaccactc atttataaat gaaaatgctt tgtcaaatga atggcaaagt 228781 acggtatact attaagacag cctcctagtt ttctgacatt tggactgcca tgttgaaatg 228841 tctccacaca gcaagataca aatgtgtgaa tgccagtaca cttcatacct taggtcccca 228901 actggtctgt ctgcagactt gaggttacag acctgcaggt gtcgaactca ccccatagtg 228961 gtggttagta gagtgctttg tatacgttta gctctctatg ccttttttgg ttgtgatatt 229021 aggtaagctc tctttttctt gctttggagc ttaattatgg aatctgtaaa tatagtgcat 229081 atgtcatcca gatgatctgt ggaagtcatt gataatgacc tttaaaaatt cacatgtaat 229141 acagcatccg ttttcctgaa tgatttttat agttttacca ttggcaaaca gatactttct 229201 tttatgtatt tctactccat aattttagga aaaagtaaat ccttttagct acattaaaaa 229261 gaatacactt tctaggaata atgaatgtaa aaaaagtttc cttttttaaa aaaatattgc 229321 gctctctacc attctggcag acaaattgtt gaccttaata tgtataattc agtgcagtat 229381 attgatcttg tttattgtgc tccaaatgga cgctggcaga aatttgagga attaaatatc 229441 aaccacagtg ttttcttcat gcgattgcaa tggtttgtta tggattattt tgttcagttc 229501 aaataaatga tgtacgttca gattctttgc ccaatttttt aattgggttg tttaccttac 229561 tttcttaatt ttatttttat ttttattttt atttttattt ttggagacag agtcccgctc 229621 tgtcacccgg gctgaagtga agcggcatga tctcagttca ctgcaacctc cgccttacgg 229681 attcaagcga ttctcgtgcc tcagcctccc cagtagttgg gattacaggc acatgccacc 229741 acatccagct aatttttgta tttttagtag agacagggtt ttaccatgtt tcccaggctg 229801 gtctcaaact cctgagctca ggcaacctgc ccacctcggc cttccaaagt gctaggatta 229861 caggcgtgag ccactgcact cggcctgctg ttattattga gttgtaagaa tttttatata 229921 ttctggatac tattaaattc ttatcaaata cattatttgg aaatatttta cccattctgt 229981 aggttgtatt ttcacttcct taataggatc cttttgacac acaaaaattt tcaatttaga 230041 agtctatttt atcagttttt tcttttgtgg cagttactat ggtatggtat ctaggaaacc 230101 attgtccgaa acaaggttac aacgatttat acctgttacc ttctaagagt tttgtagttt 230161 tagctcttac agttaaatct ttgattcatt ttaattttca tatatggtgt gaggtagagg 230221 tttgacttta ttcttttgca tataaatatc cagtgtccca gcaccatttg ttgaaaaaga 230281 ctattcttgc caggtgcagt ggctcatgcc tgtaattcca gcattttggg aggccaaggt 230341 gggcagatga cttgagccca gaagttcaag accagaatgg gaaacatggc aagaccacat 230401 ttctacaaaa aaattatcca ggcatgataa catctatttg tagtcccagc tactcaggag 230461 gctgtggtgg gaggatctcc tgagcctggg gtggctgagg ctgcagtgag ccttgatcac 230521 gccacctggg caatagagca agaccctgtc tcaaaaaaag gaagaaaaag actattattt 230581 cccccattga atggtcttgg cactattaca caaaatcaat tgtccataga taatatgggt 230641 ttatttctta attcttagtt cttttctttg atctgtgtgc ctgtgcttac tgtagtacca 230701 cactgttttg attattgtag ctttgtagta aattttgaaa tcagcaagtg tgagtccgtt 230761 atctttgttc ttctttttca atattacttt gagtatccgg agtctcttgc attacaacat 230821 gaattttaag atcttcctca ccccctgccc ccactaccat ttctgcaaaa aaaaaaaaaa 230881 aaaaaaaagt gcggattgca ttggatctgt agatcatttt ggggaatatt tatatataat 230941 gttaatgagg ttttaaacaa cacttttatc tactagaact gatatttgcc tcgaaatgat 231001 tccaccttga gccccattca ggaacgttag tattttagaa gtagcaagag aactgattta 231061 gcatgctatc atcttttact ccaaattttg gtgagattcc agagaaatta tttgtgagct 231121 tttccttaag ccatgatgac tgacatgtca gtagtttttg tctgttttta tgccattctt 231181 ttatttttgt gcatgtgtgg gcacctcaac caaatggatg gagcctgagg catggtctcc 231241 cacgccaaag tctgccgttt cttttgatga tccaatgcag ttccaaagct attttctcaa 231301 ctcaccagct cgtttgtatg gccctgtaat ttgaaacaaa atctaaaaag aaatccaaat 231361 taaaatgatt gggaagtgaa gtcagaccac agtggtggct gtggaactac ttcctggcat 231421 ggaacactag agcagatgtg tcctgctttt tttgtagagt cttgggaaca accaatgatt 231481 ttagactgga acaagggctg gttattctgt ttttcacaag catctgcctt caaggttcat 231541 tttttaaaag aggaggggaa aaaaatcagt gatgtttgcc accctgagta tggagatgaa 231601 tccaaattga gattaatttt gaaatatgtt tcccgaattt tgagattaat tttgaaatat 231661 gtttccacaa acactaactg ggaatgacat gagtttgttt catgtgagac ggagtttgga 231721 agacatgtta atttgcttgt tggtgttcaa ctcagaaaca aattttgcat gtattttctg 231781 agtaagtgcc attattagtt aacagttttc ctgccgtatg cactatgggc tgctttagaa 231841 agtaaatcat agaaggttag ttataatatc tcatcaaaaa tagtggatcc atctattaac 231901 cggttccaag ttatgagacg tgtccaaagg aatgttgact tggtgtgcac ctaaacacta 231961 tttgtgaaga ggctttggac ttcgagcatg aaacggggtc agagggtcaa ggccagggcg 232021 ttacttaatt tcagctcctg ttgatggatt cattgacttc agatagtgcg tttaaaaaag 232081 cctgtttaat gtgtttgtct tttgataatt attatgtatt tacatatgct cctaaaagca 232141 tgagttaaat cttatttctt tgaaacagtg cttggcaaca acttgtattg tcttgtacat 232201 gccttttgtg tctttgcttt tggttggggg ggggcattga aatttgtctg cccatcccct 232261 gagggcagat actgaggtgt ggacctatgt cttcagcttc ctgtatagca ttgaagttgc 232321 tatagctgca tgttctctaa aatttttata catgcgattt aattgttaaa gcttatgatt 232381 tttaagtata gtattctgat agcaaataga ttttgtcaaa taaaaactaa ctttatcatc 232441 accactgaga tatcactagg tatatggctt agttctcagt caccactttt atttgtaagg 232501 aagtcttaga agttttcaca agtgtatatg tgatagactg ggcacagtgg ctcacgcctg 232561 taatcccagc actttgggag gccgagtcag gcaggagttc gagaccagcc tggccaacat 232621 ggcaaaaccc tgtctctact aaaaataaaa aaattagccg ggcatggtgg cacatgcctg 232681 taatcccagc tactcgggag gctgaggttg gagaattgaa cctaggaggc agaggctgta 232741 gtgagctgag atcctgccac ttcaccctag ccagggcaac agaacgagac tccatctcaa 232801 aagagaaaaa aaaaaaaaag ctgggtgcgg tgctcacgcc tgtaatccca gcactttggg 232861 aggccgaggt gggcagatca cctgaggtca ggagttcgag atcagcctca acatggagaa 232921 accccgtctc tactaaaaat acaaaattat ccaggcgtgg tggtgcatgc ctgtaatccc 232981 agctacttag gaggctgagg caggagaatt gcttgaacct gggaggcaga ggttgtggtg 233041 agctgagatc gcgccattgc agtccagcct gggcaacaag agcgaaactc catctcaaaa 233101 aaaaagaagt atatatatga tgcttgcttt agcagcacat atactaaaat tagaatgatt 233161 aaagagatta acctggcccc tgcacaagga tgacatgcaa attttgtaaa gtgttttata 233221 tggaacttta cttacatgag gtacctaggg tagtaaattc atagatacag aaagtagaat 233281 gctgattgcc aggggctgag aggatgaggg actgtggagt tagtgtttaa agggcacaga 233341 ggttcaattt ggcaatattg aaaatagttc agtggggaag atggatggcc gtgatgcatc 233401 gtccatcaca ttgcacagca tcgtgaaata tacttaatac cctgagcgtt acacttaaaa 233461 tggttaaaac agtaagattt atgttatata tatttaccat gataggatat atatatgtat 233521 attcatatca ttcgccattg tgaatagcaa ggagattgaa gaaataagga cacaggatct 233581 tcaataaata tgtattaaat gcttcccatg ttccacattc tttgtcattg ccagacacgt 233641 cacaggtata agggcccaat ctttgttact caaataattt agaataattt agaatttgct 233701 tggggagact ttggataaac agttaagaag ctgtatccat taggtaaagg gacattgctt 233761 gctgcgtctc gtagcatttg gtagtctcgt tagacagccg ttcttactcc tatcgtttgc 233821 tgaaatttat tagcaaagga aagaatctta gttttcagct cgttcagccg catgatttca 233881 taggctcagg aaaaccatgt aatatgccca aggtcagacg tgatgaataa gactaaagac 233941 tgaatctatt ccagtgtctc tgtgtccagt gatacttact tactgctgtt gccaagttaa 234001 acaaaactgg gtttgcagag ctgtttattc accttcagca tcttttcatt gaaaatgcgg 234061 tagatagaag aaatgtccct cttttagcat ttgtacctca gtaataagat gcttcccaag 234121 ctcaaagact ccacaccctt aaatatgtaa gtcagtgtta agatatttaa agagaacatg 234181 aatatcagct acattgtgca tgttgcagga attttatatt attttgtatc ttctgcttag 234241 aagtcataag ccagcccatg aaccatgcca acaaacatac cttcttccat gtgattattg 234301 cagccagatt tttcagtgca ggacaagcaa agttttctct gaaagcatga gataataatt 234361 ctcagccagg tcttcgccat ttgcaaggga tcagacataa gagtttttag ctctgttgat 234421 gttaatagct agagcccctg ttagcagatg gcatgttagt ggcctgttgc tgctggctat 234481 taaatcatac acattggcca ggcacggtcg gtcacaccag taatcccagc actttgggag 234541 gctgaggcgc acagatcaca aggtcaggag tttgagagca gcctgaccaa catggtgaaa 234601 ccccatctct actaaaaata aaaaactagg caggcatggc ggcacatgcc tataatccca 234661 gctactcagg aggctgaagc aggagaatca cttgagcccg ggaggcgaag gttgcagtga 234721 gccgagatca cgccactgca ctccagcctg ggcgacagag cgagactccg tctcaaaaat 234781 aaataaataa aaatattaaa agtaaatgat acacatcatg agatcatctc ccaaggcagc 234841 agagaatttt gcgtattttt ttcttagttt atattgttgc aacagagtaa ttcttcttga 234901 tcacagactt tgcttttttt gttgttattg tttctaagtt tcacttgatc ctcaaaataa 234961 cctgagacag acaaaacaga taggttttcc cattaatctc cttcatgagt ctgactttgc 235021 cagccaggct gatcttatat ccatagccct ggatgcattt atttatgtta ttccatctgc 235081 ctataatgcc catccccaat cttggcctat tacatgtgtc taacattact ttgatgtgta 235141 catgtcagtg aattaccact taactgattg ttagcttttt gagagtatgt atcaaatctc 235201 atgtgcattt gtgtgtcctg aaacaaatta ttaggtataa agtatgtatt gcaaatacca 235261 tgcccaggga atatttcaga gctgtggttc tccaagtgtg gtccttagtc tagcaacatc 235321 agcatcgcct ggcatcacct gggaacttgt tggaaaagca cattccatga tcccacccct 235381 gacctcctga atcagcaaag ctagggatgg ggctgagcaa tctgtatttt atcaagctgt 235441 ccaggtgaca ttgataaatg ctaaactttg cgaattactg atttagagac taatcaactc 235501 attaaaaagt taataaatag gtgagaatcc attgtagcac cccagtcacc tatgctatta 235561 tcttctaaat tcacattata caaattgtat gccacaaaaa tgtttaaggt gaaaacccaa 235621 aggaattgat gcattggatt aattcacaaa aatttaacag ctgcaattca tagaaaaaaa 235681 atgcatgcat atctgataat tagttgtaga ttaaaaggtg aacaacttct tattattaat 235741 acccaatgaa aatgtaattt tttaaaaatt atggccaatt ggtagttttt taaaaatctg 235801 ttatgccttg tcatgataag gataggtgaa aaggaacttt cataaactat tgctaagtat 235861 atgggtgaaa ccttttagac tatttgggtc tacctaataa attaaaatgt tcagtctttt 235921 ctaattttag caccgtattt ctacagatac actggcatat ctgtggagag gggcacttct 235981 gcttgtcacc tgatatccct tactactact aatattttta taagcattat tacctttaaa 236041 atcatgttaa agggtaaatg atgtgcaaca gactcagttt ccccttctct gactcctgtt 236101 tagtttacat ctcttttctc tctctcattt gggggctcaa tacccttaag attatagaga 236161 aattgcagta ggtaatgttg gctggaaatg ctaacttgta agaaacttgg gtggaaatgg 236221 tttaagccat cagcctaaac gatcagtgaa aatgtcagac attctccgct ttcttcttcc 236281 aagatactat actctactgt ggccttagag ttcttaattc ctatataatt ctgattaaaa 236341 aggaaggaga tgctcctaat ataccttgct gttgtcccct cctgttcatt agggtagctg 236401 atgtcagtca aaataaagtc tagaaattct tgagtacttt ttttttttgt aatatactga 236461 gaatgcttga agcaaactaa ccaaatttaa gtgttttaag gatattaaca gtaacaggag 236521 agcccctcga ttttctgcta agaatcaacc tgaacagttt ccagactgct ggttggcaat 236581 gtcagtatca tgagaaaaac tgaaaaaata agaaacttta gagaaaaaga ggaatcagtg 236641 caggctaggg gaatgaactt ggtaatccag tagtctttcc ctgccagcat tttgattgat 236701 aagctaagtg tctgcttttt ctcatccctt attttttggc aggaaaccag tattttgata 236761 cttgaatgcc ttctttctat cttaagggag aagccttctt aaacttcaag tgtgaagtag 236821 atcctttagg tataggatga aaaattaagt gggactattt gtgaattact aaatcgcaac 236881 cattaaagaa actaatgccc taattagcct gtcttttgac agacatggca tccatatgta 236941 agtctaatat tgtatgcaaa tgtgagacag gttttgcttt ctagatctct ttgatgtgaa 237001 aggaaaagct ggcttcctgg cttccttccc atgttctgta tggacctgcc ttcatcctgc 237061 tgccaacagc tcctcaaggg aacatgaaag tggaattaaa ccctcctggt tttcagattc 237121 agaatcctgg gtgattaagt ggcttgatca aaatcagaaa ctgtgaaagt gctaaaaatg 237181 gtcaaggccc aaggagttcc tcatgtgtgg tcaaaagcct ttcagaaatc aaagtgtgtc 237241 tttaaatacc tggggtgcaa aaatcagatg ttttttattt aaagaaaaat aggtatatta 237301 gagcagaact gggggccctc tgtaggagcc tgagtctggg tgtttactgt cgtctaaggc 237361 cttgttaatg tgggcttgag gcttcggaat ccacgaacag gggaagctag cgtctctcgc 237421 tgactccagt cctagctcga gctttccata ctgccctacg tgacttctgc acttttgtgc 237481 catcagccag tagactgaga aaagtatctg gtatttgtcg gtgtcttcgg tttagctgct 237541 ttaagtcctt ggatggaaaa ggctgattgt catcctgaaa tcattgtctc ctaatgtgta 237601 attggggcca agtgaagcac agtaatttag aggaattcta gttaaaaata cagattcttg 237661 ggctctacct cagacctaca gagatctgca tttttcttat tcccccaggt gatcctttaa 237721 acattcagag gtttttaaaa ctcatgctta tatgcaataa caaaaatata aggattctta 237781 gaaacttgtt atatttgtct tgaacatggg atgacagaca ttgtcatttt cattgaaaaa 237841 aaaaatgaag cagagactca gaatgtattt agagacaact ctaaatacat tctggcagag 237901 cctaactact ggattgcaga gatcctccct gcttccccca gtccccatct ataaactggc 237961 catgtgaatt tctagtggtg ttcacttcca aagtggtgat tttataaaat attacaatga 238021 agcattatta acctcaagaa gttcagatgc atagagagtt tatttgggtg tggcagcatt 238081 agtggctagg ttttatttat ccaagataag acttcttatt agctatgaaa agggattggc 238141 ataaactgct tattaatgtc cattaactgt atctaaaggg aataaacagt ataaaaggcc 238201 aggcatggtg gctcacacct gtaatcctag ccctatggga ggctgaggca ggaggatcat 238261 ttgagcccag gagtttaaga ccagcctggg caacatgggg agaccctgtt tctacaaaaa 238321 gtcaaaaaaa taataagctg ggtttagggg tacacacctg taatcccagc tagtcaggag 238381 gctgtggttg gaggatcact tgagcccagg agttggaggt tgcagtgagg cctggcggca 238441 ccactacact ccaggctggg tgacagaggg acgaccttat ctcaaaacaa acaaaaacaa 238501 aaaactgtat aagcataaga tgggcttgag tcacaaaatt ctgcctttca taactttccc 238561 ccatttgctg tttgatttgc tgtttcagaa attgtgaacc tacttctcta cttatagaac 238621 atttacctcc ctcttagatt cttccctttc tatcattgat actgacattc tgcttaaaga 238681 attatccatt taggctctct tcaaatccaa agtgatatta ttttatttta ataagagatg 238741 aaagggaaaa ttgacaaaat gtgcctggag ttgtcagcat tgtaaatgat ggactgtaaa 238801 agcttcgtct tggaagattg tcagagagcc tagtgccagc tcaaaaagtg tggtcttttt 238861 agagaattga cagaaatgtt acaaagaaaa catgtgctga ataccaactc caggtgcgaa 238921 tcctgagaag caactcactc tattctgcag taggagatga attttgcagg ttatacagca 238981 gagcaattta aactcccttg tttcttaaag tatatcctat agtgacttgg gatggaagtc 239041 atatactatc cccttttttg ttccttcaaa gtaaatacac cacagagctg gagatgtccc 239101 agacacaggg ggaagctgtt tggatttttc ccactgcctg taatgaagca agatagggag 239161 gaagaacttg ggaacttgcg tggcaaatgg tcaagtaaag aaaaagaaaa gctttttctt 239221 tacttgacca tttgtcacat gtgctttcaa atcctgaact gagtcacatc tgaaataaat 239281 cacttccttg tcatttccat gaaaggggca gtacaagaaa ttccttgcac atccctggtg 239341 agccccttgt tactctcaga ttcagctatt gccttcagga aaaaaagaaa aagaagccga 239401 agaaagaaag aaaacagaat cctctagaaa aagctattct gacctgtgct ctttcttttg 239461 tctgagtgtg actagataaa tgcattagtc tgctagggct gccataacaa aataccaaaa 239521 taccacagac tgaatccttc aacaacagaa attatttttc tcatgcttct ggaggctgga 239581 agtccaagat caaggtacca gcaggtcggt ttctgaggag gcctctcttc cttgatttac 239641 aaatggccac ctacgcatca tgtcctcaca tggtctttct cagagagaaa gagaggtgtc 239701 tcttcctctt cttcttttgt tttgttgaga cagagtttcg ctcttgttgc ccaggctgga 239761 gtgcaatggc acgatcttgg ctcactgcaa cctccacctc ctgggttcaa gcgattctcg 239821 tgcctcagcc tcccgagtag ctgagattac aggcatgcct ggctaatttt gtatttttag 239881 tagagacagg gtttctccat gttggtcagg ctggtctcga actctcgacc tcaggtgatc 239941 cacctgcctt gccctcccaa agttctggga ttacaggcgt gagccccgca cccggcctcc 240001 tccttttctt ataaggacac ctgtcctatt ggatcagagt cacaccccca tgacctcatt 240061 taaccttaat tatctcttta aaggtctgat ctccaaaggt cccttgaggg ttagggcttt 240121 gatacatgaa ttttcagggg cacacagttc gatccatagc agttggctta ttcatacttt 240181 gggtagatcc tcttcatcgt tcctcagcta ctgtctattt agtgcacagc ttctttatcc 240241 taaagctagt caggggaggc atggaaggta ttatttctcc tgtttctgca cctaaggcat 240301 taaaacgagg ttgtggtctg gcagcctaaa ctcggccttg atgaacttca aatcctagac 240361 ctccttggta tgtgaagtgc atacacttgt atggagggat ttgacaggtt tttttttttt 240421 tttttttttt gagacagagt ctcactctgt tgcccaggct ggaatgcagt ggcacgatct 240481 cggctcacta cagcctccgc ctcccggttt caaatgattc tcctgcctca gcctcctgag 240541 tagctgggat tataggcgcg caccaccaca cccagctaat ttttgtattt ttagtagaga 240601 cggggtttca gtatgttgat cgggctggtc tcgaattcct gaccttgtaa tccactcgcc 240661 tcggcctcgc aaagtgttgg gattacaggt gtgagccacc gcgcccagcc cagatttttt 240721 taaattagtt tttagtgtat ccacactgtt aaagcaggtg gatactgctg gatagtataa 240781 caaagtatag tgtgacaaac ccataaccaa gcattgttag ctatattttc tcatgtcttg 240841 agccaaaagg tcacagttaa cctaatgttg ctactaaaca ctcataccat aaaacattct 240901 attccagtct tccagatttt acacaagaat aaaatgaaaa cttttagact actggagatt 240961 tttttttaaa tacacaattt gttgaatgtg aacgctaaat tgaaattgac atgctaagtt 241021 ttatagatta cttcagctgc cctgacttca gtaaatttag ttgtgacaca ggtacaacag 241081 ctgacttctt tctggcatta gaggtggtga tgtttcgtca ctctggagtt agtaaaacct 241141 gaaacccgcc aggcatttga cacacagttt gctgccaggg acagctcagc ttctggctta 241201 ttggagagga aatgacctaa cctctgtgtt tacaaagtgt ttgaaactga ggcccttcca 241261 cttccttgct gagtgtaagg gttgacctgg catccaaaat ccccatgatc ttctcaggat 241321 aagtcatgca gccattggtg gtcaagaaag gatttgttcc ctgtggaagg gcattggaag 241381 aggaacagag ctgaaacaca cgggtgttca gagggaaaac atctctgcac ttaagagggc 241441 tagattagac tattagacta ttatcataga gctcgtgatt catctccttt gtttaattcc 241501 agccttgagt aaaggactca ctaagaaacc acattcggcc gggtgttgtg gctcacgcct 241561 gtaatcccaa cactttggga gaccgaggca gatggatcac aaggtcagga gttcaagacc 241621 agcctggcca atatggtgaa accccgtctc aactaaaaat acaaaaatta gtcggggatg 241681 gttgcgggcg cctgtagtct cagctactcg ggaggctgag gcaggagaat tgcttgaacc 241741 tgggaggaag aggttacagt gagccgagat cacgctactg cattccagcc tggtgactgg 241801 agcgagactc catctcaaaa aaaaagaaac cacattctat tccagtatat ttattataac 241861 cagaaattct tgtcttatca gcaaatagct aatattcact tacatgggca gtgtattaga 241921 ttcctagggc tgctggaacc agttaccaca aaccgcatgg ccgaaaccaa cagaaatata 241981 ctctcattat ctcattattc tagaggttag aagtcctaaa gcaaggtgtt gccagggcca 242041 cactgcctct tacagctcta ggggagaatc tgtttcatgc ctttcgcttg ctctagtgct 242101 gctgccagcc cttgatattc ctttggttgg tacatgcacc cctccaatct ccacctctgt 242161 cttcacatgg tgttactccc catgtctctg tttctgtgtc ttctctaata tgattaaggg 242221 cccacccttc tccattgtga cctcaatctt atatctgcaa agaccctgtt tccaagtaag 242281 gctgtattct taggtgtagg cattaggact tcaacatata ttttggggga aaattcaacc 242341 cataatgcgg ttatgttttt attttccttg acttttaaga atgtgaacta ttgtcaaatt 242401 ttttcaacca ctatcaaatt taaatctaag aactacatgt cttaaattta tgacctcttc 242461 attaaaaaaa aaataataac atttaattgg taccaaaacg tgaaagagtt taatgctttg 242521 catttgcttg aggctgatta atgagtaaat gtgtcaactg tggctgtttt aattagccag 242581 ggttaattaa tgtttaccac acttggaaga tgaaaaggca tgtataaatg ctctataatg 242641 tgaatcattt tttaatttgt tgttgaagct tatagtgtag ccaagtaagg attgcttaca 242701 tcatacttac aaagggaggt tcattaatgg atacaaatac acagtttgat agaagaagca 242761 agacctagta tttaattatc agcaaggtga ctaatttaca gtaatctact gtatatttaa 242821 aaatagctag gagaacctaa tttgaaagtt tctagcgtaa agaagacagg ctgggcacga 242881 tggcttacac ctatagtccc agcacttggg aaggccgagg tgggaaaatc acttgacccc 242941 agcaattcaa gactagccag ggcatcatag tgagacccca tctctacaaa acaaatattt 243001 aaaaattagc caggtgtgat ggtgcatgcc tgtagaccca gctgctctgg aggctgaggt 243061 gagaagatca cttgagccca gcagttcaag gttcaagtga gctatgatca catcactgca 243121 ctccagcctg agtgacagag agagaccctt cctcaaagca aaaagaaaag aaaaacattt 243181 aagtgatgga tatcccatta tactgatttg atctttccaa attatatgaa tatattaagt 243241 tttctcatat aatctgaaaa tatgcacatc tattatgtat caataaaaaa taaaatttta 243301 aaaatgtaaa aaacgatgtt ggagcactat ggtacttaac gtagaggctc tgacttttga 243361 ggtacattca ttacctgtta ctgaagttga gtatgcttta atattgtatg tttagaaaag 243421 gtagttacat ttttaaatct gattgtgtaa gaagaaaaga gtagaaatgt ttggggccaa 243481 gtcaataatt atttagtata tcataatttg agatgaaatg tgtttttttt tttctttttt 243541 agtttcaagt tgaaacattt taagaacaga taaggaaagg ctaactgtgc tgagatgaaa 243601 ataacatttt taaagtaact atcttaaaga aaatacattt ctagaatttt gtgtaaattt 243661 tcaacaaaga ctcagaagcc tggatgaggt cttgtgccaa gactggaaag tgtggggatt 243721 agaaaaagtt tgaagagatt gttgtgagat cctctcgtat gaggcctgaa gctgtacatc 243781 catggaaagc agttacccca ggcgggaaaa caggttatcc attctcgagt gagtggccgg 243841 gtgggaaata gtccatcatg tcgaatggat cgtactttcc agcaccaaga tttgtggttc 243901 tctaggtgaa taagatgaca ccccttcact tcacttcact tcaccattgt ctgcatggag 243961 tacgtatcta aaaacaaatt agcaaaaagg ttgggtctac tttacttttc atctcctaca 244021 ctattttctt ctttgttctc ggcaggaaaa gatgaagttg cattttatta ttatctcata 244081 ggattcttat tctaaaatga tttgatttaa aacccattgc actttatttg gataaaaaat 244141 gcttcactct ctagttctta acaataacac cagcctgatt ttattacgtt aagcactgca 244201 attagacaat acaaagaatt tgcagcaaca acatgttatt ttatcatggg tggaactgtt 244261 cattgtctag cagaatttat taccagataa atcccttttg gggaatctaa tgtggatgtt 244321 tttttcaaag cgcctgtgga gttctgctag tattgggaac gtttccattt tcattctgat 244381 gcttccctgt gggctgtttc agcatagaag tgagctctct gggtccatgt ttccattggc 244441 tccacccaat attctgaaag attgcaactc aacatgtgta aacagaacct ttctccccca 244501 tattttctct tttcgtagat taaactcagg tcaagaatta gaagtttatc ctccaccccg 244561 atagcatctc gccctatccc aaataccgcc accaaaatgc ccactgtttt tccttgggcc 244621 agttcctgaa atatctttct ggctgtatca ctcttgtgtg catccttatc ggctatgctt 244681 ttggccattt tttactgctg actttgtctg gttagtcttt tataatctcc aggcttctcc 244741 tccatccttt ttttacagtt tacctgttca tgaagcgcag gtcacttgac ccgtagaatt 244801 cccataggct gggattttgc taattgcata ctcatggtat aggtcagtgt attcctctgc 244861 cctctgtatc ccacaaactg aaagcctctt ggctaagctg cttccctttt cacttccctt 244921 gtaccacagc agccagaaga gtcacagatc ctgggcagct tgtcctctgt gcctcggtac 244981 atgcacttgt atctgctagg acaatgatct ctttccctgg ggccccactg taatatcact 245041 agtcacatcc ctaaagccca ggcttgtttc tttgtcattc tgtcggtatc cctaccataa 245101 cacttgtaat agtaattatt tttatgccca tcttttcttc tgaaccatga gcccttcaag 245161 tgaaaagaaa ttgatgaagt tatttctagc actcccctgg aaaaaggaag gcaggcaggg 245221 agggaggggc tcactagttt atgataaagt ggtttgattt tcttggttgc agattatttc 245281 ttttaaagtg actcttttca gcttctctca ctcctgtgca tcgtgaatgc tctcctgaag 245341 acagacgctt tccattcatt tccaagcccc ccatgtgaag tgctttctgt tgccaggctt 245401 ggtttcccca cacgttgtta aagaagttaa agaagcttgt gaataagcta aagcagagga 245461 gaaagaacca ttgctatatc tagaattgtt caccagaaag caagttttgt gttaaagaaa 245521 acacgagtag gccactgtgc agaagtgctt acagggtttg tgtgcttttt aaaatctgca 245581 taaataaaca tttcaaatga gctatcatat atacaaggat gatgtgctgt ctgtgggaaa 245641 aggacccaat cctgtgttgt gattttacat atatatatat atataatttg tgctattaca 245701 cacttgcctg atccccttat aattatctgc ataagtcttt ttaggctctt tgaatgaaat 245761 tcactatgta tggatgttga aaatcatcct ttctaccatg agctcatcag acttgctaac 245821 caattctttg catcctttga agtataaaat tccatcaatt cagcatgtat caacatgaat 245881 caattagaat gcactttctt cacagcatgg cggagttttt gctggctgcc gggagcggca 245941 ctgtttccca attcattcat ttgcttgggc ccgccccagg atccttctta tcccggtcag 246001 atagacttct ctcatgagct tgagagacgg ggctggtcac acaggacacg gctttaggga 246061 cacgactcac tgaaactaca aaatgagtgc tgctcttaga gttgtaccca cagcctgcac 246121 agctgttatg gcaatgctgt gggcaagtct tgctacggcc cattctcctc cctcccttgt 246181 cctgttggaa ggttcaggtt ggagctgaaa ttcagtccag cacgcagctg gacaagccat 246241 gtctcctgta tgtgactcac tccccagaga aagcatcatg tggactagta agggcttggc 246301 tttatcagaa gcaaaggaga ggtttacatc tcttcttgct ctctcttagt gggttcagag 246361 ctccagcaga aaaaatggga cactgtggat tgatctccag aaaaccccaa aaccagcact 246421 gaaggctcca ggagattatg cctttagatt gtatacaggc accagaatga cttttgcctt 246481 tataaaaatg cgtgcaagag catgaaaact aatatttgag gaattttttt ttctcactgt 246541 actttctggt cctcactgaa agcattggca gttttaaaaa atgaaatgtg atactgttat 246601 agacgtagca aacctcacgt ttcatttttt ccaattcatt ctatatctag gaacttactt 246661 agaaggagaa acaaactgta cccagaaaaa ggaatgaaac tggagaattc ttcctcatga 246721 ttcctctgtg tcctcaggcc taattatcct cattccaaag actctccatt gcctcaataa 246781 cttcgaaaaa tttactctcc ttaagcatag tcataaaaat tgcatagtaa agtgataaag 246841 agacgaaaat aatgctgtaa gaataaaacc attaaaaaaa aaccactact cttgattatt 246901 tctgctgtta gaattgtaaa tgaattctca ttcctttttt ctacattttt ggcatgaaat 246961 ttatgttact cttataatta gtaaaaacat tatattttat taataaaaga agactcacag 247021 tgaagtcata acatatgctg cagaggttag actcttcaag tcagatttga agagagaatt 247081 tcacgtaaca atcaccttga aatgtcttca ggtcagccat caaagtacac atctctcaac 247141 cagccaggca gccagttttt cctccagcat gtgaagagtg aagagatagc tgtttcttgg 247201 cttaaagacc atatgaaata attgacttat gtttaaaaga tatgagggat caggccgggt 247261 gcagtggctc atgcctgtaa tcccagcact ttgggaggcc gaggcgggtg gatcacaagg 247321 ttagaagatc gagaccatcc cggccaacat ggtgaaaccc tgtctctact aaaaatacaa 247381 aaactagccg ggcgtagtga tgcgcgcctg taatcccagc tactcaggag gccgaggttg 247441 ccatgagccg agattacgcc tctgcagtcc aacctggtga cagagcaaga ctctgtctcc 247501 agttaaaaaa aaaaaaaagg catcaaaatt acattcactt ctatttttat ttttgtttgt 247561 agaagacatg cagtttgtat cacattaatg ggcatatgat gtaatttcac tataaatcga 247621 accagaggaa gacatccaaa tatgtataaa ttaaagctga caatgcctac agtactaaac 247681 tttttgtttt agtataattg gagacttaga aaagttctaa aaatatattt aaaaattttt 247741 aaaggaagtt gcagacataa tggcccttta cacctaaata cttcactgta cgtttcctca 247801 aaacaagaag atatgtgtgt gtgtgtttgt gtgtgtgtgt tgtaatctta atacagcatc 247861 tttcagagat tatttctgta gtatataaac ttggtttttt tgtttgtttg tttgtttgtt 247921 tgtttgtttt gagatggagt ctcgctctgt cgcccaggct ggagtgcagt ggcacaatct 247981 tggctcgctg caagctccgc ctcccaggtt cacaccattc tcctgcctca gcctcctgag 248041 tagctgggac tacaggcgcc tgcaccatgc ccggctaatt tttgtatttt tagtagagac 248101 gcgatttcac ctcgttagcc aggatggtgt cgatctcctg acctcgtgaa ccgcccacct 248161 cagcctccca aaatgctggg attacaggcg tgagccaccg cgcctggcca taaagttgtt 248221 ttttagtctc tactttaagt tcctgacaac ccaggcagga ccaagactcc ctaatgtagt 248281 tggaaatcag atcagcacat tggcacaagt tctacaggta aacgggaaaa aggagcccta 248341 caatctgctt ttagatgcca gggagcctgg ctggtcttta tcttgattac tgactgatgc 248401 cactggtagg tctaagttct ttgatctatg atgtttgaaa gccaaaagga aatgtgtttt 248461 aaaggaaaag ggacttcttc tgtaccctca gtctaagcca gacaagtgat atttcaggaa 248521 aatgtcccta gcaataagga atattttcac aaatgatgtg aagctttgtt tgtgtccctt 248581 ccagggcagt ttgttaccat aaggggacaa agcatgcttc atttaaacat gtgtggccag 248641 gaaactgtta tttttaaaca aaaataaacc ccgatttaga catgtagtct gagttttaac 248701 tctgggagat tttcaagatt gatagtgtag ctaaaaataa ctgacctaat tttctggagg 248761 tcaagataaa aactggattt atggtacagc atttactttg tgataatatg tgtataaatt 248821 cttccccaag ttggaaaatg tggccgcgtc atacagggct gtatctctct tgaccattga 248881 agtgcctagg tcgctacact acaatttgac ttagagttgg tgaaacttct ggccattcac 248941 gtagccatcg tattcatcct gagaaaggga aacctaaaga cttgggagag tgttctggga 249001 ggaaagtgct attgagaatg ggctgtccaa gcaaagtgac aggcaggcct cacttagcta 249061 gaagagggag cttttcagcc ctgctccacc atcaggaatc tccgtgacct tggactactg 249121 ctttaatctc ttttatcctg gcttctaaaa gataccgtcg aattcattct ttttcaaacc 249181 tatcaaaaat ttgaaatgta tgaaaaagag ggtgcttgtg atgatagatg ggtttatgct 249241 tccttttttt tttttttttt tttttttttg agacggagtc tcactctgtt gcccagactg 249301 gagtgcagtg gcgcgatctc ggctcactgc aagctccgcc tccggggttc acaccattct 249361 cctgcctcag cctcccgagt agctgggact acagtgccac caccacatct ggctaatttt 249421 ttttgtattt tcagtaaaga tggggtttca ccatgttagc caggatgctc tcgatctcct 249481 gacctcatga ttccaaagtg ctgggattac aggcgtgata tgcttcttta atattagttt 249541 ctaatccaag taatatagcc atgtcctcta taaaatggaa cttttcatgg aacagtgaga 249601 aggttgcatt aggaattggg atacttgggt tcagatttca gttctgccac ttgctagttt 249661 taagtatact tctcaggatg gatgtcagtt tgttcaactg tcagtgggag tagtgatccc 249721 actaaggaag attattgtgg ttatcaaatc agcagatggg tatgaaaagg ccatgagtgc 249781 agttctcatg gttatgaagg tgtcataact ggcatagtcc catcacagta tggctttggc 249841 atcgcagatg gctgtagcat atatgctaat cagaacatcc ctaccgtcct taaagatctc 249901 atgggttctc tcccaaatca gctgctccac cgacctacaa gtaagtccat gcttctttag 249961 tatatttact cactagtgaa ggcccgtctg ctaaaagtgt cttcagtagg ggagagctgc 250021 tcttttctgg gaaacccagc aatgaagttt gtcctgtgat gccctctaca cttactgacc 250081 tgtttcctga gaacttcccg taacaaatgg tccagggaag aaggaagtct gtgaccagtc 250141 tttcctttcc tgtggctttc tggttcagaa atgtgaccct gatgataaga aacttcaagg 250201 catgactctg gtgttttggc aatattcccc tggcagtatt gtttcttggc tcacactcat 250261 ctcatgcctt gctttgtttg ctttgttttt gagaattgtt tttttgtttg ttgtttgttt 250321 ttggttttgg cttgtgagac agagccttgc tctgttgccc aggatggcgt gcagttggca 250381 caatctcggc tcactgcagc ctccacctcc cgggttcaag caattctcct gcctcagcct 250441 cctgaataat ggggattaca ggcacacacc accgtgcctg gctaattttt gtattttttt 250501 agtagagacg ggtttcacca tgttggccag gctggtcttg aactcctgac ctcaagtgaa 250561 ccacccacct cggcctccca aagtgctggg gttacaggcg tgagccacca cgcccagcca 250621 ttatgtttgt ttttctttca tgaagggata accgaggctg ttgtgtggtg gtgtcttggt 250681 acagctcgtg gctgtggtca gtgtggtgga gggtaggtct tctcagatgg taacatgcat 250741 gagagtcacc tggggatcac attaacatgc agattctgat tccctggatc tgggtgggac 250801 ctgaggttct gcatgtctaa caaactccca gagaacaccc atgtagcaag agtacagaca 250861 aaaggattca aagcgaaggg actgcaagaa ccatggaacc agcctcccac gattttgcgg 250921 atgacgaacc cgagccttgg ggagtttcag aggccatgga gagatgaagt ataataaaca 250981 gcatagcagt tgtatgttga gccaagaatc agacatgcac agaactgctg attggaaaat 251041 taaatctgtt tctcatagcc ctggtgtggc aaaatgccta gctggaagtg ggtgaccgct 251101 tgatttacag tgggcctcct gggtagggaa aattagtcca tcccatttac agctacttcc 251161 aatgcagctg tgtcagggga gagagaagaa aggacatgga agagagcgta gaggtatgat 251221 cttcagtcct ttcaggcagg cacataagta aatttttatt taaaagataa gaattggtca 251281 tatgcgaatc tcctgttaga taagattaaa aacaaatttc cattttgtgg agaagttgca 251341 gataaatatt tttccatggg atccaaaaga atcaaaaagg aaagaaaata gtaatagcag 251401 tggaggttcg atcctagaca gaagaaaatc cagcaagccg agggacttca tattctcact 251461 ggtgtccaaa tcagagcttt tttttttttt ctttgtttac tttaagttct gggatacatg 251521 tgctgagcat gcaggtttgt tacataggta tacatgtgcc ttggtggttt gctgcaccca 251581 tcaacccgtc atctaggttt taactccgca tgcattaggt atttgtccta atgctctccc 251641 tcccctttca ccccacccct ggacaggccc tggtgtatga tgtccccctc cctgtgtcca 251701 tgtgttctca ttgttcagct cccacttatg agtgagaaca tgagtgtttg ggtttctggt 251761 cctatgttag tttgctgagg atgatggcct ccagcttcat ccatggccct gcaaaggaca 251821 tgaactcaac cttttttttt ttcctttttt tttttttttt gtatttgcaa attcaaatct 251881 tttttttttt ttattatact ttaagtttta gggtacatgt gacattgtgc agcttagtta 251941 catatgtata catgtgccat gctggcgctg cacccactaa ctcgtcatct agcattaggt 252001 atatctccca atgctatccc tcccccctcc ccccacccca ccacagtccc cagagtgtga 252061 tattcccctt cctgtgtcca tgtgatctcc attgttcaat tcccacctat gagtgagaat 252121 atgcggtgtt tggttttttg ttcttgcgat agtttactga gaatgatggt ttccagtttc 252181 atccatgtcc ctacaaagga catgaactca tcatttttta tggctgcata gtattccatg 252241 gtgtatatgt gccacatttt cttaatccag tctatcattg ttggacattt gggttggttc 252301 caagtctttg ctattgtgaa taatgccgca ataaacatac gtgtgcatgt gtctttatag 252361 cagcatgatt tatagtcctt tgggtatata cccagtaatg ggatggctgg gtcaaatggt 252421 atttctagtt ctagatccct gaggaattgc gacactgact tccacaatgg ttgaactagt 252481 ttacagtccc accaacagtg taaaagtgtt cctatttctc cacatcctct ccagcacctg 252541 ttgtttcctg actttttaat gatcgccatt ctaactggtg tgagatgata tctcatagtg 252601 gttttgattt gcatttctct gatggccagt gatggtgagc attttttcat gtgttttttg 252661 gctgcataaa tgtcttcttt tgagaagtgt ctgttcatgt cctttgccca ctttttgatg 252721 gggttgtttg tttttttctt gtaaatttgt ttgagttcat tgtagattct ggatattagc 252781 cctttgtcag atgagtaggt tgcgaaaatt ttctcccatg ttgtaggttg cctgttcact 252841 ctgatggtag tttcttttgc tgtgcagaag ctctttagtt taattagatc ccatttgtcc 252901 attttggctt ttgttgccat tgcttttggt gttttggacg tgaagtcctt gcccacgcct 252961 atgtcctgaa tggtaatgcc taggttttct tctagggttt ttatggtttt aggtctaacg 253021 tttaaatctt taatccatct tgaattgatt tttgtataag gtgtaaggaa gggatccagt 253081 ttcagctttc tacatatggc tagccagttt tcccagcacc atttattaaa tagggaatcc 253141 tttccccatt gcttgttttt ctcaggtttg tcaaagatca gatagttgta ggtatgcggc 253201 gttatttctg agggctctgt tctgttccat tgatctatat ctctgttttg gtaccagtac 253261 catgctgttt cggttactgt agccttgtag tatagtttga agtcaggtag tgtgatgcct 253321 ccagcttcgt tcttttggct taggattgac ttggcgatgc gggctctttt ttggttccat 253381 atgaacttta aagtagtttt ttccaattct gtgaagaaag tcattggtag cttgatgggg 253441 atggcattga atctgtaaat taccttgggc agtatggcca ttttcacgat attgattctt 253501 cctacccatg agcatggaat gttcttccat ttgtttgtgt cctcttttat ttccttgagc 253561 agttggtttg tagttctcct ttaagaggtc cttcacatcc cttgtaagtt ggattcctag 253621 gtattttatt ctctttgaag caattgtgaa tgggagttca ctcatgattt ggctctctgt 253681 ttgtctgttg ttggtgtata agaatgcttg tgatttttgt acattgattt tgtatcctga 253741 gactttgctg aagttgcgta tcagcttaag gagattttgg gctgagacaa tggggttttc 253801 tagataaaca atcatgtcgt ctgcaaacag ggacaatttg acttcctctt ttcctaattg 253861 aatacctttt atttccttct cctgcctgat tgccctggcc agaacttcca acactatgtt 253921 gaataggagc ggtgagagag ggcatccctg tcttgtgcca gttttcaaag ggaatgcttc 253981 cagtttttgc ccattcagta tgatattggc tgtgggtttg tcatagatag ctcttattat 254041 tttgaaatac gtcccatcaa tacctaattt attgagagtt tttagcatga agggttgttg 254101 aattttgtca aaggcttttt ctgcatctat tgagataatc atgtggtttt tgtctttggc 254161 tctgtttata tgctggatta cctttattga tttgcgtata ttgaaccaag ccttgcatcc 254221 cagggatgaa gcccacttga tcatggtgga taagcttttt gatgtgctgc tggattcggt 254281 ttgccagtat tttattgagg atttttgcat caatgttcat caaggatatt ggtctaaaat 254341 tctctttttt tggttgtgtc tctgcccggc tttggtatca gaatgatgct ggcctcataa 254401 aatgagttag ggaggattcc ctctttttct attgattgga atagtttcag aaggaatggt 254461 accagttcct ccttgtacct ctggtagaat tcggctgtga accatctggt cctggactct 254521 ttttggttgg taaactattg attattgcca caatttcagc tcctgttatt ggtctattca 254581 gagattcaac ttcttcctgg tttagtcttg ggagagtgta tgtgtcgagg aatgtatcca 254641 tttcttctag attttctagt ttatttgtgt agaggttttt gtagtattct ctgatggtag 254701 tttgtatttc tgtgggatcg gtggtgatat cccctttatc attttttatc gtgtctattt 254761 gattctcctc tttttcttta ttagtcttgc tagcggtcta tcaattttgt tgatcctttc 254821 aaaaaaccag ctcctggatt cattgatttt tttgaagggt tttttgtgtc tctatttcct 254881 tcagttctgc tctgatttta gttatttctt gccttctgct agcttttgaa tgtgtttgct 254941 cttgcttttc tagttctttt aattgtgatg ttagggtgtc aattttggat ctttcctgct 255001 ttctcttgtg ggcatttagt gctataaatt tccctctaca cactgctttg aatgcgtccc 255061 agagattctg gtatgttgtg tctttgttct cgttggtttc aaagaacatc tttatttctg 255121 ccttcatttc gttatgtacc cagtagtcat tcaggagcag gttgttcagt ttccatgtag 255181 ttgagcggct ttgagtgaga ttattaatcc tgagttctag tttgattgca ctgtggtcgg 255241 agagatagtt tgttataatt tctgttcttt tacatttgct gaggagagtt ttacttccaa 255301 ctatgtggtc aattttggaa taggtgtggt gtggtgctga aaaaaatgta tattctgttg 255361 atttggggtg gagagttctg tagatgtcta ttaggtccgc ttggtgcaga gctgagttca 255421 attcctgggt atccttgttg actttctgtc tcgttgatct gtctaatgtt gacagtgggg 255481 tgttaaagtc tcccattatt aatgtgtggg agtctaagtc tctttgtagg tcactcagga 255541 cttgctttat gaatctgggt gctcctgtat tgggtgcata tatgtttagg atagttagct 255601 cctcttgttg aattgatccc tttaccatta tgtaatggcc ttctttgtct cttttgatct 255661 ttgttggttt aacgtctgtt ttatcagaga ctaggattgc aacccctgcc tttttttgtt 255721 ttccatttgc ttggtagatc ttcctccatc cttttatttt gagcctgtgt gtgtctctgc 255781 acatgagatg ggtttcctga atacagcaca ctgatgggtc ttgactcttt atccaacttg 255841 ccagtctgtg tcttttaatt gcagaattta gtccatttac atttaaagtt aatactgtta 255901 tgtgtgaatt tgatcctgtc attatgatgt tagctggtga ttttgctcat tagttgatgc 255961 agtttcttcc tagtcttgat ggtctttaca ttttggcatg attttgcagc ggctggtacc 256021 ggttgttcct ttgcatgttt agtgcttcct tcaggagctc ttttagggca ggcctggtgg 256081 tgacaaaatc tctcagcatt tgcttgtctg taaagtattt tatttctcct tcacttatga 256141 agcttagttt ggctggatat gaaattctgg gttgaaaatt cttttcttta agaatgttga 256201 atattggccc ccactctctt ctggcttgta gggtttctgc cgagagatcc gctgttagtc 256261 tgatgggctt ccctttgagg gtaacccgac ctttctctct ggctgccctt aacatttttt 256321 ccttcatttc aactttggtg aatctgacaa ttacgtgtct tggagttgct cttctcgagg 256381 agtatctttg tggcgttctc tgtatttcct gaatctgaac gttggcctgc cttgctagat 256441 tggggaagtt ctcctggata atatcctgca gagtgttttc caacttggtt ccattctccg 256501 catcactttc aggtacacca atcagacgta gatttggtct tttcacatag tcccatattt 256561 cttggaggct ttgctcattt ctttttattc gtttttctct aaacttccct tctcgcttca 256621 tttcattcat ttcatcttcc attgctgata ccctttcttc cagttgatcg catcggctcc 256681 tgaggcttct gcattcttca cgtagttctc gagccttggt tttcagctcc atcagctcct 256741 ttaagcactt ctctgtattg gttattctag ttatacattc ttctaaattt ttttcaaagt 256801 tttcaacttc tttgcctttg gtttgaatgt cctcccgtag ctcagagtaa tttgatcgtc 256861 tgaagccttc ttctctcagc tcgtcaaaat cattctccat ccagctttgt tccgttgctg 256921 gtgaggaact gcgttctttg gaggaggaga ggcactctgc gttttagagt ttccagtttt 256981 tctgttctgt tttttcccca tctttgtggt tttatctact tttggtcttt gatgatggtg 257041 atgtacagat gggttttcgg tgtggatgtc ctttctgttt gttagttttc cttctaacag 257101 acaggaccct cagctgcagg tctgttggaa taccctgcag tgtgaggtgt cagtgtgccc 257161 ctgctggggg tgcctcccag ttaggctgct cgggggtcag gggtcaggga ccacttgagg 257221 aggcagtctg ccgttctcag atctccagct gcgtgctggg agaaccactg ctctcttcaa 257281 agctgtcaga cagggacatt taagtctgca gaggttactg ctgtcttttt gtttgtctgt 257341 gccctgcccc cagaggtgga gcctacagag gcaagcaggc ctccttgagc tgtggtgggc 257401 tccacccagt tcgagcttcc cggctgcttt gtttacctaa gcaagcctgg gcaatggcgg 257461 gctcccctcc cccagcctcg ctgccgcctt gcagtttgat ctcagactgc tgtgctagca 257521 atcagcgcga ttccgtgggc gtaggaccct ccgagccagg tgtgggatat agtctcgtgg 257581 tgcgccgttt cttaagccgg tctgaaaagc gcaatattcg ggtgggagtg acccgatttt 257641 ccaggtgcgt ctgtcacccc tttctttgac tcagaaaggg aactccctga ccccttgcgc 257701 ttcccaggtg aggcaatgcc tcgccctgct tcggctcgcg cacggtgcac gcacccactg 257761 gcctgcgccc actgtctggc actccctagt gagatgaacc cggtatctca gatggaaatg 257821 cggaaatcac ccgtcttctg cgtcgctcac gctgggagct gtagaccgga gctgttccta 257881 ttcggccatc ttggctccca cctcccgaac tcatcctttt ttatggctgc atagtattcc 257941 atggtgtata tgtgccacat tttctttatc cagtctatcg ttgatggaca ttttggttgg 258001 ttccaagtct ttgctattgt aaatagtgct acagtaaaca tatgtgtgca tgtgtcttta 258061 cagtagaatg atttataatc ctttgggtat attcccagta atgggattgc tgggtcaaat 258121 ggtatttcta gttctatatc cttgcagaat ctcctcactg tcttccacaa cggttgaact 258181 aacactctca ccaacagtgt aaaagcgtgc ctatttctcc gcatcctcgc cagcatctgt 258241 tgttgccaga ctttttaatg atcaccatcc caactggtgt gagatggtat ctcattgtgg 258301 ttttgatttg catttctcta atgaccagtg atgatgagct tttttttttt ttttcatatg 258361 tttgttggcc acatacatgt cttgttttga gaagtgtctg ttcatatcct ttgcccactt 258421 tttgatgggg ttgtttttct tttttttctg gtaaatttgt ttaagttcct tgtagattct 258481 ggatattaga cctttgtcag atggatagat tcagatcaga gctttaacag ccagacttta 258541 cagggaaaag gctgagtggc acctccttta tatgaagaaa atgttcttct ccaacccact 258601 tgtgttcagt ggatggaaga agctgttcag taattataac tgacacttgt aacacataac 258661 tgattgattt aatgcccata aattttattt tctttaatcg taagatgtga ccgttaaatt 258721 cttagaagac atttggctag tggattaata attaaaagaa tattggatga attcatctgc 258781 ttcctattca gcagttcttt tatttataag gctatttttc tcaaaatgaa tgatcatgaa 258841 tatctaagaa aaaacaaaat atttagctgt ttgaagatga aattattatc ttttgatgtt 258901 gttttcttta aagtgtttac aatgataact aagtggtagg gcttcttccc tctgttactc 258961 taagtatgcc tcgtatacat cctcaaagca tttttttttt tttttttttt ggagacagag 259021 tcttactcgg ttgcctaggc tggagtgcag tggcatgatc tcagctcact gcaacctctg 259081 tctcccaggt ttaagcaatt ctcccgcctt agcctcccaa gtaggtggga ttacaggcgt 259141 ccgccaccat gcacggctaa tttttgtatt tttagtagag acggggtttc accaagttag 259201 ccaggctgct cttgaactcc tgacctcgtg atccacccac ctcggcctcc caaagtgcta 259261 gattacagac gtgagccacc acgcctggcc acctcgaagc atttgtaaaa gagcaaggtg 259321 atggggttca ttaggtgttt ggtatttgaa attaaatacc tctgttattg atgaagccat 259381 attttaaata ttatcaccac taaactcagt taaatatttc cactgttaac tacaaggttc 259441 aaatgcttgg atattataga agatatttga cttcaaagat aaacgctaca attttgggtc 259501 ataacaaata gcttttagaa ggggaaaaaa taagtcttgg gaggaaaata agcagtctct 259561 tagtttaata tagcggttgg caaatataga tgcttaaaag ttctgaaaat ccccattgca 259621 ggccatcaga gtgttgtaat taagtattaa attaattcag ataaaaggat aatccatcag 259681 ggccactaat gtagtggaag tgtggtttcg gtgtgattag tctcaccatg gaagtcaaag 259741 aaagtcccac tcagaatgaa caatgaaaga atcatatttg tctctgggta ttttgtcata 259801 ggctcaattg cccagtgcca gtgtgtaagc cactggtaaa atcctgagcc cttagtcatg 259861 tctacaggta gctggctgcc ttagacattt tatgatgacg aaattgaatt ccacttctga 259921 taatggtaat ccatcatgtc tacctctttt tttctcttaa aactacactg acggccaggc 259981 gcagtggctc ccgcctcatc ccagcacttt gggaggccaa ggcaggcaga tcgcctgagg 260041 tcaggagttt gagaccagcc tggccaacat ggagaaactc tgtctctact aaaaacataa 260101 aaactatccg gtcatggtag tgtgtgcctg taatcacagt actcgggact gaggcaggag 260161 aatcgcttga acccaggagg cagaggttcg ttgttctgaa tgacatagcc aggggtggtt 260221 atttctaaat tttctgtgtt gaattaagat aggaatgtgg atttcttcct attttgaaca 260281 acgaccaaaa aaaaaaaacg aaggctaagt gtgtttcact ctgctaaaaa tgtacctttt 260341 cttgaatctg tcactttttc acttacattt tctgaaaatg aatataatta accaccatgg 260401 ggctcagcat gctggtggcc atttggagga acttttctac caacagtgta acagagaaaa 260461 gagaaaaggg tactacagcg tacctcaacc tcccctgact gttctgtgaa gtacaagtgt 260521 aagatttctc actgtgatct tttgctcctc tgccttttta ctgccaaggg ctaagatggg 260581 cacagggccc tttccagctg cccctgcctg agaaatgctt atttgatgac caaaagcaat 260641 aaccattcca taggactaac ggtgattaag tcagcttaca aatccacaag ggtggaagag 260701 actgctaaag atattggaac agacatgctt aataatgcct aatttctaca gctacttttg 260761 tgataatacg agaggctgaa atagataggg gctgaacatg ggaaggctgg tgttatgaag 260821 cccaggactt ggatttcagc actggtgctt actggccatt gggtcttgaa gagattggtt 260881 cctttctccc agcctcagtc tccttgtccc atcagattaa taatgcctgc cttgggagag 260941 gagagtcata caggagaggg catttgagct gggctctaaa gagttcaaat aacgtagatg 261001 agcagagatg ggcttccaag atagcaggaa tagaaaacaa aagtgtcaga atgggtatgc 261061 atggtctttt gtgggaatac cccaagctat ttagcctgag agagaggtgt ggtgtagaca 261121 gctggagctg tgagttccat ggctagaaat acggggtggg cccagaccac agagggcctt 261181 cagcatctgg caaaggagtt ggaattaatt cggtagaaaa tgaagaccca ctgaaacaaa 261241 gagggagcca ttcttgtgtg tcaggaacta agatttcagg aaccaaacgt aaaataaaaa 261301 ccagcaggga gaccattgtc tcagtcggtt tgtgctgtta tcatgatacc agagactggg 261361 taatttataa acaaacaata gaaatactac aattctggag gctgagaagc ccaggatcaa 261421 gccgggtgca gtggctcacg cctgtaatcc cagcactttg ggaagccgag gcgggcggat 261481 cacgaagtca ggagatggag accatcctgg ctaacacggt gaaaccccgt ctctactaaa 261541 actacaaaaa attagccggg cgtggtggcg ggcgcctgta gtcccagcta ctcgggaggc 261601 tgaggcagga gaatggcgtc aacccgggag gcggagcttg cagtgagccg agattgcgcc 261661 actgcactcc agcctgggcg acagagcaag actccgtctc aaaaaaaaaa aaaaaaaaaa 261721 aaggcgcctt gtggctgcat cctccagagg cagcaaaaac tgttctcaca tggcagatgc 261781 acataaaggg gcctgtgctg gtttcctcta gcccttttag aaggcagtag tccattcggg 261841 ggcacggggt ggaactgtca tgacttgatt tccagaagac cccatctctc actactacca 261901 cagtggagat ttaagtttca acacatgact ttgggggaca ttcagaccat agcaagcagt 261961 gaggagacta tcgtaattat ccaggcgaaa agtaatcata ttcctagggg cggtgagggt 262021 ggtgtggaaa agcaagcagg aaatggctga gagaattcag aaatagaatg caaaaggatt 262081 tgtcggtagt gtgagtactg gacacgaaga aatacaggga gtcctggatg gctctgaagg 262141 attcaagcct gtgactggga gggagaatgg ttctgccatg attagaaaac aggaaaggtg 262201 gaagggaaag ccagggacct tgagtacagt tgtgtaccta ccaagtgcat ggatcaggag 262261 ggagccctaa gttggtatct ccagcaaact gttggaaata tacaaatttg agtgggaggg 262321 agaagtcaaa gctggggaag tacatttcag attcacccaa aggtaggaga tctttagagt 262381 ttcaagagtc gatgggattt ccaaagggga gaattgtaga aagagatgga aagagataca 262441 agaagaaaat gacagggatg aagagaggag aaaaaaagag taaaaagagt tattcagaga 262501 tgtcacaggg aagaagataa gtactgtgta tgcgaaacag tgagaacatt tccaaataaa 262561 ggacatggta agtgatagta gataccacag acccaagaac agcgtagccc ctggggagtt 262621 caagagcttt ctcgctggca aaggttattt cctgtcaaga tgaatgtgta tgcaccaata 262681 attattagta ttccagaata tcctctcttt agcatagtaa gtgcttaaca gtcacaagca 262741 aaagatttat ataaaagaaa tgctcattta atttatttaa tcagacattg gctaagttat 262801 tagccgctac ctatggcttt gtaaaacatg aaaaagttgg aggaaatcta atgagggtct 262861 tttatctgta agggaaaggt catcatgggc atcaagtatt gaattttaga aacggtgcca 262921 cagtgaatag ggatatatta taactcagat tagacttcaa aaggactaat ttgcttggga 262981 cttccatcat ttaaaatcac ttgcaataaa actgactaca tgaaaagata ttaatcacaa 263041 gctttaatgc tattccatca aattatttta atgggtatat ttgaacagaa ttttaatttt 263101 tcagatacat tttttgtcaa tagaaaagag aatttgaaag atgattgtaa aatgttacaa 263161 atcaaattta aatcctgatc ttattttcat tttctctagc aataaaacta accagtgatg 263221 tgtctgaaat atatactaca gcaaactctt gacacagcta tgattggaaa atgagacatt 263281 ttggttaatc aaatattatt gttaatagtg cagtgctgtc tgtcggtctc taatttgtaa 263341 aaatcagaga aaaacatggt gaatctttgg tgacataaga gaagactcac aaatcttcat 263401 atcctggggt cagtttttaa agcagtaatt acacatgata caagattata ttctaagctc 263461 tctttttcca gtaagaaaag ataagtcaat gatgttccat aaagctgatt cttgtttaat 263521 caatttttaa agtggggggg taggcggcac tgacagcatt ctttggaaag ggaagcaaca 263581 gaggagaaag gcaggatatt tggtgtctgc tcacattggg aaatacgggg agccaggggt 263641 ggttaagtcc agagaaggct ggatgtctgt ctgcagggct acatctggaa gggcaaggga 263701 acagtttcct tgttgcctca gagcataggt catgcagcac aagaggagca tttaggaaat 263761 gcagattttt tttcaaggca agacagtgaa aactattcaa gaaggtagca aactactggc 263821 cctttaatgc ttttcttatc cttgcaagtg tgcaaggcaa gacaggagat tgtctgttgt 263881 ctttcagctt gaatgagatt gtgcctcact tactgccgcc acaccaaagc ctggcgttta 263941 ttcctgtgcg atcattgtat actagggacg tctaactgtg tcaggaggaa atggcagctg 264001 agcccctgtg gcctttcacc tctagcagtg ccacgctgca gcacattggc cggctgccat 264061 gctcctgcct gcagtcctgc cagcagctct tgtgagctgc gactttgcat aagtagcttg 264121 aatgccatgt gcctcagttt tcacatctgt aaaagggaga tgataatggt acctatgtca 264181 tggctctaaa cgcgatcatg cacgtgaaag cagttgaagt cttgcctggc agaagtaaat 264241 ggtggctgct gctgctgctg ctgttgtgat tgttgttact caccaaagag atggttttgt 264301 ttggtttaga tgagctgctt cagaaagagc aaaactattc agatgacgtc ttggccaaca 264361 tgattagtga accaaggatc agttacggaa acgatgctct catgccatct ttgaccgaaa 264421 cgaaaaccac cgtggagctc cttcccgtga atggagagtt cagcctggac gatctccagc 264481 cgtggcattc ttttggggct gactctgtgc cagccaacac agaaaacgaa ggtaagagtc 264541 ccctgagcca gcaagggcgt tctgggaggt atatatacac atacataacg tgtgtgcaag 264601 agagagagtt atcttttgta tgttcttgag tggtgaattt tttttgatgt agtcataaat 264661 attttcatta taattttact ttaaaaagta aatactagga gttactatgt cattaaaata 264721 atatatctgt gtaaattatt ctttatattt tttatgctgc aaagcatgta aactgccaat 264781 tgttggaaaa gacaacaact ccattgtata ggtggcaaaa caaaatggta atgatttaca 264841 taacatttgc aggtaaaggg cctcttgttt cgcacatgtc caggtttgca ttctagggaa 264901 tattatcctt cgagtgaaca ctcccggacg aaggggagcc acagtgtatt attgaagagg 264961 cagttcttca caaggaaaat gtgtggcaat aacattcttc taatttttat tttccagttc 265021 ctcttcttaa tataaaatgg tacattatat aacaaaagtg gatacataaa gagcatgtac 265081 tgtatatcat attattatgt ttatgacttt ttcaaatgag gaaatggagg caccaggaga 265141 ctaagttctt aagtgatgac ttttggtagg aattttttag aaatgttttt taactgtgag 265201 gtatattgtt ccctctggaa tggaagcaca aaattcaact tgtgtttatg acgtaatgac 265261 ctagaatgcc accttccaaa aaagcagata gatagtgaaa taaggaaaaa ttacagttta 265321 ccaacagaga ggagttttga gggcctgcag agatgcagat ttgacaggca ccaaagtagt 265381 cttcactggc tacttattct tacatgaaat ttcagattga attaaatgtt aaaggcaaag 265441 atagagaact acatataaat aaaatttaat tttgcttaag acttaatttg tgtttagaaa 265501 attaaattta aggcagctgc caaaattaag gagcacattg ctctaaatgt tttttccttt 265561 ggtacctcag agaagtaaga ttcttctcta tcaacaaatt tgagcagtag atggcatcct 265621 ttttcacagg gatttcttag cagaaacacc tgttcaggca agaatatctt tcagcttctt 265681 ttatcttttt ctacatgttc agaagtatac aattacacac aaacaccttg atttctagat 265741 acaaataaca tgttttttaa gaacacaaca gcactttacc tgattttctt tatcaaagca 265801 gcaaatgttg atggagattt catatttata atatacctta taggccaggc atcgtgggtc 265861 acgcctgtaa tcgcaggact tagggaggct gaggtgggca gatcgcaagg tcaggagatc 265921 gagaccatcc tggctaacat ggtgaaaccc cgtctgtact aaaaatacaa aaaaaagtta 265981 gccgggcgtg gtggcgggca cctgtagtcc cagctactcg ggaggctgag gcaggagaat 266041 ggcatgaacc cgggaggcgg acgtggccag tgagctgaga tcacaccact gcactccagc 266101 ctgggtgaca gagcgagact ttaaatttaa aaaaaataaa taaataaagt aaaaaatata 266161 tataccttat aacatgcaat tagtactaaa tctaaatata atattagacc ttaagtgtac 266221 tggacaagtt ttttctaaag aaccattgca gaaaatatat agactatttt cacaatgtga 266281 atcatcaccc caaatttgtt tccttaattt gaaaaagggc accttacagt aatagcaaac 266341 agcatgttac tgacaatgta aaaataatta aaactttacc atttatgaaa ataattctta 266401 atcacataga agaaaacaat tttagggaca gttcattatc aaagacctgg aaaactgtcc 266461 tgttaaatgt attgagtgca gattgttttg tttctttaca ggtagataga ttttggtcat 266521 tgaaggtcct cttcattcag tctctaatga gaatttttag tgaaaaatgt ccgatctcat 266581 tttgtaaatt gaatgcacgc taaatttgct cttttactaa cagcatacct tgttgcatca 266641 gccttaggag aaaatctgcg tgcacacgtg tgaatgtaca cacacacaca tacactgtga 266701 acccttggta attgccagga tgtccatttg agcaaagccc ttgattaagt acttctggat 266761 acttgacaaa cacgcagcta ttacattgcc taggtctcat gaaaagagct gatattcttg 266821 ttggatgatt ctgtttgaac agtagaatct ggtataacaa ttttgggagg caatagaact 266881 tgcagtgtca gagtcttgag ttgcagtact cttcattatc agtcccaaca ttgacatata 266941 tatattgatt atagcaaaca atttaaaaag ccacttaagc aatttttttt tttttttttt 267001 tttttgagac ggagtctcgc tctgtcgccc aggctggctg gagtgcagtg gcgcgatctc 267061 agctcactgc aaactctgcc tcccaggttc atgccatttt cttgcttcag cctcccgagt 267121 agctgggact ataggtaccc gccaccacgc ccggctaatt ttttgtattt ttactagaga 267181 tagggtttca ccgtgttagc caggatggtc tcgatctcct gacctcgtga tccgcccgcc 267241 tcagcctccc aaagtgctgg gattacaggc atgagccacg gcgcccggcc ccaattggtt 267301 tttaaataaa atagacggcc atcatctcat aggatattaa atactacccc atacccctga 267361 actcctgaat gatacagacc cgagtttaaa ttccagccct gctatgtgct tctgagcaag 267421 ttacttagcc tctctggctg cttcctcgtc tgtaaaataa ggttgacaat aattcctaac 267481 tcagaggtat agtatgagca ttaaataaga aggcataaaa aagctcttag caaaattcct 267541 ggaacgtagt aaatgtcagt ggtagctttt ttttttttta atggagccag gatcttatta 267601 gagagacttt atcagcctgt ctcaagatga atcaccttat ggcctcaaca atgtatcctt 267661 aaaataatga atataataag ccatataatg agatacagtc ctaaagccgt ctctcttcta 267721 acaccgtctc ccacctctcc ccagaaactt gttagtcttt ctttgtacag acatgtgcac 267781 acacacacac atacacccaa ctactgaaaa tatctaaaat tctcaaagga tttccttttg 267841 ttcttcatgc agtcctcatt atcatagttt caaatgctta ccaaaaattt gttgtttttc 267901 ttaagaacaa tcagtgaaat ccagaggagg ttctgatgcc cattttttaa atttacattg 267961 aagaaaataa gcatatcaga gttttccaat gtgctgtgat ttgccttaac aattttttat 268021 tctttggttt tcacatccac aaaacctaaa tgaatggcta ggctgaaatg aagctgttgt 268081 tacagaattg tgctcacaga tcgtgaaatg atctggctgg aagcattagc gttgatgaca 268141 tttgtgtttg ttctactttt tcctgactgc ctgagaaggg cagccacaaa gagcagcata 268201 gcatgcagtg actgccagca agtctttgtg gccaacagtc tgcctctttg tagataatac 268261 ctagatataa tatgtagata aaataatatt tttaaagaga caaagcaggt cttataagac 268321 ttgtatggat gtatttgcaa aatgtgaagc tgccttctag tatattttta actagcttta 268381 tagaggcaat gaggaagcca atttaaatgt tgaaatttag attaaataaa tatttcatta 268441 aacttacaaa ccagttcttg tcagcctctt atctatatgt aaaactggtg tactgctgat 268501 tcctcattac ctttggcctg gtagagagtt tgtacaaggc agcacggtta atagttgagg 268561 tttattacaa agaataccag aagtaacaga ctcaaaaatc ccagaggtct gaccattctg 268621 tttctctcct cttacagcaa aggatttgat tcaaagctac tgagagcaag agaataactc 268681 atttctgctt acattttatt ttaactctat taaaagttaa agctaagata atgtatttct 268741 tgtttttaaa ttactaaaca gccaatgtaa gatgtttttt agcacttcat tgatgaagtt 268801 actgtctggt taagtaaagg tcaattggaa aaattggaaa gtatttgcag tggattggga 268861 aataacacac tactttgttt tgttggtgtg tattcttttt tttcggcaat gatatcataa 268921 ttgcctttta aaaaactttt aatgtctgct ccctgagcta cttatttttt gagcattaat 268981 ttatactaaa tcagcttcaa tcttttaaag cagattaggg tttatgtcac tgaaactagt 269041 tttgtttaat cttcctaaaa aatagggtta gggttttatt attgttttta aataatacat 269101 gcctatttaa aaaaatagaa gaagaagaag aagagcgggg agaggcagag aaacttatag 269161 gacaatatca tttgtaagtt tctttcacat aagtatacgc atttgaaccc tggaaaaacc 269221 atttactttg tgacacaact gaagttcaga aagaatgtga ttagccccag atcacagaga 269281 aagtaagtgg tgacattagc agtggcccgt gggactacca gctcctcagc ctgcatttaa 269341 accacaggta cacttaactt actgccccga gttgcatata cattttctaa taacgtaact 269401 attgatgtat gtgctcacca ccacttcaag gtacgcgttt gccctcgtgc atgtggtctg 269461 ttgtgttctt aactgcagtt gaagcactat tgggtccctc atcatcagga aattctcaca 269521 aagtacaatg acctttcaac taactgcatt ttacaaggag taatacagtt actcattaca 269581 ggttgctgca aataggaaaa aaaatgaagt atagaaaaac caaacctcat tgtcgaaatg 269641 tatttgggag ttgcagaagg aagtgaacta tgtcttcctc taatgcacta gtgaaatact 269701 agagaatagt cactctcgca tcagtagtta ctatggaata tttaatcata ggatatttaa 269761 tcagtgcagt tctgtagtat ctggtagggt ctcaaacagg ctttaaaggt tacttatttg 269821 ttcctcacca taattagaag atgcagatgt taacatttct gtttccatat ggagaaacag 269881 ccacagaagt ggcgcatccc aaggccacag acattgtaaa tgttgggccc aggcctggga 269941 ttagatgcct caccctcctt ttgactccac catgctgcca gtctctacat gagaaaggac 270001 caaattttta tttcatccaa atatcctagt gctcccggaa atattttctt atattttgga 270061 gacttggctt actgccactt tttttctccc tcactgtttc tattaacatg tggcatccaa 270121 aatcgatttg gactgagaat cccacctttg aatgcctggc ctgtgtgcat cagtaagctg 270181 gatggttttg ctgaaattgt tagaagccat ttcatcgagc acaccgtgtc tccttcagca 270241 cctgtgctca ctttgtctta cctttttccc tttacacttg ccaccttacc aaagcagctt 270301 ccccctgcac ctacctccgt ggcttcattt accctatgac cattattaat cctaaaaact 270361 tcacttttgt ccccattcct tgtttcttca aacgccagcg taaaacaagg aagtattttc 270421 tgagcatcag aagtgctttg gaaaggtcag atgatggtga ggaatatcat gctgtccaca 270481 ctgctttctt tgggtcggac ccagaacaca taaaaaggga aaattgtgaa caatggcaaa 270541 tataaaatga aatatcctca tttcacacgt ttcagctgta tgtcatctgt caaatggaag 270601 ccagcctttt tgaaaaagtg agaaaaagac tgcggtttcc ctcttctgcc ctcatactaa 270661 ggacaaattt gattcactca ctaaatggca gattatatag taaaactttg ctctcctgat 270721 aaattcatat ttgcacttcc cctggaaatt taacagagtc cttttgtagt taattaactg 270781 taaaataaga taagattggg cagttgtaaa aggaagtatc tttaaggttt tctgctaggc 270841 tgggaatatt ttatttctaa tcaaaagcca agaaactatc tctaaactag ctcagaggga 270901 aatccaccct ccccacccct cttctttgat gttgctgcgt atcctggtga gggcttttat 270961 tagaaaacta aggagcagga ctggcacatc aatagcgata agccatttaa agtggtgccc 271021 ctcctggtcc tcgatgctcg ggattaactc actgtggttc ttttctggct gctccttttt 271081 tgtaacttgt ttattgcata tgcttttttc cctgctcgac tatgtttggg agccacgact 271141 taccatcttg atttgtcttg tttgctttct gtgtcccttg cttcctgtgc ccagttgagc 271201 ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt atcagaaccg 271261 cttgacttgt gcctctccgc atcttgggcc taagctcgag cagacgttgt ctcctttctg 271321 cattttaagc ctctctcaat tgtacttact gcagagtaga atttgaagtg accttagaaa 271381 ccaaactttt aaaaattccc cttgactatt ccttactctt catttcccaa gttccgagtg 271441 aaatgatttt ttttgtttct gagctttccc tttgaaacct aaaagagagc ccttccaaat 271501 ttgggttttt ttttttcttt tttctttaaa acaaacaaaa caaaacaaaa acagagacga 271561 ttatatacag cctgtcctta aaaacagaag cccagataga tgtagactct caggtgatga 271621 cttggatacc agtgtgaaca tctctccttt cttgtttggt gtccagatgg gtaattttct 271681 aactgaactg ggctcaaact tcaatcagtt actgaagtag acatgatttc ctgaaaatac 271741 cagaattggt ttctttccag agcccaaaat acatcttctg ttttgtgggt tgtatgtact 271801 tattctcaat gagtggctaa agtaacacaa cagatgtgga tgcataattt tgcacataag 271861 cccttttttt atgtattact taaaaatccc attaacactg aaatagacac ttcttttttt 271921 acagtttcac aaacattaag aagaaacaca tttttatttt atgaataatg tcatagcaca 271981 gcagaagcat atatgtgcct ggtgtctact ctatgaacaa cctcacccaa ctgatagtag 272041 ccagtgcttt cagtgaatca aaaagaggaa gaaagttatt gaattgacaa atttgtgtat 272101 tccagggaga ttttaatccc aaataacaca tacttttcca tatttatttt cttctttatc 272161 ctcataatgt ccccaggcga tagtcaaaga acagggaatt aactttcttt gaagatatta 272221 atacacagag gtcaaccttt gctgcctggt tcagatacag tcagcagcag accaggagac 272281 ccaccctgtc cctcacactt ccaccacagg ctcgagccat gactgcctcc aagcacggtc 272341 actgatgaag actgagagtc tgcctgtggc atcttcatgc ctttatgctg tgattaacag 272401 tggcagcaga gacagtagtg tgacacgtct tgacagctgg tctcgttgat ttatgaattt 272461 ctgtctgtag gcttttaagg ggaacttaaa ttatagtgag tcagagacag tttcagttct 272521 tctaatgtcc tccttattgt caatgtcatg taagggaatc tcaaggagcc aggagttgac 272581 attcctgttc tgctcagcca acttgtaagg cctcgggctg agggtcagtt cagttctaat 272641 tgcaaccaca gccagctcag gttggtcatt tgccttggga cttttttcat ggtgcttctc 272701 tttaaaagac atgctcttca tctacaagca taaaatgtaa cttagaaagc gctcattcga 272761 ctcattcaat tttatcaaat ttctacattg ggccaggaga gcctatatgt tactattgaa 272821 acatccagga ccaggtgaca ctaccagtca atgttttttt cttttttaaa aaaattgttt 272881 tttgattttt tttttttttt tttgagatgg agtctcgctg tgtcccccag gctggagtgg 272941 agtggcacga tctcaactca ctgcaacctc tgcctcgggg gttcaagcaa ttcttgtgcc 273001 ccagcctctc aagtagctgg gactacagat gcgccccacc atgcccagct aatttttttg 273061 tattttcagt agagacggga tttcaccatg ttggccaggc tggtctcgaa ctactgacct 273121 caggtgatcc acccacctcg gcatcccaaa gtgctgggat tacaggcata agcaccatgc 273181 ccggcctatt cttttaaaaa aaattttttt taatctaaat ctaacaaaaa ggacacctcc 273241 aaacgctgtc aaacaaaaac agtgtgtttt aaaggaagag ctagagaata gttaatggta 273301 ttacactggt gggagaaaac catccaaaac ctttctcata acagtaagaa tatgatgtca 273361 cttggggatt tcacaaagat caatccacta aattctttct aggtttgaaa gaaattaaac 273421 ggtagggtcc ctgctctcag gagcttgagg tataatagaa ggaattttgc atatttggaa 273481 acatgagatg gaatttggaa gacaaggaaa tacaaagcaa cattaagtgc taacaagaat 273541 gttctgtgat gggaggagat ggagtactaa cccaggggca tgttaagtag agaacttact 273601 ggagatgggc ttccagagag gttatgaaag atttgaggca cctggattgg caaagaagaa 273661 gggaacaatt ctattcagaa gagagggtgg cctaagccaa gcttcctttc atacacagcg 273721 taggcttggg ttgcccagta agatggctaa agttgagaaa gaatgctcag atatgaagtg 273781 aggaagattc tagttaaggt ttgtctatga atagcagtat ttactgcaca tgtattaagc 273841 catgcgtgtg ttctgttttt gtgcttgtag taagtttatc aaacatttgg gctaatttcc 273901 ccgtgttttg tagcatagct ttggttcttc ttcacatttt agagaccacc cacttgagtc 273961 taatgtgccg aattatatct ttagcctttc caaacttaga aagccttccc aagattgtca 274021 tgccgttcaa gtgatggcat tgtttttgaa taatcactgt tgtcctcaga aacaagttaa 274081 aaaaaggttt ttgtttttcc cccacccagt gctttataat catgactgag gaaaatggct 274141 tctccttaac tctcatatct aaaaaaacag aatatggtag cttccagcat gcagtgtcaa 274201 gtagggacca ttgtaataac agtggtgatt tgggggcata gtggccatta tagctcctcc 274261 tcctggctta cttccaaaga aaaaatagac tctgaaggac taattataca gtttgaccag 274321 aataaaaaag aaaaacacat tagcgagctc ttcacatagg gtagaaagct accctttctg 274381 ttacaattca tcaccaccct tctgcagtcc aaataaaggg aaaccaggtc cttgacccga 274441 gccatgcaag aggtgtcatg aaggacatac ctcccttagc ccccacatga acaaaaggct 274501 ctgtccactc atcatgactt acaggaagag agagctggag tcaccagggc ttagggtgaa 274561 atcagcacaa ctcaagaaag acctctaagt aagcctgcaa accaaggtcc ctgccattta 274621 tttgaaacta ctcatcacat tttattacat gcattatcac cttattatga atatttccct 274681 atgtgccagc ttactgattt tgtcttttag taagatttgc ttaatgtaaa aagtcttccc 274741 tagatgacta ttcatgctaa acaaacatga actattatag caagagctct tctgccaaga 274801 gcttctcaaa ctattgagct gtaaacatga atttatctgt catgctgact tgataatatg 274861 aatttttttt tttggaccac caaaagtatt cacagtatca tttcaaaagt agtgtgggct 274921 tgtaaagttt ttagtggaga gctcttggtt ttgttttttt tttttttttt tttttaaaac 274981 aggatttgtc tctgtttcca ggctggagaa cagtggcatg ataccagctc actgcaaccc 275041 ccatttcctg ggctaaagct atcctcccac ctaagtttcc catgcagctg ggattacagg 275101 gatgtgccac cacacctggc tcatttttgt atttttagta gagacagagt tttgccatat 275161 tggctaggct ggtctcgaac tcctgagttc aaactgtttg cccgcctcag cctcccaaag 275221 gctgggatta caggcatgag tcactgcacc tggccgagag ctcattcttt tttttttttt 275281 ttttgagatg gagtctcact ctgtcgccca ggctggagtg cagtggcacg aatctcggct 275341 cactgcaagc tctgcctcct gggttcacac cattctcctg cctaagcctc tagtagctgg 275401 gactacaggt gcctgccacc acgtccggct aattttttgt atttttagta gagacggggt 275461 ttcaccatgg tcttgatctc ttgacctcgt gatcctcccg cctcggcctc ccaaagtgct 275521 gggattacag gtgtgagcca ctgtgcccgg ccagagctca ttctttactt ttatttttca 275581 gtataaatat ttggctccaa aatctaaaag ggaagactct tcaaactgtt caagaaagtt 275641 ttcttaaaaa tctcagaaat atagttctaa cctattgtgt tgctagatct tttatttatt 275701 tatttatttt tggagacaga gtctcattct gtcacccagg ctggagtgca gtggcacaat 275761 ctcagctcac tgcaacctcc acctgccggg ttcaagcgat tcttatgcct cagccttctg 275821 gagtagctgg gattacaggc acccgccacc acacctggct aatttttttt tttttttttt 275881 tttttttttt tttgtatttt tagtagagac agggtttcac catgttgacc aggctggtct 275941 caaactcctg acctcaagtg atccacccag cttggcctcc caaagtgctg ggattacagg 276001 cgtgagccac cgtgcttggc acgttgccag atcttaataa cagaaaagga gtgagctcta 276061 aagaaatcac tggttatata aggatctcag atttaaaagg gctatagaaa gattaaaata 276121 tacagccagt gagaattaag agagttttag catcactacc ctataaaaat aagtgtcaga 276181 aaggttcgtt tcagcagaat gagttgaaac ttaaccaaaa aatacttaac aatattaaag 276241 taagaaaata aataagaata gagaactctg tgggtctcgt agcaagaagt tgggggttga 276301 gggttaggtc attgagagga ccagagttaa agaatgagaa agggaaactg tcccacttcc 276361 attgtgcctt ttatgtctat attaaagaga atgaacttct gatgaaagga aattaaagcc 276421 cgagatagat caacatgaga aaagaaagcc ctagcccatt agttacctat tgctacgtag 276481 caaataacgc gagaatttag cagcttaaaa caacaagcat tagtttctta cagtttttat 276541 aggtcaggat ttgggatgtg gtttagctgg atggttctgg cacagtgtcc gtcatgtggt 276601 tgcagttaag acactgcctg agctaaagtc atctgaaggt ttgattggac ctggtgggtc 276661 tgtttccaac gtggctcact cacgtcactg ttactaagag acctcagtat cttaccatgt 276721 gaactctgta atgctgcttg agcgtcccta caacatggca tctcgcaccc ctgcaccaag 276781 aggtccagga aagagattaa ggaagtttta gtgccttttt tggcccagtc tcagaagtca 276841 tacaccatca cttctatcat gctctgttca ctagagcaag tcactacatc ccgaccacat 276901 tcgaaggaag ggaatcaagt tctccctctt gaaggaagga gtatcagcaa atttgtggat 276961 atattttaaa ctatgacatc ttgctaccca agtccaagtt cttatgaccc agaccaactg 277021 caggctaaga gaacttgaag gggaagtagc accattgcct gtggtctttg tggaaagctg 277081 ggaaggtagg gataggaaaa tgaagttctg aatagcaaaa aggagagaag atagcttcac 277141 aatctgtaag ccaggtatag agtgacattg attgtgggtc tcttttcaaa taggttatca 277201 agtggaagct ttttgaacaa ttggatgagg aaatggtgct aggtcacaac cagcgtggat 277261 ttcctaggtt ttgtcagatt atccttacct tctccttgta taggctgtgc tcttagactg 277321 gcacatcata tgctgtggac attgtgcttt tcagtgtcag caagatgtgt cacaagtttc 277381 ccagaatctc gtagacaaga ttgaaagagt gattaggcca agagcaggaa gtgaattccc 277441 ctgggagcca aaggaattgg gaatgtgtag cccaagtaag acaagaacca gcaggaacat 277501 gcctctcctt agggtcgtga tacctgttca aggttttaat gtggaaggga ggattaggct 277561 tgctctgtgt tgaatcaggc tcaaaggatg gaagttacag ggaagctgat tctggcttca 277621 tgtaaaaaaa ggacagtttg ggcaggcaaa tctatcaaaa aatggaggga aattgataca 277681 ttcctctatg ttcaaacagg aactgacaat ctgcccctgg gtgggaacac ggtagagaag 277741 atgacttcaa aagccctttt catcctaaaa ttctgatgtt tgataattaa atgttatagc 277801 atggacactg acatttacat tttttactta tgtttttggt ttttaaatga ctctgcattt 277861 tgttttaagc ttcaaattat tatttgaata atgaaattca tcagaacaat tagtgttaag 277921 aatcatatag caatttatag aaaaggaaga gttcgtaggt tataaattct gttagttgct 277981 aagaagcatt tttaaaatta tgtactatag ctctttattc agcagacgaa ccaattacaa 278041 tctgtgtaac tagaacactt gactaaaatt atataatttt tacaacgctt cactgcatag 278101 atacatgaac ataatttatt tgtaattgga acaaagcccc aaagtagcag ttttgttcta 278161 ccaggtaatt aatgctcatt tttaaagcct tttattatta tttctgaagt aatgagtgca 278221 catggaaaaa gacacataat aggctaaaca ataagcccgt aagccaagcc aacatattcc 278281 aggaacaaat ccttgccaac ctctcaacca ggatttaact tctgcttttc ccccattttc 278341 aaaaattata gcatgtattt aaaggcagca gaagccttac tttcaggttt cccttaccct 278401 ttcatttctt tttgttcaaa ataggtagta attgaagttt taaatatagg gtatcatttt 278461 tctttaagag tcatttatca attttcttct aacttcaggc ctagaaagaa gttttgggta 278521 ggctttgtct tacagtgtta ttatttatga gtaaaactaa ttggttgtcc tgcatacttt 278581 aattatgatg taatacaggt tctgggttga caaatatcaa gacggaggag atctctgaag 278641 tgaagatgga tgcagaattc cgacatgact caggatatga agttcatcat caaaaattgg 278701 tacgtaaaat aatttacctc tttccactac tgtttgtctt gccaaatgac ctattaactc 278761 tggttcatcc tgtgctagaa atcaaattaa ggaaaagata aaaatacaat gcttgcctat 278821 aggattacca tgaaaacatg aagaaaataa ataggctagg ctgagcgcag tggctcaagc 278881 ctgtaatccc agcactttgg gaggccaagg cgggtggatc acgaggtcag aaattcgaga 278941 ccagcctggc caatatggtg aaaccccatc tctactaaaa atacaaaaaa gattagctgg 279001 gtgtggtggc aaacacctgt agtcccagct gctggggagg ctgacgcagg agacttgctt 279061 gaacccagga ggtggaggtt gcagtgagct gagatcgtgc ctaggcgaca gagcgagact 279121 ccatcccaaa aaaaaaaaag aaaagaaaga ggctgtatgt atagttcttt cagactacaa 279181 ggcagcaaag ttcgtgcatg actcgggact taaagtggaa ttaatttcaa tatagcagcc 279241 actttgactt ccactgtgtt ttctgggaaa ataggtttac aataggttta tttgaaggat 279301 caaacacatg catacactgc ttggttttac agaacacttt atgtggctta aattcacatc 279361 cggaactgtc ttcctttacc cattcatttc tcccccagct ctttcttttc attccctccc 279421 ctacctccca tgatttaact tctcttgcaa gagtaagatc atggagtgag caggacccca 279481 tgatgttccc gatagtgtta ttcatcaaaa ggtttgtgca aagaagacag cagcttcctt 279541 ttcagatgaa atcacttttc ccccctaatg ttagaattgg agtaaatcaa aaagccacat 279601 ctcctttgtg gtcagctcta gtagttatat aaaatccttt accaaaagct tagaaatgga 279661 gataaatcaa atcgtggatt atgttagggt tccatcttat cagtaggtgc agtaagaggg 279721 ttaaattaat gaagacgaca attttatcac attcagtggt ggacagaaaa atggtaagaa 279781 aatttccata gcaataatac ttaaagttat ctcaggcact tcttttgttt tgttttgtgt 279841 gtgtgtgtgt gtgagtgtta cttttttcca agcagaaaat gtcttttcaa tattcataaa 279901 gttgataaat cctagtatta atctctaaaa gaaacacctc caaattatta tttatgcctt 279961 acttgactcc aaataattgt agcaaataaa aaactgactt gggatttgga tttgcattct 280021 taactcccat agttcttttt ttgtagaaag acatttagct ttttgaagca tggttttcat 280081 tggcaagata atctagtatc agttgttata agatcagggt ttctcttgat gaggctgttg 280141 ctgaagaggt taataaaaac tggggaacca ctaaagagtt gaagagttgg tggggtagaa 280201 agctgacgat taatgtacag atttgcattt gtcggggcct ggggcctgtg tcatataagc 280261 ccatccccac aattacacta acgcctataa tgcgacagtg actaatggca gcggcagcaa 280321 ttaggagaat cagctccctc tactggacta gttaagataa tgtattataa ttagtgcaat 280381 gaatattaca aaattacagt attttcttaa aggcacaggc atatgtccag acttgtattt 280441 attcctgatt acctcacact agtatattag ctaattaatg atttgctttt cataaaaatg 280501 ttgagctagc atatttgttt agtaaaggga ataattatga acaatttctc attttgttat 280561 aaaccacgag taaaacactt ttagaagttg cttcatttgc tatattttat attgcctttc 280621 cagattgcta gtatgttagt ttcagcttag aaaaatcagt catttgacta ccttgaggct 280681 aaattgaaag aattttttag ggaggtacag gcataccttg gaaatttcag aacaccacaa 280741 taaaacaaat tttgcaacaa aatgtcacac aaattttttg gcttcccagt acatataaaa 280801 gttatgttgg ccaggcacgg tggctcatgc ctgtaatccc agcactttgg gaggccaagg 280861 caggtggatc acaaggtcag gaggtcgaga gcatcctggc caacatggtg aaatgccatc 280921 tctactaaaa atacaaaaat tagctgcata tggtggcatg tgcctgtagt cccagctact 280981 cgggaggctg aggcaagaga atctcttgat cccaggaggc agaggttgca gtgagccgag 281041 atcgcactcc agcctgggca acaagagcaa gactctgtct caaaaaaaaa aaaaagtttg 281101 tatttacact atactctagt ctgttacatg tgcaatagca ttatgtctaa aaaaaaaaaa 281161 agccatgtat atactttact tttattagta aaatatttta ttactaaaaa tgctaatgat 281221 catctgagtc ttcagcaagt cctaatcttt ttgctcgtgg agaatcttgc cttgatgttc 281281 actgctgcag gctaatccag gtggtggctg ctgaggtttg tggtggctgt ggcaatttcc 281341 taaaataaga caataatgaa gtttgctgca tcgattgact cttcctttca caaaaagatg 281401 tctctgtagc atgtgatgct gtttgatagc attttgctca cagtagaaca gctttcaaaa 281461 ttggagtcag tcttcgcaaa tctagccact gcttcatcga gtttatgtga tatcctaaat 281521 cctttgttgt catttcaaca gtgttcatag catcttcacc aagagtagat tccattgcaa 281581 gaaaccactt tctttgctca tccataagag gtaactcctc atccattcaa gtttgatcat 281641 gagatggtag cagttcagtc atgtctttag gctccacttc taattctagt tctcttgtta 281701 tttccaccac atctgctgtg acatccttta ctgaagtctt gaactcctca aagtcaccca 281761 tgagagttgg aatcattgtc ttccaaactc ccgttattgc tgatgttttg atctcccatg 281821 gatcacagat gttcttaatg gcatctggaa tggtgaatcc tttttagaag gtttgcagtt 281881 tactttgccc agatccatta gaggaatcgc tatctatggc agtgaatagc cctacaaaat 281941 atatttgaaa tatattttga aataataaga ggctgaaata ataagacttg aaagttgatc 282001 cacaggctgc agaatggatg ttgtgttagc aggcatgaaa acaacagtaa tctccctgta 282061 cctgtccttc agagctcttg ggtgactagg tgctttgtta gtgagcagta atattttgaa 282121 atgagtcttt tctgagcagt aggtctcaac agtgggttta aaatattcag taaaccatgc 282181 tgttaagaga tgtgctgtca tcccagtttt gttccattta tagagcatgg gcacagtaga 282241 tttagaataa ttcttaagcg ccctaggact tttacaatgg tcagtcttgg cttcaaaaac 282301 aaagtcgcca gctgcattag cccctaacaa gagagtcagc ttgttctttg aagctttgaa 282361 ggcaggactt ctctctagtg atgaaagtcc tagatggcat tttcttccaa ataagtctgt 282421 ttcatctcca ctgaaaatct gttatttagt gtagccacct tcatcaggag tcttatctag 282481 atcttctgga taacttattg cagcttctac atcagcactt gctgcttcac cttgcacttc 282541 tgtgttatgg agatggcttc ttcccttaaa cctcatgaaa tcaacctctc ccaacttttt 282601 tttcttctgc agcttcttca cctctctcat ccttcataga attgaagaca gttgtgggcc 282661 ttgttcttga ttaggctttg gcctaagaga atgttgtggt tggtttgatc ttccatccag 282721 acaactaaaa gttttctccg tatcagcaat aaggctgttt tgctttatca tttgtgtgct 282781 cactggagta gcacttttaa tctccttcaa gagcttttct ttgcattcac aacttggcta 282841 actggcacaa gaatcttggc tttcaatatg ccttgcccac taagcttaat tatttctagc 282901 ttttgcttta aagtgagaga cctgtgactc ttcctttcac ttgaatactt aagaggccac 282961 tgtagggtta tttactggcc cactttcaat attgttgtct ctcaagggat agggagcctg 283021 ggagttgagg gggtggagtg gccagtttgg ggggcagaca gaacacacac aacatttatt 283081 gatttagttt gctgtcttta tgggcatgtt tcatgacact ccaaaacaat taaaatagta 283141 acatcaaagg ccactgatca cagatcacca aacagatata ataattttta atgtttgaat 283201 atttaaataa ttaccaaaat gtgacacaga catgaagtga acctgtgttc actgttggaa 283261 aaatggcacc aatagacttg ttcgatgcag ggttaccaga aacctttttg tttttttttt 283321 ttaaatacta tctgtaagca ctactgtaca ataaagctat gtatgcctgt attttcatta 283381 agttgcagag caaacgtggt aatatttagc tttagtttta cttcatctgg cataagatca 283441 actccttata taacaagatg ataaaagttg tggtgtgctt ctataatttc attcaagtag 283501 ataaagttga aaaataatga cttgctttta taaacagtat gaagcaatgt agtgcagtaa 283561 atgaaatttt attccttctt tacaatgttc tcaaaattat ttttatgttt aatccaaata 283621 aagagcaaga ataaagcaac atttcagatt ttggtttctg gagacaatag ttagaaagca 283681 tgagttatga gtgacttaaa attcttgttg cctgtacttc actttgaaat aacattatgc 283741 tttaaaaagc attacactgc taaaggttaa ttagaattct gcagaattac tatagctaaa 283801 agtaggtaac aagatatctt tttttctatt gtttaactcc tttgtttcag aatgcctatt 283861 cctgtgcatt aaaagtgtcc ctccaaggaa attaggacat ctgcagagtt gaaaaacacc 283921 taagtctcag tcacttagag tcacacatca gggctcagag tgctatgact aggaaaatgc 283981 tgacctcctt tcattagtat gatcgtgcct ttccagcttt tgatagatcc aagcgctatc 284041 ttcccaccac tcaccaaatg ttccacctgt caaagggttt caggtccctg cagacttcgg 284101 ttttgacctg tggggaaagt agacttcctc gaactgggga agccacatgt tgtacatcct 284161 tctataaact atgattatca ttcttagtag gaaaatatgt gatttctttt tttttttttt 284221 tttttaaagt aagcatcaaa tatttgacca accagttggg cagagaatat actgaaactt 284281 tttatataac ctcatccaaa tgtcccctgc atttaagaaa tgaaattctt ctaattgcgt 284341 ttataaattg taaattatat tgcatttaga aattaaaatt ctttttctta atttgttttc 284401 aaggtgttct ttgcagaaga tgtgggttca aacaaaggtg caatcattgg actcatggtg 284461 ggcggtgttg tcatagcgac agtgatcgtc atcaccttgg tgatgctgaa gaagaaacag 284521 tacacatcca ttcatcatgg tgtggtggag gtaggtaaac ttgactgcat gtttccaagt 284581 gggaattaag actatgagag aattaggctt agctttttgc taagaactag ctaagtatct 284641 cttttaaaaa acgaatcagt gtgcttccat gatgcttggg ttacagttgt tctttcttgt 284701 tttggttttc attcattgca acttaccgtg aatattctgc tcaaggtatt gagagtgtgt 284761 gttgttatct caacttacaa tttgtgttga agttatcaaa ataatacaaa tgataatgca 284821 tgactttaaa aaagcatgat tactcttaga cttttttttt tcaaacacaa gcaacaaaaa 284881 ccagtacaag aagttaaaaa taaaaataga agtaggaagt tgctttttcc aaaaaaacta 284941 aacatgttta ttcttggatt tcttttgtct taaatagtct ggcctctcaa tacaaccagg 285001 taatagtaaa taatgtgact tttcagtgca gactcttacc tttaaagttt ctcaggaaat 285061 cacactatac caaatggccc accaaaacat acatgattgt tatttgagag ttctgcaacc 285121 tattaccctg cacttccacc tgaaatatct tagtttcaag gtgctatact aaaacaagaa 285181 agagagaaag aaaatttacg aataagttat ttttatgcat ttgaaatgcc atttggatgg 285241 gggaaaaagt gttaaaatct cacagtatat gcaacttttt aggaaactgt cttgcagaga 285301 gtatagtgta aacataactt ttatatgcac agggaaactg aaaaatttgt gtgacattta 285361 attgcgatat tttaagagtt atggaatttt tagagctgaa agatttattg atgatgacca 285421 atcaaattga tcttctcatc ttgtagatga gggaactgag gccgagacag aagggttcaa 285481 attatgcaag gtcaagctca gagtccagat ttcccctcat attccccatt gctttttttt 285541 tttttttttg agacagtctc gcgctgttgc ccaggctgga tcttggctca ctgcaacctc 285601 tgcctcctgg gttcaagcaa ttctcttgtc tcagcttcct gagtagctgg gactatgggc 285661 acacgccacc atgcccggct aatttttgtt tttagtagag acagggtttc accatattgg 285721 tcaggctggt ctcgaactcc tgacctcagg tgatccaccc gccttggctt cccaaagtgc 285781 tgggattaca ggcgtgagcc accgcgtctg gcccccattg ctttttccat cagagaaaag 285841 tccaggtgtt tctagggagt tgaagattcc actatcccca aggaaatggg ctctaaacca 285901 ttcttgatta gaagaggaca ggcagagttt aaataaatga aatttgtggt gaagccagaa 285961 atttcatttt gctcagcaat taaaacctct gcagactaca acagaagctg ctgatggaaa 286021 ttggggtgga gcctcatttc cccattcata ctgacgtgga tgagcagtgt gaggaaggaa 286081 accaagagaa cagcaggcag cagggtggaa aggaacggac tgacccatag atcagagtgt 286141 gagcacataa aagcaatgac cccactgtca tgtttggcca ttgcaccttt tgtgatagta 286201 tctaagctta tgtccctcaa aacagtgtaa gataggtttt cattagtatc agcttatttc 286261 atttctgctc aaattaattt ttaaaaaaat ttaattccat atttgagcaa ataaagatgt 286321 cttcatactt gagtaattca agaccccttt tttgctgctg ttgttgtttt ggtttttttc 286381 atggttttgt tgttttcttt tgttgttgtt gtttctttgt tttttgagat ggagtcttgc 286441 tctttcaccc aggctggagt gcagtggcat gatctccgct cactgcaacc tccacctccc 286501 aggttcaagc gattctccta cctcactctc agcctcctga gtaactggga ttacaggggc 286561 ataccaccac gcctggctaa tttttgtatt tttagtagag acagggtttt gccatgttgg 286621 ccaggctggt cttgaattcc tggcctcaag tgatccgccc tccccaacct cccaaggtgc 286681 tgggattaca ggcatgagcc accatgcctg gccttcaaga gcacttttta gtttatttaa 286741 tataattctc tttgttgccg gtctgtccgt tgaagttgcc ttctttgatg ccagaggcca 286801 aagacttggc ttcctcgggc ttccataaca aagtaacaca gatgggtgga ttaagcaaca 286861 gaaatttacc ctctcacatt tctggagtgt aagatcaagg tatcagccag gttggttcct 286921 ccccaggcat ctctccttgg tttgcagatg cctgtctgct ccctgtatct tcacatgacc 286981 ttcactctgt ttttctgctg tgtcctaacc tcttcctaca aggacaccag tcatagtgga 287041 ttaggatcaa ccctgatgac cttgtttaac tttaataacc tcttcgaccc tatctgcaaa 287101 tacagtcaca ggtggaggta ctggcagctt aggacttcaa catatgaatt ctagggggac 287161 acgttttagc ccgtaacaga ccccagtgat gctgagcata aaagcaaggc ctccacctgg 287221 agctccttcc cctggctgtg cgtgagacca ggtcctacct cagacctaag gaaaaagaat 287281 ccccaaatag gaggctgcga atctgagtgt tgaaaatttc cccaaatgaa tgagatgtgc 287341 agtcggattt gagaatccct gacacatcat tttaacactc cttcctttta gttctccgcg 287401 caagcctcag ggagtggaaa atggtatctc tcacctccca gataaccggt tgccttttaa 287461 gtgctcttcc agctgcctga attaaaacgc tcttttttta gtcagtttac caagcagtgt 287521 ctcattggac ttaaaatgtc agaggagaag ttgacacttt gacccagatt ctgacattaa 287581 tttttaagcc aggaggtatc cttgcaggag caaaatcggg aaggatggct gttcagctct 287641 tctctcctca ttttgctgtc tactggagag caactggaag cctttctaga tgaggggctc 287701 cacctcattc gtgtttgttt tgtttatttc ctctcagctc cttcccctca agacctttgt 287761 gactgtcctt cctgaaagca gccctctgtg cagccccata gtgtgtctgt acagcaatgt 287821 tgaaatagac cacattccgc ttttctgtcc tcaaaaaaga tgtagtttag ctgggatggt 287881 agaacctgaa cacagggagt tgcaacccag cctagggatg ttcccaggtg agagagcacg 287941 gtgggctttc agagcaccat ttttcttagc attccagcgt ctgcacccac agaagtgatg 288001 cttctgcaac atcctcttcc taccttaaca ggcactctca ccctcaagta catggaaata 288061 cctttatttc cggctctggc taaccttcat taatcagtaa ccctacttca gtaagaacaa 288121 aaatgagtac atcaatattt ccagttggac tcacctcagt cactcaggac agaataccat 288181 tgtacttaca taccatcgtg cctgaagaag tcatgatgaa agctttggtt tccaggagat 288241 tttttgtttt tagagtgaaa aatgcccgca gaagatcaca gagataggaa ccctaggaaa 288301 aaaactcctg agaacaaatg tataaatgaa acaagcctag aacagtcaca taccacttca 288361 cactggagag aacgttctga gaaatgtggc atggggcggt ttgatcgctg tgcaaacatc 288421 ataaagtaca tgtacataaa tctagatggt atagcctgct acacacctgg gctacacacc 288481 tacaactatt gctcctaggc tccacacctg tgcagcatat tgctgcactg aatactatag 288541 acagttgtaa cacactggca gggatttgcg tatctaaaca aactagaaaa ggtacagtaa 288601 aaaatacggt attatggggc caccatcata tgtttggtct gtcgtggact aaaatgtcat 288661 tatgtggccc ataactgtat ttttattggt agagttgtta gcatttggaa atgagcactt 288721 gctagttgtt gttttttttt tttttttttt ttttcaaata agcaggaaat attgacttac 288781 tgaatttact tagccataag gccagagagt cactctgtga gccattggcc acgctttccc 288841 aaggaggtaa ctgtgcctta ttggtgcttc aagcagtgaa gcaatagccc tttattatta 288901 ccactcccca aatgtgccct ccatgtcaga atagccctct ggtggctgga aatctcagca 288961 ctagctgaat taaccacctt gctcctgttt taggacgttc ctgcaaagcc cactaatccc 289021 aggatctgtt tagtagcaag cggtactttt ctacattcac atacatagac actggatatt 289081 taaaaatacg tattgtagct ctttctgcta ctctccccag acatgtcacc tttaataagg 289141 agaagttggg ggcacagctt ctctttatta taggatgtga aaaagttttc ctctcacaga 289201 tctgatcact tgaacaccag aagtctcccc gcccccacct cctggttatc tcattaatga 289261 tctaaataag aagagagttc cttttatttg gcttttatgg aggtatggca tatgactttt 289321 tctaagtaat atgctgtcac actggtgact gtataaaaag atgtccttag aggtcctttc 289381 ttcccggcca gaagctgacc caccaggata cagacgcact gacgttggca ttgtagagta 289441 gttggagaga gtggtcaggt tgaatggaaa gccacgataa ggaaaagtga gggggggtgc 289501 agaacctagg agtggcttac agtgagcagc aggaagagaa attggaagat caaaggaaag 289561 catcaaaaat gatcacaggc caggcgtggt ggctcacacc tgtaatccca gcatttgggg 289621 aggccaaggc aggtggatca cttgaggcta ggagttcgag tccagcctgg ccaacatggt 289681 gagaccctgt ctctaataaa aatacaaaaa attagccagg cgtggtggca tgcgcctgta 289741 gtcgcagcta ctcaggaggc tgaggcagga gaatcgcttg aacccaggag atggaggctg 289801 cagtgagcca aaatcacgcc actgcactcc agcctgggtg acagagcaag actgtcttaa 289861 aaaaaaaaaa aattatgaag ggatagcaaa ggagttcctg aacctgcctg aagaattgct 289921 tgatgttggc cataaacata gtcaactgcc ttctagttat atataactat agagagatgg 289981 catcaaaatt aggaaaacaa gcatctatac aactaggtag caggcagtta ctgggaggtt 290041 acgagggata agtcactagc ttgaaaactt cctattttct ttgcttttat tttttttttc 290101 cgcagacttg tggctccttt gatgtgtcac actattgttg tattgttcgc cttagttccc 290161 ttaccctcct ttactcgtct ggtccatctc aatgacagga ttctgtcgtt tggcctgact 290221 cagtcccttg tataaaagga catctattaa tccacatcct gctggttact gtacataata 290281 aaatgtaaac gtggtgtttc ctgcctctcc tccaagctca cacacccgca tgtaaatact 290341 ctaggggcaa taggaggaga cctagaaact aaggtcaccc aaatagaatc ttgtccctgc 290401 aggcggatag ctgagctcta aattactctg gatacagtga aggactaaat ggttccaatg 290461 aatcaagttc agggagaaaa gggtcagggg gtatggaggt tagccaaaga gtgtcaaacg 290521 ttctcctcaa agctatttta gaacaaggct ggtcaggcta tttgggctca gctcttttca 290581 gcgtgtgtca taggcagcag ccagttgaag ccatgccacc atgcagtgca ggggtccagg 290641 tctcaggaag gtgcagcgta ccgagagtca catgaagacc tcagatcatt tcctgctgct 290701 gtgtaatcca ggcagtctgg gaagagaatg aatattcagt ggttgtatgt atgtgtgtct 290761 tcatcccagc aagtgacaga tgttcctgcc cagttggaaa atggaaattc tgacttcatt 290821 gtcttatcat tccttccaga acaagcacca tgggctcaaa ccctccctgg ctctgcttta 290881 agggaaataa tattgtgtct gtgcagaggc cctgtctgca cagggtcgtg gtagggctgg 290941 ccttcacatg gtggcccatg ccactgctgt agcttagcca tgaggaagag attggtgagc 291001 ctgctggaga ggaaggctac ccttacttgg tgctcctcct ttatccatag acatgagtac 291061 tcagattttc gcatctagct ttcaaggggc tggagattaa agatccaatc catactcctt 291121 ggatgccctt ttgtcaagat aatcccttct tctgtagctc ctaagccctc taaggtctga 291181 gggccaagct gccccatggc aggaatggca aaatgtagct gtcctctcag tggcaaaagg 291241 taataggatt atcaagtagc aaaacagaag ccaagaaaag acaggagagt tagaggtctg 291301 gattgtgcac cagccttgga gcctcgcatc tctagcaggg cactcttaag cgttaggctt 291361 tggagtttga agaggagata aacctatcaa attgccaggt gttcccctgc ccttgatgca 291421 tgcaaatgat tgttgacagc cagtgtgatg gtagatgtgc ctcctggaag aaataaaagt 291481 ataatattcc aacaatcaag cagcattgtg gcttttttct acaatcagaa acatagcagc 291541 tttgtgcttg atctcctaag caagataatt ttttttaacc tgggcattct ttgagttctc 291601 agtttcgagg tgatggggag caaaaatgat gcccaagttc aaatggcgta ctgctgcaga 291661 taggctgtct cctggcacac agatttacag agtgaatgaa atctctcatt cattggtgct 291721 aatctttaac catggttttt aacttacaag ttggaggcta attacacagt taccagttaa 291781 agccaggtga cctcatttgt tgacaacagt gcacagcaaa ggaaagagga gaggataagg 291841 aaaatatctg tgatttgctc ccagaaggac agtgaagctg agcatccatt tgagcagagt 291901 cctctgagta ctggccttgg tcagagcttg ttggtgggag tgtgctaagg ggagatgcct 291961 aggtctgtca ttggagtaag ctgagggttt ggatgcaggg gaactcaggc agctctgagc 292021 cacagggtgg tagatgggag ggattcacag tggcgcacag tctcttccat aaatggaaac 292081 gtaagaatcg gggcacatgg ccatggcggc tcatgcctgt agtcctagtg ctactttagg 292141 aggccgaggt aggcggatca cctgaggtca gtttgagacc agcatggcca acatggtgaa 292201 accccatctc tactaaaaat acaaaaatta gccaggcgta gtggcgggtg cctgtaatcc 292261 cagctactcg ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt 292321 gagccgagat tgcctgggca acaagagcaa gactccaact caaaaaaaat gggcacataa 292381 tggatggaac attctgtccc aagggatgag ggagctgtat atctacctat tccagctgcc 292441 ttgggattag ccactggtag tcattccacc tcactcccag actgaaaaga gtcaatcact 292501 tcatgaagtg gggctatccg aagggaccac aggggacctg attatactga cagatttatt 292561 tttatttatt tacttttttt gagacagagt ctccctctgt tgcccaggcc ggagtgcagt 292621 ggcacgatct tggctcactg cagcctcagc ctcctgggtt caagcagttc ttctgcctca 292681 gcctctggag tagctgggac tacaggcacg tgctactgtg cctggctaaa tttttttgta 292741 tttttgttaa agatggggtt tcactatgtt ggccaggcgt gtctcaaact cccgacctcg 292801 tgatccaccc gccttggcct cccaaaaggg ctgggatttt gagccacctc acctggcctt 292861 ttttttcttg gagatggagt ctcactctgt ctcccaggct ggagtgcagt ggcacaatct 292921 tggctcactg caaactccac cccccaagtt caagccattc tcctgcctcc gcctgtggag 292981 cagctgggat tacaggcgtg caccaccacg cctggctaat ttttgtattt ttagtagaga 293041 caaggtttca ccatgttggc caggctggtc tcaaactcct gacctcaagt gacgcaccca 293101 cctcaggctc ccaaagtgct gggattacag gagtgagcca ctgcacccgg ccatattgac 293161 agatctttta gtctgaattg ttatcagcca gtgtggtcta gtcatgacat atctagctcc 293221 atctcgagac ctcatgtctt actgctgaca ctacatcctc gctggtactt tttcttcctt 293281 ggcttcaggg gcattgctgt gtaggctcag atccatgcca gatggggtgt gtagggggaa 293341 aagctgcata agggaggaat gtcacttggc aacactcaac aagggggaat gggcctgcag 293401 tattctccca gagatgaatg agcagcccat tctaggaaat atccctttga aggaatatct 293461 tctctatggt acatccaaac ttttctctgg aaaacatgcc tcttcattct gatgactttc 293521 tatttattct cttcatctta ccatggtttt acaaaagagg aattttaaca tactcattca 293581 gcagagttaa ggacctcctc tcttctgcga acggggagag gtataacaaa tatagtaaaa 293641 caagaacagc cactgcccac gttgtagggg taagtaccct gcccaacctg cagacagtaa 293701 atgccacaca cagctctaac agtgacaggg ggagaaaaac gtcttcagtt actggcctga 293761 ccttagaggt aaagcaacat attaacagag agggtctgat tttaaacttt gagctgaaca 293821 tacttctata gtatcttaat tctcagtttt taaaaaatta catcatctag aatggcgtcg 293881 tccttgctct tccttttttc ttgtgccaaa taatagcccc ctttggtata tgttattgtt 293941 ttaattgatt ctcaaacctc tccttaacat tttaacaaac ataaaatttg atagaaatgg 294001 gctagactaa attcagcagt ccaggaaatc gagtattttt ctacacagat ttgagtgtca 294061 cccaagatac agtatacttt gctgaatctc cctccatgat actcttgtgg ttttatctat 294121 tttaccaaaa cacttccacg taaaacaggc ttccgttgac tctgtttgtc ctttatggac 294181 ccaacagggg aggaaaaaat accattgcac tgggtttatc cacttactgt ggagttgggg 294241 aaccagaaac aaccactgga cacgttgcaa gcgcgtcaag gtttccaccg ccacctggtg 294301 gctagatgga gcacagcatc tgtgtaactg gtgggcaaag gcgttctgct ccaagatgtc 294361 aaagtggggg agaaaattat gggtgttctg caatcttggc aaaagaagag tactgtactc 294421 cttatctttt tactgcttct ccatgttcac ccttaaaaga ttgattttta ttttttactc 294481 agctctcctc ttgtttttca ggttgacgcc gctgtcaccc cagaggagcg ccacctgtcc 294541 aagatgcagc agaacggcta cgaaaatcca acctacaagt tctttgagca gatgcagaac 294601 tagacccccg ccacagcagc ctctgaagtt ggacagcaaa accattgctt cactacccat 294661 cggtgtccat ttatagaata atgtgggaag aaacaaaccc gttttatgat ttactcatta 294721 tcgccttttg acagctgtgc tgtaacacaa gtagatgcct gaacttgaat taatccacac 294781 atcagtaatg tattctatct ctctttacat tttggtctct atactacatt attaatgggt 294841 tttgtgtact gtaaagaatt tagctgtatc aaactagtgc atgaatagat tctctcctga 294901 ttatttatca catagcccct tagccagttg tatattattc ttgtggtttg tgacccaatt 294961 aagtcctact ttacatatgc tttaagaatc gatgggggat gcttcatgtg aacgtgggag 295021 ttcagctgct tctcttgcct aagtattcct ttcctgatca ctatgcattt taaagttaaa 295081 catttttaag tatttcagat gctttagaga gatttttttt ccatgactgc attttactgt 295141 acagattgct gcttctgcta tatttgtgat ataggaatta agaggataca cacgtttgtt 295201 tcttcgtgcc tgttttatgt gcacacatta ggcattgaga cttcaagctt ttcttttttt 295261 gtccacgtat ctttgggtct ttgataaaga aaagaatccc tgttcattgt aagcactttt 295321 acggggcggg tggggagggg tgctctgctg gtcttcaatt accaagaatt ctccaaaaca 295381 attttctgca ggatgattgt acagaatcat tgcttatgac atgatcgctt tctacactgt 295441 attacataaa taaattaaat aaaataaccc cgggcaagac ttttctttga aggatgacta 295501 cagacattaa ataatcgaag taattttggg tggggagaag aggcagattc aattttcttt 295561 aaccagtctg aagtttcatt tatgatacaa aagaagatga aaatggaagt ggcaatataa 295621 ggggatgagg aaggcatgcc tggacaaacc cttcttttaa gatgtgtctt caatttgtat 295681 aaaatggtgt tttcatgtaa ataaatacat tcttggagga gcaccattgt gctggtgtga 295741 atgattccat agtaacaatc ttgaccattt actgacgtac agaccagtga gaagtcttcg 295801 catgttgggt acccacacct gttgtgtctt aattgcaagt ctgagtagga agttggggcc 295861 aacatgtgtc tcccagtgct gggaaaatat ttcatagacc taatttacag tctttacttg 295921 atctaaaaca ttttgctgcc atattttggc cctcaagttt gtcccaaatg agagacaaag 295981 ggaaaagttc cagggaaata aaaattaaga cagctgatta tctgtaaagc atggtttctc 296041 atcctgaacg ctactaacat tttgcaggga ataattcctt gttgaaggga gttgtcctga 296101 ccagtgtagg atatttattt attttattta tgttttttga gagtctcgct ctgtcaccca 296161 ggctggagtg cagtggcaca atctcggctc actgcactcc agcctgggag acagacgaga 296221 ctccgtctca ataaataaat aaataaataa ataaataaat aaaaggaggg cctggcacga 296281 atgacatgca gggaaggcag tgagcaggtg gaggtccctg tactcgttgt ggtgccttat 296341 ctaccaggcg gttgagttga cgtctttgtg gacagaattc gattacaaag gtgtccgaaa 296401 gtctccaggt gggggagagt tttgtagcgg gcataatttg ggttgtaaat tgactgttgt 296461 ctcctgagca gtctcctggt gagggagagt ttcactggag ctttgaaaca catgttagac 296521 agccttgccc tgtagggagt gtctcgtgaa ggagaggtaa acaattataa ttgcatttct 296581 aaagggctaa gtaggacatg ggaaactgga aaatggagaa aagaagagaa aaaaatagta 296641 actcattctc tttttcttag aaaaatgagg atacttggtc acatcctcac atggtgaaga 296701 ataagatggt ctcttcattt cccttcaaca ctagtcccat cgtgaggctc tgccctcatg 296761 acctaatcta atcctaactg cctcccaaag gtcccacctc cacatcccat tattggggat 296821 tagggtttca acatagggat ttgggggaaa cacaaacact cagttcatag tgatatctga 296881 ctgcgattcc aaaggtctga ggacacattg gccctgttgt ttttgctgtg ttgtccttgg 296941 cctcaggtga acattttagc tttagattgc aacattttaa gctaaaaaca ctaagctaga 297001 aaattaaact tatctcgtca cattctgcat ggtgaacaaa ctcaaaactc acctgagtgt 297061 atcttgtttt catgagaaaa aatgaccggt catctccctt gccctgttag ggacagttag 297121 acatgactgc tgtatgaaga tatgtacatg ttaacaattc atatgtacaa gctactgttc 297181 cattttgcct ggcccaaggg gaggtagcta gctagttggg gtagcagggc gaattaaccc 297241 taatataata gtacatttct aatgccacca atgaaatgaa aaccagctct caaacatggc 297301 actatatggg atggcattcc aaggccaact cagttacagt aactacagct ttaactttga 297361 tcattttgtt accagaaagg ggtcccaatc cagaccccaa gacagggttc ttggatctca 297421 tgcaagaaag aattcaaggc gggtccatag aggaaaagga aatgaagttt taaagaaagt 297481 aggaaaataa agagtggttg ctccataggc agagtgaacc taagggctgc tggttgccca 297541 ttttattgtt atttcttaat tatttactaa acaagggctg aattattcat gcctcccctt 297601 ggtagaccat atagggtaac ttcctgatgt tgccatggca tttgtaaacc gtcatggctc 297661 tagtgggagt gtagcagtga ggacaaccag aggtcactct cattgctatc ttggttttgg 297721 tgggttttga ccggcttctt tactgcaact tgttttatga gcaaggtctt tatgacctgt 297781 atcttatgca gacctacaat ctcatcctgt gacttagaat gacttaaccc tctgggaatg 297841 cagcccagta ggtcccagcc ttattttacc cagcccctat tcaaaatgga gttgctctgg 297901 ttcagacgcc tctgacaatt ttaaagctaa tgaaaaaaat gtctagccaa atgtatatat 297961 acatacatat ataattatga aaagataatg ccttataatc ttgattctta aattgaggta 298021 tattgatagt attctcccaa tggtctgggg attaatcaat aggtaattta tacaaaactg 298081 aagtgctgcc aaatagaatt ataggagtag ctgctgcgca aaaaagatta atgagatttg 298141 gggagacttt gctgtggcta atatgatggg tcaggaagag tacaatctca gtacaatttt 298201 aagttgtatt tatcttatga ataattgggc tattctttca tatgcccaag ggccatttta 298261 tagctctata tttcttttgt gttctgtttg gggcttgaag tctctacttc tgtatcggga 298321 ttgcaatgcc tgctgtctta ctgtttccat ttgtctggta aattttcgtt cattccttta 298381 tttttagcta tttcaattgc tgcattttcg gtgtgtatct tgtgtgcagc acacagttga 298441 gtttttgcta tacgagccaa tccgaaaata atctttttta ataggcaaat taagccgatc 298501 tagttttgat ataactacca tgtttgatgt caattctgtc atattttaga ttgtaattgc 298561 tgaatgtatt atatttactg tgtctatcct cttgcgtttt cttttggctt atgaattttg 298621 tgtcttttat ttatttattt atttatttat ttatttattt atttagagac ggggtctcac 298681 tctgtcgccc aggctgcagt gcactggtgt gatctcggct cactgcaacc tccacctccc 298741 ggattcaagc gattctcccg agtagctggg actacaggca cgtgccacca cacccaacta 298801 cttttttatt tttagtagag acggggtttc accatgttgg ccaggctggt ctcaaactcc 298861 tgacctcagg tgatccacct gcctcagcct cccaaagtgc tggaattaca ggtgtgagcc 298921 accgcgctgg gtcttgtatc tttattctaa tgggttgctt ttatactaat cttctgtagt 298981 gctcttactc cctttttttt tttttttttt ttttttttaa gacggaagtc tggctctgtc 299041 gcccaggctg gagtgcagtg gcgcaatctc ggctcactgt aagctctgcc tcccgggttc 299101 acaccattct cgtgcctcag cctcccaagt aactgggact acaggcaccc accaccacgc 299161 ccagctaatt tttttgtatt tttagtagag acggggtttc accatgttag ccaggatggt 299221 ctcgatctcc cgacctcatg atccacctgc cgtggcctcc caaagtgctg ggattacagg 299281 catgagccac tgcgcccggc cccttttttc tttcttaacc tgttactatc tgtcttttct 299341 agatgatact gattcccacc tattacctat atgtcagtca atggtgttac tctacttttc 299401 cccttttctc tcccttctcc caccttttta gttgtatttg ttctacttta tcagtatata 299461 atgtttacat atccatgttt ttcattaacc catgaacgaa ctcttacatt attctcagct 299521 ctacagtcaa atatgtgaaa tgatctctgt tggtcctttt gttcccagaa atattctggt 299581 tgggtgaaac tcaccctcca gtaaattgct cagcaagagc acgtgtactg tattcctgga 299641 attctggtga gtttgaaact gattttctgt gaccttgaaa cttgaagagc agcttgtttt 299701 tccttaggtt tcttgaaaat ggtgttccag tgttttcttg gtttgcatgt tatttttgag 299761 aaatctgagg ccaatcttta attctctttg taaatatttt ttgggcctgg aagccttgag 299821 gatttttttt ttataagtct aataattttc cgagctatct cagaattatt tgtggtcagt 299881 tttcctatat accacactgg gccctttcag tgtagagttg actcagttct tttatttctg 299941 gaactttttc ttggattaga gttttaaata ttctgttata ttgtttttct tttcttcagg 300001 aacaccaata atacaaatat gcaacctcct ttgtctggct tccatttcaa ctatcttctc 300061 tggtcctttt tattttatct ccttttcatt ctcttggcta tccttctttt attcaatacc 300121 ctttattaag tttttatttg aacccattct cttcggggca ccttgtaatc tgttctttat 300181 tactgataaa gttttgtgat tttcttctat tttttttcct gaatttaatc agctctggct 300241 tcattttctc ctattttttg ttgatttctc tacctagttt ttatatttcc gattcaacat 300301 attgatattt tccatgttcc cagatgcagg ttggagaata tttcataaat attgagtgtt 300361 gtcttagaat tttcatcaag ttcatggtta tatcgaggag acatcaccag cagaaaaact 300421 ttgatttgca acttctgact ttggcagttg ttctctatag aagttggtcc actccttcct 300481 cttcctgttc attctgtggg caggagacct agttaggagc agcttcttcc ctcagttcaa 300541 caatgttctg tgcagttgct tctgtggctg atgttgacag gaaagataga agagttgttg 300601 tgtctcctat tttctgcagg atgcttaatt ttgttctgtc tcctcttctg ctgtgcctgt 300661 ctcccagaag caccactcct ctccctgtgt ctgattctcc ccaacaagca tggattctcc 300721 atgtctggtc tgcacactgt caggccccac tccggagcct gggctctgaa ctgccaagtg 300781 cccatcattg ctcgtttttt ttggtttttt tttttttttt ttttttttga gaccgagttt 300841 cactcttgtt gccgaggctg gagtgcaatg gtgcaatctt ggctcactga aacctctacc 300901 tcctgggttc aagcaattct cctgcctcag cctcctgagt agctgggatt acagacgcac 300961 gccaccatac ccggctaatg tttgtagctt tagtagagac gaggtttcac catgttggcc 301021 aggaggtctc gaactcctga cctcagatga tccacccacc tcggcctccc aaagtgctgg 301081 aattacaggc gtgagccacc atgcccggcc ttgttcattc tctttgtaaa tattgtttca 301141 ctttatcttt tgggagtggc tttgtctgtt ttttctctaa gttctctgct tccctccacc 301201 cttttcccca atctcttgag actcctctcc cctgatttct gcacccatag gtttgtggtg 301261 gctgcttgct gcaggcccac agtgggcact ctgttggatt tttttttctt ctatgcacag 301321 ataattggaa aattgcagct tctcttatct cttagtcatg ctgaaagagg gacatttacc 301381 accattatga ggaaatctgc acactggtat atttttaaag tgccccagat gattctaata 301441 cacagctgtg gtggagaaca tcttgttata tcatcataat gtgcttttag tacattttac 301501 atatattttc ccaaaagtcc cttggcttcc cagccatgac tgtctcattt ttctcctacc 301561 tgagtcctgt aggtaagaga agcactgtcc ttcacatctc tgtttctcaa agaattcacc 301621 tacatcagaa tcacctatga gcctggccaa acattgcagt tttccagatc ccattctggg 301681 cctactgaga tc // LOCUS D88010 3270 bp DNA PRI 13-JAN-1997 DEFINITION Human DNA for ribosomal protein S13, complete cds, U14 small nucleolar RNA, complete sequence. ACCESSION D88010 D88011 D88012 NID g1754632 KEYWORDS U14 small nuclear RNA; ribosomal protein S13. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kenmochi,N., Higa,S., Yoshihama,M. and Tanaka,T. TITLE U14 snoRNAs are encoded in introns of human ribosomal protein S13 gene JOURNAL Biochem. Biophys. Res. Commun. 228 (2), 371-374 (1996) MEDLINE 97079189 REFERENCE 2 (bases 1 to 3270) AUTHORS Kenmochi,N., Higa,S., Yoshihama,M. and Tanaka,T. TITLE U14 snoRNAs are encoded in introns of human ribosomal protein S13 gene JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 3270) AUTHORS Kenmochi,N. TITLE Direct Submission JOURNAL Submitted (25-SEP-1996) to the DDBJ/EMBL/GenBank databases. Naoya Kenmochi, University of the Ryukyus, Biochemistry, School of Medicine; 207 Uehara, Nishihara, Okinawa 903-01, Japan (E-mail:kenmochi@med.u-ryukyu.ac.jp, Tel:81-98-895-3331, Fax:81-98-895-7049) FEATURES Location/Qualifiers source 1..3270 /organism="Homo sapiens" /db_xref="taxon:9606" exon <1..48 CDS join(26..48,190..238,421..499,2040..2209,2466..2566, 3196..3229) /codon_start=1 /product="ribosomal protein S13" /db_xref="PID:d1014222" /db_xref="PID:g1754633" /translation="MGRMHAPGKGLSQSALPYRRSVPTWLKLTSDDVKEQIYKLAKKG LTPSQIGVILRDSHGVAQVRFVTGNKILRILKSKGLAPDLPEDLYHLIKKAVAVRKHL ERNRKDKDAKFRLILIESRIHRLARYYKTKRVLPPNWKYESSTASALVA" intron 49..189 exon 190..238 intron 239..420 exon 421..499 intron 500..2039 repeat_region 1200..1490 snRNA 1795..1885 /note="snoRNA, U14-S13-3" /product="U14 small nucleolar RNA" exon 2040..2209 intron 2210..2465 exon 2466..2566 intron 2567..3195 snRNA 2919..3010 /note="sno RNA, U14-S13-5" /product="U14 small nucleolar RNA" exon 3196..3270 polyA_signal 3250..3255 BASE COUNT 800 a 714 c 787 g 964 t 5 others ORIGIN 1 ctttcgttgc ctgatcgccg ccatcatggg tcgcatgcat gctcccgggt gagctcgggg 61 catcaagccg gattgctggg cggggggtgg gaggaagaca gggagtgtgg gcagcgggcc 121 gaggggatga tgttctgggc tgctctggca ctagccgcca cctcacctcg agactgcttc 181 tctccccagg aagggcctgt cccagtcggc tttaccctat cgacgcagcg tccccactgt 241 aagtagcgcg ctgggaccgg ggagaatccg gggagggggt tggcatttgt ctcgggtgaa 301 gcgacgccag ggtgaggaac ttgcgtgtat gaggagcgcg gttttgcgga aggagagacc 361 gctgttctgc ggcgccattc ctgggttctc atcctaaggc tgctttctat tccataacag 421 tggttgaagt tgacatctga cgacgtgaag gagcagattt acaaactggc caagaagggc 481 cttactcctt cacagatcgg tgagtgtttg tgtctaacat agcctatttc gcctgtcctc 541 gtgtgacttg taggatctag taggtggtaa agttatttta aaaatagcca aaactctggt 601 ctcgccaccg tgctagaggc atcattgtaa gcacttcaca tgtactgatt tcgtttaatt 661 ctgaccacaa tactttgaaa taaatcttca tggctttcac cccagtttac agttgaggaa 721 gcagaggcac actgccgctg gatactaacc agccccaaat cacacagctg tcaagtggag 781 gagttggaat tggctaagca gagtgcctga cacagttcct cacagttagg tgtggtttga 841 atctcagctg cgtgtgactt cgtgcaagtg gcttcattca gcaaatgttt accttctatg 901 tgctgaacat ggttctgggg cctgagaaca tactactctc cgactcccct gcccaaaaaa 961 gccagccgca atgtgcctgt agtctcagat acttgggatg cagaggtgag aggatcacca 1021 gagcctagaa gtcccaggct gcagtgagct atgatgaagc cactgcactc ctgggtgaca 1081 gagagacccc acctcttttt tttttttgag acagtcttgc tcttgtcgcc cagtctggag 1141 tgcagtggcg cgatctcagc tcactgcaac ctccacctcc tggatttcaa gcgattctcc 1201 tgcctcagcc tcccgaatag gtgggattac aggcgcccgc cagcaggccc agctaatttg 1261 tgtgtttttg tttgagaccg attctcactt tgtcacctag gctggagtgc agtggtatga 1321 tctcggctca ctgcaacctc cgcttcctgg gttcaagcaa gcaattctcc tgcctctgcc 1381 tctggagtag ctgagactac aggccgcacc accacgcccg gctaattttt gtatttttag 1441 tagagacgag gtttcaccat gttggccagg ctggtctcag actcctgacc tcagatgatc 1501 cactgcctcg gcctaccaaa gtgctgggat tacaggcgtg agaccaccac ctgatcccac 1561 ctcttaaaag gggnaggggg ntcgcnnaaa aanctgtaca ccggagcttg gagtctaatt 1621 ggaggtatga cacatcaaaa gaaaatatgt catagattgt aagtagacag agctcattag 1681 aggtttaaaa gaaataacgt gtaaagtttt tagaagaata tctgaccaga gttctcagta 1741 aatgttagct gtggaatctc attaatccct ttggtcatct gtattccagc aagctcactg 1801 tgatgatggt tttccaacat tcgcagtttc caccagaaag gttttcctta gtgttgggta 1861 aaccttcctt ggatgtctga gtgagcttta tgtgcattct cacgttggtt tctgtgatgt 1921 ataaaagtat acccccatct tacagatagg aaactatgcc ctgatttctg agccttgaca 1981 tcatcacatt ctccagtttg tgtttcgtgt tgaaagttaa aacgtgattc ccttttcagg 2041 tgtaatcctg agagattcac atggtgttgc acaagtacgt tttgtgacag gcaataaaat 2101 tttaagaatt cttaagtcta agggacttgc tcctgacctt cctgaagatc tctaccattt 2161 aattaagaaa gcagttgctg ttcgaaagca tcttgagagg aacagaaagg taagctaata 2221 aataaaacct ggtgctaagt aggactgctt gtactaaatt ttttcggtaa aatgtaatgc 2281 atctggtttg aattatccaa ggcttagtac acaaaccagt gtagtgctgc tttgaagttc 2341 aaattctatc caaagagatt ttcattatag aaagtgcatg gtagctctac attctcttat 2401 taatgtttga taatgttagg tcattttggg tggttttctt gaattgcacc aaattttatt 2461 tttaggataa ggatgctaaa ttccgtctga ttctaataga gagccggatt caccgtttgg 2521 ctcgatatta taagaccaag cgagtcctcc ctcccaattg gaaatagtaa gtatcaactc 2581 ttttgtcgtt gttatcaaga ataggagtca gccagtagta aaagtcctag tagtaaatat 2641 gttcagcctt gggggctagt agtaaatatg ttcagccttg ggggctattt tgttaactgc 2701 tctttgagct ctggcctggt agtgggaaag cagccacagg caatacataa tcgaatgggt 2761 gtggctgctt gtctttaata aagacaagca gagggccata gttgcagacc ccagatttag 2821 accatgaata agccctttga ctctctttta gatagtaggt agattttcct ttcatacctc 2881 tactcccagg agtttgaatt aattttcaca tgaattgctc actatgatga ttggttgcca 2941 gacattcgca gtttccacca gaaatgtttt tccttatgtt ggccagttct tccttggatg 3001 tctgagtgag catcttcatt cattgtgctt gaacacaatg agttggttca ggtattcagt 3061 agtttggtgc actttatata gtttggattt attaactgtt tgagaatagc accattagtg 3121 gggagattct gttaatgaat gtggagggac cagtattaat cctgattttc tttctttccc 3181 cttttgcttt tccagtgaat catctacagc ctctgccctg gtcgcataaa tttgtctgtg 3241 tactcaagca ataaaatgat tgtttaacgt // LOCUS D89060 9836 bp DNA PRI 05-DEC-1997 DEFINITION Homo sapiens DNA for oligosaccharyltransferase, complete cds. ACCESSION D89060 NID g2687867 KEYWORDS oligosaccharyltransferase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamagata,T., Tsuru,T., Momoi,M.Y., Suwa,K., Nozaki,Y., Mukasa,T., Ohashi,H., Fukushima,Y. and Momoi,T. TITLE Genome organization of human 48-kDa oligosaccharyltransferase JOURNAL Genomics 45 (3), 535-540 (1997) MEDLINE 98035877 REFERENCE 2 (bases 1 to 9836) AUTHORS Yamagata,T. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) to the DDBJ/EMBL/GenBank databases. Takanori Yamagata, Jichi Medical School, Department of Pediatrics; Minamikawachi-machi, Kawachi-gun, Tochigi 329-04, Japan (E-mail:momoi@ncnaxp.ncnp.go.jp, Tel:+81-285-44-2111, Fax:+81-285-44-6123) FEATURES Location/Qualifiers source 1..9836 /organism="Homo sapiens" /db_xref="taxon:9606" CDS join(136..340,548..658,5359..5445,5767..5870,6012..6106, 6840..6933,7175..7323,7774..7921,8578..8698,8823..8929, 9022..9171) /note="48kDa" /codon_start=1 /product="oligosaccharyltransferase" /db_xref="PID:d1024551" /db_xref="PID:g2662375" /translation="MGYFRCAGAGSFGRRRKMEPSTAARAWALFWLLLPLLGAVCASG PRTLVLLDNLNVRETHSLFFRSLKDRGFELTFKTADDPSLSLIKYGEFLYDNLIIFSP SVEDFGGNINVETISAFIDGGGSVLVAASSDIGDPLRELGSECGIEFDEEKTAVIDHH NYDISDLGQHTLIVADTENLLKAPTIVGKSSLNPILFRGVGMVADPDNPLVLDILTGS STSYSFFPDKPITQYPHAVGKNTLLIAGLQARNNARVIFSGSLDFFSDSFFNSAVQKA APGSQRYSQTGNYELAVALSRWVFKEEGVLRVGPVSHHRVGETAPPNAYTVTDLVEYS IVIQQLSNAKWVPFDGDDIQLEFVRIDPFVRTFLKKKGGKYSVQFKLPDVYGVFQFKV DYNRLGYTHLYSSTQVSVRPLQHTQYERFIPSAYPYYASAFPMMLGLFIFSIVFLHMK EKEKSD" exon 136..340 /number=1 intron 341..547 /number=1 exon 548..658 /number=2 intron 659..5358 /number=2 exon 5359..5445 /number=3 intron 5446..5766 /number=3 exon 5767..5870 /number=4 intron 5871..6011 /number=4 exon 6012..6106 /number=5 intron 6107..6839 /number=5 exon 6840..6933 /number=6 intron 6934..7174 /number=6 exon 7175..7323 /number=7 intron 7324..7773 /number=7 exon 7774..7921 /number=8 intron 7922..8577 /number=8 exon 8578..8698 /number=9 intron 8699..8822 /number=9 exon 8823..8929 /number=10 intron 8930..9021 /number=10 exon 9022..9171 /number=11 BASE COUNT 2278 a 2383 c 2621 g 2554 t ORIGIN 1 tcccaccagg gcacttccgg cggcgctctc cgcgccttat cgccaaagct gcggctctgg 61 acgcccagcc gcggcgtatc ccgatcactt ccgggtagtg ctccacgggc acgagccgcg 121 attgggctac cgtagatggg gtacttccgg tgtgcaggtg ctgggtcctt cggcaggagg 181 aggaagatgg agcccagcac cgcggcccgg gcttgggccc tcttttggtt gctgctgccc 241 ttgcttggcg cggtttgcgc cagcggaccc cgcaccttag tgctgctgga caacctcaac 301 gtgcgggaga ctcattcgct tttcttccgg agcctgaagg gtgagagcgg ggtccgaggg 361 ggtagcgtgt gggggcctgg ggttggtaca gacatcgagg gcattgacca ctctccttcc 421 ctctcccttt cctggagaca ggccgcccgg gtctggcctg ccgcttggcg tgcgggtccg 481 cgatgtctgc agcaggggtt tcccttcgcg tcccgtcttc ctcaactacc tgttttttct 541 tgtccagacc ggggctttga gctcacattc aagaccgctg atgaccccag cctgtctctc 601 ataaagtatg gggaattcct ctatgacaat ctcatcattt tctccccttc ggtagaaggt 661 aagggttccg cgtcgcccct agcactggaa ttcgagaccc gggacttgat ggattgcttt 721 cttctggttc ttgggcatgt cctgggtcac aggatgggtt gccagtcctg gatggtgcta 781 ccacggagga taaggaaaat gctcatttct gacctgaccc taggagggta catctttcta 841 taagaaacag ataaggattg tggagtcaag tgttggtgta gaccctggaa cccagggccc 901 tgctgctctg ctgagaaggg agtttactct gtaaaggggc tgataaaaag atggaaagct 961 cctgggttgg gtttggctgg gtggaaagag gaaatggcct ggtattaata tggattgcta 1021 taaagaggca ttgtgcattg tggcagatgg gtctttagaa ttcagataga ttcatagtct 1081 cgggctgaaa aaagagcctt agcaattatc taccacccca cccctcctcc catttgaggc 1141 tggataaata tgtccagcag ccttgcagtg ggctagtcgt ttaacagaat gaatatcttg 1201 ggcagattat ccagttgata cttgagtatt cttgttttac gaggcaatcc actccaccac 1261 tgaacaattc tgttaagaag ttcttatatt aagaaatatt gtctcctgct ttattttgtt 1321 cttatgctcc ccagcacctt agagaataga tagaatgctt atcttgtttg caggcatagc 1381 cttcacataa tcacagctta ttgggttgtt tctcctccag actgaacatc cccagctcct 1441 tagctatttc tcttttgcca tacttttttg gtgagcatgt tagtcattct cttctttatt 1501 tgcttcagtt tatctttttc ttaaaagact caagttgtcc ctgtctgggt aagacaggac 1561 tgccttcctc ccttcagtta atgaagctta ggaagttctt cattggtgcc tggtgtggtt 1621 tgtttgttta attgagacag ggtcttgctt tgtcccccag gctgagtgca gtggtgctat 1681 catggcttac tgcagcctcg acctcccagg ttcaagtgat cctccctcct cagcctcctg 1741 aataactggg acatggcttc tttattctct gtagttcatg cgagattcag ttccaccgaa 1801 tccctcaagt acttccccta ggcttctgtt acttactgtg tcactcccaa ctcccttgca 1861 acatgtgctt ttggaattgg ctttttggag gcaagattag ttaggcggat gtcaccccaa 1921 attggtagaa gttgaaatct tctgatgact gggctttggt gcagtaactg gtgtttgcca 1981 agtttttgct gtgtttgaaa ctttgaaata ggtaccttac agtggactgg gtgagatctt 2041 aaggataggg agtctgccct ggaggagttt tttcagtctg tttttgagga aataaagacc 2101 tgaaatgagt taccaaaccc tgtgtcagga tgtaagatga agcggaggca ctgtcagctt 2161 gtataggtca gaaggaaaaa agcagtgaat gagactgctc gggagagggc tgagagagga 2221 tactgttggt gggagaaatg acagagagtg taaaacagtg tttcccaaac tttagtcatt 2281 tgcacacgtc cttcatggtt tttaccatgt tacgatctcc tgtgctaata actacctaat 2341 attttcctcc aaattgacct gcttaaaaaa aaaaaaagaa aaggctaggt atgataggct 2401 atccctgtaa tcccatcact ttgggaggcc gaggcagaag gattgcttga gtccaggagt 2461 ttgagactag cctgggcaac atagtgagac ctcatctccc ccaaaaattg acccaaataa 2521 acggggcgtg gtggtatatc cctgtagtcc caactactgg ggaaactgag gtgggaggat 2581 ttctgaccct cggtggtcca ggctgcagtg agctgagatc atgccgactg cactccagcc 2641 taagcgacag agcaagaccc tgtctccccc caaacaaaaa aaagaaaaga gaaaaaaaac 2701 taaaacaaca ttttaaaagg gaaacaatag cgagcccaga attacccccg aaaatggaaa 2761 tccagtgtca tcatttgatg tacgtaggag gtaactataa atagaataaa aacaacaccc 2821 gtaagttcta gccagatgct gctgcctgcc gaaggctttg aacctgaggt tccctcttga 2881 aaaaggagat tagcaagcat tagaagctct aacaactagt atcaaactaa gacattctct 2941 ttgatggaga aagcttgaaa aataaggaaa gggagtaact ttctgtctct gagattcaag 3001 gttatttaaa gcccggttcc ccgcattcgc gttgctgcct tttctgcctc tggtggagtg 3061 ggttccacac tttggaaggt gctgctgtgg gaaatcctca ggcgttcctt gggaggaagc 3121 tgcagggcag ggttcggggt aggggtttgg tgtctgctcc cagggccctg ccctcgtccc 3181 atgactactg tcactgtcac cctctcaggt gagtgagtgc tggcctggct tccctgtctg 3241 tgccccagtg gcttctttgc cctctggcat cggaaggctc ccaggtcact ggcaatcatg 3301 tctacaaggg aagaaatttt cagcacatcc tctatacccg ggaaaaaccg gatccctgtt 3361 atagatgggg aagtgggttc agaagggtga agggatttgc ctagttttaa ctccaggtgg 3421 tctcgatttc aaagcccgtg ttctgtcagc tttacaagag tttctcaaca ggtgcattcg 3481 cgacgtttca gactggataa ttttatagtg aggctgtccc tgtgtactgt agggtgttta 3541 acagcacccc gggccctctg cccattaatg ccagtggcca cttccctgtt atgtgacggc 3601 gaagaaatgt ctccagacat tgctctgagg ggcaaaatcg gccaagattg caaaccactg 3661 cactgtgcca ataggctgct tttggaatgt ccaagactgc ctggcctctc caaaggagag 3721 tagatagatg gctttttgaa tgtagcatga gaggcagagg tagagaatat aggagcttct 3781 gcgtatgcct gaaagtcagt caggtaaggc gcttttagtt ttctgctgaa gattacccct 3841 caaaaaacag ggctcttata ttcagatctt tacaaagtca tttccacctg ctactgaggg 3901 tctaatggac acacttctct gtccagtctc agaaattggc atgtccgtcc ttccggttgc 3961 tcagccagac tccttggagc ttttctaatg cccctgtcga agcatcagca ggccctgcgg 4021 ctgtgccatc agaaggcgtg gaggcctgct gcgcctccct ctgttgctgc tccaaggtcg 4081 gctcagcact cttcactgcc tagattttca cagaggcctt ctaaccccat caccatggcc 4141 actttcaccc tcgtacccta gtctctaccc cgctgccagg atcgtccctc aaaaatagga 4201 atgagatcct gccaggcccc tgctcagaat cgtccagcgg ctccccacgg caggtccctc 4261 gtgtggggtg cagagccctg cgcagcctgg gccctcactt cacccggcct tacctcctgc 4321 tgctcactcc tttctctcca gctaagccac actgggcccc tcgctggtga ccgtgtccac 4381 ccccctcggg gactttgcac ctggtgttca ctgtgctggg aatcctctga ccccacagac 4441 tgcctggctc acgcttacat cttttcagat gccacttgca ggtcttcctt aaacaacctt 4501 ggcacaagag ctcccctccc acattgtcca ctttgttttt cttctcggct cacaagaatt 4561 actcatttaa tgaataaatg gattactgag ccctgtcttc ataaccttgg tgtgaatatc 4621 agagtacact ttggggagac tctggaccct caaatagccc cacccagtgc tagttttaga 4681 tgcttatggc agtagtcaca tgatccaatt aattgaccag aaaaggatag gatctcacag 4741 ttacagatgt tggaaattaa tgagcccttt tggtgatcac ttttttaaaa agcagtgtac 4801 tgggtagctc tgttgtggat gtcatggaga gaatctgtag gattaatagg gaaaaatgga 4861 ttaatgccag accagggagt actattgtca agtgatggca tcactggaga tgcagaagtg 4921 ctcttcctca gaccatttag acagggtgga attgacagcc tctggaaact gcctccagtg 4981 atcagcatca ctagcgtgaa aggtgcaggt gatgagatga aatatgggca gtacggcatt 5041 tccaaatgca tatttcctat gtgagttgaa atgtgtacag cttacttgac cagtgcaatt 5101 agtggatgtt tgactgtttc ctccaaacct ccagaagttt ctaattctga gccataattt 5161 tgagtcttca cgctgcgagg atgggccctt gcccgttctg ggatgtggtg tgggaaccct 5221 ctccctcatt gtgcccccca gagggtgctg ctctgtgagt gatcgtggct gctctgccct 5281 tcatgtcccc tccctgccag tgtcagagga gcagggagag gagacctctg ggtcactgtg 5341 tctggtgcct gctttcagat tttggaggca acatcaacgt ggagaccatc agtgccttta 5401 ttgacggtgg aggcagtgtg ctggtagctg ccagctccga cattggtgag tccgacctgg 5461 gaggttctgg tgctttccac ttctccaggg agttggagag cattacaaac agaagctgcc 5521 aaattagcat tcactgggtg ggccttagct ggggagtcag tgagggaacc gttttctcag 5581 ttcagaatct gactcctgga ggcagaatac cacactgctg tccaggggtc catctgggtc 5641 tgagcccagc tggatgaagc tgcagggcac tggggctgac agagagggag tcacaaggaa 5701 gccaaggaca cttggcccaa ggcccccctg taagggctca gctaggtgtc tcttcggtct 5761 gtgcaggtga ccctcttcga gagctgggca gtgagtgcgg gattgagttt gacgaggaga 5821 aaacggctgt cattgaccat cacaactatg acatctcaga ccttggccag gtaatcaggc 5881 ctgtcacctt ccaggtttca ggggcatatg gcaaagctct gcttagccaa aagccaagag 5941 ggtgtgagga ggttctccct ggcttgggga accttttcct gaccacccac ctttccatct 6001 actctttgca gcatacgctc atcgtggctg acactgagaa cctgctgaag gccccaacca 6061 tcgttgggaa atcatctcta aatcccatcc tctttcgagg tgttgggtga gtgagtgagc 6121 agggagagga cagcagaaac ctgggggaag aaggggtcac ttaatttttg tgggaaatca 6181 aggcaaaatt gtaattcttt aagggtacgt agatgggatt aatttgttat attaaggccg 6241 agtatagtgg ctcaacctgt aatcccagcg ctttgggaag ccctagccag gaggttcgct 6301 tgaggccagg agtttgagag cagcctgggc aacacagaac tcctctcttt aaaaaaaaaa 6361 ccaaaaaaac caaaaaaaaa aaagttggcc gagcgcagtg gctcacacct gtaatcccag 6421 cactttggga ggctgaggtg gacagatcac ccgaggtcag gcgtttgaga ccagcctggc 6481 caacatggtg aaaccccgtc tctgctaaaa atacaaaaat tagacaggta tggtggtgtg 6541 tgcctgtagt cccagctact tgggaggctg atgcaagaga attgcttgaa cccaggaggc 6601 agaggttgca gtgagccaag gttgtgccac tgcactccag cccaggtggc agagcgaggt 6661 ctgtctgaaa caccaaacac caaaacaagt cagactccat tgttatatca agacaagagc 6721 taagtggttg gcaattaaga aaaaaggatt cattcctggg aagaactggc tctcggggat 6781 cccatgtcct tgggctcagt tcctggggcc agtgctgggc tgctctcgtc ctgctgcagg 6841 atggtggccg atcctgataa ccctttggtg ctggacatcc tgacgggctc ttccacctct 6901 tactccttct tcccggacaa gcctatcacc caggtaaggg cttcacaacc ttgaggcttc 6961 gaaataaaaa tgaagcagaa cagcacattt ggctgaggcg aaggaaatga ctagttggtg 7021 gggtggggca tgggcagaag cggtgtggtc tcagcctccc ccttcaggta gggccagggt 7081 cgttttggtc tggcctgctg tcctcttgtt cgggaggcgt gcagcttttc ccttacttgg 7141 aactgcttgg gtcacagctt cgttttgttc ccagtatcca catgcggtgg ggaagaacac 7201 cctcctcatt gctgggctcc aggccaggaa caatgcccgc gtcatcttca gcggctccct 7261 cgacttcttc agcgactcct tcttcaactc agcagtgcag aaggcggcgc ccggctccca 7321 gaggtagctg ccaggaggct taggggccgg ggatccgggt ggtgcagtgt gtgtatatgg 7381 ggaggactca gaaggggaag aaggaggctt ttagctaaaa gtgtcagagg ctgagtgttt 7441 caggtgctgt cagattattt cagggcttta ttgctatatt gctggtattg gcacttccaa 7501 ttttttaagg tagtagtgac ttatctagag atccctaacc tccaccgttc cttttagaaa 7561 tttgtgtatt tgttgtgcag atgacaagca accagccata tcttttcgtt tgtttcctgc 7621 gttccctttt aaccatgagt ccctaggtgc ccaaggccta caggcagaac ttcgctaggg 7681 tactgtcatg gtggacatga gtggtccctg cccatcaggc aacactggaa aggtggctct 7741 ggtggccaag ctgctcctgt ttccctcctg caggtattcc cagacaggca actatgaact 7801 agctgtggcc ctctcccgct gggtgttcaa ggaggagggt gtcctccgtg tggggcctgt 7861 gtcccatcat cgggtgggcg agacagcccc acccaatgcc tacactgtca ctgacctagt 7921 ggtaagagat gtttggggtg ggagggccca aagggtttgc caagctgact gagctcagct 7981 cagaagcact tattgaacac ttctttgctg gcctaaaggg cataaataaa gatgattaag 8041 ttaatcccta cacctaagca tgccaggtga gatgcagagc ctggcgtgaa aactgatgtc 8101 ctaataggag cacttctttt ggaagaggtg tgtgtatttg ctatggcggg cagcagtggc 8161 aggggcttca ctgattaact ctgcccagga agttggactt aataagagaa ataggtcatt 8221 aggatgaatt tgttaggcaa gagaaagcga atatggaagc atgaatactt gaaagtgcct 8281 ggcacatctc ggggcaggag ggtggtttgg gtccagactt tgatgggcca tgaatactgt 8341 gtgaaggaat ctagcccttc agagaagagc cagagatcct gtgttggggg aataaccttt 8401 ctgaggagag agactggtta ctgatcaggg ccgggtggca ggggcatctg caggtcttct 8461 gaactcctgg gcattcctgc catgttcgtc gggtgggaat taagtgcagc tctgcctgca 8521 ggtggtggaa ggcctccagt catggccacc cctggctaac ctgccctttt atctcaggag 8581 tatagcatcg tgatccagca gctctcaaat gccaaatggg tcccctttga tggcgatgac 8641 attcagctgg agtttgtccg cattgatcct tttgtgagga ccttcctgaa gaagaaaggt 8701 gagtgtagtt cctgcacgag gatgctgcca gaaaacggcc ccaggcctca cttgctctcg 8761 aggcccaacc aggaaaagct ctgggaaacc cctcccggtt attcttgttg gttcttccgc 8821 aggtggcaaa tacagtgttc agttcaagtt gcccgacgtg tatggtgtat tccagtttaa 8881 agtggattac aaccggctag gctacacaca cctgtactct tccactcagg taagcacagg 8941 tgacagcttc tgttttcagc tagcatgtgc cctgcctggg ttctgcagtg ggcaggtcct 9001 gccacctctg tcctcctgca ggtatccgtg cggccactcc agcacacgca gtatgagcgc 9061 ttcatcccct cggcctaccc ctactacgcc agcgccttcc ccatgatgct ggggctcttc 9121 atcttcagca tcgtcttctt gcacatgaag gagaaggaga agtccgactg aggggctaga 9181 gccctctccc cacagcgtgg agacggggca aggagggggg ttattaggat tggtggtttt 9241 ggtttgcttt gtttaaagcc gtgggaaaat ggcacaactt tacctctgtg ggagatgcaa 9301 cactgagagc caaggggtgg gagttgggat aatttttata taaaagaagt ttttccactt 9361 tgaattgcta aaagtggcat ttttcctatg tgcagtcact cctctcattt ctaaaatagg 9421 gacgtggcca ggcacggtgg ctcatgcctg taatcccacc actttgggag gccgaagcag 9481 gcggctcacg aagtcaggag atcgagacta tcctggctaa cacggtgaaa ccctgtctct 9541 actaaaagta caaaaaatta gctgggcgtg gtggtgggca cctgtagtcc cagctactcg 9601 ggaggctgag gcaggagaaa ggcatgaatc ccagaagcag agcttgcagt gagctgagat 9661 cacgccattg cactccagcc tgggcaacag tgttaagact ctgtctcaaa tataataata 9721 ataataataa taataataaa ataaagcgag atgttgcctc aacttcacct ggccccggtc 9781 cggtttgcta atggctgttg gtggggttgc tgttggttat actgactgat caaatc // LOCUS D89501 7201 bp DNA PRI 05-MAR-1997 DEFINITION Human PBI gene, complete cds. ACCESSION D89501 NID g1854451 KEYWORDS PBI; salivary proline-rich protein P-B. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Isemura,S. and Saitoh,E. TITLE Nucleotide sequence of gene PBI encoding a protein homologous to salivary proline-rich protein P-B JOURNAL J. Biochem. (1997) In press REFERENCE 2 (bases 1 to 7201) AUTHORS Saitoh,E. and Isemura,S. TITLE Nucleotide sequence of gene PBI encoding a protein homologous to salivary proline-rich protein P-B JOURNAL Unpublished (1997) REFERENCE 3 (bases 1 to 7201) AUTHORS Isemura,S. TITLE Direct Submission JOURNAL Submitted (21-NOV-1996) to the DDBJ/EMBL/GenBank databases. Satoko Isemura, Nippon Dental University, Junior College at Niigata; Hamaura-cho 1-8, Niigata, Niigata 951, Japan (E-mail:i_sato@ja2.so-net.or.jp, Tel:025-267-1500) COMMENT Sequence updated (24-Feb-1997) by:Satoko Isemura. FEATURES Location/Qualifiers source 1..7201 /organism="Homo sapiens" /db_xref="taxon:9606" exon 636..723 /number=1 intron 724..1982 /number=1 exon 1983..2050 /number=2 gene 1997..6874 /gene="PBI" CDS join(1997..2050,6524..6874) /gene="PBI" /note="similar to salivary proline-rich protein P-B" /codon_start=1 /db_xref="PID:d1014669" /db_xref="PID:g1854452" /translation="MKSLTWILGLWALAACFTPGESQRGPRGPYPPGPLAPPPPPRFP FGTGFVPPPHPPPYGPGRFPPPLSPPYGPGRIPPSPPPPYGPGRIQSHSLPPPYGPGY PQPPSQPRPYPPGPPFFPVNSPTDPALPTLAP" intron 2051..6523 /gene="PBI" /number=2 exon 6524..7031 /number=3 BASE COUNT 2351 a 1397 c 1209 g 2244 t ORIGIN 1 gagctcgagg atcatccagc ataggaaaaa ttatcctcag tgcctagatt tatcaaaaag 61 acaaggctat aaatttccat tagacatcca cttcaaagta aaaatatctg tataattttc 121 ccataatcaa atctttgcat gtcatcatat tttcatttaa ttaaaactct caagttaata 181 acaacaggga ccacatgatt tttaaaacac ggttgttctc agttcttcac tgacaaaagg 241 gaacttccta tagaaaatac aaaagatggc ccattaaata aaatagaaaa tacaaaagat 301 ggcccattaa aattactcgg gatagaaaaa gttggtgctt tttctttaaa ataaagagga 361 aatggtgcgc cacaggcccc agattgtttc cctgaagtga gaggatatta tttgttgaat 421 ctgtttcccc aggaaacaac aaatattgaa caggtcaatg cttttcagaa atgagaccca 481 aaaaatgcgg aaaaagactg ccaaattggc aagaacaact ttcctatctt tgtttgaaga 541 gcatcagccc agaagttatt ttatttccaa gtagcctttc ttttactgcc aacagaaatc 601 ttttgcaata tataagaact gaagtttctt gtctcttttt cacctttatt ttctgtcttt 661 caactggcaa gagtcatttt gaccagcagg ttaatcaact ctaagacaga tcctcacgca 721 aaggtatgta cctggaaata ttaactgaag aaattttttc tcagactttt catgagcttt 781 ttataggaca ttcattatca ccaaaaacat aagcaccttc accttatgta aaattgtaag 841 tcgttaaaca tatcaaatct tgaaaaatta tgttattcac atgtcagagt aaagcattca 901 tgttgggcat tgaggacaat tagtccaccc aagtacacaa ttgacttttc cccttgatgt 961 tccttaaagt tcttgaaact tcataatgat agattattct cagatgtact ctaaaagagg 1021 gaattgttcc acagtatatg tttattatta gtttttcttt tgtttttcaa acttcttata 1081 agaacggatc ctgggaaaag taatccttaa gggaatagcc acacccacat acaattgaat 1141 atccagtgaa attttgtcca tataattctt tcaaaacttg cgttgttcag acaacacact 1201 caacctatgt gcagagttga ctgtcaggaa gccatttgta caatggtgca tggagttgct 1261 ttcccagaac tcaagctatc aggtacttga aatatttcac aaattgtgag ccaactgtga 1321 ataaaatatc tggactcctc catattctct gacttggtct ttgatctacc atatacggat 1381 ttttggaagc actttgagac cacacatcac agactaaatt aatttagtgg aatgttgtta 1441 aaactaggag atatataata tgactctttt tttcttacac tacatcaaac agagtgattt 1501 ctttcttttc cttatttttg agatatccca aggaattaca ctaacatatg atctagttaa 1561 tatgttaata tttcaaaccc aagcctatag aaatctataa aacaatgtac ctgctattta 1621 aaaatgcaaa agtattcaaa tgtaaataat tgtacatgtc aaagtagtta cctctaaagg 1681 caaaaactgt agttacaaaa gcttctattg ttcaatcatt tatgaacaac tctttggcat 1741 tatctccagt atcagtttac aagtcaaaaa taaaaactac ttccactatt ttgaagtcaa 1801 attttataag cctactaatg ttgtcccaca gtaatactga atccaaaata atgttaggca 1861 aatttcctaa aaaggctaca gtagtatatc tgtactacat tatatccatg tgttgagaca 1921 ttttaaatag tactttttaa taaaaaacaa tcatactgat cacctattgt gcttactttc 1981 agaggcaact gaaaggatga aatcactgac ttggatcttg ggcctttggg ctcttgcagc 2041 gtgtttcaca gtaagtatca ttaatcacga tcacacatct ttatactttc tcattaacca 2101 ttacttcaga tttcttttta cattaatgat gttacctttt cttatatatt agtaactatt 2161 aaccattgat gaaacactgt tctagtggtt aaatttaaat ttaaaatcta atttaattta 2221 aatagcccca tatgtctgat ggctacaata ctggacagct caagtctgta atatcagact 2281 gcattaaaca gataagtata aatgtaggtc cccatactct agcaattaca caataggtaa 2341 aactctataa aatgagatag tttaaagtag ttgttcctaa catatggagt cacctaactc 2401 acttattaaa aatatagtat taaagtcctc atccaagcct gaataatagg acctgggaag 2461 atgcattttg cagaggattt aggagaatct ttgcatgcta aattttgagg aaataataaa 2521 catttatagt aacacctacc atatattaaa aactaggtta aggacaacta cccttattgt 2581 ccttaaccta gtgtttcaga attttttaat gcatattttt aaaagtaata gaatgatact 2641 caatttcaac agaagggtca attaattcca gtgttacaaa gaggagatta attccagagt 2701 aacttctccc aaagtaccat aaaatctatt atatcaaatg ggaagatatg aaagctgatt 2761 ttttagaaga ctatgacttc cacaaagcag gcaacataaa attatatcat tctgaaacat 2821 gaaacataaa ataatacctc tttcttctac ttttaaaaac ttgaataaaa acaggacatt 2881 ctgagatgac tgaagtattc tgactttggc caccttcctg catgcattat gggtggacat 2941 aatttttgat aagccaataa aacatattga catccgttaa atgtcatgtc ttaatgactt 3001 catcataaag ctgattttca acattttcac atcctcaaaa tacaagagaa aaacatttta 3061 ttcaggcttt tattatacaa atagtaactt tttacaagat aaattagaaa aacagagctt 3121 tccaataata taagcagttt aaatagaaat gagccttttt ccagttgata tgataatgat 3181 gattaataat tactgaacat tggaaatgtg aagggtagca tgtcctttgc atatacatat 3241 ttgtagatgg ggtaagtcat catattcagc ctgttacctt ttgagtgttt tctctgttat 3301 aggtaatttt ccaaatgcat tacatacatt atctcttctt taatcctcaa agcaagtctc 3361 actattttta atctcactaa aatacaaata aatttagatt caaattatgc atgtcttagc 3421 tggattattg ctaaattgaa taatgtagga gattaacatt aattatagcc ctttgagagg 3481 ggagtgtttc ttcctaatat agtattttca tcattatcat cattttactt tagccagttg 3541 tggaaaggga gcttatcact gcgattgggt taagagaaca ggaaaagaat agatatttac 3601 tatgaagatt tttcattaca taatatttag atcaacatcg gtatatcttt aggaaaaggt 3661 actttcagct aattttttaa ttgtctcaac aataaaaatg tgtgaaatat ttttccatac 3721 taaggatttg ttttcttgac aataaaaaat caaatcctga acttccacat aacctgacta 3781 gtccttggat tggcagccct tgggtcaaac accctgcata acaacttgaa ccatcactag 3841 ctagagaaga caaatgcaag gctggtccca cagaagtgcc ttggaatttt ttgtttgttt 3901 cattggtgat atgaataaaa cttttaccgg catgttaagg gcaaaagcca ggttgaagag 3961 caaatgggaa gtgagaagtg gggttggcaa atttagaata ctctttcata gagctttgct 4021 ttgcaggaaa gaggtgaaaa ggaggtcaca acaacaaggg tattggtggt tgaaactgaa 4081 tcttttataa acagggaaat atcaatttat atttatttct caaatgatgt cttcgacctc 4141 tctgtacgtg ttaggtgaaa gcggcaagat tagaatcaag aggtcaaatc aacgggccct 4201 acagtaattt gggtaaaagt aaatgacaga ttagaatgaa agcaagaaag atgataagaa 4261 gtggttagat ttagaattca gggagtggga tgaagagctg ataggatagg gagaaaaaaa 4321 attgagttga acataaatga tagtgttttt ctgaagaaaa tgagctaatt aaatggcaga 4381 tattacttgc agaaagtaaa agagaagact gacagagcca ggtccctcag aaaacaatta 4441 taaagaaaaa gaaactgtct agctatcaat gactccatgg ttgtgtggcc agtaaatcag 4501 ggaagttcgt ttctaattgc ctgaaatatc ttggtgaagc aggagaaaag ccagtttggt 4561 gggggtagat gagagctcag gatgaagtgg agaatgttta taaaattggc atcaacttgg 4621 aaaattacag gcctctgggg aagtgggaag agataatcta aggtaagata ctcaaaaaag 4681 gatttcaggc agcactgaag gacgctaact ttctgatggt acagtgcttg tgacatcctc 4741 tagcattgtt aagaacctgg aactaggaat ccaggagaaa gatggtttac cgtaatacta 4801 aagtttagga ttagcaaggg taatggccag acaaataggt gaagggactg agaaccacaa 4861 gaagtgaatg tagttttgtg ccctatataa accatctgct ccaagcaacc tggagttctt 4921 gccgtcagta gagtggaact tcattcttac cccagagcat ttgcatattt tccttctatt 4981 tggcatgctc cctccttatt ttgttatctt taagcctctt ttcaataact ccttatcaca 5041 gagacaatcc tttatcattt tcctatccag aactgatctc tttaaatccc taatgtattt 5101 gttccttgca ctacttttca gctattttta gatttttaat ggagatgggg tcttattatg 5161 ttgtccaggt tggtcttgaa ctgctggcct aaagctatct acctacctca gcctcccaaa 5221 gtgctgggat tacaggcatg agccaccata tccagcctca gctattttca tttctaaaaa 5281 ttgtactcta tgtaaaccag acctgagata agcaccaccc taggcacttt acaggtgata 5341 tttcacaata gatgtgacaa caaccgcagg agatgggtac ttttatgaac atcattttcc 5401 tgataagaaa aagagactca aaaaggattc tatactcacc ctatctctat gtgccagaag 5461 gagagctcaa acactggcat gaaatcccag acactcattc cattcatttc acttcccttt 5521 agatcacatt tatccccccc tgtattgtaa ttactcagcc tgtggattca tagaagacat 5581 gtttgtaccc ctggcaaact gccaggttga cattcatgac atgaatgaac cagctccttc 5641 attagctcat tgcacaaaat agttgctcac atgtgtttca caattaatta atgttcaaag 5701 cccagaacaa ccctgaactc ttgaattaga cataaaaaga cgtgtctttc ccatgctgca 5761 aaggcattta tttccatttg gaatttggaa tgacagactt cactacaggc caaggataag 5821 atttttgctc attatctatt ttttccagtc aatactatcc aagctttcat tgagagccta 5881 ggtattcaga gggcaaaaga gaaacatttt tctttgatac agaagttcat acatcataac 5941 ttagatacat ttaacaattt tctattgctc aataacaaat taccactaat tagtagcctt 6001 taaaaactca ttattatctc agtttctttg gccataactc tgagcttggc ttaggtggct 6061 tctctgctca gagtctcaca ggctgaaaag gtgtgtccat tgctgtattt tttttttctg 6121 cacctcagtg tcctttttta agcttgcatg gttgttggca ggattcagtc ccttggttgt 6181 tggactgaag cccctatttt tttgctgcct atgagccagg ggccactctc agctcctata 6241 gccagtctac catgtacttc cacatggccc tcgccacaac acggcatctt gattcaaagc 6301 ccagaggaaa gcatttgctg cagcttcaaa tctctctgat tccttctcac attgtagaac 6361 ctgttttaaa agggctcacc tgataaggtc aggccagcat gtgccagcag ggagtagaaa 6421 tcttgaggcc atctcagagt tctgcttacc atagaaacca ttcaccctta tatttaactg 6481 tgaaatatct gatttaaaat tatttacttc tttgtttcca cagcctggtg agagtcaaag 6541 aggccccagg ggaccatatc cacctggacc actggctcct cctcctccac cacgttttcc 6601 ttttggaaca ggatttgttc caccacccca tcctccaccc tatggtccag ggagatttcc 6661 accacccctt tctccaccct atggtccagg gagaatccca ccatcccctc ctccacccta 6721 tggtccaggg agaattcaat cacactctct tcctcctcct tatggcccag gttatccaca 6781 gccaccttcc caaccaagac cctatccacc tggacctcca tttttccctg taaattctcc 6841 aactgatcct gccctcccta ctcttgcacc ctaaatacag acaactgcaa caggtgccac 6901 cacccacaaa agacaacact accctcgtaa ctactgcttc tactacccaa aaataagaat 6961 ttcaacacta cttccaagag acttttagat aaaatcactt ccatttttgg atgagaataa 7021 agatttccaa agcactgagc ttttgggaga aatatcttag aaattgtgaa acgatcccca 7081 tgaaccttta tatcagtagg ggaaaataaa gaattgagca acaatatgaa gtatccactg 7141 ttatcagagc caatagttta caccccagtt atgtcaccta aatgaatatt agtgctaaca 7201 g // LOCUS HS1D3HLH 2481 bp DNA PRI 21-APR-1995 DEFINITION H.sapiens Id3 gene for HLH type transcription factor. ACCESSION X73428 NID g313212 KEYWORDS early response gene; transcriptional factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2481) AUTHORS Deed,R.W., Hirose,T., Mitchell,E.L., Santibanez-Koref,M.F. and Norton,J.D. TITLE Structural organisation and chromosomal mapping of the human Id-3 gene JOURNAL Gene 151 (1-2), 309-314 (1994) MEDLINE 95129881 REFERENCE 2 (bases 1 to 2481) AUTHORS Deed,R. TITLE Direct Submission JOURNAL Submitted (18-JUN-1993) R. Deed, Paterson Institute for Cancer Research, Dept of Gene Regulation, Christie Hospital NHS Trust, Wilmslow Road, Manchester, M20 9BX, UK FEATURES Location/Qualifiers source 1..2481 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="peripheral blood" /cell_type="lymphocytes" /clone="cos 5 Hind III fragment" /clone_lib="pcos2embl" /map="1q36.1" /chromosome="1" variation 584 /note="g/a polymorphism" TATA_signal 636..640 exon 665..1038 /number=1 /evidence=experimental gene 739..1205 /gene="Id3" CDS join(739..1038,1146..1205) /gene="Id3" /codon_start=1 /evidence=experimental /db_xref="PID:g313213" /db_xref="SWISS-PROT:Q02535" /translation="MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDM NHCYSRLRELVPGVPRGTQLSQVEILQRVIDYILDLQVVLAEPAPGPPDGPHLPIQTA ELAPELVISNDKRSFCH" intron 1039..1145 /gene="Id3" /number=1 exon 1146..1219 /number=2 /evidence=experimental intron 1220..1739 /number=2 exon 1740..2190 /number=3 /evidence=experimental misc_feature 2123..2129 /note="immediate early motif" polyA_signal 2173..2178 BASE COUNT 521 a 662 c 699 g 599 t ORIGIN 1 agctttcttc ttttccctgt tgctcaaata aatagtgttc tttgctcaaa ccccctttcc 61 ctcctccttc tgcaatctca gcgcctagcg aaatctgttt tcttcattgt aacctcagct 121 tcaccgcaat taattttttt tccctctggt cacaagataa ttcctgacgc cagtgagtct 181 ggaggtcaga cgaacagcaa attggggaac aaggcggcac taattcctta caagttcctt 241 gaaaaatctt tcgcttaaaa aaaacggggg gtggggggag cttctttgct gttcagggat 301 ttatgcctcg cggagctgtg gctcgaacca gtgttggcta aggcggactg gcaggggcag 361 ggaagctcaa agatctgggg tgctgccagg aaaaagcaaa ttctggaagt taatggtttt 421 gagtgatttt taaatccttg ctggcggaga ggcccgcctc tccccggtat cagcgcttcc 481 tcattctttg aatccgcggc tccgcggtct tcggcgtcag accagccgga ggaagcctgt 541 ttgcaattta agcgggctgt gaacgcccag ggccggcggg ggcagggccg aggcgggcca 601 ttttgaataa agaggcgtgc cttccaggca ggctctataa gtgaccgccg cggcgagcgt 661 gcgcgcgttg caggtcactg tagcggactt cttttggttt tctttctctt tggggcacct 721 ctggactcac tccccagcat gaaggcgctg agcccggtgc gcggctgcta cgaggcggtg 781 tgctgcctgt cggaacgcag tctggccatc gcccggggcc gagggaaggg cccggcagct 841 gaggagccgc tgagcttgct ggacgacatg aaccactgct actcccgcct gcgggaactg 901 gtacccggag tcccgagagg cactcagctt agccaggtgg aaatcctaca gcgcgtcatc 961 gactacattc tcgacctgca ggtagtcctg gccgagccag cccctggacc ccctgatggc 1021 ccccaccttc ccatccaggt aagcctcgaa gtcgggacag ggctgaacac ccaggcaagg 1081 atgctgcggg accctcggag ctcccgattg cctcgcgtaa ctcttccctc ttttcctcta 1141 atcagacagc cgagctcgct ccggaacttg tcatctccaa cgacaaaagg agcttttgcc 1201 actgactcgg ccgtgtcctg acacctccag gtgagtatct cctctcttgg agagggaggt 1261 ttaaacggca agtcctggag ttggcagacg ttttgaaaaa ttgccactca ctcggtttag 1321 ggaaactgag gccagagagg gacaagtgac ttgcccatgg ttgcatcaaa tgaatggcag 1381 agtcagtttc catgtgatgt gcatttaagc cttaatgcgc ctggccctgc ctccgcagtg 1441 gccgaggtct ggcaagtaga catggtccga ctaaatacaa gtctttctgt tccatgttgt 1501 ataggagctg tcttcggcag ccccctccca gctagtgtca attccaagta ggaggggtag 1561 cgcaacgtcc gcctgtggtc tttggcgcca actgggtggg ggcagcgtgg ggggcggagt 1621 tatcaggctg gaggtacaga ccaagtttcc tccctggcgc cggccagtct gcggacggcc 1681 cccgcctcgg cacgctcggc ggaaactgac tgctccttgg tcttctttcc tcccccgccc 1741 agaacgcagg tgctggcgcc cgttctgcct gggaccccgg gaacctctcc tgccggaagc 1801 cggacggcag ggatgggccc caacttcgcc ctgcccactt gacttcacca aatcccttcc 1861 tggagactaa acctggtgct caggagcgaa ggactgtgaa cttgtggcct gaagagccag 1921 agctagctct ggccaccagc tgggcgacgt caccctgctc ccaccccacc cccaagttct 1981 aaggtctttt cagagcgtgg aggtgtggaa ggagtggctg ctctccaaac tatgccaagg 2041 cggcggcaga gctggtcttc tggtctcctt ggagaaaggt tctgttgccc tgatttatga 2101 actctataat agagtatata ggttttgtac cttttttaca ggaaggtgac tttctgtaac 2161 aatgcgatgt atattaaact ttttataaaa gttaacattt tgcataataa acgattttta 2221 aacacttgtg tatatgatga cacccgtctc cattaagtac taatgatgct ttctcgcaca 2281 tggccgaatt ttgggagctt tgggaaagtg aacttgctta ttctacgaga gggaaatgaa 2341 aaactgcctg gttgagaggg gatggggtgg agagagaagg gttcatgatg ggagtctcat 2401 gtccattgag ggatgggtgc agagaaaagt tctggctctg cctcattatt tcagagatga 2461 aaccagagac tggtgcaagc t // LOCUS HS2OXOC 2518 bp DNA PRI 30-JUN-1993 DEFINITION H.sapiens gene for 2-oxoglutarate carrier protein. ACCESSION X66114 S51486 NID g23843 KEYWORDS 2-oxoglutarate carrier; 2-oxoglutarate carrier protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2518) AUTHORS Walker,J.E. TITLE Direct Submission JOURNAL Submitted (04-JUN-1992) J.E. Walker, M.R.C. Lab of Mol Biology, Hills Road, Cambridge CB2 2QH, UK REFERENCE 2 (bases 1 to 2518) AUTHORS Iacobazzi,V., Palmieri,F., Runswick,M.J. and Walker,J.E. TITLE Sequences of the human and bovine genes for the mitochondrial 2-oxoglutarate carrier JOURNAL DNA Seq. 3 (2), 79-88 (1992) MEDLINE 93091249 COMMENT See also X66115. FEATURES Location/Qualifiers source 1..2518 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="placenta" exon <1..115 /number=1 prim_transcript <1..2518 mRNA join(<1..115,718..870,955..1161,1330..1420,1503..1693, 1773..1824,1951..2110,2365..2518) CDS join(21..115,718..870,955..1161,1330..1420,1503..1693, 1773..1824,1951..2106) /codon_start=1 /product="2-oxoglutarate carrier" /db_xref="PID:g23844" /db_xref="SWISS-PROT:Q02978" /translation="MAATASAGAGGMDGKPRTSPKSVKFLFGGLAGMGATVFVQPLDL VKNRMQLSGEGAKTREYKTSFHALTSILKAEGLRGIYTGLSAGLLRQATYTTTRLGIY TVLFERLTGADGTPPGFLLKAVIGMTAGATGAFVGTPAEVALIRMTADGRLPADQRRG YKNVFNALIRITREEGVLTLWRGCIPTMARAVVVNAAQLASYSQSKQFLLDSGYFSDN ILCHFCASMISGLVTTAASMPVDIAKTRIQNMRMIDGKPEYKNGLDVLFKVVRYEGFF SLWKGFTPYYARLGPHTVLTFIFLEQMNKAYKRLFLSG" intron 116..717 /number=1 exon 718..870 /number=2 intron 871..954 /number=2 exon 955..1161 /number=3 intron 1162..1329 /number=3 exon 1330..1420 /number=4 intron 1421..1502 /number=4 exon 1503..1693 /number=5 intron 1694..1772 /number=5 exon 1773..1824 /number=6 intron 1825..1950 /number=6 exon 1951..2110 /number=7 intron 2111..2364 /number=7 exon 2365..2518 /number=8 BASE COUNT 449 a 745 c 736 g 588 t ORIGIN 1 ccgagggcca ttgagtggcg atggcggcga cggcgagtgc cggggccggc gggatggacg 61 ggaagccccg tacctcccct aagtccgtca agttcctgtt tgggggcctg gccgggtaag 121 ctcaggcccc agggatggga aaagaggagg aagggaaggg caaaaggaac cgggcccggt 181 tcatctctgg gtctgttgca gtgaactgtg ccggcacggg gtcagcgcga tctccagcgt 241 attgcgagtt ccccgatggg ctgctagggc cccattgcag gacccttatg gactgcatgc 301 agggtcgttg cacaaccccc gccggccagc tgagtttccc agcgccattg ctaagctcgg 361 ccgggcggca gggggttcca ggccattaaa tggctcctgc aggactacct ggagtcctaa 421 atcttggatg ccccttcctc ccctccctgt gtcggactgc gtgcagcccc aggtcactgc 481 ttgactgtca tgaagctgct tggagcactt tcattgcatg gcccctactg gacaactaga 541 cttttcagac cgttgcactt ggtgtacata tagactctta tacacatgct caccagcatc 601 ctgattcttg aatcgcatgg caggccctga cctttgtaac ctcttgctgc caggctccag 661 tgggcactgg tagatctgga atgcctggcc cttcactcca cctctgtctt cccccaggat 721 gggagctaca gtttttgtcc agcccctgga cctggtgaag aaccggatgc agttgagcgg 781 ggaaggggcc aagactcgag agtacaaaac cagcttccat gccctcacca gtatcctgaa 841 ggcagaaggc ctgaggggca tttacactgg gtattggggc ctcaggatgg agggtagact 901 gtgggttggc agctctagac cttggcctga tatgctgacc cctctgctcc tcaggctgtc 961 ggctggcctg ctgcgtcagg ccacctacac cactacccgc cttggcatct ataccgtgct 1021 gtttgagcgc ctgactgggg ctgatggtac tccccctggc tttctgctga aggctgtgat 1081 tggcatgacc gcaggtgcca ctggtgcctt tgtgggaaca ccagccgaag tggctcttat 1141 ccgcatgact gccgatggcc ggtgagttcc aagtctgaac ctaaccccca gcccccattc 1201 cctcgaatct agaatgaaag gaccagtttt ttatcccgga tctggagcag agtgtttacc 1261 tgttctggcc ttcctttctt tgcacctgtg ggcaatcctg atcttgacct ctttctgcct 1321 tcccaccagg cttccagctg accagcgccg tggctacaaa aatgtgttta acgccctgat 1381 tcgaatcacc cgggaagagg gtgtcctcac actgtggcgg gtgagtggag gggctggaga 1441 cttgggggct gtgaggtctg gttttagggg tttctgactc ttccccgctc ccctccctcc 1501 agggctgcat ccctaccatg gctcgggccg tcgtcgtcaa tgctgcccag ctcgcctcct 1561 actcccaatc caagcagttc ttactggact caggctactt ctctgacaac atcctgtgcc 1621 acttctgtgc cagcatgatc agcggtcttg tcaccactgc tgcctccatg cctgtggaca 1681 ttgccaagac ccggtgagtg tgcagcctgg gcctggaggt gggtgggagg gtgccctttc 1741 ggtctctcat gcccctgcgt cctctcctgc agaatccaga acatgcggat gattgatggg 1801 aagccggaat acaagaacgg gctggtgagg aagccattct gggggcctgg gagggggtgt 1861 cggatcccct gggaaaggtg tgaaggcagc tgggtgaggg ggatggggac ctagagtcct 1921 agccccagcg ccctgcctgt ctgtgtctag gacgtgctgt tcaaagttgt ccgctacgag 1981 ggcttcttca gcctgtggaa gggcttcacg ccgtactatg cccgcctggg cccccacacc 2041 gtcctcacct tcatcttctt ggagcagatg aacaaggcct acaagcgtct cttcctcagt 2101 ggctgaagcg gccgggggct cccactcgcc tgctgcgcct atagccactg cgccctgggg 2161 gcctgggctc tgctgccctg gacccctcta tttatttccc ttccacagtg tgtttcttcc 2221 tctgcggtaa aggacttggt ctgttctacc ccctgctcca gcttgccctg ctcgctcctg 2281 atcctgtgat ttctctgtcc ttggctattc ttgcagggag ctggaaaact tcctgaggat 2341 ttctggcctc cccctgggtt ttagtttcag ggcacacagg acagcagaag atcccctttg 2401 tcagtgggga aaccaaggca gagctgaggg gacagggagg agcagaagcc atcaagatgg 2461 tcaaagggcc tgcagaggga gatgtggcct tcctccccct cattgaggac tcaataaa // LOCUS HS370M22 143747 bp DNA PRI 16-DEC-1997 DEFINITION Human DNA sequence from PAC 370M22 on chromosome 22q12-qter. contains GRB2 ADAPTOR LIKE PROTEIN, UBIQUINOL-CYTOCHROME C REDUCTASE IRON-SULFUR SUBUNIT PRECURSOR (UQCRFS1) exon, ESTs, STS, CA repeat and CpG island. ACCESSION Z82206 NID g2677628 KEYWORDS 22q12-qter; CpG island; GRB2; repeat polymorphism; UQCRFS1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 143747) AUTHORS Burgess,J. and Whiteley,M. TITLE Direct Submission JOURNAL Submitted (09-DEC-1997) sanger.ac.uk/HGP/Chr22/) Sanger Centre, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is not the entire insert of clone 370M22. It may be shorter because we only sequence overlapping sections once, or longer because we arrange for a small overlap between neighbouring submissions. During sequence assembly data is compared from overlapping clones. Where differences are found these are annotated as variations together with a note of the overlapping clone name. Note that the variations annotated may not be found in the sequence submission corresponding to the overlapping clone as we submit sequences with only a small overlap as described above. This sequence was generated from part of bacterial clone contigs of human chromosome 22, constructed by the Sanger Centre chromosome 22 mapping group. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/ This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone 370M22 is at 1 in this sequence. The true left end of clone 140N12 is at 69666. The true left end of clone 496C20 is at 143644. 370M22 is from the library RPCI3 constructed at the Roswell Park Cancer Institute by the group of Pieter de Jong. For further details see http://bacpac.med.buffalo.edu/. FEATURES Location/Qualifiers source 1..143747 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q12-qter" /clone="370M22" /clone_lib="RPCI3" repeat_region 1093..1122 /note="15 copies of 2 mer 90 % conserved" repeat_region 1495..1797 /note="AluSp repeat: matches 1..303 of consensus" repeat_region 2171..2375 /note="AluJo repeat: matches 100..302 of consensus; incomplete repeat" repeat_region 2443..2519 /note="MIR repeat: matches 191..260 of consensus" repeat_region 2664..2966 /note="AluSp repeat: matches 1..303 of consensus" repeat_region 3146..4279 /note="TIGGER1 repeat: matches 2418..1298 of consensus" repeat_region 4273..5430 /note="TIGGER1 repeat: matches 1172..1 of consensus" repeat_region 6761..7062 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 7771..8069 /note="AluSc repeat: matches 1..299 of consensus" repeat_region 8289..8692 /note="MLT1A1 repeat: matches 3..365 of consensus" repeat_region 8919..8993 /note="MIR repeat: matches 64..139 of consensus" repeat_region 10022..10149 /note="MIR2 repeat: matches 1..131 of consensus" repeat_region 10152..10411 /note="AluSx repeat: matches 40..299 of consensus; incomplete repeat" repeat_region 11784..11920 /note="MIR2 repeat: matches 146..5 of consensus" repeat_region 12411..12704 /note="AluY repeat: matches 293..1 of consensus" repeat_region 13230..13305 /note="AluSp repeat: matches 218..293 of consensus; incomplete repeat" repeat_region 14003..14383 /note="L1 repeat: matches 5005..5390 of consensus" repeat_region 14238..15135 /note="L1PA15 repeat: matches 1..904 of consensus" repeat_region 15486..15509 /note="6 copies of 4 mer 100 % conserved" repeat_region 15511..15810 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 16405..16692 /note="AluJo repeat: matches 13..302 of consensus" repeat_region 16693..16724 /note="8 copies of 4 mer 97 % conserved" repeat_region 16879..16980 /note="MIR repeat: matches 188..74 of consensus" repeat_region 18353..18613 /note="AluJo repeat: matches 299..1 of consensus" repeat_region 19394..19747 /note="THE1B repeat: matches 364..1 of consensus" repeat_region 19832..19904 /note="7SK repeat: matches 193..121 of consensus" repeat_region 19942..20004 /note="7SK repeat: matches 124..62 of consensus" repeat_region 20085..20281 /note="L1 repeat: matches 5190..5387 of consensus" repeat_region 20137..20812 /note="L1MB8 repeat: matches 1..729 of consensus" exon <20814..21617 /note="match: P47985" /product="UQCRFS1" misc_feature 21614..21904 /note="match: STS G13234" repeat_region 22036..22213 /note="L1ME2 repeat: matches 710..902 of consensus" repeat_region 22368..22434 /note="MIR2 repeat: matches 144..76 of consensus" repeat_region 24469..24679 /note="L1 repeat: matches 5137..5350 of consensus" repeat_region 24576..24682 /note="L1MC1 repeat: matches 5..113 of consensus" repeat_region 24683..24984 /note="AluY repeat: matches 1..301 of consensus" repeat_region 24986..25447 /note="L1MC3 repeat: matches 100..572 of consensus" repeat_region 25448..25719 /note="AluJo repeat: matches 1..267 of consensus; incomplete repeat" repeat_region 25728..26165 /note="L1MC2 repeat: matches 639..1077 of consensus" repeat_region 26374..26597 /note="MIR repeat: matches 1..234 of consensus" repeat_region 27032..27165 /note="MIR2 repeat: matches 128..2 of consensus" repeat_region 28225..28525 /note="AluSg repeat: matches 1..300 of consensus" repeat_region 29230..29375 /note="AluY repeat: matches 155..301 of consensus; incomplete repeat" repeat_region 29383..29450 /note="L1 repeat: matches 4369..4440 of consensus" repeat_region 29493..29659 /note="L1 repeat: matches 4425..4590 of consensus" repeat_region 29682..29981 /note="AluSx repeat: matches 299..1 of consensus" repeat_region 30073..30758 /note="L1 repeat: matches 4687..5390 of consensus" repeat_region 30616..31458 /note="L1MB5 repeat: matches 1..896 of consensus" repeat_region 31493..31559 /note="L1ME1 repeat: matches 844..910 of consensus" repeat_region 31583..32083 /note="L1MC2 repeat: matches 561..1077 of consensus" repeat_region 32716..32876 /note="FAM repeat: matches 161..1 of consensus" repeat_region 33765..33935 /note="AluJo repeat: matches 302..135 of consensus; incomplete repeat" repeat_region 33936..34227 /note="AluSg repeat: matches 292..1 of consensus" repeat_region 34229..34354 /note="AluJb repeat: matches 143..19 of consensus; incomplete repeat" repeat_region 35317..35619 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 35620..35667 /note="12 copies of 4 mer 88 % conserved" repeat_region 36768..36899 /note="MIR2 repeat: matches 1..135 of consensus" repeat_region 36978..37103 /note="FLAM_A repeat: matches 130..5 of consensus" repeat_region 37123..37421 /note="AluSx repeat: matches 298..1 of consensus" repeat_region 37432..37484 /note="FLAM repeat: matches 126..72 of consensus" repeat_region 38450..38534 /note="MIR2 repeat: matches 146..60 of consensus" repeat_region 39764..40023 /note="L1PA2 repeat: matches 584..843 of consensus" repeat_region 41080..41162 /note="MIR repeat: matches 206..119 of consensus" repeat_region 41173..41227 /note="MIR2 repeat: matches 146..87 of consensus" repeat_region 43019..43321 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 43807..43939 /note="MIR2 repeat: matches 11..145 of consensus" repeat_region 44033..44334 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 45203..45499 /note="AluSq repeat: matches 1..294 of consensus" 5'UTR join(46642..46832,92595..92608) /note="GRB2L" repeat_region 48291..48350 /note="MIR repeat: matches 159..216 of consensus" repeat_region 48870..48934 /note="AluSx repeat: matches 1..65 of consensus; incomplete repeat" repeat_region 48978..49278 /note="AluJb repeat: matches 302..1 of consensus" repeat_region 49337..49391 /note="MIR2 repeat: matches 139..85 of consensus" repeat_region 49611..49909 /note="AluSx repeat: matches 2..302 of consensus" repeat_region 50022..50650 /note="MER42c repeat: matches 1409..774 of consensus" repeat_region 50893..51101 /note="MIR repeat: matches 12..225 of consensus" repeat_region 51108..51321 /note="MIR repeat: matches 236..21 of consensus" repeat_region 52992..53261 /note="AluJb repeat: matches 36..300 of consensus; incomplete repeat" repeat_region 53518..53605 /note="MIR repeat: matches 56..144 of consensus" repeat_region 55909..56208 /note="AluSg repeat: matches 300..1 of consensus" repeat_region 57291..57440 /note="MER5A repeat: matches 11..183 of consensus" repeat_region 57641..57933 /note="AluJo repeat: matches 1..292 of consensus" repeat_region 57982..58157 /note="MER20 repeat: matches 29..214 of consensus" repeat_region 58413..58713 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 60409..60710 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 64499..64657 /note="AluJo repeat: matches 137..302 of consensus; incomplete repeat" repeat_region 65359..65466 /note="MIR repeat: matches 220..105 of consensus" misc_feature 66394..66519 /note="match: Z16533 STS containing (CA) repeat" repeat_region 66446..66491 /note="26 copies of copies of AC 100% conserved; differs from Z23853" repeat_region 66706..66905 /note="MIR repeat: matches 37..225 of consensus" repeat_region 67304..67432 /note="AluJo repeat: matches 302..176 of consensus; incomplete repeat" repeat_region 68263..68558 /note="AluSx repeat: matches 1..298 of consensus" repeat_region 69006..69104 /note="MIR2 repeat: matches 1..95 of consensus" repeat_region 69921..70217 /note="AluSx repeat: matches 1..296 of consensus" repeat_region 71545..71681 /note="MIR2 repeat: matches 2..144 of consensus" repeat_region 74632..74758 /note="MER42B repeat: matches 1143..1273 of consensus" repeat_region 74763..74794 /note="16 copies of 2 mer 88 % conserved" repeat_region 75455..75724 /note="MER33 repeat: matches 45..322 of consensus" repeat_region 75823..75989 /note="MIR repeat: matches 43..213 of consensus" repeat_region 76591..76671 /note="MIR2 repeat: matches 59..142 of consensus" repeat_region 77931..78235 /note="AluJb repeat: matches 1..299 of consensus" repeat_region 78247..78492 /note="MIR repeat: matches 260..14 of consensus" repeat_region 79061..79361 /note="AluSq repeat: matches 1..302 of consensus" repeat_region 80218..80390 /note="AluJb repeat: matches 128..300 of consensus; incomplete repeat" repeat_region 80677..80720 /note="11 copies of 4 mer 82 % conserved" repeat_region 82166..82211 /note="23 copies of 2 mer 83 % conserved" misc_feature 82463..82669 /note="match: STS G04359" repeat_region 83187..83304 /note="MIR repeat: matches 153..28 of consensus" repeat_region 84783..85083 /note="AluY repeat: matches 1..301 of consensus" repeat_region 85166..85424 /note="AluSx repeat: matches 37..298 of consensus; incomplete repeat" repeat_region 85868..85901 /note="17 copies of 2 mer 88 % conserved" repeat_region 86665..86720 /note="MIR2 repeat: matches 145..94 of consensus" repeat_region 88004..88295 /note="AluJo repeat: matches 1..286 of consensus" repeat_region 89221..89506 /note="AluSc repeat: matches 299..1 of consensus" repeat_region 91361..91662 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 91663..91808 /note="MIR repeat: matches 204..51 of consensus" CDS join(92609..92686,101321..101412,105557..105676, 111492..111660,116407..116586) /codon_start=1 /product="GRB2L" /db_xref="PID:e1216623" /db_xref="PID:g2695869" /translation="MEAVAKFDFTASGEDELSFHTGDVLKILSNQEEWFKAELGSQEG YVPKNFIDIQFPKWFHEGLSRHQAENLLMGKEVGFFIIRASQSSPGDFSISVRHEDDV QHFKVMRDNKGNYFLWTEKFPSLNKLVDYYRTNSISRQKQIFLRDRTREDQRVRWARA LYDFEALEDDELGFHSGEVVEVLDSSNPSWWTGRLHNKLGLFPANYVAPMTR" repeat_region 93459..93527 /note="3 copies of 23 mer 100 % conserved" repeat_region 94779..94999 /note="MIR repeat: matches 259..30 of consensus" repeat_region 97527..97828 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 98339..98435 /note="MIR repeat: matches 221..130 of consensus" repeat_region 99117..99414 /note="AluJo repeat: matches 295..1 of consensus" repeat_region 99908..100196 /note="AluSx repeat: matches 292..1 of consensus" repeat_region 100581..100878 /note="AluSc repeat: matches 3..299 of consensus" repeat_region 101659..101958 /note="AluJb repeat: matches 301..1 of consensus" repeat_region 102001..102270 /note="AluSx repeat: matches 294..1 of consensus" repeat_region 103166..103451 /note="AluJo repeat: matches 12..296 of consensus" repeat_region 103915..104060 /note="MIR2 repeat: matches 1..146 of consensus" repeat_region 106167..106445 /note="MER39 repeat: matches 16..298 of consensus" repeat_region 106456..106715 /note="AluJo repeat: matches 301..41 of consensus; incomplete repeat" repeat_region 106791..106861 /note="MER39 repeat: matches 434..511 of consensus" repeat_region 107643..107938 /note="AluJo repeat: matches 1..294 of consensus" repeat_region 108163..108468 /note="AluSx repeat: matches 297..3 of consensus" repeat_region 109238..109526 /note="AluY repeat: matches 1..290 of consensus" misc_feature 109482..109984 /note="match: STS G33279" repeat_region 110452..110563 /note="MIR repeat: matches 33..152 of consensus" repeat_region 110614..110894 /note="AluSx repeat: matches 5..302 of consensus" repeat_region 111789..111931 /note="MIR repeat: matches 48..201 of consensus" repeat_region 112249..112463 /note="MIR repeat: matches 38..256 of consensus" repeat_region 112483..112516 /note="17 copies of 2 mer 91 % conserved" repeat_region 112884..112934 /note="MIR repeat: matches 181..231 of consensus" repeat_region 112909..112973 /note="MIR2 repeat: matches 84..141 of consensus" repeat_region 113961..114026 /note="33 copies of 2 mer 88 % conserved" repeat_region 114588..114706 /note="MIR2 repeat: matches 17..146 of consensus" repeat_region 115134..115436 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 115619..115882 /note="AluJo repeat: matches 299..1 of consensus" repeat_region 116142..116265 /note="AluJo repeat: matches 2..125 of consensus; incomplete repeat" 3'UTR 116581..116730 /note="GRB2L" repeat_region 117479..117543 /note="L1 repeat: matches 3836..3777 of consensus" repeat_region 117873..118004 /note="3 copies of 44 mer 80 % conserved" prim_transcript <118442..>118798 /note="match: EST F21127" repeat_region 120615..120656 /note="21 copies of 2 mer 83 % conserved" repeat_region 120657..120958 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 121085..121146 /note="AluSc repeat: matches 258..194 of consensus; incomplete repeat" repeat_region 121156..121268 /note="MIR2 repeat: matches 30..146 of consensus" repeat_region 122626..122759 /note="MER3 repeat: matches 2..132 of consensus" repeat_region 123266..123564 /note="AluSg repeat: matches 1..299 of consensus" repeat_region 123607..124175 /note="L1MC1 repeat: matches 30..621 of consensus" repeat_region 124185..124490 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 124491..124962 /note="L1MC1 repeat: matches 604..1079 of consensus" misc_feature 124959..125267 /note="match: STS L18079" repeat_region 125089..125144 /note="14 copies of 4 mer 91 % conserved" repeat_region 126470..126596 /note="MIR repeat: matches 9..142 of consensus" repeat_region 126741..126779 /note="FLAM repeat: matches 86..124 of consensus" repeat_region 126782..127076 /note="AluSx repeat: matches 296..1 of consensus" repeat_region 127094..127271 /note="AluJo repeat: matches 131..302 of consensus; incomplete repeat" repeat_region 127284..127582 /note="AluSg repeat: matches 1..299 of consensus" repeat_region 127650..127784 /note="AluSq repeat: matches 1..135 of consensus; incomplete repeat" repeat_region 127785..128084 /note="AluSq repeat: matches 1..301 of consensus" repeat_region 128086..128260 /note="AluSg repeat: matches 126..300 of consensus; incomplete repeat" repeat_region 128261..128288 /note="14 copies of 2 mer 100 % conserved" repeat_region 128291..128587 /note="AluSx repeat: matches 1..297 of consensus" misc_feature 129677..129859 /note="match: exon trap H55317 clone C22_318" repeat_region 129700..129813 /note="MIR repeat: matches 25..151 of consensus" repeat_region 129961..130296 /note="AluSc repeat: matches 2..299 of consensus" misc_feature 131684..132022 /note="match: Z23853 STS containing (CA) repeat" repeat_region 131855..131904 /note="25 copies of AC 100% conserved; differs from Z23853" repeat_region 132088..132357 /note="AluSq repeat: matches 33..303 of consensus; incomplete repeat" repeat_region 132662..132851 /note="AluJo repeat: matches 111..296 of consensus; incomplete repeat" repeat_region 132903..132948 /note="23 copies of 2 mer 94 % conserved" repeat_region 133113..133404 /note="AluSc repeat: matches 1..289 of consensus" repeat_region 133407..133704 /note="AluSq repeat: matches 4..303 of consensus" repeat_region 135744..136040 /note="AluSq repeat: matches 1..298 of consensus" repeat_region 136951..137099 /note="MIR repeat: matches 2..151 of consensus" repeat_region 137100..137262 /note="AluJo repeat: matches 300..126 of consensus; incomplete repeat" repeat_region 137557..137680 /note="FLAM_A repeat: matches 126..1 of consensus" repeat_region 137692..137832 /note="MIR repeat: matches 152..11 of consensus" repeat_region 137852..137950 /note="MER5A repeat: matches 11..109 of consensus" repeat_region 138055..138159 /note="MIR repeat: matches 192..98 of consensus" repeat_region 138428..138467 /note="MIR2 repeat: matches 97..136 of consensus" repeat_region 138516..138799 /note="AluSg repeat: matches 13..298 of consensus" repeat_region 139351..139430 /note="MIR repeat: matches 60..145 of consensus" misc_feature 139983..140988 /note="Putative CpG island" repeat_region 141614..141666 /note="MIR2 repeat: matches 124..71 of consensus" repeat_region 142475..142694 /note="MIR repeat: matches 249..2 of consensus" repeat_region 143445..143510 /note="MIR repeat: matches 149..85 of consensus" BASE COUNT 41994 a 31978 c 30808 g 38967 t ORIGIN 1 gatcattttt gtaataactg ttgcctctac aaaaatggcc tgatgcttag agctgagtta 61 taattgtcct ttttgtattg ttctgtcttg ccatcaagca gcaatgatcc ttcagtccat 121 ctcttccttc aaacttcaaa agcatcgaaa caaaaaccac ccaggctatc tactagaatt 181 aaaacttggg aaaacatcca agaaaagaaa aaaaaaaaga agtagaagtg gtgatgtgta 241 tataacacct tctttaaagt gcctcttttc ccttgctgat tcaccataaa gaagggaagc 301 acagctgttt cagctctagg tcctaaaggc tggaaatcac atttcctgca ttcaatttta 361 catttcactg taactaaata acttagagac atctttagac tccttctaat gaaaatttac 421 ctgtatgata tcatcacttg cctccatttt ctctaatgca aatgttagtc aatttcaatt 481 tctctcttct ccagtacatc atccacttac ccccattgcc tctctacagt ctatctttaa 541 acactactgt ttcctcattt ctataaccct aggagcaggt cctggatgcc tggattgaaa 601 caactgtctc ttacaagctt cgttcaattt gtatctcctt catttctaca caaagcaata 661 agaattatct tccacaaata gctgttttca gtaataagga agctcagttc tgtgactaag 721 ctaaacattc cttctgttct ccaaggccta caatggttcc tagttgctac tcatatcaaa 781 ctgagtttat tagctcattt atgaattgct ctgagtactc tcagctatct acccctactg 841 ttttacatat tcaaatttat atttttattc catttagtca gacaactctg tattaccatg 901 cttaatctgt gtcttgcttg tattctttac cattattgta gaaatgattt gtggtcccga 961 gcatttattc tctccatctt ccttaataac agaatctctt aactttcatc tgggcacatg 1021 atatttctca acctcctttg tagctaggag tgtcagtggg actaagtact aaacaataat 1081 atgtgacttg ttaaaaaaaa aaaaaagaag aaagaaaaaa aagggaagag acattcatat 1141 tttcaatctc ccttcctttc tcctggctag aaggcagatg tgatggctgg agctcaagca 1201 gcaatgaatc aaaaagtgga agtggatcat gaagtggaac ttaaacgctg agtttggtgg 1261 agcaatgaga ctcaaaaata gccctgaagc caacagtgta gagagccata tcagtcatag 1321 aacacctaca agagagtatt tcagaagaga gaaataaatt tgtatgttta agctgctagc 1381 atttttgtgg cctctactat cataacttgt ggatgcacct ccttctaact aatctaccta 1441 tctttgagtt ttaatttgct acttctcctg tcccattaag aatgtccccc tcgtggccgg 1501 gcgcggtggc tcaacgcctg taatcccagc actttgggag gctgaggcgg gcagatcacc 1561 tgaggtcggg agttcaagac tagtctgacc aacatggaga aaccccgtct ctactaaaaa 1621 tacaaaatta gccgggtatg gtggcatgtg cctgtaatcc cagatactgg ggaggctgag 1681 gcaggagaac cacttgaacc tgggaggcag aggttgcggt gagccaagat tgcaccattg 1741 cactccagcc tgggcaacaa gagtgaaact tcatctcaaa aaaaaaaaaa aaaaaaaaag 1801 aatgtcccct caatttctta ttttttccca ccccaactcc aaactgtcag ttttctttct 1861 tcctttaaaa tacagcttca gattcaaatg cctacatgga attggttagt attgccaagt 1921 ttacttaatc caattactct tttttgaaca caaacacaga catctgtata tctaggatgt 1981 actcatatag aaagcagata aatgaaataa aaataacaga tttaaaacaa atttactata 2041 gattttacat ccaaacattc actgttcgct ttaaatggta ctcttactat agatatacaa 2101 taatgatgcc atcattgact aaagcttttt tagaacatct tttttgtgaa acatacttaa 2161 aagcacatgg tagcaagatc ttgtctcgaa aaaatatttt ttcaagtcag ctgggtgtgg 2221 cggtgtgtgc ctgtagtccc agctagttgg taggctaagc caggaggatt acttgagtcc 2281 agtagttcaa ggctgcagtg agtcattatc acaccactgc gctcctgccg gggtgacaga 2341 gcaagaccct gtctcaaaaa aaaaaaaaaa agaaagaaag aaaaggaaaa gaaaaagcac 2401 acagtacatc acaagctata gcctccaata gagcagaaat acagtcaatt tacttaacag 2461 gcttagaaca atgtctggca ttatagtgag gagacactca atatgtgttt gtcgtattaa 2521 aaaataataa attgtagcaa aacttcattt gaaagtggat tcactttttt aaaatagtca 2581 agtcattcat agctcagtgt agtgaatgag acatgtcata aagatggaga atatcatttt 2641 aggtttaaaa acgcaatctg tggggctggg cgcagtggct catgactgta atcccagcac 2701 tttgggaggc tgaggcaggc agatcatctg aggtcaggag ttcgagacca ccctgaccaa 2761 catgaaggaa ccccgtctct acttaaaata caaaattagc caggcgtgat ggcgtatgcc 2821 tgtaatccca gctactctgg aaggctgagg caggagaatc gcctgaaccc tggaggcaga 2881 ggttgcagtg agccgagatg gtgccattgc tctccagcct ggacagcaag agtgaaactc 2941 cgtctcaaaa aaaaaaaaaa aaaaaaaaaa aaaagcaatc tgtgactaca aaacaataaa 3001 ctagaatttc ttgcatgaat cataaaatca gagttgggtt acttctgtat caatttaggg 3061 attcatttct taagcttacg attcaccagg ccaagtttgc tccagtttca gagcatctac 3121 cctgctgttc cctctgcctg atatacaggc acaccttgaa gatactgcgg tgggctgggt 3181 tccagattac tgcaataaag caaatattcc aataaagcca ctcacacaaa catttttgtt 3241 tcccagtgca tataaaagtt atgtttacat tagactgcag tctacttagt agactaagta 3301 tgcaatacca tgctgcctaa aaaatatata tacataatta aataatactt tattgctaat 3361 ttaaaaaggc taacaatcat ctgagccttc agcgagtctt catctttttg ctggtggagg 3421 gtcttgtctc catgttgagg gctgctgata gatcagggtg gttgctgctg aaggttggga 3481 aagctgtggc aattttttaa agtaagacaa caataaagcc tgcaacatca actgactctt 3541 cctttcacaa aagatttatc tgtagcatgt gatgctgttt gatagcattt tacccacagt 3601 agaacttctt tcaaaattgg agtcaatcct ctctaatcct gccactggtt tatttactaa 3661 gtttatgtaa tattctgaat tctttgttgg cattgcaaca atgttcacag cttcttcaca 3721 ggagtagatt tcaactcaag aaaccatttt ctttgctcat ccataagaag tatctcctca 3781 tctgttcaag tttgatcatg agattgcagc aatccagtca catcttcagg ctccacttct 3841 aattctagtt ctcttgctat ttccgccaca tctgcagttc cttcctccac tgaagtcttg 3901 aacccctcaa agtcatccat aagggttgga atccacttct ttcaaactcc tgttaatgtt 3961 gaaattttgg cttcctccca tgaatcacaa atgttcttaa tggcatctag agtggtgaat 4021 cttttccaga aggtttttaa tttactttgc tcagatccat cagaggaatc actatctatg 4081 gaagatatag ctttatgaaa tgtatttctt cagtaataag acttgaaagt tgaaatcact 4141 ccttgatcca tgggctgaag aataaacact gtgttttagc aggcacgaaa acaatattaa 4201 tctccttgta catctccatc agagctcttg ggtgaccagg tgcactgtca atgagcagta 4261 atatttttaa agaaatctta agggcactag gattttggga atggtaaagg agcagttggc 4321 ttcaacttaa agtcaccagt tgcattagtc cctaacaaga gagtcagcct gtcctttgaa 4381 gctttgaagc caggcattga cttctcctct ctagctatga gagtcctaga tggcatcttt 4441 ttccaacaaa aggctatttc acccacattg aaaatctatt tagtgtagcc cccttcatca 4501 attaccttag ctagatcttc tggataactt gctacagctt ctccatcagt acttgctgct 4561 tcactttgta cttgtgtgtt atggtgacag ttgctttcct tgaacctcat gaacccacct 4621 ctgctagctt caagattttc ttctgcagtt tcctcttctc tctcagtctt tgcagaatta 4681 aagacagtta gaaccttgct ctggattagg ctttggctta agtgataatg gggctagttg 4741 gatcttctat ccagaccatg caacctttct ctataacagc aataaagctg tttcccattc 4801 ttatcattcc cgtattcatt ggagcagcac ttctaatttc cttccagaac ttttccttta 4861 cattcataac ttagctaact gtttggcaca agaggcctac ctttcagcct gtttcggctt 4921 tcaacatgcc tttctcacta agtttaacca tttctagctt ttgaattaaa gtgagaaacc 4981 tgcgactctt ccttacactt gaacacttag aggccattgt agggttatta attggcttaa 5041 tttcaatatt gttgaccctc agggactagg gagacccaag gagagagaga gagatcggga 5101 tcagtgaagc agtcagaaca cacatgacat ctatcagtaa gttcactggt cttctacagc 5161 catggtttgt ggtaccccaa aacaattaca acagtaacat ctaagatcac tgatcataga 5221 tcaccatgac acatataata ataaggaaaa agttaaaaat attgcaagaa ctaccaaaat 5281 gtgacacaga cacaaagtat gcacatgcta tcagaaaaat ggtgccaata aacttcttgg 5341 gtacaggatt gccacaaacc ttcaatttgt aaaatatacg ttatctgtga agtgtaatta 5401 agcaaagtgc aatcgatcga ggtatgcctg tatgtgggtt gctccctcac ttcattcagg 5461 tctgtttgaa gattcacttc ctccaaaagg ccctctctca ttacgtgacc taaaataaac 5521 ccacccctgt cactctgttc cctgaccctg ctttcctttt ctttacagga cttgttatta 5581 tctgacatat tttatgcttg tgacagagac atcccagtct ctaccactag aatgttagtt 5641 ccatgataaa agaaaactta ttttactcac tcctatagtt ccaaatttct gctgaatatc 5701 caaattttaa ataaattcaa taattatgtg tttacaatga aagaagaaag aactgacaaa 5761 agaaaggaag aaggaagagt aagaaaggta acatcgaaga agcaaaagtg gggaagaaac 5821 tctggatgag tcataattgg ttattcttct ctgttactct gtttttagca tcagtttgat 5881 ttaaaggctc ccatgagaaa atttcagaat gagtggctgt ctctttcgga gatttatttc 5941 aaaagaatca gatcccctaa agatcccacc acttttagaa gccggtaggt agtatttgct 6001 actataatta caatggctac cctcttataa tgtctctctc caatcccaag tgtggatgtt 6061 cactctatgt ctcactgggt gaacgtttga aactcttgat taagacagaa aagtacctga 6121 atgattttta gcaagcttaa cttgtgttgg tattactagt aacattgtta catttttgca 6181 cttaactgac agttggaatt atctatttgt actgaataca aagggacagt tctctaagtc 6241 acatttcact aatgtttttg cttcctcaag ctgtccattt ctcatacaca acttatcaac 6301 agggtttcta ctggagaagt tagagcaaaa tgacattttt taaagtttta gcaaatttgg 6361 tagaaaagat gatagcattt ttctactcat gtctattata acattataaa taaatccctc 6421 tgttataaaa gttcagtagt ccatatttaa agttttaagc aaaggaaagg gtaaaaggta 6481 ttgtcctaga ataattttag agcactttta ggattctgaa gcatacttta gaaaatttca 6541 attacttgat ttatgttaca ggatcaaaga gacctcatac gcacataact agacacctag 6601 tcttagaaaa ctgtactcct tcactttcca gttaaggata aaaaacttaa aattgaatca 6661 cgatgccaaa tagggtagtc actagtttaa taccttaaaa gctcaaaaac attttatttg 6721 aatcaaagat aactgcattc agaaaacagt cataaaactg ggctgggcac agtggctcac 6781 gcctataatc ccagcacttt cggaggccaa ggcaggagga tcacttgagg tcatgagttc 6841 gagaccagcc tgggcaacat gctgaaaccc tgtctctact aaaaatacaa aaattagctg 6901 ggcatggtga tgggtgcctg taatcccagc tactcaggag gctgagtcag gagaattgct 6961 tgaacccagg agacagaggt tgcagtgagc tgagatagtg ccactacact ccagcctagg 7021 agacagagca agattccgtc tcaaaaataa ataaataaat aaataaagca aacaaataaa 7081 ctttaaaaag gtcataaaat cttctattga atattcttcg ttgttgtttt ctatagtatg 7141 aagcctaata tggtaatagt aatactaaat aatattaata atttagtact aaaacataat 7201 attttatatt tttaaatcat ttttaatgat acctgttgca ttttaggcca atgttcatta 7261 tacgtacttt tattatgtaa ccttccaaac ttaggaagct tatacttctt ctctgaagca 7321 gaaatatccg gtgtgggggc agaagtgcac gctgtcagtg agttacttga accaagttgt 7381 cttttagaaa acaatataga gtgggaggta cgctgtctag tccgacatgc cacttccctc 7441 tctttacaca gcaatggttc atccatcagc aatgtgatga cttgcttaga tttttcccgg 7501 atataataac ctgaaaaata cattgactct gcttagaaaa ggaaatacta gttaaggcta 7561 taattaattt tttttctaca gaaatttttt taaaaataaa aatctcagca gtgaaaaaac 7621 ctgcagcaaa taaagaacta ctgataagta tattcctttt cttatacttt aaaaggtaaa 7681 taagtcatta atatatgata aaattatata ctggcattac atttttcact ttaaatacat 7741 tattggattt tccttttaaa agcaatatat ggctgggtgc ggtggctcat ggctgtaatc 7801 tcagcacttt gggaggctga agtgggcaga tcacgaggtc aggaaatcaa gaccatcctg 7861 gccaatatgg tgaaaccttg tctctactaa aatacaaaaa actagctggg catggtggca 7921 tgtgcctgta gtcccagcta ctcaggaagc tgagacaggg gaatcacttg aactcaagag 7981 gtagaggttg cagtgagctg agattgcacc actgcactcc agcttggcaa gagagcgaga 8041 ctctgtctca aaaaaaaaaa aaaaaagaaa gaaaaagcaa taatactttg caataatcaa 8101 acaatatagg catacaaaga aggggtatga attagatcta gatatactga cctagatgta 8161 tgaccaggac acatggttaa agatttagag taacatacac aataggattt gaagctgtaa 8221 catatagagg aaaagaaatc accatatatg tacacggaag agagaagcag ataaagaaat 8281 acatacaact gtggtttgaa tgtatcccct ccaaaattca agtactgcca atgtgatagc 8341 attaagaggt ggagccttta agaggtgatt atgccatgag gtcacctcct ttgtgaacag 8401 actaaaggtc cttataaaag aggcttccca cagcatttgt ctcttgcttg ccttccacca 8461 tcgcctttcg gaccttctgt cacatgagga cacagcattc ctcccttcca gaggatgcag 8521 catcaaggcg ccatcttgga agcagacagc agccctcact agacaactga acctgccagt 8581 gccctgaacc tggactctcc agcctccaga accatgagaa aataaatttc tcttctttct 8641 aaattaccca gtctgtgata ctctgttata gcagcacaag cagactaaga cagatagaca 8701 gacaggcaag aatataggaa ggcaggcagg caggcaaaca gatggcatac tcagaatatt 8761 gttaacatct gttaccttac aggggtacag ttggggtggg acaagaaaag ggagagcaat 8821 tggcttttct ctgacgtaac atcctatcaa aacaatttgc tgctttttaa cagagtgaca 8881 gagcacagtc aaagacaaga attctagtga gagtgatagc tctgtcctta accatctgtg 8941 tgactttcag tcagtcaggt ggtctatctg ggcctgagtc ttctatctga aaagcagcac 9001 aagcagtatc tgtatggttg aattcccaga tacaaaggta cactgtaacc tataactcac 9061 tcgaatataa ggtattacta ttttttcaac cttcttccta gaaaaacgat aaaggaaatc 9121 acttaccttt ctccatcttg ggatgaactc acaaacatgg cttcccaagc cacctttcaa 9181 aaataaactg aaaagtccct aatgatccac ttgtatgttc attttgggct ctaggtgtga 9241 tgaattaaag tattaatgtt ttctaactat aatttactag cagtggaagt gatgagaaat 9301 ggtcaaattt ggggtatatt tagatgacag tggccaacga tgttctctca ctgcgggtca 9361 cttcattaag gactggtgtg gacagtgtca gcaacctttg ctcaagttgc catcttttcc 9421 aagattattt tatctaatct cacttcttag ttatttcttc cttccaaatc tactttataa 9481 ctgcttcaaa tcatatcatt ctaaacactc ttagcttcct atgatcagtt tgcattccat 9541 aacccgaatt tccacaatgt tttgggtaac cattcagtgt cgtgtgataa acagaagtcg 9601 gctgtgaatt ttagcctttg ggaacaatat gtcagacact tatctttccc ctgaaaaatg 9661 gacaaagatt ggttttggag gcacaaagaa catgttttcc catttttaac atcaggaact 9721 catctccttc caccttcccc ctcattcctt ctactctagc taccctggtg ctttattctg 9781 caatcccacc aagctcttca tctacttaga gccatgtttt ggctgttgct tctgccagga 9841 actctttgtg tggctctatc cctgattcat tcaggtctca agtgtcacct cctcagagaa 9901 tctcccttct gcctaccctt tctaaaatag ccccaccctc atcactctct attcctttat 9961 cctgctttat ttttcacatt cgcatttatc attacctaat attatatatt tattggttta 10021 cttatttatt gtttgtttcc tcaccaaaaa tataagctcc atgaggggca gagaagtctc 10081 attttagaca ctgatatctt cagtgtctgc aagagtgact ggcacacagt agatgctcag 10141 taaatattta ctgagtgacc aaggtggata gatcacttga ggccaggagt ttgagaccag 10201 cctgggcaac atggtgaaac cctgtctcta ctacaaacac aaaagttagc tgggcacggt 10261 ggtgcacacc tgtaatccca actactcagg tggctgaggg atgataatcg cttgaaccca 10321 ggaggcagag gttgcagtga gctgagattg ggccacagca ctccagcctg ggtgacagcg 10381 tgagactcag tttcaaaaac aaaacaaaaa atttaccgag tgaattaatc ataaaggaat 10441 aaattaccaa aaagctgcta attttcttct tgacatagtg taactggaac cagattgatg 10501 aaaaatacta agtcaattta caacttagct gcctaagaca gttgcttggc ctgaaataat 10561 ttcttgtaac actgctctca ttaagttaga actagaaatt atagaataga tatgaaatat 10621 tctccttgga gtctgtgtgg attggtggga tttgaggaca gctgcaatag ttctaagaga 10681 caagcaatta caccaagcaa agagccagtt gcaatagtgg aggacttctc cccagatcat 10741 cagagcccct ccctgaggta ggactcagca gcaccaggaa gatcaggcca cagcagttca 10801 ccaaacatag ggaataaatc acccagaagt gagcttctgc ccactaccct agcttgctat 10861 catcctagag aaggtactag agatactaga tactagcctg gctgttcaaa agcatcaaag 10921 ttgaaagtga gaaaatctaa gaagaacgta ataaacctgc ttcaaggatg agttaaataa 10981 caacaacagt aataattagc accttcttac tactaaacaa tatgtttctg aaaagaaagg 11041 gaggacagca taccaacaaa gaatagtact ggaagtagaa aacttaggca gcaaaattaa 11101 gaaatggagt tggaagtaga aaacatggat tctctgaaat cagaaattaa actagtccaa 11161 cagatcaata cccaaggata ttacataaaa cacatcaaga taaaaaggtt gttagatgag 11221 gtgcaggtta aagtgaaaat aaagctatat acagttttaa ttgggaatct tttggctcta 11281 agtgacagaa actccatttg aactgggtaa tgcaaaagct agggaaaacc aggggttgca 11341 actgagcctc aggtacaaag gaagccaaga ctggaaaact accaggacat ctctcccctg 11401 ccatctctgc ttctctagga aatgcttgca ttttctcttc ttgcaagttg gcttctttta 11461 tctggatggt tatggccacc aacagctcta gattcacact gccacctgag aagtctccct 11521 cttcaaattt ggagtatcag gagaaggact ccaagtgacc cagcttgaaa cagtcacatg 11581 cccatcacca tggccatagg gtgtgatgcc acgattggca acctcagtta gagcacaacg 11641 ggggcgtgag ggtagggtcc aggtgacaga aaaatgaaca agtagacagc caacttgagg 11701 agtcctccca aacaaacaaa aaaaggacaa ggaaatcaaa ttagtagagc aactatgcaa 11761 gagatcaagg cccaaggaca ttcatttatg cagtccatta ataatttctg gaatacttat 11821 gtttcaggaa gtattctaag caatgggggt gcaagtcaga aaaagacaag gtccctgcac 11881 ttgtgacact tacattgtag ttggttgaaa cagccaataa tcataaacaa gataatttca 11941 gataaatgct ctaaaggaaa taaagaaggg tgatgtgcca gactttaatt ttgggtgtgg 12001 tcacggaaag tcttgcagaa gaggtaacat ggctgagtct tgaatagtaa gaagcagcca 12061 gctctgcaca gacattaggg cagagtgcgc cagacaggga acagctccaa tttcttcagg 12121 aaagccttcc acggacagtg aacaccctct cccgctgtgc cctatgcaga catgggcctc 12181 tgacttggcc aggccctgct ttcaaacact gtcacagtac cttacccttg tccttcacag 12241 cacttccaca gttagtcatt gcccatttgg taactgtctt ttccctccaa ttaactgtga 12301 catccacgag tccagactat gcttgtttaa gagacactta agaaaagttt atgaaatgaa 12361 tgaatatgct gaggaaaagt tttccaaatg ctgaggagaa atctacatac tttttttttt 12421 ttgagacgga gtctcgctct gttgcccagg ctggaatgca gtggcacgat ctcggcccac 12481 cgccagctct gcctcctggg ttcaagcgat tctcctgcct cagcctccgg agtagctggg 12541 actacaggcg cccgccacca ggcccggcta atttttttgc atttttagta gagacggggt 12601 ttcaccgtgt tagccaggat ggtcttgatc tcctgacctc acgatccacc cacctcagcc 12661 tcccaaagtg ctgggattac aggcgtgagc caccgcgccc ggcccgagaa atctatatac 12721 atttacagct gaccatcgtg taaagagatt gcaaatatgg taaattcaaa gatctttggc 12781 attgagatgc accactagga aaaatatgta tcagaagatc tttatactaa aacacgtgct 12841 aggcttcaaa caatcgtaaa aatcttaata ctcctgtctc tattcctgac ctactgccct 12901 accaaaatct actcttcttc ccatattcct ttctcggtga cttttatctc tactctaagt 12961 tagaaatatg aacatatacc tcccaaattc agttatccac gtctattgac tatacccact 13021 aattatcttc caaatccatt ctctcctctt catcttgatt cagaccttga tcatttgttg 13081 cctgtggtgt cctaaattac ttacttagct ttagtctccc ttccaatcta ccctccacaa 13141 tgccttcagg taacgcggtg ggggtagaaa gggaggggag aagtatctaa aatatctgat 13201 caagtttcca ggatagaaaa ccttcagttg gttgcagtga gccgagatcg cgccattgca 13261 ctccagcctg ggcgatgaga gcacaactcc gtctcaaaaa aaaaatccct ccagtcaagt 13321 tgacacagta agttcatgct tcaaagtatc cccctgttct attccaaata cacagcaata 13381 atagacactg ccagtctcaa aaacaagata aaaaccccca cgaagcagaa atataacaga 13441 aatgaaaaag cagtgagcga ggctgaagcc aagggctgct gggctcttga tctgccagcc 13501 cagagttcag ctcctcaggt gttaggaaac cacagctggt ctcactcttt aatgttatgg 13561 gggcagtgtt tgtgagagaa gcctgggaag aaaaggagaa gaaaaaaaaa aaatagccat 13621 caactgcttc ccgggataca acttatttgt ggctgtggat ctaaaagcag gaggatacac 13681 atgcagtctt aatacttgga actaaatgaa gccacctgct tgctcagccc ctgggtctag 13741 gacattctag tattatcaca gggcatgtgc cccaagccaa ctttataagg cctgtacagg 13801 agggagtcta gaagacctgg gtagagggaa ctggagaaac ctttatttta ttaaaaaaaa 13861 aaaaaaaaag gaaggaaatg aacatgagtg gttgtgggaa aaaatctccc actcaaaata 13921 gcctaaacac taaaattcta aaatacacat ttttaaaatc taacgctaag aaagtcagcc 13981 aatgaaatca gcaatctatg taagacctca aactataaga atcccagaag aaaagctaga 14041 aaacaccacg ctggacattg aacttgggaa ataatttatg agtaagtcct caaaagcaac 14101 tgcaataaaa cccaaaaatt gaccagtggc acctaattca gctaacgagc ttctgcacaa 14161 caaaagaaac tatcagtgta aagagacagc ctatagaatg ggagaaaata ttcataaact 14221 atgcatctga caaaggtctg atattcagaa tctataagga acttaaacaa ttaaacaagc 14281 aaaaaacaaa taacccaact aaaaaatggg caaaacatgc acagatactt ctcaaaagac 14341 atacaattgg ccaaaaaaca cacagaaaaa atgctccaca tcactaacca tcagagaaat 14401 gcaaagcaaa accacaatga gataccatct cacaccagtc agaatggcta ttatcaaaaa 14461 gccaaaaaac aacagatgct ggtaaggctg tggagaaaag gggacacttg tatactactg 14521 gtgggaatgt aaattcgttc aaccatggta gaaaacagtc tggagatttc tcaaataact 14581 gagaactgga ttcaacccag caatcccatt actaggtata tatccaaaag gaaatacata 14641 attctgccaa aaatacacat gcacttgtat gttcactgca gcactgttca caatagcaaa 14701 gacgtggaat caatctaggt gcccatcaat ggtggattga atgaagagag tgtggtacat 14761 ataccacgaa atactatgca gccataaaaa agaatgaaat catgtccttt gcagcaacat 14821 gggtgcagct ggaggccatt atcctaagca aattcatgca ggaacagaaa accaaatact 14881 ggatgttctc acttataaat gggagaatgg gagctaaaca ttgggtaccc atagacataa 14941 agatggcaac accagaaaca ggactactgg aagtgggagg agagaaggga ggcaagggct 15001 gaaaaatgca ctattgagta ctatgattag tacacgggtg atgggatcaa tcatacccca 15061 aacctcagca tcacacagtg tatccaggta acaaacctgc acatgtaccc cttgaatcta 15121 aaataaaagt taaaattgta aaaaaaaaaa aaaaaccagc aatcacaata taaatccaca 15181 ccaaatgaaa tcaatgtttc tataggatct tctatggaat ctatattcta tgatttgaaa 15241 accctctaac aaagattttt aaataaagag attaatacat gactcatatt gacaacaagg 15301 aatataaaag tataaaacaa aacaaaggca gatatgaaac aggaacagat ggatatgaaa 15361 aaaatcaatt aaaatcctag aaatgaaaac tacagtcatt gagatttttt taaaaaaatt 15421 caacatccaa ggtaaaaacc agactagaac agctgaagat agatttggta attggagaac 15481 agtactttat ttatttattt atttatttat ttatttattt atttattttt gagacagtct 15541 caatgtctca acctgccacc taggctggag tgtagtggca caatctcagc tcactgcaaa 15601 ctccgcctcc tgggttcaag caattctcat gcctcagcct cccgagtggc tgggattaca 15661 ggcagacacc accaggcctg gctaattttt gtatgtttaa tatctttgcc atgtcggcca 15721 ggctggtctt caactcctgg cctcaagtga tccacctgcc ttgacctccc aaagtgatgg 15781 gattacaggt atgagccacc gtgtccagcc aacaggagaa cagtactaag gaattctccc 15841 aaaatacagc acagagaggc aacaagatta aaaagattaa ttaaaagtca taaagaacag 15901 attataggcc ctaatattcc tctagaggaa gtttcaaaag aaaattaagg taatagaaaa 15961 aaagcaatat tcagagacaa gagttgagaa ttttacagaa aagacataaa ttttcatgtc 16021 aaaaatgatc tccaaagacg aagtaagata aataaaaata aatctatacc agaaacactt 16081 cagaaaaata gaagaatatt taggataaag agaaaatatt aaacctacaa attccttctt 16141 tgggtttata atagcatact tgccattcag tcggcccatc agccattgtc agcaaatgct 16201 gatttcccac ttccaccaca gaaagagaag aaactgagat acaaataaag tatcctcagc 16261 cctccaaacc tagtaaggga ggccaatgtg tgaatcaata attatggcag tgtgacaggg 16321 ctacaatgta tacgtggaca aggtgctagg gacaagtaga gggtgaccac tttacagaac 16381 agtagaaaag acctcctttt ctactggctc atatctgtaa ttctaacatt ttgggaggcc 16441 gaggtgggag gatcacttga ggccaggagt ttgagaccag cctgggcaac atacagagat 16501 cccttctcta cattaaaaag aaagaaagaa agaaaattag ctagtcatgg tggcaaacac 16561 ttgtagtccc cactactcag gagacaactt gagcccagga gttggaagct gcagtgagct 16621 gtgatcacac cattacactc caaccggggc aatggagtga gagcccatct tctaaaataa 16681 ataaataaat aaataaataa ataaataaat aaataaataa atacggacct tcctagtgag 16741 tatctgaatg gagctttatt tgaacatgaa tatctgttct aggaaatgaa aataataaaa 16801 gttaaaatct atcaagagct gatagtagta actaatatcg acaggtaact tactatgtga 16861 ctaactctca gacatatctc taatcctcac aaccatcctc agagttaagt ccattttacc 16921 aaagagaaaa gtaaggttta gggtgatgaa gtaactttcc tgaagtcata aagctaataa 16981 actagaaagt ggaggttggc tttaatttct ttgactgacc ttagcatata cttaccagtg 17041 atttcatttg atgagaacct taagagagaa tagttctgtt gacacagcac tgaaagggct 17101 aaactggttg ctttcccttt tttgtttact gtgctttgtc gtccatctga gtacaataaa 17161 taaagagctc cgtgggaaga ctagccagct gggtaaaagt cacatttaag cttaaaagaa 17221 ttcatgcttt gcttcttgct atcatgatta ccaacaggat tacatttgtt gaaaaggaac 17281 acaaggctaa atattcaact attttaaata aaagtttttg gtttaaattc acaatgatac 17341 aacttctaac aacaaggctt atgatgacct aaagcctgta acatatgttt gttaaaggaa 17401 actatttgtt catcaaaaat atttgttaca ggaaattaca tttgctcact taagctataa 17461 atgaactcag aagcctgtgt taattaaccc accaaccatg agtgcattga aattgcacta 17521 atttattaat ctaatccacc atcatatatt aggtgggttc accgattaga gaattctgac 17581 tctcgatgtc tgacactagt ctgaaaatta aatctgatcc acttttaagc tagaatgttt 17641 cctataatgc ttggcccctg tatcatacta ctaggaactg cctcatatac cccaaatcca 17701 tcccttctgt gagtgtatta atgcccccaa attttagtta tttggcttta tagccttgac 17761 atggctaatc aacatctaga gcaatttaaa aaatatttcc tattttattt ttcttgaatg 17821 tattttttcc agatatcttg gctttattca attcattata attgccttga aaaggaagac 17881 ctcagacatt gctttttcct tctattccta gggttcccgc tttgtctaaa cctttttcaa 17941 gggccaaata cttgatgttt tctctctttc tatgagaatc taaactcaaa aaacaacctc 18001 cccccactgc ccaacaaaac ctagtgtaaa tataaataaa tgtattatgt cttccacagt 18061 atttgccttt catattttaa tttatatctc atctatagct taaaggctta gttcttagtc 18121 taagatattc aaaatctggt gagcaaatag gtaggcaact ggtatgagca catccctgag 18181 tgttggagga tacaatggga gaggatacaa tggggagtat tatactaata cagaaatcgg 18241 tcattagtat aaatgtgaag cacctcattc ctaaattagg tgactgaaaa tgaaaatgaa 18301 aagctacaat ggcaagaata tcaccaatga atgctacttc tagtaaaatc actttttttt 18361 ttttttttag agacagggta gagtgcagtg gcacaatcat agctcactgc aacctcaaac 18421 tcttgggctc aagctatctt tccacctcag ccccccatca gccatgccta gctaattttc 18481 ttttctttct ttttcttttt ttttaatgta gagacagggt ctcactatgt tgcccaggct 18541 ggtcttgaac tcctgacctc aggaaatcct cccacagtgc tgggattaca ggagtgagcc 18601 accacactca gcctaacata acttttaata acaacattga aatctgtctc tttaaaatgt 18661 ggccctgttt ttaaaattat acctttaatt gaaatgtatg tgtgtatgtg tatatgaaat 18721 agtttattta aataaaatta attgcagttt catgacttag ctaattgaga attaaaaaaa 18781 cacacaaatc tataatccat acaattccta gaatctgaat tggttttaaa agaatactgt 18841 aatttgtgta aataatttga tgatatttta aaactttcat atttgattag atatttttca 18901 gataatttta attaagttca cactagtatt tcagtgaaac cccatttgga tatttttaag 18961 actcaactag gagaggctgt ttagtatact aatagcatat gactaaatat caggatatat 19021 tatttagatg agttctaatt gtgttcagcc ataaattgga tatgtaacct tacctaaata 19081 acttaatatc tctgagcttt gcctccaagt ctctgtaaga gagaaaatac ctgctcttca 19141 tttttcataa gatgctttaa ggagatcaaa caagatcatg catatggaaa tactctatta 19201 ttaaaatcta atacttctca agcaattgga taccatacag ataagaatga actctatctc 19261 acaatataca caaaagctaa ttctagatgg atcacagacc tagttaaaac tataaatatc 19321 tggaagaaaa catagaacat attccttggg ataggcaagt ttctctgaca aggtaaaaaa 19381 ggcattaacc atatgtatta gtccactttc acactgctga taaagacata tcaaagactc 19441 ggcaatttac aaaagaaaga ggtttaatgg acttacagtt ccacatggct agggaggcct 19501 cacaatcatg gcagaaggca aggaagagca agtcacatct tatgtggatg gtggcaggca 19561 aagaaagaga gcttgtgcag ggaaactccc ccttataaaa ccaccagatc ttatgagact 19621 tattcactat catgagaaca gcacacaaaa gacctgcccc catgattcaa ttcctcccac 19681 agagtccctc ccacaagaca tggaattcac gatgagatat gggtggggac acagctaaac 19741 tatatcacca taaaagaaaa acagattgaa cttcatgaaa attaccaact taaaactatt 19801 tttaccccat gttctcctac aaatagatct tgagagcttg tttggaggtc ctaaaaaggg 19861 agtgcagcta ctcctatacc cttgaccaaa gaaaaatcct cttctatcag caaaggtcat 19921 cctctttgac caagaatcca actttaggag agacacacat acagcggtga ggaaggaaaa 19981 ggactcctgc ctagccagcc agattgccct cacatccaaa atgacaaact tctactcatc 20041 ataagacaac attagaaatg gaaaagacaa gctacagaat ttttgtagct tgtaggatgg 20101 cagaaaatat ctataatatg tctatctgac aaagaactca tatccaaaat atgtaaaaat 20161 tcctagaaat caataataaa aagataaaat gttaaaaata ggcaaaatat ttgagcaggc 20221 aatacttgaa caggcaaaag atgtccttgt gggcaaaaat catatgaaaa ggcacttaac 20281 aacttttcag ggaaatgcaa attaaaacca aattgaaaca acattatact cccaacaaaa 20341 tggctaaatt gaaaataata acattcctaa gtgttgataa ggatgtgaac aacaaattct 20401 catatattgg tagtgaaatc acttaagaaa actttagttt ctagtcatat ttaatggttt 20461 ctacaaccca gaaatttcac tcttcaggat ataccaaaga gaaccgaatg catatcgcac 20521 aggaatgttc ataagagttt tattcacaat aactcaaagc tataagcatc ccaaatatta 20581 tccatcagca aaagatggat aaactgtggt atgcacatac aattgaatac aactcagcaa 20641 taaaaacaaa tgatattctg atatacataa caatatggat gactctcaga aacattatgt 20701 tgagagaagc cagaaactga agaacacata cggtatgctt ccatttatat aaattttaag 20761 aacaaatgca acaaatctat acagttaaag aaattcggtc ggtcgcagcc cgctcgggcc 20821 cgttcgcgcc cgtcctgtcg gccacgtccc gcggggtggc gggcgcgctg cggcccttgg 20881 tgcaggccac ggtgcccgcc accccggagc agcctgtgtt ggacctgaag cggcccttcc 20941 tcagccggga gtcgctgagc ggccaggccg tgcgccggcc tttggtcgcc tccgtgggcc 21001 tcaatgtccc tgcttctgtt tgttattccc acacagacgt caaggtgcct gacttctctg 21061 aataccgccg ccttgaagtt ttagatagta cgaagtcttc aagagaaagc accgaggcta 21121 ggaaaggttt ctcctatttg gtaactggag taactactgt gggtgtcgca tatgctgcca 21181 agaatgccgt cacccagttc gtttccagca tgagtgcttc tgctgatgtg ttggccctgg 21241 cgaaaatcga aatcaagtta tccgatattc cagaaggcaa gaacatggct ttcaaatgga 21301 gaggcaaacc cctgtttgta cgtcatagaa cccagaagga aattaagcag gaagctgcag 21361 ttgaattatc acagttgagg gacccacagc atgatctaga tcgagtaaag aaacctgaat 21421 gggttatcct gataggtgtt tgcactcatc ttggttgtgt acccattgca aatgcaggag 21481 attttggtgg ttattactgc ccttgccatg ggtcacacta tgatgcatct ggcaggatca 21541 gattgggtcc tgctactctc aaccttgaag tccccacgta tgagttcacc agtgacgata 21601 tggtgattgt tggttagaga cttggactca agtcataggc ttctttcagt ctttatgtca 21661 cctcaggaga cttatttgag aggaagcctt ctgtacttga agttgatttg aaatatgtaa 21721 gaattgatga tgtatttgca aacattaatg tgaaataaat tgaatttaat gttgaatact 21781 ttcaggcatt cacttaataa agccactgtt aagcactgtt atgctcagtc atacacgcga 21841 aaggtacaac gtcttttagc taattctaat taaaaattac agactggtgt acaagatact 21901 tgtgaaatct gtaactgact ttattttctt gcccaaatat ttgcttcctg ctttgcgtca 21961 gggacgcaga ttctgcaagg tcattgttgg gatgaagtaa aattaacggg tcatctaaaa 22021 aaaaaaaaaa aaaaaagaaa ttcaaaaagt agttgcctct ggagaggagg gaaagcaagc 22081 agggatgaag gaactggatg aggtgattaa aatgttcttt attgtgctat gggtgatggt 22141 tccactcatc caaattcatc caactgtata cttaagattt aagtatgtca taatatgtaa 22201 attatatttc aataaagtgg ttagcataaa actattttaa aaagtaattt gtagtgaaga 22261 aagggaaatt tttgtattag tcaactctga catttctatt ttttatcttt ctaaaaaaag 22321 tcaattgctt taaatttgct aaaggtataa aaatatcaag gtattaatca ttcattcatc 22381 aaatatatat tgagtgatcc taccccaagg caaggtacta agtgctagga acactgttct 22441 tgcccttgaa aaattcagag ttttgcaagg gataaagatg tgtacataaa tggactgtga 22501 cagaggcata cagacattgt aggcataaaa gagagaatgt cagactacct gcgaatgtta 22561 acgagattca ggacgatcac ttgggaacct tgaatcacct gtggtaacta tggtgcccac 22621 tcctgtgagg ttgagcctaa attgtaatgg gtcttaacct aagataaaag agactgaccc 22681 agccttgcca actgtttcta gctgtccacc caagaccttc aaattagtga aggattttta 22741 aagcctcaag agccacaata aagagcacat ggtgcatgct tcttgccatc cttgacctaa 22801 acaacttaga aacggataaa aaaaaggata aacctgactt ctgatccaaa atgtaaaaag 22861 ttggagtcgc cactccatcc taacaagtaa aaagctgaac atacagaaaa atcaactctt 22921 cttggatccg tcagagaggt gtagtcacag ggcaaaccac ttagccccac actgagactg 22981 acaggcaaat acagtacaga gaatcatggc ttgctggagc agaaacctcc aaggaaacaa 23041 gtgctgaggt cgggaagcct gaactataac tgagagataa aaactccagg ggaagtcttc 23101 agggcctccc tcacttttgt tttacctcaa ggatctcgac caagttctcg cagtaaatac 23161 tggagaaaaa tcccctggtg cttctagcag gggaaggaaa aaaaggaacc attcagaatt 23221 ttccagagca ctctgttctt aacaagttct gcccttagga gaagctagtt aaccaaagcc 23281 taacctgatg gggtcttaac agagcctaac tgatctggaa gatcagtaag cctaacttac 23341 tgccacatca ctaaaggctt atttacctga gatcctctta ctgtatctca tgcccagcta 23401 tcaagaaaaa atcataagac atactaaaag gaaaaaatca cagtttgagg agacagaaca 23461 agtaacaaaa cccggtgcag atagggccga gatattagaa ttattagact aggaatttaa 23521 aataactatg attaatatgt aagggctcta acagataatg tggacagcat gcagcataga 23581 tgggtgatgt aagcatgaga cagaaattct aagaaggaat caaaaagaaa tgctagcgat 23641 caaaaatatt gtaacagaaa tgaagaatgc ctttggcagg ctttctggga gactggacac 23701 agctgaggaa agaatctttg agctggagga catctcaata gaaaccacta aaactgaaaa 23761 gcaaagagaa aaaccgaatg acaacaacaa taacaagaac agaatatcca agaactatgg 23821 gacaactata aaaggtgtaa catatgtgta atggaaatta acaaagaaga gagacacagt 23881 aacaggagaa atatgtgaaa cagtcactga taatttcccc aaattagtgt tatacactaa 23941 ccacacatca aggaagctca gagaacatta agcaggataa atggaaattt ttttttcaaa 24001 aaaaaaaaaa tactcctgaa agaaaccaga ggcaaaaaac accttacgta tagaggaaca 24061 aagataagaa ttacattcaa cttctcagaa accaaaacta agaaagtgaa ggagaaataa 24121 agacttttta gccaaacaaa aattgaggga atttgtggcc agtaacctac cttgcaaaaa 24181 atattcaaag aaattcttta gagaaacaga aaatgataaa gttcagaaac tcagatctgc 24241 ataaagaaat gaactgcatc agaaaaggaa taagtgaaaa taaaaacttt cttattattt 24301 attaataaaa taatagcaac aatgtatttg attatacatg cttacacata tgcttatgta 24361 taagcaaaat aagtaacagc aatgatacaa gggacaggaa gaagaaatca gaattatttt 24421 attattataa ggtactagca ctacccatga ggcaatatag tgttattcat taaaattaaa 24481 atttctgatc tgcaaaagac acggtcaaga aaatgaaaat atgagccaca gactgggaga 24541 aaatatttgc aaaaaaaaat gagacgaagg agtgttatct gaaatataca cagaattctt 24601 aaaactcaac agtaaaaaca aacaacctga cttaagaaag ggcctaaaac cttaacagac 24661 acctcattaa agaagatata caggccgggc acggtggctc actcctgtaa tcccagcact 24721 ttgggaggct gaggcgggcg gatcacgagg tcaggagatc gagaccatcc tggctaacat 24781 ggtgaaaccc catctctact aaaaaataaa aaaaattagc tcggtgtggt ggtgggtacc 24841 tgtagtccca gctacttggg aggctgaggc aggagaatgg tgggaaccca ggaggcagag 24901 cttgcggtga gccaagattg caccactgca ctccagcctg ggtgacagag tgagactctg 24961 tctcaaaaaa aaaaaaaaaa aaaaaaaaaa agatatacag atggcaagta agcatatgaa 25021 agatcctcca catcatacat catcagagac atgcacacta aaataacaat aagagatgct 25081 acatacatac ctattaggat gaccaaaatc cagaacactg acaacaccaa atgctggtaa 25141 agatgcagag catcaggaac tctcattatt gctggtggga atgcaaaatg gggctgccac 25201 tttaaaagac agtttggcag tttcttataa aactaaacac tcttgccata caattgagta 25261 catcatgctc cttggtattt actccaagga gttgaaaatt attgtccaca taaaaaactt 25321 catacagttg tttatttgta attgtcaaaa cttggaagca atcaaggtaa cgttcagtag 25381 gtgaatggat aaatttactg tgattcatcc caacaatggg ctactattaa gtgctaaaag 25441 gaaatgaggc tgggcgcaat ggctcatgcc tataattcca acaatttggg aagccaagac 25501 aggaggattg cttgaggcca caaattcaag accagcctgg gcaatgtagt gagactctgt 25561 ctccacaaaa aattttaaaa tgagctgggc atggtggtgt gcacctacag tcccagctac 25621 tctggaagct gattctcctc tgagaggatt gcttaagcca aggagttcaa ggcttcagtg 25681 acttctgata atgccactgc actccagccc agtgacagaa acctacttaa aaagggtaca 25741 tgttaaatga ttccaactaa atgacatttt ggaaaaggaa aaattatgga gacagtgaaa 25801 agatcaatgc ttgcaggggc tgaaggaggg aggaaaaaat aagcacagca cagaaaaatt 25861 ttagggcagt gaaaccactc tgtgtgatat tataatgatg aatacatgtc attgtacatt 25921 tgtccaaacc tataaaatgt acaacaccaa gagtgcacct taaagtaagc tatggacttt 25981 gggtaaaaat gatacataac tgtacgttca tcgattgtaa caaatgtacc actttggttg 26041 gggatgttta taatggggga ggctacgcat gtacagggga agagggtatt tgggaacact 26101 cagtattttc ctctcagttt tgctgtgaat ctgaaactgc tctaaaaaac gaagtttatt 26161 tttaaaaaac tttttaataa aaaggataca tttctgaatt cttgtcccag gatagtgatc 26221 agtattagtt cagtctcagt cccttccatc ctgctatatg taactggtta aatgtttggt 26281 caaagctgga aacatgacac atgctgacag gatggagtaa aggagtgaac agaatgatca 26341 ggatgaggat cacatcctta attccttcat tcaacaatat actgtagttg ttgagaacat 26401 gtattctgga gccaaaatgc ttagattgca atcttgactc tgctaactag ctgtgtggag 26461 agttattcaa cctctgtgca ccttggtttc cccatcagca aaaaggagcc aacaacattt 26521 cccatctcaa agggttgttg tgaagattaa atgaattcat acatgccaag tgttgagaat 26581 gccacttgtt acaaagtgca gtaaacatgc ttgctaataa aagagcccaa aagttagcaa 26641 atatagaaca aatttatttt taaacgctac gttagttaca ttttcaaaca atatgctcaa 26701 tatgtccttc tatagcctag acactgtttc ctgtgaagta cctctaactt taaagcaatg 26761 tatccaactg gtgactcttt taaaactaca ataagtacaa tatagaatat caccaaagtc 26821 tggaaacaca gggaaaatca tgaatttatt ttttcagatt aacaattaga ctcactgctt 26881 aaattcagag atatatcacc tgaaaagctg tctagagatt aaggaaaaat gcgttgagaa 26941 attatttgaa aaagtaactt attattctcc gatgtttccc aatttttctg atttgttttc 27001 ccaatgtttt tcctactagt ataactatta ttaattactg agcatgtgcc atgtgacagg 27061 cactgctgag gatactggag atggggcata gacaaagcgg atataaaagg caaaatgtcc 27121 ctgctgtcgt gaaatttaca ctttagtagg gatagacaat aaacacatcc taaaatagaa 27181 tatacactat gtcagatgag aagcgctatg gggaaaaacg aaacagaagg tggccagggc 27241 atatcagtac aacaaggagg ccaggctgct tttttataac tagggtagac ctagaaggcc 27301 tctaataaga ctttctgaga agagatccaa aggaggtgga agagccatgc aggtttctgg 27361 aaaacagcat tccaggcaaa ggcaacgcaa gtgtaaagac caaataagag agccaagaaa 27421 gccttttttt ctatagtatc ttcttttgtc ttcatctctc ttgataacct gtaactaggg 27481 cagactccag atatagatta ctcggaccaa ataatttctc tttcattcat ttctcagtac 27541 agacttgaca ctgccttctt tcttctctga gacaacaaaa tcatttttat ctagcagttt 27601 ttctttcttg tttttttcta tgacttccat ttgataaaaa tttactgtct actctgtgca 27661 ggcctgatgg tgaaaggatg attaacatat ggtacatgcc ctcacaatct agttaaaatg 27721 agacatataa aatcaataaa atataccata aaaagcactg ctttgatatt atgaataaag 27781 tactacagca attaattctg cccattgagg gaaagagtag ttggggatag agggaaaaag 27841 gattctgagc taggatttgg agactaagta ggagttcaaa acatggacag cgataaaaaa 27901 atattccaag gggaaagaca aatcttaagc ataagcacag aaatgtctaa gcatacaata 27961 tattaggaat gattgttaat gcaatgtgct acattgaaga ttagggaaat aagagcagaa 28021 aagtagattg gaaacatgtt aaggggcctg actttatcct gagagccagt ggtttgcaaa 28081 ctattttgac tgtacacctc agcagtaaca aattgttgtg catgcatata tcctcaataa 28141 atgcacattt atttaagtat gagtgaaatt atgccaagag ctactcttct agtatatcac 28201 attcattaga aaatacacaa agagggctgg gcacggtggc tcacacctgt aatcccagca 28261 ctttgggagg ctgaggtggg tgaatcacta ggtcaggagt tcaagaccag cctgcccaag 28321 atggtgaaac cctgtctcta ctaaaaatac gaaaattagc agggcatggt ggcaggtgcc 28381 tgtagtccca gctacttggg aggctgaggc aggagaatca cttgaacctg gtggggcaga 28441 ggttgcagtg agcctagata gcgccactgc actccagcct gggtgacaga gtaagacttc 28501 atctcaaaaa aaaaaaaaaa aaaaaagaaa gaaagaaaag aaaatacaca aagagaaata 28561 tgtaaaagat aatattttcc cctaatggaa tctttttcat taatttttat tagtgaataa 28621 tgtgtacaat gttatcaatc aattgctaat gtaccttcca taatcaacaa atgaatgcac 28681 tatgcttctg ttgacatttc agttctaact tgtatttttc ttgctctctt ggtggtcata 28741 aatcatcaag agtactcata aagtctagca tagtgtactt gtacttactt ttttggtgct 28801 aagatcaatt gttagggaat ctgaaagatc tttatgatag tcaatttaaa cacattcatg 28861 tataccttat cttatacata taaaaggtac aaatgcataa aaagacaact tctttcatga 28921 taaaaacact cgacaactag gaatagaaga gaacttcccc accctgataa aggacatcta 28981 tgaaaaaacc catagctaac atcatacttg atgacgggaa gctttcctcc aaagagcagg 29041 aacaagacag gatgtctgct ctcaacactt ctttctagcc aaggaatcta gattggagag 29101 gaagaagtga aactatttgt atatgacagg atctagtata tagatcaata tacaaaaatc 29161 atttaatcag gtttatttct atacaccagc catgaacaat ctgaaaatga aattaagaag 29221 acaattccag cctgtagtcc cagctactcg ggaggctgag gcaggagaat ggcgtgaacc 29281 tgggaggcgg agcttgcagt ggctgagatc atgccactgc actccagcct ggatgacaga 29341 gcaaggctct gtcccacaaa aaaaaaaaaa aaaaaaagac aattccattt actagagctt 29401 caaaaagaat aaaatgctta gaaataactt tcagaaatgc aagaacttct gaaattaatt 29461 ggaaaataat ctttaaaatg aaaaaaactt aagacatgaa agacttgtac cctgaaaact 29521 acacagcact gttgaaagaa atggaaaaaa gatctaaata aaaagaaaga catcctgtgt 29581 acaaagattg gagaaattag tattgttaaa atgacaatat accccaaatt gatctacaga 29641 ttcaatgtaa gcccatcaaa attctagcca aattctagcc atcttctttt ttttttctga 29701 gacagagtct cactctgttg cccaggctgg agtgcagtgg tgtgatcttg gctcactgca 29761 accttcacct cctaggttca agcaattctc gtgcctcagc ctcccgggta gctgggatta 29821 caagtgcacg ccaccatacc tggctaattt ttggcatttt tagtaaagat ggggtttcac 29881 catgttggcc aggctggtct tgacctcctg gcctcaagtg atctgcctgc ctccacctcc 29941 caaagggctg ggattatagg cgtgagtcac cgcgcctggc cctagccatc tttttaacaa 30001 aattcgtaag ctggtactaa acttcaaatg aaaatgcaag agactcagaa tagaaaaaaa 30061 aaaaagctta tgaaagaaca atgttggaga ctcacactcc ttaattttaa aacttactac 30121 aaagctactg taatcaaaac agtacagtac tagcataagg cctgacatac agatcaatga 30181 aacaactggg agtacagaaa taagcccata catcttattt ggttgattga tttttgcaag 30241 gatgccaata taactcaaca gggaaaaaat ggtcttttcg ataaatggta gcagggcatc 30301 ttcatgaaaa agaatgaagt taaacttcac agcacataca caaattaatt caaaaaggac 30361 ccaggatctg aatctacaag ctaaaactat aaatgtctta gaaaagatag gtataaatct 30421 tcatgttctt gtattggaca atggagccca tagtttctta tatatgacac ctaaagcaca 30481 agtaaccaag gggaaaaata gatacattgc acttcaaaat ttaaaaattt agtgcatcaa 30541 aagacacctc caacaaaatt aaaggaatct acagaatcgg agaagatact tgcatatcac 30601 ctttctgata agggtcttgt atccagaata tataaagaac ttttaatact caaacagttg 30661 ttttaaaaag ttaataatgg gcaaaggatt ggaatagatg ttacaccaaa gaagacatac 30721 aaattgccaa caacgacatg aaaagatgtt cagcatcatt agtctttagg gaaatgtaaa 30781 tcaaaaccac aatgagatac caattcacaa ccagtaggat agctataatc aacaacaaca 30841 aaaaaggata ataacaagtg ttggcaggaa tacagaaaaa ctgggacatt tttaaactgc 30901 cagtggagat gtaaaatggt acagccacta tggaaaaact tagcagctcc tcaaaaagct 30961 aaacactgac ttatcatatg atccaacaat tccactccta agtatacacc caaataaacc 31021 gaaagccaga acttgatact tacactccca tgttcagtgc aatagctaaa agaaacaacc 31081 caagagtcca tcaacagatg agtaaatttt ttaaaatgtg taatgtacat acaatagaat 31141 attattcaga cataaaagga ataaagttct gatacatcct acaacatgac aacgctgagg 31201 acattatggt aagtgaaata agcgacacaa aataacaaat actgtatgat tccacttata 31261 agagatacct agaacaggaa aatttagaga cagatagggc cttggtggaa cgggaatggg 31321 gagctattgc ttaatgctta cggggtttct gtttggggtg atgaaagaat tttacaaata 31381 gtggtgatgg ccgcacaaca ctgtgaatgt aattaatgca actgaattgc acacttacaa 31441 aggcaaattt tatgttatag ttacaattct aaaaaattat actgtaatac accaaaatcc 31501 atataactgt acaccttaaa tgggtgaatt gtgtggtatg tgaattatat ctcaataaag 31561 tttttttttt taaaaaaagg gaagaaaaaa ataaactctc aagctatgaa agaacatgga 31621 ggaatcttaa gtacatattg ccaagtgaaa gaaaccagtc tgaaaaagct acatacttta 31681 agattccaac tatacgacat ctgaaaagac aaaactatag agatgataaa aagaccagtg 31741 gtagccaggg attcaagagg agaggaaaag gatgaatagg tggaacacag gacatatcta 31801 ctctgtatga tagtggaatg gtggacacat gacaccatgc atttttcaaa acccacagaa 31861 ctgtagaaca caaagagtga ctcctaatgt aagatatgaa ctttcattaa caataatgta 31921 tcagcatttg ttgataaact gtaacaaatg gaccacacta atgcaagatg ttattaagag 31981 aaattgtgtc tgagtgggga agggtatgtg ggaactctgc gtaactttct gcataatttt 32041 cctgtaaacc taaaactgcc cttaaaaaaa ggtctatttt taaaatgcat gcttcccaga 32101 gagaatgctc tgaattttat cctcttctca tgattcataa gactattcat tttgaacaca 32161 aattataatt ctacagtagc tagagaggaa gcactgagcc ttttgtcact gtagaatatc 32221 aggcactact gaatcatctc aggatcctat cctttcccta ctgcacctcc tcttccgccc 32281 ccgaaagaat tattttatga tctaaatttt tgtattcata taactgcttc catctagata 32341 gtctttgaag attccagcct gtggaaatat ttccttaatt tatgactaaa ttccagttta 32401 attataatta taaattacag tgacattatt acaccaaatt gccatgcctc ctattttatt 32461 cacttttttc attttctatg aggtatcatt ttatcagaaa cctaagtgtg ttcacttttt 32521 ttttcctttt gaaaaaagta ttgttctcta aggcactgaa gactcaatac agaggattaa 32581 attggaaact ataaccgtcc cagaggaaag acagctagta catttaagta caactcttcc 32641 ccacctcctg gtataaaatg atcccccaaa actcctaatc tttattggca catgctataa 32701 actttttaaa aaaactttag agatcggggt ctctctatgt tgcccaggcc ggagtacagt 32761 ggctattcac aggtgcaatc tttcctcact acagcatcaa atgcctgggc tcatgcagtc 32821 ctcccacctc agcctcccat gcaccaggat taccggtgta caccaccaca cccagcttat 32881 ataaacttct ttaaccatta ccttggtctt ttccagcttc atctatgtgc tgaaaatctt 32941 ttagtgtttg aaggttacag aacccctctc tgcaatgctg aataactttc tttgatccat 33001 tcttgatgag ataatccatt agggtaaggg atttatacac gtggcgccag ttcttcccat 33061 ggtcattgag tctgtgccac agcatattca taatctctga gagagaaatt gtgttgaaag 33121 tcaagtcact gatatctaac atcagagaac tagagggacc ccaagggtcg ttagaagttg 33181 cttccctgac ttttatttca gcatctgagt aatttttcac aaagtttttc acttgtctcc 33241 tgaacgccat aagtaataca agttccaagg tgagatggag gatgaaagca atggaataac 33301 agtttatgtc acgggtttat aaaactcttg acaggtaatt ggtccccagt tctgctgctc 33361 ccaaatacaa ataattaatc tctatgttga taaagtatca atttcctgca aggaaagcac 33421 aaaaaaattt acattaaact tttttttaag tctatagaag accaaatcaa tctgaaaatc 33481 taggttttgt ttctaaatca gcccctcact tacagcagcc agaattgagg gtaccatata 33541 actgttccta ggataggcat gggaaaatcc ctcatgagaa attggcctat attatttaaa 33601 gaaatatagg ttgtctttct agtcataaaa gttcctggag aaaatatcat tcaacctctt 33661 tggccagaaa tataatggct ataaagcatt acatggacag cagacaagca acattttaaa 33721 aaacagaact tgcactctga catctctttc tttctttctt tctttttttt tttttttttt 33781 ttttccaaag acagggtatt gccctgcttc ccaggctgaa gtacagtggt gcaatcatag 33841 ctcactgcag ccttgaactc ctaggttcat gcaattctcc tgcctccgcc tcccaagcag 33901 ctggaactat aggtgtgcgc caccatgcct ggctattttt ttttttggag atagagtctc 33961 cctctgttgc ccaggctgga gtgcagtggt gcgatcttgg ctcactgcaa cctctgccac 34021 ccgggttcaa gcgattatcc tgcctcagcc tcctgagtag ctgggattac agatgcacgc 34081 caccacacct ggctaatttt tgtattttta gtagagacgg ggtttcacca tgttggcgag 34141 gctggtctca aactcctgac cttgtgatcc ttccacctca gcctcccaaa gtgctgggat 34201 tatagccgtg agccaccgct tccggcctgc ctggctaatt tttaaatatt gtgtagagat 34261 gggggtctca atatgttgca caggctgttc aaaaattcct ggcctcaagt gatcctccca 34321 cttcagcctt ctgaagtgct aggatttcag gcatctcttt atattgtcac atataaaaat 34381 attatattct gagagtaatt aaaatggggg taaaattgag tcttagaatc acccactgac 34441 cctatgactg tattcttttt agaactacaa gtcccgagaa tcagatctaa atatagtctg 34501 ctttcttgct aaacctaaaa gaaaggagga tcaactatga agacaaacag ctgtatatca 34561 gtggactcca cagctaattc atggtaccct aaaacctcta ttttagtcca gataagcctg 34621 gtaaattaga gggaaaggtt aaattacttt atttgttcac ctttggaaaa agaagaatga 34681 gtttaggatt aaacagtgtc aacagtaaga ctcaaataat gcaaggattt ctctaaatcc 34741 tgcagtgatc agcctgtgtc tatcagatct caaaacaagg acacacagag aggcccatat 34801 actaaaactt aaaagggaaa tgcagccccc aaactttaca taatgcttgt gaatgatcaa 34861 ttcagtttat aggaaaccat tccctttctc tttttcatta ttttgtatta aggagggtca 34921 gtcagtttgc aatttctcta gtgattcacc actgctggct gccaagacag tgtaccaaat 34981 tgatttcaag ccctgacgaa catccagacc caacgaaacc tagaagatct ataatcagat 35041 tttactagag aaactgcaca catacaaaag aaaaaatgtt aaaatccata acaaatgtgt 35101 tcactgcttt tagttttgct tccctttctt gaccttctca ggtaactagc tcagccttga 35161 ttaatttctg tatgtgtatt aaaggaagac ataaattttt agaagcatcc caaataaact 35221 atcacttctt ttttttaaaa aaatgagata acctcttttt ggtaaaaata caacacactc 35281 ttaactaagc aagacagaaa atgtacctgt ataagtggtt gggcgtggtg gctcacacct 35341 gtaatcctag cactttggga ggctgccgcg ggcagatcac ctgagctcag gagttcaaca 35401 ccagcctgga caacatggtg aaaccctatc tctactaaaa acacaaaaaa ttagccaggc 35461 ttggtggcac acacctgtaa tcccagctac ttgggagact gaggcataag aatcactgga 35521 acctgggagg cagaggttgc actgagccga gattaggcca ctgcgctata gcctgggaga 35581 tagagtgaga ctctgtctca aaaataaata aataaataaa taaataaata aataaataaa 35641 taaataaata aaaaagaaaa tgtattaaat atctgtttaa ttaagatttc tcagaaagac 35701 actatcatta accctaggaa gaaaaagaga agttgtgttt ctacaacttt ctgattttct 35761 ctttttcgta cttcaaacac agtcattttt atttggcacc ttcctgcagt gaaaaactga 35821 tttataatat ttcagcaaag ctattaaaat tgtcaagcag gagggttttt ttttctattg 35881 tccttttttt ccttctacca aactcactca atgcagagat gcactatgac attggcttag 35941 agcaacaaat atcgatctgt ttcttcacga ttctattttc ttttaaagcc ctgttagaat 36001 cataacttgc tggaattgaa caacaaatgt taattttttt ttcttttttg gatagctatc 36061 tatctatatt ttggaagaac aatcgtaatt tttaggtttt tctacattct ctaacctata 36121 ttttaaactt ttaaaaactg ttttcttaaa tgaattttct aacccagaaa aatatcaaaa 36181 taatagcctt ctaatccaat agagtagttt aaaatgaatt tacatgacac catatcagaa 36241 ttaaatacaa ccccaaccat tagaaaaatc gcaaggaaaa aggtctagag aagtgaattc 36301 ttatcactta ctaatgtaca atgacttgaa ctttaccttg tgatatgaac gtagaatatg 36361 taaaacacca ttctactact gctgtctttt caaattgaca aggagctcaa ttaattggtg 36421 ttagttgtaa attatatgtt ggtttattat gtgttttaga cacctgacta catgaaaatg 36481 ctgtttacat acagcatcaa ataactgtga gattgtaact acagtgatgt gacaaaaatt 36541 gacatgagac aaaattagtg cccaagaggg taaatttcca catttgaatc ttggttgtct 36601 ggctacccaa ctcactataa tatttagaaa tagagggaag aagttaggta tttcatgtca 36661 caaatctgag aggtcttcac cgacctcact aactacttag gtctccccag aacctggtta 36721 atttccttca ttgcacttgt aacaatctgt acttacctat ttgtcatttg ctcagtgcct 36781 ttcttcacca ccaaactgga agctccttgg gggcagggac cttgacggtc ttatttacca 36841 ctgtaatcct atccctaaca gctaattagc aacagtagat actcaagaaa catgtgttgt 36901 gcaaataaca caatgattga atgattctgc cttgggataa acctgcactg attctatagc 36961 tttataatta taatcacttt tttttttttt ttgagataaa gcctcactat gtagcccagg 37021 ctgcttgcaa actcctgggc tcaagcaatc cccctgcctc agcctcccaa gtagctggga 37081 ctacaggtgc ataccaccaa gcctaactaa ttaaaaaaaa aattttattt ttttttttga 37141 gatagggtct tgctctgtca cccaggctgg agtgaagtgg cacgatcttg gctcattgca 37201 acctccacct cccaggttca agcaattctc ctgcctcagc ctcccaagta gctgggactg 37261 cagccatgag ccaccaagcc cagctaattt tttgtatttt tagtagagac ggggtttcac 37321 catattgacc agggtggtct tgaactcctg acctcaagtg atccgcctgc ctcggcctcc 37381 cagagtgctg ggattacagg tgtgagccac cgtgcttggc ctcaaaaaaa attttttttt 37441 tagacacgag ttcttattgg ttgcccagat ggtcacaaac tcctataaag catgatatta 37501 aaaaaataaa taaataaaat ctagctctca taattgtcaa gtaaaattct atagttttga 37561 gcaaatattt tcaaatcatc aacagatatt tcttgagtac ctaccatttg tatagctttg 37621 tggagaaata taaaacatga acctcacatt aaaacttacc attcagttaa ggagctttct 37681 ctcctaaata attcccagca aaacaaatga ttacgtagaa aaggtctttt agtgttattt 37741 ttagatctgc ccactttcac aatgctgttt ccttcccccc atttcatctg aattctaaca 37801 ggaaatgatt tatcagctgg cattaaacta ttggtccacg atcttttatc ccaatccttt 37861 gaggccagat gatttgtgga aattctcttt tgggtgcggt ggtaggggtg gaattttcaa 37921 gagtagtgcc atgtctatac catatattac atgataccca cagtaaggtg tggagctgct 37981 ctgaacaatt aaacacatta acatgtctgc aataaaaaca tgtaaatatt tggaataaag 38041 actacaaatg gcctcaggcc aagttttacc accaaatgag ttggaggaaa agttgatttc 38101 caaaagtttt tggatttcag gattgagaat aaggaacaga ggagctgaac tgactttatc 38161 caaagacttg ggacaataat gacaatagca tctaagaggc tgagtgtcca gagctatctc 38221 tgaaaagatg catttaaggt ggaactatga gaaaagaaca gctcttgtgt ttgtacaggt 38281 tctgcctata gacatgagtt tgactcgtaa ggaaatgagc tctggccaag aatgctacgg 38341 gaaaatggtt atagagacta ccatgccatt aggaggtgtt actgttttga tgaaaagaat 38401 actgtcagag gcagagaatc taatcttttc cagccctaag tgtgtgttca ttaatttatt 38461 caactaataa tcttccagag cttaccatgt ggcaggctcc attctaggaa atggagaaac 38521 aggtgcacaa gacacagtcc tcacaccctg gagtaaacag ctactcttca ttgaacgctc 38581 acaatgtgct cacacctggg ccttcacatg cctcatctct ctcctttaat taggattcaa 38641 agccatgatt tgaggtcctg ctttgccatt tcctaactaa gacttaacaa cttctctaag 38701 tctcagttcc ctcatccgca acacaaagaa taatcgccag acagggccgt gggggaggaa 38761 gaaacgagat gctttggaaa gaattttgaa aatggcgggt ggaggctgag caagaatcaa 38821 ctttctgtgg gcagggcctt tcctcggcat cctcgtcttc attcatacaa cccccacttc 38881 agatgacgac tgtgtaccag atgagccagg tgctgggcgt tcggccacac agagaatcag 38941 accagcaatt accctaggag aaccggcagt ggaggggaag gctggacctc aggaaagcct 39001 ctagaaggag agagggacgt gatggcgtca cgtcacgcca cagggatttc acctcctgcg 39061 gatggtaatg gcctcgccgc cctcccgccc ccacctggac tccgacggtg ccggcaagaa 39121 acgggaggag ggaaggcaac cgatgtttcc agacccgcct tgagccccac caccactccc 39181 aagcagaggc tgttacctga gaggaaagtc ggccgctgcc caacgaccaa acttcctctc 39241 cccaggatat tccacctact ccagccaggc tcaagggccc aggccggtcc ccacttccgc 39301 ttcctcactt tctggcctgg tgcagttcct gtcagggcgt caagcaagat ggttgcagat 39361 acctgagctg gagggtacca aggggaatga aggtaagtgt cttcgtttca caggctgccg 39421 ggctgtgaca ctgccctaat cctcctttca aagaataaat tttttttttt ttttgagatc 39481 tgggaaatag tttaatcaga cactagcatt tattcttagc tgctttctta attaaataaa 39541 aatgagtata aattaattta ctaaatgttc acaacgtaaa aacggacatt aaaaaaagaa 39601 aaaaaatgcc aaggtaagct gataatgttt ttaaaaagta gacaataaaa taatgtatgt 39661 aacttaaaat ttgccatgat aatttattct taaaatggaa aatcagtaaa atactcagac 39721 ttctagcata atatctcttt ttttttattt attattattt aagttttaag gacatggatg 39781 aaattggaaa tcatcattct cagtaaacta tcgcaagaac aaaaaaccaa acaccgcata 39841 ttctcactca taggtgggaa ttgaacaatg agaacacatg gacacaggaa ggggaacatc 39901 acactctggg gactgttgtg gggtgggggg aggggggagg gatagcattg ggagatatac 39961 ctaatgctag atgacgagtt agtgggtgca gcgcaccagc atggcacatg aatacatttt 40021 taaaagtgtg ttactactgt catttcaact gactagaaat accaccagca cgtgattgcc 40081 acctcaaagg actgagaatg taatagtagt ctatgaagag cagtgtgtgt gatggacaat 40141 gatggggtag ctccagtgac tgatggcttc aggtgccact gacgggtaat tgttgatgta 40201 ttcaactgat tttccgtgag cacccacttt gtgccagaca gtaaggactc ggagataaaa 40261 agcagccctc tgctggatac gtttacaggc tagtggaggt gacaggcacc gtcagagaag 40321 attgtaacaa gatatgtatg gcatagtaag agagaagaga tcatctaatt ctgccactga 40381 gtatttaagg gcaggtttca cagaagagtt tacatctgaa ttgaatctta aaggatgaca 40441 gggagttcac tggatagaca aagtagagga ggacgtaaga ggcaaaggac acatcatgtg 40501 taaaggcatg aaagctgagc cagcctatta tcttggaaaa actacatgta ctgtagttac 40561 tacaactgtc acaaaggcta ttagtggagc agcagaggaa ggttaaataa cagagggaag 40621 ttggatcatg aagaacctta tccatactac aaggtgattt tatccagtag gcattgaaaa 40681 atgtaatcga acctgtgttt taatggaata ttgactagga tgaggggaaa agacagagga 40741 taacaaaaca cttagtagac aaaagcaaca tttgtttcta tctcctacta cttcagtcat 40801 aattataata ataatagccc acattattga tactgacatg tagataggtg aatccaatat 40861 aaaatgatag gaggtatgaa agaagttggt gagtttggaa tattatttga ctggctataa 40921 tatagagtaa tggggtggga gccagaggga aatcgaaaag gtgacctgga aaacattgtt 40981 cagggccttg aatgctatgt tgaggtattt tgactttata aaatagtggg ggacaaacta 41041 caagtatttt atacattgag gtatagttta tgaagttttt ttatatgtat agcttatgta 41101 aatctcaaag gaatgctgag aagtaggcgt tattatccca ttgtacaggt gagaaagctg 41161 agtttgtgga atatgcatac attctacatt attgagcacc tgctatttgc caagcactat 41221 gctgagctat gagactagaa ggaagttgta gtttaatact ctaatagagg aaacaggaat 41281 aaatcagcaa ttgtaagtag ttgctataat aaaggcaaac aatgttacgg gagtgaaagg 41341 gggacttgtc tatatttgca tagtcaggga aggcttcctg gaggaggtga gcttaaactg 41401 agtttggaga aacaaataga ttacagtaat tgcccagaga tatttaacta ggaagcagtt 41461 tacctgaaac ttgaacccaa atctcctgtg tctagctctt tcccctactc caaggaccaa 41521 tctttggtcc ttgattttaa gctctatcag caactcagtc acaagtggac ttgttatgtt 41581 tggagaaggt cagagattcc aattttttat ttttaaattc attcattttc tccttttaaa 41641 atctctgtgt gtgctcaaag agtagattgg aagcagagga actggaagaa gaacagaatg 41701 ttctgatagt tcaggcaata agtgatgaag gccataacta aggcagaggt catgggcata 41761 gtgtgaatct ccactctgtt cactctcctc acacatatac ttttacatgt aatgttaaag 41821 tgtagcagtt atgggatcgg gctctgaata tatgtggcaa tgagagagaa ggaggagtct 41881 acagggacta agtttctttt tggggtgacc aagtaaatga ctgttgctgt tcaccaagat 41941 gaacatgatc cgaggggtca gtttggcagg aaagctgggg aattctgctt cggacatttt 42001 gaatctgagc aacaggtggg agatgttcag gggacagttg aatagagata actgaaagaa 42061 gagggtttgg ggctaggttc atagacttca gagtagtcag catatagaag gtgattgagg 42121 aggcagatgg gaagtaggag acagggtaag attaaagact gaagtcccaa taaacagtga 42181 gggagttggt taggcttggc aaaagtgtca agtgttacca gcttagtgga attaaccctg 42241 ttcaagtgac aatatgtaag gggtccttct tatctgcttt cctccatttg ggggcacaat 42301 ttagttgcaa taccgtaaca tccactaaat acaaatattg ctctatggta gcttggttgg 42361 ttttctcctt taattttttg aacttgaggt tctataaaag tgaacacagt ttgcttcacc 42421 tgacttcccg gacaccattt ctccttctca tctccttgac tcatccgttc tcaattctat 42481 gacctctcaa ctttgaagga gcccattctc taacctcttg tcttctctat ctatattcac 42541 tccacaggta gtgttatcca cttgcaaagc ggtaaatact cttcgattct gacaactctc 42601 aaggttttct cttcagcttc tcctgacctc caaactcgtg tatctaacta ccttgacctc 42661 tactcctggt cttgccttcc aaaccttctc tcccacggtc tctcctgtct cagtaaatga 42721 cacatccttg cttctaggtt ctcaggccat gaacctggga gtcaccttaa ctcctgtgtt 42781 tgtctcttat agctctcatg atatccatga gcttaccttt gaaatacaag gtggctaccc 42841 agtctagata ataaatgatt tattgatgct acattctaat acaagataca ataacattat 42901 ctatgtttct agactttatg atcaccctgt atattcaaaa tccaaccact tcccaccact 42961 cacatcatcc tgttttaagc cacaaccatc tctggtctgg actatgaaag tagcctcttt 43021 tttttttttt ttttttttga gatggagttt tgctcttatt gcccaggctg gagtgcaatg 43081 gcgcaatctc ggctcactgc aacctccgcc tcccgggttc aagcgattct tctgcctcag 43141 cctccctagt agctgggatt acaggcatgc gccaccatgc ctggctaatt ttgtattttt 43201 agtagaaatt gggtttctcc atgttgatca ggatggtctc aaactcccga cctcaggtga 43261 tccatccgcc tcagtctccc aaagtgctgg gattacaggc atgagccaac catgcctggc 43321 caacagtagg ctcctaactg gtctctcttc ttctgttctt gctcccttgc tatctaattt 43381 cagcacaaca gccagagcta cccttttaaa acttaaaacc caagtcagat catactactc 43441 cccagttcaa aatcttccat tggcttccat ctcactcaga gtaaaatcca aaggcctcgc 43501 catggcaaca aggccctaca tgttctgctc tctctctgat ctccatccca actcctctcc 43561 acctcactca cagtgttaaa catgtctcca cctgaagtct ttgcacttgc tagtccctat 43621 acctgcaaaa gctatttccc cagatgtcca tccaggaagt actggcttct tcactttgtt 43681 ctagactttg ctcaaatgtc ctttgatttc ccaacgtaaa atcgcaaccc ccatctcagc 43741 cacactccct gtggctcttg catcatttta tttttctccc taacagtagg accatctgat 43801 gtaccttctg tttctccctt taccagaatg taagatccag gagggcagag attatgtttt 43861 gttcactgat agttttccag cacctagaac aggactaaca cagagtaggt gttcaaactt 43921 tatttattga ataaatgaaa cgagaatgtc tgaatgcaca ctattaaata agaagtacat 43981 attttaccat aaagttaaat tgagcataat tatggcaaga agttcagata ttggccaggc 44041 gcagtggctc atgcctgtaa tcccagcact ttggggggcc aaggtgggca gatcacctga 44101 ggtcaggagt tcgagaccag cctgaccaac atggcgaaac ctcagctgtc ctaaaaatat 44161 acaaattagc tgggtgtggt ggcaggtgcc tgcaatccca gctactcggg aggctgaggc 44221 aggagaatca tttgaactta ggaggtggag gttgcagtga gccaagattg cgtcactgta 44281 ctctagcctg ggcaaccgag cgagactcag tctccaaaaa acaaaataaa acaaaaaaca 44341 aaacaaaaca gaagttcaga tataaacaaa tgacccctca aaaggaattg attaaaaaaa 44401 ttttaaacaa agtatggttt aaaaaagaaa cttctccact gtggctaagt ccatatttca 44461 catgttactg tatcctcctc atcttgctgc tatttttctg agtatctcat gatagtgagt 44521 actgaaatgc tggcattctc acattagggt ctttattctt aggatctgcc tttgcactcg 44581 ttcagttaca gagctgttat gaaataaacc actcaagtgg ccttgcttcc ctcccactat 44641 gctctacata acatgtctgt caatcagtac ttactaccaa aataacttgc ctttgagatt 44701 tttctgttct gcaatctgtc ccatgcttgt aaggggaagg attatattct gctccctggg 44761 gagtagtgcc tggtgcaaaa ttagcttcct cattagggta caagctgaaa acaccatgat 44821 actttccatg taacaccatt caacattttc ccttcgctcc cactccccaa ccatagaatt 44881 ttagggactc ctttaaccat ttcaaaacca tcgacttggg agctatcaaa gaataaggac 44941 tgtagccaga ggatttcagc atttaaaaac ttggcaatag agctgttcag tgtaatgatc 45001 caagaaggtc cttagaaggc tgaataattc tgaggtcaag gttgagacgt tattattttt 45061 aaagtagcaa gatgttgtta tttttaaatt ttatctttat aatgtggaaa tagcagagat 45121 ccttttcaga tataaccagg atttaagaaa atttgacaaa gttgcattaa taaaaagaaa 45181 gcctgttatt aaggagctac agggccgggt gcggtggctc atgcctgtaa tcccagcact 45241 ttgggaggtt gaggcgggcg gatcatctga gttcaggagt tcaagaccag cctggccgac 45301 atggtgaaac cccatctcta ctaaaaatac aaaaaaaatt agccaggcat ggtggcaggt 45361 acctgtaatc ccagctactt gggaggctga agcaagagaa tcacttgtac tcgggagacg 45421 gaggttgcag tgagccaaga tcatgccatt gcactccagc ctgggcaaga agagtgaaac 45481 tccgtctcaa aaaaaaaaag gagttatgga tagaaaatta cttcctttaa ctcattcgac 45541 aaatatttac tgagcatttt ctgtcacctt gacactttgc tgattcttct tttctatttc 45601 aataaaaatg ttgggcaatt ataaccaagg aaaaggtttt aagaaagtaa attttaagca 45661 aggtttagat ataaagaagc aaagagatgg aaaaggatgc cgtttcacat tgcagaaaaa 45721 gtttaaattt tagaacccca aattgaatct ctccttgaca ctctcctctt ggtattccag 45781 ggattaatgc taggaacatc ccagtgagcc aacatttccc tcctcatctt ggatccagtc 45841 cccaaactct gaatggtgcc acttagcaag tagcaatgtc tctaattttc tgtaaaagat 45901 gcaacgttgg caatccattt tctagcaaga gcaaagagaa aggaagaggc cttcacaagt 45961 tgcagttgat atagggggaa gggtgagtgg tacatttaaa ggtaagaact aaggaagttg 46021 aaggtaacaa aactcaaggt caatggaaag tcttaaatcc cagaaagcct tcagtctaag 46081 actcaacaga aattgaagtc ccagaaattg aaggaactgg ttccacctca tgctggactg 46141 tggcaaacct ggggtgagaa tgcatcccag ctcagaatgt ttcttctttt tttgtcaaga 46201 gtaatattag aggctgagtg tatgaacaaa gcatctttcc tgggatttta acattgtgag 46261 tgatcgaggg tgctcttaaa taagttcccc caagaagccc taatgttcct ctggatgcat 46321 gattatacta agagaataag agttttcctt aagatcagct tgctgatgga ccatatcaag 46381 aactataaat tttcttatgt gaacacaagt gttcaggggg gatattgtta aaactgaaac 46441 caagcataaa gaagcaaaag tgtttcagtg aacctcctgc attttagttc aactctggca 46501 gcaagaaaac aaaaccacat aatctaaggc tcacgcccaa ctctccgaag ccccatatgt 46561 gctagttcct gtgtggtttg cctacagact gcagggctga ctagggttag tttggaggag 46621 ggagtaagag gtggggagga ggaggcacag ttaatggatc tgtaaacttg caccctcttt 46681 cagagtggta catggaagac agcacaaagt ggatccatac tctgaaatgc agtaactctg 46741 atgcttgaat ttgtctccct tcttgccaga aaggattcta ataactcggt gtcaaagcca 46801 agacataaac tcaacccctt ctcttccaaa aggtatgtca ttatacaaca tctgtacata 46861 tactctagaa actttgaaca gttttataat ttgcagaatg ttctgtatga gttaattaaa 46921 atgaatctca ctagttctct aaatattgct tttaagtccc atcagctaag gtctctaaac 46981 cttttgctgt gacaatgttt gtggtgatgg gatttgactt ctatcaaatg aactgggaga 47041 acacttgttc cttacccagt ccataacaca aaaatctcca caaactggct tcagaagcat 47101 ctcagggaaa gattttcgac cttgggatca tggaggtaaa attccacttc accatggtca 47161 gaatttaatc caaattggaa gggattggag ggggtggtag agggctgagc agtgatgaat 47221 gaacactagc actctgaggt caggatttct ctctttgctt gaatttctag ttctgttcgc 47281 atttttgtta cctgtacaaa ggacaccagt tttcattcct ttacgaaaag aaaactttat 47341 gttgaagtgt attttggaag ggtattgtct ctcttcgtac tagccgttag gctccatccc 47401 tatgactgtc tctggtttag gcgtccagtc tgaagatagc tgctttgtgt tctttgtccc 47461 aactcttcct gaacgtgagg attgaacata ccaaccaaag gtttcactaa tttaaaatgt 47521 actgtattgc cctgacaggt atactgtcaa aaaaaattta ccgtatttat ctacctggaa 47581 ttaggccttc ttaataaagt gtatgcagca taatttgcac actattgaac atacacatcc 47641 ctagataaac taggagtggg aagtgtttta gatctaaaca tccaatggga aaaatttcgt 47701 gtgttctcgt tatgttatct gaggatggga aacttcaggt cgcagaatct tttgttctat 47761 gcagcaatgt ttgtcataaa cggtttagtt ctttagtggc tcctttttgt ttcttatttt 47821 tggtttcgga aagagtattt agcagaagtt actctccttt aacaatccag tattttatgt 47881 aaaactttat agaataattt tatagtgatt ctctttacaa aaccatctgc tattaaataa 47941 ggaacactgt gtaaatagag gaccacctaa gtatatttaa ttttcaaata cgaaaaaaat 48001 ctgactcata cccttttttg tttttttgta aaagagaggt ggccctatat tattgaataa 48061 aactgctttt ttagattgaa cttggagtgt ctcccatgtg gtaaactgaa atcttattct 48121 ccccaaattg taatagattt tatcttttta aaaatgactt tactgaagca gaggagaagg 48181 agatgatcaa tctattcaat atgaagaaat atctctgttg ctaatacaat atccttttga 48241 atatctccca catcttaggc cctggtgaaa atccctttca attgttctgt ctacctcgta 48301 gggatgttgt aagaataaat gagttaacat agatatgaag tgccttagga aaaaggctct 48361 ataaaaatgc aaggcgatat catgattagt tgtttctttg actcttgatc attttcttcc 48421 cacttcttat acaaaatgtc tctcacttat ttcctgaagc catctctttt agaactattg 48481 cttcctagaa gagctgaaaa tgatcctagt tccaacatat ttattgattc ccagataata 48541 acacccactg gtgttgatga ctgacagggt tcatgcctaa gcttgagtag aacttgccct 48601 tttttcctgg gtcaagaaaa gtctgcaatt gttcgaaaat gtgtcagtaa gtgtgattcc 48661 atcacgattt cttaaataag taacagttgc tgctggataa tcctgggggc agatgattca 48721 tttttgtgcc cctatcaaac aaactcaaaa cttgttaaag aatctttata agttgtttga 48781 atcaaacaat agggtcgcta ggtccccatt ttgtccattt gaggtatttg cccagcatac 48841 cttggtctgt aaagaaatag cacagccttg gctgggcgcg gtggctcatg cttataatcc 48901 tagcactttg ggaggccgag gtgggcagat ctccaaaaat aaataataat aatgataata 48961 acacagcctc atgagtcttt tttttttttt tttttttgag acagggtctc actctgtcac 49021 tcaggctgga gtacagtggc acaatcacag ctcactgcag cctcaactac cctgggttta 49081 gtgatcctcc cacctcagcc tctcaagtag ctgagactac aggtacacac caccacacct 49141 ggctaatatt gtatttttag tagagacagg gttttgccat gtttcccagg ctggtcttga 49201 actcctgaac tcaagcgatt cgcctgcctc agcctcccaa agtgttggga ttacaggtgt 49261 gaaccactac acctggcctg ccctttcttt tctcttaaaa attcttaaga aagcatggtc 49321 ctgtttaaca aataagcatt ttacaaacat ttatcgatca attactatat gccaggcctg 49381 gcactaggca ccaagtcctc ttcaacagtg ttaacattaa gtttcaataa acttgagtca 49441 gtaggaattc agtaaataat ttttaaagat tgttttcttt ttaaatatgt atttataaac 49501 aaaaatttta gagaattaaa agaaactatc acgaagtgca tgatgataaa agtcttatat 49561 actctccaaa gaaaatgtta agacatacct taaggagata gtgaagaaca gccgggcaca 49621 gtggctcaca cctgtaatcc cagcacttgg gcggccaaag caggcagatc atctgaggtc 49681 aggagttgga gacaagcctg gccaacatgt tgaaaccctg tctctaccaa aatacaaaaa 49741 ttagccgggc atgatggcag gcgcctgtaa tcccagctac ttgggagact gaggcacgag 49801 aattgcttga acccgggagg cagatgtttc agcgagctga gatcgtgcca ctgcactcca 49861 gcctggtcga cagagcaaga tcctgtctca aaaagaggaa aaaagaaaag aaaagaaaag 49921 atagtgagaa aataactgaa cataaaacat aaagaaaagt tgaaatatta gtactcacta 49981 cctaggttca acaattgcta actctttgcc gtatttgctc catctatacc tatttctttc 50041 tgaattactt gtaagttgca gataaaataa tacttcactc ctaaataatt cagtactcat 50101 ctaagaataa aggcattcct ctatctaacc acaattccat cattaggcct aagaggataa 50161 acagtaagtc cttaatatca tctgatagct agtgcagatt taaccttccc catttgtccc 50221 ccaaatgtct tctataccat tttttaaaaa aagaatctaa tttttaaaaa taagaacata 50281 ttttgcagtt gggtctcatg cttctttctt caacattgga tgtttttaaa gattacgcca 50341 gttctcttgt accaggtccc actttctgaa ttcgttgttg cctcatggtt aatttgttcc 50401 tctacccact ttatttctca taagctccaa gttaggacta aaagcatgat ttgattttga 50461 ttgaacgttt tttgcaaggt gccactgggt gctccatatt gcatcttatc aggaggtaca 50521 gaatgccagg ttgttccact gtgcatgatg ccaagtttga tcagggcttg aggtagtgat 50581 cttgctcaga ggatctcttg ttgtacaggc actttttcac cctttgcaat tagcgagtac 50641 tctcaggggg tatctcaaag tgtatctcat aaacactttg gcatggtggc aatatcctat 50701 ttcccaacaa actctcaccc ggtgacttta ggtaataatc tttgcctaaa ttgattattt 50761 cattggagtt catatatatt tttaatacgc tttgaagatg gactcgacat gcaaattcat 50821 cattatcagc agcaatagtg gctggaaata tttattgagt gcataagttc aggtaataat 50881 aataatagta ggtaatgttt attgagtgct tgctgtggag ccaagacagc ctcggtgtga 50941 gcccagtctt tcacttacta agcgaatgaa ctgggacgag ttattctttt tgtgcttcag 51001 tttcttaagc tataaaatgg gattactaat ggtgcctatc tcgtgggatt gttctgaaga 51061 gtaaacgatt tatacattta aggttcctaa aacagtgcct gttattattg ttatttgcca 51121 agcattgttc taatactttg taggtactaa ctcgtctact tcttctaatg gtgcaatgaa 51181 gaaagtgcta ataataccct tttttcacaa tgaggaaact gaggtacgga gaggtctagt 51241 acgttgcaca aggtcactta actagtacct ggtggagcca ggatttgaac ccaggctgtc 51301 tggttcctga gcactctctc tgatgatggg cacatgcagt gaggagtgat acaagtggag 51361 ctgagtggaa gtagccctca cacacactca gacttgaagt tattgaaaat ccgggaggtt 51421 gccacccaaa cacatttaag gtttaattca tctgaataag aacagcattc cactaagccc 51481 agataagtac tcttattaca tttacatagc ttaaggctct gtacaagaaa ccttgaacat 51541 aaaatacttg gaacagaaaa gagtaggaga actatgaagt aatgggggga aaaaatccct 51601 gaaaggaaca atacaagcgg ggagttatga gacctgggac ctcagagcct cggggtggaa 51661 gatgatattt tgatataatc aggagggaga aatttcctct catttttcac atttctgtac 51721 tatataagat tttaccataa aacgcaagag tataagccat tgtcagattt ataaaaggcc 51781 atggtggatt tccttaagag acgatggatg actgttgtta gtaggtaaat tccctggttg 51841 gttatctcat acactcagat attccataag tcagcctgtg gcttctgaaa ctgtattgag 51901 aaaaagctgt ataaactagg tgttggtcca tgctccttca ttagcattac aaaagggttc 51961 gactgctgag ttgaatctta tccactcctt cagtggccat tacagaaagc tctcaagtgt 52021 aattttctaa agagaactaa aaataaataa gaaaaaaaaa atccagcatt aataatggtt 52081 ctttttgcat ttccaacaca gtcattgcca aattggttct gcctttttat gttttttaat 52141 gaaaagcagc ctaattgatc aattggtcat gaataatact tcttcgtttg tttttctcca 52201 caccccccag ccttctagaa ccacgggctt aatgtgtcgg cctacaaatg gttcatgatt 52261 tgtgtttttt gtctagtgct ttatgggaaa atattagaaa caaactgctg agaactcagt 52321 aggaactcag taaataataa gtagatagtt ctcttgttaa atatgatttt aaagaaactc 52381 tttctaaata actaggaaat attacaaggg aagtacaaag tggggtggaa agcctggctg 52441 acttgtaatt tgaaatttct cttcatcatt ttataggtca cacagtttaa ccctttataa 52501 aacaaaatgt aaagtctgat agctgaaaag gctctcctcc gctaaataga tttgtttgaa 52561 aatatagatc attaaatcgg taagaataac aggagaatct caaccaaaca ttgagacaac 52621 taagaaaatt gtctaaatct aataccttat atcagtactt taagttgacc ataaatacgc 52681 tgatatccat agcacatagg ttcttactgg cagtggtggg gtgggtgggg gaaggggagt 52741 gtcacagaca cattcaagat gtcaacgaaa actatgaaaa ctctccccaa tggggaaaaa 52801 aaaccaacga tgcctagcac ttatcctacg caactttgta cagaatttcc gtgggaaatt 52861 ctggtgtcca tccatcataa atcacctggt gtccacccat atatcctgaa taggaaaaat 52921 tgtctttctc tagagttatt gtacactgat cctgatggag ctttaaaaag agttgtcgta 52981 caggctcatt gactttggga ggacaaggtg ggaggattgc ttgagcgcag gagattgaga 53041 ccagcctggg caacatggtg aggccccatc tctacaagat attcttttaa aaaattagcc 53101 aggcccagtg gcccgcacct gtagtccaag ttactccaga ggctgaggca ggaggatccc 53161 ttgagcccaa gagatcaagg ctgcagtaag ccgtgttcac accgctacac tccagcctgg 53221 gcaacagagt gagatcctgt ctcaaaaaat aaataagtaa ataaaaagag ttgccattat 53281 cttgtttgat ctggctatga tctttgaggt agacagacca cagattttac ctctcattct 53341 gcagagtggg aaaagcaagg cttggagagt ttccgggact tgctcaaggt aatttgctaa 53401 tgtctgccgg gacgtagcac caagcctgtc acgctgtagg ctgggcccct tccacaacac 53461 tggtccaaac acaggattct ggagatgcag gagccctcga tgggcttctg gaaagctgaa 53521 ttctggctct accagagcta cccacgtgat cttaaacaag tcacctaact tccctgagct 53581 tcagtttctt ccttagtcaa gtgagaggtt agactgtgtg ttctctgaag ccctttgatt 53641 ttctgcagaa aaccacctaa tcccactggg tctgggaaca gagggaaatg accttttctg 53701 ccatagatgg caagaccctt cacccttccc tgtagggcat aagagctggc atccaattct 53761 taacggattc ccttgcaatg tccaaatccc attagaaact acaaacatct caatagtgcc 53821 tttgtgggct gcttgggtgt catgccgctg gctgcctcac ttgctgcagg gctgactgtc 53881 cacatcaatt atactccaac cctgtgctct gcaatgggca gacatcccat gagattttat 53941 tacccctctt gggtagttca gggagtcata tggaagtaaa taatgctgat gggcttattt 54001 tgggctgata gatccttatc cttctaaaga ggttatgcct ttcaaagggt acaaaacctt 54061 ctccttgatt ttttattacg aacagttata tcctttaatg tcccagaatt gttcttggaa 54121 gttggagact cacaagaggg cctgcagctg tgaatgcaga agtcctgaga ctaggggagg 54181 acatcaccac tctcccttgg cctttggagt catccaagct aagtgctaat atttgtgatt 54241 aacaggtgtt gataggaaat tgggcccctg tgtctcagtc tttttgtgct tcaaattttc 54301 taagtgaact tcaggctggc aagtggagtt agatcaactt gttgcagaca gtatcatctg 54361 ctcggtaccg tgtcaggtca ctggactgtg ggaggttata ctgaccagtg atgagagtga 54421 cagagcatga gcttggctcc cgtatcctgt gttctgtgct cagcctcaca gatgactcac 54481 tgtgcctcat cgactcctct gtcctcgagt gaaattatgc cattctttaa cccactatac 54541 aagagacttt tgaagattca tgtgtggcta cccccaacaa tatctcattt gtacaatgca 54601 gaacatggtg ctttcttgtg taaacaaact cattttgtgg aatttttatt ttctagatca 54661 atatttttaa atagactact aattcatttt ataaaccggg tactataaga ttttatgagt 54721 tgacacttgg taccctgtaa gtatccaaat aatcaaagaa ggagaaagtg tgaagtcaag 54781 gaatccccta gctgaaaaga aattcccaat acctggaaca aaaaagatgg agcttttaga 54841 gagcagactt tttctccagc aaagatccac aacttccatt tttaaatgta agacgtaagg 54901 gagatttatg atacagccta ctttgtcaga tcatattatt tcataaagtg aagggtcaat 54961 taggtgttat tattatcaag aaagcatgct gtaatgcttc gtcttgtaga tgctgtaaaa 55021 attacctgtc tgattacttt aattcatctt ttggctactg tctgcagaaa ttcactcagt 55081 ccctagctca caggtatgca gacgtactga gaacactttg ggaaataaaa gcaaaaaaaa 55141 cattgggtcc tttcccctaa ggaatgaaca gtgtatttgg gaaactagga aaaacaaaca 55201 aacaaaaaaa tggtaccaca atgcaaaact gatgaggaga ttaaaaaaca gctcaagatg 55261 gagattttta gaaagaactt cttgaaggag gtgcatctgt aggctgagag agatggttta 55321 gcaagcaggc atcaatgcat ccacaaagct ttacacttca aggaaatcat aggagaaaca 55381 gggtcttctt tgctgaagat ctgtttattt tgcaaacaac aaagagtcag aagagccatg 55441 tgaagtttcc atggggaatc cagaaactaa acttctgcca ctggaaatta cctcaagatc 55501 ttttcccaga tagcctgctc ttcccctgct tccaatcatg tccctcaaag tcttcttttt 55561 caagcataaa tatccttagt tcttgtaacc actcctccta tgaggcagtt ttgcgtgctg 55621 ccattatttt aagcacccgc ctctgattgg actctagctt agggcctccc ctaagtcacg 55681 ggactcagaa cggagcaacg ctctccagga gtgatctgag gatgcagaac accatggggt 55741 agttgcctcc cttgatctgt agcctatact tctattaata tggcccatga ttgcatgagc 55801 ccttttggca cccatgtcac actgctgact catatcgaaa ttgtggtcaa ctaaaaccca 55861 tgagttccca tgcactacta ctaagccaga tctttctttc tttttctttt ctttttttct 55921 ttttttttga gaccaagtct cgctctgtcg cccaggctgg agtgcagtgg cgcaatctcg 55981 gctcactgca acctctgcct cgtgagttca agtgattctc ctgcctcagc ctcccgagta 56041 gctgggatta caggcgcccg ccaccacgcc cggctaattt ttgtattttt agtagagacg 56101 gggtttcacc atgttggtca ggctggtctt gaactcctga cattgtgatc cgcctgcctc 56161 ggcctcgcaa agtgctggga ttacaggcat gagccaccat gcccagccag atctttctta 56221 ttctgtactt acactgttga tttaaaaaaa aaaaaaaacc ctaggacaga ggtttaaatg 56281 taaagacatt tatttctatt tcactcgatc attttgataa caacttgtcc aaatctttca 56341 gagctagggt attagcttcc cctccctgca ggtgagtact tacctgctgg tgcagctaat 56401 ctcatgatgg atctctggcc tctaaagggc cagaacttca ttgtctcttt cctatccata 56461 aattctctgc aaccgtggat gaatctgcat ccttagacaa actagtgggc ctcaatcttc 56521 ctaagtaaga taggaataag tgtgaatggc catttcctcc ttgcggtgag gatatgcgac 56581 aatgagttcg atgtctgcaa aaccctaagg aaatgtcact cagaggcagc attagatgag 56641 gagagaaaaa gtttatttca ggctctcagc agctttcaga agacttgatt atcctgatca 56701 ctacgtgtga aagttaatga cttcattttt gcagttattt taaaaatata actggtttat 56761 ctgatttttg gttgtagtaa catcacgtga tggtcagtat cattttacac attatttgta 56821 atgtgagcaa gtgccataaa gcatatctgc aagactcatt taaaatgact ctcttccctc 56881 catcacctcc cccagcccca tcctcaactc taccctcttc tgtttgtaat acgggactac 56941 aaagatctta tagtttaggt ttccgtcaga cttttggaaa cctttaaaag acttaccaag 57001 ctagcttaga agttcataga ttttcatccc tggccagcgc agagagctca caggtaactt 57061 gtataatatt tatcatttca agctcataga aatgtcatat ttcaaaacac atccctgaga 57121 gcagtttttt tgttccttat agacaggtgc agggaacatc caaatttttc tagtctgcca 57181 ttctaatgca cattgtgatg ttgctcttgt gaccctataa ttattaattt tccagtagcc 57241 cctccccaac ctttcacgtg atttttaatc cagggcctgg gcactcgtag cagagtgtga 57301 tcctaggacc agcagcaacg gcaccgccgg ggagcctgtt agaagtgagg atctcagacc 57361 ctgcctcaaa cctattgaat cagaatctaa actgtgaaaa aattcccctg gtgattctta 57421 tgcacagtaa aatctgagaa gaacagatct agagtgaagg gattttcact cccattttac 57481 ccgctgtaga ccttatgttg taatacaaaa cttacaaact tccatttggt gtttattaaa 57541 aggttagagt ccttagttaa aatgttacca gattcaaatg ttctgtttat atattcagct 57601 atttctcagt gtatgtaact ggctttaaaa tactcttgtt gaccaggcat ggtggctcac 57661 gcctgtaatc ccaacccttt gggaggccag ggtgggagga ttgcttgagc tcaggagttt 57721 gaggccagcc tgggcaacat agcgagatca catctctaca aaaaattaaa aattagccag 57781 ggatggtgat gcatgcctgt agtcctagct acaggaggct gaggcaggag gatcacttga 57841 gcccaggagt ttgaagttgt aatgagatat gactgagata ctgtactcca tccagcctgg 57901 gtgacatagt gagaccttgt ctctggaaaa aaagtactct tgcctttctg ggttcttgtg 57961 gagcaggggt tctcccccac tcccaccagg gacatttggc aatgccagga aacatttttt 58021 gtcacagttt gcagggagag gtcctactgg tgggtagaag ctagggatgc tgctgagaat 58081 cctacaatgc acaggatggg cccccacaac aaggacttaa ctggccccaa atgtcaatag 58141 tgacaaggtt ggaaaacacc agtatatgat ttataaagac tcagatgtgc gaccataaac 58201 catagaatgc cacaggaaaa tagagtcacg tggcatgtca ctatatgacc aggcaaagaa 58261 caaatagacc agcagtggcc ggagtgatgg tggtggggtt gtgagtggca gggaatggaa 58321 aatccagcca cagatgaaag gagacatcta ggaccaatgc taactggtac tccgtgcaca 58381 aaggcagtga gctctagagg acagggctaa aaggccggtc acggtggctc acacctgtaa 58441 tcccagcact ttgggtgtcc gaggcgggtg gatcatctga ggtcaggagt tcgagaccag 58501 cctggccaac atggtgaaac cccgtctcta ctaaaaatac aaaaattagc taggcatggt 58561 ggcgggcgcc tgtaatccca gctactcggg tggctgaggt gggagaatcg cttataccta 58621 ggagctggag gttgcagtga gcagagattg caccactgca ctacagcctg ggtgacagag 58681 caagactcca tctcaaaaaa aaaaaaaaaa aaagaggaca gggctgatgt tcagggactg 58741 gacaccaggt atgagcccca ggaggggtgg aaggaggtgt gggagaaata gagagagaga 58801 aggcttcagt tctagagtgg ataccaggta tgatctcaaa atcgtgttag acatggaaaa 58861 caagaaagga tttcaagtat tagcatagga gtacacaggc agatttcagg taaatccagt 58921 aagatgaaga ctcaggaaca aggcagaatc ctgcttgcta gaactggggc gcctcagagc 58981 caggccagca gcaggcatga ccgatgctga gtcagccaag tgttcaacta gagtgagctt 59041 cttaaatccc tcccaactga gaaggggctt agtgctgagt ctacagcatg gttgacttgc 59101 aagaaggggt caagggctcc cagctatgga gaaaggtaga ccagaagtaa gccagaagta 59161 ggccagtgac agtggccact gaaaaatcac gggataagaa atggaacaga actcgagagg 59221 tcctctgggt cattccccta ctgcagcgtg cacctaaatc atttcagaaa gattatctac 59281 gcttcaactt taacattttc agagaaagat ttcttaaaga ccagtgggtg ccttataatc 59341 catcgaagtc acagcttctt ccccatttct aaccctaaac acctcttcct gggatgtaaa 59401 gcgagtgtct ttcatcccac cctgcacaca gtgtgatttc tatcattcag taatgataaa 59461 gaacacaaac tgccgtctgc aactgccaag cacagccacc ccttttccag gctaaagcat 59521 ctcaattcca ttagtctgtt gccctgagag tctctccctg accttttaat cccctcatta 59581 tcctctggac ctgctccaag ccttgcctgc ttgaaatagt taccaccaag tatgtatttg 59641 ttaaataaaa gctgtgtgca gactgccctt ttgccatgga acacaaagga ggaggaatat 59701 aaaagaatat acacagtttt gttggttttt attagtctct ctcctctctc tccaaggagt 59761 tgtcaatctt ctctgcactt atagtctagt tggagcacta gtccttaggc taaattccac 59821 caaaacattt tggttgttca gcaaacgctt tcactgtgac cctcacctaa gaacaataga 59881 gtacaccatc aaatgggata gctctaatgt gaatttagct ctaaataaaa ctaggataaa 59941 tgcatttgtt cagttctaga gaaggcatca tactgtcctg tggatttttc atttcgaatg 60001 agctgcaaaa caaagttctt ttccatggaa aggatttact caactgaatt cactcacttt 60061 gctgtagctg gattaatctc atttcacatc atgtctctaa gtagaaaagt aaactgattt 60121 acagcaggct aagtggatat gttgatgcat ccaacatcag tgagatttat tccaactaga 60181 ttgcaaccag aaaatggtgt cctggttgtt tttgcttcta taaagtagtc gcaaaagtgc 60241 accactttcg tactcgatct gaaaaacagt ttatcaaagt tacaacagat gaggtattgt 60301 gggatgcgac agaacgtacg tacgtcaaat gaaatgagca tttctgcctt cccagatctt 60361 gaaatggtct tttgccccac aaccttataa aaaaaaatat tcattttagg ccaggcacgg 60421 tggctcacac ctgtaatccc agcactctgg gaggctgagg caggtggatc actcgaagtc 60481 tggagttgaa gaccagcctg gccaacatgg tgaaacccca tctctactaa aaatacaaaa 60541 attagctgga tgtggtggtg ggtgcctgta atcccagcta ctcaggaggc cgaggtggga 60601 gaatcacttg aacccgggag gcagaggctg cagtgagcca agatcacacc actacactcc 60661 agcctgggcg acagagtcag actccatctc aaaaaaaaaa aaaaaaagaa aagaaaaaaa 60721 agaaataaat aaataaaatc attttattct atccccatgt tctgaaagcc accctttttt 60781 atttttgagt tgtgtttcac ggggcactgc tcaaggtaaa cacccagccc cacttgtggt 60841 tgagcaggca ttgaccccac actggctccc tgccctggag ccagccacca ccctgttcct 60901 tgctccacct tctgctttct gcagcacact ctttcttctt tttcttctgc atcactctgc 60961 atttccaaag ccacaatcgt gtcatcctct cgggactctc agtctctgtc tatgctggtg 61021 ccctctcctt tgtctctccc agattgctga ctttattcca gcttctgctt ctaacacaat 61081 aagactgcta ttggaaatgt ttttctgact tctatagctg gcagtttcta taagcagaag 61141 ttactttcaa aggaaagaac ttgtggagat gagcctgggg tgtcacaggg ctgttacatg 61201 ggtcaaatgt aataacgtgt atcaaaaagc aatttgtaaa ctgtaacatg ctcttcaaaa 61261 gtaaattatt actgtattat atgactttag ggtttaaaaa gcagctggat gatatgtaaa 61321 aattcaaggc tcagtagcat cattggaaaa gaaaattaga aaatggggag gggcacggac 61381 gcttccatca acaagctagg ggcctctcaa cattttttcc agccaaagca cacccattta 61441 gctgagagca tacactctct tctgaggact gcctgctcac cattctggac ctaaggcctt 61501 ccctcttgct taagctcccc catccctctc atcgccacct tcccaaatcc tagctgaccc 61561 caagggctaa cccccaatcc accctccctt cccagctctg ccccagatta agtgaatctc 61621 tttctctcca gcactcagtc ttatgttaga attatttatg tgggcattga tcttcctgga 61681 ttgactgtga cctgctgaag gcagcggtgg tcctggcatc tgccacagcc ccctccccca 61741 acagcacttc atcacctccg tgaacggaag tgaatggaag gaacagctca attaggatgg 61801 cagaggatca aggagagaat tccacggacc caaaacatcc tgcaaaaagg ctttgtcccc 61861 agaggttggt cttcaatttg gaaaaattcc tacctgagca actgtaggac ccctctgaaa 61921 tggaacaggg tctattagac tcataaaaag cctcagtgat gagttggtta ttattattct 61981 ctttactttt ctgtacattt aaagtgtttt atattttaaa ttgaatgtaa aaagatccca 62041 tcctatttaa aagttagtga gttgaataag tttcattata acttgatttt cttcctaaag 62101 gttttcctat agttcaaatt ataaaagtca ttcttagatt cccagcttag ttcccccatg 62161 aaatcacgag ttcttagact gtgtttcaca caatgccagg tcacattgca cccttactga 62221 aaggaacttc ttgtgcagag tgcggggctc ttggagaaag aaaagtacca cttttttagc 62281 acattaaaaa caaatccttg agctgtttta ttattttaat actttacctt tagtggagca 62341 gaaatgattt atttgatggt tttggcaggt gatttttttt ttcggaatct tctttaacac 62401 tattagccat gtaactattg ccaagtacat aataagtact caataattat tattatcttt 62461 taattaaaat gtgatgcact tctaaactaa aggtaacttc agtcatgtag aatagaaagc 62521 aatgggtgga gcttccaatt gcttctcttt ggtattcatt agaaatcatt tttgaaactg 62581 agaagttggg ggaaagatca gacattttta tggaaactgc attgatgaca gcagagtgta 62641 attttgttgg tagtgatttg atacacaatt atttgaggga tttactgtgc tcatggatta 62701 gcgctgtgtt gttactgtaa cagaggattg gcgcagagcc ctggcccagt gcaaagctct 62761 ctaacatcgt ttctcttcct tcatcagcag cccaaagccc ccctctctac aactccatgc 62821 agcacagtcc tggcacaaac ccctaacctg aaattccagc caatttcact aacaacacgc 62881 ctctcccctt cacctgccag atgcttctcg ttctcagaca ctgcaccctg aaattgggag 62941 ctttcactga gttaaaaagg caaatataga attacctagc tgtatcagta ttttaagtgc 63001 acttgtagac ttaaatgaga gctcttgtac aaaaccactt gtattctttc ttaagatatg 63061 ctaatttttt tctctctttg attgaactgt tgaccttctg gatatatgac tagttattac 63121 ttcaagtatg tttggaacta ttcttgccct aactgttcta aaatgtaatt agcattgaga 63181 cattattgta gaagcacagc acctggggct gggatctcat tagatggcaa aaattgagca 63241 gaattgtgca tgctccatgc tcaaaatatc tctaagaaaa attaatggca ttagggggaa 63301 agttaaaatt ggtgatttcg taggcccttt cttgctcaca tatttgattc gtataaggat 63361 ccaaacagaa aggagattgt ggttcttctg tttaaaaaat gaagacaact ctgctttctt 63421 agaactcttc ttatgtaatg tggctgctgt cacatctctg ggtcaaccaa aaccaaaata 63481 atacattttc tggctctagc tttttccccc ctcaagaaat gcatagcatt tttaggcatt 63541 atctgatttt tccttgcaac aacagatgtt acagagaatg aatattctgt aagaggtagt 63601 taacaaagtg tggaaaaact gaggcccaaa gaggtactat aactcacggg tttttcacgc 63661 tcctccctga acattcccgc atctcttcta ctggggaggg agctctttat tttctgaatt 63721 gaaaaccatt tcatgcctcc tgctcagcta tttcagtact ggctcctaag tttggcacca 63781 atcccaactg ttatgaaatc gcagacgaat aataacgaag ttcttcattg ctgggatccc 63841 aaaaggttat catttcgagt tcattgcagg gcactttaga cgtccatatt gggccctaaa 63901 tgattggaac catcaggaga gggcttgcct gactctgcca ttcacctcat ttgtaagtct 63961 ctgccgtgtt tcgacactga gttgtgtctt gttgtttaaa aataacttct cattataata 64021 acattttccc ttaattgaaa aaatatttct gacctgaatt tattattgtc atagctgtgt 64081 gagcaagcag agtcttgttt gcctctacaa tatattaata atgtattaat tagcttccca 64141 gtgctgccca aacaaggaac aaaaaggatt tatctaaatc atgaaaactg aagtgaagcg 64201 aacactttgg taattatatt ttaagggcag tctcagcatt tgatagtgtg gcaaattttt 64261 gtcctgcctt ttgagagaag ataaccaaaa acaccactca attaacaaaa cgttgtttag 64321 tacctactat gcacaaagtc tgagaggtaa aagaggttaa gacaaataaa attgtccaat 64381 ttatttttat ggccatagtt aaaaggcaat aaaggcaaga cactgtgagt attttgattt 64441 agctacaatt agctgaaata tactacatca aatcatatga aattgacaat atctgggggc 64501 tggctgccgt gactcatgcc tgtaatccca gcacttttgg ggcccaggat aatcacttga 64561 ggccaggagt tcaaggctac agtgagccag ggttgtacca cggcactcca gcctgggcca 64621 cagtgtgacc ctgtctctaa agaaataaaa ataaaaataa atagaagata ttgacaatag 64681 ctgaccctat tttttacccc aaaagtggtg attttatata atttaatcta atattaaggt 64741 attcttgtca tactgtacca gctcctaaac attatacttt tcattttact tctatttcta 64801 agctatcatt attacaacaa ccaagagaag gtaaatacag gtcagagatt tgactcctgt 64861 tttttgtatg tatttatttt tgtagattca caaataactt atattttgca gttttaaggt 64921 ttctaattat attagaccct gttttttatt ttatttagag tcattgtcct tgctgcctgt 64981 ccttcacatg tgacaagtca attcttcatg ccatttcctt cacaatcttc atgcaaactc 65041 atcatctcca tttccatcta ttcttcagag atgtatgatt gcagaaaact cccctttttt 65101 gaaatattgc tcaatagaga ttaaatctaa cacccagttt tacctgtcta tccttgaatt 65161 acaggttact gagaagtcag aacagatagc catagaagta ccccactaaa tccattcttc 65221 tatctccttc aatatttgac aaatatttac taagcatcta tcttgagcaa gcattggagt 65281 aagtcctccc ataaatatga agttgtgtaa ataaggatgc atgcattcat gtaaatacaa 65341 gtttaatgct ggatatatat tgttttaagt gcctgatatg tattagtcta tttattccta 65401 tctaagggga taagtgccat tgttgtttcc atcttaaaga tgaggaaact gaggccagag 65461 agattatacg aactcctaaa accctaactc ttaaccacag cattcagaat ctatctggtt 65521 gtagtggagg acccaatgtc aaaaaagcaa gtgcttttcc cagaccatga acacaatgtg 65581 tgacaaatta gcagttcttc cccttgccct aagatagtaa atatccaccc aggccaactt 65641 gtgtctgtca gagaacaaag agtgggcttg tcacactgtt ttctcccaag agggtccatc 65701 tgcctccgct cctttggtac acgctagggg acacacactg tccagcatat tcttgcagag 65761 tcatatctta ccccacatct tatcacaacc ctgctgaatg tcaggcacta tcaagaaaat 65821 gttcactcat cttcttcccg ttatgcccaa tgtgtgaact gataacacat ctaaattcaa 65881 agaaaatcaa cagacatcac ctcaatcaac ctgaaaagtc tggtgactca ttcccaccct 65941 tctcacgggg ctcctgctgg gctgcctggt tctccggaac cagcctccag gcctcctgca 66001 gccggggtgc agctcagtgt tcgagccccc tctctgctgc ccatcagatg agtgttgact 66061 ttctttggca caggcatttg agttgcattt gaattgcaga acatgctgcc cccattctgt 66121 gctgactgtg accccacttc atgtgcccca gaggcaggct aaacaagctt gccttggaaa 66181 aaataacagg gaagaagatg ctgactctcc aacccccaga gtaagttccg tcactctgga 66241 ggtaaagtca taatatctaa atatggctcg attcatgtca ggtcttataa atacacacac 66301 acacgtggat gctgaaaccg tgcatgccct gacagccaaa aaaaaagtca aaacgactta 66361 ctgtctcatt agcaggacaa gacctgatag gccagctcac cagaataaag aatacaaaaa 66421 atgggtattt aacttctcta cacagacaca cacacacaca cacacacaca cacacacaca 66481 cacacacaca caatctgatg taacgacctc aagagagctg gttgccctgg caatgaagaa 66541 tcgtttgtat attcaaggat tttcttgttt aaatttaagc tagccacttg aacttctcga 66601 gctttcttga gcttagcttc aggcctccat ccccagacag aggtctgaca tctatgtcag 66661 cctctagcaa ggcaggaagc ttagaagtcg gaccaaaacc tcacaagcca gccagcctgg 66721 atttgaattt cagctctatg gtttctcagc tgtgtgacct gggttagctg cttaaccacc 66781 ttgaacctct atttcctcac ctgtacaatg ggaataataa aattaaaaca aacttgcatg 66841 ggttcattat gaaaattaga ttgattgatg acataataca tagcaagcat ttagagctgt 66901 gcctgattcc gaatgtgagt tatctgtaat agtgacttga aatgggttta cactgcctgg 66961 caggaaacca ttcaattcct ttttgttttt gactttaatc tttgaagcgc tttcatctca 67021 tttcttctta ttttgtgttt gtggagaagg aggatgttta gccatccttg gagtacaatc 67081 ttttattaaa aaccattatt aagtctcttc aagtttctct tttgtagtaa atgatctaga 67141 ttacttaatg gttctgtttt acatgtttta atcatttatt tgaaccatct ccagatgtag 67201 tacacacaga aatattgtct aggaaaatgt gttactgaaa ttccttatca tctgtggtcc 67261 attagctcaa atgtttaaaa cccattgttt ttatttttat tatttattta tttagttatt 67321 tttgcaagac agggtctcac tcttttgccc aggctagagt gcagtggcac aattatagtt 67381 cattgcaggc tcgaactcct gggctcaagc aatcctccag cctaagcctc cccaaaactt 67441 cttattctta agagcaaagt tgagttccat catattgtca gttggtttca ctccatcata 67501 ttgccaaaac cacctaattc tcagatgcat agtttagcac aagccaaacc agaagagacc 67561 ataggtcatt cattcattat tcattcaaca tttactaagt gactgctgta tgccagaggc 67621 tgggctatgc tgtaaagtca caaagaagat acttctcagt tgggagagct tacagcttga 67681 gagagaagac acacaagcaa acaaatcatt acaaatgcct ttccattggg tgccatactt 67741 gaaatgagaa gatgaggcca gggtaccctt ggggcagaag ggagggagga ttaagctcag 67801 ggaacagctt tacagagaag gtaacatttg caccatatta aaggatggca agaattgagg 67861 agtacaaaaa ggagaatagt ccaagaagaa acactcaggg atgatttgca cattgtaaat 67921 ttaggtaacc ttacatgcct tacatggggc tgctgtgact gaggcacagg gtcaaggaag 67981 ggagcagagt cataaagcta cccaggtcag ccaaggccag gctaggaggc ctgagcttta 68041 tcctgcaggt gtgcagagcc agaggagagt gcaagaagag acatggcaca gtctgatctg 68101 ttttgcacaa atcactctgg aagcaatgag gaggggagaa tggaggtatt tgagcctaga 68161 gtcaggaagg agaagtagga tgccagggtt gtggggaaag acgataaagc ccacacaaag 68221 gagggctgat ggaaagatgg aaatcaattt caggtatatt taggccaggt gcggtggctc 68281 gcgcctgtaa tccccacact ttgggacgcc gaggcaggtg gattgcctga gctcaggagc 68341 tcacaaccag cctgggcaac atggtgaaac cccgttctac taaaatacaa aaaattagct 68401 gggcatggcg gcatgcgcct gtagtcccag ctattcggga ggctgaggca ggagaattgc 68461 ttgaacctag gaggcggagg ttgcagtgag ccgagattgc tccactgtac tccagcctgg 68521 gcgacagagt gagattcagt taaaaaaaaa aaaaaaaatt caggtatatt taaaatatag 68581 tgtttagaga cccattagtg gaagatacca gatttgagaa aggggaggaa aagacacgga 68641 ggagtccaga agaaatttcc agatttctgg cctggataac tgtgagggca gtggtgccag 68701 atagtgccag aatgaaatat atagattggc acaaatccat ctctcagaaa tgcacctctg 68761 agacatctgt tctgctgaag gtcagtaggg aaggccaact tcatgtgcaa agagcaatat 68821 gtgtgtattt cttactagct attcttttta aaatacttct taatatatgg gacatacatg 68881 agctaagata gtattcatgc aggaggtttc atttttcatt ttctggcaat tgtgttttca 68941 aatatgagat gctagcatta tttcagttat ctatgcctag catagtatat atctacggtt 69001 caagattatt aattgtctcc cctgcctccc cactagaagg taagctttct gaagccaggg 69061 attttgtctc ttttgtcctc atctccattc ctagccctta aaactaagtt tgactcccaa 69121 ttatcgagta aatgattgaa tttccatctg agtaaaaaga aacagtcatt tcctttacta 69181 atgaggaaag aaaaggaaaa caacaaacaa atctggactt aaaacagttt ttacaaaaag 69241 atccaatttg caaaaatgga actgaagttt aaaagctact tttctccctg cagctctact 69301 tctgggaaag tcacagctgc cagcttagat ggtggttttc actgcccggt tacaaattct 69361 gcacagttgt acatattggc agggctcagc ttccttaaat gccattggat cagtagggaa 69421 atggtgattt tccttattga tttaatgcca gttcctcttt tatgaaaaat cccacgtgct 69481 gctcactggc gtatgtacca taaaaactga acgtaggggg ccagaaatta agtaagtcac 69541 tcatgttctt ctccagccat gtgacgttaa accatctcta ctttcaggct ggagtgttca 69601 atcattctcc ctttaaattt agcagagtga ttattcctcc tggtggattg cacatatgct 69661 cctttgatct atgaaatact cattctggat gaactttaag ctcccaggaa atgcatgtaa 69721 aatactgttt ataaatcgcg ggagaaatgt ttcacattct tattactccc aagtaatcat 69781 accaagcctc tctctccaag attcgctttt tttcctagaa gagggaaaaa gaacttcggg 69841 cagtagcact gtgacaatca tcaccacggt cgtactttcc atgagtttta cgcagtaggc 69901 tcacggaagt gacaatgcat ggtcgggcgc ggtggctcat gcctgtaatc ccaacacttc 69961 gggaggccga agtgggtgga tcaactgagg tcaggagttc aagaccaccc tggccaacat 70021 agtgaaaccc tatctctact aaaaatacaa aagattagcc ggacgtggtg gtgggcgcct 70081 gcaatcctag ctactcggga ggctgaggca ggataatcac ttcaacccgg gcggcggagg 70141 ttgcagtgag ccaagattat accactacac tctagcctgg gtgacaaaac gagactccgt 70201 ctcaaaaaaa aaaagaagtg acaatgtagg atacccattc atcagcaaat atttaatgtg 70261 agcaggacaa tgtgctaagt ccatagactg caaagaggag tgaggctggg ttcccatcct 70321 ctggggctta caaggtagct ggagaagtaa aactcaaacc acaaagtgaa gttaatagtc 70381 accaatgaac aacatagtaa gctctgcact ggttcagaag aggacaggag taacaagaca 70441 gtaatcctaa aggagatgga cttgaatttg agctggacct gaagagtcag gagctagagt 70501 gagtaaaggc agagtggtgg gaacatttgt gaatcaaagc tcaggtgggg gttacggcag 70561 gcagggtggg gatagactga ggatgacttt gaatgccatg ccactctaag gactttggac 70621 ttgactctac aagcaaggtg gaattattaa gagattttga gtagaaatga tgtgagctga 70681 acagtgcttt tgagaaacta aatctagcag ggatcagtgc tccaaattaa ttggacatgt 70741 ttcagacttg agatcatgag accctgaccc aggctgggga caatgggatg tctccatcat 70801 acattacatt cctgtgaata cttcttacag attcatatga aattctgcct caatgtgcct 70861 actgtctctg cactgtaaga cagtgatatt gtgatcctgt gtgctgcttg gaaccagcct 70921 cattccttta gcaccccagt ttggacttgg agtgggccca agtgcagcct gtacacaaac 70981 ctcaaagaat ccaggcagca tagatgtaaa agcttggagc cctccaagtc cacctcctca 71041 acttaccaag aaataagcta actcttgctc atctccctaa tgacggattc agcatgtggc 71101 ctcagatctt cctgtgctcc agcagtgctt ctgaggccag tccaaattct gctttgaggt 71161 gtgcgtgtgc caattgtggt gcaatccaga agcattttgg acgaatgaag gagctgtttg 71221 tgttttcttg ccctaattga gatagttgaa gctcctctcc cttactctac accaagcccc 71281 ccagatccgg tacctccctg agtccccgac ttcatctctg ccttgcttct tttccacaac 71341 accctatcag ttcttgcatc ctgccattgg ttgtgctgtc ctcattatct gaagccttta 71401 cctcccctct taggcctcat cacctccagg agactctcca tggcctcagc ccctccctta 71461 ggtggaccaa gcttcctcct tggagctcca tgctaccctg catagggctc agtcactttg 71521 cctccatgat gtcttgcaat tgtctattta tatgtctgcc tcctgtcctt caactgtgag 71581 ctccctgagg aaaggaactg tgggtatgca tcattgaata ctcagcatct agcacggtgc 71641 ctggtacaca ctaggtattg aaatgtctga taaatgaatg acagttgcat atttgtcttt 71701 ttggtatatt cattcatttg gtaaatattt attgagtaat tacttataat tgaattggcg 71761 ctgtgtagtt acacttaaac atcaaaatga gcctatcctc tcaagctcca aattatgcat 71821 gcataaatta tacatgtgaa gtgttaacat gcacacgcag cagagagtgt gcgtatctac 71881 aaataccaga gctagcaaat gaaataaact gtgtgccaga tgtccctgaa atttacaaag 71941 aaccaagaac caaaaatgac agttagctcc gtaagcccaa caaaagaaaa aaaaaaaaaa 72001 aaggaaaaga gcaaaaaagc aaaagaacaa aagaaccttg agcataagga agtggctatg 72061 gttttttcta catgctctag ctgggggtgg ggaaaagtca taaaagcatt ctatgctgga 72121 ccggttcgtg ttaagtctga gttcctcaat cgctggctct caggagctgt ttggaaaatt 72181 cgttgccatg catgagtcgg gggattggtt ttttttccac tggaaattgt taatggggac 72241 tgcattagcg gccccttatc gctttgcctg atagagagac agcaggttcg cccaaacaat 72301 tctggaaccc tgatcactgc caattagggg agtattgtct ggctgacata atcaagtaag 72361 tataaatata tttatcgtga ttggaatggg gaagggagac tgcagaaaga ttgaggcaga 72421 agctgccacc cactgcctgg ctccagcctg gatccagccc agtttgtcac cctgtgctca 72481 gcagaggcgg tcactggctt tctgaggtgg aatgcacagt tgagaattca ggactagaat 72541 tgcttctccc agagagaaac ttgtctcttc tcatttttcc tcccctcttc ttctcttttt 72601 ttcatgctca cttctgtgag agctgaaaag agaagaaaaa gcagatgcca cgttgatcac 72661 gttttaattc tgcccccccg gccatgctat ccccacttac cttctctgta gtggaaagtg 72721 atggtgaggc acagtgccct gactggacct cccatcctct cggcccgccg gcatgtgcaa 72781 gttggtctgg cattgggttt atttcattgt tgatttgtgg agctttttgg gaatacctgt 72841 gatgtgaatc agccctggga ttgaaagacc ctgaaaaaac agtcccttgc agtgggctag 72901 tgacgggagg aattgctcac atttaaacag ctgaaaagca gacatgttta agtcacatct 72961 ggtggcattc ctgttgtctg aatggagagc aaggtctttt taagcatctg cccagatttc 73021 aaaatgcagg ctgcatcttt tcttcctcaa acaaatggtc aggatcggat atgaatggcc 73081 taatcgcatc gggagaggta aacaaaactg atctcatggc tggatttcaa ccttggtcca 73141 tccaggccca atcgatctaa ggcatgactt ttttttttcc tctcacccat gactacttga 73201 taatggccat ttccggggcc aagttcattt cggctctttt ccaccatccc ccgtgtgctg 73261 ccctgtctca ccacttgagg gctgaggccc cttcccaatg gctacaacag ggtaagggga 73321 ccctgtcctt ctgtctcctt catcctatag tttgtcttca cagctgcaaa ttgctctcag 73381 gcctgccttc ctggatttga caaaaactat tttcttactg gtactatgga taccacaagt 73441 ggagatgtaa gtaggaaggt gcaggtccac aaataaatag ggagcccaga tgggagaaat 73501 gtcattagag agatcactgt ccctaaaggc atctgaccac ccatgttcct gcctctttct 73561 gcgaccattt tcctccttcg gctgctcaga aattgtcgag gccaaactcc tttcacccag 73621 attccagaaa tacacttatt tcctttgctt tccaaatctt aatggatgaa acacctcttt 73681 ttttttgtct tctatctctt gccagaaata ttgttttttg agaaccagag ctcatttatc 73741 tccaagggag aaagaaattc attccatgtt tacccattaa caaattacca cttacatgaa 73801 aatactggga cttctctcat tctttgctaa taaatgggtt attcaccttg gttgaaccat 73861 ggcagaggca gatggatagc cactattccc ttcaagttgg cttaatattc tctcgttggc 73921 ttttgtctcc ttgaagcaca gagatttctc agagctttct ttgcttgatt ctgcttttaa 73981 ggaacgtgcc caacttattt gagtctgttt gtttagtaag ctgaattgat taaatccctg 74041 gcatttatgg aggagcctac tgtgttgggt gctatgggtt cctccccact tgaaagttga 74101 attggagtca cccatctgaa tactaactcc ctcctaatcc attggttttc ctgctgtgta 74161 gacaggggta tgaagtagcc atttaccaag agatagtgcg atgtcataaa ccgaacatga 74221 gttctaaagt tagcaagacc tggtttgtac agtagtgcag gcttgggcac gttgctgctc 74281 tcttttgagt cacacattcc tcattataga tgctgatggg aacaacagag ctgtctgttc 74341 atctttgctc ttctccataa tgagataaag ccaaacagac agtgaagagc ctggtctaac 74401 cacataacgt tggtgcaaat tatatatggt cataagaagg taatttttac ctgagcattt 74461 aggtggcatt tggtgtggta tttgatgcat gtcatttagt tacagagata aaacctcctg 74521 aacacacata aagctcttgg cctggttgaa cattacatct cacagatttt tatgtttttt 74581 actaaggata cttcctgcta ggaagttgta aaagagaaga tggaagtctt tttaatggta 74641 ttgtagcaat atcagtttcc tagctttgat actgtactat ggtgatgtaa gatgtcccca 74701 ttaggggaag tggggtgaag agtacagggg acctctgtac tatttttgca atttcctgag 74761 agacagaggg aaagagagaa agagagagag agaggaggga atgagtggag tgacatgaaa 74821 aaggagtgtt catcttggct tatgtccctg gctggacaaa tatttacatt ttgtttgcct 74881 ctttgtatag attttgtgaa tagtcatggt taaaagctag cattcccttt tctgtggaag 74941 agaagaaaga taatagaatg agctgagaat ccctgccaac attatattag aaagccacct 75001 acttcctgcc tctcctcacc ctccttacct atacaaactt accttctgca tctttgtcct 75061 ccttctttcc cattcaaagg ggagatgcgt ctcctgctga acaaatgaac tcctccgcat 75121 gaccaaggtg tcccatccct ccctgctgtc tccaaaggtt cagtcctgct acataatctc 75181 tctaggtgaa ttccttctta ctcagggttt tcacaatcat ctatattgga atagcaccca 75241 aattaatagc tctacccaga cttttttatt gagttctata cctgcatagc caatccttga 75301 ctgggagtct ctacatagat cattcacaag catttcaaac acccctttaa ttatctcctc 75361 tttctgtatt cccttatctc agaaaatgct accaacatct accctaatgc ccccagcagg 75421 aactcaagag tcgttctaga cctatgctgt cccaagtact tcaatgtggc ttgtccaaat 75481 tgaaatgtgc tgtaagtgta aaatacacac cagatttcaa agacttagtt tggaaaaaat 75541 aatgcaaagt atctcaataa tattttatat tgatttcaaa ttgaaatgat aatattttgg 75601 acatattggg ttaaataagt tattaaaaat aattttcttt tactttttaa agtgccacca 75661 aaatatttaa aatggcataa tggctggcat ttgtgacttg cattctattt ctattggata 75721 acaccattct agactctttt ctgtctttta gcccccttcc aactggtcat aattcacagt 75781 gattcccttg agttaagaca agggagaata ctgtttaagt gcacaggctg agtttgaatc 75841 ctggctccac cattttttga tacatatgaa cttgggtgat tatttaaccc ctttgagctt 75901 cactttcctc atctgtcagc tggatacaat agcagttctt attttgagat tcttgcatgc 75961 tttaaatgag ataatgtgta aagtacttac tatggagata aaaacatgtg gaaagaccca 76021 gtgtcaaaag aaagagcatg atctgtcact gttgtagttc aggccctcat cattactagc 76081 caagactact gcgatgcttt ctccatcttc ggttcaccct gccctccatt ccttcatact 76141 gccatctgat taaacattcc tgaatgcaaa tgttttcaaa tgtccaatga cttcccatca 76201 tccttggtat gaagtcctaa gtccctggca tgctttcaat gtcctccatg atctgccccc 76261 agtccaatca ctgatcccat ctcctgccag tccatgattg aaccagttgg gtccccaact 76321 cctatttgca cacaccattc tctctgtgtg tcatatatga gctctctact ctgcctagct 76381 acttgtctgc aagactcagt tcaattgtca cctcatttgg gaagcttcca ctaatccctc 76441 ctgcacctgt ctccgtccta ggctagatta ggagcctgcc ctttgggttc ctgaaccact 76501 tgaggtgtgg ttcctttatc atgctgatct cactggtttg gctgacttgt ttttcctaca 76561 agactgaact gtctgaggtc ttaagatggg ctgtctagtc cacactggat cttcatgctc 76621 tacacagggc ttgacacaga gtggggcaca ggaaatgttt gttgaacgaa tacacaaagc 76681 atatggatac tcctgagaga gagaaggtgg gcaagacagc cttgtgaagt taagcagcag 76741 aaaatctgaa aacgaacctg gatagaaatc accaattcct ggaagtacag agagaggaag 76801 gagagagaag aatctgtgca aggagtcagt aatggaggac ttgaaatgaa aggctattaa 76861 agagagaaga tcattcaaac aacagaacca catttggtac tgttggtggt tttgtaactt 76921 cctctgaaat ggttccagga agagacaagg ttctaggatg gtgcaaatca tataaagaaa 76981 ggatggggta ttataagcca attcaaggat tatttgtcat caggactaga tacatgggtg 77041 gtacaagtaa atgtcaacct gatgactaca cgatgggact agtaataatt tactggggct 77101 ggtgcccagg aaattttttg ctgtggtgca gatgccaagc atggaaaaat ttgacgagtt 77161 ataggcatga gaaagtagat ggcgaatttg agagtaacaa tgtaaaatgc cagagtaatc 77221 acaagagaaa tgtctgaata tgtaagagag tttagaagat ttctgcttta ccacaaatac 77281 ggacttcaag aaaaattccc ttataagaac agaaattcca atatataaaa cttatttatc 77341 aaaacacaca gtattccata atatataatg gtaaaagcat gccttttgac ccacacaata 77401 ttaaaagcag caaataacac aagaaaggta gagaaaacag aaggtacaac attttgtggc 77461 tcacaggata tatgagtgag attatatact tctattccaa accacaacct tcactagaac 77521 atctaaaggc ttcattctag aaccttcata ctgcttatcc caaaggctag cccactcccc 77581 ggttttcatg taagaaatct tgtgggacag gcgttgcatg aggtgagggg cagcagggaa 77641 attcgagatg aaagggtgaa aagatcaaac gagtccttgc tattcttctt aacaatgctt 77701 cagtccacca ctttgagatt tctgtggaga aagtaaagat ttgggatata catatactaa 77761 aacaactatt ggttgcttat ctgaaaatta actttagctg ggcattctgt attttatctg 77821 acaaccctac ttggtgaggt gtacattttc ttattcatac ttcctgcgag ccacaacatg 77881 tcttaatagt tgttagctat taaggactat gcataaaagt gacaatagct ggctgggcac 77941 agtggctcat gcctttaatc ccagcacttt gggaggctga ggcaggagga tcacttgagg 78001 ccaggagttc aagatcagcc tgagcaacat agggagaccc ctgtctctac aaaaaaaaaa 78061 aaaaaaaaaa aaagctggat gtggtggcat gcacctgtag tcccagctat ttgggaggct 78121 gagacaggag aatcacttga acctgggagt tggaggctgc agtgagccct gattgcctca 78181 ctgcactcca gcctgggcca cagaatgata cctgtctcaa aaaaaaaaaa aaaaagtagc 78241 aatagctaac aactattgaa ttattgagtg cctactatgt gccagacacc atactaagtg 78301 gattagatgg atattccaca tattcttccc aagaacatta tgaggcaggc actattatca 78361 tccctgtttt taaaatgagg aaactgaggc tcagagatgt taaataactt gctaagggtc 78421 actcagaaaa ctagtagtag agaagggttt gaatgctggc agcctgactc caaggctcat 78481 gtgcttgcct accttaccca gatggcggca gcactgagca aggggtagat aacttcaagt 78541 cccctggaag aataagagaa agtgagacca tggggtatga ggggggcagg ccctgagtaa 78601 agctttaaag gagcagttga aattagaccc ggtgcctttg tttaagtaag aagaatgtgc 78661 tagttctaaa cagatgcagc cccatactgg gactcacagc ttgatgagaa gcgcactgga 78721 tgcatttcag cccttactgc aacctcagcc tgaggccttt atgcagtgag caacctgcac 78781 aaccatacat agcaatgtga gtttgatggg agaaggaaga ccagtgcaga gacaaggatg 78841 tcgatggagt gcttagagaa caaagggaag ctatttacta aagcatgttt gcatggtgca 78901 ggtgattttc taaagttaat tttgacagac aggatgggga aagattgttg aaggcctgaa 78961 atgacaagag agacttgagt ggatctggga ggtcatgtga caaacatgac attaacaaat 79021 tctaaatgga aaaaatgacc ctcagaaagt tgtgtgtcaa ggccgggtat ggtggctcac 79081 gtctgtaatt ccagcacttt cggaggccaa ggcaggtgga tcattagagg tcaggaattc 79141 aagatgagcc tggccaacat ggtgaaactc catctctagt aaaaatacaa aaattagcca 79201 ggcgtggtgg cgggcacctg taatcccagc tcctcaggag gctgaggcag gagaatctct 79261 tgagcctggg aggcgaatgt ttcagtgagc caagattgcg ccactgcact ccaacctgag 79321 caacagagcg agaccctgtc tcgaaaaaaa aaaaaagaaa agaaagaaag ttgtgtgtca 79381 aaagcttcaa cacacagggg cctttgctct gtcaagcaag gagaataggg tggaggagga 79441 acaggaacac agtagctaag gtgacagctg cctcacaaaa ggaaagtcta tggcttgcaa 79501 aagaggtcga ggaaagcgga aaagcaattg ccgaagaggg acagtaagtg aaaagatgag 79561 tttcttccac ttggcatctc cccaaatatg gccctgaagg aagagttgac cctgttacgg 79621 tttctttctt tttcctgtgg ggttattaga gtgacatata agaaaagggc cacaggcacg 79681 gcacagcttg atttcagcca ggctttgcat gacaccctac agagcagaat gaggacagat 79741 aggctgaagg ataactactt aactgggtag gtttcaagtt gatactcaga gtgttgatta 79801 aagggtggtc aatgacaacc tggaggaggt ctttaggtaa agaaactgag ttcaaagata 79861 accaacttgc atgaatgaag gagccaggat ttgaacccag ccctacctga ttcaaaagtc 79921 cattttccat ttattataaa acaaaaacca aaaaggcatg aagttcactt atgaaagaag 79981 agttgaaaga tataaccaga gggttaggag tacaaatcca tgcccgaatg aattgtgata 80041 ggctgagaaa ttggccaaag ggggagaaca aaaacacagt gaaagtcaga aaggttaaat 80101 gtgaagccct gcattttggt ttcaagaaac ccgttgcaga ggtggagagg tccagcttga 80161 ccatcttcat gttgggaagc acccgggctt ttgttgtagt tattgttgct taattcccaa 80221 acagcagcgg ggtgtggggg cacacgattg tagccccagc tactcgggaa gctgaggcag 80281 gaggatgcct aaagccccgg aggtcaaggc tacagtgacc catgatcacg ccactgcact 80341 ccagcctggg caacagaaca agaccctatc taaaaaaaaa caaaagaaaa caaaaaacca 80401 gccaacagtg taacacaatt acaaaatctt catgtacaaa tgaagtgatt aaatcatcct 80461 attctactca gcccagctca gaccatatct tgaatactgg gttcaagtct agataatgta 80521 ttttgtattt tgagaaaaat attgaataac tgacaggaga cctgaggaaa atgaccagga 80581 gatcaagagt tggaaccttg tcacatgaag ggtggtcaag gactctgggg ttatttcagc 80641 ttagaaatgg gaatttggga tttgtgattc tcattttctg tttcttttgt tcgttcattc 80701 tttctttcat tctttctttt tctttcttgt tgtggttctt agcacagaat tcagatgaaa 80761 gagcttttga ttcaacgtaa acaattcacc aagctgtcag ctcactccaa tcccagccgt 80821 ccccgatgcc cattgtctat catctcaatg aattagggaa aaagactgac tttctgggcc 80881 acttccaact ttgagaatct aggatagggg cactatttaa tcaaaactgg caagtaaaaa 80941 gaaagagaac tattttaaac aagacaaata caagaggtgt tgaaccaaac tcaaattaga 81001 aagaacaaaa tttcccaaaa ataagaatat tataattcat ttgatgaaag gaatttgcca 81061 tattatccct tttactggca tttttttagt acctttactt gtaaaaatgt caggccctcc 81121 cacttttctg ggatagactt atcgaatgaa gcaaggggag cttctctatg gatagatgat 81181 cacttttgca agctttgaga tcagtgagct gaaaaaatga ccagcattcc aacaacaaat 81241 gagggcttgg gagcgtaagt aaaagtgaca gcaagattaa agtaattgtg gtcatgccgt 81301 cctctgctga cacttttgtg ttttgcccag atgagaatta aggtagccca gggcacaggc 81361 agtgaggaaa aggagagggg cgctgagcgt atttgcagca tttgtgaacc atgtcacagc 81421 cctggcgaaa acagagcaag aagatggagg ctttcaaatc catgataaat aagccatgct 81481 ttcagagccc aaatcagtaa aaacaaacat taagtgaaca gaagggaaga gggaaaaaaa 81541 gagaacatga ggccaccata aaatcctggt gatatattcc aggggaaatc atgatgtgaa 81601 tagaaaatga gaagtaaaag aaaatgaagc ctggagattt gtaaaatgtc tgtctgtctg 81661 tctgtctctc tcgactgtca ttcccatcta cccctgccac ccccaccccg ccctccaccc 81721 ttctgatggg gttcattaga tgatttctga ttcaggggtg gggggcggta tttttaatgc 81781 ttcacttatg tttgggtttt gccagcagga tccgaggtgt ttctgtagac aattgttatt 81841 gctttaggaa acaataagaa taaaatgagg aaacatcaag tgatacatca gtctagcagc 81901 agagtgaggc tcgaagtagc atttccagcg aggttggtgg tctctttttg ccttattttt 81961 gttgttttgg attgatgttt tcatggtgtc attttaataa gaaaagtgtc agagaattaa 82021 aagtggaagt atgcgagata cactttctct aactggattc atggtaagtt attaatgttt 82081 caaggagaaa gaaaagaact tggaaaaagc agaatatatt gtggggaaag ttgatgaagg 82141 gaaaggtaag agaactgtga caaatgagac acagacagaa ggagagagag agaaagagga 82201 agagagagag acagggtggt gggggggacc ggcagaagaa aaggaaaaga gcgacagttt 82261 gtaagtctat tagctgagat tgtttccatt tctgcaattg ccttaaagcg catgattgct 82321 tggtctgagg tctgccaggt cgattgcagt ctttacctct ttgaccaaga agtaatgggc 82381 ttgaatttcc tgggtatgtg aaaattgctc ttgaatgttt tggctaaagg acagccaccc 82441 aggaaggttg agagggccta agctagtgag cagtcatcct gaccaagtag gagggatttg 82501 gttatgttgc tcgtagcagg agaggcagag cagcaaagtg aagagaacac acagccttag 82561 gtgccaacag acatgtactg aaatctcttg cctccatttc ccacctctaa aatgcaaagg 82621 atgatagtaa caataataaa tgttgcaacg ggtgaccgat tataaagttc ttgtcacctg 82681 atagaccata tcaatccatg gtaacttatt cttagctggt tcacgtttct aggcattggc 82741 acagtctggg tggtgtgtgt cccctcaaag gtagcatcac agtgactgag ctctgcatag 82801 atggcaaagt gtagaagcct ggtcagtttt ctgtttctcc atccaccatt tctccccact 82861 tccccttggt tagtgctgtc ttctctcttt cctcactgaa ttgtcttgat tagtgtcata 82921 ctttctgata acactgcagc taggactcca caggcaatag ggagaagcta gcgcctgccc 82981 agggaaagtg aggaaaaata ggtgcttgag aggctgaaaa gtagtatacg cttggtggcc 83041 ttacactttg tagaaacagt tttggatttg tctctacaag aaaggattgt agataaccag 83101 aatgcaaaaa tgctttgaag aaagagacat tgagcattgg aaccaagtat tctacgtcct 83161 cttccctgca acactgggga ctaaggatta ttatctgcat ttctggtgaa gaaactgagg 83221 cacagagaag ataaactatg gttctccggc tacaaagtaa gcagcagagc ctagagtttg 83281 aagccaggtc tgacacctaa gtcccgtagt aaacctccca ctgaggtagg caagcctcgc 83341 cttgcagaga gtggcaaatt agggaaaaaa gaatcaaatg tgcttgagtt taacagaaca 83401 tatacattat acttaaagag aaagatactt ttcttacaaa aagtttgagg acaaatgaga 83461 cttcatgtga tgcgtcttgc actaaaaaaa aaaagttaga ttatataagc cagactggcg 83521 ttgattaaag aagagcaagg taataaatac gcataagaag aaggaagtgg ctggctctac 83581 cttgggcatc acccaagtta caggcagggg aggggatagg ccctgcatgg ggacagtctc 83641 ctgacttgtc ttcttccttc tctacttccc tcccttcact acccctccct ccaaaaccca 83701 aaaaacctta agcaggaaaa aagggagcta attaaaaaag agaaaaacat ccttgagacc 83761 aatgaagtct ccataatcca cagtgacaga cagggacctg cagaacagca atttgattaa 83821 ataaggatat aagggattta aaaggactct ccagttgtgt aaagtgtgtc catacgtctg 83881 aaaaataccg tatttatttg gctcccaaac ctgaaaacct tctgacgggg gagattgttc 83941 ccatttggaa actaggaaga aatcacaagg caaggccatg gggttctggc agggctgggg 84001 atcccatgtt gggtggaaac cccaagaagc cagccaaggg tctgcaggag gggccctggc 84061 agagagaagc attagaatgc aagggcaggt aaagccaccc tcacagaagg tgactgtggg 84121 ctgagcctga ggcccttccc agatggggaa gagagaacag cacacctctc ctgcccacgc 84181 ctcggtatct cactcaagaa tcagctgccc gttagtcacc tgtcatcgga gacagctctg 84241 ttggcggctt tcagaaaaag cgaagcagcc agaggatgaa tcgttattcg gggaacacac 84301 ctgcacacag gacaggaatg gatgggagtg gggagcgccc tggggctact gttcaaacgc 84361 catcctcatg gtctccaaga cttgcatcag cagctcagcc gctgatgaat tataattctc 84421 ggacaagctg cagaggcccc cctcgttcca tgtgttcacc cctgccgctc atactggggg 84481 atttatgctt cctggggcta ggaggagaaa ataatgagtc actggggaca agagaaaaac 84541 aaaaatgccc tgtatgtgca aaaccaccta aaatggaggc ttttagagga cactcaaatc 84601 ccagccaact agccagaaag ggttttgctg ggggtatatt ttatgtgtct ttgctatctt 84661 tggatgcagt aaagccagaa atgtttatat ctcttggagg gatcacctgt gcaaacagga 84721 caggtcctct tgagaaggct gtttgttctc ctgggatctc gaacctgaga agaaatggtc 84781 ctggcctggc gcggtggctc gcgcctgtaa tcccagcact ttgggaggcc gaggcaggca 84841 gatcacaagg tcaggagatc gagaccatcc tggccaacac ggtgaaaccc catctctact 84901 aaaaatacaa aaaattagcc gggcgtggtg gcaggcgcct gtagtcccag ctactcagga 84961 ggctgaggca ggagaatagc atgaacccgg gagctggagc ttgcagtgag tggagatctt 85021 gccactgcac tccagcctgg gggacagagt gagactccgt ctcaaaaaaa aaaaaaaaaa 85081 aaaagaaatg gtcctgcatc tctgacttgg gaggtccaat taagcagagg aactgttggc 85141 tgtaaacctt ctaaagcaga ggatcctttg ggaggctgag gcaggtgatc atctgaggtc 85201 aggagttcaa gaccagcctg gccagcatgg cgaaacccca tctctactaa aaatataaaa 85261 attagcggac cgtggtgggc tgtacctgta atctcagcta ctcgggaagc tgaggcatga 85321 gaattgcttg agcctgggag gtggaggttg cagtgagctg agattgtgcc actgccctcc 85381 agcctgggtg acagggcaac tccatctcaa agtaaataag taaagcagag gaccctgatg 85441 gccaccccca agttcatagc tacctgcaca ctttatgtgg tgaagcactg gtggggggcc 85501 caggaagccc tgcatgagtt ggttgtagct gccatgattc caactcccac gccaccctct 85561 gcacctagcc acagagccgc atctccagag aaaaccagaa gcaagacccc accaaaaaag 85621 aatgtgagag tctattttaa aggctccgga agatgagtta gaagtgattc aagagcaggc 85681 acagtaaagg aatatgtaag ccttggtaga cattcagtaa gaagggtaac atttccaaca 85741 tgaccagctg attaacaggt attctctaag cagggtgtct gctggtttag aatagagcaa 85801 aaatattgtc atctgagagc ccaaaatggg gaaaggctac tccttggtct cttggtgagg 85861 cccagcctgt tttgtttttt gtttgttttt tttttttttt tcactttccc actttatttt 85921 cagaatagat tgatttgggt ctctagaaga atatttaagc agtggcttac aggcgagcca 85981 taaggtagtt gtccgttcta tgcttacaaa ttttcaagaa gggccatctg taacctttct 86041 cagtttcctg ttcctatgtc caatgcccaa taagaggtaa taaactaagg cagaaaggcc 86101 ccaaacacag atgtaagatg cagagcaaag aataacacac agtaggggtt tcaatacagg 86161 atcattaacc cagaatccaa gcaggagtga gggaagtcca ctcaggagga atgggaacca 86221 cctctccagg ctgctcagga ggaatttaag catcagacag tgtgtaggag aagatgacca 86281 cgaattcttt gacatctaag caattctgta ttttaaaggt ttgagtcttt gaaatgttaa 86341 aattgatgcg ccttaaaact catgagagac aatgaagaca atcaccgaac ctgtctagtt 86401 aaaggttacc gggactcaag atgcaaagaa aagttacgag atggaattga tggtttgggg 86461 tgcaccattg ccctctattg gattttaaat aaatcagacc tcttagtgta ggtatgttgc 86521 actatctctg gctgagtttg ctgagaagtc agggcacctc tgagtgggaa gttaattgat 86581 ccttttaaat ggctccttct gtctcctgct ccttccctga ggaaatcaac cagagtaatg 86641 ctgagggcag gtcatttgca tgtcttcatt catccaacaa acatttcctg gctgcctgct 86701 aagcccgggc taggcattgt agagaatcta agaaaaaata acaactgcca ttcgttggac 86761 acctactgtg tatcagagtc tacgcattaa tacagaacaa ccccacggag taactctggt 86821 gactcccatc ccacggatgt ggaaatagga cactcaggaa gtggcacatt ttggattcaa 86881 atccaggttc gtctactact aggcctatat tctgtccact aaactacaac ttttcccctt 86941 aagaagttta taaactatta gggggagata ggatatagat ataaatgtat atatcagggt 87001 ttgacaaact acagccagtg ggccaaattc agcccatcat gtgtttttga ccttgagcta 87061 atttacattt ttacatggtt gaaaaaaata tcaaaagaat aataatactt cctgtcatgt 87121 aaaaatgcta tgaaattcaa atctcagggt ccacaaataa agttttatgg gaaccctgct 87181 atgcccattc atttaccatc atctaaggcc accttcctgg ctacaaaaac acggcgagtc 87241 attgtgacag agactagatg gccaacaaag ccaaaatatt tactctctgg ctctgcacag 87301 aaaaggtctg acaattccac atctgtactc caaggtgaaa tgttctatat caacaggaga 87361 ggtgtttttt ttttttgtct gtttgttttg ttaagtgctt gaggagttca gaggagggga 87421 aacaaatgtt tatttggccc acagtgtgca gagctggagg ggataggcag gtgtgctctg 87481 ttttaagact caatgcatga aaaccctgac tttaaaacag ataaattaaa aggtagttat 87541 ttgcataaca aattttcctt tcattttata aagcgattca ccacagggtt cctttataga 87601 gtgagcctcc ataatacatg ctaatatgat tccatttaca aatactctat tcattataac 87661 ttttcaagaa tatacctatt gtgtacagtg aggcttaact gtaatcattt ttcctggcag 87721 tataaggttt cctacttctg aaaagtgatt aagctgacct tgtcaattcg actgggggag 87781 gcggcactct cctctactgt ctccgagagg ggctttacct gtcgaacaac aaggctgtgg 87841 attctgtctc tgcattgtgt gatgttgagt agctggtata attttacatt ttcactaagc 87901 tcctctcagt agtagcagca gccaaagtcc taaagtgtaa actactctca ctattctctt 87961 acttcatttc gtatacaaga tacactaata aaaattattt gttggctgga ccacggtggc 88021 tcatgcctct aatcccagaa ctccaggaag ctgaggcagg tggatcactt gagcccagga 88081 gttcgagacc agcctgggca acatagtgag accccgatct ctaccaaaaa aaaaaaaaaa 88141 aattagccgg atgtgatgac acacatctgt ggtcccagct actcaggagg ctgaagtggg 88201 agaatcctgt gagctaaaga gttcaaagcg gcagtgagct atgatcacac cactgcacat 88261 cagcctggtc aacagaggaa aactttgtct ctaaattaat gaactaatta attaattttg 88321 ttgctgatgt cctgacaaat atgtgacaag caattcttgt aaaaagcaat gtttccctct 88381 ctttttaaaa aatttcaaga cattatctta tctgaattac aacataaact ggggcaagta 88441 ttcagattag agctctcccc agcaaaatag gaagtgatag caatttagga cctgtctcct 88501 tctctcactg ggtcctgagc ctgaatcttc acatagcttt gtggtgttat gcaattgccc 88561 tttgtaagag atacataaaa tggaaacctt agctattcat agccaactta ggaggagctg 88621 aatttgatca agaagtttct tgccgggtgc cattttgatg gtaactacgt gtttctgaac 88681 aaatccactt ataatctggg tttccaatgg aggttatttt aaagagctgt agcatggctt 88741 ttgttttcat cctctgtggt ggctatattt tagaagtaga caagcattcc tggtatatag 88801 cctgaatatt ttttaagatt tttttggtag taaaagaagc tcacatggaa aggcaagggt 88861 agaaaaaatc atgttcaggg cagtcctgtg tggcattacc ctctggtgaa gcagatccac 88921 actgcatgct gctaatggct attgcgtgac agctcttaaa ggggtaagcg agagaggtgc 88981 agagggctgg tgagaagcag agctagcact gaggccctac aaatggcatc accatctatc 89041 aggttggcgg ggtgggggtt gggggctgct gctggaggtg gtttggctgc ccctagaggc 89101 tgcagcgtct cacactgtat gggcaggtag catttcagcc atttctaaag cccatcactg 89161 gcagggaaag cacatttaat ccgtctgtga ggttctctaa aaaggggcct ctcttttctt 89221 tctttttttt tttttttttt aagatggagt ctcactctgt cgctagactg gagtgcagtg 89281 gtgcaatctc ggctcactgc aacctccacc tcccgggttc aagtgattct cctgcctcag 89341 cctcccaagt agctgggact acaggcacac accaccatgc ccagctaatt tctgtatttt 89401 tagtagagat ggggttttca ccatgttggc caggatggtc tccatctctt gacctcatga 89461 tccgcccgcc tcctgggatt acaggcgtga gccactgtgc ctggccgagg ggtctctctt 89521 ttcagaagtc agaagcgctg aaattgaaaa cagcagcaag aaggcttcat tgcattcttg 89581 aattcaggct gttgcactcc gccccccaca actctgtcta aattgttgct acccagttta 89641 tatctgcatc atattttttg ttgttgtttc tccaatgcag tatgtagtgg tggcaggtac 89701 agatcataaa acaaatggat aatcagattc ctttttccct ggtggtaatt tttaaacttt 89761 tcaaaattaa tgctaatgga tatagaggaa tgggatatac atttccatgc atttttatga 89821 ccaaacagaa aaccgacccc tgagggtgac atatttacta ttctctacta ctaggggtgc 89881 ttcaagggaa gagtttttga agccctgtgg tttaaacaaa aagagaacat accccacccc 89941 accctgaaca aacagcagct gggcagattt catctcacaa atggaaatgg gagcaaacaa 90001 attttttcaa agcgatttta taaagagtgg gcctgtttag ttgggttttt tgagccagga 90061 ttaatattac aagctgggga gctctgtgac atggctgtgt cgttggcttt gccataccgc 90121 ctacctcctg ggggaaggca gaggaaagtg ctgctattca gaagcaattt tacttctgcc 90181 tcggccttct cacttctcac tgaagtgatt tacagccagc ctggacccag tgcccaagcc 90241 tgagctgagt tacatgtcat gttggaaaga tcttccccaa atcaaggtct tatctagagc 90301 ttgcctggga tattttttca tttctaattt atgtaacttt ctaatatgtt tagcatttgc 90361 ccttcccttg tctccatcct gcttattaag gtgaatttgc atacaaatgg gaccttggtc 90421 ttcaggacca ttcacctgac acttgatccc acgaggcgac cactgtctac tgttgatacc 90481 atgaggatga cccacagaag tcatgtctct ctgcaaacta gaggcaaggc cttaattaga 90541 agaagccttg tgtgttaaaa tgatggggtg aatcaaatca gcaaaggcat ttagcattaa 90601 cttaggaggg cagaatactc cttcctgttg agtgggtcgt cctaggtcac aacaatttta 90661 atttgccttc gatagatgga tcttgaagaa agtgtacaaa aagtatattt gtgatggtgt 90721 tgtttgggag tacgttttga ctcaaagagt taactgtgac attttacttt ggctcctggt 90781 ctattttttg cttaacatct acctttgaat gggtaatagt catctctata aattaccttc 90841 gaaagattga acaattgtgg agaatcagag aaatgagccc cttgctatca ccctgttgca 90901 gtcggcttta atcctctgat ctcctggcgt ttcaccctct cactggcagc caccttcctt 90961 atttacccaa atcttgttaa cttgggtagg aattcccact gagaggacaa atgtggtccc 91021 agttgactgg tcgctgaaag tctattaggt tggagagtat ttattttcta ccctaaaggg 91081 tttgaaactt agggaaattt tgatcaacaa gaaaataaca gctttcaaga ttgtcctgac 91141 agttaagctg tcttagcaga gaatcacagg atttgatagt catggagatt agttctcaaa 91201 tttcaatgag aaaagttcat aggagatgga gtgaggaaag aaaacttgtg ctgattttgg 91261 tcaggctacc tttgctgggg aaaaattcat tccataaaat tattttaaaa cgtaaatact 91321 atttagcttt atctggtgta atactttgca tctataattt ttttgttttg ttttgttttt 91381 gagacagagt cttgctctgt cacccaggct ggagtacagt ggcatgatct tggctcactg 91441 caacctctgc ctcccaggtt caagcgatta tcctgcctca gcctcccaag tagctgggat 91501 tacaggcacg tgccacggca cccagctaat ttttgtattt ttagtagaga cagggttttg 91561 ccatgttggc caggatggtc ttgaactcct gacctcaggt gatctgcccg cctcgacctc 91621 ccaaagtgct gggattacag gcatgagcca cagcgcccgg cctgcatcta taatttgatt 91681 taaccttgca actttattaa ttgaatatta tgatcaccat tttatagatg aggaaatgga 91741 gactccaagg gtttaattac ttgcccaaga gtctacagct atctaccaac aaaactggga 91801 ttggagccag ctgtcttacc attacatccc cgctgcctcc taaagcacag ctctagttca 91861 tactaaatgt attacatagg ccgactcatt tatctttaca tgggtctcat gcattacaag 91921 aagggggcca ctattttaac tcaattcaaa aatgttaaaa tcaaaattcc caaaatgtac 91981 ttggtttata atatatcata aaataagtaa tctgaccaat tgagcctccc aactcgaaga 92041 ccaagaactt ctttctactc ccaaaccatg cagaaggaag ctgagttccc cactggccaa 92101 tgctgtctaa ttctcttaaa tgattcagat gtaaaagagg aaatggcatc cgtctaatta 92161 acactgcctg gttaaatata gctgccaagg gacagccttg ggacagtgtt tacagaatag 92221 tctttccagc ccctcaggcc tcaagttaca gatgatagca agccttccct ccttatcagc 92281 tgcaaacaga gatgccccca aaagtggtgt caggccagga tgatgaaagg agaagggaaa 92341 ggctgcggat ggccttcatt tcctgctcct ggcatctggg gattccgaga atgatcccag 92401 gacctagatc ctatagattc cggcagctct tcctgcaaac acaagccctt gttgtgtgcc 92461 tgagccttgc tctccggcct ggagacatcc atctgcccac tctcctagga actgctgatt 92521 accaggaagc accatcggct ctgctgccct ccccctggca gagaggcaca atgaccacat 92581 tatttctctt ccagcttcac gttacagcat ggaagctgtt gccaagtttg atttcactgc 92641 ttcaggtgag gatgaactga gctttcacac tggagatgtt ttgaaggtag gtgacctggg 92701 gccccagggg agcagtaggg agtttcagtt actcagtaat ctgacagtcc ctgcctttgt 92761 ctcttaagaa ctgcatgtgg ttccttttaa aaagagaagt tcaccttgat tccctttgtg 92821 cacccttata ccagacacca accaggccgg gtgatggaaa tacagtcggc ataagccacc 92881 atccttgtcc tgggaggttc gtagtctgga gaggggaaca catccaccca aagggctgtt 92941 acagatgttc ttggctaagt tctcagggag cacaggtgag ggaacaagtg attccatctt 93001 gaaagatggg ggttgtagga gtgtgtgcaa tgtgaacgtc aggaatgcac tgaaggtgac 93061 ttgggctttt agctaaagct cagagatgtg caagcccatg ctgaatgcag ggagcagcgc 93121 caggctcagc agagctggaa ctcaggagga gaggttgtgg gtggagacag cttgggaaag 93181 gcagccacag aggacatggc tcaacaaagg ccatgcacac ctggccactc aggggctttt 93241 taaagcttgt cctcatcctg cacttgagtt gtagtggcca cttgctgagc tgaagtggcc 93301 ttggacttct cccacattca tgcaccatac tttcagactc tgcatcttag cccttgggtt 93361 tttgttttgt tttgtttgtt tgcaaaagag cataagccct tcaaaagggc atttgggagt 93421 gtgtggagcc agaccccctc tgcctgacct cagagcaccc tcagatagcc tgatccctgg 93481 tcctcagata gcctgatccc tggtcctcag atagcctgat ccctggtcct cagcagtccc 93541 aatgagcctt ctcggacgcc cagcaacact gaggccatta gtgtctcacc accgagaatt 93601 aggtttgtta tacaaagaac cttgtcagca tgtgagaggg tgacattgaa agtgtctgag 93661 gagcacaaaa caccacagaa tggcttcttt tggaatatga acatgcccat taggtagcct 93721 tctaactgct ctcaactttc aagatctagt aacatagtgg caaagcatag gccttcacaa 93781 acttaaaata tgagttgggc ctcctgaccc aggaaaacct gtctgtcact gctttggggt 93841 cttacgaagc tcctggaggg acttgccaag cagcgagaag ttcctgcctt ctctgtgggg 93901 cactgtgatc atgtgactgt catccactgg agaccatcct tgtaccaact tccaccctcg 93961 tcttccatta caaagagaac ctggtatctt aagttttctc tattcctcag aaaaggaaaa 94021 aggaaagaaa aaaaaaaaaa actttgctca actctgcttc ccaatccagc tcctgatttc 94081 tttcctcccc tttgttgcta atactgggaa gaattctcta cactgggcgt gcttcacttc 94141 ctcacttccc actcattcca caactctctg gaatctgcct tttaccctcc ctgcaacatt 94201 ctcttgaagt tcaccagtga tctccttaat gtgtagtata atgaccctat ggaaatgtct 94261 gtgggattga cactgctaac tggccctctc ctggaagccc ctttctccct ggctttggca 94321 gcaccaccct ctccgggttc ttctcctccc ttcctgacca ctcctacagg tctcatttcc 94381 aagcattctt tttcctctct tgatcctgaa atatacactt ccctgaaggt gctgcctttg 94441 gctttctgat ttttcttctc actctgagcc acctcatcca cttcttcagc tgcaactatc 94501 acctctgtgc caaagaccct caagtctgtg gctcgtgttc tagctctggc cttgctcttg 94561 aactccagac ccagacttcc aattgtctgc tggatgtccc cacatgaact ccagccccta 94621 actcaggatg ccgagaatca ctcaaacccc catgcccatc tcacccccat ttccacccaa 94681 gccactcttc tttccatgtt ctaattatta tcctaggtca gtcacaagct agaagcccca 94741 attatcatca atgcctcctt taccaaatcc tggtgaccag taaccaacgt ctcctgagct 94801 cctgttcagg gccaggatct agctaagtgg tttgcaggca tgatttcatc atctcagcaa 94861 acctgtgagg taggcattca tttcccctcc atttcataga tgagaaactg aggctaggga 94921 ttaaatcaca tgtccacagt cacacagcta gcatgtgacc aatccaggac ctggacagaa 94981 ctctgtctga ctcctaagca ctaaactctg ctctgaatgt atttttttcc tcatatctct 95041 caagcaagtc tcttcttcac cagaaaggag tgtgaggcag taaaaaggag catagacttt 95101 gagtcagatg gatctgagtt tgaatcccag cttactaacc atggagactg gggcccagtc 95161 atctgccccc tttgagcttt agtgtcttcc tcagcaagtg gaggtagtat ccatcttgcc 95221 gggttgtgga gaggattcga ggtgatgacc ctgtcccaac gtagacgccc aataaatggt 95281 ggcccgcatt atcgggatca tccatccccg ctgcttctga ctgctcagat cattgccaca 95341 gctccttgac tgacctcatt acctcttgtc tttttcctcc ttcactcatt cttcttacta 95401 gtcccagagg gatctttcca gacaaatcag atcatatcat ttccttgctc aaaagcattc 95461 aggggtttcc aattaccaat tgattgagac tgaaaacatc tggccccaaa ccactgtcca 95521 gtgtcattat gcacatgtac acatgtgtgc acacacccac acagagacat ctttgtagat 95581 gtaacgcccc caggttgcac tgcaatgtat acaccgccct ttctcagcct ctcccctata 95641 tactctgatt gcccctctcc tacacacaca tacttgacct cctctaccat ggaaacctcg 95701 ctggtcctct ccactcccta ccatcccagg gaaacaaatc actccctctt cagctcctcc 95761 ccaacatttc acataaactt cactcatttc caggcactta tgactttgcc tcacatggca 95821 gctatttata ggcaggccat gaccctgtgg tctcatgcac cctctatttc tacctctcat 95881 tgctgccttt gccccgtggg atcctaattg ctcgtttatg tgttgtctac tccactagca 95941 aaagctgtgc aagggcaggg gctgtttctg cctcaatgtc tagaagagct gtgcctagaa 96001 ctccctgaca cttgacgaat gaatgaactt tcaattcctc tccattccct gagagcaatg 96061 tcatgcacgt gtgcacacaa tgccaggttc gtctttgtgt cttcacagca tctaacacaa 96121 gtcatggccc aaaatagatg cccagtacat atttcttgaa ttgctaaatt gcatttcttc 96181 ctgttcattt agaattccat tggaaagcca accccctccc tcttttcaaa ttctgtccat 96241 tgcaggtaaa gagcaaaatg gggagggctt gttgactgag ccccacagaa tgtctacagg 96301 atggtgattg gttcccatgg ctgggactcc ccacatgtga ttgcaatgtc tgccccccac 96361 cttcctcaat gggtaacaca cccaggatcc tcacaaatac tttcaaagcc agctcctgac 96421 taccccagca attcgcttct gggattctct gacttgtttg gccttttgtg ctaaagacta 96481 aacagaggtt ttcattagaa aggtacaaaa aagtctgagc agccacccac cctcaacaga 96541 ccatcagttt cctggggcag aggccccagc ttatttatct ccatatttca tgcacctaag 96601 tgcaatcagc aggtgctcac tagatggtga tctacacggc atgtagatca agatggtgga 96661 caggaaaacg ctgccaaatg acttgatcct gcttctccta tgacggaaac atttggaaac 96721 tcaggaatca gatttttttc ctattcatgc cacaaaccac catacctgca gctacacaca 96781 gcaccagcct cacatttgaa tttgccatta aatttgccag tagctttgaa aaatatctct 96841 tgacccagtc catttctgtc tccttctgta atgcagctaa tgcgacctca tttcttcctg 96901 ctaaaatagc aacctaaaag cactaagagt taattccaca taattcattt cttcaatact 96961 tcagattttt gctatgtttc taaaagctac aatttcatcc aatttaaaaa ttcttccagc 97021 cgtaatttgg cccttctctt cccacgtcta acttattcac aggccatgta acccaaaaaa 97081 tcatctctta gggaacatta taagaattaa gaaacagtca agagtaaaac ttcaagtgtt 97141 caggctgaac gccttgctca ggtttacacg atgggtacac gagatagggg gaaccgaaca 97201 gatggagaga tgagaagaac aggaagtgtg gggccctggg cccacaacgg caggaggaaa 97261 ccatctcccc tcccctcaaa tacttttctt tgcagaaaat gagaattttt tttaaaaaaa 97321 aacctcacct tctatacatg gtaaagagcc tgcaaattca cctctctaga aacacagtgt 97381 taggcagacc aagcaccagg tggcggagag aaatggctgg cattgtggga taatctttca 97441 gtttctagag gaggtctgag gcatgataac taaaattaca gagctgcttt caggtcagaa 97501 aatgaagtac aaagtgtgtg gttttttttt tttttttttt ctttttgaga cagagtcttg 97561 ctctgttgcc caggctggag tgcaatggcg caatctcggc tcactgcaac ctccgcctcc 97621 tgggttcaag caattctcgt ggctcagcct cccaagtagc tgagattaca ggcatgcacc 97681 accatgcctg gctaattttt gtctttatgg tagagatcag gttttgccat gttggccagg 97741 ctggtctcaa actcctgacc ttaggtgatc cacccgcctc agcctcccaa agtgctggga 97801 ttataagcgt gagccatggc acccggcctc aacgtgtggt tcttacggtg acaaggtagt 97861 ttccaaatgg cctcttttac ccctttctgt gtgacatttg ctaccctggc caaacaccac 97921 attcccatgt gacgggagaa aggaattcca atgccatgaa ttcagaagcg atggatggag 97981 tgcagtgtgc tcttgtgttt ttacgtggct acgaggcgct gctgctgggt ttcctagttc 98041 ttttgaccta gattttctgc ttcctccctt gccccctcag cacacacata cccacatgtc 98101 cttagtgaca ccagcagtca catttgatgt tagaaactat aaagttttct gttctcagca 98161 atctcaaact caaatatggg acacactttt attccttcag aataaatgac taaattcaaa 98221 acatacacac ataatgcttt aaagaacgta taaacccttc catgtcctat tcagagctgg 98281 aggcctccct cataagaata ccacatttga tggaattgta ttttttgaaa agagtctaca 98341 ctttctgaag gactttcagg taccagggct catttgatcc tcacatcagc tctgtgaggt 98401 gctaggtggc tattattatc tccatttttc aaatggctga ttgacttggc caagatcaca 98461 tatctagtga tggagccaag ctagaaacca ggtcttctcg ttcccaactc acaacactgt 98521 ccccttcctt ccaccttgga gtgatgttag aaataggtca tcatctccac tcaagcactt 98581 agttctttcc attatcttcc catcttcttt tctgggagaa agaagcccaa gggccctgtg 98641 acttcaactc cacccacttt ctgatgactc tgaaacccca agccccaacc ttagcctctc 98701 tcctgggtcc agactgacat atccagctgc tccctgagct tttcttctta gttgaagaat 98761 tcaactcaca gtcacccgca tgcaatctga agcccatgag ccctttctca gccctgcctc 98821 ctctcatgta tcttctattt ctgtaaagcc gtcaccacct actcagtctc ccaagccaga 98881 aatccagtgg catcccagag tcctcctctc ccttagcttc caagattggt caagaaatcc 98941 tgcctgtttg cttcctgaat ttctctaaaa tatatcctca cctcctctta gcctaggccc 99001 tatcatctct tgactggagg cctgcgctca tcttcctgca gtttttctgc cgactttagc 99061 cttgcaatct atcttcacat tctgatctct gatacagagc tctgtcgctc tctcgctctc 99121 tcttaattag agacagagtc tctctctgtt gtccaggctg tagtacagtg gtacaatcag 99181 agctcactac agcctttaac tcctgggctc acaccctcct cccgcttcag cctcctgagt 99241 agctgggact acaggtgcgc ttccaccacg ccttgctaag ttttattttt attttcatag 99301 agacaaggtc tcgctttgtt gcccaggctg gtctcaaact cctgggctca agcgatcgtc 99361 cacttgggcc tcccaaagtg ctgggattac aggcatgagc catcgcaccc agccagagct 99421 ctttctgaac tctaattctt actgttactt ccctgctcaa aatcctttaa gggctccccc 99481 tacatttcaa gatattcaca gtctacattc cagccacctg gaggcatttc attcccagaa 99541 cacaccacac ctcctcagag ctgaaggagg aagtccattg ccatctgact tggcctcagt 99601 aactcctttc gttgccaagg caagtggtct ctcctctaca tatggatgcc ttttctatct 99661 atttctgctt tcttctttat catcacactt ctatcagtta agatattttc aggagcaaga 99721 aacataaact aaaaatggct taaacaataa caaagacttg taatttctca aagtccagaa 99781 gtaggaaggc tcctgggtat catctaattc agcggttcag tgacattatc agagttcttt 99841 ccatcttttg ctctgctatc cttatttaga attattgttc tagtctatat caaagactta 99901 aaaaaaattt tttttttgag agaggcttgc tctgttgccc aggctggagt ccagtggcaa 99961 gatcacggtt cactgcaacc ttgcctcctg ggttcaagca attctcctgc ctcagcctcc 100021 cgagtagctg ggattacagg catgtgccac caggcccggc tcctttttgt atttttagta 100081 gagatggcat ttcaccatga tggacaggct ggtctcgaac tcctggcctc aagtgatcta 100141 tgtgccttgg cctctcaaag tgctgggatt acaggcgcga gccaccacac ctggcctcag 100201 actttcaaaa cattctcagt ggcggtgtca aatattttgt cttttaatgg tgaatctcat 100261 tttttgagaa accagaagcc acatacagtc aagtctggtg agtaatatga gtaactgagg 100321 tagacaataa catttggggt cacagtcaaa gtcaggccac agagtaacaa gacagaactt 100381 ttggttttgt ttgggatctg gctttaaaag caagtttaga aaaggcatcc caagtatctt 100441 gggttccagc caaatatctg tgaaatacac agcatgtgaa cacaaagagt tactgtttcc 100501 aaagctggcc ccactgggat gtttaagttt taatctgcct gtttaaaaaa caataacaaa 100561 ttgaattttt gaaaagtttc ctgggcctgg tggctcacgc ctgtaatccc agcactttgg 100621 gaggccaagg caggtggatc actaggtcaa aggatcaaga cagtcccggc caatatggta 100681 aaaccccgtc tctcctaaaa atatgaaaaa ttagccaggt gtagtggcgc atgcctgtaa 100741 tcccagctac tcaggaggct gaggcagggg gatcgcttga acccaggagg tggaggttgc 100801 agtgagctga gatcccgcca ctgcactcca gcctggcgac agagcaagac tctgtctcaa 100861 aaaaacgaaa aacaacaaca aaaaaaagag aaaagtttcc tggttacctg gaagagaaaa 100921 aaaaaaatac accgtggagt aaccattctt ccttctcacc atctgctgag atgaggaact 100981 taaggtgcta actgggacaa atgaatttcc aaaatagaaa aagggattca gggaagtcct 101041 tgagtgctga atttgggagg ccaagccaca gtggggtgac atgagtggac ggcacaagtg 101101 gacagaaata acctggggtc aggagaagct gactccaaag gtaaacccac ggctcctcca 101161 caggcctggg gtaaatccta gaacagctgt tgagttcttt caacttatta acatattttc 101221 ctctaagtcc aaaagggcct tctttgcctg gcagcccctt tctcttgttt ttttggtgtc 101281 ctccctccaa tgtctttgtc tttccttttc ttccatgtag attttaagta accaagagga 101341 gtggtttaag gcggagcttg ggagccagga aggatatgtg cccaagaatt tcatagacat 101401 ccagtttccc aagtaagtat ctgcagcctg ctgagatggg cctagccaca cacactcctc 101461 ttacatttcc agcagtcact acagttatcc atgggaaccc aagacattgt caaaaatggc 101521 tgcctctgga gctctctccg gagagcctcc agagcatgca gctgaaggcc agggcaggag 101581 ggagactttg catttcattt cattcccttt tgatgtttac accatgtgca tgtattgcta 101641 ttttaagttg ttcttaactt tttttttttt tttttgggag ttggggtctc cctctgtcac 101701 ccaggctgga gtgcggtgac tccatcacca ctcactgcag cctaaaactc ctgggctcaa 101761 gccatcctct cacctcagcc tcccaagtag ctgggattac aggcacatgc caccacgcca 101821 ggctagtttt tgtattttta gtagagacag ggttttacca tgttggccag gctggtccca 101881 aactcctgac ctcaagtgat ctgccgcctc agcctcccaa agtgctggga ttacaggcgt 101941 gaaccaccat gcctggccta ttttaagttt tgggtttttt tgggtttttt tattcatgtg 102001 tttatttgtt ttgagacaga accttgctct gttgcccagg ctggagtgca gtggcatgat 102061 ctcggctcat tgcaattctc ctgcctcagc ctccctagta gctgggatta caggtgcatg 102121 ccaccacacc cagctaactt ttgtattttt agtagagatg gggtttcacc atgatggaca 102181 ggctggtctc aaactcttga cctcaggtga tccacccgcc tcggcctccc aaagtgctgg 102241 gattacaggt gtgagccact gcacccggcc tattttaagt ttttataaaa tgagtttgta 102301 aactcaaatt atctatagtg atttcaatca cggcgttccc cagaattcca aaggggacgc 102361 tattggccac cccacttcgg gagctgctct cctaagcgct tctctgtgac ctcctaatcc 102421 taaccatcct caccaggctc tcacccttca ctcagtgggt ggacagacac tccctgcctt 102481 cagtaaagca tgctcacccc aatgagagaa gtcagcgggg ggaaggggaa attctcttcc 102541 taaagaggaa atgaaaaatt gagattgggc ttcgtttgtc tcaagaattg accacaaggg 102601 ttagatagaa atctaatctt gcctcagaac cagggtctag ctcaaagagc aaaagcttgt 102661 tagaaaacca cagtagtatt ggtgactcca gcaaggtgca agagcaccag tcttggtaaa 102721 cctgaactga ccaccaaacc aagagaatga ccactgacct tgacggaacc cttgatgctg 102781 cccaggaatc tgctaggtga cctctgcata ccccacgtgc tcctcaaacc ttcagcccct 102841 ccgtgcatca ctcttagccc gtgaccttga ttccaggttc tctggaacaa tgaaagccac 102901 cagaagtgaa gctcccagac ccccctcccc caccgcatcc acctgcctcc tgcaagtgtg 102961 ctcaaacagt cggcctttcc tgcagctggc tggaaggaac tgtcactgct tctagcagaa 103021 attctcctct ctctctcttc cacgtcaacc tcctttccct ctctgctgga tcattcctgt 103081 cagcaaacac acacgctgca atttctccca acttaccaaa acacaaacaa acaaaaccaa 103141 acccagaggc caggtgggtt ttgttgtggc tgatgcctgt aatcccagtc ctttgagagg 103201 ctgagggagg aggatcactt gtggctagga gtttgagacc agcctgggca acatagcgag 103261 ccccatctct acaaaaaatt tttaaaaatt agctggtcgt ggtggcatgc gcctatagtc 103321 ccagctgctt gggaggctga ggcaggagga tcgcttgagc ctgggagttc aaggctgtgg 103381 tggaccatga ttatatcact gcactccagc ctgggcagca gagagagact ctgtttcaaa 103441 gaaaaaaaaa atttaaataa ataaataaat caagaaaatt cttaatccca aatccccttc 103501 cacttgccac tccatctctt caaaaggagt gtgtgcattt gctacttcca actcctcatc 103561 tcccgaccct tcccaccagg ctccccatgg gctgtagctc ccaggggtca tctcagcatg 103621 acctgccctg gggatccccc ctcccactgg aacacctctt tcattttggc tccaggacgt 103681 ccttcctccg gttgtcctcc cactgccctg gcttctcctc ctcagtctct tgtgttggtt 103741 cctcccactc cccagtcccc tataggttta gaatgtctcc ttttcagtga ggtctttctc 103801 atcaccatac cttacattaa tctcccaccc acagcgcccc atctctcctc tcagttttca 103861 tgttctccac agcacttacc atcatctggc acgcttcaga tcttaattat tttgttcttt 103921 tgtgtgtgtc cccctcgact agaatgtaag ctccctgagg gcaggggtgt gtgtctgttc 103981 tgcccactac tgtgtcccca gtgcccagga ggggcccagc acagtactag atgttcagta 104041 aagctgttga atgaatgaat gaatgaatga acgaacgaat atgtattatt gaaaccatga 104101 ccactcatgt ctaatatgca atgtgtggtg atttttacaa atctttaaag aagcccccta 104161 tcagattgca gtcagaggac aaatgtgtat ttcctaaatc cccttcactt ctgaaatgtt 104221 tcctttgcga tatggctgct gtggtgcata ctccattcct cttcccatcc ccattacatt 104281 tcccacacgt ggcagtggtg tgtgtggtgg cctcttgttt agccctgtag aattgtgtgg 104341 caaatactca ggcagaatct ccaggccacc agaaactgct gctttcagcc ctctcagagc 104401 acagccaaac ttccccctca gtcccagtgg gggactcagt ctccagtaag tacatccttc 104461 ctgctaccta tgtctcagtt tcccaaattc tagaaagcac agagaattgc tcacaaagga 104521 atccaaagcc aaggcctgac gggcttttat cttaaaggaa catgcgtatg agccttctgg 104581 tgacagcaat tagagcagcc accttgaagc aatgtgacac agtcccacct ttggccgctg 104641 agtgattgca gacactcatt ttgcttgtct gtggtggaga gaggtctctg gcctcctgct 104701 ttgaggctgc agccacagct tgcctggccc tgtgtaagtg tttgacctat ttcatacatg 104761 ccacgctgta tgcagctgta cctccagggg ctgccaccct gaccattctg ctaaaagagt 104821 aatttctcct gtttcttctt ttccacagta aagattattt cccagcctgg atttttgggg 104881 caggcccagg atctggttgt atagacgccg ctctctggcg gctgccagcc tcgctgattt 104941 ccaagcatgt acactggact ccctaggaac actgcagcct ctgctaggca cactgagcaa 105001 agcccccttg tgctccatgc cagcctaggt aacgtgacgc cctgggctgt aactttcctt 105061 taatacccag ggaggaacct ctaggccact gagacctcag ggcaggcatg agcaccccag 105121 gccctgctca gccggctgca cctggttggc tccagtgcct gaagccttcc cttccctccc 105181 cagtgcatcc tctgtcctcg cagctctggt ctatctcttc cgcagctgcc ggcaggcggc 105241 tgcttcctgc agtgagtcac tcccacactg ccccattttg gtgcagcgac acatttccat 105301 gattcgtttc ccaggctgga gagagacttc tcctggagag aaggaacaaa gagggtgaat 105361 caaggggaat gctccctcgc ctttcctgaa agctgcctgg agcactggtt tgatttcaaa 105421 cagcacctgg tctctctctg ctccctactg tacagagtcc ccagcctctt tccctctggg 105481 tcacctcttc tccacgtctc cctcttgtca ctcctgtgtc tcatcctgat ccttttgttt 105541 gatctcgccc ccacagatgg tttcacgaag gcctctctcg acaccaggca gagaacttac 105601 tcatgggcaa ggaggttggc ttcttcatca tccgggccag ccagagctcc ccaggggact 105661 tctccatctc tgtcaggtac tgaccattcc tgacactgcc ttggcccttc tcggcctgtt 105721 aacctcacag aatgctgaag ccatgaaggg acctctggaa tatgacccaa tggtcagaag 105781 catggtggcc taggggccaa gcatggcagc agcttcacaa cctgctgctc ccagtgcctt 105841 cccaaaacat gcacacacca ctctttagaa cactggtgtg tgtttgaaat tatttctacg 105901 cccatctgcc tccccttcta gcccaggtgc ttctggaagg gcaggatttt cctctacttc 105961 atctttatgc ctttaacagt gtcttgcata aaagagagac actcagtgag tatttgtgga 106021 aggaaggagg gataggagga ggagaactcc cacttgtcca gatccatctg ggggtttaaa 106081 tcttaacctt ttcttttatg attgatgcat atctgaagaa gagtcagaga aaaatagaaa 106141 tctcaagtca ctgctgtagg atatacaggg gcggaaaagt tgtggcacct ttctttgccc 106201 atcagaggag tcacagctga cactcctaca agacaggtta acaagaggag agtggacaca 106261 tttatttaat caaagacttg tgtgacacag aagccttcag aaatgaagac ccaagaccca 106321 gggaaagccg tctattttta tgcttaggtt tgtcaaagga cagacagccc tgtagaaatg 106381 ggattggaca aagggggttt gatctaaggt aacagactga ggtaggaaac tgaacaaggc 106441 ctgtcccaag tcttcttttt tttttttttt tttggagaca gctgttgccc aggctggagt 106501 gcagtgacgc aatcattgct caccacagcc tctgtgtccc aggctcaagc catcttccca 106561 cctcagcctc tcaagtagct ggggatacag gtgcacacca ccacacctgg ctaatatttt 106621 tttaacttat ttctttgtag agactaggtc ttactgtgtt gcctgggctg atctcaaact 106681 cctggcctcg agtaatcctc ctgccttggc ctcccccata gtgacccttc tagtctctct 106741 gacccactgt ggggaagagg aattctaatt tctatgactc actcaggggg agaaagagga 106801 gcgggagaca ggagggcagg agaaggccaa aggaagactt tgcttccaag gctgcttctg 106861 aaaccttccc atctcctttg gttcgaagca ctcagcatgc caaggggtgt cgttttctga 106921 gccccaacaa atacatgtat agttgtaggc agctgaacaa aaagaaaggc tacttcaaca 106981 cagtacagta cagcaagacc gaaaaccaca gaacagaggg ccaggagtgg gggaggactg 107041 taacagcgaa atctcactaa ggtgcagcaa cccataggaa gtgactcagt ccacgcgtag 107101 acattacaca gccccaaact cagaagccca gttcggaaca cactgtgtgt ggtcagcatg 107161 ttgaggatcg caagtcacgt atcagacttc accacatgtg aaactgcaag aatataatgc 107221 tgttagcaga tcaggagaag tacagcaaaa ccttactgtc aagttattgg caggaaatac 107281 cgatctggtt tattcaatat attttgaaag gcataaagct ttgttttaag gacagaaatc 107341 caatctcaag taataaactg tggccacttt gctgtttaga aagaatggat atttgaaagg 107401 ataggtagga aggacttaaa aataacaggt caacaatttg tatgcataca tacctttaaa 107461 gtggtctctt tagcattgcc cttgcaatct tgcttaaact cttttcttca gtaaaaactt 107521 tattttcata gcttggagaa tggttaactt tagttcaggc caatattgat gtatcacaaa 107581 ttctaatctt gttaggtctt aattaatgaa atttcgctcc attaacatat attacaagtc 107641 agggtcgggc gccgtggctc atgcctctaa tcccagcact ttgggaggct gagttgggag 107701 gaatgcttta agccaaggag ttggagacca gccggggcaa ctagtgagac ctcatctcta 107761 ctaaaaataa aaaattaacc cagtgcagtg gcgcatgcct gtggtcccaa ctacttggga 107821 ggctgaagtt aaaggattgt tggagctggg cggttgaggt tgcagtgagc caagatcgat 107881 cgcaccactg cactctagcc tgggtgacag agcaagaccc tgtctcaaga gaaaaaaatt 107941 atttacatat attatataca tatattatta gaaaaaacaa ttatacatat atttaacatg 108001 cgtgtatata tattacaagt caaatctatt tcttaagcct cccctcttta ccccatcata 108061 aacttgacca agtaatttgt tagactgagg ggtatctctt gggcctttgg tttgcatgaa 108121 gatgactctg gctttttcag ctagaaacta tggtttttgt ggtttcgttg gttttttgag 108181 acagactctc gctctgtcgc ccaggctgga gtgcagtggc gcaatctcgg ctccctgcaa 108241 cctccgcctc ccaggttcaa agcaattctc gtgccttggc ctcctgagta gctgggacca 108301 tgggttcgca ccaccacgcc tggctaattt gtgtgtgtgt ggtttttttc agtagagacg 108361 gggtttcgtc atgttggcca ggctggtctt gaactcctgg cctcatgtaa tccacccgcc 108421 tcagcctccc aaagtgctag gattacaggc gcgacccact gtgcccggac ccagaactat 108481 ggtttatatt tccttaatta aagtgagttc tacaaaacag taggccttga cttctaccga 108541 agtcttcccc tgatattcag gcgatgaagc ttctagttat ggtgacaatt accgtaatcc 108601 ttatcacgtc tttgcagaaa tgtagtataa tattttccaa agcaaaatca ttattaatta 108661 tttttacatc tgtcatgggc acagataatt aagagctaat tgtagttcta tcactcagca 108721 tttgttaagg gcctgaaaca ccagattttg gagtccagcg ccctgagtcg catcctttcc 108781 tttgttcctc ttcaggcttg agacaatggt caggccatcc tctccttgta gacccagata 108841 accagctgca accctcacac ccatccttac agcttatgag agtaaaaggg tcaatgtttc 108901 acactctgag ttctttgagg tggtggatga aaaattataa tatcacaaga atattttctc 108961 ctcctcacct ggatgtgaag gataatgact ttgagttgat ccatgttgtc ttagaatgaa 109021 gaaagtatct gttggtggcc aaaaccaagg ggaagaccac aaagtgattg caatatgttg 109081 attaccaccc gaaatcagtc aagcaagagg ccacttgtac aacctgtggc ttcacccagc 109141 tatgagcaaa tcccagcaca taagatgaga ttgagggtca gaatttccat ctccttctta 109201 acacaggtcc ctaaatccat ccctaagaaa cctccatggt caggcgcagt ggctcacgcc 109261 tgtaatccca gcactttggg aggctgaggt gggcggatca tgaggtcagg agatcgagac 109321 catcctgtct aacacgatga aaccccgtct ctactaaaaa tacaaaactt agccgggcgt 109381 ggtggcgggc gcctgtagtc ccagctactt gggaggctga ggcaggagaa tggcatgaac 109441 ctgggaggtg gtgcttgcag tgagcagaga tcacgccact gcactccagc ctgggcgaca 109501 gagcgagact ctgactcaaa aagaaacctc cacttactct cagaggtatc tttactctcc 109561 gagacccttt aaaattcttg acctttcttt tacatcaggc aattggcatc actagtgcca 109621 gtttattttt agatataaat gtatgtctgt ttgaaaaata aactttacat ccaaaagaga 109681 atgacttggc aatgaagtaa aactggatct aatggtaaag gctcagccca cccttcctag 109741 atggataaaa tccattttct ctagccctgg ttcgaatgtg tcccatcact tgtgagtcca 109801 gactggcaac aaaaggtgct gatctaaaag aagttatcgt ccagtggcag ggtctgggga 109861 aggggcggca ggcgccatgt ccagccacga aggtggcaag aagaaggcac tgaaacagcc 109921 caagaagcag gccaaggaga tggacgagga agagaaggct ttcaagcaga aacaaaaaga 109981 ggagcagaag aaactcgagg tgctaaaagc gaaggtcgtg gggaaggggc ctctggccac 110041 aggtggaatt aagaaatctg gcaaaaaata agctgttcct tgtgcctaag gagacggtga 110101 ccctttattt catccgtatt taaacctctc tattccctcc tataatatct tttgccacct 110161 atagttggaa ttaagtgtcg tcttggagct gttgtacatt taagaataaa cttttgtaaa 110221 aaaaaaaaaa aaaaaatctt ccagtggctc atcatctctt tagttgtttt cactaagtca 110281 ttcctaccat aactgtgaat ttaaagtaaa accagctcag aatcttgcca gaatctgctc 110341 tttggtcgtt gttctaccct aaactttgta tcacctgaaa ttaaaccaac tcatttgaaa 110401 gaaaaaagaa gttatcatgt ttaggtggcg tgacaggcca agctcactag actggagtca 110461 gagactcagg gttctggcct cagcccttac tcagcagtga ccttgaccaa gcccctgcac 110521 ctctgtgagc ctcagtgtcc tcatctgtaa aatggagaag taaggctaaa caatctccaa 110581 catccttgcc aattaaaact ctatcatttc acagggcgca gtgcctcaca cctgtaatcc 110641 cagcactttg ggaggccaag gcaggtggat caccggaggt ctggacttca agaccagcct 110701 gaccaacatg gcgaaacccc gtctttacta aagacttggt ggtgggtgcc tgtaatccca 110761 gctcctcagg aggctgagac aggagaatcg cttgaacctg ggaggcggag gttgcagtga 110821 gcagagatca caccactgca ctccagcctg ggcgacagag tgagactcca tctcaaaaaa 110881 acaaaacaaa caaacaaaca aaaactctgt atttcataca aagctatttc accaaaagct 110941 gaaaacactt tgtaacctta tttatgcaaa acataaacca agatcaagag atccttatta 111001 catagagaaa aaatgtgaca tatttcttag gaaagagagt agcttcttac ctgccagtgc 111061 tagcttatgg gcttaaagga gatggttcaa agaatttctg gtaacagaaa gataatagga 111121 gcttgaaata cccctgttga catcttgcga ataaacattg tgaaaaggtc tttgatctct 111181 ttgtcgttaa gtcacttgga acatgctttt cttgtaaaag gttagttcca gtgtccaaaa 111241 caaattttag ctcatctgat taccttctgc tgatgtaaat cccgtctagc atagcctgta 111301 tcctaagtgg tccatcctac aaagttcctg ccgaagcctg actgggctgg cccacctgcc 111361 cagtgatgag tgagtcatcc atatctacat ctcctgctca aagagaactc caccatccca 111421 gatgtgagca gcctggaggt ggtgacatta tcaccgtgta acatacaatt ttgttttgcc 111481 ttgatctcca ggcatgagga tgacgttcaa cacttcaagg tcatgcgaga caacaagggt 111541 aattactttc tgtggactga gaagtttcca tccctaaata agctggtaga ctactacagg 111601 acaaattcca tctccagaca gaagcagatc ttccttagag acagaacccg agaagaccag 111661 gtatgctcca gatccagtcg accccaatct agagatttta ggcagcctga gttctcacat 111721 gaaccaccag gaaaaatgca taggatgggt agatcctctg agtatactga aagacagagc 111781 tcagagctct gaggtcagat ctcaacttaa cacttcctag tcgtgtgact ttggaaagtt 111841 ctctgcgcct cagtttcctc atccataaag tagggacaat ggtaatatct acctcccaga 111901 atggctgtga aggttggaga agataatgta tcccacgcct catgtcattc ctcatcacac 111961 tgcgctcagc aagcatcagc tcttaccaca ggtgctgttt ttttgttatt catagacatt 112021 gctacaccaa agttttcaac ctagctcaag agttgcccaa ggatgaagga gttgggggtg 112081 ctacagacag ggacattagg gaacagccta gagttcagga gttaaatcag catgtggagt 112141 tttatgatgg gattgggggt agggaggaca tgggaggaga aaactgcact tggaattatt 112201 gctgttgcaa ctgtttgcaa tacagactat tgctttaaaa ggaaatgcgt cagacagacc 112261 tggtatgaaa tcctgcttta atcgcttagc tacctgtatt acctagccaa gttaagaaac 112321 catctgagta tcagttttct catctgtaga atggagagag caataaaacc tccctcactg 112381 ggctgttgtg aggattaaat gacttaatac agctaagcac taccagagtc tggtccatat 112441 taaacactaa gaacatgtta acttgcctcc cataagtaaa ttcacacatg cacacactca 112501 cacacacaca cacacaccaa tttatgaact atttttcaaa ggaaataaca tggttataag 112561 agacaggtat cataggaaga agcattcact agaatagctg aaatcctagg gacaggttat 112621 ttgaaaagat attgagactg tgaaagcaat gtgtgctttg gggaagtcaa gaaaatcctc 112681 ctttttcgag tccctagttg cttttaagcc tgagaagtta ctgccaaagg agaaaagaag 112741 ttaccaccaa atgcacacag aagttcagaa aggaaaacaa aggctagcca agaccttgtg 112801 aatcactcta accaggaaac ggacagaggt attaactcac taggatgggc attcagagaa 112861 gcaatataaa attgcattaa atgggattaa atgagttgat atttgtaaag cgcttagaac 112921 agtgcctggt atatgctgcc ctcagggtcc tacataattg tttgttaaat gaaactaaca 112981 atacggtatt tctgatttgt aaggttgttg taacattagg gaaaaaggcc ttacttcttc 113041 ccccacgtcc agcccggcgg caggcctggt tcccatggtg ttgcaagaat gttgctgttt 113101 gggtctctat caccaggctg taaatgacag ttcaaaatca tgcaaagtca acagtggagc 113161 ttatctggag ggtagattgt ccagtggtgt ggcaacatgt gataagaagg ggtggcattg 113221 tatcagtaaa atggaatatt tcagcttgat tacacagaga aaaaacgagc gccctggaaa 113281 tggactaaat ggctaccagg caaaaagtgt tagtctgtga accagagacc tgcactccta 113341 aaactgggag gatctgagaa tgcagaggcc actttggaaa atgtgaaggt ctaaagcatc 113401 aattatcttt tagctgcccc caccccacca gctatttctt ggagccagat gagcagggtc 113461 tgggaacaac tgcccgaggg tagggtaggg ccagacgatc agagaggagg atgcatctgc 113521 agcatctctg ttctttctcc cagggtcacc ggggcaacag cctggaccgg aggtcccagg 113581 gaggcccaca cctcagtggg gctgtgggag aagaaatccg accttcgatg aaccggaagc 113641 tgtcggatca ccccccgacc cttcccctgc agcagcacca gcaccagcca cagcctccgc 113701 aatatgcccc agcgccccag cagctgcagc agcccccaca gcagcgatat ctgcagcacc 113761 accatttcca ccaggtatct ggaaagaagg cagtgggcac agtacggtgg aaaggagaac 113821 ctgcctcccc tccaggcctg gcccagcctc ttgagttctc agagagttca tgatgaagac 113881 cctggacccc cccaaatggc agaaataggg aaactgttct tggcaaagac atctctatga 113941 attaaatggc tcccacctga acacacacac acacacacac actgtctcac acacacacac 114001 acacttaaac tcacacacac acacacttcc cctgaggacc cagtaaacaa ttgcttcatt 114061 ttcctatcag cacaattcca aatctattca tgccccttag cccatctggc tgtcagatcc 114121 acaagagaaa aaggcttaat acccacaaca gagccttgtt tattacttat ttatctccct 114181 ccacccttgt tcccgatggg atttcaggcg gcttactgag atggacaata cggccagata 114241 acatcattcc tgaattcctt agcagggcac acaaagtcct ttgtgagctg ccagaatcta 114301 tttccccagc ctcgtccctg ctgcctccct gccaactccc cgtgcaaaca cacattccag 114361 caacccccca cgctgcctta gttacccgaa tggtccattg cgtttcctct catgcctttg 114421 tggcgtttta tatgctgatc tctctgcctg atgagcccct ccctgggcag tcctacccct 114481 ctaagctgga gttcaagcat gacctcctct gtgaagcctt ccttgatttc ccgggcagat 114541 acagacattt cttctctctg ctcacttacc acttaaatgt gcatggttcc ccctctaaat 114601 ggaagctcca tgaagccaga gaccttctct ttccagctat atttcccttg tgtctgctac 114661 tggaaacatg gtaggtgcct gataaatatt tcttgactga ataaataaaa cctgcctttg 114721 ccttctttgc cttcagaatc taaggtgcct ttagacctta ggtcctcctc aacgtggagg 114781 gtttattgtc attatcagta agtattcctg gaatataccg gctctgtgga tggcatctga 114841 gcaaggcact gggggaaccg gtgggagatg tcctggcagt ggggtgacca gtcttctgtt 114901 gtatgtttct aggaacgccg aggaggcagc cttgacataa atgatgggca ttgtggcacc 114961 ggcttgggca gtgaaatgaa tgcggccctc atgcatcgga gacacacaga cccagtgcag 115021 ctccaggcgg caggggtatg ggaactgtcc ttctctggga tccctgggga aaggccttgg 115081 gacagaggtc aatggaggag atgaggcagg agagactcaa accacacaga tctggctggg 115141 cacggtggct cacacctgta atcccaacac tttgggaggc caaggcaggt ggatcacctg 115201 aggtcaggag ttcgagacca gcctggccaa catggtgaaa cctcacgtct actaaaaata 115261 caaaaattag ccaggcatgg tggcagatgc cagtaatccc agctacttgg gaggctgagg 115321 caggagaatt gcttgaaccc aggaggcaga ggttgcagtg agccgagatt gcgccatttt 115381 actccatcct gggcaacaag agcgaaactc cgtctcaaaa aacaaaacaa aacaaaacaa 115441 aaatgcacac acagatcaac gagaggtgat ggaatgtttg gggaggcctg gggctgagcc 115501 agcaccgtcc agtctgttga tctcagtctg gtgccaggcc ttccctgttc tcatagttct 115561 cagttctagt tctttatcat ctttatcata gtcctttacc attgccttta tcattgcatc 115621 ttttttttgt tttttgagac agggtctcac tctgtctccc aggctcaagt gcagtggcac 115681 aatcatggct cactgcagat tcaaactcct aggctcaagc gatcctcctg cctctgcttc 115741 ccaagtagct aggtatatta tttgtagcga taagctcttg ctatattgcc caggctggtc 115801 tggaactcct gggctcaagc aatcctccta cctcagcctc ccacagtgct gggattacag 115861 gcatgagcca ccgtgcctgg cctgtgatcc atcttgactt gtgacttgat tgccctaaca 115921 ccaagggctg ttgtttgggg catgttgccc ttctgtttgt cactcaagat aaaaatacca 115981 cctattgatg tggagagagt ggttctcagg gagtctattg gggctttgga caataacatc 116041 tgttggtaaa agtcctgggt ctccagagtg acgctggtgc gttctgggaa cttcttgctc 116101 aggcctgtgc ataggtatct gatttaaatt gcagcagggt agccgggcat ggtggctcac 116161 acctgtagtt ccagctactt gggaggccaa ggcaggagga cttcttgagg ccaagggttc 116221 aaaaccagcc tgagcaacat agcaagaccc catctctaca aaaaaagttt ttttaagctg 116281 cagaggggtc agccttcagg gaatgttgct ctcaaagact tggatgtggt gggaggagcc 116341 agcagaccct cccctgcctg cccagcccac agctcagcag agcctcttct gtgcttcccc 116401 caacagcgag tgcggtgggc ccgggcgctg tatgactttg aggccctgga ggatgacgag 116461 ctggggttcc acagcgggga ggtggtggag gtcctggata gctccaaccc atcctggtgg 116521 accggccgcc tgcacaacaa gctgggcctc ttccctgcca actacgtggc acccatgacc 116581 cgataaactc ttcaggggac agaagctttt tgtctggagc tgcccacaag aaagagggca 116641 aggaaaaaag gctggactcc atgactatat atacatacat ctatctacat ctgcctgtgt 116701 acacacacaa ctttttatac tagtaattta ttggcaattg ggctggtaat tagttgatgc 116761 aaaagggaac tcaggtggag aataatattg acacttgctt ttctgccccc ctcaggggtg 116821 tgtggaaggc agtgggggag ttgggagggg ggcagggaaa tgaaatggag ttttgtcctg 116881 gccttcagct gtcactgctt cctccttgtc ttgggaattt tcacggagaa cagctaagca 116941 gagaccacac ctcggcactg gacacagaac aacagggtgg ggttgaactt ggtggggcac 117001 gttctagctg acctgcaagc cccgctcacc tggagggctt gcagagcagc tctcctcctg 117061 ttctccaagg ggtgggcact gttgcattag gaattaaggt gcagcccagt gctgcgggca 117121 gccaagcgag tctgagcggg gatgtgaagg caggacgggg ttggagagac cctgcctgtg 117181 caaggctggc ctcctagtca acacctagct gagacattca tctctgttca aattaatatt 117241 ccttgggtct ttcaaatgaa tgtttcaatg tgaggagcga agtactggga aatcgaggga 117301 ttttccacaa tccccaagca gcttactcag tggaacctct gtttccccag ctatgaagga 117361 aatagcatcc tacattccgt aaagtgcttg aagaagctct agctgagaat gctgaatttt 117421 acagtcattg ttttatggtt acaagggtat gtcttcaggg ctctgttcat tttccctctt 117481 gattttctct tttgaagatg ccttgcccag tggcatggaa agaagcccac ttgatgatag 117541 gagcctgact atgaattgat tgactaaaca tgaccccagg tggaagctgg ctttgacacc 117601 agcttcctgt tcagtctgac cactgctgtc ctcctgctgc ccaagagttg cacctggagc 117661 aaggaacccc accatggctg tcgctctcca tcccatcacg ctagaatcat gtgtccaagg 117721 gctcactctg gaggtgcaca gcacaggtca gcctggccag gggcgaagga gacagtagag 117781 aggaagctca gggccttagg ggaggccggg tgcaaacccg ttctgcacca agtgcactcg 117841 gagtttgtgg gtatgggtgt gtacccctgc aggtgtgcac atgtgtgctt gcacgcacat 117901 atttgtgcac tcctgtgtgt atacatgtgt gcttgtgtat gcatatgtgt gcattcctgc 117961 atgtgtggac atgtgcgtgc atgcatctgt gtgtctgtgt gtgtgctgag acaggaaagg 118021 gggtgaaagt gttggtgagg gagcctggaa gttttctctt ccccaacctc tcttgctcta 118081 aggagggatg gggttggggg cagccgttat tgaaggtgat cggagaagaa agattttctg 118141 actcagaagt gactgccagt gtagcacaag cagtgtccct tgtgactgtg attctacagt 118201 tctctgatcc tcatgtttcc tttagaggaa agaggaaaaa aggaactctg tggtgggtat 118261 tgggagggaa aagaaaatag cctggtggag gcaggaggga gtcgagtgtg agtaaggagc 118321 acctgcagct tttggaagtg aaagcagaga gagggaaagg tagctaagac atccaggagg 118381 atcaaggggc agcgtgagag gcacaggggg aggaaaggga gagggtgagt ggggccaggg 118441 aagagaacgg tgtggagttg gtgggcagat ggtgcaggag gagggctggg cagtggggag 118501 aggaagaatc agagcagaga aaccgctgct gcggaaggca gggagcaaag aagcataagg 118561 aaaagaacag gagaaagaac aaaggcagag aagacctcct cagctatttt ggtgctaggt 118621 aatgtgaaat gctgcagtaa ataaaagcca taagtaatgt ttgattttac agtatttaca 118681 aaccatatgc tttaaacagc ccacaaactc catccagttg tctgatgatg tgggaggacc 118741 agctctagtt agacggaggt atgtgattcg attctggtgg agactttgtg ccttgaaagg 118801 cttctcagga aagcagtaat aaaacaaagt gtccttgttt ggccaggccg tgtctggtgt 118861 ctgcgttgca tacccctcag gtagagagtg agaacgagac cagggggtga caggggtcat 118921 cggcctgcag gagctgaccc cgtaggcagg cggtatgagc agagctttca gaacccctgg 118981 agctcggtgt tctgttgtag caaaagctac tgaagagtat tttgtactga gatgccaaag 119041 catctttccc ttgctcatca gcctccattc agaaggaagc gggaaagtca cccaaaccac 119101 ctacaaggaa tgagggctct tagcggccag gtaaagagga caagaaccaa actgtccaca 119161 ttatggaggg gagggggaaa gagagctcaa cccccctcag cctctggtgc tgccaggggg 119221 agcccctccg catgtgtccc cacgggccct attttctacc tgacagcatt tcagcaccag 119281 aggagggttg agccagctcc tctcaccagc cacagaacca gcagccaaca aacacattgt 119341 ccccgagttt ggctgcaaca ctctatgtga cttgggaact tgaagcttcc agagaaaagc 119401 caggtcctgt tggaccagtt gttgggtttt cctaaccact tgctgtgcct caccgcattt 119461 caaatatctg gcacctgcca ctgcccaccc acttttagat ggccctaggg tgatctaacc 119521 atggtgcttc tcaaaattta aagtgcctac gaattgcctg gggaccgtgt tgaaatgcag 119581 cttccaactt ggtcctgaat ttctaacaag ctactgggtg ctgttgctag gtccacatgt 119641 taaggagcaa aggcaaagaa cttagcagtg agcacaactc taattgggct cttcttctga 119701 ggatattcgg ccattttttc ttcgctgcta acgaataggt cactttggct ttgggacaga 119761 tgcaattcag aacggtcatt atgacgtcat tcatatttgg ctacccaggt gacttattat 119821 tttagcatcc ttatctctag aattaaaaaa atgtaatgtg agaggaattc agatgaaatg 119881 caaacaggaa tttcctataa taattactat ttcctaatag taattatttt aaacatcagc 119941 agagattgca gaagttgtaa aagctttgtc aggaaatctt cagtcataaa attgactctg 120001 gtcagatgtg gccctggttg aggcagggga ctgtgctccc agctaaccta aaacacaaca 120061 gcccacactc gggacagatt agcatggcca aaatatttac tgtttgcctt gtctaagcca 120121 acattttagc ccctctatac caaggcaagt gtgtgagaaa aaccttgatt gtctcaatgc 120181 tttttaaaaa atgcctcctc agttcctcca ctttactcca tcaccttggc catgccacca 120241 tcatctcttc ctttggcaac tgtcagcttt catacctttc tccctgcatt ctctcccgta 120301 atctggtcct ccacacagat gccaaagcga tctttgtaaa aataaatccg atcacatcac 120361 tccccttccc actggattta ggcttaagtc caaagtccta atgtggccct gtgaagtccg 120421 gcccctgcct ccctcctcag cctcctcttc tgcctcccct gccccttgct tttatgctcc 120481 tgccccactg gcctttcagc ctccaaatgg ccaaggattt gtccccaaac ctttcacact 120541 tgccattctt gctacctgga ggcacatccc ctgctaaccc tgctaactct tgttcatctc 120601 caggtctggg ttggtttttt gttttctgtt ttttgttttt tgtttttgtt tttgtttttg 120661 tttttgtttt tgttttgaga tggggtctgg ctctgttgcc caggctggag tgcagtggca 120721 cagtctcagc tcactgcagc ctctgcctcc caagttcaag tgattctccc acctcagcct 120781 cccgagtagc tgggattaca gggtcatgtc accacacctg gctaattttt gcatttttgg 120841 tagagacagg gtttcaccat gttggccagg gtggtcttga actcctgacc tcaggtgatc 120901 cactcaccct ggcttcccaa agtgatggga atataggtgt gagccaccat gcctggccca 120961 ggtctgggtt taaatgtcac ttccaaacag tcctcccctg ccctctaatc taaaataatc 121021 cttcccctaa gatggtctta tgaatcctgt gcttttcctg aatagcacgt acacagatga 121081 tggtgccaga ctgagctgta tggctattac agctcactgc agcctctgcc tcccaagttc 121141 aagtgaactt ggagctgtaa gctccacgga gcaggggtca cctctgtttt gtaaaccatt 121201 gtacacccag catctagcac tatgcaaggc acacagaggc tcagtaaata tttttctaat 121261 gaaggaatga ttggacctgc ttcatctcat ctatttactc attttgtaac tcaactgtct 121321 tgggcctaac agacctgtag atgcaaacaa ggtcatattt acaatgatgt ccaggttcca 121381 gtagctttag ccactgtctt cggattctcc tccagttaac ttctgagagg ccaagagagg 121441 gaggctgtta tctctcacgt gtttaggcac cagtatactc acgcttccta ctctggttcc 121501 agtgttctgt tgccccacgc atcctctgct gacaatgcct tcccagatgc tgcggatcca 121561 ctatgcacca catttatctg ctggtgaatt gacccaatcc ttcaggacac caaatctgcc 121621 ctgcagtcca catgacaagg agattgggga agaggtagag gggagaatgg ccagcctgtg 121681 cccaattctg agaggcactg cttggaccct ggctccctca gcatcttcca tgagcacagc 121741 tggctctctt ctcagccgag ccctccctcc atcagacaga acaagggata ccaacccttc 121801 cctccgtctc tctcttctca ctcttccctg ttgcctcttt ctccttgagc tcaaaagaag 121861 gggaatgtac ctacatgcca tcgttagctc acagctaacc tgaaacacaa cagcccacac 121921 tggggactga ttagcatggt caaaatattt gctgttttct cagtctaaag tcaacacttc 121981 agcctgtcca taccaaggca agggtgtcct ctgcaccaca ccatttgtgc agtagatact 122041 acctagaagc ctagtagctg tgcactcact tctattgttt taagttttgc tgtaattcat 122101 tcattaagtc aacaaatatt tacaaagcac ctcttcaggg tcagacacaa tcagccatcg 122161 ggtccttcac attctctcaa ctctgtccct tctctgcacc cattccacag ccactgcctt 122221 acttcaaacc tctctcactt ctttccctga tgactgcaat agtctcagca ctcgtctccc 122281 tgcctttctg gctggccccc tgtaatacag cctccaacgc agtcagggag atcttattcg 122341 cagacaggct gatgatacca agctcctgct caaatctttg ggttctccat agtcttcagg 122401 ctggagttta acctccttag cttcgcacta aaggcctttt gtgatctgcc tgctgccttg 122461 ttcgttcagc cacatcttct acctctccag tcccaaacac tctttacttc agtcccttag 122521 aactatctgt ggttcccaga atgaggcatg ttttctaaga cctctggtct cctacattct 122581 gttctttttg cctggaatgc tttctcccac tgttttgtct agacaagggc tgtctattag 122641 gactttctgt gatgagggaa aagctctgta tctgctccat ccaatgtggt agccactagc 122701 cactagccac atgtggctat tgaatacttg aaatatgcct ggtataaatg actgaaattg 122761 ccacatgtgg caaagagctg ccttataggg cagcgtgggt ccagacaact gttacttact 122821 ttgaaatcat tagctcaggg ctccaggtct ccactgggaa accatcccaa tattatggtt 122881 gatgtaagtg cccccgttcc cccaccccca cttatgttcc tgtagtaacc ttgttctccc 122941 tctatcccaa tactcagaac accattgtca gtgattcact gatctgcctc ctctgctaga 123001 ccaaatcttt tttaggtcaa agatatgcag tgcagtcaat aaatggttga tgcctagatg 123061 cagtgacgtg ctggtaaaga tttaataatt ggctctctct aagaagaagc cctaacttgt 123121 ggtatttgcc aattctatgg tgtaaacact cccaccatat gccaagagaa tgagaagaca 123181 agtcacagac caggagaaaa tatgtacaga agacatatct gataaaggac tatgatacac 123241 aaagaactct taaaactcaa caataggcag ggcacagtgg ctcacgcctg taatcccagc 123301 actttgggag gccgaggcag gtggatcatg aggtcaggag tttgagacca gcctggccaa 123361 gatggtgaaa tcccgtctct actaaaaata caaaaattag ccgggtttgg tggcgggcac 123421 ctgtaatccc agctacttgg gaggctgatg caggaaaatt gtttgaaccc gggaggtgga 123481 ggttgcagtg agccaagttc ctgccacttc gctctagcct gggcaacaga gcaagactcc 123541 atctcaaaaa aagaaagaaa gaaagaaaga aagtaagaaa caaaaaaaac caacaacaac 123601 aacaaaaaaa ctccacaatt agaagatgaa caacccaatt taaaaatggg caaaagacat 123661 ggacagctca tggaagaaga tatagatggc aagtgaggag ataaaaagat gctcaatatc 123721 atatgtcatt aggaaaatac aaattaaaac aatgaaatat caccacacac ctataagaat 123781 gccccatatc tataatactg acaacaccat atgctggtga tgatgtgaaa caacaggaac 123841 tctcattcat tgatggtggg aaggcaaaat ggtacagcta ctttggaaga gagttcgacg 123901 gtttcttgca aaactaaact tacgctggtc ataggatcca gcattcacac tccttggtat 123961 ttactgaaag gtgctgaaaa catccacaca aagacctgga catggatatt tatagtggct 124021 ttattcataa ttgccaaaac ttagaagaaa ccaaaatgtc ctacaatagg tgaatgaata 124081 aataaactgt ggtacctcca gacaatagaa tagtatttag aaagaaataa gctaccaagc 124141 tatgaaagat gaaaaccctt aaatgcatat tattattatt attattatta ttattattat 124201 tattaagatg gagtttcact ctgtcaccca ggctggagtg cagtggtgta atctcagctc 124261 actgcaacct ccacctccct ggttcaagca aatctcctgc ctctgcctcc caagtagctg 124321 ggattacagg cgcccgtcac cacgcttggc tgaatttttt tttttttttt agtagagacg 124381 gggtttcacc atgttggcca gactggtttt gaactcctga cctaaggtga tctgcccacc 124441 tcagcctccc aaagtgatgg tattacaggc atgcgatgct gcacccaacc cttaaatgca 124501 tattactatg tgaaataagc caatatgaaa aggctacaca ctgtaccgtt tcaactacat 124561 ggcatgctgg aaaaggaaaa actagagagg cggtaaaaag atcagtggtt ggaagggttg 124621 agggtgggag gcatgaatac gcagagcccg gaggattttt agggcagtga aactcctctg 124681 tgtgatgcca caacaataga cacatggcat gacacatttt tcaaaaccca taggatgcac 124741 aacaggaaga gtgaacccta atgtaaacta cgcatttcgg gtgattatgg tgtgtcaatg 124801 taggctcaac aattgtaaca aatatatcac tctgctggag gatgttgaca gtgagggagg 124861 ctatgcgtgt gtggggacag ggggtatttg ggaactccct gtgtcttctg ttccattttg 124921 ctgtgatcct aaaactgctg aaaaaaaaaa gtctacttaa aaaagaacaa ctcccaccat 124981 ggcagatttc aggctaccaa catgacatca ctgaatccag agttgggaag agatgtacac 125041 aattgttttt catgggcaga taggagccag ccccagcaca ccaccgcatg agtgagtgag 125101 tgaatgagtg aatgaatgaa tgaatgaatg aatgaatgaa agaaatgtgc cagatgctgg 125161 gaataccagg gacgccatca ggctctttag ggagaagtct attcagacct gcgagagctg 125221 ggcatgcagc tgctagtcgg atttcaattc caactgagga gataaaggat cagggagtcc 125281 catctttgat ttagtcagta aatcgaagat atgggacatt tgttttcttt gggtgcttta 125341 cctttaatac caagacttgt caaggcatct ctattaagtg gcccactcag acaagtgcaa 125401 agtgagtgag ctggaggcga aaaaaacaga ggcctgacca aaaggctcta gagcaacaag 125461 aggctcatga tttcccatca caagaagtcc agagacaaga cagccattgg ctggcttaac 125521 acaatggccc aacaatgcat cagtgatgtc tctgagcatc agtggtgtct cttctcagtg 125581 gcttacagct ccagacatca catgcagacg tgacaatgtc taatggagaa ggggcacatt 125641 ctctcccaat tctactcaat gagaaaccct ttccttcctg gaggtccttc agcggccatt 125701 cacacagatc ccactggcaa gactggatcc catgctgttc ttacaccagc catttccaag 125761 agaatgaggc caccataact ggcttaggcc aattggtagg acccttcagg aggcagaacc 125821 tgggaaccta ggcagggctc tgcccacagg aagaagcagg tgagcgtgtg ggggtggcgg 125881 atggttgtca aaacccaata tacagggaac attacaggaa gagtcactta ctttacaatc 125941 atgtcatttc ataaacattt cttgacataa ttagggaaca agaggcagca tgaaggagca 126001 cagcttccct gccattcccc ccgccacagt gagacgcctg caaggacaca gacacagcct 126061 ttccatcagc cttcctctgt ggatccccca tccacactga gccagcaggt caatggggac 126121 aggttccatg ctgttcagag aaaccagcac tgtacacagt ggggccctgc atccactgca 126181 ggggtggacg cctcagccag cccaggactg tgccgtcatc cctgagactc actgatcctg 126241 atgtgagact ctccccgaca ggggcttcca ggcatggctc tatgcagggt catttgtggc 126301 cacactatga cgtgggcgat cccatttcta cacatctaaa ttccccaggg caggcagcat 126361 gtggtatctc actccctcgg ctctaacctg gttatgggaa gttgctaggt aactgcttcc 126421 cacactggag gaggcttagt taccatggaa agtccagatg ggtggggggg cacagaggaa 126481 gggagcactg gctggggggt ctgcagctgg gattctgtcc ccactctctc ctgagcccct 126541 ggacctaagg ggagtctctc aacccctctg ggcctcagtt tcctcctagt aaaatgaagg 126601 aaaggctggg tcacttcttg ctctaaacca ctgatgcatg tggatggcat accctcagtg 126661 agcactgatc ctcagacacc tcttcggcaa ggagtgactc aagatgggtg cagaaaagga 126721 atgcattaag tttatgtctt agcctgggca acatagcaag actctgtctc tacaaaaaac 126781 attttttttt tttttgagac agagtttcac tctgtcaccg aggctggagt acagtgacgt 126841 gatctccact cgctgcaacc tccgcctccc gggttcaagc gattctcatg cctcagcctc 126901 cctagtagct gggattacaa gcatgtgcca ccacacccag ctaatttttg tatttttagt 126961 agagacaggt ttcaccatgt tggccaggct ggtctcgaac tcctgacctc aagtgatctg 127021 cctgcctcag cctcccaaag tcctgagatt acaggcatga gccacagcgc ccggccctca 127081 attttttttt tttaaattag ctaggtgtgg tacatacagg cacctgcctg tagtcccagc 127141 tactccagac gcagaggtag taggatcact cgagctcagg agtttgagac tgcagtgagc 127201 tatgattgtg ccaccacact cagcctgagt gacagagcaa gaccctatct taaaaaacaa 127261 aaacaaaaca aaacaaaaca aaaggccagg cgtggtggct catgcctgta atcccagcac 127321 tttgggaggc tgaggcaggc agatcatgag gtcaggaatt tgagaccagc ctgaccaaca 127381 cggtgaaacc ccatctctac taaaaataca aaaattagct gggcgtggtg gcacacgcct 127441 gtaatcccaa ctactcagga ggctgaggca ggagaatccc ttgaacctgg gaggcagagg 127501 ttgcagtgaa ccgagattgc accactgcac tccagcctgg gcgacagacc gagactctgt 127561 ctaaaaaaac aaaaacaaaa aagttttttc tttaaaaaaa tagagaaact agcaagcttg 127621 tgtctctatt aggataaata aaagctatag gccaggcgca gtggctcaca cctgtaatcc 127681 cagcactttg ggaggccgag gcgggtagat cccctgaggt caggagttca agaccagctt 127741 ggccaacatg gtgaagtccc gtctctacta aaaatgcaaa aattggctgg gcatggtggc 127801 tcatgcctat aatcccagca ctttgggagg ctgaggtagg tggatcacct gaggtcagga 127861 gtttgagacc agcctggcca acatgatgaa acccgtctct actaaaaatg tgaaaattag 127921 ctgggcatgg tggcgggcgc ctgtaatccc agctacttgg aaggctgagg caggagaatc 127981 acttgaatcc aggagatgga ggttgcagtg agctgagatc acaccattgc actccagcct 128041 ggtcaacgag agtgaaactt catcccaaaa caaaacaaaa caaagcaaaa attagccggg 128101 catggtggtg ggcccctgta atcccagcta ccctggaggc tgaggtggga gaattgcttg 128161 aacctgggag gtggaggctg caaggagtcg agatggcgcc actgcactcc agcctgggcg 128221 acagagcaag accctgtcta aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 128281 aaaaaaaatt ggccgggcgc agtggctcac acctgtaatc ccagtacttt gggaggtcaa 128341 ggcaggcgga tcacctaaag tcaggagttc aagaccagcc tgaccaagat ggtgaaaccc 128401 tgtctctact aaaaatacaa aaattagcca ggcatggtgg ggggcacctg taatcccaac 128461 tacttgggag gctgagacag gagaattgct tgaacctggg aggtggagat tgcagtgagc 128521 cgagatcgtg ccactgcact ccagcctggg caacagagca agattgcacc tcaaaaaaaa 128581 aaaaaaagtt atagagattc tcccttctgc taggagaaat ttcacttctg acatcaatat 128641 tcacaagtcc tggggctccc taggtcctct tctgaaccga ggcttcctaa aaaaaaaact 128701 taggatgcaa tgtgtgtccc taaaacaccc agcttgctaa agcatgatgt cctttcagac 128761 agggtctccc agttcttggg ctctgctggt ccttctgccc cagatggagc catggcacag 128821 gcccagccct ctctaggttg ctgggaccca tcctgactcc tccaatatgg gccactgcat 128881 aagcctcttg ggtctgctat ttgttccttc tttcattctg agatccagcc ccaccttctt 128941 cccttgcttc tgcaataaga tctctgccca ggcccagatg ctgggaccag tgcagtaccc 129001 agctccccgt tcacctgcct tgttcatgcc aatcccattc ccagaatcca tatgacattt 129061 ccaagcacta gctgccatcc tcctgtctac acgcccgaat ctccatgatg cagtggtggc 129121 tccctctccc accatgcatg tcacccactc agcactctga tgaccttgtg ccggagcccc 129181 tgctcctgct ggctgcacaa cacccccacc accatctgta cccacagtca caggaggccc 129241 ctgtaatccc cgaagacagg ttggacccca tcccaccccc agaagctggt ctttactccc 129301 acttccttct cggtctccac ctccagggtc cagttttaag ttttggccac tcctgatccc 129361 tccccaccac catacaaaca tgacaaacag ggcaatgtct gagtgtggct aaggacgctg 129421 cagtcatcat atcaggtaac attctccatc cccactcaac aactgagttt tctcctggct 129481 gggagcagaa ggagaaattc acatctttta catatagagt gagcctctct agggtgtcag 129541 ggtcagtggt ctctcagcta acagctttgc tctggggcat tcaagaacct tctggccctg 129601 cagagaaaac cccacccttg agctgtacac cctgtccctc ccctcctttc taactattct 129661 ctcttctctc ctccaggcaa tgtggagagg tggaaaagtc acaagctttg agtcagaaac 129721 acgtgggctg cagtccagac tctgctactt acgctatcag cctggacaag tttcttgacc 129781 actctgagct gatctagaga actggggata atagcactga gttgtcatga gaagtaaaca 129841 aaattatagg aactacttgg taagaaccta gcaagtggga gagacatcat gactgctttc 129901 ttcattcttt ttattcctat ggcccaggcc aagcaacagg gcatgaagag aagaaactca 129961 gctgggcgtg ggggctcacg cctgtaatcc cagcactttg agaggctgag gcgggcagat 130021 catgaggtca agagatcgag accatcctgg ccaatatgat gaaaccctgt ctctacaaaa 130081 atacaaaaat tagctgggtg tggtggcgca cctgtattcc catctacttg ggagctggga 130141 gagggagctc ccgcgtaggg agaatcactt gatcactgag gcaggagaat cacttgaacc 130201 tgaaaggcgg aggttgcagt gagccgagat cgccccactg tattccagcc tgggcaacag 130261 agtgagactc catctcagaa aaaaaaaaaa aaaaaaaaaa gagaagaaac ttggtccatt 130321 cctaaggttc ctggccctcc caggagtctt gcccagcctc tacctcaccc ataactactt 130381 tccatttttg gcagaaacag cctcttcttc acccctccct gccccacccc agtgggactt 130441 gttgctgcaa cagcctcctt ccctttgaaa cttgcctgtc ctgtgttccg ccctccccgg 130501 ggtaatctct ctgcacccat caaactgata acttgccaac acagcacatc aactttggga 130561 gagctcccct ccctcacctc acccaccctc ctccagggta cacctgtcct gtgatggcca 130621 cctgcaaggg acagagtgct gagggtgttg gccaagtaaa ggaataattt ttgaagggag 130681 aatgccaggt taggcagggg tgtcagaggc atcccgggtc acttcctctg gtggctctcc 130741 actgctttgc aaggcatccc acccagatca gtcactggct ttactagaat gcccttcccc 130801 tgccaccccc agcttggggt tcaagctgcc tcttggaggt ggggctgcat ggaggcaaga 130861 catctgggcc accctcaacc cctgcacctt gccctgcacc tgccctcgac ccctgcacct 130921 ttctccttct tcggtaactg tgctcacata ggtccctcca cctccaccta ggggcctcct 130981 tcctttctat ttacctctgt tcacatccta gccagccttc aaggccaggg aaaaggccac 131041 ccccttcaca aagtcatctt tgattccacc ctcagcctag agcaagcagc cttgctgaac 131101 cttcttctga cgggtgctct ctctgccagc cttgccctaa aagggtgagt gcactcaggt 131161 cctgtgccct ggaatgtcca ggttccaggc cccaaggaca aggctgaact aacactaagg 131221 aaataattct agcccaggcc tcttggttgg ggtcagagct gctttaaact ttgcctaaaa 131281 atgtttctgg tgctgccagc cataaagcgg ggaggcactc ccttgagtca ggtgtgtctg 131341 actcaaaagc actccgcctc tgtcatgtgg gtgaagttgt tcacccaagc agccctgcgc 131401 tgcctgcaca gcctgccagg aacctctgct ccaagactct ctgttcctgt gtggctacct 131461 agagaaagca agaaaagacc ccatgcctcc ccgaggcttc ctttctgaaa cttccctagt 131521 ctcagaacct gagcagccct ctttctgtaa tgctaaatgt cactggagcc caccgcgggc 131581 tctccctacc tcgggtccag cccatgctct tggtcaggtg catccatctc aagcatcatt 131641 tttatcctgc cattccccag tttcccagcc ttccatggct cccagctgaa aacaaaatgc 131701 aaactcagcc tggaattcac gcactcaaga gcatctgtga gacaacttgt atttctggct 131761 ccttgtcatt ttcccatccc tgaaatacac atatatgtac atatttacac actggtacac 131821 acatacacat acatgtactt acatacacac atgtacacac acacacacac acacacacac 131881 acacacacac acacacacac acaccatgac ctaaacgagt cagttggttt agtctgcatt 131941 taccttaagc actacattta ctcagtcact cactcactct tcatccactc aactctctca 132001 gtaccaggtc tgttctagag ctggaacata actttcctgc ttccctacca tcagctacta 132061 cctagaatat cctcccctgt cccttcaagc actttgggag gccaaggtag gtggatcacc 132121 tgaggtcagg agttcgatac cagcctgacc aatatggtga aaccccatct ctactaaaag 132181 tgcaaaaaat agccaggcgt ggtggcaggt gcctataatc ccagctactc aggaggctga 132241 aacaggagaa tcacttaaac ctgggggatg gaggttgcag tgagccgaga tcatgccact 132301 ttactccagt ctgcgtgaaa gagcgaaact ctgtctcaaa aaaaaaaaaa aaaaaaaaaa 132361 aaagcattaa agaccagagg gtagaactca agctaatctg ccacggtcat ctctccagtg 132421 accagctctt gctctggatc aatgcacatc aggaacacac cgtggggtag ttctaacaaa 132481 caacccctgt gcggagagtc ttcaggcggg gagaagcgcc aaacccttag gcctcttcct 132541 aggcagcctt tggattaaac cctctctctt cctgatgctg gtcctccatc cgcactggct 132601 ggagaaaagg cctcctttgg actatgcctc ccgggaggac atggatgccg aaatccttct 132661 tcatctttaa agataatcaa aaatgtctac cagagccagg catggtgact gaagcctgta 132721 atcccagcta cttgggaggc ttaggtgaga tcacttgagg ccaagagttc aaggttgcag 132781 tgagctatga ttgtgtcact gcactctagc ctgggcaaca gagcgaaatc ctgtctctaa 132841 aaagaaaaaa aggtccactg tgtacaccct accacacata tgcattctct ctctctctgt 132901 gtcacacaca cacacacaca cacacacaca cacacataca cagtcacact gggcaacaga 132961 atggctgaaa agactcatgt tcaattaagt tgtctttggc atttgggttc ttatttctat 133021 gttcctatgg tatttacggc aggggaagag aggaaaccct actaagtttt gagaaagaat 133081 tcccaatccg tttgcagtaa agagggtgag ggggccggac gcggtggctc aggcctgtaa 133141 tcccagcact ttgagaggcc aaggtgggcg aatcacgagg tcaagagatt gagaccatcc 133201 tggacaacat ggtgaaaccc cgtctctact gaaaatacaa aaattagccg ggcttggtgg 133261 tgggcgcctg taatcccagc aactcgggag gctgaggcag gagaatcgct tgaacccagc 133321 aggcgaagat ccgagaagtg agccaagatc acgccactgc acttcagcct ggcgacagag 133381 caagactccg tcttaaaaaa aaaaggcggg tgcagtggct cacgcctgta atcccagcac 133441 tttgggaggc tgaggtgggc agatcacctg aggtcaggag tttgagacca gcctgaccaa 133501 tatgatgaaa ccccatctct tctaaaaata caaaaattag cctggcgtgg tggcatgcct 133561 gtaatcccag ctactcaaga ggctgaggca ggagactcgc ttgaacctgg gaggcaaagg 133621 ttgcagtgag ctgagatggt gccactgcac tccagcctgg gggacaagag agaaactctg 133681 tctcaaaaaa aaaaaaaaaa aaaagcgggt gacctcgggt gaaagcccag gggttagggc 133741 ctaaggcctg ggaacaccct gatgaatgtg acagtcctgc catcaaggaa ctcacaactg 133801 ctgggggaga cagaccatga gaacacagag cctggatgct cagacaggtc tagcgggaag 133861 caaggaaagt ccctgcaccc ggacttagag gcccaggaga gactctacag agggagtgac 133921 caccaggcta aagacaagag gagccagaag agtctcccgt gggctgggga gaaacacgtg 133981 gtagcagcag aagcttgggg aggggaagaa aagtaggtct gaatcctaaa aggcctcgag 134041 gcgcttgggc tgtagcctgc agggataggg acccttggca gggttttgtg cagggaaagg 134101 acataatggt gtttaccttt tagaagagat ctaaggggag aatccggtta gggagcccag 134161 ttaggaggct cttgcaaggc tcttgcaagg atgtccagga agggaggagt ggggagtggc 134221 agcagggaag gagggagggg cagtgacgga accacaccac gtcaccccac atcacatcac 134281 accctagccc agagaaagcc tcttctggtc tggtggccta agggaaagga aagagtcaaa 134341 atgagcccct gtttcccatg ctggtgagat ggaggctggc tagtggaggg ctgggtcgtt 134401 ttggacatgg taaacccagg aggaggggca tgcctggggg gcataggaga aggggctgac 134461 aggtgccatt tgggacaagt tgtgttggag gtgcctgggg gatgtgtgac tggagatgac 134521 cagcaggcag cctgatgaac tggcatggat tgaggagagg gttctggagt tgagctagag 134581 tcatccaagc acagggaagc gaggttgccc acggatgggc gagatgtgaa aggcgggaag 134641 gtcagggcag ggtcccgagg aacagcgcgt ttcagagatg gatgaaggag aatgagaagg 134701 agtggcaggg aggcaaacaa acaaacaaaa aacaggaatc accaattcag agaaaacagt 134761 gcaatggcag agtcttcagg ggactgggag cctctgtacc tccaaggctg gagaagcagc 134821 aaagcccctg agtctggaag gtctgtgggg gcaggaactt gcctggtaca taattagtgc 134881 tccataaaca gtcatttaat gaacagatga aagccgtagg tgagttccga cacaggtgag 134941 agacctcggt gtatgcgtat tgtggagcag tcgagtggat cagacgggaa ggttgcttaa 135001 gtcgttcagt cctaggatct ctgcctctcc tcccagtggg cctgggggaa ccccttcccc 135061 ttattctctc tctcaccctc tctggccaac caagaatttt acagaggcat gagagagaga 135121 gcccctgaag ttttgtagag atgctcatga cacagattgc tacagaggaa aaaagtcacc 135181 cctggaaaac tcatctaaga atatatcatg aagtgggcta ttttgggact caaattacac 135241 aagagggcct gggaagtgtg cagggatgaa cacagctaca attaatatta actagaaggt 135301 gtcatgtctt tatcttgtga acacacagcc ctcagtgcct ggcgggtggg cgagaggaag 135361 gtgaatccct cccgccatgt acccaggcag gagttgggtg gttttggggg agggtgtctg 135421 ccggaggaag gaggagggag ccaggcaggg ttgcggggaa ttgctggagg aggcgccagt 135481 tctcaggaag cctgaagtca ccttgggcaa ggtggcctcc gaagttggaa atgtaactac 135541 gatcgggcct cacctagttc agggaaagtc cagactcagg accagagaga ccgggagagg 135601 accaggggct ttcctagggt gtcatggcct catggagtgt ccctggaaaa gaagttactt 135661 agcttaatct atagggcctg tgcctgaaag aaatcagttt ccactctcta aaaagagatt 135721 tttttaaaaa aaatctcaat cacggccgga aacggtgggt caggcctgta atccagcact 135781 ttaggaggcc gaggcaggcg gaccacctga ggtcaagagt tcgagactag cctggccaac 135841 atagtgaaac cctgtctcta ctaaaaatac aaaaattagc taggcatggt ggtgcatgcc 135901 tatagtccca gctactcggg aggctacggc aggagaattg cttgaactag ggaggcggag 135961 gttgcagtga gccgagattg caccactgca ctccagcctg ggccacaaaa gtgaaactct 136021 gtctcaaaaa aaaaaaaaaa tctcaatcac attgcacaga gtattttgct gggccggagg 136081 gtggaaggcc tggggatcag gctatgtgga aatgcatccc agtatttcca gtcgccagtg 136141 gatcatcttg aactggatgt ctcccctcca gaagcctcag cttattcatc tgtgacacct 136201 cacaataatc cgtttcacgg agtacttcaa cagatttggg gaaaggggga tgggtcaaat 136261 tagataagtg tacattaaaa ggcctccata gcctgcaacc tctctgtgag tggcatggta 136321 tcacttatca acctgccttg taaagactgc tgtgagaatt tgactcccag ctaaactaag 136381 agagtcacca gggcagccag ctatcatgtc tgattcctct gtgttattct cagaggcaag 136441 cgtgttatct atctgtgtta ttatcagagt caagtgtgtt atctagtact gcttatctat 136501 taactgcctc caactgttgg ttgaataagt aagtcattgg tacataaagg tgaatatcac 136561 aaaacaagca caatatttca tattagttag atctaggaaa gaccttttcc aatgtgaact 136621 gttactaaaa tattctgttc tgcttctaac cttctgtcca actgcaaatc acatcacata 136681 atatttgtgg tctttgagag caaaaaccat gccttaatca ttctgtttca atgtgagctc 136741 cgggtgggca aggacttggt tttgctcaga gcctgcactt ggaaacgggc cttgcgcttg 136801 gtaagactcc atgagtattt ggtaaataaa caaatggttt ctaaccctta acacagagct 136861 acgcattcag caaatgttgg ttggattaat tattaaacaa gggggatact ttttctttct 136921 aatatctctt tctaacattc ccgactgagg cggcgtagca gaatggttag gtaggcgggc 136981 tctggaatca gatggtctgg gtttgaattc tgcctattct acttactggc tgggtgacct 137041 agagtagcta ctgaacctct ctgtgcttca ggtttctcac ttgtgaaatg aggataacat 137101 tttttctcct ttttttttct gttgcccaga ctggagtgca atggtgcaat catagctcac 137161 tgcagcctca aagtcctggg ctcaagtgat cctcctacct tggcctccgt aagcactggg 137221 attacaggtg tgagccactc agctcagcta acaatttttc taataggatt acagggttca 137281 tttaaagtgt acataaaata attagcacat agtaattgct acttttttgt gaccatatca 137341 gtctcaacaa ccttcactga ctgcccttta accactggat agacctaaag tcgaattatc 137401 cataatttgg ccccatatcc ccctctttcc agtcccatct tcccatactt taacactagc 137461 tgccctttac tggctgctac ctgtgccaga cattcttcta agtggtctat atccattaac 137521 ttgtttccct tctcaaataa cctagtaagg ttggacttca tttttaagag acagggtctc 137581 actatgttgc ccaggctgaa ctcaaactcc tgggcttaag ggatcctctt gcctcagcct 137641 cccatgtagc caggactact ggcccatcac tgcgcccagc tgaggtggga cttattgccc 137701 atatttcact gatgacaaga caaaggccca gaaaataaaa ataactttcc tggggacaca 137761 cagctaatat taggaaagct ggattcattc ccaggactct gtgatatgaa agttcatgct 137821 gttatttacc atctctgata tccttgccat gcaaagtttg gtcagtggac cagcactcct 137881 agcattacct gggggcttgc tagaaatgca gagtctcggg cctcacccca gacctactga 137941 atcagaatct acattttaac aagccccctg gctgcacagg aaaattttgg aagcactatt 138001 gtaaaccact ttctaccagg gatacacgag tactaattaa gtcaaagaag tgttctcgtt 138061 taatccctac accaacgtgg gaggtcagga gagttaagca ttattatccc attttgcaga 138121 tgaggagact aaggctctga aaggcatctt gaagtgactg agtccacata agcctgatgc 138181 cagcataatg ttctgactaa actcagtttt aagtccctcc acctataatg tcttcattcc 138241 ctctctccac cttgaattct tctcctcctt ttaggctcca cttgaaatcc cttcttgctg 138301 gacaaagccc ttcttaccca gttcccactg ggcagattgc tccttccttc atattcccat 138361 aagctccctc cacatacctt tctcttggct caaattcttt aagaaatgta cactgggtgc 138421 aaactgtgta ccaggcacat ggtaggcatt gagtaaatgt ttgttgagcc catttgctcc 138481 caaagcattt ctgactgtgt ttattaaaac actattgggc catgcctgta atcccagcac 138541 tttgggaggc caaggtgagt ggatcacaag gtcaagagtt caagaccagc ctggccaaat 138601 ggtgaaaccc taactctact aaaaatacaa aaattagccg ggcacggtgg caggtgcctg 138661 taaccccagc tactcggagg ctgaggcagg agaatcgctt gaacccaggg ggcagaggtt 138721 gcagtgagcc aagattgcac cacggcactg taacctggtc gacagagtga gactccgtct 138781 caaacaaaac aaagtaaaac aaaacaaaat aaacaaaaaa agcccaaaga aaccactctc 138841 attactccta tcctcttctg tgtttgtctg tctctcccac tgccctgagc tcctgtgtgt 138901 ctccagacta ttgagtctgg catcttgtag ggcgtcaagt cgtaataaac taagtgctct 138961 cccatctaag gacacaatct gtctatatgt acttaagagt ggagcattac caggagtgaa 139021 gctttccagt tcccaagagg ggtaaaggac aagtgaggtg atggacacct tgctgctcac 139081 cactcccaat ctttgggctg acacaggatg ccagaactga ttattgtaaa aaaaaaaata 139141 ataataacaa taatgtgcag acatttgaaa atctaactct cattttttcc agagcatgaa 139201 aggaaggcct agaaatggta agtgaccttt tcaaatcccc tggggtgaaa accaggtccc 139261 caataacctg tcgtttcaca agaaaaggag aaagggtaaa ggcggccgtg atgtcagcga 139321 aagacctcca agacagaaaa ccttgcagtc cccgtgctct gcctctttca cgggagacct 139381 gggcaacgca cacgttgcta ggcctcaatt tcttcaactg taaaatgagg gggttggagg 139441 agcttgaaaa gctccttcca gctctgagat gctgcgctat aaagtgagag aaatgtgaac 139501 ctttagttgt tatgatctga aaatattttg aatgtggaga atcaaattgt agagcagtgc 139561 atgaacaaca taatatgtta atgactaaga cgattgtatt taatccgatg ggaaacagca 139621 gagctgctgc caccgctgct gtgtgtgtgc gtgtgtgtgg tgtgcacgca cgtgcggttt 139681 ccttccttga gctcaaacat acattttgaa tcactaatat agtaatcatg tctctggcag 139741 cctatcttgc ccaaacacac acggagagac accacacacc ctacttcaaa ttaaggtggg 139801 ggcgggggga attccactgc tgctctttac atcatttctg tttggatagc tattttttaa 139861 aagaaagttt tacagtgaag gtgttggact cccagcccag aagggctctg atctcagaac 139921 tgggacgcaa gggtggggag ctcgagctct cattagcatt tcgcttttgc acaccccctg 139981 cccgctcctc agccgtcaca atcccgtcct gcagcaggcc cccccaccaa caccggaccc 140041 tcgacgggga aggggtcagc tcggcggccc cagaggccac tgggccagaa cacgtgctcc 140101 ggactccacg tgcgcggaaa atgaactcgt acccgcgacg cggccggcgg gggcggatgg 140161 cagcgagggc cggacagcga acctttaccc cgcggaggag tcggggccgg agcccgcacg 140221 tgggtctgcg acggccccgc cccctggggg cgtcccctca gggactctag ccccggctgg 140281 tgggccctgc aggcccgggc ccgaccgcgt ccgccaggcc gcccggaggt cccggcgagc 140341 ccggcggagg cggcggcccc tccccgcccg acggccacgc ccgcgcggat tggccgctcc 140401 ccgacggcgg ggcggggcgg gacgggccgg ggcagagctc gcggccaggt gaggcgcccc 140461 gcccctcggc ggctccaggt gcggctgtgg gacctcggac cgcggcgggg ccggggccag 140521 ggccggggcc ggggccgggg cgccatggcc gagtcccagc tgaactgcct ggacgaggcg 140581 cacgtgaacg agaaggtgac cgaggcgcag gccgccttct actactgcga gcggcggcgg 140641 gccgcgctgg aggcgctgct gggcggcggc gagcaggcct accgcgagcg gctcaaggag 140701 gagcagctgc gggacttcct ctccagcccg gagcgccagg ccctgcgggc cgcctggagc 140761 ccctacgagg acgccgtccc cgccgccaac gcccggggca agagcaaggc caaggccaag 140821 gcccccgcgc cggcgccggc tgagtccggc gagtccctgg cctactggcc cgaccgttcc 140881 gacaccgagg tgcctcctct ggacctgggc tggacggaca ctggtttcta ccgcggcgtg 140941 agccgggtca cgctcttcac ccacccgccc aaggacgaga aggcgccgca cctcaagcag 141001 gtggtcaggc agatgatcca acaggcccag aaggtaggcc cccgccttcg cccccacacc 141061 gctgggacct cggccccagt cccctggacc gggccccacc tcccaggcag ggcccggggc 141121 aggccgccct gagcaccctc tggaaaatgg gccccaaccc gcccctactt cctggggctg 141181 cagtgaggcc tgtagagcga atggctggga acgtgtttgg caaattacaa ggtgctggac 141241 agatgtcagc cgtcggtatt ccagcgttca gaggcactgg ctctgctctg tacttctgaa 141301 acttccctgt ccagccttcc tccccaggga agactggggg ccgaggccct gtggttaaaa 141361 tgcaccactc aaaggtcttg atgataacct acgtgctgcc cagaacatat cccgagccgc 141421 tttctcatgt gatcgtcaaa acaacttcag ggagtgggtg gcggtgacct tcccggtccc 141481 acctctgcca tgctgctcct tggaccaccc ctctcctccc atagtcaaag ctgagctggc 141541 gccctcacag tcacagcctc tcctgggccg ggtactaggc cccagctgcc tggaatgaat 141601 gggggacggg gtatattgag gacctactgt gtgcctggcc ctgtgctggg cgccttggca 141661 cagcaggtca cctgactcgg cagctcagga aagggctcct acttagcata tgtgtcctgc 141721 ttgagagttg gaaggagggg ctgtttccag ggtggctggt gaggccagtc cactgtggaa 141781 caggtggaag actttaatct ttatgccttt gggtgggagt cctgggcagc ttccagatca 141841 aatgtaggat gccccactac atttgaactt tcaataaaca acacataact ttttagtatc 141901 agcttatccc aaacatgaca tgagacgtac tgatactgaa aaattcttcg ttgtgtatct 141961 gaaattcaaa ttgaatggga acgttttgta cttttatttg ctaactctgg ctgccctaca 142021 ctgagatttg ttaagaaaca gcagcagctt ttggtgccca attcagaata atacaacagg 142081 agtggcttgg gttcaccact caaattccac tcagtgagtg aggttggagt cctggttccc 142141 ctcgaggctg aagaggcctg aggagggaaa atagagctag aaagaccagt gctcaggctc 142201 atcttcctgg aaggggtgat ttaggtagtt ggggtgatct agatagttgg ctacattggc 142261 taccagaagg aagggttaac tttccactgc gcacatttct accctccctc tctataattc 142321 ctccctgtct gactgagttt tctttgtata ttgtgttctg cagaagaaaa aggcaagatt 142381 ggacacagaa gagggaaaaa ggatgtaggc ggcaatacag gctgcagctg gggttagggg 142441 tcaagtctca tgatgtcata ctaacaaagc ctctatttat ttagcattta atatgtgcca 142501 tccactgtgt taatgactca cctgcatttt ctcattagct tttcacaaca accgggtgaa 142561 tatgtgtttt ccttattttt ggagaaaaag acatcaggat tcagaagttt cttgccagag 142621 tttccatagc tagtaagtgg ctgagccagg acccaaactc caaagccctt gctcttaacc 142681 actgtgctac actgccccag tgtctgaaat aatggtgcag tccgattaag caaggggttt 142741 gaatgaagct agcctggtca caacctggag tgcaggtggt gtacccctgt tactagaagg 142801 gggagactgg gcttctggcc actgaggcat ctagtcccat gtgggatgac aggactcttt 142861 cctgcatagt gaggtatttt tgactcttta aatatttaac aacagaaact tttaacaatt 142921 gtgaacccat ccaataagta caggtttaac cagtgatttc ctgagggctg actgtgtgcc 142981 aggcactacc tagccatgaa gaagatacac ggtgtggctt tgccctccaa gagcttaaca 143041 aatgctaggg ctctgctgga aagaaaagag tttttaaggc tgaatgtgcc taatctggga 143101 aggcttcctg agggaggtga gttgcagaat tggaaaggga gagggtcaac tggaatgggc 143161 actgtttcag taagagagct ttttgcttcg aaaaagcaag gaggtttgat tggcaggtaa 143221 aaagagctgt cgtgggggga ggtgagtact gtgtggggcc tggtttgggg accctaaaac 143281 tgggcccaag gcaggcaacc acccccaagg aagcaacagt gttggacact gtctgcgccc 143341 acccaagtat caggacctcc gggtccccac ccttaatcac tgtcctgcct gtccgcacag 143401 cctcttcttc ccgaagaccc tgagaagaag ttgggacaga gaggttgacc ctgtttcaca 143461 ggtgaggaaa ctgaggcaca gagaaggaaa gagacttgcc cagaggtcac gtgggcagag 143521 gtcacttggt tctagaaaaa cccagcttct gctctaggga ctgcaccttc tgatcactgc 143581 ctgtccccca cccccacgcc agcctgtcca gctggcaggc agccttccct ggccattaac 143641 ttagatcctt cctcctcccc tttcgccctg agggagggag tttcctttgt ttctagtgag 143701 agtctggctg ctgtgaggcc aggtggggca gactgagatg gtttaca // LOCUS HSACKI10 6483 bp DNA PRI 26-JUN-1997 DEFINITION Human gene for acidic (type I) cytokeratin 10. ACCESSION X14487 NID g28316 KEYWORDS acidic cytokeratin; cytokeratin; cytokeratin 10; cytokeratin type I; intermediate filament protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6483) AUTHORS Rieger,M. TITLE Direct Submission JOURNAL Submitted (08-MAR-1990) Rieger M., German Cancer Research Centre, Im Neuenheimer Feld 280, D-6900 Heidelberg, F R G REFERENCE 2 (bases 1 to 6483) AUTHORS Rieger,M. and Franke,W.W. TITLE Identification of an orthologous mammalian cytokeratin gene. High degree of intron sequence conservation during evolution of human cytokeratin 10 JOURNAL J. Mol. Biol. 204 (4), 841-856 (1988) MEDLINE 89125611 COMMENT See also , and J. Invest. Dermatol.91:572-578(1988) for related data. FEATURES Location/Qualifiers source 1..6483 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="blood (peripheral)" /clone_lib="EMBL3" /clone="lambda KH10(5)" /chromosome="17" /map="proximal to 17q21" repeat_region 89..420 /note="truncated Alu" -10_signal 1614..1621 exon 1649..2300 /number=1 mRNA join(1649..2300,3142..3224,3592..3748,3836..3997, 4085..4210,4525..4742,5098..5499,5805..6170) prim_transcript 1649..6170 CDS join(1674..2300,3142..3224,3592..3748,3836..3997, 4085..4210,4525..4742,5098..5499,5805..5811) /note="unnamed protein product" /codon_start=1 /db_xref="PID:g28317" /db_xref="SWISS-PROT:P13645" /translation="MSVRYSSSKHYSSSRSGGGGGGGGCGGGGGVSSLRISSSKGSLG GGFSSGGFSGGSFSRGSSGGGCFGGSSGGYGGLGGFGGGSFHGSYGSSSFGGSYGGSF GGGNFGGGSFGGGSFGGGGFGGGGFGGGFGGGFGGDGGLLSGNEKVTMQNLNDRLASY LDKVRALEESNYELEGKIKEWYEKHGNSHQGEPRDYSKYYKTIDDLKNQILNLTTDNA NILLQIDNARLAADDFRLKYENEVALRQSVEADINGLRRVLDELTLTKADLEMQIESL TEELAYLKKNHEEEMKDLRNVSTGDVNVEMNAAPGVDLTQLLNNMRSQYEQLAEQNRK DAEAWFNEKSKELTTEIDNNIEQISSYKSEITELRRNVQALEIELQSQLALKQSLEAS LAETEGRYCVQLSQIHAQISALEEQLQQIRAETECQNTEYQQLLDIKIRLENEIQTYR SLLEGEGSSGGGGRGGGSFGGGYGGGSSGGGSSGGGYGGGHGGSSGGGYGGGSSGGGS SGGGYGGGSSSGGHGGGSSSGGHGGSSSGGYGGGSSGGGGGGYGGGSSGGGSSSGGGY GGGSSSGGHKSSSSGSVGESSSKGPRY" mat_peptide join(1674..2300,3142..3224,3592..3748,3836..3997, 4085..4210,4525..4742,5098..5499,5805..5808) intron 2301..3141 /number=1 exon 3142..3224 /number=2 intron 3225..3591 /number=2 exon 3592..3748 /number=3 intron 3749..3835 /number=3 exon 3836..3997 /number=4 intron 3998..4084 /number=4 exon 4085..4210 /number=5 intron 4211..4524 /number=5 exon 4525..4742 /number=6 intron 4743..5097 /number=6 exon 5098..5499 /number=7 intron 5500..5804 /number=7 exon 5805..6170 /number=8 polyA_signal 6144..6149 polyA_site 6170 BASE COUNT 2075 a 1267 c 1341 g 1800 t ORIGIN 1 aagcttctaa ttgcagttca accacctgtt acatatcttc aggaaaaaat cacaacctct 61 caacttcaac ttcctcttct ataaattaga aataacaata accacacctg taaccccagc 121 actttgggag gccaaggcag gcagatcaag aggtgaggag attgagacca tcctggctaa 181 catgatgaaa ccctgtctct accaaaaaga caaaaaatta gccaggtatg gtggcacaca 241 cctgtagtcc cagctactcg ggaggctgag gcaggagaat ggcgtgaacc cgggaggtgg 301 agcttgcagt gagccgagat ggcgccactg cactccagcc tgggcgacag agcaagcctc 361 cgtctaaaaa aaaaaaaaga aagaaagaaa gaaagaaaga aaagaaataa taataaccac 421 cattcctatc tcaacagctt gttctagaaa tttttaaagc acagtatcac aaacagcact 481 acataattgt aaaacatgta tgaatatata catccaaaca acagcaatgt catagcctat 541 gggtagatat aatcttatac aatgtaccaa aatcccaatt tacttcacta gacaaactgt 601 tataccaaat tctgtacaca gtatatccaa gaaaatgtgt tgtttttatt gagaaactga 661 acctagcttg ggaacacatg tgcacagtct agttcataat atttggtgca agtatcattc 721 tctaatatag atttacattt ttgcaagcaa atttttactt gcaatcgtaa catatccaaa 781 ttttcccttt ttactcaatc agaacttagt gtaaagtact acaagttagt tcttcggatt 841 tcatgctaag aaaataatgc agattttctg cattattatg gtcttcacag aaaccttaac 901 tatgatgaat ttaaaagtgc aaaataatcc aggataactt tatgatttca cattttttaa 961 tgttaaaaat aatgccatca ttaattagaa aattctaaaa tcattacttc cactttctta 1021 ggcaaaatat caatatactc tcatttgcca aataaattaa aagatctcct acaaacacaa 1081 tctcctaaat tgtggtttta tggctttaat gttttatgtg tggcaactat tgatgctagt 1141 taaaatttta gaaactcttt ctttttgatt ccctacagtt gtctacaaga accttattgt 1201 agcatgatcc tgccagactt tatactattt gttgctccaa ttaaaactgt ttaaaacatg 1261 aatttgaaaa atcttatttt aactataatt ttgtagctga aacttttttt tctaaacttt 1321 gcaaacattc tatgcaacct gaattagtgc tgagaaaatt ggatcttaat ggttgctcaa 1381 tgttcttcaa caggtgaaaa gcataataaa acatgctcat ctgaactcca cccattttca 1441 atttcaacat agcatacctc gtgtttattc ttagggcaaa ttcaaaattg tacatattag 1501 gattggttat tactgaagat aatttatgca atcataagcc aaagatgcta agttggcaaa 1561 aagaaaacaa tgtaagtaag caaactctaa cacatgtgga cacaccctct cagtatataa 1621 aggcttgtca ctgtccttgg tagcaggcac tccctgggct aaacagcatc accatgtctg 1681 ttcgatacag ctcaagcaag cactactctt cctcccgcag tggaggagga ggaggaggag 1741 gaggatgtgg aggaggagga ggagtgtcat ccctaagaat ttctagcagc aaaggctccc 1801 ttggtggagg atttagctca ggggggttca gtggtggctc ttttagccgt gggagctctg 1861 gtgggggatg ctttgggggc tcatcaggtg gctatggagg attaggaggt tttggtggag 1921 gtagctttca tggaagctat ggaagtagca gctttggtgg gagttatgga ggcagctttg 1981 gagggggcaa tttcggaggt ggcagctttg gtgggggcag ctttggtgga ggcggctttg 2041 gtggaggcgg ctttggagga ggctttggtg gtggatttgg aggagatggt ggccttctct 2101 ctggaaatga aaaagtaacc atgcagaatc tgaatgaccg cctggcttcc tacttggaca 2161 aagttcgggc tctggaagaa tcaaactatg agctggaagg caaaatcaag gagtggtatg 2221 aaaagcatgg caactcacat cagggggagc ctcgtgacta cagcaaatac tacaaaacca 2281 tcgatgacct taaaaatcag gtaagaggta tttttaaatc cagctttaag tatcttgtcc 2341 atgtaatcca gacagatgaa tcttaaatta agcacaatgt ggctgttcac tatgcttacc 2401 catgttactt tcttccttca aaaataaccc agtctcatca aagataaaca tctgtgaaac 2461 tatggtcatg gcaatcttca tccagcaagt gtgctacttg tcttaagagg atgggagatt 2521 tactaagcac ttttgaggtt ttaatgagca tacaatgagt ccacagttaa aatatgctag 2581 gctatttaca aatgtagaaa ctgaaaaaaa aaatcatgat atgaatcaga acaaaatgtt 2641 attcagactg ataacaagcc atattcagta ccaacatggc aagaaaaata aattttccag 2701 tatgaaaatg ggacactgct tgcttctaag gaatttctga attgtaccta ttgtgtacca 2761 gttcagagtg tatttattta ttagtattta tcatgagtta aacaaatgca ggtgtgagtc 2821 agccaaagca tggctgaaat acatggaaat cacatagtct aaaagaggag ggcacactta 2881 caggaataca tctatataat tccagttagt tttcagaaag gaataattcg tgtacagaaa 2941 tacaagactg gagaaattcc aagagaacaa ataattcaaa gttaagtata tgggtaagcc 3001 tgcaatattt catatttaaa ataaaaaatt ttcccaagat tttgtaagag aacaacataa 3061 aagtgcagag tgcatctatg tcactacaaa agccatatct gcatctgacc tcttctcaaa 3121 taactgtgcc tctccctcca gattctcaac ctaacaactg ataatgccaa catcctgctt 3181 cagatcgaca atgccaggct ggcagctgat gacttcaggc tgaagtaagt taagtgatcg 3241 ttgtataata ctatcacaac gaatacatca gtggttttta acaatgactt gggatgccct 3301 caataacatt tacatttttc tgaattcacc caaagttaaa tagtattgga gttatctgag 3361 aaattttcca tgtcagtgtt acctttttgg caatattaaa ggaagaaaat gcatattaaa 3421 gtaactgcta aggttttttc cattaaacca ctattacttc taagagaact gtacatgaca 3481 aatattgcca ttacatgaga tcaactatgt agttgctttt taaatagtct ctgcccagat 3541 acatctcccc tatataagtt ataaccagta ttgatatcat gcttgtttca ggtatgagaa 3601 tgaggtagct ctgcgccaga gcgtggaggc tgacatcaac ggcctgcgta gggtgctgga 3661 tgagctgacc ctgaccaagg ctgacctgga gatgcaaatt gagagcctga ctgaagagct 3721 ggcctatctg aagaagaacc acgaggaggt gacacaaaag ttatactttt cccagccaaa 3781 agagagttca ttatggtcct cgtgtagcca ataaatcttt ctgttcctca aacaggaaat 3841 gaaagacctt cgaaatgtgt ccactggtga tgtgaatgtg gaaatgaatg ctgccccggg 3901 tgttgatctg actcaacttc tgaataacat gagaagccaa tatgaacaac ttgctgaaca 3961 aaaccgcaaa gatgctgaag cctggttcaa tgaaaaggta aagtaatctt ccttatagtg 4021 aaactcatgg aggttttatc atttcagaat ttcctcaccc ttttccttgt ttttaatact 4081 ctagagcaag gaactgacta cagaaattga taataacatt gaacagatat ccagctataa 4141 atctgagatt actgaattga gacgtaatgt acaagctctg gagatagaac tacagtccca 4201 actggccttg gtatgttaac tctcatgaaa tgacttcaac tttatcatac aaagtttcat 4261 gctcacctaa gaatatgcaa tgcaacaaaa aaatgcagag ttggaggtaa gaaagagaaa 4321 acaaagtgaa gctcatgtta atggaggaaa agtactacta gtgttgatct aaaagtgctg 4381 aaactgaaat ggtgccatta aacatacaac aaattctgtt cattttctta ttcttctata 4441 taatgcctta ctaaataatc aaataagcgt caccatactc aactgaacaa ggaagtcact 4501 aagccacaaa aaaatccgtt tcagaaacaa tccctggaag cctccttggc agaaacagaa 4561 ggtcgctact gtgtgcagct ctcacagatt cacgcccaga tatccgctct ggaagaacag 4621 ttgcaacaga ttcgagctga aaccgagtgc cagaatactg aataccaaca actcctggat 4681 attaagatcc gactggagaa tgaaattcaa acctaccgca gcctgctaga aggagaggga 4741 aggtaaatta taacatgaaa agttatccca gtttctttta ttcaatattc cagatagcaa 4801 ggcttatcta aaccccaaga agatgccaga gaatgagagg aagggaggag agagggtaga 4861 gtacagaaaa aggagtacgc aaccgcaatc tcactttctc atgaatttgg cccaaaatga 4921 ttcttaagag ttctgtgaac ttaacattgt tttcaaagga tgggttttaa aatatatacc 4981 tggcagggtt ttattttttc aacacgtttt gcttattttc taaattaacg gcaactggaa 5041 agctacccac cgttttccaa cgttagagat aaccgaatgt gacctcaccc cgtttagttc 5101 cggaggcggc ggacgcggcg gcggaagttt cggcggcggc tacggcggcg gaagctccgg 5161 cggcggaagc tccggcggcg gctacggcgg cggccacggc ggcagttccg gcggcggcta 5221 cggaggcgga agctccggcg gcggaagctc cggcggcggc tacgggggcg gaagctccag 5281 cggcggccac ggcggcggaa gctccagcgg cggccacggc ggcagttcca gcggcggcta 5341 cggtggtggc agttccggcg gcggcggcgg cggctacggg ggcggcagct ccggcggcgg 5401 cagcagctcc ggcggcggat acggcggcgg cagctccagc ggaggccaca agtcctcctc 5461 ttccgggtcc gtgggcgagt cttcatctaa gggaccaagg tcagcagaaa ctagctgggg 5521 taatctagaa ttagttttaa cttcctgtga tggttttttt gcgctttaag ctctagagtt 5581 gttttaaaaa attaaaaatc ttagagacgg ttccgtttgc atttgttcac aaactactct 5641 taacaccagc cgtgaaaaat ggcatgatca aaatgtcata ccttaagcat ttttttgggc 5701 ttaacaatgt aaagttgaaa tttccttctt tttacaatat ttgcttgtta attactaagg 5761 atccctacag actgtttaaa attttttttc catcattcac acagatacta acaaaaccag 5821 agtaatcaag acaattattg aagaggtggc gcccgacggt agagttcttt catctatggt 5881 tgaatcagaa accaagaaac actactatta aactgcatca agaggaaaga gtctcccttc 5941 acacagacca ttatttacag atgcatggaa aacaaagtct ccaagaaaac acttctgtct 6001 tgatggtcta tggaaataga ccttgaaaat aaggtgtcta caaggtgttt tgtggtttct 6061 gtatttcttc ttttcacttt accacaaagt gttctttaat ggaaagaaaa acaactttgt 6121 gttctcattt actaatgaat ttcaataaac tttcttactg atgcaaacta tcccaatttg 6181 tcagaattta tctttactta agtacataat actctttaaa attaaagatt agtaacccat 6241 agcagttgaa ggttgatgta tccagaaatt cggaagacag aactattgtc atgccttttc 6301 taagtttttt aatcatgtat gttcagacca ccgtcagtaa attcactgag taaagtctgt 6361 aaatccccaa tattactctt taagatacac aatatgtgga aggctcccag ctctctggct 6421 ttaaattatt tcaatcctgg aaattctgga atatctcaaa tataaccccc aaaataataa 6481 taa // LOCUS HSACTH 8658 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens gene coding for ACTH and beta-LPH precursors. Gene codes for the common precursor of the pituitary hormones corticotropin (ACTH) and beta-lipotropin (beta-LPH). ACCESSION V01510 K02406 NID g28341 KEYWORDS Alu repeat; beta-lipotropin; corticotropin; lipotropin; opiomelanocortin; proopiomelanocortin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8658) AUTHORS Takahashi,H., Hakamata,Y., Watanabe,Y., Kikuno,R., Miyata,T. and Numa,S. TITLE Complete nucleotide sequence of the human corticotropin-beta-lipotropin precursor gene JOURNAL Nucleic Acids Res. 11 (19), 6847-6858 (1983) MEDLINE 84041488 REFERENCE 2 (bases 1 to 8658) AUTHORS Takahashi,H., Hakamata,Y., Watanabe,Y., Kikuno,R., Miyata,T. and Numa,S. TITLE Direct Submission JOURNAL Submitted (27-OCT-1983) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..8658 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="p23" CAAT_signal 616..624 TATA_signal 653..659 mRNA join(681..766,4475..4626,7513..8345) exon 681..766 /number=1 intron 767..4474 repeat_region 1086..1405 /rpt_family="Alu" exon 4475..4626 /number=2 gene 4495..8184 /gene="POMC" CDS join(4495..4626,7513..8184) /gene="POMC" /codon_start=1 /product="proopiomelanocortin" /db_xref="PID:g28342" /db_xref="SWISS-PROT:P01189" /translation="MPRSCCSRSGALLLALLLQASMEVRGWCLESSQCQDLTTESNLL ECIRACKPDLSAETPMFPGNGDEQPLTENPRKYVMGHFRWDRFGRRNSSSSGSSGAGQ KREDVSAGEDCGPLPEGGPEPRSDGAKPGPREGKRSYSMEHFRWGKPVGKKRRPVKVY PNGAEDESAEAFPLEFKRELTGQRLREGDGPDGPADDGAGAQADLEHSLLVAAEKKDE GPYRMEHFRWGSPPKDKRYGGFMTSEKSQTPLVTLFKNAIIKNAYKKGE" intron 4627..7512 /gene="POMC" repeat_region 5334..5657 /rpt_family="Alu" repeat_region 6511..6658 /rpt_family="Alu" repeat_region 6776..7112 /rpt_family="Alu" exon 7513..8345 /number=3 BASE COUNT 2105 a 2225 c 2229 g 2099 t ORIGIN 1 ctgctcttca cagcatcacc ctctccccat ttaatggttt aggttaacag gactttttcc 61 ttgaggcttg ggacacggaa gggagcctcc cctaaaccag gcccttggag agcaggcccc 121 aggggagcag tgcaactcac cttcacaccc acaagacggc tcctgacttc tgctccctcc 181 tcccctcccc aaagtggaac agagagaata tgattcccca cgacttccac atcacagttt 241 ccaaacaatg gggaaatcgg aggcctcccc gtgtgcagac ggtgatattt accgccaaat 301 gcgaaccagg cagatgccag ccccagcacg cacgcaggta acttcaccct cgcctcaacg 361 acctcagagg ctgcccggcc tgccccacac gggggtgcta agcctcccgc ccgttctaag 421 cggagaccca acgccatcca taattaagtt cttcctgagg gcgagcggcc aggtgcgcct 481 tcggcaggac agtgctaatt ccagcccctt tccagcgcgt ctccccgcgc tcgtcccccg 541 tctggaagcc cccctcccac gccccgcggc cccccttccc ctggcccggg gagctgctcc 601 ttgtgctgcc gggaaggtca aagtcccgcg cccaccagga gagctcggca agtatataag 661 gacagaggag cgcgggacca agcggcggcg aaggagggga agaagagccg cgaccgagag 721 aggccgccga gcgtccccgc cctcagagag cagcctcccg agacaggtaa gggcgcagcg 781 tgggggaccc gtgctctttc cccgggatcc cctgtccccg tcctcgcgat gcagtcggcc 841 ggctccggct ccgaaggcgg acctgggcgc ctctggctct ccgcggtccc gagttctcga 901 caaactttct gcgccgactg cggcatgaga agccgccagt agctgagctg gagggcccac 961 gtccggcccc tgggcggacg gccgcgaagc tgcaggcgct gtctccaggg agccggcggc 1021 ctcctctccc ccaggggctc gcggcggtcc ggaggctccg agagcttgct aggaggtctt 1081 gggacaaccc ggtctttttt tttttttttg agacggagtt tcgctcttgt tgcccatgct 1141 ggagagcaaa ggggtgatct ctgctcaccg caaccttcgc ctcccgggtt caagcgattc 1201 tcctgcttca gcctcccgag tagctgggat tacaggcatg cgccaccacg cccggctaat 1261 ttttgtattt ttagtagtga cggagtttct ccatgttggt caggctggtc tcaaactccc 1321 gacaacaggt gatccgcccg ccttggcccc ccaaagttct ggcattacag gcgcgagcca 1381 ccgcccccgg ccagcccggt cttttagtat ctcttgctcc cagtttccag gataggtgtc 1441 acatcttgaa agtcaaattc catacacgct atcgcaaatt aatgttggaa acggggcagc 1501 agagaaaagg ataaaagtca taatgaacgc cctgccttcc ggattttttc ggattcagac 1561 ccctgaatcc ttgtttcctt gcccacctta gcgcacccga ggtggccgcg ctatgataat 1621 tacatgataa ctgggtcaat tacaatgcag aatagttggg tctcttctct ccaagaccta 1681 gctggggtta aaaacaggtg gccggggcgg gagctgtcct agatcctgaa acgcactgtc 1741 tagtttcgga tgccctcaac agaaccgggg tggacggttt atggcgcaga tcctgggttg 1801 agggcacggg cagccatttg gaatgatcaa ggctcaggta aggggcgttt ccagcgaagg 1861 agagacagtc cacttggcat ttggattccc caaattcttc atgtttaaat ggggcaggga 1921 gggttcttac agaatggctg gaaggagcca aggaaaataa aagtgtgtgt ggattttttt 1981 tgtgtgtgtg tcagtttata aactctgcac agattatggc cactttaatg acttactgtt 2041 cctttgatgc ttttgttata ggactcgatg catgtatgtc atggtgtaag gacaaaactc 2101 ggcccctgtg ctcctctaat ctttacaaaa ggtcatggcc agcgtgcagt tttacagtaa 2161 caagcaaaat gatttgttga gctcatagag agcccctcac acctatgaag ttctaataag 2221 tgtagttcta ctataaagtt aatctcagga tgagcaaatt tcaagtttct atttttccag 2281 agctttccat ttttggatta taatactttc cctacttaaa aaagcacaac atttgatatt 2341 tccccaataa tttgttgctt taaaaatgac acaaaaggta ctatttgttc attgtagaga 2401 actgaaaata cacataagca aatacacata cacataagca aaatatacaa tacaaacaca 2461 agaccatctt tcagggaaga atctgaagtt ttagcaatag cagccatcta accagtttag 2521 caacagaata taagctctga gagggtggga gtgaatatgt taccacattg tacaacacag 2581 cacatagggc ataaggaggg gaaatgctct ctggggcttt ccaggaaggc ctgaagtcat 2641 tgcttctagc aaatggaaat cactccagag tagttatctt tgacaagaat tgaaatataa 2701 ttgagggaac tatcagacct gtaagatttt gttttttcct ttactaatat gttactttac 2761 atttgcattt ggtgacatac gtaactacca tttttctgtg actgtaacat ctgggcattt 2821 ttcagagcta aatgtgctat ggtcaacttg gagctttaat ctaattgcct ggtccaccaa 2881 gttctggctg tgtacttgaa tagatcactg gcagggtaca atgggaacag cctgtccctt 2941 ggagccagga gaggacacca aggttgacca aagctcgttc agttgcccct ttagccgaag 3001 cgcacctggg ccagtcactg gctgccagtg ccatctaatg gctgctctga aaatgctcag 3061 ccttgcccgg caacccttca gaagctagca ccgtgcaggc ccagcgcctg gggaataggg 3121 cgagggtggg gtagagagaa ggaagtggcc tcctgaagta gaaatcagcg cttcagagga 3181 ctttcacttc caaagcctcc cctatataaa aaagatttgg cccacgcctc cccaaatgag 3241 agatttattt taggcaaact tattttaaaa tgccagcgtt cattaggagt gacaagacac 3301 ttagtcatcc acgctttaat gtgaattact tttctcatct aattacattt ctttctagca 3361 gctggctgag aagatcttct gaaatccaaa atgattgtag ggttggcggt gagctgatct 3421 ccggcctcga ggtggcttca gggggcccac ctggttaagg gaaatttggc agtgcgaggg 3481 tagtgctgga gagaggggtg ggtacagggg gctaggggca ccatggatgc cccctcctta 3541 ctgtcccctg gtgtcttgac ctcagcttct gcccacaggc acttgctgga ttctccaaaa 3601 gtatctgcag tggctgttcc accaggaggt aattcccttc tggtctcttt cccctccaca 3661 tctgcatcct cttcaaatcc tgccatttca gaccacattt gagagctcta gagaacaaga 3721 catctgacac gtgacgtgtc cagaagatga gccagatttc aaagaactga gatctgcttt 3781 aaaaacgaag ctctccaaag ttactggagt ctgggtaata gtgatcacca gagtaatttg 3841 tgtgcaggac atcaaatcag gctgctcgaa atgctgccta aattggccag tggttttatt 3901 tgcttttctg tcaacctaat attcatagga aatagagttt cagaggaatg ataggatcct 3961 ggtggaataa aaagggaaaa gaccatcttg agcaggagtt tcagggtcct ccgtttttcc 4021 caagttactt tcactcctga gatcttgcat gttagaacta cagcttaatg tagtgaaata 4081 ggaaagttct ctgttaggag cttagcctta ccttgtcatg gacattaaag taattgtctc 4141 tctttgggct tcaattttcc catctctcat gggaagggct gaaccaagca atccccaaaa 4201 tagcttccag ccttaacctt tttaggggtc tcgtttaaat agaagataac agggaaatgg 4261 tcacagttta cccaggtcca ttccctcctc cttatcacaa cttataccac cgctgtactg 4321 cacacctcct ttctcagcat tgctgctgtc cttaaaatgc ctttaactcc acaagagagt 4381 gtgttgttaa tgttggctca aggtccttcc tggtgagtgg ccaacattgt tttgctcctt 4441 gcaggggtcc caccaatctt gtttgcttct gcagagcctc agcctgcctg gaagatgccg 4501 agatcgtgct gcagccgctc gggggccctg ttgctggcct tgctgcttca ggcctccatg 4561 gaagtgcgtg gctggtgcct ggagagcagc cagtgtcagg acctcaccac ggaaagcaac 4621 ctgctggtac gtgggccatg actgccatct tggcttagac attagatggg actggagctg 4681 ggaaagctca aaagaaaagg gtgtggggaa agggaaattc attcccagtg ataggcgtga 4741 ttcaatccag ggcaggagca aaactttgca gtgaagtaag aaatgggaga agaaatcagg 4801 gaaggaagca gcttcaggga gaggggttga gtccacaatt tctgcttggt tatccttact 4861 tcttgcccca tcttttatgg agaccttgaa ccctttaagc tagagatggt gctataagag 4921 caataatgga cccctcaatc tattctgtac tttacatctt tagcttccca aactattcct 4981 ttttaagaag ctcatatcac ttgccatttt cattccatat ttcttaccct tttatctact 5041 accggttgca aaaccagcca ggtagttctt caaatcatct ctggaagaag gaaaaaccag 5101 gggccctttt ttttttttct ttaattggtg ccaaatgtct catgtttatt ctggaggact 5161 ggccttctgc tgtgttcctc tacagtcttt ccagagcatg tgaaggcctt tgcatcaggc 5221 aggagctccc tccaggtcac cacagggtgt atgtatctgc ctgtgggggg tgtgtgtgtg 5281 tgtgtgttgg ggggcataaa tgagtaatga tgccaaatcc agagattaaa aggcacactg 5341 agaccaggcg agatggctca tggctgtaat cccagcactt ttagatgcta aggtgggagg 5401 attgcttgag cccagggatt caagacaagc ctgggcaaca tagtgagacc tccacttcta 5461 caaaaaataa aaaagttagc cagatgtggt ggcatgtgcc tgtagtccta gctacttggg 5521 aggttcactt gaggccagga gtctgacgac acagtaagct atgatcacac cattgcactc 5581 cagtctgggt aacagaatga gaccttgtct caaaacaaaa caaaatgaaa caaacaaaca 5641 aacaaacccc catactgtta gtgtcagtga ccggaatttt aatcttgttg ccatcacctg 5701 gcaggtgctg agggtggaat gtacataact acattctgtg tattttgtca atgcagaagc 5761 tgagttaagg tgaagataga atgaggtcct caaagacaca gaccagtttt catgtgtaat 5821 ataaaataga aacaaagagc ccaggggatt ctgtgagttc cagtttggaa agacccaaga 5881 gtctcttgac ttgagacacc cacagcacag ctcaccaggg agggtgcact ggacacagtc 5941 aggacccatg ggttctagac ccagttttga ggtgtgggac cttgaccagg tcctatcacc 6001 tctctgagtc tcctgtttca ctatctgtcc acgggagggg agtgtaaatt agttttttcc 6061 attgttaacg ttccacagag ttgtaattct gaacacctgg agtaggcaat gtccagctca 6121 acagagtggg taggatcctt ttattttctc ctttgctatt cccaagaaag agagcagcca 6181 gtgagctttt catcttttta tcactgaaaa ctcaaggctg cagcctatgc agccattttc 6241 ctaagctaat atgtaccaca atagagtcct ctagggacaa ggagcagaga cacaggttcc 6301 acagacggtg caatggaaat aacgctagct ttccacccct ccctccagtc agaatgagat 6361 tacagggaaa taagcttgcc ccagagctca ctgggggatc tctcagaaat cagctcagaa 6421 gtcgtgaaag aaccaaggtg cagttttgga ggcttagtgc agagatggag ctggggtagg 6481 gcataaagta ggttttccat cactgaggta aggttgaggc attatttttt attttttgtt 6541 tatttattta tttttttgag acggagtctc gctctatcac ccaggctgga gtgcagtggc 6601 gcgatctccc ctcactgcaa gctccacctc ccaggttcac acaggttgaa gcattattaa 6661 aaatatgttt aaaaatatgg gccctagtag ccagacttct atcacctgga gagattatcc 6721 cccaaatttc agccccactc ccctcctgga cttgaattaa accatatgta tttattcaat 6781 attcttttta tttatttatt tatttttttg agacggagtc ttgctctgtt gccctggctg 6841 gagtgtggag tgcagtggtg tgatcttggc tcactgcaac ctctacctcc caggttcaag 6901 cggttctcct gcctcaggct ccagagtagc tgggattaca ggcgcccgcc accacaccca 6961 gcttatttat ttatttatac tagagatggt atttcaccat agttggccag gctggtcttg 7021 aactcctgac ctcatgtgat ctgcctgcct tggcctccca aagtgctggg attataggtg 7081 tgagccacca tgcccggccc tcaatattca ttaagtgcca acaactacca cccgtctgcc 7141 tttcttggag ccactccttt atgtcaggca tatgacagta agactttggt cctgttcaca 7201 aaagctaggg gtggctagat ggctagacaa accatggaat gggatgggaa gtgtgttgca 7261 gttgccagca gaagcatgaa ggggatggga caaaagaggc ggtggcaaga tcttagatgc 7321 ccacgagtgc caagaaagca ggtgggcaga cctgctctgt agggaggcct cgacgcttga 7381 cacgcccgac actgtgccct gtgtcctcgg cacgtggcga gggcggccag ggcctaggcg 7441 cagtgacggg cgcggcagcc gggccggggt gcggggcacg ggctgccctc atgccctcgc 7501 gtcttccccc aggagtgcat ccgggcctgc aagcccgacc tctcggccga gactcccatg 7561 ttcccgggaa atggcgacga gcagcctctg accgagaacc cccggaagta cgtcatgggc 7621 cacttccgct gggaccgatt cggccgccgc aacagcagca gcagcggcag cagcggcgca 7681 gggcagaagc gcgaggacgt ctcagcgggc gaagactgcg gcccgctgcc tgagggcggc 7741 cccgagcccc gcagcgatgg tgccaagccg ggcccgcgcg agggcaagcg ctcctactcc 7801 atggagcact tccgctgggg caagccggtg ggcaagaagc ggcgcccagt gaaggtgtac 7861 cctaacggcg ccgaggacga gtcggccgag gccttccccc tggagttcaa gagggagctg 7921 actggccagc gactccggga gggagatggc cccgacggcc ctgccgatga cggcgcaggg 7981 gcccaggccg acctggagca cagcctgctg gtggcggccg agaagaagga cgagggcccc 8041 tacaggatgg agcacttccg ctggggcagc ccgcccaagg acaagcgcta cggcggtttc 8101 atgacctccg agaagagcca gacgcccctg gtgacgctgt tcaaaaacgc catcatcaag 8161 aacgcctaca agaagggcga gtgagggcac agcgggcccc agggctaccc tcccccagga 8221 ggtcgacccc aaagcccctt gctctcccct gccctgctgc cgcctcccag cctggggggt 8281 cgtggcagat aatcagcctc ttaaagctgc ctgtagttag gaaataaaac ctttcaaatt 8341 tcacatccac ctctgacttt gaatgtaaac tgtgtgaata aagtaaaaat acgtagccgt 8401 caaataacag cagcatggat cggaggagca cagtggtttc catgcggtag gatatttcac 8461 aggacttagt gagcgtgaaa ggaaaatgtg cttcctgccc ccacccccaa atggatcttc 8521 gagggatcag atagtttggg tgaaggcaca gggtggctcc agcacctcta ggatggccgt 8581 attttccaca cactccactg agtgggagac tgctcagcta gcacacgtgt aaaggcagga 8641 ttcctgcaag agtgaccc // LOCUS HSALADG 15913 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens ALAD gene for porphobilinogen synthase. ACCESSION X64467 NID g28579 KEYWORDS delta-aminolevulinate dehydratase; porphobilinogen synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15913) AUTHORS Wetmur,J. TITLE Direct Submission JOURNAL Submitted (17-FEB-1992) J. Wetmur, Mount Sinai School of Medicine, Dept of Microbiology, 1 Gustave L Levy Place, New York NY 10029-6574, USA REFERENCE 2 (bases 1 to 15913) AUTHORS Wetmur,J. TITLE RsaI polymorphism in the human delta-aminolevulinate dehydratase gene JOURNAL Nucleic Acids Res. 19, 4307-4307 (1991) REFERENCE 3 (bases 12399 to 12412) AUTHORS Wetmur,J.G., Kaya,A.H., Plewinska,M. and Desnick,R.J. TITLE Molecular characterization of the human delta-aminolevulinate dehydratase 2 (ALAD2) allele: implications for molecular screening of individuals for genetic susceptibility to lead poisoning JOURNAL Am. J. Hum. Genet. 49 (4), 757-763 (1991) MEDLINE 91377738 FEATURES Location/Qualifiers source 1..15913 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /chromosome="9" /map="q34" prim_transcript <1..>15913 repeat_region 482..791 /rpt_family="Alu repetitive sequence" repeat_region 1588..1881 /rpt_family="Alu repetitive sequence" protein_bind 2111..2116 /note="consensus (A/T)GATA(A/G). GATA-1 is also known as Eryf1, NF-E1, EF-1, EF-gamma-a and GF-1" /bound_moiety="GATA-1" /function="Erythroid transcription factor" protein_bind 2123..2128 /note="consensus (A/T)GATA(A/G). GATA-1 is also known as Eryf1, NF-E1, EF-1, EF-gamma-a and GF-1" /bound_moiety="GATA-1" /function="Erythroid transcription factor" protein_bind 2143..2148 /note="consensus (A/T)GATA(A/G). GATA-1 is also known as Eryf1, NF-E1, EF-1, EF-gamma-a and GF-1" /bound_moiety="GATA-1" /function="Erythroid transcription factor" GC_signal 2556..2561 GC_signal 2610..2615 GC_signal 2684..2689 mRNA join(2742..2803,10451..10563,11841..11891,12387..12483, 13076..13211,13332..13415,13515..13603,14180..14235, 14321..14408,14483..14569,14901..15030,15643..15798) /gene="ALAD" /note="houskeeping promoter transcript" gene 2742..15798 /gene="ALAD" exon 2742..2803 /gene="ALAD" /note="untranslated, for houskeeping promoter transcripts only" /number=1 intron 2804..10375 /gene="ALAD" /note="for houskeeping promoter transcripts only" /number=1 repeat_region 3167..3474 /rpt_family="Alu repetitive sequence" repeat_region 6093..6409 /rpt_family="Alu repetitive sequence" misc_signal 6551..6555 /gene="ALAD" /function="CACCC Box" /phenotype="Erythroid regulatory sequence" misc_signal 6558..6562 /gene="ALAD" /function="CACCC Box" /phenotype="Erythroid regulatory sequence" protein_bind 6593..6598 /gene="ALAD" /note="consensus (A/T)GATA(A/G). GATA-1 is also known as Eryf1, NF-E1, EF-1, EF-gamma-a and GF-1" /bound_moiety="GATA-1" /function="Erythroid transcription factor" protein_bind 6624..6629 /gene="ALAD" /note="consensus (A/T)GATA(A/G). GATA-1 is also known as Eryf1, NF-E1, EF-1, EF-gamma-a and GF-1" /bound_moiety="GATA-1" /function="Erythroid transcription factor" protein_bind 6723..6728 /gene="ALAD" /note="consensus (A/T)GATA(A/G). GATA-1 is also known as Eryf1, NF-E1, EF-1, EF-gamma-a and GF-1" /bound_moiety="GATA-1" /function="Erythroid transcription factor" mRNA join(6902..6968,10451..10563,11841..11891,12387..12483, 13076..13211,13332..13415,13515..13603,14180..14235, 14321..14408,14483..14569,14901..15030,15643..15798) /gene="ALAD" /note="erythroid-specific transcript" exon 6902..6968 /gene="ALAD" /note="untranslated, is the first exon of erythroid-specific transcripts only" /number=2 intron 6969..10375 /gene="ALAD" /note="erythroid-specific transcripts only" /number=2 repeat_region 7603..8237 /rpt_family="Alu repetitive sequence" repeat_region 8925..9240 /rpt_family="Alu repetitive sequence" repeat_region 9463..9750 /rpt_family="Alu repetitive sequence" repeat_region 9771..10070 /rpt_family="Alu repetitive sequence" exon 10376..10563 /gene="ALAD" /number=3 CDS join(10451..10563,11841..11891,12387..12483,13076..13211, 13332..13415,13515..13603,14180..14235,14321..14408, 14483..14569,14901..15030,15643..15704) /gene="ALAD" /EC_number="4.2.1.24" /codon_start=1 /product="delta-aminolevulinate dehydratase; porphobilinogen synthase" /db_xref="PID:g28580" /db_xref="SWISS-PROT:P13716" /translation="MQPQSVLHSGYFHPLLRAWQTATTTLNASNLIYPIFVTDVPDDI QPITSLPGVARYGVKRLEEMLRPLVEEGLRCVLIFGVPSRVPKDERGSAADSEESPAI EAIHLLRKTFPNLLVACDVCLCPYTSHGHCGLLSENGAFRAEESRQRLAEVALAYAKA GCQVVAPSDMMDGRVEAIKEALMAHGLGNRVSVMSYSAKFASCFYGPFRDAAKSSPAF GDRRCYQLPPGARGLALRAVDRDVREGADMLMVKPGMPYLDIVREVKDKHPDLPLAVY HVSGEFAMLWHGAQAGAFDLKAAVLEAMTAFRRAGADIIITYYTPQLLQWLKEE" intron 10564..11840 /gene="ALAD" /number=3 repeat_region 10893..11194 /rpt_family="Alu repetitive sequence" exon 11841..11891 /gene="ALAD" /number=4 intron 11892..12386 /gene="ALAD" /number=4 exon 12387..12483 /gene="ALAD" /number=5 intron 12484..13075 /gene="ALAD" /number=5 repeat_region 12508..12809 /rpt_family="Alu repetitive sequence" exon 13076..13211 /gene="ALAD" /number=6 intron 13212..13331 /gene="ALAD" /number=6 exon 13332..13415 /gene="ALAD" /number=7 intron 13416..13514 /gene="ALAD" /number=7 exon 13515..13603 /gene="ALAD" /number=8 intron 13604..14179 /gene="ALAD" /number=8 repeat_region 13688..13973 /rpt_family="Alu repetitive sequence" exon 14180..14235 /gene="ALAD" /number=9 intron 14236..14320 /gene="ALAD" /number=9 exon 14321..14408 /gene="ALAD" /number=10 intron 14409..14482 /gene="ALAD" /number=10 exon 14483..14569 /gene="ALAD" /number=11 intron 14570..14900 /gene="ALAD" /number=11 exon 14901..15030 /gene="ALAD" /number=12 intron 15031..15642 /gene="ALAD" /number=12 exon 15643..15704 /gene="ALAD" /number=13 polyA_signal 15775..15780 /gene="ALAD" BASE COUNT 3865 a 4208 c 3891 g 3949 t ORIGIN 1 ctttttatat ttcaacagtt tctctactta tgtccttttt cgattacagg ggccaaccca 61 gaataccaca ttgcatctag tgaactcact ttttaagact gagatgaaat tcaaagaaca 121 tagaattaac catttttaaa tgagcagttc agtggcattt agaacagtca caatgttgtg 181 caaccactac ctctatctag tttcaaaaca ttttcctcag gccctggcaa tcagcagtct 241 gctttcttgt tcctttttta tgcttttcac agcatttagc acaatgccat atacctacct 301 agaagacact cagaatagtt tagttttaat tttttattga aacgaaatat actcataaaa 361 gacacaaata agtgtggagc ttaatgaatt ttcacgaatt aaacacagcc atgtaactca 421 cacccagacc aataaacatt aactgcatcc aaagaaacct ctttagtctc cccctgccgc 481 cttttttttt tttttttttt tttttttttt ttttctggag acagggtctt gctctgtcat 541 ccaggctgga gtggtacgat cacagcttac cgcagccttg acctcccagg ctcaagcaat 601 cctcccatgt cagcctccca agtagctggg actacaggtg tcaccaccat gcccagctac 661 tttttgtaat ttttgtagag actgtttcgc catgttgcca aggttggtct tgaactcctg 721 ggctcaagcg atccaccagc cttggcctcc caaagtgcta ggattacagg tgtgagccac 781 cacacccagc ctaatctccc tttcaatcct ccccatctcc ccagtggaaa tcactattct 841 gagtcctaac aacacagctt aattttgctt gattgtgtag ttaatataaa tgaaatcata 901 ttatatactc ttttgtgtct cacttctttt actcaacaat atgtttaaga tttattcata 961 ttattgcatt cacaatactg aaagattaag gtaaaatgtt tcaaaaggtg gaggctgggg 1021 ggagagtggg aacaggtctt tcagtcaaga agaagaaaag acatggggga aataagctac 1081 accaaacagc aacattttaa aatttccgtt ctatttgggt tcttagatgg ttctaattat 1141 ttttagttac caataacctc agggaaggaa tgagtgtcag gaaagaaatg tcgggcaggg 1201 aatctaggaa aaaacaaata ataattgccc ctatggagtg tggttattag caggcattgc 1261 agagagagcc cagtgtaaca tgggggaaat tatttaatct agctggccaa ttgttttccc 1321 aatccgtaaa gtgagaatag taacatcgtt gctgtccaaa attagacaaa tgaatgtaaa 1381 atggcatggt tactgtccct gactcttaat cgacaaaaca ttatttgctt tctatcccct 1441 tccttgttga atacagatca gaactccagt agctggaaac taagcagtgt catatttccc 1501 tagccttacg ccgctaggcc ccttaaattt aaaatataaa gtatacaaca ggtttggggt 1561 acagattttg agaaatttaa aaaaaaattt tttttgagac agagtctccc tctgtcaccc 1621 aggctggaat gcagtggcat gatctcagcc cattacaacc tccgcctccc gggttcaagc 1681 gattctcctg cctcagcctc ccgagtagct gggattacag gagcccgcca ccacgcccgg 1741 ctaatttttt gtatttttag tagacgacag ggtttcacca tgttggtcag gctggtcttg 1801 aactcctgac ctcaagagat ccgcccgcct tggcctccca aagtgctggg attacaggcg 1861 tgacgaccga accgggctga gaaataaaat ctgagaataa gtgtaatgta acccactagt 1921 gtctctacat ttgaaagtca acttttcaat aacccctgaa gaaccacctg aaaggcattg 1981 cttcgctagg gcgatgatgc ccgatcgcca ttttactcat ccaatcctac tggtttaccc 2041 aggccccata agcctcaaga tccctcttgc cctgcacatc tccaatctcc ataaagacct 2101 ttgatcggat ctatcattgt acctatcata ggtctgatgc ccctatcaag acttggagtt 2161 ttcctaaacg cccatgtctt tcacatcaca tctctcaact catagcattg ccgtaccctg 2221 agaaataaat gaagctggta caaattttct ctgaaaactt ggaagcggcg ctgaactcaa 2281 caactttagc ttagtccgca aaaatggact tagctaatac tgtgttaggc ataatggtgc 2341 tgaacggctt tgacatccaa ttattctttc accaagccat taaccagagc caagtttgcc 2401 ctaggctggg caaaggaatg ctgtgaagtg aaacctgggc tgtaaccgcc tccgaggccg 2461 gctcacaccc agggaccctc ccaagcggtt cctgcgcccc ctcgtggcct ccctcctgca 2521 gttcgctgca caagcgcagg gcccggagcg gggacgggcg ggccctccag gccctcgaca 2581 cacccaattg gccaggcctc gttgcgtgtg ggcggggccg cggctggagc ggaggagccc 2641 cccagaagac cgccttcagg gggcgtggcc tggactgtgg agggggcgga ccgtggggaa 2701 gctgcttccg ggtgagcccc ccccgctctt acgcggtctg tgggagaccg gagcgggaga 2761 cagcggtgac aggagcagcg gccgggagcc cttagggagg caggtgagcg ggggctggat 2821 gcggggagat agcgggcctg gagggcggct ccagggcgag gcggggactg tggcaccaag 2881 agtcctgcgt ccccagattg tgctgtgcgc ctgaagccct ggcggtgcag ccgtgcacgg 2941 agtcccgtgg agcgtttgtt tgggctggat tatcccgctg cagccgaggt tgggggcccg 3001 ggttcggggc tccgcacctc ttctctgggg ttcaccctct cggataccgc ctgggcagac 3061 tccatcgatg cagaacccga ggtgcgggtg gtggttggtg ggggggtagg gaggtgaaaa 3121 caataacagc cgttttgtac taagcgaagc gtgcattgac cattcattta ttttatttat 3181 ttatttattt ttgagacgga gtttcgctct tgttgcccag gctggagtgc aatggcgcga 3241 tctcagctca ctgcaacctc cgcctcccgg gttcaagcga ttctcctgcc tcagcctgcc 3301 aactagctgg gattacaggc gtgtgccacc acgcctgcta atttttgtat ttttagtaga 3361 gatggggttt ctccctgttg gtcaggctag tctcgaactc ccgagctcag gtgattcgcc 3421 cacctcggcc tcccaaagtg cagggattac aggcgtgagc caccacgccc ggcctgaccc 3481 attcatttct tacaacaacc cattgaggtc ggttctagtc cccatttact aaacgaagaa 3541 actgaggtca cacaactaga aagctacaga actaggattt gtacccacat ctatcagact 3601 ccagagccca catacttaac catcctgcta ccaacagcag ccaggcccag agtctggctc 3661 acccaaagct ctccacaagc ccaacctgct gggaaacaga tttgatatct atccttaaag 3721 tatggaggtg tcatggtggg cactgaggac tgggctgctg gagaccttcc cccagatctg 3781 ccgctgaccc actgcgtgat cctgggcctg tgtctacccc ttgagcccag tttcaccctt 3841 gtttcagtga tacaatcagc aaactttaag aagactctta gccaagatat tctaggattc 3901 tgggaatggc agcttctcct acgtctaaat ggctaaccca gaggcgatca ggctcttcaa 3961 agcccccagg gcaaggctgg ggtctcattc ttctctttaa tccctgtgtc tagcacttca 4021 gcaggcccat agtagatatt caagaaatgt gggttggctg aataaacact tcatctcttt 4081 tcagaactgc atgacagctg ctctcctggg gcctttgtgg ggctttggat ttctcagttc 4141 tggccacctt ctatctagct ggctctcctg actgcctcct ccctggccag tggcagagcc 4201 tgccagggct ggcagcaggt cctaaccctc tgtgcgaaga cccacagcct ctggaatcct 4261 cagcatagta caacttgcag gggtgggggg tggcacatga agctcatagc ccctcctggt 4321 agcagtgctc agggcccaac ttcagttact cccagtttgg ggatgttgcc ttccatggga 4381 ggagatactt tcatattaaa tctgaaacac ttcttatcct tcactgtccc ctcttccatg 4441 aaaccaccct tgactcagag acacagtccc tctgacagca cctggtccac ccctctactc 4501 tggcagcccg gtggtgatta ttttgttttt cctccaccac ataacctgct ggaaggcagg 4561 gactccactt cagaaatctt ggaaaaccct gtggcccaca agggtttgag ctcatattcc 4621 catctggtga ggtgcagaag gcacagagct gcgccaggac atctcgttcc aacccagccc 4681 cactcctaat agtaataaaa gtagctacat gtttactgtg cagctacttg gtgccagaca 4741 cctacctacc ttatccctat caataatctc cacagccttg caagttagag atatcatgcc 4801 tattttgtag attatgaaat gaacactgaa aggtcaatta ctttgcccaa aattacacag 4861 cagagcatag accaaacctg ggactgtctg accctgaaac atggatcttt ctgcaccgcc 4921 ctatactctg cgtcccagca aagctaacct gtctgatgtg gttgggcaga ggtgctggac 4981 ctcctggcgg ctcgctcctg tcttcagtgt ggtaatgtca caggcattta aattgtggtt 5041 tgcaagctcg cttgatttgg gagaagtgcc ctcccagtct gtgaaattct cccaaactct 5101 aagggccttt ttgcaaagca aaacccaccc tgcatttggc tctgccaata tgtgattcag 5161 ggggtgtttt cccactcaca gaatggagtg ggaaactcca cttattaagt acctcttaca 5221 atggaatgac tctatggtgt aggtatcagg gcactgaaat agaagccagg agctgcaggg 5281 tcagtttata gctctgtatg tcactcactg tgatctccaa aagtcccgtt gcctctttat 5341 accccatttt ccacattaaa gtgtaaggga ttaaacagga ttggcaattt tcaagctttt 5401 aatattttca gcactgacag ccgtttctca aatcaaatct gtctctgaat cttaaaattc 5461 aaaaacagat caaagtggac tgctgtagtg agactgttcc ctttactcta accctgcccc 5521 ggtctcaggc tcctaggcag tcctgcccca gggtttctgc taagcaggag ccagttgttc 5581 cctcagcctc ctaaaagcga tggggagttg aggccagtgg aatgcagctt tggattcaga 5641 agccagaaaa caacagagat tgtttgtggg aagtgagtca gagagattag acctctcttc 5701 aacatctgac tagactctag aatggcatta agcaagtcac tgaaccgctc agagcctctg 5761 tttccttatc actaaaagtt aggacaattc ttgcaatgaa atgagatact actaccaagt 5821 acttcagtgc tgtgcccagc acagaacgaa tacactcagt atgtttttct tttgtttctt 5881 ctgtctctcc agcacccaaa gcagaaacag tatgggcaga gagacgccta ctgaatcagg 5941 tgtttatcag tattatgaaa tacagtatat aaaatttagc atttggagtc cagaacaaat 6001 tcaaattctg gctcagtcac taagtagcta tgtggtcttg ggcaactggc ttacattttg 6061 gtaattctct caatatgcaa aaatattctc tcggccaggt gcggtggctc acgcctgtaa 6121 tcccagcact ttgggaggca gaggcgggcg gttcacttga agtcaggagt ttgagaccag 6181 cctggccaac atggcgaaac cctgtctcta ctaaaaatac aaaaattagc caagcatggt 6241 catgcatgcc tgtagtccca tctatttggg aggctgaggc aggagaatca cttgaaccca 6301 ggaggtggag gttgcagtga gccgagatcg aaccactgca ctccagcttg ggtgacagag 6361 ccagactctg tctcaaaaaa aaaaaaaaaa aaaagaaaag gaaaaaaaac tcctggcaga 6421 catgagagca aaattgagta cgaaatgggc agctgggacc tcatatctac tgaataattg 6481 tctgtttgga ggttgagaca tgactgggga ggcctggccc ctgcccctgt gttaggtgat 6541 ggggctggcc cacccagggg tgtggcagca gggctggccc ctgtgtgact tgtgataacc 6601 ccaccctacc aaggaggaag actggataaa atggcctgag atggctgaag gcagctaaag 6661 gggcctaaga gactatgggg agaagggcag ccagttccta ggcagccctg ccccactgtt 6721 tctgataagc aggaggcagg tgttccctca gcctcctgaa agcaatgagg agttgaggcc 6781 acgggaatgc agctttggat tccagaagcc aaggacctgt tacaaaggtg gttttctggg 6841 aagcaaatta gagggattag acctggcttc gacatctgac ttgactccaa cctaccaaag 6901 gatctcccag ttcacctggg agttaacaca ccagggttct agtccttttg ctcgctgtgt 6961 ggctttgggt aagtccttgc ccctctctgg accttgaggt cccattcata caaaaagcag 7021 ccggactcag gcctgccagc ccccattttt gtggctcttc cccgcccctc cccatccttt 7081 ggctatggga tagtctccca ggcacctcca aaaggaagtg tctaaaacag agctctccac 7141 ctcctctcca agcccagctc cccactgccc ccttcctgcc ctgtccctgt ctccactgcc 7201 ctttagatca ggcaggcagg caggcctaga gcatcctccc tgagacactg cctgttggct 7261 gccttaagac gccccttcct tctctgcccg cagcctcccc agcctctgca ttcctctgct 7321 tcctgactgc tctgctcagc ctgcaggatt gatcttccca aagccaggtg tcaaccaggc 7381 cactaaccct acccactgct cagaaatctt cccacagcct cagcctggcc ccctcttatc 7441 catcccctcc atacacactt gcctcagctc ctcaaaacag tccacagcca tcctccttcc 7501 ccacctttgc ccctttcttc ccttcctaag gaaatattgg cccccttctc tccaccaggc 7561 tgaacccact catcctgcaa aaccaagttc aacgccctgt cattttcttt tttttttttt 7621 tttttgagat ggagtctcac actgtcgccc aggctagagt gcaatgggtg cactctcggc 7681 tcactgcaac ctctgcctcc taggttcaat cgattcttct gcctcagcct ccagagggaa 7741 ctctgactac aggtgcacac caccacaccc agctaatttg tatttttatt tttatttcta 7801 tttatttatt tattgtgaga tggagtttct ctcttgttgc ccatgctgga gtgcaatggc 7861 acgatctcgg ctcattgcaa cctccgcctc ccaagttcaa aggattctcc tatctcagct 7921 tctgaagtag ctgggattac aggcgtgtgc caccacaccc ggctaatttt tttttatttt 7981 tagtagagat ggggtttcac catgttggtc aggctggtct caaactcctg acctcaagtg 8041 atccacccac ctcggcctcc caaagtgctg ggattacagg cgtgagccac cgtgcctggc 8101 ctaatttttg tatttttagt tagagacgga gtttcaccac attggccagc ctggtctcaa 8161 actccagacc tcaagtgatc cacccgcctc agcctcccaa agtgctgggg ttacaggtgt 8221 gagccactac gcctggccct gtcttcttct aatatgcctc ttctgacctc ctcactccca 8281 ttggagccat ggctcacttc tggtcaggct ccatgcactc aatcagccct gcctgccagg 8341 actgtagtgc ccacttgatg agaaatcacc tgtgaaaatg ccaggcatgg tctggcacac 8401 ggtaggcgct caagaaacaa atgttatgcc ccagcttctt ctgtcttgtg tccactaatt 8461 tggcatttgg agcatatttt tgtacttcac gtaattaaat atttactgtg tgccaaggtc 8521 agtgtgtgac agagcactgt aggggaggca aaaatgaacc aactctggac cttccctgca 8581 gaagctctgt ctactggcat gaggtaagac aggtccacag ataacttaca taagagagac 8641 tgcaatctgg acaagacagc ctcagtggga gagggttgga gaaccctgta gatttgggtg 8701 cggggaggct gtggggagag cacagccttt caggtaaagg gacaaggtta ggaaagcaag 8761 aaggtgtagc ttgtgctcgt ggagcccagg aaactccttg ggctagaaaa gatttggtgg 8821 cactctatac agtgtatgtg tctgtctgtc tatctaacat gaatggtact attttctcct 8881 attggtgtgc attgtaccta acaaaaaatc catgtcccaa gtacttttct cttttctttc 8941 tttctttttt tttttggaat gaagtcttgc tctgtcgccc aggctggagt gaagtggcgg 9001 tatctcggct tactgcaacc tcctcctcct ggattcaagc agttctcctg cctcagcctc 9061 cccagtagct gggattacag gcacgcacca ccacgcccag ctaatttttt ttgtattttt 9121 agtagaggtg gggtttaacc gtgttggcca ggctggtctc gaactcctga cctcaagtga 9181 tctgcccatc tcagcctccc aaagtgctga gattataggt gtgagccgcc atgcccggcc 9241 aacagtactt ttctctagag gtagaattgc aggggatttt tctctttttg cttatctgca 9301 cttttctaaa agttctgcca tgaacacggg cattttttgt ggaataaaca aagtttaaat 9361 gttggctaga tcagtgcttc tttatgcata aggaaggatt agttcttttt tttctatttc 9421 caatttattg attctttgat aaaatgcaat gcaataaaat tattattttt ttttagagac 9481 gtctccctct ctcgcccagg ctggagtgca gtggtgcaat catagcttac tgcaaccttg 9541 aactcctggg ctcaagcgat cctcttgctt tagcctcctg agtagctggg actacaggtg 9601 tggaccacca cacctggctg atttttaaac tttttgtaga tacagggtct cgccatgttg 9661 cccaggctgg tctcgaactc ttgggctcaa gccatcctgc cacctcagcc tcccaaagtg 9721 ccgggattac aggcgtgagc caccatgccc agctagaaaa ttactttttt ggctaggcac 9781 cgtggctcac gcctgtaatc ccggcacttt gggaggctga ggcaggcaga ttgcttgagc 9841 ccaggagttc gagaccatcc tgggaagcat ggcaagacct ccatctctac aaaaaattcg 9901 aaaattagct ggatgttgtg gtgcacacct gcagtcccag ctacttggga ggctgagttg 9961 ggagaaacag ttgagcccgg gaggtcaagg ctgcagtgag tcgagattgc accactgcac 10021 tccagcctgg gcgacagaga ccctgtgtga aaaaaaaaaa aagaagagaa ttttttttaa 10081 acagtcattg cttgctcaga tgtttacttt aaaagataat aatgaacaag aagcagtcac 10141 ataaaataca agcccaaatt ttatatcatt agattctgat tgtcatgaaa gtttctaaag 10201 acttactttc atttctcaac ttaccttgtt gaccagcagg gattggtgaa ccaggctgtg 10261 agtagcattg ggctagagag aggggaggca ggaatctaga agagctgttt tccagatgtg 10321 accatctcct gaggacaggg accatgtcct atgtgccacc catcaccccc cacagacaga 10381 gcctgcagcc aatgccccag gagccctcgg ttccaaccaa ctgatgcccc tgtgcccact 10441 ggcccacgcc atgcagcccc agtccgttct gcacagcggc tacttccacc cactacttcg 10501 ggcctggcag acagccacca ccaccctcaa tgcctccaac ctcatctacc ccatctttgt 10561 cacgtgagtc tccaagaatg ggccaggcct ctgctctgct ggttggggtt ggggttgggg 10621 agggagtgtt gactggagcg ggcatcagta tggctggggg tggcaaagtg agctgtcagc 10681 ttgaaattca aggcactgga agcaggctac ttggattaag gacaggaatc ttaggaacaa 10741 aacaaacttt gaaagaactc attcatccca tttggaaaat tagaagaata acccttgcct 10801 gccatcctga gctcttgcag taagacagaa gctgagaagg tgctctgtac attgtaaagt 10861 gctatgtacc tgtaagagat ggcagtcatt gaggctgggc acggtggctc acgcctgtaa 10921 tcccagcact ttgggaggct gaggcaggcg gatcacgagg tcaggagatc gagaccatcc 10981 tggctaatat ggtgaaaccc tgtctctact aaaaacacaa agaaattagc caggcgtggt 11041 ggcgggtgcc tgtagtccca gctacttggg aggctgaggc aggagaatgg cgtgaacccg 11101 ggaggcggag cttgcagtga gccgagattg caccacttca ctccagcctg ggcgacagag 11161 ccagactcca tctcaaaaaa aaaaaaaaaa aaaagagatg gcaatcgtga ttgttaataa 11221 taatgcagac atttactgag tacttactat ctaccaggta ctatgctaag cacctacaca 11281 cattatctca ttcaattctg agagcatttg tatgaagaag gagtagctat cctctagaac 11341 atcagctcca tgagggcagg gatgtttgtc tattttgttc actgttgtat catcagggcc 11401 tagaacagta cttggcacat aataagtact caataaatat ttgttgaatg aatgaattaa 11461 ccacgcatga tatagatgaa ggcctaaggc tcaaagagat gatagaactt ggccacggtc 11521 acccaggcag taagtggctg ggatagaaag caaggacctg ccaaattcag agtccaagtt 11581 cttaaccact taattccttc ctgtaattac cgttctttta gtacagttgc tagtgttgtc 11641 actgttattc ttgttgttcc tattattatt tcaggccctg ggcttggcca ggcagggaag 11701 ccagacactg gatcccatcc tcctcccacc atctccactt ccatatttct ttcctgcttc 11761 ccaaccatcc ctctcagtcg cccccgcacc actggccctt cccacagcta ccaatccata 11821 tcccaccccc gctcttgcag ggatgttcct gatgacatac agcctatcac cagcctccca 11881 ggagtggcca ggtaggagac gtggagttgg ggggccagcg ggtggtggag ggagagattc 11941 cacaggtgga agtgctggga ggcagaagca gacctaggaa gtagaagatg cggacagaca 12001 gacattagct cagtagagga aagggtttcc ccggggccag agctgttcca cagtggaagg 12061 ggcagcccca taaagtaaag agctacccat cacccgagac gtcgtggcag aggctgttgc 12121 agaagggagc tgaactgcag atgggagttc aaaaagaggg cctcgaagga gccttccaca 12181 gccgaattcc ggagctctgc tactcagggc ctcagtcttc cctcctattt agtggatgca 12241 tccctgcccc ttctgtcctg ggggcttgag ccctcctggt gccatatgca gcttggtttc 12301 taacagaggc acacagtgtg gtggggtccg gaggaccgtt gcctgggacc tgccttcctt 12361 caacccctct acccacaccc acacaggtat ggtgtgaagc ggctggaaga gatgctgagg 12421 cccttggtgg aagagggcct acgctgtgtc ttgatctttg gcgtccccag cagagttccc 12481 aaggtgaaga atcaaaggaa gggctaagaa gggaggttgc ctcacgcccg taatcccagc 12541 actttgggag gccaaagtgg gtggatcact tgagcccagg attttgagac cagcctggac 12601 aacatggcaa aacccatctc tacaaaaaat acaaaagtta gctgggtgtg ggggtatgtg 12661 cctgtagtcc cagctactcg ggaggtggag aggtgggagg attgcttgag cccagaaagt 12721 cgaggctgca gtgagccaaa atcgcgccag tgcactctag cctgggtgac agagcaagac 12781 cctgtctcca atacaaacag aaaaaggaag ggaggttggg caaaggtgga ctgagggtcc 12841 acactgactg caccctcact cccacattgt gctggccctg gggccacagg tgaatggacg 12901 tggtctttgc ccttaagtca gcacccatgt agggtcggtc ctctgtgctt ccttatccag 12961 gggctgtgat gatgaaggaa ggagaaggcc agggctatgc tctgtgatgg ctgtcatcct 13021 gccttccaaa gctacatgta atagacacac tgctttgtcc ctcccctgcc cctaggacga 13081 gcggggttcc gcagctgact ccgaggagtc cccagctatt gaggcaatcc atctgttgag 13141 gaagaccttc cccaacctcc tggtggcctg tgatgtctgc ctgtgtccct acacctccca 13201 tggtcactgc ggtgagttcc ctccctccca ccagccctgc tgccacccac actcctactg 13261 cccacttctc aacagggtgg ggacagccag ggcccaaggt gctccccaaa acccagtcat 13321 ctgtcctgaa gggctcctga gtgaaaacgg agcattccgg gctgaggaga gccgccagcg 13381 gctggctgag gtggcattgg cgtatgccaa ggcaggtgag tgaaccacca gcagggatgg 13441 gcacctctgg gtcaggaggt ggcagagtgg ctaggagggc cccagagttc tgaaggccac 13501 cctctgcccc ccaggatgtc aggtggtagc cccgtcggac atgatggatg gacgcgtgga 13561 agccatcaaa gaggccctga tggcacatgg acttggcaac agggtaaggg cagggaatgc 13621 agcacagggc tggcaggaga tagtctgcac cagccctgcc cccgtgtctg ctaagaatca 13681 cagaactgcc gggcgtgttg gctcacacct gtagtcccag cactttggga ggctgaggca 13741 ggtagatcac ttgaggtcag gggttcaaga ccagcctggc caacatggtg aaaccccatc 13801 tctactaaaa acacaaaaat tagctgggcg tggtggcagg cgcctgcaat cccagctact 13861 ggggaggctg aggcaggaga atcgcttgaa cccacgaggc agtgagctga gatcatgcca 13921 ctgcacttca gcctggatga cagagctaga ctccatctca aaaaaaaaaa gaatcacaga 13981 actgaagaca gtgctggatg aggctttggg gaaccattta aacctctggg cctctgcagg 14041 gaaatcaagc ccagcactcc aacaggacca gaacacaggc agtctccttc ccagcctagg 14101 ttctttctct ccctgccaca tcaccctggg atacctggca agggccgaat aagccaagac 14161 ctccattgtc tccccatagg tatcggtgat gagctacagt gccaaatttg cttcctgttt 14221 ctatggccct ttccggtgag caggggtggg caggggtctg ctgtgaatcc ctgccctttg 14281 gcccaaagct ggagcccacc ctgatgactc tgctttgcag ggatgcagct aagtcaagcc 14341 cagcttttgg ggaccgccgc tgctaccagc tgccccctgg agcacgaggc ctggctctcc 14401 gagctgtggt gagtgactag gacttgagcc ccaccctcag ccccctccta ggcaccaccc 14461 acattatacc ctcatccctt aggaccggga tgtacgggaa ggagctgaca tgctcatggt 14521 gaagccggga atgccctacc tggacatcgt gcgggaggta aaggacaagg tgagcacagg 14581 tacgaggcaa agggggctca gggggctggg acagagtttt ccacagactc tggaatctca 14641 gagttggaag cagtttgccc ttaagcatgc atcctctcct ccccttccct gcccaggaac 14701 catcgtggcc ttctatgtcg gggcttgcac gagcctcaaa cagccctgct ttaacagttc 14761 aagagtgggc caggctgcca gccgcagtaa cccaggacac ggggctcaag atggtcacag 14821 attgagcagg ggggaaggga cgcttccaga gccacatcca ccctccattt cagcctgtct 14881 ccctgtctgc ttccctgcag caccctgacc tccctctcgc cgtgtaccac gtctctggag 14941 agtttgccat gctgtggcat ggagcccagg ccggggcatt tgatctcaag gctgccgtac 15001 tggaggccat gactgccttc cgcagagcag gtaggcaggc aagggtgggg tgttttgacc 15061 tgcgccacag ggactgataa gcactctgcc tagatcgggg aacgacgtcc tgagagcttg 15121 ggatcttatt ccgggaatta ctagtgatct aaacagacac acactgagga agagatatgg 15181 aactgcagca tagaacacgg cccggtgaag caagcagagc ccttcatttt tggttgtgag 15241 aacgtggcaa gccacttctc tgaacctcag tgtcctcacc cataactgga taactgggga 15301 taagatacct ggtgcgtggt tgtcctgagg attaaatgaa gtaatatcac tccataaagg 15361 ggactcattt tgttagaatt gcacaccagc atgggaagga acttgcctct tatatttcct 15421 tcactgtgca ttttattctt tggtaaactg aggccccaaa agaggaaatg acttgcccaa 15481 gaaatagagt ttcccaaagc tgggctccgt ctcatgtggt gtgcccacag gctgtgcttc 15541 ttcatggtag ccttcttccc cgcctggcct tcccatcgca gaaggtgtgc tcagagctga 15601 tcagcgtccc cccagcaact ttctgcatct ctcccaacac aggtgctgac atcatcatca 15661 cctactacac accgcagctg ctgcagtggc tgaaggagga atgatggaga cagtgccagg 15721 cccaagaact agaactttaa aacgttcccg gggcctcaga caagtgaaaa ccaaagtaaa 15781 tgctgctttt agaactgtgc cctcatgccc tcttcctgct cacatgctag cggggcccag 15841 cagccctggg tggttttgcc agcatgctaa ctcttgtaac tcgcagctgc atcctatgag 15901 ctctcccaag ctt // LOCUS HSALDCG 5198 bp DNA PRI 17-FEB-1997 DEFINITION Human aldolase C gene for fructose-1,6-bisphosphate aldolase. ACCESSION X07292 M84921 NID g28600 KEYWORDS aldolase C; fructose-1,6-bisphosphate aldolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5192) AUTHORS Salvatore,F. TITLE Direct Submission JOURNAL Submitted (06-APR-1988) Salvatore F., Ist. Scienze Biochimiche, II Facolta di Med. e Chirurgia,, Via S. Pansini, 5, 80131, Napoli, Italy REFERENCE 2 (bases 1 to 5192) AUTHORS Buono,P., Paolella,G., Mancini,F.P., Izzo,P. and Salvatore,F. TITLE The complete nucleotide sequence of the gene coding for the human aldolase C JOURNAL Nucleic Acids Res. 16 (10), 4733 (1988) MEDLINE 88247784 REFERENCE 3 (bases 1 to 5198) AUTHORS Salvatore,F. TITLE Direct Submission JOURNAL Submitted (13-JUL-1988) to the EMBL/GenBank/DDBJ databases REFERENCE 4 (bases 1 to 5192) AUTHORS Salvatore,F. TITLE Direct Submission JOURNAL Submitted (25-APR-1990) Salvatore F., Ist. Scienze Biochimiche, II Facolta di Med. e Chirurgia,, Via S. Pansini, 5, 80131, Napoli, Italy REFERENCE 5 (bases 1 to 5192) AUTHORS Buono,P., Mancini,F.P., Izzo,P. and Salvatore,F. TITLE Characterization of the transcription-initiation site and of the promoter region within the 5' flanking region of the human aldolase C gene JOURNAL Eur. J. Biochem. 192 (3), 805-811 (1990) MEDLINE 91006178 COMMENT Data kindly reviewed (13-JUL-1988) by Salvatore F. FEATURES Location/Qualifiers source 1..5198 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda EMBL3" /clone="lambda C1" /chromosome="17" misc_feature 1..1243 /note="5'-flanking region" old_sequence 25..27 /citation=[1] /replace="ac" old_sequence 34..36 /citation=[1] /replace="gc" old_sequence 48..49 /citation=[1] /replace="gct" old_sequence 63..64 /citation=[1] /replace="tcg" old_sequence 68..69 /citation=[1] /replace="tct" old_sequence 75..76 /citation=[1] /replace="gcc" old_sequence 84..88 /citation=[1] /replace="tcaagcg" old_sequence 92..94 /citation=[1] /replace="acg" old_sequence 97..107 /citation=[1] /replace="gacggacg" old_sequence 263..264 /citation=[1] /replace="tgg" old_sequence 357..359 /citation=[1] /replace="at" promoter 453..457 /note="CCAAT box [4]" promoter 491..520 /note="TATA-like sequence [4]" old_sequence 657..660 /citation=[1] /replace="gacgt" old_sequence 1047..1049 /citation=[1] /replace="tg" old_sequence 1096..1098 /citation=[1] /replace="cg" misc_feature 1130..1135 /note="Sp1 binding site [4]" exon 1189..1282 /note="exon 1 [4] (start was 1244 in [1])" /number=1 old_sequence 1205..1207 /citation=[1] /replace="ag" intron 1283..2516 /number=1 exon 2517..2640 /number=2 CDS join(2529..2640,2727..2938,3055..3109,3205..3365, 3483..3566,3819..3993,4126..4325,4434..4529) /codon_start=1 /product="aldolase C" /db_xref="PID:g312137" /db_xref="SWISS-PROT:P09972" /translation="MPHSYPALSAEQKKELSDIALRIVAPGKGILAADESVGSMAKRL SQIGVENTEENRRLYRQVLFSADDRVKKCIGGVIFFHETLYQKDDNGVPFVRTIQDKG IVVGIKVDKGVVPLAGTDGETTTQGLDGLSERCAQYKKDGADFAKWRCVLKISERTPS ALAILENANVLARYASICQQNGIVPIVEPEILPDGDHDLKRCQYVTEKVLAAVYKALS DHHVYLEGTLLKPNMVTPGHACPIKYTPEEIAMATVTALRRTVPPAVPGVTFLSGGQS EEEASFNLNAINRCPLPRPWALTFSYGRALQASAVNAWRGQRDNAGAATEEFIKRAEV NGLAAQGKYEGSGEDGGAAAQSLYIANHAY" intron 2641..2726 /number=2 exon 2727..2938 /number=3 intron 2939..3054 /number=3 old_sequence 3017..3019 /citation=[1] /replace="tc" old_sequence 3035..3037 /citation=[1] /replace="tc" exon 3055..3109 /number=4 intron 3110..3204 /number=4 exon 3205..3365 /number=5 intron 3366..3482 /number=5 exon 3483..3566 /number=6 intron 3567..3818 /number=6 old_sequence 3653..3659 /citation=[1] /replace="gcaat" old_sequence 3698..3700 /citation=[1] /replace="tg" old_sequence 3786..3788 /citation=[1] /replace="ac" old_sequence 3808..3810 /citation=[1] /replace="tc" exon 3819..3993 /number=7 intron 3994..4125 /number=7 exon 4126..4325 /number=8 intron 4326..4433 /number=8 exon 4434..>4529 /number=9 old_sequence 4638..4640 /citation=[1] /replace="ac" old_sequence 4687..4689 /citation=[1] /replace="ac" misc_feature 4916..4921 /note="pot. polyA signal" old_sequence 5055..5056 /citation=[1] /replace="cgt" old_sequence 5122..5123 /citation=[1] /replace="act" BASE COUNT 1093 a 1480 c 1358 g 1267 t ORIGIN 1 tctagagaaa tatcctagag gagaaccttc ggcgcctctg acaaggagtt agaaaaacta 61 gatgacattg ttcagcatat ttataaggta taggcaggag ggaggtgata tcactaatga 121 acctggtaca aggcctggcc ccaggaggta ctaatgagca cagatctggt tcccagcctg 181 tgctgaagag ctgggcctga atcactattg caggcttctc tgtggtgtct gttttctccc 241 ctctaattca gcggctgcct cctggtggga gccctgggcc ttaagtctgt ggtcctgagc 301 tgcagttttc tgcctctctt cccagccctg ctctctattc cagaggtggt gaggggattg 361 caaagaacta cagggattgc tggaatttct gagctaagaa actgaaagcc agaatctgct 421 tcacctcttt ttacctgcaa taccccctta ccccaatacc aagaccaact ggcatagagc 481 caactgagat aaatgctatt taaataaagt gtatttaatg aatttctcca agcttacgga 541 atcttacttt ttgggtgaga ggggggcgga tgatgaagtc agggaagaag caggcttgtg 601 cctcctgcct gggcaccctc catagatccc tctccacctc ctctgcatcc aaagcaggct 661 gcctgccctt cctcccttcc tcgtactttc accaaaaact ctgggcactg gacgtacttg 721 ttccctcgtc cgcctggagg aaacagaaag cttcattctc tactttcccc agcctcgggg 781 tataccctgg caaatatggg gtgaggacag agaaggctca ggttcattac ttcctgccct 841 gaacttgggc accttgaaga tgcttctgcc cccacctact caacctgttg ttactgttct 901 ggtgcctcct cttcaccaga ctaagcttaa tccccagcat ccagcaagtg cccggcacgt 961 ggtaaaggga ttgtggcaga aatgaacgca aggtcatggc caggctcccc gaggtgggag 1021 ggcagtccct aacagccacg gatgcctggg cgctcactcc acctcgccgc cgcctctgag 1081 ggcgtggtct tgccccggac accagtcctg gggagggggt gtggtcaggg cgggcatgca 1141 gccacgcccc cggaggagtc acgtagctct gcgacatccg cagcctcatt taccagaggg 1201 agccagggct gcagcctcat ctgtttgcgg atcagaaccc gagctgtgct tgtggctgcg 1261 gctgctaact ggctgcgcac aggtaaggcc aggcaaggcg ggagccactc aacatctccg 1321 cttcatttcc ttggctcttc ccctgccatc ctcgtcctgc tacctgggac tccgggatgt 1381 gccctttcga ccctttccta acatctttgc tcctttccgc gtacttgaaa ccccatggct 1441 caacctcttc tgttctatcc ctcttcagct ctcccagctg aacctcagct ttccaatccc 1501 aaatcctcct ctacttgttg gatttttcct tggagtctgt ctctcctacc cagagcattt 1561 ccttctcagc ctgctccctc tcctcctagg ctaggtcctc tctccagttc tcccgccttc 1621 tctgcccccc ggtctaggtc tctcccgctg actggttagc ctgcatcacc actagctccc 1681 ctcagtctca tctctctctc aggctctcta ctcctttcag ccttggtcct ctgccctccc 1741 gtctggtgtc tattgtggga attaatcttc ctattgctat cctcctgcca aacagctaag 1801 tgcttcctgg cacagagata cccggagtgt cactagccct atctccacac actaacacag 1861 ctgaccgctg ggccctcttt cctacatcac tgcacgactg gggccaacct ctggtactgg 1921 gtgggaatag gcaataggca taggcaggca ggtttgagta cagaaaagaa gctgcaggag 1981 cctgtgactg gtatttgtgc cactcctact ccctacctgt tcttccaacc ttttcctcta 2041 gaagctgaga gaagagggtg gcaataagta cttttgcctc attctgaagc cttggaagta 2101 agtacacttt ccctaggggt cctgtggagg atgagaaaag ggaagctgga aaggccagga 2161 cttttgccta ctcaacaagg gaccaagttc agtgaaagaa gggttggcat ccttgattgg 2221 gcagcagatt tatcagaaga gctgtggctt cagggctgct cacctcccca cccccaccct 2281 gcatctttcc ccagggctgg gaaggatgcc taccagggac aaaaggagat gtgggaactg 2341 gagccctaag cttgctagct gtcagaagga cctggtgcca cttgatgccc aggactcatg 2401 ccaaggactg ctgccctgtt cccagcccct tgcttgatgg ggaggccatt tggcccatct 2461 ggcaggagag gcagcagagg gtgaggtctg gcttttttat cttgtctcca ctccagggag 2521 ctgtcaccat gcctcactcg tacccagccc tttctgctga gcagaagaag gagttgtctg 2581 acattgccct gcggattgta gccccgggca aaggcattct ggctgcggat gagtctgtag 2641 gtaagtggac atctgtagcc aggtagggta caggtggcta ggggaccctg gggatgttct 2701 cactgcctct ctttgtttgc ccctaggcag catggccaag cggctgagcc aaattggggt 2761 ggaaaacaca gaggagaacc gccggctgta ccgccaggtc ctgttcagtg ctgatgaccg 2821 tgtgaaaaag tgcattggag gcgtcatttt cttccatgag accctctacc agaaagatga 2881 taatggtgtt cccttcgtcc gaaccatcca ggataagggc atcgtcgtgg gcatcaaggt 2941 gcagcccctg gccctgctct gaatggaagc tgggtgtgaa aataagcttg tgtaggaggg 3001 gtagcaagga gaatcctgcc tggattcaac cctctgcttg tacttcctct acaggttgac 3061 aagggtgtgg tgcctctagc tgggactgat ggagaaacca ccactcaagg tataggatgg 3121 gtgggcttga ggaccaaaga ggtgttagat agttgatgct ggtaaaagag gggcagagta 3181 atgaggttgg cactgtgctt gcagggctgg atgggctctc agaacgctgt gcccaataca 3241 agaaggatgg tgctgacttt gccaagtggc gctgtgtgct gaaaatcagt gagcgtacac 3301 cctctgcact tgccattctg gagaacgcca acgtgctggc ccgttatgcc agtatctgcc 3361 agcaggtgtg tgtgttggga gggtggtgag ctaggtgccc tgtatgcctg gtggggagag 3421 agtacaaggc tttcttcatc tcccctactg cccctcccaa gcatctctgc tcttgcctgc 3481 agaatggcat tgtgcctatt gtggaacctg aaatattgcc tgatggagac cacgacctca 3541 aacgttgtca gtatgttaca gagaaggtga gtccacacct gggcacacaa acatactgca 3601 gggacagctc ggcaggagtg tctgttcccc agaaccccca gcttagatcc aggcacactt 3661 tcccctagca tttttcactt catcccgcgc acaggcctgg tgatctcgag cctgtactag 3721 ccctacagtc tgtgcccatc tacccctaca tagggagcat cgagcagtaa ccagtggggg 3781 cccagaccct tagtaaacct cctctaatcc ccacccaggt cttggctgct gtgtacaagg 3841 ccctgagtga ccatcatgta tacctggagg ggaccctgct caagcccaac atggtgaccc 3901 cgggccatgc ctgtcccatc aagtataccc cagaggagat tgccatggca actgtcactg 3961 ccctgcgtcg cactgtgccc ccagctgtcc caggtactac ccagctccct aacctgctcc 4021 tatccctaag gcccatcttc aggtccttct tgtggcctta gggggttccc tatcctggaa 4081 aaattgggag tgaccagtca gtttgtcttc tctcctccac actaggagtg accttcctgt 4141 ctgggggtca gagcgaagaa gaggcatcat tcaacctcaa tgccatcaac cgctgccccc 4201 ttccccgacc ctgggcgctt accttctcct atgggcgtgc cctgcaagcc tctgcagtca 4261 atgcctggcg agggcaacgg gacaatgctg gggctgccac tgaggagttc atcaagcggg 4321 ctgaggttgg gagctacagg tggtggtggg tgggggcagc accagaggct atagctgggc 4381 agggcttggc agctgtgggc tggctactgc ttactccagg ctcccttttg caggtgaatg 4441 ggcttgcagc ccagggcaag tatgaaggca gtggagaaga tggtggagca gcagcacagt 4501 cactctacat tgccaaccat gcctactgag tatccactcc ataccacagc ccttggccca 4561 gccatctgca cccacttttg cttgtagtca tggccagggc caaatagcta tgcaggagca 4621 gagatgcctt cactggcacc aacttgtctt tcctttctct cttcccttcc cctctctcat 4681 tgctgcacct gagaccatag gatgggagga tagggagccc ctcatgactg agggcagaag 4741 aaattgctag aagtcagaac aggatggctg ggtctccccc tacctcttcc agctcccaca 4801 attttcccat gatgaggtag cttctccctg ggctctcctt cttgcctgcc ctgtctcctg 4861 ggatcagagg gtagtacaga agccctgact catgccttga gtacatacca tacagcaaat 4921 aaatggtagc aaaacattct actttgcctg tctgttttac acatcaaatt ccagcctccc 4981 agtttctgat ctctgctaat tctatctctg ggccctctga ctctggaggt ggagagggtg 5041 ggatttgagt cttactgggc ttcaagttat ggaggaaggg caatgcagtc accatcccca 5101 gctcaggctc ttgctctctt gatgtccaag tctggagtgg ggcaatgagg aagactgcaa 5161 gtcttctagg gactgccaca tcagtcgatg ggctgcag // LOCUS HSALDOA 7530 bp DNA PRI 20-MAY-1992 DEFINITION Human aldolase A gene (EC 4.1.2.13). ACCESSION X12447 NID g28613 KEYWORDS aldolase; aldolase A; alternative splicing; Alu repetitive sequence; fructose-1,6-bisphosphate aldolase; repetitive sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7530) AUTHORS Izzo,P., Costanzo,P., Lupo,A., Rippa,E., Paolella,G. and Salvatore,F. TITLE Human aldolase A gene. Structural organization and tissue-specific expression by multiple promoters and alternate mRNA processing JOURNAL Eur. J. Biochem. 174 (4), 569-578 (1988) MEDLINE 88271327 COMMENT In liver, skeletal muscle and fibroblast primary transcripts are found that differ in their 5'-ends: exon 1a and 2a are found exclusively in liver, exon 1b is found only in skeletal muscle and exons 1c and 1d are found in fibroblasts, hepatoma cells and skeletal muscle. Exons 2b - 9 are found in all mRNAs. FEATURES Location/Qualifiers source 1..7530 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda Charon 28" /clone="lambda A3." promoter 1084..1088 /note="TATA-box (exon 1a)" precursor_RNA 1153..7330 /note="primary transcript found in liver" mRNA 1153..1216 /note="Exon 1a" intron 1217..1391 /note="Intron Ia" mRNA 1392..1473 /note="Exon 2a" intron 1474..4160 /note="Intron IIa" promoter 1658..1660 /note="CAAT-box (exon 1b)" promoter 1684..1688 /note="TATA-box (exon 1b)" mRNA 1710..1754 /note="Exon 1b" precursor_RNA 1710..7330 /note="primary transcript found in muscle" intron 1755..4160 /note="Intron Ib" promoter 2675..2680 /note="TATA-box (exon 1c)" mRNA 2699..2866 /note="Exon 1c" precursor_RNA 2699..7330 /note="primary transcript of longer fibroblast RNA" promoter 2734..2738 /note="TATA-box (exon 1d)" mRNA 2760..2866 /note="Exon 1d" precursor_RNA 2760..7330 /note="primary transcript of shorter fibroblast RNA" intron 2867..4160 /note="Intron Ic" mRNA 4160..4293 /note="Exon 2b found in all mRNAs" CDS join(4182..4293,4378..4589,5574..5628,5743..5903, 6228..6311,6420..6594,6748..6947,7035..7130) /codon_start=1 /product="aldolase A" /db_xref="PID:g28614" /db_xref="SWISS-PROT:P04075" /translation="MPYQYPALTPEQKKELSDIAHRIVAPGKGILAADESTGSIAKRL QSIGTENTEENRRFYRQLLLTADDRVNPGIGGVILFHETLYQKADDGRPFPQVIKSKG GVVGIKVDKGVVPLAGTNGETTTQGLDGLSERCAQYKKDGADFAKWRCVLKIGEHTPS ALAIMENANVLARYASICQQNGIVPIVEPEILPDGAHDLKRCQYVTEKVLAAVYKALS DHHIYLEGTLLNPNMVTPGHACTQKFSHEEIAMATVTALRRTVPPAVTGITFLSGGQS EEESSINLNAINKCPLLKPWALTFSYGRALQASALKAWGGKKENLKAAQEEYVKRALA NSLACQGKYTPSGQAGAAASESLFVSNHAY" intron 4294..4377 /note="Intron IIb found in all mRNAs" mRNA 4378..4589 /note="Exon 3" intron 4590..5573 /note="Intron III" repeat_region 4943..4960 /note="direct repeat 1" misc_feature 4944..5319 /note="alu repetitive sequence" repeat_region 5301..5319 /note="direct repeat 1" mRNA 5574..5628 /note="Exon 4" intron 5629..5742 /note="Intron IV" mRNA 5743..5903 /note="Exon 5" intron 5904..6227 /note="Intron V" mRNA 6228..6311 /note="Exon 6" intron 6312..6419 /note="Intron VI" mRNA 6420..6594 /note="Exon 7" intron 6595..6747 /note="Intron VII" mRNA 6748..6947 /note="Exon 8" intron 6948..7034 /note="Intron VIII" mRNA 7035..7330 /note="Exon 9" misc_feature 7308..7313 /note="polyA signal" polyA_site 7330 /note="polyA site" BASE COUNT 1543 a 2216 c 2141 g 1630 t ORIGIN 1 cacgcctgta gttcagctac tcagaaggct aggcagaact acttgaaccc agagtggagg 61 ttgcagtgag ccaagatcgc accattgcac tccaccctgg gcaacagagt gagaccctgt 121 ctcaaaaaaa aaaaaaaatt aaaaataggt aaattccatt ttcaaagata cctgtaaaga 181 tttttaaaat cattcatgaa tgcggtctgt tcgttgcaca gagtagatgc tcaaaaatgg 241 tgaatgagac cctctatttt ggtctcatgc tgaagaagtc cataaagccc acaagtcatt 301 ttcatgatgg acagaaaaat gtgtgtgctt tctctgtcta ctgcctcaac tgcacagacc 361 ccgggactgt agcagaacca tcttttgagc ttgacaccgg gaggcccaaa ttctagcacg 421 ggacctaggg ccagttgctc tctggtcctc agtctcctca cccataaaat gggaaggaga 481 gaaccctgaa tcattgcttc tagcttctga actcagttgt tcagaacaag gactcactgc 541 tgatttttca acagcacagg gaattgcact gttcctggaa tgatggacag taccctctgt 601 tccactgggc aagtgagatt tcccaggcct ctctgttccc ttctcccctt agagagcaac 661 agacgtgtgg ccccaccacc tccctcaacc ctctctctcc tccctcagga ctgggcacct 721 cgctgccccc gctgctgccc actctgcgac tgtgcctgta cgtgccagct ccccgactgc 781 cgagacctca actgtctctg cttcgagatc aagctccata ggacccaggc ccctgcctct 841 ggggagcggc cagcccccag gcccatgtgc cctcctccct gaagagcctt tccccacgcc 901 actggaacca cagatggcct gccgagcacc caggcctggg aactggaatg gcagcgcagg 961 gcctggctcc tgcagggcag gactcttggc cggctggacg gcagctcctc tggagggcca 1021 gaaaagagag ggctagtgct cggcagtgcc ctgcttccct tcccctccac acgtcaacga 1081 ttctatttga agttgggcag ggggtggcgc tgctcaccac acacaagtgt tataggagga 1141 gtctggcccc gtagtaccgg gtacgcaggg gtgcctcaac cacactccgt ccacggactc 1201 tccgttattt taggaggtag atgtagtgcc agtatctact ctccttctta aaaaaaacca 1261 gggctccaga gaatcagaac agccaccatc accgcaggga gtcaagggag gagggagatt 1321 agagaaggag ccagggaggg tggcagggag gccacgtgat ccgagtcccc tcaccccttt 1381 ccttcccaca ggtccctggc caaagattta tttctcttga caaccaaggg cctccgtctg 1441 gatttccaag gaagaatttc ctctgaagca ccggtgagtg ggcaggggct ctttgtcccc 1501 aatcaatcag ggccgaccca agtcttcctc ccccttcccc atgccgggcc ccacgatagt 1561 gtgaatgtca ggggcttcag gtttccctaa atataggtcc ctgccagagg atccgtggcg 1621 ggaaaagggc aggggtcatt agagaagatc ggggacacat gtggggctgg gcaggagctg 1681 ccttataacc acccgggaac ccctagctca ctcgctgctg accaggctct gccggctcct 1741 tgccctcgcc gcaggtgggc cccttgcagg accggccggg tgggatgggt tggaattggg 1801 ccaacaggtc cagatgggtc caggtaggag gggagatttg gacgatagga gcagggggct 1861 cagcatctgg gaggcagatc agttcgggga cggattttct tttggagaag gaagtcaggc 1921 tcaaggaaga cgtttggcag gaactgtgac cccgcatgcc agaggccgag cagcggccgt 1981 gcatagccgc gcattctggt tttctgtggc gcagaggact accagcctgg ctgcggcggc 2041 ccggcggaga ccgccaccat gcgcgaccca gcccgcctgc cagcctggaa ctcggatggg 2101 gaggtctgcc tccgcgcgcc gctagtttcc gaccgccttc tcgctcccct gctgtcctct 2161 cgatgccctt tcctccgcct cccttgacgc ctgggccagt gacaggtgtc gtccgcgccg 2221 attcagcccg cgggcgaggc agctaacgca cgactgcgcg atgtggcccc tatggtgaca 2281 cgctgcagcc gcgaagaccg gaagctgggg ccccgggccg cgcgcgctgg gcctgggagg 2341 cgaaactcag cttccttcgt ttccgacttt tccatccgcg tcctccactt ccccgttccg 2401 ccctccccca ttgccaacat tctggctgag atcagcgccc agagcgcgcc aggctggggg 2461 aaaggagcag aagggagggc cctagcgacc cgcgggatgt ggtccgagtc acgtccgagg 2521 gggtggggag ggatcgtgtt ctcgcgcccg ccccttccta gcgcggcctc tggctcgcct 2581 ctcgggggcg gcccgtagcc cagtccgtcg cctgccattg gacgccgccc gctcctcgta 2641 aaggaaaaag ctcggcggag ggcggagtgg tgcctttaaa aggccggcgc cgccttccgc 2701 ctgcaccgcc tcctgcgccg ccccttccga ggctaaatcg cttcctctcg gaacgcgccg 2761 cagaaggggt cctggtgacg agtcccgcgt tctctccttg aatccactcg ccagcccgcc 2821 gccctctgcc gccgcaccct gcacacccgc ccctctcctg tgccaggtga gcgcccctct 2881 tgcggggacc cagggaccgt ggagagggtc ttgggggcag tggcgggttg gcgtccgcgt 2941 ggaggcctcc cccattcgcg cccatgccag cgtctcccca ctaccaggca cacacaggct 3001 ccccggcccc tccagcctga ggtcctctaa ctgcgcaatg cagctgcgcg cgctgagtca 3061 tggcggggag gaagccggac gagatgaagg accattctcc cccttttctt gcagggaccc 3121 ctgtggcaaa ggattagggc cccttaccgc tggcgtggat cctaagaggc agtgaggggt 3181 gggggccggc ccatgtacag ccccagggtt ctcgcaagtg ggagcttggt ttctgtcctg 3241 ggaaacgggc gcccttcgcg aggagggaaa cccctcgcgt gcttgatgcc cccttaacac 3301 tttccctgtc tctccttatc gggcgaccat tgattctgag cccggaacag ctgcagccat 3361 gcgaagcgac gggagcattt ttcaggggaa ggcgcttgct cctccacgtt cttgccccgt 3421 aggaacagtg acgatggcaa agcttaccgc tttcctgcct gggctagggc tagttccgcc 3481 gccctttcct gggcttctcc ttgctctctt atatttttcc taatgcccct ttcctaccac 3541 ccgccccctc cttgtgggga aaagcctgac cttgggatgt ccttgaagcc ttggagcccg 3601 ggccagccct gggatcttga ggggattgga aggaacaccc agtggcagtc agaagagctg 3661 ggttctaacc tcagatctgg ctccggggtt gctgtgtggc ttgaagcaca gacctttccc 3721 atatctgggc cctttcccac gagggtgttg ggccctctgc ttgattcacg atctttacat 3781 tctaaaatac tccggttcgg ttttgttttc aggcaaggtg accccatggc aaggcgcaag 3841 ccagaagggt ccagcttcaa catgacccac ctgtccatgc tatggccttt tcctttcccc 3901 cagttgccag tgggcaactc caccctcagc tgggcaacac ccagcaccag acagagttag 3961 gaaaggtacc ggggcaggcc tagcaaaggg aagttgggcg taagagagag ctggggacca 4021 gaagtgcccc agggcctgct gggtgtgggg caggggaggt agggaacatt tccctgacct 4081 ccaggagagg ggccctggtc atcgggagat gatgggaaac cctagctaac tagtccttcc 4141 cctctgtttc ctgtatccag gaacttgcta ctaccagcac catgccctac caatatccag 4201 cactgacccc ggagcagaag aaggaactgt ctgacatcgc tcaccgcatc gtggcacctg 4261 gcaagggcat cctggctgca gatgagtcca ctggtcgcgg gcaggagaca gaatgggtgg 4321 agggtgcagg gttgggagtg gcaggctgat cccctaattc ccatgtgaca ctcccaggga 4381 gcattgccaa gcggctgcag tccattggca ccgagaacac cgaggagaac cggcgcttct 4441 accgccagct gctgctgaca gctgacgacc gcgtgaaccc cggcattggg ggtgtcatcc 4501 tcttccatga gacactctac cagaaggcgg atgatgggcg tcccttcccc caagttatca 4561 aatccaaggg cggtgttgtg ggcatcaagg taaggggagg gcctccggac gtgaggtttg 4621 agtaggaagt ggaggaagga aatccaggtt agtgaggcag ggaatgaatg ctggattggt 4681 ggcctagaga cttgcatgga gcctgcttca ggcttagggc atttactcga cattttttat 4741 cctcacaatt ctgagagaac agtatcattc ccactttaca catgaaaatc tcagaagctc 4801 agggaagtga agtgttttgc tcagagtaag tggcagagcc cagtctgaca cccaattcac 4861 tcaacatttc tgttgtcaat aacatcccag tgtatcctgt ctcagaggat tgttactaag 4921 tgaactaagt gaaaatattt taatttaatt gtaactaagt atttgagtaa ctaaacattt 4981 taagtaaatt tttatttttt ttttttttga gatggagtct cgctgtctcc caggctggag 5041 tgcagtggtg cgatctcttg gctcactgca agttccgcct cccaggttca cgccattctc 5101 ctgcctcagc ctcccgagta gctgggacta caggcgcccg ccaccacccg gctaattttt 5161 tttgtatttt tagtagacac tgggtttcac catgttagcc aggatggtct cgatctcctg 5221 atctcgtgat ctgcccacct cggcctccct aagtgctggg attacaggcg tgcagccatg 5281 cccggcttaa gtaactaaat attttaattg taactaagtg aaaatatttg ccagccctga 5341 agatgcagtt taagggatta acgtaaaata gtgggggaag actggggcta aagaagagga 5401 aagaggggca cgccagctac ctaggaggct gaggcgggag gatcacttga gtccaggaag 5461 gggaggcttc aggtgagctg agattccacc actgtactcc agccgaggcg aaagtggaaa 5521 gggtgctaga ggtcatttcc tgtgtcttaa tgttgttacc ctgaccccaa caggtagaca 5581 agggcgtggt ccccctggca gggacaaatg gcgagactac cacccaaggt gagaactgtt 5641 tgattctctg ccctacgaac ccaaccagag caggtttggg tgctgggagg agtggaaacc 5701 acatgcccct cccaccctgc tctgaccttc ctcttctctt agggttggat gggctgtctg 5761 agcgctgtgc ccagtacaag aaggacggag ctgacttcgc caagtggcgt tgtgtgctga 5821 agattgggga acacaccccc tcagccctcg ccatcatgga aaatgccaat gttctggccc 5881 gttatgccag tatctgccag caggtggcct gcaggtcctc aataggcaac ctcctacctc 5941 atttggttcc agtgttgtta atttgcctat taactgccat gatgcctacc tccccaaaag 6001 caagcattag ctttggcgcg tggaggcact caagggctgt tgaaggcaga ggggccaagg 6061 agggatggtg ggtggatctg aggcggctct tgtctcctgt aatctagggc tttgaagcct 6121 gagtccctgg catcatcaag atacggtctt gaccagtggc tgtggagaga tgtaggtggg 6181 actctgggtt aggaggcctc acagtgaccc tgtccctcgc cctgcagaat ggcattgtgc 6241 ccatcgtgga gcctgagatc ctccctgatg gggcccatga cttgaagcgc tgccagtatg 6301 tgaccgagaa ggtaaatggc tacctgcctg accagtgcaa ggtggctggc cggggaccct 6361 ggggctaacc cctatcctct cctccacccc actacccacc gtgcgcctgc tctgctcagg 6421 tgctggctgc tgtctacaag gctctgagtg accaccacat ctacctggaa ggcaccttgc 6481 tgaatcccaa catggtcacc ccaggccatg cttgcactca gaagttttca catgaggaga 6541 ttgccatggc gaccgtcaca gcgctgcgcc gcacagtgcc ccccgctgtc actggtgagg 6601 cccactcatc ttgatctcta tgcagtagat aagctccacc cacaacccta tgcccatttg 6661 gacggatttc ccatggcaac ttccaccagc tcctgccagc ttcctgggtc tctgacacag 6721 ccccctctgc taccccctgc actacaggga tcaccttcct gtctggaggc cagagtgagg 6781 aggagtcgtc catcaacctc aatgccatta acaagtgccc cctgctgaag ccctgggccc 6841 tgaccttctc ctacggccga gccctgcagg cctctgccct gaaggcctgg ggcgggaaga 6901 aggagaacct gaaggctgcg caggaggagt atgtcaagcg agccctggta aggataggca 6961 ggaggtgggc agggtgcctg ggtggatggg actcggggaa gagcccttct cactccaccc 7021 ctctccctgc ttaggccaac agccttgcct gtcaaggaaa gtacactccg agcggtcagg 7081 ctggggctgc tgccagcgag tccctcttcg tctctaacca cgcctattaa gcggaggtgt 7141 tcccaggctg cccccaacac tccaggccct gccccctccc actcttgaag aggaggccgc 7201 ctcctggggc tccaggctgg cttgccgcgc tctttcttcc ctcgtgacag tgttgtgtgg 7261 tgtcgtctgt gaatgctaag tccatcaccc tttccgggac actgccaaat aaacagctat 7321 ttaaggggga gtcggccgtc cgtgtcttgt ggtgtctaat gcaggggagg gcctggggag 7381 gtagcagagc ccagaagaag aaagagcccc tgttctctgt tttttctggg cagaaaagga 7441 gtgaaaggtg gaaggacctt cctgctctgt tttatacttg gccagggctt caagaaaggc 7501 tgagagctgt gacattttct tcactgcagg // LOCUS HSAPC3A 6065 bp DNA PRI 03-NOV-1994 DEFINITION Human apolipoprotein CIII gene and apo AI-apo CIII intergenic region. ACCESSION X01392 NID g28725 KEYWORDS apolipoprotein; repetitive sequence; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6065) AUTHORS Protter,A.A., Levy-Wilson,B., Miller,J., Bencen,G., White,T. and Seilhamer,J.J. TITLE Isolation and sequence analysis of the human apolipoprotein CIII gene and the intergenic region between the apo AI and apo CIII genes JOURNAL DNA 3 (6), 449-456 (1984) MEDLINE 85076175 COMMENT The intergenic region between the apo AI and apo CIII genes showes the presence of an Alu-type repetitive element localized approximately 1400 bp from the apo CIII. A second repetitive element is found in intron III of the apo CIII gene. FEATURES Location/Qualifiers source 1..6065 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 87..95 CAAT_signal 101..109 TATA_signal 161..168 prim_transcript 190..3322 mRNA join(193..225,851..918,1054..1177,3015..3322) exon 193..225 /number=1 intron 226..850 /number=1 exon 851..918 /number=2 sig_peptide join(864..918,1054..1061) CDS join(864..918,1054..1177,3015..3135) /codon_start=1 /product="apo CIII" /db_xref="PID:g296632" /db_xref="SWISS-PROT:P02656" /translation="MQPRVLLVVALLALLASARASEAEDASLLSFMQGYMKHATKTAK DALSSVQESQVAQQARGWVTDGFSSLKDYWSTVKDKFSEFWDLDPEVRPTSAVAA" intron 919..1053 /number=2 exon 1054..1177 /number=3 mat_peptide join(1062..1177,3015..3132) /product="apo CIII" intron 1178..3014 /number=3 exon 3015..3322 /number=4 polyA_signal 3296..3301 misc_feature 3323..6000 /note="intergenic region" exon complement(6001..>6065) /note="exon 4 (apo AI)" polyA_signal complement(6018..6023) /note="polyA signal (apo AI)" BASE COUNT 1183 a 1762 c 1641 g 1477 t 2 others ORIGIN 1 ctggcgggac agcagcgtgg actcagtctc ctagggattt cccaactctc ccgcccgctt 61 gctgcatctg gacaccctgc ctcaggccct catctccact ggtcagcagg tgacctttgc 121 ccagcgccct gggtcctcag tgcctgctgc cctggagatg atataaaaca ggtcagaacc 181 ctcctgcctg tctgctcagt tcatccctag aggcagctgc tccaggtaat gccctctggg 241 gaggggaaag aggaggggag gaggatgaag aggggcaaga ggagctccct gcccagccca 301 gccagcaagc ctggagaagc acttgctaga gctaaggaag cctcggagct ggacgggtgc 361 cccccacccc tcatcataac ctgaagaaca tggaggcccg ggaggtgtca cttgcccaaa 421 gctacatagg gggtggggct ggaagtggct ccaagtgcag gttcccccct cattcttcag 481 gcttagggct ggaggaagcc ttagacagcc cagtcctacc ccagacaggg aaactgaggc 541 ctggaagagg gccagaaatc acccaaagac acacagcatg ttggctggac tggacggaga 601 tcagtccaga ccgcagtgcc ttgatgttca gtctggtggg ttttctgctc catcccaccc 661 acctcctttg ggcctcgatc cctcgccgct caccagtccc ccttctgaga gcccgtatga 721 gcaggagccg gcccctactc cttctggcag acccagctaa ggttctacct taggggccac 781 gccacctccc cagggagggg tccagaggca tggggacctg gggtgcccct cacaggacac 841 ttccttgcag gaacagaggt gccatgcagc cccgggtact ccttgttgtt gccctcctgg 901 cgctcctggc ctctgcccgt aagcacttgg tgggactggg ctgggggcag ggtggaggca 961 acttggggat cccagtccaa tgggtggtca agcaggagcc ccagggctcg tccagaggcc 1021 gatccacccc actcagccct gctctttcct caggagcttc agaggccgag gatgcctccc 1081 ttctcagctt catgcagggc tacatgaagc acgccaccaa gaccgccaag gatgcactga 1141 gcagcgtgca ggagtcccag gtggcccagc aggccaggta cacccgctgg cctccctccc 1201 catcccccct gccagctgcc tccattccca cccgcccctg ccctggtgag atcccaacaa 1261 tggaatggag gtgctccagc ctcccctggg cctgtgctct tcagcctcct ctttcctcac 1321 agggcctttg tcaggctgct gcgggagaga tgacagagtt gagactgcat tcctcccagg 1381 tccctccttt ctcccggagc agtcctaggg ccgcgccgtt ttagccctca tttccatttt 1441 cctttccttt ccctttcttt ctctttctat ttttctttct ttctttcttt ctttctttct 1501 ttctttcttt ctttctttct ttcctttctt tctttccttt ctttctttct tttctttctt 1561 tctctttctt tctttctttc ctttttcttt ctttccctct cttcctttct ctctttcttt 1621 cttcttcttt tttttttaat ggagtctccc tctgtcaccc aggctggagt gcagtggtgc 1681 catctcggct cactgcaacc tccgtctccc gggttcaacc cattctcctg cctcagcctc 1741 ccaagtagct gggattacag gcacgcgcca ccacacccag ctaatttttg tatttttagc 1801 agagatgggg tttcaccatg ttggccaggt tggtcttgaa ttcctgacct caggggatcc 1861 tcctgcctcg gcctcccaaa gtgctgggat tacaggcacg agccactgcg cctggcccca 1921 ttttcctttt ctgaaggtct ggctagagca gtggtcctca gcctttttgg caccagggac 1981 cagttttgtg gtggacaatt tttccatggg ccagcgggga tggttttggg atgaagctgt 2041 tccacctcag atcatcaggc attagattct cataaggagc cctccaccta gatccctggc 2101 atgtgcagtt cacaataggg ttcacactcc tatgagaatg taaggccact tgatctgaca 2161 ggaggcggag ctcaggcgta ttgctcactc acccaccact cacttcgtgc tgtgcagccc 2221 ggctcctaac agtccatgga ccagtaccta tctatgactt gggggttggg gaccctgggc 2281 taggggtttg ccttgggagg ccccacctga cccaattcaa gcccgtgagt gcttctgctt 2341 tgttctaaga cctggggcca gtgtgagcag aagtgtgtcc ttcctctccc atcctgcccc 2401 tgcccatcag tactctcctc tcccctactc ccttctccac ctcaccctga ctggcattag 2461 ctggcatagc agaggtgttc ataaacattc ttagtcccca gaaccggctt tggggtaggt 2521 gttattttct cactttgcag atgagaaaat tgaggctcag agcgattagg tgacctgccc 2581 cagatcacac aactaatcaa tcctccaatg actttccaaa tgagaggctg cctccctctg 2641 tcctaccctg ctcagagcca ccaggttgtg caactccagg tggtgctgtt tgcacagaaa 2701 acaatgacag ccttgacctt tcacatctcc ccaccctgtc actttgtgcc tcaggcccag 2761 gggcataaac atctgaggtg acctggagat ggcagggttt gacttgtgct ggggttcctg 2821 caaggatatc tcttctccca gggtggcagc tgtgggggat tcctgcctga ggtctcaggg 2881 ctgtcgtcca gtgaagttga gagggtggtg tggtcctgac tggtgtcgtc cagtggggac 2941 atgggtgtgg gtcccatggt tgcctacaga ggagttctca tgccctgctc tgttgcttcc 3001 cctgactgat ttaggggctg ggtgaccgat ggcttcagtt ccctgaaaga ctactggagc 3061 accgttaagg acaagttctc tgagttctgg gatttggacc ctgaggtcag accaacttca 3121 gccgtggctg cctgagacct caatacccca agtccacctg cctatccatc ctgccagctc 3181 cttgggtcct gcaatctcca gggcttcccc tgtaggttgc ttaaaaggga cagtattctc 3241 agtgctctcc taccccacct catgcctggc ccccctccag gcatgctggc ctcccaataa 3301 agctggacaa gaagctgcta tgagtgggcc gtcgcaagtg tgccatctgt gtctgggcat 3361 gggaaagggc cgaggctgtt ctgtgggtgg gcactggaca gactccaggt caggcaggca 3421 tggaggccag cgctctatcc accttctggt agctgggcta gtctctgggc ctcagtttct 3481 tcatctctaa ggtaggaatc accctccgta ccctgccttc cttgacagct ttgtgcggaa 3541 ggtcaaacag gacaataagt ttgctgatac tttgataaac tgttaggtgc tgcacaacat 3601 gacttgagtg tgtgccccat gccagccact atgcctggca cttaagtgtc atcagagttg 3661 agactgtgtg tgtttactca aaactgtgga gctgacctcc cctatccagg ccacctagcc 3721 ctcttaggcg cacgtgaagg gaggaggccg gatgggctag aggttggagt aagatgcaac 3781 gaggcactat tcttggctcc accacttgat atcagcctca gtttcttaca tgtaaagtgg 3841 atacaaccgt accccctcca ccgtaggttt gccgtgagat tgaaatgaga gagcgttcga 3901 accgtttggc acagcacctg cacgtaaaga tgcttgatca atgttgtcat gattacagtt 3961 gagctgactg ggcccttggg accggactgg agtggtgggg ggcagtgtcc tgggaccaaa 4021 aagaagcaca aggtctccca atagaggctg cttcctttgt gtccccacca cccgaaagat 4081 gtcaggtcag agagcccgag agctgcagat ggcttgagta gggctccact cttcagatca 4141 aaaaactgtg gcccggagag gcgaaggcac ttggccagca tcacagagcc agcacgtggc 4201 agggccagac cttgagccca ggtcagctgc gtgtattctg ctcagttggt gcagaaaaca 4261 gttttgtcac tcctatgtca ggtgttaggg actcctttac agatctcagt ggcatcagta 4321 catccagccc cacctggaga ctgctttctc tctgaaaatt ccccagggct tctctctggg 4381 ctgagagatc tcagcacccg tatctagaaa atgttcccac ccagacctgg ctggatgact 4441 gctgttgtag ctctggaagg ttaggaacta aaaagcccac tcctttacct agggtagcta 4501 agatacactg gagatgggga catggggatg gggccgatta tccaggggcc tgcatgaggg 4561 ggcaaaaggc cctgcagaga gagggtaggg aaggcactgc cagatctgtg aagccatgtg 4621 cgtgcagcgg ggacattcag acatgagtgc aaggagggac cgtgagcagg gaggtcatgt 4681 gagaatacac aggcatgcct gcacacccat gtgaacttga gtgccaggcc acacactctt 4741 tttttttttt tttttttttt tagctggagt cttgctccgt cgcccaggct ggagtgcagt 4801 ggcatgattt cggctcactg tgacctctgc ctcccaggtt caagcgattc tcctgcctca 4861 gccttcctag tagctgggat tacgggtgca agccaccatg ccagctgatt ttttttttgt 4921 atttttagta gagacagggt ttcaccatat tggccaggct ggtctcaaac tcctggccct 4981 gaagtgatac gcccacctca gcctcccaaa gtgctgggat tacaggcttg agccaccgca 5041 cccgacccgc acactctttt caataatcat ggatggccag gggtgcaggg tctaaaaagc 5101 gctgcctagc ccatcctgct gttcactggg caagcgacgt cacaggtcca ggcttcagtg 5161 tcctcatcca tgctctgcgt ctgatggcaa tctagccagg atgtggggaa gggaggatgc 5221 agtgagagca cagatatgag agcatcttgg aaataaaaat gtacctgcaa gaggtggtgg 5281 tgaattttct tactcaggcc agcttctgcc agggctggca gaaagagggg gtggcatggc 5341 atggagccgc agggggtgga ggactggctt ccactgctgt gcctgaggaa gccgcggctg 5401 tttctgggcg ggatgggagt agtgggaggg ggatactggc cttgtgagaa gaaaagggaa 5461 gtgtctgttt gagaggtttt tgaattagta aaggaggaca ggcgcaaact ccaagcgctt 5521 cacttgcacc cgggaccaaa ccccaatccc agtggctggc tccctgaggc gccccgctcc 5581 gtcccgcccg ctgacagcgg ctgggctgga gaaggctcta tacggacaca cctctgggga 5641 cggggaaccc gactgctccc agctaaagca accgctgttt cctggcccgc ctcagacagg 5701 ctgcaggcct tgtttgagcc cctttcaggg cacctggcct tggattgtct gtggctttgc 5761 ctngtccgct gtgacttcct ttctacttga gccttgctaa ggcagactct actccctcac 5821 tcgtaagcag ccangcgtcc agcaggtcct ccaacgtcga tcttggccct aagacgtcca 5881 gtctgggcac ggagttgttg agatccggca ggaagtccct gctccagggc caaaggcccc 5941 tcccgggctc ccccggatgt ccccgcaccc ccctctattc tcccaaaaga aagaagctgc 6001 ttcccacttt ggaaacgttt attctgagca ccgggaaggg gggcggcggc gggcgcctca 6061 ctggg // LOCUS HSAPOA2 3360 bp DNA PRI 16-FEB-1995 DEFINITION Human gene for apolipoprotein AII. ACCESSION X04898 NID g28743 KEYWORDS apolipoprotein; apolipoprotein A-II; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3360) AUTHORS Shelley,C.S., Sharpe,C.R., Baralle,F.E. and Shoulders,C.C. TITLE Comparison of the human apolipoprotein genes. Apo AII presents a unique functional intron-exon junction JOURNAL J. Mol. Biol. 186 (1), 43-51 (1985) MEDLINE 86089113 REFERENCE 2 (bases 1 to 3360) AUTHORS Knott,T.J., Wallis,S.C., Robertson,M.E., Priestley,L.M., Urdea,M., Rall,L.B. and Scott,J. TITLE The human apolipoprotein AII gene: structural organization and sites of expression JOURNAL Nucleic Acids Res. 13 (17), 6387-6398 (1985) MEDLINE 86016095 COMMENT An Alu repetitive element is located around the polymorphic MspI site at pos. 3033. FEATURES Location/Qualifiers source 1..3360 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature complement(59..64) /note="seq. pot. involved in steroid hormone/receptor binding" misc_feature complement(632..637) /note="seq. pot. involved in steroid hormone/receptor binding" CAAT_signal 1100..1108 TATA_signal 1148..1153 prim_transcript 1174..2507 /gene="apo AII" mRNA join(1174..1210,1380..1455,1749..1881,2277..2507) /gene="apo AII" exon 1174..1210 /gene="apo AII" /number=1 gene 1174..2507 /gene="apo AII" misc_feature 1182 /gene="apo AII" /note="5' end of cDNA" intron 1211..1379 /gene="apo AII" /number=1 exon 1380..1455 /gene="apo AII" /number=2 CDS join(1404..1455,1749..1881,2277..2394) /gene="apo AII" /codon_start=1 /product="apolipoprotein" /db_xref="PID:g671882" /db_xref="SWISS-PROT:P02652" /translation="MKLLAATVLLLTICSLEGALVRRQAKEPCVESLVSQYFQTVTDY GKDLMEKVKSPELQAEAKSYFEKSKEQLTPLIKKAGTELVNFLSYFVELGTHPATQ" sig_peptide 1404..1455 /gene="apo AII" intron 1456..1748 /gene="apo AII" /number=2 repeat_region 1711..1742 /note="(GT) 16, pot. z-DNA sequence" misc_feature 1711..1748 /gene="apo AII" /note="functional acceptor site variant sequence" exon 1749..1881 /gene="apo AII" /number=3 misc_feature 1749..1765 /gene="apo AII" /note="pro-peptide" intron 1882..2276 /gene="apo AII" /number=3 exon 2277..2507 /gene="apo AII" /number=4 polyA_signal 2487..2492 /gene="apo AII" polyA_site 2507 /gene="apo AII" BASE COUNT 904 a 849 c 832 g 775 t ORIGIN 1 cccgggaggt ggaggttgca gtgagccgag atcatgccat tacgctccag cctgagcaac 61 aagagcaaaa ctctgtctca ggaaaacaaa caaaaaaacc tgcacatata cttctgaatt 121 taaaacaaaa gttaaaaaac aaagatttct tggtctctgg tcactacctc cctcatcagc 181 tttgcgcctc cactgtcacc ctcaggaatg ttccacatac tcagcgagta tgcttggggg 241 gcaaaagggt gaaagataca aaagcttctg atatctattt aactgatttc acccaaatgc 301 tttgaacctg ggaatgtacc tctccccctc ccccaccccc aacaggagtg agacaagggc 361 cagggctatt gcccctgctg actcaatatt ggctaatcac tgcctagaac tgataaggtg 421 atcaaatgac caggtgcctt caacctttac cctggtagaa gcctcttatt cacctctttt 481 cctgccagag ccctccattg ggaggggacg ggcggaagct gttttctgaa tttgttttac 541 tgggggtagg gtatgttcag tgatcagcat ccaggtcatt ctgggctctc ctgttttctc 601 cccgtctcat tacacattaa ctcaaaaacg gacaagatca tttacacttg ccctcttacc 661 cgaccctcat tcccctaacc cccatagccc tcaaccctgt ccctgatttc aattcctttc 721 tcctttcttc tgctccccaa tatctctctg ccaagttgca gtaaagtggg ataaggttga 781 gagatgagat ctacccataa tggaataaag acaccatgag ctttccatgg tatgatgggt 841 tgatggtatt ccatgggttg atatgtcaga gctttccaga gaaataactt ggaatcctgc 901 ttcctgttgc attcaagtcc aaggacctca gatctcaaaa gaatgaacct caaatatacc 961 tgaagtgtac ccccttagcc tccactaaga gctgtacccc ctgcctctca ccccatcacc 1021 atgagtcttc catgtgcttg tcctctcctc ccccatttct ccaacttgtt tatcctcaca 1081 taatccctgc cccactgggc ccatccatag tccctgtcac ctgacagggg gtgggtaaac 1141 agacaggtat atagcccctt cctctccagc cagggcaggc acagacacca aggacagaga 1201 cgctggctag gtaagataag gaggcaagat gtgtgagcag catccaaaga ggcctgggct 1261 tcagttgtgg agagggagag agccaggttg gaatgggcag caggtaggga gatccctggg 1321 gaggagctga agcccatttg gcttcagtgt cccccaaacc cccaccaccc tcttcctagg 1381 ccgccctccc cactgttacc aacatgaagc tgctcgcagc aactgtgcta ctcctcacca 1441 tctgcagcct tgaaggtggg tgtgatatgg agagggggcc aaaagggaga atgtgctggt 1501 ggttatagct acctatctgc ctgtgctctt atatccaggt ctggcaacca aagggctcaa 1561 aaggaagatc attccctctc taaagcccag aaagcccatc caaagatctg gctggatacc 1621 tttttgggga gggggcagag agctggggtg ggtactggca gaggtatggc aggaggccag 1681 gattcactgc tgtggaccca gctgaaaaga gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 1741 gtgggcagga gctttggttc ggagacaggc aaaggagcca tgtgtggaga gcctggtttc 1801 tcagtacttc cagaccgtga ctgactatgg caaggacctg atggagaagg tcaagagccc 1861 agagcttcag gccgaggcca agtaagtctc agggcaaggg gttcaggggc tgtggaactg 1921 tggagagaaa gaagggaaga tgagaggtcc cacagaagtc tgaacccagg ggtggggatt 1981 agggcagatt aggcttaaat tgcagagaaa aagtatttca tcacccaaag atcccacacg 2041 tctcttagat agagaggaac agcaagaact gggccttgaa tttcagtctc tagagtctgt 2101 ccctctacct agcaaaggtc ttgactctat tcctacctag gggctttgcc atgcgatgga 2161 ccaggcacta gagtttgggg acctgagtca gtctgctctg acctcccacc accaccaagg 2221 cccctgccag tgcctagggt ccctcagatt aaactctaat cccctcacct atccaggtct 2281 tactttgaaa agtcaaagga gcagctgaca cccctgatca agaaggctgg aacggaactg 2341 gttaacttct tgagctattt cgtggaactt ggaacacacc ctgccaccca gtgaagtgtc 2401 cagaccattg tcttccaacc ccagctggcc tctagaacac ccactggcca gtcctagagc 2461 tcctgtccct acccactctt tgctacaata aatgctgaat gaatccagct ctgagcctgg 2521 tatgtttggg ggactgggaa aagtagggga gtaagggagg agaagaggaa ggaaaaggaa 2581 aaatctgctt ctagaaggag agaggtttga gtgtggaggg gtgaagaaag gattgaagac 2641 acaactgatg aaaatgacag gatgagggtg ccatgattct ccaaacccag agtctcctac 2701 agcctgggca cgactctgca ggtgaacact aaagaggctt tgcattgcac aaggaactag 2761 gagaggaagg aaggattcac aaactgaatc cctcgatttc tggaggtcat agaaaatgag 2821 ggtaccctgg tggggtgcag tggctcacac ctgtaatccc tgcactttgg gaggccaagg 2881 taggtggatc acttgagatc aggagttcca gaccagccta gccaacatgg taaaaccccg 2941 tctctactaa aaatacaaaa attagccagg tgttgtggca cgtgcctgta atcccagcta 3001 ctcgggagac tgaggcatga gaatcttttg aaccggggag gcggaggttg cagtgagctg 3061 acatcgtgcc actgcactcc agcctaggtg acagagcaag actccgtctc aaaaaaaaaa 3121 aaaaaaaaaa aaaaaagaaa gtaaagaaaa aaagaaaatg agggtacccc tcataatttc 3181 ctgttagtca ttctatgaag aaaagaaagc ttcccaaggt gtcacccgtg gccctccttt 3241 cccttctgag ccaggggaac actgtgtttc cccctttccc acaataaaag acttgagttt 3301 gctcctctcc ctagaagtgc tctaatttct ccatttaaaa cctcttatct agaccaggca // LOCUS HSAPOAIA 2209 bp DNA PRI 03-NOV-1994 DEFINITION Human fetal gene for apolipoprotein AI precursor. ACCESSION X01038 NID g28769 KEYWORDS apolipoprotein; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2209) AUTHORS Seilhamer,J.J., Protter,A.A., Frossard,P. and Levy-Wilson,B. TITLE Isolation and DNA sequence of full-length cDNA and of the entire gene for human apolipoprotein AI--discovery of a new genetic polymorphism in the apo AI gene JOURNAL DNA 3 (4), 309-317 (1984) MEDLINE 85026665 FEATURES Location/Qualifiers source 1..2209 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 8..13 mRNA join(39..56,254..316,504..660,1249..1907) exon 39..56 /number=1 intron 57..253 /number=1 exon 254..316 /number=2 sig_peptide join(274..316,504..514) CDS join(274..316,504..660,1249..1852) /codon_start=1 /product="apolipoprotein AI precursor" /db_xref="PID:g296635" /db_xref="SWISS-PROT:P02647" /translation="MKAAVLTLAVLFLTGSQARHFWQQDEPPQSPWDRVKDLATVYVD VLKDSGRDYVSQFEGSALGKQLNLKLLDNWDSVTSTFSKLREQLGPVTQEFWDNLEKE TEGLRQEMSKDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLHE LQEKLSPLGEEMRDRARAHVDALRTHLAPYSDELRQRLAARLEALKENGGARLAEYHA KATEHLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTKKLNTQ" intron 317..503 /number=2 exon 504..660 /number=3 mat_peptide join(515..660,1249..1849) /product="apolipoprotein AI" intron 661..1248 /number=3 exon 1249..1907 /number=4 BASE COUNT 450 a 688 c 685 g 384 t 2 others ORIGIN 1 ctgcagacat aaataggccc tgcaagagct ggctgcttag agactgcgag aaggaggtgc 61 gtcctgctgc ctgccccggt cactctggct ccccagctca aggttcaggc cttgccccag 121 gccgggcctc tgggtacctg aggtcttctc ccgctctgtg cccttctcct cacctggctg 181 caatgagtgg gggagcacgg ggcttctgca tgctgaaggc accccactca gccaggccct 241 tcttctcctc caggtccccc acggcccttc aggatgaaag ctgcggtgct gaccttggcc 301 gtgctcttcc tgacgggtag gtgtccccta acctagggag ccaaccatcg gggggccttc 361 tccctaaatc cccgtggccc accctcctgg gcagaggcag caggtttctc actggccccc 421 tctcccccac ctccaagctt ggcctttcgg ctcagatctc agcccacagc tggcctgatc 481 tgggtctccc ctcccaccct cagggagcca ggctcggcat ttctggcagc aagatgaacc 541 cccccagagc ccctgggatc gagtgaagga cctggccact gtgtacgtgg atgtgctcaa 601 agacagcggc agagactatg tgtcccagtt tgaaggctcc gccttgggaa aacagctaaa 661 gtaaggaccc agcctggggt tgagggcagg ggcagggggc agaggcctgt gggatgatgt 721 tgaagccaga ctggccgagt cctcacctaa tatctgatga gctgggcccc acagatggtc 781 tggatggaga aaccggaatg gatctccagg cagggtcaca gcccatgtcc cctgcaaagg 841 acagaccagg gctgcccgat gcgtgatcac agagccacat tgtgcctgca agtgtagcaa 901 gcccctttcc cttcttcacc acctcctctg ctcctgccca gcaagactgt gggctgtctt 961 cggagaggag aatgcgctgg aggcatagaa gcgaggtcct tcaagggccc actttggaga 1021 ccaacgtaac tgggcaccag tcccagctct gtctcctttt tagctcctct ctgtgcctcg 1081 gtccagctgc acaacggggc atggcctggc ggggcagggg tgttggttga gagtgtactg 1141 gaaatgctag gccactgcac ctccgcggac aggtgtcacc cagggctcac ccctgatagg 1201 ctggggcgct gggaggccag ccctcaaccc ttctgtctca ccctccagcc taaagctcct 1261 tgacaactgg gacagcgtga cctccacctt cagcaagctg cgcgaacagc tcggccctgt 1321 gacccaggag ttctgggata acctggaaaa ggagacagag ggcctgaggc aggagatgag 1381 caaggatctg gaggaggtga aggccaaggt gcagccctac ctggacgact tccagaagaa 1441 gtggcaggag gagatggagc tctaccgcca gaaggtggag ccgctgcgcg cagagctcca 1501 agagggcgcg cgccagaagc tgcacgagct gcaagagaag ctgagcccac tgggcgagga 1561 gatgcgcgac cgcgcgcgcg cccatgtgga cgcgctgcgc acgcatctgg ccccctacag 1621 cgacgagctg cgccagcgct tggccgcgcg ccttgaggct ctcaaggaga acggcggcgc 1681 cagactggcc gagtaccacg ccaaggccac cgagcatctg agcacgctca gcgagaaggc 1741 caagcccgcg ctcgaggacc tccgccaagg cctgctgccc gtgctggaga gcttcaaggt 1801 cagcttcctg agcgctctcg aggagtacac taagaagctc aacacccagt gaggcgcccg 1861 ccgccgcccc ccttcccggt gctcagaata aacgtttcca aagtgggaag cagcttcttt 1921 cttttgggag aatagagggg ggtgcgggga catccggggg agcccgggag gggcctttgg 1981 ccctggagca gggacttcct gccggatctc aacaactccg tgcccagact ggacgtctta 2041 gggccaagat cgacgttgga ggacctgctg gacgcntggc tgcttacgag tgagggagta 2101 gagtctgcct tagcaaggct caagtagaaa ggaagtcaca gcggacnagg caaagccaca 2161 gacaatccaa ggccaggtgc cctgaaaggg gctcaaacaa ggcctgcag // LOCUS HSARS81S 2063 bp DNA PRI 11-SEP-1996 DEFINITION H.sapiens ARS gene, component B. ACCESSION X99977 NID g1536901 KEYWORDS ARS. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2063) AUTHORS Mastrangeli,R. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2063) AUTHORS Mastrangeli,R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) R. Mastrangeli, Istituto di Ricerca Cesare Serono Spa, Department of Drug Discovery, Via di Valle Caia 22, 00040 Ardea (Rome), ITALY FEATURES Location/Qualifiers source 1..2063 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" TATA_signal 524..529 exon 552..635 /number=1 sig_peptide join(578..635,1041..1048) /gene="ARS" gene 578..1820 /gene="ARS" CDS join(578..635,1041..1160,1687..1820) /gene="ARS" /standard_name="ARS(Component B)-81/s" /codon_start=1 /db_xref="PID:e265523" /db_xref="PID:g1536902" /translation="MASRWAVQLLLVAAWSMGCGEALKCYTCKEPMTSASCRTITRCK PEDTACMTTLVTVEAEYPFNQSPVVTRSCSSSCVATDPDSIGAAHLIFCCFRDLCNSE L" intron 636..1040 /gene="ARS" /number=1 exon 1041..1160 /gene="ARS" /number=2 mat_peptide join(1049..1160,1687..1817) /gene="ARS" intron 1161..1686 /gene="ARS" /number=2 exon 1687..2010 /number=3 polyA_signal 1992..1997 BASE COUNT 384 a 642 c 651 g 386 t ORIGIN 1 tggcccatgc taccctcacc tgacacctgc ttcctacctc tggtttctac tttgcaggtg 61 tgtatcaggt gtacacagac caggtagagg tctgtggaga gggctgcagg ccaggctgca 121 gggaaggggt gccaggcggg gctagagcaa caagggcaga ggctacactg aacctgggtc 181 ttaagggtcc cccaggctgg ggctgggtgg cctatgtgaa ccccagaggc acagccagga 241 catgggggct catcagaggg gcagtctgag ctcagcagga aaggccttct ctgtcagagc 301 tgtcccagga ccactggaca tggctgagga acagtgagtt ccccagtgtt ggaggtgtgc 361 aagcagaggc ctggccatcg tcctcagaca gagctcccag atccagctcc ctgcccgtct 421 gccatgttcc tgccagctgc ctccccactg ggccctttac cacgttcctg actcacacgg 481 ccggttctgc caccgcccag aagccggtgc ccaagggcct ggctataaat ccttgatgtg 541 aggctggcta cctctcatca cttctgagca cggagcaatg gcctctcgct gggctgtgca 601 gctgctgctc gtggcagcct ggagcatggg ctgtggtgag tgggccgcag gctggtgggg 661 accctgcctc tgagcttgtc tgcccacctc ctagggggat ggggctgttg ggggtgcttt 721 gtggctgaga gcctccttag gcctccatga ggctcaccct cctcattctc agtgagcctc 781 ctgggtccca gagcccagct tcaccctggg acaggggtca cggctccact ctgcaggaag 841 ggagactgag gcttggtgga gggatgcagc attcaagtct gtggctcagc tcagttagag 901 aaagctgcca gagaggcccc ttgaagggct gcccggggcc ttgaaagatg tcagcgagac 961 tccttcagcc cctgcctcct ggttccagga tgaggccacc gaggtcaggt gatgaggttc 1021 tgcccccatc cctcacccag gtgaggccct caagtgctac acctgcaagg agcccatgac 1081 cagtgcttcc tgcaggacca ttacccgctg caagccagag gacacagcct gcatgaccac 1141 gctggtgacg gtggaggcag gtgaggccag gccccacggc agccctgggt gcactggagt 1201 cagggccacc tcccccaagt gcctccctcc tttgctggtg ctcctcccgg cccaaaaggc 1261 agcaggtggg atgggcagaa caggctgcca caccttggca ggggtgcctt ccacgagggt 1321 ggcacagccc cctcagagac ccagtcctgg ggcaccaggc gctggaggtg ggtggggctt 1381 aatggccggg gtaccctggg gggctcaaac cccagctctg acacagaccc actgggtggt 1441 gttgccacag cctctgggct cgggctccca tctcagcgca ggcacttcag aggtctgaca 1501 aggcctaata attcatgaac aggtcacagt cagaggaggg ctgggccctg ggtggcttca 1561 cagatgtgga ctattgggaa cagggatcac agggagggtg aggtcaggcg acggcggctg 1621 ggagcagtgc agcagcaggc aggcgctgca ggggagtgag ggttctgaca ctggcccacc 1681 ctgcagagta ccccttcaac cagagccccg tggtgacccg ctcctgctcc agctcctgtg 1741 tggccaccga ccccgacagc atcggggccg cccacctgat cttctgctgc ttccgagacc 1801 tctgcaactc ggaactctga acccagggcg gcagggcgga aggtgctcct caggcacctc 1861 ctctctgacg gggcctggct ccacctgtga tcacctcccc ctgcttcctg ctgctgtggc 1921 acagctcact catggggtct gaggggagag aagcacacca ggggcgccct ctgccttcca 1981 taccccacgc ttataaaaca taactaagcc aagagtggac atgacttttg tccttcctgg 2041 gcactgacaa cctacagaac tgg // LOCUS HSARYLA 3637 bp DNA PRI 24-APR-1993 DEFINITION Human DNA for arylsulphatase A (EC 3.1.6.1). ACCESSION X52150 NID g28859 KEYWORDS arylsulphatase; lysosomal enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3637) AUTHORS Kreysing,J. TITLE Direct Submission JOURNAL Submitted (26-MAR-1990) Kreysing J., Georg-August University Biochemie II, Goalerstr 12D, 3400 Goettingen, F R G REFERENCE 2 (bases 1 to 650) AUTHORS Kreysing,J., von Figura,K. and Gieselmann,V. TITLE Structure of the arylsulfatase A gene JOURNAL Eur. J. Biochem. 191 (3), 627-631 (1990) MEDLINE 90361046 COMMENT See for mRNA sequence of arylsulphatase A. Data kindly reviewed (02-NOV-1990) by Hall L. FEATURES Location/Qualifiers source 1..3637 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="G1/1" /cell_type="leukocytes" /clone_lib="EMBL-3" /chromosome="22" misc_feature 191..200 /note="GC-box 1" misc_feature 201..210 /note="GC-box 2" misc_feature 213..222 /note="GC-box 3" misc_feature 240..249 /note="GC-box 4" prim_transcript 256..3356 mRNA join(256..847,997..1237,1352..1570,1645..1814,2127..2251, 2342..2469,2720..2822,2938..3356) exon 256..847 /number=1 CDS join(630..847,997..1237,1352..1570,1645..1814,2127..2251, 2342..2469,2720..2822,2938..3257) /EC_number="3.1.6.1" /codon_start=1 /product="arylsulphatase a" /db_xref="PID:g28860" /db_xref="SWISS-PROT:P15289" /translation="MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSS TTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPL EEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLT CFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRP FFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLV IFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSL DLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTG KYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATP EVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA" intron 848..996 /number=1 exon 997..1237 /number=2 intron 1238..1351 /number=2 exon 1352..1570 /number=3 intron 1571..1644 /number=3 exon 1645..1814 /number=4 intron 1815..2126 /number=4 exon 2127..2251 /number=5 intron 2252..2341 /number=5 exon 2342..2469 /number=6 intron 2470..2719 /number=6 exon 2720..2822 /number=7 intron 2823..2937 /number=7 exon 2938..3356 /number=8 polyA_signal 3351..3356 BASE COUNT 566 a 1290 c 1107 g 674 t ORIGIN 1 agccgctcct cctctgagaa gctccggacc cgagaggaca ccgacactgc gcagcgccga 61 gcccgcgcgc agcccggacg cctcagccag ggccgaccgc gcagaggaag ctcccagagc 121 ccgtttcaag accgcagcca acagcctcag gcgcacacgg cggcctcgga gcgagcacgc 181 gcagcaacgc ccctcgcccc ggcccgcccc cggccccgcc ccgcaagggt cacaggtcac 241 ggggcggggc cgaggcggaa gcgcccgcag cccggtaccg gctcctcctg ggctccctct 301 agcgccttcc ccccggcccg actgcctggt cagcgccaag tgacttacgc ccccgaccct 361 gagcccggac cgctaggcga ggaggatcag atctccgctc gagaatctga aggtgccctg 421 gtcctggagg agttccgtcc cagccctgcg gtctcccggt actgctcgcc ccggccctct 481 ggagcttcag gaggcggccg tcagggtcgg ggagtatttg ggtccggggt ctcagggaag 541 ggcggcgcct gggtctgcgg tatcggaaag agcctgctgg agccaagtag ccctccctct 601 cttgggacag acccctcggt cccatgtcca tgggggcacc gcggtccctc ctcctggccc 661 tggctgctgg cctggccgtt gcccgtccgc ccaacatcgt gctgatcttt gccgacgacc 721 tcggctatgg ggacctgggc tgctatgggc accccagctc taccactccc aacctggacc 781 agctggcggc gggagggctg cggttcacag acttctacgt gcctgtgtct ctgtgcacac 841 cctctaggta aagagggggc cgcgcctctt ccccgccccg accctccatc cctttcctcc 901 caatggattg caggggggcg ggaaaaacgt ctgtctctct ctctagggaa ggccacattt 961 ctgtctgtct cagggactct gtgacttgtc ccgcagggcc gccctcctga ccggccggct 1021 cccggttcgg atgggcatgt accctggcgt cctggtgccc agctcccggg ggggcctgcc 1081 cctggaggag gtgaccgtgg ccgaagtcct ggctgcccga ggctacctca caggaatggc 1141 cggcaagtgg caccttgggg tggggcctga gggggccttc ctgccccccc atcagggctt 1201 ccatcgattt ctaggcatcc cgtactccca cgaccaggta ggaaccaccc gggccctcag 1261 ccaccctccc acctcccaaa gtcccccagc cccttgactg tcccgcagcc ccacctgcca 1321 gcccagccct cacggcagct gcccgcctca gggcccctgc cagaacctga cctgcttccc 1381 gccggccact ccttgcgacg gtggctgtga ccagggcctg gtccccatcc cactgttggc 1441 caacctgtcc gtggaggcgc agcccccctg gctgcccgga ctagaggccc gctacatggc 1501 tttcgcccat gacctcatgg ccgacgccca gcgccaggat cgccccttct tcctgtacta 1561 tgcctctcac gtaagtgatc ttggcccaac cccctggctg cccgttgacc cctacccagt 1621 gctaactcca gtctttgccc ccagcacacc cactaccctc agttcagtgg gcagagcttt 1681 gcagagcgtt caggccgcgg gccatttggg gactccctga tggagctgga tgcagctgtg 1741 gggaccctga tgacagccat aggggacctg gggctgcttg aagagacgct ggtcatcttc 1801 actgcagaca atgggtatgc cagcagggca gctgggtgct ccggccctgt cacgggccag 1861 ggcctggagg ccttgcagtt cagctgcttg ccaagaacat agtgggtgag ggggtgccag 1921 gagatgctgg ccacgttgca ggggcccaag gtgtagtcag gagacacagg tgcacagaga 1981 gctggtcttg gtaggcctgg gaggtgccgg gctcatgctg ggcacctccg ggcaagcttt 2041 gtgacttaga ggtgtggggc cactggtcac cctcggtggc tcagaggctg tggctccatg 2101 gctcatgagc gcctcctgtg tcccagacct gagaccatgc gtatgtcccg aggcggctgc 2161 tccggtctct tgcggtgtgg aaagggaacg acctacgagg gcggtgtccg agagcctgcc 2221 ttggccttct ggccaggtca tatcgctccc ggtcagtccg caggccctct ccttggaacc 2281 ctggccccac caccccaacc ttgatggcga actgagtgac tgaccagcct cctgccccca 2341 ggcgtgaccc acgagctggc cagctccctg gacctgctgc ctaccctggc agccctggct 2401 ggggccccac tgcccaatgt caccttggat ggctttgacc tcagccccct gctgctgggc 2461 acaggcaagg tagggccggt gacccctgat cccagatcct tggcccctgt cctggccttc 2521 ccctggggtg agtgtggcag tgctgagagt ctgtgcctca gtgcctcctg cactgagtgg 2581 catccaagtg gcgccacctc tcaggttcct gggtgggcaa gaagcggtgc acgtccaggg 2641 cctcccacca gggctggcag cccaggtatg tgcagtgctt gggcctgccc cgccccgtga 2701 cccctgactc tgcccccaga gccctcggca gtctctcttc ttctacccgt cctacccaga 2761 cgaggtccgt ggggtttttg ctgtgcggac tggaaagtac aaggctcact tcttcaccca 2821 gggtaacccc tccccgtgga tccctccccc cgaacctgct gacccctccc cggagcccta 2881 gatccctggc ccctcctctc gcccttgccc tgtgcacaga attggccccc tccccaggct 2941 ctgcccacag tgataccact gcagaccctg cctgccacgc ctccagctct ctgactgctc 3001 atgagccccc gctgctctat gacctgtcca aggaccctgg tgagaactac aacctgctgg 3061 ggggtgtggc cggggccacc ccagaggtgc tgcaagccct gaaacagctt cagctgctca 3121 aggcccagtt agacgcagct gtgaccttcg gccccagcca ggtggcccgg ggcgaggacc 3181 ccgccctgca gatctgctgt catcctggct gcaccccccg cccagcttgc tgccattgcc 3241 cagatcccca tgcctgaggg cccctcggct ggcctgggca tgtgatggct cctcactggg 3301 agttgtgggg gaggctcagg tgtctggagg gggtttgtgc ctgataacgt aataacacca 3361 gtggagactt gcagctgtga caattcgacc aatcctgggg taatgctgtg tgctggtgcc 3421 ggtcccctgt ggtacgaatg aggaaactga ggtgcagaga ggttcaggac ttgtacaaga 3481 tcacccagcc agaaagaggt tgggctggga tttgaaccct ggtgtcgtgg ctctggaagc 3541 tgccctggcg ctccttggtg atctgcgtgg gtctgtgcac acaggcacac gtcagccaca 3601 aggcacatgg acgagcgcac gtgcttgagt gcaggac // LOCUS HSASML 4311 bp DNA PRI 04-OCT-1994 DEFINITION H.sapiens genes for acid sphingomyelinase ASM. ACCESSION X63600 NID g556808 KEYWORDS acid sphingomyelinase; ASM gene; sphingomyelin phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4311) AUTHORS Hofmann,K. TITLE Direct Submission JOURNAL Submitted (27-DEC-1991) K. Hofmann, Institut fuer Biochemie, Universitaet Koeln, Joseph Stelzmann Str. 52, 5000 Koeln, FRG REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 4311) AUTHORS Newrzella,D. and Stoffel,W. TITLE Molecular cloning of the acid sphingomyelinase of the mouse and the organization and complete nucleotide sequence of the gene JOURNAL Biol. Chem. Hoppe-Seyler 373 (12), 1233-1238 (1992) MEDLINE 93183402 REFERENCE 3 (bases 1 to 4311) AUTHORS Hofmann,K. TITLE Direct Submission JOURNAL Submitted (04-OCT-1994) K. Hofmann, Institut fuer Biochemie, Universitaet Koeln, Joseph Stelzmann Str. 52, 5000 Koeln, FRG FEATURES Location/Qualifiers source 1..4311 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="genomic" /clone="CTG3" protein_bind 70..79 /note="putative" /bound_moiety="Sp1" mRNA join(<230..541,1008..1780,2833..3004,3234..3310, 3513..3658,3815..>4224) /gene="ASM" gene 230..4224 /gene="ASM" exon <230..541 /gene="ASM" /number=1 CDS join(230..541,1008..1780,2833..3004,3234..3310,3513..3658, 3815..4224) /gene="ASM" /EC_number="3.1.4.12" /codon_start=1 /product="acid sphingomyelinase" /db_xref="PID:g556809" /db_xref="SWISS-PROT:P17405" /translation="MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMGLVLALALALA LALSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPICKGLFTAINLG LKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFEDDMVEVWRRSVLSPSEACGLL LGSTCGHWDIFSSWNISLPTVPKPPPKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTD PDCADPLCCRRGSGLPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWT GDIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHESTPVNSFPPPFIEG NHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYALSPYPGLRLISLNMNFCSRENFWL LINSTDPAGQLQWLVGELQAAEDRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTL AAQFFGHTHVDEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSRSS HVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAWHNLVYRMRGDMQL FQTFWFLYHKGHPPSEPCGTPCRLATLCAQLSARADSPALCRHLMPDGSLPEAQSLWP RPLFC" intron 542..1007 /gene="ASM" /number=1 exon 1008..1780 /gene="ASM" /number=2 intron 1781..2832 /gene="ASM" /number=2 repeat_region 1970..2104 /note="first half" /rpt_family="Alu" repeat_region 2133..2407 /rpt_family="Alu" exon 2833..3004 /gene="ASM" /number=3 intron 3005..3233 /gene="ASM" /number=3 exon 3234..3310 /gene="ASM" /number=4 intron 3311..3512 /gene="ASM" /number=4 exon 3513..3658 /gene="ASM" /number=5 intron 3659..3814 /gene="ASM" /number=5 exon 3815..>4224 /gene="ASM" /number=6 BASE COUNT 844 a 1291 c 1159 g 1017 t ORIGIN 1 cctccgcctc cgcagcgttg acagccgccc gccaccgaga gatcagctgt cagagatcag 61 aggaagagga aggggcggag ctgctttgcg gccggccgcg gagcagtcag ccgactacag 121 agaagggtaa tcgggtgtcc ccggcgccgc ccggggccct gagggctggc tagggtccag 181 gccggggggg acgggacaga cgaaccagcc ccgtgtagga agcgcgacaa tgccccgcta 241 cggagcgtca ctccgccaga gctgccccag gtccggccgg gagcagggac aagacgggac 301 cgccggagcc cccggactcc tttggatggg cctggtgctg gcgctggcgc tggcgctggc 361 gctggctctg tctgactctc gggttctctg ggctccggca gaggctcacc ctctttctcc 421 ccaaggccat cctgccaggt tacatcgcat agtgccccgg ctccgagatg tctttgggtg 481 ggggaacctc acctgcccaa tctgcaaagg tctattcacc gccatcaacc tcgggctgaa 541 ggtgagcact gaaggggctg cagtggagga ggccgaaagg agtgctgggg ctgggggctg 601 gggctgatgc tggtgcgcct gggctcagaa tgcatccctg atggagaggg tggcatctac 661 aatccatcac tgagtttgct cccctttggg acacccatgg ctacatgcca ccatcacccc 721 attgtgacct ttgtgaagta agaaaataat gcagacagtg cctgaggaag tcagcttgcc 781 aagcaaaggc ctcatgccac aggccggctg agctaaagag aagcgatggc ctggtgctgc 841 ctgagttaca ggcaatatct ggaaggcaaa ggtgtgcact gagcttggtg cactgagtcc 901 tgcccagccc cagtttggaa atggaggccc aaggggtggt ggccaggggt tggcctggtt 961 cctctgctct gcctctgatt tctcaccatg cgctcctccc actgcagaag gaacccaatg 1021 tggctcgcgt gggctccgtg gccatcaagc tgtgcaatct gctgaagata gcaccacctg 1081 ccgtgtgcca atccattgtc cacctctttg aggatgacat ggtggaggtg tggagacgct 1141 cagtgctgag cccatctgag gcctgtggcc tgctcctggg ctccacctgt gggcactggg 1201 acattttctc atcttggaac atctctttgc ctactgtgcc gaagccgccc cccaaacccc 1261 ctagcccccc agccccaggt gcccctgtca gccgcatcct cttcctcact gacctgcact 1321 gggatcatga ctacctggag ggcacggacc ctgactgtgc agacccactg tgctgccgcc 1381 ggggttctgg cctgccgccc gcatcccggc caggtgccgg atactggggc gaatacagca 1441 agtgtgacct gcccctgagg accctggaga gcctgttgag tgggctgggc ccagccggcc 1501 cttttgatat ggtgtactgg acaggagaca tccccgcaca tgatgtctgg caccagactc 1561 gtcaggacca actgcgggcc ctgaccaccg tcacagcact tgtgaggaag ttcctggggc 1621 cagtgccagt gtaccctgct gtgggtaacc atgaaagcac acctgtcaat agcttccctc 1681 cccccttcat tgagggcaac cactcctccc gctggctcta tgaagcgatg gccaaggctt 1741 gggagccctg gctgcctgcc gaagccctgc gcaccctcag gtacttatcg tccgtggaaa 1801 cccaggaagg gaaaagaaag gtgaatgaaa gtgaagggag aagggaacct ggggcattgt 1861 ctctgattgc tctagcatga gtccttagtg ctcttcattt ggctccccta atctgactcc 1921 tccttccctt tctactgttt tgccgcacca ggcttttttt tttttttttt tttagtttag 1981 tttttgtaga gacaagatct tgctatgttg cccaggctgg tctcaaacac ctaacctcaa 2041 gcaatcctcc cgcctcggcc tcccaaaatg ctgggaccac aggcatcagc tactgctcct 2101 ggccctccct tttttttttt tttttttttt ttgagatgga atcttgctct gttgcccagg 2161 ctggagtgca gtggcaccat ctcagctcac tacagcctcc acctcctggg ttcaagcaat 2221 tctgcctcag cctcccaagt acctgggact acaggtgcac gccaccacac ccagctaatt 2281 tttgtatttt tagtagagat ggggtttcac catgttggcc aagatggtct tgatctcctg 2341 acctcatgat ctgcccacct cggcctccca aagtgctggg attacaggca tgaaccactg 2401 cacccagctt tccagccctc cctttctact cttatctcca gccaccctcc ttcaaaggtc 2461 tggcagcata acctctctat gccccagctg tgtctttgct catattggcc ctctggaaat 2521 gatttccccc ttttttttaa gtgctccagt ttttcccacc ttatccatcc catgtcatct 2581 tccctctgtg tggtccttgc ttcccattct agctaactct tatccctccc ccatactcct 2641 ggagccctct gccctcagag tcttttgtgt cacacagacc caataattag aactgtttgg 2701 tctctggcta gactgtgagc tccttgcagg tggggaagat gtcatgtatg cttttaccct 2761 ccacccaaat gcccagcaca ggaggaccag gattggaaca agtgttgacc tctcatgttt 2821 actttgtttc agaattgggg ggttctatgc tctttcccca taccccggtc tccgcctcat 2881 ctctctcaat atgaattttt gttcccgtga gaacttctgg ctcttgatca actccacgga 2941 tcccgcagga cagctccagt ggctggtggg ggagcttcag gctgctgagg atcgaggaga 3001 caaagtgagg gccagtagtg ggaacacggt ggtgctgggg gacaagcagg ctcctgttga 3061 gctggagcac ctctgggcac agaagtttta ttttcctggc attcccaaca agtgttccct 3121 ggggattcag ctcatggtca ctgttgaaag ccttcattca gtcccccttt ctctagccag 3181 ggctgcctgg acccctggat gccctgatta ccatccttaa ttctccctac taggtgcata 3241 taattggcca cattccccca gggcactgtc tgaagagctg gagctggaat tattaccgaa 3301 ttgtagccag gtaggacgga gatgagggtg ggaataggga cagggtgagt gtctgaaggc 3361 tgaaaattcc cttgagcatc tcaccatccc tgttgtccca tggagtgggg aggctcctca 3421 ctagaacagg ttggagaaag agggcatcct atctccccag atgtcttcct acccctccct 3481 agaatcttct gaatgtagta ccttctggcc aggtatgaga acaccctggc tgctcagttc 3541 tttggccaca ctcatgtgga tgaatttgag gtcttctatg atgaagagac tctgagccgg 3601 ccgctggctg tagccttcct ggcacccagt gcaactacct acatcggcct taatcctggt 3661 gagtgaggca gaagggagcc tcccttatcc tggagttggt gggatagggg aaggaggttg 3721 gagccagagc ctgcaaagca tgggcaggat gtgtggcccc tccctggagt tacccttgct 3781 ccttgcccct ccagtcagcc ccacatcctt gcaggttacc gtgtgtacca aatagatgga 3841 aactactcca ggagctctca cgtggtcctg gaccatgaga cctacatcct gaatctgacc 3901 caggcaaaca taccgggagc cataccgcac tggcagcttc tctacagggc tcgagaaacc 3961 tatgggctgc ccaacacact gcctaccgcc tggcacaacc tggtatatcg catgcggggc 4021 gacatgcaac ttttccagac cttctggttt ctctaccata agggccaccc accctcggag 4081 ccctgtggca cgccctgccg tctggctact ctttgtgccc agctctctgc ccgtgctgac 4141 agccctgctc tgtgccgcca cctgatgcca gatgggagcc tcccagaggc ccagagcctg 4201 tggccaaggc cactgttttg ctagggcccc agggcccaca tttgggaaag ttcttgatgt 4261 aggaaagggt gaaaaagccc aaatgctgct gtggttcaac caggcaagat c // LOCUS HSAT3 14206 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens gene for antithrombin III. ACCESSION X68793 S52236 S52240 NID g28906 KEYWORDS antithrombin; antithrombin III gene; AT3 gene; plasma protein; serine proteinase inhibitor; serpin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14206) AUTHORS Olds,R. TITLE Direct Submission JOURNAL Submitted (12-OCT-1992) R. Olds, Institute of Molecular Medicine, John Radcliffe Hospital, Oxford OX3 9DU, UK REFERENCE 2 (bases 1 to 14206) AUTHORS Olds,R.J., Lane,D.A., Chowdhury,V., De Stefano,V., Leone,G. and Thein,S.L. JOURNAL Biochemistry In press REFERENCE 3 (bases 1 to 14206) AUTHORS Bock,S.C., Marrinan,J.A. and Radziejewska,E. TITLE Antithrombin III Utah: proline-407 to leucine mutation in a highly conserved region near the inhibitor reactive site JOURNAL Biochemistry 27 (16), 6171-6178 (1988) MEDLINE 89050967 REMARK Erratum:[Biochemistry 1989 Apr 18;28(8):3628] REFERENCE 4 (bases 1 to 14206) AUTHORS Olds,R.J., Lane,D.A., Ireland,H., Leone,G., De Stefano,V., Wiesel,M.L., Cazenave,J.P. and Thein,S.L. TITLE Novel point mutations leading to type 1 antithrombin deficiency and thrombosis JOURNAL Br. J. Haematol. 78 (3), 408-413 (1991) MEDLINE 91337920 COMMENT related sequences: M21636-M21645. FEATURES Location/Qualifiers source 1..14206 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="q23-25" allele 152..259 /note="polymorphism" /citation=[3] exon <605..645 /gene="AT3" /number=1 mRNA join(<605..645,2944..3310,5843..6058,6964..7101, 7912..8302,10336..10400,13775..>13951) /gene="AT3" gene 605..13951 /gene="AT3" CDS join(605..645,2944..3310,5843..6058,6964..7101,7912..8302, 10336..10400,13775..13951) /gene="AT3" /codon_start=1 /product="antithrombin" /db_xref="PID:g28907" /db_xref="SWISS-PROT:P01008" /translation="MYSNVIGTVTSGKRKVYLLSLLLIGFWDCVTCHGSPVDICTAKP RDIPMNPMCIYRSPEKKATEDEGSEQKIPEATNRRVWELSKANSRFATTFYQHLADSK NDNDNIFLSPLSISTAFAMTKLGACNDTLQQLMEVFKFDTISEKTSDQIHFFFAKLNC RLYRKANKSSKLVSANRLFGDKSLTFNETYQDISELVYGAKLQPLDFKENAEQSRAAI NKWVSNKTEGRITDVIPSEAINELTVLVLVNTIYFKGLWKSKFSPENTRKELFYKADG ESCSASMMYQEGKFRYRRVAEGTQVLELPFKGDDITMVLILPKPEKSLAKVEKELTPE VLQEWLDELEEMMLVVHMPRFRIEDGFSLKEQLQDMGLVDLFSPEKSKLPGIVAEGRD DLYVSDAFHKAFLEVNEEGSEAAASTAVVIAGRSLNPNRVTFKANRPFLVFIREVPLN TIIFMGRVANPCVK" intron 646..2943 /gene="AT3" /number=1 repeat_region complement(2193..2469) /note="Alu repeat element" exon 2944..3310 /gene="AT3" /number=2 intron 3311..5842 /gene="AT3" /number=2 repeat_region complement(3784..3965) /partial /note="Alu repeat element" repeat_region complement(4007..4284) /note="Alu repeat element" exon 5843..6058 /gene="AT3" /note="A" /number=3 intron 6059..6963 /gene="AT3" /note="A" /number=3 exon 6964..7101 /gene="AT3" /note="B" /number=3 intron 7102..7911 /gene="AT3" /note="B" /number=3 repeat_region complement(7277..7558) /note="Alu repeat element" exon 7912..8302 /gene="AT3" /note="4" variation 8131 /gene="AT3" /note="polymorphism" /replace="a" variation 8160 /gene="AT3" /note="polymorphism; Pst1 site" /replace="a" intron 8303..10335 /gene="AT3" /number=4 variation 8521 /gene="AT3" /note="Nhe1 site; polymorphism" /replace="t" repeat_region 8670..8687 /rpt_family="att" repeat_region complement(8692..8951) /note="Alu repeat element" repeat_region complement(8994..9264) /note="Alu repeat element" repeat_region 8994..9264 /note="Alu repeat element" repeat_region complement(9362..9633) /note="Alu repeat element" repeat_region 9823..9867 /rpt_family="att" repeat_region complement(9871..10152) /note="Alu repeat element" exon 10336..10400 /gene="AT3" /number=5 intron 10401..13774 /gene="AT3" /number=5 variation 10427 /gene="AT3" /note="Dde1 site; polymorphism" /replace="c" repeat_region complement(10732..11011) /note="Alu repeat element" repeat_region complement(12124..12405) /note="Alu repeat element" exon 13775..13951 /gene="AT3" /number=6 BASE COUNT 3715 a 3338 c 3217 g 3936 t ORIGIN 1 ccacaggtgt aacattgtgt tttccttgtc tgtgccaggg acaccttggc atcagatgcc 61 tgaaggtagc agcttgtccc tctttgcctt ctctaattag atatttctct ctctctctcc 121 ctctctccat aaagaaaact atgagagagg gaattacagg tagagggcta gaagtttttg 181 gacattaact atttctatct tctgatttag ttaacgagaa acaaaaaatc ctgcagacaa 241 gtttctcctc agtcaggtat ttcctaacca agtttgaggg tatgaacata ctctcctttt 301 ccttttctat aaagctgagg agaagagtga gggagtgtgg gcaagagagg tggctcaggc 361 tttccctggg cctgattgaa ctttaaaact tctctactaa ttaaacaaca ctgggctcta 421 cactttgctt aaccctggga actggtcatc agcctttgac ctcagttccc cctcctgacc 481 agctctctgc cccaccctgt cctctggaac ctctgcgaga tttagaggaa agaaccagtt 541 ttcaggcgga ttgcctcaga tcacactatc tccacttgcc cagccctgtg gaagattagc 601 ggccatgtat tccaatgtga taggaactgt aacctctgga aaaaggtaag aggggtgagc 661 tttccccttg cctgccccta ctgggttttg tgacctccaa aggactcaca ggaatgacct 721 ccaacacctt tgagaagacc aggccctctc cctggtagtt acagtcaaag acctgtttgg 781 aagacgtcat ttcaagtgct ctccctccca ccccacctct tggggtaagg cctttcctaa 841 gctacccctt gggtccctag cctaagaaac aagggggatg tcatccctgg tgtaaagatg 901 ctgtgcagga agtcagcact cacgggatcc aggggacgct ccaaggggaa tccccagggc 961 ctgccatcca tccgggaaga gagcaaatgc tacccatgag gacctcctca ctcccttttt 1021 gctctttctt ccactcagat ccaccccact ccacccccac ccaaatccca gtgacctttg 1081 actaaagggc caaaactgct tccttttctc acaatgagag ttgtccctcc ctcaatgcca 1141 cacacactcc cttcttcatc tgagttgtca caggaggcta gaaacggggt ggtggcacaa 1201 ctgtcttggt tttaatttgt gcttcatagc cctcccaggt cctctcagcc tcaaattgca 1261 tttccaaatg tagttgaagg gacagagtgg gcaaccgaag cagcagtgga gatgggaaga 1321 tgaatggcag ggtcctctcc tctctctctc tgcttcttca gcctgccttc cacatctccc 1381 ttggtgccgc tgcttctctc cggctttgca cctctgttct tgaaagggct gcagaactgg 1441 actcagacca cgcaagaagg caagtccccc tcagctgccc cagcttccag ccagccccag 1501 gcttgcccaa cggaccacgt ccgtgaatct gcactgggtg cctgtctttc tctcccagga 1561 gaagatggga agatccagta cccacacaca gacccccttg tgtacacgca ggaaccataa 1621 accagctgga ggcagcccct gccccaccct gtcttatcta caaaaaatat tacaagagac 1681 tttatctctt gatttgcttc atcgagtgtc ccaactacct cattttttta aaatgtgaaa 1741 ttagcttcat ttaccttcat tgaatccatg ttggcgacta ttaaaaattc caggcaataa 1801 aaagggatga gagcctgaac taaagcagtg gcaataactg gtgaaagagt aaaaaaacag 1861 aactgattga ctctggggtg aactgattga ctctggggtt tgactaaatg aggaggagag 1921 agggaggaat ccagggtgat tctcaggttt ctgtacggga ttcactgagc ccactcacag 1981 gagcaggcct gtgggggaga attaattacc agttcagttt ggtcctgttt ccctgaagaa 2041 cttgtaggag ttcctggtgg aactgtccag caaatagtca gtctggagct cagtggaagg 2101 gttagggctg gagctagaga tgtaggaatc ttcagcacac agatattgcc attgtttttg 2161 tttgtttgtt tgtttgttgt tgctgttttg aagacacagt ctcactttgt cacccaggtt 2221 ggaggtcagt ggcacaatct cagctcactg caaccttcgc ctcctgggtt caagtgattc 2281 ttctgcctca gcctccctag tagcttggga ctacagtgtg cgccaccaca cccagctaat 2341 ttttgtattt ttagtagaga cagggtttca ccatgttgtc caggctgatc tcgaacaccc 2401 aacctcaagt gatctgcatg cctcagcctc caaagtgctg ggattacagc gtgagccacg 2461 cacccggcca gatattgcct ttgctccatc catttcttct ttttctcttg tgttgctgaa 2521 atctctctgc ctgcatctat cagagtcctt ccccaaacag tttctgtaga tggctccccc 2581 taccaccctg actcttcact gggcactaaa gccgattttt taggcatgca cattccatgt 2641 cacaaacagg aagcttctca ttcttttttc tcccagcgtg gggaattgag cacataatac 2701 tccaaataac catcagatga ttctaattcc aacatgacca cgtccaggca actgaactgt 2761 cccctggcaa gaagtctagg actgaacctg tcccgggccc ctgtacttgg ttcaaaggat 2821 ttagcctttc tcttggccac accaggtggg ctggaatcct ctgctttact ggggcaaccc 2881 tgtggtgggc agtggggcta ggggttgcag cctagcttaa cttggcattt tgtctccttg 2941 caggaaggtt tatcttttgt ccttgctgct cattggcttc tgggactgcg tgacctgtca 3001 cgggagccct gtggacatct gcacagccaa gccgcgggac attcccatga atcccatgtg 3061 catttaccgc tccccggaga agaaggcaac tgaggatgag ggctcagaac agaagatccc 3121 ggaggccacc aaccggcgtg tctgggaact gtccaaggcc aattcccgct ttgctaccac 3181 tttctatcag cacctggcag attccaagaa tgacaatgat aacattttcc tgtcacccct 3241 gagtatctcc acggcttttg ctatgaccaa gctgggtgcc tgtaatgaca ccctccagca 3301 actgatggag gtacgaccaa aggtcttctg cccagccacc ttgttaggag cacctttggg 3361 cttccatagg cccaagtcca atgattcctc aaccaacact gcagccacta ggggcgctca 3421 ttatgcatta cgattccctt tgaacatcac tgtgttataa ttccctttga aaatcatttt 3481 ttaaaaaatt agccaaggaa tcttggctat ctacttttta aatcctggtt tcctcttttg 3541 agcaccttaa aatgggggaa ggcttgtatc ttctctcaac ttcttttcag taattctttc 3601 atctatatgt ttactcatta atttgatcat ttatttattt attcattcag cacttcctct 3661 gtgccaggca atgtgtagtg ccagtccctc ctctggtgga agaagagtag ctttaccata 3721 tggtgacatc aggcatatag gctctcgtgg aaaaaaattc taggatagta tttttttttt 3781 tttgagatgg aatctcgctc tattgcccag gctggagtgc agtggtgcag tctcggctca 3841 ctccaaactc tgcctcccag gttcaagcaa ttctcccacc tcagcctcct gagtagatgg 3901 gattacaggc acacgccatc acgcccagct aatttctata tttttagtag agatggggtt 3961 tcaccacgtg gccagactgg tctcaaactt tttttttttt ttttttgaga cggagtctcg 4021 ctctgtcatc caggctggag cgcagtgcac gatctcagct cactgcaacc tctacctccc 4081 gggttcaagc aattctcagc ctcagcctcc cgagtagctg ggattacagg cccccggcac 4141 catgcctggc taattttttt tcttcttagt agagatgggg tttcaccatg ttggccaggc 4201 tagtcttgaa ctcctgacct cgtgatccac ctgccttggc ctcccaaagt gctgtgatta 4261 caggcgtaac gaccgcgcct ggcctcaaac tcttgacttc aagggatcgg cctgacttgg 4321 cttcccaaag tgctgggatt acaagcatga gccactgcac ggggcctagg atggtatatt 4381 gagaccaggg gcccaggaaa gccaagagaa gcctcaagga cgtgagagtg tttctggctc 4441 tgggaagtat ggatcatttc agctcagtga cttagttccc acccccttcc ccccactgcc 4501 ttttgtggga gggaagtagg gcatgataag atgaaatgtc atagattgat tgatcactgt 4561 tggcctctgg ggctatgaca agtcatggat ggaaacacta gatctttaat ctgtccttgg 4621 cttggctgca tgacagtctt tcttcaagtt ggatcacact ttggaagcag agttcatcaa 4681 tagggaggca tgagtccctt caagatggta tacggtgctt atttgaaact tggacactaa 4741 agtctgtggg tcttaggagg gttccttcta ttctagtggt caatttccat ggaacttcat 4801 cacctttgct cagggctctg gggtgagtta acccaagtct tcactctttg aaagaaattg 4861 tagatttaaa aactctgaag acacataata ctgccttctc tgggcccttc agtcattttt 4921 gtatacattg gtactggtct caaagtactt ccatatcact catgtctctg tccccaggta 4981 agatcttaaa tgtaaccctt cctacaagaa gaacagaaca gaacactgct tccaaaccac 5041 acatgttcct ttggtcctcc cctctacaca aacgccatgt gttgggaaag cagggtgaga 5101 ctaaatctct ctggagaaaa gagaaattca gcaccaagct tttgatcaaa agcataatcc 5161 ccccctaaaa aaagtgccta ttggagcaaa atcaggaaaa ccaaaggcag agaacagata 5221 aaaccaaaag gccttttgta gcctgaggag agagcatgga aagggcagga ggggaacagc 5281 ctcacccatt ttgccttggg gatggtgaag gtgggcattg ggggattcca acttcaaagc 5341 atggatgact tctaagtcct tttcagccct gagctcttag attctgagcc tgtttaatcc 5401 cttgctgata gattcactct tcctttttca cccctaccac cagtatccca gagcctccat 5461 gagcagctgg ccccagtaga tgccacaaaa gtgtttgtta cgagaaggac accgtctgat 5521 tctcttctct gtccagaatc accaagagga cttttcccat tccagcaaga aaacgtctgt 5581 gtgttgatct agaggcgttt agagacttta ggtggcaacc tagtctctct ttttcccttt 5641 atccttccta cccttcattc ttcttttatc cttttattca tcagaacaca agagttgagc 5701 atttatgctg tcccaggtac tgtgcttgaa ggagttaaca actgaggtgg ctattagtca 5761 gagactgacc agcatgtgct caccacccat gttaactagg cagcccacca aacccaccac 5821 catttttttt tgacttctat aggtatttaa gtttgacacc atatctgaga aaacatctga 5881 tcagatccac ttcttctttg ccaaactgaa ctgccgactc tatcgaaaag ccaacaaatc 5941 ctccaagtta gtatcagcca atcgcctttt tggagacaaa tcccttacct tcaatgagac 6001 ctaccaggac atcagtgagt tggtatatgg agccaagctc cagcccctgg acttcaaggt 6061 gagttgcaga tgttacccct gacctccgag ttcttcctct ccactcagag attgaggagg 6121 tggagaaaca gcatccaaat tcacactgct ttgctgctga agactgctgg agggctgact 6181 aaaagttaga acccctgcaa tagttattct tacttgaaac ctgagaaatc aaaggtatcc 6241 atgcttggat tgtagtgact gcccagaaaa catgaattaa taatcaattc ttcattccat 6301 ccaccaactt caaatatata ccaaagggtg ttttgaagat gccagttcta caagatatct 6361 tacttaattt gaactgttat catggtcaaa taaagttggt acatgatgca tgttacattc 6421 tcctcttgga gattcatgaa gcacatgggc ctatgaaggt tctgagaaac tctgcaacaa 6481 agaaatctgt tggctttatt caatcggcat tcctcaaatg tatttgactg catgggcatt 6541 tctctcctcc atataacctg ccaaccccat ataacctgcc aaccccatat aacctgcaat 6601 cattcattgc ttcccctggc acatgccttg gaaattctac ttttgtgagt taaggttttc 6661 caaagtcaga gaaaataata ttttatcttc tttttcccag actattttcg ccttccttct 6721 tttcatttat ttcttcctat ttctttttgt ctttttcttc tgataatatt tattaactac 6781 aggaaagatt catggaacta tattagatat gtgaggcttc ccaatttggg ttagagcaat 6841 ggcttcttaa tcaaatggtg ggaaaggaca gagggatggt gagaaaaata aaatgctgcc 6901 tgggaaaatg gagaagccaa ttgaatagca caggtgagta ggtttatttt ctgttctcct 6961 caggaaaatg cagagcaatc cagagcggcc atcaacaaat gggtgtccaa taagaccgaa 7021 ggccgaatca ccgatgtcat tccctcggaa gccatcaatg agctcactgt tctggtgctg 7081 gttaacacca tttacttcaa ggtactcaga atggccctgg agagacccca gggacttcct 7141 cttgctcttc agcttacccc cttttttttt aaatggcgag accgaagccc tgagagggca 7201 aatggactgc cgaaagctac acaggtacag gtcagcaggg caggtcaatc tattatttat 7261 ttatttattt atttttgaca gagtctcgct ctgtcgccca ggctggagtg cagtggcgtg 7321 atctcggctc actgcaagct tcgcctcctg ggttctcggc attctcctgc ctcagcctcc 7381 caagtagctg ggaatacagg cacccaccac catgcctggc taattttttg gttttttttt 7441 agtaaagacg gggtttcacc gtgttagcca ggatagtctt gatctcctga cctcgtgatc 7501 tgcccacctc ggcctcccaa agtgctggga ttacaggcat gagccaccgc gcccggcaga 7561 ttggcttctt tcacctagta aaatgcattt actgttcctt tgtgttttcg tggctttgtc 7621 actcatttct tcttagcatg gaatagcatt acatttggtc tggatgtacc acagtctgtc 7681 tattcatcta ctgaaggaca ttttggctgc ttccaaggtt tgacagttat gaataaacct 7741 actcataatt ccatcattct gacacagcca ttgttaacct ttttgtgcat atcccgccag 7801 tcttttttcc gaataattat atattaatgt aacactataa tatggatatg tctgtgtcaa 7861 taactatcct cctatgaatg tttgtgttct tactttgtga ttctcttcca gggcctgtgg 7921 aagtcaaagt tcagccctga gaacacaagg aaggaactgt tctacaaggc tgatggagag 7981 tcgtgttcag catctatgat gtaccaggaa ggcaagttcc gttatcggcg cgtggctgaa 8041 ggcacccagg tgcttgagtt gcccttcaaa ggtgatgaca tcaccatggt cctcatcttg 8101 cccaagcctg agaagagcct ggccaaggtg gagaaggaac tcaccccaga ggtgctgcag 8161 gagtggctgg atgaattgga ggagatgatg ctggtggtcc acatgccccg cttccgcatt 8221 gaggacggct tcagtttgaa ggagcagctg caagacatgg gccttgtcga tctgttcagc 8281 cctgaaaagt ccaaactccc aggtttgtct aggaaggagt ttcctccctt ctccacccgc 8341 aaggtagtct gaccaaaagt ggaagagttg gagaaagaat agaaaggagc aacaagtcag 8401 gactcctgga tactgatcct agtttctact gctaatttgt ggaaatctct tttccttttg 8461 agacctcagt ttcctcttct gtaaaaggga agtttgttct tggatctcca tgggcccagc 8521 cagcactggt gccctgtgag tctgtatcag gtagaggaga tgggaccagg tggagaggaa 8581 tttgaaaggg cattggaatt cagagcaaag agacagatat taagagctgg ggaaatgtgg 8641 ttcccattac acaggcctca ctgacattta ttattattat tattattact tgagacagag 8701 tcttactctg ttgcccaggc tggagtgcag cggtgcgatc tcggctcact gcaacctctg 8761 cctcccgggt tcaagcgatt ctcatgcctc agcctcctga gtagctggga ttacaggcac 8821 acgtcaccat acctggtaat ttctgtattc ttagtagaga tgggtttcac catgttggcc 8881 aggatggtct tgaactcttg accttgtgat ccgcctgcct tggcttccca aagtgctggg 8941 attacaggcg tcacgaccgc acctggcaca ttaaaatatc ttttaaagaa gttggctggc 9001 cagggtggct cacgcctgta ataccagcac tttgggaggc tgaggtggga ggatcgtcta 9061 agcccatgag ttcgagacca gcctggacaa catagtgaga tggtctctac aaaaaataaa 9121 aaaaattagc caggcatggt gacgcacacc tgtagtccta gcttcttggg aggcagagct 9181 gggaggattg cctgagtccg ggaggtcaac gctgtggtgt actgtgatca caccactgca 9241 ctccagcctg agcaacagag tgaggtccta tcactaaata aataaataaa taataaaata 9301 gtttacgatg ttaagtaatt agatttatct ttattgacct tttttttttt tttttttttt 9361 tgagacgaag tcttgctctt gtcccccagg ctggagtgca gtggtgcaat cttggctcac 9421 tgcaacctcc accccccaga ttcaagtgat tctcctgcct cagcctccca aggagctagg 9481 attacaggcg cctgccacca cgcccggcta atttttgcat ttttagtaga aacggggttt 9541 cactatgttg gccaggctgg tcttgaactc ctgacctcag gcgatctacc tgccttggcc 9601 tcccaaagtg ctgggattac aggcgtgagc cactgtgcta ttgggctgtc tttaagctag 9661 ttttgaaaac taaaaatgtt gccagactgg aaagaaagat gttccttctg gatggagtga 9721 gttttttctg taagaacaga gtcttgccgt tctctctcca caaaaagctg aagcctgaga 9781 atgaattatc aggagccatg ctgaacaagc ccaaagtact ttattattat tattattatt 9841 attattatta ttattattat tattattttt gagatgcagt tttgctcttg ttgcccaggc 9901 tggagtgcag tggcgtgatc ttggctcact gcaacctcca cctcccgagt tcaagcgatc 9961 tcctgcctca gtcttccaag tagctgggat tacacgatgc gccaccacac ctggctaatt 10021 tttgtatttt cacgatagag acaaggtttc accatattag ttagagtgtc tccaactcct 10081 gacctcaggt gatctgtaca ccttggcctc ccgaagtgct gggattacag gtgtgagcca 10141 ctgcacccag cccccaaagt actttattat ttttaacaca tattcattgt gagagtatga 10201 ttaggtgaag atttaggatt tcttcttatg tttcaaaaag ccccaaagga tctcttaatc 10261 caaactgaat tcccatctgt ggattgaagc caactttctc ccatctcaca aagacttctc 10321 cggtcttcct tccaggtatt gttgcagaag gccgagatga cctctatgtc tcagatgcat 10381 tccataaggc atttcttgag gtgagtacac cttccccact ctcttagggt acagaaagga 10441 gatgcatgaa cagcaggaac acgtggaaaa ggcctgtttc cagtgttaag gcatgcaaaa 10501 ggcctccaca ggctgctata atacagccct ctccaaaacc ttcatggtgt gattgttctg 10561 ccttccctcc cactacctct tctgtagcag gtcaagcggg aacacaaaca tttagggagg 10621 gtgatatagg aaaagaagcc agcaaaggcc atcaagaaga aatttacagc atgaggagaa 10681 ccagaagagt atggggtcgc agaaacccag ggagaatttt tttttttttt tgagacagag 10741 cttcgttcgc tcgttgccca ggctagagtg caatggtgcg acctcactac aacttctgcc 10801 tcccgcgttc aagcgattct cctgcctcag cctcctgggt agctgggatg acaggcatgt 10861 gccatcacgc ccggctaatt tttgtatttt tagtagaaac agggtttctc catgttggtc 10921 aggcaggtct tgaactcccg atctcaggtg atccacctcc ttggctcccc caaagtgctg 10981 ggattacagg catgagccac tgcacccggc catacctagg gagaagtttt aagaaaatgg 11041 atagcatcta gtaagaagac tcctgggctg ggcatggtgg ttcacacctg taaccccagc 11101 accttgggag gctgagatgg aagatcactt tgagcccagg agtttgcaac cagccctgag 11161 caacatagtg agaccctgtc tctaccaaaa aaatcttaaa aaaaaaaaaa aaagtttgga 11221 gactgcccat agtttacctt tccctgagga cagaatagtg tggccacatg cctaattgta 11281 atggatgaag agcaaatgga aggtaagaaa gggaagctgg tgagtgtgca tcagtgtctt 11341 aaagtgtgct ccaactagag cactagacta cactggagga aacgaaaagg tggtcaaata 11401 aatgcatatc ctctcatggg agatgaacag tacacactga catgctgagg tctgacaagt 11461 cccacagtaa agaagacggt tgaatatcac ttaacgtgtt cccccaaatg agatgtgcat 11521 ggaaccctgt gttagagtaa tagcgtgtac agcctgtgga acttctggtt ctcaagtaaa 11581 cactaaacta tggaccaaca gcagtagtta tctggggagc tttatctttg gagattctgg 11641 atgggccctg agaatctctg tatgtactaa actcctcagg ggattcttat gcaaacaatg 11701 agatttggga accactggta tagactattt tttgcgggag ccaggctgtg agggatagga 11761 gattggacaa tggtagagat gttcctagaa tcaaagaaat cgaaaagaat gaaggttgta 11821 gtcaaagaga aaggtttcag aggatggtgc taagtaaata tagatccagg gatcaaactc 11881 agaggaaagt ggaattttaa cgggagcgag gaaatgtgat agcttgaaag aacccagtta 11941 taatctgaga aagatgctat taatataatt tcaaaggtag agtagttctg agatggcaag 12001 tccaaggtat agccatggac aggtttgctt aagtggaata aggcaatgct cattaggttt 12061 gatgaaagaa atgggagact agggtgttga acgggtctcc taatgagatt tttttttttc 12121 attgaaatgg agtcttgctc tctcgcctag gctggaatgc agtggtatga tctcagttca 12181 ctgcaacctc cgcctcctgg gttcaagcga tcctctcgcc tcagcctccc aagtagctgg 12241 gactacaggt gcccgccacc acgcccgact aaatttttgt atttttagta gagacggggt 12301 ttcaccatgt tggccaggct ggtctcaaac tcctgacctt aagtgatcca cctgactcgg 12361 cctcccaaag tgctgggatt acaggcatga gccacgtgcc cggcctactg agatattttt 12421 aattgcctca aatgatagca ggagttggag tggacagaaa ggctaagtgc aaaaatcatc 12481 agtgtgggga tataatctat aggacaatga atgtcaatga cctttaagac aatagcaaga 12541 gtagaggtat tgaggtcaga acaagggatt ttacaagagt gctgtattaa tggttttgga 12601 agttaagatg acactgctca caccctcttt cacatggatt tttggaagaa agaacactta 12661 ggaagactgc aagggaaatt gagtcctcag ggttttaact ctcattgaat atcctctggt 12721 aaggactcca gttagaagtg gtcaactcag acctccttga ggggtctgag ttactattag 12781 gaagaagcag aggtctggat tcattttatc cacctgagcc cagtacacaa tatgtaagat 12841 ttttccatgg ttcttacaac aaagccgttt tctttgaaaa ccttgggaat ccttaataaa 12901 caggacccca actaattgta gacctgagaa gccattaaaa accagaatct gatttgaata 12961 aaaggatcct ggtcatgcaa acactctagt ctgctaataa gctaataatt tagtgctgga 13021 atgagcatga aataggtaat atggggagat agcgggtaag gaagggagga acaaaggaag 13081 gggaaggaag agtgagaagg aaggagaaga catcatcaac cagctccaca aaacccaggg 13141 agccggttaa tcatgtgctt tcattaagag cagaaacaga gttttagtga tattctgggt 13201 cctgaggcaa aattttctga aggtgtttcc ctctagatcg ctatcagcca tgttcaaata 13261 ccattgtttc agtctattac tccaagaaaa tggcatctcg tccagccaga gaacccacct 13321 cttttcatag gccctaggtc ctgagtggct ctttggagta gctgtatctt ggatcttgat 13381 gctccaagag tgaaactgtt tctttcaact atggagttca gatcttgagc caaaaatctt 13441 tcagcggctg gtacaaaaaa aatccgctgt aaaaccattt acaatggtac cagccagaaa 13501 tgtgataacc tgtgtacatt cagatttctg ggttacctga atggaactct tacacttatt 13561 acctagcaca aggcttggac aaacacaagt accttacatt ctctgcatga aagaatgagt 13621 gaaagtagga ttctggaggg aatccaacct gacccaaatg tactttttac tggaaaacaa 13681 aagcatttga ggaattgctg tgtctgtgga tgatttacct gccaaaatga acggcagagt 13741 ggctaattta gttttattcc catgtgacct gcaggtaaat gaagaaggca gtgaagcagc 13801 tgcaagtacc gctgttgtga ttgctggccg ttcgctaaac cccaacaggg tgactttcaa 13861 ggccaacagg cctttcctgg tttttataag agaagttcct ctgaacacta ttatcttcat 13921 gggcagagta gccaaccctt gtgttaagta aaatgttctt attctttgca cctcttccta 13981 tttttggttt gtgaacagaa gtaaaaataa atacaaacta cttccatctc acattataaa 14041 tggactctgc atttgaaatg aagacaagga aaggggaaac atgctattgg ggcacatggt 14101 aaaatgatgc cttcaagttg ttctttaccc agtaaccaca tctggatcaa gaaaatgagg 14161 gagagagcga taaaagatgg tagacagcca gaaagggaag ggagag // LOCUS HSATPCP1 9457 bp DNA PRI 19-JUL-1993 DEFINITION H.sapiens gene for mitochondrial ATP synthase c subunit (P1 form). ACCESSION X69907 NID g38429 KEYWORDS ATP synthase; ATP synthase c subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9457) AUTHORS Walker,J.E. TITLE Direct Submission JOURNAL Submitted (29-DEC-1992) J.E. Walker, M.R.C. Lab. of Molecular Biology, Hills Road, Cambridge CB2 2QH., UK REFERENCE 2 (bases 1 to 9457) AUTHORS Dyer,M.R. and Walker,J.E. TITLE Sequences of members of the human gene family for the c subunit of mitochondrial ATP synthase JOURNAL Biochem. J. 293 (Pt 1), 51-64 (1993) MEDLINE 93319529 FEATURES Location/Qualifiers source 1..9457 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="AT5B1" /clone_lib="lambda 2001" repeat_unit 1..224 /rpt_family="Alu" repeat_unit 273..442 /rpt_family="Alu" repeat_unit 606..904 /rpt_family="Alu" repeat_unit 1449..1768 /rpt_family="Alu" repeat_unit 1769..2111 /rpt_family="Alu" repeat_unit 2627..2945 /rpt_family="Alu" repeat_unit 2646..3233 /rpt_family="Alu" GC_signal 4041..4046 repeat_unit 4100..4792 /rpt_family="Alu" CAAT_signal 4538..4542 CAAT_signal 4641..4645 CAAT_signal 4671..4675 CAAT_signal 4697..4701 TATA_signal 4708..4713 mRNA join(4733..4842,5341..5388,6304..6381,7088..7266, 7587..7802) /gene="P1 gene for c subunit of human mitochondrial ATP synthase" prim_transcript 4733..7802 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" gene 4733..7802 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" exon 4733..4842 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=1 intron 4843..5340 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=1 GC_signal 5002..5007 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" TATA_signal 5279..5280 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" exon 5341..5388 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=2 CDS join(5350..5388,6304..6381,7088..7266,7587..7701) /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /codon_start=1 /db_xref="PID:g38430" /db_xref="SWISS-PROT:P05496" /translation="MQTAGALFISPALIRCCTRGLIRPVSASFLNSPVNSSKQPSYSN FPLQVARREFQTSVVSRDIDTAAKFIGAGAATVGVAGSGAGIGTVFGSLIIGYARNPS LKQQLFSYAILGFALSEAMGLFCLMVAFLILFAM" intron 5389..6303 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=2 exon 6304..6381 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=3 intron 6382..7087 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=3 repeat_unit 6418..6699 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /rpt_family="Alu" exon 7088..7266 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=4 mat_peptide join(7154..7266,7587..7698) /gene="P1 gene for c subunit of human mitochondrial ATP synthase" intron 7267..7586 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=4 exon 7587..7802 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" /number=5 polyA_signal 7780..7785 /gene="P1 gene for c subunit of human mitochondrial ATP synthase" repeat_unit 8879..9176 /rpt_family="Alu" BASE COUNT 2505 a 2208 c 2318 g 2426 t ORIGIN 1 gatcaagacc atcctggcca acacggtgaa acccatctct actaaaaata caaaaaaaaa 61 aaaaaattag ctggacatgg tggcaggcgc ctgtagtccc agctactcgg gaggctgagg 121 caggagaatg gcgtgaaccc aggaggtgga gcttgcagtg agccgagatc gcgccactgc 181 actctagcct gggcaacaga gcaagactcc gtctcaaaaa aaaattattt atctatctac 241 ctatctatct atctatctat ctatctatct ataaattagc tgggcgtggt agtgggtgcc 301 tgtaatccca gctactgggg aagctgaggc aggagaatcg cttgaaccca ggaggtggag 361 gttgcaggga actgagactg cgccactgca ctccagccta gtgacagagc aagactctgt 421 ctcaaaaaaa aaaaaaaaaa aattaggcag gcgtggtggc aggcacctgt aataccagct 481 acttgggagg ctgaggtagg cgaactgctt gaacccagga ggcggaggtt gcagtgagcc 541 gagattgcgc cattgcactc cagcctggca acaagagcaa aactccatct caaacaaaaa 601 agagaggccg ggtgcagtgg ctcaagcctg taacaccagc tctttgggag gccgaggcgg 661 gcagatcacg aggtcaggag atcaagacca tcctggctaa cacggtgaaa ccccgtctct 721 actaaaaata caaaaaaaat tagccgggcg tggtggcggg cgcctgtggt cccagctact 781 cgggaggctg aggcaggaga atggcgtgaa cccgggaggc ggagcttgcg gtgagccgag 841 attgcgccac tgcactccag cctgggcgac agagcgagac tccgtctcaa aaaaaaaaaa 901 aaaagaaaga aaaaaaaaag aacttttgct ttggacccag acagacctgg gttcaaagca 961 cctagcacag tatttgtcat ataatgaggt atcgataaat gatagcttat aacactaaca 1021 caaagaaaga ctggataatc tgctcagaag ataccagtta tgtgttgtag ccatggtagt 1081 gagctcaaaa ctccactctc ctttttgttg ttgttgttgt cgttgttgtt tttgagatac 1141 ggtctcgctt tgttgcccag gccagtgcag tggtgcaatc agagttcact gaagccttga 1201 cctcctgagc tcaagtgatc ctcccacttc agccccctga gtagctggga ctacaggtac 1261 ctgccatcac accgagctaa tttttttttt tttttttttt ttttgtagag atgaggtttc 1321 gctatgttgc ccaggctggt ctccaattcc tgggctcaaa tgatctgccc accttggcct 1381 cccacacact ttcaattata ccatgctagc aggatatttt aaatctccac atgatatgaa 1441 aatgtgctgg cccagcacag tggctcacgc ctgtaatccc agcactttgg gaggctgagg 1501 caggtggatc acctgaggtc aggagttcga gaccagcctg gccaacatgg tgaaaccttg 1561 tctctactaa aaaaaaaaat ttttaaatta gctgggcgtg gtggcccgca cgctgcggtc 1621 tcagctactc gggaagctga ggcaggagaa tggcttgaac ccaggaggcg gaggttgcag 1681 tgagccgaga tctcgccact gcactccagt ctgggcgaaa gagtgacact cagtctcaca 1741 aaaaaagaaa aaagaaaaaa aaagaaaatg tgctgggcac ggtggttcat gcctgtaatc 1801 ccagcacttt gggaggccaa ggcgggcaga tcacgaggtc aggagttcaa gaccagcctg 1861 gccaacagag tgaaacccca tctctactaa agatacaaaa aaattagcca ggcatagtgg 1921 tgcgcacctg taatcccagc tactcaggag gctgaggcag agaattgctt gaacccggga 1981 ggcagaggtt gcagtaagcc gaaatcacgt cattgcactc cagcctgggt gacagagcaa 2041 gactctatct caaaaagaaa agaaaagaaa agaaaagaaa agaaaagaaa agaaaagaaa 2101 agaaaagaaa atgtttttct ggccaggtgc aggtagctca catctgtaat cccaacactt 2161 ccagaggctg aagtgggagg atccttgagg ccaggagttt gagaccagcc tggacaacat 2221 agcaagatcc cacctgtagt cctagatact tgggagagtg aggagggagg gttacttgag 2281 cccaggagat taaggctata atagtgaggg atgattgcac cactgcactc cagcctgggc 2341 aacagagtga gaccctgttt ctaaaaacaa aaaaattatt aaaaaaaaaa aagttttctt 2401 aagagtccag acttgtgaat tgccagatta gtgtaatttt taaaatatgt ttctattata 2461 aagtacccat actcataaaa atataaatca atttattaca ccctctagaa ttcactatta 2521 attttcaaca tttttttcat tctttttcca tgcatatttt ttcacaattc tatgcatagt 2581 tttgcattat aaaatatttc tcaatataaa atcttcttca aggccaggcg cagtggctca 2641 tgcctgcaat cccagcactt tagaaggcca aggcgagcag atcacttgag gtcaggaatt 2701 caagaccagc ctgaccaaca tggtgaaacc ccttctctac taaaaataca aaaattagcc 2761 gggcatggtg gtgcgcgcct gtaatcccag ctacttggga ggctgaggca ggagaattgc 2821 ttgaaccggg gaggcaaagg ttgcagtgag cagagaccgc accaccgcac tacagtgtgt 2881 gaaacagagt gagactccgt ctcaaaaaat aataataata aaaataaaat attcttcaaa 2941 aacatggccg ggcgcgctgg ctcacgcctg taatcccaac actttgggag gctgaggcgg 3001 gcggatcacg aggtcaggag atcgagacca tcccggctaa cacggtgaaa ccccgtctct 3061 actaaaaata caaaaaatta gccgggcgtg gtggcgggcg cctatagtcc cagctactcg 3121 ggaggctgag gcaggagaat ggcgtgaacc cgggaggcgg agcttgcagt gagccgagat 3181 agcgccgctg cactccagct tggcaacaga gcgagactcc gtctcaaaaa aaacacatga 3241 ttttatttcc tgttttattt ctgcttattt catcaagtga aaatatcata acttatttca 3301 ccttttactg tcagtggaac atttacttgt ttccaatttc gacaaactct gcaaatgcac 3361 atctttttaa aagaatctgc atttctgatt cttcaaaata aattcctaga agtgatatta 3421 ctggatgaaa agtggatgga gagcttaaag gctcttagtt actcaagaat tactgggtca 3481 aagaggatgg cccattcaca gcctttgaat aaaaataaat attgccaaac tgctttcaag 3541 gaaggttttc taattcatcc tctcaccagc aatataggag agcgtgtggc tcacattaat 3601 aaaaagaact ctgctgtttt gagcagtact aagcggagcg tttaaactta tatttctttg 3661 attaggaagg ctgacattcg ttcctgcaat tcatatttct tccttactga attgcctgtt 3721 cctatccttt gctcgttttt ctatagggat attatttcat tctttcagtg ttgtttttac 3781 aaaataaaca atttttttgt ttgttttttc ccccgagacg gaatcctgct cagttgccca 3841 ggctggagtg cagtggcgcg atctcggctc cctgcaacct cggcctcccg ggttcaagcg 3901 attctcctgc ctcagcctcc caagtagctg ggattacagg cacgcgccac caccatgccc 3961 ggctaatttt tgtattctta gtagagacgg ggtttcacca tgctggtcag gctgatctcg 4021 aactcctgac ctcaggtgat ccgcccgcct cggcctccca aagtgctggg attacaggcg 4081 tgagccactg cacccggccc aacaaatgtt ttagaaagcc atctggatga tagtccatga 4141 ctgctgaaca ccggctgagg tccacaaaac tgcgctcacc gcttggggac tttccagccg 4201 gcctttactc tccatggaaa gcggtctttc cacaaacctc cacggtggga attactgcaa 4261 aggcatcggc tctcccggcc tcgaatatct cgcgggatct cagctgtcct tgggcacccc 4321 cctcagcgca actaccaatc ccaacatgca acgcgccccg tggcccaagc acccgaatga 4381 aggctcgggg ccaaagtcat gtgtctgact gcgtcacacg tcccgtcatc tctcccacgc 4441 ctttttcttc agcgtacacc ttggtcgcca tcttgtgcgc gaggttatgg aacctctgag 4501 tcccgccaac tctatggtcg agcgtttcag ggatcggcca atcgtaatcc agatctcgta 4561 ggaaagcccc gcctctatcc gcatggaggc gggaattgcc acgaagctcc tgtggaggga 4621 gaggaagcag ctgcggaaag ccaataagag tggggaatcg atgacgtcaa ccaatgggga 4681 cgcggggata ttacggccaa tgagaatgga gaaggtccag gacacgtggg tgggggaagc 4741 tgagggctga gaccaagggc taaagctggg aggtgagtct gtcaccttga gccgggcgag 4801 cgctgtgggc caagcagggg ttgcagggca gtaggagtgc aggtgacttg gggccgggag 4861 ccagcaatag gcagggttgt gggctgcgca gtgtggaggg gtatttcccc ccagtgctgg 4921 gagaggctat ggcaatctag atatggcctt atcaatgaat gttaattacc aacggtaatg 4981 aacgattgac tgcctcggct ggggcgggtc ggggactgct gtgtaagatg ggcaggatcc 5041 tgcgcgacac ctgcccagta tccgcctcca gtcttcccga gagaccattc cctttcgcaa 5101 catttccttc ctggggcttg ggcacatttc cttgacttcc gatctccatg actttgcttt 5161 gaagacccac tgctctgtcc ctctgacctc taatgccctc ctcagctggg gcacggtgca 5221 aggaagagcg tggtctaggg aggacggacg ggggtgaagg ggacgaccaa aggtcgtcat 5281 tataagcaaa caaccagggt ctcagtggga ttattattac tattttttcc ccctctgcag 5341 actgaaaaaa tgcagaccgc cggggcatta ttcatttctc cagctctggt aaggtgccgc 5401 gccgcggtgc tctgtagtac caggtgtatg gtgtggacgc catcagtgtt taatgaatga 5461 acccagggga gcttggcgtc ctgatgatgt aggctctttg aggtggctgt aggatgtgat 5521 gaaggtaacc cctgtttatc tgggcctgtg aaccaagcta tcttggcttt ggaatgtggt 5581 cagagagtct ggatttgatt tttatcccca gtctgtcctg gccatagaga cagcccacta 5641 aaacagtggt tttcaacagt ggctgcacgt tagaatcatc cagggaactt tttaaaatcc 5701 tgatgcctag gctgcacctt agaccatttg catcagactt tgggaatggg ggtggaaccc 5761 aggcatctgt atcttttaag acttctccag atgattccac cgtgcagcca agtttgagaa 5821 ccactgcact aaaaattatt tcaaaaaagc catgttacag atggctctcc ttggatggct 5881 attccagtcg gtttccgggg ctgactggcc cactccattt agggtgtgta gctgtgagtg 5941 tattttagca gaataaaatt tcccccatct gattacttgg tgatctgggt ataaaccttt 6001 cctgtacctg ctttggtatt agtgtgggtt cttgttccat tctgagaagc tgattaaagc 6061 tgtgcccaaa gaagtgaact ttggtttggt tttgaatggt gaaggttgcc actgctgact 6121 atatcaagct attgagacag ctgtctctgg ggcaggccta tctgatagga acaggaataa 6181 attgaccttg aaatacattg aacttattct gtttatgaaa tctgtagtca tttttgacag 6241 ctcagcaatt aggatttttt ttccttgact gatttggtag gatgtggctt tctgatttta 6301 cagatccgct gttgtaccag gggtctaatc aggcctgtgt ctgcctcctt cttgaatagc 6361 ccagtgaatt catctaaaca ggtaagggag gaatagctct cttaggaatg ttcccaaggc 6421 ccaggatggt ggctcacacc tgtaatccca gctctttgag aggctgaggc gagaggatca 6481 cttgagccca gttgttcaag accagcctgg gcaacacagt gaaaccccat ctctacaaaa 6541 aaaataaaaa aattagccat gtgtggtggc atgtggctgt agtcccagtt actccagggc 6601 tgagatgaga ggattggtta atcccagagg tcgagactgc tgtgactatg ccactgcact 6661 ccagactggg tgacagagca agaccctgac tcataaaaac ccattaatta atttttttta 6721 aatcaaggaa tgttcccaga caggggtcct tcagcttaag ttgcctttag accaaagtgg 6781 gaggagagtt ggagaatctg tatgctttaa tagttctaag tcccccaagg taagccttgt 6841 aggtaagaaa cttataaaag tgaggtgtgc ctaacccatg aaatttccct agtcctgtct 6901 gctgtctgac aacagactct aagactaatt cccagcatac tctgcaatca gagtattctg 6961 catttaccag gcagtttgcc aacagtttca gagctcaggt gggtggggtg aaggaggtag 7021 agtcagccac ctgtccttat gccatactat ctctctgcta tctcgcctgc cttgcttctc 7081 tttctagcct tcctacagca acttcccact ccaggtggcc agacgggagt tccagaccag 7141 tgttgtctcc cgggacattg acacagcagc caagtttatt ggtgctgggg cagccacagt 7201 tggtgtggct ggttcagggg ctggcattgg aaccgtgttt ggcagcttga tcattggcta 7261 tgccaggtaa gtttgggtgg tctacagcat ctcccactgt aaattccacc ccgtttgggg 7321 aagcctcagc tggaggagct cccttcagga gccttcaggt tgatttcatc tacatcatag 7381 tttctccaga gaaaacatgc atccagcctg gctcacttaa acttaccagg ttggcccact 7441 ctgcttgtcc atccaaatcc ccaggatctg tgcaggctag gccctctccc aggagtaaca 7501 gtccccattc acctcaccct cctgtgtcct cccctcccct caccccttcc tctccctcaa 7561 ctggcaatga tctctgtccc tcccaggaac ccgtctctca agcagcagct cttctcctat 7621 gccattcttg gctttgccct gtctgaggcc atggggcttt tctgtttgat ggtcgccttc 7681 ctcatcctct tcgccatgtg aggctccatg gggggtcacc ggcctgttgc tactgcaact 7741 ccacaccatt cttggtgctg gggtgtgtta agctttacca ttaaacacaa cgtttctcta 7801 aacccctgtc tgtgcctctg tcctttgacc ttcaggaggg ccttggtaga agggtgagga 7861 gagggaattt gctcagccat gggtgatggg aactcctcag ggctgagaag gagcactgcc 7921 atgactgtaa gaacagaggg ttctcatgtt ctcccaggag gaatatgggt ttgtctgagt 7981 aatgagattt ttgccttttc ccttcagatt tattacagtt ggccatctgt atccataggt 8041 tccatatttg tggattcaac taacagtaga cagaaaatat tcagggaaaa aagtgaatgg 8101 ttgtgtctgt actgcacaag tacaggcttt ttttttcttg tcattattcc ctaaacaata 8161 cagtatagcc attatttata tagcatttac ctagtattta ggtattgtaa gtaatctaga 8221 gatgatttaa agtatatggg aggatgtgca taggttatat gcaaatacta caccatttta 8281 tataagggac ttgagcatct gtggtatcca ctggggtcct ggagccaatc tcccacagat 8341 accaaaggat aactgtaaca actgcatatt aataggttac atttttgaag ggcttaattt 8401 acaccaagca ttttacattc gttatctcaa caaccctaag agttataagc acaactttaa 8461 aactgttttg caaatgacag acccagggtt tggggaagca aaataactta tccagtatca 8521 cagagctagt aagtggcaga ttggggattt ctcccccagg ccagcttaaa ccttaactgc 8581 actgttgtcc tatctatttt cttttttttt tttttttgtc ccatctattt tcattctggt 8641 cacacaacaa ttaatgtggc acagaaaggg aaacttagtc ccaaagaagt tgtatagcag 8701 gtgaaaggta gaggtaggct tctatctgcc tgtgtttagc cccatttgta ggtttcattt 8761 tcactatcaa ttagcctttc tcagagtatc taaatgttct taaagcagat aaagtacaca 8821 atcctggttc ttggagaatt tatcacctta gttttgctag gataaaagac tatctggagc 8881 caggcatggt ggctcacgcc tgtaatccca gcactttggg aggctgaggc cggcggatca 8941 cctgaggtca ggagttcgag accagcctga acaacatgga gaaacccagt ctctactaaa 9001 aatacaaagt taaccaggca tggtggcgca tgcctgtaat cccagctact tgggaggctg 9061 aggcaggaga atcacttgaa cccaggaggc ggaggttgtg gtgagccgag atcatgccat 9121 tgcactccag cctgggcaac aagagcaaaa ctccgtctca aaaaaaaaaa aaaaaagatg 9181 atttgggggc gtgtgccaaa taatgaattc cataaagagc ttcgtccagc atcttccttg 9241 ggttctcaaa gtctatgccc accagtcttg cccctcttgg ctaaagtggg actgataacc 9301 catgtaaaat tagctaagtt gacatatcaa cagttgctgc ttctctaagt ccttgcagga 9361 gcaagccatg gacagagaac tcatgcaaag atgctgggaa cgtggtctgt ctcaaattgg 9421 atggcttcaa aaggagtagt aacgtaaagt gggatcc // LOCUS HSB3A 3683 bp DNA PRI 18-MAY-1993 DEFINITION H.sapiens gene for beta-3-adrenergic receptor. ACCESSION X72861 NID g298094 KEYWORDS beta-3-adrenergic-receptor; promoter; transmembrane receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3683) AUTHORS Emorine,L.J. TITLE Direct Submission JOURNAL Submitted (05-MAR-1993) L.J. Emorine, Inst. Cochin de Genetique Moleculaire, CNRS UPR 0415, 22 rue Mechain, 75014, Paris, FRANCE REFERENCE 2 (bases 1 to 661; 1817 to 3683) AUTHORS van Spronsen,A., Nahmias,C., Krief,S., Briend-Sutren,M.M., Strosberg,A.D. and Emorine,L.J. TITLE The promoter and intron/exon structure of the human and mouse beta 3-adrenergic-receptor genes JOURNAL Eur. J. Biochem. 213 (3), 1117-1124 (1993) MEDLINE 93279311 REFERENCE 3 (bases 1 to 3683) AUTHORS Emorine,L.J., Marullo,S., Briend-Sutren,M.M., Patey,G., Tate,K., Delavier-Klutchko,C. and Strosberg,A.D. TITLE Molecular characterization of the human beta 3-adrenergic receptor JOURNAL Science 245 (4922), 1118-1121 (1989) MEDLINE 89368947 COMMENT Related sequences: M29932 & M62473. FEATURES Location/Qualifiers source 1..3683 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="8p11.1-8p12" /chromosome="8" protein_bind 15..22 /note="CRE element" /bound_moiety="receptor" protein_bind 119..126 /note="CRE element" /bound_moiety="receptor" protein_bind 254..259 /note="GRE element" /bound_moiety="glucocorticoid receptor" protein_bind 264..270 /note="AP-1 site" /bound_moiety="AP-1" CAAT_signal complement(365..369) CAAT_signal 386..390 mRNA join(441..1842,2868..>3525) exon 441..1842 /number=1 misc_feature 441..492 /note="CAP site" CDS join(638..1842,2868..2889) /codon_start=1 /product="beta-3-adrenergic receptor" /db_xref="PID:g298095" /db_xref="SWISS-PROT:P13945" /translation="MAPWPHENSSLAPWPDLPTLAPNTANTSGLPGVPWEAALAGALL ALAVLATVGGNLLVIVAIAWTPRLQTMTNVFVTSLAAADLVMGLLVVPPAATLALTGH WPLGATGCELWTSVDVLCVTASIETLCALAVDRYLAVTNPLRYGALVTKRCARTAVVL VWVVSAAVSFAPIMSQWWRVGADAEAQRCHSNPRCCAFASNMPYVLLSSSVSFYLPLL VMLFVYARVFVVATRQLRLLRGELGRFPPEESPPAPSRSLAPAPVGTCAPPEGVPACG RRPARLLPLREHRALCTLGLIMGTFTLCWLPFFLANVLRALGGPSLVPGPAFLALNWL GYANSAFNPLIYCRSPDFRSAFRRLLCRCGRRLPPEPCAAARPALFPSGVPAARSSPA QPRLCQRLDGASWGVS" intron 1843..2867 /number=1 exon 2868..>3525 /number=2 polyA_signal 3520..3525 terminator 3567..3576 BASE COUNT 604 a 1147 c 1060 g 872 t ORIGIN 1 agatctcacc aagctgaggt cttgggagag gagatactgg ctgagcccta ttacttaatt 61 taaaatacct taggggaggc cacccaagtg gatgcggggc tcctgtgaat cctttgcttg 121 actccagcgg gttacctttg cctctgatac ataaagggtg gggatgggag cgctctcctc 181 tctccttccc ctgccttgct gtgggaactt ctgggaaagg aggtgcaggg ctccaggaag 241 ccagtgccca gggagtgcta tgctgagtcc aggagcctgg ccacggcagg ggtggacaga 301 tggtggcaga ggaaccacgg tgtcccttcc tccagattta gctaaaggaa acgtggagca 361 tcccattggc catcctcccc actctccaat tcggctccag aggcccctcc agactatagg 421 cagctgcccc tttaagcgtc gctactcctc ccccaagagc ggtggcaccg agggagttgg 481 ggtgggggga ggctgagcgc tctggctggg acagctagag aagatggccc aggctgggga 541 agtcgctctc atgccttgct gtcccctccc ctgagccagg tgatttggga gaccccctcc 601 ttccttcttt ccctaccgcc ccacgcgcga cccggggatg gctccgtggc ctcacgagaa 661 cagctctctt gccccatggc cggacctccc caccctggcg cccaataccg ccaacaccag 721 tgggctgcca ggggttccgt gggaggcggc cctagccggg gccctgctgg cgctggcggt 781 gctggccacc gtgggaggca acctgctggt catcgtggcc atcgcctgga ctccgagact 841 ccagaccatg accaacgtgt tcgtgacttc gctggccgca gccgacctgg tgatgggact 901 cctggtggtg ccgccggcgg ccaccttggc gctgactggc cactggccgt tgggcgccac 961 tggctgcgag ctgtggacct cggtggacgt gctgtgtgtg accgccagca tcgaaaccct 1021 gtgcgccctg gccgtggacc gctacctggc tgtgaccaac ccgctgcgtt acggcgcact 1081 ggtcaccaag cgctgcgccc ggacagctgt ggtcctggtg tgggtcgtgt cggccgcggt 1141 gtcgtttgcg cccatcatga gccagtggtg gcgcgtaggg gccgacgccg aggcgcagcg 1201 ctgccactcc aacccgcgct gctgtgcctt cgcctccaac atgccctacg tgctgctgtc 1261 ctcctccgtc tccttctacc ttcctcttct cgtgatgctc ttcgtctacg cgcgggtttt 1321 cgtggtggct acgcgccagc tgcgcttgct gcgcggggag ctgggccgct ttccgcccga 1381 ggagtctccg ccggcgccgt cgcgctctct ggccccggcc ccggtgggga cgtgcgctcc 1441 gcccgaaggg gtgcccgcct gcggccggcg gcccgcgcgc ctcctgcctc tccgggaaca 1501 ccgggccctg tgcaccttgg gtctcatcat gggcaccttc actctctgct ggttgccctt 1561 ctttctggcc aacgtgctgc gcgccctggg gggcccctct ctagtcccgg gcccggcttt 1621 ccttgccctg aactggctag gttatgccaa ttctgccttc aacccgctca tctactgccg 1681 cagcccggac tttcgcagcg ccttccgccg tcttctgtgc cgctgcggcc gtcgcctgcc 1741 tccggagccc tgcgccgccg cccgcccggc cctcttcccc tcgggcgttc ctgcggcccg 1801 gagcagccca gcgcagccca ggctttgcca acggctcgac gggtaggtaa ccggggcaga 1861 gggaccggcg gctcagggtc gggaagcatg cgatgtgtcc gtgggtcaac tttttgagtg 1921 tggagtttat taagagaagg tgggatggct ttgcttggag agaaaaggga acgaggagta 1981 gcgaaccaaa atgggaccca gggtcctttt ctttccggat ccagtcacta gggtagaagc 2041 aaaggagggc gagcgggccg tcgttcctca cccaaggacc caaggtgcgc caccggaaag 2101 cgctgcggtg tcccgaggac tctcgcctcg cctggtcggc tttagggatt tttttttttt 2161 ttaaatagag acagggtttc gtctctgtcg cccacgcggg aatgcagtgg tgcgatctca 2221 gctcactgca gtcttgaact cctggctcct gggctcaagc gatcctccca cctcagcctc 2281 ctgagtatct gggactacag gcgagcccca ccaatcccag ctatttttaa aatttcttgt 2341 agagatgggg tcttgctatg ttgcccaggc ttgtcttgaa cttctggcct caagtgatcc 2401 ttctgcctca gccttccaaa gcattaggat tacaggccgg agccagggcg ccgggtcggc 2461 tctagttttg gttttccagc tcagttcttt gcccccctcc cccgatttct tgccatcact 2521 agacctggct cggacttgaa ggcagggcta gtgccccccc acccgccccc caagccctcg 2581 gcctcagttc tgggttttct caaaggtttg acagctgtgg aggtgagaat ccacttccgg 2641 tatgaagtac agttgtgagt gaggagcctg tgagtgcaga tgtgtgccct cccgctccct 2701 gggctgggtt ggagtaggga tggggtgggg cgtgtgtggc tgggtggtgc cctggcgttt 2761 ttgtgtaact aaatatgcgt tccagggtct ctgatctctg tcattcccct cagtgcacct 2821 gttgctcctt tcaccccagg gtctattatc tccacttttt ttcccagggc ttcttgggga 2881 gtttcttagg cctgaaggac aagaagcaac aactctgttg atcagaacct gtggaaaacc 2941 tctggcctct gttcagaatg agtcccatgg gattccccgg ctgtgacact ctaccctcca 3001 gaacctgacg actgggccat gtgacccaag gagggatcct taccaagtgg gttttcacca 3061 tcctcttgct ctctgtctga gagatgtttt ctaaacccca gccttgaact tcactcctcc 3121 ctcagtggta gtgtccaggt gccgtggagc agcaggctgg ctttggtagg ggcacccatc 3181 acccggcttg cctgtgcagt cagtgagtgc ttagggcaaa gagagctccc ctggttccat 3241 tccttctgcc acccaaaccc tgatgagacc ttagtgttct ccaggctctg tggcccaggc 3301 tgagagcagc agggtagaaa agaccaagat ttggggtttt atctctggtt cccttattac 3361 tgctctcaag cagtggcctc tctcacttta gccatggaat ggctccgatc tacctcacag 3421 cagtgtcaga aggacttcgc cagggttttg ggagctccag ggttcataag aaggtgaacc 3481 attagaacag atcccttctt ttccttttgc aatcagataa ataaatatca ctgaatgcag 3541 ttcatcctcg gccccctttc cctccgtttg ttttcttttc ataatccact tactcccttc 3601 ccttctactc tgctggcttt tgacagaggc gtaaattagg cctaatcctc actcttttct 3661 tcctaatgtt catcaaagaa aaa // LOCUS HSBGPG 1675 bp DNA PRI 24-APR-1993 DEFINITION Human gene for bone gla protein (BGP). ACCESSION X04143 NID g29449 KEYWORDS bone gla protein; osteocalcin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1675) AUTHORS Celeste,A.J., Rosen,V., Buecker,J.L., Kriz,R., Wang,E.A. and Wozney,J.M. TITLE Isolation of the human gene for bone gla protein utilizing mouse and rat cDNA clones JOURNAL EMBO J. 5 (8), 1885-1890 (1986) MEDLINE 87004555 FEATURES Location/Qualifiers source 1..1675 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 420..425 exon <496..559 /number=1 CDS join(496..559,817..849,1026..1095,1297..1426) /codon_start=1 /product="BGP" /db_xref="PID:g29450" /db_xref="SWISS-PROT:P02818" /translation="MRALTLLALLALAALCIAGQAGAKPSGAESSKAFVSKQEGSEVV KRPRRYLYQWLGAPVPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV" intron 560..816 /number=1 exon 817..849 /number=2 intron 850..1025 /number=2 exon 1026..1095 /number=3 intron 1096..1296 /number=3 exon 1297..>1426 /number=4 polyA_signal 1562..1567 BASE COUNT 302 a 524 c 519 g 330 t ORIGIN 1 acggggctga cagtagaaat cacaggctgt gagacagctg gagcccagct ctgcttgaac 61 ctattttagg tctctgatcc ccgcttcctc tttagactcc cctagagctc agccagtgct 121 caacctgagg ctgggggtct ctgaggaaga gtgagttgga gctgaggggt ctggggctgt 181 cccctgagag aggggccaga ggcagtgtca agagccgggc agtctgattg tggctcaccc 241 tccatcactc ccaggggccc ctggcccagc agccgcagct cccaaccaca atatcctctg 301 gggtttggcc tacggagctg gggcggatga cccccaaata gccctggcag attcccccta 361 gacccgcccg caccatggtc aggcatgccc ctcctcatcg ctgggcacag cccagagggt 421 ataaacagtg ctggaggctg gcggggcagg ccagctgagt cctgagcagc agcccagcgc 481 agccaccgag acaccatgag agccctcaca ctcctcgccc tattggccct ggccgcactt 541 tgcatcgctg gccaggcagg tgagtgcccc cacctcccct caggccgcat tgcagtgggg 601 gctgagagga ggaagcacca tggcccacct cttctcaccc ctttggctgg cagtcccttt 661 gcagtctaac caccttgttg caggctcaat ccatttgccc cagctctgcc cttgcagagg 721 gagaggaggg aagagcaagc tgcccgagac gcaggggaag gaggatgagg gccctgggga 781 tgagctgggg tgaaccaggc tccctttcct ttgcaggtgc gaagcccagc ggtgcagagt 841 ccagcaaagg tgcaggtatg aggatggacc tgatgggttc ctggaccctc ccctctcacc 901 ctggtccctc agtctcattc ccccactcct gccacctcct gtctggccat caggaaggcc 961 agcctgctcc ccacctgatc ctcccaaacc cagagccacc tgatgcctgc ccctctgctc 1021 cacagccttt gtgtccaagc aggagggcag cgaggtagtg aagagaccca ggcgctacct 1081 gtatcaatgg ctggggtgag agaaaaggca gagctgggcc aaggccctgc ctctccggga 1141 tggtctgtgg gggagctgca gcagggagtg gcctctctgg gttgtggtgg gggtacaggc 1201 agcctgccct ggtgggcacc ctggagcccc atgtgtaggg agaggaggga tgggcatttt 1261 gcacgggggc tgatgccacc acgtcgggtg tctcagagcc ccagtcccct acccggatcc 1321 cctggagccc aggagggagg tgtgtgagct caatccggac tgtgacgagt tggctgacca 1381 catcggcttt caggaggcct atcggcgctt ctacggcccg gtctagggtg tcgctctgct 1441 ggcctggccg gcaaccccag ttctgctcct ctccaggcac ccttctttcc tcttcccctt 1501 gcccttgccc tgacctccca gccctatgga tgtggggtcc ccatcatccc agctgctccc 1561 aaataaactc cagaagagga atctgtgggc ctgtgagtct gtccagttta tggagtgtgg 1621 gagggaggtg tcaggaggat gggggtgagg aggttttacc ttcttcagtt ctaga // LOCUS HSBSF2 5961 bp DNA PRI 24-APR-1993 DEFINITION Human (BSF-2/IL6) gene for B cell stimulatory factor-2. ACCESSION Y00081 NID g29494 KEYWORDS B cell stimulatory factor-2; BSF-2/IL6 gene; hepatocyte stimulating factor; hybridoma plasmacytoma growth factor; interferon beta II; interleukin 6. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5961) AUTHORS Hirano,T. and Kishimoto,T. TITLE Direct Submission JOURNAL Submitted (21-SEP-1987) Toshio Hirano, Institute for Molecular and Cellular Biology, Osaka University, 1-3, Yamada-oka, Suita, Osaka 565, Japan REFERENCE 2 (bases 1 to 5961) AUTHORS Yasukawa,K., Hirano,T., Watanabe,Y., Muratani,K., Matsuda,T., Nakai,S. and Kishimoto,T. TITLE Structure and expression of human B cell stimulatory factor-2 (BSF-2/IL-6) gene JOURNAL EMBO J. 6 (10), 2939-2945 (1987) MEDLINE 88082664 FEATURES Location/Qualifiers source 1..5961 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="tonsillar mononuclear cells" misc_feature 879..909 /note="sequence homologous to 5'flanking region of IL2" misc_feature 993..998 /note="enhancer-like element" promoter 1016..1022 /note="TATA box like sequence" misc_feature 1027..1032 /note="enhancer-like element" misc_feature 1073..1078 /note="enhancer-like element" misc_feature 1082..1087 /note="enhancer-like element" misc_feature 1084..1089 /note="enhancer-like element" promoter 1108..1114 /note="TATA box like sequence" promoter 1131..1138 /note="TATA box like sequence" exon <1223..1241 /number=1 CDS join(1223..1241,1396..1586,2634..2747,3454..3600, 5342..5509) /codon_start=1 /product="B cell stimulatory factor-2 (BSF-2)" /db_xref="PID:g29495" /db_xref="SWISS-PROT:P05231" /translation="MNSFSTSAFGPVAFSLGLLLVLPAAFPAPVPPGEDSKDVAAPHR QPLTSSERIDKQIRYILDGISALRKETCNKSNMCESSKEALAENNLNLPKMAEKDGCF QSGFNEETCLVKIITGLLEFEVYLEYLQNRFESSEEQARAVQMSTKVLIQFLQKKAKN LDAITTPDPTTNASLLTKLQAQNQWLQDMTTHLILRSFKEFLQSSLRALRQM" intron 1242..1395 /number=1 exon 1396..1586 /number=2 intron 1587..2633 /number=2 exon 2634..2747 /number=3 intron 2748..3453 /number=3 exon 3454..3600 /number=4 intron 3601..5341 /number=4 exon 5342..>5509 /number=5 polyA_signal 5831..5836 polyA_signal 5909..5914 BASE COUNT 1682 a 1324 c 1373 g 1582 t ORIGIN 1 ggatcctcct gcaagagaca ccatcctgag ggaagagggc ttctgaacca gcttgaccca 61 ataagaaatt cttgggtgcc gacgcggaca gagattcaga gcctagagcc gtgcctgcgt 121 ccgtactttc cttctagctt cttttgattt caaatcaaga cttagaggga gagggagcga 181 taaacacaaa ctctgcaaga tgccacaagg tcctcctttg acatccccaa caaagaggtg 241 agtagtaatc tccccctttc tgccctgaac caagtgggct tcagtaattt cagggctcca 301 ggagactggg catgcaggtg ccgatgaaac agtggtgaag agactcagtg gcagtgggga 361 gagcactggc agcacaggca aacctctggc acaagagcaa agtcctactg gagattccaa 421 gggtcacttg ggagagggca ggcagcagcc aacctcctct aagtgggctg aagcaggtga 481 agaaatggca gacaagcgcg gtggcaaaaa ggagtcacac actccacctg gagacgcctt 541 gaagtaactg cacgaaattt gagggtggcc aggcagtcta caacagccgc tcacagggag 601 agccagaaca cagaagaact cagatgactg gtagtattac cttcttcata atccaggctt 661 ggggggctgc gatggagtca gaggaaactc agttcagaac atctttggtt tttacaaata 721 caaattaact ggaacgctaa attctagcct gttaatctgg tcactgaaaa aaaaattttt 781 ttttttcaaa aaacatagct ttagcttatt ttttttctct ttgtaaaact tcgtgcatga 841 cttcagcttt actctttgtc aagacatgcc aaagtgctga gtcactaata aaagaaaaaa 901 agaaagtaaa ggaagagtgg ttctgcttct tagcgctagc ctcaatgacg acctaagctg 961 cacttttccc cctagttgtg tcttgcgatg ctaaaggacg tcacattgca caatcttaat 1021 aaggtttcca atcagcccca cccgctctgg ccccaccctc accctccaac aaagatttat 1081 caaatgtggg attttcccat gagtctcaat attagagtct caacccccaa taaatatagg 1141 actggagatg tctgaggctc attctgccct cgagccaccg ggaacgaaag agaagctcta 1201 tctcccctcc aggagcccag ctatgaactc cttctccaca agtaagtgca ggaaatcctt 1261 agccctggaa ctgccagccg gtcgagccct gtgtgaggga ggggtgtgtg gcccagggat 1321 gcggggcgcc agcagcagag gcaggctccc agctgtgctg tcagtcaccc ctgcgctcgc 1381 tcccctccgg cacaggcgcc ttcggtccag ttgccttctc cctggggctg ctcctggtgt 1441 tgcctgctgc cttccctgcc ccagtacccc caggagaaga ttccaaagat gtagccgccc 1501 cacacagaca gccactcacc tcttcagaac gaattgacaa acaaattcgg tacatcctcg 1561 acggcatctc agccctgaga aaggaggtgg gaaggcttgg cgatggggtt gaagggcccg 1621 gtgcgatgcg tctcccctcc ctgcgtgtgg gggggctgcc tgcataagga ggtctttgct 1681 gggttctaga gcactgtaga tttgaggcca acgacctaga ctgacttctg tatttatcct 1741 ttgctggtgt caggaggttc ctttcctttc tggaaaatgc agaatgggtc tgaaatccat 1801 gcccaccttt ggcatgagct gagggttatt gcttctcagg gcttcctttt ccctttccaa 1861 aaaattaggt ctgtgaagct cctttttgtc ccccgggctt tggaaggact agaaaagtgc 1921 cacctgaaag gcatgttcag cttctcagag cagttgcagt actttttggt tatgtaaact 1981 caatggttag gattcctcaa agccattcca gctaagattc atacctcaga gcccaccaaa 2041 gtggcaaatc ataaataggt taaagcatct ccccactttc aatgcaaggt attttggtcc 2101 tgtttggtag aaagaaaaga acacaggagg ggagattggg agcccagact cgaattctgg 2161 ttctgccaaa ccagccttgt gatcttgggt aaattcccta ccacctctgg actccatcag 2221 taaaattggg ggtggactag gtgatctcat agatccttcc tgctggaaca ttctatggct 2281 tgaattatat tctcctaatt attgtcaaaa ttgctgttat taagtatcta ctgtgtgcca 2341 ggcactttaa ataaatattg tgtctaatct tcaaaacaaa tttgcaagga aggtttttgg 2401 agataaggaa actgagactc aggattaagt aacacaccta aagtcaaagg tgagcttgga 2461 actgaaccca agtgtgcccc cactccactg gaatttgctt gccaggatgc caatgagttg 2521 tagcttcatt tttcttagag actttcctgg ctgtggttga acaatgaaaa ggccctctag 2581 tggtgtttgt tttagggaac ttaggtgata acaattctgg tattctttcc cagacatgta 2641 acaagagtaa catgtgtgaa agcagcaaag aggcactggc agaaaacaac ctgaaccttc 2701 caaagatggc tgaaaaagat ggatgcttcc aatctggatt caatgaggta ccaacttgtc 2761 gcactcactt ttcactattc cttaggcaaa acttctccct cttgcatgca gtcctgtata 2821 catatagatc caggcagcaa caaaaagtgg gtaaatgtaa agaatgttat gtaaatttca 2881 tgaggaggcc aagttcaagc ttttttaaag gcagtttatt cttggacagg tatggccaga 2941 gatggtgcca ctgtggtgag attttaacaa ctgtcaaatg tttaaaactc ccacaggttt 3001 aattagttca tcctgggaaa ggtactcgca gggccttttc cctctctggc tgcccctggc 3061 agggtccagg tctgccctcc ctccctgccc agctcattct ccacagtgag ataacctgca 3121 ctgtcttctg attattttat aaaaggaggt tccagcccag cattaacaag ggcaagagtg 3181 caggaagaac atcaaggggg acaatcagag aaggatcccc attgccacat tctagcatct 3241 gttgggcttt ggataaaact aattacatgg ggcctctgat tgtccagtta tttaaaatgg 3301 tgctgtccaa tgtcccaaaa catgctgcct aagaggtact tgaagttctc tagaggagca 3361 gagggaaaag atgtcgaact gtggcaattt taacttttca aattgattct atctcctggc 3421 gataaccaat tttcccacca tctttcctct taggagactt gcctggtgaa aatcatcact 3481 ggtcttttgg agtttgaggt atacctagag tacctccaga acagatttga gagtagtgag 3541 gaacaagcca gagctgtgca gatgagtaca aaagtcctga tccagttcct gcagaaaaag 3601 gtgggtgtgt cctcattccc tcaacttggt gtgggggaag acaggctaaa gacagtgtcc 3661 tggacaactc agggatgcaa tgccacttcc aaaagagaag gctacacgta aacaaaagag 3721 tctgagaaat agtttctgat tgttattgtt aaatcttttt ttgtttgttt ggttggttgg 3781 ctctcttctg caaaggacat caataactgt attttaaact atatattaac tgaggtggat 3841 tttaacatca atttttaata gtgcaagaga tttaaaacca aaggcggggg ggcgggcaga 3901 aaaaagtgcc atccaactcc agccagtgat ccacagaaac aaagaccaag gagcacaaaa 3961 tgattttaag attttagtca ttgccaagtg acattcttct cactgtggtt gtttcaattc 4021 tttttcctac cttttaccag agagttagtt cagagaaatg gtcagagact caagggtgga 4081 aagaggtacc aaaggctttg gccaccagta gctggctatt cagacagcag ggagtagact 4141 tgctggctag catgtggagg agccaaagct caataagaag gggcctagaa tgaaaccctt 4201 ggtgctgatc ctgcctctgc catttctact taagccaggg tttctcatat gttaacatgc 4261 tagggaattc cctgggcatc ttcttgtggt gtggagtctg acttagcaag cctcgggtgg 4321 gtttgagggt caaatttcta ccaggcttat atccctggtg atgctgcaga attccaggac 4381 cacacttgga ggtttaaggc cttccacaag ttacttatcc catatggtgg gtctatggaa 4441 aggtgtttcc cagtcctctt tacaccacca gatcagtggt ctttcaacag atcctaaagg 4501 gatggtgaga gggaaactgg agaaaagtat cagatttaga ggccatgaag aacccatatt 4561 aaaatgcctt taagtatggg ctcttcattc atatactaaa tatgaactat gtgccaggca 4621 ttatttcata tgacagaata caaacaaata agatagtgat gctggtcagg cttggtggct 4681 catgcctgta ttccctaaac tttgggagcc taaggtgaga actccttgaa ctcctaaggc 4741 caggagttca agaccagcct ggataacata gcaagacccc atctctacaa aaaaccaaaa 4801 ccaaacaaac aaaaatgata gtggtgcttc cctcaggatg cttgtggtct aatgggagac 4861 agaacagcaa agggatgatt agaagttggt tgctgtgagc caggcacagt gctatataat 4921 cccagcgcta tgggaggctg aggtgggtgg atcatttagg ccaggagttt aagaccagcc 4981 tggtcaacat ggtaaaaccc catcttactt aaaaatacaa aaaagttagc caggcatggt 5041 ggcatacacc tgtaacccag ctactcagga ggctgaggca catgaatcac ttgaacccag 5101 gaggcagagg ttgctgtgca ccactgcact ccagcctggg tgacagaacg agaccttgac 5161 tcaaaaaaaa aaaaaagaag tttgttgcta tggaagggtc ctactcagag caggcacccc 5221 agttaatctc attcacccca catttcacat ttgaacatca tcccatagcc cagagcatcc 5281 ctccactgca aaggatttat tcaacattta aacaatcctt tttactttca ttttccttca 5341 ggcaaagaat ctagatgcaa taaccacccc tgacccaacc acaaatgcca gcctgctgac 5401 gaagctgcag gcacagaacc agtggctgca ggacatgaca actcatctca ttctgcgcag 5461 ctttaaggag ttcctgcagt ccagcctgag ggctcttcgg caaatgtagc atgggcacct 5521 cagattgttg ttgttaatgg gcattccttc ttctggtcag aaacctgtcc actgggcaca 5581 gaacttatgt tgttctctat ggagaactaa aagtatgagc gttaggacac tattttaatt 5641 atttttaatt tattaatatt taaatatgtg aagctgagtt aatttatgta agtcatattt 5701 atattttaag aagtaccact tgaaacattt tatgtattag ttttgaaata ataatggaaa 5761 gtggctatgc agtttgaata tcctttgttt cagagccaga tcatttcttg gaaagtgtag 5821 gcttacctca aataaatggc taacttatac atatttttaa agaaatattt atattgtatt 5881 tatataatgt ataaatggtt tttataccaa taaatggcat tttaaaaaat tcagcaactt 5941 tgagtgtgtc acgtgaagct t // LOCUS HSC1INHIB 18687 bp DNA PRI 17-JUL-1991 DEFINITION Human gene for C1-inhibitor. ACCESSION X54486 NID g29534 KEYWORDS C1 inhibitor; glycoprotein; protease inhibitor; serpin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 18687) AUTHORS Fothergill,J.E. TITLE Direct Submission JOURNAL Submitted (17-AUG-1990) Fothergill J.E., University of Aberdeen, Department of Molecular & Cell Biology, Marischal College, Aberdeen AB9 1AS, Scotland, UK REFERENCE 2 (bases 1 to 18687) AUTHORS Carter,P.E., Duponchel,C., Tosi,M. and Fothergill,J.E. TITLE Complete nucleotide sequence of the gene for human C1 inhibitor with an unusually high density of Alu elements JOURNAL Eur. J. Biochem. 197 (2), 301-308 (1991) MEDLINE 91224119 COMMENT inhibits serine proteinases C1r and C1s of the first component of human complement; belongs to the serpin class of inhibitors; defects in the gene are associated with 'hereditary angioneurotic oedema (HANE); for overlapping sequences see: X07427-33; X07577. FEATURES Location/Qualifiers source 1..18687 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /clone_lib="cosmid genomic" /clone="6d" repeat_region 266..542 /rpt_family="Alu" CAAT_signal 1080..1083 misc_feature 1100..1105 /note="IL-6 element" CAAT_signal 1122..1125 misc_feature 1152..1166 /note="c-myc element (H-DNA)" exon 1183..1220 /note="C1 inhibitor" /number=1 /evidence=experimental mRNA join(1183..1220,1747..1819,3378..3876,5533..5667, 9507..9710,9905..10044,15213..15432,17824..>18341) /note="C1 inhibitor" intron 1221..1746 /number=1 exon 1747..1819 /note="C1 inhibitor" /number=2 /evidence=experimental CDS join(1769..1819,3378..3876,5533..5667,9507..9710, 9905..10044,15213..15432,17824..18077) /codon_start=1 /product="C1 inhibitor" /db_xref="PID:g29535" /db_xref="SWISS-PROT:P05155" /translation="MASRLTLLTLLLLLLAGDRASSNPNATSSSSQDPESLQDRGEGK VATTVISKMLFVEPILEVSSLPTTNSTTNSATKITANTTDEPTTQPTTEPTTQPTIQP TQPTTQLPTDSPTQPTTGSFCPGPVTLCSDLESHSTEAVLGDALVDFSLKLYHAFSAM KKVETNMAFSPFSIASLLTQVLLGAGENTKTNLESILSYPKDFTCVHQALKGFTTKGV TSVSQIFHSPDLAIRDTFVNASRTLYSSSPRVLSNNSDANLELINTWVAKNTNNKISR LLDSLPSDTRLVLLNAIYLSAKWKTTFDPKKTRMEPFHFKNSVIKVPMMNSKKYPVAH FIDQTLKAKVGQLQLSHNLSLVILVPQNLKHRLEDMEQALSPSVFKAIMEKLEMSKFQ PTLLTLPRIKVTTSQDMLSIMEKLEFFDFSYDLNLCGLTEDPDLQVSAMQHQTVLELT ETGVEAAAASAISVARTLLVFEVQQPFLFVLWDQQHKFPVFMGRVYDPRA" intron 1820..3377 /number=2 misc_feature 2416..2454 /note="putative Z-DNA" exon 3378..3876 /note="C1 inhibitor" /number=3 /evidence=experimental intron 3877..5532 /number=3 repeat_region 4168..5356 /rpt_family="Alu" exon 5533..5667 /note="C1 inhibitor" /number=4 /evidence=experimental intron 5668..9506 /number=4 repeat_region 5973..9027 /rpt_family="Alu" exon 9507..9710 /note="C1 inhibitor" /number=5 /evidence=experimental intron 9711..9904 /number=5 exon 9905..10044 /note="C1 inhibitor" /number=6 /evidence=experimental intron 10045..15212 /number=6 repeat_region 11175..14726 /rpt_family="Alu" exon 15213..15432 /note="C1 inhibitor" /number=7 /evidence=experimental intron 15433..17823 /number=7 repeat_region 15917..16541 /rpt_family="Alu" exon 17824..18341 /note="C1 inhibitor" /number=8 /evidence=experimental repeat_region 18394..18669 /rpt_family="Alu" BASE COUNT 4918 a 4535 c 4335 g 4899 t ORIGIN 1 gatcctcctg cctcagtctc ctaaaaggct gggattacag gtatgaaccc ccaggcctgc 61 ctacttcgtg tgttttatct ggcaatgctg agaggacagc tagatgtcga ggacagccag 121 atttcgagga cagccagacg tcaaggggga aagcaacttg ctggctcaca cagccagcca 181 ggtctattgt ggagccctag tcgcaaacac atacacccca gctttcaagt ttttaaatat 241 ttatttagtt agttatttgt tttttgagac agtcttgctc tgtcacccag gctggagtgc 301 agtggtgcaa tttcagctca ctgcaacctc tgcctcccag attcaagtga ttgctgcctc 361 agcctcccta gtagctggga ctacatgtgc gcgcctgcac tgcccagcta atttttgtac 421 ttttagcgga aacggggttt ccccatattg gccagagtgg tctcaaactc ctgacctcaa 481 gtgatccacc cacctcggcc tcccaaagtg ctgggattac aggcgtgaga ccgtgcccgg 541 ctagcttttt tttttttttt ttttttaatt aggttgaatg cttgttgagc tgggtgcctc 601 acatccgttg tctgttatac aaagaactac tgattatggg tctctacgag acaaaaatgg 661 cagaacacct acagggcaca ggggagatgg gagcagggat taggcttcag aatacttctc 721 caaatgaggg tgatcagggc ccgacatttc actgctctaa ggctcagtat cttaactatc 781 aaatgggtac aatggggcac aaagggttac tttgaggaat aacggaggtg agcaatttcc 841 aaaaagttca ttcctttaaa aactgcggag cacattttgt gtacatgctg tttttattgt 901 tactcatgaa gacgcggggg aagctttggt tgtgtaaagc tgagaactgc acccaagctt 961 ccccgttcac cccacctacc aggggatttg ggctaaattc ctggcctggc ccacaaagaa 1021 gaccaagcgg tcagtcccat tccgtcccat cctgatttac aggaactcac accagcgatc 1081 aatcttcctt aatttgtaac tgggcagtgt cccgggccag ccaatagcta agactgcccc 1141 ccccgcaccc caccctccct gaccctgggg gactctctac tcagtctgca ctggagctgc 1201 ctggtgacca gaagtttgga gtaggtttgg tgctgggcag gggtggggag tagggtggaa 1261 agcatggagt gaagaggtct agggaggggg tctcctcacc cccgccttcc tgcccgcctt 1321 gatctcgggg gtctctatag gcttgcttcc acctgggact tctgcctcct cctaccccag 1381 cccctcccgc ctcaggcctg ttgtgctcag ccccccagga cctcccctcc cccacgcctc 1441 tggcctcatt gtttggttaa agcaggaccc cctccccctc ccaccacctc ccctccgact 1501 gaacagatgg acagagaccc gggcccacgg ggagaggaag ggccagccgg tgccgggaaa 1561 gggaagcggt ttggggaaaa caaaacagag ggaggagcca gggagaaggt ggccccagga 1621 gggaggagga gggaattcgc taagagggac tggggcctga gacggaatgg gggcgggccc 1681 cgggcggggt gggggcccct gggctcccag ggtgggagct ggctccgagg ctggctggct 1741 ccgcaggtcc gctgacgtcg ccgcccagat ggcctccagg ctgaccctgc tgaccctcct 1801 gctgctgctg ctggctgggg tatgtggtcc cttgtgggat gggggacggg ggtggagacg 1861 ggaggcggga tggtgcgggg tgcgggcggt ggctgaggat taacccttca ggctccgggg 1921 aatgaggaga gctcctcttg ggatcattga gtgtgatcct tgcacacgca ctcgtagatg 1981 gtggaaagag ctcaggactc agacagatgc aagttctaat cttggcgggc catgtaggag 2041 ctgcgcacgt ggtcctgagg acgtcacttt tctcagccta ctcctcctga tttataaaat 2101 gttgaataaa cagatcccac cctgtccacc aagcaggctt ggcaggagga tcacataggg 2161 caaggagatg ggaagcattt tgtgatatca aagcaagagt ttgttgtaga aaagagctgt 2221 ggggaggaga ggtttggctg ctgtgacaca gacggtggtc ccaggctagg gcctgtgaag 2281 acagggaaga atggtagcaa acatacgtgg gtctgggtgg gggtcaatag cagtgagagg 2341 tcccagtggt ctgggaggtc atggactaga aataaggcta agaagtgcag gatcccagga 2401 gaaaggtagg taggggtgtg tgtatgtgcg tgtgtgtgtg tgtgtgcacg cgcaagttgg 2461 tatcacatac tcagcagatg atgtagttat gagcagggag tttggaatca gagaccttgt 2521 tttgacccct agctgtttct cccatttctg tgagaccctg gactagttaa tatgtgtcgg 2581 tatgagtttc tctacctgtg aaatggagct aatgcgtggt ttctcagtgg aggatctgtt 2641 gacattgggg ctggataatt cattgctgag gggctaccct gtgcatttta ggatgttttg 2701 cagtatatct ctagcctcta cgcactagat tccggtagca tcttccctgc tgtctccctg 2761 tgacaaccaa aaatgtctct ggacattgcc aaatatcccc tgggagtcaa aattgtttca 2821 gagccgctgg cctatactac ttctctctcc ctcccttcct gcctcccttc tttcctcctt 2881 ccctttcttc gttcccttcc taatgtgtac tgtttttcca cacccactaa tgggttgcaa 2941 cccacagttt aaaaatttac tgttctaaat caaggagcac aagttcaaat gtctgtacta 3001 gccaagcaag tgagtcaaac aggtagatat ttacaggaaa ttgcaagaac atagataaat 3061 gacaacactc agcttcactg tttggaaaac aacttcctac agggcagtaa ttggtcagag 3121 attacagagt ccctgactat ccctcatctt ctgcagagca cattcctgtg cacccccacc 3181 ctcaccctgt attgcccctt ctctgaggaa ttagtggtgg tggttctaag acagattgct 3241 catctgctgc actgtcagaa attactctct tgtacaggac attttccaca tccacacctt 3301 ctcttcctgc tttgagtatt ttagaggact gtgcctcgta gtaagaaaaa aatgaaactc 3361 agtttcttga accacaggat agagcctcct caaatccaaa tgctaccagc tccagctccc 3421 aggatccaga gagtttgcaa gacagaggcg aagggaaggt cgcaacaaca gttatctcca 3481 agatgctatt cgttgaaccc atcctggagg tttccagctt gccgacaacc aactcaacaa 3541 ccaattcagc caccaaaata acagctaata ccactgatga acccaccaca caacccacca 3601 cagagcccac cacccaaccc accatccaac ccacccaacc aactacccag ctcccaacag 3661 attctcctac ccagcccact actgggtcct tctgcccagg acctgttact ctctgctctg 3721 acttggagag tcattcaaca gaggccgtgt tgggggatgc tttggtagat ttctccctga 3781 agctctacca cgccttctca gcaatgaaga aggtggagac caacatggcc ttttccccat 3841 tcagcatcgc cagcctcctt acccaggtcc tgctcggtaa gaccctgctt gaattctctc 3901 caggtcattt gttggacact cccataagag tcaccaatcc agacacttac aaagccatgc 3961 ctctgggaag aagctgtaaa aatgggctat tatatattgg gggtggggta gagggatgta 4021 tcttttcatt cttgaacatt ccatcatttc acagtgatgt aataggcacg attgcttgta 4081 aaactctgtg actatacaag aacatataaa ataaggtcgc agccactaac catgtttcat 4141 ggcaaggaga ggtgataaga aagatgaaat taggcgcagt ggctcacgcc tgtaatccca 4201 gcactttggg aggccaaggc gggtggatca cctgaggtca ggagttcaag accagcctgg 4261 ccaacatggt gaaaccctgt ctctactaaa aatacaaaaa ttatcaggga gtggtggtgc 4321 atgcctgtaa tcccagctac ttgggcagct gaggcaggag aatcgcttga acccaggagg 4381 tggaggttgc agtgagccga gaccgcacca ttgcactaca gtctgggtga cagagcgaga 4441 ctctgtctca aaaaaaaaaa aaattatcag agatagacct agagtagatg tggttagtac 4501 tgccttctag ctctgtgacc ttgggcagat cactttaacc tctctgagcc ttgagtcctc 4561 ttgtgtaaaa tagtgatgat gctatctacc tcaaaagatt aagaagcaga aagccaggcc 4621 gggtgcggtg gttcacacct gtaatcccag cattttagga ggccgaggag ggcagatcac 4681 gaggtcagga gttcgagacc agcctgacta acatggtgaa accccgtctc tactaaaaat 4741 aaaaagaaat tagctaggca tggtggtgca cacctgtaac cccagctact caggaggctg 4801 aggcaggaga atcacttgaa cacgggaggc agaggttgca gtgagccgaa atcatgccac 4861 tgcactccag cctgggaaga ctgagcaaga ctctgtctca aaaaaaaaaa aaagaagcag 4921 cctagtgtct gacttagtgg gaggtcaaaa aaatgtaaat cctctgccat cttgagggat 4981 tactgtcaag tcccatttgg taattaccct aggaatggca caaacaaatt actacaagca 5041 gtggggacag agctattact ccccagagag aattctaaaa aggctacaga atctttcttg 5101 gctgggcacg gtgactcaca cctataatcc tggcactttg ggaggccaag gcaggagttc 5161 aagaccagcc tggccaacat gttgaaaccc catctctact aaaaatgcaa aaattagcca 5221 ggcatagtga tgcatgctta ttgtcccagc tacttgggag gcagaggtgg gaggattgct 5281 tgaacctgga gattcaagtg agctgagatt gcaccactgc attccagcct gggccaacaa 5341 agcaagactc tgtctcaaaa aaaaaaaaaa aaaaaaaaga gagagattga gagaacattc 5401 cagctcagat gatctgtgat cccctccaaa gcagggaata ccctccattc cagcctggtc 5461 cccaaccctc attcccaagg aaggcccccg actcatcctg caagtatctt tcatctctgc 5521 cctttgttgc aggggctggg gagaacacca aaacaaacct ggagagcatc ctctcttacc 5581 ccaaggactt cacctgtgtc caccaggccc tgaagggctt cacgaccaaa ggtgtcacct 5641 cagtctctca gatcttccac agcccaggtg agtgcccagg aatgggcagt gtctgcagag 5701 gagggtcctg agaggactct gaagggggac ccagcgctgg ggaaagaaag gacagaggga 5761 atgttggagc tacagtatca gggatggact gcagagcagg tgaagacctt ggcaggagca 5821 ttaggtcact ccaggaacta gactgttctt ctaatgagac cttagacaag tctctggcat 5881 tcatcaactg ctttagaata aaaataaccg ggcaggtaca gtaaaatagt gatgatgcta 5941 tctacctcaa aagattaaga agcagaaagc caggctgggc gtggtggctc acacctgtaa 6001 tcccagcact ttgggaggcc gaggcaggtg gatcacgagg tcaggggttc gagaccagcc 6061 tgaccaacat ggtgaaaccc tgtctctact aaaaatacaa aaattagctg ggcatggtgg 6121 cgggcacctg taatcccagc tattcaggag gctgaggcag gagaattgct tgaacctggg 6181 aggcggaggt tgcagtgagc cgagatgacg ccactgcact ccagcctggg cgacagagca 6241 agactccgtc tcaaaaaaaa aaacaaaaac aaaacaaaaa caaaaaaaaa aaacaaagaa 6301 ggagaaagcc gggccgggca tggtggttct catctgtaag ttcaaggagt tgaaggtatg 6361 ctaggacttt gggaggccaa ggccttcaag accagcctgg gcagcatggc gaaacctgtc 6421 tccattaaaa aaaaaaaagt tgggggtacg gctgggcatg gtggctcaca cctgtaatcc 6481 cagcactttt gggaggctga ggtgggtgga acacctgagg tcaggagttc aagaccagcc 6541 tggccaacat ggcaaaaccc tgtctctatt aaaaacacaa aaattagcct ggcatggtgg 6601 caggcgccta taatcccaac tactcaggag gctgaggcag gagaatcgct tgaacccaag 6661 agagtgaagg ttgcagtgag ctgagatcat gccacttcac tccagcctga gtgaaacagc 6721 aaaactctgt ctcaaaaaaa aaaaaaggaa gaaagaaaaa aggccaggcg cggtgactca 6781 cgcctgtaat cgcaacactt tggcaggccg aggcaggcga ttcacaaggt caggagttcg 6841 agaccagtct ggctaactaa catagtgaaa ctccgtctct actgaaaata caagaaatta 6901 ccctggcatg gtggtgtgca cctgtaatcc cagctactca ggaggctgag gcaggagaat 6961 cgcttgaacc tgggaggcag aggctgcagt gagccgagat cgcgccactg cactccagcc 7021 tggatgacag agcaagactc tgtctcaaaa aaaaaaggcc gggcgcggtg gctcatgcct 7081 gtaatcccag cactttggga ggctgaggcg ggcggaccac gaggtcagaa gatcaagacc 7141 atcctggcta acaagatgaa accctgtctc tgccaaaaaa atacaaaact tagccgggca 7201 tggtggcagg cgcctgtggt cccaactact tgggaggctg aggcaggaga atggcatgaa 7261 cccgggaggc ggagcttgca gtgagccgag attgcgccac tgcactccag cctgggcaac 7321 agagcgagac accatctcaa aaaaaaaaaa aaaaaatggg gcgggcgggc caggcgcggt 7381 gtctcacacc tgtaatccca gcactttggg aggctgaggt gggcggatca cttgaggtca 7441 ggagttcaag accagcctgg ccaacaccag gaaaccctgt ctctgttaaa aatacaaaaa 7501 ttagccaggt gtggtggcac gcgcctgtag tcccagcagg agaataactt caacctggga 7561 gacagaggtt gcagtcagct gagatcgcac cactgcattc cagcctgggt gacagaccga 7621 gactctgtct caaaaaaaaa aaaaaagaag aatacccata tgcattcatt aatatatagg 7681 gctagagggc tagagagcta tagacataaa atagacaaaa aatttttttg ctcatttttg 7741 ggtcaaagga gtcttgggac tctaattctt ttaatttttg tgttatgtga atttgttatc 7801 atttacatgt attatgttat taagtaggta ataatgataa tactaataat aaacttacaa 7861 aacgatccaa tgtagttgtt ttcagacttt gttcctcgga gccctaggac tttgcaaagc 7921 tgtttctgga gtcatggtgg gagtggggtg tggggagcct gagcagtggg gaagggttca 7981 ccacttcctg tgagagggaa acggaccagc tgggctctga gctccccacc cagcctcagc 8041 cagggtccat tttatattgt gggcttcaga catgcctttg tttgaaagca gttctgctgc 8101 tttaaaatgt ttgataacca ttgaactaat ctacccacct ccctttttaa aaaaaaaagg 8161 aaaacttttt tttaagctgt tttttttgtt ttttgttttt ttttaaactg taaaaacaat 8221 acttaggtct ggctcagtgt gcctgtaatt ccaacactga gaggctgagg taggaggatt 8281 gcttgaggct agaatttcga gactagcctc tgggcaacat agacagacct catctctaca 8341 aaaaatttta aaaattatcc gggtgtggta cgtgtctgta gttccagcta tgaaggactg 8401 agacgggagg attgcttgag ccaagaattc gaagttagag tgagctatgt ttatgcctct 8461 gtactctagg ttgggtaaca gagtaggacc ctgtctctaa aaaaaaaaaa atttaattta 8521 aattttaaaa actcacctat caccccatca tctgaaaaac accctctttc agatgtggga 8581 aaaacatggc tcagaggtgt tcaagaaatg ttgcatttat ttattactta caaagctgta 8641 agtcagcttt atccgctgtc ttgctaggtt ggtgaaaaat acgtaacagc attcaatatt 8701 agggcttcta tccccatcta gctgcacact ggagctgtgc caagcaaggt tcttggtctt 8761 atgttgtact taggcatcag caaagcctgg gaaaatatct tttttttttt tttgagacag 8821 aatctcgctc tgtcgcccag gctggaatgc agtggcacca tctcagctca ctgcaacctc 8881 cgtctcccag attcaagtga ttctcctgcc tcagcctccc gaatagctgg gattacaggc 8941 tggtcttgaa ctcctggcct ccagtgatcc acccaccttg gcctcccaaa gtgctgcgat 9001 tacaggtgtg agccaccatg cccagctggg gaaaatatct ttaatctact tagtcctaga 9061 aaaattatct ttcaagtgct tttttttttt ttttttcagt ggactgaagg cttttgcaga 9121 gcaaaagcca ggaagttttg tcacaaactt ccaccaaggc caggaagcag agtctattcc 9181 gtgccaaaat taagggaaga aaagagggaa aatctaggca aatggaattt aatatctcaa 9241 aaagattatc ttgccgagcc atagtcacat agcaagctga ggcagcggag gcttggggct 9301 acttctcttg ttcttggttc tgggtttacc ttctttgggc cttatttgcc acatctgtaa 9361 gaggagtggg ctggaccgcg ctccaccatg ccgtattcac taagtgagca gatagaacca 9421 tagaaagcat gctcactctc aaatcgtgct catggaaaga acgacgtgtt caggactcat 9481 gcctcccttt ctcaacatac ccccagacct ggccataagg gacacctttg tgaatgcctc 9541 tcggaccctg tacagcagca gccccagagt cctaagcaac aacagtgacg ccaacttgga 9601 gctcatcaac acctgggtgg ccaagaacac caacaacaag atcagccggc tgctagacag 9661 tctgccctcc gatacccgcc ttgtcctcct caatgctatc tacctgagtg gtaagggtgc 9721 ccttagccag ttagtcttcc cattctgggt ccttcttccc ctcctggctt caaagcccac 9781 ttaaccccaa gttctacaat cggatctcaa tgtccctgca ctactctttg ctaacaaggc 9841 ttttagctcc tcttcatcct tttcctacct gcattagagc aaccctccca cctcttccct 9901 ctagccaagt ggaagacaac atttgatccc aagaaaacca gaatggaacc ctttcacttc 9961 aaaaactcag ttataaaagt gcccatgatg aatagcaaga agtaccctgt ggcccatttc 10021 attgaccaaa ctttgaaagc caaggtaagt tcttaacctt tccttctcct gtttgaaacc 10081 tacttgagtc tcctgacttt ttttctgctg tagtcccatc attttggggt acatgcttac 10141 aaattcatca cttctactcc ttccatctgt atttccaccc tatcttttct ccttctcctt 10201 tctctagcct tggccaaggc agacattgta tattttaggc tggaaacagg attctcaagt 10261 ttcttcatgc atctctacta tattgaatgg caaaatgtga gtcgtgttcc tattctacac 10321 atctgtttct ttttccagtc aactcatcag aagactctgg tggcttagca agtcctgtgt 10381 gtatgtgggt acctgtgtac agcatacata tacatgtgta gacgtgtata agtatatcct 10441 tgtatatgca tatatgtggt aagtgcatgt gtgcacatga gttaaatggc ttaagtgcat 10501 tagcagaaat tgaacaattt agtttgatga gtagggaggg agagaagaca gaatatggag 10561 cccagaggat cccaactggc aagctatctc atccccagtt accagatatt tccacctttt 10621 taggatggca aaacacagac ctatgagaca gaaaacagta agagacaatg tctcaatacc 10681 tggctgagaa acaaaggctc tgttcttgga acatgctagg cttggttcca cattgggagc 10741 tttccttagc ttttgtttct gcctgtatcc ctcttcactc agatcctcac atagcttgtt 10801 ccttcttgtc atttgggtgt cagctcagag gtcaccttgg tggagaagtc ttctccccat 10861 taaagcagcc tcctcctcac cctgtaatat cactcctgtt ttattatatt cattacactc 10921 atcaccatct gaaattatgt catttatttg cttatttgcc taggacattg tctgtctccc 10981 taagatactg tctctctact tgaatacagg aacctgtctt gtttccttct gtatctgtag 11041 tgcctagaac aatgcctcct atcataggtg ctcattcagt gtttgctgaa taaatgaatg 11101 aatgaaatag agggtgccaa agagagctac tcagtaaaag tggctggaag acttcataga 11161 ggagggtttt tttcgagaca gggtctcact ctgtcactca agctgcagtg cattgacatg 11221 atctcagctc actgcaatct ctgcctcccg agttcaagca gttctcctgc ctcagcctcc 11281 caagtagctg ggactacagg agcccgccac cacacctggc taatttttgt atttttagta 11341 gagacggggt ttcaccatgt tagccaagct ggtctcgaac tcctgacctc aagtgatcca 11401 caccctcagc ctcccaaagg gctgggatta caggcatgag ccacagcgcc cagctgagga 11461 ggttattttt gaccattctt tgctcagtta tttatccact catggactta tccacccatc 11521 tatccatccc tatctaatgt atttgaccat tcatgactca tttagcagaa ccaggacagc 11581 ctgtataagg cacagaagag tgaatatatg tgacaggtcc agagcattgt tcaaattcaa 11641 gatacgttgg gggaggtata aatatgagat ttttttcaaa tcaattggaa tggctattta 11701 atcaaagcca tagatgttag gtggaaagta ggaattaaga acttaacaag ttaggtttaa 11761 atgtaactat tcgaattata agaacaagta tttcctctct aaaaatgttt ttaaagtttc 11821 acacaaaact ctaaaccaaa aatcattgta aaaatccctt ggagtagact gttgaaacca 11881 ttgaggtgag gtgataggtt aaggtgagca ttctggacat ataatatcaa taataaatat 11941 tcaaatacaa agcaggtata atatgaagag taatttctat ctccccaagc caatatgagt 12001 tttgtttgtt tgtttgtttt tcttttgaga taaagtctca ttctgttgcc caggccggag 12061 tgcagtggca caatcccagc tctctgtaac ctctgcctcc caggttcaag cgattctcct 12121 gccttagcct ccctagtagc tgggattata ggtgcctgcc accactccca gctaattttt 12181 gtatttttag tagagatggg gtttcaccat gttggccagg ctggtcatga actcctgacc 12241 tcaagtgctc gtccgtcttc ggcctcctaa agtgctggca ttacaggcgt gagccactgc 12301 acccagcctg agtttttttt ttttttttaa gccaaaggag gaggagtagt tttgtgcaca 12361 aaagccacta tttagagttc cttaggtttt aaacagataa ttaatttaat gtactaacaa 12421 atgcaaaatt aaaattttac ttatgccttt taattttaag ctgaacttgc tacattaaga 12481 gtagggtgat ttggctgggc gcagtggctc atgcctataa tcccagcact ttaggaggct 12541 gaggcaggtg gatcacttga agctaagagt ttgagaccag cctggccaac atggtggaat 12601 tctgtctcta ctaaaaatac aaaaattagc caggcatggt ggcacatgtc tgtaatccca 12661 gctactcagg atgctaaggc acagaatcac ttgaacctgg gaggcagagg ttacagtgag 12721 ccaaaatcgc gccactgcac tccaacctgg gcaacacagc aagactctgt ctcaaaacac 12781 aaaaaaagaa taggctgggt acagtggctc acgcctgtaa tcccagcact ttgggaggcc 12841 gaggcaggtg aatcacctga ggtcaggagt tcgagaccag cttggccaac attgtgaaac 12901 cccatctcca ttaaaaatac aaaaaataag ccgggtgtga tggtgcacgc ctgtaatccc 12961 agatacttgt gaggctggga cagaagaatc gcttgaaccc gggaggcgaa ggttgcagtg 13021 agccgagatc atgccactgc attccagcct gagcgacaga gtgagacctt gtctcagaaa 13081 aaaaaaaata gggtgatttt tatataaaaa cactctcatt tgttcttgaa ttaattgcca 13141 tattgaacag attaaactcc tttaaaccat tttttctatc tacaaacata attagtattc 13201 ccataagcat tttaagctcc ctttaaaaat ttaatatgct cgattttctt ttttctttta 13261 tttatttatg tagttttccc agacagggtc ttactctatc acccagtctg gagtacagtg 13321 acacaatcct cacttactgt aacctcaagc tccttgggat gaagcagtcc tcccacccta 13381 gcctcttaag tagctgagat tccaggcatg agccacctca cccagctgat tttcttttta 13441 aacacatcta atacctgaca ggtcgtaatt acgactgtgc ttggccatat catcctaaca 13501 cctaaagctc acttgtaaac tgaaaattaa atttgaaggg tgaattaact tctagccaac 13561 attccaatat cattctcaca tttaatgaaa ttatcctaca actttgctta aaagcataaa 13621 actagcatgt cagagtctct taaaaattac aatcactatt ttaaataaca aacatacagg 13681 ttaccctgaa cttaacacct ggttttaaca tgatgaagtt ggtttctttt gtcattctta 13741 tactaaattc taaatcttcc tggcacttaa ggatatttac ccaaggaagg gggtgagatt 13801 taccggctca ggtacaccga ggttactggc ttgaaaattc aagaccacag catggtaaat 13861 ttttcttcag agattcaagg ttcatacata ggacttgaag ggcacatgct actcaaatga 13921 gtatatccaa cttgagtgct acatcaacag ggaccccatg tccactcttt tcagtcgaaa 13981 ttggcttact atctacttct tctgggctat tcaggttccg acaattgatt gctttcagtt 14041 aatttcttta ttgcccagtt gaattctgca ttcttctaga ttctttttgt tccttcagcc 14101 tcaatcataa taccatttaa tgacagtctc atatcctatt agactagggt ttctcagcct 14161 tggcactatt gacatattgg gtcagataat tctttgtcct gtgcattgta gaatttagca 14221 tcttccttag cttctactca ctagaagcca gtagcactgt gtgcacatcc tccttgcccc 14281 ctgccagcta tgacaaccaa aatgtctcca gacattacca aatgtccact gagaggcaaa 14341 ttcactccga gttgacaacc cctgtattaa acccaggggc cctgcacttc ttacatactt 14401 ttgtctggtc accagttgaa agtccttctt cttttttttt tctctttttg agatggagtc 14461 tcgctctgtc accaggctgg agtgcagtgg catgatctcg gctcactgca gcctccacct 14521 cccgggttca agcaattctc ctgcctcagc ctcctgagta gctgggatta caggcacgtg 14581 ccaccaccca gctaattttt gtatttttag tagaggtggg gtttcatcat ggtggccagg 14641 atggtctcaa tctcttgacc tcgtgatccg cccacctcgg cctcccaaag tgctgggatt 14701 acaggtgtga gccaccgtgc atgaccaaaa gtccttctaa tagcaaagtg gaaaggtagg 14761 ctgtgaacaa tcttgaatgc catactaaga tttgtactta ttctcatagg cagtgaagag 14821 ccatcaatag tttttacaga ggctgacctg ataactctgg cggttcaatg tgcatgctat 14881 atactatatg cacaacacat taggaaggaa gggaacagcg ctcagagaaa tcaaatcact 14941 tgcccatatt acatagctcg tcagtggtgg agtcagggta taatcagaaa ctcatgctcc 15001 ttcctctacc tctccccagc acagagttgg atcagcactg ttcacccagc tggtatcccc 15061 atttcatggt aaagcctgga agcttaggtc tgactgatgc ttgttgaaat acagactgtg 15121 ggagcagact gcaggacagc attgtgacag agggtggggc caggagagag atgcggtagg 15181 aagactgtta agatgcatct cttattttct aggtggggca gctgcagctc tcccacaatc 15241 tgagtttggt gatcctggta ccccagaacc tgaaacatcg tcttgaagac atggaacagg 15301 ctctcagccc ttctgttttc aaggccatca tggagaaact ggagatgtcc aagttccagc 15361 ccactctcct aacactaccc cgcatcaaag tgacgaccag ccaggatatg ctctcaatca 15421 tggagaaatt gggtgagctc tggcagctta gggttactcc caggccatca gaggagaaag 15481 ggggatccct aagatgtagt tagcattctc tagagtattt tttacatcca taatctcagt 15541 ttgtcctgca accctgcaag ttagaggggt aggtgctatt atcccattgg atcattgtgg 15601 aaatggaggc tcagaagctt tatgtgactt acccagagat tccatgattt acccttgagc 15661 tagtcagagg caaaccagaa caccgaccca ggtctccagc tctcctgagt tttttctatg 15721 gttccttgtg acataggcag tggaacagtg ggacaggtaa ccgaggtgaa ttatggatgc 15781 tccttcccca gacacatttc aaaacagtca gccaccccgt ggaatatgca agaagctatc 15841 tggaagtgca gatctggagg cttctgagtt tacgagaggc aacagagact ccattttctt 15901 tttttttttt ttttttgaga cggagtcttg ctctgtcacc caggacaggc tggagtgcag 15961 tggtgcgatc tctgctcact gcaagatccg cctcccagat tcacgccatg ccattctctc 16021 gcctcagcct cccgagtagc tgggactaca ggtgcccgcc accacgcctg gctaattttg 16081 tttttgtatt tttagtagag acggggtttc actgtgttaa ccaggatggt cacaatctcc 16141 ggaccttgtg atccacctgc ctcggcctcc caaagtgctg ggattacagg catgagtcac 16201 cgcgcccagc ccagagactc cattttctaa cctttgtttt tttgtttgtt tgtttttgag 16261 acggagcctc actctgtcac ccaggctgga gcgcagtggc atgatcttgg gtcactgcaa 16321 cctccgcctc ctgggttcaa gtgattctcc tacctcagcc tcccaagtag ctgggattac 16381 aggcacctac caccacgccc agctaatttt tttgtatttt tagtagagat ggggtttcac 16441 catgttggtg aagctggtct tgaactcctg ggctcaagtg atccacccac ctcggcttcc 16501 caaagtgctg ggattacagg tgtgagccac tgtgcccagc ctcattttct aacctttgat 16561 ctcatgtcca gccctgtcac ttcatttctt gttagagaat ttgaccgcct aacacatgat 16621 tccatttcct tgtatatgtc atctgtatag gaagaaaacc tcttttttat tttcattcaa 16681 cacttctggc caccaaacag gtgtgggagt gttttctaag caattctcca gttatctggg 16741 gacaccaact gggtgtccta caattgaatc caattctaac actctacctg gaaataacat 16801 cagatcccgt gcagtttaag tgctccgccc cagcaaattt ccccccgttt ccaatgtcag 16861 ttacaagact gggttgtcac ctgtgcatca gaccaaccag ctataatttg gagattccca 16921 caactccctg ctagggtttc atcatttgct agaatagctg acaaaactca ggagacactt 16981 acatttactg gtttattatg taaaggatta agtaaaggat acagatgaac agccagttga 17041 agagatacat agggcaaaat ctggaagggt cctgggccca gcagcatcta tcctgtggag 17101 ttctgtttgc taccttcctg acacatgtat gtgttcacca accagaaact ctcagaaccc 17161 tgtactttag gagaagtcat catgtaggca tgattgatta ttacccaatc tccagctcct 17221 ctcttctccc cagaggatgg agagaggggc tggaaagttc caagcttcta atcacacttg 17281 gtctttctgg tgaccagccc caatccagga gccagcagga gtagcctcat tagcacaaaa 17341 ggcattccta tgacccagga aattccaagg gatctagtag ctctgtgtca ggaactggga 17401 tcggagacca aatattagat ggtctaatat ttggagctaa agatgctcct atagcatatt 17461 ttcccattat ttcacagcat catggcattg catataacct cattttttca tatatgacct 17521 atctctgatc tattacctca cttccaaggt cctgggagat aaggaggcaa tgacatcatt 17581 tttcagggtt gttttaggga tgaggggcaa ggcctctcac cttgagagag agtggcagtg 17641 ttgggcccac taatagagga tcccacgaac tgccagaggg tacagtatgt aatctggcaa 17701 acaagggaag aggaagagag cactgggact caggatgaac ccagagaatt caggacaaag 17761 gtctccatca gctgagggta tcatgctggc ttctgactct gtttttctct ggttttgccc 17821 tagaattctt cgatttttct tatgacctta acctgtgtgg gctgacagag gacccagatc 17881 ttcaggtttc tgcgatgcag caccagacag tgctggaact gacagagact ggggtggagg 17941 cggctgcagc ctccgccatc tctgtggccc gcaccctgct ggtctttgaa gtgcagcagc 18001 ccttcctctt cgtgctctgg gaccagcagc acaagttccc tgtcttcatg gggcgagtat 18061 atgaccccag ggcctgagac ctgcaggatc aggttagggc gagcgctacc tctccagcct 18121 cagctctcag ttgcagccct gctgctgcct gcctggactt gcccctgcca cctcctgcct 18181 caggtgtccg ctatccacca aaagggctcc ctgagggtct gggcaaggga cctgcttcta 18241 ttagcccttc tccatggccc tgccatgctc tccaaaccac tttttgcagc tttctctagt 18301 tcaagttcac cagactctat aaataaaacc tgacagacca tgactttctc tgctttgcct 18361 ttgttttttt attttttatt ttttattttt tttaggcagg gtctcaatct gtcacccagg 18421 ctagagtaca gtggtgagat catggttcac tgcagcctta acctcctgag ctcaaacaat 18481 cctcccatct cagcctccta tgtaggtggg accacatggg cttgccacca tgcccagcta 18541 atttttaaat tttttgtaaa gacagggtct tactatgttg cctaggctgg tctccaactc 18601 ctgggctcaa gcaatcatcc taacacagcc tcccgaagtg ctgggattgt aggcacacca 18661 tacctctcct ttgcctttga atatcag // LOCUS HSCBMYHC 25000 bp DNA PRI 14-SEP-1993 DEFINITION Human gene for cardiac beta myosin heavy chain. ACCESSION X52889 NID g29726 KEYWORDS beta myosin heavy chain; cardiac beta myosin heavy chain; myosin heavy chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 25000) AUTHORS Liew,C.C. TITLE Direct Submission JOURNAL Submitted (16-MAY-1990) Liew C.-C., University of Toronto, Dept. of Clinical Biochemistry, 100 College Street, Banting Institute, Toronto, Ontario M5G 1L5, Canada REFERENCE 2 (bases 1 to 25000) AUTHORS Liew,C.C., Sole,M.J., Yamauchi-Takihara,K., Kellam,B., Anderson,D.H., Lin,L.P. and Liew,J.C. TITLE Complete sequence and organization of the human cardiac beta-myosin heavy chain gene JOURNAL Nucleic Acids Res. 18 (12), 3647-3651 (1990) MEDLINE 90301496 REFERENCE 3 (bases 1 to 25000) AUTHORS Kristensen,T., Lopez,R. and Prydz,H. TITLE An estimate of the sequencing error frequency in the DNA sequence databases JOURNAL DNA Seq. 2 (6), 343-346 (1992) MEDLINE 93075997 REMARK Erratum:[DNA Seq 1993;3(5):337]] REFERENCE 4 (bases 1 to 25000) AUTHORS Fougerousse,F., Dufour,C., Roudaut,C. and Beckmann,J.S. TITLE Dinucleotide repeat polymorphism at the human gene for cardiac beta-myosin heavy chain (MYH6) JOURNAL Hum. Mol. Genet. 1 (1), 64 (1992) MEDLINE 93244759 REFERENCE 5 (bases 1 to 25000) AUTHORS Warlick,C.A., Ramachandra,S., Mishra,S. and Donis-Keller,H. TITLE Dinucleotide repeat polymorphism at the human cardiac beta-myosin heavy chain gene (HMSYHCO1) locus JOURNAL Hum. Mol. Genet. 1 (2), 136 (1992) MEDLINE 93244775 COMMENT See for overlapping sequence. Data kindly reviewed (26-JUL-1990) by Liew C.C. FEATURES Location/Qualifiers source 1..25000 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cardiac" /cell_type="myocyte" CAAT_signal 2456..2461 TATA_signal 2510..2514 CDS join(4394..4594,4897..5040,5328..5484,5616..5643, 6245..6353,6436..6528,6630..6693,7147..7245,7463..7566, 8215..8353,8779..8897,9024..9173,9455..9625,10220..10529, 10809..10876,11251..11338,12037..12154,12303..12426, 12692..12828,13086..13341,13967..14209,14394..14570, 15754..15899,17029..17119,17838..18227,18463..18589, 18777..18895,19065..19261,19786..19969,20153..20318, 20479..20603,21157..21465,21638..21838,21962..22087, 22198..22473,23358..23453,23566..23700,24227..24244) /codon_start=1 /product="cardiac beta myosin heavy chain" /db_xref="PID:g29727" /db_xref="SWISS-PROT:P12883" /translation="MGDSEMAVFGAAAPYLRKSEKERLEAQTRPFDLKKDVFVPDDKQ EFVKAKIVSREGGKVTAETEYGKTVTVKEDQVMQQNPPKFDKIEDMAMLTFLHEPAVL YNLKERYGSWMIYTYSGLFCVTVNPYKWLPVYTPEVVAAYRGKKRSEAPPHIFSISDN AYQYMLTDRENQSILITGESGAGKTVNTKRVIQYFAVIAAIGDRSKKDQSPGKGTLED QIIQANPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKLASADIETYLLEKSRVIFQ LKAERDYHIFYQILSNKKPELLDMLLITNNPYDYAFISQGETTVASIDDAEELMATDN AFDVLGFTSEEKNSMYKLTGAIMHFGNMKFKLKQREEQAEPDGTEEADKSAYLMGLNS ADLLKGLCHPRVKVGNEYVTKGQNVQQVIYATGALAKAVYERMFNWMVTRINATLETK QPRQYFIGVLDIAGFEIFDFNSFEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEW TFIDFGMDLQACIDLIEKPMGIMSILEEECMFPKATDMTFKAKLFDNHLGKSANFQKP RNIKGKPEAHFSLIHYAGIVDYNIIGWLQKNKDPLNETVVGLYQKSSLKLLSTLFANY AGADAPIEKGKGKAKKGSSFQTVSALHRENLNKLMTNLRSTHPHFVRLYHPNETKSPG VMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDFRQRYRILNPAAIPEGQFIDSRK GAEKLLSSLDIDHNQYKFGHTKVFFKAGLLGLLEEMRDERLSRIITRIQAQSRGVLAR MEYKKLLERRDSLLVIQWNIRAFMGVKNWPWMKLYFKIKPLLKSAEREKEMASMKEEF TALKEALEKSEARRKELEEKMVSLLQEKNDLQLQVQAEQDNLADAEERCDQLIKNKIQ LEAKVKEMNERLEDEEEMNAELTAKKRNVEDECSELKRDIDDLELTLAKVEKEKHATE NKVKNLTEEMAGLDEIIAKLTKEKKALQEAHQQALDDLQAEEDKVNTLTKAKVKLEQQ VDDLEGSLEQEKKVRMDLERAKRKLEGDLKLTQESIMDLENDKQQLDERLKKKDFELN ALNARIEDEQALGSQLQKKLKELQARIEELEEELEAERTARAKVEKLRSDLSRELEEI SERLEEAGGATSCQIEMNKKREAEFQKMRRDLEEATLQHEATAAALRKKHADSVAELG EQIDNLQRVKQKLEKEKSEFKLELDDVTSNMEQIIKAKANLEKMCRTLEDQMNEHRSK AEETQRSVNDLTSQRAKLQTENGELSRQLDEKEALISQLTRGKLTYTQQLEDLKRQLE EEVKAKNALAHALQSARHDCDLLREQYEEETEAKAELQRVLSKANSEVAQWRTKYETD AIQRTEELEEAKKKLAQRLQEAEEAVEAVNAKCSSLEKTKHRLQNEIEDLMVDVERSN AAAAALDKKQRNFDKILAEWKQKYEESQSELESSQKEARSLSTELFKLKNAYEESLEH LETFKRENKNLQEEISDLTEQLGSSGKTIHELEKVRKQLEAEKMELQSALEEAEASLE HEEGKILRAQLEFNQIKAEIERKLAEKDEEMEQAKRNHLRVVDSLQTSLDAETRSRNE ALRVKKKMEGDLNEMEIQLSHANRMAAEAQKQVKSLQSLLKDTQIQLDDAVRANDDLK ENIAIVERRNNLQAELEELRAVVEQTERSRKLADRELIETSERVQLLHSQNTSLINQK KKMDADLSQLQTEVEEAVQECRNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMKKN MEQTIKDLQHRLDEAEQIALKGGKKQLQKLEARVRELENELEAEQKRNAESVKGMRKS ERRIKELTYQTEEDRKNLLRLQDLVDKLQLKVKAYKRQAEEAEEQANTNLSKFRKVQH ELDEAEERADIAESQVNKLRAKSRDIGTKGLNEE" exon 4394..4594 /number=3 intron 4595..4896 /number=3 exon 4897..5040 /number=4 intron 5041..5327 /number=4 exon 5328..5484 /number=5 intron 5485..5615 /number=5 exon 5616..5643 /number=6 intron 5644..6244 /number=6 exon 6245..6353 /number=7 intron 6354..6435 /number=7 exon 6436..6528 /number=8 intron 6529..6629 /number=8 exon 6630..6693 /number=9 intron 6694..7146 /number=9 exon 7147..7245 /number=10 intron 7246..7462 /number=10 exon 7463..7566 /number=11 intron 7567..8214 /number=11 exon 8215..8353 /number=12 intron 8354..8778 /number=12 exon 8779..8897 /number=13 intron 8898..9023 /number=13 exon 9024..9173 /number=14 intron 9174..9454 /number=14 exon 9455..9625 /number=15 intron 9626..10219 /number=15 exon 10220..10529 /number=16 intron 10530..10808 /number=16 exon 10809..10876 /number=17 intron 10877..11250 /number=17 exon 11251..11338 /number=18 intron 11339..12036 /number=18 exon 12037..12154 /number=19 intron 12155..12302 /number=19 exon 12303..12426 /number=20 intron 12427..12691 /number=20 exon 12692..12828 /number=21 intron 12829..13085 /number=21 exon 13086..13341 /number=22 intron 13342..13966 /number=22 exon 13967..14209 /number=23 intron 14210..14393 /number=23 exon 14394..14570 /number=24 intron 14571..15753 /number=24 exon 15754..15899 /number=25 intron 15900..17028 /number=25 exon 17029..17119 /number=26 intron 17120..17837 /number=26 exon 17838..18227 /number=27 intron 18228..18462 /number=27 exon 18463..18589 /number=28 intron 18590..18776 /number=28 exon 18777..18895 /number=29 intron 18896..19064 /number=29 exon 19065..19261 /number=30 intron 19262..19785 /number=30 exon 19786..19969 /number=31 intron 19970..20152 /number=31 exon 20153..20318 /number=32 intron 20319..20478 /number=32 exon 20479..20603 /number=33 intron 20604..21156 /number=33 exon 21157..21465 /number=34 intron 21466..21637 /number=34 exon 21638..21838 /number=35 intron 21839..21961 /number=35 exon 21962..22087 /number=36 intron 22088..22197 /number=36 exon 22198..22473 /number=37 intron 22474..23357 /number=37 exon 23358..23453 /number=38 intron 23454..23565 /number=38 exon 23566..23700 /number=39 intron 23701..24226 /number=39 exon 24227..24244 /number=40 misc_feature 24630..25000 /note="putative VECTOR sequence M13mp18" /citation=[3] BASE COUNT 5912 a 6790 c 6491 g 5807 t ORIGIN 1 gaattcatgc atgcccctgg cccctcactc tccccacaag gaatgtgcat gtgtgcaaag 61 ggtaggcctg gatgccattt ctacaaacag ttgactaaag cagtccagag cagggatctt 121 tctttgggtc attgttatgt ccttattgac cagaataatg cctggcacac agtaggcact 181 caataaatac ttgttgaatg gaggaatctg caaagctgct cagtcttcag acagcagcaa 241 gcagcaagga agagaatcat gatgtaacca agagcaggtg ctggaggagc tagggagtga 301 gtggggatct ttaaggacag caggccctct gatctgtctg agctgtgagg caaagcctag 361 ttagggaggt gagttctatc tgcaatgaga gggggcatgg gtggagaaca gtaacaccaa 421 acaccaacag accagagcct cccaaacccg aaaggagaga aacaaagtgt gattgatttc 481 ccaaatgcag aaaagactac ccaagtcccc agccaaggaa ggaggctggc ctgtggtgaa 541 gacactggct ttgaaatggc tcaacatcgt cccaagcacc ctttgtgcat cttaattgcc 601 ctgggaaaca tgggaggggt ggcagtttgg agcttggaaa gctggaaggt ggggacgaaa 661 ggagtttttg cccctcatcg ccaatccttg aggttcctac agggaacctc agacaacagc 721 tgtcttaggg gtttccttcc agctccaagc agctgtatta tctatagctt tgagagggtc 781 tgtctctgtc aatggggctg attcctcagc acctggcagg gcctgggggg ggactgccag 841 agtctgttct gccgggcatg ttagtagcac agactactgg gataacctca gcaactgtat 901 acctttagca ctgcagagct gcggtgttgt cagacacagc ccttgcagtt cgtggtctac 961 ccccacttgc ctgcaccccc cttatagctg ctcaatgtcc tcagactgta ccttagtgcc 1021 ttggttacca agcaaccccc acagaggcct cctcctggat tttcccttcc ctgctatgcc 1081 cacccccacc ttcaaaaatg tacctgctgc acccatcccc caggcctgcc tcactccttc 1141 caagctgaca gctaatcaga tgggactaac tccatgcgcg gaagagctgt gcttgggaag 1201 aagggggaga tttcacacca aatcgtgcag tggttgccta gacacggagc cagtcaagct 1261 tgcatgtttt gttgctagaa gagcatggca tcggtggtgt tctgtgctcg caggctgggc 1321 ccagctccct ttcctcccct cctttgtggc tgttcctcag gcctgcagag gggaagattt 1381 ttcctgtcct gttactctgc ctcacttgcc agaacactag tcgcctccag ctctggataa 1441 agctgaggct gggtgggcca gacaacagct gtcttagggg tctccttcca gctccaagta 1501 gttgtattat ctatagctct gagagggtct gtctctgggg ctcaggcctt tagtacctgt 1561 ggggcagagc aggaacgtga gtcaggcaaa ccgctccaaa ctaggaccaa tgcagctggg 1621 tcctgctctc ctgcgcatag ggggacccca cataggggga ccccacatct ttggaatccc 1681 agcccacctt tccaggctac cctccacagg ccgggctcca ctcccatctg ctgaggtttc 1741 ccaccttgaa tcctgctgcc ccgattagct gtataacctt aaagaatgca ctgtccctct 1801 ccattaaaat gaagtgcttg gatggattgc taaaggcctg tctggctcgg aggcttggtg 1861 cctcaacaca ttgcctgctg gtccaaggaa atcagtgcct gagccagagt ccccatctct 1921 aagctccatg gttattgttc ttgccacctg gctaggaaat gtccttccag ctgccccagt 1981 ctagctgcct caccctgggg ccatgccccc aactctgtcc tacccttctc tgctgctgac 2041 actcgcccct tcccagcttc cagttggata caggacctgg gccaggagag cagggaggac 2101 actgtggaaa atgcggccag gccatcaggg gcctcgcagc aggggactgg aagggggagc 2161 agtgtccagg gccagaagtg ctgcgggaga gccaggacat tggctgcctg tggtcttggt 2221 ggtcgtggtc agttccctct cctgccagct gtggaatgtg aggcctggcc tgggagatat 2281 ttttgctgca ctttgagcca ccccgccccc tggaactcag accctgcaca gtccatgcca 2341 taacaatgac gaccacttcc aattgtttcc tagctcgaga gagaggcggg gaggggagca 2401 ctgtttggga agggggggag cctcgggggg atgcttctag tgacaacagc cctttctaaa 2461 tccggctagg gactgggtgc cgttgggggt gggggtgcct gctgccccat atatacagcc 2521 cctgagacca ggtctggctc cacagctctg tcctgctctg tgtctttccc tgctgctctc 2581 aggtaggagc gggagctgga ggctttactc tgggataagg ggggctccag gcttaggaaa 2641 gggattcctc ttggaatagc aagcttcatg caggacttca tgcagagtac caggtccagt 2701 cactgggcac acatgtgcag gtctaaacat gggcgtatgt gccacaggag ttcctagggg 2761 aagatatctg catctgagca tatgggacca atatgcatta cagggtgtgt gtgtgtgtgt 2821 gtgtgtgtgt gtgtggacaa gcctgcatag tctgaatgtg agaaagggta tgtacctcac 2881 tgactgatac agaaatatca agtgggagat gtgggtgtat gatcacgtct gtgaacgagg 2941 aggttccttt gcagggaggg agaaagaatg gtggagaagg attctaggaa gtgtgtaatg 3001 ccttcatgtg attctgtgag gcctggaatg tctctgcaga aggaactcac aggcacgggg 3061 gtgtaggatg ttccaggtct ttgtgtactg aagaacacgc agatgtgatc tacttacgtg 3121 gcacacatcc tgaaatgtgc acatttgctg ttgaggcata tatggcaccc tggggttggg 3181 gtgcggattg aggtatggga aggaagcctg tcttcatgtg agaggcatgc ccaagcagca 3241 ggcatgagtg ttcataacga gggcttgtgg agggtgtttg gctatgtgat gggtgtgtct 3301 gtgagaggaa gcccacaagt gtcttctctc tgagaatgtc cacaaagagt acaacaaaat 3361 ggggacagga tggaaaaaga agtctcttta gagtcatcag acttgcattc aagccttggc 3421 tctcccactt gttgtcatgg gctcttggcc aagcaaccca tttaacctcg atttcctcat 3481 ctgtcaaatg gggtgaagac tgagtaggag aaagtgcatt gaagtgcttt tagactggaa 3541 agtgccgaac aaatgtgacc attagttgtc tctcagatgg ctcaaagaaa ggtaaccact 3601 agatgggtac caggcagggg tgggaccatc tcgcaggagt gaagaggatt agggctcaag 3661 gttgagttga caccaggaag ctgttagaac tagttcttga ggaaaaaggg aatagggttt 3721 gaaaaacaag aaggatggga ccagagaagc tgaccaccct ttccaccttg ttttggaaga 3781 ggggtagctg aagaacagag ccagggagaa gcaggaaggt gggactggtc aggttgggca 3841 cagcctgccc tgacacagcc tcttccctct ctccaggtcc cctgcaggcc ttggcctttt 3901 cctcatctgt agacacactt gagtagccca ggtaagaaaa gctgaagcta gagtgttgaa 3961 aatctagtaa gactgggcat tagagctccc caaagccagc ctatggaact ctagtttctg 4021 ccatgtgctg gagagaatat cttggctatc aagagctact acctgtgacc agggggccag 4081 ggatagatga ggtcagagat ggaaatagtt agatttcttc ctcctacagg gagctcagaa 4141 tgctcttcct tcccctagga gagccccagc ttgaacaata gtttgttgtt ccttttaccc 4201 tggcaggttt gctaggctgc tcctctggtt aaggggaacc tcaaagagga agggactcac 4261 tggtaactcc tcttgactct tgagcatggt gctaggtttt ggggctccca ctgaagggga 4321 gagcccaggg agggaaggga agaaatgggc agatgggagg gcagccagct tctgctccac 4381 tccaggcaca gccatgggag attcggagat ggcagtcttt ggggctgccg ccccctacct 4441 gcgcaagtca gagaaggagc ggctagaagc gcagaccagg ccttttgacc tcaagaagga 4501 tgtcttcgtg cctgatgaca aacaggagtt tgtcaaggcc aagatcgtgt ctcgagaggg 4561 tggcaaagtc actgccgaga ccgagtatgg caaggtgggt gtcaggctga tgtgagagtc 4621 caccctgcca cctgtacacc tgggtgacag agaggggtac acccatgccc catgcccatg 4681 cagacctgag gatggtccca ttctccttcc ctttggggaa gaatccagac cctccaaaga 4741 gccctctcct tcccaatatc ccttctgtct tccctgtgag atcctggttc cttctctctt 4801 gagcactatt gccctgtcac tcaccaactc ctaaccctct tgaggaagga ggggaagccc 4861 aggctgacag gagggcttgg gtgggggctc ttgcagacag tgaccgtgaa ggaggaccag 4921 gtgatgcagc agaacccacc caagttcgac aaaatcgagg acatggccat gctgaccttc 4981 ctgcatgagc ccgcggtgct ctacaacctc aaggagcgct acggctcctg gatgatctac 5041 gtgagtcctg cccctggccc tacgttggga tctctgttct tgctccatcc atgtccaccc 5101 caggagccaa gagtgtcttc tgtttgtgtc taggcagggt tacactctaa ctcgtcccaa 5161 catccttggt tcaattccaa cactctgggg actggcatta ctcagattga gtgcatgcag 5221 aggttttcct tctcttcttt ctctcctggg atctttctct aactcccaaa atcaccagcc 5281 ctcccccttc gcaactggca agtcactgct ccttttctaa tccccagacc tactcgggcc 5341 tcttctgtgt caccgtcaac ccttacaagt ggctgccggt gtacactcct gaggtggtgg 5401 ctgcctaccg gggcaagaag aggagcgagg ccccgcccca catcttctcc atctccgaca 5461 acgcctatca gtacatgctg acaggtgaga ggccctggaa ggtcttcctg aagggaactg 5521 ggataggccg ggagggagag ggagaaggaa gggagaagcc ccacgagagc atcctgtgca 5581 gctcctaacc tttccccccc accctctccc cacagacaga gaaaaccagt ccatcctgat 5641 cacgtgagtt agctgctact gtattccctt tcagaatctc cctgatccca gcctccagcc 5701 ccatcaggtg tcccaggccc cgaggcatag actcagcctc tttccccgca acctcgtgcc 5761 catctccttc ttctctgacc attaccctga cccctctctt tgctctccct cttcctttct 5821 gctttccatc cccttcatgt ttttcctctc ccacctcctt cacccactct ccaactcatc 5881 acgcgtttat cagagctcat tatgcagctt cacctctgca ttttctttcc cttcagaaac 5941 atttcccatt cttccagccc taccaacttt tcccaaaaca gcccaggccc atctcccctg 6001 tttactttgc ccagccaagc acaggctcag cctgcccaaa ccagtgcctt ctggtggcta 6061 ccttcctggt gccagccctg acctctgtgc atcagaagac agttgtggat tttgggtgta 6121 gataacctgg cagcttcttg cccagggctc ctctctcggg taggcccagg cattctctcc 6181 tgatttgagg cttgtcggtc tccagtagta ttgttcactg cccaataagc ccctgtcttc 6241 acagcggaga atccggagca gggaagacag tcaacaccaa gagggtcatc cagtactttg 6301 ctgttattgc agccattggg gaccgcagca agaaggacca gagcccgggc aaggtagcct 6361 gctgccctcc aaggtcctgt accgcagaaa gggagggaga agagctctca cctgcctcct 6421 tcttggcctc tgcagggcac cctggaggac cagatcatcc aggccaaccc tgctctggag 6481 gcctttggca atgccaagac cgtccggaac gacaactcct cccgctttgt gagtggtccc 6541 tgaccttggc cttgggactt ggactggtgg aggaatggtc tcagatatga gccttccccc 6601 aactcatcac cactctcttc catctctagg ggaaattcat tcgaattcat tttggggcaa 6661 caggaaagtt ggcatctgca gacatagaga cctgtgagtg ccatgaatct gctaggctca 6721 gcctaagctc acccttgctc tagaccatct ggtcttgacc tctctctctc tcccctccct 6781 ccctctgttt ttctcctctt taagtctctg tctgtaggtg tctctgtctt caggtctaca 6841 tatctgtctc tctctgagac ttcctctgca tctttctcca tttctgtctc tgcatggcta 6901 ggtgtctttc tctgggattt ctctctgaga tcattttaat ctcttcctcg ggtctcgttc 6961 tatctctcgt gtgtagactc tctttgtgtg tcctctgtgt ctcacaactc tagatgtctc 7021 tctccgtgcg tctctctcct ctccctctct cgtcgctctt tgtcgtctgc tgcatcatta 7081 tcattaattt cctgtgccaa ccctagactt ttctttctct ccttcttctc cccacctgtt 7141 cagcagatct tctggaaaaa tccagagtta ttttccagct gaaagcagag agagattatc 7201 acattttcta ccaaatcctg tctaacaaaa agcctgagct gctgggtgag tcagagccac 7261 cgactgagac caactatctc atgcactgtc cttgcttgct gactcagctg catgcctggt 7321 tcatggcact gctggatcaa ttcagtggat gtgggccaag ccaagccctg tcctgtgctc 7381 ttcctcaggc atgtgctgtg gcgagcagcc tccatgagag ccgggggctt gtgtcccacc 7441 ctaaccatgt tttttcccct agacatgctg ctgatcacca acaaccccta cgattatgca 7501 ttcatctccc aaggagagac caccgtggcc tccattgatg acgctgagga gctcatggcc 7561 actgatgtga gtgtgtgagg acccagccag ggggtgggag gattggcagt gaggggcaaa 7621 caggggtcca aaagcagagc taagacgctg gccattggtt gtttaaagtg gtaatggacc 7681 tcctgagagc cagaatgttg ccctgtgatc agggtgtact ggggaatgga tgaggtgaga 7741 caggactggc cagtatctgg tgttctatca aaccaggaag agctgaaacg catgagacag 7801 agagacacac aagccaagaa acaagcatca ccgagggagc aggacgagga gggaaggcca 7861 tgcagacggc atggggctgg agggggctgt taggtgaaca gatgcagaca ggtatggaag 7921 gccaggcagg aagcaagtgt agggccagga agcataagtg ggtaactcag aaaagctgtg 7981 gccactgaca ctggggctgc ggccaactga cgtctgctct gctcttgcat cttacatcct 8041 ttgttttgga gttccctgca tcttacatgg ttttgttttg gagctcccca ctgggtatgg 8101 gagtctcaga accccacagg attaaggata cagttttctc tccaacttac aagggatctc 8161 acttacccat catacttctt ttttggggtc cgccaatatg ggggcctcct acagaacgct 8221 tttgatgtgc tgggcttcac ttcagaggag aaaaactcca tgtataagct gacaggcgcc 8281 atcatgcact ttggaaacat gaagttcaag ctgaagcagc gggaggagca ggcggaaccg 8341 gacggcactg aaggtgggag gcagggattc ttgggggcag ctgtcaagtc atggagggcc 8401 atgtctgctc agcagtcatc tctcttctct tttttctttt ttttttgaga tggagtcttg 8461 ctcttgctct gttgcccagg ctggagcgca gtggcacgat cttggctcac tgccacctcc 8521 acatcctggg ttcaagtgat tctcctgcct cagcctcccg gagtagctgg aattacaggc 8581 atgcaccacc atgcctggct aatttttgta tttttagtag agatggggtt tcatcatgtt 8641 ggccaggttg gtctcaaact cctgacctca agtgatccgc ctgccttggc ctcccaaagt 8701 gctgggatta caggcatgaa ccacacacct ggccagcagt catctcttac caactttgct 8761 cttgcctttt ccttccagag gctgacaagt ctgcctacct catggggctg aactcagccg 8821 acctgctcaa ggggctgtgc caccctcggg tgaaagtggg caatgagtac gtcaccaagg 8881 ggcagaatgt ccagcaggtg ggtccatctt cagatgataa tgggtgggca gggtagggag 8941 actggcatgg tgggatgaga gttttcaagt tcactcttcc caacaaccct gctcaatatg 9001 ggctctctct tcacgccttg caggtgatat atgccactgg ggcactggcc aaggcagtgt 9061 atgagaggat gttcaactgg atggtgacgc gcatcaatgc caccctggag accaagcagc 9121 cacgccagta cttcatagga gtcctggaca tcgctggctt cgagatcttc gatgtgagtt 9181 gggacccctg ggagtgggag aacaatcact cactcgctcc cacattcaac agctatttct 9241 tagagccagc tgtggaccag acatgggaag gcagtgggga ctgtgtggtg acagaggcag 9301 tcatttactc tgtcttcagg ggaagccctc cttcactgcc ttgacatgga ggggaccagc 9361 cacgcctgct gggctcaggc acagtggacg gcacagcccc aatggccact cacacccact 9421 ttctgactgc tcccacccct catgccccct gcagttcaac agctttgagc agctctgcat 9481 caacttcacc aacgagaagc tgcagcagtt cttcaaccac cacatgtttg tgctggagca 9541 ggaggagtac aagaaggagg gcatcgagtg gacattcatt gactttggca tggacctgca 9601 ggcctgcatt gacctcatcg agaaggtgcc tctttggcct taccacctga attccatcct 9661 cgacaccaac aagaacacat agacaaaata gcagcccctc tcccttctgg ggaatataaa 9721 aatagaaggg gctgaaggat ccgagccctt ggttcagagc ctatgttgtg ctgagcacca 9781 ggacagggtg gcacagggac agctgatttc caggggcttg agagtggaca catggaggat 9841 gggcactgca tgatgacctc cacactgcat gtttattggg cctggtatgc atccaaggat 9901 caggaagtgt gagggtggtg taggggatca taagagtgca cctattttca accctagcat 9961 ctcaggcatc tgggtcgtgg agtggtgtgt acagcatcga tagaatccat attcccagac 10021 tttcacaaag gtccttctgt catcagacaa aatcctccac cttgaaccgg ttccagtcag 10081 tagataactg tactcagagc tgagcctact accttaacac ccaacatggc acctccacga 10141 gcaagtatat tgaccataga gcagaatcca tgtcacctgt gtgaaggaca ctcagtgatg 10201 ctctctcctg cttcctcagc ccatgggcat catgtccatc ctggaagagg agtgcatgtt 10261 ccccaaggcc accgacatga ccttcaaggc caagctgttt gacaaccacc tgggcaaatc 10321 cgccaacttc cagaagccac gcaatatcaa ggggaagcct gaagcccact tctccctgat 10381 ccactatgcc ggcatcgtgg actacaacat cattggctgg ctgcagaaga acaaggatcc 10441 tctcaatgag actgtcgtgg gcttgtatca gaagtcttcc ctcaagctgc tcagcaccct 10501 gtttgccaac tatgctgggg ctgatgcgcg taaagtaggg actgaggctc ccggtacagg 10561 aggagcaggg attctgccaa ggttctgagc caggtcaatt gctacacccc accacttcaa 10621 tgtcaggttg tacccaaggt ttacagactc agtgggatgg aactgggtga agaaactgag 10681 gcagccacat tgaaggccct cacccagggg cccaaatgcc agcaaggatg taaagagggg 10741 ctgtgattct ttactcacac cctacctccc cacactgatg cttcttttgt tgactctcct 10801 tcctgcagct attgagaagg gcaaaggcaa ggccaagaaa ggctcgtcct ttcagactgt 10861 gtcagctctg cacagggtga gtgggacaca gccccagcca acttggctcc ccatctgccc 10921 aaccccaccc cagccccacc cttccctgcc ttgttcatcc cctactcctc ccgttccctg 10981 tctccttgtg cattcggacc attttcactc tgtcttctct tcccgtcatc tcctggcctc 11041 ttcacttatt tacttcccat ctcccttctc tcctctttcc cttctgtctc ccaccctctc 11101 tccttatccc tgtctcgccc cagggcccct tcatctctgt gaccttctga attctctcat 11161 ctctcttttc cttccttctt ctcctctctt cttcctgcat ctctttctgg cattttcttt 11221 acttttctct attgcacttt ttggccacag gaaaatctga acaagctgat gaccaacttg 11281 cgctccaccc atccccactt tgtacgcttg tatcatccta atgagacaaa gtctccaggt 11341 gaggccacaa actcaggcca acccactgct gggcatacac cgccctggga acagggacct 11401 cctaggacat ctccctacct accaccacag tggtttccaa accacaattc ttctcctccc 11461 atcccgcatg aggcctccca gggtacctcc ctctcagcgc taccttgcac ttgcagtgtg 11521 taatgagggt gatgtgtaag gcactctgat ggacattacc tcatcagtgg ccctacaacc 11581 ccttggatag ctggttgtcc aagacaagaa tcttagttca actttaaatt tacctgcagg 11641 aaccacttaa tcccccaatg agacatattc aaattgttaa tccatttttg gctaattagg 11701 agcagaggct cagggacctg ggttcaagac ctggctctgt tcttgatcag cttagtgaac 11761 ttgggccagt cccataaccc tctgggctta atttactcat ctagaacatt ggatatatct 11821 gttctggcta ccacctatga taataggtac tgggaatcat ataaaaaagg aatgtgaaag 11881 ccttttagtt aaaaatgtgg ctagaagaaa atgaaacaaa tgattattac taactgactt 11941 gctaagatta caagctaatc agtgacaaag ccaggatcag aacccagaac ttcagtccag 12001 ttctcacaga ctcctcctac ttccttcttg ccacaggggt gatggacaac cccctggtca 12061 tgcaccagct gcgctgcaat ggtgtgctgg agggcatccg catctgcagg aaaggcttcc 12121 ccaaccgcat cctctacggg gacttccggc agaggtgggt atgagggtgc accagaagct 12181 catagaacag ggggagccag gctgccctga tgggaatggg atctgcaggt gaccctgaat 12241 tcctatgggg cagagcagac tcactgcaga gcatgggtga ctctggacac ttccctcctc 12301 aggtatcgca tcctgaaccc agcggccatc cctgagggac agttcattga tagcaggaag 12361 ggggcagaga agctgctcag ctccctggac attgatcaca accagtacaa gtttggccac 12421 accaaggtga ggaaaggaga ctaattaatt aaaggaaaga catctctttt ccattgactc 12481 ctctgatgct tttcctgttg taatcatctt agcaaaatct cttacctgta tgctacccct 12541 cccagtggaa catctagcac cactcccctc atcccagctc cagctgccat tgacctctcc 12601 tgcaatcctt ttctaggatg ttacccttcc taaggtaatc ccacatctct ttcctcgtac 12661 cctccatgtc atgccacaaa cttgcctgca ggtgttcttc aaggccgggc tgctggggct 12721 gctggaggaa atgagggacg agaggctgag ccgcatcatc acgcgtatcc aggcccagtc 12781 ccgaggtgtg ctcgccagaa tggagtacaa aaagctgctg gaacgtaggt gagagatctc 12841 aagaggaggt ttcccgcttc tctgaggccc aggctggttc aggggcagtg tcaggaaaaa 12901 aagctcagca gatcttcaaa cacagagacc tgcaggaggg gctcatatga acacactgca 12961 gtcacagggt cagaggcctc aggaagggtg ggagggatga aggaaatggg atattcccaa 13021 ggtttcagga cctcaggtag gaaggaggca ggaggctcag cactcctttc aatgggcccc 13081 tgcagagact ccctgctggt aatccagtgg aacattcggg ccttcatggg ggtcaagaat 13141 tggccctgga tgaagctcta cttcaagatc aagccgctgc tgaagagtgc agaaagagag 13201 aaggagatgg cctccatgaa ggaggagttc acagccctca aagaggcgct agagaagtcc 13261 gaggctcgcc gcaaggagct ggaggagaag atggtgtccc tgctgcagga gaagaatgac 13321 ctgcagctcc aagtgcaggc ggtgaggctc ctgggctact gttggctctt ccaccctgct 13381 ctgccttcac ttcccacaga ccctgcacct ccctgcacag gggctcactg tcttgttccg 13441 ttcagtcaga ggactctggg atcaacactt ctaaagacct ccaaagaggt ccctcaggag 13501 gaggaaaggg cagagggaaa aggagctgac ttttaaaagc acaggctttc tacttgggtt 13561 caacctagat acacagctta attattttgt gaccttgggc agaccatgca tcatctttga 13621 gcctcaaggg cctctagaca gggaataaag ctgtccattt cattaagcca tcatacataa 13681 gtgcttggca tcagcaaatg caattcccac ccttcgatgg aggagaccag aggaagaggt 13741 ccagatgaag atttctggtt cctttcctct ctgctcccct ccccagtgtt cccaagttat 13801 actatcctga aactcttccc ctccccatac ttctgaagct cttgccaagt ggggatatcc 13861 tagagttacc ctcctatttg agtgatgtgc ctctccttcc ctcctacctg caagaatgga 13921 ggaccttacc ccctgaacag cctcccctct gttcctcacc ttcaaggaac aagacaacct 13981 ggcagatgct gaggagcgct gtgatcagct gatcaaaaac aagattcagc tggaggccaa 14041 ggtgaaggag atgaacgaga ggctggagga tgaggaggag atgaatgctg agctcactgc 14101 caagaagcgc aacgtggaag atgagtgctc agagctcaaa agggacatcg atgatctgga 14161 gctgacactg gccaaagtgg agaaggagaa acacgcaaca gagaacaagg taagggcagc 14221 ctccctttgg ctccagcccg ggtctcatca ggactctcag accatactga ccttgaccca 14281 ggctagccac tcggcatcca gagagcagca tggaccttga catggagctc cccacagatg 14341 gcaccaagct ggtgaccttt gaccctaaag gagatgggat tcttggtcgg caggtgaaaa 14401 acctgacaga ggagatggct gggctggatg agatcattgc caagctgacc aaggagaaga 14461 aagctctgca agaggcccac caacaggctc tggatgacct tcaggccgag gaggacaagg 14521 tcaacaccct gactaaggcc aaagtcaagc tggagcagca agtggatgat gtgagtagat 14581 tgagagttgt ggggcctaga tatgccatgt ctatctgtgc ccagagctct gtgtgtgtgt 14641 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtttgt gtttaatgtt tatagggggg 14701 tagggttaga ggatgttaac ttttctgata tctttctaga aaggagctgg ggaatgaaaa 14761 gtcagaggtg gtgatcaatt ctgaattatg tatctaatct cactggtatg atcaataata 14821 gagatgagtt ttcttattta atttaagtgc aatcccatat atctctgcct ttgacacaag 14881 atttagaggg tcatctcata agacatgata aatttctttg gaaattttgt gaggctcccc 14941 aagtagaatt tttgaaattc caatatttaa gggcagaatc tgatcattgt tccctctgat 15001 gcacatatgt ctgttgactc acttatcact ctgtatttta attatttaag cttctttaga 15061 taggaactta gatctattca atttggggac tgcagtgctt agcaagagcc tgatgcccag 15121 taggtgttta taggatgctg aaagaatgaa tacggcggcc agcggtggct cacacctgta 15181 atcccagcac tttgggaggc caaggcgggt ggatcacgag gtcaagagat cgagaccatc 15241 ctggccaata tggtgaaacc ccatctctac taaaaataca aaaattagct ggatgtggtg 15301 gcgtgcgcct gtagtcccag ctactcagga ggctgaggca gaataatccc ttgaaccagg 15361 gaggcggagg ttcagtgagc cgagatcgcg ccactgcact ccagcctggc gacaagtgag 15421 actccataaa aaaaaaaaaa aaaaaaagga ggaaggaagg aaggaaggag agaaagaagg 15481 agagaaagaa agaaagaaag aaagggaaag aaagaaagaa tacggctttg gggtttcatc 15541 ccacagtcat ttattccttt atctataaaa tatgattcag catcccctat atctaggcaa 15601 tctcacagtc ccctaattaa catttggccc actctgggga aggtttccca agtcctgaac 15661 acaagattta ccaagtcctg aggtaactga acaacaaaat caccaagtca ggatgctttg 15721 cctcactggg cctgggctcc ttctctctac cagctggaag gatccctgga gcaagagaag 15781 aaggtgcgca tggacctgga gcgagcgaag cggaagctgg agggcgacct gaagctgacc 15841 caggagagca tcatggacct ggagaatgac aagcagcagc tggatgagcg gctgaaaaag 15901 tactgtgtcc cctccctgct gcccttccgc ctccccagcc cataacagta caagcagacc 15961 caatacagcc atcctctgag gcaacaacct cacgcagcct ccacaagtgg aggtgcaaac 16021 catgggaaaa ggcctcccag gcaagggctt tcctccccta gtcccagata tcagggttag 16081 ttagatggtc ctccctggca aaaagtccat tgtatcccat gagtcctttc ttcttgttag 16141 gcactcagtg ggaatgaaaa taacagcctt cccccttcat aacacatctc tctttggaga 16201 ccagttcctt gacatggagg aaggctccac atgaggagcc tcttaggaaa gtccctttgt 16261 ggcttttcac agagagccca cagggcacca agaaatgccc tcacattaaa gaagtcaggg 16321 gtacagggcc atttctttgt ttgcttaact tgtggttcca attcttagtg tagtcttcca 16381 aatgtataga cttccagact tggtaaacag tgagcttgca gaacatattt tgataattga 16441 aggtcttatg gtgtttcagg ctcaggactt tgaaaggctt tgagaatcat cataatctac 16501 atcctcatca agtttgtgtt tgcacaggaa tcttcatacc ttcccaggca cagaatgtgt 16561 tgaataaatg aataaataca cgatcctatt tgattctcaa aacacctttt aaggatggta 16621 tctttaattt cactttacca aagaggaaaa taagtttcag aaaggttagc tgagttaccc 16681 aaagcaatag tgacaaaatt taaagaccat cagagctaaa aaggaccttt gaaacaatgt 16741 agtccgaccc cctcatttta cattatgaga aaacaaggac caaagatggg aatgtcctgt 16801 agagcccagg ccccatcacc ctgtctacag cactcctccc attacattct tgttggattt 16861 caatctcacc gcaggtgtta cacttccagg gatcctacat cacccccaga agctgttttt 16921 ctaatgaaat cctactcttt acctgtatca ttaccatttt caccactggt ggaccctcct 16981 ggaggcccca cgagtctccc ttacctcacc atccctcctt ccccacagaa aagactttga 17041 gctgaatgct ctcaacgcaa ggattgagga tgaacaggcc ctcggcagcc agctgcagaa 17101 gaagctcaag gagcttcagg tgaggtctgg acacccacgc gggatgctga gtccctgggc 17161 tggttctgtt tcccctgccc cctcctctag gtgtcctgtt cacagcatta caggaatgat 17221 taatgtgtcc atggaggcag gcacgcagct atcacagtgt gcccatggat acatccaaaa 17281 ggggtcagtg atccacagag acctgaaaag gtcacaggcc tgagggcaag aattcctggc 17341 ttcctgtctc agctctgcct tagttgggtg acctctggga gcccactcca gatctctggg 17401 tctccatgcc ctcccctgtt acatggaggt ttttaagagt ccttcgactc tgatggcatg 17461 actttcctcc ttactctaga gcagaagctg tgttaggtct ttagttgtct ggtcaagaca 17521 aatacggctc tctcattttc aacttttgtt atttttttat gactaaactg cttaaaatat 17581 ccccagaaca caggtctctt tgtccatctt taagagcaag tttcaaccat ttccttccaa 17641 taaagagcat gtagactgtc aggccctcgc ctaaaggaag gctcttcaaa ctatcttccc 17701 caaatccttt aattcccacc tctccactgg aagagctaaa ctgacttgct gttccagaga 17761 agccgaggag ccttttagag ccggggggat tccagtggag gggtccaggc gtgggtctga 17821 gccttgtgtc tgaccaggca cgcatcgagg agctggagga ggagctggag gccgagcgca 17881 ccgccagggc taaggtggag aagctgcgct cagacctgtc tcgggagctg gaggagatca 17941 gcgagcggct ggaagaggcc ggcggggcca cgtcgtgcca gatcgagatg aacaagaagc 18001 gcgaggccga gttccagaag atgcggcggg acctggagga ggccacgctg cagcacgagg 18061 ccactgccgc ggccctgcgc aagaagcacg ccgacagcgt ggccgagctg ggcgagcaga 18121 tcgacaacct gcagcgggtg aagcagaagc tggagaagga gaagagcgag ttcaagctgg 18181 agctggatga cgtcacctcc aacatggagc agatcatcaa ggccaaggta ggctctgctc 18241 ggcctccctc ctccaacttc ctcctcccac ctcccctttc tgtccatgta gtgtctcttc 18301 cttcatggtt cattttccca cttcccttcc tctgcctggt tctcagatcc cctccttctt 18361 ttcacacctt ccctcttctt ttctccttca gttgcacctc ttacacccct tcattccccc 18421 cgcccccaac atccatcata ttaactctcc ttcccttctc aggctaacct ggagaagatg 18481 tgccggacct tggaagacca gatgaatgag caccggagca aggcggagga gacccagcgt 18541 tctgtcaacg acctcaccag ccagcgggcc aagttgcaaa ccgagaatgg tgagcctaga 18601 gcaggggctc catggttccc accacagtct ccccaacctc ctgagccact ccagcctcgg 18661 gcacaagcca atacacgcac acacagacac agagtctctc caggaaaatg gggcctgagg 18721 gctagaggag gaggtgggga tagagaggag tgctgatcta gaatggtgct ccccaggtga 18781 gctgtcccgg cagctggatg agaaggaggc actgatctcc cagctgaccc gaggcaagct 18841 cacctacacc cagcagctgg aggacctcaa gaggcagctg gaggaggagg ttaaggtaag 18901 ggctagactc gcgcacctgg cccaagcaag gagcacactg actagccttg catcaaatca 18961 cttcccttcc cagcaataag gctggctgtg gaccaacagt tctccaagaa ttcttgcttt 19021 atggagaaag ctgaacccac ctcctggtgc ccacccctcc ccaggcgaag aacgccctgg 19081 cccacgcact gcagtcggcc cggcatgact gcgacctgct gcgggagcag tacgaggagg 19141 agacggaggc caaggccgag ctgcagcgcg tcctttccaa ggccaactcg gaggtggccc 19201 agtggaggac caagtatgag acggacgcca ttcagcggac tgaggagctc gaggaggcca 19261 agtgagttct gagcagcctg acttctggct gaggcccctt tgcaggcagg actcagccca 19321 gccccagcct cagcagatcc cacacaggcg atgcttagct agtgtttgac aacacaggag 19381 gactctgccc cggccccacc tccttctcct ctcagggaag ctttttgctc gaattatgtt 19441 tctgatccga atataagacg tacaaaaggt ttgtctgagg gcagagtgct tccgtctggg 19501 gcagggcact gtggcaggga aaggcagtgg ggagggctgc agaagcccat acctcctcaa 19561 tgtccatagc gcagaggctg ggccagggtc agagggtgcc tgggtctcca cgccctgctg 19621 ggatcctctg cctgatgttc tcggcccctg ggacctgtcc tcaggcttct ccagctacac 19681 ttctgaggtt tcaaggattg tctttgagaa ggttcatgtt gtttacctct ttcccccatc 19741 cacaccctcc atcctcccca ccctctgcca ccctcccctg ggcaggaaga agctggccca 19801 gcggctgcag gaagctgagg aggccgtgga ggctgttaat gccaagtgct cctcgctgga 19861 gaagaccaag caccggctac agaatgagat cgaggacttg atggtggacg tagagcgctc 19921 caatgctgct gctgcagccc tggacaagaa gcagaggaac ttcgacaagg tgggccctgg 19981 gtggggcccg cagccagcat gcagggcaag ggggcatgag gggttcagtg agaggccaga 20041 gccatcctcc ttggaggtgg gggaggaggc tgagcccagg caggtcctga gacagaccct 20101 ggacatgggg ctgaggcttg ggggctgaag agtgagcctt ctccccgggc agatcctggc 20161 cgagtggaag cagaagtatg aggagtcgca gtcggagctg gagtcctcgc agaaggaggc 20221 tcgctccctc agcacagagc tcttcaaact caagaacgcc tatgaggagt ccctggaaca 20281 tctggagacc ttcaagcggg agaacaaaaa cctgcagggt gtgctggggg cccaagaggc 20341 tggggagggg ctgcactgca gtgttcccat atggtgccac ccaagggctc caagaggcgt 20401 ctgggcaaag aggcgtgtcc ctccaactcc actggacctc agcagccctc aaaccgagtt 20461 accgtgttcc ccacacagag gagatctccg acttgactga gcagttgggt tccagcggaa 20521 agaccatcca tgagctggag aaggtccgaa agcagctgga ggccgagaag atggagctgc 20581 agtcagccct ggaggaggcc gaggtgtgtg tgtgtgcagg gcacggggtg cagggagctg 20641 agctcaggct tttggtccct gttctcatcc cccgctttgt cgttttgttc actgtccatc 20701 cccagagcct aggacagagc ctggcacata gtaggcattc agtgaagatt tgtggagtga 20761 atgaatgacc agtcactgaa ctctcttcac ataaacattt tacccattgc atcttagctg 20821 agcttcccat ttccacatac ctcccggttc ccacctctgc ctccctctgt tttcttttat 20881 actctatacc tgattgcctc tgtttccttg gcaacacttt ttctcttcat tttcctcctt 20941 gtcttcatag ctggctctcc cctgtgaggg cctgaatcac tttctcttag gcttattttt 21001 attttctctt agattctctt ttcctccacc ttctgcttct ttgaaccact tacaccactc 21061 ttgaagtcac ttcgtatcca tgattagtga gcaggccccc accctgccct gtgccctgac 21121 tgtctgcctg catcccctcc cccaacccct tcccaggcct ccctggagca cgaggagggc 21181 aagatcctcc gggcccagct ggagttcaac cagatcaagg cagagatcga gcggaagctg 21241 gcagagaagg acgaggagat ggaacaggcc aagcgcaacc acctgcgggt ggtggactcg 21301 ctgcagacct ccctggacgc agagacacgc agccgcaacg aggccctgag ggtgaagaag 21361 aagatggaag gagacctcaa tgagatggag atccagctca gccacgccaa ccgcatggcc 21421 gccgaggccc agaagcaagt caagagcctc cagagcttgt tgaaggtact cacccagagg 21481 ggactggcct ccacgtggcc tggcgtaagc agtagtgtct tgatacaggc accagattcc 21541 tcctgcccct aggttactgc agggacctct gacaggtgcc tttagtgaag ggaaccgagg 21601 ctggctccct gctcatgccc actctcctga tcctcaggac acccagattc agctggacga 21661 tgcagtccgt gccaacgacg acctgaagga gaacatcgca atcgtggagc ggcgcaacaa 21721 cctgcaggct gagctggagg agttgcgtgc cgtggtggag cagacagagc ggtcccggaa 21781 gctggcggac agggagctga ttgagactag tgagcgggtg cagctgctgc attcccaggt 21841 gagcaagctc ccctgctgat tcctgaaggg agcacaggct ggggctcagc aagcaaggct 21901 tagagctatg catagatgct caatgcttaa cctacatcta cccaaccctc ccccaaccca 21961 gaacaccagc ctcatcaacc agaagaagaa gatggatgct gacctgtccc agctccagac 22021 tgaagtggag gaggcagtgc aggagtgcag gaatgctgag gagaaggcca agaaggccat 22081 cacggatgta agtcccccac tccaccgacc cgatccagac cagtgtctct ccgtgggctg 22141 ggcagcaagt gtgtgaggac ttgaccagac catgtgccac ctctctcctg cacacaggcc 22201 gccatgatgg cagaggagct gaagaaggag caggacacca gcgcccacct ggagcgcatg 22261 aagaagaaca tggaacagac cattaaggac ctgcagcacc ggctggacga agccgagcag 22321 atcgccctca agggcggcaa gaagcagctg cagaagctgg aagcgcgggt gcgggagctg 22381 gagaatgagc tggaggccga gcagaagcgc aacgcagagt cggtgaaggg catgaggaag 22441 agcgagcggc gcatcaagga gctcacctac caggtgcgac gggcggtgac tccaggcaga 22501 gccctggcac catagccaca gtgacaacca gctgaggaga atgaagagtt tgctcttagc 22561 ctcttccagg gcgaggatgg gaatgcagcc cccgtttcac tttgcctagc cctgccccac 22621 tctgaatgtc cctagctcag aggtcagtct ctgagcctcc tggcctggga gctccatctc 22681 aacacccact cttagcccat cgccagggga cacacacaga attccagaca aagctcaccc 22741 agtacagctc accagaaagc aaatatgtag ccagggtcac ccccaaaaga caccaaaaac 22801 acacccatcc cattcaaaca agcataaaaa ctttttcatt gcccagggca gtgtggactc 22861 tagctttctg aggtgttttc ttagaatcag actctgaatt agaatttgtt tcttttccaa 22921 tccaggatgt cactatctct ctaagcacac ttgtccttaa cttagattaa gtcagtgttt 22981 ctggcctgca ttgcagccca gaaattccat cacactcata gccccaaagt cactcccagc 23041 atgtgccgag gtccagagca gggacttcat cccgcatggt ccccggaccc tgaaaaacct 23101 gctccctgga gcactgcaca cacacaattt tgtgcataat ttcagagtcc cactagttct 23161 ttggactctt tcctggtaca gatcatttaa aatatttaca catatttggt aagaaattat 23221 aaggagaatt caagtgttta gtgaggatca gaaagtagaa ttgggtcagg atatcagatg 23281 aagcagggca gggcggaggg gatgctacct tctatgactg tgccatcttc accccctgcc 23341 taccctctgg cccccagacg gaggaggaca ggaaaaacct gctgcggctg caggacctgg 23401 tagacaagct gcagctaaag gtcaaggcct acaagcgcca ggccgaggag gcggtgagtg 23461 accctgctgg ggactaggcc caggggaggc ataggagagc tcgtccccaa gccaggagtc 23521 tgagaaccca ggccccctct cacctcatgc tcccacctcc cgcaggagga gcaagccaac 23581 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagcgggcg 23641 gacatcgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcacgaag 23701 gtgggtccct cttttgggct ttgctagtca cccccacagc aggcataccc agacagagca 23761 ccctcaaacc cgggatgctt ccttttcatt taattccaca cacttgagcc acatggcccc 23821 agaggcaagg taatgcagtt ctctctcatt tcaaggatta ttccagctct aacttttttt 23881 tttttttttt ttgagacggt gtctcactct gccgcccagg ctggagtgca gtggcacaat 23941 ctcagctcac tgcaacctct gcctcctggg ttctcctgcc tcagcctccc aagtagctgg 24001 gattataggc atgtgctacc atgtcatgct aatttttata ttttagtaga gatgcggttt 24061 caccaatgtt ggccaggctg gtcttgaact cctgaccttg tgatcccact cgcctcagcc 24121 tcccaaagta cggattacag gcgtgacagc ccagtccagg ctcagctcta aagttcagcc 24181 tcagaactgc cccgaacatg atcctggctt tgtttccttt caaaagggct tgaatgagga 24241 gtagctttgc cacatcttga tctgctcagc cctggaggtg ccagcaaagc cccatgctgg 24301 agcctgtgta acagctcctt gggaggaagc agaataaagc aattttcctt gaagccgaga 24361 tcctgactct tcttcactgc cctgattgaa gccgagatcc tgactccaga ctcttcttca 24421 ctgccctgag gacccgggga tggggtctgg ggaggtggaa atgcagggga gtgggggcaa 24481 attcaaaggc ctggcctagg aggcttcgtg gagcagcact gaaaacccct ttcacttccc 24541 ccaaaccagc caaatgcagc agaggcgaca cctctcctcc cagccttctc cctggtccag 24601 ccaggattta aaggagattt agggagacct cttacgcttc atggtcataa gggtctagct 24661 ctgttggcca gaatgtccct ttattactgg tcgtgtgact ggtgaatctg ccaatgtaaa 24721 gtaatccatt cagcgattga gcgtcaaaat gtaggtattt ccatgagcgt ttttcctgtt 24781 gcaatggctg gcggtaatat tgttctggat attaccagca aggccagatg tttgagttct 24841 tctactcagg caagtgatgt attactaatc aaagaagtat tgctacaacg gttaatttgc 24901 gtgatggaca ggactctttt actcggtggc ctcactgatt ataaaaacac tctcaggatt 24961 ctggcgtacc gttcctgtct aaaatccctt taatggcctc // LOCUS HSCD14G 1570 bp DNA PRI 23-JUN-1993 DEFINITION Human gene for CD14 differentiation antigen. ACCESSION X06882 NID g29736 KEYWORDS antigen; CD14 antigen; monocyte differentiation antigen; surface antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1570) AUTHORS Goyert,S.M. TITLE Direct Submission JOURNAL Submitted (22-FEB-1988) Goyert S.M., Hospital for Joint Diseases, 301 E. 17th St., Cellular and Molecular Biology Unit, New York, NY 10003 REFERENCE 2 (bases 1 to 1570) AUTHORS Ferrero,E. and Goyert,S.M. TITLE Nucleotide sequence of the gene encoding the monocyte differentiation antigen, CD14 JOURNAL Nucleic Acids Res. 16 (9), 4173 (1988) MEDLINE 88234022 COMMENT the authors also sequenced corresponding cDNA Data kindly reviewed (28-MAR-1988) by Goyert S.M. FEATURES Location/Qualifiers source 1..1570 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /clone_lib="(lambda)gtWes" /map="long arm of chromosome 5, 5q23-q31" exon 51..158 /number=1 CDS join(156..158,247..1371) /codon_start=1 /product="cd14 protein precursor" /db_xref="PID:g312399" /db_xref="SWISS-PROT:P08571" /translation="MERASCLLLLLLPLVHVSATTPEPCELDDEDFRCVCNFSEPQPD WSEAFQCVSAVEVEIHAGGLNLEPFLKRVDADADPRQYADTVKALRVRRLTVGAAQVP AQLLVGALRVLAYSRLKELTLEDLKITGTMPPLPLEATGLALSSLRLRNVSWATGRSW LAELQQWLKPGLKVLSIAQAHSPAFSYEQVRAFPALTSLDLSDNPGLGERGLMAALCP HKFPAIQNLALRNTGMETPTGVCAALAAAGVQPHSLDLSHNSLRATVNPSAPRCMWSS ALNSLNLSFAGLEQVPKGLPAKLRVLDLSCNRLNRAPQPDELPEVDNLTLDGNPFLVP GTALPHEGSMNSGVVPACARSTLSVGVSGTLVLLQGARGFA" sig_peptide join(156..158,247..300) intron 159..246 /number=1 exon 247..1494 /number=2 mat_peptide 301..1368 /product="cd14 protein" polyA_signal 1477..1482 BASE COUNT 314 a 486 c 453 g 317 t ORIGIN 1 cagaatgaca tcccaggatt acataaactg tcagaggcag ccgaagagtt cacaagtgtg 61 aagcctggaa gccggcgggt gccgctgtgt aggaaagaag ctaaagcact tccagagcct 121 gtccggagct cagaggttcg gaagacttat cgaccatggt gagtgtaggg tcttggggtc 181 gaacgcgtgc cactcgggag ccacaggggt tggatggggc ctcctagacc tctgctctct 241 ccccaggagc gcgcgtcctg cttgttgctg ctgctgctgc cgctggtgca cgtctctgcg 301 accacgccag aaccttgtga gctggacgat gaagatttcc gctgcgtctg caacttctcc 361 gaacctcagc ccgactggtc cgaagccttc cagtgtgtgt ctgcagtaga ggtggagatc 421 catgccggcg gtctcaacct agagccgttt ctaaagcgcg tcgatgcgga cgccgacccg 481 cggcagtatg ctgacacggt caaggctctc cgcgtgcggc ggctcacagt gggagccgca 541 caggttcctg ctcagctact ggtaggcgcc ctgcgtgtgc tagcgtactc ccgcctcaag 601 gaactgacgc tcgaggacct aaagataacc ggcaccatgc ctccgctgcc tctggaagcc 661 acaggacttg cactttccag cttgcgccta cgcaacgtgt cgtgggcgac agggcgttct 721 tggctcgccg agctgcagca gtggctcaag ccaggcctca aggtactgag cattgcccaa 781 gcacactcgc ctgccttttc ctacgaacag gttcgcgcct tcccggccct taccagccta 841 gacctgtctg acaatcctgg actgggcgaa cgcggactga tggcggctct ctgtccccac 901 aagttcccgg ccatccagaa tctagcgctg cgcaacacag gaatggagac gcccacaggc 961 gtgtgcgccg cactggcggc ggcaggtgtg cagccccaca gcctagacct cagccacaac 1021 tcgctgcgcg ccaccgtaaa ccctagcgct ccgagatgca tgtggtccag cgccctgaac 1081 tccctcaatc tgtcgttcgc tgggctggaa caggtgccta aaggactgcc agccaagctc 1141 agagtgctcg atctcagctg caacagactg aacagggcgc cgcagcctga cgagctgccc 1201 gaggtggata acctgacact ggacgggaat cccttcctgg tccctggaac tgccctcccc 1261 cacgagggct caatgaactc cggcgtggtc ccagcctgtg cacgttcgac cctgtcggtg 1321 ggggtgtcgg gaaccctggt gctgctccaa ggggcccggg gctttgccta agatccaaga 1381 cagaataatg aatggactca aactgccttg gcttcagggg agtcccgtca ggacgttgag 1441 gacttttcga ccaattcaac cctttgcccc acctttatta aaatcttaaa caacggttcc 1501 gtgtcattca tttaacagac ctttattgga tgtctgctat gtgctgggca cagtactgga 1561 tggggaattc // LOCUS HSCD1R3 6325 bp DNA PRI 25-JUN-1997 DEFINITION Human CD1 R3 gene for MHC-related antigen. ACCESSION X14974 X15110 NID g29767 KEYWORDS CD1 antigen; CD1 gene; cell surface glycoprotein; glycoprotein; membrane protein; MHC related antigen; T-cell differentiation antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6325) AUTHORS Calabi,F. TITLE Direct Submission JOURNAL Submitted (25-APR-1989) Calabi F., MRC Laboratory of Molecular Biology, Hills Road, CB2 2QH Cambridge, UK REFERENCE 2 (bases 1 to 6325) AUTHORS Calabi,F., Jarvis,J.M., Martin,L. and Milstein,C. TITLE Two classes of CD1 genes JOURNAL Eur. J. Immunol. 19 (2), 285-292 (1989) MEDLINE 89196496 COMMENT *source clone=lambda R3G1 see X14975 for CD1 R2 gene; see also JO4142 for CD1 sequence. FEATURES Location/Qualifiers source 1..6325 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell prolymphatic leukaemia" /cell_line="D-PLL" /clone_lib="D-PLL-lambda 2001" /chromosome="1" /map="q22-23" mRNA join(>1805..1865,2158..2424,2734..3012,3580..3858, 4638..4737,4833..5598) exon <1805..1865 /number=1 CDS join(1805..1865,2158..2424,2734..3012,3580..3858, 4638..4737,4833..4854) /codon_start=1 /product="cd1 r3 protein" /db_xref="PID:g296639" /db_xref="SWISS-PROT:P15813" /translation="MGCLLFLLLWALLQAWGSAEVPQRLFPLRCLQISSFANSSWTRT DGLAWLGELQTHSWSNDSDTVRSLKPWSQGTFSDQQWETLQHIFRVYRSSFTRDVKEF AKMLRLSYPLELQVSAGCEVHPGNASNNFFHVAFQGKDILSFQGTSWEPTQEAPLWVN LAIQVLNQDKWTRETVQWLLNGTCPQFVSGLLESGKSELKKQVKPKAWLSRGPSPGPG RLLLVCHVSGFYPKPVWVKWMRGEQEQQGTQPGDILPNADETWYLRATLDVVAGEAAG LSCRVKHSSLEGQDIVLYWGGSYTSMGLIALAVLACLLFLLIVGFTSRFKRQTSYQGV L" intron 1866..2157 /number=1 exon 2158..2424 /number=2 intron 2425..2733 /number=2 exon 2734..3012 /number=3 intron 3013..3579 /number=3 exon 3580..3858 /number=4 intron 3859..4637 /number=4 exon 4638..4737 /number=5 intron 4738..4832 /number=5 exon 4833..5598 /number=6 polyA_signal 5580..5585 BASE COUNT 1690 a 1379 c 1715 g 1541 t ORIGIN 1 aagctttttt gggcaaatta cacataatta gataggctgg gttagggctg tctacaagat 61 cccagcctag agtgtggtgc agtagcctat tttctctagg actcttgctg caacaggata 121 tgggagaagg gaaacattaa cgttagcaga atgggaggaa tcagattatg gagacatgat 181 ccttgccttc tggcacctct cccagctgat ttagaagagg acacagaggg agacagacat 241 atggccattt ctataattat aaactctgct acatgctgag aggatggctt ctcaaagcca 301 gttagtgtac aagcagtgga ggttgggact gccctacttc tcctgcctag ggtcacacag 361 tctggcagat ctggaaagac ttccctgaga aagtgacctt ggaactgata ccttagtgga 421 aataactagg cattgggagg gggtgaggtg aggagaaggt caagaacttt catgtagagg 481 gaactttgtg cagagacctt agtcaggaag aggttttgca tagtggagga actgaaagct 541 aacagccagg ctacagtaca gagcaaatga tgctggggtg tgaggtgatg tcaaaaggta 601 gacaccaggg ctagagggcg cataatcttg caggccgtgc taatgatttt gttttctacc 661 caagaatgct acttcaggag atcagtcaaa aggttagggt gatggtgacg gtggaaatat 721 agaaaaagga atttattcaa gaggtattca aaaagaagtg aaatagagat gagttggaga 781 tagattggat aaaaagagtg agggagaggg aggtgtcagg ctgcagccca ggttctgagt 841 tgccaaagag gagtcttttc ctccaggtga gctaaggctt agaaccacga tttttggtca 901 ctttttctta ttttcagtgt gagaagtggc gagttggctg tccaggtaca cactgtcatc 961 ttacacttac tgtttaagac agaggcgcat atttggaaag tgaatgagtc aggagccaga 1021 ggagaaggta tgtcggcaga gcctagggcg agaggaggaa gagagtctgg gggcgcgtcc 1081 ccaaaaagga gacagggaag acagagcaga ggccggggag gggagagcaa ggaccatgac 1141 aggaggaaag agaggctagg gaacactgag gtggaggaat cctgggatat gacagttgta 1201 aagaattgta gccaacctaa tccagttctc tcaatgtgca actgaggaaa ttaattttca 1261 tgcgtttact ttattgtaaa tgtggtactt gcacttgaga aattttggaa aacataaaga 1321 tgtaaagcaa attaagatca ctcatagtct ttccacctag agacatgtac tgctaaagca 1381 aggactttga tccttttttc cctttgcatt tttaccatgg ttggaattgt aacacagaca 1441 caaccttatg tcctgcttcc aaaacataaa taatggttgt aaagcgtccc acacgttgac 1501 ccaaagtctc ctttgaaaca ggaaattgag acacgccggt tgtgaaacct actgaagtga 1561 gcggcggcgc caggaattcc tgggaccccg acctctttgc agctcgcaca gctaagggcg 1621 agggcgccct tcggcagaag cagcaaaccg ccggcaagcc cagcgaggag ggctgccggg 1681 gtctgggctt gggaattggc tggcacccag cggaaaggga cgtgagctga gcggcggggg 1741 agaagagtgc gcaggtcaga gggcggccgc agcgggcctc cgcgaggtcc ccacgccggg 1801 cgatatgggg tgcctgctgt ttctgctgct ctgggcgctc ctccaggctt ggggaagcgc 1861 tgaaggtggg tggaacgagg gcgcttgagt gcactcgcgg gagggcggag agagggagct 1921 gggtagggac ggggagggca acgcctgatg gggactggtg agacccggga cgcactggcg 1981 cgatctaggt agaaaactcg ctgctccctg gctccgggga gaggcagcgc ggcacagagt 2041 tcgctggcat cagccgcctc ctgaagctca tctcctcttg tttctttctt ccttctcttt 2101 atgctggctg ctctcccggc cacttgctac acgcctccaa tcttcattct ctcccagtcc 2161 cgcaaaggct tttccccctc cgctgcctcc agatctcgtc cttcgccaat agcagctgga 2221 cgcgcaccga cggcttggcg tggctggggg agctgcagac gcacagctgg agcaacgact 2281 cggacaccgt ccgctctctg aagccttggt cccagggcac gttcagcgac cagcagtggg 2341 agacgctgca gcatatattt cgggtttatc gaagcagctt caccagggac gtgaaggaat 2401 tcgccaaaat gctacgctta tcctgtgagc tgagggatag gatcctgggc cggtacccaa 2461 ggggagagaa tggccacaga aactcaactg ggagactgtg gcaccacctg atgagattct 2521 ctgctctgtc caccctcttc tgatttccct tctacctgga gatgtcccag gctttgactc 2581 ctcaaatgtc cctcgttcct gcctactcca ggtcacttac tttcctttcc ctgaagtctg 2641 ggtccccatt ataacctgca catcaatttc ttctctttca tctctcccag tcttttaaac 2701 ccttctttga tctttctcca ttcctctcca cagatccctt ggagctccag gtgtccgctg 2761 gctgtgaggt gcaccctggg aacgcctcaa ataacttctt ccatgtagca tttcaaggaa 2821 aagatatcct gagtttccaa ggaacttctt gggagccaac ccaagaggcc ccactttggg 2881 taaacttggc cattcaagtg ctcaaccagg acaagtggac gagggaaaca gtgcagtggc 2941 tccttaatgg cacctgcccc caatttgtca gtggcctcct tgagtcaggg aagtcggaac 3001 tgaagaagca aggtcagcct gccttcctta ctccctgcat tccacttcag ggctccaaac 3061 gggcttttcc attccagggt tctcatccct ttgagcattc aaagaagagg aaggcccagg 3121 aggggctgga taaggggtga gggtatttat tcatttcaca gacatcaact gagcacctct 3181 tgggttctat gaattcaatt aatgaacaac tgggggtgaa aggtgtcagg cagttagtta 3241 ggatccctgc ttggtagggg gatgttaatt aatgcaatgc ttgaaatagg ctacatatgt 3301 tacttcaaaa caaagggtgt tataggggaa cagacaaggg gcattcatag gaggctggga 3361 ccagagaaga agaaggagta tcagggcagg ctccctgaag aaggtgggac aaatggattg 3421 aaagttgtgg gagacttcat ttccagaagt gaacatgtca ggggcataca ggaaggaggg 3481 aaataaagac ctgaaagtca aaaaggatga tatgaggcct ggagcctcta atgcagagtt 3541 ttcactttaa gatccccccc attcccttgt tggatacagt gaagcccaag gcctggctgt 3601 cccgtggccc cagtcctggc cctggccgtc tgctgctggt gtgccatgtc tcaggattct 3661 acccaaagcc tgtatgggtg aagtggatgc ggggtgagca ggagcagcag ggcactcagc 3721 caggggacat cctgcccaat gctgacgaga catggtatct ccgagcaacc ctggatgtgg 3781 tggctgggga ggcagctggc ctgtcctgtc gggtgaagca cagcagtcta gagggccagg 3841 acatcgtcct ctactggggt gagaaaaagc tgggcccaag ctggaaatgg caggaggtgg 3901 tcctcaggca tagagggagg cactggggtg ggatgtggct tgatataccc aggttagagg 3961 agtttcagag atgaggcccc cagtaaaagg atagagagag gggttccaga cacaggaagg 4021 agggaaataa agacctgaaa gtctaaaagg attgaaatag tgctctctat atacaagaag 4081 aaacaagact acaaaactca gggatccaga agtaggcagc gaataaacaa tcccaggacg 4141 attcattcct tggcttaaag ggagccaggc ctttgggaag gcaggttagc aaagatagct 4201 tggttggttg agtgctccct gtgtaggggc caatgaatct gcatataata tctcatctca 4261 tcattcacag ttttctttga agcaagtact gctcttttcg ttatatagat gaggaaatga 4321 aggcggagat aggttaactg cccggagcca catagaaagt ggacgggctg gatttagatt 4381 caggtctgcc ttgctacaaa gctgttgccc tttccctcta tgctacacta gcaaatgcca 4441 aagtgggctg aatttgggat gcccaggcac tgaaaacaag atggggaatc taaacttatg 4501 aggaatagaa attggggttt taagtggagg aggaaataag aagcacctgg taccctcaca 4561 catgcctaga ccaggggatt ggatatatgt agagaggggt cctgtgctga gagacagcct 4621 atgttcctcc agagcaggtg ggagctacac ctccatgggc ttgattgcct tggcagtcct 4681 ggcgtgcttg ctgttcctcc tcattgtggg ctttacctcc cggtttaaga ggcaaacgta 4741 agttctcccc tttccctttc ctcaactctc tctccccttc attcctggct tcccttttcc 4801 ttaatggtct ttccctttct attctctcac agttcctatc agggcgtcct gtgactcgcc 4861 ttgccacatc tgtgtctctg gaacccagga cctctggacc tcaggttccc aagacttcag 4921 tcctggtctg ctcaggaatt gaagatgtaa ggaattgaag ataggagaga taccttgaaa 4981 aagtagagaa cagtcatgag gcagctttca tcacaccctt ttaacattta tctaaaagaa 5041 tttaaattct ttttcaaaaa ttacactaca agtttataag cccaaatggc tctgtgaaat 5101 cagaagtgca aaggtgtgca aacttgtatc tgaagaccta ccagggacaa gcaggtaaga 5161 gctgatgtga gtgtgtgtga tgggatctgt aaggaactgg aacacacatg tcctatccaa 5221 aggaatcagc tgcagctgct tgttgtcaag tataaagtca ggacctggct tggctttaac 5281 cgtttttcaa gaaaactgga aatctggatt ttcagcgaac atgcctgatt ttaaaaggtt 5341 gactcaagtt tttacaaaat actatgtggg acacctcaaa tacataccta ctgactgatg 5401 acaaacccag gagtttgtgt gtcttttata aaaagtttgc cctggatgtc atattggcag 5461 ttggaggaca cagtttctat tgtaaatttg gatttacgac tgaagaagga cattttctct 5521 ttaaaagaaa gttaggttat aagaaacaga ggcgtctcac atttttactt ggtgtaatta 5581 ataaacgaag ataaatcata gtgtatgtgt attatgttga aaaaaactac tctgagtaag 5641 gatatttctc tcaaatggtc atcacttttt ttccttggag gaagattgtc atgaaggcat 5701 cccttccttc ccaaaacgac aacggcaaca acaacagtct cctttactta cagctattaa 5761 aagagacaat gtagggaaag ggccaacacc acttgggcaa gcatgagctg cagcgatgat 5821 ggtgatggta gcccagccat gctgtgttct tatcttagag aggacagagc aggaaactta 5881 caggggtagc agacctttct gatgccaaag aaataaggag gccaaaatca tgtttgccta 5941 gctgggaggt aaatctcttg gcagtcaggg tccctgaaca cccagaaaaa ataagtgaga 6001 cttaactgtt ggagagtgtt tgcttgagaa aagtaacatt tcacactctc tatactagac 6061 ttgcacatat gaatttagga tgtgcgtgaa gattctgcta gcttcaacat atcccaaagc 6121 acttggatat gcctataatc caagtgcttt gggaggctga gataggagga ttgtttaagg 6181 ccaggcattt gaaatcagcc tgggcaacat agtgagaccc tgtctctaca aaaaattaaa 6241 aaactagcca cccatggtgg tgagcgcctg tagtcctagc tactctggag gctgaggtgg 6301 gaggatccct tgagcccatg aattc // LOCUS HSCDIR2 10351 bp DNA PRI 24-APR-1993 DEFINITION Human CD1 R2 gene for MHC-related antigen. ACCESSION X14975 X15110 NID g29842 KEYWORDS CD1 antigen; CD1 gene; cell surface glycoprotein; glycoprotein; MHC related antigen; T-cell differentiation antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10351) AUTHORS Calabi,F. TITLE Direct Submission JOURNAL Submitted (25-APR-1989) Calabi F., MRC Laboratory of Molecular Biology, Hills Road, CB2 2QH Cambridge, UK REFERENCE 2 (bases 1 to 10351) AUTHORS Calabi,F., Jarvis,J.M., Martin,L. and Milstein,C. TITLE Two classes of CD1 genes JOURNAL Eur. J. Immunol. 19 (2), 285-292 (1989) MEDLINE 89196496 COMMENT see X14975 for CD1 R3 gene. FEATURES Location/Qualifiers source 1..10351 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda R2G4" /cell_type="B-cell prolymphocytic leukaemia" /cell_line="D-PLL" /clone_lib="D-PLL-lambda 2001" exon <5484..5541 /number=1 CDS join(5484..5541,5901..6167,6792..7061,7319..7597, 7990..8087) /codon_start=1 /product="cd1 r2 protein" /db_xref="PID:g296640" /db_xref="SWISS-PROT:P15812" /translation="MLLLFLLFEGLCCPGENTAAAEEQLSFRMLQTSSFANHSWAHSE GSGWLGDLQTHGWDTVLGTIRFLKPWSHGNFSKQELKNLQSLFQLYFHSFIQIVQASA GQFQLEYPFEIQILAGCRMNAPQIFLNMAYQGSDFLSFQGISWEPSPGAGIRAQNICK VLNRYLDIKEILQSLLGHTCPRFLAGLMEAGESELKRKVKPEAWLSCGPSPGPGRLQL VCHVSGFYPKPVWVMWMRGEQEQRGTQRGDVLPNADETWYLRATLDVAAGEAAGLSCR VKHSSLGGHDLIIHWGGYSIFLILICLTVIVTLVILVVVDSRLKKQR" intron 5542..5900 /number=1 exon 5901..6167 /number=2 intron 6168..6791 /number=2 exon 6792..7061 /number=3 intron 7062..7318 /number=3 exon 7319..7597 /number=4 intron 7598..7989 /number=4 exon 7990..>8087 /number=5 BASE COUNT 2694 a 2278 c 2257 g 3122 t ORIGIN 1 aagcttccac agtgtggaag gggcccccag cggttgccac tgctagctgg gggcagcctg 61 cttttattct tttatctggc cccacccaca tcctgctgat tggtagagcc aagtagtctg 121 ttttgacagg gtgctgattg gcgagtttac aatccctgag ctagacacaa atgttctcca 181 cgtccccacc agattagcta gatacagagt gtggacacaa aggttctcca aggccccacc 241 agagtagcta gatacagagt gttcattggt gcattcacaa accctgagct agacacaggg 301 tgctgattgg tgtgtttaca aaccttgagc tagagacaga atgccaattg gtgtatttac 361 aatccctgag ctagacataa agattctcca cgttcccacc agactcagga tcccagctgg 421 cttcacctag tggatcctgc actggggctg cagatggagc tgcctgccag tcccccgccg 481 tgaacccgca ctcctcagcc cttgggtggt tgatgggact gggcgccgtg gagcaggggg 541 ctgcgctcat cggggaggct ccggccgcac aggagcccat ggagggggtg ggagtctcag 601 gcatggcggg ctgcaggtcc cgagccctgc cccacgggga ggcagctaag gcctggtgag 661 aaatcaagca cagcgctggt gggctggcac tgctggggga tccagtacac cctccgcagc 721 cgctggcccg ggtgctaagt ctctcattgc tcggggccgg cagggccggc tggctgctcc 781 gagtgcaggc ccgccaagcc cacgcccacc ctgcactcca gctggaaccg cccgcagccc 841 aggttcccgc tcgcgcctct ccctccacac ctccctgcaa gctgagggag ccggctccgg 901 ccttggccag cccagaaagg ggctcctaca gtgcagcggc gggctgaaag gcttctcaag 961 tgctgccaaa gtgggagccc aggcagagga ggcaccgaga gcaagcaagg tctgtgagga 1021 cgccagcacg ctgtcacctc tcaatgccac tgcactccag cctgggtgac agagtgagaa 1081 tccgtctcaa aaagaaaaaa aaaagaaaaa atgtaactat gtgaagtgat ggatatggta 1141 ataacttgac tgtggcgatt acttcacagt atatacatat atcataacat catgttgtgc 1201 accttaagta tacccaattt aaaaaaaatt aaggtagaat ataaatatat aatttatcat 1261 cttcaccatt ttaaaatgta cagctcagtg gcaataaata cttttttata ctcaattttt 1321 atttgttggt tgtacctcac taaagctgaa aaatatattt gaataaatgt taaagaaatg 1381 taagtataca caaacaagaa aataagtatg tagaaattgc aatctttcct gaaaataagc 1441 caaaagaata aaactcgtat ttaaattctt tgaaaaaaaa ttagcagatg gaacattttc 1501 ttacatttgt acaaatgtac tcaaaaattt ttattttaaa tcaattttaa tgagttatct 1561 cccataaatt cttacataga aagattaagt tgtccaaagg cacatcttat aatatgtttt 1621 gagagaattt tctggttttg gcaaaagggg atctgcgaat gtgactgggg cagcctcagg 1681 aaaacataag agggggaatt tctttttaaa agagacctct ggccacagac actaagtatt 1741 tttagtaaaa aatgtggtta ggcactgatg gaatattttt tttttttttt tttttgagac 1801 ggagtctttc tctgttgccc aggctggagt gcagtggcac aatcttggct cactgcaacc 1861 tctgcctccc aggttcaagc gattctcttg cctcagcttc tcgaatagct gggaccatag 1921 gcacgtgcca ccacgccctg ctaatttttt tgtattttta gtagagaggg gtttcaccgt 1981 gttagccagg atggtctcga tctcctgacc tcctgatctg cctgcctcgg cctcccaaag 2041 tgctgggatt acaggagtga gccactgtgc ctggcccact gatggaattt tgtaaagcag 2101 gaaggaaata attgtaccat ataatgttac aaaaattaaa attaatatat attttttctt 2161 tggaatttat cttctggata attttaattt agaaggaatt tcattatttt gttaatgatt 2221 atggggttgc tgtcagacat tatttctact ttgctcagta atttgcagtt atctgtgagt 2281 tctgtagcct caaaattaaa aactagggtg tggttatctt tacttacata aaagtgcaat 2341 tttttgtgga ataacaacat tgttttattc tctaaatata ctttgcttta ttttactttg 2401 caaatgattt taaaattctc cctctttttt ttttttttga aacagagtca cactctcctg 2461 cccaggatgt agtacagtgg cacaatctca gttccctgca atctccacct cctaggctca 2521 agcaattctc ctgcctcagc ctcctgagta gctggaatta caggcacaca tcatggcacc 2581 tggctaattt ttactttttt tggtagagac agggtttcgc catgttggca aggctggtct 2641 cgaattcgcc tgccttggct tcccaaagtg ctgggattac gggtgtgagc caccgtgcct 2701 ggcctgatta aaaattctaa tggccggggc acggttgctc acgcctgtaa tctagcactt 2761 tgggaggccg aggcgggtgg atcagaggtc aggagatcga gaccgtcttg gtcaacatga 2821 agaaaccctg tctctactaa aaatacaaaa aatagatggt catggtggct cttgcctgta 2881 atcccagctg ctcaggaggc tgaggcagga gagtcgtttg aaccagggat tcgggggttg 2941 caatgagccg agatcctgtg gctgcactgc agcctgggca acagagcaag acagcaagac 3001 tccgtctcag gaaaaaacaa aacaaaacaa aaattctaag gaatacagtt acttgtagta 3061 ttgatgtttg gcataggtca ttccaatctt tttattatgt attattgttt ttgtaggtta 3121 ttatatattc tcttgaatat ttatctttgt ttactgttca taattttatt atgaaaccaa 3181 ttttggattt acatagcgat ttatagatag atttacatag caacttatag ctttgaatgt 3241 atttgtagaa aagaaagagt attgaaaata aatgtagaca ttctactcag gaaacaagga 3301 aacctttttt taatctttct ttaaccttag aatttagaaa tgataataag atgaaactag 3361 gtttggacct ttttcttttg tttttttgtt tgcagttttt ttttcttttt ttaattattt 3421 ttatttattt atttattttt tattattata ctttaagttc tagggtacat gtgcacaacg 3481 tgaaggtttg ttacatatgt atacatgtgt cgtgttggtg tgctgcaccc attaactcgt 3541 catttacatt aggtatatct cttaatgcta tccctccccc ctccccccac cccacaacag 3601 gccccggtgt gtgatgttcc tcttcctgtg tccatgtgtt ctcattgttc aattcccacc 3661 tatgagtgag aacatgcggt gtttggtttt ttgtccttgt gatagtttgc tgagaatgat 3721 ggtttccagc ttcatccatg tccctacaaa ggacatgaac tcatcatttt ttatggctgc 3781 atagtatttc atggtgtata tgtgccacat tttcttaatc cagtctatca ttgttggaca 3841 tttgggttgg ttccaagtct ttgttattgt gaatagtgcc acaataaaca tacgtgtgca 3901 tgtgtcttta tagcaacatg atttataatc ctttgtgtat atacccagta atgggatggc 3961 tgggtcaaat ggtatttcta gttctagatc cctgaggaat cgccaccctg acttccacaa 4021 tggttgaact agtttacagt cccaccaaca gtgtaaaagt gttcctattt ctccacatcc 4081 tctccagcac ctctccagca cagaaagagt attgaaaata aatgtagata ttctactcag 4141 gaaacaagga aactttttat ctttctttta acttagaatt tagaaatgat aataagatga 4201 aactaggttt ggaccttttt caataattct gaccaccgtt cagtaaattt actgtataag 4261 aaaagtagag gtttggggtt gaattttgtt ttattttttc ccaaagatgt ctatctctga 4321 tatcttaggg cctggcaatt gtttttggtc ataggtgggc tctgaataaa tgtcagttga 4381 acaaatatat aaaaaccaaa caaaggaatg ttatttcaac tgccttcaga gtgtcacaac 4441 aaatatcaat gaaaaagaaa caaaaataaa gactcaatta attctccagt gaacacttca 4501 gtcctctaaa ggtacttccc tgtgtccacc caaactgctc tctaaccgaa cgcaagcaag 4561 gggactccaa tgctctcagg ccatgaacta atgaggagct tcatgcctgc cctcagtctt 4621 tcttcatacc cagggcaaag ctggcccaca gacttccaga catgctctct tcgtttgggt 4681 agagatgggg atccaagtcc cctatgtatt tcctccagag ccacctgtaa aacatgggtg 4741 aaatctacct gccctctagc tcaccacttc caggtcaagc ccactcatgg accactttaa 4801 cagtgacagt tggcttgact ggggaaactg aaaaaaagga gctttcttta gattttctga 4861 taatctgcat tgccattgct gctccttttt cagcagcaca gagagttgga ttatggaggg 4921 ttgcctgctt attatgcagc tacatccagg atgctgagtg tcttctgatc atgatcaaga 4981 gcagagtgaa gagatacagg cggaggagag gatcacagag aggtagaagt gggagactgg 5041 tggataggtc actaggcacc actgagtgac taccatcact tctcacagat tagattattg 5101 acaagccaag tatggatttg atttctacaa ggccaaaagc tactccatct tctagccaga 5161 ccacagactt tgttttaagc caggactcgt tttccagcag ttagtgcatt agggaggttg 5221 ggaggatgta cacaagagca gggagaaaat ctggagccag acagtgagag aggcagaggg 5281 gaaatgaaaa accccgtggg aggctgcagc aaagcgaggc agtgtggctt ctctgacagg 5341 gaagtcagca gagggagagg tttgtctgtc tgtacagcaa gggaagtcag acgagagtgc 5401 aagagggtgt ggagaggggt actgatatct gaattattag ggcaggtgtc ctgccaagga 5461 atccctcctt taacagagct tcaatgctgc tcctgttcct cctcttcgag ggtctctgct 5521 gtcctgggga aaatacagca ggtaagaaga gtgcaggtgg aaagatacct atggtagggc 5581 accagagggc tgagaggaag ctctggggag gtcctggggg agggagcagt actcttctag 5641 gatgcccttg gaatatgcct ttcaggctag ttccaggcag agaattcttg ctctcagtct 5701 cagtttttgt ctctgatttt ggagaaagga agctggcccc acaggaaaag ggtattggag 5761 tatgtacaag ctacctaact gtctctcatc tctgggttcc ttttttccct ttggcatcac 5821 ttttcccatc cctttacatt ctctctactt gtcatttccc tctctctcag ctccccaggc 5881 tctacaatcc tatcatctag cagcagagga gcagctgtcc ttccgcatgc tccaaacttc 5941 ctcctttgcc aaccacagct gggcacacag tgagggctca ggatggctgg gtgacctgca 6001 gactcatggc tgggacactg tcttgggcac catccgcttt ctgaagccct ggtcccatgg 6061 aaacttcagc aagcaggagc tgaaaaactt acagtcactg ttccagttat acttccatag 6121 ttttatccag atagtgcaag cttctgctgg tcaatttcag cttgaatgta agttcgttgc 6181 tctaagctga taatttgcct gggaacacca actatttcca aatgaagata gatatataga 6241 ctctgaccat catttaacct tactaacctt gttccccact ctctgactcc cactccctcc 6301 tctgcttcac ccttcaccac cacccacact ccaccatata cacaaaaggg cctgcatgta 6361 catatctcaa catgaatata gcttcatgtc tggctctttg gaatgattgt ctcctctgga 6421 tcttctgccc ctcattcctg ccctcagact cagcctttct caaccctctt tctgcccttc 6481 tttatccttt gcctgagtgt tgacatggac tggcctgtac ctaaccactt tcacgtgaat 6541 tatttatgac caatctccta tcttcctgat agccttccca tctaccactt tcccattagt 6601 tatttcaaag tatctttatt atcttcaaat tttcttccca caaattttct tcccctttgc 6661 cagtaaactc tagtctccat atgatttccc agcaaatttt tcttcccttg aatctcttta 6721 ctgtctaaat tgtttgtttt tcttccttgt cattctttcc ataatgatct ctcttccctg 6781 tccactctca gaccccttcg agatccagat attagctggc tgtagaatga atgccccaca 6841 aatcttctta aatatggcat atcaagggtc agatttcctg agtttccaag gaatttcctg 6901 ggagccatct ccaggagcag ggatccgggc ccagaacatc tgtaaagtgc tcaatcgcta 6961 cctagatatt aaggaaatac tgcaaagcct tcttggtcac acctgccctc gatttctagc 7021 ggggctcatg gaagcagggg agtcagaact gaaacggaaa ggtgagccca actctctctc 7081 tcccctcttg ttcctagtac tataactctc atatttgaat ttgcctctca tcatcatttt 7141 gaaagacata gtgagagact agagaatgag atgtgtgggt tcaggactgt ttcttagaca 7201 agagaaagaa gtgattacta aatcactctt agtattatta caaaggcacc tgagtctctg 7261 agctctggcc tggggtgccc ttcaaaattc catttttttt ctatcttctt cttcctagtg 7321 aagccagagg cctggctgtc ctgtggcccc agtcctggcc ctggccgtct gcagcttgtg 7381 tgccatgtct caggattcta cccaaagccc gtgtgggtga tgtggatgcg gggtgagcag 7441 gagcagcggg gcactcagcg aggggacgtc ctgcctaatg ctgacgagac atggtatctc 7501 cgagcaaccc tggatgtggc ggctggggag gcagctggcc tgtcctgtcg ggtgaaacac 7561 agcagtctag ggggccatga tctaatcatc cattggggtg agaaacagct gaggctctgc 7621 tgggaaataa tgaaaatagc cctggggctt ttgagtgtgg ggctgaggaa atgggtagga 7681 atgctaggta caagaagggt aaaactggga caatcaaaat aaagaaggat agagtatgac 7741 agtagttaaa ttttaagaaa atggaagtag agaattagac atactaacag aaaaaggagg 7801 aggaactagt gatttagtgg gagagggttg ggaggagatc acagacaaag gatcaggagg 7861 aattgaaatg agggctttgg aaaacccaga tgaaaattct aggaaggtcc cacccttgtg 7921 aaatgggaaa tctcagcttg gtggaataga gtattttagg gttggtattc ttattctatc 7981 cccaaccagg tggatattcc atctttctca tcctgatctg tttgactgtg atagttaccc 8041 tggtcatatt ggttgtagtt gactcacggt taaaaaaaca gaggtgagct ttttcttgtt 8101 ctttgtttct tcagcctgta atcaattcat tcctttttcc tctatcttct tttttttttt 8161 ctctgccttt cccctttatt tcctttcttt gagattaata tcctcctctt ttcccacagt 8221 tcaaataaga acattctttc tccccacaca cccagccctg tctttctcat gggagccaac 8281 actcaggaca ccaagaattc aagacatcag ttctgcttgg cacaagtatc gtggatcaaa 8341 aacagagtat tgaagaagtg gaagacacgc ctaaaccaac tctggtgaca tttgctttac 8401 cttatacata aaatccttgt ctgcatcttc ttaaacaccg tccatgtccc ataagggaag 8461 catgctttta tttaaacagt ttatactagc aaagatactg acccctttag gaatactttt 8521 tccccatctt ccagagattt tttttttcct gctttggcta catatccatc attgtttatt 8581 tttgaaacta taatccagat acttcttttt catggattcc cgagatcacc caattgatag 8641 ctcttctgta ctccccaaat tgaactgatc ttcacaagca cattcatctc ttcctagctc 8701 tgaacagtag ttatttaggt ttttgctctt tttttttttt taatctcagt tgcttgaaag 8761 taggatttag gtatttgtgt ctgtattcat gaccaaaaat cttatctgaa ttcagggcca 8821 gcttcataag catgtgacct gtgcagacac atggaatcgt atgctctgca agaacccatg 8881 ctagatttaa tgctctgctg ttgtcatctt ggaatcctta gtcattttca aacaagagat 8941 attgtatttt catttttcac tgaccccaca aattatatag ctgattcagt gtgaatgtaa 9001 tatttctcaa taaatgctga ctgaaataaa tttgttgtta gatcaagtga tgttgtttta 9061 ctccattccc tctctgatat ctctgcgtca tttgactgtg ctcttgctaa aactcttcct 9121 ctctcccttt gtctctcacc agaggtgaag gaggcagatg aatgcctcag actccacagt 9181 gattggaatg ttaatgaatc ttcagggata tttaggaaaa taataacata agggctatct 9241 cctgtcaaag tgaggagagg tttgaactgg cattcctact tccctgagcc atgtgtatct 9301 atgtgtgtct ttgtggatgt gcatattcat gggagaaatt catattttga gcaattagaa 9361 ggaggacata gggtttgagc cagagctaag gcaatgaagc aaaagagtga ttggatgcca 9421 gttaagctac atgcttgttt ttatttatta gtgtggctta ttattgttat tattatttac 9481 agtttgtttt tctttagttt ttatatggat tcttacattt acataattta cttcatttga 9541 gatacatttg gttgagccta ctggagagca ggaatctggt agaagtgcag tgtttatata 9601 catgtccagt caacccctcc cttggatgga ataattgagc agattgtgca tatttctttt 9661 cttttttttt gagatggagt ctcgatctgt tggccaggct ggagtgcagt ggctggtgtc 9721 agatcactgc aagctccacc tccggggttc atgccgttct cctgcctcag cctccccagt 9781 agctgggact acaggcgcct gtcaccactc ccggctattt tttttttaat ttttagtaga 9841 gacggggttt cgccatgtta gccaggatgg tcttgatctc ctgacctcgt gatccacccc 9901 ccttggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccag ccggttgtgc 9961 agatttttta gtggaagtta taaacttcat gctgggcatt ttcaagtaat cagcaagggt 10021 cccatgatca cacatccagg ctgcactctt actcatgaac aaggtatgag gccagccttt 10081 tcactgttga gagtccatca ccttccatat ggcacatcca atcagtctat ctagagtcag 10141 gccactagaa tcgggagagg aaggtagaat cttaaagggt tgtttggagc aacaaatgat 10201 gaaaggggtc agggttccat cacgtggtat aactctacag gggaggacct ctattagaat 10261 gaagaggttc aagtaggaac ctttccgttt ttggcacaag gtgtgcgaaa ggcacaagct 10321 atacctctgg agacaataga tcctcgagct c // LOCUS HSCEATG 4137 bp DNA PRI 07-MAR-1997 DEFINITION H.sapiens carcinoembryonic antigen gene. ACCESSION X62151 NID g29856 KEYWORDS carcinoembryonic antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4137) AUTHORS Barnett,T. TITLE Direct Submission JOURNAL Submitted (12-SEP-1991) T. Barnett, Miles, Inc., 400 Morgan Lane, West Haven, CT 06516, USA REFERENCE 2 (bases 1 to 4137) AUTHORS Barnett,T. TITLE Genomic DNA sequence upstream of the translational start of the carcinoembryonic antigen gene JOURNAL Unpublished FEATURES Location/Qualifiers source 1..4137 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="EBV-immortalized lymphoblast" /cell_line="AG7647 (NIGMS Repository)" /clone_lib="EcoRI segment of cosmid; c103.2 from Alzheimer 6 cosmid library" /clone="CEA4kbN-term" 5'UTR 1..2616 CDS join(2617..2680,3605..3972) /codon_start=1 /product="carinoembryonic antigen" /db_xref="PID:e306038" /db_xref="PID:g1877203" /translation="MESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFN VAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIY PNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYRE" repeat_unit 3469..3500 /rpt_family="(AC)" BASE COUNT 1048 a 1200 c 1059 g 830 t ORIGIN 1 gaattccgga tccggatcta tgtccccgcc caaatctcat gtcaaattgt aaaccccaat 61 gttggaggtg gggccttgtg agaagtgatt ggataatgcg ggtggatttt ctgctttgat 121 gctgtttctg tgatagagat ctcacatgat ctggttgttt aaaagtgtgt agcacctctc 181 ccctctctct ctctctctct tgctcatgct ctgccatgta agacgttcct cgtttcccct 241 tcaccgtcca gaatgattgt aagttttctg aggcctcccc aggagcagaa gccactatgc 301 ttcctgtaca actgcagaat gatgagcgaa ttaaacctct tttctttata aattacccag 361 tctcaggtat ttctttatag caatgcgagg acagactaat acaatcttct actcccagat 421 ccccgcacac gcttagcccc agacatcact gcccctggga gcatgcacag cgcagcctcc 481 tgccgacaaa agcaaagtca caaaggtgac aaaaatctgc atttggggac atctgattgt 541 gaaagaggat ggacagtaca cttgtagcca cagagactgg ggctcaccga gctgaaacct 601 ggtagcactt tggcataaca tgtgcatgac ccgtgttcaa tgtctagaga tcagtgttga 661 gtaaaacagc ctggtctggg gccgctgctg tccccacttc cctcctgtcc accagagggc 721 ggcagagttc ctccctccca ccctggagcc tccccagggg ctgctgacct cctcagccgg 781 gcccacagcc tagcagggtc caccctcacc cgggtcacct cggcccacgt cctcctcgcc 841 ctccgagctc ctcacacgga ctctgtcagc tcctccctgc agcctatcgg ccgcccacct 901 gaggcttgtc ggccgcccac ttgaggcctg tcggctgccc tctgcaggca gctcctgtcc 961 cctacacccc ctccttcccc gggctcagct gaaagggcgt ctcccagggc agctccctgt 1021 gatctccagt acagctcagt ctctcacagg ctccgacgcc ccctatgctg tcacctcaca 1081 gccctgtcat taccattaac tcctcagtcc catgaagttc actgagcgcc tgtctcccgg 1141 ttacaggaaa actctgtgac agggaccacg tctgtaatgc tctctgtgga atcccaaggc 1201 ccagcccagt gcctgacacg gaacagatgc tccataaata ctggttaaat gtgtgggaga 1261 tctctaaaaa cagcatatca cctccgtgtg gcccccagca gtcagagtct gttccatgtg 1321 gacacagggg cactggcacc agcatgggag gaggccagaa gtgcccgcgg ctgccccagg 1381 aatgaggcct caacccccag agcttcagaa gggaggacag aggcctgcag ggaatagatc 1441 ctccggcctg accctgcagc ctaatccaga gttcagggtc agctcacacc acgtcgaccc 1501 tggtcagcat ccctagggca gttccagaca aggccggagg tctccttctt tgccctccag 1561 ggggtgacat tgcacacaga catcactcag gaaacggatt cccctggaca ggaacctggc 1621 tttgctaagg aagtggaggt ggagcctggt ttccatccct tgctccaaca gacccttctg 1681 atctctccca catacctgct ctgttccttt ctgggtccta tgaggaccct gttctgccag 1741 gggtccctgt gcaactccag actccctcct ggtaccacca tggggaaggt ggggtgatca 1801 caggacagtc agcctcgcag agacagagac cacccaggac agtcagggag aacatggaca 1861 ggccctgagc cgcagcttca gccaacagac acggagaggg agggtccccc tggagccttc 1921 cccaaggaca gcagagccca gagtcaccca cctccctcca ccacagtcct ctctttccag 1981 gacacacaag acacctcccc ctctacatgc aggatctggg gactcctgag acctctgggc 2041 ctgggtctcc atccctgggt cagtggcggg gttggtggta ctggagacag agggctggtc 2101 cctccccagc caccacccag tgagcctttt tctagccccc agagccacct ctgtcacctt 2161 cctgttgggc atcatcccac cttcccagag ccctggagag catggggaga cccgggaccc 2221 tgctgggttt ctctgtcaca aaggaaaata atccccctgg tgtgacagac ccaaggacag 2281 aacacagcag aggtcagcac tggggaagac aggttgtcct cccaggggat gggggtggat 2341 ccaccttgcc gaaaagattt gtctgaggaa ctgaaaatag aagggaaaaa agaggaggga 2401 caaaagaggc agaaatgaga ggggagggga cagaggacac ctgaataaag accacaccct 2461 gacccacgtg atgctgagaa gtactcctgc cctaggaaga gactcagggc agagggagga 2521 aggacagcag accagacagt cacagcagcc ttgacaaaac gttcctggaa ctcaagctct 2581 tctccacaga ggaggacaga gcagacagca gagaccatgg agtctccctc ggcccctccc 2641 cacagatggt gcatcccctg gcagaggctc ctgctcacag gtgaagggag gacaacctgg 2701 gagagggtgg gaggagggag cacagagact ggctggggtc tcctgggtag gacagggctg 2761 tgagacggac agaggctcct gttggagcct gaatagggaa gaggacatca gagagggaca 2821 ggagtcacac cagaaaaatc aaattgaact ggaattggaa aggggcagga aaacctcaag 2881 agttctattt tcctagttaa ttgtcactgg ccactacgtt tttaaaaatc ataataactg 2941 catcacatga cactttaaat aaaaacataa taactgcatc agatgacact ttaaataaaa 3001 acataaccag ggcatgaaac actgtcctca tccgcctacc gcggacattg gaaaataagc 3061 cccaggctgt ggagggccct gggaaccctc atgaactcat ccacaggaat ctgcagcctg 3121 tcccaggcac tggggtgcaa ccaagatcac acaaatccct gccctcatga agctcatgct 3181 ctcatgggga ggaagacaga catacaaaga gatctagaat gtgaggtcag gtgttgacaa 3241 gagccctgga ggaatagagc agggaaaggt cagaaaagga agacccaggg tctctagagg 3301 aggtgtcagg gaagggatct cccaagaatg ccctgatgtg agcaggacct gaaggcaatg 3361 gggagggagc cgtgaagacc cctggaaaag cagattccac acagggaaat gccaaggtcg 3421 gaggtgctaa ggaaatagga gacacactgc tgaccttgac ctagtaggac acacacacac 3481 acacacacac actcactcac tccagggctg ggggatgaag agacctgctc aggacccagg 3541 accccatttt tccaccctaa tgcataggtc ccaatattga ccgatgctct ctgctctctc 3601 ctagcctcac ttctaacctt ctggaacccg cccaccactg ccaagctcac tattgaatcc 3661 acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 3721 catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 3781 ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 3841 atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 3901 accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 3961 taccgtgagt gattccccca tgacctctgg gtgttggggg tcagttctac ttcccacaca 4021 caggattatc aggcctgggc tgtgctgtgg ccccctctgc attacgcacc atgttagggt 4081 ttgggcattt agtgcaggat acacacagaa gagacaaact tcaacagatc agaattc // LOCUS HSCFOS 3565 bp DNA PRI 21-NOV-1994 DEFINITION Human cellular oncogene c-fos (complete sequence). ACCESSION V01512 NID g29903 KEYWORDS oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3565) AUTHORS van Straaten,F., Muller,R., Curran,T., Van Beveren,C. and Verma,I.M. TITLE Complete nucleotide sequence of a human c-onc gene: deduced amino acid sequence of the human c-fos protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (11), 3183-3187 (1983) MEDLINE 83221560 COMMENT Data kindly reviewed (10-OCT-1983) by F. van Straaten. FEATURES Location/Qualifiers source 1..3565 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(132..429,1183..1434,1866..1973,2088..3239) /gene="c-fos" precursor_RNA 132..>3515 /gene="c-fos" /note="possible transcript" exon 132..429 /gene="c-fos" /note="(alternate start site)" /number=1 precursor_RNA 132..>3259 /gene="c-fos" /note="possible transcript" mRNA join(132..429,1183..1434,1866..1973,2088..3515) /gene="c-fos" gene 132..3515 /gene="c-fos" exon 136..429 /gene="c-fos" /note="(alternate start site)" /number=1 precursor_RNA 136..>3259 /gene="c-fos" /note="possible transcript" mRNA join(136..429,1183..1434,1866..1973,2088..3239) /gene="c-fos" precursor_RNA 136..>3515 /gene="c-fos" /note="possible transcript" mRNA join(136..429,1183..1434,1866..1973,2088..3515) /gene="c-fos" CDS join(289..429,1183..1434,1866..1973,2088..2729) /gene="c-fos" /codon_start=1 /db_xref="PID:g29904" /db_xref="SWISS-PROT:P01100" /translation="MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPV NAQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPS AGAYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRE LTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSV ASLDLTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPA SSRPSGSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSC TAYTSSFVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL" intron 430..1182 /gene="c-fos" /note="intron I" exon 1183..1434 /gene="c-fos" /number=2 intron 1435..1865 /gene="c-fos" /note="intron II" exon 1866..1973 /gene="c-fos" /number=3 intron 1974..2087 /gene="c-fos" /note="intron III" exon 2088..3239 /gene="c-fos" /note="(alternate stop site)" /number=4 exon 2088..3515 /gene="c-fos" /note="(alternate stop site)" /number=4 BASE COUNT 780 a 954 c 978 g 853 t ORIGIN 1 gcagccgggc ggccgcagaa gcgcccaggc ccgcgcgcca cccctctggc gccaccgtgg 61 ttgagcccgt gacgtttaca ctcattcata aaacgcttgt tataaaagca gtggctgcgg 121 cgcctcgtac tccaaccgca tctgcagcga gcaactgaga agccaagact gagccggcgg 181 ccgcggcgca gcgaacgagc agtgaccgtg ctcctaccca gctctgcttc acagcgccca 241 cctgtctccg cccctcggcc cctcgcccgg ctttgcctaa ccgccacgat gatgttctcg 301 ggcttcaacg cagactacga ggcgtcatcc tcccgctgca gcagcgcgtc cccggccggg 361 gatagcctct cttactacca ctcacccgca gactccttct ccagcatggg ctcgcctgtc 421 aacgcgcagg taaggctggc ttcccgtcgc cgcggggccg ggggcttggg gtcgcggagg 481 aggagacacc gggcgggacg ctccagtaga tgagtagggg gctcccttgt gcctggaggg 541 aggctgccgt ggccggagcg gtgccggctc gggggctcgg gacttgctct gagcgcacgc 601 acgcttgcca tagtaagaat tggttccccc ttcgggaggc aggttcgttc tgagcaacct 661 ctggtctgca ctccaggacg gatctctgac attagctgga gcagacgtgt cccaagcaca 721 aactcgctaa ctagagcctg gcttcttcgg ggaggtggca gaaagcggca atcccccctc 781 ccccggcagc ctggagcacg gaggagggat gagggaggag ggtgcagcgg gcgggtgtgt 841 aaggcagttt cattgataaa aagcgagttc attctggaga ctccggagcg gcgcctgcgt 901 cagcgcagac gtcagggata tttataacaa accccctttc aagcaagtga tgctgaaggg 961 ataacgggaa cgcagcggca ggatggaaga gacaggcact gcgctgcgga atgcctggga 1021 ggaaaagggg gagacctttc atccaggatg agggacattt aagatgaaat gtccgtggca 1081 ggatcgtttc tcttcactgc tgcatgcggc actgggaact cgccccacct gtgtccggaa 1141 cctgctcgct cacgtcggct ttccccttct gttttgttct aggacttctg cacggacctg 1201 gccgtctcca gtgccaactt cattcccacg gtcactgcca tctcgaccag tccggacctg 1261 cagtggctgg tgcagcccgc cctcgtctcc tctgtggccc catcgcagac cagagcccct 1321 caccctttcg gagtccccgc cccctccgct ggggcttact ccagggctgg cgttgtgaag 1381 accatgacag gaggccgagc gcagagcatt ggcaggaggg gcaaggtgga acaggtgagg 1441 aactctagcg tactcttcct gggaatgtgg gggctgggtg ggaagcagcc ccggagatgc 1501 aggagcccag tacagaggat gaagccactg atggggctgg ctgcacatcc gtaactggga 1561 gccctggctc caagcccatt ccatcccaac tcagactctg agtctcaccc taagaagtac 1621 tctcatagtt tcttccctaa gtttcttacc gcatgctttc agactgggct cttctttgtt 1681 ctcttgctga ggatcttatt ttaaatgcaa gtcacaccta ttctgcaact gcaggtcaga 1741 aatggtttca cagtggggtg ccaggaagca gggaagctgc aggagccagt tctactgggg 1801 tgggtgaatg gaggtgatgg cagacacttt tactgaatgt cggtcttttt ttgtgattat 1861 tctagttatc tccagaagaa gaagagaaaa ggagaatccg aagggaaagg aataagatgg 1921 ctgcagccaa atgccgcaac cggaggaggg agctgactga tacactccaa gcggtaggta 1981 ctctgtgggt tgctcctttt taaaacttaa gggaaagttg gagattgagc ataagggccc 2041 ttgagtaaga ctgtgtctta tgctttcctt tatccctctg tatacaggag acagaccaac 2101 tagaagatga gaagtctgct ttgcagaccg agattgccaa cctgctgaag gagaaggaaa 2161 aactagagtt catcctggca gctcaccgac ctgcctgcaa gatccctgat gacctgggct 2221 tcccagaaga gatgtctgtg gcttcccttg atctgactgg gggcctgcca gaggttgcca 2281 ccccggagtc tgaggaggcc ttcaccctgc ctctcctcaa tgaccctgag cccaagccct 2341 cagtggaacc tgtcaagagc atcagcagca tggagctgaa gaccgagccc tttgatgact 2401 tcctgttccc agcatcatcc aggcccagtg gctctgagac agcccgctcc gtgccagaca 2461 tggacctatc tgggtccttc tatgcagcag actgggagcc tctgcacagt ggctccctgg 2521 ggatggggcc catggccaca gagctggagc ccctgtgcac tccggtggtc acctgtactc 2581 ccagctgcac tgcttacacg tcttccttcg tcttcaccta ccccgaggct gactccttcc 2641 ccagctgtgc agctgcccac cgcaagggca gcagcagcaa tgagccttcc tctgactcgc 2701 tcagctcacc cacgctgctg gccctgtgag ggggcaggga aggggaggca gccggcaccc 2761 acaagtgcca ctgcccgagc tggtgcatta cagagaggag aaacacatct tccctagagg 2821 gttcctgtag acctagggag gaccttatct gtgcgtgaaa cacaccaggc tgtgggcctc 2881 aaggacttga aagcatccat gtgtggactc aagtccttac ctcttccgga gatgtagcaa 2941 aacgcatgga gtgtgtattg ttcccagtga cacttcagag agctggtagt tagtagcatg 3001 ttgagccagg cctgggtctg tgtctctttt ctctttctcc ttagtcttct catagcatta 3061 actaatctat tgggttcatt attggaatta acctggtgct ggatattttc aaattgtatc 3121 tagtgcagct gattttaaca ataactactg tgttcctggc aatagtgtgt tctgattaga 3181 aatgaccaat attatactaa gaaaagatac gactttattt tctggtagat agaaataaat 3241 agctatatcc atgtactgta gtttttcttc aacatcaatg ttcattgtaa tgttactgat 3301 catgcattgt tgaggtggtc tgaatgttct gacattaaca gttttccatg aaaacgtttt 3361 attgtgtttt taatttattt attaagatgg attctcagat atttatattt ttattttatt 3421 tttttctacc ttgaggtctt ttgacatgtg gaaagtgaat ttgaatgaaa aatttaagca 3481 ttgtttgctt attgttccaa gacattgtca ataaaagcat ttaagttgaa tgcgaccaac 3541 cttgtgctct tttcattctg gaagt // LOCUS HSCGN3PRO 1202 bp DNA PRI 21-APR-1997 DEFINITION H.sapiens promoter region of collagenase 3 gene. ACCESSION X81640 NID g1945758 KEYWORDS Collagenase-3; promoter region. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1202) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (14-SEP-1994) C. Lopez-Otin, Universidad de Oviedo, Dept de Biologia Funcional, Area de Bioquimica, Fac. de Medicina, C/Julian Claveria S/N, 33006 Oviedo, SPAIN REFERENCE 2 (bases 1 to 1202) AUTHORS Pendas,A.M., Balbin,M., Llano,E., Jimenez,M.G. and Lopez-Otin,C. TITLE Structural analysis and promoter characterization of the human collagenase-3 gene (MMP13) JOURNAL Genomics 40 (2), 222-233 (1997) MEDLINE 97237040 COMMENT related sequence: X75308. FEATURES Location/Qualifiers source 1..1202 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q22" /clone_lib="lambda gt10" promoter 303..309 /note="Pea-3" promoter 353..359 /note="Ap-1" TATA_signal 372..379 mRNA join(427..550,644..>889) exon 427..550 /number=1 CDS join(431..550,644..889) /codon_start=1 /product="collagenase 3" /db_xref="PID:e118563" /db_xref="PID:g1945759" /translation="MHPGVLAAFLFLSWTHCRALPLPSGGDEDDLSEEDLQFAERYLR SYYHPTNLAGILKENAASSMTERLREMQSFFGLEVTGKLDDNTLDVMKKPRCGVPDVG EYNVFPRTLKWSKMNLTYR" intron 551..643 /number=1 exon 644..889 /number=2 BASE COUNT 367 a 239 c 235 g 361 t ORIGIN 1 tctagaatca gtactaagtt tctctttatg gaagtaaaca tgccatcttg atacacttat 61 tattcgaaga agcaaaagta gatacgttct tacagaaggc aaaaaaaaaa aatttttgct 121 aagtgaagta aaaaatgtac tactctctgc ttcttcccac agtatccata aatatgctga 181 ggccgtttat tttgccagat gggttttgag accctgctga aacaagagat gctctcattt 241 atatttccct caaattctac cacaaaccac actcgggagg gaaaagaaaa agtcgccacg 301 taagcatgtt taccttcaag tgactaggaa gtggaaacct atccataagt gatgactcac 361 cattgcaggc ctataaaagt aaaggtaatc tctgcggaaa gacaacagtc cccaggcatc 421 accattcaag atgcatccag gggtcctggc tgccttcctc ttcttgagct ggactcattg 481 tcgggccctg ccccttccca gtggtggtga tgaagatgat ttgtctgagg aagacctcca 541 gtttgcagag gtagagtatc ttgccaatcc tgatgatgcg gttggtacat ctcagaaatg 601 tccttctcct tttaaacgca gttcattttg gtgtcttttc tagcgctacc tgagatcata 661 ctaccatcct acaaatctcg cgggaatcct gaaggagaat gcagcaagct ccatgactga 721 gaggctccga gaaatgcagt ctttcttcgg cttagaggtg actggcaaac ttgacgataa 781 caccttagat gtcatgaaaa agccaagatg cggggttcct gatgtgggtg aatacaatgt 841 tttccctcga actcttaaat ggtccaaaat gaatttaacc tacaggtaaa tcataggcta 901 tctttctttc atatttgtta gccttttctt gtagtattaa atgtctaata ttaagtaggc 961 cttattattt aattggcagt taaattccag aatagcttta atatggttgc cgagaatacc 1021 tcataacatc tattattgtt acattaactc agcattattt tttagtgggc atctcaaagt 1081 atttctttta gagatgaaat caaaagagaa ataaaagtag gttaaagaaa atttgcaaac 1141 atagtacatt aggtgtattt attacaacag tgctcaagat gaagagttcc actctttcta 1201 ga // LOCUS HSCKBG 4200 bp DNA PRI 24-APR-1993 DEFINITION Human gene for creatine kinase B (EC 2.7.3.2). ACCESSION X15334 NID g29962 KEYWORDS creatine kinase; creatine kinase B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4200) AUTHORS Mariman,E.C.M. TITLE Direct Submission JOURNAL Submitted (23-MAY-1989) Mariman E.C.M., University of Nijmegen, Dept of Human Genetics, Geert Grooteplein Z18, 6525 GA Nijmegen, Netherlands REFERENCE 2 (bases 1 to 4200) AUTHORS Mariman,E.C., Schepens,J.T. and Wieringa,B. TITLE Complete nucleotide sequence of the human creatine kinase B gene JOURNAL Nucleic Acids Res. 17 (15), 6385 (1989) MEDLINE 89366665 REFERENCE 3 (bases 1 to 4200) AUTHORS Mariman,E. and Wieringa,B. TITLE Expression of the gene encoding human brain creatine kinase depends on sequences immediately following the transcription start point JOURNAL Gene 102 (2), 205-212 (1991) MEDLINE 91340154 COMMENT The sequence overlaps with that reported by Mariman et. al. in Genomics 1:126-137(1987)J03036 and by Daouk et. al. in J. Biol. Chem. 263:2442-2446(1988)J03531. Data kindly reviewed (13-NOV-1989) by Mariman E.C.M. FEATURES Location/Qualifiers source 1..4200 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CK3-1" /tissue_type="brain" /clone_lib="EMBL3" /map="chromosome 14q32.3" CAAT_signal 724..728 TATA_signal 742..749 CAAT_signal 754..758 exon 808..875 /number=1 mRNA join(808..875,1136..1340,1464..1618,1691..1823,2208..2379, 3050..3173,3331..3520,3600..4075) intron 876..1135 /number=1 exon 1136..1340 /number=2 CDS join(1148..1340,1464..1618,1691..1823,2208..2379, 3050..3173,3331..3520,3600..3778) /EC_number="2.7.3.2" /codon_start=1 /product="creatine kinase B" /db_xref="PID:g29963" /db_xref="SWISS-PROT:P12277" /translation="MPFSNSHNALKLRFPAEDEFPDLSAHNNHMAKVLTPELYAELRA KSTPSGFTLDDVIQTGVDNPGHPYIMTVGCVAGDEESYEVFKDLFDPIIEDRHGGYKP SDEHKTDLNPDNLQGGDDLDPNYVLSSRVRTGRSIRGFCLPPHCSRGERRAIEKLAVE ALSSLDGDLAGRYYALKSMTEAEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIW HNDNKTFLVWVNEEDHLRVISMQKGGNMKEVFTRFCTGLTQIETLFKSKDYEFMWNPH LGYILTCPSNLGTGLRAGVHIKLPNLGKHEKFSEVLKRLRLQKRGTGGVDTAAVGGVF DVSNADRLGFSEVELVQMVVDGVKLLIEMEQRLEQGQAIDDLMPAQK" intron 1341..1463 /number=2 exon 1464..1618 /number=3 intron 1619..1690 /number=3 exon 1691..1823 /number=4 intron 1824..2207 /number=4 exon 2208..2379 /number=5 intron 2380..3049 /number=5 exon 3050..3173 /number=6 intron 3174..3330 /number=6 exon 3331..3520 /number=7 intron 3521..3599 /number=7 exon 3600..4075 /number=8 polyA_signal 3952..3957 BASE COUNT 586 a 1443 c 1477 g 694 t ORIGIN 1 gatcagtttt tttttttaat cgcacttatg cttattgttt attagcgttt cctcccatct 61 ttgcctgaag tctccgggga ctgcctttgg gggtcgggta aacttgtccc ctgcgaagag 121 ggcccagggt tggggtctgg aaactccgag gctgcacttg ccagcggcct cttaaggcca 181 cagcgtcccc gtggtttctg gctcgcagcc ccccgagacc caggacttgt ccaaggtcag 241 ggcaccgcgg gtgcccccgg gctgggccgc agcagactgc gcttcccgcg cgccttcgct 301 ttgcaccagg atcgcccagg aaatgcctgc gggcaccttg aggaaggtcg gcggctccgg 361 gccagctcgc actggccggg gtggggcggg ggccgtacct gctgcggaag ccccgaaagc 421 tttcgcccgg cccctcgccg ccgccgcggg ggctggctgg actaggcggg caggctcgag 481 gatgcggatg aacccaagcg tcctcgagtg cccggaggct ctccgcctca gtttcccgcc 541 cagaggcaag ggcgtgcgag gggatccaga tatccaagga cctgaggttt cggcctcgag 601 gtcttgggcg ggggactggg caggctgcgc ggggtcccag cgaggggaca gctcgggtgg 661 gcggccaggg tgttgggggc tgcgggcggc ggacaaagcg gcggcaccac cccgcggcgc 721 gggccaatgg aatgaatggg ctataaatag ccgccaatgg gcggcccgcg ttgtgcccct 781 taagagccgc gggagcgcgg agcggccgct gttcgcctgc gtcgctccgg gagctgccga 841 cggacggagc gcccccgccc ccgcccggcc gcccggtgag tgggcccggg ggccgggggc 901 gtccgcgccc gggctagggg cgctgcgagc aaagggggcg cgtcgcctgg agcgcgcgcc 961 ggaccggccg ggggtccccg gcgatgatgg cgctccccgc gcgcgctgcg gaccccgctg 1021 accttggccg cgtcccgggg ggcgccgggg ggcccggcgg cgggggcctg agtggtacgc 1081 gggagcccgg gaaccccggc gtgccggtcc cctctgaccc cgcgtctccc cgcagcccgc 1141 cgccgccatg cccttctcca acagccacaa cgcactgaag ctgcgcttcc cggccgagga 1201 cgagttcccc gacctgagcg cccacaacaa ccacatggcc aaggtgctga cccccgagct 1261 gtacgcggag ctgcgcgcca agagcacgcc gagcggcttc acgctggacg acgtcatcca 1321 gacaggcgtg gacaacccgg gtacgcgacc cctcggggcc ggggtcccgg ccccccctcc 1381 ccccgcgcag ccgcagggtc ctcagcagcg cgctcgggcc cggcagtgac gtcactgtcc 1441 ccgtcccgcg ccccctcccc caggccaccc gtacatcatg accgtgggct gcgtggcggg 1501 cgacgaggag tcctacgaag tgttcaagga tctcttcgac cccatcatcg aggaccggca 1561 cggcggctac aagcccagcg atgagcacaa gaccgacctc aaccccgaca acctgcaggt 1621 gcggggctgc gggcgggccg ggcgggcggg gccggggtct tcgggcgctc actcccgtct 1681 cgcctcccag ggcggcgacg acctggaccc caactacgtg ctgagctcgc gggtgcgcac 1741 gggccgcagc atccgtggct tctgcctccc cccgcactgc agccgcgggg agcgccgagc 1801 catcgagaag ctcgcggtgg aaggtagggg ccgggcgggc cgaggggcgg cggcggccgc 1861 gtccccctcc cggcgcggtc cccgcccgct tttgtttacg tcgcccggga gcggcagccg 1921 ccgtcgcgct cttatctgcg cgcgcccggg ttcagtttcc cggacccacc gagggacgga 1981 ggcccagccc ccgcgcccac agcggcctgg ggcccaggga gggcgggtcc tggcgcgggg 2041 tcaccgcctg ggaccgtcgc ccgggccgtg aggactggac gcccgcagat ccgggcgggt 2101 ggggccctct gacgtccccc gaggtggggc acgggggcgg gcgggtccgc gctgcgggct 2161 ggaggggcgg gcgcgggagc ccagcgtcct gagcgcaccc ctcgcagccc tgtccagcct 2221 ggacggcgac ctggcgggcc gatactacgc gctcaagagc atgacggagg cggagcagca 2281 gcagctcatc gacgaccact tcctcttcga caagcccgtg tcgcccctgc tgctggcctc 2341 gggcatggcc cgcgactggc ccgacgcccg cggtatctgg tgcgtgtccc tctgcgccct 2401 ctcgcggcgt cctccctccc cgctacctcc gctttccctc tcgcccccct cgcgggggtg 2461 gggcccctcg cggcgaggag gaggaggagg aggagggagg ggccggccgc gctccgggtc 2521 tgggttccgt gccgcgcctc ctcctgcgcc ggtgaccttg gccgagcagg tgcgttaagg 2581 gactgggccc cggcccgtgg gggctcagga ctcagcaaca cctccccacc ccgagacgtg 2641 aggtgggggc ggggctctct ggcgcctctc cccgacggcc ctgggagctg gagctctttg 2701 ttttcttttc tcactcctcc gccgctggga ttctaccagg ggctggtgac gccaaagctt 2761 ctccaggggc agggctccta cccccactgt ggggggcggg tcgggctgtc ctggcggtcc 2821 ctggccccgc cccacctcgg gccacagcgc atgatggcag ctggggttct cctgctgtga 2881 ggcgtcccgg ttcccccgcc cgccccgtgt tggcgggtgg agtcttggca gcagcctcca 2941 ctcctgggca tggcagggag cagcacctca gggacttggg aagttccttt ggtctggggg 3001 cggcctgggg cttttttctg ggtatgccct gagaccagcc ctcccgcagg cacaatgaca 3061 ataagacctt cctggtgtgg gtcaacgagg aggaccacct gcgggtcatc tccatgcaga 3121 aggggggcaa catgaaggag gtgttcaccc gcttctgcac cggcctcacc caggtgccag 3181 ggacggggca ggcccagacc ccagggcccc agcagggatg tgggtgcccc agcatcagtc 3241 cccccggggg atttccggca ctggggagtc tcagggcctg taggggtttc aggcaggcct 3301 tctccctcat accctcttct ccgtctgcag attgaaactc tcttcaagtc taaggactat 3361 gagttcatgt ggaaccctca cctgggctac atcctcacct gcccatccaa cctgggcacc 3421 gggctgcggg caggtgtgca tatcaagctg cccaacctgg gcaagcatga gaagttctcg 3481 gaggtgctta agcggctgcg acttcagaag cgaggcacag gtgagcaggg caggtgctgc 3541 ggcttcccgt ggcctttggg cagccctgtt tcctccgccc tgacttgctg tctccccagg 3601 cggtgtggac acggctgcgg tgggcggggt cttcgacgtc tccaacgctg accgcctggg 3661 cttctcagag gtggagctgg tgcagatggt ggtggacgga gtgaagctgc tcatcgagat 3721 ggaacagcgg ctggagcagg gccaggccat cgacgacctc atgcctgccc agaaatgaag 3781 cccggcccac acccgacacc agccctgctg cttcctaact tattgcctgg gcagtgccca 3841 ccatgcaccc ctgatgttcg ccgtctggcg agcccttagc cttgctgtag agacttccgt 3901 cacccttggt agagtttatt tttttgatgg ctaagatact gctgatgctg aaataaacta 3961 gggttttggc ctgcctgcgt ctgagtggtg cctctccttt cccagggggg agggggaagg 4021 gcagcagcca ggccccagga gtcttgagtc ctgggcctgc tgtgggcctc gccttctgtg 4081 agatgggaca agagccagga ggtggccact ctgttctgcc tgccctacct agtccatggg 4141 ccccttccct cgtgtctatc gggctgtgca ggcaggaaca tgggagagag cgagggagga // LOCUS HSCKIIBE 5917 bp DNA PRI 24-MAR-1995 DEFINITION Human gene for casein kinase II subunit beta (EC 2.7.1.37). ACCESSION X57152 NID g29968 KEYWORDS casein kinase; cytoplasmic protein; nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5917) AUTHORS Pyerin,W. TITLE Direct Submission JOURNAL Submitted (09-JAN-1991) W. Pyerin, DEUTSCHES KREBSFORSCHUNGSZENTRUM, BIOCHEMICAL CELL PHYSIOLOGY, INST OF EXPERIMENTAL PATHOLOGY, GERMAN CANCER RESEARCH CENTER, IM NEUENHEIMER FELD 280, D-6900 HEIDELBERG F R G REFERENCE 2 (bases 1 to 5917) AUTHORS Voss,H., Wirkner,U., Jakobi,R., Hewitt,N.A., Schwager,C., Zimmermann,J., Ansorge,W. and Pyerin,W. TITLE Structure of the gene encoding human casein kinase II subunit beta JOURNAL J. Biol. Chem. 266 (21), 13706-13711 (1991) MEDLINE 91310643 REFERENCE 3 (bases 1 to 5917) AUTHORS Walter,P. TITLE Human casein kinase II: structures, genes expression and requirement in cell growth stimulation JOURNAL Adv. Enzyme Regul. 34, 225-246 (1994) MEDLINE 95028755 FEATURES Location/Qualifiers source 1..5917 /organism="Homo sapiens" /db_xref="taxon:9606" GC_signal 328..332 GC_signal 414..418 CAAT_signal 481..487 GC_signal 589..593 exon 683..1011 /note="casein kinase II subunit beta" /number=1 mRNA join(683..1011,1623..1705,2672..2774,3344..3459, 3906..3981,4128..4317,4645..4875) /note="casein kinase II subunit beta; major start sites" /evidence=experimental GC_signal 698..702 mRNA join(715..1011,1623..1705,2672..2774,3344..3459, 3906..3981,4128..4317,4645..4875) /note="casein kinase II subunit beta; major start site" /evidence=experimental CAAT_signal 759..765 mRNA join(795..1011,1623..1705,2672..2774,3344..3459, 3906..3981,4128..4317,4645..4875) /note="casein kinase II subunit beta; minor start site" /evidence=experimental intron 1012..1622 /number=1 exon 1623..1705 /note="casein kinase II subunit beta" /number=2 CDS join(1634..1705,2672..2774,3344..3459,3906..3981, 4128..4317,4645..4735) /EC_number="2.7.1.37" /note="protein kinase" /codon_start=1 /product="casein kinase II subunit beta" /db_xref="PID:g29969" /db_xref="SWISS-PROT:P13862" /translation="MSSSEEVSWISWFCGLRGNEFFCEVDEDYIQDKFNLTGLNEQVP HYRQALDMILDLEPDEELEDNPNQSDLIEQAAEMLYGLIHARYILTNRGIAQMLEKYQ QGDFGYCPRVYCENQPMLPIGLSDIPGEAMVKLYCPKCMDVYTPKSSRHHHTDGAYFG TGFPHMLFMVHPEYRPKRPANQFVPRLYGFKIHPMAYQLQLQAASNFKSPVKTIR" intron 1706..2671 /number=2 repeat_region 1934..2212 /rpt_family="ALU" exon 2672..2774 /note="casein kinase II subunit beta" /number=3 intron 2775..3343 /number=3 exon 3344..3459 /note="casein kinase II subunit beta" /number=4 intron 3460..3905 /number=4 exon 3906..3981 /note="casein kinase II subunit beta" /number=5 intron 3982..4127 /number=5 exon 4128..4317 /note="casein kinase II subunit beta" /number=6 intron 4318..4644 /number=6 exon 4645..4875 /note="casein kinase II subunit beta" /number=7 BASE COUNT 1363 a 1452 c 1672 g 1430 t ORIGIN 1 gatctgtcgg ttggggtcct acttttacat aacgccccca caatgccctt cgccttcctc 61 aacgtggccc ccgctccaag cccattttct ggagccagga atccactctg tgggttagga 121 aaggccctca ggaggcggag ggaaacctgt ggaatgccga gaagccgtgt aatgaaataa 181 cggtcacggc ctggcccctc accattactc tgaccagggt tcgaaggtca cacttagagc 241 ctaaggggaa atggagaagt gcaaagggac gagcagaatg gctggcacca cctcaggtta 301 gcgcactggg acgttccagt tctcacaccg cccaccccac cccacccaag tcctacgcac 361 ggagccaagc cgcacctctc ccctcatgag gcaggagccc cggaggaaac agtacgcccg 421 tcaagggtct ctggcgggac tgattcgcac taggggccca acaggcaata aggacccagc 481 ggattggccg aggataggcc agtcccctgg gcagcagcgc cgcgccggga ctagagggga 541 acgtgaggag agctgcggaa agagatccag cctggctccc tcctttcccc gccctaagtc 601 agcctcttca cccagtgagc acaaaactgt attgcccaga ctcccgggcc ccgaacgcca 661 tacctggctt ccgcttccgg tggcttctcg ttgtgccccg cccgcaagcg ccctcctccg 721 ggccttcgtg acagccaggt cgtgcgcggg tcatcctggg attggtagtt cgctttctct 781 catttagcca gtttctttct ctaccgggga ctccgtgtcc cggcatccac cgcggcacct 841 gacccttggc gcttgcgtgt tgccctcttc cccaccctcc ctaatttcca ctccccccac 901 cccacttcgc ctgccgcggt cgggtccgcg gcctgcgctg tagcggtcgc cgccgttccc 961 tggaagtagc aacttcccta ccccacccca gtcctggtcc ccgtccagcc ggtgagtctg 1021 aagtcgtcgc tgctccgagt cccttgtcgc tgggagcggc acatggggtc tccggacttt 1081 gatgtggggc gggggaggaa gcgaccaggt ccggcacgaa ggagggagag gtggcctgag 1141 gagcggaggg gggatgtgtg gattccggtg aaagggacct gacaatcgcc cccaacccgt 1201 gagaaaagga ggagcccagt tcttgcttga gaatgataaa cttggaaacc cttgggaaag 1261 gcgtgggggt catgcagaga cttgtattgg tagggagcct gagtcgaggt ccctgccgga 1321 gttgacacag aggagagagg gccctggcct tcgggagctc cagggatgtg ggtcgggctg 1381 gtgggtcaaa gtatctgttg gcttctttca agtggtggga ccccaaagaa tgtttaactt 1441 caaagaaaag gggctgagat gtaaattaga ggagctggag aggagtgctt cagagtttgg 1501 gttgctttaa gaaagggtgg ttccgaattc tcccgtggtt ggagggccga atgtgggagg 1561 agggaggata ccagaggcag ggaaggagaa cttgagcttt actgacactg ttctttttct 1621 agctgacgtg aagatgagca gctcagagga ggtgtcctgg atttcctggt tctgtgggct 1681 ccgtggcaat gaattcttct gtgaagtgag ttctcttcaa cctccctact tgccagcttc 1741 acatatcttc ccaccagacg ttccttcaca tattccactt ctacactgtt ctcttacatg 1801 ctatttgaaa acttcctatc agcaaagagt cccccctata aaccccgacg aacctgtgct 1861 aaagtggcaa aactggggcc caagtcctga gtctgccacc gtccagcaat ataacgttgg 1921 gctagtcaat ttgtgtcttt ttcttttttt tgagactggg tctcactctg tcaccgaggc 1981 tggagggtag tggtgcgatc tcggcttact gccacctctg cctcccaggt tcaagcgatt 2041 ctcctgctcc agcctcccaa gtagctggga ttacaagtgc ctgccaccat gcctggctaa 2101 tttttgtatt tttagtagag acagggtttc actatgttgg caaggctggt ctcgaactcc 2161 agacctcagg gtgatctgcc tgcctcgggc ctcccaaagt gctgggatta caggcgtgag 2221 cattgcgccc ggcctgtatc ttttgttact aaagtggcac tgctagtact tgtctcaggt 2281 ggcctttagg aaaactgaaa tgctacacat tgaaatgttt tgttcagaaa ccatgctgtt 2341 cagcttccac cttccttagc cagctgagag gacaaaactg gttcctagag acgggataca 2401 ggagtggagt agggacaaag atcttggaaa agaatgtcta agaaaaagat tgctgtatct 2461 acttatcctt agaaaagaaa agccaaagct tttatgggag agagtgtagg tgaactaggg 2521 agagacacaa gtacttctgc tgagttggga gtgagaaaca agcacaacag atgcagttgt 2581 gttgatgata aggcatcact tagagcattt tgcccaggtc aaagatgagg attttgatat 2641 gggttccctc ttggcttcca tgtcctgaca ggtggatgaa gactacatcc aggacaaatt 2701 taatcttact ggactcaatg agcaggtccc tcactatcga caagctctag acatgatctt 2761 ggacctggag cctggtgagg caccctcagg gttgttttgt gtgtgtgcgt gcactatttt 2821 tctcttcaaa tctctattca cttgcctgaa ttttgccaaa tttcctttgg ttctctgatt 2881 tctttaaccc caaattcatg ctttattttg atcctccacc tgactcttgt ctagttttgt 2941 gacgtatatc acttgttctc atgttttcta aatccgcaat tcagacctat tccaaaatgc 3001 gtttcctcag ggtctggttt gttgtctgtt tctcctgctt tgcaccttcc agtctagagt 3061 ttcatcttct gcattgacat tgttgcagtt atgtattgag gagggagttg ggagggagag 3121 caaggagcag aggctgaaaa ggtgtgaggg gaaggcagag ctgtcttcgt ttgatgcaag 3181 ggtcagaagc ccaggtttct gggtcccatg cccagatgtt ggatggggta aggcccaaaa 3241 gtaggtgcta ggcaaactga atagcccgca gcccctggat atgggcaggg cacctaggaa 3301 agctgaaaaa caagtagttg catttggccg ggctgtggtt cagatgaaga actggaagac 3361 aaccccaacc agagtgacct gattgagcag gcagccgaga tgctttatgg attgatccac 3421 gcccgctaca tccttaccaa ccgtggcatc gcccagatgg tgaggcctct ctgctcctac 3481 ctgcctcctt ctgagcagta agagacacag gttcctgcag caagaagtca tgtttaagcc 3541 ctgtttaagg aagctagctg agaagagggg aagaacccca gaacttgggc ctgggaattg 3601 aattctgatt gggggtcatc ctgaagggat tgttttcagg gagggagaca gaccttgaat 3661 cagagagttg tgatagactg cctcttcctc aaggaacaaa caacaaatgg ctctgatggt 3721 ttgtagccct gccctaattt ggaagaaagg caacacagaa gtttgagagc ccatctagtc 3781 cagagaaggg ggcctctgga cagagttgga aggagtgccg acagagttgg tatgggttgg 3841 gctgcgaagg gagttgcctc ttctttacat ctacctgcca accccttcca ttgtattcac 3901 ctcagttgga aaagtaccag caaggagact ttggttactg tcctcgtgtg tactgtgaga 3961 accagccaat gcttcccatt ggtgagtgtt gaagaaggga aaggaaagca ccgtgtggca 4021 gtcttatggg aaggagttgg ggctcaacac attggagcct gagtcctgag gggaggttag 4081 gtaggaatag ggggatacct ggcctgctga gtctggctgt ctcccaggcc tttcagacat 4141 cccaggtgaa gccatggtga agctctactg ccccaagtgc atggatgtgt acacacccaa 4201 gtcatcaaga caccatcaca cggatggcgc ctacttcggc actggtttcc ctcacatgct 4261 cttcatggtg catcccgagt accggcccaa gagacctgcc aaccagtttg tgcccaggta 4321 gggagcaggg agagtcatta agggtcaaag gaaaggccca agatccccca gagaggggag 4381 gacagggcat ggccctttct tgaggtctgc ttctcccaga atcagggcat ctccctgctg 4441 agtgactgtg ggaaagttat ttgattatct gtgcttgagt taccttattg tagaatgttc 4501 ttgagctgag aagttgggaa ccacgaggct ttagctctga gcaggtccat agaggagctc 4561 aggtggggag gtgggaatgc aggtgactgg cagggcctgg atggggctca tgctgctgcc 4621 tctctgacct ctgccctggc ctaggctcta cggtttcaag atccatccga tggcctacca 4681 gctgcagctc caagccgcca gcaacttcaa gagcccagtc aagacgattc gctgattccc 4741 tcccccacct gtcctgcagt ctttgtcttt tcctttcttt tttgccaccc tttcaggaac 4801 cctgtatggt ttttagttta aattaaagga gtcgttatcg tggtgggaat atgaaataaa 4861 gtagaagaaa aggccatgag ctagtctgct ggtgcttgct gttggggaag ggaaggtgat 4921 ggtgtgttgg actccagggg ccctcatggc ccagcccacc ctccccagat tgaaaaccag 4981 gacagatttg tgctcagtgg attgggtggt gtttttagta tggagcagaa cagaattcct 5041 aggactgcgt gtgatgaaat gcaaggtcaa aaggaaaaga caaagcatat ttcaaagatg 5101 agaaatattt gtttggatat ctatgactgt ctgtttatac tgtaaggggc ttaatcagca 5161 gctccatctt ttagttttag ttctaaagga aaagtagcct aaagtcagta taactaaagg 5221 gtggaacgag gtgggacaag gtccggaatt gctgctcagt gatgtgtgtg tgcctgccgc 5281 tggtggagct gagactgctc actctcagaa ggatggggat gcttgatttc ctggccaggt 5341 tgtcccagca cagtggggat tggccctgtt gtatgacgaa gacagcacat ggtggcagag 5401 atagatacta acccatggac tttccaaggg agggaatagg tctttggagg gtatgcaaga 5461 caaaggtaga cactggataa agaacccggt agtgcccagg tattacccca tctgggccat 5521 tactcccaca ctcaggaacc agacgttgtg ggtgaggaca tgctgtccct cctgccaagt 5581 aataacttcc ttcccagcca ggatcctgcc ccaagtagga atatagctct gcatttacag 5641 cagctcctgc tcagaccttg tcaaaaccac cctgcagctt aggattaagg agcatggtca 5701 caggaaggtg gggtttcagg gcatcccctc aggaactgcc catctcccca gaattccaaa 5761 atgaaggtcc atatgcttgt aggtgtgctg gtcatggtgg gctcacagta ggaaagggta 5821 agtggggccc aggggcaggg agggaggaag gggtaactga gtccaggaag ggggtggagc 5881 gtggccatgg aaatcgggct ccacggccca gggatgg // LOCUS HSCLN3 15997 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens CLN3 gene, complete CDS. ACCESSION X99832 NID g1834481 KEYWORDS CLN3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15997) AUTHORS Mitchison,H.M., Munroe,P.B., O'Rawe,A.M., Taschner,P.E.M., de Vos,N., Kremmidiotis,G., Lensink,I., Munk,A.C., D'Arigo,K.L., Anderson,J.W., Lerner,T.J., Moyzis,R.K., Callen,D.F., Breuning,M.H., Doggett,N.A., Gardiner,R.M. and Mole,S.E. TITLE Genomic structure and complete nucleotide sequence of the Batten disease gene, CLN3 JOURNAL Genomics 40 (2), 346-350 (1997) MEDLINE 97237055 REFERENCE 2 (bases 1 to 15997) AUTHORS Mitchison,H.M.M.M. TITLE Direct Submission JOURNAL Submitted (06-AUG-1996) H.M.Mitchison, University College London Medical School, Department of Paediatrics, The Rayne Institute, 5 University Street, London WC1E 6JJ, UK FEATURES Location/Qualifiers source 1..15997 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cosmid NL11A" /chromosome="16" /map="p12.1" mRNA join(1102..1283,1437..1515,3600..3696,4325..4396, 5249..5328,5449..5534,6340..6412,6500..6643,8860..8972, 10297..10343,10424..10492,10587..10642,10771..10864, 15092..15232,15334..15689) /gene="CLN3" gene 1102..15689 /gene="CLN3" exon 1102..1283 /gene="CLN3" /number=1 CDS join(1238..1283,1437..1515,3600..3696,4325..4396, 5249..5328,5449..5534,6340..6412,6500..6643,8860..8972, 10297..10343,10424..10492,10587..10642,10771..10864, 15092..15232,15334..15453) /gene="CLN3" /codon_start=1 /product="CLN3 protein" /db_xref="PID:e283670" /db_xref="PID:g1834482" /translation="MGGCAGSRRRFSDSEGEETVPEPRLPLLDHQGAHWKNAVGFWLL GLCNNFSYVVMLSAAHDILSHKRTSGNQSHVDPGPTPIPHNSSSRFDCNSVSTAAVLL ADILPTLVIKLLAPLGLHLLPYSPRVLVSGICAAGSFVLVAFSHSVGTSLCGVVFASI SSGLGEVTFLSLTAFYPRAVISWWSSGTGGAGLLGALSYLGLTQAGLSPQQTLLSMLG IPALLLASYFLLLTSPEAQDPGGEEEAESAARQPLIRTEAPESKPGSSSSLSLRERWT VFKGLLWYIVPLVVVYFAEYFINQGLFELLFFWNTSLSHAQQYRWYQMLYQAGVFASR SSLRCCRIRFTWALALLQCLNLVFLLADVWFGFLPSIYLVFLIILYEGLLGGAAYVNT FHNIALETSDEHREFAMAATCISDTLGISLSGLLALPLHDFLCQLS" intron 1284..1436 /gene="CLN3" /number=1 exon 1437..1515 /gene="CLN3" /number=2 intron 1516..3599 /gene="CLN3" /number=2 exon 3600..3696 /gene="CLN3" /number=3 intron 3697..4324 /gene="CLN3" /number=3 exon 4325..4396 /gene="CLN3" /number=4 intron 4397..5248 /gene="CLN3" /number=4 exon 5249..5328 /gene="CLN3" /number=5 intron 5329..5448 /gene="CLN3" /number=5 exon 5449..5534 /gene="CLN3" /number=6 intron 5535..6339 /gene="CLN3" /number=6 exon 6340..6412 /gene="CLN3" /number=7 intron 6413..6499 /gene="CLN3" /number=7 exon 6500..6643 /gene="CLN3" /number=8 intron 6644..8859 /gene="CLN3" /number=8 exon 8860..8972 /gene="CLN3" /number=9 intron 8973..10296 /gene="CLN3" /number=9 exon 10297..10343 /gene="CLN3" /number=10 intron 10344..10423 /gene="CLN3" /number=10 exon 10424..10492 /gene="CLN3" /number=11 intron 10493..10586 /gene="CLN3" /number=11 exon 10587..10642 /gene="CLN3" /number=12 intron 10643..10770 /gene="CLN3" /number=12 exon 10771..10864 /gene="CLN3" /number=13 intron 10865..15091 /gene="CLN3" /number=13 exon 15092..15232 /gene="CLN3" /number=14 intron 15233..15333 /gene="CLN3" /number=14 exon 15334..15689 /gene="CLN3" /number=15 BASE COUNT 3717 a 4011 c 4147 g 4095 t 27 others ORIGIN 1 ctgcagcagc ctggagccaa cccaatggct ccctgcctct ggtgctctat ttcttatcac 61 catacaggtc tctaaggtca cctttaggtc taacattgta tgaatgattg tcagcaactt 121 tccaattttt ttcatatata tatatttttt ttattttatt ttttactttt tctttttttg 181 agacaggatc tcactctgtc acccaggctg gagtgcagtg gcgtgatcat agctcactgt 241 agcttcaacc tcctgggcac aagtgatcct cccacctcag cttccaaggt acctgtgact 301 acaggaatgc gccaccatgc ccggctaatt ttgttatata tatatatata tatatttgta 361 ccttggtttt ctcatctgta acaccagggg ttgaagtcct gtgatatttt ttggttctag 421 gataatttgg aaatagggat ctcttaacat cttggatcct gccaatagat cattcccgaa 481 atccctttag atgagtgtta tgtttactta ggtctctaaa acaatttgta aggtcaacag 541 atggggataa gaatgacctg aagctggtcg tctttgttcc caaccagaaa cagtcctscc 601 cccaaaaaaa aaaaaaaaaa aaagggagta ggcaggtggc catatttgtt tctggcaagt 661 gagtgctgaa ggaaaggagc tgaagcctcc cacagtcata actggtgctg gcaggctact 721 gtctcggtct tgggcgccac tgatctaagg tcacggctct gcttgctgct cccacccgct 781 ccagtttaaa acctgcggtt ccagggttct ccagcccctc cctttttcac gctccgaagc 841 cgagaaggcc caaagcgaag acagagagga cccggaagta gggaaaacct ctgagcacgt 901 gatggggaac acgcggtgct gtcacgtgat ccgacaaacg cctctgcata gtgcagaaca 961 ttctgctgct cttaaaggta caggcctcag ggtccctgct gtagacgggg cgggggagag 1021 tacgatgggt ggggcgtggt gggtcgtagg gcgctcgaga tggagccccc agcttccttg 1081 atggatcgcg gggcgcgagt gccctagaca agccggagct gggaccggca atcgggcgtt 1141 gatccttgtc acctgtcgca gaccctcatc cctcccgtgg gagccccctt tggacactct 1201 atgaccctgg accctcgggg gacctgaact tgatgcgatg ggaggctgtg caggctcgcg 1261 gcggcgcttt tcggattccg agggtgagta ttcccgccca ccctcatgga acgaccacct 1321 ctggctcgca gcccacctga ggggaatgag agctgactcg gccccgggag gacacgcaag 1381 gagcggtcgt gcacctgaga gtaggtgggg gtcatgccct cttctcgctc tcccagggga 1441 ggagaccgtc ccggagcccc ggctccctct gttggaccat cagggcgcgc attggaagaa 1501 cgcggtgggc ttctggtgag tggcctgaga cttcagcgag tgacaatggc atgagaagag 1561 ggaagtgacc ggaggaaagg gggaagaaag agttaagctg cgcaaaggag ctgaacccac 1621 aaatatttac tgaatccact ggtttgcccc aggggcagca tgtaaaattt ccaaacttca 1681 ctccttatca ccttccggat ttagcaagat cctcatacct accacctgta ggattattta 1741 cttttttctt ccagccaatt cagccttaac gttttttagt gcacatattt ttaaaaagaa 1801 aactttaaat cacgactcca cttggaaaac cagcgttttc ctagtgaacg tttaaataaa 1861 tgcatcacta ctaaaacttt tttatttttt taatttattt tatcttattt ttttttcttt 1921 ttgagacagg gtcttgctct gtttcccagg ttggaatgca gtagtgcaat cacagctccc 1981 tgcagcctgg aactcctggg ctcaagtgat tctcccacct cagcctccaa gtaggtgtgg 2041 ctacaggtgt gcaccaccgc atctgcctaa tttttatatt ttttatagag atgggggtct 2101 crctatggga cctgggctgg tctcaaactc gtggcctcaa gtgatcctcc tacctcaacc 2161 ttccaaagtg ctgagattac aagcatgagc caccatgccg ggcctgaacc tttttttttt 2221 tttttttttt ggagacggag tctctctctg tcctccaggc tggagtgcag tggcgcaatc 2281 tcggctcact gcaaactccg cctcccaggt tcacaccatt cttttgcctc agcctccgga 2341 gtagctgaga ctacaggcgc ctgccaccac gcccggctat ttttttgtat ttttagtaga 2401 gatggggttt caccgcgtta tccaggatgg tctcgatctc ctgacctcgt gatccgcccg 2461 cctcggcctc ccaaagtgct gggattacag gtgtgagcca ccgcgcccag cttttttttt 2521 ttttttttaa attttatttt atttttgaga cagagtctca ctctatcacc caggctggag 2581 agcagtggca caatctcggc tcactgaaac ctccacctcc aaggttcaag ytattctcct 2641 gtctcagctc cccgagtagc tgggattaca ggtgcgtgcc accagacaca gctaattttt 2701 tgtattttta gtaaagacag gatttcacca tgttggtcag gctggtctag aactcctgac 2761 cttaggtgat ccgcttgcct ccacttccca aagtgctggg attacaggcg tgagcaccgt 2821 gccccgccct gacatagggg ctttgggatc acagacttgg attcacttcc agcctccaag 2881 gcctctccca gaaactctct gtgaccactc tccttttttt ttttttttga gatagagtct 2941 cgctcttgta gcccaggctg gagtgcaatg gcacaatctc ggcttactgc aacctccgcc 3001 tttcgggttc aagtggttct cctgcctcag cctcctgagt agctaggatt acaggcatgt 3061 gcaacttcgc ccagctaatt ttgtattttt tagtagagac ggggtttctc catattgctc 3121 aggctggtct cgaactcctg acctcaggcg atccgcccgc ctcggcccct caaagtgctg 3181 gaattacagg tgtgagccac tgcactcggc ctcttccttt ttatttttta tttttgtgac 3241 agggtcttgc tctgtcaccc aggctagagt gcagtggcat gatcacagtt cactgcagcc 3301 tctgactcct ggactcaagc aatcctccca cctcagcctc ccaagtagct gggactacag 3361 tgcaagccac cacacctggc taatttttaa attttttgta gagatggggg tctcactatc 3421 ttgcccaagc accaaagcct gtatttttga ccccaacatt gaatgaggat gggatctggt 3481 gatagaggga aaaaaaagag ggggctgatt gaagggcaca ggtaagggaa ggtttggcac 3541 aaaggctaca gatggctgca ggatgaggag ggagttgaga cctgtcctgc tccttccagg 3601 ctgctgggcc tttgcaacaa cttctcttat gtggtgatgc tgagtgccgc ccacgacatc 3661 cttagccaca agaggacatc gggaaaccag agccatgtaa gtgactctct accaccacca 3721 ccatggttag tccctgtggg aagatgaggg ggtgggacaa ggtggggtaa agtttccagt 3781 ctctggctgg gcacagtggc tcacacctgt aatcccagca gtttgggagg ctgaggcggg 3841 cggatcactt gaagtcagga gttcgagacc agcctggcca acatggtgaa actccgactc 3901 tactaaaaat accaaaaact acctgggtgt ggtggcacac acctgtagtc tgagctattc 3961 aggaggctga ggcaggagaa ttgcttgaag ccaggaggtg gaagttgcaa tgagccaaga 4021 tcacaccact gcactctagc ttgggcaaca gagtgagaca ccatctcaaa aaaaagaaag 4081 ctcttgaatg tgttatctaa acaaagtcca tttacagggc catctgtggc ctgtggtcta 4141 gtagtgtgag actcatatgg aaaccccctc ttcattgtgc atgcagagac gctgaggtcc 4201 tgagaagtca gtcactgctg acccaaagcc atccttcaag tgaaggcaga gctgggacgg 4261 gagccaggct ctgtgtgtct atccctctgc ctccagggtg aaagacatgc ttttctccca 4321 ttaggtggac ccaggcccaa cgccgatccc ccacaacagc tcatcacgat ttgactgcaa 4381 ctctgtctct acggctgtgc gtactcatct cacctggtcc ttgcctgacc cagggcccct 4441 gggtctatag accccactcc cagcccttca ctacccagct gggactgtca tgattaaaac 4501 agttgagacc tgggctgggc gcactggctc acacttgtaa tcccagcact ttgggagacc 4561 aaggtgggag gatcacttga gcctaggagt ttgagaccag cctgggcagt atagtgagac 4621 ccccatctta aggaaaaaaa ttttaaaaaa aatatatata tataataatt gagaccttaa 4681 tataatttct gaggctgagg cggatggatc acttgagacc aggagttcaa gaccagcctg 4741 gccaacatgg tgaaacccca tctgtactaa aaataccaaa aattagccag gcatggtggt 4801 gcacgcctgt aatcccagct actcaggagg ctgaggcagg agaatcactt gaacccagga 4861 ggtggaggtt gtagtgagcc aagatcgagc cactacactc cagcctgggt ggcagaatga 4921 gactcactct caaaatatat atatatatag tttctgagtc ctttctgtct gcaccatata 4981 ttagctcatt taagccacac agcagtcctg tgagctaggc gctatgatat tcccattttc 5041 cagatgagga aactgaagct cagagagtat aaactcttga cacacaacta aggagtggga 5101 gagctgagac ttgaacccag gcgtgcctga ctccagagcc tgtgtttgta gcaggcctgt 5161 ttggccagct cctgcctctc cttggccacg tggttgggag ggttgtcccc tggaagctct 5221 gcggtctcac tctattctcc tgtcccaggc tgtgctcctg gcggacatcc tccccacact 5281 cgtcatcaaa ttgttggctc ctcttggcct tcacctgctg ccctacaggt ctgggtgagg 5341 gtagtgggag gcagggtggg caggagctga gaaaggggag gctgggatgg ctgagatgct 5401 gagagtagag accgaccttc cccctccctt cccttctcac cccctcagcc cccgggttct 5461 cgtcagtggg atttgtgctg ctggaagctt cgtcctggtt gccttttctc attctgtggg 5521 gaccagcctg tgtggtgagt gtgtggttct gtgtcagatg gggagccccg aggaaccaca 5581 tcagagcatt tgtgggaaga gtctccccag cctcccagag gaaagggatt cattctgtca 5641 cccttagaag cctgctaggg ctatcagcag taggcgatgg gagactggga caatttggag 5701 gggtaggcag tggaggagat gggagaaaat ggatgactta gatggagatt gaggtgaaca 5761 cagtcacgac tctgtgatgg accagccaca gtgactcatg cctataatcc cagcactttg 5821 gaaagccatg gcagscagat cacctgasgt caggagttcg aaaccagcct ggccaacatg 5881 gagaaacccc gtctctacca aaaatacaaa aatyagctgg gtgtggtggc aggagcctgt 5941 aatcccagct actcgggagg ctgaggcagg agaatctctt gaacctggga ggcagaggtt 6001 gcagtgagcc gagatcacgc cactgtactc cagcctgggt gacagggcga gactccgtct 6061 ccaaaaagaa aagaaaagaa aaagactgat gaaggggcag agacatcaag ggtgcatgtc 6121 tgcccctggt ctgataactg ggtggatgga ggtgccactc tccatgaagg gacacgcagg 6181 ggagtggggc tctgcttcag acctggaacc tggcctatgc atgggatcta ttggagcctc 6241 tatgagctga tactgaggag gccatggcca gacacattag aggcctgggc agtgtggcaa 6301 ggtgtggtgt gaccatccca gtgcttgtcc tccccccagg tgtggtcttc gctagcatct 6361 catcaggcct tggggaggtc accttcctct ccctcactgc cttctacccc aggtaagcag 6421 gtggagcagg gagtgtgggg agaggctgtc ccatggtcag cctaggtcct cctgaatgtt 6481 cctgtgttct ccttcccagg gccgtgatct cctggtggtc ctcagggact gggggagctg 6541 ggctgctggg ggccctgtcc tacctgggcc tcacccaggc cggcctctcc cctcagcaga 6601 ccctgctgtc catgctgggt atccctgccc tgctgctggc caggtgagct gccctgagcc 6661 gggagggaga ggggtccaag gagagaaaac ttggccatgg ctgggtgtgg tggctcacgc 6721 ctgtaatccc agcactttgg gaggccaagg agggcagatc gcctaaggtc aggaaaccag 6781 cctggccaac atggtgaaac cccgtctcta ctaaaaatac aacaattagc caggtgtggt 6841 ggcgggtgcc tgtagtccca actactcagg aggctaaggc aggagaatcg cttgaacccc 6901 ggaggcagtg gtggcagtga gccgagatcg tgccattgca ctccagcccg ggcgacagag 6961 ttagactctg tctcaggaag aaaaaaaaaa aaagaaagaa aagaaaactt gattatgatt 7021 gcaatcttca agtccctacc ttgctgtgaa gggaggcgga atctggactc tgatagcccc 7081 aggtgtgagt cctggagctg ccacttttta gctttgtagc gttgaacaag ttactccacc 7141 tctctgaacc ctcagtttcc ccatatctca aatggcagtt gttcttgctt tccttggagg 7201 tgatgagggt aatgcattca gcacagtgtg gttcccaagg tgattagaag ttggatgagg 7261 gtggacytta wttgattagt tccttttttt tttttttttt tttttaatca gtgttgacca 7321 ggttggcctc gaatgtgtaa ccttgcctgc ctgagtgcca gggcaacagg cctgaaccat 7381 ggcgactccc cttttttttt tttttttttt tttttttttg agacggagtc tcatggtgtc 7441 actcaggctg gagtgcagtg actggtgtga tctcagctca ctgcagcctc ggcctcccgg 7501 gctctagtga ttctcctgcc tcagcctccc gagtagctgg gactacaggc ccatgccacc 7561 atgcctggct aattttttat atttttagta gagacggggt ttcgctgtat tggccaggct 7621 ggtctcaaac tcctgacctc aagtgatcca cctgccttgg cctcccataa tgctaggaat 7681 acaggcgtga gccaccgcgc ccggcctgat tagttctgtg tattttcatg catattacaa 7741 aacactttgg ccgggcatgg tggctcacat ctgtaatccc agcactttgg ggaggccgag 7801 gcgggcgcac cacgaggtca ggagtttgag accagcctgg ccaatatggg gaaaccccgt 7861 ctctactaaa attcaaaatt attcccccag gtaaattggg agggtttggg gggaaggttt 7921 tgtttgaacc cccggggggc cgggggttgg cagtgaaccc ctggatttgt ccactgcaat 7981 tccaggctgg gccaacagga gcgagactcc gtttgaaaaa aaaaaaaaac acacactttg 8041 aactagccag acacacctgg cccttcacaa gattagtgct taggacaagt ctaagtagaa 8101 aaagtgagtc catttaaaga aaaatattaa gtaaaaatat atatatacag gaggaatatg 8161 gatatggcaa ataacatgaa gccagaacct aagtgactaa aatcgaggaa gttctggcct 8221 agcacagagt gaacgtataa ttatggggag ctagaccgcc attagatata tggtgccaga 8281 aaaacagact gactcgctag gtatgaccca gcagtccgaa caagggcctg ctgagcacag 8341 tggctcacag ctgtaatccc agtgctttag gaggctgagg ctggaggacc actagaggct 8401 agaaatttga gaccagcctg ggcaacatag cgagactcca tttatacaaa aaatggaaaa 8461 tatgaggcgg gcttggtggc acgtgcctgt agtccccgct acttgggagg ctgaggcggg 8521 aggattgctt gagcctggga ggtcgcagct gcagtgagct atgattgcac cactgcattc 8581 cagcctggat gctggagaaa gaccctgtct ctgaaaaaaa tcaaaacaaa atgaaggccc 8641 atgaatagga ataaggagat aaattcaggt ccaaagaaag aagcctttgt ccacaattgg 8701 agctctcccg cacagcgtag gagccttaga ggcagtgagc tacccatctt tgaaagtgtt 8761 caaaggtaga ggtgttccct gggaaaagtg gcctagatgg tccctgggga ccttcccagc 8821 cagtagggtt ttctgaccct gccttcatcc tactcctagc tatttcttgt tgctcacatc 8881 tcctgaggcc caggaccctg gaggggaaga agaagcagag agcgcagccc ggcagcccct 8941 cataagaacc gaggccccgg agtcgaagcc aggtaggaga cacagaccct cagagaggtc 9001 actttctttc tctctgggtt tggccttttc ctctctgcaa taggcaaagt taagagggga 9061 aagagagtga gtgtactggt tatgggaaaa gcctttgtct atttgagaaa tacttgttga 9121 ggacgagcac agtggctcac gcctgtaata tcccagcact ttgggaggct gaggcaggag 9181 gaccacttga gctcagaagt ctgagtccag cctgggcaac agagcaagac cttgtcgcaa 9241 aaaaaaaaaa aagaaaagaa aggaaggaag ggagggagag aagaagggat attaattgag 9301 tacttaccat gtgccaggca ttccaatcac aagtgctgag gccctgcggt ggggataagc 9361 tgggatgttc tagaaccaga gaatggccag tgagactggg cagggtgggc caatgcccat 9421 ctgtggagct gtacaagaga cattggccgg gcgtggtggt tcacacctgt aatcccagca 9481 ctttgggagg ctgaggcaga tggatcattt gagatcaaga gtttgagacc agtctggcca 9541 acatggtaaa aaccccgtct ctactaaaaa taaaaaaatt agccaggtgt ggtggtgcat 9601 gcctgtagtc ccagctactt gggaagctga ggcacgagaa tcacttgaac ccaggaggca 9661 gaggctgcag tgagctgaga ttggaccact gcactgcagc ctgggcaact gagtgagact 9721 ctgtctcaaa aaataaaaaa aattaaaaat caaagagacg tgaagaggtc tcacctggtt 9781 agcttttatt ttcatcatga taaagtatgt atattacttg aagtttacca ttttattatt 9841 ttttatctta ctttattttt ttgagcggga gtttcgctct ttgttgccca ggcttggagt 9901 gcaatggcgc aatctcagct cactacaacc tctgcctccc gggttcaagt gattctcctg 9961 cctcagccac ccgagtagct gggactacag acacctgcca ccacacatgg ctaatttttg 10021 tatttttagt agagatgggg tttctccatg ttggccaggg tggtcttgaa ctcctgacct 10081 caggtgatcc gcccacctca gtctcccaag gtgctgggat tacaagcatg agccactgcg 10141 cccagccatt ttaaccattt ttaagtgtcc aatccagtgg catggaagtg agttcccact 10201 gttgcgtggt ctggttaatt ctgtcatacc ggtgcctctt tgccgagtct tcagtgtgaa 10261 aacttcatct gcccttctct ctctgttccc ctgcaggctc cagctccagc ctctcccttc 10321 gggaaaggtg gacagtrttc aaggttcgga tgatggctgg ggtatgtccc tgtggggctt 10381 gctcacctcc aggccccccg cttatatctt ctgccttttc cagggtctgc tgtggtacat 10441 tgttcccttg gtcgtagttt actttgccga gtatttcatt aaccagggac ttgtaagtga 10501 ggggtgctag gaggggtgtg gaggtggcga ttggggctgg gacccacaca gccccgtcca 10561 tctcccctgt ctggtatttg ttgcagtttg aactcctctt tttctggaac acttccctga 10621 gtcacgctca gcaataccgc tggtaagagg agcgagggca gtgggctggg agggcgccgt 10681 ggtgatgcag ctgccctgcc cagtaggcac cggggggagc gggatggtcc ctggaggagc 10741 ctcctctcct ccccccaacc ctaacctcag gtaccagatg ctgtaccagg ctggcgtctt 10801 tgcctcccgc tcttctctcc gctgctgtcg catccgtttc acctgggccc tggccctgct 10861 gcaggtacca aacccctgcc cctcacttca ctcccacctt ggctcccagc ttggctccca 10921 gcttccccaa accccctgct tcccactatc agtgggaagt gtaaattttt tttttttttt 10981 tggagacaga gtctcgctct gtcgtccagg ctggagtgca gtagcgcaat cttggcttac 11041 tgcaacctct gcctcccggg ttcaagtgac tctcctgcct cagtctcctg agtagctggg 11101 attacaggcg cccgccacca tacctggcta atttttgtat ttttagtaga gatgggattt 11161 tgccatattg gctggtcttg aactcttgac ctcaggtgat ctgccggcct cagcctccca 11221 aagtgctggg attacaggga agtgtaaatt gtacactatc atttcttagc acagggaacc 11281 aaagtccagc agagctccat gtaaaatgta tagcctcagg agccaagcaa aactgagctt 11341 cagtactcag ccactgtgtg acttagggca ggtctcagaa cctctctgag cctcaatttc 11401 ctcatgtata aattgggtgt ggccaatacc tatttttcag ggtagttgta agaaatagag 11461 atgagagaaa gaatcgaaga acaagatttt gtagtgtgtg tgtgtgtgtg cgtgtgtgtg 11521 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg ttttaagcat tgggattggg 11581 ccggcccagg tggctcatgc ctgtaatccc agcactatgg gaggctgagg caggtggatc 11641 atttgagctt ggtagttcaa gtccagcctg ggcaacatgg cacaatcaca tctctacaaa 11701 aattagccag gtgcagtggc gcacacctgt agtcttagct actgggaagc tgaggtggga 11761 ggatcacttg aacccatgag gtcaaggctt cagtaaccaa ggtcatgcca ctgcactcca 11821 gcctaggsaa tagagcgaga ccctgtctcc cagaaaaaga aaaaagaaag gaaaagtcgg 11881 aaccttcctc tgtcgcctag gctggagtac agtggcatga tcataggtca tttactcagg 11941 caaagagacn ncttcaaata atgtttctaa ctattaagnn aatgtacagt tcttttattc 12001 tagatagtta attataatag aattggtkrm tttgaagtca gtgggttatt tatcttgagg 12061 gtggtgagca aattcaactg cccttggcaa acagtagttt acagctttaa gggsgcaaac 12121 acccttcctt acaacagtct aataaaagga gtttcatata accagcattc attgtttcaa 12181 ggttacaaat ccacagcccc tgcaggtaac tgttacaata aactgtcaac tgtgacagga 12241 aagtcttctg cctttgaaat ttaggaactt ggccagtcct ggtggctcac acctgtaatc 12301 ccagcacttt gggaggccaa ggcaggtgaa tcacctgagg tcaggatcag gagtttgaga 12361 ccagtctggc caacacagtg aaaccccgtc tctattaaaa atgcaaaaat tagttgggcg 12421 tggtggcacc tgcctgtcat ctcagctact cgctaggctg aggcaggaga attgcttgaa 12481 ctccggatat ggaggttgca gtgagccgag attgcagtac tgtactccag cctgggccac 12541 agagcgagac tccatctcca aaaaaaaaga cttaggaact caattytgtg atagccaaat 12601 acagaaatag agattatact ttaggstagg tgttgtggct cacacctgta atcccaatgc 12661 ttttggaggs tagggtggga ggatcaattg aagccaggag cttgagacca gactggacaa 12721 matagtgaga ccccccacct ttaccaaaaa tttaaaaaaa aaaataagcc aggtgggaga 12781 attgcttgag cccagaagtt tgagagagca gcctgggtaa catcacgaga cctcatctct 12841 acaaaaaaca acaacagggt gtggtgacat tcacctgtaa tcccagctac tcgggaggct 12901 gaggcaggag gatcgcttta gcccaggagt tccaggctgc agtgagccat aatcattcca 12961 ctgtacccca gcctgggtga gaccctatct ccaaaaaaaa gattacactt tcatattatg 13021 gaaaaagcat gtaaatatcc tccagcctcc ttccttattt atctgagcag tgctgcaagc 13081 ccagggtttg tacccacaaa accccaaatt tataactcag ggtgcttgta gataccactg 13141 aaacatgtag cttatggttt taagctacat gtttttagct ataatgtttt ttgttttgtt 13201 ttgttttttg agatggagtc tcgctctgtt gcccaggctg gaatgtagcg gtgcaatctt 13261 ggctcactgc aacccctgcc tcccagattc aagcgattct caagcctcag ccttccgagt 13321 agctgggatt acaggtgcgc caccacgccc agcaaatttt ttgttttgtt ttgttttgtt 13381 ttgtttttta gagacagagt tttgctcttg tagctcaggc tagagtacaa tggcgtgatc 13441 ttggctcact gcaacctccg cctcccaggt tcaggcgatt cttctgcctc agcctcctgc 13501 gtagctggga ttacaggcac gtgccaccac acctggctaa tttttgtatt ttttagtaga 13561 gatggggttt cgccatgttg gccaggctgg tcttgaactc ctgacctcag gtgatccacc 13621 cgccttggcc ttccaaagtg ctgggattac aggtgtgagc caccacacct ggctagaggt 13681 gtgtcctttt taatgctcat ctttaagtct acctcagatt taatgaatta ggacctctag 13741 ggtaagcccc gcttggggca tttgtaccta gatccccagg cgattcttac atatacgaaa 13801 agtatgaacc gccacgagga ggcctacaca ctgcacaaga ggcaatgtgg cgtggtcagt 13861 gctgggagga tggcacctcc ttgagagact ggggcacaga ggtgggacct acaccctggg 13921 gtggggatcg aagaggcttc ctggccaggt gcggtggctc ctgcctataa tcccagcact 13981 ttgggaggct gaggcagggg ggcagcgact gcttgagacc aggagttcga gcccagctga 14041 ataggatgcc cggctactcc caccttgttt taccatagtg aaccaacggt ttttccaata 14101 ggggcaattt tgctgcccag cggatgtttg gcaacgtctg caggcatgct ttggttgtca 14161 caactgaggg atgctatggg catctagcgg gcagaggcca gggatcctkc taaatgcgct 14221 gccatataca ggacagccct caccatkcag aacaaccagc accaaacgcc aagagtgctk 14281 caattgagaa atcctggttt cgatggtccc acttccctta aggaaccccc aatttttttt 14341 tttttttttt tatgagacag gatcttgctc tgtcacccag gctggagtgc agtggcacaa 14401 tcatagctta ctgcagcctc cacctcctag gktcaagcaa tcctcctgtc tcagactccc 14461 gagtagctgg gactgcaggt gcgtgccacc atgctcaact aatttttaaa ttttttgtag 14521 agacggagtt ttgttatgtt gcccaggctg gtctcgaact cctggcctca agtgatcatc 14581 ccaccttggt ctcccaatgt gctggtatta cctgcatgag ccactgcacc aggctgaggt 14641 tgagaatttt tttttttttt ttttgggatg gagtctctct ctgttgccca ggctggagtg 14701 cagtgctgcg atcttggctc actgcagcct ctgcctccca ggttcaagtg attctcctgc 14761 ctcagccttc caagtagctg ggactacaga accactacac ccagataatt tttgtatttt 14821 tagttgagat ggggttttac catgtttagt agagaccagg ccaggctggt ctcgaaatcc 14881 tgacctcagg tgatctgccc acctcagcct cccaaagtgc tgggattaca ggtgtgagcc 14941 accgcaccgg gctggtgttg agattttatc tagacctggc agccctctcc ttcaccagta 15001 actcctaaaa ccagggaccc ctggagggga ggccacctct ccttccctgc cccgccctgg 15061 tcccaggctt agcctctccc cctgttcaca gtgcctcaac ctggtgttcc tgctggcaga 15121 cgtgtggttc ggctttctgc caagcatcta cctcgtcttc ctgatcattc tgtatgaggg 15181 gctcctggga ggcgcagcct acgtgaacac cttccacaac atcgccctgg aggtcagcat 15241 tggccgggca agggctgggg gtggcctgtc cagggacacc cagggcaggg atgtctggga 15301 ctgaagcctc acccctgctc tctgccctcc cagaccagtg atgagcaccg ggagtttgca 15361 atggcggcca cctgcatctc tgacacactg gggatctccc tgtcggggct cctggctttg 15421 cctctgcatg acttcctctg ccagctctcc tgatactcgg gatcctcagg acgcaggtca 15481 cattcacctg tgggcagagg gacaggtcag acacccaggc ccaccccaga gaccctccat 15541 gaactgtgct cccagccttc ccggcaggtc tgggagtagg gaagggctga agccttgttt 15601 cccttgcagg ggggccagcc attgtctccc acttggggag tttcttcctg gcatcatgcc 15661 ttctgaataa atgccgattt tatccatgga cttcttatat cgtttttgtc tctaaaaaga 15721 aacttttatt atgaaagtaa tacatgccca ctttcctata tatagacagc ataaagaagg 15781 tatgtcagca gatttctccc ttttttgatt gtttgtttgt ttgtttgttt gttttgacag 15841 agtctcactc tgtcacctag gctggagtgc aatggcgtga tcttggctca ctgcmacctc 15901 cacttcccgg ttcaagcaat actcctgcct cagcctcccg agtagctggg attacaggtg 15961 gtcactacca tgtctggcta atttttatat ttttagt // LOCUS HSCOMT2 3651 bp DNA PRI 18-APR-1997 DEFINITION H.sapiens gene for catechol O-methyltransferase. ACCESSION Z26491 NID g403303 KEYWORDS catechol O-methyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 975 to 3651) AUTHORS Tenhunen,J., Salminen,M., Lundstrom,K., Kiviluoto,T., Savolainen,R. and Ulmanen,I. TITLE Human catechol-O-methyltransferase: Genomic organization of the gene and its expression from two distinct promoters JOURNAL J. Biol. Chem. (1993) In press REFERENCE 2 (bases 1 to 3651) AUTHORS Tenhunen,J. TITLE Direct Submission JOURNAL Submitted (22-SEP-1993) Tenhunen J., Orion-Farmos Pharmaceuticals, Valimotie 7, Helsinki, Finland REFERENCE 3 (bases 1 to 3651) AUTHORS Tenhunen,J., Salminen,M., Lundstrom,K., Kiviluoto,T., Savolainen,R. and Ulmanen,I. TITLE Genomic organization of the human catechol O-methyltransferase gene and its expression from two distinct promoters JOURNAL Eur. J. Biochem. 223 (3), 1049-1059 (1994) MEDLINE 94333369 REFERENCE 4 (bases 1 to 3651) AUTHORS Gastier,J.M., Brody,T., Pulido,J.C., Businga,T., Sunden,S., Hu,X., Maitra,S., Buetow,K.H., Murray,J.C., Sheffield,V.C., Boguski,M., Duyk,G.M. and Hudson,T.J. TITLE Development of a screening set for new (CAG/CTG)n dynamic mutations JOURNAL Genomics 32 (1), 75-85 (1996) MEDLINE 96230328 FEATURES Location/Qualifiers source 1..3651 /organism="Homo sapiens" /db_xref="taxon:9606" intron 1..984 exon 985..1074 intron 1075..1340 promoter 1120..1490 exon 1341..1629 CDS join(1341..1629,1765..1958,2246..2377,2839..3039) /codon_start=1 /product="catechol O-methyltransferase" /db_xref="PID:g403304" /db_xref="SWISS-PROT:P21964" /translation="MPEAPPLLLAAVLLGLVLLVVLLLLLRHWGWGLCLIGWNEFILQ PIHNLLMGDTKEQRILNHVLQHAEPGNAQSVLEAIDTYCEQKEWAMNVGDKKGKIVDA VIQEHQPSVLLELGAYCGYSAVRMARLLSPGARLITIEINPDCAAITQRMVDFAGVKD KVTLVVGASQDIIPQLKKKYDVDTLDMVFLDHWKDRYLPDTLLLEECGLLRKGTVLLA DNVICPGAPDFLAHVRGSSCFECTHYQSFLEYREVVDGLEKAIYKGPGSEAGP" misc_signal 1341..1343 misc_signal 1491..1493 intron 1630..1764 exon 1765..1958 intron 1959..2245 exon 2246..2377 intron 2378..2838 exon 2839..3304 misc_signal 3037..3039 polyA_signal 3305..3310 BASE COUNT 758 a 1063 c 1059 g 771 t ORIGIN 1 agtattgctg ttcagatagc ctttatttgg gtatatattc tacactgttt ttaaatatgg 61 agagtaacca aaatggccca ttatctgacc acacaaatac tagtagtcat tatagataaa 121 ccatagcaga taaataatag taaacaaagc aacaggctgt gtcattggaa atccccacca 181 tgaagaaagg agcaaggtga aaacttctgg ctgcttcagg tcatgcatgg tccctctcca 241 ccatcgttcc ccctgtcatc ttcctgccag aataaggacc ctggtacctt agggaagcac 301 catctcttgt tttttcccca cgagccctgt gggtcatggc acgtcctgcc ccgctgggaa 361 aacacagtgg gccacgggtt tccctgcagg cctggaccct tcccaagggt agcagcagaa 421 ggcagcacga ttcccactcc tgcagctgtg acagggcacc cccactgtca ctgagccctg 481 caccgggttc catcacctgc tcggggctct gcctttggcc ttttcctgtg aactgcatgt 541 tggccactgt acctatctgt ctctcatctt tttttcttac gggtttgggt atgttcttgg 601 taaaccagcc cttggtctta cacatcattt ccaaggtact aaggactctt caggggaaat 661 acaacttgag cagagtggtt ccctcctctt gtggttcaca aggtgcaggt gcacacacac 721 ataccacagg gcagtgtgac aggaccagag actgcccctg gggtccctgg ctgggggaca 781 ctagtaggga tgtcccttgc ctctctgagg ccttctgctg tctcttctga ggccggaaag 841 gcgaagcact gccctcgccc tgctagggaa ggctcaggcc aggctggccc tatccgggga 901 aggggctcag gtatctggac cttggtcatc gccaggttag ggtttatgtt gatgattatc 961 caaaggcaaa attgatttcc acagaaataa catctgcttt gctgccgagc cagaggagac 1021 cccagacccc tcccgcagcc agagggctgg agcctgctca gaggtgcttt gaaggtgagt 1081 tggccaacgg aagccggggc agtgccaggg tgggacagaa gaggcacaca cctgctctgt 1141 ctacccgagg gcaccagagg gcacgagaag gctggctccc tggcgctgac acgtcaggca 1201 actgaggcac aaggctggca tttctgaacc ttgcccctct gcgaacacaa gggggcgatg 1261 gtggcactcc aagcaaaggg gcgtgtgggt gctgcaggag gagcacagag cactggcgcc 1321 cctcccctcc cgccctgcag atgccggagg ccccgcctct gctgttggca gctgtgttgc 1381 tgggcctggt gctgctggtg gtgctgctgc tgcttctgag gcactggggc tggggcctgt 1441 gccttatcgg ctggaacgag ttcatcctgc agcccatcca caacctgctc atgggtgaca 1501 ccaaggagca gcgcatcctg aaccacgtgc tgcagcatgc ggagcccggg aacgcacaga 1561 gcgtgctgga ggccattgac acctactgcg agcagaagga gtgggccatg aacgtgggcg 1621 acaagaaagg ttggggttcc gggccagcag gtgctcagct ctgggacagg gacccaggac 1681 caggcatcaa atcccgtgcc tggggatcca agttcccctc tctccacctg tgctcacctc 1741 tcctccgtcc ccaaccctgc acaggcaaga tcgtggacgc cgtgattcag gagcaccagc 1801 cctccgtgct gctggagctg ggggcctact gtggctactc agctgtgcgc atggcccgcc 1861 tgctgtcacc aggggcgagg ctcatcacca tcgagatcaa ccccgactgt gccgccatca 1921 cccagcggat ggtggatttc gctggcgtga aggacaaggt gtgcatgcct gacccgttgt 1981 cagacctgga aaaagggccg gctgtgggca gggcgggcat gcgcactttg atcctcccca 2041 ccaggtgttc acaccacgtt cactgaaaac ccactatcac cagggtcatc ccagaaccct 2101 aaagaaaact gatgaatgct tgtatgggtg tgtaaagatg gcctcctgtc tgtgtgggcg 2161 tgggcactga caggcgctgt tgtataggtg tgtagggatg gcctcctgtc tgtgaggacg 2221 tgggcactga caggcgctgt tccaggtcac ccttgtggtt ggagcgtccc aggacatcat 2281 cccccagctg aagaagaagt atgatgtgga cacactggac atggtcttcc tcgaccactg 2341 gaaggaccgg tacctgccgg acacgcttct cttggaggtg agccccaacc aggatggcat 2401 ccgtgccagc tgctgcccag agcccattca gtcagcctca gcctctccaa agagccaggc 2461 attccagtag agccctgtgt ggacacagct cgctctggag gcaccacctg aggtctggga 2521 gtgtggggga ctgaggaggc cctgtggtgg gtggagatgg gtggggagct gggccagggg 2581 ctggctgggt ggcctgttgg gaactgggga gccaagcggt ccctgtcctc acggggccca 2641 tgttctgaag gtggcaccca agtcttgtac agtcctttcc tgcaggagtc acgctgggca 2701 ggaagtggaa acctggcccc aggggctagg cacaggcagt ggtgccgtgg cctagtgagg 2761 agcacccatc ctggtttggg gcaggttctc tgggcacctc tgacctctca cctcccccac 2821 cccccggtct gtttgcagga atgtggcctg ctgcggaagg ggacagtgct actggctgac 2881 aacgtgatct gcccaggtgc gccagacttc ctagcacacg tgcgcgggag cagctgcttt 2941 gagtgcacac actaccaatc gttcctggaa tacagggagg tggtggacgg cctggagaag 3001 gccatctaca agggcccagg cagcgaagca gggccctgac tgcccccccg gcccccctct 3061 cgggctctct cacccagcct ggtactgaag gtgccagacg tgctcctgct gaccttctgc 3121 ggctccgggc tgtgtcctaa atgcaaagca cacctcggcc gaggcctgcg ccctgacatg 3181 ctaacctctc tgaactgcaa cactggattg ttctttttta agactcaatc atgacttctt 3241 tactaacact ggctagctat attatcttat atactaatat catgttttaa aaatataaaa 3301 tagaaattaa gaatctaaat atttagatat aactcgactt agtacatcct tctcaactgc 3361 cattcccctg ctgcccttga cttgggcacc aaacattcaa agctcccctt gacggacgct 3421 aacgctaagg gcggggccct agctggctgg gttctgggtg gcacgcctgg cccactggcc 3481 tcccagccac agtggtgcag aggtcagccc tcctgcagct aggccagggg cacctgttag 3541 ccccatgggg acgactgccg gcctgggaaa cgaagaggag tcagccaagc attcacacct 3601 ttctgaccaa gcaggcgctg gggacaggtg gacccgcagc agcaccagcc c // LOCUS HSCOSE 3522 bp DNA PRI 15-JAN-1992 DEFINITION H.sapiens mutant coseg gene for vasopressin-neurophysin precursor. ACCESSION X62891 NID g30135 KEYWORDS neurophysin; vasopressin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3522) AUTHORS Schmale,H. TITLE Direct Submission JOURNAL Submitted (28-OCT-1991) H. Schmale, Universitaet Hamburg, Universitaetskrankenhaus Eppendorf, Inst. fuer Zellbiol. u. klin. Neurobiol., Martinistr. 52, 2000 Hamburg 20, FRG REFERENCE 2 (bases 1 to 3522) AUTHORS Bahnsen,U., Oosting,P., Swaab,D.F., Nahke,P., Richter,D. and Schmale,H. TITLE A missense mutation in the vasopressin-neurophysin precursor gene cosegregates with human autosomal dominant neurohypophyseal diabetes insipidus JOURNAL EMBO J. 11 (1), 19-23 (1992) MEDLINE 92155158 COMMENT See also x62890. FEATURES Location/Qualifiers source 1..3522 /organism="Homo sapiens" /isolate="Diabetes insipidus patient IV-3" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="leukocyte" /clone_lib="subgenomic plasmid library" /clone="IV-3-3" /chromosome="20" TATA_signal 1059..1065 gene 1139..3244 /gene="coseg" CDS join(1139..1258,2635..2836,3012..3184) /gene="coseg" /codon_start=1 /product="vasopressin-neurophysin precursor" /db_xref="PID:g30136" /db_xref="SWISS-PROT:P01185" /translation="MPDTMLPACFLGLLAFSSACYFQNCPRGGKRAMSDLELRQCLPC GPGVKGRCFGPSICCADELGCFVGTAEALRCQEENYLPSPCQSGQKACGSGGRCAAFG VCCNDESCVTEPECREGFHRRARASDRSNATQLDGPAGALLLRLVQLAGAPEPFEPAQ PDAY" exon 1139..1258 /gene="coseg" /number=1 mRNA join(1139..1258,2635..2836,3012..3244) /gene="coseg" intron 1259..2634 /gene="coseg" /number=1 exon 2635..2836 /gene="coseg" /number=2 allele 2657 /gene="coseg" /note="nucleotide different from normal allele in ADNDI patient" intron 2837..3011 /gene="coseg" /number=2 misc_feature 2872 /gene="coseg" /note="insertion in ADNDI patient compared to normal allele" exon 3012..3244 /gene="coseg" /number=3 BASE COUNT 592 a 1158 c 1167 g 605 t ORIGIN 1 ggtaccgggc ctgaacccca gaccagcggg cattctggtg gcccagggag agaggccatg 61 tcctctttag acctgccacc ttgggattag ggaccaaaag tggctttctc aggctgggtc 121 catttaggta tggctcctcc tgcaccttcc cccaggccca gacacccccc acccccggct 181 ctcccacccc aggcctgaga tgtacctgcc ctggttgcca catactaagg tcctggtggg 241 ggtaggaatt gggacaaact gttgtcaggt ttcctggggc tccccccgcc tttgaacctt 301 acttggcatg gagttctttc tccccatagt agctctccac actcatgttt ctacctggga 361 gggggtgacg ggtcatgtgc caatgtgtcc ccaaggcccc cagtttcagg gcctgagtcc 421 ccatgccctg ctctatgggg gtgcgggggt gtcatgtctt gcactttccc cagcaacctg 481 ccacccccct accctgtgag aacaagcccc tctcattctg tgtcccttga ctccacgacg 541 gcggctgcct tgtccacgag gcagggattc tcatctctgg tgcagcccct gtcctcacca 601 ctctgggctt gtgctgttct tggtccctta tcttgttgtc acagggctaa ccctccccac 661 acacctggct gtcccctgcc ccacagctcc taggccaggg cctgtccctg cccctaacac 721 gcagcccctc ggttcctctt accctcttct tggaccctac gccatttctg ggtaatcaga 781 aagctcagag atggtctcca ggtgaccccc cagtctgtcc ctctccctga atccagagcg 841 ctgcagtcac agtagaggca attgctgtca ttccgcggcc catgacagcc tggcggcccg 901 tacccctccc ccatgatccc ctgcacagac aggcccacgt gtgtccccag atgcctgaat 961 cactgctgac ggctggggac ctggcggccg tgggctcctg gggagccact ggggaggggg 1021 tggcggccgc gtctcgcctc cacgggaaca cctgcggaca taaataggca gccagcagag 1081 gcagcagcac agagccacca agcagtgctg catacggggt ccacctgtgt gcaccaggat 1141 gcctgacacc atgctgcccg cctgcttcct cggcctactg gccttctcct ccgcgtgcta 1201 cttccagaac tgcccgaggg gcggcaagag ggccatgtcc gacctggagc tgagacaggt 1261 acttcccact gtgggccatc tcagggctgc catagcgggc agtgctgaca ccctgggtca 1321 ggggctagga aagagggaag tcatgggtgg tggtagcctt taggggaagt tcgggggagg 1381 aagagggagg catggcatgg ctgggcagag gagccaatgg ggtgggccag aggggaccag 1441 gctttggagg aggctgggag aggctgaagg cgctcctggt cactgtcgcc atccagacag 1501 ggatgcagga aaatgaggga tgcttccccg gtgactgggc ttggggctgg atagggagaa 1561 cggggcatca tggcctcccc tgtgcccatg gcgttcttgc atctggactg gctggggcag 1621 cagaggctcc atcctaccta gcattggagg ctttcctcat ccagccccag cctcccagcc 1681 acaggcgccc aggcccccac acagaagatg gccactggtc tgagcgcgct tgagtggggc 1741 atcctgtggg gaagttctgc tgggaacctg gcctaattct atagtgctgg acgtttcctc 1801 catttccagc agagctgaag gaaatccaat cacgatgtgc atgcaattct gtccaggctc 1861 aatgatgagc ccttgagcaa attagaccac accaggctca gctaaaagtc taatgcgcta 1921 tccattgcgc cagagaaccg gctgttgagc agatgagagt ggccgctcgg caacccccgc 1981 agcctctctt cctcctgcta ggctccttta gggtcctgag gcacctgggt gtccgtgctc 2041 gcctctaggt ctcaggcccc tgccacccac ctgataggtc ataggtggct gagcaggggt 2101 cagggctcca gctgaggccg acaagcttgg cgggggccag gggcaaggca agagaggaga 2161 caggaaatgg gaagggccgg ggttctggat gggtagggcc tctccgcatg gtgtagtggg 2221 gaagggggtg ggcccgggct caagccgcag cagggcgagg aggaaggagg aagggtctgg 2281 agtggtggag ggtggggcag ctgcaacagt ggcgcccacc agcgatgacc ccgaggctcg 2341 aggaagggct ccccacgctg tagtccacgg gagacccgtc cctagctgag ggtgaggacg 2401 ctgagggctg tcaccgagag gtcatccaag aaaccaaggt gccgagcaga tctggacgcc 2461 ccgcccgtga ccgcggtcga ggcccagtgg cgcccgagcg tgcctgcagc cgcagccccg 2521 gtgtcccgcc cgcactccga gccctggacc ccagcatccc cgcctcgctg cgttcccctc 2581 caacccctcg actcccggct cccctcctcc cgctcacccc gcccgtcccc gcagtgcctc 2641 ccctgcggcc ccggggtcaa aggccgctgc ttcgggccca gcatctgctg cgcggacgag 2701 ctgggctgct tcgtgggcac ggctgaggcg ctgcgctgcc aggaggagaa ctacctgccg 2761 tcgccctgcc agtccggcca gaaggcgtgc gggagcgggg gccgctgcgc cgccttcggc 2821 gtttgctgca acgacggtgc gcggcggggg cgggcctggg gctggggggg ggcgcagacc 2881 gcttgggtgg gggggacgcg ggcctgcggc ggggtggggg ctgcgtcggg cccggcaggg 2941 agggtgtggg ccccccgcac cccgagctgc gcccgcccca gggcgcccgt gctcacacgt 3001 cctcccggca gagagctgcg tgaccgagcc cgagtgccgc gagggctttc accgccgcgc 3061 ccgcgccagc gaccggagca acgccacgca gctggacggg ccggccgggg ccttgctgct 3121 gcggctggtg cagctggccg gggcgcccga gcccttcgag cccgcccagc ccgacgccta 3181 ctgagccccg cgctcgcccc accggcgcgc tcttcgcgcc cgcccctgca gcacggacaa 3241 taaacctccg ccaatgcacg gcctcgcgtc tgtctcagtc tctggcggga agagggaagg 3301 ggagagaggt gggagcgcgg acccccgcca ccacgcccac cggccagtcc ccggacctga 3361 ggtcgtgggc agatccaccc cagagaagca acaggtcccg tagaggaagc gatctgggac 3421 ccgcagaggt gtcgctagac cgagggacag ggcgaattgg gaggcagggg agggggagac 3481 cagaggccga gagtggcctt ggagggggtg ggttgaggat cc // LOCUS HSCPH70 6711 bp DNA PRI 26-APR-1993 DEFINITION Human cyclophilin gene for cyclophilin (EC 5.2.1.8). ACCESSION X52851 NID g30167 KEYWORDS Alu repeat; cyclophilin; cyclosporin A-binding protein; peptidylprolyl isomerase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6711) AUTHORS Hofer,E. TITLE Direct Submission JOURNAL Submitted (23-APR-1990) Hofer E., Sandoz Ltd, Preclinical Research, 386/625, 4002 Basle, Switzerland REFERENCE 2 (bases 1 to 6711) AUTHORS Haendler,B. and Hofer,E. TITLE Characterization of the human cyclophilin gene and of related processed pseudogenes JOURNAL Eur. J. Biochem. 190 (3), 477-482 (1990) MEDLINE 90322991 COMMENT See also for cDNA sequence and for related processed pseudogenic sequences. FEATURES Location/Qualifiers source 1..6711 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CPH-70" /cell_type="leucocyte" /clone_lib="Lambda EMBL3" misc_feature 71..360 /note="Alu repeat" misc_feature 361..562 /note="Alu repeat" protein_bind 781..790 /note="SP1 binding domain" /bound_moiety="Sp1" protein_bind 1371..1380 /note="SP1 binding domain" /bound_moiety="Sp1" TATA_signal 1585..1591 exon 1616..1728 /number=1 mRNA join(1616..1728,4173..4203,4318..4406,4628..4800, 6215..6561) CDS join(1660..1728,4173..4203,4318..4406,4628..4800, 6215..6350) /EC_number="5.2.1.8" /codon_start=1 /product="peptidylprolyl isomerase" /db_xref="PID:g30168" /db_xref="SWISS-PROT:P05092" /translation="MVNPTVFFDIAVDGEPLGRVSFELFADKVPKTAENFRALSTGEK GFGYKGSCFHRIIPGFMCQGGDFTRHNGTGGKSIYGEKFEDENFILKHTGPGILSMAN AGPNTNGSQFFICTAKTEWLDGKHVVFGKVKEGMNIVEAMERFGSRNGKTSKKITIAD CGQLE" intron 1729..4172 /number=1 misc_feature 2054..2063 /note="general enhancer core" protein_bind 2522..2537 /note="NP1 binding domain" /bound_moiety="NP1" misc_feature 3207..3445 /note="Alu repeat" protein_bind 4094..4100 /note="AP1 binding domain" /bound_moiety="AP1" exon 4173..4203 /number=2 intron 4204..4317 /number=2 exon 4318..4406 /number=3 intron 4407..4627 /number=3 exon 4628..4800 /number=4 intron 4801..6214 /number=4 misc_feature 5026..5314 /note="Alu repeat" misc_feature 5340..5626 /note="Alu repeat" misc_feature 5755..6043 /note="Alu repeat" exon 6215..6561 /number=5 polyA_signal 6538..6543 BASE COUNT 1552 a 1625 c 1717 g 1817 t ORIGIN 1 gaattccctt gtaaggtttt cttaacaaaa caccagtcac ataagtgcat tttattttat 61 atttttgttt atttatttga gacggagtct cttgtctctc aggctggagt gcagtggcgc 121 catctctgct cgctgcaacc tccacctcct gggttccagc gattctcctg cctcagcctc 181 ccgagggggt agctgggact acaggtgcgc accaccatgc ccagctaatt ttgtattttt 241 cgtagagatg gggtttcacc atgttgtcca ggctggtctt gaactcctga cctcaggtga 301 tcctcccgcc tcggcctccc aaagtgctgg aattacaggc gtgatccacc gcacccggcc 361 tattttttga gagagggtca cactctgtcg tcccggctgg aatgcagtga tgcgatcacc 421 gcccactaca gcctcgacct ccgggctcaa gcaatcctcc ccgcccagcc tcctgagtag 481 cgagcgcctc gacgcccagc taatttttat ttttatttat ttttttgtag agacggcgtc 541 tctctaagat gcccaggctg gtggccggtg tcgaactcct aagatgaagc gatcctcccc 601 ggccttggcc tccgcgcctc ctaaagcgcc aggtatgagc caccgcgcct ggcctacaag 661 tgcattttaa ttaaagtatt attaatgtct ttgcctgaag aaattcgctt ttaaattgtg 721 acttatcttt cacccaaaaa tcaaagcaca attcagcccc gaggcggggg cggtaggagc 781 tgggcggggc gggggcaggg aaagaccagg agcagagatt caaaaagagt aagagggcaa 841 aatgtgcata atgcatcttc acaggtaaga gcctggccag gctcctgttt taatggcttc 901 ctcctgaaga agattcaagc agagtgtaag atattttcgg aaagtagagc attttgaaag 961 catttcataa tctctcaaaa ccggagactg ctcctgtccc acctcgttag agaaaacagc 1021 gatgctcaaa ggcaacctcc ttcctgacat tgcctggtag gacgcgacgt ggtgtttgcc 1081 cgcgcggaat gcggacgcaa ggctgctcct aggtctcggg gacgcgccat ccccatttcc 1141 gctcgcggag gcgtagggtc cgggcgcggg accccagtcg accttgactg gcggcgcgac 1201 cttgaggcct gcgttcgcct cagttgcccc ctctgtgcaa tggggagacg cgcctcatcg 1261 cttgacaacg gccgaagagc cgccgcgctt ccgtctcccg cgtgcgcgcg ccatgctgcc 1321 cacccccgtt ccgcactgac cctcccccgt gccccgcgtc ccgtactgcc gccccgcccc 1381 gagtcccatg ccgcagccac cgcgacggag cccgcaggcg ggaacctgcc tccgcgcgtt 1441 agcgcgcacg cgcgcctcat gtgtcgtccc catcagcgcc ggcttccgtc tataggccag 1501 atgcactgtc actctggcga agtcgcagac ccgattggcc gggacggagg cgcgagaccg 1561 ggttgcgggc ggggccgaac gtggtataaa acgggcggga ggccaggctc gtgccgtttt 1621 gcagacgcca ccgccgagga aaaccgtgta ctattagcca tggtcaaccc caccgtgttc 1681 ttcgacattg ccgtcgacgg cgagcccttg ggccgcgtct cctttgaggt cgggcgggcg 1741 gcggcgtgcg ggaatggggc ccagaaagtg ggccggggtc ggggtgggtg gtagcgcccc 1801 aaaggcccgg gcgcggggcg accctgcttg aggggcgagc gcgggcgggc tgcggcgcca 1861 tttcctgacg aggggccatt ttgggaggtc cgcgagtcgc gggaggaggc cgggacgcgg 1921 cggacaaagg caggcggggc ggctgcgagg ccgttggggg agggggcccg cgtccgcccg 1981 cccgcctcat gtggccgcgc cctgtcctgt ccgacgcacg tgctcggcgg ccgcgctcag 2041 gtccgcgcct tgagagtcgt tgtccgccct agcttggcct gggcgccgca gaccggagcc 2101 agaagcacgc tcgcgggggc ttgcgaccgc cttcctggga agctgtcccc tggcaggcat 2161 gggtgcttta catcctgagc tgggaagctg tttgcttgag ggtttttctc aaggatcgag 2221 gcgcggtgtg agcccgtcca tgctcggtcc tgtagatccc gggaggccat gttataaaag 2281 gagacttgct gggatgtgac gggttgccac ttgaaatatc ttccatttgg ataaagtagg 2341 aatatttata catgtgcccc aaacgtccct ccgtgtcccc cacccccaag cggaaatgtg 2401 aaaatgggcc ttgcctttgc tggtgcccaa ggaccgcctt ccactgcagt gacggcgctg 2461 gcgggggagg cgctcttgag cccctcccga ttgtccctct gcctagcaag caagttgcga 2521 ctggccacaa ggcaggcctc ttccgaccaa ggtggattac cagtgattac ctaattagtt 2581 ttgagagcgt taaatgagtt cttaaagatc agttgtaatt atagcatagt atctaaactt 2641 ggcgcgtgtc ttcaaagtta aatattgagt acgattccgt tccagttaac atggatagac 2701 cttagggagt agcgaaatag gatgttagtg gttttattcc tttaaatcac atctcaaaag 2761 gccaccaatg gctagtttgg atcttattcc gaaaatagat tgatcctcat gcagtcttcg 2821 tgaggacaga gcgatttcct tgttgcctac cctgtccata gtgcctggca cataggcact 2881 gaaacactgc atgttaatcc acaccccacc ccacctatga gtgtagtcaa agctggtaag 2941 tgacaagggc tttcgtggaa acttggcctg acctaatgtt gggcatcagg ttacccaaag 3001 agcttcaggg aaatgagaaa ggacttgcag gtcttgatga gaatggaggg gtaactgcca 3061 atgagggctt tggctttagc gaaagtctga aagggaagcc ataggaactt aaacgtaccg 3121 actataaagc tctgagaaaa gctgatgttt tagaaagacc atacattcta ggtacaaata 3181 cctaaaaact aaaaaataag tacgttggcc aggcgggcgg atcacgaagt caggagattg 3241 agaccatcct gggcccctgg tgaaacccca cctctattaa aaatacaaaa attagctggg 3301 cgtggtggcg cttgcctgta atctcagcta ctctagaggc tgaggcagga gatcgcttga 3361 accccggagg cggaggctgc agtgagccga gatcgtgcca ctgcactcca gcctggtgac 3421 agcgagactc ttgtctcaaa aaaaaaaaag tacattgcta taagagaagt gcacacggat 3481 actagtagtt aattcagtca catctgtgaa atagcttata aaatgctact tttaaacaag 3541 ctgtttttat gaaagggctt gtaaatgttt atggtattta agctacctct ctagccataa 3601 cgtattatac attcaagaaa ggttcaaaac cagatatact agaaaccaat ctttattttt 3661 taccccacta ctaggtaagg gcctggatac caagaagtga ctgctcatct aatccataaa 3721 gctatgttaa cagattggag gtagtagcat tttcattaca agtgactaaa agaacagctg 3781 tttacccctg atcgtgcagc agtgcttgct gttccttaga attttgcctt gtaagttcta 3841 gctcaagttg gggggtggtg atagacattt aagaagccat atatcttttc agaagtaggt 3901 gtgatgtact aaaagtttga gacactttct agaagtctca ctatttaagt tatgactagt 3961 attggatttt tggcatgtct ttgggtttca tgtttcttaa cccaactgcc tgcagggcct 4021 tatggctgtc aggagcagtt cttgggaatt aaagtaatta ctgaagaagt attctagtga 4081 gaaaatgaat ttatgactca gaagccccta aagacatggg tactaagcaa caaaataagc 4141 agatgttaat taactgtaat tttctcttac agctgtttgc agacaaggtc ccaaagacag 4201 caggttggtc cattttctaa gtttaacaaa gatgttccaa ttgtgacagt ttgtgtgtgt 4261 gtgtgtatat atatattttt atgtatgtat atatgtgttt aatttttttt taaacagaaa 4321 attttcgtgc tctgagcact ggagagaaag gatttggtta taagggttcc tgctttcaca 4381 gaattattcc agggtttatg tgtcaggtac gaaatttact gaattttatt ttatttgggt 4441 tgctcccttc atttgggatt gagccagaat atttcaggat acacatatct gaactgttac 4501 tctaccattt cggttctatt taacccttct attcagtttg aacttgggtt taaagtttga 4561 accttgcaga tttggcacac ttcatggtta tgttgtcaga agtgacattt ttcctatatg 4621 ttgacagggt ggtgacttca cacgccataa tggcactggt ggcaagtcca tctatgggga 4681 gaaatttgaa gatgagaact tcatcctaaa gcatacgggt cctggcatct tgtccatggc 4741 aaatgctgga cccaacacaa atggttccca gtttttcatc tgcactgcca agactgagtg 4801 gtaagggtac aacatggcac actaaccacc tgactaaatg aaaagttgcc ctggggggaa 4861 cggaacaaac actacttttc ttcaaccttt gcttccacag actttttcat ccctaagata 4921 ctagaagaag agcatacata aatgacaaat atagccaatg tgatacagaa tgtcagatac 4981 tatgatagaa acttggccct tagctgggtg gttgaattag gtgctacttt tttgagatgg 5041 agttttgctc tgttgccagg ttggagtgca gtggcacaat ctgggctcac tgcaacctct 5101 gcctcctggg ttcaagcgat tctcctgcct tggcctcctg agtagctgag aatacagatg 5161 tgtgccagca tgcctggcta attttttgta tttttgtgga gacggggttt catcatgttg 5221 gccaagctgg tcttgaactc gtgacttaag gtgaaccacc tgccttggcc ccccaaagtg 5281 ctgggatttc aggcatgagc cactgcgccc aaccaattaa gtgctttttt tttttttttt 5341 cttttctcag actggatctc gctcttatct cccaggttgg agtgcagtgg tgccatctca 5401 gctcactgca acctcctccc gggttcaagc aattcttctg cctcagcctc tcaagtagct 5461 ggaactacag gcatgcacca ccactcccag ctaaattgtg tattattagt agagcgggat 5521 ttaccatgtt gtccaggctg gtctcgaact cctgggctca agtgatctgc ctgccttgac 5581 ccccccgaag tgctgggatt acaggcatga gccactgtgc ccacccaatt aagtgctgct 5641 tttatgttac tattaataac atgcggttgg ttgggttttt tgtttctttg gggtttttgt 5701 tttgttttgt ttgtttttgg gggagggggg cgcaattcat tctatatgtg taactctttt 5761 ttgagatgga gtttcgctct gtcgcccagg ctggagtgca gtggcgcgat ctcggctcac 5821 tgcaagctcc gcctcccagg ttcacgccat tctcctgcct cagcctcccg agtagctggg 5881 actataggca catgccacca tgcccggcta attttttgta tttttagtag agacagggtt 5941 tcaccgtgtt agccaggatg gtctcgatct cctgacctcg tgatccgccc gccttggcct 6001 cccaaagtgc tgggattaca ggcgtgagcc accgcacccg gcctatatgt gtaactcttt 6061 aatggtaatt ggagaatcat gtttaatgac atttagtaca aaaggcttca gttaaaaaaa 6121 aaaaaaaaaa gctacctttc tcgtcttggt tcatgacaca tggaggctgc ttgtttgtgg 6181 ttgccagtca taatgattgt tcttcctttt caaggttgga tggcaagcat gtggtgtttg 6241 gcaaagtgaa agaaggcatg aatattgtgg aggccatgga gcgctttggg tccaggaatg 6301 gcaagaccag caagaagatc accattgctg actgtggaca actcgaataa gtttgacttg 6361 tgttttatct taaccaccag atcattcctt ctgtagctca ggagagcacc cctccacccc 6421 atttgctcgc agtatcctag aatctttgtg ctctcgctgc agttcccttt gggttccatg 6481 ttttccttgt tccctcccat gcctagctgg attgcagagt taagtttatg attatgaaat 6541 aaaaactaaa taacaattgt cctcgtttga gttaagtgtt gatgtaggct ttattttaag 6601 cagtaatggg ttacttctga aacatcactt gtttgcttaa ttctacacag tacttagatt 6661 ttttttactt tccagtccca ggaagtgtca atgtttgttg agtggaatat t // LOCUS HSCRTRGN 11000 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens creatine transporter gene. ACCESSION Z66539 NID g1628386 KEYWORDS creatine transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11000) AUTHORS Sandoval,N., Bauer,D., Brenner,V., Coy,J.F., Drescher,B., Kioschis,P., Korn,B., Nyakatura,G., Poustka,A., Reichwald,K., Rosenthal,A. and Platzer,M. TITLE The genomic organization of a human creatine transporter (CRTR) gene located in Xq28 JOURNAL Genomics 35 (2), 383-385 (1996) MEDLINE 96299791 REFERENCE 2 (bases 1 to 11000) AUTHORS Sandoval,N., Platzer,M. and Rosenthal,A. TITLE Genomic Organization of the Human Creatine Transporter Gene JOURNAL Unpublished REFERENCE 3 (bases 1 to 11000) AUTHORS Sandoval,N.N. TITLE Direct Submission JOURNAL Submitted (27-OCT-1995) Natalia NS Sandoval, Department of Genome Analysis, Institute of Molecular Biotechnology, Beutenbergstr. 11, Jena, 07745, Germany FEATURES Location/Qualifiers source 1..11000 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cosmid IDs: 8B7" /clone_lib="QIZ, X chromosome specific library" /chromosome="X" /map="q28" prim_transcript <2202..>10685 CDS join(2675..2936,4475..4606,5404..5653,6075..6207, 7141..7275,7363..7466,7562..7686,8005..8117,8230..8367, 8444..8546,8633..8733,8819..8989,9174..9314) /codon_start=1 /product="creatine transporter" /db_xref="PID:e206450" /db_xref="PID:g1628387" /db_xref="SWISS-PROT:P48029" /translation="MAKKSAENGIYSVSGDEKKGPLIAPGPDGAPAKGDGPVGLGTPG GRLAVPPRETWTRQMDFIMSCVGFAVGLGNVWRFPYLCYKNGGGVFLIPYVLIALVGG IPIFFLEISLGQFMKAGSINVWNICPLFKGLGYASMVIVFYCNTYYIMVLAWGFYYLV KSFTTTLPWATCGHTWNTPDCVEIFRHEDCANASLANLTCDQLADRRSPVIEFWENKV LRLSGGLEVPGALNWEVTLCLLACWVLVYFCVWKGVKSTGKIVYFTATFPYVVLVVLL VRGVLLPGALDGIIYYLKPDWSKLGSPQVWIDAGTQIFFSYAIGLGALTALGSYNRFN NNCYKDAIILALINSGTSFFAGFVVFSILGFMAAEQGVHISKVAESGPGLAFIAYPRA VTLMPVAPLWAALFFFMLLLLGLDSQFVGVEGFITGLLDLLPASYYFRFQREISVALC CALCFVIDLSMVTDGGMYVFQLFDYYSASGTTLLWQAFWECVVVAWVYGADRFMDDIA CMIGYRPCPWMKWCWSFFTPLVCMGIFIFNVVYYEPLVYNNTYVYPWWGEAMGWAFAL SSMLCVPLHLLGCLLRAKGTMAERWQHLTQPIWGLHHLEYRAQDADVRGLTTLTPVSE SSKVVVVESVM" polyA_signal 10034..10038 /note="putative" polyA_signal 10665..10670 /note="putative" BASE COUNT 1994 a 3474 c 3479 g 2053 t ORIGIN 1 atgggaggat gggagggtga gggaatgggg gatgggtatg taaatgggct ctgtaattaa 61 agaatttaaa ttaaaataaa tgatatattt aaaaaagata caaatggttt tacctatcag 121 gaaaccacta tcatatgaac tggaaacacg caaaatacat cagagggttg ccatgcaaag 181 cttggagaac atcagttcta gtgtgacatt tgttatttaa acacgagttt tctgctgtac 241 aggattagaa ttctgtaggt ttggtttttt ttgttttgtt ttcttttgtt ttaagtcgaa 301 tgtgttgtat gtgctgtgtt ctctagtaag agtcatttcc atgctcctga gacacctgtt 361 ttgatgtcgg aagaatgtag gtttttgttc atgccatgaa cagctctgct cactcagcac 421 tccttggccc tgagtgaggg caggaacctt taaacatgac ctgcccttcc tggagttctc 481 tgctcttgtc ttctctcttc ctctggccac actccttgat ggactttcct gaagaagatt 541 tctcttgctg ttcacttgct tggccctgtg cctgaacctg gaaacacccc agttggtgtc 601 acactcaatt atactatctg ccactctctg gtattgcttt tgtctttctg actgttccct 661 ccaacttaaa attcaagggt tggcattaga gagcctggtg tgacatacac aggaaggctg 721 gggacctctt ttcagctcac atctggaact agaatttgag ggaaatttgc tgatttttct 781 ttcttggcct tttgtatgac ttttccagtt ttctctccta gagtcagtta atataatgca 841 aggacatgag atctgaacct tgagatccag acctacaaag cacagtcagt atttgtaaca 901 tttggctctt gatggacact atacaaaaac caagtgttgt gggctggagc tacggctctg 961 caattgagag cactttctgc tcttgcagag gacccacatt tggttcccag cacccacatg 1021 gcagctcaca atgatctata actccagcac caggggatca gatcagggag cttgaagctg 1081 aggggggcac actttacctc ccaggccagg acaatgacca cttccttccc caccccaccc 1141 ccaggctact cttagcccta gaaaattcta aacaagctgc tcagctggcg gcggagaggc 1201 agcccaacaa gctggctctt gctagggagg cctggggggt cctggggaga ggaacacggg 1261 gtgggtgggg ggcgggcagc caggacctca ggcctgaggc ctttggggaa gggtctgtgc 1321 acctgccagg caccaggggg cagccttgcc ttgttcccgc tccagtcccc tcaagtccga 1381 agcccctacc cactctcacg ccaggcaggg gtgggggccg ccggggtcat ttacccgggc 1441 cccttctctg ccttgatgac aaagtcgagg cttgctcatc agccaggcag gctcccctct 1501 gcccactgtg gagacacaga ggcctgtcac ctgaagagct ggtcccggcc tccagcttcc 1561 agggtagccg ggaagctgta gcccccagtg ggcagcggtg gagagagctc aaggaaggag 1621 ggagcaccgg gaggagacgg ctgcagcctg ccaggagcgg ggagaaaggg agagaagggg 1681 aggcggaggg ctgagggggc ccgggggacg tcttcccagg gctgggaggg gccggccggg 1741 aagcctgggc tgcactagga gccggcgacc ctggggcgag gggcggcccg gagccctgcg 1801 ggaggagctg gcggccgccc caggtagcaa ccatcctgcc tcccgctgga gcggcgtctc 1861 ctccccggga ggagggcagg gaggaggtgg gcggagtgtg acgaggaggg cgggagggag 1921 ggatgcggga gggggagggg gaggggggcc ggccggccgt gggggtgggg cgatagtgac 1981 atcaccccgg agtcggtttt taagcggcgg ccggccgggg acggggaaga gagggatagt 2041 cggagcgagg tggcgagtcg ctgagcccgc cgcggccccg agagcggctg cagccgccgc 2101 cgccgggaag gagagggcga ggcgcgcccg agccgccgcc gccgccgcca ccgccgccgc 2161 cgccaccacc gccaccggag tcgcgggcca gccgggcagc ctccgcgggc cccggccggg 2221 gcggggggcg cgggccacag gcccctgctc cggccgccgc ttgcagaccg cgggcgccga 2281 tgtcgcccgc gccccgctag gctgagcctc gggtcgggcg aggagccgcc gcagccgccg 2341 ccgcccgagc cgcgggcagg agcctcggga gccgccgccg ccgccgccgc cgcccggccg 2401 ggccccgccg ccgcccgcgc gcccccgggc ccccgacaca catgagattc ttcaggctca 2461 ctttcaagtg cttcgtggac tgcttctgac tgcgccgccc gcgccccgca ccccgccgcc 2521 cgcccgccgc cccgtccccc ggcccggccg ccccccggcc cccggccggc ccgcgccctc 2581 ggggccctcc ccggtgccgc cggtgccccc cgcctgaccg ccgccccccg tgaggcgccg 2641 cgaccccggc ccggccgtgc ggcccgccgg ggccatggcg aagaagagcg ccgagaacgg 2701 catctatagc gtgtccggcg acgagaagaa gggccccctc atcgcgcccg ggcccgacgg 2761 ggccccggcc aagggcgacg gccccgtggg cctggggaca cccggcggcc gcctggccgt 2821 gccgccgcgc gagacctgga cgcgccagat ggacttcatc atgtcgtgcg tgggcttcgc 2881 cgtgggcttg ggcaacgtgt ggcgcttccc ctacctgtgc tacaagaacg gcggaggtga 2941 gttcccccgc ccgccgcggc ctcctccccc agcaggccgc cggcccccgc ccgacccccg 3001 gagccgccgc ggaggggtga agtccgggca acgggtggcc cccgggcacg cgggggtcgg 3061 ggccgcccct cgtccgccgc tgccgctcgg tggccgggcc gggcgcctcc acccccctcg 3121 cagtcatgtg cctggcatgg tggggggagg gggccggcga tgcccgcgag gctgcccccc 3181 agactcccgg gctgggagga gcgattggcc gccgaggtgg gaaagcaggc ctgcgccttg 3241 gggtctccgc gaggtaagga gccctggctg cccccacggg tcgggcacac aagcggcaca 3301 ttgtgtgggc cccccacgtg tgcacacaca cgaacacaca cacacacaat gggccactct 3361 gtccctcccc ctgccctccc ctcccctcgc ggccctcccg cccctcccct ctggcccggg 3421 cctggaacac tgggtgcccg agccaggctt gggaagcctg cggcctggcc cgcctggcgc 3481 cgccactgga cacactgcat gcacgtccca tgcccgcccg cccgcccgcc cgcccgggcc 3541 cagcttagca acagcgatgg gcacgcgtgt gtcctgtgac tacaaaacag cactggggtt 3601 gctggaagcc gaagtgaccc ggtgatgggt gggaaacaga ggtccagagc aaaggccttt 3661 gcccaaggtc aggagaagga tgctgggacc tggagtcagg caagttgcag ccaagctcag 3721 cctctgagta gtggagcgag cccagccagg gcaagggtag gaggcccaga gaggagaagg 3781 gggtagtggc acccagctct ccctgccctt ctgccacccc caccccagcc tgctggcctc 3841 aggagatagg cctgtgtcac gccctgccta tctcctgcag agcctgactc cctggccttg 3901 ctaaggccgg cctggcccct cttccgcacc tgtatccctc tgtccttgca catcgccatc 3961 ccaccagcag gggactgtga cccacccacc ctctgcctta gacctcacac ttgcaggcaa 4021 gcgtccaagg gcaggacagt cgcgctccct gcctttggat gagcccccca ggcctgatca 4081 cccagccttg gcacacatgc acacatgcac gtgccctcac tgtgctgcct gaaacaggga 4141 attgcagcac tagggacagc ccgcgtgtct gagcgtgtgt gtcctccatg gccatcgccc 4201 caagtgaccg tgggggtgga agccctgggg gcctagggcc cctctgccac ccagggaata 4261 gggctccaat ggctcagggg ctactgtagc ccctcttcaa cacactcaac ccaccccctc 4321 aagactccac ctggggcctg agtcagtggc cacccctaca ctgactcacc cagtcggaag 4381 ttgtgatggg gcctttggag tctgggctgg cccgctgggc ctgggcagcc tggctggggg 4441 ccaccctgag tccacgctgt gcctccaccc ccaggtgtgt tccttattcc ctacgtcctg 4501 atcgccctgg ttggaggaat ccccattttc ttcttagaga tctcgctggg ccagttcatg 4561 aaggccggca gcatcaatgt ctggaacatc tgtcccctgt tcaaaggtga gcagcccttg 4621 gccagcctca gggactgccc ccttctccca gctggctccc acttgagaaa tcttttcctg 4681 tcgtgagcac caggcctggg gccacgtgat ggcgtcccag tctcgagggg ggagcctgga 4741 ggagatgttc aggccgcaca gcgaacttgg ggaagcgggg actagagggg gcataggcag 4801 ctccacaagg caaggacagg ccaggcatag ccgggctggg gacgggacct gcccagcagc 4861 acccttggct ctctaggtag gtcctactgt tactatcccc aaggacgctg gggcacagac 4921 aggtggagcg acgtactgag gttgcccact gcaggggcga ctgtctccaa cactacctca 4981 ggcgactaga aacccccccc ccccaccacc accatcaaca ccagctgctg aggactggag 5041 gctactgggt ggccaggcag aggcttggac ctcctggaac cgccatggtg gcagtgggac 5101 ccacagaagg ggccaggtgt atgaggctgg agactccaca gcacttggtc agatggggac 5161 aggaggagag gggctcgctc tgccttgggt ctagggggcg gctggaggag aggagacagg 5221 ctggggagtc agcgcagtgt tggggctcac acaaggggga gcccagggga gtcaggagca 5281 ccacaaacaa ggctccagga ggacagatgg tgggagcacg gccagcctgg gtggggacat 5341 aaaggggtgg cagggggagg tggccaggga agaatctaca tggcaaggac ttcccggccc 5401 caggcctggg ctacgcctcc atggtgatcg tcttctactg caacacctac tacatcatgg 5461 tgctggcctg gggcttctat tacctggtca agtcctttac caccacgctg ccctgggcca 5521 catgtggcca cacctggaac actcccgact gcgtggagat cttccgccat gaagactgtg 5581 ccaatgccag cctggccaac ctcacctgtg accagcttgc tgaccgccgg tcccctgtca 5641 tcgagttctg ggagtgagtc cggcacctct gggccaagcc catcccatcc cccaggtctc 5701 cctcatgttg cccggctcca ggggagtggc cctgaggggg caccagggtg ttgcctggca 5761 gtccatcctg gaccctgcct gcccttgcct gtcctcggag agtcctgggg ccagcctcgc 5821 tcctgggttc ggcagccgat cactgtcctg gtcactcccc cctgatgggg gagctggggc 5881 tgcatgtgag gtgggatggg agtggcctcc caatggccag gggatcgtgg gctccaggcc 5941 cagcccaatt ggacaagagg gacccgctga accctgggct gtgggagaga agggagccac 6001 aactcctggg ggtggaccct gtggctccat cctctgctgg cacaggcctc atgggacctc 6061 cctccctccc ctaggaacaa agtcttgagg ctgtctgggg gactggaggt gccaggggcc 6121 ctcaactggg aggtgaccct ttgtctgctg gcctgctggg tgctggtcta cttctgtgtc 6181 tggaaggggg tcaaatccac gggaaaggta ccactagagg catgcagcgg ggagggtggc 6241 tcagccctgg gagccggatg tctgtgccag gcacacctgt ggcaacggga ggtgaccaga 6301 cagagtctag ccctaaggaa gggggaggta ctgaaagcca agcaatgctc cccaccctgc 6361 aaatccaggg cccagcagcc tttgctcctg gggatagagg ccctggcagg cactgtccct 6421 tccctgtgcc catcaccccc actggtgccc tcctgccagt ctctgactct tgtgacagtc 6481 tggtggacct ggtctggcca tctgttacct atcttgcctt ggggacccag agcagagtct 6541 ggccacatcc cttgggggct cctggtcagg ctggggagtc acctgaacaa agaagacagt 6601 gtctagagct gtgggacatg gccagctccc tgggggacaa ggtccccaga gcagcatgtg 6661 ggaagagggg gcagacagtg tggcagctgc atctcgcctg cctctgcctg gcccagttcc 6721 actctccacc tgctcaaccc ccacctctct ccagaagagg agggggaccc gacccggatc 6781 caatatcccg ctccctgcct gggcctccca cacctgcact gcccacacac tcatacagct 6841 ctcactcccc acgtgctcca cgcctcctgt ccccactgag gagagctccc agaggctcgc 6901 ctgctcccca ccgacacgcg tccctgcaga caaacgaggc gcccagggag cttccccact 6961 gcacttggcc agggctgccg gggcgcagcc ttgcccctag cttcctctgg cgggagccat 7021 ggctcggagg acaatgggga cctctgaaca tacctgcccg caagggggac cggaggcgct 7081 gggagtgggg gtgtgaggga ggtggtgcca cagcctccgc tgagcagcct ggccccccag 7141 atcgtgtact tcactgctac attcccctac gtggtcctgg tcgtgctgct ggtgcgtgga 7201 gtgctgctgc ctggcgccct ggatggcatc atttactatc tcaagcctga ctggtcaaag 7261 ctggggtccc ctcaggtgag gtggaggtgg agaggctgca gcagggcgct gcgggggagc 7321 cctgcaggcc cctcatgcct gcgctctccg gcccttctct aggtgtggat agatgcgggg 7381 acccagattt tcttttctta cgccattggc ctgggggccc tcacagccct gggcagctac 7441 aaccgcttca acaacaactg ctacaagtaa gcaccgccgc cctgccaccc gtgccctgtc 7501 ctgccctgcc ccgccctgcc cagcagccta acccatccac tctggcccct ccacccctca 7561 gggacgccat catcctggct ctcatcaaca gtgggaccag cttctttgct ggcttcgtgg 7621 tcttctccat cctgggcttc atggctgcag agcagggcgt gcacatctcc aaggtggcag 7681 agtcaggtag ggccctaccc ccagccccgc ctccagagca gcgagtgcta cccagatgca 7741 tgatgtacag gaacatgcaa tagaaatgct gaaaagtgac gaggattcaa acggaacttg 7801 tcagattgtg ggcctgtggg ggcaggtcct gggatttgtc aatgttgaca gagaaaggac 7861 ctcccagccc ctgccgcacg acccagggtt gacagcgcct ctgaggcagg cgtgggcatg 7921 ggcgcgagtg ttgcaggcag ggctcagggt gcgcacaggg caggacatcg gctacaaggt 7981 ctagagcctg cacctttccc acagggccgg gcctggcctt catcgcctac ccgcgggctg 8041 tcacgctgat gccagtggcc ccactctggg ctgccctgtt cttcttcatg ctgttgctgc 8101 ttggtctcga cagccaggtt tgcatggggc tctgggacag ggagccagga ggggggcgga 8161 gggagggctg caggcaagga aaggggtgga gggcggtgcg gggctcggcc tgagctgccc 8221 tggccacagt ttgtaggtgt ggagggcttc atcaccggcc tcctcgacct cctcccggcc 8281 tcctactact tccgtttcca aagggagatc tctgtggccc tctgttgtgc cctctgcttt 8341 gtcatcgatc tctccatggt gactgatgtg agtggggtgg ggggtctgcc tgtgacctct 8401 ggtggccgtc tgccatcctc cctgactggg ctctgtcccc cagggcggga tgtacgtctt 8461 ccagctgttt gactactact cggccagcgg caccaccctg ctctggcagg ccttttggga 8521 gtgcgtggtg gtggcctggg tgtacggtag gtcatggctg agggctgggc tgggggatgg 8581 tggcggggaa ggcaggtctc cagcttggcc ctcccgcctc acctcgccgc aggagctgac 8641 cgcttcatgg acgacattgc ctgtatgatc gggtaccgac cttgcccctg gatgaaatgg 8701 tgctggtcct tcttcacccc gctggtctgc atggtaaggg ctgggggagg tggggcaggg 8761 cggggggcga ggcagggcgg ggtaggggcc ccattaaccg cagcattctg gtccgtaggg 8821 catcttcatc ttcaacgttg tgtactacga gccgctggtc tacaacaaca cctacgtgta 8881 cccgtggtgg ggtgaggcca tgggctgggc cttcgccctg tcctccatgc tgtgcgtgcc 8941 gctgcacctc ctgggctgcc tcctcagggc caagggcacc atggctgagg taaggctccc 9001 gcccggcccg ccctcccctc ccctgctgtg aacattcaac ccagcctgct tcctagccag 9061 ggagtggccc cgactagggt ggcaggcagt gggaaccgga gagaggcaga ggaagtcacc 9121 gtggggacga gcaggtgacc ctgggggctt cagcatgtcc tcctctcctg cagcgctggc 9181 agcacctgac ccagcccatc tggggcctcc accacttgga gtaccgagct caggacgcag 9241 atgtcagggg cctgaccacc ctgaccccag tgtccgagag cagcaaggtc gtcgtggtgg 9301 agagtgtcat gtgacaactc agctcacatc accagctcac ctctggtagc catagcagcc 9361 cctgcttcag ccccaccgca cccctccagg gggcctgcct ttccctgaca cttttggggt 9421 ctgcctgggg gaggagggga gaaagcacca tgagtgctca ctaaaacaac tttttccatt 9481 tttaataaaa cgccaaaaat atcacaaccc accaaaaata gatgcctctc cccctccagc 9541 cctagccgag ctggtcctag gccccgccta gtgccccacc cccacccaca gtgctgcact 9601 cctcctgccc ctgccacgcc caccccctgc ccacctctcc aggctctgct ctgcagcaca 9661 cccgtgggtg acccctcacc ccagaagcag cagtggcagc ttgggaaatg tgaggaaggg 9721 aaggagggag agacgggagg gaggagagag aggagaaggg aggcagggga ggggcagcag 9781 aaccaaggca aatatttcag ctgggctata cccctctccc catccctgtt atagaagctt 9841 agagagccag ccagcaatgg aaccttctgg ttcctgcgcc aatcgccacc agtatcaatt 9901 gtgtgagctt gggtgcgagt gcacgcgtgc gtgagtacgg agagtatata tagatctcta 9961 tctcttagca aaggtgaatg ccagatgtaa atggcgcctc tgggcaaagg aggcttgtat 10021 tttgcacatt ttataaaaac ttgagagaat gagatttctg cttgtatatt tctaaaaaga 10081 ggaaggagcc caaaccatcc tctccttacc actcccatcc ctgtgagccc taccttaccc 10141 ctctgcccct agccaaggag tgtgaattta tagatctaac tttcataggc aaaacaaaag 10201 cttcgagctg ttgcgtgtgt gagtctgttg tgtggatgtg cgtgtgtggt ccccagcccc 10261 agactggatt ggaaaagtgc atggtggggg cctcggggct gtccccacgc tgtccctttg 10321 ccacaagtct gtggggcaag aggctgcaat attccgtcct gggtgtctgg gctgctaacc 10381 tggcctgctc aggcttccca ccctgtgcgg ggcacacccc caggaaggga ccctggacac 10441 ggctcccacg tccaggctta aggtggatgc acttcccgca cctccagtct tctgtgtagc 10501 agctttaacc cacgtttgtc tgtcacgtcc agtcccgaga cggctgagtg accccaagaa 10561 aggcttcccc gacacccaga cagaggctgc agggctgggg ctgggtgagg gtggcgggcc 10621 tgcggggaca ttctactgtg ctaaaaagcc actgcagaca tagcaataaa aacatgtcat 10681 tttccaaagc aggctcctgc ttccgcctct gctgctctaa ggaaggggtc ggggtacagg 10741 aggcaggggg aacctcctcc agctggagct gctgccgtga gcaaggctct gctctggagg 10801 cctctgcggc cggcaccctt ctggggactg ggaagggggc agggaaggca gcagcccagg 10861 ggaaggcctt gtccccctgg agccgaggca gttggggaga gcaggacgag agtgagctgg 10921 agagcagcca cacccgcggg gaagggtggc gtaaagccat gggtgctgaa attttcaaaa 10981 tgttacccca agaatttgtc // LOCUS HSCSRP2S2 7942 bp DNA PRI 16-SEP-1997 DEFINITION Homo sapiens cysteine and glycine-rich protein 2 (CSRP2) gene, exons 2 through 6, and complete cds. ACCESSION U95018 U72536 U72537 U72538 NID g2078337 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7942) AUTHORS Weiskirchen,R., Pino,J.D., Macalma,T., Bister,K. and Beckerle,M.C. TITLE The cysteine-rich protein family of highly related LIM domain proteins JOURNAL J. Biol. Chem. 270 (48), 28946-28954 (1995) MEDLINE 96081967 REFERENCE 2 (bases 1 to 7942) AUTHORS Weiskirchen,R., Erdel,M., Utermann,G. and Bister,K. TITLE Cloning, structural analysis, and chromosomal localization of the human CSRP2 gene encoding the LIM domain protein CRP2 JOURNAL Genomics 44 (1), 83-93 (1997) MEDLINE 97432823 REFERENCE 3 (bases 1 to 7942) AUTHORS Weiskirchen,R., Erdel,M., Utermann,G. and Bister,K. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) University of Innsbruck, Institute of Biochemistry, Peter-Mayr-Str. 1a, Innsbruck A-6020, Austria FEATURES Location/Qualifiers source 1..7942 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ICRFP700J1150Q06" mRNA join(U95017:874..955,404..516,3304..3472,6235..6364, 7008..7101,7619..7931) /gene="CSRP2" /note="CRP2" /product="cysteine and glycine-rich protein 2" gene join(U95017:874..1178,1..7942) /gene="CSRP2" intron <1..403 /gene="CSRP2" /number=1 exon 404..516 /gene="CSRP2" /number=2 CDS join(405..516,3304..3472,6235..6364,7008..7101,7619..7695) /gene="CSRP2" /note="CRP2" /codon_start=1 /product="cysteine and glycine-rich protein 2" /db_xref="PID:g2078339" /translation="MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNL DSTTVAIHDEEIYCKSCYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTT NPNTSKFAQKYGGAEKCSRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTE KEGEIYCKGCYAKNFGPKGFGYGQGAGALVHAQ" intron 517..3303 /gene="CSRP2" /number=2 exon 3304..3472 /gene="CSRP2" /number=3 intron 3473..6234 /gene="CSRP2" /number=3 exon 6235..6364 /gene="CSRP2" /number=4 intron 6365..7007 /gene="CSRP2" /number=4 exon 7008..7101 /gene="CSRP2" /number=5 intron 7102..7618 /gene="CSRP2" /number=5 exon 7619..7931 /gene="CSRP2" /number=6 polyA_signal 7915..7920 /gene="CSRP2" 3'clip 7932..7942 /gene="CSRP2" BASE COUNT 2311 a 1595 c 1552 g 2484 t ORIGIN 1 agatcttcag ctgagttcaa aaaaatatgt ttattaaaaa tgggcatatt agggaggggg 61 acaaggcttg cctttcccct ccccacgctt ctttcttaac tcctgtggat tctaatggca 121 tagcttgttt tgtgcctggt gacctgtact attgtactga atgcaggtcc tttgtctggc 181 aggaaaaaaa aaaaatcttg tgcacgacaa gccagtttcc taatgaaaga gttgttattt 241 gggagaggag atgaggttat tttggaggaa cgcaggcctt tgaaagatgt ggttagagag 301 gttcataaag aaacgaaaat ctgcagtccc tgggcttaaa tagccctcta gggagccgta 361 cgaccagcac agggacattc taatgaatcc tttattcctt cagaatgcct gtctggggag 421 gtggaaacaa gtgtggggcc tgtgggagga ccgtgtacca cgcagaagag gtgcagtgtg 481 atggcaggag cttccaccgc tgctgctttc tctgcagtaa gtagctctcc atgctccacc 541 tttgacggaa tttgtatgta cagagcttca atatgtctat ttaaggaaaa attgtggttt 601 tgccaacaaa gatcattttg aaagcacgaa tggtttttac ctgtcttctt agatgttgta 661 aatgccaagc ctagcagcga agacaggagg ggtggcccaa gacacccagc ctccctttgt 721 gtccccatag cttttggcct ccattttggc ctttattttg ttaacacatc tgcctccccc 781 acctcactga gtgctgggtg tcacttatgt ggtctggcac ctagcacagt ttctggcaca 841 ttacaagggc ttagctccca tttttgagta aatgattgct cctcatccac cccagctttc 901 tctttctcag gttcccttta atgacaacaa agtcttcggt tttaaaactg tagagtagca 961 acttaaagca cttcattaat catcttctga tttaatgaca gaattttctt taatgaatga 1021 ttatgtcagg taaacccacc catacgggca gatgtttgcc ttatcaatag tattgtacag 1081 ggaaaattcc tacagaaaac cataggttaa aaataggtta cattgagaaa taatcttcag 1141 gaaaagactt ctttctcaaa tttgaattgt ccttttccct cattggagcc agcattcaag 1201 gtatctggac ctctttaata gcataactaa actgctttgc agacaaccag aagcagacaa 1261 cagaaagttg cagttacaaa aaaataaaaa tactacatag gaatgttcac tatgtatgcc 1321 ttcctcagaa gacttggttt aaccattgct tttagaagtt acttttttaa aaaaggaatc 1381 tatttttcag caggcttgta accaagaaag tatggaaaca gaatgcaaat actttagaaa 1441 ttgctctcag tttcatctgt ctaggttaag ccacaatgtc tacaccatag taacatatgc 1501 gtgtatgctg ttgacatttg ttttactaaa tgtcaagttc tctactggtc atttctacga 1561 atcctttcta atatctgtct ctgctgacta atgcctttta aggaaataat tcagagtagt 1621 ttgcagacct atgagaattt tactgcagag aacagcatga ggactgggga agaaaggggt 1681 ggttatatag taacactttc atatacctac acttgttttg gttcatacag gtttttcaaa 1741 acagtattta catttattct ctctgttaag ggtagttgtc cttcagcagt gcatggtgtt 1801 aggttaacta tggtttaaga tatagttatc tttaggcaca tgtatttcag cacttgagcc 1861 tcaacagtta gattccagag tagatgccat ttcattctac catagaagac tttagtttaa 1921 gataatagac taaatttaaa ccagccagct agtcctgctt ccctcccctc ctctttgctg 1981 tccttccttt ttattcaacc cccccccttt tatttttctt acttttaatt tttatgggta 2041 catagtaggt atatatattt atggggtaca tgagatgttt tgctacaggc atacaatgta 2101 tgttaatcac atcatcttaa atggggtatc cattacctca agcatttatt atttttttgt 2161 taccaacatt ccagttaata ctgtcttatt tttaaatgta caatgattgt tgacagtaat 2221 catcctgttg tgctatcaaa tactagatct tattcattct aactatattt ttacacccat 2281 taactctcct cacttccttc ccgacaccct acccttccga gcttctggta accatcattc 2341 tactctgaat catcatacaa ttgtctttta aataagggga agaaattctg atctttgatt 2401 tccagctgcc agattgtgct accagtgagt ttcctcacaa tttacttttt tctcaagctt 2461 aattgaagtt ttagaaaatg aaataatgtc tggatgaaac tcgatgacat ttggcagttt 2521 tatgaagtgg gacacgtgga aatgtcctgt ggcaagagat ccatcaagaa agaatgcatt 2581 ctcctttggg acagaaggga aaacaaatgc agactttaaa gcacttgacg attttcaggc 2641 agactcatca ctgctggaga agctgctggg gagaggaaag tgtgatactt cccagtctag 2701 actagtgatg aaaactagaa ttcaggattt aaaactatta ttatcccagg aggtgctttt 2761 gggggaggaa aggctacaga ggcaagtccc cgaagcactg agcttccctt ctgttctgta 2821 ttaggttgtg atttaacagt acttggttaa gagaaggctg aaaagattaa gagtgttgag 2881 ggttgggagc aaggtggtgt tttattcctt ttgtttaagc atttttatag ccacctttga 2941 aacttttctc ccttaatatg atctcccaga aagggaaatc atagtaagag tttgatcagc 3001 accacaggaa agggaataaa aaataaacga aacaaactaa aaataaagct tgaggcttct 3061 ggatgttata ttctgataca tttctgaagc aactgtcata tccatggagg aatgtaaata 3121 atcctcgaat gtagctggcc agtccagatg ctgtctcatg gcttttggga agccacagga 3181 taaaccagga atgtgaggga gcctcagcct tcaccatttg taggccatgt gtctgtgcca 3241 ccattgttct aaggaaaaag gcatctttgg aacatgtgta aagtctaact ttttcccctg 3301 cagtggtttg caggaaaaat ttagatagca caacagtggc aattcacgat gaagagatct 3361 actgcaaatc ctgctacgga aagaagtatg ggccaaaagg ctacggttat ggccagggcg 3421 ctggcacgct taacatggac cgtggcgaga ggctgggcat caaaccagag aggtgagttg 3481 ggagaccgtt ggttaccttc agaaatgagt tgcgacagtt agtgccgctt gctgaacagg 3541 aacctggcag tctccagatt tttagttgaa actaagcaaa taaaatggga ctctgatttt 3601 ctaatgtcaa ctagttcaaa atattgagtt ttggacctaa caaagttctt attaataatt 3661 ccagtgattt ctacttctgg ctttatagca ttccaagctg tatcaaggag tcgtgttctc 3721 aaaagcaagt gatttgtatt tacttcaatt tcacagcacc ttggagaccg gatgtcatga 3781 aattaaatgg tagagaaatg tgctgtgtga atcatgagag gtgtttctca tgtgcaatgt 3841 aagcagccaa tgaaaattcc taagtcactc cttctatgta aacatttttc atcaagaaat 3901 cacttttctc taacttaaaa ttttttactt aaaaatggaa attacttctt atttttgatt 3961 ctttgagaga gtagtgttaa actttaaagg ctttgttgcc aacgtatcaa ggtgctaaat 4021 ttgtgaaagt tcacttagta tttcatattc aggttaaata acttatttgt ttagaattcc 4081 ccccacacac tttttttttt cttttttgtc acagaatctc actctgtcac ccaagctgga 4141 gtgcactggc acaatctcgg ctcactgcaa cctctgcctc ccaggttcaa gtgattctct 4201 tgtctcagcc tcctgagtag ctgggattac aggcgtgcac caccacaccg ggctaatttt 4261 tgtatttttg gtagaggcgg ggtctcacca tattggccag gctggtcctg aactcctggc 4321 ctcaagtgat ctacttatct cggcctccca aagtgcatgg aataacaggc atgagccacc 4381 gtgcccaacc tggaatttcc tcctaactcc tacttttaac ccagtgattc cactatttga 4441 acttcacatt ttcacatcca ttttatattg agaagtccaa aattggtgta atagtccaac 4501 aaaatggcct ttggacaaga ccataaacca tgccagccac cgtgcagcat tagcatatgt 4561 taactcgtgt aatcctcaca acagtcctac aacatagggc ttcctctctt tcttcagaac 4621 agatgaagac aagaaaaatt aagtaaaaac tttgcccaaa gtcacactgt tagttaattg 4681 tagagcctgg aacttaacct agtctaactc caaaccttat gacctgaatg aaacgtggtg 4741 cttggcttgg ctaaaataat aattaccatg tattgagtcc ctgttgtctg ttgaacattt 4801 atcttcattc tgtataatcc ttagagtaac tctgaggtat actttgtcca tttatagatg 4861 acgacactaa gagtcgggtc acacaaataa gtgagggagc caggatttga atcaatgtcc 4921 atttggttcc aaaatctgtt ctcttcctac tgcagtatgt actacattga ggctcacctc 4981 tgtgttcttt gagcttcttc caccctcaga gtttctgggc catgatgccc agcttacctt 5041 tttatctttg ttaaaattta ttttacttgt tctgctctcc ttttccatct tgttacatct 5101 atttttgtgt tttaaaggtt ttgttgaaaa cctttcaaaa tccatatccc ctcactcaac 5161 ctgcatctcc ctcaatccaa accagattta tccatgaccc caaaagtaag tcagtgacta 5221 tggccatagt tttaaagcac aaagaccagt gaaccctcct ttttttaaaa aaaaaaaaaa 5281 atatatatat atatatatag acagggtctt gctatgttgc ctaggctggc ctcgaactcc 5341 tgagttcaag ttatccttcc acattggcct cccaaagtgc taggaagaca ttgtttttcc 5401 agtaggaaaa attaacagtt gtggagtgct atcattttat ctgctcagct aatttataag 5461 gttctaagag ggcttaaaaa aaaaaaagtc ttctcaaagc ccgagaatgt tctgcacata 5521 ctaggtacac cttactgaat ggaaacagtt cctactgata gcaagaaata tggtttactc 5581 agtgatctgt aaatagtgac aaaaatacca cccttaccag agaaaaatga aagataaaca 5641 gattgcatag taataatcaa aatttataaa atatttttcc atgaagaaag ttggctattc 5701 acactttcct ttattcaaga tatactggga gactaagtaa tctataaaac atgcttaaat 5761 ttcttagtta cagaccaaag ttaacttacc tatttgtgga atttttaaac tacttaatag 5821 tggtttttcc gactcctgtg aagtaaaaag ctaggtatga ttccagtgat attattctgg 5881 agtaaaaata tatgctctgg taagttttta taaaccgtaa cataattgct tagtggagag 5941 gccaacactg aggagtaaat gaattctgtt tagactcaga gggacagatc tttacgtttg 6001 ggtccctatc tggttttttg aaataataat gagcccatag ataggttcaa gtagtaactc 6061 tggctggcac ctgagtcact tcttctttat tccctcccct gtgctggcct gcaaaggctt 6121 ccagctttga ggcagtgtag cggttaagtc ccagagttct attgttcctg ccttgtgtac 6181 ttaaaagggc cctgtggaaa ctcttgccta cttttttttt cctcttttca caagtgttca 6241 gcctcacagg cctacaacaa atccaaacac ttctaaattt gctcagaaat atggaggtgc 6301 tgagaagtgt tccagatgtg gggattctgt atatgctgcc gagaagataa ttggagctgg 6361 aaaggtaaag tgctatttgg aattaataac agcaaatgac ttctctctgc tccctttctc 6421 ccctaaccct tccacattcc ttgcttgggt gatgtcttac aaatcactgt cttatatgaa 6481 acttttcttt aaatgcattt ccatcagaat cattaacttt tttgaagtga ttcccttatt 6541 tgacaagtca tttttctgtt taaaagaact tgttttcata gggggcactc ctgaaggtta 6601 aactttaaaa ggccagactg taagggaatc tccaggttat tcagtatagt gaaaaaagca 6661 ggaatccgag tcaggacacc tgatgcgaat tcagtgtgtg gatttggaaa agttaccagc 6721 taacctcttt gagctgcgct gttcttggcc agaaaaaaga tggtctcttt aaaggagcag 6781 actgggcttg tggttaggac tgaggctgct cttagggaaa cagcctttct gtgggtgatc 6841 aagagtgcag atactgagga ctggtttcag ccttctcatt ggagctgaac cacagaagtc 6901 aggagcttgc tttgagatga tcatcctgag ccacttcttg ccatacatct aaattgaact 6961 gtaggaaaca ggcctctcaa acatgctgac acattcatct cttccagccc tggcacaaaa 7021 actgtttccg atgtgcaaag tgtgggaaga gtcttgaatc aacaactctg actgaaaaag 7081 aaggtgaaat ctattgtaaa ggtaaaattc atttcagtta ctgctgtcca tatgaatatc 7141 cacactggtt cttgtttaat gaaatgtcac tggcattaaa aaaaaaaaaa gaaaagtatg 7201 ccactatttt ctgaaaataa aatatcccaa gatgcttata gatgacagag aaccacagtt 7261 aagttcctga tcatttaaaa ctccccacca gataaaaaca ccacaaatag ccatcttttg 7321 ccttcatgtc ttgagcaaag cgtaagtata tggaagaaaa cagacaaata cataccttac 7381 ctggcctgac tcatgcttgc aactcagcag gaccacgctg aacactatag ggccagaatg 7441 tgtggacttg tctttgcagc tttaagatat ttagtacatg acacatttaa catcttaaat 7501 aagttgagct cttttacaca gcagtttaat ttatgggggt ttgttgagtg ggtcttaagc 7561 agtgctagca aagcttaatg gacttgcact aactatcctt cctccctttc ctttgtagga 7621 tgctatgcaa agaactttgg gcccaaggga tttggctatg gccaaggagc aggggctctt 7681 gttcatgccc agtaagatgt aaaccctgaa ctaaacatca cacactgaga atctcttcat 7741 aatctaggca cagataatct ttaacactaa actactgtga aattctacca gcattaagta 7801 ctgtatatcg ccctgtactt ggataggctg gctaactcgt aggaagagag cactgtatgg 7861 tatccttttg ctttattcac cagcattttg ggggaacatt tcttttacat tttaaataaa 7921 acttcagctt gatttgggtg tt // LOCUS HSCST3G 7292 bp DNA PRI 01-DEC-1993 DEFINITION Human CST3 gene for cystatin C. ACCESSION X52255 S38807 NID g30257 KEYWORDS cystatin C; cysteine proteinase inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7292) AUTHORS Abrahamson,M. TITLE Direct Submission JOURNAL Submitted (22-MAR-1990) Abrahamson M., Dept of Clinical Chemistry, University Hospital / Lund, S-221 85 Lund, Sweden REFERENCE 2 (bases 1 to 7292) AUTHORS Abrahamson,M., Olafsson,I., Palsdottir,A., Ulvsback,M., Lundwall,A., Jensson,O. and Grubb,A. TITLE Structure and expression of the human cystatin C gene JOURNAL Biochem. J. 268 (2), 287-294 (1990) MEDLINE 90303202 REFERENCE 3 (bases 1 to 7292) AUTHORS Abrahamson,M., Jonsdottir,S., Olafsson,I., Jensson,O. and Grubb,A. TITLE Hereditary cystatin C amyloid angiopathy: identification of the disease-causing mutation and specific diagnosis by polymerase chain reaction based analysis JOURNAL Hum. Genet. 89 (4), 377-380 (1992) MEDLINE 92316504 COMMENT See for mRNA sequence. FEATURES Location/Qualifiers source 1..7292 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda C3" /cell_type="leukocyte" /clone_lib="Charon 4A" /chromosome="20" GC_signal 352..357 GC_signal 939..944 TATA_signal 960..965 prim_transcript 1010..5287 mRNA join(1010..1327,3580..3693,4948..5287) exon 1010..1327 /number=1 CDS join(1085..1327,3580..3693,4948..5031) /codon_start=1 /product="cystatin C" /db_xref="PID:g296643" /db_xref="SWISS-PROT:P01034" /translation="MAGPLRAPLLLLAILAVALAVSPAAGSSPGKPPRLVGGPMDASV EEEGVRRALDFAVGEYNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGRTTCTKT QPNLDNCPFHDQPHLKRKAFCSFQIYAVPWQGTMTLSKSTCQDA" sig_peptide 1085..1162 mat_peptide join(1163..1327,3580..3693,4948..5028) /product="cystatin C" intron 1328..3579 /number=1 exon 3580..3693 /number=2 intron 3694..4947 /number=2 exon 4948..5287 /number=3 polyA_signal 5271..5276 BASE COUNT 1508 a 1881 c 2316 g 1587 t ORIGIN 1 tctagacctt gaccaaatgc tcacggatac tcaggacctc cgaagactcc ggaaggccga 61 gacctctatc tttccctttt ctgaggaaat gaagctaggg ggtcgtatgc ttttagggcc 121 cagggggcct tgacacctct ctggagggca gggtgaccta agcttgggag gagggacagg 181 tctttgggaa gagatggagc tgagggggtg gagtgggtag aaggggggca taggtgtgga 241 aggggtgggg gaagaggggc caggagtggg gtgagaagag gggaaagaag aggataggag 301 ggacagggac agggagaggc caggaatggg taggaaggga agacagagga ggggcgggac 361 gagggagggg cgatgaagag gggccgagct ggggtggggg cgaagtggga gcaaggacag 421 gagtggagga gggagatgag gggcatggga ggcggtgtct gggggttgaa aggggaaagg 481 gacagaggag aaggagcctg aagagtggcg gcatggaggg gcccgaagtg gggaggtggg 541 aatggtcgga tggatgggga ggaatgggga ggataggaga ggcaggggag gctggaagac 601 agtaggagag gagggggagg ggaggggagg ggatggatgg gaatggatag ggaaggccag 661 ggagactggg gggccagggg aggaggggag ggagagggga ggcgggaagg gagaggagag 721 gccgggaggg gctgggaggg gagggaaggg gatggatggg gaaggacagg gaagcctgga 781 ggggcagggg aggctgggat gggaggagac tgatagggag ggacctggag gcggggagag 841 gcaggggagg cggggagagg caggggaggc ggggagggag aagggaggtg ggagggacga 901 ggcgttccgg aaaagggagt gcaggccgcg gtggggtggg gcggcgaagg ccggaaggga 961 taaaaccgca gtcgccggcc tcgcggggct cacggcctcg cctcggtatc gcagcgggtc 1021 ctctctatct agctccagcc tctcgcctgc gccccactcc ccgcgtcccg cgtcctagcc 1081 gaccatggcc gggcccctgc gcgccccgct gctcctgctg gccatcctgg ccgtggccct 1141 ggccgtgagc cccgcggccg gctccagtcc cggcaagccg ccgcgcctgg tgggaggccc 1201 catggacgcc agcgtggagg aggagggtgt gcggcgtgca ctggactttg ccgtcggcga 1261 gtacaacaaa gccagcaacg acatgtacca cagccgcgcg ctgcaggtgg tgcgcgcccg 1321 caagcaggtg cgtgccgccc cccgcagggt ccgaagcccc ggccccgccg tcccagcctc 1381 cccccgcgct gctcccggac cccgtgctgc tcctctccgg cgcctgggct tctcacccgg 1441 accctgttcc cggtcttcgc tgtcaccccc gagcccttgg cgggcgtgtc cgggaatgcc 1501 ctgagtctgg cctggcctgg aaccgcacgg acacgtcagg tccgcgcccg gcgcctaaga 1561 tcagcttcca ggagccagct cgagctgcgc cgcagcgcgg gggcaggcac aggacgtccc 1621 acacagctct gtcccgacct cctgggacgc tgggctccgg gtgcgctgct gaatacgcag 1681 gagaaggaaa aacagagccc ctcatctggc tgcctccttg ttcttccacc aaataatttt 1741 taagcacttt tgtttctttt aactttttat tctcccgtaa tttcagactt acagagcagc 1801 tgcaaaatag cacagggcat ccctgtagat ctttcaccca gatccctcag atgttagcac 1861 agatgttaac attttgccct gtttgttccc cccagctcac tttccgaatg tgaatatggg 1921 tgtgtgtata tgtactgtat atgcattttt tgcctgaatc ttttgaaagt aaacacttca 1981 gtgtgttttt cctaaaaaca agggcaatat cgcacataat cactattcca ttttgaaaat 2041 cagcactgac acattcatga tctaatgctc agactcattc agctttccct gtctgttgac 2101 ggccttcgtg ccctaggacg ctcctgggcc atgagtgcat gagttcagcc ctgtcctgtc 2161 ccctcggcct cttttaacct gcagcagcag ctgggtctgt gccacgaccg tgccatttcc 2221 cagggtctag caggtgtgga tggagactat gctgactctg ggtgggcttg atgctgctca 2281 agatgagatc taggccatgc ggctcatcct cctcccagaa tactctccgc aggggccaca 2341 cgtgagcctg gcaccttgtc ctgcagagcc ctgcttccct ccccaagtca cacccctggg 2401 cacagtcccc taggactagc ggccttcacc ctcaggccgg ctgaccaccc cctacatccc 2461 agggcagctg agtccctgct ggggtggagc atgcctgacc ctgcctctat cagctgatgc 2521 agagttagac ctcagccaga tgaagacacc cagcagaccg gagtaggggg tcggatcggg 2581 agggagcttc agtagggcta ccaggcccag cttgacctgc atcccatggc agagcagcga 2641 acagtgacac agactttaga gctcctccac cttctcttgg aaattcagag gagtccagac 2701 cagccgcgtt tctcccgcag ccgtcagctg gggccctctc cctgcacagg agggccattc 2761 cctggtgtag ggtccccgct ggcctgcacc tctctctcac tgggagtgaa gcatggggca 2821 ttgtagatcg ttgggccctg gagcctattt tacagaggag cagactgaca ccagagggat 2881 cacaggcctt gcttgtgctt ctacaggact tgtgtgtggt tccacagggc aaggtctagc 2941 accctggtcc cagggtccct catcccatgc ttctccacag ttctgacaag tcatgttttg 3001 gggcggcact gtgcagggaa agcattcagt tctcttctga agttgcaacc ctaagacatg 3061 caggtgtgtg actcacttta gaaatattgc cttgaaaatc acacctggaa tggaagcacg 3121 tgggaagcaa tgtttattgg cctaaaacat caatgtatgt gagcatctca tctcctagtg 3181 agaaatgagg aaaaatacct ctgggttaaa tggcaggaat gagatgctct gtggactgaa 3241 tgccaggaac tggaagtttg ccgaaatttc atcatcacat gagaaccttc ctagaataga 3301 tccagtgtcc ctgccccctg ggtcataggt agcggaattc agttaatcct tggcattggc 3361 atagagaaac aagttactgg ggaggcctgg ggcatggcat ccctgccagc tggcaggagg 3421 aggtggcttg tgtgccttgc aggtgacaat gtgggcagct catgaaggta ggcttgaagc 3481 cccaggcaag cccagtgacc cggtcacagt gaagtgcctg tgtgtgtaag aaactgacag 3541 aacgtgctgt ccctgcctcc tgctctttca catgtgtaga tcgtagctgg ggtgaactac 3601 ttcttggacg tggagctggg ccgaaccacg tgtaccaaga cccagcccaa cttggacaac 3661 tgccccttcc atgaccagcc acatctgaaa agggtatgtg ccttatatgg gtccagggcc 3721 agtcatacac tgcagagggg tgtgtgtgtg tgtgtgtgtg tgtgatgcac atgttctgca 3781 gggtacgtgt gcatgtgcct gagtgtgtgt acacgtggag atgcatatgt gtttccaagt 3841 ataggtttgt gtggggaagt gcacgtgagt gtgtgcaagt gggtgtgtgg atgtgttggg 3901 gaagtgtggg ggtttatgca tggataggtg tgggtgtgtg agtatatgtg tgtgcacata 3961 tgtgtggggg tgtctgtgca catacattca catacatggg ggtttctgtg aacatacatg 4021 ctgtattatg aggaattacc tgccctgtgt gtgtgtgtgc atgtgggtag atagatgtat 4081 gagggagtat gtggattcat gcatagatcc gtatgggtga aggttagggt gagttcatgt 4141 agatgcctat gtgtgtgcat gtagacgggg tggtgtggag aggagtgatg aatttgattt 4201 gctaagaggg ctttagcttg ggattggggt actgggagct ccaccctatg tgctttggag 4261 tgttgcctac tggactgcag gaagcagctg cagggctggg tgctgggcag ggagaaaggg 4321 ctctgtctaa tcccagcctt aggcacctgc ccacagccac ggccatgctg agcagattag 4381 agggtagtag aggcctgtta gcagccaaac cctcagacct gccccagctc acccaacacc 4441 agcttctcca aggacccagg atttcttgtg acgtctctgc tcaggggaga gccacactct 4501 ccttgtcatc tcctcaccac cccatgtgct atgacttgga atttccagtt ccctgggttc 4561 ttccctctcg cccttcatag tgctggcctg agggctggag gtggaaggag ctgggggcag 4621 tgagctgcct ccccctgccc tgtaccctta gggctcccga ggccttgcac aggctgctcc 4681 tcacagggct gtgctggggc aggaatcctg caggctgggg tgggggccca gtgccaccag 4741 gtagagttgg agccctggga gagggatgga gcagtcactg cccagtccag cggtgctctg 4801 ggatcagcgg gggtgggtgg aggggtagat aagggcccag tgtttcacct ccatccttct 4861 tcactcccac tctgatgtcc cgtgcctggg catcttctgc tttgaactgt aacccacact 4921 gattgtcccc tctgtttcct ttcacagaaa gcattctgct ctttccagat ctacgctgtg 4981 ccttggcagg gcacaatgac cttgtcgaaa tccacctgtc aggacgccta ggggtctgta 5041 ccgggctggc ctgtgcctat cacctcttat gcacacctcc caccccctgt attcccaccc 5101 ctggactggt ggcccctgcc ttggggaagg tctccccatg tgcctgcacc aggagacaga 5161 cagagaaggc agcaggcggc ctttgttgct cagcaagggg ctctgccctc cctccttcct 5221 tcttgcttct catagccccg gtgtgcggtg catacacccc cacctcctgc aataaaatag 5281 tagcatcggc tccctctgag ttcttggctg tctggggatg tgcacacagg cagggtttcc 5341 gcagttcctt tatgaagcct ccttgtcctg ctggtgtgaa gatgagagga gtacctggga 5401 gctgacgcgg ccacagcaag gccatcaggc aagctgctgc tataggagtc ctgagtctca 5461 gagcagggag agcagccagg ggctgggaaa caaggcgttg agccacagca gcccctggtg 5521 agggggtcag ggagaggagg ggcccaagct gctgccccca gaagctgggc tggtgatttg 5581 gtctgagctg ctggtcagac caggaggagg gagctggctg tgtccataga caggggccag 5641 gcctcggtgg agctcgccag gtcatagatt ccatctgtgc ttgcagagtt gagcagaccc 5701 cagggactga gctgctctca tcaaatcccc tagacactaa atagtgatca atgcgtgctt 5761 ctttccaaga gacttaggcc cctacgtgga aacaaagtcc aaaaagcctg tctgtgtaac 5821 tgagcaggag aaatgccagc cagaccagag accagcaaga tcacagagaa agtggcacct 5881 gctgtcatcc atcctgggag gctgtgtcct cccagagttt tacaggctca tagccaccag 5941 ccctttctct ggacttggat ttaaaacaag ttgttagtcc agcttcatgg tagggatgaa 6001 gttaaagcac aaatgagacc gacaatgcag tggactcttc tttttccaag gaggattaag 6061 agagaacatt cccttcatgt cagcattatc ctatagaata gtggatcttt gtgccacaag 6121 caggaggcta cagtcctaag acacagccca ccctgcacag tccttccctt tgccatttta 6181 ggacaggggt gcagccctta cagccagtgc tgctggggca ggcctcccag gtaccttcca 6241 acctcggagc tgcactgacc ccctcttttg gctgctgcca tgaacactga aggtgaaatg 6301 atgaaggaaa atccacccca gtagcgccaa ggccagcacc ccacatgctg tccctgcaaa 6361 gtcagagcag gggacacggc agggagagtg gggagcgtct gcttcctttt cctctccacc 6421 ctgacttgct ttaacagggt ctttggggtg agggaagtgt agcaccccct aggatgccat 6481 gagctgagga gatttgagtc tgggcctcac cctgctgggg gtgggtggtc ttagtcactc 6541 agttttctgt ggggggttgg gggatagaag tggtggttgt ggtcagtggt aaagggtccc 6601 tgcagggagg tgctcctggt tttctacctc ggtgaaggat ctttgacaca ggcaacctcc 6661 atccaggcac ccagtggaaa cggacagctg accatatagc ccagtgcctt ttctgacaca 6721 cttcttcctt ggggaagttg aggttggggg cgggctgtgc acagggctcc ccagcctccc 6781 tgtgtgtctg cttccttccc tgccctccgc ccggtggccc catggcccac agcccacagg 6841 gaagcaagtt ggtgccaccc ctgcctcccc tccgcagccg gcgggcctgt aaatgcctcc 6901 cttgggcccc catgaggcca gtgagccaga agcccctctg acccagaggt cactgtgacc 6961 agcacatcct ggagaggagg cctcgccctt gcacctttca tctgcaagag ttacttttac 7021 atttatttta aatgttgatt taagtatata cttcatctac agttatccat ataaatagat 7081 tcattacagg gcctggctct gaataaacac agtaagttgg agaggctttt tcattgatct 7141 ggttctacac tggtggcttt ataacatagt gctttgacac ctcttgcatt gtgttgctgg 7201 tgaccattgg ggtgggatca ggaagggccc agtagcttct gagctcccta gcattcttgt 7261 cctgaaggaa gccagccttt gcagaaggat cc // LOCUS HSCST4 4285 bp DNA PRI 13-NOV-1991 DEFINITION H.sapiens CST4 gene for Cystatin D. ACCESSION X59964 NID g30263 KEYWORDS CST4 gene; cystatin; cystatin D; cystatin superfamily; cysteine-proteinase inhibitor; extracellular protein; secreted protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4285) AUTHORS Otin,C.L. TITLE Direct Submission JOURNAL Submitted (29-MAY-1991) C.L. Otin, Departamento de Biologia Functional, Unversidad de Oviedo, Facultad de Medicina, C/Julian Claveria s/n, 33006 Oviedo, SPAIN REFERENCE 2 (bases 1 to 4285) AUTHORS Freije,J.P., Abrahamson,M., Olafsson,I., Velasco,G., Grubb,A. and Lopez-Otin,C. TITLE Structure and expression of the gene encoding cystatin D, a novel human cysteine proteinase inhibitor JOURNAL J. Biol. Chem. 266 (30), 20538-20543 (1991) MEDLINE 92041895 FEATURES Location/Qualifiers source 1..4285 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /cell_type="blood leukocyte" /clone_lib="Charon 4A" /clone="ChC21" GC_signal 344..349 TATA_signal 389..393 /note="tata box-like sequence" exon <489..719 /gene="CST4" /number=1 mRNA join(<489..719,2546..2659,3894..>3977) /gene="CST4" gene 489..3977 /gene="CST4" CDS join(489..719,2546..2659,3894..3977) /gene="CST4" /codon_start=1 /product="cystatin D" /db_xref="PID:g30264" /db_xref="SWISS-PROT:P28325" /translation="MMWPMHTPLLLLTALMVAVAGSASAQSRTLAGGIHATDLNDKSV QCALDFAISEYNKVINKDEYYSRPLQVMAAYQQIVGGVNYYFNVKFGRTTCTKSQPNL DNCPFNDQPKLKEEEFCSFQINEVPWEDKISILNYKCRKV" intron 720..2545 /gene="CST4" /number=1 exon 2546..2659 /gene="CST4" /number=2 intron 2660..3893 /gene="CST4" /number=2 exon 3894..>3977 /gene="CST4" /number=3 polyA_signal 4212..4218 BASE COUNT 895 a 1115 c 1236 g 1039 t ORIGIN 1 gaattcccag accatgacca cgtgctcctg gatgcccctg acatctgaag actttagaag 61 gccaagccct gtatttctcc ctttcctgtg gacataggtg gtcaccaggt ttcacagccc 121 agggagcggt gtcacctctc tggaatccag agtgagccaa gctttagagc actgaggaag 181 aggggcaggg ctgtggggag gttgcggtca ggacaggagt ggggaggagg atgaggagga 241 ggcatgggag gagcgaaggg aacttaggga gggagggagg tggttggggg tggaaaggga 301 gatacagaca gaagaggagg ggtagaaaga ggaggaggag ccagggcggg gccagggagc 361 aggtggggtt gggggacacc caagtaggat aaatgcacag ctagcttctg gcctgggctt 421 cctgcctcag ccttctgctg ctttgctctc tgggagatcc agctccttgc tctgtgtccc 481 agtagaacat gatgtggccc atgcacaccc cactgctgct gctgactgcc ttgatggtgg 541 ccgtggccgg gagtgcctcg gcccaatcta ggaccttggc aggtggcatc catgccacag 601 acctcaatga caagagtgtg cagtgtgccc tggactttgc catcagcgag tacaacaagg 661 tcattaataa ggatgagtac tacagccgcc ctctgcaggt gatggctgcc taccagcagg 721 tgcgtgctac caccaccctg ggggtcctga gtcccagctg ggttttttgc ctcacccccc 781 agagcactcc cagcaaatca acattatcta aaccgcagac tcattcagct ttctctgact 841 gtctgctgat ggtcttcatg ccctaggaca ctccttggcc gtgagtgcat gagttcagcc 901 ctgtcctgtc ccctcggcct cttttaacct gcagcagcca ctgtgtctgt accatgactg 961 tggcatttcc cagggtccag caggtgtgga tggagactgt gctgactctg ggtgggcttg 1021 atgctgctca ggatgagatc caggccacga ggttcatctt cctccctgag tcctctccac 1081 aggggccaca cgggaacctg actccttgtc ctggatagcc ccgcttccct cccaagtcac 1141 gcccctggca cagcccgtta tggctagtgg ccttcactct caggctggct gaccaccccc 1201 tacagcccag gacagctgag ttcctgctgg ggtagagcat gcctgaccct gcctctgcca 1261 gctaacgcag agttagacct cagcaaaatg aggacagcaa tcacccagca gagtgaagga 1321 ggtggttggg tccagaggga ggaagcttca gcagggctac cgagcccagc ttgacctacg 1381 tcccatggca gagcagcagt gacacagcga ccacagggct atatggcctg ccagccttta 1441 gagctcctcc accttctctt ggaaagtcag aggagtccag accagccctg tttctcctcc 1501 tgcagccctc tccctgcaca ggaggggcat tccctggtgc tgtggtccct gctggcctgc 1561 actccctctt aagtgtgtca ctcactggga gtgaagcaca gaatgatgta gatcgttggg 1621 ccctggagcc tattttacag agcagcagac tgacaccgga gggatcacag gacttccatg 1681 tggttctaca ggacttgtgt gtggttccac agggcaaggt ctagcaccct ggtcccaggg 1741 tccctcatcc catgcttctc cacagttctg acaagtcatg ttttggggcg gcactgtgca 1801 gggaaagcat tcagttctct tctgaagttg caaccctaag acatgcaggt gtgtgactca 1861 ctttagaaat attgccttga aaatcacacc tggaatggag gcatgcagga ggcaatgttt 1921 attggcctaa aacatcaatg tatgtgagca tctcatctcc tactgagaaa tgaggaaaaa 1981 tacctctggg ttaaatggca ggaatgagat gctctgtgga ctgaatgcca ggagctggaa 2041 gttagctgaa atttcatcat caggtggcag ccttcctagg atgggaccag tgtccctgcc 2101 ccctgagcac aggtagcaga attcagttaa tcctttgctg tgggaaagag cattcacttg 2161 gtgcttagac cctgccctgc aagcctggtg ccaggactgt ttctttgtcc caccctggct 2221 ggttccccag agctgtgcct tctgttcctg aatccagtga agggggtttt aggccctggc 2281 ttccatctgc cctgcccctg cttctctttc tactgggctg catgagacag cttgcttgag 2341 atacccagag aaaacaagtt actaggaagg actgggacac actaccactg ccagccagca 2401 ggagaaggtg gcttgtgtgc cttttgggtg acagtgtggg catgaagccc caggcaagcc 2461 cagtgactca gtcacagtga agtgcctgcg tgtgcatgaa actgacagca tgctgcccct 2521 gcttcctgct cttccacgcg tgtagatcgt gggtggggtg aactactact tcaatgtgaa 2581 gttcggtcga accacatgca ccaagtccca gcccaacttg gacaactgtc ccttcaatga 2641 ccagccaaaa ctgaaagagg tatgtgcctg atgtgggtca ggggcatcaa gtaccgcaga 2701 gcagtgtgtg catgtgtgtg tgtgtatgtg tgtgtgcacg catctgtact cctgcacatg 2761 ctttggaggg catgtgtgca tgtgtgcaga tatttgtggg gccacgtatg caaggatgta 2821 ttcatgtgca tgggaggatg catgtgtgtt tggcacacat gtgcagatgt gtattgtgaa 2881 gtcaatgaat gtgtttgtgc atatggagtt ttgtgtatgc atgaatgaat gtgtgggatg 2941 gtgtacacat gtggctatgc atgtagaaag atgcatatgt gtgcatacat gaggaagatg 3001 cacaggagtg tgtgtgtgca tgtgtgtgga tgtgtgaagg agtatgtggg ttcatgcata 3061 tattcgtttg ggtgagggtt gctgtgagtt aatatagatg cctatgtgtg tgcagatgga 3121 gtgggtgtgg ataggggtga tgaatttgtt ttgctaggaa gactttagct tggtaatggt 3181 tactgggagg tcaactctgc ctgctttggg gtgttgtctg ttggactgga ggaagaagct 3241 gctgggctgg gttctggtca gaaagaaggg gctctgtcta accccagcct caggcacctg 3301 cctgcagcca cagccactct gagcacatta gaaggaatta aatgcctgtt agctgagaag 3361 ccctggacct gccccagctc acccaacatc agcctctcca agaacccagg atttctttcg 3421 aggtctctgc tcaaggcaga gccacactct ccttgtcatc tcctcaccac ctcgggcact 3481 ttgagttgca atttccagtt ccctgggttc ttccctctgg cccctcttag tgctggcctg 3541 ggtgctggag gtggaaggag ctggaggcag tgagctgcct cccctgtcct gcacccctga 3601 ggctcccaag gccttgcaca ggctgctcct catagggctg cgctgggaca ggaatcctgc 3661 aggctggggt gaaggcccaa tgtcacctgg tgacttggag ccttgggagg ggcaatggaa 3721 cagtcactgc ccagttgggc tcagtgccct ggaactcggc aggggtaggt tggggtcagg 3781 gggcagtgtc tcatctccaa ccattttcac ccccattctg atgtcccatt cctgggcatc 3841 tttggcttta actgtaaccc acagtcattt tctccatctc tttcttttca caggaagagt 3901 tctgctcttt ccagatcaat gaagttccct gggaggataa aatttccatt ctgaactaca 3961 agtgccggaa agtctagggg tctgtgcaag gcctgtcaca ctgaccacct cctactccca 4021 ccccctgtag tgctcccacc cctggactgg tggcccccac cctgggggag gtctcctcat 4081 gcgcctgcac caggagacag acacagaagg tggcaggagg cctttgttgc tcagcaggag 4141 gctctgccct tgctccttcc ttcttgcttc tcatagcccc agtgtgcagt gcacactctc 4201 ccacctcctg caattaaaca gtagcattgc ctccctctga gttcttggct gtttggggat 4261 gtacacacag gcagggtttc tgcag // LOCUS HSCTAS 9967 bp DNA PRI 14-MAR-1995 DEFINITION H.sapiens gene for cone transducin alpha subunit. ACCESSION Z18859 NID g312991 KEYWORDS cone transducin alpha subunit; transducin; transducin alpha subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9967) AUTHORS Morris,T.A. and Fong,S.L. TITLE Characterization of the gene encoding human cone transducin alpha-subunit (GNAT2) JOURNAL Genomics 17 (2), 442-448 (1993) MEDLINE 94010942 REFERENCE 2 (bases 1 to 9967) AUTHORS Morris,A.T. TITLE Direct Submission JOURNAL Submitted (01-DEC-1992) Morris A. T., Purdue University, Biology, 702 Rotary Cr., Indianapolis, Indiana, USA, 46202 FEATURES Location/Qualifiers source 1..9967 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Lymphocyte" /clone_lib="Clontech-Human lymphocyte genomic" /clone="HGLG3" mRNA join(1..331,2576..2618,2901..3042,4294..4451,6647..6775, 6984..7113,8979..9132,9539..9967) /gene="Human cone transducin alpha subunit" exon 1..331 /gene="Human cone transducin alpha subunit" /number=1 gene 1..9967 /gene="Human cone transducin alpha subunit" CDS join(214..331,2576..2618,2901..3042,4294..4451,6647..6775, 6984..7113,8979..9132,9539..9729) /partial /gene="Human cone transducin alpha subunit" /codon_start=1 /db_xref="PID:g732760" /translation="MGSGASAEDKELAKRSKELEKKLQEDADKEAKTVKLLLLGAGES GKSTIVKQMKIIHQDGYSPEECLEFKAIIYGNVLQSILAIIRAMTTLGIDYAEPSCAD DGRQLNNLADSIEEGTMPPELVEVIRRLWKDGGVQACFERAAEYQLNDSASYYLNQLE RITDPEYLPSEQDVLRSRVKTTGIIETKFSVKDLNFRMFDVGGQRSERKKWIHCFEGV TCIIFCAALSAYDMVLVEDDEVNRMHESLHLFNSICNHKFFAATSIVLFLNKKDLFEE KIKKVHLSICFPEYDGNNSYDDAGNYIKSQFLDLNMRKDVKEIYSHMTCATDTQNVKF VFDAVTDIIIKENLKDCGLF" intron 332..2575 /gene="Human cone transducin alpha subunit" /number=1 exon 2576..2618 /gene="Human cone transducin alpha subunit" /number=2 intron 2619..2900 /gene="Human cone transducin alpha subunit" /number=2 exon 2901..3042 /gene="Human cone transducin alpha subunit" /number=3 intron 3043..4293 /gene="Human cone transducin alpha subunit" /number=3 exon 4294..4451 /gene="Human cone transducin alpha subunit" /number=4 intron 4452..6646 /gene="Human cone transducin alpha subunit" /number=4 exon 6647..6775 /gene="Human cone transducin alpha subunit" /number=5 intron 6776..6983 /gene="Human cone transducin alpha subunit" /number=5 exon 6984..7113 /gene="Human cone transducin alpha subunit" /number=6 intron 7114..8978 /gene="Human cone transducin alpha subunit" /number=6 exon 8979..9132 /gene="Human cone transducin alpha subunit" /number=7 intron 9133..9538 /gene="Human cone transducin alpha subunit" /number=7 exon 9539..9967 /gene="Human cone transducin alpha subunit" /number=8 BASE COUNT 2912 a 2249 c 2166 g 2640 t ORIGIN 1 gtaggtctcc agttgaagta gggagtctca gtcaatgtag gcagagtaca agaccctaca 61 gcctgctctc tcacctgcca tcgtacagac cagcttttag gggagccaag ttgggatact 121 caatcccaac ttttttcctt ctcttccatc tcacatacag gaaaccttac gagagaggat 181 taggggcctg aaaaagctga caagacggca aatatgggaa gtggagccag tgctgaggac 241 aaagaactgg ccaagaggtc caaggagcta gaaaagaagc tgcaggagga tgctgataag 301 gaagccaaga ctgtcaagct gctactgctg ggtgagtgag atgggaagat gagccagaga 361 aggcaggggt ccttcctact ttcctgaagg gttggtgggt tctacctcac cccatgggaa 421 ggaagggtgg caggtcattt ttcctctcct ctacacagtc tggctagggg atcaggagat 481 ctagagctga gttaatatgg ggcctaaaca gccaccccaa ggggatccag aatgccaagg 541 ctattccaga gtttttctac tcttgagcga ggaatagtgc aagggccacc atgaacccac 601 ccatttccaa atctaaccaa acctaacaca tccttttggc ttcaaggacc ctggacttgc 661 agactgtacc caatctgcag agaactctaa gccaagaaat cagaagagaa caggaccttc 721 cctcaccaac aggctcacaa gtcccaccat acagtcagtg ccaacagtac cagagatagt 781 cccactggtt tttgccaagt atagtggttc cttcttgctt tcagtaaaaa ctttggaagt 841 agggggctct gaggaaggag gaatggtgtc tttatgtaca gcagtccctt cctggctctc 901 tattcaatag ctgcctgcaa agctcctgcc agatggaagg ttcatcaact tgatgagctc 961 ctaagcagat cactggtctg tgctgagaaa ataaaagcac ctcaatttgt cagggaaatt 1021 gatcacagct gtaaataaaa ccaagacaag aacatttgag acacgtggct taggaaaaca 1081 aaccactggt accacagaag tagggtagct ggagcaggta gggtctacgt agcagaagat 1141 tagatgcctg agctgggttt ccaagccccc ataagggatc tgggagctga cgcactaggc 1201 taaggcacct tcttttcccc cagctgatct gtggcacagt cgtaaggaca cactaaagga 1261 gcatatcttt gtaagctgga ccagactcta aggagcccag gaggttacgc agggggaaac 1321 agagatggtg gggccactga gagatctttt aagcctaagc agatttcttc tacattcagg 1381 ataagctgct tagagggaac aagcacaagc gaaataggag gagttcgagg cattagggta 1441 gtataaactc agtactagaa ggtatgagtt tttgatggag agagcagagt gtgaatgagg 1501 acattaggac acattagtca ataaagggaa cccacttagc cccatccaag accaggttga 1561 gcaacatggt gaaaacctgc ctctacggag gggcgggggg aaatggccgg atgcagtggc 1621 gcacgcctgt aatcccagcg ctttgggagg ccaagatggg tggatctacc tgaggtcagg 1681 agtttgagac cagcctggcc aacatagtga aaccccattt ctactaaaaa tacaaaaatt 1741 agtcaggcat ggtggcatgc gcctgtagag tcccagctac tcaggaggct gaggcagaag 1801 aattgcttaa aacccaggag gcagaggttg cagtgagcca agatagcacc actgcactcc 1861 agcctgggag acagagcaag actgtctcaa aaaaaaaaaa aaaatgtagt cgggcatgat 1921 gtcgcacacc tggaatccca actactcagg agactgaggt gggaggatca cttgagcctg 1981 ggaagtcaag gctgcagtaa gccgagattg tgccattaca ctccagcctg ggctacaaac 2041 ctgagaccct gtctcaaaaa aaaaaaaggg aaccagcaag gtgatgaaaa tatagttatt 2101 ttggtgaaat gatccagctc tctccaatcc taccccaaga tcactctcct gagtcaaaaa 2161 taggcagaga ggagaaatgt taaaggactc cccctgaatc tgttagtggt tttcaacagg 2221 aagattttga ctgccagtgg tcatttggaa atgtctggag acacttttgg ttgtcataac 2281 tagggaagtg ggatgctatt ggcctctagt gggtagaggc caggaatgct gctgaacatc 2341 ctaccatgca tgagacagcc caaactaaat agtactgaag ttaagaaact gctctaaatc 2401 caggttgaat ggcctgagct caagcctgcc agaaattgag ggcagcagtc atccctatgt 2461 attctcccct aacaagaccc ccaagcaagc agtggctctg acttctccca ggccatctcc 2521 tggaaggctg aggagaactg gtggaaatcg aaagcataag catttttcct tccaggtgct 2581 ggggagtcag gaaagagcac catcgtcaaa cagatgaagt gagtagaaac aaagccccaa 2641 aagacaagat agggtgaaga agtcagtaca gccagtggag gtattcaaag tgaaaggctc 2701 tttagcctca aggagcccag gtataaagga tctgattcca atgcccctca tctgtacccc 2761 ttgtccctca ccctccactt tgagaagcag tagcaacaga gagataggat tcttatgatc 2821 ctttaaatac ccccaaattc ctaatccctt aggtctggtt actcaggtcc agctaaagac 2881 agagtgtctg ccccttgcag gatcattcac caggatggct attcaccaga agaatgcctg 2941 gagttcaagg ctatcatcta tggaaatgtg ctgcagtcca tcctggctat catccgggcc 3001 atgaccacac tgggcatcga ttatgctgaa ccaagctgtg cggtatgtga ttactattat 3061 gtggttaagg gtggaagcag aaaggctagc aagaagaaac ataccagagg ccaacaaact 3121 atatggaaag atgggtaaga aaaatagtaa ctaaaatccc acctgctggg tgggatctca 3181 ctgctaggtg tactgtgtat accatctcaa ggccctttac ccatctacgc agtaagacct 3241 taaggtagac ggtacctttc taattagcct cattttacac atggaaaaac tgaagttcag 3301 agaagttaat taactgccca aggtcatgta gataaatagc agagactgca tttgcattgg 3361 gctgcttgac tacaaagctg aatatttttc ataatacacc acaattatgc agtcctggag 3421 aagtaagcaa catccaccct tatttttact gggacagggt cttcctctct tacccaggct 3481 ggaatgcagt ggtgtgatca tggctcactg caatctcacc ctcccaggct taagtgattc 3541 tcccacctca gccccagtag gtgggaccac aggtgtgcac caccacaccc agctattttt 3601 taaaattttt ttgtagagaa ggggtcttgc catcttgtct aggctggtgt caaactcctg 3661 gccttaagca atcctcctgc ctcagactcc gaaagtgttg agattacagg tgtgagccac 3721 cgtgcctgtc ctcaagccct actttataac acatatttac aaaataaagc tgctcacagc 3781 ttcctctgct tttttacttt accatttaaa ttgctaccct tgggttcctg gccatggaac 3841 ttttgtaagt gaaatcccta cttctagtac tggacttttt ctagctttca atcgctgaag 3901 aacaaaggtg atgggtccct gtcttctatc ttgtctattt catggtcaat ctggccttta 3961 cattttcagt ctctctgcag cagaggaccc acgttttgtg gcggtaaact ttcagactaa 4021 ggatggaagc cagaaactcc agagaggaaa tacctgggac ccctgccccg ctttagacac 4081 accccagggc tgggtatagt gccaccctct gcatctgggg agcatactca aaaattcaac 4141 agtatgtttt cttacataga atcttcactg gatactgctt ccatcttagg tcttcgtagt 4201 aatgtaaaat gatttccatt cagcagtatt cccagaagcc tcctggaaag ctattactcc 4261 tgtgaagttc ttaaccaggt ttctgcatta caggatgacg ggcgacagct caacaacctg 4321 gctgactcca ttgaggaggg aaccatgcct cctgagctcg tggaggtcat taggaggttg 4381 tggaaggatg gtggggtgca agcctgcttc gagagagctg cagaatacca gcttaatgac 4441 tccgcatctt agtaagactg actggtgaga gggtgggttg atgcttaagc aatcttctag 4501 ccagtcttct ctctggttgg gagaaacctc acccaaccca aaatttcagg cattgaaagc 4561 tggagaccca gactgaattc agcctgtgga tctgttttgt taggcttcag caatgttttg 4621 aatttaatgc tatggggaga tctgccacag ttgtcatgac tttctattgc tttacactgg 4681 ctcactacag cctacaaggc tgaagatgca aaattcaaaa gacgtgttct gcctagtagc 4741 cctcagccct gttctcttaa ttgaggtgta cactaagcct ctccttccag aaccctaaga 4801 tagccctgca gttctgtcat acactctcgc aggagtaatg tgattcatga tggctctttc 4861 agggcagaat tttctatctg gtaatctagg atctacttag gccactaagc agaaatgagt 4921 ctcctacacc tagagcatct tatgtcacga tgccaaggca ggggacacaa actacaaggg 4981 caaaccaagg gaaggtggtc aggaattagt accatcattg caaagctgct tccaaaaagc 5041 taagggacat aatttatcaa tcctacctca gacagtatgg gatcatagac aagtatgggc 5101 ttcaaagtca aacctgagtt tatttaacct aaccatctgt ctcctaatct tgtaataatg 5161 ctatcactta ctgttcccgg agcactcaaa gcccagtcac tgctaggtgt actgtgtata 5221 ccatctcaac gccctttacc catctacgca gtaagacctt aaggtagatg gtacctttct 5281 aattagcctc attttacaca tggaaaaact gaagttcaga gaagttaatt aactgcccaa 5341 ggtcatgtag ataaatagca gagactgcat ttgcattggg ctgcttgact acaaagctga 5401 atatttttca taatacacca caatttctct tgaccctagc caagtattta actcttctga 5461 atttctccat ctgtaaaatg gggatatgaa acttgatggc ttaggagttt aaacgagaat 5521 ctgaacatta aaatgcctgg catacaatag gcacttgtgt tacttgtact tccctcctcc 5581 tctcagtcat ctcagttcac atctgccttc ctcttgcctt tttttttttt tggagacgga 5641 gtctcgcttt gtcacccagg ctggagtgca gtggcgcaat ctcggctcac tgccagcttt 5701 gcctcccggg ttcacgccat tctcctgcct cagcctccca agtagctggg gatactggcg 5761 cccgccacca tgcccagcta attttttgta tttttagtag agatggggtt tcaccgtgtt 5821 agccaggatg gtttcgatct cctgaccttg tgatccgccc gcctcggcct cccaaagtac 5881 tgggattata ggcgcgagcc accgtgcctg gccacctttt tttttttttt taaaaataac 5941 tttttctttt ttaaggtaac tttcaaacat acaaaagtag aaataacata atgaattacc 6001 ttcaacagtt atcaactcat ggccaatctt atttcttttt acctccattc cctactggat 6061 tattttgaag caaattccag gtatcatttc atcctaaaaa tgtcattatg gatctctaaa 6121 atattgaccc tttttataaa aagaatataa tcacacccaa atagttaaca attccttaat 6181 agcatcaaat atcaccaagt cggtgttcaa atttccccag ttatctcaaa tgtttcctcc 6241 ccagtttgat tcaggatata aacaaagtcc atgtgttgca ttggttcatg tctttaaagt 6301 ctctttcaat ttactgcctc ttgatggaac ttttagcacc cactaaatct cattcaaaaa 6361 tctacaaaag cttgcagtca gggcaaaccc aaggaaaact tatctaactt gatttaaatg 6421 cgctggaact ccagcctcaa atgggcttgt ccctcagtcc atcccccatt tcctctagtt 6481 aatgacatat gaaggtttga aggaaagaat cctaaattta acaatatggc ttctagtcct 6541 gatttctatc accaaattag ctgtgtgact ttgggtaaga taccttttac gtctcttagc 6601 ctcgtctgtg gaaaatttgt cattgatgct aatcactttt ctctagctac ctgaaccaat 6661 tagaacgaat tacagaccct gagtacctcc ctagtgagca agatgtgctc cgatccagag 6721 tcaaaaccac aggcatcatt gaaaccaagt tttccgtcaa agacttgaat ttcaggtaag 6781 tgcatggttc cctagggcat ctcggataca cgtgtggaat cctgaaaagg ggcacaggca 6841 ggcttcttgg cctttggtga agcctaatct aaattactgt tctgcttctc ccccttttcc 6901 tctcccactt cacggggtga ccatcaaagg catataggtc gtatgttggg catacctatg 6961 aaaatagctg cttcttccct caggatgttt gatgtgggag ggcagagatc cgagagaaag 7021 aagtggatcc actgcttcga gggagtcacc tgcatcattt tctgtgcagc cctcagtgcc 7081 tatgatatgg tgctggtgga agatgacgaa gtggtgagtg gcctttgcat caagcagctt 7141 tggtagaaca agttctcccc atgacccttt ctctaagcct tgtgtcactc tactgcccca 7201 acttaggtaa tttcagtcta gcagccctcc agcagaccaa tcaatgtctc atgcaaataa 7261 ttctaaaaaa caacttcttc tgcaggttct agattagcat tttagagctc caaatttact 7321 gacagtgagc ttggtctcaa attagacatc taagtatcac ttggacatca caaagctcat 7381 aagaggaatt gagtgcaaag agataaggga ccatcaacta ggcaaagcaa aggagttaca 7441 cttagtactc tcccaaattg cctaaggaag gagatgaaaa tgacagaaca gagaaaataa 7501 catatgatat gaatcttcat tgcaacataa tagaagggtt gagctagtaa ccccacttag 7561 gaggctaaaa atgtactgtc cgtaggagtt taaggagaga ctggcagacc agctttctct 7621 catgccaatt aaattggcag ctggaagact acccaagagt ggttctcttt agcctgtaga 7681 attctgtagg acaggagttc tataggacaa gtgttagagc ccagccagtt tctgaatttg 7741 ggaaaggtta gaggtgagaa aaacgttaat ttcacccaag catctgcttt ctgaatttgg 7801 gaaaagttag aggcgtgaaa aacgttaatt tcacccaagc atctgctcac aaatggaggt 7861 ccacaccttg ctgatgacct ctgaatttgg gaaaggttag aggtgtgaaa aatgttaatt 7921 tcacccaagc atctgcttac aaatggaggt ccacagcttg ctgatgaaag ggatactcct 7981 atcccttgcc acagcttgtt ctcttccctt ccctttggta gttttaactt cacattagag 8041 cactctgaat atcgtctaat caaaatgtct tacagagcta ttcacttccc atctttaagc 8101 ctaaagatta cagtctatga gactttccat ctttaaccct aaagtctatg agtctatgag 8161 gtttattaaa gtctatgaga cattaataaa acaagtctat gagaccttaa agggttgtac 8221 aggagtatta tggggaaaaa gccacaaatg ggattgttct tgctttatta gataagtaga 8281 ctgaacgcaa gtcagctgat agtatacttc aaaaccctaa agacctgctc cctaaaagca 8341 agctgggctg gggcaatggg cagcctctgc agatatgcag cccgacttct cgctaagtag 8401 caatcagaga aggaaatgag agagcagaaa tgcttgtggt atggcactgg gaaattctct 8461 aactctcacc atgtggcagc aggaccaaag tagcccaaac tgagatctgg gaccccatga 8521 aagaagccta tcaaaatcat cctggagatg catatgggca catgctaact tgggcctgtt 8581 tcaacccatt atcagcacta cttataaaat gtcaagttct cagttgcatc ctggctgcta 8641 aagatctgca taacacatta tagacctata tgccagccac tatcatggac aatatacata 8701 cacaatctca tttagctttc attgtaaccc tataagatag gaaaacagac tcagaaaaag 8761 ctcaataatt ttccacaagt cacacagcta ttagaaagat agggaactag aatgatccca 8821 catctctctg gctttactct ggtacattga gataagctgt tctctgtctt gctttttaca 8881 tttggagcac tggtctctcc aaggggaaca agagcaggaa gtaggtagat attctataag 8941 ccaaatctga tatttccaat ggtgtttcct cttactagaa tcgtatgcat gagtctttgc 9001 atctgttcaa cagcatatgt aaccacaaat tctttgcggc tacttccatt gtcctctttc 9061 tcaacaagaa ggacctcttt gaggaaaaaa tcaagaaagt ccatctcagc atttgttttc 9121 cagagtatga tggtaagtgt caggggctgg aaataataat aatgcctttt agtagagact 9181 ggcaattgtc tcatttttta ggccaagatg acacaaagga acttaaggga gaaccttggg 9241 cacagttaca gggtttaaat tcagatactc tggaatacag caggcattag atgcaggaga 9301 gccactgact tcatatgata cctactgaaa accaaaggtg gaaagacacc tctcctcaat 9361 ttcttttcaa ctaaagtgag aaacactgga gtgcaataga gaatcttccc tccaaaaata 9421 ggcccccaac tgctgttgtc taataacatt tcaaggatca agtcaatcac ctaaagtgag 9481 tcagcaacta acaagggttc atttattcta ctttttcact atttttctgg aaaaccaggt 9541 aacaactcct atgatgatgc ggggaattac ataaagagcc agttccttga cctcaatatg 9601 cgaaaagatg tcaaagaaat ctacagtcac atgacctgtg ctacagatac acagaatgtc 9661 aaatttgtgt ttgatgcagt tacagatatt atcatcaaag aaaacctcaa ggactgcggc 9721 ctcttctaat cctcaccatt cctcaggtat aagttctata aacaggcttg gaatctgggt 9781 aattaaaaac agaaaattat agtcaatata ccatgacatg aagaatgaat ccattctttg 9841 gagatggagt atacatgact gcaactgtat ttcatacgtt cttttcaaag tgggatagct 9901 attgcagctt aaagagcaca ggttccagta ctggttttcc aacttaatac aaaactgtga 9961 atacttt // LOCUS HSCYCLA 8363 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens cycA gene for cyclin A. ACCESSION X68303 NID g510603 KEYWORDS cyclin A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8363) AUTHORS Henglein,B. TITLE Direct Submission JOURNAL Submitted (07-SEP-1992) B. Henglein, INSERM U. 75, 156 rue de Vaugirard, F-75015 Paris, France REFERENCE 2 (bases 1 to 8363) AUTHORS Henglein,B., Chenivesse,X., Wang,J., Eick,D. and Brechot,C. TITLE Structure and cell cycle-regulated transcription of the human cyclin A gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (12), 5490-5494 (1994) MEDLINE 94261611 FEATURES Location/Qualifiers source 1..8363 /organism="Homo sapiens" /isolate="FR" /db_xref="taxon:9606" /cell_type="Placental Tissue" /clone="#492" prim_transcript 877 /note="alt. transcription start site" prim_transcript 925 /note="alt. transcription start site" exon <1180..1392 /gene="cycA" /number=1 gene 1180..7072 /gene="cycA" CDS join(1180..1392,2162..2405,3618..3730,3944..4167, 5130..5337,5797..5910,6533..6666,7024..7072) /gene="cycA" /codon_start=1 /product="cyclin A" /db_xref="PID:g510604" /db_xref="SWISS-PROT:P20248" /translation="MLGNSAPGPATREAGSALLALQQTALQEDQENINPEKAAPVQQP RTRAALAVLKSGNPRGLAQQQRPKTRRVAPLKDLPVNDEHVTVPPWKANSKQPAFTIH VDEAEKEAQKKPAESQKIEREDALAFNSAISLPGPRKPLVPLDYPMDGSFESPHTMDM SIVLEDEKPVSVNEVPDYHEDIHTYLREMEVKCKPKVGYMKKQPDITNSMRAILVDWL VEVGEEYKLQNETLHLAVNYIDRFLSSMSVLRGKLQLVGTAAMLLASKFEEIYPPEVA EFVYITDDTYTKKQVLRMEHLVLKVLTFDLAAPTVNQFLTQYFLHQQPANCKVESLAM FLGELSLIDADPYLKYLPSVIAGAAFHLALYTVTGQSWPESLIRKTGYTLESLKPCLM DLHQTYLKAPQHAQQSIREKYKNSKYHGVSLLNPPETLNL" intron 1393..2161 /gene="cycA" /number=1 exon 2162..2405 /gene="cycA" /number=2 intron 2406..3617 /gene="cycA" /number=2 exon 3618..3730 /gene="cycA" /number=3 intron 3731..3943 /gene="cycA" /number=3 exon 3944..4167 /gene="cycA" /number=4 intron 4168..5129 /gene="cycA" /number=4 exon 5130..5337 /gene="cycA" /number=5 intron 5338..5796 /gene="cycA" /number=5 exon 5797..5910 /gene="cycA" /number=6 intron 5911..6532 /gene="cycA" /number=6 exon 6533..6666 /gene="cycA" /number=7 intron 6667..7023 /gene="cycA" /number=7 exon 7024..>7072 /gene="cycA" /number=8 prim_transcript 7341 /note="alt. transcription stop site" prim_transcript 8266 /note="alt. transcription stop site" BASE COUNT 2448 a 1521 c 1761 g 2633 t ORIGIN 1 aagctttgta tattcttata tttatatata aatataaaaa tttgttaaag gcacgtatag 61 ttaagagagt tttattttaa taaggtcata ttgttttaca tgttcaaaaa actttacttc 121 tgaaaggaac ataattatat ctaggtcact agaacgtcat tgtgtttttt gttggttgcc 181 acagcttggg gaaaaataga aaaaaattaa tgactgtatt tgaatatttt gtaatcgact 241 gctatttatt atatatatca acagtagctc aaggtgccat cttaaattaa ttgcatcttc 301 attaggaaaa ataaaaagca taaaacacaa tttctggtta ctatgaataa acgcctaaat 361 gttaagatga cattacagtc ttgacacttg agtactgtat tactatgtga gctccgtgtt 421 aaataattta tgcacattat ttaatcctaa caaccatatg actgtagtta ttagtcccta 481 ttaacacata agaaaacgga gaatcggaga tactgaaaaa cgtgccccag attttagacc 541 tttggaaaaa gtcacttaag ctaactagac gtcccagagc taaaggctgg gcaacccaaa 601 tgatagtcgc caaagtttaa ttccgtttaa ttccctaaaa ggcttagagt cagccttcgg 661 acagcctcgc tcactaggtg gctcagctta aaataatcgg aagcgtcggg ccctaaatcc 721 tacctctccc cgccccgcgc aggcgttttc tcccgcccca gccagtttgt ttctccctcc 781 tgccccgccc ctgctcagtt tcctttggtt tacccttcac tcgccctgac cctgtcgcct 841 tgaatgacgt caaggccgcg agcgctttca ttggtccatt tcaatagtcg cgggatactt 901 gaactgcaag aacagccgcg ctccagcggg ctgctcgctg catctctggg cgtctttggc 961 tcgccacgct gggcagtgcc tgcctgcgcc tttcgcaacc tcctcggccc tgcgtggtct 1021 cgagctgggt gagcgagcgg gcgggctggt aggctggcct gggctgcgac cggcggctac 1081 gactattctt tggccgggtc ggtgcgagtg gtcggctggg cagagtgcac gctgcttggc 1141 gccgcaggct gatcccgccg tccactcccg ggagcagtga tgttgggcaa ctctgcgccg 1201 gggcctgcga cccgcgaggc gggctcggcg ctgctagcat tgcagcagac ggcgctccaa 1261 gaggaccagg agaatatcaa cccggaaaag gcagcgcccg tccaacaacc gcggacccgg 1321 gccgcgctgg cggtactgaa gtccgggaac ccgcggggtc tagcgcagca gcagaggccg 1381 aagacgagac gggtaaaggg atgcgggata tctgcaggag ggtgggtcga gcagggtttg 1441 gcattggctt agcaggcaga gaagagtggc gagaaggcat gtcctgggct tttggaaggg 1501 ggccagggtc tggagtgcag ttggtgctgg tgtgtggttt gccacagtag gagttctccc 1561 atattagcat caggaccccg ccaacactgt gtgtatgctg ccggtctgct ctctataggg 1621 ggcatctgcc tgtacataat ggggtcaaac caagctctaa aacgtgtagt cttcgctcag 1681 ctccgcgcct ttctctccac ttttaaaccc caggcactgc tgtaggactc tgacccctat 1741 cctcctcacg cttaagagat gacctctact ttagaaaagc gtgtacaaaa tactttgctt 1801 ttggcaaatt ccgccatttt agccggattt gctctgttgc tccaccctcg ggcgacagtg 1861 gtgaaccaac aatttttttg ctgtctttcc taaacttgtc acgtattggc ctgtcacccg 1921 acccttcttg tggccctgat aagttttgca taattccacc tagtgttatc tattgatagc 1981 ctttgtggga atgcctgtga caaatgggaa catcccttct cttttgaata ctgaaactct 2041 tctttgtccc agaaagttta attcctgata gagtatttgg gagaaaaagc aaaggccaac 2101 acccataaga gaaagaatgc aagactagta ggctcaaagc cagttattaa ttttttttta 2161 ggttgcaccc cttaaggatc ttcctgtaaa tgatgagcat gtcaccgttc ctccttggaa 2221 agcaaacagt aaacagcctg cgttcaccat tcatgtggat gaagcagaaa aagaagctca 2281 gaagaagcca gctgaatctc aaaaaataga gcgtgaagat gccctggctt ttaattcagc 2341 cattagttta cctggaccca gaaaaccatt ggtccctctt gattatccaa tggatggtag 2401 ttttggtaag ttttaaggaa aatctgtgtg gaattacggt aattataatt attagatgat 2461 attaatgatt tgcattttaa gatgtcttat gtagtcattt tgactatagt gagttaggtt 2521 taatgtaaca aggcttagaa aggatgtgac ttatctcaga tcttaaatgc ttattttaat 2581 ctgatcccat aaatgtgttc ttggttggga gaatagattt cagttttatt taactcactt 2641 atatatcact tagtgctatt tgaagatgca gtcttaaata ttgtgattac ataacgtctc 2701 ttccagtttt ataattttgt ttctataaaa ccagtctgga ttagcatctg ataccagaaa 2761 gtgacacttt tcctcctaca agtaaatagt tgtaccatgg ctgtttactc ttacccatgt 2821 attaaagtag cttctgtaaa cagctctttg cagtttgtac tatttctttg ccctagggat 2881 attgagtcac tggacagctc actaaatcca atctcatttt agttttagtt gtattgattg 2941 tgaggaagga ggggtgagcc ggaaaagtcc tttggaaata ataatttgtg tgtttttatc 3001 aaaattttgc agaaaatgtt taatctcaag gtgatctttg aagtgaaagg aataatggat 3061 gatactcttt gcttcaaaat aatttatact acaaaaaata aattatttat agtgttaatc 3121 catttgccca taacaaacct aatttcttaa taacacttgt taatagaggc catttaactg 3181 atgctagcta ttgcctactt gtgctttata tatgtcagac tctgggatct gcagtttctc 3241 ttgggacaaa tcttagatat tatctcaaaa cttggatttt aaaagtgttc tgtcctaaca 3301 gaaccaccat gccatttgct cctttctgtt gtggaagatg gatactttaa gataatacct 3361 gattctcatt ttcctgttag gggcctacca atatgttttt tttttttccc tcattaacta 3421 aaatcattcc aaagttctat gtacagagat actgagacta gggtaaaata ttctacatat 3481 tttgtcatga gaagctcttt ttagtttgaa gaaataagtg ctttgttaat atgagattcc 3541 ttccatggca gtcattttta gctattaaac taaataggca aacaattaat ttaatcatat 3601 cttttgaatt gtcccagagt caccacatac tatggacatg tcaattgtat tagaagatga 3661 aaagccagtg agtgttaatg aagtaccaga ctaccatgag gatattcaca cataccttag 3721 ggaaatggag gtaaaggttc tctgaatcca gtttgtataa tgtatttgtt aaatgtggct 3781 aaatgaaatt gagaggataa tgtttaaatt tttaaactaa tgaattattc taatgctaca 3841 gttaagttta tagcatgtag cctctcgtaa ttaatggtaa aataatctaa ttgcagtttt 3901 tacactagaa taattttaac actataaata attttctcaa taggttaaat gtaaacctaa 3961 agtgggttac atgaagaaac agccagacat cactaacagt atgagagcta tcctcgtgga 4021 ctggttagtt gaagtaggag aagaatataa actacagaat gagaccctgc atttggctgt 4081 gaactacatt gataggttcc tgtcttccat gtcagtgctg agaggaaaac ttcagcttgt 4141 gggcactgct gctatgctgt tagcctcgta agtctacctt ggtttgttta aaagtgatca 4201 cgaccttttg tctagggaag aaattgattt ttttaatggt gaccttttaa catacaccag 4261 gcagagtcag tgactacttt agggatggca atgtgctggg gcttagggca gtggaggagg 4321 atgaaactgt tctaatctag aacactaatt ttcccccaac tggattcctt gaacatgata 4381 tagtacacta ggttacaaat tttaaaacat tttccttcca atattacttt ctggcatata 4441 agcagtgtct cttttattaa gaagtaaagg ctgggtgcgg tggctcgcgc ctgtaatccc 4501 agcactttgg gaggccgagg tgggcggatc acttgaggtc aggagtttga gaccagcctg 4561 gccatcatgg cgaaaccctg tctctacaaa aatacaaaaa ttagctgggc atggtgatgc 4621 atgcctgtaa tctcagctac tagggaggct gaggcaagag aatcgcttga acccaggagg 4681 tggaagttgc agcgagctaa gattgcacca ctgcactctg gcctgggcga cagagcaaga 4741 ctcttgtctc aaaaaaaaaa aaaaaaaaag aaagaaaaac taaatttaaa aatgaaaaca 4801 tgaaacatgt tttcctaaga cttgtatttc ctaaattttc agctacttgg tgaactctgg 4861 cattgttccc atctcgagaa gtctcatgcc tttcatttac ttcctttgac tgaactgggg 4921 aaaatatctc aaaagtcctt tattgatttg ggccttttaa ttagcaatta gctatccagt 4981 ctctactaga aatgatgatc aagtgtctgg aaatgcagtt ggtgtggatt ttggattcat 5041 tccatattct gctaatggga tttcagaact gggataagga agcttgcttc tcttttagga 5101 ctgattactt aaaacttttt tcattgtaga aagtttgaag aaatataccc cccagaagta 5161 gcagagtttg tgtacattac agatgatacc tacaccaaga aacaagttct gagaatggag 5221 catctagttt tgaaagtcct tacttttgac ttagctgctc caacagtaaa tcagtttctt 5281 acccaatact ttctgcatca gcagcctgca aactgcaaag ttgaaagttt agcaatggtg 5341 agttaacttt cattttgttg aatggtgatt ctttggtaat aattaggggt aagccctaat 5401 tcctttgtaa aggaacctag ttcctttgta cagcagtggt gaagagattt agtattcaga 5461 gggagattaa tgctgcagta tggaagaaag cctactgctg gtcaaaagat aatctatatc 5521 aggaaagcac tagaagcaga ggtctaagga tgacatcttg gggatgggta gtcagaagag 5581 ttgtttttta aaagttaata aaggagggaa aatacaggat gaggatggaa caaaacacta 5641 aaatgattga tatgtttaaa gatgaggctg tgagatattt tttctagagt tgctacagag 5701 ccgttatcaa aggcaggaaa aaattaggtt agaaggcaga ggtttgacat ttgaattcta 5761 aacgctcttg tgatcataaa gtatttttta ttccagtttt tgggagaatt aagtttgata 5821 gatgctgacc catacctcaa gtatttgcca tcagttattg ctggagctgc ctttcattta 5881 gcactctaca cagtcacggg acaaagctgg gtatgtatta cggtcttcac acacctatct 5941 tgtgactgaa atgcctgtgc cagattaaat aataattgtc ctacaaaact gagggttgcc 6001 tagatgttct tacttggaaa aacttcaaat atatatagtt gacctttgaa aaacacaggt 6061 ttgaactgtg tgagtccact tacacatgga tttttttcca atcaaatgtg gactgaaact 6121 acagtattca ggggattcac aacccatata tatggaaggg tgactgcagg acttgaatat 6181 gtacagattt gggtatattt gggaagggtc ctggaaccaa tccctcatgt acactgaggg 6241 atgattatat actgtttaag gcatgttacc tgttgaacag tgtttgttag gtagtaagct 6301 tagttgtaac ttggattgaa aggtgagctt ttttaaaagc atctgacctg ttgctggcag 6361 ggaaagtagt agaaaatgga tcgagggatg gaataatggc ctgagaaaac tagtagttgg 6421 aaaagttaaa gattagtact ggaatttaac ataagcttaa tgccaactat ttactgtgtt 6481 tggtgtgtta ctaaaagcaa aactcagggg ttctctattc ccttctctgt agcctgaatc 6541 attaatacga aagactggat ataccctgga aagtcttaag ccttgtctca tggaccttca 6601 ccagacctac ctcaaagcac cacagcatgc acaacagtca ataagagaaa agtacaaaaa 6661 ttcaaagtaa gaattaactt gcatattgaa gcttactttt ctagattatg gttcagagat 6721 ctttgacatt ttaaccttat tcatgaatct gccatagcct ccttgagaag ctgttctcta 6781 aattggttat gtgtttgaag aaggttgtag ttcatgcttt ctgccctggg tctgggggag 6841 aagagagaag gaagggggca tgcatgtgtg tttcagggac cagacagtgt ttctaaaacc 6901 atcaagtcgg tcagacagaa atagcatttt ttagttccca atacatgagt acaaacatta 6961 ttccatgatc tcctttaccc agtgagtgtt tactaaaaat gctttttttt tccctactta 7021 caggtatcat ggtgtttctc tcctcaaccc accagagaca ctaaatctgt aacaatgaaa 7081 gactgccttt gttttctaag atgtaaatca ctcaaagtat atggtgtaca gtttttaact 7141 taggttttaa ttttacaatc atttctgaat acagaagttg tggccaagta caaattatgg 7201 tatctattac tttttaaatg gttttaattt gtatatcttt tgtatatgta tctgtcttag 7261 atatttggct aattttaagt ggttttgtta aagtattaat gatgccagct gtcaggataa 7321 taaattgatt tggaaaactt tgcaagtcaa atttaacttc ttcaggattt tgcttagtaa 7381 agaagtttac ttggtttact atataatggg aagtgaaaag ccttcctcta aaattaaagt 7441 aggtttagga aaacagaccc tcaaattctg acattcattt tcctaagcaa ctggatcaat 7501 ttgctgactt gggcataatc taatctaagc atatctgaat acagtattca gagatagata 7561 cagtagagat tccccagact ttttcgctct ttgtaaaacc tgtttgttta ggttttgcga 7621 ggtaaactca acagaggttg ggagtggaag agggtgggaa gctttatatg caaattaaca 7681 gacgagaaat gctccagaag gtttattatt ttaaagcaca ttaaaaacaa aaaactattt 7741 ttaaaatcct gctagatttt ataatggatt tgtgaataaa aaatacccag ggttctcaga 7801 atggaataaa tatccctttt aatagttata tatacagata tacaactgtt agctttaatt 7861 ggcagctctc ttcttttttc ttcttttcac tggcttttta cttggtgctt tttcttgttt 7921 tgcactggtg gtctgtgttc tgtgaataaa gcaaagtaag aatttactaa gagtatgtta 7981 agttttggat tattgaaata agaggcattt cttagttttc cagtaggatc taaaatgtgt 8041 cagctatgag taagactggc atccaagaag tttatattat agatttaggt cctaattttt 8101 ataaatcaca aggtaaaaaa atcacagaac agatggatct ctaatgaaaa agggatgtct 8161 ttttgtttat agtcatgtgg caagatgaga gtaaaaccag agagcaaacc tctataagtg 8221 ttgagtatat gtatacattt gaaataaacc agaaatttgt taccttattt tctttggatt 8281 cttgtctggt tccaaaatga tcatttcttc ttcttcacta tctgagagta ttatgggagc 8341 atctttgttt ttttattaat tta // LOCUS HSCYP450 8028 bp DNA PRI 29-MAY-1992 DEFINITION Human gene for cytochrome P(1)-450. ACCESSION X02612 NID g30340 KEYWORDS cytochrome; cytochrome P; cytochrome P450. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8028) AUTHORS Jaiswal,A.K., Gonzalez,F.J. and Nebert,D.W. TITLE Human P1-450 gene sequence and correlation of mRNA with genetic differences in benzo[a]pyrene metabolism JOURNAL Nucleic Acids Res. 13 (12), 4503-4520 (1985) MEDLINE 85242117 COMMENT Data kindly reviewed (08-JUL-1986) by D. Nebert. FEATURES Location/Qualifiers source 1..8028 /organism="Homo sapiens" /strain="Breast carcinoma cell line, MCF-7" /db_xref="taxon:9606" misc_feature 1050..1146 /note="97 bp region with 88% homology to mouse P(1)-450 5' region" misc_feature 1197..1252 /note="56 bp region 76% homologue to mouse P(1)-450 5' region" misc_feature 1282..1364 /note="82 bp region 71% homologue to mouse P(1)-450 5' region" misc_feature 1544..1604 /note="61 bp region 85% homologue to mouse P(1)-450 5' region" precursor_RNA 1599..7915 /note="cytochrome P(1)-450 primary transcript" mRNA 1599..1694 /note="exon 1" intron 1695..4338 /note="intron I" mRNA 4339..5189 /note="exon 2" CDS join(4365..5189,5746..5872,5960..6049,6141..6264, 6410..6496,6689..6974) /codon_start=1 /product="cytochrome P(1)-450" /db_xref="PID:g30341" /db_xref="SWISS-PROT:P04798" /translation="MLFPISMSATEFLLASVIFCLVFWVIRASRPQVPKGLKNPPGPW GWPLIGHMLTLGKNPHLALSRMSQQYGDVLQIRIGSTPVVVLSGLDTIRQALVRQGDD FKGRPDLYTFTLISNGQSMSFSPDSGPVWAARRRLAQNGLKSFSIASDPASSTSCYLE EHVSKEAEVLISTLQELMAGPGHFNPYRYVVVSVTNVICAICFGRRYDHNHQELLSLV NLNNNFGEVVGSGNPADFIPILRYLPNPSLNAFKDLNEKFYSFMQKMVKEHYKTFEKG HIRDITDSLIEHCQEKQLDENANVQLSDEKIINIVLDLFGAGFDTVTTAISWSLMYLV MNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSLVPFTIPHSTTR DTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFLTPDGAIDKVLSEKVIIF GMGKRKCIGETVARWEVFLFLAILLQRVEFSVPLGVKVDMTPIYGLTMKHACCEHFQM QLRS" intron 5190..5745 /note="intron II" intron 5873..5959 /note="intron III" intron 6050..6140 /note="intron IV" intron 6265..6409 /note="intron V" intron 6497..6688 /note="intron VI" mRNA 6689..7915 /note="exon 7" misc_feature 7897..7902 /note="polyA signal" polyA_site 7915 /note="polyA site" BASE COUNT 1825 a 2141 c 2126 g 1936 t ORIGIN 1 ctgtgttccc ttctctgtca atcgccagca cctccgaaca gctcctggca catagtagtt 61 gcttggtaaa tatttgtgca atgcacgagg ccgcatatga ccggaatggg aggtgagggg 121 attattttct ggcctggacc agcgacggat ggtggtgcca ccgggttggg gagcacgtcg 181 gggatggcgc gtaacgatgt tagctggggc caggttgagc taggcacgca aatacaactt 241 tttttttcct ggaaaccctg taacaggaag gttccggagg gcgggacagc gtcggaggca 301 ggcagctagg ccatgccaaa tggcactggg gcttcgtgtc gtgcacaggc gtggaccgaa 361 aatgcggaca catgcaggct gcctctcctc gcaggcagaa gccacacgca gacctagacc 421 ctttgcaccg catcccctta ttcaatcgcg cacccgccac ccttcgacag ttcctctccc 481 tccaccccaa ccccacgccg cgcgcgaggc tggcccttta agagccccgc cccgactccc 541 tcccccctcg cgtgactgcg aggcgccgcg ccgggccggg gaatgggtcg gctgggtggt 601 gcgcgggctc cggtccttct cacgcaacgc ctgggcaccg cgctccgggc caggtggggc 661 ggggacgggc cgcctgacct ctgcccccta gagggatgtc gccggcgcac gcaagctaga 721 gccgggggta gggtgggggc tccgcgccag gtgccccctc cgtggtccct gggcccgagt 781 ctttccgtgg ccccccgccg ccggatttct gtgctctgcc aatcaaagca ctagccaccc 841 cgggagccaa gagggaccct caagggccgg tgggtcctgg ctggagggac cgcgcgttgc 901 aatcagcact aaggcgatcc tagaggctgc gaggagccgc tagtgagcgc tcagcgagcc 961 tgccccttcg ccatccattc cgatccttca atcaagaggc gcgaacctca gctagtcgcc 1021 cgggctctgg gggacaggtc cagccccgcg gcgcctctgg ccttccggcc ccgtgacctc 1081 agggctgggg tcgcagcgct tctcacgcga gccgggactc agtaaccccg ggaaggaggt 1141 caccacgggg cagccccgcc cccgcctgcc gagtcctggt aggctgtagc gctggggagg 1201 catctgcacg cccagcgttc cagtgggtgc aaaaatgacg aagaggagtc cccgcgcccc 1261 aggatggagc ttcccgtacc ctctcttcgg gctgtcctgg gacttctccc tcaagccccc 1321 tcctcggctg ggttctgcac tgcccttggg acgccttgga attgggactt ccaggtgttc 1381 ccagccctca cccctctatg tacaggcacc gagatgtgtc ccatagtggg ttcttgccca 1441 cccgaccccc cacccccgcc gccctccgcc acctttctct ccaatcccag agagaccagc 1501 ccggttcagg ctgcttctcc ctccatctca gctcgctcca gggaaggagg cgtggccaca 1561 cgtacaagcc cgcctataaa ggtggcagtg ccttcaccct caccctgaag gtgacagttc 1621 cttggaacct tccctgatcc ttgtgatccc aggctccaag agtccaccct tcccagctca 1681 gctcagtacc tcaggtgagg ttgctggggg acttctggct tgccctttct ctcccaataa 1741 aaggaacatt ttggtgcctc caggacttct taggtagcta cctgtctagc acctccaaaa 1801 agggaggctc agagtgtttt tagtgaccag gcagtctagc ccctagtggg gaaactgagg 1861 ccaggggaag aggaggactt gcccatggtc cacagctggg acaatagagg cagatctgtg 1921 caggagtccc aggcctttcc tatctcattg accatcttct ttgtcctttg ctgggagaca 1981 atcaggttga cagattgcca actgcaggga gctggaaata ccagtcccta aaaactcacc 2041 agtcacatct cccttggcct cctaccatct tacaaaggct gcaggtcctt gggataccca 2101 ctgtgcagaa ggggacacca tagcacacca aagcctggca ctgtcccctg ttgactcagg 2161 gatctagtgt gctttgatat ttagcccctc caggaagcct ccctccacta taatacttgt 2221 ggtaggaacc atccatctcc ctgtcttgtg aggttctcct gtggggagcc taactggtaa 2281 gactgtcagg ttccccacag cagatctggg ttttctcttc cctctggatc cattcagctg 2341 actgtcttgg acttggtgcc ctgcctacca ggaaccagac tctcacacaa ggcctctgca 2401 ctctcaaatc tcgtgctatt ttcccacttg gggcctgcac gcttaaattc acattattta 2461 accaactatc tatttagcat ctgctatgtg ctagatgcta ttcgtgcaag cattgatcac 2521 aagagatcaa agtgcctgct ctgttggagc ttacatttta gttgagggat atagcctatc 2581 tgcaacataa atacgcaaaa tatacagtat gtttggagtc tctaagtgct ataaaattaa 2641 gtagagagaa ggatagaaac tatgagtcag ggtaacagtt ttaggtggga tggccaagga 2701 aggtaaggaa tacctgagac tgggtaattg ataaagaaaa gaggtgtaat tggctcatgg 2761 ttctggaggc agtaccagca tggatgcagc atctgcttcc ggtgcgggcc tcaggaacct 2821 tacaatcatc ccagaaggca aagcgggtgc aagagaaagc aggaacaagg gtagggggag 2881 gtggccacat acttttaaat aatcggatct cacatgaact tagggtgaga actcacttag 2941 caccaagggg atggtgctaa accatttatg aggatccagc tgggtactct gaactgagag 3001 atcttgtctt accctctctc agatgttgaa attggacccc agaaaagtaa aatgtgcagt 3061 ccaagatcac tttgactaga atgttgggtc tactgacctc tagtccaggg taacaggcag 3121 agatgcctga tatgttggag agagtggttt atgaatttaa acaccctttt taggtcaagc 3181 ttacagagaa agtattgcct cagtttcctt tcagtttaga tccattcctg atttccctga 3241 ttccagtctg gggttttctt acagcctagt gggaaccttc catttattct ctgctctctg 3301 gtaacctgca aaagggggag gtccaaactg ttcattcatt gagaattgag ccaaaaaaaa 3361 accctggcct ggatttctct gactaaagag ctcaatctag ctggtctcca gctgccttta 3421 aactgtggat cagttgaact cagagagctg gggctgaaag caatgtggtt tgggaagtgg 3481 ccatcctgtc ccctatctct tcctcagtct gtcctggggt ctggtgggat ttcctgcatc 3541 ctgggattga gcagcagaga actgagggtt ctgggaaaat tccttcattg atctgaccac 3601 tcttcaaaag gaggtacatg tgacagcagc tggaattcca ggaagtgtcc ctaaggagtc 3661 tgagtcctgg ctctgtcact gattgctggg ggactttgga caagtcagcg ctgcttcccc 3721 caaatgttaa atgaggctaa aggatcagat gatccctaat tcccctgaga acatgatttt 3781 cctaagtgtg gtgggtagga ctcggggtga tacaccttga ttatgtgttt attttaatgt 3841 gttaggggaa aaatcaaact tgtactttca catagatatg tttatgatga tatgtaggta 3901 gatttaaagt tgattttagg aaaatgtgaa gtaaataaca gtactggtac tatgtggaaa 3961 tcacagattg acattggaag atgtcagttt agattggtac agcagttaag agaaagttct 4021 ggaattaggc aacttggctt gggagtctgc atccaacact ttctggctgt gtgaccttgg 4081 gcaagttact taccttctct gagcctcatt gcaggctact tcatagattt gtggccaggg 4141 ctgaatgaag aaggactgtg aaattcttag cccacagtgg tagttcaaca acttctcccc 4201 acctcaccct cttcttcatg cccagtaagc agttctggtg gaaaggtttc ccctttccct 4261 gacactctag atattggctt ttctcatccc ccaatctgac ggcttgactt tttttcttcc 4321 tgcaccttct ctcagcagcc acctccaaga tccctacact gatcatgctt ttcccaatct 4381 ccatgtcggc cacggagttt cttctggcct ctgtcatctt ctgtctggta ttctgggtaa 4441 tcagggcctc aagacctcag gtccccaaag gcctgaagaa tccaccaggg ccatggggct 4501 ggcctctgat tgggcacatg ctgaccctgg gaaagaaccc gcacctggca ctgtcaagga 4561 tgagccagca gtatggggac gtgctgcaga tccgaattgg ctccacaccc gtggtggtgc 4621 tgagcggcct ggacaccatc cggcaggccc tggtgcggca gggcgatgat ttcaagggcc 4681 ggcccgacct ctacaccttc accctcatca gtaatggtca gagcatgtcc ttcagcccag 4741 actctggacc agtgtgggct gcccgccggc gcctggccca gaatggcctg aaaagtttct 4801 ccattgcctc tgacccagcc tcctcaacct cctgctacct ggaagagcat gtgagcaagg 4861 aggctgaggt cctgataagc acgttgcagg agctgatggc agggcctggg cactttaacc 4921 cctacaggta tgtggtggta tcagtgacca atgtcatctg tgccatttgc tttggccggc 4981 gctatgacca caaccaccaa gaactgctta gcctagtcaa cctgaataat aatttcgggg 5041 aggtggttgg ctctggaaac ccagctgact tcatccctat tcttcgctac ctacccaacc 5101 cttccctgaa tgccttcaag gacctgaatg agaagttcta cagcttcatg cagaagatgg 5161 tcaaggagca ctacaaaacc tttgagaagg tacagtctgg ggaaaggcag gtgggtggtg 5221 ggaagcagat ggcatcaggg cctgggaggt caaaggtaag aggaactgca tggggcttgg 5281 caagttctca ggaaggcttc aacctgggat cttgaagcca gttgtggaat agtcttcttc 5341 cttggcaggg aaatagaact ttatcttgac taacccttcc atgcttccac agacagcaat 5401 agccactgaa gagtcattac atgcaggacc ctggggcagt tgctgaggga gccaggaaga 5461 gttaaagagt tcatctcctg gcctcataat agaaagaaga taaagagcac atccagatga 5521 taacagtaca agacgctgtg tgtcaaatgt cccatgagtt ggggctagca gcattctaag 5581 gtagtaggag agagccttgc agaggcagag agaagctggg acaacagcct caggggggca 5641 tcacctccag aacaaatgtg ctaggaatag tgaaggacca gacctggatg gagaggtagc 5701 tctgggtttg agatcttgct cacctgtgga ctttccctac ctaagggcca catccgggac 5761 atcacagaca gcctgattga gcactgtcag gagaagcagc tggatgagaa cgccaatgtc 5821 cagctgtcag atgagaagat cattaacatc gtcttggacc tctttggagc tggtatggtt 5881 accccattgt gtccttcctg tgctcaagtg ccctgacctg ctgctgcctg cctaacttct 5941 tctgttctac cctgtccagg gtttgacaca gtcacaactg ctatctcctg gagcctcatg 6001 tatttggtga tgaaccccag ggtacagaga aagatccaag aggagctagg taggtagtgg 6061 ctcccttcaa aggggtcagt gccaggggtc tggccaggtc taggcagccc ctgcatccat 6121 cttgtccctg tgttctgcag acacagtgat tggcaggtca cggcggcccc ggctctctga 6181 cagatcccat ctgccctata tggaggcctt catcctggag accttccgac actcttcctt 6241 agtccccttc accatccccc acaggtaagg ccccatggag ccactgctgt ctgttactga 6301 tcttactccg gaccacatac ctgattaggg ttagtgggag ggacacggca tgggagacag 6361 ggagatttgc ctgttgccct gagcctgact gagcttcctt tctccctagc acaacaagag 6421 acacaagttt gaaaggcttt tacatcccca aggggcgttg tgtctttgta aaccagtggc 6481 agatcaacca tgaccagtaa gttcagagat gcagaggaaa ggctgggtcc accctcttaa 6541 gctcttatat atgattaata caatcattgc attgatcctc ctgtccatgg gctgcttgcc 6601 tgtcctctat cctttggggc tggagctcca ctcacttgac acttctgagc cctgaactgc 6661 cacttcagct gtctccctct ggttacagga agctatgggt caacccatct gagttcctac 6721 ctgaacggtt tctcacccct gatggtgcta tcgacaaggt gttaagtgag aaggtgatta 6781 tctttggcat gggcaagcgg aagtgtatcg gtgagaccgt tgcccgctgg gaggtctttc 6841 tcttcctggc tatcctgctg caacgggtgg aattcagcgt gccactgggc gtgaaggtgg 6901 acatgacccc catctatggg ctaaccatga agcatgcctg ctgtgagcac ttccaaatgc 6961 agctgcgctc ttaggtgctt gagagccctg aggcctagac tctgtctacc tggtctggtt 7021 gggcagccag accagcaggc tggcctatgt ggtctaagat tcagcctgaa actcatagac 7081 actgatctgg ctgcagtttt gctatctggg ctgtgggcaa gcctaaggga tcctgcctgc 7141 ccctaccctg gacttgcctc tgcacaccct ccagagacaa caggtaaaac agggccacat 7201 agatgctgat ggagccttcc caagttgtgc ttgagccagg aggcctgcta gggttaggag 7261 gtccttaggc ctctgagaag ctctgaagaa ctctctggaa gcccctgggc ccagtaccta 7321 gctggctctg tgagggtgct gactggcttc agcaagttag aactagccaa accaggaccc 7381 tgtccaatct ttgacaattg ggagctgcca agagtgaagg gaagagacag cccaggatac 7441 tggcacagag gtagtctcac tgcttgaact aggctgagca atctgaccct atgggtctag 7501 gacacagttc ctgggaacat cacattcctc tgcccttcct gcaggcagga acaaacaggg 7561 ctgccttctg gccttgtaag acccttattg ctgtcctgga ggggctgggg acttgtgtct 7621 gcggggatcc agagcgcaca gggagtgcac atatccaggc accaggacta gggctggagt 7681 gagggggggg tatttcaatt accttctatt ggtctccctt ctctacactc ttgtaataaa 7741 atgtctattt ttaatgtttg tacacaacaa tccttctatt ctagcctgca ttgagcttgc 7801 atgcttgcat aagagcttaa gaaccattga tttaatgtaa tagggaaaat tctaacccag 7861 gtatccaaaa atgtgtaaga acaactacct gagctaaata aagatattgt tcagaaaatc 7921 catatggtgg agattttttg gaatcataaa tagttcatca ctcgtctaaa tactcaccct 7981 gaaccccatt ctgtgttggg tttactgtag ggaggaagaa gaggaggt // LOCUS HSCYTOK17 5260 bp DNA PRI 30-JUN-1993 DEFINITION H.sapiens gene for cytokeratin 17. ACCESSION Z19574 S51477 NID g30378 KEYWORDS cytokeratin 17. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5260) AUTHORS Troyanovsky,S.M., Leube,R.E. and Franke,W.W. TITLE Characterization of the human gene encoding cytokeratin 17 and its expression pattern JOURNAL Eur. J. Cell Biol. 59 (1), 127-137 (1992) MEDLINE 93105967 REFERENCE 2 (bases 1 to 5260) AUTHORS Zimbelmann,R. TITLE Direct Submission JOURNAL Submitted (13-JAN-1993) Zimbelmann R., German Cancer Research Center, Heidelberg, Fed.Rep.of Germany FEATURES Location/Qualifiers source 1..5260 /organism="Homo sapiens" /strain="Cytokeratin 17" /db_xref="taxon:9606" TATA_signal 51..56 mRNA join(85..580,1626..1708,2147..2303,2904..3065,3567..3692, 3779..3999,4100..4122,4970..5219) prim_transcript 85..5219 /citation=[1] exon 85..580 /number=1 CDS join(149..580,1626..1708,2147..2303,2904..3065,3567..3692, 3779..3999,4100..4122,4970..5064) /function="intermediate filament protein" /citation=[1] /codon_start=1 /product="cytokeratin 17" /db_xref="PID:g30379" /db_xref="SWISS-PROT:Q04695" /translation="MTTSIRQFTSSSSIKGSSGLGGGSSRTSCRLSGGLGAGSCRLGS AGGLGSTLGGSSYSSCYSFGSGGGYGSSFGGVDGLLAGGEKATMQNLNDRLASYLDKV RALEEANTELEVKIRDWYQRQAPGPARDYSQYYRTIEELQNKILTATVDNANILLQID NARLAADDFRTKFETEQALRLSVEADINGLRRVLDELTLARADLEMQIENLKEELAYL KKNHEEEMNALRGQVGGEINVEMDAAPGVDLSRILNEMRDQYEKMAEKNRKDAEDWFF SKTEELNREVATNSELVQSGKSEISELRRTMQALEIELQSQLSMKASLEGNLAETENR YCVQLSQIQGLIGSVEEQLAQLRCEMEQQNQEYKILLDVKTRLEQEIATYRRLLEGED AHLTQYKKEPVTTRQVRTIVEEVQDGKVISSREQVHQTTR" intron 581..1625 /number=1 exon 1626..1708 /number=2 intron 1709..2146 /number=2 exon 2147..2303 /number=3 intron 2304..2903 /number=3 exon 2904..3065 /number=4 intron 3066..3566 /number=4 exon 3567..3692 /number=5 intron 3693..3778 /number=5 exon 3779..3999 /number=6 intron 4000..4099 /number=6 exon 4100..4122 /number=7 intron 4123..4969 /number=7 exon 4970..5219 /number=8 terminator 5062..5064 polyA_signal 5194..5199 BASE COUNT 1086 a 1361 c 1716 g 1097 t ORIGIN 1 tggaaacaga ggagcaagcc tgttgtaatc gctacgccca cttggtggcc tataaaggaa 61 gcgggcgaac cccggcagcc ctacacaact tggggcccct ctcctctcca gcccttctcc 121 tgtgtgcctg cctcctgccg ccgccaccat gaccacctcc atccgccagt tcacctcctc 181 cagctccatc aagggctcct ccggcctggg gggcggctcg tcccgcacct cctgccggct 241 gtctggcggc ctgggtgccg gctcctgcag gctgggatct gctggcggcc tgggcagcac 301 cctcgggggt agcagctact ccagctgcta cagctttggc tctggtggtg gctatggcag 361 cagctttggg ggtgttgatg ggctgctggc tggaggtgag aaggccacca tgcagaacct 421 caatgaccgc ctggcctcct acctggacaa ggtgcgtgcc ctggaggagg ccaacactga 481 gctggaggtg aagatccgtg actggtacca gaggcaggcc ccggggcccg cccgtgacta 541 cagccagtac tacaggacaa ttgaggagct gcagaacaag gtagggcctg ctggtgggag 601 gggtctccgg ggggcatgac ttcttccccc caactcctgc cttggccaaa ggcctggagt 661 ccagccatag ggtctcaggg agccaagggt ggtttggctg tggcttagct tctgggaacc 721 tgccttgggg cccctgtgtg gctcacatcc cccttttctg gggacagcaa gctgagtcag 781 gaacaaagag gccttgtgga gccccttgga gcctcagttt ctccctcgtg gagctccgcc 841 acctggagag gttgtaggat gaggcaggag gatgcagatg gagggctggg cccgtgggct 901 gctggatgtg tggtgtcttg ctcctttgga gcaggggtca acaggagggg gttttgggtg 961 atgtggagtg ggggtgtctg agtgagctcc tgatagcacc tgctgcagga ggaggcagaa 1021 aggagggggc gggacctcag tgtggggaag gcctctgatg tgccttattt ggggattttt 1081 ctggcttctc ctttcctcct gctgtccctt aagacgggct cagcaaaccc cagggggcgg 1141 ggctgctggc tggcgcccag ggttagggat taggaggggt cctgactttt gatttggaac 1201 cactctttaa tgagggctcc tttagcctcc ttttggggga gcctgtcagg ggcaccctct 1261 agctgactgt aaaacgaggg ggttgcccac atcccctccc ttgttctaga attctgggac 1321 agcttctgcc ctggggacat tttcccattc ttttctggtt gcctcatact cccagccagc 1381 tgtctcttct cctttaaggc cgagcctgcc atgggggtct ggtggggtac tgagtatcgg 1441 gggaagaaga ggcacctttc agcccttcag actcctgttt gcccctcctc tgccaataat 1501 acagcacggg gcaagggagg ggctgggcga gaagagaggc cctgaggcag gaagatctgc 1561 tcagaaccct ggtgtgggct cagccacccc catccaatga cctgactact ctcccttctc 1621 ctcagatcct cacagccacc gtggacaatg ccaacatcct gctacagatt gacaatgccc 1681 gtctggctgc tgatgacttc cgcaccaagt gagtcctagc tgtggccttg ggcagcctgg 1741 gccagctggc ggaggatctc agggtacccc tcctgacccc aggattcctt ggttgcttgt 1801 ggcaaggccc aggagctcag ggtggggcag tcctaggagc cccactcctt agtccaggat 1861 gcagtgaagg cagccagttc tgaaggttgc tgagcttagg cagggaatag aagagaagga 1921 ggggaggcgg gaggcgggag gcagagagaa gtaaggaagc tggtgggcgt gggatctggc 1981 cctgtgatgg tcccggggtc ccggggctgg aattcgtttc cactcgaccc tctcatcagc 2041 ccttccaact ccttagagtc ctggcaaaat gaaggcaggt gagcagccag gacctggatc 2101 tgcaggtcca agcagcctgg gctgaagtct ctgattccca cggcaggttt gagacagagc 2161 aggccctgcg cctgagtgtg gaggccgaca tcaatggcct gcgcagggtg ctggatgagc 2221 tgaccctggc cagagccgac ctggagatgc agattgagaa cctcaaggag gagctggcct 2281 acctgaagaa gaaccacgag gaggtgaggt ggctggggca gaaggtcaaa gatgctgagg 2341 agtgggtggc agagccctgg ggctgggcaa tggctgaggc cgtgggagag agcagagcag 2401 gtgcaccggg attagtcacc ttagagggct tccctgtctg cagagccctg atccttgggg 2461 tccagtgtgc agggcagact cctctttgta ccacactgct tctctgtaca caaggaacct 2521 cccaggggcc tgcagaggct ccctctacct accctgcctc cctcatgagg gtgggggata 2581 agtaaggaaa tcttgtccca tttcaaactc tcaaagctga acatctacat agaagcttgg 2641 aaattagagg ggaacttttg ggggcatagg cctaataatt agattttatt ttggagagtc 2701 cttggtctaa tgggggagat agagtctgat ggtggaggca atactgagca gatgaataaa 2761 aatcatttag agggtcagat agagcagagg gagaacaaag gaggggtcct tgtgggaagt 2821 ggggtcccct tgtgggggaa ggcttgggag tgagaggtca ggatgggtcc agatgcgcac 2881 atccacatcc cctttttcca taggagatga acgccctgcg aggccaggtg ggtggtgaga 2941 tcaatgtgga gatggacgct gccccaggcg tggacctgag ccgcatcctc aacgagatgc 3001 gtgaccagta tgagaagatg gcagagaaga accgcaagga tgccgaggat tggttcttca 3061 gcaaggtggg gggtgctgca ggccagaggg ctctcttagg ggctggggct caggggcctt 3121 agcactgaca gtaagcccac ggccagatgt cttggaggtg cccctcctca gtaagctgca 3181 tggaccacag ggtcacccac tgcatcaaca gacatggagc tgagctcaag ctgggatctg 3241 gcgggtgggt ggggaggtag ggagctccct cagaataaag gcagagggta aagaccttgg 3301 gagtccccac gtctccctca agaagtcagg aactagcatc aagagccagg ctacatgttc 3361 tggcttgttc tcaagtttcc ggtctgtgcc tcccacacgc tgggattaac cataaaaagt 3421 taacatttca aatggcatgt ttctgggctt tgggatgtgg gaagctggtg agaaggcaca 3481 ctctctccac agttagattt gggaggaggc ctgacttggg agagggatcc aggctcacac 3541 cgccctgtcc tgtgtcctgt ctgcagacag aggaactgaa ccgcgaggtg gccaccaaca 3601 gtgagctggt gcagagtggc aagagtgaga tctcggagct ccggcgcacc atgcaggcct 3661 tggagataga gctgcagtcc cagctcagca tggtaggaac agtcctgtgc atgggggagg 3721 gcccagaaga gggcattgac catcctcact gacccctggt cttcctgccc tcctgcagaa 3781 agcatccctg gagggcaacc tggcggagac agagaaccgc tactgcgtgc agctgtccca 3841 gatccagggg ctgattggca gcgtggagga gcagctggcc cagcttcgct gcgagatgga 3901 gcagcagaac caggaataca aaatcctgct ggatgtgaag acgcggctgg agcaggagat 3961 tgccacctac cgccgcctgc tggagggaga ggatgcccag tgagtggggg cgcctggggt 4021 cagggctggg ggtctcttgg caggggtggg gctctcagac ccacatctaa tttcctctct 4081 gttttttttt ttctttcagc ctgactcagt acaagaaaga acgtaagtat ctgcgtggct 4141 tcggccctgg gggttgggtg catgggacag gcagcccacc tgcacgttgc tggggcaggg 4201 tccccaggag ctccaggagt tgatggctgt ccctcagcag ggagaagtga ctcattagca 4261 ctgaggattg atactcagga aaagatcaaa tgagagagac actgtctggt ctgatggggg 4321 tggggcaggg aactggtcct taccttggag atcctagtct gatggaggag acaggtccca 4381 gctctggaga ttgtcatctg atgggaagat aggaacatgg tctcatgatc tttgctcttg 4441 acagcttttg gatgagcaaa aacagtcctg tctctggggt ctgtagcctg atgggagata 4501 gggacatggt ccttgtcctc caaattccaa tctgatggag aatatatgat cctagcctta 4561 gggttcctag tctaatggag gagatagggg cctggttttt gtcttggcga tcccagtctg 4621 atgggggaga tggtagaaaa tctatgtcct agagactgca gagaaatgga gacggatttc 4681 atggagtcta catggctcct ccctggcagg acacactgga tcagaatcaa ataacccatc 4741 tggggaggca agactcacac agggccaccg gcagagggat gggatggaag ggaggtggtg 4801 gcagggacag gagggatgtg tgtgcagtgt gatgtcgagg tgccagtgga ggcactcaca 4861 gcacctgggg gaggatgagg gagagagccg gctcctgtcc atgagggcta gggggcaagc 4921 gagggcctcc tggcccctac ccactttaaa ttgcctgctt ctcctgcagc ggtgaccacc 4981 cgtcaggtgc gtaccattgt ggaagaggtc caggatggca aggtcatctc ctcccgcgag 5041 caggtccacc agaccacccg ctgaggactc agctaccccg gccggccacc caggaggcag 5101 ggaggcagcc gccccatctg ccccacagtc tccggcctct ccagcctcag ccccctgctt 5161 cagtcccttc cccatgcttc cttgcctgat gacaataaag cttgttgact cagctatgaa 5221 atgtgtcctt gttctggccc ctgaagtggg gactagggac // LOCUS HSCYTOK20 18061 bp DNA PRI 21-SEP-1993 DEFINITION H.sapiens gene for cytokeratin 20. ACCESSION X73501 NID g402644 KEYWORDS cytokeratin; cytokeratin 20. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 18061) AUTHORS Moll,R., Zimbelmann,R., Goldschmidt,M.D., Keith,M., Laufer,J., Kasper,M., Koch,P.J. and Franke,W.W. TITLE The human gene encoding cytokeratin 20 and its expression during fetal development and in gastrointestinal carcinomas JOURNAL Differentiation 53 (2), 75-93 (1993) MEDLINE 93366034 REFERENCE 2 (bases 1 to 18061) AUTHORS Zimbelmann,R. TITLE Direct Submission JOURNAL Submitted (14-SEP-1993) R. Zimbelmann, German Cancer Research Center, Division of Cell Biology, Im Neuenheimer Feld 280, 69254 Heidelberg, FRG FEATURES Location/Qualifiers source 1..18061 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Intestine" CDS join(4891..5280,7429..7511,9313..9469,9822..9983, 10145..10270,11719..11939,12667..12704,13626..13723) /function="intermadiate filament" /codon_start=1 /product="cytokeratin 20" /db_xref="PID:g402645" /db_xref="SWISS-PROT:P35900" /translation="MDFSRRSFHRSLSSSLQAPVVSTVGMQRLGTTPSVYGGAGGRGI RISNSRHTVNYGSDLTGGGDLFVGNEKMAMQNLNDRLASYLEKVRTLEQSNSKLEVQI KQWYETNAPRAGRDYSAYYRQIEELRSQIKDAQLQNARCVLQIDNAKLAAEDFRLKYE TERGIRLTVEADLQGLNKVFDDLTLHKTDLEIQIEELNKDLALLKKEHQEEVDGLHKH LGNTVNVEVDAAPGLNLGVIMNEMRQKYEVMAQKNLQEAKEQFERQTAVLQQQVTVNT EELKGTEVQLTELRRTSQSLEIELQSHLSMKESLEHTLEETKARYSSQLANLQSLLSS LEAQLMQIRSNMERQNNEYHILLDIKTRLEQEIATYRRLLEGEDVKTTEYQLSTLEER DIKKTRKIKTVVQEVVDGKVVSSEVKEVEENI" exon <4891..5280 /number=1 intron 5281..7428 /number=1 exon 7429..7511 /number=2 intron 7512..9312 /number=2 exon 9313..9469 /number=3 intron 9470..9821 /number=3 exon 9822..9983 /number=4 intron 9984..10144 /number=4 exon 10145..10270 /number=5 intron 10271..11718 /number=5 exon 11719..11939 /number=6 intron 11940..12666 /number=6 exon 12667..12704 /number=7 intron 12705..13625 /number=7 exon 13626..>13723 /number=8 terminator 13721..13723 BASE COUNT 5854 a 3590 c 3325 g 5292 t ORIGIN 1 tgtaccgagc tctaatacga ctcactatag ggcgtcgact cgatcatccc aacacccagg 61 tattaagcct agcgtccatt agctattctt cctgatgctt tccccccaac cacaggtgcc 121 agtgtgtgtt gttctccgcc atgtgtccct gtgtggaaaa tataatatgc attcttaacc 181 tgtcaatatc taaagttaat ctgtatgttt gctgttttcc taaacagtcc cgcaatttag 241 aacaatttaa ctccattcat ccccttccag ccctcatcta ctttaattcc atattgtttt 301 ttcttctcaa tagatatatt attttatagc caatacttgt ctttacattt acatttggtc 361 gcatgcttag cactctattt gttcattatt ttaattcttt tatgcatctc agaactttca 421 tctggaatca tcttccttct gcttaaaaca cattctttca gattgctctt agtgaaaact 481 tagaagttga tggcaaattc agttttttaa atgttgaaaa tggctttatt ttacccttct 541 tcaaatataa tgttaccgag tcagaaaatt ccggaatggc agttatttcc tcccagcaca 601 ttgaagattc tattccattt ccagtgtcca ttgttgcttt tgaggagtca gtcatcagtc 661 taaccaccat tcctttgttg gtgatttatt ctctatctgc ttgtaattat ctggttttct 721 gcctcaatgt gtgtagtgat ttatttttat tccttttgcc tggaattcat taggtctcct 781 taatctgaga acttgggttt tttttaaaga attctataaa attctcactt ttattttgtt 841 gaatgttttc ttattctctt gtgtcttctt ctggaactct gattagacat atttagacct 901 tctaattata gccttcatgc cttcttgact tcctccttcg tcacttagct gcattctgga 961 aattttcctc agagctacat tttgttcaat aatcctggct taatctgttc ctggatccat 1021 ccactgaact tgtaatttta acggcattat ttatttcctc tggtttgtat ttcagatctg 1081 cttggtactt gctccaattt taagtttgtt ttattttcta aaacaattta tatttatatt 1141 cttcatccaa tcattctata gccactatga gacaataaat aaatccatcc aactcatcaa 1201 aacacagctg agaattacca aataagtaca aacaaaaacc taaatatgac tgattaactg 1261 tgcagaatac aatgatgtaa gtttactata catcaacctc cttgtaccca tctcctcgct 1321 tcaacaaata actcagggca aaacttcctt catccatact tccaccctcc ttctccctgt 1381 ctgaacaaga tgccactcca gagactgcag gttctatgtc tatcatatgt gattgcaata 1441 gcttggtcat ggtagaatac aaaaaaaaag aaaaaagttg aaagccctct gaaaactgta 1501 gtgcttataa tagccttccc aactactggc catacttaat attaatcatt aacactctca 1561 gtagagtgtc tcagaagtca aaatgtatct cctactagct acaaggaagt tgggatctaa 1621 taagtattta tttacagctg cagtcaccca acagcctgaa ataccagggt ctctaaattc 1681 atgactttgt gtatgaacaa gggtcaatag gatctttact cagtcctcga gatataactc 1741 agaaaaatga attgctgtga aaaactcact tgatttcctg tggtcacatt ataagatcga 1801 ataatctaca aaagtacaaa actcagtgat atagagaagc cagcaaattc tcctgccctt 1861 atataaacag gtgggaggct ttccagtaga attttttcta ctccaacaaa ttggtagatt 1921 gaatgactat acctcataat gatgacacta ttgtatcata ttttagttta aatttaaaac 1981 ctcaaaaact taccaaaact agccaaagat ttctggaata agatatagac aaagttattt 2041 ctcaaaaaaa agaaaaaaat tggctgggcc gggtgcagtg gctcacgcct gtaatgccag 2101 cactttggga ggctgaggca ggcagatcac aaggtcaagg gatcaagatc aggctggcca 2161 acgtggtgaa accccatctc tactaaaaat acaaaaatca gctcggcggt ggtgcccacc 2221 tgtaatccca gctactaggg aggctgaggc aggagaattg tgtgaaccgg ggagggggag 2281 gttgcagtga gctgagatca tgccactgca ctccagcctg gtgacagagc aaagctccgt 2341 ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa cggctggaca caggggctca cacctgtaat 2401 cccatcattt tgggagggtg gggtgggagg atcacttgag ctcaggagtt tgagtccagc 2461 ctgggcaaca tagtgggacc ctgtttctac caaaaataaa taaataaaaa ttatgtaagc 2521 atggcgtgtg tgcctgtagt cccagctact tgagaggctg aagcaggagg atcacttgag 2581 cctgggaggt caaggctgca gtgagctgtg attgtgtcac tgcactccag cctgggtcac 2641 acagtgagac cctgtctcaa aataaaaata aaatggaaaa taactaaaaa tgggattttc 2701 aactctttga tataaatagg tccagccata aggaagaatt gtgctacaaa agagaacaaa 2761 aatcatcatg tttaacaact gcattcatgt tcttttctaa ttataaagct aactattgat 2821 tttataatgc ttctacacac cttgatcatg tgttagcata tgaaactctt agctgagacc 2881 aacttactat aaaacttgtg taatttaaat agtataattt tccaatgact ttaatctaac 2941 aacctcctac cccattaatc aacaccaaaa acaattcatc tttgaataca tgtataaaga 3001 acatagttca tgtgtttata aaaagttctt ctctgctaca gagtttagct gtgacatcta 3061 ttaacaaagt aatttataac tctaataagt tagcaaggtc ttcttgctac cagtccagtc 3121 attggtggtg agtactcagt attggctgat gaatacatga atggataaaa atataagagc 3181 tgcccacaca tataccataa aatgcataaa atagattcca ttttatctca atttttattt 3241 ctgtgatata actaagataa aatcaaacac aatgtattat tttaaagtat actgtttata 3301 agagagaaat aataagatct gaccttattt tagaagaaag ttgtgtgtaa aagaaattta 3361 aactcaaaag taaaaaagcc actggtttct aaaaactcaa aaaaaatttt ttttaagtga 3421 ctggctttca gaacaacagc aacaacaaat aagccactaa aataagacaa ttaaattcca 3481 attttttttg catttacttg cttgcttaat tatatacaaa tctcaattcc attaaagtat 3541 caggatgaca attaattaga tgtttttagt gacttttcca atcccttcta attaacttgg 3601 aagtgattaa ctgattatct aattatatta attagtgatt atctaattaa ctaaggaagt 3661 gatttacatg cttcatggga agtgtgaatt ctcagccagg gggtttgaat gaagggaagc 3721 aagtgactca ggcatttatt ggttgcttac aatcagattc ccaccgatga gatcatgccc 3781 atgtgtctag aggtccacac acaagcagct gcctccagcc caaccctgga agctgagtgt 3841 agtcccaagt ccccgcctgc accccttcct atataaacag ccactctcgc tgtgattatg 3901 ggggcataca aggaagatca ggcttccatg gaagcttctg caactgaaat ccactgggga 3961 ctttagaagg agcacctgta agtagcttca ggatggatgg ggcagagtct gtctcatggt 4021 gggtgttaag ctggcctcat acaaaagagg agacccatcc tgataccatg ccaggtacaa 4081 agcactgtgt gatttattct aagtgtcttg gtaacaggac ttgaagccag gtggtgactg 4141 gaaacctaaa atgcaatatt tagtcataga acattagaaa tagaaagaaa ataacctccc 4201 aaatcatcta atccatcact cagctaaatt catttaattc acatccccaa actacatctc 4261 caaatgtaca atggtttcat tcctggttta gggattcgag gttttgtgtt tgtacaaaat 4321 attttaatag tcttataaaa taagatgtcc aaaacacaat aaacttaaaa gccaaacaca 4381 agcaaagcat gaaacagagg ttctgaagag caactcctca atacaaatca aacattgtcc 4441 tttatctcca ttgtgacttt cagttatgta ggcacatctg cctttttaga tacctaaatt 4501 atacatgagg attgtaaata ttctttgtgt gtatgttagg atgaaaaaaa cctgccaagg 4561 caggcgcatg ggctaaaaga atacactttc ctgtttcctt ttattacctg tttaattaaa 4621 aaaaaaaaaa gttgcccctg ataaaagcct ccaagaaatg cctctagtgg tattttctgg 4681 aattgaatta agcatattaa catattgcta tacaaagcgg ggggcgtgga agtaaggcaa 4741 ggtggactgt ggtgccaatc ctagtgacat gtcagcagag gaggagtttc ttgcctgtgg 4801 acttcataaa aggctagctc aacaccctcc atgagacaca ctctgcccca accatcctga 4861 agctacaggt gctccctcct ggaatctcca atggatttca gtcgcagaag cttccacaga 4921 agcctgagct cctccttgca ggcccctgta gtcagtacag tgggcatgca gcgcctcggg 4981 acgacaccca gcgtttatgg gggtgctgga ggccggggca tccgcatctc caactccaga 5041 cacacggtga actatgggag cgatctcaca ggcggcgggg acctgtttgt tggcaatgag 5101 aaaatggcca tgcagaacct aaatgaccgt ctagcgagct acctagaaaa ggtgcggacc 5161 ctggagcagt ccaactccaa acttgaagtg caaatcaagc agtggtacga aaccaacgcc 5221 ccgagggctg gtcgcgacta cagtgcatat tacagacaaa ttgaagagct gcgaagtcag 5281 gtgagagatg atgcttgtgt ttcctactct gtgtttagct tcaagataaa tcaagaggtt 5341 atctatgtta ggtaggtcca aatggacttt gtaaagcaaa ttaggctaaa atgttacatc 5401 tataaaaatt ctttcatcta cctttagtgc tggagtacct gactgtacaa taggatgacc 5461 ttaaatcatg ctatttttaa tgtttaacac aattacatac aaaatatgaa cattttatcc 5521 tccggaataa aatgaatctt tgtcacagct tatcacatca gggctaaatg tatagtgaac 5581 aatctttctg aaaagagcta aaaattaatt acgtgataaa tctctcttag tttttcacct 5641 agtccttatc aagttttgaa accccagtaa attcaaagac ttttctcctt ttatttaagt 5701 aaatgttata atatgactat tagaatactg tactgcagat ttaaatagca aagtgattct 5761 gggtggaaac agtgtggcag gaatatgggt cttaatgatg cctggatttt taagtgccct 5821 gtttaataac atgaccatca gcatcaagca gttcagaaac tacttctaca gtaataaaat 5881 gttctattgc ccccaattca tgtttattac aatcatatca caacctcctc ctatgtcatc 5941 aaaaaaaggg tggagctctg tattttatca gtttcatgga ataaaagcag aagactttgc 6001 tcctaattga aaatttcaga tatgacacag agcacttgct tcaaattaaa gttcttatct 6061 aattaagaaa agtgttcact gtaatttgtg ttaagattca tactccttta gagcaaagcg 6121 catttactta ccagcaagtc tgtttttctt gagttgtact aaacatcagg gcaatttgat 6181 tcggactcca tgggaagtta ctttggaatt ttagaaacta taggcatgtg aagcaatggc 6241 atttaaatag cgccttccgg aataaagttc atcctctgca agctctcact aaaaatgttc 6301 gaaccacacc tgtctcgtgt ttctgactct agtttggttc atctgaaata cacagcacag 6361 gccatccgtc ccattgagct ggacagtccc aaccatagga agcagccctc tccaaaactg 6421 ggtccattga aagagaaatg tcattgtaac atcttctgca agtagatttt tcatctatat 6481 gcctccatca tgcatttcgt tctttcatgc atcatgttct ttcattcagc caatactgaa 6541 tgaacactaa ctaggtgctg ggcaccatgc tggggactga ggaatattat ggtggacaca 6601 atatgattta tgaggataca gggatggaca gagggaccag cacacagagg gataatgaga 6661 gcacaggcag atcatccagt tcagttttga gagcatctgt gaaggctcct atgaagcaat 6721 gttgtcgaac ctgagacatg agggaggaat gggagttcac cagaccacag gggctaaaag 6781 aaatgtccta agtagaaaca tcaaaggcaa ggcactcaat aaggtgtgat gactcaagtt 6841 gagttaaatt atccttctgt aaaatcacaa gaaagtagac caaaccctgg ggcatctaga 6901 gttgcatttt atgactctgt tatgttaaat aactatgtca ttattttttc tatatatttt 6961 catgtagaaa tacttgaaat gaacatcatt tttgacataa tagtcactgt agctggggat 7021 aaactaaatc cttgtaaact cctatccagt tgaaaatgtc attcttggcc gggcacgtgg 7081 ctcacgcctg taatcccagc actttgggag gccgaggcag gcagatcacc tgaggtcaga 7141 acttcaagac cagcctggcc aacatggcaa aatgctgtct ctactaaaaa tgcaaaaatt 7201 agccgggcat ggtggcggat gcctgtaacc ccagctactc gggaggctga ggcaggataa 7261 ttgcttgaac ctgggaggca gaggttacag tgagccgaga ttgtaccatt gcactccagt 7321 ctgggcaaca gagcaagatt ctgtctcaat aaataaataa ataaataaat aaataaataa 7381 ataaataaat aaaaaagaaa atgtcattct ttctcatgta ttttccagat taaggatgct 7441 caactgcaaa atgctcggtg tgtcctgcaa attgataatg ctaaactggc tgctgaggac 7501 ttcagactga agtaggttcc ctaatacgtg gcaaaagttt ctgaaaaaga attcctttag 7561 tagtccttcc agatactcag ctttccatat cattgttgat aaaggaagca cggttcaatg 7621 tccagaatcc tgaagcctaa aggaggttag aagctacatg tatgaagcta acccagcact 7681 cagggatggc cttctctttc ttgatccccc ggcatgtaac taaacacctc cagtaactac 7741 tattctcctt ggttattact ggttgctaaa ttttttttta gtaacctggt atctaaacct 7801 aattctgccc tttggagtaa taacaaagta atttacaatt ctccttctcc ctaagtgcct 7861 ttctgttatc taaagagagt cttgtgtcta cccttggtga ttctccaagt aattctccac 7921 gttcattcaa gtgcctgcaa gtgtatgacc caagttccag gggtgactct tgatgatttc 7981 tggcttgcca gtgttttaca ggatggtctg gccaaccaaa gaaccaggac caatctataa 8041 taattagctg gcgtgatgcc gtatgtctgt aatcccagct actccggagg ctgaggcagg 8101 agaatcgctt gaacctggga ggcggaggtt tcagtgagct gagatcgtgc cattgcactg 8161 caacctgggg ggcagactga gaccttgtct caaaaaaaaa aaaaaggaag aaagaaagaa 8221 aagaaaaaga aagtaaaaat tgtgttcact ttctttgtag tcatcacacc acatgtgacc 8281 agacctgttc cttgcaaaca gcttccacac taaggcctct tcatcctgaa tttgtacaat 8341 gcattaacac caaaaagccc tttgtggtta gaagggtagc cttttaatgc tccaagggat 8401 taacaagaag gaaataggaa atcaaatcca aagatgaaag cagtaaaggt gcattacttc 8461 caattttacc tagcactgag tgtcacattg cagtgtcatt ttttaaagtt ggatatttta 8521 ggaaactggg cagggcatgc atgatcgtaa cgcctgtaat cccagcactt tgagaggctg 8581 aggcaggagg ctcacttgag tcctggagtt tgagaacagc ttgggcaacg tggcaaaacc 8641 acatctctat acaaaaatac aaaaaaaatt agctgggcat ggtggcatac atctcgtgtc 8701 ccagctactt gggaggctga ggtgagagga ttgcttgagc cccaaaggtt gaggctgcag 8761 tgagccatga tcacaccatt gtactccagc ctgagtgaca gagtgggacc ctgtctcaaa 8821 tacacacaca cacacacaca cacacaccac acacagtgtg tatgtaagat tgtagaggag 8881 gatgtagagc tgtttgagat aattcacttt ggatgtctct gttcacaaag taataaaaat 8941 aaatcgatca tgtacattca ttaagtaaaa ctaaccatta tttaatatca ataataagaa 9001 ccctttgcca acacaataat taacacaatt taatttctta taagataaat tctagaattt 9061 agaagtgttc aaaattattt cagattgcct ttttaccagt caccccaaat tatagagatt 9121 atattattga gcacattttc tgactcctag gttcttatgt aaatttcatg attgtgtaaa 9181 ggcagacatt ataaagtatt gaaattgatc tcctcataag ccacatttaa aaacctatcc 9241 cattatatta gattctctcc ttataatggc ttcagaagga ccagttatct ctgtacacta 9301 attaattcac aggtatgaga ctgagagagg aatacgtcta acagtggaag ctgatctcca 9361 aggcctgaat aaggtctttg atgacctaac cctacataaa acagatttgg agattcaaat 9421 tgaagaactg aataaagacc tagctctcct caaaaaggag catcaggagg tgagaaaata 9481 ttcagaagtg gtattggaaa caatggaatg gttctatata atactaataa taggaggagc 9541 gggagaaaca ggagaagggg gaagagttgg tggtgatagt gacagagatg atgacgatga 9601 caataatgat acaagcccta ccttcttttc ataatgttgt tgtaagttaa atgacttaga 9661 gcagcacctg gaccacaata agcccaacac aagttattat tttatatctt ttttacttat 9721 tcccaatgaa agaggtcatg agactcctta tgtcttttct gcccagcctt atctgagaat 9781 gtgcttcaga ctaaaatcaa tcagaaattt cactttcata ggaagtcgat ggcctacaca 9841 agcatctggg caacactgtc aatgtggagg ttgatgctgc tccaggcctg aaccttggcg 9901 tcatcatgaa tgaaatgagg cagaagtatg aagtcatggc ccagaagaac cttcaagagg 9961 ccaaagaaca gtttgagaga caggtaacca cacaattcta aagggtgagc aaacgtgtag 10021 atgctttcct ccagaaacag ataactcatt ttctttttca tttgttcatt cttcctttct 10081 ctttctgtct tttctttctt atttccaccc ctcaactatt tttttttcac tcttggcact 10141 gtagactgca gttctgcagc aacaggtcac agtgaatact gaagaattaa aaggaactga 10201 ggttcaacta acggagctga gacgcacctc ccagagcctt gagatagaac tccagtccca 10261 tctcagcatg gtaaagcata tctaacttct ctttctcaat ctagtatgtg tttaccaagg 10321 tcctctgtta ggaactatag aaatgcaaag acctacaaga aataaccctt ccccttgcag 10381 aactggcagg gaaacaggcc gaacaactga ttataattaa aggacagaga gaaatcgagg 10441 aaagggcaat gtactgcatg aatgcagagg aaagagtaaa tctggaagat ttcacaggga 10501 aagtggcatt taaaccagat ctttttgtaa cttttaagtt caggagtaca tgtgcagatt 10561 tatgatgtag ttaaacttgt gtcatgggga tttgttgtac aggttatttc atcactcagg 10621 tattaaacct aatacccatt ggttattttt cctgatcctc tccctcctcc taccctccac 10681 cctctgttag gcccccatgt ctgctattcc cctgtatgta tccattatac cagatctcaa 10741 aatctaagat aaaattttaa gatggagaat ggactagaga cattccaggc caagcaaatc 10801 agcaagccca tgacaaatta tgttgtggta agcagaatcc ctcaaaccac agtggcctgc 10861 agccatgata ctacatccac cgtgacattt cctttccaag ggcaaggcta aaggggtagg 10921 ccctacctag gacatgccag tgtcctggca gaaagaaaag agcaatggtt ggtccacttg 10981 atagctctta aagcttctcc ttagaaaagg catttgtcct ttgggaggcc aaggcaggag 11041 gatcatttga ggtcaggagg ttgagaccag ctggccaata tggtgaaacc ccatctctac 11101 taaagataca aaaataagct gggcatggtg gtgtgtgcct gtaatcccag ctacttggga 11161 ggttgaggca ggagaatcgc ttgaacccag gaggcagagg ttgcagtgag ctgtgatcac 11221 attactgcac tccagcctgg gcgacagagc aagactccgt cccaaaaaaa aaaaagaaaa 11281 agaaaaggca tttgtcagtt tgacctgcat tttattaacc aaagagagtc acatgaccca 11341 gccaggtgtc aatggattaa aggaatctaa tcctgcaatt gagaaatgcc acagatcaca 11401 tggccaggtc tgatgaagga gtgagaggta taacatcttc acacagtggg gaagcaagta 11461 ctgagaacaa tgctacaatg cgccgcactg aaagaacagt aggtgcaaag ctacaagcaa 11521 agaaaaaccc caagagctaa tgatcctagc tggctggctg aggttgccta gttgggaaga 11581 gtgggagatg gagctagaaa tatacaccat gcccacagtg atgaactgca gagacccttc 11641 tcaaaggaaa ttaaaattaa cccatctctt gaatcaagtc ctcacataag aagtgcccta 11701 tttttcttgt gctcatagaa agagtctttg gagcacactc tagaggagac caaggcccgt 11761 tacagcagcc agttagccaa cctccagtcg ctgttgagct ctctggaggc ccaactgatg 11821 cagattcgga gtaacatgga acgccagaac aacgaatacc atatccttct tgacataaag 11881 actcgacttg aacaggaaat tgctacttac cgccgccttc tggaaggaga agacgtaaag 11941 taaggctctt agaatcaagg aataggtgtc aatatctgta tgcacttcta ttttaatgtc 12001 cctgtcactc attacccagc accaatgcaa tccctaggac agaagcaatt atactcacac 12061 atgcctcacc acgaacaaca aaacgaaaat ataccaaaaa atacacacac gcccaaaata 12121 tcaagtacag ccttgagatc atgtggtagg actgagttct accacatgat catttggaga 12181 taattctcca aatgatatag attatgctga tattaatttt catcattaat atataattga 12241 aggcataata accttttgga aattctaatt gagagctcat gaattagaca ataagctgct 12301 gaggtccacg ggagccagtc tgagaatcat aagtgtggca gcagcaactg tcttcagata 12361 ccatacctaa gaatatcaac aagagaacac aagatttaaa cttccattgt aattttgtta 12421 ttttaattag aggaacttct tagcatatat taaactggtc agttttaaat ctatgtattt 12481 ttcaagttaa aaagtaaaat gctcaagttt gcaataaaag caatgtaaaa gggaaagata 12541 tatgaaagca ataacacatt cacttggact ttgaaatata aaagacagga gtgcattcca 12601 ttttcaaaac agcaataatg cttttttctg tttctttttt cttcttttca ttaaaaaaaa 12661 aaacagaact acagaatatc agttaagcac cctggaagag agaggtaagt tctaaatttt 12721 tgcacatttt ctatgacatt cagctgcttt ctattaacta catgccactg ataaaaagta 12781 aagtgaggct gttttagtct gttttctgct gctgtgacag aatacctgag actggataat 12841 ttacaaacaa tagacgttta cttggctcac agttctggag gccagaaagt ccaatatcaa 12901 ggtgccggca tcttgtaagg gccttcttgc tgtgtcatcc catggcagaa agtagaagtg 12961 caaaagagtg tccatgagaa ccagagagca agaggaggct aatctttctt ttagaactag 13021 cccactctca caataactaa cccattatca tggtaaccac attaatccat ccatgagaga 13081 agagccctca tcatctaatc accttttact aggccgcacc tcccaacact gttacattgg 13141 ggattaagtt ttccacacat gaactccagg gaacacattc aaaccatagc aaaggccaat 13201 tctcaaggga gcagatatga caccagggat tctaaaaatc ttgtacatgg cataaagaaa 13261 ccctccacat gggaacttgg gcacactcct cagaatgggg tgattttctt ggcctgtgct 13321 attgctatct tgaaacatcc tgacatcctg aaaaggaaaa acaagacaaa atttgaattc 13381 cacttaaatt caagataatt tatttcttga atttttaaga gtaggacatt tgtaaatttg 13441 gggagaagca cattttctgc atctatttat aaaatggact taagattttt tgagaaagag 13501 gatgggttta acatattttt ctaaaagaag gaaataagta aaaagaataa taatttaaag 13561 ttgaataata taaatcagca gagtcatgtg cctgaattaa catttaatgt ttaatatgaa 13621 ttcagatata aagaaaacca ggaagattaa gacagtcgtg caagaagtag tggatggcaa 13681 ggtcgtgtca tctgaagtca aagaggtgga agaaaatatc taaatagcta ccagaaggag 13741 atgctgctga ggttttgaaa gaaatttggc tataatctta tctttgctcc ctgcaagaaa 13801 tcagccataa gaaagcacta ttaatactct gcagtgatta gaaggggtgg ggtggcggga 13861 atcctattta tcagactctg taattgaata taaatgtttt actcagagga gctgcaaatt 13921 gcctgcaaaa atgaaatcca gtgagcacta gaatatttaa aacatcatta ctgccatctt 13981 tatcatgaag cacatcaatt acaagctgta gaccacctaa tatcaatttg taggtaatgt 14041 tcctgaaaat tgcaatacat ttcaattata ctaaacctca caaagtagag gaatccatgt 14101 aaattgcaaa taaaccactt tctaattttt tcctgtttct gaattgtaaa accccctttg 14161 ggagtccctg gtttcttatt gagccaattt ctgggttaat cttattgatt tttcagcatc 14221 agtacaactc tacaaccttt gagctatatc tgctttttcc cattgcttcc actgcctttt 14281 aaaactcaac acagcttttt gaataatttg agagtcaaat tcaatcacaa atgctgagac 14341 gaataagagt gaagtacact atacttaaaa tggaaataga ttaaaaacaa cattactgaa 14401 acccttcgca aggcaaaatg tgtctccttt tgataataag ctgcatatac tatcaggtcc 14461 tctctttctt tatatggtga acatatattt ttaatgaaat gtctctcatt tttttaataa 14521 cagatttatt gagatataat tcacacacca tgaaattcac ccttacaaaa cgtacaattc 14581 agtggtcttc agtatgctta caatgttttg caaccatcac cactatctag ttttagaaca 14641 cttcatcacc ccaaaaggaa atcttgtacc tattagtagt caccgccttt tcccttcctc 14701 ccagccccta acaaccacta atctacttcc tgtctctacg gatttgccta ctctggacat 14761 ttcatataaa taggttaata cgatgtgtcc ttttatacac aaatgttcat agcagcatta 14821 ctcataaaag ccccaaagcg aaacacctca agtgtccatc aaccgatgaa tggataaaca 14881 aaatataata tatccacaca atagaatctt attcgtcaat aaaaaggaat gaagtactga 14941 tacatgctat aacagagatg aacttcgaaa acatgctaag tgaaagaagc caaatccaaa 15001 aacaataaaa acacatattg tatcctcacc ctttttgcat tttagtgagc aatcattgca 15061 tatgaatgtt tatgggaaaa atcaatgtgt gctaaatcat tgtattccag taaatagatt 15121 ggacttaaaa cttgatacag aagttgcaaa taagtgggat tgagtttgat tattatatag 15181 aaaataatta catgattcat ttaagaataa taatatccac catttattga gcacttacta 15241 tgagcctgtg tgccaaacat ttcatgcatt tctcatttaa ttctcacaat aatcctgtga 15301 ggtagaagct attaggttga atcatatgaa cttgccaata tatgataatt tctaagagtt 15361 gggaattttt gaggatgtga atggtaccac tttgaattcc taagatgtaa tataatatct 15421 aacacatagc aggcacttga ttcattattt taaattgaaa gaataaagtt ttttaagctt 15481 tccaatatat gataatttct gactttcaga aatagcaatt ttatatgcta ttatatagca 15541 tatataataa ggttcagcct tattatgtta cccccacttt acatatgagg ataaatgagg 15601 actcatatga agacatgaga taaagacttt cccaaagtca agcagttacc aagtagtaga 15661 gcgagactga acctcagcgc tgtttctcta aaaccaggac accctcataa gcaactaatt 15721 acataacaaa gcaatacatg attcacagtt gaaataggca cttgctatcc gcagttattt 15781 tgttgttttc taattgtcat tttcatcagc cagacacaac agccaattgt ggcaaatgtc 15841 cagctttggc tcgtaacatc acacatgact tgattcagta caacttttgt cagaaaaggt 15901 attctcacct attctcattg ccttcttttc caaagtgaag agatttcact catttttttc 15961 ttaattttct tccaagtcac gctagctagt aagttgcatt taaagatgtt aagaatattt 16021 aaaagtgaat tctttttcac ctactgagtc acattccaga atagtctgaa actttgacat 16081 gcaaatacca gactgtgaat ctgattaata agaaacctat gcagatgggt ttgtaactga 16141 ttaggcctga ccactgttat atggcaatga tgacactgtg ttaaaagagg taggattgtt 16201 attcttaatt taggatggtc ctcttttaac tacctgttaa aagaagtaga attgttattt 16261 ttagatcaga aaaatgaaga ttttttcttc ctcttgcttc tttggtctgt ctctagtttc 16321 ttccaagcaa tctttcaaag aagtgagtga gagctacata catgtggacc agatggtgga 16381 gttctacctc cagttctgcc agcagttact gtgtgaaagt gaagcattaa tctttgtatc 16441 tatctatatt tctgatatga gctaagaact agaatgacat aatcctattt ctgacaggta 16501 aacagatcca gagagaagtc tttatcacac attagaatta tcatcaatgt attttttcat 16561 tttatatgaa attgtcagtg cagaagtgaa gctatcaaac tcagagagag cccctagagg 16621 tgaggttagg atggaaataa ctattttttc tgagctctgg agattatttc tacccagagt 16681 tctgaatcat ctaaaaagga gaatgacatg gagtgataca aaatcaaaac atggccagtg 16741 acctcctcag acactctgtt ctcatccaca tgcatcagga tcagcctcag gtacctgatt 16801 agacccccaa gtaacaattc caacttaaag cattagtagt gatttttatt tttgaattct 16861 ctttcaaaca tctctgtttt ctctccccta ggctcatttc caagattcct ctcatcaggg 16921 tattatttta tcatcttcct gtctcctacc actttcaaaa ttctcccctg ctttcaagaa 16981 ccattgatct ctggagccca gaagtcaatc tcactcatac cccttaggtc cttgtttaac 17041 cctcatctca aaagcacaat acataaaatg taggcaaccc aaaattctct cagtcactca 17101 caggatttta taatgttatc caaagtgaaa gatgttacag atcaagggat cagcgagttg 17161 cagaaaatga caggtctgtc ttacgggaaa aaactcaggt agaaaaccct gcaaagacca 17221 ggacagaaac caacctgaca tgaaaagatg agggtatgat ttactcaata aaaatctaaa 17281 gacttgtaaa aaccacataa ctaattttct ctattttgaa catttcctaa actctacaaa 17341 aatggaagat ataattcttt tgaacagttt gccctggcta taatcatttt cctctctgcc 17401 tcaacttatg caatcactct gccaacaagc ccaagattgt ttattgtttt ttacatcacc 17461 actgccttat gcattaatgt ggctgtcaaa gaaaattatt ttcacttcta aaacctctaa 17521 agcagagtct tgattattta acttagcctc ccacttaaaa aaaaaaaagc cactgaagaa 17581 gaaatatttc ccttttcaac ttctgaagtt gctccccatt cataactgaa ttcagaataa 17641 atattggtca aagcatacca gtaaattagg gcacagtcat tttcaaatga aatgaatttc 17701 aaatgacaca ttcaaacaca ttaagactta acttctttca aatgaaatca ttcaaggagt 17761 gcagtgataa agttcagcga aacactaggc taggactcag gataaaaaat aaattagatg 17821 tagtacctac ctaacaagct tagtctagga taatatgtta tatgtataga aacataaaca 17881 aataatatat aaacataatt tttagagatt gagcaatatt atttaaattt taccataggc 17941 attagagttg gaaagtgtac ctttcagcac aaaatcaatt ccaagttcaa aaattcaact 18001 taatcatatt cccgtattat gtgatcgagt cgactccctt tagtgagggt taattgagct 18061 c // LOCUS HSDAO 9903 bp DNA PRI 19-JUL-1994 DEFINITION H.sapiens diamine oxidase gene. ACCESSION X78212 NID g463242 KEYWORDS amiloride binding protein; diamine oxidase; histaminase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9903) AUTHORS Chassande,O., Renard,S., Barbry,P. and Lazdunski,M. TITLE The human gene for diamine oxidase, an amiloride binding protein. Molecular cloning, sequencing, and characterization of the promoter JOURNAL J. Biol. Chem. 269 (20), 14484-14489 (1994) MEDLINE 94237856 REFERENCE 2 (bases 1 to 9903) AUTHORS Barbry,P. TITLE Direct Submission JOURNAL Submitted (16-MAR-1994) P. Barbry, IPMC/CNRS, 660 route des Lucioles, ZA de Sophia Antipolis, 06560 Valbonne, FRANCE FEATURES Location/Qualifiers source 1..9903 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /chromosome="7" /map="q35" exon 901..955 /number=1 mRNA join(901..955,4840..6425,7145..7430,8880..9012,9322..9668) /product="amiloride binding protein" intron 956..4839 /number=1 exon 4840..6425 /number=2 CDS join(4856..6425,7145..7430,8880..9012,9322..9588) /codon_start=1 /product="amiloride binding protein" /db_xref="PID:g463243" /db_xref="SWISS-PROT:P19801" /translation="MPALGWAVAAILMLQTAMAEPSPGTLPRKAGVFSDLSNQELKAV HSFLWSKKELRLQPSSTTTMAKNTVFLIEMLLPKKYHVLRFLDKGERHPVREARAVIF FGDQEHPNVTEFAVGPLPGPCYMRALSPRPGYQSSWASRPISTAEYALLYHTLQEATK PLHQFFLNTTGFSFQDCHDRCLAFTDVAPRGVASGQRRSWLIIQRYVEGYFLHPTGLE LLVDHGSTDAGHWAVEQVWYNGKFYGSPEELARKYADGEVDVVVLEDPLPGGKGHDST EEPPLFSSHKPRGDFPSPIHVSGPRLVQPHGPRFRLEGNAVLYGGWSFAFRLRSSSGL QVLNVHFGGERIAYEVSVQEAVALYGGHTPAGMQTKYLDVGWGLGSVTHELAPGIDCP ETATFLDTFHYYDADDPVHYPRALCLFEMPTGVPLRRHFNSNFKGGFNFYAGLKGQVL VLRTTSTVYNYDYIWDFIFYPNGVMEAKMHATGYVHATFYTPEGLRHGTRLHTHLIGN IHTHLVHYRVDLDVAGTKNSFQTLQMKLENITNPWSPRHRVVQPTLEQTQYSWERQAA FRFKRKLPKYLLFTSPQENPWGHKRSYRLQIHSMADQVLPPGWQEEQAITWARYPLAV TKYRESELCSSSIYHQNDPWHPPVVFEQFLHNNENIENEDLVAWVTVGFLHIPHSEDI PNTATPGNSVGFLLRPFNFFPEDPSLASRDTVIVWPRDNGPNYVQRWIPEDRDCSMPP PFSYNGTYRPV" intron 6426..7144 /number=2 exon 7145..7430 /number=3 intron 7431..8879 /number=3 exon 8880..9012 /number=4 intron 9013..9321 /number=4 exon 9322..9668 /number=5 BASE COUNT 2526 a 2646 c 2664 g 2067 t ORIGIN 1 aagcttcaag ttgactaagt gtgttcacct tccttcatct agctgccagt gcacactgaa 61 atggcttgca ttgcgggcag caggaaggat gatgatgaga aatacaaaac ccctataacc 121 acaaagcggt gtgcaatccc tgagcaacca aaccttggta atgggggatg atgacccctt 181 ggcactcctg agccacgaac ttttcctgac tctattcttt cctttccaac taggagttac 241 cccggtcacc acgttcgccc ttctcaacct gactaagtgg gcgttagcac tgtccgccca 301 ctgcatgggc actacctgcc ccagtgtcct ctcccctctc cgctctgtgc catagcctta 361 gctaccctgg ccagctttct cccagccaag tccttcctcc tttctgaatc tggtcagggt 421 ctcaaagatc gttaaaaagc cgccaaagtt tgcaactcgt cttgagtggc tgctcctccc 481 aaacggttct gcctctggtg tcttgttcta ggagcggttg aattctctgt tttgctgtgg 541 gtgccagcct ctatcagccc ctgccctggt ccatctgtca gcttcaggta agactaccgg 601 ctaggaaatg gccccttgtc cttatctctg cctctaaaat atgccaccta aattaacaga 661 tcgtggcatt gcatggctcg tcacagagaa gccaaggcaa attcttgtgg aaggcggcac 721 ttgtctggaa acagactcag gacaagtgtg ctgcacagcc agcttcctca aacccactct 781 cgccactgta gctgcccaca gaggcacctc ccaagccata ggcccctcct aggccagctc 841 aggcctcacc ttgtccctcc ctgcagggcg gggtctgatt gacttcaaga aggagtcaga 901 gatcagctta agggcaaagg ctggaagcag agcgaactgg gagcagagca cacaggtgcg 961 ttctgatggg tattcgggag agtcaagaag gggctgacct ggaggacaga aaagcagtgg 1021 aaggagggca cagagtgagg ggcagaagaa acaggacaca gagggaggag ggccaggaag 1081 tgagggcggg agcccagcac atcacagaag gtgcccgggt gcttggtggc aggcaacagc 1141 cggccagggt ggagtgggga aagctcagag aggagggtga gcagggagag ggcttgcttc 1201 ctgttcctgc ctggaggtct caccagccac cacccccact gccactgcac atggggacag 1261 aggtttgtcc tggggaaagc aaaggtctct tccacatctt tctccttcct ggaaaagcaa 1321 aaagaaacgt gggtataaga ggttgtaaaa aaggagaccc agccgcgaaa ggacatggag 1381 gtaccctaaa tgcacactgc caagcaaaag aagccaatct gaaaaggctg cagactgtaa 1441 gattccaaat acaggacgtt ctggaaaagg caaaactatg gagacaataa aaagagcagt 1501 gattgctagg gtttcggagg aagaggtgga gcacagaaga ttttggcgga ggtggaacta 1561 ttctgtgtga tattctaatg gtggatccat gtcattgtcc gtttgtccaa cccatagaat 1621 atgcagcagc aagagtgagt cctgatgtaa actgtgggct ctgggtgaga gtgatgtgca 1681 ggttcatcag ccacaaatgt atggctctgg tgtggatatt gatagtgggg aggctgcgtg 1741 tgtaggggct gtgagtatac agaaactctc agtacctttc gctcaatttt gctatgaacc 1801 taaaattcct ctttaaaaga ctcctattta aaagacttta atagacttta taaagtttgt 1861 taaaagactt ttaatagact ttttaaagac ttttaacaga ctttataaag tctattaaga 1921 aatctattaa agtctatttt taaaagtcta ttaaaaaaaa agtctactat aatcccagca 1981 ctttgggagg ccaacgcagg aggatcactt gagcccagga gttcaagacc agcctgggca 2041 acatagtgag atcctgacta tatatatata tatatattta tttatttatt tatttattta 2101 tttaagtcta ataagaagtc tatttggggc caggcctggt ggctcatgcc tgtaatccca 2161 gcactttggg aggctgaggc aggtggacca cttgaggtca ggaattcaag accagcctga 2221 ccaacacggt gaaacaccat ctctactaaa aacaaaaaat tagccgggca tggtggtgca 2281 cacctgtaat cccaactatt ccagaggctg aggcaggaga attgcttgaa cctgggaggc 2341 agaggtagca gtgagccaag atcgcatcac tgcactccag cctgggctac agagcaagac 2401 tgtctcagga aaaagaaaaa atagtctatt taaagaagaa aaaaaagagg cctggccaat 2461 atttaatagg agaacacaag aactcacctc atatccttcc tgctggctct gggttccgcc 2521 tgcagcccct gcctcacctg ctccctccag gtaagcaggg tctccgagag cccaagatca 2581 tgtgggttcc agggtcttct gccagcacca gagggcaaga caaggcaact agagggacat 2641 ccccatgaga ggccatgacc atggagcaaa acgagggcaa ggggagagcc ctcggggaga 2701 ggggcatggg aaagggagag aaggacatct cagatcagcc tttaccgtgg cctccagggc 2761 tagagagccc tcgggtgatg gaacaggccc agccaacacc tttcatttgt ctatgaatca 2821 agaactgccc atcgtgtaca aattaaatac aaagtgctaa ggggactagt ggagaagtat 2881 gttagttggg gccaaaaaca gcattcctcg gcagtgcggg gtttcagctg acctcaaaag 2941 gtggcaaagg atgtgacact ggcccagagt cgggtgggaa gggacaatgt gcagctgagc 3001 ttctctggag gaaaaaagcc acgggaaaac agggcctcta gttaacacat gggttgtgtt 3061 cccaacgctt gtgcttcgga aatgtgatgt gtaaatgcat tttccaaaaa caaaagcaag 3121 aaatgaaaag cgctcatgat ggtcaggttt cctcagtggg tcactccacg atacacccaa 3181 agtggagctg ctgcacctca gttacacctc acaccctgtg ccgccgactt tccaggtgaa 3241 atcctcgggg cctcccttgc aggcagcctg ggccccctgg gaagaggctg ggccaggcag 3301 aggtctgaga caggagtgga ttaaccctct ctgctacctg tctgtctgca caggagaacg 3361 agttttctcc aggtttaccc ccagggcttc tgatttaggg agcaaaaggc aactacctat 3421 gaaaatatga aaacaaggat ctctcagagt ccaacccact catgcatcca ccttataaat 3481 caagtgtctg tgcctcaggc actgcttgca ggaagtgaaa agcacagctg agacagggag 3541 gaatcctgaa gcggggctcc cagctcaacc cagagccttg aggaaggggg tgcaaggtgg 3601 caaaggcagt gcttctgtag ggaaatttcg cggtggcggc agccttccag ggaggcccac 3661 atcacctagg aggacaagtt agggtctgga gaaactaagg ctgggagaca gctccagcca 3721 gcatctgtag caacagcgca acaggcaaga gtaacagtct cccagagact ccgcacacag 3781 cacatgggaa gacagagctg tggccaccac gtctcccagc cctgcaggag gctcacagct 3841 acaagcacag gattttacag ctgcttgggg aatggggtta tgaggtgaag accctgaccc 3901 accctccctg agaaaggaaa aggcatcttc ctaagatgaa accctccctg aggtcccctc 3961 ctagtagcct tgggtggaca agagaggtga atggggcaca aagaaggcta ggcaggactg 4021 ggcacctcta cctcaccccc aaagccatca gtctgcagcc tagtcacacc tgaatcactc 4081 tacattgcgt aaaactttaa atgggctggt gtcgatagcc tgtatttaac aagaagagca 4141 gctaacattt ctcatgtatt tcccatatgg cagacattgt tacacacact ttatatttac 4201 tgtttaattg atttctctcc acaaccctag gagctaagca ctgttgtcat tattttcatt 4261 ttaaaactaa gaacatgaag gcccagagag tttaagcagc gtgcccaagg ccacacagca 4321 agtcctagtg gcatgagtca ctgggacttg actcagatag tttgactcta gagcctacct 4381 ttgaatttct acactaaatt tcaaaatctg tctacctaaa aggtatttcg ttgtgtcaga 4441 aatgtagttt tgctgttaat acaggtgtct tcttctccag gacacaagaa gcacctgggg 4501 gcatgtgcag gggcccagcc tggcctctgt gactcacgca cccctggctg agataataca 4561 caaaaacaag ggtgttcccc tgagtaagga aggctgcagg actgaaggac atttacagag 4621 gagtgaggag agggcagaaa taaatggatg gatgcacagg cagcggcccg acggcgctgg 4681 ggactatgca tgggccccag ctgccccagg acagcctcca gcttggggca gggcaagggg 4741 aggaagctca gtccatggga aaattccatg gccctaacct gagggaagcc catctctgcc 4801 cataagacaa ctaagttcat ctcctctatt gcattccaga gccgtggagc gagagatgcc 4861 ggccctgggc tgggccgtgg ctgccatcct gatgctgcag acggccatgg cggagccctc 4921 cccggggact ctgcccagga aggcaggggt gttttcagac ctaagcaacc aagagctgaa 4981 ggcagtgcac agcttcctct ggtccaagaa ggagctgagg ctgcagccct ccagtaccac 5041 caccatggcc aagaacaccg tgtttctcat cgagatgctg ctgcccaaga agtaccatgt 5101 gctgaggttt ctggataaag gtgaaaggca tcctgtgcgg gaagcccgtg ccgtcatctt 5161 ctttggtgac caggagcatc ccaatgtcac cgagtttgct gtggggcccc tgccagggcc 5221 ctgctacatg cgagcactgt cccccaggcc tgggtaccag tcctcctggg catcgaggcc 5281 catctccaca gcagagtatg ccctcctcta ccacaccctg caggaagcca ccaagcccct 5341 gcatcagttc ttcctcaata ccacaggctt ctcattccaa gactgccatg acagatgcct 5401 ggccttcacc gatgtggccc cccggggtgt ggcttctggc cagcgccgca gttggcttat 5461 catacagcgc tatgtagaag gctactttct gcaccccact gggctggagc tcctcgtgga 5521 tcatgggagc acagatgctg ggcactgggc cgtggagcag gtgtggtaca acgggaagtt 5581 ctatgggagc ccagaggaac tggctcggaa gtatgcagat ggagaggtgg acgtggtggt 5641 cctggaggac ccgctgcctg ggggcaaggg gcatgacagc acagaggagc cgcccctctt 5701 ctcctcccac aagccccgcg gggacttccc cagccccatc catgtgagcg gcccccgctt 5761 ggtccagccc cacggccctc gcttcaggct ggagggcaac gctgtgctct acggcggctg 5821 gagctttgcc ttccggctgc gctcctcctc cgggctgcag gtcctgaacg tgcacttcgg 5881 cggagagcgc attgcctatg aggtcagcgt gcaagaggca gtggcgctgt atggaggaca 5941 cacacctgca ggcatgcaga ccaagtacct cgatgtcggc tggggcctgg gcagcgtcac 6001 tcatgagtta gcccccggca tcgactgccc ggagaccgcc accttcctgg acactttcca 6061 ctactatgat gccgatgacc cggtccatta tccccgagcc ctctgcctct ttgaaatgcc 6121 cacaggggtg ccccttcggc ggcactttaa ttccaacttt aaaggtggct tcaacttcta 6181 tgcggggctg aagggccagg tgctggtgct gcggacaact tcaactgtct acaattatga 6241 ttacatttgg gactttatct tctaccccaa cggggtgatg gaggccaaga tgcatgccac 6301 tggctacgtc cacgccacct tctacacccc cgaggggctg cgccacggca ctcgcctgca 6361 cacccacctg attggcaaca tacacactca cttggtgcac taccgcgtag acctggatgt 6421 ggcaggtagg actcaaagcg agactctccc gttcaaacat ctgcatccag ccaataactt 6481 aaactcccag gagacggcac tataactccc tggtggtgga aagttaggag catttgccaa 6541 gcctgcttct gtaaaaccta aagctagaaa tttgtgtgtc cctgaaaaat ggtgaaagcg 6601 aacagtgccg agggttgccc tggatgcaac agtgtggacc ctgtcctgac cctggggctg 6661 tgcatggggc aagggagggg acggtgccta tgagggtaag aaatgtcact gtctagcgct 6721 ttgagatgga gtgtagtatt gtctgggaat attcagggtg tctcctggat gatatcacca 6781 taaaagaatg gcatgggtag aataactaat gctgccagac ccaacaaggc caccaggacc 6841 acacgggcgg ctcctacccc tcagcccagt cacctctctg gacagagcct cccatgcctg 6901 gagaaaaatg gtgccctgct ctaggattgc agccacagca gatgggcatg agcacagctc 6961 ttgagaggaa cttcccagag tggtaatgga atcaccacag ccgcccaggg catcccccaa 7021 catcccggtt attcaagaca gttggctgtt ggatgtggtg gaggatggag atcctggggt 7081 agaatgagga ggtttcagtt ctgacagctt gattctcgtt ctccttctcc ctgcatacct 7141 ccaggcacca agaacagctt ccagacactg cagatgaagc tagaaaacat caccaacccc 7201 tggagcccaa gacaccgcgt ggtccagcca actctggagc agacgcagta ctcctgggag 7261 cgccaggcgg ccttccgctt caaaaggaag ctgcccaagt acctgctctt taccagcccc 7321 caggagaacc cctggggcca caagcgcagc taccgcctgc agatccactc catggccgac 7381 caggtgctgc ccccaggctg gcaggaggag caggccatca cctgggcaag gtgaggaaga 7441 cccagggggc ctgggggagg gtcagtggct tccctcagtg tgtgtgtctg tgtctgtctg 7501 tgtttgtgtc tatggggttc ctagtctatg tggatataca catatacgta tgtgtatatc 7561 tgtgtgtaca attgaccctg aacaatatgg ggggtagggg tgccaacccc ccaaagcagt 7621 aaagaagcca agtgtaacta tgactccccc acaacgtaac tacgaatagc ctactgttaa 7681 ccggaagcct taccactaac agaaacattc cgtgaacaca tattttgtag gttctgtgtg 7741 tcagatactg tgttcttaca ataaagtaag ctagagaaaa gaaaatgtta ttaataaaat 7801 cctggccggg cacggtggct cacgcctgta atcccagcac tttgggaagc tgaggcagac 7861 agatcacgag gtcaggagat caagaccatc ctggctaaca cggtgaaacc tcatctctac 7921 taaaaacacc aaaaaattag ctgggcatgg tggcaggcac ctgtagtccc agctacttgg 7981 gaggctgagg caggagaatg gcgtgaaccc gggaggcgga gcttgcagtg agccgagatc 8041 acaccactgc actccagtct gggcaacaga gcaagactag ttctcaaaaa aaaaaagaaa 8101 aaatcctaag gaaaagaaaa tatatctaca atccattaag tgggagtggc tcatcccaaa 8161 ggtctttatc ctcgtctcct tcactttaag tgggctgagg aagaggagga ggaggggctg 8221 gtcttgctgt ctcacggtgg cagaggcgga agaaactcca aatgaaagtg gacttgttgc 8281 tgggcgcagg agctcacgcc tgtaatccca gcactttggg aggccaaggt gggcagatca 8341 cctgaggtca ggagttcgag accagcctgg ccaacacagc gaaaccctgt ctctactaaa 8401 gatacaaaag ttagccgggt gtggccgtgg gcgcctgtaa tcccagctac tcgggaggct 8461 gaggcaggag aatcacttga acctgggagg cagaggttgc agtgagccaa gatcgcatca 8521 ctgcactcca gcctgggtga cagagcaaaa ctccatcaaa aaaaaagaaa agaaaagaaa 8581 gaaggaagga aagagagaga aagaggaccc atgcagctca cacctgtgtt gtgcaagggt 8641 tgcctgctgt tctgggcctg tgtctctgtg catttgggga agggggcggg gaggactcct 8701 gtgggtcacc tgaacccggt taacagcagc ccgtcctgct gactcccagg acagatgatc 8761 tgtcatcttg gggaaggcaa tcatcttcct ctattctgac cctgaggctg ttccctgggc 8821 aggactgagg gggcccagcc cagggccctg agccaagctg cttcgcctgt gcctggcagg 8881 taccccctgg cagtgaccaa gtaccgggag tcggagctgt gcagcagcag catctaccac 8941 cagaacgacc cctggcaccc gcccgtggtc tttgagcagt ttcttcacaa caacgagaac 9001 attgaaaatg aggtactgcc ctgtccccag ccctgcccgg tgctggccct gcctccttcc 9061 agctcagccc aggaccatcc tcatcaccat cagggagtcc caaccaccct ttgcccagat 9121 ctgtccccag tcgcaggagc tgtgccttgc tgtgtggacg gcaagttcag aggtcacaac 9181 agagctgctc atctctttaa aaaggggctg gagaggaatt cagcaagttt ccaggcagaa 9241 ctgaaaatga ccaaaggcta gagtggcctc cagtggtcag tactcagccc tgcccactga 9301 agcccaccct gtctcctgca ggacctggtg gcctgggtga cggtgggctt cctgcacatc 9361 ccccactcag aggacattcc caacacagcc acacctggga actccgtggg cttcctgctc 9421 cggccattca acttcttccc agaggacccc tccctggcat ccagagacac tgtgatcgtg 9481 tggcctcggg acaacggccc caactacgtc cagcgctgga tccctgagga cagggactgc 9541 tcgatgcctc ccccttttag ctacaatggg acctatagac ctgtgtgacc agcccccagt 9601 tcctccccca gttcctccca ggaagcccag gagcctcact ggggcagaca ataaactctc 9661 agagcctcgc tctgtgtgct gcttcttgcg gggagcacag ggccatgtgt gtaggaaaca 9721 cacgaacaga cgtgcacaca cacacagacg tgcacacaca cacagacatg cacacacaca 9781 cagacgtgca cacacacaga cgtgcatgca ctcacatgga catgcacaca catggcatgt 9841 actttattca cactggtcta ctgcagtcca gaaaagccac cattactaac aaaaaggaag 9901 ctt // LOCUS HSDNAAMHI 8407 bp DNA PRI 01-MAR-1996 DEFINITION H.sapiens gene for anti-mullerian hormone type II receptor. ACCESSION X89013 NID g1212943 KEYWORDS SF-1 response element. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8407) AUTHORS Picard,JY. TITLE Direct Submission JOURNAL Submitted (22-JUN-1995) J.Y. Picard, Inserm Unite 293, Inserm Endocrinol. Du Developpment, Ecole Normale superieure, 1 Rue Maurice Arnoux, F- 92120 Montrouge, FRANCE REFERENCE 2 (bases 1 to 8407) AUTHORS Imbeaud,S., Faure,E., Lamarre,I., Mattei,M.G., di Clemente,N., Tizard,R., Carre-Eusebe,D., Belville,C., Tragethon,L., Tonkin,C., Nelson,J., Mcauliffe,M., Bidart,J.M., Lababidi,A., Josso,N., Cate,R.L. and Picard,JY. TITLE Insensitivity to anti-mullerian hormone due to a mutation in the human anti-mullerian hormone receptor JOURNAL Unpublished FEATURES Location/Qualifiers source 1..8407 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="q13" misc_feature 464..472 /note="SF-1 response element" mRNA join(734..860,1165..1347,1586..1777,2042..2119,2332..2450, 2566..2796,5773..5887,6330..6502,6708..6855,7023..7159, 8056..8407) exon 734..860 /number=1 CDS join(812..860,1165..1347,1586..1777,2042..2119,2332..2450, 2566..2796,5773..5887,6330..6502,6708..6855,7023..7159, 8056..8352) /codon_start=1 /evidence=experimental /product="anti-mullerian hormone type II receptor" /db_xref="PID:e185726" /db_xref="PID:g1212944" /translation="MLGSLGLWALLPTAVEAPPNRRTCVFFEAPGVRGSTKTLGELLD TGTELPRAIRCLYSRCCFGIWNLTQDRAQVEMQGCRDSDEPGCESLHCDPSPRAHPSP GSTLFTCSCGTDFCNANYSHLPPPGSPGTPGSQGPQAAPGESIWMALVLLGLFLLLLL LLGSIILALLQRKNYRVRGEPVPEPRPDSGRDWSVELQELPELCFSQVIREGGHAVVW AGQLQGKLVAIKAFPPRSVAQFQAERALYELPGLQHDHIVRFITASRGGPGRLLSGPL LVLELHPKGSLCHYLTQYTSDWGSSLRMALSLAQGLAFLHEERWQNGQYKPGIAHRDL SSQNVLIREDGSCAIGDLGLALVLPGLTQPPAWTPTQPQGPAAIMEAGTQRYMAPELL DKTLDLQDWGMALRRADIYSLALLLWEILSRCPDLRPDSSPPPFQLAYEAELGNTPTS DELWALAVQERRRPYIPSTWRCFATDPDGLRELLEDCWDADPEARLTAECVQQRLAAL AHPQESHPFPESCPRGCPPLCPEDCTSIPAPTILPCRPQRSACHFSVQQGPCSRNPQP ACTLSPV" sig_peptide 812..862 sig_peptide join(812..860,1165..1166) intron 861..1164 /number=1 /evidence=experimental exon 1165..1347 /number=2 /evidence=experimental intron 1348..1585 /number=2 /evidence=experimental exon 1586..1777 /number=3 /evidence=experimental intron 1778..2041 /number=3 /evidence=experimental exon 2042..2119 /number=4 /evidence=experimental intron 2120..2331 /number=4 /evidence=experimental exon 2332..2450 /number=5 /evidence=experimental intron 2451..2565 /number=5 /evidence=experimental exon 2566..2796 /number=6 /evidence=experimental intron 2797..5772 /number=6 /evidence=experimental exon 5773..5887 /number=7 /evidence=experimental intron 5888..6329 /number=7 /evidence=experimental exon 6330..6502 /number=8 /evidence=experimental intron 6503..6707 /number=8 /evidence=experimental exon 6708..6855 /number=9 /evidence=experimental intron 6856..7022 /number=9 /evidence=experimental exon 7023..7159 /number=10 /evidence=experimental intron 7160..8055 /number=10 /evidence=experimental exon 8056..8407 /number=11 polyA_signal 8389..8393 BASE COUNT 1932 a 2262 c 2140 g 2073 t ORIGIN 1 tctactaaaa atacaaaaat tagccaggtg tggtggcagg cgcctgtaat cccagctact 61 caggaggctg aggcaggaga atagcttgaa ccaaggaggc ggaggttgca gtgagctgcg 121 atcgtgccac tgcactccag cctgggggag aagagcgaga ctccgtctca aaaaaaaaaa 181 aaaaaaaatc cacatagcaa aatttaccat tgttactgtt ttcaatgtgt tttttaaaaa 241 ttcaggtaag ttttgtatac tttctgattc ttaagttgga aaacaaggta acctctaata 301 tgggctgtga ggccttcctc tgcccaagca gttgcagata tgaaccctga atataaagag 361 atcttaggct agaaaggatc ttgggcagag ctgaatggct ctaagcattt gacctcacat 421 tggtttcctc ctgagaagag tcaacagagt ccagcatctt cttccaaggt cagggaaggg 481 caaagatttg aaattaggca tcactgggtt ctcagctggg cctccaagtt cccctcctct 541 cactatgcag agcaaagagg aggttgcaga agagaagagg atatgagatt ggttctctgc 601 tcctcccttt ctttccctgc tttccccaca cccaccctct cccccacagc agccttctcc 661 cccacagagg ctgggatagg atgagggggg cggagttggg gactgaggga tcagaagccc 721 caggatgccc tgtatctgaa gaaagatttg gccaggggca gctgtgctgg cttatgctct 781 tctccttctg ctgctgccat cctccagcaa gatgctaggg tctttggggc tttgggcatt 841 acttcccaca gctgtggaag gtaagtgtct acagggaggg gaagggtctc tccatccatc 901 cagcaaggga aaggggcgct tgaagcaaga gccacccctt tggaagagtg gtgagtgggc 961 tgggtgagta agggtgaagg atagagccat gtgtccccat ggcagggctc aggttccagg 1021 cctctgctga ccctgcttcc tcctgtggct ttaccatact gacgctggga tgtggaacat 1081 gttttgtcta ttcttttggc cagttttttg cctctgcatt cactcccacc ttgaatcttt 1141 tcctttcccc accctgggcc tcagcacccc caaacaggcg aacctgtgtg ttctttgagg 1201 cccctggagt gcggggaagc acaaagacac tgggagagct gctagataca ggcacagagc 1261 tccccagagc tatccgctgc ctctacagcc gctgctgctt tgggatctgg aacctgaccc 1321 aagaccgggc acaggtggaa atgcaaggtg aatggcaaag tatatggcag gtgatggcta 1381 gggtgggaga cagacacatc ctggggtgtg ggtggcaacc aagggggaag gggagaaata 1441 gaacatctgg tgggaaagaa aagcccatga gagctggaag ggacgcctct gatagagaag 1501 ggatttaccc tctgtttcca caccccattg tgctttcttc cttgcccccc ctttctctcc 1561 tcttccccta atcccatccc atcaggatgc cgagacagtg atgagccagg ctgtgagtcc 1621 ctccactgtg acccaagtcc ccgagcccac cccagccctg gctccactct cttcacctgc 1681 tcctgtggca ctgacttctg caatgccaat tacagccatc tgcctcctcc agggagccct 1741 gggactcctg gctcccaggg tccccaggct gccccaggta gccacccaag ggtactgaag 1801 cctgatgggg gctggggccc aggttaggat gagaggtgga accagggcca gttctcaccc 1861 tactcccgcc ccacgctttc ctcctcctgg ccttgggaag ttggttgccc tggtgactgg 1921 gagataaggg gtcttgtgac cagggtgggg gtgggttgag acgcaagctc tcaggagggg 1981 aagagcagag agcaggtttg ggtcagtgct ctccagcctg cattcttgcc ttgatgtcca 2041 ggtgagtcca tctggatggc actggtgctg ctggggctgt tcctcctcct cctgctgctg 2101 ctgggcagca tcatcttggg tactaatcca ccccatccct cccttgtgac cccaagacat 2161 tgccccaaaa gctctgaccc ctctccaggc acccctgacc ccatggtctt gggatctcta 2221 tagcctgatt cccggactcc catgacctct cacaaagctc cctttccacg aagtcccttt 2281 tcctgtccct atgcatttgc accctgaccc taaggctctt gtctgttcca gccctgctac 2341 agcgaaagaa ctacagagtg cgaggtgagc cagtgccaga gccaaggcca gactcaggca 2401 gggactggag tgtggagctg caggagctgc ctgagctgtg tttctcccag gtgccccagg 2461 gagggagaga agggctcctc tgggcactcc tggaggttgt gctggggagg aatcctggcc 2521 ctgttatagc tcagaggccc acactcagca cagtgtcccc agcaggtaat ccgggaagga 2581 ggtcatgcag tggtttgggc cgggcagctg caaggaaaac tggttgccat caaggccttc 2641 ccaccgaggt ctgtggctca gttccaagct gagagagcat tgtacgaact tccaggccta 2701 cagcacgacc acattgtccg atttatcact gccagccggg ggggtcctgg ccgcctgctc 2761 tctgggcccc tgctggtact ggaactgcat cccaaggtga gcaccaagga gtgtatatgt 2821 gtgtgtgtgt gcctgtgtgt atgtatagag gtgggggcta catggcagct gggccctgtt 2881 gattgcttct gctttcgatt ttctcttttc taaaacatta aaaatggcca ggcgaggtgg 2941 cccacacctg taatcccagg actttgggag gccaaggtgg gtggatcgct taagcccagg 3001 agctcaagac cagcctaggc aatatgagga aaccccatca ctgcaaaatt acaaaaatta 3061 gctcagcatg gcagtgcaca cctgtagtcc cagctactca ggaggctgag gtggaaagat 3121 cgcttgagct taggaggttg aggctgcagt gagccgtgag catcccactg cactccagcc 3181 tgggagacag agcgagacca tgtgaaaaaa tagtaacagg ctgggcacgg tggcagacgc 3241 ctgtaatccc agcactttgg gaggccaagg tgtgaggatc acttgaaccc aggaattcaa 3301 gaacagcctg ggcaacaccg tgaaacccca cctctacaaa aaacaaaaaa ttggctgggt 3361 gtgatggcac atgcctgtgg tcccagctac ttgggaggct gaagtggaag gagcacctga 3421 gcctggggag gtcaaggctg gcagtgagcc aagattgtgc cactgcactc cagcctaggt 3481 gacagagtaa gaccctgtct caaaataata atttttattt atctttttta ttacataaac 3541 aatacatgtt tatatatttt tattttattt tggtgggagg acggagtctt gctctgttgc 3601 ccaggctgga gtgcagtggc acaagcttgg ctcactgcaa cctcagcatc ccaggttcaa 3661 gcgattctcc tccctcagcc tccccagtag ctgggattac aggcgcttgc caccacgcct 3721 ggctaatttt tgcattttta gtagagacgg ggtttcacca tgttggccag gctggtttca 3781 aaatcctgac ctcaagtgat ccacccatct tagcccccca aagtgctggg attacaggtg 3841 tgagacaccg tgcccagcca aaaaaaaaaa aaaaaaaatt gttaaagaaa tatcttcagg 3901 cacagccctc caaactcatt cgtattcatt caacaaataa gtatgaacta tatgtcaggc 3961 actatgctag aggccacaaa aaacacacag tccctctttt acagatttta tagactagca 4021 ggggagacag aaatggatca agtaaacacg agtgtcaaat gtcaagtgtg ctaagtgcta 4081 ggagagaggg ccaatgaact ataagaataa taagatgtga cttgaagggg agagcaatga 4141 gcaacaagaa gaggaaaagg taggcagagg ccagccacac caggccctgt gggcctgtta 4201 aatatttggg gctttttaag ttttcatacc atttcccatc tgcttcctgc taggcattac 4261 gctacccgct ttttatttta ttttatttta ttttattttt tagacagagt ctcattctgt 4321 tgactaggct agagtgcagt ggcacaatct tggctcactg cagcctctac ctcctggttt 4381 caagcaattc tgcctcagcc acctgagtag ctgggattaa ggcgtgccgc cactactcct 4441 ggctaatttt tgtattttta gtagagacgg gatttcaccc tgttggccag gctggtctca 4501 aattcctgac ctcaagtgat ctgcctgcct cggcctccca aagtgctgga attacaggcg 4561 tgagccaccc cgcccggcca tgctacacgc tttaaacaca tttccaaact tacccttcac 4621 aaacacatta tgagatagga atgattatcc acatggggct cagaaaagga aagtcatttg 4681 cccagaatct atgtgactct agagctggct cctcatcatc gtgtctattg cctcctgttt 4741 acccctccat caggactgtg tgtgtctctg tccacgtgga agagctcccc acaactcact 4801 ttgccctatt gtaagtagtt caacctccac cacctgtttc agagcccaaa tgcagtggct 4861 cttccctgaa ctttttcagc acctacagct tctgtacttc tgaaatcttg aatgatggaa 4921 agtcccttct ttgaccatat acctcctggc ccagctcaaa aatcactgtt tttttgtttg 4981 tttaatttct ctgacttctc caggcagggt tagtcattct tgctttatgc cccttggcac 5041 tcggttaatt ttcctatctt ggggtttact gcattcttta tctcttacct tccctgagcc 5101 atcttggtgc ctgaacacag ttcgtggcca ccgaaggtgc tcgatgacac ttgttgaatg 5161 aggggttcta ttgctccctt ccctccaatt cccacttgcc ctccacctgc tactctgtta 5221 tctactggct ctgcccgtct atgtgtacac atgcctgaca cctcagacaa ggtgactcat 5281 gaaggatgcc cctgactagc ttgccctgcc tcctctgtta ctgtctttgc cgcatgtctt 5341 ctgagtcagt ataaggtggt ggttattagt aagtgatcct gagtcagact gcttggattc 5401 aagctccact cctgcctcac cgactgactt caagagagtt tccttagttc tgtaaggaaa 5461 agctcttcat ctgttaaatg ggggtgtgac aggagcacct accttctagg gttttcacaa 5521 ggattacagg aggccacatc cactgggtct ggaccacagt aagtgcccag tacccttgat 5581 ctagagttac tatttttcat gtcaattgat gcacagtctt actcctctac ctattgtctt 5641 ggccagcatc catcagtgtg tctgtctgct ggggagatgc agggagaaga ctgtcaattg 5701 atctctgctc cctgggatgg atcagccgtc tccagctttg tgtaccatcc ttttctctct 5761 gcgtttccca agggctccct gtgccactac ttgacccagt acaccagtga ctggggaagt 5821 tccctgcgga tggcactgtc cctggcccag ggcctggcat ttctccatga ggagcgctgg 5881 cagaatggtg ggtgagctgg gcataggaag tcaagggagc cacagtgcta tgtttgtgat 5941 tctgccttag tttggagggg aaagattggg tcaaaagagg gaggaagagc cgggcacggt 6001 ggctcacgcc tataatccca gcactttggg aggccgaggt gggcggatca caaggtcagg 6061 agatcgaggc catcctggct aacatggtga aaccctgtct ctattaaaaa tacaaaatat 6121 tagccgggcg tggtggcatg tgcctgtagt cccagctact cgggaggctg aggcaggaga 6181 atcacttgaa cccgggaggc ggaggttgca gtgagccgag atcgcgacac tgcactccag 6241 cctgggcgac agagcgagac gccgactcaa aaaaaaaaaa agagggagga agaaaatcca 6301 tgttccttca accttggatt cccccacagg ccaatataaa ccaggtattg cccaccgaga 6361 tctgagcagc cagaatgtgc tcattcggga agatggatcg tgtgccattg gagacctggg 6421 ccttgccttg gtgctccctg gcctcactca gccccctgcc tggaccccta ctcaaccaca 6481 aggcccagct gccatcatgg aagtgagttc tctggataac tggtgaggcc caggatgatg 6541 ttggtgctgc tgatggcaat gcagccattg tgtgtcaaca gttgtagcaa tacctatagc 6601 atttgggaca ttgctgagtc tgtagttggg gggatattgc atggaccatt gctgcaatga 6661 ggattgccac agagatgatt cttggccctt cgtgccttgc tctccaggct ggcacccaga 6721 ggtacatggc accagagctc ttggacaaga ctctggacct acaggattgg ggcatggccc 6781 tccgacgagc tgatatttac tctttggctc tgctcctgtg ggagatactg agccgctgcc 6841 cagatttgag gcctggtaag gatgggtggt acagtcccct ctcctgggct cccccccgcc 6901 cattctaggt tcaccccaac ctgacctggc ctgagaaagc tctgctcttc cctgtcttgc 6961 cctttctaca tggtaggcac ccctaggact aactgatacc cagcccctct accttcctcc 7021 agacagcagt ccaccaccct tccaactggc ctatgaggca gaactgggca atacccctac 7081 ctctgatgag ctatgggcct tggcagtgca ggagaggagg cgtccctaca tcccatccac 7141 ctggcgctgc tttgccacag taagaggcct aggctgttgg tctgggaacc tggagagtgg 7201 gggctgggca tgggcttcaa ggacgtctct gccagagtgt ctgtctactc ctatctccac 7261 ttatctccca tcactccttg gttcatgctc agctggaact gggcaagcct cctctccccg 7321 tcagttcatc ctcttccacc ctaagtctca cacagtcgat tccatctacc tgagacacac 7381 accttcctcc tgctcaacct tgcccagcct gtctctcctc tctgccatca gttagccctg 7441 ttcctcagtc cccttctcca ggaagccctc cttggctttg ctcttgccct gagctctgcc 7501 ccatctgctc tcctaataca gtaaagccca aaaatcaatt caataagtcc ccattctgtc 7561 atacattgat gatgtttctg ctgtgaccca ataaaaggca gttctaggta taaaaggtga 7621 aagcagtggc gtcacagtga aagttcaggg ctttgcacaa atactttttt aacaaaatgt 7681 cccttctgta cctgatctga aatttccacg gcacaagtct taggttggct gacagaaact 7741 gggcattact tgcagaagac tctggctctc tgagaggaaa gagagtaatt tcccattagc 7801 atattaagct gtcaacctac attagctcct cagaggaaac agtgggctat acagaaggcc 7861 cccagagagc ctgtttcata gggagcaaga ctcagagggc tggaattcag caagagggag 7921 aggagggagg ctccaggaaa gatctcagaa gtggatgttg aaagcaggag agtgatggac 7981 actgaagatg gcttttaacc ctggggccca ctcaagatcc tagggtcaac ccttcctccc 8041 tgtcattccc cccaggaccc tgatgggctg agggagctcc tagaagactg ttgggatgca 8101 gacccagaag cacggctgac agctgagtgt gtacagcagc gcctggctgc cttggcccat 8161 cctcaagaga gccacccctt tccagagagc tgtccacgtg gctgcccacc tctctgccca 8221 gaagactgta cttcaattcc tgcccctacc atcctcccct gtaggcctca gcggagtgcc 8281 tgccacttca gcgttcagca aggcccttgt tccaggaatc ctcagcctgc ctgtaccctt 8341 tctcctgtgt aaatatgcag tttatgtgtc atcaatgtac atgccaacat aaatatggcg 8401 attgtat // LOCUS HSDNAMIA 3573 bp DNA PRI 12-MAR-1996 DEFINITION H.sapiens MIA gene. ACCESSION X84707 NID g683459 KEYWORDS melanoma growth regulatory protein; mia gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3573) AUTHORS Bosserhoff,A.K., Hein,R., Bogdahn,U. and Buettner,R. TITLE Structure and promoter analysis of the gene encoding the human melanoma-inhibiting protein MIA JOURNAL J. Biol. Chem. 271 (1), 490-495 (1996) MEDLINE 96132947 REFERENCE 2 (bases 1 to 3573) AUTHORS Buettner,R. TITLE Direct Submission JOURNAL Submitted (10-FEB-1995) R. Buettner, University of Regensburg, Institute for Pathology, Franz-Josef-Stauss-Allee, 93042 Regensburg, FRG COMMENT Sequence overlapping with that under the acc#X75450. FEATURES Location/Qualifiers source 1..3573 /organism="Homo sapiens" /variety="caucasian" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" /clone_lib="lambda FIX II" mRNA join(1316..1513,1595..1728,2812..2922,3240..3334) /gene="MIA" exon 1316..1513 /gene="MIA" /number=1 gene 1316..3334 /gene="MIA" CDS join(1387..1513,1595..1728,2812..2922,3240..3263) /gene="MIA" /function="melanoma inhibitory activity" /codon_start=1 /product="melanoma growth regulatory protein" /db_xref="PID:g683460" /translation="MARSLVCLGVIILLSAFSGPGVRGGPMPKLADRKLCADQECSHP ISMAVALQDYMAPDCRFLTIHRGQVVYVFSKLKGRGRLFWGGSVQGDYYGDLAARLGY FPSSIVREDQTLKPGKVDVKTDKWDFYCQ" intron 1514..1594 /gene="MIA" /number=1 exon 1595..1728 /gene="MIA" /number=2 intron 1729..2811 /gene="MIA" /number=2 exon 2812..2922 /gene="MIA" /number=3 intron 2923..3239 /gene="MIA" /number=3 exon 3240..3334 /gene="MIA" /number=4 BASE COUNT 877 a 810 c 967 g 909 t 10 others ORIGIN 1 tctagacana taaaaataaa agaaatcatc caagaatggt gacttgccta ctattctact 61 cgagaggctg agaggggagg atttcttgac cccgggagtt taaggatgca gtgagctatg 121 atcacatcac tgtacttcag cctgagcaac agcaagatcc tgtctctaaa aattaaataa 181 ggctgggctt ggtggctcat gctgtaatcc cagcactttg gaaggccatg gtgggcagat 241 tgcttgagcc caggagtttg agacgaggct gggcaacatg acgaaacccc ggctctacca 301 aaaaatacaa aaaattaact gggcataatg gtacatgtct gtggtcccag ctactcggta 361 ggctgaggtg ggaggaatgc ttgagcccag gaaatagggg ctacagtgaa ccaggatgat 421 gccagtgcac tccaacctgg gcaacagagc aagactctac ctcaaaataa tttaaaaaaa 481 tggattaatt gggcataggt ggcttgtgcc tgtagtccca gttactcagg agcctgaggt 541 gggaggattg cctgagtcta ggaggttgag gctgcagtga gccgggatgg caccattgca 601 ctccacctgg gcaacagggt gagaccctgt ctcaaaaaag aaaaaaaagg gaggggttat 661 aatcactcct cctgacatga tacagagtat ccatttgagt tcataacata aatatgtact 721 tggtgaatgc tctgtaacta ttggtgaatg ctctgtaact attggctttt ttattgttcc 781 cattttacat ataaggaagc tgaggctttg tgaggagaaa tagcttagcc caggtcatcc 841 agtgggaagc gtctggtgca gaggaatagt gatcagggtg ggactttgcc tagcctaagg 901 ttcagcatac aatattcagt cagtactcaa gggctgggct gtttctggta atcaaagggc 961 ctgccttgtc ctcctccccc acagcaggaa attccaaggt ggttttcttt acaggctcct 1021 ccgcttctgt ggccagaggg gacagcggag gaccccaggt acctaagcca actcaagaga 1081 agatggaatt gaatatttca accaccttat ctaggcctct gtgattgttg aggagggggc 1141 tgtcactggg aaagttgtga gctgctttgg accttatctg ggaatttcct tgggccttac 1201 agctttaccc tatccttgaa atggttctgg tttcatagca acttctaggt ggtgtgggcg 1261 aagtttggga ctggtttagg gcggggacaa gaccaagaac acaagtttcc ttgtacggga 1321 gagagggagg ggaggaaatt ggagacccca gcaccccctt gctcactctc ttgctcacag 1381 tccacgatgg cccggtccct ggtgtgcctt ggtgtcatca tcttgctgtc tgccttctcc 1441 ggacctggtg tcaggggtgg tcctatgccc aagctggctg accggaagct gtgtgcggac 1501 caggagtgca gccgtaagaa tggggagggg tagaattggg cttgggtgtt agcctgtgtg 1561 gatgtgctgc attccccttc tattccttcc ctagacccta tctccatggc tgtggccctt 1621 caggactaca tggcccccga ctgccgattc ctgaccattc accggggcca agtggtgtat 1681 gtcttctcca agctgaaggg ccgtgggcgg ctcttctggg gaggcagcgt gcgtcttggg 1741 agagtgaaag agggaagggt acagagctgg ggtagactca ttatccccat gaagggaaga 1801 tttgaggggg gtgaactgaa atagacattg tggggggata ttgttactta ctttatttta 1861 tttgcttatt attttttaat tttttccgag acagagtctt gctctgtcac ccaggctgga 1921 tgcaatggca cgatctcggc tcactgtaac ctccacctct tgggtttaag cgattctcca 1981 gcctcagcct cccaagtacc tgggattaca ggcatgcacc accacacctn ntaatttttg 2041 tatttttagt agagacaggg ttttaccata ttggccaggc tggtcttgaa ctcctgacct 2101 catgatctgc ccgccttggc tcccggagtg ctgggattac aggtgtgagc cactggcccc 2161 ccagcctatt ttcactttat ttaccaattt taggacctga tatggtccca nnntctgttc 2221 tagatctaga caccaagata caacaacaaa tgatcctttt tattctaatg gagggaaatg 2281 aacaaaaagc aaggcataaa aaatagcagc agccgggcac agtagctcac acctgtaatc 2341 ccaagtaagg ccaagtnngg aggatagctt gagcccagga gttcgagacc agcctgggca 2401 acatagcaag acccccatct ctataaaaaa aaatttaaaa ttaactgggc atcatggcat 2461 gtgtctgtgg tcccggctac tcgggaggct gaggtgggag gattgcttga tcccagaagt 2521 tgaggctgca gtgagccgtg atcatgctac tgcacctcaa cctggccgac acaatgagac 2581 cctgtttcca aaataataat aataaaagca aatatgcgct gctgtgagaa ttaacagaga 2641 cttacttggg tgttcagaaa gggcctctga acaggtggca tttaagctga gattcatatg 2701 acaaggatgg agcagttatg tggagatcag ggagagggga gaatgcaaag gccttcagca 2761 ggcacaagct tgccatcttc cagaccctag cttttaactc ctcttcccca ggttcaggga 2821 gattactatg gagatctggc tgctcgcctg ggctatttcc ccagtagcat tgtccgagag 2881 gaccagaccc tgaaacctgg caaagtcgat gtgaagacag acgtgagtgt catgggggct 2941 ggcaagaaat gtggggggag gacccttagg ttgtggggat gggcaaaaat gctcccacac 3001 ttggctccct ggccgcctag gtatgtgcgc tgggagaaat tctttccctg cctcaatttt 3061 ctcaccagta aaatgggtcc agttgggagg tgcaaagatt agagggctct aggctaattt 3121 gcatagcann tgtgtggcca gacctgggcc ctgcagctgc agcctttgct aaaaccacta 3181 gatcctttgt ggtgtgaccg ctggttttct ttccactgtt tcccctttct ctttttcaga 3241 aatgggattt ctactgccag tgagctcagc ctaccgctgg ccctgccgtt tcccctcctt 3301 gggtttatgc aaatacaatc agcccagtgc aaacggctcg tctccgtggt ctttggggtg 3361 gggtagggta gggtggggac tgtacaaatg aaatgtttct ctaggttgct gaatctaacc 3421 aattaacccg ctgcctgtgg taacgtcagt ggttgctagg cagagtttcg ctgatgaaag 3481 ccctgtgcag taggagcgct cctaagctta ggtttcgaca caagcaaaga aaacctaagc 3541 agcccaacta gggattgtag tgtcctctct aga // LOCUS HSE48ATGN 3186 bp DNA PRI 28-DEC-1997 DEFINITION H.sapiens E48 gene. ACCESSION Y12642 NID g2739293 KEYWORDS E48 antigen; E48 gene; LY6 family. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3186) AUTHORS Brakenhoff,R.H. TITLE Direct Submission JOURNAL Submitted (16-APR-1997) R.H. Brakenhoff, Free University Hospital, Dept Otolaryngology, de Boelelaan 1117, 1081 HV Amsterdam, NETHERLANDS COMMENT Related sequence X82693. FEATURES Location/Qualifiers source 1..3186 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="q24-qter" TATA_signal 1503..1507 exon 1531..1606 /number=1 gene 1555..2849 /gene="E48" CDS join(1555..1606,2384..2482,2614..2849) /gene="E48" /note="member of Ly-6 superfamily" /codon_start=1 /product="E48 antigen" /db_xref="PID:e314115" /db_xref="PID:g2739294" /translation="MRTALLLLATLAVATGPALTLRCHVCTSSSNCKHSVVCPASSRF CKTTNTVEPLRGNLVKKDCAESCTPSYTLQGQVSSGTSSTQCCQEDLCNEKLHNAAPT RTALAHSALSLGLALSLLAVILAPSL" intron 1607..2383 /gene="E48" /number=1 exon 2384..2482 /gene="E48" /number=2 intron 2483..2613 /gene="E48" /number=2 exon 2614..3186 /number=3 BASE COUNT 633 a 1002 c 1023 g 525 t 3 others ORIGIN 1 ggatccccgg catctgagac gtggggctca gcaagggaga gatgccggga ggaagggcca 61 gagggtctgt ctcaagccca ggcctgactg gccccagtac ccacggtgct cccaaaagcc 121 tcagcccagc ccaggaccac cctcctcgga accggccctg ccctgccctg ggcctgttgt 181 tgtctgttca tcagccacgt tggcagcggc ggcaggaggc tgggcccagg gctgacctcc 241 atctgtgccc cggctcccac tcaccttcag cctagccagg gcaggggcaa agcaagtctg 301 tgggcccctc cacaccactc aggtgggtgc tgcccagcac aggcgaacgg caaaacgtgc 361 tgggctgaac agtggatggg ggccccaaag aggttgggtt cttgcgcgag gcggtgggtt 421 gttcagagac ccgttcttcc agggaaagaa attccttcct gtgagtataa agagggtgaa 481 ggcggggact ggcttccaac tcaacgggcg gcagaccaag aacctgcttt gaggccctgg 541 gtgctggggc ccagccccaa agagttcacg attttgggga gactggagcc gggacaccta 601 gagcaatagt tacaaagtgg ccgggatggc gctgccccca acgtagcagg aggggcctgg 661 agagtggggg aaggaatccc acggggaggg cttctggggc cctggagggc gggaggcagg 721 gctggtggag accaggaggg ctcagtctct ccctgcacaa cgcagggtgg agttgctgcc 781 tccacagccc gggccttctg cccacagggc tccccaggca acttcgaggg ccgggtcctc 841 acagtcctca ctctgtccag tgctcacctc actgctgccc cagctgccct ccctgcctcg 901 gagtggcctc tgcctctgcc tctgcctcac acaccccatc agcccacagc caccccgagc 961 ccttgcagct caaggacctt ctgcatcccc ggcagcaacc ctgagcccag agccaggaga 1021 gtaggaccag cacctgccct tcctacctcc cctgccctgg acacccacgg gcacccacag 1081 ggcaccgggc acaggctgca actctattga gggaactatg gggggaatgc agggacaaag 1141 acacaggatg gtgatggatg gtggggggtg gggggcgggg ggtagcgggt ggcaggtcag 1201 agacaaagtg acccaggaac ggacggcagc gagagaacaa gacccacaga gacagagaag 1261 tgaaagtggg agagatggag gcacagcagc aggaacagca gggccaggag gccactgggc 1321 cccgttgcgg agggcagacc tctgcctggc ggggggcagc aggggcagcg cccagcgaca 1381 gccaggctgg tgggcgtggg ccaggcaggg tggggcaggt gcccgtcctg ggctgacgct 1441 tgcacagagc cctgcagaat gccgctggct ggcggactgc ccacccccgc ccagcccgtg 1501 cctataaggc cttggcaatg caggggcccg cactgctccc agacgacatc agagatgagg 1561 acagcattgc tgctccttgc aaccctggct gtggctacag ggccaggtga ggggctgggc 1621 caaggggacg ggcgggtggg cggwtctggc ctgtgtggtc accaaatact aggagcctgg 1681 agctgacgac atctaggcac agagaacccc gacactgtgg gtgctagact cttcaaattt 1741 aaaacaacat ggagcccgaa tcttaacgag tgtcagattt acaatgtttc agaatctgag 1801 gatcttagaa ctgactcctg gagtcacgga ccctgagaga gtggagcagg gggttgtctt 1861 gagatgcttc tcactgggag agcggccatc tagtccctgc caccagcccc tcccggtccc 1921 caggtctccc ccagcacagc ccaggacagc ccaaaggcca ccaagccctt cctgtcatag 1981 gctcagagca gccaagctgg ctgcccccgg ggccttagct ccgtcaggcc ctcagaaaca 2041 tagccatctc ctccagggac atcccctggg gtggggacaa ganccagtgt cctcttgnca 2101 gtcgcagagc tccctgagac cttggaggaa aggcttgcaa cacaccctag acaggggcag 2161 ggcaggacac ctcttccaag ccggcccagg ctggaataca tgtcccctga ggctgcactc 2221 agggcacatg ccctgccagg acccagagcc cgggtttggg ggcctagggc aagggcagca 2281 gggcctctgt ccaggagggg tctctgcagg tgggggagca ggccctgcta agtcaccagg 2341 aggcctctgt tgggcccgct gagccggact ccctccccgc cagcccttac cctgcgctgc 2401 cacgtgtgca ccagctccag caactgcaag cattctgtgg tctgcccggc cagctctcgc 2461 ttctgcaaga ccacgaacac aggtgggtag catctgtcca ggtgtccctg gggctgggga 2521 catccctggc tgttgtgggt gttggctgga aggttgtggg gaggtggggg gcagaggtgg 2581 ctgcctggcc tgaccacact cacctgtgcc cagtggagcc tctgaggggg aatctggtga 2641 agaaggactg tgcggagtcg tgcacaccca gctacaccct gcaaggccag gtcagcagcg 2701 gcaccagctc cacccagtgc tgccaggagg acctgtgcaa tgagaagctg cacaacgctg 2761 cacccacccg caccgccctc gcccacagtg ccctcagcct ggggctggcc ctgagcctcc 2821 tggccgtcat cttagccccc agcctgtgac cttcccccca gggaaggccc ctcatgcctt 2881 tccttccctt tctctgggga ttccacacct ctcttcccca gccggcaacg ggggtgccag 2941 gagccccagg ctgagggctt ccccgaaagt ctgggaccag gtccaggtgg gcatggaatg 3001 ctgatgactt ggagcaggcc ccacagaccc cacagaggat gaagccaccc cacagaggat 3061 gcagccccca gctgcatgga aggtggagga cagaagccct gtggatcccc ggatttcaca 3121 ctccttctgt tttgttgccg tttattttgt actcaaatct ctacatggag ataaatgatt 3181 taaacc // LOCUS HSEDMDGEN 3339 bp DNA PRI 27-OCT-1997 DEFINITION Homo sapiens EDMD gene. ACCESSION X86810 NID g1199823 KEYWORDS EDMD gene; emerin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3339) AUTHORS Bione,S., Small,K., Aksmanovic,M.A., D'Urso,M., Ciccodicola,A., Merlini,L., Morandi,L., Kress,W., Yates,J.R.W., Warren,S.T. and Toniolo,D. TITLE Identification of new mutations in the Emery-Dreifuss muscular dystrophy gene and evidence for genetic heterogeneity of the disease JOURNAL Hum. Mol. Genet. 4 (10), 1859-1863 (1995) MEDLINE 96121582 REFERENCE 2 (bases 1 to 3339) AUTHORS Toniolo,D. TITLE Direct Submission JOURNAL Submitted (02-MAY-1995) D. Toniolo, Instituto di Genetica Biochimica ed Evoluzionistica, CNR, Via Abbiategrasso 207, 27100 Pavia, ITALY FEATURES Location/Qualifiers source 1..3339 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /cell_type="lymphoblasts" /clone_lib="Xq28 specific cosmid lib in lawrist" /map="q28" exon 723..914 /number=1 gene 833..2545 /gene="EDMD" CDS join(833..914,1038..1142,1290..1367,1582..1715,2101..2150, 2230..2545) /gene="EDMD" /codon_start=1 /product="emerin" /db_xref="PID:e1154169" /db_xref="PID:g2564243" /translation="MDNYADLSDTELTTLLRRYNIPHGPVVGSTRRLYEKKIFEYETQ RRRLSPPSSSAASSYSFSDLNSTRGDADMYDLPKKEDALLYQSKGYNDDYYEESYFTT RTYGEPESAGPSRAVRQSVTSFPDADAFHHQVHDDDLLSSSEEECKDRERPMYGRDSA YQSITHYRPVSASRSSLDLSYYPTSSSTSFMSSSSSSSSWLTRRAIRPENRAPGAGLG QDRQVPLWGQLLLFLVFVIVLFFIYHFMQAEEGNPF" intron 915..1037 /gene="EDMD" /number=1 exon 1038..1142 /gene="EDMD" /number=2 intron 1143..1289 /gene="EDMD" /number=2 exon 1290..1367 /gene="EDMD" /number=3 intron 1368..1581 /gene="EDMD" /number=3 exon 1582..1715 /gene="EDMD" /number=4 intron 1716..2100 /gene="EDMD" /number=4 exon 2101..2150 /gene="EDMD" /number=5 intron 2151..2229 /gene="EDMD" /number=5 exon 2230..2869 /number=6 polyA_site 2848..2854 BASE COUNT 564 a 1078 c 987 g 710 t ORIGIN 1 gcgcgcaggc cccgcccctc tcaccccgcc gcacgccaca gggtgacgtc tgggctccca 61 gccgcatcgc cctgactccc gcgcgggccc cgccccctgc cgctagccaa tctgtgcgtt 121 tgtgactttt gggcccgcag ccccgcctgc tcccacagcg ataccggttt gcattgccct 181 gactcccgcg cgggccccgc cccctacgcc gctagccaat ccatgcatta gtggcgtccg 241 ggctcgcagt accgctcgct cccaccgcga gaccttctgc tccgcgcccg cgcgggcccc 301 tccccctcca tcgctagcca atccccgttt tgtgacgtat gggctcgcgg ccccgctcgc 361 tcccaccgcg agaccttttg ctccgcgccc gcgcgggccc cgccccctcc atcactagcc 421 aatccccgtg cttgtgacat atgggcttgc ggccccgccc gctcccatcg cgagaccggt 481 tcccaccgcc ctgactcccg ggcgggcccc gccctctccg ccgctagcca atcctcgcgt 541 tgatgacgtt tgggctcgcg gccccagcct cccagctctc agggcacggc cggtctgtgc 601 cggctgctcc cgcggttagg tcccgccccg cgcagcgcgc gcagcctgcg gagccagcgg 661 ccgtgacgcg acaacgattc ggctgtgacg cgacaacgat tcggctgtga cgcgagcgcg 721 gccgctcccg atgcgctcgt gccgcccccg ccgtgctcct cggcagccgt tgctcggccg 781 gttttggtag gcccgggccg ccgccaggcc tccgcctgag cccgcacccg ccatggacaa 841 ctacgcagat ctttcggata ccgagctgac caccttgctg cgccggtaca acatcccgca 901 cgggcctgta gtaggtacgc ggcggcgggc gggacccctt ccgggccccc tcctcgtgct 961 ccgcctcgcg acctccccgc tgccctcccc gcgcgccttc cccggcccgc ggccctgacc 1021 gccccgtgtc cggccaggat caactcgtag gctttacgag aagaagatct tcgagtacga 1081 gacccagagg cggcggctct cgccccccag ctcgtccgcc gcctcctctt atagcttctc 1141 tggtgagagc ctcgcctgtg gggacagcct gggacgcggg gaggatgggg tcgcgagggt 1201 gtggcagggg ggccggtcga gagcggcact ggagaaaggg gagggaagtc tgggggggca 1261 aacagttctg tctcctcctt tcaatccaga cttgaattcg actagagggg atgcagatat 1321 gtatgatctt cccaagaaag aggacgcttt actctaccag agcaagggta aggcaggggt 1381 tgggtgggca cgctggcacc ttcacccgac ttcgtcaggg accccgctca cagggaggac 1441 ctgagacctc agtcccaacc actccagcag ccttaggagg gagaaactgt tacaggtccc 1501 gaaatgggat tcagattagg gccatcaggc caggcggggc acaccgatgc cccctctgct 1561 accgctgccc cccttcccaa ggctacaatg acgactacta tgaagagagc tacttcacca 1621 ccaggactta tggggagccc gagtctgccg gcccgtccag ggctgtccgc cagtcagtga 1681 cttcattccc agatgctgac gctttccatc accaggtgag ctggctggca ggcgtcctgt 1741 acttgggtac aacctagggg atcgcggctg tgtttggata aatccagggg ggcactgggt 1801 acaaatggtg gctcttgggc ctccggggag actctgtgtg actagagcac cctggtctgg 1861 gatctaggct cagactcttc ctgagagtcc tgggggcaaa aggggatgct ggggcatgag 1921 cacaagtggc aaggccccat ggataaaggg ctgaacaccc agagccattc aggagggtgt 1981 gggttcctgg cctctaacca aaggtcagag gggactggct ggggaagttt ggactgaggg 2041 acatgacagg gccatggtgg ccctgccagc cagtcccctc gccctgactc tcttctgcag 2101 gtgcatgatg acgatctttt gtcttcttct gaagaggagt gcaaggatag gtgcgtagtg 2161 ggggagccca gggacgggct ggttctgggt ccaggctcct ggcccacttg ctcccctctt 2221 ttgcctcagg gaacgcccca tgtacggccg ggacagtgcc taccagagca tcacgcacta 2281 ccgccctgtt tcagcctcca ggagctccct ggacctgtcc tattatccta cttcctcctc 2341 cacctctttt atgtcctcct catcatcttc ctcttcatgg ctcacccgcc gtgccatccg 2401 gcctgaaaac cgtgctcctg gggctgggct gggccaggat cgccaggtcc cgctctgggg 2461 ccagctgctg cttttcctgg tctttgtgat cgtcctcttc ttcatttacc acttcatgca 2521 ggctgaagaa ggcaacccct tctagaggga gccatgaggg tctgggcttc agagctaggt 2581 ctttggggaa gtcctggctg actgccttag cagtgggggt gggggtgggg gcaggggcag 2641 gggctttatg tgtttttgct tggggggcgc tgggcctagc ccagagtagt gcttgctccc 2701 cctgccttgt cccaccaggg aggcagcaga ctcaggccct ccatggtcct ctttgtcatt 2761 ttgttgacat gcattcctcc ttttgtcatc ttgttggggg gaggggatta accaaaggcc 2821 accctgactt tgtttttgtg gacacacaat aaaagccccg tttatttgta atgcgttggc 2881 tcttcctgga ggagagggtt gggctcccat ggcaagggcc tctgcgtctt ggggctccag 2941 gattgcaatc cggctttgtt gggtccgcat ttttgcttta gtctggggat aggaatcaaa 3001 tgttacccag agatgtttgt gttttgtttg ggagttttat tccctaactc attccccaaa 3061 gcacgtgtaa ctgcttatac atataatcgt ggtacaacaa ggtatataca gagaacccac 3121 ttggaaattc aggcaaagct gcatgcacgc taccagcagt ctgcgggtgt tttaactgga 3181 aaaagctgaa gtccacctcg gtgtccaatg gcatggggat ggaaagaaaa tgaggcgtct 3241 ctggcacatc attctcagct cctggaactg ctgcttgttt aacatgggag aaaagctcca 3301 aaggctgaaa tgccccatca tccctgggtg attgaattc // LOCUS HSENO2 10905 bp DNA PRI 25-JUN-1997 DEFINITION Human ENO2 gene for neuron specific (gamma) enolase. ACCESSION X51956 NID g31164 KEYWORDS enolase; neuron-specific enolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10905) AUTHORS Feo,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-1990) Feo S., Istituto de Biologia dello sviluppo, del consiglio nazionale delle ricerche, via archirafi 20, 90123 Palermo, Italy REFERENCE 2 (bases 1 to 10905) AUTHORS Oliva,D., Cali,L., Feo,S. and Giallongo,A. TITLE Complete structure of the human gene encoding neuron-specific enolase JOURNAL Genomics 10 (1), 157-165 (1991) MEDLINE 91257823 COMMENT For overlapping sequences see and . FEATURES Location/Qualifiers source 1..10905 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hematopoietic" /cell_type="PBL" /clone_lib="EMBL3" /clone="lambda G3" /chromosome="12" /map="12p" repeat_unit 205..508 /note="ALU-like element" mRNA join(1100..1309,2450..2546,3044..3139,3297..3355, 3662..3731,4202..4335,4561..4783,6188..6385,8198..8399, 8673..8781,8961..9019,9348..10312) /gene="ENO2" /note="human gamma enolase" gene 1100..10312 /gene="ENO2" exon 1100..1309 /gene="ENO2" /number=1 intron 1310..2449 /gene="ENO2" /number=1 exon 2450..2546 /gene="ENO2" /number=2 CDS join(2462..2546,3044..3139,3297..3355,3662..3731, 4202..4335,4561..4783,6188..6385,8198..8399,8673..8781, 8961..9019,9348..9417) /gene="ENO2" /codon_start=1 /product="human gamma enolase" /db_xref="PID:g31165" /db_xref="SWISS-PROT:P09104" /translation="MSIEKIWAREILDSRGNPTVEVDLYTAKGLFRAAVPSGASTGIY EALELRDGDKQRYLGKGVLKAVDHINSTIAPALISSGLSVVEQEKLDNLMLELDGTEN KSKFGANAILGVSLAVCKAGAAERELPLYRHIAQLAGNSDLILPVPAFNVINGGSHAG NKLAMQEFMILPVGAESFRDAMRLGAEVYHTLKGVIKDKYGKDATNVGDEGGFAPNIL ENSEALELVKEAIDKAGYTEKIVIGMDVAASEFYRDGKYDLDFKSPTDPSRYITGDQL GALYQDFVRDYPVVSIEDPFDQDDWAAWSKFTANVGIQIVGDDLTVTNPKRIERAVEE KACNCLLLKVNQIGSVTEAIQACKLAQENGWGVMVSHRSGETEDTFIADLVVGLCTGQ IKTGAPCRSERLAKYNQLMRIEEELGDEARFAGHNFRNPSVL" intron 2547..3043 /gene="ENO2" /number=2 exon 3044..3139 /gene="ENO2" /number=3 intron 3140..3296 /gene="ENO2" /number=3 exon 3297..3355 /gene="ENO2" /number=4 intron 3356..3661 /gene="ENO2" /number=4 exon 3662..3731 /gene="ENO2" /number=5 intron 3732..4201 /gene="ENO2" /number=5 exon 4202..4335 /gene="ENO2" /number=6 intron 4336..4560 /gene="ENO2" /number=6 exon 4561..4783 /gene="ENO2" /number=7 intron 4784..6187 /gene="ENO2" /number=7 repeat_unit 4977..5260 /gene="ENO2" /note="ALU-like element" repeat_unit 5323..5625 /gene="ENO2" /note="ALU-like element" exon 6188..6385 /gene="ENO2" /number=8 intron 6386..8197 /gene="ENO2" /number=8 exon 8198..8399 /gene="ENO2" /number=9 intron 8400..8672 /gene="ENO2" /number=9 exon 8673..8781 /gene="ENO2" /number=10 intron 8782..8960 /gene="ENO2" /number=10 exon 8961..9019 /gene="ENO2" /number=11 intron 9020..9347 /gene="ENO2" /number=11 exon 9348..10312 /gene="ENO2" /number=12 polyA_signal 10294..10299 /gene="ENO2" polyA_site 10312 /gene="ENO2" BASE COUNT 2285 a 2856 c 3103 g 2661 t ORIGIN 1 ctgcagggga aagttggtgg tgtatgcagc tggacctagg agagaagcag gagaggaaga 61 tccagcacaa aaaatctgaa gctaaaaaca ggacacagag atgggggaag aaaagagggc 121 agagtgaggc aaaaagagac tgaagagatg agggtggccg ccaggcactt tagatagggg 181 agaggcttta tttacctctg tttgtttttt tttttttttt tttttttttt tttttgcgag 241 gtagtcttgc ttagtctcca ggctggagtg cagtggcaca atctcagctc actgcaactt 301 ccacctcctg ggttcaagca attctcctgc ctcagcctcc cgagtagctg ggactacagg 361 cgcatgcaac cgcgcctggc taatttttgt atttttagta gaaacggggt ttcaccacgt 421 tagccaggat ggtctggatc tcctgacctc gtgatctgcc cgcctccgcc ttccaaagtg 481 ctggattaca ggggtgagcc acagcgcctg gtccctattt acttctgtct tctacctcca 541 ggagatcaaa gacgctggcc ttcagacctg atcagactcc caggggcagc caccacatgt 601 atgacagaga acagaggatg cctgtttttc cccaaagctg gaaattcatc acaacctgag 661 gcccaggatc tgctctgtgc cggtcctctg ggcagtgtgg ggtgcagaat ggggtgccta 721 ggcctgagcg ttgcctggag cctaggccgg gggccgccct cgggcaggcg tgggtgagag 781 ccaagaccgc gtgggccgcg gggtgctggt aggagtggtt ggagagactt gcgaaggcgg 841 ctggggtgtt cggatttcca ataaagaaac agagtgatgc tcctgtgtct gaccgggttt 901 gtgagacatt gaggctgtct tgggcttcac tggcagtgtg ggccttcgta cccgggctac 961 aggggtgcgg ctctgcctgt tactgtcgag tgggtcgggc cgtgggtatg agcgcttgtg 1021 tgcgctgggg ccaggtcgtg ggtgccccca cccttccccc atcctcctcc cttccccact 1081 ccaccctcgt cggtccccca cccgcgctcg tacgtgcgcc tccgccggca gctcctgact 1141 catcgggggc tccgggtcac atgcgcccgc gcggccctat aggcgcctcc tccgcccgcc 1201 gcccgggagc cgcagccgcc gccgccactg ccactcccgc tctctcagcg ccgccgtcgc 1261 caccgccacc gccaccgcca ctaccaccgt ctgagtctgc agtcccgagg tgaagccccc 1321 gccaggccca gagcccctgt ggccccctcc cttcgccgcg gcgcccctgc ctcctttacc 1381 cggtgccgct gcgcacctct ccgcatctct ggcccggtgc agctgcgcac ctgctccgcc 1441 gcccgcgccc agggcgcctt ccctccctgg ccttccccgc ccgctgcctt cgctttctgg 1501 ccctctgcgc gcctatctct aactgcgcct ctccaccctt gcctgcctct ctcccggtgc 1561 tcgccctcat ctacgtctat tgttctaagt tggacgtctc cggcgtgcat tttatcctag 1621 agcgtccctt ttggtttgca tttgggaaat gtcttctctc accgttcctc acctccccca 1681 gacttccctt gactcccctc tctcctgctc ttccccacgg cgcccctctc cgttcgcgct 1741 tcctcccctc tgctgcacgg gggagagatg gaagaagtgg ggatctgtcg aggcgcagag 1801 gggaggacag gcccagctcg cctccactcc ccaccggctc ttatcctctt cacttcccgc 1861 tgcaaccccc agggactgca gggcctttct caggaccctc ttccccgaca cctgtattgc 1921 atgcgccttt cgcgggaaga ggagggatac acgtttggga gagagtggga ccttggggaa 1981 ggggcagtgt tcaggcagct ggggtgttaa gttggggagg tatgggggct ctggaagaga 2041 ggcccaggca gcccatcctc ttcttggctc ccaggagaaa tgcaagcctg gcagccattc 2101 cgctctgagg agatctggga acccaccggc tggccaagct gcaaagaggc gcgggaacat 2161 gtcgtgccct ccctcctcac tgcagtgttc tccatctcat catttgatct tacccgcccg 2221 ggactgcagt gacctcccct tcctcactgc attgaccttc tcctcttcca ggggggaggg 2281 ggaatccctt ctacagccct agactgcagc cagcgcctcc ctacccaccc cccacccact 2341 gcagtaacct ctttcccatc cctcccagcc tcccgccccg cccattgatc tcaggctcca 2401 cccctctaag cctcttatct ttctccttcc ttcccttcca cccgaggaga tcccagccat 2461 catgtccata gagaagatct gggcccggga gatcctggac tcccgcggga accccacagt 2521 ggaggtggat ctctatactg ccaaaggtaa tgggtgtggc atgggccttc ctacagccct 2581 agcttttcca cggccaggct gggttcggcc aggggttcgg aggccttttt tgatacccag 2641 gggtatgggg tgctgggcca ggctcacaag cctgggttgt ggcggttgga tcctccttgt 2701 ccgggagcca gggtaggtgg gtctgtgttc gagtttagcg tgtgagcatg tcctgcctcc 2761 gtgtgtctgc ctgtgcactt gcatgtgtgc agacgtgtgc tgcaagcaat tttctttctg 2821 cgggctccca cttgtgcatg tggggcctca gatgggtgga tgaggaggcc acttcttgtg 2881 catctgcctc agtgtgtgtg tggggtgggg gtggggggat gttagggagc agtgtgggga 2941 gagagctaga tgtggtaggc aggcagttta gagccagaag gctgaaggac tccctggccc 3001 ctgtgtcctc ttcactccct ctcattccat gttcctcctt taggtctttt ccgggctgca 3061 gtgcccagtg gagcctctac gggcatctat gaggccctgg agctgaggga tggagacaaa 3121 cagcgttact taggcaaagg tgaggtccct tctcttttcc agactctccc ccacctcagc 3181 cttatgcccc tacctcacac cagtccccag tcctcctcta gcatggcttc ccctcctccc 3241 attgatccct tccgcccctc ctggcccgac ccagtccagc ctcttccttt ccccaggtgt 3301 cctgaaggca gtggaccaca tcaactccac catcgcgcca gccctcatca gctcagtgag 3361 gcctgctctt tgctggggat agcagggcca gagttctgga aggaatcccg gagcagggca 3421 ggaggaaggg aagaaagaag gcccactctt aggaatcatg gttacaaggg ggaagggtgg 3481 ggaacagctt ccttaatgca ccctgctccc atgggagttc aggtccccta atccaggtag 3541 gcccctgtca cagggacctg gttggaccct ggccaaatgt gagcttgggt gtgaatgagg 3601 ggaccctctg ccttaggctc agcctccagc ctggccctgg gtgatggagc tctgccctca 3661 gggtctctct gtggtggagc aagagaaact ggacaacctg atgctggagt tggatgggac 3721 tgagaacaaa tgtgagccgg gccggagaaa gtgggaagcg tcagggtggg gaggcgtgga 3781 gcagatagag agctgaaggg ccagtgctgt agtggcttcc tcaggaatga ctgtcagggg 3841 cattctcctc tcaaagccag agcaagggga gatgagttta gctgcagagg gaaggaccga 3901 cagtaggcag aaggaagacc ttctttgcag catacagagg agggggatgg cctgagagag 3961 tctgtggtct cagggatatt tagaaagagg tgtggctctc tgccgtttcc aatctcctcc 4021 tccccaccca ttcctctccc tgctgttttc aagagagcat ggatgaggtg ttgctggggc 4081 aggggtgggg agggaggagg gggtctcctt tactggctcc ttttggagac tacagatgga 4141 ggcaggagct agaaaggaga aggggacatt tggctcagca ccttcctcta taatctccta 4201 gccaagtttg gggccaatgc catcctgggt gtgtctctgg ccgtgtgtaa ggcaggggca 4261 gctgagcggg aactgcccct gtatcgccac attgctcagc tggccgggaa ctcagacctc 4321 atcctgcctg tgccggtgag caataagcca cggtgcggct ctcccagggg cgggtggggg 4381 agggagcatg caactcatga ggaatgatgg gaggaaagtg aattgaggga ggtaaagagg 4441 aaggatgggg acgtgagact tagtccggaa agctggggga agtttgggat cttgggttaa 4501 cactcctggg gcgggcaggg aggggctctt tgacccttct gtctttctgt ggctccccag 4561 gccttcaacg tgatcaatgg tggctctcat gctggcaaca agctggccat gcaggagttc 4621 atgatcctcc cagtgggagc tgagagcttt cgggatgcca tgcgactagg tgcagaggtc 4681 taccatacac tcaagggagt catcaaggac aaatacggca aggatgccac caatgtgggg 4741 gatgaaggtg gctttgcccc caatatcctg gagaacagtg aaggtgaggc caggagcccc 4801 actcccagcg tgctaagtct taccctattg tgggacatca gaaagggtga cacagttcac 4861 caagtcctga gtaggcgtgg agggtcctag gactctgcaa actccaaaag gtaccagttc 4921 ttagagtgga ttgcagagag cctgccaaat tcacatgcag acctaggggg acagtatttt 4981 tttttttttt tgagacggag tcttgctctg tcacccaggc tggagtgcag tggctcgatc 5041 tcagctcact gcaacgtccg cctcccgggt tcacgccatt ctcctgcctc agcctcccga 5101 gtagctggga ctacaggcgc ccgccaccac tcccggctaa ttttttgtat tttttagtag 5161 agacagggtt tcactgtggt ctcgaactcc tgaccttgtg atcggcccgc ctcggcctcc 5221 caaagtgctg ggattacaag cgtgagccac cacgcctggc caggggacag tcttttacct 5281 gcctagccag atgtgttagc atctgttaag ttgccactgg aaggccgggc gcggtggctc 5341 acacctgtaa tcccagcact ttgggaggct gaggcgggtg gatcacctga ggtcaggagt 5401 ttgagaccag cctggccaac aaggtgaaac cccgtctcta ctaaaaatac aaaaattagc 5461 cgggcatggt ggcgtgtgcc tgtaatcaca gatactagcg gggctgaggc aggaggatcg 5521 cttgtacccg ggaggcggag gttgtggtga gccgagatca tgccactgca ctccagcctg 5581 ggcaacagag cgagctccgt ctcaaaaaaa aaaaaaaaaa aaaaagttgt cactggagcc 5641 cttgggaata ctgggagatg gtctggatga cctgttggat tcatcccatc ccattcgaaa 5701 tctgtcctgt ccccgtccca acctcctagg cctctagaat ccctaatttt tctgtgcctt 5761 gggggaaact gtatagggaa tggaaagaat ataggtagtg gttaagagtt aaggttctgg 5821 ggccaaataa cctggattta tttgaacctt gacttgtgga gttactgctg gtgaactttc 5881 ttacttctct ctggaaataa taacagaatc tagctcatgg tagtgtgagg cttaaatgaa 5941 atatatataa aatgcttaga tgacatgata ccaatgaaag tatagtaagt attattaaga 6001 gaattccatt cctctgtgtt cctagaagat ggcctctcct ccagacctgg gataacccca 6061 acagcatccc cgccacactt ccctcaggaa cagacctcca cctctgccct gaatgtcttt 6121 tctttccctc ctcctccttg cccatccctc ctgcttgtac tataatctca ctgtattctg 6181 tccccagcct tggagctggt gaaggaagcc atcgacaagg ctggctacac ggaaaagatc 6241 gttattggca tggatgttgc tgcctcagag ttttatcgtg atggcaaata tgacttggac 6301 ttcaagtctc ccactgatcc ttcccgatac atcactgggg accagctggg ggcactctac 6361 caggactttg tcagggacta tcctggtgag aggaagtggt gtgaggggga ggtctggggg 6421 caggcaggga cgtgtcccag caactctgga ccttatgggg tgctgactca ggcaccaggt 6481 gggggtgtcc taagaagaac ctgagaacca gggagagggt gcaggagcca cctgcaaaga 6541 ctgggctttg tatgtagtgt aaaaaatgca ggtacccgtg accaatctgt tctgtctcag 6601 atcctgatta aagtcatggg ttctgaaaac tactggggtc atggggaagg ctctggaagg 6661 aaccagggat gatatgagtg tgactggatc tagcagagaa ctagaacatt tcagtatctt 6721 tgattgatga aattgtggat gctgaatgga gctgggactg atgtgtgagt agaaagaagg 6781 ctgaggggga ctgaattagc ttctgcaagt ctgcccaggg cctttatctc aggatggagg 6841 cagttggtcg ccatcttcct gacacagagc gaaagaaaat caacataatt gcagaattaa 6901 ggatttggat tagcaatgag gaagggcttg ctggcagtga gaacaggaaa atgcaaagct 6961 aggttgtgaa cccttgttac tggaagattt tttttggggg gggggtgggg ttgttttttt 7021 tgttttgttt tttttttttg agacagagtt tgactcttct tgcccaggct ggagtgcaat 7081 ggcgccatct cggctcacca caacctccac tcccggattc aagcgattct tctgcctcag 7141 cctcccgaat agctgggatt acagcatgtg ccaccatgcc tggctaattt tgtattttta 7201 gtagagacag gggtttctcc atgttggtca ggctggtctc caactcctga cctcaggtaa 7261 tctatccacc ttggtctccc aaagtgctgg gattacagcg tgagccacca cgcctggctg 7321 gtttgctttt aattaacttt tttttttttt tttttttttt gtagagacag ggtttaccat 7381 gttgctcatg gctggtctca aactactggg ctcaagtgat ctgcatgcct cggcctccca 7441 aagtgctggg attacaggca tgagccactg tgcccagcct atggaagtgg cgtgagatct 7501 ctgctcactg cacgattctc ttagcctccc aagtagctgg gattacaggc gtgcaccacc 7561 atgcctgcta atttttgtat ttttagtaga gatggggttt tactatgtta gccagggaac 7621 tcctatcctc aagtgatccg ttcacctcag tatcccaaag tgctgggatt acaggcatga 7681 gccactgtgc ctggcctctc catgtaaggt tttatgaaat aagaatcagg agccaggcgt 7741 ggtgctcatg cctgtaatcc cattactttg ggaagccgag gcaggaggac tgcttgaatc 7801 caggagttcg agactggcct ggcaatacag tgagacctca tctctacaaa aattttaaaa 7861 attagctgag tgtggtgcca cacacctaaa gtccctgcta ctcaggaggc tgaggtggca 7921 ggatcacttg attggggagg tggaggttgc agtgagctga gattgtgcca ctgcactcca 7981 gcctggatga cagaatgagg ctctgtcaaa aaaaaaaaaa aagttaagaa tcagttaggg 8041 caggcttaca ctggggggat ttgtcttagc aaggatgagc aggtgtagtt aaccaagggc 8101 ctgtccattt cagggaataa aggggcatgt tcctgcctga tgtagagacc cagggaagat 8161 gaacacctcc cccctcccca ccatgctctc tctgcagtgg tctccattga ggacccattt 8221 gaccaggatg attgggctgc ctggtccaag ttcacagcca atgtagggat ccagattgtg 8281 ggtgatgacc tgacagtgac caacccaaaa cgtattgagc gggcagtgga agaaaaggcc 8341 tgcaactgtc tgctgctcaa ggtcaaccag atcggctcgg tcactgaagc catccaagcg 8401 tgagtgactt ctggccctct cctgtgtggt cctcgtttct ataagactcc ttttgcaagt 8461 gctccagcct aattctaccc aggggtgcca aagagagcgg ggaacctgga atcatcctca 8521 cagttctctc acctctgccc ctccacccct gattctctgc tcccctccca gatagctttc 8581 ccctagatgt ttcctgacat agaccaaggt tggggctggg aagagagtgc ccagtgtgag 8641 agctggagaa tcagtgctgt gtgtggatac aggtgcaagc tggcccagga gaatggctgg 8701 ggggtcatgg tgagtcatcg ctcaggagag actgaggaca cattcattgc tgacctggtg 8761 gtggggctgt gcacaggcca ggtgagtgag gcagcctggt gagtgaagag aactctctgt 8821 gggattggta tttctagctc acccacctgg tctctccttc caggtgtttg agggtgtcag 8881 gggagtttca ggagagcaga agtttccttt caggggtgag agggcagtca ctgagctgca 8941 aatcctttga aatgtttcag atcaagactg gtgccccgtg ccgttctgaa cgtctggcta 9001 aatacaacca gctcatgagg tgagggtccc tggggtggga gcccctggcc cagatggcta 9061 aaggccccat ttgcctgcca gaccatctgt agcaccaagg gcctggataa cagtccattt 9121 cctggataac agtccaacag ataatattgg tttttgcttc ctgggtttat tgatggcctg 9181 attgacaaat cccagagatc acatgggaaa gccagggaat gctaagcctt ggggcaggac 9241 acaaaagcag gtggtgtggg ggtggttgga gtctggggga cccctagaga gagaagcagg 9301 atcctcctgc atccctgacc acttcctttg tggttcatct ctctcagaat tgaggaagag 9361 ctgggggatg aagctcgctt tgccggacat aacttccgta atcccagtgt gctgtgattc 9421 ctctgcttgc ctggagacgt ggaacctctg tctcatcctc ctggaacctt gctgtcctga 9481 tctgtgatag ttcaccccct gagatcccct gagccccagg gtgcccagaa cttccctgat 9541 tgacctgctc cgctgctcct tggcttacct gacctcttgc tgtctctgct cgccctcctt 9601 tctgtgccct actcattggg gttccgcact ttccacttct tcctttctct ttctctcttc 9661 cctcagaaac tagaaatgtg aatgaggatt attataaaag ggggtccgtg gaagaatgat 9721 cagcatctgt gatgggagcg tcagggttgg tgtgctgagg tgttagagag ggaccatgtg 9781 tcacttgtgc tttgctcttg tcccacgtgt cttccacttt gcatatgagc cgtgaactgt 9841 gcatagtgct gggatggagg ggagtgttgg gcatgtgatc acgcctggct aataaggctt 9901 tagtgtattt atttatttat ttattttatt tgtttttcat tcatcccatt aatcatttcc 9961 ccataactca atggcctaaa actggcctga cttgggggaa cgatgtgtct gtatttcatg 10021 tggctgtaga tcccaagatg actggggtgg gaggtcttgc tagaatggga agggtcatag 10081 aaagggcctt gacatcagtt cctttgtgtg tactcactga agcctgcgtt ggtccagagc 10141 ggaggctgtg tgcctggggg agttttcctc tatacatctc tccccaaccc taggttccct 10201 gttcttcctc cagctgcacc agagcaacct ctcactcccc atgccacgtt ccacagttgc 10261 caccacctct gtggcattga aatgagcacc tccattaaag tctgaatcag tgcactgttg 10321 tgtctaagga gtcttactct agtccctatg aggggagaga agatggagca cctggaagct 10381 ggtgaaactg gatagcagag ctgggggggc acaaaaagag gaagacaaac tgaacaaata 10441 tggccgagat gatggcactg cctaccccat tctggctagg tggggtgcat gtggcccctg 10501 ctttcttagc agaaggcttg gctcccagac gcaggtgaat taaggggttc aagagcccct 10561 aaaagcataa aatattttgt gtgtgtgtgt gtgtgcacgc gcattttggg ggaaaggggg 10621 tctaaggtgt tttcatatcc aaagggcttg tggactggag cagctcctgt actgggcctc 10681 tgccaacaaa accctggctg gttctcgaat ggaacaggac ttcatggcca tcacccactg 10741 caagatgggg aaatgggaag gaagaatggt tccgggggta gtatacggaa ggacctaagg 10801 aaacagagtc ctcaataaac tgaagattca ggaacaaaag tgcttaacag aaccctggct 10861 gggtcagact aacagtaggt ttccaatatg tggctagaga cgtac // LOCUS HSEOTAX 5533 bp DNA PRI 30-SEP-1997 DEFINITION H.sapiens eotaxin gene. ACCESSION Z92709 NID g2462477 KEYWORDS eotaxin gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 39) AUTHORS Hein,H., Schluter,C., Kulke,R., Christophers,E., Schroder,J.M. and Bartels,J. TITLE Genomic organization, sequence, and transcriptional regulation of the human eotaxin gene JOURNAL Biochem. Biophys. Res. Commun. 237 (3), 537-542 (1997) MEDLINE 97445071 REFERENCE 2 (bases 1 to 5533) AUTHORS Hein,H. TITLE Direct Submission JOURNAL Submitted (06-MAR-1997) Hein H., Christian-Albrechts-Universitaet zu Kiel, Dermatology/Hautklinik, Mol.Biol.Lab.609, Schittenhelmstr. 7, Kiel, Schleswig-Holstein, D-24105, Germany REMARK Revised by author 23-JUN-97 FEATURES Location/Qualifiers source 1..5533 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /cell_type="fibroblast" TATA_signal 3116..3121 exon 3159..3290 /number=1 mRNA join(3159..3290,4502..4613,4992..5533) gene 3215..5097 /gene="eotaxin" CDS join(3215..3290,4502..4613,4992..5097) /gene="eotaxin" /codon_start=1 /db_xref="PID:e329504" /db_xref="PID:g2462478" /translation="MKVSAALLWLLLIAAAFSPQGLTGPASVPTTCCFNLANRKIPLQ RLESYRRITSGKCPQKAVIFKTKLAKDICADPKKRWVQDSMKYLDQKSPTPKP" intron 3291..4501 /gene="eotaxin" /number=1 exon 4502..4613 /gene="eotaxin" /number=2 intron 4614..4991 /gene="eotaxin" /number=2 exon 4992..5533 /number=3 BASE COUNT 1507 a 1320 c 1140 g 1531 t 35 others ORIGIN 1 sncmgagtca gatacaggga armcssccmg ggggvgcybc ccggggggac acccsgggat 61 actttttarg ycttgggggg ttyssccccc yttkmtttgm ccgcgatttt tktgatgccc 121 cgcmrggggg gsggagccyt wkggaaaaac gccagcaacg gggccttttt avggkbcctg 181 gccttttgbt ggcyttttgc ctcacatgtt ctttcctgcg ttatcccctg attctgkgga 241 taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 301 cagcgagtca gtgagcgagg aagcggaaga gcgcccaata cgcaaaccgc ctctccccgc 361 gcgttggccg attcattaat gcagctggca cgacaggttt cccgactgga aagcgggcag 421 tgagcgcaac gcaattaatg tgagttagct cactcattag gcaccccagg ctttacactt 481 tatgcttccg gcccgtatgt tgtgtggaat tgtgagcgga taacaatttc acacaggaaa 541 cagctatgac catgattacg ccaagctatt taggtgacgc gttagaatac tcaagctatg 601 catcaagctt ggtaccgagc tcggatccac tagtaacggc cgccagtgtg ctggaattca 661 ggctctaata cgactcacta tagggcgtcg actcgatcaa ggcccctcat tcatcaggca 721 ccaaatcaca ggcaagaaat tggtctactc tacttcattg ctgaaaagaa agcctccaag 781 aaagaaagag aaaaagccct tgaagctgcc cctgcttgct agggatggcc agtgagctgg 841 atgaaacagc tgcagagctt ggatatctaa agattgattt tatttatttc cactgagcca 901 ctgaggtttt tcaggcgtag agtaaatcca aagcccatta actccaaagc cattttccaa 961 ggaaaacaca aaaatgaata atcgcttgct gataattcct ttaattcccg ccagagttat 1021 gaaattattc ttactacaga aacaatcatt aaaccacgtc agacaaacac ctctgaggac 1081 aataatattc ccaaatttat tctgccgaga attttttaaa acttcgtatc tgccacatgt 1141 gcaactcagg cattcttgga aaccatccct gcggtagcat atctttactt ggaagcagtg 1201 ggaaaacttg gaagaatatt tggagaaata tccagactct gaggtctttc caacattccc 1261 aatctgccct tctttttttt tttttttgag acacagtctc actctgttgc ccaggttgga 1321 gtgcagtggc atgatctctg cccattgcaa cctttgtccc ctgtgttcaa gcaattctca 1381 tgcctcagct tcctgagtag ctgagactac aaacaccatc caccatggct ggctaatttt 1441 ttgtattttt agtaaagaca ggttttcacc atgttgccca ggctggtctt gaactcctga 1501 cctcgagtga tctgcctgcc tcagtctccc aaagtgctgg gattacaagt gtgagccact 1561 gtacccggcc ctattcccta ataataaaaa acagtaactt ctactcactg agtatctaca 1621 gtgatttagt cattcttcta agcagtttag atacatgaac tcatttaatt cacacgacaa 1681 ttccctaaag tacacatgat tagtatcctg ctcttaccct agcagagagt ggttaagtaa 1741 gttgtccatg gtatcacagc tagtcagtca cagagccatc atccaaatgc agatatcctg 1801 aattcaggtt ctacattaga ctaacccacc gggaatggag caggaaagaa cagggaagac 1861 tccacatttt tggcctctat ttggtaatta tagttaactt tttaggtaat tatagaccaa 1921 ttatcctaga tgggcactta gagactttgc aggacagcaa gagctgtctc taatcctgtg 1981 cccatgacag acatcaccag tcaaccacaa cacagtattt aactaacgca agtcaactcc 2041 tcagaatctt taacatacct tgtttgtgct actgtaccaa tcaatcaatt tgatatgaga 2101 gtgtgcagga aaaaacagga aacaggtttg cagtacctcc acaccagtat tcaatgctgt 2161 aatccgctgc agtgactcca ttaaatactt tgcctccctt ataccctctc caactagggt 2221 gcctagtgtt atgaacaaag ggatatgtat aggttcttgt gttgcctctc tctttgatat 2281 ttttagccat cagatacctt gtctgcaatg tgtgctcaga gagtgagggg ggaactagat 2341 gattgatttt ccaaawtgtg tccctaaacg tgctccctgg ggaataaggg cacgagaggc 2401 tgcctattct atttcaaaca aatccccttc actacagtgt atttgatgag ttggggtttg 2461 ttttaattcc atttggaaaa gggctttagc agctaagcaa atggttttaa agtgcctcag 2521 aagtcaagat taatagaaac tatccagttc tgatgtccta tcatgctaaa atttcaggga 2581 ctaagattct gtgatcatta cattgaaaca cagcagcaaa gctgtggtgt gttgtccttc 2641 ctggttcaga gatgcaacta tgtgcagggc tgctgagctc tctctgcatc tgggtgggag 2701 cctaatggaa gttttggggc tccttactgg tctccaaaat cctcaagacc accatgtgaa 2761 cacaggaatc aaggaaggtt cttagatcga ctcatccccc aggcctttgg tttccttgct 2821 cctttccccg actacaggtg tttcatttca actcatcccc tagggccttg gttttcttgc 2881 tctcttcccc cactacagat gtttaacttc atttcataac cacatattcc cctccttttc 2941 caaggcaaga tccagatgga ttaaaaattg taccaagtcc ctcctactag tcttgcctct 3001 cttctgttct gcttgacttc ctaggatctg gaatctggtc agcaatcagg aatcccttca 3061 tcgtgacccc gcatgggcaa aggcttccct ggaatctccc acactgtctg ctccctataa 3121 aaggcaggca gatgggccag aggagcagag aggctgagac caacccagaa accaccacct 3181 tctcacgcca aagctcacac cttcagcctc caacatgaag gtctccgcag cacttctgtg 3241 gctgctgctc atagcagctg ccttcagccc ccaggggctc actgggccag gtaagccccc 3301 caactcctta caggaaaggt aaggtaacca cctccagagc tactaggtca gcaagaatct 3361 ttacagactc actgcaaatt ctccatttga aaaataggga aacaggtttt gtgggtggac 3421 aagaaatgcc tcaaccgtca catccagtca ctggaagagc cagaactaga aagctcccga 3481 gtcttttccc acattcaaga gggttgctgg gtgcatccat acccagctat ccttacagtg 3541 tttgggaatg gggaatggct ctgtcttact gtgggcatgg tgggcatttt tggcagtggg 3601 agagaaggaa aatctgttga ttagaagctc agtatgttaa ttcgactcca ggacagcttt 3661 cagagacagt ggctaagaga agaacgaggt cccaggggga tctcttgagg tgacttattt 3721 tgacactctt tgggaaagtt atctaggaga tttgttccat aactcatttt cccatactct 3781 ggtgacaaat ttactgagtg tatcggtccc actgagccag tgcatagcat ggtaacaaac 3841 agttctaaat tatcaatgac ttaacagaat taactaaatt aacaaaagtt actttctcac 3901 ttgtactaaa tatctataat gtatgggctc aggcttctgc attttatact caggattcta 3961 gactgatgga gaagttgccc atgtggggga acattgatgg atactgtgat aaagcagaag 4021 aaagctctca ggagtcttgc ataggcaatg cactgtggct caaaaatgac acccatcact 4081 ttgtctcctt ctttattgat caaaactaat taatgcctcc aaccaaacaa aagtggccaa 4141 gaaatgcaag tctaccttgt gtctcaaaac agaggatgga gaatatttgg tgaaaattac 4201 catgaccatc acatggccac gtaggtcttt ataatgacag agctagcatt tgtcacattg 4261 accaagcttt gtccatacac tctacagtaa tgatgagtcc tcagtgcaca ggggaggatg 4321 ctgaagacac aggacagcat cctccagaca cataagactt cagagcagag ggattctccc 4381 tccacctctc gcaattcctt gctttctcct aacttccttt acaaagtcat gcttggaaat 4441 gtctatgtat catcatgtgg ctcatttttt tctctgttca ctttttttcc ccaaaattca 4501 gcttctgtcc caaccacctg ctgctttaac ctggccaata ggaagatacc ccttcagcga 4561 ctagagagct acaggagaat caccagtggc aaatgtcccc agaaagctgt gatgtaagta 4621 aataaagttc accctcccct ggacaaaaaa ataatgtcta gggcacagag tcaagaactg 4681 tgtcacagtt gctgggagtc atagactctg atagtttgac ctctatggtc caattcatta 4741 attttcacaa gtgagtgttc actcccagct ccctgcctgg gagattgctg tagtcatatc 4801 aatttcttca agtcaagagc aaagatggtt ttactgggcc tttaagagca gcaactaacc 4861 caagagtctc atccttcctc ctctccgtag caaccctttg tccaggggca gatggtcctt 4921 aaatatttgg ggtcaaatgg gcagaatttt caaaaacaat ccttccaatt gcatcctgta 4981 tctccccaca gcttcaagac caaactggcc aaggatatct gtgccgaccc caagaagagg 5041 tgggtgcagg attccatgaa gtatctggac caaaaatctc caactccaaa gccataaata 5101 atcaccattt ttgaaaccaa accagagcct gagtgttgcc taatttgttt tcccttctta 5161 caatgcattc tgaggtaacc tcattatcag tccaaagggc atgggtttta ttatatatat 5221 atatatatat ttttttttaa aaaaaaacgt attgcattta atttattgag gctttaaaac 5281 ttatcctcca tgaatatcag ttacttttaa actgtaaagc tttgtgcaga ttctttaccc 5341 cctgggagcc ccaattcgat cccctgtcac gtgtgggcaa tgttccccct ctcctctctt 5401 cctccctgga atctttaaag gtcctggcaa agatgatcag tatgaaaatg tcattgttct 5461 tgtgaaccca aagtgtgact cattaaatgg aagtaaatgt tgttttagga atacataaag 5521 tatgtgcata ttt // LOCUS HSERPG 3398 bp DNA PRI 22-JUN-1993 DEFINITION Human gene for erythropoietin. ACCESSION X02158 NID g31224 KEYWORDS erythropoietin; glycoprotein hormone; hormone; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3398) AUTHORS Jacobs,K., Shoemaker,C., Rudersdorf,R., Neill,S.D., Kaufman,R.J., Mufson,A., Seehra,J., Jones,S.S., Hewick,R., Fritsch,E.F., Kawakita,M., Shimizu,T. and Miyake,T. TITLE Isolation and characterization of genomic and cDNA clones of human erythropoietin JOURNAL Nature 313 (6005), 806-810 (1985) MEDLINE 85137899 COMMENT Data kindly reviewed (24-FEB-1986) by K. Jacobs. FEATURES Location/Qualifiers source 1..3398 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(397..627,1194..1339,1596..1682,2294..2473,2608..3327) exon 397..627 /number=1 sig_peptide join(615..627,1194..1261) CDS join(615..627,1194..1339,1596..1682,2294..2473,2608..2763) /codon_start=1 /product="erythropoietin" /db_xref="PID:g312304" /db_xref="SWISS-PROT:P01588" /translation="MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDSRVLQRYLL EAKEAENITTGCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEAVL RGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPDAASAAPLRTI TADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR" intron 628..1193 /number=1 exon 1194..1339 /number=2 mat_peptide join(1262..1339,1596..1682,2294..2473,2608..2763) /product="erythropoietin" intron 1340..1595 /number=2 exon 1596..1682 /number=3 intron 1683..2293 /number=3 exon 2294..2473 /number=4 intron 2474..2607 /number=4 exon 2608..3327 /note="3' untranslated region" /number=5 BASE COUNT 698 a 1034 c 991 g 675 t ORIGIN 1 agcttctggg cttccagacc cagctacttt gcggaactca gcaacccagg catctctgag 61 tctccgccca agaccgggat gccccccagg aggtgtccgg gagcccagcc tttcccagat 121 agcagctccg ccagtcccaa gggtgcgcaa ccggctgcac tcccctcccg cgacccaggg 181 cccgggagca gcccccatga cccacacgca cgtctgcagc agccccgtca gccccggagc 241 ctcaacccag gcgtcctgcc cctgctctga ccccgggtgg cccctacccc tggcgacccc 301 tcacgcacac agcctctccc ccacccccac ccgcgcacgc acacatgcag ataacagccc 361 cgacccccgg ccagagccgc agagtccctg ggccaccccg gccgctcgct gcgctgcgcc 421 gcaccgcgct gtcctcccgg agccggaccg gggccaccgc gcccgctctg ctccgacacc 481 gcgccccctg gacagccgcc ctctcctcca ggcccgtggg gctggccctg caccgccgag 541 cttcccggga tgagggcccc cggtgtggtc acccggcgcc ccaggtcgct gagggacccc 601 ggccaggcgc ggagatgggg gtgcacggtg agtactcgcg ggctgggcgc tcccgcccgc 661 ccgggtccct gtttgagcgg ggatttagcg ccccggctat tggccaggag gtggctgggt 721 tcaaggaccg gcgacttgtc aaggaccccg gaagggggag gggggtgggg cagcctccac 781 gtgccagcgg ggacttgggg gagtccttgg ggatggcaaa aacctgacct gtgaagggga 841 cacagtttgg gggttgaggg gaagaaggtt tggggggttc tgctgtgcca gtggagagga 901 agctgataag ctgataacct gggcgctgga gccaccactt atctgccaga ggggaagcct 961 ctgtcacacc aggattgaag tttggccgga gaagtggatg ctggtagcct gggggtgggg 1021 tgtgcacacg gcagcaggat tgaatgaagg ccagggaggc agcacctgag tgcttgcatg 1081 gttggggaca ggaaggacga gctggggcag agacgtgggg atgaaggaag ctgtccttcc 1141 acagccaccc ttctccctcc ccgcctgact ctcagcctgg ctatctgttc tagaatgtcc 1201 tgcctggctg tggcttctcc tgtccctgct gtcgctccct ctgggcctcc cagtcctggg 1261 cgccccacca cgcctcatct gtgacagccg agtcctgcag aggtacctct tggaggccaa 1321 ggaggccgag aatatcacgg tgagacccct tccccagcac attccacaga actcacgctc 1381 agggcttcag ggaactcctc ccagatccag gaacctggca cttggtttgg ggtggagttg 1441 ggaagctaga cactgccccc ctacataaga ataagtctgg tggccccaaa ccatacctgg 1501 aaactaggca aggagcaaag ccagcagatc ctacgcctgt ggccagggcc agagccttca 1561 gggacccttg actccccggg ctgtgtgcat ttcagacggg ctgtgctgaa cactgcagct 1621 tgaatgagaa tatcactgtc ccagacacca aagttaattt ctatgcctgg aagaggatgg 1681 aggtgagttc cttttttttt ttttttcctt tcttttggag aatctcattt gcgagcctga 1741 ttttggatga aagggagaat gatcgaggga aaggtaaaat ggagcagcag agatgaggct 1801 gcctgggcgc agaggctcac gtctataatc ccaggctgag atggccgaga tgggagaatt 1861 gcttgagccc tggagtttca gaccaaccta ggcagcatag tgagatcccc catctctaca 1921 aacatttaaa aaaattagtc aggtgaagtg gtgcatggtg gtagtcccag atatttggaa 1981 ggctgaggcg ggaggatcgc ttgagcccag gaatttgagg ctgcagtgag ctgtgatcac 2041 accactgcac tccagcctca gtgacagagt gaggccctgt ctcaaaaaag aaaagaaaaa 2101 agaaaaataa tgagggctgt atggaatacg ttcattattc attcactcac tcactcactc 2161 attcattcat tcattcattc aacaagtctt attgcatacc ttctgtttgc tcagcttggt 2221 gcttggggct gctgaggggc aggagggaga gggtgacatc cctcagctga ctcccagagt 2281 ccactccctg taggtcgggc agcaggccgt agaagtctgg cagggcctgg ccctgctgtc 2341 ggaagctgtc ctgcggggcc aggccctgtt ggtcaactct tcccagccgt gggagcccct 2401 gcagctgcat gtggataaag ccgtcagtgg ccttcgcagc ctcaccactc tgcttcgggc 2461 tctgggagcc caggtgagta ggagcggaca cttctgcttg ccctttctgt aagaagggga 2521 gaagggtctt gctaaggagt acaggaactg tccgtattcc ttccctttct gtggcactgc 2581 agcgacctcc tgttttctcc ttggcagaag gaagccatct cccctccaga tgcggcctca 2641 gctgctccac tccgaacaat cactgctgac actttccgca aactcttccg agtctactcc 2701 aatttcctcc ggggaaagct gaagctgtac acaggggagg cctgcaggac aggggacaga 2761 tgaccaggtg tgtccacctg ggcatatcca ccacctccct caccaacatt gcttgtgcca 2821 caccctcccc cgccactcct gaaccccgtc gaggggctct cagctcagcg ccagcctgtc 2881 ccatggacac tccagtgcca ccaatgacat ctcaggggcc agaggaactg tccagagagc 2941 aactctgaga tctaaggatg tcacagggcc aacttgaggg cccagagcag gaagcattca 3001 gagagcagct ttaaactcag ggacagaccc atgctgggaa gacgcctgag ctcactcggc 3061 accctgcaaa attgatgcca ggacacgctt tggaggcgat ttacctgttt tcgcacctac 3121 catcagggac aggatgacct ggagaactta ggtggcaagc tgtgacttct ccaggtctca 3181 cgggcatggg cactcccttg gtggcaagag cccccttgac accggggtgg tgggaaccat 3241 gaagacagga tgggggctgg cctctggctc tcatggggtc caacttttgt gtattcttca 3301 acctcattga caagaactga aaccaccaat atgactcttg gcttttctgt tttctgggaa 3361 cctccaaatc ccctggctct gtcccactcc tggcagca // LOCUS HSFAU1 2016 bp DNA PRI 21-JUL-1993 DEFINITION H.sapiens fau 1 gene. ACCESSION X65921 S45242 NID g31304 KEYWORDS fau 1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2016) AUTHORS Kas,K. TITLE Direct Submission JOURNAL Submitted (29-APR-1992) K. Kas, University of Antwerp, Dept of Biochemistry T3.22, Universiteitsplein 1, 2610 Wilrijk, BELGIUM REFERENCE 2 (bases 1 to 2016) AUTHORS Kas,K., Michiels,L. and Merregaert,J. TITLE Genomic structure and expression of the human fau gene: encoding the ribosomal protein S30 fused to a ubiquitin-like protein JOURNAL Biochem. Biophys. Res. Commun. 187 (2), 927-933 (1992) MEDLINE 92412144 FEATURES Location/Qualifiers source 1..2016 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="CML cosmid" /clone="15.1" mRNA join(408..504,774..856,951..1095,1557..1612,1787..>1912) /gene="fau 1" exon 408..504 /gene="fau 1" /number=1 gene 408..1912 /gene="fau 1" intron 505..773 /gene="fau 1" /number=1 exon 774..856 /gene="fau 1" /number=2 CDS join(782..856,951..1095,1557..1612,1787..1912) /gene="fau 1" /codon_start=1 /db_xref="PID:g31305" /db_xref="SWISS-PROT:P35544" /db_xref="SWISS-PROT:Q05472" /translation="MQLFVRAQELHTFEVTGQETVAQIKAHVASLEGIAPEDQVVLLA GAPLEDEATLGQCGVEALTTLEVAGRMLGGKVHGSLARAGKVRGQTPKVAKQEKKKKK TGRAKRRMQYNRRFVNVVPTFGKKKGPNANS" intron 857..950 /gene="fau 1" /number=2 exon 951..1095 /gene="fau 1" /number=3 intron 1096..1556 /gene="fau 1" /number=3 exon 1557..1612 /gene="fau 1" /number=4 intron 1613..1786 /gene="fau 1" /number=4 exon 1787..>1912 /gene="fau 1" /number=5 polyA_signal 1938..1943 BASE COUNT 421 a 562 c 538 g 495 t ORIGIN 1 ctaccatttt ccctctcgat tctatatgta cactcgggac aagttctcct gatcgaaaac 61 ggcaaaacta aggccccaag taggaatgcc ttagttttcg gggttaacaa tgattaacac 121 tgagcctcac acccacgcga tgccctcagc tcctcgctca gcgctctcac caacagccgt 181 agcccgcagc cccgctggac accggttctc catccccgca gcgtagcccg gaacatggta 241 gctgccatct ttacctgcta cgccagcctt ctgtgcgcgc aactgtctgg tcccgccccg 301 tcctgcgcga gctgctgccc aggcaggttc gccggtgcga gcgtaaaggg gcggagctag 361 gactgccttg ggcggtacaa atagcaggga accgcgcggt cgctcagcag tgacgtgaca 421 cgcagcccac ggtctgtact gacgcgccct cgcttcttcc tctttctcga ctccatcttc 481 gcggtagctg ggaccgccgt tcaggtaaga atggggcctt ggctggatcc gaagggcttg 541 tagcaggttg gctgcggggt cagaaggcgc ggggggaacc gaagaacggg gcctgctccg 601 tggccctgct ccagtcccta tccgaactcc ttgggaggca ctggccttcc gcacgtgagc 661 cgccgcgacc accatcccgt cgcgatcgtt tctggaccgc tttccactcc caaatctcct 721 ttatcccaga gcatttcttg gcttctctta caagccgtct tttctttact cagtcgccaa 781 tatgcagctc tttgtccgcg cccaggagct acacaccttc gaggtgaccg gccaggaaac 841 ggtcgcccag atcaaggtaa ggctgcttgg tgcgccctgg gttccatttt cttgtgctct 901 tcactctcgc ggcccgaggg aacgcttacg agccttatct ttccctgtag gctcatgtag 961 cctcactgga gggcattgcc ccggaagatc aagtcgtgct cctggcaggc gcgcccctgg 1021 aggatgaggc cactctgggc cagtgcgggg tggaggccct gactaccctg gaagtagcag 1081 gccgcatgct tggaggtgag tgagagagga atgttctttg aagtaccggt aagcgtctag 1141 tgagtgtggg gtgcatagtc ctgacagctg agtgtcacac ctatggtaat agagtacttc 1201 tcactgtctt cagttcagag tgattcttcc tgtttacatc cctcatgttg aacacagacg 1261 tccatgggag actgagccag agtgtagttg tatttcagtc acatcacgag atcctagtct 1321 ggttatcagc ttccacacta aaaattaggt cagaccaggc cccaaagtgc tctataaatt 1381 agaagctgga agatcctgaa atgaaactta agatttcaag gtcaaatatc tgcaactttg 1441 ttctcattac ctattgggcg cagcttctct ttaaaggctt gaattgagaa aagaggggtt 1501 ctgctgggtg gcaccttctt gctcttacct gctggtgcct tcctttccca ctacaggtaa 1561 agtccatggt tccctggccc gtgctggaaa agtgagaggt cagactccta aggtgagtga 1621 gagtattagt ggtcatggtg ttaggacttt ttttcctttc acagctaaac caagtccctg 1681 ggctcttact cggtttgcct tctccctccc tggagatgag cctgagggaa gggatgctag 1741 gtgtggaaga caggaaccag ggcctgatta accttccctt ctccaggtgg ccaaacagga 1801 gaagaagaag aagaagacag gtcgggctaa gcggcggatg cagtacaacc ggcgctttgt 1861 caacgttgtg cccacctttg gcaagaagaa gggccccaat gccaactctt aagtcttttg 1921 taattctggc tttctctaat aaaaaagcca cttagttcag tcatcgcatt gtttcatctt 1981 tacttgcaag gcctcaggga gaggtgtgct tctcgg // LOCUS HSFBRGG 10564 bp DNA PRI 21-NOV-1994 DEFINITION Human gene for fibrinogen gamma chain. ACCESSION X02415 M10014 NID g31306 KEYWORDS fibrinogen; glycoprotein; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10564) AUTHORS Rixon,M.W., Chung,D.W. and Davie,E.W. TITLE Nucleotide sequence of the gene for the gamma chain of human fibrinogen JOURNAL Biochemistry 24 (8), 2077-2086 (1985) MEDLINE 85252774 FEATURES Location/Qualifiers source 1..10564 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region 1280..1309 /note="direct repeat" repeat_region 1310..1339 /note="direct repeat" CAAT_signal 1691..1694 TATA_signal 1724..1728 exon 1748..1876 /number=1 precursor_RNA 1748..9825 /note="fibrinogen gamma chain variant gamma'primary transcript" mRNA join(1748..1876,1973..2017,2207..2390,2510..2603, 4211..4341,4645..4778,5758..5942,7426..7703,9342..9511, 10054..10271) /product="fibrinogen gamma chain" precursor_RNA 1748..10271 /note="fibrinogen gamma chain primary transcript" sig_peptide 1799..1876 /product="fibrinogen gamma chain" CDS join(1799..1876,1973..2017,2207..2390,2510..2603, 4211..4341,4645..4778,5758..5942,7426..7703,9342..9511, 10054..10068) /codon_start=1 /product="fibrinogen gamma chain" /db_xref="PID:g577054" /db_xref="SWISS-PROT:P02679" /translation="MSWSLHPRNLILYFYALLFLSSTCVAYVATRDNCCILDERFGSY CPTTCGIADFLSTYQTKVDKDLQSLEDILHQVENKTSEVKQLIKAIQLTYNPDESSKP NMIDAATLKSRIMLEEIMKYEASILTHDSSIRYLQEIYNSNNQKIVNLKEKVAQLEAQ CQEPCKDTVQIHDITGKDCQDIANKGAKQSGLYFIKPLKANQQFLVYCEIDGSGNGWT VFQKRLDGSVDFKKNWIQYKEGFGHLSPTGTTEFWLGNEKIHLISTQSAIPYALRVEL EDWNGRTSTADYAMFKVGPEADKYRLTYAYFAGGDAGDAFDGFDFGDDPSDKFFTSHN GMQFSTWDNDNDKFEGNCAEQDGSGWWMNKCHAGHLNGVYYQGGTYSKASTPNGYDNG IIWATWKTRWYSMKKTTMKIIPFNRLTIGEGQQHHLGGAKQAGDV" intron 1877..1972 /number=1 exon 1973..2017 /number=2 mat_peptide join(1973..2017,2207..2390,2510..2603,4211..4341, 4645..4778,5758..5942,7426..7703,9342..9511,10054..10065) /product="fibrinogen gamma chain" intron 2018..2206 /number=2 exon 2207..2390 /number=3 intron 2391..2509 /number=3 exon 2510..2603 /number=4 intron 2604..4210 /number=4 exon 4211..4341 /number=5 intron 4342..4644 /number=5 exon 4645..4778 /number=6 intron 4779..5757 /number=6 exon 5758..5942 /number=7 intron 5943..7425 /number=7 exon 7426..7703 /number=8 intron 7704..9341 /number=8 misc_feature 8324..8468 /note="region identical to exon 9 coding region (aa 360-410)" exon 9342..9511 /number=9 intron 9512..10053 /number=9 misc_feature 9808..9813 /note="pot polyA site for gamma' chain" polyA_site 9825 /note="polyA site of gamma' chain" exon 10054..10065 /number=10 misc_feature 10247..10252 /note="pot polyA site" polyA_site 10271 /note="polyA site" BASE COUNT 3337 a 1936 c 1991 g 3300 t ORIGIN 1 ctacacactt cttgaaggca aaggcaatgc tgaagtcacc tttcatgttc aaatcatatt 61 aaaaagttag caagatgtaa ttatcagtgt actatgtaaa tctttgtgaa tgatcaataa 121 ttacatattt tcattatata tattttagta gataatattt atatacattc aacattctaa 181 atatagaaag tttacagaga aaaataaagc ctttttttcc aatcctgtcc tccacctctg 241 catcccattc ttcttcacag aggcaactga ttcaagtcat tacatagtta ttgagtgtta 301 actacaacta tgttaagtac agctatatat gttagatgcc gtagccacag aaatcagttt 361 acaatctaat gcagtggata cagcatgtat acatataata taaggttgct acaaatgcta 421 tctgaggtag agctgtttga aagaatacta atacttaaat gtttaattca actgacttga 481 ttgacaactg attagctgag tggaaaagat ggatgagaaa gattgtgaga cttaattggc 541 tggtggtatg gtgatatgat tgacaataac tgctaagtca gagagggata tattaaggag 601 gagaagaaaa gcaacaaatc tggttttgat gtgttcactt tgttataatt attgattatt 661 tactgaatat gaatatttat ctttgttttt gagtcaataa atataccttt gtaaagacag 721 aattaaagta ttagtatttc tttcaaactg gaggcatttc tcccactaac atatttcatc 781 aaaacttata ataagcttgg ttccagagga agaaatgagg gataaccaaa aatagagaca 841 ttaataatag tgtaacgccc agtgataaat ctcaataggc agtgatgaca gacatgtttt 901 cccaaacaca aggatgctgt aagggccaaa cagaaatgat ggcccctccc cagcacctca 961 ttttgcccct tccttcagct atgcctctac tctcctttag atacaaggga ggtggatttt 1021 tctcttctct gagatagctt gatggaacca caggaacaat gaagtgggct cctggctctt 1081 ttctctgtgg cagatggggt gccatgccca ccttcagaca aagggaagat tgagctcaaa 1141 agctccctga gaagtgagag cctatgaaca tggttgacac agagggacag gaatgtattt 1201 ccagggtcat tcattcctgg gaatagtgaa ctgggacatg ggggaagtca gtctcctcct 1261 gccacagcca cagattaaaa ataataatgt taactgatcc ctaggctaaa ataatagtgt 1321 taactgatcc ctaagctaag aaagttcttt tggtaattca ggtgatggca gcaggaccca 1381 tcttaaggat agactaggtt tgcttagttc gaggtcatat ctgtttgctc tcagccatgt 1441 actggaagaa gttgcatcac acagcctcca ggactgccct cctcctcaca gcaatggata 1501 atgcttcact agcctttgca gataattttg gatcagagaa aaaaccttga gctgggccaa 1561 aaaggaggag cttcaacctg tgtgcaaaat ctgggaacct gacagtatag gttgggggcc 1621 aggatgagga aaaaggaacg ggaaagacct gcccaccctt ctggtaagga ggccccgtga 1681 tcagctccag ccatttgcag tcctggctat cccaggagct tacataaagg gacaattgga 1741 gcctgagagg tgacagtgct gacactacaa ggctcggagc tccgggcact cagacatcat 1801 gagttggtcc ttgcaccccc ggaatttaat tctctacttc tatgctcttt tatttctctc 1861 ttcaacatgt gtagcagtaa gtgtgctctt cacaaaacgt tgtttaaaat ggaaagctgg 1921 aaaataaaac agataataaa ctagtgaaat tttcgtattt tttctctttt agtatgttgc 1981 taccagagac aactgctgca tcttagatga aagattcgta agtagttttt atgtttctcc 2041 ctttgtgtgt gaactggaga ggggcagagg aatagaaata attccctcat aaatatcatc 2101 tggcacttgt aactttttaa aaacatagtc taggttttac ctatttttct taatagattt 2161 taagagtagc atctgtctac atttttaatc actgttatat tttcagggta gttattgtcc 2221 aactacctgt ggcattgcag atttcctgtc tacttatcaa accaaagtag acaaggatct 2281 acagtctttg gaagacatct tacatcaagt tgaaaacaaa acatcagaag tcaaacagct 2341 gataaaagca atccaactca cttataatcc tgatgaatca tcaaaaccaa gtgagaaaat 2401 aaagactact gaccaaaaaa taataataat aatctgtgaa gttcttttgc tgttgtttta 2461 gttgttctat ttgcttaagg atttttatgt ctctgatcct atattacaga tatgatagac 2521 gctgctactt tgaagtccag gataatgtta gaagaaatta tgaaatatga agcatcgatt 2581 ttaacacatg actcaagtat tcggtaagga tttttgtttt aatttgctct gcaagactga 2641 tttagttttt atttaatatt ctatacttga gtgaaagtaa tttttaatgt gttttcccca 2701 tttataatat cccagtgaca ttatgcctga ttatgttgag catagtagag atagaagttt 2761 ttagtgcaat ataaattata ctgggttata attgcttatt aataatcaca ttgaagaaag 2821 atgttctaga tgtcttcaaa tgctagtttg accatattta tcaaaaattt tttccccatc 2881 ccccatttat cttacaacat aaaatcaatc tcataggaat ttgggtgttg aaaataaaat 2941 cctctttata aaaatgctga caaattggtg gttaaaaaaa ttagcaagca gaggcatagt 3001 aaggattttg gctcctaaag taaattatat tgaatgtgga gcaggaagaa acatgtcttg 3061 agagactaag tgtggcaaat attgcaaagc tcatattgat cattgcagaa tgaacctgca 3121 tagtctcttc ccttcatttg gaagtgaatg tctctgttaa agcttctcag ggactcataa 3181 actttctgaa cataaggtct cagatacagt tttaatattt ttccccaatt tttttttctg 3241 aatttttctc aaagcagctt gagaaattga gataaatagt agctagggag aagtggccca 3301 ggaaagattt ctcctctttt tgctatcaga gggcccttgt tattattgtt attattatta 3361 cttgcattat tattgtccat cattgaagtt gaaggaggtt attgtacaga aattgcctaa 3421 gacaaggtag agggaaaacg tggacaaata gtttgtctac ccttttttac ttcaaagaaa 3481 gaacggttta tgcattgtag acagttttct atcatttttg gatatttgca agccaccctg 3541 taagtaacta caaaaggagg gtttttactt cccccagtcc attcccaaag ctatgtaacc 3601 agaagcatta aagaagaaag gggaagtatc tgttgtttta ttttacatac aataacgttc 3661 cagatcatgt ccctgtgtaa gttatatttt agattgaagc ttatatgtat agcctcagta 3721 gatccacaag tgaaaggtat actccttcag cacatgtgaa ttactgaact gagcttttcc 3781 tgcttctaaa gcatcagggg gtgttcctat taaccagtct cgccactctt gcaggttgct 3841 atctgctgtc ccttatgcat aaagtaaaaa gcaaaatgtc aatgacattt gcttattgac 3901 aaggactttg ttatttgtgt tgggagttga gacaatatgc cccattctaa gtaaaaagat 3961 tcaggtccac attgtattcc tgttttaatt gattttttga tttgtttttc tttttcaaaa 4021 agtttataat tttaattcat gttaatttag taatataatt ttacattttc ctcaagaatg 4081 gaataattta tcagaaagca cttcttaaga aaatacttag cagtttccaa agaaaatata 4141 aaattactct tctgaaagga atacttattt ttgtcttctt atttttgtta tcttatgttt 4201 ctgtttgtag atatttgcag gaaatatata attcaaataa tcaaaagatt gttaacctga 4261 aagagaaggt agcccagctt gaagcacagt gccaggaacc ttgcaaagac acggtgcaaa 4321 tccatgatat cactgggaaa ggtaactgat gaaggttata ttgggattag gttcatcaaa 4381 gtaagtaatg taaaggagaa agtatgtact ggaaagtata ggaatagttt agaaagtggc 4441 tacccattaa gtctaagaat ttcagttgtc tagacctttc ttgaatagct aaaaaaaaca 4501 gtttaaaagg aatgctgatg tgaaaagtaa gaaaattatt cttggaaaat gaatagttta 4561 ctacatgtta aaagctattt ttcaaggctg gcacagtctt acctgcattt caaaccacag 4621 taaaagtcga ttctccttct ctagattgtc aagacattgc caataaggga gctaaacaga 4681 gcgggcttta ctttattaaa cctctgaaag ctaaccagca attcttagtc tactgtgaaa 4741 tcgatgggtc tggaaatgga tggactgtgt ttcagaaggt aattttttcc ccaccatgtg 4801 tatttaataa attcctacat tgtttctgcc atatggcaga tacttttcta agcaccttgt 4861 gaaccgtagc tcatttaatc cttgcaatag ccctaagagg aaggtacttc tgttactcct 4921 atttacagaa aaggaaactg aggcacacaa ggttaaataa cttgcccaag accacataac 4981 taataagcaa cagagtcagc atttgaacct aggcagtata gtttcagagt ttgtgacttg 5041 actctatatt gtactggcac tgactttgta gattcatggt ggcacataat catagtacca 5101 cagtgacaaa taaaaagaag gaaactcttt tgtcaggtag gtcaagacct gaggtttccc 5161 atcacaagat gaggaagccc aacaccaccc cccaccaccc caccaccatc accacccttt 5221 cacacaccag aggatacact tgggctgctc caagacaagg aacctgtgtt gcatctgcca 5281 cttgctgata cccactagga atcttggctc ctttactttc tgtttacctc ccaccactgt 5341 tataactgtt tctacagggg gcgctcagag ggaatgaatg gtggaagcat tagttgccag 5401 acaccgattg agcaatgggt tccatcataa gtgtaagaat cagtaatatc cagctagagt 5461 tctgaagtcg tctaggtgtc tttttaatat taccactcat ttagaattta tgatgtgcca 5521 gaaaccctct taagtatttc tcttatattc tctctcatga tccttgcagc aaccctaaga 5581 agtaaccatc atttttccta tttgatacat gaggaaactg aggtagcttg gccaagatca 5641 cttagttggg agttgataga accagtgctc tgtatttttg acaaaatgtt gacagcattc 5701 tctttacatg cattgatagt ctattttctc cttttgctct tgcaaatgtg taattagaga 5761 cttgatggca gtgtagattt caagaaaaac tggattcaat ataaagaagg atttggacat 5821 ctgtctccta ctggcacaac agaattttgg ctgggaaatg agaagattca tttgataagc 5881 acacagtctg ccatcccata tgcattaaga gtggaactgg aagactggaa tggcagaacc 5941 aggtactgtt ttgaaatgac ttccaacttt ttattgtaaa gattgcctgg aatgtgcact 6001 ttccaactat caatagacaa tggcaaatgc agcctgacaa atgcaaacag cacatccagc 6061 caccattttc tccaggagtc tgtttggttc ttgggcaatc caaaaaggta aattctattc 6121 aggatgaatc taagtgtatt ggtacaatct aattaccctg gaaccattca gagtaatagc 6181 taattactga acttttaatc agtcccagga attgagcata aaattataat tttatctagt 6241 ctaaattact atttcatgaa gcaggtatta ttattaatcc cattttatag attaacttgc 6301 tcaaagtcac attgctgata agtggtagag gtagaattca gactcaagta gtttaacttt 6361 agagcctgtc ctcttaacaa ctatcctggt tgaaaagcaa atacagcctc ttcagacttc 6421 tcagtgcctt gatggccatt tattctgtca aatcatgagc taccctaaaa gtaaaccagc 6481 tagctctttt gatgatctag aggcttcttt ttgcttgaga tatttgaagg ttttaagcat 6541 tgttacctaa ttaaaatgca gaaaaatatc caaccctctt gttatgttta aggaatagtg 6601 aaatatattg tcttcaaaca catggacttt tttttattgc ttggttggtt tttaatccag 6661 aaagtgctat agtcagtaga ccttcttcta ggaaaggacc ttccatttcc cagccactgg 6721 agattagaaa ataagctaaa tattttctgg aaatttctgt tcattcatta aggcccatcc 6781 tttcccccac tctatagaag tgttgtccac ttgcacaatt ttttccagga aagaatctct 6841 ctaactcctt cagctcacat gctttggacc acacagggaa gactttgatt gtgtaatgcc 6901 ctcagaagct ctccttcttg ccactaccac actgatttga ggaagaaaat ccctttagca 6961 cctaaccctt caggtgctat gagtggctaa tggaactgta cctccttcaa gttttgtgca 7021 ataattaagg gtcactcact gtcagatact ttctgtgatc tatgataatg tgtgtgcaac 7081 acataacatt tcaataaaag tagaaaatat gaaattagag tcatctacac atctggattt 7141 gatcttagaa tgaaacaagc aaaaaagcat ccaagtgagt gcaattatta gttttcagag 7201 atgcttcaaa ggcttctagg cccatcccgg gaagtgttaa tgagctgtgg actggttcac 7261 atatctattg cctcttgcca gatttgcaaa aaacttcact caatgagcaa atttcagcct 7321 taagaaacaa agtcaaaaat tccaaggaag catcctacga aagagggaac ttctgagatc 7381 cctgaggagg gtcagcatgt gatggttgta tttccttctt ctcagtactg cagactatgc 7441 catgttcaag gtgggacctg aagctgacaa gtaccgccta acatatgcct acttcgctgg 7501 tggggatgct ggagatgcct ttgatggctt tgattttggc gatgatccta gtgacaagtt 7561 tttcacatcc cataatggca tgcagttcag tacctgggac aatgacaatg ataagtttga 7621 aggcaactgt gctgaacagg atggatctgg ttggtggatg aacaagtgtc acgctggcca 7681 tctcaatgga gtttattacc aaggtatgtt ttcctttctt agattccaag ttaatgtata 7741 gtgtatacta ttttcataaa aaataataaa tagatatgaa gaaatgaaga ataatttata 7801 aagatagtag ggattttatc atgttcttta tttcaactaa gttctttgaa actggaagtg 7861 gataatacca agttcatgcc taaaattagc ccttctaaag aaatccacct gctgcaaaat 7921 atccagtagt ttggcattat atgtgaaact atcaccatca tagctggcac tgtgggttgt 7981 gggatctcct ttagacatac aacataaatg atctggatgg attaacatta ctacatggat 8041 gcttgttgac acattaacct ggcttcccat gagctttgtg tcagatacac gcagtgaaca 8101 ggtgtttgga ggaacagaat aaagagaagg caagcactgg taagggcagg ggtttgtgaa 8161 agcttgagag aagagaccag tctgaggaca gtagacactt attttaggat gggggttgga 8221 tgaggaggct atagtttgct ataagcttgg aatggtttgg aacactggtt tcactcacct 8281 acccagcagt tatgtgtggg gaagccttac cgatgctaaa ggatccatgt tacaataatg 8341 gcattatttg gaaatcccag tggtattcca tgaataaaac cactatgaag ataatcccac 8401 tcaacagact ctccgttgga gaaggacagc aacaccaccc tgggaaagcc aaacagtcag 8461 accagacctg tttagcatca gtaggacttc cctaccatat ctgctgggta gatgagtgaa 8521 accagtgttc caaaccactc cgggcttgta gcaaaccata gtctcctcat ctaccaagat 8581 gagcaacctt acctcctgat gtcctagcca atcaccaact aggaaacttt gcacagttta 8641 tttaaagtaa cagtttgatt ttcacaatat ttttaaattg gagaaacata acttatcttt 8701 gcactcacaa accacataat gagaagaaac tctaagggaa aatgcttgat ctgtgtgacc 8761 cggggcgcca tgccagagct gtagttcatg ccagtgttgt gctctgacaa gccttttaca 8821 gaattacatg agatctgctt ccctaggaca aggagaaggc aaatcaacag aggctgcact 8881 ttaaaatgga gacataaaat aacatgccag aaccatttcc taaagctcct caatcaacca 8941 acaaaattgt gctttcaaat aacctgagtt gacctcatca ggaattttgt ggctccttct 9001 cttctaacct gcctgaagaa agatggtcca cagcagctga gtccgggatg gataagctta 9061 gggacagagg ccaattaggg aactttgggt ttctagccct actagtagtg aataaattta 9121 aagtgtggat gtgactatga gtcacagcac agatgttgtt taataatatg tttattttat 9181 aaattgatat tttaggaatc tttggagata ttttcagtta gcagataata ctataaattt 9241 tatgtaactg gcaatgcact tcgtaataga cagctcttca tagacttgca gaggtaaaaa 9301 gattccagaa taatgatatg tacatctacg acttgtttta ggtggcactt actcaaaagc 9361 atctactcct aatggttatg ataatggcat tatttgggcc acttggaaaa cccggtggta 9421 ttccatgaag aaaaccacta tgaagataat cccattcaac agactcacaa ttggagaagg 9481 acagcaacac cacctggggg gagccaaaca ggtcagacca gagcaccctg cggaaacaga 9541 atatgactca ctttaccctg aggatgattt gtagaaaatt aactgctaac ttctattgac 9601 ccacaaagtt tcagaaattc tctgaaagtt tcttcctttt ttctcttact atatttattg 9661 atttcaagtc ttctattaag gacatttagc cttcaatgga aattaaaact catttaggac 9721 tgtatttcca aattactgat atcagagtta tttaaaaatt gtttatttga ggagataaca 9781 tttcaacttt gttcctaaat atataataat aaaatgattg actttatttg catttttatg 9841 accacttgtc atttattttg tcttcgtaaa ttattttcat tatatcaaat attttagtat 9901 gtacttaata aaataggaga acattttaga gtttcaaatt cccaggtatt ttccttgttt 9961 attaccccta aatcattcct atttaattct tctttttaaa tggagaaaat tatgtctttt 10021 taatatggtt tttgttttgt tatatattca caggctggag acgtttaaaa gaccgtttca 10081 aaagagattt acttttttaa aggactttat ctgaacagag agatataata tttttcctat 10141 tggacaatgg acttgcaaag cttcacttca ttttaagagc aaaagacccc atgttgaaaa 10201 ctccataaca gttttatgct gatgataatt tatctacatg catttcaata aaccttttgt 10261 ttcctaagac tagatacatg gtacctttat tgaccattaa aaaaccacca ctttttgcca 10321 atttaccaat tacaattggg caaccatcag tagtaattga gtcctcattt tatgctaaat 10381 gttatgccta actctttggg agttacaaag gaaatagcaa ttatggcttt tgccctctag 10441 gagatacagg acaaatacag gaaaatacag caacccaaac tgacaatact ctatacaaga 10501 acataatcac taagcaggag tcacagccac acaaccaaga tgcatagtat ccaaagtgca 10561 gctg // LOCUS HSFESFPS 12263 bp DNA PRI 26-JUN-1997 DEFINITION Human c-fes/fps proto-oncogene. ACCESSION X06292 M14209 M14589 NID g31348 KEYWORDS fes/fps cellular oncogene; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 855 to 12263) AUTHORS Roebroek,A.J., Schalken,J.A., Verbeek,J.S., Van den Ouweland,A.M., Onnekink,C., Bloemers,H.P. and Van de Ven,W.J. TITLE The structure of the human c-fes/fps proto-oncogene JOURNAL EMBO J. 4 (11), 2897-2903 (1985) MEDLINE 86055727 REFERENCE 2 (bases 1 to 1315) AUTHORS Roebroek,A.J., Schalken,J.A., Bussemakers,M.J., van Heerikhuizen,H., Onnekink,C., Debruyne,F.M., Bloemers,H.P. and Van de Ven,W.J. TITLE Characterization of human c-fes/fps reveals a new transcription unit (fur) in the immediately upstream region of the proto-oncogene JOURNAL Mol. Biol. Rep. 11 (2), 117-125 (1986) MEDLINE 86284598 REFERENCE 3 (bases 1 to 12263) AUTHORS Jucker,M., Roebroek,A.J., Mautner,J., Koch,K., Eick,D., Diehl,V., Van de Ven,W.J. and Tesch,H. TITLE Expression of truncated transcripts of the proto-oncogene c-fps/fes in human lymphoma and lymphoid leukemia cell lines JOURNAL Oncogene 7 (5), 943-952 (1992) MEDLINE 92237021 FEATURES Location/Qualifiers source 1..12263 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="q25-qter" CAAT_signal 248..256 /note="pot. CAAT-box" mRNA join(<512..512,1005..1226,1379..1552,2926..3022, 3152..3335,5267..5404,5478..5597,5801..5923,6053..6239, 6361..6444,6940..7149,7511..7633,8015..8068,8663..8781, 9056..9150,9247..9370,9610..9767,9892..10014,11372..11732) exon <512..512 /number=1 intron 513..1004 /number=1 exon 1005..1226 /number=2 CDS join(1014..1226,1379..1552,2926..3022,3152..3335, 5267..5404,5478..5597,5801..5923,6053..6239,6361..6444, 6940..7149,7511..7633,8015..8068,8663..8781,9056..9150, 9247..9370,9610..9767,9892..10014,11372..11514) /codon_start=1 /product="NCP92" /db_xref="PID:g31349" /db_xref="SWISS-PROT:P07332" /translation="MGFSSELCSPQGHGVLQQMQEAELRLLEGMRKWMAQRVKSDREY AGLLHHMSLQDSGGQSRAISPDSPISQSWAEITSQTEGLSRLLRQHAEDLNSGPLSKL SLLIRERQQLRKTYSEQWQQLQQELTKTHSQDIEKLKSQYRALARDSAQAKRKYQEAS KDKDRDKAKDKYVRSLWKLFAHHNRYVLGVRAAQLHHQHHHQLLLPGLLRSLQDLHEE MACILKEILQEYLEISSLVQDEVVAIHREMAAAAARIQPEAEYQGFLRQYGSAPDVPP CVTFDESLLEEGEPLEPGELQLNELTVESVQHTLTSVTDELAVATEMVFRRQEMVTQL QQELRNEEENTHPRERVQLLGKRQVLQEALQGLQVALCSQAKLQAQQELLQTKLEHLG PGEPPPVLLLQDDRHSTSSSEQEREGGRTPTLEILKSHISGIFRPKFSLPPPLQLIPE VQKPLHEQLWYHGAIPRAEVAELLVHSGDFLVRESQGKQEYVLSVLWDGLPRHFIIQS LDNLYRLEGEGFPSIPLLIDHLLSTQQPLTKKSGVVLHRAVPKDKWVLNHEDLVLGEQ IGRGNFGEVFSGRLRADNTLVAVKSCRETLPPDLKAKFLQEARILKQYSHPNIVRLIG VCTQKQPIYIVMELVQGGDFLTFLRTEGARLRVKTLLQMVGDAAAGMEYLESKCCIHR DLAARNCLVTEKNVLKISDFGMSREEADGVYAASGGLRQVPVKWTAPEALNYGRYSSE SDVWSFGILLWETFSLGASPYPNLSNQQTREFVEKGGRLPCPELCPDAVFRLMEQCWA YEPGQRPSFSTIYQELQSIRKRHR" intron 1227..1378 /number=2 exon 1379..1552 /number=3 intron 1553..2925 /number=3 exon 2926..3022 /number=4 intron 3023..3151 /number=4 exon 3152..3335 /number=5 intron 3336..5266 /number=5 exon 5267..5404 /number=6 intron 5405..5477 /number=6 exon 5478..5597 /number=7 intron 5598..5800 /number=7 exon 5801..5923 /number=8 intron 5924..6052 /number=8 exon 6053..6239 /number=9 intron 6240..6360 /number=9 exon 6361..6444 /number=10 intron 6445..6939 /number=10 exon 6940..7149 /number=11 intron 7150..7510 /number=11 exon 7511..7633 /number=12 intron 7634..8014 /number=12 exon 8015..8068 /number=13 intron 8069..8662 /number=13 exon 8663..8781 /number=14 intron 8782..9055 /number=14 exon 9056..9150 /number=15 misc_feature 9061 /note="start of 0.95 kb fes/fps cDNA" intron 9151..9246 /number=15 exon 9247..9370 /number=16 intron 9371..9609 /number=16 exon 9610..9767 /number=17 intron 9768..9891 /number=17 exon 9892..10014 /number=18 intron 10015..11371 /number=18 exon 11372..11732 /number=19 polyA_signal 11708..11713 polyA_site 11730..11732 /note="polyA sites" BASE COUNT 2497 a 3568 c 3664 g 2534 t ORIGIN 1 gaattccgtg aggtggggag ggctgggacc agggttccct ctttctcttc tgcggtggcc 61 ctggcctggt gctaggactg cgcgcctccc ctcagtaccc gcggacaccc tgggcttccc 121 tgggcccagc atctgcctgg ggcctcgcct gggctccccc tcctgacccc caccttgcgc 181 cccttcccgg tgttcccggg gcgctgccgg gccctggggc ctgcggggcg cgggcggctc 241 ttggctgggc cattctttcc cggccccctc ctcccttccg tttccgtggc cgtgcggccg 301 gctagaggct gcggcccagc gcggagcagg ggggctggca ggcgtcggga cggtcgggcc 361 ggtcccgccc gccccttccc ctccacaggc ccgccccggg gcctgggcca actgaaaccg 421 cgggaggagg aagcgcggaa tcaggaactg gccggggtcc gcaccgggcc tgagtcggtc 481 cgaggccgtc ccaggagcag ctgcccgtgc gggtacctct agccccgggg cctggaggag 541 cggtgggagc tgggggcgcg gcaggcaggg gcagagcagg cgttccgagg gccagagacc 601 cacccaggtc ggggtagggg ccgcggaagg gcggggatgg ccgcaggggc agggctcagg 661 ctgtgggcgc ctgaggcttc agctggggca ggcttggcct gtcgaggacc tgggcaaggg 721 tgtccctgta aggggtggtg ggtggaaggg cctggggagg gaggctccag gttggctcct 781 gttcccgaac gtgcggagga gaccctgacg ctaaggaagc aatgagggcc agtccccagg 841 ccaggctgct gctgggtacc catggctgcg tgtgagcgag gcaggacccc acctcctccc 901 cgtctgcagt ccatcctgac cctacagtcc ccagtctcct cgtcccatgc ctccgtctcc 961 agctgctgcc ttgcctccag ggatggcccc ttttctgtcc ccagaacagc actatgggct 1021 tctcttccga gctgtgcagc ccccagggcc acggggtcct gcagcaaatg caggaggccg 1081 agcttcgtct actggagggc atgagaaagt ggatggccca gcgggtcaag agtgacaggg 1141 agtatgcagg actgcttcac cacatgtccc tgcaggacag tgggggccag agccgggcca 1201 tcagccctga cagccccatc agtcaggtgg gtctctatgg gactctggtg ggtgctggcg 1261 tatctgcctt ctccttcctc tcctgggggc cctctggggc agtggctgga gatctggcag 1321 gccaatgctt gggagccatt gtgcccccct ccctgcctcc cccatctgtg ctgtatagtc 1381 ctgggctgag atcaccagcc aaactgaggg cctgagccgc ttgctgcggc agcacgcaga 1441 ggatctgaac tcagggcccc tgagcaagct gagcctgctc atccgggaac ggcagcagct 1501 tcgcaagacc tacagcgagc agtggcagca gctgcagcag gagctcacca aggtgagcgg 1561 gcagcactgg ggcttcggtc atttctgtct aaattttgag cctcgaaggg gttgttttgc 1621 acaagaggcc ctggattcac tggggaagtg taagtccctg accgcaggcc tggcttgctc 1681 taaccttgat gtagcttcct ctcttccttc ccctacgttg agctggcttg cagcaaggcc 1741 tctctgtgct ttttctgtgc ctgggcaaag tgctgggagt gtaaggatga gtgaccggtc 1801 acgtgcctgg gagaagctca gaatcggtac tcgcctccac actgtgccat ctggctctgg 1861 gttctgagag tcagggagag gaatgagggt cagtctgttt gccttcgacc tatgcagcct 1921 cctctcaggg cccgagactg ggcagcagca tggccccccg aaggtcgagg actcgggccg 1981 tgaagtcagc ctgcctaggt tgaatcccac ccagctcctc cgtctagagg ctgtgtgatt 2041 tggaactatt tatctgggag cctagtgccc ccattcagtg tgctggtcac cctccctgca 2101 ccacacccct tcctcaagtg cagagcccag ccttgccatg gacccacagc ggcccctggt 2161 ggcccaccct ggccccattc ctcgccccaa aagatcatct gattcaaggg tgggcccatt 2221 tttataaagt tttgctggaa cacagctatg cccctttgtt ttcatattgt ctgtgactac 2281 aatgacagag ttgagtaatt gtgacagagg ctctatggcc tacaagccta aaatatttat 2341 ttactatctg gccctttaag aaaaagactg atctagtcga ggaatctagc tcagttacag 2401 atggggaaac tgaggttggg cgcttgccca acatatccca gcacataaac aggagaactg 2461 ggacgagaac actgatctcg ggctgtcatc tattcctact gccaagaaca taatttgcag 2521 gacccagtgc aaagtgaaat tgtgggggtc tttgttaaaa gattgctagg aatttccagg 2581 tggcaataat ggagaatgaa accaagcaca gggcccttct acatgtggag ccccgtgtga 2641 ctgcacaggc cgtgcacacc tgcaactggc cctgcctgcc accaggctac cactgtcagt 2701 ccaaggaggg accgttgtag cctgtagtct acctctttgc ctccccaagg ggtctgtctt 2761 taacaggctc tctgatcttt gactctcacg tcagcagcca gctttcccag aagtctccag 2821 gtgctccttg cctgacgaca ggacctttcc agggcttcac cccaggcaag aatcttccac 2881 aactggggac ctgctgcccc acactggcct ctcctctctc cctagaccca cagccaggac 2941 attgagaagc tgaagagcca gtaccgagct ctggcacggg acagtgccca agccaagcgc 3001 aagtaccagg aggccagcaa aggttcgtgg cttcccttgc tggcagggag ggaatccgaa 3061 gccagtgctg acctgtcctt gggtacccag agagtggggg ctgcctgggc ctccatgctg 3121 tcatctatac cccttgcccc ccttctggca gacaaggacc gtgacaaggc caaggacaag 3181 tatgtgcgca gcctgtggaa gctctttgct caccacaacc gctatgtgct gggcgtgcgg 3241 gctgcgcagc tacaccacca gcaccaccac cagctcctgc tgcccggcct gctgcggtca 3301 ctgcaggacc tgcacgagga gatggcttgc atcctgtaag cccgcagccc cgtcccctgg 3361 cccccaccct tgagcagccc taagcccagc catcaggccc agaggcagga cccagaaaat 3421 ccattgctgg gaaggtgctg gccatgtaac cacatgagaa cgggacctgg gccaaggatt 3481 ggaaacaggc aacttacctc tgaattacac tattccaggg tctcattatt ccagggtttt 3541 attacattca ttgagcactg ttctgggctc tggattatac cagagaacga tggtagacaa 3601 aaacatctgt cctcagggat ctttcgtgtt agtggagtga ggatgtgagg agcactaaga 3661 gccatggaga aaaataaagc aagagaagtg gatcgggacc tgggagcacg gaggcaaggg 3721 aggaggtgac agttgtccat agagtgatct gggaaagcct cttgagaggt gacattcaaa 3781 gaggcccctg agaggggtac gggagtgaat catggggcta tttggagaaa gaccattcca 3841 gaaaggagga cagcaattac acaggccttg aggtaggaga gtaccaggga ctaatagcca 3901 ggaaccagtg gtgcctctga gagtgaggga gggggagagt catacacgag gctggaggag 3961 gcaggcgtca agggctactg ggtgatagag ggtctagcag ggccatggtg aggactttgg 4021 ctctgggtga acaagaatgg catgatctga cctctgtttt tttgtttcat tttgttttaa 4081 ctttttttga gtcagagtct cgctctgccg cccaggctag agtgcagtgg catgatctcg 4141 gcttactgca acctccgcct cccaggttca agtgattccc ctgcctcagc ctcccgagta 4201 gctggaacta cgggcatgcg ccaccacacc cagctaattt ttgtattttt agtagagatg 4261 gggtttcacc atgttgccca ggctagtctc taattcctgg gctcaaagcg atttgcctgc 4321 ctctgcctcc caaagtgccg ggattacagg catgagccac catgcccagc cctgacctct 4381 gttttaataa ggccactctg gctgctgtgc tgcaaataga cttcagggag caaggacaga 4441 agctgggagg ccagagagca ggctcttgcc ataatccaga tccaagcttt tggccactag 4501 gacggggagg tagcaatgga ggtgaggcgc ggtcaggtcc tggaaggtga agccagtggg 4561 atttccctat ggattggaag tggggcgtga aatagaggag tcaggggtca ctctggggat 4621 ttggcctgga gcagctggaa gatggagtgg ctgttaattc atgtagggaa ggctgtggga 4681 agaagaggtt taggagacaa ggatagcagt tcatttattt atttatttat ttatttattt 4741 atttatttat ttatttagag atgtagtctc attctttcgc caggctggag tgcagtggcg 4801 cgatcttggc tcactgcaac ctccacctcc caggctcaag cgattctctt gcctcagcct 4861 cccgagtagc caagtagctg ggactacagg catgtgccac catgcctggc taatttttgt 4921 atttgctttt tcagtagaga tggggtttca ccacgttagc caggctggtc tcgaactgac 4981 ctcaggcaat ccacccgcct cgacctccca gtgttggtat tataggcgtg agccactgtg 5041 cctggcccac tggatcctta ttacaactgc cagtgtccct cttatatata tcaggaaata 5101 gaagattagg gagaggttaa ataatttgcc tagagtggca tggctagctc gaagtgaggc 5161 aggggtcaac cccagccctg actccaaacc cagggtccta ggcctgaact gcccagcctt 5221 gcccagcctg aggctcccct gactggggat cccgtctcgg gggcaggaag gagatcctgc 5281 aggaatacct ggagattagc agcctggtgc aggatgaggt ggtggccatt caccgggaga 5341 tggctgcagc tgctgcccgc atccagcctg aggctgagta ccaaggcttc ctgcgacagt 5401 atgggtaagc cccgtccttg ctcctgctgg gcccagggct gctggcctgt ccactgacgg 5461 ggcgctgtcc cccacaggtc cgcacctgac gtcccaccct gtgtcacgtt cgatgagtca 5521 ctgcttgagg agggtgaacc gctggagcct ggggagctcc agctgaacga gctgactgtg 5581 gagagcgtgc agcacacgtg ggtggtggct ttgcacctgg gctgcggcgg ggctcccagc 5641 agaccacgag tgtttatgta ggcagggcta ggtcgtggag actgtccaca cagagctgtc 5701 accaggtggc cgggcttgct tggctctaca gggatgcact ggacctgggt tgagggggca 5761 ggagggctcg gttctaatgc tgcccttctc ttgggtgcag gctgacctca gtgacagatg 5821 agctggctgt ggccaccgag atggtgttca ggcggcagga gatggttacg cagctgcaac 5881 aggagctccg gaatgaagag gagaacaccc acccccggga gcggtgagtg ggcccctgcc 5941 tgcagcagcc tcctgggcct ccctccctcc tacctaccct aactgctgct ggctacccgc 6001 cgcagaccga gcccttattc tcatccaccc tcccacccgc ccctgcctgc agggtgcagc 6061 tgctgggcaa gaggcaagtg ctgcaagaag cactgcaggg gctgcaggta gcgctgtgca 6121 gccaggccaa gctgcaggcc cagcaggagt tgctgcagac caagctggag cacctgggcc 6181 ccggcgagcc cccgcctgtg ctgctcctgc aggatgaccg ccactccacg tcgtcctcgg 6241 tgagctgccc catccgcggc cgctgcccgc caccggcctg cccacctggg ctgcgctcct 6301 cattttcgcc ctccccctcc ctaagcctgg ccacccgctg acgtctgtcc ctggcctcag 6361 gagcaggagc gagagggggg aaggacaccc acgctggaga tccttaagag ccacatctca 6421 ggaatcttcc gccccaagtt ctcggtgagt ggcgcccagc tgggcccccc tactgttgtg 6481 tttcgagttt aatcactggg atgtcctaga gaggaggctc tgcccaggct gcttgtattg 6541 ggaagttcct ctcttccctg ggattccagg ctgcagatgt ccccagaccc tgcccctgtg 6601 acccctccct ttccatcgcc ccagtgtgct aaagggacca gcaacctcga ctattccatg 6661 gctctccctg cttcaggagc ggttgggggc ctgtggcctg gaggaggagg caccagcttg 6721 gtttggggtc ttcctgcctg ggcttccctt cccagctctg cccagcgtga gcctgggcca 6781 gtccaatgcc cactccaggg gcctgtggat ggctctgcat gccactccat ggttgtaagg 6841 gctgagggca tatagggggg agagagagac ccccggctgc ccccacggcc tcttcaacaa 6901 ggtggttaag tgactcctcc tcgatcctcc cttgcccagc tccctccacc gctgcagctc 6961 attccggagg tgcagaagcc cctgcatgag cagctgtggt accacggggc catcccgagg 7021 gcagaggtgg ctgagctgct ggtgcactct ggggacttcc tggtgcggga gagccagggc 7081 aagcaggagt acgtgctgtc ggtgctgtgg gatggtctgc cccggcactt catcatccag 7141 tccttggatg tgagtggggc tgggacccga gccttccagg cctcactctt cccctccctt 7201 cccttcccca agggaaatgg cctttcaggg tagggggtag ctgccaggtc ttggatgcct 7261 ccctagcagg gctggctgga aggggccaca gagaccaccc tgtccctgca acaaaataga 7321 ggcttaagtg tgagtcctcc cctggtgggg cagcaggatg tcatgtgcca tcagatggca 7381 tcttttctgg aggtctctct gcccctggtc ctgggcaggc cctttctccc ctgctgctct 7441 ccctttcccc ctcccagggc tcacgccccc tcagaatgga ggctgctgac cccgggtccc 7501 tgccctgcag aacctgtacc gactggaagg ggaaggcttt cctagcattc ctttgctcat 7561 cgaccaccta ctgagcaccc agcagcccct caccaagaag agtggtgttg tcctgcacag 7621 ggctgtgccc aaggtgagcc tgcacccagc ctggcccatg ccacctgtgg cagggcttgg 7681 ggagtgtggg tcaggcccac ccagcgtctg agcagaaagg gctttccagg ccctccgtct 7741 acatacaaga tgcagagtga gtgaccctca gggccagcct tgctctaggt ttggaatgtc 7801 agggccactc ctatgccatg ggctgtacac accaggttgg tgcttacctg gtcagggcac 7861 ctgcctggac cccgtagtca tctcagtgtg ctccccacgt ggtcccaccc ctggtcacat 7921 atggaggcgc caaaaaatgg aggacacagc ccttctaagg gcccagcacc ccttttcttc 7981 agacttctga tcccctgtct cctctcttcc ccaggacaag tgggtgctga accatgagga 8041 cctggtgttg ggtgagcaga ttggacgggt gagtgcgcct ctgctggcct ccttgtcgct 8101 ggcgacttct cctgagtcgc gcctggcccc ctgccctacc acccagagac ctccctgccc 8161 catctgattc cccacttgta ccccgactcc ctgcccagcc cccaccacac accatcctcc 8221 aggaaacggg acagtaccta ccctgaaaac tcccagcaga cagctctgcc agcaccctga 8281 cctcatcacc cccacccagg ccgcccccat cgagctcttg tgtgcacgca gggagacacc 8341 ctgttactgt aagccataag atacctgttt agggaagaag tcactgtcct aaaaatcaga 8401 atgcttttca aacccaaggg agagtgattt ttggatttcc atgtcacttc tctcaggaag 8461 ggtggcacat cggaggcaac tttccctgcc tgccccatgt gctctctagg ttccccagcg 8521 agggtcaaac tcccagagag cctgggtgag gggtccgaac acgggggccc ctcacccagg 8581 ggtaggaagc agaatgggta ggaagcggag aagagaactg cgggactggg aaggccgtgg 8641 taggagccca agaccgtttc aggggaactt tggcgaagtg ttcagcggac gcctgcgagc 8701 cgacaacacc ctggtggcgg tgaagtcttg tcgagagacg ctcccacctg acctcaaggc 8761 caagtttcta caggaagcga ggtgggtgat aaactaatga tcaccacggg tcccgcatac 8821 acagaggtta cactgcatgg cacagtgtga agtgcttgac caccgtggtg gtgtttagtc 8881 ctcgaggccc cccattgcgg gtagtacccc cttataatgc cgaagggtag aggctgcccc 8941 aggtcacacg tccgggtctg ctggccttgg aggccaagct cttctcccat catccctggg 9001 gggccctggg gaggcgggcc tggccacgta gatcctgagc agcagtgccc tccaggatcc 9061 tgaagcagta cagccacccc aacatcgtgc gtctcattgg tgtctgcacc cagaagcagc 9121 ccatctacat cgtcatggag cttgtgcagg gtgagcgcgg ggcgctgagc tccaggtagg 9181 gcgcgcagcc tggtcaggtg gcagccttac ctcaggaggc tcagcagggg tcctccccac 9241 ctgcaggggg cgacttcctg accttcctcc gcacggaggg ggcccgcctg cgggtgaaga 9301 ctctgctgca gatggtgggg gatgcagctg ctggcatgga gtacctggag agcaagtgct 9361 gcatccaccg gtgagtgggc ggtggccacg ggccctgcca acacccccga ccagagtcaa 9421 gaggtaccta tacccctagg gccccccgct ggaccatcag gcatcagctc cagaggggga 9481 gttggcctct gtggtagaca ggggtgccca gggccgggag cagcttttgt ccttggcttt 9541 cctagagtgt tcagccaggg ctgggcaggc gactgttggc caaatgagcc cctgccctgt 9601 ctcacccagg gacctggctg ctcggaactg cctggtgaca gagaagaatg tcctgaagat 9661 cagtgacttt gggatgtccc gagaggaagc cgatggggtc tatgcagcct cagggggcct 9721 cagacaagtc cccgtgaagt ggaccgcacc tgaggccctt aactacggta cctagtccct 9781 gtctaccctg gactccatgg ccagaggcca ggcctgggtc ctgccggctg cctcgccctg 9841 gccccaggga gggtgcactc acgctgcctc acctcctcgc ctcctctgca ggccgctact 9901 cctccgaaag cgacgtgtgg agctttggca tcttgctctg ggagaccttc agcctggggg 9961 cctcccccta tcccaacctc agcaatcagc agacacggga gtttgtggag aagggtaagc 10021 accctgtgat gacagcagcc tcaggctgca ccctcttcca gatgctccag ccggactctt 10081 ctaactccct taatgccaac cttcccacca ggctgaataa gaataacctg gccagttgct 10141 cacgcctgtc atcccagcac tttgggaggc tgagctgggt ggatcacttg agcccaggag 10201 ttcaagatca gcttggacaa cacagtgaaa ctccatctgt acaaaaaata caaaaataga 10261 ctgggcacgg tggctcacac ctgtaatccc agcactttgg gaggccgagg caggtggatc 10321 acctgtggtc aggagtttga gaccagccag accaacatgg tgaaacccca tctctactaa 10381 aaatacaaaa attagccagg catggtggca cgtgcctgta atcccagcta cttgggaggc 10441 tgaggtggga gaattgcttg aacccaggag gcggaggctg cagtgagccg agattgtgcc 10501 actgcactcc agcctgggcg acaagagtga aactccatct caaaaaaaac caaaaaacaa 10561 aaaatacaaa aattagctgg gtgtggtgac atgcgcctgt agtccctgct actcgggagg 10621 ctgaggtggg aggatcactg gagcccggga ggtggaggtt gcagtgagct gagatcatgc 10681 cactgcaccc caacctgggt gacagagaga gagagagacc ttgactcgaa aaagaaaaaa 10741 acctgggcgc agtggctcac gcctgtaatt tcaacatttt gggaggctga ggaaggtgga 10801 tcacttgagt ctaggagttt gacactagcc tggccaacat ggcaaaacct gtctctacta 10861 aaaatacaaa aaattagcga ggtgtagtgg tgcaagcctg taatcccagc tacttgggag 10921 gctgaggcac aagaatcgct tgaacctggg aggtggaggt tgcagtgagc tgagatcaca 10981 ccactgcatt ccagcgtggg tgacagagca agactccatc tcagaaaaag aaaaaaaaaa 11041 atagaatatc cctgtagcta ctactgagtg agcacctggt ctgtgctagg tcacatgtta 11101 tttcatttgc tcatcactac atgtgtggta gggattaata tgtccctttc tcagatggaa 11161 aaacaggctg gcagagggga cacagctagc acgtggtagg attaggatca gaagccaggc 11221 ctctttgtcc tttgggccct tggtggagaa cagtgcatcc ttcagaacag tgcatcttaa 11281 gcagctccta tggctcatgg tatcccccag agtctgccga ggaccctcaa actccctcct 11341 catgcctggt gtgctgtgcc tctcctcaca gggggccgtc tgccctgccc agagctgtgt 11401 cctgatgccg tgttcaggct catggagcag tgctgggcct atgagcctgg gcagcggccc 11461 agcttcagca ccatctacca ggagctgcag agcatccgaa agcggcatcg gtgaggctgg 11521 gacccccttc tcaagctggt ggcctctgca ggcctaggtg cagctcctca gcggctccag 11581 ctcatatgct gacagctctt cacagtcctg gactcctgcc accagcatcc acactgccgg 11641 caggatgcag cgccgtgtcc tctctgtgtc cctgctgctg ccagggcttc ctcttccggg 11701 cagaaacaat aaaaccactt gtgcccactg aacactcctg gcatgtgcac tcctctggaa 11761 ggcaggtctc agaaggcaca agtgccggta tggtggcctt ggggaaggag gaggacaggc 11821 agtatgcatg gggcagagct gacatgattt agtagcagct ggatgtgaga catgcggaag 11881 gcgggggaga gatcaggatg atatacaggc tatggccaga tggcggtgtc atcccctgaa 11941 ataggattat aggaagagga tcagagcttc gaggaggatg ttgagtttag agatgttgca 12001 ttttattgga gataaaagtg tgggtgaagc caggtgtggt ggtagacacc tgtagtccca 12061 ggtacttggg aggccaaggc atgtggattg cttgagccta gtttgagacc agcctgggca 12121 acatggcaaa actccatctt tacaaaaaca aaaaacaaaa aaccaagaaa attagccagg 12181 cgtggtggca cacacctata gtcccagcta ctcagaaggc tgaggtagga ggatcaattg 12241 agcctcggag gtcgaggctg cag // LOCUS HSFGFR4G 13095 bp DNA PRI 03-FEB-1998 DEFINITION Homo sapiens FGFR-4 gene. ACCESSION Y13901 NID g2832349 KEYWORDS FGFR-4 gene; fibroblast growth factor 4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13095) AUTHORS Kostrzewa,M. and Mueller,U. JOURNAL Unpublished REFERENCE 2 (bases 1 to 13095) AUTHORS Kostrzewa,M. TITLE Direct Submission JOURNAL Submitted (17-JUN-1997) M. Kostrzewa, Inst. fuer Humangenetik, Justus-Liebig-Universitaet, Schlangenzahl 14, 35292 Giessen, FRG COMMENT Related sequence: X57205. FEATURES Location/Qualifiers source 1..13095 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /clone_lib="Pieter de Jong's PAC" /clone="32C5" /map="q35" exon 1152..1255 /gene="FGFR-4" /number=1 gene 1152..12477 /gene="FGFR-4" mRNA join(1152..1255,3778..3921,4621..4884,4975..5055, 5167..5333,5914..6037,6550..6740,6875..7013,7367..7560, 7635..7780,7883..8004,9558..9668,9761..9951,10285..10407, 10515..10585,10832..10969,11643..11748,11878..12477) /gene="FGFR-4" intron 1256..3777 /gene="FGFR-4" /number=1 exon 3778..3921 /gene="FGFR-4" /number=2 CDS join(3831..3921,4621..4884,4975..5055,5167..5333, 5914..6037,6550..6740,6875..7013,7367..7560,7635..7780, 7883..8004,9558..9668,9761..9951,10285..10407, 10515..10585,10832..10969,11643..11748,11878..12027) /gene="FGFR-4" /codon_start=1 /product="fibroblast growth factor 4" /db_xref="PID:e1249321" /db_xref="PID:g2832350" /translation="MRLLLALLGILLSVPGPPVLSLEASEEVELEPCLAPSLEQQEQE LTVALGQPVRLCCGRAERGGHWYKEGSRLAPAGRVRGWRGRLEIASFLPEDAGRYLCL ARGSMIVLQNLTLITGDSLTSSNDDEDPKSHRDLSNRHSYPQQAPYWTHPQRMEKKLH AVPAGNTVKFRCPAAGNPTPTIRWLKDGQAFHGENRIGGIRLRHQHWSLVMESVVPSD RGTYTCLVENAVGSIRYNYLLDVLERSPHRPILQAGLPANTTAVVGSDVELLCKVYSD AQPHIQWLKHIVINGSSFGADGFPYVQVLKTADINSSEVEVLYLRNVSAEDAGEYTCL AGNSIGLSYQSAWLTVLPEEDPTWTAAAPEARYTDIILYASGSLALAVLLLLAGLYRG QALHGRHPRPPATVQKLSRFPLARQFSLESGSSGKSSSSLVRGVRLSSSGPALLAGLV SLDLPLDPLWEFPRDRLVLGKPLGEGCFGQVVRAEAFGMDPARPDQASTVAVKMLKDN ASDKDLADLVSEMEVMKLIGRHKNIINLLGVCTQEGPLYVIVECAAKGNLREFLRARR PPGPDLSPDGPRSSEGPLSFPVLVSCAYQVARGMQYLESRKCIHRDLAARNVLVTEDN VMKIADFGLARGVHHIDYYKKTSNGRLPVKWMAPEALFDRVYTHQSDVWSFGILLWEI FTLGGSPYPGIPVEELFSLLREGHRMDRPPHCPPELYGLMRECWHAAPSQRPTFKQLV EALDKVLLAVSEEYLDLRLTFGPYSPSGGDASSTCSSSDSVFSHDPLPLGSSSFPFGS GVQT" intron 3922..4620 /gene="FGFR-4" /number=2 exon 4621..4884 /gene="FGFR-4" /number=3 intron 4885..4974 /gene="FGFR-4" /number=3 exon 4975..5055 /gene="FGFR-4" /number=4 intron 5056..5166 /gene="FGFR-4" /number=4 exon 5167..5333 /gene="FGFR-4" /number=5 intron 5334..5913 /gene="FGFR-4" /number=5 exon 5914..6037 /gene="FGFR-4" /number=6 intron 6038..6549 /gene="FGFR-4" /number=6 exon 6550..6740 /gene="FGFR-4" /number=7 intron 6741..6874 /gene="FGFR-4" /number=7 exon 6875..7013 /gene="FGFR-4" /number=8 intron 7014..7366 /gene="FGFR-4" /number=8 exon 7367..7560 /gene="FGFR-4" /number=9 intron 7561..7634 /gene="FGFR-4" /number=9 exon 7635..7780 /gene="FGFR-4" /number=10 intron 7781..7882 /gene="FGFR-4" /number=10 exon 7883..8004 /gene="FGFR-4" /number=11 intron 8005..9557 /gene="FGFR-4" /number=11 exon 9558..9668 /gene="FGFR-4" /number=12 intron 9669..9760 /gene="FGFR-4" /number=12 exon 9761..9951 /gene="FGFR-4" /number=13 intron 9952..10284 /gene="FGFR-4" /number=13 exon 10285..10407 /gene="FGFR-4" /number=14 intron 10408..10514 /gene="FGFR-4" /number=14 exon 10515..10585 /gene="FGFR-4" /number=15 intron 10586..10831 /gene="FGFR-4" /number=15 exon 10832..10969 /gene="FGFR-4" /number=16 intron 10970..11642 /gene="FGFR-4" /number=16 exon 11643..11748 /gene="FGFR-4" /number=17 intron 11749..11877 /gene="FGFR-4" /number=17 exon 11878..12477 /gene="FGFR-4" /number=18 BASE COUNT 2417 a 3946 c 3917 g 2815 t ORIGIN 1 gaattcaagc tgtcctccta cctcagtctc ctagtaacta gaactacagg cacccgtcat 61 tgtgcccggc cattccacag ctatttacat aattaactgt tcagttacaa ttctggcaag 121 cgtgctgaag aaataacagg tcctaaataa ttacagctag tattacaaag caagcgctgc 181 atgccaggct tacacatact gctaacatgt ctactttttg caacaacctc ttgtgggtga 241 gactatcatt tgcctgtttt actgaaagga cagtgaagac tgaggccttg tgtgatggtg 301 ctgagagctg ggcctatgac gtccagccct gtagtctgtg ctctgaacca cttcaccctg 361 tttctttctc catcctcaca gtagagacgt catctaagag gagcttgcaa gtccagaggg 421 cttcctgaag gaagtgagct gagaataata tgaaattaac cagaccaagg ggcattccag 481 gaagagagaa tggtacgtga aactgcccag gagtagcagg gtgggagcca ggaggcgtgg 541 gtcatggcaa tctatgtata aagagctatg ggaatctata ggaaaattta agggagtggc 601 tgacacagtc agatttgtgt tgtaagtatc actccgcagg ttgcgcaggg aacaaatgag 661 ggagcagaag gaaggggttc tcctgtccgc tgcacggcac tgcgcagaag acaggggagc 721 caggcattcc ctgaagggtg aaaagcaagg agtagagctg ggtagtagac tagaatttag 781 gagcctggcc tggggcctgg gtggggcgaa agaggcggag ctgaatgggg tgtgtatagg 841 ggggttgcgt gtaggggtgt gtgtataggc tggggcgggg tcccgggagt gggctgactg 901 ggtcgggggc ggggctctcc aggtgggcgg ggatcttggc cacccctggc cacacctctc 961 tccggctcga gctggtctag gcggggcggg cccgaggggg tgtggcagga ggtgggcggg 1021 cccgggtggg gggggggggc gtggaaggag gggcgggccc gagcaggagg gggcgggccc 1081 gaggggcggg gcgggacagg aggtgggccg ctcgcggcac gccgccgtcg cgggtacatt 1141 cctcgctccc ggccgaggag cgctcgggct gtctgcggac cctgccgcgt gcaggggtcg 1201 cggccggctg gagctgggag tgaggcggcg gaggagccag gtgaggagga gccaggtgag 1261 caggaccctg tgctgggcgc ggagtcacgc aggctcgagg tgagccggaa cccttgtggg 1321 cccggctgcg ctcccagccg ccagggggcg agaggcggcg gggctacggg gactgcccct 1381 cccggcgcag gggacctggg cgtccgccgg gcggcagggg gtggaggggg cggtaaatca 1441 gtaacccgca gtgcacacag ggccttttgt cccgctccgt ccaaagagca ccccggccgc 1501 ggagctggtt actcattgcc caccgaggcg ggggcaggct ggccctgtgc agctaccctc 1561 gggacccatt gattcgcacc tccccccagg ctggcccggc aagggtgggg gaggacaagc 1621 gcgcttgtcc ctgcggctgt cttcgcgccg gcggcagaga tgagggacct gaggccccga 1681 aaagttcagt cacttagtgc ccgggggcct ccagcgcgag tgcgggaggc tgaaggagaa 1741 cccaggactg tctgatgcct aaggcaggcc ctccattccc acgtgggggg tggtcggtca 1801 gcggtcagca gccacgggtg actcgactaa ggactctgat atcagggcag cctggggtag 1861 gaataaactc cccgggcctc cccacccact cccagcccaa gctgtgtacc caaagagctg 1921 ccctccctgc caagccgagc ttggtaggga gttttaccaa ggaggatccg actggattcg 1981 agagttgagg tgggccagag acagcagtat ctgagtcagg tagagaagag caatgagggg 2041 cacagaggga tgggcaagag agcacatgtg cccagttttg aaagccaatg gcttcagcgc 2101 tcctgaaggg gcagacggtg tgaccaaaga gataggcagc ggcagagagg gagccctagg 2161 atgttgagct ggatcctgct gggcacaggt agccattaag ggcttgcaag ctggggggca 2221 tgacatggca gacttgcagg tttttttgtt tgttttttta ttttatttta tttttttatt 2281 ttgttttttt tgagacggag tctcactctg tcgcccaggc tggagtgcag tggcgcgatc 2341 tcggctcact gcaagctccg cctcccgggt tcgcgccatt ctcctgcctc agcctcccga 2401 gtagctggga ctacaggcgc ccgccaccgc gcccggctaa ttttttgtat ttttagtaga 2461 gacggggttt caccgtgtta gccaggatgt tctcgatctc ctgacctcgt gatccgccca 2521 cctcggcctc ccaaagtgct gggattacag gtgtgaacca tcgcgcccag ccgacttgta 2581 gttttttaaa actctgctgg aagatgaagg ttgaagagcc gagggagagg atgtttccag 2641 aggcccatgc aagagatggc catgacctgc cttgagaagg ggcaggggaa gccagatgga 2701 ctggaagtgg agtggcagtg accaaggagg aggaggtgtg ataggcttcc cacgcagggt 2761 agatccagag acaccagtgc cacccatagg cccctaggac tgcagtggtc acccgattcc 2821 tttgtcccag ctgagactca gttctgagtg ttctattttg gggaacagag gcgtccttgg 2881 tagcatttgg aagaggatag ccagctgggg tgtgtgtaca tcacagcctg acagtaacag 2941 catccgaacc agaggtgact ggctaagggc agacccaggg caacaggtta accgttctag 3001 ggccgggcac agggaggaga acattccaac actctgtgtg cccagtgccg acgcacgttc 3061 tctcttttat cctcaaaaca gtcctatgag gatataagcc agagagagac agagacaagg 3121 aattacaagt tggtgagagt caggatttga acttggctct ggcagatgga aaattagggt 3181 ctgtattctt tacaaaaccg tgtgtgcctc agatggagtt ggtgcataac aagcagaggt 3241 atccagggtc gcggtcctgc ttgccacgga aggggccgcc ttgtcagttg tgaccaccca 3301 gccctggaaa tgtcagtaat gctgtaagga gtggggatcg gatcagatgc catccagatg 3361 ctgaagtttg accttgtgtc atttttcact ttcttttttg gctcttctgc aatcaattca 3421 tttatttagc aaaaaagaaa ttatgtgtgc cgagagcatg cagaagatat gtctccgttc 3481 tctgcttccc tccaaaaaag aatcccaaaa ctgctttctg tgaacgtgtg ccagggtccc 3541 agcaggactc agggagagca ggaagcccag cccagacccc ttgcacaacc taccgtgggg 3601 aggccttagg ctctggctac tacagagctg gttccagtct gcactgccac agcctggcca 3661 gggacttgga cacatctgct ggccacttcc tgtctcagtt tccttatctg caaaataagg 3721 gaaaagcccc cacaaaggtg cacgtgtagc aggagctctt ttccctccct attttaggaa 3781 ggcagttggt gggaagtcca gcttgggtcc ctgagagctg tgagaaggag atgcggctgc 3841 tgctggccct gttggggatc ctgctgagtg tgcctgggcc tccagtcttg tccctggagg 3901 cctctgagga agtggagctt ggtatggctt ctgaggtggg agagggtggc aggggtggga 3961 agagtgggca ccaggagggg gctgctgggc tgagcaaagc tggaaaggat ccttgcccag 4021 gccctgagaa ggtggcggca gggcagggct caaccactga gactcagtca gtgcctggct 4081 tccagcaagc attcatctat cactgtgtct gcgagagagg actggccttg cagggcgcag 4141 ggccctaagc tgggctgcag agctggtggt gagctccttg cctgggtgtg tgtgcgtgtg 4201 tgtgtgtgtt ctgtgcactg ggtgtgtgac ctaggaggtc caggcagcat gtgtggtata 4261 agcattatga gggtgatatg ccccggtgca gcatgaccct gtatgtggca ccaacagcat 4321 gtgccttgtg tgtgtgtgtg tccgtatgtg tgtgtgtgta tgcgtgtgtg tgtgtgtgtg 4381 tgtgtgtctt ggccactgtc atgtgcacta aatgctgtgt gtgtgacatg ccccaagagt 4441 gtggcatttg ccctgggtgt ggcatccgca gcatgtggct gtgtgggtgt caaggagtgg 4501 tggctccttc agcatgcgtt gcgaagtgct tgtgccctgc atgtgcggtg tgttctctgt 4561 acacaggagg ctgcctcaga tggggctgcg gggtctgctg acctctccct ctgcccacag 4621 agccctgcct ggctcccagc ctggagcagc aagagcagga gctgacagta gcccttgggc 4681 agcctgtgcg gctgtgctgt gggcgggctg agcgtggtgg ccactggtac aaggagggca 4741 gtcgcctggc acctgctggc cgtgtacggg gctggagggg ccgcctagag attgccagct 4801 tcctacctga ggatgctggc cgctacctct gcctggcacg aggctccatg atcgtcctgc 4861 agaatctcac cttgattaca ggtggtaaga gactctagca gggagtgaag ggatgccagg 4921 ggagacagac ctgcccccct tggaccttag atgctccctc tgtccctgat gtagactcct 4981 tgacctccag caacgatgat gaggacccca agtcccatag ggacctctcg aataggcaca 5041 gttaccccca gcaaggtcag taggtctcca aggacttgtg tcccgctgct gctcatctga 5101 tcactgagaa gaggaggcct gtgtgggaac acacggtcat tctaggggcc ttcccctgcc 5161 ctccagcacc ctactggaca cacccccagc gcatggagaa gaaactgcat gcagtacctg 5221 cggggaacac cgtcaagttc cgctgtccag ctgcaggcaa ccccacgccc accatccgct 5281 ggcttaagga tggacaggcc tttcatgggg agaaccgcat tggaggcatt cgggtgagtc 5341 tctgggttcc aagaccgtct gctcccccat tttcattcct tcatcagtcc cctcatacct 5401 acaagcatac ctataaatca atcgaatgag tgaagcgatt gcggggcccc ggaaggagcc 5461 ctggactgtg gacctgggca gctctggttc cccttctgct actctctggc aagtgactta 5521 acctctcagc ctcagcaact ccctttgtaa agggagaaga atcactgact ggttggtctg 5581 cataagcctt agcatctcat cgtcttgatg agaccctgca gggtcggctc catgctgtca 5641 tgaggcaact gagcctcaga gaaggcaagg gttggctcaa agtagcacag ctagggagag 5701 ggagagctaa aattccaaag gctcaaaccc aaggctcaag cgccctgggg agcctactcc 5761 tttgtgccat agtccttggc ctgggcctga tgttctcagg gcctagagag cttgacaaga 5821 gccctgtggg caggatgagg atctagcctc ctggtcctct ggcccccttg gtggacatgg 5881 tccggtggtc ccggacactc tctctgcctg cagctgcgcc atcagcactg gagtctcgtg 5941 atggagagcg tggtgccctc ggaccgcggc acatacacct gcctggtaga gaacgctgtg 6001 ggcagcatcc gctataacta cctgctagat gtgctgggtg agcgcggggc tgggaacagg 6061 ggaggcctga cccattttgg gctcagttgt gccctcttgg tggggtctag tctggcaggc 6121 aggatggact cagatgagtc aggcagcttg gtgagcaggt gggtcagggg aaagcacagg 6181 ggttagtgtg gggctggagg agcagaggtc tgccaagagg aaaaacaaga aggacatcca 6241 ggcagagggc gcagcccgag cggagggcct gagtataaca aacgccctgc acttgcaggc 6301 cagcatattc gtagggcgtg gcgtttatat ggggagccag gtggtggagg gttttgaatg 6361 ctaggctgag atgttgtcct tgacccgaag caatagggag ccagggaagg tttaagcagg 6421 gtaagcagga gacagacaag aagctgcaga aaggtccctc ccttgaactt gaggaaggct 6481 ggagggaggc aaacagggtg cttctatggg tgccagtggt cagggttgac tgtctcgccc 6541 ggtccccaga gcggtccccg caccggccca tcctgcaggc cgggctcccg gccaacacca 6601 cagccgtggt gggcagcgac gtggagctgc tgtgcaaggt gtacagcgat gcccagcccc 6661 acatccagtg gctgaagcac atcgtcatca acggcagcag cttcggagcc gacggtttcc 6721 cctatgtgca agtcctaaag gtaaaaggtg caccctgctg cagcctgggc cccattcttc 6781 tcccaccttg ggttgggggg ctccccagct tccctgttgg ccacagtgtg gccccaggcc 6841 ctgctgtgac cccagagcat gtcccccacc ccagactgca gacatcaata gctcagaggt 6901 ggaggtcctg tacctgcgga acgtgtcagc cgaggacgca ggcgagtaca cctgcctcgc 6961 aggcaattcc atcggcctct cctaccagtc tgcctggctc acggtgctgc caggtgagca 7021 cctgaagggc caggagatgc tgcgagatgc ccctctgggc cagcagtggg ggctgtggcc 7081 tgttgggtgg tcagtctctg ttggcctgtg gggtctggcc tggggggcag tgtgtggatt 7141 tgtgggtttg agctgtatga cagcccctct gtgcctctcc acacgtggcc gtccatgtga 7201 ccgtctgctg aggtgtgggt gcctgggact gggcataact acagcttcct ccgtgtgtgt 7261 ccccacatat gttgggagct gggagggact gagttagggt gcacggggcg gccagtctca 7321 ccactgacca gtttgtctgt ctgtgtgtgt ccatgtgcga gggcagagga ggaccccaca 7381 tggaccgcag cagcgcccga ggccaggtat acggacatca tcctgtacgc gtcgggctcc 7441 ctggccttgg ctgtgctcct gctgctggcc gggctgtatc gagggcaggc gctccacggc 7501 cggcaccccc gcccgcccgc cactgtgcag aagctctccc gcttccctct ggcccgacag 7561 gtactgggcg catcccccac ctcacatgtg acagcctgac tccagcaggc agaaccaagt 7621 ctcccacttt gcagttctcc ctggagtcag gctcttccgg caagtcaagc tcatccctgg 7681 tacgaggcgt gcgtctctcc tccagcggcc ccgccttgct cgccggcctc gtgagtctag 7741 atctacctct cgacccacta tgggagttcc cccgggacag gtgcgctgag ctgtgtgggg 7801 gcagggacgc gggcgccggg ttgcagcccg ccctccgcag gagtgactcg gaggtctgag 7861 gctggacttt ctccatctcc aggctggtgc ttgggaagcc cctaggcgag ggctgctttg 7921 gccaggtagt acgtgcagag gcctttggca tggaccctgc ccggcctgac caagccagca 7981 ctgtggccgt caagatgctc aaaggtgagt gtggcccggt gtggtggctc acacctgtaa 8041 cgccagcact ttaggaggct gagggtggga ggatcgcttg aatccaggaa ttcgaggcca 8101 gcctgggcaa catggcaaga cttcatctct acaaaaaaaa aataagaaaa ttagttgggt 8161 gtggtggtgt gtgcctttag tctcagttac tagggaggct gaggcaggag gatcccttga 8221 atccaggagt tggaggttgc agggagccat gatcacgcca ctgtattcca gcctgggcaa 8281 cacagtgaga ccctatctga aaaaataaat aaataaataa aaataaaagg tgaacgtggc 8341 agcctggagg aggtgctatg gcattgggac taatagaagg ggctcacggt gccaccaggt 8401 gagccctgga gctgggagag gctgtgggat cccaccctta aacctgcaat tcacctctgc 8461 tcctgaccct ggcaagtgac ttctgagcct cagttttccc ttgtgtcata tggggtagat 8521 aacagtccct actcccagcc caaggattgt ggaaagtgcc tggctcatag tcagggctca 8581 ataaatcttc accactgggg tgatgatgat gagaagaatt tggtgtgaca ggcttgatat 8641 cctgtgtcag cattagtctg tgtcagcttt gacttcacat ctctttgtca gcctcacagg 8701 ccctctacct ccttccttat ggttcccccc agacacaccc tcagcctccc ttggaccctc 8761 cctaggtctg ccccccacgt ccactgctgt aggaggacag cccttctgct tgcacccagg 8821 cccagccccg gggtgctctt gctgggcact cctgcacccc acccatcagg gcctctcctt 8881 gcagttcccc agccccctct gcaagaatgg cctccactgc tcttctgctc ctcccctcct 8941 ctctacacag ctggggccac ctggtgctcc ctgggaggca gggattgaga aatgcacatt 9001 gtgtcattgg cccagggcca caggtcagcc ccaggggctc agccagagaa gccaaagcag 9061 ccttcttccc aagctccccg gctgcacccg gcctgccgcc agctccctga attcccaggc 9121 cagttggaag ccaggccctg gtcaaacaga ccccagggcg ccagcctgct ttccgcaccc 9181 agaagctctg accccatgcg gggactaccg ctgacccctc cagcggcagc ttccttcctt 9241 ccttcctgct ccgagctctt cccctctctc ctgtgtcctg ggcctgcccg ctggaaggcc 9301 tgcctcttag atccttgata cagttgcatc cttgcaactg ctgtgacagg cagggtgtga 9361 cccactgctc tgtttcccac aagacgaacc tgaggttcag agacgctagg agactttttc 9421 aaggccacac agcctagcaa ggattcagcc ctagacctac gtagccctgg tccagtgctg 9481 cttgtcctgc acctgcctct gcatgctccc tcgtgcagtt ggagggcagc ctctcacccc 9541 gtctgctgcc cttacagaca acgcctctga caaggacctg gccgacctgg tctcggagat 9601 ggaggtgatg aagctgatcg gccgacacaa gaacatcatc aacctgcttg gtgtctgcac 9661 ccaggaaggt ggggccgagg cggggctggc tgcacgggcc gttagggtgc agagccaaag 9721 ctttggcagc ctctccacgc tccctccact ccctctgcag ggcccctgta cgtgatcgtg 9781 gagtgcgccg ccaagggaaa cctgcgggag ttcctgcggg cccggcgccc cccaggcccc 9841 gacctcagcc ccgacggtcc tcggagcagt gaggggccgc tctccttccc agtcctggtc 9901 tcctgcgcct accaggtggc ccgaggcatg cagtatctgg agtcccggaa ggtataggcg 9961 ctagggctct gagcccctct cagtctctcc agctccactc tcaggcctgt ggcattcaat 10021 gtcccgactt ctccctctct gctctttttc atgaccccac ctcagtgtcc ccaggcattc 10081 acgctttcct gcattcccca ctcgttcctc acccttcccc agaggggaga ggggacgcag 10141 gagaaggcac tccccgtttc taaaccttga cctcctcctc tgtaaagtgg gtggagggcc 10201 cctgcccccg ggcctgctgg ggggtggtgt gtgctcaact ccaggccagg tgtcctgagg 10261 cacccaagcc cccgctccct gcagtgtatc caccgggacc tggctgcccg caatgtgctg 10321 gtgactgagg acaatgtgat gaagattgct gactttgggc tggcccgcgg cgtccaccac 10381 attgactact ataagaaaac cagcaacgtg agggagatgg ggcagaactg gatgggggtg 10441 gaggggcact gggcccgggg tggcaggcac gaggacctgt gggactctgc actgaggccc 10501 tctctcccct ccagggccgc ctgcctgtga agtggatggc gcccgaggcc ttgtttgacc 10561 gggtgtacac acaccagagt gacgtgtgag tcctgccggc ggtcactgtc ctaccccaca 10621 aaaagggcaa ggcactgccc aaagtcacgt ggccccagga gtcatgcgct cgagggctcc 10681 ttcagatttg gtctgggacc cgagtgggcc cagactccag gaggagccca ttccccaaca 10741 gctgtggtgg gtcatgtctg tggggtcccc cgtcctagcc ccggtcgtag ggagggcgct 10801 gagccacact gagccctggc cctgcctcca ggtggtcttt tgggatcctg ctatgggaga 10861 tcttcaccct cgggggctcc ccgtatcctg gcatcccggt ggaggagctg ttctcgctgc 10921 tgcgggaggg acatcggatg gaccgacccc cacactgccc cccagagctg tgaggcctca 10981 ccctgccctc gaccccactt tccagtcctc ctcctcctct gccctgacca tggcctcagg 11041 gtgtgtcccg gccagaagga caacactaac aacaactcct cgtcctcctc ctcctcttcc 11101 tcttcctcct cctcctcttc ctcctcctcc tcttcctcct cctcctcttc ctcctcctct 11161 tcctcctcct cctcttcctc ctcctcctct tcctcctcct cctcttcctc ctcctcctct 11221 tcctcctcct cctcttcctc ctcctcctcc tcctcctcct cctcctcctc ctcctcctcc 11281 tcttcctcct cctcctcttc ctcctcctcc tcctcctcct cctcctcctc ctcctcttcc 11341 tcctcctcct cttcctcctc ctcctcctcc tcctcctcct cttcctcctc ctcctcttcc 11401 tcctcctcct cttcctcctt ctcctcctgc tcctcttcct cctcctcctc ttcctcctcc 11461 tcctcctcct cctcttcctc ctcctcagcc tagtggagtg tcctggcctg gcttctactg 11521 atgaccctcc tatccctcat caaactcccc accaaactcc tccccaccca gagaaccccc 11581 ggtcctcccc ttcctcctga aggcctgagg ctccctgtga ccctccgccc cacctctcgc 11641 aggtacgggc tgatgcgtga gtgctggcac gcagcgccct cccagaggcc taccttcaag 11701 cagctggtgg aggcgctgga caaggtcctg ctggccgtct ctgaggaggt acagcccctc 11761 ccacccacca cctccctctg cctgctcccc tccaggcctc atctggcctg accgcgtgga 11821 catgcgcccc gtcccatccc gggcgctgca gaggctgacc agctccgttc cccacagtac 11881 ctcgacctcc gcctgacctt cggaccctat tccccctctg gtggggacgc cagcagcacc 11941 tgctcctcca gcgattctgt cttcagccac gaccccctgc cattgggatc cagctccttc 12001 cccttcgggt ctggggtgca gacatgagca aggctcaagg ctgtgcaggc acataggctg 12061 gtggccttgg gccttggggc tcagccacag cctgacacag tgctcgacct tgatagcatg 12121 gggcccctgg cccagagttg ctgtgccgtg tccaagggcc gtgcccttgc ccttggagct 12181 gccgtgcctg tgtcctgatg gcccaaatgt cagggttctg ctcggcttct tggaccttgg 12241 cgcttagtcc ccatcccggg tttggctgag cctggctgga gagctgctat gctaaacctc 12301 ctgcctccca ataccagcag gaggttctgg gcctctgaac cccctttccc cacacctccc 12361 cctgctgctg ctgccccagc gtcttgacgg gagcattggc ccctgagccc agagaagctg 12421 gaagcctgcc gaaaacagga gcaaatggcg ttttataaat tatttttttg aaataaagct 12481 ctgtgtgcct gggtcttccc tgagcaacat ggagtggggt gaggtggagg gatccctcca 12541 gcagagttct gcctacagga cacggactga gggcactgga ccaggccatg ggctccgcca 12601 cctccactgc cccaggagcc agtgtgtgcc tatctgggtc cgcctgtccc accagcccca 12661 tcttgtgtct gcgacagtgt gaatgagtat taatgggctg agtccgcatt gcactataca 12721 cggtgggact cctgtaccct ctgcacatgt gtgtgtgtgc atgtgtgccc tgcagctgtc 12781 cccaagggag ctggcagccc ccctccccca tctgctcagc attaaccaag ctgaccgtta 12841 acacagcatg aaaatctgag agccagcctt aggccgcggc ccgctcccac gctctgccgg 12901 ctcaggctgg gggcttgtgg aggccatgcc cgccccgccc tggccagtct cccgggcagc 12961 agctggttgc cgcccgcctg ggctgcagct gtccctgcct gcctggtctt ccactgagga 13021 gccgtcacag ccctgtactc agagctcctc agagtgagca gcttctcaag gctctgagcc 13081 tggaacctcc ttccc // LOCUS HSFOLA 6045 bp DNA PRI 31-OCT-1996 DEFINITION H.sapiens gene for folate receptor. ACCESSION X69516 NID g288876 KEYWORDS folate; folate-binding; gene family; membrane glycoprotein; transport. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6045) AUTHORS Elwood,P.C. TITLE Direct Submission JOURNAL Submitted (12-NOV-1992) P.C. Elwood, National Institute of Health, Bldg 10 Room 12N 226, 9000 Rockville Pike, Bethesda, MD 20892, USA REFERENCE 2 (bases 1 to 6045) AUTHORS Page,S.T., Owen,W.C., Price,K. and Elwood,P.C. TITLE Expression of the human placental folate receptor transcript is regulated in human tissues. Organization and full nucleotide sequence of the gene JOURNAL J. Mol. Biol. 229 (4), 1175-1183 (1993) MEDLINE 93188012 REMARK Erratum:[J Mol Biol 1994 May 13;238(4):639] COMMENT Related sequence: J02876. FEATURES Location/Qualifiers source 1..6045 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human lymphocyte EMBL3" GC_signal 89..99 GC_signal 94..104 TATA_signal 197..204 CAAT_signal 296..304 TATA_signal 385..392 misc_feature 495..499 /note="Ets motif (Pu box)" misc_feature 553..558 /note="Ets motif (Pu box)" misc_feature 559..565 /note="Ets motif (Pu box)" exon 611..737 /gene="placental folate receptor" /number=1 mRNA join(611..737,2383..2556,4687..4875,4993..5128,5287..5758) /gene="placental folate receptor" gene 611..5758 /gene="placental folate receptor" intron 738..2382 /gene="placental folate receptor" /number=1 GC_signal 802..902 /gene="placental folate receptor" exon 2383..2556 /gene="placental folate receptor" /number=2 CDS join(2407..2556,4687..4875,4993..5128,5287..5579) /gene="placental folate receptor" /codon_start=1 /product="folate receptor" /db_xref="PID:e279925" /db_xref="PID:g1655592" /translation="MVWKWMPLLLLLVCVATMCSAQDRTDLLNVCMDAKHHKTKPGPE DKLHDQCSPWKKNACCTASTSQELHKDTSRLYNFNWDHCGKMEPACKRHFIQDTCLYE CSPNLGPWIQQVNQSWRKERFLDVPLCKEDCQRWWEDCHTSHTCKSNWHRGWDWTSGV NKCPAGALCRTFESYFPTPAALCEGLWSHSYKVSNYSRGSGRCIQMWFDSAQGNPNEE VARFYAAAMHVNAGEMLHGTGGLLLSLALMLQLWLLG" intron 2557..4686 /gene="placental folate receptor" /number=2 exon 4687..4875 /gene="placental folate receptor" /number=3 intron 4876..4992 /gene="placental folate receptor" /number=3 exon 4993..5128 /gene="placental folate receptor" /number=4 intron 5129..5286 /gene="placental folate receptor" /number=4 exon 5287..5758 /gene="placental folate receptor" /number=5 BASE COUNT 1507 a 1429 c 1515 g 1594 t ORIGIN 1 tttacatgtg tccaggaggt ctcgaacctc tgacctcagt gatctgtctg cctcagcctc 61 ccaaagtgct gggattactg acgtgagcca ccatgcccag cctatacctc aacttctacc 121 tatgtccacc tggctgccca taaattaacc caatagctgt cacttaagcc tactcagtat 181 gtgccaggct tttccctata caattgtgaa caattttcta gtgagcaaac tgaagctcag 241 taaggtccag tgacttttcc caaggttgtg caagagatgg agctctcatt gggtcccatt 301 ggcctgaccc taaagcctgg gttcttttcc accagaccta atctccatcg agctggcctt 361 atcctaagaa ccacttgggg tatctataaa atccagatgc cccctggtga tgagcaattc 421 tctagatttt gatgaaagtt gaatgtgtgg atgctggaat gagtaaatta acaagtaagg 481 agatgaatgc aagcaggaat gactaaatgg acagactcag ggagccttga agagggtggg 541 gtctggaagg gaaggaagag aggaaggaga atagctaagt agggagattt cactcagtgc 601 ttaccagagc gcgttgtcta ccctgtaccg aagacagagg ctgtggggac agcctagggg 661 cctggatcta ttgcctactt agagagaggc caactcagac acagccgtgt atgctcccag 721 cagcaacgga ggttcaggca agatgcccga aggagggaag ggtgacaagg gcagtgggga 781 gacttggaga gtttgtgcag aggggaggaa cacacctttc tttctgtatt gtattgtatt 841 gtattgtatt ttttgagaca gagtctcgct ctgtcaccca ggctggagtg cagtggcacg 901 atcttggctc actgcaacct ctgcctcctg ggttcaagtg attctcctgc ctcagcctcc 961 tgagtagctg ggattacagg tgcccaccac ctcgcctggc taatttttgt atttttagta 1021 gagacggggt ttcatcatgt tggtcaggct ggtctcgaac tcttgacctc aggatccacc 1081 cacctcggcc tcccaaagtg ctgggattac aggcgtgagc cactgtgccc ggccacatct 1141 ttctttagaa agatcatgct ggctcctgtg gggaaggcaa cttgaagggg aagaaattgg 1201 aagagggaag acttgacact aaacatagag ccatggtcag taagtttttg gagggacctt 1261 taccagagag ggagaggaag cagtatccat tatccactct ttagtgacag agcccaaagg 1321 aacagctctg taatggctgg gggtgggggc aagggcagca atgaacagag gaggcgaggg 1381 ctgctcctgt tgttgagtgc cttccaggac actaagtgct ggtctggctg ctccaacatg 1441 ggaaactttc atgttgtctc tcagatctaa gggtctctct gcatttttaa aaaaattatt 1501 ttgaattttt taaacttcta tcgtttttca ggtggttttg gttgcataga taagttcttt 1561 agtggtgatt tctgaatttt agtgcaccag tcaccggagc agtgtacact gtactcagta 1621 tatagtcttt tattcctcac cccctcctca ccttgcccca ccttgcccca cccccagtct 1681 atatcattct catctgcatt tgttttttga gaggtagggt gaggtctgct gcattagccc 1741 agagacagct gaggaggctg gaaaggaaaa tggctttcag agaataggga aggaaaaagt 1801 ctgaggaagc aaaactactt aaaaaacact gctctttcca tttccgtatc atttaaccaa 1861 acaccatcct gtgggttctg tcacttgatc ctcatgctga tctgagaact cggcagagct 1921 gggtcactgc cttccccaag gctaacctgg tttctaagct ggcacaagac cccaggtgat 1981 ctatcatcta accacagagc ccttgctgga cagagggtga agacattatg tgtccctggc 2041 tctgttcagg gaagcagagg atagacctga aggaatttaa ttacttgtca tagataaagc 2101 ctagagaaaa ggtgagggcg caaagtatct cctccaaatt gggaagactc ccccagggct 2161 cactcactaa cgccagttct cctgagcctt taaagcctgg gggtgaggga gccctcctgg 2221 tcaaccctct ctaccctagc ctcagggagc ttcagggccc cagcattgaa ggaacagggt 2281 ctgacctcat ttgccaccgt agggatgggg agactgaggc aggaggtgaa tggctcccag 2341 cttggagccc tttcccctca ggacttggtt tccctaccct agctccgcct gcagggacag 2401 aaagacatgg tctggaaatg gatgccactt ctgctgcttc tggtctgtgt agccaccatg 2461 tgcagtgccc aggacaggac tgatctcctc aatgtctgta tggatgccaa gcaccacaag 2521 acaaagccag gtcctgagga caagctgcat gaccaagtac ggctggagtg tgcctctgct 2581 aaggaggggg cttgttctaa cagggaggag aaagtcagga tggtgggaga gggattgagg 2641 ggtcagatac ctccacatcc tgaagttttc ctgtgggaga agatgaaggt ggagtaggaa 2701 gagtggctga ggtgatttta ggggggccct ccccggaggt ggataccatg ttgacaatga 2761 tattgagtcg tcattcgatg ggcacctgtc agttgtcatg tgttttttgt acaaaatttc 2821 atattcccag gggttcttgg atgtaggtag atgatattct ccacattaca caagtaagtg 2881 aaaatgaggc tcacagaagc acatgggccc acacaggagc tggacatgaa tggcccctgt 2941 ggagggtagg aataggagtg ggttaggctc ctcctttggt gggtgacaga ctagggagtc 3001 gctggctttt cccaccctca ctaagtgcca tttccatgga gtgccaaggg agaagaggag 3061 gaggcccttc tggctgtaat ttgaggcaca ggggctggac attcacacag tctatataca 3121 tgtatgccag gggattgcag ccttattaga ctcaatgtct cccttttatg acagattttt 3181 ttttagattc tcttttctat cctgcaatga aattcaaaga acctatttgt atgcataatt 3241 tttgcaaata tcaacataat gctctgtaat ataaaggaga aacacacaga agtaacttgt 3301 aataaaataa tatatatttc aatatgtcag ttctctggta tgactacatt agaaggcatt 3361 aggaaatagc acatgcttgc ttttgtcata aaatcattat aagtggggag ctacaaatga 3421 agattgatac aggtatattc tattggtgac ttaaacacta taagatacta taatataagc 3481 aatgttgctg ttgatgtcat acttttctaa aatagtgaat aattctagta gaataaacca 3541 catagtacag ccttctattt catggtattt gcagtcctgg aaaacccagt ctagattaaa 3601 atcttgtgaa aacatgatgt gcctatatgt aagatagaga tacatttaaa aagtagggct 3661 tttgcttact tttttttttt tttttgagaa ggagtctcac tctgttgccc aggctggagt 3721 gcaatggtgt gatctcagct cacggcaacc tccgctccgg ggttctggtg attctcctgc 3781 ctcagcctcc caagtagctg gggcacccgc caccatgcct ggctaatttt tttgtatttt 3841 taccagagac gaggtttcac catgttggcc agggtggtct caaactcctg acctcaggtg 3901 gtagacctgc ctcggcctcc caaagtgctg gggttacagc tacgagccac cacgcccagc 3961 ctggcttttg cttacttttt ccagcaaata ttttcatggg cctactatgt gtctggcact 4021 gcctagccac tgggacacag atcttactac atgccaagaa gaataaatat attcaaactc 4081 cattacacca tggctcgctc cctctcttcc tttgctcaaa atgtcaccaa tgtggcattt 4141 cctaacctat ttaaaatttc agcaattcca cattaccatt tcctgctcat atcatttaac 4201 ttttcccata tcatttatca tatcctgaca taccatatat atatattttt taagtttctt 4261 actgtctgtc tcctctgacc agaatgaaaa ttccatgaat acatatttct gtctgttttg 4321 ttctcttctg tattcccagc atctataacc gtgactggca taaagtaggt gctcaataat 4381 ttttaaatga gtaagtgaaa ggacttttta tgaagtgtta catacctcat aaatgataac 4441 tattattccc aagacagggt ttcccctggg ctcccacagt cccctgatca cacggttgta 4501 attgtgtttc ctccaagaaa ggggttcctg gaggcctagg agagagggac catcatctgg 4561 gaacctgagt gttctcagga caacctgcct agggcagagg agtaagaacc aaatggggga 4621 gagacacgag gtggcaggag gaggagggta tggggaggca cttagtcctg tgtcttcccc 4681 acccagtgca gtccctggaa gaagaatgcc tgctgcacag ccagcaccag ccaggagctg 4741 cacaaggaca cctcccgcct gtacaacttt aactgggacc actgcggcaa gatggagccc 4801 gcctgcaagc gccacttcat ccaggacacc tgtctctatg agtgctcacc caacctgggg 4861 ccctggatcc agcaggtagg gtgtctcccc cccacccacc ccagcagact gccatccccc 4921 tcagtcactt caaggcgatg gctgccagca tccctggctg agaggagccc tgcctcccca 4981 cctcccaccc aggtgaatca gagctggcgc aaagaacgct tcctggatgt gcccttatgc 5041 aaagaggact gtcagcgctg gtgggaggat tgtcacacct cccacacgtg caagagcaac 5101 tggcacagag gatgggactg gacctcaggt gagggtgatt gagttggggt taggaaaaag 5161 gagattgagg tagggtttgg aaaatcctca aggatttggg gtggggtgaa gatttctggg 5221 ggtggccaga aatgagcttt gggcccaggg gctgaaagtc tgtgtccacc atgcctctcc 5281 ctgcaggagt taacaagtgc ccagctgggg ctctctgccg cacctttgag tcctacttcc 5341 ccactccagc tgccctttgt gaaggcctct ggagtcactc atacaaggtc agcaactaca 5401 gccgagggag cggccgctgc atccagatgt ggtttgattc agcccagggc aaccccaacg 5461 aggaagtggc gaggttctat gctgcagcca tgcatgtgaa tgcaggtgag atgcttcatg 5521 ggactggggg tctcctgctc agtctggccc tgatgctgca actctggctc cttggctgag 5581 ttcagtcctc ccagactacc tgccctcagc ttggataacc aggctgggct cagctcagct 5641 cccacaaatg acagcccctt aagcatgctt ctattagtca cctaaccctc tgtcacccag 5701 tctgttgctg ctccatggtg gggccaagag tcacttctaa taaacagact gttttctaat 5761 aattccatgt ctgtggaatt gttttggttg tgagtttgtg ggtgggtggg agacagattc 5821 tacggcttct ggattccttc aaattagagc aactagacct gtgtttgaat cccagctctg 5881 ctactttgtg aaaatgaaac agttgcttca ctgctctaaa cctttgtttc cttctggata 5941 aaaagaggat aatagtcctg ccctcaggct tgtgaaatat acaggagctg ttgcctagaa 6001 agcccgtgag cccagtgtct gcacattaat agtcttcagg gatcc // LOCUS HSG17G 5443 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens G17 gene. ACCESSION X80700 NID g625185 KEYWORDS G17 gene; PBX2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5443) AUTHORS Aguado,B. and Campbell,R.D. TITLE The novel gene G17, located in the human major histocompatibility complex, encodes PBX2, a homeodomain-containing protein JOURNAL Genomics 25 (3), 650-659 (1995) MEDLINE 95278934 REFERENCE 2 (bases 1 to 5443) AUTHORS Aguado,B. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) B. Aguado, MRC Immunochemistry Unit, University of Oxford, Dept of Biochemistry, South Parks Road, Oxford OX1 3QU, UK REMARK revised by [4] SS REFERENCE 3 (bases 1 to 5443) AUTHORS Monica,K., Galili,N., Nourse,J., Saltman,D. and Cleary,M.L. TITLE PBX2 and PBX3, new homeobox genes with extensive homology to the human proto-oncogene PBX1 JOURNAL Mol. Cell. Biol. 11 (12), 6149-6157 (1991) MEDLINE 92049345 REFERENCE 4 (bases 1 to 5443) AUTHORS Aguado,B. TITLE Direct Submission JOURNAL Submitted (16-JAN-1995) B. Aguado, MRC Immunochemistry Unit, University of Oxford, Dept of Biochemistry, South Parks Road, Oxford OX1 3QU, UK FEATURES Location/Qualifiers source 1..5443 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="lymphoblastoid cell line-ICE5" /clone_lib="cosmid" /clone="COS E91" /chromosome="6" /map="6p21.3" exon 1..506 /gene="G17" /number=1 mRNA join(<1..506,1405..1478,1697..1944,2045..2235,2419..2554, 2806..2959,3300..3388,3511..3597,3727..>5443) /gene="G17" /product="PBX2" gene 1..5443 /gene="G17" CDS join(286..506,1405..1478,1697..1944,2045..2235,2419..2554, 2806..2959,3300..3388,3511..3597,3727..3819) /gene="G17" /codon_start=1 /product="PBX2" /db_xref="PID:g634053" /db_xref="SWISS-PROT:P40425" /translation="MDERLLGPPPPGGGRGGLGLVSGEPGGPGEPPGGGDPGGGSGGV PGGRGKQDIGDILQQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSI RSSQEEEPVDPQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGGVSPDNSIEHSD YRSKLAQIRHIYHSELEKYEQACNEFTTHVMNLLREQSRTRPVAPKEMERMVSIIHRK FSAIQMQLKQSTCEAVMILRSRFLDARRKRRNFSKQATEVLNEYFYSHLSNPYPSEEA KEELAKKCGITVSQVSNWFGNKRIRYKKNIGKFQEEANIYAVKTAVSVTQGGHSRTSS PTPPSSAGSGGSFNLSGSGDMFLGMPGLNGDSYSASQVESLRHSMGPGGYGDNLGGGQ MYSPREMRANGSWQEAVTPSSVTSPTEGPGSVHSDTSN" intron 507..1404 /gene="G17" /number=1 exon 1405..1478 /gene="G17" /number=2 intron 1479..1696 /gene="G17" /number=2 exon 1697..1944 /gene="G17" /number=3 intron 1945..2044 /gene="G17" /number=3 exon 2045..2235 /gene="G17" /number=4 misc_feature 2228..2235 /gene="G17" /note="homeobox in exon 4" intron 2236..2418 /gene="G17" /number=4 exon 2419..2554 /gene="G17" /number=5 misc_feature 2419..2554 /gene="G17" /note="homeobox in exon 5" intron 2555..2805 /gene="G17" /number=5 misc_feature 2806..2853 /gene="G17" /note="homeobox in exon 6" exon 2806..2959 /gene="G17" /number=6 intron 2960..3299 /gene="G17" /number=6 exon 3300..3388 /gene="G17" /number=7 intron 3389..3510 /gene="G17" /number=7 exon 3511..3597 /gene="G17" /number=8 conflict 3576 /gene="G17" /citation=[3] /replace="c" intron 3598..3726 /gene="G17" /number=8 exon 3727..5443 /gene="G17" /number=9 conflict 4544 /gene="G17" /citation=[3] /replace="t" conflict 4657..4661 /gene="G17" /citation=[3] /replace="gt" conflict 4785..4787 /gene="G17" /citation=[3] /replace="aggt" conflict 5196..5198 /gene="G17" /citation=[3] /replace="ga" conflict 5378..5380 /gene="G17" /citation=[3] /replace="ggcta" BASE COUNT 1163 a 1610 c 1388 g 1282 t ORIGIN 1 cctccctcct ccctctctct cacacacacc cccgcttggg cctcctctct ctctccggct 61 ccattttctc cgccgccggg ggccggggtc tcctgtgggg ggcccagccg gtatcccagg 121 tctcccttca gtgccggggt gaacccccgg gggagccggg agccgggggc agacgggcgg 181 gggttggggc ggagggagca gcggccccag cgagtttggg gggagaagta accaggcggg 241 gggaggggcg gagcagggag ggggcctcag ggcccccccc cagctatgga cgaacggcta 301 ctggggccgc cccctccagg cgggggccgg gggggcctgg gattggtgag tggggagcct 361 gggggccctg gcgagcctcc cggtggcgga gaccccggtg ggggtagcgg gggggtcccg 421 ggaggccgag ggaagcaaga catcggggac attctgcagc agataatgac catcaccgac 481 cagagcctgg acgaggccca ggccaagtga gtgcccccac tccgggaccc cacacagacc 541 cagcaaaccc cgttcacatg ttctgaatct tctgggagcc ccccccaact ccagggccct 601 ctccaggatc caacagctct cttctctcct tattcctggg agcccataga aaagtgatcc 661 ctctcaaacc tcccttcacc cccaggccct gaaaccttca cagagggaac ccccggtggc 721 ccggctcccc actcctaacc ttttgccgac ccctgcagtc tcctggaaca gccccatccc 781 cgggagcccc ctctggctcc cagactaaga aactgttctt gggctacgtt atcttctccc 841 ctaactctcc acccagcccc ctcattctct ccagatgtgg agacctccac accctctcca 901 gagcccctaa agctcctctc cactgctcag ccagacacta ggtgcatcaa agcctcccac 961 ctgctcagcc ccaggacccc ttcacacacc ctacactgat ctccccagtt agctcggcac 1021 ccccagcccc actctgccac ctcaaactct gactcttctc aaccccagcc tctgtctctc 1081 tccctctgaa acctaccaag tcactttcct ttctccatcc actcccagat tcctcctcct 1141 acctttctag accatctccc aaagcccgca gcctttaacc tgctgcctgc atcttccctg 1201 tgtctccctg aagctgagga gcttccccat gctctgggag ctgatctttt cccaagaact 1261 cctcattcca cccccaactc attccacccc caatccgctt cctccctccg cagactgacc 1321 ctcctccctc cttgttctca ggccccctgc tctgtttctc tagctcctca acttttctct 1381 ttccccactc ccactcctcc caaggaaaca cgccctaaac tgccaccgaa tgaagcctgc 1441 tctctttagc gtcctgtgtg aaatcaagga gaaaactggt atgtgggccc cccccggatt 1501 gctcaactct gggaacagaa ccctgttcat tatagggcta gagtgtgaca acttggggcc 1561 ctgaggaaag taaggagtca gggggactgg ggaaggaacc aaagcctggg aacttggctc 1621 tccaggaagc accaggagga ctgagcactg ggtattgggg tctctgggtc cctaagtcca 1681 ctcgcctgca tgctaggcct cagcattcgg agctcccagg aggaggagcc ggtggaccca 1741 cagctgatgc gcttggacaa catgcttctg gcagagggtg tggctgggcc cgagaaaggg 1801 ggcggctcag cagcagcagc tgcagccgct gcagcctctg gtggtggtgt gtcccctgac 1861 aactccatcg aacactcgga ctatcgcagc aaacttgccc agatccgtca catataccac 1921 tcggagctgg agaagtatga gcaggtaagg agaggaggct tgggtgggtg gagggaaggg 1981 ctcttgcagg ggaatcccat ggtcaaaggg ctcctcctca ccagcccact ggcccccact 2041 acaggcatgt aatgagttca cgacccatgt catgaacctg ctgagggagc agagccgcac 2101 caggcccgtg gcccccaaag agatggaacg catggtgagc atcatccatc gaaagttcag 2161 cgccatccag atgcagctga agcagagcac ctgcgaggct gtgatgatcc tgcgctcccg 2221 tttcctggat gccaggtggg cccagggacc ccaggctggc ccccagcact gggctccttc 2281 ccattcctct ccaagaccct gagctgccat gctgcacaac atggtactcc atgacaatgg 2341 tgactctggg gtcatgccat gtgacagccc tgccaggaca tcaacatcct cctcactgct 2401 cttctccctc ctctgtagac gaaagcgccg taacttcagc aaacaggcca ctgaggtcct 2461 aaatgagtat ttctactccc acctgagtaa cccatatcct agtgaggagg ccaaggagga 2521 gcttgccaag aagtgtggca tcaccgtgtc tcaggtatta tggaggttgc gggaggagtt 2581 gtcaggcaaa gtgcacgcat ctcagctagg tgcagtggtg tgttcctgta atcccagcta 2641 ctagggaggc tgaagtggga ggatcacttg aattggagac cagcctgggc aacagcatag 2701 tgagaccagg aagcaaaaaa aaaaaaaatg ctgtcactca catcttattc agtgaaggac 2761 ttcagaggca aatgtttcta cctgaccctc ctttctgccc cacaggtctc caactggttt 2821 ggcaacaaga ggattcgcta taagaaaaac atcggaaagt tccaagagga ggcaaacatc 2881 tatgctgtca agaccgccgt gtcagtcacc caggggggcc acagccgcac cagctccccg 2941 acaccccctt cctctgcagg tggatcccac tgtcaccccg gctgactgtt ttgcacactt 3001 cctgcttttg ttcccacttc ctatctaggc aggatcatag cagagagggg gccttttggg 3061 gtgagaggga ccgagctgag ataggctgga gatgtcaggg gacagaggcc attccagtga 3121 tcttagttct gcctttcttc ccacgggtgg ccaaggaaca gcctgctctt tctgtgtgtt 3181 ggaatgttat tttgtggata attggagtat agtagcatgt ccccacaaga gttgagagtt 3241 gtggttcatc ctctaccatc acgggctcta ttacactctt ccctctctgc ccccacaagg 3301 ctctggcggc tctttcaatc tctcaggatc tggagacatg tttctgggga tgcctgggct 3361 caacggagat tcctattctg cttcccaggt cagatgccca tctcctctcg aatagggctt 3421 tccccaactc catttcctct actttaggat acaagacctc tttcctctga ggcttctctt 3481 cactgtcata ccttcctctg ctgcctgcag gtggaatcac tccgacactc gatggggcca 3541 gggggctatg gggataacct cgggggaggc cagatgtaca gcccacggga aatgagggtg 3601 agtggatcct gaagctcctc tctgtccagt tctcacagga cagaggggca ttttccctag 3661 taatgttgtg cccacacagg gttcccaggg ctttctctgt tttctgtact ctgtctctcc 3721 tttcaggcaa atggcagctg gcaagaggct gtgaccccct cttcagtgac atccccaacg 3781 gagggaccag ggagtgttca ctctgatacc tccaactgat cttgcccctc agggtcacag 3841 gggtgggggc tctcacaagg cgacttgaag aggacgcagg cttccagagg acaaacccca 3901 atacaggaga agcacaagac agagaagggc caatggggtc atcccctccc taacgagact 3961 ctctgtgctg ggggtgctaa ttacatggca ggaagaatgg ggcctctaag gggagtgtgg 4021 ggtctgtctc tccctttttt ccatcttttt cctctctcgc tttctttctt acacagaaac 4081 atacacatac cgagaaacct atttctcaga cccctttttc tcctctgtct ttctctctcc 4141 ctctcccaca cctcacacac acatactccc acttgcaact attctgtttc tctcctgggc 4201 tcccccactt tcccttcccc accccacttg tatgctctgg aatctgtgga gacgccagcc 4261 ctgcccaatc agagatgcca aaaatgggga catgacttct ggacagagga catgggccac 4321 gcccccatgc atccccaccc ccgcccctcc ggacggctta cttacctcat acgcagctca 4381 tcttaaacca atagaatcgc tcggtggacg agagtgtctg actcagatat ctacctcgga 4441 gggagtttct gctactttag ggaattattg actgggcttt ggggttgaac tttttttttt 4501 ttaaagaaag aaaaagaaac cctgggatcc atctgttttt tttgttgttg ttgttgtttt 4561 tgttgttgtt ggtggtggtg gtggtggtgg ttcttaattt ttaatttagt ttggggaagt 4621 agcttgtttt tttttttata aatatgttga tttcttgttt tctttttttt ttatttctta 4681 ctttcccata ttaggggtga tagccaaagg ggttctggta agagaaaggg ggacaaacag 4741 aactggtaaa gaggcccccc tggctccagg cctgtccatc aggaagtaaa ttttacaggg 4801 caccaagctt tgccccctaa aatcccttag gtgttctttg ttcatgcagg caggtttctg 4861 ccgcatttga tgtggaggca gtgaagggct tgccctgctg gcctctcatc ccccttcttc 4921 ccacaaccct tgggcagggc tggactcagt aattttgagg aaattgaaga tgccatcttc 4981 ccctgtgagt gacatgtctt taatttttta aaaaactact atttgaaaat tggaggggga 5041 agaatgggaa gggagttatt gccaaatatg ttaaatatgg gttggggtgc ttgtatatgt 5101 atcttcctca atttccccat aaatgaggta tctttttgtc acaccaaaat caaggggtag 5161 ggagagggag gaggttgcaa aaagccagat gtgggggaaa agtaacatca acactgtccc 5221 atcctcagcc ctgaactagc taccatctga tcccctcaga cattctcagg attttacaag 5281 actgtcagag tggggaaccc ctcccattaa agatccgggc aggactggga caggttggaa 5341 gtgtgatggg tgggggggtg ggaggcatgg gccgggggca gttctctcct cacttgtaaa 5401 cttgtgtagt ttcacagaaa aaaaacaaaa tgcagtttta aat // LOCUS HSGCAP2 3600 bp DNA PRI 17-AUG-1996 DEFINITION H.sapiens GCAP-II gene. ACCESSION Z70295 NID g1495450 KEYWORDS GCAP-II; guanylyl cyclase; uroguanylin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3600) AUTHORS Maegert,H.J., Hill,O. and Forssmann,W.G. TITLE Structure of the human uroguanylin / GCAP-II gene and expression within the gastrointestinal tract JOURNAL Unpublished REFERENCE 2 (bases 1 to 3600) AUTHORS Pardigol,A. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) Andreas Pardigol, IV - Molecular Biology, Lower Saxony Institute for Peptide Research, Feodor-Lynen-Strasse 31, Hannover, Lower Saxony, 30625, Germany FEATURES Location/Qualifiers source 1..3600 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda FIX II, Stratagene, Cat. No. 9462039" /sex="Male" TATA_signal 950..954 exon 979..1110 /number=1 5'UTR 979..1020 /evidence=experimental gene 1021..3167 /gene="GCAP-II" CDS join(1021..1110,2251..2437,3106..3167) /gene="GCAP-II" /codon_start=1 /product="uroguanylin" /db_xref="PID:e233860" /db_xref="PID:g1495451" /translation="MGCRAASGLLPGVAVVLLLLLQSTQSVYIQYQGFRVQLESMKKL SDLEAQWAPSPRLQAQSLLPAVCHHPALPQDLQPVCASQEASSIFKTLRTIANDDCEL CVNVACTGCL" intron 1111..2250 /gene="GCAP-II" /number=1 exon 2251..2437 /gene="GCAP-II" /number=2 intron 2438..3105 /gene="GCAP-II" /number=2 exon 3106..3390 /number=3 3'UTR 3168..3390 polyA_signal 3374..3379 BASE COUNT 804 a 1030 c 1056 g 710 t ORIGIN 1 ttctctctga cccaaccttc aaccactggg gcagtaggtt atgcattgcc tagaaggcaa 61 ctgtaagatt caggggctgc tatggatgca ttacccaaga gataagggca acctcaggtg 121 ggccggggcc ctcacagcaa ccttacgaga ggggttgtat catgcccatt tgatggaaga 181 gaaaactaag attccattag aattgatgtg acgtatccta aatcacacag ctgcagcagg 241 actaggattt gaatgcgggt caatctgcct ccaacactac tgccttttcc tccatgctgg 301 attaggctat tggcttctga aagcacagac aggtgagatg ggtcaaacat ggcctgaagc 361 acccaggcac aggtgggaga tggcaagcag aggcaggttc aaaacgtggg gccaggactg 421 gtctgaaaga tagtcgttaa atagcactga ggagagggag tgagagagat gtgggtcttg 481 ggggagtggg gtcttgagaa ccctgaaatc agcagtgtct gcccactttg ccctggacca 541 gcctctctgt ctgtatcctt cccccagaga agggaccatg ggctgacctg ggccaaggac 601 acctggctgg agccccttct ctgacattaa ctagctccgt gacctgggga aagccaccct 661 ccctcctgag cctcactttt cctgtctgta aaatggcctc cccgtcatca cacccacaca 721 ggtgtcttga ggagtgaatg atggtgtgct ccaaggcccc catggagagc ccagcacatg 781 gcaggcactc ggtattcaag cactgtgttc ccatagactt tgggctcctg ccctcctcgc 841 ccccaacccc cttcctagga accagccccg gcttgtcaga gagccagcta gctgtcaacc 901 ccaccccagc tggccagtta atgagtgacc aggactcctg agcctgacct ataaggaggt 961 gctaggcagg gacacagatg ggagacggtg gacagcggca gggggaaccc agggagcgcg 1021 atgggctgca gggctgcatc agggctcctg ccaggagtgg ccgtggtcct cctgctgctg 1081 ctgcagagca cacagtcagt ctacatccag gtgagtccct tggccagcgt tccctttgcc 1141 tgaagggccc catggtggga ggctaggctg gagagggtgt actgggaatt cagaggggca 1201 ccgggggagc acggggcccg gggctcagcc caacacccat cacagcccaa agaccaaggc 1261 agctggtggg tgggggcagc cagatgtttg atggcagtga tgatggcaat gacaacacca 1321 aactgattca gcgctgctcc cctcaaaggc ccgtgctggt cacagaggtc agggtgtgga 1381 acagaaggct cctccttgca tgggaggagg ccagaaaggg aaggagacac cctgttatgc 1441 tcccacagtt ccgtatattc acccacatgt aggtggagga ggacaggctc cagaaaaatt 1501 ttcctgagaa gtaacccttg agctggcatt gcaagatgaa caggaggttg ccagacacag 1561 atgatgggga aactgttcca ggccaagggg cacagcatgt gcaaagacca gatagaagac 1621 tatgggggtt ccagtgccct tccagggggc atgggagagt ggcagtagtc gtcacagcag 1681 ggcctgtctg ccacgtcaca ggggatgctg ctggcctcgc ccagcatagc agagacgtct 1741 gtggccctgg gtgtcacatc cctgactctg ggcatggtct ccttgggaag aacaaagtgg 1801 gtcccagaga ggggtgcagg agacttggcc tccccaacag ccctggcctc agagaccagc 1861 tacttggggc tgccatcact ctgccatgtg agctcaggca ggcgtactca gttagtgatt 1921 gtgccctggt accaggcccc cacgggagat gtcagatgag ggaactgagg ctgagtgggg 1981 aacagacagg cagccggtca tgggcagagg tgggatttga accagggctt gtctgtctct 2041 aaagcctcga accgctgtgc tccacagccc cgctgcttct tcctcacacc tctcaccgca 2101 gcggctcttt ccagtgtggc cctggacttc ccatagagac ggggaaggct ctggaggatc 2161 tgcagggttt gagcaaccct gggtgtcatg gaagtgcctg cccctggacc aggtcttgac 2221 ctgctcctat ctcctctccc ctgtctgtag taccaaggct tccgggtcca gctggaatcc 2281 atgaagaagc tgagtgacct ggaggcacag tgggcaccca gcccccgcct gcaggcccag 2341 agcctcctgc ccgccgtgtg ccaccaccct gctctgcctc aggaccttca gcctgtctgc 2401 gcctcgcagg aggcttccag catcttcaag accctgagta agtgcccccg ctccctgcag 2461 aacctcgctc tgtctcctcc cacgcccagg ctcctccacc tgggttgttt ctcttaagca 2521 cccagcggct gaggatgggg agattcaagt aactcaggcc ctgtcccact ggctcccagg 2581 actccccggt ttgggcactg tgagtcaggc atgcctggaa cctgcttcgt ccccttagtc 2641 cacgccaggc acagacccat caacagacaa tggctgcgtg gagcagagag gagagctggc 2701 tgacatctgc gatgggacag gggtgtgaaa ggcttccaag aggaggtgtc agccaagctg 2761 aaacctgagg gatcgtgcag agccaggcac gagaatgtag caactagcta gcacttactg 2821 agcacctact gtgtgccggg acttgtcggt cctgcattat ctcctcacag tgaccctgtc 2881 catttagcaa acaagaaatc tcaggcacag ggagtgtgag tgacttgccc agtggccgct 2941 aggaaggaga gcatctggga cttgaaccca agccctcact ccacctctag acgttcctgc 3001 tcccagtctg tggaagatgc cgcccattcc caagaagacc tcttcttctg cctggccaaa 3061 gagggcccag gttcaggtca actgaccagt tctctccctg cccagggacc atcgctaacg 3121 acgactgtga gctgtgtgtg aacgttgcgt gtaccggctg cctctgagat agccctgggt 3181 accctgagcc caccagggac acctcgccct tcagcccacc accctggcag gcttccatcc 3241 ccgtccatgc tcaagatggg tccctggcca ccatggtcat caccaccctt ccagggcctg 3301 agcagctgga tctggtacaa agcaatcgga catagagttg gagggggagg cccctgaggc 3361 agcccagctc ctgaataaag attctacaac acacgagtcc acgtgtcctt tgttcatccc 3421 caggagccat gggaggagct tctggagaag aagtgtggat tgaggagaaa gacgggacta 3481 aaaataccag gcaggaattt tcctgaagtt ttcaaggccc ggggagttga ttgaacccca 3541 tcccgaatat gagggagttg aggctcggag caggaatgag gcatccagga tcacagagct // LOCUS HSGCSFG 2960 bp DNA PRI 24-APR-1993 DEFINITION Human gene for granulocyte colony-stimulating factor (G-CSF). ACCESSION X03656 NID g31687 KEYWORDS colony stimulating factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2960) AUTHORS Nagata,S., Tsuchiya,M., Asano,S., Yamamoto,O., Hirata,Y., Kubota,N., Oheda,M., Nomura,H. and Yamazaki,T. TITLE The chromosomal gene structure and two mRNAs for human granulocyte colony-stimulating factor JOURNAL EMBO J. 5 (3), 575-581 (1986) MEDLINE 86220137 COMMENT Data kindly reviewed (19-JUN-1986) by S. Nagata. FEATURES Location/Qualifiers source 1..2960 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 296..301 exon 329..403 /number=1 mRNA join(329..403,580..743,1122..1229,1374..1520,1685..2702) prim_transcript 329..2702 CDS join(364..403,580..743,1122..1229,1374..1520,1685..1849) /codon_start=1 /product="G-CSF protein" /db_xref="PID:g296647" /db_xref="SWISS-PROT:P09919" /translation="MAGPATQSPMKLMALQLLLWHSALWTVQEATPLGPASSLPQSFL LKCLEQVRKIQGDGAALQEKLVSECATYKLCHPEELVLLGHSLGIPWAPLSSCPSQAL QLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTIWQQMEELGMA PALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHLAQP" sig_peptide join(364..403,580..629) intron 404..579 /number=1 exon 580..743 /number=2 mat_peptide join(630..743,1122..1229,1374..1520,1685..1846) /product="mature G-CSF protein" intron 744..1121 /number=2 exon 1122..1229 /number=3 intron 1230..1373 /number=3 exon 1374..1520 /number=4 intron 1521..1684 /number=4 exon 1685..2702 /number=5 polyA_signal 2685..2690 polyA_site 2702 BASE COUNT 599 a 839 c 917 g 605 t ORIGIN 1 ctgccgcttc caggcgtcta tcagcggctc agcctttgtt cagctgttct gttcaaacac 61 tctggggcca ttcaggcctg ggtggggcag cgggaggaag ggagtttgag gggggcaagg 121 cgacgtcaaa ggaggatcag agattccaca atttcacaaa actttcgcaa acagcttttt 181 gttccaaccc ccctgcattg tcttggacac caaatttgca taaatcctgg gaagttatta 241 ctaagcctta gtcgtggccc caggtaattt cctcccaggc ctccatgggg ttatgtataa 301 agggccccct agagctgggc cccaaaacag cccggagcct gcagcccagc cccacccaga 361 cccatggctg gacctgccac ccagagcccc atgaagctga tgggtgagtg tcttggccca 421 ggatgggaga gccgcctgcc ctggcatggg agggaggctg gtgtgacaga ggggctgggg 481 atccccgttc tgggaatggg gattaaaggc acccagtgtc cccgagaggg cctcaggtgg 541 tagggaacag catgtctcct gagcccgctc tgtccccagc cctgcagctg ctgctgtggc 601 acagtgcact ctggacagtg caggaagcca cccccctggg ccctgccagc tccctgcccc 661 agagcttcct gctcaagtgc ttagagcaag tgaggaagat ccagggcgat ggcgcagcgc 721 tccaggagaa gctggtgagt gaggtgggtg agagggctgt ggagggaagc ccggtgggga 781 gagctaaggg ggatggaact gcagggccaa catcctctgg aagggacatg ggagaatatt 841 aggagcagtg gagctgggga aggctgggaa gggacttggg gaggaggacc ttggtgggga 901 cagtgctcgg gagggctggc tgggatggga gtggaggcat cacattcagg agaaagggca 961 agggcccctg tgagatcaga gagtgggggt gcagggcaga gaggaactga acagcctggc 1021 aggacatgga gggaggggaa agaccagaga gtcggggagg acccgggaag gagcggcgac 1081 ccggccacgg cgagtctcac tcagcatcct tccatcccca gtgtgccacc tacaagctgt 1141 gccaccccga ggagctggtg ctgctcggac actctctggg catcccctgg gctcccctga 1201 gcagctgccc cagccaggcc ctgcagctgg tgagtgtcag gaaaggataa ggctaatgag 1261 gagggggaag gagaggagga acacccatgg gctcccccat gtctccaggt tccaagctgg 1321 gggcctgacg tatctcaggc agcaccccct aactcttccg ctctgtctca caggcaggct 1381 gcttgagcca actccatagc ggccttttcc tctaccaggg gctcctgcag gccctggaag 1441 ggatctcccc cgagttgggt cccaccttgg acacactgca gctggacgtc gccgactttg 1501 ccaccaccat ctggcagcag gtgagccttg ttgggcaggg tggccaaggt cgtgctggca 1561 ttctgggcac cacagccggg cctgtgtatg ggccctgtcc atgctgtcag cccccagcat 1621 ttcctcattt gtaataacgc ccactcagaa gggcccaacc actgatcaca gctttccccc 1681 acagatggaa gaactgggaa tggcccctgc cctgcagccc acccagggtg ccatgccggc 1741 cttcgcctct gctttccagc gccgggcagg aggggtcctg gttgcctccc atctgcagag 1801 cttcctggag gtgtcgtacc gcgttctacg ccaccttgcc cagccctgag ccaagccctc 1861 cccatcccat gtatttatct ctatttaata tttatgtcta tttaagcctc atatttaaag 1921 acagggaaga gcagaacgga gccccaggcc tctgtgtcct tccctgcatt tctgagtttc 1981 attctcctgc ctgtagcagt gagaaaaagc tcctgtcctc ccatcccctg gactgggagg 2041 tagataggta aataccaagt atttattact atgactgctc cccagccctg gctctgcaat 2101 gggcactggg atgagccgct gtgagcccct ggtcctgagg gtccccacct gggacccttg 2161 agagtatcag gtctcccacg tgggagacaa gaaatccctg tttaatattt aaacagcagt 2221 gttccccatc tgggtccttg cacccctcac tctggcctca gccgactgca cagcggcccc 2281 tgcatcccct tggctgtgag gcccctggac aagcagaggt ggccagagct gggaggcatg 2341 gccctggggt cccacgaatt tgctggggaa tctcgttttt cttcttaaga cttttgggac 2401 atggtttgac tcccgaacat caccgacgtg tctcctgttt ttctgggtgg cctcgggaca 2461 cctgccctgc ccccacgagg gtcaggactg tgactctttt tagggccagg caggtgcctg 2521 gacatttgcc ttgctggatg gggactgggg atgtgggagg gagcagacag gaggaatcat 2581 gtcaggcctg tgtgtgaaag gaagctccac tgtcaccctc cacctcttca ccccccactc 2641 accagtgtcc cctccactgt cacattgtaa ctgaacttca ggataataaa gtgtttgcct 2701 ccagtcacgt ccttcctcct tcttgagtcc agctggtgcc tggccagggg ctggggaggt 2761 ggctgaaggg tgggagaggc cagagggagg tcggggagga ggtctgggga ggaggtccag 2821 ggaggaggag gaaagttctc aagttcgtct gacattcatt ccgttagcac atatttatct 2881 gagcacctac tctgtgcaga cgctgggcta agtgctgggg acacagcagg gaacaaggca 2941 gacatggaat ctgcactcga // LOCUS HSGEBCMA 3802 bp DNA PRI 25-JUN-1997 DEFINITION Homo sapiens gene for BCMA peptide. ACCESSION Z29574 NID g471244 KEYWORDS BCMA peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3802) AUTHORS Laabi,Y., Gras,M.P., Brouet,J.C., Berger,R., Larsen,C.J. and Tsapis,A. TITLE The BCMA gene, preferentially expressed during B lymphoid maturation, is bidirectionally transcribed JOURNAL Nucleic Acids Res. 22 (7), 1147-1154 (1994) MEDLINE 94218235 REFERENCE 2 (bases 1 to 3802) AUTHORS Tsapis,A. TITLE Direct Submission JOURNAL Submitted (24-JAN-1994) Andreas Tsapis, inserm U301, Institut de Genetique Moleculaire, 27, rue Juliette Dodu, Paris, 75010, France FEATURES Location/Qualifiers source 1..3802 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="germ line" /tissue_type="placenta" /chromosome="16" /map="p13.1" /germline TATA_signal 190..196 exon 259..606 /label=exon1 CDS join(477..606,1349..1495,2715..2992) /codon_start=1 /product="BCMA peptide" /db_xref="PID:g471245" /db_xref="SWISS-PROT:Q02223" /translation="MLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNAS VTNSVKGTNAILWTCLGLSLIISLAVFVLMFLLRKISSEPLKDEFKNTGSGLLGMANI DLEKSRTGDEIILPRGLEYTVEECTCEDCIKSKPKVDSDHCFPLPAMEEGATILVTTK TNDYCKSLPAALSATEIEKSISAR" intron 607..1348 /cons_splice=(5'site:YES,3'site:YES) exon 1349..1495 /label=exon2 /product="BCMA peptide" intron 1496..2714 /cons_splice=(5'site:YES,3'site:YES) exon 2715..3214 /label=exon3 /product="BCMA peptide" polyA_site 3124 /evidence=experimental polyA_signal 3192..3197 polyA_site 3193 /evidence=experimental BASE COUNT 1042 a 765 c 822 g 1159 t 14 others ORIGIN 1 cccctgcagc tggcctcaat gttaagatct taaggggcac agcacagacc ttgtcttgtc 61 tatctccctg gcacctctca cagagtacag cttattaaca gatgttccag aaactgttac 121 tgaacaatcg gcttgatgct gtgggcttgt ctgcatcttg caactgtcac ctggctgaga 181 aatttcttct ataaataagc agtttctgtt tcagatgtga tatgccctga tatttacacc 241 ctgtctctta ccccatccaa gactcaaact tagaaacttg aattagatgt ggtattcaaa 301 tccttacgtg ccgcgaagac acagacagcc cccgtaagaa cccacgaagc aggcgaagtt 361 cattgttctc aacattctag ctgctcttgc tgcatttgct ctggaattct tgtagagata 421 ttacttgtcc ttccaggctg ttctttctgt agctcccttg ttttcttttt gtgatcatgt 481 tgcagatggc tgggcagtgc tcccaaaatg aatattttga cagtttgttg catgcttgca 541 taccttgtca acttcgatgt tcttctaata ctcctcctct aacatgtcag cgttattgta 601 atgcaagtaa gtaatattgc ttgaacgatt attcattggt gtgaactatt ctgtctatat 661 ggactgctta ttcagagaat caacataatg ggcatgatgg tgagttttct tgaatcaaaa 721 agagaaagga agcaaggcag tgattttaat gtttatggaa acaaagtaat tatttggaac 781 tgaacttgat atgattcagc actattagca acatagattt tttttaaaaa tcagctcttc 841 taattaagtg atatttagaa tttaaaagtc aatgttcatt aattaaggtg attgaatgga 901 aataatccat acttgattat tttgctatca aaacaatcca taattcatta tttttagcaa 961 aataatcaag tatgacagcc gggtgcggtg gatggctcac acctgtaatc ccagcacttt 1021 gggaggccga gatgggtgaa tcacctgagg tcggcagttt gagnncagcc tggccaacct 1081 ggtgaaaccc tgtctctact aaaaatacaa aaaattagct gggcatggtg gcacaggtct 1141 gtaatcccag ctactcggga ggctgaggca ggagaatcgt ttgaacttgg gaggtggagg 1201 ttgcagtgag ccgagatcgc gccactgcaa ctctagcctn ggcaacagag caagactttg 1261 tctcaaaata aataaataaa taataacaat aaagtatgtg aatattatgt tatcagctca 1321 ttatctgtct gatgttcttt tcataaaggt gtgaccaatt cagtgaaagg aacgaatgcg 1381 attctctgga cctgtttggg actgagctta ataatttctt tggcagtttt cgtgctaatg 1441 tttttgctaa ggaagataag ctctgaacca ttaaaggacg agtttaaaaa cacaggttgg 1501 tttgatggtg aatctttgaa atctatttcc aggggatggc tattgtgagt ttcagttcct 1561 tttctttttt tagcgttgac tatttcactt cgttacagcc ctttcgaatg tgttagaaca 1621 ttgttacatt aaatgaactt ggtagaggtg agccatcttc attctgattt tgacaccttg 1681 gcagattttc tacaatgtca gtcttctcca ggattcttcc actgttaatt actctattga 1741 aagtactaag gctttcttgg gaaacatcag tctctttgac taaagttagc acatcgatta 1801 aatgccacat tatcagaaac tcctagccag gtctgctact gtcaggaaaa gcatatttgt 1861 cnagatctat ggtcakgntt ttatacaaat ataggtgtnn yttgcntgag tgaacanttt 1921 actactgnna aaatgttaga aatgaataac cagttgctcc tgaattattt gaggaatcat 1981 ctaaaaaata attattttta agcaatagag aaccagtccc agaaaaatga atgttctact 2041 taagtgcctc ttaagataaa aaatacttct gcagcacctt tgctcatgat tggattccca 2101 agcatgtaca gccactgccc tatttctgta tgcatttatt tatttattta tttatttatt 2161 tatttagaga tggagtctcg ctctgtggcc caaggctgga gtgcagtggc gtgatctcaa 2221 ctcactgcag cctctgcctc ttgggttcaa acaattctcc catctcagcc tcctgagtaa 2281 ctggactata ggtatgtgcc atcacctccg actaattttt gtactttttg gtagagacag 2341 ggtttcatca tgttggccag gatggtctca agctcctgac cacaagtgat ctgcncgcct 2401 cagcttccca aagtgctggg attacaggcg tgagccacag ggcccagcac atactcattc 2461 ttttttactg aaaagatctg tttcaagctg ggtgttggtg gctatggagc tgtagtccga 2521 ctgctctgta ggctaacgtg ggaggattgc ttgagcccag agtttgaatg cagcctgggc 2581 aacacagtaa gaccccacct ctaaaaaatg aaaaaatctc tctcacattg ctttgagtcc 2641 cgatgtgtac tgctaagact ctcatgacca cattctctgt gaagtttggg ttaagttccg 2701 ttctacataa ttaggatcag gtctcctggg catggctaac attgacctgg aaaagagcag 2761 gactggtgat gaaattattc ttccgagagg cctcgagtac acggtggaag aatgcacctg 2821 tgaagactgc atcaagagca aaccgaaggt cgactctgac cattgctttc cactcccagc 2881 tatggaggaa ggcgcaacca ttcttgtcac cacgaaaacg aatgactatt gcaagagcct 2941 gccagctgct ttgagtgcta cggagataga gaaatcaatt tctgctaggt aattaaccat 3001 ttcgactcga gcagtgccac tttaaaaatc ttttgtcaga atagatgatg tgtcagatct 3061 ctttaggatg actgtatttt tcagttgccg atacagcttt ttgtcctcta actgtggaaa 3121 ctctttatgt tagatatatt tctctaggtt actgttggga gcttaatggt agaaacttcc 3181 ttggtttcat gattaaagtc tttttttttc ctgacatcta agtttttatt aacgtgagtt 3241 tttaaaaaca agcatgtata ccagtgtggg gggtgagggt gggagagaaa ggtgggaggg 3301 ggaaagaatt ctaacctatt gataataaag ctccagtttt ggccaggcgc ggtgctcatg 3361 cctgtaatcc cagcactttg aaaggccgag gcgggcagat tacctgaggt caggaatttg 3421 agaccagcct ggccaacatg gtgaaaccct gtctttacta aaaatacaaa aattagctgg 3481 gcatggtggt aggcacctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt 3541 gaacctggga ggtggaggtt gcaatgagct gagatagcat ccctgcactc cagcctgggc 3601 aagagggtga gactccgtct caaaacaaaa caacacaaac aaacaaaaag tacctccagc 3661 ttcatcttct gctggatttt atagcgcccc caaagatatg tggtccttaa aaattgtata 3721 ccacttattc aggagtcttg ttcctgaaag ggttgttctt gttacagccc tagtctgggc 3781 tgtaatcagc ttcttaaggt cc // LOCUS HSGGL2 1628 bp DNA PRI 15-NOV-1994 DEFINITION Human a gamma-globin gene. ACCESSION V00513 NID g31728 KEYWORDS gamma-globin; germ line; globin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1628) AUTHORS Slightom,J.L., Blechl,A.E. and Smithies,O. TITLE Human fetal G gamma- and A gamma-globin genes: complete nucleotide sequences suggest that DNA can be exchanged between these duplicated genes JOURNAL Cell 21 (3), 627-638 (1980) MEDLINE 81064665 COMMENT KST HSA.AGAMGLOBIN.GL.1 [1628]. FEATURES Location/Qualifiers source 1..1628 /organism="Homo sapiens" /db_xref="taxon:9606" misc_RNA 57 /note="capped by 7mGppp" exon 57..201 /number=1 prim_transcript 57..1628 CDS join(110..201,324..546,1413..1541) /codon_start=1 /product="gamma globin" /db_xref="PID:g31729" /db_xref="SWISS-PROT:P02096" /translation="MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFD SFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPE NFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTAVASALSSRYH" intron 202..323 /number=1 exon 324..546 /number=2 intron 547..1412 /number=2 exon 1413..1628 /number=3 polyA_site 1628 BASE COUNT 429 a 332 c 438 g 429 t ORIGIN 1 ggccggcggc tggctaggga tgaagaataa aaggaagcac ccttcagcag ttccacacac 61 tcgcttctgg aacgtctgag attatcaata agctcctagt ccagacgcca tgggtcattt 121 cacagaggag gacaaggcta ctatcacaag cctgtggggc aaggtgaatg tggaagatgc 181 tggaggagaa accctgggaa ggtaggctct ggtgaccagg acaagggagg gaaggaagga 241 ccctgtgcct ggcaaaagtc caggtcgctt ctcaggattt gtggcacctt ctgactgtca 301 aactgttctt gtcaatctca caggctcctg gttgtctacc catggaccca gaggttcttt 361 gacagctttg gcaacctgtc ctctgcctct gccatcatgg gcaaccccaa agtcaaggca 421 catggcaaga aggtgctgac ttccttggga gatgccataa agcacctgga tgatctcaag 481 ggcacctttg cccagctgag tgaactgcac tgtgacaagc tgcatgtgga tcctgagaac 541 ttcaaggtga gtccaggaga tgtttcagca ctgttgcctt tagtctcgag gcaacttaga 601 caactgagta ttgatctgag cacagcaggg tgtgagctgt ttgaagatac tggggttggg 661 agtgaagaaa ctgcagagga ctaactgggc tgagacccag tggcaatgtt ttagggccta 721 aggagtgcct ctgaaaatct agatggacaa ctttgacttt gagaaaagag aggtggaaat 781 gaggaaaatg acttttcttt attagatttc ggtagaaaga actttcacct ttcccctatt 841 tttgttattc gttttaaaac atctatctgg aggcaggaca agtatggtcg ttaaaaagat 901 gcaggcagaa ggcatatatt ggctcagtca aagtggggaa ctttggtggc caaacataca 961 ttgctaaggc tattcctata tcagctggac acatataaaa tgctgctaat gcttcattac 1021 aaacttatat cctttaattc cagatggggg caaagtatgt ccaggggtga ggaacaattg 1081 aaacatttgg gctggagtag attttgaaag tcagctctgt gtgtgtgtgt gtgtgtgtgt 1141 gtgtcagcgt gtgtttcttt taacgtcttc agcctacaac atacagggtt catggtggga 1201 agaagatagc aagatttaaa ttatggccag tgactagtgc ttgaagggga acaactacct 1261 gcatttaatg ggaaggcaaa atctcaggct ttgagggaag ttaacatagg cttgattctg 1321 ggtggaagct gggtgtgtag ttatctggag gccaggctgg agctctcagc tcactatggg 1381 ttcatcttta ttgtctcctt tcatctcaac agctcctggg aaatgtgctg gtgaccgttt 1441 tggcaatcca tttcggcaaa gaattcaccc ctgaggtgca ggcttcctgg cagaagatgg 1501 tgactgcagt ggccagtgcc ctgtcctcca gataccactg agcctcttgc ccatgattca 1561 gagctttcaa ggataggctt tattctgcaa gcaatacaaa taataaatct attctgctga 1621 gagatcac // LOCUS HSGLA 12436 bp DNA PRI 25-JUN-1997 DEFINITION Human GLA gene for alpha-D-galactosidase A (EC 3.2.1.22). ACCESSION X14448 NID g31755 KEYWORDS alpha-D-galactosidase A; Alu repetitive sequence; galactosidase; GLA gene; glycoprotein; hydrolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12436) AUTHORS Bishop,D.F. TITLE Direct Submission JOURNAL Submitted (25-FEB-1989) Bishop D.F., Mount Sinai School of Medicine, Division of Medical and Molecular Genetics, Box 1203, 100th St and Fifth Avenue, New York, NY 10029, U S A REFERENCE 2 (bases 1 to 12436) AUTHORS Kornreich,R., Desnick,R.J. and Bishop,D.F. TITLE Nucleotide sequence of the human alpha-galactosidase A gene JOURNAL Nucleic Acids Res. 17 (8), 3301-3302 (1989) MEDLINE 89263745 COMMENT library=lambda EMBL3; clone=L-W2, L-B5 and L-B18; Data kindly reviewed (16-June-1989) by Bishop D.F. FEATURES Location/Qualifiers source 1..12436 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /cell_type="lymphoblast" /chromosome="X" /map="q21.33" repeat_unit 848..857 /note="direct repeat 1" /rpt_type=DIRECT repeat_unit 864..873 /note="direct repeat 1" /rpt_type=DIRECT repeat_unit 881..890 /note="direct repeat 1" /rpt_type=DIRECT repeat_unit 897..906 /note="direct repeat 1" /rpt_type=DIRECT TATA_signal 1089..1094 prim_transcript 1120..11266 exon 1120..1373 /number=1 mRNA join(1120..1373,5094..5268,7269..7446,8321..8412, 10131..10292,10510..10707,10978..11266) CDS join(1180..1373,5094..5268,7269..7446,8321..8412, 10131..10292,10510..10707,10978..11268) /EC_number="3.2.1.22" /codon_start=1 /product="alpha-D-galactosidase A" /db_xref="PID:g31756" /db_xref="SWISS-PROT:P06280" /translation="MQLRNPELHLGCALALRFLALVSWDIPGARALDNGLARTPTMGW LHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCIDDCWMAPQRDS EGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFA DWGVDLLKFDGCYCDSLENLADGYKHMSLALNRTGRSIVYSCEWPLYMWPFQKPNYTE IRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWNDPDMLVIGNFGLS WNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPLGKQGYQLRQG DNFEVWERPLSGLAWAVAMINRQEIGGPRSYTIAVASLGKGVACNPACFITQLLPVKR KLGFYEWTSRLRSHINPTGTVLLQLENTMQMSLKDLL" sig_peptide 1180..1272 mat_peptide join(1273..1373,5094..5268,7269..7446,8321..8412, 10131..10292,10510..10707,10978..11265) /EC_number="3.2.1.22" /product="alpha-D-galactosidase A" intron 1374..5093 /number=1 repeat_region 2851..3150 /note="Alu repetitive sequence" repeat_region 3155..3443 /note="Alu repetitive sequence" repeat_region 3828..4125 /note="Alu repetitive sequence" misc_feature 4211..4512 /note="Alu repetitive sequence" exon 5094..5268 /number=2 intron 5269..7268 /number=2 repeat_region 5351..5644 /note="Alu repetitive sequence" repeat_region 5759..6060 /note="Alu repetitive sequence" repeat_region 6306..6595 /note="Alu repetitive sequence" exon 7269..7446 /number=3 misc_feature 7317..7322 /note="pot. N-linked glycosoylation site" intron 7447..8320 /number=3 misc_feature 7945..8229 /note="Alu repetitive sequence" exon 8321..8412 /number=4 misc_feature 8350..8355 /note="pot. N-linked glycosylation site" intron 8413..10130 /number=4 repeat_region 9508..9793 /note="Alu repetitive sequence" exon 10131..10292 /number=5 misc_feature 10137..10142 /note="pot. N-linked glycosylation site" intron 10293..10509 /number=5 exon 10510..10707 /number=6 intron 10708..10977 /number=6 exon 10978..11266 /number=7 misc_feature 11203..11208 /note="pot. N-linked glycosylation site" misc_feature 11211..11215 /note="snRNP binding site" polyA_signal 11250..11255 polyA_site 11266 /note="polyA site" misc_feature 11330..11620 /note="pot. N-linked glycosylation site" misc_feature 12101..12398 /note="pot. N-linked glycosylation site" BASE COUNT 3308 a 2864 c 2649 g 3615 t ORIGIN 1 cccttctgta ggggcagaga ggttctactt cattactgcg tctcctggga aggccatcag 61 gactgctggc taaagtggga accaggactc tttgtgagtt aagaatttgt gtatttatat 121 gtgtgttata cacatttttt aaaaaactgt aacgacatca ggttgagcag tcgtctccgg 181 gtggtgaatt atgtgtattt ttaaatttta tactatattg ttatttttca aatgttcgaa 241 attgaatatg tagattgttg ttatcagcag aaaaataaac attattcaaa tactctattc 301 agtaaagtaa tttattgggc gcctttgtca agcacgcatt tgcctagatg tgactctaca 361 gataaaattc acttggggcc tccccttaca gacaatcagg cagtggagac tgagtgcctg 421 aatggataga ccagcactca gaccactatt ttcagtatct gtttttctta actcagggcc 481 gtggttttca aacgtttttc gccttacggt cacccttagg gtcccccgag accggcccag 541 acagacagat atacaaaaac acatacacag tcatgagcgt ccaccatttc cccaccaggc 601 gcagcacagg cggcttcccg gcactgagat gggggggagg agggagagag cgcgaggggg 661 gaggggaaag cagagaacga aagaggcgga ggcggccccc gaaccccgct ctggtcttca 721 tcatcaccac ccctgggtcc ccagttccca cccacacacc aacctctaac gataccgggt 781 aattttcctc cttcttccct caaacggcta tagcgagacg gtagacgacg accagaacta 841 cttctgctca cgtaagcgag taatcacgtg agcgcctacg tcatgtgaga tctcggtcac 901 gtgagcaact ctcggcttaa actcgggatc actaaggtgc cgcacttcct tctggtatgg 961 aaatagggcg ggtcaatatc aagaaaggaa gagggtgatt ggttagcgga acgtcttacg 1021 tgactgatta ttggtctacc tctggggata accgtcccag ttgccagaga aacaataacg 1081 tcattattta ataagtcatc ggtgattggt ccgcccctga ggttaatctt aaaagcccag 1141 gttacccgcg gaaatttatg ctgtccggtc accgtgacaa tgcagctgag gaacccagaa 1201 ctacatctgg gctgcgcgct tgcgcttcgc ttcctggccc tcgtttcctg ggacatccct 1261 ggggctagag cactggacaa tggattggca aggacgccta ccatgggctg gctgcactgg 1321 gagcgcttca tgtgcaacct tgactgccag gaagagccag attcctgcat caggtatcag 1381 atattgggta ctcccttccc tttgcttttc catgtgtttg ggtgtgtttg gggaactgga 1441 gagtctcaac gggaacagtt gagcccgagg gagagctccc ccacccgact ctgctgctgc 1501 ttttttatcc ccagcaaact gtcccgaatc aggactagcc ctaaactttc tctgtgtgac 1561 ctttcctggg atgggagtcc ggccagcggc ccctgtttct ttctctctct ctctctctct 1621 cgttctcctt ctctttctct ttctcttctt tcctctctct ttctctctct ccctgcccgg 1681 ttctcttttt tcactgctcc ttgcagagca gggccacccc ataggcagtg tgcccaaagt 1741 agccctgccc ggttctattc agacccttct tgtgaacttc tgctcttcct ctgccgggtg 1801 ctaaccgtta gaacatctag ggtgggtagg aggaatgggg aactaagatt cgtgccattt 1861 tttctccttt tggggtcgtg gatttctcgg cagtatctcg agggagttag agagaccata 1921 aggtcgctga gatctctccc acctcgccca tgagcgtggc atcaggctgg aaggttgaca 1981 tggaggaact ttatacattt acacctttgc gtgagggttg aggctggatt agataggtat 2041 tgaacatatc tgaccctcac aatccttatc tgtaaattgg gattacaacc ttttaatttc 2101 agggagctga caaaaaaaat ctgaaaaata gttcttatct cacacaggtg agttttcaag 2161 gagataacct atttaaagta catagcacag cgcttgacca ttcaactgcg cttacagagc 2221 aaatgttcaa tgggaaaatg aatgtaaatc tacaaatctg aatgaatatg tgtatttttc 2281 tggagagagg atatttacct ttcttcaaat tctcaaaggg ctctgtgatt taaaaaaggt 2341 taggaatcac tgatagatgt tggtaaaagg tggcagtcac agtacatttc tgtgtccata 2401 agttattcct atgaatatct ttatagataa agtcaggatg ttggtcagac atcacagaag 2461 aaattggcct tgtaagtttc atgtgaccct gtggtacagt atgtgtggca attttgccca 2521 tcacggattt ttttttattg gtatttgcat ctgattataa aactaatgca tgatcattgc 2581 aaaaaatgta gataaagaag agcaaaatga aaataaagat ttccccccac cgttccacca 2641 cccagaaata atcatggttt aaatgttaat atacaacctt acaattgttt tctatataaa 2701 tgaaaacata gatttcttta tttcattatt ttccataaaa aatggatcat gtttatgtca 2761 tgtttggcta atggcaagac cctggcaccc agtctgggct caaattctgc ctcattgtta 2821 cttagccctg tgacattggg taaattacac tttttttttt tttttttttt tgagacgggg 2881 tctcgctctg tcgcccaggc tggagtgcag tggcacgatc tcggctcact gcaagtccgc 2941 ctcctgggtt cacgccattc ttctgcctca gcctcccgag tagctgggac tacaggcgcc 3001 tgccaccacg cctggctctt tttttttttt tttttttttt tagtacagac ggggtttcac 3061 catgttagcc agggtggtct caatctcctg acctcgtgat tcgcccgcct cagcctccca 3121 aagtgctggt gtgagccacc gtgcccagcc ttactttttt ttttgagagg gggtctcact 3181 ctgtcaccca ggttggagtg cagtggcgcg atctctgctc agtgcaaact ccacctcccg 3241 ggtttaagca gttctcctgt cgtagtctcc tgagtagctg ggattacagg cacaccacca 3301 cggccagcta atttttgtat tttcagtaga gacgggtttc accatgttgc ccaagctggt 3361 ctcgaactcc tggcctcaag tgatctgccc gccttggcct cccagagtgc tgggattaca 3421 ggtgtgagcc accgcacccg gcctcttttt tcttttttag tctatcatac cttgcaaata 3481 cagtggttct tcctatgtgt tggttttgat atttatgtaa tcaaacacat cagtttttcc 3541 tttctgattt ctgactttgg ggtcatgctg agaaagtcct ttcctacctg aagataatac 3601 agtatatacg tttcttacta gtatttttgt ggatttttaa aatatttaaa tctttagtcc 3661 atctgaactt gttcttctat cagaaatgcc acatttaata aataataagt cccatggtat 3721 cagatggctg gaaggacctc tttcgaaact ttgtttaatt ccattaatct gtgtattctt 3781 attctaatgc taatagttcc acactagctt cctttatctt ttttttcttt tttttttttt 3841 ttttgagctg gagtttcgct cttgttgccc aggctggagt acaatgtcac gatctcggtt 3901 caccgcaacc tccgcctccc aggttcaagc aattctcctg cctcatcctc gcgagtagct 3961 ggaattacag gcatgcgcca ccacgcctag ctattttgta tttttagtag agatggggtt 4021 tctccatgtt ggtcaggctg gtctcaaact cccagcctca ggtgatctgc ctgcctcggc 4081 ctcccaaaat gctgttatta caggcgtgag ccaccacgcc cagccttcat cttttaatga 4141 atgtacatgt atgtaatctt ttaggtgaac tttttgtaat gttgtgccaa gttccttaaa 4201 aagccctttt ggaagctggg caggtggcca cgcctgtaat cccagcattt tgggagtctg 4261 aggcaggtgg atcacttgag gccaggagtt caagactagc ctagccaaaa tgcaaaaccc 4321 tgtctctact aaagatacaa aaattagccg gatgcgatgg cacatgcctg taatctcagc 4381 tactcgggag gctgaggtag aagaatcgct tgaaccgggg aggcagaggt tgcagtgagc 4441 aagatggcgc cactgcactc cagcctgggt gacagaggga gactccatct caaaaaaaaa 4501 aaaaaaaaaa aagataaaaa ggaaacctaa gtactcttgg gctttgttaa ggattttgtt 4561 aaatatacaa aggattgcag ggaaaattaa cttattttta atattgagta tgcttatcca 4621 agagcaaaat aatatttctc catttattca aatcatttag gagcatcata gttttaacat 4681 atgggccttg cacgtatctt aaatttatct ctaggcattt taggttgttc agttgttctt 4741 gtgaatggga tctttttctc caaataggat tattgttgat atctgttgat tatgttaact 4801 ttgtagtttc tgactttact gaactgtctt cttagatcta atactctttt caatttcatc 4861 atatatttct cattcctatt ttgtttgggg tttttagggc gggaatatta acgggataag 4921 agagacaaaa gaaaatctgg aaaaacaatt cattttacct tacattgctt gtgattacta 4981 ccacactatt actgggttgg aaaaaattgt gaaatcccaa ggtgcctaat aaatgggagg 5041 tacctaagtg ttcatttaat gaattgtaat gattattgga atttctcttt cagtgagaag 5101 ctcttcatgg agatggcaga gctcatggtc tcagaaggct ggaaggatgc aggttatgag 5161 tacctctgca ttgatgactg ttggatggct ccccaaagag attcagaagg cagacttcag 5221 gcagaccctc agcgctttcc tcatgggatt cgccagctag ctaattatgt gagtttatag 5281 ataatgttct tgttcattca gaggactgta agcacttctg tacagaagct tgtttagaaa 5341 cagccctcat ggccgggcgt ggtggctcac gctgtaatcc caacactttg ggaggccgag 5401 gcgggtggat cacctgaggt caagagttca agaccagcct ggccaacatg gtgaaacccc 5461 aactctatta aaagtacaaa aaattagctg ggcatggtgg tgaacgcctg taaccccagc 5521 tacttgggag gctgaggcag gagaatcgct tgaacccagg aggtggaagt ttcagtgagc 5581 tgagatcacg ccattgcact ctagcctggg caacaaaaga gaaactccat ctcaaaaaaa 5641 aaaacaagga aaaaaagaaa cagccctcat gacacttaga aagtagaata gctggctgtt 5701 atctgaacat tgaattgtaa ggcttatcag gtggactttg cattccatca gcagacaatt 5761 tttttttttt tttttttttg agatggagtc tcattctgtc tcccaggctg gagggcagtg 5821 gtgcgatctc ggctcactgc aagctccacc tcctgggttc atgccattct cctgcctcag 5881 cctcccaagt agctgggacc acaggcaccc gccaccatgc ccagttaatt ttttgtattt 5941 ttagtagaga cggggtttca ccatgttagc caagatggtc tcgatctcct gacctcgtga 6001 tccgcccacc tcggcctccc aaagtgctgg gattacaggc atgagccacc gcgcctagcc 6061 tacaaatgtt ttgtaatagc tcttgaggcc catcttggag ttctcctttt gctaaaacca 6121 ctgaactctc taggaggaaa aaggaacttg gttcttgaca tatgtgtgca tgtatttcca 6181 tataaccttt aggaagctat tgcaatggta ctataaacta gaattttaga agatagaagg 6241 aaaatattct ggagatcatt gaagagaaat ggagtccaac actagttaaa gatgatgaag 6301 acagattttt ttttttgacg gagtctcgct ctgtcgccca ggctggagtg cagtggcaca 6361 atctcagctc actgcaaccc tccacctctt gggttcaagt gattctcctg cctcagcctc 6421 ccaagtagct gggactacag gcgcacacca ccacgcccgg ctaatttttg tatttttagt 6481 agagacaagg tttcaccata ttcgccaggc tggtctcgaa ctcctgacct tgtaatccgc 6541 ccaccttggc ctcccaaagt gctgggatta caggcatgag ccaccacgcc cggccgatga 6601 agacagattt tattcagtac taccacagta gaggaaagag ccaagttcaa ttccaaatac 6661 aacaaagaca ggtggagatt tatagccaat gagcagattg agggggtcag tggatggaat 6721 atttaagaag acatcaaggg tagggagctt cttgctaaag cttcatgtac ttaaacaaga 6781 agggtggggg atgagggaaa ttgatcagat atcaatggtg gcagtattga cttagcagga 6841 ttcttgctaa gaggtcttgc taggacagac ataggaagcc aaggtggagg tctagtcgaa 6901 aagaaggctc atcagagaag tctaactaaa gtttggtcaa gaagagtctt tgtcaaggta 6961 aatctatcat ttccctcaaa aggtaatttt caggatccca tcaggaagat tagcatggct 7021 gctagctttc tcctcagttc tgggctatag ctcacatgcc tagtttgaac tagctcagca 7081 gaactggggg atttattctt tgtcttccaa caaactcatc tggatgattt tgggggtttg 7141 tggggaaaag cccccaatac ctggtgaagt aaccttgtct cttcccccag cctggaatgg 7201 ttctctcttt ctgctacctc acgattgtgc ttctacaatg gtgactcttt tcctccctct 7261 catttcaggt tcacagcaaa ggactgaagc tagggattta tgcagatgtt ggaaataaaa 7321 cctgcgcagg cttccctggg agttttggat actacgacat tgatgcccag acctttgctg 7381 actggggagt agatctgcta aaatttgatg gttgttactg tgacagtttg gaaaatttgg 7441 cagatggtaa tgtttcattc cagagattta gccacaaagg aaagaacttt gaggccatgg 7501 tagctgagcc aaagaaccaa tcttcagaat tttaaatacc ctgtcacaat actggaaata 7561 attattctcc atgtgccaga gctcccatct cttctctttc agttcattaa ttaattaatt 7621 aattcatgta aaatccatgc atacctaacc atagctaata ttgtgcactt ataattcaag 7681 agggctctaa gagttaatta gtaattgtaa ctctctataa catcatttag gggagtccag 7741 gttgtcaatc ggtcacagag aaagaagcat cttcattcct gcctttcctc aatatacaca 7801 ccatctctgc actacttcct cagaacaatc ccagcagtct gggaggtact ttacacaatt 7861 taagcacaga gcaactgcct gtccctgctg ctagtttaaa catgaacctt ccaggtagcc 7921 tcttcttaaa atatacagcc ccagctgggc atgatggctc atgcctgtaa tcctagcact 7981 ttgggaggct gaggcgggtg gattacttga ggtcaggagt tcgagaccac cctggccaac 8041 atggtgaaac cccatctcta gtaaaaatac aaaaattagc tgactttggt ggcacatgcc 8101 tgtaatccca gctacttggg aagctgagac agaagagtca cttgaacctg ggaaacagag 8161 gttgcagtga gccaagatcg caccactgca ctccaccctg gatgacagac tgaaccccat 8221 ctcaaaaaat taaaataaaa taaaataaaa taactatata tatagcccca gctggaaatt 8281 catttctttc ccttatttta cccattgttt tctcatacag gttataagca catgtccttg 8341 gccctgaata ggactggcag aagcattgtg tactcctgtg agtggcctct ttatatgtgg 8401 ccctttcaaa aggtgagata gtgagcccag aatccaatag aactgtactg atagatagaa 8461 cttgacaaca aaggaaacca aggtctcctt caaagtccaa cgttacttac tatcatccta 8521 ccatctctcc caggttccaa ccacttctca ccatccccac tgctgtaatt atagcctaag 8581 ctaccatcac ctggaaagtc atccttgtgt cttccccttt atttcaccat tcatgtcctg 8641 tctatcaaca gtccttccac cagtatctct aaaatatctc ctgaatcagc ccacttcctt 8701 ccatcttcac tacatgcacc ctggccttcc aagctactat cggctctcaa ccagactgct 8761 gggaccacct gatctctctg cttccactct gtctcaaccc ccatctattt tccaagcagc 8821 actagagtta tcatattaaa atgtaaatat cagttttttt tttaaagaaa aaaaccctga 8881 gacttaacag agttataaaa aatataaatg tcatcatcag ttccctgctt aaaaccctta 8941 actcgcttcc aattgcactt ggaatgaaac caaactgcac tgatccagcc cttgcctgcc 9001 tccccaaagt ccaaggggtc atggctcttt ccctggctac actggttttc tttctgtccc 9061 tcaacactgc aagcctattg ctgccccagg gcctttacac ttgctttttt tctgcctaga 9121 acagttcttc cccaaagatt tttaaagggc cgggctcctt aacattgaag tcgcagacca 9181 aacgccacat atgcagacag ttcttctcta actactttaa aatagccctc tgtccattca 9241 ttcttcatca cattaacctg tttaattttc ttctcagagc tccacactat ttggaagtat 9301 ttgttgactt gttaccatgt ctccccacta gagtgtaagt ttcatgaggg cagggacctt 9361 gtctgacttt gactgtatct ctcgcatatg gttaagtgtt aaatagttat ttatggaatg 9421 aatccctatt attccctcat tatctctgca aaatagtctt ttttctcaac atcttaaacc 9481 tgatatccca cctgcctatc tacaaacttt ttttttgcga cagagtctca ctgtcaccca 9541 ggctagagtg cagtggcgcc atctcggctc actgcaacct ccgcctcccg ggtttaagcg 9601 attctcttgc ctcagcctcc cagtagctgg gattataggc gtgcgctacc acatctggct 9661 aatttttgta tttttagtag agatggtttc accatgttgg ccaggcttgt ctcgaactcc 9721 tgacctcaga tgatccacct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc 9781 accgtgccca gcctctacaa actttttatt ccattaacaa actatatgct gggatttaag 9841 ttttcttaat acttgatgga gtcctatgta attttcgagc ttttaatttt actaagacca 9901 ttttagttct gattatagaa gtaaattaac tttaagggat ttcaagttat atggcctact 9961 tctgaagcaa acttcttaca gtgaaaattc attataaggg tttagacctc cttatggaga 10021 cgttcaatct gtaaactcaa gagaaggcta caagtgcctc ctttaaactg ttttcatctc 10081 acaaggatgt tagtagaaag taaacagaag agtcatatct gttttcacag cccaattata 10141 cagaaatccg acagtactgc aatcactggc gaaattttgc tgacattgat gattcctgga 10201 aaagtataaa gagtatcttg gactggacat cttttaacca ggagagaatt gttgatgttg 10261 ctggaccagg gggttggaat gacccagata tggtaaaaac ttgagccctc cttgttcaag 10321 accctgcggt aggcttgttt cctattttga cattcaaggt aaatacaggt aaagttcctg 10381 ggaggaggct ttatgtgaga gtacttagag caggatgctg tggaaagtgg tttctccata 10441 tgggtcatct aggtaacttt aagaatgttt cctcctctct tgtttgaatt atttcattct 10501 ttttctcagt tagtgattgg caactttggc ctcagctgga atcagcaagt aactcagatg 10561 gccctctggg ctatcatggc tgctccttta ttcatgtcta atgacctccg acacatcagc 10621 cctcaagcca aagctctcct tcaggataag gacgtaattg ccatcaatca ggaccccttg 10681 ggcaagcaag ggtaccagct tagacaggta aataagagta tatattttaa gatggcttta 10741 tatacccaat accaactttg tcttgggcct aaatctattt ttttcccttg ctcttgatgt 10801 tactatcagt aataaagctt cttgctagaa acattacttt atttccaaaa taatgctaca 10861 ggatcatttt aatttttcct acaagtgctt gatagttctg acattaagaa tgaatgccaa 10921 actaacaggg ccacttatca ctagttgcta agcaaccaca ctttcttggt ttttcaggga 10981 gacaactttg aagtgtggga acgacctctc tcaggcttag cctgggctgt agctatgata 11041 aaccggcagg agattggtgg acctcgctct tataccatcg cagttgcttc cctgggtaaa 11101 ggagtggcct gtaatcctgc ctgcttcatc acacagctcc tccctgtgaa aaggaagcta 11161 gggttctatg aatggacttc aaggttaaga agtcacataa atcccacagg cactgttttg 11221 cttcagctag aaaatacaat gcagatgtca ttaaaagact tactttaaaa tgtttatttt 11281 attgccaact actacttcct gtccaccttt ttctccattc actttaaaag ctcaaggcta 11341 ggtggctcat gcctgtaatc ccagcacttt gggaggctga ggcgggcaga tcacctgagg 11401 tcgggacttt gagacccgcc tggacaacat ggtgaaaccc catttctaat aaaaatataa 11461 aaattagcca ggtgtggtgg cgcacctgtg gtcccagcta ctctgggggc tgaggcatga 11521 gaatcgcttg aacccgggag tggaggttgc attgagctga gatcatgcca cctcactcca 11581 gcctgggcaa caaagattcc atctcaaaaa aaaaaaaaaa gccaggcaca gtggctcatg 11641 cctggaatcc cagcactttt ggaagctgag gcaggcagat cacttgaggt taggatttca 11701 agaccagcct ggctaacata gtaaagccct gtctctacta aaaatacaaa aattagccag 11761 gtatggtggc gagcttctgt agccccagct actcaggaga ctgaggcagg agaatcactt 11821 gaacccggga agtggggggg tgcagtgacc caagatcacg ccactgcatt ccagcctggg 11881 caacagagca agactccatc tcaaaaaaaa aagttctatt tccttgaata aaattttccg 11941 aagtttaaac tttaggaata aaactattaa acccgtattt actcatccag atacccaccc 12001 cccttgttga gattctctcc caattatcaa aatgtgtagc atatttaact accaagagct 12061 aaacatcatt aagactgaaa tgtattaaga aggatgtata ggccaggcac ggtgtctcac 12121 gcctgtaatc ccaacacttt gggaggccaa gtcgggcgga tcacgaggtc aggagatgga 12181 gaccatcctg gccaacatgg tgaaaccccc tctctactaa aaatacaaaa attagccagg 12241 caggtggcag gcacctgtaa tcccagctac tccagaggct gaggcaggac aatcacttga 12301 acctgggagg cagaggctgc agtgagctga ggttgtacca attgcactcc agcctaggta 12361 acgagcaaca ctccatctca aaaaaagaaa aaaaaaaaga tgtataattt ggaactgtta 12421 agaggcattt taaaga // LOCUS HSGLTH1 1020 bp DNA PRI 26-JAN-1995 DEFINITION Human theta 1-globin gene. ACCESSION X06482 NID g31775 KEYWORDS globin; theta-1-globin; theta-globin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1020) AUTHORS Hsu,S.L., Marks,J., Shaw,J.P., Tam,M., Higgs,D.R., Shen,C.C. and Shen,C.K. TITLE Structure and expression of the human theta 1 globin gene JOURNAL Nature 331 (6151), 94-96 (1988) MEDLINE 88122547 COMMENT Data kindly reviewed (20-Jan-1989) by Shen C.K.J. FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cosmid Calpha3'Bg" promoter 23..27 /note="CAAT-Box" promoter 57..61 /note="TATA-Box" precursor_RNA 128..972 /note="put. transcript (K562 cells); maj." mRNA 128..375 /note="put. Exon 1" misc_feature 177..182 /note="GGGCGG-Box" misc_feature 190 /note="alternate pot. transcription initiation site; min." misc_feature 242 /note="alt. pot. transcription initiation site; min." CDS join(281..375,460..664,774..902) /codon_start=1 /product="theta 1-globin" /db_xref="PID:g31776" /db_xref="SWISS-PROT:P09105" /translation="MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYF SHLDLSPGSSQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLL GHCLLVTLARHYPGDFSPALQASLDKFLSHVISALVSEYR" intron 376..459 /note="Intron I" mRNA 460..664 /note="Exon 2" intron 665..773 /note="Intron II" mRNA 774..972 /note="Exon 3" misc_feature 952..957 /note="put. polyA signal" polyA_site 972 /note="put. polyA site" BASE COUNT 146 a 362 c 357 g 155 t ORIGIN 1 ccactgcact caccgcaccc ggccaatttt tgtgttttta gtagagacta aataccatat 61 agtgaacacc taagacgggg ggccttggat ccagggcgat tcagagggcc ccggtcggag 121 ctgtcggaga ttgagcgcgc gcggtcccgg gatctccgac gaggccctgg acccccgggc 181 ggcgaagctg cggcgcggcg ccccctggag gccgcgggac ccctggccgg tccgcgcagg 241 cgcagcgggg tcgcagggcg cggcgggttc cagcgcgggg atggcgctgt ccgcggagga 301 ccgggcgctg gtgcgcgccc tgtggaagaa gctgggcagc aacgtcggcg tctacacgac 361 agaggccctg gaaaggtgcg gcaggctggg cgcccccgcc cccaggggcc ctccctcccc 421 aagccccccg gacgcgcctc acccacgttc ctctcgcagg accttcctgg ctttccccgc 481 cacgaagacc tacttctccc acctggacct gagccccggc tcctcacaag tcagagccca 541 cggccagaag gtggcggacg cgctgagcct cgccgtggag cgcctggacg acctacccca 601 cgcgctgtcc gcgctgagcc acctgcacgc gtgccagctg cgagtggacc cggccagctt 661 ccaggtgagc ggctgccgtg ctgggcccct gtccccggga gggccccggc ggggtgggtg 721 cggggggcgt gcggggcggg tgcaggcgag tgagccttga gcgctcgccg cagctcctgg 781 gccactgcct gctggtaacc ctcgcccggc actaccccgg agacttcagc cccgcgctgc 841 aggcgtcgct ggacaagttc ctgagccacg ttatctcggc gctggtttcc gagtaccgct 901 gaactgtggg tgggtggccg cgggatcccc aggcgacctt ccccgtgttt gagtaaagcc 961 tctcccagga gcagccttct tgccgtgctc tctcgaggtc aggacgcgag aggaaggcgc // LOCUS HSGLUCG2 10050 bp DNA PRI 20-APR-1995 DEFINITION Human glucagon gene. ACCESSION X03991 NID g31786 KEYWORDS glucagon; glucagon-like peptide; hormone; preproglucagon; preprohormone; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10050) AUTHORS White,J.W. and Saunders,G.F. TITLE Structure of the human glucagon gene JOURNAL Nucleic Acids Res. 14 (12), 4719-4730 (1986) MEDLINE 86259053 COMMENT See also which is derived from the same library. Sequence discrepancies may indicate that the two clones represent different alleles or may be due to cloning artefacts. pos. 3516 in corresponds to pos. 9 in (the first eight nucleotides of which are probably a cloning artefact). FEATURES Location/Qualifiers source 1..10050 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="fetal liver genomic DNA" /clone="lambdahGCG1" promoter complement(535..538) /note="CAAT-box (complementary strand)" promoter 577..581 /note="TATA-box" exon 603..698 /note="exon 1; untranslated leader (5' UT-region)" intron 699..3665 /note="intron I" exon 3666..3766 /note="exon 2" CDS join(3675..3766,5339..5500,7177..7314,8683..8826, 9481..9487) /note="prepro-glucagon" /codon_start=1 /db_xref="PID:g762941" /db_xref="SWISS-PROT:P01275" /translation="MKSIYFVAGLFVMLVQGSWQRSLQDTEEKSRSFSASQADPLSDP DQMNEDKRHSQGTFTSDYSKYLDSRRAQDFVQWLMNTNRNRNNIAKRHDEFERHAEGT FTSDVSSYLEGQAAKEFIAWLVKGRGRRDFPEEVAIVEELGRRHADGSFSDEMNTILD NLAARDFINWLIQTKITDRK" sig_peptide 3675..3734 /note="signal peptide (AA -20 to -1)" intron 3767..5338 /note="intron II" misc_feature 5339..5396 /note="glicentin related pancreatic peptide (AA 12-30) (5339 is 3rd base in codon)" exon 5339..5500 /note="exon 3" misc_feature 5397..5402 /note="spacer (AA 31-32; 2AA)" misc_feature 5403..5489 /note="glucagon (AA 33-61; 29AA)" misc_feature 5490..5500 /note="spacer (AA 61-65; 4AA) (5500 is 2nd base in codon)" intron 5501..7176 /note="intron III" exon 7177..7314 /note="exon 4" misc_feature 7177..7195 /note="spacer (AA 66-71; 6AA) (7177 is 3rd base in codon)" misc_feature 7196..7306 /note="glucagon-like peptide 1 (AA 72-108; 37AA)" misc_feature 7307..7314 /note="spacer (AA 109-111; 3AA) (7314 is 2nd base in codon)" intron 7315..8682 /note="intron IV" misc_feature 8683..8725 /note="spacer (AA 112-125; 14AA) (8683 is 3rd base in codon)" exon 8683..8826 /note="exon 5" misc_feature 8683..8826 /note="pro-glucagon (AA 112-159) (8683 is 3rd base in codon) (8826 is 2nd base in codon)" misc_feature 8726..8826 /note="glucagon-like peptide 2 (AA 126-159) (8826 is 2nd base in codon)" intron 8827..9480 /note="intron V" misc_feature 9481..9484 /note="glucagon-like peptide 2 (AA 159-160; 35AA total) (9481 is 3rd base in codon)" misc_feature 9481..9484 /note="pro-glucagon (AA 159-160) (9481 is 3rd base in codon)" exon 9481..>9487 /note="exon 6" misc_feature 9950..9956 /note="pot. polyA signal" misc_feature 9965..9974 /note="region of pot. polyA sites" BASE COUNT 3397 a 1698 c 1746 g 3209 t ORIGIN 1 gaattcattt attaaaacag aacacatagg ggtttaatca atatccttaa attttccaca 61 aacataacat aaataaactc cacgttgtga ggaagagagg atttttaata catatgtgtt 121 gaatgaatga tcattattta gataaatgaa tgactgaagt gattgttata ttcaggtaaa 181 ttcatcatgg ctaggtagca aaccaaagac ttgtaagaac ctcaaatgag gacatgcaca 241 aaacagggat ggccatgggc tacgtaattt caaggtcttt tgtcttcaac gtcaaaattc 301 actttagaga acttaagtga ttttcatgcg tgattgaaag tagaaggtgg atttccaagc 361 tgctctctcc attcccaacc aaaaaaaaaa aaaaaagata caagagtgca taaaaagttt 421 ccaggtctct aaggtctctc acccaatata agcatagaat gcagatgagc aaagtgagtg 481 ggagagggaa gtcatttgta acaaaaactc attatttaca gatgagaaat ttatattgtc 541 agcgtaatat ctgtgaggct aaacagagct ggagagtata taaaagcagt gcgccttggt 601 gcagaagtac agagcttagg acacagagca catcaaaagt tcccaaagag ggcttgctct 661 ctcttcacct gctctgttct acagcacact accagaaggt aagatgatta taaaattgta 721 aatcctgttt ggcggacagt gaagtatttt taagggatca aaatattata aattaaaatg 781 ttgttctttc atcttagact ttattactaa tagtacacag agaggaactg agatggaaaa 841 ggttatatca aatgcattta cgtgtacttt aatatagcga acggcaaagc gagttggaaa 901 aataattata tgcaaaaata taaaacagaa aaaaaacaaa gattcaaatc aatgcacttg 961 ttataatact taactgttat gagagttgta ttttaaaata attggtaaca ttttgaagaa 1021 taaatatttt tcttggactt gatagatctg tatactactt tgaacagaag gcgtctttta 1081 aagtaagcag aaatgtgtcc attagggagc ctataacaga aattgctttt taccaattaa 1141 aatctctggt tttccaaaag agcaaattaa ataacatctt tcaaatattc aaatttcaga 1201 catctatcaa aaaatcaaag tccattaaag caactctttg taaataaaac atagcgtact 1261 tggtgcagag aacaagtgtc atctatttgg ggggttttct ttggatgcat ctgagtgaag 1321 ccatacatta ataactattt actatgtatt gaggataaat agtaataaaa tctaaaatag 1381 ctaccctctg caataaaatt ttaatttgtc tttttagatt aggctcctag agacaaaaaa 1441 caaatttaca aagatctttg atggagtaaa tgattaagtg gtgtattttt ccatacatac 1501 tgaagaccta ttacataaaa tgagctaacc agtttgaaaa aggatttata tggactacaa 1561 tgccaacaca tttggggata gaacataatg tcttcttttt taggtctaag gttaatcttt 1621 ctgtaagact tcctacttat tattgttacc atactttcct ttaaatttta gtggaaattt 1681 gtagttctta caaaactgca ggctctcttt agactaccta aagagggggg aaattaactg 1741 gaaattattt tcttattgaa taagatttta atactaaaat aacataaatc tctactttct 1801 cagcaattgt attcaaacaa ttaactctat tctttttttt ttcacaattc gatagacttt 1861 cacaaacaaa taagtgagat ttttaaagcc agttctttct tcagaagaag tcatttattg 1921 ttggtaatat tattagatac ggacgctcac ttctttatgt ggtttatggt tgttctttct 1981 tccatcagct cctgaagtgg caggatacct cctgctctaa atcctcctca atggaacata 2041 ataggagata ttgcaaaacg cttaggactt tgggtggaat gaattttttt ttcagtcttg 2101 tgaaagacta aaagctttta caagaaactt ttcaacagtg gtataatttt taaaagaagt 2161 gcttacttta atatttaaat attcctgcaa gcacggatat aacaattgag agttataatc 2221 agtagatact acctaaatat atggcaaggt ttccaaagtc tgccaaccgg ttaaactgaa 2281 gcatagcaaa atgtcaaatt ggtaatttga tgttgatgaa tgaagaaagg tgaatatatc 2341 attcctatta tgttcctttg tctaatcatc cattcaatgt acaaaaaact taatatttca 2401 aagaaaaaag ttactgctgc aggtattaca gtgtctatgt tttagtgcca gagatagtaa 2461 atacagctaa tatattaagg aaaagattga tagtttaaaa atgataatat ttgtgaaatc 2521 cttattccaa accaaatgtc atttgctaaa aatattcatt caaattattt gtggataagc 2581 atagtcaaac ttggattgaa atgcccgctt ctcaaaggaa acaccattgg gtttcaaagt 2641 agccgagatc catgacaaag acggacttga cgttcccatt cagaaaatag aagcgcttgc 2701 tctgtgggac attctgtccc acattcatgt tggtcccagt ccccgtcgac actgcatcat 2761 cacatcctta acatactatg gcttcgcagg gcatctgctt gtgacacagg gacaagcaat 2821 caggtgtatc ctataaccaa gtctatggtg tcaccagcag tgaatgacat gtcctcatct 2881 aagtccaatt agaaaacaac ctggggactt ttatgacttt agaagcaagc tctaattttt 2941 ttttacatag taatctctta gttttttagt tgtgattgtt cacgttgcca gaagcattag 3001 ttcttgttgc aaattgcttg tctttaaaat gattttttcc ctttaattta gaacaggtca 3061 aagcacccta aaatacctca aatctgttgg ccgctttata attacaaagt ctagaccaag 3121 ttacactatt taaaatataa atgaataata catctgaacc gacagctggt atgccaagtt 3181 gtagcctgag gctgctttaa gatggttgaa atgcagtata atgtaaatgc ttggaagtga 3241 aggactatta attgaaagcc agactgatag tgaggggggt ggcatgatgg ataaggtgcc 3301 attcaggcta accatatgta gcatatcatc tacagtgggt attttaaatg ttgattgtgg 3361 acattttaat ttccagtctg gaaatgagcc ttctaattat taaacagaag tgtccctaca 3421 ttaatgaaaa agcttaatgg tggaacacat cttttgtgtc cttttttgcc tccaaaaaat 3481 ctggggaaat aatataaccc actatttaaa attaagctga aaatataatc agaataaaag 3541 tgataacact agctttttcc ttctacttat gatatttatc tagtcaaatc taattaattt 3601 agcctgacat gtttaaaaat ccttgcctgc ccccctcacc ctacccccat tctgtgttct 3661 gacagacagc agaaatgaaa agcatttact ttgtggctgg attatttgta atgctggtac 3721 aaggcagctg gcaacgttcc cttcaagaca cagaggagaa atccaggtat taaatccgta 3781 gtctcgaact aacatatcaa tatggttgga ataaagcctg tgaaaactat gattagtgaa 3841 taaggtctca gtaatttaga ataaatattc tgcacaatga tcaaatgttt aaagtatcct 3901 tgtgataaaa gcagactttg ttggagtgta gatcagagtt tgtgtttttt ataataaaag 3961 gggagagaaa tgttgaaaaa catgttacag gggaaattat agctccttga aatctacaga 4021 tgatcctatc tatcaattca ttactcacac actgcataat agattacttc aattattcgc 4081 accaactctt ttctcttttt tcctccttta aaaaatagca ttatttcttc ctaaatgtaa 4141 aagccataca tgttcatgat ggaaagtatg aaaaatatgc aaaaaatatt aagtactcaa 4201 aattcctctg tccaaagaaa gctattcaaa gtaaacaaat ttggtgtatt actttcctgt 4261 gttttacgta aactgtacat aaatatctct tggctcatta tatggctttg tatcatgctt 4321 aatatcttaa cattgtatta ttatacactt ttattatatt attaaaaata tttgttaaac 4381 atgatattta atggcagcat actgtttcat ctcataaatg tacaatattt gtttagctac 4441 ttatttggga agcttaagag gtcttaaaat ttattccaca aagaatgttg caaacaacat 4501 tttatataca cactttccta catttcttac tatttttttg cacataggtt tctagttgta 4561 gaaatactgt gtcaaagggc ctgaatgagt ttaaaccttt tttaatatct tgccaacttg 4621 ctttccagaa agttcataca catgcacact ctcatcagca gtatagaaat actcctcata 4681 atatcctcct cagcattcaa gctttgctaa tttcataggt aaaaatagta tctcatttct 4741 gagtattaat tacattgact attttcccat gtatttacca tgccaaccaa ctgttgatta 4801 gtggcatagg gattattcct gatacagttt caaattagca agtatccttt ttcttgcctt 4861 tctttgaact tctagttgtc ttccttccac tgctagccac catataacta gataattgtg 4921 ctgttttcat attaatatat gcaaaacaaa attgggagct taaatctgct gcccaaagaa 4981 atgatttggg attcttcact cttttaaaat attcaagtca ctactctctt ctgtcactat 5041 tagcataatc atgagtagat taattagtag aaaaagattt gtttaatcct atttcagaaa 5101 aaaagaagtc agttgagaaa catgtttcta gaattcagaa taatagcgta tgcaactata 5161 ctaaaacgaa gggattctca tgacaagcta aggaagatct ttctaaacta cctattgaaa 5221 tactctagat gcctgcctta ctgttttata tggtcttgta ttttgtagtg aagatgcttc 5281 tcaagtgagt ctactcttga ggagagattt atgttgtacc aatcactgtt cttcacagat 5341 cattctcagc ttcccaggca gacccactca gtgatcctga tcagatgaac gaggacaagc 5401 gccattcaca gggcacattc accagtgact acagcaagta tctggactcc aggcgtgccc 5461 aagattttgt gcagtggttg atgaatacca acaggaacag gtaagagtct aagcctggct 5521 caaaacttgc ttataaatgt attaaatagg tctaaaattt ctcttgacat tattaagttc 5581 ttatgctcgt caaatagtgc ccataagccc tgatcatttt gaagatgttt agggttgggg 5641 atgctctcct tagtattatg cccccgaatg tcttaacccg cacccctcta aaccacttac 5701 ccaacctcca gtgtgacata gcctttaaat tattttccca gtggtagtcc tagcaatatt 5761 tacttgctta ctcattttcc cttaaagtga aagtttactt ttgtggacct acgaggttac 5821 caatctaatt tctaccaaaa gatagagcaa atataacatt ctttcgtggg gttcaaaccc 5881 aacattaaaa ttatccccca gccaagggta aggaatggaa acagatgcat tccttttctt 5941 gtagaataaa atccaagaca tgcatttaaa attacagcct agctgcacag catagagaga 6001 gaaggcagca aataagaaca caagcctatc gtaaggcata cattttaatg ttggattttc 6061 tttgacgagt gttgggtttt ttgctgcatc taggctactg gaggaaatta tccgcaaggt 6121 ttcacagaga tggtggaatt tcacaatttt tttggagggt ggcccacaag aggaagggag 6181 ttatgcatag aaagtagctg aagaaatcaa aaaatcagga aaaggacata gatttgtctg 6241 gatagactgg agaagagaga gagtgagagt aagaaaggag agagagagag agagagagag 6301 agagagagag agagaacaag caagcaggag gaataggaga aacagagttg ctaatcaaag 6361 gttcttaaag tagagggtac agccacagga agatggctat catggtcaga gcagacactg 6421 ggagaggtcc ctcttaagga ctacagtgaa aggtgagaaa tcacaaattt ataaaaatgc 6481 aaacaactgt cctcattttc tagaatggga atgccacaac ttggtaaatg gatgtaaagc 6541 aatcctcaca ctcaagcagt gttgcctatg tgtttaaaat gaggtgatat ggggtaatat 6601 gtgccaggga gactgacagc agctgatggc tcacctccac ggtgctctgc acacacttga 6661 tgttgatatg gggtcatctt ctttgctttc cttgttacta gcagatcaca cattttcttt 6721 aatccattat caagtggtcc ttttggtaac aattcaagat ggtcctttgc tttccctcaa 6781 atcaattaac taactctgac cccatactta ctaataacaa acagctgcct tgcttgcaag 6841 tcacatagtg cagtggatag cctgtatcca ttgagactga agtcaaaggt aaggcagctt 6901 taataaccaa tgacatacat aagcagggga ttgcagatgg gtatgctggg aagggagggg 6961 gagctttagc ccaccgattg ttattagctc tcttccacca gtttcaatcc agaacattaa 7021 tgtagcttca cgacaaatcc cctagcaacc ctcatctcct aaactcccta agccccttct 7081 acatagaatc ctgaacccaa agttgccact gcttataggt gagaaccata ttgccaaaga 7141 gacagatctt caatttaact ttcacatttc tttcaggaat aacattgcca aacgtcacga 7201 tgaatttgag agacatgctg aagggacctt taccagtgat gtaagttctt atttggaagg 7261 ccaagctgcc aaggaattca ttgcttggct ggtgaaaggc cgaggaaggc gagagtaagt 7321 ctgtacattc ttatttgaca ttttttgcct tgatgcagaa aatttaagac tacagttatc 7381 tatatatgga tctggattac agaagcaatt agtagtcttg caaagtaagg aaataattcc 7441 tattgatgaa aaacagtata taaaagttaa acccattttg tttttggtac taagtattaa 7501 taatagagcc aaacaggtta catttgtatc cccttatagt tgcattataa ttaggtatta 7561 aatctttgcc aaaccaggcc acctctgaga gtaaatagga tgtttttaat aactacgctg 7621 agacaaattt aaactaggac tgttctaggg gagctagaga taaggaaaaa gaagaaacag 7681 tgctgaaaac ttcaaatatg taaagaaaac atataatatt taaagcttaa ctttaaacat 7741 ttacatatgt gatcaacata tatttatata ttaataaata ttcagataaa aatgtgatga 7801 agactgaaag tgaccaaacc taagatgttg ataaaatgat attcaaagta caaataggta 7861 aaaacatcca tatatttaac tacatatcca attttgtatg gggctgtcat atagatatct 7921 actaaataat ctaagttgaa aaacaaccaa gaccatcaat tacttgctta gatcttaaca 7981 cagccaaaca gacccctgaa ccatctcatt ttcttccgat tttttttgga gagatgaaat 8041 atgagagacg gagaatttat gttcaactct gatttttaaa ttagatttaa aacaagctta 8101 ctgaaattta aagagatctc aagatgaaag agaactagaa taatggttgg tttttaaaac 8161 attaataatt aaacttataa aaccaatggt aaaatagttt ctcactcttg acgatatttt 8221 gcagtgtttt aaagggatac caaaaattct gcaatagtaa accagtgaaa gagaaaaatc 8281 taatatagat gaagctttaa cctcttaata ctgcattttg caaggctgtt cctgcaagct 8341 ctggttttat aggatatgat atatttagtt gaattacaga ctaataatct caacaatagt 8401 ttctgtattg tcaatatact aaaatcttca aaacagccta gaagattgaa aagggcatga 8461 aattatgcag gcttggtatt agatcccagc tctgctactt tctctttcca gtagtcactg 8521 gtccacatgg gtttttatca ggtatttcac agtacaccct ttaaaaacag aatcctttac 8581 tgtttcctca agacacttgt gcatgttacc agtggtagac aatctgtgat catttattga 8641 aaatatctaa tcaaacgcta attttaacac ttattttctt agtttcccag aagaggtcgc 8701 cattgttgaa gaacttggcc gcagacatgc tgatggttct ttctctgatg agatgaacac 8761 cattcttgat aatcttgccg ccagggactt tataaactgg ttgattcaga ccaaaatcac 8821 tgacaggtga ctgcttttta gttaattctg aaaaccatca aattctcata agtactgatt 8881 gtcatactgc tgcaagctgt tccccatgta ggggaaagtc tataatctct tattatatag 8941 gaaataatta tgcatgttta agtcatataa gagtacatca ttatgtttgt tttgtatcat 9001 tacagtttgt atctatttat tcatttattc cagaaacatt tactgagtgt ctattagagt 9061 caagaagcta gaaacacaga tagaaatgaa acatgggaaa tgttacatca ttcacatatt 9121 gtcacattaa aaattgctat gttgaaattt cagcacaagg tgaaatgcag catatacaat 9181 attacatact taatttaata agaagagtag tgagaactgg acaccgaaaa atacttatct 9241 tgatgaattt tgatttttta ataattattc ataattgttt gatagcattg atgagcagac 9301 tttaggataa ataatcttta aatgaaaata ttttaagagt ccaaaggatg gagggattta 9361 tggacaaagg gattaataaa caacttttat caaaataaat cattgaaata ttttctggat 9421 aagttaatga tatcatctta ttataattat gttaaatcat tttctttttt taatctctag 9481 gaaataacta tatcactatt caagatcatc ttcacaacat cacctgctag ccacgtggga 9541 tgtttgaaat gttaagtcct gtaaatttaa gaggtgtatt ctgaggccac attgctttgc 9601 atgccaataa ataaattttc ttttagtgtt gtgtagccaa aaattacaaa tggaataaag 9661 ttttatcaaa atattgctaa aatatcagct ttaaaatatg aaagtgctag attctgttat 9721 tttcttctta ttttggatga agtaccccaa cctgtttaca tttagcgata aaattatttt 9781 tctatgatat aatttgtaaa tgtaaattat tccgatctga catatctgca ttataataat 9841 aggagaatag aagaactggt agccacagtg gtgaaattgg aaagagaact ttcttcctga 9901 aacctttgtc ttaaaaatac tcagctttca atgtatcaaa gatacaatta aataaaattt 9961 tcaagcttct ttaccattgt ctgatttctc tttacgtccc cctttgtcat gccccactcc 10021 ttccaagcac cctgttttct aagctgcagt // LOCUS HSGMCSFG 3043 bp DNA PRI 03-JAN-1991 DEFINITION Human gene for granulocyte-macrophage colony stimulating factor (GM-CSF). ACCESSION X03021 NID g31858 KEYWORDS colony stimulating factor; granulocyte-macrophage colony stimulating factor; growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3043) AUTHORS Miyatake,S., Otsuka,T., Yokota,T., Lee,F. and Arai,K. TITLE Structure of the chromosomal gene for granulocyte-macrophage colony stimulating factor: comparison of the mouse and human genes JOURNAL EMBO J. 4 (10), 2561-2568 (1985) MEDLINE 86030234 COMMENT Data kindly reviewed (02-SEP-1986) by S. Miyatake. FEATURES Location/Qualifiers source 1..3043 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 597..603 /note="put. TATA box" misc_feature 620..622 /note="transcription initiation region" CDS join(662..820,914..955,1643..1768,2578..2685) /note="put. GM-CSF precursor" /codon_start=1 /db_xref="PID:g31859" /db_xref="SWISS-PROT:P04141" /translation="MWLQSLLLLGTVACSISAPARSPSPSTQPWEHVNAIQEARRLLN LSRDTAAEMNETVEVISEMFDLQEPTCLQTRLELYKQGLRGSLTKLKGPLTMMASHYK QHCPPTPETSCATQIITFESFKENLKDFLLVIPFDCWEPVQE" intron 821..913 /note="intron I" intron 956..1642 /note="intron II" intron 1769..2577 /note="intron III" misc_feature 2998 /note="polyadenylation site" BASE COUNT 668 a 824 c 878 g 673 t ORIGIN 1 ttctcagagt ggctgcagtc tcgctgctgg atgtgcacat ggtggtcatt ccctctgctc 61 acaggggcag gggtcccccc ttactggact gaggttgccc cctgctccag gtcctgggtg 121 ggagcccatg tgaactgtca gtggggcagg tctgtgagag ctcccctcac actcaagtct 181 ctctcacagt ggccagagaa gaggaaggct ggagtcagaa tgaggcacca gggcgggcat 241 agcctgccca aaggcccctg ggattacagg caggatgggg agccctatct aagtgtctcc 301 cacgccccac cccagccatt ccaggccagg aagtccaaac tgtgcccctc agagggaggg 361 ggcagcctca ggcccattca gactgcccag ggagggctgg agagccctca ggaaggcggg 421 tgggtgggct gtcggttctt ggaaaggttc attaatgaaa acccccaagc ctgaccacct 481 agggaaaagg ctcaccgttc ccatgtgtgg ctgataaggg ccaggagatt ccacagttca 541 ggtagttccc ccgcctccct ggcattttgt ggtcaccatt aatcatttcc tctgtgtatt 601 taagagctct tttgccagtg agcccagcta cacagagaga aaggctaaag ttctctggag 661 gatgtggctg cagagcctgc tgctcttggg cactgtggcc tgcagcatct ctgcacccgc 721 ccgctcgccc agccccagca cgcagccctg ggagcatgtg aatgccatcc aggaggcccg 781 gcgtctcctg aacctgagta gagacactgc tgctgagatg gtaagtgaga gaatgtgggc 841 ctgtgctagg caccagtggc cctgactggc cacgcctgtc agcttgataa catgacattt 901 tccttttcta cagaatgaaa cagtagaagt catctcagaa atgtttgacc tccaggtaag 961 atgcttctct ctgacatagc tttccagaag cccctgccct ggggtggagg tggggactcc 1021 attttagatg gcaccacaca gggttgtcca ctttctctcc agtcagctgg ctgcaggagg 1081 agggggtagc aactgggtgc tcaagaggct gctggccgtg cccctatggc agtcacatga 1141 gctcctttat cagctgagcg gccatgggca gacctagcat tcaatggcca ggagtcacca 1201 ggggacaggt ggtaaagtgg gggtcacttc atgagacagg agctgtgggt ttggggcgct 1261 cactgtgccc cgagaccaag tcctgttgag acagtgctga ctacagagag gcacagaggg 1321 gtttcaggaa caacccttgc ccacccagca ggtccaggtg aggccccacc cccctctccc 1381 tgaatgatgg ggtgagagtc acctccttcc ctaaggctgg gctcctctcc aggtgccgct 1441 gagggtggcc tgggcggggc agtgagaagg gcaggttcgt gcctgccatg gacagggcag 1501 ggtctatgac tggacccagc ctgtgcccct cccaagccct actcctgggg gctgggggca 1561 gcagcaaaaa ggagtggtgg agagttcttg taccactgtg ggcacttggc cactgctcac 1621 cgacgaacga cattttccac aggagccgac ctgcctacag acccgcctgg agctgtacaa 1681 gcagggcctg cggggcagcc tcaccaagct caagggcccc ttgaccatga tggccagcca 1741 ctacaagcag cactgccctc caaccccggt gagtgcctac ggcagggcct ccagcaggaa 1801 tgtcttaatc tagggggtgg ggtcgacatg gggagagatc tatggctgtg gctgttcagg 1861 accccagggg gtttctgtgc caacagttat gtaatgatta gccctccaga gaggaggcag 1921 acagcccatt tcatcccaag gagtcagagc cacagagcgc tgaagcccac agtgctcccc 1981 agcaggagct gctcctatcc tggtcattat tgtcattacg gttaatgagg tcagaggtga 2041 gggcaaaccc aaggaaactt ggggcctgcc caaggcccag aggaagtgcc caggcccaag 2101 tgccaccttc tggcaggact ttcctctggc cccacatggg gtgcttgaat tgcagaggat 2161 caaggaaggg aggctacttg gaatggacaa ggacctcagg cactccttcc tgcgggaagg 2221 gagcaaagtt tgtggccttg actccactcc ttctgggtgc ccagagacga cctcagccca 2281 gctgccctgc tctgccctgg gaccaaaaag gcaggcgttt gactgcccag aaggccaacc 2341 tcaggctggc acttaagtca ggcccttgac tctggctgcc actggcagag ctatgcactc 2401 cttggggaac acgtgggtgg cagcagcgtc acctgaccca ggtcagtggg tgtgtcctgg 2461 agtgggcctc ctggcctctg agttctaaga ggcagtagag aaacatgctg gtgcttcctt 2521 cccccacgtt acccacttgc ctggactcaa gtgtttttta tttttctttt tttaaaggaa 2581 acttcctgtg caacccagat tatcaccttt gaaagtttca aagagaacct gaaggacttt 2641 ctgcttgtca tcccctttga ctgctgggag ccagtccagg agtgagaccg gccagatgag 2701 gctggccaag ccggggagct gctctctcat gaaacaagag ctagaaactc aggatggtca 2761 tcttggaggg accaaggggt gggccacagc catggtggga gtggcctgga cctgccctgg 2821 gcacactgac cctgatacag gcatggcaga agaatgggaa tattttatac tgacagaaat 2881 cagtaatatt tatatattta tatttttaaa atatttattt atttatttat ttaagttcat 2941 attccatatt tattcaagat gttttaccgt aataattatt attaaaaata tgcttctact 3001 tgtccagtgt tctagtttgt ttttaaccat gagcaaatgc cat // LOCUS HSGROW2 1964 bp DNA PRI 16-DEC-1994 DEFINITION Human germ line gene for growth hormone (presomatotropin). ACCESSION V00520 J00148 NID g31906 KEYWORDS complementary DNA; germ line; growth hormone; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1964) AUTHORS DeNoto,F.M., Moore,D.D. and Goodman,H.M. TITLE Human growth hormone DNA sequence and mRNA structure: possible alternative splicing JOURNAL Nucleic Acids Res. 9 (15), 3719-3730 (1981) MEDLINE 82014939 COMMENT This entry was previously called . See for mRNA sequence (with some differences). FEATURES Location/Qualifiers source 1..1964 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript (274.275)..1905 mRNA join((274.275)..344,601..761,971..1090,1184..1348, 1602..1905) exon (274.275)..344 /number=1 CDS join(335..344,601..761,971..1090,1184..1348,1602..1799) /codon_start=1 /product="growth hormone" /db_xref="PID:g312406" /translation="MATGSRTSLLLAFGLLCLPWLQEGSAFPTIPLSRLFDNASLRAH RLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLELL RISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPR TGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF" intron 345..600 /number=1 exon 601..761 /number=2 intron 762..970 /number=2 exon 971..1090 /number=3 intron 1091..1183 /number=3 exon 1184..1348 /number=4 intron 1349..1601 /number=4 exon 1602..1905 /number=5 BASE COUNT 451 a 550 c 550 g 413 t ORIGIN 1 agggcaccca cgtgaccctt aaagagagga caagttgggt ggtattttct ggctgacact 61 ctgtgcacaa ccctcacaac actggttgac ggtgggaagg gaaagatgac aagccagggg 121 catgatccca gcatgtgtgg gaggagcttc taaattatcc attagcacaa gcccgtcagt 181 ggccccatgc ataaatgtac acagaaacag gtgggggcaa cagtgggaga gaaggggcca 241 gggtataaaa agggcccaca agagaccggc tcaaggatcc caaggcccaa ctccccgaac 301 cactcagggt cctgtggacg ctcacctagc tgcaatggct acaggtaagc gcccctaaaa 361 tccctttggg cacaatgtgt cctgagggga gaggcagcga cctgtagatg ggacgggggc 421 actaaccctc aggtttgggg cttctgaatg agtatcgcca tgtaagccca gtatggccaa 481 tctcagaaag ctcctggtcc ctggagggat ggagagagaa aaacaaacag ctcctggagc 541 agggagagtg ctggcctctt gctctccggc tccctctgtt gccctctggt ttctccccag 601 gctcccggac gtccctgctc ctggcttttg gcctgctctg cctgccctgg cttcaagagg 661 gcagtgcctt cccaaccatt cccttatcca ggctttttga caacgctagt ctccgcgccc 721 atcgtctgca ccagctggcc tttgacacct accaggagtt tgtaagctct tggggaatgg 781 gtgcgcatca ggggtggcag gaaggggtga ctttcccccg ctgggaaata agaggaggag 841 actaaggagc tcagggtttt tcccgaagcg aaaatgcagg cagatgagca cacgctgagt 901 gaggttccca gaaaagtaac aatgggagct ggtctccagc gtagaccttg gtgggcggtc 961 cttctcctag gaagaagcct atatcccaaa ggaacagaag tattcattcc tgcagaaccc 1021 ccagacctcc ctctgtttct cagagtctat tccgacaccc tccaacaggg aggaaacaca 1081 acagaaatcc gtgagtggat gccttgaccc caggcgggga tgggggagac ctgtagtcag 1141 agcccccggg cagcacaggc caatgcccgt ccttcccctg cagaacctag agctgctccg 1201 catctccctg ctgctcatcc agtcgtggct ggagcccgtg cagttcctca ggagtgtctt 1261 cgccaacagc ctggtgtacg gcgcctctga cagcaacgtc tatgacctcc taaaggacct 1321 agaggaaggc atccaaacgc tgatgggggt gggggtggcg ctaggggtcc ccaatcttgg 1381 agccccactg actttgagag ctgtgttaga gaaacactgc tgccctcttt ttagcagtcc 1441 aggccctgac ccaagagaac tcaccttatt cttcatttcc cctcgtgaat cctctagcct 1501 ttctctacac cctgaagggg agggaggaaa atgaatgaat gagaaaggga gggagcagta 1561 cccaagcgct tggcctctcc ttctcttcct tcactttgca gaggctggaa gatggcagcc 1621 cccggactgg gcagatcttc aagcagacct acagcaagtt cgacacaaac tcacacaacg 1681 atgacgcact actcaagaac tacgggctgc tctactgctt caggaaggac atggacaagg 1741 tcgagacatt cctgcgcatc gtgcagtgcc gctctgtgga gggcagctgt ggcttctagc 1801 tgcccgggtg gcatccctgt gacccctccc cagtgcctct cctggccttg gaagttgcca 1861 ctccagtgcc caccagcctt gtcctaataa aattaagttg catcattttg tctgactagg 1921 tgtcctctat aatattatgg ggtggagggg ggtggtttgg agca // LOCUS HSGTRH 7224 bp DNA PRI 25-JUN-1997 DEFINITION Human gene for gonadotropin-releasing hormone. ACCESSION X15215 NID g31955 KEYWORDS gonadotropin-releasing hormone. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7224) AUTHORS Hayflick,J.S. TITLE Direct Submission JOURNAL Submitted (09-MAY-1989) Hayflick J.S., Oregon Health Sciences University, 3181 S W Sam Jackson Park Rd, Portland OR 97201-3098, U S A REFERENCE 2 (bases 1 to 7224) AUTHORS Hayflick,J.S., Adelman,J.P. and Seeburg,P.H. TITLE The complete nucleotide sequence of the human gonadotropin-releasing hormone gene JOURNAL Nucleic Acids Res. 17 (15), 6403-6404 (1989) MEDLINE 89366682 FEATURES Location/Qualifiers source 1..7224 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /clone_lib="Maniatis lambda Charon" /chromosome="8" /map="p11-p21.4A" TATA_signal 1009..1114 mRNA join(1039..1192,2063..2204,3720..3815,5960..6160) exon 1039..1192 /number=1 intron 1193..2062 /number=1 exon 2063..2204 /number=2 CDS join(2064..2204,3720..3815,5960..6001) /codon_start=1 /product="gonadotropin-releasing hormone" /db_xref="PID:g31956" /db_xref="SWISS-PROT:P01148" /translation="MKPIQKLLAGLILLTWCVEGCSSQHWSYGLRPGGKRDAENLIDS FQEIVKEVGQLAETQRFECTTHQPRSPLRDLKGALESLIEEETGQKKI" intron 2206..3719 /number=2 exon 3720..3815 /number=3 intron 3816..5959 /number=3 exon 5960..6160 /number=4 polyA_signal 6138..6143 BASE COUNT 2145 a 1491 c 1421 g 2167 t ORIGIN 1 aagctttttg ctttctattc attcattcgt tcattcattc attcaaacct atacttaccg 61 aatgctcact aaatgccggg ggtttattaa gagagattta aataagatgg gatctttgac 121 tattacaggt ttcagcctag gggtaaatta ggggaagaca accatgtatt caaataaatg 181 taattaagag taatggttgt gtgtgtattt tacatgcttg tcctgtgtaa ataacacgtc 241 cacggttgca cctctggggt ggaacatcta taaaatttag ataatgatac ccactttgca 301 tggctattgt aatgagtgct cttatacatt tgctatttat taaataacta taatttctca 361 tctttctgtt cccactgccc ttaagagtga tttgcatatt taactcaata agcatctact 421 gaaatgagtt gatctgttga tgtaagtctg ctcaatatgg tcttgctctc agaatatgtt 481 tcttgccttt ttgatgcttt agaaggcttt caaggtaagt caagcaggga acctggtggg 541 gtagatgagg gaattttcaa acacacaact gtctgattta ggatcctaca tggacttggt 601 atatagtgtc acttacttgt aaatcagatt tttaaaattg gaagcaactc tgtgatcatc 661 tagtccatct agtctacacc cttcctttta caaatgaaga atccaagagc cagaagctcc 721 cagacatcct gactcaatgt cctatatttg ttgtatagcc tcctttgtgg aagttatgta 781 tgcatttgac ttcacttaat ctaagacatc tattttcctt gaactcttga taggtctgct 841 ggtttcctca agggaatcca atctagctgg attttaatct ctttgaattg tgtcctcagc 901 tataaaagtt ttagctgagg ttttaatggc tgcacttaag taaatctaac agatatacca 961 gggggtgttc caattacata caccattaaa gggctttatg tgaggatttt taaaaattac 1021 cattaaaaaa aaaaaagcat agtccatttg cagtataatt taccagcagg aaagatttca 1081 atgtcctgga aaaattccct ataaaaagga agataggaaa acagaaaagt cacagtactc 1141 aacctacttc aagggaagat tgggatcttt ttggctctct gcctctaaac aggtaaaagg 1201 ctttgtatta tttctagcac gagtttttct tctttagatt gcatgctatt gtatgtctac 1261 agggcatttg acagcccaag ggctaaatcc aggtgtgacg gtatctaatg atgtcctgtc 1321 cttcactgtc cttgccatca ccagccacag agatccaggc tttggggact cccacagctt 1381 atcgaccagt gtttgattta gtttttagcc tctttcccat caaatgaaaa ttaacttgga 1441 gacacatttc attagaaaat tagaggcccc cttggctagg aaggcatctg gtctgggact 1501 aactactttg aacagtgttg agtcctctct cccacagatg gttcagccag cagtaatgct 1561 aggcaagact gaaggataaa tagaaaaatg tcattagtac catggggtag ccatgtaatg 1621 tcaagcaatt ttatattagc cagagattcc tagtaggagc tacttttctt aacagatgac 1681 tcagttctct ctatctcagg aatgaaagag ttgaagacca atccacaaca ggggaaatgt 1741 taaggcaaaa tgatgaactt gataagggat gaattatggg gtttggataa ccaaacaata 1801 aaaataaaag tatagactat tttagtacta aaaaggtcct gaacatgtga gcttaagtac 1861 tcattttgtc cccagtggct aagaaactaa aggcaagcca gcaagtgtct ctgagtttca 1921 gtgtctgtat gtaaaaactg actctgactt ccatcttctg cagggttagt gatacagatg 1981 ctagcttttt cactaaagag gtcttttagt ttatactcaa ccttgtctgg atctaatttg 2041 attgtgcatt catgtgcctt agaatgaagc caattcaaaa actcctagct ggccttattc 2101 tactgacttg gtgcgtggaa ggctgctcca gccagcactg gtcctatgga ctgcgccctg 2161 gaggaaagag agatgccgaa aatttgattg attctttcca agaggtaagt ttctctcagc 2221 ttcaaaataa gacatagtga tttgattcaa tttaactata ttaaacattc aggatagccc 2281 caatgtcaat attctatgat gttgtaccct agatgctcca ggtgagataa ggcacttaca 2341 aagtagaagt cccattccta ctttcagttc acacagggac taaacaaaga gctggaaaaa 2401 ttccaaaaga atatattaat agcaacaagt gtgagacagc acgtcatact ctgagtgtat 2461 gggattgcta aaggaattag aaacaatggg atagggtcaa ggtctgtatc agaagatgat 2521 tctttgggat ttggaaaaaa cgttagaact tcgtattttt tttcctcatt tcatctttta 2581 aacatatcta tgctattaga tcagtacatt tttataattt ataaatgagt atccttatca 2641 aggatgcaat gccctataat ttcttttcac tgataggggc atttaagcaa agttggagac 2701 tggtagaata caggaattag caaactcaag atgataagat aacgtagtag aaaacatgct 2761 gatttaaatt catatagatt agattatagg actggcacca aacctaacgt gaggtacttg 2821 ctttcttgtt ttttggggtt attttctaag acagggtctc gctgtgttat ctaggctgga 2881 atttagtggc acgatcacag ctcactgcag cttgacttct tgggctcaag tgatcctccc 2941 acctcagccg tctgcatagc tgggaccaca ggcatgcact atcacaccca actaattttt 3001 aaaacttttt atagagatgg ggtttcccta tgttgcccag gctggtctca aactcctggg 3061 ctcaagggat cctcccacct tggcctccca aagtgccggg agatttttgt ttataaaata 3121 tgaccactca tcagggtcat gtaaggaaag aagccatcta tgttagctga ttaacctgaa 3181 aaataaccta ggactgagaa tgggaaaaat tttaaatcat ttcattatca ttggaaggaa 3241 ttatctcttt tctgagaaat aataaagata atttagtatt aaagaagacc cagaatctga 3301 agcctcttct ctgcaggtta tacatgaagc aaatctcatt gaactataac atatttagta 3361 aaacctagaa aataaaaacc aacctttttt acaactataa actcttgggg ttttttgctt 3421 tttgtttttt tgggtatact gactctcatg aggctcaaag tgcctccctc ttttcatctt 3481 taaggggaaa atatgatact tcttactgtc tccattatct ccagatcccc atgccattca 3541 gtagaaatgt cagatggcag atctgtgtcc ttaaagtccc tatgctaatc ctgcaacttt 3601 cccaatctcc ccagccccac atccccttga ccccactctc cacaattttt tggtggaaat 3661 ggaaaacacc atttcatttc tttatctcca tctcaaagca tcacattctc tttcttcaga 3721 tagtcaaaga ggttggtcaa ctggcagaaa cccaacgctt tgaatgcacc acgcaccagc 3781 cacgttctcc cctccgagac ctgaaaggag ctctggtaag ttaaagtgat cataacatga 3841 tcacagcata gagctctaga ggtggataag cctttgggga tcacttagga cagctacctc 3901 ccagatactg tggggcttac attcctgact cctctgttac ctcctgtggt aggaccgtgt 3961 ttcatgacaa tcccagttgg tggttagaca gcactagggg ctgaaacgtt tttttgtttg 4021 tttgtttttt ctgaagttga attgaaatct ctctgtaact tttatctctt aatcctggtt 4081 ctagcttttg ggataactaa caaaacaaat ttcttcccac tgctgcattt catttcttca 4141 agtaaaatca ccaaaccccc tagactactc cactccaggc tactccacag ccctccacct 4201 gatcctgcaa ctgtgcttta tcttacatgg ttttccagaa ccttggggaa tagagacatg 4261 agaaacactg ctgtagatgg gttttttttt ctcttctttg gaatgaaaaa tgccaaacta 4321 ctaaatttat aatttagaga gtgatggact tgatttccag tttcctgata ggacaataat 4381 cacctccaaa ttccaccccc caaaatggaa atacactaat catattaggt ttttgatgaa 4441 aaagtataaa gagaattgaa tgtataaatt gaatctttta aaaaaattat ttgttgagac 4501 aggatcttgc tctgttcccc gggctagagt gcaatggtgc aatcaatgct caccacagcc 4561 tcaacctccc aggctcaagc aattctggag actagattag cctctctagt agctgggact 4621 ataagcactt gccaccacac ccagctgatt ttatttttta attttttgtt agagacaggg 4681 tctatgttgc ccaggctggt ctcaaactcc cagcctcaag atccacccaa agtgctggga 4741 ttaaaggtgc aagccactgt gcctgactta agttgaatct tggattcaat gttgatattc 4801 tctgatctct attgtccact tatctgcagc aatcagaagg cattacagtt aatgatcagt 4861 tatgcctagg agctgggaaa gcccaaataa atcatatata aaaataagct gtaattttaa 4921 ttgtctacag tgacttcaac ttaatatacc cacagaacaa agaaaaaagt gggcagacgt 4981 cgttatttcc ttttttgttt tttttttgag acggagtctc gctctgttgc caggctggag 5041 tgcagtggcg caatctcggc tcaatgcaac ctccatctcc tgggttcaag cgattctcct 5101 gactcagcct cccgagtagc tgagattaca ggcatgagcc accacactga ctaatttttg 5161 tatttttagt agagatgggg tttcaccatg ttggccagga tggtctcaat cccttaacct 5221 tgtgatccgc ccactggcct cccaaagtgc tgggattaca ggcatgaggc caccacgccc 5281 gagcctattt cctttctttt tctaatcttg cttactgcat tacaaaaatg gcaagcagtg 5341 aaatttgtca aacatgacat tatgaagaaa ttgaagcaaa ggctggttta atagcaaagt 5401 aattgaccag actttttttt cacttccttc ctcacaactc atccttaaac tattaatgta 5461 gattttatgt atattaagtg cttaaaaaga cccaatcggc caggcacagt ggctcatgct 5521 gtaatcctag cattttggga agccgaggta agtggatcac ttgaggccag gagttcaaga 5581 ccagcctggc caacatggtg acaccctgtc tctactaaaa ctataaaaat tagccagaag 5641 tgcggtgatg catgcctgca atcccaacta ctagagaggc tgaggcacga gaatcatttg 5701 aacctggaag gtggaggttg ccatgagctg agatcatact actgcactcc agcctgggtg 5761 acagagtgag gttctgtctc aaaaaaaaaa aaaaccagaa aaacaaaccc aatttatcat 5821 gtctccctag cactaactag agcacaaaat caaacagacc aattccttcc agactgatat 5881 tttagaaatt aaaatgtcaa aatgtaatga aattcagctg gtaaagtcag tcttgatata 5941 tttgttatat atttttcagg aaagtctgat tgaagaggaa actgggcaga agaagattta 6001 aatccattgg gccagaagga atgaccatta ctaacatgac ttaagtataa ttctgacatt 6061 gaaaatttat aacccattaa atacctgtaa atggtatgaa tttcagaaat ccttacacca 6121 agttgcacat attccataat aaagtgctgt gttgtgaatg aagtggcata cctgttaaat 6181 ctttctccaa ctcagaactc cgggggaagg atcactgtaa acccaccaaa gggagccctc 6241 catgtgtgta tacaggtggc agatgggagg gcaggtaaga taaagtgtct gttgttgaca 6301 aaagggatct caggctctcc agcacccata ccctgcatct acccacaagc agaacagcca 6361 catactggtc cagccagaaa aagctgattc agctccagtt tctctgggat tataacttta 6421 tcttcgacca tactcttcag aagttgaggt ggggccacgg ccaaggcttt cttccacttg 6481 gaaagaagtt ccctcccttg atcgtctcca aaccctttga aagtttactg gaacccaaat 6541 gaggcctggg ggtaaggaga gggggcctcc aaggactcct agtccagcgc tccttctggt 6601 cccacgcaat ctatccaagt ggtgcacact gagggttggg actgagacct aaaaaataat 6661 agagtgttca tctgcttgat ctgtttggtt tgcatttaag aaacacaatg gagtacagac 6721 agaccgttgg agatgggact ctattgttcc tatgtcccct ggtcagatag cattgccacc 6781 attctttctc caaaaagacc aggggcaggc tttgtgctac atatgtgcac agatattttt 6841 actaaccatc ataagaacct tggattgtag ttagtattgc ctttgcttta aaatgaggaa 6901 acgggctcag agagtttaag agacccagcc atgtttggtt ccatgttcac tattctagta 6961 ttgccctatg atgagcagct aacagaactg gactttggag gctgggttat tttgtccatt 7021 ggagctgaaa caggaaacta gggttcatag gtcaatgtag atttagcaat gactaatcct 7081 ccataaggcc ccagacacta tgacattgat gttctcattt aggccacata acttccaagt 7141 ggctatgaga ccattggaaa aaaaaaaaca aatttcagat aacttgccca atgtctcagt 7201 aacaaactga cctgtcaaag atct // LOCUS HSHAP1 2957 bp DNA PRI 25-SEP-1992 DEFINITION H.sapiens HAP1 gene for AP endonuclease 1. ACCESSION X66133 NID g32021 KEYWORDS endonuclease I; HAP1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2957) AUTHORS Robson,C. TITLE Direct Submission JOURNAL Submitted (01-JUN-1992) C. Robson, Imperial Cancer Res Fund, Mol Oncology Laboratory IMM, John Radcliffe Hospital, Headington, Oxford OX39 9DU, UK REFERENCE 2 (bases 1 to 2957) AUTHORS Robson,C.N., Hochhauser,D., Craig,R., Rack,K., Buckle,V.J. and Hickson,I.D. TITLE Structure of the human DNA repair gene HAP1 and its localisation to chromosome 14q 11.2-12 JOURNAL Nucleic Acids Res. 20 (17), 4417-4421 (1992) MEDLINE 93027134 FEATURES Location/Qualifiers source 1..2957 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Lorist" exon <587..587 /number=1 intron 588..769 /number=1 exon 770..895 /number=2 gene 838..2699 /gene="HAP1" CDS join(838..895,1106..1293,1859..2051,2182..2699) /gene="HAP1" /codon_start=1 /product="AP endonuclease 1" /db_xref="PID:g32022" /db_xref="SWISS-PROT:P27695" /translation="MPKRGKKGAVAEDGDELRTEPEAKKSKTAAKKNDKEAAGEGPAL YEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAPDILCLQETKCSEN KLPAELQELPGLSHQYWSAPSDKEGYSGVGLLSRQCPLKVSYGIGDEEHDQEGRVIVA EFDSFVLVTAYVPNAGRGLVRLEYRQRWDEAFRKFLKGLASRKPLVLCGDLNVAHEEI DLRNPKGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTYMMNARSK NVGWRLDYFLLSHSLLPALCDSKIRSKALGSDHCPITLYLAL" intron 896..1105 /gene="HAP1" /number=2 exon 1106..1293 /gene="HAP1" /number=3 intron 1294..1858 /gene="HAP1" /number=3 exon 1859..2051 /gene="HAP1" /number=4 intron 2052..2181 /gene="HAP1" /number=4 exon 2182..>2699 /gene="HAP1" /number=5 BASE COUNT 712 a 713 c 714 g 818 t ORIGIN 1 ctgcagatag cactgggaaa gacaccgcgg aactcccgcg agcgagaccc gccaaggccc 61 ctccagggac ctgtcttcct aacgtccagg gagcccgagc caactcgcgc cttacattcg 121 tatccgtttt cctatctctt tcccgtggtc agcccagcct tctccactgt ttttttcctc 181 ttgcacagag ttagaatctt aagtcagtgt cacacaatgt gctgtgcatc tggcacaacg 241 ataaacagcc gagggagggt tggggactaa gtgcctagag aattagagga gggaggcgag 301 gctaagcgtc cgtcacgtgg tgtcagacag accaatcacg cgcattcttc ggccacgaca 361 agcgcgcctc tgatcacgtg accaggtccg ctacccacgt gggggctcag cgtgcaccct 421 tctttgtgct cgggttagga ggagctaggc tgccatcggg ccggtgcaga tacggggttg 481 ctcttttgct cataagaggg gcttcgctgg cagtctgaac ggcaagcttg agtcaggacc 541 cttaattaag atcctcaatt ggctggaggg cagatctcgc gagtagggta caaggcacta 601 tgaaatgatc tagtttcgtg ggtgaggggc tgaagggcct atgatgcacg gaggcgggga 661 aaggatttag agataacgtg gtttaaaggc gggacctggt gcggggacgc tccttgggag 721 gagtcttctc ccagccttag ctggtttcat gatttctttg cgtctgtagg caacgcggta 781 aaaatattgc ttcggtgggt gacgcggtac agctgcccaa gggcgttcgt aacgggaatg 841 ccgaagcgtg ggaaaaaggg agcggtggcg gaagacgggg atgagctcag gacaggtaag 901 ggaatgaaat cagcccttct tcctagaagc tgcggcgggg gtgtttgtca ttcccttgat 961 gtacggtaag tacgggccga ctcatttttg caggggtttg tgaagaagtc gcaggaaccg 1021 taggctttcg ttgggtctat agttaacgcc ggatcgcagt tggaaaccac cagctttttg 1081 tcagtatata ttactcattt tatagagcca gaggccaaga agagtaagac ggccgcaaag 1141 aaaaatgaca aagaggcagc aggagagggc ccagccctgt atgaggaccc cccagatcag 1201 aaaacctcac ccagtggcaa acctgccaca ctcaagatct gctcttggaa tgtggatggg 1261 cttcgagcct ggattaagaa gaaaggatta gatgtgagtg gaatttgagg gaaagagaca 1321 ttttttagta ttgaatggtc ttagggttta gtcacccctt ttctccgttt agccttcagg 1381 ctgttttatt tttctcctgc ccgtagtttt ctgtggggct tccccagtct tgccagttgt 1441 atttcctaaa tgtctgttcc ttcacttcca ttgccatttt cttttttagt gttctctcct 1501 cttcccagaa tgttgcaaaa acctcttcac tatacttcct ccattttatc ttcctgcatt 1561 gcattccata tgaagcatgt cctccattcc attaaccata gcttaaaatc ttagcttgct 1621 atccactgcc tatagaaaaa acacatctcc ttggcatagc atgtaagact ttcttacctc 1681 tctatatttg ttttcattta tctagcttag aattgtttga atattgtgct gcttgactcg 1741 aactccttag gccaagagac tgtttaaccc gtgcgtatct atgacttagc atatagatta 1801 ttcaataaat gttctgctga attgataata cgttttccac ctttcttttc acttacagtg 1861 ggtaaaggaa gaagccccag atatactgtg ccttcaagag accaaatgtt cagagaacaa 1921 actaccagct gaacttcagg agctgcctgg actctctcat caatactggt cagctccttc 1981 ggacaaggaa gggtacagtg gcgtgggcct gctttcccgc cagtgcccac tcaaagtttc 2041 ttacggcata ggtgagaccc tattgatgcc taatgcctga actcttcaaa accaattgct 2101 aattctctat ctctgcccca cctcttgatt gctttccctt ttcttatagt tttttatgct 2161 aattctgttt catttctata ggcgatgagg agcatgatca ggaaggccgg gtgattgtgg 2221 ctgaatttga ctcgtttgtg ctggtaacag catatgtacc taatgcaggc cgaggtctgg 2281 tacgactgga gtaccggcag cgctgggatg aagcctttcg caagttcctg aagggcctgg 2341 cttcccgaaa gccccttgtg ctgtgtggag acctcaatgt ggcacatgaa gaaattgacc 2401 ttcgcaaccc caaggggaac aaaaagaatg ctggcttcac gccacaagag cgccaaggct 2461 tcggggaatt actgcaggct gtgccactgg ctgacagctt taggcacctc taccccaaca 2521 caccctatgc ctacaccttt tggacttata tgatgaatgc tcgatccaag aatgttggtt 2581 ggcgccttga ttactttttg ttgtcccact ctctgttacc tgcattgtgt gacagcaaga 2641 tccgttccaa ggccctcggc agtgatcact gtcctatcac cctataccta gcactgtgac 2701 accaccccta aatcactttg agcctgggaa ataagccccc tcaactacca ttccttcttt 2761 aaacactctt cagagaaatc tgcattctat ttctcatgta taaaactagg aatcctccaa 2821 ccaggctcct gtgatagagt tcttttaagc ccaagatttt ttatttgagg gttttttgtt 2881 ttttaaaaaa aaattgaaca aagactacta atgactttgt ttgaattatc cacatgaaaa 2941 taaagagcca tagtttc // LOCUS HSHCC1GEN 4037 bp DNA PRI 01-OCT-1995 DEFINITION H.sapiens gene for chemokine HCC-1. ACCESSION Z49269 NID g1004266 KEYWORDS chemokine. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4037) AUTHORS Pardigol,A., Maegert,H.J., Cieslak,A., Hill,O., Schulz-Knappe,P. and Forssmann,W.G. TITLE Nucleotide Sequence of the Gene for the Human Chemokine HCC-1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 4037) AUTHORS Pardigol,A. TITLE Direct Submission JOURNAL Submitted (18-MAY-1995) Andreas Pardigol, Molecular Biology, Lower Saxony Institute for Peptide Research, Feodor-Lynen-Strasse 31, Hannover, Lower Saxon, 30625, Germany FEATURES Location/Qualifiers source 1..4037 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ph3b7" /dev_stage="adult" /tissue_type="placenta" /clone_lib="lambda FIX II, Cat.Nr. 946203, Stratagene" /sex="male" TATA_signal 727..733 /note="putative, determined by consensus rules." 5'UTR 764..833 /note="first base determined by means of consensus rules" exon 764..912 /note="first base determined by means of consensus rules; base 780 is the first base of cDNA (Z49270)" /number=1 CDS join(834..912,3021..3135,3585..3672) /codon_start=1 /product="chemokine HCC-1" /db_xref="PID:g1004267" /translation="MKISVAAIPFFLLITIALGTKTESSSRGPYHPSECCFTYTTYKI PRQRIMDYYETNSQCSKPGIVFITKRGHSVCTNPSDKWVQDYIKDMKEN" intron 913..3020 /number=1 exon 3021..3135 /number=2 intron 3136..3584 /number=2 exon 3585..3817 /number=3 3'UTR 3673..3817 BASE COUNT 1023 a 1048 c 1004 g 962 t ORIGIN 1 gagctccgtt gggagtccca tgtttcttta tggcataatg ggtgagaaca cagacttgga 61 agccaaacca cctgaatttg aaccccagtt ccatttacca actgtcaaaa gcttaggctt 121 tgattctaag cctgtttcct caactgctgt tctaaagatt aaataggcta atattcataa 181 ggcaactggg acagtggctt gtgtgtatag caaccattat ataagtgaat tatctactga 241 gcaccacagc acttcttcac tccatggtgt ggtgaccaga atggagatga gacagagaac 301 tgcaggttct gcttcgagtt taagttagga tttcccttga ccaatgagac ctgacttgga 361 ggagtcctgg cctcattcca ttaccccaaa caccctctag tctctagatg aacagatcct 421 gaatgtccag gccccacgtg gcctgttcta aggcctgaga tggaattgga tacaggacac 481 atccagcctt gagatctttt gctaagtgtg acacagtgcc cccagccctg tgctcatgtt 541 catgcctagg gaaaggcttc tatcaaaaga gttgaacttc ttcccactgg ggatggaaga 601 ccatttcctc ccttaaacct tggctctccc tgcttccttc aggccaccaa caacacatgt 661 gcaggatatg aaattgctga ggcatcactg ctttcctact tcccttccaa gtctcagctc 721 ccttatttta aaaaatattt ggcctcaatg atcatttctc aacaattcct caccgcagga 781 gcctctgaag ctcccaccag gccagctctc ctcccacaac agcttcccac agcatgaaga 841 tctccgtggc tgccattccc ttcttcctcc tcatcaccat cgccctaggg accaagactg 901 aatcctcctc acgtgagtgc aatgccttgt cttccttcca acctagagcc tgcagggaaa 961 taagcaggag tgaggttggg gctcagggga agaccaggag cagggactca gaaaggaggg 1021 ctggtatctt cttgaaattg tgtgtatagc aacattatat aaatgaatta tctactgagc 1081 accacagcac ttcaccccat ggtgtggtga gcaggatgga gatgagactt aggactgtag 1141 gttctgctta agagtttaag ttgggatctt ccagccttga ccaatgagac ttgacttggg 1201 agactccagg cttcattcca ctaccccaaa tgccctctag tctccaaata aacagatcct 1261 gaatctccag gcctcacatg gccttgatct cttatcattg ccccccagga ccagtccccc 1321 cttgccctca aggacatgga gtgagaccag cctgcctctc tactccctca atttctctct 1381 ctttgccgct aagcaaaaga gtggcccacc ccatttgggg tatatttcct cagggagatt 1441 aggagcagtg tcttgagccc ctcaagggca tttttctatt ggcctcctga ggtttgggcc 1501 cagcctgctt ccagcgtcac ctgtgcccag tgagtgcagc attgcttggg tatgggctgg 1561 ggggaaacac gacagtgtgg ggtccatcct aggccccctt ttctcagctg atttcttaga 1621 ataagctgcc tttagagata accaaaacta tttatcactc ttccatttta cctactctcc 1681 ttttcagaaa ctggggggaa accgaaggtt gttaaaatac agctaaagtt ggtgggtatg 1741 tgcacagttt gacttgccct ctccgatgtc atttgtcagc tcagaggaac aaggtgggag 1801 agtataggag ctctgactgg gtctcaggaa acaggggccc cttatgccgt tctttggatc 1861 gtgaggatgc tgcctggaat ggagctggaa aacaggatga gacccttcca cccagacatc 1921 tggccaccct cagtgacctc tgaggccatt gtgatgcaca tccatgattc tatgaagcag 1981 ggtcacataa catgcacaca cctgatttct ccactccata accacaacat gtgcctgttt 2041 gtacagggct cttggcctac aatgtccttc ctgctacctc tataattcaa gcttggggtg 2101 gctgctgtca ccttgcttct cctataaaag ccatgaaact tctcaatcag aaaatagatg 2161 aaaaaatcac ccaatccagt gatttttaaa actttttaga ccacaaaacc ttttcttcaa 2221 gcaatatctt ccacagaggc ccaatatgta aaacagaaaa aatgggttga gtagggtaca 2281 agacaccact ctcaaatgca gcaaggcctc cacaatagtc cctgaggccc ccagagctca 2341 gtgtaaaaac cactgatgca gtccaagggc ctcatttaca gaggagggaa cagggggaaa 2401 gtaaaatggc cacagtacac aggaagcaca ggcaaggtta ggttaggatt tgggtgccct 2461 gactctgtgg cctttgtcct tggggcttgc tgtgggcatc ctgctctctc tgcaggttgt 2521 cggttcaatg gggacatggg cagggtggag cactaggagg ggctgggttt gcattcccaa 2581 atggcatgtc tccaaatccc tattgggatt tcttccaaat attcctccta tttggagcac 2641 ctttcccgaa taaggcatga aggctgcatg atattggcca agtccctagc cttctctgcc 2701 agtcggcccc cagagatggt gtaagaagat ctgagtgtgc tgctcttcaa tcctggagtt 2761 gaaagtcatc caccagtctt tccaagaggg gttgaagaaa aggaggaagg gtgattgatg 2821 atgagggagg agaaaaagaa gagcccagga gtaccatgga gaaggagaag agaagatgag 2881 gaaagcctac tctcccctcc aagttctgag gggctgtctc ctccttcctt ccctcctcca 2941 tgccctcagc ttgcaggagc agccaatggt atggccttta acaaggggcc cctcctcagc 3001 atctgatgct ctctcctcag ggggacctta ccacccctca gagtgctgct tcacctacac 3061 tacctacaag atcccgcgtc agcggattat ggattactat gagaccaaca gccagtgctc 3121 caagcccgga attgtgtagg tggtacacac acatcacact ggggggagag ggagccagca 3181 gggcctcctg gagggaagca gggagtggtg gtggaatggg gacccccagc gtacctccca 3241 ggtgtgacta catggggaga ggcagctgag gggcaatctg agcgctttct ggctggagcc 3301 tgcaggagcc atggggaaac tgaccccatg gatggggaga tgacagagaa gggagaagaa 3361 ggcaagaggg cacttcctca gggggacaca gagactagat gggtctaggg gtcctaggaa 3421 ccgaagagta tgtctcagag aggagactgg ctctaagctg cctctgtgga agaaaggaaa 3481 agcagtatag gtcaggtggg gaatttagga gggagggaag atgggctgtc tcttccggcc 3541 actgggcccc tcggtttgtg atccttctcc ctcttgctcc acagcttcat caccaaaagg 3601 ggccattccg tctgtaccaa ccccagtgac aagtgggtcc aggactatat caaggacatg 3661 aaggagaact gagtgaccca gaaggggtgg cgaaggcaca gctcagagac ataaagagaa 3721 gatgccaagg ccccctcctc cacccaccgc taactctcag ccccagtcac cctcttggag 3781 cttccctgct ttgaattaaa gaccactcat gctcttccct ggcctcattc ctttctacgg 3841 gatttactca ttggccatgc actgaggaca ccagggtgtg gcaccctcgg catcaagcct 3901 cgctctgcag aagttttggt ggagcctggt acaaaaaata ggtcaggcct gcaatgcagg 3961 tagtgagaag cagaaagtga gaaagaaaag cagtgtaaag accgtctcct cctcagcagc 4021 aacagtagca gaccccg // LOCUS HSHCF1 17760 bp DNA PRI 19-AUG-1996 DEFINITION H.sapiens HCF-1 gene. ACCESSION X79198 NID g558348 KEYWORDS HCF gene; host cell factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 17760) AUTHORS Frattini,A. TITLE Direct Submission JOURNAL Submitted (10-MAY-1994) A. Frattini, ITBA CNR, Via Ampere 56, 20131 Milan, ITALY REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 17760) AUTHORS Frattini,A. TITLE Direct Submission JOURNAL Submitted (04-OCT-1994) A. Frattini, ITBA CNR, Via Ampere 56, 20131 Milan, ITALY REFERENCE 3 (bases 1 to 17760) AUTHORS Zoppe,M., Frattini,A., Faranda,S. and Vezzoni,P. TITLE The complete sequence of the host cell factor 1 (HCFC1) gene and its promoter: a role for YY1 transcription factor in the regulation of its expression JOURNAL Genomics 34 (1), 85-91 (1996) MEDLINE 96299663 REFERENCE 4 (bases 1 to 17760) AUTHORS Frattini,A., Faranda,S., Redolfi,E., Zucchi,I., Villa,A., Patrosso,M.C., Strina,D., Susani,L. and Vezzoni,P. TITLE Genomic organization of the human VP16 accessory protein, a housekeeping gene (HCFC1) mapping to Xq28 JOURNAL Genomics 23 (1), 30-35 (1994) MEDLINE 95130107 COMMENT Related sequences: L20010 and D10711. FEATURES Location/Qualifiers source 1..17760 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="X.3000.11" /clone_lib="genomic pWE15 cosmid" /clone="430" /chromosome="X" /map="Xq28" exon 101..249 /gene="HCFC1" /number=2 mRNA join(101..249,543..703,1393..1601,2520..2604,3178..3284, 4413..4592,4666..5025,5336..5496,6061..6264,6579..6803, 6942..7046,7290..7509,7763..7905,8060..8198,8412..8632, 9250..10726,11022..11185,11834..12278,12633..12950, 13081..13199,13301..13438,13790..13975,14246..14546, 15171..15234,15399..17227) /gene="HCFC1" /product="host cell factor" gene 101..17227 /gene="HCFC1" CDS join(205..249,543..703,1393..1601,2520..2604,3178..3284, 4413..4592,4666..5025,5336..5496,6061..6264,6579..6803, 6942..7046,7290..7509,7763..7905,8060..8198,8412..8632, 9250..10726,11022..11185,11834..12278,12633..12950, 13081..13199,13301..13438,13790..13975,14246..14546, 15171..15234,15399..15438) /gene="HCFC1" /codon_start=1 /product="host cell factor" /db_xref="PID:g558349" /translation="MVEYGKYSNDLYELQASRWEWKRLKAKTPKNGPPPCPRLGHSFS LVGNKCYLFGGLANDSEDPKNNIPRYLNDLYILELRPGSGVVAWDIPITYGVLPPPRE SHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHS ATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTNTLACLNLDTMAWETILMDTLEDN IPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQVCCKDLWYLETEKPPPPARVQLVRA NTNSLEVSWGAVATADSYLLQLQKYDIPATAATATSPTPNPVPSVPANPPKSPAPAAA APAVQPLTQVGITLLPQAAPAPPTTTTIQVLPTVPGSSISVPTAARTQGVPAVLKVTG PQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSSPQMSGMAAL AAAAAATQKIPPSSRPTVLSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVMVSVSNP ATRMLKTAAAQVGTSVSSATNTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKTITL VKSPISVPGGSALISNLGKVMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTI LKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTKPGTTTIIKTIPMSAIIT QAGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKG APGQPGTILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLA GAGGHSTSASLATPITTLGTIATLSSQVINPTAITVSAAQTTLTAAGGLTTPTITMQP VSQPTQVTLITAPSGVEAQPVHDLPVSILASPTTEQPTATVTIADSGQGDVQPGTVTL VCSNPPCETHETGTTNTATTTVVANLGGHPQPTQVQFVCDRQEAAASLVTSTVGQQNG SVVRVCSNPPCETHETGTTNTATTATSNMAGQHGCSNPPCETHETGTTNTATTAMSSV GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKPQCQTRQTSATSTTMTVMATGA PCSAGPLLGPSMAREPGGRSPAFVQLAPLSSKVRLSSPSIKDLPAGRHSHAVSTAAMT RSSVGAGEPRMAPVCESLQGGSPSTTVTVTALEALLCPSATVTQVCSNPPCETHETGT TNTATTSNAGSAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQPPAGRPCET HQTTSTGTTMSVSVGALLPDATSSHRTVESGLEVAAAPSVTPQAGTALLAPFPTQRVC SNPPCETHETGTTHTATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTT VSSTLTRAVTTVTQSTPVPGPSVPPPEELQVSPGPRQQLPPRQLLQSASTALMGESAE VLSASQTPELPAAVDLSSTGEPSSGQESAGSAVVATVVVQPPPPTQSEVDQLSLPQEL MAEAQAGTTTLMVTGLTPEELAVTAAAEAAAQAAATEEAQALAIQAVLQAAQQAVMGT GEPMDTSEAAATVTQAELGHLSAEGQEGQATTIPIVLTQQELAALVQQQQLQEAQAQQ QHHHLPTEALAPADSLNDPAIESNCLNELAGTVPSTVALLPSTATESLAPSNTFVAPQ PVVVASPAKLQAAATLTEVANGIESLGVKPDLPPPPSKAPMKKENQWFDVGVIKGTNV MVTHYFLPPDDAVPSDDDLGTVPDYNQLKKQELQPGTAYKFRVAGINACARGPFSEIS AFKTCLPGFPGAPCAIKISKSPDGAHLTWEPPSVTSGKIIEYSVYLAIQSSQAGGELK SSTPAQLAFMRVYCGPSPSCLVQSSSLSNAHIDYTTKPAIIFRIAARNEKGYGPATQV RWLQETSKDSSGTKPANKRPMSSPEMKSAPKKSKADGQ" intron 250..542 /gene="HCFC1" /number=1 exon 543..703 /gene="HCFC1" /number=3 intron 704..1392 /gene="HCFC1" /number=2 exon 1393..1601 /gene="HCFC1" /number=4 intron 1602..2519 /gene="HCFC1" /number=3 exon 2520..2604 /gene="HCFC1" /number=5 intron 2605..3177 /gene="HCFC1" /number=4 exon 3178..3284 /gene="HCFC1" /number=6 intron 3285..4412 /gene="HCFC1" /number=5 exon 4413..4592 /gene="HCFC1" /number=7 intron 4593..4665 /gene="HCFC1" /number=6 exon 4666..5025 /gene="HCFC1" /number=8 intron 5026..5335 /gene="HCFC1" /number=7 exon 5336..5496 /gene="HCFC1" /number=9 intron 5497..6060 /gene="HCFC1" /number=8 exon 6061..6264 /gene="HCFC1" /number=10 intron 6265..6578 /gene="HCFC1" /number=9 exon 6579..6803 /gene="HCFC1" /number=10 intron 6804..6941 /gene="HCFC1" /number=10 exon 6942..7046 /gene="HCFC1" /number=12 intron 7047..7289 /gene="HCFC1" /number=11 exon 7290..7509 /gene="HCFC1" /number=13 intron 7510..7762 /gene="HCFC1" /number=12 exon 7763..7905 /gene="HCFC1" /number=14 intron 7906..8059 /gene="HCFC1" /number=13 exon 8060..8198 /gene="HCFC1" /number=15 intron 8199..8411 /gene="HCFC1" /number=14 exon 8412..8632 /gene="HCFC1" /number=16 intron 8633..9249 /gene="HCFC1" /number=15 exon 9250..10726 /gene="HCFC1" /number=17 intron 10727..11021 /gene="HCFC1" /number=16 exon 11022..11185 /gene="HCFC1" /number=18 intron 11186..11833 /gene="HCFC1" /number=17 exon 11834..12278 /gene="HCFC1" /number=19 intron 12279..12632 /gene="HCFC1" /number=18 exon 12633..12950 /gene="HCFC1" /number=20 intron 12951..13080 /gene="HCFC1" /number=19 exon 13081..13199 /gene="HCFC1" /number=21 intron 13200..13300 /gene="HCFC1" /number=20 exon 13301..13438 /gene="HCFC1" /number=22 intron 13439..13789 /gene="HCFC1" /number=21 exon 13790..13975 /gene="HCFC1" /number=23 intron 13976..14245 /gene="HCFC1" /number=22 exon 14246..14546 /gene="HCFC1" /number=24 intron 14547..15170 /gene="HCFC1" /number=23 exon 15171..15234 /gene="HCFC1" /number=25 intron 15235..15398 /gene="HCFC1" /number=24 exon 15399..17227 /gene="HCFC1" /number=26 3'UTR 15439..17227 /gene="HCFC1" polyA_signal 17203..17209 /gene="HCFC1" intron 17228..>17760 /number=25 BASE COUNT 3395 a 5476 c 5117 g 3747 t 25 others ORIGIN 1 gctcagcccc tctcaccaag cttgacggct ggcacatttg gggaatgaga ggaatggggg 61 ctcccgggtg ttctgacttc tgcctccttc tgttccccag caaccaacca gtggttcatc 121 ccagccgtga ggggggacat tccccctggg tgtgcagcct atggcttcgt gtgtgacggg 181 actcgcctcc tggtgtttgg tgggatggtg gagtatggga aatacagcaa tgacctctac 241 gaactccagg taaagcagcc ccaggttctc actctgcctc cttggcccca tgtgacctga 301 cggtagtggc ttcagggttg aggcaagcgg gaggtggaag ggacgcttga gagtctgcca 361 aggctgctct gaacttggcc cacctcagaa gaaggcagca aggaccaaca cccaggacct 421 tcttgctcca gcaaggcacc tcgctcaccc ccggccattt cctcctccac cacaagttgc 481 cccgggtccc tagtggtggc cttctcatcc cacggttcag tctccaggtc ttgattttgc 541 aggcgagccg gtgggagtgg aagagactca aagcaaagac gcccaaaaac gggccccctc 601 cgtgtcctcg actcgggcac agcttctccc ttgtgggcaa caaatgctac ctgtttgggg 661 gtctggccaa tgatagcgag gacccaaaga acaacattcc aaggtgaggc ttggtccttt 721 tctctggaac cccccgggtg ggtgtgctct cacatgcaca gggctccatg gtgtcctaaa 781 gaatgtgcgc anagagcccc gtggccgggt ccctttcctc gtgctgggtt gtaaatgaga 841 gagcggcacc ggatgccctc tcaggagctt tcgggctcca aaatcgctac tgaccctgag 901 caggcctcgt gggaggaggg ccaagctgga ggtggaagcc actttgcctt gggctggccc 961 agctctcgga gtcggcttag agctgctcgc gttccttctc agcctccgct gcctacaagt 1021 ttcctgatgg gtattttcca aggcccgtgg caaggactct ctgtctggag gtagtcccaa 1081 gccagcgccc cctcttgctc tgcggagccc tcatataata cccagctttg tgtgcatggc 1141 gggggcctgc tgctgggtct ggactcgggg ctgctgtgca gcctcaccca ccctgtgtcc 1201 cctgttccag ttggactcat cctagggagc ttctggcagc cccttcctca agagctaggg 1261 tagaagaact tcagagctcc cacagcccta gcccctttgt gttctgggga ggctgccact 1321 cactgatagc catccagcga agcctctggc gtgaggagtc gagacttgcc gtaactgatc 1381 tttcctcgtc aggtacctga atgacttata tatcctggaa ttacggccag gctctggagt 1441 ggtagcctgg gacattccca tcacttacgg ggtcctacca ccaccccggg agtcacatac 1501 tgccgtggtc tacaccgaaa aagacaataa gaagtccaag ctggtgatct acggcgggat 1561 gagtggctgc aggctggggg acctgtggac cctagatatt ggtgagcagg gtggtagcac 1621 cccagcagcc atgggacaaa gcgggtctgc ctcttggctg gccccgtggg tggcgctgtt 1681 gtatggtgag gtggggctct tagggagctg gagcatacca cttcgtgctt tcctcttgtc 1741 atggtgggcc agggtgggga accctggatc gtccctaggt ttggggctgt tatctctgcc 1801 ctgtgctggc tgagtgggct gaattggtct gcgggcttat aagagaataa gtcattaagg 1861 ctttctagaa tggaattctc tggatccggg tagcattagt ccctcttgac tcgtcagtgg 1921 tcagactgtg tggagtgctg catcctggtc tgactgctgt ggtaccattg gatgccggta 1981 aaccgtgagg ccttcgggga ctgcttgggc cggtcaccag gtggtggtat caggagtggc 2041 tgggaaagat tactaactgt ggatctccgg aggcctcttc agctgggaga gaggagatgg 2101 cgatgttgtc ctatagatca gagtcggcac tccaaagaag cagtttcccg tgagagaatt 2161 ttccaacagt ttgccaaaat gtctgtagcc taaggtggtt atggcggtga acgtatttca 2221 tggagagttc ggggtgtgcc caccactcct cccccagcac agagtgcagt cctggctcag 2281 ggcaggtgct caggatgtac ctgaaggagc gctattgcta gcctggtcga aaccgcctgg 2341 gtaacacagg aatctagcta ccatgacaac agtcagaggg cctcgcgtag gctataggga 2401 agccagggat ggaattctct cactgagaca gatcctacct tcctgagcta gcttgaatcc 2461 agggaacgac ctgtgtgcag tctaccttgt gtaacccttt cccctccgtc tctctgcaga 2521 caccctgacg tggaataagc ccagtctcag cggggtggcg cctcttcctc gcagtctcca 2581 ctcggcaacc accatcggaa ataagtaagt ggctgctcac ctgcgtctct cttctctaga 2641 ctggcgtggc ccctcccccg aagggctctc actcagtttt gtcctgtcag accctaaggc 2701 ctgaggatgc ctctttctgg ctgggctggc atttggcagc cctgtttttc cattacgacc 2761 ctgggggacc aaggaaagac agattcaggg gggagagggc cacacacaag ctgagtctga 2821 gaccctcccc gggcctgtga ggctccgccc tatcaggtac agagtgttct aggtcctctg 2881 tgcaagcacc tgggtgtctt tgtggatgtc caggtcctaa agggaaccta gaaactgtgg 2941 acatgaaggt gagctttaga tgagctgaac ggcactttta tgtggtagag gtgacacagt 3001 tctcctgagg gctgagccac caggcaggag agggccacgt caggaggctt cacgcatcct 3061 tggggactct cctggggaga gccagaggac tgagctctag aactggggta gcagaggtgg 3121 cctctagctt ggcaccaact ctacaatcct ttcccactct tccctcctca ctcccagaat 3181 gtacgtgttt ggtggctggg tgcctctcgt catggatgac gtcaaagtgg ccacacacga 3241 gaaggagtgg aagtgtacca acacgctggc ttgtctcaac ctgggtatgg cctcctcggg 3301 ctccgagtgg cctgtgggag gctgcaggag gctggccctt ctcagcctaa gagctgtgtg 3361 gggtgtgcgg ggagcaagct gggcccccgg ctccctttgg aatccactgg aaggatccag 3421 tgggcagagc tctctgaagc aatgttccgc ctctgagttt tttgaatgag ctctccatcc 3481 aaatggccca acatccaaaa gtgcaaaagc ctctacggtg cagggcctcc ttccctcctc 3541 ccggttctcc ggttctgctt ctcagaggca gccatgtggg ttggtttcct gtccaccctg 3601 caggggctct tcatatctga gcattagcca taggtccctt taaaaacaaa cagtggcagc 3661 ctggcgcacg cagtccgaca cctggccttt ctcatccgcc gtatcttgga gatcactcca 3721 cgtccctgnn ccgcagcagc ctccttcgag ttgcggttgt gctgctgtaa cttgtacagg 3781 cnccgctccg tgagtatcct tncnccatct ccacccgtgt gcaagtgtat cctaggggtg 3841 aaaacctaga agtagggttg ctgtccgatg cggctgaact gccctgcaca gaggctgtnc 3901 ccacgtaggc gcctccagtg gtgccctcac ggaatggtca ggccactctt tgccaagcct 3961 gatgggtggc caagctcccc ctnnnnctca ccttgagccc cagctgctgc gggcctgaaa 4021 ataaaaatgc caggcnccga tggttgaagt ttagatttct tttgcgtgag agtgtgtgtt 4081 ttttccttaa gatctttcta tcacgttttc cctaaaccac tttcccattg ccgtctcttt 4141 ctgtgagttg tagtgttctt taacgggcaa ggaggtaaac ctctgactgt gtttctcatg 4201 gttttcttag ttggccgttg gtggtttgta attttatttt ttgtgttgct ttttgccaca 4261 aagggtttct ttgttttgtt gtttttaatg taataatcca ataacttctt gattttaatt 4321 cccatccttg acaacaaaag aacnaagggn aaccaggggt ggtgggtttc ctccatcctc 4381 cttgctgacc tcccttgtct gccctccccc agataccatg gcctgggaga ccatcctgat 4441 ggatacactg gaggacaaca tcccccgtgc tcgggctggc cactgcgcag tcgccatcaa 4501 cacccgcctg tacatttgga gtgggcgtga cggctaccgc aaggcctgga acaaccaggt 4561 ctgctgcaag gacctctggt acctagagac aggtaggccc ggcccagccg actcccctcc 4621 tcccatagcc tcccncccct tgtgaccaac tgcgttctcc cacagaaaag ccaccacccc 4681 cagcccgagt acaactggta cgcgccaaca ccaactccct ggaggtgagc tggggggcag 4741 tggcaacagc cgacagctac cttctccagc tccagaaata tgacattcct gccacggctg 4801 ctactgccac ctcccctaca cccaatccgg tcccatctgt gcctgccaac cctcccaaga 4861 gccctgcccc agcagcagcc gcacctgctg tgcagccgct gacccaagta ggcatcacgc 4921 tcctgcccca ggctgccccc gcacccccga ccaccaccac catccaggtc ttgccaacgg 4981 tgcctggcag ctccatttct gtgcccaccg cagccaggac tcaaggtgag ccaggggacc 5041 agggacgttt gaaagtaggg ggctagcctc cggggagagg cagcggccaa gtaggggctg 5101 cgagacggca gcctagacag caggccagca gccgggagct ttgggcttgt ctccaagggt 5161 tgtaggggga tgctaaagcc tggccccacc tgtggaaatg gtcactctca gggcacagag 5221 aagagccaga ctcccacaca agcaggccaa cagaagtggc tcagggctgc ctgcaaaggg 5281 gctggggaca tcgtgcagaa ggtgatggct gacggctggc cccttctctc atcaggtgtc 5341 cctgctgttc tcaaagtgac cggtcctcag gctacaacag gaactccatt ggtcaccatg 5401 cgacctgcca gccaggctgg gaaagcccct gtcaccgtga cctcccttcc cgccggagtg 5461 cggatggttg tgccaacaca gagtgcccag ggaacggtga ggagtggtca ctggggcggg 5521 tgggatttcc tgttgatcca gtggattaag ttgcaagtgt actgaggggt tcacatccct 5581 ctcgggctct ggtaccacgc accccacacc tttccctcct gggacgtgca gatggggaaa 5641 cgctcctgcg aggtcatggt gggaattaat aggttaatgc ctagaaggcc taacctgctg 5701 cctgatccta agagcatttg ccacgtggac tttcttgtcc tgcagcctga gaggaggatc 5761 tagccagctg ggcttgccac aggcccatga gaatcaacgg gacctgactc tggtctttag 5821 aaaggatcca gagctcccag ctgcctcttg ccacgggggt tgccgtgcat ggtgggtggt 5881 ggganccctc tgtgtctggg gccaggagag cggggtctgg cagtagagaa agctgagcca 5941 tgctggcggg atgaggctcc caagagcagg gcagcccctg ggcaggttcc ggtgaggtcc 6001 accaagcagg gaggtgggtg gcacctgtga cctggcctca catgcccgtg cctgctgcag 6061 gtgattggca gtagcccaca gatgagtggg atggccgcac tggccgctgc ggccgctgcc 6121 acccagaaga tccccccttc ctcgcgaccc acggtgctga gtgtcccagc gggtaccacc 6181 atcgtgaaga ccatggctgt gacacctggc actaccaccc tcccagccac tgtgaaggtg 6241 gcctcctcgc cagtcatggt gagcgtcttc cggggcttgt cactgtggtc aaacaagcat 6301 agtcattccc tccgggcttt agagttagcc gttggaccgt gggctgcaga ataagctgtc 6361 tccattaggg gcgtcatggc ctcaggtgag agctgcaggc tcagcctcct cccttgcctg 6421 ttaaaggaag ggggaaccaa gctgttcttg ggcttgggag ccaggaggag gaggaaggtg 6481 agtgttgagg caggagctat gggcccaccg tggacatctt gggtggtttc cccatcgctt 6541 tctttgcctg gcctgtcact cgtctctgcc tcctccaggt gagcaaccct gccactcgca 6601 tgctgaagac tgcagccgcc caggtgggga catcggtttc ctccgccacc aacacgtcta 6661 cccgccctat catcacagtg cacaagtcag gcactgtgac agtggcccag caagcccagg 6721 tggtgaccac agttgtgggc ggggtcacca agaccatcac cctggtgaag agccccatct 6781 ctgtcccagg aggcagtgct ctggtgagtg atgggtgctg ccaggctacg gccactgcca 6841 gtggtgaatg gccgccctgg gccagctcca ggagagtctc agccaggcca tgcccttgtt 6901 tggtttccag atcactcctt gctccatccc ttccttttta gatttccaat ctgggcaaag 6961 tgatgtcggt ggtccagacc aaaccagttc agacttcagc agtcacaggc caggcgtcca 7021 cgggtcctgt gactcagatc atccaggtga gctctcagtc tttgcacata cgcaagagtg 7081 ggggcctggg tgccaggcca caggaagaac tcatgtccct gcccatggaa gatcccatga 7141 gcacacatgg ctcagcaagt tctgctgctc actcccctcg cccacagcct ccagaacatt 7201 ccctgagcgc caaggctgcc ggcggagagt agccagacca cacttccctg gcagtgatgc 7261 cggctcatgc acctgcttcc ccctttcaga ccaaagggcc cctgccagcg ggaacaatcc 7321 tgaagctggt gacctcagca gatggcaagc ccaccaccat catcactacc acgcaggcca 7381 gtggggcggg gaccaagccc accatcctgg gcatcagcag cgtctccccc agtaccacca 7441 agcccggcac gaccaccatc atcaaaacca tccccatgtc ggccatcatc acccaggcgg 7501 gcgccacggg taggggcctc ccccgacata tgagtgtctg caagtcctca ctggagggga 7561 aatggagtcc gggttataag gcgtgctgtg tttctgttta gtttaccctt ttctatcttc 7621 tccttgctgc cttttgtgtc taagggtagt gaggagatgg gcagggctat ggctgtgact 7681 tgtggatgtt cattatagtc taggggcagc aggggcagcc ctcacagagc ctgtctcccc 7741 tgggccccac tctcccatcc aggtgtgacc agcagtcctg gcatcaagtc acccatcacc 7801 atcatcacca ccaaggtgat gacttcagga actggagcac ctgcgaaaat catcactgct 7861 gtccccaaaa ttgccactgg ccacgggcag cagggagtga cccaggtgag gcactcccag 7921 ccgtctctca gcccagtgtc ctagtgcagt ccaccccagc acgccacgca ggtcctcccg 7981 acaggcctgg gaggggcagc atcggtggca tctaccagcc tggtggtgac gttggtggca 8041 ctttcctctg gtgttccagg tggtgcttaa gggggccccg ggacagccag gcaccatcct 8101 ccgcactgtg cccatggggg gtgttcgcct ggtcacaccc gtcaccgtct ccgccgtcaa 8161 gccagccgtc accacgttgg ttgtgaaagg caccacaggt gtgtacccat aggccaggtc 8221 ccctgccacc cctagattat aacgaagcag ggctggcctt caatccccat ggcacagctg 8281 ggtggcacag cagctcctct ctcccgagcc ttctatgcaa gcagctccag agtgccccag 8341 cctgtgcagg aggagctgct gggaaggaga cagcagcagc cttggcccaa ctcttgcctc 8401 ctttccttca ggtgtcacga ccctaggcac agtgacaggc accgtctcca ccagccttgc 8461 cggggcgggg ggccacagca ctagtgcttc cctggccacg cccatcacca ccttgggcac 8521 cattgccacc ctctcaagcc aggtgatcaa ccccactgcc atcactgtgt cggccgcaca 8581 gaccacgctg acagcggcag gcgggctcac aaccccgacc atcaccatgc aggtagggca 8641 ggcctgaggc cacgtgtggg cgagggaccc cacacaagtg gaagcacgtg aggctcacgc 8701 cgtccgtcca caggcctttc cttcctggtg ttgcgctgct gccgggtgca gccatggcag 8761 gcctttcctc acctggggca ttttccctgg tgtcacatgg tccctgtgtc atagtccgtg 8821 gtttggctgt catggtttgg ccatgcctgg tgtttgaaca ccgggccgca gctcgtctgc 8881 atgtgtgtgc agtgttctgt tcatacatat taagaaaacg gtgaaacgtt tgttgttggc 8941 ctttcagcat tgagggggat ttccattctc tgacaaagga cttgacttcg gccagcctcc 9001 tgccttcccc ttgccctgtc ctcacagcct cttcctcatc ccagttgcct ttgaggcaaa 9061 gtggtgcctc ttccctggcc tctgagcaag tcagtgtctt gggggcaggg gtgagtgggt 9121 cttacgtctg ccagggccgt tgagaagtgg tcagtggtcc cagcctgtgt ggtgacagtg 9181 caggacagtg ggagggacag gcagacagcc agctgcagca ctaactctcc tcctcgtgtg 9241 tctccccagc ccgtgtccca gcccacccag gtaactctga tcacggcacc tagtggggtg 9301 gaggcccagc ctgtgcatga cctccctgtg tccattctgg cctccccgac tacagaacag 9361 cccaccgcca cagttaccat cgccgactca ggccagggtg atgtgcagcc tggcactgtc 9421 accttggtgt gctccaaccc accctgtgag acccacgaga ctggcaccac caacacggcc 9481 accactactg ttgtggctaa ccttggggga cacccccagc ccacccaagt gcagttcgtc 9541 tgtgacagac aggaggcagc tgcttctctt gtgacctcga ctgtgggcca gcagaatggt 9601 agcgtggtcc gagtctgttc gaacccgccc tgcgagaccc acgagacggg caccaccaac 9661 accgccacca ccgccacctc caacatggcc gggcagcatg gctgctcaaa cccaccctgc 9721 gagacccacg agacgggcac caccaacact gccactacag ccatgtcgag cgtcggcgcc 9781 aaccaccagc gagatgcccg tcgggcctgt gcagctggca cccctgccgt gatccggatc 9841 agtgtggcca ctggggcgct ggaggcagcc cagggctcta agccccagtg ccaaacccgc 9901 cagaccagcg cgaccagcac caccatgact gtgatggcca ccggggcccc gtgctcggcc 9961 ggcccactcc ttgggccgag catggcacgg gagcccgggg gccgcagccc tgcttttgtg 10021 cagttggccc ctctgagcag caaagtcagg ctgagcagcc caagcattaa ggaccttcct 10081 gcggggcgcc acagccatgc ggtcagcacc gctgccatga cccgttccag cgtgggtgct 10141 ggggagcccc gcatggcacc tgtgtgcgag agcctccagg gtggctcgcc cagcaccaca 10201 gtgactgtga cagccctgga ggcactgctg tgcccctcgg ccaccgtgac ccaagtctgc 10261 tccaacccac catgtgagac ccacgagaca ggcaccacca acaccgccac tacctcgaat 10321 gcaggcagcg cccagagggt gtgctccaac ccgccatgcg agacccacga gacgggcacc 10381 acccacacgg ccaccaccgc tacttcaaac gggggcacgg gccagcccga gggtgggcag 10441 cagccccctg ctggtcgccc ctgtgagaca caccagacca cttccactgg caccaccatg 10501 tcggtcagcg tgggtgccct gcttcccgac gccacttctt cccacaggac cgtggagtct 10561 ggcctagagg tggcggcggc acccagcgtc accccccagg ctggcaccgc gctgctggct 10621 cctttcccaa cacagagggt gtgctccaac cccccctgtg agacccacga gacgggcacc 10681 actcacacgg ccaccactgt cacttccaac atgagttcaa accaaggtaa gtgaaggtgg 10741 ccagcctggc tttcatggtc tcccagcccc ccaggntgaa aggnnaagac aacgatggcg 10801 ggggtccgtc cagagtccac cagtcaggct ccttggnacc caatttacca gtggggagag 10861 ctggggngtc ctggctgtac ctgaagcaga cagaggtctc caggagcctg tcctgcctta 10921 tctggccctc cccttggtgg tagaggaagc ctgtggtggt caagccaagt acaggccggg 10981 ggggctcctg ctgagcagcc gcctgcctgt cgaccccaca gaccccccac ctgctgccag 11041 cgatcaggga gaggtggaga gcacccaggg cgacagcgtg aacatcacca gctccagtgc 11101 catcacgaca accgtgtcct ccacactgac gcgggctgtg accaccgtga cgcagtccac 11161 accggtcccg ggcccctctg tgccggtaag agcccagggc ggcgtgttct tccccgcctt 11221 cccggcctac gcccgtgcag ttccacactc agacgccaga tgtcgctctt gtaccaggca 11281 tggggccgct cctggtccct tttccggacc ataaaagggc agcctcggtg cctgctgcgg 11341 caggaggcag gagcctcacc gtcccccgca tctctggggc gcttttattt ctgggactgg 11401 ttctcatcgc gtgtcacttc accgcagcct gtataagaca tggcgggggt ctgagtgccc 11461 atttgtccag gccagggtga ctgaggacgg gttaaacctg accgggggtc tcaagggcct 11521 ttctcctcat gctggtcaga gaaccactgt cccctcccta tttaacatgg gctggagctg 11581 gcccatcaag aatgaaaact taggcccaca aggaaaggtg agggctggct tgcttctcag 11641 ccctgctgcc tggctctggc ccctctcctc ccctctgaca ggggcttctc tccctttcta 11701 gaagatctca tcaatgactg agactgcccc aagggctctg actactgaag tccccatccc 11761 ggccaaaata acagtgacca tagccaacac agaaacttct gacatgccct tctctgctgt 11821 tgacatcctg cagcccccag aggaactcca ggtgtcgcca ggtcctcgcc agcagctgcc 11881 accacggcag cttctgcagt cggcttccac agccctgatg ggggagtccg ccgaggtcct 11941 gtcagcctcc cagacccctg agctcccggc cgccgtggat ctgagcagca caggggagcc 12001 atcttcgggc caggagtctg ccggctctgc ggtggtggcc actgtggtgg tccagccacc 12061 cccacccaca cagtccgaag tagaccagtt atcacttccc caagagctaa tggccgaggc 12121 ccaagctggc accaccaccc tcatggtaac ggggctcacc cccgaggagc tggcagtgac 12181 ggctgctgca gaagcagctg cccaggccgc agccacggag gaagcccagg ccctggccat 12241 ccaggcggtg ctccaggccg cgcagcaggc cgtcatgggt gagtgccggt gcaaccctgg 12301 ggaagccggg ccgctcgggg gagcaatagg ccgcggagac agcctgggaa gaggagccca 12361 tttccctcga gggtggcatg agccagatgt gatggggagg acgagcacag gggaccccag 12421 gcaggccaag tagtagagac tgccgtgtag aacaggctgg ctggggttag cgcctgggaa 12481 acaggcaggg cccctcggcm rggcttagcc tacagtggag cacggtagga ggtggacggg 12541 gtctctagga caggctttgg gggcctccgc ccgggctgcc cgcttctggc cagccttcct 12601 agtaagtagt gtttctctcg accctggagc aggcaccggc gagcccatgg acacctccga 12661 ggcagcagca accgtgactc aggcggagct ggggcacctg tcggccgagg gtcaggaggg 12721 ccaggccacc accataccca ttgtgctgac acagcaggag ctggctgccc tggtgcagca 12781 gcagcagctg caggaggccc aggcccagca gcagcatcac cacctcccca ctgaggccct 12841 ggcccctgcc gacagtctca acgacccagc cattgagagc aattgcctca atgagctggc 12901 cggcacggtc cccagcactg tggcgctgct gccctcaacg gccactgaga gtaagcgact 12961 gagggcgaat tggtgctggg gtggtggggg cacccaggga ggtgagggca ggacgggagg 13021 ctcggtctca cttgggagcc aggtttcccg agtctttccc gctgacacct tctttcctag 13081 gcctggctcc atccaacaca tttgtggccc cccagccggt tgtggtggcc agcccagcca 13141 agctgcaggc tgcagctacc ctgaccgaag tggccaatgg catcgagtcc ctgggtgtgg 13201 tgagtcgggt gtgttggggc tcctgagtcc gtggcctggg gggcagcact gcctctaggt 13261 gctgagggcc tgatctcgtc gcaccgattc tgtcttgcag aagccagacc tgccgccccc 13321 acccagcaaa gcccccatga agaaggaaaa ccagtggttt gatgtgggag tcattaaggg 13381 caccaatgta atggtgacac actatttcct gccaccagat gatgctgtcc catcagacgt 13441 aagtgtcccc aggtgctgat gtcctcaggt gggtgggtct tggctccacc accctgagcg 13501 gggcttacag gaaactcctc catctttggg ccaagctgag tagccaggaa gcctgaggaa 13561 ggtgggcaga gaccatcccc cctgaagcag tagaaaagga gagctttggt tccttcccgc 13621 cagtaccggg ccacagtcag cctcgccacg tccctccttg ccgtgagggg ccagccaccg 13681 ctgacctctt ttccctctgg cccatagtta ccatggacct gcagtgttcc tggggtggcc 13741 cagagctgcc ttctccacag tggcgttcat gccactcctc ttttcctagg atgatttggg 13801 caccgtccct gactataacc agctgaagaa gcaggagctg cagccaggca cagcctataa 13861 gtttcgtgtt gccggaatca atgcctgtgc gcgggggccc ttcagcgaaa tctcagcctt 13921 taagacgtgc ctgcctggtt tcccaggggc cccttgtgcc attaaaatca gcaaagtgag 13981 tcttgctgtg gggtggccag tctgtccctg gttggtttct gcctgcctca gcacatgagg 14041 tggccggggg ggggcatagg agagctaggt tttgagtccc agtttgtctt agtccttcca 14101 cttgggggaa gatctctttg aagctccttg aactaccaca tggaatatct ctactgtgtt 14161 agaaaatggc ttatgggaac catgattagt cagttctgga agcctctagc ctgttctgtt 14221 gaccttctga ccccctggct tctagagtcc ggatggtgct cacctcacct gggagccacc 14281 ctctgtgacc tccggcaaga ttatcgagta ctccgtgtac ctggccatcc agagctcaca 14341 ggctgggggc gagctcaaga gctccacccc ggcccagctg gccttcatgc gggtgtactg 14401 cgggcccagc ccctcctgcc tggtgcagtc ctccagcctt tccaacgccc acatcgacta 14461 caccaccaag cccgccatca tcttccgcat cgccgcccgc aatgagaagg gctatggccc 14521 ggccacacaa gtgaggtggc tgcagggtga gtgtccttcg tggagctctt caggcacagg 14581 gaaggcccag gaagccgggt gcagcaggag gaactgtcac catctgagat tttttcccat 14641 gtgcggcgtc tccctgcttc ctttggcgcg cacacagccc cccacgggtg gccatcaagt 14701 agagcagggg acccagctcc gggcctgggg gtgcagggga cgccacccca ggattgtttg 14761 aggggacaag cagtgagagg gctggcggga aagtgtaggt ttccaggccc gaaaacagca 14821 cgtcccaggt ccctggagtg cagcaggtga gggatggagg ccagcctgcc tcctgtcagg 14881 tcatgctcct gctccagctc ccctggggct ctgggacctg ggagggcagt gagaggtggg 14941 agctgagctg agtgggtttc taacctacgt ccagagggga gtgagtggtg tcgtgttctg 15001 gtcccaagga gtggcgtaaa aggtgaccac ccataggcag gcgccccacc cttgccagct 15061 gcgggtcatg ccatgtggca ccaagagact ctgcaggcgg gcagggcgca tgtgttcctt 15121 gctctcgttc tgtcacaatg ctcgtcacgc atctctcccg ttccttccag aaaccagtaa 15181 agacagctct ggcaccaagc cagccaacaa gcggcccatg tcctctccag aaatgtaagc 15241 aggaagcccc tctcgggaac cttctgagac taggggtgga aatagctggg ttcagcctct 15301 tgtcaaatga ctgggggccc ctaaccggcc ctgcaccagc assgtccagt ggtgcagggc 15361 ccacactaac tttctctttc ctttttctgc cttccaagga aatctgctcc aaagaaatct 15421 aaggccgatg gtcagtgaga ggaagctgac tagcccctgg attcttctcc agacccccct 15481 gcttcaggaa cacccgccag ggcccacccc tcccaccccg tcccagcatt cgcacttcac 15541 cctcgcgagc cgctgttcac tcctctcccc tttctctttc tctctgtttt taaaataatc 15601 taaagaaagc acattttacc attgctgttg ggaggaagca gaggcagatg ggaaagcaga 15661 gagaggagcg cgcttccttt cctccccgct gccgcccacc ctggggagag acttttgcgg 15721 ggagggaagg cggacgtgag gacagccagc tccccctccc aaggctgtgc gttcctgagg 15781 gccaggtcgg gggcaggcat ggaggggagg aaaggcgtcc ctcttggccc tccccagagt 15841 ggctttcctg gcaccctggc ctgggtgtct ggttctgttt tcttttcttc cccttgtgtt 15901 tccagtcacc taacttccct tcctcaggct cccccggccc accctgctca gtgaccccac 15961 aggaagctta cacattttct cagaggcctt tgtgctccca cctcttctac cctccccctc 16021 ttctttccca ttttaaaaaa gaaaagaagg aaaaagaaaa aaggggcaag gagccccgcg 16081 gcggcctggg cagcgcctgt gcagacctcc ctgcaggccg cactgccaac tgctgcattt 16141 gttgtgtttt ttaggttgca attggtgaag ttcacacttt cattgtaatt ttagcgtgtg 16201 gggttttgtc ccttttttgt tgttgttagc tgtgtacaga atgtgtaatc ttttttcttt 16261 tctctttttt ttgttttgtt ttgttttgtt tttgtttttt tacttttttc ttcttggcta 16321 attcttggca gggatctttc tggaggaaaa gctggggcca gccagggcag gagaggtgtg 16381 aaatctgcca cgaggggcct gctgtttgcc acccagccca acttcctgtt gctggcccct 16441 gccctctgcc cttttgcctg tcctcaggcc gctggaacaa aggaaggaca gctcattcct 16501 catgggcgat cactccgcat ctatagggtc gagcctaggg gagcttgagg gagggctggg 16561 gcctccttgt cctggatttc cagctctccc catgccccct ccctgagcac caccggcacc 16621 gcctcccaaa cagggctgct ggtttccgca gccactgctc cacctccccc aaatcgtcat 16681 ggaaagggtg gagaatggag gggaaccagg cgtccttgga ggcagcttgg gagggtgact 16741 gtgtagtgtc acccacaagg gaggctaggg caatggagca ggccaccagc agcagctgtg 16801 cagcatggaa ctcaggccag gctccgaggc tgggggatct gcttggagtt ttctgccccc 16861 caccccaaac ttctgtcgag gagcaaggct tgccagcaag tcagaaggat ttgaaccgag 16921 cagccaatct ttccagccct cccctaccga cctctgcctg gagacgcagc agcctgtgtc 16981 ctccagggcc tctggtttgt tgtattatag tatatttcgc tgtggaaaat gtcacgttta 17041 gtcaccttgg agcccactca cctggtcctg ttgttttacc ccatcccttc tctcgcgcgc 17101 ctattgattt gtttctgagg agagtacacc gttcactatt gtagagtaac ccctgtgact 17161 caatattacc atagtgcgat gtcgttttgt gctattttga acaattaaaa gacttttttt 17221 gaaataacca ctaggtgtct cactgtgtcc cggcatcctc tcaggaagcc tgagcgggaa 17281 ggctgagaac ctgcccgagg ctcacgggag ccccctggag ggagacctga gctctggtga 17341 aagcgtagaa ggccccagag tgaggcctct gctgtggcct ccggtctttg gaccaagtcg 17401 gaccagggtc cagcaggttg cttggtagct gccttgcagg gcaagggcca aagagcccct 17461 tgggcagctg ctgctgactt cctgtgtctg gggcccagat ctccctccct cccctcctgc 17521 actagaggat aggatggggc aagggcctgg caggtcacca gcggcagcct ccttgggtgg 17581 gttgggggga ggtgggctga gtcaagtaca ccacgggtac tgcagaggtg gatcccttcc 17641 tctggcttcc tcttaaatgt gagccatgcc cccaagggaa gaagcaagag cttgtgatcc 17701 agactgacca cctagcagct gggtcggtag gacagctggg atcccgtggg ctgaatttgt // LOCUS HSHFE 12146 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens HFE(HLA-H) gene. ACCESSION Z92910 NID g1890179 KEYWORDS haemochromatosis; HFE gene; HLA-H gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 858) AUTHORS Albig,W., Burmester,N., Bode,C., Doenecke,D. and Drabent,B. TITLE The haemochromatosis candidate gene HFE (HLA-H) of man and mouse is located in syntenic regions within the histone gene cluster JOURNAL Unpublished REFERENCE 2 (bases 1 to 12146) AUTHORS Albig,W. TITLE Direct Submission JOURNAL Submitted (14-MAR-1997) Albig W., Georg-August-Universitaet Goettingen, Biochemie und Molekulare Zellbiologie, Humboldtallee 23, Goettingen, FRG, 37073 FEATURES Location/Qualifiers source 1..12146 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ICRFy901D1223" /clone_lib="ICRF YAC-library" /chromosome="6" /map="6p" exon 1028..1324 /number=1 gene 1249..8035 /gene="HFE(HLA-H)" CDS join(1249..1324,4652..4915,5125..5400,6494..6769, 6928..7041,7995..8035) /gene="HFE(HLA-H)" /function="iron metabolism" /note="haemochromatosis candidate gene" /codon_start=1 /db_xref="PID:e308008" /db_xref="PID:g1890180" /translation="MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGL SLFEALGYVDDQLFVFYDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDF WTIMENHNHSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEP RAWPTKLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTS SVTTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGE EQRYTCQVEHPGLDQPLIVIWEPSPSGTLVIGVISGIAVFVVILFIGILFIILRKRQG SRGAMGHYVLAERE" intron 1325..4651 /gene="HFE(HLA-H)" /number=1 exon 4652..4915 /gene="HFE(HLA-H)" /number=2 intron 4916..5124 /gene="HFE(HLA-H)" /number=2 exon 5125..5400 /gene="HFE(HLA-H)" /number=3 intron 5401..6493 /gene="HFE(HLA-H)" /number=3 exon 6494..6769 /gene="HFE(HLA-H)" /number=4 intron 6770..6927 /gene="HFE(HLA-H)" /number=4 exon 6928..7041 /gene="HFE(HLA-H)" /number=5 intron 7042..7994 /gene="HFE(HLA-H)" /number=5 exon 7995..9050 /number=6 intron 9051..10205 /number=6 exon 10206..10637 /number=7 polyA_signal 10617..10622 BASE COUNT 3383 a 2474 c 2911 g 3378 t ORIGIN 1 ggatccttta accgaggaga ttattatagc cggagctctg aagcagcaat ctcagttctt 61 gtgatagtga gcaaagaact acaaactaac accaaaatgc aagcttaaag caaagtttat 121 tgaagcacaa taatacactc tgagggacag cgggcttatt tctgcgaagt gaactcagca 181 cttctttaca gagctcaagg tgcttttatg gggtttgtgg ggaggagttg aggtttgggc 241 tgtatctgag tgacaggatg atgttatttg attgaagttt atagctatac aatctaaaat 301 taaactgtgc atggtcttac ctataatttg ttaagaaaag cctcccaggg atgggggggc 361 aaaactgtat gtaaattcta ttataatgat ggcatgatga acttggggtg aacttgaaga 421 caggcttttg tgttgttggg catgtgccac cttagggaat ttccacctgt accctccttt 481 ctctttctcc aggatatttt ggccacagac tttatcataa actccatccc ttagggtggc 541 attagggtag tcttgggcct gaatttaggt gggccagtgg ctgtcttagt gacagccttt 601 ccgctctctt ctgtcatccc ctcccaactg ctaatgtcta actacctaac aattacccat 661 taaatcagtg tgtctggggt taggagcagg cctcaatatg tttaatcatt ctccagataa 721 tcccaatact gtaaagtttg tgaaacactt gtcagataat tcaattatga aggctgtgga 781 acgtgtttca gtaggatcta attggttaat gttatgactt aattaatttg aatcaaaaaa 841 caaaatgaaa aagctttata tttctaagtc aaataagaca taagttggtc taaggttgag 901 ataaaatttt taaatgtatg attgaatttt gaaaatcata aatatttaaa tatctaaagt 961 tcagatcaga acattgcgaa gctactttcc ccaatcaaca acaccccttc aggatttaaa 1021 aaccaagggg gacactggat cacctagtgt ttcacaagca ggtaccttct gctgtaggag 1081 agagagaact aaagttctga aagacctgtt gcttttcacc aggaagtttt actgggcatc 1141 tcctgagcct aggcaatagc tgtagggtga cttctggagc catccccgtt tccccgcccc 1201 ccaaaagaag cggagattta acggggacgt gcggccagag ctggggaaat gggcccgcga 1261 gccaggccgg cgcttctcct cctgatgctt ttgcagaccg cggtcctgca ggggcgcttg 1321 ctgcgtgagt ccgagggctg cgggcgaact aggggcgcgg cgggggtgga aaaatcgaaa 1381 ctagcttttt ctttgcgctt gggagtttgc taactttgga ggacctgctc aacccaatcc 1441 gcaagcccct ctccctactt tctgcgtcca gaccccgtga gggagtgcct accactgaac 1501 tgcagatagg ggtccctcgc cccaggacct gccccctccc ccggctgtcc cggctctgcg 1561 gagtgacttt tggaaccgcc cactcccttc ccccaactag aatgctttta aataaatctc 1621 gtagttcctc acttgagctg agctaagcct ggggctcctt gaacctggaa ctcgggttta 1681 tttccaatgt cagctgtgca gttttttccc cagtcatctc caaacaggaa gttcttccct 1741 gagtgcttgc cgagaaggct gagcaaaccc acagcaggat ccgcacgggg tttccacctc 1801 agaacgaatg cgttgggcgg tgggggcgcg aaagagtggc gttggggatc tgaattcttc 1861 accattccac ccacttttgg tgagacctgg ggtggaggtc tctagggtgg gaggctcctg 1921 agagaggcct acctcgggcc tttccccact cttggcaatt gttcttttgc ctggaaaatt 1981 aagtatatgt tagttttgaa cgtttgaact gaacaattct cttttcggct aggctttatt 2041 gatttgcaat gtgctgtgta attaagaggc ctctctacaa agtactgata atgaacatgt 2101 aagcaatgca ctcacttcta agttacattc atatctgatc ttatttgatt ttcactaggc 2161 atagggaggt aggagctaat aatacgttta ttttactaga agttaactgg aattcagatt 2221 atataactct tttcaggtta caaagaacat aaataatctg gttttctgat gttatttcaa 2281 gtactacagc tgcttctaat cttagttgac agtgattttg ccctgtagtg tagcacagtg 2341 ttctgtgggt cacacgccgg cctcagcaca gcactttgag ttttggtact acgtgtatcc 2401 acattttaca catgacaaga atgaggcatg gcacggcctg cttcctggca aatttattca 2461 atggtacacg gggctttggt ggcagagctc atgtctccac ttcatagcta tgattcttaa 2521 acatcacact gcattagagg ttgaataata aaatttcatg ttgagcagaa atattcattg 2581 tttacaagtg taaatgagtc ccagccatgt gttgcactgt tcaagcccca agggagagag 2641 cagggaaaca agtctttacc ctttgatatt ttgcattcta gtgggagaga tgacaataag 2701 caaatgagca gaaagatata caacatcagg aaatcatggg tgttgtgaga agcagagaag 2761 tcagggcaag tcactctggg gctgacactt gagcagagac atgaaggaaa taagaatgat 2821 attgactggg agcagtattt cccaggcaaa ctgagtgggc ctggcaagtt ggattaaaaa 2881 gcgggttttc tcagcactac tcatgtgtgt gtgtgtgggg gggggggcgg cgtgggggtg 2941 ggaaggggga ctaccatctg catgtaggat gtctagcagt atcctgtcct ccctactcac 3001 taggtgctag gagcactccc ccagtcttga caaccaaaaa tgtctctaaa ctttgccaca 3061 tgtcacctag tagacaaact cctggttaag aagctcgggt tgaaaaaaat aaacaagtag 3121 tgctggggag tagaggccaa gaagtaggta atgggctcag aagaggagcc acaaacaagg 3181 ttgtgcaggc gcctgtaggc tgtggtgtga attctagcca aggagtaaca gtgatctgtc 3241 acaggctttt aaaagattgc tctggctgct atgtggaaag cagaatgaag ggagcaacag 3301 taaaagcagg gagcccagcc aggaagctgt tacacagtcc aggcaagagg tagtggagtg 3361 ggctgggtgg gaacagaaaa gggagtgaca aaccattgtc tcctgaatat attctgaagg 3421 aagttgctga aggattctat gttgtgtgag agaaagagaa gaattggctg ggtgtagtag 3481 ctcatgccaa ggaggaggcc aaggagagca gattcctgag ctcaggagtt caagaccagc 3541 ctgggcaaca cagcaaaacc ccttctctac aaaaaataca aaaattagct gggtgtggtg 3601 gcatgcacct gtgatcctag ctactcggga ggctgaggtg gagggtattg cttgagccca 3661 ggaagttgag gctgcagtga gccatgactg tgccactgta cttcagccta ggtgacagag 3721 caagaccctg tctcccctga ccccctgaaa aagagaagag ttaaagttga ctttgttctt 3781 tattttaatt ttattggcct gagcagtggg gtaattggca atgccatttc tgagatggtg 3841 aaggcagagg aaagagcagt ttggggtaaa tcaaggatct gcatttggac atgttaagtt 3901 tgagattcca gtcaggcttc caagtggtga ggccacatag gcagttcagt gtaagaattc 3961 aggaccaagg cagggcacgg tggctcactt ctgtaatccc agcactttgg tggctgaggc 4021 aggtagatca tttgaggtca ggagtttgag acaagcttgg ccaacatggt gaaaccccat 4081 gtctactaaa aatacaaaaa ttagcctggt gtggtggcgc acgcctatag tcccaggttt 4141 tcaggaggct taggtaggag aatcccttga acccaggagg tgcaggttgc agtgagctga 4201 gattgtgcca ctgcactcca gcctgggtga tagagtgaga ctctgtctca aaaaaaaaaa 4261 aaaaaaaaaa aaaaaaaaaa aactgaagga attattcctc aggatttggg tctaatttgc 4321 cctgagcacc aactcctgag ttcaactacc atggctagac acaccttaac attttctaga 4381 atccaccagc tttagtggag tctgtctaat catgagtatt ggaataggat ctgggggcag 4441 tgagggggtg gcagccacgt gtggcagaga aaagcacaca aggaaagagc acccaggact 4501 gtcatatgga agaaagacag gactgcaact cacccttcac aaaatgagga ccagacacag 4561 ctgatggtat gagttgatgc aggtgtgtgg agcctcaaca tcctgctccc ctcctactac 4621 acatggttaa ggcctgttgc tctgtctcca ggttcacact ctctgcacta cctcttcatg 4681 ggtgcctcag agcaggacct tggtctttcc ttgtttgaag ctttgggcta cgtggatgac 4741 cagctgttcg tgttctatga tcatgagagt cgccgtgtgg agccccgaac tccatgggtt 4801 tccagtagaa tttcaagcca gatgtggctg cagctgagtc agagtctgaa agggtgggat 4861 cacatgttca ctgttgactt ctggactatt atggaaaatc acaaccacag caagggtatg 4921 tggagagggg gcctcacctt cctgaggttg tcagagcttt tcatcttttc atgcatcttg 4981 aaggaaacag ctggaagtct gaggtcttgt gggagcaggg aagagggaag gaatttgctt 5041 cctgagatca tttggtcctt ggggatggtg gaaataggga cctattcctt tggttgcagt 5101 taacaaggct ggggattttt ccagagtccc acaccctgca ggtcatcctg ggctgtgaaa 5161 tgcaagaaga caacagtacc gagggctact ggaagtacgg gtatgatggg caggaccacc 5221 ttgaattctg ccctgacaca ctggattgga gagcagcaga acccagggcc tggcccacca 5281 agctggagtg ggaaaggcac aagattcggg ccaggcagaa cagggcctac ctggagaggg 5341 actgccctgc acagctgcag cagttgctgg agctggggag aggtgttttg gaccaacaag 5401 gtatggtgga aacacacttc tgcccctata ctctagtggc agagtggagg aggttgcagg 5461 gcacggaatc cctggttgga gtttcagagg tggctgaggc tgtgtgcctc tccaaattct 5521 gggaagggac tttctcaatc ctagagtctc taccttataa ttgagatgta tgagacagcc 5581 acaagtcatg ggtttaattt cttttctcca tgcatatggc tcaaagggaa gtgtctatgg 5641 cccttgcttt ttatttaacc aataatcttt tgtatattta tacctgttaa aaattcagaa 5701 atgtcaaggc cgggcacggt ggctcacccc tgtaatccca gcactttggg aggccgaggc 5761 gggtggtcac aaggtcagga gtttgagacc agcctgacca acatggtgaa acccgtctct 5821 aaaaaaatac aaaaattagc tggtcacagt catgcgcacc tgtagtccca gctaattgga 5881 aggctgaggc aggagcatcg cttgaacctg ggaagcggaa gttgcactga gccaagatcg 5941 cgccactgca ctccagccta ggcagcagag tgagactcca tcttaaaaaa aaaaaaaaaa 6001 aaaaagagaa ttcagagatc tcagctatca tatgaatacc aggacaaaat atcaagtgag 6061 gccacttatc agagtagaag aatcctttag gttaaaagtt tctttcatag aacatagcaa 6121 taatcactga agctacctat cttacaagtc cgcttcttat aacaatgcct cctaggttga 6181 cccaggtgaa actgaccatc tgtattcaat cattttcaat gcacataaag ggcaatttta 6241 tctatcagaa caaagaacat gggtaacaga tatgtatatt tacatgtgag gagaacaagc 6301 tgatctgact gctctccaag tgacactgtg ttagagtcca atcttaggac acaaaatggt 6361 gtctctcctg tagcttgttt ttttctgaaa agggtatttc cttcctccaa cctatagaag 6421 gaagtgaaag ttccagtctt cctggcaagg gtaaacagat cccctctcct catccttcct 6481 ctttcctgtc aagtgcctcc tttggtgaag gtgacacatc atgtgacctc ttcagtgacc 6541 actctacggt gtcgggcctt gaactactac ccccagaaca tcaccatgaa gtggctgaag 6601 gataagcagc caatggatgc caaggagttc gaacctaaag acgtattgcc caatggggat 6661 gggacctacc agggctggat aaccttggct gtaccccctg gggaagagca gagatatacg 6721 tgccaggtgg agcacccagg cctggatcag cccctcattg tgatctgggg tatgtgactg 6781 atgagagcca ggagctgaga aaatctattg ggggttgaga ggagtgcctg aggaggtaat 6841 tatggcagtg agatgaggat ctgctctttg ttaggggatg ggctgagggt ggcaatcaaa 6901 ggctttaact tgctttttct gttttagagc cctcaccgtc tggcacccta gtcattggag 6961 tcatcagtgg aattgctgtt tttgtcgtca tcttgttcat tggaattttg ttcataatat 7021 taaggaagag gcagggttca agtgagtagg aacaaggggg aagtctctta gtacctctgc 7081 cccagggcac agtgggaaga ggggcagagg ggatctggca tccatgggaa gcatttttct 7141 catttatatt ctttggggac accagcagct ccctgggaga cagaaaataa tggttctccc 7201 cagaatgaaa gtctctaatt caacaaacat cttcagagca cctactattt tgcaagagct 7261 gtttaaggta gtacaggggc tttgaggttg agaagtcact gtggctattc tcagaaccca 7321 aatctggtag ggaatgaaat tgatagcaag taaatgtagt taaagaagac cccatgaggt 7381 cctaaagcag gcaggaagca aatgcttagg gtgtcaaagg aaagaatgat cacattcagc 7441 tggggatcaa gatagccttc tggatcttga aggagaagct ggattccatt aggtgaggtt 7501 gaagatgatg ggaggtctac acagacggag caaccatgcc aagtaggaga gtataaggca 7561 tactgggaga ttagaaataa ttactgtacc ttaaccctga gtttgcttag ctatcactca 7621 ccaattatgc atttctaccc cctgaacatc tgtggtgtag ggaaaagaga atcagaaaga 7681 agccagctca tacagagtcc aagggtcttt tgggatattg ggttatgatc actggggtgt 7741 cattgaagga tcctaagaaa ggaggaccac gatctccctt atatggtgaa tgtgttgtta 7801 agaagttaga tgagaggtga ggagaccagt tagaaagcca ataagcattt ccagatgaga 7861 gataatggtt cttgaaatcc aatagtgccc aggtctaaat tgagatgggt gaatgaggaa 7921 aataaggaag agagaagagg caagatggtg cctaggtttg tgatgcctct ttcctgggtc 7981 tcttgtctcc acaggaggag ccatggggca ctacgtctta gctgaacgtg agtgacacgc 8041 agcctgcaga ctcactgtgg gaaggagaca aaactagaga ctcaaagagg gagtgcattt 8101 atgagctctt catgtttcag gagagagttg aacctaaaca tagaaattgc ctgacgaact 8161 ccttgatttt agccttctct gttcatttcc tcaaaaagat ttccccattt aggtttctga 8221 gttcctgcat gccggtgatc cctagctgtg acctctcccc tggaactgtc tctcatgaac 8281 ctcaagctgc atctagaggc ttccttcatt tcctccgtca cctcagagac atacacctat 8341 gtcatttcat ttcctatttt tggaagagga ctccttaaat ttgggggact tacatgattc 8401 attttaacat ctgagaaaag ctttgaaccc tgggacgtgg ctagtcataa ccttaccaga 8461 tttttacaca tgtatctatg cattttctgg acccgttcaa cttttccttt gaatcctctc 8521 tctgtgttac ccagtaactc atctgtcacc aagccttggg gattcttcca tctgattgtg 8581 atgtgagttg cacagctatg aaggctgtac actgcacgaa tggaagaggc acctgtccca 8641 gaaaaagcat catggctatc tgtgggtagt atgatgggtg tttttagcag gtaggaggca 8701 aatatcttga aaggggttgt gaagaggtgt tttttctaat tggcatgaag gtgtcataca 8761 gatttgcaaa gtttaatggt gccttcattt gggatgctac tctagtattc cagacctgaa 8821 gaatcacaat aattttctac ctggtctctc cttgttctga taatgaaaat tatgataagg 8881 atgataaaag cacttacttc gtgtccgact cttctgagca cctacttaca tgcattactg 8941 catgcacttc ttacaataat tctatgagat aggtactatt atccccattt cttttttaaa 9001 tgaagaaagt gaagtaggcc gggcacggtg gctcacgcct gtaatcccag cactttggga 9061 ggccaaagcg ggtggatcac gaggtcagga gatcgagacc atcctggcta acatggtgaa 9121 accccatctc taataaaaat acaaaaaatt agctgggcgt ggtggcagac gcctgtagtc 9181 ccagctactc ggaaggctga ggcaggagaa tggcatgaac ccaggaggca gagcttgcag 9241 tgagccgagt ttgcgccact gcactccagc ctaggtgaca gagtgagact ccatctcaaa 9301 aaaataaaaa taaaaataaa aaaatgaaaa aaaaaagaaa gtgaagtata gagtatctca 9361 tagtttgtca gtgatagaaa caggtttcaa actcagtcaa tctgaccgtt tgatacatct 9421 cagacaccac tacattcagt agtttagatg cctagaataa atagagaagg aaggagatgg 9481 ctcttctctt gtctcattgt gtttcttctg aatgagcttg aatcacatga aggggaacag 9541 cagaaaacaa ccaactgatc ctcagctgtc atgtttcctt taaaagtccc tgaaggaagg 9601 tcctggaatg tgactccctt gctcctctgt tgctctcttt ggcattcatt tctttggacc 9661 ctacgcaagg actgtaattg gtggggacag ctagtggccc tgctgggctt cacacacggt 9721 gtcctcccta ggccagtgcc tctggagtca gaactctggt ggtatttccc tcaatgaagt 9781 ggagtaagct ctctcatttt gagatggtat aatggaagcc accaagtggc ttagaggatg 9841 cccaggtcct tccatggagc cactggggtt ccggtgcaca ttaaaaaaaa aatctaacca 9901 ggacattcag gaattgctag attctgggaa atcagttcac catgttcaaa agagtctttt 9961 tttttttttt gagactctat tgcccaggct ggagtgcaat ggcatgatct cggctcactg 10021 taacctctgc ctcccaggtt caagcgattc tcctgtctca gcctcccaag tagctgggat 10081 tacaggcgtg caccaccatg cccggctaat ttttgtattt ttagtagaga cagggtttca 10141 ccatgttggc caggctggtc tcgaactctc ctgacctcgt gatccgcctg cctcggcctc 10201 ccaaagtgct gagattacag gtgtgagcca ccctgcccag ccgtcaaaag agtcttaata 10261 tatatatcca gatggcatgt gtttacttta tgttactaca tgcacttggc tgcataaatg 10321 tggtacaagc attctgtctt gaagggcagg tgcttcagga taccatatac agctcagaag 10381 tttcttcttt aggcattaaa ttttagcaaa gatatctcat ctcttctttt aaaccatttt 10441 ctttttttgt ggttagaaaa gttatgtaga aaaaagtaaa tgtgatttac gctcattgta 10501 gaaaagctat aaaatgaata caattaaagc tgttatttaa ttagccagtg aaaaactatt 10561 aacaacttgt ctattacctg ttagtattat tgttgcatta aaaatgcata tactttaata 10621 aatgtacatt gtattgtata ctgcatgatt ttattgaagt tcttgttcat cttgtgtata 10681 tacttaatcg ctttgtcatt ttggagacat ttattttgct tctaatttct ttacattttg 10741 tcttacggaa tattttcatt caactgtggt agccgaatta atcgtgtttc ttcactctag 10801 ggacattgtc gtctaagttg taagacattg gttattttac cagcaaacca ttctgaaagc 10861 atatgacaaa ttatttctct cttaatatct tactatactg aaagcagact gctataaggc 10921 ttcacttact cttctacctc ataaggaata tgttacaatt aatttattag gtaagcattt 10981 gttttatatt ggttttattt cacctgggct gagatttcaa gaaacacccc agtcttcaca 11041 gtaacacatt tcactaacac atttactaaa catcagcaac tgtggcctgt taattttttt 11101 aatagaaatt ttaagtcctc attttctttc ggtgtttttt aagcttaatt tttctggctt 11161 tattcataaa ttcttaaggt caactacatt tgaaaaatca aagacctgca ttttaaattc 11221 ttattcacct ctggcaaaac cattcacaaa ccatggtagt aaagagaagg gtgacacctg 11281 gtggccatag gtaaatgtac cacggtggtc cggtgaccag agatgcagcg ctgagggttt 11341 tcctgaaggt aaaggaataa agaatgggtg gaggggcgtg cactggaaat cacttgtaga 11401 gaaaagcccc tgaaaatttg agaaaacaaa caagaaacta cttaccagct atttgaattg 11461 ctggaatcac aggccattgc tgagctgcct gaactgggaa cacaacagaa ggaaaacaaa 11521 ccactctgat aatcattgag tcaagtacag caggtgattg aggactgctg agaggtacag 11581 gccaaaattc ttatgttgta ttataataat gtcatcttat aatactgtca gtattttata 11641 aaacattctt cacaaactca cacacattta aaaacaaaac actgtctcta aaatccccaa 11701 atttttcata aactcagttt taaactaact ttttttcaaa ccacaatctg atttaacaat 11761 gactatcatt taaatatttc tgactttcaa attaaagatt ttcacatgca ggctgatatt 11821 tgtaattgtg attctctctg taggctttgg gtataatgtg ttcttttcct tttttgcatc 11881 agcgattaac ttctacactc taacatgtag aatgttacta caatattaaa gtattttgta 11941 tgacaatttt atttgaaagc ctaggatgcg ttgacatcct gcatgcattt attacttgat 12001 atgcatgcat tctggtatct caagcattct atttctgagt aattgtttaa ggtgtagaag 12061 agatagatat ggtggatttg gagttgatac ttatatattt tctatttctt ggatggatga 12121 atttgtacat taaaagtttt ccatgg // LOCUS HSHH3X3B 3276 bp DNA PRI 14-MAY-1996 DEFINITION H.sapiens hH3.3B gene for histone H3.3. ACCESSION Z48950 NID g761715 KEYWORDS histone H3.3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 261 to 3276) AUTHORS Albig,W., Bramlage,B., Gruber,K., Klobeck,H.G., Kunz,J. and Doenecke,D. TITLE The human replacement histone H3.3B gene (H3F3B) JOURNAL Genomics 30 (2), 264-272 (1995) MEDLINE 96163879 REFERENCE 2 (bases 1 to 3276) AUTHORS Albig,W. TITLE Direct Submission JOURNAL Submitted (31-MAR-1995) Albig W., Georg-August-Universitaet Goettingen, Biochemie und Molekulare Zellbiologie, Humboldtallee 23, Goettingen, D, 37073 FEATURES Location/Qualifiers source 1..3276 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /cell_line="GM2132" CAAT_signal 626..630 TATA_signal 696..700 prim_transcript 741..3043 exon 741..847 /number=1 5'UTR join(741..847,1320..1329) intron 848..1319 /number=1 exon 1320..1457 /number=2 gene 1330..1909 /gene="hH3.3B" CDS join(1330..1457,1541..1694,1781..1909) /gene="hH3.3B" /function="chromatin condensation" /codon_start=1 /product="histone H3.3" /db_xref="PID:g761716" /db_xref="SWISS-PROT:P06351" /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRP GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLV GLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA" intron 1458..1540 /gene="hH3.3B" /number=2 exon 1541..1694 /gene="hH3.3B" /number=3 intron 1695..1780 /gene="hH3.3B" /number=4 exon 1781..3043 /number=5 3'UTR 1910..3043 polyA_signal 2233..2238 polyA_signal 2463..2468 polyA_signal 3027..3032 polyA_site 3043 BASE COUNT 803 a 710 c 849 g 914 t ORIGIN 1 catatggaaa tcaaaaaata ttttgtgaca ccttaaaaac atacaaaaca gttcctctaa 61 aataatttat taacatgttt atttgtattt attatttacc ccttcccacc ccaccaagag 121 acctgctttt tgttaaaaat gtatgtgcat gatatatgaa atacttgata atataacatg 181 atttactgaa ttctaattct cacactacgc acttaactat caaaatatta tgaaaggtaa 241 ataatccagg gaacacattt tattttacat cggaaaatat ttacaaaaat tttgttaaag 301 tagcctctct gggtgcatag cattcgcttg atgcctgtga gtttagactc ctgctcattc 361 tgatagagac caatgcaatt catttgcatt gactttctat tgcagagctt tggaaccaag 421 ggaaggatcc aaacatagct ccattttatc caatcagaag ggccgcttag tgctaaactc 481 tgattcgtcc attcctcttt gcatccatcc aatcattagg ccactttaac agccaatcag 541 cggcgtagat atgcaccaat catggaacag tgccgaagca cggcctttgt gtcgggggat 601 gtcagcgcgt cggcgaaagc gcccaccaat agaaaaagtc gttggtgtat gcaaataagg 661 gttctatgac gcagagacgc agcgtgaagc gtgcctataa aaacggaggc gacgcggggg 721 cttggagcgc agagcggttt ggtcgttcgt tgggcggtgc tggtttttcg ctcgtcgact 781 gcggctcttc ctcgggcagc ggaagcggcg cggcggtcgg agaagtggcc taaaacttcg 841 gcgttgggta ggcgttcgta ggtttacccg cggcttcagg ttctgccaac cgttggcgcc 901 gcgcgccgcg gccgttggga gccgagcggc gggggctgct cgggccgctg gcgtcgggtc 961 gggtatgggg ccgggaggtt tccccttggg gctttgttcc ggaactgtgt ccccgccttt 1021 tcccggcggt gattcagagg tcccgacgcc gggttccggg cagtcgggcc tccgcccgtc 1081 gaggaaggga agtgactcct ccccgcgctc cccctcccgg gggaggctgc cgctcctcgc 1141 agcctgagtc attaggggag ggggaggagg ttggcggccc tcggccatct gctgccgcga 1201 ggtggggcgc ggggaggcgg gaggcccggc ggtggggcca ggatctgttc cctccccccg 1261 cagccctggg gggcggcccg ggtgagcgcg gcgccttatc ttcggggcgt ctttcttagg 1321 tgaaagaaaa tggcccgaac caagcagact gctcgtaagt ccaccggtgg gaaagccccc 1381 cgcaaacagc tggccacgaa agccgccagg aaaagcgctc cctctaccgg cggggtgaag 1441 aagcctcatc gctacaggta ggtcgggcgg gggaacaatg gcccggcggt ggccggcttt 1501 gtgcggcagc gtccgctcac tcctcccctg ctcgctgcag gcccgggacc gtggcgcttc 1561 gagagattcg tcgttatcag aagtcgaccg agctgctcat ccggaagctg cccttccaga 1621 ggttggtgag ggagatcgcg caggatttca aaaccgacct gaggtttcag agcgcagcca 1681 tcggtgcgct gcaggtaaga caaaggcctg gagccggggg agggctgggc ggtttccgct 1741 ccccgagtgg gattaatagt gcggctctcg tcctcaacag gaggctagcg aagcgtacct 1801 ggtgggtctg ttcgaagata ccaacctgtg tgccatccac gctaagagag tcaccatcat 1861 gcccaaagac atccagttgg ctcgccggat acggggagag agagcttaag tgaaggcagt 1921 ttttatggcg ttttgtagta aattctgtaa aatactttgg tttaatttgt gacttttttt 1981 gtaagaaatt gtttataata tgttgcattt gtacttaagt cattccatct ttcactcagg 2041 atgaatgcga aaagtgactg ttcacagacc tcagtgatgt gagcactgtt gctcaggagt 2101 gacaagttgc taatatgcag aagggatggg tgatacttct tgcttctcat gatgcatgtt 2161 tctgtatgtt aatgacttgt tgggtagcta ttaaggtact agagttgata aatgtgtaca 2221 gggtcctttt gcaataaaac tggttatgac ttgatccaag tgtttaacaa ttggggctgt 2281 taagtctgac catacatcac tgtgatagaa tgtgggcttt ttcaagggtg aagatacaag 2341 tcttaaccac agtgtaactt acagtttcct ttaaaaaaaa aaaaagtaaa cctggcagct 2401 atagaataca ctatgtgcat ttataatagc tattttatat attgtagtat caacattttt 2461 aaattaaatg ttttacattc acaagtggtg gggagtcttg tcattaaggt gtgtgtaatt 2521 tagagtccag ttggttttct tctgactgca cttgttctca tagtagtaaa atgctatgcg 2581 catttatacc ttgcataagt cctcattcta ccacatgtta accctctagc tgataatgca 2641 aacactaact gggggatttt atttataagg gctctagaaa aaacgagtta ttcacaccag 2701 catcatctta actaacattc tgaactagtt agtgcagctt ttcattgtgt tgtgtggttg 2761 gtctcataac taggttgagt ttttctcctc tgctgaggaa acagtaccga agttcttttt 2821 cttgtggcat ttgtattata aaaacttggt gtgggggagg agcacaaaac tccagcccac 2881 tgaacctctg ccaattaaga tggtgttggg ttaggttaca tctggttact gtcctgggaa 2941 aatcattttt atagagatgg ccttccaagt ggttttaaaa tttactgaag tttttaggtc 3001 aattatgtat gttgactaaa tttacaaata aacttgttta tccaactaag tgtccaaaac 3061 ctaaattgaa tgtactaagt tttcacatgt cccattatct aggtccttgt atactaatgt 3121 tttgaactta gatcatttca ggtgttgttt ggtggataaa ggaacctttt atttataaag 3181 atactgtaga aagcatgtga acagctctct gcttgattaa gatgccataa tagtgctgta 3241 tttgcagtgt gggctaagac aaagtatatt aagctt // LOCUS HSHHA2GEN 14117 bp DNA PRI 13-NOV-1996 DEFINITION H.sapiens hHa2 gene. ACCESSION X90761 NID g1668739 KEYWORDS hha2 gene; keratin intermediate filament protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14117) AUTHORS Rogers,M.A., Winter,H., Langbein,L., Krieg,T. and Schweizer,J. TITLE Genomic characterization of the human type I cuticular hair keratin hHa2 and identification of an adjacent novel type I hair keratin gene hHa5 JOURNAL J. Invest. Dermatol. 107 (4), 633-638 (1996) MEDLINE 96420675 REFERENCE 2 (bases 1 to 14117) AUTHORS Schweizer,J. TITLE Direct Submission JOURNAL Submitted (14-AUG-1995) J. Schweizer, German Cancer Research Center, Research Program 2, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG COMMENT Related sequence X81419. FEATURES Location/Qualifiers source 1..14117 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="circulating lymphocytes" /clone_lib="lambda DashII" /clone="ghkI2.12, ghkI2.17" /chromosome="17" /map="q12-21" TATA_signal 4506..4511 /note="putative" exon 4520..5048 /number=1 gene 4710..11791 /gene="hHa2" CDS join(4710..5048,5730..5812,5977..6133,7463..7624, 7703..7828,8856..9076,11662..11791) /gene="hHa2" /codon_start=1 /product="HHa2 hair keratin type I intermediate filament" /db_xref="PID:e195474" /db_xref="PID:g1668740" /translation="MACLPSVCLPTTFRPASCLSKTYLSSSCQAASGISGSMGPGSWY SEGAFNGNEKETMQFLNDRLASYLTRVRQLEQENAELESRIQEASHSQVLTMTPDYQS HFRTIDQLQQKILCTKAENARMVVNIDNAKLAADDFRAKYEAELAMRQLVEADINGLR RILDDLTLCKADLEAQVESLKEELMCLKKNHEEEVGSLRCQLGDRLNIEVDAAPPVDL TRVLEEMRCQYEAMVEANRRDVEEWFNMQMEELNQQVATSSEQLQNYQSDIIDLRRTV NTLEIELQAQHSLRDSLENTLTESEARYSSQLAQMQCMITNVEAQLAEIRAELERQNQ EYQVLLDVRARLEGEINTYRSLLESEDCKLPCNPCSTPSCTTCVPSPCVTRTVCVPRT VGMPCSPCPQGRY" intron 5049..5729 /gene="hHa2" /number=1 exon 5730..5812 /gene="hHa2" /number=2 intron 5813..5976 /gene="hHa2" /number=2 exon 5977..6133 /gene="hHa2" /number=3 intron 6134..7462 /gene="hHa2" /number=3 exon 7463..7624 /gene="hHa2" /number=4 intron 7625..7702 /gene="hHa2" /number=4 exon 7703..7828 /gene="hHa2" /number=5 intron 7829..8855 /gene="hHa2" /number=5 exon 8856..9076 /gene="hHa2" /number=6 intron 9077..11661 /gene="hHa2" /number=6 exon 11662..12091 /number=7 BASE COUNT 3568 a 3182 c 3484 g 3883 t ORIGIN 1 tttctttcaa aatatttctg ctcattgaca atgcatctga tcacccaaga gccctgatgg 61 agctgtccaa ggagattaat gtttttgtgt ttgccaacac aacatccatt ccacagccca 121 tggatcaaaa actaattttg actttcaagt cttattacct aagaaatata tttcataagg 181 ctatagctgc catacagatg attcctctga tggatctggg caaagtaaat tgaaaacctt 241 ctgaagagga ttcactattc tatatgtcaa tatttgtgat tcaagagagg aggtcaaaat 301 atcaacatta acaggagttt ggaagaagtt gattccaact ctcatgtatg atgttgaggg 361 gcttaagact tcagcagagg aagtaactat agatgtgttg gaaacagcaa gagaattaaa 421 attagaatgg agcctgaaga tgtgactgaa ttgttgccat gttatggcaa aacgaacgga 481 tgaggaactg cttcttattc atgaacaaag aaagtggttt cttgagatgg aaactactcc 541 tggtgaagat gctgtgaaca tcactgaaat ggcaacaaag gatttggaat actacatcaa 601 tttagttaat aaggcagtgg cagggtttga gaggactgac cctagttttg aaagaatttc 661 tactgttaaa atgctatcaa acagcatcac atgctacagg gaaatctttc atgaaaggaa 721 gagtcaactg atgcggcaaa cttcactgat gtctcatttt cagaaattgc cacagccaca 781 ctaacttgca gcaatcatca ccctgatcag tcctcagcca tcaacattga ggcaagacct 841 tccaccagca aaaagattat aacttgctga aggctcagat gatccttagc atttttagca 901 agaaagcatt tttaaaatta agttatatac attgttttta ggccataatg ctatggtaca 961 cttaatagac tatagtatag tgtaaatata atgtttacat acacagaaaa accaaaaagt 1021 ttgtgtgact cactttatgg ttacatttgc tttattgtgg tggtctagaa ctgaacatgc 1081 actatctctg aggtatgcct gtatcttcct ctatcttcag tgggtctcca aaccacctga 1141 aggacatgct aaacacaggc tgctgcatcc cagccccagc ctcagagttt ctgattcagc 1201 cagtctggga tgagcctgaa aaactgccat ttcttttttt tttacatttt atttttattt 1261 ttatcaaagc agtgtatcta cattgtttaa ataaaacaac attaaatagc aaatatttaa 1321 aaactgcaac atctatgcct tctttctttc tttattgtta ttatacttta agttttaggg 1381 tacatgtgca caatgtgcag gttagttaca tatgtataca tgtgccatgc tggtgtgctg 1441 cacccactaa ctcgtcatct agcattaggt atatctccca atgctatctc tcccccgtcc 1501 ccccacccca caacaaatga gaacacatgg acacaggaag gggaacatca cactcttttt 1561 tttaaaaatt ttactttaag ttctgggatg catttgcaga atgtgcaggt ttgttgcata 1621 ggtatacatg tgccatggtg gtttggctgc acctatcacc catcatctag gttttaagcc 1681 cgcatgcatt aggtatttgt tctaatgctc tctctcccct tgccccccat cccccgacag 1741 gccctggtgt gtgatgttcc cctccctgtg tccatgtgtt ctcattgttc aactcccact 1801 tatgagtgag aacatgtggt gttcggtttt ctgttcctgt gttggtttgc taagaatgat 1861 ggtttccagc ttcatccatg tccctgcaaa agacacgaac tcattctttt ttatggctgc 1921 atagtatttc atggtatata tgtgccacat tttctttatc cagtctgtca ttgatgggca 1981 tttgggttgg ttccaagtct ttgctattgt aaatagtgct gcaataaaca tacttgtgca 2041 tgtgtcttta tagtagaaag atttataatc ctttgggtat atacccagta atgggattgc 2101 tgggtcaaat ggtatttctt ggttttagat cattgaggaa tcaccatgct gtcttccaca 2161 atggttgaac taatttacac tcccaaccaa cagtgtaaaa gctttcccat ttctccacag 2221 cctttgccag catctgttgt taccagactt gttaatgatc accattctaa ctggcatgat 2281 atggtatctc attgtggttt tcatttgcat ttctctaatg accagtgatg atgagctttt 2341 tttcatatgt ttcttggcca cataaatgtc ttcttttgag aagtgcctgt tcatatcctt 2401 tgcccacttt ttgatggggt tgtttgttat tttcttgtaa aattttgttt aagctccttg 2461 tagattctgg atattagacc tttgtcagat gggtagattg caaaaatttt ctccaattct 2521 ataggttgcc tgctgactct gataacagtt tcttttgctg tgcagaagct ctttagttga 2581 attagaccca tttgtcaact ttggcttttg ttgcaattgc ttttgctgtt ttagtcatga 2641 agtctttgcc catgcctatg tcctgaatgg tattgcctag actttcttct agggtgaaaa 2701 ctcacatttc taacatgttc ctgagtcagg ttgatgctga gagtgactga taacacctta 2761 ttataataat tatagttttt gggtgagagg attaaatggg caaattaatg ccaagcactc 2821 agcaccatgc ctggtatttg tatacattcc acaagtgctg gctatgattc tgaagggtgg 2881 cctgatgagt ctcatccctt gtagttggta ttgccatcac acctccttcc atctgatgct 2941 ataatcttct ctaaataggt aatccagaca aggtactgga atttgtagtt gttgcagaga 3001 atcaaattgt gagagtattt gatctgaccc cctcgattgg catgaagaaa ctgaggtctg 3061 gagagacaaa attactttcc caaggccaaa tggcaagtca gaggcagtat atctacgctc 3121 tccttttctt atgcaatgaa tgagctgggt ggcattttcc ctttcctgtt cttgtactga 3181 catttctggg aatatgtgaa acataaggca agtacttata ccccaaaatt atatcaagaa 3241 gattactgaa taaaaaagtg gctatatagc acacacatac taaatgtgaa gctcacagct 3301 tttgcagcca ggactgaaaa ccactgctct agcatgttgc cttcttaagt gaatgcccag 3361 ggcttctata gttgggcaaa tatgctccct gtctcctggc ttagctcatt ccagggctat 3421 agaacatcct ttcccaaggg agtggattca cactgcttcc atagtctgag atcctgaagt 3481 gagccttccc catctgccac aacagaaagt aaaagtagaa cctgtccaac tattctcagt 3541 ctgttcaaca aacaatttat tgagcacttg acaattgtgt atatggtgca gtgataattg 3601 aatggatggt ccagaggggg caagacagag cagctgccac ctggagatgt tcatgatatg 3661 gacttttcca agaggaagag accaactgga aagatgtacc tgattcttag gcttctgttt 3721 gggtcttttt tttttttttt tttttgcctt ttgaaggggt gcagcattcc tggaggtact 3781 gcaataccag gtcaacatgt agagtgaaca gagcaagctc ttattccatc tccctgctcc 3841 caaaatccat ttaatatgtt gtcctcagat ggaggacgta tcagatatta agacgataag 3901 aacagatacc acacttgatc ttagccaaaa ggctgcacaa agaagtgatg ctgcctatgt 3961 cttgagttca ttctctcccc actgatatta ttttcttccc cttggcagga agatgatgtc 4021 tgttaggaag cctctgaggt tcctgttcct ttctgctgga tttatgccgc tgccagcatc 4081 ggagcagttt ctaccaccca cacatttcct tgttaaatag ccaggcctct tcccatgggg 4141 aatgctttca ttaaaagagg cagccactgc tgcacagacc tgcaggcttc tcagggctag 4201 agcaggcggg ggtgcagtgg gcaaagccag tggaggcaca ggctgggttg tggcagtctc 4261 ctggagggcc ctgccctagt aatgagggcc caggcatgcg gctgaccctt tgaagatgtg 4321 tcctgaagct ctctcatggt gatcaatgac aggaacccag actcctgctt tagccaaatg 4381 ataagtttgg cctcttttat tggaaaccaa attacaaatt aattagcagc ttcctctggg 4441 gctgggtgtg aacatcaaca ccacccaatg atgaatttct atcatgagcc ccctcactgc 4501 aagggcataa aatggcccgg gcggcatggg gtctgtagac atccaggtag ctgtggctga 4561 ggagaaaggg cctctccaac atgacatcct cctgctgtgt caccaacaac ttgcaagcct 4621 ctctcaagag ctgcccccgg cctgcctcgg tctgttccag cggcgtgaac tgccggcctg 4681 agctgtgcct gggctatgtc tgccagccca tggcatgcct gccttcggtc tgcctgccca 4741 ccaccttccg gccagccagc tgcctctcca aaacctatct atccagttcc tgccaggcag 4801 ccagtggcat ctccggctcc atgggccccg gcagctggta cagcgaaggg gccttcaatg 4861 gcaatgagaa ggaaaccatg cagttcctta acgaccgcct ggccagctac ctgacgaggg 4921 tgcggcagct ggagcaggag aatgcggagc tggagagcag gatccaagag gcctctcact 4981 cccaggtgct caccatgact cctgactacc agtctcattt caggaccatt gaccagctcc 5041 agcagaaggt gagggcagcg gtgaggtgaa taggctctct gggaagggaa ctggactagc 5101 tggcattcca gattggaatc tcgttagctt attaagctat gttcaggaca aagagacttc 5161 cctagggcat agggttattt tataatttga gcactcagcc tgaggctttc atgtggagag 5221 atctgggatc tagtctcagt tctaccattg cctcattgca tgactttggg ggtcccatcc 5281 cttccccagg cctcagtttt ctcatctgta aaacagggat aataatggtc gttatctaat 5341 ggggttgtca aaaggatttg atgagatgat gcaggtcaag tgcttggaac agcccctggt 5401 ccacggtaag ttttcagtaa atgtcaaaga cccttctaac tgtcacatga gtgacttcag 5461 acatgagatt cttcccttcc acattgcttg gcacatccaa aatggggaat ttgaatttac 5521 gaagcttcag gttcttaaaa aatacatctc aagttctcca aggactagca attcgctaaa 5581 tatctcccag agttccaggg aagaggacct tctgcaggga tggctgcagg gctgctggat 5641 cctacctttg ctgctgtctt ctcttcattt gggttcttct tgcttcgtct catcctgaac 5701 taaccctctc catgtgcctt ggcctacaga ttctgtgtac caaggcagag aatgccagga 5761 tggttgtgaa cattgataat gccaaactgg ctgccgatga cttcagggcc aagtgagttc 5821 agtcgggggg ctggagctgg ggaggacctg tcctcatagg ctctggggca actttccatt 5881 agtttcacgg agggttggaa agtgccggca gtttaaggcc ttccctgagt tctgcattct 5941 gtttcaccct tggttgctga ccctgtcctt gtgcaggtac gaggcagagc tggccatgcg 6001 gcagctggtg gaggccgaca tcaatggcct gcgcaggatc ctggatgatc tcactctgtg 6061 caaggctgac ctggaggccc aggttgagtc cctgaaggag gagctgatgt gcctcaaaaa 6121 gaaccatgag gaggtgaggc tgggaagtcc cgctgaagtg gccccgggaa gcagaggggg 6181 aggaacgtgg ggtatggggt tggataggcg tgggttgaaa ttcccaagcc tgccacatgt 6241 tgttttagtg actttgccca atttatggaa tcttcctgag cctcctcttc tgtaaaatgg 6301 ggacaacatg atcacgcagg gttattgtga ggatttaatg gacaggatat ggatcatgga 6361 aatccccaag gcatgggtat gaaacacctg ccacttggtc aactctcaga agtgtagccc 6421 ccttcccttc tgcatttcct gggctagtgt gactgccaag cactcactag tggcaactgc 6481 atttttttct ctcgagagcc acacagcaga ggtagagtgg tgcagtggtg ccggggtgag 6541 ttatgttcca gatctcacgt tgaatggcct gtccatctct gcctggctca actctcagaa 6601 gcagtcccat ctcttctgag agggagtaca gctgcagtgg tctcctcttt ttgcccctat 6661 ccttattttg tcctcccttc tgtttgcata taaaatgctc aagctgaagc cctttacttt 6721 ctgatttttc cttatctcct gaagtttctt ggaggggaag ccctctgctt tgggcacctg 6781 tgtgctgcca agcccacctg agccatggtg tttttccccc ctccctcctt gactctcaac 6841 ctcttgactt gggatttgaa tgaacaagtg cctctgaatc ttggctgggt ttcctcaggg 6901 cttaagtgta aagtaacaat cagtcaccac gtactcaccg agcaccctcg ggctcctgac 6961 tcatcttgct caaatcagag aactgggaac ggcaccaaga agccactaat gagaagttat 7021 cacacatcct ggcagactca gtgacaactt tccctcttgg cagccaagcc tgggaggcag 7081 ctgccctaac tcgggccttt aactagtcaa gccaggctcc tgctaccctc ctccaggatg 7141 aacacaggtg gggagggaga ctgggaatac taggggtacc ggtttcccat tttagcccaa 7201 atgcatcaaa caaaccaggg tgctgctctt ggcttctgcc aaagtgagag gaagtgttgt 7261 gttgcagtga ggttcccatc gcaggggtat tcagctggag tttgaagagc actgggatgc 7321 tgctgagtgg actgcagtcc ttaggggcca tctaatgttt caatctttag caacttttgt 7381 tgcatctttt catgtctccc tggggaggtg agccaggatc ctatgattgt cacatttttc 7441 tggaggtctt gtgccttttc aggaagtcgg ttcccttcga tgccagcttg gggaccgcct 7501 taacatcgag gtggacgctg cacccccggt ggacctgacc agggtgctgg aggagatgcg 7561 gtgtcagtac gaggccatgg tggaggccaa ccgcagggac gtggaggaat ggttcaatat 7621 gcaggtgggc ctctcacggt ggggatggcc tcctccatat ccctaggaag ggactctagc 7681 cttctccttc ccccaactgc agatggagga gcttaaccaa caggtggcca caagctctga 7741 gcagcttcag aactaccagt cagacatcat tgacctgaga cgcacggtca acacgctgga 7801 gatcgagctg caggcccagc acagcctggt gagagctgct gggtgggcac ccatccctcc 7861 ggatcctagg cggtactgag cataggtgca ggtccccagg aaagaggaag aggaggctca 7921 gatttcagcc accatggatg ctcatcctgt tgacttttcc cggagggagg tttctcccga 7981 gatccagctc agagataaaa aagggatgtt tcaaatcaga catgggttag gtgacactgt 8041 caaactcaac tccactaaga aggcttgttc tgtgcttagc ctgcccttcc aaacctatgg 8101 atctcaatat cacccatcct gatacccagg ttcttttctg gaccaactga accagagtct 8161 ctggaggtgg gacctgatca cagctatttt tttttttttt gagatggagt ctcactctgt 8221 tgcccaggct ggagagcagt ggcatgatct cagctccctg caacctctga cccgctgggt 8281 tcaagtgatt ttccagcctc agcctcccaa gtagctggat tacaggcgtg caccactatg 8341 ccctgctaat gtttgtattt ttagtagaga gggggtttca ccatgttggc caggctggtc 8401 tcgaactcct gacctcaggt gatccacctg ccttggcctc ccagagtgct gggattacag 8461 gcatgagcta ctgtgaccgg ccagccatgg gtattttttg agggctccca tgtggcgcta 8521 atgtgcagct aggtttgaaa acccctgttc taaatgatgc cggcagggag ggtacttggg 8581 aaatctcagt ccaatcctga aggcagacaa aggttgcgga agaaaggagg gatttaggat 8641 cagatttacg aatagaaact gtggttccat aatgtaccag ctgtttaccc ttgaacaagt 8701 catttgacct ttctgggctt ctgtttccaa agtgactggt gtagggaggg cttcatttcc 8761 agcatcaaat ggagatttgg ctcttcttgg ttctttctga agcaggccat ggtaaacagc 8821 tcccttcctc atggttatgt cttcctttgc cttagaggga ctccctggaa aacacgctga 8881 cggagagtga ggcccgctac agctcccagc tggcccagat gcagtgcatg atcaccaatg 8941 ttgaggccca gctggctgag atccgggctg agctggagcg gcagaaccag gagtaccagg 9001 tgctgctgga cgtccgggcc cggctggagg gcgagatcaa cacgtaccgg agcctgctgg 9061 agagtgagga ctgcaagtat gcaggcccag ctgaggctta gagagacgtg ggcagggatt 9121 ctgggaggtt ataggaagca actggatcta cccttgaggg accatcagct tagaaccctg 9181 tcctgactat ggagccatta agaagctggt atgctctgaa ggaagtcagg cagtggtgtt 9241 catgctgcca tcctgaacca agcccctcgg agaccattct atctcattcc aagctggcaa 9301 gctccttcta agtgcccacc atggggcagg tgctatggag gacaccaaga tagaggaaga 9361 cagggcattt gcctcctgtc atttccatat gtttagggag ataggcagac aggtgactgg 9421 aggtcatggc ctgtccggag cttaggatga aaagcagctt tattaatagt acctacacat 9481 ctgctcccac tcttacccag cctcacctca tagctccatt cctctcagaa cggagatttt 9541 ggcatgtcag caggacaata ggtagccttg tgtaattgag ccctggtggg gagcaggaca 9601 ggaaagatca gccggggccc atttatggag aacaacacgg gtcatactgg gaagggaggg 9661 ctttaattta taggtgagtg gaaattgttt tgaggaggaa atgagagaat ttactgtgtg 9721 tcttctccac tagatggtaa acatctagaa tgcagagaca ttataataca attttattcc 9781 ctgtacgtgg cacatagtag atgctcagta aatgtctctc aagctcaaag ctgttcttca 9841 ggaagatggc cctgatagca gcaggaagaa ctgagtagag ggaggggaac caggcaggga 9901 aactgtccag gaggccctgg ccacagccta ggtaagcaat agggagggcc tgagttaggg 9961 cagcagaggc atcagctcta gacactgtgc aggtagaatc agcaggactt ggtggctgtc 10021 tgaatgcagg gtacctctgg gccatggaca cccggtgggc tgacgactgt tgtagctgtt 10081 tcttattccg catttggcgt tgcttctcca tcattcagaa tctataactt cagggcaagt 10141 gttgtgtcaa acatttgcaa ggaccaggcc attaacatgc atgaatgacg tgggtcatac 10201 tgagatggta gaaaagcaga aagctcttgc cttgtccaat ccaggcaatg gcatgccctc 10261 agggccactc tactgtgtga gaagcaggtc caatattgct gatcttccaa tagttccagg 10321 gaagctgaga atctgggttt ttaaaaatgt taaattctcc tgattcttaa gtattttcaa 10381 aaaattaaag aaaatatata gtgccaggca aatggaacac atttcaggtt gcacatgatc 10441 ctcaggcctc ccattggttc tctgagctac tgggtttcca tccagcatcc agtgtgttgt 10501 tcctggtttt gagtgcatgc ctgtcagtct ctgagtcatc ctttttcctt tcaccatgta 10561 ttaattcttc attcatttat tgttttgtct gatccaaata ttcttattag gtgcctattc 10621 tatgtgaggt atgcagggtg ggcatgggtc tgtggctgct ggcctcactg cttggccggg 10681 gagacagacc ataatagaat gattactact cacgatgaaa ggagatacat gtaccatggg 10741 ggctttgtct cagagaggtg ggggaggctt caaggaggac gtgacagttg agttgagctc 10801 ttaaacaaga gaagaaatgt aggtgagtgg agaggggaag agggttccag agatgtacgg 10861 cacaggcaca agccctgtgg cctgagcagt acagtccctg caggagctgg aagaaggtca 10921 gagtacctgc agctcccaga gtaatggagc tatcaggtga ggctggggca agaggtggga 10981 gcaggatcat gcaggtccta ttaaggaagc ttcttttctt tattttattt tattttactt 11041 taacttctgg gatacatgtg cagaacgtgc aggtttgtta cataggtgta catgtgccat 11101 ggtggtggtt ttgttgccta tcaacccgtc atgtaggttt taagccctgc atgcattaga 11161 tatttgtttt aattttctcc ctccctgctc ccctcacctc tcgacaggcc ccagtatatg 11221 atgttccccg gcctgtgtcc atgtgttctc attgttcaac tctcacttat gagtgagaac 11281 atgtagtgtt tggttttctg ttcctgtgtt agtttgctga gaatgatggc ttccagagga 11341 agctgctttt catcctgagc tcaaatggaa gccactgaag gttttaggga ggggagggac 11401 ataattggat ttgtgactgt agaagattgc tctggctact aagtggacag tggttaggag 11461 gggcccaagt ggggttgggg agatcagtta ggaggccatg aggtgactca ggcaaagatg 11521 gcggaggttg ggaccaggga ggctgggcag agaaagcacg gaagatgggg ttgagaggca 11581 tccgagggga gaattggcag gacctgtggc cgagtgggcc ttctctacta atcctgtttc 11641 tctttagact ttctcctgca ggctgccctg taacccatgc tccactcctt cctgcaccac 11701 ctgtgtgccc tccccatgcg tgacccgcac cgtctgtgtg ccacgcactg ttggcatgcc 11761 ttgctcaccc tgcccccagg gccgctactg aagtcccttt gtgccagtgg atcctggagg 11821 gcctggggct gggcagcctg gtattcagtg gccaccagaa gagcagggcc agccccggtc 11881 agcaaggaag accctgagca ggaccgtgga tcacctgcaa caagctctga tactccaggg 11941 gatacttaag ccctcatcac ttcaaaactg cctctttttt ccatgggtga actgttctct 12001 ttggtgatgt ttctggttgt ctgtgctgcc tcaaagagcg tgtgttctta gttaactggc 12061 aaatagagct gtactcagtg gccttgcaaa catgtctgtc tctgtttgtc acttacgctg 12121 ctgcatccac aagccaatcc tactcaattg ggcttaagag gaacgtgggc aaattctgta 12181 tttattttta tgctccttct gcttccatag aggcttgaga ggtgttcact aaaagggccc 12241 gcatgccata aaccagttaa aactaatcaa ttactctaga gccaagtaat aaaagaataa 12301 agagaggagg gagataatta tgccagaaac ctaggccaaa ttactgtaat tgagaatcat 12361 atcataataa acccacccct aaatctcatt acagctggta caatgtgatc attcattctt 12421 tcaaaatatc ttcactgagc agctactggg tgcaggttct gcattagagg ctggctaatc 12481 caaggaagag ttcccagaca cattgttagt aatgcttcat ttaatccttg caacaatccg 12541 tgaaaaatat gccattattg catccatttt gtaaatgagg aaactgaggc ccagagaagt 12601 taagaaactt gcccaaagtc acacagcttg ttagtggcag acccaggact gaaatcgagg 12661 cctttgggct ctagagatgc tcaaccgatt cacattcaca gtcctcacta tttgcaaact 12721 acagctgggt gcagggggta ttaaaaatgc aagtgatcac caccattcaa acacttgtaa 12781 ttacaggagg agctaagacc catgtgcata agatgccact cctttcttca taagggccat 12841 ataatagtaa cagtaataat agtaataatg gcaacggtta ctaattcttg agcacttata 12901 atgcactgag tactgtgtgg agcatattac ataaattaac ttatgcagtt ttcatgacca 12961 ccttgtaagg tacacatagt atccatttta gacatggaaa tggaggcata gggtggtcaa 13021 gttagttgtt gaaggttaca tgcaaggaca aagacttaaa cccaagtcta gcttcacagc 13081 agtgttattt taaccattct aactgccaaa ttcctaccca gaaagagtaa acactagtca 13141 agatttggag aaagtcttaa gctgagagga tcctgaaagg cttcttgtag ctggtggcat 13201 ttgaaatagg tcttggaggt tgaatagaag gtctacaggg caccagatag gcaagtaagt 13261 gtgtggggat cttcaggaga gcagggggtc atgcttggaa ggcctcaagg ggtcttgctg 13321 gctggagcag tgagttcctg tgagaggctg actggggatg aagctcaaat ggtagaaagg 13381 catcagagag taggggggcc ttgggtaccc cacaaaaagc ctggattctg gactctatcc 13441 tgaaggcaat gggagggctg ctgcaggatt tgagcccaaa gatgacatga cttgagtggc 13501 atcttagaaa gtatcaccaa gtaacacaga caggatagct aagaggaggg gttaggctgt 13561 ggaagaagct aacagggtct caggcaagac aatgtcaggg accatggaaa aataaggaat 13621 caatctaaga gacactgtga tggacctgac ttggcaatgg attggccatg gcaggtaaag 13681 aggagagagc tggggacagg aatcttgaac acctttcaga acctcaccct ccaaacacac 13741 agttcttcct taatgagctg agatgatgtt tctattaagt atcctccctc tggccttgcc 13801 aagaaatgat gaaaaatgga ttggatcctg aagctgcctg caggctgctc tccagacatg 13861 atcctgcagg catccctggc agacaaggtc attagcctga cagcagggac atgaacatac 13921 tgcttagcaa gctgtggttc ctggttgatg gatgggtaaa atttcaagaa gctgaaatgc 13981 caagagagag gggttctggc taattgaatt ttctcataac cgcgtgcaaa ccagcaatct 14041 ttaatttcaa ccccggtgca aaacttttct ggaatgtgct cagcttgata aacaacacgc 14101 agaacagacc aaagctt // LOCUS HSHLAA1 3840 bp DNA PRI 07-JUN-1995 DEFINITION Human HLA-A1 gene. ACCESSION X55710 NID g32152 KEYWORDS HLA-A1 gene; immunoglobulin; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3840) AUTHORS Girdlestone,J. TITLE Direct Submission JOURNAL Submitted (01-OCT-1990) Girdlestone J., Medical Research Council Lab. of Mol. Biology, Hills Road, Cambridge CB2 2QH, UK REFERENCE 2 (bases 1 to 3840) AUTHORS Girdlestone,J. TITLE Nucleotide sequence of an HLA-A1 gene JOURNAL Nucleic Acids Res. 18 (22), 6701 (1990) MEDLINE 91067475 FEATURES Location/Qualifiers source 1..3840 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 445..449 TATA_signal 471..475 exon 523..597 /gene="HLA-A1, MHC Class I" /number=1 /product="HLA-A1 Class I Antigen" CDS join(523..595,726..995,1236..1511,2091..2366,2469..2585, 3028..3060,3203..3250,3420..3424) /gene="HLA-A1, MHC Class I precusor" /codon_start=1 /product="HLA-A1 Class I Antigen precursor" /db_xref="PID:g860968" /db_xref="SWISS-PROT:P30443" /translation="MAVMAPRTLLLLLSGALALTQTWAGSHSMRYFFTSVSRPGRGEP RFIAVGYVDDTQFVRFDSDAASQKMEPRAPWIEQEGPEYWDQETRNMKAHSQTDRANL GTLRGYYNQSEDGSHTIQIMYGCDVGPDGRFLRGYRQDAYDGKDYIALNEDLRSWTAA DMAAQITKRKWEAVHAAEQRRVYLEGRCVDGLRRYLENGKETLQRTDPPKTHMTHHPI SDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSG EEQRYTCHVQHEGLPKPLTLRWELSSQPTIPIVGIIAGLVLLGAVITGAVVAAVMWRR KSSDRKGGSYTQAASSDSAQGSDVSLTACKV" sig_peptide 523..597 /gene="HLA-A1, MHC Class I" gene 523..3424 /gene="HLA-A1, MHC Class I precusor" mat_peptide join(523..595,726..995,1236..1511,2091..2366,2469..2585, 3028..3060,3203..3250,3420..3421) /gene="HLA-A1, MHC Class I" /product="HLA-A1 Class I Antigen precursor" gene 523..3421 /gene="HLA-A1, MHC Class I" intron 596..725 /gene="HLA-A1, MHC Class I" /number=1 exon 726..995 /gene="HLA-A1, MHC Class I" /number=2 /product="HLA-A1 Class I Antigen" intron 996..1235 /gene="HLA-A1, MHC Class I" /number=2 exon 1236..1511 /gene="HLA-A1, MHC Class I" /number=3 /product="HLA-A1 Class I Antigen" intron 1512..2090 /gene="HLA-A1, MHC Class I" /number=3 exon 2091..2366 /gene="HLA-A1, MHC Class I" /number=4 /product="HLA-A1 Class I Antigen" intron 2367..2468 /gene="HLA-A1, MHC Class I" /number=4 exon 2469..2585 /gene="HLA-A1, MHC Class I" /number=5 /product="HLA-A1 Class I Antigen" intron 2586..3027 /gene="HLA-A1, MHC Class I" /number=5 exon 3028..3060 /gene="HLA-A1, MHC Class I" /number=6 /product="HLA-A1 Class I Antigen" intron 3061..3202 /gene="HLA-A1, MHC Class I" /number=6 exon 3203..3250 /gene="HLA-A1, MHC Class I" /number=7 /product="HLA-A1 Class I Antigen" intron 3251..3419 /gene="HLA-A1, MHC Class I" /number=7 exon 3420..3421 /gene="HLA-A1, MHC Class I" /number=8 /product="HLA-A1 Class I Antigen" misc_signal 3422..3424 /gene="HLA-A1, MHC Class I precusor" /note="termination codon" polyA_signal 3718..3724 BASE COUNT 769 a 1072 c 1164 g 835 t ORIGIN 1 aagcttactc tctggcacca aactccatgg ggtgattttt cttctagaag agtccaggtg 61 gacaggtaag gagtgggagt cagggagtcc agttcaggga cagagataat gggatgaaaa 121 gtgaaaggag agggacgggg cccatgccga gggtttctcc cttgtttctc agacagctcc 181 tgggccaaga ctcagggaga cattgagaca gagcgcttcg cacaggagca gaggggtcag 241 ggcgaagtcc cagggcccca ggcgtggctc tcagggtctc aggccccgaa ggcggtgtat 301 ggattgggga gtcccagcct tggggattcc ccaactccgc agtttctttt ctccctctcc 361 caacctacgt agggtccttc atcctggata ctcacgacgc ggacccagtt ctcactccca 421 ttgggtgtcg ggtttccaga gaagccaatc agtgtcgtcg cggtcgctgt tctaaagtcc 481 gcacgcaccc accgggactc agattctccc cagacgccga ggatggccgt catggcgccc 541 cgaaccctcc tcctgctact ctcgggggcc ctggccctga cccagacctg ggcgggtgag 601 tgcggggtcg ggagggaaac cgcctctgcg gggagaagca aggggccctc ctggcggggg 661 cgcaggaccg ggggagccgc gccgggagga gggtcgggca ggtctcagcc actgctcgcc 721 cccaggctcc cactccatga ggtatttctt cacatccgtg tcccggcccg gccgcgggga 781 gccccgcttc atcgccgtgg gctacgtgga cgacacgcag ttcgtgcggt tcgacagcga 841 cgccgcgagc cagaagatgg agccgcgggc gccgtggata gagcaggagg ggccggagta 901 ttgggaccag gagacacgga atatgaaggc ccactcacag actgaccgag cgaacctggg 961 gaccctgcgc ggctactaca accagagcga ggacggtgag tgaccccggc ccgggcgcag 1021 gtcacgaccc ctcatccccc acggacgggc caggtcgccc acagtctccg ggtccgagat 1081 ccaccccgaa gccgcgggac tccgagaccc ttgtcccggg agaggcccag gcgcctttac 1141 ccggtttcat tttcagttta ggccaaaaat ccccccgggt tggtcggggc ggggcggggc 1201 tcgggggact gggctgaccg cggggtcggg gccaggttct cacaccatcc agataatgta 1261 tggctgcgac gtggggccgg acgggcgctt cctccgcggg taccggcagg acgcctacga 1321 cggcaaggat tacatcgccc tgaacgagga cctgcgctct tggaccgcgg cggacatggc 1381 agctcagatc accaagcgca agtgggaggc ggtccatgcg gcggagcagc ggagagtcta 1441 cctggagggc cggtgcgtgg acgggctccg cagatacctg gagaacggga aggagacgct 1501 gcagcgcacg ggtaccaggg gccacggggc gcctccctga tcgcctatag atctcccggg 1561 ctggcctccc acaaggaggg gagacaattg ggaccaacac tagaatatca ccctccctct 1621 ggtcctgagg gagaggaatc ctcctgggtt tccagatcct gtaccagaga gtgactctga 1681 ggttccgccc tgctctctga cacaattaag ggataaaatc tctgaaggag tgacgggaag 1741 acgatccctc gaatactgat gagtggttcc ctttgacacc ggcagcagcc ttgggcccgt 1801 gacttttcct ctcaggcctt gttctctgct tcacactcaa tgtgtgtggg ggtctgagtc 1861 cagcacttct gagtctctca gcctccactc aggtcaggac cagaagtcgc tgttcccttc 1921 tcagggaata gaagattatc ccaggtgcct gtgtccaggc tggtgtctgg gttctgtgct 1981 ctcttcccca tcccgggtgt cctgtccatt ctcaagatgg ccacatgcgt gctggtggag 2041 tgtcccatga cagatgcaaa atgcctgaat tttctgactc ttcccgtcag acccccccaa 2101 gacacatatg acccaccacc ccatctctga ccatgaggcc accctgaggt gctgggccct 2161 gggcttctac cctgcggaga tcacactgac ctggcagcgg gatggggagg accagaccca 2221 ggacacggag ctcgtggaga ccaggcctgc aggggatgga accttccaga agtgggcggc 2281 tgtggtggtg ccttctggag aggagcagag atacacctgc catgtgcagc atgagggtct 2341 gcccaagccc ctcaccctga gatggggtaa ggagggagat gggggtgtca tgtctcttag 2401 ggaaagcagg agcctctctg gagaccttta gcagggtcag ggcccctcac cttcccctct 2461 tttcccagag ctgtcttccc agcccaccat ccccatcgtg ggcatcattg ctggcctggt 2521 tctccttgga gctgtgatca ctggagctgt ggtcgctgcc gtgatgtgga ggaggaagag 2581 ctcaggtgga gaaggggtga agggtggggt ctgagatttc ttgtctcact gagggttcca 2641 agccccagct agaaatgtgc cctgtctcat tactgggaag caccttccac aatcatgggc 2701 cgacccagcc tgggccctgt gtgccagcac ttactctttt gtaaagcacc tgttaaaatg 2761 aaggacagat ttatcacctt gattacggcg gtgatgggac ctgatcccag cagtcacaag 2821 tcacagggga aggtccctga ggacagacct caggagggct attggtccag gacccacacc 2881 tgctttcttc atgtttcctg atcccgccct gggtctgcag tcacacattt ctggaaactt 2941 ctctggggtc caagactagg aggttcctct aggaccttaa ggccctggct cctttctggt 3001 atctcacagg acattttctt cccacagata gaaaaggagg gagttacact caggctgcaa 3061 gtaagtatga aggaggctga tgcctgaggt ccttgggata ttgtgtttgg gagcccatgg 3121 gggagctcac ccaccccaca attcctcctc tagccacatc ttctgtggga tctgaccagg 3181 ttctgttttt gttctacccc aggcagtgac agtgcccagg gctctgatgt gtctctcaca 3241 gcttgtaaag gtgagagctt ggagggcctg atgtgtgttg ggtgttgggt ggaacagtgg 3301 acacagctgt gctatggggt ttctttgcgt tggatgtatt gagcatgcga tgggctgttt 3361 aaggtgtgac ccctcactgt gatggatatg aatttgttca tgaatatttt tttctatagt 3421 gtgagacagc tgccttgtgt gggactgaga ggcaagagtt gttcctgccc ttccctttgt 3481 gacttgaaga accctgactt tgtttctgca aaggcacctg catgtgtctg tgttcgtgta 3541 ggcataatgt gaggaggtgg ggagagcacc ccacccccat gtccaccatg accctcttcc 3601 cacgctgacc tgtgctccct ctccaatcat ctttcctgtt ccagagaggt ggggctgagg 3661 tgtctccatc tctgtctcaa cttcatggtg cactgagctg taacttcttc cttccctatt 3721 aaaattagaa cctgagtata aatttacttt ctcaaattct tgccatgaga ggttgatgag 3781 ttaattaaag gagaagattc ctaaaatttg agagacaaaa ttaatggaac gcatgagaac // LOCUS HSHLADMBG 6933 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens HLA-DMB gene. ACCESSION X76776 NID g512471 KEYWORDS HLA-DMB gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6933) AUTHORS Radley,E., Alderton,R.P., Kelly,A., Trowsdale,J. and Beck,S. TITLE Genomic organization of HLA-DMA and HLA-DMB. Comparison of the gene organization of all six class II families in the human major histocompatibility complex JOURNAL J. Biol. Chem. 269 (29), 18834-18838 (1994) MEDLINE 94308138 REFERENCE 2 (bases 1 to 6933) AUTHORS Beck,S. TITLE Direct Submission JOURNAL Submitted (16-DEC-1993) S. Beck, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, London WC2A 3PX, UK FEATURES Location/Qualifiers source 1..6933 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cosmid HA14" /chromosome="6" /map="p21.3" promoter 19..28 /note="NFkB" promoter 73..82 /note="J-box" promoter 119..128 /note="J-box" promoter 134..147 /note="Sp1" promoter 313..322 /note="J-box" promoter 439..448 /note="J-box" promoter 440..458 /note="X-box" promoter 478..487 /note="Y-box" CAAT_signal 513..517 promoter 574..588 /note="Sp1" promoter 582..591 /note="NFkB" promoter 740..749 /note="J-box" gene 756..6592 /gene="HLA-DMB" CDS join(756..810,2598..2879,4107..4391,5911..6027,6186..6221, 6576..6592) /gene="HLA-DMB" /codon_start=1 /db_xref="PID:g512472" /db_xref="SWISS-PROT:P28068" /translation="MITFLPLLLGLSLGCTGAGGFVAHVESTCLLDDAGTPKDFTYCI SFNKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGLQNCATHTQ PFWGSLTNRTRPPSVQVAKTTPFNTREPVMLACYVWGFYPAEVTITWRKNGKLVMPHS SAHKTAQPNGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGLSPMQTLK VSVSAVTLGLGLIIFSLGVISWRRAGHSSYTPLPGSNYSEGWHIS" exon 756..810 /gene="HLA-DMB" /number=1 /evidence=experimental intron 811..2597 /gene="HLA-DMB" /number=1 promoter 829..838 /gene="HLA-DMB" /note="NFkB" repeat_region 1564..1814 /rpt_family="MER21" repeat_region complement(1899..2025) /rpt_family="MER21" exon 2598..2879 /gene="HLA-DMB" /number=2 /evidence=experimental intron 2880..4106 /gene="HLA-DMB" /number=2 exon 4107..4391 /gene="HLA-DMB" /number=3 /evidence=experimental intron 4392..5910 /gene="HLA-DMB" /number=3 exon 5911..6027 /gene="HLA-DMB" /number=4 /evidence=experimental intron 6028..6185 /gene="HLA-DMB" /number=4 exon 6186..6221 /gene="HLA-DMB" /number=5 /evidence=experimental intron 6222..6575 /gene="HLA-DMB" /number=5 exon 6576..6915 /number=6 /evidence=experimental polyA_signal 6910..6915 BASE COUNT 1858 a 1423 c 1717 g 1935 t ORIGIN 1 acaatggagg cccatggagg gattttccag gcccctattc ctgatctcca tggtggcgct 61 gcttccctta ttctgcagtc ttgctttgag accagacaat tctgtactct gtctgtgcct 121 tcttccttct cagcagccct gcctctgcca cctgccccag aagggctctg ggtcctctca 181 tctcaggaca gacaagggtt tttgtaatta tcttcttggg taacctccct gtaaggaata 241 ggagaggtta tctggtcctg gtcttttggg agagcattga gaagaaaaag ttaaaccagt 301 gggcagccca ggctgcatac atcctgggaa tattgcgatt attctctcca gtaatctacg 361 gaaatctact ggttgttctg cagaaaaaga attaagagag acaaggagac ctgttgtcta 421 agactaaggc aggatgacgt ttacctagta actgatgatg ctaggctgag gcactcagtg 481 atttgtctct acatttgtcc ctgcctacct agccaatctg tccctgtttg ggacactgga 541 ctcccgtgag ctggaaggaa cagatttaat atctaggggc tgggtatccc cacatcactc 601 atttgggggg tcaagggacc cgggcaatat agtattctgc tcagtgtctg gagatcatct 661 acccaggctg gggcttctgg gacaggcgag gacccacgga ccctggaaga gctggtccag 721 gggactgaac tcccggcatc tttacagagc agagcatgat cacattcctg ccgctgctgc 781 tggggctcag cctgggctgc acaggagcag gtaaggacac ttcttctggg gactctccct 841 tcccctgctc ctgtttcagg gtaagggtgt tccgttttgt aattgcattt gacaccccag 901 atagtttgtc tccctggtat acattcctat agcactttgt actttgtagc aattttaatg 961 taattaatct gtataattat ctgtgcagtg tatattccct gctggaatat aggcaaggac 1021 aatgttcatc ttatttattg ctgcctcctc agctcctagc acagtgcctt gcatgcagca 1081 agtgcttcat aaatatgtgc gaagtgaata tttaatattt ccagcacaat acaaggctga 1141 ctctttctct tgaccctttt tctctctcaa taatttgcct tactgaaggt ctgtgttctg 1201 ggcaaattgt catgtttaaa catgcaaata atctcggggg gctactccta tccctgtgct 1261 tagtcttgca taaagaggag actggatcta aaaacttatc tactacttct actgactccc 1321 tcaaatcaga ctttcagaaa cttcagtgta tgagcttggt cagtagatgt tccctgagca 1381 ggaaatctgt gccagactag ctggatgtca ccaaggctta ggttctgagc tgaatatagg 1441 aaaaatcaac tttttttctt ctatatgctc acactcaaca cttctttgac caactgtgtg 1501 aggttttttt ttttttttta ctcataccaa ccaattctcc tatattagct ggatatccta 1561 taattcaatt ccattgtgac attaactaga gttaacatag acaccaaagg ttaaagactc 1621 agtcccataa gactgcctcc atttcagaca ccaatcacaa gtagtaggtt cccaaattac 1681 ccacatcttc tgtccaactt gcctacaaat cagaggttcc catgaccccc tccttggggt 1741 tggtaatttg ctaaagtggc ttatggaact caggaaaagt ttacttatta ttgtagattt 1801 tttacaaagg atattttaat tgataataat aattataaat attcatgggg tggataatga 1861 tgttttagta catgtaatgt acagtgatca gatcagatca tcatctcatt tatcatttct 1921 ttgtgttgga aacattccat atccttcttc tagcgatttg aaatgatata aagtattatt 1981 gttaatgaca gtcatcccac agtggtatag aacactagat cataattttg catcctttaa 2041 caaaggatat tttaaagcat acaaaggaac atccagatga agagatacca gatctggaag 2101 ggtcccaatc acaggagctc tgtccccaca gaattggggt acttcacctc ctggcatgtg 2161 gatgtgttta ccaactgaga agttctctga actccatagt tccgggattt ttatggaggc 2221 ttcatcatgt aggcatgact gattattaac tcaatctcca gccccttccc cttcagggag 2281 tatgggggat gggactaaaa gttccagact tctaatcatg acttggtctt tctggtgacc 2341 agcccctcct gcaggagccc accaagagta cctcattaga acaaaagaca ctcctgttat 2401 ctaggaaatt ctaagcgatt aggcactcta tgtcaggaac cagggtcaaa gacaaagctc 2461 tgtgcagagc tcctaaatat acgtctgtat gttttatata tttattattt gtgtatattt 2521 attatttata tatttattta tttattgcca ggccagtaga agacattgac ctgttctccc 2581 ttccctggct cctctaggtg gcttcgtggc ccatgtggaa agcacctgtc tgttggatga 2641 tgctgggact ccaaaggatt tcacatactg catctccttc aacaaggatc tgctgacctg 2701 ctgggatcca gaggagaata agatggcccc ttgcgaattt ggggtgctga atagcttggc 2761 gaatgtcctc tcacagcacc tcaaccaaaa agacaccctg atgcagcgct tgcgcaatgg 2821 gcttcagaat tgtgccacac acacccagcc cttctgggga tcactgacca acaggacacg 2881 tgaggagaga ggggtgcaga ggggctacca ggaagtgcag ttaggagggc aggccaggga 2941 ggatcccaca gtggcccagg ggtttgagat ttgagcagca aataagagaa aatgtgtgga 3001 tctgaaatgt agaaagacgg aggattgaac ctcaagggga acaaggtggc tgacgtgagt 3061 ggaacaggag taaagaaggg gaggtgaggc ttgaaccgcg aggtgccatg tggggagctt 3121 atgcagaggc tggggcatct caggatgcat acccaagatg ttcttgcctt gttatcccag 3181 attttgatgt tccagatctg atgtgggccc aggcatggga atatttggaa tcccagggga 3241 ttctgacaca tgctttttct cacccttaaa ctcttgcatt gacaatggct tgaagtttgt 3301 gaaagtaaac ttgaagatct ccacagtaca gaacagtgtg tctcaagggt ggtttctgaa 3361 ccacctttct cagaattgcc tggaggaggt gcttgttaaa gatgcaagcc ccttagtacc 3421 actccagatc tgttgaaaga aagtatctgg ggatacagcc taggaaatct gcattttaac 3481 ataattcctt ggatttttat ttaagattgt gtttgaaaaa tgtcaagata gaggcaagca 3541 agaggatcac ctaggagagt aaattagtaa aagatggcag tattagcaat ctctttagtt 3601 tgactacatt cattccaaat ttaagagtga gtcctaagtt aggcttgttt ccttgaacta 3661 tgtgaggaga aaaagcttta actagcaaaa gaacgtatta aacaagattt ggagaaaaat 3721 tccttttcca ccttaaaaaa acccaatgta caactctgga ttactcttag cttccttatt 3781 tcaaatactt tccagtttat gtacttgaaa taaatacaac aacttctaga acagcttgca 3841 gttcagatct ggcttttact aattgtaatc aaacataatt ctggaggagg aaagaaagaa 3901 aggggcaatg aaggaatggg agagaagaaa gagtaatgca ggaatacatt ctaacggttc 3961 cccttcaagg ggcagcatgg cagagggggc tggggtggaa agtgggttgc aaaatctacg 4021 aagagttgcg atagggaaga aaccaggttg aggaagcagc cagaatgtca ccctccttcc 4081 taaacatgtt tttttctcct atgcagggcc accatctgtg caagtagcca aaaccactcc 4141 ttttaacacg agggagcctg tgatgctggc ctgctatgtg tggggcttct atccagcaga 4201 agtgactatc acgtggagga agaacgggaa gcttgtcatg cctcacagca gtgcgcacaa 4261 gactgcccag cccaatggag actggacata ccagaccctc tcccatttag ccttaacccc 4321 ctcttacggg gacacttaca cctgtgtggt agagcacatt ggggctcctg agcccatcct 4381 tcgggactgg agtaagtgta tggcagatgg atggaattag ggtcaaagca gagaaaatga 4441 gatgtggatc gatacatggt acatggtaga cagcgaagtg ctgaaaatgg ggactgagtc 4501 tggaggaact tacggggggc ttaggaccag aatggggaaa tgggataaag aaatggaaat 4561 atttaggttg gtgcaaaagt aattgcagtt tttgccatta ctttcagtgg caaaaaccgc 4621 aattactttt gcaccagttt aatatttagt ctgtgctatt gctgctctgg tggtgtggtt 4681 gatgttgctg cgtctatgtt tgagggtgag aggggagcgt gcttgctttg aaatgaggat 4741 gtaaatttgg caatcatatt ttcagaaccc caaattgtaa tacactattc tagcctcctt 4801 agatttcaac tattctggtg ccagaagcag atgggagctg aaggaatgat gaaggttgaa 4861 gaaggggggc ttttcttggt gtggggcagt actgcatttg gcctgctcta ccaagcatac 4921 gggagtagta aagccacggc tggcagacca tttggcatgc atgctcaggg gccagtggat 4981 aaagaattac ttacagttca aacactgttt gaactcagtg tcgggagtag ttaaaggtat 5041 cgtgagaagt tgcacacagc tttggggact cttggaaaag aaagaggaag aaatgaggaa 5101 gaggaagggt gtctacaaag ggccagagaa caggatctca gatcagctgc tgtaaccagg 5161 tttccccttg tgggaagtgt tgtttcttgc tgggcagttg ggaagggaat ggagaacaga 5221 gaagagagtg gaaatcacat gctcacttga actttcctgg ggaacgtctc ctcacagcgt 5281 gcacaagagc ctccctttag aaatggagtg ttcattttat catgggaaaa gaatctgagt 5341 gggacatgat tcagaacagg accggcccaa ggaagtgcag gggctgtgga gtgggatgga 5401 gacaagctct gaaaggacac atgggagatc tagatgtaga aggtacacaa gtagtaggat 5461 aactcacagg atggatccac tggaggttaa gacatgtggt aagacagtgt aataggaagc 5521 tgctcagttg gagaaagtaa ggaagcaaac attgttaccg tgggggcaat ggagaggaca 5581 gtgaggagcc ctttatcctg ataagggtgg ctttgaggta aaggaaggaa agaggatgcc 5641 ttgagaggcc ccactgtatt agagaggacc tggaagccag gatgctaatt ctggggagat 5701 ggattcccca ggcttactct aggagtagag gtccatggga cgagggtttg atttgagaaa 5761 gatcattttc ttgggagtgg gtggtgtgag ctagaccctt ggagctggga taaaggacct 5821 tttaacccac tgagaggtgg ctgcaataaa tggaattgcc ctgggggtga gcaacagaaa 5881 ctgggtcaag taagtttcta ttttttgcag cacctgggct gtcccccatg cagaccctga 5941 aggtttctgt gtctgcagtg actctgggcc tgggcctcat catcttctct cttggtgtga 6001 tcagctggcg gagagctggc cactctagtg agtgactcgc tgaactccca tccccactct 6061 tggtcccact ctctgcttac tttctgtttg tgattaactc tctccttcct actgcatttg 6121 ctatgaatac tgctagatat tttcatccac aaagactggt ataatcaagt atcttcctct 6181 cttaggttac actcctcttc ctgggtccaa ttattcagaa ggtaacatct ctgttggtct 6241 gtttccctac ttgccctttg gtaggggtgc gggttagagg ggtcagtgtt gggttcaact 6301 aatcttgatt attatatggg tgagcttcca tgaggatcta ggcaagggca tgatttaagc 6361 tgccattgct aggattaaga gcaggaagga gcatcctcct cttctaccaa gtgggatgtc 6421 tgtggagagg aggctgaagg tgcttccttt gtattagttg ttggtgccct ggagttttca 6481 gtatcactgt attaaggcat gggatggtta cagtgacaaa cgatgggggc aagttgggtt 6541 gaagcctcat tatctccctt ttatttattc tgtaggatgg cacatttcct agaggcagaa 6601 tcctacaact tccactccaa gtgagaagga gattcaaact caatgatgct accatgcctc 6661 tccaacatct tcaaccccct gacattatct tggatcctat ggtttctcca tccaattctt 6721 tgaatttccc agtctcccct atgtaaaact tagcaacttg ggggacctca ttcctgggac 6781 tatgctgtaa ccaaattatt gtccaaggct atatttctgg gatgaatata atctgaggaa 6841 gggagttaaa gaccctcctg gggctctcag tgtgccatag aggacagcaa ctggtgattg 6901 tttcagagaa ataaactttg gtggaaatat tgt // LOCUS HSHLADZA 5691 bp DNA PRI 19-AUG-1996 DEFINITION Human HLA class II alpha chain gene DZ-alpha. ACCESSION X02882 NID g32216 KEYWORDS cell surface glycoprotein; class II antigen; major histocompatibility complex; surface antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5691) AUTHORS Trowsdale,J. and Kelly,A. TITLE The human HLA class II alpha chain gene DZ alpha is distinct from genes in the DP, DQ and DR subregions JOURNAL EMBO J. 4 (9), 2231-2237 (1985) MEDLINE 86081729 FEATURES Location/Qualifiers source 1..5691 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1..83 /note="homology to Alu I repeat" promoter 1649..1669 /note="put. promoter sequence" promoter 1686..1700 /note="put. promoter sequence" CAAT_signal 1720..1726 /note="pot. CAT-box" exon <1816..1897 /number=1 sig_peptide 1816..1890 mRNA join(<1816..1897,3091..3339,3759..4040,4136..4271, 4510..4790) CDS join(1816..1897,3091..3339,3759..4040,4136..4271, 4510..4513) /codon_start=1 /product="DZ-alpha protein" /db_xref="PID:g1335103" /db_xref="SWISS-PROT:P06340" /translation="MALRAGLVLGFHTLMTLLSPQEAGATKADHMGSYGPAFYQSYGA SGQFTHEFDEEQLFSVDLKKSEAVWRLPEFGDFARFDPQGGLAGIAAIKAHLDILVER SNRSRAINVPPRVTVLPKSRVELGQPNILICIVDNIFPPVINITWLRNGQTVTEGVAQ TSFYSQPDHLFRKFHYLPFVPSAEDVYDCQVEHWGLDAPLLRHWELQVPIPPPDAMET LVCALGLAIGLVGFLVGTVLIIMGTYVSSVPR" mat_peptide join(1891..1897,3091..3339,3759..4040,4136..4271,4510) /product="mature DZ-alpha protein" intron 1898..3090 /number=1 exon 3091..3339 /number=2 intron 3340..3758 /number=2 exon 3759..4040 /number=3 intron 4041..4135 /number=3 exon 4136..4241 /number=4 intron 4272..4509 /number=4 exon 4510..4790 /number=5 polyA_signal 4767..4772 polyA_site 4790 /note="put. polyA site" misc_feature 5467..>5691 /note="homology to Alu I repeat" BASE COUNT 1353 a 1567 c 1308 g 1463 t ORIGIN 1 ctgcagcgag ccatgattac ttcactgccc tccagcctga gcaacagagc aaggccctgt 61 ctctaaaaaa taaataaata aatgcaccac tcttagttat gtataggcca gtccaaagat 121 aaagaaaacc gtaataaatt agaatgacaa aatccacatg cttgatccaa taaacatgta 181 catgtataaa attctgcatc acatttatta gagcccatgt gaaatattta cagaagttca 241 ccacactcca agccatagag gattatcatg taattaaatt ataagttaat aacaaaaaac 301 atactttaaa aaatctatgt atgttttcaa ataaaaaaaa acacccgata ctcgactgtt 361 attaaccaaa gaagcaacat tctttatcat gagtttttca tggctttccc gtctttcatc 421 tcatggctcc agctcccacc tcacttgggg ggaggtgaag gtggcaaggg tcctaaccag 481 ggcaggacca aagcagtgaa gacactgttc attcatgcgc aggtgtctcc taggcgccta 541 tgatgtgcta ggcattacgt agacactgtg gactagaaac gaaaaaacta agattagggc 601 tccccataaa ccgggccatg caaatcaatt actagctgtc atttgattaa tttgaagtca 661 gaagctgtgt ctgccccact taaaataggc atcccttatt cattgaggaa agagcattgg 721 agcaggtttt accagtccac agactacccg agaaccacct ggggagcatg ctgaaaacac 781 aggttccctg gccctgcctt ggaaaggcta atctaaccca ttaagtaaag aggaaactca 841 ggaacacgta ttataacgaa ccccccaggt gattctcatt tggaaccaga gacccgcatg 901 atttcctagc tccctccaag ttgaaaatca tcgatttcat ggtctttgac acatggccta 961 atttggttag atttctcttt acacagacgt tagctttacg ttgttgttgt tttgttttgt 1021 tttgtttttt aaatatatat actggactaa attctcccag tgcctcttat tagccgagcc 1081 caaccagaag ccaaaggcaa gggtgatgcc gtcctccagc ctgtgtcagc ctggaggagg 1141 tgcccgggtc ctggagggac caacagagag gaccagcacc tctgccctcc tcggggagtc 1201 cacgggagct ctttacactt tgctttagac caaacaactc caacaattac acattttttc 1261 ttacacaatg aaattaaata ataataaaag aaaaataaaa taacattttt gaaagtgatt 1321 taaagtttta agaggttttt attccattgt gtagccaatg ttttcctttt aaagattatt 1381 ggtaaagttt attttacttt ctaactcaaa atttgtccac ggaaacttct tgggaaagta 1441 ggattccctg catcgcagat ggacatagag agaggatttg ttggagctca gctgcgcggt 1501 gctctgacgg ccttttctct ctcttcaggg ggtcccttct tggaggacgt ggtaatagtt 1561 ggtccaagcc cctcccatcc tccaaaactt cctttggcct ctcctggggg cccagtgaat 1621 ccttccacct tctcaccccg gctctttccc tctttaccca gcaacagata cattcactca 1681 gagaatttct gtgattggct gaagacagaa ggggtcgccc ccatcctcga atctgttttc 1741 ttcttcttta cctccgcctt gttcctgtcc tcaccacacg gactgagact gatttgatta 1801 aagcaccaga gtgtaatggc cctcagagca gggctggtcc tggggttcca caccctgatg 1861 accctcctga gcccgcagga ggcaggggcc accaagggtg agtgcgaggg cgaggagggt 1921 gcggcgggga gcagagattt acggagttgg gttacatgag gaggtggcat ggaggatgct 1981 tgttcctctc gctctctggt ttatgggcaa cttccttcac caagagacac ccaatccccc 2041 tcatctctgt cacatccact ctggacctaa atgaagatgc agctcggtca gctgcgcagg 2101 tgccccagtc agcctttgct gacgttcaga tttctcctca ttccttcctc cttcctgaga 2161 cccaaacctc cacccaacag atgccagcaa gcaccctgat ttctctacca cccctggcct 2221 ggaatgtgcc cgatcaagtc cagttctgtt gcagtattta tgcccatgcc ggtagttaac 2281 tatttacctg tctttgttct tcgggagaca tgagctgggg tgcgggtcta cagatggttc 2341 atcttttttt tcttttattt ccctggcccc ccattgtgct gggtgcatgc tagttcctca 2401 ataactgttg ctcaaacaac ttcatagagt tctacaagaa ttaaaactta atccctaact 2461 tccagaaaac tagacaacag ttatggaaga gccacactca gtcataatgc tctgagatgg 2521 aggaattggg acatgaacct tgacttctga ctcctcgtcc agtgctcttt gtaatgcctt 2581 gagttgcctc tccctatccc cttggtctct gggtcactaa ccttaattct tacccctgcc 2641 agccgtggcc tgtttcccct cacacccacc tgcactgcat ctctgtgcaa agctccatac 2701 tctcctgtcc ttaatcattc ctcctcatcc cacccccaca gtcccaccag ctcacaacac 2761 agggcctgac aacacagcca gggcagatga ccacagccaa gatttaaatt ctggaacccc 2821 aagcatgatt ttaggcagtc cctcctcttc ccatcctgat atgccagact gcactgtctc 2881 tgtgccgact gcagtgtgct gggatgagcc tctttcttcc tgttctctct ccctttctct 2941 ctcccagggc cagcccagtg tcagaacagg actctgtccc cacacagaac ccaggacggg 3001 gcccaggctc agggactcaa caatcacata ttgtggatga gacagacaca tttttttctc 3061 tctccttgac cctgaactcg ccaaacacag ctgaccacat gggctcctac ggacccgcct 3121 tctaccagtc ttacggcgcc tcgggccagt tcacccatga atttgatgag gaacagctgt 3181 tctctgtgga cctgaagaaa agcgaggccg tgtggcgtct gcctgagttt ggtgactttg 3241 cccgctttga cccgcagggt gggctggccg gcatcgccgc aatcaaagcc catctggaca 3301 tcctggtgga gcgctccaac cgcagcagag ccatcaacgg taccggccct ccctctgccc 3361 acccagtcag gcgggaaggt ccagagaaac ttcctcccag ttcctaggtt cccatcactc 3421 tggggcgcgc tctcacgccc gcgcctgtca tgccctgttc ctttctttcc caggaggctc 3481 caggtcttcc cagacccctt tggcacccct ctccttgagg aatgacacct ctcacccgga 3541 ctcccgccca gggaccagtc aaatatagga gctcctggcg tccccactcc ctccccagtc 3601 tcctctccct ctgtttccct cctctcctgc cccagtggat accccagagc atcccctgcc 3661 cacagatggc tacaaagggg gaacgtccct taatcccagt cctagtaagg ccctggggtg 3721 agggatgagc ctgtggactc agggcccgtt cctcctagtg cctccacggg tgaccgtgct 3781 ccccaagtct cgggtggagc tgggccagcc caacatcctc atctgcatcg tggacaacat 3841 cttcccccct gtgatcaata tcacctggct acgcaacggc caaactgtca ctgagggagt 3901 ggcccagacc agcttctatt cccagcctga ccatttgttc cgcaagttcc actacctgcc 3961 cttcgtgccc tcagccgagg acgtctatga ctgccaggtg gagcactggg gcctggatgc 4021 gccactcctc aggcattggg gtacggagcc ccctccccat gcaccctcct ggccccaggt 4081 ttcctttact ctagaatcct ttcatatacc accgactcct tcctttctct cctagagctc 4141 caggtgccta ttccaccacc agatgccatg gagaccctgg tctgtgccct gggtctggcc 4201 atcggcctgg tgggcttcct cgtgggcacc gtcctcatca tcatgggcac atatgtgtcc 4261 agtgtcccca ggtgcagagg ccccgggagt ctggggggtg ggggaggaaa gtggatgact 4321 ctgaacagga cgtgggtgga gaatcagaga ttctgttgtg gggaaagaag tcagaaaaga 4381 aatgggcagg gagaaaagaa gcagaggtgg ggtgagagag tgaggttttg ggggaggtgg 4441 gcactcagag ataggatccc agcatattga aattgagcaa cctcgatcgt atgttttctg 4501 ctattttagg taatgatcct tctgagagaa atgacttgtg ggagacaccc tgcagatcct 4561 catgggtttg tgacagcccc tgcgtgctca gtgcccttta agtgcatccc gctgtgctga 4621 ctttgagtgg gatcaacatc tgtcctacgg gtcccctctt ttttggcccc agtattcatg 4681 gcagggtttg ttggacacct actagcttcc cttcccattc aacacaaaca cacattcttg 4741 ctctacccaa agctctggct ggcagcacta aatgctttgg tggtgtttgc actgtgtcct 4801 ttccaggcct tggccagttc ttccaggggt gaggcatgtg gtgctgggga ttggcagccg 4861 tcctggggcc cacacaggtg tgtcttgctc catttggccc attgtgtgtt actttgtgaa 4921 tgagccattt cacatggact tcatgaaatt tgcctcctga gttcaggttt accctgaaag 4981 ggatgcagat tatcctgttc ctcacgaccc cctcagctaa caacagttct gaagggtgct 5041 gggacaagac aggctcatgg ggactccact cctgcctggg tttactctgt atgaagaggc 5101 cactggtatc ctgccatgat gttatctcct ttttctactt tccctagagt cccatgcatg 5161 ataaagagag gcccaaggct tggataaggt ggccacttcc ctcagtggag tcagtcatgt 5221 taggtaggag gtggtagagt cggtctgcaa ggtatctcgt aagaggggag gtccacctag 5281 acacattcta aatatgtggc ctagaagatt ttggtctact tttctgtgaa caaaatttaa 5341 aacatacaaa gagataaatc accataccac atagtttatg tcaagaccaa aatgagcaat 5401 acagattacg gttttcaaac cagaatgcac ataagaactg cttgggatcc ttttaaaagt 5461 acaggcattg gcctggtgca gtggctcatt cctgtaatcc cagcactttg ggaggccaag 5521 gggacagaac tgcttgaggc caagaggtgg aaaccatctt gggctacata gagagacccc 5581 atctctacaa agaaagattt aaaaattaac cagccatggt ggctcgcacc tgtattccca 5641 gccactgggg aggctgaggc cggaggagtg cttgagccca ggagttcaag g // LOCUS HSHNRNPA 5368 bp DNA PRI 24-APR-1993 DEFINITION Human gene for heterogeneous nuclear ribonucleoprotein (hnRNP) core protein A1. ACCESSION X12671 NID g32344 KEYWORDS hnRNP A1 proten; ribonucleoprotein; RNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5368) AUTHORS Riva,S. TITLE Direct Submission JOURNAL Submitted (23-AUG-1988) Riva S., Consiglio Nazionale Delle Ricerche, Istituto Di Genetica Biochimica Ed Evoluzionistica CNR, Via Abbiategrasso 2D7, 27100 Pavia, Italy REFERENCE 2 (bases 1 to 5368) AUTHORS Biamonti,G., Buvoli,M., Bassi,M.T., Morandi,C., Cobianchi,F. and Riva,S. TITLE Isolation of an active gene encoding human hnRNP protein A1. Evidence for alternative splicing JOURNAL J. Mol. Biol. 207 (3), 491-503 (1989) MEDLINE 89342435 FEATURES Location/Qualifiers source 1..5368 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pES5" /tissue_type="liver" /clone_lib="lambdaCh4A." misc_feature 695 /note="mRNA initiation site" exon 695..813 /number=1 mRNA join(695..813,1377..1493,1789..1935,2084..2294,2388..2480, 2567..2659,2794..2868,3806..3961,4252..4311,4543..5240) misc_feature 715 /note="mRNA initiation site" misc_feature 735 /note="mRNA initiation site" CDS join(799..813,1377..1493,1789..1935,2084..2294,2388..2480, 2567..2659,2794..2868,3806..3961,4252..4307) /codon_start=1 /product="hnrnp a1 protein" /db_xref="PID:g296650" /db_xref="SWISS-PROT:P09651" /translation="MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCV VMRDPNTKRSRGFGFVTYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHL TVKKIFVGGIKEDTEEHHLRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVD KIVIQKYHTVNGHNCEVRKALSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGR GGNFSGRGGFGGSRGGGGYGGSGDGYNGFGNDGSNFGGGGSYNDFGNYNNQSSNFGPM KGGNFGGRSSGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF" intron 814..1376 /number=1 exon 1377..1493 /number=2 intron 1494..1788 /number=2 exon 1789..1935 /number=3 intron 1936..2083 /number=3 exon 2084..2294 /number=4 intron 2295..2387 /number=4 exon 2388..2480 /number=5 intron 2481..2566 /number=5 exon 2567..2659 /number=6 intron 2660..2793 /number=6 exon 2794..2868 /number=7 intron 2869..3805 /number=7 exon 3806..3961 /number=8 intron 3962..4251 /number=8 exon 4252..4311 /number=9 intron 4312..4542 /number=9 exon 4543..5240 /number=10 BASE COUNT 1476 a 1052 c 1270 g 1570 t ORIGIN 1 gggattgaga gtgatcactc acgctaacgt ctgccctgtt cctgtatggt gaggccgcac 61 cacaagccac caccgccgcc gccttctgcg caacgccaac cgcccgccaa aacggatcct 121 tccctgcgcc tgcgcaacca atcttgggac cggacctttt ttctccgccc actacgcatg 181 cgcaaagcta ggacaaactc ccgccaacac gcaggcgccg taggttcact gcctactcct 241 gcccgccatt tcacgtgttc tcagaggcag gtggaacttc ttaatgcgcc tgcgcaaaac 301 tcgccatttt actacacgtg cggtcaacaa gagttcattg caaaaaaatt gttacctcct 361 agctgcttgt ctaatacata gtgttaatca tgctttgcca agcgacttga ctgtaatatt 421 tgcgcgtgga agattaaaaa gatgttaaac acccaaggta gattcaaatg tgaatgattg 481 gtcggttggc caatcagact ggttaacaat aacattactc gggaaccaat ggactccaag 541 gggtggagac ggcgtagaac gaccgaagga atgacgttac acagcaatgt ggcaccacag 601 gccaatagca gggggaagcg atttcaagta tccaatcaga gctgttctag ggcggagtct 661 accaatgccg aaagcgagga ggcggggtaa aaaagagagg gcgaaggtag gctggcagat 721 acgttcgtca gcttgctcct ttctgcccgt ggacgccgcc gaagaagcat cgttaaagtc 781 tctcttcacc ctgccgtcat gtctaagtca gaggtgagtt aggcgcgctt tcccacttga 841 attttttcct ctccctttcc tgaatcggta agatgctgct gggtttcgtt ccttgcacca 901 gcccattcta cagttccttc ggtcgctgcc acggcctacc cctcccaaag ttcaagtcgc 961 cattttgtcc tcttgatcgc catgaggccg ctctccgcca accatgtgtt atcatgcggg 1021 actcgttact cgtagcaaaa ttcttaggca cacaggatct ttgtcttttt ttaaaccttg 1081 ccttggtgag cgagttttct aaagagcgat tagtcccatt gtggagatgc acccctaccg 1141 cccaagcctt tgttgcgcgt gcgtcggaag gcgactaggg acgcatgcgc ttgcgatttc 1201 ctagcactcc caactccagc atacggcctc ccttgatagg cagaagcacg tgtcttgttg 1261 cgacctgaac gaacaataag tgctaggtac acagttggtg tctagttttt cttttcctcg 1321 atggaaattg tttcgtgttg tagcccattt aacacttccc cctcccccca ctctagtctc 1381 ctaaagagcc cgaacagctg aggaagctct tcattggagg gttgagcttt gaaacaactg 1441 atgagagcct gaggagccat tttgagcaat ggggaacgct cacggactgt gtggtaagat 1501 ttggaaggga caaagcagta aaacagccga tttccttggc ttatcttggt gcagtcttct 1561 ccgaatgctt atgaaagtag ttaatagcat tatagttaga gctttgttgg caaaggaacg 1621 tcctgctttg attttaaaag ctaacctctt aaatctaagg gtagtgggaa actggacgaa 1681 ctttttataa aaggctggtg taaagtttcc tattgcccta ttcaaagtta aaataacaaa 1741 agcttttgcg gtcagacttt gtgttacata aattaacact gttctcaggt aatgagagat 1801 ccaaacacca agcgctctag gggctttggg tttgtcacat atgccactgt ggaggaggtg 1861 gatgcagcta tgaatgcaag gccacacaag gtggatggaa gagttgtgga accaaagaga 1921 gctgtctcca gagaagtgag tgggtttttt ttcttcttct tcttaaactt acttggatat 1981 gtgctgctat gaacttaaga ttcgggagtt ttctaaactt accaaaattt tttattcgag 2041 tataggcttt gctaatctaa acctatggtt tttctcctat taggattctc aaagaccagg 2101 tgcccactta actgtgaaaa agatatttgt tggtggcatt aaagaagaca ctgaagaaca 2161 tcacctaaga gattattttg aacagtatgg aaaaattgaa gtgattgaaa tcatgactga 2221 ccgaggcagt ggcaagaaaa ggggctttgc ctttgtaacc tttgacgacc atgactccgt 2281 ggataagatt gtcagtaagt atcagatagt ggcatttagt aagggttcca caatctgtat 2341 ggcattctaa accctgatac catgttgtat ctatgttttt tttttagttc agaaatacca 2401 tactgtgaat ggccacaact gtgaagttag aaaagccctg tcaaagcaag agatggctag 2461 tgcttcatcc agccaaagag gtatgcttgt tgcttaatta aaccttaaag gtaactttga 2521 gttactccag tatgaatgat ttaatgctta aacttcatgt cttaaggtcg aagtggttct 2581 ggaaactttg gtggtggtcg tggaggtggt ttcggtggga atgacaactt cggtcgtgga 2641 ggaaacttca gtggtcgtgg tatgtatggt ttatctacat gtagttctga cttctcacca 2701 tctttgctat gaagatttta cagtacggga actgcattca gaatgtcact ttaagtccaa 2761 gtcatactta aaacttgaaa ctttttctta caggtggctt tggtggcagc cgtggtggtg 2821 gtggatatgg tggcagtggg gatggctata atggatttgg caatgatggt aagtttttta 2881 ggaataagta gagaaaaatt cctggcaacc tggatcttta gaataggtta gtagagacta 2941 aaattctggt gcatgtcaaa ctcaactttg cccataacac gcatgctgtg agcaggcctt 3001 cagccgttac acttgcacaa gttttcattg tcaaatactt ttgtcttatt gagaagaatt 3061 gtattcttgt aggtggttat ggaggaggcg gccctggtta ctctggagga agcagaggct 3121 atggaagtgg tggacagggt tatggaaacc agggcagtgg ctatggcggg agtggcagct 3181 atgacagcta taacaacgga ggcggaggcg gctttggcgg tggtagtggt aggtatccag 3241 tgatccaagt acttggtgtg acagctagat tagcctttta gagcttgggt tctggtgctg 3301 ttgaagcatt gtgtggtaca ctgcatggta tattaaaaac aaatgggctt gctatgctac 3361 ctcctcctag ctttaagctg gggccgcctc actcccaaat agtagagata agtggatagt 3421 gttgtctttg agttagatta gtatcataga aggatttagt attttaactc ctttgggacc 3481 ttaggcgctt agttgatgta tccaagatac ttctgcttgc tgtggccctg gatccgtgaa 3541 ggccttcaag gctgaagggt atgcttgtgc cactctgaaa atctctttat tttatgtcat 3601 ggtgagttag gccagttttc tttgtattac tggattattc aactgaatgc ctttcccaga 3661 gaatgaaatg caaagattgg agtcaccata gtttgggaga aaggaaggct gataactcaa 3721 ccttatttta ttctgactgc taaacagaat tggaaactaa catcatcctc aggtaacaga 3781 taaaggccct ctttcccatt cataggaagc aattttggag gtggtggaag ctacaatgat 3841 tttgggaatt acaacaatca gtcttcaaat tttggaccca tgaagggagg aaattttgga 3901 ggcagaagct ctggccccta tggcggtgga ggccaatact ttgcaaaacc acgaaaccaa 3961 ggtatggtat ctatgtaatt ttggataatg tcaaaagagt gtctgtagct actgctggga 4021 agaaagccct ttaactgcta tgtctgggca gcaaaacgtt tatagtttag aaccttcaga 4081 aagtgataat ttgatcacaa attagaaaaa tcatgggacc tctttaccac ctcccttgta 4141 gtagggccat ttttaaatgg ccagacactt gaatttaact tttattatcc caaatatgaa 4201 aacattactg ttggcacttt gaaactttaa aagaaaaatt gtacttttca ggtggctatg 4261 gcggttccag cagcagcagt agctatggca gtggcagaag attttaatta ggtaagtaag 4321 cacctttttg tgtgttgaca taatttttta aattgctgat gaacccaata accctaatgt 4381 agctgagcag tgcaacatag ttaacattat aattgcagta attgtggata taaagttaat 4441 attcagatca gcaaaatttg tgggaaacaa acttgatatt ggattgtagc cttgagtctt 4501 aatatgttta gattaacaac tctattccat attgttcaac aggaaacaaa gcttagcagg 4561 agaggagagc cagagaagtg acagggaagc tacaggttac aacagatttg tgaactcagc 4621 caagcacagt ggtggcaggg cctagctgct acaaagaaga catgttttag acaaatactc 4681 atgtgtatgg gcaaaaaact cgaggactgt atttgtgact aattgtataa caggttattt 4741 tagtttctgt tctgtggaaa gtgtaaagca ttccaacaaa gggttttaat gtagattttt 4801 ttttttgcac cccatgctgt tgattgctaa atgtaacagt ctgatcgtga cgctgaataa 4861 atgtcttttt tttaatgtgc tgtgtaaagt tagtctactc ttaagccatc ttggtaaatt 4921 tccccaacag tgtgaagtta gaattccttc agggtgatgc caggttctat ttggaattta 4981 tatacaacct gcttgggtgg agaagccatt gtcttcggaa accttggtgt agttgaactg 5041 atagttactg ttgtgacctg aagttcacca ttaaaaggga ttacccaagc aaaatcatgg 5101 aatggttata aaagtgattg ttggcacatc ctatgcaata tatctaaatt gaataatggt 5161 accagataaa attatagatg ggaatgaagc ttgtgtatcc attatcatgt gtaatcaata 5221 aacgatttaa ttctcttgaa tgaaatgaca actgtatgga tttgggactg gcagagattt 5281 ggactttccc tacccactcc ccctgataat aatgttgaat gcttctatca caattcaagt 5341 tcaaagctct gctagggaat agaaacta // LOCUS HSHOX3D 4968 bp DNA PRI 15-JAN-1992 DEFINITION Human HOX3D gene for homeoprotein HOX3D. ACCESSION X61755 NID g32387 KEYWORDS homeoprotein; HOX3D gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4968) AUTHORS Mavilio,F. TITLE Direct Submission JOURNAL Submitted (09-SEP-1991) F. Mavilio, Institute Scientifico San Raffaele, Dep of Biology and Biotechnology, Via Olgettina, 60, 20132 Milano, ITALY REFERENCE 2 (bases 1 to 4968) AUTHORS Arcioni,L., Simeone,A., Guazzi,S., Zappavigna,V., Boncinelli,E. and Mavilio,F. TITLE The upstream region of the human homeobox gene HOX3D is a target for regulation by retinoic acid and HOX homeoproteins JOURNAL EMBO J. 11 (1), 265-277 (1992) MEDLINE 92155167 FEATURES Location/Qualifiers source 1..4968 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="pcos2EMBL" /map="q12-q13" TATA_signal 2251..2256 /evidence=experimental mRNA join(2280..2808,3507..4589) /gene="HOX3D" /evidence=experimental misc_feature 2280 /gene="HOX3D" /note="transcription start site" /evidence=experimental exon 2280..2808 /gene="HOX3D" /number=1 /evidence=experimental gene 2280..4589 /gene="HOX3D" CDS join(2355..2808,3507..3721) /gene="HOX3D" /codon_start=1 /evidence=experimental /product="homeoprotein HOX3D" /db_xref="PID:g32388" /db_xref="SWISS-PROT:Q00444" /translation="MSSYVANSFYKQSPNIPAYNMQTCGNYGSASEVQASRYCYGGLD LSITFPPPAPSNSLHGVDMAANPRAHPDRPACSAAAAPGHAPGRDEAAPLNPGMYSQK AARPALEERAKSSGEIKEEQAQTGQPAGLSQPPAPPQIYPWMTKLHMSHETDGKRSRT SYTRYQTLELEKEFHFNRYLTRRRRIEIANNLCLNERQIKIWFQNRRMKWKKDSKMKS KEAL" intron 2809..3506 /gene="HOX3D" /number=1 /evidence=experimental exon 3507..4589 /gene="HOX3D" /number=2 /evidence=experimental BASE COUNT 1037 a 1522 c 1332 g 1077 t ORIGIN 1 ctctgggtcc gttctcgaat atttaataaa actgatatta tttttaaaac tttataccgg 61 gaattctgtt aattcgctcg cccgccggcc tcagggtgag agctggagca ggctccaagc 121 gctcccgcgc ctgccggccc cctgcaagcc cccagccctc tgcactcgcc ttccccggag 181 cccaggcgcc cggccccggc cttctgttcg ttctcgcgat cgcactgagg ggctgggagg 241 cctctctggg ctccctgcgc ggagcaacac acacaccctg cgggaaaaag ccgcgtagga 301 ttttgttgcg gggggagctg ggatcagtcg gtttgggcac caaccccacg caggcctttc 361 ttcctgcctg tggcagccgg tgtcctcggc gtccgaaccc tgaggcgccc cgaaagacca 421 gtaaagaagc actatccagg caccagggtc tcctccaccc actcccggga gttggaggcg 481 gggggcgccg tgagagcaag ccccaggccc gcgccaagct ggctgttgcg aaacgtagcg 541 gaggcggaga gcggagggaa agagaagacc gcgggactgg agccctgtcg gcagggattg 601 agcgattcaa ggacatctcg agagagaaag cctggaaact gagctctcgc ctgctctggt 661 actcctgacg tggccgggac aaaattcctt cttcattaca gctccagccg ggctggttcc 721 ttacttttct ctcctcaaac gggtgaagag gcgggtgagg agcgcagccg gaccctaacc 781 atttatcaac cctggccttc tctctccaag cctggccaga gaaacctgag tgggcctaca 841 gggagaggtg ttgaggaggg tctccggagg gaaaaaatcc gccagagttt tccgggccct 901 tcagattccc tccagcgccc ccaccctcgc ctccagtgcc tccaccagtg aggagggggc 961 cttctcctgg cggggcggag atttcctggg aagaagcagg cagggaagta tgagacttgg 1021 ggctacgggg gaagaggatg gggccgaatg gctgcagata ccctgcgaag gctacccagg 1081 cgcctgccct gcccaccccc caggtccgcc ccattcctct tggtagctca gaaattgctc 1141 tcttgtgaac cttatggatt caggtccctg gaaagcctag cccagagacc ttttccccaa 1201 agggcacata acgggccttt tcctctgcca tgctcacgta atatttgtgg aggcttcaat 1261 cttgcttctc catgtcctca attaacatct attttctgtc taaaacctct ggcataggta 1321 ccacataagg atcgcacaca gggaaacaga gcagtcctct agaccatatg tctggctctg 1381 tccgtgtcca cacacacact tgaaaattgc cttgttgcca ctctggggac cctagcatgg 1441 tgaggtctag ccaactccac ctggctcctc acctccagat ccatcccaaa cagcctgcaa 1501 caccctcagc ttcatgaccc ccctatcttg ttctgaccac tcaggatccc tcctcctggg 1561 cgtcctgtct acctcttcag aagctaattc tactttctct tgctgatagg cacctcttca 1621 gcagtagtgg agccttaggg cctgagaggc aagggacttc tacctttggt ttagggaacc 1681 taagttaaac cccattggtt ctccagcctg ggcataggcc accttaccta acttaatccc 1741 ctttcctcag ccatttttga agattggaag accctggagg ttgggagaac gggactttca 1801 ttacaaaaag cactctgtgc gttataagtg gggcctcgtc ttttcccaat tttaaataga 1861 atttaaaagg ctagctcagg ggttgcccca acaaccagct tctccttcac agtgtaggaa 1921 atccagccac cggaaagcaa gctggcgctc caggatgcaa ttcccccata taggcaccag 1981 gtgtcgctgt gggcttgttg tcccggctac ccccaattcc aagaaccttt tttttttttt 2041 ccctccccct tccctctttc tctctctcac tccctctccc ccttggttgg gctttgccaa 2101 catatcgaga tgcttttcgc cggcttccat cactaacctc ccggaggtca tcaagccaaa 2161 tttatgagtg gccgctcgag tcacgtgact ctatttaagg ctcccttatt tgggaagagc 2221 gcataggata aagaaagaga tatctccacc tataaattgt ccactttgga gaacaaaaaa 2281 cccctcaact tcaaagagtc acaaatcacc cttaatcaaa aagggtgcag aaattttttt 2341 gggccctccc cgccatgagc tcctacgtag ccaattcatt ctataagcag agccccaata 2401 tccctgccta taacatgcaa acttgtggga actatggatc ggcctcagag gtgcaggcat 2461 ccaggtactg ctacggcgga ttggacttaa gcatcacttt cccaccgcct gcgccttcca 2521 actctctcca cggggtagac atggctgcca acccccgggc tcaccccgac cgccccgcct 2581 gcagcgccgc ggccgctccg ggacacgctc cgggcagaga cgaagcggct cctctgaacc 2641 ccgggatgta cagtcagaag gcggctcgcc cggcgctgga ggagcgagct aagagcagtg 2701 gggagatcaa agaggagcag gcgcagacag ggcagcccgc cggactgagc cagccaccgg 2761 ccccgccaca gatttacccg tggatgacca aactgcacat gagccacggt aaactttagg 2821 acttcatttt gcgctctcgg gtccgcctgg gttttatagg ccatgcgggg caaataaaga 2881 aaaaaaacct gcggccataa attttacgat ccaggcatca atggctcgta aaactgtcca 2941 ctaaaaggct tagaggctgt gtgcgcccaa atttacgacg acataattgg atcataggaa 3001 caaaacgtgt ataaaaggca atattcaatt tttgggggag agggagggag ttaaaaaaat 3061 agagggatct gaagggtgag gagcgcgggg ctccagagcg gggatccccc cgcggctccc 3121 tccctccctc ccctgcggag ccggctccgc cggcttgcgg ctccggagga ttccagcgac 3181 tcgggagggg cgggaggggg gtccccggtg ctcggatctc gagggtgctt attgttcggt 3241 ccgagcctgg gtctccctct tccccccaac cccccctcag cccctccggc tgcagagtga 3301 aggctgcggt ggaaagtttc ctgcctgggc ggaggcgcct ctcccggggc tgggctgggc 3361 tggcccgcct gcgcccgcgt ggcctgtctt gcggctctcg cctcctctcc ctccggccgc 3421 gggtgggggc ctgggctggg gtggggacgg gggaacgctg caagctattc accccttcct 3481 ggcttggggt ggggtttatg ttccagagac ggacggcaag cggtcccgaa ccagttacac 3541 gcgctaccag actctggaac tcgagaaaga attccacttt aaccgctacc tcactcgccg 3601 caggcgcata gagatcgcca acaacttgtg tctcaatgag agacagatca agatctggtt 3661 ccagaaccga cggatgaagt ggaagaaaga ttccaaaatg aaaagcaaag aggctcttta 3721 gaggcagcgg gggaggcccg cagagcgcgc ccctagccgg ttcctgtccc tgcgcctttc 3781 cttttcgcct ttcctctcta tatttcgggt cgggggcagg tgctggagca ctgggctccc 3841 gggccccaca gacaaaagcg cttttccttg gcattccgca tccctaccga cccagggttc 3901 ccgcggggct gtcggcgctg ccccatctcc cctcagctcg gctcagctcg gtacccgggg 3961 cccagggaag ctccgtagga cttccccgga gggctgcggc gtacaggctg gcgcagaacg 4021 aaccttggcc tgggccgtat ctccggctcc cagcctcagc gcggccctcc cgagttaagg 4081 tgggcccggc ccgcgccaca ggaccctcgc cggaccctct aacctcgccc tctcctttgt 4141 tcctggctgg acgggttaga cagccaaagg ctggcgagag tctggcccta aactcggggt 4201 gcttccttgt agcgactaaa ctagattttc acttatgaat gatttgcata tgaaaggaga 4261 gcatcggcct agggccccca cagttgctct atgctttcca aaccttatct ccacaacctc 4321 ttccccccaa aacccgggaa cctccccagc ctgcgcctgc tgcatgccct ctcaggccgg 4381 cagccccagc ctgctagcta gctcaactag tggggtttcc tggcactgga ccccagcaag 4441 tggtcctaga ggccctttgc tgtcccatag tccctgccac gaatttctgt gccctcctga 4501 cccattgctg ttgtccaact atttattgac tctgggtcct tcctgaaact atattttgtc 4561 atatcaaata aagagagaac aggactaaag atgcagtggc tcctgtctgt ttggggcatg 4621 tattgggtaa aattgtctaa atgggctgtg agatgtagaa ggagacccac aaactttaag 4681 ccaccccttt aaaaaatttt ctagcttcta ggggcagtga gtagagggag gtgaaatgag 4741 ctcctttcct gttccttggc tgcaacaaga gcctctgggg gaagagaagg gcacctaggc 4801 ctcgagtact ggcttttccc atgctgtggc tgtccaaggc cttcacgcct ctgcactcca 4861 ggaggcctct ggactggcag cagcccccac tgtgtgccaa agctgcggtg caggaaccag 4921 tggctccacc gtgctcagcc ttaggacctt ctgggcctct gaagatct // LOCUS HSHOX51 6305 bp DNA PRI 25-JUN-1997 DEFINITION Human HOX 5.1 gene for HOX 5.1 protein. ACCESSION X17360 NID g32394 KEYWORDS DNA-binding protein; homeobox; Hox 5.1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6305) AUTHORS Cianetti,L. TITLE Direct Submission JOURNAL Submitted (20-DEC-1989) Cianetti L., Dept. of Hematology-Oncology, Istituto Superiore Di Sanita', Viale Regina Elena, 299, 00161 Rome, Italy REMARK (revised by [3]) REFERENCE 2 (bases 1 to 6305) AUTHORS Cianetti,L., Di Cristofaro,A., Zappavigna,V., Bottero,L., Boccoli,G., Testa,U., Russo,G., Boncinelli,E. and Peschle,C. TITLE Molecular mechanisms underlying the expression of the human HOX-5.1 gene JOURNAL Nucleic Acids Res. 18 (15), 4361-4368 (1990) MEDLINE 90356367 REMARK (revised by [3]) REFERENCE 3 (bases 1 to 6305) AUTHORS Cianetti,L. TITLE Direct Submission JOURNAL Submitted (14-SEP-1990) to the EMBL/GenBank/DDBJ databases COMMENT See also (HSHOM4) for HHo.c13 clone. FEATURES Location/Qualifiers source 1..6305 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda 13G." /dev_stage="adult" /chromosome="2" /map="q31-37" TATA_signal 347..350 exon 376..2051 /number=1 mRNA join(376..2051,2593..6053) prim_transcript 376..6053 CDS join(1619..2051,2593..2927) /codon_start=1 /product="hox 5.1 protein" /db_xref="PID:g296652" /db_xref="SWISS-PROT:P09016" /translation="MVMSSYMVNSKYVDPKFPPCEEYLQGGYLGEQGADYYGGGAQGA DFQPPGLYPRPDFGEQPFGGSGPGPGSALPARGHGQEPGGPGGHYAAPGEPCPAPPAP PPAPLPGARAYSQSDPKQPPPGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTA YTRQQVLELEKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNT KGRSSSSSSSSSCSSSVAPSQHLQPMAKDHHTDLTTL" intron 2052..2592 /number=1 exon 2593..6053 /number=2 misc_feature 2619..2801 /note="homeobox" polyA_signal 3192..3197 polyA_signal 5971..5976 polyA_signal 6039..6044 BASE COUNT 1587 a 1433 c 1686 g 1599 t ORIGIN 1 ggatcctggt gggggagggt ggttaataaa gccgccatcc ttgggatgga ttatttttct 61 ttctttcttt ctttttttct ttcttaagaa gaatattctg gttgttcgcc tgcttggtaa 121 ccctgaccct ggcagaagaa tgagggaact cattgcttca aattgtcgcc aagcccatta 181 ggctacctga actgtctcag aaagtgcggg tggctgcgtc gaacggtggt ggctcagagg 241 aagagattgg ggccggcagc gacctaggta cctcactctg ggtgggaccc agaggttgta 301 acgttgtcta tatataccct gtagaaccga atttgtgtgg tatccgtata gtcacagatt 361 cgattctagg ggaatatatg gtcgatgcaa aaacttcacg tttcttcgga atagccagag 421 accaaagtgc gacatggaga ctagaagcag ccggcgctgg tcagccgcct cgttctgttt 481 tattaccttg gactccagga ggatcagctg cgcctggtga catagagcag cttttcctct 541 ccagaagctc ctcaccttta aacagagtat cctctgggtg ctgaaaagaa agaaagacag 601 aaagagagaa agagagagag agagagaaag agagaatgca agcctaattg gttgcatgga 661 tgcagggcca aagggctagg ttttggggta ctagggagtg aggtacaagg ccagcttgcc 721 cagtcccagc tctgccctcc aggaacatga ggtgcaaagg tacccaaatg ggggcttgct 781 tgtatttggg gcctgtggga agaaagcaag cttcaaagaa gcccagtggg gagctctagg 841 gtgcattttg acaaggtgga ggtgcccttg ccaccatccc agcccacccc cagctacatg 901 ggcaagggca gcgagggccc cctgctattt tggcagggcc cagctttggc tgggaacccc 961 cgggcctggg cactggtaga aagcatggcg gttactcatt gcctaatttg attcaagctg 1021 gccagattct ggtaactttt gggtgaccct gatgaagaca aagccaggac ggcggccttt 1081 gtatggcaga tccctgctcc cgccggctgc aggcagggcg ggcaggcagg aaccctcctc 1141 gcctggggca ctctgcccaa ctcagaggcg agttcaccca cccacctttc attgctctgt 1201 accccaatag gaggattcat tctcccttga gctgtgccta cttggtgtcg gggggcgggg 1261 gttgcattca gctgggggtg agtggaaggg ccacggaagg ttggcaaaat cagtggcaga 1321 caaaagctgg gattacctga ggggaatggg gtgctgggga ctggaactac attaatatct 1381 ggcaggggct ctcaaatgtg ccatagcaag ctacttgatt acacgtatgt tatttagtta 1441 aatttgtgaa aattatgaga tgctcaccaa cccggtgata aacttgctcc ctcgccattg 1501 gctggcctgg tcacatggct gcccaacttt attcagttga cagcaagtag gagggcccta 1561 tggaaggaga aaaaaagaca acacgagaaa aattagtatt ttctaccttc tgaaattaat 1621 ggtcatgagt tcgtatatgg tgaactccaa gtatgtggac cccaagttcc ctccgtgcga 1681 ggagtatttg cagggcggct acctaggcga gcagggcgcc gactactacg gcggcggcgc 1741 gcagggcgca gacttccagc ccccggggct ctacccacgg cccgacttcg gtgagcagcc 1801 tttcggaggc agcggccccg ggcctggctc ggcgctgcct gcgcggggtc acggacaaga 1861 gccaggcggc cccggcggtc actacgccgc tccaggagag ccttgcccag ctcccccggc 1921 gcctccgccg gcgcccctgc ctggcgcccg ggcctacagt cagtccgacc ccaagcagcc 1981 gccccccggg acggcactca agcagccggc cgtggtctac ccctggatga agaaggtgca 2041 cgtgaattcg ggtaaggcta gggtccagta acctttctgt ccacatccca gcccgttagc 2101 ctgggtcctc tggaaggggg tgcgagtagg tgggggcgtg tggagcttcc atgggcgccg 2161 caattactct ccccataaat ttttatagct gagggagcag gtcaggacca tgtggctggc 2221 tgctcggctg tgggcgcaaa agggggtggg gatggggggg tgggggagga ctccattttc 2281 agagcagggg gaaggctgtg gaggagcggg ggatttccaa aatgcttgag ggttccggac 2341 ctggtggtgg gcccagaaga aggagcacat ttggggatcc cgcaagcctg gggtatgtgg 2401 gtgtgtttga ggaggtgggt gggagtgagc gtgtgcgccg gggagagggc gggagggagg 2461 aagcaagcga gcttgggagc gcgcggggag ggccgcgggc ctcggggcgc gccaggaagt 2521 gagcggcgga ggcgaggggc ctaactagtg gccgggcgct gacctgcctg tcctgtctgt 2581 tttgtctcgc agtgaacccc aactacaccg gtggggaacc caagcggtcc cgaacggcct 2641 acacccggca gcaagtccta gaactggaaa aagaatttca ttttaacagg tatctgacaa 2701 ggcgccgtcg gattgaaatc gctcacaccc tgtgtctgtc ggagcgccag atcaagatct 2761 ggttccagaa ccggaggatg aagtggaaaa aagatcataa gctgcccaac actaaaggca 2821 ggtcatcgtc ctcatcttcc tcctcatctt gctcctcctc agtcgccccc agccagcatt 2881 tacagccgat ggccaaagac caccacacgg acctgacgac cttatagaag tggggaccct 2941 gggcccatct ctccctgcgc accaggctga gccgaagctg cggggcaggc cgggcctgct 3001 gtcacctcgc tgggctctaa ggtactgtgg ggtggacctg ggacaagcag gccgccctcg 3061 gactaggtta gcatcctgcc cgagggcagc cccctcccta gagcgggatg gggatgggag 3121 ggggggcggg attctctctc taagtatatt atatggcagg agctactgag aacataaaat 3181 cttggcgagt cattaaactt atgaaaatca ccgctcttgg attttgaatt tgcaaatgaa 3241 ggttggatgc tttatcccac tgtgaatttg gacattctcc cccactccac ccctccaggg 3301 tgctttgtgg cttaataatg tgggggagtt gaggcagaag gttggccacc cctgtgctag 3361 gtgctttcag tggaagccag agagctgggt caggatttct ggactttctg ggttgtctat 3421 ggaatttcat gtgattaaaa aatatatatt ttgctcccag tggccccacc tccaaagaaa 3481 tgggtctaag aaggaagtaa aaatgggtta ttttatgttt agatatttgc ttaaatattt 3541 atttgttggg aaatgtggta cagaaataac tgacaccttc atgccaaaaa tcttaaaaag 3601 gtgaaagggg ctgaacttca gggagcagaa tcagagatat gtgcacttac ttctgaactc 3661 cacccctccc cactctctgg aaatgtatat aggggggcct taacccttcc agaaggaagc 3721 aaaggattca ctcaaagttg cattcttgaa aatatatttc cacatgtgtt tttttcagca 3781 ctgtgcttac aaccagttct gggtgattaa aggaaaggga aaaaaaccaa caaatggtcc 3841 aacattttcc ttctggggaa agaaaacaaa acctctatgc actgggtcat tagataatga 3901 ctgaattttc tgttccaact ggattccaaa tgccctaaat accctcatat agcagtgttt 3961 tacaggaatt agtgtatggc ccgtgtaggg gaggggctgt gcagtgggga gaaagtggga 4021 aggtgaggaa ctcttgcttt aagaaggaaa aaaaaaaaac cctaattgaa tctagaagtc 4081 cacaaaagtt agccttagag tttttttccc cctgaagttt taattttttt aaaaaccaaa 4141 tctaaggaag ttttcctcag ctcattaatt agaagcagaa tttgtaaaag tataaaagtt 4201 ttcaagcact cgtctttgcc ttgagaatag tggtttttta aagaatcact ctcaacaggg 4261 gagatgtcct ctagtcgttt ttcttctgcc tctcctggga agggttcaaa gttcattttt 4321 ctaaaatgct gaccctcaag cataaggagg aagaagtcaa agttaatggc cagagttcat 4381 atactcagat gaaaccagtc ttcccaaggc ctcaggctcc aaaaaaggtt gtagctatca 4441 aaaagtgacc aaagtgggaa agggagaaag gataagctta aaatttaatt ttaagatcca 4501 gaaggggggt atttttttca gtacttcaaa aacactttag aaggtttctg ttgtaattta 4561 aaaaatatat ttaagtggga gggaaaaaag agtttctctg taggcttgtt ctttggctgt 4621 gtctcctgag agctgagggc aggtattcac tgcagtccct aggctgaaat tccgcttctc 4681 tgaagtgtct tccaagcctt ggtcttttgt attagaccct ggggactgct ctttgtttct 4741 ccttggggtg agcctggctc tcagacttgc acatggcaat acttgaatgt caccacgtcg 4801 ggatattaaa gatggatatt cgtgcattat tcacatcatt gtttctatga caaaaagcac 4861 agagttcata catagtcaag acgtcttttt ctgacgccct cacgttgaga agctgaaaag 4921 gtattttacc gaagttcggg taaattacag aatcaggttc atccagagga caaattttct 4981 atttgattag ctgtatttca gccgggagga ctgacctcta aacccctaac cttttggact 5041 ctaactaccc ttctcttctt ttttcctctc taacatggag agcagtcttt ggatgtacca 5101 tttgaaagga gccgctatcc ttaggcaagt tggaaagttg ctaagctgct ttcctaaaac 5161 ccaaatctgt ctatacattg aaccttctct ttgagaaggg gaaaaaggta tatattttca 5221 caacatccaa ttacatatat ataatagaga tttgttgtat agattttccc ccacctcaga 5281 agttcaggtt actcaccccc agtttcatac caaatgccac acaggcttaa ctgactgcat 5341 ccctgcccca gaggaaagcc aagaaacatg ttttcatgag gaaaacccaa gctccttctc 5401 aaacatagcc ccactacttt ggaaagtaac ttaatcagag aaacaacttc tttgtttata 5461 agtctcagct ctccttctca gcttggaggg attcttttga aatgttaatg gagcctggat 5521 ggcccagagt gcagccccca accctgaggt cccagtcgga ccccagcatc catttgggcc 5581 cacaggagtg ggccagggaa ggggtagggc cccgtaacca cttagggcag ggaaggaaat 5641 gggtttccat ctgagaacgt gctttggaga aagctaggtg tggaaaagct ccaatgccca 5701 tttgctatta tttgtttcca gtttgttcct ttaaatatga gccagaagtg tttgtgttgg 5761 tgttttaaaa acaaaaacaa aaaccgtgtt ggggtcctga ctgggggagg gggagagtga 5821 agtgtttgct gaggacattg ctcctctgac tcccatctca ctttgtccat cgcagccttt 5881 tgttgggaga tgacactgtc agtcagccca tgatgtctgt tcacacgaga tgctttttta 5941 atagaattga ccaatgtttt gctgccactg attaaagtat tatttatact aattgttgct 6001 tgtagttttg atgtaattca ttgatctata tttaaaataa taaaaggtgt agcaaaatct 6061 ccctcctgtt tggtgcctta acagaagcat tcatcctttg ttaagtcttc taaaagctaa 6121 catttaacat aaacaagtta ttattttctg caataaatta ggcaccattt tttggggggt 6181 gcctaaagtg tgaaggttaa accatgtaag gcttagcaat tctattatta ccacctcctt 6241 aatgtacaca cactcccagt tggccaccat attttgtgag cattgggaag cctggggttg 6301 aattc // LOCUS HSHSC70 5408 bp DNA PRI 09-MAY-1995 DEFINITION Human hsc70 gene for 71 kd heat shock cognate protein. ACCESSION Y00371 NID g32466 KEYWORDS heat shock cognate protein; hsc70 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5408) AUTHORS Dworniczak,B. and Mirault,M.E. TITLE Structure and expression of a human gene coding for a 71 kd heat shock 'cognate' protein JOURNAL Nucleic Acids Res. 15 (13), 5181-5197 (1987) MEDLINE 87259994 REFERENCE 2 (bases 1 to 5408) AUTHORS Rensing,S.A. and Maier,U.G. TITLE Phylogenetic analysis of the stress-70 protein family JOURNAL J. Mol. Evol. 39 (1), 80-86 (1994) MEDLINE 94343547 FEATURES Location/Qualifiers source 1..5408 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phage521" /clone_lib="human liver DNA in lambda L47.1" misc_feature 4..17 /note="heat shock consensus element" misc_feature 109..113 /note="regulatory sequence" misc_feature 178..191 /note="heat shock consensus element" TATA_signal 199..203 mRNA join(227..304,1035..1244,1567..1772,2097..2249,2337..2892, 3104..3306,3535..3733,3881..4113,4445..>4630) exon 227..304 /number=1 prim_transcript 227..>4630 intron 305..1034 /number=1 exon 1035..1244 /number=2 CDS join(1040..1244,1567..1772,2097..2249,2337..2892, 3104..3306,3535..3733,3881..4113,4445..4630) /codon_start=1 /product="71 Kd heat shock cognate protein" /db_xref="PID:g32467" /db_xref="SWISS-PROT:P11142" /translation="MSKGPAVGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAF TDTERLIGDAAKNQVAMNPTNTVFDAKRLIGRRFDDAVVQSDMKHWPFMVVNDAGRPK VQVEYKGETKSFYPEEVSSMVLTKMKEIAEAYLGKTVTNAVVTVPAYFNDSQRQATKD AGTIAGLNVLRIINEPTAAAIAYGLDKKVGAERNVLIFDLGGGTFDVSILTIEDGIFE VKSTAGDTHLGGEDFDNRMVNHFIAEFKRKHKKDISENKRAVRRLRTACERAKRTLSS STQASIEIDSLYEGIDFYTSITRARFEELNADLFRGTLDPVEKALRDAKLDKSQIHDI VLVGGSTRIPKIQKLLQDFFNGKELNKSINPDEAVAYGAAVQAAILSGDKSENVQDLL LLDVTPLSLGIETAGGVMTVLIKRNTTIPTKQTQTFTTYSDNQPGVLIQVYEGERAMT KDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKG RLSKEDIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDED KQKILDKCNEIINWLDKNQTAEKEEFEHQQKELEKVCNPIITKLYQSAGGMPGGMPGG FPGGGAPPSGGASSGPTIEEVD" intron 1245..1566 /number=2 exon 1567..1772 /number=3 intron 1773..2096 /number=3 exon 2097..2249 /number=4 intron 2250..2336 /number=4 exon 2337..2892 /number=5 intron 2893..3103 /number=5 exon 3104..3306 /number=6 intron 3307..3534 /number=6 exon 3535..3733 /number=7 intron 3734..3880 /number=7 exon 3881..4113 /number=8 intron 4114..4444 /number=8 exon 4445..>4630 /number=9 polyA_signal 4843..4848 BASE COUNT 1411 a 1079 c 1420 g 1498 t ORIGIN 1 gagcttgaaa gttccagaac gctgcggtga gtgcgttatc gtgaggcggc gcggtggggt 61 gggtgcggaa gggggcgagg cgaggagtgg agccgcgttg tgattgtgat tgggtcttgt 121 aagggcagcc ggactctatt ggccgggaac ctaatgcagg aagcaggcgg accccttctg 181 gaaggttcta agatagggta taagaggcag ggtggcgggc ggaaaccggt gctcagttga 241 actgcgctgc agctcttggt tttttgtggc ttccttcgtt attggagcca ggcctacacc 301 ccaggtaaaa cctctgctca agagttgggt tgtgggtctg ggagcgtgca gcctccacac 361 aggcctgttg ggcttgctga ggcttggggg ttctgagaat ctcgtcgagg cgagtgtgcg 421 gctccttcta ccggcttaaa gggcctcagt tttcggtggg atggcagcgg tatttggttg 481 cagccggcag acggaaatgt agggagtggg ccgcatggcc ccaggggagg ctgggagacg 541 cccggccgcg tggcggggga gggttgctgc atcggtttgc ctggcgcgcg gggaagtgga 601 gccagcgttt tctttcaccc agttccctgc ttagtccagt cccaccgtgg ttcttcagag 661 ctgttcttgg cgtgcttcca gtatgggggt acattccgga gtagttaaaa gcccgttgac 721 tcccgggggg cactggcacc tggcgaggga ggggaacaga cagtgctcag ttcggggtaa 781 gaccacgtgt tgagcaacgc cccacgccgt ctgggtcgat gggtccttca tctagggcgt 841 gctgtgctgc ggttggcacg gcaacctgga ctgcagcact agttctggac ctcgcgcgtg 901 cttagacagg aggtgatggg cactattacc tcttggcagt ggccatacgt ttttcctggt 961 taagtgttct gttaagggat gagggaaata ttttgattaa ttgaattttt aaaccagatt 1021 tttctttttt tcagcaacca tgtccaaggg acctgcagtt ggtattgatc ttggcaccac 1081 ctactcttgt gtgggtgttt tccagcacgg aaaagtcgag ataattgcca atgatcaggg 1141 aaaccgaacc actccaagct atgtcgcctt tacggacact gaacggttga tcggtgatgc 1201 cgcaaagaat caagttgcaa tgaaccccac caacacagtt tttggtgagt tcctaatttt 1261 aaatgacaga acaaatataa acagggctag gaagcacaaa agtttatgaa acgtgaggag 1321 ggaacttttt gattttagaa aaactgagct gagagacttg ttatcaagtc tgttataaaa 1381 caggttgtag aaacctttca ggctgaaatc tggataacgt aggaggttga agtttgaacc 1441 tttgctaggt atatggtagt tgaattcacc tacctatgaa ctgttaggta tttgagtaat 1501 catggacttg agttttatct gaagagctat gaaattgaaa gtgttttcat ttgacacctt 1561 ttacagatgc caaacgtctg attggacgca gatttgatga tgctgttgtc cagtctgata 1621 tgaaacattg gccctttatg gtggtgaatg atgctggcag gcccaaggtc caagtagaat 1681 acaagggaga gaccaaaagc ttctatccag aggaggtgtc ttctatggtt ctgacaaaga 1741 tgaaggaaat tgcagaagcc taccttggga aggtgaggtt ggtttttcag tatggggtgc 1801 attccggagt agttaaaagc ccgatgactc ccgggggcac tggcacctgg cgagggaggg 1861 gaacagatgg ggctcagctc agggttaaga ccacgtgccc aacagtgccc taggctctct 1921 aggtagatgg gtctgtcaac accagaaacc agtgaatctt gacaattaca cagtaattta 1981 cattttggtg gggggggtgc tccagctgtt gtttcaccag cattaatcca tttgctggag 2041 tttgcatata tgtaagtata atagttacca atctgtggtc ttttccttat tcctagactg 2101 ttaccaatgc tgtggtcaca gtgccagctt actttaatga ctctcagcgt caggctacca 2161 aagatgctgg aactattgct ggtctcaatg tacttagaat tattaatgag ccaactgctg 2221 ctgctattgc ttacggctta gacaaaaagg tatgtaccat ttgtgatgca agttcggatt 2281 attttaagat taatttgatc catcgtaaat ttaaatgaga ttgtttttaa cggcaggttg 2341 gagcagaaag aaacgtgctc atctttgacc tgggaggtgg cacttttgat gtgtcaatcc 2401 tcactattga ggatggaatc tttgaggtca agtctacagc tggagacacc cacttgggtg 2461 gagaagattt tgacaaccga atggtcaacc attttattgc tgagtttaag cgcaagcata 2521 agaaggacat cagtgagaac aagagagctg taagacgcct ccgtactgct tgtgaacgtg 2581 ctaagcgtac cctctcttcc agcacccagg ccagtattga gatcgattct ctctatgaag 2641 gaatcgactt ctatacctcc attacccgtg cccgatttga agaactgaat gctgacctgt 2701 tccgtggcac cctggaccca gtagagaaag cccttcgaga tgccaaacta gacaagtcac 2761 agattcatga tattgtcctg gttggtggtt ctactcgtat ccccaagatt cagaagcttc 2821 tccaagactt cttcaatgga aaagaactga ataagagcat caaccctgat gaagctgttg 2881 cttatggtgc aggtaacaat ggtatctcaa ttaaccctaa aggcaggcag gcccaaggtg 2941 actcgctgtg atgagtgatt gttaaacatt cgtagtttcc accaaaagct tggctaatga 3001 tggcaacacc ttccttggat gtctgagcga gtgatagtta aaacaggagc tatgtactgg 3061 gttttctttt aacttctttt aacgttaact ttttgtttgc tagctgtcca ggcagccatc 3121 ttgtctggag acaagtctga gaatgttcaa gatttgctgc tcttggatgt cactcctctt 3181 tcccttggta ttgaaactgc tggtggagtc atgactgtcc tcatcaagcg taataccacc 3241 attcctacca agcagacaca gaccttcact acctattctg acaaccagcc tggtgtgctt 3301 attcaggtat gtttctgtac ttctcttgtt tggcttactg ataacagata aagggaagtc 3361 ttgactgact cgctatgatg atggattcca aaaccattcg tagtttccac cagaaagtct 3421 tatgttggcc agttccttcc ttggatgttt gagcgaccat tcttccttag caggacccta 3481 gcactgtcac agacctggag tccattgtag taatttgttt tatttcctac caaggtttat 3541 gaaggcgagc gtgccatgac aaaggataac aacctgcttg gcaagtttga actcacaggc 3601 atacctcctg caccccgagg tgttcctcag attgaagtca cttttgacat tgatgccaat 3661 ggtatactca atgtctctgc tgtggacaag agtacgggaa aagagaacaa gattactatc 3721 actaatgaca agggtaagga ggcactgtca tctggtcttg acagggataa tggtatttca 3781 attgagttac tggtgaataa gggcgtctag ctaagagaaa ctagagttac acatacacag 3841 gtaatttaag gcttttactt agagttaatt tctttcctag gccgtttgag caaggaagac 3901 attgaacgta tggtccagga agctgagaag tacaaagctg aagatgagaa gcagagggac 3961 aaggtgtcat ccaagaattc acttgagtcc tatgccttca acatgaaagc aactgttgaa 4021 gatgagaaac ttcaaggcaa gattaacgat gaggacaaac agaagattct ggacaagtgt 4081 aatgaaatta tcaactggct tgataagaat caggtttgtg tttttttttt tttttttcct 4141 cccccacgca atggagggga aggggatggt aaaccaagct tgagctggat ttcagtgtag 4201 ggtcacaatg atgaatggtc caaaacattc gcggtttcca ccagaattca aggtgttggc 4261 aactaccttc cttggatgtc tgagtgaccc aagatgttaa ggaagaataa ggccctattt 4321 taatgttggt atgggccctc ttgtaagagt ttgctccaga cttttagtat cagattgcgt 4381 cagggagaaa gaagggttat taacattaaa agaacttgca gtaattcctt tttctcttcc 4441 tcagactgct gagaaggaag aatttgaaca tcaacagaaa gagctggaga aagtttgcaa 4501 ccccatcatc accaagctgt accagagtgc aggaggcatg ccaggaggaa tgcctggggg 4561 atttcctggt ggtggagctc ctccctctgg tggtgcttcc tcagggccca ccattgaaga 4621 ggttgattaa gccaaccaag tgtagatgta gcattgttcc acacatttaa aacatttgaa 4681 ggacctaaat tcgtagcaaa ttctgtggca gttttaaaaa gttaagctgc tatagtaagt 4741 tactgggcat tctcaatact tgaatatgga acatatgcac aggggaagga aataacattg 4801 cactttatac actgtattgt aagtggaaaa tgcaatgtct taaataaaac tatttaaaat 4861 tggcaccata caattgcttt gagtctttaa ataatctccc aggccagcgg tgggagaagt 4921 aggcttaggt gattatgtga ctcttacttt ctccttcctc ttaagcttga gttaacaagg 4981 gctgggtggc aagttgccct tcagagcatg tggatggtac attttggaat tcagagcttt 5041 gagaagggga gcataagaaa ttggatctgg atcaaactaa ccttagtcct taggctggag 5101 aggcagaagc tgacttaatg gtgttttcta aacttattct gtgtgtaagc ctgcctagga 5161 gcagaggctt tcctggaggg ttgtgctaga tgagtaagaa tttagataca gaatcaaata 5221 atgggcagtg aatattaagc tacatggcag aggtatctga atgtcaatcc cttatatgag 5281 ccactgccct gtgggcttcc atttcttctg agttaagatt attcagaagg tcggggattg 5341 gagctaagct gccacctggt taattaaggt cccaacagtg agttgtgata gcctagggga 5401 gcaggctg // LOCUS HSIFNAR 32906 bp DNA PRI 25-NOV-1996 DEFINITION Human IFNAR gene for interferon alpha/beta receptor. ACCESSION X60459 NID g32671 KEYWORDS Alu repeat; interferon; interferon alpha/beta receptor; interferon receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 32906) AUTHORS Lutfalla,G. TITLE Direct Submission JOURNAL Submitted (20-JUL-1991) G. Lutfalla, CNRS, Lab. of Viral Oncology IRSC, 7 Rue Guy Mocquet BP8, 94801 Villjuif Cedex, FRANCE REFERENCE 2 (bases 1 to 32906) AUTHORS Lutfalla,G., Gardiner,K., Proudhon,D., Vielh,E. and Uze,G. TITLE The structure of the human interferon alpha/beta receptor gene JOURNAL J. Biol. Chem. 267 (4), 2802-2809 (1992) MEDLINE 92129376 REFERENCE 3 (bases 1 to 32906) AUTHORS Burglin,T.R. and Barnes,T.M. TITLE Introns in sequence tags JOURNAL Nature 357 (6377), 367-368 (1992) MEDLINE 92278426 FEATURES Location/Qualifiers source 1..32906 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DAUDI" /chromosome="21" /map="21q22.1" prim_transcript <1..31791 prim_transcript <1..32293 mRNA join(660..829,11201..11324,16768..16943,19033..19187, 19300..19441,21017..21131,24861..25060,25159..25313, 28528..28678,29408..29553,31085..31791) exon 660..829 /number=1 mRNA join(660..829,11201..11324,16768..16943,19033..19187, 19300..19441,21017..21131,24861..25060,25159..25313, 28528..28678,29408..29553,31085..32293) gene 754..31318 /gene="IFNAR" CDS join(754..829,11201..11324,16768..16943,19033..19187, 19300..19441,21017..21131,24861..25060,25159..25313, 28528..28678,29408..29553,31085..31318) /gene="IFNAR" /codon_start=1 /product="interferon alpha/beta receptor" /db_xref="PID:g32672" /db_xref="SWISS-PROT:P17181" /translation="MMVVLLGATTLVLVAVAPWVLSAAAGGKNLKSPQKVEVDIIDDN FILRWNRSDESVGNVTFSFDYQKTGMDNWIKLSGCQNITSTKCNFSSLKLNVYEEIKL RIRAEKENTSSWYEVDSFTPFRKAQIGPPEVHLEAEDKAIVIHISPGTKDSVMWALDG LSFTYSLLIWKNSSGVEERIENIYSRHKIYKLSPETTYCLKVKAALLTSWKIGVYSPV HCIKTTVENELPPPENIEVSVQNQNYVLKWDYTYANMTFQVQWLHAFLKRNPGNHLYK WKQIPDCENVKTTQCVFPQNVFQKGIYLLRVQASDGNNTSFWSEEIKFDTEIQAFLLP PVFNIRSLSDSFHIYIGAPKQSGNTPVIQDYPLIYEIIFWENTSNAERKIIEKKTDVT VPNLKPLTVYCVKARAHTMDEKLNKSSVFSDAVCEKTKPGNTSKIWLIVGICIALFAL PFVIYAAKVFLRCINYVFFPSLKPSSSIDEYFSEQPLKNLLLSTSEEQIEKCFIIENI STIATVEETNQTDEDHKKYSSQTSQDSGNYSNEDESESKTSEELQQDFV" intron 830..11200 /gene="IFNAR" /number=1 misc_feature 1609..1892 /gene="IFNAR" /note="duplication copy 1" repeat_region 1890..31802 /rpt_family="Alu" /rpt_type=DIRECT repeat_unit 1890..2135 /gene="IFNAR" misc_feature 4083..4388 /gene="IFNAR" /note="duplication copy 2" repeat_unit 9294..9476 /gene="IFNAR" exon 11201..11324 /gene="IFNAR" /number=2 intron 11325..16767 /gene="IFNAR" /number=2 repeat_unit 12422..12714 /gene="IFNAR" repeat_unit 12747..12926 /gene="IFNAR" repeat_unit 13307..13513 /gene="IFNAR" /rpt_type=TANDEM repeat_unit 13996..14328 /gene="IFNAR" repeat_unit 14507..14829 /gene="IFNAR" repeat_unit 15413..15725 /gene="IFNAR" exon 16768..16943 /gene="IFNAR" /number=3 intron 16944..19032 /gene="IFNAR" /number=3 repeat_unit 17440..17757 /gene="IFNAR" repeat_unit 17892..18182 /gene="IFNAR" repeat_unit 18341..18641 /gene="IFNAR" exon 19033..19187 /gene="IFNAR" /number=4 intron 19188..19299 /gene="IFNAR" /number=4 exon 19300..19441 /gene="IFNAR" /number=5 intron 19442..21016 /gene="IFNAR" /number=5 repeat_unit 19640..19938 /gene="IFNAR" repeat_unit 20533..20853 /gene="IFNAR" exon 21017..21131 /gene="IFNAR" /number=6 intron 21132..24860 /gene="IFNAR" /number=6 repeat_unit 22903..23199 /gene="IFNAR" repeat_unit 24320..24600 /gene="IFNAR" exon 24861..25060 /gene="IFNAR" /number=7 intron 25061..25158 /gene="IFNAR" /number=7 exon 25159..25313 /gene="IFNAR" /number=8 intron 25314..28527 /gene="IFNAR" /number=8 repeat_unit 27301..27611 /gene="IFNAR" repeat_unit 28111..28407 /gene="IFNAR" exon 28528..28678 /gene="IFNAR" /number=9 intron 28679..29407 /gene="IFNAR" /number=9 exon 29408..29553 /gene="IFNAR" /number=10 intron 29554..31084 /gene="IFNAR" /number=10 repeat_unit 29830..30126 /gene="IFNAR" exon 31085..32293 /number=11 exon 31085..31791 /number=11 repeat_unit 31504..31802 BASE COUNT 9789 a 6631 c 6954 g 9532 t ORIGIN 1 gggcccgtgg ctgttctctc caagggacca tctcgcccct cagccaagtc gcccggaaaa 61 cgagcgctcg accgcctctg ccccgctctc ggtctgcaca cagcaacggt ctggtcgctc 121 agccacttcc tccttccagc ctcatctggt tcccaggccg ctggggactc ccaacgccac 181 tgtccaagac tctagggtca gcaagcgccc cgggcggaga agggcgagga cgaagagcgc 241 cgggccgcga ccaggagccc acccgcgccc tccgactgca gacatgggga agagacgcgg 301 ggactccaaa gtcgctgggt ctgcgcaggt gtgtgccgcg atcctgtgaa ggtcaaggcc 361 tcctgtgagg gggagtcgtc ctggaatgcg atggtgaagt gctccagacc ggccataggc 421 cggaaagagt gaggaagaag agaatgcagg aggcctgcga tttctaaggc gcgcgcgcac 481 aggggtgctg caattaggat ggggcaatgg gagcttggag aaggggtgct agctaggagg 541 aaaggcgcgt gcgtggagga acggcgcgtg cgcggagggg cggtgtgtgt gtgtgtgtgt 601 cagaagaggc ggcgcgtgcg tagaggggcg gtgagagcta agaggggcag cgcgtgcgca 661 gaggggcggt gtgacttagg acggggcgat ggcggctgag aggagctgcg cgtgcgcgaa 721 catgtaactg gtgggatctg cggcggctcc cagatgatgg tcgtcctcct gggcgcgacg 781 accctagtgc tcgtcgccgt ggcgccatgg gtgttgtccg cagccgcagg tgagaggcgg 841 ggaggagagt cttggcgcag ggcgggaggt agggcacgca gctgggctac gggggcggcg 901 atgctgttgg gggcgacaga cgcccagtct gggaaacctt cggtccactt tgccgcgcca 961 aagattaaac ccgacctggg ctcgcaaatc aaccaggaga aagtggtgtt ctgggtcctc 1021 tcttgccgct tgcctgtgcc gtgtacggtc ctcgggagcg ccgggtccac ccgtgaaatg 1081 gcgtgcagag ctttgtgtcg agtttgattc tttccgggaa agtaccgcgg ctccgctggt 1141 ctgtttgata ataaacaaaa catttcccga aatgcatttc ctcaatctgg tgaataggcg 1201 cccagtccca cgggggagtg agctaagtgt ggccaggatt tctctgccct caggattcat 1261 acaggccaat aaaaaatgac agtcggagtg actgaagcat aattgtcagt tctcgaggtt 1321 tactaagcct gcttcagggc gcatccggga aaaacgtgag ccacagacac gtctgtgact 1381 ggttttccag aggttttcag gaggtggggc aggaaggcag taaggcaaat agttctgtga 1441 gaatttagtt agtgcccaag aaatctgtat tttaagatac catgagtgtc ttgaagagaa 1501 aaagagagta aagggagtca gttatgcaga tgcctctggg taggtggagt cttgactttg 1561 ttttgcacct gagaagataa gcttgtaatt gacatggtca gtgtggaagg cctgaactag 1621 tttttcagat taactttgga atgccatttg ttgagaggag aggtccattc acattgttgg 1681 ggggcttaga attttatttt tggtttactt aactacctac ttttttccca aagaaataat 1741 catgttcaaa actcaatgaa atgttatctc aatgagattg taatgttgct cagtctttca 1801 gaagtcataa aaatagtcat actttttaaa ctagtaaatg caattctagg acccaagact 1861 aagaaaactg tccaaaatat gggaagacat ttttatttat actttttttt ttttttttga 1921 gacagcctgt tgcccaggct ggtgtgcaga ggcgtgatct cagctcactg caacctctgc 1981 ctcccaggtt caggcgattc tcctgcctca gcctcttgag tagctggaat tacaggtgca 2041 cgccaccaca tccggctatt ttgtatttct agtagagatg gggttttgcc atgttggcca 2101 ggctggtctc gaactcctga cctcaaacaa tccacttcag cctcccaaag tgctgggatt 2161 acaggtgtga gccaccacgc ccgatctatt tatgttttag actttagttt taggaactag 2221 acttaggttg tagacctgaa gttacagttg gcatcccctt gtttatggga gaccagcaaa 2281 gaattgactt atgaatgatc tgtcggggta gcccttcgca gatggctgag tctttttatc 2341 cttccttggg gatttggctg aagcataatg ctagtaacag ccattcattt ggaagagggt 2401 gttgcagtca cttggcctct gggcttaact aacttccctt ttgcgtaaga aagttgggag 2461 aggtcctgag attttttttt ttcttttaca aaaataataa aagaaaaata caaaggcagg 2521 gcgtcaccag gctaatttgc tgaaattctt catacttact tgggatgcat gggggaatgg 2581 caactattac aaggttgata gagagtgaga ggtaattttg tagtgaggag ggtacccaaa 2641 gagaatcagg gctagcatcc cagtgcagga gtccaggtgg tctgtgagcc tgcaagtgag 2701 ttcaaaatac tttacctgtt ggcttaggaa accatatacc tgtgtggtac atgatatcct 2761 atttctcaaa cttttccaca aatatgtagt atacttttcc taggatgtgg gtactagggt 2821 cctggtgcat cccatagtaa tttctttgaa gtcttttgtt tatgaaaaac ttgcattaaa 2881 ctcatgtttg gtttgatgtg aaaaaagctt tcccagaatt ttcatataaa attgacattc 2941 caggagggtg tgcattaaga aattaattta aaattaggtt agcggacagg tggcaagata 3001 taatttatag gcatttatca tttcttaagc tcttagctgt gtgccaggca gtaatctaag 3061 tgttttaaag gtataaactc attaagcttg cagagcagtg tagatactgt tatgcttagt 3121 taagagacaa ggaaactgag caacggagag atcaagtaac ttgcctgagg ctgtataggc 3181 cagtccgtgg aggagtggga agtcaaatcc tgagtctgga ctaatagcct tggggcttaa 3241 ccagactaga ctgttttccc aaaggccaaa caatttaaat tcctgattgt ccacttgtat 3301 gttaagggcc taattatcag tatctttcag ttaactgtga gcccaaaaat atctgagacc 3361 ggtctgaaat caacttagaa agtttatttt gccaaggtta aggatgcatt catgacacag 3421 cctcaagagg tcctggccac atgtgcccaa ggtggtcaag gtccagcttg cttttataca 3481 ttttagggag acatatcaat cagtacatct aagatgtaca tttgattggt tcgatctgga 3541 aaggcaggac aactcaaagc aaggcttcca ggtcataggt agatttaaag attttctgat 3601 tggtaattgg ttgaaatcaa tggaaaggaa tgtctgggat gtcctaaggg gttgtggaga 3661 ccaaaatttt atcatgcaga tcaagcttgt ccaacccaca gcccactggc cgcgtgcagc 3721 ccatgatggc tttgaatgca gtccaacaca aattcgtaaa ctttcttaaa acatgagggg 3781 tttttgtttt gtttgttttc tttctgtttt tttgtttgtt tgtttttagc tcatcagata 3841 ttgttagtgt attttgtgtg tggcccaaga cagttgttct tccaatgtgg cccacggaag 3901 caaaaagagt ggacaccctt gatgcagatg aggcctgcag atagcaggct tcagagagaa 3961 tagattgtaa ctgtttcttg tgaaacttaa cgtctgtgtt gttgttaatg gtggttagct 4021 tttctgaatt ccagaagaga ggagggtata atgaggcata tccaaacctc acttcgcatc 4081 atggcctgaa gtagtttttc aggttagctt tggaatgccc ttggtcagga gaaggggtcc 4141 attctgatgg ttgggggact tagaatttta tttttggttt acgtaactat ctactttttc 4201 cccaaaaaaa atcatgttga aaactcaaga ccttaagttt acagctcaag ctgatcctga 4261 aagttatctc agtgagattg taatgttgct cagtctttca gaagttgtat aaatgtacct 4321 tttagcctag taaatgcaat tctaggactc tagattaaga aaactatcca aaatatggga 4381 agacatttac ttacatattt gtacagatat ctatatttgg atttatgtca gacttaaaga 4441 tatagtaaac cattttggtt cgtgaaaatg aatgtgaaag gtgaaagcac caaaattata 4501 aatgcattat atatatatta gattaaatgt ccatttatac atttacatgt ggggtattaa 4561 ttgttaatat actaatataa acattttaat atactagtaa attaatatat ggtatgttaa 4621 tgttacataa caatttatga aaaaattatt aaaaggagat tgccaataat aatagagcaa 4681 aatctgtgaa aactaaaatc ctgtttaata actttatttc caagcccctg tccatttatc 4741 agtcagttat ttgttggtct tccaaatgcc aacatgcctg taacttcagg gtctccttct 4801 tttagcagta aaggattttt tttctccctg aggaaatcac tagctgtacc ctataacatt 4861 ttctattctt gtctcaaggc ccaaaattac tagaattata atattctgca caagatattg 4921 taaatactac aaatgccatc agatcttttg ttaaactctg taaacccagt tgctaccctt 4981 tggctgcata gaatctataa attctaacaa atggatgaga accaagttaa gaattctaga 5041 gtgggataag gggatgtgat gtatgctatt cagggatgaa gttttagaga ctccaagtat 5101 gaaacagaaa attgaggatg gggagacctg atgggtacca cattaagttt aaaaaaaaaa 5161 aagtgttttc tcagctgtct ccccttttga ccccatacta taagatctaa ctggaaagag 5221 caagaagctg tctcagttaa gaaagttcta tacaagaacc cctgtggtag aacaagtcta 5281 caacaagata acatgattgc cccaaggcag aagtaaaaga aatattgtag gaaccataaa 5341 tgaatttaaa agctgatctt ttaaaaaata tataacttat caaaggtgag agatcagtcc 5401 tttatgatat ttaaggagtt cctgtgaatg caccattatg ctggatactc tgaaaaatgc 5461 aaatcaagta tgaagcgata taaattctgc ccttaaggaa cttgcaatct agttggagat 5521 ttatgtttaa cacaaagaag cagtttttac aaatacttct tttgaggatt ctggaaagat 5581 ggcagagtag aaagcaccag gaatctgtct cctcacctag acaagaattg tactggcaga 5641 atctgatgta actattttgg aactctggag tctattgcag gcttgcaact tccaggggac 5701 agacttagat ggtaaattgc agttaatttt ggtctatttg agctcttagt acagtagcag 5761 ctacccatct tccacctctc aacccttagc agacagctgt gcacgttatt agagcaatct 5821 gcacacaact tacaggaatc aggtgggcaa aaagaaacac accatccaga tatctgggat 5881 ctgtgctctt gattggtcat tgctccttct cgtcacaaaa gcgcaaagag atgggcggtt 5941 gttgttgtta caccttactc attgttacaa ccccctcccc actcactccc ctgccccatg 6001 gatttaaagg gctagaaatc tccctctgct tcatttttct ctttcacttt tgggagccgg 6061 atattaaaga ctaggacatt taaaaacaac tgcatttatg gggaaaatta gaaagtcact 6121 gtacatacct agggaaaggc tcaggagaga ccttaagttt atagctcaag ctgatccttg 6181 gcacaggatt aacaacaatt acacagcaat taaacaaagc aaacacacac acacacaatg 6241 tagtctggga gagtgatgtt agcaagatgg ccaactagaa gccccttctg ctcatcccct 6301 cataagacag ccagaacaac aaataaacaa ctacatttta gcaaacttaa ctaagagtgc 6361 tagagtgcat caaagaagta acagaaatcc tagtgagcag agaaaagtgg gatggccaca 6421 tggagaatgg gaggaaatgc caggattcct aggatcatct gggaacgagg aggaacttct 6481 tcctatggca aggagataaa caagaagatc ccagctacac catgggcacc aacagatctc 6541 accactgggg ttccttagag tccagacagc aactaagcca agctgaggaa gctgcctaga 6601 gtctgcacag ctgtgttccc tccagagaag gagccaacac tgtgctccac ctgctgtggc 6661 ccacacagct actgtgccac tccaccttgg aactggaact acttctggag tgtgtcttgc 6721 ttcggggact tgtagccatg gcttcctttc atctgaacca cgccaggctg gtggctcaac 6781 atccctaagc taaactttaa gcagctgtta cacccttccc tgtggagtca agcagaggtg 6841 gaactactcc acctactcct accccactgc cctttggcta gagctgaagc agtatcctat 6901 ttcctgggaa cacactacct tggccaccca gagcagtcat gcacccctgt gcctaagctg 6961 aagtggtata cggcattcca gggaaacagg gcctgaacca cccagagcag tcacactccc 7021 cgggcctcag ctgaaggagc acatcactcc ctgcagaatc agtgccctgg ccaaaccgag 7081 cagctatgca tcccagggct gagctgatgt agtatcctat gtctcaggga aacagagcag 7141 tggctgagct gagacaccct gttctacagg ccaaataact gtagtacccc gcttccctgg 7201 agctggacta gtcctctaga gcctgagctg ctgagacact ctgaggagtg gagtcatcac 7261 tgtgctgttc cctacccccc attcccagcc cagacaacag ctatgcttca ccattttgta 7321 gtacttgctg ctgctgctac acttggcctc acagtctggg gtactgctta tccctagcta 7381 ttccaagatt tagagtcatg acttcatggt gcctcatccc ttgggacctg tgttgccact 7441 gagccctatt agctcatatt cccaaattaa aaccataccc tgctccccag gcccaaacct 7501 ccagagaacc ccttctttat agtcaggcca gtgctatgcc ctgaccccca taggggtaga 7561 atcaaagcta caacccattc tctgggccta agctgccagg aagcacctcc aggtcacaaa 7621 tcctggctct gtgagcaact acgtccaagt cctgccacag agagcaaacc cacctcagca 7681 accaagtgtc tcagtaggtt tgtgagatcc tgagcctagg acctttgccc agtagccact 7741 ctgaatacct tcatctggaa tcaagagctg ctgcagctgc atatatatag cccctgtcag 7801 acctgacacc aagaggtgtt gccttggctg agtctcccca ttgtggggaa gacaagaata 7861 ggaggatcct aaaagccttc aacactgaga acattaacaa cctacactac cactgccaca 7921 aacttctgca gcctaggccc ctgaggcacc catttactac tgacattgaa cacaggtaaa 7981 gcagctgctt taccatacca ctgcatctgt caggaaacag tcaccacacc cttcgaaacc 8041 agcatactga aacccaactg ctaataaaag actttatctt tgaaagccac tgtctgtaaa 8101 gtttggaaga ggtaattatc ctatcaacac aaagacacaa aaacatgaaa aagcaaggaa 8161 atatgacacc accaaaggaa agtaagaact ctctaatggt ggcccaatga aaagaagatc 8221 aacaaattgc tggaaaagga attcaaacta atgatcttaa ggaaactcaa caagatacaa 8281 gaaaatacag atagttcaat gaatcaggaa aacaattcac aatacaaaaa attcaacaaa 8341 aaaattaaaa catgatgaaa aacaaccaaa caaatcctac agttgaagaa ttcaatgaat 8401 tacatttaaa aaatacaata aagagcttca gtaacagact tggtaaagca gaagaaagaa 8461 tctctgaact tgaagatata ccatttgata ttacttaatt gtagaaaaaa caagtgaaac 8521 cctacaagac ttatgggaca tcattaaagg aacaaacgtt catactatgt aagttacaga 8581 aagaggagaa aaggtaagag gcatatcaaa ctatttaatg aaataatagc tgaaaacttc 8641 ccaagtctgg agagagatac agacatccag atccagaaag ctcaatgatt cccaaataga 8701 tttaacccaa aaaggttctc tccaaagcac attattatct gttgtcaaaa gtcaaagaca 8761 gagaaagaat tctaaaaata gcaagagaaa ggcatcaagt cacatttaag aaaatcccca 8821 ttcagctaac agcagatttc ttcacagaaa ccttacagtg cagaagagga tgggatgata 8881 tactcaaaca gccaagaatg ctatgcccag caaaactatc cttcaggaat gagggagaaa 8941 tacagttttt cccagacaag caaaaactga gggaattcat cactgctaga ccagccttat 9001 gagaaatgct ccagggagtc ctacatctgg aagtgaaaag agagcagtca gtatcttgaa 9061 aacatggaaa agtataaaac tctctgatag agcagataca caaaggagaa agagaaaaga 9121 attaaacctt atcaaaccat caaaccacaa tgataaataa taagagagga agaaaggggc 9181 aaaggatatt attagaaaac tttaacaaaa tgacaaatgt aagggctcaa atatcaataa 9241 taatcttaaa tgtaaatgga ttaaattcca tacttaaaga gactggcgga atattttttt 9301 ttttttcttt tttttttttt tgagacggag tctcgctctg ttgcccaggc tggagtgcag 9361 tggcgcgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 9421 agcctcctga gtagctggga ctacaggcgc ccgctaccac gcccggctaa ttttttgtat 9481 ttttagtaga gacggggttt caccgtgtta gccaggatgg tctcgatctc ctgacctcgt 9541 gatccgccca cctcggcctc ccaaagtgcg gaatattttt taaatgactc aattatatgc 9601 tttctgtaag aaacccactt cacctataaa gacacatata aactgaaatt gaagggatgg 9661 aaaaagttat tccaagcaaa tggaaaccaa aagctgtcag gaattgctat acttatgtca 9721 gattaaacag acattaagtc aaaaactgta aaaacagaca aagaaggtta ttatataatg 9781 ataaagggac caattcagca agaggatata acaattctaa acatacatgt acccaacacc 9841 agagcaccca gatatatgaa gtaaatatta ctatacgtaa agggaaagat agactccaat 9901 atcatagtag ttggggactt cagcacccca ctctcagctt cggacagatc agcaaactcc 9961 aagtacagtg aactcaaaga gaccacacca agaaacagtc aaacttttga aagccaaaga 10021 cagagaagtg acttgtcata cgcgagggat ctttgataac atgatcagca gatttctcac 10081 ctgatacttt ggagaccagt gtattcaaag tgctaaaaaa aacaaaacca aaaaactcaa 10141 aaccctgtat atgcaaaatt gcccttcaaa agcataagga aattaagatg ttcttagata 10201 gacaaaagct aagggagttc attaccgcta gaactgccct gcaagaaatg cttaaggaag 10261 tcctgaaagg tgaaatgaaa gggcactaga cagaggggac agtgcagatt caggcaggtg 10321 caatggcttc tggggaagac ccagccagtg ggctattgcc agcatccagt cagctgccac 10381 cttccctgac cccagtgtca agtgatgcgc agggtgatcc aggtgtgtga ggggcagctg 10441 gatgtccaga ctgagggcac tggtgccatc agtggctatc ctaccactca attcatgacc 10501 caggtggtga tccagggtga tatcaccagt tatgatgcag ttaacacaga ggggaaagct 10561 gctgagatac actatacttt cccacccctg cagtgggaga tggggcaggg ggtaccacat 10621 cagggagtac agctgctgtt gttactaccc agggctcaga ggcactgctt gggcaggtga 10681 ctcttcctgg cactggtcaa ttatttgtga tgatgtcacc acaaggagta ctgcagggag 10741 gaagccagca cttgattgcc cctaggactc aaccttattt atttggacgt aacaccattg 10801 taagtcaaga agcatctcta tctatctatg aacatacaat gtgaaatcaa attgaggcca 10861 taaacagttg aacctcatca taagagtgac tcatcggtgt cacttctaag accaagaaaa 10921 attctatttt tatgcagaga ccatttgttt tttaattaaa agacaactct atttcaaaga 10981 ggaaaaaaaa tcaactcttt tttcagctct ctcagatgca gaaatgtaat tatcagcatc 11041 cctgtgctgg gagcaatcat tagttttttt agatttttaa aatatacttc attatgtttc 11101 actgtattca ttctttaatc agggatgtga gggatagaat aacatttaga atatctgtac 11161 agtttgtata taatgttcgt atttcttgtt gcttttatag gtggaaaaaa tctaaaatct 11221 cctcaaaaag tagaggtcga catcatagat gacaacttta tcctgaggtg gaacaggagc 11281 gatgagtctg tcgggaatgt gactttttca ttcgattatc aaaagtatgt gactctactt 11341 actgatttgt cagaatgacc tgaataattt ttacaagttt aacaacacca taatttttag 11401 atttggaaag tgtttggttt ttctattttt tggaaatgtt acgcctattt tacataatat 11461 ttttaacttt gtttctgtag agacttagtc aaatacatct ttgggtgttg cagcaaaaaa 11521 ttggggatga gggtggtaga cagcgtctct aacccatagc cattccttct ttcttctggg 11581 tgttccagct tctttttttt ttttttttta attattatac tttaagtttt agggtacatg 11641 tgcacaatgt gcaggttagt tacgtatgta tacatgtgcc atgctggtgc gctgcaccca 11701 ctaactcgtc atctagcatt aggtatatct cccaatgcta tccctccccc tcccccatcc 11761 cacaacaggc ccagagtgtg atgttcccct tcctgtgtcc atgtgttcta ttgttcagtt 11821 cccacctatg agtgagaata tgcggtgttt ggttttttgt tcttgcgata gtttactgag 11881 aatgatgatt tccaatttca tccatgtccc tacaaaggac atgaactcat cattttttat 11941 ggctgcatag tattccatgg tgtatatgtg ccacattttc ttaatccagt ctatcattgt 12001 tggacatttg ggttggttcc aagtctttgc tattgtgaat aatgccgcaa taaacatacg 12061 tgtgcatgtg tctttatagc agcatgattt atagtccttt gggtatatac ccagtaatgg 12121 gatggctggg tcaaatggta tttctagttc tagatccctg aggaatcgcc acactgactt 12181 ccacaatggt tgaactagtt tacagtccca ccaacagtgt aaaagtgttc ctatttctcc 12241 acatcctctc cagctccagc ttctttcaat ataagttggg gtctgagcta gggtatatct 12301 tgaagatatg gcattgtact ccaaaaggtc catcgaagac cttggaatag gccaccaggt 12361 ttcctgtgat caggctttcc tattatctcc atgatatact atattttaat tttaggtaca 12421 cttttttttt tttttttttt tgaggtggag tcttgctgtg ttaaccaggc tggagtgcag 12481 tggcacaatc ttggctcact gcaacctcca cttcccggtt tcaagcgatt ctcctgcctt 12541 agcctctcga gtagctggga ttgcaagcac atgccaccat gtttgactaa tttttttatt 12601 tttagtagag atggggtttc gccatgttgg ccaggctgat ctcaaacttc ttacctcagg 12661 tggtctgctc acctcagcct cccaaagtgc taggattaca ggcgtgagcc actgccctgg 12721 cctagataca ctttgtaaat atattgttaa ctgggcatgg tggcatgtgt ctgtgatccc 12781 acctacttgg gaagctgagg cagaagaatc gcttgaaccc aggagacaga ggttgctgtg 12841 agccaagttt gcaccactgt actccagcct gggagacagc aagactccat ctcaaaaata 12901 aataaataaa taaaataaaa taaaaatcga tatattggac cttgcatttc taaagactca 12961 gtttttctgg agtgttgatg tctcttcctt agtactttct acttaattat tcaagaggag 13021 aggcaggcag aaggaggaat atagaaggga acacagatca ggaagtacag tactttgcct 13081 tccgattatg tgttggggcc ccacctgcca accatgttcc cctgtccaag aaaaagcagg 13141 caatgaagaa ataaggcatt aggccctccg gaaccaggag cccaccactt cagccacagc 13201 tgtgacacta ttctcgagtt gtaactaaaa ctggcatatg aatctcacag cttcagtaat 13261 tgtgtatgca tatataaaca aatacacact gtatatatgc atatatacac tatatataca 13321 tactgtatat atacattaca ctatatataa tacatactgt atacatacat atacacacat 13381 tgtgtataca tacatataca cacattgtgt atacatacat atacacacat tgtgtataca 13441 tacatataca cacattgtgt atacatacat atacacacat tgtgtataca tacatataca 13501 cactatatat acatatacac actgtatatg tatatgtata tacacacaca aattcatata 13561 tgacttttct cagaattgat catttaaaat tttttatctc tagagatttt cattaaattg 13621 aactttgagt ttggttacca gttttgtttt ttcccttagt ttggaatttc cttatgaaca 13681 actattttag gtagagaagg aattatttca gataccctta tttaatgtta cttgtataaa 13741 gaaactattt gcaaaatatg catccaacaa gagcttaata tccaaggaac tcaaacaatc 13801 aacaacaaaa cccaaatcca tcaaaaggta ggcaaaagat aggaatagat atttttcaaa 13861 agaaaacata cagtggccaa caagtatgtg aaaaaatggt caacatcact ctaatcatca 13921 gagaaatgca aacgaaacct gccatgagat attgtctccc cgcagtcaga atggctgtta 13981 ttaaaagtta aaaaggctgg gcaaggtggc tcacacctgt aatcccagca ctttgggagg 14041 ccaaggcagg tggatcacct gaagtcagga gttcgagacc agcctgacca acatggtgaa 14101 accccaccag cctgaccaac atggtgaaac cccatctcta ctaaaaatac aaaaattagc 14161 cggatgtggt ggtgcgtgcc tgtaatccca cctactcggg aggctgaggc aggagaatcg 14221 cttgaaccca ggaggcacag gttgcagtga gctgaaatcg tgcccctgca ctccagcctg 14281 ggtgacagca agacttcatc tcaaaaaaaa aaaaaaaaac ccaacaaata ttgaatatat 14341 ttcatgccaa acactcttct gactgcttag actgcctcag tgaacaggag aggcaaagat 14401 ggcatgaagt tatattatga aacggatttc atgaagtaaa tattacatta gttacatatt 14461 catatttcat attctgttct gagctaatta ttgcatatga taatagtttt tttgttgtgg 14521 gttttttttt ttgagacgga gtcttgctct gttgcccagg ctggagtgca atggcgcaat 14581 ctcggctcac cgcaacctcc gcctcccggg ttcaagcaat tctcctgtct cagcctcccg 14641 agtgcctggg attacaggca tgcacccccc acgcccggct aattttgtat ttttagtaga 14701 gacaggattt ctccatgttg atcgggctga tctcgaactc ctgatctcaa gtgatccacc 14761 cccctcggcc tcccaaagtg ctgggattac aggtgtgagc caccgagccc ggccaataat 14821 gattttttta agaaaatata aaagtagctt ctgctgcaat atcttgatta gatgtaaaac 14881 agttttaatg gagcatgctt cagcatctcc tgaaatagga aagacagatg taaacagtga 14941 ggttgggttg aaattatatt aagttttaaa tttttgataa ggttttctgt ttataatgag 15001 agtttctgga tttttagaaa tagattctga tgtggaacct tttcaagaat gtttaaggta 15061 tttttaatgc cagctctaat ctcaggcctc agatatgtaa aagtggaagt gaaaactggc 15121 caagaatgaa gtatttcata tgctagtgta tatttttggt aatttcttaa aaatgtttcc 15181 attttgaaaa acaaggtaga cttgtatttg gttcagcaac aaaattattt tggtattatc 15241 tctggtttct aatatctaaa tcgtctttaa aatggcatct gtgatattat taaaaataaa 15301 taaatcagct aatgaattag gcttgacaaa aatgccttcc cagaagatag tttctctgct 15361 acttatggtc aatgggctta acataggtaa aagttctcaa atctgctaaa ttggccgggc 15421 atggtggctc acgcctgtaa tcccagcact ttgggaggct gaggtgggca gatcacctga 15481 ggtgaggagt ttaagatcag tctagccaac atggcaaaac cccatctcta ctaaaaaaaa 15541 aaatacaaaa attagccaga cgtggtggca cacgcctgta atcacagcta gttgagaggc 15601 tgaggcagga gaattgcttg aacctaggaa gtggaggttg cagtgggccg agattgcacc 15661 actgcactcc agcctgggca acagagtaag actctgtctc aaaaaaaaaa aaaaaaaaaa 15721 agaaatctgc taagttcgtt tctatccctc actttccctg caagctcttc ttcccctttc 15781 tgggcaacat gtccttttct gctttaggca ctccatgccc actttatttt tcatgtttgc 15841 cccttttttg aaaggaagaa tctaatatca cttcagcttt ctgtttggcc agttccactt 15901 ccaaaaattt atttgtttat gtaaacattt tcagttttgt ggtttctgat gatgtaatag 15961 aagaaattac ggagacaggc aagcagaagg gatggtttgc ttttggagag ggcagaggaa 16021 accttgaaac catactgatt ctgaggtctt tcttagggaa ggtaagtcta ttcataggag 16081 gaggtcagaa gggaatcatc aaatccaatt ccccagtttc tatttaattt tattttatta 16141 gagacagggt cttgctcggt tgctgaggct ggagtgcagt gatatgatca tagcctactg 16201 cagcctgcag ccttgaaccc ctatgctcaa atgaccctct cacctcagcc tacccagtag 16261 ctaggactgt ggcatgtgcc accatgccct gctttgcctt tttttttttt atttctcata 16321 gagacagggt ttgctatgtt gaccaggctc caaactttca ataaggaaga taagatccag 16381 aaacacaaaa tagtttacca aaagtcacac agctattttt aacagagcca acatactaaa 16441 tccactttta ccttatgggt catttatttc tctgcttcct gaagcaacca cccacaaaat 16501 tatataaaga actgtatttt aaaattcagt ttcataagta ataacttggc ttatatgcat 16561 tgaaaaagag tggaagggtg tatgctaaaa tgttaatagg acattagctc aagtagaaga 16621 aataactctt aaaccaaaaa tagaaataac tcttaaacaa gagttattaa gagttaagaa 16681 ataactctta taccagtaaa tagaaagtat ttgacactta catttataca tttgctcact 16741 cattcatttg ttttttttac tttaaagaac tgggatggat aattggataa aattgtctgg 16801 gtgtcagaat attactagta ccaaatgcaa cttttcttca ctcaagctga atgtttatga 16861 agaaattaaa ttgcgtataa gagcagaaaa agaaaacact tcttcatggt atgaggttga 16921 ctcatttaca ccatttcgca aaggtaagaa aaagttgcta gctgaattat attctttagt 16981 aaatattacc agagcagttc actttccaag ccattcattt gcatgatgca aaatctaaca 17041 tcttttaaaa agaacaaaaa ttcccttaaa cctatatctt cttcctgcta tggctccatt 17101 tgtcctactt tccctgtagt agtggttctc aaagagtgat cctcaaggcc ctttcagggg 17161 gctgtaaaga caaagctata tttatgacaa atctaagact tagttgcctt ttttattttt 17221 gttctttctt gtattcacag tggagttttc cagaggaggc tgtttgatgt gtgatgtggt 17281 gatatggtca ctgatttaac atagaattca atgtgaaatt catctgcctt ctttattaat 17341 ccaaacattc aaaagatttg gaaaaaaaat gtaaaacagc accactctga agtttctatg 17401 ttccttttgg caaatgtttg ttaccataaa ataggtttat cattgttatt tctttctttt 17461 tttttttgac acagagtttg tcttgtgcca ggctggagtg caatggcaca atcacggctc 17521 actacagcct ctgcctcctg ggttcaagtg actctcccgc ctcagcctcc cgagtacctg 17581 ggattaccag catgcatcac catgcctggc caattttgta ttttagtaga gacagggttt 17641 ctccatgttg gtcaggctga cctcaaactc ccaacctcag gtgatctgcc cacctcaacc 17701 tcccaaagtg ctgggattac aggcgtgagc caccatgccc agtccattgt tatttcttaa 17761 tggattaatt agttaaaatg gattgaaaac tttaccagtt ttaatgttta atatgctaat 17821 ataaaaatat acagcccaca taaataatgg ctgtttggga ttctcgataa ttcctttaag 17881 aatgtaaagg agctgagtgt agtggcccac acctgtaatt ccaacacttt gggaggctga 17941 ggtgggagga tcacttgagg ccagaagttc aagaccaacc taggcaacat agtgagaccc 18001 catctctata aaaaatatat aaaaattagc caggcatggt ggtgcacacc tgtagtccca 18061 gctattcagg aagctgaaga gggaggatca cttgagcctg ggagatcaag gttgcagtga 18121 gctgtgtttg tgccactgca ctccagcctg agcaacagag ctagaccata tctcaaaaaa 18181 aagtggggat cctgagacca aaaggtttga gaaccactgc cctgtaccaa gaatgatctg 18241 tactcccttt ctatactaat cttccaattc cctgtcagcc ctctccattc aaattctgtt 18301 cctcctcact taaaaccact ctttttggcc aggcgcggtg gctcacgcct gtaatcccag 18361 cactttggga ggccaaggcg ggcagataac gaggtcagga gatcgagacc atcctggcta 18421 acacggtgaa accctgtctc tactaaaaat acaaaaaaaa aaaaaaaaaa aactagccgg 18481 gcatggtggc gggagcctgt agtcccagct actcgggagg ctgaggcagg agaatgacat 18541 gaacccggga ggcggagctt gcagtgagcc gagatcgtgc cactgcactc cagcctgggc 18601 gacagagcga gactccgtct caaaaaaaaa aaataaataa accactcttc tcaaggtcac 18661 agcagcctcc acaatgccaa gcaaattccc agtcctcttc tggcttcatc cctctgtagc 18721 atagatgtgg tcatcactca gtttttcctt tactggatct ctgggacact gctctttcct 18781 ggttctcctc ctgctcattg atcgcttcat cttcccaact tttcagtgtt agactgctgc 18841 tgggctcagt cattgaacct cttctctttc tgtatcactc cactggtgat cttacagctt 18901 tctatcctat ctgtatgctc ttaactccca gaagtggcag gcacatatta agtgctcaga 18961 attatttgtt gaatgaaggt tttggcattg tattaataaa gttccatagt aattgttttg 19021 atttttttgc agctcagatt ggtcctccag aagtacattt agaagctgaa gataaggcaa 19081 tagtgataca catctctcct ggaacaaaag atagtgttat gtgggctttg gatggtttaa 19141 gctttacata tagcttactt atctggaaaa actcttcagg tgtagaagta agcattattt 19201 ttacctctgt ttaatcgatg tgagagaaaa attaggcgaa ttaatcctaa aatttgactt 19261 tatacttttt taaagaacca acttatattt gtgttatagg aaaggattga aaatatttat 19321 tccagacata aaatttataa actctcacca gagactactt attgtctaaa agttaaagca 19381 gcactactta cgtcatggaa aattggtgtc tatagtccag tacattgtat aaagaccaca 19441 ggtaaggaag atgttttgtt ttagattcaa taaatatata aacagattgt caattttggc 19501 atcttcccca tattgctgaa gtttacatga taggtcaata tatgttaaaa acattgtaac 19561 atttacataa gcaaaataaa tgttacttgg gatttttgtc tcaaatagta atgaaaatta 19621 attctactta aaagttcagg ctgggcgtag tggcccacgc ctgtaatccc agcactttga 19681 gaggctgagg tagctggatc acttgaagcc agaagttcga gaacagcctc gccaacatgg 19741 gaaaaccccg tgtctactaa aaataattag ctgggcgtgg tggctcacgc ctgtagtggt 19801 agctacttgg gaggctgagg cacaagaatc ccttgaacct gggaggtgga agttgcagtg 19861 agccaagatc gcgccactgc actccagtcg ggagcaaggg ggacaaagag taaggctccg 19921 tcttagaaaa aaaaaaaagt tcaaatattt tgtaatgaca aaacttttca tttgttcaga 19981 atccacatga agcaggaggt agctgaacta tgagttctgc aagtgggaag aaaaggtggg 20041 caagtagttg ctagtagaca agttattgtt caagaaagca tacctttgta agtggaatgt 20101 taaaatatga aagtatcctg atacatttag taacatcttt ccaataatta aggtcaaggg 20161 caggagctgg gatttgttta ttttaattgt aaatttcata tgcttttttt tttaatcaaa 20221 cattacggaa gggtttaaag tgaagtcttc ttctcccttc ttatccccaa ttccattcac 20281 tagagataag cattgcttag cagtttgtgt atatcctttt cacaactttt ttcaggaatt 20341 taatggtaag gatataccac agcacccaca ggtttccttg ggttttgttt tttaaaaatt 20401 acaaaaataa aaatcttaca tgaagtattt acagcttgct tacttaaccc agtgtgttgg 20461 ggatattcta tatcactcaa tacagatcta ttacgttctt catttgatct tagccatcta 20521 ttgcattctt tcttttttac ttatttattt atttatttat ttatttattt atttttgaga 20581 cagagtctca ctctcttgcc cagaatggag tgcagtggct caatctcggc tcaccgcaac 20641 ctctgcctcc cagcttcaag cgattctcct gcctcagcct cccgagtagc tgggattaca 20701 ggcgtgtgcc atcatgcctg gctaattttt gtatttttag tagaaatggg gtttcaccat 20761 gttggccagg cgcgtctcga actcctgacc tcaggtgatc catccacctc agcctcccaa 20821 agttctggga ttacaggtgt gagccactgc gcccggcctg tagtctttca aatggctgca 20881 tagtattaca tagtgtgaat acaaaccata tttaactatt accttataat gataaaatgc 20941 gagcctttat cttcttgcca gttatctcac ttgagtaaaa atgtgtgctt ttttttatct 21001 gttctttggc ttctagttga aaatgaacta cctccaccag aaaatataga agtcagtgtc 21061 caaaatcaga actatgttct taaatgggat tatacatatg caaacatgac ctttcaagtt 21121 cagtggctcc agtaagttcc attccataaa tttccttttg cccagtttgt tttgattatg 21181 cttcttttcg ctctgcatca gtcaccaggt ccttgcacac agaatgaccg actgggaggt 21241 gggtacgata tattggaaac gtgagaaatc tcatttctaa ccccagttct gccagtaact 21301 aactggatga cattgagcta atcgcttaac ctccatgggc ctcaatttct tcacttgtaa 21361 aatagatgga ttctaatatc tggggttctt actggcttaa gaaaaaaaag aaagtgtaac 21421 tagttgcatg tgttatttcc ccaatctgga agactcgaca gttgaccctg atctttttct 21481 aagaactcac ttcttccatg agtccttaaa atgatctcag tccactcact tgttcaattc 21541 aaaaaattag ccttcctatt ttctttatta ctttagtggt tttgacaaaa ttttgtagta 21601 gatacttcgg tgggaaagaa agttggaaag aactatgttc tatgtggttg agcagaggct 21661 agctaaacat ggatgaagtc caggcgctgc ataaatgaaa gagaactggc ctggtgcagt 21721 ggcgcatgcc tatcgtccca gctactcagt gggctgaggc aggaggatta cttgaggcca 21781 ggaattgagg ctgcagtgtt gtatgatcac acctgctagt agccgctgca ctgcagcctg 21841 ggcaacatag tgagaccccc atctctaagc attaaaattt ttaaaaagct gataaaagag 21901 aactgcatgt agtttatatt gattggaaaa ttgtaggaga tgaatagaga aaagagaaga 21961 ggaaaattaa gtagaaggaa aaatgggtta tggggagaaa agagatggaa agaaagatat 22021 acattggtgg cagaggagta gagagagcaa aggcaagtat agctaagaca ggaaaggaaa 22081 agtcagaaga aactggaaag aaacagaaaa aatgagggag aagggaagtg taagtaccgc 22141 tatgatcaat ttctatcttt tagtgttaat tttgaaccaa gggtgactgg tagaatgcaa 22201 atgcatttat gtgcatatgt gccccacaat gcatatataa cctcacaaac caatcaggaa 22261 actgttcagt agtcttttgt gtatacctca ctttcagaag tgaataaata atagggtcac 22321 caaatagata atgagaggta tttctgatag ctgatctcta agcttctttt gggtacttac 22381 catattgtac aaagcactga agaaatggaa agagtcagca gaatcaaacg tggttcaaaa 22441 attgaaagca gttagccaac attgcaaagt gttgtgtggt ttaggactaa agagtggtat 22501 acaaattata agggttgcag aaattgaaaa tgtaaaggtg aataggggct aaagagacca 22561 aagaagccat aaggaaagag ataaggtatg aacaaggcct tgattgttaa ttcagatttt 22621 ttttaagtag acaactatta gatgttagtt ttctgtgtct gtaactaatt accacagact 22681 tagcaactta aacagctgtt tattgtctca aagtttctgt gagtcaggag tccaagcgtg 22741 gcttagctgg ctcttctgct aagcagtcaa ggagtcagtc attcatacaa gacttggggg 22801 ccgctcccaa gctcacatgt tgttggcaga atttatttct agagctcatg gtggctcact 22861 tcttagaggc catcaggaag ggagagtctg acctctagat cctttttttt tttttttgag 22921 acggagtctc actctgtcac ccagactgga gtgcagtgac atgatctcag ctcactgcaa 22981 cctccgcgtc ctgggttcaa gtgattcttc tgcctcagcc tcccaagtag ctgggactac 23041 aggtgtgcac caccacaccc agctaatttt tgtagtttta gtagaaacgg ggtttcaccg 23101 tattggccag gctggtctcg aactcctgac ttcaggtgat ccacccgcct cagcctccca 23161 aagttctggg attataggcg tgaggcaccg cgctcagcca ttttttttgt atttttagta 23221 gagacggagt ttctccatgt ttgccaggct gttcgcaaac tcctggcccc aagtgatctg 23281 cccacctcag cctcccaaag tgctgggatt agaggcatga gccaccatgc ctggcctaga 23341 tcctctttta aggggttcat ctgattaggt caggcccacc cagaatagtc tctcatttga 23401 ttcaactgat ttgggacctt aattacatct gtgaaatacc ttctcttctg ccacttaatg 23461 taacctaatt atgagaggag ggcctccatc gtatttacag gtcctgccca tgctcaagag 23521 gaggggatta tacaggatgt gtacaccagg gggcaggaat cttgggggcc atctcagaat 23581 tttgccttct ccagtgggga ggatgtcctt ctagagtagc agaacatttg gacaccaggt 23641 ggcaatggga agaaagctga ataagtagaa tgggacaaga ttagggtcaa cctcaggctg 23701 tcctgtaggc ttttggggat tttgtgttgg ggtgatagtt aacctgagct ctggatttga 23761 ggaaggtgca tgcggcgaca tggtatgtgg gagagtggac taaggagaca ctaaagagag 23821 ataaggaaac cagtcaggag gctattagag ttctgggctt gaactgagat gcttttagct 23881 gttggtttct gggcaagtta cttaacctct ttagtctcgt acttatactt gctcttataa 23941 ccccgcagtg tgtctctatt ttagcactta atcagtgtcc attggtatgc tggcgcatgt 24001 ttagcagctg gctttccaga aagaaaaagt cctattttgt agtgtttgct gatttttatg 24061 atataaatat tcccaccatg gccactttca agccaccagc ctgatgtccc tgaatatgga 24121 gttgggaaca tatgcacagt agcacaccac tatcatatag ctacaataga taaagatagc 24181 cccaagagct tagaaaataa aatgtagtaa aataattagg acaagatgaa tttatttatt 24241 acctctgtct ttaaaacaat tttttatttg caaatttata taacttaatt tttagtaatg 24301 gtttttcttt aaaaaccagt ttgtatttta ggaggccgag gtaggtggat cacctgaggt 24361 caggagttca agaccagcct ggccaacatg acgaaacccc atctctacta aaaatacaaa 24421 aattaggcag gcctggtggc gtgcacctgt aatcccagct actcaggggg acagaggcag 24481 gagaattgct tgaacccagg aggcagaggt tgcagtgagc caagatccta ccactccact 24541 ccagcctggg tgacagagcg agactcggtc tcaaaaaaaa agccaaaaaa aataaaccag 24601 cttgcagcat tcctggaaat tctaactaac agatgttctt gcatattgat atgagccacc 24661 tccagcagag cacaacatga ccacagtctg gaacagtctt tggttttctt ttatgttaga 24721 tgcatatctc ttccattgtt tgtgagtttc ctgagtgtgg atactattta tttctgtaac 24781 cttagcccct aacatagtgt ctggcaattg taaatactta ataaatatct aatgaattta 24841 aaaaatattt gtcttaaaag cgccttttta aaaaggaatc ctggaaacca tttgtataaa 24901 tggaaacaaa tacctgactg tgaaaatgtc aaaactaccc agtgtgtctt tcctcaaaac 24961 gttttccaaa aaggaattta ccttctccgc gtacaagcat ctgatggaaa taacacatct 25021 ttttggtctg aagagataaa gtttgatact gaaatacaag gtaaggcagt agtttttact 25081 ggagattgta attctctggt gcaagttttt aaaattgttt ttctaattga acattatttc 25141 tttacaaatt ttttctagct ttcctacttc ctccagtctt taacattaga tcccttagtg 25201 attcattcca tatctatatc ggtgctccaa aacagtctgg aaacacgcct gtgatccagg 25261 attatccact gatttatgaa attatttttt gggaaaacac ttcaaatgct gaggtaaaaa 25321 gactgtatag tataattttg taacttagag ttataattat gatttgggta aataaagctt 25381 gaatgtaaaa tttgggggaa atttttaaac tttatgtggg ctggatgcag tggcctgtaa 25441 tcccagcact tcaggaggcc aaggcgagag gatcacttga gcctaagggt ttgagaccag 25501 cctgggcaac atagggagac cctgtctcaa taaaaatttt aaaaaattag cctggtgtgg 25561 tggcgtgcac ctatattccc agctacttgg ggtgggaggg tcacttgaac ctgggaggtc 25621 aaggctgcag tgagccatga tcgtggccac tgcactccag cctgggtgac agagtgactg 25681 acagcttgtc tttaaaaaaa aaaaatgtga ttaactcaga tattaacaaa atgaagatta 25741 tgagcatttt tcatgttttg cactgtagag ttatgggtga gctgcatctg ggccccagtt 25801 tgcttttaaa atacagattc ctgagcctat tgttactgaa tcattatctc tggggatgga 25861 actcaggaat ttgcattttt aacatgattc ccatgtattc atctaaacct gggattctca 25921 gcactattga cgtttgggct ggatagttcc ttgggagggg gctgtccagt gcagtgcagg 25981 atgcttggca acattctggt ttctgcctac tagatgccag tagtgttcct tcctagttgt 26041 gacaaccaaa aacatcttca gatattgtca aatgtcccct ggtggctcct ccaaggggac 26101 aaaaattggt tcttatggga caaaagaatt tatgtagtac agttgtttgt tttcctccaa 26161 aaccattgga aagcattctt ccaaagttca gctttgccca acaaaatctt attccttagt 26221 atttaatttt atgatggggg aaggattagg aaaaaattgc ccaaaaagtt gtttagttgg 26281 ggaggtaatg aaaaaagggt tgacaaacac tggtctaaac cctaaaattc agataccact 26341 gacaaagata atatataccg tcagataata tacgtgtttc ccagatcact tccccctcgt 26401 ggacttgtag tcaagaatcg acctttaaaa ctcctgtcca gtcaggctgg taaattcttt 26461 aacttcacct cattttctat tggaaatcta agcaaattcc attgatttgg ctgcttccct 26521 ttttaattgt ctgacaaccc tgtaaccata gcttaatgta gcccattgag aattatggtc 26581 ggttttcaga tgtctgtaga tcagacaaac ctaaatttct atctcactgc taaactgtgt 26641 aaccttacgt acgcagggtg ctttacctct tggagacttt gcttcctctt ctctgaagtg 26701 gtggtaatca taacaactaa atttttcaaa aggatgttta taaggtttaa aaggaagtaa 26761 caaatatgaa aatacctagc acattgctgg acaagtagta gagattcaat aaattgtagt 26821 tgccatctta acctatactg gataatatat caataattgt tactatagaa actcttattg 26881 aatacttata tgtaaggtac tatgctaagt attgtacatg aattgtctca gtcattcttc 26941 aaaacaaccc tgccggatga tgtgattggt tacattgctg tttggcaaat acttgggacg 27001 gtgacttctt ccctgcacca ccagaccgta gatgctggag cccaagtaga tgtgctgaga 27061 acagaggcct taagtgtgct tgcagggttt ggcttggctc ttgcactgct cctatttgcc 27121 aggaagaaaa tgtattccac atagtctctg gtccaaggag gattagaggc tgctggagct 27181 gacctaaacc ccatgcacag cctccgtggg agctgcccag ccagtctgta gacttgtgag 27241 tgagaaacta cataatcgtt ttgtgagcca ctgaacgtta aggcagtttt gttatctata 27301 tttttttatt ttattttatt tttttttttt ttttttgaga caggttctca ttctttttcc 27361 caggctggag tgcactcatg tgatgtcagc tcactgcagc ctcgacttcc tgggcttaag 27421 tgatcctccc acctcagctt cccaagtagc tgggactaca ggcacatgcc accatgccta 27481 gctaattttt tgtatttttg ctaaagatgg agtttcacca cgttgcccag gcttttttgg 27541 tggtggtggt ggtggttgtt ttggaaatag ggatttcctc agcctcccaa gtaacagaga 27601 ctacaggctc acaccaccat gcctggccaa tttttttgaa atttcttgta cagacagagt 27661 ctagctatgt tgcccaggct cgtcttgaac tcctgggctc cagtgatcct cctgcttggc 27721 ctcccaaagt gctgggatta caggtgtgac ccaccggctt gttagacatt attctggcga 27781 tagctgactg atacagatag cagcaataac tgtcccgatt ttatagatca gggaatcagg 27841 tagagagaaa ttaaataatg cttgcaagac cacacagctg taagtgatac tcttaataag 27901 gcagtttgag gccaaaaccc ctactcttaa ccattctgtc actaatcaga ggtaatacaa 27961 tggcaattac cccagacttg gcatgtactt gtaaatcaat tagaattctc ttaggaatag 28021 atcatatgct tttttggcag cagcattttt aactttctga tttggttata gtggtgtatc 28081 taaaacaaat ttatatttct cacacatata ctctgtaatc ccagcacttt gggaggccaa 28141 ggtgggtgga tcacttgagg ccaggagttc aagaccagcc tgaacaacat ggtgaaaccc 28201 catctctacc aaaaagaaaa aaaaattagg cgtggtggca tctgcatctg taatcccaac 28261 tactcaggag ggtgaggcag gagaatcgct tgaacccagg aggcagagat tgcagtgagc 28321 ccagattgca ccactgcact ctagcctggg tgacagagcg agactgtctc acaaaaaaaa 28381 caaaacaaaa caaaacaaaa caaaaaatac ataccaacta cgtgggagag gccaatgtta 28441 gactgaacat aaaaaattga gaaagcacat attccctgat ttcttgaggt gactaaattt 28501 tatcagtgat ttaattatat tttctagaga aaaattatcg agaaaaaaac tgatgttaca 28561 gttcctaatt tgaaaccact gactgtatat tgtgtgaaag ccagagcaca caccatggat 28621 gaaaagctga ataaaagcag tgtttttagt gacgctgtat gtgagaaaac aaaaccaggt 28681 cagaatcttt tattgtcttt tttaaaaatg tagctagaca taataaaagt aattctatac 28741 tgtacattga aaattgtaaa acattttctc tttactgcaa aaaatatata gaaagaatgt 28801 tttcttcatg aactacatga atcaaaagta gactttttag aaaatatttg taacgcttaa 28861 ctctcaagtc ggtgttgttg gatgctttat atttcatcca gtatccctat aattaatttc 28921 cttaatgtat tgctctttaa catttaataa aactatttta aatttttaga atataatcct 28981 taacataatt atcatgtaga aatcacttag ttcaattgtg agttttttaa tgtgggaatt 29041 ggtttagtct cattttctat tttacagttt ctagtgttga ccggtagagg cttagtcaca 29101 gaatatctat taaattttgc tagttgtatg gtgtaaaagt gctgaaggat acttgcatgt 29161 ttggcctgta tgtcaatatt gtatttctcc ctgaggatct cttacttcag ttcccacgcc 29221 aatctttaaa tgtaaatgtc attgcctatg gttgctgtgg actcagtgct accagaaata 29281 tttttaagag tattatttaa tagattttat atttcctttc atatggctag tctttcacac 29341 agctgtcagc actgttaagg catttgtatg tcaaaatata tgctaaatat cacgtatatc 29401 tttttaggaa atacctctaa aatttggctt atagttggaa tttgtattgc attatttgct 29461 ctcccgtttg tcatttatgc tgcgaaagtc ttcttgagat gcatcaatta tgtcttcttt 29521 ccatcactta aaccttcttc cagtatagat gaggtatgtt acttttttta tttttttgtc 29581 aacagctagg aaatgaacag aaaatgtgtt tgatttcaac agtatatagt aggttttctt 29641 gatatccaga aaataataga gactgatttg ggtatcttct tcaaagcttt agccaattaa 29701 ctttaaaaac agtaatttca tgtaataaca tagcatgaga tagtaatgat tgtccttaat 29761 ttcatatttt tctggcaatt cctagattca ctgtggcatt tgttttaccg tttaaagcct 29821 gtgattcttg gccaagcgtg gtggctcacg cctgtaatcc cagcactttg ggaggctgag 29881 gcgggtgggt cacttgaggt caggagttcg agaccagcct ggtcaacatg gagaaaccct 29941 gtctctacta aaaatataaa attagccggg cgtggtggca catgcctgta atcccagcta 30001 ctcgggaggc tgaggcaggg ggatcgcttg aacctgggag gtggaggttg cagtgagccg 30061 aggtcatgcc actgcactcc agcctgggtg acagagcaag actctgtctc taaataagta 30121 aataaagtct gtaatgtgat tcttctcagt acagacagtc ccaaacttac gctgtttcaa 30181 tttgtgatgc gtttgtcaga acgtaattcc attgtaagtt aggagcatct gtatatatcc 30241 agaagactaa aatattctat cagcgtagaa agtatattct atggatgaaa tcatacaaaa 30301 tgaaataact aaaaacatga gacttttttt atgacacaca tcacagtttt ttgtttggca 30361 cttctatgaa gctagcatat taaatgaaag cactgaatgg tgaaggccta aacatagaat 30421 ctgttcctgg gaaaccatgt gctcatctag ggtgggtttt gtgaatttga gcaagtcact 30481 aagggcaggt aagcaagtac ccttggctgt aaaccacccc atagaggtgt taggagcaaa 30541 ggagttaaac cataagaacc ttaccactct aaaaatgctc cgtaaatatt aacaatttta 30601 ctattatacc cctgagtttt taaatgtcat tcctgtgtac tgtaatattc atagctatta 30661 tttaaaatag acttaaaaac tagttacaat agctaataat ttctcaattg tgcttcttct 30721 ggatatatat gtgttggata caaacatttt tattatttca aaaaaaaaaa gtcatgatcc 30781 cagagtccgc ccactcctgt cctttcccgt gttcttcccc gcctaccccg atggcacagt 30841 gtacctttct taggtactct tcaaagactc accacagaag gtactaagat atgagtgacc 30901 tcactaatga tgcttttaaa cattataagg caattagtat gttcttaggt gactttttaa 30961 tatgcatgcc agaagatagg ttttctcagt aatggatgta agaaactaaa gctattacaa 31021 ctagaaaagg aatttttatt attttaaata attgatttct actctttccc tttttttaaa 31081 ttagtatttc tctgaacagc cattgaagaa tcttctgctt tcaacttctg aggaacaaat 31141 cgaaaaatgt ttcataattg aaaatataag cacaattgct acagtagaag aaactaatca 31201 aactgatgaa gatcataaaa aatacagttc ccaaactagc caagattcag gaaattattc 31261 taatgaagat gaaagcgaaa gtaaaacaag tgaagaacta cagcaggact ttgtatgacc 31321 agaaatgaac tgtgtcaagt ataaggtttt tcagcaggag ttacactggg agcctgaggt 31381 cctcaccttc ctctcagtaa ctacagagag gacgtttccc tgtttaggga aagaaaaaac 31441 atcttcagat cataggtcct aaaaatacgg gcaagctctt aactatttaa aaatgaaatt 31501 acaggcccgg gcacggtggc tcacacctgt aatcccagca ctttgggagg ctgaggcagg 31561 cagatcatga ggtcaagaga tcgagaccag cctggccaac gtggtgaaac cccatctcta 31621 ctaaaaatac aaaaattagc cgggtgtggt ggcgcgcgcc tgttgtctta gctactcagg 31681 aggctgaggc aggagaatcg cttgaaaaca ggaggtggag gttgcagtga gccgagatca 31741 cgccactgca ctccagcctg gtgacagcgt gagactcttt aaaaaaagaa attaaaagag 31801 ttgagacaaa cgtttcctac attcttttcc atgtgtaaaa tcatgaaaaa gcctgtcacc 31861 ggacttgcat tggatgagat gagtcagacc aaaacagtgg ccacccgtct tcctcctgtg 31921 agcctaagtg cagccgtgct agctgcgcac cgtggctaag gatgacgtct gtgttcctgt 31981 ccatcactga tgctgctggc tactgcatgt gccacacctg tctgttcgcc attcctaaca 32041 ttctgtttca ttcttcctcg ggagatattt caaacatttg gtcttttctt ttaacactga 32101 gggtaggccc ttaggaaatt tatttaggaa agtctgaaca cgttatcact tggttttctg 32161 gaaagtagct taccctagaa aacagctgca aatgccagaa agatgatccc taaaaatgtt 32221 gagggacttc tgttcattca tcccgagaac attggcttcc acatcacagt atctaccctt 32281 acatggttta ggattaaagc caggcaatct tttactatgc attaagacct ctgattcaaa 32341 acttattaga acagtagctt ctgctggaat ttgcaatcac tgaagtcata gaaaataggt 32401 aactatctaa ttagagaaat aattgttgta ttttaagatc tgagagtgtg tacaagtttt 32461 agtatacatg ccatgccaga agatagtgta tgcaagaagt cttgggacca gaaaatggca 32521 atgataggag actgacatag aagaagaatg cttccctagg aaaaaggtcg ctggctttgg 32581 tgcaagagga agaagaatgt tccactggaa gcctgagcac ctaatcagct ctcagtgatc 32641 aacccactct tgttatgggt ggtctctgtc actttgaatg ccaggctggc ttctcgtcta 32701 gcagtattca gatacccctt ctgctcagcc tgcttggcgt taaaatacaa atcattgaac 32761 tgagggggaa aaatgtaact aggaagaaaa acccaattta agaaattaca taatgctttc 32821 caaaggcacc tacaacttag ttttaaatta cttgctactg gggattaccc atggatatcc 32881 ttaataggca ggaagtctgg gaattc // LOCUS HSIFNG 5961 bp DNA PRI 15-NOV-1994 DEFINITION Human immune interferon (IFN-gamma) gene. ACCESSION V00536 NID g32675 KEYWORDS interferon. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5961) AUTHORS Gray,P.W. and Goeddel,D.V. TITLE Structure of the human immune interferon gene JOURNAL Nature 298 (5877), 859-863 (1982) MEDLINE 82272342 FEATURES Location/Qualifiers source 1..5961 /organism="Homo sapiens" /db_xref="taxon:9606" exon 347..588 /number=1 mRNA join(347..588,1828..1896,1992..2174,4600..5319) prim_transcript 347..5319 CDS join(475..588,1828..1896,1992..2174,4600..4734) /codon_start=1 /product="immune interferon" /db_xref="PID:g32676" /db_xref="SWISS-PROT:P01579" /translation="MKYTSYILAFQLCIVLGSLGCYCQDPYVKEAENLKKYFNAGHSD VADNGTLFLGILKNWKEESDRKIMQSQIVSFYFKLFKNFKDDQSIQKSVETIKEDMNV KFFNSNKKKRDDFEKLTNYSVTDLNVQRKAIHELIQVMAELSPAAKTGKRKRSQMLFR GRRASQ" intron 589..1827 /number=1 exon 1828..1896 /number=2 intron 1897..1991 /number=2 exon 1992..2174 /number=3 intron 2175..4599 /number=3 exon 4600..5319 /number=4 polyA_site 5319 BASE COUNT 1829 a 1012 c 1170 g 1950 t ORIGIN 1 agcaaatgat caatgtgctt tgtgaatgaa gagtcaacat tttaccaggg cgaagtgggg 61 aggtacaaaa aaatttccag tccttgaatg gtgtgaagta aaagtgcctc aaagaatccc 121 accagaatgg cacaggtggg cataatgggt ctgtctcatc gtcaaaggac ccaaggagtc 181 taaaggaaac tctaactaca acacccaaat gccacaaaac cttagttatt aatacaaact 241 atcatccctg cctatctgtc accatctcat cttaaaaaac ttgtgaaaat acgtaatcct 301 caggagactt caattaggta taaataccag cagccagagg aggtgcagca cattgttctg 361 atcatctgaa gatcagctat tagaagagaa agatcagtta agtcctttgg acctgatcag 421 cttgatacaa gaactactga tttcaacttc tttggcttaa ttctctcgga aacgatgaaa 481 tatacaagtt atatcttggc ttttcagctc tgcatcgttt tgggttctct tggctgttac 541 tgccaggacc catatgtaaa agaagcagaa aaccttaaga aatattttgt aagtatgact 601 ttttaatagt acttgtttgt ggttgaaaat gactgaatat cgacttgctg tagcatctct 661 gataggctgt catctcttgt aggcagtcat tttgagattt ggtgttattt tgttaattat 721 tgactagatg agttccttga ctaaataatc tagatattgt tttaaccttc tgctcagttt 781 gtatagagac ttaaaaggga tttatgaatt ttccaaaaga tgggcataat atgggtatga 841 agcataatga tgttaataat tttgtggtgg gaactcattc agttgtgata gtcaaggagt 901 atgcagattg aaaaaaatga ttggttatta gtttttgact tctcagactc taaggtcaag 961 attagcatta aaaaggtaat aggaaatgtt tacaaattaa agtcaaaaag gtccttaaag 1021 ctttggctta aaaaaataac tgataggtga ttttctccaa aaagtgattt caacattctg 1081 cttctctatc tatattactt gtgaagtatt ccggaacttc gttgctcact gggattttgg 1141 aagaattatg attctggcta aggaatgttt aaaaatttta agtgaatttt ttgagtttct 1201 tttaaaattt tattgatggt taatgaaaag tttttacatt ttaaatattt cattatttgt 1261 ttaaaactta gctgttataa ttatagctgt cataataata ttcagacatt cacaattgat 1321 tttattctta caacacaaaa tcaaatctca cacacacaca cacacacaca cactcgcaca 1381 tgtttggaac tatcttttaa agctcgtata ataataccct acaggaaggc acagtagatg 1441 taatagaaac ctgtaccatt ggggggcagt attttatagt ggggtggctt tgctgttttt 1501 tgtttttgta ttttttagcc tagcttgaaa atactttctt tagcttacta tagtttttgg 1561 gacctttgga gtatcagctt tgttgagctc atttgtgaca ttgcaattta atggttatat 1621 tgggaaataa aaaagctaaa agaacataat agtctttgtc tatatctcac ataagccttt 1681 tgggaatact tattgttaga actaagcaga agagttgaaa aggaaatcag tgaatattgt 1741 cacatctgag ttcaatgaaa cttgaaatat atttttaagg caatttatgg gctaattgta 1801 aaccaatttt ttcttttttt tttttagaat gcaggtcatt cagatgtagc ggataatgga 1861 actcttttct taggcatttt gaagaattgg aaagaggtaa gctgaatatt cccatttggc 1921 taattttcct gttgcttgct ttctgatgga taaattcaca tcatcctctg tttgtgctct 1981 ttccttccaa ggagagtgac agaaaaataa tgcagagcca aattgtctcc ttttacttca 2041 aactttttaa aaactttaaa gatgaccaga gcatccaaaa gagtgtggag accatcaagg 2101 aagacatgaa tgtcaagttt ttcaatagca acaaaaagaa acgagatgac ttcgaaaagc 2161 tgactaatta ttcggtgagg ctatttaaat tctttctttg gtttcattgc cgagggtctt 2221 gcaaagcatt tattctccag aaagtagaca ttagctattt aacagttgct aaagctatga 2281 actcaactca tggctgaaac tctaccttac tatttccatt cgtgtttggg tgactttgca 2341 aagccagtaa gagaatcgct gaagtatgta atgtagagaa atgctggcat tgtaactatt 2401 gcgtaaagac aggtgagttg acaaattcca gtgaagagga agtaggtgag gaagaagcag 2461 ggagtactga gaagcagttc tctcattgtc ccttgctcat atgatggaaa ttctcttact 2521 ttgaatgaga ggctgtctgt cttaatggaa agagcagtgg gaggagctga gaagatgtgt 2581 gttctcctcc caactcagcc accaaggaac tgtgatgaat cacatggctg gctgggctca 2641 gtttcctcat cttaaaagga aactgttagg ttcactgtat aagtttgatg accttctttg 2701 ctccaaaact ctacaatgca aagaatagaa aatgagaatg agatagaaga aagctacagt 2761 ctttgaatag gtaccaggga caccccactg caagtctcta gccaacctat cagattgtac 2821 tgcccaatta gaagcaagaa tggttgctgt ttgtttgttt ttagggaaaa atagatagaa 2881 tttatacctt atgaaaagat tgttctatca actctctatc aactttcaga atatctcagc 2941 tggagaactc cttagactcc taagtcttac ctcatgaact tgtatcttta agttatggct 3001 tctataaaca gaaagataac gttgaggcat aaagacaaat catgtttttc agaatgtttt 3061 ctagaagaca aaggcctcta gattcctttg gggttgactt tgatataaat gggctcaaat 3121 gagagggacc agggtcttca agctagcatt tgtgttctta ggatatgtgc tcagctttca 3181 ctattgctgg gcctgcctct cactcctctc atgtaagccc ccagaaacag aaaggagaga 3241 catggcaaca ggtctccttt ggttataaac tagacactca gcacttgttt ctaatccagt 3301 ggtgcccctg gcttactgtt cagtcctgga taagtctctt agtttcttgg tgatgatttg 3361 aacattggaa agtaaaatct gtcacttgca aacacacagc ttgtcgaaaa ttttttctac 3421 tctgcaggaa ctgggcctta aaaaaatgaa aaaaaatctg tggtttcttc cttctggaag 3481 ctacaaacct cctgtttctt gatgggcaat cttgagtgag ctctattaat tattattctc 3541 tttggctcag ttgctaagct attttatgca tgttatgccc tttgacaatt agtctttagc 3601 tgtaatcccc cagccatcct cagaaatgtg gtgaggtagc catagtgttc ccaagattag 3661 aaaaatgtaa tggcagagcc aagaggaagg taaatggtcc acatcttatg aagcatcatc 3721 taaatggccc tattggttag agtgaggaga tgcaagtagt tcaatttgct tgcctagaag 3781 gcagggtact ggaaaagttg ttgcaattct taattttaaa ctttatatat cagtaagcca 3841 tatataaata tgattggggg tgtttatttt aaaatctatt atggaaattg agagactgac 3901 ctaatctggg agaaattaaa aattacagtt ttcactcgtt ttggatttgg tgttttctag 3961 ggtacctaac ctagatcagt ggttctcaaa cttaggtgga tgtcagaatc acctggggag 4021 cttagtgaat gcacagggca cagtccttcc acttcatgca cctggatctc tgaggtcttt 4081 gacaggtttc cggattaatc tgctatgcac aacagtgaga atcattgacc tatagttact 4141 catttgatgc atacaggaaa gactgaagta taaagtgata taattggtag attgatgata 4201 gagaggtcat agaaacagtc tcatcctcct ttagatgaga aaatagaagt tcagagaggt 4261 taagtagctg gctcaaggtc agaattattg catgcatgag attcaaaccc acctttttat 4321 gctgactcca caaccaggag tcttttcact atataatttc aagaattcta tagaagtaga 4381 tttaaagata tgtgatggac tccaccacat tatagcacaa ctagaaatgt aattgtaatt 4441 tttagcttca actgctgaag aagtaaatat tgtatattaa ggtaatacgg tccatttttt 4501 aaaggaatac ttttattttc actgaccatc atgacattag cagaatatcc tgatggctta 4561 tatgcctgaa attaattttg ctcttttctt tcccgatagg taactgactt gaatgtccaa 4621 cgcaaagcaa tacatgaact catccaagtg atggctgaac tgtcgccagc agctaaaaca 4681 gggaagcgaa aaaggagtca gatgctgttt cgaggtcgaa gagcatccca gtaatggttg 4741 tcctgcctgc aatatttgaa ttttaaatct aaatctattt attaatattt aacattattt 4801 atatggggaa tatattttta gactcatcaa tcaaataagt atttataata gcaacttttg 4861 tgtaatgaaa atgaatatct attaatatat gtattattta taattcctat atcctgtgac 4921 tgtctcactt aatcctttgt tttctgacta attaggcaag gctatgtgat tacaaggctt 4981 tatctcaggg gccaactagg cagccaacct aagcaagatc ccatgggttg tgtgtttatt 5041 tcacttgatg atacaatgaa cacttataag tgaagtgata ctatccagtt actgccggtt 5101 tgaaaatatg cctgcaatct gagccagtgc tttaatggca tgtcagacag aacttgaatg 5161 tgtcaggtga ccctgatgaa aacatagcat ctcaggagat ttcatgcctg gtgcttccaa 5221 atattgttga caactgtgac tgtacccaaa tggaaagtaa ctcatttgtt aaaattatca 5281 atatctaata tatatgaata aagtgtaagt tcacaactac ttatgctgtg ttggactttt 5341 tctaagtgag acctggagtg aaagaactac ctattaatga attagtaggg aggggagtct 5401 tcttagctgt gaaaatttta gagttgcatt tggttccatt aaatgtggta tttctttcca 5461 ctagcatttt gttggctttc gcttttccag ttagcagctc tttgaattat ctttctaaga 5521 tacagattta attatgtcac tattcaattc agaggttctg ctatggaatg tagtttaaac 5581 tgcttagctt ggcacacaga gatttatttc tagccccttc tccaccttcc tatttcctcc 5641 ttcgtttcag aatcttcctc tccctcatcc aatgctggca aacaccagtg ggggtggagt 5701 agtgggtgta agctctaggg agaaggcttg gattggaatc caagttattc cattacaagt 5761 agtgtgacct ttaatacatt atgtatattg tctaagtttc agctttattg tctgaaaaag 5821 aaaaataatt gtgtgttcct cataatattg tggtacgaat tgattctttc actcaagaaa 5881 tatttactgg agtacctact acatgcctgg tgctgttgta gaccttgaga taccttactc 5941 aagcaaaaca gccaaggatc c // LOCUS HSIGK12 2352 bp DNA PRI 16-FEB-1995 DEFINITION H.sapiens germ line pseudogene for immunoglobulin kappa light chain leader peptide and variable region (subgroup V kappa I). ACCESSION Z00010 K02093 X00903 NID g33146 KEYWORDS germ line; Ig light chain; immunoglobulin; pseudogene; signal peptide; variable region. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2352) AUTHORS Pech,M., Jaenichen,H.R., Pohlenz,H.D., Neumaier,P.S., Klobeck,H.G. and Zachau,H.G. TITLE Organization and evolution of a gene cluster for human immunoglobulin variable regions of the kappa type JOURNAL J. Mol. Biol. 176 (2), 189-204 (1984) MEDLINE 84267839 REFERENCE 2 (bases 1491 to 2192) AUTHORS Straubinger,B., Pech,M., Muhlebach,K., Jaenichen,H.R., Bauer,H.G. and Zachau,H.G. TITLE Molecular footprints of human immunoglobulin gene evolution: a new sequence family JOURNAL Nucleic Acids Res. 12 (13), 5265-5275 (1984) MEDLINE 84272216 REFERENCE 3 (bases 1 to 2352) AUTHORS Zachau,H.G. TITLE Direct Submission JOURNAL Submitted (01-AUG-1984) Prof. H.G. Zachau, Universitaet Muenchen, Goethestrasse 33, 8000 Muenchen 2 FEATURES Location/Qualifiers source 1..2352 /organism="Homo sapiens" /db_xref="taxon:9606" unsure 263 /note="may be C or T" misc_feature 299..313 /note="put. regulatory element pd" misc_feature 330..339 /note="put. regulatory element dc" TATA_signal 372..379 exon <427..481 /note="leader peptide" /number=1 CDS join(427..481,608..723) /codon_start=1 /product="immunoglobulin kappa light chain subgroup V" /db_xref="PID:g642240" /translation="MDMRVPAQLLGLLLLWLPGARCAIQLTQSPSSLSASVGDRVTIT CRASQGISSALA" intron 482..607 /number=1 exon 608..903 /number=2 misc_feature 906..912 /note="heptanucleotide box, not canonical" misc_feature 1581..2064 /note="L-sequence region" unsure 2257 /note="not unambiguously determined" unsure 2300 /note="not unambiguously determined" BASE COUNT 685 a 497 c 510 g 657 t 3 others ORIGIN 1 cgggatcata tgagagtctt ttggagacct gataatcata ccgtctaaca ttttattata 61 tatttcctac aaacaagaat attctcctaa ataatcccca tacaccaatg aaatacatta 121 ctccatcaac tcctgaggaa tatttcaaat tgtcaaaaaa aacctaaaaa atgtctctca 181 taataaaata gttcccagta gaaacacatt ctctggagac aaatttgtgc taccctggtc 241 ttacctggga cacctgggga caytgaactg gtgctgagtt actgagatga gccagccctg 301 cagctgtgcc cagcctgccc atcctctgct catttgcata ttcccagaac acaacctcct 361 gccctgaaga cttcttaata ggctggtcac acttcttgca ggagtcagac ccactcagga 421 cacagcatgg acatgagggt ccccgctcag ctcctggggc ttctgctgct ctggctccca 481 ggtaaggaag gagaacacta gcagtttact cagcccaggg ggctcagtac agcctggcta 541 ttcagggaaa ttctcttact acatgattaa ttgtgtggac catttgtgtt tatgcttcca 601 atctcaggtg ccagatgtgc catccagttg acccagtctc catcctccct gtctgcatct 661 gtaggagaca gagtcaccat cacttgccgg gcaagtcagg gcattagcag tgctttagcc 721 tgatatcagc agaaaccagg gaaagctcct aagctcctga tctatgatgc ctccagtttg 781 gaaagtgggg tcccatcaag gttcagcggc agtggatctg ggacagattt cactctcacc 841 atcagcagcc tgcagcctga agattttgca acttattact gtcaacagtt taataattac 901 cctcacatag tgttacaaac ccgaacataa acccccaggg aagcagatgt gtgagactgg 961 gccgccccag ctgcttctcc tgatgcctcc attggctgag agtgttcctc agatgcagcc 1021 acactctgat ggtgttggta gaggaggata tgagatcacc tctgcatccc aatttctttt 1081 tcttttctca gccccagctg cacagacatt acaatgcctc tgctgattta ataaagatag 1141 agatcatgac acctgaagag tctagtttat ggctttggtt agaattcata taacagagaa 1201 gaagccatta tagatattct aagcaggaat agtcttaata gatagaatta gagtctaaag 1261 tattgaagtc taaataaaat gtacagataa atttaatgtt ttatttgcta agaaattttt 1321 gccaaatggg gcatacagga aaactcaatg gtcttcaata tgttggaaga gcaaagagtt 1381 ttattaaaag ggaaattatt acctattgtt ctttgagaaa ttttgttggc tgtagtaagg 1441 gttgggagct ggcaagctca gactggtaag cagtggtggt caaactgaat cctagaatta 1501 tattaagtta tctcagaagt tgtgggtaaa tttgctttca ggttacaata agccaaagca 1561 gtgaagcttg cagagaattt tgttactgaa atgccaggga ttcagtatag atcctgcggc 1621 tcaccacaca gaaagccaat cactaagaca acaagtgttg tcaaagaaca ggctttaatc 1681 aggtgctgca gccgaggaga cgggacacca ttctcaaatg tatctccctg acaaaataaa 1741 ttaggcgttt atatagcagg gaagaaatgt ggaaaacagg aattagagag gggtaaggaa 1801 gataatttag tcaacaggaa gctggaggtc agttaggcaa tcataatggg tgaagggtct 1861 gatgtctcac tgtccccatt cagtgatata taactttcag ctccttgata gtatctagag 1921 gcctgatggt tggtttcctg aaaaaagaac tcagattaac aaatgtaact accttgagtt 1981 ttaagactgg gggagtcagt ttctatgttt attcaaaaaa tcataaacat tagttccatg 2041 ggataatagg gtctatttca attgcattct tcagacaata ttttgcaccc tgagtggttt 2101 tccctcctgg tttcttggct ctgttgggta tgtcaagaat gaccgaattc tatgattaac 2161 ttttacacta caacctttca aagccaagga tatagtagtc aggcaggttg acattagaag 2221 caggattctc tggtactccc tcagaaaata gaatgcntct cgccactgaa gtatgggcta 2281 tctaaccatg tggtcctcan tcctgtctga aagcttaagg gtggggttgc agctgctctc 2341 agcttcctat ag // LOCUS HSIL05 6684 bp DNA PRI 03-JAN-1991 DEFINITION Human interleukin-2 (IL-2) gene and 5'-flanking region. ACCESSION X00695 X00200 X00201 X00202 NID g33783 KEYWORDS growth factor; interleukin; T-cell growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6684) AUTHORS Holbrook,N.J., Lieber,M. and Crabtree,G.R. TITLE DNA sequence of the 5' flanking region of the human interleukin 2 gene: homologies with adult T-cell leukemia virus JOURNAL Nucleic Acids Res. 12 (12), 5005-5013 (1984) MEDLINE 84247353 REFERENCE 2 (bases 1 to 6684) AUTHORS Degrave,W., Tavernier,J., Duerinck,F., Plaetinck,G., Devos,R. and Fiers,W. TITLE Cloning and structure of the human interleukin 2 chromosomal gene JOURNAL EMBO J. 2 (12), 2349-2353 (1983) MEDLINE 84131950 REFERENCE 3 (bases 1 to 6684) AUTHORS Taniguchi,T., Matsui,H., Fujita,T., Takaoka,C., Kashima,N., Yoshimoto,R. and Hamuro,J. TITLE Structure and expression of a cloned cDNA for human interleukin-2 JOURNAL Nature 302 (5906), 305-310 (1983) MEDLINE 83167472 FEATURES Location/Qualifiers source 1..6684 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 1339..1344 /note="TATA-box" precursor_RNA 1363..6403 /note="primary transcript" CDS join(1416..1562,1653..1712,4005..4148,6010..6120) /codon_start=1 /product="interleukin" /db_xref="PID:g33784" /db_xref="SWISS-PROT:P01585" /translation="MYRMQLLSCIALSLALVTNSAPTSSSTKKTQLQLEHLLLDLQMI LNGINNYKNPKLTRMLTFKFYMPKKATELKHLQCLEEELKPLEEVLNLAQSKNFHLRP RDLISNINVIVLELKGSETTFMCEYADETATIVEFLNRWITFCQSIISTLT" intron 1563..1652 /note="intron I" intron 1713..4004 /note="intron II" intron 4149..6009 /note="intron III" misc_feature 6382..6387 /note="polyadenylation signal" BASE COUNT 2342 a 1113 c 1064 g 2165 t ORIGIN 1 agtggttttt ggagtcagta cattctcttt tcaaatcctt ctctgcccct tactggcaat 61 aagggctgag tgacctagag gcaaattact taacttctct gagcctcagt tttctaatct 121 gcaaaatagg agccatcact tcacaagtct gtaagactta tattagacta agtgcctgcc 181 tgtacactgt tctctttttc tctctttcta tatacctgaa ggcattatag tgctagatgt 241 ctgtttaaag accagacaat attgtcttaa aaaaacaaac aaaaacacag acaataccat 301 ctttaaaaaa aaaaaaaaag tccaggtaag aaataaataa ggccatagaa tggaagcttt 361 acaaggactc tctttgagac aggatctcct caagtgtccc caggttaaat tagaagtata 421 tatccgtaca attgttcagc cagtttgtgc actgtactga ggatgaatga acacctatcc 481 taaatatcct agtcttctga ctaaaaacaa gatcatattt cataacgatt attgttacat 541 tcatagtgtc ccaggtgatt tagaggataa ataaaaatcc attaaagagg taaagacata 601 aaaacgagaa acatggactg gtttacacat aacacataca aagtctatta taaaactagc 661 atcagtatcc ttgaatcgaa acctttttct gagtatttaa caatcgcacc ctttaaaaaa 721 tgtacataga cattaagaga cttaaacaga tatataatca ttttaaatta aaatagcgtt 781 aaacagtacc tcaagctcaa taagcatttt aagtattcta atcttagtat ttctctagct 841 gacatgtaag aagcaatcta tcttattgta tgcaattagc tctttgtgtg gataaaaagg 901 taaaaccatt ctgaaacagg aaaccaatac acttcctgtt taatcaacaa atctaaacat 961 ttattctttt catctgttta ctcttgctct tgtccaccac aatatgctat tcacatgttc 1021 agtgtagttt tatgacaaag aaaattttct gagttacttt tgtatcccca cccccttaaa 1081 gaaaggagga aaaactgttt catacagaag gcgttaattg catgaattag agctatcacc 1141 taagtgtggg ctaatgtaac aaagagggat ttcacctaca tccattcagt cagtctttgg 1201 gggtttaaag aaattccaaa gagtcatcag aagaggaaaa atgaaggtaa tgttttttca 1261 gactggtaaa gtctttgaaa atatgtgtaa tatgtaaaac attttgacac ccccataata 1321 tttttccaga attaacagta taaattgcat ctcttgttca agagttccct atcactcttt 1381 aatcactact cacagtaacc tcaactcctg ccacaatgta caggatgcaa ctcctgtctt 1441 gcattgcact aagtcttgca cttgtcacaa acagtgcacc tacttcaagt tctacaaaga 1501 aaacacagct acaactggag catttactgc tggatttaca gatgattttg aatggaatta 1561 atgtaagtat atttcctttc ttactaaaat tattacattt agtaatctag ctggagatca 1621 tttcttaata acaatgcatt atactttctt agaattacaa gaatcccaaa ctcaccagga 1681 tgctcacatt taagttttac atgcccaaga aggtaagtac aatattttat gttcaatttc 1741 tgttttaata aaattcaaag taatatgaaa atttgcacag atgggactaa tagcagctca 1801 tctgaggtaa agagtaactt taatttgttt ttttgaaaac ccaagtttga taatgaagcc 1861 tctattaaaa cagttttacc tatattttta atatatattt gtgtgttggt gggggtggga 1921 gaaaacataa aaataatatt ctctcacttt atcgataaga caattctaaa caaaaatgtt 1981 catttatggt ttcatttaaa aatgtaaaac tctaaaatat ttgattatgt cattttagta 2041 tgtaaaatac caaaatctat ttccaaggag cccactttta aaaatctttt cttgttttag 2101 gaaaggtttc taagtgagag gcagcataac actaatagca cagagtctgg ggccagatat 2161 ctgaagtgaa atctcagctc tgccatgtcc tagctttcat gatctttggc aaattaccta 2221 ctctgtttgt gattcagttt catgtctact taaatgaata actgtatata cttaatatgg 2281 ctttgtgaga attagtaagt aaatgtaaag cactcagaac cgtgtctggc ataaggtaaa 2341 taccatacaa gcattagcta ttattagtag tattaaagat aaaattttca ctgagaaata 2401 caaagtaaaa ttttggactt tatcttttta ccaatagaac ttgagattta taatgctata 2461 tgacttattt tccaagatta aaagcttcat taggttgttt ttggattcag atagagcata 2521 agcataatca tccaagctcc taggctacat taggtgtgta aagctaccta gtagctgtgc 2581 cagttaagag agaatgaaca aaatctggtg ccagaaagag cttgtgccag ggtgaatcca 2641 agcccagaaa ataataggat ttaaggggac acagatgcaa tcccattgac tcaaattcta 2701 ttaattcaag acaaatctgc ttctaactac ccttctgaaa gatgtaaagg agacagctta 2761 cagatgttac tctagtttaa tcagagccac ataatgcaac tccagcaaca taaagatact 2821 agatgctgtt ttctgaagaa aatttctcca cattgttcat gccaaaaact taaacccgaa 2881 tttgtagaat ttgtagtggt gaattgaaag cgcaatagat ggacatatca ggggattggt 2941 attgtcttga cctacctttc ccactaaaga gtgttagaaa gatgagatta tgtgcataat 3001 ttaggggtgg tagaattcat ggaaatctaa gtttgaaacc aaaagtaatg ataaactcta 3061 ttcatttgtt catttaaccc tcattgcaca tttacaaaag attttagaaa ctaataaaaa 3121 tatttgattc caaggatgct atgttaatgc tataatgaga aagaaatgaa atctaattct 3181 ggctctacct acttatgtgg tcaaattctg agatttagtg tgcttattta taaagtggag 3241 atgatacttc actgcctact tcaaaagatg actgtgagaa gtaaatgggc ctattttgga 3301 gaaaattctt ttaaattgta atataccata gaaatatgaa atattatata taatatagaa 3361 tcaagaggcc tgtccaaaag tcctcccaaa gtattataat cttttatttc actgggacaa 3421 acatttttaa aatgcatctt aatgtagtga ttgtagaaaa gtaaaaattt aagacatatt 3481 taaaaatgtg tcttgctcaa ggctatattg agagccacta ctacatgatt attgttacct 3541 agtgtaaaat gttgggattg tgatagatgg catccaagag ttccttctct ctcaacattc 3601 tgtgattctt aactcttaga ctatcaaata ttataatcat agaatgtgat ttttatgcct 3661 tccacattct aatctcatct ggttctaatg attttctatg cagattggaa aagtaatcag 3721 cctacatctg taataggcat ttagatgcag aaagtctaac attttgcaaa gccaaattaa 3781 gctaaaacca gtgagtcaac tatcacttaa cgctagtcat aggtacttga gccctagttt 3841 ttccagtttt ataatgtaaa ctctactggt ccatctttac agtgacattg agaacagaga 3901 gaatggtaaa aactacatac tgctactcca aataaaataa attggaaatt aatttctgat 3961 tctgacctct atgtaaactg agctgatgat aattattatt ctaggccaca gaactgaaac 4021 atcttcagtg tctagaagaa gaactcaaac ctctggagga agtgctaaat ttagctcaaa 4081 gcaaaaactt tcacttaaga cccagggact taatcagcaa tatcaacgta atagttctgg 4141 aactaaaggt aaggcattac tttatttgct ctcctggaaa taaaaaaaaa aaagtagggg 4201 gaaaagtacc acattttaaa gtgacataac atttttggta tttgtaaagt acccatgcat 4261 gtaattagcc tacattttaa gtacactgtg aacatgaatc atttctaatg ttaaatgatt 4321 aactggggag tataagctac tgagtttgca cctaccatct actaatggac aagcctcatc 4381 ccaaactcca tcacctttca tattaacaca aaactgggag tgagagagaa gtgactgagt 4441 tgagtttcac agaaacgcag gcaagatttt attatatatt tttcaagttc cttcacagat 4501 catttactgg aatagccaat actgagttac ctgaaaggct tttcaaatgg tgtttcctta 4561 tcatttgatg gaaggactac ccataagaga tttgtcttaa aaaaaaaaac tggagccatt 4621 aaaatggcca gtggactaaa caaacaacaa tctttttaga ggcaatccca ctttcagaat 4681 cttaagtatt tttaaatgca caggaagcat aaaatatgca agggactcag gtgatgtaaa 4741 agagattcac ttttgtcttt ttatatcccg tctcctaagg tataaaattc atgagttaat 4801 aggtatccta aataagcagc ataagtatag tagtaaaaga cattcctaaa agtaactcca 4861 gttgtgtcca aatgaatcac ttattagtgg actgtttcag ttgaattaaa aaaatacatt 4921 gagatcaatg tcatctagac attgacagat tcagttcctt atctatggca agagttttac 4981 tctaaaataa ttaacatcag aaaactcatt cttaactctt gatacaaatt taagacaaaa 5041 ccatgcaaaa atctgaaaac tgtgtttcaa aagccaaaca ctttttaaaa taaaaaaatc 5101 ccaagatatg acaatattta aacaattatg cttaagagga tacagaacac tgcaacagtt 5161 ttttaaaaga gaatacttat ttaaagggaa cactctatct cacctgcttt tgttcccagg 5221 gtaggaatca cttcaaattt gaaaagctct cttttaaatc tcactatata tcaaaatagt 5281 tgcctcctta gcttatcaac tagaggaagc gtttaaatag ctcctttcag cagagaagcc 5341 taatttctaa aaagccagtc cacagaacaa aatttctaat gtttaaagct tttaaaagtt 5401 ggcaaattca cctgcattga tactatgatg gggtagggat aggtgtaagt atttatgaag 5461 atgttcattc acacaaattt acccaaacag gaagcatgtc ctacctagct tactctagtg 5521 tagctcgttt cgtctttggg gaaaatataa ggagattcac ttaagtagaa aaataggaga 5581 ctctaatcaa gatttagaaa agaagaaagt ataatgtgca tatcaattca tacatttaac 5641 ttacacaaat ataggtgtac attcagagga aaagcgatca agtttatttc acatccagca 5701 tttaatattt gtctagatct atttttattt aaatctttat ttgcacccaa tttagggaaa 5761 aaatttttgt gttcattgac tgaattaaca aatgaggaaa atctcagctt ctgtgttact 5821 atcatttggt atcataacaa aatacgcaat tttggcattc attttgatca tttcaagaaa 5881 atgtgaataa ttaatatgtt tggtaagctt gaaaataaag gcaacaggcc tataagactt 5941 caattgggaa taactgtata taaggtaaac tactctgtac tttaaaaaat taacattttt 6001 cttttatagg gatctgaaac aacattcatg tgtgaatatg ctgatgagac agcaaccatt 6061 gtagaatttc tgaacagatg gattaccttt tgtcaaagca tcatctcaac actgacttga 6121 taattaagtg cttcccactt aaaacatatc aggccttcta tttatttaaa tatttaaatt 6181 ttatatttat tgttgaatgt atggtttgct acctattgta actattattc ttaatcttaa 6241 aactataaat atggatcttt tatgattctt tttgtaagcc ctaggggctc taaaatggtt 6301 tcacttattt atcccaaaat atttattatt atgttgaatg ttaaatatag tatctatgta 6361 gattggttag taaaactatt taataaattt gataaatata aacaagcctg gatatttgtt 6421 attttggaaa cagcacagag taagcattta aatatttctt agttacttgt gtgaactgta 6481 ggatggttaa aatgcttaca aaagtcactc tttctctgaa gaaatatgta gaacagagat 6541 gtagacttct caaaagccct tgctttgtcc tttcaagggc tgatcagacc cttagttctg 6601 gcatctctta gcagattata ttttccttct tcttaaaatg ccaaacacaa acactcttga 6661 aactcttcat agatttggtg tggc // LOCUS HSIL1AG 11970 bp DNA PRI 24-APR-1993 DEFINITION Human gene for interleukin 1 alpha (IL-1 alpha). ACCESSION X03833 NID g33785 KEYWORDS Alu repetitive sequence; interleukin 1 alpha; inverted repeat; repetitive sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11970) AUTHORS Furutani,Y., Notake,M., Fukui,T., Ohue,M., Nomura,H., Yamada,M. and Nakamura,S. TITLE Complete nucleotide sequence of the gene for human interleukin 1 alpha JOURNAL Nucleic Acids Res. 14 (8), 3167-3179 (1986) MEDLINE 86205226 REMARK Erratum:[Nucleic Acids Res 1986 Jun 25;14(12):5124]] COMMENT Data kindly reviewed (10-NOV-1986) by Y. Furutani. FEATURES Location/Qualifiers source 1..11970 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 254..393 /note="Alu repetitive sequence" repeat_region 1375..1382 /note="direct repeat 1" misc_signal 1375..1390 /note="pot. transcription activator (seq. homolog. to adenovirus 2 major late promoter transcription factor (MLTF) binding site)" repeat_unit 1375..1382 /note="inverted repeat A" repeat_unit 1383..1390 /note="inverted repeat A'" repeat_region 1383..1390 /note="direct repeat 1" TATA_signal 1407..1413 prim_transcript 1438..11643 exon 1438..1488 /number=1 mRNA join(1438..1488,2153..2207,3166..3214,4103..4325, 6262..6432,7815..7939,10290..11643) intron 1489..2152 /number=1 exon 2153..2207 /number=2 CDS join(2161..2207,3166..3214,4103..4325,6262..6432, 7815..7939,10290..10490) /codon_start=1 /product="IL-1-alpha" /db_xref="PID:g33786" /db_xref="SWISS-PROT:P01583" /translation="MAKVPDMFEDLKNCYSENEEDSSSIDHLSLNQKSFYHVSYGPLH EGCMDQSVSLSISETSKTSKLTFKESMVVVATNGKVLKKRRLSLSQSITDDDLEAIAN DSEEEIIKPRSSPFSFLSNVKYNFMRIIKYEFILNDALNQSIIRANDQYLTAAALHNL DEAVKFDMGAYKSSKDDAKITVILRISKTQLYVTAQDEDQPVLLKEMPEIPKTITGSE TNLLFFWETHGTKNYFTSVAHPNLFIATKQDYWVCLAGGPPSITDFQILENQA" intron 2208..3165 /number=2 exon 3166..3214 /number=3 intron 3215..4102 /number=3 exon 4103..4325 /number=4 intron 4326..6261 /number=4 misc_feature 4893..5174 /note="Alu repetitive sequence" exon 6262..6432 /number=5 intron 6433..7814 /number=5 misc_feature 7695..7744 /note="poly [dA-dC] tract" exon 7815..7939 /number=6 intron 7940..10289 /number=6 misc_feature 8466..8483 /note="poly[dA-dC] tract" repeat_region 8912..9137 /note="5 x 46 bp repeat" misc_feature 9770..9806 /note="poly [dT-dG] tract" exon 10290..11643 /number=7 polyA_site 11643 /note="polyA site" misc_feature 11863..11970 /note="Alu repetitive sequence" BASE COUNT 3708 a 2489 c 2226 g 3547 t ORIGIN 1 aagcttctac cctagtctgg tgctacactt acattgctta catccaagtg tggttatttc 61 tgtggctcct gttataacta ttatagcacc aggtctatga ccaggagaat tagactggca 121 ttaaatcaga ataagagatt ttgcacctgc aatagacctt atgacaccta accaacccca 181 ttatttacaa ttaaacagga acagagggaa tactttatcc aactcacaca agctgttttc 241 ctcccagatc catgcttttt tgcgtttatt attttttaga gatgggggct tcactatgtt 301 gcccacactg gactaaaact ctgggcctca agtgattgtc ctgcctcagc ctcctgaata 361 gctgggacta caggggcatg ccatcacacc tagttcattt cctctattta aaatatacat 421 ggcttaaact ccaactggga acccaaaaca ttcatttgct aagagtctgg tgttctacca 481 cctgaactag gctggccaca ggaattataa aagctgagaa attctttaat aatagtaacc 541 aggcaacatc attgaaggct catatgtaaa aatccatgcc ttcctttctc ccaatctcca 601 ttcccaaact tagccactgg ttctggctga ggccttacgc atacctcccg gggcttgcac 661 acaccttctt ctacagaaga cacaccttgg gcatatccta cagaagacca ggcttctctc 721 tggtccttgg tagagggcta ctttactgta acagggccag ggtggagagt tctctcctga 781 agctccatcc cctctatagg aaatgtgttg acaatattca gaagagtaag aggatcaaga 841 cttctttgtg ctcaaatacc actgttctct tctctaccct gccctaacca ggagcttgtc 901 accccaaact ctgaggtgat ttatgcctta atcaagcaaa cttccctctt cagaaaagat 961 ggctcatttt ccctcaaaag ttgccaggag ctgccaagta ttctgccaat tcaccctgga 1021 gcacaatcaa caaattcagc cagaacacaa ctacagctac tattagaact attattatta 1081 ataaattcct ctccaaatct agccccttga cttcggattt cacgatttct cccttcctcc 1141 tagaaacttg ataagtttcc cgcgcttccc tttttctaag actacatgtt tgtcatctta 1201 taaagcaaag gggtgaataa atgaaccaaa tcaataactt ctggaatatc tgcaaacaac 1261 aataatatca gctatgccat ctttcactat tttagccagt atcgagttga atgaacatag 1321 aaaaatacaa aactgaattc ttccctgtaa attccccgtt ttgacgacgc acttgtagcc 1381 acgtagccac gcctacttaa gacaattaca aaaggcgaag aagactgact caggcttaag 1441 ctgccagcca gagagggagt catttcattg gcgtttgagt cagcaaaggt attgtcctca 1501 catctctggc tattaaagta ttttctgttg ttgtttttct ctttggctgt tttctctcac 1561 attgccttct ctaaagctac agtctctcct ttcttttctt gtccctccct ggtttggtat 1621 gtgacctaga attacagtca gatttcagaa aatgattctc tcattttgct gataaggact 1681 gattcgtttt actgagggac ggcagaacta gtttcctatg agggcatggg tgaatacaac 1741 tgaggcttct catgggaggg aatctctact atccaaaatt attaggagaa aattgaaaat 1801 ttccaactct gtctctctct tacctctgtg taaggcaaat accttattct tgtggtgttt 1861 ttgtaacctc ttcaaacttt cattgattga atgcctgttc tggcaataca ttaggttggg 1921 cacataagga ataccaacat aaataaaaca ttctaaaaga agtttacgat ctaataaagg 1981 agacaggtac atagcaaact aattcaaagg agctagaaga tggagaaaat gctgaatgtg 2041 gactaagtca ttcaacaaag ttttcaggaa gcacaaagag gaggggctcc cctcacagat 2101 atctggatta gaggctggct gagctgatgg tggctggtgt tctctgttgc agaagtcaag 2161 atggccaaag ttccagacat gtttgaagac ctgaagaact gttacaggta aggaataaga 2221 tttatctctt gtgatttaat gagggtttca aggctcacca gaatccagct aggcataaca 2281 gtggccagca tgggggcagg ccggcagagg ttgtagagat gtgtactagt cctgaagtca 2341 gagcaggttc agagaagacc cagaaaaact aagcattcag catgttaaac tgagattaca 2401 ttggcaggga gaccgccatt ttagaaaaat tatttttgag gtctgctgag ccctacatga 2461 atatcagcat caacttagac acagcctctg ttgagatcac atgccctgat ataagaatgg 2521 gttttactgg tccattctca ggaaaacttg atctcattca ggaacaggaa atggctccac 2581 agcaagctgg gcatgtgaac tcacatatgc aggcaaatct cactcagatg tagaagaaag 2641 gtaaatgaac acaaagataa aattacggaa catattaaac taacatgatg tttccattat 2701 ctgtagtaaa tactaacaca aactaggctg tcaaaatttt gcctggatat tttactaagt 2761 ataaattatg aaatctgttt tagtgaatac atgaaagtaa tgtgtaacat ataatctatt 2821 tggttaaaat aaaaaggaag tgcttcaaaa cctttctttt ctctaaagga gcttaacatt 2881 cttccctgaa cttcaattaa agctcttcaa tttgttagcc aagtccaatt tttacagata 2941 aagcacaggt aaagctcaaa gcctgtcttg atgactacta attccagatt agtaagatat 3001 gaattactct acctatgtgt atgtgtagaa gtccttaaat ttcaaagatg acagtaatgg 3061 ccatgtgtat gtgtgtgacc cacaactatc atggtcatta aagtacattg gccagagacc 3121 acatgaaata acaacaatta cattctcatc atcttatttt gacagtgaaa atgaagaaga 3181 cagttcctcc attgatcatc tgtctctgaa tcaggtaagc aaatgactgt aattctcatg 3241 ggactgctat tcttacacag tggtttcttc atccaaagag aacagcaatg acttgaatct 3301 taaatacttt tgttttaccc tcactagaga tccagagacc tgtctttcat tataagtgag 3361 accagctgcc tctctaaact aatagttgat gtgcattggc ttctcccaga acagagcaga 3421 actatcccaa atccctgaga actggagtct cctggggcag gcttcatcag gatgttagtt 3481 atgccatcct gagaaagccc cgcaggccgc ttcaccaggt gtctgtctcc taacgtgatg 3541 tgttgtggtt gtcttctctg acaccagcat cagaggttag agaaagtctc caaacatgaa 3601 gctgagagag aggaagcaag ccagctgaaa gtgagaagtc tacagccact catcaatctg 3661 tgttattgtg tttggagacc acaaatagac actataagta ctgcctagta tgtcttcagt 3721 actggcttta aaagctgtcc ccaaaggagt atttctaaaa tattttgagc attgttaagc 3781 agatttttaa cctcctgaga gggaactaat tggaaagcta ccactcacta caatcattgt 3841 taacctattt agttacaaca tctcattttt gagcatgcaa ataaatgaaa aagtcttcct 3901 aaaaaaatca tctttttatc ctggaaggag gaaggaaggt gagacaaaag ggagagaggg 3961 agggaagcct aatgaaacac cagttaccta agaccagaat ggagatcctc ctcactacct 4021 ctgttgaata cagcacctac tgaaagaact ttcattccct gaccatgaac agcctctcag 4081 cttctgtttt ccttcctcac agaaatcctt ctatcatgta agctatggcc cactccatga 4141 aggctgcatg gatcaatctg tgtctctgag tatctctgaa acctctaaaa catccaagct 4201 taccttcaag gagagcatgg tggtagtagc aaccaacggg aaggttctga agaagagacg 4261 gttgagttta agccaatcca tcactgatga tgacctggag gccatcgcca atgactcaga 4321 ggaaggtaag gggtcaagca caataatatc tttcttttac agttttaagc aagtagggac 4381 agtagaattt aggggaaaat taaacgtgga gtcagaataa caagaagaca accaagcatt 4441 agtctggtaa ctatacagag gaaaattaat ttttatcctt ctccaggagg gagaaatgag 4501 cagtggcctg aatcgagaat acttgctcac agccattatt tcttagccat attgtaaagg 4561 tcgtgtgact tttagccttt caggagaaag cagtaataag accacttacg agctatgttc 4621 ctctcatact aactatgcct ccttggtcat gttacataat cttttcgtga ttcagtttcc 4681 tctactgtaa aatggagata atcagaatcc cccactcatt ggattgttgt aaagattaag 4741 agtctcaggc tttacagact gagctagctg ggccctcctg actgttataa agattaaatg 4801 agtcaacatc ccctaacttc tggactagaa taatgtctgg tacaaagtaa gcacccaata 4861 aatgttagct attactatca ttattattat tattttattt tttttttttg agatggagtc 4921 tggctctgtc acccaggctg gagtgcagtg gcacaatctc ggctcactgc aagctctgcc 4981 tcctgggttc atgccattct cctgcctcag cctcccgagt aagctgggaa tacaggcacc 5041 cgccactgtt cccggctaat tttttgtatt tttagtagag acggagtttc accgtggtct 5101 ccatctcctc gtgatccacc caccttggcc tcccaaagtg ccgggattac aggcgtgagc 5161 caccgcgccc ggcctattat tattattatt actactacta ctacctatat gaatactacc 5221 agcaatacta atttattaat gactggatta tgtctaaacc tcacaagaat cctaccttct 5281 cattttacat aaaaggaaac taagctcatt gagataggta aactgcccaa tggcatacat 5341 ctgtaagtgg gagagcctca aatctaattc agttctacct gagtaaaaaa atcatggttt 5401 ctcctccatc cctttactgt acaagcctcc acatgaacta taaacccaat attcctgttt 5461 ttaagataat acctaagcaa taacgcatgt tcacctagaa ggttttaaaa tgtaacaaaa 5521 tataagaaaa taaaaatcac tcatatcgtc agtgagagtt tactactgcc agcactatgg 5581 tatgtttcct taaaatcttt gctatacaca tacctacatg tgaacaaata tgtctaacat 5641 caagaccaca ctatttacaa ctttatatcc agcttttctt acttagcaat gtattgagga 5701 cattttagag tgcccgtttt tcaccattat aagcaatgca acaatgaaca tctgtataaa 5761 taaatattca tttctctcac cctttatttc cttagaatat attcctagaa gtagaatttc 5821 ccagagccat gaggatttgt gacgctattg atatgtgcca ctttgcactc tctgtgacat 5881 atataattat ttttaatgca ttcatttttt tctcagagtg cattcgtttg aaaacataga 5941 cgggaaatac tggtagtctt ccttgtcagt tagaaacacc caaacaatga aaaatgaaaa 6001 agttgcacaa atagtctcta aaaacaatga aactattgcc tgaggaattg aagtttaaaa 6061 agaagcacat aagcaacaac aaggataatc ctagaaaacc agttctgctg actgggtgat 6121 ttcacttctc tttgcttcct catctggatt ggaatattcc taataccccc tccagaacta 6181 ttttccctgt ttgtactaga ctgtgtatat catctgtgtt tgtacataga cattaatctg 6241 cacttgtgat catggtttta gaaatcatca agcctaggtc atcacctttt agcttcctga 6301 gcaatgtgaa atacaacttt atgaggatca tcaaatacga attcatcctg aatgacgccc 6361 tcaatcaaag tataattcga gccaatgatc agtacctcac ggctgctgca ttacataatc 6421 tggatgaagc aggtacatta aaatggcacc agacatttct gtcatcctcc cctcctttca 6481 tttacttatt tatttatttc aatctttctg cttgcaaaaa acatacctct tcagagttct 6541 gggttgcaca attcttccag aatagcttga agcacagcac ccccataaaa atcccaagcc 6601 agggcagaag gttcaactaa atctggaagt tccacaagag agaagtttcc tatctttgag 6661 agtaaagggt tgtgcacaaa gctagctgat gtactacctc tttggttctt tcagacattc 6721 ttaccctcaa ttttaaaact gaggaaactg tcagacatat taaatgattt actcagattt 6781 acccagaagc caatgaagaa caatcactct cctttaaaaa gtctgttgat caaactcaca 6841 agtaacacca aaccaggaag atctttatta tctctgataa catatttgtg aggcaaaacc 6901 tccaataagc tacaaatatg gcttaaagga tgaagtttag tgtccaaaaa cttttatcac 6961 acacatccaa ttttcatggc ggacatgttt tagtttcaac agtatacata ttttcaaagg 7021 tccagagagg caattttgca ataaacaagc aagacttttt ctgattggat gcacttcagc 7081 taacatgctt tcaactctac atttacaaat tattttgtgt tctatttttc tacttaatat 7141 tatttctgca attttcccaa tattgacatc gtgtatgtat ttgccatttt taatatcact 7201 agacaattca atcaggttgc tacgttggtc ccttgggttt actctaaata gcttgattgc 7261 aaatatcttt gtatatatta ttgttttttc tcctatcttg taatttcttt gagcacatcc 7321 caaagaggaa tgcctagatc aatgggcaca aataatttga cagctcttat taaacattat 7381 tctgtaagta aaaactgaac tacttttcag tatcactagc aacatatgag tgtatcagct 7441 tcctaaaccc ctccatgtta ggtcattatg aacttatgat ctaacaaatt acagggtctt 7501 atcccactaa tgaaattata agagattcaa cacttattca gccccgaagg attcattcaa 7561 cgtagaaaat tctaagaaca ttaaccaagt atttacctgc ctagtgagtg tggaagacat 7621 tgtgaaggac acaaagatgt atagaattcc attcctgact tccaggtatt tacaccatag 7681 gtggggacct aactacacac acacacacac acacacacac acacacacac accatgcaca 7741 cacaatctac atcaacactt gattttatac aaatacaatg aatttacttt ctttttggtt 7801 cttctcttca ccagtgaaat ttgacatggg tgcttataag tcatcaaagg atgatgctaa 7861 aattaccgtg attctaagaa tctcaaaaac tcaattgtat gtgactgccc aagatgaaga 7921 ccaaccagtg ctgctgaagg tcagttgtcc tttgtctcca acttaccttc atttacatct 7981 catatgtttg taaataagcc caataggcag acacctctaa caaggtgaca ctgtcctctt 8041 tccttcctac cacagccccc acctacccac cccactccca ttgattccag aggcgtgcct 8101 aggcaggatc tatgagaaaa tataacagag agtaagagga aaattacctt ctttcttttt 8161 cctttccctg cctgacctta ttcacctccc atcccagagc atccatttat tccattgatc 8221 tttactgaca tctattatct gacctacaca atactagaca ttaggacaat gtggcctgcc 8281 tccaagaaac tcaaataagc caactgagat cagagaggat taatcacctg ccaatgggca 8341 caaagcaaca agctgggagc caagtcccaa aatggggcct gctgcttcca gttcccctct 8401 ctctgcattg atgtcagcat tatccttcgt cccagtcctg tctccactac cactttcccc 8461 ctcaaacaca cacacacaca acagccttag atgttttctc cactgataag taggtgactc 8521 aatttgtaag tatataatcc aagaccttct attcccaagt agaatttatg tgcctgcctg 8581 tgcttttcta cctggatcaa gtgatgtcta cagagtaggg cagtagcttc attcatgaac 8641 tcattcaaca agcattattc actgagagcc ttgtattttt caggcatagt gccaacagca 8701 gtgtggacag tggtgcatca aagcctctag tctcatagaa cttagtcttc tggaggatat 8761 ggaaaacaga caacccaaac aaccaacaaa agagcaagat gctgcaaaaa aaaaaaaaat 8821 gaatagggtg ctaagataga gaaaagtggg agagtgctat ttagacaaag tggtaaaaac 8881 aaagcccctt gtgagatgag agctgccgac agagggggcg ggtcatggtt gtgggttttt 8941 gggtaggaca ttcagaggag ggggcgggtc gtggttgtgg gtttttgggt aggacattca 9001 gaggaggggg cgggtcgtgg ttgtgggttt ttgggtagga cattcagagg agggggcggg 9061 tcgtggttgt gggtttttgg gtaggacatt cagaggaggg ggcgggtcgt ggttgtgggt 9121 ttttgggaca ttcagaggag tctgaatgca cccaggccta caacttcaag atggtaaagg 9181 acagctccaa ggatcagaag aagcattctt ggaactgggg cattttgaga aggaggaaaa 9241 atatgcagag actagtgctt gcagagcttg catttggatt tcatttgagg tacaatgaaa 9301 acccattaat gggtttcaca cagtgcaatg gcctgacctc acttatattt cctaaaatag 9361 aaaacagatc agaaggaagg caatagagaa gcagaaagtc caatgaggag gtttcacagc 9421 agtcatgggg gtggggtaag gaaaagaagt ggaaagaaac agacagaatt gggttatatt 9481 ttggagatag aaccaacaga aggaagagga gaaacaacat ttactgagaa gggaaaaagt 9541 aggagaggaa taggtttggg aaataaatcc tgctgacatt ggaaacccca aggaagcctc 9601 aaaagtatat ttacttgctt tagatttaaa agaataggaa agaagcatct caacttggaa 9661 tttgaaatct atttttccat aaaagtattg ttaaattcta ctcatactca caagaaaagt 9721 acattctaaa gagtatattg aaagagttta ctgatatact taggaatttt gtgtgtatgt 9781 gtgtgtgtgt atgtgtgtgt gtgtgtttaa ccttcaattg ttgacttaaa tactgagata 9841 aatgtcatct aaatgctaaa ttgatttccc aaaggtatga tttgttcact tggagatcaa 9901 aatgtttagg gggcttagaa tcactgtagt gctcagattt gatgcaaaat gtcttaggcc 9961 tatgttgaag gcaggacaga aacaatgttt ccctcctacc tgcctggata cagtaagata 10021 ctagtgtcac tgacaatctt cataactaat ttagatctct ctccaatcaa ctaaggaaat 10081 caactcttat taatagactg ggccacacat ctactaggca tgtaataaat gcttgctgaa 10141 tgaacaaatg aatgaagagc ctatagcatc atgttacagc catagtccta aagtggtgtt 10201 tctcatgaag gccaaatgct aagggattga gcttcagtcc tttttctaac atcttgttct 10261 ctaacagaat tctcttcttt tcttcatagg agatgcctga gatacccaaa accatcacag 10321 gtagtgagac caacctcctc ttcttctggg aaactcacgg cactaagaac tatttcacat 10381 cagttgccca tccaaacttg tttattgcca caaagcaaga ctactgggtg tgcttggcag 10441 gggggccacc ctctatcact gactttcaga tactggaaaa ccaggcgtag gtctggagtc 10501 tcacttgtct cacttgtgca gtgttgacag ttcatatgta ccatgtacat gaagaagcta 10561 aatcctttac tgttagtcat ttgctgagca tgtactgagc cttgtaattc taaatgaatg 10621 tttacactct ttgtaagagt ggaaccaaca ctaacatata atgttgttat ttaaagaaca 10681 ccctatattt tgcatagtac caatcatttt aattattatt cttcataaca attttaggag 10741 gaccagagct actgactatg gctaccaaaa agactctacc catattacag atgggcaaat 10801 taaggcataa gaaaactaag aaatatgcac aatagcagtt gaaacaagaa gccacagacc 10861 taggatttca tgatttcatt tcaactgttt gccttctgct tttaagttgc tgatgaactc 10921 ttaatcaaat agcataagtt tctgggacct cagttttatc attttcaaaa tggagggaat 10981 aatacctaag ccttcctgcc gcaacagttt tttatgctaa tcagggaggt cattttggta 11041 aaatacttct cgaagccgag cctcaagatg aaggcaaagc acgaaatgtt attttttaat 11101 tattatttat atatgtattt ataaatatat ttaagataat tataatatac tatatttatg 11161 ggaacccctt catcctctga gtgtgaccag gcatcctcca caatagcaga cagtgttttc 11221 tgggataagt aagtttgatt tcattaatac agggcatttt ggtccaagtt gtgcttatcc 11281 catagccagg aaactctgca ttctagtact tgggagacct gtaatcatat aataaatgta 11341 cattaattac cttgagccag taattggtcc gatctttgac tcttttgcca ttaaacttac 11401 ctgggcattc ttgtttcatt caattccacc tgcaatcaag tcctacaagc taaaattaga 11461 tgaactcaac tttgacaacc atgagaccac tgttatcaaa actttctttt ctggaatgta 11521 atcaatgttt cttctaggtt ctaaaaattg tgatcagacc ataatgttac attattatca 11581 acaatagtga ttgatagagt gttatcagtc ataactaaat aaagcttgca acaaaattct 11641 ctgacacata gttattcatt gccttaatca ttattttact gcatggtaat tagggacaaa 11701 tggtaaatgt ttacataaat aattgtattt agtgttactt tataaaatca aaccaagatt 11761 ttatattttt ttctcctctt tgttagctgc cagtatgcat aaatggcatt aagaatgata 11821 atatttccgg gttcacttaa agctcatatt acacatacac aaaacatgtg ttcccatctt 11881 tatacaaact cacacataca gagctacatt aaaaacaact aataggccag gcacggtggc 11941 tcagacctgt aatcccagca ctttgggagg // LOCUS HSIL1RECA 12565 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens gene for interleukin-1 receptor antagonist. ACCESSION X64532 NID g33798 KEYWORDS interleukin 1 alpha and beta homologue; interleukin 1 receptor antagonist. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12565) AUTHORS Carrier,M.J. TITLE Direct Submission JOURNAL Submitted (17-FEB-1992) M.J. Carrier, Yamanouchi Research Institute, Littlemore Hospital, Oxford, OX4 4XN, UK REFERENCE 2 (bases 1 to 12565) AUTHORS Lennard,A., Gorman,P., Carrier,M., Griffiths,S., Scotney,H., Sheer,D. and Solari,R. TITLE Cloning and chromosome mapping of the human interleukin-1 receptor antagonist gene JOURNAL Cytokine 4 (2), 83-89 (1992) MEDLINE 92338323 FEATURES Location/Qualifiers source 1..12565 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda EMBL3 genomic library" /chromosome="2" /map="q13-14.1" TATA_signal 5949..5953 exon 5989..6120 /number=1 mRNA join(5989..6120,7952..8040,9418..9530,11029..12386) CDS join(6005..6120,7952..8040,9418..9530,11029..11244) /codon_start=1 /product="Interleukin-1 receptor antagonist" /db_xref="PID:g33799" /db_xref="SWISS-PROT:P18510" /translation="MEICRGLRSHLITLLLFLFHSETICRPSGRKSSKMQAFRIWDVN QKTFYLRNNQLVAGYLQGPNVNLEEKIDVVPIEPHALFLGIHGGKMCLSCVKSGDETR LQLEAVNITDLSENRKQDKRFAFIRSDSGPTTSFESAACPGWFLCTAMEADQPVSLTN MPDEGVMVTKFYFQEDE" intron 6121..7951 /number=1 exon 7952..8040 /number=2 intron 8041..9417 /number=2 exon 9418..9530 /number=3 intron 9531..11028 /number=3 exon 11029..12386 /number=4 BASE COUNT 3217 a 2980 c 3072 g 3294 t 2 others ORIGIN 1 gtcgacctgc aggtcaacgg atctgagagg agagtagctt cttgtagata acagttggat 61 tatataccat gtcctgatcc ccttcatcat ccaggagagc agaggtggtc accctgatag 121 cagcaagcct gggggctgca gcttggtggg tagaggtact caggggtaca gatgtctcca 181 aacctgtcct gctgccttag ggagcttcta ataagttgat ggatttggtt aaaattaact 241 tggctacttg gcaggactgg gtcagtgagg accaacaaaa agaagacatc agattatacc 301 ctgggggttt gtatttcttg tgtttctttc tcttctttgt actaaaatat ttacccatga 361 ctgggaaaga gcaactggag tctttgtagc attatcttag caaaaattta caaagtttgg 421 aaaacaatat tgcccatatt gtgtggtgtg tcctgtgaca ctcaggattc aagtgttggc 481 cgaagccact aaatgtgaga tgaagccatt acaaggcagt gtgcacatct gtccacccaa 541 gctggatgcc aacatttcac aaatagtgct tgcgtgacac aaatgcagtt ccaggaggcc 601 caaatgaaaa tgtttgtact gaaatttgtt aaagcttccc gacaaactag atttatcagt 661 aaggattgtt ttctgcaagg gggatgaaac ttgtggggtg agccatttgg gctgaggagg 721 agggaggttg gagctgagaa atgtggagac aatttccctt tagaaggact gaatctccct 781 gcctctctgg ggtgcggcag ccagcaggat ccaatggtgt atatgtctcc ccagctcccc 841 attcagtgat atcatgtcag tagcttgaaa ttatccgtgg tgggagtatt atgtcatgga 901 aattggcaaa tggaaacttt tattggagat tcaattgtta aacttttacc agcacaacac 961 tgccctgcct tcagagtcaa tgaccctatc caagtttaat ccatctgtcc actgtctcca 1021 acacgatctt tataaaacac acctgacaac attacccttt tattcagttt tttaaaagat 1081 aagtttccag ctcatcgggg tggctttaaa ggccatttct cctctggacc tcacccaact 1141 tttcaaatca cttttcctac ccctacctct aaatgctact caaactccag ccatcctgaa 1201 taataagact tttgaaaagt agattatggg ctgggcacag tggctcacac ctgtaatccc 1261 agcactttgg gaggccaaga tgggtggatc acctgaggtc gggagttcga gaccagcctg 1321 actaacatag tgaaaccctg tctctactaa aaatacaaaa ttagttgggg gtggtggcac 1381 aagcctgtaa tcccagctac tcaggaggtt gaggcagggg aattgcttga acctgggagg 1441 cggaggttgc ggtgagccta gattgctcca ctgcactcca gcctgggcaa caagagcgaa 1501 actccatctc aaaaaaataa ataaataaat aaagtagatt acatcagata cctctggcct 1561 aggttgttta tgaccaactc tcctgctgag aataactaga aaagctagac aaaacatatt 1621 tccaaaagat ctctttggag gcatcagaga atggccaagg ctgtaaggaa ctgcctgagc 1681 ccagagaggt ggagcccagc actggtgccc tttactcctg gggacatgtg ctggtttcaa 1741 aaacttcagc tgagcttttg agcattcatg gaacttggtg ggggagatga aatttgtacc 1801 ttaaatcctg cctacaggga gggtccctga taatccccac ccaatttgga aatctgggtc 1861 agccttcaca ggtactgaag ccctcctctg aatgatctca agtcctgcta gggtagaggt 1921 tacctgcttt tgaaaggctc ctggcctacc tgtgcagcag gagcaaaagt gaaccatctc 1981 agggtacaga taacaatcat ccagagcctt gaatgacctc tactgtgctt aatatatagt 2041 attcagcagt cagtaaaaag gatttaggca catgcaagat gacctgtgta tcagggagaa 2101 ataggcaata aattgagatc cagcagggat ttgaatcatg gatttgaatc aggggcagcc 2161 ttcgaaagaa ctatggagaa tatactcaga tttaaaacat aagattggaa tttttggcag 2221 agaactaaca actgtacaaa aaaggaacca aatggaaatc ctagaactga aagatgcaat 2281 taaccgatgt tgagaaatag ccaacatcta ttgaacactt cccatgtgga cagctgtgct 2341 aaacacttta caggcatcaa cataagatgt gtccccttac agcagtgcag tgtccctcct 2401 aagacatgga cagcctggtt tccctatctc tctgcttcat caaaacccct ttacgtgggg 2461 cttagacact cctgttgtct ctagtgtcta gtagcacagg gctcagcaca tggaagccac 2521 tagatacaat ttgatgacca ggacctccga tgaaagccat gggtgctgat tgggaaggca 2581 ttgtctttta tgtgctatgg tcttaaagct tcatccagga agcagaactc ggggggtgct 2641 gaggacccag aaccgagaat aagattagtc agagatttcc tgtgggcaga aatcataagg 2701 acgccaactg tttgggtgag ataagacgaa accaagagtg gacttgtggc cagaagcgtg 2761 aggaagaggg agagagcttc ccttgtcccc tttcttcctc tccctaagcc acagtgattg 2821 acagcccccc cgctttggag tcagagcagg cttgagactg gactgggaaa ggagggtggg 2881 tcaggataca gagcaggaag gctgggagtg cagggcagga gcaaggggct ggggcattca 2941 ttgtgcctga tctctcccac tttacctggg gtaaagaagc atatgcaaaa gccacggtgt 3001 gagtatttcc caagtgccag ggtcagggca tgattcatca cgtgcagcat ttcattcaat 3061 ccttatagta accgatgatg tggcttctat tattagctct atcagataat gaaactgaga 3121 ccaagacagg ctctgcacat tgtgtggggt aatgacacag ggggattcag acctagactc 3181 cataactcct gccccaggga ccacccccac cctcaccctg tgcatgtcga caaaggacag 3241 actgggccac ttctcaggac acagcgggga aatgacacag agcagggagg ttccaggagc 3301 cccgagcgtc ttttctccag gagaatactc tctgaattca gactggggtc agagaaacat 3361 ttacccagga gccgcagtgt gggtggggct ttttacttga aacgctgtct gaaggcagtg 3421 gcaggatgaa ctctccaccc taccttggca agccacttct cttctgcaat ctgtaaggac 3481 attgttgaga gaattatggt cttccaattc cggagggttg aagaaagaca aataggagag 3541 aacctatcat agtcaggtgc tagctgcctt ctctttcaga gagtgtgaga ataaagtgat 3601 acacttgatt attagcaaat actttggaaa ttttaaacgc taatattcaa cacactctgg 3661 aagaggcaaa taagtagaca ggttcatata catcatctcc ttcagctagt cctcacaaaa 3721 acaaacaaat gaataaacaa aattcttctt tggccctcat aggaagacac tgtttcttga 3781 acgtgtttca aaaaggatgg gtgactcact caaggtcaca ctgtttatga ggacagtaca 3841 ggaatacaga catgccattt tgcctgaaaa aatccatcac ccagggaggt gacacaattt 3901 tgcagaaatg ttctatttcc tctgaaggat acattcttta aacctttggg aaattcattc 3961 atagtcttcc tcctttgaag gattactctc tggacacaaa gtgtttgatt ctgatttgtt 4021 ggttggaaga tgtgttggtt gagagaaaga ttctgatttg ttggttgaaa atagactcat 4081 caagatcaac tgctgtagta gtaaatattt tgacattttg tctgtattcc tgtgctgccc 4141 tcacaagctg catcaccttg agtgagtcat tcatactttt ttgtttgttt ttgttttgga 4201 gatggagtct tactctgttg cctaggctgg agtgcggtgg cgtgatcttg gctcactgcg 4261 acctccatct cctgggttca agtgatcctc ctgcctcagc ctcccgagta gctgggatta 4321 caggcacatg ccaccatccc tgctaatttt tgcattttca gtagagacgg agtttcacca 4381 tgttggtcag gttggtcttg aactcctgac ctcaggtgat ccgcccacct cagcctcccc 4441 aagtgctggg attacaggtg tgagccaccg tgcccagccc agccatcatt tttgaaacac 4501 gtttgagaaa tagtgtcttc ctttgagggc caaggagaca ttttttttgt ttatttgttt 4561 gtttttgtga ggactagctg aagggggtga tgtatattaa cctgcctact tatttgcctc 4621 ttcccagagt gtgatgaata ttagggttta aagtttctga agcatttgtt aataaagccc 4681 ggggctggag gtcagaagac ctggatttct ctgcatactt ttgccatcag caagctgtgt 4741 gaccttggac agatcccttt tttgtctaaa tctttctgag tcttcttgaa aacaatgcca 4801 ggttgggaca ggatgattgc caagctcccg tccagctcta aaacactgca acgtatgctt 4861 ctgcaccagc actgtccatc ctgtagatca tgcagaaatt ctcttcaact ttttcctacc 4921 cataaaatag gagcatgctt acctttttcc taatgttcca ggccccgggt ctagatattg 4981 taagtaagga agttaatgtg tatcagagcc cattatgggc cagaagttct cctcttcctt 5041 cctacacctg cttcctccct ccctccctcc ctctttccct tccttccttc catccatttg 5101 tgaagaagac atgatcaccc tcattctgag agtgaagaga cagaggctca actaatgaaa 5161 tgatttgttc aaggtcacac gggtggcaca aggcaagtgg cagaggttga atttagaccc 5221 attcctgtcc aaatgctgag tttatgtcat cgtcccgaga ccataacttt aaagatgtaa 5281 gatagtggga aaagagttga tttcaaagca cctctcagaa ggactcactt tacatcaggg 5341 gtcagcagac tcaggccaaa tccggtccat tccccgcttt tgcaaagaaa gttgtagtgg 5401 aacacagcta ggcttattga tttatggatt gccaacgtcc ttttgtgaaa cagacagctg 5461 agctgagtaa tcgtggcgca caaaacctaa aatatttact atctcgtcct ttacagaatg 5521 tttgccaatc tatggtccgg agtccaaggc tgtccatttt tcaaagaaca caaagtgaca 5581 tgagactgtc ccatgtgcag ggagccctat cattttatta tgaaaaaacg gcctttctgc 5641 tcaaatctgt tttttaaaaa gtcaacaaac agactctggg tacctgtcag gaacagtagg 5701 gagtttggtt tccattgtgc tcttcttccc aggaactcaa tgaaggggaa atagaaatct 5761 taattttggg gaaattgcac aggggaaaaa ggggagggaa tcagttacaa cactccattg 5821 cgacacttag tggggttgaa agtgacaaca gcaagggttt ctctttttgg aaatgcgagg 5881 agggtatttc cgcttctcgc agtggggcag ggtggcagac gcctagcttg ggtgagtgac 5941 tatttcttta taaaccacaa ctctgggccc gcaatggcag tccactgctt gctgcagtca 6001 cagaatggaa atctgcagag gcctccgcag tcacctaatc actctcctcc tcttcctgtt 6061 ccattcagag acgatctgcc gaccctctgg gagaaaatcc agcaagatgc aagccttcag 6121 gtaaggctac cccaaggagg agaaggtgag ggtggatcag ctggagactg gaaacatatc 6181 acagctgcca gggctgccag gccagagggc ctgagaactg ggtttgggct ggagaggatg 6241 tccattattc aagaaagagg ctgttacatg catgggcttc aggacttgtg tttcaaaata 6301 tcccagatgt ggatagtgcg accggagggc tgtcttactt tcccagagac tcaggaaccc 6361 agtgagtaat agatgcatgc caaggagtgg gactgcgatt caggcctagt tgaatgtgct 6421 gacagagaag cagagagggg caccaggggc acagcccgaa ggcccagact gatatgggca 6481 aggcctgtct gtgctgacat gtcggagggt cccactctcc agggaccttg gtttccccgt 6541 ctgtgacatc tgtgacatga gagtcacgat aactccttgt gtgccttaca gggttgttgt 6601 gaaaattaaa tgcacagata atagcgtaac agtattccgt gcattgtaaa gagcctgaaa 6661 accattatga tttgaaaatg gaatcggctt tgtgagacca tcactattgt aaagatgtga 6721 tgctgataga aatgacagga ctgcttgtgc atgccctctg cagtgtgaca ttccagcagt 6781 gaaatcatgt tggggtgact tctcccccac tctgaccttt atgtttgtct gggccgaggc 6841 tgcaagtcgg gctctgtggg tgtatgagtg acaagtctct cccttccaga tatggggact 6901 gtctgcttcc ctaggttgcc tctccctgct ctgatcagct agaagctcca ggagatcctc 6961 ctggaggccc cagcaggtga tgtttatccc tccagactga ggctaaatct agaaactagg 7021 ataatcacaa acaggccaat gctgccatat gcaaagcact ttggtttgcc tggccacccc 7081 tcgtcgagca tgtgggctct tcagagcacc tgatgaggtg ggtacagtta gccacacttc 7141 acaggtgaag aggtgaggca caggtcccag gtcaggctgg ccggagctct gtttattacg 7201 tctcacagct ttgagtcctg ctctcaacca gagaggccct ttaccaagaa gaaaggattg 7261 ggacccagaa tcaggtcact ggctgaggta gagaggaagc cgggttgttc ccaagggtag 7321 ctgctcctgc aggactctga gcaggtcacc agctaatgga ggaaaggctc tagggaaaga 7381 cccttctggt ctcagactca gagcgagtta gctgcaaggt gttccgtctc ttgaaacttc 7441 tacctaggtg ctatggtagc cactagtctc aggtggctat ttaaatttat acttaaatga 7501 atgaaaatag aagaaaattt aaaatccaga cccttggtca cactatccac atttaaagag 7561 gtcaatagcc acatgtggtt agtggccacc ctattgggca gtgcagctac agaacatttt 7621 tgcatcccag aaagttcttt tggatgttgc tgctctacag catgctttgc tgaaacagaa 7681 gtgccttccc tgggaatctc agatgggaag caagtaagga ggggagtcaa atgtgggctc 7741 actgctcacc agctgtgagg gttgggcctg cctcttaacc attgtcagcc tcagtcttct 7801 catccatgca tgccgtgggt atactaaaat actatacccc tggaagagct ggatgcaaat 7861 ttgacaagtt ctgggggaca caggaaggtg ccaagcacaa ggctgggcac atggtggctg 7921 tgcactacag ctgagtcctt ttccttttca gaatctggga tgttaaccag aagaccttct 7981 atctgaggaa caaccaacta gttgctggat acttgcaagg accaaatgtc aatttagaag 8041 gtgagtggtt gccaggaaag ccaatgtatc tgggcatcac gtcactttgc ccgtctgtct 8101 gcagcagcat ggcctgcctg cacaaaccct aggtgcaatg tcctaatcct tgttgggtct 8161 ttgtattcaa gtttgaagct gggagggcct ggctactgaa gggcacatat gagggtagcc 8221 tgaagagggt gtggagaggt agagtctagg tcagaggtca gtgcctatag gcaagtggtc 8281 ccagggccac agctgggaag ggcaaatacc agaaggcaag gttgaccatt cccttcctca 8341 agtgcctatt aaggctccat gttcctatgt tgttcaaacc ctaactcaat cccaaattaa 8401 tccaccatgt ataaggttga gctatgtctc ttattcctgg acaccatact cagccatatc 8461 tggtccacac attaacagct ggatgacctt gaagaagctt cacccactct gttcctcagc 8521 tttcccttca gtgggatgat atcaactgga caacaggatg tgcgattctt ttagttccag 8581 ccttccagga tgttttcact cccctgtttg ttgttgtagg atggtattac ctccaccttc 8641 ccaccttccc tatgccctgg ttctgtctcc tgtgcctcgc tctgaaagtg gatgagacct 8701 acaattcctg tcctggtagt tctcctaatg aacacactga agcacgagga agctgagatt 8761 tttgttgcta catgagagca tggaggcctc ttagggagag aggaggttca gagactccta 8821 ggctcctggt ggagccccac tcatggcctt gttcattttc cctgcccctc agcaacactc 8881 ctattgacct ggagcacagg tatcctgggg aaagtgaggg aaatatggac atcacatgga 8941 acaacatcca ggagactcag gcctctagga gtaactgggt agtgtgcatc ctggggaaag 9001 tgagggaaat atggacatca catggaacaa catccaggag actcaggcct ctaggagtaa 9061 ctgggtagtg tgcatcctgg ggaaagtgag ggaaatatgg acatcacatg gaacaacatc 9121 caggagactc aggcctctag gagtaactgg gtagtgtgca tcctggggaa agtgagggaa 9181 atatggacat cacatggaac aacatccagg agactcaggc ctctaggagt aactgggtag 9241 tgtgcttggt ttaatcttct atttacctgc agaccaggaa gatgagacct ctctgccctt 9301 ctgacctcgg gattttagtt ttgtggggac caggggagat agaaaaatac ccggggtctc 9361 ttcattattg ctgcttcctc ttctattaac ctgaccctcc cctctgttct tccccagaaa 9421 agatagatgt ggtacccatt gagcctcatg ctctgttctt gggaatccat ggagggaaga 9481 tgtgcctgtc ctgtgtcaag tctggtgatg agaccagact ccagctggag gtaaaaacat 9541 gctttggatc tcaaatcacc ccaaaaccca gtggcttgaa acaaccaaaa ttttttctta 9601 tgattctgtg ggttgaccag gattagctgg gtagttctgt tccatgtggt ggaacatgct 9661 ggggtcactt tggaagctgc attcagcaga gtgccaggct tgcgctgggc atccaaggtg 9721 gtccctcatc ctccaggctc tctttccatg tgatctctca gtgtttaaga gttagttgga 9781 gcttccttac agcatggcgg ctgacttcca aaagggatta ttccaaaaag agcctcaaca 9841 tgcaggcgct tattatgact tctgcttgca tcatcctatt ggccaaagcc agtcacgtgg 9901 ctaagtctag ccccctgtga gaggagactg cataagagtg tgaacaccag gagacacggt 9961 cactgggggc caccactgta accatctacc acaggacctg aatctctgtg tgctactccc 10021 ttgctcaagg gcccccctac ccacgcagac ctgctgtctt ctagcaaagc ccatcctcag 10081 gacctttctc ttccaatcct tattgactca aattgattag ttggtgctcc acccagagcc 10141 ctgtgctcct ttatctcatg taatgttaat gggtttccca gccctgggaa aacatggctt 10201 tgtctcaggg gcttgctgga tgcaacctta acctcaatgt gagtggccat actgtggcac 10261 tgtcccatcc ctcaccaggg acactgttct ggagggtgac tgcctgttct gtgaggagtg 10321 gggatggcta ggacattgca tggaacacac caccacccca tcttctcaga gctcaaaccc 10381 tgacagaaca ccagctccac aggccttggc ttctgctgat ggtgccgtgt atttaccaga 10441 cttagtggtc caaggccaga gtggcagatt tcccaaagtc aaggtgtgac agtgggacag 10501 cctctttgtg tctttgctgt cctaagaaac ctgggccagg ccaggcgcag tggctcacgc 10561 cttgtaatcc cagcactttg agaggccaag gtgggcagat cacgaggtca ggagtttgag 10621 accagcctgg ccaacattgg tgaaaccctg tctctattaa aaatagaaaa cattagacag 10681 gtgtggtggt gcatgcctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt 10741 gaacccagga ggtggaggtt gcagtgagcc gagattgtgc cactgcactc cagcctaggc 10801 gacagagcaa gactccgtct cgggaaaatt aattaataaa taaataaacc taggtcccag 10861 agtcccacag aatggcagac aggagcacct gggggctttt agggtatggc atttcccctg 10921 tactaactct gggctgtcca gaggcgattt catggcgtgg agtggagagg gaggcagcac 10981 aggacttcct aggcctcagc tctcacctgc ccatcttttg atttccaggc agttaacatc 11041 actgacctga gcgagaacag aaagcaggac aagcgcttcg ccttcatccg ctcagacagt 11101 ggccccacca ccagttttga gtctgccgcc tgccccggtt ggttcctctg cacagcgatg 11161 gaagctgacc agcccgtcag cctcaccaat atgcctgacg aaggcgtcat ggtcaccaaa 11221 ttctacttcc aggaggacga gtagtactgc ccaggcctgc ctgttcccat tcttgcatgg 11281 caaggactgc agggactgcc agtccccctg ccccagggct cccggctatg ggggcactga 11341 ggaccagcca ttgaggggtg gaccctcaga aggcgtcaca acaacctggt cacaggactc 11401 tgcctcctct tcaactgacc agcctccatg ctgcctccag aatggtcttt ctaatgtgtg 11461 aatcagagca cagcagcccc tgcacaaagc ccttccatgt cgcctctgca ttcaggatca 11521 aaccccgacc acctgcccaa cctgctctcc tcttgccact gcctcttcct ccctcattcc 11581 accttcccat gccctggatc catcaggcca cttgatgacc cccaaccaag tggctcccac 11641 accctgtttt acaaaaaaga aaagaccagt ccatgaggga ggtttttaag ggtttgtgga 11701 aaatgaaaat taggatttca tgattttttt ttttcagtcc ccgtgaagga gagcccttca 11761 tttggagatt atgttctttc ggggagaggc tgaggactta aaatattcct gcatttgtga 11821 aatgatggtg aaagtaagtg gtagcttttc ccttcttttt cttctttttt tgtgatgtcc 11881 caacttgtaa aaattaaaag ttatggtact atgttagccc cataattttt tttttccttt 11941 taaaacactt ccataatctg gactcctctg tccaggcact gctgcccagc ctccaagctc 12001 catctccact ccagattttt tacagctgcc tgcagtactt tacctcctat cagaagtttc 12061 tcagctccca aggctctgag caaatgtggc tcctgggggt tctttcttcc tctgctgaag 12121 gaataaattg ctccttgaca ttgtagagct tctggcactt ggagacttgt atgaaagatg 12181 gctgtgcctc tgcctgtctc cccaccaggc tgggagctct gcagagcagg aaacatgact 12241 cgtatatgtc tcaggtccct gcagggccaa gcacctagcc tcgctcttgg caggtactca 12301 gcgaatgaat gctgtatatg ttgggtgcaa agttccctac ttcctgtgac ttcagctctg 12361 ttttacaata aaatcttgaa aatgcctata ttgttgacta tgtccttggc cttgacaggc 12421 tttgggtata gagtgctgag gaaactgaaa gaccaatgtg tyttycttac cccagaggct 12481 ggcgcctggc ctcttctctg agagttcttt tcttccttca gcctcactct ccctggataa 12541 catgagagca aatctctctg cgggg // LOCUS HSINSU 4992 bp DNA PRI 30-MAR-1995 DEFINITION Human gene for preproinsulin, from chromosome 11. Includes a highly polymorphic region upstream from the insulin gene containing tandemly repeated sequences. ACCESSION V00565 NID g33930 KEYWORDS germ line; insulin; repetitive sequence; signal peptide; tandem repeat. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1925 to 3715) AUTHORS Bell,G.I., Pictet,R.L., Rutter,W.J., Cordell,B., Tischer,E. and Goodman,H.M. TITLE Sequence of the human insulin gene JOURNAL Nature 284 (5751), 26-32 (1980) MEDLINE 80120725 REFERENCE 2 (bases 1928 to 3651) AUTHORS Ullrich,A., Dull,T.J., Gray,A., Brosius,J. and Sures,I. TITLE Genetic variation in the human insulin gene JOURNAL Science 209 (4456), 612-615 (1980) MEDLINE 80236313 REFERENCE 3 (bases 1 to 2227) AUTHORS Bell,G.I., Selby,M.J. and Rutter,W.J. TITLE The highly polymorphic region near the human insulin gene is composed of simple tandemly repeating sequences JOURNAL Nature 295 (5844), 31-35 (1982) MEDLINE 82125365 REFERENCE 4 (bases 1 to 4992) AUTHORS Bell,G.I. TITLE Direct Submission JOURNAL Submitted (01-APR-1982) to the EMBL/GenBank/DDBJ databases COMMENT This entry is assembled from and replaces the previous entries and , and contains other new data. Some sequence and feature data have been adapted from the Los Alamos sequence data base entry HUMINS1. The immediate translation product of the gene is preproinsulin. The signal peptide facilitates membrane transit of the insulin precursor, and is cleaved off in the process. In the resulting proinsulin molecule, the peptide chains A and B are joined by the connecting peptide C, which is believed to help in the formation of the disulphide bridges required for insulin. (See [1].). FEATURES Location/Qualifiers source 1..4992 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1340..1823 /note="polymorphic region (tandem repeats)" conflict 2101 /note="G is AGG in [2]" /citation=[2] misc_feature 2156..2161 /note="TATAAA (Hogness) box" mRNA 2186..2227 /note="preproinsulin mRNA (part 1)" precursor_RNA 2186..3615 /note="preproinsulin primary transcript" intron 2228..2406 /note="intron" allele 2401 /note="T can be A (see [2])" mRNA 2407..2610 /note="preproinsulin mRNA (part 2)" CDS join(2424..2610,3397..3542) /codon_start=1 /product="preproinsulin" /db_xref="PID:g758088" /db_xref="SWISS-PROT:P01308" /translation="MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCG ERGFFYTPKTRREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSL YQLENYCN" sig_peptide 2424..2495 /note="peptide pre (signal peptide)" mat_peptide 2496..2585 /note="proinsulin peptide B" mat_peptide join(2586..2610,3397..3476) /note="proinsulin peptide C (part 1) (2610 is 1st base in codon)" intron 2611..3396 /note="intron" conflict 3068 /note="G is GG in [2]" /citation=[2] allele 3229 /note="G can be C (see [2])" mRNA 3397..3615 /note="preproinsulin mRNA (part 3)" mat_peptide 3477..3539 /note="proinsulin peptide A" allele 3551 /note="T can be C (see [2])" allele 3564 /note="A can be C (see [2])" conflict 3638..3639 /note="TT is T in [2]" /citation=[2] unsure 4381..4441 /note="sequence unconfirmed" BASE COUNT 849 a 1553 c 1755 g 835 t ORIGIN 1 ctcgaggggc ctagacattg ccctccagag agagcaccca acaccctcca ggcttgaccg 61 gccagggtgt ccccttccta ccttggagag agcagcccca gggcatcctg cagggggtgc 121 tgggacacca gctggccttc aaggtctctg cctccctcca gccaccccac tacacgctgc 181 tgggatcctg gatctcagct ccctggccga caacactggc aaactcctac tcatccacga 241 aggccctcct gggcatggtg gtccttccca gcctggcagt ctgttcctca cacaccttgt 301 tagtgcccag cccctgaggt tgcagctggg ggtgtctctg aagggctgtg agcccccagg 361 aagccctggg gaagtgcctg ccttgcctcc ccccggccct gccagcgcct ggctctgccc 421 tcctacctgg gctcccccca tccagcctcc ctccctacac actcctctca aggaggcacc 481 catgtcctct ccagctgccg ggcctcagag cactgtggcg tcctggggca gccaccgcat 541 gtcctgctgt ggcatggctc agggtggaaa gggcggaagg gaggggtcct gcagatagct 601 ggtgcccact accaaacccg ctcggggcag gagagccaaa ggctgggtgt gtgcagagcg 661 gccccgagag gttccgaggc tgaggccagg gtgggacata gggatgcgag gggccggggc 721 acaggatact ccaacctgcc tgcccccatg gtctcatcct cctgcttctg ggacctcctg 781 atcctgcccc tggtgctaag aggcaggtaa ggggctgcag gcagcagggc tcggagccca 841 tgccccctca ccatgggtca ggctggacct ccaggtgcct gttctgggga gctgggaggg 901 ccggaggggt gtaccccagg ggctcagccc agatgacact atgggggtga tggtgtcatg 961 ggacctggcc aggagagggg agatgggctc ccagaagagg agtgggggct gagagggtgc 1021 ctggggggcc aggacggagc tgggccagtg cacagcttcc cacacctgcc cacccccaga 1081 gtcctgccgc cacccccaga tcacacggaa gatgaggtcc gagtggcctg ctgaggactt 1141 gctgcttgtc cccaggtccc caggtcatgc cctccttctg ccaccctggg gagctgaggg 1201 cctcagctgg ggctgctgtc ctaaggcagg gtgggaacta ggcagccagc agggagggga 1261 cccctccctc actcccactc tcccaccccc accaccttgg cccatccatg gcggcatctt 1321 gggccatccg ggactgggga caggggtcct ggggacaggg gtccggggac agggtcctgg 1381 ggacaggggt gtggggacag gggtctgggg acaggggtgt ggggacaggg gtgtggggac 1441 aggggtctgg ggacaggggt gtggggacag gggtccgggg acaggggtgt ggggacaggg 1501 gtctggggac aggggtgtgg ggacaggggt gtggggacag gggtctgggg acaggggtgt 1561 ggggacaggg gtcctgggga caggggtgtg gggacagggg tgtggggaca ggggtgtggg 1621 gacaggggtg tggggacagg ggtcctgggg ataggggtgt ggggacaggg gtgtggggac 1681 aggggtcccg gggacagggg tgtggggaca ggggtgtggg gacaggggtc ctggggacag 1741 gggtctgagg acaggggtgt gggcacaggg gtcctgggga caggggtcct ggggacaggg 1801 gtcctgggga caggggtctg gggacagcag cgcaaagagc cccgccctgc agcctccagc 1861 tctcctggtc taatgtggaa agtggcccag gtgagggctt tgctctcctg gagacatttg 1921 cccccagctg tgagcaggga caggtctggc caccgggccc ctggttaaga ctctaatgac 1981 ccgctggtcc tgaggaagag gtgctgacga ccaaggagat cttcccacag acccagcacc 2041 agggaaatgg tccggaaatt gcagcctcag cccccagcca tctgccgacc cccccacccc 2101 gccctaatgg gccaggcggc aggggttgac aggtagggga gatgggctct gagactataa 2161 agccagcggg ggcccagcag ccctcagccc tccaggacag gctgcatcag aagaggccat 2221 caagcaggtc tgttccaagg gcctttgcgt caggtgggct cagggttcca gggtggctgg 2281 accccaggcc ccagctctgc agcagggagg acgtggctgg gctcgtgaag catgtggggg 2341 tgagcccagg ggccccaagg cagggcacct ggccttcagc ctgcctcagc cctgcctgtc 2401 tcccagatca ctgtccttct gccatggccc tgtggatgcg cctcctgccc ctgctggcgc 2461 tgctggccct ctggggacct gacccagccg cagcctttgt gaaccaacac ctgtgcggct 2521 cacacctggt ggaagctctc tacctagtgt gcggggaacg aggcttcttc tacacaccca 2581 agacccgccg ggaggcagag gacctgcagg gtgagccaac cgcccattgc tgcccctggc 2641 cgcccccagc caccccctgc tcctggcgct cccacccagc atgggcagaa gggggcagga 2701 ggctgccacc cagcaggggg tcaggtgcac ttttttaaaa agaagttctc ttggtcacgt 2761 cctaaaagtg accagctccc tgtggcccag tcagaatctc agcctgagga cggtgttggc 2821 ttcggcagcc ccgagataca tcagagggtg ggcacgctcc tccctccact cgcccctcaa 2881 acaaatgccc cgcagcccat ttctccaccc tcatttgatg accgcagatt caagtgtttt 2941 gttaagtaaa gtcctgggtg acctggggtc acagggtgcc ccacgctgcc tgcctctggg 3001 cgaacacccc atcacgcccg gaggagggcg tggctgcctg cctgagtggg ccagacccct 3061 gtcgccagcc tcacggcagc tccatagtca ggagatgggg aagatgctgg ggacaggccc 3121 tggggagaag tactgggatc acctgttcag gctcccactg tgacgctgcc ccggggcggg 3181 ggaaggaggt gggacatgtg ggcgttgggg cctgtaggtc cacacccagt gtgggtgacc 3241 ctccctctaa cctgggtcca gcccggctgg agatgggtgg gagtgcgacc tagggctggc 3301 gggcaggcgg gcactgtgtc tccctgactg tgtcctcctg tgtccctctg cctcgccgct 3361 gttccggaac ctgctctgcg cggcacgtcc tggcagtggg gcaggtggag ctgggcgggg 3421 gccctggtgc aggcagcctg cagcccttgg ccctggaggg gtccctgcag aagcgtggca 3481 ttgtggaaca atgctgtacc agcatctgct ccctctacca gctggagaac tactgcaact 3541 agacgcagcc tgcaggcagc cccacacccg ccgcctcctg caccgagaga gatggaataa 3601 agcccttgaa ccagccctgc tgtgccgtct gtgtgtcttg ggggccctgg gccaagcccc 3661 acttcccggc actgttgtga gcccctccca gctctctcca cgctctctgg gtgcccacag 3721 gtgccaacgc caggcaggcc cagcatgcag tggctctccc caaagcggcc atgcctgttg 3781 gctgcctgct gcccccaccc tgtggctcag ggtccagtat gggagcttcg ggggtctctg 3841 aggggccagg gatggtgggg ccactgagaa gtgactctgt cagtagccga cctggagtcc 3901 ccagagacct tgttcaggaa agggaatgag aacattccag caattttccc cccacctagc 3961 cctcccaggt tctattttta gagttatttc tgatggagtc cctgtggagg gaggaggctg 4021 ggctgaggga gggggtcctg cagggcgggg ggctgggaag gtggggagag gctgccgaga 4081 gccacccgct atccccagct ctgggcagcc ccgggacagt cacacaccct ggcctcgcgg 4141 cccaagctgg cagccgtctg cagccacagc ttatgccagc ccaggtccag ccagacacct 4201 gagggaccca ctggtgcctt ggaggaagca ggagaggtca gatggcacca tgagctgggg 4261 caggtgcagg gaccgtggca gcacctggca gggcctcaga acccatgcct tgggcacccc 4321 ggccatgagg ccctgaggat tgcagcccaa gagaagcagg gaacgccagg gccacagggg 4381 cagagaccag gccagggtcc cttgcggccc ttagcccacc ccctcccagt aagcaggggc 4441 tgcttggcta ggcttccttt tgctacagac ctgctgctca cccagaggcc cacgggccct 4501 agtgacaagg tcgttgtggc tccaggtcct tgggggtcct gacacagagc ctcttctgca 4561 gcacccctga ggacagggtg ctccgctggg cacccagcct agtgggcaga cgagaaccta 4621 ggggctgcct gggcctactg tggcctggga ggtcagcggg tgaccctagc taccctgtgg 4681 ctgggccagt ctgcctgcca cccaggccaa accaatctgc acctttcctg agagctccac 4741 ccagggctgg gctggggatg gctgggcctg gggctggcat gggctgtggc tgcagaccac 4801 tgccagcttg ggcctcgagg ccaggagctc accctccagc tgccccgcct ccagagtggg 4861 ggccagggct gggcaggcgg gtggacggcc ggacactggc cccggaagag gagggaggcg 4921 gtggctggga tcggcagcag ccgtccatgg gaacacccag ccggccccac tcgcacgggt 4981 agagacaggc gc // LOCUS HSINT1G 4522 bp DNA PRI 03-JAN-1991 DEFINITION Human int-1 mammary oncogene. ACCESSION X03072 NID g33935 KEYWORDS int-1 oncogene; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4522) AUTHORS van Ooyen,A., Kwee,V. and Nusse,R. TITLE The nucleotide sequence of the human int-1 mammary oncogene; evolutionary conservation of coding and non-coding sequences JOURNAL EMBO J. 4 (11), 2905-2909 (1985) MEDLINE 86055728 COMMENT Data kindly reviewed (15-JUN-1986) by R. Nusse. FEATURES Location/Qualifiers source 1..4522 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 259..263 /note="pot. TATA-box" CDS join(465..568,1282..1535,2238..2503,2966..3454) /codon_start=1 /product="int-1 protein" /db_xref="PID:g33936" /db_xref="SWISS-PROT:P04628" /translation="MGLWALLPGWVSATLLLALAALPAALAANSSGRWWGIVNVASST NLLTDSKSLQLVLEPSLQLLSRKQRRLIRQNPGILHSVSGGLQSAVRECKWQFRNRRW NCPTAPGPHLFGKIVNRGCRETAFIFAITSAGVTHSVARSCSEGSIESCTCDYRRRGP GGPDWHWGGCSDNIDFGRLFGREFVDSGEKGRDLRFLMNLHNNEAGRTTVFSEMRQEC KCHGMSGSCTVRTCWMRLPTLRAVGDVLRDRFDGASRVLYGNRGSNRASRAELLRLEP EDPAHKPPSPHDLVYFEKSPNFCTYSGRLGTAGTAGRACNSSSPALDGCELLCCGRGH RTRTQRVTERCNCTFHWCCHVSCRNCTHTRVLHECL" intron 569..1281 /note="intron I" intron 1536..2237 /note="intron II" intron 2504..2965 /note="intron III" misc_feature 4410..4415 /note="pot. polyadenylation signal" BASE COUNT 805 a 1523 c 1320 g 874 t ORIGIN 1 cagctgagtg aggcgggcgc gcgtgggagg gtgtcccaag gggaggggtc cgcggccagt 61 gcaggcccgg aggcgggggc caccgggcag ggggcggggg tgagccccga cggccaaccc 121 gtcagctctc ggctcagacg ggcgggaacc acagccccgc tcgctgccca ttgtctgcgc 181 ccctaaccgg tgcgccctgg tgccacagtg cggcccggag gggcagcctc ctcccgtcac 241 ttcagccagc gccgcaacta taagaggcgg tgccgcccgc cgtggccgcc tcagcccacc 301 agccgggacc gcgagccatg ctgtccgccg cccgccccca gggttgttaa agccagactg 361 cgaactctcg ccactgccgc caccgccgcg tcccgtccca ccgtcgcggg caacaaccaa 421 agtcgccgca actgcagcac agagcgggca aagccaggca ggccatgggg ctctgggcgc 481 tgttgcctgg ctgggtttct gctacgctgc tgctggcgct ggccgctctg cccgcagccc 541 tggctgccaa cagcagtggc cgatggtggt aagtgagctg gtgcggggtc gccacttgtc 601 ccgcggcaca gagccagggg ccaaccctac ccagctccca cgctctggga tccgtctgcc 661 gacaggctcc ctccccgctc tgacttccct ccgcgacacc gaagggcgat ctggcatgaa 721 actgccccag actccagctc tgtacaagtg gggcgaatga tccgcccgcg gaggcctaag 781 ataccccagg cagggagccc actctcatct agcaccgccc ttcccctttg agcgccaact 841 ccagcctcac ggcggtggct caccacaggt ttccccacct cgggaagtga agggccagga 901 gttcgcctag aaaggagggg agaagagggt gggactccta agcatttcac gccttgggtg 961 ggcaagaact gcaggccatg attatctcgc tcaggctgac cggaagaggc tcggagatcc 1021 aaggtagaca ctcggtctcc gggtacctcc tctgtccagt ctccggacct agggctcagg 1081 cgagcagccc tgggactact gggcacacac aagtctggac gcccagttct ttcaaattag 1141 tgagcctggg agagcgggta ttattaatct cccgccattc tctccagcca cataccccca 1201 ggaagaggac cgggtggcac agtttttatg gttagggtgc ggatcccctt cctgagcctg 1261 agctatcata cgtcccacca ggggtattgt gaacgtagcc tcctccacga acctgcttac 1321 agactccaag agtctgcaac tggtactcga gcccagtctg cagctgttga gccgcaaaca 1381 gcggcgcctg atacgccaaa atccggggat cctgcacagc gtgagtgggg ggctgcagag 1441 tgccgtgcgc gagtgcaagt ggcagttccg gaatcgccgc tggaactgtc ccactgctcc 1501 agggccccac ctcttcggca agatcgtcaa ccgaggtggg tgcccaggaa ggcgacgctt 1561 ccgggagcag gggaaacgcg gggtcacccc cagggcatgg gcgggcgagt tcagagaagg 1621 tgtcccaggc gcctggaggg tcacacaatc aaccttgcca agtgcctcgt gcccagcgcc 1681 agctcggggc cagacttcta ccaggcgttt tccagccgtg caccctggaa acgaagctta 1741 acttttctga gctactgccc cagataaaga aagtttcggg tcgcggacgc cggctgaccg 1801 ccgctttccc ccagcctctc tcaaaagcgc ctgggaagct gctctctgca ggcgtgtgtc 1861 tggcctctcg cccagcaagg cttgcaccgc caaaatgggc cgaaagtttt gggctgcgaa 1921 gaagtcttgg ggatgtatgg ttcttccgct cccctctctt cggtttgtct ctctggggct 1981 gctccacttc cgctatcgag ccaaaatgcg ccctagaatc tcccagtaag gtgtgattac 2041 gcccgtggac gtggctgcgt gcccacgcac ctgctttctc tactagccct agagaccagc 2101 tttccagcac tgccggccct ggtcctcagg actcaaagtg cggagtcggg ggtgggattc 2161 cggtcccaag cccttcatga gggtgctggc cgcgccccgc gtaccccctc gctgatcccc 2221 gctcccttct cccacaggct gtcgagaaac ggcgtttatc ttcgctatca cctccgccgg 2281 ggtcacccat tcggtggcgc gctcctgctc agaaggttcc atcgaatcct gcacgtgtga 2341 ctaccggcgg cgcggccccg ggggccccga ctggcactgg gggggctgca gcgacaacat 2401 tgacttcggc cgcctcttcg gccgggagtt cgtggactcc ggggagaagg ggcgggacct 2461 gcgcttcctc atgaaccttc acaacaacga ggcaggccgt acggtgagct ttgagaggct 2521 ccgcacccta agcggagcgg caggggccaa cctcgggctg gggaagtgac ggtcggtgag 2581 ataaggcaag gggcaccagg agagggcgtc ctgggagagc cggaggcttg gaacgaagac 2641 ggagaataga ggagacagtg gctgagggca aaggtatgtc tggcccgcgg acaggtagaa 2701 gaggttgcaa atcaagcaca gtctcttcgc tgtacagatt cgaaaaataa gcctgagagg 2761 ccgagactga ctcgccgcgg cggagcaggg ttgggcaggg tttccaaatc tcagcggaac 2821 atttcgcgcc tcccttcccc tgggctcagc taggcctggg cctttgctga ggtccggccc 2881 ccgtggcgtc cgggagaggg cagtgtctgg gagggtgact ctggcccggt gccctgggac 2941 actctttctt cccctatccc cgcagaccgt attctccgag atgcgccagg agtgcaagtg 3001 ccacgggatg tccggctcat gcacggtgcg cacgtgctgg atgcggctgc ccacgctgcg 3061 cgccgtgggc gatgtgctgc gcgaccgctt cgacggcgcc tcgcgcgtcc tgtacggcaa 3121 ccgcggcagc aaccgcgctt cgcgagcgga gctgctgcgc ctggagccgg aagacccggc 3181 ccacaaaccg ccctcccccc acgacctcgt ctacttcgag aaatcgccca acttctgcac 3241 gtacagcgga cgcctgggca cagcaggcac ggcagggcgc gcctgtaaca gctcgtcgcc 3301 cgcgctggac ggctgcgagc tgctctgctg cggcaggggc caccgcacgc gcacgcagcg 3361 cgtcaccgag cgctgcaact gcaccttcca ctggtgctgc cacgtcagct gccgcaactg 3421 cacgcacacg cgcgtactgc acgagtgtct gtgaggcgct gcgcggactc gcccccagga 3481 acgctctcct cgagccctcc cccaaacaga ctcgctagca ctcaagaccc ggttattcgc 3541 ccacccgagt acctccagtc acactccccg cggttcatac gcatcccatc tctcccactt 3601 cctcctacct ggggactcct caaaccactt gcctggggcg gcatgaaccc tcttgccatc 3661 ctgatggacc tgccccggac ctaacctccc tccctctccg cgggagaccc cttgttgcac 3721 tgccccctgc ttggccagga ggtgagagaa ggatgggtcc cctccgccat ggggtcggct 3781 cctgatggtg tcattctgcc tgctccatcg cgccagcgac ctctctgcct ctcttcttcc 3841 cctttgtcct gcgttttctc cgggtcctcc taagtccctt cctattctcc tgccatgggt 3901 gcagaccctg aacccacacc tgggcatcag ggcctttctc ctccccacct gtagctgaag 3961 caggaggtta cagggcaaaa gggcagctgt gatgatgtgg gaatgaggtt gggggaacca 4021 gcagaaatgc ccccattctc ccagtctctg tcgtggagcc attgaacagc tgtgagccat 4081 gcctccctgg gccacctcct accccttcct gtcctgcctc ctcatcagtg tgtaaataat 4141 ttgcactgaa acgtggatac agagccacga gtttggatgt tgtaaataaa actatttatt 4201 gtgctgggtc ccagcctggt ttgcaaagac cacctccaac ccaacccaat ccctctccac 4261 tcttctctcc tttctccctg cagccttttc tggtccctct tctctcctca gtttctcaaa 4321 gatgcgtttg cctcctggaa tcagtatttc cttccactgt agctattagc ggctcctcgc 4381 ccccaccagt gtagcatctt cctctgcaga ataaaatctc tatttttatc gatgacttgg 4441 tggcttttcc ttgaatccag aacacaacct tgtttgtggt gtcccctatc ctcccctttt 4501 accactccca gcttggaagc tt // LOCUS HSINT2 11608 bp DNA PRI 25-JUN-1997 DEFINITION Human int-2 proto-oncogene. ACCESSION X14445 NID g33937 KEYWORDS growth factor; int-2 gene; proto-oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11608) AUTHORS Dickson,C. TITLE Direct Submission JOURNAL Submitted (27-FEB-1989) Dickson C., Imperial Cancer Research Fund, P O Box 123, Lincolns Inn Fields, London, WC2A 3PX, ENGLAND REFERENCE 2 (bases 1 to 11608) AUTHORS Brookes,S., Smith,R., Casey,G., Dickson,C. and Peters,G. TITLE Sequence organization of the human int-2 gene and its expression in teratocarcinoma cells JOURNAL Oncogene 4 (4), 429-436 (1989) MEDLINE 89239468 COMMENT tissue=placenta; library=cosmid; clone=C1; Data kindly reviewed (04-Jul-1989) by Smith R. FEATURES Location/Qualifiers source 1..11608 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="q13" misc_feature 445 /note="pot. transcription start site" misc_feature 479 /note="pot. alt. transcription start site" misc_feature 507 /note="pot. alt. transcription start site" misc_feature 527 /note="pot. alt. transcription start site" CDS join(936..1155,3445..3548,9197..9592) /codon_start=1 /product="int-2 preprotein" /db_xref="PID:g312409" /db_xref="SWISS-PROT:P11487" /translation="MGLIWLLLLSLLEPGWPAAGPGARLRRDAGGRGGVYEHLGGAPR RRKLYCATKYHLQLHPSGRVNGSLENSAYSILEITAVEVGIVAIRGLFSGRYLAMNKR GRLYASEHYSAECEFVERIHELGYNTYASRLYRTVSSTPGARRQPSAERLWYVSVNGK GRPRRGFKTRRTQKSSLFLPRVLDHRDHEMVRQLQSGLPRPPGKGVQPRRRRQKQSPD NLEPSHVQASRLGSQLEASAH" sig_peptide 936..986 exon <936..1155 /number=1 mat_peptide join(987..1155,3445..3548,9197..9589) /product="int-2 protein" intron 1152..3444 /number=1 exon 3445..3548 /number=2 intron 3549..9196 /number=2 exon 9197..9929 /number=3 polyA_signal 9911..9916 /note="pot. polyA signal" polyA_site 9929 /note="polyA site" BASE COUNT 2166 a 3822 c 3181 g 2439 t ORIGIN 1 ggatcctccg aggcctgcca gacgagagct aaccccacca ctccgggtgc ccagttggtg 61 tgcgcaaaag gagatagcat ctccagaccc ggcctgcccc gcgccttgca aggctgagga 121 cagcgccact gctcctgcag gaagcgccgg gcgcagacac aaacccggag ctccccacgc 181 gtgcccgcgc cccggagcct ccccgccgcc gccctccgcg gtccccttca ttatgcgccg 241 cctttaatgg gcgtttgtca gctcgacttc cccgcaagtt gttttcacgg acatcagtca 301 tcggcggcgg ccccattgtg cagggggatt gatggggagc gggagggggt gacagggcct 361 ggggcggcct cctcaaggcc tcggtctata attttcggag gcataattgg tctgggggag 421 ggggcggggg aggggcgggt aggggacctt tcagagccag gagggctttc gggggcgtgg 481 ggcgcgctgc ggagcggagc cgcggctcga cggcggtgcg ctggcggcga gtgtatgcag 541 acggcgcccg gcccgaaccc cgagccccgc ggggctcccc acccgccggc ctcccgcccc 601 tcccgcgcct ccgcctgggg accacgtcgg ccttttgttg gcgaaccgtc ctttctttca 661 gcgctttgcg cagcaacgga aatttcattg ctcctgggtg gaaattaaag ggactcgcgt 721 tccctctctc cctctccctc tcccactctc cctctctttc tctctctcgc ccacccttcc 781 cccttcttcc cccacctttc ccgcgaagcc ggagtcagca tctccaggcg cgggatcccg 841 ctccgagcac ctcgcagctg tccggctgcc gccccttcca tgggcgccgc gctcgcctgc 901 agccgccgcc gccgcggggc gggcgcgatg ccacgatggg cctaatctgg ctgctactgc 961 tcagcctgct ggagcccggc tggcccgcag cgggccctgg ggcgcggttg cggcgcgatg 1021 cgggcggccg tggcggcgtc tacgagcacc ttggcggggc gccccggcgc cgcaagctct 1081 actgcgccac gaagtaccac ctccagctgc acccgagcgg ccgcgtcaac ggcagcctgg 1141 agaacagcgc ctacagtgag tgccggacgc tgcggggccc cgggggaagc ggcgccggga 1201 ggggtcgggc ccgggagaag gcgcgctgcg gggccccgcg ggggaggcgg cgccggggag 1261 gggtctggcc cgggagaagg catggtgcgc ccggggtgtc cgggaaaaga ccgtctgcct 1321 cccgctgcca gaaggggaat gccagggtgc cctcctcaac ctacacgtcc gggagaagag 1381 cgcgcgctgg ggttcgagca gaactcgggg gcatgggcgt tcacgtccga agagggcgcc 1441 ggcggctgtc agagtccgtc cccggtccgg cccgctggtc tgaggggacg cgagctaggc 1501 gaccccgggg ctccaggctc tgctgctttg ggcagctctg acaagtttaa gccctttgaa 1561 tgtgtcggag aaaaggagcc ggacaaggca tttatcatct tctttattat tttgacgact 1621 tctcctcctc ctgccgaccc ctggacgccc cgccgtcccc tccgccctcc ccgctcgcgg 1681 ctgcccaggt ggccgccccc gctgctgcct ctgcgcggag gactcttgcc ctgcggagct 1741 cggttccctg gccgcggccg ccaccgacag ttttccccgc gctggaatct gcacctcccc 1801 cgcctccgcc ctggccgcac agcgacagga gggtggaggc cctcgccgtc gggacgtgcg 1861 gatcacgagc gcggaacggt gtccgcccgg gcctgggggt gcagacacac acactccggc 1921 ccccgcacgc aggacccgcg gcccgggctg cgctcaccgc gggagtctgc cggactacac 1981 ggttgggctc ctctggtcac agggacctga ggcccgcgcc gagccctttg aggagcagga 2041 tagcggagct cagggcccgg gaggaccgcg tggggagcgt gggagggcag tcaagaccac 2101 agctgtcctc gggtgctccg cgcggcgctc ggctccggcc ccgggcgcag gaagcggttc 2161 cgccgcttga aggtggcggc ggcggcctca gcaaaccgcg gcttcctcca ggaaagccgc 2221 agccctgaga gggcgtcctg gggacatgcg cctccggagc cgcacggtgg gcaccagctg 2281 tcaccagggg gtccgagtgc gcggaattcg tctcactaag acactccggt tctctccaaa 2341 gccaggctcc ccctcggagt ctcacagcat ccaaacttct tggtgttggc tgctcacggg 2401 gaggggaggg cgcgcgcccg cagccgcccc tgtcctgcgt cgagactcgt gcttcgctgg 2461 tccccggtca ggcaccgccg atgccgccca gcctgcgcac tgggaaggcg ggaggctcgc 2521 agcctgcacc acagcacccc tgggctggag caaaagcccg gtggtgaccg cgtctgtgct 2581 cggaccgcgt gccaggaggg cgctcctgag aggcatgcct tggagagggt gcaggcagtc 2641 gccccaatcc cctcgcagcc tctattggga gacaatgacc ccaacccgtc ctttgatgta 2701 gtcccccgcc cccagccccc accacagcac tcgtgtatcc agaaggaaag gcgggagggg 2761 agatttaaac tttcttatcc ctggggagtg ggtgagaccc gtccagcttc cctctgcggg 2821 ctccagggtc taaactggct tcctgccctt ccaggtcccc cggcgagtac aatgatctcc 2881 cgccaggtct attgccagca tgatttgggt ggcaggcacc ctggctgtgt tattgccagt 2941 ggtttatatt atcagcctgg ctcatcaggg cccagctttc ccgctggcag gcacaggcat 3001 tggggaccgt gcagtcccag agctcagact cccatggctg ccgccagggc ttgatttccc 3061 gctcaagcac atgtgcaggg gcctgacagt cacccagcca tagtggtccc cacttgcttg 3121 tctgtcggat gatgatggca gctggcaagg tgttgtgagg ctcgaatgaa agccagggtg 3181 caggcagcgt cagcaggtgg gaatgttaat ttcattgtta ccctcccaag tttgaatgaa 3241 gagaagcctt ccctgatgtt cccttctcta cctcctgtcc ggtgacttct cccttcctga 3301 acttccactc actccccgac ctgtggggtg tgtgctaccc tccccagccc tgagcccggg 3361 gtgtggcttt gggcagggcc agctgggcct cagccgcccc ctccgtgggg gcggcgggag 3421 tgaggcacct ctcatttctt ctaggtattt tggagataac ggcagtggag gtgggcattg 3481 tggccatcag gggtctcttc tccgggcggt acctggccat gaacaagagg ggacgactct 3541 atgcttcggt gagtccaggc tgtcacgtgg gtgggcgctg acggagtagc ggtctggcct 3601 gcacatcaag ccaggggacg ggggatgtgg gcagtagaat gctttgccaa gggtacatgg 3661 gatccaagct aggaccagac cctggcccag ggctccacgc ccagtgatct ttggctgtgt 3721 ctgttagagg ctgctccgcg agtgacctgg agcctggccc tgggggacct gggatctgga 3781 gaatgccagg ttagcaaagg tttcccccat ccttgtcatc actaccaccc tcccttgaga 3841 gtagctgaga gcagggaagt gattgttctc caaatttcca ccaaaataaa tggacccaag 3901 acaacctggg agtgttcctt ggggcagcca gagctctcct gggagcatgg agaggagggc 3961 gctggcccag ctccaccggc tctctccaag ggtggaaagg cgagagctga agcctaggag 4021 gggaaggggg atggcgccca tgtcccgggc acacaggagc cttggagccc tggcttggga 4081 cccagtctat ctactgcctg ggcctccgtg taaccaaaaa gggtctcctt attctgtggg 4141 tatggcagcg cccctcttca taaggggtag gggtggggaa tacacagagt aaaatacatg 4201 tcacagggaa atttgctacc tccaactagt cattacatgc aatttggctg atacttcctt 4261 gggcaatgag aggttttcca tccatgagta gcctggctga cgcggcccaa ggacaatctc 4321 cctgcagtga gctctctgct cagtcctgct cacagaggac acatcccgca gctccctctc 4381 gcagaagctg atgatttcat cacagatttt tagccgtttt gctaaaggaa ggtccagaaa 4441 gccgggatgc gccccttcat tttctctggt ccagaggcta ctccctcctt cctcccatcc 4501 actcacccat ccacccatcc acccattcac ccatccactc atccacccat ccacccatcc 4561 atccatccac tcatccaccc atccacccat ccatccatcc acctatccat ccatccatcc 4621 acccatccac ccatccttcc atttacccat ccaccatcca cctatccatc catccatcca 4681 cccatccacc catccaccca tttacccatc cacccatcca cccatccacc tatccatcca 4741 tccatccatc cacctatcca cccatccact catccactca cccatccacc tatccaccca 4801 cccactcacc catccatcca cccacccact cacccatcca tccatacacc tatccatctg 4861 acattcagtc cattcatcag tcagtgtcct ggaagctgtg tctgggagca ggggccctca 4921 tggtgccagc ttctgccata ggggagccag gatttggaga gacagaataa agcatgacat 4981 gagggggtcc cagtggttca gagcccacct ggggcttccg tgcctgagaa ctcacagcct 5041 ggctttgaga agggtggaca gaggctccta gccccaccag ggatgcttag ccaagcagtc 5101 tggctggagg gtgtggaact ttccagagtg cccagtggag aagttcccat tgccctgcag 5161 acaacctggg aggcatagta gcctggatgc caactccgtc acctgccacc tgccacctga 5221 taccacagtc ctgcgtgtgg ccagcttgcc taggagtgct gtcccaccac ggggctccct 5281 ggactttgcc ggcagcatgc ttgccctcag gccagctcat ctcactgtgg gccagtgagc 5341 actgcaccct gacactcctg gtctggccct gacctccctt ctgaggtata gaccaccctc 5401 ccccaacatc ctgctgactc agacactcca accgaactca gcagctcccc gactccccat 5461 cttaatcatt gccccaccag ccactcactc tcacccagag tggactggga gtcatggtag 5521 acccctcctg ctattcagtc catccctagt cctcagacca cgcccctgca tctcttcagc 5581 ccacacacct cacctccctc tgcctcatcc ccgagagcac gccttaaaag gcaggctcat 5641 cacttcccac ctggatcact gctccagcct ctcctacctc cagtccatct gtttgtcaag 5701 ccatccatcc atcttccttc cttctttccc cttcctttat tctcttcacc catctgtcca 5761 tacacctctc cttccttcct tctttccttt ctctcatcct tctttccatc catccttcca 5821 accgttcatc tgtccatctg cccattcaca tgtccatccg tccatgctcc ctccttcctt 5881 ccatcctttc acccatccac ccatccaccc atccacccat ccaccaatcc atccagcaat 5941 ccacccatcc atccagccat ccacccatcc atccgtccac ctatccatcc atccacccat 6001 ccatccaacc agccatccac ccatccatcc atccatccat ccatccatcc atccacccac 6061 ccacccatcc acccatccat ccatccagcc aaccagccat ccacccatcc acccatccac 6121 ccatccatcc atccatccat ccatccatcc atccatccat ccacccatcc acccatccat 6181 ccatccagcc atccacccat ccatccatcc acctatccat ccatccacct atccatccat 6241 ccacccatcc atccatccag ccatccaccc atccatccat ccacccatcc acccatccat 6301 ccatccatcc atcaatccat ccatccatcc acccatccat ccatccatcc acccacccat 6361 ctgacattca gtccttcgtt caccagtcag tgtcctgggc acttaccatg tgttgcactc 6421 tgctagtgca tgttaaaggg gcttcctaaa aaagcaaatc agcccagttc atggcttctg 6481 aggactccca tgatgcgcag gcatccccca ggcaccttag ctccacccac agagcagcag 6541 cccaggtgcc atctgctccc aggggggccg tcatccagag gttgagcctc ctccagcttc 6601 cggaagcccc ctcaactctc caagccatct gtactgctca ggccacacct tcactgcctg 6661 gaacgccctc ctccagaaaa tgcccatgag aaagtgcccg ccccgccctt acagctccct 6721 ttagaagtca cctcctcaac aaagcagctt cggagtcggc ttcctcccca cccttcatgg 6781 ggtggaagcg gccctggggg aggggcctgt ggaaccaggt ctggtggggc ggcagtgtca 6841 ggacacacat tggaaagtgt tgtcacagcg agtcccaact gcagtagctc tggagtctat 6901 ggccccggcc cctgagccaa caccttctgc cctggtcatc ttggccaccc caggtgccct 6961 accacgtgtc agcaattgat ctacactgcc cccaatctcc cacctcggga gagcctgagc 7021 cccctgccac ctgaggctca caggcacctg tccttcagtg acccacccct caagtggccc 7081 cccaaagaga aagcctattt cctggcctct ctggccctga gaggactcac ctgctgcagg 7141 atcgagggac ctgggagaag cttgggtcct gccctgagct tctcacagtc tgccatggga 7201 gccagacact aacatgttgt acccagtgtg tggggtgggg aagtcggccc tgagcccacc 7261 cgctcattcc cagatagtcg ctgagcccca ggctatgtgg gggtcaggca cagatcagac 7321 actgttctat cccctgtaat gtagggcctg gcaccacccc tgggcctcag tttccccatc 7381 tataaacatg gagaccggtc ttggatagta acatccagac tgtggcccag ggggcactgg 7441 gcctccaaga gggctggaga gcaggggggc ttggggttat ggggagaggc agcccctacc 7501 accaggtcaa gccttccacc cttccctgct ctgggctttc atgtcgctga aaaacaggat 7561 ggggtgcttg aagcatcttg ttccctgagg tcctgcgagg tcagatgctg ctcggtccag 7621 tctgggagcc tctggaggcc atttccactc tccctctctc ccacgggaca ccacacccca 7681 gatggggaca gaccatgagg gcctccgact cctcctgcgg ccagtcccca ggaggaaaga 7741 gagatcctac tgtctgcctt gcacctgctg cttccacctc ccaccaccct ttctggtttg 7801 gagggagcag ctcctgtcag tcatcccctg agggaggtgg ccctaggcat taccacttgc 7861 ctgtagctgg gactgaagcc tagtgggtct gcataaggca tacccacctc ttccctacgc 7921 cacagactaa gaagacccca tgaggcagcc cttgagggaa actcccacgg ccaggccttg 7981 gggatgttgg tgtaagctcc ttcagttcag agcttcactg tgcctttgag agggcagggt 8041 cccttctgcc cttcccccac tgcaccctgg gcctgacaag ggtcccatta atctctgatg 8101 acaaagaggt gtcctctctg tcctgcttgt ggtgggacac cccagcttct gctctcatcc 8161 taagaacagc agtatctgtg gcactgattg atcacgtgcc tcatgccagg ccccagcccc 8221 tgctctgtgt tgtaacctct tcaacctgca aggcgagatc ctccagctta agagagtgtt 8281 gctcaggggt caatgacccc agctggggcc cctgagcgca gcgtttaatg gagagtctgg 8341 gctcatcaaa cctggctgtg tccccttccc ttcctgcttt ttgtgttctt tcttcttttt 8401 acttttctgc aatttctttt tttttctttt cttttttttt ttttttgagt cagagttttg 8461 ctcttgtcgc ccagactgga gtgcagtggc acaatctctg ctcactgcaa cctctgcctc 8521 ccgggttcaa gtgattctcc tgcctaagcc tcccaagtag ctgggattgt gggcgcacgc 8581 caccacaccc ggctaatttt tttgtatttt tagtagagac ggggttttgc catgttgccc 8641 aggctggtct cgaactcctg acctcaggtg attcacctgc ctcggcctcc caaactgctg 8701 ggattacagg catgagccac cgtgcccagc ctctgcaatt tctttaaaag agatctgggt 8761 gttgtcatct tcctttgtcc aaatgcccag tccttgctga cctccacact gccagcagac 8821 tgccagggca cctgggttgg ccggcctggt ctctgctcca cagacaaccc tacacatccc 8881 tttgctgtgt cagcgccctc agatggggga cagaggctgg tggaacccag agagtaggag 8941 ctgggagttg tctggcacag ctttgagtga aggtgacctt taggcagaag gccaagttca 9001 ccaggcacac aggggaggaa ggacatgctg ggcagagggc agggcctagg caaaggtgtg 9061 actggctgag agtgcgctgg ggtttggcac tggaccgaac agcctcacag gaggggaggg 9121 aggcatcagg caaccctggg ccctgacgct gccgcagtct ccccggggca ctgaccatga 9181 tatctcatcc ccgcaggagc actacagcgc cgagtgcgag tttgtggagc ggatccacga 9241 gctgggctat aatacgtatg cctcccggct gtaccggacg gtgtctagta cgcctggggc 9301 ccgccggcag cccagcgccg agagactgtg gtacgtgtct gtgaacggca agggccggcc 9361 ccgcaggggc ttcaagaccc gccgcacaca gaagtcctcc ctgttcctgc cccgcgtgct 9421 ggaccacagg gaccacgaga tggtgcggca gctacagagt gggctgccca gaccccctgg 9481 taagggggtc cagccccgac ggcggcggca gaagcagagc ccggataacc tggagccctc 9541 tcacgttcag gcttcgagac tgggctccca gctggaggcc agtgcgcact agctgggcct 9601 ggtggccacc gccagagctc ctggcgacat cttggcgtgg cagcctcttg actctgactc 9661 tcctccttga gcccttgccc ctgcgtcccg cgtctgggtt ctcagctatt tccagagcca 9721 gctcaaatca gggtccagtg ggaactgaag agggcccaag tcggagctcg gagggggctg 9781 cctgcaatgc agggcatttg tgggtctgtg tggcaggaag ccggcaggga agggcctgag 9841 tgccagccct ggcagactga ggagcctccc aggagcagcg gggcagtgtg gggctttgtg 9901 tcatcacaac attaaagtat tttattctac tctgtcgttt ggtagaccgt gatgcaggct 9961 gaggagcgct tgccgccttt actggaacgt gcttgcttcc agcacagcag aatccgcgct 10021 ggcatcagcc tgcgtcagct gctgctttaa gaggaagacg gcattcccag aaatcgggct 10081 aaaggtgcat ttcagttccc tggttttaga aagttacgtt tttttggatg gttggaaaca 10141 agcaaaggca tgtttgtgca tgtgtgtgca tcgtgtgtgt gtgcgtggag agaaggatcg 10201 tgtatttctg aaagcgtgag tgtgcatgtg ggtatgtgtg atcttgtgtt agtggtacct 10261 gtgtgaggac atgtatgtgt gtgtgtgttt ctgggtgtgt ctgaatgtgt gatgtgtgtg 10321 tgtctgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtggaga gagagagaga gagtttactt 10381 tctttgaaaa ctctaaaaag cctctccctc tggaagctgt gtgcttctcc agggaccctt 10441 tagagcaact gtgtcaggtc aggcagcaca gaaacttcct ttatccttac aacctgctct 10501 tggggcccgt gcaccctgtc tttacctaga aggtgaggct cagagaagca attactcagt 10561 ggccggcccc tgccttggac taggtgcctc ctcacacctg ttccccaaca atggcatggg 10621 tggaatcacc tgggccggcc caggtgagag ccagcatggg cagtgtacta acctctcctg 10681 gcacttggca ggatgggcag ggtccaggtg aggaggctct cctgagcctg ggactgtgag 10741 gaccatcgct ctctgttccc atgccctccc aggggtcaga gagcccagac tcagagagcc 10801 cagggtcaga gaacttaggg tcagagaggc cagagtcaga gagctgagac tcagagagcc 10861 cagggtcaga gtgtccaggg ccagagagct tgtggtcaca gagcccagac tcagagagcc 10921 cagactcaga gccacctgat tggttagtgc agactcgcca aacccacagg gaggctgggc 10981 tcctccctgg cacgtgtgca acacaagtga aaatctcggt gcctccttca gcccccagcg 11041 catgtcagat ttcccggaat ggctcccctg cagctgcgaa cattcctggc agtcaacagg 11101 agcagcacgc agctgagctc tgctgtgggt tttgttgttt ctctagagtg agatggggca 11161 ggggctgcca tcactccctc cttgcagatg atgaccctga gtcctggcaa ggggaacttg 11221 cccggggctg tgtcaacaca ggggaagcag cagtactcag tgctgcagga tcaacagatg 11281 gtccctgatg aaggcgtagg agacactggg ggctcttgtt taacatgtaa aacagctttg 11341 acaagagaat gtggattttt cgcagctgat ggctgtgcca tggtcacctt cttccccaca 11401 ccagagtcca agggacttca ttttgtgtgt gtgtttgggg ggtcatgggc tgaattatgt 11461 ctcctcccca gagttcatct gttgaagtcc taacccctag taactcagca tgtgacctta 11521 tttggaatag ggtcattaca gatgcaactg gtgaagatga ggtaacatag gagtagaatg 11581 acccctgaat ccattgtgac caggatcc // LOCUS HSL7A 5506 bp DNA PRI 24-APR-1993 DEFINITION Human L7a gene for large ribosomal subunit component (L7a). ACCESSION X52138 NID g34202 KEYWORDS L7a gene; ribosomal RNA; ribosomal subunit; RNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5506) AUTHORS Pietropaolo,C. TITLE Direct Submission JOURNAL Submitted (23-MAR-1990) Pietropaolo C., Dipartimento di Biochimica e Biotecnologie mediche - Ed.18, Via Sergio Pansini, no.5, 80131 Napoli, Italy FEATURES Location/Qualifiers source 1..5506 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="COS 1.1" /clone_lib="cosmid h2a" repeat_region 1068..1363 /note="Alu-like sequence" exon 2040..2072 /number=1 mRNA join(2040..2072,2746..2866,3375..3524,3740..3880, 4084..4163,4441..4571,4849..4918,5109..5273) precursor_RNA 2040..5273 /note="primary transcript" CDS join(2070..2072,2746..2866,3375..3524,3740..3880, 4084..4163,4441..4571,4849..4918,5109..5213) /codon_start=1 /product="L7a protein" /db_xref="PID:g34203" /db_xref="SWISS-PROT:P11518" /translation="MPKGKKAKGKKVAPAPAVVKKQEAKKVVNPLFEKRPKNFGIGQD IQPKRDLTRFVKWPRYIRLQRQRAILYKRLKVPPAINQFTQALDRQTATQLLKLAHKY RPETKQEKKQRLLARAEKKAAGKGDVPTKRPPVLRAGVNTVTTLVENKKAQLVVIAHD VDPIELVVFLPALCRKMGVPYCIIKGKARLGRLVHRKTCTTVAFTQVNSEDKGALAKL VEAIRTNYNDRYDEIRRHWGGNVLGPKSVARIAKLEKAKAKELATKLG" intron 2073..2745 /number=1 exon 2746..2866 /number=2 intron 2867..3374 /number=2 exon 3375..3524 /number=3 intron 3525..3739 /number=3 exon 3740..3880 /number=4 intron 3881..4083 /number=4 exon 4084..4163 /number=5 intron 4164..4440 /number=5 exon 4441..4571 /number=6 intron 4572..4848 /number=6 exon 4849..4918 /number=7 intron 4919..5108 /number=7 exon 5109..5273 /number=8 BASE COUNT 1228 a 1572 c 1431 g 1275 t ORIGIN 1 ggatcctggt cagcaggctt gcaggcccca gtgctgaccg ctccacctgc ctggaggccc 61 cccccacttc tcccctcagt ctgcttttct cctgctcatt ggctgtgtgc caggccccga 121 ggtaactgtt ttacacagat aaactcacca gtcatcacaa ggaccccacc atgcaggaga 181 aactgaggca ctaaaaaagt gaaatctcca tccaaatcca tactgttagt aagggctgga 241 cgtgccatta tttggtttgc ttatctactg accagctccc tgtcgactgg gtgcttcctg 301 cctagaacac tattcactcg cgctgctcac gccctggcct ctcctgtctc caccccttct 361 cagactctgc tccccttgtg ggccacccca cccccacctt ggcggtcttg atgatctcgg 421 tgaagttgtc catgatggac ttaatgtcgt ccttcagccg cttgttgtag gactgcaggg 481 ctctctgctg ggccatggcg agcctcaagc agcgcagcgg agacctggga cctagagtgc 541 agcacagacc tctgagtgca ggcagagtct accccagcca ccctctatgc cccaacctta 601 gcagaacacg cagatttctg gggattccct ttctgccaaa taaagtcaat cactgaaaca 661 cagatgaact cattccatca gcagacaccg caggggtgcc aggactggcc ccgcctttgc 721 cagagcaagc tcatcctgga actcctggcc agccctccaa gccctgtcca gggctccagt 781 ccagacttca cgtccaggcc cacagcgtgt cccagggagc cctgcaaact caacaccagc 841 tccccgtccc cagcccttgg cttaacttct tagcccgccg ccaccttcgc cacgccctcc 901 catccgaaga gtccttccaa ttcgacccct ctgcaccgcc tacatccacg ctttccttcc 961 actcccactc aggctgcccc gctccagacc tcatccctgc actcggggcc ccgctccctc 1021 cagtctctgc gcggcaggga agaggtccta aaaagtggtc attccaaccg ggcgcggtcg 1081 tcacggcctg taatcccagc actttgggag gccgaggcag ggatcacctg aggtcaggag 1141 tttgagacta gcctgaccaa catggtgaaa ccccaactct actaaaaata caaaaattag 1201 ccgggcgtga tggcaggcgc ctgtaatccc agctactcgg gaggctgagg caggagaatc 1261 gcttgaaccc gggaagcaca ggtcgcagtg agccgagatc gcgccactgc actccagcct 1321 gggcgacagg ggggagagct acgtctccaa aagaaaaaaa aaaggggggt aattccactc 1381 cctcgcttaa catccctcat ggttccggtg cgccggacaa ggggctcaag ttccgcgccg 1441 ccgtctcggc ctctgccctc cagccgcact ggacggcctc ggcgctggag ttgccctgcg 1501 ctggggccgg gcctttgcgc gcggtgctca ggggagcccg gggcccccta ggttcggagt 1561 ctggcgcacg accgagcgga ctcctggacg cactcgcatt gtttgtgccc atttttggcg 1621 gggtgtggga aataagtcac acgcaggaaa ggggatctcc gaccccagcg cctacgcacc 1681 cacccacccc cactcccgcc cacacaccca ccccccctcc atccccaccc cccaccacac 1741 cctcataccc gccccagcgc ccgcacacca gacgccggcg tcccgcgggt cggcctaggg 1801 cggggtggtc aagtgcctct gcgacccgca ctttcccgcg tctctcccac ggcctggccc 1861 tcccgccgca gtctctcttc cccggccgcc gcggtccgaa aacctagtca gccgccgcag 1921 cctctcggcc ccgcctcgat ttttagcttt ataggaatgc tgttgcttta aatccgaaat 1981 ccgtgccggt atcaactctc gcgatctccg aggccgcata catattaccc acaattccct 2041 ttcctttctc tctcctcccg ccgcccaaga tggtgagtga gctgtagttc cgtggcacta 2101 tagccaggtt ccggctgtat ccgctgccat cctcctccag gcgcgcctcg gagggcctcc 2161 tgctcctcct ggcgctagga gagccccact cggtgtggca cggagacacc gaggtggatt 2221 agagccccac ttggtgtggc acggagacag ttgagatgga ctagagcccc gggcggccga 2281 gagcggaatg cgttgttccc ggtgtcgcag ggctgggtgt cgcaggcctg gagcacgcag 2341 tgcgggctcg gagccctagc gtctctcggg tctgctgggg gccgctccag aggccttgtg 2401 agcgacgagt tctgagcccg cccctgttgc ttctagagcc tctggggccg cgactcagag 2461 gagtcacgag tccggggtgt ctcctgggtg ggcgacgcga agagagcgtg gtctcgggct 2521 tagccttgct ctggccactc ggggttcccg gggctgcatg cttgtgcggc tgaatgtgag 2581 atgctcctgt cgaggggtgg tgctgggggg ttgcagaaag ctgctcgcca gcttagttca 2641 ggcaggtgct gtcagcgtcc cttgttttgg aggagccagc ctgagcccta cccccgacga 2701 agcgagtgga ggcggcggtt taactgacgt tttctttctg cccagccgaa aggaaagaag 2761 gccaagggaa agaaggtggc tccggcccca gctgtcgtga agaagcagga ggctaagaaa 2821 gtggtgaatc ccctgtttga gaaaaggcct aagaattttg gcattggtaa gtaacaaacg 2881 gcagaatgaa aacggtctat gtttttctca agggaaggtg gtaattgggt tgtgttgtat 2941 cttgtaggtt ttagtgggtg taaagtggtc gcagtcctta atttgtgtct cttagagacg 3001 ggggcaatga tacatgcttc ttgctttcat tgggagttgc tgagcgagca ttcagctcaa 3061 tatggtagtg gccttgaatt cagcttagcc atctggaaac aagtacagta gcagtgtcgc 3121 agcgaggtac taggactgca attctgctgt acttcgtggc accttggctt cttgttagat 3181 gaggaaaagc atcgtgctct ttgttctcag gtgtttgtgt gcagatgatg taaaagaata 3241 tttgctatct gagagatggt gatgacattt taaaccacca agatcgctga tgcaccaaca 3301 cccttcctag tggccccaga catgaacttg acatggaatt tgagcctcac tcggtgtcac 3361 cctttcattc tcaggacagg acatccagcc caaaagagac ctcacccgct ttgtgaaatg 3421 gccccgctat atcaggttgc agcggcagag agccatcctc tataagcggc tgaaagtgcc 3481 tcctgcgatt aaccagttca cccaggccct ggaccgccaa acaggtgagg ttctgtggcg 3541 tggaaaggag tttctcaggc aaggattcct tatttctacc agaactcaga ggggatggtc 3601 ttcaggcttc ttcgaactgc aggttgtcat taaacttata gtcatatagc aggaccgcag 3661 tccagcattt gttattaagt gttaagtgac aaggattaga accttgactc caagcctaaa 3721 ctgaagtgtg tttttccagc tactcagctg cttaagctgg cccacaagta cagaccagag 3781 acaaagcaag agaagaagca gagactgttg gcccgggccg agaagaaggc tgctggcaaa 3841 ggggacgtcc caacgaagag accacctgtc cttcgagcag gtcgagtcag gccccacctt 3901 cagggtcgaa cactgggggc gggctcgttc agcagtcgat cgtcaaaatt tcttcggcct 3961 cgaaattcac tcgtcgaaga gtcaaaaccg agctttttaa cactgagtca gcagctgagc 4021 ccagcagctt cttgtgacta gagcaggccc tgtgagtgct cacaaagtgg ttgtgtgttc 4081 taggagttaa caccgtcacc accttggtgg agaacaagaa agctcagctg gtggtgattg 4141 cacacgacgt ggatcccatc gaggtgcgtt tgcctgttga ctgctaaccc aagggcttct 4201 ggcagtacca ggaagagaga gtagacctaa tgccaagtca gtgatgggac cgaagtgggt 4261 gagggcagta ctgacacaga tccaacacat gcgtggctct tgcaatgatg tgaatctctc 4321 actgaattca accttgaagt gcgaatccat gagcttttta accctgagca attgttacaa 4381 gctaactgaa atttgctgct tttggtcaaa atacagtctt cagctaatgc tttcttccag 4441 ctggttgtct tcttgcctgc cctgtgtcgt aaaatggggg tcccttactg cattatcaag 4501 ggaaaggcaa gactgggacg tctagtccac aggaagacct gcaccactgt cgccttcaca 4561 caggtgaact cgtaagtaca catagcctgg ccccaaactt ccccccagtt catttaatcc 4621 atgcctcaca gttgtttcct tttgccttaa aggccaatct tttagtttaa gaaatatatt 4681 tatctgaatc ttttgccaat gatggttaag aatttcttca ccgtaataaa ccatgtggtc 4741 agcattgcat ctgaggcaaa agactgtctt gagctaaaag gtatttttgc attctaaaag 4801 ggaaactaag gcaaaaaacc cacttttgtt tcccctcctg ccttttaggg aagacaaagg 4861 cgctttggct aagctggtgg aagctatcag gaccaattac aatgacagat acgatgaggt 4921 aagaggcagc tttacaccaa aatactgtca ttcacaaatc tttctcccaa ataactggct 4981 ggcttaacct atgagaagtt ctatctgacg atcagcttgg aacagccaaa cagaattaac 5041 gcaactaata accttgaaaa tctcagaaaa cagtaagcca agctaactgc ctctttttgt 5101 cttttcagat ccgccgtcac tggggtggca atgtcctggg tcctaagtct gtggctcgta 5161 tcgccaagct cgaaaaggca aaggctaaag aacttgccac taaactgggt taaatgtaca 5221 ctgttgagtt ttctgtacat aaaaataatt gaaataatac aaattttcct tcagccagtg 5281 tctgttgagt atctcgggtt gaatcttact tggggttagc aagtatcttt ttgagacaca 5341 gcctcactct gtcgcccagg ctggagtgca gtggtgtgat gtcaaagcaa ccttcgcctc 5401 ccaggtttag gatattctgg agcctcagca tcccaactgg ctggcccata tttgtgtttt 5461 tggtagagat ggggtttcac catgttagcc aggctggtct gagctc // LOCUS HSLCATG 6901 bp DNA PRI 24-APR-1993 DEFINITION H.sapiens gene for lecithin-cholesterol acyltransferase (LCAT). ACCESSION X04981 NID g34286 KEYWORDS Alu repetitive sequence; phosphatidylcholine-sterol acyltransferase; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6901) AUTHORS McLean,J., Wion,K., Drayna,D., Fielding,C. and Lawn,R. TITLE Human lecithin-cholesterol acyltransferase gene: complete gene sequence and sites of expression JOURNAL Nucleic Acids Res. 14 (23), 9397-9406 (1986) MEDLINE 87091568 COMMENT See M12625 for LCAT mRNA Data kindly reviewed (14-JUL-1987) by McLean J. FEATURES Location/Qualifiers source 1..6901 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="(lambda)charon30" CAAT_signal 709..716 TATA_signal 786..790 prim_transcript 809..5052 mRNA join(809..990,1724..1880,1960..2075,2170..2265,2349..2573, 4458..5052) exon 809..990 /number=1 CDS join(837..990,1724..1880,1960..2075,2170..2265,2349..2573, 4458..5032) /codon_start=1 /product="lecithin-cholesterol acyltransferase (LCAT)" /db_xref="PID:g34287" /db_xref="SWISS-PROT:P04180" /translation="MGPPGSPWQWVTLLLGLLLPPAAPFWLLNVLFPPHTTPKAELSN HTRPVILVPGCLGNQLEAKLDKPDVVNWMCYRKTEDFFTIWLDLNMFLPLGVDCWIDN TRVVYNRSSGLVSNAPGVQIRVPGFGKTYSVEYLDSSKLAGYLHTLVQNLVNNGYVRD ETVRAAPYDWRLEPGQQEEYYRKLAGLVEEMHAAYGKPVFLIGHSLGCLHLLYFLLRQ PQAWKDRFIDGFISLGAPWGGSIKPMLVLASGDNQGIPIMSSIKLKEEQRITTTSPWM FPSRMAWPEDHVFISTPSFNYTGRDFQRFFADLHFEEGWYMWLQSRDLLAGLPAPGVE VYCLYGVGLPTPRTYIYDHGFPYTDPVGVLYEDGDDTVATRSTELCGLWQGRQPQPVH LLPLHGIQHLNMVFSNLTLEHINAILLGAYRQGPPASPTASPEPPPPE" intron 991..1723 /number=1 exon 1724..1880 /number=2 intron 1881..1959 /number=2 exon 1960..2075 /number=3 intron 2076..2169 /number=3 exon 2170..2265 /number=4 intron 2266..2348 /number=4 exon 2349..2573 /number=5 intron 2574..4457 /number=5 repeat_region complement(3306..3585) /note="Alu repetitive sequence" repeat_region complement(3609..3914) /note="Alu repetitive sequence" repeat_region complement(3916..4207) /note="Alu repetitive sequence" exon 4458..5052 /number=6 polyA_signal 5028..5033 repeat_region 5424..5748 /note="Alu repetitive sequence" repeat_region 5961..6261 /note="Alu repetitive sequence" repeat_region 6352..6643 /note="Alu repetitive sequence" repeat_region 6656..6901 /note="Alu repetitive sequence" BASE COUNT 1473 a 2028 c 1933 g 1467 t ORIGIN 1 ctgcagacat ggagttcctc gaggtgctga ccgagggcct tgagcgggtg ctgttggtgc 61 gcggtggtgg ccgtgaagtc atcaccatct actcctgagc ccagtgtcat cttgtggcct 121 ggagtcgagg tcttggccag gacataacaa gctgtggtct ggggtaacag cctcttccca 181 gcacccacct gccagccctg cttgcctggc cctgtcctgg acccagcttt gctaggtctc 241 cttggaaacc aggcctgggc ctcaaaatgg agatggatcc caggtcttgt gggaccctgg 301 gatgtttggg gactttacta tctagcaccc cagtaggcct gtcctggcca gagaagactg 361 gtaggggccg agtggggttt gaaggcagcc ggcccggccc agcccaggag cgctatttat 421 tgcatattta ttgtttggat gtcaccatca gagacgaagg gaagggtagc cagggaggga 481 gtccagccca gctgcctgca ggaagatctg gctcagtcta ctatgggcag ggccccccac 541 caagctgagc cgaatggaga cagctgagct gaggcctgac tttttcaata aaacattgtg 601 tagttctggg cctcctgctg ccccggctct gtttcccctg gcgccaagag aagaaggcgg 661 aactgaaccc aggcccagag ccggctccct gaggctgtgc ccctttccgg caatctctgg 721 ccacaacccc cactggccag gccgtccctc ccactggccc tagggcccct cccactccca 781 caccagataa ggacagccca gtgccgcttt ctctggcagt aggcaccagg gctggaatgg 841 ggccgcccgg ctccccatgg cagtgggtga cgctgctgct ggggctgctg ctccctcctg 901 ccgccccctt ctggctcctc aatgtgctct tccccccgca caccacgccc aaggctgagc 961 tcagtaacca cacacggccc gtcatcctcg gtaagccccc accaggcccc tgatgcacca 1021 cgccagaccc tggggagcct gggccccagc ccctggcagc tgacctggcc aaagcccttc 1081 tgccctgcat aagccccgac ataagtacct gccctggtgt ggggaggggc caaaagcttg 1141 tcccttagag gaatgacgtc ccttctccca ccacactgtg actctcagtt gtctaaccca 1201 ggggggcgga gtgggggacg gggtgtgcct gaggtcttgg ctggggcatc acaagctgtg 1261 gtcagtcaca gccacaccag actctgggcc aagcccacca ctccttcctt ggcccccacc 1321 caccaaggac aagatgccca gcccaggatc ggtgagcagg agaggcccat ccatgcccgg 1381 cccctattag gcccagcccc catgccccca gacctatctg ttcccacctt ggactttggc 1441 aataaaggag cgccagactg ggtgtttgct ctgcagaggc aggggtcagc ggccttggcc 1501 tcagcacctc agcgccttcc cttcctcagg gaagcctggg ctttggctac tggggggaca 1561 gcagggagag ggggtgtaag caggggaggg taagtgtgct ttgtacctgg gggttgaggg 1621 tatgggaggt ggggggtggg tctggtcact gcagcatctg gggtgacggg ggtaagggtc 1681 acggggggaa tccagagtcc agagtgaggg ctgctgctca cagtgcccgg ctgcctgggg 1741 aatcagctag aagccaagct ggacaaacca gatgtggtga actggatgtg ctaccgcaag 1801 acagaggact tcttcaccat ctggctggat ctcaacatgt tcctacccct tggggtagac 1861 tgctggatcg ataacaccag gtacagccat gtgctccacc ctagccccaa cacgctgccc 1921 cttggctact ggctgctgag tggcaccccc gccccgcagg gttgtctaca accggagctc 1981 tgggctcgtg tccaacgccc ctggtgtcca gatccgcgtc cctggctttg gcaagaccta 2041 ctctgtggag tacctggaca gcagcaagct ggcaggtttg tgtcagaggg cagggctggg 2101 gctccaggcc tgggtgctgg cccacagcag gcatggccca agcccccggt gctgctggtc 2161 cccccacagg gtacctgcac acactggtgc agaacctggt caacaatggc tacgtgcggg 2221 acgagactgt gcgcgccgcc ccctatgact ggcggctgga gcccggtgag tgtctctgcg 2281 gatgaccggc ttggggtggg gcaggtgccc cagaccccag ctgccctgac cccttccacc 2341 cgctgcaggc cagcaggagg agtactaccg caagctcgca gggctggtgg aggagatgca 2401 cgctgcctat gggaagcctg tcttcctcat tggccacagc ctcggctgtc tacacttgct 2461 ctatttcctg ctgcgccagc cccaggcctg gaaggaccgc tttattgatg gcttcatctc 2521 tcttggggct ccctggggtg gctccatcaa gcccatgctg gtcttggcct caggtgagaa 2581 ggcctcgaac acttaggtcc agcgatgggt gagaccaagc tgatcctggg cctgccttca 2641 ttgcggctcc tgctcacagt ggcctctagg ggtgctatct accactcctg ggctggcatg 2701 cttgctgtca ctggccccca gagcagtgac cctggcctga gcaattaggg tggctccttc 2761 cagagtctgt gtcagtgatg gcaaaggggc agtgaacaca gaaagtgaat cccagctatc 2821 tgctcccagg gtttgttctt gtagccccag agcctgcctg cccagccctt gcctgccttc 2881 cacttgctcg caggagtgcc tttgcccagg gatgtgcttc actgaggatg gatctgccag 2941 agctagggcc agaccccccg aggccccacc tgctcttccc tagggagtgg caccagggcc 3001 cagcactgac acagctacca cctactcccc accccctgta ccctgggagc tggtctggag 3061 agagaaacac agtctggaca agagaaacgc tcatcagaca ccaccaataa acatcaaaca 3121 gacaccatct tgtttccccc tttctggagc acaactctgt ggcccccatt gctgcataag 3181 ccacacaagg agcagaaaga cacatgcccg agggaggaca gccagcaccg cccccaccat 3241 ccccacacat cctgggcctc acccaccatc cccacacacc ctggtcctca ggaagccccg 3301 cctacttttt tttttctttt tcagacaggg tcattctgtc gcccaggctg gagtgcagtg 3361 gcgtgatcat agccgcagtc tcaatctccc tggcccaagc aatcctcctg cctcagcctc 3421 ctggttagct gagactatag gcacacaaca ccacacctaa tttatttttg ttttttagta 3481 gagatgaggt cttgctatgt tgcccaggct ggtctcaaac tcttcacctt aagtgatctt 3541 cctgcctcag cctcccgaag tgctgggatt acaggcgtga gctactgtgc tgggcctttt 3601 aaaaaatgtt tatttgttta tttatgtatt ttgagatgga gtctcgctct gtttcccagg 3661 ctggagtgca gtagtgcaat ctccactcac tgcaacctcc atctccccag ttcaagtgat 3721 tcttctgcct cagcctccct agtagctagg atcacaggca tgtgccacca cgcctggcta 3781 atttttatat ttttagtaga gattaggttt ccccatgttg gccaggctgg tctcgaactc 3841 ctgaccttaa gtcatctgcc tgcctcggcc tcccaaagtg ctaggattac aggtgtgagc 3901 caccgtaccc ggccctattt atttattttt taagctggaa tctcactgtg tcacccaggc 3961 tacagtgcag tggtgcgatc atagtttact gtaacctcaa attcctaggc tcaagcaatc 4021 ttcctgcctt tgcctcctga gtagctagga ctagaggtgc actccactaa gcccagctga 4081 tttttttttt tttttttttt gtagagacag ggtctcactg cattgcctag tctggtcacg 4141 gactcctggc ctcaagtgat cctcctgcct cagcctccca aagtgttggg attacagggg 4201 tgagccatgg tgcctgtccc tgccatcctt ttgaagccct acagctccac ccaacagagg 4261 tcttatcagg gcttctcatt gagtaagctg acactgagca tcattgaata tcaggcctgc 4321 tcaagcctgt ggcttagagt ctgtgtctag attgggcagg gacaagattg agcatctggc 4381 tgagcctaca ctcagcaggt tgtgggccag gggtagccag gcctggctcc ctgtcccacc 4441 ttgctccata tccacaggtg acaaccaggg catccccatc atgtccagca tcaagctgaa 4501 agaggagcag cgcataacca ccacctcccc ctggatgttt ccctctcgca tggcgtggcc 4561 tgaggaccac gtgttcattt ccacacccag cttcaactac acaggccgtg acttccaacg 4621 cttctttgca gacctgcact ttgaggaagg ctggtacatg tggctgcagt cacgtgacct 4681 cctggcagga ctcccagcac ctggtgtgga agtatactgt ctttacggcg tgggcctgcc 4741 cacgccccgc acctacatct acgaccacgg cttcccctac acggaccctg tgggtgtgct 4801 ctatgaggat ggtgatgaca cggtggcgac ccgcagcacc gagctctgtg gcctgtggca 4861 gggccgccag ccacagcctg tgcacctgct gcccctgcac gggatacagc atctcaacat 4921 ggtcttcagc aacctgaccc tggagcacat caatgccatc ctgctgggtg cctaccgcca 4981 gggtccccct gcatccccga ctgccagccc agagcccccg cctcctgaat aaagaccttc 5041 ctttgctacc gtaagccctg atggctatgt ttcaggttga agggaggcac tagagtccca 5101 cactaggttt cactcctcac cagccacagg ctcagtgctg tgtgcagtga ggcaagatgg 5161 gctctgctga ggcctgggac tgagctgggc accctagatg tacagctgcc cactctcctg 5221 gtcgagctgt tgaggcagtg tgcaccgtgc ctgcctctgt gctgggcgcg gggactggag 5281 ctggctccac ccacagccct gtcagaggag cacggggcgg tgggggggcg gtgacaattt 5341 gagctgtctc tcccagctcc caaaagggca ggtgagacga ccttttgagt gctgggtgaa 5401 tgacagggcc acaagtgttt agaggccggg cacggtggct cacgcctgta atcctggcac 5461 tttgggagac cgaggcaggc ggatcacctg aactcaggag tttgagacca gccaggccaa 5521 catggcagaa acccccgtct ctactaaaat acaaaatatt tgccaggcgt ggtggcatgc 5581 atctgtcgtc ccagctactc aggaggctga ggcacgagaa tcatttgaac ctgggaggtg 5641 aggtcactgt gagccgagat cacgtcgttg cactccagac tgggcggcag aatgagactg 5701 tctcaaaaaa aaagagactg ggtctcaaaa aaaaaaaaaa aaaaaaaagt tagaattaaa 5761 atctgggagt gacaaccatt aatcagatca tttcactaat ggaagtttaa ttactaatga 5821 agctaaatgc tctgagaaaa gcttaggaag cacaagaggc tgagcctttc aggtcagcaa 5881 agacttccca gaggaggcag tgcctacact gaggtcagag tgacaagaag agtaatggac 5941 cactgtaaag acttgggttc ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt 6001 gggaggccga ggcgggtgga tcatgaggtc aggagatcga gaccatcctg gctaacaagg 6061 tgaaaccccg tctctactaa aaatacagaa aattagccgg gcgcggtggc gggcgcctgt 6121 ggtcccagct actcgggagg ctgaggcagg agaatggcgt gaacccggga agcggagctt 6181 gcagtgagcc gagattgcgc cactgcagtc cgcagtccgg cctgggcgac agagcgagac 6241 tccgtctcaa aaaaaaaaaa agacttgggt ttgacttgat tgagcccagg agttcgagac 6301 aagcctgggc aatatagtga gacctcatct ctacaaaaat tttaaaaatt agcctggtgc 6361 ggtggctcat gcctgtaatc ccagcactct gggaggccga ggtgggcgga tcacttgagg 6421 tcagaagttt gagaccaccc tgaccaacat ggagaaaccc cgtctctact aaaaatacaa 6481 aattagccgg gcatggtggc gcatgcctgt aatcccagct actcgggagg ctgaggcagg 6541 agaattgttt gaacctggga ggtggacgtt gcggtgagcc aagatcacac tattgcactc 6601 cagcctgggc aacaagagca aaactccgtc tcaaaaaaaa aaatttattt ttaaattagc 6661 caggtgtagc cacagctgta gtcaaatcta ctaggcaggc tgaggtggga ggattgcttg 6721 aacctgggag gcagaggttg cagtgagcca agatggtgcc acggcattcc agcctgagca 6781 acagcaagac cctgtgtcca aaaaaaaaaa aaaaaaaaac cgtaaaatag gccaggcaca 6841 gtggttcatg gttataagcc tagcactttg gaaggctgag gagggtggat cgcctgagct 6901 c // LOCUS HSLH01 1662 bp DNA PRI 31-JUL-1997 DEFINITION Human beta-LH gene (luteinizing hormone gene beta subunit). ACCESSION X00264 NID g34351 KEYWORDS complementary DNA; glycoprotein; luteinizing hormone; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1662) AUTHORS Talmadge,K., Vamvakopoulos,N.C. and Fiddes,J.C. TITLE Evolution of the genes for the beta subunits of human chorionic gonadotropin and luteinizing hormone JOURNAL Nature 307 (5946), 37-40 (1984) MEDLINE 84093590 FEATURES Location/Qualifiers source 1..1662 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide join(550..564,917..961) /gene="HCG" CDS join(550..564,917..1084,1319..1561) /gene="HCG" /codon_start=1 /product="beta-Luteinizing hormone" /db_xref="PID:e332193" /db_xref="PID:g2292893" /translation="MEMLQGLLLLLLLSMGGAWASREPLRPWCHPINAILAVEKEGCP VCITVNTTICAGYCPTMMRVLQAVLPPLPQVVCTYRDVRFESIRLPGCPRGVDPVVSF PVALSCRCGPCRRSTSDCGGPKDHPLTCDHPQLSGLLFL" gene 550..1561 /gene="HCG" mat_peptide join(962..1084,1319..1558) /gene="HCG" BASE COUNT 274 a 530 c 527 g 331 t ORIGIN 1 aagggagagg tggggctcgg gcttaatccc tccttggggg gcatctgggt caagtggctt 61 ccctggcagc acagtcacgg ggagaccctc tctcactggg cagaagctaa gtccgaagca 121 gcgcccctcc tgttaggttg gactgtggtg caggaaagcc tcaagtggag ggttgaggct 181 tcagtccagc actttcctcg ggtcatggcc tcctcctggc tcccaagacc ccacaattgg 241 cagaggcagg ccttcctaca ccctactccc tgtgcttcca gcctcgacta gtccctagca 301 ctcgacaact gagtctctga ggtcacttca ccgtggtctc tgcctcacct ctggcgctag 361 acccgtgagg ggagagggct ggggcactct gctgagccac tcctgcgcct ccctggccat 421 gtgcacctct cgcccccggg ggattagtgt ccaggttacc ccaggcatcc tatcacctcc 481 tggtggcctt gccgccccca caaccccgag gtataaagcc agatacacga ggcaggggat 541 gcaccaagga tggagatgct ccaggtaaga ctgcagggcc cctgggcacc ttccacctcc 601 ttccaggcca tcactggcat gagaaggggc agacccgtgt gagctgtgga aggaggcctc 661 tttctggagg ggcgtgaccc ccagtaagct tcaggtgggg cagttcctga gggtggggat 721 ctgaaatgtt ggggcatctc aggtcctctg ggctgtgggg tgggctctga aaggcaggtg 781 tccgggtggt gggtcctgaa taggagatgc caggaagggt ctctgggtct ttgtgggtgg 841 tgtaccacgc gggatgggaa ggccaggact cggggctgcg gtctcagacc cgggtgaagc 901 agtgtccttg tcccaggggc tgctgctgtt gctgctgctg agcatgggcg gggcatgggc 961 atccagggag ccgcttcggc catggtgcca ccccatcaat gccatcctgg ctgtggagaa 1021 ggagggctgc cccgtgtgca tcaccgtcaa caccaccatc tgtgccggct actgccccac 1081 catggtgagc tgcccggggc cggggcagat gctgccacct cagggccaga cccacagagg 1141 cagcggggga ggaagggtgg tctgcctctc tggcctgcgg ttggggaatg gggtgtggga 1201 aggcaggaac agagggcttc ctgggctcct gagtctgaga cctgtggggt cagcttggga 1261 gctcagctga ggcgctggcc taggcacatg ctcattcccc cactcacacg gcctccagat 1321 gcgcgtgctg caggcggtcc tgccgcccct gcctcaggtg gtgtgcacct accgtgatgt 1381 gcgcttcgag tccatccggc tccctggctg cccgcgtggc gtggaccccg tggtctcctt 1441 ccctgtggct ctcagctgtc gctgtggacc ctgccgccgc agcacctctg actgtggggg 1501 tcccaaagac caccccttga cctgtgacca cccccaactc tcaggcctcc tcttcctcta 1561 aagaccctcc ccgcagcctt ccaagtccat cccgactcct ggagccctga caccccgatc 1621 ctcccacaat aaaggcttct caatccgcac tctggcagta tc // LOCUS HSLPAPGEN 3131 bp DNA PRI 09-DEC-1996 DEFINITION H.sapiens LPAP gene. ACCESSION X97267 NID g1729765 KEYWORDS CD45-binding protein; LPAP. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3131) AUTHORS Bruyns,E., Mincheva,A., Bruyns,R.M., Kirchgessner,H., Weitz,S., Lichter,P., Meuer,S. and Schraven,B. TITLE Sequence, genomic organization, and chromosomal localization of the human LPAP (PTPRCAP) and mouse CD45-AP/LSM-1 genes JOURNAL Genomics 38 (1), 79-83 (1996) MEDLINE 97124850 REFERENCE 2 (bases 1 to 3131) AUTHORS Bruyns,E. TITLE Direct Submission JOURNAL Submitted (17-APR-1996) E. Bruyns, Institute of Immunology, Ruprecht-Karls-Universitaet Heidelberg, Im Neunheimer Feld 305, Heidelberg, 69120, FRG COMMENT Overlapping sequence: X81422. FEATURES Location/Qualifiers source 1..3131 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cosmid library" /chromosome="11" /map="11q13" promoter 60..600 mRNA join(636..701,1968..2808) /evidence=experimental /product="LPAP" exon 636..701 /number=1 /evidence=experimental CDS join(699..701,1968..2585) /codon_start=1 /evidence=experimental /product="LPAP" /db_xref="PID:e242825" /db_xref="PID:g1729766" /translation="MALPCTLGLGMLLALPGALGSGGSAEDSVGSSSVTVVLLLLLLL LLATGLALAWRRLSRDSGGYYHPARLGAALWGRTRRLLWASPPGRWLQARAELGSTDN DLERQEDEQDTDYDHVADGGLQADPGEGEQQCGEASSPEQVPVRAEEARDSDTEGDLV LGSPGPASAGGSAEALLSDLHAFAGSAAWDDSARAAGGQGLHVTAL" intron 702..1967 /number=1 /evidence=experimental exon 1968..2808 /number=2 /evidence=experimental BASE COUNT 603 a 1019 c 983 g 526 t ORIGIN 1 tgccagcggg tcgcaccggc tagctggctg ccagagcctc tgaggcagcg caggggtcag 61 ttcccacccc cacccgtccc aggcccaggc cgaagccagc gcccagcttt cctcactgtt 121 cctgtggagg atgtctacgc ccaggcgagc tcctcgacct ctgagggacc atctccccga 181 ccactgccca gccctctgct ccctccccag aggaggcggg agggtgggct ctatattttc 241 attccaaata aaattctctt tctaaaagcc aggctgacct gctgctgcaa gggtggggct 301 gagaagtctg gaaacctgga gacagaggat gtcacaggag tcacacatgt gcagaaggtc 361 tgggggcagt gagcagcccc acaggcattg ctcccctgga ttgtcaggca gggaaactga 421 ggcagaagac ctgagaacac atgcaaggct gcatccagct tcaagttggt gctgcttctt 481 ccttggagga caggcccggc agcccagcct cctccagaga gactggggag ggtctggtgt 541 ggaccagggg tcctgcagca gggaggggca ggtggggtat gtgggcagga agcggaagcc 601 tgggccaccc ttcactgcag acgagcactg agctcacttc tcgctcgaca cagccagagc 661 tggaggtggg tgcccggcac ggaggggcct gcggaccaat ggtaagcctc ggagcccccc 721 gggttcttgg gatgggggcc acaggcggtc ggggctagaa tgtctgggcc tgtggagcta 781 aggagggccc cacattggcc ccagggaaga ggtgagacct aagaggagta tggcagaggc 841 tgaacctttc tctcacctga gctgggctcc ctgggactcc aggcctggcc cccagcaccc 901 cataccggac caggcccctg tcaggagccc tgggaaggcc cttacctggt cttcctgtga 961 gcgggaactt gcattcctgg agcctgggcg gacagcctgg ggtggtgggg gaggctggct 1021 gcacctcagc acccctcccc cgacaccccc cacctagcct gttccgaccg cagacttcct 1081 ctggcagctg cgcccgcctc ccccagcccc ccccagcccc cggccctgca cgcctgtggc 1141 cctcagggac tgggagtgga acggaaaccc tgccgctggg gcagccccgt ggtggggagg 1201 gaggaagagg ggcctcacgg acccccgttt ggggacctgg ccaagcagaa gatgagcagt 1261 tgcctctggg tgcatccagg cccctccatc ccccatccca ggcctcaggg agagccagcc 1321 ctgacaccag ctagcaacct ccttccctcc ctcccatctc ctctcccacc cacccaggca 1381 gcctagacac atttaatcca tacttattga gcacctacta acatgcttga cccaaaaagc 1441 ccccgtttcc tagcagctta ttgtgggggg tagataagac aatagacata aaaaatgagt 1501 acagttatct cctgcgttag gtgacatgga aggaaaaagg cactgagtgc tggggggtgc 1561 tggggtgggc tgcagtgata gacatcaggg tagaggttaa ggtcaggttc agcctcactg 1621 gggtgaagtt tgagcacggt gagcaggcca tgcagcccgg gggaggggag gatgggagga 1681 ggtggagctt tccgggcaga gggaacagcc agtgcgaagg ccccaggcag gtggcttaat 1741 gcagctgttg ggggaggtga gtggtaggga ggaggctgga gggatggggg ctgatctcac 1801 agggccagag cctggttgac caaataaggc cttggccttt tctgcttggc tgtcccaaga 1861 ggatcccaaa gagaaaaaaa cgaaagtggt cttggtcacc cagcctgccc cacaccaggc 1921 cccaccccag gtgctgagcc ctctgagccc ctgcctgtct cccacaggct ctgccctgca 1981 ccttagggct cgggatgctg ctggccctgc caggggcctt gggctcgggt ggcagcgcgg 2041 aggacagcgt gggctccagc tctgtcaccg ttgtcctgct gctgctgctg ctcctactgc 2101 tggccactgg cctagcactg gcctggcgcc gcctcagccg tgactcaggg ggctactacc 2161 acccggcccg cctaggtgcc gcgctgtggg gccgcacgcg gcgcctgctc tgggccagcc 2221 ccccaggtcg ctggctgcag gcccgagctg agctggggtc cacagacaat gaccttgagc 2281 gacaggagga tgagcaggac acagactatg accacgtcgc ggatggtggc ctgcaggctg 2341 accctgggga aggcgagcag caatgtggag aggcgtccag cccagagcag gtccccgtgc 2401 gggctgagga agccagagac agtgacacgg agggcgacct ggtcctcggc tccccaggac 2461 cagcgagcgc agggggcagt gctgaggccc tgctgagtga cctgcacgcc tttgctggca 2521 gcgcagcctg ggatgacagc gccagggcag ctgggggcca gggcctccat gtcaccgcac 2581 tgtagaggcc ggtcttggtg tcccatccct gtcacagccg ctcactcccc gtgcctctgc 2641 ttcccaagat gccatggctg gactggaccc ccagcccaca tgaccatgcc tcagactgtc 2701 accccctacc agttcccaag tccatgtgta ccccgctcac cacgggaacg gcccccccca 2761 accacaggca tcaggcaacc atttgaaata aaactccttc agcctgtggc cctgtggtcc 2821 tacagagacc cctccctcct ggaccagggg ctcctcctgg cacaatccaa cccaaccctg 2881 cccctaggca tgcagcacaa agagccaggt cagcaccatg attcagccct ttaatcttcc 2941 acgggagcag ttgagcgcgg ggcgtggcgg gcggccctcc gtgcccatga ttcaggggca 3001 cagctgcccc agcagacaca cactttcata cgcactcaca ccccaccccc agacacaccc 3061 ccaggtctct ggaactggcc cagggtcctg ctgctctcac agccgcagga cagggctcaa 3121 gggctaccct c // LOCUS HSLWBGTPT 1946 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens LW gene. ACCESSION X93093 NID g1491707 KEYWORDS LW blood group; LW gene; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1946) AUTHORS Hermand,P., Le Pennec,P.Y., Rouger,P., Cartron,J.P. and Bailly,P. TITLE Characterization of the gene encoding the human LW blood group protein in LW+ and LW- phenotypes JOURNAL Blood 87 (7), 2962-2967 (1996) MEDLINE 96219993 REFERENCE 2 (bases 1 to 1946) AUTHORS Bailly,P. TITLE Direct Submission JOURNAL Submitted (16-NOV-1995) P. Bailly, INSERM U76, INTS, 6, rue Alexandre CABANEL, F- 75739 PARIS CEDEX 15, FRANCE COMMENT Related sequences L27670, L27671, S78852, and S78853. FEATURES Location/Qualifiers source 1..1946 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /germline /cell_type="leukocyte" /clone_lib="HL11115 (Clontech)" /clone="LW clone 1" /chromosome="19" /map="p13" promoter 1..404 exon 405..807 /number=1 gene 414..1506 /gene="LW" CDS join(414..807,938..1240,1388..1506) /gene="LW" /note="transmembrane protein" /codon_start=1 /product="LW blood group" /db_xref="PID:e214717" /db_xref="PID:g1491708" /translation="MGSLFPLSLLFFLAAAYPGVGSALGRRTKRAQSPKGSPLAPSGT SVPFWVRMSPEFVAVQPGKSVQLNCSNSCPQPQNSSLRTPLRQGKTLRGPGWVSYQLL DVRAWSSLAHCLVTCAGKTRWATSRITAYKPPHSVILEPPVLKGRKYTLRCHVTQVFP VGYLVVTLRHGSRVIYSESLERFTGLDLANVTLTYEFAAGPRDFWQPVICHARLNLDG LVVRNSSAPITLMLAWSPAPTALASGSIAALVGILLTVGAAYLCKCLAMKSQA" variation 712 /gene="LW" /note="A in LWa, G in LWb" /replace="g" intron 808..937 /gene="LW" /number=1 exon 938..1240 /gene="LW" /number=2 intron 1241..1387 /gene="LW" /number=2 exon 1388..1924 /number=3 BASE COUNT 382 a 571 c 584 g 409 t ORIGIN 1 agctttctca actgcctcag ccttgtgtga gttgagggga ggtgtcacat ccagctggag 61 tcctttctaa gcagccacag cctgagcctc ccacttcctc ccccaagaaa acattgtggg 121 ttgatggcca taccctgagg ttctggtcca aatcggactt tctatgacct tctgggtctc 181 tagtgaaaac taaagactcc tctccagaaa aaaacatttg gtttctaatg aggcctggaa 241 tcttattctt gacctgggga gcggaatccc tttttgcagt actcccgggc cctctgttgg 301 ggcctcccct tcctctccag ggtggagtcg aggaggcggg gctgcgggcc tccttatctc 361 tagagccggc cctggctctc tggcgcgggg ccccttagtc cgggcttttt gccatggggt 421 ctctgttccc tctgtcgctg ctgttttttt tggcggccgc ctacccggga gttgggagcg 481 cgctgggacg ccggactaag cgggcgcaaa gccccaaggg tagccctctc gcgccctccg 541 ggacctcagt gcccttctgg gtgcgcatga gcccggagtt cgtggctgtg cagccgggga 601 agtcagtgca gctcaattgc agcaacagct gtccccagcc gcagaattcc agcctccgca 661 ccccgctgcg gcaaggcaag acgctcagag ggccgggttg ggtgtcttac cagctgctcg 721 acgtgagggc ctggagctcc ctcgcgcact gcctcgtgac ctgcgcagga aaaacacgct 781 gggccacctc caggatcacc gcctacagtg agggacaggg gctcggtccc ggctggggtg 841 aggggagggg gctggaagag gtgggggaag ggtagtttga cagtcgctct atagggagcg 901 cccgcggacc tcactcagag gctccccctt gccttagaac cgccccacag cgtgattttg 961 gagcctccgg tcttaaaggg caggaaatac actttgcgct gccacgtgac gcaggtgttc 1021 ccggtgggct acttggtggt gaccctgagg catggaagcc gggtcatcta ttccgaaagc 1081 ctggagcgct tcaccggcct ggatctggcc aacgtgacct tgacctacga gtttgctgct 1141 ggaccccgcg acttctggca gcccgtgatc tgccacgcgc gcctcaatct cgacggcctg 1201 gtggtccgca acagctcggc acccattaca ctgatgctcg gtgaggcacc cctgtaaccc 1261 tggggactag gaggaagggg gcagagagag ttatgacccc gagagggcgc acagaccaag 1321 cgtgagctcc acgcgggtcg acagacctcc ctgtgttccg ttcctaattc tcgccttctg 1381 ctcccagctt ggagccccgc gcccacagct ttggcctccg gttccatcgc tgcccttgta 1441 gggatcctcc tcactgtggg cgctgcgtac ctatgcaagt gcctagctat gaagtcccag 1501 gcgtaaaggg ggatgttcta tgccggctga gcgagaaaaa gaggaatatg aaacaatctg 1561 gggaaatggc catacatggt ggctgacgcc tgtaatccca gcactttggg aggccgaggc 1621 aggagaatcg cttgagccca ggagttcgag accagcctgg acaacatagt gagaccccgt 1681 ctatgcaaaa aatacacaaa ttagcctggt gtggtggccc gcacctgtgg tcccagctac 1741 ccgggaggct gagttgggag gatcctttga gccctgaaag tcgaggttgc agtgagcctt 1801 gatcgtgcca ctgcactcca gcctggggga cagagcacga ccctgtctcc aaaaataaaa 1861 taaaaataaa aataaatatt ggcgggggaa ccctctggaa tcaataaagg cttccttaac 1921 cagcctctgt cctgtgacct aagggt // LOCUS HSMB1GENE 6502 bp DNA PRI 30-SEP-1996 DEFINITION H.sapiens MB1 gene. ACCESSION X95586 NID g1262337 KEYWORDS MB1 gene; proteasome. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6502) AUTHORS Abdulla,S., Beck,S., Belich,M., Jackson,A., Nakamura,T. and Trowsdale,J. TITLE Divergent intron arrangement in the MB1/LMP7 proteasome gene pair JOURNAL Immunogenetics 44 (4), 254-258 (1996) MEDLINE 96337914 REFERENCE 2 (bases 1 to 6502) AUTHORS Beck,S. TITLE Direct Submission JOURNAL Submitted (07-FEB-1996) S. Beck, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, London WC2A 3PX, UK COMMENT Related sequence S74378. FEATURES Location/Qualifiers source 1..6502 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="MB4" /chromosome="14" /map="q11.2" exon 588..785 /gene="MB1" /number=1 CDS join(588..785,1799..2105,5577..5863) /gene="MB1" /codon_start=1 /product="proteasome" /db_xref="PID:e223606" /db_xref="PID:g1262338" /translation="MALASVLERPLPVNQRGFFGLGGRADLLDLGPGSLSDGLSLAAP GWGVPEEPGIEMLHGTTTLAFKFRHGVIVAADSRATAGAYIASQTVKKVIEINPYLLG TMAGGAADCSFWERLLARQCRIYELRNKERISVAAASKLLANMVYQYKGMGLSMGTMI CGWDKRGPGLYYVDSEGNRISGATFSVGSGSVYAYGVMDRGYSYDLEVEQAYDLARRA IYQATYRDAYSGGAVNLYHVREDGWIRVSSDNVADLHEKYSGSTP" gene 588..5863 /gene="MB1" intron 786..1798 /gene="MB1" /number=1 repeat_region complement(1332..1580) /rpt_family="alu" exon 1799..2105 /gene="MB1" /number=2 intron 2106..5576 /gene="MB1" /number=2 repeat_region complement(2252..2332) /rpt_family="alu" repeat_region 2334..2556 /rpt_family="alu" repeat_region 2578..2880 /rpt_family="alu" repeat_region complement(3005..3311) /rpt_family="alu" repeat_region complement(3371..3654) /rpt_family="alu" repeat_region complement(4519..4803) /rpt_family="alu" repeat_region complement(4969..5169) /rpt_family="alu" misc_feature 5190^5191 /gene="MB1" /note="deletion of about 1kb" repeat_region 5204..5503 /rpt_family="alu" exon 5577..5863 /gene="MB1" /number=3 BASE COUNT 1551 a 1518 c 1655 g 1778 t ORIGIN 1 tttattttgt tgcgttgttt taccactaat tgctgtggtt taaataatat agggaaatgg 61 cctcagaatg gatcattatt ttacccaatt ttagaaggtg cccttaaaca agatggcacc 121 tcaggcgcgt taacaaggtg ctgaagcagc agcgatgtaa cctgttgctt ggacaccagg 181 ctggagtggg aggactcacc gctaagggtt cagaaccgat tttcaaccag tcacgccctt 241 ctcaaagatg gccgacctca cttcccttac gcaacatgga cgttttcagt cacttcctgt 301 gattttgagg ttattctgat ccaggactaa cttgcatgac caaagccaag atggcaggta 361 cgtcctttta gaatttaagg ttagtgaagc catctatttc cccgaccccc ttcagtgaaa 421 atggtctttc gcatctggct cttctttttg gaggcgtgct tgccagcagt caaaatggct 481 ccattccgga atagatttaa taggaagtga agctgtgacg gcgaggcgtt gcccggccta 541 tctttgctag gcgttctcag aattagttct ttctgcccac actagacatg gcgcttgcca 601 gcgtgttgga gagaccgcta ccggtgaacc agcgcgggtt tttcggactt gggggtcgtg 661 cagatctgct ggatctaggt ccagggagtc tcagtgatgg tctgagcctg gccgcgccag 721 gctggggtgt cccagaagag ccaggaatcg aaatgcttca tggaacaacc accctggcct 781 tcaaggtgtg gagccagccc ccttgccagg ctgagtactg aacgcccgcg acttgcctgg 841 cctcccagcc tgaccggagt ttgcggtggc cttggccacc aacccaggcg gaggctgggt 901 cctctcgttc ggaccgcagc agttttgctg tctcattggc ccaggaggcc cctgtctttg 961 ctctggttgg aaagggtggc ggtggggccg ggggtaattc gggggaagaa ggtgaggcct 1021 cttgggttga ttcctaggcg accccaggtt gctgctgttc aggatcttct agctgggaaa 1081 ggccgcaaca acaatgaatc ttggccgagg gaaggctatt gtaaacgttt accaaagttt 1141 tggaaactgg gcggcccctc aaggcagggg ttaactgaaa ctggcatgaa agggaaacgt 1201 tggtgaaggt aagcattgtt actaagaatc aaacatttgg ggttccttgc cccactcctg 1261 caataactag ctctagattg agaaatcatt tgatttcatt gaacctgttt cctcatctat 1321 aaaatgattt gttttttttt tttttttgaa acagggcttt tcgctgtcgc tcaggctgga 1381 gtgcagtggc tccagcgcct cagcctccag agtagctgac actataggca cacaccacca 1441 cgacaggcta agttttttta ctttttgtag agatggatct ccctgtgttg cccaggctgg 1501 tctccagcac ctggcctcaa gcggtcctcc cgccttggtc tcccaaaatg cctgggttac 1561 aggagtgagc caccacgccc atcctataaa atgaatttaa tatgggttat atccatgtca 1621 agggattgat gtgaagctcg ggtgacatta atatatggtg aagtatagaa gtatactgtg 1681 aagtataaaa ggacggagaa agagatgtgc tgggtttgga ttgatgcaaa aagaagtatg 1741 taggagtgtt tttgtggtct tatgtggcct gttttgtgtt ttcctctgat cttaacagtt 1801 ccgccatgga gtcatagttg cagctgactc cagggctaca gcgggtgctt acattgcctc 1861 ccagacggtg aagaaggtga tagagatcaa cccatacctg ctaggcacca tggctggggg 1921 cgcagcggat tgcagcttct gggaacggct gttggctcgg caatgtcgaa tctatgagct 1981 tcgaaataag gaacgcatct ctgtagcagc tgcctccaaa ctgcttgcca acatggtgta 2041 tcagtacaaa ggcatggggc tgtccatggg caccatgatc tgtggctggg ataagagagg 2101 ccctggtgag ttaagctgca accacatgat ccttgggctg acactacagt gaggaggggt 2161 tggatggaca gatgaattaa agggcttggt gtcaatgctg aagaagtgta attaggtcct 2221 tagcccttcc ttctctcttt ttttttttcc tgttttttta gactgtgtct cattctgtcg 2281 ctcaggctgg agtgcagtgg catgatctca gctcactgca gcctccgccc ccaggttcag 2341 gaattcaaga ccagcctggc caacatggcg aaaccccatc tctactaaaa atacaaaaat 2401 tagcccggcg tggtggcggg cgcctgtaat accagctgct caggaggctg aggcatgaga 2461 attgcttgaa cccaggaggt ggatgttgca gtgagcagag atcgcgccac tgcactccag 2521 cctgggcgac agagtgagac tctgtctcaa aaaaaagaat ttaagaattc gttgtaaggc 2581 cgggcgtggt ggctcactcc tgtaatccca gcactttggg aggctgagac agtggatcac 2641 gaggtcagga gatcgagacc atcctggcta acaaggtgaa accccgtctc taccaaaaat 2701 acaaaaaatt agccaggcgt ggtggcaggc gcctgtagtc ccagctactc aggaggctga 2761 ggcaggagaa tagtgtgaac ccaggaggcg gagcttgcag tgagtcgaga tggtgccact 2821 gcactccagc ctgggcaaca gagcgagact ctgtctcaaa aaaaaaaaca aaaaaaagaa 2881 ttcgttgaat cccacatgtg gcagtagtgc tatttcaagt gttcattgac catgtgtggc 2941 ttgtggctac tgtattggac tctaatatat ggttgtccct caaaagttag tcatttctct 3001 tctgtttttt tctttttttc tttttttgag atgcaaagtt tcactccgat cgtccagtct 3061 ggcgtgtaat ggcattgttt cagctcacta caacctctgc ctcctgggtt caagtgattc 3121 tcctgccgca gtctcccaag tacctgggat tacaagcgcc caccagcatg cccagctaat 3181 tttgtatttt tagtagagat ggggtttcac catgttggcc aggctggtct cgaaatcctg 3241 acctcaggtg attcacccgc ctcagcctcc caaagtgcca ggattaaagg cgtgagccgc 3301 tgcacccggc ctcttctctt tttttcttta gagagtcttt ggcctagaag gggccttttg 3361 ggctcctctc ttttttgagg gggtggggat ggagtctcgc tctgtcgccc cgactggagt 3421 gcagcggcgg gatctcagct cactgcaatc tctgcctccc aggttcaagt gattctcctg 3481 cctcaacctt ccatgtagct gagactacag gcatatgcca ccatgcgcgt ctaatttttg 3541 tatttttagt agagacaggg tttcaccatg ttggccaggc tggtcttgaa ctcctgacct 3601 caggtgatct gcccacctca gcctgccaaa gtgttgagat tacaggagtg agccattgcg 3661 cctagtcctg ggctcatttc ttttatttct ttgctggtta taatcaccct tacaatcatg 3721 ctaggtccgg ttctatcatc tctgaaaata agttccttag cctgtttttc ctttccagca 3781 tttaagaaac ttgccagaca gttcaacttt gtatttaatg tcatttcctc atcagccagt 3841 tttgaattgc ccgtttagta aagttggagt atagctagtc accttctaat tttcccttcc 3901 atatgcttaa aaataattat tctttcctcc ttttgatttt gcttcattca tttacctgct 3961 tactaaatac ctagttatct tatagaacaa ttatagggta cctaccactt tttcacttgc 4021 tttcaaacga gaatagaaag ggctgtaggg agtatccact tactacccaa tggaggatga 4081 aaggaaatta aaaaccggat gtgagagagg agctcacttt gttactttgg aaattgagat 4141 gcttttctct gaaaggtggg aaaagacagt tccatacatt ttacgtgatc cagggaagca 4201 ttttggaaat gttgctttga acctttagct ggcggatttg ttctaggtga tggaaacagt 4261 gtaaaagagt acagggaccg ggtaacaagg aaaatccagg gagactgagc atcatttaaa 4321 cagtgccttc ttcaagaggc cttcctgatg accctgtccc attcattgtg agtcagcccc 4381 ttgcctcata ttacagttgg cccttgagca acatgggttt gaatcactca ggtccactta 4441 tacgcaggta tttttcaacc taattctgat tgaaaataca cattcccaga atttgaaacc 4501 caaatataca gaggaccaat ttatttattt atttatttat ttatttttga gatggagttt 4561 tcctcttgtc acccatgctg gagtgcaatg gcgcgatctc agttcatcgc aacctccacc 4621 tcctgggttc aagtgatctc ctgcctcagc ctcccgagta gctgggttta cagacacccg 4681 ccaccctgcc agctaacttt tgtattttta gtagagacag ggtttcacca tgttggccag 4741 gctggtctta aactcctgac ctcgggtgat ccacctgcct cagcctccca aagtgctggg 4801 tggcgtgagc caacatatct ggccaagttt tcatatatgc aggttctgca gggccaactt 4861 tgggacttga gtgtgcatag attttggtat atgcaggggt cttgggacaa atccccttca 4921 tataccaaag gatgactagt taacccaagg ctgaataaga taaataacct tttttttttt 4981 ttattgagac ggagtctcgc tctgtcgccc aggctggagt gcagtggtgc catctctgct 5041 cactgcaagc tccgcctccc ggattcacgc cattctcctg cctcagcctc ccgagtagct 5101 gggactacag gtgcccgcca ccacgcccgg ctaatttttt gtatttttag tagagacggg 5161 gttccagcct gaccaacatg gtggaacctt gaacttactg ctggccgggc gaggtggctc 5221 acgcctgtaa tcccagcact ttgggagtcc gagaagggca gatccccaga ggtcaggagt 5281 tcaagaccag cctgaccaac atggtgaaac cccgtctcta ctaaaaatac aaaaattagc 5341 tgggcgtggt tacgagcgct tgtaatccta gctacctggg aaactgaggc aggagaatca 5401 ctttaaccca ggaggcggag gttgtggtga gccgagattg cgccattgca ctctagcttg 5461 ggcaacaaga gcaaaactcc atctcaaaaa aaaagaacta ctgccttggc atatgaatta 5521 gatatcaaaa ggggtgatct tttttttttt cctgctaacc tcatctccct ttccaggcct 5581 ctactacgtg gacagtgaag ggaaccggat ttcaggggcc accttctctg taggttctgg 5641 ctctgtgtat gcatatgggg tcatggatcg gggctattcc tatgacctgg aagtggagca 5701 ggcctatgat ctggcccgtc gagccatcta ccaagccacc tacagagatg cctactcagg 5761 aggtgcagtc aacctctacc acgtgcggga ggatggctgg atccgagtct ccagtgacaa 5821 tgtggctgat ctacatgaga agtatagtgg ctctaccccc tgaaagaggg tggatgcagc 5881 tgcttgtgtt tcttggggtg actgtcattg gtaatacgga cacagtgacc catcctccat 5941 cctatttata gtggaagggc cttcaattgt atcagtactt ttttttaagc tctggcacat 6001 tgacctctat gtgttaccag tcattaatga gctgctgcag aggtgactat ttgttttact 6061 ttcttggatg ttaaacatta cactactcac tactcaatct cagcctgtgt tgcatctttt 6121 gtccacactg tccctcctcc aagcactttt cagtcaaata agccaggaca gtggaaggga 6181 agtgactgta attacaggaa acagtggtgt gaagacattg tctagtgaat acattaatgt 6241 ccccactcat gaagctaata acggagaagg agagagaggc agggaggagg gagacacttc 6301 tcgatttggt ttacaaagag atgaaaagtt gatgaggtgg aaagatgcac agggctgtac 6361 caggcagggc tccagcggaa aaggctgtca ccctgtggtg ggggaaggga ggctgaggct 6421 acatgcagag cggaagggtg aagagggtaa atcttgactc ttaaaccaac tgggaaggga 6481 aggcttggac cctgggaagg gg // LOCUS HSMECDAG 4370 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens gene for Me491/CD63 antigen. ACCESSION X62654 S37524 NID g430755 KEYWORDS antigen; ME491/CD63 antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4370) AUTHORS Hotta,H. TITLE Direct Submission JOURNAL Submitted (17-OCT-1991) H. Hotta, Kobe University School of Medicine, Dept of Microbiology, 7-5-1 Kusunoki-cho Chuo-ku, Hyogo 650, JAPAN REFERENCE 2 (bases 1 to 4370) AUTHORS Hotta,H., Miyamoto,H., Hara,I., Takahashi,N. and Homma,M. TITLE Genomic structure of the ME491/CD63 antigen gene and functional analysis of the 5'-flanking regulatory sequences JOURNAL Biochem. Biophys. Res. Commun. 185 (1), 436-442 (1992) MEDLINE 92287132 FEATURES Location/Qualifiers source 1..4370 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood leukocytes" /cell_line="peripheral blood leukocytes" /clone_lib="LN87" /clone="lambda2" /chromosome="12" /map="q12-q13" enhancer 529..536 /note="AP-1 binding site" promoter 612..621 /note="Sp1 binding site" mRNA join(658..715,1316..1392,2328..2516,2701..2775,2870..2965, 3403..3543,3768..3851,4038..4190) /note="ME491/CD63 antigen" exon 658..715 /number=1 intron 716..1315 /number=1 exon 1316..1392 /number=2 CDS join(1327..1392,2328..2516,2701..2775,2870..2965, 3403..3543,3768..3851,4038..4103) /codon_start=1 /product="ME491 /CD63 antigen" /db_xref="PID:g430756" /db_xref="SWISS-PROT:P08962" /translation="MAVEGGMKCVKFLLYVLLLAFCACAVGLIAVGVGAQLVLSQTII QGATPGSLLPVVIIAVGVFLFLVAFVGCCGACKENYCLMITFAIFLSLIMLVEVAAAI AGYVFRDKVMSEFNNNFRQQMENYPKNNHTASILDRMQADFKCCGAANYTDWEKIPSM SKNRVPDSCCINVTVGCGINFNEKAIHKEGCVEKIGGWLRKNVLVVAAAALGIAFVEV LGIVFACCLVKSIRSGYEVM" intron 1393..2327 /number=2 exon 2328..2516 /number=3 intron 2517..2700 /number=3 exon 2701..2775 /number=4 intron 2776..2869 /number=4 exon 2870..2965 /number=5 intron 2966..3402 /number=5 exon 3403..3543 /number=6 intron 3544..3767 /number=6 exon 3768..3851 /number=7 intron 3852..4037 /number=7 exon 4038..4190 /number=8 polyA_signal 4164..4170 BASE COUNT 817 a 1125 c 1467 g 961 t ORIGIN 1 ggatccctgg aaaatggaga agctgtgcta atagaggggg gccagaaatc cccactctag 61 aatgctgtag aatgttggga gacacccagg atgtgagcca gggactttct ggaagtgttt 121 gttctggccc cacccgaccc caggcagtcc ccagctgtct gcacagtcgg atggggaggg 181 ggcttgcaca gagttggagc cagaggagag agctggctca tcccctacgg taggatgggg 241 aaacctcaca gaccacattg tcacccggcc tcagctctcc gccccggcgc tcagagggta 301 actctcaccc acctcgtccg cttctctgaa ccagagtgac ccaggctgcc gtccgccccg 361 ctctcctacc ccgagttggc acggaggtat agcgccagag gggggcccca ggcgccccga 421 ggtcttgccc ttgcggcttt ccctttgcgg gggtgggcgc cttcttccgg gtaggggcca 481 cgtggccctg gccgggcggg gggctcggcc caccccgcgc cgggcccagt gactcaggcc 541 gcagctgtta ccgcgtcaca tgagggaggc cggcggccac tcggcggggg aggggaccgt 601 ggctggagcc cggggcgggg ccgcgcggca ggcggggcgg gagccggggg gcgcagctag 661 agagccccgg agccgcggcg ggagaggaac gcgcagccag ccttgggaag cccaggtgag 721 gccgcgggga cccggcggac ttcggcgcgg gctgggagag accacccgca gtcccttcct 781 ccagggctcg ccccaggccc gcggagactc gtccggcgct ctccagccgc cctggagcaa 841 ctttgcttcg aagttccctg ggcccgggca gggtgtggcg ggggtgctgg ctgcactttt 901 tttttttttg gtgggcgggg aggtcagagc ccaggggaaa ggggctgggg cgggggtcgg 961 ggttgggctg aagaccgttt ctccaggctg ggtgttgggt ggggaaaggg tgtttgacgc 1021 ccggagactg gcgggccgag ggtcggcgtc aggctcaagc ccctgccggc gcccttggat 1081 ctgtcctggg ccccagaggc ggcggggcgg ggtccgggct tcgcggccgt cgcccccacc 1141 cctctcctgc gggtaaagag ggggctggga gacgtggatc cgaccgcggg tttccgggtg 1201 ggaggggccg ggaatcgggg ccgtgaaggg aagccctcga gggccggccg ggagcccgga 1261 aaccgaggtc ccccttcggc cagttttaat ccccccgccc tcccctctgc cccaggcccg 1321 gcagccatgg cggtggaagg aggaatgaaa tgtgtgaagt tcttgctcta cgtcctcctg 1381 ctggcctttt gcgtgagtgg cgaggggtcc tggggggcgc gccggcaggg tgcggggaga 1441 cctggctcca cagtggtgag ggctgaggga atgagaatta aggcagctgg agaccagatg 1501 tggaaagaaa gccagggaaa gacagcgcag ccactccctc ctccagtttt cctggacacc 1561 tgaggcttgt ttccaccgag aatggccatc ctgggtgtgc agcaacccct acccccatcc 1621 ctcctggccc ttcctgtctg gtgacccaac cacttgtgat atggggcagt ggggcaacca 1681 gccaaccgcc tgatgctatc gccatggggc ttcccttcgc tctcccttct cccctctctc 1741 cctccccaac cctctcatcc ctgtccagcc cctctccccc tttccctcct cccagcgctt 1801 gggggtgagg aagggctctg cccacttcaa tttgctcctg ccctgggctc ctggcttgag 1861 aggctactgt ttgtataggg agggacacat cactcttgct cccttttcct cttcccttct 1921 cctggcctaa ctggccagag gaaccttagt cacagctgat gcgcgcgtgg atgtgcactg 1981 gaggatagtg agggggcagt tctgggcgcg aattgagctg ggagccctgg aggcctgtag 2041 aaagcattat tctaggtcag aaatacgtgt cttgaggtca aggctttctg ggaggttttt 2101 gaatttggtc atgtccctga ttaaagagcc gggaggggat caggggaggt aggggcaggc 2161 aaggatgggt catcccccta ccccacagcc aaagaaattc cctagaggtg tgagaatctc 2221 tcttggggat ggggtcaagt ctgtgtccta ggtgtacccc ctgccctcca ccatggaggg 2281 ccactgtgtt gtggatatgg cctaatcctc acctgatttt ctcttaggcc tgtgcagtgg 2341 gactgattgc cgtgggtgtc ggggcacagc ttgtcctgag tcagaccata atccaggggg 2401 ctacccctgg ctctctgttg ccagtggtca tcatcgcagt gggtgtcttc ctcttcctgg 2461 tggcttttgt gggctgctgc ggggcctgca aggagaacta ttgtcttatg atcacggtgc 2521 gttggggcgg accaacatgg gcggtctggg actgccagag cagcttggtt cgggctcagg 2581 aaggtgggtg aggtgggtgg gtgaggaaga cctttgtgag tgtccaggga ccggctgtgc 2641 caaaacgggc ccctgtaatg catagactcc aaacttgact tctgtccttt gctcctgcag 2701 tttgccatct ttctgtctct tatcatgttg gtggaggtgg ccgcagccat tgctggctat 2761 gtgtttagag ataaggtaag cagggaataa tgggaagggc ctgcctcagc cgggtctggg 2821 accctggtgc ttgaattctg actgcaacag tgctcctccg tactctcagg tgatgtcaga 2881 gtttaataac aacttccggc agcagatgga gaattacccg aaaaacaacc acactgcttc 2941 gatcctggac aggatgcagg cagatgtgag tggggttgct gggagcaggg gtggtgggaa 3001 tagagaattc tcatctccaa ggctgggtgt ggtgactcac gcctgtaatc tcagcacttt 3061 tgaaggccga ggtgggcaga tcacctgagg tcagaagttc aagactagcc tggccaacgt 3121 ggtgaaaccc tgtctctact aaaaatacaa aaattagcca ggtgtggtgg tgagtgcctg 3181 taatcccacc tactcaggag ggtgaggcag gagaatcgct tgaacccggg aggcggaggt 3241 tgcagtgagc cgagattgag ccaccgctgg gcacttgtct gggcaacaga gcaagtctct 3301 gtctcaaaaa aaaaaaaaaa aaaaaaaaag aattctcatc tccatattga aggtgaaggt 3361 tcacttaaca accttcttct ccatatttct tccctccccc agtttaagtg ctgtggggct 3421 gctaactaca cagattggga gaaaatccct tccatgtcga agaaccgagt ccccgactcc 3481 tgctgcatta atgttactgt gggctgtggg attaatttca acgagaaggc gatccataag 3541 gaggtaggga ggagactgtg tatgggactt gggaaacctg ggaaatgttt tagaatgcaa 3601 gagacccagg atggaggtgg acagggtcag ggatggggag gggatctgga aatgctaagg 3661 gtggggtgaa aggtacaggg gcagggaaga gactgatgga ccaagggagg gggttggggt 3721 gggaagtttc tgatggtgct tttgtccttt gtgcctgcca ccttcagggc tgtgtggaga 3781 agattggggg ctggctgagg aaaaatgtgc tggtggtagc tgcagcagcc cttggaattg 3841 cttttgtcga ggtaagagat aaccctgagc aggggctggg actctgtcca gggtgcctgg 3901 agggaagggg ctgaggggag gagggtgctg aacaggaggt actggcctgg gagtgtgtgt 3961 ttggagtctc ttgcatgggt ctgaggtagc tagcccttct gactcctctc cacccctgtc 4021 cccatcgggc cttctaggtt ttgggaattg tctttgcctg ctgcctcgtg aagagtatca 4081 gaagtggcta cgaggtgatg taggggtctg gtctcctcag cctcctcatc tgggggagtg 4141 gaatagtatc ctccaggttt ttcaattaaa cggattattt tttcagaccg aaaagagatg 4201 gtctgagttt gtcttagagt gatgcttgat tccttccttt ccttactgat gttccctgtc 4261 ctctgggacc ttaatgcatg tgtacttcga ggtctatttt ggggggtgtt tgggaaagga 4321 ggactttgcg aaggtgttgg ttgggcaggc agtgggagaa gtgaggatcc // LOCUS HSMED 5292 bp DNA PRI 24-APR-1993 DEFINITION Human bone marrow serine protease gene (medullasin) (leukocyte neutrophil elastase gene). ACCESSION Y00477 NID g34529 KEYWORDS elastase; medullasin; serine protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5292) AUTHORS Naruto,M. TITLE Direct Submission JOURNAL Submitted (09-NOV-1987) Naruto M., Basic Research Laboratories, Toray Insustries, Inc., 1111 Tebiro, Kamakura 248, Japan REFERENCE 2 (bases 1 to 5292) AUTHORS Nakamura,H., Okano,K., Aoki,Y., Shimizu,H. and Naruto,M. TITLE Nucleotide sequence of human bone marrow serine protease (medullasin) gene JOURNAL Nucleic Acids Res. 15 (22), 9601-9602 (1987) MEDLINE 88067782 COMMENT This cDNA encodes the full protein sequence of human leukocyte (neutrophil) elastase (HLE), which was reported by Sinha et al. in PNAS USA 84:2228-2232(1987). FEATURES Location/Qualifiers source 1..5292 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="tonsil genomic library in lambda gt WES lambda B" repeat_region 287..551 /note="tandemly arranged direct repeats" CAAT_signal 1114..1118 TATA_signal 1230..1234 exon <1287..1353 /number=1 sig_peptide join(1287..1353,1786..1805) /product="pot. leader peptide" CDS join(1287..1353,1786..1942,2173..2314,4485..4715, 4882..5088) /codon_start=1 /product="serine protease" /db_xref="PID:g296665" /db_xref="SWISS-PROT:P08246" /translation="MTLGRRLACLFLACVLPALLLGGTALASEIVGGRRARPHAWPFM VSLQLRGGHFCGATLIAPNFVMSAAHCVANVNVRAVRVVLGAHNLSRREPTRQVFAVQ RIFENGYDPVNLLNDIVILQLNGSATINANVQVAQLPAQGRRLGNGVQCLAMGWGLLG RNRGIASVLQELNVTVVTSLCRRSNVCTLVRGRQAGVCFGDSGSPLVCNGLIHGIASF VRGGCASGLYPDAFAPVAQFVNWIDSIIQRSEDNPCPHPRDPDPASRTH" intron 1354..1785 /number=1 exon 1786..1942 /number=2 mat_peptide join(1806..1942,2173..2314,4485..4715,4882..5085) intron 1943..2172 /number=2 exon 2173..2314 /number=3 intron 2315..4484 /number=3 repeat_region 2538..2957 /note="tandemly arranged direct repeats" exon 4485..4715 /number=4 intron 4716..4881 /number=4 exon 4882..>5088 /number=5 polyA_signal 5146..5151 BASE COUNT 1129 a 1526 c 1563 g 1074 t ORIGIN 1 ttgtcagagc cccagctggt gtccagggac tgaccgtgag cctgggtgaa agtgagttcc 61 ccgttggagg caccagacga ggagaggatg gaaggcctgg cccccaagaa tgagccctga 121 ggttcaggag cggctggagt gagccgcccc cagatctccg tccagctgcg ggtcccagag 181 gcctgggtta cactcggagc tcctggggga ggcccttgac gtgctcagtt cccaaacagg 241 aaccctggga aggaccagag aagtgcctat tgcgcagtga gtgcccgaca cagctgcatg 301 tggccggtat cacagggccc tgggtaaact gaggcaggcg acacagctgc atgtggccgg 361 tatcacaggg ccctgggtaa actgaggcag gcgacacagc tgcatgtggc cggtatcaca 421 gggccctggg taaactgagg caggcgacac agctgcatgt ggccggtatc acagggccct 481 gggtaaactg aggcaggcga cacagctgca tgtggccggt atcacggggc cctggataaa 541 cagaggcagg cgaggccacc cccatcaagt ccctcaggtc taggtttggc caggtttgga 601 aaaacacagc aacgctcggt aaatctgaat ttcgggtaag tatatcctgg gcctcatttg 661 gaagagactt agattaaaaa aaaaacgtcg agaccagccc ggccaacacg tgaaaccccg 721 tctctactaa aaatacaaaa aattagccag gcgcagtgct cacgcctgtg atcccagcac 781 tctgggaggt gaggcaggcg gatcacccga ggtcagctgt tcaagaccag cctggccgag 841 tgggcgaaac actgtctcta ctacaaatac aaaaattagc cgggagtgga ggcaggtgcc 901 tgtaatctca gctattcagg aggctgaggc aggagaatca cttgaacctg ggaggcggag 961 gttgccgtga gccgggatca cgccaccgca ctccagcctg ggcgatagag caagactctg 1021 tctccaaaaa aataaattaa aaaacccaca ttgattatct gacatttgaa tgcgattgtg 1081 catcctgaat tttgtctgga ggccccaccc gagccaatcc agcgtcttgt cccccttctc 1141 ccccttttca tcaacgcctg tgccagggga gaggaagtgg agggcgctgg ccggccgtgg 1201 ggcaatgcaa cggcctccca gcacagggct ataagaggag ccgggcgggc acggaggggc 1261 agagaccccg gagccccagc cccaccatga ccctcggccg ccgactcgcg tgtcttttcc 1321 tcgcctgtgt cctgccggcc ttgctgctgg ggggtgagtt tttgagtcca acctcccgct 1381 gctccctctg tcccgggttc tgttcccacc tctccataga gggccccacc agtgtgggtc 1441 cctcatcctc acaggggagg tgccagctgg gacaaggaga ccagaagaga ctgaggttct 1501 gagcggtgaa gccaccacca ggagcccaga gttggggttt gaaaaccggg gagggggggg 1561 gtggcaggtc gccctctggg ttcaagtcca ggtctgtctg tgccttggag gggcaccgtg 1621 gggaggtccc tttgcctctc cgtgcctcag tttcctcatc tgaacaacag gggtgcgaac 1681 ggccccgatc ccgtgggttc ccggtggggg atccagaggc cccgtggccg ggaggggaca 1741 ggctccttgg caggcactca gcacccgcac ccggtgtgtc cccaggcacc gcgctggcct 1801 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc 1861 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg 1921 ccgcgcactg cgtggcgaat gtgtgagtag ccgggagtgt gcgcgcccgg ctcggacccc 1981 gcgtcccggt ctgtgaggtg ggtgggggga ggccggggcc ggggctgctg gcgggggggg 2041 gtccgtccag ggcccgcggg gcccctcgag caccttcgcc ctcaggcccg tcgccggatg 2101 gggacgacaa ggcgcggctg agccccgacc cccggggccg cccctgagcc ccgcctctcc 2161 ctcttttggc agaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata acctctcgcg 2221 gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg gctacgaccc 2281 cgtaaacttg ctcaacgaca tcgtgattct ccaggtgccg ccgggcgggc gggggcgagg 2341 ggcggaggcc agaggcctgg ggagggtgga ggcctgggga gggtggaggc tgcgacggag 2401 gggcgcgtcg gggccgctcg tggggacctg gggtggcatc gtgggctggg tggtcccctc 2461 tccgcgcctc ggtctgcacc tctgtgaaac gggaaaatac ccgccatggg ccgttgaggg 2521 gttaaatgag atcctgcagg gaggccccga tctgctgtca atcaacaaac ttactgagaa 2581 gggaggcccc gatctgttgt caatcaacaa acttactgag aagggaggcc ccgatctgtt 2641 gtcaatcaac aaacttactg agaagggagg ccccgatctg ctgtcaatca acaaacttac 2701 tgagaaggga ggccccgatg ttgtcaatca acaaacttac tgagaaggga ggccccgatc 2761 tgctgtcaat caaccaaact tactgagaag ggaggccccg atctgctgtc aatcatcaaa 2821 cttactgaga agggaggccc cgatctgctg tcaatcaaca aacttactga gaagggaggc 2881 ccccgatctg ttgtcaatca acaaacttac tgagaaggga ggccccgatc tgctgtcaat 2941 caacaaactt actgagattc tgtgtgtctc tccattcacc agtcctgtgg cccagggcag 3001 gggccgcctc tgtctttggg aaaaggggca aaagtcccca cctttccacc cctgtccgcg 3061 gcttgcagtt ctggttattt cctgggcgcc gggccccgtg gctcaggcct gtcatcccag 3121 cactttggga ggctgaggcg ggtggatcac gaggtcaggt gttcgagacc agcctgagca 3181 acatagtgaa accccgtctc tactaaaata cacaaaaaaa aaattagccg agtgtggttg 3241 tgggtgcctg taatgccaac tactcaggag gctgaggaag gagaatcgct tgaaccccgg 3301 aggcggagat tgcagtgagc tgagatcaca ccactgcact ccagcctggg tctcaaaaaa 3361 aaaaaaaaag attcctccct gggaagggtt agagggagag tttccttgtc actaagtttt 3421 ctcatagctc tcacccagtg cagtggcgcg atcgcagctc actacacctc catctcctgg 3481 gctcaagcca ccctctcagc ttggaatggg gggtagctgg aaccacaggt gccaccacgt 3541 ggtccaccac gtctggctaa tatatatata tacacacaca catacatata ttataaataa 3601 taaatatata ttttatttaa ataaaatata taatatttat aattatttta taattataat 3661 aatatttata taattataaa tatcatttat aattataata tttattattt tataaaataa 3721 taaatataaa atatataaaa atatttttat aaataataaa atatatatat acacacatat 3781 atatatattt tttgagacaa gtctcgctct gtcgcccagg ctggagcgca gtgcacaatc 3841 tcactcactg cacctccgcc tcccaggttc aagcgattct cctgcctcag cctcccaggt 3901 agctgggact acaggcgccc gccaccacgc ctggctaatt tttggtattg ttagtagaga 3961 cggggtttaa ccatgttagc caggatggtc ttgatctcct gaccttttga ttggcccacc 4021 tcagcctccc aaaatgctgg gattataggc gtgagcaccg cacctggcaa ttttttttta 4081 ttatttttgt agacatgggg ctttgccaca ttgcccaggc tggtcttgaa tgcctggcct 4141 ggcctaagtg atcctcctgc ctcgccctcc caaagtgctg ggcttacaag catgagccac 4201 cgcgcccggc tgtagttttt ttgttaactg agcacctact gcttcctgca ctcaagccac 4261 atccagggac aacctccaac gccctgagcc ttggtgacgg ctcccactct acagatgggg 4321 aaaccgaggc ttgccttggg gagcagagtg tggggtgggt atcctgccct gcaggatccc 4381 agaaccacag tggaacctga gatggggaaa ctgaggcccg gagaggggag ggtcatcatc 4441 actgccccgt gtgacgcgct gacgatctgt ccccaccgcc acagctcaac gggtcggcca 4501 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg 4561 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg 4621 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca 4681 ctctcgtgag gggccggcag gccggcgtct gtttcgtacg tgccctgggt gtccctctgc 4741 tccccacccg ctcccagccc ggtactgcag caacaggcac cgtggctaga ccctaggatg 4801 ggacttccca accctgacac gtcggcgggc aggtgggcag ggcctcgcag tccagcttcc 4861 ccaccttgtc tgcctccaca gggggactcc ggcagcccct tggtctgcaa cgggctaatc 4921 cacggaattg cctccttcgt ccggggaggc tgcgcctcag ggctctaccc cgatgccttt 4981 gccccggtgg cacagtttgt aaactggatc gactctatca tccaacgctc cgaggacaac 5041 ccctgtcccc acccccggga cccggacccg gccagcagga cccactgaga agggctgccc 5101 gggtcacctc agctgcccac acccacactc tccagcatct ggcacaataa acattctctg 5161 ttttgtagaa tgtgtttgat gctccttggc tgtgtgattg ggtgttgaaa atggtcagta 5221 ggtcgggcgt ggtggctcac acctgtaatc ccagcacttt gggaggttga ggcaggcgga 5281 tcacttgagc tc // LOCUS HSMGSAG 1895 bp DNA PRI 19-DEC-1990 DEFINITION Human gene for melanoma growth stimulatory activity (MGSA). ACCESSION X54489 NID g34625 KEYWORDS beta-thromboglobulin; beta-thromboglobulin superfamily; growth factor; melanoma growth stimulatory activity. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1895) AUTHORS Richmond,A. TITLE Direct Submission JOURNAL Submitted (20-AUG-1990) Richmond A., Vanderbilt University, Dept. of Cell Biology, Nashville, Tennessee 37232, USA REFERENCE 2 (bases 1 to 1895) AUTHORS Baker,N.E., Kucera,G. and Richmond,A. TITLE Nucleotide sequence of the human melanoma growth stimulatory activity (MGSA) gene JOURNAL Nucleic Acids Res. 18 (21), 6453 (1990) MEDLINE 91057157 COMMENT See also X12510. FEATURES Location/Qualifiers source 1..1895 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /clone_lib="lambda fix blood genomic" TATA_signal 21..27 mRNA join(51..229,328..451,565..648,1180..1895) /gene="MGSA" gene 51..1895 /gene="MGSA" prim_transcript 51..1895 /gene="MGSA" exon 51..229 /gene="MGSA" /number=1 CDS join(130..229,328..451,565..648,1180..1195) /gene="MGSA" /codon_start=1 /product="melanoma growth stimulatory activity preprotein" /db_xref="PID:g34626" /db_xref="SWISS-PROT:P09341" /translation="MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQ CLQTLQGIHPKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLN SDKSN" sig_peptide 130..329 /gene="MGSA" intron 230..327 /gene="MGSA" /number=1 exon 328..451 /gene="MGSA" /number=2 mat_peptide join(330..451,565..648,1180..1195) /gene="MGSA" /product="melanoma growth stimulatory activity protein" intron 452..564 /gene="MGSA" /number=2 exon 565..648 /gene="MGSA" /number=3 intron 649..1179 /gene="MGSA" /number=3 exon 1180..1895 /gene="MGSA" /number=4 BASE COUNT 466 a 501 c 450 g 478 t ORIGIN 1 gctttccagc cccaaccatg cataaaaggg gttcgcggat ctcggagagc cacagagccc 61 gggccgcagg cacctcctcg ccagctcttc cgctcctctc acagccgcca gacccgcctg 121 ctgagcccca tggcccgcgc tgctctctcc gccgccccca gcaatccccg gctcctgcga 181 gtggcactgc tgctcctgct cctggtagcc gctggccggc gcgcagcagg tgggtaccgg 241 cgccctgggg tccccgggcc ggacgcggct ggggtaggca cccagcgccg acagcctcgc 301 tcagtcagtg agtctcttct tccctaggag cgtccgtggc cactgaactg cgctgccagt 361 gcttgcagac cctgcaggga attcacccca agaacatcca aagtgtgaac gtgaagtccc 421 ccggacccca ctgcgcccaa accgaagtca tgtaagtccc gccccgcgct gcctctgcca 481 ccgccggggt cccagaccct cctgctgccc caaccctgtc cccagcccga cctcctgcct 541 cacgagattc ccttccctct gcagagccac actcaagaat gggcggaaag cttgcctcaa 601 tcctgcatcc cccatagtta agaaaatcat cgaaaagatg ctgaacaggt gagttatggt 661 ttccatgtac acaggcgact ggagccgttg gtcagaaata ctggcatgtg ccccctaaaa 721 ataaaatcag gaaaacccag gggttagttg aaggactaga aattgggatt attgttttca 781 caattaaggt ttcctttacg ataattactg ctctggtgcc agaggatatt cccaatgcct 841 ggcgtcccca ccctggttct tccttcgttc caatgaatgt aggtaaaact gccttcattt 901 gaggcccagt aggacaaaca gcaacaggtt ctggctgttt ttaatccaat agtacagtgg 961 agaccaccgc cccaccccac ccccattcct aaaagagcat cccaagctta gaggtccctg 1021 ccacacagca cagctgtcat aggcagtagc cacttggttg ccaggctggg gaaactgcat 1081 tcggagaact ctagaggctg gaggagcagg gcaggagaag agtgttgtgc aatcagcttt 1141 cccgagcacc tactcagggc acccattttc tcattgcagt gacaaatcca actgaccaga 1201 agggaggagg aagctcactg gtggctgttc ctgaaggagg ccctgccctt ataggaacag 1261 aagaggaaag agagacacag ctgcagaggc cacctggatt gtgcctaatg tgtttgagca 1321 tcgcttagga gaagtcttct atttatttat ttattcatta gttttgaaga ttctatgtta 1381 atattttagg tgtaaaataa ttaagggtat gattaactct acctgcacac tgtcctatta 1441 tattcattct ttttgaaatg tcaaccccaa gttagttcaa tctggattca tatttaattt 1501 gaaggtagaa tgttttcaaa tgttctccag tcattatgtt aatatttctg aggagcctgc 1561 aacatgccag ccactgtgat agaggctggc ggatccaagc aaatggccaa tgagatcatt 1621 gtgaaggcag gggaatgtat gtgcacatct gttttgtaac tgtttagatg aatgtcagtt 1681 gttatttatt gaaatgattt cacagtgtgt ggtcaacatt tctcatgttg aaactttaag 1741 aactaaaatg ttctaaatat cccttggaca ttttatgtct ttcttgtaag gcatactgcc 1801 ttgtttaatg gtagttttac agtgtttctg gcttagaaca aaggggctta attattgatg 1861 ttttcataga gaatataaaa ataaagcact tatag // LOCUS HSMHCPU15 5833 bp DNA PRI 29-JUL-1993 DEFINITION H.sapiens gene for major histocompatibility complex encoded proteasome subunit LMP2. ACCESSION Z14977 S47822 S47823 S47869 S47870 S47871 NID g34655 KEYWORDS major histocompatibility complex; proteasome subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5833) AUTHORS Fruh,K., Yang,Y., Arnold,D., Chambers,J., Wu,L., Waters,J.B., Spies,T. and Peterson,P.A. TITLE Alternative exon usage and processing of the major histocompatibility complex-encoded proteasome subunits JOURNAL J. Biol. Chem. 267 (31), 22131-22140 (1992) MEDLINE 93054490 REFERENCE 2 (bases 1 to 5833) AUTHORS Fruh,K. TITLE Direct Submission JOURNAL Submitted (04-AUG-1992) Fruh K., Scripps Research Institute, Immunology, 10666 North Torrey Pines Road, La Jolla, California, USA, 92037 FEATURES Location/Qualifiers source 1..5833 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="HLA DR 7" /cell_line="MANN" /clone="U15" /chromosome="6" prim_transcript 465..5833 /gene="MHC-encoded proteasome subunit gene" exon 465..570 /gene="MHC-encoded proteasome subunit gene" /number=1 mRNA join(465..570,2424..2491,3547..3678,4289..4418,4648..4789, 5688..5833) /gene="MHC-encoded proteasome subunit gene" gene 465..5833 /gene="MHC-encoded proteasome subunit gene" CDS join(511..570,2424..2491,3547..3678,4289..4418,4648..4789, 5688..5815) /gene="MHC-encoded proteasome subunit gene" /codon_start=1 /db_xref="PID:g34656" /db_xref="SWISS-PROT:P28065" /translation="MLRAGAPTGDLPRAGEVHTGTTIMAVEFDGGVVMGSDSRVSAGE AVVNRVFDKLSPLHERIYCALSGSAADAQAVADMAAYQLELHGIELEEPPLVLAAANV VRNISYKYREDLSAHLMVAGWDQREGGQVYGTLGGMLTRQPFAIGGSGSTFIYGYVDA AYKPGMSPEECRRFTTDAIALAMSRDGSSGGVIYLVTITAAGVDHRVILGNELPKFYD E" intron 571..2423 /gene="MHC-encoded proteasome subunit gene" /number=1 exon 2424..2491 /gene="MHC-encoded proteasome subunit gene" /number=2 intron 2492..3546 /gene="MHC-encoded proteasome subunit gene" /number=2 repeat_unit 3208..3505 /gene="MHC-encoded proteasome subunit gene" /rpt_family="ALU" exon 3547..3678 /gene="MHC-encoded proteasome subunit gene" /number=3 intron 3679..4288 /gene="MHC-encoded proteasome subunit gene" /number=3 exon 4289..4418 /gene="MHC-encoded proteasome subunit gene" /number=4 intron 4419..4647 /gene="MHC-encoded proteasome subunit gene" /number=4 exon 4648..4789 /gene="MHC-encoded proteasome subunit gene" /number=5 intron 4790..5687 /gene="MHC-encoded proteasome subunit gene" /number=5 repeat_unit 5076..5354 /gene="MHC-encoded proteasome subunit gene" /rpt_family="ALU" exon 5688..5833 /gene="MHC-encoded proteasome subunit gene" /number=6 BASE COUNT 1490 a 1311 c 1545 g 1487 t ORIGIN 1 gtcctcccct actggcggct gggggaggga acgagggcgg ggctctcgga aagtcccagg 61 aacaggctga tcctgcgctg gcgagaagct cagccattta ggggaaagcg aaatcgaaag 121 cggccgctac tcactagata acgcctactt ccaaaagtgg cctgcccaga ctattttggt 181 agcaagcgtg gaaatcagat ctgagaatct cgggagcagc cctggtgccc aattttctcc 241 atcacgcaca cccttctcgc ctctccctgc ctcctgcctt tccacttgca ccagttttcc 301 caccccagcc tcagggcggg gctgcctcgt cacttgtctc ggggcagatc tgccctacac 361 acgttagcgc cgcgcgcaaa gcagccccgc agcacccagg cgcctcctgg cggcgccgcg 421 aaggggcggg gctgtcggct gcgcgttgtg cgctgtccca ggttggaaac cagtgcccca 481 ggcggcgaag gagagcggtg ccttgcaggg atgctgcggg cgggagcacc aaccggggac 541 ttaccccggg cgggagaagt ccacaccggg gttaatgggt ctgggcttga gggttggcag 601 aggggtggag gagatgcagc ggccagggga ccctggaagc gcgcgcggag aagtgaatgc 661 agagaccaac gggagcgcag ggaggtcgcc tgtagcagcc agcgcttgca acccgcaatg 721 agcatagagt atttcttttc tgaggggggt cgtctagagt gtccgtgaag ggaacaggca 781 cgcgaggctg gtggaaaaag cgggtgcttt gactcttagc tggaagcgtc aacgggaagc 841 tactctaaag cgctttcgct ttcactctgg tcccggacag tgggggctgg ttaaatcaag 901 aaagggggtt ggggatggtg caaagagatg aggaaatggt gccctgggtg aagtagaaca 961 gcacttggga gaaggaaata taggcactta ttgagaagga ccaactcatc acacagactt 1021 ttgataaact tgccactggg caactcttag cccaagcact gataatgggc gttctgtgtt 1081 aactagtgat gcccttccct agcttgaccc aggaaggtct ctccttggcc cagatgctgc 1141 cttactccct tccctgtgtc ttccctgccc actcccatgt gcccactggg ggggactttg 1201 cttaggatgg gcgcctggtg cagatggcag ccccaagact ggctggctgg cttctgctct 1261 ggactactgc caccactcgt ggcttggggg cggctttgtt agagaggaat agcctctaac 1321 ttgaagttaa ccctgttctt tgaccctcta ttcatgataa gtcggtccgt cggaaagcat 1381 actcagagga gcgtcctttg gggccagagt aacttacggc ctggtaagaa agacacagtg 1441 aaaccactta taatttggga aatctcccct cactgccaaa tgagcagtgg caagtaggaa 1501 gtagaagtgg aaacaaggga taagagttag acctgaattt tagtcccagg tctactatta 1561 actctgtgtg actttgcata agtcgtttgc attttctgtg acttggtttc ctcatttgaa 1621 ccgaggatct ttaaggctcc ttccaactca atagtagaat aaatgtagct ttatcttccc 1681 tcacttcttc ttgattcttt tcttgacctg gaaaagtcag cttaaacttc tcagtcaaat 1741 tatctcttgg tacaaattta cctccctggg tgctgagata tgtatttacc tcttgatcgg 1801 aaattccata actgaaactt ttattttcac ccatctgtat gtgttccttg ctgcttctct 1861 cctgccttgc ccctggacat gctaaccact gccctcctcg attttttcca atgtacagta 1921 aattggaaga gctcacttct gatgaaatgg ggggtgagag tggaggattg tgggaccaaa 1981 aaaaaaaaaa tagactgacc ttgtttccca agatcatagt caattactct gtgttgggtc 2041 tacaccacat ctgcacatac tatgagccct tccgttggag ataattttca cttgcggagc 2101 tgcttcactt ctacctgtag gagcctcatc tccacctctc tacagtggag aggattccac 2161 taggcaagtt ggaacttagg gacacagttc tttctgtgtt gtatcacagc tgggctgtgg 2221 cattcccctg cagccggatg aagcaataga gaaagtggaa agatgaaggg aaaaaaagcc 2281 tgtactgaca gtcagctctg gcctgttact gtgtaatctt tgagccagtc acttcgcctc 2341 tctgggaatg tttcttcttc tctaacatga gggcatcaag gctgttcttg ccctgacatt 2401 ccatattgct gtgtgctctg cagaccacca tcatggcagt ggagtttgac gggggcgttg 2461 tgatgggttc tgattcccga gtgtctgcag ggtgagtaaa agtgaagatg tatgcatttg 2521 gaaagaagct aatggcctca aatacacact ttccttaccc attcatgaaa agactggcaa 2581 actggagcct tggaggaatg gagttgacct tccccaaaag ccactatgat aagctatttg 2641 gtgggtgctt gggtctctga atttgtggag gaggatctgg ggtctgaatg tgtatgtgac 2701 ctgtcccagt agtgtacagg gatgagtaaa ggaatagggt ctgagagggg gacaggagat 2761 agatttttga gggtcttctt tccatctgtg cttagggatc aaaaagatga ttctgtcaag 2821 cagatacctg gtttctcatt taccatatat tgaactattt tggctcttct cccactccta 2881 accaatttcc tcacatgcaa aatgagtata tggggttagg tcaatattac tgacattatg 2941 ttccatagaa cataactctc tcaagattgt taatagcaaa gaaaattgat gaggcatatt 3001 tttcttacct tagcattttt tgctttgtta taaaatctaa gcctgaaaaa taagcctaat 3061 tttgattaac atctgcagtg attaataata tctgagatga ttatttgcct cctgctttaa 3121 tccaagcatt aaacttcatg ctattctctt gtcaaagaaa tttgagagac attgaatgat 3181 caccctcaaa aattcctgag ttctggttgg gtgcagtggc tcacatctat aatctcagca 3241 ctttgggatg ccgaggtggg cagatattta aggtcaggag tttgagacca gcctggccaa 3301 catgttggga ccttgtctct actgaaaata caaacattag ctgggcttgg tggtgggtgc 3361 ctgtaatccc agctattcgg gaggctgagg caggagaatc acttgaacca gggaggcgaa 3421 gtttgcagtg agcccaagat tgatccactg cactccagcc tgggtgacag agtgagactg 3481 tctcaaaaaa aaaaaaaaaa aaaaacctga gttttaactt ggtgactgtt gactccctcc 3541 tgacagcgag gcggtggtga accgagtgtt tgacaagctg tccccgctgc acgagcgcat 3601 ctactgtgca ctctctggtt cagctgctga tgcccaagcc gtggccgaca tggccgccta 3661 ccagctggag ctccatgggt atgaagctct ggagttctga ctccccaccc actagagctc 3721 ccccaacctg catgaatccc tgtacagtgt gctgttccag gagctggaca ctgggaaatg 3781 gaaaagtctt gtttcggctc ttgctggcac ttgaatctgt cagtttctgc atctgtaaag 3841 tggagataat atagtacctc atgagacggt tattttgaga accacattct atatgtgaac 3901 acagtttaaa agctgtaaat cactatcctg atataaataa tcaggaagaa ggtgatattg 3961 tgacccacca taatatcagg cagttaccat acgagaaatc aaggtcgttg ggacggaagt 4021 aaccttatct gcttttcccc ataagagcag ggtccttgca gccacaagaa agttatgtgg 4081 gtggggctga ccaaaagagt gagcaattga aagcttctta ccagttggtg gtgtgggact 4141 ctggttcccc tgtacatgtg ggagggaggc tgcagtttga gctattgcag ttacagtttt 4201 caggggtcgt ttagcaggga tgatggtaac agtataggag aatgagactt aaaattctat 4261 caacctttat tcctaatatt tccctcagga tagaactgga ggaacctcca cttgttttgg 4321 ctgctgcaaa tgtggtgaga aatatcagct ataaatatcg agaggacttg tctgcacatc 4381 tcatggtagc tggctgggac caacgtgaag gaggtcaggt gagtttctcc caaagcactc 4441 tctcctctgg gcttccccac tctcctgcag aggaagatgg aagtcctatg tcattctagc 4501 aatgagttcc aaggacacta cctctgaaag catagtactt tggggatatg agataccagg 4561 gcttcattgc agggtgcaga gaccacttaa tgtctcagtg ggaaggaagg gcttgatgat 4621 tctttaacct gaggatccct ttcccaggta tatggaaccc tgggaggaat gctgactcga 4681 cagccttttg ccattggtgg ctccggcagc acctttatct atggttatgt ggatgcagca 4741 tataagccag gcatgtctcc cgaggagtgc aggcgcttca ccacagacgg taaccagcca 4801 agtggaaggg tacctgggga gggctttgaa acatgggaag gaagtagatt atgaggaaca 4861 ggaagagaaa tacaggggtg gccatttaag ttaatgccgg gcctggtaca cttttaagag 4921 tgaaaagggg caggacaaat gcaaagctca atggggttct tgggcaatac ggataaccca 4981 gggctgttct gagtaaatca aatgaggata cacagtcact gtgagaccca gtggtgtgct 5041 aagcacagtg gctcacacct gtaatgccaa caatttggga ggctgaggca ggaggattac 5101 ttgaccccag gagtttgagg ccagcctagg caagatggtg aaaccctgtc tccacaaaaa 5161 aacaataaaa aaaagtaaaa aaaaaattga cctgggcatg gtggtgaaca cctgtggtcc 5221 cagttactca ggaggttgag gtggaaagat catctgagcc ggggagatca aggttgtagt 5281 gagcggtgat tgaaccactg cgctgcagcc taggtgacag agagagaccc tgtctggaga 5341 aaaaaaaaaa aaaagaacca gtggtgtgct gaggtgtact gaggctggct tgggaccact 5401 catgagagcg gactgttaaa tagtccagga tttgtgaact gcttagctat ttgtaacttg 5461 caattcatca tagcgggagc atttacacca cggacatcag cagatgccac atatggaagc 5521 ctttttgtaa aaaaactgat ttaccagcac accactaaat atgccttcct ggaagatgag 5581 ttttgaggtg aaagtggtag taggcatatg gatggagggg gagtaaaaag atttttgaag 5641 ctaagccatc ctctctctcc ctctctccaa cttgaaaccc tctgcagcta ttgctctggc 5701 catgagccgg gatggctcaa gcgggggtgt catctacctg gtcactatta cagctgccgg 5761 tgtggaccat cgagtcatct tgggcaatga actgccaaaa ttctatgatg agtgaacctt 5821 ccccagactt ctc // LOCUS HSMICAGEN 11722 bp DNA PRI 25-JUN-1996 DEFINITION H.sapiens MICA gene. ACCESSION X92841 NID g1405892 KEYWORDS MHC class I chain-related gene A; MICA gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11722) AUTHORS Bahram,S., Mizuki,N., Inoko,H. and Spies,T. TITLE Nucleotide sequence of the human MHC class I MICA gene JOURNAL Immunogenetics 44 (1), 80-81 (1996) MEDLINE 96217560 REFERENCE 2 (bases 1 to 11722) AUTHORS Bahram,S. TITLE Direct Submission JOURNAL Submitted (18-OCT-1995) S. Bahram, Faculte de Medecine, Centre de Recherche dImmunologie, 4 rue Kirschleger, 67085 Strasbourg Cedex, FRANCE REFERENCE 3 (bases 1 to 11722) AUTHORS Bahram,S. TITLE Direct Submission JOURNAL Submitted (05-FEB-1996) S. Bahram, Faculte de Medecine, Centre de Recherche dImmunologie, 4 rue Kirschleger, 67085 Strasbourg Cedex, FRANCE COMMENT Related sequence L14848. FEATURES Location/Qualifiers source 1..11722 /organism="Homo sapiens" /note="maps to class I region of major histocompatibility complex; MICA4 allele" /db_xref="taxon:9606" /cell_type="mann" /clone_lib="M32A cosmid clone" /chromosome="6" /map="p21.3" exon <1..109 /note="leader peptide" /number=1 gene 40..11548 /gene="MICA" CDS join(40..109,6950..7204,7479..7766,8354..8632,8732..8869, 11421..11548) /gene="MICA" /codon_start=1 /product="MHC class I chain-related protein A" /db_xref="PID:e221069" /db_xref="PID:g1405893" /translation="MGLGPVFLLLAGIFPFAPPGAAAEPHSLRYNLTVLSWDGSVQSG FLAEVHLDGQPFLRYDRQKCRAKPQGQWAEDVLGNKTWDRETRDLTGNGKDLRMTLAH IKDQKEGLHSLQEIRVCEIHEDNSTRSSQHFYYDGELFLSQNVETEEWTVPQSSRAQT LAMNVRNFLKEDAMKTKTHYHAMHADCLQELRRYLESSVVLRRRVPPMVNVTRSEASE GNITVTCRASSFYPRNITLTWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRICQGE EQRFTCYMEHSGNHSTHPVPSGKVLVLQSHWQTFHVSAVAAAAAAIFVIIIFYVRCCK KKTSAAEGPELVSLQVLDQHPVGTSDHRDATQLGFQPLMSALGSTGSTEGA" intron 110..6949 /gene="MICA" /number=1 exon 6950..7204 /gene="MICA" /note="extracellular alpha 1 domain" /number=2 intron 7205..7478 /gene="MICA" /number=2 exon 7479..7766 /gene="MICA" /note="extracellular alpha 2 domain" /number=3 intron 7767..8353 /gene="MICA" /number=3 exon 8354..8632 /gene="MICA" /note="extracellular alpha 3 domain" /number=4 intron 8633..8731 /gene="MICA" /number=4 exon 8732..8869 /gene="MICA" /note="transmembrane segment" /number=5 intron 8870..11420 /gene="MICA" /number=5 exon 11421..>11722 /note="cytoplasmic tail" /number=6 BASE COUNT 2414 a 2996 c 2828 g 3484 t ORIGIN 1 cactgcttga gccgctgaga gggtggcgac gtcggggcca tggggctggg cccggtcttc 61 ctgcttctgg ctggcatctt cccttttgca cctccgggag ctgctgctgg tgagtggcgt 121 tcctggcggt cctcggcgga gcgggagcag tgggacgttt ccgggggtcg ggtgggtagc 181 ggcgagcgct gtgcggtcag ggcggggctc ctgtgccctg tcggtggcgc agggagctgg 241 acgcggcccg ttaccgccac acttcagccc tgcttccccg tcacttttca gtcctcctcg 301 ggatcgcgca tcacctgcac tttctggtct cctcctgctc tttctctcct cgcgtctcct 361 ccgcttcctc tcacttttcg gacaaaccag tccttctgag gcccatgggt tcccgggctg 421 cctccggggc tgctcctgtg aatggcattc gagtgccctt ccagcgcggc cactgaagca 481 gccacaaccc ccggtgctcg gggcggctct caggtccctg aagtcctgtc ctctcccgga 541 gccgacgtgt tctcagctcc tgggccgcag ctcctggagt aggggccctc ctttctcggg 601 acccggagct ggtgcttcct gctgctgtgg ggactgtggg gggtcctgac tctcaagctg 661 aggggttgga gtctgcaggc tccgggcaga ggattcttcc tgcgacttct ctcatcccca 721 gctcattctc ccctcgcctc tggctccgag ggtcctctcc tctctctcat cccaccccta 781 ctaatgacca gtgatctaag gacaccagat tccctctcac ctcctccctg cccatctcag 841 ggcccgctga gtccttttgc cctcccagct ccctgctacc ccttcctgtg tgctgttctc 901 tgatccattt ctagggtgtc ctctgccctc atcccctgtc cccgccaccg aagtccctcc 961 tgcacccctt atgggccttt cctacaagca gccttcaccc agtgctgccc ctatgcctcc 1021 ccgttcccaa atgtccctga ctctaacttt ctggtgctgc cttttatccg ggggggtctt 1081 ccctccatcc cactcccctc cagaccccca aggggaaccc tgatgctaat ggcagttggg 1141 ccttaggcag ggcgcagggc agcgcagatg ccccctcccc tccagtgcag atgcctgttc 1201 tggaccctgc ctcattgtgg ccccttcccc actccttcat cctcagcctc accctcttga 1261 ggaccccacc ctccagccca caggtgctgg accatccctc cctggtccct ccgcccctct 1321 ccaccttggg accttgtgct gctcctatct cttgcccagc tgccttgggc cctcagcacg 1381 ttctcatctt tcagtgggaa agtgggagtg ctggagcata tgacagtgct gagcatcttt 1441 cccaagcccc accctccccc agagcaccct cccctcctgt cctcacccta ccccaagttc 1501 tcccacagtc actcctgccc catgctcatg ccgccctcca gttcttgctc tgcccatctc 1561 ccctccccaa cccagaccta aaacaggctg ttgggccaac tgttccttga ccttccttct 1621 tttcttttgg ttccttgacc ccagtgggct ctcactcccc acaccgcata tctaaaatct 1681 gttttgcctg ctcttggggt gccactgctc cccctccagc attactcctt ttggcaggtc 1741 cttcctcagg ctgagaatct ccccctctac cttggttttc tctctctggc cagcaccccc 1801 actccttgct ttgtttttaa tttttaactt ttgtttgggt acgtagtaga tatatatgta 1861 tatatttatg gggtacatgg gatattttga cacaggccta caatatgtaa taatcacatc 1921 agggtaaatg ggttatatca caacaagcat ttatcctttc tttgtgctac aaacaatccc 1981 attatgctct ttcagttatt tttaaatgta caataaatta ttgttgactg tactcaccct 2041 gctgtgctat ctactagatc ttattcattc taattatatt tttgtaccca ttattaacca 2101 tccctgctcc cccactcccc actacccttc tcagcctctg gtaatcatca ttctattgtc 2161 tctccccatg aggtccattg ttttaaattt tggctgccac aaataagtga gaacatgcaa 2221 agtttgtctg tctgggcctg gggcttattt cacttcacag gatgacctcc agttctttgc 2281 aaatgacacg atggctgaat agttctccac atacacatgt acaccacatt ttctttatcc 2341 atgcgtctgt tgatggacac ttagattgct tgcagatctt ggctactttg aatagtgctg 2401 caataaacat ggaaaagtag atagctcttt aatataccga tttcctttct ttggagtata 2461 tgcctaacag tgggagtgct ggagcatatg acagctctat tgtattttta gtttttggaa 2521 gaacctccac attgtttccc atagtggttg tactagttta cgttcccacc aacagtgtac 2581 atcctcacca gcattcctta tttctacatc ctcgccagca ttccttattg cctgtcttct 2641 ggataaaagc cagtttatct ggggtgggat gttatctcgt aggagttttg atttgccttc 2701 atctgttgac gaatgatgtt gagcaccttt tcatatacct gtttgccatt tatatgtctt 2761 cttttgagaa atgactattc agatcttttc tcatttttaa attggattat tatatttttt 2821 ttcctatagt tgttcgagct ccttatatgt ttcagttact gatcctttgt cagatgaata 2881 gtttgaaaat attttctccc attcttggat ggtctcttca ttttgtttat tgtttccttt 2941 gctgtgcaga agccttttta cttgatatga tcccatttat gcaattttac tttggttacc 3001 tgtgcttgtg gggtattact ttaaaaatct ttgcccagtc caatatccta gagagtttcc 3061 ccaatgtttt cttgtatagt ttcatagttt gaggtcatag atttacatct ttaatccact 3121 ttgatttgat ttttgtatat ggtgaaagac agggtctagt ttcattcttc tgcataagga 3181 tatctagttt ccccagcacc atttttgaag agactctcct ttgccaatgt gtgttcttgg 3241 tacctttgtt ggaaatgagt ttactgtaga tgtatggaat tgtttctggg ttctctattc 3301 tgtttcattg gtctgtgtgt ctgtttttat gccagtatca tgctgttttg gttactgtag 3361 ctctgtagta taatttgaag tcagataatg tgattcctct agttttgttc attttgctca 3421 ggatagcttt atctattctg gtttttttgt ggttccatat gcattttagg attattttta 3481 ttatttctgt gaagaatgtc attagtgttt tgatagggat tgcattgaat ctgtagatta 3541 ctttgggtag tatggatatt tcaacaaaac tgattcttcc aatccatgaa cgtggactat 3601 cttttccatt ttttgtgtcc ttcaattttt tgcatcagtg ttttttgttt ttggtttttg 3661 agatggagtt tcactcttgt tgcccaggct agaatgcaag ggtgtgatct tggctcaccg 3721 caacctccgc ctcccaggtt caagctattc ttctgcctca gcctcccaag tagctgggat 3781 tacaggcatg tgccactgtg cctggctaat tttctatttt tattagagat ggggtttctc 3841 tatgttggcc aggctagtct tgaactcctg acctcaggtg atccacctgc ctcggcctcc 3901 caaagtgctg ggattacagg catgagccac cacgcccagc cacatcactg ttttatagtt 3961 tttattggag aggtctttca cttcttcagt taggtttatt cctcagtatt ttattttatt 4021 tgtagctatt gtaaatggga ttcgtttctt gatttctttt tcagattatt tgctgttagc 4081 actgattttt gcatgttgat tttgtatcct gcaactttac tgaatttgtt cttcagttct 4141 aatggttttt tggtggagtc tttaggtttt tccaaatatc agaccacatg atctgcaaac 4201 aaggataatt tgacttcttc ttttccagtt ttaatgccct ttctttcttt ctcctgtctg 4261 attgctctag ttaggatctg cagtactgtg ttgcataact gtggtaaaat tagtcatcct 4321 tgtcttattc cagatcttag agaaaaggct ttcagttttc ccccattcag tatgttacta 4381 gctgtgagtt tgtcatatat ggcttttatt atattgaggt ctgttccttg tatacttagt 4441 tttttgagag tttttatcat gaagggatgt tgaatttatc aaatgctttt tcagtatcaa 4501 ttgaatgata ctggcttttg tcctttattc tgttgatatg acgtattaca ttgattgatt 4561 tgtgtatgtt aaatcatcct tgcatacctg gaatacattc cacttgctca taaagaatga 4621 tcttttttaa tgtattgttg aatgtggttt gctagtattt ccttgacgat ttttgcatcg 4681 gtgttcatca gggatatagg cctgtagttt tcttttttat gatgtgtctt tgcctggttt 4741 ttgtatcagg atattcctgg ctttgtaaaa tgagtttgga agtattccct cctcctctat 4801 ttttcagaac agtttgaata ggactgacat atgttgttct ttaaaagttt aattgtggta 4861 aattatacat tacataaatt ttactgtttt aaccactttt aagtgtatac tcggtggcat 4921 tagatacatt cacatttttg tgcaacccaa aactctgtgc ccattaatcg gtaactcccc 4981 attcctccct acctctggcc cctggtaacc accattctac tttttgtttc tatgaatttg 5041 accactctag gtacctcatt taagcagaat catgtaatgt ttgtcttttt gtttctggct 5101 tatttcactt ataatatttt tgaggttcgg tgggcacagt ggctcacgcc tggatttcca 5161 gcactttggg aggctgaagc aggtggatca cctgagtttc ggagttcgaa accagcctgg 5221 ccaacatggt gaaaccccat ctctactaaa aataataaaa gttagccggg cgtgatggcg 5281 ggtgcctgta atcccaacta cttgggaggc tgaggcagga gaatcgcttg aatccgggaa 5341 gtggaggttg cagtgagctg agatcaggcc actgcactcc agcctgggca acaagagtga 5401 aattccatct ccaaaaaaaa aaaataaaac aataataata ataatatttt tgaggttcat 5461 ccaagttgta gtatgggtca gaatttcatt ccttttaagg atggataata ctcattatat 5521 gtatgtacca catcttggtt atccatccct cagacaatgg acacttgggt tacttctacc 5581 ttttggatat tggcaaatat ttcatttcct ttgggtatat atttatttcc tttgggtatt 5641 tcttttgggt atatatccag aaatagaagc agtacacagg ggcttcattt tctctgtctc 5701 tttgccaacc ttgctctgtg tgtgtgtgta tgtgtgtgtg taggtgtgtg ataacagcca 5761 tcctgattgg tttcaggtgg catctcattg tggtttggat ttgcattttc ctaatgagtg 5821 ctgatattga gcatcttttc atgtgtttgt tgatcatttg taattttctt tgaagaattg 5881 gccatttaag tcttttgccc attttttccc ccacatagct tctcttatca gatatatgac 5941 ttgcaatatt tatttcattt cggggttgat tgctttttca ctctgattgt gccctttgat 6001 gcatagatgt tttgaatttt catcagtcta ctttgtcagt tctttctatt ctatctgtgc 6061 tttggtgtca tatccatgaa agcactgtca aatcctatgt catgaacatt atccccaatg 6121 tttgcttcta agaaattttt aggttttagt tcttgagtgt agagtttagg tctttgattc 6181 attttgagtt aatttttgta tatagtgcaa attaagggtc caattttatt ttaacacccc 6241 ctgcccccag aactatttgc tgaaaagatc aactgactct ttgtcacctg ctcaccccag 6301 tggacactag ctgttccatc caattgctgt cctggggcct tgtcatgcta ctcttccact 6361 ttgaacccaa gcccacaccg ttcgttgctc ccctctggga tactgacccc actataaact 6421 tctctggggc tacaaccttc ctaccctttg tgcctcatga ccaccccctc ccttgtcccc 6481 gccatgccca tgatgagtct cttctcgagg cagctcccct tgcctccatc tcaccctcag 6541 cctatgcacc acagccacac tggacatggg tccctctgag cctgagtccc ttcccattcc 6601 caccatctcc tctggcaaga ccttccttcc accaccttca tgctcctccc ttgcccctgc 6661 agggcagcct ctccccttgg cccctattcc cttagggggc ttgtggccac ccagtccttg 6721 cacctggcct acaagtttgc catcttcatt cccccttctt ctgttcatca gccccctcct 6781 ctatcctccc accctcacag ttttctttgt atatgaaatc ctcgttcttg tccctttgcc 6841 cgtgtgcatt tcctgcccca ggaaggttgg gacagcagac ctgtgtgtta aacatcaatg 6901 tgaagttact tccaggaaga agtttcacct gtgatttcct cttccccaga gccccacagt 6961 cttcgttata acctcacggt gctgtcctgg gatggatctg tgcagtcagg gtttcttgct 7021 gaggtacatc tggatggtca gcccttcctg cgctatgaca ggcagaaatg cagggcaaag 7081 ccccagggac agtgggcaga agatgtcctg ggaaataaga catgggacag agagaccagg 7141 gacttgacag ggaacggaaa ggacctcagg atgaccctgg ctcatatcaa ggaccagaaa 7201 gaaggtgaga gtcggcaggg gcaagagtga ctggagaggc cttttccaga aaagttaggg 7261 gcagagagca gggacctgtc tcttcccact ggatctggct caggctgggg gtgaggaatg 7321 ggggtcagtg gaactcagca gggaggtgag ccggcactca gcccacacag ggaggcatgg 7381 gggagggcca gggaggcgta ccccctgggc tgagttcctc acttgggtgg aaaggtgatg 7441 ggttcgggaa tggagaagtc actgctgggt gggggcaggc ttgcattccc tccaggagat 7501 tagggtctgt gagatccatg aagacaacag caccaggagc tcccagcatt tctactacga 7561 tggggagctc ttcctctccc aaaacgtgga gactgaggaa tggacagtgc cccagtcctc 7621 cagagctcag accttggcca tgaacgtcag gaatttcttg aaggaagatg ccatgaagac 7681 caagacacac tatcacgcta tgcatgcaga ctgcctgcag gaactacggc gatatctaga 7741 atccagcgta gtcctgagga gaagaggtac ggacgctggc caggggctct cctctccctc 7801 caattctgct agagttgcct cacctccaag atgtgtccag ggaaaccctc cctgtgctat 7861 ggatgaaggc atttcctgtt ggcacatcgt gtcctgattt tcctctattg ttagagccac 7921 tggataaaga cagtgggtca gggactggac catccagtgt tgtaatcagg gcaagtagag 7981 gaccctccga cagaatcctg agcctgtggt gggtgtcagg caggagagga agccttcagg 8041 gccagggctg ccccctctgc ctcccagcct gcccatcctg gagagttccc tcctggcccc 8101 acaacccagg agtccacccc tgacatcccc ctcctcagca tcaatgtggg gatcccagag 8161 cctgaggcca cagtcccaag gcccatcctc ctgccagcct ggaagaactg ggccccagag 8221 tgaggacaga cttgcaggtc aggggtcccg gagggcttca gccagagtga gaacagtgaa 8281 gagaaacagc cctgttcctc tcccctcctt agaggggagc agggcttcac tggctctgcc 8341 ctttcttctc cagtgccccc catggtgaat gtcacccgca gcgaggcctc agagggcaac 8401 atcaccgtga catgcagggc ttccagcttc tatccccgga atatcacact gacctggcgt 8461 caggatgggg tatctttgag ccacgacacc cagcagtggg gggatgtcct gcctgatggg 8521 aatggaacct accagacctg ggtggccacc aggatttgcc aaggagagga gcagaggttc 8581 acctgctaca tggaacacag cgggaatcac agcactcacc ctgtgccctc tggtgagcct 8641 agggtgaccc tggagagggt caggccaggg tagggacagc agggatggct gtggctctct 8701 gcccagtgta taacaagtcc ctttttttca gggaaagtgc tggtgcttca gagtcattgg 8761 cagacattcc atgtttctgc tgttgctgct gctgctgctg ctatttttgt tattattatt 8821 ttctatgtcc gttgttgtaa gaagaaaaca tcagctgcag agggtccagg tgagaaaagc 8881 gggcagtttc tggagatggt aaggcccctg tctgggcagt agggtcccct cattgctcct 8941 gcaaagatag gcatgttggt gacaaggctt ctgtaacagg ggatgaaagt tggggaattt 9001 gggaagggaa tgggggcagc atctccatct acacccataa gtgctgccca agcgagggtc 9061 aaacgcccag ctgtggcatc ttcctgctgc aggtgaggag tgggcagcag ggagggctgc 9121 ggcgcctgct ctgtccccat cccggtctct gtgtctcttg gactcactag ggcgcatcca 9181 ggtggggtga gctgggaatc acgtgctgaa tgctgagggc ctggatgatc acggcctcag 9241 agggagcaaa tagtaaaggc agctgtgatc tggggagggc cagaaactgg agaggaatct 9301 gaggagaggc ggtgccccta ttcccttcct ctctgcatcc ccctcccctg tttctccagc 9361 catcggggcg gacaccgaga aaaagaccta tgaggcccag cctgggggcc ctgcctgtgt 9421 agccctttgg agacccctag taacagggag ggtcctgagc acacatggcc atctctgtcc 9481 actgtgcagc tccccatgca cctcctccag gagctttctt ggggttgtcg tgtcctctgc 9541 accattcgag gccctactct ttccaggttc ccacggcctg gcctccctga gtttcttgca 9601 gatgacatgg atgagtagat aagcagatgt ccctgggcca tttgaggagt ggggcccagc 9661 ccctcatcag ggcagctgtg gtccctgttt tcatcctacc tccgagtgtt ttcttctcca 9721 gtccctgagg gacacagtcc tcagggccca tgtttttggg gatttaatct gtgctctgtg 9781 gcctcacctt gccttccctg agccaatttc cctttctaaa ggtggtcact gcctggtaag 9841 tttggagtaa gggacggtca gaatcatttc ccctacagtc aggttgtttg atgggggatg 9901 aaaagagaca gcaggaagtt ttgtgtttct gcaaagacag aagcagttca ggcgacagta 9961 agaggctggg gtgtccagga gggtgtgtct ggcagtaggg tcgctggttt ctcatccttg 10021 aacctaattg cactgtcagt cggcccctca ggcctgagca gatgggaagg tttgtcccct 10081 gccctgcagc aagagggccc tgtccaggag gcacccacaa cagaggcagt gcaggtctgt 10141 ggtcactcct actctcacct gtggcgtctc ccgtagaggg attgtcagtt ctggttccct 10201 gtgggcagga atggtttcct cataggtcac tggagttttg gccaggaaaa gagtatgaag 10261 ttcatgtggc agtttctcaa aattcctgct ttcaatgttg atgtccagta aagatattcg 10321 taatttcagc tctataatct taataggatt tcctctaata ttgtgaagca tattatatga 10381 aacaggaaca caaatttctc aaaattcctg cgatgtccaa taaagatttt cataatttca 10441 gctctgcaat cttaatagga tttcctaata ctgtaaagca tattaaatga aacaggaact 10501 caaatttgga gccccctctc caggaggttc tgtgtggaga tggtggctgt ggcagtggca 10561 gttcccaggt gcagagggtg ggcagaggca gcctcaggct aaggggtctc ccctactcca 10621 catggagaaa atcccttgta ggttgcaagg gcagtggccg ggtggaatcc ctgctaggga 10681 cagagcagga aggcctcgca gcctcaccaa gcagcagccc tggggtggag ctgcgtttcc 10741 agggttaagc ggaccaggca ggagtagcgg ttactcaaga gcaggtcaca ggcttgggtt 10801 gtgagggtca ggagaggcca ggcctcctcg agcaaggtgg gggtcccagg gtcaggtcag 10861 gtgcagatcc tgtggcagcc acgtctttcc atgctgggcc tgctgggccc cccaggcttc 10921 ctgatggggt ccccagttag gagctgcctg ctcagggctg ggaggggagg agcactgagc 10981 tgcagataga gggcagagcc cacagtgggc agggcctgcc ctggtgtgta ggtgcctctg 11041 caggagagga gggcctgggg actgagagca agggtcaggg cctctctttg gggaggcctc 11101 tcactgtaac aggactggtc aggcctgaga ggagggcact gggttccctc ttgggtcttg 11161 tcctttagtc ttggggccct ttccctccct gcacgatgag tggtgggcac agggcacggg 11221 ctgatgttga tggagtgatg ggagggaact ggcaggggct gggaaaagca aggagggagg 11281 aagaaaaaag tgggggcctc atcttccctc agagaaaggg caaatctggt tttggagcaa 11341 ctgaagagag aaaagtcccc agggaataaa cacaacactg cacccagtgg agcatttacc 11401 catttccctc ttttctccag agctcgtgag cctgcaggtc ctggatcaac acccagttgg 11461 gacgagtgac cacagggatg ccacacagct cggatttcag cctctgatgt cagctcttgg 11521 gtccactggc tccactgagg gcgcctagac tctacagcca ggcggctgga attgaattcc 11581 ctgcctggat ctcacaagca ctttccctct tggtgcctca gtttcctgac ctatgaaaca 11641 gagaaaataa aagcacttat ttattgttgt tggaggctgc aaaatgttag tagatatgag 11701 gcatttgcag ctgtgccata tt // LOCUS HSMOGG 17538 bp DNA PRI 21-AUG-1996 DEFINITION H.sapiens gene for myelin oligodendrocyte glycoprotein (MOG). ACCESSION Z48051 NID g758091 KEYWORDS myelin/oligodendrocyte glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 17538) AUTHORS Roth,M.P., Malfroy,L., Offer,C., Sevin,J., Enault,G., Borot,N., Pontarotti,P. and Coppin,H. TITLE The human myelin oligodendrocyte glycoprotein (MOG) gene: complete nucleotide sequence and structural characterization JOURNAL Genomics 28 (2), 241-250 (1995) MEDLINE 96015053 REFERENCE 2 (bases 1 to 17538) AUTHORS Roth,M.P. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) Roth M.P., CNRS UPR 8291, CIGH, CHU Purpan, Toulouse, France, 31300 REFERENCE 3 (bases 1 to 17538) AUTHORS Gonzalez,P., Pinto,F., Nogales,M., Jimenez-Asensio,J., Hernandez,M. and Cabrera,V.M. TITLE Phylogenetic relationships of the Canary Islands endemic lizard genus Gallotia (Sauria: Lacertidae), inferred from mitochondrial DNA sequences JOURNAL Mol. Phylogenet. Evol. 6 (1), 63-71 (1996) MEDLINE 96426860 FEATURES Location/Qualifiers source 1..17538 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ICRFc109A2434" /clone_lib="ICRF cosmid library" /chromosome="6" mRNA join(764..1253,3274..3621,10106..10219,11597..11617, 11860..11880,14238..14354,14658..14678,15129..16324) /product="myelin oligodendrocyte glycoprotein" prim_transcript 764..16324 exon 764..1253 /number=1 CDS join(1166..1253,3274..3621,10106..10219,11597..11617, 11860..11880,14238..14354,14658..14678,15129..15142) /codon_start=1 /product="myelin oligodendrocyte glycoprotein" /db_xref="PID:g793839" /translation="MASLSRPSLPSCLCSFLLLLLLQVSSSYAGQFRVIGPRHPIRAL VGDEVELPCRISPGKNATGMEVGWYRPPFSRVVHLYRNGKDQDGDQAPEYRGRTELLK DAIGEGKVTLRIRNVRFSDEGGFTCFFRDHSYQEEAAMELKVEDPFYWVSPGVLVLLA VLPVLLLQITLGLVFLCLQYRLRGKLRAEIENLHRTFDPHFLRVPCWKITLFVIVPVL GPLVALIICYNWLHRRLAGQFLEELRNPF" intron 1254..3273 /number=1 exon 3274..3621 /number=2 intron 3622..10105 /number=2 misc_feature 4519..4544 /note="polymorphic (CA)n" exon 10106..10219 /number=3 intron 10220..11596 /number=3 exon 11597..11617 /number=4 intron 11618..11859 /number=4 exon 11860..11880 /number=5 intron 11881..14237 /number=5 exon 14238..14354 /number=6 intron 14355..14657 /number=6 exon 14658..14678 /number=7 intron 14679..15128 /number=7 exon 15129..16323 /number=8 misc_feature 16250..16289 /note="polymorphic (TAAA)n" BASE COUNT 4624 a 4120 c 3991 g 4803 t ORIGIN 1 atggaaatgt tctgtatttg tgttgtctga tgagataacc actaactgta gtgctattga 61 gcatttgaaa catggctagt gtaatcaatg aaccaaattt ttaattttat ttaattgtaa 121 ttaattttaa gtggccacat gcagggagtg actgctgcat tggacagcac ggctctaaat 181 tgagcctttt ttccttattt ggtgaggcat acttgcctta agattgggaa gtctattttt 241 ggaacctgct accaatgctg gtctcacact tgcaattctc agctgagcca agaggtgaga 301 gaaaggtcat tttccattcc aagatctcac tctcccctgt gacactgagg aaactggcaa 361 gtgatgtgaa ggctggagag cgtgtcctgt atgctggctc tgtcccttct gcctgtgttg 421 actgacatag ttagttgctg cccttgctgg tctcccttcc tccaaccttg cctctctgag 481 cacacctgac attcatctca tgacttccct aaaaacattc tttgggaaca agaaactaac 541 aaatcccaag tgacctatca catatacaaa catacagggc agagtttgga ttcgcggtag 601 aagaaaggga ggttagacat taagaagaat ggtctggtga tgacagttgt gagataatag 661 aaacaggaaa aagaaatcta agttttcttt ctttttttaa gaaccaataa taatttctct 721 cttttgacta gtcagtaggg ctggggtgga ttggaggaag cttacatatt ccatgaacaa 781 gcctcttcct aaggtcctgt aagtgatcct gccccactga ttagccccta gaagaccctt 841 caaaggttgg atctccagga gggagtgggg gaggaaagcc ctgtaccagg cagcctctgc 901 tccattgctc tgggggggtg gggaagacaa accctggtca tcccctcagt ctgtagccct 961 tttgtgtgag tgcctggcaa gggtgacgtg gggctgtttc tgcgggcaca gctgcagcaa 1021 ttaccggagt ggaggcaggg cccaggcagc actgccctcc aagatcttcc cttgggcttt 1081 tcagcagtaa ggggacatgc accccaaggg cctccacttg gcctgacctt gctgcggggg 1141 ctctctgtcc ccaggaacag tagagatggc aagcttatcg agaccctctc tgcccagctg 1201 cctctgctcc ttcctcctcc tcctcctcct ccaagtgtct tccagctatg caggtaagac 1261 atgttttttt tcctgccctg gggagaccct gaaaacagaa aggctagttt cctgggggtt 1321 agctccttca aacatcctca agttggtata ttatctttct aaaacataga cctactgaca 1381 tgcctccctt cctcagaaac cttccgtggg tggttcttac agccttcaag atggagtcca 1441 gactcttttt tttttttggg acagagtctc cctctgttgc tcaggctgga gtgcagtggc 1501 atgatctcgg ctcactgcaa cctcagcctc cctggttcaa gcgattctcc tgacttggcc 1561 tcccaagtag cggagactac aggcgcctgc caccacaccc agctaaattt gttcttttct 1621 ttcttttttt ttttttttgg gattttagga cagacggggt ttcacatgtt ggccaggatg 1681 gtctcgatct cttgacctgc tgatccgccc gcctcagctt cccaaagtac tgggattatg 1741 ggcgtgagcc actgcactag gcctaatttt tttattttta gtagagatgg ggtttcacca 1801 tgttggccag gctggtctgg aacccctgac ctcaagtggt ctgccctcct cagcctccca 1861 aagttctgag attacaggca tgagccattg cgtctgaccc agactcctta atgtgactaa 1921 ctccaggctt tccttggact acttcttact tgtctttcca gctttgtctt ttcacctctc 1981 caattgagat aaaataataa caacctcttg gagttctcat caggattaca tgaaatgaga 2041 tatgtaacat gcttagcagt gcctgtccat agtaaatctc aataaatgtt tgtggaatta 2101 taatatcttg tcatgtttga gactttgctc tgcataatca ggcaccagta ggtttttata 2161 aaggaacccg tctgtcacgt gcagaggaga aataaacaga aagtttccca tcctcaggga 2221 gccacctgac tgacagaggc acagtgcatc cactctccag gtctagggga gaaagcagcc 2281 ttatttctta gtagctcaga atctgacttg agaaacacat ccacatagaa aaaaacaagg 2341 aactttttcg ggtcagggtc cgggacccac agtgaggtgg aagatacagg ggaaggaaga 2401 gggaaataga gccatcccca gggtggaaga tctcagaaga gaatttggga aacaaggtat 2461 gaacaaggac tgaatagtga gaagtgatgg agagacagct aaagtagatg gagtgtcaaa 2521 accaaaacct ctaagggtag aataggcagc aatttggcca agtcctaaca gggaggccca 2581 taggaggatt caacctcaag atgctgtgcc acattccaag agggaaccta aaggctgggc 2641 tgaagagtca gagatggcta cagctggcaa aaagatgggc agatgctgag aggagatgat 2701 tgctaaaatg ttctgtccag gacattcaca gtatctctat aaccagagtc ttttttgtcg 2761 ttgttgttct caagaaggaa acttgaggcc gggtgtggtg gtttatgccc ataatcccag 2821 cgctttgggg ccaaggcagg cggatcacct gaggtcagga gttcgagacc agcctggcca 2881 acagtgtgaa acctcatctt tactaaaaat acaaaaatta gctggatgcg gcggtaggtg 2941 cctgtaatgc cagctactcg ggaggctgag gcaggagaat cacttgaacc tgggaggcgg 3001 aggttgcagg gaggcggagg ttgcagtgag ccaagattgc accactgcac tccagcctgg 3061 gcgacagaga gtaagactgt ctcaaaaaat aaatgaataa ataaaaagga agaagaagaa 3121 gaagaacaat tgcaatcctc cctggctcta gaatgtcatt taaaagtcga gtgtcttctt 3181 ccttccctgt tttgaagcag cccttctcat gacaggcttg cttgccaagg ttccctctga 3241 ccttaaatct cttccttttg gtgtcttgga cagggcagtt cagagtgata ggaccaagac 3301 accctatccg ggctctggtc ggggatgaag tggaattgcc atgtcgcata tctcctggga 3361 agaacgctac aggcatggag gtggggtggt accgcccccc cttctctagg gtggttcatc 3421 tctacagaaa tggcaaggac caagatggag accaggcacc tgaatatcgg ggccggacag 3481 agctgctgaa agatgctatt ggtgagggaa aggtgactct caggatccgg aatgtaaggt 3541 tctcagatga aggaggtttc acctgcttct tccgagatca ttcttaccaa gaggaggcag 3601 caatggaatt gaaagtagaa ggtgagtagt gccatataat attaggtatt aactgttggg 3661 tggccaagaa caattattct ctcaactgag atgagatccc tcaacccaaa catctcagtc 3721 ctgggaatga tttccataaa aatgtacaca tcaataaaca gaaactcatg cttagggatg 3781 tctgttgcat cattattcag agtagcaagg aaattgggat caaaatcaat gcctttgagt 3841 aggtaagtga cagaatgaac aatggtagcc atactgtgaa tattatgcag ggattaaaaa 3901 gattatttta gcactaggcc agatggtttg gggggctcct ctaaggtatt attgagtgat 3961 aagagcaagc tgctgtagga tacaaaaaca aaaacaaaac cctagggcat ggtggtttgc 4021 ctcgcagcta ctcaggaggc tgagacggga ggctggcttg agcccagggg tttgcagtta 4081 cagtgagcta tgattgcacc actgcactcc aacccgggtg acagagcaaa gaccttcacc 4141 cccactccct acccgtctct aaaaaaaaca aaaacaaaaa caaaaaaacc cttgggccca 4201 gcgccgtggc tcacgcctgt aatcccagca ctgtgggagg ccgaggtggg cagatcacaa 4261 ggtcaggaga tcgagaccat cctggctaaa acggtgaaac cccgtctcta ctaaaaatac 4321 aaaaaaaaaa aaaaaattta gccaggcatg gtagcaggcg cctgtagtcc cagctactcg 4381 ggaggctgag gcaggagaat ggcgtgaacc cggaagcgga ggttgcagtg agccaaaatc 4441 cttccactgc actccagcat gggggacaca gcgagactcc gtctcaaaaa aaaaaaaaaa 4501 accctgtatt tgtgagcgca cacacacaca cacacacaca cacacctgtg cttggtccta 4561 gtgaataagc aagtaaatca aatgtctaaa tataattata gaaaggagat gtcacctttt 4621 ggctgtacct ccactatttc attctgcaga attgcagaat ttcttttttt tttcctttct 4681 ttcttttctt tttttttttg acacagagtc tcgctctgta acccaggctg gagtgcaatg 4741 gcgccctccg cctcctgggt tcaagtgatt ctcctgcctc agcctcccga gtagctggga 4801 ttacaggtgc ccaccaccac acccagctaa tttttgtatt tttagtagag acagggtttc 4861 accaggttgt caaggttggt ctcaaactcc tgacctcagg tgatccactc gcctcagact 4921 cccaaagtgc tgggattaca ggcatgagcc atggtgcccg gcctcagaat ttcattttca 4981 acatgttttg catgatgggt gattttggag aatatttttt gctctatcgc aggatgatta 5041 agatgtggac aaggtgaagc cgatggaggg ggagctttga aagttacttg ctatttaatt 5101 gaggaactaa actgctttga gagcctgggg gtcagatcct ctgccttttc ctcctcccca 5161 cctgcagtgc aaacatcaga caattgatca ctattgtatc ttggaggtgg gagtgaccat 5221 tgcagtgctg ggaccagaag atggcattgt atgtggaaca acaaagcact atttctagag 5281 actgcctgca gggatatgga aatagcttta tgtgtctcag aatgttcttc atacagctgt 5341 ttttattggg gaaattctac ttgccgaaaa gtttgatagt gagaccctct ccagtttgca 5401 gatttttctc cttcctgctc aacaacttcc tagctcagta actgcctctc ccaacaaact 5461 ccctcagttt caccacacca aaaaaggaag acaagccggt tgcggtggct cacacctata 5521 atcccaaaac tttgggaggc cgaggcgggt ggatccacct gaggtcggga gttcgagact 5581 agcctgacca acatggagaa accctgtctc tactaaaaac acaaaattag cctggcgtgg 5641 tggcgcattc ctgtaatccc agctgggagg ctgaggcagg agaatcgctt gaaccccgga 5701 ggcggaggtt gcagtgagcc aagatcgttc cattacactc cagtctgggc aagaaaagtg 5761 gaactccatc tccaaaaaaa aaaaaaaaaa aacaaggaag acaaaaagaa aagcagctaa 5821 agactttgcc tcaggggaga aagttctctt ttgggttgct atccacattc caacctcctg 5881 ttcccacctc ttcgtctgca tgcctaagaa actgttttac aagtaaataa gggacgcttt 5941 gtctaggctt tggagccagg aagttgagac aaatttagga atgagatgaa gtaatggtat 6001 tattgcaagt ctcaggtgta actacctctg ctctttctct gaagagtttc taatttctct 6061 tgtttactta tttttttctt gtcatttttg ggattttatt actagttgtc tctaatcctt 6121 tctttaaatt cttcattatg aaacataaaa acaaatgcca ggcgcggcag ctcacgcctg 6181 taatcccagc actttgggag gccgaagcgg gcagatcacc cgggtcagga gttcgagacc 6241 agcctgatca acatggagaa accccgtctc tactaaaaaa tacaaaatta gctaggcgtg 6301 gtggcacatg ccagtaatcc cagctacttg agagactgag gcaggagaat cgcttgaacc 6361 gggaggcaga ggttgcggtg agccaagatc gcgccattgc actccagcct gggcaacaag 6421 agcaaaactc tgtctcaaaa aaaaaaaacc acatacaaac cagagataat attataatga 6481 gcctccaagt gcctaccacc ttgctgcagc acttgtcaat ccagggacca cccacctcac 6541 cggctcccca ctcattacca ccctccccta ctcaattact gaggtaaatc ctaggcagca 6601 tgatcatttc ttttttttct ttttatttat tttgagacag gatctgtctc tgtcacccag 6661 gctggagtgt agtggcatat ctctgctcac tgcagcctct gcctcccggg cagaagccat 6721 cctcccacct cagcctacat agtagctggg accacaggca cacaccacca cacactgcta 6781 atgttttgta ttttttgtag agactgggtt ttaccatgtt gatcaggctg gtctcaaact 6841 cctaggctca agcaatcctc ccacctcggc ctcccaaagt gctagaatta caggcgcgag 6901 ccactgcacc cagcgaagaa cactttttaa aaaataaata ggccgggcgc ggtggctcac 6961 acctgtaatc ccagtacttt gggagcccaa ggagggcgaa tcatgaggtc aagagattga 7021 gaccatccta agtaacatgg tgaaacccca tttctactac aaatacaaaa acaaaattag 7081 cctggcgtgg tggcaggcgc ctgtagtccc agctacttgg gagctgaggc aggagaatgg 7141 agtgaacccg ggaggcggag cttgcagtga gctgagatca tgccactgca ctcccccctg 7201 gggcaacaga gtgagactcc caaaaaaaaa aaaaaaagcc ccccctcccc acacacaata 7261 atataaataa ataaataacc acaatactat tatcacatct tacaaactca acaaaaattt 7321 cttaatatca tcaaataccc agtttgtgtt caaattttcc tgattgtttc ataaatatac 7381 tcttacagtt ggtttctttt agcgagattc aaatgagacc cacctgttga cctttgccct 7441 tagggtttcc cagggtctga attttgttga cgacattccc atgttgctat gtaatacggt 7501 cctccatgcc ctgtgttttt ctgtaaactg atagatgtgg aggtgcaatg acatttgtgt 7561 ttgatttact ttggcaaata tagttcatca gtgatactct atacttcttg ttgctttaca 7621 tccggaggct gataatgtct gcttttctct cttttctaat tatttgtgaa aggaaaaatg 7681 tggggggttg ggagaaaaaa acccttaagt acatactcgc taaatcacat tgctacaggt 7741 aacttccatt aagaacttga aagtaaaggt agctgcattt tcccctaggg aacacaatga 7801 tagacaggag ccttagtcta cagcttgaag gattgtaatt atacctaagc aaccctcctg 7861 gaccagttta atgttattag ctgtgatgta tccctacctt tgatgtcatt atccttactt 7921 agctccctta aagcagagat caagatgaaa agggcttcag ctgcagcatg gcacatggag 7981 attagagtgg ggcttttgga tgctgaggag cagacctaga atgggaaata gatgggagcc 8041 acagaagtga aggtccccct ccctcattgc tcaacctact ccacatctcc aggtctgcac 8101 atctgttcag ttactgaatc ctgtgtaagc taccttcttt ttcttttttc ttttatttat 8161 ttatttattt tttttttgag atggagtttt gctcttgtta cccaggctgg agtgcaatgg 8221 tgcaatctcg gctcactgca ccctccaact cccaggttca tgcaattctc ctccctcagc 8281 cttccaagta gctgggatta caggctgcac caccatgtct ggctaatttt tgaaaaatca 8341 gtagagagag ggtttcacca tgttggccaa gccggtctcg aactcctgac ctcaagtgat 8401 ccacccacct tggcctccca aaatgctggg attacaggtg tgagccacca tgcccgctgt 8461 aaactacctt cttaaaagct ctagaagagg gcttttaacc ttttgttgtg tgtcatgcac 8521 cttccgcaag ctgatgaagt tgatagaccc atctcagaat tttttttttt tttttgagac 8581 agtgtctcac tctgtcaccc aggattggtt gcagtggcac gatcatgggt cattgcagcc 8641 tccacctccc aggctcaagt gatcctcctg actcagcctc ttgaatagct gagaccacag 8701 gcttgtgtca ccatgcccag gtaattttta attttttttc gtagaggcag ggtctcacat 8761 tatgttgccc agtctggcct cgagaactcc tgggctcaag caatcttcct gccttgggct 8821 cccaaagtgg tgggattaca ggggagagcc accacaccta gccaggagga tgttttaaat 8881 acaccaaata aaacatttat acccaaatac agttatccaa atattaaatt aacaagagtt 8941 agggtgaccc tattaattag tgtaatttcc aaatagtaat gaacataagt gatagtttga 9001 gatttctgtg acttttctaa tgtgacgtga aaatatttgt gatttttctt tttctttttt 9061 ttttttgaga tggagtttcg ctcttgttgc ccaggctgga gtgcaatggc aagatctcgg 9121 ctcacctcaa cctccgcctc ctgggttcaa gcgattctcc tgcctcagcc tcttgagtag 9181 ctgggattac aggactgtgc caccacgtcc agctaatttt gtatttttag tagaaacagg 9241 gtttctccat gttggtcagg ctggtcttga actcccaacc tcaggcgatc cgcccgcctc 9301 ggcctcccaa agtgctggga ttacaggtgt gagccaccgc acctggccaa tatttgtgat 9361 ttttattgac gacaaagtca aaggttctct tcatattatt gtggtgtatc gcctacaagc 9421 ataattaaaa taaacactaa atttcagttt aaagtttact gaaaataaat atgtattttt 9481 tattccctat ttaagctttg aatcccctga cttcctatac cattaccact gtcctagttc 9541 aggttcatgt tgttttttac tttaattgtt atcacagtct cttaacattt ctccctatgt 9601 tctccagtcc tgtaggtgct aaatctgacg tggtcacttc tcagcttgga atccttcagt 9661 gcaccaccac agccttgaac tacatatttg aaatacatat ttattttcag taaactttaa 9721 actgaaattt agtgtttatt ttaattatgc ttgtaggcga tacaccacaa taatatgaag 9781 agaacctttg actttgtcgt caataaaaag tcccttgagg ggacttcaga tgtaagtccc 9841 ttagctgctc gttaaaactc ccccaggctg acccaataca caatcttgac tttaaaccac 9901 ttgtcattct aaatcactag catttcctgg aaaaaaaagc catttttcct tcagggctaa 9961 gctcagggac caattctgtg tcaccttctt tgaatcctga tgatattcac ttctttattt 10021 gacctgattt attgggcccc agacaccatg ctgagtgttg gggattcagc tctggacaat 10081 gtcaaatgtc agtcctgcct ttcagatcct ttctactggg tgagccctgg agtgctggtt 10141 ctcctcgcgg tgctgcctgt gctcctcctg cagatcactc ttggcctcgt cttcctctgc 10201 ctgcagtaca gactgagagg tacagggcag agggtgggtg gatcaggatc ctttctttaa 10261 atgagctggc ttcttggagc tacaccactt aacatgtatt tgtgagtgac ttctgggttc 10321 agaagttctt ctcactattg agtgataaag aaaaaaaata actccatgat gaaagagttt 10381 tacatcttac ggaatgcttt catatgaata atcggaccta gcatttccct atgagctaac 10441 tatgccatat agtaacccca ttttacagag gatacaactg aggccaggag tagttcagtg 10501 acttactcaa accgatataa cttataagtg gtagagctga ggcctctgta tcatacctag 10561 cagctccatg caacttggga gagtgtgagc ttcgaagtca gacaggtcta ggctattagg 10621 agttttgaat aaagatactg aagtgaaagt ctctaccaca cagtaggcgt tcgaaaattg 10681 tttcctcttt ctccattcaa cactgaggac tcaggttcag ctgctgatga agctcctctt 10741 ttttgcctag agctttcatt ctgagccttc tcctcctacc aagtgtctcc ccaatgccag 10801 agcaggaaga gtcttcactc ctcccaatgc cccacctccc atttgttact aagaggagag 10861 gagaaagtag caaggagggt atggggaatg ttctggggga atgggtgttg gtgcgatcaa 10921 caacaaagtc ctttctctca ccttgaattc atcccagatg cctgcttgtt tacttcttcc 10981 acacaaaaaa aggccttcag ccctcatggc tgagcagaaa gaatctgaat gttagagtca 11041 ggcagcctgg gtttgaattc catctcaggt actgaactct atagcaaaat tcttagattc 11101 tccaagcttc agttgccttg tctgtcaaat agagaaaaca tccttcgtcc taaattgtag 11161 ggaggattaa agtcatgcaa agtgcctact acaaatccag tcacaaagta gctagctact 11221 cactaaatgt tcagctcctc cctcctcatt cagatgggaa gtggctttag ataaacaaag 11281 tggcaacgca gtgggctgga gcagctctgt gaactgagaa tccaagaaaa ggggcgaaga 11341 gcagctggga tgtattggat gcttgtgctg gcttggagca ttgctcacat tctttattcg 11401 ctattgtatc tagactatag ctagagaaag agccgcaacc attggcttta aatccagtgc 11461 tcttcctact ctcctgaggt tgtttccagg ctgcagagaa atagcctgca caaggggccc 11521 aggcgctggg tgtgggaggg tccccaccga gagccagaac atgcaggaac taaaatgttg 11581 cctttttcta ttttaggaaa acttcgagca gagataggtg agttccagtc atcgtttctc 11641 ccaattcttg ccttttggtt ttttggcata acggaaatgg tcccattctt ggaccgtctc 11701 tccctctcaa taccctgttt tcccctcagt ttccctttct ctacagtggg tgtgtcgtgc 11761 ctagaacaag ttttaagtaa ttaaataaca aagactcagg ataaaaggat cctttttgga 11821 gtgccctact aaatccattt ccatttgttt ctctttcaga gaatctccac cggacttttg 11881 gtaagttccg gcatgtctag gccctcccag gtcaacttgg tatttcactc tagttccagt 11941 cacctggggg aacaaggacc cctggctcct ggttgagtcc cttcctctct tctcttttct 12001 ttctttaaat aagaagtcat ttgcatttag gattggtaaa atcataataa aaatactcat 12061 gtactgtttt tatgtgccag gcactattct aactacttta caaaaacgtt atcttattct 12121 gtttaactcc ttatgcacat gatctctctt ttcaggaatg ccaaaacaga ggtaaataga 12181 tcgtttacac gtaaacctga tgtctggttg gggaggtgaa acaaacagaa acaagacaca 12241 actgtatcac ctgtacttat atttctgctt tacaaactca ggatgtttcc atgagtacag 12301 aacatgacta atcagagaag acctcataga ggaatagaaa agccaccaag ccccactagg 12361 aattgacccc tcaaggacat ggtttctagc ctttttgttc actgcagatt gcccaatgcc 12421 taaagataat ggcaacagaa gagcacccaa atatttgtta gataaatgtt gcagacacta 12481 gaaggtgtca ttagggcaca gatggtacct tctctgagca aacttccttc acagctcctc 12541 ctcccgaggc tgtaggtgac tctactcttg tcacctggca cacagagttc tatcgtacga 12601 tttaggaaat tagaccagtg tgtggaccac acacacacac atctttacac acccaaagag 12661 gaggaatagt atctttgttt tggaggactt gactatgaaa ggtcttaact cctttttgta 12721 ccatgaatct ctctggcact ccagtgaagt ctaaaggacc cctttgcaga atgtttttaa 12781 atatacacat aaaatagaac acataggatt gcaaaaacaa tcattgtact aaaatacagt 12841 tatcaaccga taatcacatt tgtgatatag taacataaat gtttcttttt tttttttttg 12901 gaggcagagt ttggctcttg tcacccaggc tggagtgcaa tggcgcgatc taggctcact 12961 gaaacctctg cctcccgggt tcaagcgatt ctcagcctcc tgagtagctg ggattacagg 13021 tgcccgccac cacacccagc taatttttgt atttttagta gagactaggt ttcaccaggt 13081 tggccaggct ggcctcgaac tcctgacctc aggtgatcca cctgccttgg cctcccaaag 13141 tgctgggatt acgggcatga gccaccgtgc ccggccataa atatttcttt agccaaagta 13201 atacattaag taatgtagca gcaagtctaa taacctgtaa tttctttctt tctttctttc 13261 tttctttttt tttgagatga agtttttttg agatggagtg caatggcaca atctcggctc 13321 actgcaacct ccacctcctg ggttcaagcg attctcctgc ctcagcctcc caagttgctg 13381 gaactacagg cgcatgccac catgcccagc taatttttgt atttttagta gagacggggt 13441 ttcaccatgt tggccaggct ggtcttgaac ccctgacctc aggtgatctg cctgccttgg 13501 ccttccaaag tgctgggatt acaggcatga gccaccaggc ccagcccaat aacctttaat 13561 ttcaacatac taataaacat aaacagtatt tcaagatttc tgcaataact ctaatgggaa 13621 tgaaaacatc tgtggcttcc attggtaatt aagtcacagg tactgctcat attgtggtta 13681 gttgtaaaat gttttggttt gttttgtttt ttccaagact tgggggaatg ggtgttggtg 13741 ggatcaacaa gagtcttgct ctgtggccca ggctggagtg caggggcagg atcttggctc 13801 actgcaacct ccgcctccca ggttcaagcg attctcctgc ctcagcctcc tgagtagctg 13861 gcattacagg catgtgccac cacgcccagc taatttttac atttttagta gagatggggt 13921 ttcaccatgt tggcctggct ggtcttgaac tcttggcctc atgatccacc cgtctcggac 13981 tcccagagtg ttgggattac aggcatgagc caccacacct ggcagttgtt acatttttaa 14041 tgaaagaaaa tgttaaatcc agttattgaa aataaggagg cagtactttt ctcatccaag 14101 ttcatggact ttctgaattt tgtccccaga gtcctttggt gttctaggac cccaggttaa 14161 ggaacccaaa aagacaggtg ggtggggcat gagggggaac acatgttaat ccctgtttgt 14221 tctggtgaac aattcagatc cccactttct gagggtgccc tgctggaaga taaccctgtt 14281 tgtaattgtg ccggttcttg gacccttggt tgccttgatc atctgctaca actggctaca 14341 tcgaagacta gcaggtgcag tggctgggca gcaggcaaga ccaccaaata gtgggggacc 14401 aagtcagctc tgaatgggaa gccaaaagag aatagaacca ggactcaaga ttaggggagc 14461 tgggatttcc ttattcctct gtccccatgc ccaaccccag gctcttctga gaaactgtga 14521 agagaaccac ttactggatc tgtgggatcc cccagtggaa agggcagtgt gggtcactcc 14581 aaatgtccat agggaggatg tggggaaggt gctattcatc ttccactaat cacatatttg 14641 tttctttttg ttttcagggc aattccttga agagctacgt aagttctctt ctctctgtta 14701 taagcagaga ataaaaagcc aggaaaggga gacagaagca acaagaggaa gaggcgggct 14761 attgagggat cacattccca gaggaaagga ggagctggag agcctgggtg gagggaagac 14821 tcctcctggg aggtagaggg caaagaagcc agctgttaga gacacattta caggtggcag 14881 agaagctgga ggcactccta tctgccacct gatccattcc tccttcactg cccctaagca 14941 ggaatccaac cctagctggt ctcattgccc attccacagc aactgcccag tgcctcacct 15001 ctcagatcaa ccattgaggc aggaatggag acaagatgac cccaagggct tttcttctcc 15061 ctagttcaat ggttttatga tacaaactac tgacatacgt ttttcaagtt attttctcct 15121 tcttctagga aatcccttct gagtgatgtc acatcttggc aggggtggag gagagcctgg 15181 ttgcccaggg atttgtcctt ggggacatct catccatcaa gttgcacact cactggcatc 15241 tttgctatgg ggacattcca atttgcactt tcaggaacac tctgaattcc aagtagaatt 15301 gatttccctt cttctgtcat ctaccttttc tcttcatttt cccattttta ttacccttct 15361 ttccatttct ctctccagtc ttccacctgg aagccctctc tggctaagga caggcaggtg 15421 cccctctctc catcagagga cacctgtact ggagagcaac acaggatggt ctctgccatg 15481 aactggaggc caggaatctc ctcactgaaa attacagtat ggtaactttg caaatggtgg 15541 ttgtttcttc caagactcca gccctgattg cgcaaaactg aaaggcatgt gaagggaagg 15601 aagaggaaga gtgcaaaaca ttgaagagag agctgagtga gctgaagagt gaggatatga 15661 gtagccccaa cccaaacctg gagatgggga gaaacctaca gaatactagc cagagctcct 15721 ccttgtcttg gcagcctact agggacctgg ggaagcaaaa acgaaagctg ggcaacatgc 15781 ctgctttaga atgttttcct tctacttaca catcttccac aggtctcaga atctttcctt 15841 cctctcatcc ttttctccta tctacatatc tatcagagta tccactgttt attcaacaac 15901 tactacttga tggtcagaca caaacaaaca agctaggtgc taattaataa agatacgagt 15961 tttggccggg tgcggtggct cacgcctgta atcccagcac tttgggaggc cgaggcgggc 16021 gaatcacgag gtcaggagtt caagaccagc ctggccaaca tggtgaaacc ccatctctac 16081 taaaaataca aacaattaac tgagcatagt ggtgggcacc tataatacca gctactccgg 16141 aggctgaggc aggagaatcg cttgaaccca ggaggcagag gttgcagtga gctgagatcg 16201 cgccactgca ctctagccgg agtgacagag taagactctg tctcaaaaat aaataaataa 16261 ataaataaat aaataaataa ataaataaaa aataataata caagttttca taagcacact 16321 tctaacccct tgtcttttat gtatttcctt ccttatccac gcacctgtct ccctctactc 16381 cagcctcatt accccagagg tcagtcctca ggaaaactaa acacaaagaa agagctcagt 16441 cagaaaggcc atttatttat gtttcaagat gctcactgcc tcctttgttt tgtctccttt 16501 gcaggccttc tctcttaggc ctcttctcct gggggtatgg atcctggggg gagattgatc 16561 acctccatgc ttccattcct ccccagccat agtggggaca tcatgagaga agccaagcca 16621 ctggcccagg atcacccggc atttatggtg gctgctctgg cacaggtcct tgcctttata 16681 gcccctccag tgatccataa ggccctcttt ctccccaaag gagaggtcac agatagggca 16741 aaggtagctc ttctgcttcc agtgggtctg ctggtgtctg accagcctgg aaaatgagct 16801 gaaagacttg ctgcaatgga agcagtagtt gggcggctct gtgaggtggc ccttctggtg 16861 tctggagaga taggatttct tgctaaaagt caaagaacaa tgggggcaac agaagacatt 16921 gagtcttgag ggcttcactg gatgagagtt ggatctggca tcctgacaga gggttccagt 16981 gatgggtgcc tgggtcctgg tcacaggtgc ttggttctta agtacagatg cctggttctg 17041 ggccatagga ccctcagttc taaatatggg ttcctgggac ctggccactg gtgcatggtt 17101 cacatccaaa agcccctgga tggacctctg gcttctggcg atgggtgtct ggaattcagc 17161 ctgggtgcct ggaatcctca aagtacactc ctggtttcca tccactggct cctggttttg 17221 gtgtatcttc tggtggcgtt tgagctcaga ctggtcccgg aagctcttcc cacacacaga 17281 gcatgaatgg ggccggtaac ccagatggac gcggcggtga cgacttagtc cagaagcatc 17341 acagtaggtc ttgtcacaga gcgtgcaaca gaagggcctc tccccaagat gcatgcgtct 17401 gtgatagctg agggacttgg ggctccgaaa caacttccca cactgactgc agctgttagt 17461 cagcttggga ttgtgaacaa actggtggct atagaggtag gagcgcctgc tgaaacattt 17521 ggcacaggtg tagcaaaa // LOCUS HSMTIXG 1858 bp DNA PRI 15-DEC-1994 DEFINITION H.sapiens MT1X gene for metallothionein 1X. ACCESSION X65607 NID g517350 KEYWORDS metallothionein; metallothionein IX. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1858) AUTHORS West,A.K. TITLE Direct Submission JOURNAL Submitted (16-APR-1992) A.K. West, University of Tasmania, Biochemistry Deapartment, University of Tasmania, G.P.O.Box 252C, Hobart 7001, AUSTRALIA REFERENCE 2 (bases 1 to 1858) AUTHORS Stennard,F.A., Holloway,A.F., Hamilton,J. and West,A.K. TITLE Characterisation of six additional human metallothionein genes JOURNAL Biochim. Biophys. Acta 1218 (3), 357-365 (1994) MEDLINE 94325344 FEATURES Location/Qualifiers source 1..1858 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone="lambda 47" /chromosome="16" /map="16q13" TATA_signal 107..112 prim_transcript 137..>1715 exon 137..235 /number=1 mRNA join(137..235,831..896,1624..1715) CDS join(208..235,831..896,1624..1715) /codon_start=1 /product="metallothionein IX" /db_xref="PID:g517351" /db_xref="SWISS-PROT:P80297" /translation="MDPNCSCSPVGSCACAGSCKCKECKCTSCKKSCCSCCPVGCAKC AQGCICKGTSDKCSCCA" intron 236..830 /number=1 exon 831..896 /number=2 intron 897..1623 /number=2 exon 1624..1715 /number=3 polyA_signal 1843..1848 BASE COUNT 362 a 535 c 488 g 473 t ORIGIN 1 gcgctgggct cacggttgct gcacccggcc caggatacgc ggcgtgcaga ctcagcaggg 61 cggtgcaagg acaggcgggc ctctgcgccc ggccctcttc caggactata aagagagcgc 121 cggcttctgg ggctccacca cgcttttcat ctgtcccgct gcgtgttttc ctcttgatcg 181 ggaactcctg cttctccttg cctcgaaatg gaccccaact gctcctgctc gcctggtaag 241 ggacacctag tccgcgcctt gggatgcccg tttcccagcc acagtacaga ctcttcctgg 301 gtttgaagaa gtcgcattta aagttctgag ctgaaggggc tcctttattt cgttaggtgc 361 tttcttcccg atcacgtccc tgagaccact tctcgcctcc ctgtgcctct aagttagagt 421 tgagggtact gaggcccaag gctgtcctgc tccatgtcac ccagttggtc agggggctgc 481 tggctgagcc ccaatgctct gaccaggctc tggcagtcag ggtggatggg aagtgggggg 541 ccattgcctt ctcggagttc aggacagaag gttctggcct cctgtcttag ccttcctggg 601 ctgtgtctgg agcctgggac cttgcttgtg gggtaaaagc aacagaacac ttgcccttcc 661 caaaatgaag ggagaggaga tggggcttct cttcctctcc cctgagtggg aaaggagctc 721 tgggggctgg tcctgcagca cagaggaggg gtcactgaag cgttattgac cagctgctgt 781 accttctgca tctcactcca cgctcactgc ctttttgctc ttccttgcag ttggctcctg 841 tgcctgtgcc ggctcctgca aatgcaaaga gtgcaaatgc acctcctgca agaagagtga 901 gtgcagggcc ttccctcgga atctggggga tgggccaagt tagagcaggg aacccagagc 961 tctgcaggca ggggcaggcc aatgaccagc ttccccaaac ccctccttca acacctgatt 1021 cagaatcaga cctcaaattg ccttaaaaat gggtgagtcc cagcctctta ttaccaaact 1081 agaaactgag gcccagagag gttaccagat agtgttggga acaaagctgg aatgtgaacc 1141 taggtctcct gcctcctgat gcagccttct tcacccttct gggtcctgaa gcacttaggc 1201 ccaggatctg gaagaccccg ggtgatttca aacctaatga tccagtcctt tcctgcaggg 1261 gcagcccaga gcttccctag ccttccccag aactgctgtg tcagggattt gccccctgtc 1321 cggctgtgaa gactttcctc atttaaggca taggttttgg gaactggcct ccttttgttc 1381 ctgtaccccc aatcactacc tgtcccagtc ttctgtcctg tcccagactc aggtggggct 1441 gggcagactt tttcatataa aaccctcatc ccaaagatct accagttctc ttctgacaaa 1501 gccatgccat cctgaaatga tggtcctctg gggctggagg cagggctcga gcaggcttcc 1561 ttggggcagg gcaggtgcct gattgagtct gctctgacct ctcactctcc ccttcttctc 1621 caggctgctg ctcctgctgc cctgtcggct gtgccaagtg tgcccagggc tgcatctgca 1681 aagggacgtc agacaagtgc agctgctgtg cctgatgcca ggacagctgt gctctcagat 1741 gtaaatagag caacctatat aaacctggat tttttttttt ttttttttgt acaaccctga 1801 cccgtttgct acatcttttt ttctatgaaa tatgtgaatg gcaataaatt catctaga // LOCUS HSMTS1G 3002 bp DNA PRI 18-OCT-1995 DEFINITION H.sapiens mts1 gene. ACCESSION Z33457 NID g486654 KEYWORDS mts1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3002) AUTHORS Ambartsumian,N., Tarabykina,S., Grigorian,M., Tulchinsky,E., Hulgaard,E., Georgiev,G. and Lukanidin,E. TITLE Characterization of two splice variants of metastasis-associated human mts1 gene JOURNAL Gene 159 (1), 125-130 (1995) MEDLINE 95331645 REFERENCE 2 (bases 1 to 3002) AUTHORS Tulchinsky,E., Ford,H.L., Kramerov,D., Reshetnyak,E., Grigorian,M., Zain,S. and Lukanidin,E. TITLE Transcriptional analysis of the mts1 gene with specific reference to 5' flanking sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (19), 9146-9150 (1992) MEDLINE 93028421 REFERENCE 3 (bases 1 to 3002) AUTHORS Engelkamp,D., Schafer,B.W., Mattei,M.G., Erne,P. and Heizmann,C.W. TITLE Six S100 genes are clustered on human chromosome 1q21: identification of two genes coding for the two previously unreported calcium-binding proteins S100D and S100E JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (14), 6547-6551 (1993) MEDLINE 93342029 REFERENCE 4 (bases 1 to 3002) AUTHORS Tulchinsky,E. TITLE Direct Submission JOURNAL Submitted (11-MAY-1994) Tulchinsky E., The Danish Cancer Society, The Fibiger Institute, Molecular Cancer Biology, Strandboulevarden 49, opgang 7.1., Copenhagen, Denmark, 2100 Copenhagen FEATURES Location/Qualifiers source 1..3002 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda 44" /tissue_type="spleen" exon 664..721 /gene="mts1" /number=1 gene 664..2701 /gene="mts1" exon 953..1000 /gene="mts1" /function="alternative spliced" /product="exon 1b" exon 1659..1815 /gene="mts1" /number=2 CDS join(1675..1815,2537..2701) /gene="mts1" /codon_start=1 /db_xref="PID:g580320" /db_xref="SWISS-PROT:P26447" /translation="MACPLEKALDVMVSTFHKYSGKEGDKFKLNKSELKELLTRELPS FLGKRTDEAAFQKLMSNLDSNRDNEVDFQEYCVFLSCIAMMCNEFFEGFPDKQPRKK" exon 2537..2701 /gene="mts1" /number=3 BASE COUNT 646 a 756 c 852 g 748 t ORIGIN 1 gatttttgtt tctgaatctt tattttttta agagacaagg tcctctgtgt tgctcaggct 61 ggagagcagt ggcttgagca tagccaactg cagtctcgaa ctctgggctc aaatgatcct 121 ctgtctcagc ttcctgacta gctgggacta caggctacag catgcgtgcc cagctaatta 181 aaaaaaaaaa ttgtttttcc tttttataga gacagaagtc tctctatgtt gcctaggctg 241 gtcttgaact cctggcctca gcgatcctcc catctccccc ctagcttttg tgtcaccaca 301 tttccagggc aatctcccac ctgtcaccca ccaccccctg catctccttt cctaggtccc 361 catgggacta ctccctgtcc cccatgctcc aggcacaggc tgccccttcc tccacctctc 421 taaaattagg ctgagctatg tacatgggtg gtgcccatct catccagtcc cctgctagta 481 accgctaggg cttacccgtt accaggtgcg actgggaaca gaggttggtt caggctgggc 541 tggtggaggg tgctgtggca ttaccgcatc agcccacagc aggaaggcag tatccgctct 601 cccctgtccc ctgctatggg cagggcctgg ctggggtata aataggtcag acctctgggc 661 cgtccccatt cttcccctct ctacaaccct ctctcctcag cgcttcttct ttcttggttt 721 ggtgagttgt gttggcctga ctggcatgca aggggtgtca gaggccaggg ctggggaagg 781 agaaggggag gctggtgggg gccagatgtg ctaaagagat ccagatgtga gattctgatg 841 tggaactctg ggtggattgt gtgcgtgggt gtgcatggca aacacacatg cacgtaagac 901 ggaggaaaaa acaaacagaa aagtgagcaa gtgactgaat ttgagctctc caggtgcttc 961 tgagatgtgg gcttgcacac gctgttgcta tagtacgtgt tggtatgtat gtgcctgtgg 1021 gtatctgcac tggctcatgt ttgctgggtt gcgcactcgg gagcaggaag caaaggaaag 1081 gcagaaggca actgtgggcc tttgtctggt ggtgtgcccc atgagcctct gccctgcacg 1141 cagcagccca gctcgagaag gtgcatggcc tctgcagctt ctcttccaac ccttgcctct 1201 gcacctcact tggcccctcc ccatgctgag agctaagcgg ctgtgcctgg ttttttccac 1261 tgcaggcccc tggcaggcct ccagcagcca cacccagttc tgggagggaa aagaatggca 1321 agggcggggc ctttgtggct gagctgtggg agtggataga ctgagtgagg ggtggaaaaa 1381 atggtgttgt tgagcaagcc tgggaggccc tgggagtttg ctgcccgaat ctccaagact 1441 tgcgcagcgg atcttgcaaa tgttcactgc ccagagcatg tgttccccac tgtgcacacc 1501 ctcccagcca ggtgcggggg cccactgctc tgggctcccc cagggaggga gcagagtctc 1561 gccaagtgct cctggaggga tgggagtgga gcctggcatt ctgaacacat ctctgagggg 1621 tgggattaat aagacggtct ctgtgcctcc tgctcccaga tcctgactgc tgtcatggcg 1681 tgccctctgg agaaggccct ggatgtgatg gtgtccacct tccacaagta ctcgggcaaa 1741 gagggtgaca agttcaagct caacaagtca gaactaaagg agctgctgac ccgggagctg 1801 cccagcttct tgggggtgag tgggtagtgc ctgagtgagt ccccacgtgg ggcatttccc 1861 acagaggagg gcagcagtct tgctctagag cattagctac agagggcatc tatcagtggg 1921 tggctgcctg gggtggaaac acattgaaca ccaccactca ctgcctggcc ccatgctgaa 1981 agagggctga gaatgaatgg gtcagacact gccaggtgct ttgcacaact taactgaagg 2041 gaagactaag ctcagagtgc taagtaactt cccaaggtgg tcagtgtaca caactgccat 2101 ccggaccggg actgtctgac tcttgccatc actccaacag tggacactgt ttgagtttct 2161 atttggcttg tagatgtgaa gacacagatg tggagatgat cacaggcctg cagacgttcc 2221 cttcaaacaa taacaatgta tatttgtatc aaacataaca cagtttatat attgttttca 2281 tgactattac tacctcatgg gattattaga acaaccttgg tgaaaatgta tgtcccgtca 2341 tttttccatt gcacaggtac tcagacttcc ttatccaaga gaccttctcc accctagctt 2401 agccttgagg gttggagttc caaactggac ctcccaaagg agcctccctg aactctggtc 2461 tgggagtaga aactgggtct gtcctgctcc acccactggg cttctgtttt ctatctgtag 2521 cctcttctcc ctccagaaaa ggacagatga agctgctttc cagaagctga tgagcaactt 2581 ggacagcaac agggacaacg aggtggactt ccaagagtac tgtgtcttcc tgtcctgcat 2641 cgccatgatg tgtaacgaat tctttgaagg cttcccagat aagcagccca ggaagaaatg 2701 aaaactcctc tgatgtggtt ggggggtctg ccagctgggg ccctccctgt cgccagtggg 2761 cacttttttt tttccaccct ggttccttca gacacgtgct tgatgctgag aagttcaata 2821 aagattcttg gaagttttga ggctgatggt ttgagagact ctgggggcgt gggttgggga 2881 ctgagggata tgttgtgggg tggtggtggg gagagctggg agttgagctg aagttttatg 2941 gacagcagac cagtgaagtt aggggaggga tggcagggtg actacagtgt gtgtgcaagg 3001 ca // LOCUS HSMYBPC3 20662 bp DNA PRI 14-MAY-1997 DEFINITION H.sapiens mybpc3 gene. ACCESSION Y10129 NID g2058321 KEYWORDS MYBPC3 gene; myosin binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 20662) AUTHORS Carrier,L., Bonne,G., Bahrend,E., Yu,B., Richard,P., Niel,F., Hainque,B., Cruaud,C., Gary,F., Labeit,S., Bouhour,J.B., Dubourg,O., Desnos,M., Hagege,AA., Trent,R.J., Komajda,M. and Schwartz,K. TITLE Organization and sequence of human cardiac myosin binding protein C gene (MYBPC3) and identification of mutations predicted to produce truncated proteins in familial hypertrophic cardiomyopathy JOURNAL Circ. Res. 80 (3), 427-434 (1997) MEDLINE 97200835 REFERENCE 2 (bases 1 to 20662) AUTHORS Carrier,L. TITLE Direct Submission JOURNAL Submitted (04-DEC-1996) L. Carrier, INSERM UR153, Institut de Myologie, Hopital Pitie-Salpetriere, 47 Boulevard de l'Hopital, Paris, 75013, FRANCE FEATURES Location/Qualifiers source 1..20662 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /chromosome="11" /map="p11.2" exon 1..58 /number=1 mRNA join(1..58,1174..1440,2059..2172,2561..2659,2750..2898, 4178..4295,4812..4860,5036..5065,5237..5290,5687..5689, 6072..6089,6346..6509,8199..8331,8561..8563,8678..8802, 8888..8993,9079..9245,9667..9832,10579..10685, 10791..10820,12032..12171,12418..12498,13137..13296, 14021..14125,14236..14424,15668..15802,16470..16637, 17668..17756,17926..18121,18346..18485,18706..18865, 18977..19113,19421..19607,19797..19833,19961..20662) gene 34..19807 /gene="MYBPC3" CDS join(34..58,1174..1440,2059..2172,2561..2659,2750..2898, 4178..4295,4812..4860,5036..5065,5237..5290,5687..5689, 6072..6089,6346..6509,8199..8331,8561..8563,8678..8802, 8888..8993,9079..9245,9667..9832,10579..10685, 10791..10820,12032..12171,12418..12498,13137..13296, 14021..14125,14236..14424,15668..15802,16470..16637, 17668..17756,17926..18121,18346..18485,18706..18865, 18977..19113,19421..19607,19797..19807) /gene="MYBPC3" /codon_start=1 /product="myosin binding protein C gene" /db_xref="PID:e314013" /db_xref="PID:g2058322" /translation="MPEPGKKPVSAFSKKPRSVEVAAGSPAVFEAETERAGVKVRWQR GGSDISASNKYGLATEGTRHTLTVREVGPADQGSYAVIAGSSKVKFDLKVIEAEKAEP MLAPAPAPAEATGAPGEAPAPAAELGESAPSPKGSSSAALNGPTPGAPDDPIGLFVMR PQDGEVTVGGSITFSARVAGASLLKPPVVKWFKGKWVDLSSKVGQHLQLHDSYDRASK VYLFELHITDAQPAFTGSYRCEVSTKDKFECSNFNLTVHEAMGTGDLDLLSAFRRTSL AGGGRRISDSHEDTGILDFSSLLKKRDSFRTPRDSKLEAPAEEDVWEILRQAPPSEYE RIAFQYGVTDLRGMLKRLKGMRRDEKKSTAFQKKLEPAYQVSKGHKIRLTVELADHDA EVKWLKNGQEIQMSGSKYIFESIGAKRTLTISQCSLADDAAYQCVVGGEKCSTELFVK EPPVLITRPLEDQLVMVGQRVEFECEVSEEGAQVKWLKDGVELTREETFKYRFKKDGQ RHHLIINEAMLEDAGHYALCTSGGQALAELIVQEKKLEVYQSIADLMVGAKDQAVFKC EVSDENVRGVWLKNGKELVPDSRIKVSHIGRVHKLTIDDVTPADEADYSFVPEGFACN LSAKLHFMEVKIDFVPRQEPPKIHLDCPGRIPDTIVVVAGNKLRLDVPISGDPAPTVI WQKAITQGNKAPARPAPDAPEDTGDSDEWVFDKKLLCETEGRVRVETTKDRSIFTVEG AEKEDEGVYTVTVKNPVGEDQVNLTVKVIDVPDAPAAPKISNVGEDSCTVQWEPPAYD GGQPILGYILERKKKKSYRWMQLNFDLIQELSHEARRMIEGVVYEMRVYAVNAIGMSR PSPASQPFMPIGPPSEPTHLAVEDVSDTTVSLKWRPPERVGAGGLDGYSVEYCPEGCS EWVAALQGLTEHTSILVKDLPTGARLLFRVRAHNMAGPGAPVTTTEPVTVQEILQRPR LQLPRHLRQTIQKKVGEPVNLLIPFQGKPRPQVTWTKEGQPLAGEEVSIRNSPTDTIL FIRAARRVHSGTYQVTVRIENMEDKATLVLQVVDKPSPPQDLRVTDAWGLNVALEWKP PQDVGNTELWGYTVQKADKKTMEWFTVLEHYRRTHCVVPELIIGNGYYFRVFSQNMVG FSDRAATTKEPVFIPRPGITYEPPNYKALDFSEAPSFTQPLVNRSVIAGYTAMLCCAV RGSPKPKISWFKNGLDLGEDARFRMFSKQGVLTLEIRKPCPFDGGIYVCRATNLQGEA RCECRLEVRVPQ" exon 1174..1440 /gene="MYBPC3" /number=2 exon 2059..2172 /gene="MYBPC3" /number=3 exon 2561..2659 /gene="MYBPC3" /number=4 exon 2750..2898 /gene="MYBPC3" /number=5 exon 4178..4295 /gene="MYBPC3" /number=6 exon 4812..4860 /gene="MYBPC3" /number=7 exon 5036..5065 /gene="MYBPC3" /number=8 exon 5237..5290 /gene="MYBPC3" /number=9 exon 5687..5689 /gene="MYBPC3" /number=10 exon 6072..6089 /gene="MYBPC3" /number=11 exon 6346..6509 /gene="MYBPC3" /number=12 exon 8199..8331 /gene="MYBPC3" /number=13 exon 8561..8563 /gene="MYBPC3" /number=14 exon 8678..8802 /gene="MYBPC3" /number=15 exon 8888..8993 /gene="MYBPC3" /number=16 exon 9079..9245 /gene="MYBPC3" /number=17 exon 9667..9832 /gene="MYBPC3" /number=18 exon 10579..10685 /gene="MYBPC3" /number=19 exon 10791..10820 /gene="MYBPC3" /number=20 exon 12032..12171 /gene="MYBPC3" /number=21 exon 12418..12498 /gene="MYBPC3" /number=22 exon 13137..13296 /gene="MYBPC3" /number=23 exon 14021..14125 /gene="MYBPC3" /number=24 exon 14236..14424 /gene="MYBPC3" /number=25 exon 15668..15802 /gene="MYBPC3" /number=26 exon 16470..16637 /gene="MYBPC3" /number=27 exon 17668..17756 /gene="MYBPC3" /number=28 exon 17926..18121 /gene="MYBPC3" /number=29 exon 18346..18485 /gene="MYBPC3" /number=30 exon 18706..18865 /gene="MYBPC3" /number=31 exon 18977..19113 /gene="MYBPC3" /number=32 exon 19421..19607 /gene="MYBPC3" /number=33 exon 19797..19833 /number=34 exon 19961..20662 /number=35 BASE COUNT 4529 a 5783 c 6125 g 4214 t 11 others ORIGIN 1 catggtgagt agcctggtgt gacgtctctc aggatgcctg agccggggaa gaagccaggt 61 agctttagga ctggggttgg gtctaagtgt gggagcatgg gggtgtctac aattggggag 121 caggctagga ggagttcttg agggggttga cgaggacgtg gcctctgagc cccaggacag 181 gggcagccag tcctccaggg tccttttttg attctcggtt tctttttccc atccctctga 241 tgagaggtga aggtaaaccc aaggtcaggg acgcagagag gggacacagc caagacagga 301 aagcacccgg ggtcaaccca ttctgagaat gcaaagggac cccttccctg tcccattgcc 361 ttcaccttcc acagcaaggt tccccgcgga cagaacccac gtttagtatg gagctgagcc 421 aggctcacgg catgccttca gaccagtctc tacccactat gggtctcagt ctcccaacct 481 ggggtgccgg gggtggtggt ggcattgggt tggttcctcc tggctgtgac attatctgct 541 tcatgagcct gtacggtgtg gcccccatgt tgagggtgac acagcccctc ttctctgaga 601 gaggacagtc aggagcccct gaacccaggc caagggcagg gaaaatcaca gcccagggac 661 ctgggcctgg gctgtgccct tatgaaggtt gtccagggca gagatccagg tcccatcccc 721 cagcgccctc ccaggccagc tttatctcag ttgctgatac ccagcctgag gtccttggag 781 gacaaatttg gccacattcc aagtaggcag acttccccgc cccctgccaa gggcccgaaa 841 tggttggggg aaagggagct gggttcatca cgggtgtctc caagcaggga aagtggacac 901 atgcagaaaa agtcttgttg ggaggtcagg accacagccc tggctctccc gactgctagc 961 tgtgtgatct tggaggagcc gcttagcttc tctgagcctc agtgtcctcc tctgagaaat 1021 gaacagaata actgcaacta cctcagagaa ctggtagagt attacgtgag gggatgcata 1081 gaaagtgcta gcacagtatt tactgagagg ggctgcaggg gcggggtgca cgctccaacc 1141 aggggccgct gtagcctcca cctggccctt cagtctcagc ctttagcaag aagccacggt 1201 cagtggaagt ggccgcaggc agccctgccg tgttcgaggc cgagacagag cgggcaggag 1261 tgaaggtgcg ctggcagcgc ggaggcagtg acatcagcgc cagcaacaag tacggcctgg 1321 ccacagaggg cacacggcat acgctgacag tgcgggaagt gggccctgcc gaccagggat 1381 cttacgcagt cattgctggc tcctccaagg tcaagttcga cctcaaggtc atagaggcag 1441 gtaagatcct gatcagcctt ccctgagttt gggctgctgg gggagggcag cccagcgact 1501 ctccatccat ccagggaaca ggaggtgctt tcacactttc ttgcctttgc tgtggctgtg 1561 ccccccgcca ggaacgccct ttcccccttt tctgcacatg acctctctga tacccccatg 1621 tgccatcctg gcttgttcca gccatggggt agtgggagag atgaacaaga cacaacccca 1681 ctctccccct gcagacctcc ctggggagat gacctgcaca gacggccggg aagccccagg 1741 agctgtggat gaggggagtg cagcccagtc tgaggggctg ctctaaggct ggggcagagc 1801 agggatggac ctggagctgt tcagggcaga tgcagaggac atgggaagtg atggcctggg 1861 acggggagga gaatgtgaga ggcaggctgg ctctccagct cagcatgagc tgcctctgag 1921 gggcccaggc agcgggagga cagccatggc agactttcct catccacagc gggctcatgg 1981 gtccaatgct caggagagag agggtggagg ggtctctggt tagtggctgc accacgactc 2041 ccatctgatc ccctccagag aaggcagagc ccatgctggc ccctgcccct gcccctgctg 2101 aggccactgg agcccctgga gaagccccgg ccccagccgc tgagctggga gaaagtgccc 2161 caagtcccaa aggtgagagg ggctgggcag ggaggggcag gatccgtggg tagggcgtgt 2221 ccagggcagg tctcaaaagc ctttgctggg ccaggtgcgg tggctcaggc ctataatccc 2281 cagtactttg ggaggccgag gtgggcggat catgaggtca ggagattgag accatcctgg 2341 ctaacatgat gaaaccccat ctctactaaa aatacaaaaa attagccggg cgtggtggcg 2401 ggcacctgcg gtcccagcta acttgggagg ctgaggcagg agaatggtgt gaacctggga 2461 ggcggagctt gcagtgagcc gagattgtgc ctctgcactc cagcctgggt gacagagcaa 2521 gactccatct caaacaaaca gaaaaagcct ttgctcacag ggtcaagctc agcagctctc 2581 aatggtccta cccctggagc ccccgatgac cccattggcc tcttcgtgat gcggccacag 2641 gatggcgagg tgaccgtggg tgagtgtgag ctgctgtgcc cagcattggg gtgggaaggg 2701 ggggcagcag gacactcccc aagccgggcc tgtcgccctg cctttgcagg tggcagcatc 2761 accttctcag cccgcgtggc cggcgccagc ctcctgaagc cgcctgtggt caagtggttc 2821 aagggcaaat gggtggacct gagcagcaag gtgggccagc acctgcagct gcacgacagc 2881 tacgaccgcg ccagcaaggt gggccagcac ctgcagctgc acgacagcta cgaccgcgcc 2941 agcaaggtgg gtccaccggg gtcgtggaga cacggagaga ggggacatgg agagccctag 3001 aaggcacaag gcacacagag ggcaccatga gacccagagg cacacacgag ctgcaagcaa 3061 ggggtgtggg gaatctggga ggcaggtgga ggggccaggg gcccacaggg agatcccgga 3121 ggcctggggg ccagggagtg catgaggctc atgggagcac tgtcggggac ctgggagact 3181 tggtggacat gatcacctca agggggcacc taataaggtt tggccggggg tgggggggcg 3241 cccagggtgg tcaggaagcc tggaggacac tgtgaagcat gggggcacca gctagtgggg 3301 ttgtaaagcc ccgggatggg tgagtatcag ggctaagggc tcaagggact ctcagggatt 3361 aaccagaaca attcctcatt atacagacac ggaaaccaaa gcttgagagg ttaagtggct 3421 gctccaaggt cacttagaca taggcctaga accttaagcc ttcgtttttt gtgttttgtt 3481 tttttccgcc ctgcccacca gaaccttaag gcttaagtcc ccttgtttgt tcactccaag 3541 cctctgcttt tcccaacctt ccccagtcct gggctaggca agggggcctc tagcaagaga 3601 ggccaacatg ttctgctgtg cggggctgtg cattgcacaa aagcctgtat ttcaccacaa 3661 ccgtgtgcca gcagatggca atcatgggtt tcgagatagt cccctttctc tgtacaaaaa 3721 ctctgtatgg gttaactaaa gcctgagggt gcgaagatcg atgagcattt gtcactccca 3781 gcctccttta aatatatata tatatatata atttgagaca gggtctcgct ctgtcgctca 3841 ggctggagtg cagtggctca atctcggctc actgcaacct ccacctcctg ggttcaagcg 3901 attctcccac ctcagcctcc cgagtagctg aaactagagg tgtgcaccac catacttggc 3961 taattttcgt tatttttagt agagacggaa tttcaccacg ttggccaggc tggtctggag 4021 ctcctggtct tatgtgatcc gcccgcctca gcctcccaaa gtggggatta caggcctgag 4081 ccaccgcgcc cggccactcc cagtctcctt taagggtgcg gagccttgtc tcccggcccc 4141 tggtgtcccc tgacgccccg tccctccatg cacacaggtc tatctgttcg agctgcacat 4201 caccgatgcc cagcctgcct tcactggcag ctaccgctgt gaggtgtcca ccaaggacaa 4261 atttgaatgc tccaacttca atctcactgt ccacggtgag ggggccctgg tgtctgtcct 4321 gggctcgggc tccccatggg tcctggtctc ctacctcctt ttcccaacac taaggaggat 4381 gcctcgtccc atccagacat gagtgctggc cacgtgccca gtgctgcaca cacagggtgt 4441 gagagaaacc ccaaggcttg cagggtaggc gtgggggctt agggctgaat ccgggtctca 4501 tctgggtggg actctgtttt atcatcttgg tgtcacggct cctggcccac ggcctggcac 4561 tgtctgaatg tttgggcgat gggtgaaggt gagttgagag atgggtggga attggggaca 4621 ataaaccgtc ccacctgctg gtaccttgac cctgggaaag gctgggaagg tgagatcctg 4681 gggcagatgc ccagcctcat ggggtcatga atgggcaagt ctgtgaatac tcagaggccg 4741 cccccagggc agggcttctc aaacggcccc tctgaagccc cttcccccat ctctccaccc 4801 tttgaaccca gaggccatgg gcaccggaga cctggacctc ctatcagcct tccgccgcac 4861 gtgagtggcc atcctcaggg cctgggggag ccagtgctgg agtctgaggc ccttcagggt 4921 ctcgactggg ggtcagggct ggggatgatt tgcggggcgg aactgaagtc agtgggggct 4981 gcaggggaga gcaagcttcc caggcaggtg aggatactga gtctaacccc cacaggagcc 5041 tggctggagg tggtcggcgg atcaggtacc cctgccccag gccctacctg caccatccgc 5101 agacccccag gggtctgaga cgggtgggag cccaatctgg ctagtgtccc tttctccctc 5161 cccactccac tcccatcctg ctcctaatcc ctttccagtc cctcactgca gcctcactgg 5221 gggtccctct ccatagtgat agccatgagg acactgggat tctggacttc agctcactgc 5281 tgaaaaagag gtgagtcctg ggtggcagtt cctggggcat ctggtgaggc ctggctccaa 5341 atcccagcac tgtggtttgc cagctgtgtg accctgagca caacactgca cctctctgag 5401 ttggtttcct tgtctgtcga ggggatgatc ccagtgtgga aggagttaaa gggctctgca 5461 gacattatgt cccgtcaaca gtcatcctca cagtgtcccc cacggccact ccacacttga 5521 ggctgcaaaa gctccctccg taaactccgg ggacccagaa ggaacagaga gagggtcccc 5581 agagctgcag ggtctaccag gtcggcccaa ctgacttatg ttctctctgc cctctctctc 5641 cctctccccc ggcccctcca ttatggctgc tgctgctgtg gcccagagag taagaatcgg 5701 ggtctgggtg tttgggggcg gcagggcgct ggtgggtgca gaggggattc ccgtcccctc 5761 tctgagaggc tgtggtgaga gactcagagc agggtgcggc tccccacgga caggggtgag 5821 tctctgtgcc ttgggcttct cctgccaccc acctatctgt cagagcttag gagggataag 5881 aaaaagtaat gcacacagga ggtcctgcca gagcctccgg agcagcatgg gtgccccaca 5941 acacagggag tgcaagtgcg gaaaataggg aggaagtaaa ggcccatggg tggggtccag 6001 gtctttgagg gaaggtggcc atacctctca tgtgccacct accctttctc ccacccccac 6061 ccctggagca gcagtttccg gaccccgagg tgagtgccca acatagcaca gcaccctcat 6121 cacccctaat tctgccagga gcccctccag atccccagcc cctcttcagc tcccttggcc 6181 cccagtcctg cctggcctgg gacccagagc agctccagct gccccaccag aagggaaggg 6241 gcagagggac aggggtggct acagctcctt ggtcctgggc ccaggaagcc ccgcccaggg 6301 ggctgcagtc ttgcccccgg ccacagccta gactgcggga cacagggact cgaagctgga 6361 ggcaccagca gaggaggacg tgtgggagat cctacggcag gcacccccat ctgagtacga 6421 gcgcatcgcc ttccagtacg gcgtcactga cctgcgcggc atgctaaaga ggctcaaggg 6481 catgaggcgc gatgagaaga agagcacagg ttagcccttc ctcagaggga gaggaagagg 6541 gcacaggcta gcccttccct acacaggaga ggagagggca taggttagcc cttccccaca 6601 ggggagagga gagggcacag gttagccccc cacctcgtcc acatacatgg aaagagcgga 6661 gtccggcttt agcacagaga ctcacaggaa acaagagccc aggttaaccc ctccttgtgt 6721 agaaaaagcc aagaatgccc aggttagccc cttaccacag agatagaaga gaagagcaaa 6781 agttaggcca ggacagccac aaggaaaagg acagagctca ggttaaccct cctcaccaag 6841 gaaaaatcag gctacacttt ctttctctac cctcttctga aaagaaatga agcccggtta 6901 accctctcca cacccaaaag aaaagggaag aggcccgcta agcctggaga gccacacaca 6961 ggcaaagaac agagcctggc taatctttct ctcaggcaaa caaagcagga tggcttgggt 7021 taactaccct cacccccagg cagatggaaa gaggagagcc caggaatagc ccttctccac 7081 tgaggggttg acagggcggg tgtgaccggg aaagcagggg tctcacatgc cttctcccca 7141 ccatgtctgg gtctagcaca gggcctggca cacacctcga gcgctctata aacaccgact 7201 gaatggaggg gtcagcttcc atgcaggata tgggagacaa gcccctccca aagcacacac 7261 acaaaacaca cacgatcatg gggacgggga gaggagctgg ttagtccctc cctgctgacc 7321 tcccccaggc cccttatccc tctgccctgc ctccctgttg tctcagcctc actgggtttc 7381 ccatcacccc atcgctaggt ggcctggccc ttaccggcac gtggcccatt taagccaggg 7441 atcccctgaa tgccacaact gtctctggta ctcctgaact gctgagccca gaagggcaca 7501 agccannnnn nnnnnntggg caacagaggt gctagtggag cgctgggaag gagtacacac 7561 aactcagggt gctgagagaa gagaaggcac agcgtcccag ccccagttct ctgacaagtt 7621 ctttctttct ggaagcccta acccatcctg agctgagcag gtggagagga ccccaaggaa 7681 gacagaagga caagtcagac agacagagac aaggaggtgg gttgacacat agacacagat 7741 aaagaaatga agacaccatg cctgtaatcc cagcactttg ggaggctgag gcgggcgcat 7801 cacctgaggt caggatttca agaccagccc ggccaacatg gtaaaacccc gtctctacta 7861 aaaatacaaa aattagctgg gcgtggtggc acatgcctgt aatcccagct atcgggaggc 7921 cgaggcagga gaatcacttg aacctgggag gcagaggttg cagtgagcca agatggcggc 7981 cattgcatcc agcctggccc acagagcaag atttgtccga aaacaaccaa cagacacacc 8041 aaccaaccta aagaaataga gacacagaga agtggaagcg gggcggcaca gaggggattg 8101 gaggggggga cctgggcccc agccacagcc acagtagctt ggcctggggg agcagggtgc 8161 gggcggtggg gtggccgggg ctgaggggtg gtgctcagcc tttcagaaga agctggagcc 8221 ggcctaccag gtgagcaaag gccacaagat ccggctgacc gtggaactgg ctgaccatga 8281 cgctgaggtc aaatggctca agaatggcca ggagatccag atgagcggca ggtgcagcct 8341 ggggtgggga gggggctcgg gtggaggtgg ggacccccat agccttgcct cctgcccctc 8401 tccacatgcg tatctctgac tcggtgtggc tctcagcccc atctctctgg gcctaatttc 8461 ccatcctttt gctcctgccg gtccctctct ctctctcctt ttgtctcggg gctcacttcc 8521 tcccgtcggg aacacttcaa cggccccttc tgttctacag caagtaagtt cccctctgga 8581 tggcttgggg tggggggcac agggattatc acggggcgca ctgggcccgg aggggtgtcc 8641 gcagctttcc tgccacttcc ctgcggcccc cacccaggta catctttgag tccatcggtg 8701 ccaagcgtac cctgaccatc agccagtgct cattggcgga cgacgcagcc taccagtgcg 8761 tggtgggtgg cgagaagtgt agcacggagc tctttgtgaa aggtgggcct gggacctgag 8821 gatgtgggaa cctggggagg agatggcctc aggggagcca accctcatgc tcaccctgcc 8881 tggacagagc cccctgtgct catcacgcgc cccttggagg accagctggt gatggtgggg 8941 cagcgggtgg agtttgagtg tgaagtatcg gaggaggggg cgcaagtcaa atggtgagtt 9001 ccagaagcac ggggcatggg tgttgggggc atctgcccag aagaggccac agcacttgcc 9061 acccacccac ccggctaggc tgaaggacgg ggtggagctg acccgggagg agaccttcaa 9121 ataccggttc aagaaggacg ggcagagaca ccacctgatc atcaacgagg ccatgctgga 9181 ggacgcgggg cactatgcac tgtgcactag cgggggccag gcgctggctg agctcattgt 9241 gcagggtgag cctggctggg ggggcacatg aggctttagg gcttggaccc ctcagccccc 9301 accccaccta gccctgtagg ggagcaggca aacctgggct caaggcctct gtgaccttgg 9361 gcctttgaca gcctaaacct catccttctc atctctaaaa tgacgatgct gaggtgaccg 9421 ctccagaggg tactgagagg gtcatacggg cctggtgagc atcaccaccc ctcagacact 9481 tgaggttcct tatgtgcact gcaccatcac aggcaaacat ggcacacaca ggcacacgtg 9541 ttttcacatg cccacatgca cccagacacg tgcgcaccag cgcccatggg cccacacgcc 9601 ctccacaggg attcacgcca cacccacaca ccctcccgag ctcaatggct ctgccctgcc 9661 ctgcagaaaa gaagctggag gtgtaccaga gcatcgcaga cctgatggtg ggcgcaaagg 9721 accaggcggt gttcaaatgt gaggtctcag atgagaatgt tcggggtgtg tggctgaaga 9781 atgggaagga gctggtgccc gacagccgca taaaggtgtc ccacatcggg cggtgagtgt 9841 gcagggcagg tggatgggac aggtggagac tgagatggag acagagagag acacagggag 9901 aaacagagag agagagataa agacaggtgg acagagaggc agatgcacag agacaggaca 9961 gagactgaga cagagaccca cagagaggga gaaacagaca gacccaaaaa gacaaagaga 10021 cagaaagaca gaaatagttc tatagagaca gaaagatatg cacagggagg cacacacata 10081 ggagagttcg tcagagacag agaagaacag agggacagag acaaagaggc agcaggggag 10141 agacacggac ggggtcgaga gctccccagc accctccccc agcttgtgac cagggcaggg 10201 caggaggcca ccaaggaggc cggctctggc cagaggccct cactgcctct cccgccctct 10261 tgctcatccc tggagcaact gggctggcct gctgggccca agttcacaga gattaatgag 10321 acccagcccc tgcccagtct ccaagggccc agccaaggag atgctttgat gtagaaatca 10381 cttttacgct gcctacctct gaaggcacat taaaaagctg gaaaattaga taatatagaa 10441 aaaaaaaggc atttttaaaa atcagaatac caacaagcca ggacaaggtg agggcctcct 10501 gggaccctgg ctggggtatc tggcaaggcc aggggtgtgt ggcccagtgg ggtcccctga 10561 gccactgctc ccctgcaggg tccacaaact gaccattgac gacgtcacac ctgccgacga 10621 ggctgactac agctttgtgc ccgagggctt cgcctgcaac ctgtcagcca agctccactt 10681 catgggtgag cctgctccag ggtagggtgg gtcggggcag tggggccagg agcccctgtc 10741 actgggccct gcctttgccc ccgtgctact tgctcttcct tctcttgcag aggtcaagat 10801 tgacttcgta cccaggcagg gtaagtcttg gggcccctga gtcttggttc tgctcacttt 10861 cccgcaaccc acccacgcag gcctcccctc cactcacaga gggaaaagca aggatcaaaa 10921 ttgggggtcc tggggcccag ctccagcttt gccactcatg accatgggtc aattataatc 10981 tctgtgagcc tcagtttcct cttctctaaa atgggaaccc tgatagtgcc acctcatagg 11041 atagttataa agatgaggtg acatgatgta tcagacctgt caagtgctgt cacactgtga 11101 ctgttattat tacatctttt cttcttcttc cttttttttt atttttattt tttttgagac 11161 ggattctccc tctgtcgccc acgctggaga gaagtggcgc gatctcggct cactgcaagc 11221 tccgcctccc aggttcacac cactctcctg cctcagcctc ctgagtagct gggactacag 11281 gcgcccgcca ccatgcccag ctaattattt tttgtatttt tagtagagac ggggtttcac 11341 catgttagcc aggatggttt cgatctcctg atctcatgat ctgcctgcct cggcctccca 11401 aagtgctggg attacaggcg tgagccactg cgtccagact gctttgcttt gtttaactga 11461 gggcagattc ttgatttatt tggccaggga acccttttaa tgtgtgcatg tgtgcgtgca 11521 tgtgtgcgcg cgtgcgtgtg tgtgcatgtg tctgtgtgtg tatgtatata tgtgtgtctg 11581 tgtctgtgtg tgtgcgtgta tgtatgtgta tttgtgtctg tgtgcatgtg tgtgtctgtg 11641 tgtgcatgtg tgtatgtgta tgtgtgtctg tgtgtatgtg tatatgcgtg tgtgtgtgtg 11701 tattgtgtgt gtgtgtatgt gtgtggcaag acactgatgt tcccccagtt ttcaaaatgc 11761 tatgctaggt gacctgacct gaatattaca agcctcccag ccaacgtcct ggggtgatgg 11821 gaatgttcca gatcttggtt ggattaataa ttgcatgggt gtattcatct gtcagaactc 11881 agcctactgt cagatctgag cataaactat acctcaatac aaatctaaaa aaccaataac 11941 caggggccca ggctcttgag cagccccgcc ccagtgacct gtgctcctcc tggctctccc 12001 gtttctctga actacattgt gtcttctgca gaacctccca agatccacct ggactgccca 12061 ggccgcatac cagacaccat tgtggttgta gctggaaata agctacgtct ggacgtccct 12121 atctctgggg accctgctcc cactgtgatc tggcagaagg ctatcacgca ggtactgtgg 12181 gtccctcctc agtctcccca tctataagat gggtgtgtgg aaccagccaa gtgctaggac 12241 ctgctgcaca ccccacccac atgtgcacgc cctgctgttg gctaccccat gggagagact 12301 ggtcaggggg caggcaaggt gggcagtgtg ggtcaggggg tggaagaatc agcagagggc 12361 cgcttccagg ctgagtcagc tcctctgctc cctacttccc tcctgccctg ttcccagggg 12421 aataaggccc cagccaggcc agccccagat gccccagagg acacaggtga cagcgatgag 12481 tgggtgtttg acaagaaggt gagtgagact gaggtcagga ggaggatcgt ttgtctttca 12541 tcactttcta cctacgtggt gccggccctt ggagaagtct ccataggtgc cacacttggt 12601 cgggtgtgtt gctgccccca gcagggctcc ctgtgggacc ctcacacctc agccagaggg 12661 tctctgttga agttgctgat gcccatcaca aggatgtgga caccctctca ttaccaggaa 12721 cccctcatga ctgggctgct ccctggctcc cttcatccta agctgaccat gaccttgacc 12781 ttgtgctccc tctcaactca ggagtagttt ctgagacact tttctgccag tgtagctgcc 12841 ggggtcttgc tcctgcctgg cactgttggt tcttctgaag gctggggtct tgaggggtcc 12901 tcatgatagc tcacctgaga ggaacaggtg gttcccccag gtcccgtgcc aaatcaagaa 12961 aggcgagccc ctgccttcag tcggtgccca gagatgaatt gtactagaat attgtggatt 13021 cagtcttccc acagactgtg ttccccccat ggacccccgg gacatctggc tgttgggagg 13081 ctgcaaggac ctcctgggtc tgacttggat ctcaccccaa ctctgcaccc ccccagctgc 13141 tgtgtgagac cgagggccgg gtccgcgtgg agaccaccaa ggaccgcagc atcttcacgg 13201 tcgagggggc agagaaggaa gatgagggcg tctacacggt cacagtgaag aaccctgtgg 13261 gcgaggacca ggtcaacctc acagtcaagg tcatcggtga ggccggccgg ggtccaagct 13321 ggagaacaca gaggggcagc cccaaggagg gcccatccgt tcgctcattc agccactcga 13381 caaacatcac gggcatgctc cctgagccga gcggcaacag gacaggtttt tctgcccaaa 13441 aggagccccg ctctggggga gatggaggca gatgcatcca gcagatttcc tgcgacactg 13501 ctgtacatga gccatgtcca gagtgccgtg ggagcactga gcagagagcc atccctgtgc 13561 ctgggggtga tgcctttagg aggctgagaa gatggtgagg aaaggcatcc caggcagagg 13621 aaactgtgta caaaggggtg gaggagggaa agggcctttt gggtgtggag aatgaatgaa 13681 acatgtcagg agaggactgg gtggggttgg gaggatcaga gccaggacag gggtgaatat 13741 gggcagtgag aagccttaaa gggttgacgg agggctggcg acatgatcag aaatgcatcc 13801 tgttggagac cagcctggac aacacagtgg agactccata tcactaaaaa aaaaaaaaaa 13861 aaggaaaaga agaaagaaat gcattctaga aagatctctt tggatctggg ggtggtgggg 13921 gcagcctgtg gcggttagtt ggagtgggaa ggggacgagc aacgttactc aaggccctga 13981 gcggggcagg gctgatgtgg gtccatccca ccccatccag acgtgccaga cgcacctgcg 14041 gcccccaaga tcagcaacgt gggagaggac tcctgcacag tacagtggga gccgcctgcc 14101 tacgatggcg ggcagcccat cctgggtgag tgcaagggca ccggatggag gtgtgagggc 14161 gccaaacaga tccgagggaa ggtggtgtgg ggatgcctgg gttccagacc agagctgcca 14221 cctcccctga gccaggctac atcctggagc gcaagaagaa gaagagctac cggtggatgc 14281 agctgaactt cgacctgatt caggagctga gtcatgaagc gcggcgcatg atcgagggcg 14341 tggtgtacga gatgcgcgtc tacgcggtca acgccatcgg catgtccagg cccagccctg 14401 cctcccagcc cttcatgcct atcggtgagc ctgcctggcc tggtcctgcc ccccgccccc 14461 tccccagtta aaaacctcca ataatagcag gtgctctgca ggcagccatg ctagacactc 14521 acatccacag tctcctttca tcctcgactg gggatactca gccgcatttt acagatgagg 14581 aaacaggctc caagcagtta aatgacctgc ttgagatctc ccagctggtt ggggtggagc 14641 tgactctcac actacgccga cccctagttc acgtttcaag cctctctttc catgtctaca 14701 aaatgggaat aatgacactg cctatctcag agagctgtta ggatgatatg ggatgaccag 14761 gcaggccatc tggcatctaa caggtgctca ataaatgttc attgtcccct gttctccaca 14821 cccttcaacc cagggttttc acctatggga tggggcaagg gctgtagtta gtggaggggt 14881 ctcaggggca tctgtctgtg gaaagagtcc tggaccaaga ctcagagatg gcagaggcag 14941 ctccatggtc cccggggggc caactggagt catccctccc tgaagcctca gcttcctcat 15001 ctgtataatg gggtctgagt gggcagaaca gtgttgagac tgtgcaggct cagagacacc 15061 tgtcccacta ggtggtgatc ggggcagctg ctgtgcaggc agcagacctg gccagatggg 15121 cacacttgga gatgcagatc ggctgtggga gaggtcagct ctcttttagt tttttttttt 15181 ttttttttgg agatggagtc tcgctctgtc acccagactg gagtgcagtc ctggcgcgat 15241 ctggggtcac tgtaaccttc acctcccagg ttcaagcaat tctcccacct cagccttctg 15301 agtagctggg attacaggca tccaccacca tgcccagcta atttttgtat ttttgtagag 15361 acggagtttc accatgttgg ccaggctggt cttgaactcc tgacctcagg tgatcctccc 15421 acctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgtgcctg gctgctgttg 15481 ctgtttgttt ttaagccaat ttacagacat gtcttgtaac agggacaaaa tattccttaa 15541 agtggccaaa agccagctga acaggaagtg ccccctatgt gaccagtggg cagttcagag 15601 tctagggcat ggatctccag cttccccagg cttgctcaga cccctctctg acctctcctc 15661 tgcccaggtc cccccagcga acccacccac ctggcagtag aggacgtctc tgacaccacg 15721 gtctccctca agtggcggcc cccagagcgc gtgggagcag gaggcctgga tggctacagc 15781 gtggagtact gcccagaggg ctgtgagtgt ccccgccccc aacccccctc cccaaaggaa 15841 gaacatgctc accttgccat tgagcagatt cacctgtagt caagtttttt aggcccactt 15901 ttgccgaaag ttgaggacac ccagagaaat atctgtcctg ttttcagaag taaaaaagct 15961 tgcctagtga aaaacaaatg cagagtaaag ctgactgtaa cacttctttg agcagtgcga 16021 aatcagcaac tacattttaa aaacatattt aaggccaggc gcgatggctt acgcctgtaa 16081 tcccagcact ttgggaggcc aaggtgggcg gatcacctga ggtcgggagt tcgagaccag 16141 tctgaccaac gtggagaaac cccgtctcta ctaaaaatac aaaattagcc gggtgtggtg 16201 gtgggcacct gtgatcccag ctacgcggga ggctgaggca ggagaatcac ttgaactcgg 16261 gaggcagaag ttgcggtgag ccaagatcat gccattgcac tccagcctgg gagacaagag 16321 cgaaactcca tctcaaaaat aaataaacaa ataaaaattt aaaaaaacat atttaaatcc 16381 tggagattcc tatcagagga gtgggcagtg ggagtggggt gtcagtggtg acacagcctg 16441 tggccttgcc tccccctccc cacccccagg ctcagagtgg gtggctgccc tgcaggggct 16501 gacagagcac acatcgatac tggtgaagga cctgcccacg ggggcccggc tgcttttccg 16561 agtgcgggca cacaatatgg cagggcctgg agcccctgtt accaccacgg agccggtgac 16621 agtgcaggag atcctgcgtg agtgcccctt tgtgcagtca caagacccgc cattgacacc 16681 ccaggccctt ggtgtccagt ggaggtgggg acagttgggg aggcccagaa tgctctgccc 16741 agaacgctgg gcagagacag gaggggttgg agccatcgtc catgcgctag gcctagggaa 16801 aaagtgcaga gagggtggcc aggcacagtg gctcaagcct gtaatcccag cactttagga 16861 ggccaaggca ggcagatcac ttgaggtcag gagtttgaga ccagcctggc caacatagtg 16921 aaaccctgtc tctactaaaa atacaaaaac tagccaggtg tggtggcgcg tgcctgtatt 16981 ccctcaacta cttggggagc tgaggcagaa gaatcgcttg gacccgggag gtggcagttg 17041 caatgagctg agatcacact attgcactcc agtctgggtg acagagtgag acaacgtctc 17101 aaaaaaaaaa aagtgcagaa agggctctct tggtgtatgt cagtccctta gttctggaac 17161 agggaaagta aaagccacag tacccaagct ctcttccagg agtgtctcag ctccaacagg 17221 taaatattgt ctatgtagtc agattggtcc ctctgggatc cacttcccat tcctttgttc 17281 cttgtcccaa caaagggtcc atcttttgta aggattggtt ctcaaagctg gaacgaggca 17341 gcaaactcag gtggtaacaa cacagggcat agggtgtcaa agcccaggct tacaccctgg 17401 tctatgactt actagctgtg tgacttcagt caagttgctt accctctctg atccctgcct 17461 tttgaagtgg tgaggctcgg ggaagccagc tgggcttggg cagcctggag ttgctgtgtt 17521 agcaggagct aaaggaggcg cagggcccag caggcagctt ttttcttagc tgcagctctc 17581 tgggccttgt ctcaagggag gttggagctg tggagcccct cagagctgtg gcgggccctc 17641 acttagctac ccactttata cccacagaac ggccacggct tcagctgccc aggcacctgc 17701 gccagaccat tcagaagaag gtcggggagc ctgtgaacct tctcatccct ttccaggtgg 17761 gactggcccc cttccctgtc ccccagggga gagaggctat agtgtgttgt tcccatccag 17821 tggactgatg tctcagggct ggcctgatct gaggtaccag aggctggggt ggcggccggc 17881 ccttggagtg atccaggttc agggttaagc ttttcctccc ctcagggcaa gccccggcct 17941 caggtgacct ggaccaaaga ggggcagccc ctggcaggcg aggaggtgag catccgcaac 18001 agccccacag acaccatcct gttcatccgg gccgctcgcc gcgtgcattc aggcacttac 18061 caggtgacgg tgcgcattga gaacatggag gacaaggcca cgctggtgct gcaggttgtt 18121 ggtgcgtggc caaggcctcc ttgagccccc ttgtttccct tccctgggct gggctgctgg 18181 ggccaagggt ggggccacgg ggacccaacc cacagctcac attttccagt ccactgcccc 18241 agcaggcctg gccctggttg gcaggggtgg ggtggtccct ggagccagtg accccctgct 18301 cactgtcagg aggcgtggtg acccaactgg gtctgtctcc cgcagacaag ccaagtcctc 18361 cccaggatct ccgggtgact gacgcctggg gtcttaatgt ggctctggag tggaagccac 18421 cccaggatgt cggcaacacg gaactctggg ggtacacagt gcagaaagcc gacaagaaga 18481 ccatggtgag cccagggtct ggggtcccca cgtgcaccct gcctaggccc ggcagaccca 18541 ggccgcagct acccttcact gtcctcaccg tggacccctc accctccctc tgctgatctg 18601 aatccctcca tagtaggggt ggagatgcca agggcaccca ggagaggctc tcggcatcag 18661 gaagggcctg gcccagagat gcctcccctc cctccctgcc cccaggagtg gttcaccgtc 18721 ttggagcatt accgccgcac ccactgcgtg gtgccagagc tcatcattgg caatggctac 18781 tacttccgcg tcttcagcca gaatatggtt ggctttagtg acagagcggc caccaccaag 18841 gagcccgtct ttatccccag accaggtgct gtaccctcat tcttccaacc aggggctggg 18901 gcagggaggc tgtgggaaca gggagagggg cctagctttg tgtggccctc tcggtaccaa 18961 gtcctgtcac caacaggcat cacctatgag ccacccaact ataaggccct ggacttctcc 19021 gaggccccaa gcttcaccca gcccctggtg aaccgctcgg tcatcgcggg ctacactgct 19081 atgctctgct gtgctgtccg gggtagcccc aaggtaggga actttaggcg ctggtccagg 19141 ccgaccagga ggcagggctc acaggccccg acgttgagca gtcctctccc cagcctgtct 19201 cccctgcttt ctctccacct tggggtgtat ggaccgggtt cctccccctg ggcagggcca 19261 tggtactcac tcttggttcc atgtttgttt ccagccttgg gcatagtcag ggactctcgt 19321 gggacccccc gagtagaaac acagatgtgt ctccctgggt ccctgccagg tcccctctca 19381 gcctggatgg cttccctccc tctctttacc ttatttatag cccaagattt cctggttcaa 19441 gaatggcctg gacctgggag aagacgcccg cttccgcatg ttcagcaagc agggagtgtt 19501 gactctggag attagaaagc cctgcccctt tgacgggggc atctatgtct gcagggccac 19561 caacttacag ggcgaggcac ggtgtgagtg ccgcctggag gtgcgaggtg aggaccctcg 19621 gggccagggc ctgggagatg ggaagagcgg gcggcagtgg ggaccctggg ctttgctccg 19681 ttgtcctcgg ccaagcacct gccctggctg caagggcacg gaccctgcca cgcccagatg 19741 ggcaatagct tccagaaggc tgggaggaca cagtgacatg gcctcctctt ctgcagtgcc 19801 tcagtgacca ggctggctcc tggggatggc caggtatgtc cacccgacac ccagggcaca 19861 agtcagtggc tgagccctgg cccttggtcc tggggcagcc atcagtaaat gggaggctgt 19921 aggggccctc cattcactcg taagataacc tgtgttgcag gtacaaccgg atgccagccc 19981 cgtgccagga gcctggaggg aagttgggga aacccctccc tactgttgga tgtatgtgtg 20041 acaagtgtgt ctcctgtgct gcgatggggg atcagcaggg cagttgtcgg gcagtcctga 20101 gtgggtgttg cacagactgg tccacagggc tcctgaagga agcccctgga tctttggggt 20161 aaaaggaggg tggcctcaag aaacaatgtc tggggacagg cctttctggc ctgctatgtc 20221 ttcccaatgt ttattgggca ataaaagata agtgcagtca cagagaactc actcttcatc 20281 acctgagctc accatagtct aatagaaaac aaccgaaacc aaataattca agcactgctt 20341 attacaattt tactgggtct ctattttacc ctcctacaag cctcagagta cttcgagtct 20401 cccttcacca tttccgacgg catctacggc tcaacatttt ttgtagccac aggcttccac 20461 ggacttcacg tcattattgg ctcaactttc ctcactatct gcttcatccg ccaactaata 20521 tttcacttta catccaaaca tcactttggc ttcgaagccg ccgcctgata ctggcatttt 20581 gtagatgtgg tttgactatt tctgtatgtc tccatctatt gatgagggtc ttaaaaaaaa 20641 aaaaaaaaaa aaggaattct cg // LOCUS HSN10C3 38117 bp DNA PRI 05-JAN-1998 DEFINITION Human DNA sequence from cosmid N10C3 on chromosome 22q12-qter contains platelet-derived growth factor B chain (PDGF-B) and CpG island. ACCESSION Z81010 NID g1621241 KEYWORDS 22q12-qter; CpG island; growth factor; PDGF-B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 38117) AUTHORS Burgess,J. and Odell,C. TITLE Direct Submission JOURNAL Submitted (17-OCT-1996) E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is the entire insert of clone N10C3. The true left end of clone N10C3 is at 1 in this sequence. The true right end of clone N10C3 is at 38117. N10C3 is from the human chromosome 22-specific cosmid library LL22NCO3, constructed at the Biomedical Sciences Division, Lawrence Livermore National Laboratory, Livermore, CA 94550 under the auspices of the National Laboratory Gene Library Project sponsored by the US Department of Energy. The source of the flow sorted chromosomes was a human/hamster hybrid containing chromosomes Y, 22 and 9. VECTOR: Lawrist 16. N10C3 is part of a cosmid contig isolated using YACs from the Sanger Centre chromosome 22 YAC contig described in Collins, J.E. et al Nature 377 Suppl., 367-379. FEATURES Location/Qualifiers source 1..38117 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q12" /clone="N10C3" /clone_lib="L22NCO3L" repeat_region 730..767 /note="19 copies of 2 mer 100 % conserved" repeat_region 806..1001 /note="MIR repeat: matches 260..41 of consensus" repeat_region 1016..1105 /note="MIR repeat: matches 98..184 of consensus" repeat_region 1718..1819 /note="5S repeat: matches 105..1 of consensus" repeat_region 2241..2520 /partial /note="AluSx repeat: matches 1..282 of consensus" repeat_region 2525..2603 /partial /note="AluJo repeat: matches 233..159 of consensus" repeat_region 2607..2658 /note="26 copies of 2 mer 98 % conserved" repeat_region 2678..2813 /note="FLAM_C repeat: matches 133..3 of consensus" repeat_region 3278..3514 /note="MIR repeat: matches 4..260 of consensus" repeat_region 3953..4046 /note="L1MA10 repeat: matches 999..909 of consensus" repeat_region 4072..4230 /partial /note="AluJo repeat: matches 300..136 of consensus" repeat_region 4244..4388 /note="MIR repeat: matches 149..2 of consensus" repeat_region 5073..5218 /note="MIR repeat: matches 148..4 of consensus" misc_feature 5587..8515 /note="Putative CpG island" repeat_region 5789..5830 /note="6 copies of 7 mer 91 % conserved" unsure 6440..6511 CDS join(6585..6647,14677..14773,17026..17115,18723..18928, 20322..20466,24703..24827) /codon_start=1 /product="platelet-derived growth factor B chain (PDGF-B)" /db_xref="PID:e275137" /db_xref="PID:g1621242" /db_xref="SWISS-PROT:P01127" /translation="MNRCWALFLSLCCYLRLVSAEGDPIPEELYEMLSDHSIRSFDDL QRLLHGDPGEEDGAELDLNMTRSHSGGELESLARGRRSLGSLTIAEPAMIAECKTRTE VFEISRRLIDRTNANFLVWPPCVEVQRCSGCCNNRNVQCRPTQVQLRPVQVRKIEIVR KKPIFKKATVTLEDHLACKCETVAAARPVTRSPGGSQEQRAKTPQTRVTIRTVRVRRP PKGKHRKFKHTHDKTALKETLGA" repeat_region 7242..7337 /note="48 copies of 2 mer 88 % conserved" unsure 7548..7554 /note="single clone" unsure 7648..7702 /note="single clone" repeat_region 11439..11506 /note="MIR repeat: matches 190..118 of consensus" repeat_region 11579..11768 /note="MER20 repeat: matches 218..1 of consensus" repeat_region 11898..12026 /note="MIR repeat: matches 46..174 of consensus" repeat_region 12303..12345 /note="MIR2 repeat: matches 145..103 of consensus" repeat_region 16518..16549 /note="8 copies of 4 mer 88 % conserved" repeat_region 17345..17483 /note="MIR repeat: matches 194..51 of consensus" repeat_region 17763..18057 /note="AluSc repeat: matches 1..295 of consensus" repeat_region 18185..18260 /note="MIR repeat: matches 193..117 of consensus" repeat_region 18514..18600 /note="MIR repeat: matches 144..50 of consensus" repeat_region 21159..21355 /partial /note="AluJb repeat: matches 93..292 of consensus" repeat_region 21656..21943 /note="AluSx repeat: matches 290..1 of consensus" repeat_region 21944..22244 /note="AluJo repeat: matches 296..1 of consensus" repeat_region 22952..23065 /note="MIR repeat: matches 110..241 of consensus" repeat_region 23157..23292 /partial /note="AluSq repeat: matches 1..136 of consensus" repeat_region 23294..23592 /note="AluSx repeat: matches 3..301 of consensus" repeat_region 23593..23673 /note="MIR repeat: matches 131..47 of consensus" repeat_region 23977..24378 /note="MLT1G repeat: matches 412..12 of consensus" repeat_region 24388..24593 /note="MIR repeat: matches 233..28 of consensus" misc_feature <25233..26863 /note="match: STS G06144" repeat_region 27331..27400 /note="MIR repeat: matches 46..115 of consensus" repeat_region 28857..28913 /note="3 copies of 19 mer 84 % conserved" repeat_region 29303..29379 /note="MIR2 repeat: matches 63..141 of consensus" repeat_region 29647..30222 /note="144 copies of 4 mer 83 % conserved" repeat_region 31470..31520 /note="3 copies of 17 mer 100 % conserved" repeat_region 31591..32482 /note="L1PA2 repeat: matches 893..1 of consensus" repeat_region 32331..33726 /note="L1 repeat: matches 5390..3981 of consensus" repeat_region 33823..34117 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 35050..35230 /partial /note="AluJo repeat: matches 302..121 of consensus" repeat_region 35270..35392 /partial /note="AluJb repeat: matches 122..1 of consensus" repeat_region 36607..36893 /note="AluSx repeat: matches 288..1 of consensus" repeat_region 36950..37244 /note="AluSx repeat: matches 3..302 of consensus" repeat_region 37860..38054 /note="L1MA10 repeat: matches 291..485 of consensus" repeat_region 38056..38117 /partial /note="AluY repeat: matches 1..62 of consensus" BASE COUNT 7996 a 10576 c 10309 g 9233 t 3 others ORIGIN 1 gatcagagca ccaggagtgg gtcagggctc agccaggctc tctttctttg ctggtcaaat 61 ggtaatcact gtcctaccgc cttcctcagg ctgtcagctc aagagcacga ttggcccagg 121 gtgaatgctc tgcaagggag gctgtcctga tgacgggctc actccaggcc caagaggcag 181 gcagcaaagg acgctgtcag gatgggcctt ctctggggag agccagactt caagcatcaa 241 ggtggggagg gggggcccag acccccgata gggtcatcct ccagcccatc tggcccagct 301 cccctcaccc caggacggtt tcactcgaat tccagccagc ttgcagggca gataggtcca 361 agctgccccg tccaaacatc ctcttccttg tctggagcct ggaggagcta aaaatgcatc 421 agctttggga aaacaaacac tgcgcttccc tcgcctttgt atacaggaca aggcccaagg 481 ccgcagggtg ggctcctcca ggctggctta ggcctggggg tccatctgtc tgccttcctg 541 ggggtctcct ctcctgcctc cctggcactc acaggccagt ggccagagcc tcattagggt 601 gagaaggtct gagagaagtg gcgggagttc tatgtaagaa caacctccac cctctccagg 661 tgaagacctg tggcccctct cctagcgctg gctctggagc tcagaggtga atttgcaagt 721 gaggtagggg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgcg cgcgcgcgca 781 gagcctgcag taataaccat cactgtgatg accaacatct gtggggtgct cactccaggc 841 tttatcactt caggtgaatt aactcattta acccagtgag gtagggacta ttgtccctac 901 gacttacaga ggaggaacgt gaggctcagg aaggttaagg aatttgctta aggtcacaga 961 gttagtagat ggtagggcag ggctttgaac ccaggtggtc tttgacggga agcctagtcc 1021 cttgccctct ctcagcctca ctttcctcat ctgtacaatg ggcatatacg atggcaatgc 1081 cttctccctg ggcgactgtg agggtgaggc aggcagaggg gaaggaagtg ctttgtggac 1141 agccaaaggc cctgtagaca gggaggcagg ggatggggca gcaaggggca tgggggagct 1201 gatgagggcc acgcggagtg gggtagggct tggagtttca gaccatttgt ggcatggtct 1261 tggacggtct ccagcttaga gcctcacttc ctctccatca agtatggaga agcagagctg 1321 cctggcccag gtgttgctgg cccctctcag tggcggggag gctcccagtg atcatcgctc 1381 tcacctttct ggacttgagg gcgtgagttt gcccccggga tggaacccaa ctccttcctt 1441 ggtatttctt caccccagcc agtgtcctgg acccccacgc ccagggtgta agatgctctt 1501 catggatgag ggcttggata gctctgagca cacagtctaa aaagccccca accttgggca 1561 gcacccccat cccccacggc tgtggccact ctttttccct ttggctcaga tctgtcctgg 1621 agtggagggt ggagagttac gggggtgtgg gcgcccccat ccctagctct gctggagaac 1681 cttatttccc agtggcctga acgcgaagat gggcgggggc cttcccagtt ctctcattca 1741 agtgctcctc agacccagcc ctgatgaagt tccgaggtca gatgagacca cgtgcattca 1801 gggtggtgtg gcttcagacc cttcccagtt ctgatacctc cagaaggggt gcccttcagg 1861 ggccctgtgc ttccaagctt agaggctcat gctacaagcc cctccccaac tctttccacc 1921 ctttaggaag ggcctggcat ctctgcaata actggtcagg actcagaaag agaaattggc 1981 aggattttaa tcatttattc attccttcag cgagcggcac ttcctgagca gccactgtat 2041 gctaggagct ggcaggaggc ccttgaacac tctggactca gtctagtgct agattgatga 2101 tcaggccatg acgatgccag gtgtgcatgc tgtggagcct ggaacaagtg ccaagcctgg 2161 tgggccttct ggaggaggtg cctgctacac ccagcctggc aggaggcaga cgatcaggtg 2221 ttattaaaaa aaaattttta ggctgggtgt ggtggttcat gcctgtaatc ccagcacttc 2281 gggaggctga gtggggcaga tcacttgagg tcaggagatc gagagcaccc tggtcaacat 2341 ggtgaaaccg tctctactaa aaatacaaaa attagctggg catggtggca ggcgcctgtg 2401 gtctcagcta ctcgggaggc tgaggcagga gaatcgcttg aacctgggag gcagaggttg 2461 cagtgagaag agatcgcacc acggcactcc agcctcggcg acagaccgag accccgtctc 2521 tgagatggct cactgcagtc ttgaactcct gggctcaggg gattctccct cctgactcag 2581 cctcctgagt agctggggcc acactcagat atatatatat atatatatat atatatatat 2641 atatatatat atatatattt tttttttttt tttttttttt ttttggtaat tttttttaag 2701 agatgggatc tcgctatgtt gctgaggctg gtctcgaact cctggctgca catgatcctc 2761 tggcctcagc ctcccaaatt gctgggaata caggggtgag ccaccgtgcc tggggaggtg 2821 ttctttacag actgttcctc tgcacagaga aggtacaaga aaggagccct tagcacaaca 2881 ttcaggccca ggtgtcacgc aggtgtcacg tagcattcag gcccaggtgt ctgctgggcc 2941 ctcttctgcc acttctaacc catgtactgg cacctgcact gcaatgattt gggtctatct 3001 cactgagagc atgggaagtc cctgagtgcc gaggctggga ctccttcctc attccatttc 3061 tcagcgcttc acacagggtc tgcaaacagt aggcgttcta cgatgctgct ctgagtagcc 3121 ttccctctcc ctcaacttgt acctaagtca gagttcatat gggtccatca ctttttagct 3181 ggcagagcag gttcccacat attaatgcat gtggcgctca ccaaagcgct gtgtcaaaag 3241 tgtcatcatt gtcatcagca tcacctgcca gaaaattgca cagcacggtg gtctcagcat 3301 gtgggctctg tagctggact gctcagattc aagtcctggc cttaccactt acgggctgtg 3361 tgaccttgaa caagtgtatt ggcctctctg tgcttccatt tccccatctg taagatagga 3421 ataacacatc tcacaggagc atggagaggc tacaggtaga gtaagaaaaa cactagctac 3481 cacatagtca gctcactaat catggtagtt gttaccattc tatttataga agacacagaa 3541 ggtcaaggtc aaatgaccca gctgtccggt ggaagagggt ttttatctct gagtcgttag 3601 acttcaaatc tagtgtttgg gggttatctc ctcgtcccgc tgccccatct cccttgtcac 3661 cctgtctcct aatttatctg tttatacatc tggttgtcct tctcaatggg gctcttcaaa 3721 ggcaggggcc tgcattaggc tggttcacac ctgtctcccc agtgcctgga gcagtgcggg 3781 gccaccagtg aacttgaagg ctgcatggaa ggttccaggc gcttgcttcc tgctaaggtg 3841 ttttgctgag cgcctccaca tggaggacct catgggagcc tccagaccac tccaggagca 3901 gggtaaaaat tctctacaac tgcactgagg tacaacaaac tgcacacatt ttgttgtgat 3961 gtacgtatac acccataagc ccatcaccac aatcaaaaat gacaaacatt tccatcacca 4021 ctaaaatttt gccatgcctc tctgttctct ctcactcctt cccattatag atttattttt 4081 atttctttag agatagggtc ttgctctgtt gcccaggctg gagtgcagtg gcacagtcac 4141 aactcactgc agcctcaaac tcctgggctc aaaacgatcc acagtctcct gagtagctgg 4201 gactacagga gcttgttacc acacccagct ccagtttata aattcatctc cagtttataa 4261 aggaggaaac cgaggtactg agaggttaaa aaaccttcct gcagacactt gtccagcaag 4321 tggccactcc aggatttgga ccaaggtgat gtgtcttcag gctgtgtctc tgccactgtg 4381 ccacgctgct gggtggtagg cagcagtggg tgggtgcctg cagtggtctg taaagaccac 4441 ctgagatgtc cttcctcctc tgttccaccc tgtccaggtc caagaagaca gtctatgaag 4501 agagagcagg tgtgactctc tcagtgtgct cctctgtgag aagcaggctg acatcccaaa 4561 gggaagggcg gataacagag acagtgcaag cggaggagat gagggtgcct caaagccggg 4621 aggctgggtg atgcaggagc ctgcgtgtcc cgaggggggt gctgggccca gtgtgagtac 4681 gtgtgactgt gactgagaca gtgtgactgc tgaaggcagg gacacagcag ctccctgact 4741 gggggcagaa ggcgttaact gtgtgaaggc tggttgtggg tgggtgggct ctgggcctcg 4801 aacccggggg ctgagggaga tagtaaacag cagggtgact gacgggaaga tcatgttggt 4861 agccctgcga agatgctgca gggctgtggg ggtttgtgtg actttgcagt tcaacaaatt 4921 caaattcagc caacgctggc agggcctgtt gtgccaggca accagctagg aggaggagac 4981 tcggacccag cttgcagctg aagggcgctg gctgccgggt tctgtgggtt caccttgcgg 5041 tgtcttccct tgctaacact gagtccttac aatagcccca tctccaggtt gaggctagat 5101 ggaggggaca gagggaagtg acttgcccaa ggtgacccaa gctcccgagt gccagggcag 5161 gatctgaatt caggctctca gactgcagag cctgagtccc tccctgccat gcctgtgcca 5221 gggtggaaat gtctggtcct ggaggggagc gtggactcct ggccttggct ctggagacat 5281 ccccctagac cacgtgggct cctaacctgt ccatggtcac tgtgctgagg ggcgggacgg 5341 tgggtcaccc ctagttcttt tttccccagg gccagattca tggactgaag ggttgctcgg 5401 ctctcagaga ccccctaagc gccccgccct ggccccaagc cctcccccag ctcccgcgtc 5461 ccccccctcc tggcgctgac tccgggccag aagaggaaag gctgtctcca cccacctctc 5521 gcactctccc ttctccttta taaaggccgg aacagctgaa agggtggcaa cttctcctcc 5581 tgcagccggg agcggcctgc ctgcctccct gcgcacccgc agcctccccc gctgcctccc 5641 tagggctccc ctccggccgc cagcgcccat ttttcattcc ctagatagag atactttgcg 5701 cgcacacaca tacatacgcg cgcaaaaagg aaaaaaaaaa aaaaaagccc accctccagc 5761 ctcgctgcaa agagaaaacc ggagcagccg cagctcgcag ctcgcagctc gcagcccgca 5821 gcccgcagag gacgcccaga gcggcgagcg ggcgggcaga cggaccgacg gactcgcgcc 5881 gcgtccacct gtcggccggg cccagccgag cgcgcagcgg gcacgccgcg cgcgcggagc 5941 agccgtgccc gccgcccggg ccccgcgcca gggcgcacac gctcccgccc ccctacccgg 6001 cccgggcggg agtttgcacc tctccctgcc cgggtgctcg agctgccgtt gcaaagccaa 6061 ctttggaaaa agttttttgg gggagacttg ggccttgagg tgcccagctc cgcgctttcc 6121 gattttgggg gcctttccag aaaatgttgc aaaaaagcta agccggcggg cagaggaaaa 6181 cgcctgtagc cggcgagtga agacgaacca tcgactgccg tgttcctttt cctcttggag 6241 gttggagtcc cctgggcgcc cccacacggc tagacgcctc ggctggttcg cgacgcagcc 6301 ccccggccgt ggatgctcac tcgggctcgg gatccgccca ggtagcggcc tcggacccag 6361 gtcctgcgcc caggtcctcc cctgcccccc agcgacggag ccggggccgg gggcggcggc 6421 gcccgggggc catgcgggtg agccgcggct gcagaggctt gagcgcctga tcgccgcgga 6481 ccggagcgga gcccaccccc ctccccagcc ccccaccctg gccgcggggg cggcgcgctc 6541 gatctacgcg tccggggccc cgcggggccg ggcccggagt cggcatgaat cgctgctggg 6601 cgctcttcct gtctctctgc tgctacctgc gtctggtcag cgccgaggtg agttgccacg 6661 gcggctgggg ctggttcttc attcattacc ttcgcccccc ccttctgacc gccccctcct 6721 ctccctgcag tgaactttgg acccttgcag cccgcgagcc tgacgccggg cgctgggtga 6781 cctcttcggg ctgggagcga ggtccggggg tgacaggctc taagggaagg caacagcggt 6841 ggctttcttt ccaaccggcg ggcgaatctg gctccctaag ccgttccgtg tcgggggagg 6901 gtgtgtgtgg ccctgtcccc caccctttgg gaacccgaga acaagcccct tcccggccgg 6961 gggagagggg gtggggtggt gcccagggtg cagaaggcag cgcgtcctcc cgagcccact 7021 tcggcgccag cctcggctta ggctctgtcc tgccatcggc ttgcccagga ggtgcaagct 7081 tgcgcaccgg tgccatccac gccgaccttt gacggggctc gggtcccacc ctggctcccg 7141 gtcggggtcc gggtggacgc cctccggcac gcgcagggca tgcccggccc cagagcacgg 7201 tctggaagca gcttgtgtct cgggcgcggg gtttggggtg ggagtaacag ggagagagag 7261 agagagagag agaaagagag agagagagag agagacggag agagagacgg agagagagag 7321 ggagagaaag atagagacgc aggtaggttt cgggcactcg ggtaggggga ggactagggg 7381 ctctgaagga atccagcccc cggggcgctg tgtgcctgag gccttgccgt acctgtgccc 7441 gcgccctgca ggaggaagca cctctctccc ccgcatgcag ggtgcggagc gcgccgcctg 7501 caaaagccta gctgcacggg agatggggta taaaaacatc tgagggagcg ctggggtcgg 7561 agccgtcccc acccctccca gcctcacgta gccgtgagtt ggcaagtcta cccggaatcc 7621 aggtgggaaa gggggcgggg caggacgacg cgcgcggtgg aggggaggag agggcagccg 7681 ggagcgcggc gtgactgggt accgtcccac ctccgcccgg ctgcggccgg gagtccgcac 7741 ctcgcccgca cctcgccggc tcccggggag ggtcggcccc tgcggcctcc agccaggcgc 7801 aggaaggagg cagcggcggc gaaggggtta aggcggaggg ctcggaggcc gcggcccggc 7861 cttgggctgc cgccagcgca ggttgttttg accacggagg agccgtcgcc gtctcctttt 7921 gttctcgggg ctcctcgagg gccgcaggcc gcccgcccgg gggccccgcc cttccgcggc 7981 cgccccccgc ggcccgcacc cgggagggag gacgcggggc tcggccgggc cgccttcagg 8041 cccctcccga cgccccctct ccgcctctcc cgcagccccc cgggccgcgg ccagctgttg 8101 gggagggggt ggccggccgg ccccagctgc cgcctcgcct cgccgtggga cctgggggct 8161 gggcccggtg ccatggcgtc ctgggaacgg cggcgccccg tggctgctct ccgcagccca 8221 ccccgcccgg gcccccgact cgctcactca ccccacgcat gcacactcct ggccggaggc 8281 gatgctgcgc tccggcgggc gggcgcgcag agcgacgggc acgcactacc gcggccaggt 8341 cgcgcgcccg cctctgccac gcccgtgcac acgcgggaca cacgcgcgcg caccgcacac 8401 aaacgcacgc acggcccctg cacgcgcggc ctgagcacac gtgcgcgcac acccacgcac 8461 gtacacaggc acgcacgggc ccccgtgcac ccagcgcctg gtgctcgccc ccgcgcaaca 8521 ggtgggccct ccgtgggccc tggcacctca ccacctctgt agcggcccca tttccttcct 8581 ggcgtcctgt gagggaggga gaacctccca tcagcaccac agcacctact tttttttttg 8641 cctcgtcagc ccgacgcccc tcaaacctta cccatctgtg actccttttt ttcaaccacc 8701 tccgcgtggt ggaaaatggt ggtgatgtga ctctgagggg cactgagctg tccagagcga 8761 ttcccccttc acataagcct tcatttgaac ctgcaagact ggagggacct ggcgtgtgca 8821 accgcggagg gggctcccac ccctggctgt tgcattctct tggctgatcc cagcgtgccc 8881 cggggaggcc gctgacagct ggatgtttcc ccagcctccc cttaccattt ccagcttcgt 8941 ccagcacctc ctccttcttt cccacagctc cacgggctcg tgtatctggg gtggaggctg 9001 tggcacagaa actgcctttc tcctcacttt agtcacagca ttcttgaaca catggccaca 9061 ggcgcgatgt atgtggcact ttgcagttta tgaagcactt tgctgctaag cctgagtgag 9121 cctcaggctg gccctggggg aggggacctg catggggatg gaaccacgca ggggtcagtc 9181 caggaaggag ctgtaatggc cagtgctggg agagtcaggg caggcctgct ggtggaggtg 9241 gccttggagc tgtccacgtc ctggtcgtgc tcggactaat ctttcagcag acggcaggca 9301 gccgtgaggc agggctgggt ggagggcctg ccgaggcctc tgaggtgcca tctccaccag 9361 ctgagctggc ttccaggagg gcgagtccca ctgtcacgtg acgcgtctgg cctcagcaca 9421 cttcttccgg gaaagagtga anngggcccc actgcccttt gccatccagc ttcctctggc 9481 tttgctaatg gccctagggg gcaggagacc aactgctgga atcccagagc cctggaggtg 9541 tgcaagggca ggtcaaacag aatttggagg atctggtgca agagccagga agagagagag 9601 agagagagtg tgtgtgtgtg tgtgtgtgcg catctgagag agagagagag agagactgac 9661 tgagcaggaa tggtgagatg tttatcatgg gcctcgtaag tacctctcca cgtcttgtct 9721 tcccctcccc acattgagga gcctcttctg tgacaactct tcctatgttc tggtttattt 9781 cattgtttat tacctgcttt ctctactgga gtgtcaaccc cattagagag ctttcctcct 9841 ggtccccact tttagaacag tgcccagagc cctgtggaca atcagtaact attagtgcaa 9901 agaacagtta gtgctgggtt ggtggggagg gcaggacctg gccccctgtt gctccatggc 9961 cccactgagt cctcgctttg ctctgctccg agggcagccc cccttttcag ccctctggta 10021 ggcatcagtc tttcaaagcc ccccatcttg agagagaagc atgtctgtct ctcctctgtc 10081 ttgggacttg ggctaccagg attagacagg acagaggaga agaaacagaa agaaaatcag 10141 actgcaaagg ttctgtgtat ctttagctgg ggctaagttt ctgccaccta aaaaaaaaaa 10201 gagagagaga agattggatt agtgcagagt tcccaaagaa tggcagaatc ccctggcccc 10261 gtcccctggc tgggatgggg atgggcagtg attgccgtgg ttgccagtat tataaatggc 10321 tggactggtt cctctcttcc tgctttaatt ttcctctgct cctgcaagag gtggggaggc 10381 cagggttctg ggcccacgtg aaggagcaga ggctgagcgg gagggtccca agagcaataa 10441 agaaggggaa tgctaggtgg gaaacactgg ttctaatggc ttctgtggtt tgccccgaga 10501 gggcttcttc aaaggggttg gttggctttg gcattcgatc taaataaggc ctgcgactct 10561 caggcaggca ggctctggga ggcctcatca gcttcttcct ctgccagcca cagacaacgc 10621 ccctggttgc ttgggcctgt gtgtcccttg gtgggaatgg caggcgggcc ggggagtgtg 10681 tatctgtgtg tgtgtgctgg gcccaggagg agggtggggt tcaagcccct ttgatctgcc 10741 agcctggttg ggagcagatc actcacctgg cctcacgctc gctcgtgccc ttcctacctg 10801 ctgcagctgg cgctgggggc ggggtcggag gaggctgttt accttggctc ccacggctgg 10861 ctttgcccca gctgcctcct cgccacgccc tcactctgcc agaaaccccg ggcctgagat 10921 cttgggacag cttcttcagg gtgccaggcc tcctttccca tctctgaagt gagctgtcca 10981 cctggaggcc tgcggaacct gtgcccagga aaaccaggct ccgggcggct caccttccca 11041 taccaagaag agcctgtgac tcccaacagg tgctcatgct cgtcatcccc agagcattgc 11101 atcctggagc tgagcacgtg ctgagtgtcc ccccaccctc acccaccccc agccccggaa 11161 gggccttgta agcccacacg gcccaggctc tgccagtgtg gaggtagggt accatttcct 11221 gtggcccagc acaagggata atgcaaagtc acgcactctt tcatgggcag gcagctctcc 11281 acccactcct tgtcatcctc aaaaatgtcc tgtgctgctg gccctgaaca cgtgtgccac 11341 tcgctgctgc ccacaaagga gccatccgga aagaattaat gatgatgaca gtgacaactc 11401 attaagctcc cattatctgc tggcagctct gctctgtgca attatgcctc ccagtaaccc 11461 cgggcgaggt attattagcc ccatttaaaa gatgcagaag ccgaggtttt agattgtgac 11521 aagattccaa gaatccaggt tgccctggca tctggagccc ttgctctttc ctccgctaca 11581 gggtttctca gcctctgcac tcctgatgtt tgggaaggcc gtcctgtgcc ctgaagggtg 11641 gtgagcggca tccccggcct cccccaactt aagatgcagt gcacacccca agttgtggcc 11701 accagcgatg ccttcataca ttgccactgt cccctggggg cccactgggc cctggttgag 11761 agcctctgac gtgggacatg ctgcagagta aagagccaac acgaaccttg tgtgatgctg 11821 agggccagtg gccttagaac acaggagagg gaggctgaga cgccccatgg tgggctggaa 11881 caggtgggca gtctggagcc tgggaacgac ccagctctgc cacttgctaa ttgtctggac 11941 ctgttgctgc ttgtttaatt ttttctaagc ttcagtgttc tcatctggag gatggggata 12001 caaatactgc cagcctctta ggattggtag tttatacggt gtttttctca gcgtgtggct 12061 tgtagctggc attcatagat gttagttgtg gtattattac caaatccggg ggcttatggg 12121 tagttctggg catctagtgg ccacattacc aaaagcatct tcagggacgg tctccctgct 12181 cagattaaaa gggactctag ttgctcattt gcctagggtt ggggaagaat atcccaggtg 12241 gccctcctgg tggaggaggc agaagaatca tactggacag aacctggaaa atggtagact 12301 ccttcatgca tttgaccagt atttaccgag gacctactat gtgcctctct ctcctcctca 12361 gaccctatcc tgggctcttt cacactcagc cttggagggg catctcctat gtgtcgtggg 12421 cctggatctg gggtgtccac tcggcctgtt caggggtctg cctcctctgc cgtggctggc 12481 agctgcagcc ggctctccag tgggcaggga gatcttggca caagcactaa atgccatttg 12541 ctactaggcc tttgggcccc tcccctgggg ggttaattga atctttactc cacaaacgct 12601 tcttgttctc cttctgccag tgatccaggg aggcctggct aggcacccgg ctttggcgtt 12661 ttctctgccc cttctaacgc ctcctggcag aattagaatt gaatcgaatg ttcttcaagc 12721 ccctcgattc acagatgaga gccaggacca taggggcagt gaccagccga ggtcacacgg 12781 cagttggtcc aaaggcgtgg gattcagctc tctggcctgc cagttctggg tttgacagaa 12841 ctctcagggg aagcggcttt cgctggtgtt cactggagac caagccctct gttctgtggg 12901 ttgggtactt tgtcaacatt tggagagcac cggaggcgtg gtctctcttc tgaggtggaa 12961 agaatgaact tgcttgaggg cgggaggagg cagagcccgg aggagatggg gccgaaagtg 13021 aggggcagcg gggcccatac cctcctctgt cttgcacgtg cagcccctgg ctgcccggat 13081 attgaaggaa agaccctagt tctcagccaa gccctgccac ctttttttcc tcaagttgcc 13141 cagcagccct tagacattgg gcggagttgt aagctccatg ttcaaacaaa gacatggaga 13201 ctcaggggtt tcagggaatg atgtagcttt aaaagtggtc acccccaccc cccgccctga 13261 gtcttctggc tcctggatcg atgtgggtgg ccagagttgg tcatggatga acccctggag 13321 gtgttgctag acctcatgat aagctggagg aaacagagct ttcggggaga gccccctgag 13381 gctggccaca gggctgagac gacctcttct cctctccctc gctccccctc cgccgtcagg 13441 cagcgagaaa gcagcatccc ttttgtgaag gtagcctgca ccgaccccct ctgagccccc 13501 cgaccttcgg ctccgggaat gtggagcgaa ccaagaaggg aagtcctccc ttcttaatcc 13561 agggaaactg aggcagaaaa tagtttaacc caaggttcta tagtaatgtg ccaagcctat 13621 tgatgccctc tccaggactg tctgtccaga ccgtgcagcc cgagtgcctg tgcctgcacg 13681 tgtgtgtttg cacacgtgga agtccggttc cagcctttga acgtgcagcg acatcttagg 13741 aattttcttt gcttctcagt ggctggtggt ctggattgtg attctcagtc tcccatctgc 13801 cctcgcctct ttcttgggat ctctggcatc cgccccttgg gtctcctggg actgtggggg 13861 acgtggctgt gaccttccta cggggctgct cttgtgtcat tgtgggacct gcggtttccc 13921 tggaggaagc ccagctccga atgaggcctg ttgaagtgtt atcaggtttc gagtatgttt 13981 ggcaacaggc caaaggggct ctaaaaatag ggtttggttg ggcacagggc acgggtaagc 14041 tttcaggttt cccggccgtg tccgaggatt cccttttcgc ctctttcgag tgagagactt 14101 catagatacc gctaggacct ggggcctcct gaggccatcg cactctgcac gcgtttatta 14161 agcacctctt gtacccagga gcctgggagt acaggctctg tcatcccaaa ggcatgggtt 14221 tgaatgcagc ttttgccaat aattgagcct ctctttgaac gtttgttttc ccctctgtaa 14281 aatgggggtt agattggcca tgggagatgc tggctgtctt cagctgtcag gtgtgttgtg 14341 agactcaggg agaaatcaga ttttggtgca ttcccagaac atgtagtttg gtgggggaga 14401 cagacataga gacaagcagg tccaactcaa agcaagctgg ggttcctgtt gggggttgag 14461 ggtgcaggga ctgagctggg tctcagaggc ttcggcaggt ccaggccccg aggcctttgt 14521 gctcctgatc atcaggcctg gatcctgcct gtccgtctcc ctgtgacctt ggagctttcc 14581 acaggagaaa gcgagaaagc gtgtggtgng gggagacagc catgctggaa agcccccact 14641 cccagctcac tcagcctttt ggtgtctgcc cggcaggggg accccattcc cgaggagctt 14701 tatgagatgc tgagtgacca ctcgatccgc tcctttgatg atctccaacg cctgctgcac 14761 ggagaccccg gaggtaaatg gaatcctcgc cccgcgctcc ggccctccga ggagacttta 14821 agagatctgg gaggggcagg acaggaggca tccctccttc ttgacgtctg gagaactaga 14881 ggcccatggc agcccagaga gagcgtggcc acacccatcc agggcagggc cgagtcagca 14941 ggcgggttgg tacctgggac ttggggtgtg gcaggagaag cacccacgtg tggctccggc 15001 ttggtgccaa gggtggggtg ccaggaggag ggagggcggg gagaaggtgc ccagggcatc 15061 tgccgagtat gtctccaggt gtgggcccag ccagggaggg tggcactgga ggagtcatct 15121 ccccgcccct ctggctaggg gcccggtctg tgaggtccct ggggacacct ggcctcgacc 15181 agcaggcagc cctgcccgtg ccccagagct cactcagcga atgggcacgt gctcggtggc 15241 acacacgtgg cggggctggc gggctggatt gacaatgtat ttataaacgc tgtcttcaga 15301 gcaaattcca ttctattcta acctctggcc tgttccctgg agccctggtc agcagccccc 15361 ctgcaccccc aggtcccctt ccctctgggg ttctgtctct ttgtcacttt gtaatccttg 15421 cccagacgct atctacgggg gaacagcatt tcctgccttt gtttcctctc cccgttgggc 15481 ccctggctcc ctctcaaaag cattcccggg ccctttcaat cccgcctgtg ctggggggcg 15541 gtgaggcagg caggaggggg ccccagctgg gcccacctat tgttcgccag ggcccccacc 15601 cgatgtctcc cacaaccccc accctatgcc cgactggccg cccctggcca cacacaatgg 15661 ggcaacttcc aaatttagcc tttctaccat ttctttccaa gcctttacac ccccaccctc 15721 agacggcccc tccacacccc tggtgggggt cgggatcttg gagatgaggt ttcaatagca 15781 ggcctgtttc aaggcaacca tgtggctatt ttttcctaat caacttaacc tttccacaaa 15841 gcgcatcttt tccccatctc ctccccacca gggacattcc agagatggca gggagaaagg 15901 aaaggagcca gagggacaga taggctcgtt gacgtagggg ctgtcaggct ggacggaagc 15961 ggactagtca tcagactccc tgcagtaccc gggacggaca ggcggcaggc gcctttgtct 16021 tggatggggc cagtgggtcg gtgccagaac tggagccatg gggtgctgtg tggaggtggc 16081 aggagcgtgt tcccaatgcc ccttagttta tttgtggctc ctcacccatg agagagacag 16141 aagggtctgt ccacgcctta acatccagct cagaggcctc cctcagccct cacttcccag 16201 cagccaacct ctggccatgg gctctggaaa ggaccgatgt ctccaaggag ggctgatggg 16261 actgagatgg tgcccaggct agaaaaggga gctgtcattt gtgaccagag gaatcaacag 16321 acggctgggg gcagaggtgg ggcaacagtt tagagaaggg gagggagaca gacacagata 16381 cagatacggt gtgagggccc cggagctaga gagaccaggc cctagaaggg tgtgtggcag 16441 ggactcccca cccccagcag gcccaactcc cacccctctt ctcctcttct ctgtcttcag 16501 ctcccacccc catccctctc cctccctccc tccctccgtg gctcccgcct gcccagcgcg 16561 gccattgtcc ggggctccag aggggctgtg ttcagggcat gcggtccccc tggggcttct 16621 gggaatttct ccattcagca cttcctatgg gaacgctggg tggaggggca ctggaaactg 16681 gccttcagtg ccctaggtcc cggccccgcc ctggaggccg aggagggttc acttacagta 16741 gcaaagggga agggttattt ttaactccac tgacatgggt tctggccaaa aatgttggct 16801 aaagagccag ggatgcagtc aggatcctgg gcctgcagct gtcctggggg ctgcttggag 16861 ggtccctgct cagggccagg acccgtgcat ttatgggggc tggaaggagg gactgttctc 16921 tggagtgggg gagaggcaaa cctcccgcct gcccagacac cccggcggag ggagcccctg 16981 cccaaggtgg ggccgcctgg tgtctgactg tgacttctcc tgcagaggaa gatggggccg 17041 agttggacct gaacatgacc cgctcccact ctggaggcga gctggagagc ttggctcgtg 17101 gaagaaggag cctgggtaag actgagacac ccaacaaggg tccttcaaat tagcatgggg 17161 gccagggaaa gagaacgggg gcgggcagcc agtcggaggg ccccgagggt cctcttacaa 17221 ctcccgttcc ccccacattc aggactgagc gaactgagag agtaaaacca tccttaaatg 17281 cctgtgttca gtcatcacag cccctcgccc ccacactctc tggtttctaa atatagtgct 17341 gcaaagctca ttcgttcctt atttcaatcc tggaggtaga tattattaac cccatttcac 17401 agagttggaa actggggccc agagagggta aagaactagg cgaggtcaca ctgctaggga 17461 gcggtagggc tgagatccca acctgctccc ctcacttgaa cccagcgtgg tccctgatgg 17521 gctatgggta tctcctgtgg caaagggccg cggcctaggc tggtagacct cgagcaaagc 17581 ctggaacttc ctcctgctcc cttgcagaga cctaaaggag accgtagatc attaggtccc 17641 agagaaaccc tcccaggtga ggcaaatgtc cactcaaacc cctcctcctc tttgacacct 17701 ggagtcccca aaacactgca gttagaggct gggggaacga tgaataagag tcaggtggcc 17761 tgggccgggc atggtggctc acgcctgtaa tcctagcact ttgggaggct gaggcaggca 17821 gatcacgagg tcaagagatg gagatcatcc tgaccgacat ggtgaaaccc tgtctctatt 17881 aaaaatacaa aaattagctg ggcgtggtgg caggttcctg tagtccatgc tacttgggag 17941 gctgaggcag gagaattgct tgaaccaggg aggcggaggt tgcagttagc cgagaccacg 18001 ccactgcact ccagcctggc gacagagcaa gactccgtct caaaaaaaaa gaaaaaagtc 18061 aggtggcctg agttccaccc ccaatctatg aatgtgggcg gacgggatat tcagctctca 18121 gagcctcctg tcctctgtac atggatagca tcggtaacct cagtgcttgc tgtaggcaag 18181 gcttactcac tttacacctt agatccacat gggagggagg tgctgttatt atcccatttc 18241 atagaggagg aagctgaggc tttgaaagca gcagtgatgc aggaatccaa ggcaccgtga 18301 gacagcagtg cacgcgtggt ttcgagctaa tatggtacct gcataaaaac cacatgttgg 18361 gcaagcccat aaatgtgtaa aatcataaag aagaataaaa tagtcatgtg ccagcagagg 18421 gaagcggaca tttactaagc atttacaaca tgccccacac tctgaatttt ctcacttgaa 18481 actacagtgg ttcttgggtg tagatctcat ggtcccattt tcagatgggg aaacacaggc 18541 ttagagtggc tcagccatct gcccaagtcc catggctaat gacagccagg acttgaaacc 18601 tgaaccctgg ggttgaaggg cgtgagaaag agcagtcccg gaggcacccg ctgagctccc 18661 ctgccaagtg gggggactct ggacaggcct acgctcctga cccaggtttc gtctcctccc 18721 aggttccctg accattgctg agccggccat gatcgccgag tgcaagacgc gcaccgaggt 18781 gttcgagatc tcccggcgcc tcatagaccg caccaacgcc aacttcctgg tgtggccgcc 18841 ctgtgtggag gtgcagcgct gctccggctg ctgcaacaac cgcaacgtgc agtgccgccc 18901 cacccaggtg cagctgcgac ctgtccaggt gcgtaggctc cgcggccccc cgaggctggt 18961 cccggtggtg ggtggagacc acccttctgg ggctcatacc tgaccaggct tccttgactg 19021 ggcagagtgg tgtgtcgaag gacattcctg atttgttcct gggctctgtc cagagaggcc 19081 cactcttggg aagctcttcc taggcaggtg gactgaggcc gaccatgagt cctctgaggt 19141 ggctcaggcc actgtagcca aaagccacag acagaccccc cgggctgggc gaggtgtgga 19201 gccaacgtga agtagggatg tcacctcctt tacaaggaca aaatcgcaaa acgtcccttg 19261 aagagccgta tccgcaaaca tttatctgac actcagaatg ttttggggag ctactcaggt 19321 tcccccaaag ctctccctat tcagggctgt cttgagaaag gctcagatgg gtggccctgc 19381 atctttgaca cctgcttgtc tctctttgac aagctgtgtc tttaataaag agagtgcaga 19441 gaggcccacc aggtgataga cactgcggag ctggggcagg tcccctgaca acagtggaat 19501 gctgatcctg cttttcttcg gagggacggg ggatacatac actggactca gggaaaattc 19561 tagagtagag gctgtctcag ggagcaccct ggtctcaccc tagtcggggc agagtcagag 19621 aagctgctgc ttctgcttct tgtggaaaga ttcctgcacg ccccaggcca gcctcgccca 19681 caggctgggc agtgcgcatt tctctagctc agcaccggcc cttgcccgat gctctgcctg 19741 cttagccgca cttccttctc ctgccaccag cccgtggcct ctcctcctgc ccaccttccc 19801 tccctggctt gggccccacg cctgagacta cgaatgtcac tccaccccac tgcccactct 19861 cgtggtatct cctgtcctcg atgcttccag ctctatggat ggacacctga catctgacct 19921 ccccgttccc acctccctcc tagatgaagc ctccccccac ttccttccag acaaccatct 19981 cccgcctctg ccgcagccct gaccttggct ggcgctccgg gaatgaggat accgcaggcc 20041 ccagctcaac ccggaaatgc ctttctggct gtccctctct gggggcactg aggggggccc 20101 caggtcggga gggcttgttt tgatggaaaa gctacgagaa gggcagaggt caaggtcctg 20161 ctattgtttg ggccaaagct tttgccccac agcctgctct ccgggctttc gaggaaagat 20221 tgctgcaggg gacaccgggg tgggaagcgc aagccattcg ggcttcccgg gctcctggct 20281 gtgtggtctc tacagaggca ttttgtggct ctgtcctcca ggtgagaaag atcgagattg 20341 tgcggaagaa gccaatcttt aagaaggcca cggtgacgct ggaagaccac ctggcatgca 20401 agtgtgagac agtggcagct gcacggcctg tgacccgaag cccggggggt tcccaggagc 20461 agcgaggtaa ccacctttcc aggctcagcc ctcagccccc ttcccctgca gcagctgggg 20521 gcaggggccg gtcttaaaaa tggaaatggg atcaggcccg cccccggtta aattcacggg 20581 aggctttctt ttcatcttgc tgagacacaa gccaggttcc tccctggcct gagagcccta 20641 catgaggatt agcccctggc tggctctctg gtctctgcca accagtaccc ttcacctgtg 20701 ccacccccgc cgggcaggcc tctttcagta actaatggag tttctgcctc aaggctttag 20761 cacacactgc tgtggctgaa gaacacttct cccagctctc cacaagccta gctccttctc 20821 ttcattagag tctcagctca gaagtcacct cctcctagaa gtctgccctg accaccccat 20881 ccagagcagc gccatttcca gtcttgctac catagccccc tgctttgttg gatgtgggga 20941 tgggatggtg gaaggaatct gggcctggat gagagaaagc tccgcgcaca tcaacttccg 21001 atagtgagca gcttgtcctg gggatgagaa atcaaacgaa tttctcgtat ccttgagcgg 21061 gggaaccagg accagtgggg agaagccact ggagagagat ttcagctcca tatgcagaac 21121 agctagctgg tgatcagtca ggggtgcaag gtggaattgg cagcattgtg aggcatctct 21181 acaaaaaata caaaaattag ccaggcatag cggcacacgc ctgtagtctc agctacttag 21241 gaggctgagg tgggaggatc gcttgaggct gggatgtcga ggctacagtg agccgtgatg 21301 gcaccactgc actccagcct gggcaacaga gcaagaccct gcctcaaaaa aaaaatgatt 21361 aatacgatta gatgagcgtt taaaaaaatc acgtaagtga tagttctttt tcagtaactt 21421 tctattgtta gggacaggac aatcttttca ttctcatttc ttattaacaa gaccctgaag 21481 acggagccat ttcttggcat ctgtgtctta ttcccagagc tgagctgagt cgctggatgc 21541 catggaaagg ctgatcatat gacagcttca aactgacatg ttgtaaatta ttgagttaat 21601 atggagggtg atttctctat taagggaaaa ttatatatat ctatatatat atatattttt 21661 tttgaggcag tctcaccctg tcacgcagtc tggagtacag tggcacaatc ttggctcact 21721 gcaacctcta cctcctgggt tcaagcaatt ctcccgcctt agcctcccga gtagctggaa 21781 ttacagacac atgccactac acccggctaa tttttgtatt tttagtagaa acggggtttc 21841 accatgttgg ccaggctggt cttgaactcc tgacctcaag tgatccacct gcctcggcct 21901 cccaaagtgc tgggattaca ggcgtgagcc accacgccca gcctctttct tttttaagag 21961 acagggtctc tctccatcac ccaggctgga gtgcaatggc atgatcatag ctcactgcag 22021 ccttgaactc cttcctgggc tcaagcgagc ctccctcctc agcctcctga gtagctggga 22081 ttacaggcat gtgccaccac gtccagctaa tattttaagt ttttgtagag atgaggtctt 22141 gattcgttgc tcaggctggt ctctaactcc tgggttcaag tgatcctccc acctcagcct 22201 cccaaggtgc tgggattaca ggcattgagc caccacgcct ggccttcttt atgtttagag 22261 tgagttactt catgtatttg tttgacgtaa ttttctaaac cagtttcttg ggaggcgcat 22321 tataccagtg aaacgtagat gatgtaaagt cccatcattt tagttccggt attttggttg 22381 agtatgacgg tgtttctgtc acctgtccat gggcctggct aaggcttggt agtgtggctc 22441 tctgtgataa tgtggtcagt gtcagacctc ctgggctcag atctgcgtgt tgtcacttga 22501 gctgtgtggt gaccttgtgc ttgcctgggc ttggtatagc ccagtgtcct gctgatggcc 22561 caggctctgg ggaagaaact ccctgggccc ttcagtggcc ctcctggtgt cttctgtctg 22621 aggggatctg aagcccattc aagtccctga actgctagaa aatctgcgca attgcagaga 22681 tgggagagct gcctcgtggg agctggtgtc tgtaaggtgg gaggtcaggt gtcagatccc 22741 ggtaggctgg gagagctcaa aaggctccca ggaaatggct ggccttgagc tcagcagaca 22801 cagctgtccc tgccagggtg gccacctggg aatccaggac ggagagaagt gtataagagc 22861 tccccttgac tgaggctcac aggcactgtc gtttgtgctg agtggttgga ggtgtgcact 22921 ctagaggcag atccctgagt tcaaatccca gtctctctgc ttcagtttcc tcatctgtaa 22981 agtgtgggta acagcatctg ccggtaacag cacctgccgt gaagggaaat gagcatagaa 23041 cagcaccagg tacatggtaa acactagctg gggggtttct ggccatttcc atgtattgtc 23101 tcctaacgct aggaggccag ggctgctgtc ttctctgctt taaagatgtg gaaatcggct 23161 gggcgtggtg gctcacgcct gtaatcccag cactttggga ggccaaggca ggtggatcac 23221 ttgaggtcag gagttcaaga ccagcctggc caacatggtg aaaccccatc tctactaaaa 23281 atacaaaaat taaccaggtg tcatggcgta tgcctgtaat cccagcactt tgggaggcca 23341 aggcaggtgg atcacttgag gtcaggagtt caagaccagc ctggccaaca tggtgaaacc 23401 ccatctctac taaaaataca aaaattaacc aggtgtcatg gtgcatgcct gtaatcccag 23461 ctactcggga ggctgaggca ggagaatcac ttgaacctgc gaggtggagg ttgcagtgag 23521 ctgagatcgc gccactgcac tccaacctgg gcaacagagt gagcctccat cttaaaaaca 23581 aataaataaa gatgcggaaa tcgaggcaca gggcagttaa gtgacttgct ccaatgtcca 23641 cgatacgtgg gagagcccag atttgagccc aggtgggtca gatcctcagc ctgagttgtg 23701 gcccatccca ctctaccatt tctcagagtg gcaaggggca ggtaggtgac aggcacaact 23761 gacccgaagg cccagcttca tggtgggagt atgattctgg cccagagccc agtggcctcc 23821 acaccacgtg atcggacggg gctgagaggt tgggaggaaa cctcaatagt aggtcaagct 23881 aaggaaaata agaaaaaaaa gtgagtcgta gcagctgcca tgtgctgacc acttcatact 23941 tggagctttt tattatcttt cccaggtctg ggggttgact gggctcagct gggcagttct 24001 attggcccct ctattggttg cagtcagctg gtggcctggg ctgtggtcat ctggagactc 24061 aacggggaca gctgggcctc ttttcctctc cctgtagtcc agggcctctg gctctacggg 24121 gagagccagt ccccaccagg cctctccatg tggcctctcc agcaaggtag ccagacctgt 24181 tatgtggcag cccagggtcc cccaaagcat gaaggtgaga actgccagcc cttcttaaga 24241 cctctgcctg gaactggcac ggcaccacgt ctgcctcttg ttggttaacc agatccaact 24301 ccaggccaga ttcaaggggc agggaaataa actctattgg tcagtgggca atgacaaaga 24361 acttgaagcc atatttaatc tacccggctc tgtgtccagt actgtgcaaa gagctttgcc 24421 agcatactgt catttattcc tcccaccagt tctatgaggt gggcagtctt attaccccca 24481 ttccacagat ggaggcgcca aggctcagag agtttcagtc attgtcccca aggcacatac 24541 agctggtgag cagtggagtt gggatccaaa cccaggaccc tggtccagat tcccctcctg 24601 tccctttcct gagcccagaa ggtccatggc aggccttggt cagtggggag agacctcccc 24661 aatggtccac atgctgacga ggtctttctt tttcttgtgc agccaaaacg ccccaaactc 24721 gggtgaccat tcggacggtg cgagtccgcc ggccccccaa gggcaagcac cggaaattca 24781 agcacacgca tgacaagacg gcactgaagg agacccttgg agcctagggg catcggcagg 24841 agagtgtgtg ggcaggtgag ggccaggcgg ggcaactgag gcccagaatc ctgtgttttc 24901 ttctctgtgg tctgtggatg gggtcagggg aaaggatttt tccatagaaa atccagcctc 24961 tgagccttgg tcctccctgt ccagttatca gctggggcag ggtgggtggt gggctttgga 25021 gaccccagcc cagctgtgtc ccagcatgcc ctaagaggat gggaggtcta aatctcaatc 25081 ttcccagccc ctcaatgccc ccagccctga gcattgctgg aaatcagggg tggctagttg 25141 accccagctc cctgcccctg gcatcgggga catatgaaca tgggatggac cctctaggct 25201 gtctctgacc cctgtgtctc tctgcttggc ctgcagggtt atttaatatg gtatttgctg 25261 tattgccccc atggggtcct tggagtgata atattgtttc cctcgtccgt ctgtctcgat 25321 gcctgattcg gacggccaat ggtgcttccc ccacccctcc acgtgtccgt ccacccttcc 25381 atcagcgggt ctcctcccag cggcctccgg cgtcttgccc agcagctcaa gaagaaaaag 25441 aaggactgaa ctccatcgcc atcttcttcc cttaactcca agaacttggg ataagagtgt 25501 gagagagact gatggggtcg ctctttgggg gaaacgggct ccttcccctg cacctggcct 25561 gggccacacc tgagcgctgt ggactgtcct gaggagccct gaggacctct cagcatagcc 25621 tgcctgatcc ctgaacccct ggccagctct gaggggaggc acctccaggc aggccaggct 25681 gcctcggact ccatggctaa gaccacagac gggcacacag actggagaaa acccctccca 25741 cggtgcccaa acaccagtca cctcgtctcc ctggtgcctc tgtgcacagt ggcttctttt 25801 cgttttcgtt ttgaagacgt ggactcctct tggtgggtgt ggccagcaca ccaagtggct 25861 gggtgccctc tcaggtgggt tagagatgga gtttgctgtt gaggtggctg tagatggtga 25921 cctgggtatc ccctgcctcc tgccacccct tcctccccac actccactct gattcacctc 25981 ttcctctggt tcctttcatc tctctacctc caccctgcat tttcctcttg tcctggccct 26041 tcagtctgct ccaccaaggg gctcttgaac cccttattaa ggccccagat gatcccagtc 26101 actcctctct agggcagaag actagaggcc agggcagcaa gggacctgct catcatattc 26161 caacccagcc acgactgcca tgtaaggttg tgcagggtgt gtactgcaca aggacattgt 26221 atgcagggag cactgttcac atcatagata aagctgattt gtatatttat tatgacaatt 26281 tctggcagat gtaggtaaag aggaaaagga tccttttcct aattcacaca aagactcctt 26341 gtggactggc tgtgcccctg atgcagcctg tggcttggag tggccaaata ggagggagac 26401 tgtggtaggg gcagggaggc aacactgctg tccacatgac ctccatttcc caaagtcctc 26461 tgctccagca actgcccttc caggtgggtg tgggacacct gggagaaggt ctccaaggga 26521 gggtgcagcc ctcttgcccg cacccctccc tgcttgcaca cttccccatc tttgatcctt 26581 ctgagctcca cctctggtgg ctcctcctag gaaaccagct cgtgggctgg gaatggggga 26641 gagaagggaa aagatcccca agaccccctg gggtgggatc tgagctccca cctcccttcc 26701 cacctactgc actttccccc ttcccgcctt ccaaaacctg cttccttcag tttgtaaagt 26761 cggtgattat atttttgggg gctttccttt tattttttaa atgtaaaatt tatttatatt 26821 ccgtatttaa agttgtaaaa aaaaataacc acaaaacaaa accaaatgaa tccgccggag 26881 gtctgtctgt tggcatcgtg cgtgacaatt aacctttctg ccttggcagg atgtgccgac 26941 agcttgcggc gtgttcctct cactctggga gcctcaggcg tgatctcaca cactggcgtg 27001 cacatacaca cacacacaca tacatgctca cacatgcgtg cacatacacg caggcctgca 27061 acttggggga ggcctctgtc tggcgggaag aagagacaca caggctactc tgttggtctt 27121 ggtcctggca cagctcctga cacgtggact tgtgcgtgtc tctggcagtg acgagagatg 27181 ggtttctgca gcctgagggt ctccatagag gaagtggatc ctcagcatcg tggggtgggc 27241 tggatgtggg gagggagtgc ccactgtggg gaacactgaa ggaagtccca gaggggatgt 27301 gggatttaga aggaaacagt ggaaagccgg gtctgcggtc tggtcttggc ttagccctta 27361 gccagctgtg tggccttggg caagtcacag acccactctg gccagagagg atgaggaaag 27421 gtgcgtgggc tccaagctcc tgtgttctgc gctttctctt gctcatgaag acttgggtac 27481 cagcccataa aaaagaggct aaatgtattc taggtcccac ctgcagcgtg gactggggct 27541 gagagggacc cagaagctaa agatctggct cataagggca atctaaaact aaaaccattt 27601 tgccacctga agtagtgagc tccctgtcac tggaagtgtg taagcgaaaa ctggggaaca 27661 atgtggctga ggttagactg aataacattt gggctcccag caattgattg atcaatggac 27721 tgattcattc cttcactcac tcactttctt gaacctctga catgtgcctg tcacggccag 27781 gagattttat agtcatttga tttgaattca tcctggggtt cctagccctg cccttgagga 27841 agtagagtct acccatggtg gtagtgggcc gtgcacatca gagggatctg tttacctggc 27901 agctcctacc cctgcctgcc caatcgtgac tggggccttc ccttctcccc taaatcccca 27961 gtgttctcag tttccagggc acaggagaag gatctggctg ggaggaggaa gagacaactc 28021 caccaggaga cggacatcag aagtcagctg gagagcccag tgcgtgcagc atcctggcag 28081 ctaccaacag ctggtctgtg ccatggattg ctgcaggcac ttccgaggcc tgggtgccca 28141 tttcctgggt gccatcccat acagcagcgc agcagagtcc gtgggccttc caggggcccg 28201 gtgggctttt tcaaggtctt aatgttgcca ggctaaaatg gcattcttcc agaaacccct 28261 ttgctggaga gaaagcattc ttttcttcat gcctttccac tgcactgttg gctgtaatac 28321 agatgtgatg tctggtgctc aagcagccat gttgagccat gaggtgaaag ccacatgctg 28381 aggattatgg agcaagatgg aaggcagggt ccccaatacc tgggctgcct tgttcaggag 28441 ctttcacgga agaaagaaac ttctttctgt ttgagctact ctggttttgg gttttctgtc 28501 ccttgcaact gaacctaatc ctaatagccc cgtttacttt gggagagaga gaacatggct 28561 gctgggctgg tccttttctc tcctaccctc ctcaggcact cagcttcacc ttgcgaactc 28621 ttaggaactc tcctctgttc actcagatcc tcatttcccc aaggtgtctt cttggtcccg 28681 gccctcagcc ttcaaccaga gcgtggagct gtagctttct ttggacagca atttttgctt 28741 ttgtaaaata ataaaatgtg aaacatttac tgctactaaa gtaaactatc cgtgagctcc 28801 tagtacacat aagaccgttt ttgagggatg aaggagataa cagacaggac agggacagca 28861 cgcaggctta gggagggcac gcaggcttag ggagagcatg cagctgctgg tggctaggtc 28921 caagctccac cctttcctcc tctgctgaca ccatgtctgg tgtgtttcca gtattttaaa 28981 gtgtcagctc aggaacccca gaaccccaac acctgtccat gtacttgggt gacagccttg 29041 acccggccac attgttactg atgatcctgg ggaagtccct gcttctcccc aatctcagag 29101 gagctgtgac tcacccctat gctgccttca ggcatccatg caagtgacac ccatcctgtc 29161 ccacctgccc ctgccccaag cccccctttc tcactctgct ctgttgatct ctgtagcatc 29221 tggcctcatc tggctcactc cacgtgtata tctctcctca tctcccttct ccactcccca 29281 tggaaactcc acctggagtt accttgttca ctactctgtc ctcagagcct agaacagtgc 29341 ctcgtaccta gtaggcactc gaccagcctg tcaaatgaag caacatacaa atctgttatt 29401 ggagcaatca agtatgttgg ctccaagtca cagcctgcct ggctggtctt catcacacac 29461 actatggtgt ggctggaaaa tgcaagactc tgagtgtgtc actacagata aaatatttca 29521 acttcttggc aagctccaca cttagcacac cctcttcctt gattgggaaa tcctggaggg 29581 ttgggatcat gtttaattca ccttctaagt ctatctagcc attgtctctc tccatgtcta 29641 tacttcttca tctacccacc tgtccattca cctacccgcc cacccaccca tctgcgtatt 29701 tatccatcca tccatccatc catccatcca tccatccatc cacctgtcca tccatccacc 29761 catccacccg cccatccacc cacccatcta cctatccgtc cgtccaacca tccgtctacc 29821 tgcctacctg tccatccatc catccgtcta cctgtccacc tgtccatcca tccatccgtc 29881 catccatccg tgcatccatc catccatcca tccatctacc tgtccatcca tccatccatc 29941 tgtccatccc tctgtccgtc tacctgtcca cctgtccatc catccatcca tccacctgtc 30001 cacctgccca tccatccatc catccatcca tccacctgtc catccatcca tccatctatc 30061 catccatcca tccatccatc catccatcca tctacctgtc catccatcca tccatccgtc 30121 catccatctg tccatccacc tgtccatcca tccatccatc catccatcca tccatctacc 30181 tgtccatcca tccatccatc catccatcca tccatccatc ctctttcttc ctcccttttc 30241 tatttttcac cctcacttac tcccaggctc tgttgttgtc accttgttct attacctctg 30301 aggaggttgg tgccacagag gagacaacac atcagacccc cattgctcac tccttctgtg 30361 tgaccttacg gaactgttgc acaggttaga gacatgagga ttgagcactg cccctggcac 30421 atatttggcc tttggtcaat ggtaggagtt gtcatctttt tcctctcagc atcctgaaac 30481 tggctgatca cagcatggtg gactaaagat ggaaacaatg gaaagaatga acacaaaatc 30541 acagaggatt aagagctcag ctgcagtcag tgtgtgcatg catactcagg caccgtcccg 30601 ttattctgct caattaagtt gatgatactt gtcatgtggg gcaagtgagc cccagattgg 30661 ggcttacccc aggaaggttc ttggctttgc tcaggaaaga atccaagagt gagctgatag 30721 gggaagaaaa catctctatt gaagtggaag tgttaacagt tctgagactg ctcctgcaga 30781 gcagggctac tccaaggtca gtatgctgag ggtagcagct caggagcaat tatgcagtca 30841 tatttatacc cacttaatta tatgcaaatt acggggaaaa ttatgcagaa atttctagat 30901 aaagggcagt aacttccggg tcatgggtcg ttgccatgga aagaggcatt aatgtctgga 30961 tgttgccatg gcaatggtaa agtgacaagg cacactgatg ggtgtgtctt atggaaaggt 31021 gcttttgcct cttccttgtt ttagctagtc ctcaatctgg tccagtgtcg agtcttgcct 31081 cctgcctcac ttccagtcct cctgttacag atgaagaaac aagggaaaca agggctcagc 31141 aaggttcacc cccaccaccg gcttcccgtt acacagttat ttctcctgta aatgtggtaa 31201 acgaccaagt ccacagtccc cctgtgggga gattcacggc gcagactcca ggtgtaacgt 31261 caggtcttcc ttcattgagg gaggtctggg tttgcaaact aactcgggtc tgggactggg 31321 ggagggaatg cgtgcctctg tccgggggac agtgggggag ggggtctcaa gtcggacccc 31381 tgtttaccaa acccaggcct ttctggttgt aaaagccaca ggtccttgta gggtcccatc 31441 catttttgcc cttgggaact tgtgatccat tagatccacg aatcccttag atccacgaat 31501 cccttagatc cacgaatccc tgttggtcag tctgttggga cagcgacctg gtcctgtgcc 31561 tgttttttct ttttaaatta ttattatttt ttttattaca ctttaagttc tagggtacat 31621 gtgcacaacg tgcaggtttg ttacataggt acacatgtgc catgttggtt tgctgcaccc 31681 atcaactcat catttacatt atgtatttct cctaatgcta tccctccctc agcccccaac 31741 cccacggcag gtcctggtga gtgatgttcc ccaccctgtg tcaaagtgtt ctcattgttc 31801 aattcccacc tatgagtgag aacatgcggt gtttggtttt ctgtccttgt gatagtttgc 31861 tgagaatgac ggtttccagc ttcatccatg tccctgcaaa ggacatgaac tcatcctttt 31921 ttatggctgc atagtattcc atagtatata tgtgccacat tttcttaacc agtctaacat 31981 tgatggacat ttgggttggt tccaagtctt tgctattgtg aatagtgcca caataaacgt 32041 acgtgtgcac gtgtctttat agtagcatga tttataatcc tttgggtata taccagtaat 32101 gggatcactg ggtaaaatgg tatttctagt tttggatcct tgaggaattg ccacacagtc 32161 ttccacaatg gttgaactaa tttacactcc caccaaccat gtaaaagtgt tcctatttct 32221 ccacatcctc tccagcatct gttgtttcct gactttttaa tgatcaccat tctaactggt 32281 gtgagatggt atctcattgt gattttgatt tgcaattctc tgatgaccag tgatgatgag 32341 cattttttca tgtatctgtg ggctgtataa atgtcttctt tggagaagtg tgtgttcata 32401 tcctttaccc attttttgat ggagttgttt gttttttttt cctgtaaatt tgcttaagtt 32461 ctttgtagat tctggatatt agccctttgt cagatgggta gattgcaaaa attttttccc 32521 attctgtagt ttgcctgttc actctgatgg tagtttcttt tgctgtgcag aagctcttta 32581 gtttaattag atcccatttg tctattttgg cttttgttgc cgttgctttt ggtgttttag 32641 tcgtgaagtc cttgcccatg cttatgtcct gaatggtatt gcctaggttt tcttctaggg 32701 tttttaatgg ttttaggttt aacatttaag actataatcc accttgaatt aatttttgta 32761 aaaggtgtaa ggaagggatc cagtttcagc tttctacata tggctagcca gttttcccag 32821 caccatttat taaataggga atccttcccc cttttcttgt ttttgtcagg tttgtcaaag 32881 atcagatggt tgtagatgtg tggtgttatt tctgaggcct ctgttctgtt ccattggtct 32941 atatctctgt tttggtacca gtaccatgct gttttgatta ctgtagcctt gtagtatagt 33001 ttgaagtcag gtagcatgat gcctccagct ttgttctttt ggcttaggat tgtcttggca 33061 atgtgggctc ttttttggtt ccatatgaac tttaaagtag ttttttccaa tactgtgaag 33121 aaagtcattt gtagcttgat ggagatgaca ctgaatctat aaattacctt gggcagtatg 33181 gccattttca cgataatgat tcttcctatc catgagcatg gaatgttctt ccatttgttt 33241 gtgtcctctt ttatttcgtt gagcagtggt ttctagttct ccttgaagat gtccttcaca 33301 tcccttgtaa gttggattcc tagggaattg tgaatgggag ttcactcatg atttggctct 33361 ctgtttgtct gttattagta tataggaatg cttgtgattt ttgcacattg attttgtatc 33421 ctgagacttt gctgaagttg cttatcagct taaggagatt ttgggctaag acaatggggt 33481 tttctaaata tacaattatg tcatctgcaa acagggacaa tttgacttcc tcttttccta 33541 gttgaatacc ctttatttct ttctcttgcc tgattgccct ggccagaact ttcaacacta 33601 tgttgaatag gagtggtgag agagggcatc cctgtctcgt gccagttttc aaagggaatg 33661 cttccagttt ttgcccattc agtatgatat tggctgtggg tttgtcataa atagctctta 33721 ctatttatta ttcatattct tattatttat atttcactaa tccctgttgg tcaggctgtt 33781 gggacagtga cctggtcctg tgcctgtttt tatttttttt atttttttta ttttttattt 33841 tttagacaga gttccgttct tgtcacccag actggagtgc aatggtgcaa tctcagctca 33901 ctgcaacttc tacctcctcg gttcaagcaa ttctcctgtc tcagcctccc aaagtagctg 33961 gcgcctgcca ccatgcccag ctaatttttt gtatgtttag ttgagacagg gtgtcaccat 34021 gttggtcagg ctagtctcga actcctaacc tcaggtgatc accctcctcg ggctcccaaa 34081 gtgctaggat taaaggcgtg tgccacagca cccggccctg tgcctgtgac agtgatgcac 34141 ccccaccacc cctggcagtc actgctgcct cgggcttctg tcatctcagg tcttggtttc 34201 cactccagaa atccagaggc aggccactgg ccagacacct tccctggaca ctaacacatt 34261 ccgaagggtg ttgggccctc tgcaaggtgg ctcgccgtgt gggtgtaggg ccggtacttt 34321 gggtccttgg gcattgtggt aaagggtcat ccatgctcag tccattcctc tggctctgtc 34381 atgccttggt gtctctgtca tctccaggtt gggggggggg ggggcaacct ctgcgttggc 34441 cttcacctca cctgccagcc gacaaagcag tggtagaaac aatcagggcc gccccccgct 34501 gatgaagttg gcaccttgga tgcagggccc cctgttggtg gagcagcatg gatacattca 34561 gccccatcca aagctgcatt cgctcctcct tgacgtatgt cctcagaacc cactcacaca 34621 tctcccccag atgtctgatg ataggaaatg acattctaca caggttttgg tttgagattg 34681 ctatttaatc tcgcccctca gggcctcctg ccacctgact ccagctgaag actctgagga 34741 tgggggccac atgagggtga ttacagggtc ccttcttcgg caggaccagc cgcatcagac 34801 agcgaagagg agctgctcag ctggcaaggg gacttggcgg ggagtgaggg ttcagggtgt 34861 tccaatcagc cagaatcctc cttagagctg gtgaggccat ggcatcgctc atgatgtaat 34921 tcaccattct gtaggctcga attttgagtt tcatttttgg ctgtctcagt cttaccacag 34981 tgagaattag gaaacatttt tactgcagtt atagaagtcc ttcgttttct ttcttccctt 35041 attttttatt ttttagtttt tttgttttta agacaggttc ttgctctgtc actcaggctg 35101 gagtgcagtg gtgtgatcaa agctcactgc agtctcgatt tccccgactc aagccattct 35161 cccgcctcag cctcccaagt agctaggact acaggtgcac acaccatgcc tggctgatat 35221 atatattttt aaacttttta gtagaaaaaa gtaaaaacag ggtctttact tagtagagat 35281 gggggtctca ctatgttgcc caggttggtc tcgaactcct gagcaaaagt gatcctcctg 35341 cctcggcctc gcaaagtacc gggattacag gcatgaggta ctgcacctgg cccagaagtc 35401 ctcagttttt gatctgggtc ttgagccaag aatttaggaa tttaggagac gggcttctgc 35461 atctttaaca gtgcagtgct gtagagccca ttcagtctaa ttcctcattc ctttcattct 35521 tttctaaggc aatgaattct caatgaagct ttgttttcaa cgggctcttc cttcccggtg 35581 actccaggtg gcaaatctga cattctgctt cccctgcatg ccatggcttc cagatctgag 35641 cccttatgat gtgctgaatt tttactgact tcagccccaa gcagtccaga taaccaattc 35701 cagcttctcc gagcttcctc caacatccac tgcctgtaac cactcctgac accaacttct 35761 atataaggca aggttcttag ttgcaggtaa cataaaccag ctccggttaa cttataaaga 35821 aacggatgtt caaagaatgg atattgaaaa ggatattcaa agaattccag gtgactcatg 35881 cattgctgag caggcttgga gacctggttt tgtggtatgc aaacaggatc agtcctctcc 35941 aatgcagcct gaggtcagtg ctctgaacac ctgtttgaca ctgcagcccc tgtgactggc 36001 accttccaac tgaataccca gcactgggca ctgtggctcc acggctcagc cagggagacg 36061 ctctgcttct ttgcagcact aattttgttg ttcagtcttg agttggtgtg cctgactggt 36121 gagcctcaca gacttcagct gtgtctgaga aagggcgtag tggggatttt tagtttctag 36181 gcgggcgggc ggggggtggt ccctgggtcc acagcctaca aggtgcggtt cagacggtgt 36241 gtggtcaaaa caacgaaacg taagtggaat gtctgtttca ttctgtacca cgtttcttcc 36301 atttcttcca gctgagcaca ggatgaggct gcctgcccag atagctgtcc cttctgactt 36361 aggatctctt tggctgtaca ttgaaggatg aagaagtgtg tagctgtttt acctatgact 36421 ggattcccag atcctagctc agctgcggac acaccatggg cactcacttc atgttgttag 36481 gtgatggaac atgtcttgtg gggacaggac taagggtgag acacctgccc caggtgcaaa 36541 ggttaaaggg gcaccaaaaa ttctcagtga tcaaaataag tcatcatttc atgcaatatg 36601 tatatatttt ttgaggcaga gtcgtgctct gtcacccaag ctggagtgca gtagcgtgat 36661 ctcagctcac tgcaaccttc gcctcctggg ttcaagtgat tcttgtgctt cagctgcctg 36721 cgtagctggg accacaggca catgccacca cacctggcta atttttgtat ttttaggaca 36781 gacagggttt tgccatgttg gccaggctgg tctcaaactc ctgacctcaa tgatctgcct 36841 gcttcagccc cctgaagtgc tggaattaca ggcgtgagcc accacgctca gcctcatgca 36901 gtattttaag aaatcaaaat taatgcaaac aagttcacca cgagccagtc cgggagtggt 36961 ggctcatgcc tgtaatccca gcactttggg aggccgaggt gggcagatac ttgaggtcag 37021 gaatttgaga ccagtctggc caacatggtg aaaccccgtc tctactaaaa acacaaaaat 37081 tagccagaca tggtggcggg cgcttgtaat cccagctact tgagaggctg aggcaggaga 37141 atcgcttgaa cctgggaggc ggaggttgta gtgagctaag attgcaccac tgcactcaag 37201 cctgggtgag cgagactcca actcaaaaaa aaaaaaaaaa aaaaaaaaaa gaaagaaaga 37261 aaaaaaattt accatgagcc aaaatttcaa aatttcactt caagacagga gttgaaacga 37321 gcacttacac aatgcctcgc tcgcctcact ctaatcttgg ctttgtcaga tcctgtcttg 37381 ttttaaaatg ttgtgctccg ggtgtgtgcc tggcttgcct caccgcagcc cgggccctgc 37441 tgacccagct ccatgttcac tgtcctttct tcttccttct gtggccttgg ttttgggtcc 37501 tcactctctt ttgacccaac cattgagaaa ctcctctggg gttccttctc ctcaccccgg 37561 actatcccac catgagccac gtcactctct gctgccccct taggcttctt gacacagagt 37621 agtcaagcca ctcccttgct cataaccttc tatgacttct gaccccaggc gaggctcaca 37681 ctcaccaaca aggcatgcag aaccctctgt ggtcagccca gggaggcttc ttcctcccaa 37741 gcctccccgc ctgtcacctg ccctcctcct ccatatccag gccgtgcctt tctccacatc 37801 tgtacttcac tgactatctt caaagtcaac ctcaaattca ctagaaaacc gcccctggag 37861 gaatgtaaaa tggtatagcc actgtggaaa acagcagggt ggttccccag aaacattcac 37921 acagaattac gatatgaccc agcaatccca cttctgggta gataccccaa agaacttaaa 37981 acagggtctc gaagaaatat ttgcacaccc aggttcacag tggaatgatt cacaatagcc 38041 aaaaggtgga agcatggccg ggcacggtgg ttcacgcctg tgctcccagc actttgggag 38101 gttgaggcag gcggatc // LOCUS HSN44A4 40662 bp DNA PRI 15-APR-1997 DEFINITION Human DNA sequence from cosmid N44A4 on chromosome 22q12-qter contains 14-3-3 protein, ESTs and CpG island. ACCESSION Z82248 NID g1941922 KEYWORDS 14-3-3 protein eta-subtype; 22q12-qter; brain specific protein; CpG island; protein kinase C inhibitor; protein kinase-dependent activator of tryptophan hydroxylase; protein kinase-dependent activator of tyrosine hydroxylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40662) AUTHORS McLaren,S. TITLE Direct Submission JOURNAL Submitted (08-APR-1997) Sanger Centre, Hinxton, Cambridgeshire, CB10 1RQ, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is the entire insert of clone N44A4. This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone N44A4 is at 1 in this sequence. The true right end of clone N113A11 is at 16736. The true right end of clone N44A4 is at 40662. N44A4 is from the human chromosome 22-specific cosmid library LL22NCO3, constructed at the Biomedical Sciences Division, Lawrence Livermore National Laboratory, Livermore, CA 94550 under the auspices of the National Laboratory Gene Library Project sponsored by the US Department of Energy. The source of the flow sorted chromosomes was a human/hamster hybrid containing chromosomes Y, 22 and 9. VECTOR: Lawrist 16. N44A4 is part of a cosmid contig isolated using YACs and markers from the Sanger Centre chromosome 22 YAC contig described in Collins, J.E. et al Nature 377 Suppl., 367-379. FEATURES Location/Qualifiers source 1..40662 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11.2-qter" /clone="N44A4" /clone_lib="LL22NCO3" repeat_region 457..751 /note="AluSx repeat: matches 1..297 of consensus" repeat_region 1361..1641 /note="AluY repeat: matches 301..20 of consensus; incomplete repeat" repeat_region 1699..1872 /note="AluJo repeat: matches 294..120 of consensus; incomplete repeat" repeat_region 1884..2184 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 2693..2834 /note="MIR2 repeat: matches 5..146 of consensus" repeat_region 3862..4017 /note="MIR repeat: matches 12..191 of consensus" repeat_region 4028..4328 /note="AluJo repeat: matches 2..299 of consensus" repeat_region 4477..4679 /note="AluSp repeat: matches 1..203 of consensus; incomplete repeat" repeat_region 4742..4845 /note="26 copies of 4 mer 85 % conserved" repeat_region 5094..5153 /note="15 copies of 4 mer 93 % conserved" repeat_region 5159..5202 /note="MIR2 repeat: matches 75..28 of consensus" repeat_region 6381..6417 /note="MIR repeat: matches 114..78 of consensus" repeat_region 7377..7561 /note="MIR repeat: matches 216..35 of consensus" repeat_region 7866..8012 /note="MIR repeat: matches 236..81 of consensus" repeat_region 8132..8415 /note="AluSg repeat: matches 17..300 of consensus; incomplete repeat" repeat_region 8948..9054 /note="MIR2 repeat: matches 114..10 of consensus" repeat_region 9216..9284 /note="MLT1F repeat: matches 541..467 of consensus" repeat_region 9344..9634 /note="MLT1F repeat: matches 372..79 of consensus" repeat_region 9488..9700 /note="MLT1G repeat: matches 224..13 of consensus" repeat_region 9714..9829 /note="AluSx repeat: matches 2..117 of consensus; incomplete repeat" repeat_region 10112..10386 /note="AluJo repeat: matches 273..1 of consensus; incomplete repeat" repeat_region 10518..10919 /note="MLT2B repeat: matches 1..404 of consensus" repeat_region 10926..10969 /note="22 copies of 2 mer 82 % conserved" repeat_region 10975..11027 /note="MLT2B repeat: matches 392..444 of consensus" repeat_region 11868..12172 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 12379..12504 /note="FLAM_A repeat: matches 131..1 of consensus" repeat_region 13224..13385 /note="MIR repeat: matches 191..21 of consensus" repeat_region 13916..13995 /note="MIR2 repeat: matches 146..67 of consensus" repeat_region 14254..14290 /note="MIR repeat: matches 155..191 of consensus" repeat_region 14376..14679 /note="AluJb repeat: matches 1..302 of consensus" repeat_region 15142..15202 /note="MIR repeat: matches 71..131 of consensus" repeat_region 15305..15604 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 16028..16148 /note="MIR repeat: matches 147..21 of consensus" repeat_region 16580..16721 /note="MIR repeat: matches 20..163 of consensus" repeat_region 17527..17602 /note="MIR repeat: matches 84..160 of consensus" repeat_region 18447..18531 /note="MIR repeat: matches 105..192 of consensus" repeat_region 18590..18887 /note="AluSp repeat: matches 2..303 of consensus" repeat_region 19184..19211 /note="14 copies of 2 mer 89 % conserved" repeat_region 19342..19629 /note="AluSq repeat: matches 299..1 of consensus" repeat_region 19630..19929 /note="AluSx repeat: matches 296..1 of consensus" repeat_region 19956..20102 /note="AluSx repeat: matches 159..302 of consensus; incomplete repeat" repeat_region 20110..20147 /note="19 copies of 2 mer 100 % conserved" repeat_region 20152..20280 /note="AluJb repeat: matches 130..2 of consensus; incomplete repeat" repeat_region 20298..20356 /note="MIR repeat: matches 105..47 of consensus" repeat_region 20857..20953 /note="MIR repeat: matches 48..146 of consensus" repeat_region 20957..21091 /note="AluSq repeat: matches 1..136 of consensus; incomplete repeat" repeat_region 21098..21397 /note="AluSx repeat: matches 1..299 of consensus" repeat_region 21463..21631 /note="AluSg repeat: matches 127..294 of consensus; incomplete repeat" repeat_region 22539..22659 /note="MIR repeat: matches 2..128 of consensus" repeat_region 23337..23530 /note="MIR repeat: matches 49..246 of consensus" misc_feature 23633..24891 /note="Putative CpG island" prim_transcript <24259..>37289 /note="match: multiple ESTs; match: W65437 R56892 N79982 R82499 D54224; match: W97815 C03366 W91698 N21530 F04963; match: H25337 N20527 N71711 H67636 H21979; match: T11718 W61343 W41264 W47608 W75482; match: F08031 H23204 W16498 W46459 W46595; match: W85826 W85827 W46596 W59944 H11706; match: D82630 C21397 D54950 N90630 T28031; match: N31547 N29226 H82692 H79624 N69107; match: R25364 H69844 W94022 W33751 R84813; match: W47413 R74851 R69901 N99662 W65436" repeat_region 24265..24299 /note="5 copies of 7 mer 89 % conserved" CDS join(24419..24505,35825..36478) /codon_start=1 /product="14-3-3 protein" /db_xref="PID:e311853" /db_xref="PID:g1941923" /translation="MGDREQLLQRARLAEQAERYDDMASAMKAVTELNEPLSNEDRNL LSVAYKNVVGARRSSWRVISSIEQKTMADGNEKKLEKVKAYREKIEKELETVCNDVLS LLDKFLIKNCNDFQYESKVFYLKMKGDYYRYLAEVASGEKKNSVVEASEAAYKEAFEI SKEQMQPTHPIRLGLALNFSVFYYEIQNAPEQACLLAKQAFDDAIAELDTLNEDSYKD STLIMQLLRDNLTLWTSDQQDEEAGEGN" repeat_region 25470..25585 /note="MER5A repeat: matches 74..189 of consensus" repeat_region 28075..28360 /note="AluJo repeat: matches 300..1 of consensus" repeat_region 28825..29127 /note="AluSp repeat: matches 303..3 of consensus" repeat_region 29187..29337 /note="AluYa5 repeat: matches 296..146 of consensus; incomplete repeat" repeat_region 29371..29497 /note="MIR repeat: matches 163..10 of consensus" repeat_region 29763..29800 /note="19 copies of 2 mer 100 % conserved" repeat_region 30587..30879 /note="AluSx repeat: matches 296..1 of consensus" repeat_region 30906..30939 /note="17 copies of 2 mer 88 % conserved" repeat_region 33090..33190 /note="U13 repeat: matches 1..102 of consensus" repeat_region 34177..34626 /note="MLT1C repeat: matches 464..8 of consensus" repeat_region 35131..35427 /note="AluSc repeat: matches 299..1 of consensus" repeat_region 37956..38255 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 38566..38689 /note="AluJb repeat: matches 127..266 of consensus; incomplete repeat" repeat_region 38751..38774 /note="12 copies of 2 mer 96 % conserved" repeat_region 38801..39099 /note="AluSq repeat: matches 299..1 of consensus" repeat_region 39180..39247 /note="MIR repeat: matches 144..73 of consensus" unsure 39430..39497 /note="single clone area" repeat_region 39470..39768 /note="AluSq repeat: matches 1..300 of consensus" unsure 40029 /note="uncertain base" repeat_region 40208..40490 /note="AluSp repeat: matches 1..283 of consensus; incomplete repeat" BASE COUNT 10205 a 9485 c 10000 g 10972 t ORIGIN 1 gatcctaatt accatctaca tgcccacaag tctcaagatc tctagcccat cccacctctg 61 aattccaacc tatcagtcca gttgccttct tggcctctca ttagagattt aagagaacag 121 tgaacttgat gtgtctaaaa cacaactctt gatttagctc tgcacccaac ccccagctct 181 cacccaaacc tgttccaccc acagtcttcc cggatttagt aactgtacca tcatctaccc 241 agttattcag gcagaaagcc ttggaattat cctatatttc cccttccctc atcccccaca 301 tctaacccat tagcaaaaat atattgtaat ctgacccccc tccccatctc cacttccatc 361 tgatgcgaca caacgttaac tgtcctcttg caccctcctt cgccttcagg aatatattct 421 cacagcaaca gcatgccctc tttaaaatac agaacaggcc gggtgtggtg gctcacgcct 481 gtaatcccaa cactttggga ggccaaggca ggtagatcac ctgaggtcag gagttcaagt 541 ccagcctggc caacatggcg aaaccccatc tctacctaaa atacaaaaat tagccaggca 601 tggtgtcacg tgcctatagt cccagctact caggaggctg aggcaagaga atcactggaa 661 cccgggaagt ggaggttgca gtgagctgag atcacatcac tatactccag cctgggggac 721 agagtgagac tgtctcaaac aaacaaacaa acatagaaca ggttgtatca ttcccctgct 781 tgaggatcct tcagcaactt cccatcattt caggatgaaa cctcaactcc ttgccagggc 841 ccataaggcc ctgtgaggcc tgcctgggct tccagaagtt tccattctcc agtctcatct 901 cctcctactc agaatgatgc agcttttcct gttttttgtt ccgtgaacac cccaaggttg 961 cccttgcttc agggcctttg catgtctccc ctcttcttgg aaccctcttc ccaagattgt 1021 ggcttggttg gttccttatt gtgggtgtca ctcgttgcct tctcaggcct tccctgacca 1081 tcctgtctag aaagcagcac ctctgtgtcc tctcgtctct tccccatagc ccttaacatc 1141 atctaaaatt ttctgcatta tctgcctgct tctgcattgt cttttctgct cagtggaatg 1201 taagcactgg gactgttttt ggtccctgct gtagcctcag tgtgcctagc acgtagcagg 1261 cactccatgc atgcatgctt attgacagaa caaaccacat ggattaaggg cctaagttac 1321 aaaccaagtg tcttcgaagt tgttgtccct caaactcttt tttttttttt tttttttttt 1381 gagacggagt cttgctctgt tgcccaggct ggagtgcagt ggcacgatct cagctcactg 1441 caacatccac ctccaaggtt caagcatttc ttctgcctca gcctcccaag tagctgggac 1501 cacaggcgcc cgccaccacg cctggctaat ttgtgtattt tttgtagaga ctgggtttca 1561 ctgtgttagc caagatggtc tggatctcct gaccttgtga tccacccgcc tcaacctccc 1621 aaagtgctgg gattacaggt gcccggccta gtaacatgag ctctcttatt attattatta 1681 ttactactat tattactatt attcatttta gagagaaggt ctctctctgt cacccaggct 1741 ggagcacagt ggcatgatcg tagctcactg cagcctgaaa ctcctgggct cgagcaatgc 1801 tcctgcctca gcctcccaaa gtgtagtgat tataggtgtg agccacctgt gcccagccac 1861 ttattatttt ttaatcatta ttgttttgtt ttgttttgtt tttgagacag agtctcgctc 1921 tgtcgcccag gctggagtgc agtggtacga tctcggctca ctgctagctc tgcctcccgg 1981 gctcaagcaa ttctcctgcc tcagccttgt gagtagctgg gattacaggc atgcaccacc 2041 atgcctggat aatttttgta ttttagtaga gacggggttt tgccatattg cccaggctgg 2101 tctcaaattc ctgaccttag gtgatctgcc tacctcaccc tcccaaagtg ctgggatcac 2161 aggcaggagc caccacgccc agccaatcat tatttttatt accaccacta tcatcatcag 2221 tttgactcct ggaagaggca gctgttgatt cctctctgcc ttcaagtaat aatacttacg 2281 ccacttttca cattcagtgc ccttcagtgg cttcctgttt cactcaagaa aaggagaaaa 2341 ccccacctac ctatccagca acatgtgagc ctccacctac ctctccagcc tcatcttgca 2401 tgctctttcc tgttggtcac cgccctgcag tcgtaccagc ctcactgcag ttcttcatgt 2461 gatttatgtt catcccacac caagcactct ccctggaata tgctttccct ccatcttcac 2521 ctggtgaatg cctactcttc cttctttcga tctcagctcg gccattacta tctcagcctt 2581 ccatgacctc cgactagggg aaccctcctt atatcacact ctcagagtaa catgtgcctc 2641 tgctgtagcc ttcatcacag ccgtcagttt acacttattc cttcaacttg aattaatgtc 2701 agccttttct aatcaacact aagctccaca ggaacagaga ccatatcatt ctttgctcac 2761 cttggtactc ctaacgttag aacagtgcct tgcacataga agatatctcg taaataataa 2821 atgaatgaat gagtaatgag aaagcctttg ggttagacgg tcatccttca gaccaagccc 2881 aaaacttccc caagagaaaa tctgctgcct ttgaataaaa ccagcctttc tcactggcct 2941 cctgccatgg gacagaaaca gccttcttga cttccagtga gttctgccgg ctagagacgt 3001 gaccttgggt ggtcacaggg tacccgtgga gtaggaaaag cctttgtccc tctaagctga 3061 gatagaaact ggatatgaaa ccacaatgca atggagaagt ctgtctcatt tcctgagctc 3121 agccatttct cagtgccagg cactttcagc tcttccttcc ctcttcaaac agggtcactt 3181 accgtgtcct gatggtggaa tcctctactt tggtttttct ggccagcagt gttttcttta 3241 atgggttgga gcaaaggttc tcaaaggcct acagacctgg gccagtcccc cacagagtca 3301 gcaccaggac atggtgaaat cagagaaaca gggacaaagt cgggagtttt tatgaagata 3361 aattaattct gaaattgtct gccctacttt tttctgtgtc aggatgccct ttcttttatg 3421 aaacaagggt gctaggagtt ggccattttt ttcagtgtcc ttgtttgtca aaatcaccta 3481 tttaattttc ttttgaaatt tactggtctg tggacagtgg ctgagccagg atgagaactt 3541 ggattttgta gttccaggtt tgcagtctct cccctctgcc acccctccct ggagaggatc 3601 agatgcaggg gagacaggac ggctggctgt aaattattga cgggctgcca tgtggaagag 3661 gggaagcact ggttctgtaa ccagggagca gagtcagggt cactgggtag cagtttcaga 3721 ggggcatatt tcaagtgggt tagaataaac tctttgacag gcaggaattc aacaatagaa 3781 tgggctaccc cacccatggt ggccaggcac aggtgtggca gccatatttg ggctgttgga 3841 gggactatcc ccgcctgcgg gtactggtta aggattttgg ccttggattc aaatcgacct 3901 gggtttcagc actagtttta ccacttttta tctgtgtgac ccggggcaaa tcagtttcct 3961 gctgtttaaa atgggggtaa atcacgcctc cctcagagtc ctgtgaggat tgaaagaagg 4021 ggtgccagcc gggcacagtg gctcacgcct gtaatcccag tgctttggga ggccaagatg 4081 ggaggattgc ttgagctcag gagtttgaga ccagccttgg caacatagtg aggcctcgtc 4141 tctaccgcta ctattaataa taaactagct gggcatagtg gcatgtgcct gaaatctcag 4201 ctactcggaa ggctgaggtg ggagaattgc ttgagtccag gaagtcagct gcagtgagcc 4261 aagatcgtgc cactgcaccc cagcctgggt gacagagtga gaccctgttt caaaaataaa 4321 agaaagaagg ggtgcccagg ccacaaaagg gcattttgtg agatgtgggc acattcctgg 4381 cacatctgtc tccagggagt ttactaattc ctggggaata aggaatgcca ctctctggtg 4441 gagaggaaga gactcacaga gaaaaacagt ttatgaggcc gtgtgcggtg gctcacgcct 4501 gtaatcccag cactttggga ggccgaggcg ggaggatcac ctgaggtcag gagtttgaga 4561 ccagcctggc caacatgggg aaaccccatc tctactaaga atacaaaaat tagccaggtg 4621 tggtggctca tgcctgtaat cccagctact cgagaggctg aggcacgaga atcgcttgat 4681 tccaaacatt cattcatcca tatacccatc catccataca cctactcatc caactcgtgt 4741 tcatccaaac atccatccat ccatccatcc atccatccat ccatccatcc atccatatat 4801 acatccatct atccattcat gcatctatac atatacctat tgatcaaatt cttgtccatc 4861 catccatcca cctatttatc caacttctgt tcatccaaac atccacctat ccatctatct 4921 acccatccat ccattcatgc atgcatgcat ccatccatac atacatacat acactcactc 4981 atccgtctat atacccatcc atccatgtat ctatttatct aattcctgta catccaaaca 5041 ttcattcatc catataccca tccatccata cacctactca tccaactcat gttcatccaa 5101 acatccatcc atccacccat ccatccatcc atccatccat ccatccatcc atgtttcaag 5161 cagagaacaa gacaaaatca ctgcattcat gaagcttaaa ttgagtgagg cagggcttgc 5221 agatataaat caaataatag taaataaggg taaaattgtg acaatgataa gtgctacaga 5281 cataaagggc acatggtgct aggagagtcc atcacagggc aatctgacct agtcatgaag 5341 gtcagcaaag gctctccaag tgaccataga actgagaact acagggtaag caggattaag 5401 tagaagaatt ggggaggaaa aaatgttcag gaagagaggg aagggcacgc acagggcaag 5461 ataaagttgg gaagtaggcc tgaccatgca gtgctcttgg gcatgctgaa gattttgatt 5521 ttgattctta gaggttctaa gcaaggagca ggtgacagga tcagatttgt atttttaaga 5581 gattattttg gctgtggtta cagaagatgg aagcggggga tgggatgagc aagtgtgaaa 5641 gccggaggcc tgtgggaagc caatgtagat gtccaggaaa ttcatgatgg aaccttggac 5701 tggggaggtg atggggggag gggaggagtg gatggacttg agggccattt aggagataaa 5761 atggacatga ttgggccatg ggttttgtgg gaaggataag ggtgagggag ttatctagga 5821 tgacacccag gtttctggat aaaactgttg ccaggcaaca gagagaaagc cagaagggag 5881 tggggaaggg gtgggacaca ttttcccttg cagttgtttt tatgcccatg tttgcaaaat 5941 aaagggtgtt ggaggtgtgg gcgtgcacag ctccctgact gcccacccaa ggataagaag 6001 actggtttaa gaagattgca tgttgcaggg taaagggagc taggtcttct actctgggct 6061 ctgcatgcag gtaactgtgt gatttcactc ccctggccca ggactctgaa acagacatcc 6121 ctccttgtct ggcaatttca tggcaaaaag cagcctgagt cgtatttgtc cactcatgct 6181 atttacagga ctcctccttg ggaagttatt tcttgtagat ccactttatc cagagcctga 6241 aggtgaaaaa tcatcaagtc tagaatgtga gatctgaaag gaatcacaga gcccattttc 6301 ccaatcttct aattttacac tggggcagcc cccgtgtctg acccatgtct ctatgctact 6361 ctactacctt gcctacagga agagaggtta aggagtttgt ccaaagccac aaagctattg 6421 ggcataagga ggtgacccca cattcctttt cttactttgg gggtggggat tcttctgcag 6481 cctgcagtta tttcctagga cagtggggct aggtagagct gtggcgatga gctaagatca 6541 tagacacagg tgatgctgag catctggggg aataattcat ctgaagctgt gccctgctga 6601 gttggagtcc tttctgactc tttaaagatg cctcttgtca tgcacccagt cgtgactcct 6661 gaatatcctc ctggggttgc aagatgctct ttgcaaaaaa aaaaaaaaaa aaaggaatgc 6721 agaaaaccca gcccaccccc tgaaaatgag acatagagac aaaaaacatg gtttgaacct 6781 cactgcttag gtttccatgg tggctccact ctgctgccct tcccattggc taggtgtcct 6841 gctccagctt tagcaacaga gactcagata ctcatctcct tcctgattgg ccaggctgtg 6901 ctgcagtgcc atggcaacag aggcaggggg accagtctct gcagaacaac ttgagtcacc 6961 atctacctgg ggtgggctga gatccagaga actgcctgtg ggggatggaa aggggaacag 7021 gtagccccca cctaagctca cagtcaggtg ggaacctccc cgccaccagt tgcccagggg 7081 ttttggaggc tggcgggctt ctcctcctca tcttgtctac cctgtatctt gcagcttctt 7141 ccagcctttt cttgctccat gggtcctgtc tccctcagaa gaaaaatcac accactagca 7201 gtgccagagt ttaactcatt cgcccaaagt cgtgcaagtt actaataaag gccagggttt 7261 gaactcgtat ttatattgcc tggtttaaag gctttagagg ggctttgcta gagaatccag 7321 gggtcctctc agttttaata tctatcccct ttttattcac caaacagcaa atgcatttca 7381 aagcagtctt tcatctaaat tatctcatca tctcaacaac cctatgagtt cggtaagtca 7441 gagattaatg acccatttta tagatgagga cactagagcc cagagaaggc aaattatttg 7501 ttccatgaca cacagcacat tagtgccaga gccgggcctt taacccaggc cttctgaatc 7561 cttgtcagag gtctaccttc caaaccactg acctatctat accagggggg ttccaggagg 7621 gttggaggcc ctctgagggc ttctctgagt ggtctcctct gttacagaag ccagttagaa 7681 tcaggcagtc cgttgtggac caagggttca taccatagcc ctagtctcag ccaggcaacc 7741 cctcaatgtg accctggtct ccaggaaggg tctgttttta actgcaaacc ttgctcttcc 7801 cctcaacccc catttgccaa caggtgaaga agtactcaca agcttacctg caatcggttt 7861 tactgtttct gtgtgcttga agctgtgctg ggcatgcatg ttctcattga attctcatca 7921 caaccttagg agttcggtct tcactgtcca ctcctaatag atgcagaaac tggtgctcag 7981 gaaggtcaag ccgcatcccc aaggtcacac agtcatggag acataatacc aacacaagtc 8041 cacctaacct aaaggagcct ggcttgaact acggtgatgc tctctctgtt cccacagaat 8101 gcagaatgat agctaaaaga ggcacctgga atcactcctg taatcccagc agtttggtag 8161 gctgagatgg gtggatcaca aggtcaggag ttcgagacca gcctgaccaa catggtgaaa 8221 tcctgtctct actaaaaata caaaaattag ccaggagtgg tggtgcatgc ctgtaatccc 8281 agttactcag gaggctgagg caggagaatc ccttgaaccc gggaggcgga ggttgcagta 8341 agctgagatc acgccactgc actcaagctt gggtgacaga atgagactcc gtctcaaaaa 8401 aaaaaaaaaa aagaaaagaa aagaaaaagg caccctggag ctgattcttt ttctcttggg 8461 tatgcatttt ttcctcaact gaagctgtgg ctactggacc aattccttgg cactggaaaa 8521 agcagactct gaggactctg agggacttgg aaatagttgg cccctgggaa gtgactcagt 8581 gtccccaaca ctctctccca ttgcccagtt tctcagcttg gtgatcttac agaagggcgt 8641 ggctgggctg ctggtaattt gcagctgacc tggtggagag aagccccagt aacttccttg 8701 gcctgaatct aagcccctgc tgaaaatgaa cattttggca gtttcctttt cttcctactc 8761 aacttgggct ccctgaagct cttgcccctg aaacacagcc cagacacctg gcctgccctc 8821 ctgccctccc aagggcaccg gttttggaag acacaaccag gcaggagttc ccacctgcca 8881 cgctgctccc ccaagcacca cagcagcaag gggcaggcac acaatctctc catcaacaat 8941 tacacagctt ctctgtgcca gacaccacac acacaggctc aagggagatg gtagggaaca 9001 aaccattctg gcccctgccc tcatggaaac tgcagcctcg tggcggaaat gcactgcagg 9061 tccataagtg aatacatcat ggtttcagac agtgataaat gccttggaca ggatgaaaca 9121 ggataacggc ataaacagca attggtgggg gtttgagggg ggagggaagg gactgaaggg 9181 gaagggagca ggcacctgaa tgaggaggag gcagttgtct tggttatcta tagccaccta 9241 acaaaccccc aaacctagtg gcttacaaca accatttgtt attttttatg agtctggctg 9301 gcctgagtgg ttcctctgcc cgtttctcct ggtcacacat gcagctgcat tcagctggag 9361 tcacagctga gctggaaggt ccaagaaggc ctctcacaca catctggtgg ttggtgctgg 9421 ctgttggctg gggtgtctca gatccctatc acgtggcctc ccatcctccc gctggtagac 9481 cagctgcctt atgtggtggt ctaggagcag cattccaaga gggggaaggc acaagcttgc 9541 aagatctctt gcggcctaag ctcctgaact cataaagggt cacttctgcc acattctgtt 9601 gtcagagcaa gtcacaagcc cagcccagat ccaaagggca aggaaaggga tgccacttct 9661 tgatgtgagg ggagcagcaa agtatttgtg gccttattta tgagacatga tcagccgggc 9721 gtggaggctc acacctgtaa tcccagaact ttgggaggcc agggtgggca gatcatttga 9781 ggccaggagt tcaagaccag cctggccaac atggtgaaac tccatctttg tattttataa 9841 atttataaat gtatacatat tatattttat ataaaaagag acatgatcca tattgcagag 9901 agtaacaaaa atgatttcga ggtgtaccaa agttgtgatt agtatgaaat cttttttttt 9961 taattttttg agacagggtc tcactccatt gcccacgctg gagtacactg gcacgatcat 10021 agctcactgc atcctcaagc tcccaggctc aaacaatcct cctgccttgg cctcctgagt 10081 aggcatgcac cacaacacca gcaagacgct atcttgccct gttgcccagg caggactata 10141 gcaacacaat catagttcac tgcatcctca aactcctcat ctcaagcgat cctcccacct 10201 gagcctcctg agtagctggg actacaggag tgtgccaccc tgcctggcta tttttttctt 10261 attttttgta gcaatggggg tctcactatg atgcccagac tggtcttgaa cttatgcctg 10321 gggctgtctt cctgccttgg cctcccaaag tgctgggatt acaggtgtca gccaccatgc 10381 ctggcctatg tggccatctt taatctgcga catcatccat gtcgagatct ctgggaagag 10441 tgttccgggc agaggaaaag cacctgcaaa gaccctgggt taggatggca tccagagacc 10501 tgagccttga acaatggtgt gatggttgat actgagtgtc aacttgattg gactgaagaa 10561 tacaaagtat tgatcctggg tgtgtctgtg agagtgttgc taaaggatat taacatttga 10621 ctcagtgggc tgggaaaggc agacccacct ttaacctggg tgggcaccat ctaatcagct 10681 gccagtgctg ctagaatgta aacaggcaga aaaaatgtaa aagagagact ggtctagcct 10741 cccagcctac atctttctca tgtgctggat gcttcctgcc ctcgaacatc ggactccaag 10801 ttcctcagtt ttgaaactgg gactggctct tcttgctcct cagcctgcag acggcctatt 10861 gtgggacctt gtgatcgtgt gagttaatat gtaataaact cccctttata tatatatata 10921 cacacatata tatacataca cccatatata tacatatata tacacatata cacatatata 10981 tatatatgcc attaattatg tccctctaga gaaccctggc taatacaaat gggttcctct 11041 actcccttag tttcctgtcc tgtaaagtgg gaactaagga tgtctccctg gctccttata 11101 ggaatacagg atgtaagatg ctttgcttca aggctgggtt aggattttgg ttattggcca 11161 gtggcaggga ggagaggcca acacaggagc tcagcttggc ttcccccaag ctcaccagcg 11221 tttgttgagc actgtctggc aggaaggcac agtgctcagc accacaggtc atagctgggg 11281 agtctgaggg ttgctgatct tgccaagctt atgggataaa aagattgctt gctgccccgg 11341 atgtgactga gacataccag atgcatctca gaagaaaaga gctgggagga ctgtggggct 11401 cagggaaggc ctggaggaga tggggttcca acagggcttt gaggggcaca taggatttgg 11461 ctgaaggagg ggaaactgtt gataggaaaa gtataagcgg ctgtggaatt tagtaattca 11521 gtgcaacaaa tgtttgctga acccctacat acgccacgtg aacgggacag acagaggcac 11581 tgtgaggatc cagaggagac acaacacagg tcttgaagtg atgatgcttc agaggcctga 11641 ggatgagtca ggtcaagcac agtggggcac tgggaggagg agatgatacc aagagggttc 11701 caatgaaagg acatctgcaa aggacagagg ggagggaggg cttgagtttc tcaagtaaag 11761 gcaatgagct gtgtctaaag cagagggtcc atgtggaaga gagtggaggg atgagctcag 11821 ggccaaactg gtctcatggg ctaactaaag acttcgtcct ttttcctggc caggcgcggt 11881 ggctcacacc tgtaatccca gcactttggg aggccaaggc aagcagatca cttgaggcca 11941 ggagttcgag accagcctgg ctaacatggt gaaattctgt ctccactata aatacaaaca 12001 ttagttgggc atggtggtgc atgcctgtag tctcagctac tcaagaggct gaggcaggag 12061 aaacgcttaa acctgggagg cagagcttgc agtgagctga gatggtgtca ctgcactcca 12121 gcctgggcga caggcttttg agactccgtc tcaaagagaa aggaagaaaa aaaaggcttt 12181 gtcctttttc ctgagggcaa tgtatgtagg aacaccagat ttagcaaatt aaaatacagg 12241 ttgtcctgtt aaatttggat ttcatatgaa caacaaataa tttctgtagt atgtgtatgc 12301 aatttctggg acattcctat actaaaatat tattcattgt ttgtctgaaa ttcaggggtt 12361 tttttgtttt tctttttgtt ttgttttgtt tttaagacag ggtcacatta tgttgcccag 12421 gctggcctca aactcctggc ctcaagcaat cttcctgcct cagcctgttg agtagcacta 12481 cgggtgtgtg ccaccatacc cagcttgaaa ttcagtttta actgagcctc ctgtattttc 12541 tctggagacc ttaagtgtgg gatcatggaa aggttttaag cagggcgagg acaaggtcag 12601 atttacgttt tgggaagctc ccaggattcg tgggaaaata gactggaggg agaccaggaa 12661 ggggcggggg atgaggttcc tcctattgtc ctggccaaag aagtggaggt caccaccagg 12721 aatggggcca ttgcatcatg tcttccaaca catctgcctc ccggactagg catgtggcct 12781 tggataaacc ctagctgatg aagccccaga gggctgcagg ccagtcagac cccagctgtg 12841 ccatgtgagc ccctggcatg gtttactccc tcccagatgc tcatctgcca aacaaagggg 12901 tcagatgatc agagagattg tccatgaggg cccttctgct tctgccatcc tggggccctc 12961 cacccatcct ctctctgata cctctcctgg cgtgggccca ccagggtgaa aagtccaaac 13021 ctgagatgct tggctcccag gtcatccttc ctgtgcctcc ttgctccttc ttccctccca 13081 ccatcactct gcccagtgaa tgcctgaagg gaccccgtgt ctcctctctc caacctcagg 13141 tcccaggcct ggttttggag aagggtctca gcctgtgttg acagcctgcg atgggtgaag 13201 ctaggtcttc ggcatcactc gtttcattga atcttcacag cagtgctaca aggaaagcat 13261 cattcccatt tcacagataa gggggactga ggctcagcag tgcacttgct gacccagtca 13321 cacagctgtg cagggggcaa acctggagtg gagtccaggt tgggctggca ccagtgctct 13381 gctctcccgc tgtctcaatg gaatgtaccc aggagcccct aatcccccat tgatttagtg 13441 ttcgggcttg gtcagcagtc ttcttgtccc actgcaggca tgtgcagccc cggctctttc 13501 tgagttaagc cagggactga aatgggtcgg cagaatgctc atctgttggg cgctttatgg 13561 tttataaagc cctttccctt ctgtgccctc ctcacaaccc catggagact ctgcacggcc 13621 cctccagctc ttagaggctg cattttgcgc agtgaaggcc acattaaata ttaacagggc 13681 agctcctggc cccgctgtga cgtcaattac ggggccgctt ggtgagggct gcggatgcct 13741 ttccaaggtg aagcggtgac cggaattcag ctggcttgaa tggtgtccca ggaggctggc 13801 tgtcaggact cggagaagat gttgctaatg taaggtgtgg caggaatcca aaagctttcc 13861 tctgatcttc ctggctccct cagggcctgg tggcatcctg aggctagcga gatgcattca 13921 gtcatttggc agatatttat tgagcacctg ctatatgcca agcacagtgc gaggctctgc 13981 ggagacggct gtgaatgtga tgaggtcttt gcatagtggt gctccctgtg tgattgaaga 14041 aagggagaaa tccagctctg tgggctaggg caggtgccca ctccagcgtg aggccaaggg 14101 atccttccaa aagaagtcag gttgatactg aaacctgaag aagaagccca agttaggtag 14161 ggaaagagga ggggtgagag aatagacata gagaatacct ttcgaggtcg gatcctggtt 14221 ctacctgttc gtagacattt atctacaaat taggttccca ccacgcaggg ctgttgtgag 14281 gactaaatga agccattgct agtgacagca caggctggat gcatagtgag gtgcccagtg 14341 ggagctcatg ggtagctgct gtgaaaatag ctctgggcca cacatggtgg ctcacacctg 14401 taatcccagc actttgggag gctgaggcga gaggatcact tgcatctagg agttcaatac 14461 cagccaaggc aacatactga gaccctgtct ctacaaaaaa tagaaaatta gccaggcgta 14521 gtggtgcaca cctgtagtca gtcccaacta cttgggaggc tgaagtggga ggatcacttg 14581 agtccaggag gtcaaggctg cagtgagcca ggattgcacc actgcactca gcctgggtga 14641 cccaggaaga ccttgtctaa aaaaaaaaaa aaaaaaaaag gaaggaaagg aaagaaaaga 14701 aagggaagta aggatggaag gaaggaaaga aaaagaaaga gagagagaga gagaaaggaa 14761 aagaaaagaa atagtctgtg tcttacatga ttcctttgca ctgccaggaa ttcttgctcc 14821 cattttgtaa gcagagaata tactgaggct cagagaggtc agagaggtga cccgacatgg 14881 ccagggttat actgtggatt agaccccagg ttggacatgt gaccctaggc tctgacttgg 14941 gctttttgcc atactccatc cacaggcaac ccaagcaatt atttcagctc agtgagatac 15001 ggaatcccac cagtgattcc cagtatgagt tccttggctg catgttcaag ggtagtctca 15061 gccttctttg tgataaggcc acagctgtct gatgattagc ccagggttgg gctttgggag 15121 gcctgggcat aggcttgatt tcgctaacct gctgtgtgac catgggtgaa ttactttccc 15181 tctctgagcc tcagttccca cagggccagc gaatgaccag ctgttattgt ctgggtctct 15241 gatcctccca gtagtgctgt ggcaaaggaa ctactttgaa tttcatatta aaaatgaaga 15301 acccggacag gcgcgatggc tcacgcctgt aatcccagca ctttgggagg ctgaggcagg 15361 tggatcactt gaggtcagga gttcaagacc agcctagcca acatggtgag accacgtcca 15421 tacaaaaaac acaaaaatta gccaggcatg gtggtacgtg cctgtaatcc cagctactca 15481 ggaggctggg gcaagagaat tgcttgaatc tgggaggcgg agtttgcagt gagctgaaat 15541 cacgccactg tactccagcc tgggcaacat gcgagacact gtctcaaaaa aaaaaaaaaa 15601 aaaagataag gaaccagcgg tctgagaccg acaaagtggc atgcacaagg ctacacaatt 15661 agcaagtggg gcccaggatt tgggaacaac ccgttgctgg cagagcttcc catcagtcca 15721 ccatgctgtg ctgcctcttg gagtaagaaa gccaaactga attgtctcca agatctttat 15781 tatgggtctg tgattctgtg agttccctgc tgccccaggc cttcagggct caatgacttc 15841 atggggaagc catttcaggt tgtgctggtc ctccaactgt ccccatttca agtaggtaac 15901 atccccgtag gcagcactcc ctccctccct cagacacaac ctcattcctc ctgaatgaag 15961 aggtctagtg aggacaggga catgcccttt ttattctccc cccactgcac cgcccactca 16021 ccctgacatg cccattttat ggaaaagaaa actgaaactc aggtccaaca acttgcccag 16081 gtttccccag ttactatgga agagctagga tttgaggctg ggtctggtaa ctccagatct 16141 tgtactctaa gtaactgcaa gctggccaga ttcagcagga cagccctcct cctcctctga 16201 gctccctccc ggcttctgca atcctgcctg ccttccccct tgctccctcc ctcttcttgc 16261 ctttcactca gctgtcactt aatgttagct gtcagctgat ttcaggtgtt ccccttcctg 16321 gggtggaagg ggagaggacc aaagatcttg gccattcaca aagggggctc tggtgatggg 16381 agttttgtaa tacgcagaag gtgtaaccaa attcaattca gcattcatgg agcagcgcct 16441 ggatccagga cactcagcca ggcatcccag ggactcagat ggtgaatgaa acacaggccc 16501 cattctccag tgcttacagt ctattacaaa ggaaatggct gcttcctaag gaagtgggaa 16561 gcagaaggta taagtaacaa agagtgtgaa ccttatagtc agaaagacct gggttcaaat 16621 cctaattctt catctaatag cagcatgact ttgggtaaga ggatgccttc ctgagcttta 16681 atttttccat ctgtttaatg ggtgtaagaa gaatagctcc catattaaat gagatcttat 16741 actgcattaa atgacataac aaagtgcttg atgcccagta gcaagccaca gacattccta 16801 ccaatgtcac tctaggcaga cagtggatac cagagtcaca gtgggagtag gtggaaggcg 16861 agagtacttc ttgttggaga aactgggaag gctctttagg ggaggtaaga tttcggctgg 16921 gtattaaagg aatgtaagaa gaagagaatg gggtggaggg atgggaaagg acattctgga 16981 aactcatggg cagaatctgg gaacctggtg ggcctaatgt gtccagggag tgggtgatga 17041 ccttgagaag atgggccacg ggcaggttat gaagggccag gtagaggaag ttatatcttt 17101 aacgagggaa tggatcctac ccacgacccc aggggcagct cagtgctgtc tctgcagaag 17161 ccagccagcc tcctacagga gttggccttc tcatgtggct gtggtcctct cagatccctt 17221 tatctcctga gtcagggcca gcctgccacc attatctaca agaagacaga aggctataac 17281 aatctgcttg cctcctacta acatactggg gaggaaaaga tgttttttag catagttgcc 17341 tggcagagaa ccatggtgaa aattcagtga ccctctccct tgcttctcag ggtacagttt 17401 cagcagcagc tgatgacagt tatgctcata gctcaacagc tacaagaatg ttctcagact 17461 ctctgaaggt caggggcagt ggtgggaaga atggtgaaga tggagcagag cccctgctct 17521 ctgtgctgtg gctgggcaag acacttaacc tccctgggac tcagtttcct catctgtaaa 17581 atcaaaacaa taatactgac cttgccttaa ccaagagaca gcgtagtagc aaactgctgg 17641 cccacagcct gctatgaagt aggagttcat taccttcttc gctccaggtc ttgacatggt 17701 ccaaagactt gtcttttgat gcagccctgt tgtatcctct tgagttgtca tgacattgtc 17761 tgctggtctt ccagtggcaa aatatcctag actttcagag ctgaaaaaaa aaggtacttt 17821 gccattcatc attgcagcct cccatttcac agataagaaa tccaaggctc ggggtgaagg 17881 aacagccttc tccagctttc ccggaccatg cacgacagcc cctcccaacc cagcccaccc 17941 agtgggaacc atgcaaaagc ttgaggcagt ttttcagatg gttcaagcca aaacgtctag 18001 aaaattcttg ctaagaaaca caaactccca gaccatctgc agaactgaaa caaaagcctt 18061 tttcgccctc attttaactg caactcaata ggtgtcggct ttgagctctc ccccagagct 18121 gagctcctct gaacaagctg gaggggggcg ggtgctggga ctggggctct gagctccacc 18181 agctcataga accagctcac agattgcagc taacaggaca tttttgcact tcttaatgcc 18241 aaaagctgcc ttggatgctg gtttgcttcc aaggcgcgct ttgaaagagg gatggaaggc 18301 tctagatgat cctgaaatct ctgcgctcaa gatgctctta gttaagacac ggggtgagat 18361 ttggtgtcag caaaaggata ctggggtcaa agtagatggt gcaaagttgg ggtgagtggg 18421 gaaggcttca gatgattgca agaatctgac tcttctaggc ctcagtttcc tcatacatat 18481 agtggggcta acagcctcct tccgagtgag gatgtgggga gataatatga gagtaaagtg 18541 ttcatcctca aatcaggcag agggtaagtg attaaaaaac aagagtcgag ccgggcgcag 18601 tggctcacgc ctgtaatccc agcactttgg gaggctgagg caggcggatc acctgaggtc 18661 aggagtttga gaccagcctc aacatggaga aaccctgtct ctactaaaaa tacaaaatta 18721 gcggagcatg gtggtgcatg cctgtaatct cagctactct ggaggctgag gcaggagaat 18781 tgcttgaacc tgggaggcgg aggttgtggg gaaccgagat cgcgccattg cactccagcc 18841 cgggcaacaa gagcaaaact ccgtcttaaa aaaaaacaaa aaacaaaaaa aaaaacaaga 18901 gccgctgtcg atgttattgc tggtggtttt attatcattg gctgggtcgg caaggtcacc 18961 ttggctgcag gtggcctaca gtgggagaaa aagttaaaat cctatctggg tgacattatt 19021 aataatttgt ggacaggcgg ctgctccagc tcaagatggt aaccatgcca acattgcaaa 19081 tgccatctct ccagaacaac atcacgtctc ctttgtaata tgtcccttgg tgtccagact 19141 tctcagcagc tcagagtaag tgtctcagag tatccatgag tggaaaaaca aaaaacaaaa 19201 aaacaaaaaa accaggatgt gccttttctg ggtcatctat tcactgggct ctggtggagg 19261 gttctggcaa tgctttagaa atgcagtccc acgtgaggag cagtcgtgac cactagctca 19321 gggtgggatg caagtgagga attttttttt tttttttgag ccttgttgtc caggttgcac 19381 tgcaatggtg tgatctcggt tcactacaac ctctgcctcc gggattcaag cgattctcct 19441 gcctcagcct cctgagtagc tgggattaca ggtgcctgcc accacgccca gctaattttt 19501 tgtattttta gtacagatgg ggtttcacca tgttggtcag gctggtttca aactcctgtc 19561 ctcaggtgat ccacccacct tggcctccct aagtgctggg attacaggca tgagccaaca 19621 tgcctgacct tttttttttt tttgagatgg agactcactc tgacgcccag gctggatgga 19681 gtgcaatggt gcgatctcag ctcactgcaa cctctgcctc ccgggtttaa gcaattctcc 19741 tgtctcagcc tcctgagtag ctgggattac aggcgcatac caacacaccc cgctaatttt 19801 tgtattttta gtagagacag ggttttgcca agttggccaa ggtggtctcg aactcctgac 19861 ctcaggtgat ccactcgcct tggcctccca aagggttggg attacaggca tgagccacca 19921 cggctggcca gtttttcttt tttaaaaata ttttttgtag tctcaactac tcgggaggcc 19981 aaggcaggag aattgcttga tgaacctggg aggcagaggt tgcagtgaac tgagatcatg 20041 ccactgcact ccagcctcag tgacagagac agacttcatt aaaaaaaaaa agaaaaaaaa 20101 aatataaaat gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt attttgtttt 20161 ctgtagaact ggggttttgc catgttgccc aggctggttt tgaactcttg gcctcatgtg 20221 atccttccac ctcggcctcc caaagctctg ggattacagg catgagccac tgtgccaggc 20281 tattggtccc tttttacaag tgacttgccg gagggtactc agctggtgca ctggcagggt 20341 aggattcgaa cccaagacaa gaaccacgtt ctttctcaaa tcacaccaca cacgtgtgtt 20401 caggaacccg gggggggggg cggtttgttc tgttccccga gtgagaccct gtaccaggtg 20461 aagacccaga cttggctcct ctactgggga cctgctctga gccctgaggc aacaaagtgt 20521 gtttgacatt aaccattgac ctccctcacc tcactcaggt gctgtctttg acgttgttca 20581 aaggccctgt ttatcgatag gacgtagagc cttgggagta ggtgacctgt cgcaggtggt 20641 gaccctgatt tttattgatg aggctcttgg gcgttagcaa ctctctccca agggcaaaga 20701 agatgtgtgc tggtaagagt gtgtaggcca catggtggga ggcttatgta tgctttgtat 20761 aatctgagtg accccaacag ggagtcacac tttgagctct tctcagctga caccacgggg 20821 cagtgtgagg ccctcagact tgggtgtctg atggatctga gtttgaatcc ttgctttgcc 20881 cattcccagc tgtgtgactt tgtttaaacc atgggacagc tctgtgtcag tttgcttatc 20941 tgtaaaacgg gaaggaggcc gggctcagtg gctcacgcct gtagtgccat cacttgggag 21001 gccgaggtgg gtggatagct tgagtacagg agttcaagac cagcctgggc aacatggcaa 21061 aaccctagct ctattaaaca aacaaatttt aaaaactggc tgggcgcagt ggctcatgcc 21121 tgtaatccca gcacactggg aggccgaggc aggaggtaca tttgaggcca ggagttcaag 21181 accagcctgg ccaacctggt gaaatctcta ttaaaaatac acacaaaaaa attagctggg 21241 cttggtggtg catgcctgca gtcctagcta ctcgggaggc tgaggcagga gaatagcttg 21301 aacctgggag gcggagattg cagtgagcca agattgtgcc actgcactcc agtctgggca 21361 acagagcgag actccatctc ttagagaaaa aaacaaacca aaaaaaagac taaaaaatcc 21421 atttaacaga tgaggcacaa gagtttaaat aattttctct gaaaaaatta gctgggtgcg 21481 gtgacgtgcg ccttgtagcc ccagctactt gggaggctga ggcaggagga tcgcttgaac 21541 ctgggaggcg gaggttgcag tgagctaaga gagcactatt gcactccagc ctgggtgaat 21601 gggtgagact ctatctcaaa acaaaacaaa agccaacaaa caaaaaaaca ggaaaggaaa 21661 tcggaccagc cttacagggg tatcttgagg gaaaaaaaaa tgcacgtttc tgcaaagcac 21721 cttgcctggt actaggaaca tggtagatgg ttccttagag tgcatgagga tggggtcacc 21781 aaggataggg cagtgtgtga agagggctcg ggacaaagcc ctgaggaacc ctagtatttt 21841 ggggtaggct agaagacaag cccatacagc aaatagaaga gcaatggccg gagcagcaga 21901 agggaagctg ggagagtcgg tcaaggaagg gtgagcagtc gtgcccagtc cagccaagag 21961 ggccagggag gtgggcacta tgaaacctcc ttggctttga caacaaggag ggctctgatg 22021 gccacggtgg aggagtaggg acagggtgag agcacactga gctgatgggt gggtaggagg 22081 tgaggaagag gagatgagga agtttggtga gaagaaaagc ttcatgggag aaccagagtg 22141 tcctttacac atgtcagaaa tcaattattt agaatgctcc agctctacct aattttcact 22201 gttttgaaat caatcttagt ccaacctcat cagaaaagct ggaagggctc taagccctct 22261 aagagctaga aggaaccgag gaggtccccc catccagggc tggccctcct cccatctctc 22321 actgggaccc catccttatg gggggtggag tggatgattc attactatct agcatgctct 22381 ttaaatatat tcactagggg aaagattagc ctggtttgtt aacctcgatt ccaacagcaa 22441 aagtgactct tcaaagcatg gtgaagcctg gggtagtggc ctcctcctag acccacattc 22501 aaatcctgtt tcatgggccc ccagaagtgt gatcgtttca gcatcatagc aggaagagga 22561 aggcctggga ggcaggaaac ctggtatgga gaccccagct ctgctcactc gctgtgcaaa 22621 ctagggccag cccctccttt ccctcagacc tcagtttccc gttattcagg ataaggggtt 22681 agacaggggt gattctaagg gtcctgccag ctctgacatt tcagcatgtg gatatctatt 22741 ctaaagtcct tgaaacagtt aagtccaatt ctccagatgg aaaacaggtc cagaaaaggc 22801 ctggtctcca gttccctccc ttctcccttc gaacatgtct tactctctag agcaagattc 22861 caggcctgaa atatggcttc ccaattacct aaggatgaag cctcaaatct ttggcctggc 22921 atttaaaaga ctccgtaatc ttgttttacc caacctcact tcccattctt aggcctctct 22981 tctcaccatc tcccaggtca ctgccaagcc tttgccaatg ctgtacccca ggccttaatc 23041 tttttcctga ctctccttca agatccagct tatccctgtg ggctgagcat gcagtagatg 23101 ctcaataaaa ttctcagcga gcccaaggcc ccctctcctg cactgccctt ggcgtgttga 23161 aatcaagtgg gggccttgtt ttaatcatca ttttcctagc ttgtcgggca gcccagtcca 23221 gggcctggca caagggaccc tcagcatcca ataccagcca ggattgggag gggctcagtg 23281 accagccagt tgggccaccc cagctctttg ccacaagaga aagttcagcc tgggtttgga 23341 ctcagatcct ggccccacca cttacaggct gcgtaaccct gagcaagtca ctaacctcgc 23401 ttttcctcag gttccacgtt ttaaatatgg ggagaccagc tgtgcccact gtgggctggg 23461 tagtgaggat taaagaggag gatgcttgta cgggactagc actgggctgg ctcctaatgt 23521 gtcctgaata ttacttgtcc cgccctggcc acccagtccc ctcctctggt ggaaggcaca 23581 gcctgcccaa ggctacgtct caggttgggt tcgattgtgt gtgtctgtgt agcggggctg 23641 tccccgaggg tcggactcga tcctgcccgg accagtacaa ggtgtggccc agcagaacgc 23701 atggacaggc agggctcggg gcgggcggga cggctggggc ggggctgggc ggacccgccc 23761 ctcggggcac gcggtctccc ggcttctccg ccagtatgtt tgcttgcttt tggggccctt 23821 ctcaagccag atctctaacg tggatggagt ggcgcgtttc agggtggggc tatgacaatg 23881 cactgcatgt gcagccgcca tcaccgcgcc ccctccaccg gctccgcagt ctcctgcccg 23941 gctttccctt ctccacctcc atctctgcgg cggtaaaatg cagcactcgc agagcgcctg 24001 cgcggctttc aggccgggcg gcggcggcgg cggtggctat ttatagtagg tgacgtcacc 24061 ttgaaataga ccgttagggc cggcccgccc ttcccccctc ccactcccgc cgctccccgc 24121 cgcccggcct ggccactctc actgcgcagg cgtcgtcccg tcctcccatc ccccagcgcg 24181 gccgcgtctc ctccctcggc gttgtccgcg gcgcgagcca cagcgcgcgg ggcgagccag 24241 cgagagggcg cgagcggcgg cgctgcctgc agcctgcagc ctgcagcctc cggccggccg 24301 gcgagccagt gcgcgtgcgc ggcggcggcc tccgcagcga ccggggagcg gactgaccgg 24361 cgggagggct agcgagccag cggtgtgagg cgcgaggcga ggccgagccg cgagcgacat 24421 gggggaccgg gagcagctgc tgcagcgggc gcggctggcc gagcaggcgg agcgctacga 24481 cgacatggcc tccgctatga aggcggtgag cgcgccggga gcccgggcgg ctggccgggg 24541 ggggcctggc gttggggagg gacggggatg gccgcgggcg cgttcccctc ccggccatgg 24601 gcgacccggc gacccggctg ggcgtggccg cccgcccgcc cttagcccgc gcttcccgct 24661 cccgctgggc gccccgccac ttcctgaggc tgggcccagg gtgggggatc cgggagggtg 24721 cagttcggga tcgcgaaggc agccccggaa ggggggcggg ccggtcgggg tcgccacatc 24781 ttagttcgga acccggccgg gggcagaggg tgccctaggg gacgcgaagg agccgcattt 24841 ctcctagagc gtttcaccgg acccgggggc tccccctctc gtcttcctcc gttccccaac 24901 ttggaataaa gaatcaccta gtaagtggcg ccctgtctca gctggttttc tctgccacga 24961 cctgaagtct gcaattccga tctgttttgc tctgctcgtt cgaattgttc tggttccatc 25021 ttcccacgcc tgggggtctg gctttgtgtg cgaagacccc tttcctgcag tctaggcgtg 25081 gacgggggcg ggagagtggg cggagggtgt ggggccccac tccacagccc caggtttgct 25141 gctgcgctgt ctgcttggag attaaaatga aacgtgactt ctaggtgaga cgaatctgtg 25201 tgtctttgca ttcctgagac cctcataaat cgattctgca gcttctccgg tggatgagtc 25261 gtgcccagcc acaccctagc aaaataccgt gggtcacaca ccacctgcat ttctgagttc 25321 cggttaccat ctcactctct ctccacgtta ttttacgttt tctgtaaatg aatatttcct 25381 ctccctgcgt caaggtcaca cagtggtaga acaactgcga ccaccaacct atgttcagtg 25441 tacaacctcc tgagacctgg gaggatcccc ttgaactccg ccttagacct atacattcag 25501 aaattcgggc ggtgagactc agacatctgc attgtaacca gctctcacag gcgattttta 25561 gctggcttaa gtctgagaat tactgcccat gattatagta aatgtatcct ttgctggcct 25621 tttggaggga gttccgaaaa aaaactgcca atacaaaggc tccctaaaat cttaagtcat 25681 ttggaaagct catttgaagg cttttgatgt taatatcaga tgaaatatgt agttggggcc 25741 aatatttagg attgttcagg taagaaaggt gcttgaagag ttgcctttcc tctggtgtcc 25801 ttgttacaca cataactggg tgtgtttgga aaccagcaac accgttgccc gtatgttgat 25861 ttgttggtga tcttgacttg agcactatta gcatgctgtt gtggtgaggc aggttttctt 25921 ccagaccggg ggacggtcta ggtctggagg ctgaacttta tgtaattcag ggggtccttt 25981 aaaagaaaat ctaaaatata gttttgcata ttttacaaaa atgggactga cgaatgcatt 26041 tccttggaag ggaccctgaa ccttaatttt ctgtgggata aagctgcctc tgtccacagc 26101 aggtcacact taagcccctc ctccccagta actgagtggt gctgggatta tgcaggctga 26161 ggcctctcca cacattttca tgaagtggtt gctcaccctc taagtgagac gtaaagactg 26221 cagtgttcgg agccctttgg gaaataaatg tggtgaaaag aaaggaggga cgaggtttaa 26281 aggaagggga aatggaggtt aaggaaaaat ggctagaagg tgtgttctgg aacaatcaga 26341 aacctgaggc aacttgaaat gggggaaaaa ggcaagaatg gagagatgtt accagaacga 26401 aatctagttt gagcagagaa aaagactggc tttctagtag gcttgtaaaa tgctttcctg 26461 cagcaagcta cttgtgtcca tagatcttct ctggagggtt aaggtagcct tttgtaaacc 26521 aaaggaatct ataatatcta tatttgaaat aattaatgag ttggaaaagc cacccagcgt 26581 agaagagtca atccaagctt taattctgcc atctcagaat ggtgataaac ccatttctcc 26641 cccatcatct aaatggaata tattgtgctt ataggtattt tctttaaaag aaaaaaaaga 26701 atatattgga agggactggc aggggtcact ttccctgcta acttggtgtt tgtagattaa 26761 attcttaatg ttactgggat gactcagtct ctatgttatt aatctcatca gccttaacta 26821 aggcttttgg aaagggtgta agtttcagta accctctggg tttacctggg ctttcttcag 26881 ggctggggaa gggcaggtaa gtagcctgct gtagggagga tggagcagga cacatgtgtt 26941 aaggatatat tctgatgcca ccagtctgta accttgaaac ttacatcttg gctttaggtc 27001 ctctgaagta taaaactagt gatgttgctg ttattaacaa gtcagttttg ccttcagtct 27061 tctaacttgt tttctgtttc cttcataatt aaaatgctca cttgataata ctgccttcct 27121 gtaatgattg atgttcctca ttctaccagc gtcacctgga ggacagtgtt ggtaagcttg 27181 aagaaaaatc gtgcatactg gaaacaatgg tcttgcctca aacctgttga gaacttctct 27241 tccctgcttc tctccataat ttgaagaagc aagaaatgtt tctgaagtga agcataaagg 27301 agtatcttca agctctatct catacctacc agcttgcata gagtataata gtgggaagaa 27361 atattgtgaa agattctaga acctaagtcc tttctaggct aggcaattct aggctacaca 27421 attcgtagtc tctgtaaata ccatttgata actgaaataa acctataaag tgggattaaa 27481 tgttagtcca gaagactaag tgatttaagg aaagagggaa aataaccaga tagatttctg 27541 gtggatattc atagacagac ataatttcat catgtattat ggccctaagt catctatttt 27601 atattaaaac atagatggta actgtaaagt tatcaaatgt aaaaattact gggctagcat 27661 tgtgttttaa cactgataca ccagttgatt acagtgttga cagaaaacgg gcagaaacac 27721 gtttaaagta cctgcagaat atatatatag aaatcttttt tttttaatca gtgttgacca 27781 ggttggcctc gaacgtgtag cctcacctcc ccgagtgcca gggcaaccgg cctgagccac 27841 agcggctccc tagaaatctt aagtgcagga aacaaaacta cagtttaagg aaacaggagt 27901 cggagaggag gggcatttgg cccaaccact ggcaaaaggc tgttgttgca acaactgtga 27961 gattctggga agtggggaaa tgagcaggtg ctgtgtaaaa gtgcacagct gactgcccca 28021 ggtgagacag atcagtgtca cagagagtag gaggattttg aggtgtgatt agtatttttt 28081 ttttttttct taaagagatg gtctcgttct ggaacccagg ctggagtgca gtcgtggcac 28141 gatcataatt cactgtagcc ttgagctcct gggctcaagt gaacctcctg agcagctggg 28201 actacagatg catgccaccg tgcccagcta attattaaat tgtatagaca ggatctctat 28261 atggtaccca agttggtctc aaactcctgg cctcaagcag tcctcctgcc tcagcctatt 28321 aaagtgctgg gattacagga gtgagctacc acacccaacc agtaggaaat cttaaagcca 28381 cattgatact acagtttgaa atatgtaaaa actagggagc tatgtttccc actaactggg 28441 catttaaaat gtaatcgtct ttgaggaagg aagaaggctc tggattattc ttgtaatagt 28501 ttgaatttaa aaggtccttt caggtgggca aacagcttac gtggctttct tgcactgaaa 28561 tagaaactgt agtccccatg tgggtttggg tccctgaatt atccccagaa gaacttccac 28621 agtccggaac tgactccctt aaagaaacca aaaatgtatt ttctttggtg gggattttct 28681 ttagttggtc tttgtctggg attttatatc agtgacagac tgtagcgtca cacttttcca 28741 gaaatacggt ttcggagagg gtacccctaa cgttttatgt gtgtgtgtgt tacatttagc 28801 aagtttatat aacatatttt actttttttt tttttttttt ttttgagaca gtttcgctct 28861 tgttgcccag gctggagtac aatggagcaa tctcggctct ctgcaacctc tgcctcccgg 28921 gttcaagcaa ttctcctgcc tcagcctccc aagtagctgg gattacaggc atgcaccacc 28981 atgccatgcc cgcctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag 29041 gctggtcttg aacacccgac ctcaggttat ccacctgcct cggcctccca aagtactggg 29101 attacaggcg tgagccactg cgcttggtta gaacttattt aactttttag gcacttggta 29161 aacggtagct gttagtccat gtgtactttt ttttttggtg ggggggggag tcatgctctg 29221 tcacccaggc tggagtgcag tggcaggatc ttggttcact gcaagctccg cctcctgggt 29281 tcacgccatt ctcctgtgtc agcctcccaa gtagctggga ctacaggtgc ccgccacctt 29341 ccatgtgtac attcttgttt aacaattcag gataggtact gttactcccc tttacaaatg 29401 agactttcag aggttgattt tctcatggtc aaacagctag taagttgtga attcaagatt 29461 caaactcaaa tttagagctc tcattttaac cactatgtaa caatgcccca tgaggcaaag 29521 ggataatatg tctagtatac attctggtat attaaccatt gtcagcactg gtgactgaaa 29581 tccaagaact cttcataagt gggcttacta aacaaataat tttatctgtt gggtgcaagt 29641 catttttttt ttccaaatac gattaagaac aatctgccat ttatgggttt tagctacaac 29701 agagaaacat aagagggaaa ctcacatatc tggattttgc ttgttctgaa ggcttggagg 29761 cctttttttt tttttttttt tttttttttt tttttttttt gcttaacaaa gtattgaatt 29821 tgagtttctc ctgagttaag aagtgcagga ctatgttaca ttttatttgg ggtattttaa 29881 gaaattattt catattcttg gcaccagttg tttctccata ttgcctttaa tgtatttttc 29941 tccctgcacc ctttagtata tttttatgca ggccccaaaa tttctgtctc taggttgctg 30001 gtaaaattcc tcctagtgtc caagtggagg cagcttgtct tctgtcccag cattttggtg 30061 ctcctcccgc ccctactcct ggctgcagtg gcattccctt ctgcggtggt gagtgttagc 30121 acttccaatg atccgaacct ggcacagctt ctgaagcctt caattcggat gcctctaggg 30181 acagtagcat gcagtaatgc cattcagatg gtgttgtatt taatccttgc caatccccat 30241 gaaaatgttc agttatgtca aaagcaaggc aaaaacagtc tcttggctat acaagggtag 30301 ctgttttatt tgactaaaat ttagcttaga gtggatgtta cttacccgaa cttgcctgct 30361 ctgagcttga agtttagcct atttgtggtc ttacagaatt gcagcctcat cctggtgagg 30421 ataaggggct cagcctgacc ctggctggtg atgttcttgc ccagtggcct gtaggacttg 30481 gtctgttggt ggatatcttt ccagcttggg gcaggccagg caagatcctc acttcctaag 30541 cattaacttg ggaaagagct cagcaagctt tcatccctct agctcatttt ttgttttttt 30601 gagacagagt cttgctctgt cgcccaagct ggagtgcagt ggcacagtcc tggctcactg 30661 caacttccac ctcctgggtt cgagcgattg tcatgtccca cccaagtagc tgggattaca 30721 ggtgcacgcc accatgccca gctaattttt gtatttttag tagagatggg gtttctctat 30781 attgactggg ttggtgtcgt actccctggc ctcaagtgat ccgcctgcct tggcctccca 30841 aagtgctggg attacaggtg tgagccatca cacctggccc ctcaaaccct caaactgact 30901 tttctcacac agacacacac acatacacgt acacacacac ttctgccaaa gaagcttaaa 30961 ggttttatga tgtggttatg tttacttaat catgagaatc atttactcat atcaaaagca 31021 agctggttga tagcatgtag gtgtgggtag ctataaagga tgagcccata gcatgtgtgg 31081 agctgtgaag gatggcagca tattcgagtg aggtagcatt acccctaact taggcagcta 31141 tggcctggca gctcgggggt ggcatttttg tttgctcagt gttacttttg atgttggatg 31201 tttcccagat tgagaacttt aaattttcct aaatagtccc tgtactgttt ttggcagtgg 31261 atctattttt actcacacac atagtttttt gccttgtgag tagtgggtgg aggcccttag 31321 gggtttttgg gtaaatgaat agtgtcatag gaagataagt gaatgatcag caggccaggt 31381 catttttgac atttagtaat gttgggggta gtcctggagc agagaaaatg tgtgaagtga 31441 gtgccctggg gtggggacta ggtgtagcca aaggctgttt ggtcaggacg tgagtaaaac 31501 tggctgacag ggaagtccaa tccttaaaag aaatggagag gagggccttt gtgcatggga 31561 agagcccatc agtttaaggt tgtccttcag gaagggtgta agtaggatta cagagggtgg 31621 gaaggcaggc aggacaccta gaaagcagag aggctgtgag caggggtggc gaaacccctg 31681 gtttgtggcc cggtgaagac tcagtgccta gtgtggctac actgagggaa ggtgccgtgt 31741 ggcagaaatg actgttgtaa tcttggtgtt agagcggaaa gaggctgaaa actattgtct 31801 aggcctgtct gtaaaattct gtcttgttgt tgcttcatgt tcctgatttg tggagatgca 31861 gcaaatgtga tttcattaga ttttgcagat gatcttaggt agcatggtcg gggcctggag 31921 agcatccatt actctgtgtt gccacccggt gtttgcctct cagtacagtt tatgactgaa 31981 cttttcatta gttctcctgg ctcctgtatt ttggtatttg aagtgttggg aactggtgct 32041 aagtatcatc catcaagatt ggctatccat ctttttgttt taatggactg tctgatttta 32101 catgggtcta taattttcca gtctgggaaa aactcagttt tctttagtta tctggtcaag 32161 tagtaagcca tcttcatttc atgtttcctt ttgttaactg aattataacc caggagtatt 32221 aaaaaatact gtatttaaat gtatccagtt atttcctagt taggctctgg ctggggtatt 32281 aggaacaata tttgcttgct gagatatgtt agtcaccctc agctaatgct gatagtgatt 32341 cattcagact ggcaggcaac cagatgattt ttacataaac actaatactc ccagtaaatg 32401 atttttcttg tgaaggtgtc atcataaata ctgcccagta gccttttttg gcagcttatt 32461 cctgatggag aaagaatatg cttacagata cacccaaact tcacgttagt aggaagccct 32521 gccacacctt ttattggtgc acaggaaaaa cgaaatcatg gaacattgga gctgaaggag 32581 gccttggaga tcatttgggc taatcctgct tattttactt tggatgtcca aaaaggttgt 32641 gatttgttta aggtcacata gctagctcag actaaacttc agatgtcttg ggtttaataa 32701 ttataccgcc ttttgtgtta tgcaagcatt ctgtattata catacatgta tttcagattt 32761 gtactcactt taaatgatat actctgggaa gttaccttgg tgtatctttg ccagtttcac 32821 agaagcagct ttgttagtca caggaggaag tatattgtga ctccttatgg atattatttg 32881 tattttaatg ccaaatggcc ttagtttaat tactacaaga tgaaaggagc tctccaaatt 32941 ctgttgttgc cttatgcttt gttttattga atattgctct agaaacgcaa gtcattctag 33001 aggattcaag cattctgaaa tttatcagaa tatttgggat ggaattttga ttcagaagtg 33061 ttgttcgtta aaagactaag tagttactca tctttttgta gttcataagt gtgatgattg 33121 ggttttgaca tgcaggtgtg agatgtgcca ccctcaaacc ttgttacaac atagacatgt 33181 gaccctctga tgtgggcaaa aaaagactaa gttattaaat atttacaaag tctctatttt 33241 ccgtaagaaa atgtatgtaa gtgtttaaag gcacttactg gaggaggaaa ctagtggggt 33301 tataggtgtt cagctggatt taaggggagc atggattgta gattcatggt agactggggc 33361 agatcacagt gagaggtatg gagactacta atgaatatgt gaggatgaat aatgagaacc 33421 ctggcttgac cagagtggag gcagcacccc tggggaatgg tcaggatagc tgttagccaa 33481 agaccacact gaagaaagaa gtgggtaggc aaaggagaga ctttgcctgt gttaagggtt 33541 tgtagcagaa gagtaggggt cctggggaca gggaaatgga gaaattaaga gtgagctggt 33601 atttttattt tgtctggggc ttgggggtag ggaggactca gacaattgct tgttttactt 33661 tgtgagaact tggcacgttt tctgattttt acctgtaata atgattacac ttgctgctta 33721 tgtccttgaa ccactaattc cccaaataaa ccatcattgt tcatgatttc tgatactcca 33781 ctgctcagtc ttctcaatcc attttaagtg tgtctatttg aatatgctat tgataacttt 33841 ccccaatgaa cttggccttt tccctctgga ggttctgtcc ttacagattg gtttaagtaa 33901 tacctttttt gtttagtgtt ttatggtttc ctaagtgctt tgctttcttt tttttcatta 33961 ttgaaattaa agaccttttg gataagcaga gactgattta ataaaattaa agacctttac 34021 cagtaggtca tattaaggaa tccctctact ttagacgaca caccatcttt taagggcttg 34081 tgaaatgtcc tgggtcaatt taggaagtca ttttctttgc ttgggcccag accaactctg 34141 ttagcaactc tgttaggaac tcttgttagg aaactctgtt agtttcctag ggggtgctga 34201 acctagtacg acaaactgga tagtttaaaa taataaatgt attctctcat agttctggac 34261 gctagaagtt tgaaagtgag gtgtaggcag gccacgctcc ctctgaaggc tctagggaag 34321 aatccttcct aggcactctc ccagcttctg gtggcttctg gcaacccttg gtgttccttg 34381 gcttgtagat gcgtcactcc agtctttgcc tccactgtca catggcattc ttcctgtgtg 34441 tgtgtcaaaa tctcactatc cttataagga cacctgtaat tgcacttagg gcacacccta 34501 atgtccagta tgacttcatc ttaactaatt acatctgcaa agaccctatt tccaaataag 34561 ctcacattca caggaaccta gggtcagaac ttcagcgtat cttttggcgg gggatacagt 34621 tcaaccactg caaagtctaa taatgctctt cctactaact gcatagctag ttcacttgta 34681 gttggcatca tggagaacta agggaagtta aagcttgtga aatttaactc ttccacttaa 34741 aataattgat gcttccatcc ttgatgacca gagtttcctg tgaggtaggt cagagtacaa 34801 cttcctgctg agtagctgtg cttcctttca ctaggagctg ggggtacttt catgtcactt 34861 atgaattgtt cttcattttg gtttgggaga gggctgggag tatcagttag ggttccatgt 34921 gtagccacac agggcgaagc tctggccact ggatctgtgt gatgttccat ctgatctgtg 34981 aagccccacc atctgttaat gtgtatttga ggagtggttg gttctttcca gagtataaag 35041 ctggaagcag agtctggaac acttccagtc tgttgtcttt gaacatttga caaagggacc 35101 gtacgatctt actgttcaga gtatcttttt tttttttttt tttttctttt gagacggagg 35161 cttgctctgt cgctagctag gctggagtgc agtggcacga tctgggctca ctgcaacctc 35221 cgggttcaag agtttctctt gcctcagcct cccaagtagc tgggactaca agcgcgtgcc 35281 accacgccca gctaattttt gtatttttag tagagatggg gtttcaccat gttggccaga 35341 atggtctcga tctcttgacc tcatgatctg cccgcccttg gcctcctgaa gtgctgggat 35401 tacaggcata agccatcgcg cccggcctgt tcagagtatc ttttgcagaa tggctgaagc 35461 tcgaggtttt cttcttcacc attatgtact gctgctgtac aaccttttca taattatcct 35521 tagtcccatt cctctaccaa ggtgaaagca acatcttatc agacccgaat tatgaatttc 35581 tcaagcccct atgattttct gttttgggtg ccaagtattt atctcttctg ttagactata 35641 gtctttcttc acatagggtt catgtctata ttggttttat ccatgggtgg tttttattct 35701 ccagagtgac tgacctgttc gtaattccgt tcttgagaag gattgttgat tatgttgaag 35761 ggaaggcttc ttaccaagat tttcagattt tgctttcaat gtttatcttt ttggggtttt 35821 gcaggtgaca gagctgaatg aacctctctc caatgaagat cgaaatctcc tctctgtggc 35881 ctacaagaat gtggttggtg ccaggcgatc ttcctggagg gtcattagca gcattgagca 35941 gaaaaccatg gctgatggaa acgaaaagaa attggagaaa gttaaagctt accgggagaa 36001 gattgagaag gagctggaga cagtttgcaa tgatgtcctg tctctgcttg acaagttcct 36061 gatcaagaac tgcaatgatt tccagtatga gagcaaggtg ttttacctga aaatgaaggg 36121 tgattactac cgctacttag cagaggtcgc ttctggggag aagaaaaaca gtgtggtcga 36181 agcttctgaa gctgcctaca aggaagcctt tgaaatcagc aaagagcaga tgcaacccac 36241 gcatcccatc cggctgggcc tggccctcaa cttctccgtg ttctactatg agatccagaa 36301 tgcacctgag caagcctgcc tcttagccaa acaagccttc gatgatgcca tagctgagct 36361 ggacacacta aacgaggatt cctataagga ctccacgctg atcatgcagt tgctgcgaga 36421 caacctcacc ctctggacga gcgaccagca ggatgaagaa gcaggagaag gcaactgaag 36481 atccttcagg tcccctggcc cttccttcac ccaccacccc catcatcacc gattcttcct 36541 tgccacaatc actaaatatc tagtgctaaa cctatctgta ttggcagcac agctactcag 36601 atctgcactc ctgtctcttg ggaagcagtt tcagataaat catgggcatt gctggactga 36661 tggttgcttt gagcccacag gagctccctt tttgaattgt gtggagaagt gtgttctgat 36721 gaggcatttt actatgcctg ttgatctatg ggaaatctag gcgaaagtaa tggggaagat 36781 tagaaagaat tagccaacca ggctacagtt gatatttaaa agatccattt aaaacaagct 36841 gatagtgttt cgttaagcag tacatcttgt gcatgcaaaa atgaattcac ccctcccacc 36901 tctttcttca attaatggaa aactgttaag ggaagctgat acagagagac aacttgctcc 36961 tttccatcag ctttataata aactgtttaa cgtgaggttt cagtagctcc ttggttttgc 37021 ctctttaaat tatgacgtgc acaaaccttc ttttcaatgc aatgcatctg aaagttttga 37081 tacttgtaac tttttttttt ttttggttgc aattgtttaa gaatcatgga tttatttttt 37141 gtaactcttt ggctattgtc cttgtgtatc ctgacagcgc catgtgtgtc agcccatgtc 37201 aatcaagatg ggtgattatg aaatgccaga cttctaaaat aaatgttttg gaattcaatg 37261 ggtaaataaa tgctgctttg gggatattat ctctgtttgg tcttgatttt tcccccctcg 37321 aggaactgtt taaccagtca caaattgtgg tttgaatctc tcaaacatgg aagcatttgt 37381 agtacattat ccaagttttc ctcccctctc attacattac atattttgca gcacacacat 37441 aggattcata gattttaaat agttttctgt gaacccagct aacatagcct ctctgatcaa 37501 ttttcctgat aactggtgct ttttaatgag cctcgcagaa caaaatgcag tcttccacag 37561 tatagcacct ctgaaacatt gtgggctctg ttgaacttga gagtgtattt aacactgaag 37621 gtaaatacgg agcttataca cgtcctcaag ttttcttaat gtaataagat actaggggca 37681 caacttgcac ccctgaaatc tttagtgatt attgttaagc atttagtatt ctttaaaata 37741 gaaaagcaaa cctcattgag ttaaaaagcc taacctaccc cttaaacact tgcttaaccc 37801 caatgtgtct ctcgagcctc agcatcactg gctgactgtt cttgagtgag gtggtttcca 37861 gctcctcacg atttttatca cctccagagt ggcacctagg aatgaacgtt tgttaggtga 37921 gttggttcag ttacacagaa cccttaaacg cctttttcta tttgtttgtt tttttgagat 37981 ggagtctcgc tgtgtcatcc aggctggagt gcagtggcat gatctcggct cactgcagct 38041 tccacctccc aggttccagc cattctcatg cctcagcctc ctgagtaggt gggattacag 38101 acgccaccac cacaccaggc taatttttgt atttttagta gagatgaggt tttgagtgtt 38161 ggccaggctg ctctcaaact cctgacttca tgtgatccac ccaccttggc ctcccaaaga 38221 gctgggatta caagcgtgag ccaccgtgcc cagccttaaa caccttcttg actaaagttt 38281 ggcattagac tgatggaaat ggccatgaca caatctctgt ccttgaattt atattgtgga 38341 gcacataact tgtaagcatg taagtttaac ctgtttctta ttctgaagta gagtatgagc 38401 cccgcccccc atgccggggc cgatgaatac ggctcaggaa gcctgtttca agttgaccta 38461 attttaagat aaagtggtta ttgagtttca cttcagttag taggtcaggt atttgaaagt 38521 caacaagctt tatatggcta atacttgctt ttgataagca cgtagacaaa atgttgctgg 38581 gccatggtgg cgtgcgcctg tagtcccagc tgcttgggag gatggagccc gggaggtcga 38641 ggctgcggtg agccgtgatt gctccactgt accccagcct gggagatagg aagacctttt 38701 tttctctctc tctctgtctc tgatagcaag acctctctct ctctctcttt tctctctctg 38761 tctctctctc tctcacacac acacacaaag ggctcttaca ttttcttttt ttgtttgaga 38821 tgggagtctc actcttgtca cccaggctgg agtgcaatgg cacgatctca gctcactgca 38881 acctttgcct cccaggttca agtgattctc ctgcctcagc ctcctgagta gctgggatta 38941 caggcaccca ctgtcgtgcc cggctaattt tggtattttt gtggagatgg ggtttcacca 39001 tgttggccag gctggtcttg aactcctgac ctcaagtgat cctcccacct cagcctctca 39061 aagtgctggg attacaggca tgagccactg tgcccggtcg cttttacatt tgaaatgtcc 39121 tttgcagttg ctaaagcatt cacatttacg ttttacagct gtcttctgaa aggttttgtc 39181 ccactttaca gatgaggaag ctaaagtgca aaggttgtgg atctctggaa tttatacagc 39241 tggtaagcta gaggacagct tccacccaag atttattcag aattttaacg gccctctggg 39301 gggcctgggg aaccttggca agccatgcca cctttgaggt tcaggtccct cattttttaa 39361 gtgtaggatt tgggttatgt tttaatctaa gttcatggtg tgtaaggcac cttctttggg 39421 aatgaggtat ggggaagtat ggacatacgt gattaatgtt agccacatag gtcgcgtgca 39481 gtggctcatg cctgtaatcc cagtactttg ggaggctgag gtgggtggat cacttgaggt 39541 taggagttcg agaccagtct gaccaatatg gtgaaacccc gcctctacta aaaatacaaa 39601 aattagctgg gtgtggtggc tggtgcctgt aatcccagct acgatggagg ctgaggcagg 39661 agaatcactt gaacccagga ggcggaggtt gcagggagcc gagattgtcc cactgcactc 39721 cagcctgggc aacagagtga gactcggttt caaaaaaaag aaagaaaagg aggttagcca 39781 cataaataat agatgaggct ctttgaaagg acctaagagg gtgctgtaaa ttacatgtga 39841 tgtaagcctg gaagtcttca ctgctgcata ttttttcccc aattctactt aaaacgtttt 39901 aaaacttgtc aagataactt tctcaaatcg ttaagaaggg gtcatctcaa tgttacagca 39961 tgagaaatta aagtccagag aaatgaaata gctttctagg tcatggcaaa attggtgcat 40021 ctggagtagc attcagatcc cgaatccttg tccctggttc tcttaactgc catgctcttt 40081 gggttgagac tagaaagaaa atggatacaa cacatttcac ccttaggaga ctttcagtgt 40141 ttttaatgtt ttcacttaga atggaagaaa ttatacttga aaagggatgt ttaaatttat 40201 tgatgttggc tgggcgcggt ggctcacgcc tgtaatccta gcagtttggg aggccgaggt 40261 gggcggatca cctgaggtca ggagttcgag accagcctga ccaacatgga gaaaccccat 40321 ctctagtaaa aatgcaaaat tagccgggca tggaggcgca tgcctgtaat tccagctact 40381 cagcaggctg aggcaggaga atcgcttgaa cctgcgaggc agaggttgcg gtgagccgag 40441 atcgcgctat tgcactccaa cctgggcaac aagagcaaaa actccgtccc cgcccccccc 40501 ccccccccag aaaaaattat tgttgctatt gcttcagatt tccagtagct cctcaaaccc 40561 tgctcaaaag tccatttctt tgagttgggg gaaggggtag gattcttgat aatgaagcta 40621 ttgttgggta aattcttggc cgtccttaaa tccttcggga tc // LOCUS HSNCAMX1 16288 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens gene for neural cell adhesion molecule L1. ACCESSION Z29373 NID g472988 KEYWORDS neural cell adhesion molecule L1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 16288) AUTHORS Rosenthal,A., Coutelle,O. and Drescher,B. TITLE Genomic Sequence and Structure of the Human Gene for the Neural Cell Adhesion Molecule L1/HSAS Locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 16288) AUTHORS Rosenthal,A. TITLE Direct Submission JOURNAL Submitted (13-JAN-1994) Andre Rosenthal, Genome Analysis, Institute of Molecular, Biotechnology, Beutenbergstrasse 11, Jena, 07745, Germany FEATURES Location/Qualifiers source 1..16288 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="B5" /clone_lib="cosmid" /chromosome="X" CDS join(1533..1608,4127..4141,4672..4777,5015..5217, 6192..6314,6411..6581,6871..6982,7131..7315,7437..7568, 7708..7851,8417..8528,8642..8808,8911..9067,9248..9372, 9460..9570,9817..10014,10246..10316,10499..10721, 11551..11666,11870..12071,12160..12282,12376..12549, 12666..12785,12893..13048,13352..13486,13820..13892, 13990..14001,14475..14706) /codon_start=1 /product="neural cell adhesion molecule L1" /db_xref="PID:g472989" /db_xref="SWISS-PROT:P32004" /translation="MVVALRYVWPLLLCSPCLLIQIPEEYEGHHVMEPPVITEQSPRR LVVFPTDDISLKCEASGKPEVQFRWTRDGVHFKPKEELGVTVYQSPHSGSFTITGNNS NFAQRFQGIYRCFASNKLGTAMSHEIRLMAEGAPKWPKETVKPVEVEEGESVVLPCNP PPSAEPLRIYWMNSKILHIKQDERVTMGQNGNLYFANVLTSDNHSDYICHAHFPGTRT IIQKEPIDLRVKATNSMIDRKPRLLFPTNSSSHLVALQGQPLVLECIAEGFPTPTIKW LRPSGPMPADRVTYQNHNKTLQLLKVGEEDDGEYRCLAENSLGSARHAYYVTVEAAPY WLHKPQSHLYGPGETARLDCQVQGRPQPEVTWRINGIPVEELAKDQKYRIQRGALILS NVQPSDTMVTQCEARNRHGLLLANAYIYVVQLPAKILTADNQTYMAVQGSTAYLLCKA FGAPVPSVQWLDEDGTTVLQDERFFPYANGTLGIRDLQANDTGRYFCLAANDQNNVTI MANLKVKDATQITQGPRSTIEKKGSRVTFTCQASFDPSLQPSITWRGDGRDLQELGDS DKYFIEDGRLVIHSLDYSDQGNYSCVASTELDVVESRAQLLVVGSPGPVPRLVLSDLH LLTQSQVRVSWSPAEDHNAPIEKYDIEFEDKEMAPEKWYSLGKVPGNQTSTTLKLSPY VHYTFRVTAINKYGPGEPSPVSETVVTPEAAPEKNPVDVKGEGNETTNMVITWKPLRW MDWNAPQVQYRVQWRPQGTRGPWQEQIVSDPFLVVSNTSTFVPYEIKVQAVNSQGKGP EPQVTIGYSGEDYPQAIPELEGIEILNSSAVLVKWRPVDLAQVKGHLRGYNVTYWREG SQRKHSKRHIHKDHVVVPANTTSVILSGLRPYSSYHLEVQAFNGRGSGPASEFTFSTP EGVPGHPEALHLECQSNTSLLLRWQPPLSHNGVLTGYVLSYHPLDEGGKGQLSFNLRD PELRTHNLTDLSPHLRYRFQLQATTKEGPGEAIVREGGTMALSGISDFGNISATAGEN YSVVSWVPKEGQCNFRFHILFKALGEEKGGASLSPQYVSYNQSSYTQWDLQPDTDYEI HLFKERMFRHQMAVKTNGTGRVRLPPAGFATEGWFIGFVSAIILLLLVLLILCFIKRS KGGKYSVKDKEDTQVDSEARPMKDETFGEYRSLESDNEEKAFGSSQPSLNGDIKPLGS DDSLADYGGSVDVQFNEDGSFIGQYSGKKEKEAAGGNDSSGATSPINPAVALE" exon 1533..1608 /number=1 /label=ex1 prim_transcript <1533..>14706 intron 1609..4126 exon 4127..4141 /number=2 /label=ex2 intron 4142..4671 exon 4672..4777 /number=3 /label=ex3 intron 4778..5014 exon 5015..5217 /number=4 /label=ex4 intron 5218..6191 exon 6192..6314 /number=5 /label=ex5 intron 6315..6410 exon 6411..6581 /number=6 /label=ex6 intron 6582..6870 exon 6871..6982 /number=7 /label=ex7 intron 6983..7130 exon 7131..7315 /number=8 /label=ex8 intron 7316..7436 exon 7437..7568 /number=9 /label=ex9 intron 7569..7707 exon 7708..7851 /number=10 /label=ex10 intron 7852..8416 exon 8417..8528 /number=11 /label=ex11 intron 8529..8641 exon 8642..8808 /number=12 /label=ex12 intron 8809..8910 exon 8911..9067 /number=13 /label=ex13 intron 9068..9247 exon 9248..9372 /number=14 /label=ex14 intron 9373..9459 exon 9460..9570 /number=15 /label=ex15 intron 9571..9816 exon 9817..10014 /number=16 /label=ex16 intron 10015..10245 exon 10246..10316 /number=17 /label=ex17 intron 10317..10498 exon 10499..10721 /number=18 /label=ex18 intron 10722..11550 exon 11551..11666 /number=19 /label=ex19 intron 11667..11869 exon 11870..12071 /number=20 /label=ex20 intron 12072..12159 exon 12160..12282 /number=21 /label=ex21 intron 12283..12375 exon 12376..12549 /number=22 /label=ex22 intron 12550..12665 exon 12666..12785 /number=23 /label=ex23 intron 12786..12892 exon 12893..13048 /number=24 /label=ex24 intron 13049..13351 exon 13352..13486 /number=25 /label=ex25 intron 13487..13819 exon 13820..13892 /number=26 /label=ex26 intron 13893..13989 exon 13990..14001 /number=27 /label=ex27 intron 14002..14474 exon 14475..14706 /number=28 /label=ex28 BASE COUNT 3053 a 5067 c 4630 g 3538 t ORIGIN 1 ctgcctcagc ctcctgagtt gctgggttta caggcatgag caccacgcct ggcttttttt 61 tttttttttt tttttttttt tgtattttta gtagagacag gatttcacca tgttggccag 121 gctggtctca aactgctgac ctcatgatcc gcctgcctca gcctcccaaa gtgctgggat 181 tacaggcgtg agccaccacg cccagcccta attttgtatt tttactagag acagggtttc 241 accatgttgg cctggctggt tttgaactcc tgacttcagg tgatccaccc acctcggcct 301 cccaaagtgc tgggattaca ggcatgagcc actgtgcctg gccttttatt tttattttta 361 tttatttttt tagacagagt ctcactccat cacccaggct ggggggcagt ggcacgatct 421 tggctcactg caacctccgc ctctcaggct caagtgattc tcatgcctca gcctcccaag 481 tagctggcat tacaggcgcc caccaccaag gctaattttt tttctttttg tatttttagt 541 agagacgggg ttccgccatg ttggccaggc tggtctcaaa ctcccaacct caagagatcc 601 acccacctca gcctcccaaa ctactgggat tacaggcgtg agccaccacg cctggccctt 661 tccccattct ttcttcattt tcctgaaact attttgtgaa atgctgccat cttgggcccc 721 tttcctccct tctctatccc acttcccaat ccccccatta gctttttagg tctgtacagc 781 acccaggcat cagttggctc tgaaagctat agcaatgtga ctgcaagaag aattcttctc 841 tacatgtgtg tgcacacaga gataatagat acagggattc atccccacag atacataaag 901 acatacaaga agacacacgt agctagcaga cttgtgtgtt gattccctca ggcacacaca 961 ctaacacacc cactcatgag tgcaacccct cccagagaag acacaccaca gaaataccag 1021 ctcacataca gctcactccc tgcttgcaga tccacaacca cacacatgcc cacggacacc 1081 tgcacatgca gacacatacg gggatgtgca tgccatgcac aggggagctg atggggcaac 1141 aagttggtgc tgggaaggca gctggcagga aaggtgcaga cagagagtag agagggaata 1201 aaaggcaacc tagaggccca gccagggcta tctcttccag atttctccct ccaaggtggg 1261 gaggggtgcg gatggggagg ggagggctgg agaagccggc cagctggggg cggggctgag 1321 ggcggggcca agagcccggg cttacccaga tgttagtcac taacgtcctt ccgttctctc 1381 gtctctcttc ccccaccctt cccttccctt ctcccctctc ccagtgcccc cactcccaac 1441 tcccgcccca agccgcccac cagccccctt cccctccggc cggagcctga accgagcccg 1501 ggtggctgtg ctgcgcggtg ccgccgggaa agatggtcgt ggcgctgcgg tacgtgtggc 1561 ctctcctcct ctgcagcccc tgcctgctta tccagatccc cgaggaatgt gagtagctga 1621 agggagcctg ggcgcctacc ttcgccgcta tgttgggccc ctctttatct tctacctcct 1681 caccttctcc ctgttccgga ccctgactcc tctccctggc ctttcctggg cacagggtgg 1741 gcaggagata gccaggacgt tgggtacccc cggagagaag ggcagggtag aaggtgcggg 1801 gcgtgtcagg accatggaca gggacaggcg cggccgccgg ggcgatttca gcaccaggga 1861 cagcgaacct ggagcagggc cccgaggtct tgggcctcct ccatgctctc cttagcccgc 1921 cccatctctg cccgccccac tccgactcct ccgcttccat cccccaccac aggggtgtgg 1981 cagcaggaga atggggtcgt ccccagccct cagctggctt gtgcaaaggc ctgtgttcaa 2041 gtcttgtccc catttctctt tcccctcctg ggcccttggt caccctgtct ctgaactgag 2101 acaggaaaag gtggcttgaa aaaccctggt cccagcccag cagccttctg gacccttttt 2161 tgtgtgtagt tatttggtgg gggtgggggc agggctgctg gagcatgtgt ctatggttgt 2221 aacatttgtt gggggcacaa gcaggtgagt ggatgttggg gagtgtggcc gtagggttgt 2281 gattaagcat ggatacactg tgtatgtctg tgtgttcttg tgattgactg agcgtagata 2341 ttcgtgaatg tgattttgac ttccttacat gggtgactgt gtgagtcact ctgttactta 2401 ctggccaggc ttgtggaggt cgggaggtat ttgaccacga catttgattg cattgggtca 2461 tgtgtatgtg tgagtggggc tgaatgtaag tacacacttg cgtgtggggg gtgttaccgt 2521 gatggtgttt tctcctgtaa gtagctgtca ggccgtgtga ggggcatgtc acagggtatc 2581 atgtgtgagg gcatttgtgt ctcttgagct accaggctcc agtcccttcc cagcctcacc 2641 ttgacacctt tgtctctggg cctggacaag gaggaggcca gtgtcactga gagacgagtt 2701 ggcggatagg ggttgggaag actgtctgtg ccagcagtgg gacaggggct gcgagagaga 2761 ggctggggag gcacctggcg ttgggaggca gtgagggctt ggaggaggta aaggggctgg 2821 ccccttagac acatacatgg agggccaggg ccaggcagac tcccttggta cagatgcaca 2881 tggacacatg cacacaggtg catgaagaca aggcaggtgc agggatgaag gagcagagcc 2941 gctgagaggc cacacaggtg gaaacacaca ccagtgaggc ttttagaccc agaggacaca 3001 gaggtggata cacaatataa cactgtacag tggaatgcaa gccgtgcatt ctctacagaa 3061 agatgggagt ggggagagag aactaggcca aaaggccatg gcagctccaa ggctgccctg 3121 gtcactgacc agcacttgtt cttgaccctg atacgacaca gtgggcagtg gcaggcagcc 3181 ggccctccct gttggaacct gtcttccatc tctccctccc cagccactcc agcccccttt 3241 gtctcttttc ctgcttcctc cacctagttg gctttctgca tcctgtcatc cctgtcacca 3301 tctccctccc cagctcaggg ccctctgcag tgtccccagg gagccccctc ctccctccct 3361 ggaccttcac actgaggtct tctgcagaga caaagctttc agaggcagcc tgctgctccc 3421 tccctccctc ttctcccttc ctcgaggccg ccctggccag tgctctgcag agtcctgggg 3481 ctgggacagc agctctggcc tctgccccag agagcttggt ggctgtctgg gaggaaggag 3541 ggagaagagg agggctggcg ccaaccccgc aggctatgga aactcagagg cagcagggcc 3601 tttgggatag ttttcagggg gagggggcag cagggctctg aaggctgggc cctggtaatt 3661 gaaaacactc agctgtgcta attggattag gagggaagca ggctggagca ggggacggtg 3721 ccttccaagg catcctgaat gggtggggtc agttctgcag gccctgccag gccctgccag 3781 ggggctgcaa ggacactcct tgttaaaggc aagaatttct ctctccctgt ctgctagagc 3841 tggaggcatg gcctggctgc ctctgggcta catgccctgt gtgtctatcc agtggagctc 3901 ctctccctgg gcatctctct gtcccatctc cctgggcttg acctgccact gctccagccc 3961 agcgtgtgcc cccttgttgc cactgccagg gcctcctctc ctgccattcc ctctccacgg 4021 gggggctgca ggagcttact atgtcccctg ccatctgccc cactctgggg ctgcctgtca 4081 gtatttctct ccattccctc ccttgctgcc tgcctggatt tctcagatga aggacaccat 4141 ggtaagttgg ggaggggtga gagtgtttcc cagaaggcca aggagaaggg taaggaggcc 4201 ctcagctgag cctgcacccc cgccctcctg agtgccgggg ccctgctcgg ctctccctga 4261 gcaccatgtg cttctctcac ctgtgtgggg ctggctttgt tctgtctctg ccccagcctg 4321 ggtactctgt caccctcact gttcctctct gccattctct ctggtccctt ctcccctgtc 4381 tggggtttgt ccttcttggt ttctgtgtct tggcttttgt tgcccccgct tctttctgtc 4441 ataccctttt tgtctttctc aggcatctct ttgtaactta cgtggctgtc ccctctctcc 4501 ctcttttcat gtcccacaag cctcttttgt tcttcccctc cgttcgtccc cctgtggcca 4561 tggggcctcc aggggctctg gaccctgctt ccctcacgat gggacaggtg ctgaggctat 4621 gacaccagcc aggcagcacc ctcaccaccg ctctctccac cctgtcttca gtgatggagc 4681 cacctgtcat cacggaacag tctccacggc gcctggttgt cttccccaca gatgacatca 4741 gcctcaagtg tgaggccagt ggcaagcccg aagtgcagtg agtgatctct gccgtctcag 4801 cccctgttgc ctgccactga ccccggcctt gtgcctccct ggactgccta acttctaagc 4861 ccagaagccc caaaccttgc taatcacccc aagtcctcgc catctcccag tccatcccct 4921 gggggccctg gctgcttgca cctgtccctc cagcctgttc cagcccccgt tgggttccag 4981 ggcctcaggc ctctgggcct ttttccgccc acaggttccg ctggacgagg gatggtgtcc 5041 acttcaaacc caaggaagag ctgggtgtga ccgtgtacca gtcgccccac tctggctcct 5101 tcaccatcac gggcaacaac agcaactttg ctcagaggtt ccagggcatc taccgctgct 5161 ttgccagcaa taagctgggc accgccatgt cccatgagat ccggctcatg gccgagggtg 5221 cgaggcgctg cagggaccag ggagggcagg tcccggagtt cgtgtgggat tgtgggctgg 5281 aagctgctgt cagccagttg tctcgtggcg aggggcctgg gtgtcctgcc agccctccac 5341 cccaccccag cttctcccct ggcccatctt gtttcttgtg cagtctcctc ctttgccctg 5401 tctctgtctc cattgcactt ccggtcatct catgcccttt cactccctca ggctttcctg 5461 gtgcctgcct catggctgag caggaaggtg gcggggggag gggggggcgg cctggtgggg 5521 ggagtgactg attctgggca gacctgcctg agtcactgca gacatctctg tctacacctg 5581 cactccttag cagggcccct cttctctggc ttttccactt ggctctccat ctcctcccct 5641 ccacctccga cttggtctct cttgctccac ggctctgtct tgatctgtgt cgccttttcc 5701 ctggggtttt ccctttgcat tccatcctcc ctgagctttc tgcttcctga gcgctctcct 5761 ctcccctccc cacccctcca gttaccttgg gaaccgttga catggtttct gtgtcaggat 5821 gatggaaggg tctgtgactc ctgcatccct ccacagctcc cctcccccat cactgcacct 5881 cagcctcagc ctccccccac ccgtcctttc ctagggtggg cagggagagc tgagaaccgt 5941 ggaaggttaa cccttcctgg cctcctggtc tcctgctcac cctggcctcc ctaagtgcta 6001 gtgctctgct atgagtccag aaggtatcca gagtgtctac ctccttttct gtgaagtttc 6061 tactggcctg accctctcgc cccctcgtcc atgccctggt gtagcccgcc acccccctgc 6121 ccccacctcc cttctcctgc ctagcaccat ctggaccagg aggagagtgt cagcccgtct 6181 gtcccttcta ggtgccccca agtggccaaa ggagacagtg aagcccgtgg aggtggagga 6241 aggggagtca gtggttctgc cttgcaaccc tcccccaagt gcagagcctc tccggatcta 6301 ctggatgaac agcagtgggt gccggcgagc ggctgcctcg tggggtcggg gtgttagcgg 6361 gggtgtcttc ctggacgggg tcatgacttc ggcctccttg ccccctgcag agatcttgca 6421 catcaagcag gacgagcggg tgacgatggg ccagaacggc aacctctact ttgccaatgt 6481 gctcacctcc gacaaccact cagactacat ctgccacgcc cacttcccag gcaccaggac 6541 catcattcag aaggaaccca ttgacctccg ggtcaaggcc agtgagtctc agaggctctg 6601 cactctctcc ctgactcctt tcaccctcag gcatggccct tcccaccata ggtccatctg 6661 acaactccag gcttcctcaa gaacagcttg gctcccacct ctaccctgcc cagacccctt 6721 cccaggcgtc gagagcagag acaggacact ctccttaatg gaggaagaca cagaggcccc 6781 ggggctggct ggagtgtcga gtccctcagc cttgcaattc tggggtggag ggaagagtgg 6841 gcagcccacc cacccacttt ctcctggcag ccaacagcat gattgacagg aagccgcgcc 6901 tgctcttccc caccaactcc agcagccacc tggtggcctt gcaggggcag ccattggtcc 6961 tggagtgcat cgccgagggc ttgtgagtct gggaacccag ggcagggcag ggcagggcag 7021 ggttggcgga gccctagcag ggagctcaga ccaccgcggg gtataggggg ctggggtgat 7081 tggccttgtc ctttctccct tctgctctct tccctttgcc tcccgggcag tcccacgccc 7141 accatcaaat ggctgcgccc cagtggcccc atgccagccg accgtgtcac ctaccagaac 7201 cacaacaaga ccctgcagct gctgaaagtg ggcgaggagg atgatggcga gtaccgctgc 7261 ctggccgaga actcactggg cagtgcccgg catgcgtact atgtcaccgt ggagggtatg 7321 gacctcctgg gacagtggcc tgtgatgccc actgtcatgg gggaggggag ggtctgcgcc 7381 gggtcaagag ccagctgctg gctgcgggct cagggcggct ctccccttcc tcccagctgc 7441 cccgtactgg ctgcacaagc cccagagcca tctatatggg ccaggagaga ctgcccgcct 7501 ggactgccaa gtccagggca ggccccaacc agaggtcacc tggagaatca acgggatccc 7561 tgtggagggt gagcagggcc tcaggcagga gtggggagtg tggcgagccc tggggactgt 7621 gggggagagg aagaggaggg tgaggaaggg cctcaacgca gctcaatgcc gggctgtggc 7681 ccatgggccc acctgctgca tgcacagagc tggccaaaga ccagaagtac cggattcagc 7741 gtggcgccct gatcctgagc aacgtgcagc ccagtgacac aatggtgacc caatgtgagg 7801 cccgcaaccg gcacgggctc ttgctggcca atgcctacat ctacgttgtc cgtgagtgcc 7861 ctcccttcct tacctcctaa ctcccagtct gtccctgcac ccactgggcc agcgaccaga 7921 gtggctcagc tggcagctga tgtgtcctcg gcctaccagg ctggcgccga cctttctcct 7981 tgctgacacc cttcgcccca tctcactcag cctcacgata gccgagggag atgtaagagg 8041 ggaaaggtct tgtcactgtg actgagcgga ccgctgtgcc acaggatgtg accaaggcca 8101 gcccaggtgg cacctctcct ggaacttcag ggaaggctgg ggactaggcc aggaaggctt 8161 ctggggagag gtgactgtca gttaggcctg gaataacgta aaagttgggt ggctgtgacc 8221 agaggcgcat ctgaggcaga gggcagtgtc tgaaggcaga gagagacaca gcctggcggg 8281 ggcctcagct gggagcaggg aaccaagatt tgcagggctc ttgggggccc gaggaagccg 8341 tgcagagggt ccctggctcc tggcccagct gtggtcccaa gtcctgccct gtctcctgag 8401 gctgttcctc ccctagagct gccagccaag atcctgactg cggacaatca gacgtacatg 8461 gctgtccagg gcagcactgc ctaccttctg tgcaaggcct tcggagcgcc tgtgcccagt 8521 gttcagtggt gagtgtctcg tcctggtagt ggtgagtgtc gtgtcccagt ggccagggag 8581 ccagggaggg cagggagccc aggccagcca gtcagagcca ggcccgccct gctccctcca 8641 ggctggacga ggatgggaca acagtgcttc aggacgaacg cttcttcccc tatgccaatg 8701 ggaccctggg cattcgagac ctccaggcca atgacaccgg acgctacttc tgcctggctg 8761 ccaatgacca aaacaatgtt accatcatgg ctaacctgaa ggttaaaggt caggcaaccc 8821 ttgccacatg gctggcagtg cctctgggag ggagggtctg ggcctctgga ggacaacaga 8881 gtgacttccc cacgcacgca ttaccctcag atgcaactca gatcactcag gggccccgca 8941 gcacaatcga gaagaaaggt tccagggtga ccttcacgtg ccaggcctcc tttgacccct 9001 ccttgcagcc cagcatcacc tggcgtgggg acggtcgaga cctccaggag cttggggaca 9061 gtgacaagtg aggacagtga cggtgaaagg gggcagagtg ggaaaagctg gaagtccaga 9121 cctcttggcc tcgtccttgc tttgatgggg aaggctctga caggagcggg ggaaggagac 9181 aggagggagg gattgggagg ggagaagagc ccagatggca ggaaagacag acacgcctcc 9241 tccgcaggta cttcatagag gatggtcgcc tggtcatcca cagcctggac tacagcgacc 9301 agggcaacta cagctgcgtg gccagtaccg aactggatgt ggtggagagt agggcacagc 9361 tcttggtggt gggtaagtcc tagggtggaa gcctccactc cagaaggccg gggctcacat 9421 ccctgggccc tttcaagcac cgaccctccc caccctcagg gagccctggg ccggtgccac 9481 ggctggtgct gtccgacctg cacctgctga cgcagagcca ggtgcgcgtg tcctggagtc 9541 ctgcagaaga ccacaatgcc cccattgaga gtaagaggct tgagctcagt gccaccccac 9601 cgtcacttct gccagccctg gcccctcctg gggtcctccc tcctcctgga gccttcctgg 9661 ggggacacgg tttgggggga ctcacggttc ctcatgggaa tctggagatg ccagttggcc 9721 tgggtaatca gggacccaga ctgtacccaa cccctccaca gcccttcccc caaagccaca 9781 tgctgatcac tccattgtcg gttcatttct tggcagaata tgacattgaa tttgaggaca 9841 aggaaatggc gcctgaaaaa tggtacagtc tgggcaaggt tccagggaac cagacctcta 9901 ccaccctcaa gctgtcgccc tatgtccact acacctttag ggttactgcc ataaacaaat 9961 atggccccgg ggagcccagc ccggtctctg agactgtggt cacacctgag gcaggtgagt 10021 cagggtggca cccactccca tgccacctgg aaggggctcc gggctgtgaa gggagggctt 10081 agagaggtgc catgcccaag ttcctctgct tccaaatttc cagggggagc gcgagggggc 10141 tggactccct tggtgccaga atgagccatt tgttccttgg ccctcggtcc tcgtggctct 10201 ccaaaagagg ctgcagcatt gatgtgttaa tactttcctt tccagcccca gagaagaacc 10261 ctgtggatgt gaagggggaa ggaaatgaga ccaccaatat ggtcatcacg tggaaggtga 10321 gagtccctgg gctgagctgc ctgggggaca tgaggccatg acctgggtgt ctgttgctcc 10381 cccttggcta tgcaaaggaa ggggctgatg tctggggatg ctggtttgtg ctggggggcg 10441 ggccaaagaa tgctggtgtt cttggccaac caactcctct tctgctggcc tcctccagcc 10501 gctccggtgg atggactgga acgcccccca ggttcagtac cgcgtgcagt ggcgccctca 10561 ggggacacga gggccctggc aggagcagat tgtcagcgac cccttcctgg tggtgtccaa 10621 cacgtccacc ttcgtgccct atgagatcaa agtccaggcc gtcaacagcc agggcaaggg 10681 accagagccc caggtcacta tcggctactc tggagaggac tgtgagtatc cggcagggcc 10741 cctcgcccac ggttgagctc gcctgcttcc accccagccc atgtccacac ctctcagcat 10801 tggtgtgcgc ctttcctaag gatgaggata ttctcttaca taaccagagc gacgttatga 10861 aattccccac acttcacatt tctataattg gcagcggtat tccagttata tcagccaatg 10921 ctgcttcttt catttccttg aatctggaac atgaccagat ggtaacttga agtacccggt 10981 caagtttttt ttctggtttc tccttttttt cttttctttt ttcttttctt ttcttttctt 11041 tctttttttt tttttttgag atggagtctt gctctgtcat ccaggctgga gtgcagtgga 11101 gcggtcttgg ctcactgcaa cctctgcctc ccgggttcaa gcgattctcc tgcctcagcc 11161 tcctgattag ctgggattac aggcacgcgc caccacaccc aactaatttt cgtatttagt 11221 agagatgcgg ttgcaccatg ttgatcaggt tggtctcgaa ctccagacct caaatgttcc 11281 actcgcctct gtctcccaaa gtgttgggaa tacgggtgtg aaccatcacg cccagccatg 11341 gtttctcctt tgtataagct actattttca cctttgcaac taataagcag tccaggaaga 11401 gatcttgaga ccatgcaaac acacctctcc tcctcaaact gtacctcata aaatatccga 11461 attcgtcttc tctgtgtgta ggggcttggg cctggcctca gggctggggt gagagcctat 11521 ggctctatcc aagtcactgg tggtttccag acccccaggc aatccctgag ctggaaggca 11581 ttgaaatcct caactcaagt gccgtgctgg tcaagtggcg gccggtggac ctggcccagg 11641 tcaagggcca cctccgcgga tacaatgtaa gggttgaagg catgggggcc caggagggtg 11701 atcactgagg taaagcgaca ggatggtgag ggggctttgc cacctgccag ccgctctcct 11761 ggaagacttg ggttgagagg agggtcccca ttaagggtgg tgggggtgtg cgctgggggc 11821 cctgctgggt gccttctgcc cctgcgaggt ctcctgttcg cttgtgcagg tgacgtactg 11881 gagggagggc agtcagagga agcacagcaa gagacatatc cacaaagacc atgtggtggt 11941 gcccgccaac accaccagtg tcatcctcag tggcttgcgg ccctatagct cctaccacct 12001 ggaggtgcag gcctttaacg ggcgaggatc ggggcccgcc agcgagttca ccttcagcac 12061 cccagaggga ggtgagtcct gcaccccacg cctcatcccc tgccctcctc cccagacctg 12121 ggcgaggacc tacctgccac tcggttctgt gccctgcagt gcctggccac cccgaggcgt 12181 tgcacctgga gtgccagtcg aacaccagcc tgctgctgcg ctggcagccc ccactcagcc 12241 acaacggcgt gctcaccggc tacgtgctct cctaccaccc ccgtgcgtgc gccgccccag 12301 cagggaaggg aggtggaggg gccacgggga gggggcagag ctgcagccac agccaacccc 12361 tgtctgtccc cacagtggat gaggggggca aggggcaact gtccttcaac cttcgggacc 12421 ccgaacttcg gacacacaac ctgaccgatc tcagccccca cctgcggtac cgcttccagc 12481 ttcaggccac caccaaagag ggccctggtg aagccatcgt acgggaagga ggcactatgg 12541 ccttgtctgg taagctggag gaatgacctg ccaacaggga ggccccggcc agccgggtcc 12601 aggagggagg gccttagact tcctggcagc tgccacactc tcctcgttcc cctgcatccc 12661 cccagggatc tcagattttg gcaacatctc agccacagcg ggtgaaaact acagtgtcgt 12721 ctcctgggtc cccaaggagg gccagtgcaa cttcaggttc catatcttgt tcaaagcctt 12781 gggaggtaag cgtgaaaggg ggccttgggg cgtggtgggg aagggggctc tgggccagag 12841 gggttgcctg gcactccgac tcacccctgc tgccaccctc tctccctggc agaagagaag 12901 ggtggggctt ccctttcgcc acagtatgtc agctacaacc agagctccta cacgcagtgg 12961 gacctgcagc ctgacactga ctacgagatc cacttgttta aggagaggat gttccggcac 13021 caaatggctg tgaagaccaa tggcacaggt gaggcgccgg gggcccgcca tcacctgcca 13081 gggaagcatg ggggctgcac aattcatgtc cttctgtccc ctgaatggcc actactgccc 13141 aggctccctc cacggctcct gtctgcttct cctcccagaa tctgggtgtc cccgaatcag 13201 ccccctttca taccctttcc gcagatgccc ttttccctgt ccctgtcact ccttagcccc 13261 ccagagggtc ccaactttaa gagcatactt cagggcctca ggtcgggttc tggcttgggc 13321 ggcagcacgg ggggctggtg tctcacctca ggccgcgtga ggctccctcc tgctggcttc 13381 gccactgagg gctggttcat cggctttgtg agtgccatca tcctcctgct cctcgtcctg 13441 ctcatcctct gcttcatcaa gcgcagcaag ggcggcaaat actcaggtag cctctgagcc 13501 ccacgtgacg ggggtggagc cgagggcaga gaggagcacc tcgccccttc cacccttctg 13561 caaggcctcc tggattcccc gagctcccca caagatgaca ggcgcatggt gggggcaggc 13621 agggggatca tcctcccagg ccggggttgg ggcaacaggc aagtggtggc tggaccgcag 13681 ggggtgagcg gtttctttca aggcctcttg cctctgggcg ctgttgagac agagtgctgg 13741 gggtgcgggg cgtagagggg cctggagccc cgggtcaggc tggggcggga gaagaagctg 13801 tccctctctg tgtctccagt gaaggataag gaggacaccc aggtggactc tgaggcccga 13861 ccgatgaaag atgagacctt cggcgagtac aggtgagccg gggcaggagt gggtgctggc 13921 actcagctcc accctggcct tcacatctca ccccctctct ctcttctctg tgctgccggg 13981 tgacttcagg tccctggaga ggtaaggcag gggcggtggg gaggggccaa cagcaaggtc 14041 tccctataga caatgtgcgc ctggtgtgcc attgtcccct ggctgccggg gctctgacag 14101 gactctcact tcccagcccc caggcccggc agctcctcat ttcagggaga ggccaacacg 14161 ccccagcccc tcctctccac acagatacct tcactgctcc cctctcccca agggaggccc 14221 cagacagctt cccagacagg aaccaggccc cataatcccc tggccgagcg cacattttct 14281 ccctcacagt ccccaggatc tggggaatgc ccaaagatga cagctccaga cctgcctgcc 14341 agggtggcca gagctgctct ttctgagcgc attctaaaca aatggaaggc aggcgggcgt 14401 ctgcctcacc cccagccagg ctcctgcagg ggctcggcag tgctctcact cgcacctgcc 14461 ctgtgcctct gcagtgacaa cgaggagaag gcctttggca gcagccagcc atcgctcaac 14521 ggggacatca agcccctggg cagtgacgac agcctggccg attatggggg cagcgtggat 14581 gttcagttca acgaggatgg ttcgttcatt ggccagtaca gtggcaagaa ggagaaggag 14641 gcggcagggg gcaatgacag ctcaggggcc acttccccca tcaaccctgc cgtggcccta 14701 gaatagtgga gtccaggaca ggagatgctg tgcccctggc cttgggatcc aggcccctcc 14761 ctctccagca ggcccatggg aggctggagt tggggcagag gagaacttgc tgtctcggat 14821 ccccttccta ccacccggtc cccactttat tgccaaaacc cagctgcacc ccttcctggg 14881 cacacgctgc tctgccccag cttgggcaga tctcccacat gccaggggcc tttgggtgct 14941 gttttgccag cccatttggg cagagaggct gtggtttggg ggagaagaag taggggtggc 15001 ccgaaagggt ctccgaaatg ctgtctttct tgctccctga ctgggggcag acatggtggg 15061 gtctcctcag gaccagggtt ggcaccttcc ccctccccca gccactcccc agccagcctg 15121 gctgggactg ggaacagaac tcggtgtccc caccatctgc tgtcttttct ttgccatctc 15181 tgctccaacc gggatgggag ccgggcaaac tggccgcggg ggcaggggag gccatctgga 15241 gagcccagag tccccccact cccagcatcg cactctggca gcaccgcctc ttcccgccgc 15301 ccagcccacc ccatggccgg ctttcaggag ctccatacac acgctgcctt cggtacccac 15361 cacacaacat ccaagtggcc tccgtcacta cctggctggg gggcgggcac acctcctccc 15421 actgtccact ggacggcctc ttccaggcgg ccccaccccc caggacccct tagcagccct 15481 gccctcccta tcgtctgaac agttgtcttc ctcagcctcc tcccgccccc accttgggaa 15541 tgtaaataca ccgtgacttt gaaagtttgt acccctgtcc ttccctttac gccactagtg 15601 tgtaggcaga tgtctgagtc cctaggtggt ttctaggatt gatagcaatt agctttgatg 15661 aacccatccc aggaaaaata aaaacagaca aaaaaaaagg aaagattggt tctcccagca 15721 ctgctcagca gccacagcct ccctgtatgc ctgtgcttgg tctactgata agccctctac 15781 aaaaaaacaa aagtatatat atatatgtac ataatatcag aattataaca ggcaaataaa 15841 acctgaaaat caagagcacc ttttgttatt tcgcggaaca cttacttccc ctgtccacgt 15901 cacactgtgg gaggtgctag gagcgacctg ccaggcgggc ccgggtgtgg cccagccaca 15961 cagtcctgat gcttccctgc cccttcccag agcccgggcc ttccacccac cctgccaggc 16021 ctggctgttg gctgccttgg gtgttgctgt cacattgttg ctggggctca ggatcctcgc 16081 ctctgctggc ctcctgagac caagcttagc tgtcaccatg gccgggccct tcactgctca 16141 aacatgagag tagagtgtgg ccagcggcag gagcctcttc ccctgtctca agggacacag 16201 ctccatggct cagtggaaga ctcatgatgt gccccggtgg catcgttctc tctggggtac 16261 ccagccctct aaaccagtac aaagcacc // LOCUS HSNFLG 4682 bp DNA PRI 13-FEB-1997 DEFINITION Human gene for neurofilament subunit NF-L. ACCESSION X05608 S42443 NID g1495072 KEYWORDS neurofilament; NF-L gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4542) AUTHORS Julien,J.P., Grosveld,F., Yazdanbaksh,K., Flavell,D., Meijer,D. and Mushynski,W. TITLE The structure of a human neurofilament gene (NF-L): a unique exon-intron organization in the intermediate filament gene family JOURNAL Biochim. Biophys. Acta 909 (1), 10-20 (1987) MEDLINE 87214213 REFERENCE 2 (bases 1 to 4682) AUTHORS Beaudet,L., Charron,G. and Julien,J.P. TITLE Origin of the two mRNA species for the human neurofilament light gene JOURNAL Biochem. Cell Biol. 70 (5), 279-284 (1992) MEDLINE 92360234 REFERENCE 3 (bases 1 to 4542) AUTHORS Charron,G., Guy,L.G., Bazinet,M. and Julien,J.P. TITLE Multiple neuron-specific enhancers in the gene coding for the human neurofilament light chain JOURNAL J. Biol. Chem. 270 (51), 30604-30610 (1995) MEDLINE 96107220 COMMENT Data kindly reviewed (15.5.88 ) by Julien J P. FEATURES Location/Qualifiers source 1..4682 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHNFL" misc_binding 164..173 /bound_moiety="sp1" CAAT_signal 239..246 /note="putative" TATA_signal 273..278 /note="putative" exon 300..1442 /note="putative" /number=1 CDS join(396..1442,2625..2749,3135..3454,3978..4120) /codon_start=1 /product="NF-L" /db_xref="PID:e259346" /db_xref="PID:g1279504" /translation="MSSFSYEPYYSTSYKRRYVETPRVHISVRSGYSTARSAYSSYSA PVSSSLSVRRSYSSSSGSLMPSLENLDLSQVAAISNDLKSIRTQEKAQLQDLNDRFAS FIERVHELEQQNKVLEAELLVLRQKHSEPSRFRALYEQEIRDLRLAAEDATTNEKQAL RGEREEGLEETLRNLQARYEEEVLSREDAEGRLMERRKGADEAALARAELEKRIDSLM DEISFLKKVHEEEIAELQAQIQYAQISVEMDVTKPDLSAALKDIRAQYEKLAAKNMQN AEEWFKSRFTVLTESAAKNTDAVRAAKDEVSESRRLLKAKTLEIEACRGMNEALEKQL QELEDKQNADISAMQDTINKLENELRTTKSEMARYLKEYQDLLNVKMALDIEIAAYRK LLEGEETRLSFTSVGSITSGYSQSSQVFGRSAYGGLQTSSYLMSTRSFPSYYTSHVQE EQTEVEETIEASKAEEAKDEPPSEGEAEEEEKDKEEAEEEEAAEEEEAAKEESEEAKE EEEGGEGEEGEETKEAEEEEKKVEGAGEEQAAKKKD" intron 1443..2624 /number=1 exon 2625..2749 /number=2 intron 2750..3134 /number=2 exon 3135..3454 /number=3 intron 3455..3977 /number=3 exon 3977..>4407 /number=4 polyA_signal 4407..4415 BASE COUNT 1288 a 1047 c 1180 g 1167 t ORIGIN 1 aaggatccaa gtgtcacggg gtctgggcaa tgcaggacgg gaggggctgc gtgagtgagt 61 acagaaggga aatgagtgag ggggcatggg atctcagaga aaatcaggga cctctgagca 121 aagtggaaag gacgaccgcc gcagctcctc gggccgtagc tcgaccccgc cttccctttt 181 ccgcagaatc ctcgccttgg ctgcagcagc gcgctgcccc cactggccgg cgtgccgtga 241 tcgatcgcag gctgcgtcag gacctcccgg cgtataaata ggggtggcag aacggcgccg 301 agccgcacac agccatccat cctccccctt ccctctctcc cctgtcctct ctctccgggc 361 tcccaccgcc gccggggagc accggccgcc aaccaatgag ttccttcagc tacgagccgt 421 actactcgac ctcctacaag cggcgctacg tggagacgcc ccgggtgcat atcagcgtgc 481 gcagcggcta cagcaccgca cgctcagctt actcaagcta ctcggcgccg gtgtcttcct 541 cgctgtccgt gcgccgcagc tactcctcca gctctggatc gttgatgccc agtctggaga 601 acctcgacct gagccaggta gccgccatca gcaacgacct caagtccatc cgcacgcagg 661 agaaggcgca gctccaggac ctcaatgacc gcttcgccag cttcatcgag cgcgtgcacg 721 agctggagca gcagaacaag gtcctggaag ccgagctgct ggtgctgcgc cagaagcact 781 ccgagccatc ccgcttccgg gcgctgtacg agcaggagat ccgcgacctg cgcctagcgg 841 cggaagatgc caccaccaac gagaagcaag cgctccgagg cgagcgcgaa gaagggctgg 901 aggagaccct gcgcaacctg caggcgcgct atgaagagga ggtgctgagc cgcgaggacg 961 ccgagggccg gctgatggaa cgccgcaaag gcgccgacga ggcggcgctc gctcgcgccg 1021 agctcgagaa gcgcatcgac agcttgatgg acgaaatctc ttttctgaag aaagtgcacg 1081 aagaggagat cgccgaactg caggcgcaga tccagtacgc gcagatctcc gtggagatgg 1141 acgtgaccaa gcccgacctt tccgccgcgc tcaaggacat ccgcgcgcag tacgagaagc 1201 tggccgccaa gaacatgcag aacgctgagg aatggttcaa gagccgcttc acggtgctga 1261 ccgagagcgc cgccaagaac accgacgccg tgcgcgccgc caaggacgag gtgtcggaga 1321 gccgtcgtct gctcaaggcc aagaccctgg aaatcgaagc atgccggggc atgaatgaag 1381 cgctggagaa gcagctgcag gagctggagg acaagcagaa cgccgacatc agcgctatgc 1441 aggtgcggca cggccagaaa cacagggggg cggggaactc gagcaagggg gggagttggt 1501 gcgcccagaa agcgaaacca ggggtggtgc ggctgcccag ctcttaggga tagggcttgg 1561 ctccttggcc actgtgtgga ggggtggggc tcttgagggg cgtgagtgcg ggcgccactg 1621 tagtccggga gtgactgctc cgcgtgctgc accggcgttc cgcattaaag ctgcccgacc 1681 cttgttgggt ggggagggga agacgtggga attgggcgtt gcctccgact gcagtgagat 1741 cagctctcta ctgacctgcg ttgaccacga gttactttgc agcgactatc ggatcgtcta 1801 gttaataaat agtacgagtg aactaactct caattaattc tgaaggatta ctgtgaccag 1861 catgctttat gactagtttt accaaccacc tcccttcctt tatttagtag gtagacagga 1921 aaatagtcaa cattgtttta ggtagttaac tagtgatgtt catagtaaac catttccttt 1981 tacctttttt ttttcttttt ttctttatgt gtaaaatctt ctacaacatt tctgtttaaa 2041 catctccatc ttctggggag tagaaaaaat acaattttaa aaagatctcc attttaaaac 2101 atctccgtct tctggggagt agaaattttt ttcttcttct ggggagtaga aaaaataatt 2161 tagatacata ggaaatattt catagaaaat aatttttttc ttttttttgt ttacatctgg 2221 tattttcttc tcataaagaa aggcattagt ttcctggcat gtaacccagc taaagaagag 2281 taatcagtga atgagagaca cagtttttct atcaacttag tctgtttttc tatcaactta 2341 gtctgtttgc atgcatttta tgatgatcat taaacagtat taagtaaaga aacagaagaa 2401 cagaattttc gtccatcttt tttttcatct caggcttcat gaacttgggt attttaggca 2461 tgaaggtttt tcaaaagata caggaagtta ttctaggaga gattttatca aagtgtgcac 2521 cttgatttta atcgaaacta ggcctttgca actacactac agtaaaataa tagaagggat 2581 ttatgctcgg attttttttt tgttttattt ttgtcttcaa acaggacacg atcaacaaat 2641 tagaaaatga attgaggacc acaaagagtg aaatggcacg atacctaaaa gaataccaag 2701 acctcctcaa cgtgaagatg gctttggata ttgagattgc tgcttacagg tgaaaataga 2761 ggggcaaaga cagcagccat taaaccttag gaagaaaatc agatcccatt taaagttatg 2821 ttggatcaga aaccttcaat aatagtcctt ttgaaataat gaagtgttag tttttggctt 2881 cttccaagaa gagggtattt agatatataa gaatttaacc ctgtaattag agtcctgttt 2941 ttatcttgtc attacacttt aaatctaata ggagtgattt atttatattt tttctggtct 3001 ccatcaaaag atccccaggc attaagtatt gataaatccc agccctgctc ctgcttgcct 3061 ttgtgtttag ggtactcaga gcaagttgtg aaacacaggt gttttttaac ctcaccttgc 3121 acctgcatcc ccaggaaact cttggaaggc gaggagaccc gactcagttt caccagcgtg 3181 ggaagcataa ccagtggcta ctcccagagc tcccaggtct ttggccgatc tgcctacggc 3241 ggtttacaga ccagctccta tctgatgtcc acccgctcct tcccgtccta ctacaccagc 3301 catgtccaag aggagcagac cgaagtggag gaaaccattg aggcgtctaa ggctgaggaa 3361 gccaaggatg agcccccctc tgaaggagaa gccgaggagg aggagaagga caaggaagag 3421 gccgaggaag aggaggcagc tgaagaggaa gaaggtatga taagaaaaaa cccctgcaac 3481 ttcaagtgta aactgggtgt ggagatttgt taggaggtgg ataagacaaa tgaagccttg 3541 ctcatttatt catatatgac attagaatca taaataaatt ttctgtttgt ttagcaaaac 3601 tttcctaagg catctactct gaatgaggtg attggtcaaa attttcattt tttaatataa 3661 tcatttaaca cagcaggttg gtgtcctaaa gaacaaaaat agataccaga cacataatga 3721 aagaaatatt gaggttaagt cttggagagg agcagagctt cccataccta gaagtgatct 3781 cattcgattt aaatatgtgt tcagtggcaa attattcatg gcaagctttg tctgttacat 3841 gtgcttttgg agagagtgga gctgggaggt tttggtagca ttctgacagt tgtgtttgca 3901 aataaaacct ttgcagacat gttttgactg gacttaccct ggatttgcat tttgtacatt 3961 ttctttttat gttaaagctg ccaaggaaga gtctgaagaa gcaaaagaag aagaagaagg 4021 aggtgaaggt gaagaaggag aggaaaccaa agaagctgaa gaggaggaga agaaagttga 4081 aggtgctggg gaggaacaag cagctaagaa gaaagattga acccccattt ccttaattat 4141 ttcaggaata attctcccga aatcaggtca accccatcac caaccaacca accagttgag 4201 ttccagattc tatgtgaatt aaaaagtcaa tatatgtata attctgagat gacttaggtt 4261 ggacattcaa tgttgtgcta tgaatttcct ctttatgcag agtatctgtt tgcttgcaga 4321 gtggctttcg gcttgctgcc agcctgtgca tggtccacgc ttatgagttc aggatctacg 4381 gcaatgtgaa tcattcagat gtttacaata aaaaacacca catgagtaaa tgaattcact 4441 aatgttaatg ttaaacttca tggaaaagta gtcctttgaa ccttcggtgg ttagcaatta 4501 aagaccctga gttatgtgca ataaatagta aataaagtta taccgaatga tgtatttttt 4561 gccgtggttg ttacctaatt aaaatacctt aaagatggca ccaatataaa gtgtgtgcca 4621 gtgaactatt gacctccaat tttttaaaaa gccgaaattt taacaattac caatactttt 4681 tt // LOCUS HSNFM 6236 bp DNA PRI 24-APR-1993 DEFINITION Human gene for neurofilament subunit M (NF-M). ACCESSION Y00067 NID g35045 KEYWORDS neurofilament subunit M. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6236) AUTHORS Lazzarini,R.A. TITLE Direct Submission JOURNAL Submitted (29-APR-1987) Dr. Robert A. Lazzarini, Laboratory of Molecular Genetics, IRP, NINCDS, NIH, Building 36, Room 4AO1, 9000 Rockville Pike, Bethesda, Maryland 20892 REFERENCE 2 (bases 1 to 6236) AUTHORS Myers,M.W., Lazzarini,R.A., Lee,V.M., Schlaepfer,W.W. and Nelson,D.L. TITLE The human mid-size neurofilament subunit: a repeated protein sequence and the relationship of its gene to the intermediate filament gene family JOURNAL EMBO J. 6 (6), 1617-1626 (1987) MEDLINE 87275853 FEATURES Location/Qualifiers source 1..6236 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" TATA_signal 678..683 exon <742..1821 /number=1 mRNA join(<742..1821,2550..2674,4006..6038) /product="NF-M" CDS join(742..1821,2550..2674,4006..5551) /codon_start=1 /product="NF-M" /db_xref="PID:g35046" /db_xref="SWISS-PROT:P07197" /translation="MSYTLDSLGNPSAYRRVTETRSSFSRVSGSPSSGFRSQSWSRGS PSTVSSSYKRSMLAPRLAYSSAMLSSAESSLDFSQSSSLLNGGSGPGGDYKLSRSNEK EQLQGLNDRFAGYIEKVHYLEQQNKEIEAEIQALRQKQASHAQLGDAYDQEIRELRAT LEMVNHEKAQVQLDSDHLEEDIHRLKERFEEEARLRDDTEAAIRALRKDIEEASLVKV ELDKKVQSLQDEVAFLRSNHEEEVADLLAQIQASHITVERKDYLKTDISTALKEIRSQ LESHSDQNMHQAEEWFKCRYAKLTEAAEQNKEAIRSAKEEIAEYRRQLQSKSIELESV RGTKESLERQLSDIEERHNHDLSSYQDTIQQLENELRGTKWEMARHLREYQDLLNVKM ALDIEIAAYRKLLEGEETRFSTFAGSITGPLYTHRPPITISSKIQKTKVEAPKLKVQH KFVEEIIEETKVEDEKSEMEEALTAITEELAASMKEEKKEAAEEKEEEPEAEEEEVAA KKSPVKATAPEVKEEEGEKEEEEGQEEEEEEDEGAKSDQAEEGGSEKEGSSEKEEGEQ EEGETEAEAEGEEAEAKEEKKVEEKSEEVATKEELVADAKVEKPEKAKSPVPKSPVEE KGKSPVPKSPVEEKGKSPVPKSPVEEKGKSPVPKSPVEEKGKSPVSKSPVEEKAKSPV PKSPVEEAKSKAEVGKGEQKEEEEKEVKEAPKEEKVEKKEEKPKDVPEKKKAESPVKE EAVAEVVTITKSVKVHLEKETKEEGKPLQQEKEKEKAGGEGGSEEEGSDKGAKGSRKE DIAVNGEVEGKEEVEQETKEKGSGREEEKGVVTNGLDLSPADEKKGGDKSEEKVVVTK TVEKITSEGGDGATKYITKSVTVTQKVEEHEETFEEKLVSTKKVEKVTSHAIVKEVTQ SD" intron 1822..2549 /number=1 exon 2550..2674 /number=2 intron 2675..4005 /number=2 exon 4006..6038 /number=3 polyA_signal 5849..5854 polyA_signal 6015..6020 polyA_site 6038 /note="polyA site" BASE COUNT 1851 a 1317 c 1779 g 1289 t ORIGIN 1 cagctgcttt aagacaaggg gtgggggaag gggagggagg caagaaaaga tgagggtggg 61 ggaggggaaa agagggaatg caaggggaag gagggaggag acggggagaa ggaaagattg 121 gaagaaaagg atctccgagg aaggggctga gagaagggca gggtgaactg gactaaaggc 181 cagagtagga aggagaagag gggccaaaaa agaaggggat gaaattaagc acagaagatg 241 ggtaaagaaa aaagtatcag ggaaagggca aaataagaga aagccttgag gataagaggg 301 tagaaggcta aagaacaagg ggaccacggg gtcggggaag cgctgcctga acggcgggac 361 agtgacaaaa gaaagggcgc tggcgatatt ccgaccaagg gaaacgcaat cgggaggtga 421 gaaatcggga ggtgagaaat ggaaagaagg cgaatccgcg gctacaagta gcctgggact 481 gaaaggggac ctgggggagg ggctgggccc agggcagaaa agtccaggtt cccatgcggc 541 ctgggcccac gtggagcggg cgctgaatca ccgttcagcc gcccccctcc cctcctcccc 601 gaccggtgcc cgcagtcccc gcctcctcgg ccgccgcctc cacggggcgg gccctggccc 661 gggaccagcg ccgcggctat aaatgggctg cggcgaggcc ggcagaacgc tgtgacagcc 721 acacgcccca aggcctccaa gatgagctac acgttggact cgctgggcaa cccgtccgcc 781 taccggcggg taaccgagac ccgctcgagc ttcagccgcg tcagcggctc cccgtccagt 841 ggcttccgct cgcagtcgtg gtcccgcggc tcgcccagca ccgtgtcctc ctcctataag 901 cgcagcatgc tcgccccgcg cctcgcttac agctcggcca tgctcagctc cgccgagagc 961 agccttgact tcagccagtc ctcgtccctg ctcaacggcg gctccggacc cggcggcgac 1021 tacaagctgt cccgctccaa cgagaaggag cagctgcagg ggctgaacga ccgctttgcc 1081 ggctacatag agaaggtgca ctacctggag cagcagaata aggagattga ggcggagatc 1141 caggcgctgc ggcagaagca ggcctcgcac gcccagctgg gcgacgcgta cgaccaggag 1201 atccgcgagc tgcgcgccac cctggagatg gtgaaccacg agaaggctca ggtgcagctg 1261 gactcggacc acctggagga agacatccac cggctcaagg agcgctttga ggaggaggcg 1321 cggttgcggg acgacactga ggcggccatc cgggcgctgc gcaaagacat cgaggaggcg 1381 tcgctggtca aggtggagct ggacaagaag gtgcagtcgc tgcaggatga ggtggccttc 1441 ctgcggagca accacgagga ggaggtggcc gaccttctgg cccagatcca ggcatcgcac 1501 atcacggtgg agcgcaaaga ctacctgaag acagacatct cgacggcgct gaaggaaatc 1561 cgctcccagc tcgaaagcca ctcagaccag aatatgcacc aggccgaaga gtggttcaaa 1621 tgccgctacg ccaagctcac cgaggcggcc gagcagaaca aggaggccat ccgctccgcc 1681 aaggaagaga tcgccgagta ccggcgccag ctgcagtcca agagcatcga gctagagtcg 1741 gtgcgcggca ccaaggagtc cctggagcgg cagctcagcg acatcgagga gcgccacaac 1801 cacgacctca gcagctacca ggtaggaacc gcggcctcgg ccagcctcgg ccacggccac 1861 gccgcgcgcc cccgacactt gggctcgtgc ccaggcgccc tctccgccgc gctccctggt 1921 ggccgctcgc tagagcacgc gcgccgcaga cctagggtat ttgcggatca gcgtcctcgc 1981 ccatctcatc ctccacactc cgcccccacc cacctgcccc agctgctaag ggtcttgacc 2041 tttttcagaa acgtgcatct tttcccagtt ctaattttgc acgcttgcac gtttaaagca 2101 ggagggatga attcggtagt ggataaatca gcaactttag gatagcttat gcagaaacgc 2161 gtgtattctc tacttttccg gcagtgatcg gaagagctct caaaattggc ttcagccaaa 2221 gggctcagat gggaatggcc aggtcagcca tggagtttcc ccatgcatgt ttgtgtcctg 2281 ttgagacgtg ttctaagtcc actggtctcc gtgcgtgatg tgcccaggaa gtgtcctatt 2341 gtcttactga tcttgtatct tcatttgaga atcgcttaga tttaaaagaa aaagggggtg 2401 ggacgggggg ctgggagtca ggtgtcagcg aggtttgcag aagtggaggg agacgggagg 2461 aggccagggg gaaggggtag caagtggttt gcgaaggaag ttgctgtttg caaggatgag 2521 tctggggaga ttctctgtgt ctgtttcagg acaccatcca gcagctggaa aatgagcttc 2581 ggggcacaaa gtgggaaatg gctcgtcatt tgcgcgaata ccaggacctc ctcaacgtca 2641 agatggctct ggatatagaa atcgctgcgt acaggtacga tgcttactac gtgcgtggcc 2701 ggaacactaa ccgcagtgca gaggctgttc cggcagagct tccaccactt aagttaaagc 2761 aggcagggtg caggcatcaa ctcagcacct ggttatcttg cttacttaaa aagaaattat 2821 tctaaagaat tgcaagtgta gttttatctc tttttatgca gctttaaaag aatgaatact 2881 agtagaaaca aaaggttttt gaattacaca aaggaggtgc agattaatct caatgcacat 2941 gcttaaactt tttatggaaa aatgttttca aatgctggaa gcatgaacag agttttggtt 3001 tctaatattt catctagtgg tttcagcttt tcaaatgtat aatgtcaagg acaaacacca 3061 ggacgttcta tttctctgtt tctctgttat atagcttact attgccatca tctggctgag 3121 aatagatata gaatgataga atatagatat agttctttta tatatgtaga taatttatat 3181 gtattatatt ttatgctaga ctgtagtata aattatatca atatatcatg tatgatatta 3241 atctagatct atagatacac atatgtgcat atgcatataa atctagatat atagacacaa 3301 atatatatga tcgttttata gatagtgaga taggttatag gtctattaac tgaagtgacc 3361 ttgctgttga gtaagcgcaa aggacaaaat cgttgattaa aatttttctg ctaccaataa 3421 ggtagttata atataacgag aataaattgc atttacagag ctatctctct tttcaggaaa 3481 gctgaataac tacattaaat agacacttta tgataaaaat tatcaacaaa tttataactc 3541 gatacacctg aaaatctaaa cgtttaagaa agtgactact ctcagaaagg ctgtttggct 3601 ttggagtttg ggggcgtttt gtttatggct tcttgttttt ttgtttttgt ttttgttttt 3661 tgctatttgg ccactaacaa gtttttcagc atattcatgt tgtacctaat ggatctctac 3721 tgcagggcca agacttagta gctgggtgtg gttagtggac tattgggcaa ggttagtcat 3781 tgtagggggc aactgtctgg cagtccagga gaatctttct ctgtcactga gtataatgta 3841 atatgccagt aagtgatagc aggtattata gtgaattcat agaatattct acttatgtaa 3901 ttctatttat tcaaaggtag ctaccacaat acccagaatg taatgaagct cagaaggcct 3961 agtgaaattt ttactatgtc ttatgttctt ggattttctc cttagaaaac tcctggaggg 4021 tgaagagact agatttagca catttgcagg aagcatcact gggccactgt atacacaccg 4081 acccccaatc acaatatcca gtaagattca gaaaaccaag gtggaagctc ccaagcttaa 4141 ggtccaacac aaatttgtcg aggagatcat agaggaaacc aaagtggagg atgagaagtc 4201 agaaatggaa gaggccctga cagccattac agaggaattg gccgcttcca tgaaggaaga 4261 gaagaaagaa gcagcagaag aaaaggaaga ggaacccgaa gctgaagaag aagaagtagc 4321 tgccaaaaag tctccagtga aagcaactgc acctgaagtt aaagaagagg aaggggaaaa 4381 ggaggaagaa gaaggccagg aagaagagga ggaagaagat gagggagcta agtcagacca 4441 agccgaagag ggaggatccg agaaggaagg ctctagtgaa aaagaggaag gtgagcagga 4501 agaaggagaa acagaagctg aagctgaagg agaggaagcc gaagctaaag aggaaaagaa 4561 agtggaggaa aagagtgagg aagtggctac caaggaggag ctggtggcag atgccaaggt 4621 ggaaaagcca gaaaaagcca agtctcctgt gccaaaatca ccagtggaag agaaaggcaa 4681 gtctcctgtg cccaagtcac cagtggaaga gaaaggcaag tctcctgtgc ccaagtcacc 4741 agtggaagag aaaggcaagt ctcctgtgcc gaaatcacca gtggaagaga aaggcaagtc 4801 tcctgtgtca aaatcaccag tggaagagaa agccaaatct cctgtgccaa aatcaccagt 4861 ggaagaggca aagtcaaaag cagaagtggg gaaaggtgaa cagaaagagg aagaagaaaa 4921 ggaagtcaag gaagctccca aggaagagaa ggtagagaaa aaggaagaga aaccaaagga 4981 tgtgccagag aagaagaaag ctgagtcccc tgtaaaggag gaagctgtgg cagaggtggt 5041 caccatcacc aaatcggtaa aggtgcactt ggagaaagag accaaagaag aggggaagcc 5101 actgcagcag gagaaagaga aggagaaagc gggaggagag ggaggaagtg aggaggaagg 5161 gagtgataaa ggtgccaagg gatccaggaa ggaagacata gctgtcaatg gggaggtaga 5221 aggaaaagag gaggtagagc aggagaccaa ggaaaaaggc agtgggaggg aagaggagaa 5281 aggcgttgtc accaatggcc tagacttgag cccagcagat gaaaagaagg ggggtgataa 5341 aagtgaggag aaagtggtgg tgaccaaaac ggtagaaaaa atcaccagtg aggggggaga 5401 tggtgctacc aaatacatca ctaaatctgt aaccgtcact caaaaggttg aagagcatga 5461 agagaccttt gaggagaaac tagtgtctac taaaaaggta gaaaaagtca cttcacacgc 5521 catagtaaag gaagtcaccc agagtgacta agatttgagt ccattgcaaa aggttaagcc 5581 atatgacaat ttcaaaatgc atgtgattgg cagcttcaaa acagaacggg ttctcccatg 5641 ggggctccag acattgtatt ttactttgtg caatatgagg ggactgcatg caagctcagg 5701 gtgctccctc ctcagtcttt gggggattca aatgcatgat attgtatgta cctgggaaat 5761 ttgccgattt cctaagctgt tggaaggggg tcacttaagg ggggatgtct tgagatgtat 5821 tatgcaaagt accaactgag ccaaaaacaa taaatgaaac acagaactca gccttaagaa 5881 agctatatat gaataattat gtttacctca ctggtgcatt taaaatggac ttttgttcat 5941 gggagaacct cgttgacatg cacagtttgc aatcttatgt tgatcgatgt taaacgtcac 6001 agcagtactt gctcaataaa ggtcatattg gaaacatagt caattgctga gtcttatgtc 6061 atttctcttt ttctaatttt tatttattta tttttattta gagatggggt cttgctatgt 6121 ggcctcaagc agtcctccca cctcagccac ccaaagtgct gggattacag gcatgagcca 6181 ccacgcccag cctgttatgc catttcaaag tgaaatctcc actacctgaa gcttgc // LOCUS HSNGALGEN 5869 bp DNA PRI 27-OCT-1997 DEFINITION H.sapiens NGAL gene. ACCESSION X99133 NID g1657330 KEYWORDS NGAL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5869) AUTHORS Cowland,J.B. and Borregaard,N. TITLE Molecular characterization and pattern of tissue expression of the gene for neutrophil gelatinase-associated lipocalin from humans JOURNAL Genomics 45 (1), 17-23 (1997) MEDLINE 97480711 REFERENCE 2 (bases 1 to 5869) AUTHORS Cowland,J.B. TITLE Direct Submission JOURNAL Submitted (06-JUL-1996) J.B. Cowland, Granulocyte Research Laboratory, Dept of Hematology, National Univ. Hosp., Rigshospitalet L-4041, 9 Blegdamsvej, DK-2100 Copenhagen, DENMARK FEATURES Location/Qualifiers source 1..5869 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" TATA_signal 1663..1669 exon 1696..1902 /number=1 misc_feature 1696 /note="CAP-site" prim_transcript 1696..5691 gene 1765..5358 /gene="NGAL" CDS join(1765..1902,2476..2612,3877..3956,4145..4264, 4422..4523,5339..5358) /gene="NGAL" /codon_start=1 /db_xref="PID:e256056" /db_xref="PID:g1657331" /db_xref="SWISS-PROT:P80188" /translation="MPLGLLWLGLALLGALHAQAQDSTSDLIPAPPLSKVPLQQNFQD NQFQGKWYVVGLAGNAILREDKDPQKMYATIYELKEDKSYNVTSVLFRKKKCDYWIRT FVPGCQPGEFTLGNIKSYPGLTSYLVRVVSTNYNQHAMVFFKKVSQNREYFKITLYGR TKELTSELKENFIRFSKYLGLPENHIVFPVPIDQCIDG" intron 1903..2475 /gene="NGAL" /number=1 exon 2476..2612 /gene="NGAL" /number=2 intron 2613..3876 /gene="NGAL" /number=2 exon 3877..3956 /gene="NGAL" /number=3 intron 3957..4144 /gene="NGAL" /number=3 exon 4145..4264 /gene="NGAL" /number=4 intron 4265..4421 /gene="NGAL" /number=4 exon 4422..4523 /gene="NGAL" /number=5 intron 4524..5338 /gene="NGAL" /number=5 exon 5339..5365 /number=6 intron 5366..5549 /number=6 exon 5550..5691 /number=7 polyA_signal 5673..5678 BASE COUNT 1207 a 1782 c 1766 g 1114 t ORIGIN 1 ctcgaggatc tcggctcact gcaacctccg cctcccaggt tcaagctgtt cttctgcctc 61 agcctcccga gtagctggga ttacaggcgc ctgccaccat gccctgctaa tttttgtatt 121 tttagtagag atggggtttc accgtgttgg ccagactggt ctcgaactcc tgacctcgtg 181 atccacccgc ctcagcctcc caaatgctgg gattacagat gtgagccacc gcacccggcc 241 tggcagagga tactttttaa ggtcaaagac agtagcagag gtggagttcc tgggaacagg 301 gtcatgaggg gaagaggggg ttcggaggga gcgagtagcc actggctacc tctagaaagg 361 gaaggctttg gtgcaacatc gttcccctgc agttttactc atctttgctt cctgcccttt 421 catcatccaa tcgggcaggc aggacagggc ctgagggggc agggatccag tgggtgcctc 481 tctagactaa ccccagctca ggactcccag agccccttcc ctgaggccct gctgccccca 541 agcccagatt ggggatccca agcagcacgt aggcagagcc agtgaggtcc ccgttagtcc 601 cattgaaagc tctaaaacca gcgaaccctc agtccagcct caggtcaggc atccaggacg 661 gcctcagccc tcatgggtga gccatctctg cggacactgc acagggccta cgatccatcg 721 ctgcctcccg aggatgccag ccaggccccc gttgagataa ctgcttccct gctggacaag 781 gctgggacca gccatctcgg tgacagttcc agaacccctg gcctgggctg ctgggttcaa 841 tggaaaaagg ctgtgactag agtcaggggg atggtctcag tgacctcaag gataaggcca 901 gatccttgca ctgtcagtga cccaaagcaa caggtgtcca gagcagcagt gtggcgcctt 961 cacgccccca cacatcagcc caactcaccc aggacaggga ctgtagcctc agcactcaac 1021 ccatgtgccc tgtgtggggt ctcttcccac tgcactcaca ggagaggaag ggtccctcag 1081 gggtccactg gggtcccctc ctgcaaatgg ggcaaggaga ggggcaaggg gctgtctcaa 1141 ggcccctgga gcacatgcag gtcctggact ggggctcctg ggagggccat gattctgggc 1201 tccatgagtt cagagcagac gccttgtttt tccttgtcca ctgtcagcca ccccaccctt 1261 ccctgaccct taaaagaacc aggaaacagc acatgatctg ttggaaggag gcattcattc 1321 tttcctttct gtgggtgtgg ggaggggacc acagggcaca taccccaccc tgggatccag 1381 ctgagcaggg gggtcagaga tgacagctct tccggctcac aggccaccgg cccacataca 1441 gggcaatcag aagaaagaaa cagcacaagg aaggcacaga gggagtcgtt gtccctgcca 1501 gaggtgcagc actccgggaa tgtccctcac tctccccgtc cctctgtctt gcccaatcct 1561 gaccaggtgc agaaatcttg ccaagtgttt ccgcaggagt tgctggcaat tgcctcacat 1621 tcctggcctt ggcaaagaat gaatcaaccc accctagatc ccataaatag ggccacccag 1681 gtgagcctct cactcgccac ctcctcttcc acccctgcca ggcccagcag ccaccacagc 1741 gcctgcttcc tcggccctga aatcatgccc ctaggtctcc tgtggctggg cctagccctg 1801 ttgggggctc tgcatgccca ggcccaggac tccacctcag acctgatccc agccccacct 1861 ctgagcaagg tccctctgca gcagaacttc caggacaacc aagtaagggg ccaagagggg 1921 cacctgcagg cagggcctgg ggaagagtgg gagcagaggg gaggagaggt gaagagactc 1981 aggaagagcg ttgggcagga cttaggagtc cagggtccag gtttcagctc actctgtgcc 2041 accagggtcc cctggtggaa accatgcccc ttccccccat ccccaccccc tctcagcctg 2101 aacagactcc cccaggtcca catcccctct cccataaccc ccattgtcca aagaaggtgg 2161 gagcactttt agtccccctg cacagatgag gaaactgagg ctcaggaagg cccaccagcc 2221 acatgcctcc tccagtgagg aggtcaccct cctccctgcc agactcagaa ccgcctcttc 2281 ccccaggact cccttctgga ctgatggcct cctgctcctg ccccttcacc agtgcaggcc 2341 cagcctgggc cctgctgccc agctagaggg gctcatggtt ccaagctggg cggcccagag 2401 gtgccacagg gacagagctg gaggggtggc tcctagggcc attcctgggt tgtgcctctt 2461 atcagtccct tgcagttcca ggggaagtgg tatgtggtag gcctggcagg gaatgcaatt 2521 ctcagagaag acaaagaccc gcaaaagatg tatgccacca tctatgagct gaaagaagac 2581 aagagctaca atgtcacctc cgtcctgttt aggtgagggc cgacatctcc tgggggtgtg 2641 agagtcagac tgacgtcaca ggcaagggat ggccaaagct gagggatcct gtcgttcacc 2701 tcgctgttct gcccggaatt catctgtgtt catccttcct ctgttcctta gagcaacgtt 2761 tatagcacat ttccatgcag acacacagac agtggtgggg atggacatgc acagtcgtta 2821 gaaaacaaga cggagagagg aggggtgcct gggagcggga ggaggggaca ctggatccag 2881 cctggaaccc cacccagtgc cttcatggaa ggcttccagg gaggtggcct taaaagagcc 2941 aacctgcttc aaaaggaaat gtggggtgtt cccggcaggg gctggagtca gagagagccc 3001 ccccttcagg aaggagcaag ccatcgcagg gtcaccctga gcagagctgc tgagcagcct 3061 ggaggggcag gtggccacgc tagcacctag cacggtcctc aggccccgcc ccagcggatc 3121 tgctgcggag tggcttagag cagggctctt gggccgcagg gtggggagac ttggtggggt 3181 gcagcctagg gggtcgggag accagcgaaa gtgaagcggg gccgtcacag gtgtgagaga 3241 acaggcgcag ggtgaagagg cagggagcca gggatcagcc gcccccagtg ggtttctgac 3301 tctggcagct gagtggattg ggattggggc atttgtggag caaggagcag aatacagaca 3361 ggttggggag ctcagccctg gggtgccagg ggatgggaag tgggaggact caaggatggg 3421 gtcaggtttg acccgagagc taggggaacg gctggcatgg agcagactgg aagtaccgag 3481 gtggatcccc gggagagggt ataggaaggg aagcagcaag ctgagtgcag gggagaaatg 3541 cagggtttcc tgtgtgttgg gtggcggcgg gggtgaaagc cacccaggga ggcagccaaa 3601 ggaaagaagg acatcgggtg ctggagggtc tgagtggggt ccaggggccc caggcaggcc 3661 aggagggaca gcctggtgtc agctcaggga gaaggcccag gcccatctcg gctgggtggg 3721 gtagggcccc tccaggtagt gggggatgag ctgtcacggg ttgggccgga ctgagagcaa 3781 cagaaccctg ctgctgccct ggccccacct tgtccagcac aggaggccca agcctgggtt 3841 gtctcccctc tcacccaccc atctctccct cccaaggaaa aagaagtgtg actactggat 3901 caggactttt gttccaggtt gccagcccgg cgagttcacg ctgggcaaca ttaagagtga 3961 gtcttgagtg aggtggggca ctgagttggg gctccgggga gctgggtagg ggcacagacc 4021 ttcctgcccc tccacacaga tgtgttgtat ggggagaagc ccacgttgat gggctgggga 4081 gggaggggac agctccctcc tcccatccag ggcagggctg acccctcacc gtccacgcct 4141 gcaggttacc ctggattaac gagttacctc gtccgagtgg tgagcaccaa ctacaaccag 4201 catgctatgg tgttcttcaa gaaagtttct caaaacaggg agtacttcaa gatcaccctc 4261 tacggtgggt cctctcccat cccctcgggg actggctcct gatcacactt agtgggaggg 4321 gaggccggtc ccccatgagg aagggatctg aggcctcatc tactcattca acgatattta 4381 tgtggtgtct gccggccact cactggccat cttggtcaca gggagaacca aggagctgac 4441 ttcggaacta aaggagaact tcatccgctt ctccaaatat ctgggcctcc ctgaaaacca 4501 catcgtcttc cctgtcccaa tcggtaatgg ccagtctgga tgaggggacg gggacatggg 4561 gactgttcag gcaggatgct tccctaccag ggatcaggga gaggagggac tccgtcctca 4621 gcttcagtca ctggagcagt ggatggtcca ggagctcctt ggaagccact ctgggcccag 4681 gaagactgtg ccccacccca gggtctatgg gactcccagg gacccaggcc gcaagtgctc 4741 tttcctggca gtttagcccg ggtctgccca gacaaggatt tcaggcccag gcctgagtat 4801 ccatttctca gtctcactgg cctgacacct ctggccaccc tcccaggccc ccttgttctg 4861 ggccatctcc cccgaccctc ccaggcctcg tcaccctggg ttttgctgtc ctggctgtcc 4921 tctctcccct ggggacttgc tcaccactga cttgggagct gtccttgact ccagggagcc 4981 tggcttgggc aggaggctcc agccaggcca ttcagagagc cactggcctc ccccaggctg 5041 agagactgcc tggactggta aacaggcagg agacctgggt gcccgaggag cctgggagct 5101 gggcctcact cagggcagcc cctccccagg cctttctccc acatcccctg ccctgccatc 5161 cacccctctg ttgccccatc tctgaaagga acccccatat cttctgcagc tgggccaggt 5221 ggggcagggg ctgcccaggg gcagtgcaga ggacctggca gtcagggatc acacacacac 5281 actcatacac gcacacacac acacagctgc ctgttctgac ggactttctc cctaacagac 5341 cagtgtatcg acggctgagt gcacagtgag tgtggctggg cggctgcgag ggggcttgtg 5401 ggaggccagg gtgcagtggg ctgggggtct tgggcctgcc tttgctcatc cccctgcccc 5461 ccagcactgc tgctgtcttt attctgctgt ccccatctcg ggtgcctccc atttccccac 5521 ccatcaccct catatccacc tctgtccagg gtgccgccag ctgccgcacc agcccgaaca 5581 ccattgaggg agctgggaga ccctccccac agtgccaccc atgcagctgc tccccaggcc 5641 accccgctga tggagcccca ccttgtctgc taaataaaca tgtgccctca ggcctctgag 5701 tctacactgt ttgacccctg ggccttcgag gaaggggagg ggcgggaggc tcccactggc 5761 atcactctca gggtctgcac ccccaggatg gagcctagcg aacccagcct gggtgttagg 5821 gctgcagagt gaagacacaa gcccctggtc atcaccagca gctttgtgg // LOCUS HSODCG 9043 bp DNA PRI 24-APR-1993 DEFINITION Human gene for ornithine decarboxylase ODC (EC 4.1.1.17). ACCESSION X16277 NID g35137 KEYWORDS Alu repetitive sequence; ornithine decarboxylase; repetitive sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9043) AUTHORS Steeg van,H. TITLE Direct Submission JOURNAL Submitted (21-AUG-1989) Steeg van H., Institute of Public Health and Environmental Protection, P O Box 1, 3720 BA Bulthoven, Netherland REFERENCE 2 (bases 1 to 9043) AUTHORS van Steeg,H., van Oostrom,C.T., Martens,J.W., van Kreyl,C., Schepens,J. and Wieringa,B. TITLE Nucleotide sequence of the human ornithine decarboxylase gene JOURNAL Nucleic Acids Res. 17 (21), 8855-8856 (1989) MEDLINE 90067851 FEATURES Location/Qualifiers source 1..9043 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="CML cells." TATA_signal 763..767 mRNA join(795..1001,3858..3967,4073..4191,4475..4648, 4855..5027,5286..5420,5551..5632,5809..5892,6948..7110, 7193..7305,7399..7613,8254..8740) prim_transcript 795..8740 exon 795..1001 /number=1 intron 1002..3857 /number=1 misc_feature 2682..2980 /note="Alu repeat I" misc_feature 3211..3477 /note="Alu repeat II" exon 3858..3967 /number=2 intron 3968..4072 /number=2 exon 4073..4191 /number=3 CDS join(4090..4191,4475..4648,4855..5027,5286..5420, 5551..5632,5809..5892,6948..7110,7193..7305,7399..7613, 8254..8398) /EC_number="4.1.1.17" /codon_start=1 /product="ornithine decarboxylase (ODC)" /db_xref="PID:g296667" /db_xref="SWISS-PROT:P11926" /translation="MNNFGNEEFDCHFLDEGFTAKDILDQKINEVSSSDDKDAFYVAD LGDILKKHLRWLKALPRVTPFYAVKCNDSKAIVKTLAATGTGFDCASKTEIQLVQSLG VPPERIIYANPCKQVSQIKYAANNGVQMMTFDSEVELMKVARAHPKAKLVLRIATDDS KAVCRLSVKFGATLRTSRLLLERAKELNIDVVGVSFHVGSGCTDPETFVQAISDARCV FDMGAEVGFSMYLLDIGGGFPGSEDVKLKFEEITGVINPALDKYFPSDSGVRIIAEPG RYYVASAFTLAVNIIAKKIVLKEQTGSDDEDESSEQTFMYYVNDGVYGSFNCILYDHA HVKPLLQKRPKPDEKYYSSSIWGPTCDGLDRIVERCDLPEMHVGDWMLFENMGAYTVA AASTFNGFQRPTIYYVMSGPAWQLMQQFQNPDFPPEVEEQDASTLPVSCAWESGMKRH RAACASASINV" intron 4192..4474 /number=3 exon 4475..4648 /number=4 intron 4649..4854 /number=4 exon 4855..5027 /number=5 intron 5028..5285 /number=5 exon 5286..5420 /number=6 intron 5421..5550 /number=6 exon 5551..5632 /number=7 intron 5633..5808 /number=7 exon 5809..5892 /number=8 intron 5893..6947 /number=8 misc_feature 6123..6410 /note="Alu repeat III" exon 6948..7110 /number=9 intron 7111..7192 /number=9 exon 7193..7305 /number=10 intron 7306..7398 /number=10 exon 7399..7613 /number=11 intron 7614..8253 /number=11 exon 8254..8740 /number=12 BASE COUNT 2280 a 1970 c 2372 g 2421 t ORIGIN 1 ggatccgggt cccctcacgc tcctggctga gtccctggct tcacagggga aactacctcc 61 gcaggccagg acccatctag ttacaggata cctcgatgtt acaaagacga ggcttccagc 121 gcgggggcgt ggaggcggct gccagccctg cccgcagcgt gctggcgacc cccgggacgc 181 cccttccctc ccgcgcctct gctccctagc tggtgggagc agagcgcacc gggatcactt 241 ccaggtccct tgcaccggag gaatgggcgg cagcagggtc cggagtcggc ccggcggggc 301 ccacgtggcc agcacatcgg tcctccgctc gcgatttccc ttttccgctc tcgggcacga 361 ggtactgaac gccaggtgga agcacagctg tgcagctaca ggctctgccg ttcagctgcc 421 gcgggccggg gccggggcct gcggcgtcgt gcgcgtgcgc ggaccagttc caggcgggcg 481 agaccgccgc agggcggggc ggggcgaggc ggccgcaggg cggggagggc ggggagaggc 541 ggccgcaggg cggggagggc ggggcgcgaa gccgggggcg ggggccacgc gtggggcagg 601 cggtgctcgg ctcggctgac gtcggcccgc cggcgcccca ccagctccgc gcgggcccgg 661 gttggccacc gccgggcccc cgcccctccc ccggccgtgt cccggccgga accgatcgtg 721 gctggtttga gctggtgcgt ctccatggcg acccgccggt gctataagta gggagcggcg 781 tgccgtgggg ctttgtcagt ccctcctgta gccgccgccg ccgccgcccg ccgcccctct 841 gccagcagct ccggcgccac ctcgggccgg cgtctccggc gggcgggagc caggcgctga 901 cgggcgcggc gggggcggcc gagcgctcct gcggctgcga ctcaggctcc ggcgtctgcg 961 cttccccatg gggctggcct gcggcgcctg ggcgctctga ggtgagggac tccccggccg 1021 cggaggaagg gagggagcga gggcgggagc cggggcgggc tgcgggcccc gggccccggg 1081 cacgtgtgcg gcgcgcctcg ccggcctgca gagacacgtg gtcgccgagc gggccacgac 1141 cttgaggcgc cgcttcctcc cggcccgggg ttctcccgcg gctggataag ggtgatccgg 1201 gcgcctcgtt ctgcccccgt cttcacagct cggggctgga ggggcctagg ggagacccac 1261 ccggagaccc tgcggccccg cgccggcctc tttcccaacc cttcggcggc cgcgcgctgg 1321 ccggggagcc gttggggagg ccctggcggc cgcgcagcag gtgcaggggc gcagagcctg 1381 ggctcgcctt ggtacagacg agcggccccg gccttggcgc cttcagtttc cttccagttt 1441 ttattttcgc tgtgtctaca gagcagatga caccaatttg gaaacccgcg agagtgggta 1501 gagctaagat agtcttgctg tagtagctgt gatattagat gctcggccat gacttagagg 1561 tgtttattta aggactgtga atgactcggt gatttcggaa aagcttggct tagatgaacg 1621 gacatacaca ggggagacag ccctaaggtt tgcagaaaag gctgattgtg ctgtttgcga 1681 agtcgaaata attggtgaaa gtgtagaagg cagaacctct caggaatgtc tggggaggac 1741 aaagaatgtg ttggctgact ttgtttaaac ataaaattgg gcagacttta attgatttgt 1801 gaaatttttt tcaaagtttg tttgaattag cccctatctc ttctaacatt atcctcttgt 1861 gctaattgat tgaccatttt aaataactta gctgttacag aaagaccgaa aggtgttctt 1921 cagtaaaata tattcaagta agttacttaa gtaacgcctt aaaagataca gaaaagcaaa 1981 aaagtattgg cgtattaaaa agaaatcaaa actttccaag tttaggcctg aacattgcct 2041 taaaaatatt taataaggcc tcaaatgacc cagtccgaga ctgcatgagc ctatttatta 2101 ttaaattgta aatattcttc atataaacaa aaatatataa ccatgtctgt aacaaaaatg 2161 gttttgctag cgttgttact ctcttccctt ctccgagggg tgatttaggc aacttcggag 2221 gttgacaatg ccaagcagtc acaatagata gagctttaaa gcaaattcta tgcatgggtt 2281 tggatttatg acaggcccgt caccctgggc ctgtcatagt accccatgcc agagcaaact 2341 gtgtccccga accattgcct ggcctctgtg cccgtaggct gctggcactg aagtgggttg 2401 cacagtggaa aagaagaaag ctctacctgg cagaaatttt taaaggttaa aataaataat 2461 tttaagaaag ctggttcaca aggtgccaca tttgatgaaa gcaaaataca gtggctttta 2521 ttgttactag agtgatgttc ttgcttgttt ttcttttttg gtgaagttag ccccaaatta 2581 ttctcatagc taagcaaata cgagagtgac tgtaaggaca gttggcattc ccggaattgc 2641 taaacttggt aggcaacgct ggtttaagaa tactgagttc tagccgggcg tggtggctca 2701 cgcctgtaat cccaacactt tgggaggctg aggcaggcgg atcacctgag gtcgggagtt 2761 ggagaccagc ctgactaaca tggagaaacg ccatctccac taaaaatata aaattagcca 2821 ggccccgggt gtggtggcac atgccggtaa tcccagctac tcgggagact gaggcaggag 2881 aatcgcttga acccaggagg cggaggttga ggtgagccga gatcatgcca ttgcactcca 2941 gcctgggcaa caagagtaaa actctgtctc aaaaaaaaaa aaaaaaaaat actgaattct 3001 gatcaggtaa cagcaactgt aatacaatgt gataagttga cttgaagatt acagttttta 3061 agaagtatat acccagctaa tacatgaaaa ttaactcgta aaatctcaaa tgctccagac 3121 atttccatga tgcctgttgg tcagtaaaaa tcattctaag acttagtgga agtaggaaat 3181 gtttgtatgg ctgtgtataa aggctataat gtaatcccag cactttggaa gaccgaggcg 3241 ggtggatcac ctggggtcag gagtttgaga cccacctgga caacgtggtg aaatcctgtc 3301 tctactaaaa acacaaaaat tagccgggca tggtggcagg cgcctgtaat cccagctgct 3361 ggggaggctg aggcaggaga atcgcttgaa cccgggaggc agaggttgca gtgagccaag 3421 attgcaccgc tgcactccag cctgggtgac agcgtgagac tctgtctcaa aaaaaataaa 3481 aaagtctata atgctatttt aagtttctaa ggaactgaaa ctgctctgaa ataaatcaga 3541 ccattataag acttttttcc atatcagtga gctaagtgca gataagcttc tgaaacttgc 3601 atgctagatt tttttggtac aaatatttga aatgcttagt gtgctgcctt ggaaaaacct 3661 ggtatttttt gttgtgtcct tatactgcca aggtttatgg aatcatgtac cttatgccta 3721 gtaataatta ggatgaccag gccagtgagt ggttcatatc cggggcatga ttagctctgc 3781 gtgtgctcag ccagtgcccc atcttcaact cgatgtgttc ctaaggtaga cagcaaattc 3841 cctattttat ttctcagatt gtcactgctg ttccaagggc acacgcagag ggatttggaa 3901 ttcctggaga gttgcctttg tgagaagctg gaaatatttc tttcaattcc atctcttagt 3961 tttccatgta agtattcagt ttacatttat gttgcaggtt aatcttaaga attgtattgc 4021 taaggcttct aagtgaattt ctccactcta tttgcatttt gttgcatttc agaggaacat 4081 caagaaatca tgaacaactt tggtaatgaa gagtttgact gccacttcct cgatgaaggt 4141 tttactgcca aggacattct ggaccagaaa attaatgaag tttcttcttc tgtaagtata 4201 tgaggcccat gctggcagtg cagctgagag tgccaggcaa gtggaaaact ttggcaaggt 4261 ctaaggaaga gcaatgaggc ttacatgtct tgttatggaa tgtagaaatt aattcactgg 4321 tggtaaatta atagtgataa tggtgatact catatcagtg gctagactca aaagagcagg 4381 attcattgtg actgatggga atgaaggtcg ctggctattg gtgtggtgtg tggtgaggct 4441 gctagtgagt cacctgtgac cactcttgtt tcaggatgat aaggatgcct tctatgtggc 4501 agacctggga gacattctaa agaaacatct gaggtggtta aaagctctcc ctcgtgtcac 4561 ccccttttat gcagtcaaat gtaatgatag caaagccatc gtgaagaccc ttgctgctac 4621 cgggacagga tttgactgtg ctagcaaggt aagcgatagc agcaggcctc aaaagcgttg 4681 tataaaatgg gcctggtatt ccccacgagg cagatacaag ttgtgttttt tgggcaataa 4741 atgctcacta aaggcaaatg gggcgggggg gtacatgaca acttcccatg cttttctgtt 4801 tattccacgt gttaagccac atatggatag catgacacca ctcttctttt tcagactgaa 4861 atacagttgg tgcagagtct gggggtgcct ccagagagga ttatctatgc aaatccttgt 4921 aaacaagtat ctcaaattaa gtatgctgct aataatggag tccagatgat gacttttgat 4981 agtgaagttg agttgatgaa agttgccaga gcacatccca aagcaaagtg agttattccc 5041 ccatctgagg gcaagatcgg gagcataaga tatgtggatt cttatcaaac aaacttaaat 5101 ttctgattat tatatttcta tactttagta gaaagtagtt gaaaccccca ttgagtcatg 5161 aagcctggga ctcaaactac agaatatatc agcgacagta tttagaacag gattgttttt 5221 attttaattg tggctataag tgaacatcta tcatgagaca tttgctgcac tttccttgct 5281 tgtaggttgg ttttgcggat tgccactgat gattccaaag cagtctgtcg tctcagtgtg 5341 aaattcggtg ccacgctcag aaccagcagg ctccttttgg aacgggcgaa agagctaaat 5401 atcgatgttg ttggtgtcag gtgagatttt ggtgggatag ctagaggtca agacattgaa 5461 cagtttgagt tttacaggct ttctcctagt gtttgctatt attttaagaa atactaagac 5521 acagtgtctc gtctctttat tttaccccag cttccatgta ggaagcggct gtaccgatcc 5581 tgagaccttc gtgcaggcaa tctctgatgc ccgctgtgtt tttgacatgg gggtgagtat 5641 acgtgaccct gttagggaag ggcgggacac aactgacaat aactagtctt aattctagag 5701 ttaacttttt atggcagttg gttctgtatt acatgggttt cagcctatct gctgcataca 5761 tttttgttat tagctgtgga tctggctgac ttattttctt gattctaggc tgaggttggt 5821 ttcagcatgt atctgcttga tattggcggt ggctttcctg gatctgagga tgtgaaactt 5881 aaatttgaag aggtaattta gaacaaaact gtaatactca gtagccgttc taataaattc 5941 ctttttggaa tatttcaaaa tttaagtgtc ttaactaata ccacaatggg ctgaagtgtc 6001 ttggtgtgat attttgagtg atttctttgt gctgtctgac attacacttg ataccatttg 6061 gttttctaaa gtgtgaatca gctttcccag aagtcttgga taattggtta cattggaaat 6121 catggctcac acctgtaatc cagcacttgg ggaggccaag gtggtaggat cacttgagcc 6181 caggagtttg agaccagcct gggcaacaca gtgagacccc atctctacaa aaaaaatttt 6241 aaaattagcc tggtgtggtg gcgggcacct gtaatcccag ctacttggaa ggctgaggtg 6301 ggaggatcac ttgagcccag gaggttgagg ctgcagtgag ccatgatcat gccactgcac 6361 tcagcctggg ctacagagtg agaccctgtc tcaaaaaaaa aaaagaaaaa gcatgttgct 6421 gtgggcttcc tagagaatat gctgactgta gcacatcatc accccaaatg tgctttgcta 6481 gacctatgct tcctctcctt aaaatacttg aaatgtttag tcacttagga agttaagcca 6541 ttatattggt gcttgaattt ataaaataca tccacatggt ttgttaaaat catgacgtag 6601 gcagaatagg atttttatcc tgttggcatg tatttgttaa aatgttttga catcttgatg 6661 ccttcctagg tagtagttag ttgcgtactg ttctttgata aaaatcatac ccataacatc 6721 ctaaaggaga tagggtgcct ggaggggaat gaaaacgagc cacctgggat atgtagcctg 6781 gttttcaggg agatgttgat gtttttttgc ttttgttact ttaatgataa acctgtctgt 6841 tgatgcctgg tctcatgatg tcatgtcaca aggccctgtg atgttactcc cccatgtgaa 6901 tttcccacaa tgaaggctgc tctttctttt ctgtttcact ctcttagatc accggcgtaa 6961 tcaacccagc gttggacaaa tactttccgt cagactctgg agtgagaatc atagctgagc 7021 ccggcagata ctatgttgca tcagctttca cgcttgcagt taatatcatt gccaagaaaa 7081 ttgtattaaa ggaacagacg ggctctgatg gtatgtataa aggacgaatc acttcatgta 7141 taactgaaag ctgatgcaaa aagtcattaa gattgttgat ctgcctttct agacgaagat 7201 gagtcgagtg agcagacctt tatgtattat gtgaatgatg gcgtctatgg atcatttaat 7261 tgcatactct atgaccacgc acatgtaaag ccccttctgc aaaaggtaat ttctgagcat 7321 actgtataaa acaattaaga ggactggtca caacacgtgt aattaagtag tacttcctct 7381 ctccgtctct ttatatagag acctaaacca gatgagaagt attattcatc cagcatatgg 7441 ggaccaacat gtgatggcct cgatcggatt gttgagcgct gtgacctgcc tgaaatgcat 7501 gtgggtgatt ggatgctctt tgaaaacatg ggcgcttaca ctgttgctgc tgcctctacg 7561 ttcaatggct tccagaggcc gacgatctac tatgtgatgt cagggcctgc gtggtaagta 7621 agccatgcat gttgatggtg ctgccaagaa taggcacctt cttggatgtg tgcttcttgt 7681 ctagacgaat aagaaattgt cttgcctaag attaaatata tatggatatt tttcctaaga 7741 aaagttttag aaaagactga tgagtgtatt tctatgtaat tggaatatat ttaagttcat 7801 gccatgtgtc ttgtggtttc cttattacca aaacggtgac tgaagaaacg cttgctttag 7861 aaatacattg aattggccag gtgtgctggc tcacacctga aatcacaaca cattgggagg 7921 ccaaggcaga aggatcactt gagcccagga gttcgagcct gggcaacata gtgagaccct 7981 gtctctacaa aaaattaaaa aattagttgg ccatggtagt gggcgcctgt agtcccagct 8041 gcttggctaa ggtgagaggt ttgcttgagc ctgggaggtt gaggctgcgg tgagctatga 8101 tagcaccatt gtattccagc ctgagtaaca gagaaagacc ctgtctcaga aaaaaaaaaa 8161 atacattgaa ttgtttcctg atgggaagta aatactctca tgcccagtta ggagtgagtc 8221 agggttttta atatgccact ttttctttct caggcaactc atgcagcaat tccagaaccc 8281 cgacttccca cccgaagtag aggaacagga tgccagcacc ctgcctgtgt cttgtgcctg 8341 ggagagtggg atgaaacgcc acagagcagc ctgtgcttcg gctagtatta atgtgtagat 8401 agcactctgg tagctgttaa ctgcaagttt agcttgaatt aagggatttg gggggaccat 8461 gtaacttaat tactgctagt tttgaaatgt ctttgtaaga gtagggtcgc catgatgcag 8521 ccatatggaa gactaggata tgggtcacac ttatctgtgt tcctatggaa actatttgaa 8581 tatttgtttt atatggattt ttattcactc ttcagacacg ctactcaaga gtgcccctca 8641 gctgctgaac aagcatttgt agcttgtaca atggcagaat gggccaaaag cttagtgttg 8701 tgacctgttt ttaaaataaa gtatcttgaa ataattaggc attgggacgt ttttatggtg 8761 tgttcattcc agacagttca cgaatcccgt atagctcgct ctgattctca gagaacaatg 8821 agtgggtcca cccacacaca ggtaggagga caggtgagac ggaagcccca tcctcccatg 8881 tggacggtgc acatctgctc agcccacccc acatgtccag agttggctgc aaactccttg 8941 tccagagcct ctggtggtgg gacctactta agtctgacgg acctgtcctg tccaggccag 9001 tgcccaggga aggtgtggga ggccctttga gcctggcctg cag // LOCUS HSODF2 1430 bp DNA PRI 21-APR-1994 DEFINITION H.sapiens ODF2 (allele 2) gene for outer dense fiber protein. ACCESSION X74614 NID g474425 KEYWORDS outer dense fiber protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1430) AUTHORS Hofferbert,S. TITLE Direct Submission JOURNAL Submitted (12-AUG-1993) S. Hofferbert, Ins. fuer Humangenetik, Gosslerstrasse 12 D, 37073 Goettingen, FRG REFERENCE 2 (bases 1 to 1430) AUTHORS Hofferbert,S., Burfeind,P., Hoyer-Fender,S., Lange,R., Haidl,G. and Engel,W. TITLE A homozygous deletion of 27 basepairs in the coding region of the human outer dense fiber protein gene does not result in a pathologic phenotype JOURNAL Hum. Mol. Genet. 20, 2167-2170 (1993) FEATURES Location/Qualifiers source 1..1430 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q22" gene 280..1275 /gene="ODF2 (allele 2)" exon 280..599 /gene="ODF2 (allele 2)" /number=1 CDS join(280..599,843..1275) /gene="ODF2 (allele 2)" /codon_start=1 /product="outer dense fiber protein" /db_xref="PID:g474426" /translation="MAALSCLLDSVRRDIKKVDRELRQLRCIDEFSTRCLCDLYMHPY CCCDLHPYPYCLCYSKRSRSCGLCDLYPCCLCDYKLYCLRPSLRSLERKAIRAIEDEK RELAKLRRTTNRILASSCCSSNILGSVNVCGFEPDQVKVRVKDGKVCVSAERENRYDC LGSKKYSYMNICKEFSLPPCVDEKDVTYSYGLGSCVKIESPCYPCTSPCSPCSPCNPC NPCSPCNPCSPYDPCNPCYPCGSRFSCRKMIL" intron 600..842 /gene="ODF2 (allele 2)" /number=1 exon 843..1275 /gene="ODF2 (allele 2)" /number=2 polyA_signal 1390..1395 BASE COUNT 394 a 315 c 332 g 389 t ORIGIN 1 aaggtgagtg aaatctcaac acgagtatgg ttctgagagt agctctgtaa ctctgaggat 61 ggtctctgga gaccatgact gtgtacagtt cacacggtaa ccagaagact atgacatact 121 tcagaaggtg gtgaggtcat agaacacaag ctttaaagta agtgaatcat gtgtgcctca 181 tttattttta aaagcaactt ctgagaaggg cttagaacaa attttttccc tgagtgccat 241 ttcccaaagg tactcacaga acaatcaggt gtgaccataa tggctgcact gagttgtctc 301 ttggacagtg tcagaaggga cataaagaag gtggacagag aactaaggca actgagatgc 361 atcgacgaat ttagcacacg gtgcctgtgc gacttgtata tgcaccccta ttgctgctgt 421 gacttgcacc catatccgta ctgcttgtgc tattccaagc gatcacgctc ttgcggcctg 481 tgtgatctct acccatgttg cctgtgtgat tataagcttt actgtctgcg accatctctc 541 agaagtttgg agaggaaagc catcagagcc atagaagatg agaagcgaga gcttgccaag 601 taaaataact tatttttaaa tttttatagt cggtatatta gccttataag ttggaataag 661 gaaaaatatg tgacaatcaa tagttgaaca aagattaaaa gggttcagga taatgaccct 721 ggtgacttag gaaagattta aggcttgcta ttcaaagcta gaaatcaaat attgcttctc 781 aagctgtgtc tggattctga ggtctgagct ccctcccacc ctatcccttt tctcaattcc 841 agactgagaa gaacaacaaa tagaattctg gcttcctcct gctgtagcag taacatttta 901 ggatcggtga atgtatgcgg ttttgaaccc gatcaagtca aagttcgagt gaaggatgga 961 aaggtatgtg tgtcggctga gcgggagaac aggtacgact gccttggatc gaaaaagtac 1021 agctacatga acatctgcaa agagttcagc ttgccgccct gtgtggatga gaaggatgta 1081 acatactcct atgggctcgg cagctgtgtc aagatcgagt ctccttgcta cccttgcact 1141 tctccttgca gcccgtgcag cccgtgcaac ccctgcaacc cctgcagccc ctgcaacccg 1201 tgcagcccat atgatccttg caacccgtgt tatccctgtg gaagccgatt ttcctgtagg 1261 aagatgattt tgtaaagtgc gcataggaac ccattactta atagaagtca gttactccag 1321 ccaggcagct ctcccaatgt ttctcctctc cttcccatgg cccctgttgt tgaagtacgt 1381 aggaaactga atacataact gcaatctgct ggtgttgtct gaaagtcttt // LOCUS HSP53G 20303 bp DNA PRI 25-JUN-1997 DEFINITION Human p53 gene for transformation related protein p53 (also called transformation-associated protein p53, cellular tumor antigen p53, and non-viral tumour antigen p53). ACCESSION X54156 NID g35213 KEYWORDS anti-oncogene; cell cycle control; growth suppressor; heat shock protein 70; oncogene; p53 cellular tumour antigen; p53 gene; phosphoprotein; transforming capacity; tumor antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 20303) AUTHORS Chumakov,P.M. TITLE Direct Submission JOURNAL Submitted (02-AUG-1990) Chumakov P.M., Engelhardt Inst. of Molecular Biology, Academy of Science of the USSR, Vavilov St. 32, 117984 Moscow, USSR REFERENCE 2 (bases 1 to 20303) AUTHORS Chumakov,P.M., Almazov,V.P. and Jenkins,J.R. JOURNAL Unpublished REFERENCE 3 (bases 1 to 20303) AUTHORS Futreal,P.A., Barrett,J.C. and Wiseman,R.W. TITLE An Alu polymorphism intragenic to the TP53 gene JOURNAL Nucleic Acids Res. 19 (24), 6977 (1991) MEDLINE 92107726 COMMENT See also entries K03199, M14690, M14695, X01405, X02469, M22881-4, M22887-8, M22894-8. See also Mol. Cell. Biol. 6:1379-1385(1986); and Mol. Cell. Biol. 7:961-963(1987). FEATURES Location/Qualifiers source 1..20303 /organism="Homo sapiens" /strain="caucasian" /db_xref="taxon:9606" /chromosome="17" /map="p13" mRNA join(843..949,11689..11790,11906..11927,12021..12299, 13055..13238,13320..13432,14000..14109,14452..14588, 14681..14754,17572..17678,18599..19876) /gene="p53" prim_transcript 843..19876 /gene="p53" exon 843..949 /gene="p53" /number=1 gene 843..19876 /gene="p53" intron 950..11688 /gene="p53" /number=1 repeat_unit 2581..2587 /gene="p53" /note="5'-ALU flanking" repeat_unit 2588..2877 /gene="p53" /rpt_family="ALU" repeat_unit 2890..2896 /gene="p53" /note="3' ALU-flanking" repeat_unit 3915..3929 /gene="p53" /note="3' ALU-flanking" repeat_unit 3950..4223 /gene="p53" /rpt_family="ALU" repeat_unit 4224..4238 /gene="p53" /note="5'-ALU flanking" repeat_unit 4319..4327 /gene="p53" /note="5'-ALU flanking" repeat_unit 4328..4603 /gene="p53" /rpt_family="ALU" repeat_unit 4631..4639 /gene="p53" /note="3' ALU-flanking" repeat_unit 4786..5574 /gene="p53" /note="rearranged cluster" /rpt_family="ALU" repeat_unit 5802..5811 /gene="p53" /note="5'-ALU flanking" repeat_unit 5812..6100 /gene="p53" /rpt_family="ALU" repeat_unit 6127..6136 /gene="p53" /note="3' ALU-flanking" repeat_unit 6221..6236 /gene="p53" /note="5'-ALU flanking" repeat_unit 6237..6517 /gene="p53" /rpt_family="ALU" repeat_unit 6531..6546 /gene="p53" /note="3' ALU-flanking" repeat_unit 6548..7812 /gene="p53" /note="rearranged cluster" /rpt_family="ALU" repeat_unit 8703..8982 /gene="p53" /rpt_family="ALU" repeat_unit 9087..9098 /gene="p53" /note="5'-ALU flanking" repeat_unit 9099..9377 /gene="p53" /rpt_family="ALU" repeat_unit 9391..9402 /gene="p53" /note="3' ALU-flanking" repeat_unit 9513..10332 /gene="p53" /note="rearranged cluster" /rpt_family="ALU" repeat_unit 11065..11069 /gene="p53" /note="5'-ALU flanking" repeat_unit 11070..11357 /gene="p53" /rpt_family="ALU" repeat_unit 11374..11378 /gene="p53" /note="3' ALU-flanking" exon 11689..11790 /gene="p53" /number=2 CDS join(11717..11790,11906..11927,12021..12299,13055..13238, 13320..13432,14000..14109,14452..14588,14681..14754, 17572..17678,18599..18680) /gene="p53" /codon_start=1 /product="protein p53" /db_xref="PID:g35214" /translation="MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLM LSPDDIEQWFTEDPGPDEAPRMPEAAPRVAPAPAAPTPAAPAPAPSWPLSSSVPSQKT YQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAM AIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSVVV PYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCA CPGRDRRTEEENLRKKGEPHHELPPGSTKRALPNNTSSSPQPKKKPLDGEYFTLQIRG RERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDS D" intron 11791..11905 /gene="p53" /number=2 exon 11906..11927 /gene="p53" /number=3 intron 11928..12020 /gene="p53" /number=3 exon 12021..12299 /gene="p53" /number=4 intron 12300..13054 /gene="p53" /number=4 repeat_unit 12588..12597 /gene="p53" /note="5'-ALU flanking" repeat_unit 12598..12882 /gene="p53" /rpt_family="ALU" repeat_unit 12901..12910 /gene="p53" /note="3' ALU-flanking" exon 13055..13238 /gene="p53" /number=5 intron 13239..13319 /gene="p53" /number=5 exon 13320..13432 /gene="p53" /number=6 intron 13433..13999 /gene="p53" /number=6 repeat_unit 13617..13630 /gene="p53" /note="5'-ALU flanking" repeat_unit 13631..13913 /gene="p53" /rpt_family="ALU" repeat_unit 13930..13943 /gene="p53" /note="3' ALU-flanking" exon 14000..14109 /gene="p53" /number=7 intron 14110..14451 /gene="p53" /number=7 exon 14452..14588 /gene="p53" /number=8 intron 14589..14680 /gene="p53" /number=8 exon 14681..14754 /gene="p53" /number=9 intron 14755..17571 /gene="p53" /number=9 repeat_unit 15171..16491 /gene="p53" /note="rearranged cluster" /rpt_family="ALU" repeat_unit 16633..17397 /gene="p53" /note="rearranged cluster" /rpt_family="ALU" exon 17572..17678 /gene="p53" /number=10 intron 17679..18598 /gene="p53" /number=10 repeat_unit 18076..18091 /gene="p53" /note="5'-ALU flanking" repeat_unit 18092..18371 /gene="p53" /rpt_family="ALU" repeat_unit 18389..18404 /gene="p53" /note="3' ALU-flanking" exon 18599..19876 /gene="p53" /number=11 repeat_unit 19424..19431 /gene="p53" /note="3' ALU-flanking" repeat_unit 19431..19752 /gene="p53" /rpt_family="ALU" repeat_unit 19753..19760 /gene="p53" /note="5'-ALU flanking" BASE COUNT 5246 a 4970 c 5107 g 4980 t ORIGIN 1 ttcccatcaa gccctagggc tcctcgtggc tgctgggagt tgtagtctga acgcttctat 61 cttggcgaga agcgcctacg ctccccctac cgagtcccgc ggtaattctt aaagcacctg 121 caccgccccc ccgccgcctg cagagggcgc agcaggtctt gcacctcttc tgcatctcat 181 tctccaggct tcagacctgt ctccctcatt caaaaaatat ttattatcga gctcttactt 241 gctacccagc actgatatag gcactcagga atacaacaat gaataagata gtagaaaaat 301 tctatatcct cataaggctt acgtttccat gtactgaaag caatgaacaa ataaatctta 361 tcagagtgat aagggttgtg aaggagatta aataagatgg tgtgatataa agtatctggg 421 agaaaacgtt agggtgtgga tattacggaa agccttccta aaaaatgaca tttaactgat 481 gagaagaaag gatccagctg agagcaaacg caaaagcttt cttccttcca cccttcatat 541 ttgacacaat gcaggattcc tccaaaatga tttccaccaa ttctgccctc acagctctgg 601 cttgcagaat tttccacccc aaaatgttag tatctacggc accaggtcgg cgagaatcct 661 gactctgcac cctcctcccc aactccattt cctttgcttc ctccggcagg cggattactt 721 gcccttactt gtcatggcga ctgtccagct ttgtgccagg agcctcgcag gggttgatgg 781 gattggggtt ttcccctccc atgtgctcaa gactggcgct aaaagttttg agcttctcaa 841 aagtctagag ccaccgtcca gggagcaggt agctgctggg ctccggggac actttgcgtt 901 cgggctggga gcgtgctttc cacgacggtg acacgcttcc ctggattggg taagctcctg 961 actgaacttg atgagtcctc tctgagtcac gggctctcgg ctccgtgtat tttcagctcg 1021 ggaaaatcgc tggggctggg ggtggggcag tggggactta gcgagtttgg gggtgagtgg 1081 gatggaagct tggctagagg gatcatcata ggagttgcat tgttgggaga cctgggtgta 1141 gatgatgggg atgttaggac catccgaact caaagttgaa cgcctaggca gaggagtgga 1201 gctttgggga accttgagcc ggcctaaagc gtacttcttt gcacatccac ccggtgctgg 1261 gcgtagggaa tccctgaaat aaaagatgca caaagcattg aggtctgaga cttttggatc 1321 tcgaaacatt gagaactcat agctgtatat tttagagccc atggcatcct agtgaaaact 1381 ggggctccat tccgaaatga tcatttgggg gtgatccggg gagcccaagc tgctaaggtc 1441 ccacaacttc cggacctttg tccttcctgg agcgatcttt ccaggcagcc cccggctccg 1501 ctagatggag aaaatccaat tgaaggctgt cagtcgtgga agtgagaagt gctaaaccag 1561 gggtttgccc gccaggccga ggaggaccgt cgcaatctga gaggcccggc agccctgtta 1621 ttgtttggct ccacatttac atttctgcct cttgcagcag catttccggt ttctttttgc 1681 cggagcagct cactattcac ccgatgagag gggaggagag agagagaaaa tgtcctttag 1741 gccggttcct cttacttggc agagggaggc tgctattctc cgcctgcatt tctttttctg 1801 gattacttag ttatggcctt tgcaaaggca ggggtatttg ttttgatgca aacctcaatc 1861 cctccccttc tttgaatggt gtgccccacc ccccgggtcg cctgcaacct aggcggacgc 1921 taccatggcg tagacaggga gggaaagaag tgtgcagaag gcaagcccgg aggcactttc 1981 aagaatgagc atatctcatc ttcccggaga aaaaaaaaaa agaatggtac gtctgagaat 2041 gaaattttga aagagtgcaa tgatgggtcg tttgataatt tgtcgggaaa aacaatctac 2101 ctgttatcta gctttgggct aggccattcc agttccagac gcaggctgaa cgtcgtgaag 2161 cggaaggggc gggcccgcag gcgtccgtgt ggtcctccgt gcagccctcg gcccgagccg 2221 gttcttcctg gtaggaggcg gaactcgaat tcatttctcc cgctgcccca tctcttagct 2281 cgcggttgtt tcattccgca gtttcttccc atgcacctgc cgcgtaccgg ccactttgtg 2341 ccgtacttac gtcatctttt tcctaaatcg aggtggcatt tacacacagc gccagtgcac 2401 acagcaagtg cacaggaaga tgagttttgg cccctaaccg ctccgtgatg cctaccaagt 2461 cacagaccct tttcatcgtc ccagaaacgt ttcatcacgt ctcttcccag tcgattcccg 2521 accccacctt tattttgatc tccataacca ttttgcctgt tggagaactt catatagaat 2581 ggaatcagga tgggcgctgt ggctcacgcc tgcactttgg ctcacgcctg cactttggga 2641 ggccgaggcg ggcggattac ttgaggatag gagttccaga ccagcgtggc caacgtggtg 2701 aatccccgtc tctactaaaa aatacaaaaa ttagctgggc gtggtgggtg cctgtaatcc 2761 cagctattcg ggagggtgag gcaggagaat cgcttgaacc cgggaggcag aggttgcagt 2821 gagccaagat cgtgccacta cactccagcc tgggcgacaa gaacgaaact ccgtctcaaa 2881 aaaaaggggg gaatcataca ttatgtgctc atttttgtcg ggcttctgtc cttcaatgta 2941 ctgtctgaca ttcgttcatg ttgtatatat cagtattttg ctccttttca tttagtatag 3001 tccatcgatt gtatatccgt ccttttgatg gccttttgag ttgtttccca tttgcggtta 3061 tgaaataaag ctgctataaa cattcttgta caattctttt tgtgatcata tgttttcgtg 3121 tttcttggag aaatacttag gaggggaatt gtggaggaag taaaaagtag ctgtattttg 3181 aactttttca gaagctctga gttttccaga gcggttgtac cattttacac tccaactagc 3241 aaggtatggg agttattatg gttgtgccac agccttccgg acattaggta tgtcagtctt 3301 tctaatgtgg tatatccttg tggttgtaat ttacagttct ctattgacta aggatgttca 3361 gcattttttc atgtgcctat tggccattcg tattttgttt gtaaagtagc tcttcgagtc 3421 ttttacctgt tattttggtt tttttgtttg tttttattgt tcagttgtgg gactgcttta 3481 tactttctgg atacaagtcc ttatcagatc catgagtcgt gaatgttttc ttctgatctg 3541 ttgcgggcct atttgtttgc tttacagagt ttacagaatc ttaagaggag tggattaatc 3601 ttttttatgt tcagtatttg ccttgtcctg tttaggacat cttttttttt ttttttaacc 3661 ccagggtcat gaagatataa tcttacattt tcttttagga cctttatggt ggtaagtttt 3721 acagtaaggt ccttaagcca ttaattaatt cttaaaatta attgtttatg gtgtgaggtg 3781 taggagtcag tctctggtat ctttcctgta tggaaatcca gttattctgt ctccacttgt 3841 tgaaataggc ttcctttctc tactgaatgc ttttaatttt aattatttta cagttggagt 3901 atagggctac cattttagtg ctattttctt tttttctttg ttaatttttg agacagggac 3961 tcacactgtt gcccaggcta gagtacaatg gcacaatcaa ggcttactgc agcctcgaac 4021 ccctgggctc aagcagtcct ctagcagcct cacgagtagc tgggattact ccaccacacc 4081 cagctaacta ttttattttt ttgtattgac aggatctcac tatgttgccc aggctggtct 4141 caaactgctg gcctcaagct ttcatcccat ctcggcctcc caaagtgctg ggattacagg 4201 tgtgagccac catgcctgac ctcttagtgc tattttctat ttatctcctc tgttctctgc 4261 tctctttaaa cgttggagga agaaacagta cccatcttac acaaactctt cagaaaacag 4321 aggaacagac tgggcgcggt ggctcatacc tgtaatctca gcactttggt acgctgaggc 4381 aggggatcat ttgaggtcgg gagttcgaga ccagcctggc caacacggcg aaaccccatc 4441 tctactaaaa tacaaaagta gctaggcgtg caccatacct gtaatgccag ttactcagga 4501 ggctgaggca caagaatccc ttgaacctgg gaagcggagg ttgcagtgag ccgagattgc 4561 gccactgcac tccagcctgg gcaacagagt gagaccctgt ctcagaaaaa aaaagaaaga 4621 aagaaaaaat agaggaatat ttcccaactt gttttcgaag ccaggataat cctggtacca 4681 aaaccaaaca aggacattat aagaaaagaa aatatagacc aatattcctg ttagcataga 4741 catgcaacag ctaaccaatt ttagcaaacc aaacctggta atatagaaaa aaggataaat 4801 aggccagtcg cggtggctca cgcctgtaat cccagcactt tgggaggctg aggcaggcag 4861 atcacttgag gtcaggagtt tgagaccagc ctgaccaaca tggtgaaacc ccgtttctaa 4921 taaaaataca aaaatcaggc tgggcacggt ggctcacgcc tgtaatccca gcactttggg 4981 aggccgaggt gggcagatca cgaggtcagg agttcaagac cagcctgacc aatgtggtga 5041 aacgccatct ctactaaaaa tacgaaaatc agccggtgtg gtggcacctg cctgtaatcc 5101 cagctactca ggaggctgag gcagaattgc ttgaacccgg gaggcagagg ttgcagtgag 5161 ccaagatcgt gccactgcac tccagcctgg gcgacagagc aagacttcat ctcaaaaaaa 5221 aaaaaaatta gctgggcatg gtggtgggca cctgaaatcc cagctactcg ggagtctgag 5281 gcaggagaat cgcttgaacc caggaggcag aagttgcact gagctgggat cacaccattg 5341 cactccagcc tgggcaacag agtgagactc catctcaaaa aaagaaaaag aaaaaggata 5401 aatacattct aaccaaataa tgtttatctc atgattgtag ctgattcaac attcaaaaat 5461 tggcctggtg cagtagctca ggcctgtaat cccaacattt taggaggctg aggcaggaag 5521 atctcttgag cccaggattt caagaccagc ctgggcaaca tagtcagact ggtctttact 5581 ggggggaaaa aaatcagtct gtgtaattca ccacattaac aaagggaaac ataaaaaccc 5641 tatgatcatt tcaacagatg tagcaaaagc agttaatgat atcaacacat atgcatgatt 5701 acaaaccaac caacctccta gcaaactagg gaaaggaaac ttaactagtt tgataacagg 5761 gcgtccacag tcggagttcc actagcagca tacataatgg tagaaaactc agtgctgctg 5821 ggggcggtgg ctcacgcctg taatgccagc gctttgggag gcctaggcgg gcggatcacg 5881 aggtcaggag atcgagactg tcctgactag catgctgaaa ccccgtctct actaaaaata 5941 caaaaacaaa aaattagccg ggcatggtgg cgggcgccta tagtgccagc tactcgggag 6001 gctgaggcga gagaatggcg tgaacccggg aggcggagct tgcagagcct agatcgtgcc 6061 actgcactcc agcctgggtg acagagtgag acttcgtctc aaaaaaaaaa aaaaaaaaaa 6121 aagaaaagaa aactcaacgc tttttcctct aagatcagga actagaaaag gatttgactc 6181 tcacaacgtt gataccatac tggaggtttt aaccaggcaa gaaaaagaaa taatgagggc 6241 cgggtgcggt ggctcaggcc tgtaatccca gcactttggg aagccgagac gggtggatca 6301 cgaggtcagg agatcgagcc atcctggcta acacggtgaa accctgtctc tactaaatat 6361 acaaaaaatt agccgggcgt ggtggcgggc gcctgtagtc ccagctactc gggaggctga 6421 ggcaggagaa tggcgtgaac tcagggggcg gagcttgcag tgagctgaga tcgagccact 6481 gcactccagc ctgggcgaca gagcaagact gtgtctcaaa aaaaaaaaaa gaaaaagaaa 6541 taatgattag tggcccgatg tctcacgcca gtaatcccag cactttggga ggccgaggtg 6601 ggcagatcac ctgaggtctg gagttggaga ccagcctgac aaagatggtg aaacctcgtc 6661 tctattaaaa tattaaaaaa atagccaggc gttggccggg tacagtggct catgcctgta 6721 accccagcac tttgggaggc cgaggtgggt ggatcacctg aggtcaggag ttcaacacca 6781 gcctggccaa catggtgaaa ccccatctct actaaaaata caaaattagc cgggcgtagt 6841 ggcgggcgcc tgtaatccca gctacttggg aggcttaggc aggagaatcg cttgaacctg 6901 ggaggcggag gttgtagtga gccgagattg caccattgca ctccagcctg ggtgacaaaa 6961 gcaaaaactc cgtctcaaaa aaaaaagaat tagccagggg tagtggtgaa cgcctgtagt 7021 cccagctact caggaggcag aggcaggaga atcacttgaa ccccggaggc agaggttgca 7081 gtgagccgag attgtcccat tgcactccag cctaggcgag aagagcaaaa ttccatgtca 7141 aaaaaaaaaa aaaaaaagga aagaaaaaaa ataacgatta gaaaggaaga aatcaaacac 7201 attcacagcc agtatgattc tatacatacc atggtcctaa tggggccagg cgtggtggct 7261 catgctgtaa tcctagcact tttaggaggc tgaggcaggt ggcttccctg ggaccagctg 7321 gccaacatgg tgaaacccca actctaataa aaatacaaaa aatcagccag gcgtggtgag 7381 ggcacctcta atcccagcta ctcaggaggc tgaggcagga gaattgcttg gacctgggag 7441 gcagaggttg cagtgagccg agatcgcgct attgcactcc agcctgggca acaagagtga 7501 aactccggca gggtgtggtc ttacgcctgt aatcccagca cttcgggagg ctgagccagg 7561 ccgatcacct gaggtcagga gtttgagacc aacctaacat ggtgaaaccc cgtctctact 7621 aaaaatacaa gaattagctg ggtgtagtgg tgggcgcctg taatcccagc tacttgggag 7681 gctgagacag aagaattgct tgaacccagg aggtggaggt tgcagtgagc tgagatcatg 7741 ccattgcaca ccacgccggg caacagagcg agattccgtc tcaaaaaaaa aaaaaagatg 7801 aaactctatc tcaaaaaaaa aaaaaagtcc taatggaaaa tccataaaaa gctaccaaaa 7861 ctaataaata aatatagcag ggttgcaggt tacagggcaa tatagttatc cctctatctg 7921 taggggcttg gttctgggac tcctcacaca ccaaacccac agatgtctaa gtcccatata 7981 taagacggaa tagtatttaa cctacacata tcctcccata tagtttaaat tatctagatt 8041 acttacatta cccccataca atgaaaatgc taatgtacat gcaagtatgt atgtaagtac 8101 ttgtactata ttgtttaggg aatcactgga cagataggcc ttcaagactg ataccagcag 8161 ccactgttaa gattctggtc aggcctgccc ctgtttgggg tctcagttga tctcattgcc 8221 ttcccaccca gccaagggca cctgcatttc tcttggctcc ctggccattt ggaaggccta 8281 gttcagcctg gcacatttgt atcctggccc actgatgctg gtacccctgg gaaggtcctg 8341 ctctgaaaaa cacggagatt ttagttgcta ctgaagattt gagagataaa gacagggaga 8401 cctgtctgta gacctgtgtc cctccaagtg ggattgagac tttgggcccc ccatttcagg 8461 acagcacctc ctggcctgtt gactgaatag atccctgaag gaggtgtagt tgcattttag 8521 gagtgggggt gggagcagta ccactgatcc gcactaacaa tcacacagtt ctctctagaa 8581 taataatata gaacaagtga aatagaacaa ttgcagaaag agctaacctt tgttgagctc 8641 ttactgtgtg cccagcactt tcctcaactc tacatttccc ataatacata gagtactagg 8701 taggcggggc ttgggggctc acgcctgtaa tcccagcact ttaggaggcc aaggggggtg 8761 gatcacctga ggtcgggagt tcaagaccag cctgactaac atggtgaaac cccgtctcta 8821 ctagaagtac aaaattagcc aggtgtggtg gcacatgctt gtagtcctag ctactcagca 8881 ggctgaggca ggagaatcat ttgaatccgg gaggaggttg cagtaagcgg agatagtgcc 8941 actgtactcc agcctgggca ataagagctg agactccgtc tcaaaataaa ataaaataaa 9001 ataaaataaa ataaaataaa ataaaaaaag aaaagagcct gccattaaag gagctgtttg 9061 gtaggggatg ttttgtcagt gcaaacaaca gaaaagtggg ctgggcacag tggttcatgc 9121 ctgtaatccc agcactttgg gaggccaagg cgggcggatc acctgaagtt gggagttcaa 9181 gaccagcctg accaatatgg agaaaccccg tctctactaa aaatacaaaa ttagccgggc 9241 gcagtggccg atgcctgtaa tcccagctac tcgggaggct gaggcaggag aatcgcttga 9301 acctgggagg cagaggttgc ggtgagccga gatcgcacca ttgcactcca gcctggacga 9361 gagcaaaact ctgtctcaaa aaaaaaaaaa aacagaaaag tgtaacaaac acttacagta 9421 ggcatgtttc ttagcaaatc tgatgacaaa tttggcataa agaaagagag catccctgaa 9481 aaaaaaaaaa agaaaaagaa agagagcatc ctgcctgggc aacatagtga aaccctgcct 9541 ctacaaaaaa actcaaaaat tggccgggtg cagtggctca cacctgtaat cccagcactt 9601 tgggagtcgg aggcgggagg atcacctgag gtcaggagtt cgaaaccagc ctggccaaca 9661 tggcaaaacc ccatctctac taaaaataca aaaaattaat caggcgcatt ggtgggcgcc 9721 tgtaatccca gctactcagg aagttgaggc aagaggatcg cttgatactg ggaggtggag 9781 gttacagtga gtcgagatca caccactgca ctctagcctg ggtgacaggg cgagactccg 9841 tctccaaaaa aaaaaagaaa aagaaaaaga ctaaaaaatt agccaggcag gcctctgtgg 9901 tcccagctac ttgggaggct gaggcaggag aatcactgag cccaggagtg gcaggctgta 9961 gtgagccatg attgcaccac tgtaccctag cttgggcttc aaagcaagac cctgcctcaa 10021 aagaaaaaag aaagaaagaa agaacatggc gggccaggca cagtggctca cacctgtaat 10081 cccagcgctt tgagaggccg aggcaggtgg atcacaaggt caggagttcc acaccagcct 10141 ggccaacatg gtgaaaccct gtctctacta aaaatacaaa aaatcagcag gcagggtggt 10201 aggggcctgt aatcccagct actcgggagg ctgaggcagg agaattgctt gaaaccagaa 10261 ggcagaggtt gcagtgagcc tagactgcac cactgcactc cagcctgggc gaaaagagcc 10321 aaactccatc tcaaaaaaca aacaaaaaaa caaaacaaaa aaaaacatgg cagcctttga 10381 aagcttgtct gggagaaggt gcgatgatgg ttgcataact tcgtgcaaga tgctggtcca 10441 cacaggggct gccccttgct ctttctcgct ctcttaacct ctcatataac aggcttgtgt 10501 gttatgcaca tttattgagc ccaagcaggt gcaaggcatt gtgatctaat actttggtca 10561 gcaagacaac aagatagatc actgccctgc ccttaggaag tgtatatgct attagaggaa 10621 acagataaaa taaacaagga aaagtatcag acaatgtaag tgctatgaga atgcaaatga 10681 ggtgatgtga attaaaatag gatgacttaa gtctgcacgg aaggccccta cccccatgtt 10741 cctggctagc caaggaacca ccagttgatt agcagagaag ggcagcccgt ctagctagag 10801 cttttgggga agagggagtg gttgttaaga gatgagatta aagaagccga gacgggccct 10861 tcgtgagggg gggttgtaat gcagggctga ggagtgtccg aagagaatgg gcaggtgagc 10921 ggtgagacag ttgttcttcc agaagctttg cagtgaaagg aatcaaagaa atggagccgt 10981 gtatcaggtg gggaagggtg ggggccaagg gggtgtcctt ccccatacag agattgcagg 11041 ctgagaatga ctatatcctt gttaacagga ggtgggagca gggcacggta gctcacacct 11101 gtaatcttgg cactttagga ggcggaggcg ggccgatcac ctgaagtaag gagttcgaga 11161 ccagcctggc caacatgcaa agccctgtct ctactaaaaa tacaaaaatt agctgggtgt 11221 ggtggtactc gcctgtaatc ccagctactc gggagactga ggcaggagaa tggcttgaac 11281 ccggaaggta gaggttgcag tgagctgaga tcatgccact gtgctccagc ctaggtgaca 11341 gagagagact ccatctcaaa aaaaaaaaaa aatacaggaa gggagttggg aatagggtgc 11401 acatttagga agtcttgggg atttagtggt gggaaggttg gaagtccctc tctgattgtc 11461 ttttcctcaa agaagtgcat ggctggtgtg gggtggggca ggagtgcttg ggttgtggtg 11521 aaacattgga agagagaatg tgaagcagcc attcttttcc tgctccacag gaagccgagc 11581 tgtctcagac actggcatgg tgttggggga ggggcctcct cctctgcagg cccaggtgac 11641 ccagggttgg aagcgtctca tgctggatcc ccacttttcc tcttgcagca gccagactgc 11701 cttccgggtc actgccatgg aggagccgca gtcagatcct agcgtcgagc cccctctgag 11761 tcaggaaaca ttttcagacc tatggaaact gtgagtggat ccattggaag ggcaggccac 11821 caccccgacc ccaaccccag ccccctagca gagacctgtg ggaagcgaaa attcatggga 11881 ctgactttct gctcttgtct ttcagacttc ctgaaaacaa cgttctggta aggacaaggg 11941 ttgggctggg acctggaggg ctgggggggc tggggggctg aggacctggt cctctgactg 12001 ctcttttcac ccatctacag tcccccttgc cgtcccaagc aatggatgat ttgatgctgt 12061 ccccggacga tattgaacaa tggttcactg aagacccagg tccagatgaa gctcccagaa 12121 tgccagaggc tgctccccgc gtggcccctg caccagcagc tcctacaccg gcggcccctg 12181 caccagcccc ctcctggccc ctgtcatctt ctgtcccttc ccagaaaacc taccagggca 12241 gctacggttt ccgtctgggc ttcttgcatt ctgggacagc caagtctgtg acttgcacgg 12301 tcagttgccc tgaggggctg gcttccatga gacttcaatg cctggccgta tccccctgca 12361 tttcttttgt ttggaacttt gggattcctc ttcaccctta ggcttcctgt cagtgttttt 12421 ttatagttta cccacttaat gtgtgatctc tgactcctgt cccaaagttg aatattcccc 12481 ccttgaattt gggcttttat ccatcccatc acaccctcag catctctcct ggggatgcag 12541 aacttttctt tttcttcatc cacgtgtatt ccttggcttt tgaaaataag ctcctgacca 12601 ggcttggtgg ctcacacctg caatcccagc actctcaaag aggccaaggc aggcagatca 12661 cctgagcccc aggagttcaa gaccagcctg ggtaacatga tgaaacctcg tctctacaaa 12721 aaaatacaaa aaattagcca ggcatggtgg tgcacaccta tagtcccagc cactcaggag 12781 gctgaggtgg gaagatcact tgaggccagg agatggaggc tgcagtgagc tgtgatcaca 12841 ccactgtgct ccagcctgag tgacagagca agaccctatc tcaaaaaaaa aaaaaaagaa 12901 aagctcctga ggtgtagacg ccaactctct ctagctcgct agtgggttgc aggaggtgct 12961 tacacatgtt tgtttctttg ctgccgtgtt ccagttgctt tatctgttca cttgtgccct 13021 gactttcaac tctgtctcct tcctcttcct acagtactcc cctgccctca acaagatgtt 13081 ttgccaactg gccaagacct gccctgtgca gctgtgggtt gattccacac ccccgcccgg 13141 cacccgcgtc cgcgccatgg ccatctacaa gcagtcacag cacatgacgg aggttgtgag 13201 gcgctgcccc caccatgagc gctgctcaga tagcgatggt gagcagctgg ggctggagag 13261 acgacagggc tggttgccca gggtccccag gcctctgatt cctcactgat tgctcttagg 13321 tctggcccct cctcagcatc ttatccgagt ggaaggaaat ttgcgtgtgg agtatttgga 13381 tgacagaaac acttttcgac atagtgtggt ggtgccctat gagccgcctg aggtctggtt 13441 tgcaactggg gtctctggga ggaggggtta agggtggttg tcagtggccc tccgggtgag 13501 cagtaggggg gctttctcct gctgcttatt tgacctccct ataaccccat gagatgtgca 13561 aagtaaatgg gtttaactat tgcacagttg aaaaaactga agcttacgag gctaagggcc 13621 tcccctgctt ggctgggcgc agtggctcat gcctgtaatc ccagcacttt gggaggccaa 13681 ggcaggcgga tcacgaggtt gggagatcga gaccatcctg gctaacggtg aaaccccgtc 13741 tctactgaaa aatacaaaaa aaaattagcc gggcgtggtg ctgggcacct gtagtcccag 13801 ctactcggga ggctgaggaa ggagaatggc gtgaacctgg gcggtggagc ttgcagtgag 13861 ctgagatcac gccactgcac tccagcctgg gcgacagagc gagattccat ctcaaaaaaa 13921 aaaaaaaaag gcctcccctg cttgccacag gtctccccaa ggcgcactgg cctcatcttg 13981 ggcctgtgtt atctcctagg ttggctctga ctgtaccacc atccactaca actacatgtg 14041 taacagttcc tgcatgggcg gcatgaaccg gaggcccatc ctcaccatca tcacactgga 14101 agactccagg tcaggagcca cttgccaccc tgcacactgg cctgctgtgc cccagcctct 14161 gcttgccgct gacccctggg cccacctctt accgatttct tccatactac tacccatcca 14221 cctctcatca catttccggc gggaatctcc ttactgctcc cactcagttt ccttttctct 14281 ggctttggga cctcttaacc tgtggcttct cctcccacct cctggagctg gagcttaggc 14341 tccagaaagg acaagggtgg ttgggagtag atggagcctg gttttttaaa tgggacaggt 14401 aggacctgat ttccttactg cctcttgctt ctcttttcct atcctgagta gtggtaatct 14461 actgggacgg aacagctttg aggtgcgtgt ttgtgcctgt cctgggagag accggcgcac 14521 agaggaagag aatctccgca agaaagggga gcctcaccac gagctgcccc cagggagcac 14581 taagcgaggt aagcaagcag gacaagaagc ggtggaggag accaagggtg cagttatgcc 14641 tcagattcac ttttatcacc tttccttgcc tctttcctag cactgcccaa caacaccagc 14701 tcctctcccc agccaaagaa gaaaccactg gatggagaat atttcaccct tcaggtacta 14761 agtcttggga cctcttatca agtggaaagt ttccagtcta acactcaaaa tgccgttttc 14821 ttcttgactg ttttacctgc aattggggca tttgccatca gggggcagtg atgcctcaaa 14881 gacaatggct cctggttgta gctaactaac ttcagaacac caacttatac cataatatat 14941 attttaaagg accagaccag ctttcaaaaa gaaaatagtt aaagagagca tgaaaatggt 15001 tctatgactt tgcctgatac agatgctact tgacttacga tggagttact tctgataact 15061 cgtcgtaagt tgaaatattg aaatattgta agttgaaaat ggatttaata cacctaatct 15121 aaggaacatc atagcttagc ctagcctgct tttttttttt tttttttttt ggagacagag 15181 tctcactctg ctacccaggc tggagtgcag tggcgggatc tcggctcact gcaacctccg 15241 ccttctgggt tcaagcgatt ctcctgcctc agcccactga gtagctggga ttacaggcac 15301 ctgccccgac gcccagctaa ttttttgtta tttatttcct ttttttttag tagagataga 15361 atttcaccat gttggccagg ctagtctcga actcctgacc ttgtgatctg cctgccttgg 15421 cctcccaaag tgctgggatt acaggcgtga gccaccgcac ctggcctgcc tagcctactt 15481 ttattttatt tttaatggag acagcatctt gctctgttgc ccaggctgga ttacagtgat 15541 gtgatcatag ctcattatac cctcctgggc tcaagcaatc cccctaactc tgcctcccca 15601 gtagctagga ccacaggcat acaccaccat acccagctaa tttttaaaat tttttgtaga 15661 tagatagagt ctcactatgt tgcccaggct ggtctctagc ctactttttt gagacaaggt 15721 cttgctctgt cacccaggct ggatagagtg cagtagtgca gtcacagctc actgcagcct 15781 ccacctccca ggctccatcc atcctcccag ctcagcctcc caagttgctt caactacagg 15841 cctgcaccac catgcctggc taatttttat ttatttattt ttattttatt ttattttatt 15901 ttttgagact cagtctcact ctgtcgcctt aggctggagt gcagtggcat gatctcggct 15961 cactgctaac ctctgcctcc tgggtttcaa gtgattctcc tgcctcagcc tcccgaatag 16021 ctaggactac aagcgcctgc taccacgccc ggctaatttg tgtattttta gtagagacag 16081 ggtttcacca tgttggccag gctggtctcg aacttctgac catgtgatcg ccgcctcggc 16141 ctcccaaagt gctgggatta caggtgtgag ccaccacgcc cggctaattt ttatttattt 16201 atttaaagac agagtctcac tctgtcactc aggctagagt gcagtggcac catctcagct 16261 cactgcagcc ttgacctccc tgggctccgg tgatttcacc ctcccaagta gctaggacta 16321 caggcacatg ccacgacacc cagctaattt tttattttct gtgaagtcaa ggtcttgcta 16381 cgttgcccat gctggtatca aacccctggg ctcaatcaat ccttccacct cagcctcccc 16441 aagtattggg gttacaggca tgagctacca cactcagccc tagcctactt gaaacgtgtt 16501 cagagcattt aagttaccct acagttgggc aaagtcatct aacacaaagc cctttttata 16561 gtaataaaat gttgtatatc tcatgtgatt tattagatat tgttactaaa agtgagaaac 16621 agcatggttg catgaaagga ggcacagtcg aagccaggca cagcctgggc gcagagcgag 16681 actcaaaaaa agaaaaggcc aggcgcactc tcacgcctgt aatcccagca tttcgggagg 16741 ctgaggcggg tggatcacct gaggtcagga gttcaagacc agcctagcca acatggtgaa 16801 accccgtctc tactaaaata caaaaattaa ccgggcgtga tggcaggtgc ctgtaatccc 16861 agctacttgg gaggctgagg caggagaatc gcttgaacca ggaggcggag gttgcaggga 16921 gccaagacgg cgccactgca ctccagcctg ggcgatagag tgagactccg tctcagaaaa 16981 aaaagaaaag aaacgaggca cagtcgcatg cacatgtagt cccagttact tgagaggcta 17041 aggcaggagg atctcttgag cccaagagtt tgagtccagc ctgaacaaca tagcaagaca 17101 tcatctctaa aatttaaaaa agggccgggc acagtggctc acacctgtaa tcccagcact 17161 ttgggaggtg gaggtgggta gatcacctga cgtcaggagt tggaaaccag cctggctaac 17221 atggtgaagc cccatctcta ctaaaaacac aaaaattagc cagtgtgaga cacgttgagt 17281 ccacgtactc ggaggctgag gcacaagaat cacttgaacc ccagaggcgg agattcgaat 17341 cagccaagat tgcaccattg cactcccgcc tgggcgacga gagtgagacc ccatctcaaa 17401 ataaataaat aaatattttt aaaagtcagc tgtataggta cttgaagtgc agtttctact 17461 aaatcgatgt tgcttttgat ccgtcataaa gtcaaacaat tgtaacttga accatctttt 17521 aactcaggta ctgtgaatat acttacttct ccccctcctc tgttgctgca gatccgtggg 17581 cgtgagcgct tcgagatgtt ccgagagctg aatgaggcct tggaactcaa ggatgcccag 17641 gctgggaagg agccaggggg gagcagggct cactccaggt gagtgacctc agccccttac 17701 tggccctact cccctgcctt cctaggttgg aaagccatag gattccattc tcatcctgcc 17761 ttcatggtca aaggcagctg accccatctc attgggtccc agccctgcac agacattttt 17821 ttagtcttcc tccggttgaa tcctataacc acattcttgc ctccacgtag tatccacaga 17881 acatccaaac ccagggacga gtgtggatac ttctttgcca ttctccgcca actccccagc 17941 ccagagctgg agggtctcaa ggggcctaat aattgtgtaa tactgaatac agccagagtt 18001 tcaggtcata tactcagccc tgccatgcac cggcaggtcc taggtgaccc ccgtcaaact 18061 cagtttcctt atatataaaa tggggtaagg gggccgggcg cagtggctca cgaatcccac 18121 actctgggag gccaaggcga gtggatcacc tgaggtcggg agtttgagcc cagcctgacc 18181 aacatggaga aaccccatct ctactaaaaa tacaaaagta gccgggcgtg gtgatgcatg 18241 cctgtaatcc cagctaccta ctcgggaggc tgaggcagga gaatcgcttg aacccgggag 18301 gcagaggttg cggtgagctg agatctcacc attacactcc agcctgggca acaagagtga 18361 aactccgtct caaaaaagta taataaagta aaatggggta agggaagatt acgagactaa 18421 tacacactaa tactctgagg tgctcagtaa acatatttgc atggggtgtg gccaccatct 18481 tgatttgaat tcccgttgtc ccagccttag gcccttcaaa gcattggtca gggaaaaggg 18541 gcacagaccc tctcactcat gtgatgtcat ctctcctccc tgcttctgtc tcctacagcc 18601 acctgaagtc caaaaagggt cagtctacct cccgccataa aaaactcatg ttcaagacag 18661 aagggcctga ctcagactga cattctccac ttcttgttcc ccactgacag cctccctccc 18721 ccatctctcc ctcccctgcc attttgggtt ttgggtcttt gaacccttgc ttgcaatagg 18781 tgtgcgtcag aagcacccag gacttccatt tgctttgtcc cggggctcca ctgaacaagt 18841 tggcctgcac tggtgttttg ttgtggggag gaggatgggg agtaggacat accagcttag 18901 attttaaggt ttttactgtg agggatgttt gggagatgta agaaatgttc ttgcagttaa 18961 gggttagttt acaatcagcc acattctagg taggtagggg cccacttcac cgtactaacc 19021 agggaagctg tccctcatgt tgaattttct ctaacttcaa ggcccatatc tgtgaaatgc 19081 tggcatttgc acctacctca cagagtgcat tgtgagggtt aatgaaataa tgtacatctg 19141 gccttgaaac caccttttat tacatggggt ctaaaacttg acccccttga gggtgcctgt 19201 tccctctccc tctccctgtt ggctggtggg ttggtagttt ctacagttgg gcagctggtt 19261 aggtagaggg agttgtcaag tcttgctggc ccagccaaac cctgtctgac aacctcttgg 19321 tcgaccttag tacctaaaag gaaatctcac cccatcccac accctggagg atttcatctc 19381 ttgtatatga tgatctggat ccaccaagac ttgttttatg ctcagggtca atttcttttt 19441 tctttttttt tttttttttt ctttttcttt gagactgggt ctcgctttgt tgcccaggct 19501 ggagtggagt ggcgtgatct tggcttactg cagcctttgc ctccccggct cgagcagtcc 19561 tgcctcagcc tccggagtag ctgggaccac aggttcatgc caccatggcc agccaacttt 19621 tgcatgtttt gtagagatgg ggtctcacag tgttgcccag gctggtctca aactcctggg 19681 ctcaggcgat ccacctgtct cagcctccca gagtgctggg attacaattg tgagccacca 19741 cgtggagctg gaagggtcaa catcttttac attctgcaag cacatctgca ttttcacccc 19801 acccttcccc tccttctccc tttttatatc ccatttttat atcgatctct tattttacaa 19861 taaaactttg ctgccacctg tgtgtctgag gggtgaacgc cagtgcaggc tactggggtc 19921 agcaggtgca ggggtgagtg aggaggtgct gggaagcagc cacctgagtc tgcaatgagt 19981 gtggactggg gggcccagtg cccgggttcc gggaggggaa caaaggctgg agactgggtc 20041 agtctgcggg ctgcatgaca acaagggagg gggtggctcc attcataact caggaaccaa 20101 ccgtccctcc tcccctccgc ccacggctgg cacaaggttc tctgcctccc ctgcttctag 20161 gattgggctg cttccccctc ggcagcctct caccaaggat tacgggattt aaatgtcgtg 20221 atttacgaag gctgagcctc cagggtggcc atcttcgtcc atcagaagtg gcaggatacc 20281 tgggttccaa gggaacaggg tgg // LOCUS HSPACAP 17041 bp DNA PRI 19-DEC-1995 DEFINITION H.sapiens gene PACAP for pituitary adenylate cyclase activating polypeptide. ACCESSION X60435 NID g35229 KEYWORDS PACAP gene; pituitary adenylate cyclase activating polypeptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 17041) AUTHORS Hosoya,M. TITLE Direct Submission JOURNAL Submitted (17-SEP-1991) M. Hosoya, Tsukuba Research Labs, Takeda Chemical Industries-Ltd, 10 Wadai, Tsukuba-shi, Ibaraki-ken 300-42, JAPAN REFERENCE 2 (bases 1 to 17041) AUTHORS Hosoya,M., Kimura,C., Ogi,K., Ohkubo,S., Miyamoto,Y., Kugoh,H., Shimizu,M., Onda,H., Oshimura,M., Arimura,A. and Fujino,M. TITLE Structure of the human pituitary adenylate cyclase activating polypeptide (PACAP) gene JOURNAL Biochim. Biophys. Acta 1129 (2), 199-206 (1992) MEDLINE 92110383 REFERENCE 3 (bases 1 to 17041) AUTHORS Weber,B.H., Riess,O., Graham,R. and Hayden,M.R. TITLE (CA)n-dinucleotide repeat polymorphism at the D4S227 locus JOURNAL Hum. Mol. Genet. 2, 827-827 (1993) FEATURES Location/Qualifiers source 1..17041 /organism="Homo sapiens" /db_xref="taxon:9606" intron <1..7538 /number=1 exon 7539..7649 /number=2 CDS join(7540..7649,9813..9944,10420..10518,11601..11790) /codon_start=1 /product="PACAP precursor" /db_xref="PID:e214691" /db_xref="PID:g1132550" /translation="MTMCSGARLALLVYGIIMHSSVYSSPAAAGLRFPGIRPEEEAYG EDGNPLPDFGGSEPPGAGSPASAPRAAAAWYRPAGRRDVAHGILNEAYRKVLDQLSAG KHLQSLVARGVGGSLGGGAGDDAEPLSKRHSDGIFTDSYSRYRKQMAVKKYLAAVLGK RYKQRVKNKGRRIAYL" sig_peptide 7540..7611 intron 7650..9812 /number=2 exon 9813..9944 /number=3 intron 9945..10419 /number=3 exon 10420..10518 /number=4 mat_peptide 10421..10507 /product="PRP" intron 10519..11600 /number=4 exon 11601..13197 /number=5 mat_peptide 11653..11769 /product="PACAP" polyA_signal 13177..13182 BASE COUNT 4465 a 3817 c 4125 g 4634 t ORIGIN 1 gatcacgagg tcacgagatc gagaccatcc tggccaacat ggtgaaaccc catctctact 61 aaaaatacaa aaaatagctg ggcatggtgg cccatgcctg tagtcccagc tactcgggag 121 gctgaggcag gagaattgct taaacccggg aagcggaggt tgcagtgagc caagatcgca 181 ccactgcctc ccagcctggt gatggagtga gactccatct caaaaaaaaa aaaaaaaaat 241 tcctagagaa aataaatatg ccagtataac atattatagt cattaagact gtctggagtc 301 attgagactt gaatctgaag ttcagcatta caatgtagca gctgtgtaac tttggataag 361 gtacctgagc tcttttagtc ccgatttctt gtctgtaaaa tggaggtaat aacagtgcct 421 acaaagaagt ttgttgtgag ggaaaggaaa taagtagtca agcacttagc ccaggaagtg 481 ttcattaaac agttgttgct gttgctgtta ttcactggtg aataacaaaa ccatacagtc 541 cctttggaag gaaggattta aaataattta aacaaataat cactaaaaat ttcaaccagt 601 acatttatga caaatgtaat agtattccaa gcagatgata gtttttaaaa atttatgcct 661 gtgtatattt gggtagagac aaaggattat ttaaaaagta ttttcaggtt gggcacactg 721 gctcatcctt gcaacccctg cactttggga ggcttgagtc caggagtttg tgaccagcct 781 gggcaacaca gggagacctt gtctctgcaa agtaataata ataacaataa taataaataa 841 aattagccag gcatggtggt ggcatgtgcc tgtagtccca ggtattcaag aggttgaggc 901 aggagtctca cttaagccca agagtttgag gttgctgtga gctatgaatg cactactgca 961 ctccagcctg ggtgaccgag gaagactcag taaaaacaaa acaaaacaaa caaacaaaca 1021 aaaactgcaa agccgtgatt accatacagt gctagtaata atgataataa aaacaaaggc 1081 tccaaaatta ttacatgtaa acctatattc acaggtagat aatgccaagc ctgagcccaa 1141 agggaagagg gatgctaggg gctcacagag gaagaccttc ttttgatttt acacacaaaa 1201 accttgtttt attttgcata tagattcccc ttccaacatt ttctgaagtg gtctcaaaag 1261 cttatttagg atattggttt tcctggatcc tgtccaagct tttccttctg catttaagcc 1321 ttatgtcagc agaggattta gactaagtag agagaagact ttcttctcct tggcttttta 1381 gaggaggtgc tctggaattt ggaagatgct acacagtgaa gtctgggata tacatttttg 1441 gtacccagta aactcatgtt cagagaataa ggccttagta gagccatata tgtgagatat 1501 ttggaatcat ccaaacctgt agcaaaggtt agatcctggt ttttgtttct cagatgtctg 1561 ctttatatct caaaacaaca agcaaaattt cttggctctg tgaatatgca attctgttct 1621 tagattagag agaagcttta ctcatgtggt ttggaaggct tcctttccta gttgttttca 1681 gtgtgtgaga agcactacat tttgaaggtc agagaagtca tgacacatta taggtaagct 1741 catcagcttc ttacttcaca gtgagttctg aaaggcatga tgcatgcagt ccagtaagtg 1801 atggtcatga tgttctggtt cagaacattt gggtttccct acaggtgtaa tcggtatgaa 1861 gtgagtcatt agtcattgca ttttctggaa aagtcggtag agaaaagttt aagtgaaatg 1921 taatacagtg ttataattca cttttgtcac tcacaggaga ggatgatgtt ttgcatcagg 1981 cttgtttact gaaaaagctt attatagcct ggttcttatg ctaagtactg gctaaaaaag 2041 aatagaatgt gccaggcacg gtggctcacg cctgtaatcc cattactttg aaaggctgag 2101 gcaggtggat cacaaggtca ggagttcgag accagcctgg cgaacacggt gaaactctgt 2161 ctgtactaaa aatacaaaaa ttagcagggt gtggtggtgg gtgcctgtag tcccagctgc 2221 ttgggaggct gagacaggag aatcacttga acctgggagg cggaggtggc agtgagccga 2281 gattgtgcca ctgcactcca gcctggggga cagagcgaga cttcatctca aaaaaaaaaa 2341 aaaaaaatgc acaacttttc caatcttgat ttagattatt tatactagga atgtgtgagg 2401 atgccttgaa caaacatgtc cttttatatg gttaagaaaa tcaaatgttg taggatatag 2461 agatagtggt agtaaaaggt aatggtgtag agatatgtac ctaaggaaag agaatgtcat 2521 ggagaaaccc tgggtactat gggtgactga gccaaaaaga aagtagtgga aatcttatct 2581 aagtgaccag aagccgcact tcactgggct gcttaaaggc aaaaatactt ttagctcacc 2641 actgatttgc aatatgggga tggaggggag cagtgtttaa aagatgctga agattcccat 2701 gcaatatagg gaaagccaat ttcccaagtg gtgatggtcc aaaggcagga acctggcaca 2761 gacacaacag tacaaacact atagtatttg ctaatgttgt gaagccatct gcaattcaaa 2821 ctcccagtat atatactaga catattcctc ctgtttgaat acaaaaaccc acttcctcaa 2881 aggctagagt tctttaaatt gaatgttaat tcaaggttca aggattaacc cttcaacaaa 2941 ggcggatgtg ttagccacca ggaaaaacaa ttcggggaag ggttagtttg acttttagct 3001 attatttatc atttcttacc caaacttgtt ttcacatctg aaggaccaac aggataaaag 3061 ttgatacatt agggacttga agttcagagt attattaaat catttccaac aaatatatat 3121 aaacagcgtc ttccgggcag gtcagggctt agctcaagtc actttcagtt gctgtgcctc 3181 agggaggatg ctggttagac ctcccactga aagatttcca ttgttcttct aacttttcta 3241 gccaaacatg attccagtta atgtaacaat ctcatagcct ggaaagaaac tgccagcctg 3301 ggaaatctac ttttctggcc tgggaagtat tctggtgagc actgagggaa gggagtaggg 3361 gtgttggaaa gaagacttga aattcctttg tgttatctgt aaatagaacg ttctaactct 3421 ttggtctctt cttcctcctc tcccccaacc ccctctttca tcattttcaa ttatatatag 3481 gaggttggaa agtttctctt gagctcttaa ccccagtcac ctaaataccc tttgtgaggg 3541 aaactgggta agaacaatta aagtggaagg ctctcctacc ctggtcttgc tcttccccaa 3601 ttctcctcta gctcctcctc cctttatctc tctctcttct cataaaagtg ctttagttga 3661 ggcttcctag gattcaccct ccagctccta tctgcacttg aagccaggct ggggtctgca 3721 cttgcaatta gtatgtctgt tggactgggc cacggtatcc cacctggcca ctgccgcatg 3781 cctcctcagt gcatgccggg gctcctggtc tctctagcct ggggctttgg gctgacaagt 3841 cccctcttcc ttgcagctcc ctcaaagtcc cagacacaaa ggcctccagg atgctctgtt 3901 aatgcttgac tggagccttc caagattaga atcaaagggg catttggggg tagttttggt 3961 ctttgagact tcagtcatcc catattccct ctacccaata gaaagcagaa ggggcctata 4021 ctctcatcta gcagcttcta gttcctccta tttattggcc ttttcccttg gcccagggcc 4081 aaggccagat tttcatgaat aggaaagctc tcctgcagag agatgtcaaa catgcccagc 4141 tagacaatgg ccatgagcaa caaaagatct gtgggtgatc ctgtaggagt ttgattcccc 4201 caggctgctg tgggcagggc tgtgtgggtc tttagattgt gttgagcaat tggtgggtct 4261 aggaagcttg attttctgga gttcagtgac cttgaatctc aagttctctg tttagtcttt 4321 cactctcgtg aatgggttca ggtctgaagg cctgtaattt ttgggtgctg agccccgagt 4381 tctgagctga gagtatctaa gctgagaaga gcacggggtc acagccttgg taagtcagag 4441 gcacagttca gcctctgttg gcccttggag ccagcagtta gttgtcctcc ccagatagtt 4501 aatcgtgttt gtggttttcc ccctttaatg ggccgtgaag tcagcaaact ccccacatgg 4561 tggctccctt tactaaagtt gagtagtgaa ttgtacaagg agtcttggaa attttcaaat 4621 atttctccag attgaactca actcagaatt cgtgtgggag gaaagagtaa ggattttaat 4681 ggggttcacc tttgacgtga agcaaggcgg aagacaggaa agccacagtg gggaatagct 4741 ttgggcgctt tagtaagaaa gacatctctg cttgattatc tggtagtgtt caccgcaggc 4801 tctttgtgtg cgagcttgct gcagaggcag agctgaacac ggaaaacacg catgtaaaca 4861 gtccaacata aatcagtgag catatgtatc agagaaaaag agacatattc ccatgtagat 4921 gtggtttgaa gctttattga aggcagacaa tctgaagcac ggccaggaat aaactaagag 4981 gaagcagaca gttttcggtc attcatggcc aggaataaac aaaacttagt ttttttttta 5041 agaaggagga agtattaaat ctcaaataag agcaggaaca gcatttagaa ggagaaatat 5101 aatatcttgg aaaaagcaag cagaactaat atagcttatt taaatgtgag atccaaatcg 5161 tagtaacagg aaaccctccc actaaactgg aatttcccct aatttttgtg taagatccaa 5221 ataattaaaa tgcactctaa tggttattga tggctctatt ttctttcttt ctttcttctt 5281 cttctttttt ttttttttaa agaaatagac ctgagttctc ttactggagt agaaatatat 5341 gaaccttctt caattaccca ggaaattgga agcctctggg tggatatggt cttcccttat 5401 tgctttcctc ttcccacatc attttcagtt aaaaaaatta actgtttcca gcagagggat 5461 tcctgttaga aaccttcatc aggtgaactt gtactgggaa ccctcatgct ttcccagtct 5521 gtctgtgtct cccaaacaga gctgaagttg taaacaaagt ggaaaaacat atttctcacc 5581 ccaaaattct taaaatttca cttcttgtgg aaaacacaat ttcacaacat caatttttaa 5641 aatctgtaag agccacagaa ggtgtgaaag tagccaaaca gccggtctag aaatccaaaa 5701 gccaggacta acgggggaca gaatgctttt tcctcaaatc caggcaggga tggggagcat 5761 tctcagcatt agggcattta tggacgctac aaggggaaag gatgtatctg aacggtgggg 5821 gtgatttagc gatgaatcgc cacgttaata gcactactgc caaatcttca aatttagagg 5881 ctctggtgaa aaattaaacc ggtggcaatt ttcaacgttt gtagcatctg ttaccctaca 5941 cttcagcacc cggagtctgg acagctccgc aggccgcgct ccggaggcag catgagctct 6001 catcaatcta ctcatagccc tactgtcaac ggcagccaga ctcaggagag atttactgaa 6061 aatcctccaa gacttccctt taaaaacaaa acgacttcca catttaatgg tctatctgaa 6121 agaacatacg caagaaatta ggagatctaa attaaattta ttaataggag agcttgatga 6181 tgcttaattc cagagaccag agctccgatt ggtgaggctt gatgaaaaag taaagagaaa 6241 tcgtaattgt atagttaaaa acataacttt tgtcatcctc aaaattctaa aaattcttta 6301 cctgtccttg ggaaatgggt gaaattgaaa accatcaaaa caattggact tcttaaaaat 6361 tggattgtat gagtgaaagg tgtttatgag aagtcgatga ctccggatct tatcatccaa 6421 gaggacagca cagaatagtt aatatgttcc ttgagggact aggatgctga cgtctttttc 6481 tgatacccga tcattacgtg actgagaaaa aaaaaaagga agtcatttca tgaataaaaa 6541 tcggagcgca acagtgcaac aaaatattct gtacttaaag gcaacaggca ggcagatgtt 6601 gacaaagagg gctctccaaa aaccatgttc ggatagattt ttgcgaactg cacagataaa 6661 taggagcaga aggccggtca cctctgtaac cagcggtagc agcagcagaa gccgcagctt 6721 cagaggcagc cggagagacc tcggagcaga gaaggcgccg ccgaccctcg cggctgcctg 6781 gcccgcggct cctacaaagg cgggctagcc gcccgccctc tcccttgcct tcctcccctt 6841 cttttctgac tttccctctt tcccttaatc gcctgcttct tcctccgggt ggacttacgg 6901 ccaccttgct cctccgcgct tcacctcatc gccccctctt tctttcttct gcctctctct 6961 ctgcgccccc ttctctccgt gtcacgctcc ctcctggttc tgcgcgtcta caaacttttg 7021 agcagaacac gagcctcggc aaacgagtcc cgcagctcct cctgctgctc ccgctggttc 7081 ctgcggcttc tgctcagaca ccaacgccag acggcgatgc ctctcgggtg gtgactccag 7141 cgcaggaact tgaagaagcg ctttgcccgc cgtcctacct ggcagctctc ctggcagcgg 7201 gaggagttga agggtaaggg agggaaaatc ttaccaaagc gaccggctca ctcgactgct 7261 gattctttcg cttggcgtcg cgtcagggga gttagctttc cttcagccgg gtctggctag 7321 ttattgggcg ccgggtagat gcatatatat atattttttt ctaactatag caagcaagaa 7381 gtggcagggc gcgcaccggc tgtcgccaag tgctgttcaa ctcagggagc cggggcttcg 7441 ctccgtccct cccccggctt ccagagcttt ttggggttgg agggtgggag gccaggggcg 7501 ttctcacagc tgtgtgtcct ctttcccatc ctgcgcagaa tgaccatgtg tagcggagcg 7561 aggctggccc tgctggtcta tgggataatc atgcacagca gcgtctacag ctcacctgcc 7621 gccgccggac tccggttccc cgggatcagg taggtgctgg ctgcctggcc caagcaggag 7681 ctggggctcc ccaggcacag acgcttcctc acggtctcct tcctgcagtc ctttgggtcc 7741 agactactag catcgccctc tgcgcccccg gtgcgcctcc gccagcctcg gctggacagc 7801 gggtccccat tctagccgag ggtctggcag gctccgcgac tgctcggacg cctcccccag 7861 ccctaggcag ctcagggtcc cgggtagagc cagtgagctt ctggccgctg gagaaccccc 7921 cctcccccaa cccggcccac aggatggggg cagggcacgg cccctagctt ggtttctttt 7981 acctattctt gggacgagtt aggagaactt cagctctgga gcctggccgg gggttgagcg 8041 tgaagctccc tcggactttg ctttgttact gcttgttctg gactatccgg gtggggtctc 8101 tctctctcct ccaccctttc ttttcatttc attccaattc tttcccctga agagctttct 8161 ttcaagtgat ccgtgttcca actgcatttt gaatcccagg ctgtcttggg gggcgtgcgg 8221 tggggagggt gttggcccgg tgtgattgag gaaaagcgac ttaagagagg gaagaacaag 8281 gacgagactg cgaaggaggg ggaaaaacag gcgcaaagga ggaggaaggg aaagccagca 8341 ggcaggcagg accgggagag cagccctgcc tggcccggga tggaggaacc ttggcttttt 8401 tcttaacccc gggtttctaa cccgcaggcg cggcccaggt tcccggaggc agccccagag 8461 tcgcgggccg atgtgccagg ctgtggatga gccccgggta ggggagggtt cgtaccagcg 8521 gcgcctgggg cagcgaggag cgcgcgttct gcctgcgaag ctgccttctc cgagccccgc 8581 ccaggaacat tagctctggg gggccgctga tcattgattt ggacggagag atgggttctg 8641 ggttctgtat taggattcca gcatctgggc tcgaggcagg gcaatatcca gaaagacccc 8701 agggttcggg gtacccgggc cagggctgag gcgcatcgcc gagcaaaggc tgggtgcgag 8761 gcgtgcggaa tgatgcgctt gccttgcccg ggcctctcca aggatggaga aaaggcgagt 8821 gaagtagcga agtacgactc caaccccgcc cagagagtgc tactagcgct ggctgcacgc 8881 caagtctctc caggggtcca aagcgagagg gatttgtttt aacccatctc tacccgtcct 8941 gtgtcaagaa cggaggctgt agagggcgac tgcgaagtcg ccaggcactc gctggatctc 9001 ggtccccctc ctcgtgctct ggggttgaga tggggcaccg ccatcgataa cagatcagcg 9061 cgaactattc gtttagtggc cttaaaacac cctggtttca ccctcagcta ttttcaagtt 9121 cccgtgtgcc tggcactttc tccgtgcgag aagcaccgga gggtgcggac gcgccacagt 9181 ctgagccgcc gccgaactgg ctaagtttag gggcatttat tattcatgtt cctgccagat 9241 cctcgcctgc ccaaaataga aaccgaggtt ctccgtgacc tacatctgct cggggaaggg 9301 ctcccctggg ctcggaggct ggggtggggg tggctgagga gttggccccc gcacgcccca 9361 cgcatcctct cctttgcttt ctgggcctcc ccattcgggt cttcgcgtgg gtcagcgccc 9421 ggtctcccag ggcctttctc gtccccgccc gttgctgctt tggggaggct cgggagccag 9481 gcggggaggg gggcggtcct tttccgtaga caggtgtgcg cgatcggcgg agacgcctcg 9541 gtttcccagc gcttgttgag gccgtggccc gcaggacgac cctttacccg cgaagggggg 9601 gtgggcggga ccgcccggcg gggtaggagt ggttgggtgt cgttgcctcc tccttacctc 9661 tgctcccacc cccagtcctg ggagaagaga caattctcag cggaggactt ttatcacctg 9721 tgaaaatccg cgcgagcccc ttactttgga tcctcgccga gctggggagg aacttgcact 9781 gaccacacct tctgtccccg gccaccccgc aggccagagg aagaggcgta cggcgaggac 9841 ggaaacccgc tgccagactt cggtggctcg gagccgccgg gcgcagggag ccccgcctcc 9901 gcgccgcgcg ccgccgccgc ctggtaccgc ccggccggga gaaggtgaga ttcgcgcggc 9961 ctcgcgcaca cccgcggctg ggagctcggg actgcggtga cgggaggggc agtgtggtga 10021 cccacccagg attttttttt tttttcccgt gaaagtcctc aagcctgtcc tctccctggc 10081 ccgatcctat tgcagcgaca gaaaatcagc agcgggcggg tctgtgtgga cctgagggcc 10141 gcgtggggac cgaggggggc tgtggcccaa agagtggcag tgagtggcgt caaggaaccc 10201 acactccgca tctgccactc ctagagccgg gactagctcc cgatcctagc agttgctctc 10261 gagatcatcc cgggagttat tggcgagttc tgggcctctg gaggtttccc tgtcagcctc 10321 cccggccgcc gagggggcgc gcgcccaaca agggggtctc tagcggccac ctggggacag 10381 aaacagtgac cctgggcgcg cactttgcct ccccgttaga gatgtcgccc acgggatcct 10441 taacgaggcc taccgcaaag tgctggacca gctgtccgcc gggaagcacc tgcagtcgct 10501 cgtggcccgg ggcgtggggt aagagtttgt ggaaggatta acctgcgcgc gccggggtgg 10561 gtgcctgtgc ggggcgcgcg gggcgggcgg cggtgggtgc ccgtgggggc cagggtgagt 10621 ctgcgcccct gggtctgggg tgggcatccg ccacgggtcg cagttggaga ttttgaagtg 10681 gcactttaaa tttgcccaga gagctctgga agaggcaaaa agggaacgcg agccagggag 10741 tttgatccgt tttgaatgaa aagaaagaga aaccaaacca aacctctcag tcatccaaaa 10801 ccttcaggct tccagggagg ttttgctata attttctcta agcatgactg tttctggggg 10861 aggggaaagg ggtggttgta tttactgaaa attcaaatcg aaataataaa tggccaaatg 10921 tggacactta tggacccaaa cagttttgct cacgccagag aaactgagag cacagggctt 10981 gcgtgaagcc tatctcggca gaaggcaaca ttctaataaa gcccgtggga aaacagatta 11041 cattttcgcc atgaataagt catgcagtga aaaatattgc ctacagcctg tcgacttata 11101 ttattatcac gtttttcaac tcggcgtgag gagggagagg agtgttcata tttgactagg 11161 aattgcagga tcgatgcaaa ctccagggca gcagccagac tggcatatgt agggctctcc 11221 ggttactttc tctgtatgtc gcgggtgaga ggaacagcga ggacaattta gcgcaaacac 11281 acgaagggtc ggatctcaag ggggcagcgc tgggagaaag gttaggcttg aagcgcgcgt 11341 cgcctgcccg gatcttatcc cgggccccct ccgcagggtt tggtgccagg agatcctgcg 11401 tggggagggg ggcatcgagg ggctgccgtc tcggccctcc ccacggctgc ttccaggcag 11461 aggcgggcga cgcggtgggc agtgcgagcc ccgggccctc cccgaaggct cccgcgtggg 11521 gtggggcccg cctgctcccc gcggcgattg aacctgtgtc tcccgccccg ccaccctctt 11581 cccgacccct ttgcttgcag tgggagcctc ggcggcggcg cgggggacga cgcggagccg 11641 ctctccaagc gccactcgga cgggatcttc acggacagct acagccgcta ccggaaacaa 11701 atggctgtca agaaatactt ggcggccgtc ctagggaaga ggtataaaca aagggttaaa 11761 aacaaaggac gccgaatagc ttatttgtag cgatgggtta ccagctaccc tgtgtataca 11821 gccctgacgc aatgaaaagt cgttttccaa actgactcaa cagtcatcgc tcgtgtgttc 11881 tatccaaaca tgtatttatg taatgaagta aagccattaa atgaatattt tgataataat 11941 attgtttttc tttctacaaa gcactagaga atgcacagat atactttgtg gaccaattat 12001 tgatatatat tataaatata tataaagaat atatatatat atatatatat aaagtataga 12061 gagaagttca tacaaagcgt gcacaaggat tgaaaattcg cccgagctgt ttatgttttt 12121 ataaaaataa atagaaaagt agacaatcat tgttttgaat attactccta tttttgtaaa 12181 ctggaattaa aaggatagta tttttatcca tgacaggcct gaagatatta ctacttacca 12241 tttgctactg tacataaaca atgatgccct gctccaggga gattttgagg taaagatatg 12301 gagaattgct gaagggcatt ctttcccagt gagtctctgg ggcaggctgc ttcaatccca 12361 gcctaactca actgggctct gtccccctgg ttgggtggca attccaatat ttctgctttc 12421 tttgattctc cttttatgtg tagttgtctc tcttcagact ctcagcccag aagaaaattc 12481 tcctgataaa acaacagctc gatccaaatt gtgcttctcc ccagaattca cgcctctccc 12541 taggagaaga gttgaggaac tgtacagaaa agggcggctt cgttagaccg ctctcttttc 12601 tgtacttcct gagtggccag ggaatctaat atccccaaat tagggcaatt ggaacaaagt 12661 gaaggacata gaggtatatt ggaagaggca gagcctgagg tggtaggagg acgaccctgg 12721 aaatggactg gtttgagatt gccccaggtc tgggaagctg agggcaaatc cagtcccagt 12781 ggtcctgact ttgggcgctg ggtattggaa atggatgcaa agtacaatgt gtttttctcc 12841 agtgctgtcc atgcttctca tcttgtgaaa tggccaggat cctctccttt gaaacctgct 12901 ctgtaggagc tacccttttc ctttgtggtt ttatggagac ctctccttcc taccctcctg 12961 cactgtttaa gtactgttta ccatttttca ttcacttctc ttaaacttgt gaatgcttct 13021 cacttttttt tttgtttgat gcaggcactt attgtaaatt ttagaaaccc ctctgtagcc 13081 actagtaagt aattatgcac taaatatgaa ccctttgttt cttgtttatt gagtttgtag 13141 gtaaaatgta tttttctaca ttattgctta ttgcttagta aaatttattt cataaaacca 13201 acctttgtca tattagaatg tgtagtgttc acatgttgct cagttttgct aactgataaa 13261 tcatttaatc ctcttcttca tatgtatgag tactatctta tatctgtggt caagagtgag 13321 gtaagcaagc tccaacagac cctgagaacc tacgcttgta tcctttcttt ggctaaagaa 13381 agcatgtctg tttcctgtca attctttgaa catacagagt aatctttata aacaaaagaa 13441 ccttcaccca gcaatcagat cgagcagcaa cagacaaacc agccagccaa tctcccaaat 13501 ttcaggcaca agtttatttt tttttttttt atgttttgaa aaaagaagat gaagaagaag 13561 aaaaaaaaaa gaacaaggaa agattaaacg ttagcttgta aagtttaaag gacctttcct 13621 tttcctttac ggatttgatc agtatgaagt cataaatcaa agaaaacaga attggatttg 13681 cattcccagg cgggatggat gctgccagga gatcacattg caaatagtga aaacagaggc 13741 attcggtcta tgcctgagtc ctgtgtatag gatcaatctt cctttaattc cgcagtctcc 13801 tcaggcaatg tgacacggga tgcagtttgc agctttagtg cctttcttcg ccttttaaat 13861 tgccacgaat cacagatggc tatttagtgg ccctacaatg ctgcaacaca tcagcttgca 13921 ttttagtctt aattatttgt ttcttggata atgggcagag ttttctgtat ttgtatcagc 13981 tgttagtggt gaaatagggc tctagttaac cttttattta tgaagtctaa tttagtgttc 14041 ccgtggctag ttgcaagcat tttacagtga tcacccagtt taatcttttg tatacttttt 14101 agaaatgcca agagccttac taaactgaag cagatttatg atatagtgat aatttaggta 14161 gatgttagtc ttgaagctct tattttgtgt gcaactgatt ataaaaacac cttaaccaag 14221 tattattaca cacatgatat ctataactag gactttgata actgttatat aaagtgtgta 14281 aaatttgtat gaataaattt ttgtaaacaa tgcaacttgg tctaatgttt gggaaaaaag 14341 acattcagga aataattact ttaaaatctc ttaaagtatt atatttcttt agcaaccata 14401 agattttttt acgtctggaa tatatatcta tctaagcacc cttgtatttt catgaactgc 14461 actttaataa ttgatgggca actggattct gctaaaaatt taaagtagct actcagatgg 14521 agatgcctaa gaaggtttta agctcataaa caggcatgat gttgcaacat tataagacac 14581 acaatttaga ttaatttcca tcccctagtg tgtatatact ttgctcaata ttcagaaagt 14641 tactaggtag tagtgggaga caatgctgga gcattagtta cacatctaaa atagcaatct 14701 aacattgttc ttttattttt tattttagtg gccaggtctc actatgttgc ccaggctggt 14761 cttgctcaag cgatcctccc acctcagcct cccaaagtgc tgggattaca ggcgtgagcc 14821 accacaccca gcctaaaata gggatctaac attgttctta tacaagtaac tctgcagact 14881 aaacttgtct tgataaaatt ttgtataaaa tgatctaaat aatcagtttt tggaggtttt 14941 aaaatgtatt tagagacata caaactactg tctctgatta aaatgcttta ggtagaagga 15001 acgtgaacat gagtaagtaa agagttaatt agatgccttt aaaagaaaat gtactttgaa 15061 gtccaggaag aaacacaaga agtcatttgt ggattgtatg ctttcttagt tcatatttac 15121 aaactttagg gcaaagcttt catacgaaat tccttcaaat tccgtagtgg tgtgtgtttt 15181 ggcacctttg actatttctg gcttagaaaa tgtatagaaa gtcacacaat aattgacata 15241 ccatttaatt taaaatgcca gggtttcatc ctaaaaatta atggtctcaa ttagtaaatc 15301 aataaatatg ttacgataga attaaaggat gagtgaggat tctaaaatta tcttcagaat 15361 ttagctcagt atttaaggcc tagaatcaaa tagggaggag cccacagtct agaaatcccg 15421 tttgtagtca atgaaaaaat gaatccagta cagttcatat ttgcatttga ttttattgga 15481 taaggaattt ttcttctcca tctttaactg cccctctttg tccttgaaga catagtgtgg 15541 tagatgaaaa aatgaagaag actttattcg gttggggcta ggctaatgac ttgtcaagaa 15601 cataaagata aaccccagac ttggctgact tcaagtgaat ttcatgtatt tagcaacttg 15661 ccatattatc ttcggtgata actcaaatta catcttttta aaggcagact tgatacatat 15721 gggtattcaa gaagctgtaa taggtgcctt aatgctgtta gggcggagaa cacacttatt 15781 caatacaatg cacacttatt gaacacatga aatgggccac cgcactggtc gagagagctt 15841 gactaacacc tgggagctca ggtagtattt ttttcagaat gtttttctga aatgtgatca 15901 tctttggggc ggggggagct taaaaatgca aaattgtagg gctctgccag atccagtgaa 15961 tctgcatttt aaataaaaac cctagattac tgtgcactaa aatttgagac tgcctagatt 16021 tagaattggt gacatatgta ggcatacata tgtgctggtc tgaatgtttt tttcatatga 16081 aataagcaaa aggtcatgtt acctgcatac agtaataaat acataactgt gccatattct 16141 tccaagatat ctggtcatta agctctttga caatttcagt atttccttta ggtcactaaa 16201 actactagtt agcattattt tacttgtaca gtctggttgg acctctccta caggagcttg 16261 tggaaggaga gtgatcctct aagttgggtc caaaatattc aatcacagga ctaagagatt 16321 atggctataa tgaggagaac ttgtgcagct agctagccat aattctgggg atccagaagt 16381 caacttccag ttgcattata tcccaatttg gtttgaatgt atttactgct ccccaactgt 16441 ttacatgatg gtttctcttg gatggctcac tatgaccttc aacccaaccc tactgttcac 16501 atgatcacaa gattggaagc caagatcaag tcatccctct tctcttttgt tgccactctt 16561 ttttgtagaa gggagatgcc agctgcccct gctgctgcag attgcatcac tgctggattc 16621 ttacattggt ttgtagttgg tcatcctggt cacttcccct gcaaccacat agttttagct 16681 ccatcttagt atcatgtccc tctgatcaga tgttccagag tagcttccat tgtcaaaggg 16741 ttaaagggtt taaggtaatc agtagtcaat tttacctctc tgttcttcaa cacgatcctt 16801 cctctttttt ttgttttgag aaagggtctt actctgttgc ccaggctgga gtgcagtggc 16861 acgatctcgg ctcactgcaa cctctgcctc ccgggtttaa gcgattctcc tgtctcaacc 16921 tcccgagtag ctgggattac aggtgcatgc caacgcgccc ggctaatttt tgtattttta 16981 gtagagacgg ggtttcactg tgttggccag gctggtcttg gactcttgtc cccaaatgat 17041 c // LOCUS HSPAT133 3720 bp DNA PRI 23-APR-1993 DEFINITION H.sapiens zinc finger gene pAT133. ACCESSION X69438 NID g38423 KEYWORDS zinc finger gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3720) AUTHORS Zipfel,P.F. TITLE Direct Submission JOURNAL Submitted (19-NOV-1992) P.F. Zipfel, B.Nocht Institute Tropical Medicine, Freie und Hansestadt Hamburg, Molecular Biology, Bernhard-Nocht-Strasse 74, 2000 Hamburg 36, FRG REFERENCE 2 (bases 1 to 3720) AUTHORS Holst,C., Skerka,C., Lichter,P., Bialonski,A. and Zipfel,P.F. TITLE Genomic organization, chromosomal localization and promoter function of the human zinc-finger gene pAT133 JOURNAL Hum. Mol. Genet. 2 (4), 367-372 (1993) MEDLINE 93278383 FEATURES Location/Qualifiers source 1..3720 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda fix" /chromosome="2" /map="2p13" TATA_signal 1050..1056 exon 1080..1446 /gene="pAT133" /number=1 mRNA join(1080..1446,1847..3690) /gene="pAT133" prim_transcript 1080..3690 /gene="pAT133" gene 1080..3690 /gene="pAT133" CDS join(1311..1446,1847..3171) /gene="pAT133" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g38424" /db_xref="SWISS-PROT:Q05215" /translation="MLHLSEFSEPDALLVKSTEGCCAEPSAELPRLPARDAPAATGYP GAGDFLSWALNSCGASGDLADSCFLEGPAPTPPPGLSYSGSFFIQAVPEHPHDPEALF NLMSGVLGLAPFPGPEAAASRSPLDAPFPAGSDALLPGPPDLYSPDLGAAPFPEAFWE ASPCAGAPSQCLYEPQLSPPDVKPGLRRPPASPALDAVSAFKGPYAPWELLSVGAPGN CGSQGDYQAAPEARFPVIGTKIEDLLSISCPAELPAVPANRLYPSGAYDAFPLAPGDL GEGAEGLPGLLTPPSGEGGSSGDGGEFLASTQPQLSPLGLRSAAAADFPKPLVADIPG SSGVAAPPVPPPPPTPFPQAKARRKGRRGGKCSTRCFCPRPHAKAFACPVESCVRSFA RSDELNRHLRIHTGHKPFQCRICLRNFSRSDHLTSHVRTHTGEKPFACDVCGRRFARS DEKKRHSKVHLKQKARAEERLKGLGFYSLGLSFASL" intron 1447..1846 /gene="pAT133" /number=1 exon 1847..3690 /gene="pAT133" /number=2 polyA_signal 3671..3677 /gene="pAT133" BASE COUNT 724 a 1165 c 1063 g 768 t ORIGIN 1 tgcagttgtc tatcaaaatt ttacatatac tttggcacac aattacactc gttggaattt 61 atctcacaga cacattcaca tttatatagt caaggatgtt gactccaaca acattggtat 121 aagtaaaaag ttggaaactg cctacatgct catggatagg ggataaatcg actaactcaa 181 ggtcctccct tacaatggag tactatacag ccttgaaaaa gaatgacaag atctttatgt 241 caggacatgg cacaatctct aagttatttc aagtgaaaaa ataataataa taataatatt 301 gcagaacaat gtgtatggtt ggatatacat aaattcttct gaaagaatac ccaagaaact 361 taaatgtggc tacctcagga gctaggaatg aggtaggtgg agaggaggga cttttacttt 421 tcaatttaga atataccctg ttgtgctagt tgactttcct cctactgtta acaagtatta 481 cagtcctttt taatttaaat tatttttcct agcacaaatt taaaaaaaaa aaacttaaaa 541 tggaaaaaca cacaggcgtg agcaaaagaa atgaatggca ataatctctg tggtttactg 601 agtgcgcctg gacatttatg tgctctcagt ttctctccac aggaaatgca caggtgagaa 661 actgacgtta agggggactg agtgtcaagc tagttagtgg cagagggcag attcaaaccc 721 aacacggtcc tcccctgctg cccctcggcc tctgcctcca ggtgggaagc gcatctaccg 781 gacggtcggc ccggtgaggc gcagcgcccc agactggcgc atccgcggcc ccagcgctcc 841 acgcctgggg agcgcgcgcg cacgcagcgg cgcgagcctg gcggcggcgg cgacaacaac 901 aacgtcacag ctcgagcttt ccttttcgga gtccccggca cacatcctgt gtccatgttt 961 gggcatttac gtcacggcgg cagggccggg gcctcccaaa atggcagtgg cccggggagt 1021 cggaagcccg gagccagcgc cgccgcagct atataagtgg gggggctgtg ggctggggga 1081 gcccggcagc gctttggaga ggcgaggagc cgccgcccga ggccggtgcg ggcgagcgag 1141 ggcgccgcgg ctccccgact cctttcccag aggtgagtgc ccgaagccag gagcccggcg 1201 cccataggtc tgtgcgctgc ggggaacccc taccgccagc ctccccgcca cccgcgcgcc 1261 cccaagccca gcgggcgagg ccccgggcgc cccacagccg gcgccgcgcc atgctccacc 1321 ttagcgagtt ttccgaaccc gacgcgctcc tcgtcaagtc cactgaaggc tgttgcgccg 1381 aacccagcgc tgaattgccc cggctgcctg ccagggacgc tcccgcggcc accggctacc 1441 ctggaggtaa ggagggcgag caggggtgct cagacgacga cggcgcagcg cgggggcgca 1501 ccatacctga gaaccaggag ggactgggac attggagcta tgagaatagg ggcgatggga 1561 agcttaggag tcctggggtg cgcaccccca ttcccaccca cactgggccg ccagcgcctc 1621 cgcaggaacc tgtgcggtta tcggagcgcc cttttgcctg tgcacgtgtg ttttgcgtgt 1681 gcatgtttct atgtgctcct tgggacgtat gcgggcccct gctgaatcag aatgtgcaaa 1741 aggcactttg tgtatatccg tgggcaccaa gagttttgta ggtaggggct gtggactcag 1801 gtgcaccctt tgatgtgccc agagctgatt cctgctccct cctcagcagg cgacttcttg 1861 agctgggctt tgaacagctg cggcgcaagt ggggacttag ccgactcctg cttcctggag 1921 gggcctgcgc ccacaccccc tcccggcctc agctacagcg gtagcttctt cattcaggca 1981 gtgcccgaac acccgcacga cccggaggca ctcttcaacc tcatgtcggg cgtcttaggc 2041 ctggcaccct tccccggtcc agaggcagca gcgtccagat ccccgctgga tgcccctttt 2101 cctgcggggt ccgatgcctt gctgccgggt ccgccggacc tttactcccc ggatctgggc 2161 gctgcccctt tcccagaggc gttctgggag gcctcgcctt gcgcgggtgc cccctcgcag 2221 tgcctgtatg agcctcagct ctccccgccc gacgtcaagc ccggcctccg gcggcctccc 2281 gcctcgccag cgctggacgc tgtctctgcc ttcaagggtc cctacgcgcc ctgggagctg 2341 ctttctgtgg gggccccagg gaactgtggg tcacagggag actaccaggc cgccccggag 2401 gctcgttttc ccgtaatagg gaccaagatt gaggacttgc tgtccatcag ctgccctgcg 2461 gaactgccgg ccgtcccagc caacagactc tatcccagcg gggcctatga cgctttcccg 2521 ctggccccgg gtgacttagg ggagggggct gagggcctcc ctgggctcct gacccctcct 2581 agtggggagg gagggagtag cggcgacggc ggagagtttc tggccagtac gcagcctcag 2641 ctttccccgc tgggccttcg cagcgccgcc gcggcggact tccctaaacc tctggtggcg 2701 gacatccctg gaagcagtgg cgtggctgca ccacccgtgc cgccgccgcc gcccacccct 2761 ttcccccagg ccaaggcgcg acgcaagggg cgccgcggcg gcaaatgcag cacgcgctgc 2821 ttctgcccgc ggccgcacgc caaggccttc gcttgcccgg tggagagttg tgtgcggagc 2881 tttgcgcgct ccgacgagct caatcgccac ctgcgcatcc acacgggcca caaacccttc 2941 cagtgccgca tctgcctccg caacttcagc cgcagcgacc acctcaccag ccacgtgcgc 3001 acccacaccg gcgagaagcc ttttgcttgc gacgtgtgcg gccgccgctt cgcgcgcagc 3061 gatgagaaga aacggcacag caaggtgcac ctcaagcaga aggcgcgcgc cgaggagcgg 3121 ctcaagggcc tcggctttta ctcgctgggc ctctccttcg cttctctctg agcaagagat 3181 gggtttatgg gttggggcgc cgccgttcgg cgcgcacgag ttccgggccg ttcccctccc 3241 cgctcttctt ccaactcctc ctcgcacgcc cgagggccgg cctccgtccc gcttccagtt 3301 tccttgaagc gcccgccgca cacgccctat tcagcaccag ctcgccggac agttcccgcg 3361 gtccaggcgc tgtcaccctt gtcagccgcg ctttggggga agtcttctga gaccacccag 3421 tgaataggca ctaccctggg attcaagaca gtcttttgta actgcacacg ccccacgcct 3481 tcctctataa cccccagaga caggctgggg caggccaagg cggtctcgcg cgggactttg 3541 tacagcagtg tcttatccag cagccgattg gatgtaacgt tttgctttgg gttttttttc 3601 cttttgttgt tgttaatttt tgtaaagcag acgctactct caagcagttg acaaaactgt 3661 ttatttttgc aattaaaatt attgtgctaa aagcttactg aatctgccat gtaagctcct // LOCUS HSPCK1 10047 bp DNA PRI 02-DEC-1997 DEFINITION H.sapiens gene encoding mitochondrial phosphoenolpyruvate carboxykinase. ACCESSION Y11484 NID g2661751 KEYWORDS PCK1 gene; phosphoenolpyruvate carboxykinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10047) AUTHORS Modaressi,S., Brechtel,K. and Jungermann,K. JOURNAL Unpublished REFERENCE 2 (bases 1 to 10047) AUTHORS Modaressi,S. TITLE Direct Submission JOURNAL Submitted (24-FEB-1997) S. Modaressi, University of Goettingen, Biochemie und Molekulare Zellbiologie, Humboldtallee 23, 37073 Goettingen, FRG FEATURES Location/Qualifiers source 1..10047 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /cell_type="lymphocyte" /map="q11.2-12/14" mRNA join(<1..73,2527..2772,3835..4019,4107..4310,4680..4867, 5187..5349,5625..5843,8381..8518,8788..8883,9138..9733) /gene="PCK1" gene 1..9733 /gene="PCK1" exon <1..73 /gene="PCK1" /number=1 CDS join(45..73,2527..2772,3835..4019,4107..4310,4680..4867, 5187..5349,5625..5843,8381..8518,8788..8883,9138..9592) /gene="PCK1" /EC_number="4.1.1.32" /codon_start=1 /product="phosphoenolpyruvate carboxykinase (GTP)" /db_xref="PID:e1198354" /db_xref="PID:g2661752" /translation="MAALYRPGLRLNWHGLSPLGWPSCRSIQTLRVLSGDLGQLPTGI RDFVEHSARLCQPEGIHICDGTEAENTATLTLLEQQGLIRKLPKYNNCWLARTDPKDV ARVESKTVIVTPSQRDTVPLPPGGARGQLGNWMSPADFQRAVDERFPGCMQGRTMYVL PFSMGPVGSPLSRIGVQLTDSAYVVASMRIMTRLGTPVLQALGDGDFVKCLHSVGQPL TGQGEPVSQWPCNPEKTLIGHVPDQREIISFGSGYGGNSLLGKKCFALRIASRLARDE GWLAEHMLILGITSPAGKKALCAAAFPSACGKTNLAMMRPALPGWKVECVGDDIAWMR FDSEGRLRAINPENGFFGVAPGTSATTNPNAMATIQSNTIFTNVAETSDGGVYWEGID QPLPPGVTVTSWLGKPWKPGDKEPCAHPNSRFCAPARQCPIMDPAWEAPEGVPIDAII FGGRRPKGVPLVYEAFNWRHGVFVGRAMRSESTAAAEHKGKIIMHDPFAMRPFFGYNF GHYLEHWLSMEGRKGAQLPRIFHVNWFRRDEAGHFLWPGFGENARVLDWICRRLEGED SARETPIGLVPKEGALDLSGLRAIDTTQLFSLPKDFWEQEVRDIRSYLTEQVNQDLPK EVLAELEALERRVHKM" intron 74..2526 /gene="PCK1" /number=1 exon 2527..2772 /gene="PCK1" /number=2 intron 2773..3834 /gene="PCK1" /number=2 exon 3835..4019 /gene="PCK1" /number=3 intron 4020..4106 /gene="PCK1" /number=3 exon 4107..4310 /gene="PCK1" /number=4 intron 4311..4679 /gene="PCK1" /number=4 exon 4680..4867 /gene="PCK1" /number=5 intron 4868..5186 /gene="PCK1" /number=5 exon 5187..5349 /gene="PCK1" /number=6 intron 5350..5624 /gene="PCK1" /number=6 exon 5625..5843 /gene="PCK1" /number=7 intron 5844..8380 /gene="PCK1" /number=7 exon 8381..8518 /gene="PCK1" /number=8 intron 8519..8787 /gene="PCK1" /number=8 exon 8788..8883 /gene="PCK1" /number=9 intron 8884..9137 /gene="PCK1" /number=9 exon 9138..9592 /gene="PCK1" /number=10 BASE COUNT 2337 a 2849 c 2494 g 2367 t ORIGIN 1 ctccgctcgg ttcctggcca ccccgcagcc cctgcccagg tgccatggcc gcattgtacc 61 gccctggcct gcggtgagtg acccccggcc cggggcccac ccgcaccttc cgctgcgctc 121 gccccctcgg ggctgccagt ggcgctctcc tgctctcagc ctccgccagg tttcccatcc 181 taggcggagg cgggcagggg cgactgctgt gggtccagcc tcccgcgccg cgcgtctctt 241 gggagggcag ccggccggtg ctcctcgttt ccgcctgcac ctccccttct ctgcctcgct 301 cgcctctgac cgcgcgatct ctatctgcca ctctcagaac ttcctctctc tcctcgctcc 361 tctctgctga gccaggtctc cgcatatcct cctttccttc ccaaatacct ccctcggacc 421 tctaacgggc tctcagccag cgccccaggg tacttcgaga ggcagcaggg ccctggggac 481 aagggtacgt gagccccggg agactaagct cagagccccc taaagaaggt ggaaggttaa 541 atatccattc ccggcctctc ccggactgga aggactggaa cctggcggga agtccagagc 601 agcccgaggg acctgggccc aggggaggga ggcaagcaag gtgggaggag ggcgccaagt 661 tgccttcgtt tcttacatag ctggcttctt cctccgtcca ggcctggagc ccccaggctc 721 gtcctgtttg tctgcctgtc ctcttagtct cctatttatt ctctgaggcc tctcttctca 781 gcttttgtcc cagagtcgga agtgacccac atctgtcgca cagcccgttc cacttgggca 841 gcccttgtgg gtggtctctg aaggaaacgt cccacttaga gggctgcaag agggtgtggg 901 ggcttcacaa gagataacgt gagccaggct ccagggagag agaggctgtc ctcaagactg 961 tgtgcttgaa aactgatgct cacggagaac ttccctctga ggcaggaaca gacccaggtc 1021 ccagtagccc tcctcccctg cccctggggc cacactgatc atctatcctg ctttagcgga 1081 aaccacccca gcttctaccc cagacagact caagctcccg tatccatgct ctgagctttc 1141 ttccttcccc aggctaacac cctctgagtc tgagctgcca gcaagctgct gttccaccct 1201 cccaccaaca ccaaagctct ctaggcatgt ggcctctagg aagaagagcc aggggaagca 1261 cggggtcacg tggtcctggg tgtgggggca gtttctgatg ggcgaggcct tgatagagga 1321 ggagagtaac atccccttca tggtctttgc tctctcgggt ttactccacc ttgagtccag 1381 gccaatcaga gcagacgttg cttctctggc tcccagggcc atgagaggac agacaacagg 1441 acgctgacct cctgagaatt aagcccatga accccagcca gtgacactca ttccccagtg 1501 gtcaaccttc cgcagagttc agaaatactt acccgagggc aacattttat gcaaccattg 1561 ttggtccaag tgggcagcag cagatcaggg cctggaagcc cagcatccag tcacctattc 1621 tctgtgcaag agccctcatc tagaaacctg gcactggaaa gactgtgacc tttgcttggg 1681 gctttcatag tcttacagca tacacaccag aaggaaagaa taaacacagc tgccatttta 1741 atttataaaa aactatactt gaaaatggaa ataaaatgga tgaggcttca aataccagac 1801 atatgaaatt gtcacctggg cccaacttct tgtcttgaca cttgggccaa aggcccctac 1861 tcatttcttt tttttttttt tttttttttt tgagacagag tcttgctctg tcgcccaggt 1921 tggagtgcag tgtcccgatc tcggctcact gcaacctcca cctcccgggt tcaagcgatt 1981 ctcctgcctc agcctcctga gtagctggga ctagaggcac acgccaccac atccggctaa 2041 tttttatatt tttagtagat ggggtttcac catgttgccc agtatggtct tgatctcctg 2101 acctcatgat ccacctgcct cagcctccca aagtgctggg attacaggca tgagccaccg 2161 tgcccagcca tctccctact tatttctaaa cgttgattaa acagttaaac atgggcatgg 2221 gctacacagg cagtaacatc agcatgccta catgtatact tccacacagc caggcatgtg 2281 ccttttttgc tcatttggcc atatctgtcc ctcgctgagc aggagacacc ctcctcaagc 2341 ctcataaagg ctacaagata catgtgtcct gaacacatcc cacacaccaa ctgcaacctg 2401 ctcttcatgg tccctgcatg cagacatgtt ttagcaggct gcagcccaag ctttctgcct 2461 ctccaccacc tgccttgtcc actctcgatg acagcaacta gctcattgcc tctgtttctc 2521 ctataggctt aactggcatg ggctgagccc cttgggctgg ccatcatgcc gtagcatcca 2581 gaccctgcga gtgcttagtg gagatctggg ccagcttccc actggcattc gagattttgt 2641 agagcacagt gcccgcctgt gccaaccaga gggcatccac atctgtgatg gaactgaggc 2701 tgagaatact gccacactga ccctgctgga gcagcagggc ctcatccgaa agctccccaa 2761 gtacaataac tggtaagcct tgggctccac aacctgcagg ataggtgcac tgaggccact 2821 ttgggttcac caaggcaaaa tcaacttaac tagaacatcc caatggaatg aacaagaatg 2881 agagctttgg ggtaaacaga cccagaaact gggatttgct tacgcctata atcccagcac 2941 tttgagaggc caaggcgggt ggatcaccag gtgtcgggag tttgggacca gcctgaccaa 3001 catggagaaa ccccgtctct actgaaaaaa aaaaaaaaaa atacaaaaat tagcccagca 3061 tggcagcgca tgcctgtaat cccagctact tgggagggtg aggcaggaga atcacttaaa 3121 cccaagaggc ggaggttgca gtgagccaag atcgcgtcat tgcactccag cctgggcaat 3181 aagagcgaaa ctctgcctca aaaaaaaaaa gaaagaaact gggatttttt tttttttgag 3241 atggagtctc actgtatcac ccaggctgga gtgcagtggc atgatttcag ctcacaacaa 3301 cctctgcctc cggggaattg tctcaagcaa ttctcctgcc tcagcctccg gagtagctgg 3361 gattacaagc atgcgccacc acacccagct gatttttgta tttttagtag agacagggtt 3421 tcaccatgtt ggccaggttg gtctcaaact cctaacctca agtgatccac ccacctcagc 3481 ctcccaaagt gctgggatta caggcatgag ccactgcgcc cagcctagtt tgtagtttgt 3541 attttatttt aatgttaaat gaagaagctg atataaataa gatcctttgc tttttttttt 3601 tttcctcacc agttcaggga gcttttgcca ggggcagaga cccccagagg gctgggacct 3661 tggggaacac cccttagatg ggacaaagcc tggaggaagg gactgagatg tgattgggtg 3721 gggaaaaata aggccaacag aagacctgga gtcaaagttg gacttgaaaa agtgggtcta 3781 gggacaaggg aaacctgctg gccaccatct tcctgacaat cccctctccc ccagctggct 3841 ggcccgcaca gaccccaagg atgtggcacg agtagagagc aagacggtga ttgtaactcc 3901 ttctcagcgg gacacggtac cactcccgcc tggtggggcc cgtgggcagc tgggcaactg 3961 gatgtcccca gctgatttcc agcgagctgt ggatgagagg tttccaggct gcatgcaggg 4021 taaccagggc aggggcacag tggcaagggc acggaagatg tgaacaggtt tggaaccctt 4081 catccagggg atgccttcct ccacaggccg caccatgtat gtgcttccat tcagcatggg 4141 tcctgtgggc tccccgctgt cccgcatcgg ggtgcagctc actgactcgg cctatgtggt 4201 ggcaagcatg cgtattatga cccgactggg gacacctgtg cttcaggccc tgggagatgg 4261 tgactttgtc aagtgtctgc actccgtggg ccagcccctg acaggacaag gtaagcacct 4321 gctctgcccc aaggggaaca cagaggcctt cttgtactca gaggaaatcc caaatcctac 4381 ctctccacag accctaagaa cctgtcctct ctggcaacct aattcccaag atccagagca 4441 gcagtcccag cagagggata aggctgtgtt tgcagagcac tttgcactag gttgagaaaa 4501 atccgtgtcc aagaataggg gcatggaagc tgatggttat tatgaggtgg ggggcttcag 4561 ccacctcttg gtgctgctac tgctcccaag tgtctctcct gccaatccct gatccctctg 4621 gccccgacac cccagttcct gatgctgctg ccagcagccc atgaccccat tgtccccagg 4681 ggagccagtg agccagtggc cgtgcaaccc agagaaaacc ctgattggcc acgtgcccga 4741 ccagcgggag atcatctcct tcggcagcgg ctatggtggc aactccctgc tgggcaagaa 4801 gtgctttgcc ctacgcatcg cctctcggct ggcccgggat gagggctggc tggcagagca 4861 catgctggtg agggcctggt gagaagcagg gcagctgccg gggacagggc aggggtgggg 4921 cctggccagt ctgcctcagc ctcacctccc tctgccaggt gccaggctgg tgggcgggga 4981 ctctacttga aggcccaaag ctttggcctg agcgtgctga atgttgaggt ttcccctgcc 5041 actaacccag gcctgatggc agggcaatca cttatatagt taataaacat tggtcctccc 5101 tattagaccc tagctgccct tccccatgca gaccatgccc tgacttttgg tgacctcttt 5161 cttattccct ctctcccaat gcacagatcc tgggcatcac cagccctgca gggaagaagg 5221 cgctatgtgc agccgccttc cctagtgcct gtggcaagac caacctggct atgatgcggc 5281 ctgcactgcc aggctggaaa gtggagtgtg tgggggatga tattgcttgg atgaggtttg 5341 acagtgaagg tgagggactc tcagatcata ctcttggttc tggctcttgt cagagcctcg 5401 ggtctcctct ctagtgttca caatgacttt gtcagtgaga aagtttcctg aacacccaac 5461 cctactggat tcctctggca gcccagccac ccgagagaca gcctttcctt catcagatct 5521 tgggtccatc tctcaggaca ggggtgggtg gtgcaggacc ttctttggtc ttacatctca 5581 agttttcctt atttaatcct tcctttcttc acttctccta acaggtcgac tccgggccat 5641 caaccctgag aacggcttct ttggggttgc ccctggtacc tctgccacca ccaatcccaa 5701 cgccatggct acaatccaga gtaacactat ttttaccaat gtggctgaga ccagtgatgg 5761 tggcgtgtac tgggagggca ttgaccagcc tcttccacct ggtgttactg tgacctcctg 5821 gctgggcaaa ccctggaaac ctggtatgtg cggtggggaa ggtgtggcac agcctccagg 5881 cctcagcacc ttaatggtgg aaaagctttc tccacaacct ccaaccatct tctaggactg 5941 ccaggaggca cagaagtcat gaacgtttgc agtttccagt cccaggcaaa atctcagttc 6001 atgtcccaac tccaccagtc actggttttg tgatctggct aagttgctca acttccctaa 6061 gctttagttt ccacatcagt tgaatgaggg tagttgtgat agtacctatc tcatgagatt 6121 gttggaggat taaatagtgc ataaaaaggg tttagcacac tgacaaatac acagtaaatt 6181 ctcaataata aatacaggct ggattttttt ttaatgaaag gaaaaggaag gacttttgaa 6241 cattcttaca gaaggtattg ggctccaagc actatccata aagtttggcc cattaggaaa 6301 agaggaaagc tgcctcctct gctccaactc tcctcctgcc acttggctcc cactgtcccc 6361 tgtataacaa ccactgtcta aaggtcagta ttgttaccgt cacccttccc ctgtccctcc 6421 aaagcattca ccccaatcct tcctacaaac aaaatcaggt cagtgcttga gtctttccca 6481 gaagctagtt tctgaatcct gtcattaccc tgggcgcctg ggagtcccac ctctccctca 6541 gccctgcact ctggaccttc agtattcttt ccatggcctt ctgcagtcag gcagtccaga 6601 caccaagagg caggggcaaa gaagagcatg ggaggggagg ctggccttgt agtcactgaa 6661 gcctatattc aggtttgcca ggctggccta gcagtcaccc tccttgcttc atctaatcac 6721 cctttatttt tactaacacc atcattaagc ccccctcagc cttcccaccc aactgagaaa 6781 tccaagaaac tttcatcttt ccccacaggc tagttcccca accctttcat catctccaga 6841 tttgggggca taactagggc atcttgtccc cagcttcaat tcccagaata ataccctgtg 6901 ttaggattct gcactgggtg ctgaagaagg atggctctta tctgcaatgg cgggcagaag 6961 ctggcggatg ggagagggtg gggattttgg ccccgtggtt ccccactccc caggtctgac 7021 cagcaacctc cagcagagaa ggcaccatgt ccactcaggg ggcacacagt ggtgcttcat 7081 acatgtgcca ctgacttagt cccaaccccc ctccaggaca cctgaaggtg ccaagtgtga 7141 cctgggctcc tgaggttatc cctacccatg tgatatccct atctctattt tttccagccc 7201 tatcacttca tcaggtctaa acagggcagg gaaatcacca acatgttgtt agctttaaaa 7261 tcaattcctt gcagggcaca gtgactcaca tctgtaatcc cagcactttg ggaggccgag 7321 gcagttggat cacctgaggt caggagttcg agaccagcct ggcaacatgg caaaaccccg 7381 tctccaataa aatacaaaaa ttagccaggc atggtggctc atgcctgtaa tcccagctac 7441 tcaggaggca ggaaaattgc ttgatcccag gaggcagagg ttgcagtgag acaagatcat 7501 gccactgcac tccagcctgg tgacagagtg agactccgtc tcataactaa attaattaag 7561 taaataaaat caggccaggc gcagtggctc atgcctgtaa tcctactact ttgggaggcc 7621 aaggtgggca gatcagttaa ggtcaggagt ttgaaaccag cctggccaac atggtgaaac 7681 cccatctcta ttaaaaatac aaaaaaatca gccaggcatg gtggtgggtg cctgtaatcc 7741 cagctactgg ggaggctgag gtaggagaat tgtttgaacc tgggaggcgg aggttgcagt 7801 aagccaagat tgcaccactg cactacagcc tgggcaacag agcaagactc tgcctcgaaa 7861 taaaaagata aaataaattc ctatttgcat ttggataact taggagaacc tgtcttcccc 7921 ggtttgctga cggaaagtca attgtctgaa gtactaagct gacattctca gtttttgctt 7981 taggtttggg tattcattta aataataatc tcacaaataa tgaaatagtt tctgggggaa 8041 aaattattat aaccttatgc ccatatctaa ccccattccc ttgagccctg gtcagtgcca 8101 agtgccagta gcttggcaca aacattagtg ccctgccaaa ccccaattcc tctcccactc 8161 ttttctcaca tagctcagct ggccgcacct tcatggctaa acaacctgag ctcttggaga 8221 tgccctggct cccctctctc tgctccttat cacacaaggt tctaggcagc tgatgaggca 8281 aaaaaaaaaa aagaaccctg caagaatgtg tgcccatgta tgtgtgtgtt gggggtcgac 8341 atgaccttgg aaataatagt gtttgtattt cctctgccag gtgacaagga gccctgtgca 8401 catcccaact ctcgattttg tgccccggct cgccagtgcc ccatcatgga cccagcctgg 8461 gaggccccag agggtgtccc cattgacgcc atcatctttg gtggccgcag acccaaaggt 8521 aaacaacata tgagctccat gttcttggca aaagggctat ctctgtatta gggcctacct 8581 ccctccctct gatccagagc ctcagcctgg atctcacctt tctccagagt tctcccctgg 8641 tgaatgcaaa cttgggagga ggcaaagggt ctgaaaatgg gatagccgag gtcttaggag 8701 agagagtacc agtcaagctc accagaaggg ctggagttag ggtccaaaga aaagggctgc 8761 ctgtgactct gttcattggt gatctagggg tacccctggt atacgaggcc ttcaactggc 8821 gtcatggggt gtttgtgggc agagccatgc gctctgagtc cactgctgca gcagaacaca 8881 aaggtgagca ccctcaccat tcctccctct cctgtgtgtg cacacagcac gtcctctctc 8941 ccttcctgag ccagaccttc cttttgtcca cccctggagt ctgatatggc cccacctctt 9001 cccacttcta tcttttcccc atccctgaag atattcagaa ccataagcct ttcacagctt 9061 cctccaactg gatgcagggt gcccttccct accccagtga gaaggaagat tccttaccca 9121 tcttgcttcc cccccaggga agatcatcat gcacgaccca tttgccatgc ggcccttttt 9181 tggctacaac ttcgggcact acctggaaca ctggctgagc atggaagggc gcaagggggc 9241 ccagctgccc cgtatcttcc atgtcaactg gttccggcgt gacgaggcag ggcacttcct 9301 gtggccaggc tttggggaga atgctcgggt gctagactgg atctgccggc ggttagaggg 9361 ggaggacagt gcccgagaga cacccattgg gctggtgcca aaggaaggag ccttggatct 9421 cagcggcctc agagctatag acaccactca gctgttctcc ctccccaagg acttctggga 9481 acaggaggtt cgtgacattc ggagctacct gacagagcag gtcaaccagg atctgcccaa 9541 agaggtgttg gctgagcttg aggccctgga gagacgtgtg cacaaaatgt gacctgaggc 9601 cctagtctag caagaggaca tagcaccctc atctgggaat agggaaggca ccttgcagaa 9661 aatatgagca atttgatatt aactaacatc ttcaatgtgc catagacctt cccacaaaga 9721 ctgtccaata ataagagatg cttatctatt ttacacaaga tttgtgctgt tttcatttcc 9781 cacctatctt cacaggcttc cctctaacac ctgtctcaca atcatcttct tccagcccct 9841 agaagaagca cagcctggca caatcaaaga tctgttttac aggtagctct agcactgggt 9901 cacagacata ggaattgctg ggagaaggca ctatccactc tatgtcctga gttcttaaaa 9961 aaaaaaaaat ggtgaggctg ggtgtggtgg ttcacgcctg taatcccagc actttgggag 10021 ggtgaggggc acagatcacg aggtcag // LOCUS HSPEX 242825 bp DNA PRI 22-SEP-1997 DEFINITION H.sapiens PEX gene. ACCESSION Y10196 NID g1834504 KEYWORDS PEX gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 242825) AUTHORS Francis,F., Strom,T.M., Hennig,S., Boeddrich,A., Lorenz,B., Brandau,O., Mohnike,K.L., Cagnoli,M., Steffens,C., Klages,S., Borzym,K., Pohl,T., Oudet,C., Econs,M.J., Rowe,P.S.N., Reinhardt,R., Meitinger,T. and Lehrach,H. TITLE Genomic organisation of the human PEX gene mutated in X-linked dominant hyposphosphatemic rickets JOURNAL Unpublished REFERENCE 2 (bases 1 to 242825) AUTHORS Hennig,S. TITLE Direct Submission JOURNAL Submitted (20-DEC-1996) S. Hennig, Max-Planck Institut fuer Molekulare, Genetik, Ihnestrasse 73, D- 14195 Berlin, FRG REFERENCE 3 (bases 1 to 242825) AUTHORS Francis,F., Strom,T.M., Hennig,S., Boeddrich,A., Lorenz,B., Brandau,O., Mohnike,K.L., Cagnoli,M., Steffens,C., Klages,S., Borzym,K., Pohl,T., Oudet,C., Econs,M.J., Rowe,P.S., Reinhardt,R., Meitinger,T. and Lehrach,H. TITLE Genomic organization of the human PEX gene mutated in X-linked dominant hypophosphatemic rickets JOURNAL Genome Res. 7 (6), 573-585 (1997) MEDLINE 97343325 COMMENT Source of the molecule sequenced : 1..41742 ICRFc104A0717 cosmid 41743..48745 LLXU24M23 cosmid 48746..86185 ICRFc104C0161 cosmid 86186..109335 ICRFc104C05100 cosmid 109336..149529 ICRFc104D1056 cosmid 149530..170606 ICRFc104D0142 cosmid 170607..178656 LLXU62D02 cosmid 178657..218362 ICRFc104H0865 cosmid 218363..242825 ICRFc104A0563 cosmid. FEATURES Location/Qualifiers source 1..242825 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="p22.1" repeat_region complement(365..507) /rpt_family="MIR" repeat_region complement(1650..1952) /rpt_family="AluY" repeat_region 2529..2601 /rpt_family="MER5A" repeat_region 2605..2906 /rpt_family="AluSx" repeat_region 2913..2982 /rpt_family="MER5A" repeat_region complement(2983..3286) /rpt_family="AluJb" repeat_region 3288..3349 /rpt_family="MER5A" repeat_region complement(4041..4866) /rpt_family="LTR1" repeat_region 4981..5086 /rpt_family="MIR" repeat_region 5589..5674 /rpt_family="MIR" repeat_region complement(6876..7150) /rpt_family="AluSc" repeat_region 7355..7652 /rpt_family="AluJb" repeat_region complement(8133..8321) /rpt_family="L1MA1" repeat_region complement(9030..9289) /rpt_family="AluSq" repeat_region complement(9289..9584) /rpt_family="AluSx" repeat_region complement(9612..9905) /rpt_family="AluSg" repeat_region complement(10721..11023) /rpt_family="AluSx" gene 12024..226970 /gene="PEX" exon <12024..12141 /gene="PEX" /number=1 CDS join(12024..12141,17487..17555,26068..26229,55406..55492, 56494..56720,69447..69515,73001..73117,75973..76056, 78024..78169,90485..90578,93476..93604,112540..112641, 147329..147406,157290..157393,169461..169519, 191921..191975,198053..198120,200630..200760, 205460..205525,206524..206628,224350..224426, 226868..226970) /gene="PEX" /codon_start=1 /db_xref="PID:e290125" /db_xref="PID:g1834505" /translation="MEAETGSSVETGKKANRGTRIALVVFVGGTLVLGTILFLVSQGL LSLQAKQEYCLKPECIEAAAAILSKVNLSVDPCDNFFRFACDGWISNNPIPEDMPSYG VYPWLRHNVDLKLKELLEKSISRRRDTEAIQKAKILYSSCMNEKAIEKADAKPLLHIL RHSPFRWPVLESNIGPEGVWSERKFSLLQTLATFRGQYSNSVFIRLYVSPDDKASNEH ILKLDQATLSLAVREDYLDNSTEAKSYRDALYKFMVDTAVLLGANSSRAEHDMKSVLR LEIKIAEIMIPHENRTSEAMYNKMNISELSAMIPQFDWLGYIKKVIDTRLYPHLKDIS PSENVVVRVPQYFKDLFRILGSERKKTIANYLVWRMVYSRIPNLSRRFQYRWLEFSRV IQGTTTLLPQWDKCVNFIESALPYVVGKMFVDVYFQEDKKEMMEELVEGVRWAFIDML EKENEWMDAGTKRKAKEKARAVLAKVGYPEFIMNDTHVNEDLKAIKFSEADYFGNVLQ TRKYLAQSDFFWLRKAVPKTEWFTNPTTVNAFYSASTNQIRFPAGELQKPFFWGTEYP RSLSYGAIGVIVGHEFTHGFDNNGRKYDKNGNLDPWWSTESEEKFKEKTKCMINQYSN YYWKKAGLNVKGKRTLGENIADNGGLREAFRAYRKWINDRRQGLEEPLLPGITFTNNQ LFFLSYAHVRCNSYRPEAAREQVQIGAHSPPQFRVNGAISNFEEFQKAFNCPPNSTMN RGMDSCRLW" intron 12142..17486 /gene="PEX" /number=1 repeat_region 13717..13859 /rpt_family="MER5A" repeat_region complement(15076..15377) /rpt_family="AluSx" repeat_region 15834..16134 /rpt_family="AluY" repeat_region complement(17060..17238) /rpt_family="MER5A" exon 17487..17555 /gene="PEX" /number=2 intron 17556..26067 /gene="PEX" /number=2 repeat_region complement(17995..18285) /rpt_family="AluSx" repeat_region 18803..18971 /rpt_family="AluSp" repeat_region complement(19264..19325) /rpt_family="MIR2" repeat_region 19380..19497 /rpt_family="FLAM_A" repeat_region complement(20326..20512) /rpt_family="MIR" repeat_region 21362..21517 /rpt_family="MIR" repeat_region 21521..21852 /rpt_family="AluSx" repeat_region complement(22406..22593) /rpt_family="MIR" repeat_region 23448..23748 /rpt_family="AluY" repeat_region complement(23882..24014) /rpt_family="AluJb" repeat_region complement(25271..25417) /rpt_family="MLT1G" repeat_region complement(25583..25884) /rpt_family="AluSx" exon 26068..26229 /gene="PEX" /number=3 intron 26230..55405 /gene="PEX" /number=3 repeat_region complement(28133..28431) /rpt_family="AluSq" repeat_region complement(28436..28545) /rpt_family="L1PA7" repeat_region 28609..28894 /rpt_family="AluSq" repeat_region 29314..29608 /rpt_family="AluSx" repeat_region 30268..30566 /rpt_family="AluSq" repeat_region complement(30597..30731) /rpt_family="MER42" repeat_region complement(30598..30700) /rpt_family="MER42" repeat_region complement(31244..31545) /rpt_family="AluSx" repeat_region 31650..31756 /rpt_family="MIR" repeat_region 31766..31970 /rpt_family="L1PB2" repeat_region 31983..32111 /rpt_family="MIR" repeat_region complement(33150..33452) /rpt_family="AluSq" repeat_region 33910..34212 /rpt_family="AluSq" repeat_region 34252..34396 /rpt_family="L1MA4A" repeat_region complement(34582..34875) /rpt_family="AluSx" repeat_region 35537..35719 /rpt_family="AluSq" repeat_region 35720..35798 /rpt_family="AluSx" repeat_region 35904..36229 /rpt_family="MER33" repeat_region 36340..36626 /rpt_family="AluSx" repeat_region 36744..36868 /rpt_family="MIR2" repeat_region 37224..37289 /rpt_family="MIR" repeat_region 37395..37432 /rpt_family="MIR2" repeat_region complement(37824..38122) /rpt_family="AluSx" repeat_region 39028..39190 /rpt_family="AluJo" repeat_region complement(40658..40750) /rpt_family="MIR" repeat_region complement(40796..40849) /rpt_family="MIR" repeat_region complement(40850..41184) /rpt_family="THE1B" repeat_region complement(41209..41261) /rpt_family="THE1B" repeat_region complement(41264..41297) /rpt_family="MIR" repeat_region complement(41324..41628) /rpt_family="MER33" repeat_region complement(41649..41773) /rpt_family="MER5B" repeat_region complement(41779..42077) /rpt_family="AluSx" repeat_region complement(43164..43558) /rpt_family="L1PA15" repeat_region complement(43553..43610) /rpt_family="MER5A" repeat_region complement(44448..44741) /rpt_family="AluSc" repeat_region 45980..46279 /rpt_family="AluJo" repeat_region 46696..46842 /rpt_family="MER5A" repeat_region complement(46845..47155) /rpt_family="AluSq" repeat_region 47156..47206 /rpt_family="MER5A" repeat_region 47629..47745 /rpt_family="MIR" repeat_region 47867..48168 /rpt_family="AluS" repeat_region 49416..49708 /rpt_family="AluJo" repeat_region 50836..50962 /rpt_family="AluSq" repeat_region 50963..51264 /rpt_family="AluSx" repeat_region 51268..51448 /rpt_family="AluSx" repeat_region 51631..51738 /rpt_family="L1ME3" repeat_region complement(52266..52495) /rpt_family="MIR" repeat_region complement(52661..52955) /rpt_family="AluSg" repeat_region 53815..54109 /rpt_family="AluSx" exon 55406..55492 /gene="PEX" /number=4 intron 55493..56493 /gene="PEX" /number=4 exon 56494..56720 /gene="PEX" /number=5 intron 56721..69446 /gene="PEX" /number=5 repeat_region 58823..58976 /rpt_family="MER42c" repeat_region complement(61147..61236) /rpt_family="L1MA4" repeat_region complement(61230..61626) /rpt_family="L1PA3" repeat_region complement(61477..61801) /rpt_family="L1" repeat_region 61796..62288 /rpt_family="L1" repeat_region complement(62287..62685) /rpt_family="L1" repeat_region complement(62695..63148) /rpt_family="L1" repeat_region complement(63180..63303) /rpt_family="FLAM_A" repeat_region complement(63318..64306) /rpt_family="L1" repeat_region 64311..64611 /rpt_family="AluSq" repeat_region complement(64612..65016) /rpt_family="L1" repeat_region complement(66917..67180) /rpt_family="L1ME1" repeat_region complement(67129..67719) /rpt_family="L1MD2" repeat_region complement(67572..68043) /rpt_family="L1" repeat_region complement(68044..68339) /rpt_family="AluSx" repeat_region complement(68437..68733) /rpt_family="AluSq" exon 69447..69515 /gene="PEX" /number=6 intron 69516..73000 /gene="PEX" /number=6 repeat_region 69868..70005 /rpt_family="MER5B" repeat_region complement(70738..71092) /rpt_family="THE1B" repeat_region complement(71391..71690) /rpt_family="AluSx" repeat_region complement(71766..72067) /rpt_family="AluSg" repeat_region complement(72094..72164) /rpt_family="MIR2" exon 73001..73117 /gene="PEX" /number=7 intron 73118..75972 /gene="PEX" /number=7 exon 75973..76056 /gene="PEX" /number=8 intron 76057..78023 /gene="PEX" /number=8 repeat_region complement(76785..77074) /rpt_family="AluSx" repeat_region 77523..77833 /rpt_family="AluSx" exon 78024..78169 /gene="PEX" /number=9 intron 78170..90484 /gene="PEX" /number=9 repeat_region 79205..79357 /rpt_family="MER5A" repeat_region 79908..80213 /rpt_family="AluSx" repeat_region complement(80981..81514) /rpt_family="L1MA2" repeat_region complement(81986..82106) /rpt_family="MIR" repeat_region complement(82333..83179) /rpt_family="L1PB2" repeat_region 84127..84225 /rpt_family="MER5A" repeat_region complement(84236..84282) /rpt_family="MER5B" repeat_region complement(84352..84534) /rpt_family="MER" repeat_region 84692..84799 /rpt_family="MIR" repeat_region 84840..84954 /rpt_family="MER46" repeat_region 85507..85814 /rpt_family="AluJo" repeat_region complement(85916..86065) /rpt_family="MER5B" repeat_region 86086..86185 /rpt_family="MIR" repeat_region 86858..86922 /rpt_family="MIR2" repeat_region 87342..87533 /rpt_family="L1PB2" repeat_region 87687..87977 /rpt_family="AluSx" repeat_region complement(88939..89053) /rpt_family="MIR" repeat_region complement(89156..89423) /rpt_family="MER6" repeat_region complement(89483..89632) /rpt_family="L1PA5" repeat_region 89630..89797 /rpt_family="L1PA4" repeat_region complement(89808..90153) /rpt_family="MER6" exon 90485..90578 /gene="PEX" /number=10 intron 90579..93475 /gene="PEX" /number=10 repeat_region complement(91165..91297) /rpt_family="FLAM_A" repeat_region 91643..91939 /rpt_family="AluSx" repeat_region complement(92966..93267) /rpt_family="AluSx" exon 93476..93604 /gene="PEX" /number=11 intron 93605..112539 /gene="PEX" /number=11 repeat_region complement(94004..94063) /rpt_family="L1MA9" repeat_region 94064..94362 /rpt_family="AluSx" repeat_region complement(94363..95337) /rpt_family="L1MA9" repeat_region complement(95730..96357) /rpt_family="PABL" repeat_region 97163..97200 /rpt_family="MER5A" repeat_region complement(97221..97313) /rpt_family="MER5A" repeat_region complement(98085..98167) /rpt_family="AluSg" repeat_region complement(98609..98914) /rpt_family="AluSx" repeat_region complement(99106..99319) /rpt_family="MIR" repeat_region complement(100144..100401) /rpt_family="MER3" repeat_region complement(102006..102296) /rpt_family="AluJ" repeat_region 102399..102522 /rpt_family="MIR" repeat_region complement(102847..103147) /rpt_family="AluY" repeat_region complement(105887..106188) /rpt_family="AluS" repeat_region complement(107074..107379) /rpt_family="AluS" repeat_region 107831..108388 /rpt_family="MER44B" repeat_region 109275..109585 /rpt_family="AluSx" repeat_region 109801..109886 /rpt_family="MIR" repeat_region 109906..110030 /rpt_family="MER5B" repeat_region complement(110057..110356) /rpt_family="AluSx" repeat_region 110359..110418 /rpt_family="MER5B" repeat_region complement(111116..111424) /rpt_family="AluJo" repeat_region complement(111898..112201) /rpt_family="AluJb" repeat_region complement(112202..112337) /rpt_family="AluJb" exon 112540..112641 /gene="PEX" /number=12 intron 112642..147328 /gene="PEX" /number=12 repeat_region 113326..113629 /rpt_family="AluSx" repeat_region complement(113768..113921) /rpt_family="MER5A" repeat_region 116630..116793 /rpt_family="MER5A" repeat_region 118017..118252 /rpt_family="MIR" repeat_region complement(118496..118649) /rpt_family="AluJo" repeat_region complement(118653..118847) /rpt_family="AluY" repeat_region complement(118967..119143) /rpt_family="MER5A" repeat_region complement(119497..119797) /rpt_family="AluSx" repeat_region complement(119872..120081) /rpt_family="MIR" repeat_region 120082..120266 /rpt_family="MER5A" repeat_region 120986..121286 /rpt_family="AluSx" repeat_region complement(122139..122200) /rpt_family="L1MB5" repeat_region 122185..122456 /rpt_family="L1MA4A" repeat_region complement(122459..122771) /rpt_family="L1MB5" repeat_region 123043..123196 /rpt_family="MIR" repeat_region complement(124022..124170) /rpt_family="MER42c" repeat_region 124208..124342 /rpt_family="MER5B" repeat_region 124346..124649 /rpt_family="AluSx" repeat_region 124654..124693 /rpt_family="MER5B" repeat_region complement(124955..125138) /rpt_family="MER42c" repeat_region 125554..125853 /rpt_family="AluSx" repeat_region complement(127031..127203) /rpt_family="MLT1F" repeat_region complement(127370..127783) /rpt_family="L1MA7" repeat_region complement(127642..127844) /rpt_family="L1ME1" repeat_region 128482..128786 /rpt_family="AluSq" repeat_region complement(129169..129214) /rpt_family="MER5B" repeat_region 129260..129293 /rpt_family="MER5A" repeat_region complement(129288..129392) /rpt_family="MER5A" repeat_region 131173..131352 /rpt_family="MIR" repeat_region 131596..131916 /rpt_family="L1ME3" repeat_region complement(132011..132300) /rpt_family="AluJo" repeat_region 133167..133204 /rpt_family="L1MA1" repeat_region 133457..133672 /rpt_family="MER20" repeat_region 133694..133796 /rpt_family="MER5A" repeat_region complement(133945..134228) /rpt_family="AluSx" repeat_region complement(134260..134303) /rpt_family="MIR" repeat_region 135040..135172 /rpt_family="MER34" repeat_region complement(135890..136198) /rpt_family="AluSx" repeat_region 136228..136357 /rpt_family="AluSx" repeat_region 138386..138537 /rpt_family="BC200" repeat_region complement(138889..138940) /rpt_family="MIR" repeat_region complement(139100..139283) /rpt_family="THE1B" repeat_region 139284..139586 /rpt_family="AluSq" repeat_region complement(139597..139789) /rpt_family="THE1B" repeat_region 139870..140163 /rpt_family="AluSx" repeat_region 140279..140458 /rpt_family="AluSg" repeat_region 141969..142168 /rpt_family="MER20" repeat_region complement(142451..142489) /rpt_family="MIR" repeat_region complement(145548..146498) /rpt_family="L1MA8" repeat_region complement(146507..146811) /rpt_family="AluJo" repeat_region complement(146812..146973) /rpt_family="L1" exon 147329..147406 /gene="PEX" /number=13 intron 147407..157289 /gene="PEX" /number=13 repeat_region complement(148431..148610) /rpt_family="IR" repeat_region 148747..148968 /rpt_family="MIR" repeat_region 149348..149600 /rpt_family="MIR" repeat_region complement(150094..150450) /rpt_family="THE1C" repeat_region 152518..152597 /rpt_family="MADE1" repeat_region complement(154366..154667) /rpt_family="AluSx" repeat_region 155128..155420 /rpt_family="AluSx" repeat_region complement(155812..155917) /rpt_family="L1ME3A" exon 157290..157393 /gene="PEX" /number=14 intron 157394..169460 /gene="PEX" /number=15 repeat_region complement(157998..158115) /rpt_family="MIR" repeat_region complement(158348..158689) /rpt_family="L1PB1" repeat_region complement(159507..160447) /rpt_family="L1MA3" repeat_region complement(160412..161364) /rpt_family="L1" repeat_region complement(164655..164764) /rpt_family="MLT1F" repeat_region complement(164774..165072) /rpt_family="AluSx" repeat_region complement(165079..165135) /rpt_family="MLT1F" repeat_region complement(165303..165460) /rpt_family="MLT1G" repeat_region complement(166064..166342) /rpt_family="MLT1E" repeat_region complement(166379..166653) /rpt_family="MLT1D" repeat_region 166956..167065 /rpt_family="MIR" repeat_region complement(167299..167373) /rpt_family="AluJo" repeat_region complement(167635..168370) /rpt_family="L1MA3" repeat_region 168473..168606 /rpt_family="AluSq" exon 169461..169519 /gene="PEX" /number=15 intron 169520..191920 /gene="PEX" /number=15 repeat_region complement(170036..170344) /rpt_family="AluSx" repeat_region 170365..170515 /rpt_family="L1ME1" repeat_region 170549..170730 /rpt_family="MER5A" repeat_region 170754..170926 /rpt_family="L1ME1" repeat_region complement(172516..172966) /rpt_family="L1ME3" repeat_region 174329..174628 /rpt_family="AluSx" repeat_region 174678..174908 /rpt_family="L1ME3" repeat_region 174912..175210 /rpt_family="AluJo" repeat_region 176286..176760 /rpt_family="MLT1F" repeat_region 176824..177015 /rpt_family="MIR" repeat_region 177258..177351 /rpt_family="MIR" repeat_region complement(177815..178177) /rpt_family="THE1B" repeat_region complement(178388..178888) /rpt_family="MER44C" repeat_region complement(178910..178988) /rpt_family="MER44A" repeat_region complement(179084..179261) /rpt_family="MIR" repeat_region 179344..179643 /rpt_family="AluSx" repeat_region complement(180194..180499) /rpt_family="AluSx" repeat_region complement(181257..181526) /rpt_family="L1ME3" repeat_region 181873..182044 /rpt_family="MER5A" repeat_region complement(182944..183108) /rpt_family="ER5B" repeat_region complement(183178..183279) /rpt_family="MER5A" repeat_region complement(185372..185512) /rpt_family="MIR2" repeat_region complement(186762..186904) /rpt_family="MER33" repeat_region complement(186915..187082) /rpt_family="L1PA6" repeat_region complement(187432..187554) /rpt_family="L1MB6" repeat_region 187585..187765 /rpt_family="MER5A" repeat_region 190230..190285 /rpt_family="MER45" repeat_region complement(190706..191059) /rpt_family="THE1B" repeat_region 191487..191601 /rpt_family="MER45" exon 191921..191975 /gene="PEX" /number=16 intron 191976..198052 /gene="PEX" /number=16 repeat_region complement(193014..193499) /rpt_family="MLT1D" repeat_region complement(194702..194786) /rpt_family="L1MA2" repeat_region complement(195534..195836) /rpt_family="AluSp" repeat_region complement(197243..197380) /rpt_family="MLT1G" exon 198053..198120 /gene="PEX" /number=17 intron 198121..200629 /gene="PEX" /number=17 repeat_region 198197..198295 /rpt_family="U4B" repeat_region complement(198635..198933) /rpt_family="AluSc" repeat_region 199644..199720 /rpt_family="MIR2" exon 200630..200760 /gene="PEX" /number=18 intron 200761..205459 /gene="PEX" /number=18 repeat_region 200835..200950 /rpt_family="MER5A" repeat_region 201376..201599 /rpt_family="MIR" repeat_region 202480..202764 /rpt_family="AluJo" repeat_region 202871..202928 /rpt_family="MIR" repeat_region complement(203012..203299) /rpt_family="AluSx" repeat_region complement(203900..204340) /rpt_family="MLT1D" repeat_region 203970..203994 /rpt_type=TANDEM repeat_region complement(204411..204476) /rpt_family="MLT1D" exon 205460..205525 /gene="PEX" /number=19 intron 205526..206523 /gene="PEX" /number=19 exon 206524..206628 /gene="PEX" /number=20 intron 206629..224349 /gene="PEX" /number=20 repeat_region complement(207758..208657) /rpt_family="L1PA7" repeat_region complement(208513..213376) /rpt_family="L1" repeat_region complement(213455..213688) /rpt_family="L1" repeat_region complement(213997..214317) /rpt_family="L1" repeat_region complement(214377..214731) /rpt_family="MLT1A1" repeat_region 217032..217158 /rpt_family="AluSq" repeat_region 217160..217736 /rpt_family="L1" repeat_region 217837..222706 /rpt_family="L1" repeat_region 222562..223454 /rpt_family="L1PA2" repeat_region 223457..223640 /rpt_family="AluSx" exon 224350..224426 /gene="PEX" /number=21 intron 224427..226867 /gene="PEX" /number=21 repeat_region 226281..226588 /rpt_family="L1ME3" exon 226868..>226970 /gene="PEX" /number=22 repeat_region 231337..231636 /rpt_family="AluY" repeat_region complement(233426..233550) /rpt_family="MIR" repeat_region complement(233820..234020) /rpt_family="MIR" repeat_region complement(234303..234767) /rpt_family="L1MA8" repeat_region complement(234769..235068) /rpt_family="AluY" repeat_region complement(235069..235405) /rpt_family="L1MA4A" repeat_region 236866..237176 /rpt_family="AluJo" repeat_region 237215..237287 /rpt_family="MIR2" repeat_region complement(237290..237429) /rpt_family="MIR2" repeat_region complement(238181..238479) /rpt_family="AluSx" repeat_region 238979..239277 /rpt_family="AluJb" repeat_region 239402..239886 /rpt_family="L1PA7" repeat_region 240417..241212 /rpt_family="L1ME3A" BASE COUNT 68803 a 47704 c 50274 g 76044 t ORIGIN 1 cccacctcgg cctcccaaag ctgatgtttt cttaaattgt ggtaaaaaac acttcccttc 61 cctacctctt ctctcctctt ctccccccaa ctcctctctt ctcccatccc ctctcctttc 121 atttcctctt ctttccatta tttcttcatc cctccctctt ttcctccccc agtaagtctg 181 gactgaatac ctcccatagg catagacagc cccaaataag aattttaaaa cacaaacaat 241 ggagacatct ataaggcata tctatgggtg gcaaacatga aatattaaga aacagaataa 301 taataatgcc tgccacttca tcagaaacca ttttaagcac ttcataacta ttataatcat 361 agtttattat tatctcccat tgccagatga caaaattgag accctgagag attaggcaat 421 tttttcaagg tcacatagca gtaagtagca atgccaggaa tccaatccag ggagtttgtt 481 cccaaatacc aggtatttaa ccactgtcac actgcactat cctgcctggc agcgtggaaa 541 catgtatcaa cgtgaatcaa cgtatatcaa cgtgaatggt gcagttagca gttacaagaa 601 aaagagaagg gagctctgtt agtgttatgt gaggaaagca gaaaccactc taggtatttc 661 aaacaagtgt atttaatgta gggaattggt ttcaaaggta gtggaaaggc agaaggaaca 721 aaggagagga tgttacccag agttcagtga ctgcaaaagg ccgctactgc ccttaagact 781 agagaagcaa acagggaaaa ggaatgtggc ctaaagtcca catcacactc acctcggagg 841 ctgctgctgc tactgctgag tgggactcta ggagccacaa tcccattgat gctgccagag 901 tctgcagtca tctgcagcca tgtgcccctg cagctgtcac atttgcggtc tttatgtagg 961 atcccggagt ttcctcctgt ggccctggat aaacccagga tcagtaaaga aaaaataaac 1021 caggcttcta cattcctcct gtttttgaat ttcctattaa tgccttccct tgacagagtc 1081 taacaggaac tcagctggca agggagcctg gaaaatgtag ttcgtgggct tccagctgcc 1141 aaccatacac tgaacacact agaagatcac agtggggctg aaagccaaag cgtattctac 1201 atgctcagtg ttgcataaga aaacgtggat acagggaaga aaaaaacatt taaccgaatc 1261 taagattcaa tcgattgtaa gattgtaaga tgtatcataa taccaccata ataataccat 1321 cgagaaagaa aaataccgtc aaattatgcc acatcaccac ttacgaggta cattctactt 1381 ctagaaatgt taaagcttga aaaaaagtcc ctgttaaaat tgataagata cagtaaatgt 1441 ctgatctcaa cttacatgta gtctaatggg gcagacttct gcagagacta ttgtaataca 1501 acctgacatc ctgtgaccct gtgaccttac taagaagagg ggtctgggaa gattttctcg 1561 aggaggtaaa atctgagctg agtcttaaaa gatgagcccg agtttcctaa agaaatggtt 1621 ttccaggcaa aaggagaaat gttttctttt tttttttaat tttttttttt tgagatggag 1681 tctcgctctg acgcccaggc tggagtgtag tggcgcaatc ttggctcact gccagctccg 1741 cctcccgggt tcacgccatt ctcctgcctc agcctcccga gtagctggga ctacaggtgc 1801 ccgccaccac gcccggctaa ttttttgtat ttttagtaga gacggggttt cacggtgttt 1861 tccaggatgg tctctatctc ctgacctcgt gatccgcccg tctcggcctc ccaaagtgct 1921 gggattagag gcctgagcca ccacgcccgg ccgagaaata ttttcaaagc gcaaaaggct 1981 gggtaaagtt tagtgcattt gagaacttga aggggtttag aatgactggt gtgtagggtg 2041 taaattgaga agccgtgcac aggtgagcct ggaggattgc gtgggctcaa tcatgcagga 2101 actcaatgtg aagtattttt acattttcct aaaaactgac agcagccact gaaggatttt 2161 aagaacatca aaagtccatt tgaaatttga ctcaggactg caatttagtt ggttcatacc 2221 agtgtggctg ctgctgttgt gaaatagtga aatatctcct tactagctgt ggatggctgc 2281 ccctctggac cccctgcagc ctcctccaat ccaccccctc tcctagaacc ctccagatcc 2341 agcaactaag atagtacagt aggggccacc gaaactggcc ccagtcccct ggtcagtgcc 2401 atgccatggt ggttgttaaa tattttgaat ggcatcactg ttttcactac ataagtttaa 2461 tgtttagttt tgttggttct tcagtccact ttaccacatt gcagagagat ttaatggttg 2521 ccctacacca gtggttctta caatgtggtc cccagactag cagcatcagc atctcccaag 2581 agctggttag aaattaaaat ttcaggccgg gtgcagtggc tcacgcctgt aatcccagca 2641 ctttgggagg ccgaagcagg tggatcacca gaggtcagga gttcaagacc aacctggcca 2701 acatggtgaa accccgtctc tcctaaaaat acaaaaatta cccagacgtg atggtgggcg 2761 cctgtaatca cagctacttg ggaggctgag gaaagagaat cgcttgaacc caggaggcag 2821 aggttgcagt gagccgagat tgtggcactg cactccagcc tgggtgacag agcaagactc 2881 cgtctcaaaa aataaataaa taaaaaataa aaagaaattt aaatttcagc tcatctcaga 2941 cctacagaat cagaaactgt agggatggag cctagcaatc tgttttttaa tttaatttaa 3001 ttttgagaca gggtctcatt ctgtcaccca ggctggagtg cagtggcaca atcaaggctc 3061 actgcagcct cgacctcccg ggctcaggca gtcctcccac ctcagcctcc tgagcagctg 3121 ggactacagg agctcaccac cacacccagc tagtttttgt attttttgta gagatggggc 3181 ttcatcatgt tgcttaggct ggtctcaaac tcttggtctc aagccatcta cctgcctcgg 3241 cctcccagag tgctaggatt acaggtgtga gccaccatgc ctggccacaa tctgtgtttt 3301 aaccagccct ccgggagatt ctggtgccca ctcaaatttg agacctgctt ctctaagccc 3361 cttgtcctca taaacatcta ggtaagccat ccttgaacaa ttggttgtta ttaatcaagg 3421 gcatacaaac ttctataggt taatagtcta accaaggagt agggtgagtg tgaccagttt 3481 gaggatacag caagtaaacg ctagttggat gaatagttat ttcattttaa aattgttatg 3541 cagtggagtg ctttggaaaa ggtattaata ttgtgggttt gggagctgta tcaatcatgt 3601 tttttatttt ctatatttta tgatgtgttg acatcttggt gccttgcaga cccagggaag 3661 gtctgtccct cccagggtta gctaattcct agagatagta actgacttgc ctgtgagcat 3721 gcctttgata tgcaaatcaa ccaatccacg ttcatatacc ccccacttcc ttttaccaga 3781 ctcctgcagt caaggacact atacccctac ctttccaccc taaatcatcc cagggctagg 3841 tacgagagtg ctagagacca cctctaggga ccagagcctg ctgaaattat tgatgctatc 3901 cacccaatcc taaacctgct cagctgctta ccctgccttg cccattcctt tctttgaaaa 3961 ccacaataaa gtgcctatgg ttttcccttg ctccttctgc tccttgatgc tcccacatat 4021 ggccctgtgt ggcaaggcat tgaaatgaga gagttccctg attcccctcg cagggtatga 4081 gacagtgttg tggctcgcct gtttggttgc cccccgcagc tcagacccct tacgggaggg 4141 ggagcatgca gatgagcagg tgcaggaacc ggggtgagca cttttgggct ctggccccat 4201 ggcagcatct atgggtgggt gtttgtgact cccgaagccc aagtgggcat gtgttacagt 4261 gtgctccttt agctttgcca tctgtagatg gcttgtgtgt taatcagctc aatggaccct 4321 gtgccttatc acaagggcag ggggcgagtg tgacagcctt ctgtatccca agctcttcct 4381 cagtgtccca aaagaatcgg atcacatgtg ggcttgaagg atgagtgcag ggttttactg 4441 agaggtggag gtggctctca gcgaaatgga tgcggagcca gaagggggga tggagtggga 4501 aggtggtctt cccctggagt cggggtgcct agcaaccaga ctcttctttg actgcccctg 4561 gccgaactcc ccttggcgtc cagacgtccc tcctcttctc tctctgctgc attgttctgc 4621 cgtcactggt cagctggtcc gcttgcctcc tcctctcttt gcttggggtt ctcttctgga 4681 gcttggggtt cagggtttat atggggccag ggtagggggt gtggtggacc aaaaggcaac 4741 tttttgggcg tgaaaacaga aatgcctgtc ctcattaggg cctctggtct tcatgcttga 4801 gggcggggac tttgccaggg aaccgccctc ttctacccag tatttccctg tctcttgtcc 4861 gtatcagcgt gccctctcat cttccaaact gtgagtaata aacccttttc aagggctgtc 4921 tccatgtttg ttgtcttaac cataccagat taaagcaaat tccaagtgca tttcaacaca 4981 ggagccacat cgatttgagt tccaactggg ctctgtcatt tacccatcac gtgaccttcg 5041 gaatattact tatcctcagc tggctccagt gtccttatct gcaaaagatt cctgtattgt 5101 aggactgctc taaggactga tgataataca cataaggtga gcaggcactg aggaaatagt 5161 agctgtgtgg gaaggaggtg agaaaaaatc aatggataaa atccccttta aggctcagag 5221 atgaacaagt agaacacatg gatagatagc ctctcagatg cgtatctcct tcctaatgtt 5281 caattactag cagcaacaca tcaggagata gttcatgtac tgctattttt gttcctcaac 5341 tgaaaaatgt accccccacc ccccaaatta aatgatggct gctgttcctg ttaaggaaat 5401 aaagcctgat aattactctt ttaaacaaga gaattaaagg aagcatagtt tatcgctttg 5461 cctctgctca agtgtcactt tgagatgcta cacctgcctt cactaattac ccgataagca 5521 aacttatcaa gaaccacagc agagagacaa gaggttggga tttgaaacca ttctggcctt 5581 aagacaaata ctggccctgt gacctttcgc aaatcactta atagcttcaa acttcagttt 5641 ccttatctgt aaaaggagat aagaacacct acctaaggaa aatgtttaaa aagatcagat 5701 acaatgtaag taatgtatat cgttagtgaa aggcctggga tttgaactcc tccagtctga 5761 aggacctgcc aataagtaac agagcccacc acaatcccca gatctgagct ttcactattt 5821 attttaatag ctcttctctc cctctcctcc cttccttctt ctttcctccc tcttccttcc 5881 aatggatgga tgaaaacatt tttcagaatt tgaatcactt ccttctttct cactaaaatt 5941 atggcccgca tgggatttta ctgcttgaaa cacaggaccc tgcacatcca aataatgacc 6001 agatgcactc atgtctaaca taaagacagc ttcagctgtc atcccggcca gttagcagct 6061 tgggctgttg ttgctggatt ccgttacctc ggataaatca ctgtacctgt gagtcctgca 6121 gggagagcag taaatgccca ggcaagtccc ttcttggggt ggggctcggt ccatctggaa 6181 gccttcctct tgtcctgcaa gctgcccatt ctcgcaagca ctccttctgg aagtcttccc 6241 aatgaatcta ccgctctggg atttctgctt ctctgggcac tgactgccct tatggtcagg 6301 cctgcgtgct ttagggtcat gtggccaagt tgggtttcct atacagtatt ttcatccctg 6361 catcaagttg accagtccag aacctccggc atgggttctt ggtcttgctt cataagggtt 6421 tgcacagggc taggcatctg agaaatgctc aagaaacttc cgttgattca ttgattgatc 6481 cagtgaatgt ccaaggtagt cctgaatgtt cttaattgaa ataattgggt gggtgtagag 6541 caggctatta acagacttac cttgcataat gacttttcaa tgaactaatt ttcacatctc 6601 tagaatagtt gtaaaaattt ttagaataac gccccctccc cttttttttc cagaataatg 6661 acttctcact cccaaacaca ctaaatcgta tgctaaatca taatattgaa atcatagttt 6721 caatattaca gtttcaggca aagacacatt caccagtttc cctgacacaa cagtttctgt 6781 tcaaagaatc tcgaatggaa caattttgag ttacaggtca agtgtggata aacattttat 6841 tttgtatgcc atgtagatgg ggtgaagagg tagcattttt tttttttttt tgtcatagtc 6901 ttactctgtc accaggctgg agtgcagtgg tgtaatctca gctcactgca acttccgcct 6961 cccaggttca agggattctc ctgcctcagc ctcccgagta gctgggacta caggcgcgca 7021 ccaccacacc tggctaatgt ttgtattttt agtagaggtt tcaccctggt cttgatctcc 7081 tgagctcatg atccacccac cctggcctcc caaagtgctg ggattacagg tgtgagccgc 7141 cgctcccggc tagaggtagc atcttttact actctttagt gcagaggttg ctaaactttt 7201 cctgtcaaga gccagagagt aaatatccta agtttcaaag gccaaaatgt ctctgttgca 7261 aaactactta actttgctct tgcagcatga aagcaactat gcacaatcta caaacaaata 7321 ggtgtgacct tgttccaata aaactttatt tacaggccgg gcatgatggt tcacacctgt 7381 aatcccagca ctttgagagg tcgaggcatg tggatcgctt gagcccagga gttcgagacc 7441 agcttgggga acatggcaaa aatccatctt tactaaaaat acaagaatta gctgggcgtg 7501 gtggcgcacc acctgtagtc ccagctattc gagaggatga ggcaggagaa tcgcttgaac 7561 ctgggaggtg gaggttgcag tgagccgaga ttgcgccact gcactccagc ctgggcaaca 7621 gagcgagacc ctgtctcaaa acaaaacaaa aaggtgaaaa acaaaaaaag ctttatttac 7681 aaaaacaggc agtgggccaa atttggccca tgtactcttg tgtacagacc cctgctttag 7741 tcctgtgagc acctattatt tccctgtgct ttgaacttct ttattttgcc ctaaagggtt 7801 ttttttcttt tctttttttt ttccgtgtta ctttaagtgt ggtttatttt gagggcaaga 7861 taatcatcat ctcctaaaag gctgaaaaat taatgcagaa aatggcacag aatgaacaat 7921 gtaccttgta atgaatgatt ctcccacatc ctttcctctg tcagtcccgc cactctcaag 7981 ttttccttta ttcctctggt gaataaaaga aaccttcatt acattacatt acacaaatta 8041 tgcttatttt caatattttg acagttctat attactctta cagacttgtt ttcattgttc 8101 aagtaataca ttttaaaatt acttttttta atttttaatt tttgtgggta catattaggt 8161 gtatatattt gtgggtacat gagttgtttc gatacaggca tgcaatgtga aataatcaca 8221 tcgtggtgaa tgtggtatcc atcccctcaa gcatttatcc ttagtgttac aaacaatcca 8281 atggcagtct ttcagttatt tttaaattta caattaagtt aaatgaatac atttttctat 8341 ctgtgtttgc atatcttagt ttgggataga atgcatcata attttcctat tacaattaat 8401 gaaagtattt tttttggttt aatggctttt cactcagcag ttgaccgttc aggaatgaat 8461 tcctgtcggt aagtgatggg taggtataac ttcagggaat ccccaaattg ccctcacctc 8521 tgggggaatt gcaagttcaa ggcaggtgaa ggctactcca gctaggaact cacggccact 8581 attcaatgtc agtggggaca ctgaaagttg tgaggagatt gagaggagga aggaaagagg 8641 aaagaagtga gtcctggtgt cgtgcttggg tgactgtcct ctcgctgtgc acaggaagct 8701 ttgtcaagca gcagacatgc catgtatggc aatgatccca tcccccatct atccatttgt 8761 gattttccca tgccacctga ggccattcct tggtctcagt gtccttattt gtgaaacaga 8821 gttaaaatac ctttgctatc aagtatgtag atttaccatg aggttaaagc gaatgtgcta 8881 tgcctatatg aagtatcact tttagttgag cattgagaag gaataattag ggccttgaaa 8941 gatgacaccc ttcctccccc cgcagacatt agcatgtgag gttcctgaag ctgaagctgg 9001 ttgtttgttg tttgttgttg ttgttgtcgt tgttgttgtt ttgagatggg gtctcgctct 9061 gttgcccagg ctggagtgca gtactgcgat ctcagctcac tgcaacctcc gcctcccggg 9121 ttccagtgat tctccgattc tcctgcctca gcctcccgag tagctggcat tacaggcatc 9181 caccaccacc tccggctaat ttttgtactt tcagtagagt cggggtttcg ccatgttggc 9241 caggctgatc tcgaactcct gacctcaggt gatccaccca ccttggcctt ttggtttttt 9301 tgagacacag tttcattctg tcacccaggc tggagtgcag tggctcaatc tcagctcact 9361 gcagcctcca cttcccaggt ccaagcgatt tttgtgcctc agcttccccg agtagctggg 9421 attacagtca cgtgccacca cacctggcta atttttgtat ttttagtaga gacagggttt 9481 cgccacattg gccaggcaga tctcgaactc ctgacttaga gtgatctgcc cacctcagcc 9541 tcccaaagtg ttgggattac aggtgtgagc caccacttct ggccccgaag ttggtttttt 9601 gttttgtttt gttttgtatt tgtttttttc cagacagtct tgctctgccg cccagactag 9661 agttcagtgg catgatctca gctcgctgca acccgcctcc tgagttcaag caactctcct 9721 gcctcagtct cctgagtagc tgggattaca ggcgcctgct accacgcccg gctatttttt 9781 gtatttttag tagagacagg gtttcaccat gttggccagg ctggtctcga actcctgacc 9841 tcgtgatcca cccgccttgg cctcccaaag tactgggatt acaggcgtga gccactgcgc 9901 ccagctgaag ctggttgtta agagtaagtg gctagggcta acttgtatga tttttattgc 9961 tctccttttc actgtgaagg gccaggaaat gtcgatgttg gagctgcaac attgaacatg 10021 gttcctgact ttgtggaact catggtctaa gatgtgttct agagtgaggt ccaaaaactc 10081 ttgggagaat cccatactag gggccaaagt cctgctaggg tgtgttatgg aataacagca 10141 acaaaataat gtcaacagcc aatgagcaca ggataactac aacacaccat gatgtcataa 10201 cttggacaaa ggcttgtgac cccctgactt tggcgcaaca tgaaatgagc agaaagtaag 10261 gtatattatg gaataagccc caaaacttga gagaatccca tgctaagggc ttaagccctg 10321 accaagacaa gccagacagg accatgaatg tgtggccatg ttatactgag ctcatcagca 10381 cctgctgagc tcctgtgtca cctctaaagc catgcgcaca agttttgcat tgagataatt 10441 gctctactgg tttctagata tagttgaatt ctccaccccc actcccactg ccctatgacc 10501 tccagcaaaa caaaagcagg taggtagtaa ttgtgttgtg agttaacgat atctatatct 10561 ttggataata aaatctatat catttcagtg gtagcattgg aatatcagtt gcccttcccc 10621 atgaagaaag cacattgtta tcgtttaaat cattatcata atcaaaaata tcaaagaccc 10681 agtcattcat cagtctttcc agtgaagtca actgcctttg ttttttgttt gtttattttt 10741 gagacagggt ctctctctct gtcacccagg ttagagtgca gtggcgcgat ctggctcact 10801 gcaacctcca cctcccaggc tcaagcgatt ctcccacctc tgcctcctga gtagctggga 10861 atacaggtgc tcaccaacat gcccggctaa tttttgtgtt tttagtacaa acaaggtttc 10921 gccttgttgg ccaggctggt ctcgaactcc tggcctcaag cgagccccct gcctcagcct 10981 cccaaagtgc tgggattaca ggcgtgggcc actacgcccg gcctgaaacc aattgccttt 11041 gaattcatcc tggggaatct gctgaagtgg acagtttatg taaacacaaa aacatcctaa 11101 tgaatggctg cctcagtagg agtggcagac aaaacacatg gtgcactttt catcaaaaac 11161 acacaaggaa aaaatttttc atcatatctt ttgtagttgc taataaactt aattcacgca 11221 cttgtgtttt tttgtgggtt tttttttttg tacacttctt agtaccacaa agaggaagtg 11281 ctttttctcc ttcctttaag aaaaagttcc agttcaaacc atcagcctct gtaggttgtc 11341 ttaattgcta ctttgcactg cactggacta tgactgattt ttggcacatg ctatcataca 11401 caaagttcct gaaataaaat gctccaggat atcctaggaa ctcttagaag gctttaaaaa 11461 gggactcaca cactgaaaga atatctttga tgaagacaat tcaggcaagc agaatgattc 11521 ttgcaacaga attacatgat taattgagat cttgaagtgg gtccggtgaa tcctggccac 11581 ctaacttatc atgatttggg ggagtttcac gagaatccag ttttgataaa acaattgttt 11641 ttttcctccc caagtgacta tacatttaaa tagctaaaac atctgttcag caacatagta 11701 aaacatatat actcggaacg cttgagagaa gagcctgcca aacagggact ttgctgaggg 11761 agagcaccaa gataaagcaa cactgtttgt tttgtctagt cagggggaaa gccaaggcaa 11821 ccaatatttt ggtttttata attttcattt gtgaagaatt atttgagaaa gggtggcgag 11881 gggagatttc ctgacggcag tttcttaagc tgtccattag tagaagagca agaaagcctt 11941 ggatgtcaac gcctcgctct tgagaccagc caccaaacca cgaaaagtga ctttcttctc 12001 gtgtgctctc tacggccctt ctgatggaag cagaaacagg gagcagcgtg gagactggaa 12061 agaaggccaa cagaggcact cgaattgccc tggtcgtgtt tgtcggtggc accctagttc 12121 tgggcacgat cctctttcta ggtaagtgga gtgcaggagg ccgggggaac tgggtgtgtg 12181 tcgagaagtt tccttgtgct ttattgtagc aaaacttgtc ataacacagg tatagggctg 12241 tttgcctgcg ttcataggtg gtgtatcttt agtttctatc aaatatgcaa atgataaaga 12301 aggctttttt aaggagccta aataaataaa tacacccaag taccaaatgt atagtccaga 12361 aggcaaagtg ggtgtaagtg gcttcgagtg gctggctgat tctgttgcat tgggtggaaa 12421 attttttatg tggtacattt agcaaaatga gtgtttgtga attcatagtc acatagtttc 12481 atttatattt tgaggaaaga ctttacattg tgatgaaaac catgctggag tgtctttcca 12541 tttgactttt attaatgttt gcttgaacga atcatgaaat ttaattcaca aaacacaaat 12601 cagtccttta aagtgtgctt ttaatgatat tcttcaattc ctttcttatc cagtgactct 12661 ggtgagatgc aatttgattt acaaattaat atgaaggaaa acagaggttt tcccccaaga 12721 tttgaaagat attgtgattc agttatttta atgactgttg gtgagaatga tgtacccaga 12781 gagtatatag gatgatagaa gtgtattccc atgaaaaaga atcaaatatc ttggaaagga 12841 gtgagggatt ttctaaaatt ctaggctttc tgaacccttg actattaccg ttttaatttt 12901 tctcatacaa cccataatac ttcattaact tgaacgttac ataggaacga ggagattcta 12961 ggtgcagttt gaaatgtagc aaacctgcag gaggccatgt ttattttggc aggggtaaac 13021 cagtaattaa actttaaaag agttagggtt gtgtggttga aaactgctaa aatgagcaaa 13081 ccatctttcc acgctttctg tggctgaaga tttaggaaat tagatgggtt tattgctttc 13141 tgataaagcg aaatctccta ctcctgcaac tgaatctgtg gctcttgaaa actcttgtaa 13201 gatttctcag cacttagggt ttaaacaagg gtcctctgct gaaattccaa acaagaggaa 13261 aaattctatg ggcatgccaa aagactacag gctacgtgca acgtgactgc aaggttaggt 13321 tttgcaaatg ctcccaaact taaaaatccc aataattatt gtttatgtaa aaatatgtac 13381 tctttttaac gatattacca ctgcacgaaa cttttacaaa actctacttt tatgattacc 13441 tcctgaggcc gtggcatatc ggttgaaaga ttctccgcgg cagcaaatct ttgttctttc 13501 agagaaaatt attctcagga aatagtgccg atgtttttag agcaaagtaa aattcatcaa 13561 gccagacgct actattatgg ataaggaggc ctgcataaaa tatgaaagag aactttccag 13621 cgtgcccaat aacctggctc tcaccttgac tctctaagag gagctcaaat gatgttagag 13681 aataagggca aactgaggct gtggcctcta caactttgct tctcaaaatg gggtctgtgg 13741 accagcagca ttagcatcac tggggagctt gagctccacc ccagacccgc tgattcacaa 13801 tctgcatttt cacaagaccc ccaggggact gtttgcaaat gcaggtttga gaagcattgc 13861 ttcaggtgac catatctatt tggatgtata cattcaggga cacttctttt tgtcaaatat 13921 ttctgcacta ttttagaatc acccccttac cccctctccc cgtccctctc ctcaactctg 13981 ccacccacct ccactctgca cccaggcaaa gtgcttgtct gaaaagctcc agggacctgc 14041 ctggcaatgc tcttcatgac aatattctag ttagggtcgt cgtctcttca aggggcttcc 14101 tgcttgactt tgtggtatat ttctctgcct gtccacattc acttacatgc ctgcaatttc 14161 tagcatccct tcgtacctca tctttgtatc tgaggctctg ggataatctc tcctcgcaac 14221 cttagtgctt ctacagtaag agtgattgcc agtggtcttg cagtggcttc tgagaagcta 14281 cagagcacca gaatggaaaa cctttcttct tctccttccc ctctctttcc tccccacaat 14341 ctgtttctca gatgcactgg ctggcaaact ttttctgcca agagccagat agatagtgaa 14401 tatatttggc tctgtgggcc atacggcctc ttgcaactac tcaattctgc tgttgttgca 14461 caaaagcatc catagacaat atgtacacaa atgggcatgg ctgtgtgcca agaaaatctg 14521 atttctagac actgaaattg gaatgttata tacttttcat atgtcatgaa atattatttt 14581 cctttttatt tttcctgacc atgaaaaaaa tgtaaaagcc attcatagct catgggctgt 14641 acagaaacag gcaatggatt tgacctacag gccatagttt gctgacccct gctttctggc 14701 cattcttatg ctcttgatct gtggatgact ccactgcatc ctaagggcat ttcctctggt 14761 tatagagaaa tctacaatga gaaaccaaac tgctatttac atgtattttt taaagctcct 14821 aaccattcaa aagtaacatc ttaattttta tattaaaaat attttcttta ttgtctctat 14881 ttcattcact tacaccttca cattatctgg gcttattgat cctcatttat ttggagaatt 14941 ttaaaataac cccatccatg gagataatgc tcttccttga ggggttgggt ctgtcagtgt 15001 cccaattaat taattataca cacacacaca cacacacaca cacacacacg tacatatata 15061 tatatatata tatttttttt tttttttttt tttttgagac ggagtctcac tctgtcaccc 15121 aggctggagt gcagtggtgc gatcttggct cgctacaacc tccgcctcct gggttcaagc 15181 aattatcctg cctcagcctc ctgagtagct gggattacag gcgtgcacca ccacgcccag 15241 ctaagatttg tatttttagt agagatgggg tttcaccatg ttggccaggc tggtctcgaa 15301 ctcctgacct caggtgatcc gcccacctcg gccacccaaa gtgctgggat tgcaggcctg 15361 agccaccgtg cctggcctaa ttagttatat ttaaaacctt ccttctgctc atcgttagat 15421 aaagatacct gattttactt cttcctttga ggttttgggt caagaattat ctctcatgaa 15481 aggaataagt ctttaacgat tttattagct attttgtttt ttaaaataag gctaggatca 15541 agaaggtatt gtccagaatt ataaaaatat tatttacaat aaagcttatt tttgcattat 15601 ataaaagaac acatgctcac taaagaacat taggtcaaca tataaaaatt gactgatggg 15661 gaaaattccc cctgaggttc tgccacccag gcatattgga atattgcaca ctaatcttta 15721 tgctgcccat atatattttt ttacataact tttttttttt ttttgcttta tacagaaatc 15781 aaaacacacc tttttgctaa gaaacttatt ttcttaaaaa caggactgag atgggccagg 15841 cgcagtggct cacacctgta atcccagcac tttgggaggt cgaggcgggc ggatcatgag 15901 gttaggagat cgagaccatc ctggctaaca cggtgaaacc ccatctccac taaaaaatac 15961 aaaaaaatta gccgggcgtg gtggcgggtg cctgtagtcc cagctactcg ggaggctgag 16021 gcaggagaat ggcgtgaacc cgggaggcag agcttgcagt gagccaagat caccccactg 16081 cactccagcc tgggtgacag agtgagactc ttgtctcgaa aaaaaaaaaa aaaataggac 16141 tgagatgttt catcggcagt tttcttgaat ctctaatgta acacaatgca atgcaaagag 16201 gctgtttgga ggatggcagc agagaaaaag aacaaaggaa aagcaggctc catttagaga 16261 gtgggttaat aaataataaa aactcgccac actgggtagt aatctacagg tcttccatct 16321 ctagcagaaa tataggtact gccgagcttt ctgccacaga gggccttggt ccccattcac 16381 ctcccaccat ccctctcccc atggccacat ctccaaatga ttcattgttt ctgctcacag 16441 agcacagcaa gaatgaattc cccagaattt cctcagctac tacaaatact tttgaaatct 16501 taaactaaca tctgctaatg cccaggtgct acctggggtt gacaaattgg gcacccttct 16561 gtacttctgc agacacctat ttagataatg ttgccaggca gatggagata ttaagccatt 16621 ggtgaaaagc tatttcaatt ttagactttt ttttttttgg agtgggggaa gggaggaaat 16681 agctggaagc ttggtccaca aagttgcaaa tgtctggtag ctaagctggg cctggtgggc 16741 ggagagtttg tggggttcag ggcaggcgga caatgactgt tgctatcaat aggtgattga 16801 gatatgttca gtatattctt tacccctacc ccttttttgg tccgtatctg tatggatggt 16861 ttagataaga tccagaactc taaaagccaa acatgctagc tttctatttc cactctgatt 16921 ggctccaaat gacatagttt aaaagtttca attttcccat cttgaacatg ggcatggagt 16981 tataaaccaa tagagtgaaa gtatgagact taaaccactg ataattggag gctgaagcga 17041 ctttagagag tgtctcgccc agtggtgctc atatttgagc atgtatcgga atcacctggg 17101 gggcttgtta aaccgcagat ggctgggccc cacccccagg gtttctgagt caagatcttg 17161 gagtggcttt ctaacaagct tccaggtgca gcagcagcag ctgctgctgc tctagagacc 17221 acactcttag gagctctggt ctagttatac tcacttatct tacgtacgag agaactgagt 17281 ttcaaagcaa ccaagagtct taccaaggtt atttagctag cagctagagt ctggtttcgt 17341 gacattgaac caactgggtt ttggaatacc gtgtcaacac tgagaaggat gaactgatat 17401 caaatatctt gcgtatgttt ccgagggtgg tttaccggat ggtggtctgc tacaactcag 17461 ccatttattg tggtctgttt tttcagtgag tcaaggtctc ttaagtctcc aagctaaaca 17521 ggagtactgc ctgaagccag aatgcatcga agcgggtaag tcacagtttt ccatcctgtg 17581 tcaagttata attatggtac cttgggaagt ggaagagaag acaggtggta tttttagtat 17641 tctgtttgct tgaggtttac caagtacgaa atgcaatttt gaagaaattt taagtttatt 17701 ttcaagtttt ggtgtgaaac agtggagctt aggtttcctg ttttcttttt ccttgcaaaa 17761 actgtggtat tctaagcgtt gttacaagga aatcaggaag gggcttggaa ggctgaatgg 17821 cctctcttaa gacccacagg cagagaacat tggtcttcct atggctccgc tggctatgaa 17881 ttcctcttta ccttaagcag ttgcctatct tctggtctgt gggtctgact tgcctgcctc 17941 taagcaaggc tcctttgcaa ataaatccac gcttccattg tcagtgggag tcccttttct 18001 cttttttaaa aattgagacg gagtcttgct ctgtcgccca ggctggcacg atctcagctc 18061 actgcatcct ccacctcctg ggctcaagtg attctcctgc ctcagcctcc ctagtagctg 18121 ggattacaag tgcccaccac cacgcccggc tgatttttgt atttctagta gagacggagt 18181 ttcaccttgt tggccacact gatctcgaac tcctgacctc aagtgatcca cccgccttgg 18241 cctcccaaag tgctgggatt acaggcaaga gccactgtgc ccggctggga atctcttttt 18301 tctgtcacac gttgcaagtc aaacttctct tgaggtgacc aggctgtgta tcagtggata 18361 tcccagggtg aagatgagtg tttaaagtgg ccgcagtttt gccacagtgt cttatcccat 18421 caaactcatg gctggagttg ccccaaaagc agcaacaggg aaattcccat gagggaatat 18481 cccatcccga gatggggcca gttgggattc caaagaaaga agcactaaat aaatgccagg 18541 gcggccggtc agtccaaagc atttattagt gcaacttatg ttgaaagtgc tgcagtgtat 18601 cctcaagaca gtgagagaaa aggggtgttc tacttagggc attagggtgt ggagactata 18661 tgcgggttta aggaatttgg ctcagagccg gtactagttt ctttcagtgt tttgtgcatc 18721 aacctagata cctttatcag tgcctgggaa tgttcaaggg tccagtctgg acctaagcct 18781 gctgggaaaa accagcagct ggaattagcc gggcatggtg gctcatgcct gtaatctcag 18841 ttactcggga ggctgaggca tgagaattgc ttgagcccag gaggcggagg ctgcagtgag 18901 ctgagattgc accattgcac tccagcctcg gtgacagagg gaacttctgt ctcaaaaaaa 18961 acaaaacaaa acaaacctgc aggtggctgg gtcacagagt ggtcagggga ttgtgtaatt 19021 tttggttagg acaccgaaag cggcggtggg gaggtgaggg gaaacctggg ggtcccacac 19081 acagctatta gatttccctc ctaaatcagt gttagggcct agatttcagg gagccagatt 19141 ctagccctca ccctcattct ttaagtgggc agggacactt ttttcctttt tccaaaagga 19201 aagtcatcaa acaagcatgt ggccctttca ctttcctctt ggccttaaaa ggttaatgaa 19261 caaattcacc aaatatttgt tgcacatgag gtactgagtt aggttttgga aatagagcag 19321 tgaaccaggt atgggaagaa agacaataag gcaattaata caatgagaaa attatgaaac 19381 tgggtgtggt gacacgtgcc tttagttcca gccacatggg aggctgaggc ttgagccctg 19441 gagttggagg ccagcctggg caaattagtg agaccccatc tctgaaagaa agaaagagga 19501 atatcagtca gtggaaaatg ttatagagaa aattaaaaaa aagatagaac aatgtgatag 19561 caagtgacag gatggctaga ttgaaaagcc ggaggacatg accttggagg tgacctttga 19621 atgatgagga gggaactgct gtgtgaagac tgaggcaagg gctttccagg tggggggaac 19681 agatggggca aagggtgcat agaagcatga acttggtgga cttgaggaac gggaggagct 19741 tgctaggtga ggtgcaagtg atgaggagtc aatgggggac ccaagtgcca gtgagtttgg 19801 gctttcctta agtgtgctcg aaagcctctg gaggctttta tgaaagagaa tacatcttag 19861 gagcatttca aaagatgtct cggtggaggg ctcagttaga gggagtcttt cttgatggag 19921 aagagaccat agggagcaag aatagaagaa ggtgactcaa gtaggtttat ttctgtacgc 19981 caggtgacag acgatgctgt cttgaattca agagatggag agggagaatg ggggaaggtt 20041 taggaggtag agggcctatt gaaggattcg atgcaggctg tgaacaaaag cagacaattg 20101 tgggtgacgc ttagattttt ggtgatggtt gggcagtggt gtttattgag atgggaaaca 20161 cttaggtgaa agagcttgta aattcggaga ccttatacaa aggtagtggc ttagataatt 20221 atgttagtag agttcttttt ttgttccatt cttaagtgat ctctctctct ctctctctct 20281 ctctatatat atatatatat atatatatat atatatatat tttaaccagg ctgtgtgttg 20341 agtgccttac aggcaccatc tcataaaacc ttcataagac ccctatgaag tcagtgctat 20401 caccagcctc actgacagat gaggaaatgg aggcttctgg aggctaaata actgcccagg 20461 gtctcatcct gagttagtag cagagctgga cttcgtgtcc atgtctggct cctttccgtc 20521 ttttccttgg tgatatgggg ccatgccagg caactctttt tggccctgga actatctgag 20581 aactcccagg ccctttcctg ctccaggctt gagtagttga cctgaggttt tctccagtgg 20641 tgagagattg tcatggagtg gggtgagaga gcaggcggaa ctatcaggac aggttttaga 20701 gtttatgtca cctggtgctt tggctgtggt ggggttcttg cagatgtgtg aaatgggaat 20761 ttttgagctc ataaattttc aacttgagtt tagaacactt ttttgtatgg aaggccactt 20821 atttcatgag ttttagttct gtattatgaa tcataataat gcaagactaa aacttacaga 20881 ctgacattca aagatgttgc cttctgccag ctagtgtgtt ctgccttgct gtcctgtttt 20941 taattttcac attggcgaat ggctttctaa tgttgaaaaa aaatattgat tttgctgttt 21001 ctggagtgaa aggctcacca aagacccagt tctctttagt gtaacgtgag cctcagtgtg 21061 cccaggggtc tgcagaacag ctgtgagtac acatggcagg agaattacgg ccaagccagg 21121 gccttaaaca agaggctgtg cgtgcgtggg cgtgcatgct gaaagttcag aagactagag 21181 tgtttgatcg ccttctggac catttctgag caggtgccaa ccacttctga gatgcttcct 21241 ctccataact ctcctcttct ctgtgtcctc ttattttaat ccctctttca aaataagagc 21301 tgtgcaaaga atgcaaagaa acaaggaaag ggcatgcatg ctcatgatca cagataaggg 21361 cagacaggct gactgggttt gaagcatgat ttagctattt acaagccaca tgatcctggg 21421 ctaaatatat agccacgcta agcctcagtc tcaccaagtg tgaaatggga ataacaggtt 21481 ttacttatac tagtcgtgag aataaaagaa aataataatg gactgggcac ggtggctctc 21541 gcctgccacc ccagcacttt gggacgctga ggcggacaga tcacttcagg ccaggagttc 21601 aagaccagtc tggccaacat ggtgaaactc catgtctact aaaaatacaa aaattagcca 21661 ggcgtggtgg cgcaagtctg taatcccagc tacctgggag gccgaggcag gacaatcact 21721 tgaacctggg aggtggaggt tgcagtgagc cgagattgca ggttgaatct gcacttcaac 21781 cccaaggggg aggttgcagt tcagcctggg caacatagtg agactccgcc tcaaaaaaaa 21841 aaaattagaa aaagaaaata ataatggtag tagcagtaat ttaggttcac aatgctgatt 21901 cttaacccca aacccagaaa acgatgtaac tctcagttgg caagatgggg gcctgatctc 21961 aattcctttg atggcaaaat tggatcttaa ctcatgtgag gctgcttcag atttcattta 22021 tcctacttac aagaatatgc atatgtttcc ctatagaaac attaatgcac ttgcttcagg 22081 cctgccactc cagccattgc tgaagtatta catgatctgt ggttatatgc aatgtattta 22141 ttttctaaaa tctgaaaaat ctatgagctc tgaaacatat ctaacctcaa gagtttcaga 22201 taagtgattg tggaattaca gttcccagat ttagctaact acatacatac gtacatacag 22261 gatgcttagt gaaatttgat tgcagataca caatgactac tttttctggt gtatgtctca 22321 tggactattt ggtacctact tattctaaaa ggtggttgtt tatcagaaat tcagacttca 22381 ctgggcatca tgcatttata tggtaatgtg cccaacacta ttctaagcac tttacatata 22441 tttaatttgc ataatgattc catggggtgc tattattatc cccattttca ggtgaggaaa 22501 ctaagacaca ggttaaatca cctgcctaag gtgtcttagc tgctgcatgg tgtttctctc 22561 attcaaacac aagtggtctg actagagttt gtgacgatac catattgcta aaaaagggct 22621 tagcgtaggt cctggcaaaa aggaaatact caatattggt gattttatta ttgttaatta 22681 ttattattac tactctctcc cttcatcctt gattaaatct aaaaacagag gctggtattt 22741 agggaggatg cctccacctc taccacacaa acagcaactg gtgactgtgt gacaattaac 22801 cttatatata gtgacttgga agagtacaca gcctgagctt tcacagattt ccttcatcat 22861 catatttctt ttattaaaac aagactctgg gttttctcca tggagctgag tagggggaaa 22921 ggcgttttgg tgtcttgaca actatggtcc cttgcaattt gaagtcacag ttcccctgca 22981 gaaagtgcag ccctgtgccc cgctgtgaat tgtgaatgat gaattgggag gctgcaacta 23041 gagttttctc actcagcagg cttctttcca aattcaatgt gagtgatgta tttcccaact 23101 tttgtatcag ggccaaggtc agagctgacc ttgcagtttt tctaggtacc tgcaagaata 23161 ctggcgagat ggagaggata catttgaggg ttgggggcag ggaaagtttg agaaggaggc 23221 tggaaaacag tgtactcatg tgggtaaaaa tgcgtgtgag ggctcccctg tgtgactgta 23281 aatgttgtgt gggactgaga cgtgtgatcc atgtctctga ggtgtcacat agcagattca 23341 ttctgtggct gcatacagta actataaata ccaaccaaaa gaaagatttt gttatgtcct 23401 agcacatttg cacattaaat attaatgata aaatagtagc agtaagtggc cgggcacggt 23461 ggctcacgcc tgtaatccca gcactttggg aggccaaggt gggtggatca tgagatcagg 23521 agatcgagac catcctggct aatgtggtga aacctgtctc tactaaaaat acaaaaaaaa 23581 attagccggg cgtggtggcg ggtgcctgta gtcccagcta cttgggaggc agaggcagca 23641 gaatggtgtg aacctgggag gcagaatttg cagtgagctg agatcacgcc actgcactcc 23701 agcctgggcg acagagcaag actctgtctc aaaaaataaa taaataaata aataaataat 23761 aaaaaataaa aaagtagcag taattaaaaa tgtaattaga gacaactgat gctagttctg 23821 aagcacgaaa atgttaagcc aatttgaaat ttatcaaact tctaagtctt tatttttatt 23881 tattttttta tttttttaag agacggggtt ttgccatgtt gcccaggctg gtctcgaact 23941 cctaggctca agtgatctgc ccacctcaga ctcccaaaat gctgggatta caggcatgag 24001 ccacagtgtt tggcttctaa accaattttt tgctagtgtt ttttttttgt gtgtgtgtgt 24061 gtgttgttgg gggaggtttg aaaatggtgg ggattttgag gccctataga aattagaatg 24121 tttttatatt attcacttca ttgaagacaa aaggctgttt gaaatttttg tagaaagtct 24181 aggctggaga aatatctagc cttattttta aacctggagg aaccacacat gtaaccaaac 24241 tgtaagatat gtatctaaat tgacacagat tgtacttttt tggggcagtt cattcatatt 24301 tagctgaaat ctctttgggt tttcattact tcaaacaaaa atatattctg gcattattta 24361 aagtatgaaa attgcccaga atgtcatctt ttcaatgtca ttaaaatatt gtctggtttt 24421 ttaaaaaaat caattggtac ataactttgt gctagaatgg tagccagcaa ttgcatggca 24481 gctggaagag cattggagtt caatattggg aattctcatt ttgtaatgaa gggttttctt 24541 ttaaagttca tggttacata tttatgcaca catctcctac ctgtgggtta tcttctatat 24601 gtaattgtgt atgggcttat gaaagaggaa aaaaatggta gatcagtcta ttgtgtcttt 24661 ctgttaactt atacacatac tcagttctac tgctagacct gggtccctca tttagaacac 24721 cctactttgg ccctcctctg gccatgcctc tccttcagcc aaggcgtctg tggagccaag 24781 taggcagccc tctccttcta caccccagaa ttccctgcct gatggcccca agctcccttc 24841 caggatctct gtgagtctct tctggggctt catttctctg agagttacac tgaatatcaa 24901 ggagctcaag aatttgaaag tattttgtaa atataacttt taaatattta gatagatgac 24961 atatgaactt ctgtttccat tagccttcac tgtgaggcaa accacccaaa ccatagaggc 25021 ctggaacaaa aatgattgga tatttctcat gattttgtgg atttggtgga tggtttctct 25081 gccgattttt gcctaggctc atttgtgcca ctgcattcag ctagatgggc gcctggactg 25141 gaaggcgaat gatggcctca gtcaatgcct ggcagttgtg ctggcttggt tcttctctat 25201 gtgtcctctt gtcctccagg aggctgcttc aaagggcagc taaagggctc cattcaaagg 25261 gagttaatat ggaaggtgca aggcctctta aggcttagct ttgaaactca cagaaagtta 25321 cttctgccaa ttctgttggt caaagcaagt gacaaggtca gtccaaattc caggcagggg 25381 aaatagactc taccacttga tgggaagagc agcaaggtca ctttgcagaa ggacaagcag 25441 gatagaagaa atagtcacag ttatctttgc aaacaattga ccacacgctc tatttgtatt 25501 cttgtcccat accccacaga tgttagggat ggccctgcaa aaaccagtga tgataatacc 25561 accaaacttt tatccattcc cttttttttt tttttttttt ttgagagaca ggttcactct 25621 gtcaccccag gctggactgc agtggtgcga tctcagctca ctgcaacctc tgcctcctag 25681 gtttaagtaa ttctcctgcc tcagcctctg agtagctggg attacaggca tgcatcacca 25741 tatctggcta atttttgtat ttttagtaga gatgggtttt caccatgttg gccagactgg 25801 tctcaaactc ctgacctgag gtggtccacc tgcctcggcc tcctaaagtg ctgggattac 25861 aggcatgagc ccttgcaccc ggcctatcca ttcactttga atgtagtcat ttcatatagt 25921 gaggtagatt cagagagttt atttaagggg aaatataaat gccaaggctt ggaaactggt 25981 tgatatggaa ttgttgttta atttggaaaa gaacagtata caacattcag tgcttgtcat 26041 taatcctatg attttctttc taaatagctg ctgccatctt aagtaaagta aatctgtctg 26101 tggatccttg tgataatttc ttccggttcg cttgtgatgg ctggataagc aataatccaa 26161 ttcccgaaga tatgccaagc tatggggttt atccttggct gagacataat gttgacctca 26221 agttgaaggg taagtttcta ctggggtttg gtgatacact ttatagagag caatatttga 26281 caggaaaaac atagtttggg atttgaagca tgacttctca ctttttaact accatgctaa 26341 attcatttcc aagaattttt aaaatgaaag attgaaagga aagtttgcag aattctggac 26401 tcccagatct caacataaga ctaatgtagg ttgttatgtt catgtttggt aatagtaaat 26461 tctactgcca aaggataaag agcatttgat tgagattgga agcaaaggaa tagttaatga 26521 aaattatatc aatatactca tttgttgtat tcagagtaaa cttcatttct tgctcattgt 26581 gtcctggatt ttcatggctt acaaaatgta cttaacacct catcatcttt tgacatcact 26641 tttgagaact ctggatgacc ttgtggttgt catattgttc ttgttaactt ctaagttggc 26701 tctagggaga ttcacttttc tcccatggcc actctagcaa tgacaagctc acacacacat 26761 actcacacgt ggatgcatag aaacatacat atgttctcaa atacattccc ctgccccccg 26821 ccaaagttgg aaatgtctgt tattggagga tattctgtta gcattggatc taatttttat 26881 gctaagctat tggagaaggc cacctcagga gggaagtagg aagatttcaa agccagataa 26941 ggtaattatt ttgagaaggc aaaaatagta tggttttttt ttttggctct ttttgacaac 27001 tgtgagttac accatgtaac ttgcattcag taacacctac aaagaactcc tgagaaatta 27061 taaaaatcat tttcccttcc catttacatt ttcccaagat tttaaataca tgaaaattaa 27121 atatgtatca ttttacaata agtaaaactc ttggcaaaat aaataattag atacctattg 27181 tcaattgtaa ggaaaaatgt acttaaattt cacaactaga tataaatcac atggacttat 27241 attttctatt aggagaccca aatttagtct tggaaaatta taaatgggaa tttacaagga 27301 tgtcagggtt ttctcctaac ccaaaaggaa aattctcata ctataaaatc ataaataaga 27361 ccttgaggag tgagtttact aatacttttt tccctgtaga acattaattc atgttggtct 27421 ttgatttacc attcttgtcc atttcctatt tcccttctct ggcttttctt aatagacaga 27481 tgtgtacaca cacatgtgca cacacacaca cacgcgcgca cagtcatgca tacagaaaca 27541 ttactgaaag ataagaaaac tacaagattt ccaggcatat gctccttttg catttgttat 27601 aacccaaaga tttctatatt caggacatga agtggttaac gataacttaa ataactttcc 27661 aggggtgtct ggaattttaa cataaaagtt aaaatcagtg tcaatatgct ttgtttgagg 27721 agggtactgt ctacaggctc cttcaaaatc aatagaaatg caagtaaatt gttaagcaca 27781 gggtttttgt atttgctagc ttccttttaa aaatcaaaaa ttttatttct cagctccatt 27841 gaagaaataa aaaaagagca aaaattagaa gaaaggtcgc tcataaatcc gtcacaagaa 27901 ataactgcta ataaagcaac tggggatatt tctttccagg ttttgttgca tgtgtgtgtc 27961 catatgcata acttcatatg cacacctata tattgtaaaa attagctgat attatacata 28021 ctactctaac ttaattttta aaattagttg gtattataca tactactctg taacttaaaa 28081 ctatgcattc ctcatagaca tagattatgt cagtagataa cttttctttt ctttcctttt 28141 ctcttttctt ttctttcctc ttgtcaccca ggctggagtg caatggcacg atctcggctc 28201 actgcaacct ccgcctcctg ggttcaaacg agcgattctc ctgcctcagc ctcccgagta 28261 gctgggatta caggcacttg ccaccacgcc tggctaattt ttgtattttt tagtagagat 28321 ggggtttcac catgttggtc aggctagtct cgaactcctg acctcaagtg attggcctgc 28381 ctcagcctcc caaagtgctg gggttacagg catgagccac cgcgcctggc caacttttct 28441 tattataaat aacactgtaa taaacatcct tgtgtacaca tctttatgtc cttatctaaa 28501 tgtatcctta ggataaattt ccagaaatgg aattattggg ttgaaagagt actctaaatt 28561 tcatttttga tgcatatagc caaattgcct tttagaaagt ttatttcagc tgggtacggt 28621 ggctcatgcc tgtaattcca gcactttggg aggccgaggc aggtggatta tctgaggtca 28681 cgagttcaag accagcctaa catagcgaaa ccccggctct actaaaaata aaaaaattag 28741 ccaggcatgg tggcacctgt aatcccagct actcagaagg ctgaggcagg tgaatcactt 28801 gaaccaggag ggagagattg cagtgagcca agatcatgcc attgcactcc agccggggca 28861 acaagaatga aactctgtct caaaaaaaaa aaaagtttgt ttctaggtct ttatattttt 28921 ttgatgtttt aaaggataac tttttcccat caactgcttt ctgtggtata aagtagaagt 28981 catcaaacca ttgccagtag gcaaaatcca gtccactgcc tggaagtttt attggaacac 29041 agccacatcc attcgcttat gcattattaa tggctgcttt tgcactccaa tggtacagtc 29101 gaatagttgc aatggaaact ggatggccca aaaagcctaa aatatttact ctttgacctt 29161 tatagacagt ttgccaatcc ctgatgccta gtataaagta aagctttgaa tgtatgtatt 29221 ggattcaatt agaactgtct tattagttga tagctccttg gtagcttctt tgaatttctt 29281 agatagttaa taaagttaag aaataccaga acaggctgag cgtggtggct aacacctgta 29341 atcccagcac tttggaagaa caaggcgggt ggatcacctt caggtcagaa tttgagacca 29401 gcctggccaa catggtgaaa cctcgtctct accaaaaata caaaaaaaat tagccacgca 29461 tggtggaaat cccagctcct caggaggctg aggcaggaga actgcttgaa cccaggaggc 29521 ggaggttgca gtgagctgag attgcaccac tgcactccag tctgggcgac agagtgagac 29581 ttcatctcag gaaaaaaaaa aaaaaaaaaa acagaacggt gttgaaataa tagcaatgat 29641 aatgggtgta attttcttat gccttgcttg ggtggtaatg cctccagtat ttcccctgaa 29701 aacgtacact ttttttgttt gtttgtttta gatagttatt cttttttgtg ttctgttgtt 29761 cctagtttgc aaaaagctgc tgctcaagaa aaatgcctct ttgccattta tcaagagact 29821 cagatggagt atctctttta acttatcttg atttacccat gctgaactaa acttataatt 29881 ttgaaataaa tcctgcctat taatggtttg tattttttaa gacattacca gaattgaaga 29941 agctagcatt ttgtttcaca catttgtctc tatatttatg attcatattg gtttgtgatt 30001 gttttgtgtg ctaggtttgt cgaattctgt tattaagatt gtgctggcct tgtcaaacaa 30061 agtgggaggc ttcctatatg tgtttctatg ctctgggaaa gacataattc aataaatccc 30121 aagttactct atgacaagca ctgtaaaggc caaattcaca aactgacatt agggttcttg 30181 cttcatatgt caagaaggag agcaaatggt ggaaaaggat ctcttaaaat tagtattttt 30241 taacaaaaat aggtaaaata taaaatagct ggacgcagtg gctcttgcct gtaatcccac 30301 cactttggga ggctgaggca ggtggattgc ttgaggtcag gagttcaaga ccagcctggc 30361 caatgtggcg aaaccctgtt tttcctaaaa atacaaaaat cagctgggtg tggtggcagg 30421 tgcctgtaat cccagctact taggaggctg aggcaggaga atcgcttgaa cccaagaggc 30481 ggaggttgca ttgagccaag atggtgccag tgcactccag tctgagtgac agagtaaaat 30541 tctatctcaa aaaaaacaaa aaacaaccgc ataaaatatc tattagattt aaaaactttt 30601 tattttgaga taattcaaac tttgataagt tgaacaaata gcacaaggaa cttctatgta 30661 ccctttaccc agattccctg attattacca tttactacat ttgttctatt atatatatac 30721 tttcattttc catctgtaaa tatatatata gtgtatcatc ctctacatat atacttatat 30781 acttttatac acatgggcag atagtatatg ataaaataat atatctataa atgtagagat 30841 aaatatcgat ataaatacag atacatagat ttttttctga actctgcaag tatgttacag 30901 atgattagat attcttaaag tgtatagagg ttttcaaatg tcaaatgtgc taaaaatctt 30961 atttcttttc ctgaaacact gaatctttaa attctatcta tatcttttaa tatattctat 31021 atctatctat ttatgtgtct gtcagtctag ctacctagtc atccaactat ctattcattt 31081 atcattctat ccaggtaata caggctcatc gtagaccaac cacaaataaa cagcaaaaga 31141 agcatacttc accacccagc aataacaact ttttaaatgc aaatataggt ataaaattat 31201 acatatcttc taagggatca tataatacat tctatttaat taatttattt acttatttat 31261 ttatgacaga gccttgccct gttgcccagg ctggaagtgc agtagcacga tttcagctca 31321 ctgcaacctc tgccttttgg gttcaagtgt ttctcatgcc tcagcctccc aagtagctgg 31381 gattacacgc atgcgccacc atggccagat catttttata tttttaatag agacagggtt 31441 tcgccatgtt tgccaggcta gtctcaaatt cctggtcaca ggtgatccgc ctgcctcagc 31501 ctccccaagt gctgggatta caggtgtgag ctaccgtact cagccggata tactataatt 31561 tacttaatca atgcctgtgt tatttaggtt attttttatg ttttcttcat tgtaagccac 31621 ttggaacatg agtttagaaa catcaagagc tggatttgaa tcctgattcc ctttcattta 31681 ccctctatat cactttgggc caattactga tctcacctag ttcagttttc ttatctttaa 31741 aatggagata agaatataca tatcagagta tgcaaaggca tacagagtgg tgtaatagac 31801 cttggactca aaggggggag ggtgggagaa gggtgaggaa taaaagaact acaggttgtg 31861 tacaaaatac actacttggt gacaggtaca ctaaaatctc agacctcacc attatccagt 31921 taacctacat aaccaaaaac tacttgtacc ctaaaagcta ttgaaataaa aaatatatat 31981 attttgaaaa aaggggaaaa aaaggaatac ccatatcata gggattttgt gaagattaaa 32041 ggagatgatt tatataaatt acccagtcca gttcctggca catagcaggc cttccgtaaa 32101 tatttgtcct ttgttcctcg ctttgtttca aagatagttc taagattcat gaagtaatgt 32161 ctatcaaatg ctttaagttt tattctggta ttatcctttg tgtaaatatc atgtgctttc 32221 aaaaatattg aatactgtca agatattttt gctcactctt gataaaaaga gagaacatcc 32281 caaaataaaa cccaatacta ccaatcgaaa gaataccatt ttcttaaggc tgatagaaaa 32341 cacacagata cggctgtcca atttgtgaaa taaaatggac ttgacagacc tggctgatga 32401 aaaattgccc tctactgaag tgagcaacct ggctttgtga atttttttct ttaaatggca 32461 tataagactc agattccgac ttgggaaata aaattacttc ctattctctc caagaagtga 32521 gattttttaa aaagccgaat ataatcatat gtttgtctta agatccacat tctataaacc 32581 ccccaaattt gtcttcatgt gaccatgaag aagacacagc atgcattgct tatgaatgag 32641 cagatggaaa tttagcaaat gagcacttat gaaacatgag gcaatgaatg agatagaaca 32701 cttatgtcac acagaagaca gatgactcaa ggtacggcca gatcatctct attacggaat 32761 agctatcttt gccaatattt tgtgaagaaa tgcaatatat tattttcaag cccagcttca 32821 gcagaacaat aagatgaagt agggactctg cattaatgaa cagaaagaga tgattgtgat 32881 tcatagacca tttatttgta aataccttct ggctgagatt caaatagttc tcgtaattaa 32941 acaccacccc ctccccaaat aatctttttt tctgtaaatg atcctagcca aatcatcttt 33001 tgcagacaat ggaaatctga aaacttcagt tcctataaag tcatttagtg attaggaagt 33061 tcagattatt gaaaacagtg aattgtcatt ttcctcatgt tgtatgtata gtagagcttt 33121 gacttcataa agttcttaat tttttaaaat tattttatta ttattttttg agatggagtt 33181 tcactcttgt tgcccaggct ggagtacagt ggcaccatct cacctcactg caacctccac 33241 ttcccaggtt caggtgattc tcctgccttg gcctccctag tagctgggat tacaggcgcc 33301 tgccaccatg cccagctaat ttttgtattt ttagtagaga cagagtttcc tcatgtcggc 33361 caggctggtc ttgaactctt gacctcaggt gatccgccca cctcagcctc ccaaagtgct 33421 gggattacag gtgtgagcca ccgcacccgg ccacttcatg aagttctttg aaacaagtaa 33481 tatcagacta gcaaatgtgg ggatgcagtg gggaagtgct aaggaccaag catccattgt 33541 ggtgggcttt ttattactag tgaaaactca gcttctgtaa ttttgtgtga gtctcctatg 33601 taaactgggt gcaccccttc agagaagatt cccatttttc tcagatcatg ctggctcttt 33661 ttcacgtggc cacatagaaa gcaacaaatt gccaagttac aaatatatca ccacagtcct 33721 atatgtgcaa aggggtgctg catgaagatt tgggcctgta ttgaagactt ttggaatgtg 33781 agattaaata attttacttg aaaagtttgt tgttattcat agttttcctg ctctgatata 33841 ccgtttgagt attggtaggt gtgcgtgctg catggagtgt ggggagaagg taagaagttg 33901 tcagattagg gctgggcgtg gtgactcaca cctgtagtcc cagcactttg ggaagccgag 33961 gcggttgggt cacttgagat caggagttca agaccagcct ggccaacatg gtgaaacccc 34021 atctctacta aaaacagaaa aaattagctg tgcatggtgg cgggcgcctg taatcccagc 34081 tacctcagag gctgaggcag gagaattgca tgaacctggg aggcagaggt tgcagtgagc 34141 caagatagca ccactgcact ccagtccggg caacagagag agactccagc tcaaaaaaaa 34201 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaacagacct agtgctagat atgaaatatt 34261 cccaacacac aaaataatga taaatgtttg aagcaatgga tatcccaatt acccagattt 34321 gatcattaca cattgtattc ttgtatcaga taatcacatg taccacataa atatgtaaac 34381 tataatgtat ccataatcat tgaaaattgg aaaactggac aaatatagaa attttttttt 34441 ctgaaggctc tgtggaagat gtgaatgtat ctctaggata tgttctcttt ctaaatagct 34501 aaatgaacta aagaggccat cagagtgact actgtgtgta aaagcttgtg ttttggtagt 34561 tctgacagtc agcctgtcag gttttctttt tttttttgag gcggagtctc gctctgtcac 34621 ccaggctaga gtgcactggc acgatctcgg ctcaccgcaa cctctgcctc ccgggttcaa 34681 gcaattctcc tgcctcagct tcctgagtag ctgggattat aggcatgtgc cacatgccca 34741 gctaattttt gtatttttag tagagacggt gttttgccgt gttggccagg ctggtcttaa 34801 actccgatct cagttgatcc acctgcctct gcctcccaaa gtgctgggat tacaggtgtg 34861 agccaccgct cccagacagg ttttcttctt atgtgtgctt ttccaccctc agtgcatgtc 34921 cttgtcttac tttttgactc ttgtctgctc taagccttcc attttaagat ctgatctctt 34981 cccctattgc tttaatgtag ctcctctgtc attagcggtt tctactacaa ataagttggg 35041 tatgcttggt aagaaccagc tctgttcatt ccattcattc taagttgtat gatattcagc 35101 tggcgggaag ttcaccactt caaatagatt gaaatgcaca acattttgat tgtactactc 35161 tgtgaaaata gtcttattta ccaagcattt acttttattc ctggcactaa ttgctttcaa 35221 tcgtaagtaa cagacagccc aactcaaact ggcttaaaca ataaaatatg ctgattcatg 35281 taaccaaaaa gcctagtaat tatgtagact tcaggtaagg gctgatccag caactgaacc 35341 atgtcattaa ggatcacatt ttttttttct ttctctctct actctgacat ccaccatgtc 35401 gtatttcctt ctaagtctgt ttggtcatga gattgttgct aataaccaga atttatactt 35461 cctcattcat atctagcaag aggcattgtg ttagacacaa gggatacaga agtacaaaga 35521 ttaataaggt cccctaggct gggcggggtg gctcatgcct gtaatgctag cactttggga 35581 gcctgaggtg ggtggatcac ctgaggtcag gagctcaaga ccagcctgga caatatggtg 35641 aaaccctgtc tctactaaaa atacaaaagt tagtctggcg tggtggcggg agcctgtaat 35701 cccagctact gggaggttgc agtgagccga aatcgctcca ctgcactcca gcctgcgtga 35761 cagagcgaga ctctgtctca aaataataat aataataata ataataataa taataataat 35821 aatgtcccct agactatcct catggaaatt acaatctgca tcttatgagt gttcccatag 35881 gtattgtaat caatactcta gaactgtcct gctcagcatg gtagccgcta gccactgtgg 35941 cttttgggta cctgaaaata tggctaggtg gaattgaggt gtgatgtaag tgtaaaatat 36001 acaccaaatt ttaaatactt cattttaaaa aatgaaagca ctctcattaa ttgatagttt 36061 catttataat cagtaattgc atgtagaaat aatattttga gttatattaa ggcaaatagg 36121 atataccatt aaacatctca cctgtttctt tttacttatt taaatttagc tacaagaaaa 36181 ttaaaaatta cgtatgtggt ttgggtaata tttctaccag acagcattgc tctagagccc 36241 tgaacaggct attagcaagg tgaaatatct gggaataact caaatggatt tcagtgctag 36301 agaaagctca gagaacacct ttatttaaaa tagtcctctg gctgggtgcc gtggctcacg 36361 gctgtaatcc tagcactttg agaggtcaag gtgggcagat cacttgaagc caggagtttg 36421 agaccagcct ggccaacaga ataaaaaaca acaacaaaaa ttagccagac ctggtggcat 36481 gtatctgtag tcccagctac tcaggagctg aggtgggagg atcacttgag ctggggaggc 36541 caaggttgca gtgagccaag actgcacaac tgcacttcat ccagggcaac agagcgagac 36601 tcgatctcaa agaaaaaaaa aagaaagtcc cctacccaat ctccaatccc tttgtatttg 36661 tgtcctttgt ggtccttata accatctgaa atgatgtcat ttgtgtacat gcttgctgtt 36721 cttctcactt acacccatcc ccgctcctgt agatgttaaa ctccatgaga accagcccct 36781 tgtccatttt gggtgtcact gtgaccgcag caccaagcat agtgcctgcc ataggaggtg 36841 ctcagtaaat acttcctaaa gaaaggaaga aaagaatgtg ccctctgtgg ttggctgagt 36901 gcagttgtgg gggcagctga ctgggtaggg aggagtggct ttttcttctc cgtttcactc 36961 catctcctcc tgtgcccaca gtcccagaag actcttgaga tgcaactggg gcttgcccct 37021 gaaaggaggg gaagcaaaat catcagtcat tttccatttg tgttctgggc ctcaggaagg 37081 caggacaggg acgtgggcaa tgagacaggc ctgggccctt ctaagactcc ctgttggtgg 37141 gatctgatgc aagttacaga caagaaaaat aacttgcagg tggttcaagt ttgggttctg 37201 agaaaagaaa aaaaaaaagg gattctgcca cttatcagct gtgtggccat gggcaaacta 37261 cttctttctg agcctccgtt tcttatctgg tgatactgtt ggacaaatcc aatgagatgc 37321 gtagaaagtc ttctacacat ggttcaataa atggaatccg ttgctactgt taatttcaca 37381 aggattaatg ttttaaaagc acttggcaca tagtagatgc ttaagaaatg ttctgcagcc 37441 aaataaaaat aacttagaag aagaccttga catcactatt aaaataaaat attgtgatct 37501 agtttttttt ctcaaactcc ccacatgcct gtaaataagg aagctggttc tgaaagaggc 37561 aaggctggag ataaaaaaca gcagatgagg acatttgtac gtattcccaa cttgcattgc 37621 cagaccttgg tcatgaccct aaaccctgga cggttttgat aatctaattg tcagataaca 37681 cattcattgg aggccatgtg atttttaaat cactcccaca ggaagagaaa gaagaggaaa 37741 agggaattgg taaaaataag aataactatc ctcttacgcc gctctagagc ttctctggat 37801 gccacttaaa aaaattctac caatttccct tcttttttcg gacatggagt ctcactctgt 37861 cgccgaggct ggagtgcagc agcacgatct cggctcactg caacctctgc ctccccgatt 37921 caaatgattc tcctgcctca gcctcccgag tagctgggat tacaggcgcc tgccaccacg 37981 cctggctaat ttttatattt ttagtagaga cagggtttca acatgttggc caggctggtc 38041 ttgaacacct gacctcaggt tttccacctg cctcggcttc ccaaagtgtt gggattacag 38101 gtgtgagcca gcgtgcacag ccctcatttc cctttgttct ctcttctttc ttatccctag 38161 gggatcctta acttgagtcc acagtcttcc cacaagaggg ctgtcaatac aattcaggta 38221 cttggatggg aaaaaattgc acctttgttt tcactaacct ctcacagaat tttagcattt 38281 tctttaatta tgaatgtagg caacaaaccc tggtactatt agcagtgcct gtgactttgt 38341 taccaacagc aatcatgatt atttttctac cttattgcag atgttgcagg tatcttgaaa 38401 tgttatgtac acgtatcact acttcaaaat tatagtagtt gtcagactca aagttagggc 38461 ttggtattta atgaattaat caaatagcac atatatatat ttcacacagt aaaaaaagtt 38521 gtgctgtgtt ttgaaatacc tgtttttctt tgtaagccta tgtttcttat tttatgtact 38581 tataaacatt attctcagaa tggtccatag gtttcaacag actgctcaag agatgtaggc 38641 agaaaaaacc attaagaacc ccagtgtatg taaggattgc tagagcaaga ttcaagaagg 38701 cagttttatt tagctccagc ctcttgatcc atcttaattc aggtgcttgg agtggcattg 38761 actattacgt ggcctcatgg gtccactcat gcccaaatct ttgagtactt cactgtctag 38821 actcccgggg taatgagcat gttactgaca gttaatatgt gtttatgagg atattagatg 38881 aaactggagt ttgtaccagt accaaccctt gaactgatta aatgcagtct ttatgattgt 38941 ctatgccttg gaacctgtat attttcaggt gaaattacta aggcttgcca tgtgcttatc 39001 atgaaagcag ggcattaaag gtcaattctg ggcctggcag tgtacatctg tagtcccagc 39061 tactcaggaa gttggagagg gaagaatgct tgagcccagg aattcaaggc tgtagtaagc 39121 taagattgtg ccactgcact ccagcctggg tgagagagca aagccctgtc tcaaaaaaaa 39181 aaaaaaaaaa gtcaattctt attcgaagtg atatagcttt tatgaacagt gcaaatgatt 39241 ggaaatagta atacaccaag tagagtgttt atccataagc agatactaca tggtgtacaa 39301 cagagctagc ttgttattca atgttgaaga caggggctgt tgattgaact tagaagggtg 39361 gggagtaagg gagtagggga actgcaaacc tggaggtgct gggagctaag tcctaagggt 39421 cagatccaat atatatagct tggctttgtt ggaaggaagg aatgaacagg gcctttggtg 39481 agagatgaca ggagataaga gatgttgaaa tccattggtg agatttttga cttgcagcag 39541 ggaatgctaa ggtttctgag aagtggagca atagcctgca aaatgctatg catgggaata 39601 ttggctgaaa aaaatgaatg ggaaagcaga caagggggca tttttgtagc ataaatattg 39661 ccacacatta tgaaaagttt cctgtgagtt agcttgaatg ttgtaggagg aaagaaacat 39721 ttcactttag aagattgtgg gctaaattta ggttcctttg actaaatata agttcctcag 39781 catgactaat aacacaggat atgtgcatcc tgagaaagac gtatacaccc acaatgacac 39841 ttttccacct gcccacaccc aggtcctgtt ttacaaggtt ggtatgttta agtgtatttt 39901 actttgcagt tgttcagaag gattgcatat ttttgaaaaa ctacttcctg aaagaggatg 39961 actgtttgga aaggaaagaa aaatcctagc tggctttggt ggtaacagtt gggaggcata 40021 cccacgtgag ctttggtggt aacaattcct gaagcttctc caagggtcca cgtgttcctg 40081 ctgttggttg ccagaagcca gaataataat ggagctgaga atatgggatt tttgtgagaa 40141 attccagggg cattcagatt tcacagcaga agcagatgac tcagatgtcc tgcgccttag 40201 agagatgcgt gaatccctta cctttaccat actctgaatt ggtcaacggc agaatggaag 40261 ccctacaaaa attgaaaagt tggttgcatg ctaacattca tagtggaact gagttgaaaa 40321 ttttatcctt gtatttccag actctctgat tagattgttt tagctatgct ctcaccccca 40381 attctgagaa cttttcaact gtgactgagg tgtggatggg tcaagcatct ttacaatgtg 40441 acaaaatccc catttctcct ctgatttact actaatgggg agataattca ttactgtgaa 40501 gtgagccttt aatagattaa aaaagttgta agaatatgtc acatagggca ggacctaaca 40561 gggctgggca ctttattgag gtgaccttat ttaataatat cattagttat catttattgg 40621 actctgaact gagctcccaa cacggaagat gtaagattta tctgcatttt acagatgaga 40681 agacagaggc accaagcagt ataactagcc caaggtcaca cggctgatat gcaatggagc 40741 caggattaga tttgggccat ttgaccttta ctgcctggta aaagccctga aaactaggaa 40801 ggtgatatta ttcctgtttt acgattgggc aaactgtggc ttagagaggt gtattagtct 40861 gttttcacac tgctataaag aaatacctgt gactgggtaa cttgtaaagg aaagaggttt 40921 aattgactca cagttctgca tggctaggga ggcctcagga aactgaaaat catggcggaa 40981 ggggaaggag gcatgtctta catggcagca ggtgagagag agtgagcaag agcagggcaa 41041 actgccttat aaaaccatca aatctcttga gaactcattc actatcacga gaacagcatg 41101 ggggaagaat gcccccatga tctaatcacc tcccatgacg ttcttccctc aacacgtggg 41161 gattatgggg attacaattt gagaattctg ccactgaacc acccatgcgg ggattagaat 41221 ttgagatgag atttgggtgg agacagagcc aaaccataac aacaggttaa gtaatttacc 41281 catattcaca caattagaga cctataaaga taccttccag agtagtattg tccgatagaa 41341 atataatgtg agtcacaaat gcaagtcaca tgtgatttta tgttttctag gagccacact 41401 ttaaaaagta aacgtgtcag cagttgggat gatatatttt atctacccca atatatccaa 41461 aatactgtta tttctacatg taatcaatgt agaaaactta cccatgaggc attttctctt 41521 ttttctttct tactaagtct ttcaaatgtg gtgcatatct taaacttgca acccttctcc 41581 atttggacca gccacattgc aatacccagt agcctcgtgg gactactgaa aagtacaatt 41641 ctagagtggt ggttctcaac ccttgctgtg cagtggaatt actagggagc ttataaaaca 41701 cagatgcctg aggcccattc tcagagattc tcatggaatt gatctggggt gctgcttggg 41761 tgttaggatt tttgtttgtt tgtttgcttg ttttttgaga cagagtcttg ctctgttgcc 41821 caggctgtag tgcagtggtg tgatctcggc tcactgcaac ctctgcctcc caggttcaag 41881 cgattctcgt ccctcaacct cccgagtagc tgggattaca ggtgtgtgcc accatgcctg 41941 gctaattttt gtatttttag taaagacggg gtttcaccat gttggacagg ctggtctcga 42001 acccctgacc tcaagtgatc cgcctacctt ggcctcctaa agtgctggga ttacaggcgt 42061 gagccactgc acctggctag atgtgaggat ttttaacgac ctccctcccg accaggcagt 42121 gcccaaacta actgagtaga aaactgtagc ctttgtacat tagagaagtt tctgtgagtt 42181 ttctgagatg agaaccattc agaatgactg ctctggcagt atgctttgct agttgacatt 42241 aatgaatgcc aagcaatgcc caactaattt attaatttct ctccatttac atttcccaac 42301 atgttgacca agtctccaga ataatataag aaaagagatt ttctttctct tgctgattta 42361 cacattggtc tggatacata ccagtgaaag taaattcaga cataaaattc tactctttta 42421 attgaggtca aaggtgttgg tgtcagcctg gaatctctga gctgtggaga agaggaagtt 42481 ggggtttgaa agctgcaggt ggctgccaga atgagacaag tctagtgggg aatgcgtggg 42541 aggtgctggg tgaagaaagg aggcaattgt gtttcgcctg aggtctgaag gctgcatagc 42601 ccgctgtgtc tcaggacgtt cacaaagctg ccctcaccag aagcgtcgcg tccaccttcc 42661 cactggacca aatttttgat tctgcccctg aaccctggat atgaaaccca atactgaaga 42721 attaaaatgt ctcatctcac ccggaggttc agttgtcctt aaaatgcttt tggattcatt 42781 tggaataatt tctatattgg cttctttcat ttaaatccaa atatagcttc agtatttctg 42841 acatatttat tactttccag aatggaaata ctagttcttc tgaagtggta gctatttagt 42901 tatagtcaat tttgttagtt atgtttattt gacttcaaat taagccagag aatatgcaaa 42961 cactgcagtg tcttccacag ccttaccaaa atgaatacca tatttcatca aatcaaaaat 43021 attgtcaact gtaagatgca ccattgtttc acatgagcta agaaagagta agctattaca 43081 tgacgctttt ttatcacttg gaattttatt ttgtgttcat tgaagttctt ttagcctgac 43141 ttagacataa ctgtttaaaa atattttaac ttttattata ggtttagggg tacctatgca 43201 ggtttgttac gtaggtaaac ttgtgacttg ggggtttggt gtacagatta tttagtcact 43261 agacactaag catagtacct gatgtttgtt ttttttctga acctctcccc tcttcccacc 43321 ctccaccctc aagtaggccc cagtatctgt tgttcccctc tttctgtcca tgtgttctct 43381 cattatttag ctcgcattta taagtgagag catgcagtat ttggttttct gttcctgcat 43441 tagtttgttg agaataatgg cctccagctc catctatgtt cctgcaaagg atgatctcat 43501 tcttttttat ggctgcatag tattccgtgg tatgtgtacc acgttttctt tatgtagttc 43561 cagaaagttc tcaggcgatg cccatgctgc tggtttgggg accacacatt caatagctag 43621 gctttagtgt gtctaaggac atagtatgtc actgaagaaa ttcagtacct ctccagggta 43681 catggatgtt ttaatttttc ttccatttgt attattattt tgatgaaaat gtatatattg 43741 caaatttgtt tgccttgcat tgggtaaatg caaaggaaga aagcaggaat tttgtatgtc 43801 tgatcttatg gtacagggag agagattttt ttccccttaa tttgtttcat gaaagatttg 43861 ttcatttcct tttcctttgc tttctttatc caacttggga aacttgaatt tattaggaaa 43921 tctgcatgta attgcagatt aataacctac ccattaatac cacattagtt catcatgatt 43981 aatagcatct atgtgaatac ttaatgaatt tgtaatgatg gactatttgc tttaaagcgt 44041 caagagaaca ttccatgtaa aaaggcacta cccgtaacag gaaaattaat tatatgctgg 44101 ttggataata gcttgtcaca aagaagatat gtttgtttgg aaagcctcat tggtattgat 44161 atttgcagtg agaggtactt tttttaaaac attcatgaac cttttaattt gtaaaatgag 44221 aaggaaattt tattttcatg aacagttaaa ataggtgtag ggactgatat ttatgaaaga 44281 gagcagcaca aacacaaata tgtgctaata tactcataag tttacctatg tcactaggca 44341 gaaagacttc ctatagtggc catttagatt aagttccgaa ttgaactgct tctactaccc 44401 actggtctat cctaaattca caagttcact tctttaaggc tcaagcattt tttttttctt 44461 ttgagacaga gtcttgctct ttcaccaggc tggagtgctg tggcatgatc tcggctcact 44521 gcaagctccg actccctggt tcaagcgatt ctcctgcctc agcctcccaa gtagctggga 44581 ttacaggcat gtgccaccac gcccagataa tttttgtatt tttagtagag atggggtttc 44641 actatgttgg ccaggatggt cttgaactcc tgacctcgtg atccgcctgc ctcggcctcc 44701 caaattgctg ggattacaga cgtgagccac cgcgcccggc cggcccaagc atttcttgat 44761 actcatttta tttcttcaga gaatcatttg ttttctctac tgtcttcact ctaattctgt 44821 gatgtcagca gtcatcattg tttatttctg tctttgccct atatctgagg aagtgaaggt 44881 caagaagatt atgtgacttg cccaaaccaa tccagccata taatttgttc gccacagatt 44941 atgggtctct catttcaaat gtctccagtt ttaaccccgg cggaaagagt tgcaagcgca 45001 gagactctgt ccagctggtg cctttgagga aacccaagaa gcccagtgtg attggagctc 45061 tgagttggaa gggaagaggc ataaatgata aaactgagga gaaaggcggg ggccagagca 45121 tacaggcttt gtaggccgtg taaagatttt agactctatc ctagggcaat gggaaggcgt 45181 ttgtaaatgt catgttgagg gatgtaatga gcgatatgtt ttgccagagc ttgctgctgt 45241 ctgtggagag tggcttggag gaggtagctg acaaatggcg accaattcat gctttacagg 45301 ccccatcatg ttgatctatt ttctagtctg gtatccaaac gaaggtggag tttgacttcc 45361 ttgttacaga aaataccaag gcatgattgt ttatcctttt ctctattttc tccttaggtc 45421 aacatgaaag atcattgctt cttctttgtt gcccttgatt tctttgaact taggaaggta 45481 ttttgtcaag cacccccact catggggcaa aatctgccag ggaagtcatg gcagctactg 45541 gtgggcatgg tagttagtac tcgttctacc agcacttttg cccctttggg ttcactgctt 45601 gtgccaaagg tgctgttgct tccccccttt tgggggagag gattctggat agagaaggaa 45661 agaaagcctt acccctaatc actcactgtc ggaaacccag ttctggagga gcagtctccc 45721 tcgaccacac attgtggttt gaacagccag gaatagaacc cctccttctc tcacctcctc 45781 atagaaggtg ggagtgaaaa caggggttag ttgggaagtg gctttttaaa cctatgttta 45841 cagtgttttg atgtcacttt catagacgag ggaggaattg tctgaagctc agatttttta 45901 ggatttcagg gatgtgatgt ttatctctct cttcctctct tgaggatgtg gtataaactg 45961 gaattgaaaa tagatacccg gctgggcgtg gtggctcatg catgtaatcc caacacgttg 46021 ggaggttgag gtgggtggat tgcttgagcc caggagttca agaccagcct gggcagtatg 46081 gcgaaacctc atctctacta aaaataaaaa ttaaccaggc actgaggcac atgcctgtgg 46141 tcccagctac tcagaaggct aaggcaggag aattgcttga gcccaagagg cggaagttgc 46201 agtgagccaa gatcgcgcca ctgcactcca gcccaggcaa cagagtgagg ccctgtctcg 46261 aaaaaaaaaa aaagaaagaa ataccctgta tggtattctt tctttcactt tgagagtctt 46321 tctggcatag tgctaatatt tttttccact cttgaatctg tgaatcaacc ctcacatcaa 46381 acccctattg cattttatgt ttaagaaggg aactttttaa agctgaggtt ggcagaactg 46441 agaaacttag acctaaatgg gcctatgcaa ggagggtgtt tccaaaacaa aacaaagtgt 46501 ttagtgtcct ccttgaagaa tgttgacact gagttaaaat ggagctgcag agtgaggaaa 46561 tcagtaaaga caggacaaaa gttccttttt tgatttgagg tagggtagga aaggcagagg 46621 tgggattctg ggctggtgat gtttccaggg attgcaggca tcaagaaaga ccggtgagtg 46681 ctgttggttt aaagtagttg ttctcaaagt gtattccccc gatctgcagc atcaacatta 46741 cctggcagtt tgttagtaat gcaaaatcct gggtcccacc ccagacctac tgaattagac 46801 actcaagtta aaacacagcc atttgtgttt taacaagctc ccattttttt ttttttttta 46861 aatttttgag acggagtttc actcttgttg cctaggctgg agtgcaatgg cacggtcttg 46921 gctcactgca acctcaacct tccaggttca agtgattctc ctgactcagc ctcccgagta 46981 gctaggatta caggcatcca ccaccatgcc cagctaattt tttttttgta attttagtag 47041 agatggggtt ttaccatgtt ggccaggctg gtctcgaact cctgacctca ggtgatccgc 47101 ctgcctctac ctcccacagt gccgagatta caggcgtgag ccaccgcgcc tggcctaaca 47161 agctctcttt attattccaa tgttcactca agtttagaaa ccactggctt agactagagc 47221 aaaatgtgag gtcaagcctt gaggaccagc caactttgtt ctgttctaac ctcaccaggc 47281 ctcctacctc tggttaggtg ccccaggagt ctgtgtccct ggtcaaaaat agggtgacag 47341 ggcaaaggga gtagatggct catcacaaag acattgcacc actgcaaaaa gaagttatga 47401 tgtaaccagg gaaggtcttt ccttcagtta tggtggagag atctgtcagg gaattccatg 47461 agggcatagc tgcctctcat ttagccactg ggggcccatt ctctcacagg aaagcaactt 47521 gcagaatttt tctgggtatc tgatgtttga attaacagga gtcagagacc tgagtccaaa 47581 tcctagctct aacctttact aactgagtct ccttggacaa gttcttatag ttttcttgcc 47641 tgcacaatgg agttatgata tatgtttctt aggattgctg ggcaggttaa ttgaggttat 47701 gtattaatac aatacatctg gcacagagca ggtgcttaaa aaatgaacac tgtcgttact 47761 ggtccttgga tgtcttagga ttgttcagga acgaccattg cagggagtat ataaacatac 47821 acggcttttt ccttgtaacg ttaagatgga taaaatagga ggcagaggcc aggtgtgctg 47881 gctcacatct gtaaccccag cactttggga ggccgaggca ggcagatcac ttgaggtcag 47941 gagtttgaga ccagcctggc caatgtggtg aaaccccgac tctactaaaa acatacaaaa 48001 aaattagctg ggcgtggtag cacgtgcctg taatcccagc tattcatgag ggtgaggcaa 48061 agaatcgctt gaacccggga ggcagaggtt gcagtgagct gagatcatgc cactgcactc 48121 cagcctgggt gacagagtga gactgtctca aataggaggg agaaaaaata taaataatta 48181 gagttagtag taaaaatgct atacgaaaaa actgaggata gatttatttt aaacctcaag 48241 tttggcacag accttacatg gaattatctg ctaagataca aaatatctaa ataggaaagc 48301 tatttatatt gagttaaaac ataagattgg cagattcatt gtaatgctat gctatcaaag 48361 aataatgcta aaattgaaat tattttgaag tataaccttg ataataagta taattcttat 48421 aaattcactt ataagtgaac ataataaaag agttatcttc tgttagagtt ttgtagtatc 48481 tggcaatttt atataattaa tttcattatg gacaaggagg tcttttgtgt taacagggac 48541 atttttgtgt tatttggtaa cccatgattt agaatgattt agaggatttt aatcccctta 48601 aaatcatctc tgcacctttt tcttaagctg atgctaacca acaattatca tttataagat 48661 cataaaaaat gaacagagaa aataattctg acagcttcta tccctatatt taggtgggag 48721 ttttctagag gaatgagaaa gatcagctcc tattggcttt tcagtatgtc atacagattc 48781 agttaatccc atgcttgact ggccgtgagg ctgaggaact gccaagctga gtgtcacaca 48841 attcgtgtga aaatatatga gagttacggg atccacggac catgacttag ttaacttgaa 48901 ttttattttt taatgagaat aacaataagc tttgcaagct tatttagagt ctgcagcttt 48961 aagtgcataa gggtttaata cctttaatac ccatgcagtg tgagttagaa aaatgaaaat 49021 tttcagtata gtaaataaac tgaggagcct ggaaaccaaa ttgaaatctg tagcctagaa 49081 aatcaacaag agctaaaaat agtaggatta gaggcagcat tcaaccatag accatccaaa 49141 ggtagttttt aagtaccacc aaaggagaaa gcaggccatt ctataataag ttttaaaaaa 49201 aatctaggtc tagagaacat tctgttcatt atgtagatgt ggaagcttaa agtatagatt 49261 tggttttcca gagttttaat ttttcatgtt ctttttttca ccagaatcag agggcactcc 49321 ctctgatttc caaaagagaa atgttaccac aatctgcatg aaattgagaa tttaccttgc 49381 tgacactgac gtgaagattt tagaatgcag ttgctggcca ggtgtagtgg ctcatgcctg 49441 taatcccagc actatgggag gctgagttag gaggatcact tgagactagg aattgaagac 49501 cagcctgggc aatatagcga gaccccatct ctacaaaaat aaaaataaaa caattagcca 49561 cggtgtggtg gtgtgcacct gtagtcctag ctacttggga ggctgaggtg agaggatggc 49621 ttgggcccag gaggttgagg attcagtgag ctgtgattat gcctctgttc tccagtctgg 49681 gtgacagcac aagaccctgt ctctaaaagt acccattaag caaatcaagc aagtggctcc 49741 aggattcact ggtctcttag caaagagcca gaggccccct ttcagaatag gcacatggaa 49801 atattgcagg gtcaggagaa agaacccaaa ggagggtgtt tttaagtgtt cctgtgctcc 49861 tgagccactt cattcaaatg tcagctcaat caaggttctt ggtaggtttt tgctcatagg 49921 cctaaaactt tgcatgtgtg gagtttttga aaaatgtttt aggtggatat gtcttagata 49981 ttgtagaaac tttaaatgat tggcctgagc tgcagatggc tgttatgaac tcttgactaa 50041 gcatggtact gtaaaatcct caagggcctc atccttcctt tggaatttcc tttttgtatt 50101 ctagagcact caataaatgt cctgtctgag cagcagaatt gaattgagtg aggattgata 50161 gtcctctctc agcttccgag cagaactaag tatttttttt ctccccagtg tagaagagca 50221 gggcattggg aatggatgat aaagctgaga caattgatga aaataaaact taggtaccct 50281 taaaactggc gttcatgaaa acagatgaat gaagtcttgg atttcattgt atgatatagg 50341 gactgagatg tgaaaaaaag agatggtgtt gttagtatga tcagggcatt gctgattgta 50401 aaaagaaata ataaaagtag tcagtccagc ttctcatagt tctttttcaa ttggatgaaa 50461 gtctctctca gactttcctt gtcaggaatt caactcagcg aacattcact aagcaccagc 50521 catgccaagc tctgggtcct gggtgttcaa agatgaatgt gacccagccc ctgcccaaaa 50581 gagtattcaa tctaatggag catagccatt gttaatgcca atgttttata ccaaaagtat 50641 ttcttggtga tattatttga ggcccaatga ccaaaaatat ctttattatt aaaattctca 50701 cgtttcatcc cctttgaggc taggcttggg ttacaaaggc atggttgtcc acaccatggt 50761 caaaaacaaa ttttagatgg actgtagatc taaatgtgaa acaataaaga ttttagaaga 50821 aaatgtaaac tgtcaccggg cgcggtggct catgcctgta atcccagcac tttgggaggc 50881 cgaggcgggt ggatcacctg aggtcaggag ttcgagatca gcctgaccaa catggtgaaa 50941 ccccgtctct actaaaaata caggctgggc gcggtggctc atgcctgtaa tcccagcact 51001 ttgggaggct gaggcgggtg gatcacctga agtcaggagt tcgagaccag cctggctaac 51061 atggtgaaac cccacctcta ctaataatac aaaaagttag ctgggcatgg tggcgcatgc 51121 ctgtagtctc agctacttgt gaggatgagt caggagaatc gcttgaacct gggaggcgga 51181 ggttgcagtg agctgagatc ctgccactgc actccagcct ggcgacagag cgagactcca 51241 tctcaaaaca aacaaacaca caaaaacaaa aaacaaaaaa tagctgggca tggtgacatg 51301 tgtctgtaat cccagctact tgagaggctg aggtaggaga atcgcgtgaa cctgggaggt 51361 ggagattgcc gtgagccgag atcaagccac tgcactccag cctgggcggt agagcaagac 51421 tccatctcaa aaaaaaagaa aaataaaata aaatgtagac tatcttcatg agtttgggat 51481 aggcagagat ttcttgaacg ggacactaaa tatggtaacc atggaggaga aaaatggaga 51541 agctgaacta aaataatttc agtttattac aagataccat taaaaatgaa gtctatgtaa 51601 tgggatattt tataatatat tcaaaaagga ctcgtacgta gaatatagaa agcacttcta 51661 caaatcaata acaaaaaggt agattgctct ataaatatgt gatatagact tgaataagta 51721 ttttaccata gaggatgtta attttcttgc ttggggttag tgattggaag ggagcacaag 51781 gggcttttgg ggttccagta ataatgctgg ttacttgagc gtgttaactt tgtgaacatt 51841 cttcacatta tgatctgtgt acttttctgt atgttgtatt atttgtgtac ttttctgtac 51901 tttcattgga aaaatttaca taaaaaaaga aaatgctttg ctgtccagga tgaagaatgt 51961 gagcagaata tttttggagg ttaacactga ttgcaaaata tgtcttccca tttagaattg 52021 cctctactat agaagcagag gaaatgttga gagcatgggg ctttgcaagg tggctatagc 52081 aggtgcagtt tgtgaaatag tcctacacac agcctggcag tgtcttaggt tagcataagt 52141 cctttggcca acaatatcct gagaagcaag caggagaaca aacacagtta tggaggaagc 52201 actagaaaca agcttgatta cttgacagtt ttgtataggg cgaattctgc ctgtaatcct 52261 gtttaagtca ccacttactg atccccgtta tcatgtgccc aggcaccaag atgggggctt 52321 tctgtgtatt aaaaatttta aatcctcagc acaatcctgg gaagcatgta ctattattat 52381 ttgactttac agataaagaa actggggctt agagggatta aataactcgc tcgaattcca 52441 gctagcaagt tgcagcaagc tgcaactgga acccatgtct gtgtgattcc aaagcacccc 52501 cttcccacat gtctattatg ttgcctttaa aaaaaagtta tttgatggag catagtgaga 52561 agaaaaatac ttaatatgta cttaatgttc aaatttgatg atcagaagaa cttctcaaac 52621 cacttttaaa gtctgaggtt tgctgtgcta tagttttttt tttttttttt tttttttttt 52681 ggaaacacac tttgtcgccc acgctggagt gtagtggcga gatcttggct cactgcaacc 52741 tctgcctcct gggttcaagc gattctcctg cctcagcctc ccgagtagct gggactacag 52801 gtgtgcgcca ccatacccgg ctaatttttg tattattagt agaggtgggg tttcaccaca 52861 ttggccaggc tggtctcaaa ctcctgacct tgtgatctgc ccgcctcggc ctcccaaagt 52921 gctgggattg caggcatgag ccaccgcgcc tggcctgcag ttttatgaac tgctattgtc 52981 ctttgcagtt gtaaggtggt aagaaatacc tgggtggggt ggaggtgggg attgagaaat 53041 cctccaagaa acaacacagg ctatgattgg aatgcccctg tatgggtgac ccatgagcct 53101 ttctcaatgc tttggggaca ttttaataac ctgagtaatt acctgtggta cgcaggcttt 53161 gttttctact cattaataac tctccaggct acattttatt cttcttactg ctaattcaca 53221 tgcagagata cataaaatgg atgctgtcaa ctgattgcca catcactgtc tgcatatcaa 53281 tgctctcctg tctgcttggg ctcagagaga aggaaagaag agaaggaaaa gggtatggca 53341 aggaacttgt gatagccgaa tctgttttaa aagccaaatt ctggtacaga aaccagaatt 53401 ttgacatata caaattattg atatttgtgc atggttagtg ttttgggatt gccctattta 53461 gcatctgctg atcctctttt tccctccctg tgatagatag catgccttgg aagcagacac 53521 aggtaaatat ttaatgcttt tagttttttt tgctcctgtg gcttgtttgg cccttagggc 53581 attcatgttt tttagcctgt ggagtgctta tagcatcaga acatcatcac aaccctaaat 53641 gaaatttgga ttatttactt ctaagtatgt ttcccaaata gtctaatgac tgtgtaagca 53701 atatagtgga agggtacagg agccgatttg aatggaaata tatgtggata tggctttaga 53761 tctttctgcg tattaaaaaa aaatctcctt taaaaaatcc ttggggacct ggcacggtgg 53821 cagtggctca tgcctgtaat cccagcactt tgggaggcca aggcaggtgg atcacctgag 53881 gtcaggcgtt tgagaccagc ctggccgaca tggtgaaact ccatctctac taaaaataca 53941 aaaaattagc cgagtgtggt ggtgtgcacc tgtaatctca gctactcggg aggctgagac 54001 aggagaattg cttgaacacg ggaggcagag gctgcagtga gctgagatcc tgccactgca 54061 ctccagcttg ggtgacagag tgatactctg tttcaaaaaa aaaaaaaaat ccttggggct 54121 agttatcttg attttattat tatgatgata cacaagaaag aaaacaccaa acaagctgaa 54181 ataattgaaa atactcacat ccaacccaat ttaatttatt tcactggaat ctatctagtt 54241 atatggactt gttttaggta caggtactta tttttaaaaa actagacagg agctctgggt 54301 tgcttttttt tttttttttt ttttgaactg tagaacttct ggtttagaag taagattcct 54361 cttcaggaga tatacaatta tttattaact taattagtgc cgtacatttt gtcttgaagc 54421 tattcagagt attataggaa catctgttgt actgtgatta tgatggctgc taggctgttt 54481 ccccaaatgc cttgacaagg cagtttgaca tagttcggtg aaggcagtgc cctggtgctg 54541 cagtcaccat agggattttt catcttggct tcatgttggt agattgatac tctcaagtac 54601 gttcaaggtt ctctttttcg ggagagggtt gttttcattg agtcctggct ttcctctttc 54661 cttatttaat gaaatagtga caactatcat tttctacctt caaacattat tattagtgat 54721 tttataaata gaaaattatt ttccaggtgg gcctatgtct gtgctggcaa ccattgtgta 54781 cacacggact ctcttagatg tttaggactt tgcatgtgta aacgggcaag ttttcaggca 54841 attcgggtag gttttccaat cttgtctttc catttcctgt ctaaagaact tcttccatgt 54901 ccacagcatg tttattcaac aatggataaa acttctaatg ggaaatttct tccgggtgac 54961 ctccttgaga ggctgggctg tggagaaatc ttaagaaaca caaccctgat gaaattggct 55021 ccgttgagaa tattacttgg tccaggaaca agggacagaa ttcagttatt tttggttctt 55081 gaataaatga attctgtttc cttttgtgta ggtggcagaa gaaattaacc cctgtagaac 55141 atccattctc agttggaaat taaatttttg gtttcttttg agttaggagt accctggata 55201 agagaatgta aaactgtttt tatttgaacc tcatgcaact tggaatctgt tctgttgtct 55261 gcatatcttt gatagatttg tcttttccag ggttgtctgt ctttgtattt aattttctgg 55321 aggttggaat tgtgattatc aaatgacttc caacttggca ccatatgtgg gtggatacta 55381 atgaatcctt tcttaacctt cccagaactt ttggagaaat caatcagtag aaggcgggac 55441 accgaagcca tacagaaagc caaaatcctt tattcatcct gcatgaatga gagtgagtga 55501 tgaagaaaac taaataaaat atttaccatc cctatccttt agagttctat taatgtttta 55561 atatattgtt tgaggaagaa gagaactact aagaattcta ttaatgttta aaggtgttaa 55621 aggaatgatt gtgaaagact ggattttgac tagtcccatc ttagttaatg ttgttactgg 55681 cttggtggag aaaatcttat ttatatgagt tggattcttt gggtatcatt tactccctga 55741 gaaacttaga caaagtcagc agggcaaact ccactgtatc tggtagagaa tccagaagcc 55801 aggccaagtc tggcttgaga cagagactgg gagggagtgc acctactgga gcctagagaa 55861 atccagattc ccaggcatgc caggttcctg tgtcagccta acaggcatct ttgttgggtc 55921 cttagccctg tctcaatatc ttgggtctga ttggaatcag aacagatact ggccacttga 55981 cctggtctgc cttctcaagc ccaccacagg aagaagccag tgcctgtggg aagtgtgtca 56041 actggtctag gatttggggt atcatgtggt gagctagaaa agagcaataa aagacaggaa 56101 tctgaagcat cacttcttct tcattgggta gtgattggct ttattgaggc attcctttag 56161 ttagggttgt aacttagtcc caccaagtat tattcattgg ctgggttcca tgtagagaat 56221 ccatgtttat tgtagtgaat atccagcagt ttccctggtc agctctcctg gcaagaacac 56281 taggtataga cagtcacatt gaagcgtgga tcgtaggtag ttctgtctct gtgattatgc 56341 tcatcccacc ccacctcttt tacctactcc ctaacagctg aaattttaag ttggagcatt 56401 atgggaattt tgtcttgaga agatgcactt aaattctagt gtgctgatcc agtttgcaat 56461 tctaactttc tcttcatatc tgctcccttt cagaagcgat tgaaaaagca gatgccaagc 56521 cactgctaca catcctacgg cattcacctt tccgctggcc cgtgcttgaa tctaatattg 56581 gccctgaagg ggtttggtca gagagaaagt tcagccttct gcagacactt gcaacgtttc 56641 gtggtcaata cagcaattct gtgttcatcc gtttgtatgt gtcccctgat gacaaagcat 56701 ccaatgaaca tatcttgaag gtataatgag gacccattca tcttctttgc tcagtcctag 56761 attagccttt tggggtgcca tcctggggaa agagactcat gctgccttag tgaaataata 56821 acataaaaca cttttattga atagtagaat ccagaatatt cccttctccc ggctttgcag 56881 aatgtagacc attaccatac aagttttgta agccttttgg aaaactagtg aattagtttc 56941 aagtgatctg tgatctggcc ctcatatgcc tactcagagc aagcctaggg ttcctaaatt 57001 ccagtgggaa ggggactgca ggtctcctgt tgttagcaga gattttctga gccctgctct 57061 tgtccttgcc tcctcactcc cttgatagag gagaaaacac gtgccatgat tttttgttcc 57121 tctttcacta acattttctt gtgcacacac ctaaatttta gttatcaact tcactctgaa 57181 tctctcctcc ctcctttcca aagatacagc aaaacttgtt ctgttccaag tgcttggttg 57241 cgttttctgt tggaagacta accatgtctc ccataaagca atgggccaca tggacatacc 57301 ctgaggcatt tcacactggt aaaatcagga ggctgtcaga aaactaccaa cagctaaaga 57361 ctcaccaatg aaagagctat gcaagaattc aaataacatt cgtcaaatag ttttgtgctt 57421 tgatcctgct gggttttgtt ttgttttgct tccacagtta ctttgactaa taatgaaaat 57481 gttattgcat gactgatttt ctccccaatt ctgtccttag tcaaatcaga aaatattaat 57541 gaaatattga tattcaaatt gtagacatgt attgccctca cctcatgcat tttaaccttc 57601 taaaatccaa atcttaaagt tcatcgtttg attgaagctg gtcatcgttt gatcagagat 57661 tggcacattt tttgggtaaa gggccagata atacatattt tagggtttgc aggccgtctg 57721 gtctctgttg caactactta attctgccat tgtgtctggg aaagcaactg cagacaacaa 57781 aggtatggct gtatggcggg ctggcccagt ggctgtagtt ttctcacctc taggttagac 57841 aatggaattc tcagtaaaaa tctttggtca ttgaaaggac tcccctctac tcaggaggaa 57901 attctgagtt ccagagacaa ctgtgtttct atgtttggtg atgaaatgag ccatcttcta 57961 atgctatcag gataagatgt gctatgtaag acacaatgat gatgacgacc caactgatct 58021 ctgtactgtg ttgatatatc actcattttt tgaaactagg ttttaagact atcattatca 58081 gttccagaac tattatgatg ttctttcaaa caccttaagt tggtgttttg aagtaagctg 58141 aaggctgttt tccaaactta agtttaagtt tatacattct attgagtgat agaagatctt 58201 gattcacaca cagatgaata attgcaaacg gtagccattt tattatcttt ctcacctatt 58261 tctggatcta ttcaattgaa tttattgctt taaacttttt attattaaaa tgtggtacaa 58321 aattaaagaa tagaaaaata aactcccatg tatccatcaa cccagcttca acaactctca 58381 accttctgat gttatgtttc tttctgcagt tatgttttaa tctagagccc tccttttttg 58441 ggaggtggct agaacatttt atagcaaatt aaaaatatat cattgcatag taaatacttc 58501 agtgcttagc tagtggaaag gactattatc acaaccaaaa agataatcat aattccataa 58561 ttaacatcta ataactagtt aattttcaat ttttctgatt gtcttaaaca tgtccttaaa 58621 aaaaaaaaca gttggtcttt gattcaatct ccaaataaaa atcacaggtg aacttagttg 58681 atatgtctgt tttactccac aagggatccc cctatttttg ttttttaaac cacatttatt 58741 tgttgaagaa actggagtat ttgtcctgta aactgtctta cattctgaat ttgactgttt 58801 tcagggtaat ctaatagctg atgaaagttc ttgtttatag gagaatcaca gcttaaaaat 58861 gcagaagaaa taataggaag ctatctattt aaaaaaaaat ccctaaggaa aggattgatc 58921 taggcaatta tcatcaatgg ctgctaaaac cattaggtaa atggttgatt gggaaattca 58981 tgatatcaga ccgatatcac ctgaacccta atatcaatga tagcattgct aaaagtgggg 59041 caaccagata ttgtgtgcct cctggtatca gtaggaagtg cacactatca ccagtgatgt 59101 tgtcgtgccc ccacaattga gcctctagtt ttaagaggag cttacagaaa acatggggaa 59161 tagagaaaca tggtaaccaa tttttaatag cttgccttgt atacttaata ttcaattaaa 59221 ctgatcctag aaacactaaa atgtcaatga tagaatactt tccaaatgta aggaggcacc 59281 catgtactta ttattgctct cttttgtcaa gatggcagct tgttccaaac tggtgtttga 59341 acattgtttg gcatttgagt gcggcaaacg tttgcttaag ttctgttcct gtttgtacct 59401 ttttccatga ctgaatattc acacatctga actaattgcc ttccctcaga tataatataa 59461 ttatagtctt cctgatgaca catttacaag agttcgagca tgtatctgga ggtggtagga 59521 gaagttttat tctgagagct gcatgaaacc cattgaacct gataacaaaa attagcgtac 59581 tggtggctgc caaagttgta gaatttgcaa gtgttgcttt tttattcatt taaaaatagt 59641 taaaactata aaagacaaaa tctaattgga tgaacatttg aagccccccc cacttatagg 59701 ttttgtggaa catagtttga aaactactgg tatagttaaa aaacaaacaa aaaaaataat 59761 aggctttggt atttaaaaga ttgaggtttg cagtttagct ttgtgatgta ctcaccgaat 59821 atcctcgtgc aaattacttc acctcgctga tttcactttt cccatgtgta cggaagggtg 59881 aattagaatt tacttttaat cgtcattgac gttcaaatga attgactttt gtgacagttt 59941 tcactgtaga tactggtcca cagtagcaca cgacacaagg ttccgaacct actcttcaag 60001 catgacccag caactggtca ggaccatttg gagaatgttt ttctttgaca tgtggtaccc 60061 gatatgcctc tcagagctca aactcagtag tgcaaggaat aggaattttg tttactcaac 60121 ctaatatttg acttttagct cccacccaac cttttggtgg gaaactgaac cttagtctag 60181 gcatttacct cccctcttgg tggccccatt tctcggtctt ttcctagggt attgggaaac 60241 tgcaataacc agttatctgg tttctgagaa gaaacccact atggaaccat gtagcctctt 60301 caagtgtgcc cctctgatgc tgaggtcctg cgactcccat tccttctcag atcctgccat 60361 gttttggggt gatcagttcc tctgctcatg agaattcttg acttcttttg ctgcttgaaa 60421 gatcctcttt ctgctctggg agaaatccct cttacctgaa actctcttcc ctcaatgtcg 60481 ttaggcttcc ttttccatcg aattcaattt tcttcggtgt gcttagcttt tctaatccca 60541 caaacccaca ggaataagag aagggccttt tgggcaggca cagcactgtg ggcatccttt 60601 tgggtgagca gatggtgcct ttgggtgaag agacaggcac cctttctaag ggttagatat 60661 atccccatgc caaagacatg cagtttggcc agcaggggtc gctctaatct acacagcgta 60721 cttctgcaaa gctcccctgt ggcaaaatga tacagggttt gctacaggct taaatgaggt 60781 aaaggaattc caaaaataaa gaaaacaaca gaagttagtg accaaaaaca ttatcttgct 60841 gcacacgttt ggatttattc tatgtcagga gatagtatga ggatttaagg gacaataaaa 60901 gacgtgagta tatgtgatcc tgaaagagta tttggaaggc caaggatgtc ccagtatgta 60961 ttactagtta atgaagacaa caacttaaag tcatcatctc acaacagcct gcatttttct 61021 ggctctcagc aaaactcgta aaaaatggca aggagaaact ccaagaaaat ttgaaaagtt 61081 tcattctctt tccttttcct ttttttctca ggaactgaca gaacatagaa ttttatttta 61141 ttattgttat tttttattga tacgtaacta cacattttca tggtacatgt gataatctga 61201 tacattcata taatcaaatc atggtaactg ggatatatcc aataatggga ttgctgggtc 61261 aaatggtagt tctgttttaa gttttttcag aaatcaccaa actgctttcc acagtggctg 61321 gactaattta cattcccacc agcagtgtat aagagtttct ttttccttat atccatgcca 61381 acatctatta tttattgact ttttgataat aaccattctg actggtgtga gatggtatct 61441 cattgtggct ttgattcaca tttctctacc gattagtgat ggtgagcttt ttttcatatg 61501 cttgttggtc acacgtatgt cttcttttga aaagtgtctg tttatgtcct ttgtctactt 61561 tttaatgggg ttgtttggtt tttgcttata aatttgttta tgttccttgt aaattctgga 61621 tattagacct ttgtcggatg cacagtttcc aaatactttc tgccgttctg tgggttgtct 61681 gtttattctg ttgatagctt cttttgctgt gcagaagctc tttggtttaa ttaagtcccg 61741 tttgtcattt tttgtttttg ttgcaattgt tttgggcatc ttcatcatgg aatcttgccc 61801 aaagtaattt acagattcaa tgctatttct atcaaactac caatgacatt gttcacagaa 61861 ttagaaaaag gctattctaa aattcatatg gaatcaaaac agagcccgaa tagccaaagc 61921 aatcctaagc aaaaagaaca aagctggagg catcacatta cccgatttca aactatagta 61981 taaggctgta gtaacaaaaa cagtgtggcc ggtacaacta cggatacata gatcaatgga 62041 agcagcccag aaataaagct gcacaccaac aaccatctga tctttgataa agtctacaaa 62101 aacaagcaat gggtaaagga ctgtggtgct gggataactg gctagccata tgcagaagat 62161 tgaaactgga cccatccctt ttatcatata caaaaatcaa ctcaagatgg attaaagatg 62221 taaatgtaaa acctaaaact gtgaaaaccc ttgaagaaaa cctaggaaat accattccag 62281 acataggccc tgtactatat tttgaagtca ggtaatgtga ttctccagct ttgttatttt 62341 tgcccaggat cgctttggct atttggggtc tttcgtggtt ccatataaaa tttaggattt 62401 gttttttcta ttttgtgagg aatgtcaatg gtattttgat agaggttgtg ttgaatctgt 62461 aaattgcttt cagtggtatt gtcattttaa caatattcta ctaattcata agcatggagt 62521 atccttccat ttttttgtgt ccttttcaac ttctttcatc aacattttat agttttcttt 62581 acatgaatct tttatttcat gggttaaatt gattcctagg tgttttatat tgtaaatgat 62641 attgtttatt gatttctttt tcagattatt tgcttttggt gtatacaatg ctgttgattt 62701 tgtatactcg aactttacta catttgttga tcagttctaa tagttttttg gtagagcctt 62761 taggtttatc caaatataag atcatgttat ctctgtacaa ggctaatttg acttcttcct 62821 ttccagtttg gatgcccttt atttatttct cttgcctaat tgctctggcc aggacttcta 62881 gtattgcgct gaataaaaga ggacaaaaca ggcatccttg tcttgttcta catcttaaaa 62941 gaaaggcttt taatttttca ccattcggta tgatgttggt tatgtgtttg tcagatatgg 63001 cttttacttt ttgaggtgtg ttccttctgt acccagtttg ttgagggttt ttgtcataaa 63061 gggatgttga attttattga atgcttttac agaatgtatt agaatgatta tgtgattttt 63121 gttcctggtt ctgttaatgt gatgtgttta tatttatgta cttgaatttg gattttccat 63181 ttaatttatt ttttttgaga tgggatctca ctatgctgcc caggctggac ttgggttcaa 63241 gtgatcatct agcctcagcc tcctgagtag ctggtactac aggtgcatgc cactgcaccc 63301 agcatatcat gtctattgat ttgtgtatgt tgaactatcc tggcatccct ggattaatcc 63361 cactcaatca tagtgaatga tctttttcat gtattgttga attaggttag ctagtatttt 63421 gttgataatt tttgcatcta tgttcctaag tcatattggc tggtagtttt cttttctcct 63481 tgtgtccttg tgcagttttg gtatcggggt aatgctggac tcatagaatg agtttggaag 63541 tattcccttc tcttcagttt tttttttttt ttttttggaa gagtttgaat ataatttgta 63601 ttgcttcttt aaatgttttg tagaatttgg tagtgaagac atcaagtcct gggcttttct 63661 ttgatgggag accttttatt atggcttcaa tcttgctact aggtattagt ttgttgaggt 63721 tttctacttc ttcatggttc aatgttggta ggttgcatat gtccaggaat ttattcattt 63781 cttctagatg ttccactttg ttggcatata gctgttcata atagtcttta taattctttg 63841 tatttctgtg gtctcagttg tgtcttcttt ttcatttctg attttattta tttgtgtctt 63901 ctctctttta ttgttagtct agctaaagat ttgttgattt tcaaaaaaaa agccaacttt 63961 tttcattaat tttttttggt gtttttatag tctacatttc atttatttct attttaatct 64021 ttattatctc tgtctactaa tttggggttt ggtttgttct tatttttcta gttccttgag 64081 ttgtatcatt agattgttta tgaattgtta tttgaagtct ttcaaccttt ttgatgtagg 64141 cttttattgc tataaaattc cttttcagta ctcctttggc tgtgtttcat agattttaat 64201 atgttgtaat ttcattttca tttgtttaaa gaaattttta aatttcattt ttattttttt 64261 cattgaccca ttggtcattc aggagcatat tgtttaattt ccacgtgttt ggccaggcat 64321 ggtggctcat gcctgtaatc ccagcactat gggaggtcga ggcaggtgga tcacctgagg 64381 tcaggggttt gagaccagcc tgaccaacat ggtgaaactc tgtctctact aaaaatacaa 64441 aaattagcca ggcatggtgt cgggtacctg taatcccacc tactcgggag gctgaggtgg 64501 gagaattgct tgaacctggg aggtggaggt tgcagtgagc tgagatcatg ccactgcact 64561 ccagcctggg caacaaagcg agactccatc tcaaaaaaaa aaaaaaaaag attccatgtg 64621 tttgtgtatt ttccaagttt cctcttatta ttgattttta gttccattgt ggtcagaaaa 64681 gataactgat atgatttctg cttctttgaa tttgtggaga cttactttgt ggcttaaaat 64741 atgatctatc ctggagaatg ttccacatgg tgataaaaag aatatgtatt ctgcagcaat 64801 tgggtgaaat gttctgtaaa tgtcagttag gcctatttgg cttagtgtct agtttaacgc 64861 tgatgtttct ttgttgattg tctagatgat ctatccatta gagtggagtg ttgaagttcc 64921 ctatcattac tgtattgcag tctccctctc cctttagatc tgttaatatt tgctttatat 64981 acttaggtgc tccagtgttg ggtgcatagg tatttataat tgagataggc agtatattgc 65041 tatatctcca ctaggtcctt gccttttttt ctggcactaa atggcacctt aagcctacgt 65101 ttgctttgac tctagcccaa gatcagagca ctcccagtcc agatataggg gaggtcctaa 65161 agaggatatc ctggcagtgt ggaaaagctg gctggcggct agtgtacagg tgacctttgg 65221 aatgaacctc ctatagcata atgctgctga acagccactc tgatttggtg tctcctttgg 65281 ccgagttacg gagcagagtt tccagggctg ggtatggtag tcccacctcc cctctttgtc 65341 tcttgctgtc ttcagaaata tatctccctt caggcactca tgatgccttc tgtgggttaa 65401 ggtatgggca ggtcttctgt caaagaaccc aagatggtca ggaagctggt tgtccacctt 65461 gatctcactt tttccagtgt agaaactatg agtcagggaa aatttctgca ggcttgttgc 65521 tggacagatt ggttggaagg gcatcataga gaaattttat tatcctttgt ctgcttggag 65581 ttttttcact tctctgtggc ctcagcaact gattcatcct tatatttgag ttctgggata 65641 ttgctggtga gaatcttagt gctgtatatt tgtttttagt ttcctgtgta tatgttgggg 65701 tggggaggga agccagcttc tgtctccacc accattttga aacctgaagt ccagactcca 65761 tttcctacct caaaatttgt aagctctttc ttcttgtatt actactttgt cttctaactt 65821 gaggtttatt atcattggaa tgccagttgt aattggagaa aacatttata ttttcagtta 65881 aatgtgttgg atttcctaaa aaacatactt tattttctgc ttacagtatt tagatcagtt 65941 gtaaatgcaa ggatttaaaa aatctcaacg cagtgtcctg attgcttcat gagaaagtct 66001 tttgcctcaa gagtttgtat gtttctaatg agatttattt gactgacctt attcaatttg 66061 ttggttggtg tgtttaaaat gttccctaga aaccataaca atctgaaagt ttataaaagg 66121 tctggtaacc tcacttgaat tatctaaact tttacaataa gtggatttaa ctttatttca 66181 tgagagggaa actttggctt tcttgataat atctacacag tttcctttcc tatgagactc 66241 aagctcaaaa gattcttcaa gtggtccaaa tttaattgaa agccaaagtg tcttaatcat 66301 aatattgtgt ttcatgatcg tggcttgctt aatttttgga agaaatagtg gcagattggg 66361 aattgcagta ggacgaagcc aggaaggctt gtgactttgt tgtttcaggc caatgagaga 66421 gggttgaaca aaataatttg aggattttag cagtatacac agacaagatg ggtgttttga 66481 aagtgccttt tttcccctga aacaagagtt atgtttttcc tggaattttt cataatatca 66541 ctaattctta acgtggcctt tgtaaaattt gttgtatatt aagggcagtt tcatggaaaa 66601 ctgtaatctt atacaaattt atgtatttaa gattatagta agacgaaaaa gaagctggag 66661 gagatgggtc aaaacagttt aaagtcatat aaactaatgg tatgggcatg taggattcag 66721 tttttattca gatattacta atttttaatt tgttgtttag attttgtttc ttattatttt 66781 ggcttaatgt ttaatgtgtg aattagagat ttaatttgga aaaatctact cttcttgaga 66841 ttcattttat ttatctttag tcagttgctg atattcaaga aagaacaaac cacagattta 66901 gagaaggtgc tggaaatttt tttaaagtat cttaaaattt acataaattc attttttgtg 66961 atgtacatgt ctaagttttg acaaatgcat agagtcatgt aaccaccact gtaatcaaga 67021 aatagaatag tcccatcatc cacccccaaa tacctcatgt tatccctttg ctaagtcaag 67081 ccctctcctc atgcccaaac cctgttagcc atcactttgt tccctgttcc tgtaaagttt 67141 ggctttttca ggatgtaagt aaaattcacc catgttgtat cagttcattc tcttttattg 67201 ttgagtagta ttccattgta taccacagtt tatgctttca tgaattggag gacatttggg 67261 ttgtttccag ttttgggtga tattgaataa aattacatac aggtttttgt gtgactttaa 67321 atttttttcc tgggtaaata cctagaagta ggattgctgg ctcttgtgta aatgtatgtt 67381 gaactttata agaaactgta aaattggttt ccttccaaag tggctgtgcc attttgcatt 67441 cttaccagta atacatgagt gatccaactt ttccacgtcc ccaccagcag ttgtcagatt 67501 tttttttcca ttttaacaga tctatggtag tgttttattg tgggtttaat ttacattttc 67561 ctaatgacta atgatgttgg gcatattttc gtgttcttat tttctatctt catatgttct 67621 ttgatgaagt gtctgttaga atcgtttgct caattttaaa ttgagttgtt tcttttctta 67681 tggttgagtt ttgaggaatc tttatgtatt ctagatacac gtcctttgtc atatatgtga 67741 ttttcagata ttttccccat tctgtggctt gtctttttat tctgttaagt ttctttcaag 67801 agcaaatttg ttcaatgtcg ataaagtcta atttattttt tcttttaagg attgtgattt 67861 tattgtcatg tctaaaaact ctttgtctaa cccctagtaa caaaaatttt tcccaatatt 67921 ttttccagaa gttttatagt tttaggtttt acatttaggg ctatgattca ttttcagtta 67981 atttttgact aaggtgtgag gtatgtgtaa aggtttattt tctttttgct tatggatgcc 68041 cagtttttct tttctttgag atggagtcac actctgtcac ccaggctgga gtgcagtggc 68101 gccatcccag cttgctgcaa cctccgcctc ctgggttcaa gtgattctcc tgcctcagct 68161 tcccaagtag ctagaattac aggtgcgtgc cgccacacct ggctcatttt tgtattttta 68221 gtagagactg ggttttaccg tgttggccag gctggccttg aattcctgac ctcaagtgat 68281 ctgcctgcct tggcctccca aagtgctggg attacaggca tgagccacca cacaaggcct 68341 ggatgtccag tttttctaac acgctctttg aaaagactgt cttctttctt ttgtttacct 68401 ttaaggtgct gaaagttttt tttgtttttt ttttgttttt tttgtttttt tttcctgaga 68461 cagagttttg ctctgttgcc caggctggag tgtagtggcg tgatctctgc tcactgcagc 68521 ctccatctcc caggttcaag caattctcct gcctcagcct cctgagtagc tgggattaca 68581 gccacacgct accatacctg gctaattttt atattcttag tagagacggg gtttcaccac 68641 gttggccagc ctggtctcaa actcctgacc tcaagtgacc cacccacctt ggcctcccaa 68701 agtgctggga ttacaggtgt gagccaccac accagcctaa ggtgctgaaa gttttaattc 68761 aagaatttat actatacttt tctatcactg taattctact tatttagttg tctggttatt 68821 atttgtgttt ttatcactta tcctaaacct tccagagagg gttctaacat ttgcgaaggt 68881 ttctcaaaac aaaattatta tttgattcca ttcaaataaa aaaaaatcaa tgccatatgt 68941 tattttagag ataaggagtt ggaaacgtct gattaatcaa ctctatacag attggcattt 69001 tgcaaaacac tgcaatgaga aaaatactgg aactggtgat aggagacaac gattttagtc 69061 tgagctctgc ccaatcatgt taccttggga aagtcacaac aattatctgg cttttgtttc 69121 tcctcccatt gaataaggga gaatagctaa taatctctaa gatctattcc aactgaaata 69181 gccttgttgt tattcagtgt tgataggacc tcaaaaggtt tagaaaacat aaattagaaa 69241 agcaggggca ggtctctgat agtttattat atctgtcatt gccaaagtgc ctatttgcat 69301 gtaattctga gtgtactgaa ttgtttttgt aatggtgaca tgagctcaaa atatggctgg 69361 gatgcagacg atttcatcac tcttgttaac atggtacgta agaatttaca gtgctagcag 69421 aatgctttct gtttttgttt ttacagctgg accaagcaac actctccctg gccgtgaggg 69481 aagactacct tgataacagt acagaagcca agtctgtaag ttttactcat attcaactat 69541 gtgccttacc aggctgctgt caggcccatt agttctaagg ccgctgcttc tttattggaa 69601 atgagcagtg tggctattca atgaccatat tcccaatgca ggaagacttt tattcaggca 69661 gataaattcc actccctgtc tttctagaag tctgctcatc aaggctgtac aagtctcagt 69721 actcaatgcc cttatctagc ttgggagaca gcactcccaa atgaagtaac aaagaaatga 69781 gcttgcagtc attcattcct cagaagtgca gagagctcac tatcaattgc agaaataaag 69841 cgtcagaaga gatacccagg agacagatgg ttcccaaatt gtactgcttt agaatcactt 69901 gggaggcttt cattaaatgt tgcttcttgg gtcacagccc tggagagtct gacttagact 69961 tgaggtaggg tctgagaatc tatattttta acaggtgttc caggttaatg gaggtttaat 70021 aagaatcaga aaggatcaat caagttagac ttcttggatt ggcttagaaa tctagcagtc 70081 tcctggatat tggaaagagc tggttcattt ctgacacttt caatttattt tggaaacttt 70141 attgaacttc caaaggagcc tcagacctag gtcccctggg tctatgccga tactaacttc 70201 tatacataga aggctctgga gggaagctaa ctaagatgct gggcagcttt cttgctgagg 70261 acactcacca aagagggttg gcaagttttc cttcctaaaa gtggacagaa tgaacatttg 70321 ccaatgtggg gggagggtgt acccctggta ttgatggagt ctcagcaacc aggctcaaaa 70381 tcgtctcccc ctataaccta tctgtcaagt cctaatgatt ctaccttcaa aatatctctt 70441 ccatccactt ggtccctctt ttctatcctg actgctatag gtcagattct tgttaactct 70501 tgccttgact acagcaagta gccttttaac tgttactgct tctccttcat cacattgcct 70561 ccttagcacc agttgacaag ttctgttctc aaagcactgc tcctagcatg ccttcgctcc 70621 caagacaggg tctaaattct ttagagggct ttcaaggctg cctcatgact ttatcccaag 70681 tcatctttcc tatctcatct ggatttgttc ttcccccaac aaaagcagct tgcccactgt 70741 attagtccat tctttttctc ctaataaaga catacctgag acttggtgat ttataaagga 70801 aagaggttta atggactcac agttccatgt ggctggggag gcctcacaat cgtggcagaa 70861 ggtgaatgag gagcaaagtc acgtcttaca tggtgtcagg caagagagct tgtgcagggg 70921 aactcccatt tataaagcca tcagatctca tgagacttac tcactaccat gagaatatgt 70981 gggggaaacc acccccatga ttcaattata tccacctggc cctacccttg acatgtgggg 71041 attattacaa ttcaaggtga gatttgggtg gggacacagc caaaccatat cacccacctc 71101 acaggaccct tcactgcgtt ggaaccagtc tccctcaaat gctatgtccc cagtttctcc 71161 gggctagatt tttctctcag atcttcagtt tccaagcgtg tagcagcttt attaaggaag 71221 ttattttgat gtctttaatg ttacagttca taattacatg tttatctccc tcactgtttg 71281 ggaaccagac acagcattat tatttttttt ctgtaccacc ctctcctaac atatgcacag 71341 agggcctacc aagccacctt gtacatttga gtaggtactc aataaatatg ttttttggtt 71401 tttcttttta gatagagtct cactgtgttg cccctgctga agtgcggtgg cacaatctcg 71461 actcattgca acctctgcct cctggattca agtgattctc ccacctcagc ctcccgagta 71521 gctgggatta tgggcatgca ccaccatgcc gggctaattt ttgtattttt agtagagaca 71581 gggtttcacc atgttggcca ggctggtccc aaactcctgg cctcaagtga tctgcccacc 71641 tcggcctccc aaaatgttgg aattacaagt gtgagccacc acctccggcc aataaatatg 71701 ttttgaattg aattaggaat agcaaaatat tatttctatt taagacttct tttcttttct 71761 ttctttcttt tttttttttt tttttgagat ggaatttccc tctgttgccc aggctggaat 71821 gcagtggcat aatctcggct cactgcaacc tccgcctcct gggttcaagc gattctgcta 71881 cctcagcctc ccgaatagct gagactacag gcgtgcgcca ccacgcccag ctagtttttt 71941 tgtattttta gtagagacgg ggtttcacca tgttggccag gctggtcttg aactcctgac 72001 ctcgtgatcc acctgcctca gcctcccaaa gtgctgtgat ttcaggcgtg agccaccacg 72061 cccagcctat ttaagactct tttcaaagct tgcttactga ttcaacaaag atttattaaa 72121 ggcctcctgt atgctaagta ctgtataggt cctaaggtca tacatatagt aagaatagct 72181 tacagagtca aggtataagt tagtaaatct tattcccaat gcatgagaag gaggtcattt 72241 aatacctgat gcgttgagat ctcacaaata tatctgaaat gagaggtggg cctaaggcag 72301 ttctttaatt cccattaagt aaggtcaccc ctgccccacc caatgacata tgattatccc 72361 caaaatgggt ttatgagtat gtaagaagga aattctatat ttgctatttg ttaggtttcc 72421 tatcatcgtt cctctctttt ctcctttttt tcctgagcta gggagcagga ctttgtgaaa 72481 actgtaaaaa cctgaaaaga cacagggctg ctgtcactgg tcttttattt cttagtgtga 72541 aggtgggtaa attgagttgc aagagaataa atcacacctc ccaccattct ttatggatca 72601 ggcctcctgc ttttagaaac agttctttcc cattgtttta ttaaagcctc acttgtctgg 72661 aaagaggaag tctgaccttt aactttctat tcttgtctca tttggagtta tttcatgtat 72721 ccataatatc gacggttgcc agaaaactag ttctcttgtt gcttgatggc tggacatctc 72781 tgtctgtcat ggaaacagta ctcaagtagc cgtcactctt tttttctccc aaagtgttat 72841 atgggtgcct ggtattgcat aatgcctgca ccatactttg agtacagagt gaaaataaaa 72901 gacatttatt tctacctgct attttctgct cttccatgtc tctcaaacat atttcctaag 72961 aagctattgt catggtcact ttgttcttta tttcttacag tatcgggatg ccctttacaa 73021 gttcatggtg gatactgccg tgcttttagg agctaacagt tccagagcag agcatgacat 73081 gaagtcagtg ctcagattgg aaattaagat agctgaggta agtcttcact gaaaatctct 73141 ttctttcctt tactttcttt tcttttcctt tactttcttt tcctttcctt ttattaaagg 73201 cagtatatcc tgaatgaaaa tataaattgc aaacaaataa tagcgttttg tgtcattgcc 73261 cattgggccc aaggtagaac taatagcatc ccctctagga agtaacaagg actgtgacag 73321 aaaagctcct cattccttcc acctccctgg tttaagccca ttcagtgtcc atacgttgct 73381 cagctgtggc caaagaatag cagcagagag accctctgac cgtttttcct ggagaaactc 73441 atgtttggct taaatagaag atagatgttt aataaatcct tagtaatctg gcttgaatgg 73501 aaggttgctt ttaatccagc ccatgtttta tgctatcccc actcagagtc attctggctt 73561 aagttctggg ctcagggaaa tctgtccacc aatactggat ataaaagttg gtatagaaat 73621 aggtaggaca tgctggtaag catttttaaa cgacctttct ggtattactt tgtacctgcc 73681 cctgtttttc attattttct tgccctactt taatgaatat cattgtgtca tatacatgac 73741 acagaactac atgtctatgt aaattacctt aacttcttgt tggctcattg tagagacaaa 73801 aagatgtaaa aacaaacttg tgggcaaaat aatgagaact cctaaatggt tagctttgtt 73861 tatggaataa cttttttcta gaagaaagaa agtgtaaaaa tctctctata tgttcctagg 73921 ccaagaggga ataaatgtgg taaaatctgc ttctggggaa accctgtagt gaaaaaaatg 73981 atacttttta ccaagaaagt tttgcgtatg tgttgggaat cttttacaag agaagttggt 74041 aaaaatctcg agatgaataa gagtccaaat gttgattggt tttgtttggc gaggtggtgg 74101 tggtgtgttg aggtatggag tagggatgaa ggataagggt aaggaggaat tggtctctct 74161 ctctttgccg tggtgatgtt cgtgtatagc tctctccaaa ctcctacttg cagtgcaaaa 74221 agttcattcc tgcatgtcac ctttgaagac tgtgaagact gctgcagatg tttcgaagtc 74281 caggactgtg cctctgagcc atgtctagag tctgtctgaa ttctctttcc aggcacatag 74341 gaggcccagg gctctcattt ggactaaaac tctgaaacag cagggaaggc cttgccctga 74401 gccagggatt tggaccaaaa agggaactaa aaacagctgg gctgaggcaa gactgggaat 74461 tgtcaaaagg ctaaatgaag tgacaaatct attgtgtgga ggggagtggc tggcagagag 74521 ctgaaaacag ctctgaggcc aaagggctcc aacttggggc tgtgagactc cgagggcgga 74581 ggagggattg gtggaagagg agtgtggata tcctctgagg caatgattca caccgagcca 74641 ggaggctgtt tgtaagttac atgtggtctc tcctgtggag ggtgaagaaa gggaagctaa 74701 ggacactgaa gcatttagca aagaggggcc agacgtctgc cagtcagccc tctgtaattt 74761 ttaccttgaa aaatcactgc tagtaatgct cacttccata tcttacacta cacatcacta 74821 ggtactttta aaattagtgg gatacatagt cctatggaaa cttctgaagc tgggaagaca 74881 agacagtcca gcccagccaa atacgtcctt gtaattacaa ccagggaagc atatacttaa 74941 ccaatccagc tagagagatg tgtgcacaag gagaggtttt attgatggag tagcccttgt 75001 gattggaacc aagtggagtg taattggaaa ggcttgctag atgcagtgag tcttgggcag 75061 tcctgagtca tagttcaaac catggaggtg gtgatggtgg ataggagatg caagagtcgg 75121 agagtcatgt gtgcagtggg aaggtggcaa gtttaatgca agggaattag atgatggaag 75181 gcccgaaggc aaagtcatga cgtgattgag gggtggatat gtgagctact gctggtcaat 75241 ggaagtgatt aagatgatcc ttgaggaaga taatcattgc tgctgttctt aggttaaaca 75301 tagtggaggc tagaagctca gatttggtgt tggccaaatg gccaagagca tgagtattag 75361 gaattgaaag agtatagagc cagggctaga gagaggcaag ttttgaaatg tgtgtccagg 75421 cggtgactaa atccataccc ccagcaatgg ggctagggaa atctcaaggc caaagagaca 75481 agaggggaaa tgagggaatg agctgttgac aaaatgatca cggggaacag tggcatccag 75541 atagtcttgt gttcatcagt gaaaatttgc ccctagaggt gaagggatgg tctgtggttc 75601 atgaaatatt gaaatcatta tctggagacc acaggtgaca atgggatgtt tacctcaatg 75661 tccaacttaa gtttgattgt caaaattgag ccagcaatag attttgccca tccacattat 75721 agattgggag ggagatttgg ggtagttctt tttgtactgg ggacaatatt tcacacagag 75781 gaggatatga aagtttattt tcttaatcct gcctaaagtg atgtcttttt ttctcttccc 75841 cgagttgggg accacaccaa agccttgaaa aactttataa tactatgcag atgttttggc 75901 acatgtagga agtaatcata cagtaagaaa tggttcactt gaaaaaaaat aacaaaaatc 75961 tcttttcaac agataatgat tccacatgaa aaccgaacca gcgaggccat gtacaacaaa 76021 atgaacattt ctgaactgag tgctatgatt ccccaggttg gtgaaaacta tccagaaaac 76081 tttctctcaa ctcatcattt tagaagggat tggatttcat ctctgtgtaa atttgtgtct 76141 tgtaaaacct gctctcagtc ttggttatca tcatctttgt caacaatcac ataaaatatc 76201 ccttgtctgt gtattgggat gccttctgtt acagggaaca aaacatcttt ccaacagtgg 76261 cttagaaact taaagcctat attctcacat gacaaaagtc tagacatagg taattgttgg 76321 cattggctca gctgtttaac agtgttgtca aaggcccaag ttctttctat ctttccatcc 76381 tactgtgttt agtgtgttgg ctttttgact tcatgtttga tgactcatgg ttccaaggta 76441 tccatcacag ctccatatat cacatctgct gtcagagcag gaagaaggtg aaagggccag 76501 tgatgtgtct aactctttta ctaggaaaaa caaacaagcc tttccagaac atctccagat 76561 gtcttccctt tatttcttat tgagtaggat tgggtcacat ggccactcct aaccacaggg 76621 gaggttgggc aagtgagtaa ctggttttcc gtttcaacag tggacacagg caaggaagaa 76681 aaggttggga aaagactttt agagtagcca atctaaaaaa caccatatgg tgttttaggt 76741 aggataggga tgctctgaag aatggaggca tgttactttc tgtgttttta tttgtttttt 76801 tttgagacac agtcttgctg tgtcatccag gctggggtgc agtggtgtga tcttggctca 76861 ttgcaacctc tgcctcccgg gttcaagtga ttctcctgcc ttagcctctc gagtagctgg 76921 gattacaggt gccggccacc acacctagct aatttttgta tttttgatag agatgggatt 76981 ggccaggctg gtctcaaact cctgacctca aatgatccac ctgcctcggc ctcccaaagt 77041 gctgggatta caggtgtgag ctaccatgcc tggctgcatt tactttctgt tgctgcttat 77101 ttcctcttgt gttttttttt tttttttttt aaaaactgtg tcacagagac aaattatgta 77161 ctggggcatc tgtttctgtg tgtatgtgag agtgagggag agtacaggac agagaaaggg 77221 agggagggag agagaaaatg taccgtatag aggaagaatt tagaaggagg aagagaaatg 77281 caggagagaa ggagggaaga gtatgggagg aagaaagagt atagaataga ggaagagtac 77341 tggacaggga gagagcagaa tggggagaga aagacagagg ggagagttca ggaaggagga 77401 agagtactgg agggggccaa gggagtgagg gtgcatatta ctgtgttagt gtgtgtgtgg 77461 gtgtcgtcca gatgtagatg gtacagaaca catgctccat ttgtgattaa gacaataggt 77521 caggccgagc atggtggctc atacctgtaa tcccagcaca ttgggaggcc aaggtgggtg 77581 gatcacttga agtcaggagt tcgagaccag cctggccaac atggtgaaat tccacctcta 77641 ctaaaaatac aaaaaaaaaa aaaaaattag ccaagtgtgg tggcaggtgc ctgtaatccc 77701 agctactcag gaggctgagg caggaaaatt gcttgaaccc aggaggcaga ggtaacagtg 77761 agccaagatc atgccactgc actccagcct gggtgataga gcgagaatgt ctcaaaaaaa 77821 aaaaaaaaaa aaaaaaaaaa aaaaagacaa taggtctgga tggcaatgat caggagttta 77881 ttttctagac ttgagtagtt gcatcttgaa ttattaggtg tgaaatcaaa agaaatgaat 77941 gatgctcaat ctactctatc tgcctttctt acttgctacc taaccgagat tctctcattc 78001 tgttttgttc tctctcccct cagttcgact ggctgggcta catcaagaag gtcattgaca 78061 ccagactcta cccccatctg aaagacatca gcccctccga gaatgtggtg gtccgcgtcc 78121 cgcagtactt taaagatttg tttaggatat tagggtctga gagaaagaag taagaacttt 78181 cacatgaatt ttactgtgac ttttgtgttt tctttaaatt ataagggctt cccttctcac 78241 atcctttgaa aactgttttt aaagaatgtt acatcaaaag agtcataggg aaaatcccgg 78301 tgtcctttgt gatgaactgg ccaaaagcac tgtctggtgt aatttacttg ctgcaatgat 78361 aacaaatgtt tttgcccttt atttctggtc tttctccatg ctgttctctt tgagatccta 78421 aattgtttat gagcttgggg agctagtcat ataatgatgt tattttctaa aatgtctgtg 78481 tgaacatctt tttttttccc cagttcattc tcaatctttt gctgcttttt tttttttttc 78541 ctggttcctt tttaaccatt aagtcacttt ctggcaataa actaaagtaa ctgtattgag 78601 aagacacaag accaccttga gagtggtaga aaaattgcac ggataaaaca actttaggaa 78661 ctgccaggtt tttcttattt taaagaatgt tttagaattt gttgctttgc tagtttattg 78721 ctacaaagaa aaatcttcca atattccagg aaggagaatt ttacaggttc cttaaggcat 78781 tcagctttga ctacagaaaa tgaatcacag agcttacaaa atgaatcaac gagtttacaa 78841 aacactttac tgagttaatt gctgggccat gattaaaatc acattttctt atttggactc 78901 agacaaggct atgttgccag aagttacgtt ttttaaaaaa ggcacaacag tccttggaag 78961 aaaatgtagc atttatcccc tacggaaaca cttagacacc tgagaggagc tgctgctctt 79021 tttaagatac tttctgatga taaaagtaat ctgtgttcac tgaggaaaac atacagaaaa 79081 aggagagaag aaacaattaa tttacttaaa attctatcat ttagagataa gccaggattc 79141 tttgaaaatg ttctaaatgg aaattgaatc tagaggtacg taattatatt gcagggtaga 79201 gccttgctcc atggaccagg aatgtcagca gcatcacctc gggtgcttgt taaaaatgca 79261 gattccaagg ccctcctcca ttgagtctga tccagtagag ctggagggga ggccaagtaa 79321 tctttatttt aaacaagtac ctcagaagaa tcttatgatt aacttttctt aggggagggg 79381 aaggggatca ctgtattaga aaatttcttt gtaagtactt cctagctctc aaatactata 79441 aaatatgacc caaacgattt tctactcata tgccattgag ttacggagca cattttaaat 79501 tttggtggta atttccattg aaaggtggag tttggcctgc tttagattca tccttcacct 79561 cgtgagtcag tttcttctag tatctgtgtt aaaagagtag gtacttctga acactgattg 79621 attttacttc caaaaaagcc taatttacaa tcaggggata gattgggatt tggtttactg 79681 tattagaaaa agcatctcta tcgtgatgat tttacttggt aattgcagga cctttcataa 79741 caatggaaaa gagcatgatt ggagggagat tttactctga ttcttgtcaa aatgataact 79801 tggacaaata taaacgtaga gaacaacagt agaaaaaaca tagatggaaa agtcacctta 79861 gatgttatgt acacatacaa cttcatttaa aaataaggac aaaaattggc caggcgcggt 79921 ggctcacacc tgtaatccca gcactttggg aggctgaggt gggcagatca cttgaggcca 79981 ggagttcaag accaacctgg ccaacatggt gaaaccccgt ctctacttaa aaaaaataca 80041 aaatttagct gggcgtggtg gctggcacct gtaatcccag cgacttggga agctgaggca 80101 ggagaatcgc ttgaacccag gaggtggagg ttgcagtgag ccaaggtcgc accactgcac 80161 tccagcctgg gtgacatagc gagactctgt cttaaaaaat aaataaataa aaataaagcc 80221 aaatcccaaa tctctttcac acagtttatc cctcaaccac cagtttataa aaagcaactt 80281 gtgttctgcg tattcttgtg agatttcaat acaacgtaca catcatgaga agtctgacag 80341 cattttattt gcagactatg gtgatgtttt ttcattgatg atcgtgttta atcaccactc 80401 gtgtagcatt acagggctac ttgaggaaaa gtcttcttgg gttcccataa ttgccatatt 80461 ggtcaggata tgccagttgc tataaccagc acctctaaac actcattggc taaattgtac 80521 cctctgctca tggaacagtc tgaacatggg tgtccctgta agagagcttt cttctaaata 80581 gtgacttagg aagccagact cctttcatct tatggctctg ccctcccttg gatgtttggc 80641 atcctctcta ttcagctgga ggatggagaa acagagaggg tggaggattg tataggtttt 80701 ggtgggaagg cctagaagtg gggtatgtca cttctaccca taatccataa gccagaactc 80761 agtcttgtga ctcatctagc tgtaggggag actgggaaat atagtttagc tatataccca 80821 gaaggaaaag gagttggagt ttccctagac atgcctttat caaagagcct atctcatttt 80881 cttgaaatga ccacttttct ctcaatggca aatacttccc tgctccctac ctctatgaca 80941 ctgtgagcac ctcatggcag gcactctttt ttaattttta atttttgtgg gcacaggaca 81001 tattttgata caggaatgta atgcataata atcacatcaa ggtaaatggg gtatctatcc 81061 cctcaagcat ttgtcctttg tgttacaagc aatccaatta tactgtttta tttatttaaa 81121 aatgtacaat tacactattt ttgactatag ttacctgttg tgctagcaac tactaggttt 81181 tattcattct ttctattttt ttgtacccat taaccatccc tccttccctc cttcccctct 81241 gctatccttc ccaacctcca gtaaccaccc ttctattctc tatctctatg aattcaatta 81301 ttttaattgt tagctcccac agataagtga gaacatgtga agtttgtctt tctgtgccag 81361 gttatttcac ttaacattat gatctccagt tccactcatg tttctggaaa tgacaggatc 81421 tcattctttt ttatggctga atggtcctcc gttgtgtata tgtaccacat ttttaaaaat 81481 ccatttatct gttgatggac acttaggttg cttctggcag gctgtcttgt gattgctttt 81541 ttaagtgttg tgacctagca cagtgcttct catgtagtag acactttaaa aattctagaa 81601 gttaaccaga taccccatca ccaccacaag gaggtgacat atgaatggaa ccccaaagga 81661 taaagaggaa ttttctaggt agggaagaat agagaaaagg ggatggtaag catcttcagt 81721 ggtggaacca atgtgaactt gggtgttgcc tgcaaggcag ggggctagag agctgagagg 81781 ccagcttggg ctcaggatat gtagaggaaa agcatcagat gaatgttctg aggatccccc 81841 cacctcctac ctttcccttg gaggcaccat tgaagaaaaa gccagccaag ccttctcaca 81901 ttgggcatct ctgctgtgcc tctgtttcga gaatctgcct ataaaaatca tatacaggta 81961 atggtgatgt taaccaccac tatggttaac atctttgatc tttacaacaa ttctatgcgt 82021 tgttcccact ttacagatga ggaagctgag gttaagtgac ttggccaaga tcacacctct 82081 aagaggaaac agagctgaga ttctaaatgg ggtcaccctc atttctatgt cttgtctctt 82141 tttttccata aatgtcttgg tgtgaaaaga gatacttttc agatgcttca ctgctcattt 82201 tggagatgct agcgtattgt accctcagct agttagatat tgctagcttt ggcaattttc 82261 ctgcaatgtt cagcagttca aagccatcct gtgttggtgt cagctgttca tttcttattt 82321 tttattttta tttttatttc aataattttg ggggaacagg tggtgtttgg ttgcatggaa 82381 aaattcttta gtggtgattt ctgagatttt ggtgtaccca tcacctgagc agtgtacatt 82441 gtacccaata tgtagtcttt tattgcttac cccctcccac tctccccctg agtccccaaa 82501 gtccattata tcattcttat gcctttgcat tctcgtagct tacctcctgc ttataagtga 82561 gaacatgcga tatatgtttt tccattcctg agttacttca cttagaataa tcatcttcag 82621 ctctatccag gctgctgcga atgccattat ttccttcctg tttatggctg agtagtattc 82681 catggtgtat atataccaca ttttctttat ccattcgttg gttgatgggc atttatgttg 82741 gttccatatt tttgcgattg caaattgtgc tgctgctata aacatgcatg agctagtatc 82801 ttttttgtat aatgacttct tttcctctgg gtagataccc agtagtggta ttgctggatc 82861 aaacggtagt tctatttagt tctttaagga atctccatac tgttttccat agtggttgta 82921 ctagtttaca ttcccaccag cagtgtaaaa gtgttccctt ttcaccacat ccatgccaac 82981 atctgttatt tttttgattt ttgattatgg ccattctcac aggagtgagg tggtatctca 83041 ttgtggtttg gcacttccct gataattagt gatgttgagc atttttcata tgtttgttgg 83101 ccctttgtat atcttctttt gagaattgtc tattcatgtc ttttgcccac tttttgatgg 83161 gattgtttgt ttttttcttt cagctgctca tttctcaatt gctgcttctc ccttcttata 83221 ttactcccac tgccagtcta ggggattagg cctctttctc tgactgagtg gctgtgggga 83281 cagtggcact gactctttgg taggtggtgc atggtacaga attagagcaa tgagatcctc 83341 cagtggtggt ggtgccaaac cacaccagcc cttgcagtcc aagggggtag gtggccaggg 83401 agggaccctt ggaggaatga gctggaggct gcctgtgggg aacaagagct ggtgtttgag 83461 gtggaaaggg gggtggggta aggaggtagt gatgatccct catgttatgt gcatctcttc 83521 ctaactacat gttcggtgat ctcatgttga tagtttcaat tcaactatgg gggattttga 83581 caccatggaa attggcaaat gtaacaaatc atttttcttt ttcttgacaa ggcagttggc 83641 aacctttact agcacatcac tgggtgaggg gcactataga cttctctccc tttctgtccc 83701 cctcaattct cctcctcact cctgactcga ggctgcctca gacacttccc atctactttt 83761 atctgaggag taagagtggg aggaggagag gccaaggaag gttggcatca ggaggagctg 83821 acgtgatttc ctcttgcagg ccagggctgg atccatggag atcatctgcg ccaggactgt 83881 tttcttttgt tgttgttgtt gttttgtttt ttagcagact ggcctagagc aaacccttgg 83941 agttttgtca aggaccctgt aagtattcct gtctttagtt ttattttcat ataaagcaag 84001 tacaatgtac attaaagtaa gatgattctt gttgcatgtt cccaaaggat gacctggaaa 84061 atgtgctgtt atgtggacct ctcctggggg ccagctcttc tgagagtgat ctagggtctt 84121 gctagccaaa gaatggccta cagatgagca gcagagacat tactggggag tttgtcagaa 84181 atgcagactc tcgggctctg ccccagacct actgaatcag aggctgaaat tgaataagat 84241 ccccagggga ctcatatata cattcaagtt taagaagcac tgatgtggga tttaggaagt 84301 cctagttgag ctgtgtgtgc agaacgatgc ggagatggag ttttcctaga ccagctgttt 84361 tcaaacctga cacgcctcag aatcccctgg agggtttgct aaaatacagt ctgctggccc 84421 cccaccccac agattctgat tcagcaggtc tggcggtaag gcctaggaat tttctaacaa 84481 gttctcagat gacgctgaag ctgctggctc agggacacac tttcagaatc actgccttac 84541 ataagcctct aggcagcctg taggaagatc tctggctttg agaaccttta agaagagcct 84601 ctgtggctca cttcagatag tatgaccaag acgagcaaat gcaagttctg gaatctgaaa 84661 acccagatgt tagtcctgga tttgacacat tcttggaagt accttgacct cttacctttg 84721 ttttcttcac ctgaaaataa ggataataat aataatcacc tcacagcact gttagaaata 84781 tcaaagaagg tattttatga tctcttaaat gtgatacaaa tgtaagctgc cactacaatt 84841 tgagtatccc ttatccaaaa tacttgggat tggaagtgtt tctgattttg gatttattca 84901 gattttggaa tatttgcatt ataattacca attgagcatc ccaaatctaa aaattaaaag 84961 aatcttccaa gtcagaggct ttaggaaatg atttcatgtt gctctctttg tggtgggtgc 85021 tcttttgaga tgctttgcct ttctctgttt gatttattgc cacttgggga aattaccata 85081 tggctgaaaa tgtgtttttt tgggaaaaaa ggatattatg tgttattttc aaggactgtt 85141 gctgactgtg atttttgctt tttgatatat ctgagcatta ggtgacagtg attcagtcat 85201 tatggttata tctgttcctg gcgtggtaat attagcactc caaaactgtg ctccatagca 85261 ttctgtcagt gtatttataa cacactgaag aaactcatca gtgccattgt ctcgctgtgt 85321 gaacttctca gcattacagc ttccagcttg cagaggtctg gttgttatgc tatcaccttt 85381 gtaatgcctt gtccaggaag tcctcgtaat tctttttgtg cccacagatt tctgtgaagc 85441 atctttgttg gagtgttttg gggtctctgt gactggccag gaaatcttag ctaaattaga 85501 ggctttggcc aggcgtggtg gctcacgcct gtaatcccag caacttggga ggccgagttg 85561 ggcggattgc ttgagtccag gagtttgaga ccagcttggc aaaatcatga aaccctgtct 85621 ctaccaaaaa tacaaaaaaa aatgtgctga gcacagtggc gcatgcctgt agtcccagct 85681 acttgggagg ctgaggcagg aggttacctt gagcctagga gatcgaggct gtagtgagct 85741 gagattgtgc cattacactc cagcctgggt gacaggagtg agtgaaactc tgtctcaaaa 85801 aaaaaaaaaa aaaaccaaat agaggcttta attcatcact atctggcata gggaaaagca 85861 tgcaggcttt agagttagac acacctgggt ctggtccggc ctccattgct tattacagtg 85921 gttcttcaac ttggcctcac attggactta gctgaggaga tttataagca atatttatgt 85981 tgggtcccac tttagagatt gtgacttaat tggcctgggg tggcctaggc atgtggactt 86041 ttaacaactt ccccaggtga ctctactagt ttgagactca ctgaatatgt gatcgtgagt 86101 ttacttaatg gttctaagcc tgaagtctcc tcatatgtaa aatgggtata ataacatctc 86161 tcttacagca gcactgtgag gatcataatt aatgtgagat gcctaaatgg gatgatctca 86221 ggggagctgg aagggcccct gcttctcaaa ccccagtata ctttgccata ctacatcttc 86281 caccaccacc attgggcccc ctaacaactt ccctagttat tctggctcat caccacccct 86341 ttgagggaat atcttttgaa ctgatgaaag gacatctgct ggctgttgac ctccttctcg 86401 tctccaccta tggatccagc atggttcaca tgcctggtaa gaggtatttt agagacaaag 86461 aacataaagt ccttgcacta ggaccaagat cggcccatta atgtgctaga caccatgacc 86521 tggtcgctgt ctgcatctcc ccagtgtggc tggcctttct gcccacttcc ttgagtgatg 86581 atcctgttca ccatcctctc caacatatca gccgattcca ccattattta gcctgtccag 86641 agttcacgtt cctgtttgca acaccaaagc cttttaacct ttcggcatgc ttctctcatg 86701 aagcctactc ttaaccctca gtttgagatg tactatcctt acagtgaagt gtctggcaca 86761 tacttttggg tgctagtgtc taccaaatca ttatttatgc acttgcctca tctcccctgc 86821 tcagctaaaa gcttcaggaa atcagggact atccatctcc ccagcacaga gaactgttag 86881 tatatatgtg ggtgctcagg atgtgttgat tgagtgaatg aaagaaagac ccttatgtac 86941 tgacagtgtg tcccttcaat taaacaaccc aattacatga acaagctgaa atctgaagtc 87001 ctgtgttccc agctggactt tgagttctct aactgtaggt gctgggttct tacttcttta 87061 cgttcctcat ggcatctggc acaatacttg gccaacaata agtgctcagt gagcacttct 87121 tgtcctatta aggagtggag atgtgaatgc aaaagagcca aaggaatgac cccaaaccac 87181 ttagtggcct caacagagat gggcaagggc tggactttgc tgattagtgc agtggtgtct 87241 taggggaatg cgcatatcct aacaagatcc attgggtctg agaagaacat tttagaactt 87301 ctatatcttt atttaaattt aaaaataatt aggctttata cagagtggta taatggactt 87361 tggagactca gaagcaggga gggtaggaag gggatgaggc ataaaaaaac tatagattgg 87421 atccagtgta cattacttgg gtgatgggtg cactaacatc tcagacttca ccactataca 87481 atttatccat ataaccaaaa accatttaca tccccaaaag ctattgaaat aaaaaaactt 87541 tataaagaaa aataattaca ctttattgag ttctatattg atggctatgg acccttcatt 87601 tgatctgtat ggaacatgtg agaggggcac tgtcaccctc actggcttgt tttcattcaa 87661 tatattaaaa ataatctcag ctgatacggt ggctcacgcc tgtaatccca gcactttggg 87721 aggctgaggc gggaggatca cctgaggtca ggagattgag accagcctga ccaacatggt 87781 gaaaccccat cagtactaaa aatatagaaa ttagccagtc gtggtggcgg gcatgtgtaa 87841 tcccagctac ttgggaggct gaggcaggag aattgcttga acctaggagg cagaggttgc 87901 agtgagtgga gatcgcgcca ttgcactcca gcttgggcga cagagcaaga ctccatcttc 87961 aaaaaaaaaa aaaaaaatct tgagcggttt acatatagct gtggatttat gaattttatt 88021 atctagttta tctagtttac tacttacaaa gtagacaagt ggcttaaaaa ttcttgcagg 88081 aaacaggaat tgaaatgatt attgatgatg taagcacaag agaacagcta gaaattggca 88141 gaagtgaccc ttctcccagg ctggtatgcc atgttggtta gccagatgac aaaagaaata 88201 caatggtggt ctaacaagaa caaggacaat ggcaatctga caaagacatg tcatacattt 88261 ttgaaatagt caagaagact ctaagaaatg tggatttata ttcattgtca ttgttgtatt 88321 aaccaaatcc tatatggaca ctgtgtcttg aagaattaac ccaatggtgt ttgaagaatt 88381 aatgactttt gaagatttta aatttcattt tacctcatct tttgaaattt ctgtttttgt 88441 atttgttaaa aatattccta atatattagt ctattagtat atctctatag aaattaaaca 88501 tccatgtatt gggtgtgtat gctgaaaaat attttactag taggagcgca tgtccactgt 88561 gtcagaatgt catacaaatg agagtgtagt ttgttcttga tgggctttta gcatcatcat 88621 gtcatattgc tctcatgtga ttggctcacc agaaagccat ttcaggaatg tgtgttatta 88681 gtcaccatgg taatggcaca gctgggaggg gaatggatca ttaaagatcc atcatcacta 88741 ttgagttaga aattaactga gatcacacat tcttctgagg tcaggtcatc ttttgcttat 88801 agaaaaaggc aacccttaaa atttaacaga tttttaaaga tatgtattat aggcattaca 88861 tgagcttatg aggttgttca tacatttgga agtgaaagac aaaaccctag aaattgacta 88921 tttgtcaagc attttgttat ttatttgaac cctgcaacaa ctctgtaagg tagggatcat 88981 cctcattgta tgaaaaataa agataaaatc acagacctta agtgacttgt cgaaactcac 89041 atatttagta agtatagaat gagggtgcaa aattagatga ttggtattag ccaatatttt 89101 gctgttagtt tccaaaggca aagaatacat aggcattgta tcaagtcacc taatccagta 89161 agtcctcact taacatcatc gatagcttct tggaaactgt gactttaagc aaaatgacag 89221 ataatgaaat caattttacc ataggctaat tgatgtaaac aagagttaag ttcccatggc 89281 atattcctgc tcacaaaaac atcaccaaac ttctaaataa agaccaaaac acttctaata 89341 ttgaacattg aatacatgtg agctatgcgt acatttaaga aagacaaata aaagcaagta 89401 agataatttt ttttgatctt tggaacccat tcaatgtttt tgtagacata ttcttttttt 89461 tatccttgtt ttcatttttt aaattatact ttaagttcta gggtacatgt gtacaacgtg 89521 caggtttgtt acatatgtat acatgtgcca tgttggtgtg ctgcacccat taactcaact 89581 cgtcatttac attaggtata tctcctaatg ctatccctcc cccctccctc tatgcagcca 89641 taaaaaatga tgagttcatg tcctttgtag ggacatggat gaagctggaa accatcattc 89701 tcagcaaact atcacaagga caaaaaacca aacaccgcat gttctcactc gtaggtggga 89761 attgaacaat gagaacactt ggacacagga aggggaaagt aagataatta ttaacccagt 89821 tattctagtt cagggttgca ggtgctggag cctatctagg cagctcaggg caccaggcag 89881 gatccagccc tggctaggat gccatcctgt cacagggtgc agtagctcac catgctcact 89941 cacactggga atgtgtagac acgccgatta acctagcagg cacatctttg ggatgtggga 90001 ggaaaccaca gcacccagag aaaacctgtg cagacttgag gagaatgtgc aaactctaca 90061 cagacagtga ccctggccca gaattggttt tttccttatc aactgtataa caaaaggatg 90121 ttgaatgaaa tgatgttatt cgaggatctg ctgtaatttg aaggtactca caatttccag 90181 acacgaaagg ctgtgtaaca gtggacttta taccctgaga gtgaaagtaa aatgttcact 90241 ctgagggctg gagctggaat ctttccttca tctatctctg ttaggtggtg aaaagcaagc 90301 caattttggt tgtgcttatt gacattgtat catcttatgt catttctcaa gtgaaagttc 90361 tgcatttttg tatgagtaag aggtccctcg atggagcttt gccaactgtt tctctcaaga 90421 tgttctgcag agcatcagat attgacctaa aatacaataa atgggcatct ctctctgtta 90481 acaggaccat tgccaactat ttggtgtgga gaatggttta ttccagaatt ccaaacctta 90541 gcaggcgctt tcagtataga tggctggaat tctcaagggt aagtttaaga agattgcagt 90601 gttacatcgc tggataactt tacatgctgg aatagtccat atattaactg atccagtaaa 90661 aatcatcttt agggattaga cagggggagt ttgtagccca agctctattg tttttgtgtg 90721 tgtgtgtgtg ccttatttat ttttatttag gtataaatta catatggtaa ggtacactca 90781 agttttactt tgatgggtta tataaagata tttagagaaa tacttgaggc cacacattgt 90841 tttcaaactt ccccgtctta ttagtgtatt cccattttcc cccagagttt ttaactgggt 90901 cactgtaatt atagttttaa catttctttg tgatatattt gtaactatct tttattctca 90961 gtattctgaa tatttaaaat gtgcatgata gagatattgg atggtagaac ttcattttca 91021 atttggaaat ggaattttgt cttaaacttt gcctatacct tccatctttg aatctatcct 91081 ctgctaggtc caagaggctc caatttagtt ttcttgctca actgtctgag atggttttcc 91141 tccacctgct cacctggctc tttatttatt tatttttatt agagatgggg tcttgccatg 91201 ttgcccaggc ttgtcttgaa ctccagagct caagtgatcc tctcacctca gcttcccaaa 91261 gtgctgggat taaaggcacg agccactgtg ccgggcctca cctggctctt taacctttgc 91321 ttctctccca cccttgctcg tctcccagtt tgtacccatg ttactctcac ctttgttttt 91381 gaggcttatg tctgactaat tctcctacca ctagaggccg taaggatttt gtatctttga 91441 aacagttttt atttttctat caatttaaag ttaagtgaat ggtgaatctt ataagctatg 91501 atttcattca agaggcattt tagttattag aggaactaat aagtatagaa attcttagtg 91561 ggccattact ctggaagagt agtgcaatta ttttggatat gctttgaccc tataaagtca 91621 ctagcatata aagttacttt agggctgggt gcagtggctc acacctgtaa tcccagcact 91681 ttgggaggcc aaggcaggca aattgaggtc aggagtttga gaccagcctg gccaacgtgg 91741 tgaaacctcg tctctacaaa aaatacaaaa attagccggg catggtgtca cgcacctgtg 91801 gtcccagcta ctcggggggc tgaggcagga gaattgcttg aatccgggag gcggaggcta 91861 cagtgaactg agatgatggc accaatgcac tccagcctgg gcaacagagc aagactccgt 91921 ctcaaaaaaa agaaaaaaag tcttttttac agagaacagc ataaaacaac tatatatata 91981 tttaggtcat attgatgggt taagatacaa aacaagtagg ttttgtgtgt gtgtgtgtgt 92041 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtttgtgtgt atttttaact gctacgaatc 92101 acataaaaga tgatgcccag ctgctttcca cccctactgt gaagagaatg agagaggtgt 92161 aagaagcact gtgtgaacat ttaaggttag gtatgagggg atgtttcttg agcacgaagt 92221 gctaaatata atattgaatt gccgaagaag gttgtaggat ctctttctct agaggtttta 92281 aaaatagatt catcgcttac ctctgagaga ggttaccaga aatcagagga agaatagatg 92341 aaattaagcc ttaatcccat tcctcaaaac aacctgggtc taacattgtg caaaggacac 92401 ttctgaccac cattctcagg gcctcccttc ctccctgcct gcacccccag cttccaaggg 92461 caacttacac tatatttatt tctctgcttt tattcccaag agtaaacctc tgacccaggt 92521 cctgcttggg aaacttctta cttgtttgta aatgttttct tgcaatggta taattgggac 92581 agatgggcaa agcatttaca tctgtatttt gcttgtttat cagtgtgttt tttgctgtga 92641 cacaatgtaa atatgctccc tgtgttgttc atggtgggcc atgttcttct ggtctgcctt 92701 gttgagtgtg ctactcggag caaggaaacc ctcactgggc aggaactttg atcacccact 92761 gattctagac cttcttagtg gaactttagt tctcaggaga tattaaatat tatctagaca 92821 cttggatagc atcctctcat gtccctgcta actgcagaaa agtgatacca ttgtccagat 92881 tctttgttta atggggaaaa ttgggaagaa aggatacata tattatttta actgttaaat 92941 cactttcttt tcttcttctt cttttttttt tttttttttt ttttccagac agagtctcac 93001 tctattgccc aggctggagt gcagtggccc aatctcagct cactgcaacc accgcttcct 93061 gggttcacac aattctccca cctcagcctc ccaagtagct gggattacag gcatgcacca 93121 ccgtgcccgg ctaatttttg tgtttttagt aagcacgggg tttcaccatg ttggccaggc 93181 tggtctcgag ctcctgacct caggtgatct gctcacctgg gcttcttaaa gtgctgggat 93241 tacagatgtg agccactttg accggcctaa aacattttca tttagtttga aatataatgg 93301 ggacaagcag aatttttttc ttcaactatt ttacctttgc ctctactatc tgatgcatat 93361 atatgggtta gggtgtgcag tgttttgtag actgttttct tcaggttgtt tgaattgttt 93421 tcagccatgg gttttatcca aatgaagttt aatctggatc aattatctcc cacaggtaat 93481 ccaggggacc acaactttgc tgcctcaatg ggacaaatgt gtaaacttta ttgaaagtgc 93541 cctcccttat gttgttggaa agatgtttgt agatgtgtac ttccaggaag ataagaagga 93601 aatggtaagt ggtactcccc agctagcaaa aaataatggc aatttagcca gatctgacaa 93661 gggtatattc aacaggctaa tgtcagcctt ccaggtggta gtggcctgtg ggtattgtct 93721 tcagattctt tcaacttcta agtgaaatag gaagaaaggt catcaaacga gagtgaggat 93781 gagttagcag gtattgaagg ttagaggaga gagaaggtgt gtagttttcc aggggagtgg 93841 gagagtgaat aaactgaggg aatgtagtat agctgcttga gggcaggcat gtccagatgt 93901 aggtgaggtt tgaggtattg aatttaaaga gggactcatc actgtggcaa cgtgttcttc 93961 tccatctgtt tcaactgcat tggagaaaag ggtgtatttt tttttctttc agctttattg 94021 agatgcattt gactaataac aattgtatat attcgaagta tacggccggg cgcagtggct 94081 cacacctgta atcctagtac tttgagaggc taagccgggc agatcacttg aggtcaggag 94141 tttgaggcca gcctggccaa catggtaaaa ccccatctct actaaaaaaa atacaaaatt 94201 tatctgggcg tggtggcagg cacctgtaat cccagctatt tgggaggctg aggcaggaga 94261 attgcttgaa ccccggaggc ggaggttgca gtgagccgag atcgcgccac tgcccttcag 94321 ccaaggcgac agagcgagac tctgtctcaa aacaaacaca aagtatacaa gatgatgatt 94381 tgacatacat tgtgaaatgg ttgccacaat caaatgaaca catctgttgt cacacttatt 94441 taccattgtg tgtgtggtgg tgggggatga ggacacttaa gatttactgt cttagcaaat 94501 tcaagtatac aatacattat tagtaactat agtcaccatg ccatatttta gatccccata 94561 acttattcat cttagaacta gaagtttgta ccctttgatc aatatctccc catttccctg 94621 cccccagccc ctggcaacta ctgttctact ctctgcttct atattttaac ttttataaag 94681 attctacata taagtgagat catataatat ttgtctttct gtgcctggct tatttcactt 94741 agcataatgt cctctggctt tatctatgct gttgcaaaag gcaggatttt cttcatttct 94801 tcatttttat ggccaaatat tccattgtgt ggtacacaca tgcacacatc gcagtttcct 94861 tatgtattta tccattgaag ggtatttagg tttattccat atcttggcta ttatgaataa 94921 tgctgcaatg aacatgaagg tgcagatctt tcttccatat actgatttta cttactttgg 94981 atatataccc agtagtggga tggctggatt gaatggtagt cctattttta tttttttgaa 95041 gattttccat gctattttcc ataatggctg taccagttta cattcttacc aacagtatat 95101 gagcgtttcc tttttttcca ggaatatttt gccaacactt acctcttaaa tttttgatca 95161 tagccatcct aacaggtagt gaagtgctat ttcattatgg tttgatttgc atttccctga 95221 tgattagtga tgaaaacctt ttcataaacc tcctgtttat gtgtcgtctt tggaaaaatg 95281 tctattcaga tcttttgcct attttttaat cagattattc gttttttcac tattgagagg 95341 gtgtattttg aattagaaac atggatttgg ccaagttttt tgtctgcatt ttatttttga 95401 tgaattatat tttacaaatg tcatgaagta aaagatccag gcatgattaa taataatccg 95461 tgtagtcaat aagatacaaa tatatgagca gcttggatgc aggtggtcaa taaacactga 95521 attaaagggc aaagacattg taatgtcatg attgttcaaa tgattggtat cacgtagtga 95581 tacataataa taccaaatgg ttggtattat gtatcagtac atagtcatta ttgatacata 95641 atacatgatt atgtattatg cactgaagtt tgctttgcaa taccatatct aagtacattt 95701 gtggcaggcc aggtctcact aacacaggcc tccatcacaa ctgtcccagc acgtactgag 95761 tagctaggtt aaacattaaa agctgattga accactgccc atatacaaag cctggaatgt 95821 aacaaagagc ccaccaagag ttttgcctag gcttttcctg gatcttgaag catgacaaga 95881 taatgaatga attcttaaca ggaccctttt aggattaaac aagttttatt gggggtctga 95941 agaaactcca caggcctcca caaacaagtt tactggggtc taaaggaact ccccaaacct 96001 ttatgattta gcaggagaca agataagggt aattacctca gtacctagac ccatttagat 96061 taagtaaact tactgaggct ccagaagaag gtcttcaaga ctcacaccta gttatagatt 96121 aaaagaagtt aatcatttat gtctttcgac gattgcacac tttcacgtag acatatagct 96181 tagaaggtat atgagctcta gaaaactttg taactttgag ttggtctggt gataatttcc 96241 aggctttctc cctgtaacca gtggcagaaa ataaaactct gttcttcccc agttcatttg 96301 catctcgtta ttgggccgtg agaaatagca gcctgaccct cagtttggtc tgggaacaaa 96361 ttgacctatg ttaacttgtt tttactgtct aactatgaga ctttatgttt gaacctgaaa 96421 gttcccttgc aaataattcc actgaggaat taatggctta tttcaaaata ctcagtattc 96481 tcattgcgtg ttttcttttg ctctttggtc agagtaaata gattgagatc ttatcagata 96541 atcacagtag taatattata atctctggga aggctataaa aacaattaaa ggataatgaa 96601 agatttaaaa aaattatagc aggttttttg gcagtagaag aattttaact aatatgaaaa 96661 gtatgtggat taaacttgtt tttctcagtt ttggtgtgaa agtccctttt ccttgtgtca 96721 gggtatgaca ttctaaaata aagttaattt tggcatcttc ccccttattt tgatgtacgt 96781 gggggtagtt tggtagttag gcagatactc atctctttaa cgtaaaattg tttgcctccc 96841 ttgtgctgaa gctgtttgcc tcagggaata cttaggttgg aaacttataa gagaaatatc 96901 tggcctttta tagccttttt ttttacaggg ctcccatcca agaaccacct ggaaccactg 96961 atttacttga aaaacattca gtattattag ctacagtcta aagaaaaaga aaaacagaaa 97021 gatccccaaa ggggcaaatt ccagagaccc taactcagat ggaaggctgc cattgtttat 97081 ttaccaaaaa agtttgttaa gcaaatgaat acagtcagaa gcttaaactt caggtatgtc 97141 tcaaatttac atgtgtacat aaatcacgtg agcatctgct tgttaaaaat gcagattctc 97201 attgcatttt aacaaggttt ttttttttta agttgtgttt ggggtaggac ccaagaattt 97261 gcctaacaag gttccaggtg cgactgctgc tgctggtcgt ggaccacact ttgcgcagca 97321 aggtcttctc atgattcatt tgtttgtaaa cagcaccaaa aaaaaaaaaa aaaaaaaaaa 97381 aaagcctcgg tgactcacaa gagcaggcca ggccagttga catgaatagc agtactttat 97441 ttaccaaaac cggttacatt agtgcaagta actgctttaa ttttcagatc attgaaaatt 97501 tctgtgattc agattggttt aaatggactg tgaaaaaccc tgcgttagga cctccaggct 97561 ttatcttttg aaaactgata cctaggtttt actaagagac tcccgcccca cttctcctcc 97621 gtgtctgttt tgagaacctc ttgatgacag agttggccca actgcctgtg gttggtaccc 97681 tctcttctaa agagtgagtg ggagagagca gttcacattg aggtttccta cctatcagtg 97741 ttctcctcct tttcactggg gaaaacacat aattctatat tactgtttca ctaagacttc 97801 ctacatccaa ctgtgcattt ggacacaccg atacaccaac gttctcatac tcattcacac 97861 acacaaccat gggaatttag caaattgaat tagaaatgag aagaatagca tgtagaaaag 97921 gagagtcctt caacagaaag ccaccaaaag aggtcatata gtacaatccc ttgcctccca 97981 gttagcagga cagagcctta actcgtgaga ttggctgcaa gtgttgcttt gtctttgcag 98041 tccaaagtag cactacccaa ttgaactgga atatgagcca catatttttt ttgagacagg 98101 gtctcactct gttgcccagg ctggagtgca gtggcgtgat ctcggctcac tgccacctcc 98161 gcctccctgt gttgggatta caggcatgag ccactacccc tggcctccat tgacttttat 98221 attaacattt tctccctcct tacctccctt tcttccccca ttccccagaa ctaaaaatca 98281 taactttgtc tatcagagga cttctttaat tttttatctt atttagatga agattattta 98341 taattttaaa acattccatt agctaagcat ttaatattac ttttgtggaa agttacatga 98401 aagtgggact cccctagaga attgatgata tatcatcact gtaaatagag atgatgctct 98461 cgttccactg tgccaagccc tgatccatat gctcatgacc taaatggcag gagagaggat 98521 tagttgtgtt caggaaggat gtctagagtt atcacttgct aattttcccc tttatttatt 98581 tttatttttt aatactcttg tctttttctt tttctttttt ttttttttga gacagagtct 98641 tgctccatca cccaggctga agtgcactgg catgatcttg gctcattgca acctctgcct 98701 cctgggttca agccattctt gtgcctcagc ctcatgagta gctgggatta caggtgtgca 98761 ccaccaggcc cagctaattt ttgtattttt tttgggtaga gatgtggttt cctcatgttg 98821 gccaagctgg tcttgaactc ctgactttaa gtgatctccc tgcctcagcc tcccaaggtg 98881 ctgggattat agacgtgagc caccacgcct ggccttgttt gattattatt attattgctt 98941 atatctgtaa tgtataagta agtgccccag gtagatactc atgtccatgg agggcaggtg 99001 aaggttccaa cgaaggccag ggaatgaagg gattgtcacc atgggcctag ttgtgactgt 99061 gatggtcagg acatattggt tttaaaatgt ggtgatgata ataacagtag cagttagctt 99121 tcactgaggt ttaactagct gtctggtatt cttttaagtg ctttcagtgt gtgaactcac 99181 ttatcctcac aatgaccctg ttactgctgt tagtgccctc attttccaga tgaggaaaca 99241 ggctcagaga gttgaagtaa attctctgga gtcatatagc ttgaaaatgg cttaggtagg 99301 gttcaagccc agacaatcta catggaaacc agtgttccat tcccttgctt atctctgaag 99361 gcacttgaat ctttgccagt tttacgaata tgaggagacc agaaccttct tagtggctta 99421 tcttataatt acccttgtta gcaaattatt ctaatattaa gatcctaaaa gggcatacat 99481 ttcatatttt ttaacatttc agaggtagga aacagttgga ataggtttgg cttccagtat 99541 ttcaaaattt taaatataaa tatagtggaa acctaattca tatccaacca gttgacatca 99601 catttttgta ttcatataat tctcaggaga aagacggaaa tgataagaag caagctacta 99661 ttgatttaat ctaagaataa ccttgcaggt atatctttgg ctcagtctac acactgagct 99721 caatttctcg tattttgtac atacgtcatt ctaaggagtg atccctgatg tttgcagtgg 99781 taacccaaag gctctgtaag atttattatc aaagtttaaa aaaaaatgtt taccattctc 99841 tactaaggcg gaaatgtgaa tatatacatc aaagaaattt tttttctatt gtgttgatgg 99901 gaaaaaaggc cttgttgctg taactctagt ttctagaaaa ttcagtaaag ctgaagttcc 99961 catgtgaaca ttttggctgg gctaagttct tctgtataaa ccatgtcttg gtttgtttca 100021 ttgtagacac tatgtcccca aactgcttat ctttattcat atatctcatc tctgccagta 100081 ggtatttttt gctgctccct caaattttct gcatttttct tatagagtaa accctgctaa 100141 agcaaataat tattttgaca cttgttaaaa tagtaatgaa gactattcag gactattgca 100201 agaggtgtca agtctattgc aacaggggag aaagattagg cccaactcta aatacagcaa 100261 agatagctgg agatttatag ccagtgagga gaatgagggg gtcagtggat ggaaaattat 100321 gaagaggaga catcacgagc agggaaattc ttgttaaacc aacttgacag gattcttgtg 100381 gaaggcaggc caaggtgatc aggatgggga ttctttctaa agtgacttag caggattctt 100441 gctacaactg gactagacag gcttgtcgag gacaggatgg gggtgggagg ctgggaggag 100501 ggtaagggca agacgtactt gagaagaggg ctcagaggag cctgactaaa ctttggtcaa 100561 ggagagagtc tttgtcagct ctcagatcaa agctaaagtc actgatctcc tgtcatatat 100621 taaaacattc tatatgcaga aggttaatct agttaaagca gagcaaatct attacacttc 100681 agattcagat tttattaagt gaagaactaa aacttgactc cctatgtaga acaattcact 100741 aatttctcct gcactctagc agctattcta tttctcttta aagacattgt tcatgtcccc 100801 aagtaatgcc atggatgccc taaaatatct tccatttttt ctgagctgct ctctgcctga 100861 gcaattccag tagtaattct tatcctaaga aaagaattct cctcccaccc catactgcta 100921 catctggtgt tgaagactta aatacccacg tgggtccaat tgcagtggcg gagtttgaac 100981 ttgcaacctc aatacactag aagttcacct cattagtgag ctaagataat tgaattgatg 101041 tactgccatt aatattcttt aagttctcca attccctcgc tcactgtttc tctcagtggt 101101 tgagtaaatg agatgtaact ataacgttga tggcagaagt cacatgaaac tcagtgtcat 101161 gcattatagc attgccctta tactgaggag tttttcgttc cttgcaaatt cagctgctca 101221 caaaggttgt tctcagtaaa gccactgtac agataaagac aatggtccct tcatcacaaa 101281 ttgtgaaaga aaggatattc tctttggcta ctaaatgccc gtttctgttt gtctagtaag 101341 tggtgttggt tattattatc aagatcactt tccagagaaa ctttgctgaa gacctgtcca 101401 aattaaggca aaccatgatg atttggtttt ttaagggctt tttaatatat ttataaaata 101461 ttgacctaat atagaaatta catctgattt tctgcaaagt tgcattgata gctcatagcc 101521 cagctctgct cttaccaaca tgggcaaatc agttaacttc tttgagcttg ttttcctgtt 101581 ataaatgatg caagagtttc tgttgctttg gtaatgctgt gtaacaatca atcacaaagc 101641 ctcactggca tttgacaata agcagttact gctgatgcat ctggagtcag tggtgggccc 101701 ccatgcagct ctgttgattt tggccaggct ggcttgaatt tctgggggtt ggctgatcaa 101761 ggttggcttt ggcagaggtg attggaatga ctcggctctg cttcatgtgt cctgcttcct 101821 tcagccggct agcctgtgca tgttctcatg gtgatggcag aagttcaagc aagcaagtag 101881 aaacatgtga ggtctcctga gacctaggct caggacaggc ataatgtaat atctgcattt 101941 tgttggccaa agcaagtcac aaggccggtc cagatttaca gactagggag atagggtctg 102001 catttttttt tttttttttt tttttgagac aaggtctggc tctgtccccc aggctggagt 102061 gcagtggtgc gatctcggct cactgcagcc tctgcctccc aggctcaagc catcttccca 102121 ctttagcctg cgagtagctg ggactacagg tgtgcgccat gatgcccggc taatttttgt 102181 attttttgta gagatggggc tttggcgtgt tgtccaggct ggtcttggac tcgtgagctt 102241 aagcgatctg cccgccttgg cttcccaaag tgttaggatt acaggtgtga gccacctgta 102301 actctgcatt tttaatgaga ggatctgcaa agtcacacag taaggggtgt ggataaatga 102361 aggagtatgg cattaaagct gtcagtgctg tgtatcacaa atagggataa aagtatacct 102421 ccttctttca tttgaggtga gagttaaatg atataatcta tgtagaaggc ctagcataat 102481 acctggtaca caaaaagtgc tcagtgaatg tcatctactc ttattgctaa tattgccatg 102541 cctaatgtta ataattaaga tagaatttat aatatcaact tcctggcaca gctgtggcaa 102601 cttctgcatt tagattgcct ctttttaatt tatcatagag tgagggaggt gcctgggggg 102661 ttttgctgtc gctactcctc catacatgcc tagtatagta gctttcattg gaataaagat 102721 ccttttcatg ttagtgccct ggccacattc cagaacccac cctatcattt aacgccaagt 102781 ctggctcata caagagtctc atgaaattac tcatgaaatt actcgaggag ccaaatataa 102841 tttctttttc tttttttttt ttttttgaga ccaagtctcg ccctgttgcc caggctggag 102901 tgcagtggtg cgatcttggc tcactgcaag ctctgcctcc cgggttcacg ccattttccc 102961 gcctcagcct cccaagtagc tgggaccaca ggtgcatgcc acctcgcctg gctaattttt 103021 tgtattttta gtagagatgg ggtttcactg tgttagccag gatggtctcg atctcctgac 103081 ctcgtgatct gccctcctca gcgtcccaaa gtgctgggat tacaggcatg agccaccatg 103141 cccggccatg taatttctag aataggtcca gatttcataa ccctctgcag gctaaatctt 103201 gttacaacaa atatcatccg gtcccaatcg tgacagctca tttatgcaat cttcggtcaa 103261 gtaacatcca gcaaaacaag taatttagat agctcattgg gaattctttt gaaataagta 103321 tgaacactta ctcttgtcct tgtctttgag tgtgctgcca cgacgatgac acattggctc 103381 acctttcttg tttgttttct ccatttctgg ctagcattgg gcagtgttgc ctttgccaca 103441 tattatcagc cacagtcttg tcaagcagat ctggtcagac ctatggaatg tgaagaccct 103501 ttattagaaa ttcatttacc taaagaaaac atttatgcct tccaacagat ctgaatgttg 103561 gaagtgacca gacatagtta cttgctgggg agaatttacc cacagccaaa ggctagacta 103621 tttgagactg attgacttgt aaatattatt ctggtttctg atccatcctc tcttaaagtt 103681 ggaaagtgga aaacaggaaa agatgagtta taaaattaga acacagccta tatgttcata 103741 tccagcaaat agaaatataa atatctacct gagtcttatg aatatatacc tttagataaa 103801 aggacatgag aaactcctcc tgtaccacca gaataaccca ctttatagca ccaaataatt 103861 tgattttttc ttcttttgcc aaaatgttta atagctatgt aacactgcta atcctgacaa 103921 actgatatga tttattcact cacttggctg ccagaaacct cagcattttc taactatcct 103981 tagttcttct ggtttaacca catgtcccac ttcaatatag gtcacttttt acttattttg 104041 ttgccattat tttttatcag gagccaatcc aagggcattt ttataaatag ttatagtata 104101 agtaattaat taatgaatat ttcatattta ttggtgcatt catttattta atgtctgtgt 104161 ttctagtaaa gtgacaattg ccagaaagaa aattttaaga gtccccctga gattaaagcc 104221 tatttcttca gggttatcca cataaattat taacaacgat ttccattttt cacattccac 104281 caatcagtgg ttacaactca agaaaacttg ccctgttcta gagcaatgag aagactggaa 104341 acattttaat ctcatggttt ttttccagat attataactt ctaagtatgt taaagaggaa 104401 agcctactca acattatgcc ttttcgttaa gaattttatt aacattaaat tagtttaagg 104461 ttaatgccca taatcaacct tactgatttt ttttcttttt cctttctttt ctttttcaga 104521 actcctagcc tataattatt cctactagat atgtactata tttttaaacc attccaagct 104581 gtttgtcatg taccatgttg acatcaccta tcttagaatg gactaggcta acttacagag 104641 tcacttgatt tccactaaag tttgtgtttg gtagtaggct acagtcattc tggagacatt 104701 tgctgtgaac cttgtcccct aatataacga tggattttct attcactgat aagttctact 104761 catggattca ggctttgaat tactaaatca gagtaggtct ctcttttcaa aaatgacttc 104821 acctaaaaag ggtaccatgc attttttttt tccctttttc agctttcatg ttgttaaatc 104881 tcatgatcat atctgcacag caccatattt tctatgagtg agtttgttaa tattttctct 104941 ttatactatt ggaataactg ttatctgtta gcatatatga aagatatttg attcaatttc 105001 aactctgcta ggtgccaatt gaaggtttca aaaagcataa attgatcagc ttgtatttac 105061 tgagcatctg ctatatttta gtattaggta tgaatgtgag atatggactc gtgaaaaatc 105121 atatcacatg agatatgaaa tcataaaact aacaaaattt taaattagca gaatactttt 105181 ttgtttgctt acatattcat gatctcattt gatctaaatc atggttcctg ttctcaaagc 105241 ttatgggtct tgttagggag accaaataaa tataaatgaa aagacaacaa ttaaacacac 105301 acctatgtat tattctgatt ttaagcctca caggaactca gagagggaat tgaaagtgtg 105361 ggtttaagtg tttgggattc atgggtatga gataggattt gagttgcctt gtgggatttc 105421 tttattatca gggacaaagc tttgaagaaa actacaaccg atcatggggc aaactaatcc 105481 ctccttaaag caatgttggc atatgtccct tcaaatgtat taatagaacc atcatgactc 105541 atttatgcaa agtacagaat tcttatgtct atgtgatggg aggaagggta aggagagggc 105601 aagaggagca ggagagataa tttataattt gctaagatag gagcatctca ttaaagtaaa 105661 gccctggatt gaagggtaac aagtgcctat gcatatgtga ctcaccaatg tctggtgtct 105721 gatgtagaac tctatgggca aaatcctggg gtgcaaagaa tgaaaaagga agaacctatt 105781 tttagataac actggttgag gaagagaact aagaactgcc catagatcaa ccaaatgaac 105841 tgtattttta tagtgtggat ccataaaatc tgaaatagtt gttttttttt tttttttttt 105901 ttttttgaga cagagtcttg ctctgtcacc caggctcgag tgcagtgggg tgatctcagc 105961 tcactgcacc ctccacctcc cgggttcaag cgattcttgt gcctcagcct ctcgagtagc 106021 tgagattaca gttgcccgcc accatgcctg gctaattttt gtatttttag tagagacagg 106081 gtttcaccat gttggccagg ctggtcttga gctcctgacc tcaggtgatt cgcccacctc 106141 ggcctcccaa agtgctggga ttacaggtgg gagccattgc gccctgcccc tgaaataaat 106201 agatatttaa caaaacgttg ctttgcctag agggctctta ccctgaatat ggcagatttt 106261 accaaaataa cacttgagtt aatttttata tgttatattt tgacctttga ctggctctaa 106321 ataaatattt cctatggata acttcacctt aagacttcca ctttaaggga tttctattaa 106381 tgttatatgc tgcctggttg aggtgtgagg tccaagtttg gtgaagaaat actatttctc 106441 tatttttttc ttattttaaa attacagcag gtttttgact tgactgaagg caatgttatt 106501 atgtgatgct gtagagtagt ctaaagataa aaatcagaag tgttcagtga tttatcttcc 106561 tagttccagc atgctaaaat aattatttct cttatttgca ttgacatgtc aaccaatgaa 106621 attaatcacc ttcaaagatt ttccttgttc ttgaagtctg ctggggtagt tcaacaccat 106681 ttcacagaaa atgttataca ataaatctgt aaagtctgcc aaaaaaaaaa aaattgagaa 106741 cctgaggtgg gggatggtgt ggggagtgga ggtgccagag aagacaaggg caaagagcag 106801 ctgaggctca tgggaaaaaa aaagtgattt acccgcgttt tgacaaatgt gagagggcca 106861 caaagagtga gttaccgaag tgtggattca gggaagacag aggtggtggg ggcaggggag 106921 gaggtggaag aaaaggcaga ggagggagga actggcatgt gaataaacta tttgtcattt 106981 ttcactctgg cagtgggctg aaatgttttt gcctcttttt attttcattt agaatgccaa 107041 tcaatgggtt acagacaaag agctttgaaa accttttctt atttatttat ttagagacgg 107101 aatcttgctc tgtcgcccag gctggagtgc tgtggcgcga tcttggctca ctgcaacctc 107161 cgcctcgcgg gttcaagcaa ttatcctgtc tcagcctccc aagttgctga ggttacaggc 107221 atgtgccacc acacctggct aatttctttt tttgtatcta gtagagacgg ggtttcacca 107281 tgttggtcag gctggtcttg aactccagac ctcaggtgat ccaaccgcct cagcctccca 107341 aagtgctggg attacaggcg tgagccactg cacccgccct gaaaaccttt tcaaaatgca 107401 ttaaatttta atgaatttga tagccatttt aatgcctggg agatgctgga cagggagggt 107461 cctatgaaac caagctgctt agagagaagt atctggaaac actcagcccc attttaataa 107521 tagccacata ctccataaaa atacctcaaa gagaacagga gcaaaatgac cttaagattc 107581 atcataggaa tccaaaaatt ccaaaatttg ctgaagatct ttttagcttg gagagtgctt 107641 atttttttga taatttattt taaaacctaa taaaatgttt tgcttttgta caccttgtcc 107701 caaataaata ttttctttgg acatattcaa ctttttatta tttattgggt gctttctatt 107761 attgctacat cctggccagg gtattgagag ggaacctttt attgtcctct ttgtccctat 107821 ctgtaaatat agtaatcccc ccttatctga gacagatact ttccaagacc ccccccaacc 107881 cccgcccagt ggatgagact ggggatagta ctaaactcta tatatactat gtttttccta 107941 tgcatatata actatcataa ggtttatttt ataaagtagg cacagtaaga gattaacacc 108001 aataactaat aataaaatag aacagttata atgatatact gtaataaaag ttatgtgaat 108061 gtggtctctc tttctctcaa agtatcttat ggtattgaac tcatgcatgt acttcctatg 108121 atctgtctga taaccaagat agctagcaag tgactattgg gtgcgtatat acagggtgga 108181 gatgctggac aaacgaaggg tgattcacat cccgggccag acaggctggg atggcatgaa 108241 acttcatcac actaatcaga atggtgtgca atttaaaatt tgtgaattgt ttatttctgg 108301 aattttccat ttaatatttt cagaacacag ttgcccatag gtaactgaaa ctatggaaag 108361 caaaatcgtg aataaggggg tactactgta cttctcttcc tagtaatatg ttcatgctgc 108421 caggatgcac atagccatgt gcgaagaaat ataatttttt agataaacct ccctttaagc 108481 agaaggtctc aatcctattt catggagaaa agacccacgt ttatactagg actctctcaa 108541 ccttctacat ctaatcttta gattcttctg catgcctgcc tatcatttta atcttctctc 108601 ctctcataag tctgtctcat atctaaggtg aatccttcca tatagtctct ggatcccatc 108661 acctcctttg ccctctgaga caccactcat ggcttgcttt gtcccatatt ttcagtctct 108721 tccttttcgc tggatccttt ctattggtat ttaaacatag tcaaatctct cagcttttca 108781 acaacaacaa aaacagtagg aacagcatta gcaagcatat gcatgaaacc caaactccaa 108841 accctcagac ccttccctta accctacctc tacttctctc ctatctttgc tttttctcct 108901 tcatagacaa actcaattaa tttgtccaga cttgctgtct tcgcttccca cacactcttc 108961 atcccactag tttggcttct gtccccatcc ctctttttct tccaattttc tcagtcttat 109021 tgaatgacgg tgtttgccaa tatctttctg aagcctagga atgaatctga ctctctctac 109081 gttcctcact ttcccacata aaatgagtca cctgtttcca ttggtttggt ctctaaatct 109141 ctcctttgag tccacccact tctttccctc cccattgccc ctccctaatt tgggaagata 109201 taatttcttt ttctctgtga acgtgatggg ccagccaatg catgttgcta catgatgagt 109261 ccctaaagac ttagggccag gtgtggtggc tcacgcctgt aatcccagca ctttgggaga 109321 ctgaggcggg tggatcacct gagctcggag gttcgagact agcctgggca atatggtgaa 109381 agcccgtctc tactaaaaat acacaaaaat tagctgggca tggtggtgca tgtctgtgat 109441 cccagctact taggaggctg aggcagaaga atcgcttgaa cctgggaggt ggaggttaca 109501 gtgagcagtg agccaagatc atgccactgc actccagcct gggcgacaca gtcagactca 109561 ttctcaaaaa aaaaaaaaaa aaaaaaaaac ttaggcatgc tgataggcaa aagatctagt 109621 ccccggagat ttccttgtca ttttaacatc ctcctgatgt gggttttctg tttgaaattg 109681 tgacttatgc atctctcagt tgattccctt tcactccctg gcatttctct ctgattttgg 109741 gagggatcct ctgttgttac tagaggctct gagctctcca ctgttatttt agctctttgg 109801 tcaggtcctg gctctggtcc tcattagctg tgtcagcttg agcagatcat agactccttg 109861 acccttaatt tccttatctg tcaaataaat ggtttcccct ggtctagtgg gtctcagccc 109921 tgactgcaca ttagaattac ctggggaact ttttaaaccc tcccaatgcc caggctggac 109981 ctcagaacaa tgatactagc atcccgatgc ccacgacgca ggatttttta ttttatttag 110041 aaattttttg tcagtattat tttttttttc ttttgagact gagtctcgct ctgtcaccca 110101 gactggagtg cagtggtgta atctcagctc actgtgacct ctgcctcccg agttccagtg 110161 attctcctgc ctcagcctcc tgagtagctg ggattacagg cgcctgctac tacacccagc 110221 taattttttg tatttttagt agagacgggg tttcaccatg ttggccaggc tggtctcgaa 110281 ctcctgacct caagtaatct gcccgcctca gcctcccaaa gtgctgggat tacaggtatg 110341 agcccctgcg gccagctctc agtatttttt aaagttcctc caaagattct aatgggcaga 110401 caaagttaag aaccaccgac ttaatgattt ttaaaggacc ttttgctctt tgctcttcta 110461 gcagtttgtg ttttatccca agttgaagtt tctcttcttt taatgttttt accaagttgg 110521 gaagaaatag ctgtgaagct cccccgaccc ccttccatct gctgtttgtt cttatttaca 110581 tatgatggaa cccagcccac tgatttatag ctctggttcc ctcacaactg ttattgaact 110641 gaaaacatag ctgaaaactt attaaacaga aaaaacttat tttgagcttc atgctgacaa 110701 tttattatag aaccaaggaa acatttcaac gtctttcctt ctagagttaa tgttaacaat 110761 tgatttacaa ccccatgctc tcaaatgaaa aagttcattc tggggccaaa cccagtgttt 110821 tttttttctt gggtgtccca ttgagaaagc caaagcatct ctgcaggaga actggatttc 110881 gaggggtttg tatccatgaa caggcagggc ctcctcacat tgctgccggt ggcctctaat 110941 gtgacgtgtc agtagcccct ttcctccagt cctctcttct ggtccacaga caccggcagc 111001 aaagccaggc acgtacatcc tcactatggg tgtggcctca ctgttcaaat gctttgcgat 111061 aggtctttca gaagtccacc tttcggtagc agcccccttt agacctacca ttttatttta 111121 ttattatttt tttagagaca aggtctcact ctgtcaccca ggctgaagta tagtggtaca 111181 gtcatagctt actgcagcct caaactcctg agctcaagtg atcttcctgc ctcagcctcc 111241 tgagtaccta ggactacagg tgcaggcgcc accatgcctg gcctttaaaa aataattttg 111301 tttttataga gatgaggtcc tcttgtgttg cccaggctgg tctcgaactc ctggcctcaa 111361 gcaatcctcc tgcttcggcc ttccaaagtc ctgagattat aggggtgagc cactgcccct 111421 ggcctatacc taccctttta aattggaaat aacaagactt ttctgagctg ccaaggagta 111481 gagaggagag taggaagatg atatagaaga gccgaacatt tagtttaacc cacggaacta 111541 aatctggaaa gaactggggt cctgacataa gtgtgattca tattggtgaa gtgtcctttg 111601 gggtgctaag tctccctgcc cgtttctctg tgttaacagt tatctctttc aactgttcac 111661 tttcttctgc aatacctatc tgaaatgcac agcccatgcc tttcctgtct ctctcagtcc 111721 tttattaaat gattaaaatt agttggcacc ttgataagga tgaactagtt tagtagctac 111781 agacaggtaa gttattggtt tgtcataatc tatggtgaat tcctatccct gtatacccgt 111841 ataattcatt ttatagctgg gccaccacat tattctcatg gaaagaaatg gttttttttt 111901 tttaatgttt tttatttttt ggagacaggg tcttgctttg ttgcccaggc tggagtacag 111961 tggtgcagtc atggctcact gcaacctcca cctcttgggc tcaagcaatc ctctcacttc 112021 agcctcctga gtagctggga ctaaggcatt tgcctctaca ccaagctaat ttttatattt 112081 tttgtagaga tggggttttg ccatgttgcc catgttggtc ttgaactcct gggctcaagg 112141 gatctgctca tctcagcctc ccaaagtcct gggattacag gcatgagcca ccatgcccag 112201 ctaattttta tatttttggt agagatgggg tttcaccatg ttgtccgcgc tggtctcaaa 112261 ctcctgggct caagtgatcc acctgccttg gcctcccaaa ttgttgggat tacaggcatg 112321 agccaccatg ctgggcctgg ttagttatct tttagtggtg gaaaaaaggg aattttcctg 112381 gaaaaagaag tgttgccaga gcatggagtc aagctgaaag aaccccttac ttaccatgtt 112441 gagtaaagag tcatttctca tgcaagccaa ttttgttcca gaaaacctcg actgaagctt 112501 cttggctttg acgttccctc tttgggtatt ttcttgtaga tggaggaatt ggttgagggc 112561 gttcgctggg cctttattga catgctagag aaagaaaatg agtggatgga tgcaggaacg 112621 aaaaggaaag ccaaagaaaa ggtaaggatt ccttttgatg aaaaaaaaat aagacttctg 112681 gtttaatgga tgttcatgct tgacattgac tgtgtagttg gtttttatat tgactgaact 112741 gttgactctg aatgacccca gagttagctg tctgcttttg gatgaataca tagatttttg 112801 tttcaccagg aggtaagaga aaataatacc tgaatttttc gaactgcatt ttcaaatacc 112861 ttgtaattcg gcctttcctt tctcctaatt gctgtctgta agaggcacat ggcttgctgc 112921 ctggtgaatt gtgtcataaa ccatcactgg atatggcaga atgcagagca gtggggagtc 112981 acagggagca gctggaacaa gagcccctaa gacaccaaaa cgaagaaaac ccatattggc 113041 cacatgtcct tctgctctgt gcatgatgca tgcaaatgtc ctggggcaaa tcctttcctg 113101 taaccaagca tatctccttc ccgacccatc atttggcaca aatgacccaa atagtatact 113161 gataaactgc agaccagtgt tctaccattg cttaggacta caaacgtatt tcctaataga 113221 agtggacact tgctgggcct tgtaaaattt tctgatacta tgcgacacga taagggaaca 113281 aaggagaaaa gtattgacag ccatgctgtt taaaagtaga aaacgggctg ggtgcagtgg 113341 ctcacgcctg taataccagg actttgggag gctgaggcag gtggatcact tgaggtcagg 113401 agtttgagat cagcctggcc aacatggcaa aaccccatct ctgctaaaaa tacaaaaaaa 113461 ttagtcgggt gtggtggcat gcatctgtag tcccagctac tcgggaggct gaggcaggag 113521 aatcgcttga gcctgggagg cggaggttgc agtgagttga gatcgtgcca ctgcactcca 113581 gcctgggtga cagagcgaga ctccatctct gaataaatga atgaatgaat gggatccatt 113641 ttgaattcca gagattctga tagagctgga attcaagtgg gtggagcata cctgctagac 113701 tcttgatgga tgcttctctg cagctgagca tttcttgtat ctagaggtat ttttgaaagc 113761 ttcctgagta taacaatcac ctgggttgcc tggacgctct ggctttctgg gccccttccc 113821 agcaaggcct gggttaatgg atcatggatg caggctgaac agctgtcact ttagccagtc 113881 tccgggtgat tgttatcact aggcaggtct ggggaaacac tgcctggagt catactgagt 113941 catactgagc acaaatattt ttgcctcagc gttctgggaa tgcttaacct tgttgaaatg 114001 agaacacgta gagttttagc aaggatattt tctagtgctt taaactcatg gtactggaga 114061 tggataaaca ttgccataga tacttttgtc agtagatgtg aaaaggcgtt tgttcagtgt 114121 ttataatgaa ggaagaactg ggatggtggg tggcaaggca gcgttggagg aagacaaaga 114181 gtaggtttga ttttttgttt ccccttcaaa aatctgtcaa agctgtcctc agtcacataa 114241 ggcacaggct gatgaaggca agagcacatg ggctgtgatg ggaaactctc cagatggtag 114301 acaactttca acaccaacat attgttgttt tatgaaggct tcatttattt gctgagctag 114361 cttgtggtat gtggaacagg atgtgtcttg ccctgggcag ttgagcgctt agtgtattaa 114421 gggctcttta tcctcgcctg accccacatg aaccgaagag gtgcttgcat gaattagtac 114481 aggaccacca gggtggaggg tggggtggag tgtgtgtgta tgtataggtg ttgggggtaa 114541 tgtgttatga atagctggat gccaggggaa tgttctggga agtgcagtca gttctttgag 114601 cagtgtggat tgcagttgcc tcagggagct cttctgagtg gcataaggga gtgggtgcct 114661 cacagttggc ccttctcggc aagtggccca tgggtccttc ttgaagcagg actattctca 114721 gttcttctaa tattagacat gttcctgttt gcctcccgag ttgctctcca gtcttctatt 114781 ttatttcaat gagagctgtc agtttttctt ctaaaagcgt tctttgaaca ctctgctttc 114841 tttgagctga gttcaacctg ccaggaactc tgctagttcc acgacaggta caaatatcaa 114901 aggcaaattg gatctcacaa agttccccga ctgtatcttg catttcagag ttcagagaag 114961 tttctttaac gaaatgcaag aaagatggat gttgttgaag caatttcagc agttctttga 115021 agtgggtcct ttcctggacc ccagaagatg taaatttttt tttttttttt cccctcaaaa 115081 ttaaatgaac atgcagttct taaaatgacc tctgggtgga gctcgtggta ctcacaaaga 115141 acactagcgg cgttgaaagc ttttatgcag ctttctactc cggcgttgtt tttgaaaaaa 115201 aaaaaatttc ttgccctttt tctctgctgt cctcgatttg attgcctttt aatagtcaac 115261 aagaaaggag agactttgcc taataattcc ttgaatctta gatggaaatc tcccactatc 115321 ttagggtttt gtctttctga tactgagtag cccctgcatt aagaaaggat ttgcctgcat 115381 ttgaagtgga ggtgaataac tcgcctgctc ttcctggtat gacttggcac tgttctttgg 115441 gcatgtgttg tatgcagtgt atcacccagg cggaagattt ctcatcaata atagtgccca 115501 cactgggcca atgctgagtc acttggcaac ccattgggaa agctggaaaa atgttctgat 115561 ttgtgaaagg gagaagcttt tctttccctc accatgaagc tctggaccgc agctaattac 115621 agagcctcct aagaagtgtt aacctcagca ttccatagtg accgacctcc tgaaccgggc 115681 aggtacagaa agctgcagtg ttctgaggcc taataccttg gcttttgtag caatgacctc 115741 gtctcaaaat ggaagtgtgt ggatgtggct ttgttgtact ctttgctagg ggtgttcact 115801 gtaaaagata aatgaggatg cctctgctcc cgtgaagggt aaatttgttt tctctgtctc 115861 cctgcctttt cctgctcatc tcttaagagg tcacgagtag tttttgtatt gtgaaactga 115921 tggtgtaggt ttcaagtgga aatttagtat cgtgaaataa gcaccgcttt ttgtttagga 115981 ttctctgttc cctgcaagca tgtgctctta ttgtaaataa gcagcaaagg acagggcatc 116041 tgttattttt ctggtgacca cagctgttga atgagtgtca ggtgataaaa taatgatttt 116101 aggggtgtct tctggaggat ttgcccttgc cagagtagaa gaggaggatc tcaaggtcgc 116161 tacatcagga ttctgactga gacctggcag gagagcgtca gatcaggagg agcaacttcc 116221 tgagcttggc ttatctccag ctggatttga ctatatagcc aaaaagctgg atcccttcgg 116281 aattatttgt ttgtttgtgc tctttgcccc tgggacctaa gtttgtcagg tcacatccaa 116341 agaaaaatag caggatggga ctcacaggga ggtggcatcc taatgatgct ctgagggagt 116401 ggccttaacc ttggccacag caggtgcaga gaagtcggct gaatggtcct aaatttattg 116461 gcatgtgact ctcacacaca ggctttgagt gaggtggtga gagttttctt gcagagggcg 116521 aaacagaggt gtagcccact tctttggctt tgtctctaag ctctaccagg cctaggtact 116581 ggacttctcc caggtatgcg gtcagggctc tgcttatatc cggagcctgg tggttctcaa 116641 ggtgtggttc cagaccagta gcagcagtgg tccctgggaa ctcattagag atgcaaattt 116701 tcacatcccc acctgctgag ctctgggagt agggctcagt gccttgtgtt ttacctggtg 116761 attctgctgc gtgaaaaggg ttgggaaccc ctgtcttgtc cttaccactt cacagcacac 116821 ctgcctgact tccaactgca gcctcctgca ttttggccaa gagcattctc tccctgctac 116881 agcctgcttt gctactcata aagagggccc aaaggacggg gaatgaccac accttcctca 116941 gaagtcctca cccaacagtt gatgggagtt gctgtgtgtg gcccccgctc cctcatccct 117001 ctagtgcgtg ttccacatgg gctctgggag cttttccaat gggctagagc tattatacta 117061 gctaagatac tcgttgctga ctgcctttct ttctctacat atttcctctt tctcctacca 117121 gtgtttcttt tgctgcccaa atgaactgct cgcacttgaa tccttgtctt agagtctgtt 117181 tctgggggat tcccaactag gacacagccc ctgggggtat acgctttccc agtgcttcct 117241 tgcctgtaag gcaacagcac atttcaccga gtctggttgc tagcactgct tcagcagact 117301 caatgccaca aacatttcta gcacaccaac agcatgccag gtgctgagcc gggattcgga 117361 ctcagagatg agcagccaca gtcccttttc tgcgggggcc ttcaggctcc taggagagaa 117421 caggaggcca tggatgcagg gcagaccagg agaagtgtta ggtaacgtga tggagacgaa 117481 aggggtcctg ggagtctagg ggagctcaaa catgacatgt gagggtgcct tacttgggaa 117541 gtgcctactc tgcccctttc tctcttcctc tgtggtgttc ccacaccctg tcttgcagcc 117601 ttgtggcaag tcttcctctt ccttccttac taatcccaac tctgcatctg gctgtcactt 117661 ccccttgata ccttcactcc tgcttcacct cactgtgcat tttaaattgt ttcgtaggag 117721 aaaagaaaca gaagggcttt catcaaactg tctcctgccc tttccagtta cagattctag 117781 cacatttgca taaccgctca agaatgcagt caccctgcgt ctggcatcca ctccagggct 117841 caccagggac agtgactggt agaccagcat ggaaaccgcg ggctgtggct ccctcccatg 117901 tgccaccttc ccgagcttgt cccttctctt tggctctatg tctctctcac acacatgcag 117961 gccaagatgg cggccttcat acttgtatct ctttcctctc actggcctca acaggaagta 118021 tggtatagag gtcaggagta tagacaggtg agagactgcc tagtttccaa gccaacttct 118081 acaccttatt gactgtatga ccctttggaa attacttaac ttcttccacc ctcagttttc 118141 acatccataa actggggaga ttggtccctg tctcagaaga tttgttatga ggattaaaag 118201 agccactgtg cctaaagtac ttataagaat gcctggagca tgagaaaggc tccccaagtt 118261 gtcagccgtt tggtccaaac tgctgtgcct ctgggagggg aggctttggc tggaccagac 118321 ggtgagagga gggatatttg tgagcagccc ctgctgagag ggtgttggag gggatgcatt 118381 tcttttctct ggattttgga ttttgtggtt ttgaacaaat tcagtattgt cgttgtgata 118441 tatattttaa acacaaaagt aaatatctgg tataagtggg agtttgatgc atatatctat 118501 ttattttttc gagttaaagt ctcgctctgt tgcccaggct ggagtgcagt gttacaatca 118561 tagctcactg cagcctgaaa ctcctgggct caagcgatcc acccacctta gcctcctgag 118621 tatctggaac tataggtgtg tgcccccacc caccttagcc ttctgagtac ctggaactat 118681 aggtgtgtgc ccccacatcc agctaagtgg ttttcttaat tttttttttt ttttttggta 118741 gagatgggat ctcactgtgt tgcccaggct ggtctcaaac tcttggcctc aaccgatccg 118801 ccagctacag cctcccaaaa tgctgggatt acagtgtgag ccactgccac acccagccaa 118861 catatattta ttaaaggcat agtacatgtg atatcttacc ggagtgattt tccatgctct 118921 agagcctcat gtaaatgtca tttaggggga tgctttaggt cattcattct cacacttgag 118981 catgcatcag tgccatctga agggcttgtt aaaacaccaa ttgccagccc cacccctaga 119041 atttctgatt cagtagtgct aaggtggagc ctggatattt gcatttctga caagttctca 119101 ggcaattctg atgctgctgg tctggggcct gtgctttgag agctctaggt gattacaagt 119161 gaaaatgtcc ttatgccaag cctgtgttaa tagaagggta taatgttcat taagtgattg 119221 tcttacctct tactggagtc cccttttcac catttctcgc aggtgggtcc cctgagtgag 119281 tactaggaca cccagaaccc agccaggcca cctactctac tgttgctgtc tctgctaaag 119341 agccctgctt gagagagatc caaaggctct ttcttggtca tttctgctcc ttgactccac 119401 atctgtcccc aggaactacc caggttaagt cttccacata aaagcccttc caacgtttga 119461 agatgtttaa atgtttttct tttttcttta ttttatttta tttatttatt tttttcgaga 119521 cagagtcttg ctctgtggcc cagactagag tgcagtggtg caatcttggc tcactgcaac 119581 ccctgcctcc cgggttcaag tgattctcct gtctcagcct cccgagtagc tgggaccaca 119641 ggcatgcgcc accatgcccg gctaattttt gtatttttag tagagatggg gttttgctat 119701 gttggccggg ctggtcttga gttcctgacc tcaagtgatc cgcctgcctc agcctcccaa 119761 agtgctggga ttacaggcat gagccaccgt gcctggctgt tttttctatt tcaaataaaa 119821 cacctttagt gtttctcaca ttaaattaac caacattaca gtcagactgg atgttctaaa 119881 tgcttcacaa atattagctc tctcaatgcc ataatatccg tatgaggtag gtactttttt 119941 ttttttttta atacagatga ggaaactgaa gcacagagag tgtgagcaca cttacccaag 120001 gtcacacagc tagtaggtag aagagctggg attccctccc tagaggtctg gtttcacagt 120061 ctgtgctttt atccgcaatg ccagtgggtc tcaaagtgtg gtccccagtc cagaagcatc 120121 gccatcactc aggactagct aaaaatgcag attcttggct ctatcccaca tctatagaat 120181 cagcaactct gggagaggcc ctaggcatct gtactttaat cagtcctgct ggtgattctg 120241 atgcagccca agtatgagaa ccattgctct aaggtgtggt tctgatgcct catcctgctg 120301 gctgctctcc agttttcaat atctttttta gaagcttaag ttcaaggctg tgcacaatat 120361 tctaggtata gtgtaactca cacagcacaa tgggaatatt attactgcct ttgtcttgca 120421 tacaaattat attaagaaat ccaaagttta cattatggct atcttttccc ctcagccatt 120481 cactgttggc tcataacgaa tttctagtca gttaatttct aggtcttttt ttccccgctc 120541 acaggaactg ttgttaagcc ctgttgctcc agtcctaaac ttggcattaa aaaacaaaca 120601 aaaaaaaaaa ccctcatccc agaacttcac atttattctt attaaaatcc atttagttgt 120661 tttggttcat ggctctggct ttcaaaaagt atatttctgg acctcctaca tgtttattat 120721 tgttttaaaa atctatgctc atttaaaaaa tctggaaaat aaagcacaac agtagaaaga 120781 caaaaaaaag ttctagttcc atcatctaaa cataactgct gctaacattt tgatgtatca 120841 ctttgacttt tttcccatac ataggctttg aaaggcattc ttgaacaatg atgtttataa 120901 aaatttgagt tcttttttct cttaacattt attataagca tttcttgtat tattgcatat 120961 tctttatgca agaatattca ctgttggctg ggcacagtgg cccacacttg taatcccagc 121021 actttgggag gccgaggcgg gtggatcgct agaggtcagg agtttgagac cagcctggcc 121081 aacatggtga aaccctgtct ctactaaaaa tacaaaaaat tagccgggtg tggtggcagg 121141 cgcctgtaat cccagctact tgggaggctg aagcaggaga atcgcttgaa cccggaaggc 121201 ggaggttgca gtgagccaag attacgccgc tgcactccag cctgggcgac agagtgagat 121261 tctatctctc acacacacaa aaagaatatt cactactatg acatattctt ctagtggtgg 121321 cacaggaggt actacttaaa catttctcta tagttgcatg tttaggttgt gctgttatag 121381 ttattactat gatacacatc ttttgcataa tgccttctcc ctaaatgtag gactaattcc 121441 ttagaatggc atctcagaca acctttgaaa aggtcaactc tttccttcag tatatcaact 121501 actgtttcta tgtcgttcat aacttgataa gtttgttttt tagttttatt attttggggt 121561 tattatgaat ggacatgctc aaaaaaaggg aattggaatc agccaatttg ttcatccttt 121621 ggcaatgccc ttccagattt atctattgaa caaaaatatt cagggagaaa cttctaagtg 121681 gtgagtcttt tgttaggtgc cagtcaattc actaactaat agatttttgg tctagttgtt 121741 ctactttgga caaactccat ggaactatga tttaaattct cccttttgcc tgcaaggaga 121801 ccttgagata tttagcaatg cgttgatatc atccagttat actgtattta gggcattctt 121861 taatttacca atgtattgtt ctgtttaaaa tgtggaaacc tgaggaaatg aacaaagctt 121921 cactgggttc tcatttttct gaagttccat gttacctaag tgcagattat atcattctgt 121981 gaaaacagcc agctgttcag gcaaccaaat cagaaatctc agtctttcct cttagccaac 122041 ttgtaagcac aaatcgtgtt gcttctttct gccagctctc aaatttatac catttctgtt 122101 ttagttattc ccctccctta gcttctcatc cttaaacatt ttttttaatt gaggtaaaat 122161 tcacctaaca taaaactaac cattttaaag tgaacaattc tcaattagat agaagaaata 122221 agctctcatg ctcaatagca gattatagtg actacagtta gcaacagtat attgtatatt 122281 ccaaagcggc tagaagaaag gacttgaaat gttaccaaca catagaaata atgaatattc 122341 acagtgatgg gcactccaaa taccctgact tgatcattac acattctatg caggtaaaaa 122401 ctactcacat gtacccataa atatgtacaa atattatgca tcaataaaag aaaagaaatg 122461 aagtgagcaa ttcagtggca cttagaacat tcacagtgtt gtgtgatcac cacctctatc 122521 cagttccaaa acatttttgt aactcccaaa ggacactttg tacccattaa gcagtcactc 122581 tccattcccc actcccccag ctcctgacaa ctaccaatct gctttttgtt tctatggatt 122641 catgtatttt ggatatttcc tgtgaatgag atcacgcacc aatgcgattt ttgtgactgg 122701 cttctttcac ttagccttat gttttcgaga ctcatccact ttgtagcatg tatcttctca 122761 ttctgtttaa ttggcgtccc cattgatagt cttggaaaag gaatagcaaa ataaacccct 122821 gtagcatttc tgatagtgag atctttcctg tcttctctga actttttgca atagcatcgt 122881 agtgtttatt catagctgaa atactatccc ccagctatat attactgaat tgaagcaact 122941 cctgtaggga aagaagtatg tctgactact ttatttttgt attttcagaa cctaatgtgg 123001 atctgacact ccattctttg accactgggc agaacttagt aagaaccatg gctttaccaa 123061 acactagcta tgggacctca ggcaagatgg gcagttttgg gggtctcagt ttcctcactc 123121 atgaaatggg gacaaatact tttgtcccta tttttactga gtggaacaat taatatatgt 123181 gcagtggcta gcacagagta agtgctcagt attagtaaga tgacatgttc attttttttt 123241 cccaagtttc ctctaactgc ctccctcact aaataacatg ctccattata gtgtcttaaa 123301 attgggaatg aaaaagtgat gccagttaca taattacatt gttttctttg gggcagaggt 123361 acaaaataga attttattaa actttgcctc taatatttaa aaatattttt atggaaattt 123421 aaaaatgtac gcaaaaatgt acataactaa tctccgtata cttatcaccc agactcaaca 123481 gttagtaata ttttatcatg ctgatttcat ctactcccct acttgttttt atgcttgaga 123541 attttaaagc taattccatc tatgaaatca ttttactatg attccatctc tgacaccatt 123601 ttactataaa tatttcatta tgtatctcca acatataagg aattttaaaa acctaactat 123661 aataccatta acatacttga caatattaac attaattcct tattagtatc ctgtctgtgt 123721 gcagtttcct ataattatct caaaattgtt tgctttcttt tttttgaaaa aaaaaaaatg 123781 gttgatttga atcagtatcc agatgaggtc tgcatattgc agtcaatagg tacatctttt 123841 aaagcttttt aaattttaag agtaccgcct ctccatccat gaaatttatt tatgaaataa 123901 attgagtcat ttgtctggta gaatttttaa caatcttgtt ttgctagtta tttcttaatg 123961 ttgtctttta acatgttgtt ctgtctctta cattttctgt aattggttct atctagaggt 124021 ttgattagat tcaggttcaa ttttctttgg ataagaagac ttcataggca gtgctgtgag 124081 cttcctgttg catctatcaa gaggctcaca atgccacgtt gtcccacttg caaagcgaag 124141 ggcgatccgt gtgttcaggt gttgtcagcc tgatcctttc atttgaactt tcttaccatt 124201 cctcatgcag cagtcctgtt tcttggctgt acattagaat cacctgggaa gctttttaaa 124261 aatcctgagg ctcaggccac accccatatg aagatcagac tctctgggag tgggacacag 124321 gcattttcta aagttaccca ggaggggccc ggcgcagagg cttacgcctg taaacgcatc 124381 actttgggag gccgaggtgg gtagatcact tgaggtcagg agtttgagac cagaccagcc 124441 tggccgacat ggtgaaaccc catctttact aaaagtacaa aaattagcca ggcatggtgt 124501 catgcacgtg taatcccagc tacttgggag gctaagacag gagaattgct cgaatccagg 124561 aggtggaggt tgcagtgagc caagatcatg ccactgcact ccagcctggg tgatagagca 124621 agactctgtc tcaaaaaaaa aaaaaaaaag ttacccagga gatacaaatg tgcagccagg 124681 cttgaaacca ctgacttaat tgtttcagtg gccattgata atatttcatt gattcgtgat 124741 tccattcggt gttgcaatgt ttgattttca aattctatca ttctttttgc atttattagc 124801 taggctgttt ttataaaaaa ggattttccc tcatcaacca cttgattact ctgacatata 124861 gtttatacag gaaagacagc ataaatggct atctttctct tgattaattt tcacaagttg 124921 gtgccctaag cacttccaga ggtgccaaaa tagttttctt tttagtatca ccaacttagg 124981 gaatcttatg tatttgatgt gtttaaattt gttatgatca tcattcactg aagctcagat 125041 tgtcccatct tggccaatgt ggggctcttc ctttagtgaa ttttgacatg accctagttg 125101 tttttgaaag catctttgcc tcctggcaca agatattctc tttaatgttt aacctctatt 125161 taatcataag ctgttttgaa caatttgtaa atcagtgcct aaattagttc tcccaccaat 125221 gaacagtata ttgatttttt tctgatgggg aaagggtagc accttactaa cttgaaatac 125281 agcattttat taatgagcaa atttgcagaa atatctgtta gagactgatg taaattatct 125341 tttgtaaggg gcagtagctt atatcaatag aaacacacgt aatatgtact ccatggtttt 125401 tgaaaacctc tgcagtgttg cattttcagt gttgatcatc aagaatacaa atatataacc 125461 atgtcataca ttttaaaaac aaataatgct agtgttgagg aatataacta atttcaaatt 125521 tgtggaacag tgggactata ttaaaagatg agtctgggtg tggtggctca catctgtaat 125581 ctcagcactt taggaggctg aggcaggcgg atcacttgag gccagaagtt tgagaccagc 125641 ctggccaacg tggcgaaatc ccatctctac taaaaataca aaaattagct ggacgtgttg 125701 gcacgcacct gtaatcccag ctatttggga ggctgaggca ggagaatcgc ttgaacccag 125761 gaggcagagg ttgcagtgag ccaagattgt gccactgcac tccagcctgg gcaagagagt 125821 gagactctgt ctccaaaaaa taaataaata aaataaaatt aattaattaa ttaattaaaa 125881 aaataaaaga cgaaacttgg aatagtaaat acttaaaaat tctaacaata tcaacaggta 125941 gtgatagttt aaacatgtac tgtaaaagtg tttggttttt aagggtatgt aagagactgg 126001 ttgtctgtgg gactttgaaa gcagttggga aatgccacaa tagcctcagt ataaaattct 126061 agagacccaa agcggagaga gatgcctctt aagaattatg gaggatattt attactttta 126121 ttcatgggat gggtggtggt gatttaatgc ctgtctcttg tttcttttgt ttttttttgt 126181 tatccaaatc caaatccaaa ttatatgagg aattcactta aaaatatacg tttagtttgg 126241 tttaataaaa ataatcagtt ttgatgtatt gacttttaat ttcatggggg aaagagtcca 126301 ttaggaatgt tcaggctttc tgtgtatatt ttgtgttgct cattctcttt ggcccaagat 126361 gaaactaata gccacaatgt gtctgctttg gtgctgtctc tagggttgtg tctgctgctg 126421 tcagctgcac ccttgttatc tggctgctgg tcctgctcag tcagttgtac aggaatgtca 126481 tttccattaa taaccccagg gccggacagg cagatagtgg aaaatgggtg aagagagaga 126541 aattgtattt aacccagtgt tctgagttgt catttttgag accctagcat agctgggtaa 126601 aggcaagcag agggcatgac agcaaatcta tttctggcag aatctcatat acaaactgga 126661 gaaacagcct gataccaaga gaagaaaggg tgaaatatac tgggaaaaga gggaaagatt 126721 tggggttgga aaagtcagag gaaacaatgt taggaagaaa taacatgatc aagtattctg 126781 atagaggtgc tgacaacttt gcaaaataaa tttaattttg attgtaaatg ccatcatttg 126841 tggtggttca aatgaacaaa attccttttt aaaaaatgaa atttatgtaa ggattgctca 126901 tagaagattt tggagttttg tgctatgttt tttttttttt ggttctggat cagtgaaagg 126961 cccgaggaac agagtaggtt ttattttttt ttccatctag gtagaatgac tcagttactg 127021 tagttgtctg tgttttggtt atttattgct gcataacaaa ttaccccaaa acttgctggc 127081 ttaaaacaag tattttatta tttcctatga ttctgtgggt tggctgggct cagctgggtg 127141 gctcctctgc tggtttcacc tggagtccct taggcagttg cagtcagatg gtggctgtgg 127201 ctggacgtcc aaggtggctt cttcacacat gtgtctggca cctcagtgtt cctcatgtgg 127261 ctttctcaac agggtgctgt ggtactccta cagggtagct aagagtgcca agaagagaaa 127321 gcagaagctg ccggtcctct taatgtctgt tctcagaagt ctcagaatat aattgacaaa 127381 taataattgt atatatattt ttggggtaca gtgtgatgtt ttgatgtatg tatacattat 127441 agaaagattc aatcaaacta cttaacatat ccatcacttc acaaacttat ttttttggta 127501 agaacattaa aaatctatta ttttagcaat tttgaaatat ataataaatt attattaact 127561 gtggccacca tgcagtacaa ttggttacta aaacttattc ttccagtcta attgaaattt 127621 tgtgcccttt aatcaaaatc tcccctttcc ctatctctcc ccctccctcc agcctttggt 127681 aactactttc tactctctgt ttctatgaga acaacttttt aagattccac atataagtga 127741 gattatacag tatttttctt tctgtgtttt gcttacttag catttttttt ctatttttct 127801 gtttggttga ttaggctatt tttcattatt gactgaatta tttttttctg tttagctcaa 127861 gaccattgta tacatcgact cacagaagta tttgagtaag cctaatcatt gttgaactgt 127921 gatggggaaa ggattcagcc ttgggacacg tgacctttgt gtacaggtgt aagaaattcc 127981 tactaggttt tagggcccag aactggacta ggccatgagg gacccaaccc atcctagtag 128041 gcaaatcatt ttccagtttt atatttgagt tcattagtaa ataagtttac tttccattat 128101 agacaccccc tccactgact tatggggtcc tgtacgccat aaccaaagac tcagtgttgg 128161 aaaggtgatg ctccactctg tgtctttgtt tagctaaggt gaaggggaca ggtctgccac 128221 cctggtatat tctggtattc tctcattggc agtcttagtg aaggctcatg agagcttcct 128281 cacagaggtt tgtaaggatt agaactgagt ggaatagacc tgttccaggg cagagtgaag 128341 ccgaggacaa cagtgagatt gtgtccttca gggccacaga ccagaacctt tactcaactc 128401 tgatgttaag ctataaagct agtttacaag tttatggaac aattcagctt gttcttccag 128461 gctcttgaaa atgaaggtta aggctgggcc cgtggctcat gcctgtaatc ctgtaatccc 128521 agcactttgg gaggccaagg cgggtggatc acttgaggtc aggagttcga gaccagcctg 128581 gccaacatgg caaaaccctg tctctactaa aaatacaaaa attagccagg cgtggtggtg 128641 cctgcctgta attccagcta ctcgggaggc tgaggcagga gaatcgcttg aacctgggag 128701 gcggaggttg cagtgagccg agatcatgcc attgcactcc agcctgggcg acaagagcga 128761 gactccgtct caaaaaagaa aatgaaggtt aagttttggc aaagtgattt gtatagatct 128821 gtaatacatc atgttctaat gaggtccaat gagttctaag taatttctga tttccattaa 128881 aaatgattca tgaaaaaata agaatctcat cagagagaga gagagaccac ttttgaaatg 128941 aggaaaaatg acagaataga aaaggagaaa gtctcttttc tctcttcttc cccaagcccc 129001 cctcatcact ggttacttat ttgagcatcc tttatcagca tctaatgaga atgagaacat 129061 agtgggatgc catgtgacct tttgagtagg aattggaagc tcttatgaag tttccagcct 129121 gatgtcactt cagatccagg tcattaatat ttaaaaaaat aaaaaaaata aaaactctta 129181 ggtgattcga atgagcagcc aagtttgaga agcagagccc tactggagca atgcttctgc 129241 aacttgaatg tacatgcaaa tctcctgggg acttggttaa aaatgcagat tcttaataag 129301 tttggagggg agcctgagag tctgcatttc taacaagctc ccaatggagg ccgatgctgc 129361 tggacaatgg cctatggacc atactttgag aatatagagc agggggcagc aaacttttca 129421 catatgaagg cctgagagta aatatattgg gctttgcagg tcatgtggtc tctgttgcag 129481 caacccaaat atacctgtag tacagagcag caataggcaa taggcaaaca aatgggtgtg 129541 gctgcgtgcc aataaaactt tatttacaac accaggcagt atcctgaatt tgtcttacag 129601 gccgtgagcc aataaaactt tatttataac accaggcagt atcctgaatt tattttacag 129661 gctgtgactt gccaacttgg tctagagctt tccctgtctc aagtttttga gccgtatctt 129721 gcttcttccc tgttttgttg ggctgaatct taggtaagaa ttaaatggtt ttgaaagagg 129781 ttactcaatg gctttgttga actgatgaat aattgatgat gacatttagg ctctgtttac 129841 aaacagcccc aattgtccct tggttgtggc tcattgcttt gccttgaata taggtggcct 129901 ttctgatttc tctccctaga ctgcctcttc agcaatggat cgaaacagag ctagggttta 129961 ttctaaaatg tggcccacgg gcatctcaga ataaataacc aggtctgaac tgtagaggca 130021 gagggtggct tcaggtcact tttccttgac ctgcatttta caactcatct ctggtgtttt 130081 gtctaaggcg acagaggctg gtattttggc tatctgagag gtagagagca gagaggttcc 130141 tggacatgta cattttgtgt tgagataggt catgccccag atatggctat taatggtagc 130201 agagagttaa atgttgctga taatatattc taataagtag ataatacaga accaagttgt 130261 cagagttttg gggggaccca aatttggaac caaggactca aagctcagaa gggttccatc 130321 ttgtgggctt caggagtaag gtgcaagatt cctggtctgc tgctcatggt aaatttcctg 130381 catttcaaac cccaaagtca gttaggccta agcttcagga aagttttcag tttaattggt 130441 tcaaaactga aacagactct catctatctt tatgacagtg ctaggattcc taagggagag 130501 gtgttactcc tcttctgaaa tgggaggatg tgggccagga atccttgggc cagggatgct 130561 tatggcagac tccatttctc agcatggttt tggctgggaa tttcctctat gagattgacc 130621 cttctctccc cgaggtgcca ttttctggaa taccaccttc tctctgctgg taaaatctaa 130681 tttaccttag tgccttgact ggtctgcaaa ataaatgttc tgacgtttcc tgacaggaga 130741 gcctgtcctc tctgctttct gcttctttat cattcttgta ctttcttgcc ttgccttttc 130801 cattttgatc ttctctcttg gccaagtctt gtcttccggg ggctctcagt caaagtttag 130861 tcccaccgat gtccagaaca gaaagctcag tggtattatt tttagagagg gagggagaac 130921 ctcttaaaag catacttaaa attctatctg tgtgtgtctt ttcatcctta catctgagaa 130981 tattatttgt cttggcatgt tgaatatggt tactaatgca gatgtggtct gtgccttaca 131041 tgttattcag tttgatttat aaaaaatgaa taattatata atagaatatt taatcactca 131101 gaagataata tatgttggga aaacatgaaa tttgaagtca gattaaggct aaattaatct 131161 aggtgagaag ttggttttgg ccctcattgc ctgtgtagac ttggggtgag ttgcgctcat 131221 tttctgagtc ttagcttatg agctgtgaaa tggggatgat ctgtacctac cttgtacagt 131281 tagggtagga tttggatctg atgatgaagt aaagtactta gcacagtacc tggtttatag 131341 taagggttta atggagagca ttacttttgt tatttaaaaa attatgtcta ttgtttctgt 131401 ttctgtttag ctctgagatc tcctcctctt tgaaaatgtc ctttccacaa ggcccttccc 131461 ttcccatcca ttccatctga ccctctacct tcttctgttg ttttttctct tacatcgcat 131521 tctctcccta tgaccaacta tgaactcctg aaggaaagaa atccagttta ttaaactgtg 131581 tgtcttcact ccgtctaaac acaacaacat ggacaaattt cacagatgtg ttgttgagtg 131641 gaagaagcca gatacaaaag agtacacact ttttttgaag ttcaagaaca ggaaaaagta 131701 atttatggag acagagatca ggatagcggt tcttttgaag gtggaagggt tattcactgg 131761 ggaggtaccc aggggagtct tcaggatgct ataaatgttt taaatcttgt ctcaggccta 131821 ctaacacaga tgtgtgtatg ttttaaaaaa tatttgattg gtatacttag tatttgtgca 131881 ctttgtacga tatagtgcag taaaaatttt aaaagatgtt tcccagtatg cagggcttat 131941 ttgcctttta gtgatcattc agtaaaaggt tgaatattta atggatgtca tgatgaagta 132001 aatgatcaac tttttttttt ttttttgaga cagggtcttg ctctgtcacc taggctggag 132061 tgcagtggca ccatcatagc tcactgcagc ctcaatctcc tgggctcaag agatagtcct 132121 gcctcagctt cctgagtaat tgggactata ggcatgcacc acaatgccaa actaatttat 132181 ttttttgtaa agatggtgtc tccctatgtt gcccaggctg gtctcgaact cctgggctca 132241 agcccccacc ttggcctccc aaaatgctga gattacagct gtgagccacc gtacctggcc 132301 ctaagtggtt aatcttaatc accatgtgac caagaccaaa ttttaaggaa gatgtagcat 132361 acattttgga aaacatggta gacataattc ataatttata gccagaattc acgctgttgt 132421 tcatggcgct acttggatct tcacatagct gtttcagaaa atatagtaaa tgcgaacggc 132481 aggacttgca atttcagaca gtttgtgcag ttggcaggac agagaacaca gagtggggat 132541 acggtcctgt cagattggtg gaaaacctca tttttaattg attgtctatt gcatctatgc 132601 ccctcttttc ccaccctccc tttgtccttt tatcaattct acttggaaat aaaatctgca 132661 actctaatag atgctactga cactcgtgca gcaattctac cagatggcat tcgaatctcg 132721 ggtgttctat gaaaaatgaa aagaccaaac acatgcacaa ataaatgaag gttgaattaa 132781 tctttataat taaacggaat attaccctgc tatgactgat ttttgctttc aaactggatt 132841 gcgtgctaat ttttcagtga actgtaattg tattaacatg gcttttttga tgcctgaaag 132901 tacttaaaaa tagattttgc gaagttactg agagaaaagt tattagaact tttaatctaa 132961 acatctctaa gatcctgagg tatatttata tcacgactta cagctgcaaa gcattttcca 133021 atagtttttt attctttgtc ccttatcatc ctgggattta ggtggcactg gccagaaatt 133081 gggttgaatt gcaagtacct gaggcacgtt agaaaattga tgtcctactg tgtacccaca 133141 aaaattaaaa ataaaaatta gaaaacaaat atgtacaact gctatgtacc cacaaaaatt 133201 aaaaattaaa aaaatgagtg ggagggaagg tggtagagtt ttctacataa agtgcaccta 133261 ctagcttagt agtggtgact attagtgaaa tccctaagga ttattgaagc agaggatgag 133321 tcccctactt atgaactgat aacacagtgt tgggggttct cagaagtggg tccagaagcc 133381 ttggtaccat ggcaagtacc atgatctaca gcagagagta tctcaaattt cagcaggcat 133441 gagactcccc tatagtagtg gttttcaacc cagagtgatg ttacacacct ggagacattt 133501 ggcaatgtct ggagatattt ttgatcacaa ctggggggac agagggtgtt actggcattt 133561 agtgggtagc agccaggaat gctgctaaat atcctacaat ccacagaaca gtccccagag 133621 ataagaatga ttctttccca aatgtcaata gtgccaaggt tgagaaaccc tgccttatgg 133681 taaggctcat tataatgcag attcctgggt cccagttcag aagcattcta actcaagaga 133741 tgtagcattg tcccacaaat ttgtgttttc aacaagctcc cacgtgacac agatgctgcc 133801 agtccaaaga caaagctttg ggaagcccta agctagacga tgccagtgtc aaaggaacaa 133861 gccatatatg tgggcactgg ccctatgaca ttcccttctt ttagaaaaat gatatgaaaa 133921 ttcccacttg cattgattga ttgattgaga cggggtctcg ctccgttgcc caggctcgag 133981 tgcagtggca tgatatcggc tcactgcaac ctccgcctcc cgcatttaag caattcgcct 134041 gcctcagcct cccacgtagc tgggattaca ggcacacgcc accatgccta gctaattttt 134101 gtatttttag tagagacgag gtttcaccat attggtcggg ctagtttcaa acttctgacc 134161 gcaggcaatc cacccgcctc gacctcccaa agtgctggga ttacaggcgt gagccatcgt 134221 gcccagccca cacttgtcta atttaataat tcaccaaagt tatctcattt gatctctccc 134281 caccagttta tgagggaggt acttcacaag gtttttaggt ggtaaagaat tgttcatata 134341 agagctaacc ttaaaggaaa gaatgacaac aattcccact taaactgttg atgagacaag 134401 gggatatgtt tatctaggtt aatctccaac ttgcctcctg tttaaataag taatttaaca 134461 actggaggag gattaagtag ctgattaaat ttctctgaga aacgttaggt gttagggcct 134521 tggtatttat tgcaggtgtg tttctaagcc ctttcaacct tgatatttga aaatgttttt 134581 aaataatttg taagtatttt ttcccattcc ccaaaaaatg cattgattta tttcatagta 134641 tttattccca ttttggttaa tatttctata ctggctgtgt tttaaatgta gatatttacc 134701 tcaacaaaat atgaatatta cagctaatta tagaattatt taaatagtta ttatttaatt 134761 atatatttct atcttgatga aactttccga aaagattaac ctttgtgcct tatttttccc 134821 ttgctatctt tactattggt tttgaaaaat ggggaaacct cttgtgtaga agaggggttg 134881 gcaaaccatt tctgtaaaga gacaaatggt aaacatctta ggcattgtgt gggctatatg 134941 gcttctgttg caactactca aactccactg ctgtagcatg aaagtggcag taaacaatac 135001 ttaaatgaac gtggctgtcg tcagagagaa aaaagcaact tttcctttac cactttaggt 135061 ttaatgtcag ggcctgtgaa ttaaactata aaagatggat taacaagaga aaagcaaagt 135121 ttattcacta acctgcatga gagcttacag aaaacacagc ccagagaaac agaattgggg 135181 ggttacatat tgtcttaaga aggggtaggg ttaggatttt aagggacaat aaatggtggg 135241 aaatgactga gaaatatatg aggggaccta acagaaagtc aggcttgctt tagtaaggtc 135301 tgtttatgta agttctcatc ctggcgccaa ctttttatcc ccggtgagag gagttatttt 135361 cccccggtgc aagaagtgtc tcttaaaagg gggatttacg acagttgaat tcttttggaa 135421 gtctctgctt gtaggccgat aagggaagca tggagaaagc ctctttctgc ttgtcttgat 135481 tctcagatac cttcagctca aaatagtcct ttgctacagt agcatattcc ggaccaataa 135541 aactttcttt acaaaaaaac atgtggtagg ctggcttggg ctgtagtttg ctgacctctg 135601 atacagagcc ctgtgagtga gacagagaca aacataattt attttggctc atataaaagc 135661 ctaagtgtga tataggattg ttcccactct tgggctgtta tcattttggt tatgtctgga 135721 taaatgggct tagctaattg aggcttaggg tattcatttg actaggtcta gcagctgttc 135781 tatttgcaaa ctttgtgagc agtttaacca catataatcg aaaccaataa aaacattgat 135841 ttaaaaattt atgatttggc cactcttctg cccagtttct tttcatcttt ttttttttaa 135901 tttaattttt atcatttggg acagggtctc actctgttgc ccaggctgga gtgcagtggc 135961 acaatcttgg ctcactgcaa cctctgcctc ccaagttcaa gtgatccttc tgcctcggcc 136021 tcccaggtag ctgggatttc aggtgcaagc caccacgccc agctaatttt tgtattttta 136081 gtagagacag ggttttacca tgttggccag gctggtctgg aactcctgac ctcaagtgat 136141 acacctgcct gagcctccca aagtgctgga ataacaggtg tgagccacgg tgcccagctc 136201 ctgcccagtt ttaaaggaaa taaatgagcc gggtgtggtg gctcacacct gccatcccag 136261 cactttgaga ggcagaggtg agcggattgc ttgagtccag gagtttgaga caagcctgga 136321 caacacggtg aaaccctgtc tatattaaac aaacaaacaa ataagggaaa taaatgagag 136381 ttgcctctat ttttaaggta atattgctac cactctaagg acgatgggtg tgttgtttta 136441 ttaattaatc ataaggtggc tttggcttct aaaatctatt gctactacct gctccatctg 136501 tttaccaaga aaactcaaat attactttct ttctttaact ctgttcagta cagggctcac 136561 ccaagtaaga ggagcatcac aaagacacca gggaaatgtt tagtgccagt atggcaaaat 136621 taagatttac ttctgccaga gtagtttcct tttgcagaca gaagaaactg cccacctcca 136681 aaatatattg agttggtgtt ttaggagaga gcatggatgg tgcttgttta taatttggaa 136741 catgccttcc tagaatgttg tctatcttag agagtaatga gggttttctg ctatactggg 136801 gcattattca catggggttg agggtggatg acctgaaatc tgatctttcc tgtcttttac 136861 ttcttgtaca attagaactg tgatttgcac ttagagagcc ttctgattca tggaagtgct 136921 tatctatctt ctacccctgt atgatgcatc tgcttgtgga ccgcacccaa aagcaatttg 136981 cagaattaaa gtttttgaag actcataata aatcagttca attttatcaa gagaataaaa 137041 tttgattcat cagataactg cattgttatt atttgtggga tcctgataga cccagaggat 137101 attaaagcct gttcccctga ctaattttca tcaataccat tagctccgct tagggtaata 137161 tcttcagcta aggcatctat actagtcaat catttgtttg gcattatgga aactgccatt 137221 gttttgaatg atagtgtaat tcatgtacag tctcctgaac ttctgggtgc ttaaagctca 137281 gaattttgat tgatgtttaa tagttattac agaactcatc agagaaaatg aggccttggc 137341 atgattgagc ctgactgaag ctttcaaagc cactgcaagc ttaggaaagt tgccctataa 137401 attttttatg aaagggcagt gatcagttgt tccatttttt tcttaagttg gttaaattaa 137461 taaaaaagaa aaaatatgct gagacttaac cacgtatact aaacatacag ctatttaatt 137521 aagtgattaa agtggtctct ttgatgtgat acatgcaaga catttgagaa tttatctgac 137581 tctcctaaat agccaagtca gtggggatta aaataaatga gctttctgtc ttcctttctg 137641 tagatcagct taatagtaag tggctttgaa aattcaggag taaagcagtc aacctagcca 137701 atctttaaat cagttcagtc acatatataa aaagtagatg ccatttcaga gctcacagct 137761 aaataaagca gaattatgcc tgattctgtt cttcccacca gcattatcaa gttgtagtat 137821 acagtattga aaattctggg aaggtggggg tgatttgaga gtttaaatca tgttgtatat 137881 taggttttca atgttcctca tttttctgag ttcatttgtg atagctgttg tttctctctt 137941 caaataataa tctttacaaa ctctctgttg tctccatgga aattaattgt gatttttaaa 138001 gctctagcac taagacttta gggcagggag tcaactccca agcaactgag atctgtcgca 138061 tgtaaatggt agtaatagca aagagcaaag caaaaacaaa taaaaatctt gatagctaag 138121 ccgtgcgtgc acgtgcacac acatgcacac agatgatgct gttgagcatg gatcagtttg 138181 gaagcattgt taagcactta ctgtgtacta gcccctgtgt catgaattga ggcaactcag 138241 cgaataacct agctttattt aattatctgt gttggcttag taagttatgg cccttgggcc 138301 atatttggca ttctgcctgt ttttgtgtga accacaagtt atgaatggat ttcacatttt 138361 ttaaatagtt ttaaaaaatc agcttggcac aatggctcat gcctggaacc ccagtactta 138421 gggagggaga tgtgggagga ttgcttgagc ccaggagttt gagacctgtc tgggcaacgt 138481 agtgagacct cgttctccac aaaaaggaaa acaacaacaa caacaacaac aacacaaaag 138541 gacaaaaaaa cacaccaagt gtaaacaacc aaaagaagaa tatattatga tacatgaaag 138601 ttgtatgaga ttcaaatttc agtgtctata aagttttatt ggaacagagt cacagccatt 138661 catttatgtg ttgttctagc ttgtttctgc tttctcaggt acaatcacag agctgagtag 138721 ttgtgcagag gccatatgga ttgcaaagcc taaaatattt actatttggc actttacaga 138781 aaaagtttgg ttattcctga tatgggtaag gcaaacttca taattccttt attatctgaa 138841 tacttctccc aactctctat tttactttat tggtattaga actaaatgta tccatattct 138901 acagacgagg aacctgagga acagagagat tcagaaaatt cagctattgc taatacctta 138961 tgtgatgatg ataacttgtt atagaatggt tttcctcagc tttttaaata aaaatgcagt 139021 taagtaatcg ataatattac gtttgtatgt tctttcagca ttaaggagag tcctgagcct 139081 tgattttatg ttcagtatgt gtattagtct gttctcatgc tgctaataaa gacatacctg 139141 ggactgggta atttataaag aaagaggttt agttgactca cagttccaca tggctgggga 139201 ggcctcacaa tcatggcgga aggcaaagga agagcaaagt catatcttgc atagcggcag 139261 gcaagcaaga gaaagcatgt gcaggctggg cacagtgtct cacgcctgta atcccagcac 139321 tttgggaggc tgaggcaggt ggatcacctg aggtcaggag ttcgagacca gcctggccaa 139381 tatggtaaaa ccccatctcc actaaaaata caaaaaatta gctggacatg gtgtgggcac 139441 ctgacatccc agctacttgg gacgctgagg caggagaatc acttgaaccc aggaggtgga 139501 ggttgcagtg agctgagatt gtggcattgc acttcagcct gggcaacaag agtgaaactc 139561 cgtctcaaaa aaaaaaaaaa aagaaaagaa aaaagaaagc aagcatgtgc agaggaactc 139621 cccttcataa aaccatcaga tctcctgaga cttactcact atcaagagaa cagcacagga 139681 aagacccacc ctcatgaatc aattacctcc cactgggtct ctcccatgac gtgggaatta 139741 tgggagctac aatttgagat ttgggtgggg atacagccaa aacatatcag taagtatttg 139801 ctgattgatt gaaatagctt acataaaata tatagttttg tttgccacca agaattaatt 139861 gcattggaag ctgagtgtgg tggctcacac ctgtaatccc agcactttgg gaggctgagg 139921 tggacatatc atttgaggtc aggagttcga gaccagcctg gccaacatga tgaaactccg 139981 tctctactaa aaaatacaaa aattagccgg gaatggtggt gcatgcctgt aatcccagct 140041 actcggtggg aggctgaagc agaagatgct tgaacccgag aggtggaggt tgcagtgggc 140101 tgagattgtg ccactacact ccaacctggt tgacagagcg aggctccacc tcaaaaaaaa 140161 aaattaattg caatggattt tatatcttgc ttaattttcc agaatgtttt atatgttctc 140221 acaaaatatc tcattttctc cccttttttc tctcctgtgg taagatcagt taatattgaa 140281 atatacctta atgttagctg ggcatggtgg cacacgcctg tgatcccagc tacttggtag 140341 gctgaggcac aagaattact tgaacctggg acatggaggt tgcaatgagc cgagatcgtg 140401 ctactgcact ccagcctgga tgacaccgtg agactgtctc aagacaacaa caacaaaacc 140461 cagggaaacg taccttggtg tgaacagcac tgaatgatgt ttatgattgt atgttaatcc 140521 tcttatattt aaaggttaaa tgtaaaatat agccttttat attttctgat actcttgatg 140581 acattgatca tagtgggttt atcaaattat tcattatatg aattaatatt tattgagtat 140641 tggtcacttg ctatgttacc tgtggtaaca tggtgattag gtggaatagg ctattggttt 140701 aatttgaata gtattagaag attctctcag ctacaaaacc tctcatttaa ttgtatttat 140761 tcctttggct aataaactac tcttttaaat aaaagtattt ctccagagta aatatattag 140821 attgctatta gtgacttaca ttctattata aactagtttg aggccccagt ggcagctagt 140881 ggattagcaa ccttgaagaa ttaataattt tgatggatga gaagggaagg gggagatatc 140941 tttggagcac agagtattgc taaggaaagg ccaactaggg tacgatggta ttgcaagtgg 141001 tatgaacaca taggacctgc taggtgttga aaggaacaga taagagaaag agaaagagga 141061 ggaaggcaga aaccaagatg gcaagtgcta aaagcaacct gtatctttga tcttttctag 141121 ggttagattt ggtagtgttt cttgtgtctg ttttcagagg tgtgccaata gtggctctca 141181 aaagagtcct ccaacttttg gggggctgca aatagcctgg attcttttga ctcatatgca 141241 gagtctggct ttgctaggaa aatctcctac catgctccaa gaagcttgta catttgtctt 141301 ccattgataa gagggttgcc tcaatagctt ttctccctaa gcatttcttt tctcgatatt 141361 tttcctctca ccacaagtgt ggccttctga aactggacct atgactaaaa gagtgacatt 141421 atttctcagg ggtttcaggt gtttggttag gggtataaga tgacctggac tcttctggaa 141481 gacctccaaa cactacacat cacagttact ccaaatcctt ggaaaaagtc acgtttctac 141541 tttcttggtg tagtctacaa ctaacatttg ctcatcatgt actctggacc aagggaaaaa 141601 agtgttttac atgcaatgtc atcctgaact cttgcagaca atgctatgaa ataggtacta 141661 atttttagta atttttttct caacatctga tgtaaaggag agtgtgttag gattacatta 141721 gctaatggaa aacatgacac ataatgcctc aaacaaataa gggctttatt ttctgtatgt 141781 catgaaggga aaagagggag gcagtccagg tctaatatgc tggccttcga atgccatcag 141841 cttttcatcc caccatcttc agcatgtaga ttttttcctc atatgtgttg ccttgtgatt 141901 actagatggc tgctcggctt ctggcattgc atcagcattc cagaaaggaa gaagaggaaa 141961 ggctaaagca gtagttctca gctggaggtg attttgccac caggggcaca tttggcaata 142021 tatggagata tttttggttg tcacaacttg cggagggggg catgggttgc tcctggtatc 142081 taatttgtag aagccaaaga tgctgctaaa catcctacaa tttataggac aaaccccaga 142141 gcaagaatta tctggcccaa agtgttgaga aaccctggac tcgaggcaga aagcatatgc 142201 tgatgaatct gtcccctttt aaaatggatc attggaagtg caacaagaag tttttcttaa 142261 atgtcattgg cccgaatctc atcctgtagc tgactccagt tggaaaggaa gctgggcaac 142321 acagctattt tgttgttgtt gttgttgttg ttgttgagtc cattgtcatc ctgagcagaa 142381 ttggagtcct gacagtaaga gaagaaaggg agaatagtta ttaggtagac agtaggcggt 142441 gtttatttca aaaagttaag tactctgcct aaggtcacag aggtagtgaa gatcagcagc 142501 tgcgtaaacc tcctctattg aggaacggat gtccagctgt caagtgcagc ttctctctac 142561 atgtctgtat gaataaagca tgaaaatgca tgcactgggt atggaaagcc ctggagtttt 142621 tgatcagcca caaacagcct aggaaatatt tcttgcagga tataatatgc aatctgttaa 142681 atatttatat cactgctctc tcagcattta cagtgggact tgcctttcct ttttgtccat 142741 gtaaccaata gggtaattaa tctcagacct gactcactaa tactttgccc ttcaagccat 142801 caaactattg gtgcttcact tctcccattt tgaaaaatta aaaagtgatg tgattactat 142861 ttgccttact tagggttcat tcaggaaacc agacactctt ctaagtatat caaccagcaa 142921 gacatttaat acagggaatt ggaggcttag gtaacctttc agagggctgg ggagtgaagg 142981 gcagagtctg tcaatgatct ctgctgcttg tacaccaaag aggtctttct caggagaatg 143041 cctggaagtt gctggaaaag ggctgtatct gctgtctgca gatgcccaca tatctgtata 143101 aggctaccaa cagcaaatga atggcatctc ctcttctacc tttggaattg cctatgagtg 143161 cttctcattg gcagtctaac ctggagccat gggatattgc aagatgtatt tctgaagctt 143221 ctcagggaaa aatgtagaat ggggtggggt aggatgagga ggaattgaca acgaagtctg 143281 tggcagaaaa aaacatttga atctgaatag aggaatacat atatagcata ttttaggaag 143341 aagaacactc tcttcagatg ttggcaatgt tgctactagt attattacct tattctaggt 143401 atgtatcttt gggaggtaaa tagggattct ttgtatgtgt tctgattatc ttcaagtcat 143461 gatgatgctc aataaataat aaaaaaggat tcaataacct gcatagatgg tgcagaaggt 143521 aatatattct gtgaggctct gtagtgcagt gggggtccca gagatgaggt ttaaaaaaca 143581 atgcataatt caccagcatg ctctggaaag accagtcttc agaaaacaat tgtggatgac 143641 tcagcttgtc cgttcgttcg ttcttttctt ttcttctttt cttttctctc tctctctttt 143701 tctcactttc tctccaaaac tatctggaat tctttacttg ctatagttat gcatatgact 143761 tatagcgagc tgataaaata tagtgcaaaa ttaaatacaa tgccggaaat aatagaataa 143821 tctcagaatg aggtaaccta tagatgtatg acatagtttg gtgttggggg tagagttgta 143881 acctctagtt atctgctatc tgggtcacat caagctttta gtgtgtgtaa ctcacagagt 143941 tttgaaggtg taatttgatc tgtgctatac ttgagtctgt gaaacagcaa aacccattat 144001 gactatgtcc tacagccctt gtaaaatcct aagtcttgag cttcaggatt ctgctgatat 144061 ttaatatctg aattatttca aggaaggtaa ggaaagtgta ctatcttact atgtttattt 144121 cttgagtgag gaaaattgaa agggactaaa ggaaagaact cgttgccaag agcagtgtcc 144181 agaggtcact ggttttgctt tgtggctttg ctggaataaa accaacgtgt ttggggttgc 144241 ctaggtctta tgttgggtca tctggaggca gatctggaga tgagaattcc tgtgtaagtg 144301 attcattaaa caaatactcc caagagaaat cagtaaggga atgggagaag caagacaggg 144361 agggatgaga gtccaatttc aggagacatc tcagaactag tctgatcctg cagggaactt 144421 tggaagtaca aattatacct cagagtttgt cctgaattga ggcaaggggg ctgggctttc 144481 atattgccgc ttagggctgc ttgggtggag gcagaggcgg aggcggaggt gcaggtgcag 144541 gaaaagtggg tagaaggcaa ataaaccccc aggaactagc tatttttgat tgcaggcaag 144601 gagctctagt agttcctagg gcaggccttc agaaagtctc aggtgttggc cattagaaaa 144661 gaaaaaacat agatgctgga gggatgtaca gaagttataa aagggatccc agggaatctg 144721 gtcagagccc tgacagtgta cttgtaagct tctcttttgc ttttatgaaa cccagaatct 144781 ggagaaggga gagagaaatt ttcacatgga actgtaaagt ttctctaaag caagggttga 144841 cgaacttttt ctagaaaggg ccagataaaa tatttgaggc tttgtgggcc acagggtgtc 144901 tgacccaatt attcaactct acaactgtac tgcagaagca gccatagttg acaggtaaac 144961 aaatgagtgt ggctatgttc caataaaact ttatttataa aaacaggggt agggtggctg 145021 gatttggccc agggaccata ttttgccaat cgctgctctt gaggttcttt atatgtgcct 145081 aagtttgttt ttcctttatt tattccctat ctgttataaa acacagagga taattagtac 145141 ctttgtctat ctaaatgatg gttatgacat tatgatgtat cttaaattat tctagatcac 145201 tgcctccaat ctttctggca tcctggccat ctttcacaat gttgcaaaga taatcttcct 145261 gaagcatgac tcatctcctt ttattcccat gctaaacata tttagctctc tcttagctgt 145321 aaaataaaat ataggcttgt taacatggca ttcaggaatc cctgtgatct gaccccaggc 145381 caattttcct gcattatttt ctgttctcca tcctcatgta cttatagtcc agctacatta 145441 aattactcgc cctctctcca atatgccata tcgtgtgtgt ttgctatctc cttgtagttt 145501 ttttcatttt tatgtattta aactgactat atactgtcaa aattttatat atatatatat 145561 gatgtacaac atgatgtttt gatacatgta tacattagaa aatgggtaaa ttaacatatc 145621 catcacctca tatatttatc atttttgtgg taagaacatt taaaatctac tcccttagcc 145681 attttctagt gtacaataca ttgatattaa ctatagtcac tatgttgtac aatacatctc 145741 ctaaacttat tcctcttatc taactgaaat tttgtatctt ttgaccaact ccctaacccc 145801 attcccccac ccctggtaac caccatccta ctgtactctg ttatgagttt gactgcttta 145861 aatttcgcat gaatgagatc atgcagtatt tgccttttgg tgcctggctt atttcactta 145921 gcatgaagtc ctacagattc atccacgttg tcacaaatga caggattttc ttctttttta 145981 aagctgaata gtattccatt gtgtatatat actacatttt atattccatt gtgtacatat 146041 actacatttt taatccattc ctctgttaat agacatttag gttaattcca tatcttggct 146101 attttgaata gtgcttcaat gaacatggag gcactctctt tagcgtactg atttcatttc 146161 ctttggatat atccccagaa gtaggattac tgctttgtat ggtagttaca tttttaattt 146221 ttagaggaac ctccatagtg tttttcatag tgactgtacc aaattatatt gccaccaaca 146281 gtgctcaaga gttccctttt ctccacaccc ttaccaacac ttgttatctc ttgtcttgtt 146341 gatagtcgcc attctaactg gtgataggta atatctcatt gtgtttttaa tttttatttc 146401 cctgatgagt aatgatgtgg atcattaaaa aaatctgttg acaatttgca tgtcttcttt 146461 tgagaaatgt attttcacgt tattttccct ttttaatttt ttaatttctt tttttttttt 146521 tttttggaga cagggtcttg ttctgtcacc caggctggag tgcagtagtg tgatcatagt 146581 tcactgcagc ctctacctcc caggctcaag tgatcctcct gcctcaacct cccaagtaat 146641 tggaactata ggcaggtgcc accatgcccg gcttttttta aaatttattt ttgtagagac 146701 agggtcccgc tgtgttgccc aggctggtct tgaattcctg gcctcaagca atcctcctgc 146761 ctcagcctcc caaagctctg ggattacagg catgaacaac tgtgcccgac ctcctttccc 146821 cattttttaa ccaggtgatt tgctttcttg atattgagtt gagttttaaa tatattttgg 146881 atattaaccc tttatcagat gtatggtttg cacatatttt tgcctctgtc tttgcccctg 146941 tttcttctcc tggaatgttc ttctagctct acattttgaa atcctaacca ttctcttaag 147001 actcgattca gtcaccttct ccattaagca ttttctccat gcagaaacag tctctcctct 147061 ggactttgat aacaatattg tctatttcta ttaaattttt tattttgttt acattgtact 147121 agacagttat ttgctacatt cactgcattt ttagtatact gaggtctctc aatatatatc 147181 tatatatttt tgcccttcac agtggcttgc atatagtaga tattcagtaa atggagtttt 147241 cagatgaagg gcgcatttct acatcttgtg attattaatg atttaagtgc tgaaactctg 147301 acattatttt tctttttcct ttttgtaggc gagagctgtt ttggcaaaag ttggctatcc 147361 agagtttata atgaatgata ctcatgttaa tgaagacctc aaagctgtaa gtgctaaatt 147421 tactgtactt ttttttttct ggcaagtttt actggccttg tgcctttcaa ctaacccaag 147481 tcctagcaat taaaactggt gatgccagaa aacaccaact tttgattaat tccttggcta 147541 gatatgagcc tgaatatgcc aggcagatca caagatttag ggcagattca aaataaagaa 147601 gtgtcagaaa cgatgcgtag ctgagtgggt accaaaaaac aaatggactt gggtttggcc 147661 taatttgatc ataaaagaaa gaattaagca aattgagtat gttacaatta gtgggcatgt 147721 tctaatggca aatgcatcat gtttgttgtt aaatactttt tatttgcttc attcgtatta 147781 aatgaatagt aataaatctg cctgctggga gttaacttca ttaaacttta tgagtgtttt 147841 ggcaaattaa atgcatatac atacgactct ataatttatt gtgaattgga ttgtgtattt 147901 gcagtaagtt actgatgaaa aatggtcata attctttttt atttcaatat ttttggacat 147961 gtattttcca gtaggtgtat taattacagg cctttctatt cacaagcact gttatttcct 148021 ctcctacaaa tagttctgat aagtgatttt tcttcctcat ttaaaaaaac ctgtttttac 148081 tattcactag attttcttga aaatgtagtc ttttgaaatt ctttaagtaa ctatagttct 148141 gaggtatagc tctttcctga aattgtggtt caacaagtgg catatctgaa aacgatccca 148201 gctttgtaga tgagcatcaa aattttacaa gactattcaa aagtgagaga aagcattaat 148261 tagaatcttc ctccattggg ctctaagata tagcctccca aaatattgat acagttgctc 148321 tggaagaaaa taatgatgcc tcatagcatc ctaaaactca gtctatctta ttatacactg 148381 ttttgatgat gctgaaaatc ccattattat tatcgtaact aaaaatattg aagcacttga 148441 catattcatg tacattatct tttaattcct tcagtgattc tgtgaaatag atgatgttca 148501 ttcccatttt gcagaaaagg acatagagac tcaataattt aagtgagctc ctcagttcca 148561 cacagacttt cagagccagg actgatatct gggctcagga agctctgcct gtaatcagac 148621 aagggattct tgagggtcct ttttaggcag ctatgctgga ggaacagagg catcagggtg 148681 gagacttggt gggggcagtg tgaccagggc tgttggaaga agagcagagg gaaggccatg 148741 ttatcaaatg gttaagagca caggctccag tatcatgggc ctgggtttga attctgcttc 148801 ttgtacttac tagttatttc atcctgtgta ggttacttaa cttcggttag cctccctttg 148861 gttatctggc tactggaaat ataataacca agtcttagta ttgttgggaa gattagatga 148921 gactgtataa atgtaaaagc acttggcata atactctggc tcagagtata gcctacttcc 148981 tataaagggt agctttgcgg atcaatatgg accatttgtg gtccattttc aaatcagcca 149041 tataaaatat tccatttcat tgatccttaa tagaaataga ctcaagcgtg aatttctacc 149101 aattttggaa atatatctgt atcagccagg aaagcaaaaa gcacagtagg tattttaaac 149161 agggaattta atacaggaaa tgggttggaa ggcaagaaaa gggaaaatag ataaaccaca 149221 ggttaaactg ctgttgctac taagaagagt atcatcaggg cacaaagtac cttagcagga 149281 gaacctgggt aactccatac tattggtata tattctgagt cagcatgtgg atgagaaggt 149341 ttgagagaca gtttgatgta gaggttataa atagggaccg tggagtcaaa gtgcctggag 149401 taaaaccccc agtctaccat gtaccacctt gttacctagg gtaggttatg tatcctctct 149461 gtgcctttct tcatctgaag aatgaaggtc ataccaatcc ctcccatata ggatttctgg 149521 gaggatcaaa taagttaaca tatttgaagt gtttagaatg aaccctaaca tacaatcagc 149581 acaatacatg tgttagctat gacatcctca attcacactt gtggtagtga actattttgg 149641 tttgaaaaag tcctcatgaa ttaactcatg cactgtccag gtccctcaaa ttgatttaaa 149701 agggtataca gatggagcag tctcaagctt taatcgaata aggtagaact taaggcttac 149761 tcacaatatt tagttgatca agttttgttg gcattctgtt gctaagttca gagccccaaa 149821 tttaaggacc cattgaatgc aatatctttt gtgaaaagaa accagatgcc aaacaggatg 149881 ccatgttggc tgtgcttctt tgtttctctg gggcctgaac tgaagacatt acacaccaag 149941 atttttaatc acttctttga agaaatggct ttcatctgac tcattgccaa agcagacaca 150001 attacgccca agagttttaa cagtttaaaa acctgacagt gaaaaaaatt ccaggtacat 150061 gaagtaaaaa gacatttgta ttcagaaacc gattgtttta gtcaatttac acggtgctat 150121 aaagaaatac ccgagactgg ataatttata gaggaaagag gtttaattga ctcacagttc 150181 cacacggctg gggaggcctc aggaaaccta caatcatggc ggaaagtgaa ggggaaacag 150241 gcaccttctt cacaaggcgg caggagggaa gagtgagagt tcaggaaaaa actgccactt 150301 ttaaaaccat cagatctcat gagaactcac tatcacaaga acagcatggg ggaaactgcc 150361 cccatgattc aatcacctcc ctcccttgac atatggggat tacaagtccc tcccttgaca 150421 cgtggggatt acaattcgag atgagatttg tgctctctca ctttctctcc ccctcaagcc 150481 ccgccctttt ctttttctga cagtgacttc agtccctcct ccaaacctca gctctggagt 150541 ggcaggcagt tctcctctac tctcctaagt gatggtacca tcacctcctc ccttctgtga 150601 ccctagacac aggagtcgca gctgtgtact cctgtgactc tcacagagcc aaatcatatc 150661 actgatagag aaaatgaata aaacccaccc taaaaatagg taaaggatga tatgatagat 150721 tgtaatattg tcactgaggg caatagcaaa aagactgaag tattcctggt acacaatgag 150781 actgttttca gttttaagaa acatgtagat aaggactgct ataggaccta tgagaaattt 150841 ctggtaatac gagctctgag catgagaggc tgttacaaat gaaaaacagc atgctgcatt 150901 tgagacaggt ggagatatga ttcatcaggc acaaatgcaa aaggaaagag gagcgagatt 150961 acctgatgga tcagacgaag gtcaaggaga aatgttgaaa attgaagttg ggctcatgtg 151021 gaaataaaag caaatgagat agaattaaga ggtaaatatg caaaggaatc agaaattttt 151081 taaggatgac aaggtaaact gaggtactca gcaaagcatt tccgaacaac aatcttgtag 151141 tttttgaaaa gaacaggaag aaataaaatg ccctttattt gaacccccat caacatgcag 151201 gaaagcctgt tagcctaaaa aggtaggcat gagcccattt tcaaatcagc ctaataaagt 151261 attccatttc attgattctt aatagaaata gattcaaaag tgagtctctc ccaattttgg 151321 aaatatttca gtattagcca ggaaagcaaa aagcacacta ggtattttaa acaaggaatt 151381 taatacatga aatatgttgg aaggcaagaa gagggaaaaa tgagataaac cacagattag 151441 taactttaga aagcagctgc gactcctgtg tctagggtca cagaagggag gaggtgatgg 151501 taccgtcact taggagagta gaggagaact gccactccag agctgagctt tgcaggagga 151561 actaaagatt cttccagagt tattgtcaga aaaagaaaag ggagggagta gagggggaga 151621 gagagtgaga gacagcaaca cagcaccctt ttcccttctg caacccacta atttcctgta 151681 gtgcctccta tttgcaggaa gctaattggc aaaggagctt gagaaacagt tttcagattc 151741 ccagtcccca gcattacgga acagagtgaa agggtaggca tgtagctgaa aaactacagg 151801 taaataactg gcactattaa ggacgttact aaggaccaaa gtcagaatca tggctggcct 151861 ccatttactt aggagaattg gaaatgagaa aattttcaga ggccaaagtt tctggtgagt 151921 gatacacaca tatatagaca taaatgtgca tatagggata tgcagatatg tgaacctatg 151981 taagattctc tgtggtgggt ctggtctact ttggccagtg aaacactagt gtgtgtgtgt 152041 agtgtgtgtg tgtgtgtgtg tgtacatgac agagagtgcc actctgcaag agtgaggtga 152101 tattcagatg gaatgctaga atgctgcaga aaaagaggtt gttaaaatat taaaataatt 152161 ctttactatt tgttaaaaaa cacaaaacat tgcgctcagc tctttcaagg gacccctggc 152221 catcatggca ctggagagct aagggtgggc attggaggtg ggacaccctg gtgcccccca 152281 accttccagc caggcctctg taactctctc acattggcct tcatccagac tgttccactg 152341 gtcagagcag accaagacca gccccagtgt ggagatggga catctggttc cagtgctagt 152401 gtgctggctg gaactccctc agacccacct tcacatagct gtttgtagga attgcatcct 152461 ccatggtaca ctgtatatgt ggattattaa acattttgag catcacctca gtatatatta 152521 ggttggtgca aaagtaattg tggtttttgc cattactatt aatgccaaaa actgcaatta 152581 cttttgcacc aacataataa gtatgtctac atgtatagac acacattact tttgggcaaa 152641 gccatctaaa cacaccaatc ttgcatgttg ctttgttgtt tgaataaggg gacatgtaga 152701 tataaaatac ggattccatt aagtattcag aaaatccttt gggtgcctaa cataaagtct 152761 ttctaataga tatttggata tatctctgag tatctttaaa tggatttaaa ttaaatgaaa 152821 ggtatcctag aatttacact tcctactggc acagtaagga aagtaccaca attagtgctt 152881 cagtattttt catctgccta ttcctacatg taggccgaag actatactac agatcatttg 152941 cattcctatg tactgcctta taaacatgta cttatcctac agagaagctg aacacatttc 153001 cttgttctgc gcttattaca ggttctccat ctttggggtg gtggtggtgg taataacttt 153061 attcttccat ttagtctgtg gatctgggat aggatattca agaagagaaa tgtaggctct 153121 cctgaatttt atgatcggga cagctgttcc tgggaaatga cacaagaatc ccaattaatt 153181 ccccaaaggt cacgcaattg aaaagtcaag agatggaatt gattgaaagt gatcatcaac 153241 ttgacctaga ctttttccag taactgaaag gacaagaggc ttttgagcca agtattgtct 153301 cttgccaaat ttaatttttt cattttggag attacaagcg gctttaaaat attcatgcga 153361 attcattctt gtttctactt ctagaaaaaa tagaactttc tcaatagttg ctgggtttca 153421 attctttttc tcatccaagg gccttttgaa atatttaaaa ttatcaaaat attattacaa 153481 ttagcaagtc ttctaaaagt aaaaactgat cccactgata ttatttttta aaagttaata 153541 tttttgaagg aaattcttct gagaaaattt cccagaaagt gttctgagcc caagaacata 153601 tctgaatttt ctatcagtta taatcatatt ctgaaggttt tggcagaaaa tttgggaact 153661 agagaaaacg cactcataat attgctacat aatatatact tctgttacag tttattaatt 153721 tgtttttctc tatttctgta ctgccgtaat tgatgtttac atataatttg gtacaccaac 153781 ctttacagta acatattgtt ttgctactgt ctttatagcc tttgttacca tgattttaac 153841 tgctgtatag tatttcaagt aagttgtatc atgattgatt taatcactgc ttctactccc 153901 ttcctcaatc tcactttgtc cttgaatcac tggttctctt attaagacca acgaaaataa 153961 cattggagtc agagccaaat gacttggacg gtgtagcttt gccaaattcc gtaatctcct 154021 tagccctgtt tcccatctat catatgtgaa taacagtgac tggcccctgc ctacccccaa 154081 acacttaacc tgtctcaagg aaaaacgtaa gacctgtcct ctattacttg tgatttccat 154141 taaataaatg gaatagatca attaaaccag gctaagaaag aagaaatatt ctcaacttaa 154201 taatgaaaac aatttttaaa ttatctttct aaacataaat aagctatggc aaattgtaat 154261 taattacctc tgagttttgt ttcccccaat tattttcagg gttctggtag gtccttactt 154321 atctaatgta tttttagact tgggaccctt ttcttcttta caacgttttt ttttttttta 154381 atttttaggc agagtctccc tctgttgccc aggctggagt gcagtggcac aatctcggct 154441 cactgcaacc tccacctccc aagttcaagt gattctcctg cctcagcctc ccgagtagct 154501 gggattacag gcatgtgcca ccacacctgg ttaattttca tatttttaga agagacggga 154561 tttcaccatg ttggccaggc tggcctcgaa ctcctgacct caagttatct gccagcctcg 154621 gcctcccaaa gtgttgggat tacaggcatg agtcactgtg cccagccctt tagaaagttt 154681 tgagatggct ttctgtttaa tgtctgtaga ccaacctgaa tatcaaggga tttttttttt 154741 tcagatggaa cagtaatgaa agtatcacaa ttagcaagtt ttctaaaagt aaaaactgac 154801 cccactgata ttatttttaa aaagttaata tttttgaagg aaattcctct gagagaattt 154861 cccagaatgt gttctgaacc caagaacata cctgaatttt ctatcagtta taatcatatt 154921 ctgaaggttt tggcagagct gtcttggaaa caaaagctga aaaaggcaaa gggttgaaca 154981 gtggacacat cttttaacca tatgatctgg gaatgcctta ggattgctag taaggttgct 155041 ttctataaca aggtgcttac tctgtttctt ctgaggctaa aagaaaacct ttatcttaaa 155101 atataatcaa ttataagatt ttggccagcg cggtggccca cacctgtaat cccagcactt 155161 tgggaggcca aggtgggcgg atcacttgag tccaggaatt caaggccagc ctggccaaca 155221 tggcaagacc gcatctctac taaaaatata aaaattagct gggcatggtg gtgcatgtct 155281 gtaatcccag ctactcggga ggctgaggca caagaatcgc atgaacctgg gaggtagagg 155341 ttgcagtgag ctgagattgc actgctgcac tctagcctgg gtgacagagc gagactgtct 155401 caaaaaaaaa aaaaaaaaaa tatatatata tatatatata tatgtatgta tatatgattt 155461 ctttttatta tgaagcattt caaatgttgg aacatgagta aaataatttg aacacatcta 155521 ttttaacaca tagtaaacat attgccatat tagctttgta catgcacaca cacattttaa 155581 atgaacagat gcatttgaaa ttttgtctgt aatcttcctt gatctaatct ccttcctctg 155641 cgtctatctc catctctgca tataaacact accttgagtt cagtcttatt attgttattc 155701 ccatgccttt ctttatgctt ttgctacata ggtacgcatc catcaacagc atacagtgtt 155761 gttttgcatg tttaaaaact ttaaatttcc tcatagatac aacttacctt attcactctg 155821 cataattttt aagattattt catgttcatg catacagaac aatcatgtaa actgttgtat 155881 tgccttccat tatgtgaata taccacaatt tgtttatcag cccctcttcc ctgagcaagc 155941 agtttgaggg acaatttaaa gtcatagtgt tataatttgt tggccaagca gaacacagag 156001 ggtgccagga atcaatactt ttatttctgc ctttctccat gtgccttcag ttattttact 156061 ccttccttta tattttggct tggtctcctg gaaagattgt taagatgact ccttaacaaa 156121 aacttctcca cggacatgag aacccactct gtgttaggga ggagctaaat gtgagccaga 156181 cattggtcaa caagtaggtg actgtcgagc ctattaccat caaattgcat atatacattc 156241 gtatctataa tgtaatgcct tgtgggtttc atatacacga gtattttact ttttgctttt 156301 aaaagactat taaagttata aaaattacag tttataagaa tttggttacc cttggttatc 156361 tccttattca aatgtgtatt tttcttaatt tccatagtaa tatatttaaa aacccaagca 156421 ttggaggtca ctacaaacga ttgcatgaac agctccacag atcttattta caataaatag 156481 ctaggttagg tattgctagt tttgtagatg gagtgcctat agaatggaaa ccgtctgcca 156541 ttctccttcc aaaacaagga aaggaactaa aaaagcaatt atgtctgttt ccttgtctta 156601 atagacaaca ttgttgtgag ctgctaccag ctttctgtgt tcccatgcaa gctataatct 156661 atgatttgcc tctgtaattt cattaatatt tagtgaccag agactttttg agtgttaaca 156721 gatacattct gttaataatt attctggtga cttcattgtg cttatggtgg aaaataggtc 156781 tctgaatcac atttattgtt ttcctttacg aacaactacc aattatttga ctaaatcatt 156841 gtctataagc taaatgagtt ttcagccaac taaactgcat gttactgaaa aaacttacgt 156901 gacaaattaa caaagtacac attttaaaaa agatcaagga ctcgtgagcc aagaaatttc 156961 aaaccaaaga taatatggtt tataaaggtt gaagtagatt tttcaattag aaagtaaatg 157021 gaacgcattg taaatgactg cagcttctct tgttgtgttt ttactgtgat ccagaaatga 157081 ctatattgat ttttgaagtt tttacgttaa caaaatcact gagactaaat ttcatggctt 157141 tgtgacttct gggtcaactg ataacagctg agagaggctg tgactgatgc agcttctctg 157201 ctagttgctc cttcctatgc tgaagtattt ttttagagcc atcttttatc tttacttaga 157261 acaatgatgt tgtggtttgt tttattcaga tcaagttttc agaagccgac tactttggca 157321 acgtcctaca aactcgcaag tatttagcac agtctgattt cttctggcta agaaaagccg 157381 ttccaaaaac agagtgagta ttaaacaaaa aaagttaaat agataaatac attggtgaga 157441 agcggagtct ctttagatta aactttgtaa ttcccctcaa gtcagagtag ctggcttgcc 157501 agttaacttg ttttagcaaa atctgtgtca tgttttgcca actcttattt tttatattac 157561 gtgattcaga tcttaagtct aattacaggt cccccatttc tagcatcaac acatatcaag 157621 agaaggctta ttagaatgca aacattaaag ttatgtctag atctagttat gaatatatag 157681 agagggaata atttctctgg atttttacat tcatgtaata acttttgaag accttaaaaa 157741 agaataaata cttaaactat gattagaaat gtcaaagtaa atgtatagaa gcaactagaa 157801 aatatttgtt gaggtaatgc ctatcaaaga actgatagct atattgtttt tatctaatag 157861 cagttatgat ttatttgcct tcctctcagc tgcctatagg agtttttcct acagatagga 157921 gagtttcctt aggaactttt aggcccagag cccaccagct acgatttcat cccttagcaa 157981 aatgtcagtg atgaatgtag cagctaacaa ttattatttg tttacattgt accagaaagt 158041 gctttacaat ttttaattca ttttatcctc atacgaattc tgtgtggtat aggtactgtt 158101 atccccattt tacaggaaaa taaaaggctt acaactgaga aattcttcag cttaggtttg 158161 gatcacattt tgttctgtct gctggtgctt ttgattcctg ggactggatt ctgttctgct 158221 aaggcccact tctccaatgc ttttttttct ctctgttctc cagattggat aatttctatt 158281 ggtctggatt tcaactccag aatttctttt tttaatttaa ttgtattatt atttttttaa 158341 cttatttttt atttcaatag attttggggg aacagatggt ggttggttac atgaataagt 158401 tctttagtgg tgatttctga gatttcagtg cacccatcac acgagcagtg tacatgtata 158461 ctgtatcagt gtgtagtctt ttattcctca cccccctccc actctttccc cctaagtctc 158521 taaagtccac tgtatcattc ttatgccttt gtgtcctcat agcttagctc cctcttatga 158581 gtaagaacat acaatgtgtg gttttccatt cctaagttac ataggataat ggtctccaat 158641 tccatccagg ttgctgtgaa tgccattatt tcattccttt ttatggctga caacttcgga 158701 atttccgttt gttttttgta gtttctattt ctctgttgtg attctttgtt aagtcattgt 158761 catcatattt tctattaatt cttcaaataa agtttatttt aattctttga agtcatctat 158821 aatagtcact ttcaagtctt tgctaaatct aatatgtggg ctcaaactaa ctagagttag 158881 tttctattga ttgccttttt attaaattaa ttaatttttt tttacagttt ttcctttgct 158941 taacattttt tgttgttgtt gctaatgaca gtttagatag catgttgtag caactctgga 159001 ttctgattta tttttctgag gctttttttt tttaaaatca aaagtaactg tcctaactta 159061 tactgccgaa tagatcttct ctgtggtatg tagctgtcga tgtctccact cagtttttta 159121 actcttattt gtagttatta gcctcacttc ctgctggttg tccctgtgtc tacatagctt 159181 agtgctcaat cagtgattag gtcagagctt gtgcttaaac accttgagtc catagggctt 159241 ctaccctctg ccagttgctc tgtgtgtggg ttgggagcac cttcaaagtt atagccagtt 159301 ctcagtccac cttcaatttc accttccacc aggccctcac tggtcttccc tgcacatgtg 159361 cctagtttcc cagtcagcca gggatgtgtg gagtgcttat ttggcccttc tattactatt 159421 tcagttatag gatctctctt ttaaatttct gactgggttt tcctcgttag ttttctacta 159481 agttggccat tcaaaaatta aaaaaaaaat gttgtgggta tgtagtaggt gtacgtattt 159541 atgaggtaca tgagatattt caatacaggc ataaaatgtg taataatcac atcagagtaa 159601 atggggtatc catagcctca agcatttatc ctttctttat gttacagaca atccaattat 159661 acagttttag ttatttttaa atctataaaa agttgttgac tgcagtcacc ctgttatgct 159721 atcaaataga tcttattcat tctatctaag tatatttttg tacccattaa ccatcccctc 159781 gacccagtac ccttctcagc ctcattctac tctctatctc catgggttca attgttttca 159841 tttttagctc ccacaaataa gtgagaacat gtgaagtttt ctttctgggg ctggcttatt 159901 tcacttagta ttgtctagtt tcatccatgt agttgtaaat gacaggattt cattcttttt 159961 tatggcttaa ttgtactcca ttgtatatat ataccacatt ttctttatcc atttgtctgt 160021 tgatgaacac ataggttgct tccaaatctt ggctattgtg aacagttcta caataaacac 160081 gggagtgcag atgttccttc gatgtactga ttttctttct tttaggtata tacccagcag 160141 tgggattggt ggactgtaag gtagctctat ttttagttgt tcgaggatcc tccaaactgt 160201 tctccatagt ggttgtacta atttacattc ccagtaacaa agtacaaggg ttctcttttc 160261 tcaacaacct caccagaatt tgtttttgcc tgtcttttga ataaaagcca ttttaactgg 160321 ggtaagatga tatctcattg taatttttga tttgcatttc tctgataatc attcatgttg 160381 tgcacctttt catatacctg tttgcctttt gtatgccttc atttgagaaa tgtctattca 160441 gatctttgtg tctatgtgtc tgtttttatg ccaatatcat gccattttgg ttactatagc 160501 tctgtagtat aatttgaagt taagtaatgt gattcttcca gttttgttgt ttttgctcag 160561 gataacattg gctattctgg atcttttgtg gttccatata aattttagga atttttttct 160621 atttctttga agaatgtcat tgatattttg atggggatta cagtgaatct gtagattgct 160681 tttggtagta tgggcgtttt cacagtattg attcttctaa tccattaaca tggagtatct 160741 acttttttgt gtcctcttca atttctcgca tcaatgtttt tatagttttc attgtggaga 160801 cctttcactt ctttagttaa gtttattcct aggtatttta ttttattcgt agctattgta 160861 aatgagatta tcttcttgac ttcttttaca gatcgctttc tcttagaata tagaaatgct 160921 actgatattt gtatgttgat tttgtatcct gtaactttac tcagtttatc attttaatag 160981 tttttttggt agaaccttta ggtttttcca aatataagat catattgtct acaaacaagg 161041 gtaatttgac ttctttcttt ctaatttgga tgccccttat ttccttctgt tgtctgattg 161101 ctctagctag gacttctagt actatgttga ataatagtgg taaaagtgga catccttgtc 161161 ttgtctagat cttagaggaa aggctttcag tttttcccca ttcagtgtga tactaactgt 161221 ggctctgtca tatatgacta ttactgtgtt taggtatatt ccttctatat ccggtttttc 161281 gagagttttt atcctgaaac gatgttgaat tttattaaat gcttctttag catcagttta 161341 catgatcata tgccccgtgt ttctttccac tgtgatagga cagcactgag ttccagtgca 161401 aagtcccaca atcaccgtac tctccttccc ccaagcgcag aggtttttct ctctgtgccc 161461 atgtggctgc tgctggggaa tgagggaagg acggtatagg caattcaaga ctggctttcc 161521 taccctcttc agtgcctctt tccttgatgt gatgttaaaa ccaggtattg tgattgttca 161581 cctggatttt ggttcttatg aagatgcttt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgg 161641 ctagttgttc aatttggtgt tcctgcaggg aggacacttg ctggaggttt ctacagcctt 161701 cttgctctgc ttcatcctct atgttggcca tttttatctg acaaagctgt ggcttttcac 161761 tgccaccccc agttgagtct acccactggg gcagcaaagc tgctgatttc ccagccagcc 161821 tttttctgac aaaactaagg tctcactgag ctaggaggga caggagtagt ctcaggcaaa 161881 aaggccatag tgcctgctct acttaatgcc aagttctagc agtttttaaa tattaaattt 161941 tgtttaattt attgctcccc cttgatgaat tcccagattc ctaaaatggt tatttgttga 162001 caatgttgtc cagttttgta cttggttttt gtagaaatga tttgtcaacc ttctcatgcc 162061 attatacctg caagcctggc ttccatggat tacaattttt tttttcatgg cccacttttt 162121 aaccttggct aatattctga tttctagatc tttgactggc tgtttttatg ctgcctagtc 162181 ttttgctact ctccctttca agtccctgtt ctgtcctggc ttctagaaac tctgagcccc 162241 tgtgctttcc cttggcttaa gtggtgatta ccacttctcc ctcttatgtt tccatccggc 162301 catccctgct ctctgccctt caaaccttcc tataatatgc acctgaaaaa ttcattggag 162361 gttactgact ttaagaggta ggcctaaagc tgaatctctc ccctacctaa ccctgactcc 162421 agtcaaaagc acaaaggaaa acagctttag aactgctttg aaaaatgata gataatggtt 162481 taaagccatt tgctccactg tctttggttc tttcctctca tcacgttatt tctcagtcca 162541 cctatcctgt ttcattccca gtctaaatta ttcaatagtt ttagatttag gcttcttaaa 162601 tccaatactc aaatctgatg actttggcaa atgatttcca gtatccctat gaccatctac 162661 atgtaatacg gtagtcacag atctatataa tcttgccctt caatatttgc ttcagatcag 162721 ttccaactct tttttaaaaa aaaaaatctg tggaaaattg aagggtttca ccaatttttt 162781 ggtgaatttt cagggtcttc tcaggggctc ctgcatattg gcaatacagt cagtctaatt 162841 gtttgatttc aaaatgtcat tattcattta gttgtgtaga aaggcctgga ttagtaattg 162901 aaaaactcct aaatgtgtta tctttaacat cgtcctagga tactgatgaa ctttcagatt 162961 ctaatggtct taactatttc taatatttta ttcatgcctt agaaatatac tggctttgca 163021 ggcttaggca aactgagatt tgtgtggttt tcttttctgt cattgaaaaa caaatagctc 163081 tttcacaagg gtattaaatt ggttaacttt tgggaaaagt tctccagaaa tggaaatcag 163141 tgcaatgctt gtaattcttg tttcctcaga ctatatgtct atttcaggga aataacattt 163201 agaaatttaa ggtcttcttt gcatagtaaa gggatatgag taaattcctt ttccttcttg 163261 tttttttttt ttaaaggaaa catctggctg gaccttttca ggcattgtag agacttttag 163321 cctcatgata ctcattggct tgaatggaca tctttcaggt ctgaaaattc atgagttaat 163381 cgagtgggga tcattctgga gtagctctac tacactagtt tactttgaac agcatcatga 163441 ttatattttc acaaagccct gaaataacgt tttgcttctt tgtgggtgta ggagataaag 163501 accctttatc ttttattctc atggaatcca ttttagctgc actaatgtgc attaagaagt 163561 ttctaaggcg atgggaaatg tcagcaagaa tgaaattaat tgggtggtca tccctgaagg 163621 tacatattta ggggttggtt cacagaattg tcacttcatc agcttgctta aaaatagagg 163681 atttacttac tgctcggctt gagttgaaag taatttcttt ccctcaggat caccttcttt 163741 gacagaaatt attttagccc gcctcacttt taactcactc tcattctact gagaccaaac 163801 acgtctgaat tctctatctt ttgaaaaata tgtaaaaatg cattaaatgt ctttataaat 163861 tatgttcctg cccagaactt ttgtttgcaa aagcattatc tatagcatgg aatgtgagtg 163921 atataatctg caggatatta ataagaaagc aataaataca ctattgcttt tccattccat 163981 attccctaga ctatctgtag gagttgataa gaaaaatgta aaactttagc ctgatgtata 164041 tctattttat atggtccaaa gaaataggtg taccaatgtc tttttcttca gattcttgta 164101 ttttggtatt atattccttt tatgtatctc attctccagt ataatttcat ccccattatt 164161 gtgggatttg aaatttctat gagacctctc cttaccttat ccaccaaatc ctttctcaaa 164221 cacagactgc agaagcaccc taaagacata gtaagaagaa tttaaagcac ttgaatgata 164281 ttctctgttt attgaagtgg atgtatacat gcaataaatt cattttaatg gatttagtga 164341 tcttctgttc agcacttacc atttggacaa cagtcaaagc ccataagcaa cttaaaatct 164401 tgttgggaca taagctgtgc aaactcagat gccttcaggg gccagtcagt taacaaatgg 164461 ttgacctaca tgggtgtaaa caacagggag tggtagggac tgtagaaaac tagtaattac 164521 acacaccact tgaaggaatt ccaattcaat agtttgttaa gccattgtgt tgggcaagca 164581 aaacagtttt ctgggaagat tcagcccatt gggttggact tagtctgtgg atagccaact 164641 tttcagcatt ccatttatct attgctgtgt gataaacctc ccccaaattt agagtactga 164701 aacaacagtc attatttttc tctctcatgg ttctgggggt cactaggctc agctgcatgg 164761 ttctcgtttg tggttttttt tttttttttt ggacacagag tctcactctg tcaccaggct 164821 ggagtacatt ggcgtgatct cggctcactg caacctccgt ctcccaggtt caagcgattc 164881 tcctgcctca gcctcctgag tagctgagat tacaggtgcc cgccaccacg ctgagctaat 164941 ttttgtagtt ttagtacaga cagcatttca ccatgttggc caggctggtt tcgaactcct 165001 gacttcaggt gatctgccca cctcagcctc ccaaagtgct gggattacag gcatgagcca 165061 ctgcacctga ccatttgtgg tctttcatgt agttacagtc agatagtggc tggggctaga 165121 gtcatatgga aggctacctc tctcacatgt ctaatggttg atgctgactg ttgactagaa 165181 catctacata tggcctcttc atgtatgttg ggtttcctta cagcatgttg attgatatag 165241 aaagttcaaa gagcagcttt tctgagaaag aagaagggag ggagggtgtg tttgtatgca 165301 tcagggggaa gttgtatcac ctttaattac ctagccttgg aagtcacact gtcatttctg 165361 ctgcgtatta ttctttagaa gtgagtcact aaggctaacc cataatcaag gggagaggaa 165421 tttagattcc atctcttaat gggaggagtg taaaaaagtt cacaaacaca tattaaaact 165481 agcacatgta gccttacatt agagaatagt aaaactaaat atataataaa cttggatgta 165541 atacgggaga tatacaaaat gtttaacagc tggtttggca tgggccctga acaaatagaa 165601 gaaataacag catcaacaac tggtaggatc tactggtgca gctgcttaat atcagctctg 165661 ggtattatca agtctccata cgtgccctag agtcaataag agccctggga ggtcaggaat 165721 ggtaagatca tgtgtctgaa ataagatgtt ttcgttattc ttgttgaagc ttcagaagcc 165781 ccattaaggt ctagtttcag agactggttg tttggtttgc tttgtctacc tttgtctacc 165841 agcagttagg aacattttcc ctgcatggcc cttacctgaa tattttaaaa gtttgttatt 165901 ttaaagagag gggagttggg aagaacaaaa agagaaggga caagctctag gaatacacgt 165961 tatcttttgt atgtattatg attcattcca gtgttattta gaagccaagg cacccacctg 166021 gattttcccc tcttgaccta caaatgttgg gttctttgga tagtgcatta gtttcctatt 166081 gctactgtaa caaactccta caacctaagc agcttaaaac agcacagatt tattatctca 166141 cagttctcta gctcagaagt ctagggtagc ttggttgttt tctctgcttt ggatttcata 166201 aggctgaaat cagagtgtca gccggctggg ctcttatggg gaggttctgg gaataatccc 166261 cttccaaggg gattcaagtt gttggcagaa tccagttctt tctgattgta ggggtaaggt 166321 ctttacttcc ttgctggcta tcgcctggaa gccactcttt gttctcagag ggatttatgc 166381 agtccttgcc cattgcccct acatcttaga gccagcagcg tgtcaaatcc ttcttaccct 166441 tcctcctctc cgacttcctc ctctgttgca tctctctgat tccagccaaa caaggttgtc 166501 tgcttttaag ggcttgtgat tagattgagc ctacctgaat aatccaggat aatctctcta 166561 ttttaaggtc cttaagctta attacacctg caaagtcctt cttactatgt aatgtaacat 166621 gttcatagga tccagggatt aggacttggc cataatttta cctactacag gtagccaagt 166681 taggtcaata gaggtgcttt ccccttcact tgtttcctca ggaaagtagg ccaaccagca 166741 agtttatgtt aaacactagg gcttaattga aataccctaa atttgaagga aaagaatatg 166801 cattttatgg aaatatgtct aggtttcggt tttgaagaga gaagaggtgc tctttcttga 166861 tttccagtaa tgtctacagt ataatataat gcttaggcga actggtttga agtcagacag 166921 accaggcctc gtacaagcac ataatgctca taaatctcat tatcctcgtc tgtgaaatgg 166981 gctaataatt cctatgttgc catggggaaa aactgatgaa tagaaggtgc ttaggacata 167041 gtaagtgctc aatgaatgtt agctaaatat aatgataatg ataacagaat gacatattca 167101 tagacttctt tgtagctggg gagaaggtat gagtttacca actagtatca ataattgggt 167161 ttcctattag tttcatatta acctgatata aattaatttg ttagtctgac ataattatcc 167221 tattagacgt aaatgagtaa tacaattatt ctagaaaggg atttatacaa gttttttttt 167281 tctttttaat ttttttagag actggaactt ctgggctcag gcagtcctcc tgcctcagcc 167341 tcccaagtag ctggaactat atttgcatgc cactgctggg cctaggagtt tatttttagc 167401 aattttgctt aagatgaatt gcttggtttt aatcataaca agcagtcact gaaactcaca 167461 accatctaag tcagggtttg gccactatgg cctgtgggcc aaatctcacc tgctgcctgt 167521 ttttgtgtgg cccatgagct aagaataatt tttagagata aatgtttgca ggtggtttga 167581 tgatggggaa gccctaactt tgttctgttt tttgtttttg ttttttaatt aaaataactt 167641 ttgttggtac gtagtagttg taaatattta tggggtacat gagatatttt gatacaggcc 167701 tgcaatgcat aatcatcaca tcatggaaaa tgggagtatc catcccctca agcatttatc 167761 ctttgtgtta caaacaatcc gcttatatta ttatagttat cttaaaatgt acaattacta 167821 ttgactatag tcaccctgtt gtgctatcaa atactaggtc ttattcattc tttctaacta 167881 tacatttttt gtacccatta cccatcccca cttctccccc attcccctac cgcccttccc 167941 aacctctggt aaccatcctt ctactctcta tctccacgag ttcaatcgtt ttggttttta 168001 gatcccacaa atacgtgaga tctcacaaat acgtgagaac atgcaatgtt tgtctttctg 168061 tgcctggctt atttcactta acataatgac ctccagttcc atccatgtgt tgcaaatgac 168121 aggatatcat tctttttatg gctgaatagt actccattgt gtatatgtac cacattttct 168181 ttatccattc acctgttggt ggacacttag gctgcttcca aatcttggct attgtgaaca 168241 gtgctgcaac aaacatggga gtgcagatat ctctttgatt attttctttc ctttgagtat 168301 atacccagca gtggggctgc tggatcatat ggtagctcca tttttagttc tttgaggaac 168361 ctccaaattg gagaagcata actttgaacc ccaattaagc aaaaggttat ctcatcccaa 168421 aataattccg ttattctcat taataggcat atattagaaa taattgtact cagctgggca 168481 tggtggctca cgcctgtagt cccagaactt tgggaggctg aagtgggtgg atcacttgag 168541 gtcaggagtt tgagaccagc ctggccaaca tgccgaaacc ctgtctctga aaaaaaaaag 168601 agaaataatt gtacttgatt actattgtta ttttttggat ttcatcaata aaatttgtgg 168661 acagtttgct cttaagtacc tacgtaatat ccttgatttt gcctctttgc ctgtaaagcc 168721 taaaatattt atgatctcac tctttacaga aaaaaatttg ctgactgctg atcccagtaa 168781 cctattgtgc atttaaggtt tgggtgtaga ctccttttag atagaaggga tcctttagat 168841 gtcctaatgc aatgccttaa ttttatagct aaaaccaaac cccaggaagg taggtgactc 168901 ttcccacaca attcagctag ttagtgatag aggcagtaca ctttccactg gagcaggcag 168961 accctctggg tggcatagtt ggactttatt catctttctt taaatggacc tctgtgttgt 169021 caccaatcca gaacttttaa atggacaggc cagaggtaat gcagcataga aacccaattg 169081 tccattcaac ccattgaata gggcctggca aatatatagt tgctgtgttt ccatttgcag 169141 ggacagccct ttagattcta gaatttctct ttaaaacttt ttcaaaagca ctggtgtgct 169201 gacttttttt ctgttaagtt agtaacaata aataactggg catgttaata tattttcatt 169261 gagccatgtt cagtttagga tgagattgtt cattgagcag tacagtagcc ccaaactgag 169321 ggaataattt cccctcatgt ccaacatccc cattgttcca tgttatctgc taataaaccc 169381 agccatgctg tgtttgtctt tgcttccctc ctgcctgtat aatgatgatt gctctctggc 169441 atttgtttct ttttctacag gtggtttaca aatccgacga ctgtcaatgc cttctacagt 169501 gcatccacca accagatccg tgagtacggg ttccttgtct ccttggtaac ctggtataaa 169561 atgcaaagaa cttttccatg actatgatgg agggtaagac atttcttatt tcccccaaaa 169621 gtggcagggt atgctggtga aggaaggttg ttctcctacc tatatgccaa attttcctca 169681 ctctcttcac cttaccagtt ttttttaaag ctttatgtca ccactgattg aaaaaaagtc 169741 aactacagta aaccaaatta aaccataatg aagtttgcca tgaaaaggag caagtttcca 169801 ctttcctttc ataatggcct tactgatttt gtccatgcca ttttattcat tcctaggatt 169861 ggtggggaaa acagcaggta actgagcaga ttgtagaacc tgagcggacc atctgtatgt 169921 aagaaagaag cttggaaata ttcttattaa cgctaggaag ttatgtcttg aggttttcta 169981 cttacatgta tcaggtatat gtaattaaaa ataccatgat aaatttaaaa aattattatt 170041 tttattattt tttaaagaca gagtctccct ctgtcaccta ggctggagtt cagtggtacg 170101 atcctggctc actgcaacct ctgcctccca ggttcaaatt attcttgtgc ctcagcctcc 170161 tgagtagcta ggagtacagg tgtgcaccac cacgcccagc taaatttttt ttatttgtat 170221 ttttagcaga gatggggttt tgacatgttg actaggctgg tctcaaactc ctggcctcaa 170281 gtgatctgcc caccttggcc ttccaaagtg ctgggattac aggcgtgagc cactgcacct 170341 ggccttaaaa aattattaag atctgaatgg cacaaataaa aagtctgaca gcttccagcc 170401 ttggcaataa ggacacatac tcactgctgg tggaaagata gatgggctca tccactttgc 170461 agggcaattt gacaatatct caaaaagctg aagttgtgca tgcataccta tgaccatgat 170521 gtgtgctcct ggaatgtgct ctaatgttgt gagtcttgaa atgtggtcct tggacctgca 170581 gtaatagcat catgggggat cttgttagaa atgcaaattc ccaggcccca ccttaggtct 170641 attgatcaaa ctttggatgt ggagctcaga aatctggtgc aataaaacct gcaggtgatt 170701 gtaatgtatg ctaaagtttg ggaacagctg ccctagagaa ctcttgcaca aggcataaag 170761 aagcccatgc aaaaatgctc ataacagtat tatttataat agtagaaaaa gcaaaataac 170821 ctacatattc ctctgcagga agatgagtca catgtggcat attcacacaa aggaatacca 170881 tacagcagtg aaaaatgagc aacatatatt aacatggatg aatttcacaa gtaaaacaac 170941 ttgggaaaat atatcaagta tgatactata taaataaagt tgaaaatcat gcagaacaat 171001 tctacacatg gtttggaggt ccatccattt gtagtgtaag tgtaaagaca ttcatgagaa 171061 tgacaaacac caaattccgg ttggtggtgg cttgtcttgg ggagggtggg ccactgttgc 171121 tcaaatcctt gctggtttct cctcatctcc catatctctt aatgatggca tgccccaagg 171181 ctcagtctgc accttttctg ttctcttctc tccatctttg ctcacttgct tggtgataac 171241 atctaatctc atggttttaa tcacatttct acagtgttga atgcagggga tggaatttat 171301 cacccttgtt cataccacta ctatctccca cctggattat ggtaatagcc tcctcattgg 171361 cctccctgct tttgttctta ccccccttca gtctattcta aatataatat tgtagcgatt 171421 ttaagacgtt ttacaatcac atcgtgtcac ttctttgctc aaaagctttt atcagcatcc 171481 catctcactc agagaaagag ccaaattatt tataatggcc ttaaggtctg ttatctttct 171541 gcctgtacct cctgctactc tcccccttcc tccctccttc tgttccggcc acactggcct 171601 ccttcgtgtt tatagagcac acctgacacg ctctgctctg ggagcattta tacctggggt 171661 tggctttctc ttggaaaact tccccaagtg acctgcctgg tttatgctcc catttccttc 171721 aaatctttat tcaaaggcca ggccacctcc ttagggaggc atcctgcctt ccctctggcc 171781 agttggccag tctgccgtgg gtcctccata ctggccgttg ttcctacagg actctggtca 171841 gaggacagcc ggaaggctgt ggctgctcac taccagctgg cacatcttct tagagccgcc 171901 ctcgtgtggc cactgttagg aatggcagtt tctgcccgac ttcggtctga gagagaaatg 171961 aacagaaaga cgaaagacaa gcagatggcg ggattttttt atttatacat taatttacat 172021 aaccaagtaa gttgttaaat attttgaatg tcacttctta ctgtataaat attcattatc 172081 acgaaactac cattgctttc ttaaggccct aaatgccaac ttgagaacta agttgcatat 172141 gttatatggg aacttatgat atttgagtta ttaattataa ctaatactta taataattta 172201 ttattaatta taatcacatt taggacattc tattgggaaa tagaaattgt ttagattatt 172261 cacttaaaaa cgttttattg tctaatattt ggtacatgca aaagaacatg taacattttg 172321 gcacatataa gttataaaga atatatgaaa acatgacgaa attaatatgt gcctacgacg 172381 ccctcacccc atcaggaacc atccctccct ctcaccacag aagtaacact cttctgattt 172441 gagggtttat catttttgtt ttgttttttt aaaataaaac tttatttata taggtatccc 172501 taaagaatat gttgtttatt tttacttgtt tcttagtata ataaaaatgg tatcttactg 172561 tatgcaatct taaacaactg cgatttttca gtcaacatgt ttttaggatt tgttcgtgta 172621 gcatgtggtg gtaggttgtt ttcatgttca ctgctgtata atatttcatg actgtacccc 172681 cagattgttt attcttctgt tgtccagcct atgagtcccc acacccaggt ttttgcttat 172741 gaacagtgct gctctgcata ttcttgtgaa tgtctcttgg aacaagtgtc tgagggtata 172801 catccaggag tggaattatt ggctgatagg aagaagttat ctaccagata atatcaaatg 172861 ttttccaaag ctggtgttcc aaagtacaga agtgtttctg ttgttcctca ttttcatcaa 172921 ctcttggcat tggtagattt aattttcacc agtctagcga gtgtaagaga tcagctctga 172981 tgcaagaaaa gccgacgttt ccgcattcac tttaaaatgt cactgaaggc ttactgctcc 173041 ttggggtact taagtgtgtc taataatagt tccatctcca gggttcaccc ctgctcagcc 173101 tttttcttgc ctgggaaggg ctaatcctat cagaggttta ctgaggaaga aaccatttca 173161 gggtaaccca atggtttcat gaaagctcat gggcagaggg ttttttttaa aatgaacaat 173221 ttgaaacagt attttaatta agagattcaa gctgtttctg ttttcctaat gaccagtgct 173281 tgctctcatt gtaaattaca ttcccaggca agatgacaag gaatcctttg ggtgactttt 173341 ttttcccctc tctcctttct gattttctgg actacaaatt aagacaggtg gagtttcttc 173401 ttacacactt ccccaccggg tttcctggga cttcccccac cctacaactg ccttcttttt 173461 gcttcgctga agaagtgaaa aattgaggtg attctctaaa atgaacatct ttattgatta 173521 ggggctgatt ctgacagcat gtggccatct gtttaattgg gttttacatc taagatgtaa 173581 ctatacttag taacgtgatt tatgacaatt tcattcagtt cccattatca ttttctgatt 173641 tatgctactt aaatcttctt ctgcacgcaa tagagacttg aatagaaaac agaaaaacct 173701 gtagttgttt tttctttttc aaatccatgc ctttgtgtgg aaccctatga tttgtggcca 173761 tctttcaatg ttagattttt acagggtggt tacaacttct tatttttctt cccttgtcaa 173821 gctgggtagc aacgtctcta gacatggcat gtagtgtgaa attataatga tttaaatctg 173881 ctgtatgagg agttaaactc agtagctgtc ataaaattta attatatttg aataaggtgg 173941 ttaaagagga aaacacaaat ctgaaatgat tggtgcttaa ccctaacttt gtgataactg 174001 cgacactgat cttgctgctt ttcagagtgg caaggtgaga acatgaatca gagatctagt 174061 ttgtatggcc ttagtcgagg tttttatcct tccagacccc tattgtcctg tctgccacat 174121 gttgaggcta tctagcccat agggttttag gggaagattt aatgagacga atttgtaaag 174181 tgccagcaca ttgccaagct cagggcttgt atatctaaac ctggcatgcc aggaatccct 174241 gcgataacca ttgtgtcaca agacaaacag caaaatataa agatcggcag gggggattat 174301 ttcaaacttt tgccctcaag agatcttagg ccgggcatgg tggctcacgc ctgtaatccc 174361 agcactttgg gaggctgagg caggcggatc acctgaggtt gggagttcga gaccagcctg 174421 gccgacgtgg tgaaacctca tctctaccaa aaatataaaa aattagccag gcttggtggc 174481 aggcgcctgt aatcctagct gctcgggagg ctgaggcagg agaatcactt gaacccaaaa 174541 ggcggaggtt gcagtgagcc gagattgtgc cagtgcacta cagcctgggc gacagagtga 174601 gactctgtct caaaaaaaaa aaaagagatt ttattataat gttccacatt ataatatgta 174661 tgcatgcaac tgactaaatg ctgagtaaaa gaagccaggt gtaagactaa atactgttct 174721 gtttcatgtg tataaagttc aggaacaggc aaatacatct atatgataga agttaaaata 174781 atgactacct ttgttggggg tattgactga gaggaggcat gtgacacatt ctttatcttg 174841 atctcagatg tggttacaca ggtgatacaa gagtaaaaac tcaccgtgtt ctatcattaa 174901 aatttgtgtc tggctgggtg tggtggctca cgcttgtaat cccagcactt tgggaggcca 174961 gggtgggagg attgctttaa cccaggagtt cgagactagc ctgggccata tagcaagacc 175021 ctgtgtctac aaaaaacaac aaaaaattag ccaggtgtgg tggcatgcac ttgtagtccc 175081 agctactctg gaggctaagg tgggaggatt gcttgagccc aggggtttga gacagcagtg 175141 agctatgatt atgccactgt actccagcct gggtgacaga gcaagatcct gtctttaaca 175201 acaacaaaaa tgtatgcttt tactgtatgt taagtgacac ctcaaaataa ataggaacac 175261 tgctcctttt ctccataaaa atgtttggaa cagcgtatag tatgcatgga catctggagc 175321 taataatagg ttgcctcagg ctaagtctct agatgtgtag accagggatt ggcaaactat 175381 ttctgtaaat tgccagatag taatattttt ggctttgcaa gtcatagtgt ctttgttgca 175441 atttctcaac tctgccatgg cagccataga cagttcataa accaatgaac ctggctgtgt 175501 tccagtaaaa ctttatttac aaaacaggtg gcaatattca ggtagtggta gtggtgggaa 175561 gagtgaacag ctcatatttg gttttcattt ctactaacat ggattggtct ttagaagaaa 175621 attcatttta agtattcact atcaatgaat atttgaaaag cgcctcattt aagtatttgg 175681 tttttatttt catacatctc aaatcaaatt aaagaagtat cttaaacaga taagatcttg 175741 gcttgaaaac aagcatggcc taagggctaa aaccaatgat tttattatct ttgctgttat 175801 acacaaggga tttttttccc ctcagtttaa tcacagatat tctgctaaag aagataagcc 175861 tttgtccaga ggctgggctt atgggggcat cgaattatgg aattaacctg gaaatcaaac 175921 atagatttgt tttgctgtta aggaaataac tgtgtttcaa actgggagaa tttctcatac 175981 ttcactggct ctgcctgaaa ccataaagta aaagctaagt tagcaaatac tcacctgtgg 176041 cttctcttgg aatttaatga aattaaatct ttgaatccac ttgatgttcc atctggggac 176101 atgagaatat aagactaata tatagaattt gggaatagtc ttccattgtt tttagagcta 176161 gaaagaaact taagtggtca cctcatccaa atttctcact acctaaaaga gatcctgaca 176221 ctcagagcaa ttcttagtaa ttttttttcc acttattcac ttgattttca agtgaatttt 176281 caaggcccca atgattttca cttcccggta ttcattcccc tgtctagtcc cctgccacac 176341 tgagtaaggc ttaccactgt aactaatagg atgctatgga aattctagat tatgacttct 176401 gaggctgggt cataaaaaac attgtggctt ttgttttttg gataatgttc actctggggg 176461 aagccaacca ccacactgtg agggcattca gggagctcac acagagatcc acatgtgaag 176521 gaactggggc cccctgctca ttaggagtac catgtgagtg cgccatcttg gaagtagatc 176581 cccagtgcag ccaagctttc agatgaccgt ggcccaggct gacatcttgg ctgcaacctc 176641 acgagacact gatacagaac cacccaaata agctgtttcc aaactcctga cccagagaaa 176701 tggggagata gcaactatgt ctggggatga tttgttacac agcaagaggt aactaatata 176761 gctattagag catgcccagg tactaactga gtttggtagg agggtatata actttgtgtg 176821 gtttagtggt taagggtgtg agctctgaag tatgacagtt tgattttgaa tcctggcttg 176881 gccactaata acttgcaacc ttgggcaagt tacttaacat ctttaagcct cagtttaatt 176941 atctgtaaaa tggagctaat agtagtaacc atcctatagg gtttttgtga agattaaatg 177001 atactatata tttaatatct aatgttatct agtacatttt ataatctaat attagataca 177061 taatattaat atgtaatatt aaatattaat tatatctaat attaatatac atttatagtt 177121 atatattata aattatacat ataatatatt atataatatt tatataatat atatcaatat 177181 gtaatatata agtatatatt aaattatata tacttatata tatgttaata ttgtatatat 177241 aatatcattt aatcttaacc ctataggatg gttactacta ttatatatat atataaaatg 177301 cttggtatag agtgtggttc atagtaggca cccaataaac attagctatt agtccttgct 177361 tttgaggagc tgggagagag agacttgtta ccaggtcatt tagaagacat catggtgagt 177421 gtgcaacaga agggtgtgca gtgcattctg ggagccacag tggaggggca attaagcctg 177481 ggggagtcaa gattggattc cctatgaaac tgatgtttga gtggactttg gaaagaaaaa 177541 gaaatttctc tgggcacaga agtgagatta gggcatttag gaagaagcac tgttgtttgg 177601 tgcaaaggcg cagatatggc agaacatagc acaggaacag actgtgagat ggccacaaat 177661 gctggggcac agcttgcact ggtggaagag acagcaaagg gagtgatggg agatgataag 177721 tggaaggaag gctttgtggg tcatgtcaaa gaacaggcca gagggagcta gtatagtttt 177781 tttgcgggga ggcaaataac ttaagtttta cttctgtatt ggtctgtttt cacactgcta 177841 gtaaagacgt acccaagact gggtaattta taaaggaaag aggtttaatg gactcacagt 177901 tccacatggc tgggaggcct cacaattatg gcggacggcg aaggagcagc aaaggcatgt 177961 cttacatggt ggcaggcaag agagtgtatg caggggaact cccatttata aaaccatcag 178021 atctcatgag acttattcac tatcatgaga acagcatggg aacaacctgc ccctatgatt 178081 taattacctc ccactgggtc cctcctacaa catgtgggaa ttatgggagc tacaattcaa 178141 gatgagcttt gggtgggaac acagcctaac catatcaact tccttaatcc tagtattaaa 178201 gatcatggtc tggagaggga cagattgcag ccactaagac cagttaggag ttttgcaaat 178261 ggctcaagag aaggacgggg agggcctaaa cttaggcaat gacaggggtg aggaggagta 178321 ggaagtttat aaatcatatc ccaggagata attgtcaaat aactgcataa tgataataca 178381 gatagcacag tgatcctccc ttatccatgg ttttgctttt cattgtttca gttacctatg 178441 ggcaaccaag ttctgaaaat attcagtgga aaatgccaga agtaaacagt tcctaagttt 178501 taagtcgtgt gccattctga gtagcatgat gaaatctctt gctgtccagc tctgtgccac 178561 ctgggacgtg aatcatctct ttgtccagtg tatccatgtt gtatatgcta cttgcctgtt 178621 agtcacttag tagctgtctc agttaccaga tcaactgtca cagtgcttgt gcccaagtaa 178681 cccttatgtt acttaataat ggccccaaag cataagagta gtgatgctgg caattcaggt 178741 atgccaaaga gaagcttcct ttcagtgcta aggtgaaagt tctccactta aggaaattta 178801 aaaagaaatc atatactgag gttgctaaga tctacagtaa gaacgaatct tctatacatg 178861 aagttgtgga ggaaaaagaa atttgtgcat tgtgtgcgtg tgtgtattag tttggtacta 178921 tctgaggttt caggtgtcca ccaggggtct tggaacatat ctcccaagga caagagcggg 178981 gactactgta cttttctaac ttttccacta gggctttaca ggacttccat caaatggaga 179041 ttataactgt tctgtgcagc taggctttgc tgaggaacca ttaatatgcc aaccattatg 179101 ccaggtactt taatgtatgt tatctctaat cttcacaaca gttctataaa cttggtatta 179161 tgggctcatt ttacagaaga agaaactgaa gctcagagag gtgatgcaac ttgcaaaaga 179221 ctgcaaaact agcaaattgc agagctggga gtataaccca gcttatctga ttgcatcctg 179281 tttatcgtca cactacactg ttcaaaactg gataggaaat gtgagaatta cagtcacgtg 179341 ataggctggg tgcagtggct cacgcctgta atcccagcac tttgggaggc tgaggtgggt 179401 ggatcacttg aggccaggag tttaagacca gcctggccaa cgtggtgaaa ccctgtctct 179461 actgaaacta caaaaattag ctgagcatgg tggctggtgc ctgtaatccc agctactcgg 179521 gaggctgagg cacgagaatc acttgaaccc gggaggcgga ggttgcagtg agctgagatc 179581 gtgccactgt actccagtct gggtgacaga gtgagactct gtctcaaaaa taaaaataaa 179641 aaataaaata aaaaatagcc atgagatgat atacaggaac acactctgaa aaggttaagg 179701 gtatgatata aatgtgttag agtaatcatt attatatcac cccctcaccc atgctgctcg 179761 ttcactggtt gggggattgt ttctcttctg gtatggaaac aagatattat agaatgtaag 179821 gcagccagga ggtgagagtg agaagtttag atggtcccca agtcctactg atttcatgtc 179881 caaaagggtt gtggcatcaa gcagttcatt attgcctctt ctaacgatgc tgttctctgc 179941 aagctttttg gccaagcctg tgatctcctt gggtctttag cttatgggaa gagtggagga 180001 ggggttattc tgtaagctag cttaaaagga gtatttagat atactccccc aaaaggcttt 180061 actttccaaa gtttacactg agattttaaa agttttgtta agtgagctga agttaagtca 180121 tgaacaaact ttgaatagtt tgtgaaattg acagatgatg cacttagcaa gtctgtggaa 180181 cacctgctac aacttttttg tttttgttgt tgtgacagag tctcactctg tcacccaggc 180241 tggagtgcag tggcccgatc ttggcttact gcaaccttca cctcctgggt tcaagcgatt 180301 cccctgcctc agcctcctga gcagctggga ctacaggcgc tcaccaccac gcctggctat 180361 ttttttaaaa tatatatttt tattagagac aggagtttgc catgttggcc aggctggtct 180421 cgaactcttg acctcaagtg atccacctgc ctcggcctcc caaagttctg cgattacagg 180481 tgtgagccac catgcccaga cctgctacac cgattttaag gcattcttgt cctggtctag 180541 ttgaggaatt tggaggtcct gatacagacc cccagtttct gaagcctaac caaggaggca 180601 cctgagaatg gtgttctggg tagtatcctg ggaggacaaa actagcttca tctggaatgc 180661 tctgcagtct catgggattc ctgttaccat ggtaagtccc agaaccaatg cctgtaagga 180721 tgcaatttgc cgcttgcagg gtatttctaa atgctgtttt cagacatagt gcactggtca 180781 ggttggctgc agggctggta gtaggaaagc atgaaatgat tatcagagtc aacctgatta 180841 aagacaggat gtaagcaagc cagggtgagg tgagcatagg gacctaaaaa ccagccatgg 180901 gaacaggaga caggatgtca gggagaaatc tcattaattg accttgatgg tcacagtctc 180961 atttgtcttc tctttaaaaa atcaattgtt ttttaccctt atttacaaaa ggagttcata 181021 tttattgtag aaaatctaga aaaattttaa gtgaactaaa aatttaaaaa gttaatatca 181081 tttaaagata gtcatggtta atacctttgt tgaataccac tgagatgtat gtttatctac 181141 aaatatttta cagtttaaat taactagact catccaacag tactactttt tccccactga 181201 acattgtaac atgaatattt tttccattta ttacatattc ttccaagtca tcgagtgtag 181261 tgttcaattg agaaaacacc ttaatttatt taattacttc ccaatggaca tttaggctgt 181321 ttccctattg ttaaagaacc ctctgtatta ataagttttg tggctttctg tctaaatctg 181381 agaccatttc tttagagtaa attcctagaa gtggaggagt tgctgtatca aatggtatgc 181441 cccatttgag ggttttgaaa tgcatcttac ccaacttctc tcaagaaagc agtacaaatt 181501 tgtatttcca gtaacaacaa atgagatggt ttaaccttta atgagttgca tcttaaagat 181561 actgttggag gaaatgaata gagagactca gttttttgtc taactgatat agggaaaata 181621 tcagtcacat ttcattcttg agaccattat tgcataacca gatatggttc agtgaaccca 181681 cataagcatt tgttgagtac ctaccatgct cccaggcctg ggctgaatat ccctgctgct 181741 agtgtcagtt tcaaatactt agtacagcaa tcaataaatg agaggacctg ttggaatatg 181801 agtagaatat gagacaccga gctttatagt ggaaggaaat tagtgttttc tttgccagat 181861 gttaaactat agcagtggtt ctcaaagtgc ggtccctgga tcatcagcat cgcttgtgat 181921 agatggggat tatctggccc tacctcctga atcagagact ctgggatttg ggcccagcaa 181981 tttatgtttt aacaaggtct taaccaccaa acttctgcaa gatgccaaag attgaggacc 182041 gctgagctat aaaaaggcag caaatgcaga agaggtcaga cttgttgggt aaaggtgggg 182101 tgactaactt ctcctggttt gcctgggacc atcccagttt tagcacagaa agtctcgcat 182161 cccaggagcc ccttctgtcc cagctagaca gggatggttg gtcaccctgg taaagggtgg 182221 caggggtgaa gggaggttct ctttgttgca gcctaaaggt aattatggca gaaaaagaat 182281 agaagggcca gcagagaagg tttcaagatg gagggcagat ttcagaagct gctttctgct 182341 tgtcagctct aggctgatag atgtggtggc aggaggttac ctgcaggaag ccagcagatg 182401 gctaaggcat cttgctgaat cccttgacag gagtcacagc tgttagaact tgatactttc 182461 tggaaagggg aaggttggca ttcaccaaac tactgcaaga tgccaacttg aaaaaaaagc 182521 catgattaat cttagcagtg atgacaatga aatcatttta cagcaaaaac cataaccaaa 182581 agtgctttgt aattgcagca atttccatgt cctgactgat ggccccctag attctagaat 182641 ggtgtcgggc ttatctgatt acaagaacta atcctgaata agatgtaaca ttcattctta 182701 gagtaagcat tcatcccgga ttttctaaag caggtgcaat ttaaactctt tttgttcttt 182761 actacagtca ataatgtata aggcatgtaa agtttcagtt ttgaaagtat agtgtgcaaa 182821 attttaataa aagtaatcat acgttacatt gttacatgca tggtgttgat gacagaaaga 182881 caggagggct gatttataga tcagttaaac ataagaaatt ccttattttt cctaaaccaa 182941 caattctgaa cctgttttca tagtcgaatc acctggggag cttttaaaaa atgcttatgt 183001 tcagggttcc atcccagaga tggtgactta ataggtttga ggcaggggct tggacataga 183061 tattaaacaa gcaaaattcc ccaggtgatg tgcatttggg gttgagaata ctgtacttac 183121 cctgatataa accaatgggt cccaacttta gcatgtacca gaatagcctg aagtgctatt 183181 cagtaggtct agggaggtgg gggtgcaaac atttgcacct ctaaggagtg ccctggtgat 183241 gctgatgctg ctggtttgca gaccaagatt tgatcaccag tttcctaaac cattcctact 183301 ccaaacccca gataatcgta caaaccacat ctgtcaccca ttagctagac atgtgaataa 183361 ataactccct ctcaaaacac ttctgcacat ctactcccac gtatccagta tttcttttct 183421 ctactgagtt taaaacatat ttcaattaaa aagcatactt tagaaagctt ggtataggtt 183481 taccttcctc tttaggtctt gcaaaatagg attatttgtc gagtggtatc ctttctccct 183541 cactagattg taggttctct gaggacattt gtttgttcga aggttgccca tgtggcttag 183601 gacatcatag gtgctcagta aatgttagta gtcagtggtt atgcatagaa attgcatcat 183661 cataaccaaa cccacctatt cccagtagtt tttataatgg gatctaaata gactgacact 183721 ccaaaattag gatcataaaa cgtattcact tagagaagtc gaagctattg cacaggacaa 183781 gtaaaataat caaagaagtt agcagaaggt aactacaagg gaatttcacc aagaaggaca 183841 ataatgtaga taagatgata tatcagcaat agccctgaaa tgtggactga tacatagagt 183901 agtttagtaa tataagcaca cacttaaggc ttttgaaaaa aaaataactg tcacaacttg 183961 tactggagcc acattaatac atcaaaaaat ccatgcaaat gaccaaatat cccattgttt 184021 gacaaaacta ataatgcaac agaatgtatt gacatagtcc accccttaac aaaactatca 184081 caaacatcca cactgaaaga aaacagacac aggtacacac ttaatgaaaa aacccaagta 184141 tatttctgga gaaagcagct acagcaaata tggaaacaga tttagttcaa taaacagctg 184201 aatgtgtatg ctggtccaca catcctcaag cccattgaat tttcctttgg aattttccct 184261 tggtcacctt tttttggatt gaatttactt tgtcatttgc ccctggctct gtcacttctg 184321 tccagaggct gctgcaggga tcctaaatga acaaattcaa tgatgcttat tccttcagag 184381 gagtcaaaag agatgttgaa tttaagaatg aaatgctcct gaaggtagat gagttcaagt 184441 tgtcttgcat tgtgttacta cacttacaaa agtttgctat agagaaagga gaaaagcaca 184501 tggtctgggt ttctgaagac tggcagtgaa agttggaggc ttcctatttt gtaaaatctg 184561 accaaacatg gaaataacag gaaaagatgt ctgagtctgg ttcatagttg tctaaaatca 184621 tcagcattta taagttatta acacatcaat attaaaaatt aatattaaat tattaattta 184681 ttaatacttt attaatatta aattattatt attaatattt aaaaaacttc caaataacat 184741 acaattttgg aaggtaatgt tttaaaaata ttttctgtat gataactttt ctgaaccata 184801 cacttatgaa tatggaactt aaaattaatg ctgcagtaaa gtgaacgcaa caatgaatgt 184861 agggaaaatg gattaaatta gttcaatatt ctgcaggccg aaggcctttg atcaatattt 184921 tgtctaattt tctctttcca gcatatatca caaacataaa aaggaagaat attccacttt 184981 tattgggggt cagcagggag ggctaagagg atacaggagt gatggtattt gcataaaact 185041 ggctggacta tttagcttgt taatatctgt attgtgattg aaagtttatt ggaaaaatgt 185101 gctgatctca aaactacact agttcggcaa tgatagcaag agaatggaac caacaaattc 185161 tttcatctag aaggaaaatg aaaaccagaa cttgaataat gggatgtatc ttagatctgg 185221 tactgaatgt ggaaaattca agaagtaata cattggtact actaatagta tcatgtaagg 185281 ttttaaactt tgtctaacaa gatgtactca taatgatagg taatgcttca accccagtca 185341 tccctctgag tttcacaaga tacagaatca gttaattcaa caagtattta ttgagtgctt 185401 accatgaacc aggtattgtt ctagctctga agacatcacc attaaagaat agtaaaaaat 185461 aaaaatccct tttgtcatgg aacttgcatt ctagttgggg aaagacagaa aacgcataca 185521 aaataagtca aatgtaaact gtcttagatg gtggtaagtg ttacgatgag ttacaaagca 185581 agaaagtggg tagaataaca tcctgcccct taaaaaagca ctagggaagg ggttaatgaa 185641 aagatgatct tagcctacat acctgaagcg agccatacag atagtggcgg aaagtaggtt 185701 ctaggttagg agaagagagg cgtagggcaa aaggaaaatt attccctttt tgcatgtcct 185761 tctatacagt gtgaatttgc taggcctgaa tatatttatt tatttgtttt cttgaggtct 185821 gagctgtgga ataatgtgct aaatttgttt tcttctctat acttgtttgt actaaaaaaa 185881 actctaatta atgcagctta caataatgtg tactgaaacc cagacacatt gaggcattta 185941 acttttctgg gaaagctgta gctaccgtca gggtatgatt tatgactata aaagtcaaat 186001 agttgagcct ggggtaatca cccactgaat gtgattggtt ctttggcaat tgtgtcctaa 186061 acaaaagcac tctgtctttt atggtcagca ttttgctaag ctccaactag gttactccat 186121 atttactgat aggagaattg taatgagaaa caccagtaaa aaggagaggt tgaccccagt 186181 agaaaaccaa aagcaggttc ttgtggcttt cccaggtgca gatttgaaaa agacagtatt 186241 actgataaat aattaggtta ttacaaagac caatattttc ttttgtcaga attttaatca 186301 ttgaagtggt ataaccacaa tttgtatttt tgcctgaaat aggagtggaa tttgctggtg 186361 ttaacaaatg caagaacaaa ataatactat ataagcttag ttaactgttt ttcaatagcc 186421 tttagaagaa ttttctgtaa tacacacaca cttctttttt tttgagatgc taaaagaaat 186481 tttaataatt gttacataac atactgtgta gctatttgat gaaatctacc tctgtgtaat 186541 aattattaaa gtaagtcttg aaatgtatga taagaatata ttagtttttt gggtacaatt 186601 tccatgaaaa aaataagaaa acaaataaat aactaaaata agaaatagga ggttttttat 186661 acatactgtt gtcatcatac aaaattactc tattaactaa atttacctga cattgtcacc 186721 gttaaaagca aaatcaagac aaaactaatt ctcaaattta tttaatttta gtaatatatt 186781 ttatttaacc ctatataccc aaaatatttt cattctaata tgtaatcaat atacaaaaat 186841 tattaaagag atgttttatg ttcttttttc atactaaggc ttgaaaatcc agggggtatt 186901 ttatttattt ttatttagtg tactttaagt tctagggtac atgtgcacaa tgtgcaggtt 186961 tgttacatag gtatacatgt gccgtgttgg tttgctgcac ccatcaactc gtcatttaca 187021 tttggtattt ctcctaatgc tatccctccc ctagcccccc accccccgac aggcccctgt 187081 gtacacacac acttatttct aaaagaatta attttaatta cacaagttta ttatctttgg 187141 ataatcaaaa ggtaaaacat tacgtttatg tgtagagccc tttttacgag ccaccacccc 187201 aaatcctgat tttcttcaaa aggtaaccac tgtgatcact ttgataggta ttttacagac 187261 ttttaaggaa tttacatatg tatacacata tacacacaat acatgtatat gctcacactg 187321 gtaggatagt tatgtaagaa gataaatggt atttatatgt ctatatctgt accttgcttt 187381 atgccaacaa catgtcttgg ggattatttc ccatgctgga attttaagac ctacctcttt 187441 cttttcactt actgcatagc agtccacaat atggttatac cataattaat tggcctctta 187501 tttggacatt tagcttgttg cacacttctc aactacaagc catgcagcaa tgaattctgt 187561 agtgcacttg tgcaagtgat caggcagtgg ttctcacggt ggcacccaga ccagccgcat 187621 ctgcatcgcc tgagaacttg ctagatgcaa attcttagcc ctaccctaga cttattgact 187681 cataaactga ggatggggcc cagcaatctg tagtttaaca agccctctgg tgaatctgat 187741 gcattataca atttgaaaac cactggcagg tatgtctgtt tctacagaca cacatgtacg 187801 gatacagtca gttctataat gtaacatatg tgttcttaaa caccactgtg ctatgcaaaa 187861 gtatgtaata aaaggcctcg gggcttttga gaaaaatgag gttggggcac aacagttaaa 187921 aacttcatca gtgacacgtt aaaaacaaaa gctgggagac caataaaaat gatagtgcgg 187981 ttttatacat gttaaatggg taaatacaca aacgctacag taaatatggc accttccctt 188041 gaaaagacca gaagtttgct tgtgtaagtt ggtattggag gggttgaccc ttgtgagttt 188101 ttgggtagtg gcggaaggag ggttacatga aattggaagg taagttttaa cagcagatgt 188161 ggtgagtgtg gctcctgccg cacgtggtga actgaggcag ttggtcatgt tgaggtgtgt 188221 gtgtgtgtgc attttgggta ttcctatatg gctcagtcca tttgggtgca tttttctgct 188281 tttccctagc atttcttgtg gataagcaaa tgcgaaattg gagttatgtc caaattttcc 188341 cccagtgtat caatttcctt ggaataaatt tgccttttcg atacaggtgt cacagctgta 188401 tgtatggatg cttattcttt tgcgtacatt acacaaaata actccataca ctgtactgaa 188461 tgctgttttt cacttaatat ttcttagagc tttttccata tcgccacata gttactcgtt 188521 ttttttttta atggcattta aagcattaag agatgactaa attgatctcc aaaatgcttg 188581 caacaaccta cactcccatc aacatgcttc ctacaccttg gccaatatta cctattatca 188641 catttattaa cttttgccaa ccactagtta aacgtctctc tccttcctct ccctccctct 188701 cctccctctc ctccctctcc ctctcttccc tctgctccct ctgctccctc tgctccctct 188761 gctccctctc ctccctctcc tccctctcct ccctctcctc cctctctctc tccctctccc 188821 tctccctctc ctccctctgt ctcctccccc acctccacag agacatttac actgtgagga 188881 atgcctcccg taagtacact gatagaaaaa ggaactttaa aagtgtgaaa caggtggcga 188941 ggtggtgttg aaatggtaaa aaaaaaaaaa aataaataaa taaaaataaa aaggaaataa 189001 aatggatatg aaagaataca taaaaaccta ttaaaaaggc agaaaaaagg aattcccttt 189061 ctgcttattt ttcacttcta cctttgctaa atacaactta tccactttat gagataagat 189121 attgagttga ccctgtagga ttccatctat cactctagct ggttgacatc tccaaaaatg 189181 cagcttacac tattaatcca ttcacttcag caaaacatgc ctaagggcat cagattcttc 189241 tgtttatcag tattgggttc cttggcccca acaagctcag cagatacagc tgaacagagt 189301 gatgtgtggt ggcttcaact cttgagtctg gggcttctaa cggcaaaagt gacagacata 189361 gaggctcatg caggtgtcag atgttttcct gtaaagaggt tggcaatgga cgtaaatgct 189421 tccagctggg cacattacca ggcaggataa gccataatga tttaataatg aaatcacatt 189481 ggtttatgat cattgttctt attaggaaat atcattgctt tttgatgatt tattatgtga 189541 aactgagttt tcatataaac actgtgattt gcatttattt gtaatatata tttgcatatg 189601 tatatatatc tgagtgagaa tgacttacta aaagtaatca ccaaaaaaag aggatggaat 189661 attgatgtaa atatgccaca aatatttaag gaaatgtaga aattatgaag attattatga 189721 tgttgccact tcaaaataat attagatgca ttttccttta tgtgaaactg acagagaact 189781 agtagctgga tccatctctg ttttgcaatt aggcttttgc acctaatcac aaactacgat 189841 attattaata aaatataaag aatatatata agggttatat agaaccacca tttcccttct 189901 aaattaagga cttgctttcc tgtcctcatt tctcctcatg tgttctgaca tggaggagag 189961 agagaaaaaa aaaagttaag aagagggagg aagaacattt tgacttatga aagagctttc 190021 agtctcatgg ggcctggaac agttgctctg gagcagtgaa ggacactaca gaaggcttct 190081 ctctctcttt ttggttatga ttctttgctc tatcagctgt gatgacttgt aagtacagtt 190141 cctaggatca acccagggtg caaaatttag tggtgagttg caagggcaaa gcaaagcatg 190201 tctcccactc atctgcactc aatgtcaggt acataatttg ggaagcccag tgcaaaataa 190261 aaatgtgggg tctcttgttc caaaagcagg aaaaaatgcc attgaggggg ctaaaatata 190321 aagccttgcc ctatctttca tgggctttct ctttcaactt gtcatggtat tgttttattt 190381 gctattcagt gctactataa gaaaaattaa agacgtaagt cattagcctg agtgttactt 190441 ttcatctcaa tattgtgcaa tgtcagtttt aaatgcaaac ataagagcat ttaacttgga 190501 tgtggaatca ccaaaattac atagtttgta ttttgtagct tgtatgtgcg tatgtatttt 190561 gttcttacca gcatggagga aatgctgaac aaaactcaac tgtttttatt ttacttctta 190621 atacatgtac attctaccaa tactctctac ctttggttta ctgatgaata aagaaggact 190681 aaaaggaaaa gaatctatag gttattgtat aagtctgttc tcacactgct aataaagaca 190741 tacctgagac tgggtaattt ataaaggtaa gaggtttaat ggacttacag ttccacatgg 190801 ctgaggaggc ctcacaatta tcgtggaaga taaaagagga gcaaagtcat gttttacatg 190861 gcggcaggca agagagcttg tacaggggaa ctccccttta taaaaccatt ggatctcatg 190921 agacttattc actaccatga gaacagtatg ggggaaaccg cgcccatgat tcaattatct 190981 ccacctggcc ccacccttga cacatgggga ctattacaat tcaaggtgag atttgagtgg 191041 ggacacagcc aaaccatata attcaccctt tctctgtcct tttctgtcat cgttttcagt 191101 gtaggtgttt ggccaataca gggaagtaac aggtataaga aaggttatga caggattcct 191161 tggtcattca tgtttcttag aatgccacta ccttcttttg gtggttgaag caaattctgg 191221 tttgaatgga aagtatggcc tcttagggct atcagagctt actcagttct agatgtatcg 191281 tatttacctt gtatttgttt ggcatctcgc tgaactccca tgcactatgg gcctgatgga 191341 gtcctgtgct cctggggcat tgtgaatgct gtgtgtaaat ggggctccta ggaacaacag 191401 acacatattg tgcacatttt ctctgctcac acacacatgt tccattgtcc cctcatgctc 191461 cactcaaaaa atacaagttc aaagataaaa atcattaaga atttcaagat agtggcagga 191521 gaagattaaa ctaagtgcag gacccttctc tctgtggggc ctgtgtgact acacaggttg 191581 cacacccacg aagccggccc tttctatatc aggcaaagct gctggacagc ctgaagcctc 191641 atgagcatgc catgtatgaa ggagcccagc ttacagcagg ctggatattg ccaatagtgt 191701 accagtgcaa aatggtttcc ctgaaacctt taatacactt acttatttca aactcagttg 191761 aattgatatc attgagacag ctagatctct tagagggctc ccagtgccag gaggagtgcc 191821 tttcagatgg gtctttgaaa cctcagtgtt atagctagaa ccaggtactc atcattgaat 191881 caatctctct atatctctta acattttttc cttctcatag gatttccagc aggagagctc 191941 cagaagcctt tcttttgggg aacagaatat cctcggtgag taaatgagta cagaaaccag 192001 ttactgacca attaggaaga acatgttgct ttggtagagg acagaagaaa caaagtcaaa 192061 ggttattcag cagaaagaag ccatggaatc tggcgtgggt cgcctgtact cttatcattt 192121 taggtatttg ttttctgatc cttgttacca gggtgtatgg ctggacagga aagccatctt 192181 acacataggt ggctttcatt tttatgcatt tattctctta cccattaact catactacca 192241 gtatttggac attgaaggat tgttgggatt tgaattgatg ggatgagtct aaaaccattc 192301 caggtagaag aaacagtgtg agataaggag ggaaaagtgc ttctataatg gcagagtgta 192361 gtttggctag cccaaatggt atatgaaagg aagtagtgga aaagaagatt gaggtttgat 192421 attggagggc tctgcatggc aaataaggag cttggatact aataggcaat agaaaatatt 192481 gaactttttg agcaagttag taagataatc gaatttatct ttagaaatat aacctgtgta 192541 gctacagaat aagtcaaaat tggtagtgta tggacttagg gataccagtt taaaggtcat 192601 tttaatacca acagattgta gactcatgta attcagaatt tccatacata gtagagattc 192661 tgataatatt tgatgtcaat aatatcacca tttctattat atacttttta cttttgtgtt 192721 ttaatgcttg gaaaatttta gtcttgattc tatacattga gtctcttgtg ttttggtttg 192781 gatatatgac aatttaaaaa tatgtaggtt taaaagatac agattcccat aaatatatca 192841 ataagagaaa tatgttggca atgttttcat ttggcctttg ggctttcccg gtagcagacc 192901 ttgtggtaag tattcaagtg caattagttc tttcgggagg tggtacaagg aaacaccagt 192961 aggggagtgg agaattgaga cagtgaggga aataagccaa atgaattatg tcctgtgagt 193021 cttttatggc tgccaaaatg aattactaca aacttagtgg cttaaaatga cataaactta 193081 taagcttata gttctgtagg tcagaagtcc aaactggttc tcactgggct aaaatcaagt 193141 tgtcagcagg gctactttcc tttctggaag ctctagagat gaatctgttt tcttgccttt 193201 tctagtttct agagggggcc cacattcctt ggcttatgac ctcttcctcc atctttaaag 193261 ttccatagtc acatctcctt ctgactctca tcttctgtct ctcccttcta cttataagga 193321 cccttgtgat tacactggta ccattcagat aattcaggat aatcttcctg ttttaagttc 193381 aactgattag caaccttaat tccgtctgca gcccctttac tgtgtaacat aacatattca 193441 caggttccaa agattagcaa tatggacatc ttttgtaggc cattattcca cctaccacat 193501 gtgtgttaca aagccagtta tcaccatggg caactggagc tcagtcttgc tggggaactg 193561 taaaacatgc ttcagagtga tcccatctca gggacatggg agctggggta tttatctgtc 193621 ctccaactca ccttccatca ttgattgagg gctcctacca ggggtacctg gcagtgccca 193681 tcctaggcca agagggctct agtggctgga gaaagtcttc aggcaactag tttcaggtgc 193741 tgacatttga aggctgtcag attggtgtgc tcaaaaatag tgagggctac gaggacgtgg 193801 gcagccatcg ataaagctta ctacagggaa gcaccatcgt tagagtaaat aaaaattgca 193861 ggttcttatt taaccaaatc cattaataat atcccaggcc tcagctccat agttttcttt 193921 ggtgttcaga tgtttggact caatttgccc tgcttttaag tttagaaata tatatatatc 193981 taaaaaatta aaggtttagg ataaatattc tttaagctcc cttcaaacta aaaaagtcct 194041 ataattcttt gattctgctt actgttaaca atttaactga gcttctatta ttgtggttat 194101 atagctggag actaggaaat agtctggttg gatttctgaa gttattgatg aagtaatttg 194161 tttattgggt ctgtgagcat gttagtgtct atttgtttgt accttaggct aggtcatagt 194221 taatacttga gtgtgaaaga gagttttttt ttcctaaaga cttgtttgtc ctcatccact 194281 taatgaatag ctactacata tcatttgggt actcagcatt gggaatatga agatgctaaa 194341 gacatgggag tcatggtcca gtgagatggc ttcccatatg ttcatctagt ctgtattaat 194401 aaatggatag cctccagaat caagggagtg ggtatccttt ctgctcacag tggtcagctt 194461 agtcctcacc tgagtgttga gtctaattat ggggcatgaa caagctgaat tgtttccaga 194521 agagagaatc aggactgtaa atggttgaag acaacgtttc ccaaagtgtg tgttccatga 194581 aatagtaata gaggtaagac atcctcccct cccttcagtt ttggggaatc ataataatta 194641 tcaccataaa aatttcctag aagctctcca agaaagaaac ctatttagat gtgtttaaca 194701 caactacctt ttttgtaccc attaaccatc cccacttctg accccagccc cccaccaccc 194761 ttcccaggct ctggtaacca tccttcgaaa tgcagaggtt cccaaactcc ccggtgtttt 194821 gggaaatgct gatctagaat tcatgactga agaataagag ttgaaagaac ttaataacac 194881 ttgaagtgga acctcataac tgtctgcata tttttgaagg aaagttaagg tggtagaacg 194941 acttattata ttcaggagat tcagcaagtg ttcagttaga ggctacacta gaccaataga 195001 tgggagccat agaaaagaca tttcagatca gcattatgga gagctattac gtttagaccc 195061 gcccaaagag gaaatgggtt accttagaag gggatgagct cttggtcatt ggaaataccc 195121 aagcagggta ttataaccat ttgaatgggt ttttctagag ggggcccatt cctctagtag 195181 gaagttggag tagatgacct ctaagccatt tccaactcag aaatttttat gaatttatga 195241 ttcaggcata ggacagccct tacttgataa cagtgtgaga tcatgattaa gctagacagc 195301 agtatagttg ttttgttgtt gacagcaaat ccggtgcata tgttaaaccc actgagtaat 195361 tagaacaaaa gagaaactga gtaggtagta aaagtggtgt cttctccatt ttatttttgg 195421 aagactagat cttcagtgca agaaatcttc cagtttctgg ccagccagtc aaggttgcat 195481 tttaaaaggg cgagtttatt tttttatgct tatttattta tttatttatt tatttattta 195541 tttatttatt tatgagacag agttttgctc ttgttgccca ggctggagtg caaaggcatg 195601 atctcagctc actgcaacct ccgcctcccg ggttcaagtg attttcctcc ctcagtctcc 195661 caagtagctg ggattacagg catgcgccac catgcctggc taattttgta ttttttaata 195721 gacatggggt ttctccatgt tggtcaggct ggtctcgaac tcccgacccc aggtaatcca 195781 cccgcctcgg cctcccaaag tgctgggatt acaggcgtga gccaccacgc ctggccaatg 195841 acatgctaat ttaaatactc ttcattgata aaatagagga catgaatcta atattatgaa 195901 atattattat tgtcaggctt agtcatataa aattactttt tctttgtctc ttgtccaaag 195961 gattgtgatt aagtaactga gaattgaaac ttaaaatgat tcaatcattt tctgaattct 196021 ctattgcctt ttgcaaatag gaagcttgtt ggagcagccc tgcttaaagg acatttattg 196081 cacagcaatt tctataagct cccattgttc atgccttcaa ttcatccagt caacagcatt 196141 gactgaggga tgtagaggac caagtactta ttctcagagc tgagagacct ggaaaaacta 196201 aatgttgcaa tctgcacact gatagctact gccgcaaaaa ttaacactat gtaaaattac 196261 aaatttgtcc cacactgcta tgatagctct atgattgaaa tgaataattt cactcttact 196321 tctttaatga aggtttgcaa tgatggaaaa acagtaggaa gcattttttc aaatgcaaat 196381 taccttacag gaaagaaata acaaaggaaa gcattgccac caaggggaaa aattaaatct 196441 tacaccactt ccttgggtgt agttgagaca aacagttagc atggtaagga tcatttatgt 196501 cctgtaataa catattactt taaaaatttc cattccttcc agttatgggt cctatctatt 196561 tgctgggtga ctttgatgtt gcatgtcagg aactaactgt gagatatttt tagcataatt 196621 gcaaatctca tgaagaatct tcacaataag gattctaagt ggtgtgtaat tctcatccac 196681 aggttgaatg actaaattaa gatacataca gatctgtcac ttgcatgaga tggtaaagtt 196741 aactaatcag ttaaatcatt gttactttct tacgctgtac aaggcataga ggatgtagca 196801 atgaacaagt agataagaga gtggcctcaa tagaacttac atttgagaga gaaaatccca 196861 gacatacata tggaaacccc attattgatt tcttcatata ttcttaatat gctttgctat 196921 gtaacaaagc accaccaagc aaccatttat tagctcatga ttctgtaatt tgggcaggac 196981 tgagctgaga cagctgctct gtatggtgct ggccggggtc gctcaactag ggctggactc 197041 acagacaggt gagtgaaggc gacctcactc acctgtctgt gccccagttg ggctggctct 197101 cgtggctggg gcatggcagg gcctctctct cttcctttgg agtctctcaa cttctggggt 197161 ttctcgcttc ttcatttggc ctcattccct ctagctgtgt agttgaactt ctttgcaagt 197221 gacttccaag aaggtggaaa atggaagcta ctaagtttct taaggcctag acccagaagt 197281 catacctgac cccttccact gcattctgtt ggtcatagca agtcccaagg ccagtccaaa 197341 tttaagaaga gggaaataga ttgtctcttg atgggaggag gggctatgtg cattcagggg 197401 tagggagaat tgttggtgtc cccttttata gataatctac cacatgcttc ttcagaggtg 197461 aagagaaata atgtggcacc tttttcaaat ctttgctatt cagttgctat cttgtctaca 197521 acagaaagag gaacgtgact acctttgaaa atagagtgat cccttttaat ctctactcta 197581 ttcctttttt tatgactaca tttggaaaac aatttaactt ctaagactat agtgactttt 197641 caacaagttt ccagataaaa tatacatgat atatttactg acaggatctt ttcaagccct 197701 gttctaggtg ttcataaata tatagctata gcagataaaa aaatcatagc aatcttaagg 197761 ctgaaaggga ctttagaaat catctctttc acatccctaa tttcagagag tagggcacta 197821 aggttcatat gtggtaataa tgacaatgtt tgttagaatt tatattttct ttgtctattt 197881 caattggaag gtatgtttag attttttaca gctttatggg tatggttatt cgattgttat 197941 taccatggtt actagaaagc agtttatctt ggctttccat tgaaaaaatt aaaggattat 198001 gctctgagat tcatgcttgt aattgtctcc aaattatgta ttaatgccat agatctctga 198061 gttatggtgc tataggagta attgtcggac atgaatttac acatggattt gataataatg 198121 gtaagtaccg gttcatttta taagctgctg cttttataat aatgttgact atatggatgt 198181 taattgttat gattatcttt gcatagcagc agcatcatag cttgaagatt atccaaagca 198241 atatgattgc tgattgaaaa cttttcaccg tgtcttgctg tgatggcttg caataagtaa 198301 acaattgttc ccattggtga tcggggtaga ctttaaaagg cagccttgta atttcaataa 198361 aattagcttt gcatttagat caggattgct gtagctgcca caatagttac agagattgat 198421 ttatagcttc tccctttgtc atttattcat tgtcttgaga tattctccct ggaggagcaa 198481 gctctgctcc aacgatgatt atacatgctg ccaagttagg actcgttgag ttaggactct 198541 ggagcttggc tgactggttc ccttttacga tttcgaagat tatgcaatgc ttaatctata 198601 tttttctcta tactggctta tattctgatt attattatta ttattatcat tattgagacg 198661 gagtctagct ctgtcgccag gctggagtgc agtggcacga tctcggctca ctgcaacctc 198721 ggcctcccgt gttcaagtga ttctcctgcc tcagcctccc gagtagctgg gattacgggc 198781 acgtgccact atgcccagct aatttttgta tttttaatag agacggggtt tcaccacatt 198841 agccaggatg gtctcgatct cctgacctcg tgatctgccc acctcagctt cccaaagtgc 198901 tgggattaca ggcatgagct accgcgcctg gcctattctg attattttaa agtctatact 198961 ttcctgataa tttatataat aggatatgct ttgtatcccg atgacattta agcaaaaaga 199021 aaaagcagaa tatgacataa actaaagaaa tggcttcctt tgagagtcct catcatcatt 199081 aatcagcact cttttctcct ttaactttgg agtattatcg gaaggccaca tgaaatctct 199141 gaaacttacg gattagtagc aaattactat cacatccata cccacacagt gatttacagg 199201 cagtctcagg aaccagctgc cctaaggcaa cgttatatag aaggggacct ctttcttcca 199261 tccctgtagt atctccaaca tccttcctgg atgcctaccc cagggccaga gacacttagg 199321 ctcttctaga aaagtcttcc tgtactccca ccagcccttc cttaatcacc tagatttttt 199381 gctcccctgg tcttcccttc tctgttaaaa tgtggctgac ttgtagttat aaatgtggct 199441 gacttgtagt tatctgactc tgcagagcta gactttctgc ctttgggtgt ttggtttatc 199501 tgagatgaga gtcttagcta agaaaaaaaa aagctaatag tcctttcttc gttctggacc 199561 tctcttgata ggaaatttaa gtacagttcc tccccagaac cttctggagt ctttctcaaa 199621 ggactgaatc taatggtggc aatttcactc tgtaaccaat gcttagaaca gtgcattgca 199681 catataagta catgcccggt acctatttaa tgattggttg gttagtgggt agaggacagt 199741 gactccatac tatcccaggg ggtggtgtta gtagtatgta gccttgctga agagttttgg 199801 attgttccaa ttaacaactt tacttacggg acttttttgt aagctctctg tataattaaa 199861 tttaaacgat gcctctgatt ctcatagtaa ccctgaatat agaagacagt ctagcagcat 199921 ccaaattttg tttcaaaggc tgtagagaaa atgaagagtg agcacatagc aagaaagcat 199981 gtgtaatctt tcttcccatt gtgttttgga gattttcatt cttattccaa gttttctttt 200041 tagagcaatc ttccctgtct aagcaggaca gatgggcctt ccgtccacac tagtggctgt 200101 ggaatgaatc ataacagagg ttttatgtgt ctttcagctc ctggcaactt ggaggaaaca 200161 ggttcctata gggctgagtc agggtctatg atcgcacagt atggatgact cactggggct 200221 attcctatgg atcatttgga atttcactgg tagtaatgct ctgattgcct ccatgggctt 200281 tctgaatgaa gccattttat ttcataatcc agtgaacact ttgggtcaaa tcaggtgaag 200341 acacagaatc caagattgaa taagactagt atatgtaata aattgatgaa tctcattttt 200401 aaggtgctct ttgttcctta gaaggaaaga acttcctcaa ttgtctttat cctttttgcc 200461 ctttttacaa ttaacctgtg acattggaga gaatgcagga gctgatctca ggaagagtgt 200521 tccctgctgt tatgactgct ttttgaaggc ttgtcgaggt gagggaaagg aaagatgaat 200581 ttagtttcca taaataacac aaatgtcttc gaacttttcc ttttgctagg tagaaaatat 200641 gataaaaatg gaaacctgga tccttggtgg tctactgaat cagaagaaaa gtttaaggaa 200701 aaaacaaaat gcatgattaa ccagtatagc aactattatt ggaagaaagc tggcttaaat 200761 gtgagtacaa ctgtggctaa ggggggcacc ttgtggttca tttttcctca tttggacatt 200821 agcttgcttt aaaccattgg ttctcaaagc gtgtttacaa gatcaacagc atctgtcttc 200881 cttgggaaca ttcttgagtc ctaccccata cctgctgaat cagaatcttt ggggttgggg 200941 cctggtcatc ccttaaccag gcatgaactt tttcctgata tattactagg gctggaaaaa 201001 tacctaaatc cttgctatgc ctcaaatcta ttatacttgt agatgaattt aggtaaaaat 201061 ccatgtgctt cttttaaagg gaatgtttat aaacactgac ttggttctcc tggttcttaa 201121 accatgctca gcccaagtga gaccactgct cattcaggca gagaaaccag agtctgtatc 201181 cttgaatggt gtctagctct tttatcagtg tcctggacag ccatttctct gcctatatat 201241 ttttaaatat ctacaactgt gacatgagaa gacctaaaat gggtgcttct atgtatctct 201301 atatagtgca ggggttttca cactgtgaaa attcatctgg agtatatgtt gcttcaagaa 201361 aagcatagca gaaaggggtt ctagagtcag acatgggtcc aaatctaggt tcaactagtt 201421 ttatgaactt gggcaagtta cttaatgttc ctgagtctca gcatcttcat ctgtgaaacg 201481 ggtcaatgtt tctgaactca tggatttgtt gtaagaatta agtgagcaaa gcaatgtgga 201541 gcactagcac aaggcctgac ataaagcaag tgctcaataa aagatcgtta gttaatattc 201601 ctgatccttc gtgacatact cattagtacc ctatcccagg aaaaaacaaa gccaaagatt 201661 tttgggttaa aaacccccca aaaaacaata gaacagcttt cttggaggaa gggaatttat 201721 atttcctgat gattagtata tacaggcatg aagatgaaac ttttcactta attgaaacaa 201781 atgtatcccc atactagtcc tgagaggtat agattattac taagtgtaca ggttttacgt 201841 atgaggttca aaggaattaa actgaggttc aaagagatta aaatccacgg tagagtctag 201901 attagagttt aagtaagtct gtacgactcc aaagtccata ctgcttcccc tatatcatgc 201961 tgaggttctt attttaagtt cattttatat ctgggtgatg tgtgggtgga ctgctcttca 202021 ccattcattt tggccaaagt gtcagagatt gaaacttaat tctgacaaat gtgtaccttt 202081 gtcacaagga cgatagaaag ggcatgcctg tcttagcacc aatccactta accatcagaa 202141 aacagcttca gtttcattcc cacaggagag tgggtggctt cctttgtgga tataagggcc 202201 atgctttatt atttttctgg cttcatttat aggagccgtt cctttgtgcc tgaatcttgc 202261 tttgtagtga cttcggggga gctcagatgg ggcactacag atgcagactg tcagcagctg 202321 ttgagtcttt ttttgttgtt gttccttttc gtagctgttg agtcttggga gtccaaacct 202381 agggccacgt ggtctactga ggcagaccag agttcaagga gccagagggg tgaggctgct 202441 ccaaaaggct ctaaattaga tataaagctt agagtgaatg gctgggcatg gtgactcatg 202501 cctgtattct aaacactttg gaaggccaag gtgggaggat cacttgaacc taggattttg 202561 ataccagcct gggaaacata cggagaccct gtctctacag aaaatttttt aaaaaattag 202621 ccgggtgtag tggtgcatgc ctgtagtccc agctactcag gaggctgagg tggaagaatt 202681 gttggaaccc tatagtgaga tatgatcaca ccactgcact ccagtgtggg tgaaagagag 202741 agaccctgtc tcaaaacaca caaacacaca cgaaaccctc agagtaaaaa ggaaggagat 202801 taaatttttt aagcaggagt ggatataatc ataatcttca agagccaaga ggagttggga 202861 cttgagaaag tagcatatta ggaagcaaag gctctggaga cagaagactg cttgggttcg 202921 aatcccagtg acatgcgagc agtttacttc atcactctgt cttaggttca ctagaagcag 202981 agcccaagaa ggggttcctt aggtcattga cttttttgag acagcgtctc gctgagtcac 203041 ccaggctgga gtgcggtggt gtgatctcgg ctcactacaa cctctgcctc cctggttcaa 203101 gtgattctcc tgcctcagcc tccccagtag ctgcgattac aggtgccagc caccacgcct 203161 ggctgatttt tgtattttta atagatacgg ggtttcttca tgttggccag tctggtctca 203221 aacccctgac ctcaggcaat ctgcccgccc cagcctcccg aagtgctggg attacaggtg 203281 tgagccaccg cacccagccc aggtcagtga tttattcaga aagtcgtttc aggaaaaacc 203341 tataatggaa tgagggaaac ctattggttt caggaaatgt ttagcctcac cctgattcca 203401 cagaggattc tggagcacga actgcaccac agaaattgtt cagcccaggt actgggttgt 203461 tcaaactgca catcagtcaa ttactggaag gtgggttata acctcccagc ggtctccaga 203521 tgaggtgact cccataagct aaggttgatc tgagctgtta gcagttgtgg gaatgggtgc 203581 acagtttagg cagaaaaggg gatctggtgg gatatcgaca tctgctacac tctcggagct 203641 ttagatcctc atttgttgag caagggtggt ggtaattagc acttacttac ctcatacgac 203701 tgtggataac aaagaatata cgtaaagcag gaatggtagc ttttgtgtac tgaagctagg 203761 agggtcctga gtgattacag tgaataatat tttccaaaga tttgttttta ttcattgata 203821 ttgcttgata cctgggtgac tgggctagct taaaagtaaa gtgttggtaa gatccatggc 203881 ttgaaagtta aaagcaaggt atattagttt tctgttgctt gctataacaa gttactataa 203941 acttagtagc ttcaaataac ataaattatc atacagcgct gtatgattta ttatcataca 204001 gctctgtatg tcagaagtct gacatggatc tcagtggact aaaatcaaag tgtcagcagg 204061 gccacattcc tttgtagagg ttctagggga gtatctgttt cctcgccttt tctagctttt 204121 agaggctgcc cacattcctt agctcatggc cccttcgttt ttcaaagcca gcactggccg 204181 gtggagtctt ctcatgttgc atcactctga cactcgctct cctgcctccc ttttcctctt 204241 gtaagaactt ttgtaattac actgggtcca cctggacaat ccaggataaa gtctgcatct 204301 caggatcctt aatcacatca taaaagtctc ctttgccatg ctttattttt ccagcttcat 204361 tttgtgggga ccattcctct gtgcctgaat ctttctttct tcgcagtgaa taaggtaaca 204421 tattcacaga ttccagggat caggacgcag acatctttgg gaggccgtta tcctgcgtgt 204481 tatacaagag ttgaatgact tttctgtaaa gggccagata gtaaagaatt gaggctttgt 204541 ggggcgtatg gtctctgtta caatatttaa ttctgccact gtagcaagag agcaaataca 204601 gactatatgc atgggcatgg ctgtgtccca ataaaacttg atttatgaac accaaaattt 204661 gaatttcata taagtaattt tcatgtgcca ccacaatatt attcttcttt tgatttttcc 204721 ccagttgttg aagtatgtaa catccagtct taccttgaag cctgtacaaa aacaagcagc 204781 aggttggatt tggcgtgtgg gctgtattag tttagaatat ttgtgtccta gctcatttaa 204841 agttttgaga aacaatcttg aatattggga ataatttttt ttaccatgct ggctggttgt 204901 catacatatc gcagacctcg accttgtgat ctgcactact tctgtttgat ttttcttccc 204961 taggagaacc actttccttt taattttcca tggaaattgc tgaaggtaaa atgatgatag 205021 ttggtcttga aatcttgttc ctgtgtggat ggtaaatttt tggaaacaca ggcacaatga 205081 gaagaatact caaattatgt gagttaatca gagaaagtaa cctgtgtatt gacattcctg 205141 ttttttccca ttaaaccaga gtcactcaga atattgttaa cattttattg gagctggcta 205201 gattggttag aatgaagagg taggtgaaaa agaagtgtgt agcataatga aatagaaagt 205261 attacattcc agagcacctt gctgaggata gtttgccatc tttcttacaa tatattatgc 205321 ctacctctga tttattgaat gtgattttct ctttaaaata tgatactttt cttgctataa 205381 tattgatgcc tcttgctgaa tgatagttga ccgtgaaaca cgcattcatt tttttttttt 205441 ccttttttct ttctgttagg tcaaggggaa gaggaccctg ggagaaaata ttgctgataa 205501 tggaggcctg cgggaagctt ttagggtatg cgctgctaca tttaccgtgg ttctaaaaat 205561 caagccataa aacaccatat gggggtacct caattcatac catagccatg ttcttgagga 205621 gttagttagt tagatattat ggaatttctg taaatagaag tgtgtctccc cattgaccct 205681 attacataaa gagaactata tttctttttt ctttttgggg tgtgaagtca ccctaaagaa 205741 taatatttgt tattaagaat catgtattta aagaatagct tttagaaaat acattcacca 205801 tcagatcgaa gcatgctaca gcaatctcct tggggctagg ttcatttgtt ttccttgagt 205861 acaggtcttt ctgtagtcct cctataaatt tctcttaaaa tatcaaactt ctaaattatg 205921 accagagggt agtggttctc atatactggg ccaagtgata gttcataata acgttttcat 205981 catccactag tttgtggtca aataagaagc ataaagataa tacagaaaga tttcataaag 206041 ctaaatgcat taaatttaat gactagactg ttattcctgg ttatatcctt cctatattct 206101 agttgaaata tactttattt tataaaatga gggcactaga agatgatagg ttttctctta 206161 cgtgtcaaaa tgaaaaaagg taatatgcgg accccaaaat gtattgccat aggggccttg 206221 ccattgtgtt agatttgttt gtcccctgct aggaatggaa agaacagctt tttttctcca 206281 agagttattt aacatctgaa tgctttggtc cagctgcttt ggtatacatt gagcataagg 206341 cctttttgca ggtgtacctg cctcactggt aagcaacagg acatggactt gtgtttctta 206401 agaaaatttt gacatgatgc attttgcacg tggcaggagt atatatttgc tattcctgtg 206461 ctagctgagc aaagagaaaa acccaccgtt gcagagactc atctcttctt cttctctcac 206521 caggcttaca ggaaatggat aaatgacaga aggcagggac ttgaggagcc tcttctacca 206581 ggcatcacat tcaccaacaa ccagctcttc ttcctgagtt atgctcatgt gagtagactg 206641 aggaaggggc atcagggatg agatgcagga cttgagtttg ctcccttgat caaagaaaga 206701 tttcaaacaa aagtttaatg gcatgacttt gattttcagc aaaaatgtta tcttgattat 206761 tttcagggta aatctcatct tcttaatgtt ctgtagtaca taattaatta actgattgaa 206821 tcaaggttta ttagtatttt ttaaatcaaa tgggttagca gccctgtggt aattcatttg 206881 atagttcctc aatgtatctt gacttcttaa tgtgtttgat actatttatt tgtaaccatt 206941 tctagtaact tgctctttgg ctttcatggg cctgctcttt cctatattgc tcctcctttg 207001 cctcatttgc tgactttatg ccaccactct cagcctgtgg gggaccctca gggtcatgtt 207061 cccaactgtc taatgcatat gcagagaccc ctagagctga aaaagaacaa aggcaaggag 207121 agtcatgtca ctcttttgtg agtttaatag gctcttggag cagaaattga aaggaagatg 207181 tacacgaaat aaagcacagt tccgagcctg agtctctgtc taatgaggca agtgccaaat 207241 ctttattagc tattatactg gtttatgtca taaacatacg ggcagccttt tcaaagaagt 207301 cagggatagg tcaataaagg ttctagtgaa tggcctcacc gtgggcaaat gtgtatgaat 207361 atgctcaaag ctagacaaaa attttcatca gtacaacctt ttgttcctaa gtctgtcttt 207421 gctttcactg ctaagtaagc tcattctcac acgagtcttc tataattcac tctggtcaca 207481 ggattcttgc ttgcctgcgg tcttttattt cttaaagcta ataaaaaaca ctaaataaat 207541 tattatttag caacctaatg tcaacaaaac tcaactctag tcatccacaa catgactttc 207601 cttttacttt tcctgtttct atcatcacat cccatcttta gattttcctg gttagaaagc 207661 ctggattctt aggtcttatt tgatgcatct gtcctcaatt ttacaacatg tattccagaa 207721 gccagatgtt tcatacttta ctttacttta aaaaaaaatt atactttaag ttctggggta 207781 cacgtgcaga acgtgcagtt ttgttacata ggtatacatg tgccatggta gtttgctgca 207841 cccatcaacc cgtcacctac attaggtatt tttcctaatg ctatagctct tctagccccc 207901 aaccaaccct ctgacaggcc ctggtgtgtg atgttcccct ctctgtgtct atgtgttttc 207961 attgttcgac tcccactcat gagtgagaac atgcggggtt tggttttctg ttcttgtgat 208021 agtttgctga gaatgacggt ttccagattc atccatgtcc ctgcaaagaa catgagctca 208081 tcctttttta tggctacaca gtatgccatg gtgtatatgt gccacatttt ctttatccag 208141 tctagtctat cactgatgga catttgggtt ggtgccaagt ctttgccatt gtgaatagtg 208201 ccacaataaa cataagtgtg catggcatct ttctttatag tagcatggtt tataatcctt 208261 tgggtatata cccagtaatg ggatggctgg gtcaaatggc atttctggtt ctagatcctt 208321 gaggaatcgc cacactgtct tccacaatgg ttgaactaat ttacactccc accaacagtg 208381 taaaagtgct cctatttctc tacatcctct acagcatctg ttgtttcctg actttttaat 208441 gatcaccatt ctaacaggca tgagatggta tctcatagtg gttttgattt gcatttctct 208501 aatgaccagt gatgataatg agcatttttc atatgtttgt tggctgcata aatgtcttct 208561 tttgagaaat gtctgttcgt agcctttgcc cactttttga tgaggttgtt tttttcttgt 208621 aaatttgtat aagttctttg tagattctcg atactagccc tttgtcaaat ggatagattg 208681 caaaaatttt ctcccattct ttaggatgcc tcttcactct ggtggtagtt tctttggctg 208741 tgcagaagct ctttagttta attagattcc atttgtcaat tttggctttt gttgccattg 208801 cttttggtgt tttggacatg aagtctttgc ccatgcctat gttctgaatg gtattgccca 208861 ggttttcttc taggattttt atggctttag gtcttaagtt tgtttaagtc tttaatctat 208921 cttgagttaa tttttgtata aggtgtaagg aagggatcca gtttgttttc tgcatatggc 208981 tagccagttt tcccaacatc atttattaaa tagggaatcc tttccccatt gcttgttttt 209041 ggcaggtttg tcaaagatca gatggttgta gatgtgtggt gttatttctg agggctctgt 209101 tctgttccat tggtctatat atctgttttg gtaccagcac catgctgttt tggatactgc 209161 agtcttgtag taaagtttaa agtcaggtag tgtgatgcct ccagctttgt tctttttgct 209221 taggattgtt ttggctatat gggctctttt tttttttttt tttttttttt tttttttttg 209281 gttccatatg aagtttaaag tagttttttc cattctgtga agaaagaaag tcaatggtag 209341 cttgatgggg atagcaatga atctctctct aaattacttt gggcagtatg gccattttca 209401 cgatattgat tcttcctatc catgagcatg gaatgttttt ccattggttt gtgttctctc 209461 ttatttcctt gagcagtggt ttgtagttat ccttgaagag gtcctttaca tcccttgtaa 209521 gttgtattcc taggtggttt attctcttag tggcaattgt gaatgggagt tgactcatga 209581 tttggctctc tgtttgtctg tgattggtgt ataggaatgc ttgtgatttt tgcccatcga 209641 ttttgtatcc tgagactttg ccgaagctgc ttatcagctt aaggagattt tgggctgaga 209701 cgatggggtt ttcttaatat acaatcatgt catctgcaaa cagagacagt ttgacttcct 209761 cttttcctat ttcaataccc tttatttctt tctcttgcct gactgccctg gccagaactt 209821 ccaatactat gttgaatagg agtggtgaga gagggcatcc ttgtctggtg ctggtttcca 209881 aagggaatgc ttccagtttt tgcccattca gtatgatatt agctgtgggt ttgtcataaa 209941 tagctcttcc aattttgaga tacgtgccat caatacctag tttactgaga gtttttagca 210001 tcaaggggtg ttgaattttg tcaaaggcct tttctgcatc tattgagata atcatgtggt 210061 ttttgtcatt ggttctgttt atgtttattg atttgcatat gttaaaacag ccttgcatcc 210121 catgtatgaa gccgacttga tcatgtgatg gataagcttt ttgatgtgct gctggatttg 210181 gtttgcaagt attttatcgg gtattttcgc atagatgttc atcagggata ttggcctgaa 210241 agttttcttt ttttttgttg tgtctctgcc aggttttggt atcaggatca tgctggcctc 210301 ataaaatgag ttagggagga ttccctcttc ttctgtggtt tggaatagct tcagaaggaa 210361 tggtaccagc tcctctttgt acctctggta gaattcggct gtgaatctgt ctggtcctca 210421 actttttttg gttggtaggc tatcaattgc tgcctcaatt tcagaacttg ttattggtct 210481 attcagggat tcaacttcta cctggtttag tcttgggagg gtgtatgcgt ccaggaattt 210541 atccatttct tctagatttt ctagtttatt tgcatagagg tgtttatagt attctctgat 210601 ggtagtttga atttctgtgg gattcttggt gatatcccct ttatcatttt ttattgcgac 210661 tatttgattc ttctctccct tattagtctt tcttgctagt ggtctatctg tgttgttgat 210721 cttttcaaaa aaccagctcc tggattcatt gattttttga agggtttttt gtgtctctat 210781 ctccttcagt tctgctctga tgttagttat ttcctgtttt ctgctagctt ttgaatttgt 210841 ttgctcttgc ttctctggtt cttttcattg tgatgttagg gtgtcaattt tagatctttc 210901 ctgctttctc ttgtgtgcat ttagtgctat aaatttctct ctacacactg atttaaatgt 210961 gtcccagaga ttctggtaca ttgtgtcttt gttctcactg gtttcaaaga acatctttat 211021 ttctgccttc attttgttat gtacccagta gtcattcagg agcaggttgt tcagtttcca 211081 tgtagctgtg tggttttgag tgagtttctt aatcctgagt tctaatttga ttgcactgag 211141 gtctgagaga ctgtttgtta tgattttctt tctttctttt gcatttgctg agaagtgttt 211201 tacttccaat tatgtgatca attttacaat aagagcgatg aggtgctgag aatatatatt 211261 ctattgattt ggggtggaga gttctgtaga tgcctattag gtccacttgg tccagagctg 211321 agttcaagtt ctggatatcc ttgttaatta attttctgtc catttatcta atattgacag 211381 tggggcgtaa agtctcccac gattattgtg tcggagtcta agtctctttg taggtctcta 211441 aggacttgct tgatgaatct gggtgttcct gtatagggtg catatatatt taggatagtt 211501 agctcttctt gttgcattga cccctttacg attatgtaat gctcttcttt gtctcttttg 211561 atctttgttc atttaaagtc tgttttatca gagattagga ttgccacttc tgcttttttt 211621 tttttttttt tttttttttt tttgctttcc atttgcttgg taaatattcc tccatccctt 211681 tatttatttt gagcctatgt gtgtctttct ttgcatgtga gatgggtttc ctgaatacag 211741 cacattgata gttcttgact ctatccaatt tgccagtctg tgtcttttaa ttggggcatt 211801 tagcccattt acatttaagg ttaatattgt tacgtgtgaa tttgatcccg tcattatgct 211861 gctacctgga tattttaccc attagttgat gcagtttctt catagtgttg atggtctttc 211921 tttacaattt ggtgtttttc agtggctggt gccagttgtt cctttctgtg ttcagtgctt 211981 ctttgaggag ctcttgtaag gaaggcctgg tggtgacaaa atctctcagc ctttgcttgt 212041 ctgtaaagga ttttatttct cctttgctta tgaagcttag tttggctgga tatgaaattg 212101 tgggttgaaa attcttttct tttctttgag attgttgaat attggccctc actctctcct 212161 ggcttgtaga gtttgtgctg agagagatcc actgtttgtt agcctgatgg gcatcccttt 212221 gtgggtaacc cgacctttct ctctggctcc ccttaacttt ttttcctcca tttcaacctt 212281 ggtgaatctg acgattatgt gtcttggggt tgctcttctc gaggagtatc ttagtggtgt 212341 tttctgtatt tcctgaattt gactgtgggc ctgtcttgct agggagggga agttctcctg 212401 gataatatcc tgaagagtgt tttccaactt ggttccattc tccccgtcac tttcaggtac 212461 accaatcaaa cgtagatttg gtcttttcac atagtcccat atttcttgga ggctttgttc 212521 atttcatttc actccttctt ctctaatctt gtcttctcgc tttatttcac tgagttgatc 212581 ttcaatctct gatatccttt cttccgcttg atcaattcgg ctattgatac ttgtgtatgc 212641 ttcacaaagt tctcatgctg tgttttttag ctccatcagg tcatttgtgt tctcctctaa 212701 actggttatt ctagctagca attcatctaa cctgttttca aggttcttag cttccttgca 212761 ttgggttaga acaatgctcc tttagctcgg agaagtttgt tatcacccac cttctgaagc 212821 ctacttctgt caattcgtca aaatcattct ccgtccagtt ttgttccctt gctggcgagg 212881 tgttgtgatc ctttggagga gaggaggtat tttggttttt gtaactttta gcctctttgt 212941 gctggtttct ccccatcttt gtggatttat ctacctttgg tctttgatgt tggtgacctt 213001 cggatggggt attttagtag acgtgctgtt cctttctgtt tgtttgttag ttttccttct 213061 tacaaacagg ctcctctgct gcaggtctgc tggaggtgca ctccagactc tgtttacctg 213121 ggtatcacca gcagaggctg cagaacagca aaaattgctg cctgttcttc cctccggaag 213181 ctttgcccca gaggggcacc tgccagatgc cagccagagc tctcctgtat gagatgtttg 213241 tcggccccta ccgggaggtg tctcccagtc aggatacacg ggggtcaggt acccacttga 213301 ggaggcagtc tgacccttag cagagctcga ccactgcgct ggctgtcagg cagggacgtt 213361 taagtctgct gaagctgtgc ccacagctgc cccttccccc aagtgctctg tcctagggag 213421 atgggagttt tatctataat tccctgactg gggctgttgc cttttttttt tagagatgcc 213481 ctgcccagag aggtgacatc tggcagtctg gccacagtgg ccgtgctgag ctgtggtggg 213541 ctcctcccag ttcaaacttc ctggtggctt tgtttacact gtgagggtaa aaccacctac 213601 tgaagcctca gcattgggga acgccccttc cccccccaag ctcaagtgtc ccaggtcaat 213661 ctcagactgc tgctgtactg gcagcgagaa tttcaagcca gtggatctta gcttgctggg 213721 ctctgtgggg gtaggacccg ccaagccaga ccacttggct ccctggcttt agaacccctt 213781 ctgacgggag tgaacggttc tgtcttgctg gcgttcccag caccagtggg gtatcaaaaa 213841 aaaaaaaaac cacacacaca cacacagaaa aacaacaaca atgaaacaac aaaaaactct 213901 tgcagctagt tgggtgtctg cccaaatggc cacccagttt tgtgctggaa acccagagcc 213961 ctggtggggt aggcactgga gggaatctgg aatctcctcc tgttctgcag gttgcgaaga 214021 ccgtggggaa agtgcagtat ctggactgga gtgcacggtt cctcaggctc agtccctcat 214081 ggcttccctt gggtagggga gaaaattccc cgaccccttg cacttcctgg gtgaggtgac 214141 accagaccct gcttcggctt gccctctgtg gcctgcaccc actgtccaac cagtcccaat 214201 gagatgaact gggtacctca gttggaaatg cagaaatcac ccacttactt tctgcattga 214261 tttggcattg atttcgctgg gagctgcagt ccagtgctgt tcctatttgg ccatcttcta 214321 gttcttctcg atgtttcata cttttaagtg cttttcagat ttgacatctt ctctcctgtt 214381 ttagacagtt tgggctgcta taacaaagtg tcatagacta ggtggcttat gaacaacaga 214441 aatttatcac agttctggag gctggaagtc tgcgatcaga gtgctagtgg gatcgagttt 214501 tggtgagggc tttcttctgg gttgcacact gccaatttct ctctgtgtcc tcacatggtg 214561 gaaagagcta gctagctctc tggcctcttc ttataagggt actaatccca ttcctgaggg 214621 ctccaccctc atgatttaat tgcctcccaa aggcccaact cctaatgcca tcacattgat 214681 cagggcttca acataagaat tttgggggat agaaacattt agtccatagc atccccatta 214741 ccctatctta tcatatatag ttagttatca tatataggca taatcttaat aaatatcctc 214801 tactctggat tatcccaaca gccttatgga atgttaatct gacttcagca tttttttttt 214861 taaccactgt tatacatacc attggaatac cacttttatt tcattaccaa cctgctgaaa 214921 gaactacagt ggctcccaaa tgcctaccac attgaagcca aatacctcca tctggcttcc 214981 caaattttca gtatgatcca actccactca atccaattca atttcccata actaactcca 215041 ctgggtgaga aaaaactcag ccttagtggg gcaagtatcc tcacttctac cctacaatgt 215101 catgcccatt actgtctctt tggctttgct tctgattctc cttctataat gcaacccttc 215161 ctactcttcc ccttcagatg attattcaat ctaagcttca tccttcagga tccatgttga 215221 gttccactcc ctccacatct tccataaact cttccccaag gaccccagtc cttattattt 215281 tcttttatct tcaactcctt ttatctttct tactttcatt accatgcagt tgagtactta 215341 actgctctcc aaatgttcaa gaattttagt tttatctcca cagctaatct tctggatcct 215401 tacacttcct atctccatag cacccagcaa agatgctggg tatattgaag acagttgata 215461 tttttagatc aattgcttga aatgactaaa actaacagtt caactaagga gactcccttt 215521 tactgggtca gatatataaa ataagtatac aaaaatagaa ctttctagag ttgggattgg 215581 caaactaggg ccaaatatgg cctgctgcct gtttttgtaa ataatgtttt attggaaagc 215641 agccacattc cttcatttat tcatgatctg tagctggttt tgtactacag tggcagagtt 215701 gaataattga aacagggacc ttatagtctt caaagcctaa aatctttacc atctgtctct 215761 tttcagaaaa agtttgctaa cccctgctct agaggacaaa gtatagtgat ctatcaatca 215821 gttcttacta tttacatcct agcccacaat atctctttgt agataaaata aactaattga 215881 ttgatagtga agttacttat aattctgaaa tgtttccaga tataatacca aactaaacta 215941 tagctgactg atttattgta aactaaagag atagatagat agatttaggg gggaaaagtg 216001 gaaagctatt ttatatctta tatcactagt gagaaattta tcttggagct tttcccactg 216061 acagaatgaa gagatttttc agtcttttgc actttaaagt aagtctgctt ataaggagta 216121 aaattacttt ccctcctggg aaggtactaa gaaaacagta gcaggcataa tgtgttttca 216181 gaagtctctt aagtgatata ttaagcaaag acaatactat attagagtat aagaatatag 216241 gaagaaataa gaaatgaggt aaaatcaaat gccagaaaca acttcatatt agccagcttc 216301 tgaggaaatc aagggatcgc attacaaaac atcatcttca aagtcaaatt cagaaccact 216361 gtgactggtg gtttatcttc atcaaaagca acaacatcat atcacctttc ctgctttcta 216421 tgtccctgta gttgctgtga cctctcattt ttctcagata ttcttctaat ccttctactt 216481 ctttccattc tattttctac ttatttagtg gacatccatt gagaaccttc ctggtatcta 216541 taaataatct aactagcctc taatttcttc cttccactat acacccctgc cagataaatc 216601 tttttcaaaa tctgggtttt attattttcc tactcaagaa tgacttttaa atatagttta 216661 aacgctctga ccaactgctc aggacttcag aaatctgaca ctgtccttcc tctcccactg 216721 ctgcccgtta ggcatcatat actccagcct agctgaatgt ctggttctct ttcccctcaa 216781 acatgtcttt gttcccattt ccccctccac ttagaatatt tttttccata tttatgtgtc 216841 ctagttcaat tcacacttga agaccatatc atacctcaag tcatatggca agtcaggatt 216901 aaagtaagta agttatgctg cagaaacaac acgacaaagt ttatttcttg ttcactctcc 216961 atgtccaaca tgagtcagca gtggggctct tgtagttttt catcaagaag ttgggttttg 217021 ggccgggcat ggcctgtaat cccagcactt tgggaggccg aggcgggtgg attgcctgag 217081 gtgaggagtt agttagtttg agaccagcct ggccaacatg gcaaaacccc atctctacta 217141 aaaaaataca aaaattaggc aagatggctt aataggaaca gctctggtct gcagctccca 217201 gcaggattga cacagaaggc gggtgatttc tgcatttcca actgaggtac ccggttcatc 217261 tcactgggac tggttggaca gtgggtgcag cccacggagg gtgagccaaa gcagggtggg 217321 acgtcacttc acccgggaag cacaaggggt tgggggattt cctgttccta gccaagggaa 217381 gccatgagag actttaccag gaggaacagt gcactccagc ccagctactg cgcatttccc 217441 acagtcttcg caactggcag accaggagat tccttacagt accgggctcg gtgcatccca 217501 cccccatgga gcccagcaag ctgagatcca ctggcttgaa attgtcactg ctagcacagc 217561 agtctgaagt tgacctggga tgcttaagct tggagttggg aggggcatcc gccattgctg 217621 aggcttgagt aggctgtttt accctcacag tgtaaacaaa gccactggga agtttgaact 217681 aggtagagcc cacagcagtt cagcaaggcc actgtgacca gactgcctct ctaggcagag 217741 catctctgaa aagcagcagc cccagtcagg gaattatagg taaaaccccc atctccctgg 217801 gacagagcat ctaggggaag gggcagctgt gggcacagct tcagcagact taaacgtccc 217861 tgcctgacag ccagcacagc ggtcgagctc tgctaagggt cagactgcct cctcaagtgg 217921 gtcccggacc cccgtgtatc ctggctggga gatacctccc agtaagggct gacaaacatc 217981 tcatacagga gagctctcac tagcatctgg caggtgcccc tctgtggaca aagcttccag 218041 aggaaggaac aggcagcaat ctttgctctt ctgtgtagac ttcactggtg atacctagcc 218101 aaacagggtc tggagaggac ctccagcaaa ctccagcaga cctacagcag aggagccctg 218161 agttagaagg aaaactgaca aacaaacaga aaggaatagt agcaacatca acaaaaagga 218221 tgtccgatca gagaccatat ccgaaggtca ccagcatcaa agaccaaagg tagataaatc 218281 cacgaggatg gggagaaacc agtgcaaaaa ggctgaaaac tccaaaaccc agaacgcctc 218341 ttctccttca aaggatcaca actcctcgcc agcaagggaa caaaactgga cggagaaaga 218401 atgattttga tgaattgaca gaagtaggct tcagaaggcg ggtgacaaca gactcctccg 218461 aggtaaagga gcatgttcta acccattgca aggaagctaa gaaccttgaa aacaggttag 218521 aggaactgct aactagaata accagtttag aggagaacat aaatgacttg atggagctga 218581 aaaacacagc acgagaactt tgtgaagcat acacaagtat caatagccga actgatcaag 218641 cggaagaaag gatatcagag attgaagatc aactcaatga aataaagtga gaagacaaga 218701 ttagagaaaa aagagttaaa agaaatgaac aaagcctcca agaaatatgg gactctgtga 218761 aaagaccaaa tctacatttg atgggtgtac ctgaaagtga cggggagaat ggaaccaagt 218821 tggaaaacac tcttcaggat attatccagg agaacgtccc caacctaaca aggcaggcca 218881 acattcaaat tcaggaaata cagagaacac cgcaaaaata ttcctcgaaa agagcaaccc 218941 aagacatata atcatcagat ttaccaaggt tgaaatgaag gaaaaaaggt taaggggagc 219001 cagagagaaa ggtcgggtta cccacaaagg gatgcccatc agactaacaa aggatatttc 219061 ggcagaaacc ctacaagcca gaagagagag ggggcaagta ttcaacaatt tcaaagaaaa 219121 gaattttcaa cccacaattt catatccagc caaactaagc ttcataagca aaggagaaat 219181 aaaatccttt acagacaagc aaaggctgag agattttgtc accaccaggc ctgccttaca 219241 agagctccgc aaagaagcac tgaacacaga aaggaacaac tggtaccagc cactgaaaaa 219301 caccaaattg taaagaccat caacactatg aagaaactgc atcaactaat gggcaaaata 219361 tccaggtagc agcataatga caggatcaaa ttcacacata acaatattga ccttaaatgt 219421 aaaggggcta aatgccccaa ttaaaagaca cagactggca aattggataa agagtcaaga 219481 cccatcagtg tgctgtattc aggaaaccca tctcacttgc agagacacac atagactcaa 219541 aataaaggga tggaggaaga tctaccaagc aaatggaaag caaaaaacaa aaacaaacaa 219601 aaaaaaaaca gaggttgcaa tcctagtctc tgataaaaca gactttaatc caacaaagat 219661 caaaagagac aaagaaagaa gagcattaca taatggtaaa ggggtcaatg cagcatgaag 219721 agctaactat cctaaatata tatgcaccca atacaggagt acccagattc ataaagcaag 219781 ttctcagaga cctacaaaga gacttagatt cccaaacagt aacagtggga gactttaaca 219841 ccccactgtc aacactactg ccaacactag acagattgag acaaaattaa ttaacaagga 219901 tatccagggc ttgaactcag ctctggacca agtggaccta acaggcatct acagaactgt 219961 ccatcccaaa tcaaatcaac agaatataca ttcttctcag caccacatca cacttattcc 220021 aaaattgacc acatagttgg aggtaaaaca cttctcagca aatgcaaaag aatgaaaatc 220081 ataacaaaca gtctctcaga ccacagtgca aacaaattag aactcaggat taagaaactc 220141 actcaaaact ccccaactac atggaaactg aacaacctgc acctgaatga ctattaggta 220201 aataacaaaa tgaaggcaga aataaagatg ttctttgaaa ctaacgagaa caaagacaca 220261 acgtacctga atctctggga cacatttaaa gcagtgtgta gaaagaaatt tataacacta 220321 aatgcccaca agagaaagca ggaaagatct aaaattgaca ccctaacatc acaattaaaa 220381 gtactagaga agcaagagca aacaaattca aaagccagca gaaaacaaga aataactaag 220441 atcagagcag aacggaagga gatagagaca tgaaaaaccc ttcaaaacat caatggatcc 220501 aggagctggc ttttcaaaaa gatcaacaaa atagaccact agccagacta ataaagaata 220561 aaagagggaa gaatgaaata gacgcaataa aaaatgataa aggcgacatc accaccgatc 220621 ccacagaaat acagtctacc atcagagaat actataaaca cctctatgca aataaactag 220681 aaaatctaga agaaatggat aaattcctgg gcacatacac cctcccaaga ctaaaccagg 220741 cagaagttga atccctgaat agaccaataa caagttctga aattgaggta gcaattaata 220801 gcctactaac caaaaaaaag ttgttgagga ccagacaggt tcacagccga attcgaccag 220861 aggtacaaag aggagctggc accattcctt ctgaaactat tccaaacaat agaaagagag 220921 ggaacccttc ctaactcatt ttatgaggcc agcgtgatcc tgataccaaa acctggcaga 220981 gacacaacaa aaaaagagaa ttttagacca atatccctga tgaacatcaa ggcaaaaatc 221041 ctcagtaaga tattggcaaa ctgaatccag tggcacatca aaaagcttat ccaccacaac 221101 caagttggct tcatacctgg gatgcaaggc tggttcaaca tatgcaaatc aataaacata 221161 atccatcaca tacacagaac caatgacaaa aaccacacga ttatttcaat agatgcagaa 221221 aaggcatttg acaaaattca acaccccttg atgctaaaaa ctctcaataa actaggtatc 221281 gatggaacgt atctcaaaat aataagagct atttatgaca aacccacagc caatatcata 221341 ctgattgggc aaaaactgga agcattccct ttgaaaacca gcacaagaca aggatgccct 221401 ctctcaccac tcctattcaa catagtgttg gaagttctgg ccagggcaat caggcaagag 221461 aaataaaggg tattcaaata ggaagaaagg aagtcagatt gtctctgttt gcagatgaca 221521 tgatggattg tatattaaga aaaccccatc gtctcagccc aaaatctcct taagctgata 221581 agcaacttca gcaaagtctc aggatacaaa atcaatgtgc aaaaatccca agcattccta 221641 tacaccaata acagacaaac agagagccaa atcatgagtc aactcccatt cacaattgcc 221701 actaagagaa taaaatacct aggaatacaa cttacaaggg atgtgaagga cctcttcaag 221761 gagaactaca aaccactgct caacaaaata aaaagaggac acaaacaaat ggaagaacat 221821 tccatgctca tggataggaa gaatcaatat cgtgaaaatg gccatactgc tcaaggtaat 221881 ctatagattc aatgccaacc ccatcaagct accaatgact ttcttcacag aattggaaaa 221941 aacttacttt aaacttcata tggaaccaaa aaggagccca tataaccaag acagtcctgg 222001 gcaagaagaa caaagctgga ggcatcacac tacctgactt taaactttac tgcaaggctg 222061 cggtaaccaa aacagcatgg tactggtacc aaaacagata tgtagaccag tggaatagaa 222121 cagagccctc agaaataaca ccacacatct accaccatct gatctttgac aaacctgaca 222181 aaaacaagaa atggggaaaa gattctgtat ttaataaatg gtgttgggaa aactggctcg 222241 ccatatgcag aaaactgaac ctggacccct tccttacacc ttaaacaaaa aatcaactca 222301 agatggacca aagacttaaa tgtaagacct aggaccataa aaatcctaga agaaaacctg 222361 ggcaatacca ttcaggacat aggcatgggc aaggacttta tgtctaaaac accaaaatcg 222421 atggcaacaa aagccaaaat tgacaaatgg aatctaatta agctaaagag cttctgcaca 222481 gcaaaagaaa ctatcatcag agtgaacagg caacctagag gatgggagaa actttttgca 222541 atctacccat ctgacaaagg gctaatatcc agaatctaca aaggacttac acaaatttac 222601 aagaaaaaaa caaccccatg aaaaagtgag tgaagggtat gaacagacac ttcttaaaag 222661 aagacattta tgcagccaac aaacatatga aaaatgctca ttttcactgg tcattagaga 222721 aatgcaaatc aatcaaaacc actatgagat accatctcat gccagtgaga atggcgatca 222781 ttggaaagcc aggaaacaac agatgctgga gagaatgtgg agaaatagga acgcttttac 222841 attgttggtg ggagcgtaaa ctagttcaac cattgtggaa gacagtgtgg cgattcctca 222901 aggatctaga accagaaata ccatttgacc cagcaatccc attactgggt atataccgaa 222961 ataattttaa atcactctac tataaagaca ccatgcatac gtatgtttat tgcggcactg 223021 ttcacaatag caaagactgg gaaccaaccc aaatgcccat cagtgatgga ctggataaag 223081 aaaatgtggc acatatacac catggaatac tatgcagcca taaaaaagaa tgagctcatg 223141 ttctttgcag ggacatggat gaagctggaa gctatcattc tcagcaaact aacacaggaa 223201 cagaaaacca aacaccacat gttctcactc ataaatggga gttgaacaat gagaacacat 223261 ggacacaggg aggggaacat cacacaccag ggcctgtcca gggggtcggg ggctaaggga 223321 gggatagcat taggagaaat gcctaatgta gatgacgggt tgacgggtcc agtaaaccat 223381 catggcacat gtatacctat gtaagaagct tccacgttct gcacatgtac cccagaattt 223441 aaagtataat aaaaaaaaaa atacaaaaat tagctgggcg tggcggtgtg cacctgtaat 223501 cccaggtact tgggaggctg aggcaggaga atcgcttgaa cccaggaggt ggaggttgca 223561 gtgagccaag attgcgccat tgcactccag cctgggtgac ggagcgagac tccatgtcaa 223621 accaaacaaa caaacaaaaa acttgggttt tagagtctat atcttggcat gtgcttctgt 223681 ggataccaca ggagtagaaa agacatacga ggatatgtgc actggccgtt agtgtagcat 223741 cttaaattgt tcttaaggct cttaaaactt ctgtcaggtt ttaaagcttc tgccaggaag 223801 taacatatat tacttctgct tctgctcaca cttcattggc cagaacaagc catatcgcca 223861 tgcccaactt caaaggagat ggagaagtgt acttcagtcc taggcccaga ctagggtgaa 223921 gcgagcaagg cactcagggg cagactacta acttctcatc accatcctca ataatgggtg 223981 gctgactcat tttattgatt tgaatgagta tggttaaatt tgagaaggac tatttcaaac 224041 attaaattca ttgaatgaat acactggtcg attcctgggc acatatacga ttctcttgat 224101 ttaaagcact tgtctggaaa cctttctcac ttactcaaat tgtgaaatta ttcacattga 224161 taaaaccttc tactctgggg atcagaataa tagatgttat tacagatgac ggttttattt 224221 gatcttctaa aactgtttgt catccattgg ttctaaaatt gttcctcagt ataatttgga 224281 gcagttaaaa cagcagaaaa tacataattg gaatgaaagc tcatttgttg ggatgctttt 224341 ctcttctagg tgaggtgcaa ttcctacaga ccagaagctg cccgagaaca agtccaaatt 224401 ggtgctcaca gtccccctca gtttaggtaa atgggcaaat gggtgacggc agtttttaac 224461 tgtacatctc cccccttcct cccccaccca gccatccaag ggctctacca gataggtgat 224521 ctgaaaaaaa tcccacattg tggacgtgtg atttccattt tgcagccaaa aaatatctgg 224581 ccacacggaa actaaatatc ttctttcatt agtagtaagc tggtttagtc agcactctct 224641 cacctgaatt tctgctgatg gtacctgtac atatctagtc tagtgctaca tcaagtgcca 224701 gtgagactat acaagactcc agaggtaatt cggaatccga atcatagctt tagtattaca 224761 tcccattatc ttctagtttt tgaaagagtc tcagtaaact gcattattgg acttactcaa 224821 ctgcttactt ttttgtccaa aaaaatagcc tttattgtct taggaaagga tgagatttga 224881 tgtactctgc cttaaaacac ctagcgctct ggtttatttt gttttaaaga atatcccctt 224941 taatacctct tgttctttca cacagtcaac cagcaaacat gaatgaatgg ccactttcta 225001 aatagctctg aacacaaagg aggatacaaa gtattattca gcaattataa tttatatcta 225061 gagttagaga gtctacattt tggctttcag gtgggatcat attttgaaac ggtagcttat 225121 acaaaaaagg aaaaggatta tgaattgcag tcttatttgt cagaatgtat aaaacattaa 225181 gtacaaataa aattaaaagt taatattact caccctgtac tcggtcgtac aggttgacag 225241 aaattttcta tagtaccgga aagataagta tgatgagaaa cccaaaaacc ataataattt 225301 atatgtattc attctttgtt tagggcatta tatctggata tctttatatt taatgtttaa 225361 acagatccat tagcatattg caagaaagcc atctaaaatt tcttctatat gttaaaatct 225421 tattcttaag atcttgcctt agaaatgctt cagaaattga ctataaatta cccactagca 225481 taaaaatcta atcactatgt tggaaatttc tcccccactg ggctaaactg tcatgaaggc 225541 tgacggtctc gaacgtacta aaatgtgttc ccactgacca atggtgcaga tgaactcaca 225601 cagtagatta attactcaac agataagttt tcagaaactt atttccttta actagttgcc 225661 tgtaatgttg aattttacag tgcatgctta ctaattatca actgtattct atctttaaca 225721 cttgcctagg gtgaatcaca ctatgaaata tgcatcccta cccatatatt cagctaaaac 225781 aacaagaggt tatgaggatt ttttggatac agtccattac atttatgcaa atgcccgtag 225841 gtaacaaagt cattagggaa gaaatagtca gggactgaga atgctgagtc tctagtcaac 225901 cctttgaaca cggagttgag taagaaatag aaggggggaa gttttttgtg tgtgtgtgtg 225961 ctggcttgtt actgaggtct gcagtaacta tagattttga gtgcagcaga aatcacacaa 226021 tggcaatacc acttccaaaa ggcagctaca atcaaagtgc tgagtaggaa tccagtcttt 226081 gctgtagttg gaagcctgta agaatgtgac atattggagc taactccagt gaagaattat 226141 cagtaataac tagataccat tcattaattg actgattgtt gcattaatca aattctgggc 226201 aactgtgagc ttcccatacc taacactaat tcagagaaaa tgttttgcag attgaatgaa 226261 ttctttaaaa atagctggaa atgcaacaac atggatgact cttatagaca ttatattcag 226321 tgaaagaaat caaagaccaa acagtacata ccatggattc cattcacatg aagttcaaaa 226381 tcaggcaaag ctatggcaat agaagtcaga atagtaggtt atcttgggag atgctgttgg 226441 tatggaacag gcaggagcct tctgggtgct ggaatgtctt gatctggcag atagtaatat 226501 gggcatatac acacaaaaac tggcaagtcc tacgtttaat attttgcata ctttatgtca 226561 gttatacttt ccataaagtt tttaaaaagt tagaaaaact aatattggtg atgatacagt 226621 tttaaaagaa ttttgaagac tgttcgaaga gtaatagggg catgtgcttg gcattcagga 226681 gatgctggat ctagtctgtt aattttttct ggaaatttac ttaatcattt tgggctttag 226741 ttgtctccct gtaaacagat taaaagaatg ccaaccttct ttctagcaat attctgctac 226801 agttttatac agaacctgtt gatgtgcaag aattatatga catatgcttt gacatatcgt 226861 ttttcagggt caatggtgca attagtaact ttgaagaatt ccagaaagct tttaactgtc 226921 cacccaattc cacgatgaac agaggcatgg actcctgccg actctggtag ctgggacgct 226981 ggtttatggc atcctgagac agttgcacag tgccagcgga ggctgcactg agccttcatc 227041 gcccattgct ttaggcctgg agactttcat ttttagtgca ttttcattat ttgggtaggt 227101 gacctgcttg gatctagaca gcatctgttc aaagttgtag ggcttataaa gtggaatata 227161 agaatgaact aagtatgttt ctttagaaaa tcaaaccaac aaaaataaat ccctaggcta 227221 cttttgttaa aatgctatct gctaaattgt tgctattgtc cattttaggg tgttggccac 227281 agttatgtgg tacatagcat ttcaatctat agctttgtgg gaagcctaaa ttgatgtatc 227341 ctttaaaagt tcaatctatt aaacttgtag cctctcaatg atgaagacat gtgcatgaat 227401 acctgggagg gcttgctgct ctgtggaatg cagtgcacgt ttttatggac agtaattaca 227461 cagcattcct caactgtaag gtagttaaat tagaaatatg tttgatctcc attttacact 227521 ttttgatgaa gttcttgcct tcagctttgg agtctgttgg aatgaaactg tgtctcacag 227581 cttgcatctg atgtagcaga ttgccctgtt caagtcagca tgaacaaagc tcgcagaaag 227641 gctactaggt aacggatttg agtttattca gtagaaaaaa agggaatgag gacatatctt 227701 taggtcccag aagcagcaaa acctcctagt tacataacag cagggaaact aggagatgaa 227761 atctccaaaa ttttgtccag tctgtatgaa ctgcagtgat tgtctgtctg ctagacactt 227821 ttttttagcc taattcttgt tcgtttcatt accttcatat gtgtgcattg tactggtttt 227881 gaaatgagca gacattgaga aaacaatttt ctgattcccc attaggaaca tatgattttc 227941 atttcccttt agtgcaaagg aacaaacatc acattgagga gggagaagtg tacccttctt 228001 tttcagctta atgttacctc tctttaaaaa aaaaacattt ttgaaaaatt tcttgactga 228061 tgacattcta gaatttattg attcaattta aaattccctg catagtccaa ataaacataa 228121 tttttactct tcaagtcatc taaagagttg tgagaaagag cagtactatg tgttttatcc 228181 ttagagcatt atccttctaa gtgaactcct ttcttattac tgagcgtgtg tgtgtgtgtg 228241 tgtgtgtgtg tgtgtgtagg ataaaattgt aatggcctta aagttgttac ctggcatttt 228301 tggtttctta aaataggcat ccccctactc cctttacatt attcattaga catgaggtgc 228361 tgggttaagt gggccaatag gaaaatctca aaagcaaaat gacaaattca gattaaccac 228421 aggaagccat tgtgtgtccc tgtgatttgt gattctttta aaaaaaaaaa aaatatatat 228481 atatatatat atatatatat atatgtatat ctacctaagt gtgtgccatg tccttttgta 228541 gccatgggaa cgtttgaatg gagaaaaatg gaggggcacc atcatgagga aacaattata 228601 ctgcattagt cagaggagag cgattttccc tttgacttct tcctctctga atggataaac 228661 ttgtgaaggg cagctacctt gtcctcctct taacctggct taatgtgacc ttagtttgta 228721 tttccaaaaa caagcagtat taggttggaa ttgtatgtgt cttttctctg gaggaagcat 228781 tctgacttgg tttagacata taagatactc tgttttattc ctggacttgg gtgtttaaaa 228841 tcagctgagg tttatcatgc ttatatgaat agtgtttccc ttctagattt cctttataaa 228901 ctccaaataa acacattcct gcaggtcaga gaaagtgttt gataaaagca tagaattatg 228961 gctttggaaa gacaaactaa gtcacttgtt ggtttagcaa aactggttgg tgattagttg 229021 gggccctcac agtttctaga tgagaagctt accttatcct gattaacttt acttgctgta 229081 actgttatac cgaaaattgt aggtaggatg tccattactc aaacaaagca gaacaaaaca 229141 acaaaccaaa agcagcaata ccaaatcatc actactagct agaacaatga gaaggattga 229201 atttccaaat acatgcttat gtttaactca actcttccaa ttacattctc attgtccttt 229261 ctcttggata ccgattgcta agataaacag acactgatta agtggtagaa agagaaaggt 229321 aaaatgcaac tggaagggaa agaaaaaagt agggagaagc tggcctatgg atgtggcttt 229381 gaagagaaga caggattttg gcattccttg ctaagcacgt tcaatggagg agagccatga 229441 agttttacaa aaataattat ggtaaattaa aaggatatta gatattggat ctcagactag 229501 ggctgctata atacagttct ctgattcgca caccagatgt tctgccagcg aactgttgtc 229561 tgggcattgg aaggagttag gttttttgtg tttgtgaaaa tctcctggat tagtaaaaaa 229621 attttctaga tataaagtgg ttttatttgg gctcatgacc attgcccaaa gtgtactgag 229681 cttttcacta agtgctcagc attggcctcc tgttctcatt tattcatttt atgagatgga 229741 cttggacatc actaatgcaa acacaggatt cataaattat tctgaagttt atttagtgcc 229801 ttgcctcaat ggagcttaaa gcatagcaca taaattttat cttatttagc cctggagcat 229861 cactaagaag cagctaggaa ctatggcttg ccagacagat tatgcttgtg ccagagaagt 229921 ggagaagcct gaaatccctt gtcaagtatt ccacaggaag gaaaggtagt ctttgggctt 229981 tggtctactg acctgtggat taaactctat cccttagatt atcccatttc taaatacatt 230041 tattcaatca gtatttctat aatcacttag catttgatag gcttcttcct tgatacattt 230101 aacataacgc ctcagttgca tattacatta tggaagaata tacgattgct tcatataaca 230161 acattctcat caccagtgac agtgtcctac tggtttgaag aaagatattc ttggattatg 230221 tgttttatat ctaaatggtt attctgagac tggttgtatt ttcttctcct actcgtgaaa 230281 cgatgtgaac aagcatttta aggacaataa aaagttagct ggttgaaggc tttctgtgtc 230341 tttctttttt aaaaatatta aaatctatta gatctgttaa catattccgt cctgttttac 230401 acaacatttg ccaaaaaata gtgtacaata actatactta ctagctttct tggcttagaa 230461 aagtgagatt tgaatgcagt aactttctcc atagacaaga aggtatttaa gaggttgtca 230521 ttatattttc ttccagaatg taagtcagca gatacggttt attgcctttg taataatatt 230581 tctatagaga aagtgatgcc tctgaggaaa ttatcagcga tgacagggga tagggctgat 230641 ctcacaaaag tgacaatgaa catcacagtt actatcaaat caaacaatct ctgtgttatc 230701 atcattcttt aacaatccag tcaatttata gttattcatt cagcacacag taactgagca 230761 cccattatat ttaaatcacc aaactagcta tatttaggtg gataatatac agtgtctggg 230821 cttcagtgaa cagtctggtc taacatggga gatggatata taaactgttt caattaccct 230881 actgagtgat tatgatatat aagtaaatca ggttgtggtt tggaacacag agaaagaatt 230941 cttcaacttt tgtctggtgg atactattgg tgtttcctct gcgcagatcc cagcaaattc 231001 cttttactgg gtggccttat ctcatcttcg gtctgctttt gtttctaatt ggctgtggct 231061 ctctaggatg gctgccctca tgctactgga gcctctttgc ccaatgcact ttcagagaac 231121 tggaagtgcc tcgtgaccct tagacagtga ctgagtggta tgggactccg catgcctgac 231181 tacctcccgg agttcagaca aatccagagg catacaatag aatttatact gatttctctg 231241 tggtgtcttt cttttagcat tatgctgaca gtgcaaattg agtcccctgt tgattaaaca 231301 cctttctggc actgatatgt tctaagatct ggaaaaggcc gggtgcagtg gcttacgcct 231361 gtaatcccag cactttggga ggccgaggtg ggtggatcac gaggtcagga gatcgagacc 231421 atcttggcta acacagtgaa accccgtctc tactaaaaat acaaaaaatt agccgggcgt 231481 ggtggcggat gcctgtagtc ccagctactt gggaggctga ggcaggagaa tggcgtgaac 231541 ccgggaggcg gagcttgcag tgagtcaaga tcacgccact gcactccagc ctgggcgaca 231601 gagcgagact ccgtctcaaa aaaaaaaaaa aaaaaagatc tggaaaagta atggtttgta 231661 ggatcctagg tcagcccttc tgtgtagtag gtcatatcat catcatgtac aggaaatgga 231721 catcagatgg cacagactgt gggtaataag tagctgagta tggcctgttt gtgcccttct 231781 tggagagttt atcctagttt ctgagctgat ccatgaaatg ggaatttttg tgtataggga 231841 agtggcacaa ggaaagatgc atataaatga cagtgttaca tatgtatcta agtggacctg 231901 atcaaaattt gcaagaggct gtttccaatt aggtttaaga tatttgcatg tgcctcaagt 231961 tcactcgctt ttttttatta agatcacact agctgtagaa acagataatg gcttaaccca 232021 atttatttct tgttttcatt acatacataa tggatattaa ttatcagtga gtttctctcc 232081 tctaactggt gactcaggga cccagcctcc ttccatatta tggttctgcc attttcaaca 232141 catgatttaa tgtatgctca tctataagaa gccagtggag ggaaaaaggc catggaagat 232201 actgagtgga agatttgcaa ttgtccttga agttgtctac attacttctg ttcatattcc 232261 attggccaaa tctaactgca ggggaggctg ggacatgtag tctagctgtg tgcccagaat 232321 gaagagcaaa tgggtttagt tctcagggag cattcccagg cacaccttat aaagtgacgt 232381 atggggcagt aactttgagt tcttctgaat ttagaaaatt gcttttgagc acggcacaat 232441 gaaaactgaa aatgccattg gtagcctgca gtttattctg taggagtctt tttaattaac 232501 agcttaattg gataatttaa tccccgttag aattggagat atgcaaatga aaatgagata 232561 ctccctatga tctgacaata ctgtgcttta ataaacacac caaagtgagc ctggaaatga 232621 gcaccacaaa agcattacca tgatttgtga tcagtgaaag aaaacctgat ctataatttt 232681 aatcattcct aaatggttta gatacatgtt tcatttgaat gtgtattttg ttttgttaat 232741 tggttcaatt agaaaggtaa ctgaacataa gaatctgtgg caaagccatg aatttataat 232801 atgttctaag cttttaaaat atataccatt gagaaagtaa ctaaaatacc ctacaaaata 232861 ctgttgaaag tgcagtattt tccagtattt tcagataatt tatatacaat agtttttact 232921 tctccattga ttatgagatg aaaaatacta cttaccatat tcttgttata tgctttctcc 232981 acttgaatga gaaattgagg ctcagagtcc aagaaatgaa gatgagtgtc catgtacaag 233041 ccacaaaagt gctgttctaa ttaattctgg ccggagactc cactttacag agaaagaaaa 233101 actgagtaag ttaaaatccc agtgtaaaaa gctacctaaa atattgagta tcattttccc 233161 tgccagtttg gctgaaatgt ctgcaaatgt cctagacaaa aaaagtggta tagtttacca 233221 tttggaagta cactgtgtta atttccaaat tgatttagaa gtcagtggaa tcccagtact 233281 tggataacag tcagacagcc ctcccacccc agtgacattt gatgaatgtc aaaaatagcc 233341 caaagcatat ttattccaaa gttgaccaga ttttcaattg gcatttgtat tttttttttt 233401 ttggcctaat aataatcgta gtgaaaatgg cttctgtttg ttgacaccca ctatgtccta 233461 ggcactctgc tggggcatgc ctgctttatg gccagtcctc acaacaatct tgcacatagt 233521 tgttgctatt ccaacttcac aggtgagaaa cacgtgactt gtcaagatca cattagtaac 233581 tgcaagagcc agaaacagtc agttttgtct aacttcagag cccattctcg aatgttctcc 233641 tgggatatag aacaatgtta acatgattgt aacctgtgac ttactgaaga cttcaggtta 233701 tacttctata gtaatctgtt agaaaataat gtaaagatag gagtcagttt acaataattt 233761 atacttactg gtagtttcaa gtccttcaga aaggattcta gatactgctc tttactgata 233821 atgcttgtta ccatttatta tctataacgt gccaaacatg tactaaatac cttatttata 233881 ttatctaatt taattatcac aacaactggg atacatatta ttatctctac ttcacaggta 233941 agaaacctga agcttggaga ggttaatgac ttcattcatg atcatttagc tagcagctgg 234001 cagcatggat ttgaatccag tttggtctaa gtgctttctt aggcaaacat taactatggt 234061 aactcttaag tcctggatgt ccaggctggg gcatcccctc agttcagctg gcagacctcc 234121 ctgaaatcta cctgtgatca gccccatgat taccctcacc ctgtctgttc agtccctgaa 234181 accgaaaagt ttgcaggtta ctttctccag acaacaaaat ctttgtggcc ctgcctatgg 234241 gtagttgaga gactttcaag cagtcacctt ggttttgacc ctatgatgca ttctttttaa 234301 aattttttta agatttgaca acactgtgtg tatttatcat gtacaacacg atgttctgaa 234361 gtatatgtac acattatgga atggttaaat ctcactaatt aacaattgca ttaccgcatg 234421 cagttatttt ttgtggggaa agcacttaac attcactctc tctgcatttt tcaagaacat 234481 aataaatcat cattaactac agtcaccttg ctgcacaata gatcttttga attaattcct 234541 tccatctaac tgtaattatg tatcctttga acaacctctc cctatcttcc ccacccctca 234601 gtctctggca gccaccattc tactctctac gtctatgaga acaacttttg agatttcaca 234661 ggagtgacat catgcagtat ttatcttttt gttgtctggc ttatttcact taacataatg 234721 tgctccaggt tcatccatgt tgtcacatgt aacagaattt cattattatt attattatta 234781 ttattttgag acggagtctc gctctgtcgc ccaggctgga gtgcagtggc gcgatctcgg 234841 ctcactgcaa gctccgcctc ccgggttcag gccattctcc tgcctcagcc tcccgagtag 234901 ctgggactac aggcgcccac agccacgccc ggctaatttt ttgtattttt agtagagacg 234961 gggtttcacc gtgttagcca ggatggtctc gatcttctga cctcgtgatc cgcccgcctc 235021 ggcctcccga agtgctggga ttacaggcgt gagccaccgc gcccggccca gaatttcatt 235081 attttttatg gctaaataat ttgcccttgt gtatatatac cacattcttc ctcccttgat 235141 ggacgctaag gttgattcta cctcttggct attgtgaaca ctactgcaat aaacatggga 235201 gtgcagatat ctcttcaaca tactcatctc atttcctttg ggtatatact cagtagtggg 235261 atttctggat catatggtag ttctagtttt aatattttaa ggaacttcca tactgttttc 235321 cataatggct gtactaattt acattcccac caacagtgtg taagggttcc cttttctcca 235381 tatccttgcc agcacttgta gctttatcat gcattctgac atttgctgtt acctagagct 235441 tccaaatcct gaggcattct ggtattctac agagtatttc agtttgcttc tagtaggggg 235501 gtctaatatt tttgtctatg taggtttata cctttcaaaa gatatatttt tagctttttg 235561 aagtgagagg gtgagcagat ctgcatccga gtctcctaca ttcaactgga agtctacatt 235621 taatttttta aatggggata tttaataaaa atttatttgg aaagaaatgc tgctatagat 235681 acagttctat atggaattat tgcctttccc cggtaatgtc tgtaactaaa gagtctaaag 235741 tttcctttaa actgttgaat actaggaggt tttattttac acagaaaccc atcctatttt 235801 ttcttagtta atttaataaa ataaacacac acgaacattt ggtatttttc atttctccta 235861 aagacataga acttagcctt tggagaattc atcagagatt gaaggaacct atctctgtgt 235921 accttccgaa gttgcctaga gtggatttgt cacaaatgct gtggaggaaa tttttacata 235981 tcaacaaggg ctaagctcgc tccttggaga ttcacatttc aactctcatt tgcacacaca 236041 tagagtctct ggtgcagatt gagtttgagt taatgcacat agcttgttgg gaagaaatca 236101 atcaaaaacc tttttaagga taaatggttc tttaggaaca gaaagttcag tatctcaatt 236161 ttattataag cattgaataa aaaccatttt aaaacatgca gcatattaaa actacaaaag 236221 gcgctaataa taaaggtatt ccctcaccta ggtcagttgg tgatatgtac taagtgatat 236281 ttagaataag gaaaaaaatc aactgtcatt taatgtctgc ctacaacaca tgaggcagag 236341 tgctagttgt catattgcaa tttgagtaat ccaggactct ggtgacttta gcccaaaact 236401 tcacgaacag tattataaca gccttgtcat tcctacgctt aagaaacctt tagcgatgtt 236461 tttcatggtg tttacaataa catcgaaaca actttccctg acttaaaggg cctacaggac 236521 ctggctctct gtttctgctg tcacttcttc atagcctttt tctagctcca ttggtggtct 236581 ttctgtttct gtaaatttac taaactttct cctaccttct cagtctcagg gcctttgctg 236641 ctttgcctga aatgcttttt gtatgtcttt cacataactt cattgaaatt aggcccctta 236701 actatgtagt ctatcacctc acctgcctca caccttcata gatcttacca ctgttgctag 236761 ttttctcctt ccttcctccc tccctccttc cctctctctc ttctttccta ttgtttgtct 236821 tcctccactg tcttgtaagt cccatgatat taaaactttc tccggggcta ggcatgttga 236881 cccgtacctg taatcccagc actttgggag gccgaggtgg gagaattgct tgaggccagg 236941 agtttgagac cagactgggc aatatactga gacccagctt ctaaaaaaaa aaaaaaaaaa 237001 aaaaattagc ttggcattgt ggagcatacc tgtagtccta gctactcagg aagctgaagc 237061 tggaggatag cttgagatca ggaggttgaa gctgcagtga ggtatgattg tgccactgca 237121 ctccagcctg gatgacacag tgagaccccg tctctaaacc aaaacaaaac aaaaaaaata 237181 agcctttctt ttgtttatta ctatatttct agcatcacca ctatatttcc agcactcaga 237241 acagggtctg aaaaataata ggcccttgaa tacacattta gtaaatgtgt tctttcattt 237301 cacccataaa tattgatcat ttactacgtg cttggcactc tttaggggct gctgaggata 237361 ttgcagtaaa ccaaacagat agaaggtctt gcttccatag agcttatact ctaggagaca 237421 gacaataaaa taaataagtg aaacatgata tagtagatgg taatacttgt tatggaggag 237481 agaagaggat ataatatgtg gatgtcagag tgttgcaaat ttaaatagca cggtcatgaa 237541 gatgtttgag caagaaccta tcttggagaa gagccttctt ggcagaggca ggagtgagtc 237601 caaagactta gtggtaggag catcccagga catttgtaga agagcaagga aactagtgtg 237661 gctggagcag agcgagcaag gaagagtgca gaatgagatg atgtcaggat gaaaacaaca 237721 acattcagct tgtctatggt catataggct gctgcaagaa ctcttcttct cactctgtgt 237781 gagatgggaa gccactggag aattttgagc agaaaagtga caagatccaa cctatgattt 237841 ccatgatgac tctggctgtg ttgagaatag accgtggctg aacaggggaa gaagcatgaa 237901 gaccagttag gaagtggttg ccatcatccc agcaagaaat gatggtggcc gtgtcgagga 237961 gaggagcagg ggaggaaatg agtgaatgaa cgataataaa atttaagaaa gaaacgctcc 238021 atcatcctgt tactttgaaa tttccgccta tttcttttgt atttgtgttt ttccatagcc 238081 ccatgtataa atatatggca taatttttat tcagaggata atatgatttt gaatcttttt 238141 catctaatag catcggcctg cgtccatagg tgcttcaaaa tctttttttt tttttttttg 238201 agatggagtc tcactctgtt gcccaggctg gatacagtgg tgcgatctcg tctcactgca 238261 accttcgcct tccgggttca agtgattctc ttgcctcacc ctcctgagta gctgggatta 238321 tgggcgcaag ccaccacgcc cggctaattt ttgtattttt agtagagatg gggtttcacc 238381 atgttggcca ggctggtctg gaactcctga cctcaagtga tccacccacc tgggcctccc 238441 aaggtgctgg gattacaggc atgagccacc acacctggca aaatcttcca tttttaatgg 238501 cagaatgata ttctttagag tttctgtgac ataatttgtt caattatttc tatttagaaa 238561 tttattattg atttcattta tcactattgt aaataataga gattaaaatc ttgatatgta 238621 gatttatttt ctttcctata ggttgtttcc ttaaaataaa tacttaagag taaatttact 238681 tcatcaaagg atgcaggtat atgctgttgt ggcttttgaa atgcactaac aaattgcgtt 238741 ctaacaaggt tgtatgcatg cgggtaccta ccggttttac aatagaactg gatcagaagt 238801 aggtcctgct cgtaagaaag ttacaatata atgtgggaag aagatgtgta aaccatcagc 238861 tgtaataaaa agcacaatga agaacagtgt aagaaggtac cacatttaga aaaaccttat 238921 cagctttagg tgttattaat ttcattttta gtaacttttg cattcattaa ataagtaagg 238981 ctgaatgcaa tggctcacgc ctgtaatccc agcactttga gaggccgagg caggtggatt 239041 gcttgatccc aggagttcaa gaccagcctg ggcaacatgg tgaaacccca tctctaccaa 239101 aaaacataca aattagcagg gcatagaatg tgcacctgta gtcccagcta ctggagaggc 239161 tgaggaggga ggattacttg agcccggggg gttgagactg cagtgagcca tgattgcgca 239221 actgcactcc atcctgagtg acagtgagaa cctgaagtaa gtaagtaaat aagtaaataa 239281 ataaagcgtg actggttgtt taattttacc aatttgcttt tttgttttta cctctcttga 239341 tataaattac ccacatatca aggcccttca cccattaggc tattaagatt ttgatttttt 239401 ctcattctcc taaaaagaca catgcacacg tatgtttatt gcagcactac tcgcaacagc 239461 aaagacttgg aagcaaccca aatgtccatc agtgatagac tggataaaga aaatgtggca 239521 catatacacc atggaacact atgcagccat aaaaaggatg aattaatgtc ctttgcaggg 239581 acatggatga aactggaaac catcattctc agcaaactaa cacaagaaca gaaaatcaaa 239641 cgccacatgt tctcactcat aggtgggagg tgaacgatga gaacacaggg gcacaggcac 239701 aggcagggaa acatcgtgca ccagggcctg tcgggggtgg ggggctaggg gagggataac 239761 attaggagaa agagctaatg tagatgacgg gttgatgggt gtagcaaacc accatggcac 239821 gtgtatacct atgtaacaaa cctgcaagtt ctgcacatgt atcccagaac ttaaagtata 239881 ataataaaaa agattttgat ttttttctta tcaattaggt aagtttttgt tgcttattat 239941 tcatagacac ttacttagct agaaaacaat tctgctactc tctgatcctc atctttgtac 240001 atttgtaagt agtggcgttc ctttgtaagg taatgtagat gggcaaaata gttcacataa 240061 tttgtcttac tgaaataaac tttatacaga gaagtacaaa agagttaaca caaatttagg 240121 aggtatacaa tgttaagtgt acatcagtta accgaacaaa tatttattga agcacttaat 240181 agatattatt ctagggttgg gaatatgatg aataaggttg tagtttccaa ttgtttaaca 240241 atttgaaaat tactcataag gaatagtaga tatataaata aatttcagca gtgtaatata 240301 taattgaata gaggtgaaca cagagttctt tggaaataaa actggataaa agtatttata 240361 gagctaagca atgaaaaaat tctaatggcc tcagctgtga aataaaaata atacaaaatg 240421 aataaggcag acaactcaat agaaaattgg gcagacgact tgaactaacc acaaaagagg 240481 atattccaaa ttgccaacaa atagatggaa aggtgcttaa cttcactagt catgaaggaa 240541 atgcaaattt aagccacaac atgattcaac tatttacccc ctcccctcta gaatggctaa 240601 aatcaaaaag catttagaca agcagaactc tcatagactg ctggagggag tatacatttg 240661 tatggccacc ttggaaaact atttaacagt atcttctaaa gctgaatata cttgtatcca 240721 atgacccagc aatccactcc taggtattaa gttgtcatct tagtaggtaa acaatggttg 240781 aatattggca attttatatt gcttaaccta acgtataccc agcagaaatg cacacctatg 240841 ttcaccaaaa gatgtggaag aatttcatag aagcactata agagtcccaa aatagaaagt 240901 atccaaatgc ccatcaacaa cagagaaggt aaataatcga atactgatct acaataacaa 240961 gaacaaacaa actgcaacta catgcaacaa gatgagtgtt tctcacaaac ataagttgga 241021 tgaaaaaagc caaacacaag agagaacata cttcatgatt ctacttacat aaaatataaa 241081 aatatgcaaa acgaatctgt tagaagtcag aggagtggtt actccttggg ggttaggaac 241141 tggaagggtt cacaaggtgg ggcttttgaa gttctggtaa tgttgggttt cttgatctgg 241201 atgcttttta cattaatgat ttgtgcactt ttgtatatgt atgtgtattg catcaaaaag 241261 ttttaaaaaa ataacattaa aataaatata agaaagtcca agtaagtcct cattttttcc 241321 ccacaacaat cactttgatg cttttctagg gaaatactga atatcaaatt cttttaaaaa 241381 gcttttctcc cctcattttt ggagcccttt ttttgctggc aatggtaggt aatcctgtac 241441 ttgttgtagg aacacaggta agagactaaa ctgggtgtgg ggtcctggct gggaagcttt 241501 cctggaagag atgaagctgg acctgagttt tgtagcatcc atcagagaca gccagggaag 241561 caagtggatg gcagacagtt ccagggagag aggccaggct actcgaaagc gaaaaggtat 241621 ggtagggaaa gaaagagaac ccactgagag ggagtggaca gagcttgctg gagctagagc 241681 ttgaagagta gctgaggagg ggttaatgat tgataagcaa gagagtttgg caaaggtcag 241741 acagagaaag atcttaaagg gtcatgatac tgagcttgga cttaatcttg taggcatctg 241801 ggaaccactg accaatttaa atcaaacagg tgacacaatt aagtttctac ttaataaaga 241861 gtaagatggt gatggtgtga gaaatacttg aagagggtta agaagactgg agcctggggg 241921 accagttagg aggccattgc agtaatccat gggagatacg gcaggctggt ggtatgggat 241981 ggagaagtag ggatagacat gagagctatt gaggagacag ttgttctgac aggactcagt 242041 aattgatggg tttgaaattg acatgtaggc atggtacttt tttttctagt aaaaaaaaat 242101 tggtagaacc ttaattatga taaatgataa aacacatgtg cctttgtcct tattcccccc 242161 tttaacacct ttttacaggt gtatacatca aaggatgttg ccatgacgaa cgagagaacg 242221 cttaccaatg atttcccatt gctgcttgga atgaaaaccc aatgacacat ttcaaaattt 242281 tgcagtggta aatcatgtag gtcatcttat tgcactacca tgcaaacttc ttgattttgc 242341 caaatcagtc agttgagagg aaaaaggaaa aaagtgttac ctatatatac tgtcatatac 242401 tattgtaaac aaaaaacaat ccgtatatag gtttatatga ttatacagta caaaactatg 242461 tacacacatg aacaaacata catatatgta tatacacact agaagcttaa acagctatgt 242521 gacacttgag atttttatat acagacttta ccagggtata caaaaaataa cataaatttt 242581 tacttaattt tccctgtctt tttaatggtt ccatttgggt gggggaggat ccagaggatt 242641 ttttcctcac tttaattacc tcattgcatt tccttctttt cgattttggt ttttataagc 242701 gggaacactg tgttaatcca ttctaattaa aacccagcca tgcaaccctt cagttaaata 242761 aacaattttt gagtctgcct aggatatcat ttagtcagaa attgtctcta tggctttgga 242821 aatga // LOCUS HSPLAPL 1282 bp DNA PRI 26-APR-1993 DEFINITION Human placental alkaline phosphatase-like gene 5' region. ACCESSION X07247 NID g35509 KEYWORDS alkaline phosphatase-like gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1282) AUTHORS Kam,W. TITLE Direct Submission JOURNAL Submitted (25-MAR-1988) Kam W., University of California, Box 0724, UCSF, San Francisco, CA 94143, USA REFERENCE 2 (bases 20 to 1187) AUTHORS Shen,L.P., Liu,H., Kan,Y.W. and Kam,W. TITLE 5' nucleotide sequence of a putative human placental alkaline phosphatase-like gene JOURNAL Nucleic Acids Res. 16 (12), 5694 (1988) MEDLINE 88262578 FEATURES Location/Qualifiers source 1..1282 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="sperm cells" /clone_lib="EMBL3" CDS join(336..402,482..598,711..826,1017..1250) /codon_start=1 /product="PLAP-like protein" /db_xref="PID:g35510" /db_xref="SWISS-PROT:P10696" /translation="MQGPWVLLLLGLRLQLSLGIIPVEEENPDFWNRQAAEALGAAKK LQPAQTAAKNLIIFLGDGMGVSTVTAARILKGQKKDKLGPETFLAMDRFPYVALSKTY SVDKHVPDSGATATAYLCGVKGNFQTIGLSAAARFNQCNTTRGNEVISVMNRAKKRGD VGPRVGSGPGDRPLSHI" exon <336..402 /number=1 intron 403..481 /number=1 exon 482..598 /number=2 intron 599..710 /number=2 exon 711..826 /number=3 intron 827..1016 /number=3 exon 1017..>1250 /number=4 BASE COUNT 272 a 401 c 377 g 232 t ORIGIN 1 gagtacaggt gtcttgtcac acagtgagca gggttgggga ggccctcggc ggagatcgca 61 cactcgacta tacccaaaat cccacccttc cctgggacac ctggtcccac cctaagctgc 121 ctttctcaga ccccagcccc agcccagccc agccacaccc tgccactccc ttcagccagt 181 gtggcttcag gtcaagaggc tgggcggggt caaggtggta acaaggggag gggccaggac 241 acagttttcc ctgatttaaa cccaggcagc ctggagtgca gctcatactc catacctgga 301 tttccgcctc gccgctctcc cgacagcttc cagacatgca ggggccctgg gtgctgctcc 361 tgctgggcct gaggctacag ctctccctgg gcatcatccc aggtaatgag gctcccccag 421 ctgcccctac agggcacccc ccagcccagg ctgacctgat ctttgctctc cccctggcca 481 gttgaggagg agaacccgga cttctggaac cgccaggcag ccgaggccct gggtgccgcc 541 aagaagctgc agcctgcaca gacagccgcc aagaacctca tcatcttcct gggtgacggt 601 gagtgagcca ggccttccag ccccgcagcc ctcacagccc cggcgcccgg accctcagtg 661 gttccaggac agccctgggg agcaagcctc acacacttct gctccttcag ggatgggggt 721 gtctacggtg acagctgcca ggatcctaaa agggcagaag aaggacaaac tggggcctga 781 gaccttcctg gccatggacc gcttcccgta cgtggctctg tccaaggtaa gtgctgggct 841 accttagagt cctccaagca gagaagggga atctggctat ggagtgtggt aggagggagg 901 gaccctaaac agctggggct ccaataagga gctggaggca gttggaatcc cagaggacag 961 agatcagggt cttgtttgtc gtccccagag aagagctcag agtgtctctg tcccagacat 1021 acagtgtaga caagcatgtg ccagacagtg gagccacagc cacggcctac ctgtgcgggg 1081 tcaagggcaa cttccagacc attggcttga gtgcagccgc ccgctttaac cagtgcaaca 1141 cgacacgcgg caacgaggtc atctccgtga tgaatcgggc caagaaacga ggtgacgtgg 1201 ggcctcgtgt ggggtcaggg ccaggtgaca gacctctatc tcatatctga cctctatacc 1261 tcaggaatct gtggagtggt ac // LOCUS HSPNMTB 3799 bp DNA PRI 24-APR-1993 DEFINITION Human gene for phenylethanolamine N-methylase (PNMT) (EC 2.1.1.28). ACCESSION X52730 NID g35560 KEYWORDS methylase; phenylethanolamide N-methylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3799) AUTHORS Nagatsu,T. TITLE Direct Submission JOURNAL Submitted (19-APR-1990) Nagatsu T., Department of Biochemistry, Nagoya University School of Medicine, Nagoya 466, Japan REFERENCE 2 (bases 1 to 3799) AUTHORS Sasaoka,T., Kaneda,N., Kurosawa,Y., Fujita,K. and Nagatsu,T. TITLE Human phenylethanolamine n-methyltranserase gene: existence of two types of mRNA with different transcription initiation sites JOURNAL Neurochem. Int. 15, 555-565 (1989) COMMENT See also for mRNA sequence and for overlapping genomic sequence. Data kindly reviewed (20-AUG-1990) by Nagatsu T. FEATURES Location/Qualifiers source 1..3799 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="ghPNMT1101" /chromosome="17" misc_feature 814..829 /note="pot. glucocorticoid response element" misc_feature 998..1012 /note="pot. glucocorticoid response element" protein_bind 1027..1037 /note="pot. Sp1 binding site" /bound_moiety="Sp1" protein_bind 1395..1404 /note="pot. Sp1 binding site" /bound_moiety="Sp1" protein_bind 1407..1416 /note="pot. Sp1 binding site" /bound_moiety="Sp1" TATA_signal 1640..1648 exon 1670..1894 /number=1 mRNA join(1670..1894,2846..3053,3168..3688) CDS join(1693..1894,2846..3053,3168..3606) /EC_number="2.1.1.28" /codon_start=1 /product="phenylethanolamine n-methyltransferase" /db_xref="PID:g296668" /db_xref="SWISS-PROT:P11086" /translation="MSGADRSPNAGAAPDSAPGQAAVASAYQRFEPRAYLRNNYAPPR GDLCNPNGVGPWKLRCLAQTFATGEVSGRTLIDIGSGPTVYQLLSACSHFEDITMTDF LEVNRQELGRWLQEEPGAFNWSMYSQHACLIEGKGECWQDKERQLRARVKRVLPIDVH QPQPLGAGSPAPLPADALVSAFCLEAVSPDLASFQRALDHITTLLRPGGHLLLIGALE ESWYLAGEARLTVVPVSEEEVREALVRSGYKVRDLRTYIMPAHLQTGVDDVKGVFFAW AQKVGL" intron 1895..2845 /number=1 misc_feature 2243..2597 /note="Alu sequence" misc_feature 2392..2406 /note="pot. glucocorticoid response element" exon 2846..3053 /number=2 intron 3054..3167 /number=2 mRNA 3168..3688 /note="exon 3" polyA_signal 3671..3676 BASE COUNT 733 a 1024 c 1363 g 679 t ORIGIN 1 ctggcactgg gtggtaacca gcaagccagc tggcatccgc atccagggtt tgtttcaatg 61 atgtctcgtg gagaatatgg aggggctggt gccaggactg tccttggctt tgcctcgggg 121 tgtgaacggg gtcagtgacc tctaaaacta acctgcctct cagttctgaa tccagacaga 181 atcaatcctc agctgtgtct cgctccacac cccctgccct ggaagccagg gaaggttgga 241 ggtgctaggg ggtcaggctc ccctctgtga cccctgcagc tgttgtggtg actcatgtcc 301 caacctagct gcctctccca aggagacttt cccctgggac aagggggagg gaatggcatg 361 gaggaggccc acatcaagcg gggccaggaa cccacggtgg caggagctgg gctggtgacc 421 tacccagggc agaagggccc gggactcatc cagaggggaa ggaaggggtc ttcaggaaga 481 ccacggagat gccacaggca gaattggctt cccatctggg agataggtgg ggagaccctg 541 gcattttgac agccagaacc tggggtgctg agcagaatct tcatgcctgg cctggccgcc 601 ttcggaggga agctggaggg ttgggtgcga gaggagtggg gtcagagccc ctacatccgc 661 aggaccccaa atcggctggg ccccaaggcc cggactgcgc tccccggtgg ccccggcggc 721 cctccgcgaa tgcgtcctgc ccctcccctg cccaagccct ctgccctcac ccgggtccgg 781 cgccgccccc gaagtggcgg gaacaacccg aacccgaacc ttctgtcctc gggagccccc 841 agataagcgg ctgggaaccc gcggggcccg caggggaggc ccggctgttc cgcccgctaa 901 gtgcattagc acagctcacc tcccctatcg cgcctgccat cggacgggca gtgccgcgcc 961 ctgctctggg gcccccggag cgaccacagc ggaggccgga acggactgtc ctttctgggg 1021 cggggtgggg agggggtgtc gctggagggc ccggtggcat agcaacggac gagagaggcc 1081 tggaggaggg gcggggaggg ggagttgtgt ggcagttcta agggaagggt gggtgctggg 1141 acgggtgtcc gggagggagg ggagcctggc ggggtctggg gcctcgtcgc ggagggcgct 1201 gcgaggggga aactggggaa agggcctaat tccccagtct ccacctcgaa tcaggaaaga 1261 gaaggggcgg gctgctgggc aaaagaggtg aatggctgcg gggggctgga gaagagagat 1321 gggaggggcc ggccggcggg ggtgaggggg tctaaagatt gtgggggtga ggaactgagg 1381 gtggggggcg cccagaggcg ggactcgggg cggggcaggc gaggcggagg gcgagggctg 1441 cgggagcaag tacggagccg ggggtgtggg ggacgattgc cgctgcagcc gccgccccac 1501 tcacctccgg tgtgtctgca gcccggacac taagggagat ggatgaatgg gtggggagga 1561 tgcggcgcac atggccccgg gcggctcggc ggtcagctgc cgcccccaca gcggaccggt 1621 cggggcgggg gtcgggcggt agaaaaaagg gccgcgaggc gagcggggca ctgggcggac 1681 cgcggcggca gcatgagcgg cgcagaccgt agccccaatg cgggcgcagc ccctgactcg 1741 gccccgggcc aggcggcggt ggcttcggcc taccagcgct tcgagccgcg cgcctacctc 1801 cgcaacaact acgcgccccc tcgcggggac ctgtgcaacc cgaacggcgt cgggccgtgg 1861 aagctgcgct gcttggcgca gaccttcgcc accggtgagc gggggaaact gaggcacgag 1921 ggacaagagg tcgtcgggga gtgaaagcag gcgcagggaa ataaaaagaa ggaaagggag 1981 acagaccagg cgcctaacag atggggacca agaaacaaga gatagctgag aggtgcaaac 2041 agaagagaaa aaggagcaac atcccttagg agaggggcag aggagagaga ggtggagaga 2101 gggggcggag agtgctcaga attgagagct aaggtggggg atgcaggaca gactgaggtg 2161 gagatgcata ggaggaaatg gaggcagatg tgggacaggg gtgagaaact ccaggatttc 2221 ctcgctgagc ctggctggta ggtatagttg ttttctttct ttttctttat tttattttca 2281 tttatttact tatttttatt ttttatttgt tttgagacgg agtttcgctc ttgttgccca 2341 ggctggagta caatggcgcc atctcggctc actgcaacct ccgcctcccc gggttcaagc 2401 gattctcttg cctcagcttc cctagtagct gggattacag gcatgcgccc ccatgcctgg 2461 ctaatttatt tgtattttta gtagagacgg gacttctcca tgttggtcag gctggtctcg 2521 aactcccaac cttaggatcc acccaccccg gcctcccaaa gtgctgggat tacaggtgtg 2581 agccactgcg cccggccagt aggtatagtc ttctagatgt gaaacctgag tctcagagcg 2641 gtgaagttcc cttccgaagg gcagcccatg ttggagctgg gttcagtcta actctggggc 2701 caatgctttt tccagatgga gacacatttg cagaggagaa ggaagaacta gagagaggca 2761 gggagatgca ggggagggaa gggtaaggag gcaggggctg cctgggctgg ctggcaccag 2821 gaccctcttc ctctgccctg cccaggtgaa gtgtccggac gcaccctcat cgacattggt 2881 tcaggcccca ccgtgtacca gctgctcagt gcctgcagcc actttgagga catcaccatg 2941 acagatttcc tggaggtcaa ccgccaggag ctggggcgct ggctgcagga ggagccgggg 3001 gccttcaact ggagcatgta cagccaacat gcctgcctca ttgagggcaa ggggtaagga 3061 ctggggggtg agggttgggg aggaggcttc ccatagagtg gctggttggg gcaacagagg 3121 cctgagcgta gaacagcctt gagccctgcc ttgtgcctcc tgcacaggga atgctggcag 3181 gataaggagc gccagctgcg agccagggtg aaacgggtcc tgcccatcga cgtgcaccag 3241 ccccagcccc tgggtgctgg gagcccagct cccctgcctg ctgacgccct ggtctctgcc 3301 ttctgcttgg aggctgtgag cccagatctt gccagctttc agcgggccct ggaccacatc 3361 accacgctgc tgaggcctgg ggggcacctc ctcctcatcg gggccctgga ggagtcgtgg 3421 tacctggctg gggaggccag gctgacggtg gtgccagtgt ctgaggagga ggtgagggag 3481 gccctggtgc gtagtggcta caaggtccgg gacctccgca cctatatcat gcctgcccac 3541 cttcagacag gcgtagatga tgtcaagggc gtcttcttcg cctgggctca gaaggttggg 3601 ctgtgagggc tgtacctggt gccctgtggc ccccacccac ctggattccc tgttctttga 3661 agtggcacct aataaagaaa taataccctg ccgctgcggt cagtgctgtg tgtggctctc 3721 ctgggaagca gcaagggccc agagatctga gtgtccgggt aggggagaca ttcaccctag 3781 gctttttttc cagaagctt // LOCUS HSPPTII 3173 bp DNA PRI 15-MAY-1997 DEFINITION H.sapiens gene encoding phenylpyruvate tautomerase II. ACCESSION Y11151 NID g2104580 KEYWORDS phenylpyruvate tautomerase II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3173) AUTHORS Rorsman,H. JOURNAL Unpublished REFERENCE 2 (bases 1 to 3173) AUTHORS Rorsman,H. TITLE Direct Submission JOURNAL Submitted (07-FEB-1997) H. Rorsman, Hamtstalle 32, Department Of Dermatology, University Hospital, S-221 85 Lund, SWEDEN FEATURES Location/Qualifiers source 1..3173 /organism="Homo sapiens" /db_xref="taxon:9606" exon <1..234 /number=1 CDS join(127..234,598..773,2918..2990) /EC_number="5.3.2.1" /codon_start=1 /product="phenylpyruvate tautomerase II" /db_xref="PID:e311354" /db_xref="PID:g2104581" /translation="MPFLELDTNLPANRVPAGLEKRLCAAAASILGKPADRVNVTVRP GLAMALSGSTEPCAQLSISSIGVVGTAEDNRSHSAHFFEFLTKELALGQDRILIRFFP LESWQIGKIGTVMTFL" intron 235..597 /number=1 exon 598..773 /number=2 intron 774..2917 /number=2 exon 2918..>3173 /number=3 BASE COUNT 698 a 832 c 843 g 773 t 27 others ORIGIN 1 gatcccggtg ccagggaccc tgcccagttc caggcgtcgc ctaacccaga aacgactggg 61 cgccgcgtcc tggaaaggcc ccagcgcacg gacatctgag gagctgtttc cgttcctctg 121 cccgccatgc cgttcctgga gctggacacg aatttgcccg ccaaccgagt gcccgcgggg 181 ctggagaaac gactctgcgc cgccgctgcc tccatcctgg gcaaacctgc ggacgtaagc 241 gtgggccggg cagcacgggg cgaggggagg ttggtgggcc aggggtccgg ccctgtccct 301 gctccgcctc cccgacagtg accccgaatc ttttccccag ggaccactcc ccactccttt 361 cctcacgcca agctctgact ttccgtgctc cacgatcccg cggctccccc tccgcacgtc 421 tttcccttgt cgccctcccc agtcatgacc cgggtgtgac cttcagggac cgcggcccgt 481 atcgggatcc ctgccccgcg aacactgcgc gtttcggctt tcgcgcgctc gggtcccgtc 541 cccagaggta gcccggccgg ctccaacttc gggcaaaact tttcatgtcc ccctcagcgc 601 gtgaacgtga cggtacggcc gggcctggcc atggcgctga gcgggtccac cgagccctgc 661 gcgcagctgt ccatctcctc catcggcgta gtgggcaccg cggaggacaa ccgcagccac 721 agcgcccact tctttgagtt tctcaccaag gagctagccc tgggccagga ccggtgcgta 781 ggggtagtag gggatccatg tgggactgcc gcagactgga gccactgatc ctgcctcagg 841 gggaaaaacc catttcttgc cctgcccagt aaggacacat cagggtctgg agctttgggg 901 ccccctgacc ccttaggttc ctgctgttag gaccatcttc aaagtgcgag caggattgaa 961 tgaatttctg gctctgctcc tcagtgtgta agtctgtgaa ccgggaaggc tctcttttaa 1021 cacccccggg gcantgcaag ggtcatgtgg gattgtctgt gtgctgtacc tgccttggca 1081 cctgacaggg taggtacacg tggctgaagt gtgattttct agaacttttc caggctggtc 1141 agaaggaatt ctgggtatgt tctgaagtta cgtattttgg acctgtgtcc cagccaggtt 1201 ccaggtgaag ttcacgggag actcacagag tagtgaaaga ccattggcct ggatgtctag 1261 acatctgctt tctgggtcct gcatagctgg gggaccccag acaaacttgg aaatgaacca 1321 tctccagttt ggcaacctcc tcttcctgtt gaatacaggg gaaaagacct ccctcccccc 1381 acaagaacgt tctgcaaccc caacctggcg ttctnctngt ngaccgagtt aaagtttcct 1441 cttgggtaaa agatattctt gagccacatc catgtctagg aggaagtaag ggcatgagaa 1501 gcttgaaagg acactgtcca ggcacggtgg ctcatgcctg taatcccagc actttgggag 1561 accaaggcgg gaggttcatt tgacccagga gttgtagacc agtctgggta acatagtgag 1621 atgccatccc ccaaaacagt tttaaaaatt aaccggacat ggtggtgtgc acatgtagtc 1681 tcacttagtt ggcaggctga agtgggagga tggtttcagc ccaggaggtt gaggctgcag 1741 tgagctatga ttgcgcattg cactccatcc tgtgtgacag tgagaccttg gctcacaaaa 1801 aaagaccttc tgaaatggag cctttgttan tccatgaagg tcgatgaaga atggctgatc 1861 ctgctgggtc ctccctcaag ctacagaaga ataatacagt cancccctgt atcactgggt 1921 tacnctctgt tgattcaaca cctngattga atnttaaaga aaaaattgat attgctctat 1981 actgaaatac tttaanttta tccctntctt atccccaaca anccnattac tattcttctt 2041 antagtttta attggtatta taagtaaccc caggggttat ttaaagtata tggaggaggg 2101 atgtgtgtag ggtaacatgc aaatattaac accattttat gtaagngaag ggagcngaac 2161 gtagattgtg nttttcctcn ggggggtgtn gaacaaatat ccnacggata ctcagggggg 2221 actgttacta ttacaacaag tggtgtaaan aataccatng gtaggtcttt tacaggtaat 2281 taacaatcat gatcccattt gatcctaaca atccctatcc antctccctt tgctcagaca 2341 agaacatgct atccccatga ggtagataat ctccattatg ccaattttat gatgagaaga 2401 caggctgttc ataactcccg cccgccccaa ccaggctgtc ccaagttntg gttttcctca 2461 ctggaaactt gctgttacct ctgagagggg catgttcact tatggcccag acattagtgg 2521 cggggttggt ggggtggggg gtcaccctgt gctcagcctt tgagaaaaca ggtggctgag 2581 gtactgtgcc cttagggagc ctgcaattag gaggcagggt acccctcagt acacaaactg 2641 atggagatga tagagtgtac ggcacacaca ccgaggctac tgtattgtgt gtctgtccta 2701 nctgggtcat gatgtgcggc tacctcccac acctgagtgt cccactgccc tgctgggggt 2761 tggggaaaat cattattggg atgagcaggt ttgcnaaatg cctggtggac tgaggcaggc 2821 tgtcctgagt acccacagtg ggagtacctg gcaggggtct tgcaagatgt ggtattancc 2881 aaggagctga tgatatcttt ttctctgtct cctgaaggat acttatccgc tttttcccct 2941 tggagtcctg gcagattggc aagataggga cggtcatgac ttttttatga ttgggcacgg 3001 agggatccag ggcatctgtg aactggctgc ttcttccaga gagatctctt ggcagagtga 3061 gggcctggag ataaccagct ttggattatc ccgcatgcaa cattcctgtg atcacataat 3121 cctcttcttc atcctcatat gaaataaatg aagagagctt cctcattcaa aaa // LOCUS HSPR264SC 4251 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens PR264 gene. ACCESSION X75755 NID g455418 KEYWORDS PR264 gene; SC35 gene; splicing factor SC35. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4251) AUTHORS Sureau,A. and Perbal,B. TITLE Several mRNAs with variable 3' untranslated regions and different stability encode the human PR264/SC35 splicing factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (3), 932-936 (1994) MEDLINE 94134745 REFERENCE 2 (bases 1 to 4251) AUTHORS Perbal,B. TITLE Direct Submission JOURNAL Submitted (01-NOV-1993) B. Perbal, Inst. Curie, Centre Univ. Batiment 110, 91405 Orsay Cedex, FRANCE FEATURES Location/Qualifiers source 1..4251 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="normal fibroblasts" /clone_lib="genomic" /clone="E830" /chromosome="17" /map="q25" gene 961..2216 /gene="PR264/SC35" CAAT_signal 961..964 /gene="PR264/SC35" TATA_signal 1014..1019 /gene="PR264/SC35" mRNA join(1108..1578,1913..2223,2502..2605,3204..4233) /gene="HPR4" gene 1108..4233 /gene="HPR4" mRNA join(1114..1578,1913..2223,2502..2605,3204..4245) /gene="HPR6" gene 1114..4245 /gene="HPR6" mRNA join(1119..1578,1913..3301,3800..4233) /gene="HPR5" gene 1119..4233 /gene="HPR5" CDS join(1217..1578,1913..2216) /gene="PR264/SC35" /codon_start=1 /db_xref="PID:g455419" /db_xref="SWISS-PROT:Q01130" /translation="MSYGRPPPDVEGMTSLKVDNLTYRTSPDTLRRVFEKYGRVGDVY IPRDRYTKESRGFAFVRFHDKRDAEDAMDAMDGAVLDGRELRVQMARYGRPPDSHHSR RGPPPRRYGGGGYGRRSRSPRRRRRSRSRSRSRSRSRSRSRYSRSKSRSRTRSRSRST SKSRSARRSKSKSSSVSRSRSRSRSRSRSRSPPPVSKRESKSRSRSKSPPKSPEEEGA VSS" polyA_signal 3503..3509 /gene="HPR5" /note="polyA1" polyA_signal 4211..4217 /gene="HPR5" /note="polyA2" BASE COUNT 962 a 1060 c 1197 g 1032 t ORIGIN 1 aagcttttta gattccgggg acattttggc tcagctctgc cggaggcagc agcccagggc 61 agtcagctct cctcggggcg aagccactga caatcctgga gaaagaaatg agatggtgag 121 tggagaaggc agatgtagga gccaggcagc caagctgtca acttccagtt cctggatcgg 181 aaccccaggt ttagggcgaa gttctcccca ccggcagtta ggatactcct aagaagcgat 241 ccgagtacgg aagagcacaa cccccggaga agcgccaaga atttatccgc tccaggttga 301 ggggcccacg aaggggatta aagggccaag tgccagccgc agcagccggc gcggccccgc 361 cctcccagcc tgactcactg cggccaggcg accgggtcac ctgaccgcag ccgcggcgcc 421 ggccgctcgg cccgctggga aacgtagtcc cgggtgacct ggcacgcgca agcgctcggc 481 ggggagtccc caccgccagg acgcagagcc gacggagctc tcgggctctg gcgaaagggg 541 gttgggaagc ggactccggc gaagagcagt caggaacggc tggacggggc cgagagacga 601 cgcagcggag tctgaggggg ccggggtcac agggagaggc aaatgagggc agaagcactc 661 ttcagacgga gacggggctg acggctgagc aaaaaagggg cgtgaacttg gagtattggc 721 atgaaaaaaa ggtgggggcc cagatgggga gcacctcctc ttcctcctgc ccaatcgcga 781 tcgtccggcc tcccaggcgg aaaaagcgtt tcgcgggctt tccaactgcc cgctaattcc 841 gctccccctc cctccagccg tgactccgcg ctttttggcc cgcccgccgg gctgtgcgca 901 ggcgcttcgg gtaggggcgg ggcgcgcggc agggtcgtta cgaagcgggg cgcggtgggc 961 caatcagaag gtttcatttc cgggtggcgc gggcgccatt ttgtgaggag cgatataaac 1021 gggcgcagag gccggctgcc cgcccagttg ttacttaggt gcgctagcct gcggagcccg 1081 tccgtgctgt tctgcggcaa ggcctttccc agtgtcccca cgcggaaggc aactgcctga 1141 gaggcgcggc gtcgcaccgc ccagagctga ggaagccggc gccagttcgc ggggctccgg 1201 gccgccactc agagctatga gctacggccg cccccctccc gatgtggagg gtatgacctc 1261 cctcaaggtg gacaacctga cctaccgcac ctcgcccgac acgctgaggc gcgtcttcga 1321 gaagtacggg cgcgtcggcg acgtgtacat cccgcgggat cgctacacca aggagtcccg 1381 cggcttcgcc ttcgttcgct ttcacgacaa gcgcgacgct gaggacgcta tggatgccat 1441 ggacggggcc gtgctggacg gccgcgagct gcgggtgcaa atggcgcgct acggccgccc 1501 cccggactca caccacagcc gccggggacc gccaccccgc aggtacgggg gcggtggcta 1561 cggacgccgg agccgcaggt aaacggggct gaggggaccg cgggaggcgg ggcggggcgc 1621 gcgggaggcc cgggcgacct cacaaaggtc cgcggcgaag cacgtggtgc gggcccggac 1681 ggggcggggg tgcacgccgc gtctcgcgac cctccggcca ccccgcgagc ttccgccgtc 1741 tgcgacccgg gagtggccgg ggtgtgggcg gcgcggggcg gaggaccccg cctcgcgact 1801 ggggaaatgg cgtctggcgg cgagataatg gcggcctggg cgggagcgcg cgggcggggc 1861 cggccccgct gcctggaatt aaccccgctg tgcttgctcg tcccgcccgc agccctaggc 1921 ggcgtcgccg cagccgatcc cggagtcgga gccgttccag gtctcgcagc cgatctcgct 1981 acagccgctc gaagtctcgg tcccgcactc gttctcgatc tcggtcgacc tccaagtcca 2041 gatccgcacg aaggtccaag tccaagtcct cgtcggtctc cagatctcgt tcgcggtcca 2101 ggtcccggtc tcggtccagg agtcctcccc cagtgtccaa gagggaatcc aaatccaggt 2161 cgcgatcgaa gagtcccccc aagtctcctg aagaggaagg agcggtgtcc tcttaagaaa 2221 atggtaatgt ctgggaatcc gagacacata accctaattc ataaatggga tttggggtag 2281 gtctttttga gtcgtgttaa tgtaagaatg actcctatca ttaggagtgc tgctcggagg 2341 ttactcacct ttgggagtaa tactgaagag aggggtctgc agaaaggatg tgtatgaagc 2401 ttagataata atggctgttt cgtaaactgt ttgagaccta ttaatgaaaa tgactatttc 2461 ttgctgtttt tatccaacgt ctgcattttc cccctttaaa gctgcggtct cctgtttgat 2521 aaaagaatat tggccagtat tgcagatttt aactgatttg gctgatcctc cagggaccag 2581 tttctgtggg cgtgtattgg agcaggtttg tctttaaatg ttaaagatgc actatcctct 2641 tagagaaaca atcagttcaa ctattgttgt actgactggg acttcatatt ctaatggatg 2701 tggcaaaaga attgcaataa gaagcagtga acatttggaa ccccaaaaga aagttacagg 2761 tattgcactg ggtggggaaa ggatagtgtg tctttaactc ttaaattgtt tggtcctatt 2821 ttaaaaggaa gggccctaag tagctcagat attaaagtag tatctcaatt accaaatgtt 2881 cattgaacat tacttaatga aatatagacc aatctctgac tcgagttgtt ttgtttggat 2941 acagcccttt tttttctttt ttttccttcc ccttaccttt cttcaccttg gttatttggc 3001 caggaatacg taaattcaaa cttgtacatg ctgatggtag cctttgtgaa attttcctaa 3061 ttgggccttt taaaaacata ggctgggtgg aacatttctg taccctactg gtttgaccag 3121 agccttagta agtacgtgcc tgaaactgaa accatgtgca ctttaatgga aggtaagctg 3181 aacttctttc ttttcaaacc tagatgtatc ggcaagcagt gtaaacggag gacttgggga 3241 aaaaggacca catagtccat cgaagaagag tccttggaac aagcaactgg ctattgaaaa 3301 ggttattttg taacatttgt ctaacttttt acttgtttaa gctttgcctc agttggcaaa 3361 cttcatttta tgtgccattt tgttgctgtt attcaaattt cttgtaattt agtgaggtga 3421 acgacttcag atttcattat tggatttgga tatttgaggt aaaatttcat tttgttatat 3481 agtgctgact ttttttgttt gaaattaaac agattggtaa cctaatttgt ggcctcctga 3541 cttttaagga aaacgtgtgc agccattaca cacagcctaa agctgtcaag agattgactc 3601 ggcattgcct tcattcctta aaattaaaaa cctacaaaag ttggtgtaaa tttgtatatg 3661 ttatttacct tcagatctaa atggtaatct gaacccaaat ttgtataaag acttttcagg 3721 tgaaaagact tgattttttg aaaggattgt ttatcaaaca caattctaat ctcttctctt 3781 atgtattttt gtgcactagg cgcagttgtg tagcagttga gtaatgctgg ttagctgtta 3841 aggtggcgtg ttgcagtgca gagtgcttgg ctgtttcctg ttttctcccg attgctcctg 3901 tgtaaagatg ccttgtcgtg cagaaacaaa tggctgtcca gtttattaaa atgcctgaca 3961 actgcacttc cagtcacccg ggccttgcat ataaataacg gagcatacag tgagcacatc 4021 tagctgatga taaatacacc tttttttccc tcttccccct aaaaatggta aatctgatca 4081 tatctacatg tatgaactta acatggaaaa tgttaaggaa gcaaatggtt gtaactttgt 4141 aagtacttat aacatggtgt atctttttgc ttatgaatat tctgtattat aaccattgtt 4201 tctgtagttt aattaaaaca ttttcttggt gttagctttt ctcagaatac g // LOCUS HSPRG1 1864 bp DNA PRI 23-SEP-1997 DEFINITION H.sapiens PRG1 gene. ACCESSION X96438 NID g2440072 KEYWORDS PRG1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1864) AUTHORS Trauzold,A. TITLE Direct Submission JOURNAL Submitted (05-MAR-1996) A. Trauzold, Laboratory of Molecular Gastroenterology, 1st Dept.of Medicine, University of Kiel, Schittenhelmstrasse 12, Kiel, D-24105, FRG REMARK revised by [4] REFERENCE 2 (bases 1 to 508) AUTHORS Schaefer,H., Trauzold,A., Lettau,P., Kalthoff,H., Foelsch,U.R. and Schmidt,W.E. TITLE cDNA cloning and sequencing of a novel human early response gene and characterization of its expression in pancreatic carcinoma cells JOURNAL Gastroenterology In press REFERENCE 3 (bases 1 to 1864) AUTHORS Schaefer,H. TITLE Direct Submission JOURNAL Submitted (23-SEP-1997) H.Schaefer, Trauzold, Laboratory of Molecular Gastroenterology, 1st Dept.of Medicine, University of Kiel, Schittenhelmstrasse 12, Kiel, D-24105, FRG FEATURES Location/Qualifiers source 1..1864 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="818-4" promoter 1..562 /gene="PRG1" gene 1..1864 /gene="PRG1" mRNA join(563..801,914..1864) /gene="PRG1" exon 563..801 /gene="PRG1" /number=1 CDS join(592..801,914..1174) /gene="PRG1" /codon_start=1 /db_xref="PID:e350480" /db_xref="PID:g2440073" /translation="MCHSRSCHPTMTILQAPTPAPSTIPGPRRGSGPEIFTFDPLPEP AAAPAGRPSASRGHRKRSRRVLYPRVVRRQLPVEEPNPAKRLLFLLLTIVFCQILMAE EGVPAPLPPEDAPNAASLAPTPVSPVLEPFNLTSEPSDYALDLSTFLQQHPAAF" intron 802..913 /gene="PRG1" /number=1 exon 914..1864 /gene="PRG1" /number=2 BASE COUNT 395 a 571 c 519 g 379 t ORIGIN 1 ctactagaag aaggacggag ggagcgaggg agatcacgag gtcaggaaat cgagaccacc 61 ctggccaaaa gggtgaaacc ccgtctctac taaaaataca aaaattagct gggcgtggtg 121 gcgcgtgcct gtattcccag ctactcagga agctgaggca ggaaaatcgc ttgaaccagg 181 gagtcagggg ttgcagtgag ccgagatcgc gcctctggat tccagcctgg cgaccgaacg 241 agactgctcc atctccaaaa aaaaaaaaaa aggcctgtga gggatcctgt ggctaaagtg 301 agcccctctc caggtgccac atgcctcgac atgtgcctgc agcccgggat ctcacccacc 361 cccactcacg actcacacac tcacaacgtg cagttgggcg cctaggattg tgcatgttca 421 agtctccacc cactcccttt gttaatcgtc ggaatttcca gcccgctgct gccaaccgct 481 ccccagctgc gggaggagga gttagaagga cccgcccaat tttcaggagc acataaatta 541 cctctgccgg cagccgaccc tcacttggcc ttacactccg ctcggctcac catgtgtcac 601 tctcgcagct gccacccgac catgaccatc ctgcaggccc cgaccccggc cccctccacc 661 atcccgggac cccggcgggg ctccggtcct gagatcttca ccttcgaccc tctcccggag 721 cccgcagcgg cccctgccgg gcgccccagc gcctctcgcg ggcaccgaaa gcgcagccgc 781 agggttctct accctcgagt ggtgagtatc gccgaagtgg gcattcgcgg ggtgcgctgc 841 cctggagtca ctggggaacg acccgactcc agaggcctcg acctgacctg tctcctgttt 901 tgtctcccct taggtccggc gccagctgcc agtcgaggaa ccgaacccag ccaaaaggct 961 tctctttctg ctgctcacca tcgtcttctg ccagatcctg atggctgaag agggtgtgcc 1021 ggcgcccctg cctccagagg acgcccctaa cgccgcatcc ctggcgccca cccctgtgtc 1081 ccccgtcctc gagcccttta atctgacttc ggagccctcg gactacgctc tggacctcag 1141 cactttcctc cagcaacacc cggccgcctt ctaactggac tccccgcact ccccaaaaag 1201 aatccgaaaa accacaaaga aacaccaggc gtacctggtt gcgcgagagc gtatccccaa 1261 ctgggacttc cgaggcaact tgaactcaga acactacagc ggagacgcca cccggtgctt 1321 gagcgggacc gaggcgcaca gagaccgagg cgcatagaga ccgagcacag cccagctggg 1381 ctaggcccgg tgggaaggag agcgtcgtta atttatttct tattgctcct aattaatatt 1441 tatatgtatt tatgtacgtc ctcctaggtg atggagatgt gtacgtaata tttattttaa 1501 cttatgcaag ggtgtgagat gttccccctg ctgtaaatgc aggtctcttg gtatttattg 1561 agctttgtgg gtctggtgga agcaggacac ctggaactgc ggcaaagtag gagaagaaat 1621 ggggaggact cgggtggggg aggacgtccc ggctgggatg aagtctggtg gtgggtcgta 1681 agtttaggag gtgactgcat cctccagcat ctcaactccg tctgtctact gtgtgagact 1741 tcggcggacc attaggaatg agatccgtga gatccttcca tcttcttgaa gtcgccttta 1801 gggtggctgc gaggtagagg gttgggggtt ggtgggctgt cacggagcga ctgtcgagat 1861 cgcc // LOCUS HSPROPG 8222 bp DNA PRI 30-JUN-1993 DEFINITION H.sapiens gene for properdin. ACCESSION X70872 S46256 NID g35679 KEYWORDS activator; complement; properdin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8222) AUTHORS Nolan,K.F. TITLE Direct Submission JOURNAL Submitted (15-DEC-1992) K.F. Nolan, MRC Immunochemistry Unit, Dept of Biochemistry, University of Oxford, South Parks Road, Oxford, OX1 3QU, UK REFERENCE 2 (bases 1 to 8222) AUTHORS Nolan,K.F., Kaluz,S., Higgins,J.M., Goundis,D. and Reid,K.B. TITLE Characterization of the human properdin gene JOURNAL Biochem. J. 287 (Pt 1), 291-297 (1992) MEDLINE 93038568 COMMENT X70872 and X57748 are related sequences (genomic vs. cDNA). FEATURES Location/Qualifiers source 1..8222 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="GM1416B (PLUS ?)" /clone_lib="COS4X PLUS ?" /clone="COS4XP, G3" /chromosome="X" /map="Xp11.23-Xp11.3" repeat_unit complement(26..86) /rpt_family="ALU-like" repeat_unit 491..777 /rpt_family="ALU-like" repeat_unit 1053..1343 /rpt_family="ALU-like" exon 1745..1921 /number=1 mRNA join(1745..1921,2141..2281,2376..2526,3774..3949, 4410..4580,4719..4910,5105..5278,5532..5723,5882..5993, 7610..>7824) /note="alternative" mRNA join(1749..1921,2141..2281,2376..2526,3774..3949, 4410..4580,4719..4910,5105..5278,5532..5723,5882..5993, 7610..>7824) /note="alternative" intron 1922..2140 /number=1 exon 2141..2281 /number=2 CDS join(2206..2281,2376..2526,3774..3949,4410..4580, 4719..4910,5105..5278,5532..5723,5882..5993,7610..7775) /codon_start=1 /product="properdin" /db_xref="PID:g35680" /db_xref="SWISS-PROT:P27918" /translation="MITEGAQAPRLLLPPLLLLLTLPATGSDPVLCFTQYEESSGKCK GLLGGGVSVEDCCLNTAFAYQKRSGGLCQPCRSPRWSLWSTWAPCSVTCSEGSQLRYR RCVGWNGQCSGKVAPGTLEWQLQACEDQQCCPEMGGWSGWGPWEPCSVTCSKGTRTRR RACNHPAPKCGGHCPGQAQESEACDTQQVCPTHGAWATWGPWTPCSASCHGGPHEPKE TRSRKCSAPEPSQKPPGKPCPGLAYEQRRCTGLPPCPVAGGWGPWGPVSPCPVTCGLG QTMEQRTCNHPVPQHGGPFCAGDATRTHICNTAVPCPVDGEWDSWGEWSPCIRRNMKS ISCQEIPGQQSRGRTCRGRKFDGHRCAGQQQDIRHCYSIQHCPLKGSWSEWSTWGLCM PPCGPNPTRARQRLCTPLLPKYPPTVSMVEGQGEKNVTFWGRPLPRCEELQGQKLVVE EKRPCLHVPACKDPEEEEL" intron 2282..2375 /number=2 exon 2376..2526 /number=3 mat_peptide join(2381..2526,3774..3949,4410..4580,4719..4910, 5105..5278,5532..5723,5882..5993,7610..7772) /product="properdin" intron 2527..3773 /number=3 exon 3774..3949 /number=4 intron 3950..4409 /number=4 exon 4410..4580 /number=5 intron 4581..4718 /number=5 exon 4719..4910 /number=6 intron 4911..5104 /number=6 exon 5105..5278 /number=7 intron 5279..5531 /number=7 exon 5532..5723 /number=8 intron 5724..5881 /number=8 exon 5882..5993 /number=9 intron 5994..7609 /number=9 repeat_unit 7329..7485 /rpt_family="ALU-like" exon 7610..>7824 /number=10 polyA_signal 7819..7824 BASE COUNT 1920 a 2184 c 2246 g 1872 t ORIGIN 1 gtcgacaagc ttatcgatac agctggatcc acctgcctca atctcccaaa gtgttgggat 61 tacaggcgtg agcaccatgc ctggccagtt tcaggtctct tggagagtgt tttcacagga 121 taacacagca atgattgtct tgaagtcgta gaatcttaat aagactcctt tgttcaaagc 181 acattttcaa ggatagcctt atgttatcag agaacagagt atgtctgtaa tggatacaga 241 tattaacaca tactttactg cccataaaac aagtgttcct gaagaacttc tgttttaaca 301 gctgaccaaa ttattacatt tgaatttgca gaattcctct gcctccctga gagcagagga 361 aattaggcat tgccttccta tctttcttgt gcttcttttt gttattaaaa gcaaaataaa 421 aattttaaag tgggccctgg agtttcatta gggtttataa aaaaaaaata aaaaaataaa 481 aaaaaaagaa gggctggtgt gatggctcat gcctgtaatc ccagcacttt gggaggctga 541 ggttgggaga aaatattgaa gccaggagtt tgagaccagc ctgggcaaca caacgagacc 601 ccatctatac agaaaataaa aacattagtt gtgtggtggt gtgtgcctgt agtcccagct 661 acttggggac gctaaggtgg gaaaatcact tgagcccagg agtttgaggc tgcaatgagc 721 tatgatcaga ccactgcact ccagcccagg tgacagcaag actctgtctc tttaaaacaa 781 aacaaacaaa aaaacaggtt tggaagctac tgggctagag agtgacaggg atgccagaca 841 agggtatggg gaaggcctct gtgaggaggg gcaacatttg ttatccatct tctcgagggc 901 caggattttt gtctgtccta ttctctgctc taatgctaag aaccgcatct ggcatccagt 961 aggtgctcaa taaatgttga ataaatttat caaacacaga gcacctacta tatgccaggc 1021 actgttctaa agactttcca aaagcttaac tcctggccgg gcacggtgac tcacgcctgt 1081 aatcctagca ctttgggagg ccaaggcagg tggataacct gaggtcagga gttcgagacc 1141 agcctgccca acatggtgaa accctgtctc cactaaaaat acaaaaatta gctggacatg 1201 gtggtacacg gctgtaatcc cagctactca ggaggctgag gcaggagaat tggttgaaat 1261 cgggaggcag aggttgcagt gagccgagat cgtgacactg cacgccagcc tgggcgacgg 1321 agtgagactc catctcgaca aaacaaacaa aagcttaact cgcaatcttt gcaacacacc 1381 ctacctatgt tgtaggtgct gtctgtgagc acattttaca gaggggaaac aggctccaga 1441 aggggggtgt tatctgcctg agggttctct gatggaatat ggcccagctt attccctcct 1501 aatcacccca gaccctggtg ttgtccccag ctgggcccat ggaacaatgg atagacccca 1561 ttcccctcct gaatccctcc ccaggctcct gccacttccc tttctcactt cccttgtgct 1621 gacctccccg accacaagct gaccacaggg aacctctcag gaaaggaggg ccgaggttaa 1681 gggagggtgg aaggtggagc aactgactcg atgctccctc cacccccacg agcaggaaca 1741 tactgagcac cgcacactca cttcaccctg gttcaacacc cccacgaggt tgaccccgtc 1801 attatgttag agattatgca ttttccacat agggaaactg aggctcaggg gtgttaagtg 1861 actcacccaa ggtcacacgg ctaggaagtt gctgcacgct cctatgctcc atttcctctg 1921 ggtgagaatt tgtcctagga gcttctattg aacactgtga ggttgctggg aggagcgcga 1981 ggtgctgggg atatagtaac aaacagaaca agaccaagag taagacagga aatggggagg 2041 ggagagtaga gaaggaggca gaggggacag ggtggggtca aagcaggatg ccctcccagt 2101 tcctcctgcc tctaggttcc tctttccctg ctgattccag gagcctatca acccagataa 2161 agcgggacct cctctctggt agaggtgcag ggggcagtac tcaacatgat cacagaggga 2221 gcgcaggccc ctcgattgtt gctgccgccg ctgctcctgc tgctcaccct gccagccaca 2281 ggtgaggggg tgagggccca gctggcccca ggggtgtcca ggcccaccac atccctgacc 2341 gagccccacc cacccatccc ctggccctgt tacaggctca gaccccgtgc tctgcttcac 2401 ccagtatgaa gaatcctccg gcaagtgcaa gggcctcctg gggggtggtg tcagcgtgga 2461 agactgctgt ctcaacactg cctttgccta ccagaaacgt agtggtgggc tctgtcagcc 2521 ttgcaggtta gggggagcct ggggtggggg atggggggga gggaaggact gtctgtgggg 2581 gtcacccaaa actgggcact ttctttctct gggacttcca agggtggccc tgaggccctg 2641 atcgtgacca tacctcaggg tttctcaacc ttgttactat tgacatttgg ggctggaaat 2701 tcttggctgg gggaggggat tgcttacctg tgtattgtgg gatgttttgt aacaccccta 2761 gccttccact agatgccagt agcaattctc cccaccagct gtgacaacca aacatgtcag 2821 cagacattgc caaatgtccc ctgggggtca aaatcacccc tgattgagaa ccactgacat 2881 agctcacaca tattgagcac ctcctgcaca ctggaagcat tgcaggtggg gtgcagcttg 2941 gtatttccct aattttgatg acagctcatg tttattgagt gcttactgta tatggacaca 3001 ctgtgggggc gggaaagttt ggggaattca aagggcagct ctaggcccct aattatgatg 3061 acacttcaca ttcattgagc actccctgta taccagctgc atggcagggt tgggagatga 3121 gcttgaggat ttctgagggt agctctagtc tcctgatcat gatttccagc ttgcatggat 3181 tgagccctta ccgtatgcca gacctgtttt ctacttgttt actatatatc ccagatacac 3241 cgtgggcatg gggagcttgg ggatttttga gggcagctct gggccccaag tcatgtcaac 3301 agttcatcac atctatttgg catgaacagc atacctaggg ctatgcagtg ggttggggat 3361 agatctgggg ttatttctga ggaggcctct gaggtcatag ggatgatggc tcaggtttat 3421 taagcacttt ctatttacct actaccgtgt agtgtggtga tctggtgggt ttctgacagc 3481 agcccttggt ttctagtcct tacctttgct tactgcggac ctggtgttgg ccatgccagc 3541 cttccaatgg taactccaga tgcctgattg tgaaggtaaa tgacattatc aagcacttat 3601 tgtatatcca ctgcttccca tcagtatccc aatatcccta aaaggtaagg acttatcaat 3661 ccatcgcaca cgcacttgcc tagcaaccct tcgtgtctgc ccacactctg gagtcccaca 3721 tgccattctt gttagctcat gccaggatgg gatgtgtgtg ctcttctcca caggtcccca 3781 cgatggtccc tgtggtccac atgggccccc tgttcggtga cgtgctctga gggctcccag 3841 ctgcggtacc ggcgctgtgt gggctggaat gggcagtgct ctggaaaggt ggcacctggg 3901 accctggagt ggcagctcca ggcctgtgag gaccagcagt gctgtcctgg tgaggaggat 3961 gagcaaggcg ggcaagcact ccctgtcacc cagcactggt gggaacgatc ggtacagagg 4021 tcctcagagt gggcccgtcc ccagtgtttg taaggaccta aggaggagag tggtgggagg 4081 tgaggtcaga gaggtgagga gggtccattg cctgggggcc acagagagga cgttggcttt 4141 tcatctgagt aatttggact gtgggcagag gaggaacgtg atctgactgg ggttgggagg 4201 agaacagact gtgcacggtg agggtgagag gggaagcgga ggcacccatg tgaaggttgc 4261 tgcggcaggc ccggtgagcg gggatggtgg gcccagaatg gtggcagagg atataaggac 4321 aggattagct catgtggggc actcaaggaa gcaaagagcc ctgaatgaca ccccacctcc 4381 atcccattac cctctcttct ctcccacaga gatgggcggc tggtctggct gggggccctg 4441 ggagccttgc tctgtcacct gctccaaagg gacccggacc cgcaggcgag cctgtaatca 4501 ccctgctccc aagtgtgggg gccactgccc aggacaggca caggaatcag aggcctgtga 4561 cacccagcag gtctgcccca gtgagtgagg gaagccacag gatgctgatg ggtgcaccca 4621 gcgtgggtct gcggggttac agcagggaac tgccaggctg agaggacttt agcatgcatg 4681 accacacctg catccctcca cctcccaccc caccacagca cacggggcct gggccacctg 4741 gggcccctgg accccctgct cagcctcctg ccacggtgga ccccacgaac ctaaggagac 4801 acgaagccgc aagtgttctg cacctgagcc ctcccagaaa cctcctggga agccctgccc 4861 ggggctagcc tacgagcagc ggaggtgcac cggcctgcca ccctgcccag gtacaccagg 4921 atgaggctgc tgatgttgct ggtggggctc ttgattgagg gcagagatgc tctgccattt 4981 ctagagttcc taggccatgt gaccatgcag cccttcattt ggggacagtc tgcttcaagg 5041 gtctaggggc tgagaggaag gattgaggag gcctttctcc tcactccctt tcctcctcca 5101 acagtggctg ggggctgggg gccttggggc cctgtgagcc cctgccctgt gacctgtggc 5161 ctgggccaga ccatggaaca acggacgtgc aatcaccctg tgccccagca tgggggcccc 5221 ttctgtgctg gcgatgccac ccggacccac atctgcaaca cagctgtgcc ctgccctggt 5281 cagcatctca gggttcacga tttgcatgcc taagtccccc ttgcccttct ctgctgccca 5341 gctcctgctg ctgaggcctt gttcctgtct ctgattcctc ctttcctggc cctgatacct 5401 tgtttccacc cctggtcccc cattcacaca tctgatcacc tctactccct cctaccgccc 5461 tcattccttc ctctgaaccc ccttgctgat tccctgcttt ggtccaatcc cctgttgccc 5521 tgtctctgca gtggatgggg agtgggactc gtggggggag tggagcccct gtatccgacg 5581 gaacatgaag tccatcagct gtcaagaaat cccgggccag cagtcacgcg ggaggacctg 5641 caggggccgc aagtttgacg gacatcgatg tgccgggcaa cagcaggata tccggcactg 5701 ctacagcatc cagcactgcc cctgtgagtg tcccacagac tgtgctctga gggtggggtg 5761 gcccatgcag gataaggggt ttccaactct ctgctgtgga cctcacgtct ctgcagcctc 5821 cctctcactt tcccaccaag acctccagtt ctgactctgt gacccctacc cctcattgca 5881 gtgaaaggat catggtcaga gtggagtacc tgggggctgt gcatgccccc ctgtggacct 5941 aatcctaccc gtgcccgcca gcgcctctgc acacccttgc tccccaagta cccgtgagtg 6001 agagggcaaa gtatgcctgg gagggggtca ttatattaca ggcgggagag tcggctaccc 6061 caggaaacca gggtttccac tgctgaatct aaggtatctg ccttgccgag tgccacctcc 6121 cagctgcagg aagtggggag aaactaagaa gatgagaaag aagttcttgg aggataattc 6181 gtagcctctg tgcccctcag tcacttgttc agtcactcat ccacttagtc cttcatgtgt 6241 cattgactcc gaccttcagt tacccatgct tttagctgcc atcactcagg cctccattca 6301 ctgcctgcct cccacagtca ctcatgcctc catccacttt ctcagcatct cactgagcca 6361 aggcactcca aggctttttc agcaacagtt gcagatacac acagcccaga tcctgtggat 6421 cctccaaggg acccagctga gcaagccctg acaggaggca tatgaggaac agctggtgca 6481 tgcaacaggt gctgccacct caacctaggt ccaggaagct gactgggcct cccagggcag 6541 gtaggggttg ggaaaggcat tcctggcatc aagactgtga acggagcctt ggagctaggt 6601 gattagccgc tataagtgtt tccaggtgcc agaggggccc aggtgttaga gagggttaaa 6661 atgttcatcc cttttcttcc ttccacatac atcaagctac tagattaagt gaactattcc 6721 cagctcgtgg aagggctgcc atctgggaag ggatctcccg gctgaggggg ttagagtggg 6781 gcctgaagat tagaggtctt gaatggaggc acaaaggact tgagtgagcg agagactgag 6841 tggaggaaga agcaattgag agaactggga tgggagtgaa tggagggctt tcctagtggg 6901 acaaataacc caggtaaaga cctttctgga cagggactga gcttgaataa agtggaggag 6961 acagagagag agattggctt ggggcaggga tttgggccca acaataaggc cttgaatgtc 7021 aggacaaggc attgggtttt ttcctggagg cagtggaaag agatataaag ggtatatatt 7081 tcggaggtaa aagaaagtag ctcatgccaa caagtaacat tcaccagcat ttattgagca 7141 cttgctgtgt gccaggcact gctctgagca ctttacctga attaactcat ttatcctcat 7201 tcaaccccgt gaggtgagta ttaatattac tattcctacc attctcattt tgcagatgag 7261 ggaaaccgag gcacagagaa gctaaagaaa ttgcccaata tcctatagct aaaaagtagt 7321 gttgggaggc caggcacagt ggctcacgcc tgtaatccca gcaacacttt gggaggctga 7381 ggcaggagca tcacttgagc ccaggagatt gaggctgcag tgagctatga ttgagccact 7441 gcactgcagc ctgggcaaca gagtgagacc ctgcctcaaa aaaaaaaaag gaaaaaaaaa 7501 aaaaaaaaaa gtagccctcg gcatatagca ggcactcaat aagaattgaa tgcattcttg 7561 ccttccctga gattctccct tccgttcctc cccaccccta atgcctcagg cccaccgttt 7621 ccatggtcga aggtcagggc gagaagaacg tgaccttctg ggggagaccg ctgccacggt 7681 gtgaggagct acaagggcag aagctggtgg tggaggagaa acgaccatgt ctacacgtgc 7741 ctgcttgcaa agaccctgag gaagaggaac tctaacactt ctctcctcca ctctgagccc 7801 cctgaccttc caaacctcaa taaactagcc tcttcgagtt cgtctacgat tccttaaagg 7861 aggaaaaaca acctatcccc ttccccaaaa gtagggtgat agcatctcat agggcaaaca 7921 gcacagcacg ggccaagaca cagcagcctg ttgcggaaaa cagtattttc cactgtggtc 7981 gcaggttggc aggagtgaaa gcgagtgtgc agtcccagcc tctccccgcc gctgccataa 8041 cccaatgtct gggccctgcg cagccccctt agagcctcag ggccgcccgg cacgttcctc 8101 tgcagagctc cgcgttacgg gtttcctgat tggctacttg tcctggcagg cacttcttga 8161 ttggccggcg tgcttgttcg ttgtctcccc gtctcctgga cctccctaga ttcccaccag 8221 cg // LOCUS HSPSAG 5873 bp DNA PRI 24-APR-1993 DEFINITION Human DNA for prostate specific antigen (PSA). ACCESSION X14810 NID g35732 KEYWORDS Alu repetitive sequence; prostate specific antigen; serine protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5873) AUTHORS Klobeck,H.G., Combriato,G., Schulz,P., Arbusow,V. and Fittler,F. TITLE Genomic sequence of human prostate specific antigen (PSA) JOURNAL Nucleic Acids Res. 17 (10), 3981 (1989) MEDLINE 89282407 COMMENT See also acc# x05332 and x07730. Data kindly reviewed (19th June 1989) by Klobeck H.-G. FEATURES Location/Qualifiers source 1..5873 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoid cell line GM 607" TATA_signal 332..338 misc_feature 355..365 /note="transcriptional start region" exon <401..446 /number=1 mRNA join(<401..446,1688..1847,3477..3763,3907..4043, 5413..>5568) CDS join(401..446,1688..1847,3477..3763,3907..4043,5413..5568) /codon_start=1 /product="prostate specific antigen (PSA)" /db_xref="PID:g296671" /db_xref="SWISS-PROT:P07288" /translation="MWVPVVFLTLSVTWIGAAPLILSRIVGGWECEKHSQPWQVLVAS RGRAVCGGVLVHPQWVLTAAHCIRNKSVILLGRHSLFHPEDTGQVFQVSHSFPHPLYD MSLLKNRFLRPGDDSSHDLMLLRLSEPAELTDAVKVMDLPTQEPALGTTCYASGWGSI EPEEFLTPKKLQCVDLHVISNDVCAQVHPQKVTKFMLCAGRWTGGKSTCSGDSGGPLV CNGVLQGITSWGSEPCALPERPSLYTKVVHYRKWIKDTIVANP" sig_peptide join(401..446,1688..1713) intron 447..1687 /number=1 exon 1688..1847 /number=2 mat_peptide join(1714..1847,3477..3763,3907..4043,5413..5565) /product="prostate specific antigen (PSA)" intron 1848..3476 /number=2 misc_feature 2311..2663 /note="Alu repetitive sequence" exon 3477..3763 /number=3 intron 3764..3906 /number=3 exon 3907..4043 /number=4 intron 4044..5412 /number=4 exon 5413..>5568 /number=5 BASE COUNT 1186 a 1778 c 1503 g 1406 t ORIGIN 1 ggtgtcttag gcacactggt cttggagtgc aaaggatcta ggcacgtgag gctttgtatg 61 aagaatcggg gatcgtaccc accccctgtt tctgtttcat cctgggcatg tctcctctgc 121 ctttgtcccc tagatgaagt ctccatgagc tacaagggcc tggtgcatcc agggtgatct 181 agtaattgca gaacagcaag tgctagctct ccctcccctt ccacagctct gggtgtggga 241 gggggttgtc cagcctccag cagcatgggg agggccttgg tcagcctctg ggtgccagca 301 gggcaggggc ggagtcctgg ggaatgaagg ttttataggg ctcctggggg aggctcccca 361 gccccaagct taccacctgc acccggagag ctgtgtcacc atgtgggtcc cggttgtctt 421 cctcaccctg tccgtgacgt ggattggtga gaggggccat ggttgggggg atgcaggaga 481 gggagccagc cctgactgtc aagctgaggc tctttccccc ccaacccagc accccagccc 541 agacagggag ctgggctctt ttctgtctct cccagcccca cttcaagccc atacccccag 601 tcccctccat attgcaacag tcctcactcc cacaccaggt ccccgctccc tcccacttac 661 cccagaactt tcttcccatt tgcccagcca gctccctgct cccagctgct ttactaaagg 721 ggaagttcct gggcatctcc gtgtttctct ttgtggggct caaaacctcc aaggacctct 781 ctcaatgcca ttggttcctt ggaccgtatc actggtccat ctcctgagcc cctcaatcct 841 atcacagtct actgactttt cccattcagc tgtgagtgtc caaccctatc ccagagacct 901 tgatgcttgg cctcccaatc ttgccctagg atacccagat gccaaccaga cacctccttc 961 tttcctagcc aggctatctg gcctgagaca acaaatgggt ccctcagtct ggcaatggga 1021 ctctgagaac tcctcattcc ctgactctta gccccagact cttcattcag tggcccacat 1081 tttccttagg aaaaacatga gcatccccag ccacaactgc cagctctctg agtccccaaa 1141 tctgcatcct tttcaaaacc taaaaacaaa aagaaaaaca aataaaacaa aaccaactca 1201 gaccagaact gttttctcaa cctgggactt cctaaacttt ccaaaacctt cctcttccag 1261 caactgaacc tcgccataag gcacttatcc ctggttccta gcacccctta tcccctcaga 1321 atccacaact tgtaccaagt ttcccttctc ccagtccaag accccaaatc accacaaagg 1381 acccaatccc cagactcaag atatggtctg ggcgctgtct tgtgtctcct accctgatcc 1441 ctgggttcaa ctctgctccc agagcatgaa gcctctccac cagcaccagc caccaacctg 1501 caaacctagg gaagattgac agaattccca gcctttccca gctccccctg cccatgtccc 1561 aggactccca gccttggttc tctgcccccg tgtcttttca aacccacatc ctaaatccat 1621 ctcctatccg agtcccccag ttccccctgt caaccctgat tcccctgatc tagcaccccc 1681 tctgcaggcg ctgcgcccct catcctgtct cggattgtgg gaggctggga gtgcgagaag 1741 cattcccaac cctggcaggt gcttgtggcc tctcgtggca gggcagtctg cggcggtgtt 1801 ctggtgcacc cccagtgggt cctcacagct gcccactgca tcaggaagtg agtaggggcc 1861 tggggtctgg ggagcaggtg tctgtgtccc agaggaataa cagctgggca ttttccccag 1921 gataacctct aaggccagcc ttgggactgg gggagagagg gaaagttctg gttcaggtca 1981 catggggagg cagggttggg gctggaccac cctccccatg gctgcctggg tctccatctg 2041 tgtccctcta tgtctctttg tgtcgctttc attatgtctc ttggtaactg gcttcggttg 2101 tgtctctccg tgtgactatt ttgttctctc tctccctctc ttctctgtct tcagtctcca 2161 tatctccccc tctctctgtc cttctctggt ccctctctag ccagtgtgtc tcaccctgta 2221 tctctctgcc aggctctgtc tctcggtctc tgtctcacct gtgccttctc cctactgaac 2281 acacgcacgg gatgggcctg ggggaccctg agaaaaggaa gggctttggc tgggcgcggt 2341 ggctcacacc tgtaatccca gcactttggg aggccaaggc aggtagatca cctgaggtca 2401 ggagttcgag accagcctgg ccaactggtg aaaccccatc tctactaaaa atacaaaaaa 2461 ttagccaggc gtggtggcgc atgcctgtag tcccagctac tcaggagctg agggaggaga 2521 attgcattga acctggaggt tgaggttgca gtgagccgag accgtgccac tgcactccag 2581 cctgggtgac agagtgagac tccgcctcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaga 2641 aaagaaaaga aaagaaaagg aagtgtttta tccctgatgt gtgtgggtat gagggtatga 2701 gagggcccct ctcactccat tccttctcca ggacatccct ccactcttgg gagacacaga 2761 gaagggctgg ttccagctgg agctgggagg ggcaattgag ggaggaggaa ggagaagggg 2821 gaaggaaaac agggtatggg ggaaaggacc ctggggagcg aagtggagga tacaaccttg 2881 ggcctgcagg caggctacct acccacttgg aaacccacgc caaagccgca tctacagctg 2941 agccactctg aggcctcccc tccccggcgg tccccactca gctccaaagt ctctctccct 3001 tttctctccc acactttatc atcccccgga ttcctctcta cttggttctc attcttcctt 3061 tgacttcctg cttccctttc tcattcatct gtttctcact ttctgcctgg ttttgttctt 3121 ctctctctct ttctctggcc catgtctgtt tctctatgtt tctgtctttt ctttctcatc 3181 ctgtgtattt tcggctcacc ttgtttgtca ctgttctccc ctctgccctt tcattctctc 3241 tgccctttta ccctcttcct tttcccttgg ttctctcagt tctgtatctg cccttcaccc 3301 tctcacactg ctgtttccca actcgttgtc tgtattttgg cctgaactgt gtcttcccaa 3361 ccctgtgttt tctcactgtt tctttttctc ttttggagcc tcctccttgc tcctctgtcc 3421 cttctctctt tccttatcat cctcgctcct cattcctgcg tctgcttcct ccccagcaaa 3481 agcgtgatct tgctgggtcg gcacagcctg tttcatcctg aagacacagg ccaggtattt 3541 caggtcagcc acagcttccc acacccgctc tacgatatga gcctcctgaa gaatcgattc 3601 ctcaggccag gtgatgactc cagccacgac ctcatgctgc tccgcctgtc agagcctgcc 3661 gagctcacgg atgctgtgaa ggtcatggac ctgcccaccc aggagccagc actggggacc 3721 acctgctacg cctcaggctg gggcagcatt gaaccagagg agtgtacgcc tgggccagat 3781 ggtgcagccg ggagcccaga tgcctgggtc tgagggagga ggggacagga ctcctgggtc 3841 tgagggagga gggccaagga accaggtggg gtccagccca caacagtgtt tttgcctggc 3901 ccgtagtctt gaccccaaag aaacttcagt gtgtggacct ccatgttatt tccaatgacg 3961 tgtgtgcgca agttcaccct cagaaggtga ccaagttcat gctgtgtgct ggacgctgga 4021 cagggggcaa aagcacctgc tcggtgagtc atccctactc ccaagatctt gagggaaagg 4081 tgagtgggac cttaattctg ggctggggtc tagaagccaa caaggcgtct gcctcccctg 4141 ctccccagct gtagccatgc cacctccccg tgtctcatct cattccctcc ttccctcttc 4201 tttgactccc tcaaggcaat aggttattct tacagcacaa ctcatctgtt cctgcgttca 4261 gcacacggtt actaggcacc tgctatgcac ccagcactgc cctagagcct gggacatagc 4321 agtgaacaga cagagagcag cccctccctt ctgtagcccc caagccagtg aggggcacag 4381 gcaggaacag ggaccacaac acagaaaagc tggagggtgt caggaggtga tcaggctctc 4441 ggggagggag aaggggtggg gagtgtgact gggaggagac atcctgcaga aggtgggagt 4501 gagcaaacac ctgcgcaggg gaggggaggg cctgcggcac ctgggggagc agagggaaca 4561 gcatctggcc aggcctggga ggaggggcct agagggcgtc aggagcagag aggaggttgc 4621 ctggctggag tgaaggatcg gggcagggtg cgagagggaa caaaggaccc ctcctgcagg 4681 gcctcacctg ggccacagga ggacactgct tttcctctga ggagtcagga actgtggatg 4741 gtgctggaca gaagcaggac agggcctggc tcaggtgtcc agaggctgcg ctggcctcct 4801 atgggatcag actgcaggga gggagggcag cagggatgtg gagggagtga tgatggggct 4861 gacctggggg tggctccagg cattgtcccc acctgggccc ttacccagcc tccctcacag 4921 gctcctggcc ctcagtctct cccctccact ccattctcca cctacccaca gtgggtcatt 4981 ctgatcaccg aactgaccat gccagccctg ccgatggtcc tccatggctc cctagtgccc 5041 tggagaggag gtgtctagtc agagagtagt cctggaaggt ggcctctgtg aggagccacg 5101 gggacagcat cctgcagatg gtcctggccc ttgtcccacc gacctgtcta caaggactgt 5161 cctcgtggac cctcccctct gcacaggagc tggaccctga agtcccttcc taccggccag 5221 gactggagcc cctacccctc tgttggaatc cctgcccacc ttcttctgga agtcggctct 5281 ggagacattt ctctcttctt ccaaagctgg gaactgctat ctgttatctg cctgtccagg 5341 tctgaaagat aggattgccc aggcagaaac tgggactgac ctatctcact ctctccctgc 5401 ttttaccctt agggtgattc tgggggccca cttgtctgta atggtgtgct tcaaggtatc 5461 acgtcatggg gcagtgaacc atgtgccctg cccgaaaggc cttccctgta caccaaggtg 5521 gtgcattacc ggaagtggat caaggacacc atcgtggcca acccctgagc acccctatca 5581 agtccctatt gtagtaaact tggaaccttg gaaatgacca ggccaagact caagcctccc 5641 cagttctact gacctttgtc cttaggtgtg aggtccaggg ttgctaggaa aagaaatcag 5701 cagacacagg tgtagaccag agtgtttctt aaatggtgta attttgtcct ctctgtgtcc 5761 tggggaatac tggccatgcc tggagacata tcactcaatt tctctgagga cacagttagg 5821 atggggtgtc tgtgttattt gtgggataca gagatgaaag aggggtggga tcc // LOCUS HSPXFGEN 11069 bp DNA PRI 27-OCT-1997 DEFINITION H.sapiens PxF gene. ACCESSION Y09048 NID g2570022 KEYWORDS Alu repetitive element; PxF protein; Sp1 binding site. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11069) AUTHORS Kammerer,S. TITLE Direct Submission JOURNAL Submitted (22-OCT-1996) S. Kammerer, Kinderkrankenhaus, Labor f. Molekulare Biologie, Lindwurmstr. 4, D- 80337 Muenchen, FRG REFERENCE 2 (bases 1 to 11069) AUTHORS Kammerer,S., Arnold,N., Gutensohn,W., Mewes,H.W., Kunau,W.H., Hofler,G., Roscher,A.A. and Braun,A. TITLE Genomic organization and molecular characterization of a gene encoding HsPXF, a human peroxisomal farnesylated protein JOURNAL Genomics 45 (1), 200-210 (1997) MEDLINE 97480732 FEATURES Location/Qualifiers source 1..11069 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /cell_type="leucocyte" /clone_lib="EMBL3" /clone="lambdaL15" /map="q22" repeat_region 342..621 /note="Alu repetitive element" /rpt_family="Alu" repeat_region 1832..2118 /note="Alu repetitive element" /rpt_family="Alu" misc_binding 2477..2482 /note="putative Sp1 binding site" /bound_moiety="Sp1" misc_binding 2491..2496 /note="putative Sp1 binding site" /bound_moiety="Sp1" exon 2543..2622 /number=1 gene 2553..8116 /gene="PxF" CDS join(2553..2622,4032..4141,4561..4726,5168..5253, 5470..5631,7421..7597,7822..7866,8033..8116) /gene="PxF" /codon_start=1 /product="PxF protein" /db_xref="PID:e286546" /db_xref="PID:g2570023" /translation="MAAAEEGCSVGAEADRELEELLESALDDFDKAKPSPAPPSTTTA PDASGPQKRSPGDTAKDALFASQEKFFQELFDSELASQATAEFEKAMKELAEEEPHLV EQFQKLSEAAGRVGSDMTSQQEFTSCLKETLSGLAKNATDLQNSSMSEEELTKAMEGL GMDEGDGEGNILPIMQSIMQNLLSKDVLYPSLKEITEKYPEWLQSHRESLPPEQFEKY QEQHSVMCKICEQFEAETPTDSETTQKARFEMVLDLMQQLQDLGHPPKELAGEMPPGL NFDLDALNLSGPPGASGEQCLIM" exon 4032..4141 /gene="PxF" /number=2 exon 4561..4726 /gene="PxF" /number=3 exon 5168..5253 /gene="PxF" /number=4 exon 5470..5631 /gene="PxF" /number=5 repeat_region 5965..6245 /note="Alu repetitive element" /rpt_family="Alu" repeat_region 6370..6646 /note="Alu repetitive element" /rpt_family="Alu" repeat_region 6849..7133 /note="Alu repetitive element" /rpt_family="Alu" exon 7421..7597 /gene="PxF" /number=6 exon 7822..7866 /gene="PxF" /number=7 exon 8033..>11069 /number=8 misc_feature 8084 /gene="PxF" /note="cryptic 3' splice site (exon 8)" polyA_signal 9177..9182 polyA_signal 9438..9443 polyA_signal 10774..10779 polyA_signal 10820..10825 polyA_signal 10832..10837 BASE COUNT 2987 a 2377 c 2656 g 3049 t ORIGIN 1 tacaagatgt gaaataattg tgactcccag gcaagaatta gaatttgttc gataaactgt 61 ggagaaccat taaatcctta aggaacagag tgatatgaaa gcgttataaa atactgttat 121 tcaacagtat tcgctataca ttagaagggc agaaggagca gtcagtaggt cagtaaatac 181 tgagatgaga tgttaggaat attaggagta aaaatagaaa agcagaaaga agtttaagaa 241 ataatgcaaa gaaagaatca tgcttgtttt tgactccatt ttaattatca tctattatct 301 tttaattaac atgcttatat tacattagtt taaagccaat tgaccaagtg tagtggctca 361 catctataat cccagtactt tgggaggcca aggtgggcat atcacttgag gtcaggagtt 421 aaagaccagc ctggccaata tggtgaaaca ctgtctctac taaaaataca aaaattagtt 481 gggcatggtg gcgcgagcct gcagacccag ctactcaggt ggctgagaca ggagaatcac 541 ttgaacccgg gaggtggagg ctgcagtgag ccaaaattgc accactgcac tccagcctgg 601 gcaacagtga aactctgtct caaaaaataa aaccaataga gatgcctgtt gatagacaag 661 ttttcttttt ttgacgtgct tgttacaact atagccatat tggagataca cgtgtaagca 721 gtttgaatga gaaagtctgg aataaattat gttgcaggtg ttggaaaaca ggtttatcag 781 gaaaggattt gattacagaa gtttagaatt gtaaaggact tcaaaagtga ttagtggcag 841 aattggggct ggaatttagg ctttctaact cacagttcat aatattttga tccaaaacac 901 actacttgtg gttggaagaa aacaggtttt gcagatatga ttttacagtc tgtgccctat 961 ctgtgaagga catagtaagt gcttgatctg gtgcttgatg cagtactcaa atgtgtgttt 1021 aggaagctaa gaatgaaagt gatgaggtga cttaacacag gacttttttt catttaaaat 1081 tcttaccttg tatcatactg tgtgtatatc tcatcatacc tccggctcac ccccacttag 1141 ctaccttaca gcagggggtg gtcctgattt atctctctgt cctcactgtg cccagaacaa 1201 gaccttaata ataggtgctc tctaagtgcg aaaacagatt tgtatgtgaa acagttgcta 1261 cattctacat ctgaatagaa caattccaaa ggaggaaagg gacgtctaga gttacctcta 1321 tctgagcatt ccccgtgttc actcagttgg aggttgggtt ctgtgtggag aaaaggagat 1381 tgttcaggga aagtaagtct attactactg cattcgaatt gccctgctca gtccaaggta 1441 ccagaggctc tcttaataac attgagttct aagttcctgg ccggccctgt gtcctttaat 1501 caaacagttc taatagataa taggaaatga gagatgccac tagattgggc agacagatgt 1561 gcctgattag ggactgcgag agcaaagagt ggccaatgaa tgcctactaa gttttaaacc 1621 ctagcctggg agttgggctt tacagcagtg aagacaacag acgaaaatct ttgcgttcat 1681 ggtgtttaaa ttctgcaggg tggggagcgg agaataaaaa agtacggtag atagtatgtc 1741 agatactgtt cagtgttagg aagaaaagga acgggaatgg ccaaggaaag cgtttcagaa 1801 aagataatat ttgagggaaa caatgaagtg aagccgggcg cagtggctca cgcctgtaat 1861 cccagcactt cgggaggccg aggtgggcgg atcagctgag gtccggagtt cgagaccagc 1921 ctggccaacc agcctggcca acatgttaaa accccgtctc tacaaaaatt agtcggacgt 1981 ggtgtcgcgc gcctgtcatc ccagctacac gggaggctga ggcaggaaga atcgcttgaa 2041 cccgggaggc caaggctgca gtgagccgag atggcgccac tgcattccag cgtaggcgac 2101 agagcaagat tccttctcaa aaaaaaataa aaaaaaaaaa atgaagtgag agaaagtagt 2161 atgttatgtg ccataattgt catatcaatt atatatgttt agattctggt tatttgtagg 2221 cagtttgtct caatctggcc tactcacagc tcaggaaaaa ataatgaaag agcttcgcca 2281 ggtaatttgg gaagttgaaa gaaaggagtc cgggcctcta aaaccagaca gaaaataacc 2341 ctatcttatc cctaattctg tttcacagga gaaaagggaa aaatacccaa aacacaaaca 2401 ctcaaagctt cctggccctt taaaaatggc ggttccgcac tggtcaacaa taccattggt 2461 tacccgaggc agaagagggc gggagcgaag gggtggaggg cgtgtcggca ccgaggaggt 2521 cccgcctcct acggcaagtc ggaggtagca agatggccgc cgctgaggaa ggctgtagtg 2581 tcggggccga agcggacagg gaattggagg agcttctgga aagtaagagc ccatagtggg 2641 aaaggcccga agaggcacgt gtgaggaccc attctggggg ttaagtgaac ctgagatatt 2701 agggacgggc aggagaggct ctccaaaggt taaatggggg cggagagtct gacccccgag 2761 ccccccaaaa atctctgggt catcctgccc caccccggcg actggagctg gggagctcac 2821 ctggggtccc tcagtggtcc ggatggtgga tgtttccgca gacggcgttt catgatacct 2881 cttcaaactc ctctttaggt tactgagttc ttaaagttcg actccagcag cagtagcagc 2941 agcagcagca tttggttctt gctgggtagc gaactacgca ctcactgatg cttacattaa 3001 ctcctagcaa gctcgtggcc ggtccttgtg tgcctcgtcc tgtcttcctt aacacgaaga 3061 cacttttctc ctcctagtcc ttttctcttt ctctcgttag gcccctcaac agttctttca 3121 tcttgtcagt agctcaagcg gactcctttc ccttattttg agaaactgag gatgtggcga 3181 tttcagggga aagcagttca ccctcctgaa agacggggga tgttttctag cattactctc 3241 cactcccctg ttgacttcct gcccagagta gggccagtgc tctctcccta ggcctggcga 3301 acagaatggt ggtgaggaca caggcttttc agggcagagg tgaatttgta atcgcttgct 3361 caatcacgga gatgcgggca ggcagaaaaa tgaacagaaa ccctgaggag tgccaagtga 3421 ctgtgtaaat tagtgtacgg ttatctgcag agatcaggcc aagggagctc tagaaggttg 3481 ttggggagca ctaaactgga gaataagaga agatagttta atgccgtacc atggagtctg 3541 gtatgtagtc cctcagtctg tgctggtacc agtaccaaca ggaagctggg tgggatgtcc 3601 tgtgctttct caccaagtta ttccttttag agctccagag tgagttcaga gggatgagga 3661 catctgtgtt ctctactact gttgggtatt tgtatgggat ttctagggaa tcgttagggt 3721 gtggcaaggg agacactata gatggagatg tctctgcaga actttttaat cctctcctcc 3781 cagctctgag atggacattt gttccacctt aagggatttc tttgcctttg ctcacaataa 3841 attaggtaat ccagaaggaa gtaggaggct cattgagggt ggtctgggga ggtgggggac 3901 tcaaggagga aaattgagga aagttggcaa tttgaaaaat catattccta atgctttgcc 3961 cagaaaccac ccagccttgc ttgcattctc atctgtcgcc aacacacaca ccatgtctct 4021 tgtctctgaa ggtgctcttg atgatttcga taaagccaaa ccctccccag cacccccttc 4081 taccaccacg gcccctgatg cttcggggcc ccagaagaga tcgccaggag acactgccaa 4141 agtataaatt ccaccccctt aagggatggg gaggggatgg gcatgcagcc ctgagcagag 4201 gttgggcagg cccctatggg cagaggggaa tgtgggaggg taccttggcc tcacattggt 4261 gagtgctctt gccagcctca gtccttgtga aataaggact tgaggttgag tgggggaatc 4321 tcagagttcc ttgaaaaaat agtaggttta aatggttgga ttggtatgcc atcccatacc 4381 ctccttcaat gactagatac cttttttgct atcctcatta ggagcagtag ttagaatgtc 4441 cagtggaatt gggtaggagg gaggtgcatt tcccatagcc actgcctgtg gcttctatca 4501 gaggccaggt gcagtgtagt cctaagtaag agacgcattt cagccatcct tcccccgcag 4561 gatgccctct tcgcttccca agagaagttt ttccaggaac tattcgacag tgaactggct 4621 tcccaagcca ctgcggagtt cgagaaggca atgaaggagt tggctgagga agaaccccac 4681 ctggtggagc agttccaaaa gctctcagag gctgcaggga gagtgggtga ggagacccca 4741 gatacatgga aatatctgaa cagactctta agtctttgtt gcctgaaaac accagaattc 4801 ttagtgaagt tgatagcaat catgatgtca cagccaagga acgggtaaag gcaatgagag 4861 aaaggatggg aatccctgtt aatgaaaagg ggtgagatag gttggaaagt aatagaaaac 4921 attattcaaa actgaccatt tgaaaggtca cagtgttatt gtgggtatca tcatggagaa 4981 taaactgatg cagggtccgt gtactaaatt agatttatga aaagactatg aagataacca 5041 tagggttttc caagattggg agtgataatg attcactagt aaatgcttat atccaagctg 5101 ttaaccagtt cagtctaggc aaggaaacaa actcctaaat gacgctggct tttttaatct 5161 ctcctaggca gtgatatgac ctcccaacaa gaattcactt cttgcctaaa ggaaacacta 5221 agtggattag ccaaaaatgc cactgacctt caggtgagga gaaagtagta ggggcccaca 5281 aggggtccac tcccagacat ttctccagca gagtctcaaa agacagatgc tttgagtatc 5341 atcccaagtc tgttgctgtc ttattgtttg ttacaggcaa gtttagatag tttatctttg 5401 tcacctgaga gagtggtgag catcccaaga aggtagtata cctgctcacc tcttacttgt 5461 cctatctaga actccagcat gtcggaagaa gagctgacca aggccatgga ggggctaggc 5521 atggacgaag gggatgggga agggaacatc ctccccatca tgcagagtat tatgcagaac 5581 ctactctcca aggatgtgct gtacccatca ctgaaggaga tcacagaaaa ggtttgtgaa 5641 cttccatttc tgctttttct cttcattctc catcacatca actgatgtgg gctattttgt 5701 ctactcagag cctgcagaaa gagtaactcg agtgtgcttt ttttccttct ttatttattt 5761 gaggcaggga gagttggggg ttggggagag ttgtgtctct gagagaataa agctgtcaca 5821 cggtatgttg gggtataaga ggtataacgt ttggggtcta aggggaaatc cttcagttct 5881 agttttctat aaactggttc aagactctgg gattgttgtc tgatcaatct tgcatttctg 5941 acttcatttc tccagcaaga atttggctgg gcatggtggc tcatgctgta atccagcacc 6001 ttgggaggcc gaggcgggtg gatcacctga cctgaggtca ggagtccgag accagcctgg 6061 ccaacatgtg aaaccccgtc tgtagtaaaa atacaaaaat tatcctggcg tagtggctca 6121 cgcctgtgat cgcaactact caggaggcag caggagaatc acttgaacct gggaggcaga 6181 ggttgcagcg agctgagatt gtgccactgt actccggcct gggtgacaga gcaagactcc 6241 atctgaaaaa gaaaaagaat taactgagga tttacttact ggctttgtga tgtgaacagt 6301 gtaagcgtag tttaaatgat ggtccctgcc ataaaagagc ttatatgtgt ggttgtatca 6361 ttttgttttg agacagggtc tagctccagt tggagcacag tggcacgatc ttggctcacc 6421 acagcctccc catcccaggc tcaagtgttt ctcccatctc agcctcctga gtggctggga 6481 ctacaggtgt gtgccaccat gcttggccaa tgttttcaaa attttttgta gagttgagct 6541 ctcactatat tgctcaggct ggtctcaaac tcctgagctc aagtgatcct cccacctcag 6601 cctcccaaaa tgctgggatt acaagtgtga gccaccacac ctggctaaag agtgtagttt 6661 tactggagag ataaaataca ggaatagaag ctttcagtga catgggaaat gtgggtaagt 6721 gtgtaatgaa tagtacagac cacgagtcag caggccaaga cctgggtgga caaatccaac 6781 tctctagcaa ataatggttt ttacattttt aaggtcattt gaaaaaaact gcaaagaaaa 6841 tgtgcaatag ccgagcgtag tggctcactc ctatagtccc agcgctttgg gaggccacgg 6901 tggacagatt gcttgagccc aggagtttaa gaccagcttg agcaacatga tgaaaccctg 6961 tctctacaaa atacaaaaaa ttagccagtc acagtggtgc acacttgtag acccagctac 7021 ttggggaggc tcaggtggga ggattggctt gagcctgggg agatcgaggc tgcagtgagt 7081 ggtgatcaca ccatcacacg tccagcctgg gtgacagagt aaaactctgt ctcgaaaaaa 7141 agaaaaaaaa aacacaatag agaccatatg tggcccttga agcctaaaat attcactgtc 7201 tggcccttta cagaagctta ccaatctctg gtgaccataa atgctaacat ggccaggtga 7261 tctggttgga gtcagtgccg cttttcatga tgaacttact tatttagctt ttcgtttttc 7321 tacgtccctt tccctttcac attttaatca cattgttctg tttacccatc catatccatt 7381 gttcacttaa ttcttttcca cccattctct cttttcttag tatccagaat ggttgcagag 7441 tcatcgggaa tctctacctc cagagcagtt tgaaaaatat caggagcagc acagcgtcat 7501 gtgcaaaata tgtgagcagt ttgaggcaga gacccccaca gacagtgaaa ccactcaaaa 7561 ggctcgtttt gagatggtgc tggatcttat gcagcaggta aggcctcttg ccttctgccc 7621 actggtgctg tccacttagg tgttcagtga aggaagtatc tgggaaggat gcaatactgc 7681 tagacctagg tggaggaggg atcctctctt gtgatctctc tcagaagaaa tagagagatc 7741 attcattaga ttcctcttct ggatcctctg ttgttttctg ggcttttctc tatttcatgt 7801 tccattttcc ttattttcta gctacaagat ttaggccatc ctccaaaaga gctggctgga 7861 gagatggtga gtatgtctta tttttccaga ttttaatttg agtccccaat ttccagaatt 7921 ctttaacctc tcaggattcc acctaaggtg aaaattgtgg ctactgggga tggatcttca 7981 cgtctgtcct aaaaaatgag caactatctg agtcccttca tctcttcccc agcctcctgg 8041 cctcaacttt gacctggatg ccctcaatct ttcgggccca ccaggtgcca gtggtgaaca 8101 gtgtctgatc atgtgaaaca caacacgttt tcctctctga gtcccagcta tggggaacat 8161 ctggagtcag cagaaccatt gggacctgag gcaggagtgt cacctgcggg agaagtctgc 8221 ccgctgccct ctgtcatccc attcaagatt gtgccatacc agctgaggtt tttcctctgt 8281 ctctctagga atagggtctg tttcacaggc catttctgtg aaccctactc cattgtggtt 8341 tctgccacta tcaaagttcc agctacctgc aaggtgaagg aaggcatccc ttttggggca 8401 tgcactttct ttcctttctc aaaataatgt tatatgtggc cacactgatg ttcaccttta 8461 cgtccagggt ctttgtgcct tgtctctact ccctctcttg gatctgggga ggaggggcag 8521 agacctggga ctctgtattt ctatagttct cctggcagag cctttgagaa tggggagaaa 8581 cagcctgggc tggggctaca ggtctgtcac tatgctctct tgccttcaga cagaccattc 8641 tgaattctct aaagggaaag ggcttttgca tctaatcaca atagagttga aagagaggcc 8701 ttaggattct cctctctcta ggtgctgagc cctcacctcc ctgttccagg ctgagaactc 8761 aaatggttac cctgcttctt cctacaatgc tgtgtgatat gggtgaaccc agcccctgac 8821 cttcctctat cccctgccca tcctcccttt tacctcctct cttttttaaa cacctgttta 8881 tcccaacctt tttgagctca agctgtgata aagaagggcc catcctattt cccctcatct 8941 agtccattta cgattctcac tgactccccg tcttcctggc agacacaaat aaacccagtg 9001 tcaggtctag gaaattaatg gctattcttc cccagataca ttctggctta tttgagatac 9061 atgattctct tagaatcctg tcccttggtt caggaaagta gcttggaaaa ggagtagggg 9121 tatagcttgg gtcccttttc ctgcaaggcc ccatggggca gaatataata aatattctga 9181 gtgaggagtg tggtcttttt ctgatcttcc tcagcttccg taagttgcag agtgaggtat 9241 attaggagac tagttctaca caatattgta atgctgggtt ccatcaacac ccaccttcca 9301 caactcagtc tgcacctcag ttggcaaagg agactggatg gccatctttc ctcatgttcc 9361 cttgagtatt tcaatgtaga aagcccttca agtggtatta tattttaacc ttttacatta 9421 ttgttattaa tgttagtaat atattgttat gttttctaaa ttatttttct ttaagctgac 9481 gtggcttttt ttctgtggct cccagtgggt ctacggacct tggctgacat atgttggtag 9541 gtactctggt cagctcagct ggctgtcctg gttcactcag aagataagtc tctccaaagc 9601 aaattcacat gcattatgag tcgctttgag cttctgacat gtcacttgcc ccgaggttaa 9661 aacttttcac cccttgaaga ccttacatgt tttatggtat tggtgaggaa ggaaatgttc 9721 tcaaggtctc aggctatttg ggaaattcca actcctatac cttaccagag catggaagag 9781 cccagatctg aatgtaaaac gtctctgttc tgccagagat ggaaaaaata caggtatact 9841 tgtgatatag tcatggggct tcagtgtcac tattttctcc ttaaagctcc agccaaaaac 9901 tggacaagga tagagaggag gagggaagaa caaaagagcc cttctctatg aaccttgtgc 9961 cttctgtcct accagttttc ttttacagat tctcacttct gctagcctag ccagggctta 10021 ctccaggaat ctaaatagat gccctagtcc actttatctt tgttcccaag gcactcattt 10081 ttattttgat tttgattgaa tgtgagcagg ttgacctcag gtcacacttt gttccaaaaa 10141 cttttggaat tattccagga cttgtggtgg agttatggta ctctagggca gtctttctca 10201 aactatgtat ggtaaaggac caggtttttt gttttccagt ccttcactta tcaatatgca 10261 ttcctattgc cgatgacagg tatggagttc acactgtgtg ctgccgaccc ggcaagtttg 10321 acagcaccca aactggccag actgttctgt aggttaagtc cattgatcat gtacttggat 10381 atcacagcaa cattgaaatg ctaaaaagtt tttaaacact ctcaatttct aattcaccat 10441 gtcacagact ggtgaaaaaa aaaaaaaggt gttcactgac cagcacaagt ctgcagatca 10501 tctttgagta gcactgtttt ggggccctcg gtctctctga agaccctagc agaactgata 10561 cctacctgta tctcttgttc tctcctattt gagtttcact tccagagaac ttgttcttca 10621 gcaagaatgt gtcactagta aggacatctc tagcatttct ctagccttcc ttttctgctg 10681 ctcaaaaata atcgttacaa agcttaggtt taagctgtat atgaaatatt tatgcgactc 10741 tcaaacttta aaggagttgc tcctttgttc caaaattaaa tgtgttagat aaatttgtga 10801 ttgtatgggt ggcttcatga attaagaatt gaattaatac agactttttg ataattggga 10861 cattttcctg tgactctgga aggtcttgtc aaagccccat gaggcagtta taatatgatg 10921 ttaagatggc tttggggagc taggagaaaa cattggccca tattgaatga catggagacg 10981 gcagaccata caaaaaagga ccaaaagact tagaggaaat gggtatgcaa gaatgaatgt 11041 tatataacac tggtaaaccc accagctga // LOCUS HSQC8B6 21480 bp DNA PRI 12-DEC-1995 DEFINITION Human DNA sequence from cosmid QC8B6, on chromosome Xq28, containing red opsin gene. ACCESSION Z68193 NID g1122283 KEYWORDS Red opsin; Xq28. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 21480) AUTHORS Buck,D. TITLE Direct Submission JOURNAL Submitted (06-DEC-1995) Sanger Centre, Hinxton, Cambridgeshire, CB10 1RQ, England. E-mail enquires: humpub@sanger.ac.uk COMMENT The true left end of clone QC8B6 is at 1 in this sequence. QC8B6 is from a human chromosome X-specific cosmid library. FEATURES Location/Qualifiers source 1..21480 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq28" /clone="QC8B6" misc_feature 1..5586 /note="match: S44029 red visual pigment 5' region" repeat_region 2966..3145 /note="MER12 element fragment" repeat_region 2979..3143 /note="MER12 element fragment" repeat_region 2981..3145 /note="MER12 element fragment" repeat_region 3242..3322 /note="L1 element fragment" repeat_region 3351..3592 /partial /note="Alu repeat: matches 255..1 of consensus" repeat_region 3613..3802 /note="L1 element fragment" repeat_region 3961..4223 /note="L1 element fragment" CDS join(5584..5695,11954..12250,14239..14407,15875..16040, 17595..17834,20117..20227) /codon_start=1 /product="Red Opsin" /db_xref="PID:e213786" /db_xref="PID:g1122284" /translation="MAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTRGPFEGPN YHIAPRWVYHLTSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAE TVIASTISIVNQVSGYFVLGHPMCVLEGYTVSLCGITGLWSLAIISWERWLVVCKPFG NVRFDAKLAIVGIAFSWIWSAVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSYPGVQS YMIVLMVTCCIIPLAIIMLCYLQVWLAIRAVAKQQKESESTQKAEKEVTRMVVVMIFA YCVCWGPYTFFACFAAANPGYAFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCIL QLFGKKVDDGSELSSASKTEVSSVSSVSPA" repeat_region 6483..6789 /note="Alu repeat: matches 1..308 of consensus" repeat_region 7924..8231 /note="Alu repeat: matches 308..1 of consensus" repeat_region 9271..9459 /partial /note="Alu repeat: matches 24..228 of consensus" repeat_region 9460..9547 /partial /note="Alu repeat: matches 210..298 of consensus" repeat_region 9550..9869 /note="Alu repeat: matches 308..1 of consensus" repeat_region 10223..10538 /note="Alu repeat: matches 1..308 of consensus" repeat_region 10545..10876 /note="Alu repeat: matches 1..308 of consensus" repeat_region 11104..11204 /partial /note="Alu repeat: matches 102..1 of consensus" repeat_region 11224..11295 /partial /note="Alu repeat: matches 222..151 of consensus" repeat_region 13563..13851 /note="Alu repeat: matches 1..308 of consensus" repeat_region 15267..15564 /note="Alu repeat: matches 308..1 of consensus" misc_feature 18720..18928 /note="Match: STS HSSWX140" misc_feature 18721..18867 /note="Match: STS HSSWX141" BASE COUNT 4876 a 5936 c 5574 g 5094 t ORIGIN 1 ttagtcagga tctgtgaagg atgtcaatgg ctgtgaggcg agggcccctt gccagtcccc 61 atgttcgtta tttatggatg cctgatgccc acaacctcga cttctcagct gagggcaact 121 cctcctgtcc tctcgctgct tccacctaag ccaccaggaa atcctgttga ctccacttac 181 aaaagctctc cagagtctgc tacctcttac caccctactg tcgcctccca gacctggtct 241 ccccgcttct gccctggccc ctacagctcg cagacagctg ccagaggagt ccatccgaat 301 caacttcaga ttgcgtcact ttttctttca aaatccgaca aagctcctca ttccattcag 361 agtgagaagg gccccgccat gcagcctttc ctcctcctat tcaccaaccc cactctcgct 421 ccagccatct tgatatcctt gctggtcctg gagcaggcca gccacgctct tgccccaggg 481 cctttgcact tgctgcccct ctcctgtgat gccctcctgg atctctgcat ggctccctcc 541 tgccctccct ccctcgagtc tttgctcaca tgtccccttc tcagaggggc tcaccctggt 601 gcccttcgaa agtgggcatc cactcccttc ccaccctggc acccgcaccc tcctgtcttc 661 ctttttctgt tctccatcct actcatctcc cccaactaga aaggcagctg cagctgcgga 721 gcacgggatc tccatctgca gctgcatccc gggacctaga acaggtacaa aagtacctaa 781 taagtaccca ctgaatgaat aattggctga gttttctgta ccaaagactg agctaagact 841 ctttaaaagg atgatcttat ttaagcctta cagcaagtaa atgttatccc catcttcctg 901 atgaggacac agtgaccacc acgctcaagg acacagaggg tggacgtgcc acattcacac 961 tctgtgactt agagcggctg gacgggcagg gacttaggag gcctacagca gccagggtga 1021 gattatgagg ctgagctgag aatatcaaga ctgtaccgag tagggggcct tggcaagtgt 1081 ggagagcccg gcagctgggg cagagggcgg agtacggtgt gcgtttacgg acctcttcaa 1141 acgaggtagg aaggtcagaa gtcaaaaagg gaacaaatga tgtttaacca cacaaaaatg 1201 aaaatccaat ggttggatat ccattccaaa tacacaaagg caacggataa gtgatccggg 1261 ccaggcacag aaggccatgc acccgtagga ttgcactcag agctcccaaa tgcataggaa 1321 tagaagggtg ggtgcaggag gctgaggggt ggggaaaggg catgggtgtt tcatgaggac 1381 agagcttccg tttcatgcaa tgaaaagagt ttggagacgg atggtggtga ctggactata 1441 cacttacaca cggtagcgat ggtacacttt gtattatgta tattttacca cgatcttttt 1501 aaagtgtcaa aggcaaatgg ccaaatggtt ccttgtccta tagctgtagc agccatcggc 1561 tgttagtgac aaagcccctg agtcaagatg acagcagccc ccataactcc taatcggctc 1621 tcccgcgtgg agtcatttag gagtagtcgc attagagaca agtccaacat ctaatcttcc 1681 accctggcca gggccccagc tggcagcgag ggtgggagac tccgggcaga gcagagggcg 1741 ctgacattgg ggcccggcct ggcttgggtc cctctggcct ttccccaggg gccctctttc 1801 cttggggctt tcttgggccg ccactgctcc cgctcctctc cccccatccc accccctcac 1861 cccctcgttc ttcatatcct tctctagtgc tccctccact ttcatccacc cttctgcaag 1921 agtgtgggac cacaaatgag ttttcacctg gcctggggac acacgtgccc ccacaggtgc 1981 tgagtgactt tctaggacag taatctgctt taggctaaaa tgggacttga tcttctgtta 2041 gccctaatca tcaattagca gagccggtga aggtgcagaa cctaccgcct ttccaggcct 2101 cctcccacct ctgccacctc cactctcctt cctgggatgt gggggctggc acacgtgtgg 2161 cccagggcat tggtgggatt gcactgagct gggtcattag cgtaatcctg gacaagggca 2221 gacagggcga gcggagggcc agctccgggg ctcaggcaag gctgggggct tcccccagac 2281 accccactcc tcctctgctg gacccccact tcatagggca cttcgtgttc tcaaagggct 2341 tccaaatagc atggtggcct tggatgccca gggaagcctc agagttgctt atctccctct 2401 agacagaagg ggaatctcgg tcaagaggga gaggtcgccc tgttcaaggc cacccagcca 2461 gctcatggcg gtaatgggac aaggctggcc agccatccca ccctcagaag ggacccggtg 2521 gggcaggtga tctcagagga ggctcacttc tgggtctcac attcttggat cctcggatcc 2581 tctgactctg gtggggacag gcagaccaag ctctcttgga cccgggaaga gggacccttg 2641 gaagtcactt gggattgagt tctagagtct tgacactgtt tcagcagatc tatactttga 2701 acccacctca ggcatctcat ccacagaaca gggacagtga ccattccatc ttgccaagga 2761 gtgcggggca cacaccatgc tgctggcagc cagggtggaa agtcaagggg tcccagctga 2821 agcactgcca cagaggcagg atgcagtccg agaggggact tcagagcagg ggcccaaggc 2881 caaggccatc agggaggtct ttctgcagga gggacctctt taggagttag gccctaaaaa 2941 caaataagag gaagaaaggc acttggtggt ttttagtcta ttcacggtgc tgtgcagcca 3001 tcaccaccat ccagccgcag aacttgttca tcttcccaaa ctgaagctct ggccccgttc 3061 aaccccaact ccccaccccc cagcccctgg cacccaccct tccactgtct gtctctatgg 3121 atttgaccac tctaggccct tcatataagt ggaattttac agtatttacc ctcctgtgac 3181 tggctcattt cactgagcaa tgtcctcaag ggtcatccat gttggagcat gtgtcaaact 3241 tcatttcttt ttaagggtga atcatattcc atcgtctgta tagaccacaa tttgtgtagc 3301 taatcatccg ttgatgtttg ttgatttgtt tgtttgtttt gagacagtct tgccatcaga 3361 gctcactgca ggctcaacct cccaggctca agaaatcctc ccgcctctct acctcccaag 3421 tagctgtgac tacaggcacc tgcgactgtg ccctgctaat ttttgtattt tttgtagaga 3481 cagggtctca ctatgttgcc caggctagtc tcgaactcct gggctcaagt gatcctctca 3541 cctcggcctc ccaaagaact gggattacag gcatgagcta ccatgtccag cccaattgcc 3601 cattgatggg cacgggttgc ttccatgttt cagctgttgt gaatcacgct gctgtgaaca 3661 tgcgtgtgca aacagccctt ccagaccctg ccttccattc ctctgggcct atacccagca 3721 gtgcggttgc tgggtcctat gggaattcta cgtttaactt ttggaggagc tgccaaactg 3781 ttttccacag tggctgcgcc atcacaatcc aattttagga catttttatc acccataaag 3841 cactccctgt acccattaag aagtcatcct ccatttccct ccctcccctg tcctggcacc 3901 cattcctctg ctttgtgtgt ctctggattg ccctatctaa gcatttcaca gagatggagc 3961 catgcgctcc gtggtctttt gtgtctggct tcgctcactg agcatgctgt tctccaggtc 4021 catccacgtt gtagcgtagg tcagcccttc attccctgtt atggccaaat gatactccat 4081 tgtacagaca ggccactttt tacccactca tctgcttctg gacttttggg ttgcttctac 4141 catgtggcct gttgggaaca gtgctctgtt tgtattcatg tacgggtttt tgtgtggaca 4201 cacattttca agtctcttgg gtacacaggt gtagaagtgc cagagttggg gaaaaagctc 4261 acctttctag gctgtgaatg ggccctggca agtctgtggc caggactcgt cttctcttcc 4321 acatggggcc cctagcttgg cacctagcac gtggcaggca gcgacagatg ttaaaagcca 4381 ttcttgctat gggtagccag gctggggctc catgcagccc tggccttcag cttggcagcc 4441 agggccccct tgtgcctgca gcagaagcca tgctgccagg agtgtaagtg tgagccagga 4501 atgctggaga atcgtggctc tgagaacagg gacaagaggc cacaagctca cgccttggct 4561 ttcctaagct taaggaataa acccaaaagg aggtacctgg aaggagctgg atttggggac 4621 tgaggagctg ggagctgatg gaagccgtga aaggggatgt gctcctgggg aggcgctggg 4681 gcgggtgggc cgtggagggg acagggcccg ttggttggaa actgaggcga ggctacggag 4741 ttgggcacta acaggtcatc cgtgcccctg cgaagcgtgg ggacacaggg acagcagaga 4801 tggcctgtct ggacactctg tcgacggggg gcctgtggtt ggtgaagccc aaggcaaggc 4861 tgtgaactca gggcaaggga gacgtgagca ggcgctgccg tgggctgatg tgggcactgc 4921 atgtgcaccc tggcggccaa aggacctaca gctcatgggg ggcaaggggg aggagggaag 4981 ccaacagcag gatgtgcgca gtcagtctgc cccccctaca ctggaggagg agccccccgg 5041 cacaaatctc gcccgtttgg gcccacggac atggctggcc tcgcaaggag gatccggttc 5101 caggcctcgg ccctaaatag tctccctggg ctttcaagag aaccacatga gaaaggagga 5161 ttcgggctct gagcagtttc accacccacc ccccagtctg caaatcctga cccgtgggtc 5221 cacctgcccc aaaggcggac gcaggacagt agaagggaac agagaacaca taaacacaga 5281 gagggccaca gcggctccca cagtcaccgc caccttcctg gcggggatgg gtggggcgtc 5341 tgagtttggt tcccagcaaa tccctctgag ccgcccttgc gggctcgcct caggagcagg 5401 ggagcaagag gtgggaggag gaggtctaag tcccaggccc aattaagaga tcaggtagtg 5461 tagggtttgg gagcttttaa ggtgaagagg cccgggctga tcccacaggc cagtataaag 5521 cgccgtgacc ctcaggtgat gcgccagggc cggctgccgt cggggacagg gctttccata 5581 gccatggccc agcagtggag cctccaaagg ctcgcaggcc gccatccgca ggacagctat 5641 gaggacagca cccagtccag catcttcacc tacaccaaca gcaactccac cagaggtgag 5701 ccagcaggcc cgtggaggct gggtggctgc actgggggcc accggccacc cacctgcccc 5761 gcccaaggga atctctcttc tgcacgtccc caccagcaga gaaggctttc tcccatagct 5821 tttctgatga catgaattgg ggggtcctct ccaaatctag aaggacacca taatatcgaa 5881 tatgcattct caagccacac aggcttccca gcccctttga gaatccgagg ccggggaaga 5941 gtttatgtgc tctttctttg tggcccgtag atgagtgtgt tcactgctag cgaatgacct 6001 ctcattccac ggagtccctc agcttcctgg ggaagagctg ggtctgtctt tacatttgaa 6061 gccgaaagag gcaacatact gacacaccca agggaggcgg gagggtgggg aagacagcag 6121 cagagggcaa gaaacttcta gaacttcagg gtcggcaaag cctgtagcag tcattttgtc 6181 aaactccatg atggggccac ttggcttttg gctgcacacc tctgggggaa gaggctgcat 6241 tggcgcccag ggccatcttt ccattcggag ccgtcctggg agagagggct caggcccaac 6301 agaaagctga aagctctcat cagggcagcc cgagtcctgc cattgggagt tgcccaatcc 6361 gaaagttttg cacgcaggcc ctcaaagaag ctgaggacac cagtgaccgc cccactcctg 6421 gccctctccc caggtccctc ctccaaacca aattcctttg gtgccttcaa gaacatcgtg 6481 caggccgggc acagtggctc acgcctgtaa tcccagcact ttgggaggca gaggcgggca 6541 gatgacgagg tcaacagata gagatcatcc tggccaacat actgaaacct catgtctact 6601 aaaaatgcaa aaattagctg ggcttggtgg cgcatgcctg tagtcccagc tactcaggag 6661 gctgaggcag gagaatcgct tgaacccggg aggtggaggt tgcagtgagc cgagatcacg 6721 ccactatgct ccagcctggc cacagagtga gactcttgtc tcaaaacaaa acaaaacaaa 6781 aaacaacaac atcgtgcagg ctgtggtttc cagaagccac gccagctcct tgattgccaa 6841 taaacatccc gctgtggggt ggccaggacc gagtgccaat tagtgacaga gtgcccagac 6901 caaaccggat gaggatcttg cagttgacct caacatgact gtgcccagaa tttccttggt 6961 ggcaatgtca acagtctctt cctagatgcc cccagacttc atcaatgcat gatgcttcag 7021 tgcactcttt tcaaatgtcg gggtgggttt tttttttttc cacaaaactt caagcatcta 7081 ctaaagtaga gggaggagtg taatgaactc cggtacccat cactcagctt ccacggtttc 7141 atctcatttc atctgtgacc cctccactac cctttcttcc tgattcttgg aagcaaatcc 7201 aagacatcac acccttccct ctgtaaatct ttactatgtt cctctaggag aaaagggctc 7261 ttctcaatac ataaccacaa gtcatcatca caccgacaag tgtaacagta tttcctgaat 7321 agcttcaaat atcctagtag tgttcaaaaa atgtcatacg tattttcagt ctgcttgaat 7381 cagggctcaa ataaggtcca cacattcaga ttgactgata tgccttttga ctacctttga 7441 atctagaggt tccctttcta tctccctgca atttatttgt ggaagcaagc aagtcgttca 7501 tgacgtagcc taacaggccc ctctgacgtt gttcattatg atttttctgt aaattggtag 7561 ttgatctgag gatctggcca gaggtaggtt ggatttgttg gtgtgttttg gcaaggagag 7621 tgtctctttt ctggggtgtt ggcagctact gaaactcaat gcccagacca attaaaccac 7681 tggggatgga aaatgacggc attcggacac cttaccctgc cttcacctat tggtgaccaa 7741 aaccttaaca tcttcacagg tcttcttacc ctgagggtat atgccactag gttgtgtagt 7801 aaaccggtgt gtttccagtc ccttagaata gtccctctct aagtgatatg ccactcagtg 7861 gatatgcatt tagcttcatt tcttttgttg ctgattttca gagattgctc tgtaaattta 7921 aacttttatt ttactttatt ttattttttc gagacagtct tactctgtcg cccaggctgg 7981 agtgcagtgg cgcgatctca gctcactgca acttccgcct cctcggttcg agcgattctc 8041 ctgcctcagc ctcccgagga gctgggacta caggtgcccg ccaccacgcc cagctaattt 8101 tttttatttt tagtagagac agggtttcac catgttggcc aggctggtct ccaactgctg 8161 acctcaagtc gtccgcccac ctcggcttcc caaagtgctg ggattacagg tgtgagccac 8221 cgcccccggc cacttaaatt ttgttttata attatgtaat aaaacagtta aaagtctcaa 8281 attaaaatct agaaaagaag gtgtatttga agaagtctgg cttctctgcg ccaccaccga 8341 ccgccccttc cctacctgcc tgtatttcct cgaatcactt tgcctgggag ttgactttga 8401 ttctcttgct cattgcttca tgaaattcag ttccagaact ttcaggaggg aggggtaggc 8461 catgacacca gctctagtta cactggtggc agctcctgtc ccctccccca ctgctgctgg 8521 gacctgttct ctcctttgcc cccttgtccc tgcactgccc aatttggacc gcaagggttg 8581 ccagggaagg gcactggctg ccttgttttc agaggtcgta gcacctagat tgctccagcc 8641 ccttgcactt gcctgcaggc cagagtgtcc caaaccctcc cagtctcagc tgctcttccc 8701 cagttcaccc aaggtacttc ccagggaaga gctgccgaca gtttgggggt tctctgttct 8761 taggtccatc agcaacccca ttgctcccct ctgcttcctt ctgcacggag actgacgcca 8821 tgcaggtctt caattgtcaa tggtctgtcc ctgctgctca tactgggggt tcctggggag 8881 ccagtgccag gtatcgggat tgcagacatt gtctgtgggt ttccagaagc tccttgtgtt 8941 aggaacatat ggggcccgtg cacagagggc agcagaggcc ttgtgggatc cagctgtgct 9001 aggggtgaga tttatctgtc tctcctggcc atagccagga aatccccatt tttcttaagc 9061 tagcttgagt tgggcttttc taacacacag ctaaagaatc tcttgataaa ccttgggact 9121 ctccatgagg ccttatatgg cagcaggtct gtggcttgca atcccttcaa gtaatctgcc 9181 aaaaacaatg ttatgacgaa ggtccttcca acacaaaagg tgtagagccc tagcaaactc 9241 ctacagaaga aaaaggagaa ataattcgtt tgtagtccca gctacttggg aggccaaggt 9301 gggaggatca cttgaagtca ggagttcgag accagcctag gcaacatagc cagaccccat 9361 ctctacagaa ataaaaaaaa ttgccattgt ggtaatgcac ggcttgtagt cccagatact 9421 cgagaggctg aggcaggagg atcgcttgag cccaggagga tcgcttgagc ccaggagttc 9481 cacgttgcag tgagctatga ttgtgccact gtactccagc ctgggtgcca gagccaggct 9541 ctatctctat ttggtgttgt tgttgttgct gttgttgttt ttgagacgga gtcttgttct 9601 gtcaccccgg ctggagtgca gtggcgtgat ctcagctcac cgcaacctct gcctcctggg 9661 ttcaagtgat tctcccgcct cagcctcctg agtagctggg actacaggcg cccaccacca 9721 tacctagcta attttttttt cttttgtatt tttagtagag acggggtttc accatggcca 9781 ggctggtttt gaactcctga cctcaagtga tccaccccct cggcctccca aagtgctggg 9841 attacaggcg tgagccaccg ctcccagccg actgtatctc taaatatata ataatcataa 9901 tcataatcag gacagccgtc atattggatt agggcccacc ctaatgacct catttaaact 9961 tggtcatttc tgtaaagacc ctatctccga aaacggtcac attctgaggt attggggtta 10021 ggactccaac atatgaattt gggtggggac acaattcaac tcataacaca ttcattgatt 10081 aatttcctca ttcatttatt ttctgagcat ctattgtgtg ctggacactc tgtgaggatg 10141 aatgagtcaa agtctctggt gagaaagaca agacttgtac ttgtgtctca gtactggctg 10201 ctataagaaa ttaccaggct gtggccgggc gcagtggctc acgcctgtaa tcccagcact 10261 ttgggaggcc gaggcaggcg gatcacgagg tcaggagatt gagatcatcc tagctaacat 10321 ggtgaagccc cgtctctact aaaaatagaa aaaattagcc aggcgtggtg gcgggcgcct 10381 gtagtcccag ctactcagga ggctgaagcg ggagaatggc gtgaacctgg gaggcggagc 10441 ttgcagtgag ccgagatcgt gccactgcac tccagcctgg gtgacagagc aagactccgt 10501 ctcaaaaaaa aagaaaaaaa aaagaaaaga aaaataaatt accaggctgt gcatggtggc 10561 tcatgcctgc aatcccagta ctttgggagg acaaggcagg aggatctctt gaggtcagga 10621 gtttgagacc agcctggtca acatggtgaa accccatgtc tactaaaagc acaaaaatta 10681 gctgggtgtg gtggtgggtg cctgtaaatc ccagctacta ctccggagct gaggcaggag 10741 aattgcttga acccaggagg cggaggttgc agtgacccga gatcacatca ctgcactcca 10801 gcctgggtga cagagcaaga ctttgtccca aaaaaaaaaa aaagaaaaga aagaaaggaa 10861 aagaataaaa gagaaattac catagattgg gtggctttta aatgataaat ttatttctca 10921 cagctctgga ggctggaagt cagggtgcta gcgtggtggg ctctggcgag gaccctcttc 10981 ctgactgcag attgccaaca actcattgta tcctcacatg gaagaaagag agctagagag 11041 cactctaggg actctttttc ttgtttgttt taattaaaaa aaaatttttt ttacatgggc 11101 atgccatgtt gcccaggttg gatttgaact cctgggctca agcaaccctc cagcctcagc 11161 ctcccaaagt gctgggatta caggcatgag ccaccattcc cagctaattt gggctgttcc 11221 caaaggctca agtgatcctc ccacgttggc ctcctgagta gctggggcta caggcgtgag 11281 ccaccatgcc cagcttctag gacctctttt ataagggcac taatcccatt catgagggcc 11341 ccactcactc tgcacacatg acctaaatga cctgccaaag gccccacctc ctaataccat 11401 caccttgggg gttgggattt caacacagaa atttatgggg ggcacgtaca ttcagatcat 11461 catgaacagt aactcctatg tgtgacagaa ggtgacagag gtgggtagtg gtcttcccct 11521 caagggggtg agttgccact agctggggaa tcttctggaa ggcaaatgca tatgagctgg 11581 gctttacagg aggcaagcgt ttctctatgg aagggcagag gactgtggga ggtaggaggt 11641 ggggctgggg caaagggaag aggggagcag ggaagtgggg tgactgcaca ctgggagtgg 11701 ggaatcagat ggaggagacg atgaggagtt ctgttaagtt caagatgcca gtgccagtga 11761 ccagcgggcg atggtctctg gcttgaggga caggatggag gggagactgt ctgaggatgg 11821 acaaagctgg agggaaacag ccaattgcaa aggcaggagg gcggaagggg gaggggagag 11881 gtgggatcag cactggtata gacaggcggt gctgcagccc agctcctctc tctcctctgc 11941 ctcctgccct caggcccctt cgaaggcccg aattaccaca tcgctcccag atgggtgtac 12001 cacctcacca gtgtctggat gatctttgtg gtcactgcat ccgtcttcac aaatgggctt 12061 gtgctggcgg ccaccatgaa gttcaagaag ctgcgccacc cgctgaactg gatcctggtg 12121 aacctggcgg tcgctgacct agcagagacc gtcatcgcca gcactatcag cattgtgaac 12181 caggtctctg gctacttcgt gctgggccac cctatgtgtg tcctggaggg ctacaccgtc 12241 tccctgtgtg gtaagccagt cggggcccag gctcggcgga aaccactcat tcaccctgca 12301 agctcctcca gccacctcat gatgatcggg gcccagctgc tcctgtaggc ctgtctccct 12361 ccacatctgc gcctcacatc catatactga agggttctgg aggcttccat ctgaacactc 12421 acattaaatt cagctccctt gagtcaaaca taccctgagt tcctactctt gagtcaggct 12481 ctgcccgggg acagccagtt tggagctgtg gggctggtgt gggaggagac agatacagag 12541 ctagacaacc ccagaacagt aggggggcgg ggactctggg caccctggac agaactcccc 12601 tgcaattagg gatgcctgct ctttcagctc gccagcatct gcttttcccg gaggagacac 12661 aattcccaga tcctctcccc atccccatca ctaatatctc tgtgggccac tattccgctc 12721 aggtcaggag acagtggccg agaggtacta gcgtgccagg ctctgtgcta aggagggggc 12781 cctatagcca gacggcaacc acacagtacc atcatcagtc ctctcagaca agaaggggcc 12841 tggggcaggt ggtggaggag cggctgggag cagtttgtgg ttcgagtgga tagagtacca 12901 ccaagcagcc gtggctgctg gacacgaggt gggcaggccc aggtctcaga ggcctcagac 12961 gtcatgccca ggagctggga ctttctttca ggaggaggag accccacatc cagcagcagc 13021 agctcctgct cttgcctccc caccactctt agcagcctcc ccaaccccac cccgttaact 13081 gcctcaaatt gtacccacga tggcccagac cagagagggt gcttgtccaa gtcccggcac 13141 taccccgata gtgtagaagg ggagccaagg gaaggtcagg cagagaaggt ccatccccag 13201 gtccgagtgc tctctgcagc aggcatggcc tcggtggtca cacgaccctt cccgagtgcc 13261 ccccctgcat ctccgcccac gtctgtctcc gtttctgcca tggtctcccg ctcacccttg 13321 cctctgctca tggtctgttc ttgggtcagt caggtgccaa gcagccagca cttccccacc 13381 acttttggtc cacggatgcc cttggccatc tgggaagcct gtggacccca tctcaggaga 13441 atttttgcaa acgcataaaa tgagacccat aggattacaa aggcagcaaa ttatactgaa 13501 atacagttat caaagtatta aacattcatc agtaacatag tctttagtta aaagcattta 13561 ctggccaggc tcatacctgt aatcccagca ctttgggagg ctgaggtggg aggactgctt 13621 gcctccaaga gtttgagacc agcctgggca acatagtgag acctcttctc tacaacaaat 13681 aaaaacagct gggcgtggtg gcacaccagt agtcccagct actcaggagg ctcaggcggg 13741 aggatcgctt gagctctgga ggtcaaggct gcagtgagct atgatggcac cactgcactc 13801 agcctgggca acagagtgag attctgtctc aaaaagtaaa taaaaataaa agcatgtgtt 13861 aaacgtatta gtgacaccac tcagtattaa ggtattaaat aacaggatcc cgcctgacaa 13921 ccactgttat ttcagagtag tgatgaacat aagtggtatt cgaactctct gccacctcta 13981 tgaattgaca ggaaaacatc tgtgacctct cttgctgacc gagtcacggg tactgctaat 14041 actgccacgt tcataatgga aggaaatgcc cagtgtctgt tcgaggttgg tggaaagaaa 14101 gatgtcgttt tttccacctc agtccgtgga gccctgaatt ctgtgtgcag acgtttgggg 14161 tctaagcagg acagtgggaa gctttgcttc ccacctttgc tttggctcaa agccctcatc 14221 tgtctgctct ccccataggg atcacaggtc tctggtctct ggccatcatt tcctgggaga 14281 ggtggctggt ggtgtgcaag ccctttggca atgtgagatt tgatgccaag ctggccatcg 14341 tgggcattgc cttctcctgg atctggtctg ctgtgtggac agccccgccc atctttggtt 14401 ggagcaggta agggtgcgag gacgcaagat ggagtgggca gggtcagact ctgtgacctt 14461 aaggcaaatc acttcctttc tctgggcccc tctgagcgtg caatgtctat caatgtatga 14521 atgtggctgc aacataggaa aggctctgtg gtccccgaac ctctggaaac atatttatcc 14581 caagcacgat caggtcacag gcgcacacgg agctcaggcc atcagcacag ctgtcagtga 14641 acgcatagcg tgtttgcatt ccaggtctct ttcttgcaca cgctgccgca ccacgccccc 14701 cacctttcag aggctgcttg ggtcatagat ccacctgggc ctacagagca catgtcctgg 14761 ccaggccaag caagtggctc aaatgtttga ttggagtgga ctgggtggga cagcatttca 14821 ctgttttatc gacaagctcg tgaataagtt ctcgtggtgt ttggagaggg aatgttcttt 14881 cctcgagaac gttccacaat tctaggaaac aaaccttgtg gaagcctgtc tctgtctccc 14941 gccctcctca tgccgccatg ccccacacag ctgcccgtta tcaaacatgt gtggtgagct 15001 gaccctggtg gaggctctcc cgcgggttat ctcatttaat cctccaggcc actaagtgag 15061 cagggccctt tatttcagtc atggcctagc tgacctcaga taaaagactc agctcttcat 15121 gggtgttctc agaaggtcag ggcaagaagg aacctcacaa tccctttgta aagaagggga 15181 gtgattggga agatgaaaat gtcctggaag cagatagtgg agatggttgc acagcattgt 15241 gaatgtacca aaggtcacaa tggtactttt ttcttttttt gagacagggt ctcactctgt 15301 cactcaggct ggcacagtgc agtggtgtaa ttatggctca ctgcagcctc cacctcctgg 15361 gctcaagtga tcctcctacc tcagcctcct gaggagctgg gcctacaggt gcaccacttc 15421 acccagctaa ttttttttat tttttgtaga gacaagatct cactatgtga tccaggctag 15481 tcttgaactc ctgggctcga gcaatcctcc tacctctgcc tccaaatgtg ctgggactat 15541 aggcgtgagc cattgtgcct ggcctataat ggtacatttt atgtgatgtg tattttacca 15601 caattcaaaa agaagaaagg catgacatct aaaaatggac aaggattaac caaaatccta 15661 cccaacggtt ttgttttggg ttgatgaaaa tgttctggaa gcagaggtgg tgactgccac 15721 agaattgatc acttcaaatt gggtaatctc atgcaacatg aatttcacct caatttaaaa 15781 aaacaaaccc cacccgagtt agcaccgtgc ctgggccggg ggtcctgggt caccccaccc 15841 tgcatcagga ctggctgccg gcccttctct ccaggtactg gccccacggc ctgaagactt 15901 catgcggccc agacgtgttc agcggcagct cgtaccccgg ggtgcagtct tacatgattg 15961 tcctcatggt cacctgctgc atcatcccac tcgctatcat catgctctgc tacctccaag 16021 tgtggctggc catccgagcg gtaagccccc cgattcctcc tggcctcacc cgcctcctgc 16081 ccctaagctg ctctgccctc aaatgagtcc actgagactc ctaaactatt tttccaaaaa 16141 tccttagaga agaggatttt acccctataa gaaaatatta agatccagcg atgagaatca 16201 ggtgattcct ttgggactgt accagtggct gcaggttcag ccccagcccc gttgtcctca 16261 gctctgtgag acgggaaagc actgccactc cctccctgga ggagtccact aagggaacag 16321 aggtgtgcct tgccccgacc ctggacagtt ctccccgggg tggaaaggct gcctttccca 16381 cagagtagag tggagcagcc acatcagcaa atgacacctg caaatcaagg cgtgttttta 16441 tgaggctgcc accggagtac ccttgtcctt ttcataggct gtggggccga ccaaggagtg 16501 gacccgagag tgccatttgc ccccctgacc cactctccac ctccatgtct ggccctctgc 16561 cctgggaagc tgatcctgtc cacagccgtc accccccacc cctagactag gctaccactg 16621 ggagcccttc aggaagtcag agcaagggag gagagccagg ctggttcttt tctgttagca 16681 gtgggagccc tttcagggtg ctggctttcc tatatgaagc tgcctgtgcc cacaattgga 16741 tgggcatgcc tgccaagctc tctctagagg agtctgtgag cctgtgaaag gccccctcac 16801 cccgtcacct tggggtgaag gctcccacag gtacccaacc atggcttcgg ctgtattagt 16861 ctgggatggt agagccccag ctccacaatg tggccccagc tctgctgtct cagccatccc 16921 tgcattccag ccctcacact ccctctctca tccccactca tctgcctgcc gccagtccct 16981 catccctggc aggtggtggc tggcctctgg cctcccccac agtgcctctg cctggaggcc 17041 attcgtctcc ttcctcccag caggcatgaa ggagccaccc caccaaagct gccctcagct 17101 gcctcaccgt gagtccaggg caggatttag tccacagagt ggccaacctg gcctaggaag 17161 cctgagggaa gtgtatgcat tgctctgaca ctcccatcgc gcaccccgcc agccactgct 17221 tttgcctccc ccgccatctc caccttgtta actccttcat tctccacgcc cagtcatcaa 17281 tcaaatcagg cctccatgct caggcctgag cgcaggacag gacagtctgt taagggatca 17341 ggtgaagcaa aggagcttgt tagatccagc tctggggtca tcttaggcca cacctagctg 17401 catgccacct ccaattctag aactccccca gggccagcct gaggcagcca tgtctgcctg 17461 gggccggctg tgctccactc agggctggaa gatggctgct gggctcctct cctcctcccc 17521 acaactccct atgcctgggt cacctgcctc ttgctgccct ccaacccccg actcactatc 17581 cctgtctccc ttaggtggca aagcagcaga aagagtctga atccacccag aaggcagaga 17641 aggaagtgac gcgcatggtg gtggtgatga tctttgcgta ctgcgtctgc tggggaccct 17701 acaccttctt cgcatgcttt gctgctgcca accctggtta cgccttccac cctttgatgg 17761 ctgccctgcc ggcctacttt gccaaaagtg ccactatcta caaccccgtt atctatgtct 17821 ttatgaaccg gcaggtaagc aacaccatca gcagatccca ctcaaaatac cgtgtgccct 17881 agaagggtgc agtgatggcc ccacctggaa tcatgtctct gataagaagc ccgcggagca 17941 tctgggggac cctccaggga aatgaccggg aaaggctcag cgtgtgaccc agccccagcc 18001 agagctccag ctggccctta gcagaaggct taggtgtgcc ctctggaatc ctttatagtc 18061 tcggcctgag ggtggcattt cccaaagcgt ctgtgtgccg tgcgctcttc ccttccggtg 18121 gccctagaac tatggctgcc gagcttcagg ggctctcctg gcgttcagac gctctaggag 18181 ttggtgagcc ctaggtacat ccaccctagg tgtgcccctc ttctgttcag actcgaccct 18241 tctcaacctt catctctcca ttttcaaacc gtaacctctg gaatttgtct tcctataaga 18301 acaaaagccg gccctccttg gctacactga ccaagagttc aagagctttc acgagtttgt 18361 gggttagttc aggggggacg tgctgtggtc ctgcccagag gcagcctcct tagctggcat 18421 attgggcctc agcagcaagc tgctcacaca cctaaatccc cccacctcct gcaggttaca 18481 ggcttcatta aagcgcagct gtgatgtgac ttgatggtgg ccagaaaggt gtgcagaggc 18541 ctcccatttc accaggccca gtccatccct tccactgggc tcttccttgc ttctccatct 18601 tagagccact caatggctcc agcccctttg gctcagcttt gactcacaca agccaagtct 18661 gcagagttca ttaagggttc attctctctg gtaactttta aatagtaagt aggaccaggc 18721 ctgcagtgga tttccgggaa ctcgctgtag cacactgatg cccagagtgt agttctatcc 18781 ctgacccctg tttcctgact ttcatgagga tcttttttag gtttctggaa tcctaaacta 18841 tcttgccaag tactgtcttt actggattat ttccattctc ctttccagaa ctccccctgg 18901 acagggggag acagatgtct gcacttctgg acctcaccag gcctcgaact ttgcttttac 18961 cctttccaca taattatcct gtcctgccac attctgagag aattttctgg aacgcagttc 19021 catgaagaca gcaaattttg ctcaggacag agtctggcac acagtgggtg ctcaagcagc 19081 agctgctgaa tggattcctc agccctatct cccagctctt cagccgagct gattctgctg 19141 tttgtcccgt ttcttatgtt attaatttca accattatat tttttatttt tgagagtttt 19201 gatgatagag ggagttagag ctagtcaaga gtaggcctga aatatttaga aaatgccttt 19261 ggtctgggtc ctcaaagcat tgtggttact tcagggatga cacaggacat gatttgagac 19321 attcatatgg cccagatctc tttggggtga agcagcaaag acagacccct cctggtaccg 19381 gaagacgctt ggctggagag atgaggtagg ggctagattg tcattaccta ggcctcacct 19441 tgccccagat ccatggactg gaaaaaacat gacaaccaca tgccttttca ttaatattcc 19501 tccgagccgc tcaccagaca gtctggggac aggtcaccac tgccccttag ctgtcactgt 19561 ggatgagtgt catggggctg ccgtcacaaa ctaccacaaa ctcagtggct tcaaaccaca 19621 gaaatggatt ctctcagggt tctggaaatc ttgagtctga aatcagggtg ttggcaaatg 19681 gaaaggttcc ctatggaggc cgggagggag aagcagctgc agggctgccg gcagtctttg 19741 gcgttccttg actccaaggt gtgtcacccc agtctctgcc ttcatcttca cgtggccttc 19801 ttccctctgt ctgcgtgtcc gtgtccaagc gttccttttc ttatcaggac accagtcatt 19861 cgattagggc ccaccctgct ccagtgtgac ctcatcttaa cctgaacaca tcttttgggg 19921 gacccacttc aacccagtgt agtcaccatc aactgctaag tcagatgaca tccccgcgtg 19981 tgagggagaa ataatccaag ccttcctcca tcccccatgg gattcggaat gggtgaaggg 20041 aaggctcggg cacgtacatt cagcacagtg ctccaccctt ccctgctctg ctcaataacg 20101 ctttctgtcc ttccagtttc gaaactgcat cttgcagctt ttcgggaaga aggttgacga 20161 tggctctgaa ctctccagcg cctccaaaac ggaggtctca tctgtgtcct cggtatcgcc 20221 tgcatgaggt ctgcctccta cccatcccgc ccaccggggc tttggccacc tctcctttcc 20281 ccctccttct ccatccctgt aaaataaatg taatttatct ttgccaaaac caacaaagtc 20341 acagaggctt tcactgcagt gtgggacgac ctgagcctct gcgtgtgcag gcactgggtc 20401 tcgagagggt gcaaggggga taaagaggag agagcgcttc atagacttta agttttcccg 20461 agcctcatgt ctaccgatgg cgtgaaagga tcctggcaaa acagaagtgt gaggcaggtg 20521 ggcgtctata tccatttcac caggctggtg gttacataat cggcaagcaa gagctgtgga 20581 ggggcttgct ggatgccctc agcacccagg aggagggagg gagctagcaa gctaaggcag 20641 gtggccctcc tggcccctta aggtccatct gctggaggcc cagagtcctt ggagtacagt 20701 ctacacctgg aggggaccca ttcctgccag tctgtggcag ggatggcgcg ccacctctgc 20761 caggccagga ccccaagccc gatcagcatc agcatggtgc aggtgcacag gcgtgagctg 20821 atcagtgacg aggggcaggc acacaaggtg gagacaaaga ccaagaggac ggttgccagt 20881 gagaggcgcg gactcaggaa cttgaacaac atctgcgggg gacggctttg gaggtgctcc 20941 gctgcctcca gttgggtgac ttgctgtagc atctccagct tggatattcg gctcttgaag 21001 gtctccgtga tctcctgcag gagacgaaaa tgcacgcacc agaagtcagc acagagttgt 21061 ggtcgtttat tgagttctta ggggtgagca gaaagcactg tggagtgggt attcgaggag 21121 ggaagcagag agcctagagc acattcaggg cagaggggag ggcgcaggct ctccagcaac 21181 agggaaagct tcatctgacc cggctgcact cccccatcca ctgtctcccg aagctgagga 21241 cctggtcaag acacagctac ccagggacgg gggtgggcgc tatgggaatg gaaaagtgag 21301 gagagggaag ccaggtctaa ggaggggttc tgagagggcg ctccctacac ctgcagccgc 21361 agcagaagca gctccacccc agatctcccg agtcagaggc tcacgggtga gcactgcagc 21421 accagagtgg caaaagcagc taagccagat ggtgggaagc ggagcgtgag tgtaaagatc // LOCUS HSRA36 32269 bp DNA PRI 13-MAR-1996 DEFINITION Human DNA sequence from cosmid RA36 from a contig from the tip of the short arm of chromosome 16, spanning 2Mb of 16p13.3. Contains 3-methyl-adenine DNA glycosylase, ESTs and CpG island. ACCESSION Z69720 NID g1204116 KEYWORDS 16p13.3; 3-methyl-adenine DNA glycosylase; CpG island. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 32269) AUTHORS Ainscough,R. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) Sanger Centre, Hinxton, Cambridgeshire, CB10 1RQ, England. E-mail enquires regarding this sequence: humquery@sanger.ac.uk COMMENT IMPORTANT: This sequence is not the entire insert of clone RA36. It may be shorter because we only sequence overlapping sections once, or longer because we arrange for a small overlap between neighbouring submissions. The true left end of clone RA36 is at 1 in this sequence. The true right end of clone RA36 is at 32269 in this sequence. The true right end of clone NFG9 is at 107. The true left end of clone GG4 is at 16638. RA36 is from a 280kb clone contig extending from the telomere of 16p. Higgs D.R., Flint J. unpublished. MRC Molecular Haematology Unit, Institute of Molecular Medicine, Oxford. RA36 is from the library CV007K. Choo et al.,(1986) Gene 46. 277-286. FEATURES Location/Qualifiers source 1..32269 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" /clone="RA36" /clone_lib="CV007K" misc_feature 1059..1914 /note="putative CpG island" repeat_region 3758..4068 /note="LTR7 element fragment" misc_feature 6490..7672 /note="putative CpG island" CDS join(8333..8371,8465..8740,12092..12296,14441..14817) /codon_start=1 /product="3-methyl-adenine DNA glycosylase" /db_xref="PID:e224269" /db_xref="PID:g1212953" /translation="MVTPALQMKKPKQFCRRMGQKKQRPARAGQPHSSSDAAQAPAEQ PHSSSDAAQAPCPRERCLGPPTTPGPYRSIYFSSPKGHLTRLGLEFFDQPAVPLARAF LGQVLVRRLPNGTELRGRIVETEAYLGPEDEAAHSRGGRQTPRNRGMFMKPGTLYVYI IYGMYFCMNISSQGDGACVLLRALEPLEGLETMRQLRSTLRKGTASRVLKDRELCSGP SKLCQALAINKSFDQRDLAQDEAVWLERGPLEPSEPAVVAAARVGVGHAGEWARKPLR FYVRGSPWVSVVDRVAEQDTQA" repeat_region 8507..8590 /note="2 copies of 42 mer 98 % conserved" repeat_region 9569..9858 /note="Alu repeat: matches 1..308 of consensus" repeat_region 10586..11326 /note="13 copies of 57 mer 91 % conserved" repeat_region 12499..12790 /note="Alu repeat: matches 1..308 of consensus" misc_feature 13296..13639 /note="expressed region - matched by multiple ESTs" misc_feature 14841..15914 /note="expressed region - matched by multiple ESTs" repeat_region 16280..16576 /note="Alu repeat: matches 308..1 of consensus" repeat_region 16582..16667 /partial /note="Alu repeat: matches 308..223 of consensus" repeat_region 16688..16908 /partial /note="Alu repeat: matches 237..1 of consensus" repeat_region 17269..17437 /partial /note="Alu repeat: matches 144..308 of consensus" misc_feature 17737..17929 /note="expressed region - matched by multiple ESTs" misc_feature 18752..18943 /note="expressed region - matched by multiple ESTs" repeat_region 19888..20127 /partial /note="Alu repeat: matches 52..308 of consensus" repeat_region 20136..20392 /partial /note="Alu repeat: matches 1..273 of consensus" misc_feature 21635..21764 /note="expressed region - matched by multiple ESTs" misc_feature 22256..22370 /note="expressed region - matched by multiple ESTs" repeat_region 22612..22899 /note="Alu repeat: matches 308..1 of consensus" repeat_region 23507..23749 /partial /note="Alu repeat: matches 1..256 of consensus" repeat_region 24182..24294 /note="MLT1D element fragment" repeat_region 24390..24423 /note="17 copies of 2 mer 100 % conserved" repeat_region 24621..24908 /note="Alu repeat: matches 308..1 of consensus" repeat_region 25156..25333 /note="2 copies of 89 mer 85 % conserved" repeat_region 25569..25751 /partial /note="Alu repeat: matches 1..196 of consensus" repeat_region 25772..25833 /partial /note="Alu repeat: matches 246..308 of consensus" repeat_region 26302..26526 /partial /note="Alu repeat: matches 308..72 of consensus" repeat_region 26528..26562 /partial /note="Alu repeat: matches 35..1 of consensus" misc_feature 27182..27292 /note="expressed region - matched by multiple ESTs" repeat_region 27444..27487 /note="4 copies of 11 mer 93 % conserved" repeat_region 27707..27985 /note="Alu repeat: matches 13..308 of consensus" repeat_region 28403..28684 /note="Alu repeat: matches 1..308 of consensus" repeat_region 28717..28972 /partial /note="Alu repeat: matches 38..298 of consensus" repeat_region 28999..29044 /note="23 copies of 2 mer 91 % conserved" repeat_region 29727..30019 /note="Alu repeat: matches 1..308 of consensus" repeat_region 32191..32264 /partial /note="Alu repeat: matches 308..232 of consensus" BASE COUNT 7156 a 9756 c 9171 g 6186 t ORIGIN 1 tcctggcaca gacactgagg tgggggaact ggagccggtg tggcggaggc cctcacagcc 61 aagagcaact gggggtgccc tgggcaggga ctgtagctgg gaagatccta ggggagggcc 121 tgatggtccc tgaaaggcag gggagacctg gacgactccc cctaaaccaa gagcgcaccc 181 tagaaagtca gctccccaag gtggggaccg gccagctcat gaggggtgct ggttgagggg 241 cctgcttgtg caacggaggc cacccagcca gtgaaagcta cttggaaggg ctgggaaagg 301 gtctgaggcc ccttcaatct ggcaccacca gactcccatg cactccattt gcccctgccc 361 cattctcagg aaaaggggcc cggcttgccc agaatgaccc ctggtggcag ggggctccgc 421 ctcagggacc ccttgagccc cagccagctg cttggtcccc agcttctgcc cacccttggg 481 gtcggggcta ggaggttggg gttagctcca caggggtgag gagaaccgag cctgggactg 541 cccaggcatc tagctgggtc aaacctgaaa agccagtttc atccttgaat cttctcccgg 601 tctccaacag gaagtcccgt tggcgggtgg gagttcccag gataggagaa gggcctccag 661 acctaggccc cacccaccgg gacagacaca cacattcacc atctcaagcg gcacacacag 721 aagaaactct ggtcaggacc acctcggtcg gaagcaggcc ctggggggcc agggaagatc 781 ccctgcccat tccccgccca gccccacaaa catgccccca gcatgacaga caaccaggcc 841 ctctgccaat gccctgagca tctggggtct ggctgtcacc ccagccggga gaaggcctgt 901 ccccgttggc ctctgcgcac agagtgagag tctgtcttcg ggagtcgctc ccagcccacc 961 ctggagcttc gaagtgtaca gatgggggag tcagtgtcgt ggggcagaag gcggagtcag 1021 tcccccaccc ccagctccac ggctcctgct attcccatcg gtgagggcag ccgggacagg 1081 gaccatcccg ccctgggctg gagtttcagc tcggagacac cacccgccgg tcgtggccgg 1141 gaaggaaaat gcagcggggg gagttcagag gaacctgggg cacccccgga ctctagactc 1201 ccgctcctgg gaccctcagc tgttgggcgg gggctggagc cgagctgtcc cgcccggcca 1261 ggccgctggg gctgctaatc cggccacggg ctatttttgc ggcgcgccgg gctaggaatc 1321 cggggctggg ccgcctccgc gggcgactcg ggaaactcca ggaagaaaag cccggccgcc 1381 ccgggggggc ctcgccgcga cccctccccg gccccggccc cggccccgac acggccccgg 1441 ccgctctgag cagctccggc cccgggtgct gaggactcgc cccgccctcc ggacgcagcc 1501 ccggaacccg cccgcccccg gagccctgcg ctcccggccg cgctcactca ctgccgccgc 1561 cgggggctct ggggggtcct gagggcgccg gggaggaggc tgccgccgct ggccgggagg 1621 gcccgcgccg agtccccgcc cgcccgccgg tccggcccgc ccgggaatgc cctggagcga 1681 ggggcgtgcg cccgggggcg ggcccggccg gagcggggcg gtcgcgctga ctcagagctc 1741 aggaatgcgc gccgcccgcc tgcccggccc aagtcacctc ctcccggaag gccgccgggc 1801 tcccttctcc ccagggtccg cctgcgggga cacctggggc tggggtctcc agcgcctcgg 1861 cccgacggag gacctggtgg gagcggaccg cagtgtcagc gctgggcggc tccgcctctc 1921 cctggctcca gattgaggct tgagctgtcc tgccctgatt tgtgctgaga agagggtgag 1981 gagggagagc aggcacgggg taaagggcgg aggatgggtg tgtatccctg gctcccctaa 2041 acccgacctc ctccagaggc ccacgctgac aagtcctgta tcccacactc acccaatccc 2101 actaagcact cacattccct cacacacatt catgtgcacc gatcactcac aatcatacac 2161 acacgtcccc acatccttcc acttcatgcg ttcattcgcc tgcacatgca cgtgctcaca 2221 tgcacccacc tactcatgta catctttcat gggtacatac acgtgcacac acacacgtgt 2281 acacaccgat gcacacactc agttcaggac cacagtcatt acccctcccc ctgccctgtg 2341 cccctacccc cagaagcccc gctccaccag caggcactca ggccaggctg caccggcccc 2401 acccgaagcc actcagcaga ggcagcgcag tgtctggtgt ccactctggg ctcttctcac 2461 tgtctcagag ctcagcggcc accccggggt gggctcatat ccctgcacaa tctgggctgg 2521 actcaaaatg gcactctgct ttccgctggg gctgctggtg agagtttccc agcatttctc 2581 tcctgacact gggtgggctg accctcccag ggtgggggac aggggtgcca acagcagagt 2641 acgggcaggg ctgtccagga gctctgagtg gtgggtggcc aaggcttcct ctggatagag 2701 ggtgcaggga tcccttcgtc ccatgggtcc ctaaggaccc ctctacaggg tgccccccca 2761 tgcctcggac aaatgtgtct tcaggccctg tgctggtcac aaggtgctgc tgcttcagtg 2821 ccatcctctg ggtctgggtg gcaccagccc tctccaaact gagctgttgc acttgagggc 2881 ccgacgcaga caatggcctg ctccttggct agacagaaaa ggggagaatt tgaactaaag 2941 gtgctggtac acttaccggt agatcccaga gcctgggcaa gattcccctg cccatatcct 3001 tgtccatgcc acgggatagg gcggtgcctg cacataatgg gtgctctctg aacacaagca 3061 atatcaacac aattctcatg tagaaggaga caatggaagc tttggagggg gtggagggta 3121 actccaccag gggatgactg ccagtgcatc acggacaccg ggtgagaggg agcctgcccc 3181 agtgccaaca ctcagtgctc cgatagcctg cacgtgccac ccctcccatg ccaggctcac 3241 agccagagac aggacagtct ccatgcttag cattattgac ctgccaggga taagcaagcc 3301 ctatcgggcc tggatgccca gactaggctg gcagggccag aagtggtgtc ctctctggaa 3361 agctggggac tgggcttgcc tccaaagcat gccacagctg ggcatctggt gtcagcaggt 3421 cttcctcctg cagtggactc tgtcagaccc cacctagacc cgctccaaga cccagacact 3481 catcgccagc tgccaggagc accagtcaca gctggctcag agctgaaacc ccccgcaccc 3541 aggagtcacc ttcagtgggg agcatgcccc agtctgcctc tgcaggcaaa tccccagctc 3601 agagcctgtg tccaggaatc cccaggctct cagcccaccc tggcctcctg agtgaggtca 3661 tcagaaggat gatcagaggg gggtcgatac acgtgtgctc agtgtcaggc ctctgagccc 3721 aagcctgcac gtatacatcc agatgaagca agtgaagaat cacaaaagaa gtgaaaatgg 3781 ccggttcctg ccttaactga tgacattacc ttgtgaaatt ccttctcctg gctcagaagc 3841 tcccccactg aggcaccttg tgacccccac tcctccccgc cacagaacaa ccccctttga 3901 ctgtaatttt ccactgcccg cccaaaccct ataaaacggt cccaccccat ctcccttccc 3961 tgactctctt ttcttcggac tcagcccgcc tgcacccagg tgaaataaac agccttgttg 4021 ctcacacaaa gcctgcttgg tggtctcttc acacggacgc gcacgaaact caggattgta 4081 cccagtccac atagacaagt gtgtgcacat atgtgtgctc tgcctcctgc ctgccctact 4141 actctctgtc ctggtggagg agccccctat acggaaaccc agctcccggc ccctctcttc 4201 ccgcgtcatg caccctggag aaatggaaat cagctgcagt tgcccagcac tcccggggct 4261 ggtgggcagg cgtctgagaa tgggttaggg ttagggtcta gaccaaaccc agggtttgtg 4321 gccctcacac caggacacga gggaatggga gccctgacca gtaagaggag tgttcccaac 4381 agctacccca gaatcctctg ctaccccaga atcctcagct cagcttctag gccacaagta 4441 atcacccaag gggtcagacc ctgggctttg gggagtccct ggaaggatca gaagaaaccg 4501 cccggggagc agggctccag ggatgggccc atgaccagct cctggctatt tggggaagag 4561 gcagaagggg gccaggccag ccagtgtcct gaagtcatag gctatttctg aggcagctgc 4621 cccctcccca tatcccaggg tttggcaagg ggttcccata agcacctggg cccagggcct 4681 ggcagcacag ccccaggcag ggaggatgta gctcacacac acccaacctc ggcctgccca 4741 gtggctgcct tctctgccca aggaagcttt ggccctccag ggctcatcag acacagatgg 4801 ccgcagagac cagtcaagag gccagcagcc accctgctgg gaagctgcca gctcaaagct 4861 gtgtagagga gtccatgggc tctgacctga cacccacctc aacagctgtc agcacccccc 4921 tccatcagag agaagacgga gccagtgtgc tgggcaccct ggggtcataa ggctgatact 4981 agaaccctgg cccctgtagc caatacagcc acatctctca aagacttgtt tataaaggct 5041 ccagcgactg atggccacac ctcctacagc tggaggcagc tccagcccct ccacagcctg 5101 tggcctattg tggaggtcgg gggtcagggc tgctcacccc agcaccaata ctagccttca 5161 ctcctttcaa agcctcaagg gtgttctcat ctccatcttc cactgatatc cttgcctcct 5221 ccaacaagcc tgttccagcc cagcagtggg acaaactggg gacagaggaa gccccaggcc 5281 ccctcctctc acctggcacc actttctctt gtcctggacc agcccgccag cccactctgc 5341 aggccacagt tggaactctc atcccaccgc ttccagcctg tggtgggagc atgagaccac 5401 aaaccaggaa gtgggggggg ggagtgccca gagattgcgc aacccccagc actcagtcca 5461 gaacatctgg agaagttggt tgctggcgcc ccggtgccgg cctctccacc tcctggggca 5521 ggggccaatc tgctctccaa gtgattctct ttcctcataa gaaaggtggg gataaaaaca 5581 cccctctggg agcctctgga ctccccgtga gttctgcaag gaccatgtat cacttgtaac 5641 atcagaataa atataagtac aaagtaacac ccatcctctc cctgcccatc aagaggacaa 5701 aggctcaact ggggcctcag tttccctact agactgaggg ggagcactcg gggggatggg 5761 gggctgttcc cacaggaagg agagactcat tgtttgggag gcatggctgc agataagcag 5821 ggagggtgtg gaatcacatc cagggctgct ttataaacac ggattgatga cacattatgc 5881 ccccaccccg ggaagcggca cacagtgcta caaacagctg gtggcttcca gccggctgat 5941 ggcccctccc tccccaggaa gtcctggctg gggcccatgg gggtgggacg gctggtcaga 6001 ggaggagagg cagttggaga ccaggtcctg ccctgttagc ctttctgtag gaccaggtgt 6061 cacactcatc tgagggctcc agacacatgg cctgtgcgtc tgtcctgccc agcgtctccc 6121 acctgacgcg gccaggtgcc cagttgctct tcaaggcctt ttcaggtgct cctttccagg 6181 accagggtca cagaagtcaa agctcaggaa aagcccctcg agggtttttg tgcggcagag 6241 gtgggttgtg gggtgggatt gtgcctgcca cagtggaggg gccctgcaga cccagataaa 6301 ccttcaagtg gccagaagcg ggggatggct ctgctgggtg ctggggctgc catgggccgt 6361 gggagccagc agtgtgccca gctccctcag ggcccgtccc ctaggccctt ccgtccactg 6421 ggccaagcac cgtccctgcc cctccctagg ggcatggatc tgacttgaga ggttgtgaga 6481 gcttacaggc gctgggccgt cggggaggcc tcagaagcgt aggacggctg cgcactgccg 6541 ggccgtgttc agccctggtc tggcctcggc ctctagagga ggctgcctgc gctccagcag 6601 gcccaaccca gaacgtgggc gagctccctt cagcatccct gggcggaaag agggatgggg 6661 gctctgctgc agaggcagaa tccgcgccgc tccctccttc cttcccccga ccagcctgtg 6721 acaaccccgg ccaggggcgg gggcctccgc acaagcctgg cgtccacttc ctggataagg 6781 actccccggc ccactccgga ccagggctgg ggcggcctcc caggcgctca ctccgctggc 6841 accccaccgg aaaacacgtc tgcggcccgc cccctccccc aaagcacgac cactccgccc 6901 gggcccctcg aggatccact caggttcacg acgggcccgt cctctcggtg gtctgaccac 6961 cggctggtgg agtgggctct ggggccgcca ggcgaccagg gcgcaggcgg gggcggacag 7021 ctcattggga ggggcgccgg ggcacagtgc ggggctcgcc ccacccccag gtgccccttc 7081 cccgctctcg cctcgcaggc accgcatcgg gcccgggaat cggtccggac ctggcggtgg 7141 gcgctgggaa gaggatccac ctccacgtgg cccgccccgc cccgggggcg cagccagttc 7201 ccggcgctca ctgcccccct tctcccggct tccgtcccct tctgcgcagg cgccgctccg 7261 ccccggtcct aggggtgctt ccgtggtcgg cggctgctgg gctccgcgcc ggggtccgag 7321 tcccacgaag ccccggcccg agccgccgga tgcccgcgcg cagcggggcc caggtgagcg 7381 cgcgcctcgg ccgccccgcg gaacagacgc gcccaccccc aggcgcagca gcgagcgcgg 7441 ccgcgggagc gggagtgccg gggacgggcg tagcgcccac cgccccgagg gttcggggca 7501 gagccagagc ataggccaag ggccaagctc gggccgagag cagtggccgc agcgcccggg 7561 ggctgaaccc acggcgcgct ggcagcgcgg gccgagctgc ggagacggtc acgtcagcgt 7621 ccgttccagg ccgactggca gtctccgttc tacattaacg tcagcactcc cgttaaaaat 7681 aatgcatctc tcccatgcca ggaggactta ggtgctgcta aagaccagcc ctccgggtgc 7741 tgccaggccg gcgctcaccc gccaccttca tcttcccttc tcctttgccc caggacagcc 7801 gaggatgtgt ggttaggttc cccctaccca tggggaggcc agaggtggga ggctggcggc 7861 ctgctcggtc tcagcagacc ctcctagtcc ctcaggagac cttgcctttg ccccacttgc 7921 tcgttatcca gcctgggcca tgaagcagag gacagttagg gaccctgagc acgcggtggt 7981 caccccggtg ctcacccctc cctgtgtgtc cgaccttggc cctgctaaga tcctgtgttt 8041 tgaattctgg caagggttgg atgaaagggc agggctccag aaaccagctc agacgtttgc 8101 ttgggacctg catgatgagt gggaatcgga gggcaccagc cctgctgtcc caggctcagg 8161 cccccatctg ctccccaggt catgcagcct gggcccccat gccgtgcagc tcgcacatat 8221 gtggggcaga gcagccaccc tgcccccagc agcagccgtc catcgtcaga cgtgatcatt 8281 tcctgaggcc tcgagtgtgt cagggtgttt gtgcctcata acaacccaca ggatggtcac 8341 ccccgctttg cagatgaaga aaccaaagca ggtggtcaga tccagtcctt gcacttcctg 8401 agcctgacct taccacacag ctgtctccta ttcggatgct tatttatttt ttttcccatt 8461 acagttttgc cgacggatgg ggcaaaagaa gcagcgacca gctagagcag ggcagccaca 8521 cagctcgtcc gacgcagccc aggcacctgc agagcagcca cacagctcgt ccgatgcagc 8581 ccaggcacct tgccccaggg agcgctgctt gggaccgccc accactccgg gcccataccg 8641 cagcatctat ttctcaagcc caaagggcca ccttacccga ctggggttgg agttcttcga 8701 ccagccggca gtccccctgg cccgggcatt tctgggacag gtaatgtgaa catagcagcg 8761 agggccctcc tgaagtgcct gtcactgcag ttgtccctga acccctgtcc gctgcctgct 8821 ggattggccc tcagttagtc ctccttgtcc tcatgcattt agcggctttg ctcacactgg 8881 tccctgctta gcttcatctc agttgcctga ggtcatctgg gaagaatgta gagattgtca 8941 gtgctgcttg cccagggtgg agcaggggtg ccactgtctg cccccggctc tgggcatcta 9001 gggcaggagc acttgcaccg ttagccttcc tcaccggcaa tccacctcag tttttccaca 9061 cgtgggaact gtggctgtgg ccagtgtcgg gctgcccctt cccgctgtag tggctcacag 9121 tgcagtgctg gcttcccagg gcccctgcag cttaatgtag gtcaaggatc tgggctctgc 9181 caatcagact tggagtaaga agcagggact gcccaggtcc catcctggtg gggcctacgg 9241 tgcagcctgt ccccacgagc agatgctcac acagccgtgg catcttggct ggatctgttg 9301 ggagcgaggt ggtggcggtg gtcctcgatg ccaaagcccc aggcctgcct ttctggttct 9361 ccagaggctc caaggccttc cctttgtcct ttaatgagcc tacttttagg ttataatcac 9421 caagaagaac cagaagagaa ataccagtgc caggagttgg gtgctaaagg agcagaccct 9481 agaatgtggg ctcgacctca tggagaacag agaacattct gtccccatgc ggatccttct 9541 aaaagcacct taagtaatgg gcatttgtgg ccaggtgtgg tggctcatgc cggtaatccc 9601 agcactttgg gaggcggagg tgggtggatc acgaggtcag gagtttgaga ccagcctgac 9661 caacatggtg aaaccccatc tctactaaaa atacaaaaat tagctgggtg tgatgacggg 9721 cgcctctaat ctcagctact tgggaggctg aggcaggaga atcacttgaa cccgggaggc 9781 agaggttgca gagagctgag atcacgccat tgcactccgg cctgggcgac agggcgagac 9841 tccgtctcaa aaataaaaaa aaaatacaat gggcatttgt gatacaacaa aacagcacgt 9901 gtggcaggtg cgtcgtagga acagggcaag ggcgcagcat ggcatcaggg accacaatac 9961 gtttgtctct attcagcatc cttgtcaggc cacagtcatg cccacttgag cccattggtg 10021 gacaagatga tgtccagcag agaagagggt gtctcaccct gtgtcccttg gatcagggac 10081 agccctctgc cctctccata acctcctagt ggatgccctt gcagacccct cagccaggct 10141 tgggtcatgc actcatccct aaaagggtca ctgcagtgag ttgccatgaa atcccaggcc 10201 actctctggg ctgaggaggg tgtggtctcc cccaggaggc tctgccaaca aggaagaggg 10261 gtgtgtgttg gccatggggg tggggagggg tgagaggccc cagtatctgc cactggtgga 10321 aatggtatca gtgaaatcct gatcagaaat ttgggatgga aaaggaaaca atgagaaaag 10381 ggctgtggga acctcctctc cagagtggta gttgggcccc cccagtgtgc ccccatggag 10441 gcacaggagt gtgcagagtg aagcccctgg aggggcattc cctctaagct ccagggaaga 10501 tggaacaggg ccaccctttc tcccctggac atggcctggc caagcctggc ctgttcactt 10561 ctgtcctcat ggctctggat gcctaaggat ttctgggctt cccgcccagg cactgacccc 10621 ttcttcccac caccctatct ccaggaaggc ctcctgaatg ctccccacac tgaccccttc 10681 ttcccaccac cctatctccg ggaaggcctc ctgaatgctc cccacactga ccccttcttc 10741 ccaccaccct atctccggga aggcctcctg aatgctcccc acactgaccc cttcttccca 10801 ccaccctacc tccgggaagc cctcctgaat gctcccggcg ctgacccctt cttcccacca 10861 ccctatctcc gggaaggcct cctgaatgct cccggcgctg accccttctt cccaccaccc 10921 tacctccggg aaggcctcct gaatgctccc cgcgctgacc ccttcttccc accaccctac 10981 ctccaggaag ccctcctgaa tgctccccgc gctgacccct tcttcccacc accctatctc 11041 cgggaagccc tcctgaatgc tccccacgct ggccccttct tcccaccacc ctacctccgg 11101 gaaggcctcc tgaatgctcc ccacgctggc cccttcttcc caccacccta cctccgggaa 11161 gccctcctga atgctcccca cgctggcccc ttcttcccac caccctatct ccgggaagcc 11221 ctcctgaatg ctccccgcgc tgaccccttc ttcccaccac cctatctccg ggaagccctc 11281 ctgaatgctc ccccggcgag catctaagct ccagtgcccc tcctccccag cccgtgatac 11341 tgaccatggc ctttggcacc acagggcttc gtgggctgag gtgaaggtag cctggctgtc 11401 cagctgacct atctggccta aaaacgagag gtgcagggga tgtggttggt ccccagtgct 11461 ctgctgttaa ctctggggct ccgagcagtg acacaactgt tggccgttac caggaccatg 11521 gtccaggctg gggcaccgtg gggcagctgg gcctactgct cagaaagccg gagggtggga 11581 gctgagtatg agggtccctt gtggccacct cagcagccca ggtttagggg acttcacggg 11641 ctctgggatt gcccaggtag ctggtgtggg gacatccttg gggtcctgtg ccttccgtga 11701 tgcctggcct cgtgctaaat gctgatggtt ttctcaaggg cacttgggaa aggaggaatc 11761 tggggaatgg agggtcctgc tgagaggact tgacctgcaa ggtgaatcca tgactgccgg 11821 ttcccagccc agcctggctg tggcctgagg tatcttcggc ctgcggtctg gttgtctagc 11881 agggctggag gagacaactg gtgccctcaa tagccacatt cagcctcagc aggagcaaga 11941 gaagaggcag gatggagacg gggcaaggtc cctggctaga cctgggcggc tgacacaccc 12001 tgagctgggg agatgaggtc caggctgggc actgttaggg tgagtggacc cccatccctt 12061 cccgccctga gcagaaactg actgcccaca ggtcctagtc cggcgacttc ctaatggcac 12121 agaactccga ggccgcatcg tggagaccga ggcatacctg gggccagagg atgaagccgc 12181 ccactcaagg ggtggccggc agaccccccg caaccgaggc atgttcatga agccggggac 12241 cctgtacgtg tacatcattt acggcatgta cttctgcatg aacatctcca gccagggtga 12301 gcagtgctgg ggcacggggg gttggagggc cggcagagcc ctgtccgcta gcagccaggg 12361 gaccactagg aggactgagg tggggccagc atttgagcga ggaggtgcca cttccaggcc 12421 aggccagggt ttcgggagct ctggctacgc tgacggggca gggctggtta ggttaaaggg 12481 gtagaaaggg ccgggtgggg ccgggcgcgg tggctcacgc ctgtcatccc agcacttcgg 12541 gaggccgagg cgggcggatc acctgaggtc aggagttcga caccagcctg gccaacacgg 12601 tgaaactctg tctctaccaa aaatacaaaa attagctggg tgtggtagtg ggctcctgta 12661 atcccagcta ctagggaggc ggaggcagga gaatcgcttg aacccgggag gcggaggttg 12721 cagggagcca agatcgcgcc attgcactcc agcctgggtg acagagcgag actccatctc 12781 aaaaaaaaaa aaaaaggaaa ggaaaagaaa gggttggttg gagtggaggc agaaagaagg 12841 ccgaggacac agtgtcgcct gcccttgacg gaggcagcac atgaggatga gaagctgatt 12901 ggagaagagg atgactgcag tgctaagagc agcgtggtca ggttgccaag gatggagcag 12961 tgggcacagc agggggactt agggtcggcg gaggagtcgg tgaggaaagg gaggtttggc 13021 aggaagtgat caaaggggtc atgtttttgt caggatgtgg gacttggatg tgttctgtgt 13081 gaaggagcca gggcacgggg ctgtggtgat gagggcggcc aggctttgac tcatttgcag 13141 gcggctctgt gggggctcag tgagacaacg aggggcgtgt gccctgcacc cacagggatg 13201 tagagggtcc tgctcctccc tactgaggtg ggtcagggtg ggcagcaggc accccacctg 13261 gtgagctgga agcagcgtgg gaatcacaga atggacggga acttaaaggc tttgcttggc 13321 ctggatttta tcttgaaata cttttgacag ctggctggtt gagggtatct gctcacagga 13381 acgccgcatt tgctggcttt gtccactagt gctcgcccct ggctgctgat gcggagcctc 13441 acgtggccgc agcccaagag tagggactgg cttggccacc tccaggctaa gcttcggact 13501 cccaggtggc tgggagggcc aggggtgcac aggtgcatca gagcaggtgc tgccttgctg 13561 gagggccagg gctcttctgg ccagggtcca ggtcatcatt gtccccagcc aggaatccaa 13621 ggggcctttc caaacctgca gggcagaggg aattcgggta tctgtgcttg agtgagcccc 13681 tgggcccagg agccttcgct tgctgtctct gtttctcaag gggcctggcc tggtgaggga 13741 gggggctagg ctggaggagg gatcccaagg gaggtgaggg ggctttgtca gcctcctcct 13801 gccctgcctg tgcagggtgt tgcagtcagt ccttccactg agtcattgca tgggctctcc 13861 caacatccgg tgcacactgg cagctgctct aagccaactc ctagccccca ccacttgacc 13921 aacacaaaca ctgagtgggt gaggcagaag gggagcgctg gggcctggct aggccaaggc 13981 ttcctgcttc ctggctgaat gatcgcaccc gaggactggc tctctggagc ttcctttgct 14041 ggctttatag ctgctgccag tcacaagacc aggggaagcc aggtggaaag gaactgatac 14101 ccagcatttg tcatgtgttt ttaacagtct ggctttgtgg gggcggccac agtgggggag 14161 gccctgcctg gtggtggaag ccagaggtgc ccacaggagg cacacctcat ggtgcaggct 14221 tggaggatgg caaggtaggc agaggggtct ggacacagtg aggtgcagcc ccctcccacc 14281 aggtcagacc caggagatgg tgcaggtgca cagagcaggt ccctggccca ggcaggaagg 14341 cagctgcacc ctccctgcag cacaggatgt ctggatgtgt actagggcag agaggacagg 14401 agcctaggga ggctccactt ccaaactgtc cgtcccacag gggacggggc ttgcgtcttg 14461 ctgcgagcac tggagcccct ggaaggtctg gagaccatgc gtcagcttcg cagcaccctc 14521 cggaaaggca ccgccagccg tgtcctcaag gaccgcgagc tctgcagtgg cccctccaag 14581 ctgtgccagg ccctggccat caacaagagc tttgaccaga gggacctggc acaggatgaa 14641 gctgtatggc tggagcgtgg tcccctggag cccagtgagc cggctgtagt ggcagcagcc 14701 cgggtgggcg tcggccatgc aggggagtgg gcccggaaac ccctccgctt ctatgtccgg 14761 ggcagcccct gggtcagtgt ggtcgacaga gtggctgagc aggacacaca ggcctgagca 14821 aagggcctgc ccagacaaga ttttttaatt gtttaaaaac cgaataaatg ttttatttct 14881 agaaaactgt gccttagcca gagctcctct aggtgatcaa cccatgtctg gagctagctc 14941 ttcctccagg acacgagagc tgggggcctg agtacgtagc gccaggcccg gtgtggatgc 15001 tggggagaat catcagtgtg ggagccgaaa gcccccgagg gtggggtcct gcacagtggg 15061 ccatgcctcc accagcaaga tgtgcacagg tgacagggct tctccagcct agcagggcca 15121 gcccaggccc tcgtgcccca gatggtcagg accaggtcac agcttggcta tgagcctgtt 15181 tgcggcttct gtggactgtg gtgaggactg ggccaggaaa ggctcagggt agcctgggag 15241 gaagaagcgc atggcagaca gaggtgctgg ggagggggcc acagggcact tcacaaatag 15301 aaggctgtca gagagacagg gacaggccac acaagtgttt ctgcacattc ttcagggtgg 15361 ccacagactg gggggtccaa ggagcaggtg tagggacaga aggagggtct gagaaacgca 15421 cagcccacat gggccttgaa ggatgcggcc tcacccagag acaggagtcc tggcaggccc 15481 ccctccagcg tggagatgcc tacgcgtgcg gcaaggactg gagggaagcg taggaacaca 15541 gagggcagca gccccacagc ggaaccacca ggggcaagga cagcggggct ctgcaggctt 15601 cactgggcca cggccagccc gcatccaccc aatgccaggc ctcagggcca agagggctca 15661 gcctcagcac ggggggagcc ctggggtggg gagacgcgag cgcccacctg cgcaccccag 15721 cagccttccg ccctccgcct gggctcaggg gagcagagcc tggaagacgg caatgacagg 15781 gtcctcgtgg gtggtcacca ccagcacgct gcggaacttg tcaaacagca tgagcagctg 15841 ggagcgccgc gtgttctcgt tgtacataat ctcctccagg tggtggcggc cgcggaagta 15901 gtgaaggagc ctggaaggga tgggtgggtg tgagcccaac ctgacaccag cccccagagg 15961 cctctgctga agagccactg ctgggaatca gctctgagct gcccacaggc ctgaacagag 16021 ctggtggtga aggccaggga ggcagccacc acagcccccc aacaagggtg ggcaggcctc 16081 ctggacccca tgcccaccac ggtcccgctg accaccaggt gggcggagtg ggttcaggac 16141 ggcagacggc tgttcaaacc cagaggtgcc caagcctgcg tcctgatgtt gggaccaggg 16201 ttctgctggt ggcttctttt tcgtgctaca gtgagtccca cgtagctctg gagccaaatg 16261 gtcactgctt ttttttttcc tttttttgag agacggagtc tcactcagtc acccaggcta 16321 gattgcagtg gctcgatctc agctcactgc aacctcccgc ctcccagatt caagcgattc 16381 tcctgcgtca gcttcccagg taagctggga ctacaggagt gcgctaccac acctggctaa 16441 tttctctctt ttttagtaga gacagggttt cattatatgt tggccaggct ggtctcgaac 16501 tcctgaactc aggtgatcca cccacctcag cctcccaaag tgctgagatt ataaccgtga 16561 gccactgtgc ccagcctgct tttttttttt tgagacggag tctcgctctg ctgcccaggc 16621 tggaggtgca gtggcgcgat ctgggctcac tgcaactccc gccttcccgc cattctcctc 16681 ctcccgccaa gctccgcctc ccgccttcag gccattctcc tgcctcagcc tccccagtag 16741 ctgggactac aggcgcccgc caccacaccc agctaatttt ttgtattttt tagtagagac 16801 ggggtttcac catgttagcc aggatgatct cgatctcctg acctcgtgat ccacccacct 16861 cagcctccca aagtgctggg attacaggca tgagccaccg cgcctgactt tttttttttt 16921 tttttttaat gtcaccatca tttggaaggt cactgcttta gggtgctttc tgcaactcag 16981 gctggcacag aggaggctag gatgcaggtg agacctggac cagctgctca gaagagttgc 17041 aagtggttct ctcgagggcc gagtgtggtt ctcaaccgga aactcagtcc ctgacaaccc 17101 gaccactccc gggaaactat gcccctgcac tgtcccccaa gtcagggtgg ggctgacaga 17161 gaccactgag ccacatcacc tttaacatgg gggaccctgg atgcccgcag ggcttacagt 17221 cctgggcaaa cccctgccaa gtcccatgag ggtgagaact tggcacttaa attagccggg 17281 cgtggtggcg ggcgcctgta gtcccagcta ctcaggaggc tgaggcagga gaatggcgtg 17341 aacccgggaa gcggagcttg cagtgagccg agattgcgcc actgcagtcc gcagtccggc 17401 ctgggtgaca gagcgagact ccgtctcaaa aaaaaaaaaa aaaaaaaaga acttggcact 17461 gcctctcaag ggaaaaggaa catctaagaa gtactggaga gagaagagtc cgaaaaatag 17521 gcatgccagg cccatggccc ggcgtcttgg tgacaagccg aaccctcagg cctgggcagt 17581 ggcacctgcc cctacacacc tgaatgcaga gactccatct cgaacagggc cccatctgtt 17641 ctcagtggat ttggtggttc tgttgcgtac gtccctatcc actttctgcc ctgaccctgc 17701 aactggcccc agtcccaacg ggtcaaccta cacccacctg gcaaacatgc ggaggtcctc 17761 agggttctgg gctgcgggta cactgaggat ggctgcgcgt tcatgctccg acagctggcc 17821 agcaggttct ccgtcatcct ctggttcagt ggcgagtccc cgctgggaag tagctctgcg 17881 ctggagttgt ccatgctggg gctggtgagg gtcatgtcat cgctgctggc tgtgggggac 17941 atgggtcagg gtgaccaagt ggaattccag gacacccagg ggaacagcga gggcagagcc 18001 agcagccagc ctccaatgac acccctaact cctacaaggg ttagggtaca tctgcctgca 18061 agcattcgct gctgccccac ctcctgggtg tgacacgccc ctcatacggc tgacttggga 18121 aacgggtagc ctcccctctg ggggcaacct caagggacct gaacagaggt gcgatgtgtg 18181 ccccacaaac accccaaggt gggcctgggc atcctgtccc tcacccagga tggaggacaa 18241 agcaccctgg gcaggtgggg tcctcacatg aagcatgaag cctacagcca gacctcagcc 18301 aaggggggac tttcaaagtc ttccaccccc ttccttcccc aagcctgggg accggggacc 18361 tgggacctgg caacaggctg ctgccctcaa gccacactgg ccatgacggc tcttctgggg 18421 cgcagaggtg tcagctgtcc tggtagtggt atgagatcct gctctgctcc caggctgggg 18481 cagagattcc agcactgatt tgctgggctg ggagctgccc ctaattctag caagcctgct 18541 gaagagcccg gagacagagc tgccacctca gagtcacggt ggtgctagga gacaggagag 18601 gtgtctccac gggggacacg aggcacgtgc atgtgcgtgt gcagctgtgt ggaggcccct 18661 catcactcac agcaccaccc acgttgagcc tcctggatgc cacccccaag cgtctcccct 18721 gactaccaca gcggtgccct gggggtccta cttggggagc caaagctgag ggcgttgggc 18781 gtgctgaggc tgcgaccgcc gacccgggca gtgaagggga cgtcgtcctc tcgcggacgg 18841 ggctcctcct cgctgggtga ggccatcagg cagacatagg tgtgcagctg gatgagaagc 18901 cggcgctgca gcatccacac caccatctgg atgagctggg tctgcgggtg gcagcaggtg 18961 aggctggtcc ccctccccac tccaacttcc tcctgggtga tcagggagcc atggccctag 19021 acatggccac agcacgctcc ctgtgctgac gcacaggcag aaaggccctc ttctcagaaa 19081 gggagccccc agctgccagc cagctctgca aggcccgctc accctcctgg gaagggagag 19141 gggccagggg aacatctgga tcacacctgc acaggctccc accacacacc ggggccaccc 19201 agcatctggc tgaggagcag gacctgagac gtggaggccc ctgtgactgc ttcgctgagc 19261 aaacccccca ccctgtgctg ctgcagctgc ccctctgcct cctctcttct ttggcccact 19321 ggccctggag ttggacccag ccagcaagtc cacctccaca caagcccaga cactgtttaa 19381 aaagaggctg tgggcaggta ccctgagagc agaaagtcct ctccttggat ctacctgaaa 19441 gtccaccctg agcacaagag tggatccaca agaaagctga ggccagtgac cctgggcttg 19501 ggggtggagc accccacccc cacccccacc gtgagaaggc acgtcctggc caggccctcc 19561 aatgtctctg ggttagcagg tggggctggc agtgaggagc agtgaggagc cttgtgccag 19621 gtagagtcca gtccccatga gaaagtactg cccagactca ggcgttgaac agtaaggagt 19681 gtgagacaca tcagtgtcct cagatacaca tctgtctact catctctggg agggtcaaca 19741 aggagctgct ggccagggtc acctcctggg agggggctgg ggaactgggg atgggaatgg 19801 gggatatatc ccaggcaagt aagaagactt ggtcaaaagg actgcaagat atccagactt 19861 cacatgaatt taagaaataa aacagacgcg ggtggatcac ttgagctcag gagttcgaga 19921 ccagccttgc caacacagtg aaacctcatc tctactaaag atacaaaaat tagccgggtg 19981 tggtggcaca cacctgtaat cccagctact tgggaggctg aggcaggaga atcacttgaa 20041 cctgggaggc ggaggttgca gtgagccaag atcacgccac tgtacttcag cctggacaac 20101 aaagcaagtt tctatatcaa aacaataaaa aaaaagctgg gcgtagtggc tcatgcctgt 20161 aatcccagca ctttgagagg atgaggtggg tggatcacct gaggtcagga gttagagacc 20221 accctggtca acacggtgaa accccgtctc tactaaaaat acaaaaaatt agccaggcgt 20281 ggtggcgggc acctgtaaac ccagctactc gggagggtga ggcaggagat tgcttgaacc 20341 cgggaggcaa aggttgcagt gagccgagat catgccattg cactccagcc tgacgtgcaa 20401 tcaaaattaa ataaaatcaa taaaccaaaa ataaaacaga ggggcttccc agcactggag 20461 cccaggcaga acctggctga cctaggcccc ttgacaaggc ctggagcaac cactcagtca 20521 tggcctggaa aagccactcc gttatctaga ccacaggagg cctctgagaa gggagcacca 20581 gcagctcctc cctaagcact gccccaagag caggggtgca gagagtgcct tttattcagc 20641 tggtacaacc tggctcagct cccagaaacc aagggctctg tgaaagaccg cactgagagc 20701 tctgtcaggt ttagcgtcta acggagctga aaagccagaa tcgttctcat tcgtaatttg 20761 agaacggaaa gggtttaact ttaaggactg gttttgccta atagatctgg cacttactca 20821 gatacttctg gagacaaaga cttgagaaca cccagctcgt ctgtctccaa caccgtatcc 20881 tatctggccc tgctctgagt ctctgcccag aactcagtgg aggggaggat ggcaaagcca 20941 cctcagcaca aggctatgac acctgcagcc cacaggggaa gccactggcc ctgccacaga 21001 tgcaggtgca ccagttcatc ccacagacct gcaggaggcc tcctatcccc cagccactcc 21061 tagctcccaa cctgcctgcc cgttgccctg atccaggccc actagggtcc tctctaggct 21121 ctcttgaatg aggagattct ggggaccagt gaccccagaa ctcataccca cagaccctgt 21181 ggccacctga accatggcag ggagtgaaga cagaagcaga actctcagag ggtcaggcct 21241 ggcccagtga gaacaaaact gcctttgcac cctcatctgt aggcagtccc cagtcccaaa 21301 ggggtacaag gcaggtctgt aggcccagaa aaggtaagca ctctcctgct taactcggtt 21361 ctaacactaa aaactgcctg ccacagccag ggtgggctct atggggaggg aacaactgct 21421 caagtgttgg gtcactggcc aagcatctgc aggtggcccg gcagctgact tttgggtcct 21481 gctgggcacg tagcacttgc acggcaagcc cccaaaacca aggagaagca ggtggcagtg 21541 ctgagggctc tcagatcaga accaacagcc aggcaggtag tgcctatcca ctacacacct 21601 gccactctgg gctccttgag cttctgtgac tcacctcctg cacagcgggg gccaggggat 21661 tcctaaattc tgacaaggag accggcaagg agaacttggc aagaacggac ggcaggtcat 21721 gagatgggaa ctggtgggag aactgctcgg ccagcgggga gtacctgcag gcaggtgagc 21781 atgccagtga gtgcagcaga cccccccacc gtgttctcaa cactggcctc agtgggcagc 21841 agagaagcaa agtgtggcag gtgggtcagg acagtgcgat gagggcagaa cggtatgagg 21901 ccaggcaggc ctccaggtgg acagagcctg ggcacttccc cttgagcagc agcaggagcc 21961 atgggcacag ttcatcctgc atctcacctc tgatccctca aggtggatta gaagagccag 22021 cctggaggag ggggccaggc atatggggga cacaggagag ccacagagca cagagccaga 22081 ccagaattgg gacctgggta tgctagtggc caggagttag gctgaagtga gagcctggcc 22141 ttggggcttc cacagccttt ggtgcctggc caacacagag aacagcatct ggagggccct 22201 gccagcctca ccccttccca ctcggggctc caggctctgg cagccagcag cactcacaga 22261 catacgctgg cattgggaga cagcatgtag acgttgttct cacacagcgg gtagatgatg 22321 atggccttgc cccagtacac cagatgagct gcaagctgga aaacctgcag gcaaaaggga 22381 agctgcaagg accatgcctg aggctggccc accgggagct gtcagcagct ctcagcctgg 22441 gtggccatgg cctaccagcc tggagaagct ggccccagcc tcatgcagaa cacacctggt 22501 ggctatggcc cgagacttcc tccctaacgt ggaatagcac cttcgcaacc tctcactagg 22561 atgtggggca cagccagcac acagcacttt ctgtgagcaa tggttttttt tttttttttt 22621 ttggagacag tcttgctctg tcgcccaggc tggagtgcag tggcacaatc tcagctcact 22681 gcaacctctt gcctcccagg ttcaagcgat tctcctgcct cagcctcctg agtagctggg 22741 attacaggca catgccacca cgcccagcta atttttgtat ttttagtaga gacagggttt 22801 caccatgttg gccaggatgg tctcgatctg acctcgtgat ccacccacct tggccttcca 22861 aagtgctggg attacaggca tgagccaccg cacccggcca caatggcctt tttgtcactg 22921 ctccctgtgc tgtttggtgg cactgttttc ccatcctatg ctgtcttgac tcccactgaa 22981 gaatggtggt ctcaggccac aagcatggga atgtctgggg tgctgggaac acagtgctgg 23041 cccagctggg cagggggctt agctggatgt cctggcaggg gcacagccct cttatacccc 23101 ctaaaactcc agcatggcct tgcccgatga tgggtgtgga gtcactcagc atgccacccc 23161 tacataccca aaggccccag gtctgcccca tgtgccccga ctcccacaga gacccccctc 23221 ccagctttgc tcttgtggac ccttgaacat caggaaacac aaacacaacc caaacccctt 23281 ccagaggaaa gctcaggcct gaggtctgag agcacagcac cccagcactt cacgaaaagg 23341 tgtgtgaaat cctaagctct cgcacttctg cctctgcatc agcccaggac accactccac 23401 cctcttggcg ggtaggcctc tgtggaggtg agtcacccct gcctggacac cgtgctgccc 23461 tgctgctgag gacacttata cctcagctca gaaaacaggg cctccagact gagtgagtgg 23521 ctcatgcctg tagtcccagc attttgggag ggcaaggcag gaggatcgcc tgaacctcgg 23581 agttcgagac caacctgggc aatatagtga gaccctcatc tctgccaaaa atttttattt 23641 aattagccag gcactgcggc acatgcctgt agtcccagct acttggaggc tgagacagga 23701 ggattgcttg cacctgggag gtgggggttg cagtgagcca agattgtgcg agtagtgccc 23761 ccaatggctc cgccaacccc gacatggcag gtttcagctc catcaccgcc ctgttccctc 23821 accccacgtg ccactcgcat gtctctctca aaaagacacc cccactttcc cccagctctg 23881 ctcagcatag tggtctccac ctcctgcctc tccctgcagc agcctctcca agatgctgtc 23941 cattctggaa attgaaaatc caaaatcagt acttaagggc aagcataagg tgccagcagg 24001 accaggctcc ctctggaagc tccagaagag aattcgtcct gcctcttcaa cggccagtgt 24061 ctcttggctt gcggtcacgt cactccactc tgtctctgtg gccacatcgc tgcctctttt 24121 tatcagatct tcctctgcct ccctctgata agggccttag agattacatt tagagctcaa 24181 caggatgatc ttcccatctc aagatcttct acttagtaac atctccaaag tcccacatga 24241 gttcacattc atagattttg gggattagga tgtgtatatc tgtgggggcc attattcagc 24301 caatcacaac gaatatacag cacgatccta attttcataa tacaaaataa aatgtttgtg 24361 tgtgtatata tatatatata tatatatata cacacacaca cacacacaca cacacacaca 24421 cacaataaaa tgtatatata cagataaaaa acctggaggg atctaggtta cctgtgggat 24481 atgggactgt gggaactttc tgtgtctata atttgtgtag tgtttgaagt cttgtaacat 24541 actaatttta tcatgacaca cccaaaattt acttgtagat tctctctgga ttgatataaa 24601 catgaagaat tttattttat tttatttttt gagactgggt cttgctctgt tgcccagact 24661 gcagtgtgta aaatcacaat tcactgcagc ctcgaacacc agggctcaag tgatcctccc 24721 acctcagcct cctgagtagc tgggaccatg ggcacacacc accatgccca gctaattatt 24781 taattttctg taaagacagg gtcttcctat gttgcctagg ctggtcgtga cctcctgggc 24841 ttaggcaatc ctcctgcctc atcctcccaa agtgctggga ttacaggcat gagccactgc 24901 acccagccac atgagggact ttaaactatc tggtgacatt cacacctgag gcaggtgctc 24961 cccagggggc tatctccgga aacagagtga cccaccaggg ctgggcccca ggggcttccg 25021 caccgccaca ccagcagcaa cgccctgctg acctgcagca gggacagcag ggaaggggta 25081 cagacagaag aaaacaatga caaacagcct tccctctccc cagtcaaggc cacggcaggg 25141 aagtgaaggg gacatcaaaa ggagaccaaa gctttgtcca aggagactct gagagcaagg 25201 gtgagaggca cactgtccac acaaactcct gccagacaac ctcagaagag ggcatcaaag 25261 ctttgtccaa ggagactctg acagcaagga tgagaggcac agtgtccagc acaaactcct 25321 gccagacagc ctcaggaaag ccaggcccag ctcctttcaa aagctcaaca caggaccagt 25381 gaacagccac aggcatttac catgtggtcg gtgctcatca ggacacggag ggtcactgcc 25441 tccctcactc tgcagggttg tatggagtgg tgagttcttt taaaattctc tgtctcctga 25501 atgctatgaa ggtgggcagg acatggtgtt tactgagtcc tgtccagccc atgaagtcca 25561 aaggcacagc tgggcatggt ggctcatgcc tgtaatacca gccctttagg aggccaaggt 25621 gcgatgatca cttgagccca ggagtatgag gccagccagg gcaacatagg gaaaccccgt 25681 ctctaccaaa aaaaaaaaaa aaaaaaaaat attagccagg cgcacacctg tagtcccagc 25741 tacttgggag gatgcttgag cctgggtgga tcatgatggc accaccacat gccagcctgg 25801 gagacagagc aagaacctgt ctcagaaaag aaaaaagaaa tacaagggca caatttggta 25861 agtgctctca cagaggatag ataaagggtc ctgagataca gaggcgggag acttactctc 25921 aaagacagaa tccaaaaagg ctgctttcgg ggaaaagtgg cctgagagtc aggccttgtt 25981 ggacaagttc ccctaacgtg ctaagcagag acctgtttca caggccaaaa gagcatggaa 26041 gctcagccag agggacccag gcgggcccac ctcacgcctg tgttcctcgg cagcctctgc 26101 tgccccctgg tggccatgct gtgcacaccg ccaccagacg agggcgccac ccacaggcac 26161 cacctgctcc caccctcagg cagtatttca atggtcccgt ggctctgggc cacgcccaca 26221 tgactagcag ccacagctga tcccatggcc tgttggccac cccatcagca gcccctgctt 26281 ctgatttctc cccgtgcttt tttttttttt ggaaacagag tcgcccaggc tggagtacag 26341 tggcacaatc tcagctaact gcaacctctg cctcccaggc tcaattgatc ctcacacctc 26401 atacccccaa gtagctggga ctacaggtgc acgccaccac acccagctat tttttttttt 26461 tttttttttg tattttttgt agagacgggg ttttgccatg ttgctcaggc tggtctcaaa 26521 gtcctgggct aggatgacag gcatgagcca ccgtgcccag ccttgcctgc gcttttcctc 26581 ccacactcca accagatccg cagacgaggc tgaggggccc tccatctaat cagaaagggc 26641 agccctctgc atggacaggc ccataagcat gtggctcagc aaggacgtgg catcttgtac 26701 agagtgcaca ggccaacaca tgtgcacaca caggggcgta aggccaccag gttaggtctc 26761 tgttctgcac atgggagggc cccgcatcca agcttcctgc ccttagttcc cgtatgttgt 26821 tcatgcacca ctgtgagacc cccatggccc tgcccgctgc tgccctgccc accccctagc 26881 tctggtgcct caacctccag gactgcaagc accccagtcc tttctctcac agactcgcct 26941 tacctctctg agacagaaga acaagtcagt cccgtggctc actcacaaag ccagccctga 27001 acccaggtct gactcagatg ccaaaacctc cagccatccc agggactggc cccaccccat 27061 ggtgggctgt gccgcggctc tcagctggac acagcccctg tggatgtact gtggcaggcg 27121 gcctgccttt cctgatgccc cagcaccgcc acatgccacc catctgggcc tccagagcta 27181 tacctgcagc aaggccaggt ccgcatcttg ggctagctgc tgcaggttct tcacagcaga 27241 tgtggtcttg atcacccgca ctagggcagg ggagcagtca ataggaagct cacccagcaa 27301 ggacttctca tcactgagca gcagcagggc atggtagggg ctgcaaaaca atcacctgtc 27361 acggaacaca cgaagtgcag gaaccctgca ccgggtatgc accaggtcct gcaccaggta 27421 tgcaccgggt atgcaccagg tcctgcacca ggtatgcacc gggtatgcac caggtatgca 27481 cctgggactc aaacacacgg aacatacgaa gtgcagggaa ccctgcacca ggtatgcacc 27541 aggtcctgca cgaggtcctg caccgggtgt gcatcaggta tgcacctggg acacaaacag 27601 gcagctggag cctcctggcc cttaactgcg gagagaaggg ccaagagcgg gatttgagaa 27661 aagggcagtg gtctgtttgg aggccttcac aaaatctgcc aaggactggc tcatgcctat 27721 aatcccggca ctctgggagg ccaaggcagg caggtcacgt gaggtcagga gttcgaggcc 27781 agcctggcca acatggtgaa accccatctc tactaaaaat aaaaaaatta tccgggtgtg 27841 gttgcaccca cctggggtcc cagctactcg gcaggctgag gcaggagaat cacttgagcc 27901 tgggaggcgg agcttgcagt gagccgagat tgcgccattg cactccagcc tggcgacaga 27961 gtgagactct gtctcaaaaa caacaaaaac aaaacctgca aggattatgt caccctcgga 28021 aaaaaacacg acaaaagtgg ggaaggctcg aagactggac tctgtatccc acctcagaga 28081 cgcccagcaa acatctgttg attgaagaca tgaatgaatc tttcctaagg aaattgtccc 28141 aaatatggga aaagcaaaat cctcctcaca gactagggac caaagtacat taaaatatgc 28201 ttagctagga ttgaaagcaa agagcacgga ctggggaggg tggaactagg cagcgctttg 28261 ctttctgtca tttccaaatt ttcagcaatg tgcccagata tatttttata attaaaaaat 28321 gttcaaggta ttgggggact tgggaacagg tggtgccaga agactgggct cagttctttc 28381 cttgcagaaa atcaattaaa ttgccagggc atggtggcta acacctgtaa cccagcactt 28441 tggcaggccg aggcgggtgg atcacgaggt caggagttca agacgagcct ggccaacatg 28501 gtgaaacccc atctctacta aaaatacaaa aattagccag gtgtagtggt gtgcacccat 28561 aatcccagct actcaggggg ctgaggcagg agaatcgctt gaacccggga gacagaggct 28621 tcactgagtg gagactgtgc cactgcactt gagccccgag caagactaca tctcaaaaaa 28681 aaaaaaaaaa aagtcaatta aattaagaaa gtatattttg ggaggctgag atgagcgatg 28741 agtggatcac ctgaggtcag gagttcgaga ccagtctggc caacatggtg aaaccctgtc 28801 tctactaaaa gtaccaaaaa aattagccag gcatggcagt gggcgcctgt aatcccaact 28861 actcgggagc ctgaggcagg agaatcactt gaacttggga ggtggacgta gtggtgagcc 28921 gggatcgcac tactgaaatc cagcctgggc gataagatca aaactctgtc tctctctcac 28981 acacacacac acccccgcaa aaaaaaaaaa aaaaaagaaa aagaaaaaaa aaagaaaaag 29041 aaaataactg agatcccatg agtgctgact gaaatgcaaa agactggggt agggcaacac 29101 cattaagatg aagtccctcc agccctccgg cctccccaag gacactcagc cccacatcca 29161 ggctgccctg cccaagccta aggtggtgcc agaagggccc aggagaaacc tgggattggt 29221 catttctgga cacaactgga tacaaaatta tgtgttttaa cttctataaa cggcagagcc 29281 ccacctgcaa gcttccgcag tgggaagaag aaaaagagaa attttggaac aaagtaagtg 29341 gtaaaatctc aggcagaagc taaagggaag ggccctggca gggagtgagc acgtggggct 29401 gggcacttac cggatggctt tcaggctccg ttcgatggcc tctgggggga tcagactgga 29461 ggccgcatag tggatcttgt ggggcaggca gaagctcacc tccagccagc tgttgatgtg 29521 aagccgaact acgcccgacg tgcacaggct gcagagagtg ggcgctgtta cccgttcaca 29581 taaactttct aaccatgcac acagatcaga aaacaccctg ctcaatggtg ctgattcccc 29641 tgctgcttag gggaaactgc aggtggagac ccgggcagga gagcatctct taagatcaca 29701 ttatgtttaa gaaaagaaaa tatgggggcc gggcacggtg gctcatgcct gtaatcccag 29761 cactttggga ggccaaggtg ggcggatcac gaggtcagca gttcgagacc agcctggcta 29821 ataacacggt gaaaccccgt ctctactaaa aatacaaaaa ttagccaggt gtggtggcgg 29881 gcgcctgtag tcccagctac tcgggaggct gaggcaggag aatcacttga acccaggagg 29941 tggaggttgc agtgagctaa catcacgcca ctgcactcca gcctgggcaa caaggcaaga 30001 ctctgtctca aaaaaaaaaa aaaaaaaaag gaagaagaag aagaaagaaa aagagaaaca 30061 tatttctggc tagccctgga atctgtagtc caaccagact gccagggctt gagagaagcc 30121 gtaccctgtc ctggggactc agtgcactag aggcactcta actcaggccc ctgcccgtcc 30181 agacccctgg cagacaccca acacccagag tacatcctcc ctgtgaatgg gcctgccctg 30241 cacacctcat gaagacccat cacaggtgca cgtccaggca gaacagtggc tgtctcagag 30301 ggcgaggagg gcagggaagg aatgccaagt cccccagagt gttccagtgg tgagaagcac 30361 acagtgccca cctccctgca gtctgcagat gcccacagca cctcagttgc agtgcatgtg 30421 gataccaagt ggtcaccaca agccagccgt gctagccagc cccaacactt cactacgacg 30481 gaaaaccttt cactccacat tctatcagta agggtaactt ggttctttcg ggtcccaaag 30541 cttgctcagg ccaaacatag caacaccatc ccagtttcct tatttctttc acagtccact 30601 ttaaaccatc agcaaatccc atctgctcca cctgcaggga tgtccaaaat ccaggccttt 30661 tcagcatcac caggcccaag ccacccagag cactggcccc tctgactacc gctccccaca 30721 ctcccacctg ggcagtccca gccatgggcc tgcaacaccc acgttcctgt ggcctcactc 30781 cctgtgccca gtcactcagg tcccgctgtg ctcagggccc tggcatttgc ccttgcccac 30841 aaactctctg cagacgcccc aaggctcctt tactccatcg gtctccactc aagtgtcaca 30901 ttctcagtga agcacccctg accccagccc cattgacact ccaacccctg acccgctgcc 30961 taatgctgtc tccccaacca gaactcactc ctgaggaggg gggcttgtgg gtttcattct 31021 ctgctctcta cacattgtaa aacgtgtcag gcacataggc acggagctgc atttcctgaa 31081 tgaagccaag gaacacaccg ccctccctgc atgcactgtc tttcctagga agccctgccc 31141 caagaacagg ctccacagtg gggatgcagc ccacgtcatc ccctcgggga accacacaaa 31201 acaaggaagg agttgttttc accaggagct tctaaaactc cctgatggag aagccagggc 31261 gccacccaac tccaccgctc accacaaagc gcccttgcct taagtaagag ccaacagcac 31321 ttgttctatg acacagagga gatgcaactg ctcaagcaaa gctgtctgca gctggccttc 31381 accgcatccc cggaggctcc aggcaacagg ggggatctcc acctactctt gtcaaggcct 31441 ggctactcaa agtggccggc accttccttg tgtcctgcac agggaacgca gcctgtgaag 31501 cctcacaatg acaccagttc taaaatagga gcaaaacaca cacaagttct tctccaaact 31561 gcccatttct acagcacctc cccgcatcat gtttcctctg ttcactgccc cactgcccag 31621 gagagcagcc aactagtgct tttcccatca ccccctgacc ctattctgct attctcagct 31681 cccagttggc gggacaggcc aacacaccca catccggcta agtccgggag acttggccag 31741 acttctggct gccccaggcc agaaggtaca aacggcccct cggacgtctc ttaggacacg 31801 tcaatacggg gtgctcagag aacaccatgg ccccaaggtc ccagaggagg gcagcagtct 31861 cagcttcttc cagaactgca tgctaaccaa gtatgtcggg gcagacagag gctcccagaa 31921 ccttacctct ctttggtcag ggtctttgca gggtgaaaaa ctcgtatcaa accctgagaa 31981 atgtcccagc cgctaaggaa ttaaagatca caccaggaga gatttgtcca ggggatccag 32041 gttactcaac ttgagtaagc aatagttagg atcttggcat ccgtatgcga cagggaagaa 32101 agggcaagcg agaggtggcc ccatgcactg tgctgtcacc atctgggtca gaaaggaggg 32161 ccactggaaa taaaaggtct attttttttt tcttttttct gagacaaggt ctcactgttg 32221 ctcaggctgg agtgcagtgg cgccatctcc gctcactgca gccttgatc // LOCUS HSRAS1 6453 bp DNA PRI 03-JAN-1991 DEFINITION Human germ line gene homologous to bladder carcinoma oncogene T24 (Gene code c-Ha-ras-1) with four exons. ACCESSION V00574 J00206 J00276 J00277 K00954 NID g35886 KEYWORDS germ line; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6453) AUTHORS Capon,D.J., Chen,E.Y., Levinson,A.D., Seeburg,P.H. and Goeddel,D.V. TITLE Complete nucleotide sequences of the T24 human bladder carcinoma oncogene and its normal homologue JOURNAL Nature 302 (5903), 33-37 (1983) MEDLINE 83141783 COMMENT The paper lists the oncogene, with the differences to the wild type gene, here the sequence of the wild type gene c-Ha-ras-1 is given. FEATURES Location/Qualifiers source 1..6453 /organism="Homo sapiens" /db_xref="taxon:9606" mutation 678 /note="G is GCGGCGG in oncogene T24" gene 1664..3347 /gene="c-Ha-ras-1" CDS join(1664..1774,2042..2220,2374..2533,3231..3350) /note="c-Ha-ras-1" /codon_start=1 /db_xref="PID:g35887" /db_xref="SWISS-PROT:P01112" /translation="MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQV VIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKR VKDSDDVPMVLVGNKCDLAARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLV REIRQHKLRKLNPPDESGPGCMSCKCVLS" mutation 1692 /gene="c-Ha-ras-1" /note="G is T in oncogene T24" mutation 1783 /gene="c-Ha-ras-1" /note="A is G in oncogene T24" mutation 2707 /gene="c-Ha-ras-1" /note="A is G in oncogene T24" misc_feature 3722..3727 /note="other site (polyA signal)" polyA_site 3744 /note="polyA addition site" BASE COUNT 946 a 2287 c 2113 g 1107 t ORIGIN 1 ggatcccagc ctttccccag cccgtagccc cgggacctcc gcggtgggcg gcgccgcgct 61 gccggcgcag ggagggcctc tggtgcaccg gcaccgctga gtcgggttct ctcgccggcc 121 tgttcccggg agagcccggg gccctgctcg gagatgccgc cccgggcccc cagacaccgg 181 ctccctggcc ttcctcgagc aaccccgagc tcggctccgg tctccagcca agcccaaccc 241 cgagaggccg cggccctact ggctccgcct cccgcgttgc tcccggaagc cccgcccgac 301 cgcggctcct gacagacggg ccgctcagcc aaccggggtg gggcggggcc cgatggcgcg 361 cagccaatgg taggccgcgc ctggcagacg gacgggcgcg gggcggggcg tgcgcaggcc 421 cgcccgagtc tccgccgccc gtgccctgcg cccgcaaccc gagccgcacc cgccgcggac 481 ggagcccatg cgcggggcga accgcgcgcc cccgcccccg ccccgccccg gcctcggccc 541 cggccctggc cccgggggca gtcgcgcctg tgaacggtga gtgcgggcag ggatcggccg 601 ggccgcgcgc cctcctcgcc cccaggcggc agcaatacgc gcggcgcggg ccgggggcgc 661 ggggccggcg ggcgtaagcg gcggcggcgg cggcgggtgg gtggggccgg gcggggcccg 721 cgggcacagg tgagcgggcg tcgggggctg cggcgggcgg gggccccttc ctccctgggg 781 cctgcgggaa tccgggcccc acccgtggcc tcgcgctggg cacggtcccc acgccggcgt 841 acccgggagc ctcgggcccg gcgccctcac acccgggggc gtctgggagg aggcggccgc 901 ggccacggca cgcccgggca cccccgattc agcatcacag gtcgcggacc aggccggggg 961 cctcagcccc agtgcctttt ccctctccgg gtctcccgcg ccgcttctcg gccccttcct 1021 gtcgctcagt ccctgcttcc caggagctcc tctgtcttct ccagctttct gtggctgaaa 1081 gatgcccccg gttccccgcc gggggtgcgg ggcgctgccc gggtctgccc tcccctcggc 1141 ggcgcctagt acgcagtagg cgctcagcaa atacttgtcg gaggcaccag cgccgcgggg 1201 cctgcaggct ggcactagcc tgcccgggca cgccgtggcg cgctccgccg tggccagacc 1261 tgttctggag gacggtaacc tcagccctcg ggcgcctccc tttagccttt ctgccgaccc 1321 agcagcttct aatttgggtg cgtggttgag agcgctcagc tgtcagccct gcctttgagg 1381 gctgggtccc ttttcccatc actgggtcat taagagcaag tgggggcgag gcgacagccc 1441 tcccgcacgc tgggttgcag ctgcacaggt aggcacgctg cagtccttgc tgcctggcgt 1501 tggggcccag ggaccgctgt gggtttgccc ttcagatggc cctgccagca gctgccctgt 1561 ggggcctggg gctgggcctg ggcctggctg agcagggccc tccttggcag gtggggcagg 1621 agaccctgta ggaggacccc gggccgcagg cccctgagga gcgatgacgg aatataagct 1681 ggtggtggtg ggcgccggcg gtgtgggcaa gagtgcgctg accatccagc tgatccagaa 1741 ccattttgtg gacgaatacg accccactat agaggtgagc ctggcgccac cgtccaggtg 1801 ccagcagctg ctgcgggcga gcccaggaca cagccaggat agggctggct gcagcccctg 1861 gtcccctgca tggtgctgtg gccctgtctc ctgcttcctc tagaggaggg gagtccctcg 1921 tctcagcacc ccaggagagg agggggcatg aggggcatga gaggtaccag ggagaggctg 1981 gctgtgtgaa ctccccccac ggaaggtcct gagggggtcc ctgagccctg tcctcctgca 2041 ggattcctac cggaagcagg tggtcattga tggggagacg tgcctgttgg acatcctgga 2101 taccgccggc caggaggagt acagcgccat gcgggaccag tacatgcgca ccggggaggg 2161 cttcctgtgt gtgtttgcca tcaacaacac caagtctttt gaggacatcc accagtacag 2221 gtgaaccccg tgaggctggc ccgggagccc acgccgcaca ggtggggcca ggccggctgc 2281 gtccaggcag gggcctcctg tcctctctgc gcatgtcctg gatgccgctg cgcctgcagc 2341 ccccgtagcc agctctcgct ttccacctct cagggagcag atcaaacggg tgaaggactc 2401 ggatgacgtg cccatggtgc tggtggggaa caagtgtgac ctggctgcac gcactgtgga 2461 atctcggcag gctcaggacc tcgcccgaag ctacggcatc ccctacatcg agacctcggc 2521 caagacccgg caggtgaggc agctctccac cccacagcta gccagggacc cgccccgccc 2581 cgccccagcc agggagcagc actcactgac cctctccctt gacacagggc agccgctctg 2641 gctctagctc cagctccggg accctctggg accccccggg acccatgtga cccagcggcc 2701 cctcgcgctg taagtctccc gggacggcag ggcagtgagg gaggcgaggg ccggggtctg 2761 ggctcacgcc ctgcagtcct gggccgacac agctccgggg aaggcggagg tccttgggga 2821 gagctgccct gagccaggcc ggagcggtga ccctggggcc cggcccctct tgtccccaga 2881 gtgtcccacg ggcacctgtt ggttctgagt cttagtgggg ctactgggga cacgggccgt 2941 agctgagtcg agagctgggt gcagggtggt caaaccctgg ccagacctgg agttcaggag 3001 ggccccgggc caccctgacc tttgaggggc tgctgtagca tgatgcgggt ggccctgggc 3061 acttcgagat ggccagagtc cagcttcccg tgtgtgtggt gggcctgggg aagtggctgg 3121 tggagtcggg agcttcgggc caggcaaggc ttgatcccac agcagggagc ccctcaccca 3181 ggcaggcggc cacaggccgg tccctcctga tcccatccct cctttcccag ggagtggagg 3241 atgccttcta cacgttggtg cgtgagatcc ggcagcacaa gctgcggaag ctgaaccctc 3301 ctgatgagag tggccccggc tgcatgagct gcaagtgtgt gctctcctga cgcaggtgag 3361 ggggactccc agggcggccg ccacgcccac cggatgaccc cggctccccg cccctgccgg 3421 tctcctggcc tgcggtcagc agcctccctt gtgccccgcc cagcacaagc tcaggacatg 3481 gaggtgccgg atgcaggaag gaggtgcaga cggaaggagg aggaaggaag gacggaagca 3541 aggaaggaag gaagggctgc tggagcccag tcaccccggg accgtgggcc gaggtgactg 3601 cagaccctcc cagggaggct gtgcacagac tgtcttgaac atcccaaatg ccaccggaac 3661 cccagccctt agctcccctc ccaggcctct gtgggccctt gtcgggcaca gatgggatca 3721 cagtaaatta ttggatggtc ttgatcttgg ttttcggctg agggtgggac acggtgcgcg 3781 tgtggcctgg catgaggtat gtcggaacct caggcctgtc cagccctggg ctctccatag 3841 cctttgggag ggggaggttg ggagaggccg gtcaggggtc tgggctgtgg tgctctctcc 3901 tcccgcctgc cccagtgtcc acggcttctg gcagagagct ctggacaagc aggcagatca 3961 taaggacaga gagcttactg tgcttctacc aactaggagg gcgtcctggt cctccagagg 4021 gaggtggttt caggggttgg ggatctgtgc cggtggctct ggtctctgct gggagccttc 4081 ttggcggtga gaggcatcac ctttcctgac ttgctcccag cgtgaaatgc acctgccaag 4141 aatggcagac atagggaccc cgcctcctgg gccttcacat gcccagtttt cttcggctct 4201 gtggcctgaa gcggtctgtg gaccttggaa gtagggctcc agcaccgact ggcctcaggc 4261 ctctgcctca ttggtggtcg ggtagcggcc agtagggcgt gggagcctgg ccatccctgc 4321 ctcctggagt ggacgaggtt ggcagctggt ccgtctgctc ctgccccact ctcccccgcc 4381 cctgccctca ccctaccctt gccccacgcc tgcctcatgg ctggttgctc ttggagcctg 4441 gtagtgtcac tggctcagcc ttgctgggta tacacaggct ctgccaccca ctctgctcca 4501 aggggcttgc cctgccttgg gccaagttct aggtctggcc acagccacag acagctcagt 4561 cccctgtgtg gtcatcctgg cttctgctgg gggcccacag cgcccctggt gcccctcccc 4621 tcccagggcc cgggttgagg ctgggccagg ccctctggga cggggacttg tgccctgtca 4681 gggttcccta tccctgaggt tgggggagag ctagcagggc atgccgctgg ctggccaggg 4741 ctgcagggac actccccctt ttgtccaggg aataccacac tcgcccttct ctccagcgaa 4801 caccacactc gcccttctct ccaggggacg ccacactccc ccttctgtcc aggggacgcc 4861 acactccccc ttctctccag gggacgccac actcgccctt ctctccaggg gacgccacac 4921 tcgcccttct ctccagggga cgccacactc gcccttctgt ccaggggacg ccacactcgc 4981 ccttctctcc aggggacgcc acactcgccc ttctctccag gggacgccac actccccctt 5041 ctgtccaggg gacgccacac tcccccttct ctccagggga cgccacactc ccccttctct 5101 ccaggggacg ccacactcgc ccttctctcc aggggacgcc acactccccc ttctgtccag 5161 gggacgccac actcgccctt ctctccaggg gacgccacac tcgcccttct ctccagggga 5221 cgccacactc ccccttctct ccaggggacg ccacactccc ccttctctcc aggggacgcc 5281 acactccccc ttctgtccag gggacgccac actcgccctt ctctccaggg gacgccacac 5341 tcccccttct ctccagggga cgccacactc ccccttctct ccaggggacg ccacactccc 5401 ccttctgtcc aggggacgcc acactcgccc ttctctccag gggacgccac actcgccctt 5461 ctctccaggg gacgccacac tcgcccttct ctccagggga cgccacactt gcccttctgt 5521 ccagggaatg ccacactccc ccttctcccc agcagcctcc gagtgaccag cttccccatc 5581 gatagacttc ccgaggccag gagccctcta gggctgccgg gtgccaccct ggctccttcc 5641 acaccgtgct ggtcactgcc tgctgggggc gtcagatgca ggtgaccctg tgcaggaggt 5701 atctctggac ctgcctcttg gtcattacgg ggctgggcag ggcctggtat cagggccccg 5761 ctggggttgc agggctgggc ctgtgctgtg gtcctggggt gtccaggaca gacgtggagg 5821 ggtcagggcc cagcacccct gctccatgct gaactgtggg aagcatccag gtccctgggt 5881 ggcttcaaca ggagttccag cacgggaacc actggacaac ctggggtgtg tcctgatctg 5941 gggacaggcc agccacaccc cgagtcctag ggactccaga gagcagccca ctgccctggg 6001 ctccacggaa gccccctcat gccgctaggc cttggcctcg gggacagccc agctaggcca 6061 gtgtgtggca ggaccaggcc cccatgtggg agctgacccc ttgggattct ggagctgtgc 6121 tgatgggcag gggagagcca gctcctcccc ttgagggagg gtcttgatgc ctggggttac 6181 ccgcagaggc ctgggtgccg ggacgctccc cggtttggct gaaaggaaag cagatgtggt 6241 cagcttctcc actgagccca tctggtcttc ccggggctgg gccccataga tctgggtccc 6301 tgtgtggccc ccctggtctg atgccgagga tacccctgca aactgccaat cccagaggac 6361 aagactggga agtccctgca gggagagccc atccccgcac cctgacccac aagagggact 6421 cctgctgccc accaggcatc cctccaggga tcc // LOCUS HSREP10 8874 bp DNA PRI 20-MAY-1992 DEFINITION Human beta-tubulin gene (5-beta) with ten Alu family members. ACCESSION X00734 NID g35958 KEYWORDS Alu repetitive sequence; tubulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8874) AUTHORS Lee,M.G., Loomis,C. and Cowan,N.J. TITLE Sequence of an expressed human beta-tubulin gene containing ten Alu family members JOURNAL Nucleic Acids Res. 12 (14), 5823-5836 (1984) MEDLINE 84272256 FEATURES Location/Qualifiers source 1..8874 /organism="Homo sapiens" /db_xref="taxon:9606" CDS join(699..755,1770..1878,1995..2105,6932..7989) /codon_start=1 /product="tubulin 5-beta" /db_xref="PID:g35959" /db_xref="SWISS-PROT:P04350" /translation="MREIVHLQAGQCGNQIGAKFWEVISDEHGIDPTGTYHGDSDLQL ERINVYYNEATGGNYVPRAVLVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWA KGHYTEGAELVDAVLDVVRKEAESCDCLQGFQLTHSLGGGTGSGMGTLLISKMREEFP DRIMNTFSVVPSPKVSDTVVEPYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLT TPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPAFAPLTSR GSQQYRGLTVPELTQQMFDAKNMMAACDPRHGRYLTVAAVFRGRMSMKEVDEQMLSVQ SKNSSYFVEWIPNNVKTAVCDIPPRGLKMAVTFIGNSTAIQELFKRISEQFTAMFRRK AFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATAEQGEFEEEAEEEVA" intron 756..1769 /note="intron I" intron 1879..1994 /note="intron II" intron 2106..6931 /note="intron III" repeat_region 2613..2617 /note="direct repeat" misc_feature 2618..2922 /note="Alu sequence A" repeat_region 2923..2927 /note="direct repeat" repeat_region 2933..2945 /note="direct repeat" misc_feature 2946..3246 /note="Alu sequence B" repeat_region 3247..3259 /note="direct repeat" repeat_region 3285..3288 /note="direct repeat" misc_feature 3289..3583 /note="Alu sequence C" repeat_region 3584..3587 /note="direct repeat" repeat_region 3833..3840 /note="direct repeat" misc_feature 3841..4117 /note="Alu sequence D" repeat_region 4118..4124 /note="direct repeat" repeat_region 4925..4931 /note="direct repeat" misc_feature 4932..5241 /note="Alu sequence E" repeat_region 5242..5249 /note="direct repeat" misc_feature 5250..5515 /note="Alu sequence F" repeat_region 5516..5523 /note="direct repeat" misc_feature 5602..5903 /note="Alu sequence G" misc_feature 5922..6272 /note="Alu sequence H" misc_feature 6323..6451 /note="Alu sequence I" repeat_region 6429..6440 /note="direct repeat" misc_feature 6441..6806 /note="Alu sequence J" repeat_region 6807..6818 /note="direct repeat" misc_feature 8812..8817 /note="polyadenylation signal" misc_feature 8816..8821 /note="polyadenylation signal" BASE COUNT 1900 a 2617 c 2370 g 1987 t ORIGIN 1 aatgccagta gaggtgtaca ggggagccac cagcatctgc caccaatcat gggaggaaaa 61 tcgtggtttt actatgcagc ttcaggacaa gccctccagg aacctaccgc cccccactcg 121 caaaccagat cgccgctccg gagcccccag cccactggga gggctcgcgg ttactgcaag 181 gattctggcg aatgaggtct aaatagaatg agaagggggg ctgcggaccg agaaactgag 241 gcaaggcctg gggcgggagg gtggaggcta agggggaggg gcaggacttc tcagatgcct 301 gtttaccagg cgctcccctc ccttcctccg cccaatttct gcgggctggc caagctggct 361 gggggctggg acgggggaga ctgcactggt ggctccctaa ggggcctcac ccccaaggca 421 ggggtcggag cacgtggcgt cccggacttc tctgggaggc cccctgggga gggaggtccc 481 tgcacactcc ccccacccca gccccctcat atccggggcg ggaccaaggg gcgggacgcg 541 ggagggtgac cggccgtcga cgtgcccgcc gctcatatag cggctcccgg gggcgcaggg 601 accgtgtccg ccgtctccgc cgcatcttcc acccgtcgcc gccgccgcag ctccccgcgc 661 tcgtgccacg ccgccgcgtc caccctcagc gccagcccat gcgggagatc gtgcacctgc 721 aggccggcca gtgcggcaac cagatcgggg ccaaggtgct cggggaacgg ggcccgggga 781 ggcaagttgg cggccccgga gcaccgcggg tcggtccccg gagcaccccg caacctggga 841 gtctttggaa aggaatcgag gtccccagga ggccacggcg cccagggcct tgggggaagg 901 gtcgcggggt ggggtggcag ggccagactg ggaacaaaga ggctgcggag agtgggagcc 961 agccttctgt ccgcctgtca cagctggccc acagctgggc cactgtcccc tcctcacgca 1021 caaagaggcc ccggtgcctc ccccggctgc aggaggaaat tatgtcccca gagcggtgct 1081 ggtggacctg gacccggcac catggactct gtccgttctg gccccttcgg tcagatcttt 1141 cggccggaca acttcgtgtt tggtgagttc cccagcaggg agccagaggc tggaaaactc 1201 cttattcctg ctaacaccac agagtcaaca gccaccaccg gggcccacag accagagatg 1261 gggtggtgga cagggcacag gcacagcctg catccagggc tgtgatgagg ggatgctcag 1321 gcctaggagg gcggatacag agaaggattc agggaagtct tcctggagga ggtggcattt 1381 taattcaacg gtctgaaccg gaaccagact gcctgggtgc acatcccaat tctgccactt 1441 catgcattca acaaacagtg agcactcact gcataccagg ccctgttcta gaagctggag 1501 atccaggtaa gaatcaaaca gccgggggga tccgtcgacc tgcagcctgg gtgcgctggc 1561 caagccggcc ccgctgggtg ctgactcggc tgcacccgcc cccaccgctc cttcttgctg 1621 cctatctgcc ccgacacatg cctctaggtc ccccgttcac ttgctggggg agaggtggca 1681 gctagggagc tagttggggg ctgggggcag tcctcggctt ggccaccggc ttcccctctc 1741 tcaactcttc cgaccacctc tctctctagt tttgggaggt tatcagtgac gaacatggca 1801 tcgaccccac aggcacatac catggggaca gtgacctgca actggagagg atcaacgtgt 1861 actacaacga ggccacaggt agggcgaagg aggcacttcc aggtgggaag gggcagctgg 1921 gagggaacag gatgctaccg ggcgcctgtg gttcctggca cgcatcgaac tgccctccat 1981 ctgtttccct gcaggaggaa attatgtccc cagagcggtg ctggtggacc tggaacccgg 2041 caccatggac tctgtccgtt ctggcccctt cggtcagatc tttcggccgg acaacttcgt 2101 gtttggtgag tcccccagca gggagccaga ggctggaaaa cctccttatt cctgctaaca 2161 ccacagagtc aacagccacc accggggccc acagaccaga gatggggtgg acagggcacc 2221 aggcaccagc ctgcatccag ggctgtgatg aggggatgct caggccatgt gagagcctga 2281 ggagggcgga tacagagaag gattcaggga agtcttcctg gaggaggtgg cattttaatt 2341 caacggtctg aaccggaacc agactgcctg ggtgcatccc aattctgcca cttcaatgca 2401 ttcaacaaac agtgagcact cactgcatac caggccctgt tctagatcag gtaagaatca 2461 aacagccgga gctccctctc tgccccagtg cagctcacat tctggtagaa ggtgataggc 2521 cacaaaggag ataaataagt aaagctctgc caggggttga ttcagatgga taacttagga 2581 tgggctcctc agcctttctg ggcctcaaaa aatcctcttt tttttttttt ttttcagaca 2641 gggtctgtct gtcctgctgt cacccaggct agagtgcagt ggcatgatca cggctcactg 2701 tcacctccac ctctgggcac acgtgatcct cttgcctcag tctcccaagt agctggacct 2761 acaggcatgc accaccacac ccagctaatt ttttagtttt ctttgtagag aggtgggtct 2821 ctctatgttg cccatgctgg tctcaaactc cgggactcaa acgatccacc caactcagct 2881 cccaaagtgc tggattacaa gcatgagcca ccttgcccgg cctcctcgtc tttaaaacag 2941 ggatttcact gggcatggtg gttcatgcct ataaacccag gactttggga ggctgagtca 3001 ggaagagagc ttgaggtcag gagttcgaga ccagcctggg caccatggtg agaccctgtc 3061 tctacaaaaa gaaaaaaatt ttaattagct gagtgaggtg gtgtgcacct gtagtcacag 3121 ctactcgtga ggctaaggta ggataatcac ttgagcctgg gaggttgagg ctgccatgag 3181 ctatgatggt gcccatgcca tccagcctga gcgatggtgt gagacccatc tctaaaaaag 3241 atcaaataaa acagggattc taaaatgcta ttagaattgt gaggctcagg gtgggcacgg 3301 tggctcacac ctgtaatccc agcactttgg gaggccgagg cgggtggatc acctgaggtc 3361 gggagtttga gaccagcctg actaacatgg agaaacccca tctctactaa aaataaaaaa 3421 ttagcccagg catggtggca catgctgtaa tcccagctac tcgggaggct gaggcaagag 3481 aattgcttga acggcgagag gttgtggtga gccaagatca tgacagtgca ccccagcgtg 3541 ggcaacaaga gtgaaacttc gtctccaaaa aaaaaaaaaa attctcaagc tttcctggct 3601 ccagctggct gcatgtcccc accctggctc cacccgcctc caatactccc agcatgtaag 3661 acgtgctaaa tgcaggacct gtcctgcctc tctgaatctt ttccgcaact cgcctcttac 3721 ttccatttta caggtgagaa aacagaggca cagagacgtt taggtcactt gcagaaagtc 3781 acactgcaag ccactggggc tgtcaggagc cagaggagag ggtctgacta ctaggtccac 3841 gctttttttt ttttttgaga cggagtctcc gttgcccagg ctggagtgca gtgacgcaac 3901 ctccacctcc cagattcaag cgattctcct gcctcacgag tagctgggac tacaggcatg 3961 catcaccaca cctagctaat tttttgtatt tttaatagtg acggggtttc gccatgttgg 4021 ccaggctggt ctcctattcc tgacctctca ggtgactctg cctgcctctg cttcccaaag 4081 tgctgggatt acaggcatga gcaacgcgcc tggtctgagc ccactacttt ttaatgctcc 4141 tcccaccccc tggccaccca gaggccatag aagtctcttg gcccaactgt atagtccctc 4201 tggtcctcct gaaatcactc acaccatttg gccaatctca tttcatcttg atttcaccca 4261 gccatttccc aaggacctat gatggccacg gccacaaaac tgcaatcatt aagtgttttg 4321 aaagtcttcc aacagatcag aaaggggaag atcacttgtc caaggtcaca cagtttgtct 4381 tgtcagtccc gagaatctgc agtctaaacc accaggctcg ggaggaggag ggagggggtg 4441 accagagtgg ccttctctga ggatgcaggc atttgagctg agacctgaag gaagtgagga 4501 aaaagtcagg tagaatacct ggggggtttg cattctaagc agccaagcca ctagtgcaaa 4561 ggccccaggg caggaccaca cctggaaagc tggagtaaca gctgcggctg aagcagagtg 4621 agggagggga tggggcaggt cggcaggcct tgtgcctgca gggaggattt gggctttgac 4681 cccaaggtag gtgggagcca tggagggctg taggcagagg agggacggga cctgacttac 4741 ttgctctcag gtgtcctctc gtgggtactg tggagaggac agactgggtg actaagggta 4801 ggagccaggg gaccaggaca gagggaattg aacttatcca ggagggtgat gctggagctg 4861 aacctaaggt ggagacagat gaggggtgaa agaaaacgtt agcttgttca cagtaatacc 4921 ggccaatatt ctttttttta ttccattttt tttttttttt gagacaaagt ctcactctgt 4981 cgcccaggct ggagtgcagt ggcgcgatct cggctcactg caagctccac ctcctgggtt 5041 cacgccattc tcctgcctca gcctcccgag tagctgagac tacaggcacc caccaccaca 5101 tccagctaat tttttctatt tttagtagag acggggcctc accgtgttag ccgggatagt 5161 cttgatctcc tgacctcgct atccgcccgc ctcggcctcc caaagtgctg ggtttacaag 5221 cgtgagccac cacgcctggc caatattctt cttttttttt tttttttttt tttgagacgg 5281 agcttgctgt cgcccaggct ggatgcagtg gcacgatctc ggctcactgc aggctcagct 5341 cccgagtagc tgggactaca ggcgcccgca cctcgcccgg ctaatttttt gtattttcag 5401 tagagacagg gtttcactgt gttagccagg atggtctcga tctcctaacc tcgcaatctg 5461 cccgcctcgg cctccctaag tgctggatta caggcatgag ccactgcgcc tggccaatat 5521 tctaatactg attcttttct ccgggagctt gcattccaca gtgagcttgc agacaataaa 5581 catgattaac aagaaaacac aggccaagca tggctgcacg cctgtaatcc cagcacttta 5641 ggagccatgg caggtgtatg ctccagctca ggagttcaag aacagcctgg gcaacatgtc 5701 gaaaccccac ctctacaaaa aaaagataca aaaattagcc agatctggta gcataagcct 5761 gtagtcccag ctactcaaaa ggctggggca ggaggatcac ttgacctggg aagtcaaggc 5821 tcagtgagcc atgattgcac ctctgtactc cagcctgggc aacataccaa gaccctgcct 5881 gaaaatgata ttattattaa taatgatgtt aaatattaat ctttttattg ttattattta 5941 tttgttttag agacagggcc ttgctctgtc tcccaggctg caatggtgtg atcatagctc 6001 actgcagcct ccaactcctg ggctcaagta agtctcccac ctgagccccc ctagtaactg 6061 gtactacagg tgtgccacca ccatgactgg ctaatttata tatatatata tatatatttt 6121 tttttttttt tttttttttt tttttttttt gttagagaca gggcctcact agggccgcca 6181 ggctaagtct ccaagctgct gggctcaaga acctcctccc acctctgcct cccaaagtgt 6241 taggattaca ggcatgagca ctgtgcccag tcttggtttt tttttaggca gggggcgggg 6301 gagtgttttg tgtgtatgtg tgtttttgag atggagtctc actccgtcac ccaggctgga 6361 atgcagtggc tcaatcttgg ctccctgcag cctcctcctc ccaggttcaa gcaattctct 6421 tgcctcagcc tcccgagtag ccagggacta cgatcgctca ccagcacact ggctataatt 6481 gttattattt attattatta ttattattat tattattatt ttgagacgga gctcgctctg 6541 ttgcccaggc tgcagtgcag tggcacaatg tcggctcact gccaagcttc cgcctcctgg 6601 gttcacgcca ttctcctgcc tagcctcccg agtagctagg actacaggcg cccgcaccac 6661 tcccggctaa ttttttttgt aattttagta gagacggggt ttcaccgtgt tagccaggtt 6721 ggtctcgatc tcctgacctc gtgatccgcc cacctcggcc tcccaaagtg ctgggattac 6781 aggtgtgagc caccgcaccc ggctatttgt tattattttt gatagttact agtattcaac 6841 tactagaggc cctccaggtg gtggagagac agagtcaagg gttggggctc ccccataact 6901 gcatgtgtgt cccctcccct tccctctcta ggccaatccg gagccggcaa caactgggca 6961 aaggggcact acacggaggg cgcagagctg gtggacgctg tcctggacgt agtccggaag 7021 gaggccgaga gctgcgactg ccttcagggc ttccagctga cccactcgct ggggggtggg 7081 acggggtccg gaatgggcac gctgctcatc agtaagatgc gcgaggagtt cccagaccgc 7141 atcatgaaca ccttcagcgt ggtgccctcg cccaaagtgt cagacacggt ggtggagccc 7201 tacaatgcca cgctgtctgt gcaccagctg gtggagaata cggatgagac ctactgcatc 7261 gacaacgagg cactctacga catctgtttc cgcaccctca agctgaccac ccccacctac 7321 ggggacctca accacctggt gtcggccacc atgagcgggg tcaccacctg cctgcgcttc 7381 ccgggccagc tgaacgccga cctgcgcaag ctggccgtca acatggtccc ctttcctcgc 7441 ctgcacttct tcatgcccgc gttcgcaccc ctgaccagcc ggggcagcca gcagtaccgg 7501 ggcctgacgg tgcccgagct cacccagcag atgttcgatg ccaagaacat gatggcggcg 7561 tgcgacccgc gccacggccg ctacctgacc gtggccgccg tgttccgggg ccgcatgtcc 7621 atgaaggagg tggacgagca gatgctgagc gtgcagagca agaacagcag ctacttcgtg 7681 gagtggatcc ccaacaacgt gaagacggcc gtgtgcgaca tcccgccccg cggcctgaag 7741 atggccgtga ccttcatcgg caacagcacg gccatccagg agctgttcaa gcgcatctcc 7801 gagcagttca cggccatgtt ccggcgcaag gccttcttgc actggtacac gggcgagggc 7861 atggacgaga tggagttcac cgaggccgag agcaacatga atgacctggt atctgagtac 7921 cagcagtacc aggacgccac ggccgagcag ggcgagttcg aggaggaggc ggaggaggag 7981 gtggcctagg ctgctcccat cgcttcccac ctgtcccctc gaggcttctg acctttgatc 8041 cgctaggccc cccatctctg aaccctagag cccccgcttt ccctccaagg ctgactcccc 8101 cctgacccta acaattacct ttggagctcg ctttacctct ggctacttca tctccgaccc 8161 tggctcccct ttgagcccta atttatcttt aacccccttg agctcttcca accttgacat 8221 tcccaggagg agccccgctt caccccttct cactctggaa accgcacctt taactttgca 8281 gaccttcctt cacccctgac ttctgcttca cctttgacct ctgcccccca tgaatcccat 8341 tttacctcta gacctataag ttctggttta tgtttgaccc ctccctctga gctgcacttc 8401 accgctgacc ttgcctcacc tttaaccccc cacctgagcc ccagctccta cctctgaccc 8461 caacttctct ttgaatctct gaatcccctc tgactccaac ttctctttca ccctctatga 8521 gtcccatttt acttctacac ctgcaaagtc ctggtttata ttggacccct ccctccgagc 8581 tgcagttcac ctttgacctt gcctcacctt tcacccccca ccccccacag cgtcagctcc 8641 tacctctgac cccagcttct ctctggttcc cacaggcccc atgcatcctc cctgcctcac 8701 tcccctcagc ccctgccgac cttagcttat ctgggagaga aacaaggcct ggtgcctgtg 8761 aggaagagag gtcaccccta ccctccctcc ccgcttgcct gcctcaccct caataaataa 8821 attaaatgtt gtcatggatg ttctgccgaa tccctctttc ctctcttaca gcaa // LOCUS HSRING3GE 14561 bp DNA PRI 17-SEP-1996 DEFINITION H.sapiens RING3 gene. ACCESSION X96670 NID g1370114 KEYWORDS kinase; RING3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14561) AUTHORS Thorpe,K.L., Abdulla,S., Kaufman,J., Trowsdale,J. and Beck,S. TITLE Phylogeny and structure of the RING3 gene JOURNAL Immunogenetics 44 (5), 391-396 (1996) MEDLINE 96376536 REFERENCE 2 (bases 1 to 14561) AUTHORS Beck,S. TITLE Direct Submission JOURNAL Submitted (15-MAR-1996) S. Beck, The Sanger Centre, Hinxton Hall, Hinxton, Cambridge, CB10 1RQ, UK FEATURES Location/Qualifiers source 1..14561 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cosmid o27" /chromosome="6" /map="p21.3" promoter 205..228 /note="SP1 motif" exon 236..1276 /number=1 mRNA join(236..1276,2811..3114,3729..3866,4375..4513, 4595..4809,4907..5281,5787..5915,6102..6350,6471..6733, 8171..8475,8685..8807,8925..9632) promoter 346..352 /note="CREB motif" promoter 1159..1168 /note="Krueppel motif" intron 1277..2810 /number=1 promoter 1486..1502 /note="CdxA motif" promoter 1599..1604 /note="NIT2 motif" promoter 1699..1708 /note="NFkB motif" promoter 2215..2221 /note="SRY" promoter 2600..2605 /note="NIT2 motif" promoter 2618..2624 /note="CdxA motif" exon 2811..3114 /number=2 gene 2923..9061 /gene="RING3" CDS join(2923..3114,3729..3866,4375..4513,4595..4809, 4907..5281,5787..5915,6102..6350,6471..6733,8171..8475, 8685..8807,8925..9061) /gene="RING3" /codon_start=1 /product="kinase" /db_xref="PID:e243292" /db_xref="PID:g1370115" /translation="MASVPALQLTPANPPPPEVSNPKKPGRVTNQLQYLHKVVMKALW KHQFAWPFRQPVDAVKLGLPDYHKIIKQPMDMGTIKRRLENNYYWAASECMQDFNTMF TNCYIYNKPTDDIVLMAQTLEKIFLQKVASMPQEEQELVVTIPKNSHKKGAKLAALQG SVTSAHQVPAVSSVSHTALYTPPPEIPTTVFNIPHPSVISSPLLKSLHSAGPPLLAVT AAPPAQPLAKKKGVKRKADTTTPTPTAILAPGSPASPPGSLEPKAARLPPMRRESGRP IKPPRKDLPDSQQQHQSSKKGKLSEQLKHCNGILKELLSKKHAAYAWPFYKPVDASAL GLHDYHDIIKHPMDLSTVKRKMENRDYRDAQEFAADVRLMFSNCYKYNPPDHDVVAMA RKLQDVFEFRYAKMPDEPLEPGPLPVSTAMPPGLAKSSSESSSEESSSESSSEEEEEE DEEDEEEEESESSDSEEERAHRLAELQEQLRAVHEQLAALSQGPISKPKRKREKKEKK KKRKAEKHRGRAGADEDDKGPRAPRPPQPKKSKKASGSGGGSAALGPSGFGPSGGSGT KLPKKATKTAPPALPTGYDSEEEEESRPMSYDEKRQLSLDINKLPGEKLGRVVHIIQA REPSLRDSNPEEIEIDFETLKPSTLRELERYVLSCLRKKPRKPYTIKKPVGKTKEELA LEKKRELEKRLQDVSGQLNSTKKPPKKANEKTESSSAQQVAVSRLSASSSSSDSSSSS SSSSSSDTSDSDSG" intron 3115..3728 /gene="RING3" /number=2 repeat_region 3630..3660 /rpt_family="MVR" /rpt_unit=GT exon 3729..3866 /gene="RING3" /number=3 intron 3867..4374 /gene="RING3" /number=3 exon 4375..4513 /gene="RING3" /number=4 intron 4514..4594 /gene="RING3" /number=4 exon 4595..4809 /gene="RING3" /number=5 intron 4810..4906 /gene="RING3" /number=5 exon 4907..5281 /gene="RING3" /number=6 intron 5282..5786 /gene="RING3" /number=6 exon 5787..5915 /gene="RING3" /number=7 intron 5916..6101 /gene="RING3" /number=7 exon 6102..6350 /gene="RING3" /number=8 intron 6351..6470 /gene="RING3" /number=8 exon 6471..6733 /gene="RING3" /number=9 intron 6734..8170 /gene="RING3" /number=9 repeat_region complement(7388..7710) /note="Sx" /rpt_family="alu" exon 8171..8475 /gene="RING3" /number=10 intron 8476..8684 /gene="RING3" /number=10 exon 8685..8807 /gene="RING3" /number=11 intron 8808..8924 /gene="RING3" /number=11 exon 8925..9632 /number=12 polyA_signal 9633..9638 repeat_region 10086..10332 /note="sb0" /rpt_family="alu" repeat_region 13548..13721 /note="J" /rpt_family="alu" BASE COUNT 3579 a 3178 c 3712 g 4092 t ORIGIN 1 tccacacccc tggcggttcc aggcaggcta cgccacgcga cccctcccgt ttccctgctt 61 tggccaatgg aggagctacg aatggcacga cctgctcgag cttggcagtc tccagttggg 121 ctgtgcatgg aagcttggga agactttgtt ggaaggggag gcggggagag agtgctggag 181 gctctggggc gatggcttcc gcacctcttc caaccaccct ctttccctgg agtcggcgga 241 ccacagctca gccaattggc ttggagatgt ggcgggttgc cacttccctg tgggtctctg 301 cggcactctt ctgcctggtg actgacacct tggaaatgaa gtttatgacg tcatcgttgc 361 ggctggccaa tagaaaaagc tcccgcggag aggtgttcct tccccttcga ctcagcttct 421 tcacccgcgt gagcgagcgc gcgcgcgcgg agggggtggg gaaaatctca agcagggtgg 481 cgcgcatgag cggcgaagct cctcctcccc gcctatatat aaagggctgg cgcggggctc 541 ggcggcgcca tttcgtgctg gagtggagca gcctctagaa cgagctggag gattctgcct 601 accgatacag agccttcgag tcgtccgggg ccgccattac aatccacctc catccgcttg 661 gaaatggcct tcgtcccggc ctatgactgg tcccagcggg cagtacagac cccctagaag 721 cccctggagc tccccttttt cgggccccgc ccaatcctcg gagtctgtcc accccctcta 781 ctccgccctc aagaggattt caaagatgga ggcggcggct ccctaaacca cttttcgtgt 841 tcatccgcct ccatccgaga tcgaaacggg acctcgtcgg ccccgtaggg gcccgacaag 901 aagagggaat ccctgcagac caacagcggg ctatattgac gacggtgtct gagatcgggg 961 accgtctttt gaagagtcag tccctcctta gttgcccgcc tcagctgagg ccgccgccat 1021 tttcttgctg tccgccgtct gcagagcgcg ccaagctgcc cggagctctc cgagaggccc 1081 caaagagact gctttcgtgc cggccaggca gggggtttgt cgcctggagg cccaagagga 1141 acggcctccc cccaacttag cgggttatgc tggaccgggc ggtgagggga accgaggcca 1201 cccggacttt ccgcggctga gggcagcgcc ggttccttgc ggtcaagatg ctgcaaaacg 1261 tgactcccca caataagtac gtttccgcga gccgcgtgtg ggaaggggat gttgcagggc 1321 ggcggcacag gggtgtgggg cgccgtgttg ggagtactga gcggccccgg cgcgctgctg 1381 ttgcggcgca gctgtcgact cggtcgcgcg gagggaattg agcgacggtt ttggaacggt 1441 ggtggcggct cggctactgc tcgtggaggg gaatacaggt tgtcaattta tacgctatta 1501 atgccgccgt ggcccagtct taaccgagtc aggcagagct agtttgacgg tggagtggag 1561 tgaggttgaa cagcaggttt ggcgtttggt gggtctggta tctagcggcg gtctgttagc 1621 cttttagggg ggattcacgg acacctctag cgccctgtag ggttgccatg gtgacggagc 1681 gcttaaggga ctggcaacgg ggattcccag agaagggtaa agggatcact ctcccgtgtg 1741 tgcaggttcc taatgcccag ggcatgtcat taaatctttt gctttctttg ggtgggtggg 1801 ttgtgtgtgg tgtttgttgg tgcagggatt gttttttcct aacattaaaa gtttgattca 1861 gggcaggagg gtagagctaa ggttcctagt tcagctctgc gatgtaaaca atgagattcc 1921 catatgatgt tttaattctt aggtggtagg aaagactgat cggaggagca ccagagggac 1981 tgtaaatgaa ccactgttag cgtttggtgt ccggagttgg tgctacaggg ggaactggta 2041 gtggaatcgt gttgtgtagt gggtgggtgg aagggggcta tcacttggtg accttgactg 2101 ttttgtacgg ctttttgact tccttggagt gaggagactc tgatttggtg cgaataattt 2161 tgagggcctg gaagttacgg gctgtgaagt ctgacaaatt cttccttgtc tgaatttgtt 2221 tttaagttga tatggttctt cctctgggtt tctagtctat gttctgttgt ggcgtgaact 2281 acccagacct tgtggaagat ggtgctctct cttctatcta ggtggattat tctgtgtctt 2341 atcagcattt tatggaattt tttatagcca taatttgttc ttttcctcct taccggcgct 2401 caaccaccat ggcaaccacc aaacccctag tgaggaggaa gcttggggtt tgagtttctt 2461 aactccaccc attttgctta atccccatcc ccatagggct gtagttctga gatgtcgtgc 2521 cttgtcagaa acaatttggg agttttttaa aatatgaaaa agaacagata gagcctatca 2581 gacttaagaa ggtgggatct agatagtata ctaaaaatat taataaaagg aaggcggggc 2641 cagcaataaa agctccacag attgtttgga tattgtttct gcttaagaag cacttggcat 2701 aagcttaacc acctcactag ggccagcacc tggattcatc agactattgt gcagatgcac 2761 tttttcctca tttggacgat attgccctaa ttttgttccc atctttacag gctccctggg 2821 gaagggaatg cagggttgct ggggctgggc ccagaagcag cagcaccagg gaaaaggatt 2881 cgaaaaccct ctctcttgta tgagggcttt gagagcccca caatggcttc ggtgcctgct 2941 ttgcaactta cccctgccaa cccaccaccc ccggaggtgt ccaatcccaa aaagccagga 3001 cgagttacca accagctgca atacctacac aaggtagtga tgaaggctct gtggaaacat 3061 cagttcgcat ggccattccg gcagcctgtg gatgctgtca aactgggtct accggtgagt 3121 agagacattg gagccgggga ggtgtgggat gagcaagaat gcgtgtgaat gggggtggtc 3181 tgcctagtgt agatgctgcg gcccctaggg agttcccatt tctcccctgt agggcagtta 3241 gctaccagat ttctgggtat cttggtcctt tgtgattgat ccgaccgctt gctgtaacta 3301 tcttggcatc tttccttgtg ccctccatgt gtccttcctt aacttttgtg ccctggctcc 3361 attttacaga ttcccacctc gggttgggag aggaccacgg tggccaaaat tcttagcttc 3421 ttcctttccc tcatgcagcc catggatagc cagccccaga ggtaatgtca caggatggga 3481 agtttccaga gtgggtggga ggtgggtggt tagagaaagg cagcaggggc ctccctgtgg 3541 atgtcaagaa tcttttttat ttatttattt attttgtccc acagtttaat tggggccgca 3601 gtttaactgt tcctttgatg catagggggt gtgtgtgtgt gtgtgtgtgt gtgtgagagt 3661 cggggatcgg tagtctccct ataagcattt atttttctgt ggttctgacc taacatttct 3721 ttatttagga ttatcacaaa attataaaac agcctatgga catgggtact attaagagga 3781 gacttgaaaa caattattat tgggctgctt cagagtgtat gcaagatttt aataccatgt 3841 tcaccaactg ttacatttac aacaaggtga gtttttctgt gtgttcattt agtaggtggg 3901 gagaaacagt aatttctatt attgctggat atgttgtcta cataaagttt aaatcctttg 3961 ctactgaagg tgttatccag gtagggtagt cggagtctta aaaacctgac tctagatggt 4021 actattgaac acagtgatgt gacttcagag ctctagttga aggttattta gaacacttca 4081 tacttggggg tggtggtcct gtttcttaga aatcaccaga gacctgagta gaccagggat 4141 ctgttttctt gtcagctctc aagttttttc ttctttcgaa ttttgggaga cagttaggag 4201 aaagtggaaa ttagtagtgg cctggagtag aaattttctt taagatttga tgacaagatg 4261 actggtgggg gtatggtaat ggcctagggc ctgaatgcct ctgagaaaga tggtgtgtat 4321 ctatcttctg ttggcatttt ttaactttct ttattgctgt ctgtgttctc atagcccact 4381 gatgatattg tcctaatggc acaaacgctg gaaaagatat tcctacagaa ggttgcatca 4441 atgccacaag aagaacaaga gctggtagtg accatcccta agaacagcca caagaagggg 4501 gccaagttgg caggtaggaa gagtgggagt tttgcaaatg gacaacttaa agatggggaa 4561 gagaatcaaa ctacactttt ttcctttttt ctagcgctcc agggcagtgt taccagtgcc 4621 catcaggtgc ctgccgtctc ttctgtgtca cacacagccc tgtatactcc tccacctgag 4681 atacctacca ctgtcttcaa cattccccac ccatcagtca tttcctctcc acttctcaag 4741 tccttgcact ctgctggacc cccgctcctt gctgttactg cagctcctcc agcccagccc 4801 cttgccaagg tatgatctgt ggatttcctc tgggcagcag ggaggcaagg gtcttaagta 4861 aagtgggctt ggagtgacag gttccctatc ttgtttcttt ctgcagaaaa aaggcgtaaa 4921 gcggaaagca gatactacca cccctacacc tacagccatc ttggctcctg gttctccagc 4981 tagccctcct gggagtcttg agcctaaggc agcacggctt ccccctatgc gtagagagag 5041 tggtcgcccc atcaagcccc cacgcaaaga cttgcctgac tctcagcaac aacaccagag 5101 ctctaagaaa ggaaagcttt cagaacagtt aaaacattgc aatggcattt tgaaggagtt 5161 actctctaag aagcatgctg cctatgcttg gcctttctat aaaccagtgg atgcttctgc 5221 acttggcctg catgactacc atgacatcat taagcacccc atggacctca gcactgtcaa 5281 ggtacccact gcatggggca gatgggatgc tcaagcagtg atgggagcct aggtgcaaaa 5341 caataagtct ccttatgtgg gcacacagca gtctttggtt cttggcattt tacttttata 5401 aaataatagt ggaacagaag gtctggtgtt ttgagaattt gtatttcttg gagtttgaaa 5461 cagtagggtg gggtttcttt gtcttgagaa aaatactgtc tataattaag tactaatgtg 5521 gcagtgttgg gttaaggaag ttatagggtg gaaagacagg cataggccac ctctctgtca 5581 cttagaaatg atttcttttt ctagacataa atatttcttc aacccaccca aattcctttg 5641 acttcaaact tgaaccccag ggcacagatc cttaaggtca tccccactgt gctctcaaga 5701 gagggctctt cttgtggtgt ctggggttgg cagggaaagg tgagtcttcc tgcctgtgca 5761 gcttctgatg ctgcctcctt ctgcagcgga agatggagaa ccgtgattac cgggatgcac 5821 aggagtttgc tgctgatgta cggcttatgt tctccaactg ctataagtac aatcccccag 5881 atcacgatgt tgtggcaatg gcacgaaagc tacaggtgag tggaaaggtt ggagtttgaa 5941 aaataaatgg tatggggagt tattttgtca tgtgtgctgc atagcctcaa cgtgagggtc 6001 tcactgttct gtacagttgt aaattggagc tatatcactt ggtggctggg tatgtagggc 6061 actgtttatc agcatagttt tgagtttgtg cctctttcta ggatgtattt gagttccgtt 6121 atgccaagat gccagatgaa ccactagaac cagggccttt accagtctct actgccatgc 6181 cccctggctt ggccaaatcg tcttcagagt cctccagtga ggaaagtagc agtgagagct 6241 cctctgagga agaggaggag gaagatgagg aggacgagga ggaagaagag agtgaaagct 6301 cagactcaga ggaagaaagg gctcatcgct tagcagaact acaggaacag gtattttgtc 6361 actcttgaaa gtttttattg ggtaagaggt tcatgccctt tgtcctcatt ttttcttctt 6421 gttattttat ctttatttac tttttccact tcatgttttt tttcctttag cttcgggcag 6481 tacatgaaca actggctgct ctgtcccagg gtccaatatc caagcccaag aggaaaagag 6541 agaaaaaaga gaaaaagaag aaacggaagg cagagaagca tcgaggccga gctggggccg 6601 atgaagatga caaggggcct agggcacccc gcccacctca acctaagaag tccaagaaag 6661 caagtggcag tgggggtggc agtgctgctt taggcccttc cggctttgga ccttctggag 6721 gaagtggcac caagtgagtt agagtaggaa gcagagacta gtttggctat ttctgtctct 6781 ctgggggatg ccatctctct ttgcaaagat aattctaaat ggccagttaa cagatacaat 6841 aggctttgag cagtggtccc caaccttttt ggcaccaggg accagtttcg tggaacacag 6901 attttaccac agacagggtt tgaggggatg gtttttggga tgaaactgtt ccacctcaga 6961 tcattgggcc attggattcc cataaggagc atgcagcctg gatatgtacc atgcgcactt 7021 cacagtaggg ttcatgcttc tatgagaatc taatgcttct gctgatgtga caggcagtga 7081 tgcccacatg ccggctgttc acctcctgcg tagcccagta acaggccacg gactggtact 7141 ggtctggggg ttgggacccc tggctttggg agtcagggtg tttcacagct actctgacag 7201 tgaactcaaa gtagccataa actagaaaca tgaagatggc tgtgttccaa aaagacttta 7261 tttgcaaaga cacgtggcga tcagatttgt tctctgggcc atatagtttg cctgttgctc 7321 taaatcaatg agtctagact tgtttttcat ggcgtagtag ttttgggttt tttggtgtgg 7381 ttttgtgttt ttttttgttt gttttttttt tttttttttt ttttttaaag actccaggct 7441 ggagtgcagt ggcgtgatct cggcttactg caacctccac ttctcgggtt caagcgattc 7501 tcctgcctca gcctcccaag tagccagaat tacaggcatg cgccaccacg cccagctaat 7561 ttttgtattt ttagtgcgca gctagtttat gtagttttag tggagacggg gtttcgccat 7621 gttgggcagg ctggtcttga actcctgacc tcaagtgatc tgcccgcctt ggcctcccaa 7681 agtgctggga ttacaaatct gagccactgc agctggccca tggtgtagtt tggtagtgtt 7741 taagggagca gaaagaccca tgtcagtata cctaaacagg tataccttgt tttattgtgc 7801 ttcactttac ggagtttttt ttagatacta cttttttttt tagttgaaga tttgtgacaa 7861 ccctgtgtgg agcaagtctt tcaacagttt ttccaacatg tttgtgtgtc acatttttag 7921 taatattttt tcattaaggt atgtacgtac attgtctttt taaagacatg ttattgccta 7981 cttacagtca agagcaaaat gctctgtttc actatacagt gtcccagtag cccacctctt 8041 acttggccat tgaatggaaa aacagaagct ccactctggg caggaaatag gatcactgaa 8101 ttataacagt gggaacatac tggaagaggt taatgaagct tcttttgctg acaactcttt 8161 ttgcccttag gctccccaaa aaggccacaa agacagcccc acctgccctg cctacaggtt 8221 atgattcaga ggaggaggaa gagagcaggc ccatgagtta cgatgagaag cggcagctga 8281 gcctggacat caacaaatta cctggggaga agctgggccg agttgtgcat ataatccaag 8341 ccagggagcc ctctttacgt gattcaaacc cagaagagat tgagattgat tttgaaacac 8401 tcaagccatc cacacttaga gagcttgagc gctatgtcct ttcctgccta cgtaagaaac 8461 cccggaagcc ctacagtacg tatgaaatga ggttcatctc atggttctga ggacagttga 8521 ggaaagatgg tggggtctgt ttgcattcag gattgtcagc tcccaggata atgggatgtg 8581 ttggttggca gctgacgttc aagaagggaa cttgggaacc ttaggggccc ataataagat 8641 gcttggggca atcttaatgt atcctgataa atttctttca ttagccatta agaagcctgt 8701 gggaaagaca aaggaggaac tggctttgga gaaaaagcgg gaattagaaa agcggttaca 8761 agatgtcagc ggacagctca attctactaa aaagcccccc aagaaaggtg agtatatact 8821 ttcatgccac tacagattga ctccatcctg ccttcttgac tgtcttttat tgacaaatga 8881 agattcagac ttgaacgtct ttaactttcg aatttgttct gcagcgaatg agaaaacaga 8941 gtcatcctct gcacagcaag tagcagtgtc acgccttagc gcttccagct ccagctcaga 9001 ttccagctcc tcctcttcct cgtcgtcgtc ttcagacacc agtgattcag actcaggcta 9061 aggggtcagg ccagatgggg caggaaggct ccgcaggacc ggacccctag accaccctgc 9121 cccacctgcc ccttccccct ttgctgtgac acttcttcat ctcacccccc cctgcccccc 9181 tctaggagag ctggctctgc agtgggggag ggatgcaggg acatttactg aaggagggac 9241 atggacaaaa caacattgaa ttcccagccc cattggggag tgatctcttg gacacagagc 9301 ccccattcaa aatggggcag ggcaagggtg ggagtgtgca aagccctgat ctggagttac 9361 ctgaggccat agctgcccta ttcacttcta agggccctgt tttgagattg tttgttctaa 9421 tttattttaa gctaggtaag gctgggggga gggtggggcc gtggtcccct cagcctccat 9481 ggggagggaa gaagggggag ctcttttttt acgttgattt ttttttttct actctgtttt 9541 ccctttttcc ttccgctcca tttggggccc tgggggtttc agtcatctcc ccatttggtc 9601 ccctggactg tctttgttga ttctaacttg taaataaaga aaatattatt caagttttga 9661 gttaccttaa tatttgcttt tgtagtgttt caaaaggaac atcataagaa ttgtcttgat 9721 aattttgagg gaaatattac tgcagtgaga aaaggcaata gctaacctat aattggattg 9781 tcttaatttt taaaccagta ggcttttgct gtgtttttaa taaagtaaat atgacttttg 9841 taaattgagt ccttagaagt aatctttagg tctacaattt gctcttgttt aaatgaaaaa 9901 tagtactgtg gctcattcat gctttaacca agaactcaaa attttgaggt aggctttagg 9961 tttttccctg tggcactgga tgtgtgaatt ttctcctgag cagacttaaa atatgagaaa 10021 agggtgggag gtagccgaac ataagtactt tatgcattga gtttattgcc ttttaaaagg 10081 aaattggcct gtaatcccag cactttggga ggccgaggcg ggcagatcac gaggtcagga 10141 gatcgagacc atggtgaaac cctgtctact aaaaaaaaat tagctgggcg aggtggcggg 10201 tacctgtagt cccagctact cgggaggctg aggcagcaga atggcgtgaa ctcgggaggc 10261 ggggttcagt gagccgagat cgcgccactg cactccagcc tgggtggtag acactccgtc 10321 tcaaaaaaaa aagtaattgg gcctactaca ttgttaaaca ttgttaaatt ttgctgccat 10381 ggtcacacac aaatttacag atagtttatt agtagaatac taaagagtat tccaacgatt 10441 aaatcacaaa actgtgcttt ctgcataccc ccttgtcttg ctaaggggag agaagggttg 10501 tataaaaagt ttagggggtt gggatgtttg cattctggaa tttggggctt taatactgga 10561 aaagtgagac atttgcttag tatagtgtac catagtagga aacctggata gagacatgga 10621 aattagaatc aggaatgtag taaagcaaat ggtttatttt gctgtaaatg acaccacaaa 10681 ctaagtgtag ggcaacacca caaactaaat gtaggaagca ataaatttta ctagtgatgc 10741 tcagccctct ttaggaattc cggctaaact ggggcttgag caacaatttt caaaagctcg 10801 ggagatggta ataaaaaatt aggtttgtga accacctgct actgtttgcc aagcacttag 10861 agggaaacaa acccttgttt gggctttctt gctaacttgt gtgcaccagt gaaagctctt 10921 gagctccctt tgagctctgg ttcccttttg agaataacag atgttgagga ttcgtaagta 10981 cttaatagag acgcgttggg caataggtga tgagatacaa attaaagttc tgaaaatcgg 11041 agtaaataga tttaagctaa gtgcatgtct atgtcaagga ttacatctca tttcagaggg 11101 aattgaagga tttagttgga ttagttttgg gacaaaatat agaatatttt gactgcagcc 11161 ccttctgctc atgtactttt aaggtttgtt ttctgtagtt cggaaaaata aaagtttcaa 11221 cctgacattg gaggcccctg agtacttaat tccctgtaaa tggaacccag accgccctaa 11281 atgctttaag agagagaagg gctggctgac acaggggctt cagacctgcc ttaaaccaat 11341 tggactagtc tcttaattga ctttagtttg aacttatttc aagcctgtct cacttaggga 11401 ttgtaattgt ttcaggagtt tggttgagtt ccatcttggt tgccaaagga ctttattcca 11461 aaatagcagt ctccagcaca actcaaagga ctagtggagt cctgtgggca ttatttcccc 11521 ctatgctcct ctcagctctt ggaatcatgg gttctattgc tgctgctttt tccctctccc 11581 ctcatgctgc catttactgc cttttattgc gtccatagga agccttttgt tgggttgtgg 11641 gggaaggtga gagtcggtct tatttactca gtcacacatt tattgagttc ccttgattgc 11701 cttttcagca aactgttagg cctgtagacc tggacgttgc caagccagag ggtataaggt 11761 gaagataaga caaggtctta tcatggagtt tgcttagcac gaatagggtg caattattaa 11821 ccatcatggt gtggtaatta tactgattga atccaagata taggcatgac ttggtcttca 11881 cagaccatcc ttattgtgca cccatatgtg gaccccagtt ccagcctgcc tgtaaccttc 11941 cccaaagtcc tgctcttagg ttcactcggg actaccttga ttggaaggcc ctgcttccat 12001 gagtgaccag catgaaggac tcccaggaca aggaccagag gtgactggtg ttactctggg 12061 gcttagtgca agactggagg aagattttcc tgagcaatta gagagtggct gtaatggaga 12121 gctgggaagg agatgtggga atggtagcat ggaactaatg tgttgtcacc atgatttcat 12181 tttttcctgg gtcatcaccc taaagatact cacaaaatcc cactagctgg tctccagcac 12241 tgaatgagac gacagcttct tgctctcaat ttatattcta ggatcgggga aaggggcagc 12301 attagacaac tagtcacacg atttttaaca aaaataagta attctaacag gacaatgtgc 12361 agaggtctag caagaattta gaaaagaggg ttctctggtc taagctgaga gctattcaga 12421 tatctcagaa cgatgagttc cttcctgttt gtaaggggtg gatgggtggg agcagggagt 12481 agtgaacatt gccagcagaa gaaacagcaa ctgagaaggt aactaggtga tgctgaagca 12541 gcaaggatga aggtgagtgc tttgagatgc tttgaattta catctcaaag actaatagct 12601 aacatttatt gagcacttac tgtgtcccat gcactgtgct aaataaaaac ttaccctgta 12661 aactcatttg gtcctcacta taatcctgtg aggtacatct tgtctgcatt taacagataa 12721 agaaatgagg cacagagaga ttagtcaatt tgcccaacat caccacattg tcagtgagca 12781 ctggaggtgg gttttgaaac caggcgatct ggcttcaggg tccacattta taactactgc 12841 actagacttc cagtgctgtg ggccagtagg gagctaggag atagacatgc tttaaggaaa 12901 ttgcactagg gacaatcatg ttagggttgg tggggtgttg agagtggata gtttagaaag 12961 caattgcagc agttaatccc agccagtgag aatagtggtg tagaagagaa aggcagcaat 13021 gtagatgaag aaatgtagat agatttgaga gctatgtagg agttaaaata gatgggactt 13081 attctcaaaa ggttcttgtc tccacacatt aaaggaacag cagagcacac gtgcacttgc 13141 agacatacag ctctcagagt ccctagcaca gagctttcca atagtggaag cttgatattt 13201 tgttgactaa aggagtgcca cctggctgac tggcagattt gaatggagct ccctccagca 13261 tggcttgcca aaggggcagg ggttctgagg ctcttgtaga ctgccactga aggtatttgg 13321 cgccacctgt tgggcgtgtt acccacgatt gcctcctgaa tctattgtca tttttgtgtc 13381 ctgcccccga ggttaggtgg ttcttccctt cttactttcc taaacttcaa ctcttaaaat 13441 gtgagccttc atttttatga cccagagggt cacaaaagaa ggagattagg cctttttagt 13501 ccttatctcc ctttacctct gaagcatccc atggggcttc cctagcattt tattttatat 13561 tgatttattt atttattttt aagaaagagg atttctgtca cccaggctgg agtgcagtgg 13621 tgtgatcata gctcactgta gcttcgacct tctgggctca agtaatgctc ttgcctcagc 13681 ctcctgagta gctgggatta caggcatgag ccaccgcacc ctgcatctcc tgacttcttt 13741 gacttgacac cacttgttcc tagctctcat acttttcaga cagccatttc ttagtctgct 13801 tctttaaagt ctctgcctct gcctcccata atatattgtc ccttatgaca cctttctttg 13861 tcctcttttt atttgcacag tatttttaag caagtttacc cattcctctg gaggcacctc 13921 ccacccataa aatctgtgtc tctgtttaga ccttgtatta gtccattctc acatggctat 13981 gaggaaatac ctgagactag gtgatttata aaggaaagag gtttaattga ctcacagttc 14041 tgcatggctg gggaggcctc aggaaactta caatcatggc ggaagacacc tcttcacagg 14101 gtggcaggag aatgagtgcc aaagtgaagt ggggaagccc cttataaaac cataagatct 14161 cgtgagaact cactatcacg agaacagcat gggggaaact gcccccatat caaaccttga 14221 gtcaaagttc caagtgccta atgcaatgcc taatggcaag gtcccataag ctcgtgcaca 14281 catttttaga aaagattcta cctttttccc caaacatgtc cctcgcagtg agttctctaa 14341 gtcagcagtc cccagctttt ttggcaccaa ggactggttt tgtggaagac aacttttcta 14401 tggatggagg cggggagaat ggtttccaga tcaaactgtt ctacttcaga tcaccaggca 14461 ttagattctc aaaaggagtt tgcaaccttc cctcgcatgc tcagttcaca atagggtttg 14521 cagtcctgtg agaatctaat gccgctgctg atcccgtcga c // LOCUS HSRODPDE 6470 bp DNA PRI 09-MAR-1995 DEFINITION H.sapiens rod cG-PDE G gene for 3', 5'-cyclic nucleotide phosphodiesterase. ACCESSION X62025 U13894 NID g36107 KEYWORDS 3',5'-cyclic-nucleotide phosphodiesterase; cyclic nucleotide phosphodiesterase; rod phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6470) AUTHORS Piriev,N.I., Purishko,V.A., Khramtsov,N.V. and Lipkin,V.M. TITLE The organization of the gamma-subunit gene of human photoreceptor cyclic GMP phosphodiesterase JOURNAL Dokl. Akad. Nauk SSSR 315 (1), 229-231 (1990) MEDLINE 91266687 REFERENCE 2 (bases 1 to 6470) AUTHORS Piriev,N.I. TITLE Direct Submission JOURNAL Submitted (22-APR-1992) N.I. Piriev, M M Shemyakin Institut of, Bioorganic Chemistry, Academy of Sciences, Moscow, USSR REFERENCE 3 (bases 1 to 6470) AUTHORS Piriev,N.I., Khramtsov,N.V. and Lipkin,V.M. TITLE Cloning and characterization of the gene encoding the cGMP-phosphodiesterase gamma-subunit of human rod photoreceptor cells JOURNAL Gene 151 (1-2), 297-301 (1994) MEDLINE 95129879 FEATURES Location/Qualifiers source 1..6470 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retinal" /cell_type="rod photoreceptor" /clone_lib="EMBL 3 genomic" CAAT_signal 138..141 TATA_signal 190..194 exon 209..252 /gene="rod cG-PDE G" /number=1 mRNA join(209..252,3339..3543,5006..5046,5538..6470) /gene="rod cG-PDE G" gene 209..6470 /gene="rod cG-PDE G" intron 253..3338 /gene="rod cG-PDE G" /number=1 exon 3339..3543 /gene="rod cG-PDE G" /number=2 CDS join(3398..3543,5006..5046,5538..5614) /gene="rod cG-PDE G" /EC_number="3.1.4.17" /codon_start=1 /product="3',5'-cyclic-nucleotide phosphodiesterase" /db_xref="PID:g36108" /db_xref="SWISS-PROT:P18545" /translation="MNLEPPKAEFRSATRVAGGPVTPRKGPPKFKQRQTRQFKSKPPK KGVQGFGDDIPGMEGLGTDITVICPWEAFNHLELHELAQYGII" intron 3544..5005 /gene="rod cG-PDE G" /number=2 exon 5006..5046 /gene="rod cG-PDE G" /number=3 intron 5047..5537 /gene="rod cG-PDE G" /number=3 exon 5538..6470 /gene="rod cG-PDE G" /number=4 BASE COUNT 1454 a 1929 c 1967 g 1120 t ORIGIN 1 ctggctgttc agggtcacac ttctctccct gtcccggccc catctgctgg gtgtccttcc 61 ttggagccct tgacagcacc tgagcccctc ctctccatcc cctgaacctt ggtggttgtg 121 ccccccctcg gggccaacaa tctcattagg gtggcactga gacgcctaat cagctcatga 181 agcagcgaga taaaggccgg ggctggcacc cagcactcac agcacagccc cctgagaccc 241 gccctgcact tggtgagtag tggccaaccc cgctgtcttt cctggcagcc actttccccc 301 agtgagtcag tgggggtctg ctcatcaccc aaggggcaga gggagctgac aggcttttac 361 caccgtggct gaggtctgca gatgcctacc cgccaccctg ttattcagag gggactccca 421 gccaacctcc tcctctgctc ccccatggta accagtgctt ccaccatccc aggccagtga 481 tggtctggga acctcaggcc acgtgggctc agctgacctg agtgacgcca gtcacaggct 541 gggtcgggct agggcagcgc aggcagcggt ttccacatgg atggcatctg atggttggag 601 ctccatcagg gcccagagct gcccaggaca gcccatgtcc cagggctgga gaagcagctg 661 gacggggcct gtgctatcta ggatatcagg attgggtgcc atgggatctg agcagctggg 721 accaccgggc tcctgagtgc tggttttgca caggctggtg ctggtcccag tcccgggaac 781 agaggctgaa ccttagtgag ggaaggtgcc atggcagggt ccaccccaat ccggtgccca 841 gcaccccttg cagagcccgc gtccccaaaa tgggccacct gcccttgctt tctgatcact 901 ctcacgcctg aagctcggca aacagctggt agtagcccag ggctggagtc aggaatccca 961 gccaaaaaga ccaggggcag gaaaggggac cccaagaggt ggtccctggg gcatccagtg 1021 cgggcaagca gcccctcctg tctgtgtcat gtgcctcctt ccagaacttt ccggcagcat 1081 gtgcagctga cagcagctcg atgggagatt aggaattctt cctctgctgt taataggctt 1141 aggggaagct gaactgccct tggcccaggt cactggggag aggctgtagg gagaggagga 1201 ggagaggcca ccctgtgcag cagtgcaggg acactgatct cgtccagagc ggccgaggcc 1261 ttcagccacg ccacgccacg ccaggcgggt gaccctgctc cccgagctga ctccagacac 1321 agagagggcc tgtgattccc ccagcagggg gaacacatgc ccggcaccct ccgtccccaa 1381 gccccagctt gtccctctat gaagtggggt tggggaacgc agggcttcac agctcagcac 1441 caagtctctg agacaggccc ctttcccagg ggcgccgtgc ctggggaccc gcccccattt 1501 tgtgggtcat tctccatcag cactgcgggg tgcccgtcat ggtgctgcgg ggctcaggga 1561 tgcctccgtg agtgggtgaa gggaactgac agcaccccaa ctgcaaagca cagacaagag 1621 gcagaaagca ataaacccat caataagaat cagaggggtt ggccgggcgc agtggctcac 1681 gcctgtaatc ccagcacttt gggaggccaa ggtgggtgga tcacgaggtc aggagatcga 1741 gaccatcctg gctaacatgg tgaaactcca tctctactaa aaaatacaaa aaaaaaaaat 1801 aaattagccg ggcgtggtgt cgggcgcctg tagtcccagc tactggggag gctgaggcag 1861 gagaatgacg tgaacccagg aggtggagct tgcagtgagc tgagatcgtt gccactgcac 1921 tctagcctgg gcaacagagc gagactccgt ctaaaaaaaa aaaaaaggtg agattccagc 1981 cacagctctg gaataggatc ctaggggcta ctttggcagg tattaaaact tttttaggcc 2041 gggcgcggtg gctcacgcct gtaatccctg cactttggga ggccgaggcg ggtggatcac 2101 aaggtcagga gtttgagacc agcctgacca agatggtgaa accccatctg tactaaaaac 2161 acaaagaaaa attagctggg tgtggtggcg catgcctgta atctgagcta ctccggaggc 2221 tgaggcagga gaattacttg aacccgggag gcagaagttg cagtgagcca agatctcgcc 2281 actgcactcc aacctgggtg acagagcaag actctgtctc aaaaaaaaaa aaaaaaaaga 2341 atcaggggca acagcccggg agacgcccac cagctcggaa tcagggctaa ctcgacgcgt 2401 gatgagaagc ggcaaatcag aacctaagga cacagtggct gcagggcggg ggagacctcc 2461 ctgagcggga cttgagatgg aggcagccca actaaccatc aggaagggct gctcagaggc 2521 tcctctgctg ggcctggcag gcccattccc agcaggagcc acagaggact ttcatcagag 2581 tagcagctgg gccgggtgcg gtggctcacg cctgtaatac cagcactttg ggaggctgag 2641 gcgggtggat catgaggtca ggagttcgag acaacctcgt caacatggtg aaaccccgtc 2701 tctactaaaa atgcaaaaat gcaaagatta gccgggcatg gtggcagacg ccggtaatcc 2761 cagctgctgg ggaggctgag gcaggagaat cacttgaacc cgggaggcgg aggttcgtca 2821 gagccgagat agtgccactg cacccagccc aggcaacaag agcaaaactc tgtctcaaaa 2881 acaaaacaaa acaaaaaaaa cccagagtag cagttaaatg ccgacgggag tcacgcttcc 2941 ctggggtggg gaacgcaggg gaaggggcag gaagttgtca gtggtcaccc acagggaggt 3001 ggtgaggact ccaggctcag gacctgccac aggttgggag gctaggtttg ctgtggattg 3061 tgggccgggg cgagggaggc caggacaggg gggagtccgg agctgtgtgt ggccagcgtt 3121 gcaggggggc atccactggc aaggggaatg ggagcgtgcg ggtggaccca agcaaagctg 3181 gcaggctggt taccccaggg cggattgcag gggctgctgt ctggatgacg ctggagctgg 3241 cggggctggg tctcagcccc tccaccggtg ggttgagacc cccacaaggg ttggaagcag 3301 gagggctgac tgggacccgg ggcctggtct gcccccagac cgcagcagga gggagtccag 3361 gagccaaggt tgccgcggtg tctccgtcag cctcaccatg aacctggaac cgcccaaggc 3421 tgagttccgg tcagccacca gggtggccgg gggacctgtc acccccagga aagggccccc 3481 taaatttaag cagcgacaga ccaggcagtt caagagcaag cccccaaaga aaggcgttca 3541 agggtaagca gagctggggg atggggcctc tgaaagggag gaggcggtcc tgaggctgca 3601 caggaagggg agggcggggc agagccaggg ccacttcact gggtgagtgg tcccagcctc 3661 acagctggga aatagggcgg tgccgagggc tgagcaggga tggaggaggg ggaaggggaa 3721 ttgggggctc ctgtcacccg ctccagatgc tactgtcaac catgcttgaa atacagccac 3781 tgccggccag gcgcggtggc tcagcgcttg ggaggctgag gtgggcagat cacctgaggt 3841 cgcgcccggc ctaaaaaaag ttttaatacc tgccaaagta gccccctagg atcgcgagtt 3901 tgagaccagc ctgacaaaca tggagaaacc tgtctctact aaaaatacaa atttagccag 3961 gcgtggtggt gcacgcctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt 4021 gaatccagga ggcagaggtt gtggtgagct gagatcgcgc cattgcactc aggctgggca 4081 acaagaagaa caaaactccg tctcaaaaaa acaaaagccg ggggcggtgg ctcatgcctg 4141 taatcccagc actttgggag gccgaggcgg gcggatcacg aggtcaggag atcgagacca 4201 ttctggctaa gaaggtgaaa ccctgtctct actaaaaaat acaaaaaatt agccgggcgt 4261 ggtggcgggc gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggtgtgaac 4321 ccgggaggca gagcttgcgg tgagccgaga tcacgccact gcactccagc ctgggcgaca 4381 gagtgagact ccgccaaata aataaaaaaa atacaaccac tgccacccag ggacagccaa 4441 tggcacaatg caccgaggag tgtccactca cacatgcgca cacacgggca tctcgctccc 4501 atcagtgcac cgaggagtgt ccactcactc gtgtgcgcac acaggcaccg cgctcccatc 4561 aacgcaccga ggagtgtcca ctcacacgtg tgcgcacaca cagggcatga cactcttatg 4621 cacaaacacg ctcacacctc cacatatgca cgtgaggcac actcacagac aagccacctt 4681 cacaaatcta cactcagacc atgcaccaac cacgcacccc cccgtactcc cccagggcac 4741 gcacctgccc agcatcctgg gacttgcacc tccttttcta ggcctcagcc ccatgatgct 4801 ctgggcagca gactcacctt gggtctgtga gggaagcctc ctcagatgcg ccaagccctg 4861 ggaggttggg gaggaggcgg gtctcgtccc actgggctcc cagccaaaca cctcatctcc 4921 taggaaggct agagatgacc tcggcctcag gctgactgag ccaggactgg gcccggccat 4981 gtccctgagt gctctgtcct tgcaggtttg gggacgacat ccctggaatg gaaggcctgg 5041 gaacaggtat gacggctgcc agtaccaacc cgaggtctgc ccaggtcccc ccaggcacac 5101 ccagcacgcc cggggcccat tgcagcccac ctcggccccc acctcccctg cttttcactt 5161 agggaccccc agcctgtcca tcacttcttc ccccaacact gagctccaga tccactggat 5221 ctagagggtg gggctgtccc catttgggac actttgcacc aggtgagggg aggggtttgg 5281 gaaagaacag aacccctcgc tgcctcctcc ctcagcaagc cccaccctcc tctggttccc 5341 tcccctggac ctgctgcacc caccctcctt ccccacagcc agtgtgtccc cagaggaggg 5401 ccccactcca gcgtccgcac gtccccgctc atcacactgt agggctacac ctgggctgtg 5461 ggaaacaggg ccacagggga tgggggggtc cctaggccgt gggatcctct ctgatccgtg 5521 gcccgtttct cccacagaca tcacagtcat ctgcccttgg gaggccttca accacctgga 5581 gctgcacgag ctggcccaat atggcatcat ctagcacgag gccctgctga agtccagacc 5641 ctccccctcc tgcccactgt gctctaaacc ctgctcagga ttcctgttga ggagatgcct 5701 ccctagccca gatggcacct ggacaccagg atgggactgc aacctcaggt ctccccctac 5761 atattaatac cagtcaccag gagcccacca cctccctcta ggatgccccc tcagggtggc 5821 caggccctgc tcaacatctg gagatacagg cccacccctc agtcctgccc acagagaggc 5881 ttggtcggtc tccactccca gggagaacgg gaagtggacc ccagcccggg agcctgctgg 5941 accccagatc gtcccctcct cccagctgga aagctagggc aggtctcccc agagtgcttc 6001 tgcaccccag cccctgtcct gcctgtaagg ggatacagag aagctccccg tctctgcatc 6061 ccttcccagg ggggtgccct tagtttggac atgctgggta gcaggactcc agggcgtgca 6121 cggtgagcag atgaggcccc aagctcatca caccaggggg ccatccttct caatacagcc 6181 tgcccttgca gtccctattt caaaataaaa ttagtgtgtc cttgcctgtc tgtggcatat 6241 ttgtgcatgg gacagggcct gtggtggggt cacttgagtt gagccggagc acatggggga 6301 aggggtgggc agtggggcag gaggggctgg cctgcctgag tgggaagcac gcccttgggc 6361 tcacactctc actgagcctt gctggtggcc ccaagacacg caatgtcagg ccatggctgg 6421 gtcaaggcag aaggccttgg tgcacaccca ggggccatag tcaacgaagc // LOCUS HSRPII145 2093 bp DNA PRI 07-DEC-1993 DEFINITION H.sapiens gene for RNA polymerase II 14.5 kDa subunit. ACCESSION Z23102 NID g397149 KEYWORDS RNA polymerase II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2093) AUTHORS Acker,J., Wintzerith,M., Vigneron,M. and Kedinger,C. TITLE Structure of the gene encoding the 14.5 kDa subunit of human RNA polymerase II JOURNAL Nucleic Acids Res. 21 (23), 5345-5350 (1993) MEDLINE 94089382 REFERENCE 2 (bases 1 to 2093) AUTHORS Kedinger,C. TITLE Direct Submission JOURNAL Submitted (24-JUN-1993) claude Kedinger, U184INSERM-LGME UPR6520 CNRS-Inst.Chimie Biol.ULP, Strasbourg, 11, rue Humann, Strasbourg, Alsace, 67085, France FEATURES Location/Qualifiers source 1..2093 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cells" promoter 1..372 exon 373..448 /number=1 mRNA join(373..448,533..587,834..907,996..1070,1181..1232, 1430..>1530) CDS join(390..448,533..587,834..907,996..1070,1181..1232, 1430..1492) /codon_start=1 /product="RNA Polymerase II subunit 14.5 kD" /db_xref="PID:g397150" /db_xref="SWISS-PROT:P36954" /translation="MEPDGTYEPGFVGIRFCQECNNMLYPKEDKENRILLYACRNCDY QQEADNSCIYVNKITHEVDELTQIIADVSQDPTLPRTEDHPCQKCGHKEAVFFQSHSA RAEDAMRLYYVCTAPHCGHRWTE" intron 449..532 /number=1 exon 533..587 /number=2 intron 588..833 /number=2 exon 834..907 /number=3 intron 908..995 /number=3 misc_feature 930 /note="That base was not found in an independent allele. The intron 3 can therefore be shorter by one base pair." exon 996..1070 /number=4 intron 1071..1180 /number=4 exon 1181..1232 /number=5 intron 1233..1429 /number=5 exon 1430..>1530 /number=6 BASE COUNT 386 a 653 c 627 g 427 t ORIGIN 1 gagctcgccc agtccgcgaa gggtaaactc gcgcatgcgc actggccttc aaccaagcca 61 agcgccttgg cttctgtcct ggatgctcag tgttttgcgg tttacgatgc gcatcgtttt 121 cccggctctg gcaccagatc catactattc tgtaaactga ggcggcgatg cccaaaaatt 181 ggaagcaatt actctaagat atcaaaagat tgaagcaagc catgaaacct ctgcgcggag 241 agagccgagc ccgaggcggc ggggagccag caagtgtgcg cagctccgcc ctcttttggc 301 gaagcaccta ggcctggccc ctcccgcgac ctgtagcgcg gcggagcaag cgcagaaggc 361 tgggagggct gcgcgggctg cgcgtcgcca tggagcccga cgggacttac gagccgggct 421 tcgtgggtat tcgcttctgc caggaatggt gaggcggagg gagcgggtga tgttagaagg 481 cgggcgagga atggctccag gactaacccc tggcacctcc ccctccccac agtaacaaca 541 tgctgtaccc caaggaagac aaggagaacc gcattctgct ctacgcggtg agcgccgggg 601 tcttccccgg caggtccacc tgttcccagc gtggccccgc cctgccccct gcctctgtga 661 tcggactccc tgttcgccct ccgctgcacc tcaagtgatc ctgactctca tgtcctgccc 721 atccccaaag agagtcctgt cgtcatatcc tccgggatca ccctgccctc tcctcccaat 781 cttgtctgga atccttccgt gacccccttt cccccacact cgccctctcg tagtgccgga 841 actgtgatta ccagcaggag gccgacaaca gctgcatcta tgtcaacaag atcacgcacg 901 aagtggagtg agtggcgggc cctgagctgg ggggcggggg tgttggctct ggaggctggg 961 tctgagcgta attttgcacc cccgcgtccc tgcagcgaac tgacccagat tatcgccgac 1021 gtgtcccagg accccacgtt gccgcggacc gaggaccacc cgtgccaaaa gtgagcgttc 1081 agctggtact ggggaggtaa agtgaggctc gtctcctcac tgggcgaagg gaggagggaa 1141 gggtgtgaag cctccggtga ccgagcccct accctctcag gtgcggccac aaggaggctg 1201 tgttcttcca gtcacacagt gcgcgggccg aggtaagtgg cgcgagggct ttctaagtat 1261 ctggggcttg tttggtcctt tctgggggcg gggatatcaa tgtgggtgac cctgcgcttc 1321 ttggaactct gccgggccct tacctattgg tgaagttctc ttgcctgttc ttttggggta 1381 aggcgctgct gggtccaaat ccttcctaac catgcatcat ttctgccagg acgccatgcg 1441 cctttactac gtgtgcacag ccccacactg cggccaccgc tggaccgagt gacctcctct 1501 ctcccccgag tgtaataaac accagattcc atgcgtgagt tttctgtgtt tatttgcctc 1561 gtggggagga cgctggcaca ccgtatcagg ctgcgcggtg cagggcgcgg tgctgtgcgt 1621 aggtgtcggg ccgggagctg gtgaagccgc agtcctcgca cacgtgcagc ttctcgcggc 1681 gctcacggta agcgtagctg gcccggcctg tccatgcacc ttagcaaggt gagcttccag 1741 ggagcagcgc tgcgtaaacg ctttcccgca agcactggag cggaagggcc ggatccctgc 1801 acagatgggc gcagagggga gggctggaca gagagggcaa tgggctctgg actaaagcgc 1861 ccccaccctg ctttccagtg gcgagcattt ctggtaccta ttccaagctc tacagactct 1921 tccgcccatg acccaccatc tgctaccgac acccagtgga gacatgggac agacacctgt 1981 gtctccactc tcccacacca ggatgctccc tcccctgggg gatggagagg gaggccttcc 2041 cttgcctcgc gatctggagc ctgaatcgta taaccggcat ccgtctacac tct // LOCUS HSRPS3AGE 5529 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens RPS3a gene. ACCESSION X87373 NID g854178 KEYWORDS human ribosomal protein S3a; RPS3a gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5529) AUTHORS Nolte,D., Taimor,G., Kalff-Suske,M. and Seifart,K.H. TITLE The human S3a ribosomal protein: sequence, location and cell-free transcription of the functional gene JOURNAL Gene 169 (2), 179-185 (1996) MEDLINE 96194798 REFERENCE 2 (bases 1 to 5529) AUTHORS Nolte,D. TITLE Direct Submission JOURNAL Submitted (23-MAY-1995) D. Nolte, Institut fuer Molekularbiologie &, Tumorforschung, Philipps-Univ. Marburg, Lahnstr. 3, 35037 Marburg, FRG FEATURES Location/Qualifiers source 1..5529 /organism="Homo sapiens" /note="single copy" /db_xref="taxon:9606" /cell_type="lymphoblast" /clone_lib="EMBL3/human genomic lymphoblast DNA" /clone="16/2" /chromosome="4" exon 362..448 /number=1 mRNA join(362..448,1212..1315,1702..1889,3598..3806,4911..5020, 5209..5374) gene 387..5330 /gene="hRPS3a" CDS join(387..448,1212..1315,1702..1889,3598..3806,4911..5020, 5209..5330) /gene="hRPS3a" /codon_start=1 /product="ribosomal protein S3a" /db_xref="PID:g854179" /db_xref="SWISS-PROT:P49241" /translation="MAVGKNKRLTKGGKKGAKKKVVDPFSKKDWYDVKAPAMFNIRNI GKTLVTRTQGTKIASDGLKGRVFEVSLADLQNDEVAFRKFKLITEDVQGKNCLTNFHG MDLTRDKMCSMVKKWQTMIEAHVDVKTTDGYLLRLFCVGFTKKRNNQIRKTSYAQHQQ VRQIRKKMMEIMTREVQTNDLKEVVNKLIPDSIGKDIEKACQSIYPLHDVFVRKVKML KKPKFELGKLMELHGEGSSSGKATGDETGAKVERADGYEPPVQESV" intron 449..1211 /gene="hRPS3a" /number=1 exon 1212..1315 /gene="hRPS3a" /number=2 intron 1316..1701 /gene="hRPS3a" /number=2 exon 1702..1889 /gene="hRPS3a" /number=3 intron 1890..3597 /gene="hRPS3a" /number=3 exon 3598..3806 /gene="hRPS3a" /number=4 intron 3807..4910 /gene="hRPS3a" /number=4 exon 4911..5020 /gene="hRPS3a" /number=5 intron 5021..5208 /gene="hRPS3a" /number=5 exon 5209..5374 /number=6 BASE COUNT 1484 a 1003 c 1327 g 1715 t ORIGIN 1 aatgctctca aaaagcaagg tttgggggtt agacggtgag ttcagctttg aacgatgttg 61 agtgggcgtc tcgttttaca gaaacttact tcccgacgtc acagccacag aaaccacagg 121 taaccagacg cctgcacgtc aggccaccga cgccttggtc tacacagcgc acgctccagt 181 gagcccgcgg catgccgcgc ggcctcccca cacacgatgg cgatgtcgca accacagtag 241 aaagagtggc gctctctact gtcccggaat tcttgaggcc ggaagaataa cagggcaaag 301 gtcacgtaga cggcgcgccc cgcccccgta cgcctaagtt ctcgcgcgac tcccacttcc 361 gcccttttgg ctctctgacc agcaccatgg cggttggcaa gaacaagcgc cttacgaaag 421 gcggcaaaaa gggagccaag aagaaagtgt aagtcgcgac tgtcgtggcg tcttgctttt 481 tgggggtctg ctggaatcgg cgggctggtc ctagatcgcg gcgtaggcat gcggaggatt 541 ccagtcggat tgtgcgggcc gtggcgggtt ggtgcaccca ggaacgcggc ctggagggtc 601 ggtgttgagt agcggcaggt cggctgcctt tcccaggcct tctggtcctt gcgcgtcgcg 661 tttgacggct tgccggcgcg actcggctgg aatgttagtt ttgtgggctc acgagctgcc 721 tcaattctgc gagtgtttga tggcactgcg acagcaggag ttgaggaaag cacccttcta 781 tccattcagg acccgtcaat ctcgcctttg cttggtcccg ctggaatctg cgagctcctg 841 ctaagctttg gggccatgca gtctcttagt taccagtaca tgccattacg tgccagcaaa 901 gttaatgaga ttgcatgtta ctcgtgcctg tgtagtggaa ggaccgatgc atgttattct 961 aagtgcctgt gtagtggatg gaccgatgag ggtgggaagt ggacataacc tttcaaagtc 1021 tgatgcatgc agtgaaacgt acagtgggaa gagctagatt tatgtttgtt cccctcacct 1081 cactgtgaga tatactttgg tatcaaattg ccggttagct ttttaaaata ttaatgggaa 1141 ataccattaa cagtgaaaaa tacagtaaga attgatctgc aatggaactt aagcatcttc 1201 ttaaaatcca gggttgatcc attttctaag aaagattggt atgatgtgaa agcacctgct 1261 atgttcaata taagaaatat tggaaagacg ctcgtcacca ggacccaagg aaccagtaag 1321 tagcttattc ttggtttgta ttttccttaa gttggcgctt gtatattgta gcaggcatct 1381 tggtctgttg ttggtcctgt ggtgggagac tttaaggaaa tgtcagctct tggttgcagc 1441 cttgcgacct gtgggtatca tttttagtac agcacttgga gagaagcaag atggtgcttt 1501 ttgaattaag aaaatgtgcc atgggtagtt ctgagcttca aataagtact gttgataagt 1561 cactggtgtc tggatagatt tgataacaaa ggttaaggtc tttataattt accattaact 1621 taatttctac cctcagttta caggcttgac tttgcttata tggttcctaa atgttaagat 1681 acttgttttt ttcttttgta gaaattgcat ctgatggtct caagggtcgt gtgtttgaag 1741 tgagtcttgc tgatttgcag aatgatgaag ttgcatttag aaaattcaag ctgattactg 1801 aagatgttca gggtaaaaac tgcctgacta acttccatgg catggatctt acccgtgaca 1861 aaatgtgttc catggtcaaa aaatggcagg tgagacccaa agttccttag agtggttgag 1921 ctatgggtgt ttgaccaagg atagcatggt ttgatgatcg gaaggagcaa gttcatttta 1981 tttatttatt gagaccagtg aaatgtccac aaagttcatt ttaaattttt gttttcttgc 2041 cgttgttgag tgctaactct ggggttgcca caccatattt tacttactat ttaaatctaa 2101 gatttaatag tggtgtatca tcaaagtcag attttcttgc tgcatagatt agtggttttt 2161 tgaggaacgt ttaggtgcag gtgcaggtgg gcggttttgc ctgtcttaag gatgtttttt 2221 gacctcagaa ttgagtgaat gttgattaac tgtggcagta gagcataatt gtggcagagc 2281 tctggagtga ggttaaaatc tgggctccat ttgcctattg tgttctcgtt ggaaagtaac 2341 ttgttttcta agcctcggtt tctttatttg taaagtgcag ataagagtgt cattttattg 2401 aattgtgagg atgaaatgaa aatatttcag atgtgtctgg ctaactatcc agttggaata 2461 ttttgtggga gttttttgtt tttggagatt gagtctctta tcgtccgggc tggagtgcag 2521 tggcgccatc tctgctcact gcaacctcct cctcctgggt tcaagggatt ctcatgcctc 2581 agcctcccaa gtagctggga ttacagcacc cgccaccacg cccagctaat ttttgtattt 2641 ttagtagaga caggtttcac catattggcc aggctggtct cgaactcctg acttcaagtg 2701 atctgcccgc ctcagcctct caaagtgctg ggattacagg tgtgagccac tgcacctggc 2761 acagttcgaa tattttgtat gtgggaatga atgatgacaa aatgtttcag tcccaaatga 2821 tacatactga ttataccatt atatttatcc tgacattcct ctaaggcttt atggtttaca 2881 tttctagaaa tattttaagg tcccttagct gtaaattgta tttgattagt ttttttttat 2941 actaaggtat aaaatttaca aaatgcaaac aaggtaaatc agttttcatc actacccaag 3001 atgttttttg ttttttcaat ttaggcaata atatgaccac taactagggg tttttaatta 3061 tattttagct ctgaaattgg gtatttaatc atcagcatgg ctttagtttg gaacataaaa 3121 gacgtctttc aaagtgtgca tttagagtta tagcagtcac ctatttacaa cacagttttt 3181 tgaaaaatga cataatcact gtagatgttc ataccataga ataccatgga tcctctcaga 3241 cctattttcc tctggtgtag ttttacactt gagggggtat gagctgttta aaggggtata 3301 tgtatatgca atggatgtgg gtttggatgc cgtcaagata gtggtagtgt taatggaggg 3361 tagaaaggag gagggagggc agttgagagg acctggttat ttaattgggt aagaatttgg 3421 gggtaacttg gtaagttcaa ggaagttaat tctgtaaact taaagagatg ttacttatgt 3481 taggatactt tatatttaat aatgagtgtt tgacaatcca gctttttaaa attagaactt 3541 gagggataaa tggataacgt aaaacctgtt aaatctgacg gtatcttttt tctccagaca 3601 atgattgaag ctcacgttga tgtcaagact accgatggtt acttgcttcg tctgttctgt 3661 gttggtttta ctaaaaaacg caacaatcag atacggaaga cctcttatgc tcagcaccaa 3721 caggtccgcc aaatccggaa gaagatgatg gaaatcatga cccgagaggt gcagacaaat 3781 gacttgaaag aagtggtcaa taaattgtaa gtgtttcttt gcttcctcac acaacacaac 3841 cttgatgatt ggattattcc tgagatgaga gaacgcatat gagacaaggt aaaggtctgt 3901 ggaaatcctg tctgtgaatc cttctagcta tatctcttta agtgaaagag tgttaagtac 3961 tcagtaaata tgattattat tactattatt atttgagtca gagtcttgct ctgttgccca 4021 ggctcgagtg cagtattgtg atcctccttg gctcactgta accactgctt cctgggttca 4081 agcagttctt gagcctcagc ctcctgagta tctgggaata caggggactg ccaccatacc 4141 cagctaattt ttttaaattt ttagtagaga tggggtttca tcatgttggc caggctggtc 4201 ttgaactcct gacttcaggt gatctgccag tactctgtaa atgataacag ttttttcgtg 4261 tttatttatt ttgaatgaag ctgtctcaca gtagatggag ttgaaggaca ggaaatgttt 4321 ttcccctagt tggaaaatac actgaataag ttgagtgggg tgggatgtgc ctggagtccc 4381 agctactcag gaggctgagg tggtaggatt gtttgagccc aggagtttga ggccagcctg 4441 ggcaatatag ggagaccctg tcccaaaaaa taaaaaatat acgtatatat atatacacac 4501 acaaagaaaa aatacactga atagacaaaa cctttcatga ttaatgatgc acgggaataa 4561 gtgatgaaaa aagtttcggt cccagatgat ggccagtgat aacaacattt ttctgatgtt 4621 cccatgcaat atacagttag ctaagagggt gtaatggaaa aagcataagg cttggactca 4681 gaagactcta ctaactttgc cactagctag ctatgtaatt cagatcatct atcctttaca 4741 tgtgaaaggt aaataatggc ttatcttaac aggaggattt atgcaggtta aatgaggtag 4801 gtgttatgtg taggtttatt ccaaggcttc tctactttta aaggaaatgg cttatcttat 4861 atctgagaac taggactttt agaaaaaaat ttactgttac tggtttgcag gattccagac 4921 agcattggaa aagacataga aaaggcttgc caatctattt atcctctcca tgatgtcttc 4981 gttagaaaag taaaaatgct gaagaagccc aagtttgaat gtaagtgaga aatcacatga 5041 ttcctgtagg gccaaataca ttgtttttgg gtggaggagg agtgtggggc catatcatgg 5101 ccttcttttt cttcctgtca tgcttgcata gtagtgatga ccattatttc aagatatact 5161 aacagttttt tggttttttt tttttttttt tttttttttg ccttttagtg ggaaagctca 5221 tggagcttca tggtgaaggc agtagttctg gaaaagccac tggggacgag acaggtgcta 5281 aagttgaacg agctgatgga tatgaaccac cagtccaaga atctgtttaa agttcagact 5341 tcaaatagtg gcaaataaaa agtgctattt gtgatggttt gcttctgaac attctttttt 5401 taaaaataat ctgacagctt ggtggattag acagtaaatt tgacagctgg taaacttttc 5461 tgacccagac aaattggata taaacatact actatgcacc tttactgtga agctgaaatg 5521 gggcggaga // LOCUS HSRPS7 7513 bp DNA PRI 18-MAY-1996 DEFINITION H.sapiens gene for ribosomal protein S7. ACCESSION Z25749 NID g550116 KEYWORDS ribosomal protein; ribosomal protein S7; rps7 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6544) AUTHORS Annilo,T., Laan,M., Stahl,J. and Metspalu,A. TITLE The human ribosomal protein S7-encoding gene: isolation, structure and localization in 2p25 JOURNAL Gene 165 (2), 297-302 (1995) MEDLINE 96096538 REFERENCE 2 (bases 1 to 7513) AUTHORS Annilo,T. TITLE Direct Submission JOURNAL Submitted (18-AUG-1993) Annilo T., Tartu University, Biotechnology, Jakobi 2, Tartu, Estonia, EE2400 REMARK revised by [3] MAT REFERENCE 3 (bases 1 to 7513) AUTHORS Annilo,T. TITLE Direct Submission JOURNAL Submitted (08-SEP-1994) Annilo T., Tartu University, Biotechnology, Jakobi 2, Tartu, Estonia, EE2400 REMARK revised by [4] MAT COMMENT Related sequence: M77233. FEATURES Location/Qualifiers source 1..7513 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Lymphocyte" /clone_lib="human lymphocyte library (Stratagene)" /chromosome="2" /map="2p25" mRNA join(1759..1830,2082..2165,2297..2368,2964..3107, 4189..4253,6583..6733,7277..7375) /gene="rpS7" gene 1759..7375 /gene="rpS7" exon 1759..1830 /gene="rpS7" /number=1 intron 1831..2081 /gene="rpS7" /number=1 exon 2082..2165 /gene="rpS7" /number=2 CDS join(2091..2165,2297..2368,2964..3107,4189..4253, 6583..6733,7277..7354) /gene="rpS7" /codon_start=1 /product="ribosomal protein S7" /db_xref="PID:g551251" /db_xref="SWISS-PROT:P23821" /translation="MFSSSAKIVKPNGEKPDEFESGISQALLELEMNSDLKAQLRELN ITAAKEIEVGGGRKAIIIFVPVPQLKSFQKIQVRLVRELEKKFSGKHVVFIAQRRILP KPTRKSRTKNKQKRPRSRTLTAVHDAILEDLVFPSEIVGKRIRVKLDGSRLIKVHLDK AQQNNVEHKVETFSGVYKKLTGKDVNFEFPEFQL" intron 2166..2296 /gene="rpS7" /number=2 exon 2297..2368 /gene="rpS7" /number=3 intron 2369..2963 /gene="rpS7" /number=3 exon 2964..3107 /gene="rpS7" /number=4 intron 3108..4188 /gene="rpS7" /number=4 exon 4189..4253 /gene="rpS7" /number=5 intron 4254..6582 /gene="rpS7" /number=5 exon 6583..6733 /gene="rpS7" /number=6 intron 6734..7276 /gene="rpS7" /number=6 exon 7277..7375 /gene="rpS7" /number=7 3'UTR 7376..7513 BASE COUNT 1885 a 1506 c 1904 g 2218 t ORIGIN 1 gaattcggat cccacagctt gtaagatgca gagccaggat tcagacctaa acctgtatgt 61 ctccaggttc tagtttactg cagagttggc ctttaggatg ttcatgacca tcaggtgaag 121 acaataaagt tacagatgct atgattagtg tttatgcagg ctatagtaag ggtataaaag 181 taacatattt aggaaaatgt ttgagaagcc tttggttttc tttagagtac agcccatctt 241 tttgtatctt gagttttcta cctgtaggtt tgcacctcag ttggtatgtg cattggggga 301 tccaagacaa ccctcaggtt taatcattca ctagaaggac tcaactgaga aaagcagcta 361 tactcacagt tatgttacat tatagtaaag aatactcaaa gcggtaagga aaaggtactg 421 agttcagggc agagtccaag agagaccagg cacaagcttc cagttgttct ctgccagtgg 481 agttttatgg acaacactta attctgtcgg cattggtgag tgacagtgca tatagaatat 541 tgtccaccag ggaagttcat ccacgtcttg gtgtccagaa ttttgttggg ggtcgattct 601 gtagtcatgg aacacctgca tgactgacct tagttaatca gtctttagcc tccagaaatc 661 aaactacatg atccagggcc ccaccatata acattgttag cctaaactat ctggcgtggc 721 ccaatgtctg gtaaacagcc actcatatca ggcaggacat tccaaaggcc tagagatgat 781 ctcttagaag ttggttgaag gccagctctt tatttgaaat gtgcagggtt tatttgaaat 841 gcgcagggtt tgaacatcaa cactttaatg cacagtatca ttgataaatc tttaaaccac 901 agggatcctg gggctcttcc gtgtctcagt ctaatgtcat tcgttccagg attcccgact 961 tcttaaagca taaataatcc ctccccaccc tctcattgta ctgttatgta agttattaca 1021 atatgtcatt atatatttag tcatactgct ttaggtaatg tcttctccac tgaactgtaa 1081 gctccatgag gcccgggggt cagtcggttt tacttaataa ttagcaccta gtacagtact 1141 agcatagaat gaaggcctcg caattttttt taaatttatt tttagacagg gtcttgcgtg 1201 tcgcccaggc tggagtgcag tggtgcaacc tcggctcacg gcagcctcga ctttcggctc 1261 cagcgatcct cccgggtcgg ctccggggta gctgggactg caggcgcgca ccaccatgaa 1321 tggctaattt tttttttttt ttttttgtag acaggggttt ccccatgttg cccaggctgg 1381 ttcctgagct caagtgacct cctgcctcgg cctcccaaag tgctgggatt acaggcgtga 1441 cgtcagcgcc cagccaagtt agcctttttt aaacgtcctg tctccggagg ttgccgaagt 1501 tggttttctt cggcctcctt ctctctccca ggcccagggc tgggacgagg ccggttcccg 1561 cctgcaacct gcactgaaga cgggaacctt gggagccggt accggaacgc tcggaaaccg 1621 caccaaagta cgaatcctag gccggaaaag cgttaccaag acactcgtcc ccagagccgc 1681 ttgctgggac tctctagcct cctaccgctt ctcagtgatg ttccggtttg gcccctcctc 1741 cgcgccctgt ttccgcctct tgcttcggac gccggatttt gacgtgctct cgcgagattt 1801 gggtctcttc ctaagccggg gctcggcaag gtaggttggc ggcctgctct ccgacagaac 1861 ttttcttctt gggttgagga aaacgccctt ttggagtcag gccctggagg ggcgagcctt 1921 gctcacaggg tggggataca gccgattacc cgccctgtgc tttccgatgg cttctgcggg 1981 gcgagcgggg cctggccggg gggtgcgggc gggagggcga gccagcggcg cctgcagccc 2041 gggccgcgta acgctgaccg ctgtgccttc agttctccca ggagaaagcc atgttcagtt 2101 cgagcgccaa gatcgtgaag cccaatggcg agaagccgga cgagttcgag tccggcatct 2161 cccaggtgag agggtttccc tggggtctgg ggtgggggga ggcgccccgc ccggggggga 2221 ggcggccgcg cgtgtgttgg gcccggggtg ctcggacgcg cgctcagggt cggtcctgct 2281 gttcgttgct tcttaggctc ttctggagct ggagatgaac tcggacctca aggctcagct 2341 cagggagctg aatattacgg cagctaaggt aagctggcgc tccctcggct gggagggagg 2401 ttgccccgcg tctccccgcc cagtggccgg gagggttgga cctgggttac ggttgatgat 2461 gacttgcccc cctggccgcc taacctgaac tcaggttcgg ccgtgttgtt tgggagtgat 2521 accgcccagg tgcgcggagt gggcttgcag gtaccctgcg gcttaagctg tccacatgcg 2581 ggcggtggag aaacgtggag tggggagggc ctgggcattt ggaccttatt tgcccttact 2641 tagttataga tacggcaaag agctgtgggg agctcaggat gagattctct ctacccttag 2701 cccggagttg ccgcaagtgg gggtgcagaa gtggtagtga ttgcctcctg gcctgaacag 2761 tctcccttcc tggaataagg aaatgtgaag gttggagaga gatgagaaca tttccgaagg 2821 atgcatttat gattaactca aaactagtat tagtttgtag tgcagctacg gtgttagtga 2881 taaggtcttc ttatcctcta atttgaccac aacgtttact ttctgaagca gttaacacag 2941 tggctttttg tttttttctt taggaaattg aagttggtgg tggtcggaaa gctatcataa 3001 tctttgttcc cgttcctcaa ctgaaatctt tccagaaaat ccaagtccgc ctagtacgcg 3061 aattggagaa aaagttcagt gggaagcatg tcgtctttat cgctcaggta tctgttctac 3121 tgttgcagca cgtttctgtt tgtgaatttt gctaaaattg cttgtattta gactgcattg 3181 gtagttggag tcatgaaaac aatcctttta tgaatccaat tgggtagaaa gataaagtac 3241 aggcagagcg ccgtggctga ggcctgtaac cccagcacgg gggggtggag gcgggtggat 3301 cactgtagag accagcctgg gcaacatggt aagaccctgt ctacaaaaaa tagaaaaaat 3361 aagttttaaa aagaaagata aagtacgttc tttaattaag agtcgaaaca agaagtctgt 3421 aaagtgacag actgctgtgt tttgagttat gaaaatgatt tcccatattt aaatttccac 3481 aagtctagtg ggttgcttac atagatgatc aaactaagaa acctgttaca ggccgggccc 3541 tgtggctcac tcctgtaatc cccgcccttt gggaggctga ggcaggcgga tcacgaggtc 3601 aggagatcga gaccatcctg gctgagtggg tgaaaccccg tctgtactaa aaaatacaaa 3661 aaattagctg ggcgtggtgg tgggcgcctg tagtcccagc tactcgggag gctgaggcag 3721 gagaatcgcg tgaacccggg aggtggagct tgcagtgagc cgagatcccc gccactgaac 3781 ttcagcctgg gcgacagagc gagactccat ctcaaaaaaa gaagaaacct gttacagttc 3841 ggatggggga ttggtgcaaa ctgaagtcta tcgggaatga agtgccagga ttgatgtctg 3901 gaataaaagt agtattctgg gtcataggca tgggaaaggc gcatctggga gattttgtcc 3961 actggcgtat tagtagtggc tgtggattct gaatgattta ttcaagaatc aggaagtaac 4021 tccatagaag ggtttgctca gtcaattgtt cgtctaagtt gtttagcctt ctcgaacttt 4081 gaacttaccc tgccattctt cttgctttta aagcagtatg gcagttacag ctttttgtca 4141 atttaaagtc ttttttcatt ttgttacatg ataattttta ccttacagag gagaattctg 4201 cctaagccaa ctcgaaaaag ccgtacaaaa aataagcaaa agcgtcccag gaggtgagta 4261 ttttagtagt ttcagaaatg tgtgtacccc tcttattaac aactcttaat ttgtttaagt 4321 tgtagtttat gaaaacagat gttcaagtgg gaatttttga gtagcagcat ttggtttcct 4381 gtatagttgc acatgatacg ttttagaatt ctagatgtaa aacatacatg tattcatgta 4441 gccattgttt gcctactagg tcagtgctgt cacagatgag atgattcctc gattattggc 4501 acaggatttc ctccgattat ttgtgcacag gctcggaatg atgagaagga aatcattact 4561 ttaattctgt gatatagcat gctagacaca gttgacatgg gcttggattt atctttgatt 4621 tttatttttg ctgtggattg gaggtaaaaa aatgtaggtg gattttggtg ataacatcac 4681 aggttaaaat atcactgcag ttgaagatga ctagatggaa atgtttatac ttgattgact 4741 tacaggttat ctggcataaa tctcaatatg agtgtctctg acagtctttg tatttgttag 4801 tctctgatta gtatgttgtt tacaaatgtg aactattaaa tccctgtgga ctcttcccta 4861 aggatgagtt gaagaaaaaa aaaatcctag tgttgcctgt gccctctaac atgtttaatt 4921 tgtattgtta ctctcagtgg atgaggaaac tgatccaagg ctgggataat ttgctgattg 4981 cagctagtac tagtagtgct aagaatgttt tttttttaac tcggtcttgt tctgaaaacc 5041 actagaacac agtgttggag tctaatgtgt tattgtgtcg agctgtggta tgtggaaagg 5101 taggtgccag catttctaga acttgaattc tatggcttag tactttgaaa aacttttaaa 5161 ggcagagtgg cataagaatt taggattttt tggccctaaa tcatgacact taaagagggt 5221 tttggaaact cagttctgag aacctcagtt aggaaatgag tataagaaca gtcccagagg 5281 actgtccctc aaatccatag ctttagtttc cctcagtaca aacctctgtt ttccaaattt 5341 tctcagagaa tggcttcagt gtttctagtg ggactgtaca ttgaaagcct tcacactaag 5401 cctagctggt aggtcatagt gattctacca cttttccaat ccggcttcct gctttcagat 5461 ttatttttcc ctattcagta gagctatctt tatctaactc aggttaaatg ctgaaattga 5521 tagtttttgg gtatagcagg gtttttcctc taaggccctc cagtccctgt ggtttgtgat 5581 gattgagtta tggtgccatt gctccccatt ttaaaagatt gtagatgaca ttggaatcta 5641 atttccgtat ttgggtgctt tcctagtatt gtaagaatat gcttaattaa tgagtcgcaa 5701 tgaataatcc gtttgtcttg atttgctttc taaaggtgat gagggttttg gaaatggact 5761 ttgttcaggc ttttgccctt tgtggctgtg acatattaag atgttggcag taaaattgag 5821 agccaccaga agatgttaac tgtgtatatg ggcgctgcca gataggaaat ccaaatgagt 5881 agactatagt gttattgtta gctactatgg ggtgagacgt agacagataa ccatatgtaa 5941 ggcagctccc taaatgctga aacaaagaac cgtctcttga cttactcctg atgtctcatg 6001 gtacatggta gctgccccaa aagcttgcaa ttaatttgaa acttctcatt tgttaggtgg 6061 caggtaggta ggttttgaac ttggtagtga tattcagacg aggagcaaga ttccactggc 6121 agttctgtga tgctaagtaa atgatgtgat catatccgtg ctttagaaag attatcttga 6181 cattggaagg gggtggtttt ggagatgagc aagagtgtta aaggttattg tagtgattta 6241 gttgagaaat aatacaagtg aaaataaaga tgtataaaag tcctctaatc tcacattccc 6301 tagagatggt cactcattgc cttggtgagt ttgtagttta tcccagagta acatttaaaa 6361 ttgactgagc ttttgaaata aatcagtaag ctattaccct attctaggta ggcaaaggct 6421 ttagtttggt acgtgaaata taacaagctt atttgaaaat gagtgttaat ttgacttcag 6481 gaacctggga tttgcatttt cattttgact taaagaggtg ccctctggag ttgcccagag 6541 ggggcgttgc taggtggctt accttttaaa ctattctttt agccgtactc tgacagctgt 6601 gcacgatgcc atccttgagg acttggtctt cccaagcgaa attgtgggca agagaatccg 6661 cgtcaaacta gatggcagcc ggctcataaa ggttcatttg gacaaagcac agcagaacaa 6721 tgtggaacac aaggtaatag gtcaacattt atcatggaaa ggttcagcca cagtgagagt 6781 ggattttagt gtaaccagtc tccatgcgcc aaccatagca atgactgtag taaactcaga 6841 ctagttcttt tacccgcact tcagcctgct ctcctttgga ttatgtgtca ggtgttaaat 6901 ggagatgttt caagaattga acacttgaaa ttctctgtac cttttggaag tagactcttt 6961 ctgtggtctt ttagttaggc tgtatattct tggtagttag gggtgggtgg tgatgggatc 7021 agtgtcttgg gcccacataa ccatgtgggt gactgctggg gtacccctga tggcttcccc 7081 ggtgcagtgg tgtacagttc tgtcccacag cattggagaa gagcttgtcc ccggtcgtga 7141 agactgctgc agacatgttg tgtgtactta gttgctgagg agaaaaacaa tacagggcaa 7201 caatttcaca aactattagg ttttaagctg agtgtgtatt tcaaagttct gtgatgaatt 7261 ctttcttttc ttgtaggttg aaactttttc tggtgtctat aagaagctca cgggcaagga 7321 tgttaatttt gaattcccag agtttcaatt gtaaacaaaa atgactaaat aaaaagtata 7381 tattcacagt actctgtttc agttatgttt ttcaaaattc caaattcacg gatgcgcagc 7441 tgtcttcatt atcagtggcg tcctgtgtgg gccagaggat ttcagtagga ggggtgtctg 7501 tgccagaaag ctt // LOCUS HSRPS8 3530 bp DNA PRI 22-JUL-1992 DEFINITION H.sapiens rpS8 gene for ribosomal protein S8. ACCESSION X67247 NID g36149 KEYWORDS ribosomal protein; ribosomal protein S8; ribosomal protein small subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3530) AUTHORS Fried,M. TITLE Direct Submission JOURNAL Submitted (29-JUN-1992) M. Fried, Imperial Cancer Research Fund, P O Box 123, Lincoln's Inn Fields, London WC2A 3PX, UK REFERENCE 2 (bases 1 to 3530) AUTHORS Davies,B. and Fried,M. TITLE The stucture of the human intron-containing 58... JOURNAL Unpublished FEATURES Location/Qualifiers source 1..3530 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="male leukocytes" /chromosome="1" /sex="Male" /map="1p32-34.1" mRNA join(228..254,685..791,1326..1425,2263..2438,2649..2778, 3225..3389) /gene="rpS8" prim_transcript 228..3389 /gene="rpS8" exon 228..254 /gene="rpS8" /number=1 gene 228..3389 /gene="rpS8" CDS join(251..254,685..791,1326..1425,2263..2438,2649..2778, 3225..3334) /gene="rpS8" /codon_start=1 /product="ribosomal protein S8" /db_xref="PID:g36150" /db_xref="SWISS-PROT:P09058" /translation="MGISRDNWHKRRKTGGKRKPYHKKRKYELGRPAANTKIGPRRIH TVRVRGGNKKYRALRLDVGNFSWGSECCTRKTRIIDVVYNASNNELVRTKTLVKNCIV LIDSTPYRQWYESHYALPLGRKKGAKLTPEEEEILNKKRSKKIQKKYDERKKNAKISS LLEEQFQQGKLLACIASRPGQCGRADGYVLEGKELEFYLRKIKARKGK" intron 255..684 /gene="rpS8" /number=1 exon 685..791 /gene="rpS8" /number=2 intron 792..1325 /gene="rpS8" /number=2 exon 1326..1425 /gene="rpS8" /number=3 intron 1426..2262 /gene="rpS8" /number=3 misc_feature 1492..1614 /gene="rpS8" /note="ALU homology" misc_feature 1737..2017 /gene="rpS8" /note="ALU homology" exon 2263..2438 /gene="rpS8" /number=4 intron 2439..2648 /gene="rpS8" /number=4 exon 2649..2778 /gene="rpS8" /number=5 intron 2779..3224 /gene="rpS8" /number=5 exon 3225..3389 /gene="rpS8" /number=6 polyA_signal 3360..3365 /gene="rpS8" BASE COUNT 736 a 902 c 1044 g 848 t ORIGIN 1 agatctgccg acgccgcaga cccccttccc ccagcccagg gtcggaggcg agtcgccggg 61 ggtcggcaga ggtcagcagc ctgcgagacc ctcccaggct cgtggcccta ggcgcggcag 121 ctgacacgta agtctcgtcg cccgacgttg tttcggcgct cagaaacaac gtaaagtaaa 181 ggggcggggc agcgttttac aaaccgaacc gtgaatcttt gcggtttctc tttccagcca 241 gcgccgagcg atgggtgagt gtgctctgca ttgaggcggg tgaagggagg ttgagctcaa 301 tcaggcccca tcctgcttca cagcctactg aggagtccag acgccccgac ccccagccag 361 ggacagccac gaggagtggt ggccctcggg tttcggccgc ccagcccgtc tctgggggct 421 gcgaaggctc tgggctccgg gttccgggcg cgggctgcgg gctccgacgg gcgccaacat 481 ggctgccgcg aggaggaggc cccggcctgg ccgcacgtgt atgatgacaa ctcggtaatg 541 ctgcatactc ccgagtgcgc ggtggggaag ccaaccttgg agagctgagc gtgcgaccgg 601 ccggcgcggg ggtctccggg agctggcgag tcgctagcac cgagtcacag tggctcaagc 661 ttccttcccc gcttccacat gcaggcatct ctcgggacaa ctggcacaag cgccgcaaaa 721 ccgggggcaa gagaaagccc taccacaaga agcggaagta tgagttgggg cgcccagctg 781 ccaacaccaa ggtgggtgcg agcgtgggcc tgtccgcctg ggaggtcgcc ttcccccgct 841 ctccagcgtg ctcgggtctt tccgtgtgac agttgtgcgt tctttcttgg gctctgattt 901 ctctgtagta gggcctgggt gtcctgtccc cctttagctg ttggataagt aagtaccaaa 961 gagggaatgc tccctcaggc ccccaaacct ggagcttagg atttcagaga gaaggatact 1021 gtgtggggac ttgagcttct ggagagggtg gcgcctcgcg tcatagtcag aagcctagtc 1081 ttgttttttt tacagaggct taattttcag catttggggt caggctttcc tcttggaggc 1141 aagtagggtg atgaaaaaga atccttaggc gtggttgtgg ccgtcttggt cacctgtgtg 1201 ccacttgcca atgcaaggac ttgtcatagt tacactgact gttgcctcct cccggcccag 1261 gttcctctcc ctcacttgcc ttgctctcct tggtaaccta gttcctgtaa ccttgtgttt 1321 tccagattgg cccccgccgc atccacacag tccgtgtgcg gggaggtaac aagaaatacc 1381 gtgccctgag gttggacgtg gggaatttct cctggggctc agagtgtgag tgaggccctt 1441 tgggagtggg tgggaaaacg cacctaaacg gtcttaagat tcaccaagtg ggcctggcgc 1501 ggtggctcac gcctataagc ccagcacgtt gggaggccga ggcgggcgga tcacctgagg 1561 tcaggagttc gagtccagcc tgggcaacag agcgagactc tcagtaaaaa aaaagattca 1621 agtgctttta gcaagtagtc tgtggcttag accaaggcat ttgaagtttc tccttgctga 1681 aagatcttaa ggtagctggg gagttctctc cacccaggct gtcctgctcc aatctctttt 1741 tttgagattg ggtctctgtt gctcaggctg gagtgccagt ggcgtgatct tggttcactg 1801 cagcctccgc ctcctgggtt gaagtgattc tcctgcctca gcctcccaag aagctgggat 1861 tacaggcgtg tgccatcaca cccggctgct tctgtatttt tagtagagac ggggtttcac 1921 catgttggcc aggctggtct caaattcttg gctttaagtg atccacccgc cttgacctcc 1981 caaagtgctg gggttacggg cgtgagtcac cgtgcccgac ctgctctgat ctttctgaac 2041 tctgcagcct gagagattgg ggctggtaaa gactgcgggt tgccaaacat aactagaaac 2101 gtgggttagg gggttgtgga agacgaactg atgccccatg gcttgtaaag gctgagagtt 2161 cccttatttc tccttaaggc ctcatggggc tgaagaacct ggaaggaagt gtgccaggcc 2221 attgtcctca gtttacttat gccaaatttc tcctgtgagc aggttgtact cgtaaaacaa 2281 ggatcatcga tgttgtctac aatgcatcta ataacgagct ggttcgtacc aagaccctgg 2341 tgaagaattg catcgtgctc atcgacagca caccgtaccg acagtggtac gagtcccact 2401 atgcgctgcc cctgggccgc aagaagggag ccaagctggt gcgtgttact tccctgtagg 2461 ggttgtgggg agggcagcct gactccagcc ttctcgtgat gaaaactctg tccagttctg 2521 ctactgaagg gagagagatg agagcctttt aggctgagga aggccagcac tggggtgtgc 2581 agggttcgag aaagctccca gggcctgcct tccttccctg agctcatata tttgtatccc 2641 cttttcagac tcctgaggaa gaagagattt taaacaaaaa acgatctaaa aaaattcaga 2701 agaaatatga tgaaaggaaa aagaatgcca aaatcagcag tctcctggag gagcagttcc 2761 agcagggcaa gcttcttggt gagaaggctg ttgtgttgga ggtggggagt cgcagagatt 2821 gagtgtgccg aggcactttt cccttgtctc agttcctttg actgccagcc atgcagtcta 2881 aagggttcac tgataacagg ctgcgagcac aaaggggaac gtttggtcac cctattcgta 2941 tgaagctgaa atgggaagca ttgggtagaa gagtctgcat aggcccgtgc ttggagtctt 3001 tgtatttggg gaagtctctg cccaggctga gggggctgtc tcagtgatga aaactttgtc 3061 cagttctgct actgacagta agtgaagata aagtgtgtct gaggagacag ctggcttcat 3121 gcttgccccc agggtacctg aacccacaga gattcttaag cgggtggaga ggtttgggta 3181 gggccacctt gtcgttgtgc taaggatcac ctactctctt gcagcgtgca tcgcttcaag 3241 gccgggacag tgtggccgag cagatggcta tgtgctagag ggcaaagagt tggagttcta 3301 tcttaggaaa atcaaggccc gcaaaggcaa ataaatcctt gttttgtctt cacccatgta 3361 ataaaggtgt ttattgtttt gttcccacat ttatgttgcc tgaatatatg actgttttct 3421 ctgctttatt tccttgccct gcaaaactga tctgggtggg tggctgcaac cccttgcctt 3481 aacctctgcc tcctactgtc ctgagccagg ctcaccacac tgtaaagtcc // LOCUS HSRSGCG 12810 bp DNA PRI 06-DEC-1997 DEFINITION Homo sapiens gene encoding retina-specific guanylyl cyclase. ACCESSION AJ222657 NID g2695889 KEYWORDS GUC2D gene; retGC gene; retina-specific guanylyl cyclase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12810) AUTHORS Perrault,I., Rozet,J.M., Calvas,P., Gerber,S., Camuzat,A., Dollfus,H., Chatelin,S., Souied,E., Ghazi,I., Leowski,C., Bonnemaison,M., Le Paslier,D., Frezal,J., Dufier,J.L., Pittler,S., Munnich,A. and Kaplan,J. TITLE Retinal-specific guanylate cyclase gene mutations in Leber's congenital amaurosis JOURNAL Nature Genet. 14 (4), 461-464 (1996) MEDLINE 97099458 REFERENCE 2 (bases 1 to 12810) AUTHORS Perrault,I. TITLE Direct Submission JOURNAL Submitted (03-DEC-1997) Perrault I., INSERM U393, Hopital des Enfants Malades, 149 rue de Sevres, Paris, 75015, FRANCE FEATURES Location/Qualifiers source 1..12810 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="p13.1" exon 1..64 /gene="retGC" /number=1 gene 1..12810 /gene="retGC" mRNA join(1..64,369..1098,1182..1486,3693..4044,4390..4474, 4755..4857,5260..5361,6833..6913,8101..8307,8407..8563, 9061..9210,9837..9985,10557..10720,10815..11007, 11284..11458,11699..11797,11883..11977,12161..12246, 12400..12511,12599..12810) /gene="retGC" intron 65..368 /gene="retGC" /number=1 exon 369..1098 /gene="retGC" /number=2 CDS join(378..1098,1182..1486,3693..4044,4390..4474, 4755..4857,5260..5361,6833..6913,8101..8307,8407..8563, 9061..9210,9837..9985,10557..10720,10815..11007, 11284..11458,11699..11797,11883..11977,12161..12246, 12400..12487) /gene="retGC" /function="converts GTP to cGMP in the retina" /codon_start=1 /product="guanylyl cyclase" /db_xref="PID:e1203975" /db_xref="PID:g2695890" /translation="MTACARRAGGLPDPGLCGPAWWAPSLPRLPRALPRLPLLLLLLL LQPPALSAVFTVGVLGPWACDPIFSRARPDLAARLAAARLNRDPGLAGGPRFEVALLP EPCRTPGSLGAVSSALARVSGLVGPVNPAACRPAELLAEEAGIALVPWGCPWTQAEGT TAPAVTPAADALYALLRAFGWARVALVTAPQDLWVEAGRSLSTALRARGLPVASVTSM EPLDLSGAREALRKVRDGPRVTAVIMVMHSVLLGGEEQRYLLEAAEELGLTDGSLVFL PFDTIHYALSPGPEALAALANSSQLRRAHDAVLTLTRHCPSEGSVLDSLRRAQERREL PSDLNLQQVSPLFGTIYDAVFLLARGVAEARAAAGGRWVSGAAVARHIRDAQVPGFCG DLGGDEEPPFVLLDTDAAGDRLFATYMLDPARGSFLSAGTRMHFPRGGSAPGPDPSCW FDPNNICGGGLEPGLVFLGFLLVVGMGLAGAFLAHYVRHRLLHMQMVSGPNKIILTVD DITFLHPHGGTSRKVAQGSRSSLGARSMSDIRSGPSQHLDSPNIGVYEGDRVWLKKFP GDQHIAIRPATKTAFSKLQELRHENVALYLGLFLARGAEGPAALWEGNLAVVSEHCTR GSLQDLLAQREIKLDWMFKSSLLLDLIKGIRYLHHRGVAHGRLKSRNCIVDGRFVLKI TDHGHGRLLEAQKVLPEPPRAEDQLWTAPELLRDPALERRGTLAGDVFSLAIIMQEVV CRSAPYAMLELTPEEVVQRVRSPPPLCRPLVSMDQAPVECILLMKQCWAEQPELRPSM DHTFDLFKNINKGRKTNIIDSMLRMLEQYSSNLEDLIRERTEELELEKQKTDRLLTQM LPPSVAEALKTGTPVEPEYFEQVTLYFSDIVGFTTISAMSEPIEVVDLLNDLYTLFDA IIGSHDVYKVETIGDAYMVASGLPQRNGQRHAAEIANMSLDILSAVGTFRMRHMPEVP VRIRIGLHSGPCVAGVVGLTMPRYCLFGDTVNTASRMESTGLPYRIHVNLSTVGILRA LDSGYQVELRGRTELKGKGAEDTFWLVGRRGFNKPIPKPPDLQPGSSNHGISLQEIPP ERRRKLEKARPGQFS" intron 1099..1181 /gene="retGC" /number=2 exon 1182..1486 /gene="retGC" /number=3 intron 1487..3692 /gene="retGC" /number=3 exon 3693..4044 /gene="retGC" /number=4 intron 4045..4389 /gene="retGC" /number=4 exon 4390..4474 /gene="retGC" /number=5 intron 4475..4754 /gene="retGC" /number=5 exon 4755..4857 /gene="retGC" /number=6 intron 4858..5259 /gene="retGC" /number=6 exon 5260..5361 /gene="retGC" /number=7 intron 5362..6832 /gene="retGC" /number=7 exon 6833..6913 /gene="retGC" /number=8 intron 6914..8100 /gene="retGC" /number=8 exon 8101..8307 /gene="retGC" /number=9 intron 8308..8406 /gene="retGC" /number=9 exon 8407..8563 /gene="retGC" /number=10 intron 8564..9060 /gene="retGC" /number=10 exon 9061..9210 /gene="retGC" /number=11 intron 9211..9836 /gene="retGC" /number=11 exon 9837..9985 /gene="retGC" /number=12 intron 9986..10556 /gene="retGC" /number=12 exon 10557..10720 /gene="retGC" /number=13 intron 10721..10814 /gene="retGC" /number=13 exon 10815..11007 /gene="retGC" /number=14 intron 11008..11283 /gene="retGC" /number=14 exon 11284..11458 /gene="retGC" /number=15 intron 11459..11698 /gene="retGC" /number=15 exon 11699..11797 /gene="retGC" /number=16 intron 11798..11882 /gene="retGC" /number=16 exon 11883..11977 /gene="retGC" /number=17 intron 11978..12160 /gene="retGC" /number=17 exon 12161..12246 /gene="retGC" /number=18 intron 12247..12399 /gene="retGC" /number=18 exon 12400..12511 /gene="retGC" /number=19 intron 12512..12598 /gene="retGC" /number=19 exon 12599..12810 /gene="retGC" /number=20 BASE COUNT 2712 a 3673 c 3655 g 2756 t 14 others ORIGIN 1 cccacagggg gaccggccct gtgacccctc accggggccg tgggcccgag ccccggactt 61 ccctgtaagt gtcagaggcc cctccgctgg gatagggtcg gtctgagggc gcaggcgagt 121 ccctgctgac ccctgacgcc tccgacgggg ggaggggcag gccgggtggg agcgggaagc 181 cggggcggca gaagggggct tcggggcggt gtccttggcc ccagttagtc ttcccagcct 241 ccggaggggg cggtagcagc agaatcatcc catgggttac tcgggcttgg agaaactcgg 301 ggttacgggg agaaccctag gggaggccgg ggtctcagtc gctcagcctg ctccgtctgt 361 gttcgcagaa gccggcaatg accgcctgcg cccgccgagc gggtgggctt ccggaccccg 421 ggctctgcgg tcccgcgtgg tgggctccgt ccctgccccg cctcccccgg gccctgcccc 481 ggctcccgct cctgctgctc ctgcttctgc tgcagccccc cgccctctcc gccgtgttca 541 cggtgggggt cctgggcccc tgggcttgcg accccatctt ctctcgggct cgcccggacc 601 tggccgcccg cctggccgcc gcccgcctga accgcgaccc cggcctggca ggcggtcccc 661 gcttcgaggt agcgctgctg cccgagcctt gccggacgcc gggctcgctg ggggccgtgt 721 cctccgcgct ggcccgcgtg tcgggcctcg tgggtccggt gaaccctgcg gcctgccggc 781 cagccgagct gctcgccgaa gaagccggga tcgcgctggt gccctggggc tgcccctgga 841 cgcaggcgga gggcaccacg gcccctgccg tgacccccgc cgcggatgcc ctctacgccc 901 tgcttcgcgc attcggctgg gcgcgcgtgg ccctggtcac cgccccccag gacctgtggg 961 tggaggcggg acgctcactg tccacggcac tcagggcccg gggcctgcct gtcgcctccg 1021 tgacttccat ggagcccttg gacctgtctg gagcccggga ggccctgagg aaggttcggg 1081 acgggcccag ggtcacaggt aggctccctt gcagggtgcg aggaggtcgg ctggtcctgc 1141 cggcagccgg acggcgccgc gagccaagcc tctgtccgca gcagtgatca tggtgatgca 1201 ctcggtgctg ctgggtggcg aggagcagcg ctacctcctg gaggccgcag aggagctggg 1261 cctgaccgat ggctccctgg tcttcctgcc cttcgacacg atccactacg ccttgtcccc 1321 aggcccggag gccttggccg cactcgccaa cagctcccag cttcgcaggg cccacgatgc 1381 cgtgctcacc ctcacgcgcc actgtccctc tgaaggcagc gtgctggaca gcctgcgcag 1441 ggctcaagag cgccgcgagc tgccctctga cctcaatctg cagcaggtag acggtcccgg 1501 gaggagggaa gaaggcaagg gagaggggag aggacagcca aagcagggag aggaggatgc 1561 agccaatgga gaaagaacca ctggcagctc tagctgtttg gagtttcctg ggtaatgtag 1621 aggctctggc cccacttggg gtctcttagt gtacgaggat tgcctctaga gagtaggcaa 1681 ttcgacctct caagtccaac atcaagcagt aaactctaat gaactgtaaa ctgcaaaacc 1741 ttttgcaatt ggcttcaaag taaacccgtt tccaccacca catactccca gctgtctaaa 1801 ataaatactt ggtgttcttg aaagcaggaa ggagggagga tgccaatcca accttcatcc 1861 ctagttcaga ggtatttcct ctggggcctc atctggtggt cttagctttt caagggccta 1921 gacaccttgc aagaagcaga aagttccctg ggtggattca ggggcaatcc taaccagatc 1981 catagtgtat ttccaacatc cagctaatac ctagggatcc ccaagtcccc agaatcccag 2041 ttcagatcat ctaactttac tcccgccaca aagggggact agaaaccact atgagaaggg 2101 aatttagatg tcaacgggca gtgaattaac caccacaaag gtccaactca caggcggctg 2161 catgagagaa gggtcgtagg ggttgcaatt tcccatcgat gcccctagat tcttccaaga 2221 ggattggagg tgtccaatgg aggggggagg ggtgcagtga tgggaatagt cagaagagaa 2281 tacaagatag gaatcataat gagatgggat ggagggaggg tcggtcagtg gagagaggga 2341 ggttgagaaa gtagagagag aaaagccctt cagctctcag taagtaccac agcagtcctc 2401 tccactgccc aaactgcgca agcctctgag cgtgtgagtc acccttgatt ccttgaattc 2461 ttcgccccca cttccaggca tcaagccctg ttactgttac ttcctaaaga tcattcgaat 2521 tcatccgctt tctccatctt tactgtcact gccctgattc aggccacagc acccccagcc 2581 caaatcccaa aacataatct ctcagatggt ctctgcttca acactcacct gcccaacacc 2641 tttctccaga tcaagcttca gaaatctgac agatctcacc ctgtcatgcc cctgttcaaa 2701 gcctcacagt ggcttctgtt gccctggata gatgcaggcc tctccacatg gcttaccagg 2761 tcacctgtct ttctctctct tccacctccc atgaacttca gccagaccaa gctactctgg 2821 gttcttggaa tgtgtccttc tcttttgcct ctgagcttct gagtgtgctg tttcctctct 2881 cttgtccctg cctcccctgg ataattctga ctcatcccct gaaggatggt cacttctaac 2941 agctccttct ctgtccacag cctagggtgg ggacacccat tcctgtctct gcagctccct 3001 cagtttcccc tacttttact cattgcactg cctgaacacc cctgtctctt tggcaagaag 3061 aaaggttcct tgagacgggg tctgtcttgt acactgatac atctttagtg ctctgcacag 3121 tgcctggctc tctgagagat ggcaactggg aataatagag aagcctgata agatcgggtg 3181 gtaacaatag agaaggtttt ttgtttgtct tttgttcagt tcttttgttt gtttgtttgt 3241 ttttttaagg aacaacagaa aatagaagcc aaccaaaaga taggtgagaa ggcttttagg 3301 tgaatggtag gtgatgaggt taaaactata tgctgggcaa ccaacaggaa gaagagagga 3361 cctacaggtg atggagaatt agaatcaacc aatggtcaga gaggaggctg agtcctgaaa 3421 agaggaacca gttagtgggg agagcaggtg tgacagccaa taggagaggg aggaagagag 3481 acccagtggg gagggaggaa gagagagcca atggggaggg aggaagagag ccaatgggga 3541 gggaggaaga gatagccaat ggggagggat cctgggaaca actaactgga aggtggcaac 3601 agtggatacc ctgggcttga caggcagtga aagaatctgg tctgtctgtg ggctgtgacc 3661 ccgacctctg agcccctact ctccttctcc aggtctcccc actctttggc accatctatg 3721 acgcggtctt cttgctggca aggggcgtgg cagaagcgcg ggctgccgca ggtggcagat 3781 gggtgtccgg agcagctgtg gcccgccaca tccgggatgc gcaggtccct ggcttctgcg 3841 gggacctagg aggagacgag gagcccccat tcgtgctgct agacacggac gcggcgggag 3901 accggctttt tgccacatac atgctggatc ctgcccgggg ctccttcctc tccgccggta 3961 cccggatgca cttcccgcgt gggggatcag cacccggacc tgacccctcg tgctggttcg 4021 atccaaacaa catctgcggt ggaggtgagg gcgagcaccc cagtccccac tgagacaatc 4081 gccatggacc atccacaaag tgatgaaaga gagtggactc ctatcctgta atccgtcttc 4141 gatgcccttc taggccctct cccagcctct ggcttgcaca ggacccctct cttgctgagc 4201 tccaaagccc tcttggaaat tttctatcat tcccagcctc tcccctttga ggaactatga 4261 cctaccccta gagcctctct gggcccccat ccccctttcc tggaggggcc agcatgtggc 4321 atgcctccct agagaagagg cctcccctgg catcgctcct cagtatacct cctgtcactg 4381 tcccttcagg actggagccg ggcctcgtct ttcttggctt cctcctggtg gttgggatgg 4441 ggctggctgg ggccttcctg gcccattatg tgaggtgagt agtggaatga ggtaagtagg 4501 aagtgagctt gtgccagaaa tggaagtctg cagcaaaaac agcaggctgg gttcactcag 4561 gagtagagga ggagattttt cttggggtga gggtgttctg gtgggctggt agagtcccag 4621 gggatgtgtg ctttggggat ggctgccctc caggacccct cccctgagcc aacgttatcc 4681 ctcccccgca tcccctgctg gtctcttctg acggaacttg gtgcccttgg tggaggtgac 4741 tctttctcca ccaggcaccg gctacttcac atgcaaatgg tctccggccc caacaagatc 4801 atcctgaccg tggacgacat cacctttctc cacccacatg ggggcacctc tcgaaaggtg 4861 ggggaggcag agaggcagga gccagttgtc ttctttccgt aaatttggtt ccttccctgg 4921 gccagtcccg accccagctc cctacttggg aagcctgatt tctaccccag ttctgtccca 4981 cgtctgaagt ctagggatca gcaccctcat ctgtgcaatg agggtaacag tacctgtcct 5041 gactactttc ctggcgggga ggtgaagctc caacaagaaa atgaatgttt gctttgtaaa 5101 ctgtaaagcg tgcacttggg gagtaatgtc attatcacct tccctcattg agattccttc 5161 gcctcccatc tttaaatccc caaaactcag cctgacctca acccaggact ctgacaccag 5221 aatatatttt gacctcttgc attgacctct acctgctagg tggcccaggg gagtcgatca 5281 agtctgggtg cccgcagcat gtcagacatt cgcagcggcc ccagccaaca cttggacagc 5341 cccaacattg gtgtctatga ggtgagcctg accccagcca gacagagaga cagtggggga 5401 agaatgctca ggccctgggc agaggggagg cgcactctca ggaggacatg tagtcatgtc 5461 aaatcctgca ggctccagga gatgggggat gggagcccag acaggatcta gggaaaggtc 5521 atggatccct caaagggaag cacctaggaa tcttccctcc ccaaggattt ttctctagcc 5581 actatctggg taagggctcc tcaaaagcta accacagtga cccagagact ggaggctgga 5641 gctgccaggg agcatgctgg gaccagcagg taacatggtg tggccactga gaccatctcc 5701 ggcctggtca gagcctctgg ccctgcccct cagcattcca cccaattcag accagactca 5761 ggaatagcaa cccctagagg gggaaaagaa agtcacaaaa aagtattagg acattaactt 5821 tggggctggt aaatctgaat gcggtcaagt ggaggggaga tgatgagaac agattgagag 5881 agagctgaga gtgctgagca aatcaagaga gtagaaattt agagctcctg ggcttcaggt 5941 catgggaagg aatgcagagg ctcctatcat gtcacctaga gatcatcatg gccaattctc 6001 agaggaggca ctattgtcat ttggggcaga ttgtcctaaa ttgtatggga ctactctgtg 6061 ccttgaagaa tatttagtgt ttcttgtctc tggccctgtc cactaatgca gctcctagtt 6121 tttgtgacag ttaaaacttt ccccacattt atttccaaat gctccttagg ggtcagtact 6181 aaaccccaat tcccttctaa attcccttct agtccaaatc ccttataaac aagacagggc 6241 cacctgcata attagcgaga tccagggcaa gatagaaatg aggggctcct cgttcattca 6301 tgatgatggc aggaggtcat tgaaccaagc ttgggccctt cggagcacga ggccctgtgt 6361 gactgcacag gtcccatgcc tgtgaaccgg ccctgccaac agttgaggat gaggtccagc 6421 atcaggaggg gagctgtcta tagttcagtt agttttcaat aaagcctgat ctacccagat 6481 tttttactcc cagaacatgt tctctcatgg cagtggtggt aatgagggtt gtgatgtcga 6541 tgacacttgt ggttgtatta atgcacttaa taggactatt ggtgggcaca gctgggttgg 6601 tgaggacagc catccatggc ttcagaatct ctgtcatcca ggggtgggcc ctctcccaga 6661 tggctgtgaa gtggatgggc atacatcaaa ccccttggtt caatgcattt tgtcacatgg 6721 aagatgcatt ctgggacagt gagccaatgg aaatgagggg gaggggttct agggctcccc 6781 atcgtgggat tttaagagac tgagttccct acccccatcc tctttgctgc agggagacag 6841 ggtttggctg aagaaattcc caggggatca gcacatagct atccgcccag caaccaagac 6901 ggccttctcc aaggtgagac ttgggcctgt gatggggcct aggtggccat cggtttctcc 6961 ctccttgcct tctctttgca gctgggtaca gaggtaggtc tcagttaagt gtcccctctg 7021 gctgagtgtg gtggctcata cgtgtaatcc tggcactttg agaggcagat gtgagggagg 7081 atcacctgag gccaggagtt caagaccagc ctgggcaaca taaagaaacc ctatctctcc 7141 aaaaatttta aaattagctg ggtgtggcgt tacatgcctg tagtcccagc tactcttcag 7201 agggtgagga gggaggatta tttgagcctg ggagttggag gttaccgtgg gctgtgatca 7261 tactactgca ctccagcctg ggtgacagag caagtacctg tcaaaattaa gaaagaaaaa 7321 aaaaaacnnn nnnnaaaaaa aaaatacata aacaaaagaa aagaaagaaa gtggcaccgc 7381 tcctccctgg ctctcccagc aaaggctgtt aaatcaggag aggatggtgg ttccaggagc 7441 ttctccttca tgagctgctt cccttaagac ccaggcagtt cagagtttta tggctggggc 7501 caggtgtggt ggctcaggcc tataatccca gcactttggg aggctgaggt gggaggatca 7561 tttgaagcca ggagttcaag accagcctgg gcaacatagc aaggctccca tctctacaga 7621 aaaaaaattt taaatgttat ctaggcatgg tggcacaagc ctgtagtccc agctattcag 7681 gaggctgagg taggaggatc acttgaacca aggaatgaaa ggtagcagtg atttatgatc 7741 atgccgctgc actccagtct gggcaacaga gtgagacctc gtgtctaaaa aaaaaggctt 7801 atgatcaata aatccgaacc tgcctgggtc ctgatcacca atgcaaagtt gtcttttcat 7861 tcacagcatt aggctaaacc atactcagta tccaaacagt ggcctttgac tatattgttt 7921 tttccaaaaa taggactatg tgtagaagag agcccccgta catactttat caaccatttc 7981 atccaccatt tgtaaaaatc tcatcttctg ggtctggata ctcaaaaaca gatcttgatt 8041 aacagcccct tccccacatt gccctgggca gaaaatgcaa gtcaactctc cccctctcag 8101 ctccaggagc tccggcatga gaacgtggcc ctctacctgg ggcttttcct ggctcgggga 8161 gcagaaggcc ctgcggccct ctgggagggc aacctggctg tggtctcaga gcactgcacg 8221 cggggctctc ttcaggacct cctcgctcag agagaaataa agctggactg gatgttcaag 8281 tcctccctcc tgctggacct tatcaaggtg tgtgtctggg ggtggtgggg tgacgtcctg 8341 ggggcaggga tggggagcaa gggaaccaag caggctgagg ctgcctctta ccctacccat 8401 tccaagggaa taaggtatct gcaccatcga ggcgtggctc atgggcggct gaagtcacgg 8461 aactgcatag tggatggcag attcgtactc aagatcactg accacggcca cgggagactg 8521 ctggaagcac agaaggtgct accggagcct cccagagcgg agggtaagag tcccctgtgc 8581 agacgaggat ccaccgggat ccccattatt tcaagggctt ctcccccgct tcctccctac 8641 ctctgccctc gcactctctt catcccacat ccaccagaca caattcctgc tacagaaaag 8701 atcttgtggc ctctgagagg gtgggctctg tgacttcgga gacggggctg ctgggggcgg 8761 gacttgtacc tgagctgcct gcagcagggt ttgctctgat tacaagtttg cctgagggtg 8821 gggcttgtgc ccaagagacg gagccttccc tgggcaccac cttttctgaa gggcaaggcc 8881 tatttgccag gctttctctg agatggctcc tagagatagt tgcagggctg gtctcaggtt 8941 gcagggtctc agaccggtct cagggctgca gggttggtgg tgtctgggtg ccaacctggg 9001 ctttctggtg agggtgggag tctttcccca gcggcgcctc agccccttcc ccatccccag 9061 accagctgtg gacagccccg gagctgctta gggacccagc cctggagcgc cggggaacgc 9121 tggccggcga cgtctttagc ttggccatca tcatgcaaga agtagtgtgc cgcagtgccc 9181 cttatgccat gctggagctc actcccgagg gtaaggctgc cctgtgcgtg gagttcggcc 9241 cacagggcac cctgcagtta gaaaagagcc agcctcactc tttcctctaa agcaaagccc 9301 agtgatgaaa ctcaattata cggaggcccc cttaaagctg gcatctgcag gtctgggtgc 9361 agaaagccgt gcatggccag ggtggggagc gtggttcatt aggtcccaga ccacaacagc 9421 ttcctctttc ttgatgctgg aaccaaactg tttccacaac tgacagaaca gactcctctc 9481 tgttctcagg ggtccctggg aggagcaggg gagggggagt gggtgcatcc cttctgcaca 9541 ggactctgag caaactactt gatcacctat ccctcacttg tcttacatac aatatgttag 9601 tttctttgcc tatgtcacct cttactgacc cccagagttc gaggtcctct tgttcctcct 9661 agcaaccccc ttccacacta tactctccct ccacacacac acactgaacc tctgatgtaa 9721 agaaacccct gccaggcacc ccctcccaca tcttggtctt caacagtcag gccagggtca 9781 gaggcagcct ttgtgttctg ggggcactcc ccctcactgt cccctcatgc ctccagaagt 9841 ggtgcagagg gtgcggagcc cccctccact gtgtcggccc ttggtgtcca tggaccaggc 9901 acctgtcgag tgtatcctcc tgatgaagca gtgctgggca gagcagccgg aacttcggcc 9961 ctccatggac cacaccttcg acctggtcag gggctgggag tgggcaagga ctgggctggc 10021 ctctgggatc ccaaatgctt gtcagcaacc tgagacagct gcagacaggc aggctggcag 10081 gacctctggc cttccaggct acctcctaag gatagcctga agactcggag tttgggggca 10141 gaattggaat gggggctgtg gaggcttttg gagtgggaga tagagttctg tctgggtggg 10201 aggaatattc aattcaattc aaataacact gattgagaac caagtatgtg cttggcctgc 10261 tgtgacagaa agacccttgg cctgggagcc caacgattgg gctggctcca gtgccctgtc 10321 aattactagc tgagaacaac tgacctctgg gaaccctcat ttcccacgtg cctcctaatc 10381 gtgtctgaaa acacagtgcc cagcaccccg gggtgcttga tgaatagtag atgaatggtg 10441 gcagcggggt tggggttcag agtgaacagc cccatgagag ggcccatgag gggggcataa 10501 agagggcatg gcaacccagg tcttcagcag ctttaccagc ttccttctac tgctagttca 10561 agaacatcaa caagggccgg aagacgaaca tcattgactc gatgcttcgg atgctggagc 10621 agtactctag taacctggag gatctgatcc gggagcgcac ggaggagctg gagctggaaa 10681 agcagaagac agaccggctg cttacacaga tgctgcctcc gtgggtgcca gtgggaaggg 10741 gtgggctggg agggcagctg gagcccagcc aggtagagtg gcccccaggt gacctcactg 10801 cctgccatcc ctaggtctgt ggctgaggcc ttgaagacgg ggacaccagt ggagcccgag 10861 tactttgagc aagtgacact gtactttagt gacattgtgg gcttcaccac catctctgcc 10921 atgagtgagc ccattgaggt tgtggacctg ctcaacgatc tctacacact ctttgatgcc 10981 atcattggtt cccacgatgt ctacaaggtg cagtgtgtag gggacaagcc ctcctgacct 11041 tcaattcagc ttcaccagcc tccagcccag cccttcctgc gcagccccta gcctacctgc 11101 ccaatcaatc ttctttccca gacctcccgt cccttattta ttcctccagt ccccagctca 11161 gtccttccac tagcaacctg gttctgcact aaccccaggt gggcccggtg acaagaggca 11221 atcgcttcgt gtactcgggg ggaatgctca aaagaaaatt cacacaactc cttcttcccc 11281 caggtggaga caatagggga cgcctatatg gtggcctcgg ggctgcccca gcggaatggg 11341 cagcgacacg cggcagagat cgccaacatg tcactggaca tcctcagtgc cgtgggcact 11401 ttccgcatgc gccatatgcc tgaggttccc gtgcgcatcc gcataggcct gcactcgggt 11461 aactcccggg tcttcccagg ctccagccca tctccctctt tagggcctgg ccccagattt 11521 cctgtagagg aggcaactca tggagcgggg agtggggctt accttgaaga ggatgcactt 11581 aacaaggctt atttgggggg ctggtggaga taatgggtgc gaagatcccc cgaggcccta 11641 cctaggtgca gcccagggcc ggccctgcta gccccgccga cccccagcat ctccacaggt 11701 ccatgcgtgg caggcgtggt gggcctcacc atgccgcggt actgcctgtt tggggacacg 11761 gtcaacaccg cctcgcgcat ggagtccacc gggctgcgtg agtgtgacgg ggacaagacg 11821 gggaggtggg agggggacac gggaggtgag tcccgagctc acggcgtccc ccaccgccac 11881 agcttaccgc atccacgtga acttgagcac tgtggggatt ctccgtgctc tggactcggg 11941 ctaccaggtg gagctgcgag gccgcacgga gctgaaggtg aggcagggcc ccaaccccac 12001 ccggaggccc cgccctgtcc tgaggcaccg cccatcccgg gccgcggctg caaacctcag 12061 ctcacccttc tgacctggtc tcccagttcc acgcagactc gagatccccc acccctcacc 12121 ccagtccgcc taagtccttc cctctcccat gtctccccag ggcaagggcg ccgaggacac 12181 tttctggcta gtgggcagac gcggcttcaa caagcccatc cccaaaccgc ctgacctgca 12241 accggggtga ggggccggcc tccggcggca gggcgaggga cgagggaccc ctgcctcctg 12301 ctctgtgtct gaccccccgc gcgcgaggca gcgatgacgt gggccctgcc ctcccacgcc 12361 ccattcccct tccctgaggc caccgccccc tccttgcagg tccagcaacc acggcatcag 12421 cctgcaggag atcccacccg agcggcgacg gaagctggag aaggcgcggc cgggccagtt 12481 ctcttgagaa gtgaggcccg gccccggaca ggtactgccc cctcagcccc aaccccagct 12541 gccgcgtccc ctctgctgcc tgcagaacgt cccacccaag ccaggtgtcc cnnnnnnngg 12601 tctgggccct gctccctgtc ccatctgcag tggaccccag gcacccccct ttgaggaggt 12661 ggggtgaact gctccttggc agggatttgt gacactgcat tgctgggctg tgttcctcgg 12721 gctcttctgg accttgcacc gtggatacca ggccatgtgc catggtattt gggtcctggg 12781 agggtgggtg aaataaaggc atactgtctt // LOCUS HSS100A2 8670 bp DNA PRI 01-MAY-1997 DEFINITION H.sapiens S100A2 gene, exon 1, 2 and 3. ACCESSION Y07755 NID g2065174 KEYWORDS S100A2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8670) AUTHORS Wicki,R., Schafer,B.W. and Heizmann,C.W. TITLE Transcriptional analysis of the human S100A2 gene in normal and tumorigenic breast epithelial cell lines JOURNAL Unpublished REFERENCE 2 (bases 1 to 8670) AUTHORS Wicki,R. TITLE Direct Submission JOURNAL Submitted (09-AUG-1996) R. Wicki, Department Of Pediatrics, Division Of Clinical Chemistry, Steinwiesstrasse 75, Ch-8032 Zurich, SWITZERLAND FEATURES Location/Qualifiers source 1..8670 /organism="Homo sapiens" /isolate="lambda 1R" /db_xref="taxon:9606" /chromosome="1" /map="q21" TATA_signal 4614..4619 exon 4657..4683 /note="untranslated" /number=1 intron 4684..6307 /number=1 exon 6308..6461 /number=2 gene 6321..8576 /gene="S100A2" CDS join(6321..6461,8424..8576) /gene="S100A2" /codon_start=1 /db_xref="PID:e307988" /db_xref="PID:g2065175" /translation="MCSSLEQALAVLVTTFHKYSCQEGDKFKLSKGEMKELLHKELPS FVGEKVDEEGLKKLMGSLDENSDQQVDFQEYAVFLALITVMCNDFFQGCPDRP" intron 6462..8423 /gene="S100A2" /number=2 exon 8424..>8670 /number=3 polyA_signal 8648..8653 BASE COUNT 1808 a 2218 c 2452 g 2192 t ORIGIN 1 gagctcaaga gttcaagacc cgtctgggca agatggcaaa actccatcac cacaaaagat 61 gcaaaaagat gcgcacagtg gcgcacacct atagccccag ttactgagga ggttaatgtg 121 ggaggatcac atgaggctgc agtgagctgt gatggtgcca ctgtactcca gccttggcga 181 cagtgagtct atgtctcaaa taagtaagta aacaaaaatt aaaaagaatc cagtccacag 241 ggcatttgaa ggcaagagga aaagatgcca gaatcagaga tggggagaag atgggcttca 301 cgcacctgct gaggttgaga aatgagacag ataggctgag tgtggggtgg agagaggatg 361 ggcagagaga ctgaggctgg tctgaatgga aatgaaatgt tagggctctc agggttatcg 421 gggaataatt ggagcttcta ggaaaggttt aacgttgtga ccacctgtgt gcgtcatgcc 481 tccccacccc ttactaattg tgtgaatttg gcagactttg agtctcagtg ttctcctctg 541 tgaagtgggg tcatcttatt ccaactcctg ggattgttgt gtgaattaaa tggggtaatg 601 tacggagagc acctgacgca cagcgagtgc ttcaaaattt cagtctgcac cccccagcaa 661 aggatatgca cacgcccatt gtgagtgaca aatccaggat gacctgaacc caatgtgata 721 acgtgggtcc tcgcatgctg gtcatgctgc cgggagacac ttatggatcc aattagtaca 781 acaggggaaa taaattattt aatgcatttt gctaagacag aatacctcag aacttatttt 841 gtggggtggg gcataataaa gggggtcctt ctgctgaaaa cgtttaagct caggttcgtg 901 gcaccactca accaaggtcg acagtcacac agtaagccag aggcaatgtc aggacttaaa 961 ctaaacctgt ggcccccaca atgaggccat ttctctttcc cctgaacggc ctggggaaag 1021 ggggtgggtg ggcagaactt ggcagtggcc aatccctcac ttctgtcccc tggttttctc 1081 ctgcccttat ctctaggctt gcattgattg attgattgag acagggtctt gctctgtcgt 1141 ccaggctgga gtgcagtggc acgatcatgg ctcactgcag cctcaaactc ctaggctcaa 1201 gtggtctttc cgcctcctat ctcccgagta cccatatccc taggctttta aaatggcttc 1261 caggtatctg gctgccgtct cagacatcca cctgggcttc tgggcaggga ctgtccggga 1321 aacctcatct atgtgaagca ggtgtgggtg taggaaggcc gcttggaaat gaatcagcac 1381 tgtctcctgt ttgagtcgta agcagggcgc cagagggtct ggcggacaag aaagggagga 1441 tgacaggagg ccggcactgc aatgacacgc cttagccacc agagggcacg aagcagctgg 1501 gcaaaatccc gcggggcccc tggtggaaaa tttctggcac ctggagcccg gagatggggt 1561 ggacggaatg tgaggaccca gcttcctgag gctgggccgg ggcagagtca ctgctttgga 1621 tgtccgcagg gcctgcttgt gtcttgacta ctctgccttt gtagacagct ggagaatgtg 1681 agagtgggat tgggatcgga ctctagggcc attccgtaca actctcctgc cctgccgtgg 1741 gggagggagt tgcccaaggt tacgcagcaa gttagtggca aatgaatacg attatcacca 1801 gtctcaggta tatggccatt tgatgggcgc agtcgcagcc tcagttcctg agacagagac 1861 acctgattaa ggacaggcct tcaggagctg accctagtga cccgcggctc tgctgctgtc 1921 tctgtttttc tccctggctt ttccatctga ctgactcttt gtcttcttcg tctgcctgcc 1981 tgtctccgtc tctgcccgct ggggggtttg ctcaactccc tcactgggtc ctgggagccg 2041 cagtttcctg ctgtcactcc tcagggattt gtagctctct gaagctcttt tccgacccgt 2101 tgtctcggtt ccactcttgg gatccagagg agaggtgatt atttcgtagc atagtcagtg 2161 gtgtgatttc acggggtgag aaggactccc ttgctcctaa gcactcctcc agtgacccct 2221 gttgccatgt ggtagccgta agcactggtt ggcacctggt gtgggcgaga cccttacctc 2281 atgcagaaat gagtaagact ggtgagctca ctatgtgggg tgaggctgag agaaaacaag 2341 tacacaggtg attcagtcaa aatcagaatt ctctaagtac acacgaaaag ggcaaaaggg 2401 gcgctttgta caggacagaa caggtagaca ctgaatccgg ttgggccctg ggaaggctcc 2461 ctgcagtggc ctttgaaggg ggggttggat ttcagcagga tagagggcat gggcatgtgt 2521 gggcacgttc tgaacagagg ggtcagcgca agccgagggt cttggccaca ctagttgcat 2581 gtgccggtgt gtttaaggga cacgcagcag caggccgagt ctggagcgcc tcactgccag 2641 gctttttaaa aatttttaat tttaatttaa ttttatttta tttttacttt aagttctggc 2701 atacatgtgc agaatgtggt ttgttacata ggtatacatg tgccatggtg gtttgctgca 2761 cctatcaacc catcatctag gttttaagcc ccgcatgcat caggtattag tcctaatgct 2821 ctccctcccc ttgcccccat ccttctcccc gcaactgccc acaggccctg gtatgtggtg 2881 ttcccttccc tgtgtccata tgttctcatt gttcaactcc cacttatgag tgagaacata 2941 ccgcctggct ttaagggaca gccatgggga tgcactgcag tttctgagca gggaaggccc 3001 tgtggaggcc cttagttaaa aggaaagaat ggctgtgaaa atcgatgcat tgcgctccct 3061 tgtccctcac cctcagtgtg aagggttttt attccgagtt ctacttgaag taggcctcga 3121 tgggaagaca agtagcatga ggggttcaag tactgagggg agcaagggac actcggtggc 3181 tgtgccaagg tgtagaagag gacactgggg gccccaagac ctgacttcat gtacactgct 3241 caggctggcc cccaagtcac acggtgaccg ctaggaaggg accagcctgt tctcagtctg 3301 atcctacagc catgtcatta tccaaagctc ctcctggcag ggcctgtttg gggtctctgt 3361 gccagtgctt tccctgccag gctgggctgg ggcttccacc tactgctctg ggactgctgc 3421 tgccctggcc ctgggggagg agggtgtgcc gctgagtcac tgcctgggca tctgggcctg 3481 gaacctcggg tgagtcactt agggctgagg tagaggggct gggggagggg aagaagctac 3541 tcgacagctg gagcagggag gggagctggg gccacaggaa gggcggtgcc ctgatgccca 3601 gacgggccgg gatagacaaa gggccaagga ggaaggggcc ctgggagggg gcagccctcc 3661 cttgggctgg ggtctgaatg gcacagtgtt tgcctttctc cgggtctggg gaggacatgt 3721 gtgtgggggg cagtgagaga gggctgtggc tgagggctgt gcttcaggcc tggattctgg 3781 cttgggaagc tgtccagctg gtgttttcag ccttgggtag ggatgtaccc ctacccaccc 3841 acccagccct caagctggag aagaggaggc caaagttttc ctgttcagcc tttaactact 3901 cgggacttcc ttatgctccc cacagactgt ggcccagccc aactgcggct gtgtgtagag 3961 caaccccatt tctcactgct tccccatcct tccagacacc ttcctacaca gagggacctt 4021 cccaggtatt tctaagcaca cttagttacc tcattacctc attaagaggt attctggtgc 4081 tggccattaa aagtcactcc acttcatcca tgccctgaag tcagtcctgt ccttctcctc 4141 ctgatgtccc ccagctgcct cctctggccc ccagcttcct aaggtggccc caggttgctt 4201 ctctctcaca cacacgggcg catgtatgta cacgagcact ggaccatgaa gtctcagcgt 4261 gtgctcacag cctctcacac aggagtgggc tgtgactcac aggcatgtca tgagaatgag 4321 gcctggcacc agtctccagg ccccagagca ggggttgcct cccctcaccc cggtccagga 4381 tgcccagtcc ccacgacacc tcccacttcc cactgtggcc tgggtgggct caggggctgc 4441 ccttgacctg gcctagagcc ctcccccagc tggtggtgga gctggcactc tctgggaggg 4501 agggggctgg gagggaatga gtgggaatgg caagaggcca gggtttggtg ggatcaggtt 4561 gaggcaggtt tggtttcctt aaaatgccaa gttgggggcc agtggggccc acatataaat 4621 cctcaccctg ggagcctggc tgccttgctc tccttcctgg gtctgtctct gccacctggt 4681 ctggtgagta cctctgtcct gctgagggca gggtggggag gatccccgtg ggtctctgtc 4741 tttgtctcca cagttctctc attccagctt ccctggtggg atcaacctgg gcctctctgg 4801 gccttccccc ttggaagaac tctctgtgaa gtgctgaagt gttgactgaa gggttttttt 4861 tttttttttt tttttttgag atggagtctc gctctgtcgc ccaggctgga gtacagtggt 4921 gtgatctcag ctcactgcaa actccccctc ccaggttcac gccatttccc tgcctcagcc 4981 tcccgagtag ctgggactgc aggcgcccac caccatgccc ggctaatttt tttgtatttt 5041 tagtagagat ggggtttcac catgttagcc aggatggtct cgatctcctg atctcgtgat 5101 ccacccatct cggcctccca aagtgctggg attacaggag taagccaccg cgcccggccg 5161 actgaagggt ttttctccag gttcctctgt gaggtctcag tgcaggggtt gctctgaggc 5221 cctcccctgg atatctcagt ctaggggccc ttctttgggg gtctaggcct aggagcagga 5281 ggtgtgcatg tgggcgttgc tgcaaaaaga atcctgagat tttttttttt tttttttttt 5341 ttgcaaagtc ctggattcta gcaggactaa ggtgcaagag gcaggggtct caagactctg 5401 cctgggtcat ggccccaagc agcaaagctc tgccccctgc ctcggtgaag gcagggctgg 5461 catgatgggc ccagggcatg ccctgcctct ggcatagctc ctctggcctc accctgaaac 5521 ctgcctaacc tttccaggct ggtctgagta ttctcagagg ccttgccgct gaggtctgtc 5581 ccatcctgat cccaaggcaa tgaacatttc atatctttaa ttctaattcc aacaggatcc 5641 ttcctggtgg agagaatgtt aagttgcccc caccctatcc atgcccctgt ctgcctagag 5701 gctcaggggc cttcagggtg aggggagaca cattccccac cctctgggag ctcctagtct 5761 gagagaggaa acactcctgc ccaagggagc ttccagttag atggcagaga gagatgcctc 5821 tggcttcagg agtcccgagt ctaaggaggg aaacgactcc ttcagggagc ttcctgctcc 5881 taggctgtag ccatggctcc tgccagactg cacaggagcc cccatctgcc agccggtgca 5941 tgtggccctg ctccccagag cctgcgcaga tgccatcaaa atgggactct ggtcaccctg 6001 tcatttccct tctggcagac actaaaatgg ggagccctgc cctcaggggg gtgtcccaag 6061 tgccatcaga ggaggcttgg tgactcccag acacaaggga agctttagcg tctgccctca 6121 gggtgagatg gaggtatccc tccggcctca gggaaccaca gtctgagggg agatgcagcc 6181 cctgccttcc cattcagaga ggggttttgt gaggtggctt gggggcatag ggcagaagtg 6241 gatcctacag gctgagctaa ggccccaaga gcctcagcag tgtacccatc acctggcacc 6301 tctgcagcca cagatccatg atgtgcagtt ctctggagca ggcgctggct gtgctggtca 6361 ctaccttcca caagtactcc tgccaagagg gcgacaagtt caagctgagt aagggggaaa 6421 tgaaggaact tctgcacaag gagctgccca gctttgtggg ggtgagtggc acaggcctgt 6481 gggggaggtc ctggtgtgag tgtgggggtg caggttaaat ctctccccca gttccgggtg 6541 cctgtcgatg caggtgccag ggtggggccc agcccctccc cactttagct tcatggctcc 6601 actggagtgg aaatgaggcc cgagtgggag tgcttaatta atggctgttt cctgcaacat 6661 tccagagaac catgtgctgt gagggccttc cgagtccatc tgtttaatcc tgtcattgga 6721 acttgagaaa ccagagccca gaagggaaaa gtgattgtcc caagatcaca cagcactggc 6781 acgttctctc tctctctttt cttttctttt tttttttttg agatggagtt tccctcttgt 6841 tgcccaggct ggagtgcaat ggcacgatct cggctcactg caacctctgc ctccaggggt 6901 caagcaattc tcctgtctca gcctcctgag tagctgggac tacaggcgca tcccactacg 6961 cccagctaat ttttgtattt ttagtagaga cagggtttca ccatattggc caggctggtc 7021 tcgaactcct gacctcgtga tctacctgcc tcggcttccc aaagtgattt ttgtattttt 7081 agtagagacg gggtttcatc atattggtca ggctggtctc gaactcctga cctcaggtga 7141 tctgccctcc tcggcctctg aaagtgctgg gcttacaggc gtgagcaccg tgcccggact 7201 cctttttttt tttttttttt ttgtggtggg gggacaagat ctcactctgt cacccaggct 7261 ggatcatagc tcactgtaat ctcgaactcc tgggctcaag caatcctccc aagtagttgg 7321 aactacagga gtattgtcac catgcctggc caatttttat tttttgtaga gatggagtct 7381 tgctatgttg tccaggctgg gcttgaactc ctgggttcaa gcaatcctcc cacctcggcc 7441 tcccaaagta ttggaattac agatgtgagc cactgtgctt gacctctttc catttttata 7501 tgccaaacta agaaagtatg ttagggatag aaaagccctg ctcagatata tagtctggga 7561 cattttgtgg agaaatgcat cgaccttcaa tttgtccctc accctcccta tactgactca 7621 ttggtgattc ccaaagttag gtgtcaggct ttgaacacat gaggcaggtc cttctttcct 7681 tggtttaatt ttgtttttgt ggctggttaa atttttctaa ttatttcggc tagtattaaa 7741 aaagtgtttt tcagctgggt gcagtggcct atgcctgtaa tccccacagt gtgggaggct 7801 aaggcaggag gatctcttaa gcccaggagt tcgaccagcc tgggcaacat agcaagactc 7861 catctctaca aaaataaaaa taaaaattgg ccaggcatgg tggcatacgc ttgtagtccc 7921 agctacttgg gaggctaaag gtgggaggat tgctggagcc caggaggttg aggctgcagt 7981 gagttgtgat tgtgccactg cactccaacc tgggctaaca gagcaagacc ttgtcttaaa 8041 aaataaaaag tgttcttttc tgaatctacc tggctggtgt tggggagcag caacttcggt 8101 ttcctcatca gcagaatggg gtgatgatac ctacctcgct gggctcctgt gggattcgag 8161 ctgatgcatg ctcagaggag catccagtgt cctccctgtg tccaggagga gggcacactg 8221 gagatgctca ccaatgagta tctgtctctc tccttactca ctgggccctc ttggtagctc 8281 ccagagcctc ctgcccacct tatacccagc tgcccagtgg ggagggagag ctggaaccaa 8341 cctgaatgtg tgagggtctg ggtgtttggt ggagctgggg ttggggctgg cttggtgatg 8401 agtgtatttc ctgtcacttt caggagaaag tggatgagga ggggctgaag aagctgatgg 8461 gcagcctgga tgagaacagt gaccagcagg tggacttcca ggagtatgct gttttcctgg 8521 cactcatcac tgtcatgtgc aatgacttct tccagggctg cccagaccga ccctgaagca 8581 gaactcttga cttcctgcca tggatctctt gggcccagga ctgttgatgc ctttgagttt 8641 tgtattcaat aaactttttt tgtctgttga // LOCUS HSSAA1A 6943 bp DNA PRI 14-JAN-1991 DEFINITION Human serum amyloid A (GSAA1) gene, complete cds. ACCESSION X13895 NID g36305 KEYWORDS serum amyloid A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6943) AUTHORS Sack,G.H. Jr. and Talbot,C.C. Jr. TITLE The human serum amyloid A (SAA)-encoding gene GSAA1: nucleotide sequence and possible autocrine-collagenase-inducer function JOURNAL Gene 84 (2), 509-515 (1989) MEDLINE 90128298 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.H.Sack, 03-JAN-1990. FEATURES Location/Qualifiers source 1..6943 /organism="Homo sapiens" /db_xref="taxon:9606" misc_signal 1797..1804 /note="TATA box" prim_transcript 1812..>6485 /note="GSAA1 mRNA and introns" CDS join(1834..1844,3175..3269,5908..6046,6320..6458) /note="serum amyloid A" /codon_start=1 /db_xref="PID:g36306" /db_xref="SWISS-PROT:P22614" /translation="MFPSRMKLSTGIIFCSLVLGVSSQGWLTFLKAAGQGAKDMWRAY SDMKEANYKKSDKYFHARGNYDAVQRGPGGVWATEVISDARENVQRLTGDHAEDSLAG QATNKWGQSGKDPNHFRPAGLPEKY" intron 1845..3174 /note="GSAA1 intron A" intron 3270..5907 /note="GSAA1 intron B" intron 6047..6319 /note="GSAA1 intron C" misc_signal 6568..6574 /note="pot. poly-A signal" misc_signal 6580..6585 /note="pot. poly-A signal" BASE COUNT 1979 a 1491 c 1483 g 1989 t 1 others ORIGIN 1 gaattcctga cctcaggtga tccaccgcct tggcctcccg aagtgctggg attacaggcg 61 tgagcccccg cacccagccc tgttttctta attactatat tttgcaagaa tcaatattat 121 tatctttttt tttttttttt gagatggagt ctcactctgt cacccacgct ggagtgcagt 181 ggcacaatca cggctcactg caacctccac ctcctggatt caagcaattc tcctgcttca 241 gcctcccaag tagctcggac tacaagcggt gcgcgacact ccagtgctaa ttttgctttt 301 tgtattttta cgtagagatg gtggtttcac tttttgcaaa gctgctcttg aactcctgac 361 ctcgtgatcc acccacctgg gccttcaaaa ttgtttggat tacaggcgtg agccactgcg 421 cccggccaat gttattatct tcaaagaaaa attaggaatg tctttgttct ccaaatattg 481 ggatatttgc acactcccaa gtctgggtct gtttagtaaa cattacgaat ttgttccctt 541 aactgtaaac atctagaggt taggaatgcc taactttctg agcacgcagc ctagcaatct 601 cagcctcatt ttcctagccc tcatgccaaa tagagtcgct ctggttcaaa cgtctctgac 661 actatgtgaa aataaacctt ggtccttata actgtagttc taattggtaa tcttacttac 721 taatgtctta ataataggtt gccttggtaa caatattgta attaatattt ctgtctcctg 781 gaaaatataa attattatgc gttttcttgg ttaaggtgga agctctaaaa ttaattatag 841 cttacagtcg tgccagaatt accagtcctg tcagcattaa tcattaatct acacacctac 901 aggagttgta tttttctgca tccttcctat atctactaca cctggtaact aggattcatt 961 acatgtgaat tttttgaggg tagtgatttt gtatcattat tttcttcatt ttatatttag 1021 ggaatataaa gttattcccg ctggtcttct aagactgggg cttaggtctg agatgtcatt 1081 cttagtcatg atacctgtgg acttttgggc ttttctgatt tttgtttttc taattcctaa 1141 tagaattaag ccttagtatt ctcctcttct acttaaatta attgtaactt aaaagaatgg 1201 aggtgtcttg ggcctctcaa tcagcctggt cacacaaggt ttcagagtag tgctaccatc 1261 tgatttactg ttcaatccag gccattgtag acaatgaaaa gatgtgccac tattaagctt 1321 ggacaataga cactgatctc atgcaaacca gcacacgtag tcacttgtaa gagactgttg 1381 gaaaatccct ttaaggcctg gggactctga tatcatgaca ggaaatcttt ctttttcagg 1441 agaaacaagt gaaaagaact gactgccgtt attgtctatt attctcagct tggcaccata 1501 agaaactgcg tgtccttccc ggaaggcaga atacattaag taacaaagga agaaaacagg 1561 caaggcttgt caaagtccct gtgaggagtt cctcttggta tctagaaatc atactcgctt 1621 tcttgctctt tttcctatga gtcagtgtgt cactttgtca cccaggctag agtgcagtgg 1681 tgtgatcatg gctcaccata gcctccaact cctgggctca agcagtcctc ctgcctcagc 1741 ctcctgagta gctgggacca caggtgcata ctaccacacc ccgctaattt aaaaaatata 1801 tatacagata tatagtggag atggggtctt gctatgtttc ccaggttggt cttgagttcc 1861 tggcctcgag tgatccttct gcctcagcct cccaaagtgc tggtattata agcatgagcc 1921 agtgtctcca cgccacttgc tttctaatac ttctgtatta gtccgttctc atactgccat 1981 aaagaaatga ctgaaactgg gtactttata aagaaaagag gttatattgg ttcatggttc 2041 tttagtctgt acaggaagca tagtggcttc tgcttctggg gaagcctcag gaaactttca 2101 atatggcaaa ggcagagggg gagtgaggca tctcacatgg tgggagcaag gacaaaagag 2161 agtgaggggg gaggtactgc acacttttaa ataaccagat ctcaaaagaa ttcactcact 2221 atcacgacga cagtaccaag gggggtgctg ttaaaccttt catgagaaac agccttcctg 2281 attcaatcac ctccctctag gctccactcc caacattggg aattacaatt gaacatgaaa 2341 tttgggtggg gggtttcaga tccaaaccat atcaacttct atcaaagaag agaatgacat 2401 tgaataaaac gagaaagtag atttccagca ttgaatggtt ccaaagggag taaaaataat 2461 tgtgacagga tgtgatcatt ggccctcaat ttctgcctga agaagagtag gaaaaaggag 2521 gtctggacat cacatgccct gctttatgaa gcagctcctc caagtacacc atgaccagtg 2581 gctgtggatg gttgacaact cccctcctct tccccctctt ctactgtcta ctcctgggac 2641 caagtgagcc acaccagctc agatactaca ctgaccacag ggaatcccac cttttccaag 2701 gaatggaagt tgtgtaggga atattcaaat gttgcttagc attgccttag ataagaacca 2761 aagggacagg gaaatcctct gacagctatc tgccttataa ctttcatttt actgtgccta 2821 aaatatgctc agaacccaga aagaggcata attcctaatt ttggcaggct ctaatctaaa 2881 ataatgattc tcaaacatgg tgtgactttt gtctatttgc tttatcctgg gtcactgctc 2941 ctcttctgtc agatactggg attccaatga gacaaatgga aatggagacg tagaccctct 3001 gaccttctat cttttatcta tacacataca cctgtgtgtg tgtgtgtgtg tgtgtgtgtg 3061 tgtaaaaccg agtgggtttt tttcttggaa tgaaagaatg gactaacatt acaaaaaata 3121 aaaacttgaa acagaatgtg tattatcctt ggttgtgttt ccttggccct gcagcaggat 3181 gaagctctcc actggcatca ttttctgctc cctggtcctg ggtgtcagca gccaaggatg 3241 gttaacattc ctcaaggcag ctggccaagg tgaggtccac aggatagggg gcaggaggct 3301 gcttctggct gcccccagga tgcagctgag cagaggccac atccccactg ggcaaaggtg 3361 ctagtgatgc cacagatgga tagagaaggg gcatggtttt tactaagcgt ggttcctcat 3421 gcttttctgg acagctttga cactcttcta tgaggatcct ccagccgagg tcgcataagg 3481 tgtgagctgc ctcttttcag caggaccatg agagagatgt ggagttgagg ggtgcatgtt 3541 cccataatac cggtggggct ctactgcccc ctagtgggaa atctgggaca gttcatgtct 3601 atgtctcctg ggaagccagg aagcaggtgg atcaaaagtg tgaggcgagt ccatggggaa 3661 gctgaacgga gccaaccgtc cccataaaaa caaccaagct tagctgagat tttaatacgt 3721 actaggcact gtttaaatgt actaatgaat tggtttccat catttagtcc tatgatgcaa 3781 gcagcattat cccttaacag agaagctaac acacacacac acacacacac acacacacac 3841 actaacacac acacacacac acacacacac acaaacccca agatacgtaa agaagttcca 3901 aagcagagca ggattaaccc aggcagtctt gctctgcaga acttgctctt aatcaaggta 3961 ctctgctgct ttcaaaacaa gagtttcgga tttgtgaaca catagctcat cctttatcta 4021 agaaatggca aataggatgt ggtgcctttg gaaggtaagt ctagctccac ttatccccag 4081 taaaacctac agtgaattac cttgatggtg gttctactgg ggcttatata tggccaggaa 4141 actgcatgca agagaaatat accccgaggg ctgggcagat cacctgaggt caagagttcg 4201 agaccagcct ggccaacatg gcgaaatcct gtctctacta aaaatacaga aattagccgg 4261 gtcgtcgtgg catgcgccta taatcccagc ctctcgggag gctgagggag aagaattgct 4321 tgaactcagg aggcagaggt tgcagtgagc tgtgatcaca ccactgcact ccagcctagg 4381 agacagagca agactccatc tagagagaca gagagagaga gagagggaga aatatacccc 4441 actagccata ataaagtggc aaaattttgt tttcagaatg cagtatttta aatttcaggt 4501 attattattt ttctgagtct ctgaaaaatg gttttaagga tttgctttta atcctattta 4561 catgttcaca cactcaacta caaatatctt tcattcctta ggttaatatt tttcaaaggg 4621 ttgttctggg accacttgcg tgagaatcac ctggattctg ggatgcttgt gaatgaatga 4681 agatccggga catgccatgc cctgcccagc agcacagtct cttgggacag agcctagaaa 4741 tcttgccttt gctaagtacc tcggtagatt tttatgcaca gcaaaggttg agaaccacta 4801 cctcttgttt tgctgctgaa agtgataaaa tgtgccagga attttggaag tacttattaa 4861 gccaatctga acatcaagga gccatttaag tcagtaactc agaggaataa gtagagtaaa 4921 aatgtcataa actctcaata aaagcaatca atttaacacc aggagtaata aatgcataaa 4981 atgaagatga gttatctaat agagaaatta tataagccat gattataact ctatatttga 5041 gttccccctt ttccgtaatc agttaatttt ctaaaaaatc ttcgtcacct taattcttag 5101 cttgatcaag atccattcag tccgtaactc cctgctcctc atcttagttt agcccttctt 5161 ttttcttatg ccacctttcc taaggaccag agaagtgaaa tgataatata ttggccacct 5221 acaatgttct agacatcata catgtatttt ctctgctctt ctgcataatc actgtgaggc 5281 aggcaatact cctccatttc attggggagg acattgaggt tctgaactag tgggtcagtt 5341 gtcctttttc tgaatttgat tacccagtag tataaagctt tcttagataa ctcaccttta 5401 tcacttgctg actgaattct gacagatgtc agtttcctaa ttatagcctg gacatttaag 5461 atgtattcag gaccaagttg ttctattact acaggcatga atttttattg actaggttag 5521 gaccccatat gtctgcagct ccctcagaat cccctgtgtt ctcacaccag ggaactgagg 5581 gtttcctggg tccttccagg tagaagttca ttgtacaatg aaacatccct taaggaccat 5641 ttcatctctt ctttaggtgc atcacacatg gttaaaacaa agtaataaca gaacttagaa 5701 tggaatcaaa cagaatgaaa cttacaccaa gtacaattct cattacatta acacagagaa 5761 gtgaaaagta gaagaatatt tatttcaagc caatataatt tccaagggct ttgttgaagg 5821 ctgaatcttc gggaggaaag tagtgagaga aactgttcat tcctctattt cccagtatat 5881 aattgtttga tcatttcttc cttccagggg caaaagacat gtggagagcc tactctgaca 5941 tgaaagaagc caattacaaa aaatcagaca aatacttcca tgctcggggg aactatgatg 6001 ctgtacaaag gggccctggg ggtgtctggg ctacagaagt gatcaggtaa tgcacattcc 6061 tgatgttgcc aggaatgcgt gagcagagct tggactgcct tggacagtca ggagagaggt 6121 aagctccttg cagagaagtt agagctgcag cctaggtgcc agcctgctga gtggagtgcc 6181 aggagggtca ttgtttcacc cctccttcct tggccttcct gggcttctcc cagagtcctc 6241 ccttggaaag catataatgg gaaggtgggc tgttgctcac tggcctggtg attaatctcc 6301 ttgcttgcct ggactacagc gatgccagag agaacgtcca gagactcaca ggagaccatg 6361 cagaggattc gctggctggc caggctacca acaaatgggg ccagagtggc aaagacccca 6421 atcacttccg acctgctggc ctgccagaga aatactgagc ttcttttcaa tctgctctga 6481 ggagacctgc tgtgaccctg agggcaggga catttgttga cctacagtta cttgaattct 6541 atatccctag tacttgatat agaacacata aaaatgctta ataaatgctt gtgaaatcca 6601 gtttgttatt ggaatctgga agcagaatat gacagtcttc ctgggatcat gggcctgttt 6661 agtaccatag ggatgaccaa taaacatcac tgttttattt tttaaaacat aaagcactaa 6721 tgcacaatag tgggaattgg ggagaaaact atatctatac atggaccaca ttgtatggac 6781 ataatatgga ccaactataa gctataatta tatggttata aatagaaaga taataagata 6841 atctatgaca cttagtaatg agctaaataa ataaatacat atagaataat aatacgctag 6901 ngatggaatt agatagctta tattaggtat tagaagcagt tga // LOCUS HSSGK 5718 bp DNA PRI 27-OCT-1997 DEFINITION Homo sapiens sgk gene. ACCESSION AJ000512 NID g2463200 KEYWORDS serine/threonine protein kinase; sgk gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5718) AUTHORS Waldegger,S. TITLE Direct Submission JOURNAL Submitted (22-JUL-1997) Waldegger S., Physiology I, University of Tuebingen, Gmelinstr. 5, Tuebingen, D-72076, GERMANY REFERENCE 2 (bases 1 to 5718) AUTHORS Waldegger,S., Erdel,M., Nagl,U.O., Barth,P., Raber,G., Steuer,S., Utermann,G., Paulmichl,M. and Lang,F. TITLE Genomic organization and chromosomal localization of the human SGK serine/threonine protein kinase gene JOURNAL Unpublished FEATURES Location/Qualifiers source 1..5718 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /sex="Male" /cell_line="Bristol8" /map="q23" exon 36..155 /gene="sgk" /number=1 mRNA join(36..155,303..378,806..881,1317..1421,1526..1609, 1725..1856,2106..2218,2560..2683,3141..3236,3652..3807, 3915..4004,4349..5526) /gene="sgk" /product="serine/threonine protein kinase" gene 36..5526 /gene="sgk" CDS join(80..155,303..378,806..881,1317..1421,1526..1609, 1725..1856,2106..2218,2560..2683,3141..3236,3652..3807, 3915..4004,4349..4516) /gene="sgk" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:e1154172" /db_xref="PID:g2463201" /translation="MTVKTEAAKGTLTYSRMRGMVAILIAFMKQRRMGLNDFIQKIAN NSYACKHPEVQSILKISQPQEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLK VIGKGSFGKVLLARHKAEEVFYAVKVLQKKAILKKKEEKHIMSERNVLLKNVKHPFLV GLHFSFQTADKLYFVLDYINGGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNI VYRDLKPENILLDSQGHIVLTDFGLCKENIEHNSTTSTFCGTPEYLAPEVLHKQPYDR TVDWWCLGAVLYEMLYGLPPFYSRNTAEMYDNILNKPLQLKPNITNSARHLLEGLLQK DRTKRLGAKDDFMEIKSHVFFSLINWDDLINKKITPPFNPNVSGPNELRHFDPEFTEE PVPNSIGKSPDSVLVTASVKEAAEAFLGFSYAPPTDSFL" intron 156..302 /gene="sgk" /number=1 exon 303..378 /gene="sgk" /number=2 intron 379..805 /gene="sgk" /number=2 exon 806..881 /gene="sgk" /number=3 intron 882..1316 /gene="sgk" /number=3 exon 1317..1421 /gene="sgk" /number=4 intron 1422..1525 /gene="sgk" /number=4 exon 1526..1609 /gene="sgk" /number=5 intron 1610..1724 /gene="sgk" /number=5 exon 1725..1856 /gene="sgk" /number=6 intron 1857..2105 /gene="sgk" /number=6 exon 2106..2218 /gene="sgk" /number=7 intron 2219..2559 /gene="sgk" /number=7 exon 2560..2683 /gene="sgk" /number=8 intron 2684..3140 /gene="sgk" /number=8 exon 3141..3236 /gene="sgk" /number=9 intron 3237..3651 /gene="sgk" /number=9 exon 3652..3807 /gene="sgk" /number=10 intron 3808..3914 /gene="sgk" /number=10 exon 3915..4004 /gene="sgk" /number=11 intron 4005..4348 /gene="sgk" /number=11 exon 4349..5526 /gene="sgk" /number=12 BASE COUNT 1518 a 1145 c 1219 g 1836 t ORIGIN 1 ggccgagcgc gcggcctggc gcacgatacg ccgagccggt ctttgagcgc taacgtcttt 61 ctgtctcccc gcggtggtga tgacggtgaa aactgaggct gctaagggca ccctcactta 121 ctccaggatg aggggcatgg tggcaattct catcggtgag tgcaggaatc ttgcgggact 181 tctgctccag gagacgcaaa gtggaaattt tttgaaagtc ccggatcaga ttagtgtgtg 241 tggcgccggg acgttatgaa gccgtctaaa cgtttcttta tttctcctcc ttctatccac 301 agctttcatg aagcagagga ggatgggtct gaacgacttt attcagaaga ttgccaataa 361 ctcctatgca tgcaaacagt aagttcagac cggattgagg aaataactag tatagtttga 421 atttgccagc ggtaaacatt ctcatcacgg cgtttatcgg gaaggcgaag acttcttctg 481 gggtggggat ctcatttctc cttaaattct aatatatttg acacatttta aacattaaag 541 ttaatttgct gatttggctt gaactggaga tgtaagataa atggttcgtg ttggccgaat 601 tcacgctttc tccatgagca acaatcctta tttctgtatt taatggggtt tattattttc 661 tttaactgac taatgtattg gggtattttc agtttaaaca gtgaattatc gggtagaagt 721 cggtagagcc agaaactcac ttttgatgtt ggtgtgcccc ctagtggcga gctggattct 781 aaatcgtgcc ctttattccc tgcagccctg aagttcagtc catcttgaag atctcccaac 841 ctcaggagcc tgagcttatg aatgccaacc cttctcctcc agtaagtttt tgtatgtgcc 901 gtgcatctgt ggagaactgt aagggagtca gttagtattc ctacattaat ggattaaaat 961 agcatttcta gaaattagta tcaaggcagg aatgcttcat tatgcataac agtgatataa 1021 atatttaagt attgagtcag agtattattt ttattttttt cctgggcata ttttacctca 1081 agtggttatt ttaaaaggca tatttcataa aaaggtttta tctgtctgaa acaacatgac 1141 tgtgtgcagt ttccatactc atttgaaatg tgatgaaatg tagttttgaa tgtttataga 1201 tgtatggtca tttgcatcag tcatttgtag atgtaacatt ttctacatcg tttatgttat 1261 agatgtcttc ctttgaagca atggtattaa aagaaattct tttttttttt ttctagccaa 1321 gtccttctca gcaaatcaac cttggcccgt cgtccaatcc tcatgctaaa ccatctgact 1381 ttcacttctt gaaagtgatc ggaaagggca gttttggaaa ggtaatttca aatctgaaga 1441 tcttttggta cacttccttc atgtcctctt ttatattctc cctggatgag gatcgaaaaa 1501 tgattttttt aaattgaaat ttcaggttct tctagcaaga cacaaggcag aagaagtgtt 1561 ctatgcagtc aaagttttac agaagaaagc aatcctgaaa aagaaagagg tgagatgtgc 1621 ttgatggggc tggcattggc ggtagacact ccttgaataa tcttgattct ggaatgttgg 1681 tgccagttga acatgccact aaatctgaat cgtcattttc ctaggagaag catattatgt 1741 cggagcggaa tgttctgttg aagaatgtga agcacccttt cctggtgggc cttcacttct 1801 ctttccagac tgctgacaaa ttgtactttg tcctagacta cattaatggt ggagaggtga 1861 gcagggggga tagaagtcaa ctcttagtgt ctctgcacag cctgctttgt tttagtttga 1921 gaaaaaagtt ttcaaagatt tttggtgggg agaatgttac cagaattagc atttccttca 1981 acctgtcagg ttatagttaa tagattactt ggggccactt cctgcagttg ttcttttgct 2041 gtgtatgtca aaactaatta aattacattg cgcaacccag aatgactttg ttctgtctcc 2101 tgcagttgtt ctaccatctc cagagggaac gctgcttcct ggaaccacgg gctcgtttct 2161 atgctgctga aatagccagt gccttgggct acctgcattc actgaacatc gtttataggt 2221 aagcctgaga gctcttcagg ctaccagttt tggtataaag gagacgtagc actggctgtt 2281 tcatagggcc ttaaaataat ttgtgtttat ttgcaacttg gttcgctaaa accagatccc 2341 ctagcacgtg agctggcttg acttaagtgc caagggggaa cagccaagta ggattgtgcc 2401 taatccagaa tagatgagca gaacaagggc tccttttttc ttcactacac aactacagtg 2461 aacctaaatg cctctaatac cttagcaatt atctttaaga ggatatctta tgaagtgaaa 2521 ttaacttgtg caactacttt tctattcact tttttacaga gacttaaaac cagagaatat 2581 tttgctagat tcacagggac acattgtcct tactgatttc ggactctgca aggagaacat 2641 tgaacacaac agcacaacat ccaccttctg tggcacgccg gaggtaggcg ctgtcttggt 2701 ttggtgcctg gtttaccccc gccttccaag agagagatgt acaatcatgc acttaactac 2761 caaaaagagt aaactcctct cagagacttc ttaatacagt tcagtgcaaa taaaatacat 2821 ttgctgtttg atgtagcatg agaaatccca agtccttctg ttcctttact gaaaagtagc 2881 tgtttgtaag taagatctgc atcataaaaa ctttctaatc ctaagtaaga gatatcaagt 2941 gccagcagtt tcctaaatgt cagtacacat aggtagccag tcaccctcaa aaagtccagc 3001 agttttatca ggaaggaatc taaagatatc tatcttccaa gctggctctg ggtctctcag 3061 ctttttcaaa ctaaatgtgt ggtcgtggga ttgcttgctt tcgcaggttc taaacgctgt 3121 ttccctggtc tgtttttcag tatctcgcac ctgaggtgct tcataagcag ccttatgaca 3181 ggactgtgga ctggtggtgc ctgggagctg tcttgtatga gatgctgtat ggcctggtga 3241 gtggcacatt gggaaccact ggaacactgc ctgctcccta caatattgcc ttcacacagc 3301 aaaagcagct aagaggcata ttggttattt tatagttcat aagaataatc acttacctgg 3361 ttcttttgtg catttcacat tttactagat aggaccacat tgaacctgtg tggtggtgaa 3421 aaactaccac ttattaacat ctacccccta ccctccacac acacacacac aaacacacac 3481 acgggttgca aagtagacac ttaaatagca agggaaaaga aagcattgag gtggggagag 3541 tttctcaaat cgagcctaat atttattgcc gtttatatct ttttctctac tggtaatgtg 3601 tgccatatga aacttccaat taagtctaaa gtaattttcc ccttctttca gccgcctttt 3661 tatagccgaa acacagctga aatgtacgac aacattctga acaagcctct ccagctgaaa 3721 ccaaatatta caaattccgc aagacacctc ctggagggcc tcctgcagaa ggacaggaca 3781 aagcggctcg gggccaagga tgacttcgtg agtgatgttt tcctgtcctc ctgggccggc 3841 cgggacgtgc actagacctc cctgccctta ttgaatgcac ctgtctaaat taatcttggg 3901 tttcttatca acagatggag attaagagtc atgtcttctt ctccttaatt aactgggatg 3961 atctcattaa taagaagatt actccccctt ttaacccaaa tgtggtgagt atctgtctct 4021 cttctaagta tagagaagcc aagcgattta ttttaattca gaattgtctg ggggagggtt 4081 ggaaggaata cattggcaga tgttttctcc ataaacctgt tattttacct acatagacac 4141 atttatcaat tcgaagcacc aaaaggcaac aagtgaacat tattcttatg tttaactgtg 4201 tgtagccttt tgagattttg tgcttgaagt gggtgattat ggaagttgat ataagactta 4261 aacttggtat ttaaagcctg gtcaagattt ccctgtcctg tgtctagtgt gagttcttga 4321 caagagtgtt tttcccttcc cgtcacagag tgggcccaac gagctacggc actttgaccc 4381 cgagtttacc gaagagcctg tccccaactc cattggcaag tcccctgaca gcgtcctcgt 4441 cacagccagc gtcaaggaag ctgccgaggc tttcctaggc ttttcctatg cgcctcccac 4501 ggactctttc ctctgaaccc tgttagggct tggttttaaa ggattttatg tgtgtttccg 4561 aatgttttag ttagcctttt ggtggagccg ccagctgaca ggacatctta caagagaatt 4621 tgcacatctc tggaagctta gcaatcttat tgcacactgt tcgctggaat tttttgaaga 4681 gcacattctc ctcagtgagc tcatgaggtt ttcattttta ttcttccttc caacgtggtg 4741 ctatctctga aacgagcgtt agagtgccgc cttagacgga ggcaggagtt tcgttagaaa 4801 gcggacctgt tctaaaaaag gtctcctgca gatctgtctg ggctgtgatg acgaatatta 4861 tgaaatgtgc cttttctgaa gagattgtgt tagctccaaa gcttttccta tcgcagtgtt 4921 tcagttcttt attttccctt gtggatatgc tgtgtgaacc gtcgtgtgag tgtggtatgc 4981 ctgatcacag atggattttg ttataagcat caatgtgaca cttgcaggac actacaacgt 5041 gggacattgt ttgtttcttc catatttgga agataaattt atgtgtagac ttttttgtaa 5101 gatacggtta ataactaaaa tttattgaaa tggtcttgca atgactcgta ttcagatgcc 5161 taaagaaagc attgctgcta caaatatttc tatttttaga aagggttttt atggaccaat 5221 gccccagttg tcagtcagag ccgttggtgt ttttcattgt ttaaaatgtc acctgtaaaa 5281 tgggcattat ttatgttttt ttttttgcat tcctgataat tgtatgtatt gtataaagaa 5341 cgtctgtaca ttgggttata acactagtat atttaaactt acaggcttat ttgtaatgta 5401 aaccaccatt ttaatgtact gtaattaaca tggttataat acgtacaatc cttccctcat 5461 cccatcacac aacttttttt gtgtgtgata aactgatttt ggtttgcaat aaaaccttga 5521 aaaatattta catatattgt gtcatgtgtt attttgtata ttttggttaa gggggtaatc 5581 atgggttagt ttaaaattga aaaccatgaa aatcctgctg taatttcctg cttagtggtt 5641 tgctccaaca gcagtggttt ctgactccag ggagtatagg atggcttaag ccaccacgtc 5701 caggccttta gcagcatt // LOCUS HSSHBG 3810 bp DNA PRI 24-APR-1993 DEFINITION Human gene for sex hormone-binding globulin (SHBG). ACCESSION X16349 NID g36442 KEYWORDS androgen binding protein; plasma protein; sex hormone-binding globulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3810) AUTHORS Gershagen,S. TITLE Direct Submission JOURNAL Submitted (25-AUG-1989) Gershagen S., Department of Clinical Chemistry, University of Lund, Malmo General Hospital, S-214 01 Malmo, Sweden REFERENCE 2 (bases 1 to 3810) AUTHORS Gershagen,S., Lundwall,A. and Fernlund,P. TITLE Characterization of the human sex hormone binding globulin (SHBG) gene and demonstration of two transcripts in both liver and testis JOURNAL Nucleic Acids Res. 17 (22), 9245-9258 (1989) MEDLINE 90067924 COMMENT see X05403,X05792 and X05885 for sex hormone-binding globulin human mRNA's; see also X16350 for alternative exon 1 for sex hormone-binding related protein. FEATURES Location/Qualifiers source 1..3810 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="SHBGq23" /tissue_type="leucocytes" /clone_lib="cosmid lambda-loric" CDS join(361..471,604..695,868..1057,1389..1550,1778..1937, 2068..2204,2940..3147,3392..3540) /codon_start=1 /product="sex hormone-binding globulin (SHBG)" /db_xref="PID:g296673" /db_xref="SWISS-PROT:P04278" /translation="MESRGPLATSRLLLLLLLLLLRHTRQGWALRPVLPTQSAHDPPA VHLSNGPGQEPIAVMTFDLTKITKTSSSFEVRTWDPEGVIFYGDTNPKDDWFMLGLRD GRPEIQLHNHWAQLTVGAGPRLDDGRWHQVEVKMEGDSVLLEVDGEEVLRLRQVSGPL TSKRHPIMRIALGGLLFPASNLRLPLVPALDGCLRRDSWLDKQAEISASAPTSLRSCD VESNPGIFLPPGTQAEFNLRDIPQPHAEPWAFSLDLGLKQAAGSGHLLALGTPENPSW LSLHLQDQKVVLSSGSGPGLDLPLVLGLPLQLKLSMSRVVLSQGSKMKALALPPLGLA PLLNLWAKPQGRLFLGALPGEDSSTSFCLNGLWAQGQRLDVDQALNRSHEIWTHSCPQ SPGNGTDASH" exon <361..471 /number=1 sig_peptide 361..447 mat_peptide join(448..471,604..695,868..1057,1389..1550,1778..1937, 2068..2204,2940..3147,3392..3537) /product="sex hormone-binding globulin (SHBG)" intron 472..603 /number=1 exon 604..695 /number=2 intron 696..867 /number=2 exon 868..1057 /number=3 intron 1058..1388 /number=3 exon 1389..1550 /number=4 intron 1551..1777 /number=4 exon 1778..1937 /number=5 intron 1938..2067 /number=5 exon 2068..2204 /number=6 intron 2205..2939 /number=6 misc_feature 2344..2651 /note="pot.Alu repeat" exon 2940..3147 /number=7 intron 3148..3391 /number=7 exon 3392..>3540 /number=8 BASE COUNT 878 a 1029 c 1021 g 882 t ORIGIN 1 tctagacctc aggcctgtga atgcaggctc ccccgagtgg acagaaatct tggaggacct 61 agatcaggcc ctagaggagg agaggggaga tggaatatcc tctcccagtt cagaaacttt 121 ctcggcagtg gaggatgata gtggagggga ctctgtcctt caccccattg atccccagag 181 gggtgatagc tgagtcttgt gactgggccc ctgggcaggg gtcaagggtc agtgcccctg 241 tttcctttac cccctcctcc ccgggcaacc tttaaccctc caccgcccac acacaaggct 301 gcctgcctct acacattctc ccaagagttg tctgagccgc cgagtggaca gtggctgatt 361 atggagagca gaggcccact ggctacctcg cgcctgctgc tgttgctgct gttgctacta 421 ctgcgtcaca cccgccaggg atgggccctg agacctgttc tccccaccca ggtgcaggag 481 cgggacaggg cactcagctc atgcagtctt cccttctctc ctctggccct gtagcagggc 541 ctctccctct gtctgtctct gacatgtccc tactcagctt tgtttgtttt ctctttctga 601 tagagtgccc acgaccctcc ggctgtccac ctcagcaatg gcccaggaca agagcctatc 661 gctgtcatga cctttgacct caccaagatc acaaagtatg gggttggcct agcccttgac 721 ccagtcccct ggttctgccc tctctccatc agctcttctc ttttccctgt cttcctttcc 781 ttatctgtga acaccatctc ccccaaaccc acactggttc tcaaaggaca catgacatac 841 acaatctttc cttctgtgtc cttccagaac ctcctcctcc tttgaggttc gaacctggga 901 cccagaggga gtgatttttt atggggatac caaccctaag gatgactggt ttatgctggg 961 acttcgagac ggcaggcctg agatccaact gcacaatcac tgggcccagc ttacggtggg 1021 tgctggacca cggctggatg atgggagatg gcaccaggta agctagctct ggtcctcagg 1081 ggagggatgt ctggagctgg tctgaggaaa gggaacaaaa ccaagttatt gggcatccct 1141 ttaccactgt catctcgttt aatccacacg aacccccaca aagtagctat tcttggcccc 1201 atcttttctg atgggaattc ctaaggctca gtcagtatat aagtgacaag agctgagtga 1261 cccaaggcca aggatgctag ctgcttcttt aaggcatgtt ctttccacta tagtactagg 1321 ctgcctcaca ggaaggtggc agaaacagat cccaggggcc tctgattttg cttcccacct 1381 tcctgcaggt ggaagtcaag atggaggggg actctgtgct gctggaggtg gatggggagg 1441 aggtgctgcg cctgagacag gtctctgggc ccctgaccag caaacgccat cccatcatga 1501 ggattgcgct tggggggctg ctcttccccg cttccaacct tcggttgccg gtaactacac 1561 cccaggggtg gaaccctagc caagacttgg taaagcactg ctgggtggct ggccgtggga 1621 atctaagtcc acacttttag ggagaaggga agggttgaga gctgcaaggg ggaggccaaa 1681 tgctcagagg ggagtcaact gagggcaggg aggtcgggac tgcgcctccg atgccctgat 1741 ttctacatcc ccgtatctta tctctgtcac actccagctg gttcctgccc tggatggctg 1801 cctgcgccgg gattcctggc tggacaaaca ggccgagatc tcagcatctg cccccactag 1861 cctcagaagc tgtgatgtag aatcaaatcc cgggatattt ctccctccag ggactcaggc 1921 agaattcaat ctccgaggta gatttcctcg gagtctattt ttcccaccct ggccagctca 1981 gcctgcctct gtccccctct accactggcc cctttcctcc ttgagacccc agctttgagg 2041 cctcaggata atcatttctc cccacagaca ttccccagcc tcatgcagag ccctgggcct 2101 tctctttgga cctgggactc aagcaggcag caggctcagg ccacctcctt gctcttggga 2161 caccagagaa cccatcttgg ctcagtctcc acctccaaga tcaagtaaag ggggacagtg 2221 gggcattgct gtattcagtg gagcctggag caatgaggaa gagggagtcc aacatgtcaa 2281 tattaggaag gtttccagcc cagggaacat aacaagactg gctccacaga attgtttttc 2341 attaataatt agccaggcat ggtggtggtg cttgcctgta atcccaggtg ctggaggcca 2401 agaccagagg atcacttgag gccaggagtt tgacaccagc ctgggcaaca tagcagagac 2461 ctctgtctaa aaaaaaaaaa aaattagcca ggcatggtag cacatgtctg ctgccctagc 2521 tatttaggag cctgaggcag gaggttcact tgagcccagg agtttgaagc tgcagtgagc 2581 tatgatgtgc cactgcactc tgacctgggc cacagtgaga ccctgtctca aaaaaataaa 2641 aataaaaata aggcttatgg atggcactca ggtgggtggt aggggcgagg gacatatctt 2701 gaagctcccc acagcaagca aacagttttg acttagactg catatttact tggggcaggt 2761 gtggtttcaa aaagggtcag ccaaaaaaaa ttggggcagg atttaagtgg tgagaatggc 2821 cagtaggtgg aggcatagcg aagaggcaga attaaggcag ctaggggtga ggccacagcg 2881 agtaggcccg gctcattctt ccctctctct ctaccgtccc tttcccacac actctgcaga 2941 aggtggtgtt gtcttctggg tcggggccag ggctggatct gcccctggtc ttgggactcc 3001 ctcttcagct gaagctgagt atgtccaggg tggtcttgag ccaagggtcg aagatgaagg 3061 cccttgccct gcctccctta ggcctggctc ccctccttaa cctctgggcc aagcctcaag 3121 ggcgtctctt cctgggggct ttaccaggta agagagaatg atgttcaagt tcatgagcac 3181 aacattggaa acagctcaag ggaggcggca cattttgagg ggaaggaaac ctctgggagg 3241 gaagaagaat aggccacaag aagaagatat gggggcagtg gaaggtagtg cttttgcaaa 3301 ctcaggttgg aggagtggaa aagtggggag aagattctgg atccgagcca ccttaatgct 3361 ctaatgccac ctttgcacta cctccctcta ggagaagact cttccacctc tttttgcctg 3421 aatggccttt gggcacaagg tcagaggctg gatgtggacc aggccctgaa cagaagccat 3481 gagatctgga ctcacagctg cccccagagc ccaggcaatg gcactgacgc ttcccattaa 3541 agctccacct aagaaccccc tttgaaagtt actgattatt catttaattc aacaaatatt 3601 cactgtgcac tagcaatgta ccaggcactg tgccaagtat tgagttgtct taatgagcaa 3661 aaacactctg gttcctaccc tcttggtgcc cacgtcccat agggaagcag acattccatc 3721 aaaggctaac taataagtgg atagttggaa gcactgataa agaagaattg gagagttgtg 3781 aaaacatgga gactggccgg gcgtggtggc // LOCUS HSSLIPG 2657 bp DNA PRI 30-MAR-1995 DEFINITION Human SLPI gene for secretory leukocyte protease inhibitor. ACCESSION X04502 NID g36485 KEYWORDS elastase inhibitor; protease inhibitor; secretory leucocyte protease inhibitor; trypsin inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2657) AUTHORS Stetler,G., Brewer,M.T. and Thompson,R.C. TITLE Isolation and sequence of a human gene encoding a potent inhibitor of leukocyte proteases JOURNAL Nucleic Acids Res. 14 (20), 7883-7896 (1986) MEDLINE 87040761 REFERENCE 2 (bases 1 to 2657) AUTHORS Rogaev,E.I., Keryanov,S.A. and Malyako,Y.K. TITLE Dinucleotide repeat polymorphisms at the P1, HBE1 and MYH7 loci JOURNAL Hum. Mol. Genet. 1 (4), 285 (1992) MEDLINE 93265040 FEATURES Location/Qualifiers source 1..2657 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 181..184 /note="put. CAAT-box" promoter 242..247 /note="put. TATA-box" mRNA 276..375 /note="put. exon 1" precursor_RNA 276..2574 /note="put. primary transcript of SLPI" CDS join(291..375,1092..1250,1668..1817,2397..2401) /codon_start=1 /product="secretory leukocyte protease inhibitor (SLPI)" /db_xref="PID:g758101" /db_xref="SWISS-PROT:P03973" /translation="MKSSGLFPFLVLLALGTLAPWAVEGSGKSFKAGVCPPKKSAQCL RYKKPECQSDWQCPGKKRCCPDTCGIKCLDPVDTPNPTRRKPGKCPVTYGQCLMLNPP NFCEMDGQCKRDLKCCMGMCGKSCVSPVKA" sig_peptide 291..365 /note="signal peptide (AA -25 to -1)" mat_peptide join(366..375,1092..1250,1668..1817,2397..2398) /gene="SLPI" gene 366..2398 /gene="SLPI" intron 376..1091 /gene="SLPI" /note="intron I" mRNA 1092..1250 /gene="SLPI" /note="exon 2" intron 1251..1667 /gene="SLPI" /note="intron II" mRNA 1667..1817 /gene="SLPI" /note="exon 3" intron 1818..2396 /gene="SLPI" /note="intron III" mRNA 2397..2574 /note="exon 4" misc_feature 2549..2554 /note="polyA signal" polyA_site 2574 /note="polyA site" BASE COUNT 545 a 649 c 704 g 759 t ORIGIN 1 agggctctag atggacactg agacggcctc tttaatcaac caacttccca ggccaatctc 61 ttccctttct tttctcgata gttgctgtgt ttggcctcat agccttacct ggcataggaa 121 agataaacaa tctccttggt gtcaggattt ctggtttttg gttagggttt cctgcttatg 181 caatagtagc tgggagaggc ccgaaagaat tctggtgggg ccacacccac tggtgaaaga 241 ataaatagtg aggtttggca ttggccatca gagtcactcc tgccttcacc atgaagtcca 301 gcggcctctt ccccttcctg gtgctgcttg ccctgggaac tctggcacct tgggctgtgg 361 aaggctctgg aaagtgtaag ttggagtcac tgtctaatct gggctgcagg gtcagaggtg 421 gggtctcctt gtggtgtggg tgtgtcccct tctgtaggct ctgatccctc agcttagttt 481 cgggagacct ccctgagggt ggaatacatg tctggctgag ctccaaggtt tgtgtgacag 541 tttgagcttc tggaaatgct tcctctatgc agccatgctg tcagcccagg tcccactctc 601 tctctctctc tctctctctc tctctctcat actccgcctt cttcttcacc ttgctgcgac 661 tctcaaatca ttagtttctg actctgcttc cgttgtgtct ttgcttctgc tattttgtct 721 ctgtgcttct cgcttgggat ttagctctca acttctctca cactggttct atttatcttt 781 gtttacctct ctccatctcc atcactccca gccttcctct ctgcctttgt gtagccttgt 841 tttgctcttg ggtggaggtc ttgactagaa gcctgctgcc cttttcttgg gtgtgaaacg 901 tcccctgtcc atttgtctaa tttaatcaag cccatcaata cacctggaga tcaggcaggc 961 atgacctttg ggctttgtgg acagctactg aggtaagggt ctctccccct caaaagtggt 1021 gctttgttca ggaggcatga tgggtcctca gtacccagcc tcctcctacc tcttgacttt 1081 ctcttcaaaa gccttcaaag ctggagtctg tcctcctaag aaatctgccc agtgccttag 1141 atacaagaaa cctgagtgcc agagtgactg gcagtgtcca gggaagaaga gatgttgtcc 1201 tgacacttgt ggcatcaaat gcctggatcc tgttgacacc ccaaacccaa gtaagcaggt 1261 cggggaactg ggtagagaga tagcctgggg acacagcatt agagggacgg aactgggtga 1321 tgggtcctgc caggcctcct tgtcaatgcc gtagtgagtc acagtgccct aagagaagta 1381 gccagctggt gaagcagcgg gcatttagat agccaggtag ttggaagcct cccacctagt 1441 cagcactggg cggctggcac ctgcataatg gggggcctga agttctagga gagccaggtg 1501 ctatgtttgg gggccgcctt agggagaagg tggtggtgat agaggtgggg aggggatgat 1561 cccccctgct gaagctggac gaggggctca ctctaaaaag tggggatggg aggggttgta 1621 taaagtacaa ggcctctgac cggtagcctc actctcaccc aacccagcaa ggaggaagcc 1681 tgggaagtgc ccagtgactt atggccaatg tttgatgctt aaccccccca atttctgtga 1741 gatggatggc cagtgcaagc gtgacttgaa gtgttgcatg ggcatgtgtg ggaaatcctg 1801 cgtttcccct gtgaaaggta agcaggggac gagggcacac tgagctccct cagccctctc 1861 agcctcaacc ctctggaggc ccaggcatat gggcaggggg actcctgaac cctactccaa 1921 gcacagcctc tgtctgactc ccttgtcctt caagagaact gttctccagg tctcagggcc 1981 aggatttcca taggagtcgc ctgtggcttt gattctattc tagtgtctct gggtgggggt 2041 cctgggcaag tgtctttctg agtctagttt ctttatcggt aaaatgtaca taatgagatg 2101 aaagtgctct gcaaagacct atgtgcacta agaattatta ttcaggtgtt tccatcatgt 2161 tttctgaggt gaaatcacaa aggatcagtg gagtttgagg attatctagt tcaatgcttt 2221 gagtttagag ttttacgtga aaatgagact tgtctcctga cactaagtct ctctcaacta 2281 tagcgctatc ttgctatttt ctctatctca gaaggatcct tgggcaggag gaaggatgtg 2341 gatatatgat ttggctggtt tctatgctga agctctgatc tgattttctc tcacagcttg 2401 attcctgcca tatggaggag gctctggagt cctgctctgt gtggtccagg tcctttccac 2461 cctgagactt ggctccacca ctgatatcct cctttgggga aaggcttggc acacagcagg 2521 ctttcaagaa gtgccagttg atcgaatgaa taaataaacg agcctatttc tctttgcaca 2581 tcctgcttct gtgattcgtt ggggggatga gtggggtggg acgtgtgagg gaagatctag 2641 tatgaggcct ccttcct // LOCUS HSSSPN1AG 4096 bp DNA PRI 07-DEC-1995 DEFINITION H.sapiens gene for spermidine/spermine N1-acetyltransferase. ACCESSION Z14136 NID g36606 KEYWORDS spermidine/spermine N1-acetyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4096) AUTHORS Xiao,L., Celano,P., Mank,A.R., Griffin,C., Jabs,E.W., Hawkins,A.L. and Casero,R.A. Jr. TITLE Structure of the human spermidine/spermine N1-acetyltransferase gene (exon/intron gene organization and localization to Xp22.1) JOURNAL Biochem. Biophys. Res. Commun. 187 (3), 1493-1502 (1992) MEDLINE 93038627 REFERENCE 2 (bases 1 to 4096) AUTHORS Casero,R.A. TITLE Direct Submission JOURNAL Submitted (30-JUL-1992) Casero R. A., The Johns Hopkins Universtiy School of Medicine, The Oncology Center Laboratories, 424 North Bond Street, Baltimore, Maryland, USA, 21231 FEATURES Location/Qualifiers source 1..4096 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Lung carcinoma" /cell_line="NCI H157" /chromosome="X" mRNA join(1098..1342,1583..1634,1725..1808,3258..3359, 3450..3490,3614..4096) exon 1098..1342 /number=1 gene 1277..3784 /gene="spermidine/spermine N1-acetyltransferase" CDS join(1277..1342,1583..1634,1725..1808,3258..3359, 3450..3490,3614..3784) /gene="spermidine/spermine N1-acetyltransferase" /codon_start=1 /db_xref="PID:g36607" /db_xref="SWISS-PROT:P21673" /translation="MAKFVIRPATAADCSDILRLIKELAEYEYMEEQVILTEKDLLED GFGEHPFYHCLVAEVPKEHWTPEGHSIVGFAMYYFTYDPWIGKLLYLEDFFVMSDYRG FGIGSEILKNLSQVAMRCRCSSMHFLVAEWNEPSINFYKRRGASDLSSEEGWRLFKID KEYLLKMATEE" intron 1343..1582 /gene="spermidine/spermine N1-acetyltransferase" /number=1 exon 1583..1634 /gene="spermidine/spermine N1-acetyltransferase" /number=2 intron 1635..1724 /gene="spermidine/spermine N1-acetyltransferase" /number=2 exon 1725..1808 /gene="spermidine/spermine N1-acetyltransferase" /number=3 intron 1809..3257 /gene="spermidine/spermine N1-acetyltransferase" /number=3 exon 3258..3359 /gene="spermidine/spermine N1-acetyltransferase" /number=4 intron 3360..3449 /gene="spermidine/spermine N1-acetyltransferase" /number=4 exon 3450..3490 /gene="spermidine/spermine N1-acetyltransferase" /number=5 intron 3491..3613 /gene="spermidine/spermine N1-acetyltransferase" /number=5 exon 3614..4096 /number=6 BASE COUNT 1142 a 854 c 971 g 1129 t ORIGIN 1 gaatcaagta gggattacaa caactttcaa cagcaaccga agtcagaaga cagataccat 61 caatgtgctg aaagaggaaa acttctgaac ctcaaaatct ataaactatg aaataaaccc 121 ttcaagaatt ttaagatggg ctggtccaaa ggtagtgagt tatctcagtt gattgttcac 181 aatcagttac acatcaaact ccttatttta ctctgactac tgcacttgac tagtctaaaa 241 aaagaattat aagatgaatt aaagacacgt tcagataagc aaaactgaga acgtttggca 301 aaagcagatc tttactaaag gaaattatga agaatgtacc tcgggcagaa ggaaaagtgt 361 caagatcaag tgtcaaggaa gattagagat ttaagaaggt ataagaaggt tagaaaagta 421 agaaaaaaat atgccgtcaa ctctaaaaac tacaagtata aagaaaaaag attattggga 481 aatccaaata acctacctta gaaattctaa agatatacga gctcacattc cttggtcata 541 agagcagtgt catcattaga catcaggtcg cctctggaaa attccattgt gagtttaaaa 601 ggcaaatgac gtctataatt ctaaaactag ttctcccacc gaaactctag aaaggatctt 661 gggatctgca gggtctcccg gatcacactt tgagaactgc tgctttagtc ttcctcccat 721 ctcctaaggc cagagctcct gagtttgctt cccactggcc aaggagaaaa gagcaaggtc 781 acttgtcggg gggctgcaga gggaattacc ttctttcatt tgcaaatgtt actgggggac 841 acaccggctc ccagtagggt ttccgccaag gctccgcgaa acgccactag agggcgccgc 901 tagcgaatcc cacagcgcgc cccgctgccc ccacttttgt cctccgggtt cacacgggcg 961 cccggaagag agggtggtgc ctgggagagg aaacgatgcg ccccggggcc gcggcctcat 1021 tggatgagag gtcccacctc acggcccgag gcggggcttc tttgcgctta aaagccgagc 1081 cgggccaatg ttcaaatgcg cagctcttag tcgcgggccg actggtgttt atccgtcact 1141 cgccgaggtt ccttgggtca tggtgccagc ctgactgaga agaggacgct cccgggagac 1201 gaatgaggaa ccacctcctc ctactgttca agtacagggg cctggtccgc aaagggaaga 1261 aaagcaaaag acgaaaatgg ctaaattcgt gatccgccca gccactgccg ccgactgcag 1321 tgacatactg cggctgatca aggtagcgga gagccagagc tcctcccggg gcgtggggtg 1381 gaggtggctc cgtttcccag ggtggggtga ttcacgtctt gaggtctcgg ccggtgcgag 1441 agcccagcct gttccccttt ctgcgcccat ccccggctcg gccgccaccc cgcccgcgcg 1501 ccccattccc ttcccctgag ctggccgggc cggagcctca ttttgttttc tacttttttc 1561 tctcctatgt gcatcccccc aggagctggc tgaatatgaa tacatggaag aacaagtaat 1621 cttaactgaa aaaggtaatt caacagtggc gggacggggg acaagcgttc ggtgcctgtc 1681 accttcgcca aggccattac cgcccctgct cccccttctt gcagatctgc tagaagatgg 1741 ttttggagag cacccctttt accactgcct ggttgcagaa gtgccgaaag agcactggac 1801 tccggaaggt aacccctcgc cctttccaga agccagagag accaagtgtt atgtaagaag 1861 tagtgtcggc tgtgtagaac cactgactac acaggccgaa gttactgaga acttggacag 1921 aaaaaatagc cagcaagtgt tcaaactact gaggaaaaaa aaaatttaga tatgctgcac 1981 ttaagaatac tagggcaggt taaaagagct gtttaagtta agtatcagag tgctgtggag 2041 actcggaagt gtttaagctg cttaagtaag tataagtgct gtggagaccc ggaagagttg 2101 gatataatgt catttgttgt aattcagttt cataaaatgg ttcttgtttg accctaacgt 2161 aacagttttt gtaattgtgt taaatcacac ttttttcttt aatttgtccc aatcttcagg 2221 ttacagtctc tagcttcgcc atgtacatgg cccttccgtg tacatggatg ggcggggagg 2281 taactaaaag atcctttaca caataaagta gatgatcatg ataaatgagg taaggtccta 2341 ttatcacaca cttcaaacac ggtagatcag aaacccacta tgatactcgc tggctgtctg 2401 tttgctaagg aatataaaat ggctagaaag tttaatttga aacctttgcc tccatttgga 2461 atagtagaca ccagttaaga gggtgtcaga tgcctttttt tttttttttt tttttttggc 2521 tggtccctgt tgattggtca gaagacagct cagctaaaag gggaagttgt ctgggtggtt 2581 gctttttttc tgacgtctgt tcctcaggct ggaagaaatg agcagaaaac aagggatgag 2641 tactttttag agtatgtgca tgttacgtaa tacctgtttc tgggcaatgc tgcttcttct 2701 gactcaacaa atggggagag caaattgaaa atgcgtaaat tggaaggcaa gttctgaaat 2761 taaacgttgt actttggcct gatgttctga cctttaagga agcaagagtt tgtaaacttc 2821 caaatattta ctattctgaa ctgccgtgta aacctgacgt attcccaagt cgacatacca 2881 gtataccaat aggatgtgaa taatgtgtgt gttgagttta aaaccatagc agttttgctc 2941 tggcaagtaa tgaaagcgtt ctcgcttcct gagtgtgagc tccagcagac tgcagagtgg 3001 ccagtccaca gttgtagcct gacttcagtg agttctgatg tgtgcttttt gcaaatacat 3061 gttctcagaa cagtgagatc atccagcagt ggcctggact gcactcacat aaaaatcatg 3121 agacagccat ggctacttgt ttctgtaata catgcaatgt gtgtttttta aaacctatga 3181 taggcctccg attctgcagc tgcaactttt atggaatgtt ttccttctcc acatctcatg 3241 tgatgctctt attacaggac acagcattgt tggttttgcc atgtactatt ttacctatga 3301 cccgtggatt ggcaagttat tgtatcttga ggacttcttc gtgatgagtg attatagagg 3361 tacgattgag ttcgtagcag agggtctgaa gagagttcag agttataaat gcttacaatg 3421 actttttaaa ttgtactctt tctttttagg ctttggcata ggatcagaaa ttctgaagaa 3481 tctaagccag gtatgtctta gattttgttc caaatttgta agtttactgg attattttaa 3541 tgatggaata aaaattgggt cttgagagca ggctgaaatg tcactgagtg tgtgttttac 3601 tctctcataa taggttgcaa tgaggtgtcg ctgcagcagc atgcacttct tggtagcaga 3661 atggaatgaa ccatccatca acttctataa aagaagaggt gcttctgatc tgtccagtga 3721 agagggttgg agactgttca agatcgacaa ggagtacttg ctaaaaatgg caacagagga 3781 gtgaggagtg ctgctgtaga tgacaacctc cattctattt tagaataaat tcccaacttc 3841 tcttgctttc tatgctgttt gtagtgaaat aatagaatga gcacccattc caaagcttta 3901 ttaccagtgg cgttgttgca tgtttgaaat gaggtctgtt taaagtggca atctcagatg 3961 cagtttggag agtcagatct ttctcctcga atatctttcg ataaacaaca aggtggtgtg 4021 atcttaatat atttgaaaaa aacttcattc tcgtgagtca tttaaatgtg tacaatgtac 4081 acactggtac ttagag // LOCUS HSTCRT3D 4186 bp DNA PRI 27-MAR-1995 DEFINITION Human T-cell antigen receptor gene T3-delta. ACCESSION X03934 NID g37037 KEYWORDS Alu repetitive sequence; delta chain; T-cell receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4186) AUTHORS Tunnacliffe,A., Sims,J.E. and Rabbitts,T.H. TITLE T3 delta pre-mRNA is transcribed from a non-TATA promoter and is alternatively spliced in human T cells JOURNAL EMBO J. 5 (6), 1245-1252 (1986) MEDLINE 86274627 COMMENT Data kindly reviewed (12-SEP-1986) by T. Rabbitts Two forms of T3-delta mRNA are made by alternative splicing in T cells, the shorter mRNA lacking third exon sequences. FEATURES Location/Qualifiers source 1..4186 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 274..284 /note="T-box; pot. regulatory function" promoter 303..317 /note="pot. promoter" misc_feature 338..443 /note="pot. leader (5' UT region)" mRNA 338..498 /note="pot. exon 1" mRNA 346..498 /note="alternate exon 1" misc_feature 346..443 /note="alternate leader (5' UT region)" CDS join(444..498,2458..2676,3145..3276,3561..3604,3828..3893) /note="put. extracellular domain of T3 delta" /codon_start=1 /product="T3 delta protein" /db_xref="PID:g755754" /db_xref="SWISS-PROT:P04234" /translation="MEHSTFLSGLVLATLLSQVSPFKIPIEELEDRVFVNCNTSITWV EGTVGTLLSDITRLDLGKRILDPRGIYRCNGTDIYKDKESTVQVHYRMCQSCVELDPA TVAGIIVTDVIATLLLALGVFCFAGHETGRLSGAADTQALLRNDQVYQPLRDRDDAQY SHLGGNWARNK" sig_peptide join(444..498,2458..2465) intron 499..2457 /note="intron I" misc_feature 581..875 /note="Alu repeat" mRNA 2458..2676 /note="exon 2" mat_peptide join(2466..2676,3145..3276,3561..3604,3828..3890) /note="put. extracellular domain of T3 delta" /product="mature T3 delta protein" intron 2677..3144 /note="intron II" mRNA 3145..3276 /note="exon 3 (alt. spliced)" misc_feature 3170..3251 /note="hydrophobic domain of T3 delta (aa 80-106)" intron 3277..3560 /note="intron III" mRNA 3561..3604 /note="exon 4" intron 3605..3827 /note="intron IV" mRNA 3828..3890 /note="exon 5" misc_feature 3957..3962 /note="polyadenylation signal" BASE COUNT 942 a 982 c 1147 g 1115 t ORIGIN 1 agatctggtg tccctggctg cctagtgaag ggtcctgaga aagatcagcc tccatgagaa 61 atctagctgc tacggcttgc gctatggggc cgacggcttc tctcaagggg cttcgagatg 121 tggcagtgtt taggttgtgt gtaaatgtgg ttgcattgtc aatagggacg ctaaagttca 181 ggccaccttt tccatattct ctgccagctc cctgctcaga gatagagcaa tttacaccgc 241 ttccttccta ccctacccct agcccacccc cactctgaaa atttcccacc atcaacggca 301 gaaagcagag aagcagacat cttctagttc ctcccccact ctcctctttc cggtacctgt 361 gagtcagcta ggggagggca gctctcaccc aggctgatag ttcggtgacc tggctttatc 421 tactggatga gttccgctgg gagatggaac atagcacgtt tctctctggc ctggtactgg 481 ctacccttct ctcgcaaggt aaggctactc caggtgggtg ggggaaggga cctgagaggg 541 acattactga tgggagtgag gcccacttga aagtgtttct tgccaggcac ggtggctcac 601 accagtaatc ccagagcttt gggaggccga agtgggagga tcctttgagg ccaggagttt 661 gagaccagtc tgagcaacat gatgagactc tatctttaca aaaaacaaaa gaattagcca 721 ggcatggtgg tgcacaactg tagtcccagc tatttgagag gctgaggcag gagggtcact 781 tgagcccggg atttcaaggc ttcagtgagc tgtgatcatg ccactgccct ccaacctagg 841 caacctgtga gaccttatct caaaaaaaaa aaaaagtgct tctcggtgct agcttcagtt 901 caccaggcac cccaggggtg tcaggagggc aggggcagga tgtagcatag agtctgggaa 961 acaggctctt ggctttgact gactccagcc atgggtgcat gtgggcctag aatgtctctc 1021 agaaggcctt aggccagaga tctgcagaaa atagctcagt acgtatgtac ctgaatctgg 1081 gactggccat gggagcacag cttcctgcac tactcctttt aggcctgatg gcctctggaa 1141 tcaggactct tgtttgtttg gccatgatta tgtcatccct ggaggcatgt cctagcactt 1201 atgtgaatgc tttaagagaa gttaacttta ggttgagcag gacggaggga agatttgagg 1261 ggttgttaca tggtaggatt ttaactcatc tcagtttgcc aatttcagtt ttaaaatgga 1321 aagtcccatg tccgaggaac cccttcaaca ctaagcaaac tgggtccaat ggtcacctaa 1381 tagttggctg cactagggca ctcaggattt ccactggaga cagatccctg aagaggacgc 1441 gactgtagga cgtgaagcat ttgatgtgca tgtgtctccc tgaggataca tgcatatact 1501 tccaacctgc ttcagaagtg tgagcaacag ctgtttcagg gttttctaga agatgctctt 1561 gtatccaaat ttgggactta gtgtgaagcc caggatgaat gaatgtagcc tccacttttc 1621 cctgtgtttg gaggcataga ctgtagtgcc tctgtggcct gtggagatgt gctcaggctg 1681 tcagaggaac tcactgcatc tggaatgttg ctgcactcac ctccagccct aaaagtctcc 1741 cacaagactt gacttgtgag gagctctagg gatggattca ataaaatttt tttcctgttc 1801 tcctccttct aaccaagggt ggaaaactgt acatgggatt tgccctggtc ccgttctgga 1861 ccacttggct gataattcag gggctggtaa accagctggt ctccacccac agccatccct 1921 ggccctggga aaggcttcag tgttgagagc cccgctttct tggttgtcaa tggagcaggg 1981 gcaatgagga aggatgagga taagaagctg aagtgggagt gttgtagatg gctgggaagt 2041 ggaattagat gtttccttga agaaaaaatg cagacatttc caatcgagga atgttcttct 2101 gtctttccac atcttccagg tgcttcagtg gtccgtgctc cccctactcc tcatcagtgg 2161 tctggattca tttccaggaa ggtgcttggg tcaatcctgc tctgaggggt cctcttttgc 2221 accaaggccc caagacctgt cctctgcttg aatggcatcc tcatgtcccc cttgacccct 2281 gcagttggtg gggaactttc tgaaggcttt ttcttgtgac ttctgtctct cttgatggaa 2341 gccataaact gtcctggacc aggaaaatgg tctctgttaa gtatgagctt ccgcagaaca 2401 aagggcttgg tgcagatcaa agagctgtct ccaactgtga tattttttcc cctttagtga 2461 gccccttcaa gatacctata gaggaacttg aggacagagt gtttgtgaat tgcaatacca 2521 gcatcacatg ggtagaggga acggtgggaa cactgctctc agacattaca agactggacc 2581 tgggaaaacg catcctggac ccacgaggaa tatataggtg taatgggaca gatatataca 2641 aggacaaaga atctaccgtg caagttcatt atcgaagtac gtgcttcctg aaccctttgg 2701 gttggaatgg atagggcttc tggatgtgag aactttctgg ctagagaggg atatggtgaa 2761 cccatctgtt ctctaaacaa agtgaggtca ctattgtaat ggtacaagcc agatttgggg 2821 accttcaaag tggaagagag agcttaactc agcaagacag agcagtaggg gtcatgttac 2881 tgcagccttg gttgggtgag aggtttggcc caccaccttg ttcaaggtcc caaaagctga 2941 gattacttct tggctcttcc cagggattcc ttaggatgca gggtggagga aggtgtggca 3001 agcccatcac caggcctttc tcccagggtc ttattgaagt ttgggttttg gcataagaga 3061 tctaagggtt gccctaggag gcagaggatg gttccctgat cttaaaggct ctcaacctct 3121 cctctctcct cccttccccc acagtgtgcc agagctgtgt ggagctggat ccagccaccg 3181 tggctggcat cattgtcact gatgtcattg ccactctgct ccttgctttg ggagtcttct 3241 gctttgctgg acatgagact ggaaggctgt ctgggggtta gtggaagagc agagcatgag 3301 agtgtgtttg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtaccag tgagcttatg 3361 caccagctaa ggaatggggc tggtggtagg tttagtctca gctgggagtt actcttgcaa 3421 tctcctaaag actcctagaa ctttgactgt tgaaagaact tgtgtgttca tatcactcat 3481 gcagacttct gagggtgtgg gagggtggat ctcacagtcc catctgctag gccattgatg 3541 tctctctctg gttcttctag ctgccgacac acaagctctg ttgaggaatg accaggtcta 3601 tcaggtgagc gttgagggga aggaggcagg aatgaaggga gggtaagtgg ggatagagag 3661 gctgacactg aatgctgttt gcacgtggga agggtcctac tggggagttc attgctgggt 3721 gtgactggag aggtcaggca ggagctctca tcgtcagggc ctgctggggc ccttccttga 3781 agagttctaa ccctgcctcc tgaccgtttg ctcttctcct ctcttagccc ctccgagatc 3841 gagatgatgc tcagtacagc caccttggag gaaactgggc tcggaacaag tgaacctgag 3901 actggtggct tctagaagca gccattacca actgtacctt cccttcttgc tcagccaata 3961 aatatatcct ctttcactca gcaggtgcct gggcttctta aggctcctgg gcaaggcgtg 4021 ggagttgtcc tgacttgctt gggatctcgc cctcctacct acctgtttct tccttcatct 4081 ccttccttcc tctgcctcac acagacagtg tgttgggcag tctctcgcca catcctggct 4141 gtctggtgtt tctagcccac agggctccct gggtgaaggg tctgcg // LOCUS HSTNFB 3037 bp DNA PRI 23-JUN-1993 DEFINITION Human gene for lymphotoxin (TNF-beta). ACCESSION X02911 NID g37215 KEYWORDS lymphotoxin; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3037) AUTHORS Nedwin,G.E., Naylor,S.L., Sakaguchi,A.Y., Smith,D., Jarrett-Nedwin,J., Pennica,D., Goeddel,D.V. and Gray,P.W. TITLE Human lymphotoxin and tumor necrosis factor genes: structure, homology and chromosomal localization JOURNAL Nucleic Acids Res. 13 (17), 6361-6373 (1985) MEDLINE 86016093 FEATURES Location/Qualifiers source 1..3037 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 791..796 misc_feature 817..820 /note="transcription initiation region" CDS join(1276..1374,1461..1566,1814..2226) /codon_start=1 /product="lymphotoxin (TNF-beta)" /db_xref="PID:g312411" /db_xref="SWISS-PROT:P01374" /translation="MTPPERLFLPRVCGTTLHLLLLGLLLVLLPGAQGLPGVGLTPSA AQTARQHPKMHLAHSNLKPAAHLIGDPSKQNSLLWRANTDRAFLQDGFSLSNNSLLVP TSGIYFVYSQVVFSGKAYSPKATSSPLYLAHEVQLFSSQYPFHVPLLSSQKMVYPGLQ EPWLHSMYHGAAFQLTQGDQLSTHTDGIPHLVLSPSTVFFGAFAL" exon <1276..1374 /number=1 sig_peptide join(1276..1374,1461..1463) intron 1375..1460 /number=1 intron 1375..1460 exon 1461..1566 /number=2 mat_peptide join(1464..1566,1814..2223) /product="lymphotoxin (TNF-beta)" intron 1567..1813 /number=2 exon 1814..2854 /number=3 polyA_signal 2836..2841 polyA_site 2854 BASE COUNT 648 a 953 c 733 g 703 t ORIGIN 1 gaattctcga aacttccttt gtagaaaact ttggaaggtg tctgccacat tgatcctgga 61 atgtgtgttt atttggggtt atataaatct gttctgtgga agccacctga agtcaggaag 121 agatggaggg catccttcag gagtgagatg agacctcatc atacttgact gtccagcatc 181 atctctgagt gaggggacca aaaaatttat cttccaaact aggacacttt caagagtgga 241 agggggatcc attaatattt tcacctggac aagaggcaaa caccagaatg tccccgatga 301 aggggatata taatggacct tcttgatgtg aaacctgcca gatgggctgg aaagtccgta 361 tactgggaca agtatgattt gagttgtttg ggacaaggac aggggtacaa gagaaggaaa 421 tgggcaaaga gagaagcctg tactcagcca agggtgcaga gatgttatat atgattgctc 481 ttcagggaac cgggcctcca gctcacaccc cagctgctca accacctcct ctctgaattg 541 actgtccctt ctttggaact ctaggcctga ccccactccc tggccctccc agcccacgat 601 tcccctgacc cgactccctt tcccagaact cagtcgcctg aacccccagc ctgtggttct 661 ctcctaggcc tcagcctttc ctgcctttga ctgaaacagc agtatcttct aagccctggg 721 ggcttccccg ggccccagcc ccgacctaga acccgcccgc tgcctgccac gctgccactg 781 ccgcttcctc tataaaggga cctgagcgtc cgggcccagg ggctccgcac agcaggtgag 841 gctctcctgc cccatctcct tgggctgccc gtgcttcgtg ctttggacta ccgcccagca 901 gtgtcctgcc ctctgcctgg gcctcggtcc ctcctgcacc tgctgcctgg atccccggcc 961 tgcctgggcc tgggccttgg tgggtttggt tttggtttcc ttctctgtct ctgactctcc 1021 atctgtcagt ctcattgtct ctgtcacaca ttctctgttt ctgccatggt tcctctctgt 1081 tcccttcctg tctctctctg tctccctctg ctcaccttgg ggtttctctg actgcatctt 1141 gtccccttct ctgtcgatct ctctctcggg ggtcgggggg tgctgtctcc cagggcggga 1201 ggtctgtctt ccgccgcgtg ccccgccccg ctcactgtct ctctctctct ctctctttct 1261 ctgcaggttc tccccatgac accacctgaa cgtctcttcc tcccaagggt gtgtggcacc 1321 accctacacc tcctccttct ggggctgctg ctggttctgc tgcctggggc ccaggtgagg 1381 cagcaggaga atgggggctg ctggggtggc tcagccaaac cttgagccct agagcccccc 1441 tcaactctgt tctcccctag gggctccctg gtgttggcct cacaccttca gctgcccaga 1501 ctgcccgtca gcaccccaag atgcatcttg cccacagcaa cctcaaacct gctgctcacc 1561 tcattggtaa acatccacct gacctcccag acatgtcccc accagctctc ctcctacccc 1621 tgcctcagga acccaagcat ccacccctct cccccaactt cccccacgct aaaaaaaaca 1681 gagggagccc actcctatgc ctccccctgc catcccccag gaactcagtt gttcagtgcc 1741 cacttcctca gggattgaga cctctgatcc agacccctga tctcccaccc ccatccccta 1801 tggctcttcc taggagaccc cagcaagcag aactcactgc tctggagagc aaacacggac 1861 cgtgccttcc tccaggatgg tttctccttg agcaacaatt ctctcctggt ccccaccagt 1921 ggcatctact tcgtctactc ccaggtggtc ttctctggga aagcctactc tcccaaggcc 1981 acctcctccc cactctacct ggcccatgag gtccagctct tctcctccca gtaccccttc 2041 catgtgcctc tcctcagctc ccagaagatg gtgtatccag ggctgcagga accctggctg 2101 cactcgatgt accacggggc tgcgttccag ctcacccagg gagaccagct atccacccac 2161 acagatggca tcccccacct agtcctcagc cctagtactg tcttctttgg agccttcgct 2221 ctgtagaact tggaaaaatc cagaaagaaa aaataattga tttcaagacc ttctccccat 2281 tctgcctcca ttctgaccat ttcaggggtc gtcaccacct ctcctttggc cattccaaca 2341 gctcaagtct tccctgatca agtcaccgga gctttcaaag aaggaattct aggcatccca 2401 ggggaccaca cctccctgaa ccatccctga tgtctgtctg gctgaggatt tcaagcctgc 2461 ctaggaattc ccagcccaaa gctgttggtc ttgtccacca gctaggtggg gcctagatcc 2521 acacacagag gaagagcagg cacatggagg agcttggggg atgactagag gcagggaggg 2581 gactatttat gaaggcaaaa aaattaaatt atttatttat ggaggatgga gagaggggaa 2641 taatagaaga acatccaagg agaaacagag acaggcccaa gagatgaaga gtgagagggc 2701 atgcgcacaa ggctgaccaa gagagaaaga agtaggcatg agggatcaca gggccccaga 2761 aggcagggaa aggctctgaa agccagctgc cgaccagagc cccacacgga ggcatctgca 2821 ccctcgatga agcccaataa acctcttttc tctgaaatgc tgtctgcttg tgtgtgtgtg 2881 tctgggagtg agaacttccc agtctatcta aggaatggag ggagggacag agggctcaaa 2941 gggagcaaga gctgtgggga gaacaaaagg ataagggctc agagagcttc agggatatgt 3001 gatggactca ccaggtgagg ccgccagact gctgcag // LOCUS HSTPI1G 5005 bp DNA PRI 24-MAR-1997 DEFINITION H.sapiens TPI1 gene for triosephosphate isomerase. ACCESSION X69723 NID g1906326 KEYWORDS triosephosphate isomerase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5005) AUTHORS Maquat,L.E. TITLE Direct Submission JOURNAL Submitted (07-DEC-1992) L.E. Maquat, Roswell Park Cancer Inst., Dept. of Human Genetics, Elm and Carlton Streets, Buffalo, NY 14263, USA REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 5005) AUTHORS Maquat,L.E. TITLE Direct Submission JOURNAL Submitted (13-OCT-1994) L.E. Maquat, Roswell Park Cancer Inst., Dept. of Human Genetics, Elm and Carlton Streets, Buffalo, NY 14263, USA REMARK revised by [6] REFERENCE 3 (bases 1 to 5000) AUTHORS Nesic,D., Cheng,J. and Maquat,L.E. TITLE Sequences within the last intron function in RNA 3'-end formation in cultured cells JOURNAL Mol. Cell. Biol. 13 (6), 3359-3369 (1993) MEDLINE 93268285 REFERENCE 4 (bases 1 to 5005) AUTHORS Cohen-Solal,M. TITLE Direct Submission JOURNAL Submitted (08-JUL-1996) M. Cohen-Solal, Unite INSERM U91, Hopital Henri Mondor, 94010 Creteil, France REMARK revised by [9] REFERENCE 5 (bases 1 to 5005) AUTHORS Brown,J.R., Daar,I.O., Krug,J.R. and Maquat,L.E. TITLE Characterization of the functional gene and several processed pseudogenes in the human triosephosphate isomerase gene family JOURNAL Mol. Cell. Biol. 5 (7), 1694-1706 (1985) MEDLINE 85267686 REFERENCE 6 (bases 1 to 5005) AUTHORS Boyer,T.G., Krug,J.R. and Maquat,L.E. TITLE Transcriptional regulatory sequences of the housekeeping gene for human triosephosphate isomerase JOURNAL J. Biol. Chem. 264 (9), 5177-5187 (1989) MEDLINE 89174806 REFERENCE 7 (bases 1 to 5005) AUTHORS Cohen-Solal,M. TITLE Direct Submission JOURNAL Submitted (14-FEB-1997) M. Cohen-Solal, Unite INSERM U91, Hopital Henri Mondor, 94010 Creteil, France FEATURES Location/Qualifiers source 1..5005 /organism="Homo sapiens" /note="Allele: hTPI-8B" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" variation 200 /note="polymorphism" /replace="c" mRNA join(597..749,1916..2039,2151..2235,2310..2442,2740..2825, 3101..3188,3317..3880) /gene="TPI1" gene 597..3880 /gene="TPI1" prim_transcript 597..3880 /gene="TPI1" exon 597..749 /gene="TPI1" /number=1 CDS join(635..749,1916..2039,2151..2235,2310..2442,2740..2825, 3101..3188,3317..3435) /gene="TPI1" /EC_number="5.3.1.1" /codon_start=1 /product="triosephosphate isomerase" /db_xref="PID:g37247" /db_xref="SWISS-PROT:P00938" /translation="MAPSRKFFVGGNWKMNGRKQSLGELIGTLNAAKVPADTEVVCAP PTAYIDFARQKLDPKIAVAAQNCYKVTNGAFTGEISPGMIKDCGATWVVLGHSERRHV FGESDELIGQKVAHALAEGLGVIACIGEKLDEREAGITEKVVFEQTKVIADNVKDWSK VVLAYEPVWAIGTGKTATPQQAQEVHEKLRGWLKSNVSDAVAQSTRIIYGGSVTGATC KELASQPDVDGFLVGGASLKPEFVDIINAKQ" intron 750..1915 /gene="TPI1" /number=1 exon 1916..2039 /gene="TPI1" /number=2 intron 2040..2150 /gene="TPI1" /number=2 exon 2151..2235 /gene="TPI1" /number=3 intron 2236..2309 /gene="TPI1" /number=3 exon 2310..2442 /gene="TPI1" /number=4 intron 2443..2739 /gene="TPI1" /number=4 exon 2740..2825 /gene="TPI1" /number=5 intron 2826..3100 /gene="TPI1" /number=5 variation 2898 /gene="TPI1" /note="polymorphism" /replace="g" exon 3101..3188 /gene="TPI1" /number=6 intron 3189..3316 /gene="TPI1" /number=6 exon 3317..3880 /gene="TPI1" /number=7 BASE COUNT 1013 a 1415 c 1535 g 1042 t ORIGIN 1 ctgcagttcc tgccaggcct tgccagccgg ggcgagggtt gggatgatcc tggcggccta 61 tgcctgtgtg ggctgcccct cccgctgtga accctgcatt tgtcccgcaa gttttcactc 121 aggtagactc cctgggtaca agggtgcctg ctcagcagtc gggcatgagc tgctccgatg 181 ggcgaaggag gttgtctatt ccacagttgg agaggggccc tctctgcccc agtgggcgat 241 ctgggctacg gccaagttgc caccagctag ttccgcttga aaaccacttc tggccccgtg 301 ggggactcaa gtcgccaagc gagggttccc ctgagcgccg gagctcacag gtctcgcctt 361 gtcccgaaag ccccgcaatc gaggcggagg cgaccgagcc cccgactctc ctagaacgtt 421 gccacaagaa gggggaacgt cggaacagtg catcatcggg cggcggccgg ggcggcggca 481 ggagggcggg cggggggcag ggctccgggg gactgggcgg gccatggcgg aggacggcga 541 ggaggcggag ttccacttcg cggcgctcta tataagtggg cagtggccgc gactgcgcgc 601 agacactgac cttcagcgcc tcggctccag cgccatggcg ccctccagga agttcttcgt 661 tgggggaaac tggaagatga acgggcggaa gcagagtctg ggggagctca tcggcactct 721 gaacgcggcc aaggtgccgg ccgacaccgg taagccctcg ccgaggaggg gtctggccgg 781 gccggggccg ggccggggca ggagtggcag cgcctctccc gaggcccgag gtccgggccg 841 gtatccgcgc ggacctgatg cagggctgtg ggacgagggc cgctggggtc cgggcagggg 901 cctcgcagcc gcagccccgt cggtgcgtcg agggggcagg gcggagcaca tgatgcccct 961 tggactacgg ggcaggtaag gacgttttgg gtctcctgga ggaaggcggc cccggggcgc 1021 gcactggctg tgcccgccag gcgacggggt taggagccga gcccgaggct ctgcgggaga 1081 ccgggggagg ctgggccgcg tgggcttccg ctccctgccc tggcctccgc gtgcgcgccg 1141 ccgcacgtag ccccagactc ctccccctcc tcgccggcgt cgtcccgcgc cgagctgctg 1201 ctgccctgag cccccagatc tgaacccctt cccttcggca acctgagcga ctcccgcctt 1261 ccacggaagg gaccgagccc gtgccaaaca ggctgagcga tttgggagtg aggagccatc 1321 ctaccgcttt ccccaacctg gaaacagcaa agcgcaaggc ctctgagtca gttaggtctc 1381 tgccacccac gggcaaagga tgctctcctc catcctcctt cctccctcca ccgaaatcgg 1441 agagccgcgg gcctgatcca aagaggcatc cccttctcgt tcattcccca gaggcctcaa 1501 tacaaacccc aggagttggc ccctctcctt ttgctacaaa tccttgcctt gcaaagggga 1561 ggtgaggatg ggctatttta gaagggaagc agggttgctc cctggagaat gctgagtctg 1621 tgaggtgcct atgccgagaa tagctcgagg aaattggagc cccagctgtt aaaagagcag 1681 agggcagggt gagggccgtg gctctcaggg gtatctggaa ggctcttcga gttgagtgca 1741 gacccagcct tgggctggaa aatggacaaa ggtcatcttg ctggggtgaa aagggggaga 1801 gcagaaccaa gaagaagagg gtgagggctg gggggctcca gggcactggt taggaattgt 1861 ggggaatgaa ggctttcttt agtctcatcc ccctgtggta ccatcttgtc ctcagaggtg 1921 gtttgtgctc cccctactgc ctatatcgac ttcgcccggc agaagctaga tcccaagatt 1981 gctgtggctg cgcagaactg ctacaaagtg actaatgggg cttttactgg ggagatcagg 2041 tgagatcgag gtggagaggg gtgtgtggga cccttccctc actttcctcg ttgaggggaa 2101 agccacaggg tgggctccct gctgaacctt ggcttcatct cttcctttag ccctggcatg 2161 atcaaagact gcggagccac gtgggtggtc ctggggcact cagagagaag gcatgtcttt 2221 ggggagtcag atgaggttag tagccaagag agaagataag ggatgtcttt ttccaagaag 2281 gatgtctcac caagtctgtt tctcaacagc tgattgggca gaaagtggcc catgctctgg 2341 cagagggact cggagtaatc gcctgcattg gggagaagct agatgaaagg gaagctggca 2401 tcactgagaa ggttgttttc gagcagacaa aggtcatcgc aggtatctct ggagaaaggg 2461 acctttgagc ctatccaggg ccacagagac tcagagggta gggtcaggcc ctggagcctg 2521 tcttggtccc catgctgatc cagaaaagga aaaaggggag ggggagtgac aatctttgct 2581 tggggcctat gacttctcca gccccaaggt agatgccacc tggaaatccc ccaatgtcca 2641 ctagggggca gtaggccacc gttcttcgta ctccggagaa cctggctgga gagctctttc 2701 ttgttcaccc ttccctccat ctgtatctct gccctgcaga taacgtgaag gactggagca 2761 aggtcgtcct ggcctatgag cctgtgtggg ccattggtac tggcaagact gcaacacccc 2821 aacaggtaac cgggcccagg agccctgccc tcatcccagc ctgcctcaat aggtttggac 2881 agacacagcc cacatggagc aaccccttat ttcaaagaca cagagacctt gaacccagag 2941 acagtgactt gtccaagggc atccagtcca gggcctggct tggatcagag ccctggtact 3001 ctgactcagt cagaaaccac actaagtgtc cactggtgcc agtgattttt cctcttagag 3061 aggcagaaaa ggtcttactt aggccagctt cttgttctag gcccaggaag tacacgagaa 3121 gctccgagga tggctgaagt ccaacgtctc tgatgcggtg gctcagagca cccgtatcat 3181 ttatggaggt gagtggcttt ggttcccggc tgaggtggag tgggctgagg actagactga 3241 gccctcggac atggaggtgg ggatggggca gactcatccc attcttgacc aagcccttgt 3301 tctgctccct tcccaggctc tgtgactggg gcaacctgca aggagctggc cagccagcct 3361 gatgtggatg gcttccttgt gggtggtgct tccctcaagc ccgaattcgt ggacatcatc 3421 aatgccaaac aatgagcccc atccatcttc cctacccttc ctgccaagcc agggactaag 3481 cagcccagaa gcccagtaac tgccctttcc ctgcatatgc ttctgatggt gtcatctgct 3541 ccttcctgtg gcctcatcca aactgtatct tcctttactg tttatatctt caccctgtaa 3601 tggttgggac caggccaatc ccttctccac ttactataat ggttggaact aaacgtcacc 3661 aaggtggctt ctccttggct gagagatgga aggcgtggtg ggatttgctc ctgggttccc 3721 taggccctag tgagggcaga agagaaacca tcctctccct tcttacaccg tgaggccaag 3781 atcccctcag aaggcaggag tgctgccctc tcccatggtg cccgtgcctc tgtgctgtgt 3841 atgtgaacca cccatgtgag ggaataaacc tggcactagg tcttgtggtt tgtctgcctt 3901 cactggactt gcccagataa tcttcctttt tgaggcagct atataaatga tcatttgtgc 3961 aagaaaaaaa aaaaaacaag aacaggtttc tataacaaca tctcttacta tttttacttg 4021 aaaaaatgtt ttgcgtagca gactgtcata gccttgaacg ccggctccct ttcttcctcc 4081 ctccaagtgg ctctggggct gttgatttcc gcagagcttg ggttggggta gggctcagcc 4141 tcaccagctt tcagcagctg gtctaggcca gcagtgcctc cccacctccc caagggaggg 4201 tggtggcaag acctcagcac agtctgtggt atcacaggct cactggtaga gcagtagcgc 4261 ttcatgcagg gggcaagggc agggcagaca cctggccgag cggtatcccc aggttgtggc 4321 gcacacacag gcggctcagg tgcagaaggg agtgtggctc cgctgggaga gagaaggagg 4381 ggaatgtaag tatgggtgca gccaccagcc agatgtcctc aaactacggg gtcctcatca 4441 gatgcctttc tgctttcctg cttcgagtgt gcccacctgg ctgaaagggg aatttgagat 4501 acccggaagt tctgcctccc agataagatt tcacacatcc ctagtcagag ctgggggtga 4561 agagctggct aaggccctct aaacaacagg ccaaggtggc tctgacagtg gtggagctgg 4621 cccaggcttt gactccagag gcttgggagc tggggctgag gtgaggaggg atggccctcc 4681 actctacagc ccaacacaac tgcagagagc agctccaagc cctggaccca gtcagttcct 4741 ggggaggctc ctcccctgct gccccaccct aaggcctgcc tcctccactg ctctcctcct 4801 ccctggtgcc cagggcccca gtgtctccat cctgaggtgt ggctgaggaa ggaagtaggt 4861 atgtggcaca gagacaggtt agagcccagg gaatccggta tacagcctgg gtacctcgtc 4921 tgcccatcct tcttttggac ctgtacatca aacccagtac ctaaccgttt gcacctcttg 4981 cctaggggtg attactcctg aattc // LOCUS HSTUBAG 4087 bp DNA PRI 26-APR-1993 DEFINITION Human gene for alpha-tubulin (b alpha 1). ACCESSION X01703 NID g37491 KEYWORDS alpha-tubulin; direct repeat; tubulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4087) AUTHORS Hall,J.L. and Cowan,N.J. TITLE Structural features and restricted expression of a human alpha-tubulin gene JOURNAL Nucleic Acids Res. 13 (1), 207-223 (1985) MEDLINE 85215466 FEATURES Location/Qualifiers source 1..4087 /organism="Homo sapiens" /db_xref="taxon:9606" exon 320..535 /number=1 misc_feature 320 /note="put. cap site" misc_feature 343..371 /note="pot. regulatory sequence" CDS join(533..535,2064..2286,2435..2583,2888..3868) /codon_start=1 /product="alpha-tubulin" /db_xref="PID:g37492" /db_xref="SWISS-PROT:P04687" /translation="MRECISIHVGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTIGG GDDSFNTFFSETGAGKHVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLITGKEDAANN YARGHYTIGKEIIDLVLDRIRKLADQCTRLQGFLVFHSFGGGTGSGFTSLLMERLSVD YGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLD IERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVI SAEKAYHEQLSVAEITNACFEPANQMVKCDPGHGKYMACCLLYRGDVVPKDVNAAIAT IKTKRTIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWARLD HKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGVHSVEGEGEEEGEE Y" intron 536..2063 /number=1 repeat_unit 801..810 /note="imp. direct repeat 1" repeat_unit 811..836 /note="imp. direct repeat 2" repeat_region 837..872 /note="18 GT repeats" repeat_unit 877..901 /note="imp. direct repeat 2'" repeat_region 902..925 /note="12 GT repeats" repeat_unit 937..946 /note="imp. direct repeat 1'" exon 2064..2286 /number=2 intron 2287..2434 /number=2 exon 2435..2583 /number=3 intron 2584..2887 /number=3 exon 2888..>4063 /number=4 polyA_signal 4058..4063 BASE COUNT 1014 a 942 c 1015 g 1116 t ORIGIN 1 gaattcatgc cgttgggtgg agtcagcgcc cccaggctct acttggaaaa cctttaagct 61 cttttctttc gtaagctctc tgggcgaggg tggtggtatg ttttgtgagg tttagcttag 121 ccccaaatcc tcaagccccg ccgccgccgc tagtgcggtg caggaaccgg gccagtactg 181 cgcccaggga cagagcgctg gggaggaaca aaggcggcgc taggctgtgt tatccgagag 241 atctttcggg ggccgcgggc agcccgtcct gccgcgaccg agggtctggg cgtcccggct 301 gggccccgtg tctgtgcgca cggtttcgct gatgctgagg ggccactttc tgtctcgcgt 361 tgttctctgg ggaccgggag aggaggaggc acccaaaaag agcgggggcg ttgggcgagc 421 tcgggggacg tgggaggggg aacgggaaca aagcgcagcc tagggttagc gtgggaagac 481 cctccgcggt ctttggcgtt ttggaaagat acccacacat tcccgggaaa acatggtgag 541 tttctgcccg gagcccccgg agcgggtgtc agggcggcga ggggcggggt tgtttgtttc 601 tggcttctat ggcgttggag ccactgggcg cggttcgcct cactgaacct cttctgtcag 661 gagctgactg aaaaaaaaac aaaaaaacct ttcatcattg cggaactgta ggctccaaaa 721 gggttttctt cactattata agttagatga cttttttttt tcttgagcaa aatcataatt 781 cacttcacaa gctctttaat gtctggtctg gggacgccct gccctgaccg actgaagtgt 841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtctgtggga cgcctgccct gaccgactga 901 agtgtgtgtg tgtgtgtgtg tgtgtctgtc tgtctcgtct ggactgcaca gttcagcgag 961 ggagaaaggc ccactttgtg agggtaccga tggtcaggac ccagggaaac gcccttcccc 1021 gccgcccccc cgccccgccc ccaccacatt cagcgaatag acaattgaaa gtggtagccc 1081 taaagaccac agagaagaaa acctctattg gatgcaaaga atatgaatat tatgtgatgg 1141 gtagagaatc tcaggatgaa aatactattt tgttgtttta aataaatatt tcattatcct 1201 tccactgggc ttttattctt tggtaccttt tcatgtgatg cttgtttcta acttaggaac 1261 ttttgtgtgt gtgtgtgtaa gatacggata atttttcagc ttttacagtg gagaagatct 1321 ggaaaaaggt ttttttttaa aaaaaaaagt aatgaaattg ctacagacaa agaagaatta 1381 tactccgctt cccgttgtcc cccgttccag tgcatcttaa ttaattcatt tcaattcagg 1441 cacatggtcc cgggcggtca gaggaggaaa actggcaaaa cagcacaatg agatcatgta 1501 ggcagctgct ggaaatagag cttgctctgt taaataatgt agcagacagt acaggctagc 1561 accaggcaca cagcaaatac agcaatgcag caatgcagaa ggcagacctt gtctaaactc 1621 ctagtattga tggattctgc agtacagatg tccggattat aatatcaagt cctattcaga 1681 ggaaactttc atctttattt aaaagggaag aaagcagtaa aattaatccc aattaagtca 1741 taattggatt tacttcattt tacaaatttg tgctttgaat actgaatagc ttttaaatat 1801 gaaaatcttc tattcaagac tggtagtagg ccaatggctg gatacccgtg ctgacagggc 1861 caaggcgaca atcattattc agaccacacc catatgcagc atttgtagca ggtgattttc 1921 cttaaatctt tgtatcgtgc tggggatatg acctcaaata atttagaaaa atatctgtat 1981 attattagaa atattttgaa atttcctata atttaaatgc taatacacct taattttaca 2041 tttttcactt ttcctcccca cagcgtgagt gcatctccat ccacgttggc caggctggtg 2101 tccagattgg caatgcctgc tgggagctct actgcctgga acacggcatc cagcccgatg 2161 gccagatgcc aagtgacaag accattgggg gaggagatga ttccttcaac accttcttca 2221 gtgagacggg ggctggcaag catgtgcccc gggcagtgtt tgtagacttg gaacccacag 2281 tcattggtga gttgacctca gtaacccaag tgagatccca gggtgctgga caggaggtct 2341 gtcctggggg gctccgctgg tcactcaccc actctcctcc cgctcctcgt cccctcctcc 2401 tcctccccct gctcctcccc catcatgtct ccagatgaag ttcgcactgg cacctaccgc 2461 cagctcttcc accctgagca actcatcaca ggcaaagaag atgctgccaa taactatgcc 2521 cgagggcact acaccattgg caaggagatc attgacctcg tgttggaccg aattcgcaag 2581 ctggtatgtt tcttttcaag aataaagtaa attaatgagc ctaaagaaca cttttgaaat 2641 aatgcttttt ttttcaaaca cagaattgaa ctgttatttt aataaagagt ggaatgagtc 2701 attctttggg gtttttaaaa ttcagttaaa atgaactatt tgatgtcatt ttgtaaatgt 2761 taatgagaat tttttaaaag catttgtcaa ataagatcta agtcctggag atgtatgaaa 2821 gtgaaatata ttactatgat gtactacaag ataaactaac ctttcctctg tcctctcttt 2881 tgtataggcc gaccagtgca cgcgtctcca gggcttcttg gttttccaca gctttggtgg 2941 gggaactggt tctgggttca cctcgctgct catggaacgt ctctcagttg attatggcaa 3001 gaagtccaag ctggagttct ctatttaccc ggcgccccag gtttccacag ctgtagttga 3061 gccctacaac tccatcctca ccacccacac caccctggag cactctgatt gtgccttcat 3121 ggtagacaat gaggccatct atgacatctg tcgtagaaac ctcgatattg agcgtccaac 3181 ctatactaac ctgaataggt taataggtca aattgtgtcc tccatcactg cttccctgag 3241 atttgatgga gccctgaatg ttgacctgac agaattccag accaacctgg tgccctatcc 3301 ccgcatccac ttccctctgg ccacatatgc ccctgtcatc tctgctgaga aagcctacca 3361 tgaacagctt tctgtagcag agatcaccaa tgcttgcttt gagccagcca accagatggt 3421 gaaatgtgac cctggccatg gtaaatacat ggcttgctgc ctgttgtacc gtggtgacgt 3481 ggttcccaaa gatgtcaatg ctgccattgc caccatcaag accaagcgta ccatccagtt 3541 tgtggattgg tgccccactg gcttcaaggt tggcatcaac taccagcctc ccactgtggt 3601 gcctggtgga gacctggcca aggtacagag agctgtgtgc atgctgagca acaccacagc 3661 cattgctgag gcctgggctc gcctggacca caagtttgac ctgatgtatg ccaaacgtgc 3721 ctttgttcac tggtacgttg gggaggggat ggaggaaggt gagttttcag aggcccgtga 3781 ggacatggct gcccttgaga aggattatga ggaggttggt gtgcattctg ttgaaggaga 3841 gggtgaggaa gaaggagagg aatactaaag ttaaaacgtc acaaaggtgc tgcttttaca 3901 gggaagctta ttctgtttta aacattgaaa atgttgtggt ctgatcagtt aatttgtatg 3961 tagcagtgta tgctctcata tcaattactg acctatgctc taaaacatga atgcctttgt 4021 tacagaccca agctgtccat ttctgtgatg ggttttgaat aaagtattcc ctgtcttaaa 4081 tgaattc // LOCUS HSU01102 4995 bp DNA PRI 25-MAY-1994 DEFINITION Human lung Clara cells 10 kda secretory protein (CC10) gene, satellite and Alu repeat sequences, complete cds. ACCESSION U01102 NID g457934 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4995) AUTHORS Hay,J.G., Danel,C., Chu,C. and Crystal,R.G. TITLE High level expression of the human CC10 gene in human airway epithelium and subchromosomal localization based on a polymorphic microsatellite and human-specific Alu insertion suggest a possible linkage JOURNAL Unpublished REFERENCE 2 (bases 1 to 4995) AUTHORS Hay,J.G. TITLE Direct Submission JOURNAL Submitted (29-AUG-1993) J.G. Hay, National Heart Lung and Blood Institute, Pulmonary Branch, Bethesda, MD, USA, 20892 FEATURES Location/Qualifiers source 1..4995 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pPB371" /clone_lib="ATCC 57726" /map="D11S18 - D11S97" exon 550..667 gene 613..4892 /gene="CC10" CDS join(613..667,3708..3895,4860..4892) /gene="CC10" /codon_start=1 /product="Clara cells 10 kda secretory protein" /db_xref="PID:g457935" /translation="MKLAVTLTLVTLALCCSSASAEICPSFQRVIETLLMDTPSSYEA AMELFSPDQDMREAGAQLKKLVDTLPQKPRESIIKLMEKIAQSSLCN" intron 668..3707 /gene="CC10" repeat_unit 1825..2101 /gene="CC10" /rpt_family="Alu" repeat_unit 2159..2431 /gene="CC10" /rpt_family="Alu" satellite 2948..2962 /gene="CC10" /note="Repeat sequence CTTTT, repeated 3 or 4 times, flanking an Alu repeat" /rpt_type=tandem satellite 2980..3011 /gene="CC10" /note="TTGC is repeated sequence, repeated 8,9 or 10 times. flanking an Alu repeat." /rpt_type=tandem repeat_unit 3004..3294 /gene="CC10" /rpt_family="Alu" exon 3708..3895 /gene="CC10" intron 3896..4859 /gene="CC10" repeat_unit 3941..4232 /gene="CC10" /note="Human specific Alu repeat. Polymorphic, only present in 3% of chromosomes." /rpt_family="Alu" repeat_unit 4386..4674 /gene="CC10" /rpt_family="Alu" exon 4860..4995 BASE COUNT 1232 a 1296 c 1226 g 1241 t ORIGIN 1 aattctggaa gggtcccttt tattcaactg cttcaatcca ggggcccccg aagtctgacc 61 acagcaatgc tccaaaccat gtgtctttcc tggcttaagg ttcagtcgcc ctcctcagag 121 gggagcctat gaaagagccc agtggagtgt cagggtcctg agtcctagtc ctagtcctgt 181 ccctgccact tgtgagggaa cttgggcctc agtttctcca ggtgggctcc acaattgctt 241 ctcttgatct ggactgcccc agtgcccagg ttcagtgagt gacacaggca gctgggtttc 301 cacatcctct gacttgggtt cccttcactg cctccaggca ggctcggccc tccaccccaa 361 gtggcccatt gtgtgagctc agtttcagtg gggacagaaa ctgggttgag aaaagggaat 421 atttacctat cccaccaagc caatgccaag taaatagtgc agtatcttat gtagagccct 481 tgccctgccc ttccccatct gggtgctgct gcctagagca tataaaaggc accttgctgg 541 gcatgtctca tactagccca ccagactcag agacggaacc agagacaggc cagagcatcc 601 ccctcctcca ccatgaaact cgctgtcacc ctcaccctgg tcacactggc tctctgctgc 661 agctccggtg agtgctcaga gacccttccc tccctcctgg acttaggaac tctcaggacc 721 ccccagttct gctcagaaga aggagtgagc tgcccattcc tgctctggag ctgctgggag 781 gacctgggca tgctgagtct cagaaaactg ggtctggtga gcaagctcat cttggaaact 841 tggagagagc ccaggctgta aggaagccta aaaagggtcc catcttctat atcaacaacc 901 ctcagaatcc cagggaatgg aatagcctgg agggaggagt ggagaatacc ccataaagat 961 gagtactcca gcataggaat aatgaggccc tcatcccaga tctggacaga ctccaagatt 1021 ctgagacctt ggtgcagcct ccaagtctgg ggtctccact ccatctggca gctgaagtca 1081 ctggaagggg agctgcaggg actcgtgacc ccaaaagaaa cccaacccaa gacaaggtct 1141 ctcattctgg gcacagagga atatccagaa agagagcttc cctttgggaa ctgccaaccc 1201 agagtgaagt tttctaaaca tttccgtcct ctgcaaaagg gattaggagt ctctgagtag 1261 ttgctgctgt cactaaaagg aaaagaactg tggggggaag aggggcaaaa agagagacgg 1321 agagaggggg agaaaggaag gaaagaagga tcacagctct ctccaagatc ccccgtcttt 1381 ggggaactgg gttatctaac tctgtttttc actctgcgtc agcctcttcc atctcactga 1441 aaatgctgtt gttatttttt aataaacaaa ctccaattaa ttcacttgga aagcttcaca 1501 acacccatgg agataagttt ttatgaccct ggggagttag aaaacccaaa ccaagaagca 1561 gtaggaacaa ctatttgcag agaggtttat ttgtttttca gagaaaatga catcattttg 1621 gactgaaatg tgtattaatt agaagatctc agtgctgtct gcgtacagag gtgggtggct 1681 gagcaagata ggactgcaac atattaaggg gtgggtcaga gatcatttgt ctattgtgtg 1741 cactgcatac atatttaaca cttctcacac atgtgccaat cactgtcacc ctttcaataa 1801 tatctctttt cattcttttt tttttttttt tttgagacag agtctcgctc tgttgccagg 1861 ctgggtgcag tggcgcgatc tcagctcact gcaacctccg cctcccgggt tcaagcgatt 1921 ctcctgcctc agcctccagt aggtgggtta caggcacgca ccactgcacc cagctaattt 1981 ttgtattttt agtagagaca gggtttcaca catgttggcc agatggtctc catctcttga 2041 cctctggatc caccacctag cctcccaagt gctgggttag cgtgagccac catgcctggc 2101 ctctctttta ttcttacaac aaccctatga agtaggatat tgggccaggc acggtctgca 2161 cgcctgtaat cccagcaatt tgggaggccg agtgggtaga tcacttgagg tcaggagttc 2221 aggaccaacc tggccaacat ggtgaaacct tgtctctatt aaaaatacaa aaattagcca 2281 ggcatggtgg cgcatgcctg tagtcccagc tacttgggag gccgaggcag gagaatcact 2341 tgaacctggg aggcagaggt tgcagtcagc cgagatggca tcactgcact ccagcctggg 2401 caatagagcg agactccgtc tgaaaacaaa taaataaata aaaataaaat ttaaattaaa 2461 ttaaaattta aaaaaaataa aaaataaaat gaagtaggga tattgttccc attttacaga 2521 tgagaaaact gagctacaga aacacagagt gacttgcctg gtacacagta agttaccacc 2581 attcaaggac ctaagttctg gagagggtct gacttggagt ggcaatttct agtgaggccc 2641 tagagtcaga ggagggaagg caaatttgtt cagaaggcag agaattcaag gaaaagggat 2701 ttgagactca ctgggaagat ggaggcaagc agtgggtaga aaatggtgac tttcccccat 2761 gttcctggtt gtaaggacct gagaagaaaa cagagtctgg aagctctgtg ttgaagggaa 2821 tgaagtggta caagtggctg ctctgtccat gagctgagtg tgccacaggg cccggtgtgc 2881 acatgtgcac acctcttccc ggccaggttc gggggcccat gtttggctgg tacaatctca 2941 atggcttctt ttcttttctt ttcttctttt tcttttctct tgcttgcttg cttgcttgct 3001 tgcttgcttg ctttttgaga cagaatctcg ctctgttgcc caggctggag tgcagtgacg 3061 agatctcagc tcactgcaac tttgcttcct ggattcaagt gattctcctg cctcagcctc 3121 ctgagtagct aggttacggg tgcccagaac cacgcccggc taattttttg tatttttagt 3181 agagacgggg tttcaccatg ttggccaggc tggtctcgaa ctcctgacct cgtgatccgc 3241 ctgcctcggc tcccaaagtg ctgggattac aggtgtgagc caccgtgcct ggcttacaat 3301 cgcttttttc ctgccagagc ctgaatttgt cacatgcccc cagtgaagca tggctcaggg 3361 catctctaac cctgatgaga ggcttgtttc tggtgggaaa taaaaccctc agtggcctct 3421 tcccagcctc cacactgcat taaaaaatca ggccagcagc ttctatgatc aatactctgc 3481 cttgatctcc aacagaaaga aaaacggcac ttgctcacct caacccaaga agtctaagga 3541 agactcgggc aatccacaaa tcttacactc tagtccatcg atgaaaaggc tgctatctct 3601 cgctgatggg cctggctgtt tgcatctggg cagacccagc cagccagagg gctagccagc 3661 ttggaaaggg gcctggagac atgtgccttc tctcctctga gttgcagctt ctgcagagat 3721 ctgcccgagc tttcagcgtg tcatcgaaac cctcctcatg gacacaccct ccagttatga 3781 ggctgccatg gaacttttca gccctgatca agacatgagg gaggcagggg ctcagctgaa 3841 gaagctggtg gacaccctcc cccaaaagcc cagagaaagc atcattaagc tcatggtaac 3901 cagcaccttt cacgtcacac tggttagaag tggcttcccc ggccgggcgc ggtggctcac 3961 gcctgtaatc ccagcacttt gggaggccga ggcgggcgga tcacgaggtc gggagatcga 4021 ggccatcccg gctaaaacgg tgaaaccccg tctctactaa aaatacaaaa aaattagccg 4081 ggcgtagtgg cgggcgcctg tagtcccagc tacttgggca ggctgaggca ggagaatggc 4141 gtgaacccgg gaggcggagc ttgcagtgag ccgagatccc gccactgcac tccagcctgg 4201 gcgacagagc gagactccgt ctcaaaaaaa aaaaaaaaaa aaacagaagt ggcttcccca 4261 agtggggctg caggattgcc ccagttttca gacctgtttc taatccagag aggagagtca 4321 cagtgccact gtccccaggc aggcagcaca gtgatctttc tagacatctc cttctttttt 4381 tttttttttt ttttgagaca gagtctcgct ctgtcgccca gactagggtg caatagcacg 4441 atcttggctt actgcaacct ccacctccca ggttcaagcg atctccggcc tcagcctctt 4501 gagtagctgg gattacaggc acccaccatc atgccgagct aatttctgta tttttgtaga 4561 gatggggttt caccgtgttt gccaggctgg tctcgaactc ctgacctcag gtgatccacc 4621 cgcctcagcc tcccaaagtg ctggcattaa aggcgtgagc caccacgccc agcctcccct 4681 tactattttg taagaggctt ttgagaaaca atccaagccc ttactacctt agttcctcct 4741 agagttgact gcacctctcg gttaatgttg aagtttctgt ggctcgtcat ctctgcctaa 4801 ctatgcaatt cattcactgt tgtattgggt ttttctgttt ctttgtctat ttgttttagg 4861 aaaaaatagc ccaaagctca ctgtgtaatt agcatttaga agctgaagat ccccaactgc 4921 tccagcctct gccgctgcca tgctttgagt ccacgcccac cagccttgct ctcttcaata 4981 aaccacaagc atctc // LOCUS HSU01882 13270 bp DNA PRI 26-NOV-1997 DEFINITION Homo sapiens SS-A/Ro autoantigen 52 kda component gene, complete cds. ACCESSION U01882 NID g499209 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13270) AUTHORS Chan,E.K., Hamel,J.C., Buyon,J.P. and Tan,E.M. TITLE Molecular definition and sequence motifs of the 52-kD component of human SS-A/Ro autoantigen JOURNAL J. Clin. Invest. 87 (1), 68-76 (1991) MEDLINE 91086480 REFERENCE 2 (bases 1 to 13270) AUTHORS Chan,E.K., Di Donato,F., Hamel,J.C., Tseng,C.E. and Buyon,J.P. TITLE 52-kD SS-A/Ro: genomic structure and identification of an alternatively spliced transcript encoding a novel leucine zipper-minus autoantigen expressed in fetal and adult heart JOURNAL J. Exp. Med. 182 (4), 983-992 (1995) MEDLINE 96018798 REFERENCE 3 (bases 1 to 13270) AUTHORS Chan,E.K. TITLE Direct Submission JOURNAL Submitted (16-SEP-1993) Edward K. Chan, Molecular and Experimental Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..13270 /organism="Homo sapiens" /specific_host="Homo sapiens" /db_xref="taxon:9606" /clone="clones 33 and 5" /clone_lib="lambda FIXII library (Stratagene #946203)" /germline /sex="male" /tissue_type="placenta" repeat_region complement(30..1050) /standard_name="long interspersed repetitive element" /citation=[2] /rpt_family="L1" repeat_region 1460..2570 /standard_name="long interspersed repetitive element" /citation=[2] /rpt_family="L1" repeat_region 2380..2540 /citation=[2] /rpt_family="MER13" repeat_region 2510..3568 /standard_name="long interspersed repetitive element" /citation=[2] /rpt_family="L1" repeat_region complement(3718..3863) /citation=[2] /rpt_family="MER14" exon 4553..4606 /citation=[2] /function="5'untranslated region" /number=1 repeat_region complement(5663..5954) /standard_name="Alu sequence" /citation=[2] /rpt_family="Alu family, subfamily Sx" repeat_region 6042..7175 /citation=[2] /rpt_family="MER14" repeat_region 6203..6371 /citation=[2] /rpt_family="MER12" repeat_region complement(6281..7721) /standard_name="long interspersed repetitive element" /citation=[2] /rpt_family="L1" exon 7780..8236 /citation=[2] /citation=[1] /function="contains translation start codon at 7829" /number=2 CDS join(7829..8236,8489..8584,9708..9938,11245..11267, 11978..12078,12382..12950) /codon_start=1 /product="52 kda component of SS-A/Ro autoantigen" /db_xref="PID:g665918" /translation="MASAARLTMMWEEVTCPICLDPFVEPVSIECGHSFCQECISQVG KGGGSVCPVCRQRFLLKNLRPNRQLANMVNNLKEISQEAREGTQGERCAVHGERLHLF CEKDGKALCWVCAQSRKHRDHAMVPLEEAAQEYQEKLQVALGELRRKQELAEKLEVEI AIKRADWKKTVETQKSRIHAEFVQQKNFLVEEEQRQLQELEKDEREQLRILGEKEAKL AQQSQALQELISELDRRCHSSALELLQEVIIVLERSESWNLKDLDITSPELRSVCHVP GLKKMLRTCAVHITLDPDTANPWLILSEDRRQVRLGDTQQSIPGNEERFDSYPMVLGA QHFHSGKHYWEVDVTGKEAWDLGVCRDSVRRKGHFLLSSKSGFWTIWLWNKQKYEAGT YPQTPLHLQVPPCQVGIFLDYEAGMVSFYNITDHGSLIYSFSECAFTGPLRPFFSPGF NDGGKNTAPLTLCPLNIGSQGSTDY" exon 8489..8584 /citation=[2] /citation=[1] /number=3 repeat_region 9543..9588 /citation=[2] /rpt_type=other /rpt_family="TG dinucleotide repeat" exon 9708..9938 /citation=[2] /citation=[1] /function="contains coiled-coil/leucine zipper domain" /number=4 repeat_region complement(10395..10660) /standard_name="Alu sequence" /citation=[2] /rpt_family="Alu family, subfamily Sp" exon 11245..11267 /citation=[2] /citation=[1] /number=5 exon 11978..12078 /citation=[2] /citation=[1] /number=6 exon 12382..13270 /citation=[2] /citation=[1] /function="contains rfp-like domain and stop codon at 12948" /number=7 BASE COUNT 3641 a 2997 c 3011 g 3621 t ORIGIN 1 tcaattcggc tattgattct tgtgtatgct tcatgaagtt ctcatgctat gtttttcaac 61 tccatcaggt catttatctt cttctctaaa ctgatgattc tagttagcaa ttcctctaac 121 cttttttcaa gtttcttagc ttccttgcat tgggttagaa catgctcctt tagcacagag 181 gagtttctta ttacccacct tctgaagcct acttctgtca attcatcaaa ctcattttcc 241 atccagtttt gttcccttgc tggcaaggag ttgtgattct ttgcaggaga agaggtgtgc 301 tgttttttgg aattttcagc ctttttgcac tggtttttcc tcatcttcat ggatttatct 361 acctttggtc tttgatatcg gtgacctctg gatggggttt ttgtgtggac attctttttg 421 ttgatgttga cgctattgct ttctgtttgt tagttttcct tctaacaggt ccctctgcta 481 caggtctgct ggaggtccac tccagaccct gtttgcctgg gtatcaccag cagaggctgc 541 agaacagcaa agattgctgc ctgttccttc ctctggaagc tttgtcccag aggggcaccc 601 gtcagatgcc aaccagagct ctcctgtatg aggtgtctgt tgaccctcct gggaagtgtc 661 tcccagtctg gaggcacagg ggtcagggca cccacttgag gaggcagtct gtgccttatc 721 agagctcaag cactgtcctg ggagatccac tgctctcttc agagctggca agcaagaatg 781 tttaaggctg ttgaagctgt gctcaccgcc accccttccc ccagatgctc tgtcccaggg 841 agatgggagt tttagctata agcccctgac tggggctgct gcctttcttt caagatgtca 901 tgcccagaga ggaggaatct agagaggcag tctggctaca gcaggtttgc gaagctttgg 961 tgggctccac ccaattcgaa cttcccagca gttttgttta cactgtgagg ggaaaaccgc 1021 ctactcaagc ctcagtaatg gtggatgccc ctcactccac caagctcaag catctcaggt 1081 caatttcaga ttgctgcact ggcagcaaga atttcaaacc catggatctt agcttgctgg 1141 gctccatggg ggtgggatcc actgagctag accacttggc tccctggctt cagccccctt 1201 tccaagggag tgaacagttc tgtctcgctg gtgttccaga cgccactggg gcatgaaaaa 1261 tcctgccgct agctcagtgt ctgcccaaac agccacccag ttttgtgctt caaacccagg 1321 gtcctggtgg tgtaggcatc caagggaatc tcctggtccg tgggttgcat agaccatggg 1381 aaaagcatag tatctgggct ggatagcacc attccatcag gtcctttaag gacttctctg 1441 cattggttat tctgacctaa aaccacaaaa accctagaag aaaacctagg caataccatt 1501 caggacatag gcatgggcaa ggactacatg tcttaaccac caaaagcaat ggcaacaaaa 1561 gacaaaattg acaaatggga tctaattaaa gagcttctgc acagcaaaag aaactaccat 1621 cagagtgaat gggcaaccta tagaatagga gaaaaatttt gcaatttact catctgacaa 1681 agggctaata tccagaatct acaaagatct caaacaaatt tacaagaaaa aaacaccccc 1741 atcaaaaagt gggcaaagga tatgaacaga cacttctcaa aagaagatat ttatgcagcc 1801 aacagacaca tggaaaaatg ctcatcatca ctggccatca gagaaatgca aatcaaaacc 1861 acaatgagat actatctcac agcagttaga atggcgatca ttaaaaagtc aggaaacaat 1921 gggtgataga caggatatgg agaaacagga gcacttttac actgttggtg ggactgtcaa 1981 cgagttcatc cattgtggaa gacagtatgg tgattcctca aggatctaga actagaaata 2041 ccatttgacc cagccatccc attactgggt atatatccaa aagattataa atcatactga 2101 tataaagaca catgcacaca tttgtttatt gtggcactat tcacaatagc aaagacttgg 2161 aaccaaccca aatgtccaac aatgatagac tggatgaaga aaatgtggca catatacacc 2221 atggaatgct atgcagccat aaaaaaggat gagctcatgt cctttgtagg gacatggatg 2281 aaagtggaaa ccatcattct cagaaaacta ttgcaaggac aagaaaacca aacaccgcat 2341 gttctcactc ataggtggga attgaacaat gagaacacat ggacacagga aggggaacat 2401 cacacactga ggcctgtcct ggggtggggg gaggggggag ggatagcatc aggagatata 2461 tccaatgtaa atgagttaat gggtgcagca caccaacatg gcacgtgtat acatatgtaa 2521 caaacctgca tgttgtgcac atgtacccta gaacttaaag tataatttaa aaaaaaatct 2581 aaggcctcaa actatgaaac tactacaaga aaatattggg gaaactctcc aggacattgg 2641 actgggcaaa aatttctttc aaatattcca taagcacagg caaccaaagc aaaactggac 2701 aaatgagatc acatcaagtt aaaaagcttc tgcacagcaa aggaaacaat gaagtagaga 2761 gataacccac agaattggag taaatatttg caaactatcc atctgacaag ggattaataa 2821 ccagaatata caaagaactc aaacaactct ataggaaaaa aatataataa tctgattcaa 2881 aaatggacaa acgatctgaa tagacattcc tcagaagaag acatacaagt ggcaaatgga 2941 tatatgaaaa tgtgttcaac accactgatc atcagagaaa tgcaaatcaa aactacgaga 3001 tattatctca cctcagttaa aatgtctttt aaccaaaaga aaggcaataa caaatgctgg 3061 cgaggatatg tagaaaaggg gaccctcata cactgtcaat ggggatggaa attagtacaa 3121 acactatggc aaacaatttg gaggttcctc ccccaaaaaa aaaaaactga aaatagagct 3181 accatgtaat ctagcaatcc tactggatac tgctaggtat atacccagaa gaaatactgt 3241 taggtatata tccaagagaa aggaaatcag tatatcaaag agatatctgc actcccatgt 3301 ttatcgcagc actattcaca atagccaaga tttggaagca acgtaagtgt ccatcaacag 3361 ataaatggat aaggaaagtg tggtacacat acacaatgga gcactattca gccataaaaa 3421 gaaggagaac ctttcatttg ctacaagatg gatggaattg gggattatta tgttaagtaa 3481 aataaaccaa acacaggcaa acaaacaaat tgttatgttc tcacttattt gtgggagtta 3541 aaaattaaaa caatccaact catgaagata gagaggatga tggttaccaa aggctgggaa 3601 gggtagtggg gaatgattaa tgggttcaaa aatacatgag agagaatgaa taagatctag 3661 tattttatag cataacaagg tgaatacagt caatgataat ttatcatttt aaaataacta 3721 aaagcatata actggattgt atataacaca aatgataagt gcttgaggtg atggatagat 3781 accccactta ccctgatatg attattacgg attgtatgct ctacctgtat ccaaatattt 3841 catgtacccc acaaatatat acacctacta tagacacaca agattgaaaa ttaaaaaaat 3901 tactttggta aattttatgt tctatatttt tacgattaaa aataaaaata atataacctg 3961 agaatgtccc tagctcagtg acaagaacac agttggttgt caaatgtcaa tttccttctc 4021 accactcact ggtgggacac tcaccttggg gtcatgcgaa ataaatcact cacctgcagg 4081 cgttaaaaaa tattagtcac atcagtcaca tgaaaataat tagcttctgg aagtaagcag 4141 atggtggcac gcgaggaaga gcaccgtcca ctcttctctg catcaaaact tccttaaaag 4201 catcttccag acatggacat agcgtctagg tgtggagtga tgaccagtgt caaagaaagc 4261 agcccggact cccccgtctt gtagatattt taccaaatca gataatgaag acatgggagg 4321 gagcgcgcaa ccaggaccac gggctactga gtttccactc cttcccagtc tcagagaagc 4381 ctcctgggca tgcagagcct ctgaagcctc ccccagctca actctgactt ctcttctcag 4441 ccgcatttct ccacacaccc tttcaaggct ctgctcctcc cactgctcct ccctcactcc 4501 tcccacggct cctcctcctc cctccccttt cctctcagac ttgcttctga gcggaaactg 4561 aaagtgaaat agggagctgg ctaccagcgt tgagtcccct gtaaaggtga gtgaacccca 4621 ctagatcaag ggacccctgc cctcctaggg gacctctctc ccctgcccca gggttgaaga 4681 gccccttcca gtctccatcc ttgagccggt cccatttgcc ccttctccca ccagggatcc 4741 ctcaacccaa gaccagctgc gcttccggtg ctggccccct gcttgggagg aggctggggt 4801 gtggacctga ctgccgagtt acccagaagg ggaaggggcc cggctgcccc ctacctggga 4861 ggaggctggg ttgtggacgt gactgtggag ttacccagaa ggggaagggg cccggctgcc 4921 cccatgctgg cacctgacct aggatgacgg ctgggtgtgg ctgggatagc caggactaca 4981 gcctagccag gagggtaggg agcccagtgt cggggaggtc tgcaattctg gagactccca 5041 ggtccaaaag tcctccagag gtgagagggg caggctgttt aggatgttca accagaaatg 5101 tataggcctt cagaattatt ctttgtccta gaataagtgt tgtcaggaac taaactaagt 5161 ttggagcagc tgagaaaaga gcccagggag tgctggccac ggttcctgtt actctgtgtg 5221 tatacataca tatacaccta catacatata tgcaactgac ttccgccgtt ataggaggga 5281 ttaatgaaga tatattcaaa gcatgacgta tctctggcac atggaaggta catggtaaaa 5341 agtagctata atttctcagc aacatggagg cctgttggta cccgacagga gatggatgag 5401 cactgagggt ttaagcagga atggacatga tgagagtaga actgtcaaca gctttatgag 5461 gtccttgcca attttcattc cctcttttag ggactcagtt tctccttccg tctcactgga 5521 tcaatggacc aaacaatgga ggagttcttg ttattcagaa caggaataaa cttattcaac 5581 aaactgtttg atcggtgcaa aagtaattac tgtttttgac attaattgtt ttgccattaa 5641 ttttatgcct tgttttgttt tgttttgttt gagagaggga gtctcgccgt gtcgcccagg 5701 ctggagtgca atggcacaat cttggctcac tgcaacctct gcctcccagg ttcaagcaat 5761 tctcctgcct cagcctccca agtagctggg attacaggtg cccaccacca ctcccagcta 5821 atttgtgtat ttttagtaga gacagggttt taccatattg gtcaggctgg tcttgaactc 5881 ctgacctaag gtgatccacc cacctcggcc tcccaaagtg ctgggattac aggtgtgagc 5941 caccgtgcct gacccattat ttttattctt attttttatt ttcctttgca tcattttgtt 6001 tttctttttt tagtatattt ttaatttttg tgggcacata gtaggtatat ttatgggtat 6061 atggaatatt gtgatacagg tatacaatgt gtaataatca catcagggta agtggggcat 6121 ctatcacctc aagcattgat catctgtaat acaaactatg caattatact cttattttta 6181 atgtacaagt aaattattgg ctatagtcac cctgttgtgc tatcaaatac tagatctaat 6241 tcactccact tttgtaccca ttaaccatcc catcttctcc cccacacaca gaccctccct 6301 attactcttc ccagcctctg gtaaccatct ttctattatc taactccatg agttcaattg 6361 ttttaatttt tagcttctac aaatgagtga gaccatacaa tgtttgtctt tctgtgcctg 6421 gcttatttca cctaacataa tgacctccag ttccattcat gttgttgtga atgacaggat 6481 ctcattatat tttatgactg aacagtactc cattgtgtat acataccaca ttttctttat 6541 ccatatatct gttgatggac tcttagattg cttccaaatc ttggctgttg tgaaacgtgc 6601 tgcagtaaac atggaagtgt ggatatctct ttgatatact gatttccttt ctttggggta 6661 tatgctagca gtgggattga tggatcatat ggtagctcta tcttcagtat tttgaagaac 6721 ctccaaactg ttctctgtag tgattgtact aatttacctt cccaccaaca gcatatgagt 6781 gtaccctttt ctccacaccc tcaccagcat ttgttatagc ctgtcttttg gagaaaagac 6841 attttaactg agatgagatt atatctcatt gtagttttga tttgcatttt tctgataatc 6901 aatgatgttg agcaactgtt catatgcctg tttgtcattt gtatgacttc ttttgagaaa 6961 tgtctgttca gatcttttgt ccatttttga gtcagattat tatttttttt cctatagagt 7021 tgtttgagct ccttatatat tctggttatt catcccttgt cagatggata gtttacaaat 7081 atttactcca attctgtggg ctgtctcttc actttgtgga ttgtttcttt tgctgtgcag 7141 caccttttca acatgatgtg atcccatttg tccaagtttg ctttggctgc ctgtgcttgt 7201 gggatatttg aaagaaatct ttgccagtcc aatgtcctag agagttttcc caatgttttc 7261 ttgtaatagt ttcatagttt gaggccttag atttaagtct ttaatccatt ttgatttgat 7321 ttctgtatac ggtgagacat aggagtgtag tttcattctt ctgcatgtgg atatccagtt 7381 ttcccagcag catttattga agagactgtc ctttcctcac tgtatgttct tggtagcttt 7441 gttgaaaatg tgctcactgt ggatgtacag atttgtttct gggttcttta ttctactggt 7501 ctatgtgtct gtttttatgc cagtaccatg ctgtttgggt tactttagct ctacagaata 7561 atttgacgtc agatgatgtg attcctccag ttttgttctt tttgctcagg atagctttgg 7621 ctattctgag tcttttgtga ttccatataa attttaggat tgttttttcc tatttttgtg 7681 aagaatgtca ttggtatttt gatagggatt atgttgactt tttgtttttc ttacacagac 7741 tttctcatat tctcttttcc ctttcttctt ctctcccagc caaaccccct aaaggtctcc 7801 acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga caatgatgtg 7861 ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga gcatcgagtg 7921 tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg gcagcgtctg 7981 tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac agctagccaa 8041 catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac agggggaacg 8101 gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga aggccctttg 8161 ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc ttgaggaggc 8221 tgcacaggag taccaggtga ggcctaagag acacctggtg agtgcttcgt tttcagagca 8281 gggaatggga gaggaccacc tctggattgg aggggttaga gaaaggaggg gtttacctct 8341 ctccatgtct aatgtaggag gagaattata agttaaaccc aacctcattc cccaggcgta 8401 ggagatatgc atgagaaaat gctgcaagga ttaccccacc ctattacttg gtttgcatgg 8461 gggacgaata agctgtcttt ctctgcagga gaagctccag gtggcattag gggaactgag 8521 aagaaagcag gagttggctg agaagttgga agtggaaatt gcaataaaga gagcagactg 8581 gaaggtaaga atgacatcct gaaggagatc ttaggctgga aggctgggca gggtccagag 8641 tcctcaactc acagccttaa ccgtagctgt gcctctctga gcagtgatag agttggtccg 8701 tctcactctc cagaaaccag ctgctctcta ggtttacaaa tagaggggaa taaaaatgat 8761 gcaggttcaa aggggagctc cccatcaggc ctgaagatca agaatcctgg aagaaggtgc 8821 tgagaggatc tagctatacc ctaaagtttc gagctcctct gagtctagtc tcagaggtgt 8881 gatggcgtca gcacctggat gtgtgagaca aatatcctag tagaaaccca actgctagct 8941 aagcctttca ggaaatgaca agggctgtga ggaataaact taaggaaacc atctgaccgg 9001 gtggttactt gaccatgacc agatctctgg tccaaaatat aatgaagaat caatgtttgt 9061 tctgcactga tgatttccta ggagaggtgg ggcatgtcag ggatgagagg ggagaagaag 9121 gagcccatcc ccagtagatc acagaagagg aagatttggg attgaggtag ggtgtgagtt 9181 gggttgtggt gacaaaacag accctcatgt gtcatcgagt gaccttccag ttactaacag 9241 gagaggaaaa gcaaagaatg tccccaagcc ccctttccca ggacatagag gtggtggcgt 9301 gagcctggct ttgtgaggtt gttggaggtg gggacaggtg atttttttct cagtaactca 9361 ctgagacagg aagacacagg ggcacatttg agtggtgact gggcttggaa gagaggttgg 9421 aaaaagtccc caatgccata tattcgtgta gctgtcgttg taggccagct acagtgtgcc 9481 ctcaccaagt ccattcccct cactgtccca gaaacctcaa ccacctacct cccaagcaaa 9541 actgtgtgtg tgtgtgtgtg cgcgcgtgtg tgtgtgcatg tgtgtgtgtt ttcccagaca 9601 gcacagagtg gagtctcttg gggataggaa taggtaccaa tacctccatt cccaaagctt 9661 gggagatttt cataaaaacc acctctccct ttcccttgac actaaagaaa acagtggaaa 9721 cacagaaatc taggattcac gcagagtttg tgcagcaaaa aaacttcctg gttgaagaag 9781 aacagaggca gctgcaggag ctggagaagg atgagaggga gcagctgaga atcctggggg 9841 agaaagaggc caagctggcc cagcagagcc aggccctaca ggagctcatc tcagagctag 9901 atcgaaggtg ccacagctca gcactggaac tgctgcaggt gagacaggga ggggttccct 9961 tctacaattc agggaataac tgaaaaagac cagagctatc tggaactgcc atttgaataa 10021 atggacacct ggtccctgga atgttcttaa atgaagtaga attcaaaccc atgaaaggtt 10081 cacgttcaag acgcaaagta aagatgaaaa taagtacagg acctttttaa aaatcaattt 10141 atgttgcact tagaatcatc ttgcatccac caatgatatg tttgcctcac tgtaggaaac 10201 tatggactag atggtctcac aagtcccttc tgttctaaca ttctacagct cttttcccac 10261 acctgttcag tgtcacgcag agtggtcatg agccattgct ttcaggtttg tgcagaaaac 10321 ctaattgtat tgggttgact tgacaggctc tcttctgccc aacattgaac caggctttct 10381 atttatttat ttatttattt atgagacaga gtttcactct tgttgcccag gctggagtgc 10441 aatggtgtga tcttggctca ctgcaacctc cgcctcccgg gttcaagcga ttctcctgcc 10501 tcagcctcac atcaccacac ctggctaatt ttgcatccct agtagagatg ggatttctcc 10561 atgttggtca ggctggtctc aaactcccaa ccgtaggtga tctgcccccc tcagcctccc 10621 aaagtgctgg gattacagga gtgagccact acacctgtct gtaaaccagg ctttccaaca 10681 cacttgtgac tgagtctagg agcctgctag ctcccatgta atggcgccta atccctacga 10741 caaaaatatt ctcagtgatt tggtttcaca tacaggaaga ggcaacatta agtacccaag 10801 ggagctttaa ctaaccaacc tttccttccc aaacaatgct agcccacaag tggaatagag 10861 tggtttctgg acacctgaac tcggccattc tctgcccgtg attgtttctg ggcttaagcc 10921 aagctgggct atgtgccctc tctgccctcc tttctttctc ccctggggaa aaaaaaaaaa 10981 aaactggtct ttgagtcaga gtcctgaaac tgataaccct tccctagcca taggtcacat 11041 ccatgctggg cttctggtta cctcacaagc agccacacat cctttctgct ttgagctggt 11101 cagaatcatc tctggggcct aggaccagag gaaggagagg aacagtgccc cagggaccaa 11161 agatggatca catcacccac agggctgatg tctcagtggc gatccagcaa tccagaactt 11221 actttctgtc tcttttctcc tcaggaggtg ataattgtcc tggaaaggta aggaggagtt 11281 ttctttgtta gaagaggggc cagcagaaag catatatgcc tatgacaaag gtgattgaat 11341 tagaagtaca tgcactgaag ttttccccat ctggcttcct attctcaaaa acttcattct 11401 ctataaaata ttttctggtc tcagaaagct aataaatgga catctcaatt ggtgattgcc 11461 cacagtagtt ggatcataac atcttcacac agtaaagtcc tgccaaacag ggaaatcctg 11521 ggaagttctg ggtcttcccg gaattcagac tactgggaca tggcctgctg ggactccttt 11581 tctcagtgga atctaaccag ttacctccca aaaagagcca tgagtaagac ccacagtgtg 11641 ggcaagactc cttgaggttc cttgcagaag tgattcagcc tgtctcagat tgcttgccta 11701 taaaatgaaa ttgaatgcat ttgccaggtg acctcaggct tagacaggag agacagactg 11761 tccacatgct gcagagcctc ctataaccag gaactcctgg gtagaaacta aatcctgcat 11821 tgtttgcctc tcacacccac tcctggagac ttgaacacca gaggtctttg taaatttgag 11881 ttgaattaaa ttatcggagt cctccacaat ggaggttata gtgttagggt ttgggatagg 11941 agtaggagac aggagtctca aactctcttt cccccaggag tgagtcctgg aacctgaagg 12001 acctggatat tacctctcca gaactcagga gtgtgtgcca tgtgccaggg ctgaagaaga 12061 tgctgaggac atgtgcaggt gaggcaagtt ctagttttgc gggggataat ggggtgcaga 12121 gtagatccca gggtcaggga gcctggatgg caacttggag gagagatggc aggtcagagc 12181 agggggaaca gagatggagg taaggaagat ggtttcttca gaggtcagga ccaaggccag 12241 aactggctga tggtcatttc ctcacacagg gaggttcacc cctcatgctt accctggagt 12301 ttacacaaaa tccccccacc acaggcacag acttagtgaa ctccccccat gcaaggcctg 12361 actgtggtcc tctctctgca gtccacatca ctctggatcc agacacagcc aatccgtggc 12421 tgatactttc agaagatcgg agacaagtga ggcttggaga cacccagcag agcatacctg 12481 gaaatgaaga gagatttgat agttatccta tggtcctggg tgcccagcac tttcactctg 12541 gaaaacatta ctgggaggta gatgtgacag gaaaggaggc ctgggacctg ggtgtctgca 12601 gagactctgt gcgcaggaag gggcactttt tgcttagttc caagagtggc ttctggacaa 12661 tttggttgtg gaacaaacaa aaatatgagg ctggcaccta cccccagact cccctccacc 12721 ttcaggtgcc tccatgccaa gttgggattt tcctggacta tgaggctggc atggtctcct 12781 tctacaacat cactgaccat ggctccctca tctactcctt ctctgaatgt gcctttacag 12841 gacctctgcg gcccttcttc agtcctggtt tcaatgatgg aggaaaaaac acagcccctc 12901 taaccctctg tccactgaat attggatcac aaggatccac tgactattga tggctttctc 12961 tggacactgc cactctcccc attggcaccg cttctcagcc acaaaccctg cctcttttcc 13021 ccatgaactc tgaaccacct ttgtctctgc agaggcatcc ggatcccagc aagcgagctt 13081 tagcagggaa gtcacttcac catcaacatt cctgccccag atggctttgt gattccctcc 13141 agtgaagcag cctccttata tttggcccaa actcatcttg atcaaccaaa aacatgtttc 13201 tgccttcttt atgggactta agtttttttt ttctcctctc catctctagg atgtcgtctt 13261 tggtgagatc // LOCUS HSU04636 9453 bp DNA PRI 21-DEC-1994 DEFINITION Human cyclooxygenase-2 (hCox-2) gene, complete cds. ACCESSION U04636 NID g496975 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9431) AUTHORS Appleby,S.B., Ristimaki,A., Neilson,K., Narko,K. and Hla,T. TITLE Structure of the human cyclo-oxygenase-2 gene JOURNAL Biochem. J. 302, 723-727 (1994) MEDLINE 95031910 REFERENCE 2 (bases 1 to 9453) AUTHORS Hla,T. TITLE Direct Submission JOURNAL Submitted (21-DEC-1993) Timothy Hla, Molecular Biology, Holland Laboratory, American Red Cross, 15601 Crabbs Branch Way, Rockville, MD 20855, USA FEATURES Location/Qualifiers source 1..9453 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23B and 9C" /clone_lib="EMBL3 genomic library" /tissue_type="placenta" /cell_type="endothelium" mRNA join(832..1017,1818..1934,2055..2198,2852..2995, 3426..3607,4340..4423,4543..4789,5072..5358,5860..6007, 6494..9419) 5'UTR 832..965 gene join(966..1017,1818..1934,2055..2198,2852..2995, 3426..3607,4340..4423,4543..4789,5072..5358,5860..6007, 6494..6903) /gene="hCox-2" CDS join(966..1017,1818..1934,2055..2198,2852..2995, 3426..3607,4340..4423,4543..4789,5072..5358,5860..6007, 6494..6903) /gene="hCox-2" /EC_number="1.14.99.1" /codon_start=1 /product="cyclooxygenase-2" /db_xref="PID:g496976" /translation="MLARALLLCAVLALSHTANPCCSHPCQNRGVCMSVGFDQYKCDC TRTGFYGENCSTPEFLTRIKLFLKPTPNTVHYILTHFKGFWNVVNNIPFLRNAIMSYV LTSRSHLIDSPPTYNADYGYKSWEAFSNLSYYTRALPPVPDDCPTPLGVKGKKQLPDS NEIVEKLLLRRKFIPDPQGSNMMFAFFAQHFTHQFFKTDHKRGPAFTNGLGHGVDLNH IYGETLARQRKLRLFKDGKMKYQIIDGEMYPPTVKDTQAEMIYPPQVPEHLRFAVGQE VFGLVPGLMMYATIWLREHNRVCDVLKQEHPEWGDEQLFQTSRLILIGETIKIVIEDY VQHLSGYHFKLKFDPELLFNKQFQYQNRIAAEFNTLYHWHPLLPDTFQIHDQKYNYQQ FIYNNSILLEHGITQFVESFTRQIAGRVAGGRNVPPAVQKVSQASIDQSRQMKYQSFN EYRKRFMLKPYESFEELTGEKEMSAELEALYGDIDAVELYPALLVEKPRPDAIFGETM VEVGAPFSLKGLMGNVICSPAYWKPSTFGGEVGFQIINTASIQSLICNNVKGCPFTSF SVPDPELIKTVTINASSSRSGLDDINPTVLLKERSTEL" 3'UTR 6904..9419 polyA_signal 8830..8835 polyA_signal 9142..9147 polyA_signal 9389..9394 polyA_site 9419 BASE COUNT 2937 a 1716 c 1710 g 3090 t ORIGIN 1 gagctcacat taactattta cagggtaact gcttaggacc agtattatga ggagaattta 61 cctttcccgc ctctctttcc aagaaacaag gagggggtga aggtacggag aacagtattt 121 cttctgttga aagcaactta gctacaaaga taaattacag ctatgtacac tgaaggtagc 181 tatttcattc cacaaaataa gagtttttta aaaagctatg tatgtatgtg ctgcatatag 241 agcagatata cagcctatta agcgtcgtca ctaaaacata aaacatgtca gcctttctta 301 accttactcg ccccagtctg tcccgacgtg acttcctcga ccctctaaag acgtacagac 361 cagacacggc ggcggcggcg ggagagggga ttccctgcgc ccccggacct cagggccgct 421 cagattcctg gagaggaagc caagtgtcct tctgccctcc cccggtatcc catccaaggc 481 gatcagtcca gaactggctc tcggaagcgc tcgggcaaag actgcgaaga agaaaagaca 541 tctggcggaa acctgtgcgc ctggggcggt ggaactcggg gaggagaggg agggatcaga 601 caggagagtg gggactaccc cctctgctcc caaattgggg cagcttcctg ggtttccgat 661 tttctcattt ccgtgggtaa aaaaccctgc ccccaccggg cttacgcaat ttttttaagg 721 ggagaggagg gaaaaaattt gtgggggggt acgaaaaggc ggaaagaaac agtcattcac 781 atgggcttgg ttttcagtct tataaaaagg aaggttctct cggttagcga ccaattgtca 841 tacgacttgc agtgagcgtc aggagcacgt ccaggaactc ctcagcagcg cctccttcag 901 ctccacagcc agacgccctc agacagcaaa gcctaccccc gcgccgcgcc ctgcccgccg 961 ctcggatgct cgcccgcgcc ctgctgctgt gcgcggtcct ggcgctcagc catacaggtg 1021 agtacctggc gccgcgcacc ggggactccg gttccacgca cccgggcaga gtttccgctc 1081 tgacctcctg ggtctatccc agtactccga cttctctccg aatagagaag ctacgtgact 1141 tgggaaagag cttggaccgc tagagtccga aagaactccg tggatattcc agctttccca 1201 caagcactga tcattatgag ccagttactt aaccgatctg agacactctc acctcctaaa 1261 tagggataga tgatactaat ttgcaggttg tcattatgat aagacaggat ctgatcaata 1321 tatgtgaatt gtttatattt ggaacctttt tattgagtgg aagaagttgt tttaaatatt 1381 ctagtcagtt ctttcctgct cccaggaaag cccggattat gttttaagat aagcaaaatg 1441 tcttaaaagt aagctgtttt actttgaatt tttccctaaa tgttgattag tgtactagat 1501 ccattttaat ttggaaagtg aagtgctact tatttgaact tcttaaaaat gctaatttta 1561 acatctaaag agttaactaa gaaaagctta gtaacatgat gtaccaagtt gaatatgctg 1621 ttatccttat ttagaataga aaattggtat ttctacgttt tatccattct aaggcaggtt 1681 aaaaaattgt atttccatga ctacctatat atttcttgaa tttattattg taaagttgat 1741 tcatagtcaa acaattaaat gtttaaatta agattaagac actagagaat gatttatttg 1801 ctgtccttta attgcagcaa atccttgctg ttcccaccca tgtcaaaacc gaggtgtatg 1861 tatgagtgtg ggatttgacc agtataagtg cgattgtacc cggacaggat tctatggaga 1921 aaactgctca acacgtaagt ttgtcctttg gttgcctcat taggagtggg gctggataca 1981 gttatcattg tatagatttg tgtcttataa tgagtcccat taatttctcc ctccctttct 2041 tcgtcttctt gcagcggaat ttttgacaag aataaaatta tttctgaaac ccactccaaa 2101 cacagtgcac tacatactta cccacttcaa gggattttgg aacgttgtga ataacattcc 2161 cttccttcga aatgcaatta tgagttatgt cttgacatgt aagtacaagt gtctttctaa 2221 ggtttttagc cttctcaaag aaaaatatgc tttataatac tgtaagccta atctaaaaac 2281 atatttccaa gcttatcaaa aagactttaa gatagctttt aagtttgcct tccatcttaa 2341 tcgccaaaaa tattgacatt tagtcccatc cagtttatac agtctgctca caactctgta 2401 tacctcttct aacctttact gtttggtcag tttgtggagg tagcatggtc cagctgttta 2461 ttgaatgccc atgggccaca gaattgttct gaacatgtag cacccattaa aataaatttg 2521 gatttggatc agcaagaaaa taactttcca tgattctaaa gtgggtgcca tactcagcca 2581 ttcctttcat aggcctcttg gatagtgagc agatggctac ctgaaaaatc aatattgcca 2641 gattataatg tgcagagtat atgtatttta ttaaagatgt atttcaagtg gccattagac 2701 tataaagtgt agttgtttaa aaatagattt tttttatttt ggagttacat tcaacctcag 2761 gtgccacttt ccacatttta caataaaaat aatggttgat ttacttaaca aatgagaata 2821 aataaaacat ttttttcttt gaaaatttca gccagatcac atttgattga cagtccacca 2881 acttacaatg ctgactatgg ctacaaaagc tgggaagcct tctctaacct ctcctattat 2941 actagagccc ttcctcctgt gcctgatgat tgcccgactc ccttgggtgt caaaggtgag 3001 taagaagaat ccattagaga tgtattaact ataagacggg ctgcattgct gccaaaaaaa 3061 aaaattgacc ttagactacc atttatttat taacaaaagc agtttttact tttagcatgg 3121 ttatctatgg gtatttttta aagtatgagt ctatataaac tattatgtaa aagcaaatga 3181 gcgtcttggt ataatgtctt aatattttca aattatttct ttagaaatga aataattcta 3241 attaaaatag ataaaatcat tcagtaagaa gttgttccac catatcttag aactgttgtt 3301 tatattatga tcctattcac aattgtaatt ctcatataaa tgaagaattc ttggtagatt 3361 gacagtcacc atctcctttc ttgaatacat agatggattc ttaccttagc tttctcattt 3421 ttcaggtaaa aagcagcttc ctgattcaaa tgagattgtg gaaaaattgc ttctaagaag 3481 aaagttcatc cctgatcccc agggctcaaa catgatgttt gcattctttg cccagcactt 3541 cacgcatcag tttttcaaga cagatcataa gcgagggcca gctttcacca acgggctggg 3601 ccatggggta agatagagtt aatatcttag agttagtaaa attataccaa atcatagtca 3661 agggctaaca ttaaaggaga tatacagata gatagatcca aataacttat ccactttttt 3721 taaaaagaag tcttatctat aaaaacctta aaggaatttt ccatttactt cactggtcta 3781 gtaaaattat acacacacac agacatgcac acacatatat aaacattcac acacatacat 3841 atgtacaggt attgttattt gtaatttgac ccttgtattt tttagtttaa aatgttagta 3901 ctgcaaaatg ttatgtcctc aaaaacacat tgtaccatga ttatgccgct ttcaatattg 3961 taaagtgagg tttttgccgc attattattt tttggatttc aatagcatag cttcaagtta 4021 ttcgtaagaa ttttttataa ataatacatt tttatacttt tttataatta ccatatcatc 4081 atagtgaagt atataatata tatgatataa gctcaatata gtatattaat tccgttaaac 4141 acaaagacat atcagtttgt agctttggtg gataaacaaa ttaatttagc aattcatggc 4201 tatgaaaaat gtatatttta tttaaaaatt ttaaagaaag ctaaatgatc aaattattta 4261 atgatgaatt atatgataga cactttatat aagaaaaact tcaacagcaa caaattaaaa 4321 ttttttcatc attttctagg tggacttaaa tcatatttac ggtgaaactc tggctagaca 4381 gcgtaaactg cgccttttca aggatggaaa aatgaaatat caggtatgct tcctttgact 4441 attaagactt agttattacc gcttataccc atattttaaa atccctaaaa atgtgttcct 4501 taacttttta actgatgttt atttatttat ttattttttt agataattga tggagagatg 4561 tatcctccca cagtcaaaga tactcaggca gagatgatct accctcctca agtccctgag 4621 catctacggt ttgctgtggg gcaggaggtc tttggtctgg tgcctggtct gatgatgtat 4681 gccacaatct ggctgcggga acacaacaga gtatgcgatg tgcttaaaca ggagcatcct 4741 gaatggggtg atgagcagtt gttccagaca agcaggctaa tactgatagg taaacaagaa 4801 aatgatttat ataaaaccct cttccccagg gaaaattagt gtgctatctt tgttatgttt 4861 tgagtaaatg acaagatgtg gtaaatgaaa actcacacat tctatataca ttaaatatgt 4921 aagcatgact gataaaatag ctatcttttg atactgacaa ggaagaaaac agaaatgaag 4981 gaatagcaaa ttttaaaaat tgcattccag ttgcttgaaa gcttgtgatc agatgcaata 5041 aatgttttta ttatttattt tgtgcaaata ggagagacta ttaagattgt gattgaagat 5101 tatgtgcaac acttgagtgg ctatcacttc aaactgaaat ttgacccaga actacttttc 5161 aacaaacaat tccagtacca aaatcgtatt gctgctgaat ttaacaccct ctatcactgg 5221 catccccttc tgcctgacac ctttcaaatt catgaccaga aatacaacta tcaacagttt 5281 atctacaaca actctatatt gctggaacat ggaattaccc agtttgttga atcattcacc 5341 aggcaaattg ctggcagggt aagcattatt attgaaaacc aaaacaaaag actagtcagt 5401 aactttagaa tttctgccac ggaaattatt tttcttaaac ttactaaaag agtagttagt 5461 tatattgcta gtaaaattat tttattgata taagaagcct aactttgttt gaaaagtcta 5521 aacttttagt ctagtctaca gttgtcagac aaatagcaaa ttgtacccct accttaaaaa 5581 tattttcaaa aagtatctat aatcttatag gaataaatat tttaggcttg aatactagtg 5641 ttatttttga aatgtaaaaa ggcaaattag ttctaggctg gtgtcccatt gaattttaag 5701 cagagctcct gttgaaatgt aggtaagcat ctttccagca aataaaaatt gtctccgctg 5761 ggagtttcag ttttacctga tttgtaccta aggcaagctg aatacaaaca gtaaatatgc 5821 ctaaaattct tgttttacaa ctaattttac tttccacagg ttgctggtgg taggaatgtt 5881 ccacccgcag tacagaaagt atcacaggct tccattgacc agagcaggca gatgaaatac 5941 cagtctttta atgagtaccg caaacgcttt atgctgaagc cctatgaatc atttgaagaa 6001 cttacaggta agaaacagtt tctaaacttc ttcgtttttt gtttgtttgt ttgtttttgt 6061 tgtttttggt tttcttttcg agatggagcc gccctctgtc acccaggctg gagtgcagtg 6121 gcgccatctc ggctcactgc aacctccgcc tcctgggttc aagcaattct cctgcctcaa 6181 cttcctgagt agctgggact acaggctcac gtcgcacgca tggataattt tttgtatttt 6241 cagtatagac ggggtttcac cgtgttagcc aggctggtct caaactcctg acctagtgat 6301 ccgccggctt cggcctcccg aagtgctggg attacaggcg tgagccaccg cgcctggccc 6361 ctaaacttct taaaagaatc aggggtcaaa tggaaacaga gaagttggca gcaaattgag 6421 caaaagaatc aaactgtttt ttattttgtg aagtttgaca ttggttgtat ctctgtcttc 6481 atcgccttca caggagaaaa ggaaatgtct gcagagttgg aagcactcta tggtgacatc 6541 gatgctgtgg agctgtatcc tgcccttctg gtagaaaagc ctcggccaga tgccatcttt 6601 ggtgaaacca tggtagaagt tggagcacca ttctccttga aaggacttat gggtaatgtt 6661 atatgttctc ctgcctactg gaagccaagc acttttggtg gagaagtggg ttttcaaatc 6721 atcaacactg cctcaattca gtctctcatc tgcaataacg tgaagggctg tccctttact 6781 tcattcagtg ttccagatcc agagctcatt aaaacagtca ccatcaatgc aagttcttcc 6841 cgctccggac tagatgatat caatcccaca gtactactaa aagaacgttc gactgaactg 6901 tagaagtcta atgatcatat ttatttattt atatgaacca tgtctattaa tttaattatt 6961 taataatatt tatattaaac tccttatgtt acttaacatc ttctgtaaca gaagtcagta 7021 ctcctgttgc ggagaaagga gtcatacttg tgaagacttt tatgtcacta ctctaaagat 7081 tttgctgttg ctgttaagtt tggaaaacag tttttattct gttttataaa ccagagagaa 7141 atgagttttg acgtcttttt acttgaattt caacttatat tataagaacg aaagtaaaga 7201 tgtttgaata cttaaacact atcacaagat ggcaaaatgc tgaaagtttt tacactgtcg 7261 atgtttccaa tgcatcttcc atgatgcatt agaagtaact aatgtttgaa attttaaagt 7321 acttttggtt atttttctgt catcaaacaa aaacaggtat cagtgcatta ttaaatgaat 7381 atttaaatta gacattacca gtaatttcat gtctactttt taaaatcagc aatgaaacaa 7441 taatttgaaa tttctaaatt catagggtag aatcacctgt aaaagcttgt ttgatttctt 7501 aaagttatta aacttgtaca tataccaaaa agaagctgtc ttggatttaa atctgtaaaa 7561 tcagatgaaa ttttactaca attgcttgtt aaaatatttt ataagtgatg ttcctttttc 7621 accaagagta taaacctttt tagtgtgact gttaaaactt ccttttaaat caaaatgcca 7681 aatttattaa ggtggtggag ccactgcagt gttatctcaa aataagaata ttttgttgag 7741 atattccaga atttgtttat atggctggta acatgtaaaa tctatatcag caaaagggtc 7801 tacctttaaa ataagcaata acaaagaaga aaaccaaatt attgttcaaa tttaggttta 7861 aacttttgaa gcaaactttt ttttatcctt gtgcactgca ggcctggtac tcagattttg 7921 ctatgaggtt aatgaagtac caagctgtgc ttgaataacg atatgttttc tcagattttc 7981 tgttgtacag tttaatttag cagtccatat cacattgcaa aagtagcaat gacctcataa 8041 aatacctctt caaaatgctt aaattcattt cacacattaa ttttatctca gtcttgaagc 8101 caattcagta ggtgcattgg aatcaagcct ggctacctgc atgctgttcc ttttcttttc 8161 ttcttttagc cattttgcta agagacacag tcttctcatc acttcgtttc tcctattttg 8221 ttttactagt tttaagatca gagttcactt tctttggact ctgcctatat tttcttacct 8281 gaacttttgc aagttttcag gtaaacctca gctcaggact gctatttagc tcctcttaag 8341 aagattaaaa gagaaaaaaa aaggcccttt taaaaatagt atacacttat tttaagtgaa 8401 aagcagagaa ttttatttat agctaatttt agctatctgt aaccaagatg gatgcaaaga 8461 ggctagtgcc tcagagagaa ctgtacgggg tttgtgactg gaaaaagtta cgttcccatt 8521 ctaattaatg ccctttctta tttaaaaaca aaaccaaatg atatctaagt agttctcagc 8581 aataataata atgacgataa tacttctttt ccacatctca ttgtcactga catttaatgg 8641 tactgtatat tacttaattt attgaagatt attatttatg tcttattagg acactatggt 8701 tataaactgt gtttaagcct acaatcattg attttttttt gttatgtcac aatcagtata 8761 ttttctttgg ggttacctct ctgaatatta tgtaaacaat ccaaagaaat gattgtatta 8821 agatttgtga ataaattttt agaaatctga ttggcatatt gagatattta aggttgaatg 8881 tttgtcctta ggataggcct atgtgctagc ccacaaagaa tattgtctca ttagcctgaa 8941 tgtgccataa gactgacctt ttaaaatgtt ttgagggatc tgtggatgct tcgttaattt 9001 gttcagccac aatttattga gaaaatattc tgtgtcaagc actgtgggtt ttaatatttt 9061 taaatcaaac gctgattaca gataatagta tttatataaa taattgaaaa aaattttctt 9121 ttgggaagag ggagaaaatg aaataaatat cattaaagat aactcaggag aatcttcttt 9181 acaattttac gtttagaatg tttaaggtta agaaagaaat agtcaatatg cttgtataaa 9241 acactgttca ctgttttttt taaaaaaaaa acttgatttg ttattaacat tgatctgctg 9301 acaaaacctg ggaatttggg ttgtgtatgc gaatgtttca gtgcctcaga caaatgtgta 9361 tttaacttat gtaaaagata agtctggaaa taaatgtctg tttatttttg tactatttaa 9421 aaaaaaaaaa aaaaatcgat gtcgactcga gtc // LOCUS HSU05259 5670 bp DNA PRI 04-AUG-1994 DEFINITION Human MB-1 gene, complete cds. ACCESSION U05259 NID g452561 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5641) AUTHORS Ha,H., Barnoski,B.L., Sun,L., Emanuel,B.S. and Burrows,P.D. TITLE Structure, chromosomal localization, and methylation pattern of the human mb-1 gene JOURNAL J. Immunol. 152 (12), 5749-5757 (1994) MEDLINE 94267180 REFERENCE 2 (bases 670 to 781; 2385 to 2684; 2926 to 3044; 4059 to 4128; 4256 to 4758) AUTHORS Ha,H.J., Kubagawa,H. and Burrows,P.D. TITLE Molecular cloning and expression pattern of a human gene homologous to the murine mb-1 gene JOURNAL J. Immunol. 148 (5), 1526-1531 (1992) MEDLINE 92166394 REFERENCE 3 (bases 1 to 5670) AUTHORS Burrows,P.D. TITLE Direct Submission JOURNAL Submitted (20-JAN-1994) Peter D. Burrows, University of Alabama at Birmingham, Microbiology, UAB Station, Birmingham, AL 35294, USA FEATURES Location/Qualifiers source 1..5670 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HMB1" /clone_lib="946205-Stratagene, La Jolla, CA" /sex="male" /tissue_type="placenta" /dev_stage="fetus" /map="19q13.2-13.3" /chromosome="19" promoter 1..666 protein_bind 287 /bound_moiety="early B cell factor (EBF)" protein_bind 476 /bound_moiety="early B cell factor (EBF)" protein_bind 612 /bound_moiety="SP1" protein_bind 621 /bound_moiety="Ets-1" protein_bind 628 /bound_moiety="SP1" mRNA join(<667..781,2385..2684,2926..3044,4059..4127, 4255..4758) exon <667..781 /number=1 gene join(703..781,2385..2684,2926..3044,4059..4127,4255..4368) /gene="MB-1" CDS join(703..781,2385..2684,2926..3044,4059..4127,4255..4368) /gene="MB-1" /codon_start=1 /db_xref="PID:g521105" /translation="MPGGPGVLQALPATIFLLFLLSAVYLGPGCQALWMHKVPASLMV SLGEDAHFQCPHNSSNNANVTWWRVLHGNYTWPPEFLGPGEDPNGTLIIQNVNKSHGG IYVCRVQEGNESYQQSCGTYLRVRQPPPRPFLDMGEGTKNRIITAEGIILLFCAVVPG TLLLFRKRWQNEKLGLDAGDEYEDENLYEGLNLDDCSMYEDISRGLQGTYQDVGSLNI GDVQLEKP" intron 782..2384 /number=1 exon 2385..2684 /gene="MB-1" /number=2 intron 2685..2925 /number=2 exon 2926..3044 /gene="MB-1" /number=3 intron 3045..4958 /number=3 exon 4059..4127 /gene="MB-1" /number=4 intron 4128..4254 /number=4 exon 4255..4758 /number=5 BASE COUNT 1267 a 1630 c 1616 g 1157 t ORIGIN 1 cagtaaagag ctgatcatgg ttctcactcc ttgaatacca ggaacaccat ctcgtatcac 61 ataatgagac agggagacat tctggtcctc atctcacaga tgaaaaatgt caagcttcga 121 aggatcaaag tgcccaccta gtcacacggg tagtcagcca caggtcagcc tgccttattt 181 attcttcatg agtatttata gtgactaaca tttactgggc gcctactgtg ggccatttct 241 gtgcatgtga caaccccttt aagtccttgt ttctaatccc aagaagcaag gaaatggggt 301 cagggaaggg acaaggtttg cccaagtcca ggcaggggga gaggtcaagc tcagaaccat 361 cacctgccca tgacacatgc ccaggactca ggttccctag gcttccttcc aaaggctcag 421 cagtgacgag ccagcccttg aaccagcctc ttcccccacc caagcagcca cctctcaggg 481 gaattgtggc caccacaggt gcagggagca gtttctctcc actcacagcc tgaagcatac 541 ccggcagggg ctgtccccag gcccaacaag caaagggccc agtagcgagg gccactggag 601 cccatctccg gggggctggg caggaagtag ggtggggttt ggggtaggga tctggtaccc 661 tgggactgct gcaactcaaa ctaaccaacc cactgggaga agatgcctgg gggtccagga 721 gtcctccaag ctctgcctgc caccatcttc ctcctcttcc tgctgtctgc tgtctacctg 781 ggtatgtggc caaagggcag gaactggcgg gaggtggggg aagctgtgga ggctgcagag 841 agggcacagg cagagggaag ggggctcagg gaaaggggaa gaggaggcag aggatagggg 901 acccagggaa gatgcctata gaaatcgtat ctgtgccaag atgggccaag gtggggctgg 961 agggagccca gcagaggaga aggggcgtcc acagtctcac acagggaggc aggagcaaga 1021 gtcacctccc ccacctcctg ttccccacag gccaaaataa ggaactaaag ttgctcttga 1081 ctgagcacca gggctggggg caggaagggg acttaggggt agcagcattc agcgtctgtc 1141 aaggggagaa aaagctttct ctgccttaaa cctcaggtgc ctctctctgt tgggagtccc 1201 ttctcagcac tgggggaatg ggtgtctcat ggactccccc tcacctgctc aaggacagct 1261 ggcaggggct gtggcacgct aacccaggag ttcagagaaa aaggttcccc acccaaggga 1321 cactgggagc aaggattgga gttcacgtct gagtcttaag cccgtgacga tgagggtgct 1381 cggcccctct ccccatctct tcctccttct ctcttcctca cctccctcct ccacctacct 1441 ccagaagagg ggactggcca tgtgggaggc ctggctgaga gctggggctt cccagaggag 1501 cccggattgg acactgcagc cagcctgagc cgcctcgtct cactcagaga cacccccagt 1561 ctccaccccg ctctgagccc cttcaatcac cagcagccca gcccaaggac tgaactcacc 1621 cctgacccct aggttgacac atacaactta cagagaatga gggccaggca cagggtcaca 1681 ggccagggca ggccacagac tggctcctca gcccaggcag ggagaggcca gggagccaag 1741 agtttgaacc cagtgccact cctgactgcc tggtgatgct ggcaacccgc ctgccctccc 1801 agagcctcag ccatccctcc tgtaaaatgg ggctaaggag agaacctact tctagggttc 1861 tgtgaatgat tacacaagaa aaagcgccag gtgctgggcc tggctgaggc tggggtgcaa 1921 aaatggaccg ggaaggctgc gggaggaggg gacgcctgca ctgcttctgg aaggagctgt 1981 ctggacagcg tcctccagtg cctggaacaa acatccaaaa tccagagagt tcacagggcc 2041 agagtacaaa gtgggtatgc gggaggggga caagagatgg cgctgcagag gtgagaaggg 2101 cctcccaggg gtcttaccat cccagggagt ctcattctcc tctcccagga tatcctcacc 2161 caccccaacc aggtatgtcc tctctccttc ccaggggctt cttcactttc ccgcatcccc 2221 cctctcccca ggatgtatca gcccctgtca ggggctctct ctctccctcc ccacccagga 2281 gagtcctcac cctcttccca ggagtgctgg aactgcaggg gccagggctg gggaaatgtg 2341 tcaccatccc cagtccctga cccacccacc ctgtctctcc acaggccctg ggtgccaggc 2401 cctgtggatg cacaaggtcc cagcatcatt gatggtgagc ctgggggaag acgcccactt 2461 ccaatgcccg cacaatagca gcaacaacgc caacgtcacc tggtggcgcg tcctccatgg 2521 caactacacg tggccccctg agttcttggg cccgggcgag gaccccaatg gtacgctgat 2581 catccagaat gtgaacaaga gccatggggg catatacgtg tgccgggtcc aggagggcaa 2641 cgagtcatac cagcagtcct gcggcaccta cctccgcgtg cgccgtgagt ggcccagccc 2701 tggcccctac tcccactgtc ccgctgggca ctcggtttat ctttgaagtg gggatagagc 2761 cagtaccttc aatgtgggtt tcaaaccggc ttggacagag ggacggacat tctcctctgc 2821 agagtggggt ctctgggggg tctggggcct tgcaggaggt gggcggggca ggaggctagg 2881 gagggcaaga ggggccaggg ctctgagcat actacctcct tgcagagccg ccccccaggc 2941 ccttcctgga catgggggag ggcaccaaga accgaatcat cacagccgag gggatcatcc 3001 tcctgttctg cgcggtggtg cctgggacgc tgctgctgtt cagggtgagc cccctcggac 3061 ctctgagtca gccgggcgag ggctgggcga gggaccccca atacccaggt agccctctag 3121 agcctgaggt tccccatcca aaacttggga gaatgaaagc acccaccata taggggctgt 3181 gggagttaaa tgaatgaata taaagaaggg acttgaactg gcgctgagcc agggggtgtc 3241 ttcaatttag tttccccttc tggctgtcct cacgtccacc tccccccaag aagagtctca 3301 ttcttctccc taagagtgtc ctcactcccc tcctgccctc acccaggagt gtggttaccc 3361 tccaggtgta gccaagacca gggaaggtgg ggcctggtcc tcccagaatc tctgatctgt 3421 accagcctct ccttaggcac tacagaaagt gtgactgtta ttgttattat tcatggagaa 3481 tagtaaggga gtggagactc aagaagacgc tagcttggga ggccgaggca agaggatcac 3541 ttgaggccac aagtttgaga ccagcctggg caacacagca agaccctatc tctacacaca 3601 caaagtctta aaaaaaaatt agccaggcat ggtggcacac acctatagtc ccagctactc 3661 aggaggctga ggtgggagga ttgcttgagc ccaggagttc aaggctgcag tgaaatatga 3721 tgttgccaca gcacttccag cctgggcaac tgagggagaa gaaagagaga gagagagaaa 3781 gagagagaga gagagagaga gagagagagg gccagaggga gggagggaag gaagggaagg 3841 gaagggaagg aaggaaggaa ggaaggaagg aaggaaggaa ggaaggaagg aaggagaaca 3901 ctggttgtag actcagagag aactgttaca taaccagtat gtggccttgg ggacatctct 3961 taccctttct ggaaaagtac ttcctggcat ccaggagggt ctgaaagata ttcacctccc 4021 cctgctcact gaggcaccca ccccacccac ccctacagaa acgatggcag aacgagaagc 4081 tcgggttgga tgccggggat gaatatgaag atgaaaacct ttatgaagtg agtgaagggt 4141 ggggatgggg taggggcagt tgtgttaggg gtgggggtgt tcctctgggg gtggctgggg 4201 gcagggaccc caggtgtcag ggtgctgatg ttcgctgcct catttccatc ccagggcctg 4261 aacctggacg actgctccat gtatgaggac atctcccggg gcctccaggg cacctaccag 4321 gatgtgggca gcctcaacat aggagatgtc cagctggaga agccgtgaca cccctactcc 4381 tgccaggctg cccccgcctg ctgtgcaccc agctccagtg tctcagctca cttccctggg 4441 acattctcct ttcagccctt ctgggggctt ccttagtcat attcccccag tggggggtgg 4501 gagggtaacc tcactcttct ccaggccagg cctccttgga ctcccctggg ggtgtcccac 4561 tcttcttccc tctaaactgc cccacctcct aacctaatcc ccacgccccg ctgcctttcc 4621 caggctcccc tcacccagcg ggtaatgagc ccttaatcgc tgcctctagg ggagctgatt 4681 gtagcagcct cgttagtgtc accccctcct ccctgatctg tcagggccac ttagtgataa 4741 taaattcttc ccaactgcag accttggcag gagtcgtgga tcttacggaa accgcctctc 4801 ccacatgtcc ctcagaccca ggagtcccag cccaagccac tccctgccca ctccccacac 4861 ctgggaccag gtagccagtc tgggctgccc tcctgggaga acaagatgtc tcttgggaag 4921 gtccccagac caactgaagg actggtttgg ccctctttgc agggctcacc ctagggtcat 4981 atccttaagc aagaaggggg acaagatcaa gatggctgtg gctaccaaat tacttacact 5041 tttttttttt tttttttttt ttgagacaga gtctcactct gttgcccagg ttggagtgct 5101 cactgcaacc tctgcctcct gggttcaagc gattctcctg cctcagcctc ccaagtagct 5161 gggattacag gcgcctgcca ccatgcccag ctaatttttg tatttttagt agaggcgggg 5221 tttcaccatg ttggccagac tggtcttgaa cctctgacct caggtgatcc acctaccttg 5281 gcctcccaaa gtgctgggat tacaggcgcc tgcaccacgc ccggccgctt tttttttttt 5341 tttttttttt tgaggcagag tctgactctg tcgcccaggc tggagtgcag tggcgcaatc 5401 tcagcccact tcaacctcca cctcaagcga ttcttgtacc tcaggctccc gagtagctgg 5461 gatgacaggc atgtgctacc acacccagct aatttttgtt ttagtagaga cagggtttcg 5521 ccatgttggc cgggctggtc tcgaactcct gacctcaagt gatccaccat gcttcggcct 5581 cccaaagtgc tgggattaca gcatgagcca ctttatgcgt atttaagcct tggaaacaca 5641 gggactatct tgtggattgg ggctagtaca // LOCUS HSU07807 4839 bp DNA PRI 28-JUL-1994 DEFINITION Human metallothionein IV (MTIV) gene, complete cds. ACCESSION U07807 NID g466264 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4839) AUTHORS Quaife,C.J., Findley,S.D., Erickson,J.C., Froelick,G.J., Kelly,E.J., Zambrowicz,B.P. and Palmiter,R.D. TITLE Induction of a new metallothionein isoform (MT-IV) occurs during differentiation of stratified squamous epithelia JOURNAL Biochemistry 33, 7250-7259 (1994) MEDLINE 94271779 REFERENCE 2 (bases 1 to 4839) AUTHORS Findley,S.D. TITLE Direct Submission JOURNAL Submitted (17-MAR-1994) Seth D. Findley, Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA FEATURES Location/Qualifiers source 1..4839 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 641..646 gene join(726..756,3320..3385,4404..4495) /gene="MTIV" CDS join(726..756,3320..3385,4404..4495) /gene="MTIV" /codon_start=1 /evidence=experimental /product="metallothionein IV" /db_xref="PID:g516535" /translation="MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAK CARGCICKGGSDKCSCCP" polyA_site 4655..4659 BASE COUNT 1361 a 1088 c 1323 g 1067 t ORIGIN 1 tgaaaagtag tcagctgccc ataccaaaga ttggggttat aatggagatg aagaggtggt 61 taaatctctc ccaagtcccc tcttgctgct tctgctttgt gtgtggtcta gaaggaaagg 121 ttgcatgttc tggatgggca caactccctg attttccacc ctctcactca tggggagggt 181 gcagtcagta gaggttcttg gtgggaactt ctaaatctca gggccaagga tagaggcccc 241 tgcctgccca tagtaagggc ttttacagtc aaccagggag ataggatgag gccccgagcc 301 gaggcaggag gagcccctgt gaaggaggga acagaaatgg cagataatag aatgcatagg 361 acttggagac agggtggacg tagctgccaa cctgagctcc atgtgtgcat ttcagcctga 421 gcctggttcc ctgaaacctt gccaggcggg caggtgaggt gggtgcagcc acctgcaccc 481 ctggtgaaaa gccacccaaa tgaccgtact gctcggcctc tcctctgccc tcctccgcct 541 gctcttccca actcccagcc caaggcatct ggctgggccg tgtgcaggac gtctggcccc 601 agggagaggt ggggtgagct gcagactaca cccacatggc tataaatggg gagcctctgg 661 ctgctgctca ctcagcctcc cttccccagc cgtgacagca ctggagcctt tcggacacct 721 ggaccatgga ccccagggaa tgtgtctgca tgtctggtga gtaaagaagc cctccctggg 781 gtctgggaac cttggaccag cttcctacag ggagcctgca ggtccctgat gaaaacttct 841 cttccctcta attaggagcc accaatgggg ttcgtcctgg gagccgacag gtggactggc 901 cacctgtggg gcagctccca ctctctgttg gctcccagcc tgattttcta gaatcgatat 961 cctagctcag ggaatggctg agagggggac ttgcaaactt tggtctttct agcgttggcc 1021 cctaagggca gtcaccaggg tgaggagaag agctctaaac cagaatctgg caacctggtt 1081 tgggtccatt gctgcggtcc cctccctggc gcttattctt tctggtcctc agtttcctca 1141 cctgtagact gaggttcctt attcccagcc aaccacatcc cagagctgct atgaaagtgc 1201 taaaaccatt aagcacctgc agatgtcagg ggagttgtgg actcccctaa cttatgtgga 1261 ctcttgagtc tctgaaactg cttaattcca cggagcatgt tagccagaag gagagaaatc 1321 agggttcaaa gccatcctgg gcttttacaa tcatggaaat ggtatgaaga agtctcacaa 1381 gtctgggtgg tgtggctcac aactgtagtc ccagcacttt gaggctgagg caggagaatc 1441 acttgaggcc aggagttcaa taccagcctg ggcaacatag cgagactcca tgtctacaaa 1501 ttttttttaa acttagctgg gcgtagtagg gtgcacctta gtcccagcta ctcaggacgc 1561 tgagaaggga ggatcacttg agcccaggag ttcgaggctg cagcaaacta ggaatgcacc 1621 actgcactcc agcctaggta atagagcaag accctttctc aaaaaaaaaa gaagaaaaga 1681 aaaagaaaga aagaaaatga agtctcacaa gactcccttg tccttaacag gatgtctctg 1741 taccaaaaac ttgcaaaaga ggcagaagct tccagcattt tgaaactcaa aatgaaaaat 1801 gggataattt ttttaagtgt agatgatcat tatgatataa aaacaaggtg ctggctccca 1861 tctgtaatcc tggcacttgg ggagtccaag gcaggatgat cacttatgcc caaggaattt 1921 gagaccagcc tgggcaatat aacaaggccc tgtttctaca aaaactttaa acaattagcc 1981 aggtgtggtg gtgcgtgcct gtggtcccag ctactcagga agctgaggca agagcttgag 2041 gctacagtga gctgtgttcc caccatggtg ctccagcctg ggtgacaggg caagaccctg 2101 tcaaaagaaa ggaagaaaga acggaaggaa agaaggaaag aaacaaggag agagaaaggg 2161 agaaagggag aaaggaagag agagagagag aaagagagaa agagaaaggg agaaaggggg 2221 agagagagag agaaagagaa aagagagaga aagacaggga gaaagagaaa aagaaaagaa 2281 agaaagaaag agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa ggaaggaagg 2341 aaggaaggaa ggaaggaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 2401 aaagaaagaa agaaagaaag aaataaatga gggagagagg aaggaaggaa gaaaggaaag 2461 gaagaaacaa acaaaacaag gtgtggctct gtggctcttg ttttgagttc actgacgttc 2521 cttgaccttt cttaaaccat tcacagtttt catcttgact tgtgaaggtg acacgggtgt 2581 gaaaggggtt tttgtttgtt tgtttgtttt aaggcagagt ctcactccat cacccaggct 2641 ggagtgcaga ggtgtgatct cggctcactg caacctccat ctcccaagtt caagagattc 2701 tcgtgtctca gcctccctag tagctgagat tacaggtgtg tgccaccacg cctggctaat 2761 tttttttttt ctcgtagaga tggggtttct ccatgttggc caagctgatc tcaaactcca 2821 ggcctcatgt gattcacctg tcttggcctc ccaaagtgct gggattacag acgtgagcca 2881 ctgtgcctag ccaagagggt gtgaaaggct ttgatagttc ttagaggaac cttgggcttg 2941 gaaacgctgg cctcgtctgt tggttctcat tgtacataag aattatttgg gttgcttttc 3001 aaaaatatgg actccatttc cccaccaaga cttatgaatc agaacatctc aagggttggt 3061 cccaggaatc tgcattttca cagttatccc aggcatgatt gatgcagcca tgccagttcc 3121 catccatggc ctgtgactgt ggactattgc agagtccttg acactgggtc gatgctcagc 3181 agccttgtca ccctcttgtc cagccagtcc ccaaggatat agcccaggag ccttgccact 3241 ctgacagtcg gcatcatgcc ttatgttccc cactccatct ccatcctcac gtttgtggtg 3301 gctctgtcct gtcttctagg aggaatctgc atgtgtggag acaactgcaa atgcacaacc 3361 tgcaactgta aaacatgtcg gaagagtgag tatggtgact gggggcacca tgggctggga 3421 gttagaaaag tccaatccag gccaggtgca gtggctcacg tctgtaatcc cagcactttg 3481 ggaggccgag gcgggtggat tacctgaggt caggagttca agaccagcct ggccaacatg 3541 gtgaaacccc atctctacta aaaatacaat aattagctgg gcatggtggt gggcacctgt 3601 aatcccagct actcaggagg ctgaggcagg agaattgctt gaacccagga ggcagaggtt 3661 gcagtgagca gagatcacac cattgcactc cagccttggt gacaagagtg aaactcagtc 3721 aaaaaagaaa aggaaggaag gaagaaagga aggaaggaag gaaggaagag agagagagag 3781 agagaaagaa agagagagag agagagagag agagagagag agagaaagaa agaaagaaag 3841 aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 3901 atcccatccc caccctcaat cctccagcca tggtggattg gggcttgtgg ggagggggtc 3961 tagatgcctt ttacccatct gggagagatc tgaaatgaga actatgtaga gacctatggc 4021 aggtctctac ccagggataa gggcatccac agtgtttgga gcaggaagag gactgagggg 4081 aagttctgtc ctgcccagtg gtaacaggga tagagcactg gcctggagtc atgagcccta 4141 ggttctagtc ttgcttttgc cactgtctca tgaccttggc aaaccccctc ccttctgtgg 4201 gcttctgatg gactctagat ggtccctggg atcccttcca gttctaacat gcctcaacct 4261 cctatctcag catttttgct aatgtggatt gaaacctcat gcacacaccc ctcttttcct 4321 atttcctcta agtagtggga ggggcagtgg tgagatggaa gtgttgaccc acagcggatc 4381 tgcgcatctc ctgactcttt caggctgctg tccctgctgc cccccgggct gtgccaaatg 4441 tgcccggggc tgcatctgca aaggaggctc agacaagtgc agctgctgcc catgaaagcc 4501 atccatcgtg cccacccctt ccaaggagag aaacctggga agtgtctgta cagtgcatga 4561 atcgagaagg tggaataatt gtacaatagg ttgtgctttt tatatatttg cccaaatgtg 4621 gtgttggtca cattcatgta aagtacttgg ggcaataaag ttttcactct tggttgcctg 4681 gtggctcaga gcattgtttt accctaaccc agcaggacca aaaggtgctg cttctgttgc 4741 agatcctgta accaccaccc acatgtacct gtgacaggtg gctgcctact gcaaggtcct 4801 atgactctgc ctgggggctt tctcctcctg cacaagctt // LOCUS HSU07983 1426 bp DNA PRI 30-AUG-1994 DEFINITION Human platelet membrane glycoprotein Ib beta (GPIb beta) gene, complete cds. ACCESSION U07983 NID g507450 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1426) AUTHORS Yagi,M., Edelhoff,S., Disteche,C.M. and Roth,G.J. TITLE Structural characterization and chromosomal location of the gene encoding human platelet glycoprotein Ib beta JOURNAL J. Biol. Chem. 269 (26), 17424-17427 (1994) MEDLINE 94292494 REFERENCE 2 (bases 1 to 1426) AUTHORS Yagi,M. TITLE Direct Submission JOURNAL Submitted (24-MAR-1994) Mayumi Yagi, Seattle Veterans Affairs Medical Center, Research, 1660 South Columbian Way, Seattle, WA 98108, USA FEATURES Location/Qualifiers source 1..1426 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene Number 961200" /sex="male" /tissue_type="brain" /dev_stage="fetus" exon 353..389 gene join(380..389,664..1274) /gene="GPIb beta" CDS join(380..389,664..1274) /gene="GPIb beta" /codon_start=1 /evidence=experimental /product="platelet membrane glycoprotein Ib beta" /db_xref="PID:g507451" /translation="MGSGPRGALSLLLLLLAPPSRPAAGCPAPCSCAGTLVDCGRRGL TWASLPTAFPVDTTELVLTGNNLTALPPGLLDALPALRTAHLGANPWRCDCRLVPLRA WLAGRPERAPYRDLRCVAPPALRGRLLPYLAEDELRAACAPGPLCWGALAAQLALLGL GLLHALLLVLLLCRLRRLRARARARAAARLSLTDPLVAERAGTDES" exon 664..1426 mat_peptide 729..1271 /gene="GPIb beta" /function="subunit of von Willebrand factor receptor" /product="platelet membrane glycoprotein Ib beta" BASE COUNT 157 a 514 c 517 g 238 t ORIGIN 1 ggatcctgag cctaggcctc ccgatgttcc cacccgcatg atcccttccc gccacacgat 61 gctccgtttt cttccgttgt gaatgccgcg tcctgtcctg gtgacaggag aacaatgttg 121 gtgaacgtcg cagcgggtgt ccgagtgctc cgtgtgcccc tgagagcggg tgggagcgga 181 agcctgagcg gcctgcggcc tccggcgata gtgtgctatc tgccgctgca gcgcgcgtcc 241 gcgcggcctc tgggctattt ctggccaggc cgcagcactg tggtcggtgc gggcgtggca 301 ggggcggggc ggccttatcg ctcggctctc ccgcctacgc ctcccgctgc agagtaagcc 361 gggctgccgt cttctcgcca tgggctccgg tgagtctgga gtccggtcgg gcccccggct 421 gctccctagg ccgacccggg ttgagaggag ctctggtcgt ttggctgcag ctgggagaga 481 cttgggtcag acttagaggg gacttccagc cggcgtgcgg ggtggtcagg gtggagaggc 541 tggcgggcta ccgggacgcc gggcatcagg ggctggatgg agccgggccg gcagtctggg 601 tactcagaga tgtcgcccag gtgcccgccg accgctcggc ttactgcggc gcttcccttg 661 cagggccgcg cggggcgctg agcttactgc tcctgctgct ggccccgccg agccgcccgg 721 ccgcaggttg cccggcgccc tgtagctgcg cggggacgct cgtggactgc gggcgccgcg 781 ggctgacttg ggcctcgctg ccgaccgcct tccctgtcga cacaaccgag ctggtgctga 841 ccggcaacaa cctgacggcg ctgccgccgg ggctgctgga cgcgctgccc gcgctgcgca 901 ccgcacacct gggcgccaac ccctggcgct gcgactgccg ccttgtgccg ctgcgcgcct 961 ggctggccgg ccgccccgag cgtgcgccct accgcgacct gcgttgcgtg gcgcccccag 1021 cgctgcgcgg ccgcctgctg ccctatctgg ccgaggacga gctgcgcgcc gcttgcgctc 1081 ccggcccgct ctgctggggg gcgctggcgg cgcagcttgc gctgctgggc cttgggctgc 1141 tgcacgcgtt gctgctggtg ctgctgctgt gccgcctgcg gaggctgcgg gcccgggccc 1201 gcgctcgcgc cgcagcccgg ctgtcgctga ccgacccgct ggtggccgag cgagccggaa 1261 ccgacgagtc ctgaggagag aaccggtgcg tcctgaggag agaaccggcg ctgggcaaca 1321 cgggcctgca aactcgacag gaccctgccc gaggggccct cgcgccaacc tggaccggtc 1381 cccgcctcct ccgctgccca atctctcaga cccaccccac ctgcag // LOCUS HSU08198 2344 bp DNA PRI 27-MAY-1994 DEFINITION Human complement C8 gamma subunit precursor (C8G) gene, complete cds. ACCESSION U08198 NID g494945 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2344) AUTHORS Kaufman,K.M. and Sodetz,J.M. TITLE Genomic structure of the human complement protein C8 gamma: homology to the lipocalin gene family JOURNAL Biochemistry 33, 5162-5166 (1994) MEDLINE 94227046 REFERENCE 2 (bases 1 to 2344) AUTHORS Sodetz,J.M. TITLE Direct Submission JOURNAL Submitted (30-MAR-1994) James M. Sodetz, Chemistry and Biochemistry, University of South Carolina, Columbia, SC 29208, USA FEATURES Location/Qualifiers source 1..2344 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q" /cell_type="lymphocyte" /tissue_type="blood" mRNA join(370..582,758..894,1053..1123,1208..1315,1587..1688, 1772..1810,1890..2097) /gene="C8G" gene 370..2097 /gene="C8G" CDS join(445..582,758..894,1053..1123,1208..1315,1587..1688, 1772..1810,1890..1903) /gene="C8G" /codon_start=1 /product="complement C8 gamma subunit precursor" /db_xref="PID:g494946" /translation="MLPPGTATLLTLLLAAGSLGQKPQRPRRPASPISTIQPKANFDA QQFAGTWLLVAVGSACRFLQEQGHRAEATTLHVAPQGTAMAVSTFRKLDGICWQVRQL YGDTGVLGRFLLQARGARGAVHVVVAETDYQSFAVLYLERAGQLSVKLYARSLPVSDS VLSGFEQRVQEAHLTEDQIFYFPKYGFCEAADQFHVLDEVRR" mat_peptide join(505..582,758..894,1053..1123,1208..1315,1587..1688, 1772..1810,1890..1900) /gene="C8G" /product="complement C8 gamma subunit" BASE COUNT 400 a 730 c 778 g 436 t ORIGIN 1 agcgggcggc ggtcgtgggc ggggttgcag gcgaggctca acgaacgctg gtctgaccgt 61 cggcgctccc tgttgccggg ccctgagcaa gtggcttcat gaaccccgtg acgttggcca 121 tggagataag accactgggt gatggtttaa ggaagataac gtgtaaaggg ctaaggactg 181 tcggtggaaa tcaggggtgc aggagaaatg gataaacagc cagaggtcaa ctcggacttt 241 gtacatagga catggtgcca ggccctgcca ggaagtgcag atcgaagcta ggctcacgag 301 gaggctggag gtggggggtg gggaggcaac ggatggacat ggacttcctg ggctgggctc 361 tgtgacagca gagtagactc tgtcctggga cttggtggtg ctacccttgg cctcccacag 421 tcctgccacc ctgctgccgc caccatgctg ccccctggga ctgcgaccct cttgactctg 481 ctcctggcag ctggctcgct gggccagaag cctcagaggc cacgccggcc cgcatccccc 541 atcagcacca tccagcccaa ggccaatttt gatgcgcagc aggtagaagt tggggggggt 601 agagggaggc aggtagaagt tgtgggaggg gtagagggag acaggtagaa gttgttgcgg 661 gggagaggga agcaggtgaa gttgtggggg gtgtagaggg aagcaggtga ggggccctcc 721 cacagtgccc tcgagttctc ccatggtctg cccccagttt gcagggacct ggctccttgt 781 ggctgtgggc tccgcttgcc gtttcctgca ggagcagggc caccgggccg aggccaccac 841 actgcatgtg gctccccagg gcacagccat ggctgtcagt accttccgaa agctgtgagt 901 cccagagcag ccctgcaccc taaccccaac cctcctctca gcccccggac ttcagccctg 961 ctctggcccc tgaccccacc ccggctgtgg cctggactag gattcctggt tggggtctcc 1021 cagcctgtgg tgcctcctcc ccgccccccc agggatggga tctgctggca ggtgcgccag 1081 ctctatggag acacaggggt cctcggccgc ttcctgcttc aaggtgaggc aggggctgca 1141 ggtcatgtgg gtgggggatg acgcagccac tgtggctctc tgacatggct actgtggctc 1201 tgcccagccc gaggcgcccg aggggctgtg cacgtggttg tcgctgagac cgactaccag 1261 agtttcgctg tcctgtacct ggagcgggcg gggcagctgt cagtgaagct ctacggtatg 1321 tgggggccag cctctgtgac caggcaggcg ctcaagctct gcacactcac tgggccaccc 1381 cgaggggctg ggtgagccat ggggacacac ttcctttctc ccatcctgat cctcctgcta 1441 agcaggggcc cagggagtag tgacagacag gcctggtgtg ggagcaggga ggagggcccc 1501 gaggggcagg ggacacacag accccgttcc cagagccctc cacgccgcct ggtgccagga 1561 ccccaggaac cctgtctgcc ctgcagcccg ctcgctccct gtgagcgact cggtcctgag 1621 tgggtttgag cagcgggtcc aggaggccca cctgactgag gaccagatct tctacttccc 1681 caagtacggt gagtgtcccc agcaggtccc cagctcagcc acccccactc tctggctgat 1741 gtccagcctg acccctgcct tggcgcccca ggcttctgcg aggctgcaga ccagttccac 1801 gtcctggacg gtgagtgcac agcgggggca agcatggcgg cgtggtgagg ggggccactc 1861 gcaccggctg agtctcgtct ctgctgcaga agtgaggagg tgaggccggc acacagctcc 1921 agtgctgaga agtcagtgcc ccgagagacg accccaccag tggggtgccc gctgcctgtc 1981 ctccgtgaaa ccagcctcag atcagggccc tgccacccag ggcaggggat cttctgccgg 2041 ctgccccaga ggacagtggg tggagtggta cctacttatt aaatgtctca gacccctctc 2101 tgactcttct gtccactctg gaccggcgcc agtaccacca aggccctctc tgcccccacc 2161 ccgcctcttt aaaagcccgg cgctccctgt tggctggagt ccacgcaggg tcactgggcc 2221 gatttcggct cttgggattt gggaggggag atcctctctg gcatatgcca tcttgtgccc 2281 tgctggacct gggggcgtcc acgtcactcc aaggctgctc ttgcctgggc catgcctgca 2341 gccc // LOCUS HSU09954 5542 bp DNA PRI 30-MAY-1996 DEFINITION Human ribosomal protein L9 gene, 5' region and complete cds. ACCESSION U09954 NID g607792 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5542) AUTHORS Mazuruk,K., Schoen,T.J., Chader,G.J., Iwata,T. and Rodriguez,I.R. TITLE Structural organization and chromosomal localization of the human ribosomal protein L9 gene JOURNAL Biochim. Biophys. Acta, Gene Struct. Expr. 1305 (3), 151-162 (1996) MEDLINE 96180319 REFERENCE 2 (bases 1 to 5542) AUTHORS Rodriguez,I.R. TITLE Direct Submission JOURNAL Submitted (23-MAY-1994) Ignacio R. Rodriguez, National Eye Institute, National Institues of Health, LRCMB, 9000 Rockville Pike, Bldg. 6 Rm. 304, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..5542 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="P-1 clone p1180, and subclones pL9H-1, pL9H-2, pL9H-3" /clone_lib="Human genomic P-1 library (Genome Systems, St. Louis, MO)" /chromosome="4" /map="4p13" promoter 1..573 /note="5' flanking region, putative promoter" misc_signal complement(43..47) /note="CACCC box" protein_bind complement(167..173) /bound_moiety="Sp1" misc_signal 292..297 /note="CACCC box" protein_bind 317..322 /bound_moiety="Sp1" misc_signal 326..330 /note="CACCC box" misc_signal complement(340..344) /note="CACCC box" exon 574..600 /number=1 intron 601..1051 /note="intron A" exon 1052..1098 /number=2 CDS join(1053..1098,1183..1298,1811..1906,2954..3086, 4548..4628,4843..4949) /codon_start=1 /product="ribosomal protein L9" /db_xref="PID:g607793" /translation="MKTILSNQTVEIPENVDITLKGRTVIVKGPRGTLRRDFNHINVE LSLLGKKKKRLRVDKWWGNRKELATVRTICSHVQNMIKGVTLGFRYKMRSVYAHFPIN VVIQENGSLVEIRNFLGEKYIRRVRMRPGVACSVSQAQKDELILEGNDIELVSNSAAL IQQATTVKNKDIRKFLDGIYVSEKGTVQQADE" intron 1099..1298 /note="intron B" exon 1183..1298 /number=3 intron 1299..1810 /note="intron C" exon 1811..1906 /number=4 intron 1907..2953 /note="intron D" repeat_region 2104..2249 /rpt_family="Alu repeat" exon 2954..3086 /number=5 intron 3087..4547 /note="intron E" repeat_region 3962..4048 /rpt_family="Alu repeat" repeat_region 4206..4433 /rpt_family="Alu repeat" exon 4548..4628 /number=6 intron 4629..4842 /note="intron F" exon 4843..4959 /number=7 intron 4960..5266 /note="intron G" exon 5267..5359 /number=8 polyA_signal 5333..5338 misc_feature 5360..5542 /note="3' flanking region" BASE COUNT 1471 a 1110 c 1230 g 1731 t ORIGIN 1 tgtctcacac acagtaagag tccaagacag aatagctatg tggggtgctc actgcagtgc 61 tccataaata tagggtacaa accactctaa gcagaaaata agtcctcggt gctggcccag 121 agagacttgc aaacaaactg atgccagaga ctctatatca tggggctggc gggataatta 181 catgagatta cacgaaaaac gtttaaaact gccgttggct agtagtaaat gttcagtaaa 241 tgcactggcc tttattattg ggcatgggcg ctctgaaggg ataggatccc cccaccccac 301 gcccaaaccc gtttgccccg ccgctcaccc ggggccccag ggtgcgggct gcatccccgc 361 agcgtagaga catttcctct ttaggactag cgcagtgcag gggttgaggg atcgctaact 421 cccgggaaag gacaggtcga aattacgtcg ctccatttat gacaagcagc caagctgctc 481 cacgattggc tgctttagcc ttacgtgcgt cgcgcactca tgacgcaata cgcaggcccc 541 ttacgcgata caagtacgta atgacgacag acgttctttc tttgctgcgt ctactgcgag 601 gtaaggatgt ttctgtgctc gtgggggtac tgggaatctg aaaccatctg ctctgggaaa 661 ggctgcggcg cggctcccac tctcggaacc ttgtcctttt tgtcccccag ctcggcaagc 721 gccatatgag cctggcggcg ccagatgcga atcctgttgt gggctttttg gcctattccc 781 gcccctcagt cttgccggga tggcaccgcc cgcataggac ttccagggtt tgggctgagt 841 gggagttcga ctgctggccc tcgtaattct cgctttgggg ctgctccttc caggctggga 901 cacactgggg cccgctgtcg gtctcccgtc ctccgacatc ttgtctggaa cttccgcctg 961 gcagtctcca gtaggagtgg agctctgtgc ggcgtagttt ggtggaaaaa cgggccttgc 1021 gtcggcctca cccccagtgt ttgtgtttca gaatgaagac tattctcagc aatcagactg 1081 tcgagattcc agaaaatggt atgagacttg atgtctttta cttacatctt tactgcacgt 1141 tccaagcgtt gtgtggcctg acgagtgtgt tctctcttct agtcgacatt actctgaagg 1201 gacgcacagt tatcgtgaag ggccccagag gaaccctgcg gagggacttc aatcacatca 1261 atgtagaact cagccttctt ggaaagaaaa aaaagagggt gagggttttt cttctgataa 1321 ttcagttgct cgttggaagg tggtttataa cagcatcaca tttggttgct ttaacaattt 1381 tcgaattgag gttcagtgct tcagctttaa atgttgctgt ccagaatacc agcctgttca 1441 gtgaaattga aaaaccagat tctgaacaca gccaattata taaagctcaa atgtaatagt 1501 ggtcatcagt gacacgtctt aatggtgtta gaagtttggt ttattttcct agtgattgct 1561 ttgtaaaaat cacaggaacc ggcagtaggt catttaaacc tgatgattta attttccaga 1621 agggaagttt gttgcaactt ctacaaagct taactgttgg gaggttttat aggtggttgg 1681 atttttccag ttgaaaacca tgtgctttga taagggttaa ttctagaatc ttaaagtaat 1741 tcggtaagtg gtctgttgtc ttttaagcct gcatatttct ggaaagaata cattttttgt 1801 tctgttacag ctccgggttg acaaatggtg gggtaacaga aaggaactgg ctaccgttcg 1861 gactatttgt agtcatgtac agaacatgat caagggtgtt acactggtaa gcagatgtat 1921 cagacttcct tgttttggaa agggaggttt ctcaaacctg tgctttatac agcgtaagtg 1981 tctctttcag agaatgggtc atgtatttta tttggtgtaa tacgcaagaa cttggcaaaa 2041 ataggcatct gtgggtcttt aagcgataga gctgagagca aagaccagat gggtacaaca 2101 tgtttgtttt tttctgttgt tgggtttttt ttttttttaa tcaaggctgg ggtctcgcta 2161 ttttgtccag gctgagctca agtgatcctc ccaccttggc ctcccatagt actgggattg 2221 caggcatgag ccaccatgca tggctagatg ggtacaactt ttaaaaagta ggctaatgat 2281 gaattttcat accttccgaa ttctttaaga taggaaatct acagttgcag tactagttgg 2341 tagtcaaatc tcaccatcag gaccccttgt tgggatagaa aaagatgtag tttctgagtt 2401 ttctgtctgc tgccataatg tatgtcatta ttctttccaa cccaccgtct ctcccctcaa 2461 gataaaaagt tcatgctgaa atggttttct ttgcattata ccttctagaa tgcacctgat 2521 aatggcaccc tgtacatcta ttataaatct gttaagattc ccccatctac ccttgctctt 2581 agccttaagt tccttgaagg cagggggcct gcctttggct tggtggtctc aattcctgcc 2641 atgtgctgta ccttagtaac tcagtaaatg ttagtctaat gtctaagatc agggctttgg 2701 ggttctcatt aaaagaaggc agcattgttc attgagggca taagagctag tgaaataaag 2761 cctccgcaag gagctttaca gtccactgaa tagacagaac aatgtttata aggcttggtt 2821 caggtgacag ttattagaaa atcttggcag tgagtgttac gttttaaaat aagtataagc 2881 atcttagaaa attttaaaac tattaaaaat attcctcagc attgctaata gcttgcttat 2941 ttatttaaca tagggcttcc gttacaagat gaggtctgtg tatgctcact tccccatcaa 3001 cgttgttatc caggagaatg ggtctcttgt tgaaatccga aatttcttgg gtgaaaaata 3061 catccgcagg gttcggatga gaccaggtat gtgcataccc tcagacagaa ttaagaaatt 3121 tttccttcat atttttttcc tccttttcca ctcccaactt ccatacttta gttggctgtt 3181 gttttgtgtt tgtgctgagt aaaaccagat ttgttttaga attatctctt gattacactt 3241 acaactgttg gtgctactta agggtttata tcattatgtc gattacaatt tcagaaagtt 3301 gatacattgg agtagcttgc agtggtggtt gtctttaact tggcaaatga aatcttttaa 3361 aaagacttta aatttacaaa aggtgtatat aatggtgtgt atttatggag cacaatgtga 3421 tgtttcaaca catgcatatt ctgtggaatg actaaatcag gctaattaac acatccatca 3481 ctccatttat ttcttcgtgg taagaacctt taaaatctaa agagaaaact acatgagtag 3541 tggttaatag cctagcgtaa atttcagctc tcactgtcca ccttgggcat gtcatctctc 3601 tgcagcttaa aattgcaaac atctttgatt aaatgagtta aagttgaaaa acaacaatgc 3661 atgcgtaata catagtaaac acggtagctt taaaaaaaat ttaaagctta gtgcttttta 3721 tttgaggcag ggtcttgctc tgttgcccag gctggagtac ggtggcgtga tctcagctca 3781 ttgcatcctc caccttcgtg ctcaggtgtt tctcccacct cagcctcccc actagctggc 3841 actgcaggtg cctgccacca catccagctg atttttgatt ttttgtagag actgtttcgc 3901 cacgttgccc aggctgattt caaggaatgc tatggtgcct ggccccagct aatttttaaa 3961 ttttttgtag aggcagggtc ttgctatgtt gtccaggctg gtcttgaact tgcaggctca 4021 agcagttctt cccccttggc ctcttaaaac atacctggct cggcctcttg atgcattttg 4081 attccattgg ggtggggagg ggtgggaaga agggagattc ttagtatcaa aataatggct 4141 gtccagaaat actctgatac tagctatggt cagcaacatt taatgaaaac ccttatgtaa 4201 aaataaaccc ctgcctcctg gcttcaagcg attctcctgc ctcagcctcc tgagtagctg 4261 ggagaatagg cacgtaccac cacacccagc taattttttg tatttttact agagatgggt 4321 ttcacagtgt tagccaggat ggtttcgatc tcctgacctc atgatccgcc cgcctcggcc 4381 tcccaaagtg ctgagattac aggtgtgagc cactgtgccc ggcctcaaaa tcttaagaaa 4441 aggttctttt ggtgcatgga gttttacatg gaataagtta gtgcctctgc aatttaaata 4501 ttttttacac agatttgatg ctgtgcaaat gccctctccc cttttaggtg ttgcttgttc 4561 agtatctcaa gcccagaaag atgaattaat ccttgaagga aatgacattg agcttgtttc 4621 aaattcaggt ttgtatgttt actatgtcta actatgccta caagattttg tctaaatctt 4681 gtttaaaata cagtattttg ctaattttta tttgattagc tttttaaatt gtgaaaataa 4741 aaaggtcaca ttctgaaatt ttaagtacag cacataatga aatacttcct ttttaagatg 4801 ggttaggctg atgtattaat tgctttatct tcactcctat agcggctttg attcagcaag 4861 ccacaacagt taaaaacaag gatatcagga aatttttgga tggtatctat gtctctgaaa 4921 aaggaactgt tcagcaggct gatgaataag atctaagagg taagttctta cagtgtctta 4981 agttttatta ctgctctagt tgaatgaagc tcttaattta ctaaaagact taaactgaat 5041 ttgtagtaat agaaaatgga gagttaagca ttctggaaca cagtaaattt gacagaataa 5101 aagatggaat tggcatgcat tttactacgt atagtttact atgaatcagt aagtcatagt 5161 cttgaggtat gttttagatt atttagaata gtacagaaaa aagattagta ctttcctggg 5221 caagtgtatt gatggggatt tacagatttt ttttcttttt ttacagttac ctggctacag 5281 aaagaagatg ccagatgaca cttaagacct acttgtgata tttaaatgat gcaataaaag 5341 actattgatt tggaccttct tcttaaaccg gttatccttt ttagctagtt tttttccctc 5401 gtggaacaag gagctttaat ctggttcttg aaggaatagc aaggatatat tttaggcaag 5461 aggaatattg taccagtgac aggcaaacat gaggccattc atcattgcat atagaacttt 5521 aggaataccc tgaaagtgtc ta // LOCUS HSU10307 4740 bp DNA PRI 21-DEC-1995 DEFINITION Human interleukin 13 (IL13) gene, complete cds. ACCESSION U10307 NID g505626 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4740) AUTHORS Smirnov,D.V., Smirnova,M.G., Korobko,V.G. and Frolova,E.I. TITLE Tandem arrangement of human genes for interleukin-4 and interleukin-13: resemblance in their organization JOURNAL Gene 155 (2), 277-281 (1995) MEDLINE 95237624 REFERENCE 2 (bases 1 to 4740) AUTHORS Smirnov,D.V. TITLE Direct Submission JOURNAL Submitted (02-JUN-1994) Dmitry V. Smirnov, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, ul.Miklukho-Maklaya 16/10, Moscow V-437, 117871 GSP7, Russia FEATURES Location/Qualifiers source 1..4740 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human genome library" /cell_type="leukocyte" /tissue_type="blood" gene 2124..4176 /gene="IL13" CDS join(2124..2255,3312..3365,3618..3722,4072..4176) /gene="IL13" /codon_start=1 /product="interleukin 13" /db_xref="PID:g1127548" /translation="MALLLTTVIALTCLGGFASPGPVPPSTALRELIEELVNITQNQK RPLCNGSMVWSINLTAGMYCAALESLINVSGCSAIEKTQRMLSGFCPHKVSAGFSSLH VRDTKIEVAQFVKDLLLHLKKLFREGRFN" BASE COUNT 1023 a 1444 c 1305 g 968 t ORIGIN 1 ggatccccgc tgacaatcta gaaacaagca acaggaccct ctgatgtagc catctgtgcc 61 gcgcctctcc gcaccgcccg ccacgccttg gtccctggag accaccctcc agggcagggg 121 ctgccgctcg gccgggcccg cggggtccct cggcctgaca tggccggtgc tggagcggca 181 cgtgcgcgcc tcggcccctc ggccgctccc gcccctcgcc ggtgcgcacc ggcgctcggg 241 gagccgctgg cccgggtgtc cagccggccc ttgccctgcc tggcgctcgg accgccacct 301 ttgccgcccc ctcgccagcc tccgcagctt ccagactggc cggtctgcgc gcccacccct 361 gcctcccgga ccggccaccg ccgaggcgcg gaggagggcc cggccgcgca gatcccgctt 421 atcggcccat ctcccgttac ataaggccac ccccctatct ccgcgggcca tcgccgccgc 481 aaccgccgcg ccagcgcctt ctcccacgcg cgggggcgcc cctgcccacc gctcccggca 541 gggcttttgg tggccatggg ggataagggg cgttgactca cccgggcggg gctccgggag 601 ttgcacagac caaggtagtt ccccgctcct tcccccatca cggagaccct gtgggagatg 661 ccgtgggccc tctactacag attaggaaac aggcccgtag aggggtcaca cggccaagta 721 gcggcactcc aggcactggg ggccctcgag gggaaggggc agacttctgg gagtcagagc 781 cagcagctgg gctgggaagc ttcgagtgtg gacagagagg gtgggaatga cgttccctgt 841 gggaagagag ggtggcaagc ctgggatgcc tctgagcggg aatccagcat gccttgtgag 901 gagggtcaca agcacaccct tgtgaggagg ttgagcccca tcgaggacag gacggaggga 961 gcctgagcag gcagagaggg ggcctgggga ggcgctggtt cggggaggaa gtgggtaggg 1021 gagaaatctt gacatcaaca cccaacaggc aaatgccgtg gcctctgctg tgggggtttc 1081 tggaggactt ctaggaaaac gagggaagag caggaaaagg cgacatgctg cagagactgg 1141 tgagcaaagg ggatcacccc aagccccagt ggcactagga acacttacaa tctctgacct 1201 ggactaaggc tgccagctgg cccagttaag agtttcccag aaggatggcc catacacttt 1261 aaattaaagg ggccagacac gtgcacacta cttccagcca ctctggaagc tgaggtgggg 1321 ggatcgcttg agtctgggag ttggaggcca gcctaggcag gcaacatagt gagaccccat 1381 ctccaaaaaa acaaaacaaa acaaaacaaa aaaacaccaa aaaagctccc agaaagacct 1441 ctgaatcttt ctggatctct cagtggagac cttggaaatc tgaactttga caatccctct 1501 cacagtgggg ccaaggagga attaggcaag ccaaaagaag tgaactttac tcttctattg 1561 cctgtttgaa ttttgtatcc aagcaagtgt tacttaagta atttaagaga ctggttcatc 1621 gaaaaaataa aactccccaa attcccatag ctggtagact gtggtcacag ccacagtgca 1681 ctaagactat ctgctcagca cttctggtga cccaaaaggg tctgaggaca ggagctcaga 1741 gttgggtcag ctgtccaggt actcagggtt gtcacaggca aaactgctgg aactcagggc 1801 agcattgcaa atgccacgcc gctctcaggg ccccttgcct gccgctggaa ttaaacccac 1861 ccagatcttg gaaactctgc cctggaccct tctcaataag tccatgagaa atcaaactct 1921 ttcctttatg cgacactgga ttttccacaa agtaaaatca agatgagtaa agatgtggtt 1981 tctagatagt gcctgaaaaa gcagagacca tggtgtcagg cgtcaccact tgggcctata 2041 aaagctgcca caagacgcca aggccacaag ccacccagcc tatgcatccg ctcctcaatc 2101 ctctcctgtt ggcactgggc ctcatggcgc ttttgttgac cacggtcatt gctctcactt 2161 gccttggcgg ctttgcctcc ccaggccctg tgcctccctc tacagccctc agggagctca 2221 ttgaggagct ggtcaacatc acccagaacc agaaggtgag tgtcggctag ccagggtcct 2281 agctatgagg gctccagggt gggtgattcc caagatgagg tcatgagcag gctgggcctg 2341 gtcctaagat gcctgtaggt caggaaaaat ctccatggac caaggcccgg cccagccatg 2401 agggagagag gagctgggct ggggggctca gcactgtgga tggacctatg gaggtgtctg 2461 gcagactccc cagggactac ctgctctcct ggcctggcct tgtctgccac tgccagctcc 2521 tactcagcca ttcctgaaca gaggacagca gagaagggtc cagcaccctc ccagaaccat 2581 gtggcatttg ccaactggat tttgaccata acaatgcagc cattctcccc agcaccatca 2641 taggcccgcc cttacaggag gattcgttag tagagtcgct ccttgcccca ctagtaacag 2701 ctcacatgtc ttgagcactg cttacaccag gcctggtgca cgtgctttat gtgtcatttc 2761 atcactgcca gccacctcaa gaggcaggta cgatgaaccc attctgctaa ggttcgtgag 2821 gttaagtgac agaggctgga ttcaagccag gcctggccaa caccagagtg tccatgctcc 2881 taactgcagt gttccctcac catcagaagg cagggcattt aatacaccag atccccaccg 2941 cctcccatct gatttgtctt ggtcaacatg gcccaggcca ctcctacttc actcgtcccc 3001 accctgaccc ttcccgcagg cccctgtcct cctgccctga ctatggcaag ccttgcatgc 3061 agcttgtccc ttactagtgg tgtcaatttt tttctctcag ctccaagacc ctaaacagtg 3121 ggacctcacc cctatgcctg ctgttcaaag cagaaaacga agctcaggaa tgctgagggg 3181 ctgccaggcc tgcctctgtg ccacaccagg gatgcttgtg gggcctgtgc tggggcagac 3241 ctggcctggg ctgccagggc aggcccacaa cccctgccag cactctgctc actgtcactt 3301 tgctcccaca gcgtccgctc tgcaatggca gcatggtatg gagcatcaac ctgacagctg 3361 gcatggtaag gacctttggg tgcagggagg atggggcaga ggctccaggc cttgggctta 3421 tcttctctga gcctcccttc catggctggg gttccaagca agcttcaagt gctctcctcc 3481 ctcccgccat aatctggccc cttcccgccc accacccaga ctcacctgcg ccaggcatct 3541 cagccccatc ttcctgcaga ctcacaaaag gcagctgccc aagcagggcc tgacccctcg 3601 gtgtcccctc cccacagtac tgtgcagccc tggaatccct gatcaacgtg tcaggctgca 3661 gtgccatcga gaagacccag aggatgctga gcggattctg cccgcacaag gtctcagctg 3721 gggtaaggca tcccccaccc tctcacaccc accctgcacc ccctcctgcc aaccctgggc 3781 tcgctgaagg gaagctggct gaatatccat ggtgtgtgtc cacccagggg tggggccatt 3841 gtggcagcag ggacgtggcc ttcgggattt acaggatctg ggctcaaggg ctcctaactc 3901 ctacctgggc ctcaatttcc acatctgtac agtagaggta ctaacagtac ccacctcatg 3961 gggacttccg tgaggactga atgagacagt ccctggaaag cccctggttt gtgcgagtcg 4021 tcccggcctc tggcgttcta ctcacgtgct gacctctttg tcctgcagca gttttccagc 4081 ttgcatgtcc gagacaccaa aatcgaggtg gcccagtttg taaaggacct gctcttacat 4141 ttaaagaaac tttttcgcga gggacggttc aactgaaact tcgaaagcat cattatttgc 4201 agagacagga cctgactatt gaagttgcag attcattttt ctttctgatg tcaaaaatgt 4261 cttgggtagg cgggaaggag ggttagggag gggtaaaatt ccttagctta gacctcagcc 4321 tgtgctgccc gtcttcagcc tagccgacct cagccttccc cttgcccagg gctcagcctg 4381 gtgggcctcc tctgtccagg gccctgagct cggtggaccc agggatgaca tgtccctaca 4441 cccctcccct gccctagagc acactgtagc attacagtgg gtgcccccct tgccagacat 4501 gtggtgggac agggacccac ttcacacaca ggcaactgag gcagacagca gctcaggcac 4561 acttcttctt ggtcttattt attattgtgt gttatttaaa tgagtgtgtt tgtcaccgtt 4621 ggggattggg gaagactgtg gctgctggca cttggagcca agggttcaga gactcagggc 4681 cccagcacta aagcagtgga ccccaggagt ccctggtaat aagtactgtg tacagaattc // LOCUS HSU12421 4258 bp DNA PRI 14-DEC-1995 DEFINITION Human mitochondrial benzodiazepine receptor (MBR) gene, complete cds. ACCESSION U12421 NID g529945 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 2080 to 3683) AUTHORS Yakovlev,A.G., Ruffo,M., Jurka,J. and Krueger,K.E. TITLE Comparison of repetitive elements in the third intron of human and rodent mitochondrial benzodiazepine receptor-encoding genes JOURNAL Gene 155 (2), 201-205 (1995) MEDLINE 95237610 REFERENCE 2 (bases 1 to 4258) AUTHORS Krueger,K.E. TITLE Direct Submission JOURNAL Submitted (19-JUL-1994) Karl E. Krueger, Dept. of Cell Biology, Georgetown University School of Medicine, Washington, DC 20007, USA FEATURES Location/Qualifiers source 1..4258 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q13.3" intron <1..99 /number=1 /evidence=experimental exon 100..310 /number=2 /evidence=experimental /product="mitochondrial benzodiazepine receptor" 5'UTR 100..128 gene 129..3872 /gene="MBR" CDS join(129..310,1941..2079,3684..3872) /gene="MBR" /codon_start=1 /evidence=experimental /product="mitochondrial benzodiazepine receptor" /db_xref="PID:g529946" /translation="MAPPWVPAMGFTLAPSLGCFVGSRFVHGEGLRWYAGLQKPSWHP PHWVLGPVWGTLYSAMGYGSYLVWKELGGFTEKAVVPLGLYTGQLALNWAWPPIFFGA RQMGWALVDLLLVSGAAAATTVAWYQVSPLAARLLYPYLAWLAFTTTLNYCVWRDNHG WRGGRRLPE" intron 311..1940 /gene="MBR" /number=2 /evidence=experimental exon 1941..2079 /gene="MBR" /number=3 /evidence=experimental /product="mitochondrial benzodiazepine receptor" intron 2080..3683 /gene="MBR" /number=3 /evidence=experimental repeat_region 2401..3306 /note="4 Alus" /rpt_family="Alu" repeat_region 3497..3581 /rpt_family="MIR" exon 3684..4122 /number=4 /evidence=experimental /product="mitochondrial benzodiazepine receptor" 3'UTR 3873..4122 BASE COUNT 794 a 1246 c 1221 g 997 t ORIGIN 1 gatctgtggg tgacaggcct tccggggatg ctggaggaga cacgggcctg accccatggc 61 ctcaggaatg ccctcacgca gccctgtctt ctctttcaga gctcccctga acagcagctg 121 cagcagccat ggccccgccc tgggtgcccg ccatgggctt cacgctggcg cccagcctgg 181 ggtgcttcgt gggctcccgc tttgtccacg gcgagggtct ccgctggtac gccggcctgc 241 agaagccctc gtggcacccg ccccactggg tgctgggccc tgtctggggc acgctctact 301 cagccatggg gtaggtgggc gtgcactggt cctggggata agcctggccc tttgcaaggg 361 gaggccaggc caggacagag ggtcttctcc aggcgggcca tggaccatgg catggtttcc 421 cctgccccat ttggcaagcc agggtgggga agctgtgggc ctgtcgattg cacaactgaa 481 cttctcctcc tgggcccctg ggggatgggt cagatgctca ccccaccctg gcatccccgc 541 catgttcaca gctccaggca gaggccggct tagtgtgtga aggttggtct tgggcagagg 601 catggacgct aagattaaca acggcagcaa cagccccaag agcagctcct ttgttgaact 661 cctgcccagt gccaggctcg tctcattcat actcaacgac ccccataata agaggcctcc 721 caaggacaca gggcttgcaa gtgtggaccc cccacagccc cttcaggctc agtctggcct 781 gagtcccctg ggaatagacc aggggaattc cacccggaaa ggacgtgctg ggaggggcag 841 ggcgcctctt ggtggctgta gcggggacca cacccaggag gcagggaatt gactgccttc 901 tgaaatgttc cctgcatcct ggttctgtgc cagcgcagtg cagggtgggg acgcaggtgc 961 caggcactct agggggcgga tagcagaggc caggccaggt gcccggctct gatgggaata 1021 aggccctgga acttgggtgt cacgtgggcc tcacccctcc tggcccaggc cctgcccagc 1081 cagggtcctg cacttccctg gctccagtgt ccccaccctg cttggccttc ctacaccacc 1141 cagtcccagc ctctcttcgg gggaaatttt ggggctccct gtggctctgg ataaggcgaa 1201 cccctcatta gtaggtgaca gccggggctc tggcctcgtg accagacttt atccactgat 1261 ggcctgccaa gagatctcag tcacttcacc gcctcagtct cctcactgta aagggggtgc 1321 tgattactgc ctgccttaat ggcatcattc atgagactga aatgagtgaa tgcttgggaa 1381 gtggtagggc ctggctcctg tgagtgctgg agggcatcgg ctgctgttac tggtggtgtt 1441 atcgttgccc tgtgttaaca gctcccatga tcaggcctcc ccctggacct caccctgcag 1501 tcacagactg actcagggtc tcacatctgc ctcaaagcct ttgcatgtcc tgtttcctcc 1561 cctggaaaac tcctattcat catttaaggc ccaggccaaa tgtcccatcc tctatgcagt 1621 gccctggcta ggttaagagc ctcctctcta cctattcaca ccatgtgctt ccctttcata 1681 gcagtgacag tgataataat ggcaggtatt tactgagtgt cctaaatgct gggtatcgca 1741 taagcactgc caatgtgcta actagaatct ctacagcaac cctatgaaaa atgaaggccc 1801 agacaggtca aggcaacttc cctgagatca cacagcagtc agtgtcaggt cacgtatgaa 1861 ccatcactgt gccactcgga ggtgggcaac gctcctggcc ttgttcctaa tggtgctctg 1921 aactgcggcc tctgtttcag gtacggctcc tacctggtct ggaaagagct gggaggcttc 1981 acagagaagg ctgtggttcc cctgggcctc tacactgggc agctggccct gaactgggca 2041 tggcccccca tcttctttgg tgcccgacaa atgggctggg taagtgtggc cacagcatgt 2101 gtccctgatc cctggatccg acccttggag gacgtggggc atcacatatg acactgggtc 2161 agtgtctata ggcggggcca ggggagacaa aaggccatgt ctctctagct gtaacagccc 2221 acaccctgga gcctcccctc tactgactcc tccggtgagg gaagctatta aagcagaagg 2281 ggttgcaggg gtgggtttgg gggccactgt gtaggaaaac ccacacaagc ctggtactgt 2341 gtgggcaggg tcactggccc ctctacataa cagcttcttc tttttttttt ttgagacaga 2401 gtctcactct ctcacccagg ctggagtgca gtggtgagat ctcggcccac tgcaacctcc 2461 acctcctggg ttcaagtgat tcccttgcct cagcctcccg agtagctggg actacggtgt 2521 gcaccaccac tcctggctaa tttttgtatt tgtatttgta tttttttagt agagatgggg 2581 tttcactatg ttggccaggc tggtcttgaa ctcctgacct caggtgatct acccgccttg 2641 gcctctcaaa gtgctgggat tacaggtgta agccaccgcg cccggctgac tgcttctttt 2701 ttaaagtttt aaactttttt aagagatgaa gactcgctgt gttgcccagg gcagactcga 2761 cttcctgggc tcaaacgatt gattctcccg cctttacctc ttgagtagct gggactccag 2821 gtcacgtcac tgtgcccggc agcctttttt ttttggagac agagtcttac ctgttgccca 2881 ggctggagtg cagtggcatg atctcggctc actgcagact ccacctccca ggttcaagtg 2941 attctcctgt ctcagcctcc cgagtagctg ggattacagg catgcagcac catgtccagc 3001 taattttctg tatttttttt tttttgagac ggagtcgctc tgttgcccag gctggagtgc 3061 agtggcgtga tctcagctca ctgcaacctc cgtctcccag gttcaagcaa ttctcctgcc 3121 tcagcctcct gagtagctgg gactacaggc acgtgctaca cgcctggcta atttttttat 3181 ttttagtaga gacggggttt taccatattg gtcaggttgg tcttgaactc ctgacctgca 3241 ggtgatccac ccgtctcagc ctcccaaagt gctgggattc acaggcgtga gccactgcac 3301 ctggacaaat agcaagtttt tgtttgggcc tagtagtaat gctgggaggt cagcatcgct 3361 ctcagtggag gctctaagct caggggaagg acgtgcaaga cctcctaagc caccaccgca 3421 cccttgcaac aaaccgagtt cacttcaggc atgtcatgtg caccccacac aaggcatggg 3481 tcaggtggca tactgttccc attttacaga tgaggaaact gaggctgcga tgggggaggg 3541 gcttggccag gtcactcaag ggtggagtgg gggtgagtga ggctcctgac tcccaaatcc 3601 agtgggagtt gggcagtggg acaggcactt gggtgaacgc ggtgcctcag gcctccccat 3661 cctccgtccc ccaatctctg caggccttgg tggatctcct gctggtcagt ggggcggcgg 3721 cagccactac cgtggcctgg taccaggtga gcccgctggc cgcccgcctg ctctacccct 3781 acctggcctg gctggccttc acgaccacac tcaactactg cgtatggcgg gacaaccatg 3841 gctggcgtgg gggacggcgg ctgccagagt gagtgcccgg cccaccaggg actgcagctg 3901 caccagcagg tgccatcacg cttgtgatgt ggtggccgtc acgctttcat gaccactggg 3961 cctgctagtc tgtcagggcc ttggcccagg ggtcagcaga gcttcagagg tggccccact 4021 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa 4081 catggaattt tataagctga ataaagtttt tgacttcctt taccatggcc tttttgcttg 4141 ggtgggaccc ctggccacag gaagaggggg agctggggct gcactggagc tcgtctctgc 4201 aggatctgcc tgggctgcct tctccagaca gggctggatc cagctggggc tgccccac // LOCUS HSU12709 2180 bp DNA PRI 28-FEB-1995 DEFINITION Human neutrophil-activating ENA-78 prepeptide gene, complete cds. ACCESSION U12709 NID g684921 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2180) AUTHORS Chang,M.S., McNinch,J., Basu,R. and Simonet,S. TITLE Cloning and characterization of the human neutrophil-activating peptide (ENA-78) gene JOURNAL J. Biol. Chem. 269 (41), 25277-25282 (1994) MEDLINE 95014315 REFERENCE 2 (bases 1 to 2180) AUTHORS Chang,M.-S. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) Ming-Shi Chang, Developmental Biology, Amgen, Inc., 1840 Dehavilland Dr., Thousand Oaks, CA 91320, USA FEATURES Location/Qualifiers source 1..2180 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q13-q21" exon 289..493 /number=1 CDS join(385..493,628..760,871..954,1311..1329) /note="neutrophil-activating peptide" /codon_start=1 /product="ENA-78 prepeptide" /db_xref="PID:g684922" /translation="MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVL RELRCVCLQTTQGVHPKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKV IQKILDGGNKEN" exon 628..760 /number=2 exon 871..954 /number=3 exon 1311..>1329 /number=4 BASE COUNT 544 a 478 c 467 g 691 t ORIGIN 1 ccatgaggcc ctgctgaatc ctgctgaata gctactccct tctagctgga gccacagctc 61 cctccaccgc ggagcagggt tacaacgtcc ctctcggtag aggtgcacgc agctcctcct 121 ggccaccctc cccaccagtt cccattgtct ggcccccctc ccccaacctc ttctttccac 181 actgccccat gagttcaggg aatttcccca gcatcccaaa gcttgagttt cctgtcagtg 241 gggagagatg agtgtagata aaaggagtgc agaaggcacg aggaagccac agtgctccgg 301 atcctccaat cttcgctcct ccaatctccg ctcctccacc cagttcagga acccgcgacc 361 gctcgcagcg ctctcttgac cactatgagc ctcctgtcca gccgcgcggc ccgtgtcccc 421 ggtccttcga gctccttgtg cgcgctgttg gtgctgctgc tgctgctgac gcagccaggg 481 cccatcgcca gcggtgagag cgcatggcgc gcgggacgca ctcgcactcg ggcacagagg 541 tgcatcccag cctctgcggg gtcgctgcgt tccagggaac tctcccagca acctgcccta 601 taaagggtgt ctctctttct tccccagctg gtcctgccgc tgctgtgttg agagagctgc 661 gttgcgtttg tttacagacc acgcaaggag ttcatcccaa aatgatcagt aatctgcaag 721 tgttcgccat aggcccacag tgctccaagg tggaagtggt gtaagttctg tgctgctgtg 781 tccgctgtga ccttggcaag agagaaatcc cgcagcctgg gtcttcaacc ttggtatctc 841 atgagtgtat cttctttttc tttccttcag agcctccctg aagaacggga aggaaatttg 901 tcttgatcca gaagcccctt ttctaaagaa agtcatccag aaaattttgg acgggtactt 961 gtcactttga tctttgtggt ttctaaatct gatctaggga gaccatagac ttcacaaggt 1021 ctttattctc tgtacgattt aagtaacact tttcatgttt agaattaaaa ggttgttgaa 1081 ttgggaaagt ttttctggat tgtcctggga aaatatacca atcttacatg taattacttg 1141 agcaattaca cacagcttgt cactaagtta tgttttttgt ttacccattg cttttattga 1201 tttttgtatt ctcctttttt accaaacatc ataaacgctg agttttgaca agggtggagt 1261 agaaaggagt gtgaaaaatg gttaaactaa tataacattt ttctcaacag tggaaacaag 1321 gaaaactgat taagagaaat gagcacgcat ggaaaagttt cccagtcttc agcagagaag 1381 ttttctggag gtctctgaac ccagggaaga caagaaggaa agattttgtt gttgtttgtt 1441 tatttgtttt tccagtagtt agctttcttc ctggattcct cactttgaag agtgtgagga 1501 aaacctatgt ttgccgctta agctttcagc tcagctaatg aagtgtttag catagtacct 1561 ctgctatttg ctgttatttt atctgctatg ctattgaagt tttggcaatt gactatagtg 1621 tgagccagga atcactggct gttaatcttt caaagtgtct tgaattgtag gtgactatta 1681 tatttccaag aaatattcct taagatatta actgagaagg ctgtggattt aatgtggaaa 1741 tgatgtttca taagaattct gttgatggaa atacactgtt atcttcactt ttataagaaa 1801 taggaaatat tttaatgttt cttggggaat atgttagaga atttccttac tcttgattgt 1861 gggatactat ttaattattt cactttagaa agctgagtgt ttcacacctt atctatgtag 1921 aatatatttc cttattcaga atttctaaaa gtttaagttc tatgagggct aatatcttat 1981 cttcctataa ttttagacat tctttatctt tttagtatgg caaactgcca tcatttactt 2041 ttaaactttg attttatatg ctatttatta agtattttat taggagtacc ataattctgg 2101 tagctaaata tatattttag atagatgaag aagctagaaa acaggcaaat tcctgactgc 2161 tagtttatat agaaatgtat // LOCUS HSU16720 8868 bp DNA PRI 28-OCT-1995 DEFINITION Human interleukin 10 (IL10) gene, complete cds. ACCESSION U16720 NID g1041812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8868) AUTHORS Sanjanwala,B. and de Waal-Malefyt,R. TITLE The Structure of the Human IL-10 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 8868) AUTHORS Sanjanwala,B. TITLE Direct Submission JOURNAL Submitted (28-OCT-1994) Bharati Sanjanwala, Human Immunology, DNAX Institution, 901 California Ave., Palo Alto, CA 94035, USA FEATURES Location/Qualifiers source 1..8868 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" repeat_region 1144..1447 /rpt_type=dispersed /rpt_family="Alu" mRNA join(<4057..4221,5088..5147,5438..5590,6601..6666, 7742..8868) gene join(4057..4221,5088..5147,5438..5590,6601..6666, 7742..7834) /gene="IL10" CDS join(4057..4221,5088..5147,5438..5590,6601..6666, 7742..7834) /gene="IL10" /codon_start=1 /product="interleukin 10" /db_xref="PID:g1041813" /translation="MHSSALLCCLVLLTGVRASPGQGTQSENSCTHFPGNLPNMLRDL RDAFSRVKTFFQMKDQLDNLLLKESLLEDFKGYLGCQALSEMIQFYLEEVMPQAENQD PDIKAHVNSLGENLKTLRLRLRRCHRFLPCENKSKAVEQVKNAFNKLQEKGIYKAMSE FDIFINYIEAYMTMKIRN" repeat_unit 8427..8440 repeat_region 8441..8741 /rpt_type=dispersed /rpt_family="Alu" repeat_unit 8742..8755 BASE COUNT 2425 a 2137 c 2075 g 2231 t ORIGIN 1 ccctccaaaa tctatttgca taagcacaca cacacacaca cacacacaca ccccagcagt 61 tcttgcctgc ccagattcct ctgcagctaa agtgatgaaa cttactgggc ggagcttcct 121 aaaaagatta ttagggtctc ctgggttggt gtgcctttaa acctttggac tttaccacct 181 cctatctctc ctatctcctt gcaacaaagg ttaggagaac aagaatgcac aaaaaacggg 241 tcctggatga catctgagtg cctgctttgg gcttcttgat gagtgagaca gaaaataaaa 301 tacaaccccc tcttttaaaa gccatgctta ctcaggtttt ccttcatttg cagctaaata 361 cagaaatgag agaatatttt ggagcaggga tggaagaaga gaggtattcc ccttcccaca 421 accttctgat ttcccactac atcccccact ggaaaaattc atttaaaatc agtataataa 481 gcatttgatt agatgcctac tatgcatctg ggcttgaggg caaactggac tcagcctttt 541 ggcctcaaga agctcacagt gtgagagtgg catttgtgtc ctcttaaatt cacaggacta 601 aattgtccca ggctacattc tatccatcca taggtgcctg ccttctcact tccctctctt 661 catggctctt gccttgtagg aaaatccaaa cccaaatgtg gtgacatgtg agtgttggca 721 ttcatgtctc agacatgacc tatgggcttg ggacttttcc ccgtgtaccc cacgtgactt 781 ttcacgatga acaggtatct ccaaaaactt cgagaaatag gagtcctgtt tgtgtgttct 841 tgttgctttg tcaatatata gagagcacag ggtcatctta taattctaaa aatgttcatt 901 atctatctct tcgacagaaa tactatgaga catacttgat taggagaagc cgttatctcc 961 atatgctaaa tgaggacttg caccagggaa cttgcccatg gttctctcca accacttaaa 1021 ttctgaaatt ttgaaatgag agtggacagt aatttcaaat caatggggaa agaatcaaat 1081 cttcagcaaa tggcttgaga taattagcta cacatttcag aacaaataaa gaagtcagat 1141 ccgggccggg cacagtggct catgctgtaa tctcagcact ctgggaggcc aaggcgggcg 1201 gatcataagg tcaggagatc gagaccatcc tggttaacac agtgaaaccc cgtctctaat 1261 aaaaatacaa aagaaaataa aaaaacttag ccgggcgtgg tgccagcgcc tgtagtccca 1321 gctactcggg agcgtgaggc aggagaatgg cttgaactcg ggaggcagag cttgcagtga 1381 gctgagatca tgccactgca ctccagcctg ggcaacagag cgagactctg tctcaaaaaa 1441 aaaaaaagct agtcagatcc taacctcaac cctatttaac agattataga tgaagaaggt 1501 acaaatggct tttacatacc tcccttctcc ctgacatttt gtatgtgtgt gtgtgtgtat 1561 ttacacacac atctcatata aggaaattga agggaggctg cctgcatccc tgagtcactc 1621 tccctctcct tctgaatgct tacctgtgcc cagaccacct ccttagcctc gcaccctcca 1681 ggcttacagg gcactcttct atgcccatcc caagtatagc tgataccttc caagggccag 1741 acttggtgct aagtaccaag tacgcaaaga ttaataaaac aatgtcctgt ttcagggagc 1801 tcaaagctga ttcggcaggg catggtgtgt acatgaatga taaccacgta gggttgcagg 1861 tttcctagtg aggtaagcac aaggcaagat gggaaacaaa ggaaggaggg gttcacagcc 1921 tcacccagag tccagaaccc ctggcctgcc tggtgcccat gctgagtcca cttctggaac 1981 acccagctca gagagggggt tagacctgca ggctaacaca gacacagccc agaaaaccca 2041 ggagccgagg gggaaggaga aaggtgcaag aaggggaaac ccaggtcctg gtccccttct 2101 ctctgcttcc tggcagcaga actcagacag aacccttaag ccagtctaag tctggcagga 2161 ccagtaagtt ctgagttagc tccatactag tttctagcag gctctttctc acttcctgat 2221 tcttaggttt ctacattgac actccctgaa gagttgggaa gagacaccac agtcccctga 2281 ccctgatcca taggtcacac agcagggaca tccacagggt gacgtgggcc ctctcatccc 2341 tccctcccac tcacttcacg ctggctgggc cccaaggtgt ttgcacccct tgcagtgagt 2401 gaccttctct agtgcagcaa gctcagaacc tgctgccact ggagttgtcc cattgctgat 2461 gcagaaaggt gaagaactag cagaacactg gaaatgccct ccatctgggt ccatggctac 2521 ttaagctcaa tgctccctgg caggcaggag gacaggtgct attgccctgt tgggacagat 2581 gaaaaacaga cacagggagg atgagtgatt tgccctgact atagagtggc agggccaagc 2641 agagcccagg cctcctgcac ctaggtcaat gttcctccca gttacagtct aaactggaat 2701 gcaggcaaag cccctgtgga aggggaaggt gaaggctcaa tcaaaggatc cccagagact 2761 ttccagatat ctgaagaagt cctgatgtca ctgccccggt ccttccccag gtagagcaac 2821 actcctcgct gcaacccaac tggctcccct taccttctac acacacacac acacacacac 2881 acacacacac acacacacac acacaaatcc aagacaacac tactaaggct tctttgggag 2941 ggggaagtag ggataggtaa gaggaaagta agggacctcc tatccagcct ccatggaatc 3001 ctgacttctt ttccttgtta tttcaacttc ttccacccca tcttttaaac tttagactcc 3061 agccacagaa gcttacaact aaaagaaact ctaaggccaa tttaatccaa ggtttcattc 3121 tatgtgctgg agatggtgta cagtagggtg aggaaaccaa attctcagtt agcactggtg 3181 tacccttgta caggtgatgt aacatctctg tgcctcagtt tgctcactat aaaatagaga 3241 cggtaggggt catggtgagc actacctgac tagcatataa gaagctttca gcaagtgcag 3301 actactctta cccacttccc ccaagcacag ttggggtggg ggacagctga agaggtggaa 3361 acatgtgcct gagaatccta atgaaatcgg ggtaaaggag cctggaacac atcctgtgac 3421 cccgcctgtc ctgtaggaag ccagtctctg gaaagtaaaa tggaagggct gcttgggaac 3481 tttgaggata tttagcccac cccctcattt ttacttgggg aaactaaggc ccagagacct 3541 aaggtgactg cctaagttag caaggagaag tcttgggtat tcatcccagg ttggggggac 3601 ccaattattt ctcaatccca ttgtattctg gaatgggcaa tttgtccacg tcactgtgac 3661 ctaggaacac gcgaatgaga acccacagct gagggcctct gcggacagaa cagctgttct 3721 ccccaggaaa tcaacttttt ttaattgaga agctaaaaaa ttattctaag agaggtagcc 3781 catcctaaaa atagctgtaa tgcagaagtt catgttcaac caatcatttt tgcttacgat 3841 gcaaaaattg aaaactaagt ttattagaga ggttagagaa ggaggagctc taagcagaaa 3901 aaatcctgtg ccgggaaacc ttgattgtgg ctttttaatg aatgaagagg cctccctgag 3961 cttacaatat aaaaggggga cagagaggtg aaggtctaca catcaggggg ttgctcttgc 4021 aaaaccaaac cacaagacag acttgcaaaa gaaggcatgc acagctcagc actgctctgt 4081 tgcctggtcc tcctgactgg ggtgagggcc agcccaggcc agggcaccca gtctgagaac 4141 agctgcaccc acttcccagg caacctgcct aacatgcttc gagatctccg agatgccttc 4201 agcagagtga agactttctt tgtgagtatg attccttcct gtcctttctc tcttcctggg 4261 actgcctgaa ctagacattc tcctggaact ataagaaccc tcctcctgcg cctccacctc 4321 catccccaac acctattccc ccaaacttaa attcttaaga agaaatccta gatcaagcca 4381 tgggttggtc agttaagcta agccagatag atacagtaaa tgtcaggaca cacctgcctt 4441 ataaagtaaa tgcgttcttt ctcgtgctga gaaacttata acgcactcct gctgcgcgcc 4501 tatatcattt attggctagg agaagtaaag aaaggtctga tgtcgaggtg aagatgctcc 4561 ccagtccttg cagcaaggga aatttaaatt gcctctgctt agagcgtttc cagcctgaaa 4621 gaccagtggt ttagggaagc actctaccat gagggaaacc tgcattagaa ggagcttctt 4681 aaatccctgg gatctttcca agctaaactg agtgtctaca gtggggagaa agaaaagcag 4741 agaacaggac atgaggggct caaggccccg aagggttgac ataggtgtcc cttaaagcct 4801 aatgtacgtc cgcagaaaga agaccaggac tgagtcaagc ttctgctttc ccttgaaaat 4861 caggccagat ttttaaaata acttgactct agaggaggag gactgattta agtgatcgtg 4921 tcccatactg ttgaatcctc tgtttttaaa ctcccctttt gtattatatt tggccagagc 4981 caatttgtat taaaaaaaaa aaaatctcta aatgaaaggg catcaaaaat accgcatttc 5041 agttatttcc ccaaacctaa agttcattct cctttttctt cctgcagcaa atgaaggatc 5101 agctggacaa cttgttgtta aaggagtcct tgctggagga ctttaaggtg agagcagggg 5161 cgggggtgct gggggagtgt gcagcatgat taagggaagg gaggctctgc ttcctgattg 5221 tgcagggaat tgggtttgtt tccttggctt gaaaggagaa gtgggaagat gttaactcag 5281 cacatcagca gcagagggtt tacaaagggc tcagtcttcg ggggaggctt ctggtaagga 5341 ggatcgcatg aacaagctgt cctcttaagc tagttgcagc agccctcctc ccagccacct 5401 ccgccaatct ctcactcacc ttcggctcct gccccagggt tacctgggtt gccaagcctt 5461 gtctgagatg atccagtttt acctggagga ggtgatgccc caagctgaga accaagaccc 5521 agacatcaag gcgcatgtga actccctggg ggagaacctg aagaccctca ggctgaggct 5581 acggcgctgt gtaagtagca gatcagttct ttcccttgca gctgccccca aaataccatc 5641 tcctacagac cagcagggac actcacatcc acagacacag caaagagaca cagctgcaag 5701 cgatcgtgta aatgaggaaa gactcctgag tcatagtctc ttctcatttc tctttgagca 5761 ggcgttgggg gtggctgcta ggcatttaca tgtgaaattt gcaaacagct tcctgttatt 5821 tgtgagtcat ttgtgggtta ttaactactc ccctctctct tcataaaagg agcccagagc 5881 ttcagtcagg cctccactgc ctctttgtac tagacctggg cggggagcta aggttcccaa 5941 agcagaggga aacatcattc acctctttta atctcaatgt ttgaaagcaa agctctaaga 6001 agggcccaat tgactgacag gatttcccct ggcattttag aagggacaag ggggctattc 6061 atccccaggc tagtgtctat gagtaattcc tccaggaatt tatttctcca actgaaatga 6121 tgccgtcact actaatggtt tcccctgttc tgtcaccaat attggaaaat cagttggtgt 6181 ctatttgtag gacaaggcta tgtgaagggt ttggtcccag tagcttccct cctcagatgc 6241 ttagttagtg ttcctccggt ggctgtgact gacggggggg agaacaggag agagaggcag 6301 aaaaggacag gctgaagaat gcctcgctca gcactgcagg agatactgta gagttctggg 6361 ggaggaagga atcccaagac cctgggttgt catccaagcc ttgcaaacat cttggagtga 6421 gtcctggaga aatacattta actcccaggg ccatggaagc agggctcagt tctctctccc 6481 agctgtgagg cgaggatttg gataaatctg gcctcctcat gatgcaccag cttgtcccta 6541 agcgtgatgg acatggagct ggaagccagg atcaccaaca ctttctcttt tcttccacag 6601 catcgatttc ttccctgtga aaacaagagc aaggccgtgg agcaggtgaa gaatgccttt 6661 aataaggtag agagggtctc agagcacaac ccatgcccac tccccaaccc caaagcatgg 6721 aaggtggtgg gactcaatag gccccattct tcattgagag agtgtgggaa cctacaatgg 6781 tatgacctct cagccattag gagctgctgc cttgattgta tttgttttct gttaagttgt 6841 ctttgggggt tctaaatgac tgctcgcttg cctttgcagg cttgcgggtc agggctggcc 6901 gcccaggtga acacagatga gctgcatgct ggggagagtg acaaaggaaa cagaaagtac 6961 agaaagtagc ttgttgggaa tctagtctga acccacacgt gcaggaagct ggcacattaa 7021 atgtgcacat tacaaataca cctgggggtg cagcccagat ctcccctagg acctcagaat 7081 gagcaggaag ctggattgct cacttaacct ggagttggtt caagcccgct ttccatctgc 7141 ccttcgcacc tgcggaggtg cctgagaatg tcagtttccc aaacgaaatg gggtttcaca 7201 cttccaactg tgcgtgaact ttttcagtct gatttcccag aaaccgtgcg gcctatgtcc 7261 tcctcgtggg ctggggacag acactgcaca gagtgccaac atcagggggt gtgaatttct 7321 catagtaggt cagggcggca gggagggcct gctcagtgtg ttggtgggag aacacagaca 7381 tttaaaaggc tccctcctct cctctcaccg tcttgctttc gaagcgcttc ctctaatgtc 7441 ttttcatcaa actctgcata atcatcatgt gaatacgtga cctttaaaat tgttgaaaag 7501 gcatcatttt gaagacagtg ctttgcaaaa tgaatgctac cccaattgct agggggaggc 7561 ctggaggaga tgaaaggtca atgcacagcc tttcccaagg cagctaggcc tatcctctgg 7621 tttacttccc agcgtgaggg agaacaagca acctctgcac tcaaggtcat gcccatccat 7681 gagcatgagg gaggggagcc tatttagtcc ccagaaagga ttttaactgt atgtttctta 7741 gctccaagag aaaggcatct acaaagccat gagtgagttt gacatcttca tcaactacat 7801 agaagcctac atgacaatga agatacgaaa ctgagacatc agggtggcga ctctatagac 7861 tctaggacat aaattagagg tctccaaaat cggatctggg gctctgggat agctgaccca 7921 gccccttgag aaaccttatt gtacctctct tatagaatat ttattacctc tgatacctca 7981 acccccattt ctatttattt actgagcttc tctgtgaacg atttagaaag aagcccaata 8041 ttataatttt tttcaatatt tattattttc acctgttttt aagctgtttc catagggtga 8101 cacactatgg tatttgagtg ttttaagata aattataagt tacataaggg aggaaaaaaa 8161 atgttctttg gggagccaac agaagcttcc attccaagcc tgaccacgct ttctagctgt 8221 tgagctgttt tccctgacct ccctctaatt tatcttgtct ctgggcttgg ggcttcctaa 8281 ctgctacaaa tactcttagg aagagaaacc agggagcccc tttgatgatt aattcacctt 8341 ccagtgtctc ggagggattc ccctaacctc attccccaac cacttcattc ttgaaagctg 8401 tggccagctt gttatttata acaacctaaa tttggttcta ggccgggcgc ggtggctcac 8461 gcctgtaatc ccagcacttt gggaggctga ggcgggtgga tcacttgagg tcaggagttc 8521 ctaaccagcc tggtcaacat ggtgaaaccc cgtctctact aaaaatacaa aaattagccg 8581 ggcatggtgg cgcgcacctg taatcccagc tacttgggag gctgaggcaa gagaattgct 8641 tgaacccagg agatggaagt tgcagtgagc tgatatcatg cccctgtact ccagcctggg 8701 tgacagagca agactctgtc tcaaaaaaat aaaaataaaa ataaatttgg ttctaataga 8761 actcagtttt aactagaatt tattcaattc ctctgggaat gttacattgt ttgtctgtct 8821 tcatagcaga ttttaatttt gaataaataa atgtatctta ttcacatc // LOCUS HSU17969 4862 bp DNA PRI 09-FEB-1996 DEFINITION Human initiation factor eIF-5A gene, complete cds. ACCESSION U17969 NID g602244 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Koettnitz,K., Wohl,T., Kappel,B., Lottspeich,F., Hauber,J. and Bevec,D. TITLE Identification of a new member of the human eIF-5A gene family JOURNAL Gene 159 (2), 283-284 (1995) MEDLINE 95347615 REFERENCE 2 (bases 1 to 4862) AUTHORS Werner,F. TITLE Direct Submission JOURNAL Submitted (01-DEC-1994) Fred-Jochen Werner, Sandoz Research Institute, Mrg-Ir, Brunnerstrasse 59, Vienna A-1235, Austria FEATURES Location/Qualifiers source 1..4862 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="placenta" exon 1..125 /number=1 intron 126..1974 /number=1 exon 1975..2159 /number=2 CDS join(1995..2159,3372..3476,3704..3835,3932..3994) /codon_start=1 /function="initiation factor" /evidence=experimental /product="eIF-5A" /db_xref="PID:g602245" /translation="MADDLDFETGDAGASATFPMQCSALRKNGFVVLKGRPCKIVEMS TSKTGKHGHAKVHLVGIDIFTGKKYEDICPSTHNMDVPNIKRNDFQLIGIQDGYLSLL QDSGEVREDLRLPEGDLGKEIEQKYDCGEEILITVLSAMTEEAAVAIKAMAK" intron 2160..3371 /number=2 exon 3372..3476 /number=3 intron 3477..3703 /number=3 exon 3704..3835 /number=4 intron 3836..3931 /number=4 exon 3932..4005 /number=5 intron 4006..4174 /number=5 exon 4175..4862 /number=6 BASE COUNT 903 a 1318 c 1313 g 1328 t ORIGIN 1 ctgcgtacta agacccgtgt gcagcagcgg cggcggcggt agaggcggcg gcggcggcgg 61 cacggctcgg aggcagcggt tgggctcgcg gcgaggacgg ggtcgagtca gtgcgttcgc 121 gcgaggtgag agcgggcagg gcgcgtgtgc ggtaccttgg cttgggcccg ggagaagatg 181 gaactgcgcg ggggtcgggg agggggcaga ggggagcggc ggccgggatc gggttcggcc 241 agagagcggc gcgaggtgag gagaggcggg gggccccggt tggggggcgg ggtgacggtt 301 ggggttggcc cggtgacgtt ggagcgacgc aggggggggg acggcgagcc gcgaggcagc 361 cacgcgggag gacgccaggc gccaacgccg gggcgagggc cagggccaga tcgcgccgca 421 ccggggcaag ggtacctggg ccgttgagga cccggctcag acgcggccac gtgagggtgg 481 gggagggggc tattcggaga gagggcatga tggggggggt tggcggacgg ccggggggag 541 gggaggcggc gtggctccgg cctggcgcag tctctgagga gggggcggcc gcggcaccgg 601 aagtgccccc cacgggaggc tgccctcggg gtcgcaggcc gcatggccaa gcgtggaccg 661 gggccgcatt ggcagggcgg ggacccctcc ccccaggtcc cggccaccgg aagcccggaa 721 cggccacatg gtcggcagga gccggcagcc cctagcgcgg ggagctagtc ctcgcggggg 781 ttccgagggg gcctgagatt ttgaggagtg gaagccggag gctcgggtcc taatcacccc 841 ggagggcctg ctgcaggcat atgttagcgc ttcccaacct caagggcccc aggagctttc 901 ctgaaaaacc tcctgtgaag tgtggcggtc agcaggtgta agtggcaccc cttcgacctg 961 ggaggccgga gaaatgggga aacagggcgt ttggacagga gcccaacttt tctgtcggcc 1021 tgggtggaca gcgcctaaaa gcctcaggca gatgagtttc ccaccgtgat aggcgttctt 1081 caccttgtca ctgtcttcct gttgtaactg gtttgggagt tccacggttt cgaagtctta 1141 actctactag cctagtatga ttttctacta ctcactacta ccaacctttt gctttcattt 1201 tattgtcatt caccaccccc ctcctctccc actttctact gactgtactg gcccggcctg 1261 aaaatgccga caggacagat gcatcctggg aaaagtggga gggccttcct ggtgtggctg 1321 ggttcctggg ttctcttggg gaaccaagat cgaggtgttt ggcaggggga tcatttcgga 1381 aacaactttg ctttctcccc cttcccttgt tgcaagtgca ttttggcgag gcgggaatcc 1441 cccaaaaagg ccttgtggtt tcctgtagca gtgactggtt tgtgtgtgga tggggggggg 1501 cagtgtaaca gatgggtggg gccttgtctc cgtctccacc ctcccccccc ccaattagcc 1561 ctgccctttt gctggattat agctgctcac ctaattatag ccctggctat atgtggggaa 1621 gaggggtggt tggtttcagt ggcaagtaac acctgagtcc tccccactcc caggaagtag 1681 gccttttctc atctctgaga tttgacattc cagctcccta gagccttcct ttcccaagcc 1741 accactttag ctttgattta tgggggagaa accagctatt ccaccctccc atacacacac 1801 cagttcttga gatatttgag cccagtcagt attttaagtt caggaacttt gactttattt 1861 taggtgggta atggaataaa gagttgggaa aatgtttctg gagagaaatg ggatagattg 1921 gaggtcctgt tgacctccat gcttattcac ttcagttctc ttcttggctc ttagttggat 1981 cgaagcctct taaaatggca gatgacttgg acttcgagac aggagatgca ggggcctcag 2041 ccaccttccc aatgcagtgc tcagcattac gtaagaatgg ctttgtggtg ctcaaaggcc 2101 ggccatgtaa gatcgtcgag atgtctactt cgaagactgg caagcacggc cacgccaagg 2161 ttagaatttc acctccgcat cttgccttcc ccatgcctcc agtatctttg gcccctccca 2221 gcagcaagct gccaaccgtc ctgactttcc tttaactctc cttttactgt ctgattgttt 2281 ttgtttagcg ctagcctata aaattagaca actaaacctg ctcccagtct tcagcctctt 2341 ctgtggttac ccttctgtcc ccagcaatat tctctggcag tccctccgtt tctccagctt 2401 tgtgccaatg tgttgaatta ataggcctta atgctctagc tattcagttg ctccttcctt 2461 attcttggtt ttcacttttt ctttcttccc ttgagccgtt gtaagtggtt gcttccatca 2521 cgttgcccag ggtttctcca attctagcct tcccctggag ggactcctgc gtcaattctg 2581 agctttcagg tcagcttgct gcttcgttcc ccagttctag ctttcccttt ggaactctga 2641 cgcaggatca aactgcccct acctgcccca tcccactctc cttgggttcc ttgagcctcc 2701 atgctttcat gggttttaac attcttgata gcacttccct catcacctca gactttcctg 2761 tctctttaga atttcaacct gattacgttc ttgttgaatt tcttgtggtg tttttcctta 2821 atcagttttt ccatctttga cttgacatga tcttggttat tctctagtgt ctggatccct 2881 ctctgtgctc ttctggcttc cttattttct agctacatag ttgtttctga tctccctgga 2941 agtccttttc ttcagctctt tcacatttct tagtttctta tcctttccct tctaccttcc 3001 ttattcccca gtttctcagc tccttctcct ggtgtccgga aaactcttcg aaacttttac 3061 tcagactcct cccactcccc tagactctta gcatcttatc ttttatcaag gtaactgtct 3121 tctccaagcc tgccatgcag ttttctattg tgctgtcaac cccaatctca acggtggttt 3181 tttagcctct gttttttttc tttaagaggc tttactgttt cttgagttga ctggttcaat 3241 tccaacactc ccagtctttg ccccaacaac aatgtaattc tgacttccct catcaggcaa 3301 ggtcaatttg agcctttatt tcctccgttt tagagtttgg ttgggtttct ctttgtgatg 3361 catacataca ggtccatctg gttggtattg acatctttac tgggaagaaa tatgaagata 3421 tctgcccgtc aactcataat atggatgtcc ccaacatcaa aaggaatgac ttccaggtat 3481 gtagatggtc tggatgagga tgggttagcg gtttatggtg gagggagggg ttgggggata 3541 gggactgacc aggagagatt gtgctgggag agaggaggga aatggcagga gagggtgttt 3601 ggtattagtt tgctatcagt ccagtttgtc tatcagagcc tttactgtca tgtcagcctc 3661 cctgggccta ctaccttcag cctccttccc tatctgcccc cagctgattg gcatccagga 3721 tgggtaccta tcactgctcc aggacagcgg ggaggtacga gaggaccttc gtctccctga 3781 gggagacctt ggcaaggaga ttgagcagaa gtacgactgt ggagaagaga tcctggtatg 3841 gtgcctccct ccctgctttt gtgctcagct ttgttctgta cgtttcttcc tgagctcaga 3901 catctcttgg ctatccctct tgcttctcca gatcacggtg ctgtctgcca tgacagagga 3961 ggcagctgtt gcaatcaagg ccatggcaaa ataactggct cccaggtgag tgtgacaaat 4021 ccctcactgt ccccttcaca ttttgttgtc ctcagcagag ctgctgtagt cttaattgtt 4081 tcaggcttct tcctgcccct agcctggctc tgtcctccct atccttcctg ttgaaggtat 4141 tatcctgtct tactaatttc tctctcctac ctaggatggc ggtggtggca gcagtgatcc 4201 tctgaacctg cagaggcccc ctcccccagc ctggcctggc tctggcctgg tcctaggttg 4261 gactcctcct acacaattta tttgacgttt tattttggtt ttccccaccc cctcaatctg 4321 tcagggagcc cctacccttc acctagctcc cttggccagg agcgagcaaa gccgtggcct 4381 tggtgaagct gccctcctct tctcccctca cactacagcc ctggtggggg agaagggggt 4441 gggtgctgct tgtggtttag tttttttttt tttttttaat tcaatctgga atcagaaagc 4501 agtggattct ggaaaatggt ccttgtgccc tccccactca tccttggtct ggtcccctgt 4561 tgcccatagc cctttaccct gagcaccacc caacagactg ggaaccagcc ccctcgcctg 4621 cctgtgtctc tctccaaacc cctttagatg gggaccccag aggaggagag gggaggggac 4681 ctgccccctc ctcaggcatc tggaaggccc tgccccatgg gctttaccct tccctgaggg 4741 ctctctcccc gacacatttg ttaaaatcaa acctgaataa aactacaagt ttaatatgaa 4801 gcccccactc agctgcttat ttgaattaaa tgggtgtatt tggaagccac aaaaagactg 4861 aa // LOCUS HSU19765 6913 bp DNA PRI 09-MAR-1996 DEFINITION Human nucleic acid binding protein gene, complete cds. ACCESSION U19765 NID g790570 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6913) AUTHORS Flink,I.L. and Morkin,E. TITLE Organization of the gene encoding cellular nucleic acid-binding protein JOURNAL Gene 163 (2), 279-282 (1995) MEDLINE 96011648 REFERENCE 2 (bases 1 to 6913) AUTHORS Flink,I.L. TITLE Direct Submission JOURNAL Submitted (11-JAN-1995) Irwin L. Flink, Internal Medicine, Room 6309, Univerisity of Arizona, University Heart Center, 1501 N. Campbell Ave., Tucson, AZ 85724, USA FEATURES Location/Qualifiers source 1..6913 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda 911; lambda 10A5" /sex="male" /tissue_type="placenta; peripheral blood" /dev_stage="newborn" exon 202..1036 /number=1 exon 4393..4530 /number=2 CDS join(4407..4530,4712..4804,4974..5172,5679..5796) /codon_start=1 /product="nucleic acid binding protein" /db_xref="PID:g790571" /translation="MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQ FVSSSLPDICYRCGESGHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCG KPGHLARDCDHADEQKCYSCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYR CGESGHLARECTIEATA" exon 4712..4804 /number=3 exon 4974..5172 /number=4 exon 5679..6913 /number=5 BASE COUNT 1786 a 1478 c 1670 g 1979 t ORIGIN 1 aaaaggaaaa taaatcatag tgttgaactg gcaggtttca ctgagacgaa acttgggact 61 cttccttttt tttttgtttc gaataaagcc attctagaat gagaccaaat tctaaaatat 121 tttatagtta acagtttaaa ttgggtttaa tcttgacaag actatctagg gctatataca 181 caaatctctt ttggagaaaa taccacaact aaactgaagt ctattcctga atatgacaga 241 ccaggtcaaa tggttatcct tgccctcccg ggggatgtca ctcataaacg tgccaaaagt 301 cacagtctag gccccattac cttacatgct catgaccttc ccagggaggc ccctcgccct 361 taccaggcac tttcatcttg ggaagacaca tcaagtcctg gggcggagaa agcagcaagg 421 cctttccccg gctcacaaaa attaatacaa atctcagagg ctgcatccca cagccgtgac 481 caccgtgact tgcatcccct tttctgcaaa cttaaatgtt atctagaaat cgggcctggc 541 tctgaaagcc aagggcctgg caggagcccg agaaagggga gaaactttct gcggccccaa 601 gctaatggca gtcactgcac cgagacccgt cccctggcat ccctttgctc cagctggcca 661 agacagacca ccaaggtcag ccagatttcc acccagtctg gccgggcccg gacccagctg 721 ggaatgaacc gagaagcacc gggacccgga tcccggcgtg aaaggccgcg tgcggggcac 781 ggcgggaaaa gacgctgcgc gcagaaacac ccgccccgcg ccgcgctcta gtgggcggcc 841 ctgccgcggg cggctctgat tgccgcgtgc ggcgaggcgg aggagagccg tgcgcagcgg 901 cgtatgtggg accgtgtgca gacccgcgtg tggcgcagtc aaggaccctc aaaataaaca 961 gcctctacct tgcgagccgt cttccccagg cctgcgtccg aatctccgcc gctgcgggcc 1021 cgctccgacg cggaaggtga gggctggggg aggggaccgg cgctgacgga gccgcagtgc 1081 ggggtcgggt ctgtggcgga cagagagagg gtagggagag gcgaggtggc gatggcggcc 1141 gcactttggc ctgcgcctct gctgcgtcag gcgggaagct cggctgctgc cgccgcctcg 1201 gacccgggtt tctggcgcac cgctgtcgga cgacacttct gtcctttctt cgtcctggaa 1261 agctgggtcg ccgagcatgc gggtctttcg gcgccacggc cacaccccag gccgcaggct 1321 tagggcagag gaggcccgcc cgtgcgccct tggggccgag gccctgacgc ttcgagggtc 1381 gcggaatgag ggaccgaggg tggatttggc gggaactcac tggaaggagt ccgtgtggtg 1441 gggaaaggct cccggctgcg gatgaagggg ggatggggtt gggtatagtc cgtgcaggcc 1501 atgtgctggg gttcgtgcgc ctggcgggcc atgtgccaag ggttttgggg cttagaaagg 1561 gttcctaggc cgggcgcggt ggctcacgct gtaatccagc actttgagag tcccaggcgg 1621 gcggatcacg aggtcaggag ttcgagacca gcctgaccaa tatggtgaag tggtctgtac 1681 gtctgtacta aaaataaaaa attagccggg catggtggcg ggcgcatgta gtcccagcag 1741 ctcgggaggc tggacaggag aatcgcgtga accccggagg ccgaggttgt ggtgagccga 1801 gatcgcgcca ctacactcca gcatgggcaa cagagagaga ctccgtctta aaaaaacaaa 1861 caaacaaaca aacaacaaca agggttcctg aagaagcctt tgtgtttgga gtggcgagac 1921 tgctggaaga cttgggagct tttagagttt ccatactccc tatccttgat agttttccga 1981 ttcttgaatt tttatcgtca tttaaatact aagttgcttg tgttacatta ccattcccaa 2041 aagggggctg atggggctca cattccaaga gttaacacta tttaagttgc tgggatcctt 2101 taaaagcgcc attaccagaa aaaacacgat tttgtcaaac ctccaaaacc acagcagcgg 2161 gcggtagtct gcatcatttc ttggattaat gaaacagatg taattacaaa cgagacacga 2221 aattcaacta gctcccctcc atctagattt ttccatatcg tgagaacctg ttttagaatg 2281 tcatgatggt ccacatttgg gtttaggtgt tgatgttatt atgggtaagc cttgtgcttg 2341 ttcccacatg ttaaccatat ggcctcagcc acagggcact tccaaaggaa gtgactgttt 2401 ctggtcttgg gggtcttgta aaaagagaac attgctcagt aatcgtctgt gattttagct 2461 agtgtgtttc aggcattatt cagaaggact caggtgagat aagccaaaac tgaatttgtt 2521 ttttgtcttt ctcaaagtga aggaggtcta atgaatatcc ccatcttgct tttaaattac 2581 atttttaaaa gtagattttt ccccctttcc tattgtttga cccaatttgg agtgaaacgt 2641 aaccagttac tatttccatt cgaatttaaa ttagcaattt tatgttattt gtttgttcaa 2701 gcagtataac tggagtgtag agctttgagg gtttcaaaaa gataagagat atagtcctta 2761 tctcctgggc ttccccctcc cccctcctaa atagttttaa atgcttctaa tgagttactc 2821 tggttaagga taatcaaaca cctgtaactg ccaggatcct aggtacatgc tgtttttagt 2881 ttgttgagcc tgattcttgt ctacaagagt tctttgtgta ttggaatata aaaggaataa 2941 tttattacat tcccaagggc agaattaaag acttaagttt tccgatttca tctcttgata 3001 agtttttctt taaaaaaata acagtttgtg tttttctgag gaaccaaagg tcctcttttt 3061 tttcatattg gtaacaggag aggtaatgta tttcagatgg tgcagtctgt aaaatatttt 3121 gaaccaaatc agtggaagac caggggtttt tctttttttt tttttttttt ttttggagac 3181 ggagtctcac tctgtcgccc aagctggagt gaagtggcgc gatctcggtt cactgcgacc 3241 tccgcctccc ggattaagcg attctcctgc ctcagcctcc gaagtagctg ggattacagg 3301 cgcccgccgc cacacccagc tagtttttgt attttagtga cagacggggt ttcaccatgt 3361 tggccaggct ggtctcgaac tcctgacctt gtgatccgac tccctcggcc tctcaaagtg 3421 ctaggattac aggcaggagc caccgcgcct gccaggtttt tcctaaactg catttgaaca 3481 tctggaacag gcagggagat gtctttttta aagtataaat gtgttttgtt acatgattta 3541 tgacaattct acttgtcttt tttttttttt tttttttttg gtgacagagt ctttctctgt 3601 cggccaggct ggaatggagt ggcacagtct cggctcacag cagcctccat ctcccgggct 3661 caagcaattc tcctgcctca gcctcccaag tagctgggat tacaggcgtg tgccaccacg 3721 cccggctaat ttttgtattt ttagtagaga cggggtttcg ccatgttggc cagactggtc 3781 tcaaactcct gacctcagat gggccacccg cttcagcctc ccaaagtgct aggattacag 3841 gcatgatcca ccgtgcccag ccactaccaa ttatttctct taatggattt tcattgaccc 3901 taaccctgta aattccatca cttttatcaa ggtgtatatt ataataagtc tataataccc 3961 aatcatgtag ttgtgtgatt attttatttt tttgagacag agtctcaatg ttgcccaggc 4021 ctggagtaca gtggcaccat ctcagctcac tgtaagctcc gcctcctggg gttcacacca 4081 ttctcctgcc tcagcctccc agtagctggg attacaggcg cctgccactt caccgggccg 4141 gctaattttg tatttttagt agagacgagg tttctccatg ttggtcaaga tggtcttcga 4201 attcatgatc cacccacctc ggcctcccaa agtgatggga ttactggcgt gagccaccat 4261 gcccagctat tttttttaac caatatatta gctagctttt ttccccagaa taattttccc 4321 caaaacattt aatagagaat aaaagaaaag aactttcagt ggtttaatgc tgttactttt 4381 aatatttcaa agatctgact gcagccatga gcagcaatga gtgcttcaag tgtggacgat 4441 ctggccactg ggcccgggaa tgtcctactg gtggaggccg tggtcgtgga atgagaagcc 4501 gtggcagagg tggttttacc tcggatagag gtattttgtc gaatagaaaa atttgaagta 4561 cttcagtatt tgttagtatc aagactggtc tgactagccg aattctttgt ttttgcgtat 4621 tttgtcgaat agaaaaattt gaagtacttc agtatttgtt agtatcaaga ctggtctgac 4681 tagccgaatt ctttgttttt gctcaaaaca ggtttccagt ttgtttcctc gtctcttcca 4741 gatatttgtt atcgctgtgg tgagtctggt catcttgcca aggattgtga tcttcaggag 4801 gatggtaagt atttaacact tccttttcat acccctctag agcttggaga ggtgagcaca 4861 tgcaactgtg tatagcattt ccacctttga ggttttgtat tgtataagtt aaaacgtaac 4921 actttgtaaa ggtattatag tacatggcct gtttcttttc cttattgttg aagcctgcta 4981 taactgcggt agaggtggcc acattgccaa ggactgcaag gagcccaaga gagagcgaga 5041 gcaatgctgc tacaactgtg gcaaaccagg ccatctggct cgtgactgcg accatgcaga 5101 tgagcagaaa tgctattctt gtggagaatt cggacacatt caaaaagact gcaccaaagt 5161 gaagtgctat aggtaaggtg tcagaatgtt gttagaagaa aactcattgc agagattctt 5221 ccagagatga attagctata aatggaaggg ccttagtaat tcagtgaaac ttagctgtga 5281 ccagataaga ccaattttca gcatatgtaa ctggcagtct atctgtatat aattctgtat 5341 tctgccctga tatcctgtgg cttatggtac ctgggcagtt ttcacaactg gactttttta 5401 atatataaaa gtaagagtgt tataatttga aacttccaga gacttcatag aaagctctgt 5461 aatatacata aatcttttat catgtaacca gaaatctttg cctgtttgtg acatgtaagt 5521 gtataatttg ataaatgttg ttgtgtacat atctgtgaaa ccttaggggt taattgcatg 5581 aaaacaaaga tcaggcgttt tgttctgcat ggtgactgtt gctttggtag acagtttttt 5641 ctgaggccca ttgtgaaaac ttttaatttc ttttttaggt gtggtgaaac tggtcatgta 5701 gccatcaact gcagcaagac aagtgaagtc aactgttacc gctgtggcga gtcagggcac 5761 cttgcacggg aatgcacaat tgaggctaca gcctaattat tttcctttgt cgcccctcct 5821 ttttctgatt gatggttgta ttattttctc tgaatcctct tcactggcca aaggttggca 5881 gatagaggca actcccaggc cagtgagctt tacttgccgt gtaaaaggag gaaaggggtg 5941 gaaaaaaacc gactttctgc atttaactac aaaaaaagtt tatgtttagt ttggtagagg 6001 tgttatgtat aatgctttgt taaagaaccc cctttccgtg ccactggtga atagggattg 6061 atgaatggga agagttgagt cagaccagta agcccgtcct gggttccttg aacatgttcc 6121 catgtaggag gtaaaaccaa ttctggaagt gtctatgaac ttccataaat aactttaatt 6181 ttagtataat gatggtcttg gattgtctga cctcagtagc tattaaataa catcaagtaa 6241 catctgtatc aggccctaca tagaacatac agttgagtgg gagtaaacaa aaagataaac 6301 atgcgtgtta atggctgttc gagagaaatc ggaataaaag cctaaacagg aacaacttca 6361 tcacagtgtt gatgttggac acatagatgg tgatggcaaa ggtttagaac acattatttt 6421 caaagactaa atctaaaacc cagagtaaac atcaatgctc agagttagca taatttggag 6481 ctattcagga attgcagaga aatgcatttt cacagaaatc aagatgttat ttttgtatac 6541 tatatcactt agacaactgt gtttcatttg ctgtaatcag tttttaaaag tcagatggaa 6601 agagcaactg aagtcctaga aaatagaaat gtaattttaa actattccaa taaagctgga 6661 ggaggaaggg gagtttgact aaagttcttt ttgtttgttt caaatttcat taatgtatat 6721 agtgcaaaat accatattaa agaggggaat gtggaggact gaaagctgac agtttggact 6781 tttctttttg tacttaagtc atgtcttcaa taatgaaaat tgctgttaaa aggatgtatg 6841 ggatttagat acttttgcaa agctatagaa aattcacttt gtaatctgtt ataataatgc 6901 ccttgagttc tgt // LOCUS HSU19816 3293 bp DNA PRI 14-APR-1995 DEFINITION Human thyroid transcription factor-1 (TTF-1) gene, complete cds. ACCESSION U19816 NID g767832 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3293) AUTHORS Ikeda,K., Clark,J.C., Shaw-White,J.R., Stahlman,M.T., Boutell,C.J. and Whitsett,J.A. TITLE Gene structure and expression of human thyroid transcription factor-1 in respiratory epithelial cells JOURNAL J. Biol. Chem. 270 (14), 8108-8114 (1995) MEDLINE 95229626 REFERENCE 2 (bases 1 to 3293) AUTHORS Whitsett,J.A. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Jeffrey A. Whitsett, Pulmonary Biology, Childrens Hospital Medical Center, 3333 Burnet Avenue, Cincinnati, OH 45229, USA FEATURES Location/Qualifiers source 1..3293 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" mRNA join(143..711,1675..>2417) /note="TTF-1" /product="thyroid transcription factor-1" exon 143..711 /number=1 5'UTR 143..338 CDS join(339..711,1675..2417) /note="TTF-1" /codon_start=1 /product="thyroid transcription factor-1" /db_xref="PID:g767833" /translation="MSMSPKHTTPFSVSDILSPLEESYKKVGMEGGGLGAPLAAYRQG QAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSAVGGYCNGNLGNMSELPPYQD TMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGGLGSLGDVSKNMAPLPSAP RRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKR QAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPVLVKDGKPCQAGAPA PGAASLQGHAQQQAQHQAQAAQAAAAAISVGSGGAGLGAHPGHQPGSAGQSPDLAHHA ASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGRTW" exon 1675..2417 /number=2 BASE COUNT 676 a 1003 c 1020 g 594 t ORIGIN 1 aacttaaagg tgtttacctt gtcatcagca tgtaagctaa ttatctcggg caagatgtag 61 gcttctattg tcttgttgct ttagcgctta cgccccgcct ctggtggctg cctaaaacct 121 ggcgccgggc taaaacaaac gcgaggcagc ccccgagcct ccactcaagc caattaagga 181 ggactcggtc cactccgtta cgtgtacatc caacaagatc ggcgttaagg taacaccaga 241 atatttggca aagggagaaa aaaaaagcag cgagcttcgc cttccccctc tccctttttt 301 ttcctcctct tccttcctcc tccagccgcc gccgaatcat gtcgatgagt ccaaagcaca 361 cgactccgtt ctcagtgtct gacatcttga gtcccctgga ggaaagctac aagaaagtgg 421 gcatggaggg cggcggcctc ggggctccgc tggcggcgta caggcagggc caggcggcac 481 cgccaacagc ggccatgcag cagcacgccg tggggcacca cggcgccgtc accgccgcct 541 accacatgac ggcggcgggg gtgccccagc tctcgcactc cgccgtgggg ggctactgca 601 acggcaacct gggcaacatg agcgagctgc cgccgtacca ggacaccatg aggaacagcg 661 cctctggccc cggatggtac ggcgccaacc cagacccgcg cttccccgcc agtaagtgag 721 gccgccccac tgcggggccg cgggctgagc tcaggaggtg cggcgagagg ctccagaagg 781 cgcggcgccg gcaggctgcg cgctgggcat cagggagggc ggcccggcag cggcgccagg 841 gacttgggtg cgggagctgg ggatgcttcc ccctgctcgg ctgggggtcc aagaacaggc 901 acttggtagc gctggggtcc tgcggtcaga tgcgggtact cggcgtctcc taggcgcggt 961 ggactggcag ctctgctcgg cgcagaagac ctcggggagc caagggaagc gaccccgagc 1021 tcaaggagca ggggcgagca gagcgcggag aggctagacc gggccaggag ggaggctgcc 1081 ctgttgggag gcactcgagc gcccggcccg gccctctctc cagcggagtc tgggcaggtg 1141 ggaggactcg cagttccaga ggggactcta agggtccgag caggtgccct cactggggcc 1201 tgacaggaga gaagccaaga ggcaaagcgt ctgggggctc cagcttttgg aagtcaacac 1261 cccctctcct aacctctcca aactggggtc taccgtagga ccccagctcc cggcctgagc 1321 ccagttcgcc gcctgtggcc agctaatcct aatgctctga cccgggctgg gcacgaaagg 1381 agcagaagcg gcctttcccc cactgcgtct tttggttcga aagagggaac tgagactgag 1441 ggagggcagc cagggttggg gctgtgagcg ctccagtaca gccccctcga cggtacggcc 1501 tggggcaggc gctggcagtt ccccgcggat gggcctcttg ggccccagcg ctaggctgcc 1561 tgggtcagga gggcgccgtc ggttggggcg ggccgggcgg gccaatggcg cggaaaacag 1621 gggtggcctg gctcggcctg gccccggccg acgctgtgcg tttgtcgctt acagtctccc 1681 gcttcatggg cccggcgagc ggcatgaaca tgagcggcat gggcggcctg ggctcgctgg 1741 gggacgtgag caagaacatg gccccgctgc caagcgcgcc gcgcaggaag cgccgggtgc 1801 tcttctcgca ggcgcaggtg tacgagctgg agcgacgctt caagcaacag aagtacctgt 1861 cggcgccgga gcgcgagcac ctggccagca tgatccacct gacgcccacg caggtcaaga 1921 tctggttcca gaaccaccgc tacaaaatga agcgccaggc caaggacaag gcggcgcagc 1981 agcaactgca gcaggacagc ggcggcggcg ggggcggcgg gggcaccggg tgcccgcagc 2041 agcaacaggc tcagcagcag tcgccgcgac gcgtggcggt gccggtcctg gtgaaagacg 2101 gcaaaccgtg ccaggcgggt gcccccgcgc cgggcgccgc cagcctacaa ggccacgcgc 2161 agcagcaggc gcagcaccag gcgcaggccg cgcaggcggc ggcagcggcc atctccgtgg 2221 gcagcggtgg cgccggcctt ggcgcacacc cgggccacca gccaggcagc gcaggccagt 2281 ctccggacct ggcgcaccac gccgccagcc ccgcggcgct gcagggccag gtatccagcc 2341 tgtcccacct gaactcctcg ggctcggact acggcaccat gtcctgctcc accttgctat 2401 acggtcggac ctggtgagag gacgccgggc cggccctagc ccagcgctct gcctcacgct 2461 tccctcctgc ccgccacaca gaccaccatc caccgctgct ccacgcgctt cgacttttct 2521 taacaacctg gccgcgttta gaccaaggaa caaaaaaacc acaaaggcca aactgctgga 2581 cgtctttctt tccccccccc actctaaaat ttgtgggttt ttttttttaa aaaaaagaaa 2641 atgaaaaaca accaagcgca tccaatctca aggaatcttt aagcagagaa gggcataaaa 2701 cagctttggg ggtgtctttt tttggtgatt caaatgggtt ttccacgcta gggcggggca 2761 cagattggag agggctctgt gctgacatgg ctctggactc taaagaccaa acttcactgt 2821 gggcacactc tgccagcaaa gaggactcgc ttgtaaatac caggattttt tttttttttt 2881 tgaagggagg acgggagctg gggagaggaa agagtcttca acataaccca cttgtcactg 2941 acacaaagga agtgccccct ccccggcacc ctctggccgc ctaggctcag cggcgaccgc 3001 cctccgcgaa aatagtttgt ttaatgtgaa cttgtagctg taaaacgctg tcaaaagttg 3061 gactaaatgc ctagttttta gtaatctgta cattttgttg taaaaagaaa aaccactccc 3121 agtccccagc ccttcacatt ttttatgggc attgacaaat ctgtgtatat tatttggcag 3181 tttggtattt gcggcgtcag tctttttctg ttgtaactta tgtagatatt tggcttaaat 3241 atagttccta agaagcttct aataaattat acaaattaaa aacgattctt ttt // LOCUS HSU19906 6402 bp DNA PRI 21-NOV-1996 DEFINITION Human arginine vasopressin receptor 1 (AVPR1) gene, complete cds. ACCESSION U19906 NID g755463 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6402) AUTHORS Thibonnier,M., Graves,M.K., Wagner,M.S., Auzan,C., Clauser,E. and Willard,H.F. TITLE Structure, sequence, expression, and chromosomal localization of the human V1a vasopressin receptor gene JOURNAL Genomics 31 (3), 327-334 (1996) MEDLINE 96435428 REFERENCE 2 (bases 1 to 6402) AUTHORS Thibonnier,M. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) Marc Thibonnier, Department of Medicine, Case Western Reserve University School of Medicine, 10900 Euclid Avenue, Cleveland, OH 44106-4982, USA FEATURES Location/Qualifiers source 1..6402 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="library LL12NL01, Charon 40, ATCC library # 57782" /chromosome="12" prim_transcript 28..>5479 /evidence=experimental gene join(2001..2970,5193..5479) /gene="AVPR1" CDS join(2001..2970,5193..5479) /gene="AVPR1" /codon_start=1 /product="arginine vasopressin receptor 1" /db_xref="PID:g755464" /translation="MRLSAGPDAGPSGNSSPWWPLATGAGNTSREAEALGEGNGPPRD VRNEELAKLEIAVLAVTFAVAVLGNSSVLLALHRTPRKTSRMHLFIRHLSLADLAVAF FQVLPQMCWDITYRFRGPDWLCRVVKHLQVFGMFASAYMLVVMTADRYIAVCHPLKTL QQPARRSRLMIAAAWVLSFVLSTPQYFVFSMIEVNNVTKARDCWATFIQPWGSRAYVT WMTGGIFVAPVVILGTCYGFICYNIWCNVRGKTASRQSKGAEQAGVAFQKGFLLAPCV SSVKSISRAKIRTVKMTFVIVTAYIVCWAPFFIIQMWSVWDPMSVWTESENPTITITA LLGSLNSCCNPWIYMFFSGHLLQDCVQSFPCCQNMKEKFNKEDTDSMSRRQTFYSNNR SPTNSTGMWKDSPKSSKSIKFIPVST" misc_feature 2163..2219 /gene="AVPR1" /note="encodes transmembrane region I" misc_feature 2277..2324 /gene="AVPR1" /note="encodes transmembrane region II" misc_feature 2385..2450 /gene="AVPR1" /note="encodes transmembrane region III" misc_feature 2526..2570 /gene="AVPR1" /note="encodes transmembrane region IV" misc_feature 2652..2720 /gene="AVPR1" /note="encodes transmembrane region V" misc_feature 2883..2954 /gene="AVPR1" /note="encodes transmembrane region VI" misc_feature 5228..5293 /gene="AVPR1" /note="encodes transmembrane region VII" BASE COUNT 1804 a 1411 c 1401 g 1786 t ORIGIN 1 gaattctctg agtcagtgac tattacctaa ttgcttgaag gattttttcc agacaggtgg 61 tctggaaacc ttttacctat taccttccat ccctgaacca tttcaatctt ctgcctcctg 121 gatatcttgg agaaaatgaa ccaacacaac acagctttca gtttttagag catttccccc 181 atacagaaca ttgtcttact tgatcttccc gatgacctca acaacaggaa aggcaggtcc 241 tttcatttcc atttataaga cgcacagacc caggattatc tagccacagg aagcaggact 301 ccagatttca agtccagcat ctcaacgtga caaccttggt aactctgcat gaacggactg 361 gatagtaaag tggaattatt actgagaact gcaatgaata aaatcttttg cattttttgc 421 ctacgtttca cagagggtga tatttttctg aggcaattaa atttatacca cggccacaat 481 actgaaacgt tctgaccaac aaagtcatgc tcctgcatct acacagcaga taactgcaga 541 aacggcttcc tttcttcctt gtaaaattgc ctgaaaacag ctcccccttg ctgtccgtcg 601 aggcatatct tcaccaacgt taaaacagag ctgagggaga tcgcatttct gcctccctcc 661 cgccctgcag aggggctcca gctgttcaga gtaacggatt actaggtagg tggttgtttc 721 ccctccttcc cagggcctct ttcctctctt tgagattgcc tctttcttac tcctgagcac 781 aggagccggg cgggttttct gtcccttgcc ctggacagca ctgcctggat ggccgctgtc 841 cggcagctgc tctttgtcca cccaaaaaga tgtccccacg actcagtagt aaccagacgg 901 tccccacgga ccactgcggc caaatttccg ccatccccgc tgtgggaatc aggcttttcc 961 cgcagaaaac cccaggaatc tagagaaaac tccttaagtc cctagtctcc atagagaaaa 1021 ccaggagaca ctccccccaa accccgctgt gaatacaggc acagcagcca ctggggcctg 1081 aaagtgatga gtgcgttctt cccgtcgcaa acatagggta ataaatagca tgcatcaaag 1141 acgttactag gaagagatag ctctttaagt cacgaggggg gagaaatgtt tgccccggga 1201 aaatttgcct ggggaataaa atttgccaga ctgctgcacg ggtgagctcg gtgagaagga 1261 agaaacccgg actggaggag gtgaggtcga gagccaggtt caggtgcagg agctagatgc 1321 gtggacgccg gtgcgtggac tggaggtttc caggtaccgc gcttagcgtg cctgttgaag 1381 tcaaatgcat ggttaaggag gctagcgagg aaggctagtg agggaagctt gtggaaacgg 1441 ctacgagccc agaaaaggca tgactcgtca gttgtccaag tttttggaag ggaaaagcgg 1501 gaaagcgcca cgatcccacc tactgtgagg aggaatctgc gagtctccca gctccacccc 1561 ctccacagtg atgcagagga caaacaccga cgtagggaga ggaaaaaata aaactccagg 1621 gagcggggag taggcaacca gcagtcttcc ggcaataggg cgggagggag cgcgtcccaa 1681 ggaaacaagc accgcataaa tacttgagtt gggaacccag tgcttccgga agctcggagc 1741 tcaccttccc gacctcgccg aagttgaaaa aaggcagagc agggagaggg gccagctcac 1801 cctgctgaga gctgctcagt gggcaggcgg gacgctgctc cgggagacgc ccactggagg 1861 gatcgcagag cccggcaagc tgcgagcgcg ccaaagaccc tgcgcttcgg acgaggagcc 1921 caagtcctcc gagacgggga gggagcgcgc cgcgagggct ggagctccga agagggccga 1981 gtaggagctg catggacagc atgcgtctct ccgccggtcc cgacgcgggg ccctcgggca 2041 actccagccc atggtggcct ctggccaccg gcgctggcaa cacaagccgg gaggccgaag 2101 ccctcgggga gggcaacggc ccaccgaggg acgtgcgcaa cgaggagctg gccaaactgg 2161 agatcgccgt gctggcggtg actttcgcgg tggccgtgct gggcaacagc agcgtactgc 2221 tggctctgca ccggacgccg cgcaagacgt cccgcatgca cctcttcatc cgacacctca 2281 gcctggccga cctggccgtg gcattcttcc aggtgctgcc gcaaatgtgc tgggacatca 2341 cctaccgctt ccgcggcccc gactggctgt gccgcgtggt gaagcacctg caggtgttcg 2401 gcatgtttgc gtcggcctac atgctggtag tcatgacagc cgaccgctac atcgcggtgt 2461 gccacccgct caagactctg caacagcccg cgcgccgctc gcgcctcatg atcgcggccg 2521 cctgggtgct gagcttcgtg ctgagcacgc cgcagtactt cgtcttctcc atgatcgagg 2581 tgaacaatgt caccaaggcc cgcgactgct gggccacctt catccagccc tggggttctc 2641 gtgcctacgt gacctggatg acgggcggca tctttgtggc gcccgtggtc atcttgggta 2701 cctgctacgg cttcatctgc tacaacatct ggtgcaacgt ccgcgggaag acggcgtcgc 2761 gccagagcaa gggtgcagag caagcgggtg tggccttcca aaaggggttc ctgctcgcac 2821 cctgtgtcag cagcgtgaag tccatttccc gggccaagat ccgcacggtg aagatgactt 2881 ttgtgatcgt gacggcttac atcgtctgct gggcgccttt cttcatcatc cagatgtggt 2941 ctgtctggga tcccatgtcc gtctggaccg gtacgtgccg ggaaaataga ggaaagtgca 3001 gggataggag tgtgtgtgtg tgtgtgtgtg tgtgtgtgag agagagagag agagagagag 3061 agaaaaaaaa atgagaatct agcaattttc ttcatagtat cttctagggc agtagttttt 3121 aaacttctat gtgtacataa gagtcgcacc ttggaggaac actgtttaaa aaaaaaaaaa 3181 gcatattcct ggattcccca tgcagacatt ctaattaaga atgtctggaa tggggctcag 3241 gactctgcat ttgtagtgtg caatgatcct aagaccacaa ttccagaaac tgtgtcctag 3301 agtcagccaa gcaactacaa agccgaagaa tgacttttgc taaatcacta gctactgtat 3361 taacaccatt tcaggtctat tggattcagg gaaatatttt tactgtcaca actgcttttt 3421 gttggattgt gaaaagtatt acaattaatt ttaaaaatgt gtagaaatgc ctcaggggca 3481 aggatagaaa cagatatata tttctaaaga aagctggaga aaaattcttt agaaggtgag 3541 taattccaat ttggcattta gctaataatg ttcctttctc gttataatct atttttttct 3601 aacaatgatt aatagttttt tcttggtatt tcaaagcaga attattgata aactaaattc 3661 attagactaa ttcattagtc tttcaccact catttgacca cacggctcta catctctcca 3721 cccaactctt acagtagttt actggaaatg ttctctgcca cctactattg tgtttgattt 3781 taattccttc ccaaactaga aaaactaact tcacagtgac ttgcaaaaaa aattatttaa 3841 ttttgcatct tgaaaatatt ttcttctagt aaagacaaaa actcaaacta aataaattcc 3901 agtgcttgtc ggaactcaaa ccaaataaat tcaatggtac agttgttacc agaatgtttc 3961 tggtaacagg aaataggtat ctgggagtag ctgttctcct tttgcttttt ggaattggag 4021 taggggaggt atgtaatatg ctttggaagt tatatttgca aataaaacat ttcagtatga 4081 atttaactta aatattctta ctgactataa tactagcgat aatgaaaaat acaatataaa 4141 cactttattt ttggtttgct atttcttatc ttgcttgatc ttagaagcct cttcatattg 4201 tccatcaaat aaagaaattc agtctaatta ttgctttagc agaatttaca ctcaagtaat 4261 aaaaacttca attgtgcata gatatgttgg taattttcat tctttgtgaa taccatctta 4321 cccatggctc ctgatcacct ttgatagcag catcttagca ctaagtatga ttaaataata 4381 acctgtaatt gttttctggc ataacaagag tgagaagatc caagtttata tttaataatc 4441 aaggaaaagt cagtgtttat tgattattct tatttttaga aaaggtatat tatcagcact 4501 gtagctccac tgtgaaaggt tataatattt atgcagttta ccagtgctaa ttatcataaa 4561 atattttaga atcctgttgg aatttcctaa ttctactgtt cttcttaata taatttgttt 4621 gaaccacaac cacagatggt tttccaaatt tctaaccaaa gaaaaacaac taaagcttat 4681 atcatccagg gacttcttct gtatggtttt catattataa gaatatttaa aactactaaa 4741 cttgatccct aatgcaatat tttttcctga gttattagga taaatacaat ttggtataca 4801 tggttattta aaattatctt aaaatttcat tacaattgta gccattctgt aactgctgtg 4861 tcattagcac atgctagttc gagtatagaa gatagagatt ttttaaatca attacttaat 4921 agtcttaacc tcgtaaaatt cccactcaat tctatttaaa tattttgata gtgttttaaa 4981 aatacttgaa ttaattttaa ggcatcttgc ttacaaaaat attttatagt caagcaattt 5041 tcaaacactc cccatttccc tgattgataa acaaaatagt tcattttcta tgataatcca 5101 gaagtttatg ccttcttaat tagttaatag aaaaatgagt ttatccatgg ttcacttaca 5161 ttacatgatt tccctttata tttttcatgc agaatcggaa aaccctacca tcaccatcac 5221 tgcattactg ggttccttga atagctgctg taatccctgg atatacatgt tttttagtgg 5281 ccatctcctt caagactgtg ttcaaagctt cccatgctgc caaaacatga aggaaaaatt 5341 caacaaagaa gatactgaca gtatgagcag aagacagact ttttattcta acaatcgaag 5401 cccaacaaac agtacgggta tgtggaagga ctcgcctaaa tcttccaagt ccatcaaatt 5461 cattcctgtt tcaacttgag ccttgcattc atgcaacttg attcttgtga ttgacttttt 5521 ggctcattag ctgaattgag ctagaaatca caagaacaaa tacactttat taatataacc 5581 ataaatcaat tcattgtgta tgagactgtg tttctagttg cattttcata ttgctaccaa 5641 aaactagaca ttattttgta tggaatatta atggaaacat gctgtactaa aatatgcagg 5701 tctgattccc agaaatacaa cagaagttat atttttaaag gaaaaatcat aaccacccta 5761 gctttatatt ttgttgttag tttcttttat tttcatttct aacataagta agacttgatt 5821 ggtttaaaag tcacataaaa tgcggcacta tttctgaaca aagagagctc atcatcagtc 5881 ttaatattca gagaaaactt cagagaaatt atgttttcat ccattaaaat taatttgtgc 5941 atcagaaaat gcagccttaa acagtgtcca ggagatggga tggtacctcc taggagtaca 6001 agtgcctggg gtgtaatgag ctcctgctca ttgtggccag tttagagttc tattagaagc 6061 tatcaatcac cttgcatttc aaaatggtaa ctttacaact ggcagtggcc tccttttggt 6121 tcctcacata ttattggtca agaaaagcat gaaaactgag atgctgaagg tgagaggaaa 6181 tgttgactgg ccaaaaatat cttttttccc ccactgcaag gttgttttaa agtcagattt 6241 gtataaggaa agccaaattt tattaaaaga gtagaaaagg attgcttaag gtactctgga 6301 ctttctcttg gacattgtaa acgtattttg atcagtatta caagggtatc ctgtgctatg 6361 ctggacatta acaagatcat tatcttcatg tttggggaat tc // LOCUS HSU20325 2483 bp DNA PRI 13-SEP-1996 DEFINITION Human cocaine and amphetamine regulated transcript CART (hCART) gene, complete cds. ACCESSION U20325 NID g665578 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2483) AUTHORS Douglass,J. and Daoud,S. TITLE Characterization of the human cDNA and genomic DNA encoding CART: a cocaine- and amphetamine-regulated transcript JOURNAL Gene 169 (2), 241-245 (1996) MEDLINE 96194810 REFERENCE 2 (bases 1 to 2483) AUTHORS Douglass,J.O. TITLE Direct Submission JOURNAL Submitted (26-JAN-1995) James O. Douglass, Vollum Institute, Oregon Health Sciences, University, 3181 SW Sam Jackson Park Road, Portland, OR 97201, USA FEATURES Location/Qualifiers source 1..2483 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="ATCC 37385" /chromosome="5" TATA_signal 72..77 exon 103..280 /number=1 5'UTR 103..121 gene join(122..280,706..789,1329..1436) /gene="hCART" CDS join(122..280,706..789,1329..1436) /gene="hCART" /note="psychomotor stimulant regulated protein; CART" /codon_start=1 /product="cocaine and amphetamine regulated transcript" /db_xref="PID:g665579" /translation="MESSRVRLLPLLGAALLLMLPLLGTRAQEDAELQPRALDIYSAV DDASHEKELIEALQEVLKKLKSKRVPIYEKKYGQVPMCDAGEQCAVRKGARIGKLCDC PRGTSCNSFLLKCL" intron 281..705 /number=1 exon 706..789 /gene="hCART" /number=2 intron 790..1328 /number=2 exon 1329..1866 /number=3 3'UTR 1437..1866 polyA_signal 1846..1851 polyA_site 1867 BASE COUNT 631 a 574 c 608 g 670 t ORIGIN 1 cccgggccct cctccacccc cccttccttc ttcgcctcct ccctctttcc tgcacggggg 61 ctcgggctca ctataaaagg tgggagcgcg tggtgcccca gcaacgacga gtttcagaac 121 gatggagagc tcccgcgtga ggctgctgcc cctcctgggc gccgccctgc tgctgatgct 181 acctctgttg ggtacccgtg cccaggagga cgccgagctc cagccccgag ccctggacat 241 ctactctgcc gtggatgatg cctcccacga gaaggagctg gtcggtattc ccctcgctct 301 cgaccccctt gacgtgtcgc cttgtctctt ctcttgcacg cctccctcct tccccccacc 361 cccactccta ttcccagagt cagggcgcgg ggagctgagc gcaacgccca ggcacccact 421 gccatccgaa gagcgactcg agctcacggg ctcctggcag tctgttgagc gaatccctca 481 tcccggcccc tctgagcaac aggggcccca gcggctcaga gacccgcggt cagtacctgg 541 gacagcgtcg ctaagtttcc acccctcgac cattccctgt gtccgcggag tcccaccgca 601 gagtgcgtgt gggtccgggg ctccttataa ctagggctgg aagtgcgcac ctgggctggg 661 ctcgcagcaa ggcgcaactt caggctccga agcggtgtgt tgcagatcga agcgctgcaa 721 gaagtcttga agaagctcaa gagtaaacgt gttcccatct atgagaagaa gtatggccaa 781 gtccccatgg taaggtttgt ggtcactccc ttcccgtgtt tttccaagag aaagtacacc 841 gccttgaatc gtacacacag ctccgtagga tgtggctaaa taacttaggt aatgggcttg 901 caggattctg tgggctcctt cttccttccc gggtgaggaa atgggaaagc aggaacaggg 961 gttgtaagaa agtgtaagtc tattgtttgt tgctcaggaa aaaggtctga tttttttccc 1021 tctgagaggg caagaaaagg agccaggaaa tgtgatgctc cccttcccac gccccccaac 1081 cctcgccact taaaggtgga agaaactagg ataaaactaa taatgtaagt ttctttaaaa 1141 aatgtactct cactgaggtt ataagcacaa ggctccctgt ttcagatctg actgtacgtc 1201 gacctcttgt gatggtgatg gggtccaatt gcccctttca agagacagaa attgcgttga 1261 ctgtgagact tgcctgttgg gaacctgggt ttgttcatac tcgatgacca cacattttgt 1321 tgtttcagtg tgacgccggt gagcagtgtg cagtgaggaa aggggcaagg atcgggaagc 1381 tgtgtgactg tccccgagga acctcctgca attccttcct cctgaagtgc ttatgaaggg 1441 gcgtccattc tcctccatac atccccatcc ctctactttc cccagaggac cacaccttcc 1501 tccctggagt ttggcttaag caacagataa agtttttatt ttcctctgaa gggaaagggc 1561 tcttttcctg ctgtttcaaa aataaaagaa cacattagat gttactgtgt gaagaataat 1621 gccttgtatg gtgttgatac gtgtgtgaag tattcttatt ttatttgtct gacaaactct 1681 tgtgtacctt tgtgtaaaga agggaagctt tgtttgaaaa ttgtattttt gtatgtggca 1741 tggcagaatg aaaattagat ctagctaatc tcggtagatg tcattacaac ctggaaaata 1801 aatcacccta agtgacacaa attgaagcat gtacaaatta tacataataa agtgttttta 1861 ataattgccc atagtgcact gctgttttca tataagtaat ttaagtggaa atggtgagat 1921 taatcatgct gttgttttca aagaaaaata tttcaaaaat agcagcctat tggaaatgca 1981 ctacgtcaga gttgatcgta tagagttgca gcagttagta tacctatttc ttgatgcagc 2041 gagtgtgtgt gtatgtgtgt gtgttagtgt gtgtgtgtgt gtgtgtgtga gagagagaga 2101 gagagagaaa gagagagatg aatgagatgg agatggttgg agaagaggtt atataatttt 2161 gtttattaaa acctttagcc agacccttta ctttaaacag tgagaccaat aaactataaa 2221 cagtttcatg ttttagtcac attaaaagca atttgaaaaa ttagaaattt tgttttgaca 2281 actcccttat tagaaaatat acattgattt aaagatatgg gctgtttagg gttgttattt 2341 gtctaaagac tccaaggtta taagacccat ccatcccaca agtaaattca cactcttgga 2401 aaaattctct attccaggag aaagagtcat ttcagaaaat agttttgagg ggaacaaata 2461 aaaattggag gaggtgagaa ttc // LOCUS HSU20499 8447 bp DNA PRI 29-MAR-1995 DEFINITION Human thermolabile phenol sulfotransferase (stm) gene, complete cds. ACCESSION U20499 NID g736378 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8447) AUTHORS Aksoy,I.A. and Weinshilboum,R.M. TITLE Human thermolabile phenol sulfotransferase gene (stm): molecular cloning and complete structural characterization JOURNAL Biochem. Biophys. Res. Comm. 208, 786-795 (1995) REFERENCE 2 (bases 1 to 8447) AUTHORS Aksoy,I.A. TITLE Direct Submission JOURNAL Submitted (31-JAN-1995) Ibrahim A. Aksoy, Pharmacology, Mayo Clinic/Mayo Foundation, 200 First St. SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..8447 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ICRFP700O01102" /clone_lib="No.700 P1 human genomic DNA library of Fiona Francis" /chromosome="16" /map="16p11.2" /cell_line="4X" 5'UTR join(624..736,905..992,4357..4360) /note="alternative 5'-UTR exists" exon 624..736 /number=1 exon 905..992 /number=2 5'UTR join(2848..2960,4357..4360) /note="alternative 5'-UTR" exon 2848..2960 /number=3 exon 4357..4508 /number=4 gene join(4361..4508,4613..4738,4828..4925,6322..6448, 6544..6638,7137..7317,7440..7552) /gene="STM" CDS join(4361..4508,4613..4738,4828..4925,6322..6448, 6544..6638,7137..7317,7440..7552) /gene="STM" /codon_start=1 /product="thermolabile phenol sulfotransferase" /db_xref="PID:g736379" /translation="MELIQDTSRPPLEYVKGVPLIKYFAEALGPLQSFQARPDDLLIN TYPKSGTTWVSQILDMIYQGGDLEKCNRAPIYVRVPFLEVNDPGEPSGLETLKDTPPP RLIKSHLPLALLPQTLLDQKVKVVYVARNPKDVAVSYYHFHRMEKAHPEPGTWDSFLE KFMAGEVSYGSWYQHVQEWWELSRTHPVLYLFYEDMKENPKREIQKILEFVGRSLPEE TMDFMVQHTSFKEMKKNPMTNYTTVPQELMDHSISPFMRKGMAGDWKTTFTVAQNERF DADYAEKMAGCSLSFRSEL" exon 4613..4738 /gene="STM" /number=5 exon 4828..4925 /gene="STM" /number=6 exon 6322..6448 /gene="STM" /number=7 exon 6544..6638 /gene="STM" /number=8 exon 7137..7317 /gene="STM" /number=9 exon 7440..7938 /number=10 3'UTR 7553..7938 polyA_signal 7919..7923 polyA_site 7939 BASE COUNT 1825 a 2351 c 2322 g 1949 t ORIGIN 1 acctctgcct cctggttcca agcaatcctc cttcctcacc ctccagagta gctgggatta 61 cacgcgcctg ccaccgcgcc tggcctaatt tttgtatttt tagtagagat gggggtttcc 121 aaccatgttg gccaggctgg tctccaaact cctgacctca ggtgatcctg cccacctaag 181 cctcccaaaa tgctggtatt acaggcatga gccaccgtgc ccggcctaaa taattaataa 241 aataatggac gatgggtgcc ttctactgag ctcccggtaa ttgtgagtga gtagaggact 301 tgccctgggg acattcagtg acctgctggg tgttgctgag ctgtgaggaa gttcaggtct 361 ggctgcagtg gtgaggctgt gactcaatca atcactgctg atgctcccag gacctgcacc 421 agcttagtcc taggggcaag gattttaact gtccacctca gtttcttcat ttgtaagatg 481 caaataacag tcacccctgc ctcatgggat ggagctgtgt aatgcccgca acagtgcctg 541 ctgcatagag gggttgctgc cagctgcctc tccctccttg tctcttacct gcctgctgcc 601 tgggtcagga tgaagagggg cccttgtgtt gcccccaccc tggctgcctg ctaagggccc 661 atgtgatctg cctggcagag gagtttcttc aggaagaacc agggcagctt ctgcccctag 721 agggccaatg cccttggtga gtgcagtccc ctggccccag cctggtccac ctctgggaag 781 agggtgccca gttgtgcaat ccaggcccag gcagctgagc cctcatctca gcatgcaggg 841 cggatactgg agggggcttg tggcatctga ctctgtatct cctacctgcc cctctccttg 901 gtagctgtga gaagtcactg ctttggggag acctgatctg gctgtgccag atggacactg 961 agaaagaagt agaagactca gaattagaag aggtgagtgg gctttggtgg cgggctccct 1021 accccactcc ctgccctggg ctgcctgtga ccacactgct tgcctctgca ggcacactgg 1081 acagacctgc tggagacctg atcctcagtg tccttacccc ctcctacctc ttttctgtgc 1141 cacctgctgt gggtccagca ggtttttact tgagtacaat aaaaagtctg agtcaagggt 1201 gccttatggt ggatgctgag gggaggggcg gagctagtag cccaaggtcc tgccagtcac 1261 ggggcttcct caggggcaca gaggaggcag gaggggcccc tggccctagc acgtgaacag 1321 cttctactct gcctggaaac cccatgcctc agctttcccc tacttgcctc tgagctcatg 1381 caattcttgg aagcctggga gacttacctt gaaattgaat gcaaatagga caaagaccaa 1441 ggaggatggg gggatgccct ccttccacgg ggccctgtgg cttccaagtc ttaatctcct 1501 ctagtctctt gtctacggag cctccttcaa acccagggaa agaaaagcac ctgccagggt 1561 tgtttttctt ctaggatctt ctattgatgc tctgtgaggt cccccaggag ccatgaagct 1621 agggctggct cctagggcaa tgggactaca gtgtccttgt cctttcttat tctttctgtt 1681 ctttctttct ttcttttttt tttttttttt tttttttgag acagagtctc actctgttgc 1741 ccaggctgga gtgcagtggt gtgatcttgg ctcactgaaa cctccgcctc ctgggttcaa 1801 gtgattctct tgcctcagcc tcctgagtag ctaggattac aggtgcccgc catcatgccc 1861 agctaatttt tgtattttta gtagagacag ggtttcacca tgttggccag cttggtctcg 1921 aactcctgac ctcaggtgat cctgctgcat cgacctccca aagtactggg attacaggcg 1981 tgagccacca cgctcagcct ctttcttgtt ctatatgtcc atgctctgct ccacttctgc 2041 cccttcactc tgccccacac atcactccag actggccttg tggtcagagc ctggaatgcc 2101 tgggctgctg ggggcctgtg gactgcactg ggccagaacc cctgccgcct tcaagactgg 2161 cctgtagcca gcaggtaggt gacttttccc aggccggcct atcccacctt tcccctccac 2221 tcactcacct cccttgcctg ggtcaattag agaaagcttg tcggccaggc atggtggctc 2281 atgcctgtaa tctcagcact ttgggaggcc gaggcgggcg gatcatctga gctcaggagt 2341 ttgagaccag cctggccaac atggcaaaac cccgtctcta ctaaaaatac aaaaattaac 2401 cggatgtggt ggtgtgcacc tgtaatccca gctactcggg aggctgaggc agaagaatcg 2461 cttgaaccca ggagggggag gttacagtga gcggagatcg tgctactgca ttgcagcctg 2521 ggcgagagag cgagtctcca tctcacataa aaaaaagaaa aagaaagaaa gcaagcttgt 2581 ctgttggcct gccctgcagg gtggagttca gagggaaggt caggagccta gtgacagctc 2641 aaaaaaaaaa aaacccaaat accaatgttg gccccttttg cctttcattc atgtgttttc 2701 tatacactaa actcacatat tgggtttgca gatcactcca agcttggctg gagctgtggt 2761 ggtaaggagg gtaatagaga agcttcccca ccctcaaccc caccccttcc ttcctggagt 2821 tcccagccct gactttagat ccctcccaca ctggaccttc aaaaccctca gggcagagag 2881 cagccctaca ctccctacac cacacccata ctcagcccct gcaggcaagg agagaacagg 2941 tcaggttccc gagagctcag gtgagtgaca cgttggaatg gcccagggca ccttcaccct 3001 gctcagcttg tggctccaac attctagaag ccgaggcctc tgccatccct gccctttccc 3061 atggatattc catttcaatt agacaaccca gcctggccgg aatccccctg cgttccttct 3121 tttcctttgt gtatttttga gacagggtgt tgctccgtca cccaggctgg agtgtagtgg 3181 gatcctggcc cactgcagcc tcaaattcct aggctgaggc aatcctgccg cctcagcctc 3241 ctgagtagct ggggttacaa gagcaagcca ccacacccag ctaattttga aaaatatttt 3301 ttgtagagga gaggtcttgc tttgttgtcc aggttggtct caaactccag ggctcaaggg 3361 atcctttccc gttggcctcc caaggctctg ggattacagg cgggagtcac cctgcctggg 3421 cccctccttt tgatgagtca tcagttttca ttcccgcacg aggctctagc ccctggtacc 3481 agcttagttg ctcaatgggc tgtgtttgtt ctggagccca gatggactgt ggccaggcaa 3541 gtggatcaca gacctggccg gcctgggagg tttccacatg tgaggggcat gaggggggct 3601 caaggagggg agcatcgggg agaggagcgc actgggtgga ggctgggggt cccagcagga 3661 aatggtgaga caaagggcgc tggctggcag ggagacagca caggcaggcc ctagagcttc 3721 ctcagcacag ctggactctc ctggagacct tcacacaccc tgatatctgg gccccgcgct 3781 acgagggtgc tttcactggt ctgcactatg ccccaggccc tgggattttg aacagctctg 3841 caggtgactg aaaggtgcgg ccaggctggg gaacgacctg gtttcagccc cagccccgcc 3901 actgactgac tttgtgagtg cgggcaagtc actcagcctc cctaggcctc agtgacttcc 3961 ctgaaagcaa aaactctgca aaggggcagc tgggtgctgg ctcacacctg taatcccagc 4021 actttgggag gctgaggtag acaaatcact tgaggccagg agttctagac cagcctggcc 4081 aacatggtga aaccccatct ctactaaaga aaaaaaaaaa ttagctgagc atggttgtac 4141 atgcttgtaa tcccagctac ttgggatgcc gaggcgggag gattgcttga acccaagagg 4201 tggagtttgc agtgagctga gattgtgcca cactgcactc cagcttgggt gagagtgaga 4261 ctccatctca aaaaaaaaaa aaaaaagaga gaatcccact ttcttgctgt tgtgatggtg 4321 gtaagggaac gggcctggct ctggcccctg atgcaggaac atggagctga tccaggacac 4381 ctcccgcccg ccactggagt acgtgaaggg ggtcccgctc atcaagtact ttgcagaggc 4441 actggggccc ctgcagagct tccaagcccg acctgatgac ctgctcatca acacctaccc 4501 caagtctggt aagtgaggag ggccacccac cctctcccag gcggcagtcc ccaccttggt 4561 cagcaaggtc gtgccctcag cctgctcacc tcctatctcc ctccctctcc aggcaccacc 4621 tgggtgagcc agatactgga catgatctac cagggcggcg acctagagaa gtgtaaccgg 4681 gctcccatct acgtacgggt gcccttcctt gaggtcaatg atccagggga accctcaggt 4741 gcatggctgg gtcctggggg taagggaagt ggaggaagac agggctgggg cttcagctca 4801 ccagaccttc cctgacccac tactcagggc tggagactct gaaagacaca ccgcccccac 4861 ggctcatcaa gtcacacctg cccctggctc tgctccctca gactctgttg gatcagaagg 4921 tcaaggtgag gccggcctca atggttcaca cctgtcatcc cagtttgaga ctgaggaggg 4981 aggatccctt gaaggcgaga gatggagacc agcctgggca acattgctgt agagatgaca 5041 tcccatctct acaaaaataa aattaacaac ctggtatggt ggcatagact gttcccagtt 5101 acttaggagg ctcagcgggg aggactgttt atgcaaatag gaagctgcaa tgagccctga 5161 tgatcctgct gctgcactcc agcctgggca acacagcaaa accatctcta cgaaaaaaaa 5221 agttcccact gactggcaag gaaagccagg aaggggggct caggtgccct ctcagccatg 5281 tacctgttct tctggaaggg cctcctcgct tctgccaggc tcatcacatc tttttttttt 5341 ttgagacaga gtcttgctct gtcaccctgg ctggagtgca gtggcatgat ctcagctcac 5401 tgcaacctcc gcctccccag ttcaagtgat tctcctgcct cagcctcctg agtagctggg 5461 attacaggcg tgtgctacca cacccggcta atttttgtat tctttttagt agagacgggg 5521 tttcaccatg ttggtcaagt ggatctcaaa ctcttgacct tgtgatcctc ctgcctcgac 5581 ctcacaaagt gctggaatta caggcgtgag ccaccgcgcc tggccctttt tttttttgag 5641 acagtttcac tcttgttgcc gaggctagag cgcaatcgtg tgatctcggt tcactgcaac 5701 caccgcctcc tgggttcaag caattctcct gcttcagcct cccaaggagc tgggattaca 5761 ggtacctgcc accacgcccg gctaattttg tatttttagt agagatgggg tttcaccatg 5821 ttggtcaggc tggtcttgaa ctcctgacct caggtgatct ggcaccttgg cctcccaaag 5881 tgccgggatt agaggcatga gccaccacgc ccagccttca tcacatcttg agagaggaca 5941 ctgtctgcct cttgctctga tgagggtctg atgcaaagga tagtgagtct ctacagtgca 6001 cacttaagaa aggcagcatg tgggtgctca caggtcaggc ggaggagggg gagctggtgg 6061 ggaccaggca tgccttgctc cagatcagga tatgatggca ttggtgcaga ttatattagt 6121 atagaatatg gtctcaggaa ccaggcagga ctttggcttc cgagcagggt tcagatccca 6181 gcttggccct acctgtgcag tgagatctca agcaagtcag cctctaagcc tcaggttcct 6241 cctttgccag ttcaacagat gagctggcct ggggtgggct gtgtggtgat ggtgctgggg 6301 ctgggtcctc tgcccctgca ggtggtctat gttgcccgaa acccaaagga cgtggcggtc 6361 tcctactacc atttccaccg tatggaaaag gcgcaccctg agcctgggac ctgggacagc 6421 ttcctggaaa agttcatggc tggagaaggt gggcttgact ggaggaagga gggtgtgaag 6481 ccgaggggtg gtggctataa cgtacagcaa ccctgtgtcg gtgccccctg cccgcttctc 6541 tagtgtccta cgggtcctgg taccagcacg tgcaggagtg gtgggagctg agccgcaccc 6601 accctgttct ctacctcttc tatgaagaca tgaaggaggt gagaccgact gtgatgcttc 6661 cccccatgtg acacctgggg gcaggcacct cacagggacc caccaaggcc acccagcccc 6721 gtccctgggc ggctcccaca gcaagcccgg attccccatc ctacctccct ggcccaggcc 6781 cccccactgc agccccacct ggcagcaggc tcggcacagc tttcatcttc tgcacctgag 6841 tcagctgcat gggtggccac ggatcagata cttagtccta ttgcttatcc tcaccaaagg 6901 gtgtgccacc cagggccaca gtcatggaag aagaccatcc cggtcctcac ccataggcgc 6961 caagccctgt tcatgatggg atcacagggc agagatcaat tcattttact ccagagacta 7021 gggccccagg ggttgaggct ctttggggtt tctaggggaa gtggccagat cccctctgag 7081 gttagagagg gggacccgtt ttgttttgct ccactgagga gccctctgct gctcagaacc 7141 ccaaaaggga gattcaaaag atcctggagt ttgtggggcg ctccctgcca gaggagacca 7201 tggacttcat ggttcagcac acgtcgttca aggagatgaa gaagaaccct atgaccaact 7261 acaccaccgt cccccaggag ctcatggacc acagcatctc ccccttcatg aggaaaggtg 7321 ggtgctggcc agcacggggg tttggggcgg gtgggagcag cagctgcagc ctccccatag 7381 gcacttgggg cctcccctgg gatgagactc cagctttgct ccctgccttc ctcccccagg 7441 catggctggg gactggaaga ccaccttcac cgtggcgcag aatgagcgct tcgatgcgga 7501 ctatgcggag aagatggcag gctgcagcct cagcttccgc tctgagctgt gagaggggct 7561 cctggagtca ctgcagaggg agtgtgcgaa tctaccctga ccaatgggct caagaataaa 7621 gtatgatttt tgagtcaggc acagtggctc atgtctgcaa tcccagcgat ttgggaggtt 7681 gagctggtag gatcacaata ggccacgaat ttgagaccag cctggtaaaa tagtgagacc 7741 tcatctctac aaagatgtaa aaaaattagc cacatgtgct ggcacttacc tgtagtccca 7801 gctacttggg aagcagaggc tggaggatca tttcagccca ggaggttgtg gatacagtga 7861 gttatgacat gcccattcac tacagcctgg atgacaagca agaccctccc tccaaagaaa 7921 ataaagctca attaaaataa aatatgattt gtgttcatgt agagcctgta ttggaaagga 7981 agagaaactc tgagctgaaa gagtgaatgc ccggtggggc cacatatggt cacctctccc 8041 ccagccttca gctccccagg tcaccatatc tggggagggg agaagggttt ggagaagtaa 8101 aacccaggag atgtgtggag gggggatgtc tgtttaatcc cagcacatcc tctgctgtcc 8161 tgccccaaga tggtggagga cgtcgagtcc gccgggcagc gtcacttttt cttgggctcc 8221 ttagaagcta ccaggtacct ctgggccaca ctgagatgag gggagtagcc gcctgcatag 8281 gaggtgtctt caaacaggat agtatagtcc ctcctggggg ttgtgggggt aggtggccaa 8341 ggaagggtag aggagcaagc ccccggggct ggttgtcaac tcactttgtt ggctggaatt 8401 ggttgtaact tgaccacctc gggcaggatc ccactgctca tccccaa // LOCUS HSU20758 3143 bp DNA PRI 07-FEB-1996 DEFINITION Human osteopontin gene, complete cds. ACCESSION U20758 NID g1001962 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3143) AUTHORS Crosby,A.H., Edwards,S.J., Murray,J.C. and Dixon,M.J. TITLE Genomic organization of the human osteopontin gene: exclusion of the locus from a causative role in the pathogenesis of dentinogenesis imperfecta type II JOURNAL Genomics 27 (1), 155-160 (1995) MEDLINE 95394452 REFERENCE 2 (bases 1 to 3143) AUTHORS Dixon,M. J. TITLE Direct Submission JOURNAL Submitted (07-FEB-1995) Michael J. Dixon, School of Biological Sciences, University of Manchester, Stopford Building, Manchester UK M13 9PT FEATURES Location/Qualifiers source 1..3143 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q13" 5'UTR join(126..212,360..373) mRNA join(126..212,360..427,537..575,884..964,1232..1273, 1597..1920,2305..3143) /product="osteopontin" exon 126..212 /note="exon 1 is not translated" /number=1 intron 213..359 /number=1 exon 360..427 /number=2 CDS join(374..427,537..575,884..964,1232..1273,1597..1920, 2305..2709) /codon_start=1 /product="osteopontin" /db_xref="PID:g1001963" /translation="MRIAVICFCLLGITCAIPVKQADSGSSEEKQLYNKYPDAVATWL NPDPSQKQNLLAPQNAVSSEETNDFKQETLPSKSNESHDHMDDMDDEDDDDHVDSQDS IDSNDSDDVDDTDDSHQSDESHHSDESDELVTDFPTDLPATEVFTPVVPTVDTYDGRG DSVVYGLRSKSKKFRRPDIQYPDATDEDITSHMESEELNGAYKAIPVAQDLNAPSDWD SRGKDSYETSQLDDQSAETHSHKQSRLYKRKANDESNEHSDVIDSQELSKVSREFHSH EFHSHEDMLVVDPKSKEEDKHLKFRISHELDSASSEVN" intron 428..536 /number=2 exon 537..575 /number=3 intron 576..883 /number=3 exon 884..964 /number=4 intron 965..1231 /number=4 exon 1232..1273 /number=5 intron 1274..1596 /number=5 exon 1597..1920 /number=6 intron 1921..2304 /number=6 exon 2305..3143 /number=7 3'UTR 2710..3143 BASE COUNT 983 a 601 c 628 g 931 t ORIGIN 1 ggggaagtgt gggagcaggt gggctgggca gtggcagaaa cctgatgaca caatctcgcc 61 gcctccctgt gttggtggag gatgtctgca gcagcattta aattctggga gggcttggtt 121 gtcagcagca gcaggaggag gcagagacag catcgtcggg accagactcg tctcaggcca 181 gttgcagcct tctcagccaa acgccgacca aggtacagct tcagtttgct actgggttgt 241 gcattcagct gaatttcatg gggaagtcca aattctaagg aaaaaaatgt ggtagtataa 301 aaaggtatca ctgttgtaac ctatgaagat gtcagctatt cctttgaaat attttgcagg 361 aaaactcact accatgagaa ttgcagtgat ttgcttttgc ctcctaggca tcacctgtgc 421 cataccagtg agtacagttg catcttaaag aaaattcctg aaaataactg aattgtgtgc 481 ttccatgtgc taggaggaca ttcttgtaat ctttcttcat cttttctgtt tctaaggtta 541 aacaggctga ttctggaagt tctgaggaaa agcaggtaag catcttttat gtttttatat 601 agttaaatca tttactcaat tatggcgaga ggtgcaagaa acgtatttgc tgcgatcaaa 661 tgagttcata tttgtaaagc aatttgaaag agtgcctagc ccacagtaag tgctacataa 721 gagtttgtta aatgaatctg caaaaaaaaa aaaaattaca aaaaggtacc taagggtccg 781 ggtgactata tgcttccatc aagactagtg aagaatggtt gttttttcca ttcatcccta 841 catttctttt tttaataatg ataaacatgc aacttttttg tagctttaca acaaataccc 901 agatgctgtg gccacatggc taaaccctga cccatctcag aagcagaatc tcctagcccc 961 acaggtattt ttaaacttct cataattaaa ctacagtgat gaaagatagc cacactcagg 1021 ccatttgggc tgctcagatg aatcctgccc tgcctgctgg caaacatgtg cttaggacat 1081 tgactgatct gccatgttgg cttctctctg tgttaagcca tccacagatg aggctgaaaa 1141 ataaaaactg ctttggatta aaaaggttaa cttttgaata aaaaagctag gcatgtgtga 1201 tgcgcactaa cacgtgccat tccttcttca gaatgctgtg tcctctgaag aaaccaatga 1261 ctttaaacaa gaggtaagtt ctcattttca atcagaggcc catcatgcct tgaagagatg 1321 aaagaaggca ttgcctggat tctcttctga tgaaatttca ttagcaagtt ttccagctaa 1381 ttggcagtct aaaacttgct cataaataaa acatgtattt actaaatatc agaaatacta 1441 ggtttcctcg gataacctaa aagccatggt atgtactgtg aatgcaaaga ttctgaaact 1501 aaataaaaag aaagatagta aaagactaat gtgctataaa ggctaaggga aaataaaaac 1561 ccatatatta attttcccgg ccatcttaat tttcagaccc ttccaagtaa gtccaacgaa 1621 agccatgacc acatggatga tatggatgat gaagatgatg atgaccatgt ggacagccag 1681 gactccattg actcgaacga ctctgatgat gtagatgaca ctgatgattc tcaccagtct 1741 gatgagtctc accattctga tgaatctgat gaactggtca ctgattttcc cacggacctg 1801 ccagcaaccg aagttttcac tccagttgtc cccacagtag acacatatga tggccgaggt 1861 gatagtgtgg tttatggact gaggtcaaaa tctaagaagt ttcgcagacc tgacatccag 1921 gtaaatcctt taacagacac acctgatggt tctgactagc gctcaagtct aggaaaccac 1981 agtttgcata ttcattcatt cattcatcca ttcattcatc cattcagcaa gaattcattc 2041 atattctact ttatgaccat tgaatacaaa tctttttctg cttggcggtt tttgtaagtc 2101 tacataattt ctctctagat ttgattctca aacacaattc tactttttga aatcctggat 2161 caaagtaaca tgctagtatt atttcagcca gatttagaca atttttagta taagatgacc 2221 taaaagctag agagtggaaa aggattacca tattcccatc cctagccgtt catataatta 2281 ttcttcattt gtgccgtgat tcagtaccct gatgctacag acgaggacat cacctcacac 2341 atggaaagcg aggagttgaa tggtgcatac aaggccatcc ccgttgccca ggacctgaac 2401 gcgccttctg attgggacag ccgtgggaag gacagttatg aaacgagtca gctggatgac 2461 cagagtgctg aaacccacag ccacaagcag tccagattat ataagcggaa agccaatgat 2521 gagagcaatg agcattccga tgtgattgat agtcaggaac tttccaaagt cagccgtgaa 2581 ttccacagcc atgaatttca cagccatgaa gatatgctgg ttgtagaccc caaaagtaag 2641 gaagaagata aacacctgaa atttcgtatt tctcatgaat tagatagtgc atcttctgag 2701 gtcaattaaa aggagaaaaa atacaatttc tcactttgca tttagtcaaa agaaaaaatg 2761 ctttatagca aaatgaaaga gaacatgaaa tgcttctttc tcagtttatt ggttgaatgt 2821 gtatctattt gagtctggaa ataactaatg tgtttgataa ttagtttagt ttgtggcttc 2881 atggaaactc cctgtaaaca aaagcttcag ggttatgtct atgttcattc tatagaagaa 2941 atgcaaacta tcactgtatt ttaatatttg ttattctctc atgaatagaa atttatgtag 3001 aagcaaacaa aatactttta cccacttaaa aagagaatat aacattttat gtcactataa 3061 tcttttgttt tttaagttag tgtatatttt gttgtgatta tcttttgtgg tgtgaataaa 3121 tcttttatct tgaatgtaat aag // LOCUS HSU20982 4619 bp DNA PRI 07-MAR-1995 DEFINITION Human insulin-like growth factor binding protein-4 (IGFBP4) gene, promoter and complete cds. ACCESSION U20982 NID g695253 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4619) AUTHORS Strong,D.D., Morales,S., Lee,K., Boonyaratanakornkit,V., Baylink,D.J. and Mohan,S. TITLE Cloning of the human insulin-like growth factor binding protein-4 gene and identification of the proximal promoter JOURNAL Unpublished REFERENCE 2 (bases 1 to 4619) AUTHORS Strong,D.D. TITLE Direct Submission JOURNAL Submitted (13-FEB-1995) Donna D. Strong, J.L. Pettis Veterans' Hospital, Research 151, 11201 Benton Street, Loma Linda, CA 92357, USA FEATURES Location/Qualifiers source 1..4619 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HIGFBP43" /clone_lib="pWE15 cosmid library of Clontech, Palo Alto, CA" /sex="female" /tissue_type="placenta" promoter 1..396 mRNA join(397..1030,1579..1736,2525..2659,2760..>3992) /gene="IGFBP4" 5'UTR 397..681 /gene="IGFBP4" gene join(397..1030,1579..1736,2525..2659,2760..>3992) /gene="IGFBP4" exon 397..1030 /gene="IGFBP4" CDS join(682..1030,1579..1736,2525..2659,2760..2894) /gene="IGFBP4" /note="IGF binding protein-4" /codon_start=1 /evidence=experimental /product="insulin-like growth factor binding protein-4" /db_xref="PID:g695254" /translation="MLPLCLVAALLLAAGPGPSLGDEAIHCPPCSEEKLARCRPPVGC EELVREAGCGCCATCALGLGMPCGVYTPRCGSGLRCYPPRGVEKPLHTLMHGQGVCME LAEIEAIQESLQPSDKDEGDHPNNSFSPCSAHDRRCLQKHFAKIRDRSTSGGKMKVNG APREDARPVPQGSCQSELHRALERLAASQSRTHEDLYFIPIPNCDRNGNFHPKQCHPA LDGQRGKCWCVDRKTGVKLPGGLEPKGELDCHQLADSFRE" intron 1031..1578 exon 1579..1736 /gene="IGFBP4" intron 1737..2524 exon 2525..2659 /gene="IGFBP4" intron 2660..2759 exon 2760..>3992 /gene="IGFBP4" 3'UTR 2895..>3992 /gene="IGFBP4" BASE COUNT 833 a 1445 c 1349 g 992 t ORIGIN 1 gcagagccgg gagtccgggc taggaagtcc ctttctcggt gggagactga ggccgccttg 61 gcggggcggg acgagactcc tccgaggtcg ggaaaggggg ccccgcagca gccccttggc 121 ttcccttctc ccttgcctcc cctccggggc tccggttcag aggcactctg ggcgcctgct 181 acagcttcca aactgcgccg cttccttctt cggcagaaaa ggactttcag atgcggcggc 241 ggcggcggcg gcgactcagg acagcgcccc ctccccctaa cggccgcctc tccctctccc 301 cctcgccgcc ccggctcccc cacctctggg aaggcgctgg gggtgtggcc agggaccggt 361 ataaagtccg ggggagccgg tcccgggcag ccgctcagcc ccctgcccct cgccgccccc 421 cgccgcctgc ctgggccggg ccgaggatgc ggcgcagcgc ctcggcggcc aggcttgctc 481 ccctccggca cgcctgctaa cttcccccgc tacgtccccg ttcgcccgcc gggccgcccc 541 gtctccccgc ggcctccggg tccgggtcct ccaggacggc caggccgtgc cgccgtgtgc 601 cctccgccgc tcgcccgcgc gccgcgcgct ccccgcctgc gcccagcgcc ccgcgcccgc 661 gccccagtcc tcgggcggtc catgctgccc ctctgcctcg tggccgccct gctgctggcc 721 gccgggcccg ggccgagcct gggcgacgaa gccatccact gcccgccctg ctccgaggag 781 aagctggcgc gctgccgccc ccccgtgggc tgcgaggagc tggtgcgaga ggcgggctgc 841 ggctgttgcg ccacttgcgc cctgggcttg gggatgccct gcggggtgta caccccccgt 901 tgcggctcgg gcctgcgctg ctacccgccc cgaggggtgg agaagcccct gcacacactg 961 atgcacgggc aaggcgtgtg catggagctg gcggagatcg aggccatcca ggaaagcctg 1021 cagccctctg gtaaggtacc cctgcctccc aattccctcc tgagtagcgc tcccttccca 1081 gcggcttctt ccccattcca gccgccctgg aaggccctta aaaatcccct atgagttgaa 1141 gagaaggcag gtacgtggca gggccgaaaa aatcagagag ccacagggag aacgatcaga 1201 tggagagggg aatggtggga ctgaatggaa ggggacagga tccagcaggg ggtccccctg 1261 cctgcctcag acctccctgc agggcccagg ggaccctcct gccatctggg cagctgcagc 1321 tggtgactca tctgagcgct ggcctgagtg ggtgagggac aagtcaagga cttcaagagg 1381 aatattctcc tgagagatcc aggggagagg ggaggaggta ggggctgctc ttgtttcttc 1441 ccgctcccca ggccccctgc tcttctccag cccaagccga agaatcccca ggaaggagaa 1501 cgtgggtggg aggctgggtt ggtgtgacgc tctgacctct tccggtgctg acctctcctt 1561 atcgctacct gaatacagac aaggacgagg gtgaccaccc caacaacagc ttcagcccct 1621 gtagcgccca tgaccgcagg tgcctgcaga agcacttcgc caaaattcga gaccggagca 1681 ccagtggggg caagatgaag gtcaatgggg cgccccggga ggatgcccgg cctgtggtaa 1741 ggacctccga tgcacaaatg tgcatgtgca tagacacaca cacacacaca tgccccctgc 1801 cccccacatg cacgcaccca cacacaccat caccaccaga tctggggcgt gttcattcag 1861 cacacattct agggtgacta ctgtgtgcaa ggtgcaacta gtgtgagaca tcagggccca 1921 gagaaagcac tcattccctt ccttgggatt cctttcctgc ccatcagtta tatacatcgg 1981 gggaggttaa gtgattatta aatactgaaa aacttctata ccaattggta aaaagcagtt 2041 acccaaagcg ccctggggtt tcttggcctt tggggggttc cccaccccag accttctcag 2101 taaccaaggg taatgctggc catagcaggg ggcctgggac cctgctccag ggtctggcta 2161 atttcccttc tgtaaggtct tacaagttgt cagttggaag ttttatctgt cccttctgca 2221 tagaaatcac cccctccctg gctttctgca taggttccca cagccccttt ccccacagtg 2281 tcccctcact tcccacccct cctttgtaga agctctttgg ctcgagaccc tttcctccac 2341 gtctctaccc tcccagtgtg tcctccagac ccaggcctct ctcttcacct cactgggtcc 2401 tgcccagccc ctgggggtca ggcctccttt cgggggcctt cagttctcac ttagctctga 2461 ccccaggcct gggcctcctg cctctcttcc ttctgctgag caattttgtc ttcccctcct 2521 ccagccccag ggctcctgcc agagcgagct gcaccgggcg ctggagcggc tggccgcttc 2581 acagagccgc acccacgagg acctctactt catccccatc cccaactgcg accgcaacgg 2641 caacttccac cccaagcagg tgggtctctg tctcccgctg gcttggccct ggactcagct 2701 ctggggcatg gtctcttttc ctgcgtcgga actgacccct catgtccttc tcttggcagt 2761 gtcacccagc tctggatggg cagcgtggca agtgctggtg tgtggaccgg aagacggggg 2821 tgaagcttcc ggggggcctg gagccaaagg gggagctgga ctgccaccag ctggctgaca 2881 gctttcgaga gtgaggcctg ccagcaggcc agggactcag cgtcccctgc tactcctgtg 2941 ctctggaggc tgcagagctg acccagagtg gagtctgagt ctgagtcctg tctctgcctg 3001 cggcccagaa gtttccctca aatgcgcgtg tgcacgtgtg cgtgtgcgtg cgtgtgtgtg 3061 tgtttgtgag catgggtgtg cccttggggt aagccagagc ctggggtgtt ctctttggtg 3121 ttacacagcc caagaggact gagactggca cttagcccaa gaggtctgag ccctggtgtg 3181 tttccagatc gatcctggat tcactcactc actcattcct tcactcatcc agccacctaa 3241 aaacatttac tgaccatgta ctacgtgcca gctctagttt tcagccttgg gaggttttat 3301 tctgacttcc tctgattttg gcatgtggag acactcctat aaggagagtt caagcctgtg 3361 ggagtagaaa aatctcattc ccagagtcag aggagaagag acatgtacct tgaccatcgt 3421 ccttcctctc aagctagccc agagggtggg agcctaagga agcgtggggt agcagatgga 3481 gtaatggtca cgaggtccag acccactccc aaagctcaga cttgccaggc tccctttctc 3541 ttcttcccca ggtccttcct ttaggtctgg ttgttgcacc atctgcttgg ttggctggca 3601 gctgagagcc ctgctgtggg agagcgaagg gggtcaaagg aagacttgaa gcacagaggg 3661 ctagggaggt ggggtacatt tctctgagca gtcagggtgg gaagaaagaa tgcaagagtg 3721 gactgaatgt gcctaatgga gaagacccac gtgctagggg atgaggggct tcctgggtcc 3781 tgttccccta ccccatttgt ggtcacagcc atgaagtcac cgggatgaac ctatccttcc 3841 agtggctcgc tccctgtagc tctgcctccc tctccatatc tccttcccct acacctccct 3901 ccccacacct ccctactccc ctgggcatct tctggcttga ctggatggaa ggagacttag 3961 gaacctacca gttggccatg atgtcttttc ttctttttct tttttttaac aaaacagaac 4021 aaaaccaaaa aatgtccaga tgattgtgtt tggttgattt attctcagtt agacacaggg 4081 atgcaccagg ggtggagaga cggggacaga ttttgggagg tgagtattgt gtggtcccca 4141 gacctgtctg tatggtaagg gactgcagaa ggacggccaa tccaccttcc tcttccctgc 4201 aacggaagtt tcctagggaa ctccttggct tcaaagtctg cgctgtcttt acttagactc 4261 ctggtgggca aacaatggct cctgaaaggg gggcatgacc aaggacagcc ctgtggggca 4321 gagctgtctc tgggatcagc tggcatgtgg ggctggggca ttctagggca tcgggcggac 4381 tgggcttgca tctggatttg atttattaat ttgttgggga ggggcaggga cactgccctg 4441 catttgagga aagggggtag atgcttcagc acattccaca gctctgactg ccgagatctc 4501 tgactcgggg cattgtgctg agattggatt ctgaggttgg gaggggttga ctttgctgta 4561 gactcagtgc cagccacagc ttcagagatt gtgctcacat ggtatgcctg gactcttgg // LOCUS HSU22027 7215 bp DNA PRI 01-JAN-1997 DEFINITION Human cytochrome P450 (CYP2A6V2) gene, complete cds. ACCESSION U22027 NID g1008461 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7215) AUTHORS Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. et,al. TITLE A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles JOURNAL Am. J. Hum. Genet. 57 (3), 651-660 (1995) MEDLINE 95397851 REFERENCE 2 (bases 1 to 7215) AUTHORS Fernandez-Salguero,P. TITLE Direct Submission JOURNAL Submitted (01-MAR-1995) Pedro Fernandez-Salguero, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20894, USA FEATURES Location/Qualifiers source 1..7215 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 782..790 exon 791..970 /gene="CYP2A6V2" /number=1 gene 791..6489 /gene="CYP2A6V2" CDS join(791..970,1237..1399,2115..2264,2499..2659,3207..3383, 4257..4398,4873..5060,5577..5718,6308..6489) /gene="CYP2A6V2" /codon_start=1 /product="cytochrome P450" /db_xref="PID:g1008462" /translation="MLASGMLLVALLACLTVMVLMSVWQQRKSKGKLPPGPTPLPFIG NYLQLNTEQMYNSLMKISERYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRG EQATFDWVFKGYGVVFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAI RSTHGANIDPTFFLSRTVSNVISSIVFGDRFDYKDKEFLSLLRMMLGIFQFTSTSTGQ LYEMFSSVMKHLPGPQQQAFQLLQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQ EEEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLHYGFLLLMKHPEVEAKVHEEIDRV IGKNRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPKGIEVF PMLGSVLRDLRFFSNPRDFNPQHFLGEKGQFKKRDAFVPFSIRKRNCFGEGLARMELF LFFTTVMQNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR" exon 1237..1399 /gene="CYP2A6V2" /number=2 exon 2115..2264 /gene="CYP2A6V2" /number=3 exon 2499..2659 /gene="CYP2A6V2" /number=4 exon 3207..3383 /gene="CYP2A6V2" /number=5 exon 4257..4398 /gene="CYP2A6V2" /number=6 exon 4873..5060 /gene="CYP2A6V2" /number=7 exon 5577..5718 /gene="CYP2A6V2" /number=8 exon 6308..6489 /gene="CYP2A6V2" /number=9 3'UTR 6490..6744 BASE COUNT 1646 a 2196 c 1746 g 1627 t ORIGIN 1 aagttcccct gaaatatggc tctggtcttc ctccccttgc caatgaagaa gatggcagtg 61 gaggttctat ggcagccatc ctggcctcac tctgaggttc caatgaggat tctgggcatc 121 aagagacagc tctgggcaaa gctaaatcaa gtcagcccct ggacccagtg ctgggctgct 181 gggctttctg ggagaacgcc gctgggcttg ctacacactc ctcctcccag aaactccaca 241 cccacagccc tgggtcttcc tagccccgag actttcaagt ccatatgcct ggaatccccc 301 ttcctgagac ccttaaccct gcatcctcca caacagaaga cccctaaatg cacagccaca 361 ctttgtctta ccctaataaa acccagacct ttggattcct ctcccctgga acccccagat 421 ccgcacaact ttggggtgca ttctcactct cagaccccaa atccaaagcc caagtgctcc 481 cctatgcaaa tattccaaac tcctcagttc tacagcttat ctgttgcccc ctcctaaatc 541 cacagccctg cggcacccct cctgaagtac cacagattta gtctggaggc cccctctctg 601 ttcagctgcc ctggggtccc cttatcctcc cttgctggct gtgtcccaag ctaggcagga 661 ttcatggtgg ggcatgtagt tgggaggtga aatgaggtaa ttatgtaatc agccaaagtc 721 catccctctt tttcaggcag tataaaggca aaccacccca gccgtcacca tctatcatcc 781 ctctaccacc atgctggcct cagggatgct tctggtggcc ttgctggcct gcctgactgt 841 gatggtcttg atgtctgttt ggcagcagag gaagagcaag gggaagctgc ctccgggacc 901 caccccattg cccttcattg gaaactacct gcagctgaac acagagcaga tgtacaactc 961 cctcatgaag gtgtcccaag acagggagat gggtgtctcg gggtgggggc tgcctagttg 1021 gctggggctt tgtggcaggg ggttgaccag tgtggaccag agtcttagga aatggagttt 1081 tggagtttca gcatcagaaa gacaggatct tgggatgtcc agctccctga ctgtgagaac 1141 ctgggtgcga agcatcccag cacatgacat ctcggtgctg ggccccattc agagtggagg 1201 gttctccctc taaccactcc cacccacctc catcagatca gtgagcgcta tggccccgtg 1261 ttcaccattc acttggggcc ccggcgggtc gtggtgctgt gtggacatga tgccgtcagg 1321 gaggctctgg tggaccaggc tgaggagttc agcgggcgag gcgagcaagc caccttcgac 1381 tgggtcttca aaggctatgg tgcccaagag ggggaaggtg ggcaggtgga cacgaaggtc 1441 tcagtgttcc cagccttctc cctgactctc ctgacaactg gaggataagg gagagtcccc 1501 agtctggtct tccctcccca tctccctaca ttggggcctc tccatgtgta tccctcacct 1561 gtctccagcg gccctgtcct gattcctccc tgcctctctc tgccccacct ccttattctc 1621 tctcactgga gtctcctctt tcccctctct ctccatctct aaggacatcc tgggtttctg 1681 tttaccagcc ctgggtctct gtctacatga gtctttgagg ccctcttagc ttctgggctt 1741 ctctgggttt ctcatctctc cggatccctt tctcaattct tcctctgtct taggatgcca 1801 gggttattcc tacttccaca tcttcaggct ccatctcctg gtaacagtct ctcttccttc 1861 cagaccctct ctgtttctat ctcaatatta aactctctgc tccagctcag cttaagaatc 1921 tcacaccaag agaggatgtc ctccacccag atctccccat atctcactac cccaccctcc 1981 atcctctgcc tccatcactc tctttctctc cccactgccc tgcggacgcg atccaatgga 2041 gtgtggagct aatgccgtga agctatgtgc atctctctgt ctggccgtac ctgggtaata 2101 acctgatcga ctaggcgtgg tattcagcaa cggggagcgc gccaagcagc tcctgcgctt 2161 tgccatcgcc accctgaggg acttcggggt gggcaagcga ggcatcgagg agcgcatcca 2221 ggaggagtcg ggcttcctca tcgaggccat ccggagcacg cacggtgagc aggggacccc 2281 gagtgcgggg gcaggagaag gaaaacaccc aggacgagga acccgcgcgc gttctgcctg 2341 gggatgggga ctaggtgggg aaaggcgccc gcacttccag ccctggagtc tggcgctggg 2401 aatttggctc aacaaggccc tgcctcctgg aattctgact ctcctcagac ctctgagttg 2461 actctctccc caaccccctt ctcccgacat acccggaggc gccaatatcg atcccacctt 2521 cttcctgagc cgcacagtct ccaatgtcat cagctccatt gtctttgggg accgctttga 2581 ctataaggac aaagagttcc tgtcactgtt gcgcatgatg ctaggaatct tccagttcac 2641 gtcaacctcc acggggcagg taatggttgc agcccggccc gtgaaggccc ttaccaaaac 2701 cggcaaattg ttcccctacc gggggaaggg ggccccaaat tcccaccgcc ccccggacag 2761 tgtcccctca aaatcagtcc ccgatttggg caaattggca gagtggaacc agacccgggt 2821 tggttgtcca atcccctgct ctccagggac accgggatag cacaacagat gctccccaaa 2881 acagagcctg ctggcaggat gcataccctc agctcagctc tctcaccctg ggcacgtgtt 2941 cccatcccca acttaccggt aatttctaac agatgctccc tacccaggtc ttcttgaata 3001 ttttaacacc cggaaaccct gggtacctaa ccttccctgt aaactttaga gattagttcc 3061 tatccggccc ctctgaaata cctaaccacc ggagaccaga tgcctttaac tcagttcctt 3121 ccttgctatg aaacaaatcc cattcccatc agctcctgcc ccgtgacagc tgtccttccc 3181 ttcccatcct ctctctgcaa ccccagctct atgagatgtt ctcttcggtg atgaaacacc 3241 tgccaggacc gcagcaacag gcctttcagt tgctgcaagg gctggaggac ttcatagcca 3301 agaaggtgga gcacaaccag cgcacgctgg atcccaattc cccacgggac ttcattgact 3361 cctttctcat ccgcatgcag gaggtacacc ccagcagcca ctgcggggag atgcaaagcc 3421 aggcagaggg aaatcagtct gggagtgggg caggcagatg acacaggccc attcaaatta 3481 accctcatca taataatcct cacaattggc tgggtgccgt ggctaacagc ctgtaatccc 3541 agcactttgg gaggccgagg caggtggatc acctgaggtc aggagttcga gaccagcctg 3601 gccaacatgg tcaaaccccg tctctactaa aaatccaaaa attagttggg catggtggcg 3661 cgaagggggg cagaggttgc aatgagccaa gatcacggca ttgcactcca gtctgggtga 3721 cagaatgagg ccctgtgtca aaaaaaatta atcacttgtt taaaaagtaa gtgagcctgc 3781 atggtcatgc gcatgtgcag ctccagctac tcaggaggct gaggctggag gattgcttga 3841 gctcaggagt tggcgtccgg cctgtgcaac ttagcaagac caagtcagta taagaaaaaa 3901 aaaaaacaaa aaaaaagctg acagctaagt tgataattga cggacagatg gtcagcaagg 3961 taacgaaggt gagaaggaag agcattgggg gcaacgccag gagtcagggc aagggctggt 4021 tcctagagcg agtctggtag gatctagggc ccctcttctc caccctgcgg tcttgcccca 4081 aagagaggtc gagggtgctg ggattgcgct agactcgagt ctgtgtagat cttggggtcc 4141 cctcttgacc cccattggtc tgaacctaag agtggaagat ccatggggtg aacccctaga 4201 tggtgccctg aggtcaagca ggagtgaggt tgtcctaaag ccccctctcc cttcaggagg 4261 agaagaaccc caacacggag ttctacttga agaacctgat gatgagcacg ttgaacctct 4321 tcattgcagg caccgagacg gtcagcacca ccctgcacta tggcttctta ctgctcatga 4381 agcacccaga ggtggagggt aaggctggag ggggacggaa gtggagggcc ccagaccctc 4441 aaaattcccc ttcgactggt gcaatgtccc cacctgtccc agatcccggg accctgagac 4501 gtgacttgct gtccagagac agggcaacat tcagctggta ggcatcagct gagtctcatt 4561 agatattaaa atattgaaaa tgtctgcact gattggtcag tcacttctgt cccaagccca 4621 ctgagtgccc actgcccgtt ccaccgggtc atcccctaag ttcctccctg tgcctcccct 4681 gtgattctgg cacaacctgg ttaacaggat cctactccaa caatgcgaat gggtgatgtc 4741 tgttctgtta tgaatgctct acttccgtct cataggcgga ggcatttcat ccaccccatt 4801 ttgcctatcc ggactatcat ttcctgctct gagaccccta gatacctaaa cacattcccc 4861 ctcctccccc agccaaggtc catgaggaga ttgacagagt gatcggcaag aaccggcagc 4921 ccaagtttga ggaccgggcc aagatgccct acatggaggc agtgatccac gagatccaaa 4981 gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac accaagtttc 5041 gggatttctt cctccctaag gtgctatccg cccccacccc ccagactacg gggactccag 5101 cccctctctg tgtccccagc atcccaccca cattagaagc tttctagacc ctgtcccact 5161 ccctcaatca gtcaaaaaag acttccccaa ccaccacatc cgttccacct ttccacttag 5221 acactcctga gtcctgcatc tctccagact ctttgtgtca ggagaatcaa acacatgttc 5281 ccaaacttcc tatcttaaga aacagaagcc ccctttccat tcggcctttt gtcataggga 5341 cagaaatctc aggtccccca aactcctgcc tagaaggaca tggaccccat gtctcccaaa 5401 cttcctgttt cagagatgtg aaccttctat cccccaaggt cctccctcag aggtccccaa 5461 ttcccatgcc tgccacttcc cctcaccggg gcaccctagt tccccctcca gcccctgtgt 5521 actctcaaca atcccccaac ccgcctcatc acatacacct tcctcctccc tcccagggca 5581 tagaagtgtt ccctatgttg ggctccgtgc tgagagacct caggttcttc tccaaccccc 5641 gggacttcaa tccccagcac ttcctgggtg agaaggggca gtttaagaag cgtgatgctt 5701 ttgtgccctt ctccatcagt aagagaccac tgtttggtgc caggcttact actcacacca 5761 gcaggggcct cccttaccca gttcccctct ctgccgtgta gcctagtatt tccccagctt 5821 ggcaagttcc tgttagcaat ctaccgtcga gccaccaggt gatactccct taactaccaa 5881 gcacccagta cctgtgccca ggcaaaagga aaggaaacat catacccctt tcagaggcgg 5941 gggaaaacca aaggccagag agaatcagag atttatttcc ctagggtcac acaggagatt 6001 cttcagcatc cctaaaaagg agatgacggc acagcaggtc atatttggga gttcttatct 6061 gggggaaggg ggatcttaaa cctcccattg tggacacctg gcatcgatca accccatctt 6121 ttggtcatct tttgggtcac tcaaggaaac tgaggtcaag gagggtcaag aggctccctc 6181 ttaaagtctc tcagggccat atattccacc cttcctccct gggagagccg cagctggagg 6241 tcggtactgg ggcgaggctg cactgagagt gggcttcacc tccacccctc ccgcctctcc 6301 tcctcaggaa agcggaactg tttcggagaa ggcctggcca gaatggagct ctttctcttc 6361 ttcaccaccg tcatgcagaa cttccgcctc aagtcctccc agtcacctaa ggacattgac 6421 gtgtccccca aacacgtggg ctttgccacg atcccacgaa actacaccat gagcttcctg 6481 ccccgctgag cgagggctgt gccggtgaag gtctggtggg cggggccagg gaaagggcag 6541 ggccaagacc gggcttggga gaggggcgca gctaagactg ggggcaggat ggcggaaagg 6601 aaggggcgtg gtggctagag ggaagagaag aaacagaagc ggctcagttc accttgataa 6661 ggtgcttccg agctgggatg agaggaagga aacccttaca ttatgctatg aagagtagta 6721 ataatagcag ctcttatttc ctgagcacgt acccccgtgt cacctttgtt caaaaaccat 6781 tgcacgctca cctaatttgc cacaaaaccc ccttcgaagg ggcgttcatg cccattttac 6841 acgtgacaaa actgaggctt agaaagttgt ctctgatgtc tcacaaaaca taagtgccca 6901 gaaaatctgc gaacacagat ctgtgcccat agccttctag acagattctt aaaaagcacc 6961 tattcctcac gcaaaacagt ttagtataga atcacatggc ctgaacatcc ctgtccgggg 7021 gagttcccca gagacctggg gggtggttgc cctgccttca ctgcacacat gcccacactc 7081 tcacctactc aacatgctgt gactacccgg gtgtaatctg tgcttgctac cagataaggc 7141 cactgtagcc cattcagagt cagcccaggg acacaacgag acatgactgg acatacaggg 7201 tcagtccatt aacaa // LOCUS HSU23143 3869 bp DNA PRI 30-MAR-1995 DEFINITION Human mitochondrial serine hydroxymethyltransferase gene, nuclear encoded mitochondrion protein, complete cds. ACCESSION U23143 NID g746435 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3869) AUTHORS Stover,P.J., Shane,B., Stover,D.M. and Chen,L.H. TITLE Mitochondrial Serine Hydroxymethyltransferase gene JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3869) AUTHORS Stover,P.J. TITLE Direct Submission JOURNAL Submitted (20-MAR-1995) Patrick J. Stover, Nutritional Sciences, Cornell University, Savage Hall, Ithaca, NY 14853 USA FEATURES Location/Qualifiers source 1..3869 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13" CDS join(202..369,845..924,1077..1277,1575..1656,1817..1939, 2068..2207,2541..2706,2924..3023,3110..3373,3497..3624) /codon_start=1 /product="mitochondrial serine hydroxymethyltransferase" /db_xref="PID:g746436" /translation="MAIRAQHSNAAQTQTGEANRGWTGQESLSDSDPEMWELLQREKD RQCRGLELIASENFCSRAALEALGSCLNNKYSEGYPGKRYYGGAEVVDEIELLCQRRA LEAFDLDPAQWGVNVQPYSGSPANLAVYTALLQPHDRIMGLDLPDGGHLTHGYMSDVK RISATSIFFESMPYKLNPKTGLIDYNQLALTARLFRPRLIIAGTSAYARLIDYARMRE VCDEVKAHLLADMAHISGLVAAKVIPSPFKHADIVTTTTHKTLRGARSGLIFYRKGVK AVDPKTGREIPYTFEDRINFAVFPSLQGGPHNHAIAAVAVALKQACTPMFREYSLQVL KNARAMADALLERGYSLVSGGTDNHLVLVDLRPKGLDGARAERVLELVSITANKNTCP GDRSAITPGGLRLGAPALTSRQFREDDFRRVVDFIDEGVNIGLEVKSKTAKLQDFKSF LLKDSETSQRLANLRQRVEQFARAFPMPGFDEH" BASE COUNT 759 a 1042 c 1168 g 900 t ORIGIN 1 aagcttggct ccagaactgg cttttgaagg tggaggtggg ggtagtgaga actgaaacag 61 gctggctgga caactggcag cttggtgtgg ccttagaagc atgagtgagg gtggccggga 121 gacgatttgt ctggactgtt gtgatttcac cctgactttt cctgccttca gcctctgcag 181 agatgtgggc agctggtcag gatggccatt cgggctcagc acagcaacgc agcccagact 241 cagactgggg aagcaaacag gggctggaca ggccaggaga gcctgtcgga cagtgatcct 301 gagatgtggg agttgctgca gagggagaag gacaggcagt gtcgtggcct ggagctcatt 361 gcctcagagg tgggactggg gagatgggca ggggttgggc caccatgggt acaggaagta 421 acaaagttat cttaactgat atttctccaa aacccccttt cacactcagg acctttcttt 481 gggctttatc ttcctttctt atctccctca agcaaaggca gtgcaagtcc agtttatggg 541 gttgggacat ttagggagcc tccagggtcc ctacagtttc atctgatccc ttccttcctc 601 ctatgaggaa ggaggagcct agaagcacaa gtttgagtgg gtaggtggca ttgaggggcc 661 actgctcatg gcagatgggt ttctgagaat gctgcctctg gctttgcccc aggcctggtg 721 ctgagtgaat ggagctttct gcagggagta ctcccgcttt cagctctggc tctggcaggg 781 agggactgtg ggagtccagg ggaaggggct caataccttc tgacattgcc ccccaccacc 841 ccagaacttc tgcagccgag ctgcgctgga ggccctgggg tcctgtctga acaacaagta 901 ctcggagggt tatcctggca agaggtgagg gctggagggc agtgtcaggg atggtgctcc 961 cagtggggga acccacctgt accttcccag tgttcattga ggagtgaact tcccagtcct 1021 ttgctgatgg ttgagagtcc tttctctgtg ccctcattac ccctctccca cggcagatac 1081 tatgggggag cagaggtggt ggatgaaatt gagctgctgt gccagcgccg ggccttggaa 1141 gcctttgacc tggatcctgc acagtgggga gtcaatgtcc agccctactc cgggtcccca 1201 gccaacctgg ccgtctacac agcccttctg caacctcacg accggatcat ggggctggac 1261 ctgcccgatg ggggccagtg agtatggatg ggctggctga tggtcttggc ggcaggattg 1321 gtgtgggaaa ggagttattt attgaatacc tactgtggac catacagatg gaacaggcct 1381 tgccctgtcc tgcatgtcac agtggatgag gaagataaga tcccagttat agtgcctacc 1441 acagagtgga cagagcagtg aggcggtgtg tcctaggact ggtgttctgg ggacagagaa 1501 ctgtggagtt gaagggagtg gttaagtccg ggggtccttc cacccaggcc ttcttacttc 1561 ctctcacttc gcagtctcac ccacggctac atgtctgacg tcaagcggat atcagccacg 1621 tccatcttct tcgagtctat gccctataag ctcaacgtga gtgctctagg gtgtggggag 1681 gggctcttgg ccctggtggt ggtcctcccc tggagaagct gagggcctgg agcgccgggc 1741 cgtccttagg gttaaggagg agagtgagct gccctgcttc cttctcaggg ctttagctgt 1801 ttgtgtgtct gtccagccca aaactggcct cattgactac aaccagctgg cactgactgc 1861 tcgacttttc cggccacggc tcatcatagc tggcaccagc gcctatgctc gcctcattga 1921 ctacgcccgc atgagagagg ttggtggggg gggctggaga ctgggcacct ccccaggggg 1981 tggtgaggag gtgtgggagg agggcagcct tgggcaggcc tctccgggcc ctccccaggc 2041 tgaggccttg cctctgtacc tgcccaggtg tgtgatgaag tcaaagcaca cctgctggca 2101 gacatggccc acatcagtgg cctggtggct gccaaggtga ttccctcgcc tttcaagcac 2161 gcggacatcg tcaccaccac tactcacaag actcttcgag gggccaggtc aggtcctgag 2221 gtcgggcttg cctttccctg ccttcaggcc tattcctggg gcactgttgg cctggacctg 2281 agaggaattc attcccacct gcagccctta agactcctgc ccagtctgtg agagttctcc 2341 ttctcttgcc catggtggcc atgccctggc aggggatttg tggatgggat tgaggggctg 2401 attccctcta ccactggaat ccagtgtacc aagcccacgt gagctgtgcc cttggggccc 2461 aggtccgcca gcttcctctg ccttctctgt cccttgtcct tcttttcagc ttagactctg 2521 accatccacc tctcacacag gtcagggctc atcttctacc ggaaaggggt gaaggctgtg 2581 gaccccaaga ctggccggga gatcccttac acatttgagg accgaatcaa ctttgccgtg 2641 ttcccatccc tgcagggggg cccccacaat catgccattg ctgcagtagc tgtggcccta 2701 aagcaggttg gggatcctgt ctttgtaggg tgtggggggg caatggcctg gaggcttaga 2761 ccctgcacct tgctaactga tgctggggct gatggaaggg aaatgccagg atggaaggag 2821 tcaaggctgg ggtcacagag ctatgctgag ggtgcagggc cagagggtag tgcagggctt 2881 gggtccaggc ctagggtgac agctgctact gtctcatctc caggcctgca cccccatgtt 2941 ccgggagtac tccctgcagg ttctgaagaa tgctcgggcc atggcagatg ccctgctaga 3001 gcgaggctac tcactggtat caggtaagcc agcaggtgat gggtgagggc ctctgtagct 3061 tcaggcagag gcccaggact caccactccc catttcttac ccaccttagg tggtactgac 3121 aaccacctgg tgctggtgga cctgcggccc aagggcctgg atggagctcg ggctgagcgg 3181 gtgctagagc ttgtatccat cactgccaac aagaacacct gtcctggaga ccgaagtgcc 3241 atcacaccgg gcggcctgcg gcttggggcc ccagccttaa cttctcgaca gttccgtgag 3301 gatgacttcc ggagagttgt ggactttata gatgaagggg tcaacattgg cttagaggtg 3361 aagagcaaga ctggtgagtg agcaagaagg agccccgggc cagccagttc ccactcactg 3421 tctgctccct cccccagctg atctcactgc cttccctaga gctctgacca cttgtttcct 3481 caccctctct ctctagccaa gctccaggat ttcaaatcct tcctgcttaa ggactcagaa 3541 acaagtcagc gtctggccaa cctcaggcaa cgggtggagc agtttgccag ggccttcccc 3601 atgcctggtt ttgatgagca ttgaaggcac ctgggaaatg aggcccacag actcaaagtt 3661 actctccttc cccctacctg ggccagtgaa atagaaagcc tttctatttt ttggtgcggg 3721 agggaagacc tctcacttag ggcaagagcc aggtatagtc tcccttccca gaatttgtaa 3781 ctgagaagat cttttctttt tccttttttt ggtaacaaga cttagaagga gggcccaggc 3841 actttctgtt tgaacccctg tcatgatca // LOCUS HSU23853 2296 bp DNA PRI 30-JAN-1996 DEFINITION Human dual-specific phosphoprotein phosphatase (PAC1) gene, complete cds. ACCESSION U23853 NID g775211 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2296) AUTHORS Yi,H., Morton,C.C., Weremowicz,S., McBride,O.W. and Kelly,K. TITLE Genomic organization and chromosomal localization of the DUSP2 gene, encoding a MAP kinase phosphatase, to human 2p11.2-q11 JOURNAL Genomics 28 (1), 92-96 (1995) MEDLINE 96070437 REFERENCE 2 (bases 1 to 2296) AUTHORS Kelly,K. TITLE Direct Submission JOURNAL Submitted (31-MAR-1995) Kathleen Kelly, Laboratory of Pathology, National Cancer Institute, National Institutes of Health, Building 10, Room 2A33, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2296 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p11.2-q11" 5'UTR 1..85 /gene="PAC1" gene join(86..473,558..679,1067..1286,1403..1617) /gene="PAC1" CDS join(86..473,558..679,1067..1286,1403..1617) /gene="PAC1" /note="corresponding PAC1 mRNA sequence can be located in GenBank Accession Number L11329" /codon_start=1 /product="dual-specific phosphoprotein phosphatase" /db_xref="PID:g775212" /translation="MGLEAARELECAALGTLLRDPREAERTLLLDCRPFLAFCRRHVR AARPVPWNALLRRRARGPPAAVLACLLPDRALRTRLVRGELARAVVLDEGSASVAELR PDSPAHVLLAALLHETRAGPTAVYFLRGGFDGFQGCCPDLCSEAPAPALPPTGDKTSR SDSRAPVYDQGGPVEILPYLFLGSCSHSSDLQGLQACGITAVLNVSASCPNHFEGLFR YKSIPVEDNQMVEISAWFQEAIGFIDWVKNSGGRVLVHCQAGISRSATICLAYLMQSR RVRLDEAFDFVKQRRGVISPNFSFMGQLLQFETQVLCH" 3'UTR 1618..2263 /gene="PAC1" BASE COUNT 369 a 729 c 720 g 478 t ORIGIN 1 ggagtcgacc gctcgggcag cgcaccgcca cgagagcccg ggacgcggga aagaccgaaa 61 ggaagaggaa gaggcaccgg tggccatggg gctggaggcg gcgcgcgagc tggagtgcgc 121 ggcgctgggc acgctgctgc gggatccgcg ggaggcggaa cgcacgctgc tgctggactg 181 ccgccccttc ctggccttct gccggcgcca cgtgcgcgcc gcgcggccag tgccttggaa 241 cgcgctgctg cggcgccgcg cgcgcggccc tcctgccgcc gttctcgcct gcctgctgcc 301 cgaccgcgcg ctgcggacgc gcctggtccg cggggagctg gcgcgggccg tggtgctgga 361 cgagggcagt gcctcggtgg cggagctccg gcccgacagc ccggctcatg tgctgctggc 421 cgcgctgctg cacgagaccc gcgcggggcc cactgccgtg tacttcctgc gaggtgagca 481 agccggccgc ccttggcccg ccccacccga ctccagcccg gctttccctg ctgatcgtgc 541 tcttcccctc cttgcaggag gcttcgacgg cttccagggc tgctgtcccg atctgtgctc 601 tgaggccccc gcccctgcgc tgccgccaac aggggacaaa accagccgct ccgactccag 661 ggctcctgtc tacgaccagg tgagtattga cccccctccc ccgaacctcc ctctctcccc 721 gccgcggact gggcgcctct agggaaagag cctgccattt ggggatgcga gcagtttgcc 781 catatacaca cacttttata cgtgtgtgtg ttgggggagg gggtggggca tggctgctcg 841 cgcgtgcctg tatgggtctg tgtatgtttg catgtgtatg ttgggagcat gaagggaaat 901 gtatgtcccg gtgtgtcctc cgcacattcc tgagacctgt ctcaggtcag gaggactggc 961 tgaggagtcc tcttgtctcg gccagcccat ggggtctcca cccgctgctg ctccagctct 1021 ccggggctgg gctggaaagg cctcaccgcc cctctgttcc ctccagggtg gccctgtgga 1081 gatcttgccc tacctgttcc tgggcagctg cagtcactcg tcagacctgc aggggctgca 1141 ggcctgtggc atcacagccg tcctcaacgt gtccgccagc tgccccaacc actttgaggg 1201 ccttttccgc tacaagagta tccctgtgga ggacaaccag atggtggaga tcagtgcctg 1261 gttccaggag gccataggct tcattggtaa gggggcacct ctgcccagaa atcccgaggg 1321 gctccagagg agaggactgg gggcacctgt ctcctgggct gtgcacattc cgtctgacac 1381 cactccccca tctcccttgc agactgggtg aagaacagcg gaggccgggt gctggtgcac 1441 tgccaggcgg gtatctcgcg ctctgccacc atctgtctgg catacctcat gcagagtcgc 1501 cgtgtgcggc tggacgaggc ctttgacttc gttaagcagc gccggggggt catctccccc 1561 aacttcagtt tcatggggca gctgctgcag tttgagaccc aggtgctgtg tcactgaggt 1621 ggtgcccctc tgcctgcctg ccccactgtg ctggcaggag ctgactgtgg actggtgggc 1681 tcccctctgg gccagcacag tcccctcacc tccggcaggg ctgctacctc ctcagagttt 1741 cagaagcccc cacatggggg ctctaggaat gccggcatgc tggtctttcc gacctggtgc 1801 tcttctgctg ggggactgag gctggccctc attcggggtc gggaaccaag ggtgtgtctg 1861 ctctttccct ccccatcctc tggcagaaat cagctagacg ctataccgtg gactctccct 1921 ggtccaccac catgttgaag cccttggcag cctgagagct ccaaggaaca agctgtgaca 1981 accaggagcc ctgtctgtgg gttcgtctgc ccagggcctg gagcccaagc cctgtgttcc 2041 tggggaagct ggggacttgg gaagtgatgg gtgtgtcatg ttgcgtgtgt ctgtctgtga 2101 gcctttcaca cctgtgctgg cgctggaaaa ttatttgtgc tcagctgaca tttaacactt 2161 cctcccccgc ttcctcctag ccctgtgggc aggggttgga aacttagcac tttatattta 2221 tacagaacat tcaggatttg tcaataaaat attgttatat ttaaaaaaca acaaaaaaaa 2281 aaaaaaaaag gaattc // LOCUS HSU24685 505 bp DNA PRI 08-MAY-1995 DEFINITION Human anti-B cell autoantibody IgM heavy chain variable V-D-J region (VH4) gene, clone E11, VH4-63 non-productive rearrangement. ACCESSION U24685 NID g799242 KEYWORDS Ig heavy chain; immunoglobulin; VH4-63; VH4.21; autoantibody; rearranged; fetal spleen; variable region; V-region; diversity region; D-region; joining region; J-region. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 505) AUTHORS Parr,T.B., Johnson,T.A., Silberstein,L.E. and Kipps,T.J. TITLE Anti-B cell autoantibodies encoded by VH 4-21 genes in human fetal spleen do not require in vivo somatic selection JOURNAL Eur. J. Immunol. 24 (12), 2941-2949 (1994) MEDLINE 95104275 REFERENCE 2 (bases 1 to 505) AUTHORS Parr,T.B., Johnson,T.A., Silberstein,L.E. and Kipps,T.J. TITLE Direct Submission JOURNAL Submitted (12-APR-1995) Todd A. Johnson, Medicine, UCSD, 9500 Gilman Drive, 0663, La Jolla, CA 92093, USA FEATURES Location/Qualifiers source 1..505 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="E11" /chromosome="14" /map="14q32.33" /tissue_type="fetal spleen" /cell_type="B lymphocyte" gene join(1..46,130..443) /gene="VH4" sig_peptide join(1..46,130..140) /gene="VH4" /note="VH4 leader-signal peptide" CDS join(1..46,130..443) /gene="VH4" /note="VH4-63 non-productive rearrangement" /codon_start=1 /product="immunoglobulin heavy chain variable region" /db_xref="PID:g799243" /translation="MKHLWFFLLLVAAPRWVLSQVQLQQWGAGLLKPSETLSLTCAVY GGSFSGYYWSWIRQPPGKGLEWIGEINHSGSTNYNPSLKSRVTISVDTSKNQFSLKLS SVTAADTAVYYCARAGA" intron 47..129 mat_peptide 141..443 /gene="VH4" /note="VH4-63 non-productive rearrangement" /product="immunoglobulin heavy chain variable region" misc_feature 432..505 /note="D-J region frameshift" BASE COUNT 105 a 136 c 147 g 117 t ORIGIN 1 atgaaacacc tgtggttctt cctcctcctg gtggcagctc ccagatgtga gtgtctcagg 61 aatgcggata tgaagatatg agatgctgcc tctgatccca gggctcactg tgggtttctc 121 tgttcacagg ggtcctgtcc caggtgcagc tacagcagtg gggcgcagga ctgttgaagc 181 cttcggagac cctgtccctc acctgcgctg tctatggtgg gtccttcagt ggttactact 241 ggagctggat ccgccagccc ccagggaagg ggctggagtg gattggggaa atcaatcata 301 gtggaagcac caactacaac ccgtccctca agagtcgagt caccatatca gtagacacgt 361 ccaagaacca gttctccctg aagctgagct ctgtgaccgc cgcggacacg gctgtgtatt 421 actgtgcgag agccggggcc taatagtggg agctactact gcttttgata tctggggcca 481 agggacaatg gtcaccgtct cctca // LOCUS HSU25816 2605 bp DNA PRI 16-JUN-1997 DEFINITION Human TATA-binding protein associated factor II 30 (TAFII30) gene, complete cds. ACCESSION U25816 NID g837262 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2605) AUTHORS Scheer,E., Mattei,M.G., Jacq,X., Chambon,P. and Tora,L. TITLE Organization and chromosomal localization of the gene (TAF2H) encoding the human TBP-associated factor II 30 (TAFII30) JOURNAL Genomics 29 (1), 269-272 (1995) MEDLINE 96079120 REFERENCE 2 (bases 1 to 2605) AUTHORS Tora,L. TITLE Direct Submission JOURNAL Submitted (27-APR-1995) Laszlo Tora, CNRS Inserm ULP, Inst. de Genetique et de Biologie Moleculaire et Cellulaire, Illkirch CEDEX, 67404, France FEATURES Location/Qualifiers source 1..2605 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.2-p15.5" /haplotype="23" /tissue_type="placenta" 5'UTR 1060..1084 gene join(1085..1316,1456..1610,1822..1886,1971..2085, 2263..2352) /gene="TAFII30" CDS join(1085..1316,1456..1610,1822..1886,1971..2085, 2263..2352) /gene="TAFII30" /note="TBP-associated factor II 30" /codon_start=1 /product="TATA-binding protein associated factor 30 kDa subunit" /db_xref="PID:g2193966" /translation="MSCSGSGADPEAAPASAASAPGPAPPVSAPAALPSSTAAENKAS PAGTAGGPGAGAAAGGTGPLAARAGEPAERRGAAPVSAGGAAPPEGAISNGVYVLPSA ANGDVKPVVSSTPLVDFLMQLEDYTPTIPDAVTGYYLNRAGFEASDPRIIRLISLAAQ KFISDIANDALQHCKMKGTASGSSRSKSKDRKYTLTMEDLTPALSEYGINVKKPHYFT " 3'UTR 2353..2434 polyA_signal 2411..2416 BASE COUNT 526 a 739 c 755 g 585 t ORIGIN 1 gaattcttgc aagactcagt tcagaagtca ccttctttcg tgaatgtttt gattccctga 61 ggctacttta ttttggtatg gctgaaaaat cctagatttt ctaaacaaaa cctgtttgaa 121 tcttggttct gatatggact aggagagaga ctgggtcaag taagcttatc tccctgaggc 181 tgtttcctcg tctgttaagt gtgaatatca atacctgcct ttcataatca ccagggaata 241 aagtggaata atgttgataa cagtgcttgg cacctggaag taggtggcag atgttaacgc 301 ccttcctccc ttgcattgcg ccccctgtgc ctacctctag cattgtaacg accacatagt 361 attgaaatgg ccagtttact tgtctgcctt cctttccaag accgttggtg cctagaggac 421 tagaatcgtg tcctatttaa ctttgtgttc ccaggtccta gctcaggagt tggcaaataa 481 gaattaaatg tctgctacac cgaaaacctc acgcctctca ttcgttgtga tttttgcata 541 tgttgttgac tcaggctgaa actcctcctc ctcctacagc agcactgtgt gccgccccac 601 tgctgccccc acctctcggg gcagctggcg gagaagagag ggggctttcc aaatccacta 661 tgaatcctta aaaggctgaa taagggaact tgaaacgatg cagacaggca ggcaacccgt 721 agtcgaggga catagggcca taaacgtcga agccaggtct tgagcgtcag acagaatgag 781 gtgtcccaga gcggcggagg agagccagac gcacgctggt tccggtctgg acggaatcgc 841 cgcggagcac ggcagaggct agggcggaat ggctacggca gcgcagttcg ccaaggctcg 901 gtctccgccc tgcagcctat ttcctctaac ccgcagatcc actatgggag gaggcgggga 961 gcgggctgga gagcactaac agagacaggc ggggcgagtc gtgggcgcgt gacgtcaccc 1021 cggggtgtgc gcggcgcgag cggaagcgga agcggctctg ttcgccgcct ctcccaccgg 1081 cccgatgagc tgcagcggct ccggcgcgga ccccgaggcg gcgccggcct ccgccgcctc 1141 ggccccgggc cccgcgcccc cggtctcggc tcccgccgcg ctgccctcca gcaccgccgc 1201 ggagaacaag gccagccccg cggggacagc ggggggacct ggggctggag cagctgctgg 1261 gggcacggga cccttggcgg cgcgggccgg ggagccagct gagcggcgtg gggcgggtga 1321 gggccgggac gggtgagggc gagggaagcg tcttgggctg ggtccagcgg cgggtgaggg 1381 cagagggtat agacagacct agctggggcc ctgcgcctcc cacctctgtg tctcattttg 1441 ctggttctcc tccagctccg gtgtcggcgg gtggcgcggc gcccccggag ggggccatat 1501 ctaacggggt ttacgtactg ccgagcgcgg ccaacggaga cgtgaagccc gtggtgtcca 1561 gcacgccttt ggtggacttc ttgatgcagc tggaagatta cacgcctacg gtgggcttcc 1621 gcccgaacaa ggccacctag cctgctgaca aaactttcag ccacatcgtg cttttcagcg 1681 ttctcttcca tttgctcccc tagtcgctct tctgtgtttg ccctctgctc acccaaactg 1741 tgagcttcct gataatcagg cctatccatt tccctcaccc tcctcccgct ctgctgacag 1801 ttctcttaat tgatttctca gatcccagat gcagtgactg gttactacct gaaccgtgct 1861 ggctttgagg cctcagaccc acgcatgtga gtaaacccag ggcaggttag ttttgggtgc 1921 ttgtgcagta tgttgtccat ctccttctca tctaagtttt ttctctctag aattcggctc 1981 atctccttag ctgcccagaa attcatctca gatattgcca atgatgccct acagcactgc 2041 aaaatgaagg gcacggcctc cggcagctcc cggagcaaga gcaaggtgtg aggggaggct 2101 taatgaatca gtaattacct tccacaacag tggaggctta tcctgccacc cctttgggga 2161 aactgaatcg taggggaggt gtaagactta ctcagggtca cccatctggg attgaagtcc 2221 gggattcctg tgctcagttg gtgctcttcc ctcttccctc aggaccgcaa gtacactcta 2281 accatggagg acttgacccc tgccctcagc gagtatggca tcaatgtgaa gaagccgcac 2341 tacttcacct gagccaccca acctaaatgt acttatctgt ccccatgtcc ccacaccagc 2401 ctgttttcat aataaacttt attgtgacag gcggggctga tccctcccat gttgggagac 2461 accatgtggc aagtgacaaa gctctgagcc cgcccctctt ggggccacag tggtagggat 2521 gggggaaggg gatggccccc atggctgggg tagtaccatg actggaggcg ggggaggcaa 2581 ccagaggcct gctgctttgg ggagg // LOCUS HSU25826 5522 bp DNA PRI 28-MAY-1995 DEFINITION Human transcription factor (SC1) gene, complete cds. ACCESSION U25826 NID g833832 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5522) AUTHORS Krishnan,R. TITLE Molecular mapping by transposon-based nested deletion sequencing: the SC1 gene maps near the HLA-C locus JOURNAL Unpublished REFERENCE 2 (sites) AUTHORS Ku,D.H., Chang,C.D., Koniecki,J., Cannizzaro,L.A., Boghosian-Sell,L., Alder,H. and Baserga,R. TITLE A new growth-regulated complementary DNA with the sequence of a putative trans-activating factor JOURNAL Cell Growth Differ. 2 (4), 179-186 (1991) MEDLINE 91329275 REFERENCE 3 (bases 1 to 5522) AUTHORS Krishnan,R. TITLE Direct Submission JOURNAL Submitted (27-APR-1995) Rajendra Krishnan, Dept. of Internal Medicine, Washington University School of Medicine, Div. Allergy & Immun., 660 South Euclid Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..5522 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="YAC B209D7, Cosmid 52, pDEL5.5" /clone_lib="CGM1 library from the Washington University School of Medicine" /chromosome="6" /map="6p23.1" /sex="male" /haplotype="A3 B8 C- DR3 DQw2 DRw52 and A29 Bw65 C- DR7 DQw2 DP4" /cell_line="B lymphoblastoid" /tissue_type="blood" /dev_stage="adult" gene join(933..1170,2905..3463,3929..4211) /gene="SC1" exon <933..1170 /gene="SC1" CDS join(933..1170,2905..3463,3929..4211) /gene="SC1" /codon_start=1 /product="transcription factor SC1" /db_xref="PID:g833833" /translation="MLPCFQLLRIGGGRGGDLYTFHPPAGVGCTYRLGHRADLCDVAL RPQQEPGLISGIHAELHAEPRGDDWRVSLEDHSSQGTLVNNVRLPRGHRLELSDGDLL TFGPEGPPGTSPSEFYFMFQQVRVKPQDFAAITIPRSRGEARVGAGFRPMLPSQGAPQ RPLSTFSPAPKATLILNSIGSLSKLRPQPLTFSPSWGGPKSLPVPAPPGEVGTTPSAP PQRNRRKSVHRVLAELDDESEPPENPPPVLMEPRKKLRVDKAPLTPTGNARGRPRKYP VSAPMAPPAVGAGSPVQLLVAACPRKRQWPGFSVMAVTSGSMWPVLAAASRLPGRPTS DAQGAGLAFSLRSTAKAPSDTPAHE" conflict 1012 /gene="SC1" /citation=[2] /replace="c" intron 1171..2904 repeat_region 1937..2238 /rpt_family="Alu" exon 2906..3463 /gene="SC1" conflict 3033..3035 /gene="SC1" /citation=[2] /replace="aac" conflict 3243 /gene="SC1" /citation=[2] /replace="a" intron 3464..3928 exon 3929..>4211 /gene="SC1" conflict 3933..3934 /gene="SC1" /citation=[2] /replace="cg" conflict 4000..4002 /gene="SC1" /citation=[2] /replace="" conflict 4176..4177 /gene="SC1" /citation=[2] /replace="cg" BASE COUNT 1376 a 1368 c 1497 g 1281 t ORIGIN 1 gaattccgaa ggaggggtag gcgctgcccg cgcgcagagg ccgcgcccct cctggccccg 61 gcttcttggc tgtcaaacag tagcagcaac gtcggctcct gccgaggagc ccaaggggtc 121 ccgggatccg ccgcacaggc tggcactgct tgaagaggag gctactcgga gactgcgccg 181 cgcgggtaga tccgaaacgg ggctggggcg gagtgggaaa aggccgggta tgccttgcat 241 gatcgcgggg agctccttcc tgtttttatc ccacctagag aagccgggaa gtaggggttt 301 aggtccaatt tgttggagta cttaaggact cgtttgcact ttcttttggg ggatgacagt 361 ggattcattg ccctcggagg ttcaaccagt tatgagtgag ggattggcca gaagatcggg 421 gcgcaggcaa gcaggagtgc tctattagga taagcaagtt tgacaggaag aagctgctct 481 tctccgaatt acacagaggt gatgtgttcg tattgcacgt agacgtgtgt ataacaggac 541 ctccttcccc gcgccccgcc accccgacac acacaggagc tgcctaaagt atccttgcct 601 tgaagattgg aggctcccca aatatttggt gatctgagga tccagctcaa gtgaggtgcc 661 ataggacgtg ttcctgagtt tgcattgcac ggagaccttc ctggaatttt tcatttgcaa 721 gtcggcttaa ccaatttggc attgagtcct aggctgcttg cactctgaat ttgggctatt 781 caggtagtgt gctcaaagtt gagaccgcat acagaacaac tcaagtttgc atcagactgg 841 gaagcgaact taagccagcg gtgcgtggcc caggagtggg aaaggaaatg gatgcctgaa 901 gtggaagagg tggtgcagag ggggcaccgc ccatgctgcc ctgcttccaa ctgctgcgca 961 tagggggcgg caggggcggt gatctctaca ccttccaccc ccccgccggg gttggctgca 1021 cctatcgctt gggccacagg gccgacctgt gtgatgtggc cctgcggccc cagcaggagc 1081 ctggcctcat ctctgggatc cacgccgaac tgcatgccga gccccggggt gatgactgga 1141 gggtcagcct ggaagaccac agcagccaag gtgagcatta agcagggcag ctttgcccct 1201 gggtggttga agcgccaggc tggaatgagt aaggtctcca caagaccttg ctgtctgcct 1261 cccatactcc catcagattg gatggatagt cgtggtccag accttcatct tcccaccaga 1321 agtgtgcaca gtcagaagct ctctgccaga ctgacccttt ttggtcccgt ttagctcata 1381 caggacctgg gatatcatca gaaagatatc acagtgggga tgttctgagg ccactagagg 1441 ccaagtttag acttgattca gtttccagct ttgctgaggc actctgttcc tgggttaggg 1501 cagttctatg ttgaataatg tttttaataa tctgggcatg tctttctccg tgacttgagg 1561 cagttagcct cagaaagcct agattcacat ttgagtttgg ccactgcctc ttggtaaagt 1621 cagctgtagg agtgttatgg ttattagact atagtagcca acattcatct agtgcttact 1681 gttatgagcc aggccctatt ttaagtgtat tgaatgtagg tggtactaat attatcctca 1741 tttacagaaa aggaaaatga ggcacaaaga ggttaaggaa cttgtccagg gctgggcatg 1801 gtggtttaca cctataatcc agcactttgg gaggctaagg cagggtggat cacttgagct 1861 caggagttcg agaccagcct gggcaacatg gtgaaaacct gtctctacca aaaaattaat 1921 tagtttttta aaaaaagcct gggcgcggtg ggtcacgcct gtaatcccag cactttggga 1981 ggccgagatg ggcagatcac gaggtcagga gttcgagacc atcctgacca acatgttgaa 2041 accccatctg tgctgaaaaa aaaaaaccca aattaaccag gtgtggtggc gtgcacctgt 2101 aaccccagct actcaggagg ctgaagcagc agaatcactt gaacccggga ggcggaggtt 2161 ggagtgagct gagatcgcac cactgcactc cagcttgggc gacagagcga gactccatct 2221 caaacaaaca aacaaaccaa aagcttgccc agggtcacat aactggtaag tggtagagct 2281 aggtactgaa cgagctggag ctgggggaga gtgagcatgt ttgaaaactg gaccttaggg 2341 cggggcacgg tccgtcacgc ctgtaatccc agcactttgg gaggctgagg cgggcagatc 2401 aggaggtcag gagtatgaga ccagcctggc caacatggta aaaccctgtc tctcgtaaaa 2461 ataaaaaaat tagccagacg tggtggcaca tgcctgtaat cccagctact caggaggctg 2521 aggcaggaga attgcttgaa cctgggaggc ggagtgcagt gagctgagat tgcactactg 2581 cactccagct tgggcaatag agcaaatctc catctcaaaa aaaaaaaaaa gaaagaaaaa 2641 aaaagaagaa agaaagaatt ttggccctta ggacagtgag ggcagggttc ctttgtggga 2701 aagcacaaga aacacagatt tgttcctagc tgacaaggag tgtactgcct ggtacctgtc 2761 acctgctgag gggcttagga tgtgagggag aatctgacca cagtttcata ttcttcccca 2821 gaaatcatac agatttctcc actcctgact ctggtcaatt ctgtttttgt cctccatatt 2881 tgcctggtgc cccaccatca acaggtactt tggtcaataa tgtccgactc ccaagaggtc 2941 acaggctgga attgagtgat ggagacctcc tgacctttgg ccctgaaggg cccccaggaa 3001 ccagcccctc ggagttctac ttcatgttcc aacaagtacg agtcaagcct caggactttg 3061 ctgccattac catcccacgg tctaggggag aagcccgggt tggggctggt ttccggccta 3121 tgctgccctc ccagggggct ccacagcggc ctctcagcac cttctcccct gcccccaagg 3181 ccacactgat cctaaactcc ataggcagcc tcagcaagct ccggccccag cccctcacct 3241 tctcccctag ttggggtgga ccaaagagcc tgcctgttcc cgccccacct ggggaagtgg 3301 ggaccacgcc ttctgctcca ccccaacgca atcggaggaa atctgttcac cgagtgttgg 3361 cggaactgga tgatgagagt gagcctcctg agaacccgcc accggtcctt atggagccca 3421 ggaagaaact ccgtgtagac aaagccccac tgactcccac tgggtaagtg gagtcctcac 3481 ttggccctct cagtgtttta ctgcttttcg attccttgta tccctaggct gtgaggaggt 3541 ccccctgcct ggggggatgg gcacgggagg tggaatagat ggaatggcaa gacctgggtt 3601 agctctgata gggaaagaaa aatatgtgca ggagaacatg agaggtgggg tggggcagtg 3661 ctataaaaca accggagtga gcatgtcctg ctttttacat tcatatggct ttaaccccac 3721 tttctagtgc ctaaggatgg ggaactttca ggctcacact agaggttttt aggcccaccc 3781 catgtgtttt taaggacaga gtccaggctc accttagttc tcagaccact gtgcctctgt 3841 ggcctcaccc tatgaccagc catagggtgg caaggtctag gccttctcag atttccggtg 3901 acccttgtgt ctctctcact tccttcagaa atgcacgtgg ccgtcctcgg aagtacccag 3961 tgagcgctcc catggctccc cctgcagttg gggcggggag ccctgtgcag ctccttgttg 4021 ctgcctgccc caggaagaga cagtggcctg ggttcagtgt gatggctgtg acgtctggtt 4081 ccatgtggcc tgtgttggct gcagcatcca ggctgccagg gaggccgact tccgatgccc 4141 agggtgccgg gctggcattc agcctaaggt ccaccgccaa ggcaccatcg gacacacctg 4201 cccatgagta gacacagcag cgagcaaata ggtctgataa atacccccct tcccttccct 4261 ccccaagagg gaatgactac agggaagaag gatggattga tgtggactca ttcagggcct 4321 ggagcagacc ctggtggcca agacagaaga gatggtttcc ttccaaagat attgccacct 4381 ccaggaaatt gccagtgagc tggaagttcc cactattaca agccataagg ccatgtcgcc 4441 atggacacca gaatatctgt agtcagagca cctatcagtt gcaaaagcca tgcctgcaac 4501 cgatggaaaa tgtaagaggg agttcttaag gttcttggtg gaatcaccca aggtattctg 4561 ggaaaaccta gggcctggcc ccaaaacttc cctactctgt ggctagtcct gctgccaaca 4621 aaatcgtagc gacctggctt ttcacagctt tgcttttatt tccaagtcaa ggacaagccg 4681 cttcattcac tcctgggcat ttactcttct tgtgggtctg tgatattcct tgctttccag 4741 ggagaatgtg cttggcaagg tctggagaac taattcagaa tcttagggga aggggagaga 4801 tggaaataca aacctgctta ctggaaaggt gcaaaatatg ggttgagctg gaggtaggaa 4861 tacaggtaat taaggtttct agtttaaggg aaaacagatc tatttgccat ttaaatatgg 4921 taactgggat ttggttaagt tcacccagat agcagaagat ttatttacag gcttcacctg 4981 tactgtcagg gacaagagaa aagcctggta aaccaagcta cagcagttta ccagtgtgat 5041 ggctctcaca cagctccacc ccccgggtgg acacagcaga gggcacctgg gctggcctgg 5101 ttcagtgtga atcaaaccgc ttaacccaca catggtacat gtgattttct tttgtgagcc 5161 ttacaccaag ccaaactatt gtcaaagcat catttctata gaaataaagc cttatcttga 5221 cctgttctat taaaacctgc cacatccgcc ctttcctacc tagatttaat gagcccaagt 5281 ttttttacat ggaagaaatg actctggggc aaagacccct aatgaactag tggcagagcc 5341 aggaataaaa cttgagtaac taatgagtca cttatgggca gagtatgcaa aaaccttaag 5401 tggaaaccaa atagaccctg gtatcaagaa agcacaaagt attaatagaa gtttctggtt 5461 ggggtgatct aggttcaaca gaaataagat gatttctaag tataaagctc aaaattgaat 5521 tc // LOCUS HSU26425 19289 bp DNA PRI 03-OCT-1995 DEFINITION Human phospholipase C-beta-3 (PLCB3) gene, complete cds. ACCESSION U26425 NID g836664 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 19289) AUTHORS Mazuruk,K., Schoen,T.J., Chader,G.J. and Rodriguez,I.R. TITLE Structural organization and expression of the human phosphatidylinositol-specific phospholipase C beta-3 gene JOURNAL Biochem. Biophys. Res. Commun. 212 (1), 190-195 (1995) MEDLINE 95336441 REFERENCE 2 (bases 1 to 19289) AUTHORS Mazuruk,K., Schoen,T.J., Chader,G.J. and Rodriguez,I.R. TITLE Direct Submission JOURNAL Submitted (04-MAY-1995) Ignacio R. Rodriguez, LRCMB, NEI-NIH, 9000 Rockville Pike, Bldg. 6 Rm. 304, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..19289 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" exon 2641..2955 /number=1 5'UTR 2641..2856 gene 2857..18807 /gene="PLCB3" CDS join(2857..2955,5635..5712,5968..6036,6118..6258, 6454..6533,6616..6669,6761..6836,6943..7043,7599..7765, 7841..7987,9718..9958,10130..10214,10303..10489, 11277..11482,12646..12742,12827..12911,13198..13322, 13653..13807,13893..14054,14744..14843,14937..15041, 15267..15358,16201..16354,16442..16477,16560..16752, 17137..17290,17377..17453,17566..17655,17746..17803, 18436..18522,18604..18807) /gene="PLCB3" /note="1-phosphatidylinositol-4,5-bisphosphate phosphodiesterase beta 3" /codon_start=1 /product="phospholipase C-beta-3" /db_xref="PID:g836665" /translation="MAGAQPGVHALQLEPPTVVETLRRGSKFIKWDEETSSRNLVTLR VDPNGFFLYWTGPNMEVDTLDISSIRDTRTGRYARLPKDPKIREVLGFGGPDARLEEK LMTVVSGPDPVNTVFLNFMAVQDDTAKVWSEELFKLAMNILAQNASRNTFLRKAYTKL KLQVNQDGRIPVKNILKMFSADKKRVETALESCGLKFNRSESIRPDEFSLEIFERFLN KLCLRPDIDKILLEIGAKGKPYLTLEQLMDFINQKQRDPRLNEVLYPPLRPSQARLLI EKYEPNQQFLERDQMSMEGFSRYLGGEENGILPLEALDLSTDMTQPLSAYFINSSHNT YLTAGQLAGTSSVEMYRQALLWGCRCVELDVWKGRPPEEEPFITHGFTMTTEVPLRDV LEAIAETAFKTSPYPVILSFENHVDSAKQQAKMAEYCRSIFGDALLIEPLDKYPLAPG VPLPSPQDLMGRILVKNKKRHRPSAGGPDSAGRKRPLEQSNSALSESSAATEPSSPQL GSPSSDSCPGLSNGEEVGLEKPSLEPQKSLGDEGLNRGPYVLGPADREDEEEDEEEEE QTDPKKPTTDEGTASSEVNATEEMSTLVNYIEPVKFKSFEAARKRNKCFEMSSFVETK AMEQLTKSPMEFVEYNKQQLSRIYPKGTRVDSSNYMPQLFWNVGCQLVALNFQTLDVA MQLNAGVFEYNGRSGYLLKPEFMRRPDKSFDPFTEVIVDGIVANALRVKVISGQFLSD RKVGIYVEVDMFGLPVDTRRKYRTRTSQGNSFNPVWDEEPFDFPKVVLPTLASLRIAA FEEGGKFVGHRILPVSAIRSGYHYVCLRNEANQPLCLPALLIYTEASDYIPDDHQDYA EALINPIKHVSLMDQRARQLAALIGESEAQAGQETCQDTQSQQLGSQPSSNPTPSPLD ASPRRPPGPTTSPASTSLSSPGQRDDLIASILSEVAPTPLDELRGHKALVKLRSRQER DLRELRKKHQRKAVTLTRRLLDGLAQAQAEGRCRLRPGALGGAADVEDTKEGEDEAKR YQEFQNRQVQSLLELREAQVDAEAQRRLEHLRQALQRLREVVLDANTTQFKRLKEMNE REKKELQKILDRKRHNSISEAKMRDKHKKEAELTEINRRHITESVNSIRRLEEAQKQR HDRLVAGQQQVLQQLAEEEPKLLAQLAQECQEQRARLPQEIRRSLLGEMPEGLGDGPL VACASNGHAPGSSGHLSGADSESQEENTQL" intron 2956..5634 /gene="PLCB3" /number=1 exon 5635..5712 /gene="PLCB3" /number=2 intron 5713..5967 /gene="PLCB3" /number=2 exon 5968..6036 /gene="PLCB3" /number=3 intron 6037..6117 /gene="PLCB3" /number=3 exon 6118..6258 /gene="PLCB3" /number=4 intron 6259..6453 /gene="PLCB3" /number=4 exon 6454..6533 /gene="PLCB3" /number=5 intron 6534..6615 /gene="PLCB3" /number=5 exon 6616..6669 /gene="PLCB3" /number=6 intron 6670..6760 /gene="PLCB3" /number=6 exon 6761..6836 /gene="PLCB3" /number=7 intron 6837..6942 /gene="PLCB3" /number=7 exon 6943..7043 /gene="PLCB3" /number=8 intron 7044..7598 /gene="PLCB3" /number=8 exon 7599..7765 /gene="PLCB3" /number=9 intron 7766..7840 /gene="PLCB3" /number=9 exon 7841..7987 /gene="PLCB3" /number=10 intron 7988..9717 /gene="PLCB3" /number=10 exon 9718..9958 /gene="PLCB3" /number=11 intron 9959..10129 /gene="PLCB3" /number=11 exon 10130..10214 /gene="PLCB3" /number=12 intron 10215..10302 /gene="PLCB3" /number=12 exon 10303..10489 /gene="PLCB3" /number=13 intron 10490..11276 /gene="PLCB3" /number=13 exon 11277..11482 /gene="PLCB3" /number=14 intron 11483..12645 /gene="PLCB3" /number=14 exon 12646..12742 /gene="PLCB3" /number=15 intron 12743..12826 /gene="PLCB3" /number=15 exon 12827..12911 /gene="PLCB3" /number=16 intron 12912..13917 /gene="PLCB3" /number=16 exon 13198..13322 /gene="PLCB3" /number=17 intron 13323..13652 /gene="PLCB3" /number=17 exon 13653..13807 /gene="PLCB3" /number=18 intron 13808..13892 /gene="PLCB3" /number=18 exon 13893..14054 /gene="PLCB3" /number=19 intron 14055..14743 /gene="PLCB3" /number=19 exon 14744..14843 /gene="PLCB3" /number=20 intron 14844..14936 /gene="PLCB3" /number=20 exon 14937..15041 /gene="PLCB3" /number=21 intron 15042..15266 /gene="PLCB3" /number=21 exon 15267..15358 /gene="PLCB3" /number=22 intron 15359..16200 /gene="PLCB3" /number=22 exon 16201..16354 /gene="PLCB3" /number=23 intron 16355..16441 /gene="PLCB3" /number=23 exon 16442..16477 /gene="PLCB3" /number=24 intron 16478..16559 /gene="PLCB3" /number=24 exon 16560..16752 /gene="PLCB3" /number=25 intron 16753..17136 /gene="PLCB3" /number=25 exon 17137..17290 /gene="PLCB3" /number=26 intron 17291..17376 /gene="PLCB3" /number=26 exon 17377..17453 /gene="PLCB3" /number=27 intron 17454..17565 /gene="PLCB3" /number=27 exon 17566..17655 /gene="PLCB3" /number=28 intron 17656..17745 /gene="PLCB3" /number=28 exon 17746..17803 /gene="PLCB3" /number=29 intron 17804..18435 /gene="PLCB3" /number=29 exon 18436..18522 /gene="PLCB3" /number=30 intron 18523..18603 /gene="PLCB3" /number=30 exon 18604..19192 /number=31 3'UTR 18808..19192 BASE COUNT 3737 a 5751 c 5870 g 3929 t 2 others ORIGIN 1 ggatccaccc tctccaccgg gagccctatc cgtggcctgg cctcctgccc taagcctggc 61 ctgataggag gatcacttta ccccagtcca aattccccac cacaccccac caggccaggc 121 cagctccctc tgaacccaag cattgtttgt accttgaggg tgcacacaga ttaaaggggg 181 aagaccctca aacagcccct aatgcccaac agcttctgtg gaagtggggg ctgcaggcca 241 tgaagacccc cagccctagc tctgccccca agatgctgtg accctgagcc ctgacctttc 301 tgtctgtcct ccctcagggt tggtgtgaag gtctcagcat gtccgtgtcc acaggtcagg 361 cattctttct gactctccct cttcccccca ccgcgccatg cccagggaca cccagattcc 421 tgtccagttc ccatggctat ccacccgaga ccacttggtg caggtgggca tgctggtctg 481 ccaccttctt ggaagctggg gcacagaacc ttagtcttgg agtagaaata ctgggttcaa 541 atcctggatc tggcccatgg gctgagaggc tgtatgcaag tcacttacct gctcagagcc 601 tcttcagggg gcttatggtc ctgacctcct gtggttgggg gcccatgccc agccctggtc 661 ctcctgcatg agccctactg ggactggagt gtaccactgc tcctcctcct tctagcctca 721 gtctgtggcc tccccagtcc taccccttcc agggtcccag gccccctcca ttagagtgca 781 tccttgaagc agaggctctg gggagggtcc tgtgggtcaa gaccgctcct tcaatcaagg 841 gtagagtccc ggaaaggctg cctgcctatc accattactg cctggtgaga taccctatta 901 ccattaccaa ggatggaaga aagtgtctgg gaaggaagga gccctgtaat ggggggtgcc 961 ctggcctcca gagacacagg gttgggggag gagagaggtt taagcccagg aaacaggtgc 1021 ccatcctgtt tttttccaca aggaggtgtt gcccaatgct tctctaaagt acgcttgcac 1081 aaggtctagg gcccacagct cagatcccct aagctcaggc ttggctccag gctttgctag 1141 ccctggcaca ggctggaggt tccactgccc cagagctgac ctatgccctt tccctccctc 1201 cctggtatac ccacagcttc cagctactca tgtcaatttg gcatctccca cccatttgct 1261 tatgtataca acaaccttta ggttctggac aaggtgactg aggcacctag agagactagg 1321 ttctgccact agcaagaggc tgctagcaag agccggattt gaactccagc ctgtgagggc 1381 agaggccaag ctccaccttg catggttctg ttttcccaat gaggtctaga ggctcctggg 1441 cctcaggaag tcggagggtt gaaatcctcc taagaagctg tgtgaccttg ggcaagtccc 1501 ttcccctctc tgggcctcaa gtggagatca taatcacacc ccccagggcc agggctgaga 1561 ccgaggagat cccccaggtt agagttgatg gagaacttgt cggggtgtgg aggaagacca 1621 gggccaggca gggctccttg tggaggtctc agaccctggg gcaaaggggc ctccagggga 1681 gctcctcctt tgctgacccc tcctccccca gagccagctg catctctttc tgggtttcta 1741 ggtgacttgg ctggcccctc tggatgagtt ctactttaaa cctgctgctg ttactgcctc 1801 tgtattgggg gggctcccac agggcccccc aggcaggaac cccccccacc agctaagccc 1861 tgcgggaggg gctgcagccc ccacccatcc cattgcccat gcgccttccc cgctcccatc 1921 tctggaacgc agatttcagg gcacctgctg tgggccaggc actgcagagc ccctcctcaa 1981 ccctcctgtc tggggacatc cacaaagctt ctgacccctc tgggcagaca acagcgttcc 2041 ccccgcggtt cccagccaag aggctcacag tcaggctcca gtcatgggag tctgcagcaa 2101 caagaagggg gctgttctag ggctcactct cgaagaccag gatgcttggg tccttaaagg 2161 aggtggcagg cgcccggcct ccctgggcta gggacggggg aggggtgcct gccggcggga 2221 ggtccattag ccggctaagt gctcccaggg aggggggacc gcgcggcata agagtgggtg 2281 aggcgaagcc agggcagagg ggggcggaag aagcttttcc tgcattcagc taagtgcggc 2341 agaagagggt gggcgacagg gctgtgcagc gaccccccgg cccaagtcta gccctccgct 2401 ggtactgcag ccctagtcgc tcggaaacct cagtgggctg atcccggacc gccttaagca 2461 gcgccctcta cacccagtcc ctccagctgc tccccaacag ccacttcggc ccccagcgcc 2521 ctccttccac cacaggcctg aagaatcagg ttctgctgag ccccgcagca aagcccccga 2581 cccgtccgcg tgcccctgcc ccaatccccg ctccggctgc agtcgccgcc ccggccgcca 2641 gggggagccg acgcggtggg agggggcggg gcccgggagg ccccgccccg tccccgcccc 2701 gcccgggccc cgccgaggct ctggtcggtc cgggacagac tggcgggcgg gcgggcactg 2761 acgccgcggg gccggagcgg gccgcgcggt gggagcagcg gcgccgtcgg tccccgtcag 2821 ggctccgtgg gtccccgacc cgcccctggc cgggccatgg cgggcgccca gcccggcgtc 2881 cacgcgctgc agttggagcc gcccaccgtg gtggagaccc tgcggcgcgg gagtaagttc 2941 atcaaatggg acgaggtaag cgcgcgggcc cctgctccgc ccaaatcccg ggactctttc 3001 agtcaagctc gaccagacgc tggggacccc caccccagcc tcgcctgccc gtggcgccac 3061 cctcatccca gtctggcggc gccccggaaa cttaagtccc cgatcgtctc cgtaatgaag 3121 acttttgtcc ccccaactcc cagtccctca gctctggaaa atttgcctcc actccttggc 3181 gcagagactt tgaaccccaa taaaccccgt ccccccgttt tccattcgtt cttcccaccg 3241 ctccctgcca gactaattct cgttccccga tcctggcctg gtgcccagac ccctgcccag 3301 cactagactc cccggctgca ggaccctgac ttccaggctg gctgggggca tcctcattct 3361 gggtgtgtgc cagggcgctc ctactctccc cggtgacctg cctgaacagt cctcgtccat 3421 tcatcatttt tccccctgtt ctggttggca ccccttcgcc ctgcttctgg atccccagcc 3481 cagctccctg ttcagatggg agcacccagt caccccctag tggagacttt ggcccccaag 3541 ttcctccacc cctgggaact ttgctcccca ctccttaggc ttgggacctt gaaccccaat 3601 acatcctggg ccccctttcc ctgtcctggc cctggtgctt cccaaatgtc tgtgtcccca 3661 ccttcgggtg gccatcccca tcctgggtga aatctcttct gccccctggg cccttccttc 3721 cctagggcct ccacccggcc ggcctgagct tcctcgcagc tccttcccag tcaggccacg 3781 ggcctggaag actctgggct cttttccggc tgggtggggc cccggggtag gcgtgggagc 3841 tggggggagg agatgggggc gtgtccgaag cggcaggtgt gcgaggtggg gtgcctgggg 3901 cccccagggg ccgggggaga cccctccccc caggctgtcc tgaagggaaa agtttcttct 3961 tttcattcca ctgtgataaa cccaaataag gaagtgggaa cagggagagg cccctggggg 4021 tggggtgggg ggtgctgacg gaggcttggg gcaggagagc aggggacctg ggctgccttt 4081 gaggtccaga cccctgggga gaggtgggag ggaaaacacc taggggcagt acccatagac 4141 tcccctctct gcacaccatt cggggccctc aggcccaggg tagggttggg aatccggaga 4201 agtgctgggg ctgggtgtca gcacactgaa gtggggaggc ccatgcatcc cccaaatttt 4261 gggggccttc tgttcccaga tgaacgccca tcttttgtaa gcaaagcctg ccatgggtgg 4321 ggctggtgca atccccattc ctccagaaga ggaaactgag gtctggaaag cagagagtta 4381 tcccagcccc cacaatgggg tgtgtctgag gccaggagcc gggagcctgg cagaagtctc 4441 cgctccctgc ttcccaggtc acaggacgct acaaccacac tgcctgcaag caggtcctgg 4501 ctccagtgat ctcgggcagg ctacgaagtg ttccctgtaa ctcagtgtcc tcacctgtaa 4561 aatgggagtg acaacagcac atacctctta ggctggttag gagacctttg tgtgctcgaa 4621 gatactaggt gcatagaact gtacctggtg cgcagtaggg gcggtgtgcg tatatgagat 4681 gtattgataa taatagtatt gtccaccaga cactgagata cattttgggg gtggcaccca 4741 acatgaggtg gggtcctttt ctgtctggaa gaacctggaa taggagggtt ctacctgtag 4801 ccacttgttc actctgcccc tggccgagtc taaagcccct ggaggacacc ccctctgcct 4861 gctggtcctg agtcacacag ggtgcccctg gtagtgtgcc agggaccggc cccttcatag 4921 aaaatggggc tggcatgcac ttccctcccc cctgccatct gcacgcctgt ttgagggagg 4981 aagagctcac tgcccagtcc tgagagtagg gcaagtcccc ttctaggggg actgttcttc 5041 ctacgagatg gggatgagct ttgagtctgt gaggaccccg gcagatgtgg taaggtggat 5101 aatgggcttt gaggcagacc acagggctat cgtgatgtcc caagggatgc agaagggcag 5161 tgggagtcag acttcttcac tttcatgcct taaacattct tcagtggggt ggattggggg 5221 cctcccactg tcacagagct ctcagcactt ctggggaaac tgagcccagg atggggcctg 5281 gggacctctt ccagggggtg ggtcagattt ccacgggtgg agaaggaagg gcagggtgtc 5341 ctgcccaaag gaaacatgag acaaacccct gggaaaggtc aggtagaatc ctagggccag 5401 agggaggtga ggctgggagg aggctttgag gtaggtgtga cactaaatgt cacacctagg 5461 accttggctt tgttctgagg gccacaggca gccacacagg acctgtggac tggatgaggt 5521 ttcgtcttag ctcatgggca ggaactctgg ggtgcatgat gggagggccc caggggccag 5581 gcctgcacca cagccctcac ttcctgaccc ctgctgtccc tgtgctgtcc tcaggagacc 5641 tccagtcgga acctggtgac cctgcgtgtg gaccccaatg gcttcttctt gtactggacg 5701 ggccccaaca tggtgagggt gggcgctggt gcagctcgct caggccaagg acccctgtcc 5761 cagacccctg ccctgcccgt gggggtgggg cccggctgca ggctgacctc tcccactgct 5821 gcccagacag gaagtgctca gggcaggagc tgaggctggg gctatagcca ggggctggcc 5881 attccctggc ctgggtagag agcttggaag ctcctggcta ttgccctgca gcctctcata 5941 ctcagcctgg catctggttc cccccaggag gtggacacac tggacatcag ttccatcagg 6001 gacacacgga caggccggta cgcccgcctg cccaaggtga gtgatgagcc tgggagtgaa 6061 gaccacagcg agactgtgcg ggagccccct cttcaccctt cgctgtcacc tactcaggac 6121 cccaagatcc gggaagttct gggctttggg ggtcccgatg cccggctgga ggagaagctg 6181 atgacggtgg tgtctgggcc agacccggtg aacacagtgt tcttgaactt catggccgtg 6241 caggatgaca cagccaaggt gggctggcac caaggggacg aagggggagt cactgtctta 6301 ttctgtgagt cggccgtcac ttaccgaaca cctgctgtgt tctaggctgc tctgtgacct 6361 actgtgtacc agaggcagtt gtggacctgg gttgtggctg ggcagcccct gtgtccccca 6421 ctcaccgcct ccccgtgtat actggccccc caggtctggt ctgaggagct attcaagctg 6481 gctatgaaca tcctggctca gaacgcctcc cggaacacct tcctgcgcaa agcgtgagcc 6541 ccaggccacc cgagggggag ccggggggtt cacgtggccg ttttcagggt gtgacctctt 6601 catctgcctt cccagataca cgaagctgaa gctgcaggtg aaccaggatg gtcggatccc 6661 cgtcaagaag tgagcacccc ttcccccagc accttcctcc tgccctgacc ttggtgacct 6721 ttgtcctcca ctgaccctga acccctcctg cccgcatcag catcctgaag atgttctcag 6781 cagacaagaa gcgggtggag actgcgctgg aatcctgtgg cctcaaattc aaccgggtgt 6841 gtggggtggg gacaggggcg gggtgtggtg tgtcacggtg gacacccacc cttacggggg 6901 tgcccgcccc tggtttctca ccccacactc ctcgacctcc agagtgagtc catccggcct 6961 gatgagtttt ccttggaaat ctttgagcgg ttcctgaaca agctgtgtct gcggccggac 7021 attgacaaga tcctgctgga gatgtgagtg ggcccggcct gcctcacaac ccctgtgctc 7081 tgtgtttagc ttctgcttag cgtgcctcta gctcaatggg gagaggggaa actggtgggc 7141 cgggtcctgg agcgccaggg ggcggagggg tgcccaggga gaacctgtcg gtgatgttcc 7201 tgtactcaga cccgccgttt ctgcccctca gttaacaggc agctctgggc taaccctgtg 7261 ccagggttgg gttaatcagc ccactcttca atggggtcca agcctcaggt gtgttgtgag 7321 gtctggaaac gaggggtggt gggctgacca cttggcagct cctaaaccct ggccattgct 7381 ggtgcccact agtctgctgc cttgacaatc ctgagtagtc aggagagtaa agggggaaac 7441 tgaggcccag aggttggcca tgactcaccc agagtcccac tcactgggtg ccagatccag 7501 ggccaggcct tctgactccc ttgcccaggg caagggcttg gcgacggggt gggagccctg 7561 ccaggctcca tcctgaccca gcttcctgtc ccctccagag gcgccaaggg caagccatac 7621 ctgacgctgg agcagctcat ggacttcatc aaccagaagc aacgcgaccc gagactcaac 7681 gaagtgctgt acccgcccct gcggccctcc caggcccggc tgctcatcga aaagtatgag 7741 cccaaccagc agtttctgga gcgaggtgag ctggctgggg tgcaggtggg tgggggcagg 7801 tgggacacca gctgaccctg ctgatcctct cccaccccag accagatgtc catggagggc 7861 tttagccgct acctgggagg cgaggagaat ggcatcctgc ccctggaagc cctggatctg 7921 agcacggaca tgacccagcc actgagtgcc tacttcatca actcctcgca taacacctat 7981 ctcactggtg agtggggagc aagggtggca cccgtgacct ctgccctgtg acccttggac 8041 agccccactc ctcctggggg acagtccttt tggcccattt gctcattcat tggccagtgc 8101 tggggatgca ggcaggggaa caggcagcca gccctgagca gtggggcctc atggtggctt 8161 tcccagggca agggctggaa cagcttgtgt tagggcctgc agcaggagag acttcagggt 8221 cctttttttt tttttttttt tttttttttt ttttttttct ttttgagaca gagtttcgct 8281 cttgttgccc aggctggagt gcaatcggca cgatcttagg ctcagcgcaa cctctgcctc 8341 ccaggttcaa gcgattctcc tgcctcagcc tcccaagtag ctgggattac aggcattcgc 8401 caccacgccc ggctaatttt gtatttttag tagagatggg gtttctccat gttggtcagg 8461 ctggtctcaa actctcaacc tcaggtgatc cgcctgcctc gccctcccaa agtgctggga 8521 ttataggcat gagccacctc gcccagcctg cagggtgctt ttaagaggct ggaagggggt 8581 tgggtgcggt gactcatgcc tgtaatccca gaactttgga aggctgaagc aggcagctta 8641 ctcgagccca ccttttgaga ccagtctggg caacatagca tgaccccatc tctacaaata 8701 atttataaaa aattagctgg gcatagtggc gcatgcctgt ggtcctagct ccttgggagg 8761 gtgaggtggg agtatcgctt ggttctggga aggttgaggc agaagtaagc tgtgattgca 8821 ccactgcact ccagcctggg tgacagagtg agagaccctt ttctaaaaaa aaaaattaaa 8881 taaataataa aaaagcagaa aggaagctgg tgcggcaggg tgcagagtgc tggagatagg 8941 tgcctcctaa gagagggaga gcaaggcagg gctaagccca ggggcctaga ggcacttgga 9001 ttcatcctgg gtgtgaggaa gaggcttttt ggagggtgaa gcagaagagg tgatgtgacc 9061 tgatatgata tgtttttaga aggatgatgc caggtgctct atagagattg gattgaggcc 9121 aggcacagtg gttcacacct gtaatcccag aactttgaaa ggctgaggcg tgtggatcac 9181 ctgaggtcag gagttcgaga ccagcctggc caacatggtg aaaccccatc tctactaaaa 9241 aaaccaaaat tagccaggca tggtggtggg caactgtaat cccagctact cgggaggctg 9301 aggcaggaga atcgcttgaa ccagggaggc ggaggtttca gtgaaccgag gttgttccac 9361 tgcactccaa cctgggtgac agagcgaggt tccgtctctc aaaaaaaaaa aaaaagagat 9421 tggattggag gcagggaggc ctggagtctt ccacacagga ggtgtgtgat ctgtcccagg 9481 actgaggtgg gtggagggag tgaggagagg gagggattgt tgaggcagtg gggagctata 9541 gtgagtctca gggatgactc cccccagctc acgctggtgg gatggacagg tggtgggtat 9601 ccatctgaga gatggagagc cctctccaaa tccagcatgt cctggttcca ggtgggaccg 9661 gggtggtgag gagctgggct ggtgggcggc ctcggtgaca gagcctcgcc gccccagcgg 9721 ggcagctggc tgggacctcg tcggtggaga tgtaccgcca ggcactacta tggggctgcc 9781 gctgcgtgga gctggacgtg tggaagggac ggccgcctga ggaggaaccc ttcattaccc 9841 acggcttcac catgaccaca gaggtgcctc tgcgcgacgt gctggaggcc attgccgaga 9901 ctgccttcaa gacctcgccc taccccgtca tcctctcctt cgagaaccat gtggactcgt 9961 gagtgagccc ctggcatgaa accccatgga ctgggggaca gtcttccagc ttcagtgctg 10021 ttggactgct cagggacctc caaccctgcg aacggccact gacatgtccc gtagacctca 10081 gcttcctctc tgggggaggg atctctgacc tctgacctcg gttccgcagg gcaaagcaac 10141 aggcaaagat ggctgagtac tgccgctcca tctttggaga cgcgctactc atcgagcctc 10201 tggacaagta cccggtacgg gagctcggag cgaggggggt ggcatggggg ccagggtgct 10261 gcggacccgg tccagggccc tgactcgtcc atgcctgccc agctggcccc aggcgttccc 10321 ctgcccagcc cccaggacct gatgggccgt atcctggtga agaacaagaa gcggcaccga 10381 cccagcgcag gtggcccaga cagcgccggg cgcaagcggc ccctggagca gagcaattct 10441 gccctgagcg agagctccgc ggccaccgag ccctcctccc cgcagctggg taggccccag 10501 cccggcccgc caccctgatc gaccctagcc tctggcctca tgttccaggc gccgtgacac 10561 ttcatcccag acccccagcc caccctcctg cctcgttggt ggcgcctgtt cccacaaccg 10621 gcctgtctcc tcaccacctg tcggttggtg tggggcctct gacccctgac ctcaggttga 10681 acgtctcacc cagctgcacc ccaactcctg actcccaacc tcagcctctc acctctcaga 10741 gccttcccac aacccggtga tggctctgaa ggctgacctc tgacctcagt aaaccacatc 10801 ccacactggg tgtgacctca gacccctaag cccgcattga gttcacctgc ccagttggtt 10861 atcatttcag ctgacctctg acctctcacc cctgactgtt cacctcacta tgccccctcc 10921 ttagacatgg ctctgactta cccctgacct gtaacctctc ctatgtcatc ccttaccccg 10981 ctattagggc tgtggtatgc tacttcctga cttctcaccc cactgaacca ccctatctgc 11041 ctccctgctg aatatcactg cagaccttta cctttgacct atgacccctc tcaacccata 11101 ctgggttaat accatctgac ctcgacctct tggcctctcc ccaagccaca ctgccctgag 11161 agtacccctt gaattcccca ctgagtcacc tcggcacaca ctgggaaaaa cccctccaga 11221 cttcctaagc agctgtgtgg tcctgacccc tcaccctgtg gccctgtcct ctgcagggtc 11281 tcccagctct gacagctgcc caggcctgag caatggggag gaggtagggc ttgagaagcc 11341 cagcctggag cctcagaagt ctctgggtga cgagggcctg aaccgaggcc cctatgttct 11401 tggacctgct gaccgtgagg atgaggagga agatgaggaa gaggaggaac agacagaccc 11461 caaaaagcca actacagatg aggtcaggcc cacagggtgg gcaggtcggg gaggtagcat 11521 ctatttacct cccagggggc tgggtctaac catcaacaag actgacatca cccctgcctc 11581 ctggggctca gtgtgggtgt gagtgaaggt gggaggaggg ctcttcagtc atcaggcatg 11641 gaaacaagca tacagttctg ctgatgatgt ggcctgggag gtgcgagtta gggtacagaa 11701 ctggcagagg atagggagtc taccctgggc agggaggtgg ggcagggaga cctcaggaag 11761 ggggacagca tgtacaaagg tcctggggca gggccagggg gtgcaggagc ggtgagcatt 11821 gtcatgggga gcattgagta gctgtgagga atggtacaga aaaaatcagt ctgagctggg 11881 cgtggtggct cacacctgtg atcccagcta ctcgggaggc tgaggcatga gaatcccttg 11941 aacctgggag gtggaggttg cagtgagctg agattgcacc attgcactcc aacgtgggct 12001 aaaaagggag acactatctc aaaaaaaaaa aaaaaaaaaa gacatcattg ctataaagac 12061 aattactgga gcaattagtg aaatttaaag atgggctaca tattagataa tatgttaatg 12121 ttttgtttcc tgaatttaat tattgtattg gagttataga agagaatgtc catatatgct 12181 gaagtattta tgggtaaagg gtcatgatgt ctataattta ctcccaaatg attcagtaaa 12241 agtaatgata attgtaacac ctatatgggc caggcgaggt ggctcatacc tgtaatccca 12301 tcactttggg aggctgaggt gggcgagtca cctgaggttg ggaggttcca gaccagcctg 12361 gccaacatgg tgaaacccca tctctactaa aaatacataa attatccggg caaggtggcg 12421 ggcgcctgta atcccagcta cttgggaggc tgaggcagga aaattgcttg agctgggtgg 12481 tgtccagggc gcatcagtga gaccccgggg agaagactgt cggcaccacc agcagtgtgt 12541 atatagtgct gggaagaggg tcagcacctc cttcatgggg ggcaggtgag ggaatggcca 12601 gctgtggcgg gaatggcact gctgaccctg ggttggggcc cacagggcac agccagcagc 12661 gaggtgaatg ccactgagga gatgtccacg cttgtcaact acatcgaacc tgtcaagttc 12721 aagtcctttg aggctgctcg aagtgagtgg gggtgggtgg caggcatggg agcttgccca 12781 gctcagccct gccaggtctg acgccctttc ttggctcacc cctaagagag gaacaaatgc 12841 ttcgagatgt cgtcctttgt ggagaccaag gccatggagc aactgaccaa gagccccatg 12901 gagtttgtgg agtatccttt gaaggtgctg tgggcggaca ggccagggtg gggcgctcgg 12961 aggccggctc cggtgttcta tccccacata taccacctgc acagggtgct ggccctccgg 13021 cggccctcct gcaccctggc ctggggcttg ggcagatcca ggcagtctca gggggcatgg 13081 ctaggctaag aaacagaggc cgggtgtggg ggtcacagga gcacctggct gaggccggag 13141 gtggaggctg atggggtggg ccctggcacc tgtgtggccc ctgaccacca atctcagata 13201 caacaagcag cagctcagcc gcatctaccc caagggcacc cgcgtggact cctccaacta 13261 catgccccag ctcttctgga acgtagggtg ccagcttgtt gcgctcaact tccagaccct 13321 cggtgagccc tggccccctc catcttgacc ccgaccctca gtcttactga cttctgaccc 13381 acgatcctgc tggtcttgcc tgaatggacc tctgaccctg tctcatccca gatcccaggg 13441 gaacccagct cccgactcac cccgcctcct gcccttggct gataccagac ccctcccctg 13501 tctgatctgt cctggatccg gaccctgatg ccatttgagc ccgcccctga ccctggacct 13561 tggatgccat ctgacctgat gatctcccat tccccacatg gcaccgtcct gcctgatccc 13621 tgcccctgct cgacgtgccc gtggcacccc agatgtggcg atgcagctca acgcgggcgt 13681 ttttgagtac aacgggcgca gcgggtacct gctcaagccg gagttcatgc ggcggccgga 13741 caagtccttc gaccccttca ctgaggtcat cgtggatggc atcgtggcca atgccttgcg 13801 ggtcaaggtg gggcttgcgg gcggctcagg ccaggggtgt cctgggggca ggactctcag 13861 catccgcctc accctccttg gcccaccccc aggtgatctc agggcagttc ctgtccgaca 13921 ggaaggtggg catctacgtg gaggtggaca tgtttggcct ccctgttgat acgcggcgca 13981 agtaccgcac ccggacctct caggggaact cgttcaaccc cgtgtgggac gaagagccct 14041 tcgacttccc caaggtgagc ctggcccctg cacccgccca ggcacaggca gatccagccc 14101 agaccgcccc gactctcagg tgggagaaca ggaaccaggc acaccgttgg cgactgggga 14161 tcagaaagtg gggggtgcca tgggtccccc aggacctgcc tgtccgagcg tcagtctccc 14221 tgccctcctt agggtaagag accagggtga gaaggtgaga ttggggttgg ggggtgtcag 14281 ctagacgtct tgtgtcccct ccgaggagtg gacacagcct accctgtctc cgaggcagga 14341 gctgaggtca caaccccaag ctcatatccg ctcttcagca ccccgcacgg ccctgcatgg 14401 acgcggcctc tgttcccagc ctgggcatgt gtgttcttgg tttcctgctc atgactcact 14461 cccgtttcta aactggcaat gaccagaact gtctgtgggg gccctggagg ggcctggctg 14521 cctgtcatct tgcctcaggg tcctgatggc tgcagtcatc cattaggcca ggacatggaa 14581 gtggggtcag ccatcagctt agtggcaatg gtgactgatg gtcagtaggt cagctcgtct 14641 gaaggtctgc tttgttaagc tttgatggct gtggttagct gtggcctggt agccacagtg 14701 gcccagggtc agccctgcgg ctcagctcca gtctggccca caggtggtgc tgcccacgct 14761 ggcttcactt cgcattgcag cctttgagga ggggggtaaa ttcgtagggc accggatcct 14821 gcctgtctct gccatccgct ccggtgaggc cttggtgggc tctgggcagc acagcgggca 14881 gtgggtaggg cccccacctg gcccagcaac ccaacgttgc cttcctgacc acccaggata 14941 ccactacgtc tgcctgcgga acgaggccaa ccaaccgctg tgcctgccgg ccctgctcat 15001 ctacaccgaa gcctcggact acattcctga cgaccaccag ggtgagctgg gggtgggcgg 15061 ggcctgcctg gccagggagt gtgagggaca agggccaccc ccagggcctg ggagtggccg 15121 agctggatgg ccccgactga gtagggaact gagtagggaa cagatctgag gacaagctct 15181 ggaattcctc attgagacca gaggtggcgg gtgggggtgg cctgggggct ctgtctctga 15241 gaccttggcc ttctgcctcc ccccagacta tgcggaggcc ctgatcaacc ccattaagca 15301 cgtcagcctg atggaccaga gggcccggca gctggccgcc ctcattgggg agagtgaggt 15361 gagccggggc agggcagggc tcaggctcac tgtgacccag ggggccgctg gacatgactg 15421 ctgtggccac atgaggaggg ggcttctggt ctctctagcg cagtaggctt gggcagccct 15481 gggattttga ggagatgaaa accacccatt tgcattcagg aggccttcag gtcacacctg 15541 ggctggaatt cctccagctc catgtaggct tggagtgggg gtacaggggt ctcagccact 15601 tcatcagggc agtgcgaagc cacagaagct gttcagacgt ggtgtcctgg gagtcatgag 15661 gagggcagtg gcttccgtgg ggaggggtca ggcacggagg ttcacctgcg cttccagatc 15721 cagtcccccc agggcctgcg gcaggcgggc ggggagagat gactgcaatt agggctgcct 15781 ggagctgtta gcggagaggc tggtgcccac tctgcggttg agtcctctgt tgatttaatc 15841 aggagctcag cagcaaggaa aatgagcgac tgtaggcaag gctgggggac acaggcaggg 15901 gaggagagga ggcttggggc tgggccaggc ctttccagaa caatctggga gttactgagc 15961 acatgcagtg ggctggcatt atgttatcct tcccccacag cccatgtgga aaaggggttt 16021 agaaaggctt gatttatagc tcttctcatg cagacagcct cctgttacag agcagtccag 16081 cagtggcttg gactaaggga ggctgacagg ggtgctaggg gacccaggag gtggtagagg 16141 acagaagggt ctggagggaa cagcattctc tgaaaagagc attctccttt ctccctatag 16201 gctcaggctg gccaagagac gtgccaggac acccagtctc agcagctggg gtctcagccg 16261 tcctcaaacc ccacccccag cccactggat gcctcccccc gccggccccc tggccccacc 16321 acctcccctg ccagcacctc cctcagcagc ccaggtaagg agtggcctgg gtcgggggtg 16381 ggttgcaggg aggcaccccc caccctgtcc tccagctcct cacagggcct ctgctcccca 16441 gggcagcgtg atgatctcat cgccagcatc ctctcaggta gggggcgggg tacctggagg 16501 cagggggttg ccttggaggt tcccccaccc cccgcccacc tttggtctgc cgctcccaga 16561 ggtggccccc accccgctgg atgagctccg aggtcacaag gctctggtca agctccggag 16621 ccggcaagag cgagacctgc gggagctgcg caagaagcat cagcggaagg cagtcaccct 16681 cacccgccgc ctgctggatg gcctggctca ggcacaggct gagggcaggt gccggctgcg 16741 gccaggtgcc ctgtgagtgt ctgggccgcc tgtgtgctat gtgtgctggg tgtgctgatg 16801 tgaacatggg ggtcggtggg catggagaga tggcaagagc tgctccttga cccccaggga 16861 ggagggttca gtgcaaggat ggaggaggct ttccacacca ggcgtccctg tgtcccccag 16921 ggagtgacat ctcttttgaa gtccatgagg agcatgttgc ccaaacaggc tccccatcac 16981 ttgggtggag ctaacaggcc tcctggtttg cataacagac ctggccccat gggatggtgt 17041 ctgccatcgc ttgccaggat gtgcccccct acacaccctc cccatcccag tccatgtgac 17101 gggcccaggc atcacccaga ggggttctcc tcgcagaggt ggggccgctg atgtggagga 17161 cacgaaggag ggggaggacg aggcaaagcg gtatcaggag ttccagaaca gacaggtgca 17221 gagcctgctg gagctgcggg aggcccaggt ggacgcagag gcccagcgga ggctggaaca 17281 cctgagacag gtagggggcc tgcagtggcc agggaaagcc tgctggatag acccgtcgtc 17341 agcccggcat cacctgtcag ctccctgtgt ccacaggctc tgcagcggct cagggaggtc 17401 gtccttgatg caaacacaac tcagttcaag aggctgaaag agatgaacga gaggtgaaag 17461 ccgaggattg tctatgggaa gggctgggga cttctagtac cagaaggagg gcagagtctg 17521 tgcttctgcc gctgacccct cctcttcccc tgccggcttc tccagggaga agaaggagct 17581 gcagaagatc ctggacagaa agcgccataa cagcatctcg gaggccaaga tgagggacaa 17641 gcataagaag gaggcgtaag ggcaccggga ccgggggcca tctgggtact ggggaggcag 17701 ggcaggtgtc tgggccccga gccatcctgc gttgctcccg tgcagggaac tgacggagat 17761 taaccgtcgg cacatcactg agtcagtcaa ctccatccgt cgggtgagtc aggctcccgg 17821 gccaccctac cccacctccc ttccttcact catcagacac ccatctccat gcctggcctg 17881 gtgccacgtt ggcctcaagg ttcttctagg gcctttctca agtgcccaac agccccaggg 17941 tacccacatc atacccaaag ccaactgtct gtaccagtgt ccctggcttt ggggtccttg 18001 ggcccagtcc tctcacaacc ctgatacatt ggcctgggag gcttcgccca cctgacttct 18061 ttttctttga gacagagtct cactctgtcg cccaggctgg agtgcagtgg cgcgatcttg 18121 gctcactgca acctcttggg ttcaagcaat tctcttgcct cagcctcccg agtagctggg 18181 attacaggca tccgccacca tgcctggcta atttttttat ttttagtaga gatggggttt 18241 caccatgttg gccaggctgt tctcgaactc ctgacctcag gtgatccgcc cgccttggcc 18301 tcccaaagtg ctgggattat agatgtgagc cactgcaccc ggcctcgccc acctgacttc 18361 tatacccagc cagagcctcg aggctctagc ccctccctca gggcccctca gctgagccct 18421 tgcccacgcc tgtagctgga ggaggcccag aagcagcggc atgaccgtct tgtggctggg 18481 cagcagcagg tcctgcaaca gctggcagaa gaggagccca aggtgaggcc atgggcgaac 18541 aggtgggcag acggtgtgca aggcagccag gcctcgcctg tgatgcccat ccttctccca 18601 cagctgctgg cccagctggc ccaggagtgt caggagcagc gggcgaggct cccccaggag 18661 atccgccgga gcctgctggg cgagatgccg gaggggctgg gggacgggcc tctggtggcc 18721 tgtgccagca acggtcacgc acccgggagc agcgggcacc tgtcgggcgc tgactcggag 18781 agccaggagg agaacacgca gctctgaact ggctgagcga ggtggccaca gggccagggc 18841 gggcgctggg tggagggcag gaggcaatga cactaatgct tttttttttt ttttttaact 18901 ttttatctag aaattttatt tttttaaacc cggggcaagt acctcagcta actcccttca 18961 tcctcctggg gcccctcctt cctgccctca gtcttaggtt agggccttgg tcagggcttt 19021 gctccctgtg acacccacac cctcgagcta gcagcgtctc ctcccttccc cgggagagct 19081 ggctggagac ttggagctcc gggaagtagg agtcacattt ttttctctat tctttgggga 19141 ttttttttac atgaataaaa gtggatttca gggcaccctg tgtgcagtgt gatttggggt 19201 gacagnactc tnggccctaa gtttcctcat ctataaaatg gactttgggc atttggtggg 19261 gttgaggtca cgcctccagc cacgggctg // LOCUS HSU27266 9645 bp DNA PRI 21-NOV-1997 DEFINITION Human myosin binding protein H (MyBP-H) gene, complete cds. ACCESSION U27266 NID g974540 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1772) AUTHORS Vaughan,K.T., Weber,F.E., Ried,T., Ward,D.C., Reinach,F.C. and Fischman,D.A. TITLE Human myosin-binding protein H (MyBP-H): complete primary sequence, genomic organization, and chromosomal localization JOURNAL Genomics 16 (1), 34-40 (1993) MEDLINE 93252409 REFERENCE 2 (bases 1 to 9645) AUTHORS Whittle,M.R., Fischman,D.A. and Reinach,F.C. TITLE Sequence of human myosin binding protein H gene JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 9645) AUTHORS Whittle,M.R. TITLE Direct Submission JOURNAL Submitted (18-MAY-1995) Martin R. Whittle, Depto de Bioquimica, Universidade De Sao Paulo - Instituto De Quimica, Sao Paulo, SP 05599-970, Brasil FEATURES Location/Qualifiers source 1..9645 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q32.1" /chromosome="1" /cell_type="lymphocyte" /dev_stage="adult" TATA_signal 738..742 5'UTR 808..835 /gene="MyBP-H" exon 808..1040 /gene="MyBP-H" /number=1 /evidence=experimental mRNA join(808..1040,1130..1264,1996..2163,4550..4638, 5012..5207,5389..5528,6140..6299,7201..7337,7498..7684, 7915..7965,8499..8779) /gene="MyBP-H" /note="corresponding cDNA sequence is reported in GenBank Accession Number L05606" /product="myosin binding protein H" gene 808..8779 /gene="MyBP-H" CDS join(836..1040,1130..1264,1996..2163,4550..4638, 5012..5207,5389..5528,6140..6299,7201..7337,7498..7684, 7915..7931) /gene="MyBP-H" /codon_start=1 /evidence=experimental /product="myosin binding protein H" /db_xref="PID:g2636654" /translation="MMEKNTSEGPACSPEETASESAKVPTAEPPGEVAVSESTREEQV PKPHGPAPQAPTASTATKPAPPSEDVPSAPLLLTLDDVSSSSVTVSWEPPERLGRLGL QGYVLELCREGGSEWVPVSARPMMVTQQTVRNLALGDKFLLRVSAVRPAGAGPPAMLD QPIHIRENIEAPKIRVPRHLRQTYIRQVGETVNLQIPFQGKPKPQATWTHNGHALDSQ RVSMRTGDQDSILFIRSAQRSDSGRYELTVRVEDLEAKAVIDILVIEKPGPPCCIRLL DVWGCNAALQWTPPQDTGNTELLGYMVQKADKKTGQWFTVLERYHPTTCTISDLIIGN SYSFRVFSENLCGLSTSATVTKELAHIQKADIAAKPKGFIERDFSEAPSFTQPLADHT STPGYSTQLFCSVRASPKPKIIWMKNKMEIQGNPKYRALSEQGVCTLEIRKPSPFDSG VYTCKAINVLGEASVDCRLEVKASAAH" exon 1130..1264 /gene="MyBP-H" /number=2 /evidence=experimental repeat_unit 1397..1413 /gene="MyBP-H" /rpt_type=inverted repeat_unit 1453..1467 /gene="MyBP-H" /rpt_type=inverted conflict 1996 /gene="MyBP-H" /citation=[1] /replace="c" exon 1996..2163 /gene="MyBP-H" /number=3 /evidence=experimental repeat_unit 3216..3234 /gene="MyBP-H" /rpt_type=direct exon 4550..4638 /gene="MyBP-H" /number=4 /evidence=experimental exon 5012..5207 /gene="MyBP-H" /number=5 /evidence=experimental exon 5389..5528 /gene="MyBP-H" /number=6 /evidence=experimental exon 6140..6299 /gene="MyBP-H" /number=7 /evidence=experimental repeat_unit 6666..6684 /gene="MyBP-H" /rpt_type=direct exon 7201..7337 /gene="MyBP-H" /number=8 /evidence=experimental conflict 7323 /gene="MyBP-H" /citation=[1] /replace="" conflict 7334 /gene="MyBP-H" /citation=[1] /replace="" exon 7498..7684 /gene="MyBP-H" /number=9 /evidence=experimental conflict 7500 /gene="MyBP-H" /citation=[1] /replace="" exon 7915..7965 /gene="MyBP-H" /number=10 /evidence=experimental 3'UTR join(7932..7965,8499..8779) /gene="MyBP-H" conflict 7959 /gene="MyBP-H" /citation=[1] /replace="" exon 8499..8758 /gene="MyBP-H" /number=11 /evidence=experimental BASE COUNT 2115 a 2895 c 2715 g 1920 t ORIGIN 1 aagcttcaga gtgctctcct ccactagggc atcttccaga tctgcctaca actgcacccc 61 atcccgaagg cgaggggttc tctgtaacag gtggaaaact gaggcccaca gcaggcatca 121 ggggagacga gctccttacc agccgcccaa ctgagacagg agccaggaga cttagtgccc 181 cttcctgggg ttccaatgct ggaggaccat cctctgagca gctccaccct cagcatctca 241 attccctccc tctctagggc cctcagcctc atacagcaca ggcgagagtc cctgggaact 301 gtgggagcca gggaatgtgg accccagagg caaagctggg agtgctcaca aggacgattg 361 tcccaaactg tcagggcccc agcacccagt ctgagaagct ctgggaacaa gtggccccgt 421 agacgccaga cacacagagc ctgagctggg ctcattctcc agatagaagt tcttcctcct 481 attgccctga cacctggcac gcggcctgct cgatgtccca tctcacccat ctgcgccctc 541 ccccagctgg gcaccagcac cctggctgtt gaaggccagt cggggagatg gatgaggaag 601 tgactcctca aggggccctt gtggttcctc ccccagagct attcctggcc tgggcgcctc 661 tccaccctcc agtcctcctg cttgacctga ccccacagct gggaatcgtt tgctgggggt 721 ggggcagccc ctgactatat aagcctggac ctaggtgtcc caggcaccct agagcccagg 781 ccccccaacg gctgacccct gcacactcca ccctggcccc ccagccagtc cagcaatgat 841 ggaaaaaaac acctccgagg gccctgcctg cagtccagag gagaccgcat ctgaatctgc 901 caaggtgccc acagcagagc ctcccggaga agtggcagta tcagagtcca ccagagaaga 961 gcaggtgccc aagccgcacg gccctgcccc acaggcccct acagcctcca cagccactaa 1021 gcctgcaccc ccaagtgaag gtgactgaag gtggggcagg gggaggaggg aggcaggcat 1081 gtcaggggtg gaggggagct cttggccatg aacctcttca tctccccaga tgtccccagt 1141 gccccactgc tgctgaccct ggatgatgtg agcagcagct ctgtgactgt gagctgggag 1201 cccccagaga ggctggggag gctgggcctc cagggctatg tgctggagct ctgcagagag 1261 ggaggtgagt cgtctgggct aagtcttgca tacatccagg ggaggttgtc atggagggct 1321 gcagggacaa gtgcacaggc cctgtggttg gtcagtctgg gagagataca gaaggagggg 1381 atgcgagcag taggctcctc aacccaccca cctgcccttt cttgctccac cagcaggaga 1441 gcaagaaagg gtaggtgggt gggttgaggg gctcccttgt cctcagctcc aaggagcttc 1501 tgagtaaggg aagggtaagg gggcactggg ttctcctttg gtgagaaggg aaggaagtgc 1561 taagtctcag aggaccccga actgtaggaa gtggaaaggg aggagctgga gaggggaatg 1621 ttagcgtaag agacagtgag gagaaagcgg gaatggggtg gggggtgtgg ggccacatcc 1681 ctgctgccac tgtccccctc catggcccaa gtccagcctg gcctctgcct ctgcccctac 1741 cttgctgtgt gactttgcaa atgtgtccaa ccctctctgg gcttcaacaa cctcatctct 1801 agccacaatt tcagggctct aacatcaggt gtctttgatg ccatttgaca agaagtcagg 1861 gaaggcaaac aaggggaaaa tgaggcccag agagggaggc tgatttcccc agatcacaag 1921 caggaaaaga agccccatgg agaggggcgc ctgtggccac attgcaaaga cctcacactc 1981 tctccatccc ctcaggctcg gagtgggtgc ctgtgagtgc ccggcccatg atggtgaccc 2041 agcagactgt gcggaacctg gctctgggag acaagttcct cctgcgcgtg tctgcagttc 2101 gtcctgcagg ggctggcccg ccggccatgc tggaccagcc catccacatc cgagagaaca 2161 ttggtaaagc ccatcctggc cttcagctac ccagcccttc ccaaacacct tatctaatgg 2221 agaaactgag gccagcaggt agagggaacc agccagtcat ttcccccgtc ccacgctcac 2281 atgcacaagc cacacgctat caccatcttg ccttaagaaa gttccaggtt cctcagcagt 2341 aaggggcctc tggaggtcta agaagccatt tccctgagtg gtccagcccc cacgcttgag 2401 aaggggctca gaaacaaaag tccagttccc tgggtcacct gctcaatgct aagagtccat 2461 ctgttctatc tcccatctaa tctcaatctc tccagtggcc ataaatgctc attttctacc 2521 catttcctgt ggcctgggtt ccagattctc tcaaaggtct tgaaacagag agaaggcgct 2581 gcttgcccct tgttccacca tggccctagt ggaaagaaaa agaaggtctg caaaaggccc 2641 atgaagactt ctccaccgca gagggcatac ctcactgggg ccccgcaggc cagctgccat 2701 ccccattgct aatatttcta tttgcaaggt gcattagcct caatcctccc ctcgctcccc 2761 tcccggagct gatcgggttt cagtctgtta atgacagtga gtcatttggt ctgatctgag 2821 gtctctgggg acagcacagt cccccagggt gaggagagac acaggcaggc aggtggcctg 2881 aagccttctc caccttcctt cttgctcctc cactagggga tggatgattt ctggtagctc 2941 tagacaaggc ctccagaagg agaggggatc cacaggaggg gtcaagtctg gattcgaaag 3001 gaaatgctca gctctgaggc ctgagagaca tgggaaccat ggggttaggg agcttagatt 3061 catagattat caaggcccaa ggagatccca aggagcccag ggcatccctt gaggtcaccc 3121 agccttttat tggcagtccc agagtctgat gcctacacag gaccactcat gtcccactgc 3181 actgcagagc atcttctctg gtggcaccca aaacccaagc ctcccaaccc taattcccag 3241 cctttggctc taggtgagat gtggccccac caggcagctg cattatggct gtgtcgaccc 3301 tagtcctgga ggcgccagga aggcttcttg gttgggacgg tgctgggcct tccccaggga 3361 accagggagc tcagactctg agaggcccct gaggcccagc attgatggac ccagccagac 3421 tcagcaccca cttctccatt caaattcctc tggcaggaga gaaggggatt tcctggagag 3481 aagagggaga ggaggctgtg aacaagtggg gaacagccag gcttatcaca gggcagagtg 3541 ggggccctgt ccaggatcct tggctgctca gctgtctctc cttggccacc tggagctgcc 3601 aagcattctt gaccccctgc agctgtccct agagcattcc atgcctgagc tcccagagcc 3661 aagtgcccac tcagctaatc ctcagagaca ccgtgttcct tctcactgtc ttgtcctcct 3721 ccgccctgca tgccaccaac caactgctgc cacctgtttg ctgctgagga gcctcagtca 3781 tggacccagc acccacccca ggggacaggg atggtggcca ctgcagcttc ccttagaatg 3841 cccaggggct tcagtggttg cccctgacca gatgagggtc agaatgtcat tccttagcat 3901 agcaggcagc tgaaatagga cttcaagaag gacttcccgg ctctgagact aacagatttt 3961 gggagggtga cagaggaaga aggggggata cccgccctgg cttcacttgg gaagagaaag 4021 atcctaaacc tttcaatagg gcctggagac agaggaatgg acaaacaggg tttcacaaag 4081 gcttctctga aagtcctggg cttgtggaga gctggggata ttagagcagg aagacggcag 4141 tggcctcccc ttcctagaag tgcagcctga caggtgcagc acctgctccc cagcaggggg 4201 agacagagag tcagctgggg cccatctggt gcttgggaat gcctgggctg caggacaaat 4261 cccatcctgg cagagtcgag gcgccaaggg agttcccagg atgcttacca gttcacacca 4321 tggaggcgct gacctcctga aacctcctgc tcccttaagg gatttgggcc cctagacctc 4381 aggagtgaga gcttcaccct tcctgcaagg agggacccca tggcagccag gcacacccag 4441 gaccctcctc ccagccagca tccctcacca gctgtcccct gccccatctc agaaacccca 4501 atcagccacc ccactgcact gctaagccca gactcttgct ttgatccaga ggcccccaag 4561 atccgtgtcc cccgccacct ccgtcagacc tacatccgcc aggtgggaga gacggtcaac 4621 ctgcaaatcc ccttccaggt gggtattacc cccaccaagg gtaacagcct gggtgggaac 4681 cagggccccc atttaagggg agcctgggag atggcagggt ctccaagggt ccagctgaga 4741 cccagcgaga cggcttcatc cttccgcaat catgcacatg actcattggt gattctcagc 4801 ctgtggatgg gcaggtgggc ccccagagcg ggatggacac ctggaagatg tggctgtggg 4861 cagcacctgt ggctccagcc tctgctgccc ctcctgagaa gggccagtgg gacaggcttc 4921 cccgagaaca gctccagaga ctcctggagt taaggaggtg tggggactct ggcacctccc 4981 tacactcacc ctgggggctc ctccctgcca ggggaagcct aagcctcagg ccacatggac 5041 ccacaacggc catgccctgg acagccagcg ggtgagcatg cgcaccgggg accaggactc 5101 catcctcttc attcgctcgg cccagcgctc cgactctggc cgctacgagc tcactgtgcg 5161 cgtggaagac ctggaggcca aggcagtcat tgacatcctg gtgattggta tggagccagg 5221 ggaacagtgc tgtctcatgg gggacctgga gcccccaacc tcctatccca gagctgatgg 5281 ctgatgggag caggccaagg gagtgtggca agagccaaat tttgggtgtg ggggctctgt 5341 gccagcctgg ccctcacact cacttcctct cacactctgg gcctacagag aaacctggac 5401 ccccctgctg catcaggctc ctggacgtct ggggctgcaa tgctgctctt cagtggacgc 5461 caccccagga cacaggcaac acagagctcc tgggctacat ggtgcagaag gcagacaaaa 5521 agacaggggt gaggctggct gaggcaaagt ggggtcgggg tacgaagtgg tcaaggtggt 5581 ggggaaggcc catcccactg agtctaggga ccgagtctag ggatcgagtt ctggtctgcc 5641 cactgctaac ctgactccct aggaggccct tagcaagact cttcctctct ggacctcaga 5701 tgcctcctga agactgtgaa gagggctggc ccgggctgtc tgaagccttt caagttctgg 5761 agccatgctt ctcaaatctt aatgtccaca tgagtcacct ggggagcttg ctaagaggca 5821 gagtctgact cagctaggtc cagggtgggg gctgagggcc tgcatttcta agacgtttcc 5881 aggtgatgct gaagttgcac atccatgggc cacactttgc acacaaaggc tctaggattc 5941 tctgaggacc ctacagaggt cacctgagtg acccaggggc cccttcttcc aattcaatgt 6001 ccctgaggcc caaagctgga tggttcagca tcctaaggtt gggggcatgg aatgagttcc 6061 tgtgaggaaa ggtctccagg gtaagcaggg agggtgagga caggggtgag aaatgagggt 6121 gacacccggg gccctgcagc aatggttcac agtgctggag cgctaccacc caaccacctg 6181 caccatctct gacctcatca tcggcaactc gtactccttc cgggtcttct cagaaaacct 6241 gtgtggactc agcacctcgg ccaccgtcac caaggagctc gcccacatcc agaaggcagg 6301 tgggtttggg cccggagctg gctgtgctaa ctctctccct gtctctgctt cttcctccgc 6361 gtctctgcac ccctctctct gggtttctcc tgggcgtctg cccctggaac acttgccgtc 6421 tcctcactcc acggctatgc atacacacac atggagtggc tgcaaagtag tgaatgtggt 6481 tggctgattt taacaaacac atcagagcaa atccacacaa atgaaagatg cccctttgga 6541 ggctgtgtgc ttattccaaa gaggtgtgtg gctgccagca tcacagcagc cgccgagtct 6601 caggcagtga gtgtccgaag ggaacctaaa gggtgacctg cgcatggctg ggtagtgtgc 6661 taaaacaagc ctcccaaccc taatacatac cacatagatg gacaagagag ataggagtat 6721 ttactgtctc cacacaggca cctgaccctc ttgcgtcaaa tgttaaaccc cgggcccttt 6781 tccctgtccc ttgtgtgccc caaggccctc tccgacccac agtctgtgtg ggttccagaa 6841 ggcaaacttt ttttaacact agcctgaact gcacagaatt aggaagaaac ttttgtttaa 6901 aaaaagaaag ccagtcaagt ctccaacgac aaacccaaag gaaacgtggc acctctgagt 6961 tgctagcatc catcttctgc caagatgctc ttatctctgg tttttccacc ttctcatctc 7021 tactcatcac cctgacccac ccagctacag tctcagcttt ctctgcctct ttgagattgt 7081 ccaagttcct gcaaataagc cacgacttgc tatccataga tcttgaatca tagtcagtcc 7141 tgacaggtct ccctcactcc tgtaagctca gatgcacccg actttctccc ctgaacccag 7201 atattgctgc caaacctaaa gggtttattg agcgagactt ctcagaagcc ccctcattca 7261 cccagcccct ggctgaccac acctccaccc ctggctacag cacccagttg ttctgcagtg 7321 tccgagcttc acccaaggtg aggggccagg ccctgttgtg ccctgggctg gtgggggcag 7381 gtggggacag cagggctgag gaatcctccg ggcgtccaca cctaagcctg gggatagctg 7441 ttgagagccc cgccccagtc catacctccc cagctctctt ctgaccatgt ctcccagccc 7501 aagatcatct ggatgaaaaa caagatggag atccagggca accccaaata ccgcgccctc 7561 tctgagcaag gcgtctgcac cctagagatc cggaaaccca gcccctttga ttctggggtc 7621 tacacctgca aggccataaa tgtgctgggg gaggcatctg tggactgccg gctggaggtc 7681 aaaggtgaga gtttgaggac gggagaacag gagtaagcac ctctctggaa ctggactaga 7741 ggcagcagac agcttaaaat caaggccgaa gagaaggggt ggtaagactg gggtcaggct 7801 gtgctgatta ctttccaagc ggcagggtgg acaggtgtag aagagggtgt ataaccgtga 7861 actttccagt tccccgccac ttcccccaag ctctaactgc agcatctctt ccagcctcag 7921 ccgcacactg aagagtcacg ggaggaggct gtgatgagga gccaggtaaa tggcctgacc 7981 cacccactct gttacctctg tgggcagcac agaagcagag aggaggccag gagcaggggg 8041 ttggggtgag aagggaagtg tggtcagctc tgtggacaca aacttgtgga cacaggcaac 8101 tagcacgcat gtccagagac tcacagtgat gttgcagggc caggcgacgc cggtgtctcc 8161 tggaaggggg gtgattccgt ggaccgccac acaaccagag tctactccca gctctgagtg 8221 atcttcagca agggtgttct ctttggacac attctctcat ccgaaaactg gggataacaa 8281 cctttccctg ccttccttcc cctggtggtt ctgaggctct gcataagtac ggaagtatct 8341 agaactgcag tgggcagagc tcagacctgg gcattgcacc tgagggaggg aggcctggtg 8401 aggaagagga tgctgctccc cctccccact ctgcacctct ctggcctgca gccgagctct 8461 gctcacacca ggggctcagg aagcatgtcc tcctccaggt gctgcaggcc tgaggggcca 8521 ggtccaccag gcctacagtc aaactccaga gatgcccaag acccccgctt ccatagcgac 8581 ggctgatgga accgcccctg gtacctgctg gcctcccttc cccagaggct ggtgtccaga 8641 actcaggaat gggcctggta ggccccaggc cagactaact gggctcaagg ggtgctggaa 8701 ggcgcagaga ttggagtgcc ctgcggagtt gcactctggg tgggaagcac tcaaataaag 8761 atgcgtggtg ttaacagtgc ggggcctctg tgcagtgagc gacatcatct gggggaggcg 8821 ctgtgtcacg aggagggggg gtgtctgaat gtttcaccat gagggggtct ccagatatgt 8881 caccaagagg aagatgccaa tgggagaggt ctcaggatgt catggctggg gcagggaggt 8941 ggggggatgg ggggtgttct ggacctgtga gttttcagga tctcccttct gatcagggtt 9001 tgctaccgcc agtgcgtaac tatggaaaca gacacaccct ctctggaacc ctccttgctc 9061 caacgcctcc acgtgccccc tctcgcagct cctcccagcc cccgtcattg cttcctgtct 9121 gctgaagctg tcttattgcc ctcaacagtg gtggggatgg gcgcccagca atgtgctctc 9181 cacagggttc acagtttttt attagtcaca tgggcctccc ttgggagcca gggcacaccc 9241 aatttccaac agagggaagg gcttggaagc aagtcctgga gttgacactc atgttcagaa 9301 ggtcaactcg acggccacct cccctccctc gccccccgac acaggggctc caggatcccc 9361 cgcctcgccc tcccacacag gggctccagg ctcccccgcc tcgccctccc acacaggggc 9421 tccaggctcc cccaccttgc cccccaacac aggggctcca ggcttggcag ggcatgggtt 9481 ggtccttccc cagatcccta aagtcctaga cccccggaca cctgcaacag agcttccaaa 9541 gcccctgaaa acccagccag atactagggg tattgaggga ttcacatgac agcaaggcct 9601 tcccccacat gctctcaggg gggtactctc ctctgcctgc tgcag // LOCUS HSU29895 20890 bp DNA PRI 14-NOV-1996 DEFINITION Human 4-hydroxyphenylpyruvate-dioxygenase gene, complete cds. ACCESSION U29895 NID g1667329 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 20890) AUTHORS Stenman,G., Roijer,E., Ruetschi,U., Dellsen,A., Rymo,L. and Lindstedt,S. TITLE Regional assignment of the human 4-hydroxyphenylpyruvate dioxygenase gene (HPD) to 12q24-->qter by fluorescence in situ hybridization JOURNAL Cytogenet. Cell Genet. 71 (4), 374-376 (1995) MEDLINE 96103473 REFERENCE 2 (bases 1 to 20890) AUTHORS Ruetschi,U., Rymo,L. and Lindstedt,S. TITLE Direct Submission JOURNAL Submitted (22-JUN-1995) Ulla Ruetschi, Department of Clinical Chemistry, Institute of Laboratory Medicine, Goteborg University, Gothenburg, S-41345, Sweden FEATURES Location/Qualifiers source 1..20890 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leucocyte" /chromosome="12" /map="12q24-qter" exon <1968..1995 /number=1 CDS join(1993..1995,2103..2129,2997..3059,3384..3488, 4191..4233,4410..4492,6028..6117,10680..10783, 11393..11470,13255..13417,13536..13607,16637..16759, 20412..20528,20622..20732) /codon_start=1 /product="4-hydroxyphenylpyruvate-dioxygenase" /db_xref="PID:g1667330" /translation="MTTYSDKGAKPERGRFLHFHSVTFWVGNAKQAASFYCSKMGFEP LAYRGLETGSREVVSHVIKQGKIVFVLSSALNPWNKEMGDHLVKHGDGVKDIAFEVED CDYIVQKARERGAKIMREPWVEQDKFGKVKFAVLQTYGDTTHTLVEKMNYIGQFLPGY EPPAFMDPLLPKLPKCSLEMIDHIVGNQPDQEMVSASEWYLKNLQFHRFWSVDDTQVH TEYSSLRSIVVANYEESIKMPINEPAPGKKKSQIQEYVDYNGGAGVQHIALKTEDIIT AIRHLRERGLEFLSVPSTYYKQLREKLKTAKIKVKENIDALEELKILVDYDEKGYLLQ IFTKPVQDRPTLFLEVIQRHNHQGFGAGNFNSLFKAFEEEQNLRGNLTNMETNGVVPG M" intron 1996..2102 /number=1 exon 2103..2129 /number=2 intron 2130..2996 /number=2 exon 2997..3059 /number=3 intron 3060..3383 /number=3 exon 3384..3488 /number=4 intron 3489..4190 /number=4 exon 4191..4233 /number=5 intron 4234..4409 /number=5 exon 4410..4492 /number=6 intron 4493..6027 /number=6 exon 6028..6117 /number=7 intron 6118..10679 /number=7 exon 10680..10783 /number=8 intron 10784..11392 /number=8 exon 11393..11470 /number=9 intron 11471..13254 /number=9 exon 13255..13417 /number=10 intron 13418..13535 /number=10 exon 13536..13607 /number=11 intron 13608..16636 /number=11 exon 16637..16759 /number=12 intron 16760..20411 /number=12 exon 20412..20528 /number=13 intron 20529..20621 /number=13 exon 20622..>20890 /number=14 BASE COUNT 5118 a 5298 c 5467 g 4921 t 86 others ORIGIN 1 tcccacctca gcctcctgag tggctgggga ctacaggtgc tcactaccag gcccagctag 61 gtgaatgctt ttctgcacac cagggtattt tcaaggatgc gtcctatgaa cccatctgat 121 cagtgagctt cttgagcaag gcctctggaa cccacctctg catggtcggg catgaagaac 181 agagacagtc acacccagcc actgctgatc tgttatccag tggagcccag cctgaacaga 241 gagcatcagg gaggggactc tggatactgg gccctgcagg cttgcctccc tgctctctgc 301 ccagacatgt gaatggcatc taggggtgca cccagcatgg gactgagacc tacagtgttg 361 gggacagagt tcctacctct agatgcctgc cagcctttgn aacccactgc tgttgacaca 421 cttccttgag ttagtaaaca ttggcctttg tgcttagcat ataagccagg gcaggctcca 481 ggagtggggg tttgagttgg ccacagcaat caaagtgaca agatgttcct gctggcctgg 541 aggggatagg caggctacac agagctcttc tcctccatgg gaaccttctg ggcctgtgag 601 atccaccctg ggccacctcc actcaagttt tcccagttgg atatttattt atttatttga 661 gatggagtct cactctgtca cccaggctgg agggcagtgg catgatcact gctcactgca 721 atctctgcct ccttggttca agtgatcctc ctgcctcagt cccctgagta gctgggacta 781 cagacgtgca ccaccgtgcc cagctaatgt ttgtattttt agtagagatg aggttttacc 841 atgttgccag gctggtctca aactcctggc ctcaagtgat cctccctcct tggcctccca 901 aagtgctggg attacaggcg tgagccacca cccccggccc ccagttggaa atttagactt 961 aaggagctga ggtggacact gggggcacac tgggcaggcc caggacaggc agaataaggg 1021 ggtgggacca ccttgcctgc ctcctgaatg gcccagaaaa tagagaaaag gagggagaag 1081 tggtctgcac tggtgcaatg agatccaact tcagaatcaa acaggccaga gtttggggct 1141 ctgctcttac cagttccatg acctgggaga catgtgttca cctctccagg cctcagtttc 1201 cccatctgat ggggatatca atatctccca tttgtataga gggcaatcta gcaatttaaa 1261 gtgtgggtcc tggagttaga atagcccggt tcatgttcca gttcttccct gccgcttgct 1321 gacctcatgg cctaggaatg ttctctgtcc caatgaatta taaccaacac tgagagtgtc 1381 ctgtgtgcca ggtcctgttc caaagagcgt gcttatctcc tttaatcccc accacactat 1441 tctcattcct gttttacaaa tgaggctcaa agggatatgg tgattttccc aagtcacact 1501 gtatgtaaca gagccaggat gcaaaccttc tcctgccgtg cctctcagag ctatggaata 1561 caacgctgtg acgcaaacag gactttgggg scagtctcca aatgccccct caaatttcct 1621 ctaggcaaga gttctggagc cgccactgca ctccagccgg ccttggcttc tgagctttgg 1681 attctgggcg gtaatggtca cattcccggc gtcctggaaa agcttgccct tggcttctgg 1741 ccctgactct gcctcccact gactagtgga cgtcactgcc cctctctggg cctggcaaca 1801 ccatcctgtt caatgaggct gcggctggat cagaggtccc caggcctgga accttccagg 1861 gctgggctca gctccccacc ccatccccca ggtaggccag gaggcctggg cgggacctgc 1921 ttggggttaa atactccggc ccagcactcc ccaggcctct agtcccagta ggaggtttga 1981 ctaagatcaa tcatggtaag tgccatggcc atctcccctt gggtgggaca tcttcagtaa 2041 cgcttccaga agcaccttgt gttggaacct ccaagtctca ctgtccttct ttttctcctt 2101 agacgactta cagtgacaaa ggggcaaagg taagcaggcg aaatgtggga aatgtagggg 2161 cctgagtcta gaagggggtc ggagtgcagg gagactgggg tttcagcaga gagcagaaga 2221 ggaaggactg gagtggatac atgacgtgca tgcctgcaca ctcagcttta cccccggggc 2281 tccccgagac tgagctggac tcaacctcac cctcacaaca aacttgtatc ctggccgggc 2341 atggtggctc atgcctgtaa tcccagtgct ttgggaggcc aaggcaggca aatacttgag 2401 gtcaggagtt cgagaccagc ctggtcaaca tgaggaaacc ccgtctctac taaaaatacg 2461 aaaattagcc gggcatggta gcgcacccct gtaatcccag ctactcagga ggctaaggtg 2521 ggggagatca cttgaaccct aggatcgctt gaacctggga ggcggaggtt gcagtgagcc 2581 aagattgcac cactgcactc cactccagcc tgggcaatag agcaaaactc catctcaaaa 2641 acaaaaacca aaacaaaacc tgtacccctt ctcaaatgcc tgccaatctt ttaatcccat 2701 ctggaagttc tttctaaggt ctaacttaaa tccatattgc tgcaacataa gtctatttcc 2761 ctgtgtgaaa ccttggcagg cctcctgagg gggtttcccc atccaagaca cccaggagct 2821 caacttccac atgagaaacc tcttttgagg cttggcaagg ctgcatgggg ttccgctgct 2881 cctaagttct gtggttgagg ctggaataag gagggtggct aggggacatc actccacccg 2941 cacttttcta gactcacttt cagggactca acctcacctc ttctctcctt ctgcagcctg 3001 agagaggccg attcctccac ttccactctg tgaccttctg ggttggcaac gccaagcagg 3061 tagaagccgg gccaaggtgc tggagtgaac agacggcagg acagaggcct gcctgacaag 3121 ggcagccctc cagcagggtc tgtgattgcc ccttgggagg gaggatcagg ccaggggtgg 3181 gccagatact ctgaatccta attacaccag cgggaaagtt atctgtcanc aaaatgtgca 3241 ggcagcagtc tatcttccca ggagggtctc tgcccctgag acaatagggc aggcaggctg 3301 aggggggcca ggggctctct tgcggaagtc ccccatgctg gccccttgac cctcatgaac 3361 cctgctgcaa ccctgtgttt caggccgcgt cattctactg cagcaagatg ggctttgaac 3421 ctctagccta caggggcctg gagaccggtt cccgggaggt ggtcagccat gtaatcaaac 3481 aagggaaggt gagtcaccac ccccaggaca gaccccaacc cctgcctggc caacatggtg 3541 aaaccccatc tctattaaaa atagaaaaat tagctgggca tgatggtggg cacctgtaat 3601 cccagctact caggagggtg aggsaggcac atcgcttgaa cccgggaggc agaggttgca 3661 gtgagccaac gtcgtgccac tgcactccag cctgggcgac agagcaagac tccatctcaa 3721 caaaaaaaaa caaaaaacaa caaaaaaccc caacaacaac aacaaaaaca tataattcag 3781 ctcccaccga cccctccgct ggcatctttg tacagccatg caacacggcc cttcctagtt 3841 tcgtttcctc ctcagcctgg cccaacactg accccacagg tgctcagaaa acctgggata 3901 tcagaggaga ggaattttaa gcagtgccct gaccgatggg gtccctggag ttctctcccc 3961 gagggagacc tgacctagca gttcctctga ggatccagag ggggtctggc tgaaagcccc 4021 atccctcagg accagtgtgg gaaactaagg cagagaagga gctagaaccc aggtccagga 4081 gttcttgtgc tccatcaggg gtgtttcggg tcggcaggga ggggccgatg ggtgcacctc 4141 cccatgcccc cctgagccac tagcctcacc tccttctcct cctatcttag attgtgtttg 4201 tcctctcctc agcgctcaac ccctggaaca aaggtgaggt gcccggggag ggtggacgca 4261 tggctctgtg ggtgggatct gggttctggg agggtggacg catggctccg tgggtgggct 4321 ctgggttcag ggcctggatg gagactaaaa gccaggtgag gggacctaga ctccagactc 4381 cagtgccatc ctctccctgc cggccacaga gatgggcgat cacctggtga aacacggtga 4441 cggagtgaag gacattgcgt tcgaggtgga agattgtgac tacatcgtgc aggtaagaaa 4501 rcacctgggt ggactgagag aggaaggcct ccgtctgyca tggggacccc atcagtcagg 4561 gctggaagga cactgctgca tcacagtgac aagcccagat gccacatcac agtgacaagc 4621 tcagatgctg catcacagtg acaagcccag atkctgcatc acagtgacaa gcccagatgc 4681 tggtgtgaca cacacctggg ctccgggggt kccctgtctt cccccgtgtc ctctctctca 4741 ctgagcctca gtttccacag ctgccagatg gggctaacca cagctgggtt gtcaaaggag 4801 gaaattagaa ttaccatttc cttttttttt tttttttttt tttttkgkga gacggagttt 4861 tgstcttgtt gcccaggttg gagtgcaatg gmacgatctc ggctcaccgc aacttctgcc 4921 tcctggattc aagcaattct cctgcctcag cctcctaagt agctaggatt acaggcgtgc 4981 gccaccacgc ccggctagtt tkgtattttt agtagagatg gggtttctcc ttgttggtca 5041 ggctggtcat ttcctttttt ttcttttttc attttctttt tcttttttgt gtgtgtgtgt 5101 gtgtgtcaga gtcttgctct gttgcccagg ctggagagca gtggcgcaat cttggctcac 5161 tgcaacctcc acctcctggg ttcaagcgat tctcctgcct cagcctcccg agtagctggg 5221 attacaggcg cctgccacca tgcccagcta atttttgtat tttagtagag atggagtttt 5281 gccatattgg ccaggctggt cttgaactcc tgacctcaag tgatccaccc acctcggcct 5341 cctaaagtgc tgggattaca ggcatgagcc attttgsctg gcctcagaat tactgtttct 5401 tgagcacaac tctaaaccaa acactgtacc caacaggcag cctacagatt cagagtgtgc 5461 gttccagagc caggctctcc aggtctgaat cttagccgtg tgacttcggg caagtgactt 5521 aatgtctctg tgcttcaacg tcctcaccta gagaatggag atgataatag tacttttgca 5581 taatgttgct gtgaggatga acgaggttag aatctagaac agagcctggt gcatgatata 5641 agcactgtca tgcagtccag ttttgctaaa taaatgcatg aatcttctta ttattctcat 5701 ccaaccattc tttttacaga tggtgaaatt gaggactaga gaaggggact gatcacaggg 5761 ggcgttagca gaggcaggac agacacccag tccctaccta gtttgtctcc catagcccca 5821 actctgggcc tgggcctcag tttccctatt tggcaaatga gcatgagtat tgggggtaat 5881 ggctgaagct ggctggggtt cctacctggg gctggagaaa ctggaggggg ctgcaggacc 5941 acgcagaggc caccaggcag gagcagcagg caagggttct ggaaggntcc agncctcacc 6001 aattccccga tcccatcttc tctgcagaaa gcacgggaac ggggcgccaa aatcatgcgg 6061 gagccctggg tagagcaaga caagtttggg aaggtgaagt ttgctgtgct gcagacggtg 6121 agttcccttt ggtgccccca cccctcctgc tgtcggcacc actgagatgt caggagcccc 6181 gcagatgact ccatcaaacc atcccaccca tggggcttga agcacagaca ttgtccctca 6241 agtctgtccc ccgtattggc tgggcagtgt acggggactg caggagccac cagacactcc 6301 tcccaggctt cccttcctcc cagtgnggga ggatgacagc aagggggtga tggcaggggc 6361 tctgaagaag ggagaaggat ggggtctggg gacatcagtt cccacacgag ctgtgtgact 6421 ttgggcatat tactgtacct ctctatgact ctgtttaaat gagcaaagac tggaactgga 6481 acacctatgt ggtctgggac atgcgcttaa ctttcctgtg cctcagcagt atctttttgg 6541 tttcctcaaa aaaatttttt tttaaagatg gagtcttgct ctgtcgccca ggctggagtg 6601 tagtggtgtg atcttggctc tctgcaagct ccgcctccca ggttcaagtg actctcctgc 6661 ctcagactcc cgagtagttg ggaccacaag tgcgtgccac catgcccggc taatttttgc 6721 ttttttgnat ttntagtaga ganggngttt caccgtgtta gccaggatgg tctcaatctc 6781 ctgacctggt gatcctccca actcagactc ccaaagtgct gggattacag gcgtgggcca 6841 ctgtgcccgg cctttttctt tttttattat tcttttttct tttttcagac ggagtctctc 6901 tgtcgcccag gctggagtgc agtcgcgtga tctcagctca ctgcaacctc tgcctcctgg 6961 gctcaagtga ttcttctgtc tcagcctcct ggggattaca ggcatccacc cccacacacg 7021 gctaattttt gtgtttktag tagagacagg gttttggcat gttggccagg ctggtctcga 7081 actcctgacc tcaagtgatc catccgcctc agcctcccaa agtgctggga ttacaggcgt 7141 garccaccgt gccaggctgt tttcttcaac tttctttttt cttttttttt ttttttkkga 7201 gatggagtct cgctctgtcg cccaggctgg agtgcagtgg cacgatctcg gctcactgca 7261 agcgctgcct tccgggttca tgccattctc ctgcctcagc ctcccgagta gctgggacta 7321 caggsgcccg ccaccacgcc cggctaaatt tkgtattttt agtagaggtt tcaccgtgtt 7381 atccaggatg gtctcgatct cctgaccttg tgatccgccc gcctcagcct cccaaagtgc 7441 tgggattata agtgtgagcc accgtgcccc acctcttcaa cttttattat aggttctggg 7501 gtatatgtgc tggatgttca agtttgttac ataggtaaac gcatgcatgg tggtttgcta 7561 cacagatcat cccatcacct aggtattaag cccggcatct gttagctact ctccttggtg 7621 cactccctcc ccgacctccc ccacctgaca ggccctagtg tctgttgttc ccccgtgatg 7681 tgtccatctt ttctcatcat ttagctctca cttctaagtg agaacatgca gtgtttggtt 7741 ttctgttcct gcgttagttt gctgaggata acggcttcca gctccatcca tgtcccctgt 7801 aaaggacatg atcttgttcc tttttatggc tgcaagcctc agtatcttca actatgactc 7861 ctcatagggt catcgkaagg atccaatgag aactgggcgc agtggctcac gcccataatc 7921 ccagcactgt aggaggctga ggcgggcgaa tcacttgagg tcagaagtta gcctggccaa 7981 catggtgaaa ccccacctct actaaaaana aatacaaaag ttagccaggc atggtggtgt 8041 gcacctgtag tcccagctac ttgggaggct gaggtgggag aatcgcttga acccaggagg 8101 cagaggttgc agtgagccaa gattgcacct ctgcactcca gcctgggcga cagaacaaga 8161 ctccctctca aaaaaaaaaa aaaaaaaaaa ggatccaatg aggtcatttg tgtatakcac 8221 ttagaacagt gcctggcaca ttagacactc agcaaagatg accaattctg tttaggacct 8281 gtgttactgt aatacagaga caatcatagc tcttacctcc ttgagattaa attcaacatw 8341 aagggctggg cacagtggct cacacctata atcccagcag tttgggaggc tgaggcagga 8401 ggatcacttg aggccaggag ttcgagacca gcatgggcaa gatagggaga ccccatctct 8461 acaaaaatta taaatacaaa taaaaaataa ataggccagg tgcagtggct cacacctgta 8521 atctcagcaa tttgggaggc caaggcaaga ggactgcttg agcccaggaa tttgagacca 8581 gcctgcgcaa caaagtgaga ccccatctct acgaaaaaat taaaaattgg ccaggcatga 8641 tggagcatgc ctgtggtccc acctacatgg gaggctgagg tgggaggatt gcttgagtcc 8701 aggaggtaga ggctgcagtg agctgttatc acaccactgc acttcagcct ggttgacaga 8761 gtgagactct gtctcaaaaa taaatgaata aaaatgtttt aaaaagttaa aaaattaaat 8821 taaacataaa aatggttaaa gtgtgcctgt aatcccagca ctttgtgagg ccgaggcagg 8881 tggatcacct gagttcagga gttcgagacc agcctggcca acgtggcaaa accctgtctt 8941 tactaaatat acaaaaatta gctgggcatg acaggcgcct gtaatcccag ctactcggga 9001 ggctgaggca gaagaatcac ttgaacccgg gagacagagg ttgcagtgag ccaaggttgc 9061 accactgtac tccagcctgg gcgacagagc gagactccat ctcaaataaa taaataaaaa 9121 ttttaaaaaa taaaagttaa aaaattaaat taaacataaa aatggttaaa gtgtgcctgt 9181 aatcccagca ctttgggagg ccaaggtggg tggatcactt gaggtcagga gttcaagacc 9241 agcctggcca aaatggtaaa accccatgtc actaagtaaa acaacacaaa aattagccag 9301 gtgtggggct gggcacggtg gctcatgctt gtaatcccag cactttggaa ggccaagggg 9361 gcggatcacc tgaggttggg agtttgagac cagcctgacc aacatggaga aaccccatct 9421 ctactaaaaa tacaaaattg gctggacgtg gtggtgcatg cctgtaatcc cagctactca 9481 ggaggctgag gcaggagaat cgcttgaacc cgggaggcag acgttgtggt gagccaagat 9541 cgcaccattg cactccagcc tgggcaacaa gagcggagct ctgtctcaaa aaaaaaaaaa 9601 aattagccag gtgtggtggc acgtgcttgt agtcccagct atttgggagg ctgaggcagg 9661 agaatcgctt gaacctggga ggcggaggtt gcagagagcc aagatcacac cactacactc 9721 tagcctggcc aacaagcaag actccgtctc aaacaaaaaa aaaaaaaaaa aaaaaggtta 9781 ggtacagtgg ttcatgcctg taatcccggc acttggggat gttgaggtgg gcccggcatg 9841 gtggtacacg cctataatcc cagctactca ggaggctgag gcaggagaat ggyttgaacc 9901 cgggagatgg aggttgcagt aagccgagat cgtgccaatg cactccagcc tgagtgacag 9961 agtgagactc cgccttaaaa aaaaaaaaaa aaaaaaagat ggttaaagtg gctgacagaa 10021 ctcttcagta taattttttt tttgagacgg agtctcattc tgtcgcccag gctggagtgt 10081 agtggcacga tctctgctca ctgcaagctc tgcctcctgg gttcatgcca ttctcctgtc 10141 tcagcctccc aagtagctgg gactacgggc acccgccacg ccaccacacc cggctaattt 10201 tttgaatttt ttagtagaga caggatttca ccgtgttcgc caggatggtc tcgatctcct 10261 gacctcatga tccacccgcc ttggcctccc aaagtgctgg gattacaggc ctaagccacc 10321 gcgccaggct gctcttcagt ataatttaat ccaatgccct cctgcgttct tcatctagtt 10381 gacctactga aataaatgaa gtaggtgctt cattgagtac ctcccctgtg ccaggcactg 10441 agcttggcgc tgggtacacc atcatgagca aaacagtgtc cccgccctca tggaaactct 10501 caaacaagtg acagattgca attcatttca gaaaggagaa gtgacttacc caagatcacg 10561 catctggtta gggtcagatg caggttggaa cccggggcta cgttagcgct atgaaatgtg 10621 ctctgtcctc ggcggggatg ggtgcccagg ggccgagcca ggctgttttc cacccgtagt 10681 atggggacac cacacacacc ctggtggaga agatgaacta catcggccaa ttcttgcctg 10741 gatatgagcc cccagcgttc atggaccccc tacttcctaa actgtgagtg tccctcaggg 10801 cagaggtggg tggaaacccc tgagggactg tgaaaagtaa tgccccttcc tccatgtcca 10861 ctgcgcatgt gtgttggggc agaatggagc cggctaggga attcctggaa gtcctcaggg 10921 gtgcttgaag aggtggcaca gaggctgaga ccatggcctc tgcaaccaaa cttccccggg 10981 ttcaaatccc agctctgtta cttcacagcc atttcaccct gggcaagtcg cttcatctct 11041 ctaggcctca gtttcttcat ctgtaaaatg gggactgtaa tagtatatga gttgacaaaa 11101 taaaatgtcc ggcccattag taaatgctac ataagtgtga gtccaggcgc agtggctcat 11161 acctataatc ccagcacttt gggagtctga agcaggaggt tcacttgagc cccagagttt 11221 tagaccaacc tggccaaaaa aagagtctct ctttttgtag agactccatc tctacaaaaa 11281 aatagaactc caaaaacaga agtgtgagct attattatta ttgaggggtc atggagaagg 11341 gagcaatctg gggggtagcc accttctcac ccccatcttc ttcccttccc aggcccaaat 11401 gcagtctgga gatgatcgac cacattgtgg gaaaccagcc tgatcaggag atggtgtccg 11461 cctccgaatg gtgagacctt ggccctctcc ctctgcgtga agatgacggg tcctcacggt 11521 ggactggccc actccctggt ccttgcacta acatggaaaa tacgactggg tgcagtggct 11581 cacacccgta atcccagcat tctgggaagc cgaggtggga ggatcacttg aggtcaggag 11641 ttcaacacca gcctgaccaa tatggagaaa ccccgtctct actaaaaata taaaaactca 11701 gtgggcgagg tgttgggccc ctgtaattcc agttactcaa gagkctgaga caggagaatc 11761 gcttgaaccc gggaggtgga ggttggtcga nagagtgaga ctccancctg gnaaanagan 11821 cnagattttg tcnncaataa acaaatacaa taaaatagac cgggcgcact tgctcacgcc 11881 tgtaatccca gcactttggg aggcggaggc gggtggatca cctgaggtca ggagttcaag 11941 accagcctga ccaacatggc gaacccccgt ctctactaaa aacncaaaaa ttagctgggt 12001 gtggcagtgt gtgcctgtaa tcccagctac ttgggaagct gagacaggag aatcgcttga 12061 accaggaagg cggaggttgc agtgagctga gattacgcca ctgcactcca gcctgggtga 12121 gagagagaga ctttgtctca aaataataat aataataata aagtaaaata aaataaaagt 12181 ggaaaataca gcacaagatg ttgggagtta gcctgaggcc actgatgcat ttattcactg 12241 taaaactgtc aacatggaga ccagtggact tgaggcccaa agcgacccac agsccctcct 12301 tctgaggaac cctggggaca tcacagtcca gggaggcttg gaggaaggtt gccaccaccc 12361 agagagcccc ctgtccaaat tccaagcaag ggcctttttc tcaaagcact ttacttccat 12421 ctgtgggagc cttgccgtcc tgtatgtatt tcgttccaag acacttccac ctgaaggaaa 12481 aatgtcatca taaaggtaca aatctaggct gggcgtggtg gcccatattt ggaatcttag 12541 tactttggag gcccaggcag gtggatggct tgaacccagg agttcaagac agcctgggca 12601 acatagtgag accccatctc tacaaaaaat acggccgggc acagtggctc atgcctgtaa 12661 tcccagcact ttgggagscc gaggcgggtg gatcacctga ggttaggagt tcgagaccag 12721 cctgaccaac acagtgaaac actgtctcta ctaaatacaa aaaattagtt gggcatggtg 12781 gtgcatgcct gtaatcccag ctacttggga ggctgaggca ggagaatcgc ctggacccag 12841 gaagtggcag ctgcagcgag ccgagattgc gtcactgcat tccagcctgg gcgaaaagag 12901 cgaaactccg tctcaaaaaa caaaaaataa aaatacccca aaattatcca ggtgtggtag 12961 cacatgcctg tagtcccagc tacttgggaa gctgaggtgg gaggagcaat tgagcctgga 13021 gattgaggct gcagtgagcc aagactgcac cactgcactc cagcctgggc aagagtgaga 13081 ctctgtctcg agcaacaaca acaacaaaaa gatctaaatc tctgatgggt atgatggcag 13141 tggmatctcc tttctggagc ctgctcaggg acagattctg gaaggtggaa atgggtgatg 13201 gaggccgtcc ctggaggcgc tcagagcata tgtgttccac cgcccaccct acaggtacct 13261 gaaaaacctg cagttccacc gcttctggtc cgtggatgac acgcaggtgc acacggaata 13321 tagctctctg cgatccattg tggtggccaa ctatgaagag tccatcaaga tgcccatcaa 13381 tgagccagcg cctggcaaga agaagtccca gatccaggtg aggccgcccc tggctggggg 13441 agaggggagc agggaagagg tctgggggct cttgtagggt cagggatgtg ggtgatgggg 13501 caaatgacag tggtctcctt gtttccctcc cccaggaata tgtggactat aacgggggcg 13561 ctggggtcca gcacatcgct ctcaagaccg aagacatcat cacagcggtt agaaccccct 13621 tccctgtccg aattcctctc tcattcagag ccagggacgg tctctgcaaa attgagaggg 13681 cgggtggcgg cagtcctggg cccgtttctc tgggctggga gactgacagg actccctcac 13741 tgcttcttgc aactcatggg caggcacagc cacacccacc cccagagagt tcagtctcca 13801 cagaacccct gggctttgga ggtagcaggc gccagggtat caggaagcct gggtgatggt 13861 cccggttctg ccagtggctt gctctgtgtc tctgggagat tcagtgctgt ctctggactt 13921 tagttncctt accttaaaaa tcaggaaatt agggccgggc acagtggctc acacctgtaa 13981 tcccagcact ttgggtgncc aaggcagggg gatcacctga ggtcaggagt ttgagaccag 14041 cctggccaac atggtgaaac cccatctcta ctaaaaannc aaaaattatc tgggcgtggt 14101 tgcgggcacc tgtaatccca gcttctcagg aggctgaggc aagagaagtg cttgaactca 14161 ggaggtagag gttgcagtga gccaaggtcg caccactgta ccacagcctg ggcgacaaga 14221 gtgaaattct gctttagaaa aaaataatag taattaataa aggaaattag gactgggcac 14281 agtggcttat gcctgtagtc ccaactacac gggaggccaa agcagaagga ttgtttgagc 14341 ccataagtta gagatcagcc tggaaaacag tgggaccctg tctctncaaa aagtaaacaa 14401 aattaaccag gcttggtggt tcatgcctgt agtcctagct acttgggagg ctgaagcagg 14461 aggatccatt tagccaagga tttcaaggct gcagttagct atgattgagc cacttttctc 14521 cagcctaggt gacagagagt gaggccctat ctccccgcaa cctcaaaaaa gaawtttttt 14581 ttaaaatgag gaagttaagg ttgggcatgg tggctcatgc ctataacagt ttctgccctt 14641 gtggagcaat aaacaaaata agtaaaatat aaatgctaga tggtagagag cacccaggaa 14701 gaaagtaaag cagggaattg ggacagcaag ggtgaatggc attgcacatt taaattgggg 14761 ggtcagacac tgtgggaggc cgaggcagga ggactgcttg agctcagaag ttcaagacca 14821 acgtgggcaa catagtaaga cctcatcctt tttttttttt tttttnggga gacggagcct 14881 ggctctgtcg cccaggctgg agtgcagtgg cgcgatctcg gctcaccgca agctccgcct 14941 cctgggttca cgccattctc ctgcctcagc ctcccaagtw gctgggacta caggcacccg 15001 ccaccacgcc cagntaatgt tttttttggt ttttttggtg gagacggggt ttcactgtgt 15061 tagccaggat ggtctcgatc tcctgatctt gtgatctgcc caccttggac tcccaaagtg 15121 ctgggattac aggtgtgagc cactgcgccc ggccanaaga nctcatcttt acaaaaaaat 15181 caaaaaatta gccaggtgtg gtggtgcaca cctgtggttc tagttaccca gaagtctgaa 15241 gtaggaggat cagctgagcc tggggaggtt gaggctgcag tgagctaaga tcatgccact 15301 gcactgcagc ctgggtgaca gaacaaagac cctgtctctc aaatatatat atgtgtgtgt 15361 atatatatat atatccttct tttttgagat gcagtcttgc yctgtyytct aggctggrgt 15421 rcagtggtgc gattttggct cactgcaacc tccacctcct gggtwcaagc gattctcctg 15481 cctcagcyyc ccaagtagct gggatwacag gtgcatgaca ccacgcctgg ataacttttt 15541 ttgtatttta tttttatttt ttatttttta tttattttat tttattttgg agacagagtc 15601 tcgctctgtt gctcaggcta gattgcagtg acgcgatctc ggctcactgc aagctctgcc 15661 tcccgggttt acgccattct cctgcctcag cctcccaagt agytgggact acacgctccc 15721 accaccacgc ccagtaattt tttttttgta tttttagtag gggtttcact gtgttagcca 15781 ggatgkymtt gatctcctaa cctcgtgatc acccatcttg gcctcccaat gtgctgggat 15841 tgacaggcgt sagccacctc gcccggcctt ttttttgtat ttttaataga gacaggattt 15901 tgccatgttg gtcaggctgg tctccaactc ctgacttcaa gtgatctgcc cgcctcagcc 15961 tccaaaagtg ctgggcttac aggcgtgagc cnccacactc ggcctaatag atccttaatc 16021 gtttcatgcc tcagtttccc cagctatgaa atgggagaat agtccttcag ctgcctcaaa 16081 agattgctgc atggacataa tgagggaata cgtagaaagt gctcaggccg ggctcatctg 16141 tcacgcctgt aatcccagca ctttgggagg ccgaggtagg tggatcgcct gaggtcagga 16201 gttcgagacc aacctggcta acatggtgaa accccatctc tactaaaaat acatagctga 16261 gtgtgatggt agtcacctgt aatcccagct actcaggagg ctgaagcagg agaatcgctt 16321 gaaccgggag gcagaggttg cagtgagcta agatcgtgcc actgcactcc agcctgggca 16381 acagagactc catctcamaa aaaaagaaaa aaagaaagtg ctcaaggcag gcacatggtg 16441 agtgttctat aaatctagca gttgttgttc taacggggcc catgacagag aagaaagggg 16501 aagaggccac acagcatcct ggcagttgag gggagctagg atagggaccc tgtatcctga 16561 tgacccagca cctccacctg gctaggggag ttgaacttgg aggccgaagg ctgacctcag 16621 ctttgctctg aaacagattc gccacttgag agagagaggc ctggagttct tatctgttcc 16681 ctccacgtac tacaaacaac tgcgggagaa gctgaagacg gccaagatca aggtgaagga 16741 gaacattgat gccctggagg tgaggcccag gcaggatccg cagccttgtg gggggagtat 16801 gagctctgca ttgcatcttt tcagaaatat tgtctcagcc cggaggggag cccarrccaa 16861 gtccctgctg gcaaaggaca tttattctag ggtctatgtg ggaggaggag acagatgaca 16921 aacaaataaa caataaaagc acggtcagag atagtcagtg caggaagttc aaacaggcag 16981 cctggggctg ctttacagag actggtcagg gagggcctct ctgaggaggt gacatctgag 17041 tgaagtctag atcatgagcg ggagctggcc atgtgaacag cgggaagaaa ggcattcctg 17101 gcagagggat cagcaagggc cgaggtagga gcatgccctg agcgtctgag gaacagaaag 17161 aataggctag agtggatggg aggctgcttt tgancgatga cctaggtcag acaggtaagc 17221 aagagccagt tcctcttgtg agctattatt caggaatctg gattttgttc ttttgtgacc 17281 agaaaccact ggagggtttt ttggtttggt tttagcagtg ttattgacat atgtcacata 17341 acatgtcatt tttttaaggt gtacaattct atgatttcta gctatttgct atgttgtgca 17401 actatcatcc aaatccattt taggactggg tgtggtggtt catgcttgta atctcagtac 17461 tttgggaggc cgaggcgggc ggatcatgag gtcaggagtt cgagaccagc ctgaccaaca 17521 tggtgaaacc ctgcctctac taaaaanacc aaaattagcc aggcgtggtg acacatgcct 17581 gtaatcccag ctactcagga ggctgaggca ggagaatcac ttgaacccgg gaggcggagg 17641 ttgcagtgag ccgagattgc gctactacat tccagcctgg gcgacacagt gagactctgt 17701 ctccaaaaaa aaaaaaaaaa aaaaaaagat ccattttagg actaggtatg gtggttcatg 17761 cctgtaatcc cagcactttg ggagaccaag gcaggtggat cacttgaggt caggagtttg 17821 agaccagcct ggtcaatatg gcgaaacccc catctctaca anaagaaaaa aaaaaattag 17881 ctgtgcgtgg tgatctgtgt ctgtggncca gctactgggg aggctgaagt gggagaatca 17941 cttcaaccca ggaggtangn gctacagtga ccatcatncn aggtcactgc accccagcnc 18001 tggtgatacc ctgtctaaaa gaaaaacaaa aacagttagg ccagacacag tgctcacacc 18061 tataatccca gcattttggg aggttgaagt gagaggattc cttgaggcca ggagtttaag 18121 accagcctga gaaacacagg gcgaccctgt atctacaaaa aaaacagttt taaaaaatta 18181 gctgggcatg gtggttcctc cctgtagtcc cagctactca ggaggctgag gtgagaggat 18241 cacttgaggc caggaggtca aagctgcagt gagccatgat caggctactg cattccagcc 18301 tgggcaaaaa atcaatttta gaacattttt gtcaccccaa aaagaaacct tgagcctctt 18361 agccagcacc tcccaacccc ctctgttccc ttcccccctc ccaccccagc cctaggcacc 18421 attaatctac cttcagtctc tatagatttg gctattctgg acatttcatc aaaaggaatc 18481 acaaagtcca ggcacggtgg ctcatgcctg taatcccagc actttgggag gctgaggtgg 18541 gcggatcatg aggtcaggag ttcaagacca gcctggccaa catggtgaaa ccctgtctct 18601 actaaaaata caaaaattag ctgggtgtga tagcagatgc ctgtaatccc agctactggg 18661 gagcctgagg ccggagaatt gcttgaaccc agggaagggg aggttgcagt gagccgagat 18721 cgcgccattg cactccagcc tgggcgacag agtgaaactc catctcggaa gaaaaaaaaa 18781 aggaatcaca aaataggcct attgtgagca gcttctttcc catagcatgt gctcaaggtt 18841 catctgcgtg gggtgtgtgt cagtgcttcg ttcctttttg tggctgaaaa atatcccatt 18901 gtatagatag acacacattg tttatctatt cttcgttgat agatatttgg gttgtttcca 18961 cttttggggt gttatacgta atgctactgt gaacatctat gtacaaggct ctgtgtggac 19021 gtgttttcaa tgcactttgg tatatatcta ggagtagcta ggtcacatgg taattccatg 19081 ttcccttttg tttgtttaga gatggagtct tgctttgttt cccaggctgg ggtgcagtgg 19141 tgcgatcttg gcccattgca acctccagcc tcctgggttc aagcaattct cctgcctcag 19201 cctccccagt agctgggatt ataggcgccc gccaccacgc ccagctaatt tttttttttt 19261 tttttttgag acagagtctt gctctgttgc ccaggctgga gagcagtggc cgatcttggc 19321 tcactgcaac ctccacttcc cgggttcaag cgattctcct gcctctgcct cctgagtagc 19381 tgggataaca ggcgtgtgcc accatgccca gccaatttat ttttattttt agtagagatg 19441 gggtttcacc atgttggcca ggctggtctc gaactcctga cctcgggtga ttcactcacc 19501 tcagcctccc agagtgctgg gattacaggc gtgagccacc ttgcccagcc taatttttgt 19561 attttntttt tgtagtgact gggtttcccc atgttggcca ggctggtctc gaactgagct 19621 caggtgatcc acctgcctcg gcctcccaaa gtgagccacc atgcctggcc accccctttt 19681 tttttttaat ccgtatttta aggaagcaac actatatgtt taactttttg agaagctgcc 19741 agactgtttt ccaaagtggt tgcaccattt tacagtccca ccagcagtac atgagggttc 19801 cgatttccac tggaggattt taaacagggg aatgacatga tccgactttt aaaagatccc 19861 gcagcaggtc atggtgtcac gcctgtaatc ccaacacttt gggaggtcga ggcgggtgga 19921 tcacctgagg tcaggagttc gagaccagcc tgaccaatat cgtgaaaccc cgtcttacta 19981 aaaatacaaa aattagccgg gcatggtagc ctgcgcctgt agtcccagct actcgggagg 20041 ctgagacagg agaatcgctt gaacccggaa ggcggaggtt tcagtgagcc aagattgcgc 20101 caccgcactc cagcctgggt aacagcgcaa ggctccgtct caaaaaagaa aaagaaagtg 20161 tcctgataat gtctgcgggg aggatggatt gtaaggatac aagattagaa gcgggcagca 20221 gtcttgaatg gtgccttagg aactgtagta ggggnntagt aataggacag tttggggata 20281 tatttggagg tggctgaact gccagaaaaa gaggtcattt ccagactttt ggggtgcagg 20341 agcccctgtg gactttccaa ggatctcacc gtgccagcct cccctagcag agccccctgt 20401 ctcccgccta ggagctgaaa atcctggtgg actacgacga gaaaggctac ctcctgcaga 20461 tcttcaccaa accggtgcag gaccggccca cgctcttcct ggaagtcatc cagcgccaca 20521 accaccaggt actgcttgtc cccggcaggc ccgaggggac aggcagctag cacactcggt 20581 tttctgggtc cttggctcat ctcctctctt tcccgccaca gggttttgga gccggcaact 20641 tcaactcact gttcaaggct ttcgaggagg agcagaacct gcggggtaac ctcaccaaca 20701 tggagaccaa tggggtggtg cccggcatgt aagccccgcc caccccacgg aggccacagc 20761 cacacagcca cgccccctga ttctggaact cgcccaactt ccctactggc tgctcccctt 20821 gggtcccgcc caccagcgga ctcggccccc aaggctccgc ccacactgac cacgcccctc 20881 ggcggggccg // LOCUS HSU29953 22484 bp DNA PRI 02-JAN-1996 DEFINITION Human pigment epithelium-derived factor gene, complete cds. ACCESSION U29953 NID g1144298 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 22484) AUTHORS Tombran-Tink,J, Mazuruk,K, Rodriguez,I., Kouri,R.E., Chung,D., Linker,T. and Chader,G.J. TITLE Cloning and molecular characterization of the human gene for the neurotrophic serpin PEDF: conservation, polymorphism and hereditary studies JOURNAL Unpublished REFERENCE 2 (bases 1 to 22484) AUTHORS Rodriguez,I., Mazuruk,K., Tombran-Tink,J. and Chader,G.J. TITLE Direct Submission JOURNAL Submitted (23-JUN-1995) Ignacio Rodriguez, LRCMB, NEI-NIH, 9000 Rockville Pike, Bldg. 6, Rm. 304, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..22484 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17p13.1" /chromosome="17" mRNA join(6652..6779,11573..11664,14528..14726,15707..15862, 16551..16754,19737..19879,21219..21429,21874..22251) /product="pigment epithelium-derived factor" 5'UTR join(6652..6779,11573..11580) exon 6652..6779 /number=1 exon 11573..11664 /number=2 CDS join(11581..11664,14528..14726,15707..15862,16551..16754, 19737..19879,21219..21429,21874..21965) /note="PEDF" /codon_start=1 /product="pigment epithelium-derived factor" /db_xref="PID:g1144299" /translation="MQALVLLLCIGALLGHSSCQNPASPPEEGSPDPDSTGALVEEED PFFKVPVNKLAAAVSNFGYDLYRVRSSMSPTTNVLLSPLSVATALSALSLGAEQRTES IIHRALYYDLISSPDIHGTYKELLDTVTAPQKNLKSASRIVFEKKLRIKSSFVAPLEK SYGTRPRVLTGNPRLDLQEINNWVQAQMKGKLARSTKEIPDEISILLLGVAHFKGQWV TKFDSRKTSLEDFYLDEERTVRVPMMSDPKAVLRYGLDSDLSCKIAQLPLTGSMSIIF FLPLKVTQNLTLIEESLTSEFIHDIDRELKTVQAVLTVPKLKLSYEGEVTKSLQEMKL QSLFDSPDFSKITGKPIKLTQGGTPGWL" exon 14528..14726 /number=3 exon 15707..15862 /number=4 exon 16551..16754 /number=5 exon 19737..19879 /number=6 exon 21219..21429 /number=7 exon 21874..22251 /number=8 3'UTR 21966..22251 BASE COUNT 5280 a 5709 c 6138 g 5347 t 10 others ORIGIN 1 gcggccgcag ggtggactgt gctgaggaac cctgggccca gcaggggtgg cagcccgcgc 61 agtgccacgt ttggcctctg gccgctcgcc aggcatcctc caccccgtgg tcccctctga 121 cctcgccagc cctcccccgg gacacctcca cgccagcctg gctctgctcc tggcttcttc 181 ttctctctat gcctcaggca gccggcaaca gggcggctca gaacagcgcc agcctcctgg 241 tttgggagaa gaactggcaa ttagggagtt tgtggagctt ctaattacac accagcccct 301 ctgccaggag ctggtgcccg ccagccgggg gcaggctgcc gggagtaccc agctccagct 361 ggagacagtc agtgcctgag gatttggggg aagcaggtgg ggaaaccttg gcacagggct 421 gacaccttcc tctgtgccag agcccaggag ctggggcagc gtgggtgacc atgtgggtgg 481 gcacgcttcc ctgctggggg tgcagggggt ccacgtggca gcggccacct ggagccctaa 541 tgtgcagcgg ttaagagcaa gcccctggaa gtcagagagg cctggcatgg agtcttgctt 601 cttgcaaacg agccgtgtgg agagagagat agtaaatcaa caaagggaaa tacatggtct 661 gtccgaggat gagctgccgg agagcaatgg tgaaagtgaa gtgggggagg gggcggggct 721 gggaggaaaa gccttgtgag aaggtgacac gagagcacgg ccttgaaggg gaagaaggag 781 ggcactatgg aggtcccggc gaagcgtggc ctggccgagg aacggcatgt gcagaggtcc 841 tgccgaggag ctcaagacaa gtaggggacg gtggggctgg agtggagaga gtgagtggga 901 ggaggagtag gagtcagaga ggagctcagg acagatcctt taggctctag ggacacgata 961 aacacagtgt tttttgtctt gtcaagtgtg tcctttttat ttttttgaaa gagtctcgct 1021 ctgtagccca ggctggagtg cagcggtgcg acctcggctc actgcaacct ctgcctcccg 1081 ggtccaagca attctcctgc ctcagcctcc cgagtagctg ggattacagg cacccgccac 1141 cacgcactgc taatttttgt attttagtag agaccgggtt ttgccatgtt ggtcaggctg 1201 gtctcgaact cctgacctca ggtgatccgc ccgcctcggc ctcccagagt ggtgtgagcc 1261 actatgccct gcagcacttg tcaagtcttt ctcagcgttc ccctcctctc cactgcagct 1321 cccagtgccc cagtctgggc ctcgtcttca cttcctggga tccctgacat tgcctgctag 1381 gctctccctg tctctggtct ggctgccttc actgtaacct ccacccagca ggtacctctt 1441 cagcacctcc catgaaccca gcagaatacc aagccctggg gatgcagcaa cgaacaggta 1501 gacgctgcac tccagcctgg gcgacagagc aagactccgc ctgaagaaaa aaaaaaggac 1561 caggccgggc gcggtggctc acgcctgtaa tcccagcact ttgggaggcc gaggtgggtg 1621 gatcatgagg tcaggagttc aagaccagcc tggccaaaat ggtgaaaccc cgtctctact 1681 gaaaaataca aaaattagct gggtgcagtg gcgggcgcct gtagtctcag ctactcagga 1741 ggctgaggca ggataattgc ttgaccccag gaggcagagg ttgcagtgaa ccgagatcac 1801 gccactgcac tccagcctgg gcgacagagc aagactctgc ctcaaaaaaa agaataaaaa 1861 taaaaaaaag gaccagatac agaaaacaga aggagacgta ctatgaagga aattggagag 1921 cttttgggat actgagtaac tcagggtggc ctttcccagg ggacatttag ctgagagata 1981 gacggtatga agacctgacc gttcagaaac aggggaagag gcagcagccc gggcaaaggc 2041 ctttggggca ggaaagggct tggatcactg gagaagcaga aagatggcca gtgtgaccag 2101 agtgtgacaa agtcagagaa aaccaggaag atggagctgg agacacaggc ggggccagat 2161 cacgagggtc ctcgcagacc agagcaaggg tttggatttt attccaagta tgaagggaag 2221 ctgctgaagt gtgttttcct ttacaatttg tagttgaaat ataatatgca aagtacacaa 2281 gtcttaacta tatgtaagct taatgaatgt ttccatgaac caaataccgc tgtgcaacca 2341 tcaccagctc aagagacgaa cccttctccc tcctcctgac tgccagtaac atagtggttc 2401 agctcaagaa acagaactct tctgacttcc cctaacatag cgggttttct tttttgtttt 2461 gttttttgtt gttttttaag agacaatgtc tttattattt ttattttttt ttatttttga 2521 gacggagtct tgctgtcgcc caggctggag tgcagtggtg cgatctcggc tcactgcagg 2581 ctctgccccc cggggttcat gccattctcc tgcctcagcc tccctagcag ctgggactac 2641 aggtgcccgc cacctcgccc ggctattttt ttgtattttt agtggagacg gggtttcacc 2701 gtgttagcca ggatggtctc gatctcctga cctcgtgatc cgcccacctc ggcctcccaa 2761 agtgctggga ttacaggcat gagccaccgc gcccagccaa gagacacggt cttgctctgt 2821 cgcccaggct ggatggagtg ccgtggtgcg atcacagctc gcggcagcct tgacatcctg 2881 ggctcaagca accttcctgc cttggcctcc caaatgttgg gattataggc atgagccact 2941 gtgcttggca tctattcatc tttaatgtca agcaggcaat tgaatatttg atcagggata 3001 gaattgtcta tttgggggta tgcagatgtg cttcatgtca tggaactggg ccgggcgcgg 3061 tggctcatgc ctataatccc agcactttgg gaggccgagg caggcggatc ataaggtcag 3121 gagatcgaga ccatccgggc caacacggtg aaaccccgtc tctactaaaa atacaaaaat 3181 taggcaggtg tggtggtgcg tgcctgtagt cccagctact cagggaggct gagacaggag 3241 aattgattga acctgggagg cagaggttgt agtgagccaa gatcgcgcca ctgcactcca 3301 gcctgggcga catgagcgag actccgtctc aaaaataaac aaaaaaaagt catggaattg 3361 atggaaattg cctaagggga gatgtagaag aaaaggggtc tcaggatcaa gccagcagag 3421 aaggcagaaa aggtaaggtg tgtgaggtgg cagaaaaagg gaagagtgtg gacagtgagg 3481 gtttcaagga ggaggaactg tctactgcct cctgccaagg acggaggtgt ccactgccag 3541 ttgacataag gtcacccatg aacttggtga caggaatttc agtggagaag tggccacaga 3601 cacaagtcta gaattgaaat gggagccgag gcagcgtaga caaaagagga aactgctcct 3661 tccagagcgg ctctgagcga gcaccgagaa atgggcagtg gctttagggg atgtagcgtc 3721 aaggaagtgt cttttaaaga agtcgggggc cgggcacggt ggctcacgcc tgtagtccca 3781 gcactttggg aggccgaggc aggcagatca cttgaggtca ggagttcgag accagcctgg 3841 ctaacacgat gaaaccccgt ctctactaaa aatacaaaaa attagctggg cacggtggct 3901 cgtgcctgta atcccagcac tttgggaggc agaggtgggc agatcacttg aggtcaggag 3961 tttgagacca gcctagccaa catggtgaaa ccccatctct actaaaacta caaaaattag 4021 ccgggagtgg tggcacgtgc ctgtaatccc agccagtcag gaggctgagg caggagaatc 4081 actggaatcc tggaggtgga ggtggcagtg agccgagatg gtacctctgt actccagcct 4141 gggggacaga gtgagactcc gtctcaaaaa aaaaagaagg tggggaagga tctttgaggg 4201 ccggacacgc tgaccctgca ggagaggaca cattcttcta acaggggtcg gacaaaagag 4261 aactcttctg tataatttat gattttaaga tttttattta ttattatttt ttatagaggc 4321 aagcattttt caccacgtca cccaggctgg tctccaactc ctgggctcaa gtgtgctggg 4381 attatagcca tgagtcacca cacctggccc agaaacttta ctaaggactt atttaaatga 4441 tttgcttatt tgtgaatagg tattttgttc acgtggttca caactcaaaa gcaacaaaaa 4501 gcacccagtg aaaagccttc ctctcattct gatttccagt cactggattc tactcttggg 4561 atgcagtgtt tttcatctct tttttgtatc cttttggaaa tagtattctg ctttaaaaag 4621 caaatacagg ccaggtatgg tggctcactc ctgtaatccc agcactttgg gaggccgagg 4681 caggtgatca cctaaggtca ggagttcaag accagcctgg ccaatatggt gaaaccctgt 4741 ctgtaccaaa acacaaaaac aaaaacaaaa acaaaaatta gccgggcgtg gtggcgtgct 4801 cctgtaatcc cagctactca ggaggctgag gcaggagaat cgcttgaacc tgggaggcag 4861 aggttgcagt gagccgagat tgtgccactg tactccagcc tgggccacag agcaaggttc 4921 catctcaaac aaaacaaaac aaaacaaaca aaaaaacaaa acaaaagcta atacaaacac 4981 atatacaata gacaaaactg taaatatttt attattttta ttttttttag tagagacagg 5041 gtttcaccat gttggccagg atggtctcaa actcctgacc tcaggtgatc cacccacctc 5101 agcctcccga tagttaggat tacaggcatg agccaccaca cccggcctaa aattgtaaac 5161 gttttagaag aaagtataga tgaatccctt cgtgatctcg gggaagaaga gattttttaa 5221 aaaagatacc aaaagaagca caaattataa aagaaaagat tgaaaatgtt ggtgttaaaa 5281 ttaaaaactt gttttaaaac aagcttgtgt aacccatgac ccacaggctg catgtggccc 5341 agaaaagctt tgactgcagc ccaacacaaa ttcgtaaact ttcctaaaac attatgagat 5401 tttttttgag attttgtttt gttttgtttt ttgttttttt agctcattcg gtatcattaa 5461 tgttagcata ttttacgtgg ggcccaagac aattcttctt ccaatgtgtc tcaggggagc 5521 caaaagattg gacacccctg ccataaacat gaaaagacaa tggccgggca cggtggctca 5581 cgcctgtaat cccagcactt tgggaggctg aggggggcgg gatcacctga ggtcaggagt 5641 ttgagacaag cgtgaccaat gtggtgaaac cctgtctcta ctaaaaatac aaaaattagc 5701 cgggcatgct cgtgcacacc tatagtccca actactcagc agggtgaggc aggagaacct 5761 cttgaacccg ggaagcggag gttgcagtga gccgacattg cacccctgca ctccagcctg 5821 ggtgacagag tgagtctcca ctggaaaaaa aaaaaaaaga acagtgtgat acattgacct 5881 aaggtttaag aacatgcaaa ctgatactat atatcactta gggacaaaaa cttacatggt 5941 aaaagtaaaa agaaatgtac gaaaataata aaaatcaaat tcaagatggt ggttatggtg 6001 acgggaaaga actgaggcgg aaatataagg ttgtcactat attgagaaat ttttctatct 6061 ttttttcttt tttctttttt tgagacgggg tctcgctctg tcgcccagga tggagtgcag 6121 tggtgtgatc tcagctcact gcaacctccg cctcccaggt ttaagtgatt ctcctgcctc 6181 agactcccaa gtagctggga ctacaggtgc gcgccaacac acctgggtaa ttttgtttgt 6241 atttttagta gagatggggt ttcaccgtgt tgactaggct ggtctcgaac tcctgacctc 6301 aggtgatccc ccggcctcgg tctcccaaag tgctgggata acaagcgtga gccactgcgc 6361 ccagctttgt ttgcattttt aggtgagatg gggtttcacc acgttggcca ggctggtctt 6421 gaactcctga cctcaggtga tgcacctgcc tcagtctccc aaagtgctgg attacaggcg 6481 ttagcccctg cgcccggccc ctgaaggaaa atctaaagga agaggaaggt gtgcaaatgt 6541 gtgcgcctta ggcgtaatgg atggtggtgc agcagtgggt taaagttaac acgagacagt 6601 gatgcaatca cagaatccaa attgagtgca ggtcgcttta agaaaggagt agctgtaatc 6661 tgaagcctgc tggacgctgg attagaaggc agcaaaaaaa gctctgtgct ggctggagcc 6721 ccctcagtgt gcaggcttag agggactagg ctgggtgtgg agctgcagcg tatccacagg 6781 taaagcagct ccctggctgc tctgatgcca gggacggcgg gagaggctcc cctgggctgg 6841 ggggacaggg gagaggcagg ggcactccag ggagcagaaa agaggggtgc aagggagagg 6901 aaatgcggag acagcagccc ctgcaatttg ggcaaaaggg tgagtggatg agagagggca 6961 gagggagctg gggggacaag gccgaaggcc aggacccagt gatccccaaa tcccactgca 7021 ccgacggaag aggctggaaa ggcttttgaa tgaagtgagt gggaaacagc ggaggggcgg 7081 tcatggggag gaaaggggag ctaagctgct gggtcgggtc tgagcagcac cccaagactg 7141 gagcccgagg caaggaggct cacgggagct gcttccacca agggcagtca ggaaggcggc 7201 cgccctgcag cccagccctg gcccctgctc cctcggctcc ctgctacttt ttcaaaatca 7261 gctggtgctg actgttaagg caatttccca gcaccaccaa accgctggcc tcggcgccct 7321 ggctgagggc tgggatggag gacagctggg tccttctagc cagcccccac ccactctctt 7381 tggctacatg agtcaaggct gggcgaccaa tgaggttgtg gcctccggca aacaatgacc 7441 actatttagg ccggcaggtg tatagggcgt gggggcccag ctgccagtgc tggagacaag 7501 ggctgtccga gatgaaccct ttctgctgcc tgccaagcca ctgggagggg taggtctcag 7561 caggattccc agaaaccccg cccctgtcca gcctaggccc cccacccggt gttagctaac 7621 ccaacgttag cccccaggtt ccgtggggtt ggggggcagg gagtcctatt cttggggctg 7681 ctgcttctgg ggtgtgggga agtgcaactc cacggcaccc tgggctgact cattcagctt 7741 ctaaagcttc aggaaacatt gtttggggct gggtcaccat gggtgggcca gagaggaccc 7801 ctcaatcccc tccggagagc caggggaggg ggaggtgccc ttccccatgc tatctccgag 7861 gcccactgcc atgtggctga aggctgtgcg gttctgggaa gagggggagg tggcggtgga 7921 ggctgtttgt ctcctaactg ggcttaatct gaaacacatg tattggcttg agttgatccg 7981 cctcacgtgg aggcaagatc acaaaagctt ctgtgtttct tgatgtgggc aattgtcaga 8041 aaataaggcc tgaccttggc ccagcaggga gggtatctac ctctccctga gccctccccc 8101 gcctgctagg acgagagcgg ggcttggata ctgccctttg gacaggatgg catcattgtc 8161 tgtggctgca gccagccagc ggtcgcctgc tcagcccatg agcaaccact gtggacaggg 8221 tattgcgtgt gtgctgaggg gcgtccatgc agacccccac gcttgccctc tcactgccct 8281 tgtagggttt tcaatcatct ctcctcttcc cttatccaga tggcttgaag tggaggattc 8341 agacttgccg ttaatactct gggtccctgt gtctagctcg gggccacctt tggacccatg 8401 tcccttccct gccaggctcc ctcacctcac ctcagcctac ccacattgtg acaatcatct 8461 accacctgat ctggggtttg ggcttagatt ctgtaggcac caagactaaa gtcgctcctt 8521 caagtccatt tgaattgtga ctttagtttc cttaaatact atgccaggat aatggccagg 8581 gatggtggct cacgcctgta ctcctggcac tttgggatgc tggtggatca cctgagatca 8641 ggattccagg ccagcctggc caacacggtg aaaccccatc tctactaaaa cataaaaatt 8701 aaccaggtgt ggtggcgggc acctgtaatc ccagctactc aggagactga ggcaggagaa 8761 ttgcttgaac ccgggaggtg gaagttgcac tgagctgaga tcgcgccact gcactttagc 8821 ctgggcgaca agagtgaaac tctgtctcaa aaacaaaaaa aactatgccg ggatgagcct 8881 gtctcctccc ttaatttctt acttgggcca gaggaactag aactaacaac ttctcttcta 8941 gccttgcctc ctgtgtacct cactgaattt ttggtctcta ataaaccagt ctgcagaggc 9001 tcaggggagg caggctcctg gcagctgggt ggggctggcc ccagccgggt ggagaccagc 9061 tgtaggcctg gatggtggtg aggcctctgt cttgcactgc agaaagcttt tcctgttgtc 9121 tacacgaaag ttttctccct gcatgtcagg gcagccacgt gcaagagcag ctggctggga 9181 acgcagaggt ctgcggctcg aggcggggtt tagaaagaaa accaggctgc ttcctgctgc 9241 ccgtcctgcc ttaagctgag taaactcaaa ggcaatcttc tttcatgcct cacgatattg 9301 tccagtggat tatctgattt aatttgaagg acgagagcca acaatcacac aacgtcctcc 9361 caaattttct gatccacttt gttctgggaa gtcaaaaagt gcgtgtgctg tgtgggtgga 9421 tgtttgtgta tataaatgga taatgaagga tgatgtgttg ggggccaggg caggggagac 9481 aacgctgttc agattctaca tttttttttc cttttttttt tttttttgag atggagtctt 9541 gctctgttgc ccagcctgga gtgcagtggc gcgatctcag ctcactgcaa cctccacttc 9601 ctggattcaa gtgattctcc tgccttagcc tcccaagtag ctgggattac aggcatgcgc 9661 caccacaccc ggctaatttt tgtattttta gtagagatgg ggtttctcca tgttggccag 9721 gatggtctca aactcctgac ctcaggtgat ctacccgcct cggcctctca aagtgctggg 9781 attacaggtt tgagccactg cgcctggcct tttttttttt ttttgagatg gagttttcac 9841 tcttgttgcc caggctggag tgcagtggtg cgatcttggc tcactgcaac ctccacctcc 9901 caagttcaag tgattctcca gccttagccc tccaagtagc tgggactaca ggtgtgtgcc 9961 accatgcctg gctattttat tttattttat tttatttatt tatttttgag actaagtctt 10021 gctctgttgc ccaggctgga gtgcagtggc ataatcggct cactgcaacc tctgcctccc 10081 aggttcaagt gattctcctg cctcagcctc ctgagtaact gggattacag gggcctgcca 10141 ccacgcctgg ctactttttg tatttttagt atagatgggg tttcaccatg ttggccaggc 10201 tggtctcgaa ctcctgacct caggctatcc gcctgcctca gcctcccaaa gtgctgggat 10261 tacaggcatg agccactgtg ctcggtagtt gttttatttt aatagtaggt tattttattt 10321 ccattttaca agagaaaaaa tggtgattta aagagctact aagacacagc actgagacca 10381 tgtgtgatgg catgcgcctg cagtcccagc tactcacgag gctgaggcag gaggatcaca 10441 tgaggtcagg agttccaggc tgtggagtgc tatggttgtg tagtgaatag ccactacact 10501 ccagcctggg cagcacagca agatcttgtc tcccaaaaaa aaaaaaaaaa aaaaatttca 10561 aatgtgaacc caggatctct gaccctaggc cctgcactcc taaccatggg aggaagagct 10621 cttgaaaggg aactgtggga gaagggaatg agctgccttg tgaggccaca gaagtccaaa 10681 gacagcttga gaatttggag ggacagcacg tgccggactg ggtgcctcta tgcttggtat 10741 ccggtgattc catggaggag acctgggttc tgccccattc tcctgggagg ggttgcccaa 10801 agtcttatca ccggagtggg tcagctgcct ccaggacaaa gctttagcat acacttgtgc 10861 tgggccatac tccacgtgga gaagccctgc tggggctggg gccccactgc tctggatctt 10921 taaaagctat tggttcaggg gccaggtgta atggctcaca cctataaccc tagcactttg 10981 ggaggctgaa gcaggtggat agcctgaggt caggagtttg agacaagcct gatgaacgtg 11041 gtgaaacccc atcgctatta aaatacaaaa aattagccgg gcatggtggc aggtgcctgt 11101 aattccagct acttgggagg ctgaggcggg agaatcgctt gaacccagga ggcggaggtt 11161 gcagtgagcc aagatcgctc cactgtactc cagcctgggc gacagagcca gactctgttt 11221 caaaaaataa aatataaata aataaataaa taaataaata aataaataaa agctttaggc 11281 ttaaaggagg gtcccctgac gcagacagtg gaacaaaagc acaagcttat ggtatgactg 11341 tgggccctga ggcaggggga ggggcgggag aaccttgctg ggagggatgg gccatcaagc 11401 tgagggtcca cttctggggg cctggagggg tgaggggtgg tcgctgcagg gggtggggga 11461 aagtgactag ccctgcccaa cccctgggtc ctggctgggg tggccaggaa ggggtagcgg 11521 ggcagtgcag tgtcggggga gagcggcttg ctgcctcgtt cttttcttgc aggccccagg 11581 atgcaggccc tggtgctact cctctgcatt ggagccctcc tcgggcacag cagctgccag 11641 aaccctgcca gccccccgga ggaggtcagt aggcaggcgg ggagggcgtg gtcagcattc 11701 cccgcccctc cttggcaggc agcacgggaa acaggacagg gaacccggac ccaggttcca 11761 ggccaggctt gggcctttat ttctctaggg ctggagtttc tccagcagca aaacagagag 11821 aaaatgtctt gccttgcctt tcaggggatg gagtagggac atgaataaga tcccaaaaga 11881 gtaaaaatct gaagcacttt taacaagtcc agggcaattc tcctgcctca gcttcccaag 11941 cagctgggat tacaggcatg caccaccaag cccggctcat tttgtatttt tagtagagac 12001 ggggtttctc catgttggtc aggctggtct cgaactcccg acctcaagtg attctcctgc 12061 ctcggcctcc caaagtgccg ggatgacagg tgtgagccac cgcacctggc caggatcttt 12121 tctcattacc ttgtcttcct agtgggggct ccactgagca ggtcatgttc ccggacattt 12181 gttcggatac tgaccaggct gtggcaggga gtgagggtat ggagtgacct ctctcctgcc 12241 cagaaagggc gcagctgggt tcccaaggca gatacaggca catggaggga agcctgggcc 12301 atatgagtgt tatggggtga gtgttggcgg aggcccaccc ttgagggaca agagcagctg 12361 ggcatcttgg cgagagccct ggactttcgt gaggtcagag tatgaattct gcgtctccct 12421 cttcctagct ttgtgaccct agacaaccct tacctcagtc tttgcttcct tgcctatgaa 12481 atgggataaa aacacccatt ctacagggcc atgtggccac tcatttattt ctcatctacc 12541 aaacacctac tcgacagggg ctggcaatgg gcggaaataa aaactcagtt ctgccgggtg 12601 cggtggctca cacctgtaat cccagcagtg tgggaggcgg agcaggacga tcccttgaat 12661 ccaggagttt gagaccagca taggcaacat agtgagaccc ctgtctctac acaaaagcaa 12721 aaattaccag gcgtggtggc aagtgcttgt ggtactacct acttgggaag ctgaggtggg 12781 aggatcactt gagcccagga gattaagact gcagtgaggg gccgggcgcg gtggctcacg 12841 cctgtaatcc cagcactttg ggaggtggag gtgggtggat cacgaggtca ggagatcgag 12901 accatcctgg ctaacacggt gaaaccccgt ctctactaaa aatacaaaaa attagctggg 12961 tgtggtgggg ggcgcctgta gtcccagcta ctcgggaggc tgaggcagga gaatggcgtg 13021 aacccgggag gtggaggttg cagtgagctg agctcgcacc actgcactcc agcctgggcg 13081 acagagtgag actccgtctc aaaaaaaaaa aaaaaaaaaa gaaagaaaga aaaactgagt 13141 tctttttttt aactttcttt ttttagagac agagtctcac tccatcaccc atgctggagt 13201 acagtggtgc gatcttggct cactgcaatc ttggcctcct gagttcaacc aattctcatg 13261 cctcagcctc ccaaatagct gggaccacag gcacgtgcca ccacgcccag ctaatttttt 13321 gggtattttt agtagagatg gggcctcacc atgttgctca ggttggtctg aaactcctga 13381 gctcaagtga tccatcttcc tcggcctgcc aaagtgctgg gattataggc ataagccact 13441 gcacctagct cccaattttt atatttatat ttatttttat ttacttattt attttttgag 13501 acagggtctc actctgtcac ccaggctgga gtacagtggc actatctcag ctcactgcaa 13561 cctctgcctc ctgggttcaa gcgaatctcg tgcctcagcc tcctgagtag ctgggattac 13621 aggcatgcac caccatgccc cgttaatttt tttgtatttt tagtagagac gggtttcacc 13681 gtgttgccca ggatggtctc gaactcctga cctcaagtga ttcacccacc tcagcctccc 13741 aaagtgctgg gattataggt gtgagccact cggctgatgg tttttaaaaa gtgggtcatg 13801 gggctgggcg cggtggctca tgcctgtaat cccagcactt tggtagaccg aggcgggtgg 13861 atcacaaggt caggagatcg agaccatcct gcctaacacg gtgaaacccc gtctctacta 13921 aaaatacaaa aaattaccca ggcatggtgg tgggcgcctg tagtcccagc tactcgggag 13981 gctgaggcag gagaatggcg tgaacctggg aggcggagct tgcagtgagc cgagatcacg 14041 ccaccgtact ccagcctgag cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaaag 14101 tgggtcatag gtttcggctt ataggtcaca agtgtttaaa cctggccatg aggccaggcg 14161 cagtggcgca tgcctgtaat cccagccatt tgggaggcta aggcaggaaa atcgcttgaa 14221 ccggggaggt ggaggttgca gtgagctgag atcgcgccac tgaactctag cctgggtgac 14281 acagtaagac tctgtctcaa ataaaaaaaa aaacagctga tctctcttct gcgctgtctc 14341 tccacagaga gctcatgcgt gatcagggag taaaactcat tcccgtttta ggccaaacac 14401 agaaaaatta ggaaggacag ccccaagggg ccagaaccac caccctacac aaagccgtga 14461 ggagacagtc cctgtgcatc tctgcgagtc cctgaactca aacccaagac ttcctgtctc 14521 ctgccagggc tccccagacc ccgacagcac aggggcgctg gtggaggagg aggatccttt 14581 cttcaaagtc cccgtgaaca agctggcagc ggctgtctcc aacttcggct atgacctgta 14641 ccgggtgcga tccagcatga gccccacgac caacgtgctc ctgtctcctc tcagtgtggc 14701 cacggccctc tcggccctct cgctgggtga gtgctcagat gcaggaagcc ccaggcagac 14761 ctggagaggc cccctgtggc ctctgcgtaa acgtggctga gtttattgac atttcagttc 14821 agcgaggggt gaagtagcac caggggcctg gcctgggggt cccagctgtg taagcaggag 14881 ctcaggggct gcacacacac gattccccag ctccccgaaa ggggctgggc accactgaca 14941 tggcgcttgg cctcagggtt cgcttattga cacagtgact tcaaggcaca ttcttgcatt 15001 ccttaaccaa gctggtgcta gcctaggttc ctgggatgta actgcaaaca agcaggtgtg 15061 ggcttgccct caccgaggac acagctgggt tcacagggga actaatacca gctcactaca 15121 gaatagtctt ttttttttnt ttttttnnnc tttctgagac ggagtctcgc tttgtcncca 15181 aggctggagt gcagtggtgt gatctcagct cactgcaacc tctgcctccc tggttcaagg 15241 aattctcctg cctcagcctc cagagtagct gggattacag gcacctgcca tcatgcccag 15301 ctaatttttg tatttttagt agagacgggg tttcaccatg ttgcctaggc tggtctcaaa 15361 ctcccgggct caagcgatcc acccgccttg gcctcccaaa gtgctgggat tacaggcgtg 15421 agccaccgcg cctggccaga ataatcttaa gggctatgat gggagaagta cagggactgg 15481 tacctctcac tccctcactc ccaccttcca ggcctgatgc ctttaaccta cttcaggaaa 15541 atctctaagg atgaaaattc cttggccacc tagattgtct tgaagatcag cctacttggg 15601 ctctcagcag acaaaaaaga tgagtatagt gtctgtgttc tgggaggggg cttgatttgg 15661 ggccctggtg tgcagttatc aacgtccaca tccttgtctc tggcaggagc ggagcagcga 15721 acagaatcca tcattcaccg ggctctctac tatgacttga tcagcagccc agacatccat 15781 ggtacctata aggagctcct tgacacggtc actgcccccc agaagaacct caagagtgcc 15841 tcccggatcg tctttgagaa gagtgagtcg cctttgcagc ccaagttgcc tgaggcatgt 15901 gggctccatg ctgcaggctg ggggggtctt tttttttttt ggggaaagac ggagtctcgc 15961 tctgttgccc aggttggagt gaagtggcgt gatctcggtt cactgaaacc cccacctccc 16021 gggttcacac catcctcctg cctcagcctc ccgagtagct gggactgcag gngcccagct 16081 aatctttntt gtatttttag cagagacggg gtttcaccgt gtttgccagg atagtctcga 16141 tctcctgacc tggtgttctg cccgcctcga cctcccaaag tgctgggatt acaggtgtga 16201 gccaccgcgc tcggcccgtt tctaaacaat agatcatgtg tgcccaggcc tggcctggca 16261 ctggtgtgga ggaagggccc gtgagcccaa agaggctcag aaagaggaag tgggctgcag 16321 gagacggtgg gaggggcagg gagggcagtg gcgcgatgtg gggaaatctg ctgcccccct 16381 ggccagtgcc tggggatgcc agcagaagtc ctggcaagtc acaggaagat gctggctggg 16441 aagtcagggc ctgctgagcg ctaaaccaga acccgagcct ggcaggctct caaagacggg 16501 atgcttgtcg tcgagtctca tacgctaacc tctgctccgc ctcttctcag agctgcgcat 16561 aaaatccagc tttgtggcac ctctggaaaa gtcatatggg accaggccca gagtcctgac 16621 gggcaaccct cgcttggacc tgcaagagat caacaactgg gtgcaggcgc agatgaaagg 16681 gaagctcgcc aggtccacaa aggaaattcc cgatgagatc agcattctcc ttctcggtgt 16741 ggcgcacttc aagggtgagc gcgtctccaa ttctttttca tttattttac tgtattttaa 16801 ctaattaatt aattcgatgg agtcttactc tgtagcccta actggagtgc agtggtgcga 16861 tctcagctca atgcaacctc cgcctcccag gttcaagcaa ttcttgtgcc tcagcctccc 16921 gagtagctgg gattacaggg atgtaccacc actcccggct aattttttgt atttaataga 16981 catggggttt caccatgttg gccaggctgg tctcgaactc ctgagctcag gtggtctgcc 17041 cgcctcagcc tcccaaagtg ctaggattac aagcttgagc caccacgccc agcccttttt 17101 atttttaaat taagagacaa ggtgttgcca tgatgcccag gctggtctcg aactcctggg 17161 ctcaagtaat cctcccacct tggcctccca aagtgctggg attacaggca tgagccaccg 17221 cgcccggccc ttttacattt atttatttat tttttgagac agagtcttgc tctgtcaccc 17281 aggctggagt gcagtggcgc gatctcggct cactgcaagc tctgccttcc aggttcacac 17341 cattctcctg cctcgacctc ccgagtagct gggactacag gcgcccgcca ctgcgcccta 17401 ctaatttttt gtatttttag tagagacggg gtttcaccgt ggtctcgatc tcctgacctc 17461 gtgatccacc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc cactgcgccc 17521 ggccctttta catttatttt taaattaaga gacagggtgt cactatgatg ccgaggctgg 17581 tctcgaactc ctgagctgaa gtgatcctcc cacctcggcc tcccaaaatg ctgggattac 17641 catgtccaac tttccacttc ttgtttgacc aaggatggat ggcagacatc agaaggggct 17701 tggaaaggga ggtgtcaaag accttgccca gcatggagtc tgggtcacag ctgggggagg 17761 atctgggaac tgtgcttgcc tgaagcttac ctgcttgtca tcaaatccaa ggcaaggcgt 17821 gaatgtctat agagtgagag acttgtggag acagaagagc agagagggag gaagaatgaa 17881 cactgggtct gtttggggct ttcccagctt ttgagtcaga caagatttat ttatttattt 17941 aagatggagt ctcattctgt tgcccaggct ggagtgcagt ggtgccatct tggctcacta 18001 cagcctcccc acctcccagg ttcaagtgct tctcctgcct cagcctcccg agtagttggg 18061 attacaggcg cccgccacca cacccagcta atttttgtat tttcagtaga gatggggttt 18121 cgccatgctg gccaggctgt tctcgaaaac tcctgacctc agatgatcca cccgcctcgg 18181 cctcccacag tgctgggatt acaggcgtga gccactgcgc tggccaaatc agacaaggtt 18241 taaatcccag ctctgcctgt actagctgag gaactctgca cacatttcat aacctttctg 18301 ggcctacgtt ctcaccttta acgtgaggat aatatatcta cttcatagac acctttttat 18361 gttgtctcca agttttctaa cagctctagt tctgtaccca agacatggca ggtggccaac 18421 gacatccttc taggctgtgg tgatgtgttt ggagcttgtt ccacgggtct tgtgtggggc 18481 cagccctgtt cagataaggc cttgtggggt ggcctggggt agggggaggg gttgggcaaa 18541 ctctccctta aaacgctttg taaccatctg aggcaccagc aagagcggcc cccgagcctg 18601 gacaaaatcc aaacggcttc ctacttcaag cactgatgtc tagtgagtga aggaacagct 18661 ctgggtccag gatattatag gtcacattaa actaaagggg cttggccatc agctggcttc 18721 cagagcgtca gccagttact tcacctcttt ggctttggcc tgttttcagc tacaagagga 18781 cttaatccag aggacctcag aggtccttcc cagctcagac cttctttgac tgtctcccag 18841 agacactgct gtaggagtgc acaccagttt acttttcttt cttttgtttt tgagatggag 18901 tttcgctctt tttgcctagg ctggagtgct gtggtgtgat ctcagctcac tgcaacctct 18961 ggctcccagg ttcaagtgat tctcctgtct ctgcctcccg agtagctggg attacagaca 19021 cccaccactg cacccggcta gtttttgtat tttcagtaga gatggggttt cgccatgctg 19081 gccaggctgt tctcgaaaac tcctgacctc agatgatcca tccgccttgg cctcccaaag 19141 tgctgagatt acagatgtga ggcaccacac ccggccattt ttgtattttt agtagagacg 19201 gggttttgcc atgttggcca cgctggtctc aaactcctga cctcaagtga tctgcccacc 19261 ttggcctcct gaagggctgg gactacaggc gtgagtcacc gtgcccggcc atttttgtat 19321 ttttaggaca gcgttttttc atgttggcca ggctggtctc aaactcctga cctcaagtga 19381 tccacccacc ccggcctccc aatatgctgg gattccaggt gtgagttacc atgcccggct 19441 accactttac ttttcctgca ggctatcaca gaacgtgtac aatctagact ctaatcaacc 19501 aaatcaacgt cttgccatcg gagtttgctg gtgaagggca cttggggtcc tggaaataac 19561 tgtaggctcc aagccacaca cactgagata ggcctattcc ctgaggcctc agagcccctg 19621 acagctaagc tcccttgagt cgggcaattt tcaacaacgt gctctgggga cacagcatgg 19681 cgccactgtc tttctggtct cctggggctc agactatgtc atacacttct ttccagggca 19741 gtgggtaaca aagtttgact ccagaaagac ttccctcgag gatttctact tggatgaaga 19801 gaggaccgtg agggtcccca tgatgtcgga ccctaaggct gttttacgct atggcttgga 19861 ttcagatctc agctgcaagg tctgtgggga taggggcagg gtggggggtg gatggaggga 19921 gaggatagag aagcaaaaca gggtagtggg aataaaatga cctttgagat ccgacagctg 19981 tctacatgtc gcctgctgtg tgactttgag caggttaata acatgtctga gctttcctcc 20041 tcttaagatg gggcagggga tcgttaccaa cacttaccct cccagggttt gttgtaagga 20101 cgaataaggt aataggaaat gggccctcag cactgggcac ccacatgttt gttctcttga 20161 gactcctatt tctagaattt aaagccaaac tttgaaaaat aatgacaaac tccaaatcgt 20221 tggcatcttt tttttttttt gagacagtct cgctctgtcg gccaggctgg agtccagtgg 20281 cacgatctcg gctcaccaca acctccgccc ccgctgggtt aaagcgattc tcttgcctca 20341 gcctcctgag tagctgggat tacaggcgtg tgcctccatg cctggctaat tttatacaga 20401 cggggtttct ccatgttggt caggctggtc tcaaactccc aaactcaggt gatccgcctg 20461 cctcggtctc ccaaaacaca ggggattcca ggcatgagcc accacgcttg gccaatcgtt 20521 ggcattctaa ggctttcagt gtacctgact tcttttagtt ctaagtctgt aactgttaac 20581 ctttcttggg ccacggctat cacacggatc tctctgggaa tctgacgaca gtgcctcaaa 20641 cccgagggag caccgccagg tgtgcacaca cgtttctgtc aacgatttcg gaggactctt 20701 gggatccctg aacaccatct gttccatggg accttaggtt aagagcctct gttcaaagga 20761 ggcttttgct cttggtgggt ggatggggtg aagtctccaa gccctcttrc ggscccttcg 20821 gtattcctat nccccggttc tgccctgtct tagtccagtg ctctctattt aacaaatgag 20881 cagtaaatgt acaccgatgg actttgggag acaataaaga cctgatattc aattctagct 20941 ccttaaacca caggagaaca ttctttcagc agacaacttc agttggtatt aggccaaggt 21001 aagaaaggcc aacagcatcc ttttctgaag aaacctcagg agatggctct ctgccagaaa 21061 gctataacct ggaaggggaa ttgtaaaata gatgaggggc tggatgaagg acgagaccag 21121 ggccccgtca cgggagaggg aaggcagctc ctggctgtgt ctgtcccccg gcttttgggc 21181 tctgaaggac taaccacatg ctttctcact tgtctcagat tgcccagctg cccttgaccg 21241 gaagcatgag tatcatcttc ttcctgcccc tgaaagtgac ccagaatttg accttgatag 21301 aggagagcct cacctccgag ttcattcatg acatagaccg agaactgaag accgtgcagg 21361 cggtcctcac tgtccccaag ctgaagctga gttacgaagg cgaagtcacc aagtccctgc 21421 aggagatgag tatgtctgaa gaccctttcg ctcttggtgg gtggatgggg tggggcaggg 21481 tctttgggcc ttccactgtg ctaagcagaa cgcaagggct ccacaggctt gtaggggggc 21541 cgtggatgag tccttaatcc tcatcgtgcc agaagggaag gctgaactgc cttctctcat 21601 cagactcatt cctcagcctc acgagcagac ctccctgaca ggcgctcaca acactgcctc 21661 tcaagacgag tctgtctgac ctgttttctc atcttgacct aacttgctaa atgctcctgg 21721 gcaagtcact ccaccctcgg tcagctcaga cctcttcagg cctcagagaa agtcaacagt 21781 gctgcgccat cccagcttgc ttgcaaaggg atcccttggt tggggtgttg gggaaggcag 21841 ggttttaacg gaaatctctc tccatctcta cagagctgca atccttgttt gattcaccag 21901 actttagcaa gatcacaggc aaacccatca agctgactca aggtggaaca ccgggctggc 21961 tttgagtgga acgaggatgg ggcgggaacc acccccagcc cagggctgca gcctgcccac 22021 ctcaccttcc cgctggacta tcaccttaac cagcctttca tcttcgtact gagggacaca 22081 gacacagggg cccttctctt cattggcaag attctggacc ccaggggccc ctaatatccc 22141 agtttaatat tccaataccc tagaagaaaa cccgagggac agcagattcc acaggacacg 22201 aaggctgccc ctgtaaggtt tcaatgcata caataaaaga gctttatccc taacttctgt 22261 tacttcgttc ctcctcctat tttgagctat gcgaaatatc atatgaagag aaacagctct 22321 tgaggaattt ggtggtcctc tacttctagc ctggttttat ctaaacactg caggaagtca 22381 ccgttcataa gaactcttag ttacctgtgt tggataaggc acggacagct tctctgctct 22441 gggggtattt ctgtactagg atcagtgatc ctcccgggag ggcg // LOCUS HSU30787 4514 bp DNA PRI 17-MAY-1996 DEFINITION Human uroporphyrinogen decarboxylase (URO-D) gene, complete cds. ACCESSION U30787 X06048 NID g1322018 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 785 to 1143) AUTHORS Romana,M., Dubart,A., Beaupain,D., Chabret,C., Goossens,M. and Romeo,P.H. TITLE Structure of the gene for human uroporphyrinogen decarboxylase JOURNAL Nucleic Acids Res. 15 (18), 7343-7356 (1987) MEDLINE 88015599 REFERENCE 2 (bases 1 to 4514) AUTHORS Moran-Jimenez,M.J., Ged,C., Romana,M., Enriquez De Salamanca,R., Taieb,A., Topi,G., D'Alessandro,L. and de Verneuil,H. TITLE Uroporphyrinogen decarboxylase: complete human gene sequence and molecular study of three families with hepatoerythropoietic porphyria JOURNAL Am. J. Hum. Genet. 58 (4), 712-721 (1996) MEDLINE 96220222 REFERENCE 3 (bases 1 to 4514) AUTHORS Romana,M., Ged,C. and Verneuil,H. TITLE Direct Submission JOURNAL Submitted (30-JUN-1995) Cecile Ged, Biochimie Medicale et Moleculaire, University of Bordeaux II, 146 rue Leo Saignat, Bordeaux, France FEATURES Location/Qualifiers source 1..4514 /organism="Homo sapiens" /db_xref="taxon:9606" protein_bind 1028..1037 /bound_moiety="Sp1" TATA_signal 1066..1073 mRNA join(1089..1126,1748..1860,1976..2055,2132..2194, 2434..2631,2749..2910,3279..3416,3576..3676,3780..3846, 4179..4514) /gene="URO-D" /product="uroporphyrinogen decarboxylase" gene 1089..4514 /gene="URO-D" CDS join(1107..1126,1748..1860,1976..2055,2132..2194, 2434..2631,2749..2910,3279..3416,3576..3676,3780..3846, 4179..4340) /gene="URO-D" /codon_start=1 /product="uroporphyrinogen decarboxylase" /db_xref="PID:g1322019" /translation="MEANGLGPQGFPELKNDTFLRAAWGEETDYTPVWCMRQAGRYLP EFRETRAAQDFFSTCRSPEACCELTLQPLRRFLLDAAIIFSDILVVPQALGMEVTMVP GKGPSFPEPLREEQDLERLRDPEVVASELGYVFQAITLTRQRLAGRVPLIGFAGAPWT LMTYMVEGGGSSTMAQAKRWLYQRPQASHQLLRILTDALVPYLVGQVVAGAQALQLFE SHAGHLGPQLFNKFALPYIRDVAKQVKARLREAGLAPVPMIIFAKDGHFALEELAQAG YEVVGLDWTVAPKKARECVGKTVTLQGNLDPCALYASEEEIGQLVKQMLDDFGPHRYI ANLGHGLYPDMDPEHVGAFVDAVHKHSRLLRQN" BASE COUNT 984 a 1159 c 1197 g 1174 t ORIGIN 1 aagcttcgta agcacctctc gcggcacgaa agccagcgct gcctaggcgc cgcccggcgc 61 gaggctctca cctctgccaa gaagcgcacc ggcccagcag ctgccggggg gactccagca 121 ccgcgccggg ccatggaccc gccatgagtc agctggcgcg accgcggaca gagcttccca 181 ccacgccctt ccccgccttt ggccagcctt tgccgtatgt tctggactaa gcgcacccca 241 gctctcactg tattggactg tgtactccca cactcaacca tattacttat ctctgtgcca 301 ccctaaccca gccgaccaaa cccaagattg gtgattgcta cctgatcaat ctccctctct 361 ccatttcctt gtgactacca ttttatctct actgctacta ccctcattca agtcaccatt 421 ctagctagcc tgggtcattg ccaacagtca tttttctggt tcttcggcct gctgtttttc 481 ctcccactcc cagcgaatct gctggactcc ctatcctatg ggtggtgtga ttaaagtgtt 541 tgagacaatg gccccttccc ctgccactga caggagtctt gagtcattag ggttgagttc 601 tgtttgacac tcctaatccc aaggacactg gagatcatta ttcattttaa tgtgattgct 661 gatttctgtt tccccagtct tgtagctcct taaaggctgg ggtgtcttga gcagagctaa 721 cctctgcacc tactataggt ccaggctata gtatggacct ggctggataa gactgttggt 781 atcatagttg ggacttgcgc caagctccgg atacccagac tgtcagatga gaacaaattc 841 ctcatgtcac cgtaagatac atttacagcg gagttttctt ttgggccttt gttgtttcgt 901 cgctacagca aactttacgg tgaaaaaagg taggggtcta cggcagcagc agggcagccc 961 tggagctgtc gctggagtcc gatcatgtga tcttcaacat ggcgacgctc ttggttccct 1021 acagaaaggg gcggagcctg gactgggggg caggctcaga ttcaggttaa attgtggatt 1081 gagctcgcag ttacagacag ctgaccatgg aagcgaatgg gttggggtga gttctccaga 1141 gcacgcggtg tggctagccg ggcttctaat ttgagtcttc caactcagga ctctatccct 1201 ctactcccct ttccccaccc tggagaacct cccaacctga actccgttag ctggatcctg 1261 aatcctaaaa ccatggattt ttgagatgtt catcccaggg ccttaattca agggatgcct 1321 caggatttcc aaccaggatc ttcattctgg gaccatcaac tctgatccct ctttatcccc 1381 cagcctgggt atttctcagc ccctgaacca gcccagtgac atttcccggt ttctgaggct 1441 cactagttcg aagaccccca aactatcctt agtgggcctt cattccctcc ccccagtccc 1501 tctggttgct tcgagcttgg aagagtagag actaagtgga gggaagaggc cccagggcgg 1561 gcccttctgg agtttgtgca ctgataggca gagaggaggc ggaacgggcg gaaagccagg 1621 gtttgggagc tggcctggag gaggtaggat agcggtcctg gactgaatcg gccttatgaa 1681 cccgcgcttt ccccagccgt ccaacgtagc atactgacac ctacccccac ccccacctga 1741 tcgccagacc tcagggtttt ccggagctga agaatgacac attcctgcga gcagcctggg 1801 gagaggaaac agactacact cccgtttggt gcatgcgcca ggcaggccgt tacttaccag 1861 gtaagagtca gggtctggaa atctagataa aactccggag gagaaaagtt ttcgaggggc 1921 aggggagggc tctggagggc ctcaaggctg agccctgtct tccctctgta tgcagagttt 1981 agggaaaccc gggctgccca ggactttttc agcacgtgtc gctctcctga ggcctgctgt 2041 gaactgactc tgcaggtgag gggtccacaa aagagggaaa gatttatgcc ttcagtctgc 2101 cacctagcaa cctgtctcct gtttcctaca gccactgcgt cgcttccttc tggatgctgc 2161 catcattttc tccgacatcc ttgttgtacc ccaggtaccc actcaaacct gatcctagaa 2221 tataatccaa ggacgccttg aaaatccttc tatcagtcca gtcaaggttt acaataagca 2281 cttatcctaa ctggatcgag ggaaaaacta aggttgaaag aaatggagtt tggcagagtt 2341 ttattctcct tttccttcct cctggaatga gctgaacaga acctttcctc ctggattcca 2401 ttttgggaac ccagatgttt tctccccctc caggcactgg gcatggaggt gaccatggta 2461 cctggcaaag gacccagctt cccagagcca ttaagagaag agcaggacct agaacgccta 2521 cgggatccag aagtggtagc ctctgagcta ggctatgtgt tccaagccat cacccttacc 2581 cgacaacgac tggctggacg tgtgccgctg attggctttg ctggtgcccc agtaatgtgg 2641 gacagggcag ggactcgggg cgcggggaga tcactctgga aggtctgggg tagacaaaag 2701 gaagggtcag tctggcttct gtgacaccat ctttctatcc ttctctagtg gaccctgatg 2761 acatacatgg ttgagggtgg tggctcaagc accatggctc aggccaagcg ctggctctat 2821 cagagacctc aggctagtca ccagctgctt cgcatcctca ctgatgctct ggtcccatat 2881 ctggtaggac aagtggtggc tggtgcccag gtgagtcctg agagagagag aaataggctg 2941 ggatttggtc tgtaaggccg agaagcaaga gtgtcctaaa cctgagaggg caggggtctt 3001 aatgctaggg atgaaagaac cttggcctcc agtgatctag ctgagcagcc aagcccatcc 3061 tgacactgac agtggggctt aatgctctaa gtattcagac accaaagtta gtgctgggat 3121 ctgaggaaag taaatttttt tttttttaat tactgggttt ttagggtcag gcagtatcag 3181 ggattgaagt catttgggga aaattgaggt ggattttgta tgtgggggaa acttcctctt 3241 tgtgtgttac atatttttct tcaccatacc ctaactaggc attgcagctg tttgagtccc 3301 atgcagggca tcttggccca cagctcttca acaagtttgc actgccttac atccgtgatg 3361 tggccaagca agtgaaggcc aggttgcggg aggcaggcct ggcaccagtg cccatggtga 3421 ggattgggat gggttgagtg aaggtggtcc tgtggagctt tcaggctaag tcctgcatgg 3481 actggagtga ccactggagg gcagcagaag tacagtcaag aaagattagt ggttgtagca 3541 aggccctctg tagcctgaga tctgcttttt tctagatcat ctttgctaag gatgggcatt 3601 ttgccctgga ggagctggcc caagctggct atgaggtggt tgggcttgac tggacagtgg 3661 ccccaaagaa agcccggtaa gccatggaag ggtgaggcct tgaggttgag gtgggggtgt 3721 tggctggggg agctgccatg tatgcagtta ccagaacgtg gcgctggctt tgcttccagg 3781 gagtgtgtgg ggaagacggt gacattgcag ggcaacctgg acccctgtgc cttgtatgca 3841 tctgaggtaa cagccagggc ccctctgtgt gtcctgttac tgtgcactcc tgtggcctgt 3901 ggttgtatta ttctgtgtgc acttgttttt aatgtctgtc tgtccttttc ttctcatctg 3961 tacaccataa gccctagaaa gaccggactt tttgttgctg ttgttcattt gtgtttatgc 4021 ttcatgcctg ggtccatact agggatctat aaattttatt gaatgactga ataacactga 4081 gttagaagca tgcctaccat atgcgtttct actagtatat atagggagga caaaggcttg 4141 ctggtcctcc tgtagccagt gccctgttgg tcccccagga ggagatcggg cagttggtga 4201 agcagatgct ggatgacttt ggaccacatc gctacattgc caacttgggc catgggcttt 4261 atcctgacat ggacccagaa catgtgggcg cctttgtgga tgctgtgcat aaacactcac 4321 gtctgcttcg acagaactga ccgtatacct ttaccctcaa gtaccactaa cacagatgat 4381 tgatcgtttc caggacaata aaagtttcgg agttgaacct attgtgtagt tttgtttgtg 4441 aaagattgtc ccatatcctc agttcttctt agcctctgtc tccttccctg ggaccctctc 4501 atatcctctt atag // LOCUS HSU31929 8851 bp DNA PRI 07-JAN-1997 DEFINITION Human orphan nuclear receptor (DAX1) gene, complete cds. ACCESSION U31929 NID g1163076 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8851) AUTHORS Guo,W., Burris,T.P., Zhang,Y.H., Huang,B.L., Mason,J., Copeland,K.C., Kupfer,S.R., Pagon,R.A. and McCabe,E.R. TITLE Genomic sequence of the DAX1 gene: an orphan nuclear receptor responsible for X-linked adrenal hypoplasia congenita and hypogonadotropic hypogonadism JOURNAL J. Clin. Endocrinol. Metab. 81 (7), 2481-2486 (1996) MEDLINE 96272864 REFERENCE 2 (bases 1 to 8851) AUTHORS Guo,W. TITLE Direct Submission JOURNAL Submitted (20-JUL-1995) Weiwen Guo, Pediatrics, UCLA School of Medicine, 10833 Le Conte Avenue, Los Angeles, CA 90024-1752, USA FEATURES Location/Qualifiers source 1..8851 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xp21" /chromosome="X" misc_feature 1..6 /note="EcoRI restriction site" repeat_region 63..86 /rpt_type=direct /rpt_unit=agg repeat_region 88..185 /rpt_type=direct /rpt_unit=ggaa misc_feature 1408..1416 /note="putative steroidogenic factor 1 (SF-1) response element" TATA_signal 1521..1525 gene 1580..6377 /gene="DAX1" CDS join(1580..2747,6133..6377) /gene="DAX1" /note="responsible for X-linked adrenal hypoplasia congenita and hypogonadotropic hypogonadism; allele: AHC" /codon_start=1 /product="orphan nuclear receptor" /db_xref="PID:g1163077" /translation="MAGQNHQWQGSILYNMLMSAKQTRAAPEAPETRLVDQCWGCSCG DEPGVGREGLLGGRNVALLYRCCFCGKDHPRQGSILYSMLTSAKQTYAAPKAPEATLG PCWGCSCGSDPGVGRAGLPGGRPVALLYRCCFCGEDHPRQGSILYSLLTSSKQTHVAP AAPEARPGGAWWDRSYFAQRPGGKEALPGGRATALLYRCCFCGEDHPQQGSTLYCVPT STNQAQAAPEERPRAPWWDTSSGALRPVALKSPQVVCEAASAGLLKTLRFVKYLPCFQ VLPLDQQLVLVRNCWASLLMLELAQDRLQFETVEVSEPSMLQKILTTRRRETGGNEPL PVPTLQHHLAPPAEARKVPSASQVQAIKCFLSKCWSLNISTKEYAYLKGTVLFNPDVP GLQCVKYIQGLQWGTQQILSEHTRMTHQGPHDRFIELNSTLFLLRFINANVIAELFFR PIIGTVSMDDMMLEMLCTKI" misc_feature 3423..3428 /gene="DAX1" /note="EcoRI restriction site" repeat_region 4334..4371 /rpt_type=direct /rpt_unit=ca misc_feature 4824..4829 /gene="DAX1" /note="EcoRI restriction site" polyA_signal one-of(6427..6432,6466..6471) misc_feature 8846..8851 /note="EcoRI restriction site" BASE COUNT 2452 a 1942 c 2094 g 2363 t ORIGIN 1 gaattccagg tcctggagaa gacgaaaaag agaaagaaag agagagagag aaggagtgag 61 agagggaggg agggagggag ggagggagga aggaaggaag gaaggaagga aggaaaggaa 121 ggaaggaagg aaggaaggaa ggaaaggaag gaaggaagga aggaaggaag gaaggaagga 181 aggaaaagaa acagcaaaaa aagaaagagg gaggatggga gggagggaaa aagtaaaaat 241 gattctgtat cagctggtat ataccaacac ccttccctgc cccatgtctt cacagctgtg 301 tggcaagtga agactaatgg atccaggctt cctgatgctt ctatttatca ttattcactt 361 aggaagggtg ggaaaagaaa tactaattac acacttacca atggaatact tttacaagga 421 tcaaaatttc tcactgcggc catgaaaaag aatgagagct ggcggccatc atgcttagca 481 aagtaatgca ggaacagaaa accaaatatc acatgttctc acttggaagt gggagctaaa 541 taaagagatc acctggacac taggagggga acaacagaca ctggaacctc cttgaaggtg 601 gagggtggga agagggagag aatgagaaaa aatacctatt ggatactatt acctgggtga 661 tgaaataatc tgtacaccaa acccccacga caagcaattc acttatataa caaacccgca 721 catgtactcc tgaacctaaa agttaaaaga aaaaaaaata tatactaaaa tgaaaacaat 781 tctcactgta acaatattat cccctcgtaa ttattatatt cctaagtttt aggcactttt 841 acatcctgct cgctgccccc agctctctta acacagcatc caggacatag tgggcgctta 901 taaatactga tggcattaaa ctgagcgctt atgatagcat atttagagca gtgcttttca 961 aacgtctagg tgcatgtgaa tccctgggga caccgttaaa atgcagattc agagttagaa 1021 agtctggttg gagcctgaga ttggacattt ccaccgagcc cccatatgat gcttgtctat 1081 gttctgtatt tcacaaggtc tcagaaaatg agacctccct atccatatac aaatataagt 1141 cacacaaact gtgataattt aatgaaagtt tacaaagagc ataggaagta gatgtttcct 1201 cttttcccct gccctcccaa taaagggaac aaattagatg cgagggttca atggaaagag 1261 ttgcaacagc atccaggcgc tcgctctcct ccggtcttcc tgagacaggg aaaggggtaa 1321 tgagaggaag gaggaaagtg tccaggagct cccacgctgc tgttcttcca tttccagctt 1381 ttaaagagca cccgcccctt cgaaccaccg aggtcatggg cgaacacacc ggagcgcaga 1441 ccgcgccccc ccgcacacac cgcccgcctc cgcgcccttg cccagaccga ggcggccgac 1501 gcgcctgcgt gcgcgctagg tataaatagg tcccaggagg cagccactgg gcagaactgg 1561 gctacgggcg ccgcgggcca tggcgggcca gaaccaccag tggcagggca gcatcctcta 1621 caacatgctt atgagcgcga agcaaacgcg cgcggctcct gaggctccag agacgcggct 1681 ggtggatcag tgttggggct gttcgtgcgg cgatgagccc ggggtgggca gagaggggct 1741 gctgggcggg cggaacgtgg cgctcctgta ccgctgctgc ttttgcggta aagaccaccc 1801 acggcagggc agcatcctct acagcatgct gacgagcgca aagcaaacgt acgcggcacc 1861 gaaggcgccc gaggcgacgc tgggtccgtg ctggggctgt tcgtgcggct ctgatcccgg 1921 ggtgggcaga gcggggcttc cgggtgggcg gcccgtggca ctcctgtacc gctgctgctt 1981 ttgtggtgaa gaccacccgc ggcagggcag catcctctac agcttgctca ctagctcaaa 2041 gcaaacgcac gtggctccgg cagcgcccga ggcacggcca gggggcgcgt ggtgggaccg 2101 ctcctacttc gcgcagaggc cagggggtaa agaggcgcta ccaggcgggc gggccacggc 2161 gcttctgtac cgctgctgct tttgcggtga agaccacccg cagcagggca gcaccctcta 2221 ctgcgtgccc acgagcacaa atcaagcgca ggcggctccg gaggagcggc cgagggcccc 2281 ctggtgggac acctcctctg gtgcgctgcg gccggtggcg ctcaagagtc cacaggtggt 2341 ctgcgaggca gcctcagcgg gcctgttgaa gacgctgcgc ttcgtcaagt acttgccctg 2401 cttccaggtg ctgcccctgg accagcagct ggtgctggtg cgcaactgct gggcgtccct 2461 gctcatgctt gagctggccc aggaccgctt gcagttcgag actgtggaag tctcggagcc 2521 cagcatgctg cagaagatcc tcaccaccag gcggcgggag accgggggca acgagccact 2581 gcccgtgccc acgctgcagc accatttggc accgccggcg gaggccagga aggtgccctc 2641 cgcctcccag gtccaagcca tcaagtgctt tctttccaaa tgctggagtc tgaacatcag 2701 taccaaggag tacgcctacc tcaaggggac cgtgctcttt aacccgggta agggtactgg 2761 ccttaggcgc cggcttttcc cagctcacaa aagcatcggg cagtgcctat ctaggggcgc 2821 gggcagtaac gagttttcag tgatcaggag agtgtcgggg caaaggtgaa gaaatcgtga 2881 ctaactcagc agagttgggg tggggagctc caggaaccac ttctgctggg tgggctgcta 2941 tcagaaactc agccaagagg aggggagttg tttgtttagg tttgtacttg gctctctaca 3001 catttcttac catagaaaag tttgtgcctt catgggaaat ggttattctt tcttctttag 3061 ctttctttta agtccagagc atatcttttt ctaaaaaaag tttttgctca ctgaagattc 3121 tgcaacatcg atttctgaaa ccccattctg aaatttacta gaaccttttt tggggggagg 3181 aaatctatta aggctagcaa aaaaggaaac atttgttaaa gggtcccatt gagagactat 3241 tccctaggga cacacaagac actagatgta gtgtttcatg ggcctgcctc tgaatggtta 3301 tttgtatggt cctaagcatc tggactatcc cttcctccaa atctctgtga ataatatcag 3361 gttggcaaag ggatagggaa aaaagacaca ttagggactg agagcaggag cactgtattt 3421 tagaattcaa agagacttta gagacctgct aatcaacccc ctttgctctg tagcctcccc 3481 ccaccccatt ttactgagta ggtagatgtg tcccaaggtc ccaagtaaat ggttggtgtg 3541 ttgtggcacc tgagggtcct gtcttctggg cccagtggga tattggctgc actaaacttt 3601 caggttgtct gttgggtgct ggacacactg tcccacagag tgcaaccaaa tcactggagg 3661 aacgtctccc tggacaggca ggcaccttgt cttgggttgt ttctcactat tagaatcatc 3721 ttcaaccagc gtcctttcaa aatgattgaa gtgggaacaa aattttccag gcaacttgag 3781 ttgggatagt ttttataatg ctccttttgt atcttttttc cctggtgctg gaatctctga 3841 ttaagtccct gccccctgcc tagcattttt ggaggggtgc atagatcagt atttgggtgg 3901 gataataatt tgttcaaacc tcacgtttca tgacgtagag acctcagctc accccagtgt 3961 ctattatatg ttaccttttt gcttatacca ttgttttcca aacttgcctg tgcataagaa 4021 gtacttgtgt gtgtgtggtt gggggtggga gggagtagat aaaagtcttt ctaggctcct 4081 gccttatata ttctgattca gtagctctgg cccagggtct gcaaatctgc attcataata 4141 agtgtcgcag atgattctta agattatttg ggaaaacagt cttcttattc cttccagcca 4201 gtgttactaa tcatccacta tgttcctcgt cctgttggga tataaaggtg agaggggctc 4261 agcttccaga aatttacatc ccattagttt cagtatcaac agtttccctt gactctgttt 4321 tatatacttt taacacacac acacacacac acacacacac acacacacac attctccctt 4381 attcttgatt gacttcccct aatcctctct caatatgagt ttacccagct aagctgagtg 4441 tgtgtgaaac acaaatgtcc ctatatttgg gagaaaggta atgcctatta agagtgtggc 4501 ctctctttgc tgtttatcct ttgtatgtcc catatataaa tttaaataca ttatatatat 4561 aggttcaaac ttaaggagat ataaaactgt ggagtccatt tgcattttct cttccctgac 4621 cttattttga aggccagttt gtgtgtctct gtgtgggggg gaaccacttg ggtggggaga 4681 aggtggaggg gtatattgtg aacacagaga aaagtaatgt ggcatacaca agatgaagtt 4741 ttcacaagcc atgaatctag acctagagaa gcttcagatt tctggtggag tttcatgcaa 4801 gcctcttcct actttcatcg atggaattca aacacaacaa aattgggtct taagcctgga 4861 tctcaagctg tagatagaag gggatgataa tgtgtggact agagtaatca gaactggaat 4921 atggattaaa atggttgctt tatttttgtt cttatttatg agatttgata ttcgcccata 4981 ataggagtgc ctctcgctta ttaccattat tattttttta aagaacactg gttctgctgc 5041 tctgtgatga ttggcatggt gcttttaaaa tacttttctg ttgaattttt taggaaaagg 5101 gaaggaaaat gattgcaatc atcattaaca tcatttaaac caagtgggtc aactgctaat 5161 tacttgaata caaagttaaa gaactttgga acgatattat catcatggaa ggatcaaaag 5221 ggaccttcaa aaagcaggcc tcctcttatt actttcctta taaaaactta catccactta 5281 aactttcctg catttatttc tttcactttt acttcggtat cacactatac tgcaaaaggt 5341 catgtgtttt ctattacaaa agttattctt gaatcatggt acaattcgtg atgttttcat 5401 gtttactttt taaactttta atgactataa gaattttcaa aaacttctta ctttaatatt 5461 ttagaaaaga gcctttttga acactcatag aatcattctt ttattttttt caaaatcttc 5521 cccaatacag aaatcatggg tgcaatgagc taagtggcca gtgttagctt tcaaattatt 5581 cgggcctcta gactaatatt ggcctgcagg caaataggta aaatgaaact gtgcgtcaaa 5641 gtttgtggct gtgaagttta ccaccattgg ctgaacgatc agtgtaatga tggaaagttc 5701 aaagaagaat ggtaaaaaca tttgtctcat ggcaacactt cttgtttatc atggcttgtt 5761 tctcataggt aaagtgccca cctcttgaaa tatcatgaac ttcttcatat ttgcatagaa 5821 agtgaactgg agtgatgatt cattaaaccg tattctactg tcatattaaa ttgtgtgcgt 5881 ttgttggctc atggccagtg tatttccaca cgtgtgcata gaaacatagt aaatattaag 5941 taagatcaca tgatgtttca ttcttggaca cgttgcttct gaaaacgtag actggtttgg 6001 ccttttaccc ttttaaccgg gaagctttgg gtcttgttta attgggatga aacagaaatg 6061 gctattttta aaaagctagc aaaggactct gtggtgagct gttttataac aaagagattt 6121 ttctccctcc agacgtgccg ggcctgcagt gcgtgaagta cattcaggga ctccagtggg 6181 gaactcagca aatactcagt gaacacacca ggatgacgca ccaagggccc catgacagat 6241 tcatcgaact taatagtacc cttttcctgc tgagattcat caatgccaat gtcattgctg 6301 aactgttctt caggcccatc atcggcacag tcagcatgga tgatatgatg ctggaaatgc 6361 tctgtacaaa gatataaagt catgtgggcc acacaagtgc agtagtgcag ttcaccatga 6421 gggaagaata aagagctgtg ggccaaagag tgtaaaatat tttaaaataa actttcttaa 6481 tatttttaca tgcagagtat ttgggtattc aattaaagaa ataattttat tccagacagc 6541 cacaaatttc tctgttccat agttaaagaa gacatttgcc aacaggtagc atagctctgt 6601 acatctttta aaaaaaaaat agcagggtac tagtataata agctattttc acaagtgtag 6661 caatttcatg gaacctgctc aaatcaaatt ggtccatttt gttataataa attttaaggc 6721 cttaactatt aacttgattg aaaaaagctt cagcagggtt taagtaaggc ctcccaaaac 6781 tattaaaacc tagccaacat aagcaaaaag tatctttgga agtaacaaca tgacattgaa 6841 tgtaattatg tgccagtatc agtgggaggt cactttgctc tatcttcatt tccacttccc 6901 tcagctgtat ggaagactgc gtttgcatcc aactggagtc aagaagcaat ctgtaattat 6961 aaagacatgc agaagtgaaa gaggagattt tctgttttat tgggggatct tttgggtaca 7021 cagttatgga tttaaagatc cacaaggaac atacattcta taatgagggg ttatagttcc 7081 agaaagctca acatcattat aaaaccacta attacacttc atcttttaga tacattacgt 7141 ttgtaactgg tatattgaac aacatggcat gaaatcataa agcaacttta ttgttatctc 7201 ctgctggccc ctttaattat aatccataga actaattttt ctttgtgtcc agataaaatc 7261 aaggctacta tatgtctatt tgtttatggg caataaattg cctgttatta atgagtgtat 7321 ctagtctagg gtaataaccc attgcaggaa gtcttaaact ttctttcgtg cagatccctt 7381 tggcagtcta ttgaatccta tggacatttt ctcaaaatca ggctctcaga ggaatgcagt 7441 agggcgggaa gcaagcggcc atgattggtc acagcacaca cacacacaca cacacacaat 7501 ccccgagggc tcatgtggtc ccctagaaac aaagagttaa aaatgttagg gcaccccagc 7561 aggatatgca ttggttaggc agcccccagt gcggcctccc accgaggtgc ctccttcccc 7621 aaggtattca gtccagtgaa agtaattcag ttcagcccct ggttttagag gggctgcctg 7681 gggctgtcag tctcaaatag tctgtcagtg acactacttc atctgcctgg tgagtgtaga 7741 tggcatctgg acatgttctg cggggacatg cagtagaagc tggccctctg ctgccagctc 7801 atagtcacgc tgtcacgtgc catctaggct cagcccttgg tctgagtcca aagcgcctgc 7861 agcacttgca aaagcagcac aagagttttc ttcgaaaaga ggggaaagct tccagaagca 7921 gttccaaaaa gcaaaaatca ttaatgccct acttcctccc cgagccttcc caggtgtcag 7981 ggttgttttt cttgactgtt acaaaatcaa tgtgccccat tcaataatcc taacttcatg 8041 taatttttcc taatcacttc tcttttcacc tagaccttct ggaaggattg tgtgtaagag 8101 ggacagaggc attttcacag agtaacgttt taatacaact taaccagaaa gacacaggaa 8161 ctggtcaggg caaaatcctg attttcaata ttttttcaaa atgggtttac gtaccccccg 8221 ccccaagcct ttttccatga tcatttattt agtgaacacc tagcaaaaat cagtcctttt 8281 ctgaggacag ctgcattgga tgatccaaac tgccttaccc aagttgtgta caacaggaga 8341 aaggcaaggg aagtaaagtc aggctatcgg tccactgttg caaactgcaa ctaacctgga 8401 aagtggaaca caagtcaaca gccttgagac atccctgaga gtctgatcta tcaggggtgg 8461 gcagaggggt cacataccct gtggtgtgcc cttccctaaa ttgcagcctt tggctagaat 8521 caaaagttag gaatgcaatc aggctcttgt atcccgtatc aagaaatatc gagtccatca 8581 gcagctccat ttctaatggt cccatcagag aacaagcctt tttctctagg gcatctgcag 8641 gtgacccaga tggtcagcta gcatccatga tgttttatag ctgtgacccc acagggatgt 8701 cttaaatttg tggcagttcc agagactgag ttctgttcgt tgagcctgtg ccgacttgca 8761 gttcagcaca gctcagtgct gaaattgaaa tagcctgtag ccggcagtat caggtttcat 8821 tcaggtccag ggaattagga tctgtgaatt c // LOCUS HSU32323 9286 bp DNA PRI 22-NOV-1996 DEFINITION Human interleukin-11 receptor alpha chain gene, complete cds. ACCESSION U32323 NID g975334 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9286) AUTHORS Van Leuven,F., Stas,L., Hilliker,C., Miyake,Y., Bilinski,P. and Gossler,A. TITLE Molecular cloning and characterization of the human interleukin-11 receptor alpha-chain gene, IL11RA, located on chromosome 9p13 JOURNAL Genomics 31 (1), 65-70 (1996) MEDLINE 96404003 REFERENCE 2 (bases 1 to 9286) AUTHORS Van Leuven,F. TITLE Direct Submission JOURNAL Submitted (26-JUL-1995) Human Genetics, K.U.Leuven, Campus Gasthuisberg ON06, Leuven, Belgium, B-3000 FEATURES Location/Qualifiers source 1..9286 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9p13" /chromosome="9" GC_signal 303..315 exon 802..862 /number=1 exon 2073..2172 /number=2 CDS join(2073..2172,2463..2523,3597..3766,3893..4007, 4161..4193,4279..4445,5376..5539,6595..6736,7110..7229, 7340..7436,7690..7772,8319..8335) /codon_start=1 /product="interleukin-11 receptor alpha chain" /db_xref="PID:g975335" /translation="MSSSCSGLSRVLVAVATALVSASSPCPQAWGPPGVQYGQPGRSV KLCCPGVTAGDPVSWFRDGEPKLLQGPDSGLGHELVLAQADSTDEGTYICQTLDGALG GTVTLQLGYPPARPVVSCQAADYENFSCTWSPSQISGLPTRYLTSYRKKTVLGADSQR RSPSTGPWPCPQDPLGAARCVVHGAEFWSQYRINVTEVNPLGASTRLLDVSLQSILRP DPPQGLRVESVPGYPRRLRASWTYPASWPCQPHFLLKFRLQYRPAQHPAWSTVEPAGL EEVITDAVAGLPHAVRVSARDFLDAGTWSTWSPEAWGTPSTGTIPKEIPAWGQLHTQP EVEPQVDSPAPPRPSLQPHPRLLDHRDSVEQVAVLASLGILSFLGLVAGALALGLWLR LRRGGKDGSPKPGFLASVIPVDRRPGAPNL" exon 2463..2523 /number=3 exon 3597..3766 /number=4 exon 3893..4007 /number=5 exon 4161..4193 /number=6 exon 4279..4445 /number=7 exon 5376..5539 /number=8 exon 6595..6736 /number=9 exon 7110..7229 /number=10 exon 7340..7436 /number=11 exon 7690..7772 /number=12 exon 8319..8701 /number=13 polyA_signal 8696..8701 BASE COUNT 1948 a 2470 c 2536 g 2299 t 33 others ORIGIN 1 gagctccctg agagaggggg cttagaccat gatgaagagt tgcttttagg cggtgctgtg 61 agccctggta ggggctgtgt gcgtgtgtga ctgcgtgagt gtatgaatat gtgtaagtgt 121 gtctgagtct gaggcggtgt ggctgtgccg tgtgactgtg agtgaccgca tgtgagagtg 181 tatgagtctg agggtaactg tgggtcttgt gtatctggca ccaagggacc aaaggcattt 241 cctggtgctg gggcccaaag aatggagggg gaggtgagaa atggctggaa cttccagggg 301 agggggaact ggactctttg gcctctcctt atatccccaa cctgggcctc ctccctggaa 361 gcacagccca gcctgtaccc taagaacatt ggtgaggagt ggaaagaggg gagtttgtat 421 ctgattttca tttgattgtg tccacgtgtg tttcaaaggc atgccctttt gttaactgtg 481 cctcttgagt ctgtgtgtga atgtgactgt gtctggctgt gtgaaaacac agggcatctc 541 tcactgacag ggacataagt attagccacc ctcctttccc tcttgctggc cttggctgtg 601 tctgtggctt cctttccaga cttgggggag gcctcttccc ttcccagggc cttgggtggg 661 agggaaggag ggaggaggca ggagcaggga gggggcctgg ctaggggcgg cagagctcca 721 gggcttagag ggggcggggc agggaagggg aaggagggga tggctctggg gtcccagaac 781 tgagtggagt aggagacggg ggctgtagct ggtgagagga agtcctagag gctatggaca 841 ctctgctgct gggatcaccg aggtagggtg gggctaagca gggtaagtgt ggctgggtgg 901 ccaggggtga gccaactgga cactcagaca gacacacaga caggtggagg ccctgctctg 961 ctctactcct actcctgaag ggtccgtgct ctcctcttct ggtcttgttc ccagagccct 1021 ttccccgcct gccccttcgt tctccatcca cttcaccact gccttcctcc ttctgctccc 1081 ccttgtccag aatctctctc tcctagcatc ccccacctgc catatagaca cactgttagc 1141 tttttgagtt tccgctgacc ccaatccagt ctttccaaca cttcccacac tgccccatca 1201 cctcctagtc ctgatctctg agtatctctt ggtgctgtgg cccccatgcc tgtccctgca 1261 ggctcccagt tcctctcagc ctctcatccc cttccctccc ctcccctccc tgccaccctg 1321 tctccatttc catctgctgg ctcctggccc acccccagcc tctgcagcag cagggcatct 1381 ggatctgctt aactacacag ccccagcctg caccctagcc ccatccagct tcacaaactg 1441 gagaccaacc aagtgtcaag agccaggccc agctgagtgg cccaagtagc cagaccaagg 1501 agccaggttc aggcnagaag cctggcannn ngggcagggg tgggcctcag ggtgggagtg 1561 caggatgggc tcagatccat gatgacaccc ttcccccagg gtgataaggt ctgcctaggt 1621 taatcagagg cagtgataag ccctggacca ggtgggggta aataccagaa ttcccaacag 1681 ctggactgga ggggttaatg ggagtggctg agctggtgcc agtgcttggt cccaggggtg 1741 ggccccaagg gcagtggagg gggagttgct ggcacagtct gttgcctccg cttttgttct 1801 gggccctaag cccaggactg agatggaggg tntgaggggg tgtgtgtgtc cgtgtgtgtg 1861 tgtgtgtgtg tgtncncncg cacgcacatg caaagcactg ggtatacagt gggaaagggg 1921 acctcaggtc agttcccgca gtgatttcta acagccttac cccacttggt gcatcaattt 1981 tntcctagga agcctcagtt ttggagagga agagccaggc tttagctccc atctcagggg 2041 tgggggattt ttgactctac ctctccccac agatgagcag cagctgctca gggctgagca 2101 gggtcctggt ggccgtggct acagccctgg tgtctgcctc ctccccctgc ccccaggcct 2161 ggggcccccc aggtgagaag aaccctgctc tacaacctcc caactctgac ttgtgacttg 2221 ccaccctcac tgtggccccc ttcccatgcc taacttgttg acaccacccc tgctcctgtc 2281 atccaacctg ggtcctctct ctaccctgtt ctctgacctc tgtctccaga ttataacatg 2341 atgctttgat ccagtgaggc ccccaccacc caggttttaa cagtctggct tcttatctgt 2401 cattctcaca aagtggggag gggggtaaag gaagagcctt acctcagaag tgccctccac 2461 aggggtccag tatgggcagc cagggaggtc cgtgaagctg tgttgtcctg gagtgactgc 2521 cgggtaagtg ccccacctgc ctgttggtct gacactcatg accctttctt gactctgcct 2581 agtcctcatg tcccgcccct cccatccttg cccgctctgt ccgtaatcct cacctctcta 2641 tctgtcctaa ccgtctaact atccttattc aacagaaaat tcaactggta tgttttgagc 2701 atctgccaag tgccaggtct attctaggct ctgggataca gcagtgctaa tgccaggcaa 2761 ggtgcctgcc ttcaaagagc ttacatctta gtggggaaga caataagcaa aggagaatct 2821 tgccagatag taatacctgc agggcagaaa aaccaaagtg agggatgtcg aagaggacaa 2881 ctgagtggtt actttttttt tttttttttc cgagagtctc actctgttgc ccaggctgga 2941 gtgcagtggt gcaatctggg ctcactgcaa cctctgcctc ctgggttcaa gcaattctcc 3001 tgcctcagcc tcccaagtag ctgggattac aggtatcccc caccaggccc ggctaatttt 3061 tttttgtatt tttagtagag acggggtttt accatgttgg tcaggctttg aactcctgac 3121 ttcaagtaat ccacccacct cagcctccca aagtgctaga ttacaggcgt gagccaccgt 3181 gcctggcccg tggttacctt ttaattggta gttaggaaag gtctctctga ggagatggtg 3241 ttgaagctga tttctgaagg tcaggaagga ggcagccatg cacacattaa ggtataagcg 3301 ctcgtgacag agggagcagc tagtgcaaaa tgtcctgagg tgggaatgag ctcaataccc 3361 ccgaggaata gaaagaagac caatgtagct ggagttttca gtgggtgagg gggagagaga 3421 taggagctga agaaggaaag gaaggcaagg cctggctcac aaggggcttt gaaagccagt 3481 aaaaggaatt tgggttctat tgcaaatgca cttagaagcc aatggagagc tgtggcatgg 3541 tctgacttat gcttgacatc ttgtctttgt ccattgtcac ctcatctcac ttccagggac 3601 ccagtgtcct ggtttcggga tggggagcca aagctgctcc agggacctga ctctgggcta 3661 gggcatgaac tggtcctggc ccaggcagac agcactgatg agggcaccta catctgccag 3721 accctggatg gtgcacttgg gggcacagtg accctgcagc tgggctgtga gttggggagg 3781 gtggcactga tgacacatag ggatcctgag ggagtatggg acctaagtaa gccctctggg 3841 accaggagga cctgaggact gctcagtctc caggtgtacc ctctgcctct agaccctcca 3901 gcccgccctg ttgtctcctg ccaagcagcc gactatgaga acttctcttg cacttggagt 3961 cccagccaga tcagcggttt acccacccgc tacctcacct cctacaggtg tgtgtgtgat 4021 tgggtgtgga tgcctacaca tttgggtatg ctggtgcaat gtgggtgtgg gaggcaggga 4081 ggtaggttct gaggaaagcc tcagagaggg cagaaggccc tcttttcctt cctgacttca 4141 gatggccccc tccccaccag gaagaagaca gtcctaggag ctgatagcca gaggtaggac 4201 gtgagggagc atgtgggggc tccagctggg tcccacagta ggcattttca aagccaaaga 4261 ccacatcctc ccttctagga ggagtccatc cacagggccc tggccatgcc cacaggatcc 4321 cctaggggct gcccgctgtg ttgtccacgg ggctgagttc tggagccagt accggattaa 4381 tgtgactgag gtgaacccac tgggtgccag cacacgcctg ctggatgtga gcttgcagag 4441 catctgtgag tacccacccc agacctcccc cagaccccca tcacagagac cccaaatgga 4501 ctcagatctc tccactccca gagttcccag tttcctccca acctagactc catttcacac 4561 agagaaggcc ctgtggagcc tgagaaggca aagggatcag agcccaggag gcatggttat 4621 ttcagcacac actctagagg ctgctgctgc tacccacact gcctgattgg ctgcctagcc 4681 tcagagcagg ccctgtaccg aatgcagcca tatgtcatgt cctcccttgc tggcccctcc 4741 ttccttgacc tgtgcatggg agtgggacac taaaactcta gctggaattt gggccaattc 4801 atcttccttg gatgaaaata ggaattgctc ttttattcac tcaagaagca cttcaaaagt 4861 ccacccgcag atctcaggtg gtttcttttt ttctttctct tccttttttt gttgagatag 4921 ggtctcactt tgtgcccagg ctggatgaag ttcagtggtg tgatcttggc tcactgaagc 4981 ctgcaactcc tgggctcaag ctatcccctc acctcagcct cccaagtagc tgggattaca 5041 ggcatgtgcc accatgtctg gttaattttt aaaatttttt gtagagacag ggttttgcta 5101 cgttgcccac gctggtctca aactcgtggc atcaagtgat cctcccgcct cagtcttcca 5161 gagtgctgag attacaggca tngccaccac acnggcctca gcccatttca taatacagat 5221 gaccggtctg agtctaatgg atgatcaagt ttaagatttc ccctcccctc tcaggagtgt 5281 ctggctaagg ctcctttaaa cacacacttt gggaagtggt ggggaagtga tggagaccca 5341 tagcctaccc tgactttgtg tcttgatgcc cttagtgcgc cctgacccac cccagggcct 5401 gcgggtagag tcagtaccag gttacccccg acgcctgcga gccagctgga cataccctgc 5461 ctcctggccg tgccagcccc acttcctgct caagttccgt ttgcagtacc gtccggcgca 5521 gcatccagcc tggtccacgg tgaggcctgg agtgcgtccc aacccacggc tgtggtcctg 5581 tctctgattt cacgatcctg ggtgttctgt atagctttcc agtgctggca ggtactgaag 5641 accaacactt cctgtggcaa gggctttgta ctgggnntgg gtgagtggtg actgagagac 5701 tgaatgtctg tcctgtgggc ctatactatg aggtaaagca catctgtggg aagatagcaa 5761 tggctttaaa atactttttt ttgagaggag tctcgctctg tcaccttagg cctggaancc 5821 aatnncaatc tcgccttaac naacctcact cctggttcaa ggcaattctt ctggtctcag 5881 cctcccgagg taggctggga ctacaggtgn ngccgccaat gccttggcct taatttttgt 5941 atttttagta gagatggggt tcaccatatt gcccaggctg gtctcgaact catgacctca 6001 cgatctgccc acctcggcct cccaaagtgn tnnnnnnngt gtgagcnncg ggccggccta 6061 aaaataactc ttaggaaaag gagtgccttt cacctagagg cacattctcc tacaaacact 6121 catataccca gctgaggata gcagtgaatt tgtgtctcag gagatgttaa agattggctg 6181 gtatcaggac tagggagcag cagcatcaga taataactaa tatttactta gtgctgcctg 6241 cctgtggggt acctggctac actttttgca tacattgttt tgtttaggta gattcacctc 6301 tattttacaa atgaagaaac tgagtcacag agaggcccaa agggtcattc aatatgtggt 6361 tcaccaaaga tttgaaccca ggctgcctga tgccacagcc tctctaagta atcactactg 6421 ccgtcttaca aacactaggt gtagactcag ctccaccaga cacccaacat gagacctaga 6481 cagctccact aggctttagg atgagactgg gagcaagggg tgctctttgt agatggtggc 6541 tgggaaggcc ctgcacttac aagctgggta acagtgagtc atgtttaccc ccaggtggag 6601 ccagctggac tggaggaggt gatcacagat gctgtggctg ggctgcccca tgctgtacga 6661 gtcagtgccc gggactttct agatgctggc acctggagca cctggagccc ggaggcctgg 6721 ggaactccga gcactggtga gagacaaagc caaagaaaag ggcagaggcc ccatcccaaa 6781 gcatctatca gctgaaccag ttccaaaaga caggcaggtg gganatgccc tgcccacaca 6841 ggcacggcaa ttgctgtccc agacttcaga cttgtttgcc tacagccctg tcttagctta 6901 agtcctgccc cttggcctcc acccctgcag ttacaggctc cacttctaca agcttggagc 6961 ttgccatgct ccctaggagc tggccatgcc ctatcaggct cctcctccta ggtaccttga 7021 cttccaacct aggccttggc cttgagcaca ggacgtgacc cccgtcccca ccagcgtatg 7081 gacacttatt gggtcttgcc ttcctttagg gaccatacca aaggagatac cagcatgggg 7141 ccagctacac acgcagccag aggtggagcc tcaggtggac agccctgctc ctccaaggcc 7201 ctccctccaa ccacaccctc ggctacttgg tgagcttggg cagatgggca agacttgtaa 7261 aggactgaaa accccagaga aaaggggaca ccgagcttgg tcaggcttgc tctcagtgac 7321 atgtggccct ccccctcaga tcacagggac tctgtggagc aggtagctgt gctggcgtct 7381 ttgggaatcc tttctttcct gggactggtg gctggggccc tggcactggg gctctggtaa 7441 gtgactgcca ttggtccctc agcctctgat cctcacacat gctctgatgn ccatagacca 7501 cattcatctc cacccttcat gactgcccgn tgaacctgtc tgattctgga actacctccc 7561 catacctcca tcccccatgc cccacttgat tttaactgat tcctctcctg accctttact 7621 aataaaccct ttggcggaga ctgagataac ccacattgtt ggagagacag ctgcctttct 7681 atgccccagg ctgaggctga gacggggtgg gaaggatgga tccccaaagc ctgggttctt 7741 ggcctcagtg attccagtgg acaggcgtcc aggtgagtag gacatccaga agatttggac 7801 ttggagatgt ttgcccctat tttgagtgtc cagattaaga gctggctgcc ctagtcattt 7861 taaaacatgc tgggaatcca agttgggtct cctcatttta atgatgtcta ggctgagggc 7921 tgggcctttc attcttgagt ccctgggctc agagttgggt ctctttccct cctctcaagg 7981 gtactgagga aggaccccag gtggacttcc cttcagggta ttagtattag tactaatact 8041 cttccagggt attagactca gagctggata ttcatccttt tttttcaagg tgtcttggat 8101 cagtgctagg tcccctcctt cattcttaat gtgcctgggc ccagagctag gtccactccc 8161 gtatcttcag tgtgttgaag aggctccctc agagctgggc tcctctactc atcttcagag 8221 tgttaagctt agaactgagt cacctccctc attctcaggg tgtctgggcc aagatccggt 8281 cttactgtct ctcctgattt gcctcctgct tcttctagga gctccaaacc tgtagaggac 8341 ccaggagggc ttcggcagat tccacctata attctgtctt gctggtgtgg atagaaacca 8401 ggcaggacag tagatcccta tggttggatc tcagctggaa gttctgtttg gagcccattt 8461 ctgtgagacc ctgtatttca aatttgcagc tgaaaggtgc ttgtacctct gatttcaccc 8521 cagagttgga gttctgctca aggaacgtgt gtaatgtgta catctgtgtc catgtgtgac 8581 catgtgtctg tgaaggccag ggaacatgta ttcctctgca tgcatgtatg taggtgcctg 8641 ggagtgtgtg tggtccttgc tctggccctt tcccttgcag ggttgtgcag gtgtgaataa 8701 agagaataag gaagttcttg aagttatact cagaaattat tatccaatgc cttgctttat 8761 tatttggcta ttggggcttc agcccatttt ccttagcatc ccaaaattca gcttgggcag 8821 agtcccatgg agctttctct cttggtgctc aaaccactgt gacaggctgg ggttctgggg 8881 gtggatgcag atgctgcgtt gagccaggtg aagcctgggt agagaaaggt ccaaagctat 8941 gaccatagct tggggatgag attagggtca tccccaatga ccctaattag gggttaggag 9001 caaggaccag atgaaaacct gagttggcaa cagaattagc agttagggct ggggttggga 9061 tcagcattag gggttgggac tgaggtcaga gtcaggggta tcaggggtgg gagctcacac 9121 gaaagcctgg aggtgacagt ccccgtcagc ctcctgcagt tccacctgga tgaccttcct 9181 cagtagcttg tctgagagtg gctttcggta gagctgagta cagcaggcag tgctgggtgg 9241 cagtaggaat gctaggggca acagggccat gtgttcagag ggatcc // LOCUS HSU32576 5950 bp DNA PRI 08-MAR-1996 DEFINITION Human apolipoprotein apoC-IV (APOC4) gene, complete cds. ACCESSION U32576 NID g975892 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5950) AUTHORS Allan,C.M., Walker,D., Segrest,J.P. and Taylor,J.M. TITLE Identification and characterization of a new human gene (APOC4) in the apolipoprotein E, C-I, and C-II gene locus JOURNAL Genomics 28 (2), 291-300 (1995) MEDLINE 96015060 REFERENCE 2 (bases 1 to 5950) AUTHORS Allan,C.M., Walker,D., Segrest,J.P. and Taylor,J.M. TITLE Direct Submission JOURNAL Submitted (27-JUL-1995) John M. Taylor, Cardiovascular Disease, Gladstone Institute of Cardiovascular Disease, P.O. Box 419100, San Francisco, CA 94141-9100, USA FEATURES Location/Qualifiers source 1..5950 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q" /clone_lib="P1 plasmid human genomic library from Genome Systems, Inc., St. Louis, MO" mRNA join(2340..2455,4851..4992,5242..5596) /gene="APOC4" exon 2340..2455 /gene="APOC4" gene 2340..5596 /gene="APOC4" CDS join(2380..2455,4851..4992,5242..5407) /gene="APOC4" /note="apolipoprotein" /codon_start=1 /product="apoC-IV" /db_xref="PID:g975893" /translation="MSLLRNRLQALPALCLCVLVLACIGACQPEAQEGTLSPPPKLKM SRWSLVRGRMKELLETVVNRTRDGWQWFWSPSTFRGFMQTYYDDHLRDLGPLTKAWFL ESKDSLLKKTHSLCPRLVCGDKDQG" intron 2456..4850 /gene="APOC4" exon 4851..4992 /gene="APOC4" intron 4993..5241 /gene="APOC4" exon 5242..5596 /gene="APOC4" BASE COUNT 1394 a 1451 c 1749 g 1356 t ORIGIN 1 gaattccagg ggacatctgt tgcgcaccta ctgtgctcac gctggggcct ctgggatgga 61 caggattctg ccaaggcaga catctgggtc aagacagtcc tgcacagttg ttcaggttgt 121 ggccaaggtt gcgtttgcag atttgccatg taaaaataca ggatgctcag ttacatttga 181 atttcagatt aatagcaaaa aaaacttttt ggtataattc tgaaatattt catgggacat 241 atttatacta aaacgtcatg cactgttgat ttgaaattca aatgtaattg ggcctcctct 301 atttcgtctg gcaagcgtag acaaaagaat ccagtccagg ccaggcgcag tggctcaagc 361 ctgtaatccc agcactttgg gaggccgagg cgggcggatc acgaggtcag gagatcgaga 421 ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaaaa ttagctgggc 481 gtggtggcgg gtgcctgtag tcccagctac tcgggaggct gaggcaggag aatggcgtga 541 acctgggagg cggagcttgc agtgagccga gatcgcgcca cagcactcca gcctgggcga 601 cagagccaga ctctgtctca aaaaaaaaaa aaagaatcca gtccatagtc ccctgagcca 661 tgtgccctgg ggtgcagctg ggtccttcag gagaaaaatg ctctatttct ggcactggga 721 ccgagcctga tgtgggtttt ttgttggttt ttgttgttgt tattgttttt gagacaaggt 781 ctcgctccac cacacccggc taatttttgt atttttagta gagacggggt ttcactacgt 841 tggccaggct ggtcttgaac tcctgacctc aagtgagccg cctgcctcgg cctcccaaag 901 tgctgggatt acaggtggga gccaccgccc tggccctggg cctgatgttg atgacctcct 961 actatgtgca cctgcagctc tcctgcatag gcctcagccg tcctgcatga ggacactggg 1021 aggcaggtgc tccctatcaa ccccgtgtta cagttgaaca aactgagccc cagaaaagaa 1081 aacgtatttg cccaggtcac acggtgaaga agtgagggat tcgaagccca ggtccatctg 1141 aagccagagt cacccagagg agaaagagtt ggaattgaga actcaaggaa tgcttggaag 1201 tgatcgggct cgagcccacc taggaagaaa cagaggctgg agacatgaga ctgtgttgct 1261 atttcctctc atcaaccctt gggccctatt gaggccctac cacaagcctg gccctgcagc 1321 ccagtgacta ggagaaatta gacacaagat aataataaca gcaatgatct tttttttttt 1381 tctgagacgg agtcttgctc tttcgcccag gctggactgc agtggcgcga tctcggctca 1441 atgcaagctc cacctcccag gttcacgcca ttctcctgcc tcagcctccc gagtagctag 1501 gactacaggc gcctgccacc acgcctggct aatttttcat atttttagta gagatggggt 1561 ttcaccgtgt tagccaagat ggtctcaatc tcctgacctc gtgatccgcc tgcctcggcc 1621 tcccaaagtg ttggggttac aggcatgagc caccgcgcct ggccaacagc aatgatcttt 1681 gagcacctat attgccagtc tccacggtaa gagctttctt cattttttgt tttgttttgt 1741 ttcaagacag agtcttgctc tgtcacccag gctggagtgc agtggtgtga tcgcggctca 1801 ctgcagcctt cacttcccgg gttcaagcca ttctcctgcc tcagcctccc aagtagctgg 1861 gattacaggc acgcatcact acttctggct aatttttgta tttttagtag ggacagggtt 1921 tttcaccatg ttggccaggt tggtctcaaa ctcctggcct catatgatct gcccacctcg 1981 gcctcccaaa gtgctgggat tacaggcgtg agccactgcg cctttctttg tatttgttca 2041 agtaatatac tgaaatatgt actgtgcctc ccactttatg gaggaggaaa ctgaggccag 2101 caaatgaggc tgtcatggga ggtggagaca ggatttgaac ctgcctcagt gcaggaggct 2161 caagagcctc tgtcttctct cagggcactg tgtgggaggg tgagaaggag ggaggcccac 2221 agaggcatga cctctgattg ccactgtcac ctgggccctg ctctctgaag tctctgccaa 2281 gcggggaggt ggccggggga gggccctgct ctgtgcagcc tcccctcccc cggcccgcag 2341 agttgagcac agagggacag aggcacggaa cccccagaaa tgtccctcct cagaaacagg 2401 ctccaggccc tgcctgccct gtgcctctgc gtgctggtcc tggcctgcat tgggggtgag 2461 aagaagtggg tggagggatg tggggcccac acctggtggg tgtgagtgtg gctgtgtgtc 2521 ctgtggctct gtagccacgt gagacatgag tacggagtgt gtgcgtttca tggcgtgcgt 2581 atgcatgtgc gtgtcgggga gtgtgtgtgt cggtggctga gagtgaagtg tgaatgtcac 2641 attggtacaa actgggatca tctgtgtgtg tgcacgtgcg tgcgtggaag tgggagtatg 2701 cagtcgtggt aaaaaagtgc atgtctgtgt gcatatgtgt atttgtgtgc acctgtctct 2761 ctgtggggta tgtgtgtgca aaatatttga gtgtgtggac atgtgtgagg gggtgagtgt 2821 gtgctggtgt gtacgtctgt gttttgcata tgcatttttt tttttttttt ttagacggag 2881 tctcactctg tcacccaggc tggagtgcag tggtagcagt ggtgcgatct tggctcactg 2941 catcatccgc ctacccgttt caagggattc tcctgcctca gtcttcagag tatttgggac 3001 tacagacaca cgccaccatg cctggctaat tttttttttt tgagacggag tctcgctctg 3061 ttacccaggc tggagtgcag tggcgtgatc ttggctcact gcaagctccg cctcccgggt 3121 tcacgccatt ctcctgcctc agcctcccga gtagctggga ctacaggagc ccaccaccac 3181 gcctggctaa ttttttgtat ttttactaga gacggggttt cgccgtgtta gccaggatgg 3241 tctccatatc ctgacctcgt gatccgcctg cctcggcctt ccaaagtgct aggattatag 3301 gcgtgagcca ctgcgcctgg ccaatgcctg gctaattttt ttatattttt ggtagagaca 3361 gggttttgcc atgttgccca ggctggtctt gaaatcctga cctcaggtga tccgcccgcc 3421 ttggcctccc aaagtgctgg gattacaggc atgagccacc acgcccggcc atgtacttta 3481 tgttaaaatg ggatcatatt ctagatcagc attatccagt agaaatttaa atttttaata 3541 cagggccagg cacggtggct catgcctgta atcccagcac tttcggaggc cgaggcgggt 3601 ggatcgcaag gtcaggagat ttgagatcat cctggctaac agatgggtaa aaacccatct 3661 ctactaaaaa tacaaaaaat tagccatgca tggtggcatg cgcctgtagt cccagctact 3721 cgggaggctg aggccggaga atcacttgaa cccgggaggc agaggttgca gtgagccgag 3781 atcgcgccac tgcattccaa cctgggtgac agagcgagac tccgtctgaa aaaaaaaaaa 3841 aatttaacac gtatgtagac aatgtgcaag gcaccattcc atgtgcatcg tatgtagtaa 3901 ctcttaattc tcacgataac cctgaggtag atattattac cccgttctac aaaaggagaa 3961 acagtcctgg ggagacagga taagtcaccg gccaaggcac acagctagct acatgtggcc 4021 cccgcgtgac ggctggtctc tgtaggcgag gctttgtcca gatgcgtggg tagaaggtct 4081 ggcccggaaa gaggaactga cagcaaggct aagccaatgt ctgcccctgg gggcagaaag 4141 tcacctcctg ctctccctcc actgtccaca gaggtagctc agacagggtg ggggtcacag 4201 gagaacgaag ggagaagggg gtagttcctg ggcagcaaaa tcaggtggtg aagggaggca 4261 tcagaggatg gcaattagag aggccattag aggggaacca caggcagaca gggtgacagg 4321 agggactact gacacaaggt gaagagatgg cccagccgga cggggtggct cacatctgta 4381 atcccagcat tttgggagcc cgaggtgggt ggatcacttg aggtcaggag ttcgaggccc 4441 caacatggca aaaccccatc tcttctaaaa atacaaaaat tagccgggca tgatggcaga 4501 tgcctgtaat ccctgctact cgggaggctg aggcaggaaa attgcctgaa tccaggaggt 4561 ggaggttgca atgagacgag atcatgacac tgcactccac cctgggcaac agagcaagag 4621 actgactctg tctcataaaa aaaaagaaaa aagaaaaaaa aaaagagatg gctgatggtt 4681 aaagaggggt tagcggtcag gggacacata agggtaaagg caggaggcaa gaggactggc 4741 agggggctgc ccctgggcca ccgggagcga cacaggatga gcatggaggg aaagggagaa 4801 ggggattcta gggtcccagc ctacccaagt tgccctctgg ttccacctag catgccagcc 4861 agaggcccag gaaggaaccc tgagcccccc accaaagcta aagatgagtc gctggagcct 4921 ggtgaggggc aggatgaagg agctgctgga gacagtggtg aacaggacca gagacgggtg 4981 gcaatggttc tggtgagggt gtgctggcct gggtggtggg aggggactcc tgggtctgag 5041 ggaggagggg ctggggcctg gacccctgag tctcagggag gaggaaaggg tgggagtggg 5101 gctgtgaccc ctaggtctgg gaggagtgga gggttagagc tgagagcagg aactcctagg 5161 tcacagagag gagcggataa atggggcaga gaacacctgg ggagagctgg ggcctccact 5221 gtgatgtcct ctctcctgta ggagcccgag caccttccgg ggcttcatgc agacctacta 5281 tgacgaccac ctgagggacc tgggtccgct caccaaggcc tggttcctcg aatccaaaga 5341 cagcctcttg aagaagaccc acagcctgtg ccccaggctt gtctgtgggg acaaggacca 5401 gggttaaaat gttcataaaa gccaggtgtg gttgtggcgg gtgcctgtag tcccagctac 5461 tcaggaggct gaggtaggat gatggcttga gcccaggagt tcgagaccag cctgggcaac 5521 acagcgagat ctcttggggg taaaacaaaa agaaaaaaaa aagttcatac ttctccaata 5581 aataaagtct cacctgtgtc cctgtctgga tccttcccca gtgtggccag aaaaaaaccc 5641 accccactgc ctcccaggaa tcaatgagta gaagaggtga cacctgatgg ggaaggaaga 5701 gtagggaggt cgggaagggt atcaaggaat aacaccctat tgtgggcttg cggagaatgg 5761 gggacttcaa ggcgtgtcag tttcaggagg gtgagggcag gagcgtgggt ggagtcagca 5821 ggtccccatg atggccctca ctgagagctt cgcccttgtc tcctacaagc tctgactcca 5881 ttcccagtgg gcacccagca cctccaaccc ctccacagcc cccaacccag cctctgtcgg 5941 aggcgaattc // LOCUS HSU37022 4233 bp DNA PRI 05-JUN-1996 DEFINITION Human cyclin-dependent kinase 4 (CDK4) gene, complete cds. ACCESSION U37022 NID g1353415 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4233) AUTHORS Zuo,L., Weger,J., Yang,Q., Goldstein,A.M., Tucker,M.A., Walker,G.J., Hayward,N. and Dracopoli,N.C. TITLE Germline mutations in the p16INK4a binding domain of CDK4 in familial melanoma JOURNAL Nature Genet. 12 (1), 97-99 (1996) MEDLINE 96122049 REFERENCE 2 (bases 1 to 4233) AUTHORS Dracopoli,N.C. TITLE Direct Submission JOURNAL Submitted (25-SEP-1995) Nicholas C. Dracopoli, Sequana Therapeutics Inc., 11099 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..4233 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="24C7" /map="12q13" /chromosome="12" mRNA join(85..291,729..965,1123..1258,1376..1543,1701..1810, 2949..2999,3135..3270,3836..4233) /gene="CDK4" /product="cyclin-dependent kinase 4" exon 85..291 /gene="CDK4" gene 85..4233 /gene="CDK4" exon 729..965 /gene="CDK4" CDS join(748..965,1123..1258,1376..1543,1701..1810,2949..2999, 3135..3270,3836..3928) /gene="CDK4" /codon_start=1 /product="cyclin-dependent kinase 4" /db_xref="PID:g1353416" /translation="MATSRYEPVAEIGVGAYGTVYKARDPHSGHFVALKSVRVPNGGG GGGGLPISTVREVALLRRLEAFEHPNVVRLMDVCATSRTDREIKVTLVFEHVDQDLRT YLDKAPPPGLPAETIKDLMRQFLRGLDFLHANCIVHRDLKPENILVTSGGTVKLADFG LARIYSYQMALTPVVVTLWYRAPEVLLQSTYATPVDMWSVGCIFAEMFRRKPLFCGNS EADQLGKIFDLIGLPPEDDWPRDVSLPRGAFPPRGPRPVQSVVPEMEESGAQLLLEML TFNPHKRISAFRALQHSYLHKDEGNPE" exon 1123..1258 /gene="CDK4" exon 1376..1543 /gene="CDK4" exon 1701..1810 /gene="CDK4" exon 2949..2999 /gene="CDK4" exon 3135..3270 /gene="CDK4" exon 3836..4233 /gene="CDK4" BASE COUNT 945 a 1050 c 1101 g 1124 t 13 others ORIGIN 1 ccctcctccc agtcgaagca cctcctgtcc gcccctcagc gcatgggtgg cggtcacgtg 61 cccagaacgt ccggcgttcg ccccgccctc ccagtttccg cgcgcctctt tggcagctgg 121 tcacatggtg agggtggggg tgagggggcc tctctagctt gcggcctgtg tctatggtcg 181 ggccctctgc gtccagctgc tccggaccga gctcgggtgt atggggccgt aggaaccggc 241 tccggggccc cgataacggg ccgcccccac agcaccccgg gctggcgtga ggtaagtgca 301 gtcccttccc aggaatgaga accagtgccc gcccccctca cagctttcca cgcgttcgtt 361 tcgcgagctg gttatggaag ggtcgctcaa gggcgggaag tggggccttt gtggtcatgg 421 gaaagtataa ttttagggac tgaggtgtag gatcttcgat gcaaggcatg tgtcatgtgt 481 gatctttgtg cggggcgcga ttgtcccaaa ggaaaaagcg ttttctattg cagggcctca 541 cgtggctgga ggggttggta ttgagtcatt gtgttatctc tggggccggc cccaaggaag 601 actgggagcg ggggatggga tgctggtggt gttctttgcg cttttttttt gggagtccct 661 ttgttgctgc aggtcatacc atcctaactc tgtaagcgac ttttggtgat aggagtctgt 721 gattgtaggg tctcccttga tctgagaatg gctacctctc gatatgagcc agtggctgaa 781 attggtgtcg gtgcctatgg gacagtgtac aaggcccgtg atccccacag tggccacttt 841 gtggccctca agagtgtgag agtccccaat ggaggaggag gtggaggagg ccttcccatc 901 agcacagttc gtgaggtggc tttactgagg cgactggagg cttttgagca tcccaatgtt 961 gtccggtgag aaggtggtgg agggttgggc gtggggagta aagggaaaag acagcctata 1021 ggtggggtgt gatgatctgt agagaagtgg ggaccctgag gaaataatga gaggccatgt 1081 tgggttaaag gggattgaaa agtgagcatt tactctggtc aggctgatgg acgtctgtgc 1141 cacatcccga actgaccggg agatcaaggt aaccctggtg tttgagcatg tagaccagga 1201 cctaaggaca tatctggaca aggcaccccc accaggcttg ccagccgaaa cgatcaaggt 1261 gagtggggtt ggtaggcatt gagaggtgga ttgggacctt tgtagtagaa ccttctggga 1321 tttcaggtat ggtgcctagt ttccagtgca tctgtacctc cccctttgaa actaggatct 1381 gatgcgccag tttctaagag gcctagattt ccttcatgcc aattgcatcg ttcaccgaga 1441 tctgaagcca gagaacattc tggtgacaag tggtggaaca gtcaagctgg ctgactttgg 1501 cctggccaga atctacagct accagatggc acttacaccc gtggtcagta gaaagatggt 1561 accaaaatgg gttctggttg ggaataggag agtgattgcc cgtagcaatt gagaagtcat 1621 gtgcttcatg tgttcagtca agcaagttgt gtttcatggt aacccatggg gtccccatcc 1681 attcttccta ttccctttag gttgttacac tctggtaccg agctcccgaa gttcttctgc 1741 agtccacata tgcaacacct gtggacatgt ggagtgttgg ctgtatcttt gcagagatgt 1801 ttcgtcgaaa gtatgggacc cacataccct ggactacctt gaattcccca aatcgcttgt 1861 tcataaacca catccatacc ttgcccattc tttttttttg agaccagggc ttgctgtgtt 1921 gcccaggctg gattgcaatg gcatgatcac agctcactgc agcttcaacc tcctgggctc 1981 aagtgatcct cccatctcag cttcccaact agctgacact acaggcacgc acctccatgc 2041 ttggctagtt tgttaatatt tttatagaga tggggtctca gtatattgcc caggctggtc 2101 ttgaactctt gcactcaagc aatcctccca cccctacctc ccaaagtagc ataagctact 2161 gcatctggcc ccattctttt acttgcgtac tactaacttg cccatagcag aaagctctga 2221 aatgttctgg aattaggaac ttcatatccc tttattctct ttatttttta tttatttatt 2281 tatttattta tttatttatt gagataaggt ttcactctgn nacccaggct ggagtncagt 2341 ggcccaatta nagctcactg tancctctac ctcctgggct aaagmaatcc tcccatctca 2401 gccccttgag tanctgagac taaaggtgca cgccaccatg actggctttt ttttttttta 2461 gatggagtct tgctctgtcg ccaggctgga gtgcagtagt gcgatctctg ctcactgcaa 2521 cctccacctc ccagattcaa gcaattctct tgactcagcc tcccaagtag ctgggaccac 2581 aggtgcacgc caccatgctc agctaatttt tgtactttta gtaatgacag gtttcaccat 2641 gttggccagg atggtctcga tctcttgacc tcatgatcca cccacatcag actcccaaag 2701 tgctaggatt acaggcgtga gcnnnngcac ctggcatttc ttttttttta aaaaaagaga 2761 caaggtcttg cttgcccagg ctgatctaga actcctgggc tcaagcagtc ctctcacctc 2821 agcatcccaa agtgctggaa ttgttggcct ttattcccta tacttcctat tttgagccac 2881 taagcagtaa ccattcaact aagatatctt tgaaaatgac tgctacctta tatcccttct 2941 caccttaggc ctctcttctg tggaaactct gaagccgacc agttgggcaa aatctttgag 3001 taagtgacca acatgggaga aaaagatttt ctattctgag tcctctttct gctgaaccca 3061 ggatggcaac tggctctgcc atggggatgg gaactggagg accctcctga ccagagttct 3121 cctgtccccc acagcctgat tgggctgcct ccagaggatg actggcctcg agatgtatcc 3181 ctgccccgtg gagcctttcc ccccagaggg ccccgcccag tgcagtcggt ggtacctgag 3241 atggaggagt cgggagcaca gctgctgctg gtaactggag atggctgtgg gcacagggaa 3301 agaaatagag actggggaaa gaaatagagc agtatgcagg gccctggcca ctgtggttaa 3361 tgaaacttgg ttggtagatg gtctgtagtt tttattacag ctgcaaatag ccacccacag 3421 agaaggatat agaagagaac ccatcctggc tgggcacggt ggctcacgcc tgtaatccca 3481 gcactttggg aggccaaggt gggcgtatca cctgaggtca ggagttcgag accagcctgg 3541 ccaacatggt gaaacctcgt ctctactaaa agtacaaaaa taagccgggg gtggtggcac 3601 acgcctgtaa tctcagctac ttgggaggct gagataggag aatcacttca actcaggagg 3661 cggaggttgc agtgagctga gatcatacca ttggcactcc agcctgggtg atagagcgag 3721 actccgtctn caaaaaaaaa aaaaaagaaa aaagaagaaa gctcatccca ggtattgttg 3781 tgggtggcag aagctgtttt cttcatggtt ttctgacctt tgcctctccc ctcaggaaat 3841 gctgactttt aacccacaca agcgaatctc tgcctttcga gctctgcagc actcttatct 3901 acataaggat gaaggtaatc cggagtgagc aatggagtgg ctgccatgga aggaagaaaa 3961 gctgccattt cccttctgga cactgagagg gcaatctttg cctttatctc tgaggctatg 4021 gagggtcctc ctccatcttt ctacagagat tactttgctg ccttaatgac attcccctcc 4081 cacctctcct tttgaggctt ctccttctcc ttcccatttc tctacactaa ggggtatgtt 4141 ccctcttgtc cctttcccta cctttatatt tggggtcctt ttttatacag gaaaaacaaa 4201 accaaaagaa awaatggccc tttttttttt ttt // LOCUS HSU37106 3186 bp DNA PRI 06-JUL-1996 DEFINITION Human erythroid Kruppel-like factor EKLF gene, complete cds. ACCESSION U37106 NID g1389691 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3186) AUTHORS Bieker,J.J. TITLE Isolation, genomic structure, and expression of human erythroid Kruppel-like factor (EKLF) JOURNAL DNA Cell Biol. 15 (5), 347-352 (1996) MEDLINE 96211158 REFERENCE 2 (bases 1 to 3186) AUTHORS Bieker,J. TITLE Direct Submission JOURNAL Submitted (26-SEP-1995) James Bieker, Brookdale Center for Molecular Biology, Mount Sinai School of Medicine, 1 Gustave L. Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..3186 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(757..906,1819..2644,2900..3075) /note="erythroid cell-specific transcription factor with three Cys-His zinc fingers of the Kruppel family; zinc finger transcription factor" /product="erythroid Kruppel-like factor EKLF" CDS join(820..906,1819..2644,2900..3075) /note="erythroid cell-specific transcription factor with three Cys-His zinc fingers of the Kruppel family; zinc finger transcription factor" /codon_start=1 /product="erythroid Kruppel-like factor EKLF" /db_xref="PID:g1389692" /translation="MATAETALPSISTLTALGPFPDTQDDFLKWWRSEEAQDMGPGPP DPTEPPLHVKSEDQPGEEEDDERGADATWDLDLLLTNFSGPEPGGAPQTCALAPSEAS GAQYPPPPETLGAYAGGPGLVAGLLGSEDHSGWVRPALRARAPDAFVGPALAPAPAPE PKALALQPVYPGPGAGSSGGYFPRTGLSVPAASGAPYGLLSGYPAMYPAPQYQGHFQL FRGLQGPAPGPATSPSFLSCLGPGTVGTGLGGTAEDPGVIAETAPSKRGRRSWARKRQ AAHTCAHPGCGKSYTKSSHLKAHLRTHTGEKPYACTWEGCGWRFARSDELTRHYRKHT GQRPFRCQLCPRAFSRSDHLALHMKRHL" BASE COUNT 634 a 977 c 986 g 586 t 3 others ORIGIN 1 ggatccaacg gtcctatccc acccaggagg agagacggtc acttttccct tggctgcgcc 61 catcgcacta aagcagctgg cactgaacca gcctccatgc agtcccatgc cgtgccaccc 121 aagggtcccc agtagacaat ggtgggccag ttgtcagggg nttcctcctc ctgcagggct 181 gagacnntgg gaggtcccaa cccaggaaat tgaacgccag gctaatttga agacccaact 241 cccagccctc cccttcaccg gaggacagag ctctagctgg cctgggcccc cacctgatag 301 cagcctccaa cgtctggggt gtctgataat gcttggcggg gagctcgtgc caagtcccgc 361 catcagcacg gttgttgctg tttactgggg agggggaggg ctgtggagcc tcaatcaggg 421 ggacaggggg tcccacagct tcttcccaga ataccctttc tgccttttcc aggaaagtta 481 actgagggaa gacccccaag tctctccttc tttggagacc caatgtctgt ttttacccag 541 cacctggacc ctcaaaccct gaaccccccc aacccttgat atttgacttg gctttggaca 601 cagggttagt ctttaacccc agccccagac aggccaacgt gaagtttgtg ccccagaaac 661 agtgcccccc cgccgccttg ccttgctttg ccttatcaga ggctgcagcc aatcagctaa 721 ggacagagag gagccctcga aggggctatc acagcctcag agttcacgag gcagccgagg 781 aagaggaggc ttgaggccca gggtgggcac cagccagcca tggccacagc cgagaccgcc 841 ttgccctcca tcagcacact gaccgccctg ggccccttcc cggacacaca ggatgacttc 901 ctcaaggtgg ggcctagaag gtggggtcta ggtgggctgg ctggaatcca gggccacagt 961 cacagatctt ggggtccaga cctgcatctt gacctgaaat caagagactt aaccaggact 1021 gaggtacgct cagtccagga gaggagatct cagcttagtc tggcaggggg tgaggagggt 1081 ggtctagggg tttgaggttc taagtgtgat ctatttcggt aatagaaaac gaaggtagcc 1141 tgggcaacat ggtgaaaccc tatctctaca aaaaatacca aaaacattag gccaggcatg 1201 ggggcgtgtg cctgtagtcc caggtactcc gtaggctgat gcaggaggat cattagagcc 1261 caggagatta aggatacagt gagctgcacc actgcactca gcctgggcaa aagagtaaga 1321 ccctatctca agaaaaaaaa aaaaaaaaag gaacgagatc taggcctcac agacaatctt 1381 ccagatatca gcgggaaatg atgagtgtgt ctgggggcac atccaaaatt tcggaattaa 1441 tcttgttttg ggagacaggg aaggagaggg atgttctggg ggaaaactaa gtcaaggctg 1501 gcatcctctc ccccgtcccc tgccagtttt ccatctccag cagctctgct accccttccc 1561 ccatccccga gtgtggttcc agatagtgga agtcttatct cctgtctcca gccagacctg 1621 atcggtttct gtccctggag ctgggggggg agcggggaga ggggcggtta gaggggcagt 1681 gttggggaag tgggacagac agacaggcaa acaagacccc tttccaaagc ctctgcgtca 1741 gtgtgtccag cccgcgatgt ccctggggag ggcaccccag tgtccaccga acctcgagct 1801 gcctgctccc tcccgcagtg gtggcgctcc gaagaggcgc aggacatggg cccgggtcct 1861 cctgacccca cggagccgcc cctccacgtg aagtctgagg accagcccgg ggaggaagag 1921 gacgatgaga ggggcgcgga cgccacctgg gacctggatc tcctcctcac caacttctcg 1981 ggcccggagc ccggtggcgc gccccagacc tgcgctctgg cgcccagcga ggcctccggg 2041 gcgcaatatc cgccgccgcc cgagactctg ggcgcatatg ctggcggccc ggggctggtg 2101 gctgggcttt tgggttcgga ggatcactcg ggttgggtgc gccctgccct gcgagcccgg 2161 gctcccgacg ccttcgtggg cccagccctg gctccagccc cggcccccga gcccaaggcg 2221 ctggcgctgc aaccggtgta cccggggccc ggcgccggct cctcgggtgg ctacttcccg 2281 cggaccgggc tttcagtgcc tgcggcgtcg ggcgccccct acgggctact gtccgggtac 2341 cccgcgatgt acccggcgcc tcagtaccaa gggcacttcc agctcttccg cgggctccag 2401 ggacccgcgc ccggtcccgc cacgtccccc tccttcctga gttgtttggg acccgggacg 2461 gtgggcactg gactcggggg gactgcagag gatccaggtg tgatagccga gaccgcgcca 2521 tccaagcgag gccgacgttc gtgggcgcgc aagaggcagg cagcgcacac gtgcgcgcac 2581 ccgggttgcg gcaagagcta caccaagagc tcccacctga aggcgcatct gcgcacgcac 2641 acaggtgagg gggcggggcc ccggacatga gaaagggcgc ggcgcccgct gtagttacag 2701 gggaagaagg gttgcagagg gcgggacttg gacttggctg gcctctgaga gtgagtgcct 2761 ccttaaattt tgtgccctag ggcctcactt tgttcatcct agtcccagcc caggctgagt 2821 aaaggggtgt gaccagatgc aggggacccg gggacatgac tggacagaca gtggcgctta 2881 tggcttcctt gtcccctagg ggagaagcca tacgcctgca cgtgggaagg ctgcggctgg 2941 agattcgcgc gctcggacga gctgacccgc cactaccgga aacacacggg gcagcgcccc 3001 ttccgctgcc agctctgccc acgtgctttt tcgcgctctg accacctggc cttgcacatg 3061 aagcgccacc tttgagccct gccctggcac ttggactctc ctagtgactg gggatgggac 3121 aagaagcctg tttggtggtc tcttcacacg gacgcgcgtg acacaatgct gggtgggttt 3181 tcccac // LOCUS HSU40391 3453 bp DNA PRI 22-JUN-1996 DEFINITION Human serotonin N-acetyltransferase gene, complete cds. ACCESSION U40391 NID g1389593 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3453) AUTHORS Coon,S.L., Mazuruk,K., Bernard,M., Roseboom,P.H., Klein,D.C. and Rodriguez,I.R. TITLE The human serotonin N-acetyltransferase (EC 2.3.1.87) gene (AANAT): structure, chromosomal localization, and tissue expression JOURNAL Genomics 34 (1), 76-84 (1996) MEDLINE 96299662 REFERENCE 2 (bases 1 to 3453) AUTHORS Rodriguez,I.R. TITLE Direct Submission JOURNAL Submitted (08-NOV-1995) Ignacio Rodriquez, LRCMB, NEI-NIH, 6 Center Drive MSC2740, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3453 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q25" promoter <1..696 exon 697..855 /number=1 mRNA join(697..855,1801..2038,2302..2456,2794..3246) /product="serotonin N-acetyltransferase" intron 856..1800 /number=1 exon 1801..2038 /number=2 CDS join(1876..2038,2302..2456,2794..3099) /codon_start=1 /product="serotonin N-acetyltransferase" /db_xref="PID:g1389594" /translation="MSTQSTHPLKPEAPRLPPGIPESPSCQRRHTLPASEFRCLTPED AVSAFEIEREAFISVLGVCPLYLDEIRHFLTLCPELSLGWFEEGCLVAFIIGSLWDKE RLMQESLTLHRSGGHIAHLHVLAVHRAFRQQGRGPILLWRYLHHLGSQPAVRRAALMC EDALVPFYERFSFHAVGPCAITVGSLTFMELHCSLRGHPFLRRNSGC" intron 2039..2301 /number=2 exon 2302..2456 /number=3 intron 2457..2793 /number=3 exon 2794..3246 /number=4 misc_feature 3247..3453 /note="3'flanking region" BASE COUNT 686 a 1012 c 1082 g 673 t ORIGIN 1 aagcttcctc tgcagggggt caaaagagtg cacctgtgaa atccaacctc caggaccccc 61 aggagcccga gatggacaca ggatctgcac tgctgggtgg ggcggagagg caggtgctgg 121 cgagagaaga gctaaggggt gcaggatggg gtgttagctg gagggcaggg ggtagtcaga 181 gccaacaggg agaggggcct gcatcccgag ggaatgccag caccaagtgc caaggcctgg 241 gcagccctgg gaatgggtgg agccaaggtc aatagcagag agctttgtag ttgagatgtt 301 tggtagggca acctcctgaa tatttaaata tttaattaaa ttaaatattt aaataaactt 361 gaagtagaac gtacttaaag tgcacaaatt ggtagcttga attcccaccc cctcccccca 421 cataagaggt ggggttgtcc aagactccga gggacagagg gagaatcgct ggtgcccagc 481 cagtcggtga cagagcaggg attagagatg gacccctgag gcttagcccc acgctctctt 541 ccccaaaccc tggggcctca gtcagcctct ggtaatccct tctgtgccca gggcaccaga 601 gctgacgttt cccttcagca aggcaggggc cgaggctgcc agagccctgc tggcgtcatc 661 ctgcctggga tagctcgaga ggtggggtgg ggggattaca aacctctaac agcaccacag 721 gctgcccggg gaacatctgc tactacagcc ttgcagcccg gagtcccgga ttttactggt 781 tcccgtgcct gcggacaggc ccccagggct agcggctttg tggagggaac actgggtatc 841 ctctccacag tccaggtagg tgggaccccc actcctggct ctcagccttt aggaacagga 901 gcttctctac ttggagcttc tagacatgga gggaaacatg gaagccatgg agtttattaa 961 gtgtcttgga aggtggaggg agggatggcc tcctctaaga actgcaaatc tgcagggtgt 1021 cttgggagac aagagtgccc agaggggaat gtctgatgag ctggatgtgt ggggcagggg 1081 gtgtcctatg cactctcctt gcctgctcct ccccattccc caggccctct gcagcccccg 1141 gctccctcta acctccccac cccccaccac aaatcctgca tgtgtgtggc ccagtctgag 1201 cccatctgga tagaagtaga ggctccctaa gggcccactt tgattaaggg gctctctagg 1261 accaagggct gaggctacct aaaaagctct ctgccacagg atcttgcacc caaaggacca 1321 gaagcctccc tctggggagg taagataaac cacatgaatg gtggtgtggg ggaaccgggc 1381 gcccacccac tgcacagttg cattagacgt ggtcagtgga gggtcacgag tcctgcctcc 1441 cctcccaccg tgggcagcag ggttggggtg ccccaggctc cactgagccc tgctcatccc 1501 tgccaccatc tctggggctc agaagcaccc aagacggcac cctgagacca atgtctgcag 1561 gtctgcaagg tggcaaacct ccagggatcc ttctggctga gagggaaagg tggggagaca 1621 gtgccaatgt ttgcaatgcc tggtaacagc ctcggggacc aggtacctgg ggaatgtgcc 1681 cattgattta ggggctgtct ccaccccagt tcctgctaca aaagaggcca gatacatact 1741 ctggggcctg cctccctgga gatgaggatg agacccctgt cccttgctgt tctccaacag 1801 gtgctgggag gccctccttg gcttaggagg acacttccaa agctggggcg ccccaaggag 1861 gcaccagtgg ccagaatgtc cacgcagagc acccaccccc tgaaacctga ggccccacgt 1921 ctgccacctg ggatccccga gtccccgagc tgtcagcggc gccacacact ccctgccagt 1981 gagtttcgct gcctcacccc ggaggacgct gtcagcgcct ttgagatcga gcgtgaaggt 2041 gagtggcccc gcacagggtc agagggatgc tccactctgg tccagttatc ctgtggggag 2101 gagaccctta gtctcctgtc cttggaggct gggtcccaga gtatcagacc atgtgtgcgc 2161 tcaagaaagt gggggaaaca gcagccctaa cccccatttt cctgtgggga acggggcatc 2221 tgagtggaca ctcggggtgc agcagacagt ggacgcgagg cacagcgact accagtcacc 2281 cacctgagcc tcctgccaca gccttcatct ccgtcttggg cgtctgcccc ctgtacctgg 2341 atgagatccg gcacttcctg accctatgtc cagagctgtc cctgggctgg ttcgaggagg 2401 gctgccttgt ggccttcatc atcggctcgc tctgggacaa ggagagactc atgcaggtga 2461 ggacaggcct gcgacgccca gctccaggga ggcctctgaa gacagaggtc agccagatgg 2521 cggggagggg agcccagggg ctgggatttc ttcctccaga actggagaga tgagtacagg 2581 ccacaggccc ctcccagagc aagaccttct gggtcttcaa gttttctcca tggggttggg 2641 ggtatggctc ccaatttggg gccctccttt gctggggtgg gtgccctgac cacaggcacc 2701 caggggacac ctgctccctg cctgggttgg tggttggggg ggagcacgtg tcagcagaag 2761 tgacctggga tctcatccct tgctcgctcc caggagtcac tgacgctgca caggtctggg 2821 ggccacatag cccacctgca tgtgctggcc gtgcaccgcg ccttccggca gcagggcagg 2881 ggccccatcc tgctgtggcg ctacctgcac cacctgggca gccagccggc cgtgcgccgg 2941 gccgcgctca tgtgcgagga cgcgctggta cccttctatg agaggttcag cttccacgcc 3001 gtgggcccct gcgccatcac cgtgggctcc ctcaccttca tggagctcca ctgctccctg 3061 cggggccacc ccttcctgcg caggaacagc ggctgctgaa ctgggctgcc cacctggctg 3121 ccaacatgat ccccgtctct gccctgggct cctcttagct cagctgagca tagagacagc 3181 agtttccaga gagtggagag agcagggcta aataaagagg agataaggtg gcttctcacg 3241 gcctgagctg gagtggtgtg tcttgtctgt ccccacgagg cctctggacc tcctgtgttc 3301 tgaactctgt acctgagacc gggctgggtt ggtaagcggg gacaatggga ggtgctgtgg 3361 ggttcctggc tcctctcctc ctggcaggtg ggcaaaggca ccaaggcggg atcctcacag 3421 acagcccatt gctcggtggg gggggggggg ggg // LOCUS HSU43415 2747 bp DNA PRI 22-JAN-1996 DEFINITION Human obese (ob) gene, complete cds. ACCESSION U43415 NID g1163105 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2747) AUTHORS Chehab,F.F. and Lim,M.E. TITLE Direct Submission JOURNAL Submitted (15-DEC-1995) Farid F. Chehab, Laboratory Medicine, Univ. of California, 505 Parnassus Avenue, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..2747 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="'7q31-32" gene 1..2747 /gene="ob" CDS join(1..144,2391..2747) /gene="ob" /codon_start=1 /product="obese" /db_xref="PID:g1163106" /translation="MHWGTLCGFLWLWPYLFYVQAVPIQKVQDDTKTLIKTIVTRIND ISHTSVSSKQKVTGLDFIPGLHPILTLSKMDQTLAVYQQILTSMPSRNVIQISNDLEN LRDLLHVLAFSKSCHLPWASGLETLDSLGGVLEASGYSTEVVALSRLQGSLQDMLWQL DLSPGC" BASE COUNT 689 a 673 c 724 g 661 t ORIGIN 1 atgcattggg gaaccctgtg cggattcttg tggctttggc cctatctttt ctatgtccaa 61 gctgtgccca tccaaaaagt ccaagatgac accaaaaccc tcatcaagac aattgtcacc 121 aggatcaatg acatttcaca cacggtaagg agagtatgcg gcgacaaagt agaactgcag 181 ccagcccagc actggctcct agtggcactg gacccagata gtccaagaaa catttattga 241 acgcctcctg aatgccaggc acctactgga agctgagaag gatttgaaag cacagggctc 301 cactctttct ggttgtttct tttggcccct ctgcctgctg agattccagg ggttagtggt 361 tctaattcta aaccactcct agcaacattt gattttgcta catgtttcca tttaaaaatc 421 ataggatttg ggctgggtgt ggtggcttgt acctgtcatc ccagcacttt gggaggccaa 481 agcaggagga tcattcgagc ccaagagttc gagaccagcc tgggcagcat agggagaccc 541 catctctaca aaaataataa aaaatgttag ctgggcatgg tggtgtgtac ctgtggtccc 601 agctagggga ggctgagatg gaaggatcaa cctgagcctg ggaggttgag gctgcagtgg 661 gccctgatca tgccaccgtg ctccagcctg ggtgacagag tgagaccttg tctcaaaata 721 aataaataaa taaataaaag tcataggatt tgatcaggca tgatgggtca catctgtaag 781 cccattgctt taggaggcca aggtaggagg atcagttgag gccaggagtt caagaccagc 841 ctgggcaaca tggcaagacc tctctctcta atttttaaaa aaataaaaat taaagataag 901 aaaaaaatca caggattctc atgaggcctc acgtgcttat tttcaaccta ccaaggggga 961 aacccaggcc tcagcgatta gctgaggcac atgcagggac aggcactgtc tctttccttc 1021 ctgtcccctc tgtccccacc ttctgcgctc gccttcctcc ctgacttcac ttccttgaat 1081 cttagtgcct acgaccagag ggagctgtga agttccttgt gtcccattgg caggtacaag 1141 acccccagaa gcatctcctc agggcctcta tcccatctct agatgtgctt gtcattaggg 1201 ttcttgtagt tccagctgat ctctggccct gccgctcaaa gatacccaaa agagcgagtc 1261 tacccttttt cacattcaac cctctactga tttgcaaata gcagtcagtg cccaccctgg 1321 tcttttctct ggggtccagc aggcctagac cttcagccat tttcctgatg aggtctgtat 1381 ttgaaattag gaagattaag tttgaatctt cacacttctg atgtctgtga gatcttcagc 1441 aagttcctta ctgtctttaa gccttgtttt catcatctgg ataatgggga tatcacacac 1501 tattcacaag gttgttatga ggcctaaatt agctaaagca attgaatcct ccttaccccc 1561 tgcatggagc tctctggaga cttccacgtc tcctggtcat tgtgggtgtc ttatggtagt 1621 cttgggcagt tagggagaag ttaggtgtct ggaaccaaag atggctcaga actagataga 1681 gtcttgggca ttttatagat aaaaactctt gtctccttta aaaataataa aaaaaaatta 1741 gctgggcata ttagccactc agcaagactg cacgtgatag atcccgagtg ccccaccttg 1801 ggtggtgtaa tacacgatat cacgggaccc ccgggtagta accacggagg tgtcagcctc 1861 agtcctgtgg gcagatggat ggggagagcc tcccggaact ggagtcactg gagcagggtt 1921 gggggccctc actgagggta cggccttgat ctctaaggag gagggactgc ctggaaaagc 1981 tgactgggag ggaggactcg gctgggggta gaagggacta gggaagcctg ggggtggggg 2041 tgcttatgga ggacctcaga tgcctgggga acagactcca ctaaataaaa catatgaaac 2101 catggctgtt cttcagcaga ggctatgtag agaaaggaat gacctaggaa agttggcctg 2161 gaagtggagg gaaggatggt gtgggaaaag caggaatctc ggagaccagc ttagaggctt 2221 ggcagtcacc tgggtacagg atacaagggc ctgagccaaa gtggtgaggg agggtggaag 2281 gaggcagccc agagaatgac cctccatgcc cacggggaag gcagagggct ctgagagcga 2341 ttcctcccac atgctgagca cttgttctcc ctcttcctcc tgcatagcag tcagtctcct 2401 ccaaacagaa agtcaccggt ttggacttca ttcctgggct ccaccccatc ctgaccttat 2461 ccaagatgga ccagacactg gcagtctacc aacagatcct caccagtatg ccttccagaa 2521 acgtgatcca aatatccaac gacctggaga acctccggga tcttcttcac gtgctggcct 2581 tctctaagag ctgccacttg ccctgggcca gtggcctgga gaccttggac agcctggggg 2641 gtgtcctgga agcttcaggc tactccacag aggtggtggc cctgagcagg ctgcaggggt 2701 ctctgcagga catgctgtgg cagctggacc tcagccctgg gtgctga // LOCUS HSU43572 10127 bp DNA PRI 11-JUN-1996 DEFINITION Human alpha-N-acetylglucosaminidase (NAGLU) gene, complete cds. ACCESSION U43572 NID g1171228 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10127) AUTHORS Zhao,H.G., Li,H.H., Bach,G., Schmidtchen,A. and Neufeld,E.F. TITLE The molecular basis of Sanfilippo syndrome type B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (12), 6101-6105 (1996) MEDLINE 96234097 REFERENCE 2 (bases 1 to 10127) AUTHORS Zhao,H.G., Li,H.H. and Neufeld,E.F. TITLE Direct Submission JOURNAL Submitted (19-DEC-1995) Hong G. Zhao, Biological Chemistry, UCLA, 10833 Le Conte Ave., Los Angeles, CA 90095, USA FEATURES Location/Qualifiers source 1..10127 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q21" /cell_type="fibroblast cell" mRNA join(<1085..1799,2542..2689,3482..3628,3813..3898, 6091..6347,8167..9588) /gene="NAGLU" /note="5' end of mRNA is uncertain" gene 1085..9588 /gene="NAGLU" CDS join(1417..1799,2542..2689,3482..3628,3813..3898, 6091..6347,8167..9377) /gene="NAGLU" /EC_number="3.2.1.50" /note="lysosomal enzyme; deficient in Sanfilippo B syndrome" /codon_start=1 /function="one of four enzymes involved in the degradation of heparan sulfate; specifically removes the alpha-N-acetylglucosamine residues" /product="alpha-N-acetylglucosaminidase" /db_xref="PID:g1171229" /translation="MEAVAVAAAVGVLLLAGAGGAAGDEAREAAAVRALVARLLGPGP AADFSVSVERALAAKPGLDTYSLGGGGAARVRVRGSTGVAAAAGLHRYLRDFCGCHVA WSGSQLRLPRPLPAVPGELTEATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALN GINLALAWSGQEAIWQRVYLALGLTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSW HIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCSYS CSFLLAPEDPIFPIIGSLFLRELIKEFGTDHIYGADTFNEMQPPSSEPSYLAAATTAV YEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYT RTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQ NEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEA CRGHNRSPLVRRPSLQMNTSIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQ ARAAAVSEAEADFYEQNSRYQLTLWGPEGNILDYANKQLAGLVANYYTPRWRLFLEAL VDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPGWVA GSW" intron 1800..2541 /gene="NAGLU" /number=1 intron 2690..3481 /gene="NAGLU" /note="the sequence for this intron was partly determined by the submitters and partly derived from GenBank Accession Number M84472" /number=2 intron 3629..3812 /gene="NAGLU" /number=3 intron 3899..6090 /gene="NAGLU" /note="the sequence for this intron was partly determined by the submitters and partly derived from GenBank Accession Number M84472" /number=4 intron 6348..8166 /gene="NAGLU" /note="the sequence for this intron was partly determined by the submitters and partly derived from GenBank Accession Number M84472" /number=5 BASE COUNT 2139 a 2846 c 2824 g 2318 t ORIGIN 1 tcgtatccgc ccgtcttggc ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc 61 cagcacctgg ctaacttttg tatttttagt acagacgggg tttcactgta tgttggccag 121 gctggtctca aacccctgac ttgaggtgat ctgcaagcct cagcctccca gagtgctggg 181 attacaggcg taagccaccg ctcctggcct aaggttggct atttttatgg ttatttcttg 241 attatatgat aaacaagggg tgggttagta atgaattttt cagaaaaggg gtggggatcc 301 cccccaactg aaggttcctc cactgtttag accatatagg gtaacttctg gacgttgcca 361 tggcatttgt aaactgcctg gcgctgctag gagtgtcttt agcatactaa tgcattataa 421 ttagcgtata atgagcagtg aggacgatca gaggtcacct tcctgtcttg gttttggcag 481 gttttgacca gtttctttgc tgcattctgt tttatcagcg gggtcttgtg accttttatc 541 ttgtgctgac ctcctgtctc atcctgtgac gaaggcctaa cctcctggga attcagccca 601 gcaggtctct gcctcatttt acccagcccc tgttcaagat ggagtcgctc tggttggaaa 661 cttctgacaa aatgacagct cctgttatgt tgctgctgct gccgccaatg gacagccttt 721 aacgtgcccg ccagccctgc tccaccgccg gcctgggctc acatggcccc atccctcctc 781 gaacctccta gcctgttagt tactcaaatc tgcaagctct ctgccttctc agggccttca 841 ataaatgcat ttcttctgtc tggaaggctc ttcctttccc tcttctagcc aattcctatt 901 catccctgag tttcagatta aaagtcactt cctttggaaa ccttacttcg ctacttcgct 961 acttactgca ctacttcgca gcatcacaac tatgatggaa atccttactt acgttaaata 1021 tctggtttct aggtcacctc cctgacgggg acggtaggga ccgtcttctc gttcatcagt 1081 agggaagtag ctatggcagt gcctgataca aaataaactc caaatgtgta tttattagat 1141 ggttggatgg aagttatttg cgtgtgaaag cgcgttttac ccgaaggcgc tctgtgaggg 1201 ccagcgggtc cccttcggcc ctggagccgg ggtcacacgc tccccaccgc gtgcggtcac 1261 gagacgcccc caagggagta tcctggtacc cggaagccgc gactcctggc cctgagcccg 1321 ggcttagcct tcgggtccac gtggccggag gcggcagctg attggacgcg ggccgcccca 1381 ccccctggcc gtcgcgggac ccgcaggact gagaccatgg aggcggtggc ggtggccgcg 1441 gcggtggggg tccttctcct ggccggggcc gggggcgcgg caggcgacga ggcccgggag 1501 gcggcggccg tgcgggcgct cgtggcccgg ctgctggggc caggccccgc ggccgacttc 1561 tccgtgtcgg tggagcgcgc tctggctgcc aagccgggct tggacaccta cagcctgggc 1621 ggcggcggcg cggcgcgcgt gcgggtgcgc ggctccacgg gcgtggcggc cgccgcgggg 1681 ctgcaccgct acctgcgcga cttctgtggc tgccacgtgg cctggtccgg ctctcagctg 1741 cgcctgccgc ggccactgcc agccgtgccg ggggagctga ccgaggccac gcccaacagg 1801 taccgccccg aagcttcccc gcgtccgccc gaggcgctta ccccctcccg gagccgctgc 1861 cacccaaatc gggaggctga gcggggagcg ctggccggaa ggcccagctg cgccgcctcc 1921 agcagctgtg tggccttgag ccagccactc tgcctttcag agcctcggct ggcccacctg 1981 aaaaacggaa agaagacgcc taccgtgcag tgttattgtg aggatttgca cgatgatggg 2041 catagaattt gtggtgcaca attggtgatg agtgaatttt cttgccttac ctcccccacc 2101 ttctctttga acctgcggac tgaggaagga cgcctccatc ccccacccta caggcctgtg 2161 ttccagcgcc tgccacacta tggagtgatg tgttcacaca gctgtcctcc cctgcccatc 2221 tgttagactg tgggggcagg gattccccgt tccaggaaaa caccgtgcag aggaggggct 2281 ctggcagtgt ggcatgaaag tggatatgcc acccaaatac ccgccaggct agagggccct 2341 gggagagtgc aggggacgag tgcctcagaa gcccagcccc ggtacctggt ctcagctcca 2401 cctggggtgg gtcccagtgt gcagcagaag ggccgagttt ggagcccctc ccctctcctc 2461 taggtggggg atgggggatt tgttccaggg ccgtggaccc tccagggtgg gatgcgcccc 2521 tgctcatgac actgcccgca ggtaccgcta ttaccagaat gtgtgcacgc aaagctactc 2581 cttcgtgtgg tgggactggg cccgctggga gcgagagata gactggatgg cgctgaatgg 2641 catcaacctg gcactggcct ggagcggcca ggaggccatc tggcagcggg tgcgtgccca 2701 ctgtcccttc cccaccctcc tctatggcgg gagccaccgt aggtgttttc acccgccccc 2761 cagcatgggc gcagtgtctc tctctagaag tgctttcaac gtgcacagtg gcttgggcct 2821 cctaaaaact gaggcttccg gccgggcgcg gtggctcacg cctgtcatcc cagcacttcg 2881 ggaggcctag cgggcggatc aggagttcag gagatcgaga ccatcctggc caacattgtg 2941 aaaccccgtc tctactaaaa tacaaagaaa tagcaacctg ggcaacagag cgagactctg 3001 tctaaaaaaa aaaaaaaaaa aaactgaggc ttccagtttg aggagtgggg ctccttcccc 3061 catctcccct atgcagccaa tcacctggtc ccttggatcc aactcatggg cagctctaga 3121 tctgcctccc tggaagcttc tgtgctgcaa tggctgctcc aggctctgct taagctcttc 3181 acacagttgc cctgcccttc catctggcac tcttgctcca tgaagccttc taaggccttc 3241 ctcttggggg aaagcccctt tgtgccccat ctcctcaccc atgcgacaaa ggcaacacag 3301 tgaactcacc tactcacagg tctctttcct ctgggctgtg ggctccttga tggcagcgtt 3361 cggattttgt ctcagtagcc ctagcgccca gcacaaagaa gcaatgagtg aatggttgtt 3421 gaatgaatga atgaatgaat gaagatgaat atatttctat gtgtgggccc ttcttcctca 3481 ggtgtacctg gccttgggcc tgacccaggc agagatcaat gagttcttta ctggtcctgc 3541 cttcctggcc tgggggcgaa tgggcaacct gcacacctgg gatggccccc tgcccccctc 3601 ctggcacatc aagcagcttt acctgcaggt aaaaggatgg aaaagggaag gggcagaatc 3661 ggtgatagat ggtcatgggc ccaggaaggg tggtattagg ccggccccag ggctcttaac 3721 tgaggcgggg ggctgcgtgt atcctgggag atgagggcct tctcatagga cagcagtggc 3781 catgctcacc acccttcctt ctgttcctcc agcaccgggt cctggaccag atgcgctcct 3841 tcggcatgac cccagtgctg cctgcattcg cggggcatgt tcccgaggct gtcaccaggt 3901 gaggttccgc tcaccccctc cacttagctc agagagggaa ttttattccc ttctagaaca 3961 tgacttaaaa acttaagctc tgggccgggc gcagtggctc acgcctgtaa tcccagcact 4021 ttgggaggcc gagttgggcg gatcacctga ggtcaggagt tcgagaccag cctggccaac 4081 atggtgaaac cctgtctcta ctaaaaatat aaaaattagc tgggcatggt ggcacgcgcc 4141 tgtaatccca tctacttagg aggctgagac aggagaattg cttaaacctg ggaggcagac 4201 gttgcagtga gtcaagatca cgccattgca ctccagcctg ggtgacgagc gaaactctgt 4261 ctcaaacaaa caaacaagct ctggacgtag gcctgggttt gatttctgac tctgctacta 4321 attagctgtg tgacttcggg cagatgacat gactgctctg tgcctcagtt tccttacttg 4381 taaaatggga tctctaccca ctcgctgtag ggtttgtaat tatctctcga tctatctgtg 4441 actttgcaca gagtgctagc aaatggcagc ccctgggagt ggccgcaggg gtgctccagt 4501 gtcccttgtc ccttcctgtt cctcgggctt tcccagccat cctgtaaatg tcttgggaaa 4561 agtcttcaag gctcacctga gacctcccct ccttcaggaa gccttgctag tgccccgcat 4621 gacctccttt gcacctgcta atgtctggct cccatactct cgtaggactt aatgcatgcc 4681 agtggcctcc ctgcccgcct ctttgccccc atcaccaggt ggcaggaaac tcactcattc 4741 attcaataaa cttggtccag ctgtctgagg ctgccagaac tggctgtgct gggtcctggg 4801 aggcggcaag aaaggtgccc aagggcttac ccctgatagg agagatatgt tggctgaagg 4861 atacaatgtg gggacaagga caggaatata tgtgggttcc gctctcctct gccgggagag 4921 aggggcagga agggctcagg gcagagccca gccttgaaaa atgagtgttg cttggacgga 4981 cgcttggcta atgcttgtaa tcctagcgtt ttgggaggct gaggcgtatg gatcacctgc 5041 ggtcaggagt taaagaccag cctggccaac atggcgaaac cccatctcta ctaaaagtac 5101 aaaaattagc caggcgtggt ggcgggctcc tgtaatccca gctactcggt aggctgaggc 5161 atgagaatct cttgaagcca ggggccagag actgcagtga gccgagatca caccacttca 5221 ctccagcctg ggtgacagag tgagactccg tctcaaaaaa aaaaaaaaaa aaaggaaaga 5281 aaattaaaca cctcatgttc tcactcatag tgggagttga acaatgagaa caacatggac 5341 acaggaaggg gaacatcaca caccggggcc tttcgcggtg tgggggtcaa ggggaggagt 5401 agcattggga cagatactta atgcatgcgg ggctgaaaac ctagatgatg ggttgatggg 5461 tgcagcaaac caccatggca catgtatacc tatgcaacaa acctgcatgt tctgcacaga 5521 actgaactga aagtataatt aaaaaaaaaa aaaaaagctg ggtgcggtgg cccacacctg 5581 taatcccagc actttgggag gccgagacgg gcggatcaca aggtcagcag atcgagacca 5641 tcctggctaa cacagtgaaa ctcagtctct actaaaaata caaaaaatta gccgggtgtg 5701 gtggcgggca cctgtagtcc cagctactag ggaggctgag gcaggagaat ggcatgaacc 5761 tgggaggcag agcttgcagt gagctgagaa tgcgccactg cactccagcc tgggggacag 5821 agtgagactc tgcctcaaaa aaaaaaaaaa aaagaaagaa aaaggagcgt tgcttgtttc 5881 aggccacagg aaggggagag atagtgaaag tttttcagag aaggtggcca gggaaggaga 5941 agaaaggact gtaggcagag agcatagcct gtacaaagcc atagaggcaa gagaaaccag 6001 gagctgtaga gaagttggca aggctgttga acactatggt gaacactatg gcggcttcca 6061 tgaaatatct gagcttttgc tccccactag ggtgttccct caggtcaatg tcacgaagat 6121 gggcagttgg ggccacttta actgttccta ctcctgctcc ttccttctgg ctccggaaga 6181 ccccatattc cccatcatcg ggagcctctt cctgcgagag ctgatcaaag agtttggcac 6241 agaccacatc tatggggccg acactttcaa tgagatgcag ccaccttcct cagagccctc 6301 ctaccttgcc gcagccacca ctgccgtcta tgaggccatg actgcaggta cagtgcctgg 6361 gtggggtggg agagcccccc agaccctcaa aaagaaggga gtagcagatg tcagtagggg 6421 taggcagagg gactggaata atgcctcgcc ataacacaca gtactttata gtttaccaag 6481 cacgtgtaca catgcgttgt ctcagtgaat cccactgtgg ttgagaggtg agctctggaa 6541 gccaacaacc tgggtcacac ctcgcgctcc tatttcctgg ccgtgtgact tatgactcat 6601 gacctccttc ccagtgtctc gtttgctttt cctgtaaact gggactacct cataggtaga 6661 ataacgcctg gcccagagca aaggccacta agagctagct atgaacaagg attttgtttc 6721 atctctgcgt ggttgctgaa gtaggcactg caggcaggag gtgagtggat gtgcctaaag 6781 gcactaagtg cgcatcctgc tacaaaactg tgaagccagg gctccttcct gccacttaaa 6841 ggaggagtgg agcagagggc gcccaagtca ggaatgactt agtggagagg cgtctgtgtt 6901 ggccaggaag ggaacagatc agctcagcct ttcttgagca gtactgctcc aagtgtgacc 6961 caaaaccagc agcagcagca gcagcagccc gagctgtgag atggcaaatt ctcaggccct 7021 acccaagacc tgaaggagaa gctacatttt tttttttttg aagacagatt tcactctgtt 7081 gctgaggctg gagcacagtg gcacaatctc atctcactgc aaccttcgtc tcctaggttc 7141 aagcgattct cctgcctcag cctcccgagt agctgggact ataggcaccc gccaccacgc 7201 ccggcaattt ttgtttgttt tgagatagag tctcgctctg tcacccaggc tggagtgcag 7261 tggcacgatc tcagttcact gcaacctctg cttcctgagt tcaagcgatt ctcctgcctc 7321 agcctcctga gtagctggga ttacaggcgc cccccaacca cactcggcta atttttgtat 7381 ttttagtaga gacggggttt cgctatgtag gtcaagctgg tttcaaactc ctgacctcaa 7441 atgattcgcc cacttcagcc tcccaaagtg ctgggattac aggtgtgagc caccttgcct 7501 ggccaatttt tgtattttta gtagaaacag gtttcaccat ggtggccaga ctggtctcaa 7561 actcctgacc tcaggtgaac tgcccacctc agcctcccaa agtactggta ttacaggcgt 7621 gatccactgc gactggcctt gattttgttt ttgagacaga atcttactct gtcgcccaga 7681 ctggagtgca gtggcacaat ctcagctcac tgcaacttct gcctcatggg ttcaagtgat 7741 tcttgtgcct ctacctcccg agtagccggg attacaggca cctgccatta cgctaggcta 7801 atttttgtat ttttagtata gacagggttt ccccacattg gccaggctgg tctggaactc 7861 ctgggctcaa gtgatccacc tgcttcagcc cctcagagta ctgggattat aggtgtgggc 7921 caccacgccc attcagaaac ctccatgttt taaggagcct ctgggtaact ctcatgttca 7981 cccaagctgc tgaaccctgt cttggagttt tcagagggac gcgtatgtgc cacagacgtc 8041 ccgctggtgg gggtcatggg aagccatgac ctgggataga cagtcgtctg tagagtgggg 8101 tgaacattcc ctgggccctc tgtttcatca ctcctcttct ctgttccccc tacctcctgt 8161 ccacagtgga tactgaggct gtgtggctgc tccaaggctg gctcttccag caccagccgc 8221 agttctgggg gcccgcccag atcagggctg tgctgggagc tgtgccccgt ggccgcctcc 8281 tggttctgga cctgtttgct gagagccagc ctgtgtatac ccgcactgcc tccttccagg 8341 gccagccctt catctggtgc atgctgcaca actttggggg aaaccatggt ctttttggag 8401 ccctagaggc tgtgaacgga ggcccagaag ctgcccgcct cttccccaac tccaccatgg 8461 taggcacggg catggccccc gagggcatca gccagaacga agtggtctat tccctcatgg 8521 ctgagctggg ctggcgaaag gacccagtgc cagatttggc agcctgggtg accagctttg 8581 ccgcccggcg gtatggggtc tcccacccgg acgcaggggc agcgtggagg ctactgctcc 8641 ggagtgtgta caactgctcc ggggaggcct gcaggggcca caatcgtagc ccgctggtca 8701 ggcggccgtc cctacagatg aataccagca tctggtacaa ccgatctgat gtgtttgagg 8761 cctggcggct gctgctcaca tctgctccct ccctggccac cagccccgcc ttccgctacg 8821 acctgctgga cctcactcgg caggcagtgc aggagctggt cagcttgtac tatgaggagg 8881 caagaagcgc ctacctgagc aaggagctgg cctccctgtt gagggctgga ggcgtcctgg 8941 cctatgagct gctgccggca ctggacgagg tgctggctag tgacagccgc ttcttgctgg 9001 gcagctggct agagcaggcc cgagcagcgg cagtcagtga ggccgaggcc gatttctacg 9061 agcagaacag ccgctaccag ctgaccttgt gggggccaga aggcaacatc ctggactatg 9121 ccaacaagca gctggcgggg ttggtggcca actactacac ccctcgctgg cggcttttcc 9181 tggaggcgct ggttgacagt gtggcccagg gcatcccttt ccaacagcac cagtttgaca 9241 aaaatgtctt ccaactggag caggccttcg ttctcagcaa gcagaggtac cccagccagc 9301 cgcgaggaga cactgtggac ctggccaaga agatcttcct caaatattac cccggctggg 9361 tggccggctc ttggtgatag attcgccacc actgggcctt gttttccgct aattccaggg 9421 cagattccag ggcccagagc tggacagaca tcacaggata acccaggcct gggaggaggc 9481 cccacggcct gctggtgggg tctgacctgg ggggattgga gggaaatgac ctgccctcca 9541 ccaccaccca aagtgtggga ttaaagtact gttttctttc cacttaaact gatgagtccc 9601 ctgggtctgt caaaatgaga aggtcactgc tgccacgctt gggaggactc agggctatag 9661 catggccctg gggtgggacc tgttctccca tcccttgcct cacgtccctg tttttgtttg 9721 tttgtttgtt tgtgacggag ccttggtctg ttgcccaggc ttgagtacaa tggcacagtc 9781 tcggctcact gcaacctccg cctcctgggt tcaagcaatt cttgtgcctc agcctccccg 9841 gtagctggga ctataggcat gcaccaccac accaggctaa tttttttttt ccaagatgga 9901 gtcttgctct gtcgcccagg ttggagttta gtggcaccat attggtttac tgcaacctct 9961 gcctcccggg ttcaagcaat tctcctgcct cagtctacca gggagttagg actacgggcc 10021 tgtgccatca cgcctggcta atttttgtat ttttcataga gataaggttt caccatgttg 10081 gccaggctgg tctttaactc ctgaactcaa gtgatccacc tgcctcg // LOCUS HSU43901 6353 bp DNA PRI 24-AUG-1996 DEFINITION Human 37 kD laminin receptor precursor/p40 ribosome associated protein gene, complete cds. ACCESSION U43901 NID g1302647 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1022) AUTHORS Nag,M.K., Thai,T.T., Ruff,E.A., Selvamurugan,N., Kunnimalaiyaan,M. and Eliceiri,G.L. TITLE Genes for E1, E2, and E3 small nucleolar RNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (19), 9001-9005 (1993) MEDLINE 94022303 REFERENCE 2 (bases 1 to 6353) AUTHORS Jackers,P., Minoletti,F., Belotti,D., Clausse,N., Sozzi,G., Sobel,M.E. and Castronovo,V. TITLE Isolation from a multigene family of the active human gene of the metastasis-associated multifunctional protein 37LRP/p40 at chromosome 3p21.3 JOURNAL Oncogene 13 (3), 495-503 (1996) MEDLINE 96330329 REMARK Erratum:[[published erratum appears in Oncogene 1997 Feb 6;14(5):627]] REFERENCE 3 (bases 1 to 6353) AUTHORS Jackers,P. TITLE Direct Submission JOURNAL Submitted (22-DEC-1995) Pascale Jackers, Department of Pathology, University of Liege, Tour de Pathologie, B23, 3eme Etage, Liege 4000, Belgium FEATURES Location/Qualifiers source 1..6353 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(525..578,1430..1595,2410..2528,4591..4836,5486..5614, 5733..5898,6112..6272) /product="37 kD laminin receptor precursor/p40 ribosome associated protein" prim_transcript 525..6272 /note="37 kD laminin receptor precursor/p40 ribosome associated protein" prim_transcript 527..6272 /note="37 kD laminin receptor precursor/p40 ribosome associated protein" mRNA join(527..578,1430..1595,2410..2528,4591..4836,5486..5614, 5733..5898,6112..6272) /product="37 kD laminin receptor precursor/p40 ribosome associated protein" intron 579..1429 /number=1 exon 1430..1595 /number=2 CDS join(1463..1595,2410..2528,4591..4836,5486..5614, 5733..5898,6112..6206) /note="37LRP/p40; metastasis-associated multifunctional protein" /codon_start=1 /product="37 kD laminin receptor precursor/p40 ribosome associated protein" /db_xref="PID:g1302648" /translation="MSGALDVLQMKEEDVLKFLAAGTHLGGTNLDFQMEQYIYKRKSD GIYIINLKRTWEKLLLAARAIVAIENPADVSVISSRNTGQRAVLKFAAATGATPIAGR FTPGTFTNQIQAAFREPRLLVVTDPRADHQPLTEASYVNLPTIALCNTDSPLRYVDIA IPCNNKGAHSVGLMWWMLAREVLRMRGTISREHPWEVMPDLYFYRDPEEIEKEEQAAA EKAVTKEEFQGEWTAPAPEFTATQPEVADWSEGVQVPSVPIQQFPTEDWSAQPATEDW SAAPTAQATEWVGATTDWS" intron 1596..2409 /number=2 exon 2410..2528 /number=3 intron 2529..4590 /number=3 repeat_region 2625..2923 /rpt_family="Alu" repeat_region 3008..3321 /rpt_family="Alu" exon 4591..4836 /number=4 intron 4837..5485 /number=4 snRNA 4891..5042 /gene="E2 snRNA" /note="gene for E2 small nucleolar RNA" gene 4891..5042 /gene="E2 snRNA" exon 5486..5614 /number=5 intron 5615..5732 /number=5 exon 5733..5898 /number=6 intron 5899..6111 /number=6 exon 6112..6272 /number=7 BASE COUNT 1635 a 1343 c 1557 g 1815 t 3 others ORIGIN 1 taacgcatac gatcagttca atcttgtctt ttttaacttt ttcagagaaa acttccctta 61 agtgaacatt taaatctgaa ttacgtcctg ttaaactgtt ctccaggaaa ataaaataaa 121 ataaatcttc aagtttttgt ttacctaaca atttgttgtg tcgaacaaac cttcctactt 181 ttcaggtaac aaaatggcag cttaggctag aaagcgctca tattcgcagg tacaagggct 241 gggtaagaac gccccgcctg gctgactaac ttgagttccg cgctctggac aggaattatg 301 cacagggcgt cgctgtggca ctagaaaccc caaagtcaca agcgccccag atccgaccag 361 gatgccgcta cccgtgacag cccagaagcc cgctcctgcg cgcaagcccg ccttcctgaa 421 gaaaaagccc agtcccggca gcgttcttct ccggctccgc ccttcttccg ctcgactttc 481 tttgccattg gctgacaacg gagtacataa ggacgtcatt tcctgctgcc tgtcttttcc 541 gtgctacctg cagaggggtc catacggcgt tgttctgggt gagttccgtg tagcgtccct 601 ggcgccttcc agggctagaa aaatgagctt ttcctgctca aatgaagggt gagaagacta 661 gtgatgaaag ccggtcagac tggatctgtc tcccgcccgg cgcgccccac cttaggcctg 721 ccgggcacgt ggcaggctcg ggcggcgggt tcccagagtg ccccgggagc gggtggaggc 781 cgccctccag cggaggctcc gagctggggt tcggaccagg ccgcgggtgg gcgggagtgc 841 agaaagcggg ctaacatcct gtgttgctat cttcggagtc ccacacggcg gtgagtcgag 901 gcccaggcgc tgatttacac cagctacttg ggctggtcgg gttttccctt cgcgccgtgc 961 gggtcaggag ttaaggttct cgggttttta gacaaacaag tggtgacagc acagcgaagt 1021 aattccaaag catccgccta ccaatctgct tgaaaatgtc tgaaaacaat tcatgccttt 1081 tttgccttta gtttgcatat tccaaacatg gctgctcttt tatatctagt tgttaacttg 1141 gcgcatccac aacttttcct taattcctat cttgagaagt gttgaatttc cattcgctaa 1201 tttcgtgtag ttttattact cggttactct gccgtccaca ctatttcctc agtaagatgt 1261 gcgctgttcc gtaatacacg acatgtatgg gttaactttc tgtttaccct tcactacact 1321 gtaagctcaa tgcctgacac tatagcagat ggagtttttg gttgctttta agggtgtgcc 1381 ctacttaact caatggaatg aaaagaaata ggttgctctc ttatttcaga ttcccgtcgt 1441 aacttaaagg gaaactttca caatgtccgg agcccttgat gtcctgcaaa tgaaggagga 1501 ggatgtcctt aagttccttg cagcaggaac ccacttaggt ggcaccaatc ttgacttcca 1561 gatggaacag tacatctata aaaggaaaag tgatggttag tcattgcttt aattttttgt 1621 tactccagct gtaagtacaa attttgagct tgctattctc gtggttagtt ctgggtaatt 1681 tctttctatc ttccttaaat gaagccagac ccctacgttg aaaacatact ttaaataaat 1741 gttcttgtta ttgagagaga agatttctgc tccaggggag aatatgtcag ttgtctgagt 1801 tacgggactt gggttgggtc atgaacatga gaaagttcat tggctgagtt tctaatagct 1861 cgcatagtgc cttcaacttc ataggtactt acacaggcta ttgaactgaa ttttgagtgg 1921 aaagtggggt aagaggtggg aatgaacaat atggtttgga agaaccagag gatggaaggc 1981 attaagttgg ctaaggcact tacttatttt agagataaag ctaccaagga cttgggagtg 2041 acatgctgta aaggagtttt aggatgaatc ttgcatcagt gcagatgata aaagggaaag 2101 agtggcagaa agccagaagt gatcttaaga accaatgatg gaataagtac agggaaagag 2161 aaaggaaaga tggattagcc aactgaatct aggctgcaca ctattaaagc tcagggtgga 2221 ggccagtctt ggctcatgaa cttctgagtg tcggaagtgt gctatatcaa tggcaggatt 2281 ttcgctaaca ccagtagagc ttgcctctat gactggagtt tggtagtact cgctgccaca 2341 tagagtaaac ttgagaagaa tgtttgcaca gccaggttaa gtgttacaaa tccttctgcc 2401 ctcacttagg catctatatc ataaatctca agaggacctg ggagaagctt ctgctggcag 2461 ctcgtgcaat tgttgccatt gaaaaccctg ctgatgtcag tgttatatcc tccaggaata 2521 ctggccaggt ttgtggaaca gtggttagtt tttatattat agaaataaag ccttacagac 2581 attgtgagac atatagaaag acataagaaa acagaaataa ctcaggcccg gggcgcgtgg 2641 ctnacgcctg taatccccag cactttggga ggtggagtcc agcgtattac gaggtcagga 2701 gatccacacc atgngtaagc cccatctcta ctaaaaatac aaaaaattag ttgggcgtgg 2761 gggcgggtgc ctgtagtccc agcctactcg gggaggccaa ggtggagaat ggcggtgaac 2821 ccggtgggcg ggagcttgca gtgagctgag atgccggcca ctggttactc caggcctggg 2881 gtggcaggag gccagagctc tgtctcaaaa aaaaaaaaaa aaagaaatag taaaactcaa 2941 actccactgt tagagatctt aagttggact gagccataga aatcgccctt atgacctatg 3001 accttccttt tttttttttt tttttccttc aaagacagtc tctcgctctg ttgcctgttg 3061 gcccatcctg gactgcagtg gtgtgatant cggctcattg cagcctttgc ctcctaagct 3121 caaggattct cctgcctcat cctcccgagt agctggaatt ccaaggatgt gccaccatgc 3181 ctgactgatt tttgtatttt tagtagagac atggttttac catttggcca ggctggtctt 3241 aaacgcctga tctcaagtga tcctcccacc tcggcctccc aaaagtgttg ggattacagg 3301 cgtaagccac tgcgcccagc ctgtaagacc ttttcaacat acatgctggg catctgggca 3361 atatgttgaa ctttataatt tcattttgtg tagacttggt ggccactaaa tatctagaaa 3421 ttctattaaa gtggcttatc acattttccc atggtcaaag tttttacctt aaaacaatga 3481 aatacctggg acagaaattt tttaagaatc ttaggccagt catgatggca tgcaccccat 3541 agtttccccg ttaattggga gtctgagcca ggagatccct caagtccagg agctgggggc 3601 tgcagtaagt tatgattttg ccactgcact ccaagcctgg ggcacagagt gagatcctgt 3661 ttcttaaaaa aaaaaagcca gtcatgggtg gctcatgctt ttgtaatatg gggtactttg 3721 tgaggctgag ggctggtggc ttgcttgagc ccagaagttc gagaccaggg tgggcaacgt 3781 ggcaaaactc cgtctctacc aaaattaaca aaaacgagct gggtgtggtg ttggcgtgcc 3841 tgtagtctca cgctgaggct gaggtgggga gattgctaga acacaggagg ttgaggcagc 3901 agggagccaa gactgtccac tggcatccca gcctgagtga cagagtgagg ccttgtctca 3961 aagagaacct ctgccaactt gaatgaaaaa taccttaatt tgtattttta agaaatgact 4021 agttagtaaa acttgaatct tttttgccac agctctattt taataatttt ttctgttgag 4081 aaaactagta gatttcttga attctaagag ttctaacagg gattttattc atcttaacta 4141 ttttgacaga tttaaaaatt atgtagtaag actttatttc aatgttgaca ggacggttcc 4201 catgtgtgtt tttgcagagg gttattcctg aaactgactt gttactagga ccacatagtt 4261 tggtacacag tagaaaaaag gcagttagtt attgctgtag aatgaactga gtgacagttc 4321 ttgcttgctt gagaaaataa tactgagcat ctagattggg ccaggtagtg ttgctaggtg 4381 ctcgggatat atagtagaaa aacaagcctg tcttttttaa tgtatgcagg aaatctctgg 4441 agaagtaaga ggggttaagg aaagctgggt atgtgcctgc tttacatgga gtagtagtga 4501 ttaagttggc cagtgcccag aagtgcttac tgggtgccag gattgttgct gtggaatatc 4561 gagtaccact aacttttaaa ttcttcaaag agggctgtgc tgaagtttgc tgctgccact 4621 ggagccactc caattgctgg ccgcttcact cctggaacct tcactaacca gatccaggca 4681 gccttccggg agccacggct tcttgtggtt actgacccca gggctgacca ccagcctctc 4741 acggaggcat cttatgttaa cctacctacc attgcgctgt gtaacacaga ttctcctctg 4801 cgctatgtgg acattgccat cccatgcaac aacaaggtaa tgattttagg atctagagtt 4861 tgtgaatgcg tgctctagaa aaaacattcc tgtgcacatt gttagagctt ggagttgagg 4921 ctactgactg gccgatgaac tcgcaagtgt aggtagtgtg ctacatgagg ggcaagtttt 4981 cgctaacacc acaagggtct ctggcccaat gagtggagtt tgatagtaat tcttgctaca 5041 agtataacat tactgcatga cagctttgtg gagaaatgaa aacatttgga aaatagtgtg 5101 ttcttctgcc tttgtccgtg tttcttgcct caggcctcag cgacttggcc tttgttttca 5161 caccaacttg atgggttcta ctataagatg ctaaaaaggt gggttgtgtg tggttcagat 5221 tggttgattt gtaaccctag ttgtgcataa gaattgccca gatatttttt ttaaatgcca 5281 gtgtccaggc attttttttc aatccctatg attccagtgt gcagacaagg ttgaaaatcc 5341 ctatttataa tgctccctgt tacaattgct tttcaactaa gagatgtctg tacttttggt 5401 acctagatga tgtaagagcc aggaaggtgc ttgctgtttg ggtttgacca agtgtcactt 5461 tttaataatc tgccactctt ggcagggagc tcactcagtg ggtttaatgt ggtggatgct 5521 ggctcgggaa gttctgcgca tgcgtggcac catttcccgt gaacacccat gggaggtcat 5581 gcctgatctg tacttctaca gagatcctga agaggtaagc ttctccaaag gcttgtggtt 5641 acataagcaa attggacgac ttggactgct gtctaggaag caaaacttgt cagtccctgt 5701 aagtctcttc ctcttttttt tttgtaaccc agattgaaaa agaagagcag gctgctgctg 5761 agaaggcagt gaccaaggag gaatttcagg gtgaatggac tgctcccgct cctgagttca 5821 ctgctactca gcctgaggtt gcagactggt ctgaaggtgt acaggtgccc tctgtgccta 5881 ttcagcaatt ccctactggt atgtatcagg atagaggtga atcaagctga tattttgcaa 5941 cttctcagtt ttattctaac tttaatgatc tctgtgactt ttatactagc tttaagaggt 6001 tttcattcca gtgtgctaca gcatctgata gactgctgtt gggagtgggg taaggaaaaa 6061 tactacattg aggacagagc tgatgggctt tttttggtat tctcttaaca gaagactgga 6121 gcgctcagcc tgccacggaa gactggtctg cagctcccac tgctcaggcc actgaatggg 6181 taggagcaac cactgactgg tcttaagctg ttcttgcata ggctcttaag cagcatggaa 6241 aaatggttga tggaaaataa acatcagttt ctaaaagttg tccttcattt agtttgcttt 6301 ttactccaga tctagaatac ctgggattgc atatcaaagc aagtaataat aaa // LOCUS HSU45984 3693 bp DNA PRI 09-JUL-1997 DEFINITION Homo sapiens CCR6 chemokine receptor (CMKBR6) gene, complete cds. ACCESSION U45984 NID g2246432 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3693) AUTHORS Baba,M., Imai,T., Nishimura,M., Kakizaki,M., Takagi,S., Hieshima,K., Nomiyama,H. and Yoshie,O. TITLE Identification of CCR6, the specific receptor for a novel lymphocyte-directed CC chemokine LARC JOURNAL J. Biol. Chem. 272 (23), 14893-14898 (1997) MEDLINE 97313465 REFERENCE 2 (bases 1 to 3693) AUTHORS Lautens,L.L., Modi,W. and Bonner,T.I. TITLE Cloning, Tissue Distribution and Chromosomal Localization of a potential G-Protein-Linked Receptor JOURNAL Unpublished REFERENCE 3 (bases 1 to 3693) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (16-JAN-1996) Tom I. Bonner, Lab of Cell Biology, NIMH, Bldg 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..3693 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q27" /clone="GPR-CY4" mRNA join(35..140,237..3136) gene 132..1352 /gene="CMKBR6" CDS join(132..140,237..1352) /gene="CMKBR6" /note="G protein-coupled receptor" /codon_start=1 /product="CCR6 chemokine receptor" /db_xref="PID:g2251211" /translation="MSGESMNFSDVFDSSEDYFVSVNTSYYSVDSEMLLCSLQEVRQF SRLFVPIAYSLICVFGLLGNILVVITFAFYKKARSMTDVYLLNMAIADILFVLTLPFW AVSHATGAWVFSNATCKLLKGIYAINFNCGMLLLTCISMDRYIAIVQATKSFRLRSRT LPRSKIICLVVWGLSVIISSSTFVFNQKYNTQGSDVCEPKYQTVSEPIRWKLLMLGLE LLFGFFIPLMFMIFCYTFIVKTLVQAQNSKRHKAIRVIIAVVLVFLACQIPHNMVLLV TAANLGKMNRSCQSEKLIGYTKTVTEVLAFLHCCLNPVLYAFIGQKFRNYFLKILKDL WCVRRKYKSSGFSCAGRYSENISRQTSETADNDNASSFTM" polyA_signal 3113..3118 polyA_site 3137 BASE COUNT 1015 a 726 c 828 g 1124 t ORIGIN 1 aactgtagtg cattttgcct tctttccttc ttagagtcac ctctactttc ctgctaccgc 61 tgcctgtgag ctgaaggggc tgaaccatac actccttttt ctacaaccag cttgcatttt 121 ttctgcccac aatgagcggg gtaagatttt tatttttggc aaggggtata atttgggttc 181 actgtggcta cttgaacact acactgcagc taactctatc tttgtttcct ttccaggaat 241 caatgaattt cagcgatgtt ttcgactcca gtgaagatta ttttgtgtca gtcaatactt 301 catattactc agttgattct gagatgttac tgtgctcctt gcaggaggtc aggcagttct 361 ccaggctatt tgtaccgatt gcctactcct tgatctgtgt ctttggcctc ctggggaata 421 ttctggtggt gatcaccttt gctttttata agaaggccag gtctatgaca gacgtctatc 481 tcttgaacat ggccattgca gacatcctct ttgttcttac tctcccattc tgggcagtga 541 gtcatgccac tggtgcgtgg gttttcagca atgccacgtg caagttgcta aaaggcatct 601 atgccatcaa ctttaactgc gggatgctgc tcctgacttg cattagcatg gaccggtaca 661 tcgccattgt acaggcgact aagtcattcc ggctccgatc cagaacacta ccgcgcagca 721 aaatcatctg ccttgttgtg tgggggctgt cagtcatcat ctccagctca acttttgtct 781 tcaaccaaaa atacaacacc caaggcagcg atgtctgtga acccaagtac cagactgtct 841 cggagcccat caggtggaag ctgctgatgt tggggcttga gctactcttt ggtttcttta 901 tccctttgat gttcatgata ttttgttaca cgttcattgt caaaaccttg gtgcaagctc 961 agaattctaa aaggcacaaa gccatccgtg taatcatagc tgtggtgctt gtgtttctgg 1021 cttgtcagat tcctcataac atggtcctgc ttgtgacggc tgcaaatttg ggtaaaatga 1081 accgatcctg ccagagcgaa aagctaattg gctatacgaa aactgtcaca gaagtcctgg 1141 ctttcctgca ctgctgcctg aaccctgtgc tctacgcttt tattgggcag aagttcagaa 1201 actactttct gaagatcttg aaggacctgt ggtgtgtgag aaggaagtac aagtcctcag 1261 gcttctcctg tgccgggagg tactcagaaa acatttctcg gcagaccagt gagaccgcag 1321 ataacgacaa tgcgtcgtcc ttcactatgt gatagaaagc tgagtctccc taaggcatgt 1381 gtgaaacata ctcatagatg ttatgcaaaa aaaagtctat ggccaggtat gcatggaaaa 1441 tgtgggaatt aagcaaaatc aagcaagcct ctctcctgcg ggacttaacg tgctcatggg 1501 ctgtgtgatc tcttcagggt ggggtggtct ctgataggta gcattttcca gcactttgca 1561 aggaatgttt tgtagctcta gggtatatat ccgcctggca tttcacaaaa cagcctttgg 1621 gaaatgctga attaaagtga attgttgaca aatgtaaaca ttttcagaaa tattcatgaa 1681 gcggtcacag atcacagtgt cttttggtta cagcacaaaa tgatggcagt ggtttgaaaa 1741 actaaaacag aaaaaaaaat ggaagccaac acatcactca ttttaggcaa atgtttaaac 1801 atttttatct atcagaatgt ttattgttgc tggttataag cagcaggatt ggccggctag 1861 tgtttcctct catttccctt tgatacagtc aacaagcctg accctgtaaa atggaggtgg 1921 aaagacaagc tcaagtgttc acaacctgga agtgcttcgg aaagaagggg acaatggcag 1981 aacaggtgtt ggtgacaatt gtcaccaatt ggataaagca gctcaggttg tagtgggcca 2041 ttaggaaact gtcggtttgc tttgatttcc ctgggagctg ttctctgtcg tgagtgtctc 2101 ttgtctaaac gtccattaag ctgagagtgc tatgaagaca ggatctagaa taatcttgct 2161 cacagctgtg ctctgagtgc ctagcggagt tccagcaaac aaaatggact caagagagat 2221 ttgattaatg aatcgtaatg aagttggggt ttattgtaca gtttaaaatg ttagatgttt 2281 ttaatttttt aaataaatgg aatacttttt ttttttttta aagaaagcaa ctttactgag 2341 acaatgtaga aagaagtttt gttccgtttc tttaatgtgg ttgaagagca atgtgtggct 2401 gaagactttt gttatgagga gctgcagatt agctagggga cagctggaat tatgctggct 2461 tctgataatt attttaaagg ggtctgaaat ttgtgatgga atcagatttt aacagctctc 2521 ttcaatgaca tagaaagttc atggaactca tgtttttaaa gggctatgta aatatatgaa 2581 cattagaaaa atagcaactt gtgttacaaa aatacaaaca catgttagga aggtactgtc 2641 atgggctagg catggtggct cacacctgta atcccagcat tttgggaagc taagatgggt 2701 ggatcacttg aggtcaggag tttgagacca gcctggccaa catggcgaaa cccctctcta 2761 ctaaaaatac aaaaatttgc caggcgtggt ggcgggtgcc tgtaatccca gctacttggg 2821 aggctgaggc aagagaatcg cttgaaccca ggaggcagag gttgcagtga gccgagatcg 2881 tgccattgca ctccagcctg ggtgacaaag cgagactcca tctcaaaaaa aaaaaaaaaa 2941 aaaaaaggaa agaactgtca tgtaaacata ccgacatgtt taaacctgac aatggtgtta 3001 tttgaaactt tatattgttc ttgtaagctt taactatatc tctctttaaa atgcaaaata 3061 atgtcttaag attcaaagtc tgtattttta aagcatggct ttggctttgc aaaataaaaa 3121 atgtgttttg tacatgaagt aggaatcgta tttcagcttc aaggttcaga ttgaggggcc 3181 cactgtttgg agaggatggt attcaggctt tctcatgtcc ttcaaatctg ttagcgtttg 3241 actctagaaa tcaaagcaaa ggagtggtta cccagacact tcttttggtg tgatcaatgc 3301 gctgatgtga tctatgaaga tgattcatgc ttgaaaacta gcacagaaac atcttgctta 3361 tttgccaaag ctgggagatg agcttctctg cataatttaa atgttcagat aaatgaagct 3421 gacttattta agcaataacc ttttaaacat tttagctaag atgtataaaa atgtttccaa 3481 aatataccac atactttatt tcttcttaaa tgtagtacat taggttacat catttttctt 3541 gctgtcttgg gcatcaaaac aggtgccatg gtaacctgac actctcagga gacattaaga 3601 tagaaggggc tgttcttcag tggttcccat tgattctccc catatctttt tgctctcagg 3661 ctctggccgt ctcttcctga gccttaactg tgt // LOCUS HSU46165 5510 bp DNA PRI 18-OCT-1996 DEFINITION Human Rad GTPase gene, complete cds. ACCESSION U46165 NID g1620562 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Reynet,C. and Kahn,C.R. TITLE Rad: a member of the Ras family overexpressed in muscle of type II diabetic humans JOURNAL Science 262 (5138), 1441-1444 (1993) MEDLINE 94069319 REFERENCE 2 (bases 1 to 5510) AUTHORS Caldwell,J.S., Moyers,J.S., Doria,A., Reynet,C. and Kahn,R.C. TITLE Molecular cloning of the human rad gene: gene structure and complete nucleotide sequence JOURNAL Biochim. Biophys. Acta 1316 (3), 145-148 (1996) MEDLINE 96375161 REFERENCE 3 (bases 1 to 5510) AUTHORS Moyers,J.S. TITLE Direct Submission JOURNAL Submitted (18-JAN-1996) Julie S. Moyers, Research, Joslin Diabetes Center and Harvard Medical School, One Joslin Place, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..5510 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(1720..1827,2037..2421,3320..3393,3519..3723, 4842..5510) /product="Rad GTPase" CDS join(2052..2421,3320..3393,3519..3723,4842..5119) /codon_start=1 /product="Rad GTPase" /db_xref="PID:g1620563" /translation="MTLNGGGSGAGGSRGGGQERERRRGSTPWGPAPPLHRRSMPVDE RDLQAALTPGALTAAAAGTGTQGPRLDWPEDSEDSLSSGGSDSDESVYKVLLLGAPGV GKSALARIFGGVEDGPEAEAAGHTYDRSIVVDGEEASLMVYDIWEQDGGRWLPGHCMA MGDAYVIVYSVTDKGSLEKASELRVQLRRHVQTDDVPIILVGNKSDLVRSREVSVDEG RACAVVFDCKFIETSAALHHNVQALFEGVVRQIRLRRDSKEANARRQAGTRRRESLGK KAKRFLGRIVARNSRKMAFRAKSKSCHDLSVL" BASE COUNT 1134 a 1535 c 1704 g 1116 t 21 others ORIGIN 1 agctcagatg ccagagggaa ccagccctcc ttctttgcat gcccccctgc tgggaaggcc 61 nacagctacc cgccaccttc gccctctgca agacacccag aggaggctct ttccagtggc 121 acagtcatgg ggttggaggg ggcacagnca gcccagccag gattcctcat gggaagtgtg 181 tggtgcctct ttagacttgc ctgctaacgg gagnctctgc ccataacttc ctcttcccac 241 tggagttcgt tcaggaagcc agaaccagca tggggatgct cagccctcca agatccccta 301 ggaccatggg cagcccaccc ctcagcccag cttttgccta ctttgaagac ctcaaggaca 361 agagggccct aatcttaccg agccagnccc tcctcaatcc aaggccttct taggcgttgc 421 caaggaaact gatgcagcag ccaaagagcc tggcccaggt ctccatgaca acctaccaca 481 gcccaccagg gtccatccat ctgtgatgtc acctgagccc caaatggatg ccctcccacc 541 cccaccacac ttgtcacaat cagatgtcac agtctgatag caggtccctg tcacaaaata 601 cctcacagcc tgacctgcaa ccctcagacc cctgagagat gagcagggag ctaagacact 661 cctcatgcct tgaccacacc ctgtgtttat gccccttttg ggtgggggcc tgaacctcca 721 ggccctggat gtgccctggg tggctggctc agcctacagc tagagggaag aagggagacc 781 ttgacgactc gaagtgctgc tcctgctggc attcaacacc cattggcctg gatttcccca 841 actctgggca acttccagtc caggagcaga ataaccagag ccacgnccag ggccagggct 901 ggggcagatt atttattttt agcagagttg ctggcagctg cctggtgggt gctgggaggt 961 gggatgggga ctgagtttaa tcccttccct cttctgggct ggggtatcct ccccagggtt 1021 caagcaggca tctccacctg caactctgtc agagctgccc ccacccccaa caccctctca 1081 gggcctccag gggtcatggg tgacacaacc ctcacactca ccactgaagg ttaggctcat 1141 tagggagaat cttcaactcc aggatgggga gtggaaagga ttttgtgtga atttaggaca 1201 ccccctaaat tctagctcag cctgagccta tggggagtgg gtgtcatggg cagagaaggg 1261 ggatctctgt ccagcctgta actagcttac ttgctaaaag gacgagccgg gacttgggga 1321 gaggaaaggg agaattcagt ctcacggggc cagaccgaat tcttcctcca gagggcagtt 1381 gctgcttttg gctgattggg tttaccccgc tgcctgcctc cccatcacgt actcccaccg 1441 caacctcgct ctctctctcc ttctcacaca caccctcaag ctccggacct cgcctaaccg 1501 gtcagcccct ctattcccag acagctaccg ctactcccct ggcgggtagg gcggggtgtc 1561 ggggatgtaa ggtccgagga agagggtggg ggattccccc tggtgggggt ggacagacac 1621 gtcagccncc ncancagcgt ggaccggggg cgggagcggg gncgggtgga ggcttaaata 1681 aggcgactct ggaccagagc ccatgcggct gagaaggcgg tggctgcagc agcagcggcg 1741 gcggaaaccc taaagtccga gtccggacta cgagtgcgtg gcctcctaat ccggatccta 1801 gtcctgagcg tgtctgtgtg cgagtgggtg agtgtcctgg cggagcgggg tcggggcggc 1861 tgcagatgga tgcgccaggt ccggaggggt cccgggtgag gcgccgcgac ctgggggctg 1921 ggagggctgc ggcgggacct cggccggagc atcccgcctg actccgggtt ggcaggtgcg 1981 ccccgggtcc cgcctcctcg cggcttacac ggccactcgg gttcctgatc ccctagacgg 2041 tcccggacgc gatgaccctg aacggcggcg gcagcggagc gggcgggagc cgcggtgggg 2101 gccaggagcg cgagcgccgt cggggcagca caccctgggg ccccgccccg ccgctgcacc 2161 gccgcagcat gccggtggac gagcgcgacc tgcaggcggc gctgaccccg ggtgccctga 2221 cggcggccgc ggccgggacg gggacccagg gtcccaggct ggactggccc gaggactccg 2281 aggactcgct cagctcaggg ggcagcgact cagacgagag cgtttacaag gtgctgctgc 2341 tgggggcgcc cggcgtgggc aagagcgccc tggcgcgcat cttcggcggt gtggaggacg 2401 ggcctgaagc agaggcagca ggtgagggga ccagccgttc aggggagggg catcccgggg 2461 ctaggactag gggctgagac cggtactcaa agccgcaaat acgttnaatt atattgttgt 2521 tgttgttgtt attattatta ttattattat tttattatta ttattttgag acagagtttc 2581 gctctttttn ccaggctgga gtgcagtggc ctcgatctag gctcactgca acctccgcct 2641 tcaggtttca agggattctc ctgcctcagc ctccagagta gctaggatta caggcgcctg 2701 ctaccacgcc cggctaatat tgtgttttta gtagagttgg ggtttcacca tgttggccag 2761 ntggctggaa tttctgacct cgtgatccgc ccaccacctt ggcttcccaa agtnctggga 2821 ttacagncgt gacagccgca cctggccaat actgtggaat tctattggcn ctttcagaat 2881 cattgcccta cctacccact tcataaatgg ggaaaggaaa ggagaaatcg ggacaccccc 2941 cagncctttg gtaggaagaa atgcccaggt agaaggccag ggcatagata tggctactct 3001 gtgggatggg actggacatc cctccctgcc tatcagggac aggtcagttt ctaacagagt 3061 ttctaaccag agggatttct aagagaggag ccctaggtgg ccaaggtcca ggggggatgt 3121 gggtgaaggt caggcacacc atgccctcca gcccccactt ccctgcctgg cactgctgtt 3181 gccacatacc agcatctgca agtggagtgg agggtaaagg agggctctgg ggaagagttg 3241 gagggtgtgg ccagaacagg aaacctggca gcagcggctt tgacaaccag ctggtaacct 3301 gtctactgcc ttctgctagg gcacacctat gatcgctcca ttgtagtgga cggagaagag 3361 gcatcactca tggtctacga catttgggag caggtagggt gcaagcagag acccagggca 3421 ggggagggag gagtgagttg ggttagaggt ctgggagtgc caggctcgtg tcgggggcat 3481 gaggactgtt gggcagatgt gtgttgtgtg ttctccagga cgggggccgc tggttgcccg 3541 gccactgcat ggccatgggg gatgcctatg tcattgtgta ctcagtgacg gacaagggca 3601 gcttggagaa ggcctcagaa ctgcgggtcc agctgcggcg gcacgtgcaa acagatgatg 3661 tgcccatcat cctcgtgggc aacaagagcg acctggtgcg ctctcgtgag gtctcggtgg 3721 atggtgagta ggcggcaggt tgcaggggga ggagagccca tcggcctctg atccaggcag 3781 tcttggctca gcttgagtcc ctggggatca tcagaggcta aaaatgttca gaaagaaaac 3841 tngggaaatg ggagagagct ggtgcctgct cacagggctc cggaaggccc cacagntggg 3901 gaaagaacca aatctgatag gggagctggg caccgcaggc ctcctcctgc cccctcttgg 3961 tcctggcctg gctggttggg ggggtaaatg ggtggaagag agtgtagggg aacaggccag 4021 gaggtcagag ctgtgtgcta ggtctgggcc attcttagag tagacttggc actggggaag 4081 acccttccca cttggnctca gcatcctttt ctgtggaccc agtactgagn ccctcagagg 4141 aaccagcagt gattggtgac ttgtctccgc aaccccagtc cactggatca ttcattcatc 4201 atgatatata acctcgaatt aaaaaatact aagttgtgcc gggcatgggc cagggggtcg 4261 taacgcctgt aatcccagca ctttgggagg ccgaggcggg tggatcacga ggtcaggaga 4321 tcgagaccat cctggctagc atggtgaaac ccccatctct actaaaaata caaaaattag 4381 ccgggcgtgg tgcgggcgcc tgtagtccca gcgactcggg aggctgaggc aggagaatgg 4441 cgtgaacccg ggaggcggag ctttcagtga gcagagattt ggccactgca ctccagcctg 4501 ggagacagcg agactctatc tcagaaaaat aaaataaaat aaaataaaat actaagttgt 4561 ccaacatagt tctcattttc ctgtgtttac tactgtcgta ctttgtttaa atatcaggct 4621 tcaaattatt tcaataaatg gatctccttt tgagggccac tgttttgggg gctccttaaa 4681 cgcaggactc tgcctcagta caggagctgg ggtctctgct gtttccgggg aggaccttgc 4741 aaattctcga gtggcagctt ccatgggtgg ccactgttgg aggggactcc tacttcaccc 4801 accctttgtt tactggtgcc tgcctttccc nacccacaca gagggccggg cctgcgcggt 4861 ggtctttgac tgcaagttca ttgagacatc agcggcattg caccacaatg tccaggcgct 4921 gtttgaaggt gtcgtgcgcc agatacgcct gcgcagggac agcaaagaag ccaacgcacg 4981 acggcaagca ggcacccgga ggcgagagag ccttggcaaa aaggcgaagc gcttcttggg 5041 ccgcatcgta gctcgtaaca gccgcaagat ggcctttcgc gccaaatcca agtcctgcca 5101 cgacctctcg gttctctagg tcccacccgc tcccactatg gtgggagacg aacggaaggg 5161 ttggtgggct ggcccagcca actgccccgg gtgcctcaga gcaggctcag actctgggtc 5221 cctcggagct gccagccggg cacccccaac ctcatggtca tggacagata gacagtgctg 5281 ccctgcgaag tggctctcag gggccagtga gggctgggcc cacagagatg catgcgcagg 5341 ctcatatgcg tcccaagcag ccgcagcgca gccgccgggc aggcctgcgt gccgggagag 5401 gactctgcct tttttcacag cccgggtgtg cctgccctgg agggaggctc ttcagtgcgg 5461 tagctacttg tttacatgca gatttttgta ataaaggcta tttcctgata // LOCUS HSU46692 2822 bp DNA PRI 08-MAY-1996 DEFINITION Human cystatin B gene, complete cds. ACCESSION U46692 NID g1255783 KEYWORDS progressive myoclonus epilepsy (EPM1) disease gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2822) AUTHORS Pennacchio,L.A., Lehesjoki,A.-E., Stone,N.E., Willour,V.L., Virteneva,K., Miao,J., D'Amato,E., Ramirez,L., Faham,J., Koskiniemi,M., Warringtion,J.A., Norio,R., de la Chapelle,A., Cox,D.R. and Myers,R.M. TITLE Mutations in the gene encoding cystatin B in progressive myoclonus epilepsy (EPM1) JOURNAL Science 271 (5256), 1731-1734 (1996) MEDLINE 96179676 REFERENCE 2 (bases 1 to 2822) AUTHORS Pennacchio,L.A. and Myers,R.M. TITLE Direct Submission JOURNAL Submitted (19-JAN-1996) Len A. Pennacchio, Genetics, Stanford University, Stanford University School of Medicine, Stanford, CA 94305-5120, USA FEATURES Location/Qualifiers source 1..2822 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.3" 5'UTR 321..416 mRNA join(321..482,1926..2027,2355..2733) gene 417..2483 /gene="cystatin B" CDS join(417..482,1926..2027,2355..2483) /gene="cystatin B" /note="EPM1 disease gene; cysteine protease inhibitor" /codon_start=1 /product="cystatin B" /db_xref="PID:g1235678" /translation="MMCGAPSATQPATAETQHIADQVRSQLEEKENKKFPVFKAVSFK SQVVAGTNYFIKVHVGDEDFVHLRVFQSLPHENKPLTLSNYQTNKAKHDELTYF" repeat_region 1479..1741 /rpt_family="Alu" 3'UTR 2484..2733 polyA_signal 2709..2715 polyA_site 2733 BASE COUNT 573 a 830 c 800 g 619 t ORIGIN 1 ctgcaggatt gcccctactc cgactgcccc ttccctatcg tcccaccctg cgcgcccaac 61 ccaccggcga cacccggccg cgcccccgcc ccggtccgtg tgactcggcg cccggaaaga 121 cgataccagc cccgggaggg gggcgctccc tcccgacacc agcgctgggc gcggagaccc 181 agcctgcggc gagtggtggc caggctcccc gccccgcgcc ccgccccgcg ccccgccccg 241 cgcgtccctt cttgcggggc caccgcgacc ccgcagggga ctccgaagcc aaagtgcctc 301 ctccccgccc cttggttccg cccgcgcgtc acgtgacccc agcgcctact tgggctgagg 361 agccgccgcg tcccctcgcc gagtcccctc gccagattcc ctccgtcgcc gccaagatga 421 tgtgcggggc gccctccgcc acgcagccgg ccaccgccga gacccagcac atcgccgacc 481 aggtgggtgg gccgcgggga cggggccggc ccggagtcct gccttagcct cagggcgcgg 541 ccgcggctcc tggagcgaaa gaagccgctt tggccccgct gcgcacccct gggctggccc 601 gggctgtggc cgtgagaggc ctccctccgc tcgggtcgcg cgcgagtgag cagcgggggg 661 ccgcgcctgg ggcgccgctg gggagacatt gggctccgct gaatacagca agggcgagtg 721 ggaattgata gcccggagca gggtgcggtc cctgcatgga cagtctctga gaggaaaccc 781 cagggatgag gcgcttctgg tttcaggcag gcagggtgat cgggcgtcgc cggcgatggc 841 gcaggtgagc agccggctcc gatctccacg gtgatccgat agcaagcggg tgggaagggt 901 ctggctaaac tgacttagcc aggcttcttg ctaaaagtgg attttacaag gaagtgcgca 961 ggtggcctag gcgcttcagg agcccgacta cagtttggcc aagcaagaat ctttgtcaat 1021 atcctcatct agttcgggaa aaaaatcatg agagagagtg caagaggtcc ccagtgataa 1081 ggcacatggg ttaaaaactt aagtgtatct gcataaaagg tccacaggtt tctttacatg 1141 cttccgattc tagcactgtt tcaaactgta agtctaaata aaaagttaaa acacagaaaa 1201 acaagataaa aaccgggctg ggttgcagat ggcaactttc cctgtgtctc ggtttcctcg 1261 tctgtaaaat ggacgtcctg ttgctctgcg cctgccagaa gattctggag gggctgaaat 1321 gagcaggtca tctgtgcaag aagccccctc cggtggagca caggccaggc ccgcctcgct 1381 gtcatggttg gtgaccgacg ggatgcccca agcaagaaca ggtccaggcg atgctgaggc 1441 ctgtgttttt tttttttgtt tttgagactc agtctcaact cttgcccagg gtggagtgca 1501 gtggcacaat ctcggcccac tgcaacctcc gcttcccagg ttcaagggat tctcctgcct 1561 tagcctcccg agtagctggg attgcaggtg ctcgccacca cgcccagcta atttttgtat 1621 ttttagtaga aacggggttt tgccatttgg ctaggctggt ctcaaactcc tgacctcaag 1681 tgatccgccc acctcagcct cccaaagttc tgggattaca tccttgagcc accgtaccca 1741 gctggaactg tttttttcta ctttattatt aggctgacag tttaaatgtc ccttcagttg 1801 taagagacaa ttgtgtgaag agccagtgtc agaatcgtgt gtgtgctcac atgcgtgcaa 1861 gttactctag caggagggaa tccaagaagc cactgagaca tcctcattct gtcccttctg 1921 tctaggtgag gtcccagctt gaagagaaag aaaacaagaa gttccctgtg tttaaggccg 1981 tgtcattcaa gagccaggtg gtcgcgggga caaactactt catcaaggta gagtgtgggc 2041 ctcaggaggg cctgccccga acgggtgctg gtaggaaacc gcctgtgcag gcccgggctg 2101 tgtggtctta ggtgctgggg cgccctgtgg ctgccccctg agataagcat cctactgtgt 2161 gtgtccatcg gcctttcagg aggactaggg cttctgggga gctaagaacc ccaaggaaac 2221 aagtgtggga tgtgaggcat cccctgcaca tgcaggagaa gacaagattg tcttcagctg 2281 gctgctaatg acctggaggg gcgcagcaag gtgacttggg atcagaggct tcgctcactc 2341 cgctctcttc ccaggtgcac gtcggcgacg aggacttcgt acacctgcga gtgttccaat 2401 ctctccctca tgaaaacaag cccttgacct tatctaacta ccagaccaac aaagccaagc 2461 atgatgagct gacctatttc tgatcctgac tttggacaag gcccttcagc cagaagactg 2521 acaaagtcat cctccgtcta ccagagcgtg cacttgtgat cctaaaataa gcttcatctc 2581 cgctgtgccc ttggggtgga aggggcagga ttctgcagct gcttttgcat ttctcttcct 2641 aaatttcatt gtgttgattt ctttccttcc caataggtga tcttaattac tttcagaata 2701 ttttcaaaat agatatattt ttaaaatcct tacagattgc ctcctttgct tttagacttt 2761 tttcttgctg ctaaccaccc cgggcaggtc cttcccctcc aggcaggagg gcggagagag 2821 tc // LOCUS HSU46920 5935 bp DNA PRI 23-MAY-1996 DEFINITION Human metaxin (MTX) gene, complete cds. ACCESSION U46920 NID g1326107 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5935) AUTHORS Long,G.L., Winfield,S., Adolph,K.W., Ginns,E.I. and Bornstein,P. TITLE Structure and organization of the human metaxin gene (MTX) and pseudogene JOURNAL Genomics 33 (2), 177-184 (1996) MEDLINE 96301393 REFERENCE 2 (bases 1 to 5935) AUTHORS Ginns,E.I. TITLE Direct Submission JOURNAL Submitted (23-JAN-1996) Edward I. Ginns, Clinical Neuroscience Branch, IRP/NIMH/NIH, B1EE16 Bldg.49, Bethesda, MD 20892-4405, USA FEATURES Location/Qualifiers source 1..5935 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21, between psGBA and THBS3" mRNA join(<1379..1459,2473..2542,2675..2754,4254..4346, 4512..4694,4821..4897,5363..5517,5610..5935) /gene="MTX" /product="metaxin" gene 1379..5935 /gene="MTX" exon <1379..1459 /gene="MTX" /number=1 CDS join(1379..1459,2473..2542,2675..2754,4254..4346, 4512..4694,4821..4897,5363..5517,5610..5824) /gene="MTX" /codon_start=1 /product="metaxin" /db_xref="PID:g1326108" /translation="MAAPMELFCWSGGWGLPSVDLDSLAVLTYARFTGAPLKVHKISN PWQSPSGTLPALRTSHGEVISVPHKIITHLRKEKYNADYDLSARQGADTLAFMSLLEE KLLPVLVHTFWIDTKNYVEVTRKWYAEAMPFPLNFFLPGRMQRQYMERLQLLTGEHRP EDEEELEKELYREARECLTLLSQRLGSQKFFFGDAPASLDAFVFSYLALLLQAKLPSG KLQVHLRGLHNLCAYCTHILSLYFPWDGAEVPPQRQTPAGPETEEEPYRRRNQILSVL AGLAAMVGYALLSGIVSIQRATPARAPGTRTLGMAEEDEEE" exon 2473..2542 /gene="MTX" /number=2 exon 2675..2754 /gene="MTX" /number=3 exon 4254..4346 /gene="MTX" /number=4 exon 4512..4694 /gene="MTX" /number=5 exon 4821..4897 /gene="MTX" /number=6 exon 5363..5517 /gene="MTX" /number=7 exon 5610..5935 /gene="MTX" /number=8 BASE COUNT 1324 a 1553 c 1791 g 1267 t ORIGIN 1 catgcctctc agccggctca ctacccctgg caggcaggca gctgggaggg gaaaaggcta 61 ggcggagaga gcaggagccg gggggcggag ggacagaatt ggaggggggg ccagcaggcg 121 ggtgaggggg ggctgaaaca aagagctgcc aatcaccctg gaggcatcat cagctctggg 181 aaccaggggc actgggaaag aatcctggag gcctccctgc tttctacctc tccctaatga 241 tcctcccccc aggtgcctcc ctctccgcag tcatcattgt tgggggtctc ggcacacagc 301 gctaccgaat gccactgttt tctggaagcc acgattgtgg aaggcaagaa cccctggcag 361 gggcagtgct agacagtggg cattattcca cagctggcag gaaaggatgc caagggagtg 421 gccgcaacta gcttgctata gtgcccaggg atgcccacat ggcatgaatt ttgctgttct 481 ggcggagcaa gtgctcttta ttagggtggg ggtcccctga caagtgcctg tttctccatt 541 ttccctcatt cgtaggaaag cttgggagtc tggaagcgtc tgacgggaac tgaagggaga 601 atctaaagca gtggtcagtg aggtcagagt gaagaaacca aaaaagagga aagggacacc 661 agggacaggc gcagaagacc aggggtacag gcgaggagcc aaggaccaga ctgtggaaca 721 ctgtgggtac tagcggacgg gccgtgaccc ggatgaagca gaggggcgcc tgagttcccg 781 tcacctaagc gacagcctgc gagcctctct tcccctcccc cacccaagcc ccagcccggc 841 ctccgctccg gccgccgcca ccgcccctgt tttgtttcca tggcgacagg cggcgcaggg 901 cccgctccaa acataacgcg ctgtggaaaa catgctgctc gggggacccc cccgcagtcc 961 ccgctcgggg acgagcccca aggggccctg gagcagtaca ggccacgtgc agtttggcaa 1021 gagcccccag acctggccca ggcgcacaag accccgctct ccagagcctg ccgcgccttc 1081 aggggttcgg ggctccactt ggacgaggcg ccgtgacact ccgaggcgcg ccgggccgac 1141 agcgctgtcc cgctacgtgg gccacctctg gatgggccgg cggccgccct cccccgaggc 1201 ccgcggccca gtcccccgca gttcagctgc cagtcgggcc agaagaagcc tcgcctcccc 1261 ggggatctcc ccaggccccc tgaccgcaac gatcggaggg gcggtggcgg ggggcgggcc 1321 caggcagggg agggcagaag cacacaagga agtgtttccg ggacagaggg tgggcaagat 1381 ggcggcgccc atggagctgt tctgctggtc agggggctgg gggctgccgt cagtggacct 1441 ggacagcctg gccgtgctgg tgaggggtgg cgccggcgcc ctctgctgtg ccctgatgac 1501 tggggagggg gccgaactag ccaagcatcc aaaccttgcc agggctactc agcacgggcg 1561 cagtggggga aactgaggca gtcaaagagc gtacgtggcc gatatggcct cagtgcccta 1621 tgctgtagtt ttctaggctt aatgcagcaa tgaaaagacg cctggttggg agttcaggtg 1681 atgggttcta gtccttgctc atccattcac tggctgtgtg accttgcgga gatcctttct 1741 ctttgggcat tttttttctc cattcggagg tttactaggc tggactgtct cagggaccta 1801 ctctagcttg gaattcctgc cttctcctca gcccccttct cctcttcctc ctcacttaac 1861 agcatttact gaggctattg tttgttagac caaacgactc acagccaggc tttgtataaa 1921 gagcttagca atcagacttc ctttgcttca tttattcatt gactccttgc tttaaagagg 1981 atctattaga tttcttccaa atattaaaca ttgccaggca aagatgaata aaatttggtt 2041 cctcccttcg agtttacaga aaggcagata ggtaagcaaa taaatataat aaaatattca 2101 catataagtt aaagtaggaa cagaccacag atactgccaa agagaagggc attatcagtg 2161 cttccaagaa gagggatact ggttaggaaa agcttctgag gatagatgtt caccaggtag 2221 atggcaagag aggagataga ggaaaggatt ttcctggctg tcctgtaatc acagacatgt 2281 ccggaatctc tctgagccct gtgttgtcca tttgtaaaat agaaatggta gtgactcccc 2341 cttttttttc gagactaaga tactaagtat agggaatggc acataaggta ttcaataaat 2401 gttatttctc ttcttgatac cactgtgggg gacatggctc tgcctctctg acttctgctt 2461 ccttccctgc agacctatgc cagatttact ggtgctccac tgaaggtaca caagatcagc 2521 aacccctggc agagcccttc aggtacccag tatcccttgg gggtaaggaa gggtgtgtac 2581 aaggggagaa gcaagaaata ggcaggaatg tgttgcaact atgtatgact aggcagctaa 2641 gggtctggtt ggtatacttc cctctcaata tcaggaactc tgcctgccct tcggaccagt 2701 catggagagg tcatctcagt tccacacaag atcatcaccc accttcgaaa agaggtaggt 2761 gacttggata gagggggctg ccagtgagag aagttcagtc aattctatac aatacacatt 2821 tattgagcac cagatatatg ccatgctaga tgcaggtgac ccagagcatc aaggagcaac 2881 agtcttgtgg cagagacaca cacaatgtca ctgtgatgta ttaaagcagt cggcaagaga 2941 tgaaacttag ggcactgtga ggcagggagg gagaggagca caggctgaag gagagtggaa 3001 gacagcagtt ggcctctgat ggtgggactg gagagagatt tctaagggcc acttcctgtt 3061 ttcagggact aggttgggct agatatgggg ctcaggatgg acaaggctta gagccaggtt 3121 ggagaagatg aaagagcatt actagaggag tggggaggcc taggctatgc tctttactct 3181 gccattgact atgtgatctc gggcaggcca tgtaacctct cagggctgtg cactccctta 3241 tttgtaaaac tagagggctg ggccagcatg ttttccaagg gttcttctag cattgacggt 3301 caggttccaa gagggaacag tgtcagaagt aggacactct tcctcatttc taacttccca 3361 aaggttgagg acctggggaa ttaattgaga ggcctggaaa gagaggatcc agataaaagc 3421 caaagttcct ggcagagccg aggccactgc cctgattgct taggtgtgaa cagagctttt 3481 ccagtttcca aagtattcta aattatttcc taacacaggg gcattcagag aaagagattc 3541 ttacccccat tccacagggg agaaactgag gcttagggag atttaaggat catgctgaag 3601 aggttatctg gggacaaagc ttactcagga tgtggagggg gatgctgagg aacagggact 3661 ggagcctggg aaggtagggc aggctgagac ctgggggtat gggtggaatg tgtatgtggt 3721 aatggtgtct gagcgatgga atgaagtgaa agaaatagcc aagagctggg cactgaggga 3781 acaagctggg gggccctggg cgtgggttcc ctggatccag ggagtgtgag gatggctttc 3841 tctggttcct gaggcatgga ccgagtccta gcctgcatct gagctccgtt gtgctagctt 3901 tgtggtgtct gggtcgggga agttactgtt agaaagttgc cgtcagcggg cgttgttcca 3961 gctttccatg gaaattctgg gagctgctcc tagtttgcgg cggagccttc cttccttccc 4021 tgcattgggc gctggacctt cacggccagg aggctccaag gcccaggctt ttcgccaaca 4081 gcaacaggct actggctggg cccaggcaag ggggccttgg caggaaaagt tccttgctgt 4141 acctccactg cactcagagg ccagtgaggg gggtaccaga caggactcct tcctccctgg 4201 tgaagtgccc ttgcagctcc ctagtgtcca cgccatggat atttcctcca cagaagtaca 4261 atgctgatta tgatctgtca gctcggcaag gggcagacac cctggccttc atgtctctcc 4321 tggaggagaa gttgctcccg gtgctggtga gtgtgcccag acctcccagc atccatggcc 4381 agccggggag ggttcgggaa cacacagacc cacacacagg ctcaggaaag catgggggtc 4441 agaagcccac cttgaatcag acaggtgcac tggctcagac ctacctgttt cttcctgccc 4501 acccaatcca ggtacatact ttttggatag acaccaagaa ctacgtggaa gtgacccgga 4561 agtggtatgc agaggctatg ccctttcccc tcaacttctt cctgcctggc cgcatgcagc 4621 ggcagtacat ggaacggcta cagctgctga ctggggagca caggcctgag gacgaggaag 4681 agctggagaa ggaggtagct ctgagaccgg gggctattgt atgagatgag ccccaaggat 4741 gctggccagg aatgggagtg cttaggtggg gaggtggcac tgttcccaca gctgcaagcc 4801 tacctgtgtc gcccctacag ctgtaccgag aggctcggga gtgtctgacc ctgctctctc 4861 agcgcctggg ctctcaaaag ttcttctttg gagatgcgtg agtctgactc caagagggta 4921 atgggtggct tggaagaaga tacaggttca gatggagcag ctggagctgg ggctggggct 4981 ggggctggct caggctctgg ataggaggtc cctgggatag aaactggccc tagtgacagt 5041 gtgactgtgt gggggccaga gccttctcag aggtacaaaa gggtagggtg ggagggcagc 5101 caggcacagg aggggcctga agagctgtgg ggcactgaat gtgcccttta tgcagccctg 5161 ggatagagcc ctattcaggg ccagctggcg ccacctgggg atctctcccc ataccaggtc 5221 tagaactgtg tgtcctgtcc ttccctggtg gccgcctgct gcccagagcc cacctcccaa 5281 ggctgactct tcctccagct ccatctttac cccttctacc ccagtggttc tcctccatcc 5341 cacccttctc tctctgctcc agccctgcct ccttggacgc cttcgtcttc agctacttgg 5401 ccctgctgct gcaggcaaag ctgcccagtg ggaagctgca ggtccacctg cgtgggctgc 5461 acaacctctg tgcctattgt acccacattc tcagtctcta cttcccctgg gatggaggta 5521 aggggcagat gggaggggca gccctgggga gagtgggcag ggatccaaga actagttctc 5581 ctaacacacc ttccttcctt gaccctcagc tgaggtacca ccgcaacgcc agacaccagc 5641 aggcccagag actgaggagg agccataccg gcgccggaac cagatcctat ctgtgctggc 5701 aggactggca gccatggtgg gctacgcctt gctcagcggc attgtctcca tccagcgggc 5761 aacgcctgct cgggccccag gcacccggac cctgggcatg gctgaggagg atgaagagga 5821 atgatttgtc ctcacgctcc caagactggt ttttctactc tcatgcattc cagaggcccc 5881 cgtgcctcct cgttgttggt acagccggac acggggtgct gccacccaga ataaa // LOCUS HSU48795 2879 bp DNA PRI 16-JAN-1998 DEFINITION Homo sapiens antimicrobial protein CAP18 precursor, gene, complete cds. ACCESSION U48795 NID g1322243 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2879) AUTHORS Larrick,J.W., Hirata,M., Balint,R.F., Lee,J., Zhong,J. and Wright,S.C. TITLE Human CAP18: a novel antimicrobial lipopolysaccharide-binding protein JOURNAL Infect. Immun. 63 (4), 1291-1297 (1995) MEDLINE 95197251 REFERENCE 2 (bases 1 to 2879) AUTHORS Balint,R.F., Lee,J.-H. and Larrick,J.W. TITLE Direct Submission JOURNAL Submitted (07-FEB-1996) Robert F. Balint, Molecular Biology, Palo Alto Institute of Molecular Medicine, 2462 Wyandotte Street, Mountain View, CA 94043, USA FEATURES Location/Qualifiers source 1..2879 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.3" exon 699..915 /number=1 mRNA join(699..915,1554..1661,1803..1874,2492..2684) /product="antimicrobial protein CAP18 precursor" CDS join(715..915,1554..1661,1803..1874,2492..2623) /codon_start=1 /product="antimicrobial protein CAP18 precursor" /db_xref="PID:g1322244" /translation="MKTQRDGHSLGRWSLVLLLLGLVMPLAIIAQVLSYKEAVLRAID GINQRSSDANLYRLLDLDPRPTMDGDPDTPKPVSFTVKETVCPRTTQQSPEDCDFKKD GLVKRCMGTVTLNQARGSFDISCDKDNKRFALLGDFFRKSKEKIGKEFKRIVQRIKDF LRNLVPRTES" sig_peptide 715..804 /product="antimicrobial protein CAP18" mat_peptide join(805..915,1554..1661,1803..1874,2492..2620) /product="antimicrobial protein CAP18" intron 916..1553 /number=1 exon 1554..1661 /number=2 intron 1662..1802 /number=2 exon 1803..1874 /number=3 intron 1875..2491 /number=3 exon 2492..2684 /number=4 misc_feature 2510..2620 /note="encodes antimicrobial domain" BASE COUNT 609 a 775 c 796 g 699 t ORIGIN 1 ttttttcata ctgagtctca ctctgttacc caggctggag tgcagtggca tgatctcagc 61 taactgcaac ttctgcttcc cgggttcaat gggttcaagt gattctcatg cctcagcttg 121 tagctgggaa ctactggtgt gagccatcat gcgtggctaa ttttcatatt tttagtagag 181 atggggtttc accatgttgg ccaagcttgt ctcgaactcc ttatctcagg tgatccgccc 241 accttggcct cccaaagtgc tgggattata ggcgtgcaga ccgtgccctg cctcattcat 301 caattcttaa tcgatgccta cagggtgcca ggcaatgcta gagctggaga tttagcagtc 361 catcatactg actcctgagg agtagaagga tgtagaatag gcacctggct ctcttcctct 421 ctggagggat ttaacgctct tgagcacccc tggctatgac aatctccggt caggtctggg 481 aggttgtcag agatgaagaa accacttcct catcttgcac acaaggaagg cctcactcac 541 tgcccagcaa gtcctgtgaa gcaatagcca gggctaaagc aaaccccagc ccacaccctg 601 gcaggcagcc agggatgggt ggatcaggaa ggctcctggt tgggcttttg catcaggctc 661 aggctgggca taaaggaggc tcctgtgggc tagagggagg cagacatggg gaccatgaag 721 acccaaaggg atggccactc cctggggcgg tggtcactgg tgctcctgct gctgggcctg 781 gtgatgcctc tggccatcat tgcccaggtc ctcagctaca aggaagctgt gcttcgtgct 841 atagatggca tcaaccagcg gtcctcggat gctaacctct accgcctcct ggacctggac 901 cccaggccca cgatggtgag ctttggggga cattctgctc tgctctggct gggcttggca 961 cgtgttgttc cttctgctcc tgctgcactg cctgccagga gggcatctcc ccctttaaat 1021 gtggtcccgt gttttccagg gaaccttcta gagctcgtgt ctcctcccag ctcgagagct 1081 tctgccttat aattcctgct gtggcagaga tccctcaccc cgaccccacg caggttttgg 1141 gacttctgcg agctccaggc actagaatgg ggtcattggc tctgggcagt gacctcctct 1201 gctttaagtc tcttctgtac cacgttaccc cacataggga agaactctat ccagacttta 1261 ggttccagtg ggcatgtctt gtcccccagg aagccccctg acttcccttg cccccacccc 1321 agagtgggag gggctccttg ttagagctca tctgaggtct gctcctactc actgttcacc 1381 taggagggta ggaatggctc agtcctcctc ccctcaatgc cccagtgcca agcccagcac 1441 ccagtgcccg tcgcacatca ggtactgtgg aaagcctgcc ctcttggtgg ggaggtcatg 1501 gacacaaatc agaaaataca agaatgggcc tccccatttc ctcctctgac taggatgggg 1561 acccagacac gccaaagcct gtgagcttca cagtgaagga gacagtgtgc cccaggacga 1621 cacagcagtc accagaggat tgtgacttca agaaggacgg ggtgaggctg ggggctgggg 1681 gtgttggtgg gtgcctccca aggagctgaa cagggggcac ctggggaata tttcccactg 1741 ggatgtggct gggaggtcat ggcaaatggt ttcaagtttg accttgagct tctcctttcc 1801 agctggtgaa gcggtgtatg gggacagtga ccctcaacca ggccaggggc tcctttgaca 1861 tcagttgtga taaggtgagt gggctgttct gggatgcagg ggctgatggg ggcatagagt 1921 gtggaccatc caatgggtca attaactact cccccaaccc aggacagaga aagcccctcc 1981 tacccagggc tcttccccaa acctgagttc catctccagg gccggctctg gaatccctta 2041 gagcggtaga tctccaagtg tagcccttcc tggggactcg ttagatatgc aaattctcag 2101 gccctactca gacctactca gacagactct gggtagggcc cagaaattcg tattttgata 2161 agctttccag gagattccgg cttctgtaaa gtttgagagc cactgtctaa gagtactcag 2221 ctctcagccc tgtgttccca tctcagtgtt gctgggctgg gctgtgtgac cctgcagagc 2281 ccctcactat ctccgggact ctgttttctc atctttttat tgggtgtagg gattcaatca 2341 catgcttcaa aggtcacagc cagaggttga actggggccc caaagctctg cgggggccca 2401 cgaagagggg cgtctaggtg gggaggggtc ttggattgac cctgggtaca tccccgacaa 2461 ggaacctgtt tcttcctgta cacaacccca ggataacaag agatttgccc tgctgggtga 2521 tttcttccgg aaatctaaag agaagattgg caaagagttt aaaagaattg tccagagaat 2581 caaggatttt ttgcggaatc ttgtacccag gacagagtcc tagtgtgtgc cctaccctgg 2641 ctcaggcttc tgggctctga gaaataaact atgagagcaa tttcctcagg cttcagtctc 2701 acttgttttg cctcctctct ctcaccacaa ctgagccctt agctcaggga gtccacgtgt 2761 gagtgtgagt gtgtgtgagt gtgacacaga ggtggcgagg gcagtgttcc atccaggagg 2821 acacagggta aggcagtagg gccaagagat ccaagatggc attcccattc tcagtggaa // LOCUS HSU48865 3706 bp DNA PRI 26-SEP-1996 DEFINITION Human C/EBP epsilon (CEBPE) gene, complete cds. ACCESSION U48865 NID g1399174 KEYWORDS transcription factor; DNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3706) AUTHORS Antonson,P., Stellan,B., Yamanaka,R. and Xanthopoulos,K.G. TITLE A novel human CCAAT/enhancer binding protein gene, C/EBPepsilon, is expressed in cells of lymphoid and myeloid lineages and is localized on chromosome 14q11.2 close to the T-cell receptor alpha/delta locus JOURNAL Genomics 35 (1), 30-38 (1996) MEDLINE 96299737 REFERENCE 2 (bases 1 to 3706) AUTHORS Antonson,P. TITLE Direct Submission JOURNAL Submitted (09-FEB-1996) Per Antonson, Department of Biosciences at NOVUM, Karolinska Institute, Huddinge, S-141 57, Sweden FEATURES Location/Qualifiers source 1..3706 /organism="Homo sapiens" /note="GDB DSEG number: D14S990" /db_xref="taxon:9606" /chromosome="14" /map="14q11.2" gene 1815..3668 /gene="CEBPE" exon <1815..2324 /gene="CEBPE" /number=1 CDS join(1815..2324,3083..3418) /gene="CEBPE" /note="bZIP transcription factor; CCAAT/enhancer binding protein epsilon" /codon_start=1 /product="C/EBP epsilon" /db_xref="PID:g1399175" /translation="MSHGTYYECEPRGGQQPLEFSGGRAGPGELGDMCEHEASIDLSA YIESGEEQLLSDLFAVKPAPEARRLKGPGTPAFPHYLPPDPRPFAYPPHTFGPDRKAL GPGIYSSPGSYDPRAVAVKEEPRGPEGSRAASRGSYNPLQYQVAHCGQTAMHLPPTLA APGQPLRVLKAPLATAAPPCSPLLKAPSPAGPLHKGKKAVNKDSLEYRLRRERNNIAV RKSRDKAKRRILETQQKVLEYMAENERLRSRVEQLTQELDTLRNLFRQIPEAANLIKG VGGCS" intron 2325..3082 /gene="CEBPE" repeat_region 2712..2844 /rpt_family="Alu" exon 3083..3582 /gene="CEBPE" /number=2 polyA_signal 3577..3582 /gene="CEBPE" repeat_region 3616..3668 /note="dinucleotide repeat" BASE COUNT 868 a 1027 c 1113 g 698 t ORIGIN 1 gagctcagct tgaaatggag aaatctccct aattcctgac atctgtctct gggttacgcg 61 cagatctccc cacaccaggt cactgtgctt tcagataaag catatctggc catgtctcaa 121 gtcatgcaca gagagcatct ggggtaggtg ttggggtagg gaggatgtaa ggaatatgct 181 aaccggaata tgctaatcag aatatgcatt tttaaaattt taatgttcaa tgcgacacac 241 atcctctctg cacttgtcct actgtggccc actcacagtg aaccacccct tgtttggccc 301 aacttacaaa aacttctggt ccctctgggg cacccatagc acttgctggt cataaaagag 361 tcagtaacat ttctatgtgt ccttcccccc ccccccccca actggaccta caatcccttt 421 tggagatgaa attcaacagg taaaagcagt accagagcag gtgtctctga aaggaggtag 481 tggggagttc caaaggaaag agaaagggaa gaggagagcc agcacagaac aaagaattgc 541 agagagggag ccatgggctt ctcctggaga ccctaaaaca gagccccagg aagaggtgat 601 gacataagag gcttctcaaa cctcacaaga ggaagtagac ccaagagaca cgcattggag 661 atcagagagc ggccatgcaa aaggaaagac agcaaacgag ctgcagcaag ctcctaagcc 721 ctgcagccca ggcacaggaa accagaggca gagcctcagg cccagggcga ggggcgaagc 781 accagcctga gcctcttgca gcacagagca gctggaagag agaagtgctg gagccaccgg 841 aggattatgg ctgtctactc cgagggcagc aacctgccta ggcctgcggt gcaaggccca 901 agtcctgacc aggtgctttg gccaagccca gggagttagg aagtgtacca ggctgctaga 961 ggtagctagg gctggaaacg cactaatatt tgggctattg cacagctcct gtgcaaaaac 1021 ctagcaaatg agcatgctct ggagcaccac gcaggcttgt gtagagcttg ttcctaggac 1081 ctatcactac tccctagacc cctcaaaaaa caaaccctgg cctacactca gagaatgctc 1141 tgggaggtgg ggctacaaaa gaaactttct cttgggggta ggtggagccc agcagcaggg 1201 gagagccaga ggaaggcgtg tcaagggagg aggtgagacc caactgaggg ttggcactgg 1261 aagagggctt tctctcttca ctctaacttg gaacctcatt ctgcatagaa ctttttcaaa 1321 gtcggaggga ggaggttgct cagagtgggg tgttgccctc tgcgaggatt tggagtcccc 1381 tgggctccca gcagggagtg gagaggcatt cccagaagca tcagcttcgg gcatctccag 1441 aaccctggct tggtccctga ccaaggagtg tctccaactg ctgaataggc caggagtcgc 1501 ctttctctaa ggcttacatc tctccctctg gggtgtgtcc tgcccctccc tgagtcaccc 1561 caaggggaga gaggggaaaa aagggaagag aagagaggca ttgactacag aggaaggaaa 1621 aggaagcaga gcagaggggg gagagaggcc acacaggagt gggtgacaga ggagactgca 1681 gagggcaggt ctagggcaga agatcgagag agggcaggcc caggtcagga ggaggtagag 1741 agagggcagc cggagcaccc caaggggtgc ctcaagagca ggtgggggcg gggagccgag 1801 ggggcgggcc ggccatgtcc cacgggacct actacgagtg tgagccccgg ggtggccagc 1861 agccactcga gttctcaggg ggccgagctg ggcccgggga gctaggggac atgtgtgagc 1921 atgaggcctc cattgacctc tccgcctaca tcgagtctgg ggaagagcag cttctctccg 1981 atctctttgc cgtgaagcca gcgcctgagg ccagaaggct caagggcccc ggaacccctg 2041 ccttccccca ctacttgccg cctgaccctc ggccctttgc ctaccctcca cataccttcg 2101 gcccagacag gaaggcgctg gggcctggca tctacagcag cccagggagc tacgacccca 2161 gggctgtggc ggtgaaggag gagccccggg ggccagaggg cagccgagct gccagccgag 2221 gcagctacaa tcccctgcag taccaagtgg cacactgtgg gcagacagcc atgcacctgc 2281 ccccaactct ggcagcaccc ggccagcctc tgcgcgttct caaggtaaga gcaggccagc 2341 cctcctccct cctgcccggg gctggagagg ctggtgtgga caggaccctg tgccccggct 2401 catgctgctg agcctgtgct tggtgtcgct ctgtgaaatg gacccaaccc acagtcctct 2461 gtgttcacac cccaccatga cctccctttc cttctctgtg aattcattcc caccctgaat 2521 ccccacccaa agcccctgtg gcaagctgac ctacttgatc tcaggccccc accgccagcc 2581 tctctgctgc tcaccatgcc ccctcctctt gtttctctgc cgcctgcttc tgtgagaaat 2641 gtctactggt catagaacgg gagactcaag gcagattgag ggaaagtgta ctcagaaagt 2701 gtgagagcat tggctgggtg tcgtggctca tgcctgtaat cccagcgctt tgggaggcca 2761 aagcaggagg ttcgcttaag gccaggggtt ggagaccagc ctggacaaca tagcaaggct 2821 ccatctttac aaaaaaagaa agaaagaaag tgtgagaaca tgaaagggag ttttggtctc 2881 aagcagagga ccctgtgcat ctgggtcacc atctcccaga agcagagcga tgatagtttt 2941 gacctgagag acaggagctg gctctgagtc ctgtattcct ttgacgcatc aagtgtgccc 3001 tgggtgccga gattcccgcc cgcctccgct ctgcctcctg gccagccctc acctctccgt 3061 ctctccgtgc cttcccctcc aggccccttt ggccactgcc gcacccccct gcagtcccct 3121 cctgaaggcg ccctccccgg ctggcccctt acacaagggc aagaaggcag tgaacaaaga 3181 tagccttgag taccggctga ggcgggagcg caacaacatc gccgtgcgca agagccgaga 3241 caaggccaag aggcgcattc tggagacgca gcagaaggtg ctggagtaca tggcagagaa 3301 cgagcgcctc cgcagccgcg tggagcagct cacccaggag ctagacaccc tccgcaacct 3361 cttccgccag attcctgagg cggccaacct catcaagggc gtggggggtt gcagctgagg 3421 ctggctggtg gattgtgggc accaggctcc ctggcacggt ctaactctgc ggacccccat 3481 cctgctgggg gcctagaacc ctgagacata gaccatggat aaatggcaac cggggtggca 3541 aagagggcag gacccagcat aatgattata tggctgaata aagttgcact gtgactgggt 3601 gttgggactg ttggctgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 3661 gtgtgtgtca tggaggaccc tggggacaga ggatgcatgt ggctag // LOCUS HSU48869 2211 bp DNA PRI 12-AUG-1996 DEFINITION Human cdk-inhibitor p57/KIP2 (CDKN1C) gene, complete cds. ACCESSION U48869 NID g1213447 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2211) AUTHORS Reid,L.H., Crider-Miller,S.J., West,A., Lee,M.H., Massague,J. and Weissman,B.E. TITLE Genomic organization of the human p57KIP2 gene and its analysis in the G401 Wilms' tumor assay JOURNAL Cancer Res. 56 (6), 1214-1218 (1996) MEDLINE 96189289 REFERENCE 2 (bases 1 to 2211) AUTHORS Reid,L.H. and Weissman,B.E. TITLE Direct Submission JOURNAL Submitted (09-FEB-1996) Laura H. Reid, Pathology, Univ. North Carolina, 345 Lineberger Cancer Center, CB#7295, Chapel Hill, NC 27599, USA FEATURES Location/Qualifiers source 1..2211 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" mRNA join(170..1230,1766..1901,1985..2211) /gene="CDKN1C" gene 170..2211 /gene="CDKN1C" CDS join(411..1230,1766..1896) /gene="CDKN1C" /codon_start=1 /product="cdk-inhibitor p57/KIP2" /db_xref="PID:g1213448" /translation="MSDASLRSTSTMERLVARGTFPVLVRTSACRSLFGPVDHEELSR ELQARLAELNAEDQNRWDYDFQQDMPLRGPGRLQWTEVDSDSVPAFYRETVQVGRCRL LLAPRPVAVAVAVSPPLEPAAESLDGLEEAPEQLPSVPVPAPASTPPPVPVLAPAPAP APAPVAAPVAAPVAVAVLAPAPAPAPAPAPAPAPVAAPAPAPAPAPAPAPAPAPAPDA APQESAEQGANQGQRGQEPLADQLHSGISGRPAAGTAAASANGAAIKKLSGPLISDFF AKRKRSAPEKSSGDVPAPCPSPSAAPGVGSVEQTPRKRLR" BASE COUNT 279 a 858 c 761 g 313 t ORIGIN 1 gcggccgcca atcgccgtgg tgttgttgaa actgaaaata ctacattatg ctaatcgcgg 61 ccgggcccgc gcgcacgggg gtggggcccg cgcgtataaa gggggcgcag gcgggctggg 121 cgttccacag gccaagtgcg ctgtgctcga ggggtgccgg ccaggcctga gcgagcgagc 181 tagccagcag gcatcgaggg ggcgcggctg ccgtccggac gagacaggcg aacccgacgc 241 agaagagtcc accaccggac agccaggtag ccgccgcgtc cctcgcacac gcagagtcgg 301 gcggcgcggg gtctcccttg cgcccggcct ccgccctctc ctcctctcct ttccccttct 361 tctcgctgtc ctctcctctc tcgctgcccg cgtttgcgca gccccgggcc atgtccgacg 421 cgtccctccg cagcacatcc acgatggagc gtcttgtcgc ccgtgggacc ttcccagtac 481 tagtgcgcac cagcgcctgc cgcagcctct tcgggccggt ggaccacgag gagctgagcc 541 gcgagctgca ggcccgcctg gccgagctga acgccgagga ccagaaccgc tgggattacg 601 acttccagca ggacatgccg ctgcggggcc ctggacgcct gcagtggacc gaagtggaca 661 gcgactcggt gcccgcgttc taccgcgaga cggtgcaggt ggggcgctgc cgcctgctgc 721 tggcgccgcg gcccgtcgcg gtcgcggtgg ctgtcagccc gcccctcgag ccggccgctg 781 agtccctcga cggcctcgag gaggcgccgg agcagctgcc tagtgtcccg gtcccggccc 841 cggcgtccac cccgccccca gtcccggtcc tggctccagc cccggccccg gctccggctc 901 cggtcgcggc tccggtcgcg gctccggtcg cggtcgcggt cctggccccg gccccggccc 961 cggccccggc tccggctccg gccccggctc cagtcgcggc cccggcccca gccccggccc 1021 cggccccggc cccggccccc gccccggccc cggccccgga cgcggcgcct caagagagcg 1081 ccgagcaggg cgcgaaccag gggcagcgcg gccaggagcc tctcgctgac cagctgcact 1141 cggggatttc gggacgtccc gcggccggca ccgcggccgc cagcgccaac ggcgcggcga 1201 tcaagaagct gtccgggcct ctgatctccg gtgagccccg cacggccccg ccccggcccg 1261 gcccggcccc gctttgtccg gccggccggt ccccccagcc ctcgggggcc tcgctgggtt 1321 cccgcctcct cccgtggcat taaagggccc gcagcgctca gggcgcggct gcgcccttac 1381 ccccctcccc gccccgttgt tgtgccctcc agcggcttcg cgcgggcggg gtgggaggct 1441 gaatcccggc cgcgaccccc cgggagcgca gttttcgccc ccggccgcgg gagcccctcc 1501 ccgggcgcgg ccgggcgccg tgagcacggc gtggaggggg ttaagcgcgg cggcggcccc 1561 ggggggcttg gccgcgggac aaggggaaat gcttacacag cacattgcgc ggcgacgtaa 1621 acaaagctga cccgccgcgg acctcggcgc gggcggggac ggcgccccca ccccggccgg 1681 cccgcgcccc gcgccctctc ccggccccct ctcggttctc cgggccggcc ccgccctgac 1741 cggccgcgcg cgctgtcgcc cgcagatttc ttcgccaagc gcaagagatc agcgcctgag 1801 aagtcgtcgg gcgatgtccc cgcgccgtgt ccctctccaa gcgccgcccc tggcgtgggc 1861 tcggtggagc agaccccgcg caagaggctg cggtgagcca agtgagtaca gcgcacctgg 1921 gggggcgcgg agggccgacc cgccgggtcc ccgccggctt tgctgaccgc ccctctcctc 1981 gcagtttaga gcccaaagag ccccgaggga acctgccggg gcagcggacg ttggaagggc 2041 gctgggcctc ggctgggacc gttcatgtag cagcaaccgg cggcggctgc cgcagagcag 2101 cgttcggttt tgtttttaaa ttttgaaaac tgtgcaatgt attaataacg tctttttata 2161 tctaaatgta ttctgcacga gaaggtacac tggtcccaag gtgtaaagct t // LOCUS HSU49742 6953 bp DNA PRI 03-SEP-1996 DEFINITION Human rhodopsin gene, complete cds. ACCESSION U49742 K02281 NID g1236136 KEYWORDS opsin; rhodopsin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6953) AUTHORS Nathans,J. and Hogness,D.S. TITLE Isolation and nucleotide sequence of the gene encoding human rhodopsin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (15), 4851-4855 (1984) MEDLINE 84272729 REFERENCE 2 (bases 1 to 6953) AUTHORS Nathans,J. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) Jeremy Nathans, Molecular Biology and Genetics, Johns Hopkins Medical School, 725 N. Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..6953 /organism="Homo sapiens" /isolate="individual J.N." /db_xref="taxon:9606" /clone="roJHN" /map="1 bp upstream of BamHI site" CAAT_signal 122..127 /note="putative CAAT box" TATA_signal 171..177 /note="putative TATA box" prim_transcript 200..>5278 /note="rhodopsin mRNA (alt); the 5' untranslated region is similar to the bovine sequence" prim_transcript 202..>5278 /note="rhodopsin mRNA (alt); the 5' untranslated region is similar to the bovine sequence" exon <295..655 /number=1 CDS join(295..655,2439..2607,3813..3978,4095..4334,5168..5278) /note="the 5 exons of the human rhodopsin gene are similar to the 5 exons of the Bos taurus opsin gene" /codon_start=1 /product="rhodopsin" /db_xref="PID:g1236137" /translation="MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYM FLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGY FVFGPTGCNLEGFFATLGGEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFT WVMALACAAPPLAGWSRYIPEGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMII IFFCYGQLVFTVKEAAAQQQESATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIF THQGSNFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEAS ATVSKTETSQVAPA" intron 656..2438 /note="intron A; the four introns present in both the human and the bovine genes occur at precisely analogous positions and are of comparable lengths" exon 2439..2607 /number=2 intron 2608..3812 /note="intron B; the four introns present in both the human and the bovine genes occur at precisely analogous positions and are of comparable lengths" exon 3813..3978 /number=3 intron 3979..4094 /note="intron C; the four introns present in both the human and the bovine genes occur at precisely analogous positions and are of comparable lengths" exon 4095..4334 /number=4 intron 4335..5167 /note="intron D; the four introns present in both the human and the bovine genes occur at precisely analogous positions and are of comparable lengths" exon 5168..>5278 /number=5 polyA_signal 5642..5647 /note="potential polyA signal" polyA_signal 6698..6903 /note="potential polyA signal" BASE COUNT 1524 a 2022 c 1796 g 1611 t ORIGIN 1 ggatcctgag tacctctcct ccctgacctc aggcttcctc ctagtgtcac cttggcccct 61 cttagaagcc aattaggccc tcagtttctg cagcggggat taatatgatt atgaacaccc 121 ccaatctccc agatgctgat tcagccagga gcttaggagg gggaggtcac tttataaggg 181 tctggggggg tcagaaccca gagtcatcca gctggagccc tgagtggctg agctcaggcc 241 ttcgcagcat tcttgggtgg gagcagccac gggtcagcca caagggccac agccatgaat 301 ggcacagaag gccctaactt ctacgtgccc ttctccaatg cgacgggtgt ggtacgcagc 361 cccttcgagt acccacagta ctacctggct gagccatggc agttctccat gctggccgcc 421 tacatgtttc tgctgatcgt gctgggcttc cccatcaact tcctcacgct ctacgtcacc 481 gtccagcaca agaagctgcg cacgcctctc aactacatcc tgctcaacct agccgtggct 541 gacctcttca tggtcctagg tggcttcacc agcaccctct acacctctct gcatggatac 601 ttcgtcttcg ggcccacagg atgcaatttg gagggcttct ttgccaccct gggcggtatg 661 agccgggtgt gggtggggtg tgcaggagcc cgggagcatg gaggggtctg ggagagtccc 721 gggcttggcg gtggtggctg agaggccttc tcccttctcc tgtcctgtca atgttatcca 781 aagccctcat atattcagtc aacaaacacc attcatggtg atagccgggc tgctgtttgt 841 gcagggctgg cactgaacac tgccttgatc ttatttggag caatatgcgc ttgtctaatt 901 tcacagcaag aaaactgagc tgaggctcaa aggccaagtc aagcccctgc tggggcgtca 961 cacagggacg ggtgcagagt tgagttggaa gcccgcatct atctcgggcc atgtttgcag 1021 caccaagcct ctgtttccct tggagcagct gtgctgagtc agacccaggc tgggcactga 1081 gggagagctg ggcaagccag acccctcctc tctgggggcc caagctcagg gtgggaagtg 1141 gattttccat tctccagtca ttgggtcttc cctgtgctgg gcaatgggct cggtcccctc 1201 tggcatcctc tgcctcccct ctcagcccct gtcctcaggt gcccctccag cctccctgcc 1261 gcgttccaag tctcctggtg ttgagaaccg caagcagccg ctctgaagca gttccttttt 1321 gctttagaat aatgtcttgc atttaacagg aaaacagatg gggtgctgca gggataacag 1381 atcccactta acagagagga aaactgaggc agggagaggg gaagagactc atttagggat 1441 gtggccaggc agcaacaaga gcctaggtct cctggctgtg atccaggaat atctctgctg 1501 agatgcagga ggagacgcta gaagcagcca ttgcaaagct gggtgacggg gagagcttac 1561 cgccagccac aagcgtctct ctgccagcct tgccctgtct cccccatgtc caggctgctg 1621 cctcggtccc attctcaggg aatctctggc cattgttggg tgtttgttgc attcaataat 1681 cacagatcac tcagttctgg ccagaaggtg ggtgtgccac ttacgggtgg ttgttctctg 1741 cagggtcagt cccagtttac aaatattgtc cctttcactg ttaggaatgt cccagtttgg 1801 ttgattaact atatggccac tctccctatg aaacttcatg gggtggtgag caggacagat 1861 gttcgaattc catcatttcc ttcttcttcc tctgggcaaa acattgcaca ttgcttcatg 1921 gctcctagga gaggccccca catgtccggg ttatttcatt tcccgagaag ggagagggag 1981 gaaggactgc caattctggg tttccaccac ctctgcattc cttcccaaca aggaactctg 2041 ccccacatta ggatgcattc ttctgctaaa cacacacaca cacacacaca cacacaacac 2101 acacacacac acacacacac acacacacac aaaactccct accgggttcc cagttcaatc 2161 ctgaccccct gatctgattc gtgtccctta tgggcccaga gcgctaagca aataacttcc 2221 cccattccct ggaatttctt tgcccagctc tcctcagcgt gtggtccctc tgccccttcc 2281 ccctcctccc agcaccaagc tctctccttc cccaaggcct cctcaaatcc ctctcccact 2341 cctggttgcc ttcctagcta ccctctccct gtctaggggg gagtgcaccc tccttaggca 2401 gtggggtctg tgctgaccgc ctgctgactg ccttgcaggt gaaattgccc tgtggtcctt 2461 ggtggtcctg gccatcgagc ggtacgtggt ggtgtgtaag cccatgagca acttccgctt 2521 cggggagaac catgccatca tgggcgttgc cttcacctgg gtcatggcgc tggcctgcgc 2581 cgcaccccca ctcgccggct ggtccaggta atggcactga gcagaaggga agaagctccg 2641 ggggctcttt gtagggtcct ccagtcagga ctcaaaccca gtagtgtctg gttccaggca 2701 ctgaccttgt atgtctcctg gcccaaatgc ccactcaggg taggggtgta gggcagaaga 2761 agaaacagac tctaatgttg ctacaagggc tggtcccatc tcctgagccc catgtcaaac 2821 agaatccaag acatcccaac ccttcacctt ggctgtgccc ctaatcctca actaagctag 2881 gcgcaaattc caatcctctt tggtctagta ccccgggggc agccccctct aaccttgggc 2941 ctcagcagca ggggaggcca caccttccta gtgcaggtgg ccatattgtg gccccttgga 3001 actgggtccc actcagcctc taggcgattg tctcctaatg gggctgagat gagactcagt 3061 ggggacagtg gtttggacaa taggactggt gactctggtc cccagaggcc tcatgtccct 3121 ctgtctccag aaaattccca ctctcacttc cctttcctcc tcagtcttgc tagggtccat 3181 ttctacccct tgctgaattt gagcccaccc cctggacttt ttccccatct tctccaatct 3241 ggcctagttc tatcctctgg aagcagagcc gctggacgct ctgggtttcc tgaggcccgt 3301 ccactgtcac caatatcagg aaccattgcc acgtcctaat gacgtgcgct ggaagcctct 3361 agtttccaga agctgcacaa agatccctta gatactctgt gtgtccatct ttggcctgga 3421 aaatactctc accctggggc taggaagacc tcggtttgta caaacttcct caaatgcaga 3481 gcctgagggc tctccccacc tcctcaccaa ccctctgcgt ggcatagccc tagcctcagc 3541 gggcagtgga tgctggggct gggcatgcag ggagaggctg ggtggtgtca tctggtaacg 3601 cagccaccaa acaatgaagc gacactgatt ccacaaggtg catctgcatc cccatctgat 3661 ccattccatc ctgtcaccca gccatgcaga cgtttatgat ccccttttcc agggagggaa 3721 tgtgaagccc cagaaagggc cagcgctcgg cagccacctt ggctgttccc aagtccctca 3781 caggcagggt ctccctacct gcctgtcctc aggtacatcc ccgagggcct gcagtgctcg 3841 tgtggaatcg actactacac gctcaagccg gaggtcaaca acgagtcttt tgtcatctac 3901 atgttcgtgg tccacttcac catccccatg attatcatct ttttctgcta tgggcagctc 3961 gtcttcaccg tcaaggaggt acgggccggg gggtgggcgg cctcacggct ctgagggtcc 4021 agcccccagc atgcatctgc ggctcctgct ccctggagga gccatggtct ggacccgggt 4081 cccgtgtcct gcaggccgct gcccagcagc aggagtcagc caccacacag aaggcagaga 4141 aggaggtcac ccgcatggtc atcatcatgg tcatcgcttt cctgatctgc tgggtgccct 4201 acgccagcgt ggcattctac atcttcaccc accagggctc caacttcggt cccatcttca 4261 tgaccatccc agcgttcttt gccaagagcg ccgccatcta caaccctgtc atctatatca 4321 tgatgaacaa gcaggtgcct actgcgggtg ggagggcccc agtgccccag gccacaggcg 4381 ctgcctgcca aggacaagct actcccaggg caggggaggg gctccatcag ggttactggc 4441 agcagtcttg ggtcagcagt cccaatgggg agtgtgtgag aaatgcagat tcctggcccc 4501 actcagaact gctgaatctc agggtgggcc caggaacctg catttccagc aagccctcca 4561 caggtggctc agatgctcac tcaggtggga gaagctccag tcagctagtt ctggaagccc 4621 aatgtcaaag tcagaaggac ccaagtcggg aatgggatgg gccagtctcc ataaagctga 4681 ataaggagct aaaaagtctt attctgaggg gtaaaggggt aaagggttcc tcggagaggt 4741 acctccgagg ggtaaacagt tgggtaaaca gtctctgaag tcagctctgc cattttctag 4801 ctgtatggcc ctgggcaagt caatttcctt ctctgtgctt tggtttcctc atccatagaa 4861 aggtagaaag ggcaaaacac caaactcttg gattacaaga gataatttac agaacaccct 4921 tggcacacag agggcaccat gaaatgtcac gggtgacaca gcccccttgt gctcagtccc 4981 tggcatctct aggggtgagg agcgtctgcc tagcaggttc ccaccaggaa gctggatttg 5041 agtggatggg gcgctggaat cgtgaggggc agaagcaggc aaagggtcgg ggcgaacctc 5101 actaacgtgc cagttccaag cacactgtgg gcagccctgg ccctgactca agcctcttgc 5161 cttccagttc cggaactgca tgctcaccac catctgctgc ggcaagaacc cactgggtga 5221 cgatgaggcc tctgctaccg tgtccaagac ggagacgagc caggtggccc cggcctaaga 5281 cctgcctagg actctgtggc cgactatagg cgtctcccat cccctacacc ttcccccagc 5341 cacagccatc ccaccaggag cagcgcctgt gcagaatgaa cgaagtcaca taggctcctt 5401 aatttttttt ttttttttaa gaaataatta atgaggctcc tcactcacct gggacagcct 5461 gagaagggac atccaccaag acctactgat ctggagtccc acgttcccca aggccagcgg 5521 gatgtgtgcc cctcctcctc ccaactcatc tttcaggaac acgaggattc ttgctttctg 5581 gaaaagtgtc ccagcttagg gataagtgtc tagcacagaa tggggcacac agtaggtgct 5641 taataaatgc tggatggatg caggaaggaa tggaggaatg aatgggaagg gagaacatat 5701 ctatcctctc agaccctcgc agcagcagca actcatactt ggctaatgat atggagcagt 5761 tgtttttccc tccctgggcc tcactttctt ctcctataaa atggaaatcc cagatccctg 5821 gtcctgccga cacgcagcta ctgagaagac caaaagaggt gtgtgtgtgt ctatgtgtgt 5881 gtttcagcac tttgtaaata gcaagaagct gtacagattc tagttaatgt tgtgaataac 5941 atcaattaat gtaactagtt aattactatg attatcacct cctgatagtg aacattttga 6001 gattgggcat tcagatgatg gggtttcacc caaccttggg gcaggttttt aaaaattagc 6061 taggcatcaa ggccagacca gggctggggg ttgggctgta ggcagggaca gtcacaggaa 6121 tgcaggatgc agtcatcaga cctgaaaaaa caacactggg ggagggggac ggtgaaggcc 6181 aagttcccaa tgagggtgag attgggcctg gggtctcacc cctagtgtgg ggccccaggt 6241 cccgtgcctc cccttcccaa tgtggcctat ggagagacag gcctttctct cagcctctgg 6301 aagccacctg ctcttttgct ctagcacctg ggtcccagca tctagagcat ggagcctcta 6361 gaagccatgc tcacccgccc acatttaatt aacagctgag tccctgatgt catccttact 6421 cgaagagctt agaaacaaag agtgggaaat tccactgggc ctaccttcct tggggatgtt 6481 catgggcccc agtttccagt ttcccttgcc agacaagccc atcttcagca gttgctagtc 6541 cattctccat tctggagaat ctgctccaaa aagctggcca catctctgag gtgtcagaat 6601 taagctgcct cagtaactgc tcccccttct ccatataagc aaagccagaa gctctagctt 6661 tacccagctc tgcctggaga ctaaggcaaa ttgggccatt aaaagctcag ctcctatgtt 6721 ggtattaacg gtggtgggtt ttgttgcttt cacactctat ccacaggata gattgaaact 6781 gccagcttcc acctgatccc tgaccctggg atggctggat tgagcaatga gcagagccaa 6841 gcagcacaga gtcccctggg gctagaggtg gaggaggcag tcctgggaat gggaaaaacc 6901 ccaactttgg ggtcatagag gcacaggtaa cccataaaac tgcaaacaag ctt // LOCUS HSU50136 4465 bp DNA PRI 16-MAY-1996 DEFINITION Human leukotriene C4 synthase (LTC4S) gene, complete cds. ACCESSION U50136 NID g1314482 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4465) AUTHORS Penrose,J.F., Spector,J., Baldasaro,M., Xu,K., Boyce,J., Arm,J.P., Austen,K.F. and Lam,B.K. TITLE Molecular cloning of the gene for human leukotriene C4 synthase. Organization, nucleotide sequence, and chromosomal localization to 5q35 JOURNAL J. Biol. Chem. 271 (19), 11356-11361 (1996) MEDLINE 96212205 REFERENCE 2 (bases 1 to 4465) AUTHORS Penrose,J.F. TITLE Direct Submission JOURNAL Submitted (27-FEB-1996) John F. Penrose, Medicine, Harvard Medical School, 250 Longwood Av, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..4465 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q35" mRNA join(1351..1504,2950..3049,3152..3222,3307..3388, 3619..3876) /gene="LTC4S" /product="leukotriene C4 synthase" gene 1351..3876 /gene="LTC4S" CDS join(1447..1504,2950..3049,3152..3222,3307..3388, 3619..3760) /gene="LTC4S" /EC_number="2.5.1.37" /codon_start=1 /product="leukotriene C4 synthase" /db_xref="PID:g1314483" /translation="MKDEVALLAAVTLLGVLLQAYFSLQVISARRAFRVSPPLTTGPP EFERVYRAQVNCSEYFPLFLATLWVAGIFFHEGAAALCGLVYLFARLRYFQGYARSAQ LRLAPLYASARALWLLVALAALGLLAHFLPAALRAALLGRLRTLLPWA" BASE COUNT 784 a 1385 c 1545 g 751 t ORIGIN 1 gagctcacag agcccccagc tggggcatat ctggtttccg ggggcagggg cgatacccag 61 aggaggaaga agggattctg agagagccca acaggctccg agcctcaggc tggagctgag 121 cttggggcag caaggaagga ccaggtgcga gggcagaacc atgcggcccg acccctgcag 181 cacggcctgt ggcctccccc agctcctgcc cgtgcttctg ggtcagtctg gactttgcca 241 cttctgacca aaagccaccg caaacccact caagccaaaa gaggaagtga ccgttaggcc 301 caactgggaa ggctggcggc caggggcact ccaggcaggg cgaggggggc ggccgggggc 361 gctccaggcg gggcgaggga gacacccaga actccaggca ggagtcctcg ggtgccacct 421 ttcctctcca cctggccctg cgtgggctct gtcctcaggg tggcccgccg tagtccccct 481 ccccactctg agtttcctgt cccaaagtcc taaggaagtt tccagaacta catctcacca 541 tcttgagtca gccttggctc agtgtccatc tcacaggcct ggaaggggca ggagtcagca 601 ctgtccagac cacagggcct gagtgtgggg agggcagccg tctaggaagg tggtggaggg 661 ttgttacctt gaggcaagag ggctgcgggg cagaaagaca cagcaggtga ctgttgtggg 721 aggcccaaga gaggcctggg agagggatgg cccacaaggg ctgaccctcc cgccacccag 781 ggggccttgg acaggtttcc tcctggcagg gtggcccttg tgcatggaac ccctacaacg 841 actaaggctg gcaggcatga ggtttcctga aggagaaaga gcttgtgggg cccagtgtgg 901 ctgggggggc gctgggactc cattctgaag ccaaaggcac tgggaagggc ttccgcagag 961 gagggtttgg caggggttgc caggaacagc ctggatgggg acagggaaca gataaggtgg 1021 gtggaggagt tagccgggag cctggggctg gctccagcat gatgtggggg tctgcaaggc 1081 cctggagaaa gtggggtggt gcagcagggg gcacacccac agctggagct gacccagatg 1141 gacagcttgg gctctgccac gcgggactag gcaaggaagg ggcacgaaca agcaggaagt 1201 ggtgaggcgg tctccagcta gctgctctcc cctgcccaga ctttggtttc ctccctgctg 1261 gcttggcctg gctccctggc tctgtgtggt atggtcacac ccccgtgcac cccctccact 1321 gagatggggc ggggagagca ccgaggctgc tcttcctctc ctgggccgtc ctctgagcag 1381 cagacggggc taagcgttcc ccagctcgcc ttcacacaca gcccgtgcca ccacaccgac 1441 ggtaccatga aggacgaggt agctctactg gctgctgtca ccctcctggg agtcctgctg 1501 caaggtgggc tggttcctat ctaggaagag ggtgggcctt agatccctac agcttgccct 1561 ctgcccccta ggcccaggtg gagggcagag gtggggactc cagcccaggc ccaagctgga 1621 agagggtggg gactttcagg gaactggggg gcacctggct gtgagagctg taggacttgg 1681 gggtggcaag ggtgccagga caaatggtag gatagccatg ggcttgggga agctgatctc 1741 tgctctttcc agctgtcccc tctctgggcg tcccagcaag cggcccccat tccctggctc 1801 tgcttcaaag gcacctccat actgggacca cgtggagcag ggtagaggtg ggactccttc 1861 ctccagcccc ctaaaaagag cctgcttaat gcctttctca gactggccct aaaggacaca 1921 ttccttggcc agatatcctt gccacctaag agacaccact actccacagt gtgtgggcta 1981 ggataaggca cagcctgggg agggggctct gaaggggctg aacagacagg ccagcctgac 2041 ctccagctgc tcctgcactg agctggatgg ccaccctgtg acacccatct gcagagggcc 2101 cagaaccaaa ggtgccaggg ctgcaggact cagggggaga tggtccgacg ggaggtctgg 2161 ggagggagcg cacagccagc actggtctgt gtgtggtctg gcctggcctc acctgaccaa 2221 gagaagggct cctgcccaca gagaaacttt agggccagcc caccctctgc aactacccca 2281 gccctggggt cctggggtta ggctaggaga gtcccagctg caacctcctg ggagcaggag 2341 agaaggtgtc tgtcagattt aggcctggga ccggaatgca ggaacagaga aactgaggtt 2401 tggaggcaca gggacgcagg ctttagtgat cccggcctga ggcagggtca gagggccctg 2461 ctggtgggcg ctggtaggtg ggtgaccagg gactgttagc tacagggagt gtgcttcctt 2521 gcacctggga ggatgcagcc agctctgccc tcagactccc gaggcacttc ctggccaggg 2581 acctgaaagc tgcatttgcc tgtgttttga gagtgaaatg attcagaaac aaggactcaa 2641 gtggtctctc tcgcggagca ggtgtccctg tgcctgaatc actcaccctc ccccatacac 2701 tcacaggttg ggacagggcc tctctgcgcc ccaggcttca gccctgccct cctcgctgaa 2761 tgtcagggac acagggcagg ccagggatgg gtgagacgag aggtctcctc gggcggggag 2821 ggggcggggt tccgccttag ggaggagagg acacggccaa gtgaagggcc agattgcagg 2881 atccctccca ctcccatctc tggggcttcg ggtgtccaga cctgactccc gctccccctc 2941 ctcccccagc ctacttctcc ctgcaggtga tctcggcgcg cagggccttc cgcgtgtcgc 3001 cgccgctcac caccggccca cccgagttcg agcgcgtcta ccgagcccag tgaggcgcgg 3061 cgggagggcg cggggcgggg agcgagcccc aggcgggtcc gggtcgcagg accatcccgg 3121 ccggcgcgct catcccaccc gcccaccgca gggtgaactg cagcgagtac ttcccgctgt 3181 tcctcgccac gctctgggtc gccggcatct tctttcatga aggtcggggt gtggggcagg 3241 ggcgcacgcg ctggaccccc gggacccgcg cagggcgctc accaggcccg tgcgtacctc 3301 tcgcaggggc ggcggccctg tgcggcctgg tctacctgtt cgcgcgcctc cgctacttcc 3361 agggctacgc gcgctccgcg cagctcaggt gagggccggg cggggagcgg ggcggggccg 3421 gggaaagatc gcgggcgggc ggggctcctg gggagcggga ccgaagctgg gggcgggcga 3481 cgggccggag cccagcgcct ttggggattc ggtgggcgag ccctggcggc ggccagagga 3541 agtccccgtg gggccagggt tgcggcgggg aagaagcggg cctcctcgcg ccacctcccc 3601 gctgaccgcc gcccgcaggc tggcaccgct gtacgcgagc gcgcgcgccc tctggctgct 3661 ggtggcgctg gctgcgctcg gcctgctcgc ccacttcctc ccggccgcgc tgcgcgccgc 3721 gctcctcgga cggctccgga cgctgctgcc gtgggcctga gaccaaggcc cccgggccga 3781 cggagccggg aaagaagagc cggagcctcc agctgccccg gggaggggcg ctcgcttccg 3841 catcctagtc tctatcatta aagttctagt gaccgagacc cgggctgcgt tctctgggtc 3901 cgcgggggtg gcgcaccgcg ggctacggag cctggagggg cccagcccga gtccgggcag 3961 cccggggcgg gcttcctagt ggcggcgtga gagtggctgc gaaggaacga gccctccccc 4021 tggggcggga ctggatccgg tcttcacctc ctaccccact ccctactcag cctcggggtc 4081 acaaggccgc ccagtcctgc cggggttcac cctcctagcg ctcagcggtc tcctcaccgg 4141 tccccctcct caggggcctt ccctcgactc tcagccgccg cagtccctcg tcccctggcc 4201 ttcacagctg acactagata gagcctgtgg ctctctcccc aggtgagggc aggggttttt 4261 cttttggtca gcactggatc cccctcgtta actgtaggtg ttcagggcag ccctccgagg 4321 tccgcagagc tgcgggcacc atgggaacga agtgagtcag tgacaggcgg tctcaaggaa 4381 atgtccagaa gccttgggga tccaggggag gcccacagaa acaaagaagt gacttttagc 4441 caagtatgca ggagaaacgg aggag // LOCUS HSU50871 86640 bp DNA PRI 22-MAR-1997 DEFINITION Human familial Alzheimer's disease (STM2) gene, complete cds. ACCESSION U50871 NID g1354264 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 86640) AUTHORS Levy-Lahad,E., Poorkaj,P., Wang,K., Fu,Y.H., Oshima,J., Mulligan,J. and Schellenberg,G.D. TITLE Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene JOURNAL Genomics 34 (2), 198-204 (1996) MEDLINE 96299710 REFERENCE 2 (bases 1 to 86640) AUTHORS Wang,K. TITLE Direct Submission JOURNAL Submitted (06-MAR-1996) Genomics, Darwin Molecular Corp., 1631 220th Street SE, Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..86640 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" mRNA join(26409..26551,30511..30696,37057..37217,38872..39086, 40705..40846,43261..43328,43999..44219,45205..45303, 46448..46531,46913..47014,49177..49295,50594..51197) /gene="STM2" /product="familial Alzheimer's disease" gene 26409..51197 /gene="STM2" CDS join(37077..37217,38872..39086,40705..40846,43261..43328, 43999..44219,45205..45303,46448..46531,46913..47014, 49177..49295,50594..50749) /gene="STM2" /codon_start=1 /product="familial Alzheimer's disease" /db_xref="PID:g1354265" /translation="MLTFMASDSEEEVCDERTSLMSAESPTPRSCQEGRQGPEDGENT AQWRSQENEEDGEEDPDRYVCSGVPGRPPGLEEELTLKYGAKHVIMLFVPVTLCMIVV VATIKSVRFYTEKNGQLIYTPFTEDTPSVGQRLLNSVLNTLIMISVIVVMTIFLVVLY KYRCYKFIHGWLIMSSLMLLFLFTYIYLGEVLKTYNVAMDYPTLLLTVWNFGAVGMVC IHWKGPLVLQQAYLIMISALMALVFIKYLPEWSAWVILGAISVYDLVAVLCPKGPLRM LVETAQERNEPIFPALIYSSAMVWTVGMAKLDPSSQGALQLPYDPEMEEDSYDSFGEP SYPEVFEPPLTGYPGEELEEEEERGVKLGLGDFIFYSVLVGKAAATGSGDWNTTLACF VAILIGLCLTLLLLAVFKKALPALPISITFGLIFYFSTDNLVRPFMDTLASHQLYI" BASE COUNT 22964 a 19374 c 20239 g 24019 t 44 others ORIGIN 1 tttttgtatt ttttagaaga gacggagttt catcatgtcg gccaggctgg taaaaccctg 61 tctctactaa aaatgcaaaa attagccagg tgtggtggcg tatgcctgta atcccagctg 121 ctcaggaggc tgaggcagga gaattgcctg aacccgggag gtggaggttg cagtgagcca 181 ggttcgtgcc actgcactcc agcatgggca acaagagcta aactccatca agaaagaaag 241 aaagacagga aggaaggaag gaaggaagga tggatggaag gaaggaagga aggaaggaag 301 gtaggtcagt caggagggaa gtcaggaagg aaggaaggaa ggaaggaaga aagaaaggaa 361 gaaaggaaga agcccctcta aaaaatggaa atggcaagga agtagatttt tttccctaga 421 gcctccaaca agagcaaggc cctggtttgg cccagtgaaa ctgattacag acttctgacc 481 tccagagctg taagggaata tatgtgtgtt gttttgagcc actcagttta tggtcatttg 541 ttatagcaat cataggaaac taacacatgt tttggcatct gaaagtgggg tgttgctgca 601 acaaatacct aaaaatctgg aagtggggtg ctgttgtaaa aatacctcaa aaatgtacct 661 gtagtgccaa gtatttggga ggctgaggtg ggaggctcac ttgagctcag gagtttgagt 721 ccagcctgga cagcaagacc cctatctctt taaaaaaaaa aatggaagta gttttggaat 781 agggtaatag ggctggaaga attttgagaa gcacaataga aaaagcctag attgctttaa 841 acagactgtt atcagaaata ttgatttgaa agattctgcc agtgagggcc aggaggaagt 901 gagaaacaca gtatagaaag cttgtatcac ctcagagaat atctatattg tcataaacag 961 actgttagta aacatgtgaa cattaaaggt gctgctggtg aaggttcaga aggaaattat 1021 tggaatttgg aaggaagagg gtccttgtta tgcagcagca gaaacttttg ctgaattgtg 1081 tcctgaagtt ttaggaaaat agaacttgta agcaatgaac ttggtattta gctaagattt 1141 tcaagcaaag ggttgaaagc ataggctggg ttcttcttgc accttataat aaactttaac 1201 aggtaagaga taaaatgagg gaagaggcgg ggcatggtgg ctcacacctg taataccagc 1261 attttgggag gtcgaggcgg gtggatcacg aggtcaggag atggagacca tcctggctaa 1321 cacagtgaaa ccccgtctgt actaaaaata caaaaaattg gctgggcatg gtggcaggtg 1381 gggcctgtag tcccatttac tcaggaggct gaggcaggag agccacttga acccaggaaa 1441 cggaggttgc agtgagccga gatcatacca ctgcactcca gcctgggcaa cagagacaga 1501 gcaagactct gtctcaaaac aaaaaaaaaa aaaaagaaaa gggggaagaa atgttaagca 1561 aaaaggaagc aaggcttgat gatctgggaa attctcagac tctccagatt gcaaaagatg 1621 ttaaacttcc agagatacac tgttaagaaa gcgggctcta gattgaaggc caaaggtatg 1681 gctagacaac cttttctagt gctgaagaga tcacatatgt gattcatgga gtccctcaac 1741 cgttttagca aaagccatga atagagatcg aatggtctat ggaggagcct tttgtttaat 1801 ggagtaaatc tctgtgacat acacaggaga ccactaaggt ttttgagagt attataacag 1861 cagcactgtc agcttggact gaaagggatg gacagagaat gaaatgaaag aagactgtca 1921 gagtcccaaa ttctacaggc aggaaacagg ctgataaaac tgctcagctg caaacatatg 1981 ctaccgttcc agaaaaagaa agatgacttt gaggtggggt gccttgggcc cagagggtgg 2041 agtctcaagc cacagagaat tgtttccagg tcttgagaca taatggagtt tgctcagttg 2101 gaattcaaaa ttgcttgagg ctagtgatga ctcttttcct tccattttct cccttttgga 2161 atgagaatat ctacaactgt tatcctgtgc ctgtcatgca attatgtatt ggaagcaaac 2221 catttgtttt ctagttttac aattccacag atggagagag attttgccca ggatggacca 2281 tgcccaaagt ctcctccaca cctgatttag attatatggg caacgagatt tagaactttt 2341 gagctgatga cacttaggta agattttggg cttaagttga tgctgttaat gggttgagac 2401 tttgggggat gttgggacag ggtgaatgtt ttttgcatgt gggatggatg tacaaaagga 2461 aagtatctta ggccccttca agctgggaac cactcaaggc aaatctgcct cccattctat 2521 tcaaagtcat ccctctgctc acagagatag atgcatagtc tgattacctc acatggaaaa 2581 acttatcaga aactcaaaag aatgaaccat ttgtatctta cctatctgtg acctggaagc 2641 tccttcccca ctttgagtcc tcctgccttt gtttcaagtt gtcccgcctt tccaggccca 2701 atcaatgtac ctcttacata tattgattga tgtctcatgt ttccctaaaa tgtataaaac 2761 caagctgtgc cccgaccatc ttaggcacat gttgtcagga cttcctgagg ctgtgtcaag 2821 ggcatgtcct caaccttggc aaaataaact ttctgaatta actgagacct gtctcagatt 2881 ttctgggttc acatttgctg aacccacatt tctgggttca ccatggggga ttctgagtgg 2941 agatacccct gacctttgac gaatctccta tcagtgcttg gtaccagcat gagctaactt 3001 tatggcccaa accaatagga caatttgctg aggtctgaga gcatctcctc cagagaatcc 3061 ctgagctccc aaaatttggt caagatctaa agtttatttc gtattttgtt gtacagctcc 3121 tctttttttt tttttttttt tttttggagt tttacttgct tccaaaacaa ggaaggcaag 3181 tttttcctgc ttccatgatg atggaaggca ggtaactcct ttatggagtt tgagcttgct 3241 tccaacagag aacataaggt ttttttgttt tgttttcctg cttctaggat ggtagagagc 3301 agtctacagc ctgagaccca tcactaggta agaaactggt ctgggattct gtcttgcaaa 3361 ttccttttaa aaaattaaag ttagcattaa caaccagctg gtgttaattt ctgcttacac 3421 ttagagcgct cagaaatcat agcatttgtg tgatcattgt tagatttact tcatggtttt 3481 gttgtttctg tcttggtcaa atccgaaggg gaactctaaa ttatgaggaa caagaccttt 3541 aaagtgggag aaaaaatggc cagcaaaaaa aaaaaaaaga ggaaagattt ttgattttga 3601 ctactaaatg ggctttattt acataacaag gccacctttt tgccagccag cccgcccaaa 3661 ctgaaagagc aatggttgta cttctgaaat agcagcattt tgtcctagct gaaacatggt 3721 aataagattt ttaaaaaatt ttgttaagga gctcaatggt taaaactcag cttaattagg 3781 ctggtcgcgg tggctcacgc ctataatcct agcactttgg aaggccgagg cgggcggatc 3841 acctgaggtt gggagttcaa gaccagcctt accaacatgg agcaatctca tctctactaa 3901 aaatacaaaa tcagccgggc atgatggcgc atgctgtaac cccagctact cgggaggctg 3961 aggcaggaga atcgcttgaa cccgggaggc ggaggttgcg gtgagccgag atcgcggcat 4021 tgcactccag cctgggcaac aatagcgaaa ctctgtctca aaaaaaaaaa aaaaaaaaaa 4081 aaaaaagtca gcttaattaa aaggctaaca tccaagatgt atgtgtgcat gtgtgcatgt 4141 ttgtatttga aaggccttca tgtttttggt ttcaacattt gtttttctat tctaagacct 4201 tgtctttttt tggagtgcaa gttttttttt tccttttctt ctcagttgac tgaattctgt 4261 tttcacctga ttttttgact aaaatagcta ttgcaacaga ggctagtctt gggtttttaa 4321 ggaagagtgt agtttaattt tatgtttaat ttggctcaga aaaattaaaa gcatctccct 4381 ctagcaccac cagacttttt ctctctgtac cttatgatgt acattccgct ctttgatttt 4441 cacctgagct gtttccttta atgtgcaaat ttaaggctat ttagctgaca actgcctagg 4501 gttgtaaaat aggttatcaa ggatctgaaa ctctaaaatg gtggggaaga aaggggggct 4561 gtttataaat cacaaaatgt acttccatca gcatgtctaa tgtgtttatg tgttgtattt 4621 atgtgttgtg tacacgtttc actactaaaa atacatgaga gctctaatta attggctaaa 4681 agaaaaataa aagcacttaa atcagatact aaaaaagaaa agactagtca aatgcttttt 4741 caagtttatg taacaagtaa aatctttaat aagctagctt taaaattatt ggtaaagtaa 4801 cattagaaat ttcttaagaa ttgccaccaa acatttttcg tttgcattta ttaatcaggc 4861 aatttcatac ttatttctgc caaatactat aaaggtgtta aaatttggca taggggttac 4921 aaaactataa acccagctca aaacaaatga tctttgtgta gtttttaata aataagacat 4981 tgatattggt ttaatgaaaa tagctgcatc ttaaatttag taagattacc ataccttcta 5041 atcttgtggc ttttaggtaa tctagtacac agggagtaag gaggtttgtt ttgggaaaag 5101 gctgttattg cctttgtttc aaaactaaac tataaaccaa attcctccca aagtccagga 5161 atgaacaggc acagcttgga gattagaagc aagatggagt tagttaggtc atatcttttt 5221 cactgcctca gttatatttt tgcagtggtg gtttcatgac tatcacagtt ttcataaata 5281 atctaagtaa acaattaaag taaaataagt aaatgtaatg ggaataaata cttgtaggca 5341 aacttgtcat agtttagaat ataaaattat atattataga tatttcatta tttgggtatt 5401 ttcttttctt ttcttttctt ttttttgaga cggaagtctc gctctgtcgc ccaggctgga 5461 tggagtgcag tggcgcgatc tcggctcact gcaagctccg cctcctgggt tcaggccatt 5521 cttctgcctc agactcccga gtagctggga ctacaggcgc ccgccaccac acctggctaa 5581 ttttttgtat ttttagtaga gacagggttt caccgtgtta gcctggatgg tctcaatctc 5641 ctgacctcat gatccgcccg cctcggcctc ccaaagtttt ggaattacag gggtgagcca 5701 ctgcgcccag cctatttggg tattttctaa taaatagata ttgtaggaaa accaagtgtg 5761 tcctttaaaa aaataggtga ataagttttg tctaattcag tttatttaaa ggttatgtat 5821 aaaacaaggt aaaaggaacc aggaaataaa aaaatatgta aagaaagtta taaaaataaa 5881 gaggtatttt ttgcggttag aaaccttaaa gagaaataat tttatatgag aaagaatctt 5941 gtatggtaaa tttagtccta aaataaaatg actggttgtt taagaaggag ggatattcag 6001 gataaatcag gaagtataag tgtcagaggc atgtgaacca gagcaactcc atcttgaata 6061 agagctggat ataatgaggt agaaacctac taggctgcat tcccagatgg ttaaggcatt 6121 ctaagtcaca ggatgagata ggaggtcagc acaagataca ggacataaag accttgctga 6181 taaaacaggt tgcagtaaag aagccggcca aaacccacca aaaacaagat ggccatgcta 6241 tcttgctaca ctctgaccag cgccatgaca gtttacaaat gccatggcaa catcaggaag 6301 ttaccctata tggtctaaaa atgggaggca tgaataatcc acccttgttt agcatatcat 6361 caagaaataa ccataagaat gggcaaccag caccccttgg ggctgctaag tctatggagt 6421 agccattctt ttgttccttt actttcttaa taaacttgct ttcactttac tctacggatt 6481 cgccctgaat tctttcttgc acgagatcca agtaccttct cttgaggtct ggattgggac 6541 tcctttcctg taacccaagt atgtcatgaa cagtcagtgt aaatcacaag atgatttatt 6601 ttttaaaaaa atgttaatat ggatcaaatt gtcatattat tatcaagttt cggtttgctt 6661 agaaaaaaac tgagaaaaaa atttttctaa attaaggtta ttatatccat gtacctcctt 6721 gtatgtgctt ttaaagtcct tgtgacattg agtaacaggg attcaactcc tgggtctaaa 6781 aaggacacca agtcctgcta aatcttaaac actgacagcc aattaaagcc ctatcttcag 6841 gccctgtaga agatgccaat caaaataaac tgcattcttg agacacaggg caagaaatta 6901 aagctattca actcctcaag gcccagggac tatcgcagaa gaggtgggca cgtatgattg 6961 tgagggctga ttttgaaagg taaaataagt tcaggttttc tataaattaa tcattaatgt 7021 taaaggcaca cagatgcaaa accagcatat ggtcccctgt gtcagattaa caagggtttt 7081 tttttttttt tttttttttt tgagatggag tcttgctttg tctcccaggc tggatggagt 7141 gcaatggcct catctcagct cactgcaacc tctgcctcct gggttcaagc ggttcttctg 7201 cctcagcctc ctgagtagct gggattacag gcacgtgcca tcacactcgg ctaattttta 7261 tatttttagt agagatgggg tttcaccatg ttggccaggc ttgtctcgaa ctcctgacct 7321 caggtgatcc gcctgccttg gcctcccaaa gtgctgggat tacaggcatg agccaccgtg 7381 actgccctaa caaggttttc ttgaagtatt aaccaactcc ttaataaagg ttgtaaaggt 7441 tataaaaggc ttatggaagc tatatcatat ggtcaagatt aaaattttat agattattta 7501 taaaattttg aaaaacaaat ttaattggct tcatgctgtt tttattaggg cttatgattt 7561 ggaaaattaa gtctcctctc tcaaagataa aggtttttgc ttttttttga aatccttgag 7621 ttatcacttt ggttaaataa atgacttatt ttacaatgac ctgtgaacct attttgtgat 7681 atcaaatgtt taaaaccttt gatatttgac aaattctcca aaatcaaatt ataaattata 7741 tctttttctg acctaattaa tcttttaaga tattaggctc cctaaagtcc agaaaataga 7801 agtccaaaat ttggcttatt tggtacaaaa attatacagg aagcattatc aaatatgaaa 7861 ttgtgttggg ttttctttgg gttatatttg tataaatatg ttattggtat atgttctaaa 7921 attatgggaa actcctgtaa ttctgacata acagtatata ttatcagtaa taatcataat 7981 tgttatgtta aaattattgt gtgccacaga ggtaacaaat ttcctcatta attgtgtctt 8041 tgattgtggc tgccctaaaa ctttcatcat ccatggacaa ttgtcctgtt ttggtcctct 8101 ttagaaggta gttttataat cagctataca attctaacaa gtgctcttga atgcacgttt 8161 ctgataactt tggagattgt gacatcagaa tagaggaaaa actttcagga ctcatggaga 8221 gctaaaatgt ttacgagtat caagcagaac aggaattaac agcatggtct gaactaatct 8281 ttttgacttt ttgcttaaaa tgtttgctga tcctaaaatc acaagcattc ctatacacca 8341 ataacagata aacagagagc caaatcatta gtgaactccc attcacaatt gctacaaaga 8401 gaaaaaaata cctaggaatc caacttacaa gggatgcgaa ggacctcttc aaggagaatt 8461 acaaaccact gctcaacgaa aaaaaagagg acacaaacaa atggaaggac attccatgct 8521 tatggatagg aagaatcaat atcctgaaaa tggccatact gcccaaggta atttatagat 8581 tcagtgccat ccccatcaag ctaccaatga ctttcttcac agaattggaa aaaactactt 8641 taaagttcat atggaaccaa aaaacaggct gcattgccaa gacaattcta agcaaaaaga 8701 acaaagctgg aggcatcatg ctacctaact tcaaactata ctacaaggct acagtaacca 8761 aaacagcatg gtactggtac caaaacagag atatagacca atggaacaga acagaggcct 8821 cagaaataac accacacatc cacaaccatc tgatctttga caaatctgac aaaaacaaga 8881 aatggggaaa gggttcccta tttaataaat ggtgctggga aaactggcta gccatatgtg 8941 gaaagctgaa actggatccc ttccttacac cttatacaaa aattaattca agatggatta 9001 aagacttaaa tgttaaacct aaaaccataa aaactctaaa agaaaaccta ggcaatacca 9061 ttcaggacat aggcatgggc aaggacttca tgattaaaac accaagagca atggcaacaa 9121 aagccaaaat agacaaatgg aatctaatta aacgaaagag cttctgcacg gcaaaagaaa 9181 ctaccatcag agtgaacagg caacctacag aatgagggaa aatttttgca atctacccat 9241 ctgacaaagg gctaatatcc agaatctaca aagaacttaa acaaatttat aagaaaaaaa 9301 caaacaaccc cagcaaaaag tgggcaaagg atatgaacag acacttctca aaagaagata 9361 tttatgcagc caacagacac atgaaaaaat gctcatcatc actggtcatc agagaaatgc 9421 aaatcaaaac cacaatgaaa aaccatctca tgccagttag aatggcgatc attaaaaagt 9481 caggaaatga cagatgctgg agaggatgtg gagaaatagg aacgctttta cactgttggt 9541 gagagtgtaa attagttcaa ccattgtgga ggacagtggg gtgattcctc aaggatctag 9601 aactagaaat accatttgac ccagcgatcc cattactggg tatataccca aaggattata 9661 aatcatgcta cagtaaaaac acatgcacac gtatgtttat tgtggcacta ttcacaatag 9721 ctaagacttg gaaccaaccc aaatgtccat caatgataga ctggattaag aaaatgtggc 9781 acatatacac cagggaatac tatgcagcca taaaaaagga tgagttcatg tcctctgaat 9841 ggacatggat ggagctggaa accatcattc tcagcaaact gtcacaagga cagaaaacca 9901 aacaccactt gttctcactc ataggtagga attgaacaat gagaacattt ggacacaggg 9961 tggggaacat cacacattgg ggcctgttgg ggtttggagg gctgggggag ggatagatag 10021 cattaggaga aacacctaat gtaaatgacg agttgatggg tgcagcaaac caacatggca 10081 catgtatacc tatgtaacaa acatgcacgt tgtgcacatg taccctagaa cttaaagtat 10141 taaaaaaaaa agttagctga tcctttgttt tttctgagtc taaaactttt cttttgagct 10201 atcgacagct tttaacaatt tagtatattc tcatgaacaa aatttggggc atatttgttt 10261 ctctctactt gatatctaca gaatttggaa actatgagta ttcgtaactt atggcaatac 10321 agttatttgc ataagtgcaa taaaaatctg ttttcatttg taacaggaca caattggaga 10381 aactggttat tttaccaagg ctttgactgg aatagtgtac tttcctttaa ggaatcaact 10441 tggcttatgg aggaaataaa gcccgtggca aaactggcct catattttgt gtacgcagtg 10501 cctgtacaag gtttctgacc tgtggtaagt aaagaatgtc actttctaac aagtccagaa 10561 gccccaggtt tatcttggaa cctcaacagg agaggaaatt cacccaactc attgatattt 10621 gatgcgcaaa tccacgactg ggcttggctt taaaaaagtc ttatctgata tcccttctgt 10681 ggaacaaagt tccatcaaag ccaattaaaa aactattaaa aaataattat tcttgctgca 10741 ctgtatacaa ataattaggc caagtataat aaggcaaacc agtcctacca tgatttgtct 10801 tcagcaaggg aaactggaga gagaaaactt atgtttcaaa aagtatagta cacctgttgt 10861 tagattctaa tcctgcctaa tgtttttcaa tttttattat ttctacagtt tgggttgaat 10921 tctaaatttt ttcttggcta caagtcttca aaataatgtt ttcaattatt ttttcttttt 10981 ttccttcttt ttttcccatt tttcctaatt gggaatcact gaaagctaag ctgtgctttc 11041 ttaaagccct gcgaactgaa actagagaac ttaaacttca gaagaaaata acagcaacct 11101 atttacatac ataagacact ttcatacctg cctactgatg tatagacttc agaggaatgt 11161 ggcctgtatc aattttgcag gattgttctt ttgtttgttg ttgtttttct cccttcctcc 11221 tcctattttc tcttcagagg acactagact tcacaatctg ctaaaaatga gctttcagca 11281 cctgctcgtc taggaataaa ccatcccagc catgagagat cagatgaaac ctgagaccag 11341 agactcattt tcttgcaaaa tgctttctcg aaaagatttt agaaaagttg ggaaatgtga 11401 aaggaaagta tcttgggccc cttcaagctg ggaaccactc agggcaaatc tgcctcccat 11461 tctcttcaaa gacatccctc tgctcacaga gatagatgca tattctgatt acctgctttg 11521 gaaagactta tcaaaaactc aaaagaatgc aactgcttat gtttcaccct atgtgtgacc 11581 tggaagctcc ttccccactt tgagtcttcc tgcctttgct tcaagttgtg ctgcctttcc 11641 agaccaacca atgtacttct tacatatatt gattgatgtt tcatgtctcc ctaaaatata 11701 taaatactgg aacaatctct gctttgtgta aatctggttt aatttgattc tttgtgttag 11761 ctcagaaaat atttatagta aataccgaaa tggggcgaac atctttatca taaaaagtta 11821 acataaatta ctaagaaaag catcaagaaa ccaaaatata cacaggtgag ggccatgatg 11881 ggtctttcac aaaggagaac atgaaactgg taagcaaaca taggaaaaat tcatacttcc 11941 tagtattcaa tggaaattaa aacactgagg tttttatgaa tctttcaaat gagcagtttt 12001 ataaaaaact gaaacattta ataatggccg ggatagagac actctcatat cctcctgcag 12061 gggagtaaac cttaattcaa gtcttttgaa aagcagtttg agaatatact tcaaaagccc 12121 catgcgacat aagaaaaaat atattgatct ctgcccccaa ttcctgatac atatctaaaa 12181 cccctagaat ttcctgggtg atagaaggat cttttgttct aatgaggtga ctcttggtgg 12241 gctcctgggt gggcacctgg caccagaaag accaagacat ggttaggagc ttggaacttt 12301 cagccctatc cccccatcct caagggagga gagtgaggct ggagactgaa tgcacaatgg 12361 atcatgccta tgaactgaag cctacaccaa aatccctaat ctgcagagtt ccaagagctt 12421 ccaggatgct caacacatcc acgtgccagg agggcggtgt accccaactc catggagaca 12481 gaagctcctg tgctcaggac ccttcccacc tcaccctagg cacctctcca tctggctgta 12541 catctgaatc ctttctacaa aaccaggaga cgtaagtaat gtttcccctg acttctgtga 12601 gctgttacag caaactctca aacctgaaga ggaggctgtg ggaaccgcca atttgtagtc 12661 aagttggtca gaagtagcag aggcctgtgt cttatgcttg gctctgaagt atgggcagtc 12721 tgtggggcct gtgaggtgtg cactaactcc aggtaggtag tgtcagaatt gaactgaatg 12781 tcaggacacc caactggtgt acagagagtt gaagaaatga ttaagagtgg ggaaaatccc 12841 catacggttg gtcagaagtg ttttgagtag aaaaacagtg tttttataga ccccgataat 12901 tttactttgg cagatttaga aagaaatcta aatacggaaa aagaaatcta tggcctaaaa 12961 tttttttaac tggaaataat ctttaataag cacaacagtg agtatatggt tatagtgtgg 13021 tgtatctatt ctttggaata ctttttcact atttcttttt ttttgagacg gagtctcact 13081 gtgtcaccca ggctggagtg caatggtgtg atctcggctc actgcaacct ctgcctcctg 13141 ggttcaagca attctcctgc ctcagcctcc tgagtagctg ggactacagg cacgtgccac 13201 cacgcccagc taattttttg tatttttagt agagacgggg ttccactgtg ttagccagca 13261 tagtctcaat ctccttgata tcatggtctg cccgcctcag cctcccaaac tgctgggatt 13321 acaggcgtga gccactgtgc ccagctctct tacattattt caaataatgt tttaaagact 13381 acacaataat acatgcataa aatgcaatgt caagggggaa aggcaggcta caaaatgctc 13441 tacatactgt atttttgata aaggttgaaa ggaaatgcac caaaatatta ttattggtca 13501 tctttggaag atgaaactat tattttcttc tttatactgc ttaggatatc taagtttttg 13561 ttgttgttgt tgcttttttt ttgagacgga gtcttactct gtagcccagg ctggagtgca 13621 gtggcgcgat ctcggctcac tacgacctcc gcctcctggg ttcaagcagt tctcctgcct 13681 tancctcctg aatagctggg attactggca tgcaccaccn tgcccagcta atttttgtat 13741 ttttagtaga gacggtttca ccatgttggt caggctggta tctaatatcc tgacctcgtg 13801 atccgcccac ctggcctcca aatgcnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13861 nnnttgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggna acagcttttt 13921 tttttttttt tttttttctt tttgagacag cgtctcgctc tgtcgcccag gctggagtgc 13981 agtggtgtga tcccggctca ctgcaatctc tgcctcccag gctcaagtga ttctcctgcc 14041 tcagcctccc gagtagctgg gactacaggc atgtgccacc acaccaggct aattttctgt 14101 atttttagta gagacgaggt ttcaccatgt tggccaggct ggtctcaaac tcctgacctt 14161 gtgatccgct caccttggcc tcccaaagtg ctgggttata ggtgtgagcc actgcacctg 14221 gcctagaata tctaagtttt ctttaatgaa tgtgtatcgc tttaccatag aaacaaaatc 14281 tgtaaagcca gaaaaaatca tgaatgctaa ttagcgtaat acaatattat cttataattt 14341 agatgtggaa gcaaacaaaa tgtaatgaaa ctaaaagtgt tatccagatg tatgcattac 14401 ttcttttctc tccactgtca acaaagtaat ttctggctaa gtgggaataa ccatctgaga 14461 gctactactt cccttcataa atcaacatgt gtatagtagc acacagaacc caagaaaatg 14521 cagagaaagc caaggaattt ttaaaagaaa aacatttgcc aaattgtcta caaagtacgt 14581 gttgtgaatt tcataaatct cgttagaata tgacaataaa gtgtgtgtgc acatctttgc 14641 aatggtttac agaaacagac accaaaatgc taagcccatg cactacacat catagaaata 14701 tggaatgaag gtggagatgt ccttagataa aatggcataa ataagcagtt ctcaaagtgt 14761 ggtctgcaga cccccagggt ctgtgtgggc aaaatggttt tcataataat attaagactt 14821 tgttatcact cattttcact atgttgatat gtgcactgat ggaacaaaat cagtgttggg 14881 taaaactgct gacgccttag catgaacaaa ctctgtgacg ccaaacagag tagtcgatgc 14941 tgcattcttc actgctatga tattggtgat ttttttaaaa tggcagtctc agttgagaat 15001 gtacttaata aagcagtaaa aattattatt attttttttt attttatttt tttttttgag 15061 acggagtctc gctctgtcgc ccaggccgga ctgcggactg cagtggcgca atctcggctc 15121 actgcaagct ctgcttcccg ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 15181 ggactacagg cgcctgccac cgcgcccggc taattttttg tatttttagt agagacgggg 15241 tttcaccttg ttagccagga tggtctcgat ctcctgacct cgtgatccac ccacctcggc 15301 ctcccaaagt gctgggatta caggcgtgag cagtaaaaat tattaattcc atctaatctc 15361 tgcccttgat cctgtactgt ttttaatatt gtgtgtgatg ccacgggacg taggcataaa 15421 gcatttctgc tacaatttga agtatcttgg ttgtctccaa gaaaagcact tacgtcattt 15481 tgtgagtcgt aacctgtact gttttttgca tagagcacca tcttgaaaat atgacggaca 15541 acctgaagtg attcagactt gtgtatttgg cagacatttt ctgaaaaatg aacaaagtga 15601 gcctgtcact tcaagggaaa caactgatgc tatttgttgc caatgataaa atttgagctt 15661 tcaaatgaaa gtgagaattc tggaaaactt gtatctgcca ttatgagttt aatagcttct 15721 ctgtatcttt ggataagatt gacattaatg aatgtgatct tttttatctt gtataatgaa 15781 atgtgtcaac agttggaaga tctgcataac ttagtgaatc agtgttttcc aaatgaccag 15841 tacatcatgt tacaaaataa tgcataagta aaagatccat tcaaattgca agatgggcca 15901 atgggtatta atgtaacaga tatgaaaagt ttgttgatat gatttcatgt tctacattgc 15961 aactaaactt taagaaatta caactgttaa atttggtgtt atatcaaaga ataatccaca 16021 atgatctgaa aaggctatta aaatactcct atggttttca actataaatc tacataaggc 16081 caaattttct tcatatactt caaccaaagc atcatatcac aacagactga atgcagaagc 16141 aggtgcaaga atccaacggt ctttggttta gccagacact aaagagattt gtaaaattgt 16201 tggattaatt tttccttttt ctttgagaca gcgtctctgt ccgtcaccca ggctggagtg 16261 caatggcgtg atctcagctc actgtaacca ctgactcccg ggttagagtg attctcctgc 16321 ctcagcctcc tgagtagctg ggattgcagg catgcgccac cacacccagc taatttttgt 16381 atttttagta gagacaaggt ttcaccatgt tggccaggct ggtcttgaac ttctgaccac 16441 aagcaatcca cccaacctgg gcctcctcaa aatgctggga ttacaggtgt gagccattgc 16501 acctggcctt gtattagttt ttctattgct accattaaaa attaccacaa accacagtgg 16561 cttaacaaca caaatttatt ctcgtacagc tccgtaggtc cgctgtgtga catgggttct 16621 cactgagcta ccatcaaggt atcccagagc tgtgttccct tttggaggct ctaatgaaga 16681 atctgtttcc ttaccttttc tagcttctgg aggtcctgtt cattccttag ctcttgactc 16741 catcctcaat cttcaaggcc agcaatagtg catctccctc tgagccttca ctgtcacatc 16801 tccctctcac cagttggaaa aggttttcca tttttcagga gctgatgatt agattgggtc 16861 cacctggata atgaaagata atctcccatc tcaaaatcgt taaccctaat catatctgta 16921 atgtcccttt tgccacgtaa aataacatgt tttataggtt cagagtatta aggtgtggac 16981 agaattgagg gccataactg ctcaccatga atgtgaaaca gtaccattct tcccactaaa 17041 tttgtttgca aaatagattt ttcattagaa tgtcatatct taacacataa tgggtttgtc 17101 attctaaagt gagtgaatat ttttaaaaaa tttctcagtt ttgctttcta acatggtatc 17161 ttggtccact taatgttgct gtaacagaat acctgaggct gggtaattta taaagaaaag 17221 aggttttttt ggcttatgat tctggagctg gaagttcaag aagcacggcg cccacatctg 17281 cctggcctct ggtgagggct tcccgctgca tcataacatg gtggagaagc agaaggagaa 17341 gcaggcatgt acaaaaaagg gaccaaaaac aaggagcagc tttgcgttat aacaacccac 17401 tatcatggta actagttcag ttctgcaaga actagccggg tctcacaaga agtccatctt 17461 aacgagctaa tcacctctga tacgccccac ctcccaacac cgctacactg gcaattcaag 17521 gtgagttttc acagggacaa accacatcca aaccacagca tatggttaag tattggtaaa 17581 cataaaccac ataaacaaaa gcttttggag ttctgaatta gttttaacag agtatagagg 17641 ttatgaagcc aagaaggttg aggattgctg gcttaaattt ttgaaatggt aagagcagac 17701 ccatcctaag agtgtctcaa aatcacataa acatcctcca tggaaagatg aaatcatccc 17761 ataatacaga gaaaatgctg gaaaaatgtg gtggtaagtg tcattaggtg agttgaccag 17821 aatctttcag atgattggtg ccaactccca gacacactgg caaaagctct agcaaagaaa 17881 gagctgcttt aaggtgtcct ttaagcatcc cttctgtccc cagtgtcaca catgactggc 17941 aaaccctagg gatgtgtccc tagtgcctat gctatggctg ggtttcatgc atgtcggact 18001 gaccttgccc tggaacccta ctggtgagac tattgttgac atggaatgtg gttctgcatg 18061 gctcaaaaga aactacttca ggctcactcc atggcaatct tatgtcagat attgctagca 18121 cttatctact cttctttttt tagagacaga ggtctcacta tgttgcccag gctggactca 18181 aactcctaag ctcaagtgat cctcccacct cagactccta agtagctgga acttacaggc 18241 atgagtcacc acacttggct tcatctactc ttctattttt tttttcatgt gaagcaaggc 18301 ttccaaattt gtgaacaata atatagtcta tcatttcaat gtcaaacaaa accaaacaaa 18361 gctcctacct caaatgagaa gagaagtagg agaaggaaga caactaaaat ttgtagagtg 18421 cctgccataa tcctggtcat tttggactgg tttctcttta attctctcta taaacctctg 18481 agagatatca tcctatcctc tcacaggtgg gaaaactgag gcccagcaag gttaagaaat 18541 caccctgggt caggcaacaa ataagtagaa gagacagatt tcagacctgg gtccatctgg 18601 ttctaacatt cacacgggtt ccgttccccc acgctgtctg ctaagcagtg ttgggaccct 18661 agagtgtgtt aaaagaaaac accatgctct gaacaccaaa acccgaagag aggccatccc 18721 tcctgaactg aggccaggcc caggaagaag tttactcatc cctcacaaat cagcatgtga 18781 aaagcagctg acgacctcaa aggaaaacca caaagggaat gaattcatca aagatccctc 18841 agccacagcc ccagtgcatg cactgccatg ggaccttcaa agtgaaggag agggccagcc 18901 actgcattgg gccagaggga cttgagttcc acatctggtt ctgctactca ttatccaagc 18961 acaattggga aacaattttg tctttgtaaa cctccatttt cccatgtgta aaaaagcaat 19021 aatgccctcc tctccctgag ttcctccaag aattaaaata aggaatccaa gacagagaca 19081 agacactcat gggtagtgag agagacaagt gaagaagacc ctcctgataa ctcccagtaa 19141 acgctgtgca aagaaatgga agcagagaga gctgcatggc attgagcata gctagtgggg 19201 catacagcgt aggaggaatg tggctggaaa ggccgtgtgt gcaagcaagc tttgatttaa 19261 ctgtatgtga gggaatcaca aaattacaat ggcttaaaca agataaaagt ggttttccct 19321 ccatgaaacc aagtaaggaa ttaggcaatg ccaagctagt ggctccacaa ttgaggacac 19381 agctccttcc attttgtggc ttcaccgtcc tcatggtctt catggcccag catggctacc 19441 tgagctctgg ctatcatctc catattctcg aaaggaagca gaatgacaaa aaagacatgc 19501 ctcttctcct tccatccgat tacctggaac ttagttacag ggcttcacct aaaatagtct 19561 gagggatttc tgagtgcagt ggcttatggg cagttgtcca tgtacctagg attcagggag 19621 aggactaaac attcctcatg tagcttgctg gtgccagggt gagcaatatg gactttgttc 19681 tcaagtaatt gtgattgtgc attttgacat gcctggcagt tccctggggg actggtgcag 19741 aggcttgtct tttcttcaac ccctcaggct ggtgcacctc cccaggtggc atcttgtttg 19801 tttttgagac agagattcac tctcgtcacc caggttggag tgcaatggca cgatctcagc 19861 tcactgcaac ggcacgatct cagctcactg caacctccgc ctcctgagtt caggcaattc 19921 tccagcctca gcctcccaag ttgctgggat tacaggcgcc caccaccatg cctgcctaat 19981 ttttgtattt ttaatacaga cggggtttta ccatgttggg caggtgggtc tcaaactcct 20041 gagctcaggt gatccaccca ccttggcctc ccaaagtgct gggattacag atgtgagcca 20101 ccgtgcccag cctgttgaaa acatttaaag acataaactc cccatagctg tctggagcaa 20161 gggacagcag atgggtcatg cagcacatag accaaaaatc ctgggaagga ggacgctgag 20221 gaggaagatg cttggggaaa gaaggtcgtt gaagggctac tgcatatacc agggaaccca 20281 ggctgcaatg cacatgtcca gggctggaca catgctcaga aaagatctaa gaggacccta 20341 ggcttgcacc tgtggctgat ctttgtattc catgcaagca ggaggtaaag tctagggaaa 20401 atggaaagag tggcttggct aagtgtctaa ggaatgcctc agctcagagc caatctgcca 20461 agatcagaag atttttcttt tgtgtttggt ccaggtgttt ttagtccagg tatttaagga 20521 aatctgtgtc aactgtggct gaccactgag gtaacagatc aaagacttca gtgtctacat 20581 ataaagaatg cagtctgcaa aaatcatatg gaaatgtcac taaacaaaga agactgcagc 20641 ccttaataag caataacagc aaatcttgca aagggcaaag aatctgattt ccagagttat 20701 cacattataa tattcaaaat gaccagtttt tttctttcaa tcagcttcct caggttgaaa 20761 tgtcgttttt aacagcagca aaaaaaatta gaaggcatac caagaaaaaa agaaaagtat 20821 aacccattca caggaaaaag aaactgacag aagccatccg tgaggaagcc tagacattga 20881 acttgccagg caaagagggc tggactgttc agccctaccc cttgacctcc tgggtcaatg 20941 attttatggc caatgatgtt gatcaatcaa tggtgcctac gcaataaaac ttccataaac 21001 acccccaaat ggctacgtct ggggagcttc tgctaacttc aggtacatat tatcagaatt 21061 gaattgaatt gtaggacaca cacttggtgt ctggagaatt gtttttgtgt ggaataaaaa 21121 aaccacacac ttggtgtcag aagtgttgtg agaaaaaata gttcagagaa gtattccctc 21181 ctcttctatt tcttagtaga gtttgagaat tggtgttaat ttttctttaa aggtttggca 21241 gaaaattcac caagctgaag ccatctggtc ctgctttagg ggaggttttt gattactgat 21301 tcaacctttt tttttttgtt tgtttttgag atggagtctc gctctgtcac ccagactgga 21361 atgcagtggc tcaatctcgg ctcactgcaa gctccgcctc ccgggttcat gccattctcc 21421 cgactcagcc tcctgagtag ctgggactac aggcgcctgc caccacaccc ggcttttttt 21481 tttttttttt tagtagagat ggggtttcac cgtgttagcc aggatggtct caatctcctg 21541 acctcgtgat ctgctggtct caatctcctg acctcgtgat ctgcccgtct cagcctccca 21601 aagtgctggg attacaggct tgagccatcg tgcctggcct tactgattca acctcttatt 21661 ggattcaatc tctttacttt tcctagcttg ttcagattta tttcttcctg agtcagtttt 21721 ggtcatttgt gtacagttgt tcacagtatt cacttaaaat gctttttttt ttttttcctg 21781 taaggtccat agtgatgccc ctactttcat gtctgatttt agttatttgc atctttactc 21841 tttatttctc agtctagcta aagatttgtc agtgttgttc atcttttcaa aattttcatt 21901 tcagctattg tacttttcag ctccagaatt ttgatttggt ttatttttca taatttgtct 21961 ctttgctgat attctgttta tacatttatt gtttacctgg cttcctttag ttctttgtcc 22021 atggtttcct ttagttagtt aagcatattt aagaccattt aaataaacgt ttaagtaaac 22081 ttttaaacat ttaaataaaa gtctttgtct agcaagttca atgtctaggc ctccccaggg 22141 atggtttcta tcaatttctt tttcctgtga atgggttgta cttttctttt ttcttggtat 22201 gcttcctatt ttatttattt atttactttt tggttaaaaa ctgggcactt cgggcctggc 22261 atggtggctc acgcctgtaa tcccagcact ttgggatgcc gaggtgggtg ggtcacaagg 22321 tcaggcattc gagaccatcc tggccaacat aatgaaaccc tgtctctact aaaaatacaa 22381 aaaattagct gggtgtggtg gtgggtgcct gtaatcccag caactcggaa ggctgaggca 22441 ggagaatcgc ttgaacctgg gaggtggagg ttgcaggctg ctgagatcgc gccactgcac 22501 tccagccctg gtgacagtgg gagtctctgt ctcaaaaaaa aaaaaaaaaa caaaaactgg 22561 gcacttcaac ctgaaaaagc tgattgaaag aaaaaaaaac ctgggcattt tgaatattat 22621 aatgtggtaa ctctggaaat cagatcctgt taatgcttat taagggctgc agtcttctgt 22681 ttgtttagat catgccttag tctttctggc caccagcccc tatcctgaag ctagtcaggg 22741 gtccccagcc accagtcatt tcattagcat accaacaaca ttcatcacac ctgagattcc 22801 aaaggtttta gaagcttgtg ccaggaacgg ggataaagac caaatattta tttttattat 22861 atcacaactt cctaactcat ttattgagtc attatgcttc accttagggc atatgcagcc 22921 tcttcacatt aggggaaatt cagcctatca agtgcaacac tgaaagctaa ataacaacaa 22981 cacaacaaca accaatcagg gatgggcgcg gtggctcatg cctgtaatct aatcccagca 23041 cttcaggaga ccgaagcggg agggtcactt caggtcagga attcgagacc accctggtca 23101 acatggcaaa aaccccctct ctactaaaat tacaaaaatt agctgggcag ggtggcacat 23161 acctgtaatt ccagctaact tgggaggctg aggcaagaga atcacttgaa cttgggaggc 23221 agaggttgca gtgatctgag atcgtgccac tgcactccag cctgggcgaa aaagcaagac 23281 cctgtttcaa aacaacaaca acaacaaacc aggttgggga gttatttccc aggaattctg 23341 tagcaagggc tcatatgcat cagttagagt tggactttgt gaccttcagg tgtccttcct 23401 tttccaagaa tcgattattt ttataattct gttaaatgta attaatttct caaatttatc 23461 aaagatcatt atttggacca ttattttctt agccaaaatg ctggcctgcc ccagttccat 23521 ccagggttaa tctgcattta aacagctctt tgaaaatatt tggttttccc catccatcta 23581 tggccaattg atttttgaca agaagccaag acaattcaat aggaaaagga caatctcttc 23641 aacgaatggt gctgggacaa ctagaagtcc acatgcaaaa caatgaactt ggaccctaac 23701 ctcacactac atacaaaaat tatcttaaaa tggatcaaca acctaaatat aggagctaaa 23761 actataaaac tcttagaaga aaacacagga gtaaatcttc acaaccttgg atttgacaat 23821 gatatctcac acaggacacc aaatgcacaa acaacaaaag aaaaaacagt taagctgaag 23881 ttcatcaaaa ttaaagtttt atgtgcatga aaggacatca gaaagtgaaa aagacaatta 23941 cagaatggga gagaaatatt tgcaaagcat atatctgata agggtctagt gtccagaata 24001 tataaagaac tcttacaacc caacaacaaa aagacaaccc aattttaaaa tgggcaaagg 24061 acttgaatag gcatgtctcc aaagaagata tacagatggc caacaaacac atgaaaagga 24121 gaatgcttaa tagcattagt tattaggaaa atgtaaacca aaaccagaat gaggtaccgc 24181 tgcacaccta ctaggataac aataaaatgg aaaataacaa gtgttggtga ggatgtggag 24241 aaattgggat cctgtacatt gctggtagga atataaaatg gttcagccat cgtggaaacc 24301 acttggggtt ccccaaatag ttaaacactg aaggctgggc acggtggctc atgcctataa 24361 tccagcactg tgggaggcca aggcaggcag ataacttgag ttcaggatca agaccagcct 24421 ggccaacatg gggaaacccc atctctacta aaaatacaaa aattagccag atgtggtggc 24481 acacacctgt aactccagct actcaggagg ccgaggcaag ataattgctc gaacctggga 24541 ggcagaggtt gcagtgagcc aagattgcgg ccactgcact ctaggctggg caacagagca 24601 aggctctgtc tcaaaaaaaa aaaaaaattg aattaccatg cgatccaaca attcaacccc 24661 tatgtatgta ccctccccaa ataaaaatgg gctctcaaac aaatacatgt acatgcatgt 24721 tcatggtagc attattcaca aagccaaaag atgcaaacag ccccaatgtc catagatgaa 24781 taaactgtgg catacatgat acacacacac acacgcacac acatatacat atacacacac 24841 aaacactatt cagtcataaa aaggaataaa gtctgttaca tgctacctga ggatgaacct 24901 cgaaaacatg ctaagtgaaa gacacaaaag tccacacact gtgattccgt ttatatgaag 24961 tatctaaagt aagtaaatat agagacagaa gtagactggt aattgccagg ggctgggggg 25021 gaagagggca ttgggaagaa acttctcaat gggtatgggg tttttctttc ggaatgagga 25081 aaatgttttg gaactagaca gaggtggtgg ttgcacagca ttgtgaattc actaaatgcc 25141 actgaaatgc ccactttaaa atggtaaatt ttatgttatg tgaatttcac cttaagttaa 25201 aaaaaaaaaa gtaaaactac tcagacaacg ccaaattatt gacaatctca acctacccca 25261 cagaccccaa atcctggaac cacaaccccc taggccaatt ctcaggtcag gcagcaattt 25321 attcctgttc aattttatgc agagctcctg gtggctctga agcgccctag ggaacagacc 25381 aggaacattc tcatggtgtt agctcacatt gggccattag tgctccttta atgtgagaac 25441 aaccgggagg aggaggggat gtggacccaa aactacaaga aagagtgtcc tcgaagccta 25501 tgtcctaccg ccccacgctg ctgccaggcc cgcaggaaga tgacaggccc ggcctccact 25561 ccttctaagg tcgtcgctta gttccgacgt cgggatgacc ctgtcatcca cgcggcgtga 25621 aggccaccct ccccgcgcgc ccgggactcc aggtggggcc ccagtggacg agggaacgcg 25681 gcgtcgccca ccggggtgtg cctgggcggg cgtggggcgg ggcctgggcc ggcgccgggt 25741 ccggccgggc gctcagccag ctgcgtaaac tccgctggag cgcggcggca gagcaggtga 25801 gcgggcggtg ccggggggtg cccaggccag ggccctgtcg cctgcggcgc tgagggcccg 25861 gggtggggct gcgccctgag ggccctgccc tgccctccgc acgcctctgg ccacggtccc 25921 ttccccggct gtgggtctgc ggcccctgcg tgcgcagcgc tcctggcctc tgcggccagc 25981 gcgggggcgg agagaggaga gtgcccggca ggcggcggct gggccggccc ggaactgggt 26041 cgtggaagga tcgcggggag cggccctcag gccttcggcc tcactgcgtc cccacttccc 26101 tgcgcccgcc tgccgccgag ccccggctgg gggtgggcgc ggcgcgagcg gttaaagggc 26161 cggtgcattt aaaggagcgg tgcacgtggg tctctgaggc gtgtagcagg cgggggcgtt 26221 ttgttcttct tctctctcgc cggagacctc cgttgcgccg agtccattcg gcctctagca 26281 ccgggtcctg ggcatgcttt ccccgggaag gaggcgcgcg ggggctctgc ccgcacgtga 26341 ggggcagggc cgcaggctca agcctagagc cggtttctgt tagcagcggt gtttggctgt 26401 tttatcaggc atttccagca gtgaggagac agccagaagc aagcttttgg agctgaagga 26461 acctgagaca gaagctagtc ccccctctga attttactga tgaagaaact gaggccacag 26521 agctaaagtg acttttccca aggtcgccca ggtacgatat agcagagcca ggcttcgacc 26581 ccagtgtcct ggcttctaga tctgctgtcc atccctccga gcagacctca cccctgttta 26641 ttgccttaat aagtattccc tttgaaaggt atgaacggtg ttgagtgaag taactgcatc 26701 cctatttaca aatggagaac ctgagagcat tccatagaga cgattgtaga ctaacttaac 26761 tcagaagcga cagcctgggg ttgccaaggc tgtctacgaa gtaacttgat taggaccgac 26821 cccagcttcc agtaaggaag cctctgatgc ctctgtagcc aattctgcag acacctgagc 26881 ctccaaggcc ttcagccaag acctttggcg gtaattggag tctcgggata agctgcttca 26941 ggtgtgtgag cctcaggttc ttctctcctg aatgtggttg tgggcagccg gtgactggcg 27001 caggtgcaga aggggcctgg ttcttggccc cacctcagag ctgcgtcctc acgacgccca 27061 cgttgagcct tgggttccag ggcagagact ggagtgaggg cttgggggca tgttgctttg 27121 aagtgggatg gatgtatcag gtttttgggg aaaactctgt accctttggt gttgaagtgc 27181 ccatgtgcca agtcttgagt ccagcatgtt cacatgtggg gagtgagtgg cttgttcctg 27241 tctatttgaa agagcagcaa ggaggaggag gagcaagggc taggggctgc tgctggggtg 27301 cctggagctg tggtgcataa tgtcacacct gtctcccctc cgtagctgct caccgtcccc 27361 ccaagggggg tttgcctctt gcctactttg gcctttctct gttatcgatg ttaataatga 27421 catatatctc gcttatgagt tggtcataat aaaaagctat cttgtacaga atattagaat 27481 ttaagatctt aagaatttca atgacactga aatagtttat tattaccttt ttacagaaga 27541 ggaaacaagt tcagggagtt aagcagctag tccagttatg tggcttcagt gctttaagca 27601 ccaggatttg aacatagctg gctacactgt ctttatctct tgagtttttg cgcaggaggt 27661 tcctgtattc aactcctacc cgtgtctctc cactactgct gggaaagttt tgtggagtcc 27721 ccatgagcaa cttcctgaca aacaaacaaa atttttttaa agaaaccaaa gcagtgtgtg 27781 taggtcacat gcagtgtgtc taatgaaaac atctctggcg ggttttcagc tgttgctttg 27841 actttcggac actgtttagt tggggactga taagacagca aatatttctg caagtattcc 27901 cacctgttct attcccagct gccacagctg cggaaaggcg ggggtgaggc tgagaggccc 27961 cgagaggaac attttccact gggctccaat cctggagatg ggatgaccat catgttaatg 28021 tctggagaaa agaatgattt caggctgggt gctgtggctc atgtctgtaa tcccagcact 28081 ttgggaggcc gaggtgggtg gatcacctga ggtcggaagt ttgagaccag cctgaccaac 28141 atggagaaac cccatcttta ctaaaaatac aaaattagcc gggcgtggtg gcacatgcct 28201 gtaatcccag ctactcagga ggctgaggca ggagaatcgc ttgaacccag gaggcggagg 28261 ttgcagtgag ccgagatcgg gccatggcac tccagcctgg gcaacaagag caaaactcca 28321 tctcaaaaaa aaaaaaatgg tttcacatca gtcctcagga aagatcagat gtcagtgagg 28381 gagatcattt cttgagagcc tcttcactga gtgggagaat gggctgcttg ttcatctttg 28441 tgaaaattct agaacgggaa gaacaattca aagggtgtcc accattctgc tgtaccttaa 28501 ccagaaactt actggactct ttttaaaata aaagtaattc atgtttattc tagaaaatta 28561 gggaaaaaaa attttttttg agatggagtt tcactcttgt tgcccaggct ggagtgcgat 28621 ggtatgatct cagctaactg caacctctgc ttgccgggtt ctagcgactc tcctgcctca 28681 gcctcctggg tagctgggat tacaggtgcc tgccaccact cccagctaat ttttgtattt 28741 ttattagagg cggggtttca ccatgttggc caggctggtc tcaaactcct gacctcaggt 28801 gatccgccaa ccttggcctc ccaaaatgct ggtattacaa gcataagcca ttgcgctggg 28861 ctgagataac cactcttaac atgcatttcc ttccacactg ttcacatatg catatttatc 28921 tattaaaaca atagagggag ggaaccgtag tgaagtttgt gtatcctgtt ttttctgtta 28981 tgccttcaga attttccttt tattgagtac tcatggaaaa gcagatttga tggctgtgta 29041 gagcatttga attatttatc ctaccagtcc cacagcagga cactttccca ccctccctta 29101 tgccccagag aagcagtgcc ctctgtcctc cccatgccca tatgtgggca ctccccacca 29161 tggagccaaa cctacctggg caagtagcag agggagagca gagtgagccc tgggggcagg 29221 agagagactt gagagttttg aggtgacaga tgagctggtg agtgagtgat tagggagcat 29281 ttcttgacac atacctgccc gtggtgaagg catgtgtctt gtgagtgtgc tcccagaaag 29341 cctgtgtagt gtgtggtggg cctgcctgtg tgaccaaacc ctggccactg ggtacgtgac 29401 cctcacaagt gctgactggg ctgagaagag ctccttgatg ggcagtttgg agacttgagt 29461 tgtaactgtg gcttttggcc atgggacatt aactgattac ttttgcccct ctaggcttca 29521 gttgtcctta attataatac agggagctga ctaggtggcg ttgatggcct ttacggtcct 29581 tgccagctct gacattgtcc tatggatatg tcctttcatt tgataatatg tttacgtggc 29641 catagtgcct ggggctgggc cgggaatgga aacttgatct ctggggcctg gcctttgaag 29701 ccagttcatg tgtctggtgg ttcagcagat ccgtaacttt ccaagaggca catccatagg 29761 ctaccgtgtc ctttctcact gtgtccctcc tccatttcat cttctttata actacgactt 29821 attgaacatc tactgtgtgc tggacacttt acaggttatc tctaggtttt acgataatct 29881 tgcaaggtat gcctgttctg ctttttacag cagaggaaat gagctgtgtc agattagact 29941 gtctgaggcc tcttggccag ggagtatgtg gttcaaatca cataggcagg cgatctgaac 30001 cctgtcagtc tccaaagcct ctgcttttga ccgctgactt gctgctgctt gtttaaaaat 30061 aaatgtgttt ctggagccta ctccagaggg gcgtgctagg ggctccctct cccacttccc 30121 cacaaaccac ccttttccct ggctgcttca ggaaatgaga gaactctgcc tgggccccag 30181 gcacttctga gtgggacagg gctgttagag gtaagtctag agcctggccc aaaattcagg 30241 aggccccatc agagggcccc tggggcctgt ggtccgggag ggtggtaggg cagtacctca 30301 cttccctttg agactcaggc cccagctctg gcttaggcca gggagaacca tccccaagtg 30361 gtatgtgtta ctatatgagc tgagatggat ggtcagctgg accaaataca tagtcgggta 30421 cccagggcca gggggaggaa ggtgagcagg gaagctgtgg gcaattgtct gggtatcacc 30481 tgaccttagc aaactcttcc ttgttttaag cgaggacgtg ggacttctca gacgtcagga 30541 gagtgatgtg agggagctgt gtgaccatag aaagtgacgt gttaaaaacc agcgctgccc 30601 tctttgaaag ccagggagca tcattcattt agcctgctga gaagaagaaa ccaagtgtcc 30661 gggattcaga cctctctgcg gccccaagtg ttcgtggtaa gtgcagtgac tcccaacctg 30721 cttttgaacc ctctttttcc attaggattt tctccgtgga ggcagatttc catgggagtt 30781 tgctgtggca ttttgaaatc tgtttcttac ctagttccat tggccttaaa tgttaaggcc 30841 aaagccttta catttctctg taatgaaaag aaggtcgagg aaattgggtc attgggtttc 30901 cataatgatt gcaggaactg ctgacacaag cacggctggg gagattctct aggtcagact 30961 cccttggttt ggctaattca gcagtttgat cccattcagc tgattaatgg gaatgtgcag 31021 tggcttcttt ggatgtttga ttttgcatcc taatccaaag cagctatcag cctcagcact 31081 tccttgttgg aaggctttcc agaacgtagt ctatgttgga cacttccttc tgcctctctg 31141 cattttcctg ccacttctct agagaatggg gtgcaggggg tgggagacgg ggaaagctgg 31201 tcgctgagtg gctgatggga cttgacatca cccagcccca cccccacctg cccgtgagtc 31261 agcctccggg gagagttcat cgcgtcaccg gcactctaat gtggacagac acctagcagt 31321 gttgtttatc tgcacacgtt tgggtggtga tttttccctc caaggatttc agagcaccag 31381 caggcttcag agcagactta ggtggcttgc aaagcaggcc ctcaggaatt cagagggtag 31441 cagaagtcca tcccagatgc tctgttttcc ttcaggagct aggtaaatca gaggggctga 31501 gggacaaatg aaaaaagtta cagcctttga gtcccatctg ctcctcctgg ccaatgagag 31561 gggatctggg aggggcagat gtagaggaaa atctgtctaa atgttgatgc tcgttatttt 31621 cctttaaaga attaatagcc taaaataaac cctacagata cagtctgtgt ttattatggc 31681 gacttagaga aatgcagaaa aatatcaaga aaataaaaac cactcttggt tctaccatgc 31741 aaagataatc atctttaatg ttttgtaata tttccgatct tttatataca tacattttat 31801 aaggacattc agatgatcag gttcgtaaag ttttatgttc ggttaaattt aacagcgtgt 31861 cattgttcag gttattaaat gtttgaaata agatttttgg tggtcctgtc acagtctcca 31921 tgaagtagca tttcaggatc gaaaggtatg ctgtgtttaa agtgttgatt cttactcctt 31981 tcagttaagg ccagtgcagt ttgtccaggt agtgactgag acccagtttt tccacactct 32041 cctccgcagt gggcattgtt ttgggccttt ttcagcccaa gagctctctt ctccccatgc 32101 cgctctgctg gtctgagatt tttccactcc tcctcctccc tagttgctct ctgaccagac 32161 tctaggtatt caggagaaag tgttcattgt ctcactctct catgtggcaa tcaagtagtg 32221 ccaagcagtg agagggtgaa ggtgggtggg tgagggacac tcaccttgct gagaaagggc 32281 cccagcctgt tcgggtgatt ataaagcaga gacagtgcca ggaaaagtct gacactggct 32341 gagaatcacc cggggaccaa ccatcccgaa tgcggatccc tgacactggg tgaggatgga 32401 gcttggagat ctgcattgtt aataagcagc ctagcagagt ggtgaagagt ccagacacac 32461 tacctaggtc caagggtaac cttgagctaa ttactttttg agcctctgtt tcctcatcag 32521 taccatgggg aagaatagta gcaccttgct ccaggatgtt tagtgccggc taagggctca 32581 gcaggtgctg gtccatctcc accagccccc agtggcctgg gccacctttg agaaacagtg 32641 atcctaaggg attcagcatt tcctaagttg gtgcctccca cctgtcaccc ccaccccacc 32701 aggctaggag ggttgtgatt agagggtgcc cttgctgtga cagctgagac tagctcttcc 32761 ctgattattc cttaatgaca gctctctcct tccctgcttt cttgaagtct tggtcctcgt 32821 tgttgtgggc acagcttcag gggaggcctt ggaggaattt ttgaaagtgg aatgagggaa 32881 gcagcctgct caagggaaca cttgttttct ggtgaggagg ccgcatgtat gaatgacgtt 32941 tgtgggttag aaagcatgtt ttgtagtttt tccttgtttc ttcctgaaga catgtcaggt 33001 cttgatgaga ccgggcctgg gcacagggca ggcagtcagc gagtgtggat gatgacgaca 33061 gtggtcacca ggtcactgtc tagaccaggt cactgtctag cgcagtgtca catggaaagg 33121 gtatggtcct ttaaccctac cctccccagc acaactatca cagatgtcag ggaacctctg 33181 ctcacagaac tgctttccag ggattgtctt ttttttcttt ttctttttct ttcttttttt 33241 tttgagacag agtctcactc tgtcgcccag gctgaagtgc agtggtgcga tctcagctca 33301 ctgcagcctc cgcctcctgg gttcaagtga ttatcttgct acagccttct gagtagctgg 33361 gattacaggt gcctgccacc atgtccagct aatttttgta tttttattag agacggggtt 33421 tccccatgtt ggctaggctg gtcttgaact cctgacctca ggtgatctgc ccacctcaga 33481 caggcatgag caccgcaccc agccccaggg agcgtcttat tagtggttgg caactgaatg 33541 gagacgtggg aattgtaagg aactgattct acttgatcct gggtcccctg cttctccatc 33601 ttcacccacc catcagctcc ctttctcctt taaacaggca cctttgctct ctgcttatcc 33661 atttttgttg tgcattgcta tttgggagcc taagaaacac aacatcctct gaatgctcca 33721 gctgttgtgg gtctgaaggg tgagcctgcc ctctgtcatt ggaggctgca gcctgtggct 33781 ttttaggtac agggactccc agaactgctc ctccagtcat agcagagata aatcacagga 33841 gcttaagagg catgggaaga acagagggag gagatcgtag cttccctgtt cattcacacc 33901 caaaacaaaa ctgtcatact agaaaaggag gtattaaaag agccacctgt acagcctcgt 33961 atctcatcca gcacactgct gcagatggaa tattatgatt tagcttgaga aaatgcagca 34021 actctttgtt gtggtgcccc tctttgagta agagtgaatt ccccattgcc agagtggata 34081 gtgagggaaa ccctgggtcc aggcaggagt ctgtttagga tttatctagt gaggctgagc 34141 cagaggagga ccttacagtt ttttctcttc aatttctttt atttatttat ttatttttgt 34201 agagatgggg ttttgccatg ttacccaggc tggtcttgaa ctcctgggct caagcgatct 34261 gtctgcctca gcctcccaaa ctgttgggat tacaggcgtg agccgagcca ccatacccgg 34321 cccttctcct gcatttccac ctgataattt ctctcatttc catagatgat gaaggaacta 34381 aagccaagaa ctttccaagg tcctgcagct ctttggggga tgtgaagctg tgctctattt 34441 gtatggattt tgctggttcc cagaacttcc ctgtggccct ggggcctagt ctgagggtac 34501 tctgagtgaa gagggaggag ggcccacacc tcttctgcaa aggctgcttt tgtaaagttc 34561 acttcagttc acatcttcct cctggtcaga aagcttcggg ggctctcctc tgctgcatta 34621 agctcttact cctccatcag gcaccaaact cctccctggc atggcccatc ctaccaggtc 34681 cccacacttg agccacatcc aattgctcga tattatcagg ataggttatg ttatgttccc 34741 aactcatatg tttacttaag tggttacctc tttccagaat gagccccctc ctccaaactc 34801 tgcctggtga aatattccta acctttgcag cttcacatcc ctcttacttc ttgtgacctg 34861 aggcatctac tcctgacaac tgatagactg tgtcccctcc tgtcgggtgc attgtccttg 34921 tcactaccct cctggctttt agctggcttt gcttcccgct gttgttactc ctgtacttgt 34981 ctcatctatc ctaaacagaa ggtgctgcag gctggggagt ttgttcatgt tgaaatccct 35041 gtgatggagg tgagcagagg cagtctctgc ctgtgcctct tatttgggga tgaagttaaa 35101 gtccctgtag gaataatcca ggccatagcc ggggttgctg tcttcagaaa gaagggcagc 35161 cacaggtctt gttaagggga ttgaaattgg ctgacttggt ggaaggaacc tgcctgcttt 35221 gtttaaaaac cacatatagc tgagtgtagt ggttcacact ctgtaatacc agtgctttgg 35281 gaggctgagg caggaggatc acttgaggcc aggagtttga gactagcttg ggcaacaacg 35341 tgagaccctc atttctacaa aatattttaa aaattagcct agtatggtgg cgtgcatctg 35401 cagccctagc tactgaggag gctgaagtgg gagaattgct tgaacccagg agttcaaggc 35461 tgcagtgagc tatgattgca ccactgtact ccaacgtaag tgaccagtga gaccctgtct 35521 ctagaaataa aaataaaaaa aatcacatat attgtggggt gacttacttg gagacgaact 35581 ttcagcagag cgcacacctg ctatccctgc ccagggtgtg aagctcagcc ctgagggtnc 35641 tctggacagc gatcactcag cctctggaca gcgatcactc agcctctgga cagacagcga 35701 tcactcagcc tctggacagc gatcactcag cctctggaca gcgatcactc agcctctgga 35761 cagcgatcac tcagcctctg gacagcgata actcagcctc tgtccccgtc tgagatgttg 35821 gcagggactg tcagatttgc caggcattgt ttgaagttct tcccagccca gaaacctgca 35881 tgtgtagatt ttggtacact gggtccccca cttggtacta ctgtgtgaaa ccccacttgg 35941 cactgtttta gggggcaggc ttccctcctg tccccttggc cttggccttc ccctgggtcc 36001 cgccctcagt ggcacttccc cacctcacac gtctgctctc atggcttagg tctccacttc 36061 taacctcagg agacctggtc ctcagacacc tcccagacag cttccccatt ttatcccata 36121 gacactcaaa gggtgaaatt catggtcttt cccgagactc tcttctccgg tcttccctgt 36181 cttagtcccc acctggctgc cattctagac tgtgttttct ctcctgcgtc aggtcctgcc 36241 cttgacctct tgaccccttc tggactctgc cctcacttgc atccatcctg ctgctgtcct 36301 cagctctcct cacctgccac agttgtctct gggggtcact ttccctctct ggtcagtggc 36361 cagactgact tttataaacc tggttcagat cttgtctgtc aagattgtcc tcagggtgat 36421 gtgtgtctcc ttaacatggt gcctgagacc ctgatcatct ccactgcccg ccccacagtg 36481 tgaggccctc actggaacat tgtgccttct gcccttccct cctcctggga aaaccagtct 36541 ccatcagata ggctcttctc tagaaaacat tcgtgatctc tgagatttgg ttccactttt 36601 gtgcttctgc acctaccatc aaacacccgg attgtatcat ttgtcacatt agatgatttt 36661 tgttttgttt taagacaagg gtgtttctta ctcatctttt tatccccaga gcccagcatg 36721 atctttggtg catagtagat gcaccacaga tgtttgctga ttgaatgaat gagcacactg 36781 acagtttgga gctgccctga ctttcgtggc tatgcgtttt gccccctggg atgtgagtca 36841 cctcaggcca gccccaggca aggccgctgc tgcctccatg gtaactctca aggcctcttg 36901 ttttatggca gtcgtttgat tgacaggcat ctcttggaag cttttggggc aggacttgtg 36961 tccaagtctc caggtcgcct ccagccaccc cctgagtcct ccactgcctt tgtctcacag 37021 gaaagtggaa caaggtcctt gtgctccttt ttccaggtgc ttccagaggc agggctatgc 37081 tcacattcat ggcctctgac agcgaggaag aagtgtgtga tgagcggacg tccctaatgt 37141 cggccgagag ccccacgccg cgctcctgcc aggagggcag gcagggccca gaggatggag 37201 agaatactgc ccagtgggta ggtcccacca gcagctgggg gccttcaaac aggtcctgcg 37261 gctactgtac cttacagatg aaaaccagac attcattccc tgatgcggga gggagaaggg 37321 aagtaatgat gaggattggc cgaaaaggtg ggtggctggc catgatggac cttccatctg 37381 cagggtttca taggactgcg cattcacagc cagagatgga cttggcagtg ggctgaagac 37441 gctgtccact ctgccacctt gggtttacct ctctcatgca ggtcactgtt tccactgtaa 37501 taggagagtt tgtttggatg cctgggtgct aggacaggta acacagaagc ttaggatggt 37561 agcaggggaa gcattttttg gcagatggcc agacatggta agtgtgagag gagtctgcct 37621 gatacacgat tgacttttga gctggggata tttgggcttc actgtgatca ttcagccccc 37681 aggggaggag attgtaacgt tagaaagagt aggatatcgt tgggagagcc acttagttgt 37741 gtcctttctc tcccgatcag ggcagaacat ctgaatttgc ctgaaccctg ttctctgttt 37801 tgcccattat agaattaaaa aatgtctctg tgtggactgt ttttttgcag ccagtcttaa 37861 tcctgcttgc tgaaatttga gctcacttct ccatgttctc cttgagaacg gaaccatcgt 37921 ccctaagccc tgagtgaaat cacaccagct taaggccact gctctgccac tcctcagcct 37981 tttcttgttt gttatctccg ggaagttttg tacactttgg ttgtttcagt ttctgttcat 38041 gagtagtctt ctttcttggc tgaacgtcta gattgggact ctctctgcag agaaccggta 38101 ctgaagcaac tgtcattttc agtttttgtt tcatttggct ttttctttag ctgttcacct 38161 tattagcaag gcagcccatg accttgactt gccacagttc caaaacacaa attcttacag 38221 atcggtttgt gctagtgtct ggcaggtgtc ctgccctccc tcgttacctc ctcatttgtg 38281 cctgcccacc ttcccagagc ctgcgtcttc tcagatgctt aacacctgtt tagcctctct 38341 agttcagagc tacaaattta catgcttgat tctgtggggc agaaagttca aagtaatttc 38401 ttcctctgca aattcccagt atcttagtca cacgcaaaga gagtgtccct gtgcactgac 38461 tcctctagct agtgatttgt cagccaaaaa tgtttattta tctcctggcc tgtctcctcc 38521 catatcagta tggccacatg aacagaattg agtgacctcc tgagtccctg tattaggaag 38581 gggaaagatc ttttgattca ttaaccatta agttgattca ttaaccatta agtcttgggc 38641 ctgcagacca tagcaacctt ccttccttca tttatggtgc ttcatccagc tccaaatctt 38701 ctctactttg tcctcacaaa cttttcatat gccctagtag ctcatagact gctccttata 38761 tctggaaagc aacattcaaa cttctcattt ctggttccaa aaatccgtgc attacatgga 38821 taggctgcca tgggggacat tccgcggccc tcacgatgtg gtttcccaca gagaagccag 38881 gagaacgagg aggacggtga ggaggaccct gaccgctatg tctgtagtgg ggttcccggg 38941 cggccgccag gcctggagga agagctgacc ctcaaatacg gagcgaagca tgtgatcatg 39001 ctgtttgtgc ctgtcactct gtgcatgatc gtggtggtag ccaccatcaa gtctgtgcgc 39061 ttctacacag agaagaatgg acagctgtga gttggggggc tggggggagc agggtggggt 39121 gagggctgag ttgccagggg gtggggggcg cagcagcctg tgttggtcac tgtacctgca 39181 gctccacacc agcagcggta aagagcaggg atgaagaacc gcccaggttc atggcctggc 39241 tcactgcctc ctggattgtg acctacttgg gcatgctttt aacatcccta tgcctcagct 39301 tccttgttcg tataatgggt tgataacgca gttactggga gaattaagtg agttaatatg 39361 agtgaagggc ttagaagagt gtctactgca cgtgagtgct caggcaagct ggatcctgct 39421 gcagaaagca agctcttgat cctgggcatg gctgtgccac tgatccctgt gtgactgcaa 39481 acaaatcact tcctctctga gtctctgctt ccctgaatgt gaaacaaggt ggttggacca 39541 gatatttctc agctcacttc cagccttgtg aggaagactt ataaagcctt tcgtttattt 39601 tagtaaaata catgcagagg cagcagcgta gaaaaatgag aagcttcctc cacttcttcc 39661 ccctcccctt tctgtggtcc tcactgctaa gcaccttctg taaacttttt tttttttttt 39721 taaagttagg gacttttgtt tcatttcgtg tgtgttggtt ttttttgttg ttgttgtttc 39781 ttttaaagaa aggaataagg ccaggtgtgg tgtctcatgc ctgtaatccc agcactttgg 39841 gagactgagg tgagaggatt atttgagccc aggagtttga gaccagcctg ggaaatgtgg 39901 cgagaccctg tctgtacaaa aaatgcaaaa attagccagg tgtggtggta catgcctgta 39961 gtctcagcta cttgggagac tgaggtggaa gaacacctga gcccagaagt cgaggctgca 40021 gtgagccatg attgcgccac tgcactgcag cctcagcaac agagtgagac cctgtctcaa 40081 aattttttta aaaaattaaa aaagaagtag agtcccatcc tcagaaagct tatagtgtgt 40141 gggggattca gcgcagaaca ggtgaaagca tggagagaat gcagccagcg gtttgtttgc 40201 agcagtccag gctgggaaga gtgaggtttg agtgaattgc ttcctgtgtc tgcttcctga 40261 gcttatgagc tgcaaggaca gcagttgctt cagcgggtgg gggtcgggta gtagcaggtg 40321 gaggagtgct gggctgggtg gagctggtgg agaggtgtgg gtgggtgggg gaatgagaac 40381 tggatgggtg agagaagtgc ctagggagcc tttaatccct gtgggggtgg ggaaagcagc 40441 agggaggtca tctagccctc gtcctcactg ctgcactggg cccagttggc aggctgagag 40501 ccacaggtct gtggtcaggg tgccaggaaa tgagctggag gacaggaact gctcatgggg 40561 atggtgcccg cactccatca gggcagcatg tgggcagcat gggcatccca ggcacctccc 40621 ctagcaggtc cagaatcact caaggtgggg agcctcgagg agcagtcagg gccgggagca 40681 tcagcccttt gccttctccc tcagcatcta cacgccattc actgaggaca caccctcggt 40741 gggccagcgc ctcctcaact ccgtgctgaa caccctcatc atgatcagcg tcatcgtggt 40801 tatgaccatc ttcttggtgg tgctctacaa gtaccgctgc tacaaggtga ggccctggcc 40861 ctgccctcca gccacgcttc tctccgtctg ccccacacca tggcggcagg gcccgtgaaa 40921 cagccgcctt tagaaaaaca caaattagag gaaaatagac ccagattttt tgtactcctc 40981 cccaccccat cctgtctccc accgtggatg acctaatact gttgtctttt atttttattt 41041 attttctttt tcttgaaaca tggtctcact ccattgccca ggctggagtg cagtggtgcg 41101 atcatgactc actgcagcct caacctcctg ggctcaagaa gttctcccac ccagcccctc 41161 aagtagctag gactacaggt gtgcaccacc atacctggct aattaaaaaa tttttttttg 41221 tgcaggctag atctcacagt gttgcccagg ctggtctcaa actcctggac tcaagtgatc 41281 tcccaccttg gcctcccaaa gttctgggat tacatgtgtg agccattgca tccagcctgt 41341 tgtcttttaa atttacacat tatcccactt gagttcctca ttgcagtgtt ccaagcatca 41401 tttctcatat ttcaaagtta attttgtttt gcttctcttt ctgaagttct attttaggct 41461 cccctcaccc cgatacttcc cctgaagatt tatttttagt tttccttttc cttttcgggc 41521 aaggatgtgc agaggccatg ctgaggtctt gcagccctgg gagacttttg ggttgtagct 41581 gcctatagct gccaagtagc cccagggagt agtggaaggg cagatcccat ctggccagaa 41641 tcatgggcac tgcctgtccc caaagatgcc ataagctttt agacagcggc ttcaggcttt 41701 tctcccaggt aaggggttga acccctaacg atggaaagga aattaagctg ggcattacct 41761 attttaaaac tgtttacaca caggtgcctc acagcatttt ttgttcaggc cgctgccatc 41821 catggagcag gtagatagaa gtgcagagtg cccaggctag agggatggga cagggacagt 41881 gcagggaggg agctgagccc ccttccagcg ggggcagcag aggggaaagc catgggaggg 41941 gctgcaggat gtgtcctgag ctgaagctta tcaacaagta atgagtacca gctgggcatt 42001 gtggtgcacg cctgtggtcc caactacttg ggagactgag gcaggaggat cgcctgaccc 42061 caggagttca agtctagcct gggcaatgta agaccctgtc tctaaaaaaa taataataaa 42121 ataagtaaca attacctgtg taactgtgac gaggcagggt ttgaacattg ccgctgggag 42181 gttggcagat ggtgggaagc agggtggagg gctgctggtt tggagcagag gatacagatt 42241 gcatggggtc aagctagaaa ttgcgtggca gatgtgaaga gctggcccca ctgcgggcag 42301 taggtgtctg gtggccagtc ccagaggctg tgaagagggg ctcagccatc tgtctagtag 42361 ggcttccttg gaggttccac gatacaggca gatggtggtg gcccgggcag ccaggtggtg 42421 gctgggatga agagggttgg caggtcccag aggcagcccc ttcccctttt ggctgtgtgt 42481 gcagcagggc cgtggaggct gcttttagtc caggtagacc agggccacgc tgaggtccca 42541 gtgggctgag ctggtgactg atgagttggt cctcaggggt gaggctggtg ggaagtgatg 42601 tcactgtccc gccgatggcc agctaaggga ctgggttagg atcagccccc tcttgtcctt 42661 cactctccca tccttggcca ggagaagagg aacaggtctt tctgaggacc tgcttgtaga 42721 cctttgggta ggaggggact tcccaggttc tctgttgagg ccactctatc taaaatagca 42781 ccccagtgag tctcctatca ctgtatccta acattatttt ctccatggcc ctcatcatta 42841 cctgctgata tactgtatgt ttgtctatat atcatctaac acccctcaca ctggaacaca 42901 atgcccgtgg gcagagactt tgctagcctt ggttccagag cctagaacag tgcctggcaa 42961 gtaggagaca cccagcatta cctttctaag tgaaccagta gagatggggg gagaccgcaa 43021 ggctatgccg gcagacctga gggagtcctg tctgcatgcg ctgcaggatg acctgagggg 43081 aactccttgg acttctgtgc cctctttatc tgtaaggtgg ccacctgatc ccttccagcg 43141 taggcatgaa gtagcctaat gaagagcatt caggcttggg tatcagtctc aggatcctgg 43201 gggccttaga atttgtggcg cttggggaca ccttgtgatc gtgcaatttc tgttgtctag 43261 ttcatccatg gctggttgat catgtcttca ctgatgctgc tgttcctctt cacctatatc 43321 taccttgggt aagtgacaga taagcagcag ggtccctggg agcccctctc catgtggcac 43381 aagtggacat gggcatgagg acctgggcgg ggaaagatga ccatcgagct ccagtcttcc 43441 ccagtgccag ccgttttggg aacccaggcc tccgtcgccc tctctcatgg ccttgacaca 43501 ggggagtgga agtggggctg catggtggac cacatgtttc tgtctcgttc ctgatttaaa 43561 atgaaccctt catggagaag gctctctgtg aaccccaggg ggatagaaac cccccaaaat 43621 ttacattctg atttttaggc taggcctggg tactttctgg tttgtgggaa aaattatctg 43681 ttctatcgcc ccttgatttg ggatatcagc ctgacccagg ggcccaaaga gactgggagg 43741 acaagagaaa acactttccc aaggaccttt ccatgtgcac agggtcttcc aggtcatgcc 43801 catgcacatt tctgtgatct gttccaagca tccccacctt gttttagaaa atgctgcaaa 43861 tggtaaattg taaggacagt gaaggtcggg gaaggaaatg ttagtaaaga gggccaggtt 43921 gggactgaat ggtggtaaac tgctaggctg taatgcctcc actgagtccc agtcacaggc 43981 tccaccttgg tcctgcaggg aagtgctcaa gacctacaat gtggccatgg actaccccac 44041 cctcttgctg actgtctgga acttcggggc agtgggcatg gtgtgcatcc actggaaggg 44101 ccctctggtg ctgcagcagg cctacctcat catgatcagt gcgctcatgg ccctagtgtt 44161 catcaagtac ctcccagagt ggtccgcgtg ggtcatcctg ggcgccatct ctgtgtatgg 44221 taggtgggca gcaaggctgg tgggggcagt gggggcgatg tccagggcca aatcgtcccc 44281 agtgctgcac aaggagggca ggtgctgaag ggcttgcatc cctttctgca gaggcctggg 44341 tgggatccct cctgagagag tcgcctttgt aaaacagagg ggggtccact atttctggaa 44401 cactcctggt ggtctagata aaacgcagta gtcactgagc tcctcattta cttttttttt 44461 ttttgagatg gagtcttgct ctgtcgccca ggctggagtg tagtggcgcc atcttggctg 44521 actgcaacct ccgcctcccg ggttcaagtg attctcctgc ctcagcctcc tgagtagata 44581 ggattatagg catgtgccac cacgctgggc taatttttgt atttttagta gagatggggt 44641 ttcaccatgt tggccaggct gatctcgaac tcctgacctt gtgatcggcc cgcctcagcc 44701 tcccaaagta ctgggattac aggcatgagc cactacaccc agcctcattt tccattatta 44761 ctgctatgct gattgagcaa gtgcactgtt aagcactgga cacgctgtaa gtgatttgtt 44821 catcaagaca gtcctttggg taccatgcat atacataacc ccaaatgtta gctgctattt 44881 gatattagca tgattatcat tgccagtatt gttacttcca ttttaaggtt aaagaattgg 44941 aggctcagag aagtgggact ccccagcctg gccaccgcgt ctcgggtgca cagctcctcc 45001 atgcttgcag ttgcctgcga ggccctactc tggctcacac cagggcctgc tctaagttgt 45061 gactggagaa tgagaatttg ggatgccagc ccagaggcaa ggcatgctct gagagctcca 45121 cccggggctc ctgtgctaca gggcaggctc ttcttcaggg ggctgcccgg ggatagtttg 45181 acaaggatgt ctctgtcttc ctagatctcg tggctgtgct gtgtcccaaa gggcctctga 45241 gaatgctggt agaaactgcc caggagagaa atgagcccat attccctgcc ctgatatact 45301 catgtgagtg agccccccgt gcctctgcct gactcggggt cagcaggcag cctgtggggg 45361 gacaggggcc tgcttcctgg ccgtggcttt cagagttgac tgggcgatcc caggagggtc 45421 tccactttca gaagccaggg agggcagtat cttgttatta cacagtaaga agcttagaaa 45481 gttaggacag gaagcaggca tctgctggga tgtgctgcag tccctgactt catcccgtcc 45541 atcctccagc ggcatgctgc ggtgcaggtt gcattcctgt gatcccgcag ccacccctca 45601 gctctccagg ctcttgagaa gggactttgg agagggattc ttcagggcag ggggtcgggg 45661 agcaaggagc ttctgggctt ccttgacagc agcgtggctg attggcatta atcctaactg 45721 aagggaaggc acacgggatg gcccctggcc tcggggtcaa tgtgtagaga tttggactta 45781 cacatgcagt caacaaaggc acatcaagtc cccattttgt gacaggcact gtgctaggca 45841 ttgggggacc cagcaggaaa gaagaccaca gggtcccagg cctcatggag ctcacggccc 45901 tgtgattgtg atgccctcgg tctgttgatg gcggggctta aatagcctga atttctggag 45961 ctctggcgtc tgcaaggtgg cctgggaaag agtttatgga acagctacag agttctaggt 46021 accttcatgc agttgaggat tcgagcccgt agaggagaat tgcctgcagc gtggccccac 46081 gggaaagcac attccaggcg cattccgagg atgagcggag accatgtatg gaaaggtagt 46141 gccaggactg tcatgagtgt cccagggctc gggggattca cccgtgaaca gtgaggtctt 46201 ggctctgata gacctggttc ttatgcttta ggaggggaga caaacagtaa cagaatagac 46261 aaatgcaaga gagagtgact ctggacccct cccacaacgg cctcctaaca atggagcatg 46321 agcagatacc tgcaggatgg agggtcctgt gcaggctttc tgggacgcag actggccacc 46381 tcccccaggc cctgcaggca gccactgtta gcaccgcctg agacgtgaac cttttctcct 46441 cccccagctg ccatggtgtg gacggttggc atggcgaagc tggacccctc ctctcagggt 46501 gccctccagc tcccctacga cccggagatg ggtgagtatc ttggggagct aacagcctct 46561 catcactggg gggcagctcc ctacctgcac ccagctctgc tcggcctggc ttccctgaga 46621 ggcatgagtt caggaggggc agagggaaag gtccgttgaa aaccagccgg acacatgcgg 46681 cttgaagatt gagcaagtgt tggaccctcg gtcctctgcc agcctctgtt gcatcgttct 46741 gctgggcgtg ggtgggtgga gtgggggaag ccctggtgtc aggtgctggt gctcaggggg 46801 accccttctt ggagctttgt tccctggtaa cactctgacc agctgttgtt tctctctctt 46861 gttgtcccct cctcacggtg atgacggaca tcttctcttc ctggacaccc agaagaagac 46921 tcctatgaca gttttgggga gccttcatac cccgaagtct ttgagcctcc cttgactggc 46981 tacccagggg aggagctgga ggaagaggag gaaagtaagg tgcccatgtt cacacggcct 47041 gcttcagcct acggcgggag cggagacaga gggtggaggc tccctgcagc ctgggtggag 47101 gagggcatga ggggaggggc cccttttccc atcagaggca tctctgtgaa agtagaagat 47161 gcctgcagcg ctggggtctt ctcagcaggc cccatgtagt tgtccggcat gtattgagta 47221 tgggccacgt gcccgtgctg tgctgggtga ggcccagccc tggtgggacc cacaggctaa 47281 ggagacacgg gcagtaatca catagactga gaagccaagg actatgaagg gggccatggg 47341 gttggggagg ggcggcagga gagcatgcca cggggcttct tgacctggtt ggcaggggtg 47401 agagaaagtc agctgaggaa gtaactgctg agctgagctc tgaaggttga gtcacagcag 47461 tcactagagg agaggagcac agggtgggga gcatttcctg acagacagac tcaggaatca 47521 gaggaagccg gggcgggatg cagagagcag aagtgtggga gagccttgca aacaggcctg 47581 gagacatgcg aagataggag ttcatcctgg cgtcagtaca cggtgcctgc ctaataccca 47641 atgccagccc actgctgcgt gccaggcagc accctggagc agggagatgc tgcactgtcg 47701 taacagcccc tgccttgaga ggtgccttac gggagcagcc tggtgacagt ggcctggcat 47761 acaggactcc agtgacacgg gaggggcaag ctagggaaag atcactctgc ggtgggtctg 47821 gaaggaggag caggtgcgca ccctccaggc aggcttgggg gaggtattta ttccaaggcc 47881 aactggtgtg ctgcagacca ggagttagca cagatcccac ggggcccaca ggactggcct 47941 ccctccagac accagccaca agctctagag ggtctagatg ccacttgtgc ttctgaccgg 48001 ctgcaaattt agggctccca tgaccccctt aggttcaata acttgctaga atgactcaca 48061 gaactcagga aagcactaca cttaaaattg cagtttgttt tttgttgtcg ttttgttttg 48121 gagacagggt ctcgctctgt tgcccaggct ggagtgcagt agcacgattg tggctcactg 48181 caaccttgac ttcctgggct caagtgatcc tcccaccttg gcctcctgag tagctgggat 48241 tacaggcacg tgctaccaca cctggctcat ttatattttt agtagagaca aggttttgac 48301 ttgttgcccg ggctggtttc gaactcctgg gctcaagtga tccacctgcc ttggccttcc 48361 aaagtactgg gattataggt ctgagccaca gcacctggcc aaaattagtt ttattataag 48421 agatgcaact caggaccagc caaatgaaga gacagtgaag aagtaatgct gatggatcac 48481 acctggtggg ggaaggcgga cagctggggc caggagcagg agggacacct gcagggctgg 48541 aagggcaggg gaggtgggcc tccatggttt gtgtttattg cataaccatt tttattgtct 48601 acagtgagca aagttatcct ataaacaagt gtcagggacc attgcactaa agaaaacaaa 48661 tgagagcatt ttggaagctc taatttcctg atcagtaatg ggtagactaa ttcccagtta 48721 tatttacctg ttgtaaggtg aaaggttctt cagaggacct ctgtcttggt gttatatggg 48781 cttttgaatg tactgaaatt aaattcccta aaaatctgtg attcagactt catactaaat 48841 tgtacagcag tgcccagccc aaggccttgc atttctattt gttgttttct ttactctcta 48901 agtgcccaac actggtttta cctgagtttc agaactgccc gcttttctct gcccaggttg 48961 taagtcaccc agtccacagg tgtcccctgc tttcccactg gccactgatt tggggaggca 49021 gctgtccatg tccccagtcc acatcttagc ttctagaggc caggtggggt gggctgggct 49081 gggcaagagc agctgggcct tctgggccag agtttctctt ctttttccat tctgtgcacg 49141 cctcttcagt acgggttact gtctctcctc acacaggggg cgtgaagctt ggcctcgggg 49201 acttcatctt ctacagtgtg ctggtgggca aggcggctgc cacgggcagc ggggactgga 49261 ataccacgct ggcctgcttc gtggccatcc tcattgtgag tggctgggga tgcgtccaac 49321 tgcctcgtgg tgggggcccc cagggtcctc attgtggtgg gggcaggtct caggatccct 49381 agggattttt catttcttct cttccctctg agggacaaga gcagggagcg gggctggaag 49441 ggtcagcttg agaccaaggc tcacaggagg tgtgctcgcc cctaggtggg ctccagcctg 49501 tggaggacag tgcaggggag ggtgaggagt gtaccggccc cagcgtggct gagcacacag 49561 cctccaggcc gaggacccag ctgacagctt tgcgcagtga tgataccctc gaggtggttg 49621 tgatgacatc agatttgcag aaaagaaaat tgcttaaggg ccttgcccat gggcgcaaag 49681 ctagtgagga ccatgttttc cccctcctcc atgccattgg gacaccacag ggtctgaatc 49741 tggggcacta ggggtggccc cgttactgtg aaccacagca gtgaaatgtg gaggccctgt 49801 agtcagttaa cgtggccaga tacacataat ggggagacgt cctgccgtga cttcatctca 49861 gagatttcgc tgtcacgtta gaggaggagg agcgtctgag ccgtgcgctt ggcatctgcc 49921 ccttagtgaa aaccctgggc atggcatgat taaggttgat gctccagtgt ccagaaggtt 49981 ttctttttgc ccacaagtat atcagggatg ggatggtgga cccaggctcc tccaccacca 50041 gactgcctta cctgagccct gctggcccca aagatataga aggcaccctg gttccctgtg 50101 ctcacctgga ccactgcctg catcagctgg gtcaggggag gatgggcagc ccccacacct 50161 gcttcccagg ggcaggttgc ctggcggctc tgattccctt ggtgccagct gctgagaacc 50221 ttactgccat ttcagttgag cccacctagc tctcatataa atacatgttc cctgagggca 50281 tcttaccatc ccatgtgacc actccagcca gacaggggag gcagcacggc ctcggggcac 50341 agcactgctc caggagtcag gaggcctgcc ttctggttca ctcactaaca ggtgaggtga 50401 tctaatgggg gtgagaactt ctgcccttaa cacctcaaga gctgttgcag gaccagggaa 50461 gataatgggg tgtctagcgc cgttatccga ctggtcctcg aacaagctcc tgtgcccagg 50521 gactagacca tgactcacag ctcctgtcca caccagggat caccacgctc accctcccct 50581 ccatgtcctg cagggcttgt gtctgaccct cctgctgctt gctgtgttca agaaggcgct 50641 gcccgccctc cccatctcca tcacgttcgg gctcatcttt tacttctcca cggacaacct 50701 ggtgcggccg ttcatggaca ccctggcctc ccatcagctc tacatctgag ggacatggtg 50761 tgccacaggc tgcaagctgc agggaatttt cattggatgc agttgtatag ttttacactc 50821 tagtgccata tatttttaag acttttcttt ccttaaaaaa taaagtacgt gtttacttgg 50881 tgaggaggag gcagaaccag ctctttggtg ccagctgttt catcaccaga ctttggctcc 50941 cgctttgggg agcgcctcgc ttcacggaca ggaagcacag caggtttatc cagatgaact 51001 gagaaggtca gattagggtg gggagaagag catccggcat gagggctgag atgcgcaaag 51061 agtgtgctcg ggagtggccc ctggcacctg ggtgctctgg ctggagagga aaagccagtt 51121 ccctacgagg agtgttccca atgctttgtc catgatgtcc ttgttatttt attgccttta 51181 gaaactgagt cctgttcttg ttacggcagt cacactgctg ggaagtggct taatagtaat 51241 atcaataaat agatgagtcc tgttagaatc ttggagtttg gtccgttgta aatgttgacc 51301 cctctccctg catcttgggc acccctggga taacttgtgc tgtgagccca ggatggaggc 51361 agtttgccct gtttgaagga acttttaatg atctcgcctc tctgcacaca tttctttaac 51421 tagaaagttt cctaagcaaa ggagttagga gagcagggtg gcctgacatc tgccagccct 51481 gagctgtaag gctgtggatg ctgagcaggt ccctggactc agttgtgcac ggtggcacag 51541 acactgccag gtggttgcca aaacatccag tggttccttc agcaagtgtt caccctctgc 51601 agaagcctgt gagggcctga gctcagaaac cactctcctt tccttctctg gctttggccc 51661 tgggcactgt ggtgggagag tggacagttt ggctttgcct tctctgtaca tcaatcatgg 51721 gttgcaaaga gaatctcaga agtgcctctt cctgagcaca gtggctcaca cctgtaatcc 51781 caatacttcg ggaggtcgag tcggaaggat cacttgagcc caggagtttg agaccagccg 51841 gggcaacata gtgagacttt gtacaaaaaa aaatttaaaa attagccaag catggtggca 51901 tgcatctgta gtctcagcta ctctggaggc tgaggtggga ggatcactta agcccaggag 51961 gctgaggctg caatgagccg agatcaagca ggtgttaggt atatcagaca gctgagaaga 52021 cgcaagtgtg ccctggggtt caaactggta cccctgtctc cctgttccag gaataacatg 52081 agtgccggga caatgcatct ttattatgag aggaatgaga attgtgtagc ttgacatttg 52141 acaggagctt gctttccccc aggctgtttg aggaagggca gaggaaaatg tggtgcccta 52201 agaaggaagg acagaggagg ccgaacactg gcgggtggaa tcccactgat tagtagtgca 52261 ggtcagagac ctgggatggg gggcattgcc gtcatggaag ccacagcggg gagcgggtaa 52321 agcagacagg gatggtccct gatggtgaca actcgcaaga ggttaagggg aaagaaaaac 52381 tgaaaagctt attcaatttg gcaattatgg cagtgtttat cttcagaaga gcagttttag 52441 ggtggggttt ccaaagatgg gattggacat atattttgaa tcattaagct tgaggtcttt 52501 caaaggcctg gccaagggtt gctgggtgga gaccacattc agaggtaaag gcagaaattg 52561 ggggccctta agtagacagc gagggaggaa gaaatgaagg ggcctggtga tggttagggt 52621 gaagtgttaa gactgagaaa acaaggacat gtgagaagac gagggaagag cattggagag 52681 aacaaagaca ctggaggaga tgctacttgg aggtccccag agagcaggga gacaaatgaa 52741 cccagaacac aaatggcaaa gaagaaaaat gagagaattt gtaaaagaca gcattcgaac 52801 atgccgaaca agagcagggt actggtgttc aaacacctgt atctcccccg tgtaacccgt 52861 caactaatat ctttccatat ttgctccaga tttgtcttta gaaataaaac ccacgttctg 52921 aagtcctgtt tgtatgtggc cccagtcctg ttgcctccgc ctcctgtcct gaagtcgatt 52981 tctgcccttc tcatctatgg ttagttttgt tttgtatgtt ggcatgtttt cttaacttta 53041 cagaaatggt atcatactgt acatatttga taatttttta aaatattgca ttctggaggc 53101 atgtataaat gtagctccag ttcatttatt ttatttattt tttgagatgg agttttgctc 53161 ttgtcaccca ggctagagtg caatggcgtg atgttggctc actgcaacct ctgcctcctg 53221 ggttcaagca attctcctgt ctcaatttcc tgagtagctg ggattacagt tgcccgccac 53281 catgcctggc tagttttgta ttttagtaga gacggggttt caccacgtta gccaggctgg 53341 tctcaaactc ctgactgcag gtgatccacg caccttggcc tccaaaagtg ctgggattac 53401 aggcgtgagc caccgtgccc agcccagtta ttttaactat tgtatagtgt tccattgtat 53461 gagttctact gtttatatgc tattgatcga cctgtagggg ttttgcagtg tttctgtatt 53521 acagctgtgc tgcagtgagc atcccatcac attgtgtgga tttgaggaag tattggaatt 53581 cccccaattg actggacatt cccaattacc ctccaagtat gtgtctgttt atccttccat 53641 ccgcaatctg agagttcccc aactctataa tacttggtgt catcagactt ttcatcttgt 53701 ctgattggat gggtgtcatt tcctttaggt tttataatta tcttttcata tgtgtattgg 53761 ctgtacaagg ttccttctct gttcattatt attaattttt ttagacagag tctcgcgctg 53821 tcgcccaggc tggagtgcag cagcgtgatc ttggctcact gcaagctccg cctcccgggt 53881 tcatgccatt ctcctgcctc agcctcctga gtagctggga ttacaggtgc ctgccatcac 53941 gcccggctag tttttttgta ttttgagtag agatggggtt tcaccgtgtt agccaggagg 54001 gtctcgatct cctgacctcg tgatccaccc gcctcggcct cccaaagtgc tgggattaca 54061 ggtgtgagtc actgcgccca gcccaagttt ccttctctgt tacttgttca tatcctctgc 54121 ccatttttca cttggatttt ttgtcttacg gatatttaag cctcttaaaa tatatattct 54181 ggagagatgc taatctttga ttaattatat gcattgcaaa tgtctggtac attgtggctt 54241 gcctctcttc cctgccttta ggagtgtttt gctggaccca agtaattttt aaatgttaat 54301 gttattaaat ctatcagttt tttgcttgta tggcttatgc cattgaatct tgttttaaga 54361 gatccttccc taccctcaag gttttctaaa tttttatttt cataacaaga tttttagttc 54421 atctgaaatg tatttttatg attgtattta gtagggacct aattttgttt ttctttgtaa 54481 ccaggtgtcc cagcactgtt tactgaacag tctctccttt ctcgctggtc tgtagaactc 54541 tcctgacata taccaagttt ccataagtgg gtggatgggt tcctgagctc tctactgtta 54601 atagaacttg ctctctcgca ggccaatgcc tcaccaggtg attgaagcag agaaacttag 54661 gtggtgaaag gagaagatgg ggcctgtcct gagagtttct gttcctgaga tgctagaggc 54721 agagggtatg taaatctgaa gttacactgg atctcctaaa acagtataaa gctacagaag 54781 tataatagtg tggaatggtg gtgggagtca gtaagggtta ggtcactgca gtggtttaaa 54841 caagatgggc tagaatcctt tcacaggcac aggcagcttg gagagggtgc aatagtgcat 54901 ggtatcaggg gtcagatgcc tctttttcct ttgagatcag taagtggctt tcacctcatg 54961 acctaggctg gctgctgtgt gctagccgtc aagtcacact ccatccagca tgaaaggagg 55021 ttagaaaagg gtgcatttcc tcttcttaaa aacatgtctc aaagttgcac acagcacttt 55081 tgcctatatt caattggcca ttagtcccac ggccatacct gtctgagact gagagactgg 55141 gaaatgtctt tatttcaagt ggccatatat ccacctaaac aagataaggg atacgtggtt 55201 atggcgtgtc ttttggttta ccaatgcaga taatgaagtt accaaaacaa tgagaaaatg 55261 gggtcgtgag ggatcatgtg aatcacaagc tgatgtcttc aaagacggtg gaaatgggcc 55321 ccgggaggca gcagatgaca gcagtgggga ttaaggtaga cctccatcct ggggttaaaa 55381 tgaggggaag gtgatggagc tggaccagca gtcagaatgg tcagtggtta ggagaccctc 55441 tgccccccac cgctgccacc attggctctc tacagaatgc ctgcgagtgg cttagagtga 55501 ccaaggatga ggtgcagatc catgtgcacc cccctgcccc ctctgtggac aattttcatg 55561 cctgacagca cagtctatgt ggattgcaag ccgatgaaac tatgcaaagt agaagcatgc 55621 ctgcagtttg tgattcggtg atgtgtttta tgcttatgtg agtcgaatgg ggcggcaggg 55681 tcctgtggtc acccgctgag aaggaagggt cctgtaacca ctgcctttct ttcagctact 55741 tgagaaaggt gttgtgaggg accgtggatt ttgggacagc tttgaatggt ggtagggagg 55801 aagggtccgg tctgagtgaa tggccagaaa gctgtgggga agcttttagg acattggcca 55861 agagctccct gaaggcagcc agggagatac ttgtcagtac atgtgactaa tggccaactg 55921 aatataagca gaagtgctgt gttgctgtgt gcaacactgg acaccttagg aaggacctcg 55981 agacagtggt tgtggactct gtagagagta acagtgacag tagcaaaccc ttacccagtg 56041 ccaaccttgt gctaggctcg cactaaatga gtttaccttc aattctcgta acaataggag 56101 gtaactacta ttctaatttc cattttatag atgaggaaac taaggcacag agatcactga 56161 cttgcccaaa atcaagcagg gagtagttag tatataagcc cacggtatgt ggtttgtaga 56221 ataggtgctc ttgactagca gaaataggtc ctccctgcag tgtgtaattg ataacaagca 56281 tgggctgcca tcttcctgtc gaggccactc aaaacaccca acaggctacg cacggtggct 56341 cacacctgta atcccagcac tgtgggaggc cgaggtgggc gggtcacctg aggtcaggag 56401 ttcgagacca gcctggccga catggtgaaa ctccgtctct actaacagta caaaaattag 56461 ctgggcgtgg tggcgggcac ctgtaatccc agctactcag gaggctgaga cagaagaatc 56521 acttgaacca gggaggcaga ggttgcagtg agacaagatc acgccattgc actccagcct 56581 gtgtgacaaa agcgaaactg tctcaaaaaa aaaaaaaaaa aaaagtatga ttttataatc 56641 ccagcacctt gggaggctga gtcgtgagaa tcacttgagc ccaggagttt aagaccaatc 56701 taggcaacat ggcaagaccc catctctgcc aaaaataaaa aatagtctaa ttttagctat 56761 tcatgtgtgt gtgaagtggt gtctcttcgt ggctttgatc tgcatttccc taatgctgac 56821 taatgacgtt gggcacctgt tcatgtgctt actggtcaga tatctttctt ttgttacatt 56881 ttattaagtt ttaaaattta aagtcaaaga tttccctatg agaatgactt ttaaaatgac 56941 caaaaagggg aagataacat taattcttga agagaaggcc tctgagaaaa atacagttgt 57001 agcaagctgc tactttgcaa atgacccatg cattttaatt ttcccctaag gaaggccaag 57061 gaagagtctt atcacctcag ggcaggagat gtagggactt gggtcattta ataagagtgg 57121 taggtttgaa aactcaaacc cagaagactc cttagagttt ctcccaggag gtagggaagg 57181 ggccgcatcc atggagagag gaggatgtga cttagagcag tggtccccaa tctttaggga 57241 ccagggactg gtgtcatggt agacagtttt tccacagata ggggttgggg ggatgatttg 57301 gagctgaaac tgctccacct caggtcatca ggcattagat tctcatgtgg agtgtgccac 57361 ttagatcctt ggcgtgcaca gttcacaatg gggttcgagg tcctatgaga atccgatgcc 57421 actgatttga caggaggcgg agctcaggtg gtaatgctca tctccaccgc ttaccacctg 57481 ctgtgcagcc tggttcctaa taggctatag actggtactg gtccatggcc tgggggttgg 57541 ggacccctga tttagaggaa gtaagggcat ggcttaccgt gggccctggg gtgttctggg 57601 aatggggagg atggagagaa gagaggaggt agggaagacc tccccttgct ccccatttng 57661 ggatttgggg agaaagtcag gtctcaggct caacagtacc tgatcctgta ccatcttcca 57721 aagggaagtc agtggggttg gaaggtaggc aggggttatc ttctctgagc cacggcacaa 57781 gacagaagtt tcccaccatt cctgaggggg caggtggtag gtccccaagc agagagccag 57841 cagtccctct ctgaggcctg caatggaatg gggtggggtg tccactgagc caagggtctg 57901 tcagtgagag ctggggaggc tgggctggct tgcaagcacc tgttataacc aaaccaggaa 57961 atcaggttcc gagtcttgcc agcaagggcc tacagctgcc agcagagatg gacagccagg 58021 agaccccaat tggccaccca gagccaccct cctctgccta ccccaccctc cagtactcca 58081 gagcctactc ggaggggaac agaaacctga gaggctgaac acacacacat ggagaaacaa 58141 acgtagtaaa atatttgggg aatcaggaag aattatttgt actattcctg caacctttct 58201 ataggcttga aattatcaaa ataaattttt aaaaattgta ataacattct catactaaaa 58261 cactgagttt ttttctttca ttttttgatt ttttcttttt gactccagca tgacttactc 58321 taacaatggg tggtctcgat tttgaaatac tttcttctcc aagcctttca tgacaccctg 58381 tctctgttgg ttctgaaaat gttggatttt gtctcagccc ttgcttctgg aaacagccaa 58441 ggttaagaaa accccccatg ctttgtgttc tagcagacag cttcctgcaa agagccatct 58501 tcccagagca cttaggcctc ttagatgtct cccttgttta attatgacaa gagcacacac 58561 acagaccctc caaattccca ttcttagtct tctaaatgat tagctgagct gcttttcccc 58621 actgattaat cggaataaaa tgctcattaa ccaaacttcc ctcctttccc caggtcccta 58681 aactttcctg agtcggcaga catcccctct ggagaagagg ttggccccag agtcgaacat 58741 cctctgatct acctgatcct gctgcccttc cattccactt ccccacatct gttctttctg 58801 gtcgtgttta ctcccctatt aaaaaaacaa aaccagaaaa cgtgtttgcc tagatcttga 58861 gactctggaa gatcttaaca gtcagaggtt ccccctattt gcaatgatct cctttcctgc 58921 cccttcctat ccttgcaata atccttttga ataaagtctc tccttactaa atccagttcc 58981 taaaaattaa tttttttaga gacagtgtct cgctttgtca cccaggctgg aatgcagtgg 59041 catgatcata gctcactgca acctcgaatt cctgggctca agcaatcctc ctgcctcaac 59101 tgaggctaca tgcatgcacc atcatgcctt gctaattttt tttaattttt gtcgagacag 59161 ggtcttgcta tattgcccag gctggtctgg aactcctggc cttaagtgat tgtccagcct 59221 caacctccca aagcgctggg attcaagcat gaaccaccgc acccagctcc aattttattt 59281 tgtttggcag tttcctccta ctcttcttcc ctcctcctgc tcattccttg gccattttcc 59341 agctgctgct ctgcttttac cctagatgca cattttcagg gctcagttct cattcctctt 59401 ctctgcttgc tcctcctctc ttcccagtgt gatcctatcc attcacgtgg ctttcacaag 59461 cgagttgcct tccccagtat ccctcctagc agatcctgct gtgctcaggg tgtcctcctg 59521 catatttagc actgctcacc tcccagggtc ctgggcactt aggagtggcc atgtgactca 59581 gctctgccaa agtctagtgc ttgggtcttc cagaaaagct attgttctcc tggttttaaa 59641 ggcggggagg ggcgcctcag ccataatctg ccattttttc cttgcatggg acacagatgt 59701 gatgctttga gatgtcacag ctgcttagcg accaccagga tgaaaccgca ggcaaagcgt 59761 gtatgacaga gaaggaaacc acaagtagct gccccagcca cccctggact tcttttctgt 59821 ttgtttgttt gtttgttttg aaatggagtc tcactctgtc acccaggctg gagtgtagtg 59881 gtgccatctt ggctcactgc aacctctgcc tcctgggttc aagcgattat cctgcctcag 59941 cctcccgagt agctgggact acaggcacaa gccatcatgc cctgctaatt tttgtatttt 60001 tagagagatg gggtttcacc atgttggcca ggctggtctg gaactcctga cctcaagtga 60061 ttcacccacc taggtctcct aaaagtgctg ggattacagg cgtgagccac cgtgcctggc 60121 ctggtcttct tttatgttat taatagaaac ccctatctgg ttaaactatt atagttgggt 60181 ttctagtacg tgcagccaca tgcaagcctg acctaccttt cagtcagaca tctctgctca 60241 cttggtatct cttccactta ggtgtctcaa aacactgttc ttctctccca ccaacacccc 60301 aaatctgctc ctccagtgtt taacaccccc aaaaagcttg tctaagtagt ttctccagaa 60361 ttcttcttag aaagggagct actgctgggc gcagtggctc acgcctgtaa tcccagcact 60421 ttgggaggcc aaggtggatg gatcacctga ggtcaggagt tcgagaccag cctggtcaac 60481 atggtgaaac cctgtctcta ctaaaaatac aaaaattagc cgggcatggt ggcacatgcc 60541 tgtaatccca gctactaggg aggctgaggc aggagaatcg cttgaacccg ggaggtggag 60601 gttgcagtga gctgagatca caccattgca ctccagcctg ggtgacaata atgaaactcc 60661 atctcaaaaa aaaaaaaagc tgtatttatt taagaaaaca tcttgatgag aagcatcaag 60721 taaaaggtga agccctaagg gcatgtatca gaaaattaaa tgaatagata cactgctttg 60781 tattggaaca agattttgag gctaggggat gtaaggagaa gaaagtgttt cttgtagctg 60841 gatattgcaa taatttcaca tctgtttcca ggtaaataag ctgggagcag aattcatccc 60901 ttttctcttg tagcaatgca gccattcgtt cctcattgca ttcctcatta tcttgggatg 60961 catcttaagc aatggcaaca gagttctcca aagcttcatc cttggtggga atagcgtctc 61021 catccctgag tcacaggtgg tggttttctc agacactgca tcatcacaaa ccatgaactt 61081 tcacccctgc atgggacggt gatcactcac cccctcccac aaaaggacat ggtgccatag 61141 tcacttacat attccttctg agggtatgtt ctgtgcatcc ccactgctgg taccaaatct 61201 gcctgccttc attggattag aaggagcaga aagccactta gactgcctta gggaaacaca 61261 agattattac gcacctgtta aatatatttt tctatttata actctcgctt tttaaaaggg 61321 tcctgtttgg tggcttttct tctactaagc acagggtctg gttaagcaag gggagctgaa 61381 tgtctctgta tgataatact tggggaggga tttggcctcc acttgatata gaaggaaaag 61441 gagccaatag gaggcaggtg gccacatact atgatgtaac ctcgccccca agagccccca 61501 gaacctttga tcaggtcagt ccaactctga aatagtgact attcacagag aactcacaga 61561 gtcagggatg tggatttgat tcttactcat ccatcttgct caccattccc tgtgccttga 61621 aatctgaata tttgagtctt tattcagatc ttggaaattt tctcccctta tttattttct 61681 ctgttttctc tttttgagat tccctcaaga ctggtgtcag atctcagaac ttgtttttta 61741 gtgtctcttc agtgttgtct cctactctcc atccctttgt cttttccctg ctctgtatgc 61801 taggattttt cttctagcct ttttagattt tattttttga caatcttgtt tgtttttttt 61861 ttaatttcca aaaactcatg atctttactt gctcctctct catagctgtt atttttgatg 61921 gaggcaatat cttctcagct cttactgaga aacgagtaat tcttgaaagg ttttctattt 61981 gattaattgt ctcagttttc tttagccttt ttgtttatct gttcatctta ggtcctttta 62041 tttcttatat aactctttga aaatggctga tgatccttgc ttgactgttc acatttatga 62101 atgaagaact agactaggtg tctggagtta ctttccttgg cagatgtcag tctggtggtg 62161 ggttaaagta ttatccagta aatattcatt cccttgcctg ctggcctcat gagggaagcc 62221 tgatcccagg ccatgtcctc tcacccctct tcggtgttct ctgtggtgca tgggaactgc 62281 ctgcactcag ttcctgttca gctctgctgc tgggggacca aattgaaaca ggacatgccc 62341 cagtttagcc aatgtcaccc tctactaatt accatctcat tcagctaaga aaatgtatct 62401 tccggccagg tgcagtggct catgcctgta atcccagcac tttagaaggc cgaggcggac 62461 ggatcacctg agctcaggag tttgagacca gcctggccaa catggtgaaa ccccatctct 62521 actgaaaata caaaaattag ctgtggccag gcacggtggc tcatgccgat aatcccaaca 62581 ctttgggaag ccaaggtggt ggatcacaag gtcaggagat tgggaccatc ccggtcaaca 62641 tggtgaaacc ccatctgtac taaaaataca aaaattagct gggtgtggtg gtgtgcttct 62701 gtagtcccag ctactcagga ggctgaggca ggagaattgc tggaacctgg gaggcagagg 62761 ttgcagcaag ctgagattgc accactgcac tcctggcaac agagcgagac tctcaaaaaa 62821 aaaaaattag tcgggcatgg tgatggacac ctgtaatccc agctacttgg gaagctgagg 62881 caggaaaatc acttgaaccc agggagtgga ggttgcagtg agcagagatg gcgccactgc 62941 actcaggcca gggagacaac atgagactct gtctcaaaaa aaagaaaatg tatctttcta 63001 aagtatgtgt gtgtgtgcgt gcgtgcgtgt gtgtgatagt cccttatagt gacagtttcc 63061 tagcaaaaca aaagagaaac ataataaaat gtgtgggaat ttggtgtcta gaaagctaag 63121 ttatattctg ctatcaatag ccataaccat ctggtcagtt cacttctgtg ggcctgtttc 63181 tcatatttat aaagtgaagg gtttaaaaca cattcatgat tcccaaacta gggatttgga 63241 gagtagtaaa gattatccaa atagtcttat aatagttttt aagagctatg acaaggccac 63301 tcaagctatt agctgattac tgcagggaat ggaggagtgg agcagaatat ggtgccgggc 63361 aaagaacaca gaactcttgg catgtggaag gtgggccctg gagcacctgc tgagctcccc 63421 gctcagcccc agctatagga ccttcagccg actgtcccag gctcagcacc atgtgccttg 63481 agggtagggc agaggggctt agggggctca gccagcagtc accacagatg taatcttcca 63541 gggatatgct ctgagcttgg gaggaaattc acttcttcat gccggggaac gtatgaccca 63601 ctttggtgta ttccaggcaa tgtggagata cagaaatgaa aagactcagt tctcaagggg 63661 cttgaagtct gattcgctga aataattttt agtttccaga accacaagac agacccaaga 63721 gggctgtgtt ggcaaaacaa atggcagagt ggagctggcc agaggcatct gtgcgtggcg 63781 actccaagag agcacccgac tccagatggc gacactgcag gatggagcgg ggcatgcctg 63841 cagacaggtg tcaggtgcga aaaacagaac aaccggacgc ttctggctgc aaggacctga 63901 agccttaggg ggtgatattg gttaaactga ggtaggcctt gtgttaatag ggtctccacc 63961 tgactttgtc agtttatttt ctcactgaca agaagtggag tagggtggct ctgggtttga 64021 tttatccagc agttcaattc catctttgat tttttttttt ccattctttc cattttcctc 64081 tctggcattc tttcctctcc cgcattctca cttacataaa gccaagtgca gggggagaga 64141 gaggagagag actcttcctg tgtcattttt ttaaggtcaa tgaaaaccat tcttaggagc 64201 ccctgggcag atttctccct gaatctcatt ggccagaact gtgtcacatc cccatgccta 64261 agccaatcac tggcaatggg tgtgggatag ttgtggccag cctggaccca tcagaatgta 64321 accctgagtc tcaaggggga gagtggatgc cagaatgaaa tcagagttcc atccactttc 64381 cattagaaag aaaagaatgg gagaggagca gagaaggctc cagacagaca actaacagtg 64441 tttgatacaa taactttctt atgtttctgt aacttaaggt ccattttgtc tattgctagc 64501 atatataagc ttaatgcctg ctagattctg ttgtttacac tgtcctcttc tccctggagt 64561 agtaccgagg gtgagggagc ctgtggcctc tgtaatatca ggacatagag gatagggttg 64621 ccggggggca catgtggtcc cttgattgcc acgtttctat tttggtttct tttctttaaa 64681 agaaaacttc ttttatttta tagataaggt ctcattctct cacccagggt ggagtgcagt 64741 ggcgtgatct cggcctactg cagccttggc ctcccaggct caagcaatcc ttccacctta 64801 gcctcttgag tagctggaac tacaggtgca caccaccgtg ccaggctaat tttttatttt 64861 tattttttgt agagacgggg tcttgctgta ttgcccaggc tagatttgaa ctcctggcct 64921 gaagtaatcc tcccaccttg gcctcccaaa gttctgggac tacaggtatg agccaccatg 64981 cccagcccta aaaaaattgt tgattgtggt aaacacatag cataaaattt accattgtaa 65041 ccatttttaa gtgtacagat aatagtgtta agtatattca tattgttgtg aaacagatct 65101 ccagaacttt ttcatcttgt aatatgaaat cctacaccca ttgaacaact tcccattctc 65161 ccctgcaacc cccacaagcc cctggcaacc acaattctat tttctctttc tattagtttg 65221 actactctag atacctcatg taagtggaat catacagtat ttgtcttttt gtgactggcc 65281 ctatttcact tagcataatg cataatgtcc tcaagcttca cgcatgttgt agcatgtgac 65341 aggacttcct tcgtttttaa ggcggaacaa tattccattg tatgtttata ccatattttg 65401 ttcatctaat cctccatcag tggacatttg ggttgcttgt acctcttggc tattgtggat 65461 aatgctgttc taaacatggg tgagctaata tctttgagat cctgctttca atttagatgt 65521 atgagatttc tgggtcataa gatttaattt tttgagaaac caccatgctg tttctttaca 65581 gtggctacgt catcttacct tccccccaac agtgtacaag gattccaatt tccccacatc 65641 cttgccaaca tttgttattt tctggttttt tttgatagtg gcaatcctaa taggtgtgag 65701 gtggtatctc attgtggttt catgaggttt aagttgagca tcttttcata tgcttttaga 65761 ccattcgtat atatcttctt tggagaaatg tgtggtgcaa tcttggttca ctgcaacttc 65821 cacctcctga gttccagcaa ttctccagtc tcagcctctc gagtagctgg gattacaggc 65881 atgtgccacc atgcctggcc atcttcgctc ttgagcacct gtgtcatggt aggtcaagga 65941 cccctgacag tcagggtccc aggcaggagg gggcacatcc cttctattat cttggattct 66001 accaccccct tagagggaga ctacagaccc ccggactctt cccagagacc cattgccagc 66061 acttcattct gttgatctcc tcttgcactt ctgcctcccc cagcctggaa agagttttaa 66121 acatcttgtc cctggcaagg aaaccctggc tcagccagta ttcttaccta atttttctgc 66181 ctcgtgtcca aagtgcagct gtattaaatg tattaaataa aaggaaaaac ttgggccaga 66241 agcggtggct catggccagc actttaggag gccaaggtag gtggatcatc tgaggtcagg 66301 agttcgagac cagcctggcc aacatggtga aatgccgtct gtactaaaaa tacaaaaatt 66361 agccaggtgt ggtggtgtgc acctgtaatc ccagctactc aggaggctga ggctggagaa 66421 ttgcttgaac ccgggaggca gaggttgcag tgagctgaga tcatgccact gcactccagc 66481 ctgggtaaca gatggagact ccgtctcaaa aaaagaaaaa aaaagaaaaa gttggagggc 66541 tttaccattt ctgctgggaa gtctggagca aaggccactg gcagtgccac tcagcttgaa 66601 catagtgagg gtgggtggga gggcaggaat acagtgttcc ttcaccaggt tgcaggtggg 66661 catcgacctc ttggtggtct ggcgcagcct ccctgcctgg ggccgtctgc cttgggctcg 66721 tcactgccca aggatcagat gagctgatgg aggaggaccc tgagccagag ttagttctta 66781 gcttaaactc tacagccgtg cttctcaaag tgtggtcctg ggggcccatg aagtcaaaac 66841 tattttgtaa taaaactaca ttaccttacc attctccttc tattacaagt gtacacagga 66901 gttgtccaga agctgcatga tacgatatgt gatatcacaa gggattgact ccaaaagcag 66961 gtcggagaat ccagctgtct tcttcttctt cttctttttt ttttttttgt ggagacagag 67021 ttttgctcta ttgcccaggc tggagtgcag tggtgcgatc tcagctcact gcaacttctg 67081 cctcctgggt tcaagcaatt ctcctgcctc agccaccaga gtagctggaa ttacaggcac 67141 gcgccaccac acctggctaa ttgtgtagtt ttagtagaga tggggtttca ccatgttggt 67201 caggctggtc ttgaactcct gacttcaggt gatctgccag ccttggcctc ccaaagtggt 67261 gggattaccg atgtgagcca ccgtgcccga cctcacctcc tgggttttca gcaattctct 67321 ctgcctcagc ctcccaagta gctgggatta caggtgcccg ccaccatgcc tggctaattt 67381 ttgtgttttt aatagagacg gggttttgcc atgttggcca ggctggtctt gaacttctga 67441 cctcaggtga tccacccacc tgggcctccc aaagtgctgg gattacaggc gtgagccact 67501 gtgcctggcc gaatccagct ctcttctatt aagatattct ttttattcta gtaaagagaa 67561 ttgcaaaaat gtaaacaaat gccagtcttc atgaaaggtt tttggtttgt aaaaatatag 67621 ctattttcaa aaaacttatt tatgttaaaa tgaaatgttt atttttgtta ttttttaaaa 67681 agttgataca taaataaaat tctcagttgt aatttctcaa caggatagag ctaacccaca 67741 taacaaaagg tctttgcagt cctcagtaag ttttacaagt ttaaaggggt cctgagacca 67801 aaacgtttga ggaccactgt ccctccggaa ggctgcggct ttcttctggt ggctttacac 67861 ttctgctgtc cttcagccac gtccctttcg cgtctccagg gactcctcca tttgccttga 67921 atttctcctg ttagctccac tccagggtgg ccacatgagg gcagccgatt ccgccctgag 67981 cccttgcggg ggcctcaagc acaggtgttc accaggtcag ccttcagcat ccagttccta 68041 ccgagtccct ggctggtggt actggagaag caaagacgat aaaccaggtc cttgttctgt 68101 gggaggctca ggcctcctgt tctctccctc ccaggggcca ggagagaggc cccagccaga 68161 gcctgagcac atcatagggt tgacatgttt ggtgaatgaa cattatctct gtgttgtgca 68221 gagctgagca tagaactcca tcttctagat cctgaacttt aacctggaag ggttaggcct 68281 tcttgccatt gcctgcaggg agaggtgagc accctagaac acttactgtt cgggagcacg 68341 tggttctcat gcagtaaagt gagaagggga gaggaggcac cttttctcta gtcttggatt 68401 aaggtggaag taaagggcat atgtgaagtt tttcccaaaa ctaaggcctc atttattttt 68461 tattttttat cagaacaagt catggacatc tctagcaatg aaagcaaagc cctgcagtca 68521 tttaacatag acatactctc tacatcctgt tagccagcgg ctcctacgac ttctcccaac 68581 aatcttaaag caacctggca tgaaggagct ggctcaaatg ctggtcagtg gaaagggcat 68641 aagctctgaa gtcacaacac aagggttcaa atccgaccct cccggctcct ggtctgtgct 68701 ctctccccct tactgtaatg ccaacttcac tttggttttt acccctgcaa atattcatca 68761 ttgtgttgag tcaacacaat gatacataaa aaatcagcag tctctgataa agtgtggcct 68821 actggttaag aggagcaggg ctttgaaagg atagccaagg taacactcct ggttccacca 68881 tgtgttagct gtgtaacctc gggcaagctg cttaacctct ctgagtctca attttctcaa 68941 ctgtagaaag cagacataat attacagggt tgctgagaga ataatatgac attaggtttc 69001 taaagtgctt aattcagtac ctggctcata gtaagtgctc aataaacacc agttataaac 69061 atcactgcat ttgaaattca cagctttttc aaagttagag tcagaaagaa tttcagaagt 69121 cctgtaattc aatgttttcc aaagtgtgtt caaaacactc acttccctca agatgctcca 69181 tgaaaaaggg ttctgtggcc aaatatgatg ggttgtatat taaagcctca gagaagccac 69241 agagtaaaga aacctgttga actttgatta ctccaacctt tatcaaatgg atttgactgt 69301 agaaccctct tcacacatca ctttttaact tccaggttgc tggtagacag agctcagtca 69361 ttaaccatca ttaagaatca taacaaatac tggataacat ttatggaggg tttgctctat 69421 gccaggcact gtgatatgca caaggcatgt attatctcac tttattcata acaacactaa 69481 gaagtagtgc tatcatcttt tacggatgaa aaaataggct tagaagacta agtaacttgg 69541 ccaaggtcac ataccaagaa atcgtcagca tgcaaactca ggttggcctg gctctggaat 69601 tcagccttgc cacctccctg tatggtttcc gcagcccact gcattacccc tgactgttgg 69661 acagtccttc cttgtattga gctgaaatcc aattccagtt ccaatcacac tgtttctcca 69721 gggtgcacat aaatatatgg tgaaatgtat tatttagttt caattctaaa tatcctttat 69781 ttgtcattgt ttctcgaatt gtatttattc tgttcccttt tctcttgact tgctcaagta 69841 cttcagcatt tcatttggac taaaaaactc aacacaattc tgtctgcttt cttatagtat 69901 gatattttca tttttaggaa gtgacacatt atcaaaaccc ataagaacta aattttttta 69961 agagggggaa aaagacaagg tcaggaggct gaggtgggag gatcgcttga gcccaggagt 70021 tcgagaccag cctgggcaac agagtgagac cctgtctcta caaacaccac caccaccaac 70081 aaaaaaaaaa ttagttgggc atggtggcac atgcctgtgg cacatgccta ctcaggagcc 70141 tgaggtggga gagtcacttg agcccaagag gtcgaggcta ttagtgagcc atgatcacac 70201 cactgcactc cagcctgggc aacagagcaa gaccccatca aaaaagaaag ttttagatgt 70261 atacaataac taagttattt tccctgcagg aattttgata acaaagttgt cttttatagc 70321 tctagtagta ttgctgtaaa gctataaaag tacctctggc ttttgcccaa gtgtagtagt 70381 gtctgtctct ctctctgtct ctccatagca gcacattgtt aactgtttct taccagtagc 70441 actactttca ttacttttat gactcattgt ctcccttatt tagtaaacaa agtatacagc 70501 cactaaaagc aagtagctgg gctattcaca gcagcttctc ccccatacac tgtggtagtt 70561 cctaactaca atctgtttgt gtccaatagc agttcctagt gatgcaaata gtactttcca 70621 tttaaacaat ttcattatag ccaattctag ttctgaaaat accactggaa gaaaaacact 70681 tgagttctca gtatggtttg accaatatag cttggaacta tcactttcct tgttctgaac 70741 attatacttc actagttcag ctcgagatct cattattttg ggtaactgca ttaccctata 70801 cctgcttgtg gagtttgtct ccaaccaatt aattttcccc tacaattctg ctatacatca 70861 gcagttgtat tttggagccc acgtgaaagg aaactatact tacatctgtt ccagttcatc 70921 tgtttaggtc tggctcgttg ttcagcctgt gacaatcatc tgggagtggc gatcattctc 70981 cttcagccca tgtcctctgt gaaactggat aaacctgccc tagagcagca ctttccaaca 71041 gaatgttctg tgaacatggg aaatatacca gatctatgct gtccaatatg gtagccacta 71101 gccacatgca ggttttgagc acttgaaatt aggctcatgc aactgggaac ttaattttaa 71161 attgtcttta attttaatta atttaaataa tcccatatgg ctaatagata tcatactgga 71221 cagtgtcgta gctctagatg tttacctgaa atgcaggtaa aaatattaag tagaacaggg 71281 acctgcttct gtccctggag tccaaactct ggactgggga tggccattca aggattatca 71341 accacctgtg tttataataa ctcggtgcac aattttatcc accagcgtat atgagtatca 71401 gtgaaggact ttgtcaaatg ccttgtaaaa agcaagatgg cctagaaata tgtggcactg 71461 ccctgctcta ccagcctcat gatttttttt tcctagaaaa tagggtgtgt ttttttttct 71521 atttgtagaa gtcatacata cctactgtgg caaaatgcag gaaagtacct agaggaaatt 71581 aacaatgacc aaaattctgc catcttgaga taactactgt taacatttca gcatttttct 71641 ttcagctttt tttctgtgtt ggtatttttc acatcactga aatcatacct tctctgttac 71701 ctcctgtttt ttttaattta aaaaaattaa aaaaaattta atcgacaaaa atatatatgt 71761 ttatcatgta caacatgctg ttacactgtg gaatggctaa atcaagttaa taagcacaca 71821 cattacttca catatatatc atttttttgt agtaaggaca cttaaaacct actcagcaat 71881 tttcagtaca ataaattgtt attaactgta gttacataaa ccaaaaataa aattctaagg 71941 ccctgcaacc atctgaaagg atccctcctc ctggctaggg cattccaaag ttagcctaaa 72001 aactggttca ggccatgatg ggaaaggagg gtcggacatg cctcattata ctcttctgcc 72061 tttgggattt caggaaaagc tgacgagcat ttaacatcaa cacaggacct taagtccgct 72121 aagaaacatt tacaatctct tctctgtgaa gcttgcaacc tgaaggcttc acctgcatga 72181 tcaaactttg gtctccacaa ccccttatca taatccagac attcctttct attcataata 72241 actctttcaa ccaattgcca atcaaaagtt ttaaatctac atagaaccta gaattcccct 72301 cacttcaaat tgtcccacct ttctggactg aagcaatgta tatcttacat ttttatgtct 72361 catgtctcta aaatgtataa agaaggttgt gccccgacca ccctggggac atgttctgag 72421 ggtctcctga gggctgtgtc ttgggtcatt ggtcactcat atttggctca ggataaatct 72481 cttcaaatat tttacagagt ctgactcttt ttgtccacaa caccatgtgg tacaatagat 72541 ctcttgaact catcccgctt atctaactga aatcttgtat ccttttacta acacctttcc 72601 aacccttgcc tcccagtccc tggcacccac cactctgcac tccgcttcta cgaatttaat 72661 gtgttttctc gtttgtttgt tttttgtttt ttttctgaga tggcgtctca ctcttgttgc 72721 ccaggctgga gtgcaatggt gcgatttggc tcactgtggc ctctgcctcc cgggttcaag 72781 cgattctcct gcctcagcct cccataccag ttcaactttt tcagattcca cgtgtaagta 72841 agatcatgca gtgtctgtct ttctgtgcct ggcttatttc acttaatata atgcctccag 72901 gttcatccat gttgttggaa atgacaggat ttccttcttt tctaaggctg aatagtattc 72961 cattgtgtgt atataccaca gtttctttat ccattcatgc actgatgaac acctaagttg 73021 attccatttt tggccattgt gaataatgct gtaatgaatg tgggagtgcc gatatctctt 73081 ggacatactg atttcatttt ctttgtatgt gtatatatta tagtagcgga attgctggat 73141 catatggtag tcctgttttt aatttctgct ttttatattt tctttatatt ttaaaaattt 73201 taagaatttt tataagtatt tcttacatta ttaaatactc tttgaaaaca atcttgaagg 73261 ctgtatcaat ggttaataat ttgcatagca attcccctgt tgctgtactt tcagcttgta 73321 tctaagtttt tgctattaca aataaaactg tgctgaatat cttcttgtac aaaataactt 73381 cctgaaggtg tattattaaa actaagaata tgaacattgt tgaggctaac tcttctgaaa 73441 gaggaaataa ggttatcctg gaccacatgc tggctcttag tggccaattc tttcccttct 73501 gaatattcaa ggccacccat ttcatcctct cttttagcca ctcatcccat attaatgtca 73561 agctcacctt aatatacttt ttagattcta atttcttagc ctttttgaaa atcaggaaga 73621 cattgcctgc ctcgtcagtc ttccagcaat atttctgttc atgaaggttt cttaaaaatg 73681 aagtctcctg atctgtgagc tccctcagta gccttgaata tgacatttac aggtcagaag 73741 acctgagttg ttctgaggca gcagggtgct ttcttactgc ctcctcctgt gcttggagct 73801 tcaatttctc tttccaatag tgctcttcaa ctttccaccc tgaccgtaat tctctttaat 73861 aagggaaacc acaggagtgt gtggcagaga atatcagcgc tgggcaaaac tcatgctctc 73921 ctctacttcc caggcacaca gctgggggac atctcccaga ctcccctgca gccagctgtg 73981 gctatgtcat tgagggccag ccagtgaagc gtgggcagaa gtgatgtgtg cctctgccgg 74041 cctggccatt aagacccccg catggagtgg agacgacacc cagggcgacc ttggaagtgg 74101 catgatgaag agaccagagc ttttgtggag cagtagtgcg gaggaggagc acctgctgaa 74161 ccactcactc cccgacagtt gtgttggatt gtggacctgt tttctctttg cccactctgg 74221 gctcatttgc aattataacc tcgttgcttc tgcttgagct atctttcctt tgtcttggat 74281 catgggatca agcatgcgta atttttctcc aaataccttg ggattcttct tcctaactct 74341 tggtacagag tccagctgtc ccaggctcct ggatgacagg gagctttctc tcactctcat 74401 ctgaccaaca ggttccccta gcggtacata attagatcca aaatggtgct tctctttatc 74461 attttctctg ctttctgaga aatggaattg tcattttggc aagaatttgg cagaaattgt 74521 atttcagatt ttaatacaat tgagaaaaaa ccattcataa ccacagttgt ctggtacgta 74581 atgaaggtag atcttgccga atggaacatc cttctctgca acttggtggg tttgaatggc 74641 tgagtttggc agaaggaatg gtagagaaat agatgactct acagaataaa tggggaagga 74701 tttggagttc tggaagcaat ataaaaggga aggggtaaag aggagagatg ggaagatgag 74761 gagccagctg gatcactctg taaggggagc tagcctaaaa tcctaatggg accggatggc 74821 tcaagagctt ttgagaacac tggtgtccaa tatttcattt gaagttgttg agtagatctc 74881 ctccactccc ctgaggggtt ggtgccaggc tggacaggga cagcacacag atggtgagac 74941 agcccacaag ctgaattctc cacttgcaga ggctgagaac cagcgatgac tgtaaatgtg 75001 agtctctgcc cgccttctgc cctggcctcc ctcctgagaa ctggccagac ttttgaggag 75061 gaatttagac ttccaaaggg tggagaatag aagtttcatc cagtcttatc tagggaagct 75121 cccgctgaca tccaaattac ataataaaca cggccagtaa aatactactg ctactactat 75181 cataaaataa aataaggaat taagtagata ggtcaggaat ttggcccctt aaaacactga 75241 ggccagagag attacacagt gtgcctcagg gatgttttgc tctgttctca ggttatctgg 75301 ctcctattcc ttgtgtcctc agtccttgcc tgggaagaga gtgcagtgag cagggtagct 75361 cctcaccccc tctcctcagg taccaggtgc tcctgggccc tcaagggctc agctcttccc 75421 tccgcagctc ctgcaggggc tgttgggttt atcagagact ggttctgtca aagagctgag 75481 ctcaggggaa cattttgggg gagtgaggtt cctaccacct tggagatagc tttttccaaa 75541 caaggggaag catttgagaa atatgttcat tagtaagttc gtcagtggtc ataaagatat 75601 tttgtctgat ggcagaaggc ctgaatcttt gttcccataa aagacttact cctctaacca 75661 gaagaataaa cgaaaggtgg tgagacactg gggacatctg ggcagccatg gagaaagagt 75721 ttgtggatct gacccactgt tctcctcttg tccagataag gacgtgtgtt tgtgtgttga 75781 ctggtgagga caggaggagg tttcctgagt gcagagtgcc aaagcacaaa gcctgagata 75841 aaaaggcata aatagaccta gagcctgacc cagagccagg cccagtgtcc ttgggctggt 75901 aggacttgca ggccaggtcc cagatgggga gggggtactg gggctgagca gctgctatgg 75961 aaatttccat gggctgtaca ccagctcttg gtaagctctt gcactcagca tggcatgggg 76021 aggacatggg ttatgaaaga ggcacaagac ttagtccaag cttataagga gctaggcctc 76081 tagagaaaaa actcaggctt gccaaagcag caagtaaaca aacaggttgg aaaattttga 76141 aacaaagttg taaataaggt caaggcatgg aaccagtcaa gaaaagtcaa tatcgtattc 76201 cagttattag aacctcattt caccttctaa ggacaatgtt gaaaatacaa acttcacatt 76261 aagataaaaa tgggttttat tactttaaat actaataaag aggtgacaat atattagttt 76321 tttaaagaaa aaacttttta taattaattg ccacttgcca gagtttgaga ttttcttaag 76381 aattcaggtt aaacttacaa aatcattttt aatacagaaa tcaatgaaaa atgatccaaa 76441 gacattagtt tcatgcattg gtaatcgtta ccaaaagcct tacagtgtgc aattttgtaa 76501 agtcaatgtt gcaactatcc caggaattgg aggtctgtgt ttttcccctc tctagtaaga 76561 ttcaggtcgt gtgtgtgtgt gtgtacaagt gtgttcacac gtgtgcatgt gcaatgtatg 76621 aacccacaat gctgaccata atgctggact cccaatgcac tggttaggat gcctttgctt 76681 gtatttaaag aagccatgaa gaggacatat tagttcacat ggctgaaaag ttaagtgtta 76741 gggcagcctt ctaataagct ttaatcaggg ctccagttcc atttctaatt ctgtctgcct 76801 tgccctgttg tggccagatt ccttttcggg ctggcttatt cacagttgca agatgtctgt 76861 gaacagcaac tcagccaaca acaactcgtc taaggagaga ggagctattg tttcctacag 76921 ccgttgcata gaagtcctga gcttccttct gattggacca ccccaggacc agtcattgag 76981 ggcgtgccat gtgctgactg gctgagattc aaaaacctat catcttgtca agtggattgg 77041 gattgactta tatcgggggc atgatatggt ttggttgtgt ccccacccaa atctcatctt 77101 gagttgtagc tcccatccca cgtgtcatgg gagggaactg gtgggaggta attgaatcat 77161 ggagagggtc ttttccatgc tgttcttgtg acagtgaata agtctcatga gagctgatgg 77221 ttttataaac gggagttcct ctgcacaaac tctcttgcct gccactgtgt aagacatgac 77281 tttgctcctc attctccttc tgccatgatt gtgaggcctc ccacgccgtg tggagctgtg 77341 aatcaattaa acctctttcc tttataaatt acccagtctc gggtatgtct ttagtagcag 77401 catgagaaca gactaatatg gtaaattggt accagtagtg gggtgctgct gtaaagatac 77461 ccaaaaatgt ggaagcgact ttggaactga gtaacaggca gaggttggaa cagtttggag 77521 ggcttggaac ttcctagaga cttgttgaat agctttgacc aaaatgctga tagtgatatg 77581 gacaagtaac atctaggctg aggtggtctc agatggaaat gaggaacgtg ttgggaactg 77641 gagtaaaggt cactgttgct atgcaaagtg gctggtggca ttttgtccct gccatagaga 77701 tatgtggaac tttgaacttg agagaaatga tctggggtat atctggcaga aaaaatttct 77761 aagtggcaaa gtgatcgaga ggaagcagag cataaaagtt cggaaaattt gcagcctggt 77821 gattccatag aaaagaaaaa cccattttct ggggagaaat tcaagccagc tgcagaaatt 77881 tgcacaagta acaaggagcc caatgttaat cactaagacc ccacccaaat ctcttcttga 77941 attgtagctc ccataattct cacatgtcat gggaggaacc tggtgggagg taactgaatc 78001 atgggggcgg gtctttcccg tactgttcac ctcttgagat ctgatagttt tataaagggg 78061 agttctgcac aagctctttt gcctgccctc atgtaagaca tgactgcttc tcattggcct 78121 tccaccataa ttgtgaggcc tttgcagcca tgtggaactg tgagtgcatt aaaactcttt 78181 cctttataaa ttacccagtc tcggttagca gcgtgagaac aaactaatag agggcaccac 78241 ccctagagct gggtgagacc aatgccactc aaattgatgg tgactacaca atggggagga 78301 gaatgggtga taaacattcc caatgtctac cacaccttgg tagccattta aacttttttt 78361 ttatttaaag cgtactgtct aggtttaagg tactgtaaga gatataaaga tacacagttg 78421 acaatctatt agatataaaa agacaagctg acaaataact acaatacact attattaggt 78481 gtaatgtaaa aatgacacat gtaaatgcta aaacgaagta cagtaaatgt aaaggagaga 78541 gacagattgt ttctgaggtg atcagagagg tgtcgcatag caaataacac tggaattagg 78601 tcttaaagag tgagtgggat tttaattatt atatgggagg caggggagct ttaagagtaa 78661 agccccagag gcagaaagcc aagggaagag ttggagaaat ggcccaggta agttggagca 78721 caagttctag aagttgctgc taaggcaaga aatgtttccc caatacaaag atatctcaga 78781 gttcagataa cacctgagga actaggataa taaagtcagg gtgcaaggtg aatatgtctt 78841 tattttctcc cataaagaaa cctggcttaa gcctccattt acctgattag tgggaaattc 78901 agtatctcct tcgtggggct cccccaactc cttgtgttca acccaacgct ttacacagtc 78961 caaagaggga gcagggccag tctgactttc acatgcaggc agaggccact ggtctcttgt 79021 cagtctggtg gacaacacgc atccccaggt ctctgtggct tctcagggag gacagtatca 79081 ctcatgtcac tttgaacttg gtcgtgagtg ttccttttca ggatccaggg ccattatctc 79141 agagctctgc ccttcagtta aattcagttt ccccatttgc caagtcactc gctgtcccaa 79201 ggactttgga taagaagtag cttgggatat ccatgtggat gcagaattct ttcatttatt 79261 tatttattta gagacagtgt ctcactctga tggagtgcag tggtatgatc ccagctcact 79321 gtaacctctg cctcctgggt tcaagtgatt ctcctgcctc agcctcctaa gtaactggga 79381 ctataggtgc ccgccagcat acccagctga tttttatatt tgtagtagag atgggctggc 79441 tcatctggcc aggctggtct caaactcctg acctcaggtg atcctcccac ctcagcctgc 79501 caaagtgctg agattacaag tgtgagccac tgcgcctggg cctggatgca gaattcctat 79561 cctctgttcc tccctgggat ccaccgcacc ccagaaagtg ggagattttc cttgctggct 79621 gtcccccaac ccccctatgg gcttccccct tgtgcaccct ttcctggcca cacatacccc 79681 cctggggttc gtggggtgga ggcacctctc tgaaatgctt ggcatctttc tctggtgcaa 79741 ggaaagaaaa taaaacccaa agttcagaca ccactgggac attctattct acttggattt 79801 ataggtccac gcctcttagt ctcttccagg attgcttatt tcacccaagg gagccaggcc 79861 ctcctgtgct cttttattaa aatttcatat taacatttaa ccttcttggt cattggcaaa 79921 atctgtgtcc ttagtctgtc ctcaccttga agttgaaatt cttttatttg cagccactta 79981 gaccctgcct tgatggcatt tcctgcccac agcaggctct gaactaagtc ttgctgtctg 80041 ttttggacac atggtgactc tctttctgag atggagtctc gctttgtcgc ccaggctgga 80101 gtgcagtggc acgatcttgg ctcactgcaa cctctgcttc ccaggttcaa gtgattcttc 80161 tacctcagtc tcccgagtag ctggaagtgc aggcgcatgc caccacaccc agctaatttt 80221 tgtattttta gtagagatgg ggtttcacca tattggccag gatggtcttg aactcctgac 80281 cttgtgatcc gcctgcctcg gcctcccaaa gtgctgggat tacaggtgtg agccactacg 80341 cccggccagt gactctcttt tctaaactcc ccttcacaat tttgctctct tcaccctgct 80401 tctgtatact cacaaggaag tgggtaacag attgtcggaa ttaggctcgg ctcatggttt 80461 gcagcttctc gcctgtttgc tgcttctcta gtttaaaaac ttggggttcc aaatgtggaa 80521 attactccaa aacttaggtg aattagaaac aagtatccag tgactgtctg agggagaaca 80581 aagtaaaata catattcnag caactaggca agtaggatgg cttccccaat acagtagagc 80641 atttaagaac tggagaatgg ggtgctgaga agaatgtata ttctgttgat ttggggtgga 80701 gagttctgta gatgtctatt aagtccgctt ggtgcagagc tgaattcaat tcctgggtat 80761 ccttgttaac tttctgtttc gttgatctgt ctaatgttga cagtggggtg ttaaagtctc 80821 ccattattat tgtgtgggag tctaagtctc tttgtaggtc tgtaaaggac ttgctttatg 80881 aatctgggtg ctcctgtatt cggtgcatat atatttagga tagttagctc ttcttgttga 80941 attgatccct ttaccattat gtaatggcct tctttgtctc ttttgatctt tgttggttta 81001 aagtctgttt ttatcagaga ctaggattgc aacccctgcc tttttttgtt ttccatttgc 81061 ttggtagatc ttcctccatc cctttatttt gagcctatgt gcatctctgc atgtgagatg 81121 ggtctcctga atacagcaca ttgatggatc ttgactatcc aatttgccag tctgtgtctt 81181 ttaattggag catttagccc atttacattt aaggttaata ttgttatgtg tgaatttgat 81241 cctgtcatta tgatgttagc tggttatttt gctcgttagt tgatgcaatt tcttcccagc 81301 atcgatggtc tttacaattt ggcatgtttt tgcagtggct ggtaccagtt gttcctttgc 81361 atgtttagtg cttccttcag gagctcttgt aaggcaggcc tggtggtgac aaaatctctc 81421 agcatttgct tgtctgtaaa ggattttatt tctctttcac ttatgaagct tagtttggct 81481 ggatatgaaa ttctgggttg aaaattcttt tctttaagaa tgttgaatat tggcccccac 81541 tctcttctgg cttatagagt ttctgctgag agatcagctg ttagtctgat gggcttccct 81601 ttgtgggtaa cctgactttt ctctctggct gcccttaaca ttttttcctt catttcaact 81661 ttagtgaatc tgacaattat gtgtcttgga gttgctcttc tcgaggagta tctttgtggc 81721 gttctctgta tttcctgaat ttaaatgttg gcctgccttg ctaggttggg gaagttctcc 81781 tggatatatc ctgcagagtg ttttccaact tggttccatt ctccccgtca ctttcaggta 81841 caccaatcag acgtagattt ggtcttttca catagtccca tattttttgg aggctttgtt 81901 ggcttctttt tactcttttt tctctatact tctcttctgg cttcatttaa ttcatttgat 81961 cttcaatcac tgataccctt tcttccagtt gatcgaatcg ggctactgaa attgaccaca 82021 tagttggaag taaagtactc ctcagcaaat gtaaaagaac agaatttata acaaactgtc 82081 tttcagacca cagtgcaatc aaactagaac tcaggattaa gaaactcaaa accgctcaac 82141 tacatggaaa ctgaacaatc tgatcctgaa tgactactgg gtacataacg aaatgaaggc 82201 agaaataaag atgttctttg aaaccaacga gaacaaagac acaacatacc agaatctctg 82261 ggacacattc aaagcagtgc atagaaggaa atttatagca ctaaatgccc acaagagaaa 82321 gcaggaaaga tctaaaattg acaccctaac atcacgatta aaagaactag agaagcaaga 82381 acaaacacat tcaaaagcta gcagaaggca agaaataact aagatcagag cagaactgaa 82441 ggaaatagag acacaaaaaa cccttcaaaa aattaatgaa tccaggagct ggttttttga 82501 aaagatcaac aaaattgata gactgctagc aagactaata aagaagaaaa gagagaagaa 82561 tcaaatagac gcaataaaaa atggcaaagg ggatatcacc actgatccca cagaaataca 82621 aactaccatc agagaatact ataaacacct ctatgcaaat aaactagaaa atctagaaga 82681 aatggataaa ttccccgaca catacactct cccaagacta aaccaggaag aagttgaatc 82741 tctgaataga ccaataacag gctctgaaat tgaggcaata attaatagct taccaaccag 82801 aaaaagtcca ggaccagatg gattcacagc cgaattctac cagagataca aggaggagct 82861 ggtaccatta cttctgaaac tattccaatc aatagaaaaa gagggaatcc tccctaactc 82921 attttatgag gccagcatca tcctgatacc aaagcctggc agagacacaa caaaaaaaga 82981 gaattttaga ccaatatccc tgatgaacat caacggaaaa atcctcaata aaatactggc 83041 aaaccgaatc cagcagcaca tcaaaaagct tatccaccat gatcaaatgg gcttcatccc 83101 tgggatgcaa ggctggttca acatacgaaa atcaataaac gtaatccagc atataaacag 83161 aaccaaagac aaaaaccaca tgattatctc aatagatgca gaaaaggcct ttgacaaaat 83221 tcaacagctc ttcatgctaa aaactcgcaa taaattagct attgatggaa cgtatctcaa 83281 aataataaga gctatctatg acaaacccac agccaatatc atactgaatg ggcaaaaact 83341 ggaagcattc ccttcaaaaa ctggcacaag acagggatgc cctctctcac cactcctatt 83401 aaacatagtg ttggaagttc tggccagggc aatcaggcag gagaaagaaa taaaggttat 83461 tcagttagga aaagaggaag tcaaattgtc cctgacatga ttgtatatct agaaaacccc 83521 atcgtctcag cccaaaatct ccttaagctg ataagcaact tcagcaaagt ctcaggatac 83581 aaaatcaatg tgcaaaaatc acaaacattc ttatacacca ataacagaca aacagagagc 83641 caaatcatga gtgaactccc attcacaatg gcttcaaaga gaataaaata cctaggaatc 83701 caacctacaa gggatgtgaa gggcctcctc aaggagaact acaaaccact gctcaatgaa 83761 ataaaagagg acacaaacaa atggaagaat attccatgct catgtatagg aagaatcaat 83821 atcgtgaaaa tggccatact gcctgaggta atttatagat tcaatgccat tcccatcaag 83881 ctaccaatga ctttcttcac agaattggaa aaaactactt gaaagttcat atggaaccaa 83941 aaaagagcct gcattgccaa gtcaatctta agccaaaaga acaaagctgg aggcatcatg 84001 ctacctgact tcaaactata ctacaaggct acggtgacca aaacagcatg gtactggtac 84061 caaaacagag atatagacca atggaacaga acagagccct cagaaataat gccgcatatc 84121 tacaactatc tgatctttga caaacctgac aaaaacaaga aatggggaaa ggattcccta 84181 tttaataaat ggtgctggga aaactggcta gccatatgta gaaagctgaa actggatccc 84241 ttccttacac cttatacaaa aattaattca agatggatta aagacttaaa tgttagacct 84301 aaaaccgtaa aaaccgtaga agaaaaccta ggcaatacca ttcaggacat aggcatgggc 84361 aaggacttca tgtctaaaac accaaaagca atggcaacaa aagccaagat tgacaaacgg 84421 gatctaatta aactaaagag cttctgcaca gcaaaagaaa ctaccatcag agtgaacagg 84481 caacctacag aatgggagaa aatttttgca atctactcat ctgacaaagg gctaatatcc 84541 agaatctaca aagaactcaa acaaatttac aagaaaaaaa caaccccatc aaaaagtggg 84601 cgaaggatat gaacagacac ttctcaaaag aagacattta tgcagccaac aggcgcataa 84661 aaaaatgctc atcatcactg gccatcagag aaatgcaaat caaaaccaca atgagatacc 84721 atctcacact agttagaatg gcgatcatta aaaagtcagg aagcaacagg tgctggagag 84781 gatgtggaga aataggaaca cttttatact gttggtggga ctgtaaacca gttcaaccat 84841 tgtggaagtc ggtgtggcga ttcctcaagg atctagacct agaaatacca tttgacccag 84901 ccatcccatt actaggcata tacccaaagg attataattc atgctactat agagacacat 84961 gcacacaaat gtttattgtg gcactattca caatagcaaa gacttggaac caacccaaat 85021 gtccatcagt gatagactgg attaagaaaa tgtggcatat atacaccatg gaatactatg 85081 cagccacaaa aagtatgagt tcatgtcctt tgtaggacat ggatgaagct ggaaaccatc 85141 attctcagca aactattgca aggacaaaaa accaaacact gtatgttctc acttacaggt 85201 gggaattgaa caatgagaac acttggacac aggaagagga acatcacaca ccagggcctg 85261 tcatggggtg gggggagggg ggagggatag cattaggaga tatacctaat gtaaatgatg 85321 agttaatggg tgcagcacac caacatggca catgtataca tatgtaacaa acctgcatgt 85381 tgtgcacatg taccctagaa cttaaagtat aaaaaaaaaa aaggaacagg agaatgaact 85441 atttttttta gaaatggagc ctcactcact ctgtcactca ggctggagta ctgtgacatg 85501 atcttagctc actgcagcct ccacctcttg ggctcttggg ctcaagtgat tctcctgcct 85561 cagcctccca agtagctagg accatgcgcc accatgcctg gctgattttt tgtattttta 85621 gtagagacag ggtttcacca tgatggccag gctggtctag aatccctgac ctcaggtgat 85681 ctgcccgcct cggcctccca aaatgctggg attaccggcg tgagcacagc gtctggccct 85741 atatgctttt ttcaattcaa tggctacttg cctaattttg aatggccaaa ataaccttcc 85801 agttattttc ccttaatgga atcaacctag aatagttgcc tggggagagg gagctggggg 85861 gcagtgattg gaaagagaca tgagggaact ttttggaggg ataaaattgt tctatatttt 85921 gatttaggtg ttagttacag gatattttca tttgtaaaat ttatgaactg tagaccttag 85981 atctgtgcat tttaccatat ataaattata tctcaattta agcatagaaa aagtaaacca 86041 atgattgtga tatacttggc taatgtccat atgctacagt gtgttgaaaa gaaaaataat 86101 accctattga gaaggtagac atgtactgca agcataagct atgaaaacag agaaaagccg 86161 cagacattgt taacggactg agctgggatg tttccagttg tggggattct gagcctcttt 86221 tgtaatgtgc ttaatcactt ctaattgtac agtcagctaa ttacagtaat tagtcttgcc 86281 ttagttcaac agagttgaac taagtcattc tcttagcggg actgggttga attttcctgc 86341 atctagcgac ttcctcagat tgtggcctgc aatggtgcat tcccggggct caagaagtga 86401 tctgtttgtg ctgggttttc accactgggt gtttgagggg tgagggatgt gactaatcat 86461 ggagccagtg agtcacgcgt gtcaggcctc tgagcacagc aagagctctt tgggctctgc 86521 actggcacca ctctcggcgt actgacccag ccttgcttag gtccaagtca atcacattgt 86581 tttttttttt ttttgagaca gagtctcact ctgtcaccca ggctggagtg cagtggcatg // LOCUS HSU51243 24454 bp DNA PRI 05-OCT-1996 DEFINITION Human alpha-albumin gene, complete cds. ACCESSION U51243 NID g1418261 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 24454) AUTHORS Nishio,H., Heiskanen,M., Palotie,A., Belanger,L. and Dugaiczyk,A. TITLE Tandem arrangement of the human serum albumin multigene family in the sub-centromeric region of 4q: evolution and chromosomal direction of transcription JOURNAL J. Mol. Biol. 259 (1), 113-119 (1996) MEDLINE 96240683 REFERENCE 2 (bases 1 to 24454) AUTHORS Nishio,H. and Dugaiczyk,A. TITLE Complete structure of the human alpha-albumin gene, a new member of the serum albumin multigene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (15), 7557-7561 (1996) MEDLINE 96353855 REFERENCE 3 (bases 1 to 24454) AUTHORS Dugaiczyk,A. TITLE Direct Submission JOURNAL Submitted (13-MAR-1996) Achilles Dugaiczyk, University of California, Biochemistry, Riverside, CA 92521, USA FEATURES Location/Qualifiers source 1..24454 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="half lamda2; half lamda9; half lamda15" /chromosome="4" /map="4q" TATA_signal 858..864 mRNA join(888..1006,3084..3132,3401..3533,5003..5214, 6108..6240,6865..6962,7771..7900,11012..11226, 14442..14574,16794..16891,18256..18388,19146..19369, 20929..21061,22192..22252,23018..23143) prim_transcript 888..23143 exon 888..1006 5'UTR 888..918 CDS join(919..1006,3084..3132,3401..3533,5003..5214, 6108..6240,6865..6962,7771..7900,11012..11226, 14442..14574,16794..16891,18256..18388,19146..19369, 20929..21061,22192..22212) /codon_start=1 /product="alpha-albumin" /db_xref="PID:g1418262" /translation="MKLLKLTGFIFFLFFLTESLTLPTQPRDIENFNSTQKFIEDNIE YITIIAFAQYVQEATFEEMEKLVKDMVEYKDRCMADKTLPECSKLPNNVLQEKICAME GLPQKHNFSHCCSKVDAQRRLCFFYNKKSDVGFLPPFPTLDPEEKCQAYESNRESLLN HFLYEVARRNPFVFAPTLLTVAVHFEEVAKSCCEEQNKVNCLQTRAIPVTQYLKAFSS YQKHVCGALLKFGTKVVHFIYIAILSQKFPKIEFKELISLVEDVSSNYDGCCEGDVVQ CIRDTSKVMNHICSKQDSISSKIKECCEKKIPERGQCIINSNKDDRPKDLSLREGKFT DSENVCQERDADPDTFFAKFTFEYSRRHPDLSIPELLRIVQIYKDLLRNCCNTENPPG CYRYAEDKFNETTEKSLKMVQQECKHFQNLGKDGLKYHYLIRLTKIAPQLSTEELVSL GEKMVTAFTTCCTLSEEFACVDNLADLVFGELCGVNENRTINPAVDHCCKTNFAFRRP CFESLKADKTYVPPPFSQDLFTFHADMCQSQNEELQRKTDRFLVNLVKLKHELTDEEL QSLFTNFANVVDKCCKAESPEVCFNEESPKIGN" mat_peptide join(982..1006,3084..3085,1006,3084..3132,3401,3131..3132, 3401..3533,5003..5214,6108,5213..5214,6108..6240, 6865..6962,7771,6961..6962,7771..7900,11012..11226,14442, 11225..11226,14442..14574,16794..16891,18256,16890..16891, 18256..18388,19146..19369,20929,19368..19369,20929..21061, 22192..22209) /product="alpha-albumin" intron 1007..3083 exon 3084..3132 intron 3133..3400 exon 3401..3533 intron 3534..5002 repeat_region 3541..3557 /rpt_type=flanking /rpt_family="Alu" repeat_region complement(3558..3971) /rpt_type=dispersed /rpt_family="Alu" repeat_region 3972..3988 /rpt_type=flanking /rpt_family="Alu" exon 5003..5214 intron 5215..6107 exon 6108..6240 intron 6241..6864 exon 6865..6962 intron 6963..7770 exon 7771..7900 intron 7901..11011 repeat_region 8870..8881 /rpt_type=flanking /rpt_family="Kpn" repeat_region 8882..9139 /rpt_type=dispersed /rpt_family="Kpn" repeat_region 9140..9151 /rpt_type=flanking /rpt_family="Kpn" repeat_region 9169..9356 /rpt_type=dispersed /rpt_family="Alu" repeat_region 9546..9553 /rpt_type=flanking /rpt_family="Alu" repeat_region complement(9554..9773) /rpt_type=dispersed /rpt_family="Alu" repeat_region 9774..9781 /rpt_type=flanking /rpt_family="Alu" exon 11012..11226 intron 11227..14441 repeat_region 11798..11811 /rpt_type=flanking /rpt_family="Alu" repeat_region 11812..11950 /rpt_type=dispersed /rpt_family="Alu" repeat_region 11951..11964 /rpt_type=flanking /rpt_family="Alu" exon 14442..14574 intron 14575..16793 exon 16794..16891 intron 16892..18255 repeat_region complement(17447..17721) /rpt_type=dispersed /rpt_family="Alu" exon 18256..18388 intron 18389..19145 exon 19146..19369 intron 19370..20928 exon 20929..21061 intron 21062..22191 exon 22192..22252 terminator 22210..22212 3'UTR join(22210..22252,23018..23143) intron 22253..23017 exon 23018..23143 polyA_signal 23118..23123 polyA_site 23143 repeat_region 23590..23596 /rpt_type=flanking /rpt_family="Kpn" repeat_region complement(23597..23844) /rpt_type=dispersed /rpt_family="Kpn" repeat_region 23845..23851 /rpt_type=flanking /rpt_family="Kpn" BASE COUNT 7364 a 4268 c 4529 g 8293 t ORIGIN 1 gaattcttaa gacctagtct cttagcatat tctaagcata caatacagta ttattagctg 61 tagtccatgt gatgcatttg aacttactca ttctacgtaa ctgaaacttt gcaccctttg 121 acaagcatct ccccatttcc tctacctcct gcccttgata accaccattt tatgctctgt 181 ttctagtttg attttttaga ttccacatat aagtaggatt atgcagtgct tttctttctg 241 tgtctgattt atttcactta acatgatgtt ctccaggttc atccacatat tctcatatgg 301 cagaatttcc ttctttttaa agattaaata gtattccatt gtgtgtgtgt gtgtgtgtgt 361 gtgcgtgtga gatttattta tttatttagt tttttatatg ttgactgaca ggttgttttc 421 atattttggc tattgtgaat aatgaggtaa taaacaccga agtgcaggta tatcttctta 481 attttttatt tttaattttt gtgagtacat agtaggcgta catatttata tatctttgag 541 atactgattt cacttccttt ggatatatat gcataagaga gattgcttga ttatatctgt 601 tctggccctt tgttccttta acatttgcag tattttaagt ttttttttgg ttgtctattt 661 gattaagtat tgtgcatact tagcctgtgg acttttgttc caactcagta gattttttcc 721 agtgaaacac aaaggtaatt tttttttctg gttaatattt agcaagaatt ctgcagagtg 781 atcaaaaaaa tcaaatactc agtatttcag aaatagatta aataggttac ttttttactg 841 ataatgtgaa agaatgatat aaaaacttga ttttcctcaa caacattact ttcttttgta 901 aatgtggttt ctacaaagat gaaactacta aaacttacag gttttatttt tttcttgttt 961 tttttgactg aatccctaac cctgcccaca caacctcggg atataggtaa gaaatttact 1021 tgtatatcag ctaaagcatg accagttagg ataatttcta tttttaaaat tttgttttct 1081 ttcaagttaa ttctttactt gcttttttca ccctctcact aatttacata ggctaaccag 1141 ttatccaaaa ctaacctttt cagaattttt attttgatgg taatttttat attaccaagt 1201 ctatatgaag aaagagttgt cacatctgaa gaataatcag cagtctatga caggacagaa 1261 agtgttgctg ggagatacat ttggatttta atgttggtat ttcttgcata tatgtatagt 1321 ttatggtaac agtggaacaa tagatgttaa ctgggatttt tctggaacct ttggttgtcc 1381 aacatatttc ccaactatat ctttatggaa tgcagaacct agttcctcct tcaactgtac 1441 agtttgaaca tccctcctga gcctggccaa ggccatctca gtctgtagga agggatgact 1501 ttttttattt cttccatcca aaggggaaat attttgaacc tcatatgtct agaggaggtg 1561 tctttttcta atcctttttg gcctagaagg agctcctttc tttgacttga acaaatgtgc 1621 cctaagcatg gcagtggccc tgaaggcatg gatcgtgtct tacgtgtctt actcaatttg 1681 acatgcccct acaataccag ctagtgaaga ctggtcattt ctgagttttc ctgctccttt 1741 tccctcgtaa caacctgttt caaaatcttg attttttgcc tctttaatat cttttgtatc 1801 tatattttcc ttttcatttc cattactcct accaaaattt agatcttgct gatttctcag 1861 tgtgacagtt gcaaagtctc ctatggcctt cttactttgt cccatccacc cttcttttgc 1921 ccatttattt gacaatagtt ctttctacct ttttctattt gtcaggcact gtggaaagtc 1981 ccatgacatt acatatacca ctacctgact cgtatttctt agtgtataga aatgatcaga 2041 ttactatcct atataaacat tttccagggc tctccatgtc ttccaaatga agcccatact 2101 ccttagcttt atatacaaga ctcctataat ctgggcccag gttctttcta tcctaggtac 2161 cattattcct ctttgtgacc cactttctca tcaaaataga ttgactttcc ctaaatactt 2221 ctcaagattt cttaatgcat ttattcacat cattctctct ccacacagac agacagggtt 2281 tttcattttt ccgtcagctg aatctcttac atttactcat gctcaactca tatctcactc 2341 ctcctataat cctttttttt ctttctttct cacccaggat tcataaagca ttcccaactt 2401 aatcccattc taaattagat tattgtcaca tgcacgagta acttccatta gagagcagct 2461 taggaacctg atgtcaaggg gagatatttt agtcattgtt gattgcttca cagggtccag 2521 cacaacactt tgcaaaaaat aaatgtagct tatgtgaaca aacataaata tgctccaatt 2581 agtatatgct cattgggagg gaatccatga ttttaggtgg atctggagat gacctaaata 2641 gctgtgactg agtagcgcta aaaactatgc tgaccaccag catgctacag catctctcat 2701 caggctctgt tatggcattg aatcagccat tagttaatct tcatatcact tctctagtca 2761 taaatgatag acatttagca tggtaatgta tttgacattt ctatctgggg agccatcaat 2821 ggcaaattta aagtctagtc agattaagat ttatttaggt caattcaatt tggtaaatat 2881 ttcttgaaca cataatcaac aatgactaag atttcctatt ttgttcacat gacttatttg 2941 tttcttgtta aatatgcaaa attatctcct gtcatcagtg gtctattttg tattttatac 3001 acttaaatgc catgtatagt tatttatttg aaaaatgtta gaaaaccatg ctatgtaata 3061 acctaaatat tcttgctttt cagagaactt caatagtact caaaaattta tagaagataa 3121 tattgaatac atgtgagttg tgctaaatac tttttgatga tgatttttaa aatgatatat 3181 tctagaaatg ttaatgcttc tcaaaagtaa tttaactaaa tagaagccat atccaagaag 3241 acaactttaa ttataaaatt caggcaaaga agttgtaaaa tttatagcca ttgcaataga 3301 taatttcctg aggctagtca ggatgttcct gtgacttttt cttgaccata aatggaaact 3361 ctgcaactga tctttgtata ccccatgtga actgttgcag caccatcatt gcatttgctc 3421 agtatgttca ggaagcaacc tttgaagaaa tggaaaagct ggtgaaagac atggtagaat 3481 acaaagacag atgtatggct gacaagacgc tcccagagtg ttcaaaatta cctgtaagta 3541 aattgcttgt gtttcttttc cttttatctt atgtctttct ttctctttct ttctttcttt 3601 ctttctttct ttctttcttt ctttctttct ttctttcttt cattctttct ttcttctttc 3661 tttctttctt tccttctttt ttttttcttg aaacagagtc tcgctctgtc acccaggctg 3721 gagtgcagtg gcaatcttgg ctcactgcaa cctccacttc ctgggttcag gcaattctca 3781 tgcctcagtc tccctagtag ctgggatcac aggcatgcac caccatgccc agctaatttt 3841 tttgtatttt tagtagagat gaggttttgc catgttgacc agcctggtct caaactcctg 3901 accccaagtg atcctcccaa cttggcctcc caaagtgttg ggattgcagg cgtgagccac 3961 tgtgccctgc caattgcttg tgtttcttaa acacacaaga ttttttgccc aacactgaga 4021 aaatttctgg cttgaaaata tgaaatgaaa gatgttctgt gcgcagcatt ttcattttgc 4081 ttaattttgg tatttagaac aaggtacttt ccttggaaaa gatactaatt ctgtattttg 4141 atgcacagaa ggagattgct gtgtatttgc agtatgtata cagtatgccc tatatgttca 4201 ggcagaagag gaagccatag cttgaaactg cacatgaaat tacagaggta cctccttagc 4261 tgcctctctc aatatgtatt ctattgttac atcccttttc tgggcctcct ccaagtagaa 4321 tcactatctg catctccaaa aaactactac taaacactac ttctaaacac tctaccaata 4381 cattaacata ccagcatgtc atcaccttgg gaggtgagta gaaaagcctg gtggaaggga 4441 aatgtaggca gcaagggaag agtaagaggt ggaagtgtct tgtgacaaga cagaggaaca 4501 aagattctga aagtgcagga tgattgtcaa gggtttgggt aaaaatgctt tgatttggca 4561 cacctgttct cccagctatt taggacactt tggctggagg atcacttgag cccaggagtt 4621 cgaatctagc ctgcacaaca tatcaagatc ccattccaaa gaagaagcag aaaacaagga 4681 gaaggagaag gaggagaagg agaaggagaa agagaaggag aaggagaaga ggaagaggaa 4741 gaggaagagg aagaggaaga agaagaagaa aaagaagagg aagaagaaga agaaggggag 4801 gaggaggagg aggaggaggt ggaagagaag gaggggggga agggaaggag aaggggaagg 4861 ggaaggggaa ggagaggagt aggagaaaga gaaggagctc tgttggtaat ggtgggaaaa 4921 ttagtcattg tgttttctca gtgagtacag aagctttact ttaccatgac taaaaattgt 4981 ccttttcttc tctgttgtat agaataatgt tttacaggaa aaaatatgtg ctatggaggg 5041 gctgccacaa aagcataatt tctcacactg ctgcagtaag gttgatgctc aaagaagact 5101 ctgtttcttc tataacaaga aatctgatgt gggatttctg cctcctttcc ctaccctgga 5161 tcccgaagag aaatgccagg cttatgaaag taacagagaa tcccttttaa atcagtaagt 5221 ttaatcttag taaaaaatga tccagttgaa gaattagtct taggctgaat ctaatatcta 5281 tctcaaataa atatggctgg gttcattaat ttattgatta attgtttcat tcattcaaaa 5341 catactgtgt gtcagatact gtgctcagtg ttcaggatat aactatgatc aagatatgga 5401 ccctgcttac aagaccatta tattaacgaa ttttatcttc taattagtaa gtcctagagc 5461 agaggtgtgt acatggtgtt aaattggtag gatgcacagt agaggaaata atatactctt 5521 cccaagggga agttgctgtt ttcacctcag gcagacattc aattacagga tgtttagcaa 5581 aacatctgtt tttaatatag tgctgtcatt gcagaaacag actttttctg gaaggttggt 5641 atgctccttg aaatctaaat ctaaatcaca aatttatctg aaaaaaagta aacaggaccc 5701 ctgaatctca gatagagtgt gtatagatag ctgttctttg aacatggaag agagaactgc 5761 tgaccagctg gtagcatgtg gatggaagga caatagccac agttcagtgt ttgggattag 5821 aattgcaatc attattccta ctttggagtt agcaacttca atgataagga cttcaaagga 5881 ctcttatcgg gaactggtta atgctgcctt atattgaagt tttctataac tggaaacctt 5941 tcttcctttt accttgtttt acaacttgct cttcttgtct ctttctattt tttcttttcc 6001 ttcccatctt accctccctt cccttccctt cccttccctt gtctacattc attgcttcct 6061 gtttcttccc tcacccgtct tctctctttc atttttattt tttatagctt tttatatgaa 6121 gttgccagaa ggaacccatt tgtcttcgcc cctacacttc taactgttgc tgttcatttt 6181 gaggaggtgg ccaaatcatg ttgtgaagaa caaaacaaag tcaactgcct tcaaacaagg 6241 gtgggtatag catttgttcc atgaagagga taagaaatca ctcaatacca cagatccaga 6301 ccctgactta tattatagca aaaggcttgc ttaccatata tgatgttctc ttgtcacatt 6361 cagtgatttt caaaccattg gtcaccaggt gctgatctaa atattctctt tacttgagac 6421 tcttgaatga cagccagtca ttttcagggt ctctcttcca gctttgtttt ctttgtaaat 6481 tggaaggaag ccctggtgag tgaatgccca gggtgaatga tgacatttta tcttttatct 6541 ttgacaattt gtagtctttt gactacaaac aacatggtat aatctcagta gggatggcca 6601 agctcacagc atgctgcccc accaagggag cacttatgca cctttttacc aagtgcaatt 6661 caatcatcag caagactggt cctacctttg tgatgtgccg taagtgcata tttcttttaa 6721 tatgtgtgtt tactgaaata aaaatcctgt agattttctg tctctgaaca aattttgtaa 6781 tagtttgaat gaaatatgac taaaaaataa gagaatagtg agattttgct attgtttcaa 6841 aagaattttc tctttcttct tcaggcaata cctgtcacac aatatttaaa agcattttct 6901 tcttatcaaa aacatgtctg tggggcactt ttgaaatttg gaaccaaagt tgtacacttt 6961 atgtgagttt tatactatat gtcttgtctc tgcttgcatt tccttgtctg tgtcccattt 7021 ttgttaaatg gttgataact ttttatttat attcttccta gtacctaggg gctttggggt 7081 gagtctgagc ctacttagac agctcaggac caggacatta ggttgagatg agagaagttt 7141 tgggaggtga ggagacccaa aagagaatgg tggctgggtc tgagacatat ttaggttaat 7201 tgagaatctg aactgctctg gatgtttcaa accagggtta taaaacccac aagtgcttta 7261 gaaataattc atccattggt ctttgacttt aacaatgagg agataaattt tatctctagt 7321 gcttgcaaag ataaatttta tctctagtgc tcaccctaat caattagtaa tgggtgctaa 7381 ggcaatgtct tgagaaaaat tttgagattt caggtagatc agcaggaaag agttctatca 7441 tgaattatct atgcctgcca tgagcttaca gtaggaagtg gtggccttac acagcatgtc 7501 ttactataat aattatggaa gaaagcacac ctctcagttt tgtttttttt cacatccagg 7561 actttggggc tcttaaaata tcaccctaag attttcaaag tgatcatgta ggctgtgcac 7621 caactatttg ataaagactt ctataaaaac aacaacaaca gcaatggtgg caatactttt 7681 tgtagctatt catatcatat tagcaaatta ctcccacttg ttctacttgc tgttcaaatt 7741 atttttgtgg tttatcaatt ctctttccag atatattgcg atactcagtc aaaaattccc 7801 caagattgaa tttaaggagc ttatttctct tgtagaagat gtttcttcca actatgatgg 7861 atgctgtgaa ggggatgttg tgcagtgcat ccgtgacacg gtgaatattc tctaaaacca 7921 agttaaaata gtgatttttg gcaattatca tttttttatt ctcaagttgc cctgatatgt 7981 ggttactttg ccacaatgaa atcagaatgt gatttaactt ttgaatgaaa gcataaaatg 8041 aaatataaac atttttataa ataaaccttg ggatttttta tacttggaat ttttaaagat 8101 tactgaatag attgtatatt atatggctaa atgtcttgat gtaaaacttc agaaatttca 8161 aatttttcat agttcattta gtacagattg aagttttctc ttacattata aatcctcttt 8221 tagagaggtt ggaaacaaat agtaaatttc aaggaatgac ttgttaacat tatgttttta 8281 agagctgatt tccctgagct tccttacgct attcaggtat agtccttcca aataagtgga 8341 tacattaatt taacttttct tttccctttc tcttttggtt cctttttcct taccttctcc 8401 cttccctttt cctgttccct cccttttccc tttcatttct ttccacgttc cctttccttt 8461 ctggccttta actgacaatt aaaaattgta tattctcatg gtatacaaca tgatatatat 8521 gtatgtgtat atataacatg ttatatatgc atatatatta atgtatacat gcacacacat 8581 tgtggaatgt ctaaatcaaa ctatttaaga tatgcattac ctcacatact tatttttttg 8641 tggtgagaat actccaaatc ttagcaattt taaagtactc aatatagtgg tatgaattca 8701 tcattgaatc actatgatgt acaagagatg tcttgaatgt attccttctg tcctaatgaa 8761 attttgtgtc ctttaaccaa catctcccca attcccagcc tctagaaatc tttaaagtga 8821 ttttccaagg aaaagtattg gtacgaatat tgttggcata gtttgaattg ttcatgcatt 8881 ctgttctcac tcataagtgg gagttgaaca atgagaacac atggacacag ggaggggaac 8941 aacacaggcc agggcctgtt gggttgagat tgggtgctga aaggagagca tcaaagtagc 9001 tcatgcatgc ggggcttaaa acctaggtga taagttgaca ggtgcagcaa atctccatgg 9061 cacacgtata cctatgtaac aagcctgtac attctgcaca tctgcccatg tatcccggaa 9121 cttaaagtga aaaaaaattg ctcatgcatt caaaggtaat ataattttat taaaaaaata 9181 actgagtatg ataggttgca cctgtaattg cagctactca ggactcagga agctaaggag 9241 ggaggcttgc ttgagtccag gagttcgagg atacagtaca gtgagctata atcccaacac 9301 cctcagcaac agaatgaaac cttatttcaa aaaaaaaaaa aaaagaaaga aagaaagatt 9361 caacttttag gcttacagtg aagatagcat gtcaattaaa taacttgaat tttagggtat 9421 tgagaaatgt ttatttcaaa caatggacaa cagatgttcc caataattat ttttgctcct 9481 aactgtaccc aagttaaatc aatctccttt gcgatttatt tttgtattgc ttgattgttc 9541 ttggtgatgt aggattttgt tttttgtttg ttttttgaga cggagtctgg ctctgtcgcc 9601 caggctggag tgcagtgaca atgatctcgg ctcactgcaa gctccgcctc ctgggttcac 9661 gccattctcc tgcctcagcc tcctgagtag ctgggactac aggtgtctgc caccacaccc 9721 cgctaatgtt ttttgtattt ttagtagaga cggggtttca ctgtattagc caggatgtat 9781 gttttataat gtgaaatcta aggtactgaa ttaggatata attatagatt ttctgtgaaa 9841 tttgaggctt aggttaaggg attatgtctc aaatttcctt ctttttaatg tctttttgga 9901 gaaataagag gaattgaatt acagttaact gaaaatagaa gtaacttccc agaggataag 9961 agatgctctg cctagaacaa tgctggcatt acatgagatg taaatgtctt tttgaagatt 10021 ggcttcctgg ttctatgtgg tttgttggaa acatggtgcc gatccccatt ttatttttta 10081 cctcagtcct gagcttcatc ttattcctta tcctccatgt tcaccagaag attcgtcagt 10141 ggaaatagct tgagtaaagt cataaaggct tgaaatagtc tagtgtgttg ctgtaactca 10201 acataattca agtgaggctg aaacggagag gggaaataat gtgcagtctc attctagagt 10261 ctggaagggc ccagaacagt ttttaaaaat tattcaacta atgtgtaaat aagataaccc 10321 caccctctct ctaattccca actcttatgt agaggttaga actggggcaa ttggtaggga 10381 tggggacaga ataaagttgt ccatttgagc agtatttgct aacctgaaca gcttcaaatg 10441 aatgatatcc attagaaaat gatatccatt agaaaatgat accatgagtg gaatagcttc 10501 ttatatttga tgaatcgtca tctggtagtt gctgtagttc actcacttgt cattctttca 10561 ttcatttatc aaatatttat tttgattttg ccatgtgctc agccacaaga gaagaccagg 10621 attatagata ttatagtaac atgtctttgg tggcctccac tgaaaacaca agtggactct 10681 tatggaaaag aaacaacatt tagaatataa gatgtaaata tatcagtgta ggagggttat 10741 ttaaaagact aatgcttaat aatggatgtg aatctatagt acagcctatt ttttcataat 10801 tatatcatct taaatcctga gaagttttaa tgtaaaactt catgtgcatt aagtaaatcc 10861 aaaatttaat tcaaacaaat attatctagc aaattaattg atgaagtgat tccagatgct 10921 gcgatcatga ctggttttgc atgccatcat tccataatgg aagaaaaccc agcctgaaag 10981 taaaataatc tctcattctt atttggtaca gagcaaggtt atgaaccata tttgttcaaa 11041 acaagattct atctccagca aaatcaaaga gtgctgtgaa aagaaaatac cagagcgcgg 11101 ccagtgcata attaactcaa acaaagatga tagaccaaag gatttatctc taagagaagg 11161 aaaatttact gacagtgaaa atgtgtgtca agaacgagat gctgacccag acaccttctt 11221 tgcgaagtaa tataactctt tattgaattt ataagggaaa cagaaagaaa cttgtaagat 11281 ttgacttttg ttgctaattt aggtggaagg acatggtacc gtttatttca ctaggtatca 11341 tataattttt ttttaaatca tggctataga gtgtagtttt cacacaattt taagaaaaaa 11401 tcaactgtat tattaaaata atcgctagca tttattgagc acttaatata tgccagtcat 11461 gggactatat actttataca gataaactat tttaattctt acagcaatca tacaaaataa 11521 gtacaatttc tcccagtttt tcagatgagc aagctgagat acaatgaagt gaattgttca 11581 gattcacagc taaaaagtga cagagtaaga atttgactcc aggaagcctg gcctagcaag 11641 ttgcaggctt aactgctatg ctatgccatc tacaactgtt aaggggaaga ctagtgaact 11701 tggagccaca gggactatat ttgagttctg gttgagttgc tgggacaagt taacctttct 11761 gtgctttttt tttttgtcat atataaaatt catataagaa tactcattca tctataatcc 11821 cagctaatca ggaggctgag gcagagaatt gcttgaacct gggaggcaga ggttgccgtg 11881 agccgagatc acgcactgca ctccagcctg ggcgacagag tgagactgtc tcaaaaaaaa 11941 aaaaaaaaaa gaatatttat taatttaaag tttattagta atgtaatata atccagggac 12001 tttactgttt ttttttttgt tgttgtttgt tttttttatc tcctggaagc cttcatctat 12061 agagagtata aagtgagaag agacaaacat aaataaataa gtgcaaaaaa tgttctttta 12121 tttctaatag catcctgttc taggtaatgc aggggaacag aagaagaaat ggtcagttgt 12181 atatagggac cctggaaagg cttaataggg aagtggtact tgaagaaatt ttggtaatct 12241 agataggtag agcatgaatt gaagtttata atggtttgtt gtgtatgtgg ctgtgagtag 12301 ggcatctgca gcgagagacc agcatggtga gaggtaggtg gggccagatc atgcaggcat 12361 gctaagtaat gggatgccac tgaatggttt taatggaaga atgaggaagg acaaattcac 12421 tttcagtttt tagaaacctc atctgaagaa aatcccagtt tggggcttgg ataactgggt 12481 gggtagtttt cccattcaat gtgaaaaagt agcaggtttt gtgagaaaga taatatgttt 12541 aagtatgagc acattgactt tgaggtacca gtagaataac taattggaac tatcagaaga 12601 gaattttata tgtgaacctg tagctctgga agggaatctg gcttgaaatc taggacataa 12661 caacaactaa tgtttactga atacttacta aattttaagt gcctctcatc tactaaccaa 12721 ccaaatttta aaaacaaccc tctgaagtta gtatactttt ataatcttta tttttcagat 12781 aataaaacag gcacagagag gttaagtaat ttacccgaag tcacacatgt ggtgtggcca 12841 ggattcaaat gcaagcagtg tggctccaaa ctcctagtct tataactact gtcctctacg 12901 gcttctcatg gttgtggata ttaaatcaag ttcaattaaa caaaatgata gattgtacct 12961 gacatatgtg tttgatagat aaaagccatt tttataatcg tcactattac aacataaaaa 13021 gtgtttgaaa catggggaga aatgatagag taacagtgag cacaccaagt cagagtgctc 13081 agtgttttaa ggtagagtct ttctggatgt gaacatttta ggcaagtcct cagtggcttc 13141 tccagcctag agtaagcttc cagggatttt tcttcacagt attttgagac atgtttttag 13201 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tactccccca ctcccagaga agaagtccat 13261 ggcttacatc agatattaag aggagccctg attctctgaa cgttaagcac tactgccatg 13321 ggatatggga atatatctca caaatggaat tgaaaaagca aatgggaagc tggagagatg 13381 gtttcagaaa aaccaagatt aatcaagaag gtcaaatcct gaaaaaatag gggtggggag 13441 gggatgaaaa gagttacctg gtcatgaaaa acttgtgagg tattgcgctc acaaaacgtg 13501 gaagatctga acttattgtg aatgaataga tgagttttct tttttttatg aagactgtaa 13561 atagcccagc tgaggtactt caagatgaca tctctgtctt gagaaaagag ttcgcttggt 13621 aaagaatgga gtaaacatcc caggggactt agggtcaaat tttatatctt ccattgatag 13681 gaaggtgctt aaccagtctg tgatttagtt tcctcagttg taaaacagta catattttgg 13741 ggactaagtg aagtgaaatg agaaagtgtt tataaagtcc aggaagcaca tctatatctg 13801 ttatgcccaa agtttgatcc agtgcctagc acctactaag ggcttaataa ctgttagcct 13861 tcttcctctc tctttctttc caaatcacac ccacctatgc tcttgaagtc cttgacagag 13921 gatttggaag ccactgccag tgatttggag ctgaaggcgt aaacactccc atagttggca 13981 agtgacagga tgtagtaatg ggcaaagctt gctccctatc tctcacgaag accacattgt 14041 gggcgggaag gaagccatat tcccagaatg tcaagttgga aatctgaatg catcttcaaa 14101 cataattata agcaatatgt attttttaaa gtatattcac aatttattat ttccaatgtt 14161 ttatagatta aatcattcca ataattagtt ccaaataatt ctactaattc tctcttctca 14221 aataaaaaat attgttaatc taaaacgtac catggaagag aacttttgta aaatgatcta 14281 tagccaaaat gtcctcttaa attactctgt gaccaagaca gaggagaaaa ttagcttttg 14341 aaagcagagt cagtattaga tctgactgga ttacaaaata gtggaattgg gttttttgct 14401 ctttctctaa tttctaatat acaaaatttt acacattgca ggtttacttt tgaatactca 14461 aggagacatc cagacctgtc tataccagag cttttaagaa ttgttcaaat atacaaagat 14521 ctcctgagaa attgctgcaa cacagaaaac cctccaggtt gttaccgtta cgcggtaggt 14581 tccattgttg taggttcaga aaatcaaaaa agaacaacta gaaaacactt aaaaattgat 14641 atcagagttt tgctgcagtc tgggggtaga aaatgtgact ggctaatctg agtccactgt 14701 agattcagcc taagaagatt attgggaagg ccaatgtcat cagtttggct gatgcataat 14761 gttttggact actaatccaa tgtgttaatt caaaaaagtc ttcccattaa acattttaaa 14821 tttacatttt tctttatgta ttctgtattt tattaagccg tattggatat tacatgattt 14881 caaaagcaat gattgctgta gatgactagg aaaatattct gtattttatt aagccatatt 14941 ggattcttta tggtttcaaa agtaatgatt gattgttgtt gattattagg aaagcaacaa 15001 aggttacaaa gaaccctttt ctgattctcc cactgggaaa tagctataat taacatttgt 15061 gttcaactct tcagtacata tacgtatata tacgtgtatt ttaaaaataa gattggcata 15121 attcctagaa aatgtttgga cctgtttttt tatttggtat catagaaaac ttttttgttg 15181 ttgttgttat aaattgtgct tccttggttt cctgctgttg ccacaaaggg agcggcttaa 15241 accaacaaaa atttattctc tcacaagagg ccagaagcct gaaatgagtc tcatggggct 15301 aacatcaggt gtcagaaggg ctgtgttccc tctagggaag ctagtccttg actcttcttg 15361 tgtctagtat agaaaacttt ttacgtcatt aaacagactt gaaaataaac ttctttttaa 15421 atgatggtat cccatcataa ttatttaatt tatttaactg atttcctact gtgtgtttga 15481 cattttggat gtttgcttag ggtgtttttc actttttaag tgaccctgag atgaacgtct 15541 ccttccattt atctttatta atatttatgg cttttttctt aggatggatt atcagaagga 15601 aaattattgg atcaaaatat atgaacttta aaaaacattt aacacaaatt gttgaattac 15661 tgtcaagaaa gttgtttctg gtgtctggcc ccacaacagt gcttggaaat gctcatttca 15721 atgtcacttt taccaacatt gcacatggaa attaactttg ctagatttat atgtaaataa 15781 cagtgtttgt gtttttaaaa cacctttatt taatttttaa attgttaaat tttttccata 15841 tgattttagc tatttacaca ttttctgtga aaaatctgtt catgtgtact acccattttc 15901 ctgttgaaat tagtgttttt catataaaat catatgtatt tctgaataca tattagagaa 15961 attaataatt tgacattttt tgatttattt ctccttcctt tgatatgtgc tacttttgct 16021 ttagcaaaac attttttttc acttgccaca aacaccagtt tatacccact caaatttgag 16081 caatggaata agacttggat ccctgctctg taactgtccc taatgaggta gtctgtttct 16141 gatccttaaa ttaatcatag cgctagaaaa ttctctcttt tcatggagat acaatttgct 16201 ccacaatagc tatccactgg tgtttagttc tctgtctggt ggccactcag aagatgtcca 16261 attttatgcc acatgagaga tagtccatca aatatttgaa gaaaactgtt ggaggtttta 16321 agttccaaat ctaaacaatg aaaaaatcaa aaagagcaac atgatttaaa atgttctgta 16381 acaacactta ctagctgggg tattctgggt aagtcactta atttcacaag gccagttgcc 16441 tttacttaaa aatggagata acaactgctc ttgccacttc aaagtgacaa ctgaatgtta 16501 ttattttcaa catacaattt aaattctatt ttgtaatttg gaaggctctg aaatatgtac 16561 aaggtgataa taaaaaaatt tggactttta aatttagcaa gaaatatatt tttatataca 16621 taattaaaaa attaagttct tgcacgtgct tgtaaaaagt gccctgaatt tagtgtaata 16681 atgagaattt gaaaatacca tagaaataag aaatgctatt ctacacattg atgtaaagct 16741 tatggttaaa ttttagataa aatgtttgta accatgatta ttcttatttt caggaagaca 16801 aattcaatga gacaactgag aaaagcctca agatggtaca acaagaatgt aaacatttcc 16861 agaatttggg gaaggatggt ttgaaatacc agtatgttgt ttgcacaagt gggctaacac 16921 ttgccactag aattgaatac ataatccctc gaagcgtaaa aaagttagct gtcaagtttt 16981 tcagttatgg ccgattcata aactcttgga agaatttttt tttaaagtca agaatgccta 17041 ttgaactgtc acattaatct tttatcagaa atgaagctaa aaataaaaga aatacaggat 17101 tctggagaca ttgattctct aatgctttgg tttctgtttg acattggctt tctcactgcc 17161 aatctagaat ccctgcatcc tcacccattt caaagtggga cactgcatat ttcaggtcat 17221 tacagtttct ttggggaaga gatagtttgc cttgccttgc cttacggcta tcttacctga 17281 gtttatagaa ataactcagg atgtgtttaa cactttaata tttagctttt tagtgctctt 17341 gtgagcattt gtgaataata tgcagtttgg gtttttttca attattatgt caatgtattt 17401 tattatcatt attattattg ttattattat cattactaca tatttttgag acagggtctc 17461 tctgtcaccc aggctggagt gcagtggtgt gatctcagct cactgcaatc tctgcctcac 17521 aggctccagt gatcctccca cgtcagcctc tcaagtagct agggctacag acgtgagcca 17581 ccatccctgg ctgatgtttg tattttttgg tagagatggc gttttgccat gttgcacagg 17641 ctggtcttga acttctgagc tcaagcaatc caacccttcg gactttgctg ggattacagg 17701 tgtgagccac tgcacctggc ctcatgctaa tgtattattt attgaaacta aacatcttta 17761 aatcaaaata aaacacagtt taaaaatatc tgactttcct atttctgcaa tttggctgtt 17821 agctgacaat tcaaacttag ttcattaaaa atcttagttt tcttcatctc agattatcct 17881 tttacaacat atattgatta aattctttct ttaaaatatt ttctaatgat gtaaattatt 17941 ttaagcaaga ttttagatac tagccaatgt atttgctgtc aaggagctcc tatacttaaa 18001 gataacacgg ggaggcaaat ctaagagctt ctgagacaac cgtgaaatta gaaaggaagc 18061 taaaatagat ataggaagca gggtgcagga aaacaggaag taaagacata tttctctact 18121 ttatgagaga gctaaggttc atgttgcacc tatattggtt gtcgctagcc ttgaagggtc 18181 tgggcttcag cagtaattca agggaaaatg ggtatctgct ttcattcaga accaaacaga 18241 caaatgttct ttcagttacc tcatcaggct cacgaagata gctccccaac tctccactga 18301 agaactggtg tctcttggcg agaaaatggt gacagctttc actacttgct gtacgctaag 18361 tgaagagttt gcctgtgttg ataatttggt gagcatggcc tgtgtaccag actactcttt 18421 tttttttttt ttaaaaaaaa gaatatttgc actcattgat ttaccgtgat tatccatatt 18481 cctttcttca ctgttttaat attgttaagc aaaaattgtc aagttttcct tttggctcac 18541 tttgtgaaaa ccctaagaag taaaacagaa tgtcttcaat atttttggag agtacttcaa 18601 attatctttc ttatttcttc attgtattag aagggatttg ttaggacaaa tgatttagga 18661 atcataatat tgtttcaaaa tgatacagga aactttataa ttttaagtaa ggttacagat 18721 tcttcctctt ggccccattg cttcatccag caatcaggca ttgaatacat ctctttaggg 18781 ctggggtgga gagtaccttg aggcagatgc caggagggga agaaacccct gtaccacagt 18841 gtttgatcgc ccctttcagt tggctttttg cagctgtctg tttgcaattg aaagtttgtt 18901 tatatgccac caagaaatta atttgtaggc atctaaattc ttatggcttg gattttggtc 18961 ataatgccat ttaggacccc agcttctagt tttacttgta tatagggata tgtcatgagt 19021 gcatgtgttt tatgagtgag gttaacagtg atcttagaaa gatctaagta ttcatctaac 19081 catattttta gaaattccat tcaagtttta agatcgtatc tcagttgcaa ctcttgttgg 19141 tacaggcaga tttagttttt ggagagttat gtggagtaaa tgaaaatcga actatcaacc 19201 ctgctgtgga ccactgctgt aaaacaaact ttgccttcag aaggccctgc tttgagagtt 19261 tgaaagctga taaaacatat gtgcctccac ctttctctca agatttattt acctttcacg 19321 cagacatgtg tcaatctcag aatgaggagc ttcagaggaa gacagacagg tacaaataat 19381 ctcttccatt cttctttttc tttggtttga agacacacca tgtattagtg agaaggttta 19441 tctagtggat atgtcttttt gatatcacaa tcctgttcct tccaccaaat tgtccagtct 19501 cttctatatg gaaaaataca agtgcatatt ttttttttgc cattgtttat atctagcttg 19561 tcttttcatt tgaaagagac ttggaatgtc caagtctaac attaccttta gaaagtttac 19621 agcagccagc aaaaaggctc ataggcaggc tgataatgaa tttcatttcc ttgagggaat 19681 gcccggagaa tatcatttag tgtttacaaa ggagaggcct aaagggccag gaagtttact 19741 ttcattttct caataaatgc tttaaaatgc caggaaatta gacctagttt tttggattta 19801 gacagagtct gcctaatgtg tagtggagac aacattcagc taccaaatat gaggaaatgg 19861 aggttgtttt ggacatgaat tttctggaaa gatcagacgt agtcatgtat accataatac 19921 ttggcataat aaaggcctaa tatatgtttt catttttaaa tagaataata ctcttattat 19981 gtgcaatatt catagattca cagtgcaata cactcttgta ttttccagtc catcctattc 20041 tagtctggtc tattttattc tttttaacta tcctgttttc attttaaaaa tgaaagcgag 20101 aaaatgcaaa tttttgttaa ttcttagatc gaaggatggg catacgggac ttattacatt 20161 ttcctctttg gtgcattaaa attttattac ttaaaataat tactctaaaa atgtactcta 20221 aaatgtactt ttaagacatc aatgggttgc cccctgaagt ttggaattct ataagttact 20281 agaatactgg aattctagaa tttactagaa aataaggaaa tcattattgt ctatggctgt 20341 tgtcacactt aattgtgata gcttattttt ttaattgctc ctttgacatc atttttaatt 20401 gacatataat agttgtacat attttggggg tacatgtgat attttgataa ctgcataaat 20461 gtgtaatgat caagtcaggg taattggaat gtccatcatc taaaacattt atcttttctt 20521 tgtactggga acattacaat tctcctcttc tagctatttt gaaatataca ataaattact 20581 gttaacttta attttcctac tatactactg aatactagaa cttattactg ctatctaatt 20641 gtatttttgt actccttggc caacttctcg tcatccctct cttcctcctt gttaatactt 20701 tttgtataat tttgtgtgtg acaaaaccct tcaaagaatg ttactgtgga aaccattctc 20761 taaatttaat aggaaagttt cagaagaccc attcaaagtg gtatatttca actctttacc 20821 cacacccaag ctgagactca agacgtgatt ctgtaacaag cattttggag tagagacgct 20881 tcagaaagaa agcgttaatt aattttattt gacatctttt ggccacaggt ttcttgtcaa 20941 cttagtgaag ctgaagcatg aactcacaga tgaagagctg cagtctttgt ttacaaattt 21001 cgcaaatgta gtggataagt gctgcaaagc agagagtcct gaagtctgct ttaatgaaga 21061 ggtagtttta tttcttttct actgaaattc aactggtttt cctggtcaaa taaaataaaa 21121 cagaaccctg caggacactg cttcctctct gcagtgtcta ccaggactac atttgagagc 21181 acaattgcta cacaactaat atgtagacat ctctgaaata cactttgagc agtgtctcct 21241 tagtacagaa aaagctgtcg gaagacctgg gttgtaaccc cagcagtctt gtaagtcatg 21301 caacttaaac tgaatggcaa ttaactatct ttaaatgagg atggtaataa ttatccctga 21361 attacacctt tggttcttat tttttggttt tgaaccactc attatcttaa taagacagag 21421 tcaaagcgac ttgcaaagga aatcataaac agctccaaat ggagcagagc aggaagggta 21481 ttagaaagtg aagtctttca ggagtggctg tctcactttc ccaggccttg tgctaacatc 21541 ctgcagttga gaataatgag ggagtggagg gtaagggtag tcagggagaa aagggagaaa 21601 tgtcccttgc ttgccctcgc tccagtgaag gtagtctgta gccatttgct ctggtttgaa 21661 cctcagcaac ctgtgaatgt tagccccaat ggatggctcc atggaaatgc ctgttcactg 21721 ttttccaact ttatttctca ctgtttcttt tctaggaaca ctcactccac tccaaactca 21781 caattcatgt aaactcacag gcagacaaac tcacacacgt gattcattct ggttgtgagc 21841 catctacttt tctgagtgac acttagtttg tggaaatttt tctggggtac atattactag 21901 tttttaaaaa atatacctaa ttttaagtaa aggatattaa tcaatatcta gcatttatct 21961 gagtgagtgt gtacattagg gccatgtgaa ggaaaggatg aatttttggc agttgtaaag 22021 agcctttcgg tgtctatttt ttcactagaa gaaagaatgt ttactgtttc cctgtacaac 22081 gtgggaatga tttaattttc ttcaaaattc aggaaaacat gcttatctaa atttaaacta 22141 gtgtcttaaa tttttcttcc tatgtgtttt tatttccatc cctcacctca gagtccaaaa 22201 attggcaact gaagccagct gctggagata tgtaaagaaa aaagcaccaa aggtaatacc 22261 ctctgcctca ttcaaatgtc aaatgtattt gtggcttatc tgatctcctt gccttttctc 22321 cctcatgctt ctttgtctat ttctggttca tcctacatga acaagaaatg cacatgtagc 22381 taacagaccc cgatattgtt cctgtgtttt tctgggtatc agccactcct tatggggtgg 22441 caatgctgca tttgtgcttg cccttctctt gctgttcaat cagttttcta gataaccaag 22501 aggcgcttaa gtcctctgcc ttcctactca tgtattcagc tgtctggacc acaaaggagg 22561 gatgatggaa cctctgaatg ttgacagaaa gaagaggggt aactccctgg ggactgaggt 22621 gggcagacag acctcagtga ccctttccaa taatctttta agaatattgc tttactgaga 22681 tatgattcac atacaaacca agggtattgc tacagcattt acgtgaacaa aaagcgatat 22741 gtttgtgatt tcaaaaagat atataaagat gccctcccct ttgtggctta tgtaagacct 22801 gaaacacaaa ctttagcaac aaaatgttgc gtagatatag gagttatata cagtgtcagt 22861 gaagagagac ctggacctca acagaaggta ggttgggatt tacaagggac aaaacacggt 22921 taccactaat agttttttca aatactcgac tttctatcac ccaactttca acatttcatg 22981 tttttttgat gcatgaaata ctatcttctc tttccaggga aggcttccta tctgtgtggt 23041 gatgaatcgc atttcctgag aacaaaataa aaggattttt ctgtaactgt cacctgaaat 23101 aatacattgc agcaagcaat aaacacaaca ttttgtaaag ttatttttca gaccgatgtc 23161 ataaatcaac ctaatcacaa aatcaagcaa tcatgattca ttcattccat cagcatcttt 23221 tgctcacatg gctgcaccat tccctatcca tgaaagatgg aagtctgaat agcagctggg 23281 taggcacagt gctctaagat gtccttgaaa ctaccagaaa gaaaactgag aattttgggc 23341 atgaaacttt actgctacaa ctactgctac tactagtaga tgacattgaa tgttttcaat 23401 gtaccaaaca ttgttctagc tagttttcat gcattatttc atttaatcta ataactctat 23461 gggtaggtat tattttcatt tacattttac aaacaggaaa tctaaggctt ggggagataa 23521 actaatgtgt tcagtgtgat atcactagta attggcagaa ccagggtttg aatttaggtt 23581 ctctaattct ggaactatat ataaatgtat ataacatata tattatatat tatgtaataa 23641 taatgttata tattatatat aatactgtat aatatattac attactgtaa catattataa 23701 aatatataat atataatata atatatactt ccagaatttt atatatatta tatattttat 23761 aatttcaact tttattttag atataggggg ttcatgtaca ggtttgttac atgtgtatat 23821 tgcatggtgc tgtggtttgg tgtatggatc tcatcaccca agtagcgagc atagtaccca 23881 gtagttagtt tttcaaccca cacccctctc cctcccaccc ctctgtagta gtccccagta 23941 tctatcgttc ctgtctttat gtccatgtac tcaatgttta gctcccacgg tatttggttt 24001 gaagtttgaa gttgggtaat gtgatgcctc cagctttgtg gaacaatatt gttaaccatt 24061 gcatacacta cctcttaggt atggccatgt tgcatgcctt acaaaatctg ttattcagag 24121 aggtggaatg attgtacaag gcaacaatga cagtggtggt tctcgccctc cattccaagc 24181 cagaattgct gttgtgttag tgatataaat gcttgatatt gatatcaaag gagaagtaac 24241 tcaggaaatg aagactggtt atggactcat ggcctggctt tcctggtggt tctctaaact 24301 ttgaagattt ttgtatgcca atgtttacat ttgaagatgt tggccattta atagataata 24361 catgaggcta ttcatagagt tattttggta gagtgtttag agcatagtct ttgggattct 24421 cattgattta ttcaacaagt atttattaga attc // LOCUS HSU51899 9844 bp DNA PRI 23-OCT-1996 DEFINITION Human kappa-casein gene, complete cds. ACCESSION U51899 NID g1245481 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9844) AUTHORS Edlund,A., Johansson,T., Leidvik,B. and Hansson,L. TITLE Structure of the human kappa-casein gene JOURNAL Gene 174 (1), 65-69 (1996) MEDLINE 97017129 REFERENCE 2 (bases 1 to 9844) AUTHORS Edlund,A. TITLE Direct Submission JOURNAL Submitted (20-MAR-1996) Anders Edlund, Astra Hassle, Molecular Biology, Tvistevagen 48, Umea S-907 36, Sweden FEATURES Location/Qualifiers source 1..9844 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 330..336 mRNA join(362..425,2572..2633,5576..5608,6755..7250,9018..9182) /product="kappa-casein" exon 362..425 /number=1 intron 426..2571 /number=1 exon 2572..2633 /number=2 CDS join(2580..2633,5576..5608,6755..7216) /codon_start=1 /product="kappa-casein" /db_xref="PID:g1245482" /translation="MKSFLLVVNALALTLPFLAVEVQNQKQPACHENDERPFYQKTAP YVPMYYVPNSYPYYGTNLYQRRPAIAINNPYVPRTYYANPAVVRPHAQIPQRQYLPNS HPPTVVRLPNLHPSFIAIPPKKIQDKIIIPTINTIATVEPTPAPATEPTVDSVVTPEA FSESIITSTPETTTVAVTPPTA" intron 2634..5575 /number=2 exon 5576..5608 /number=3 intron 5609..6754 /number=3 exon 6755..7250 /number=4 intron 7251..9017 /number=4 exon 9018..9182 /number=5 BASE COUNT 3308 a 1750 c 1429 g 3357 t ORIGIN 1 aagctttatg agtgatgact ctacatcctt ctctgcattt cttttttatt ttattttatt 61 ttattttatt ttattttatt tttattatac tttaagtttt agggtacatg tgcacattgt 121 gcaggttagt tacatatgta tacatgtgcc atgctggtgc actgcaccca ctaactcgtc 181 atctagcatt aggtatatct cccaatgcta tccctcccca ctccccccac cccaccacag 241 actgacccaa aggtgaccct gctatcatca tcttattggg taaaaagtag gagggaaata 301 gaatagcgtg actcaaggta ctaataccct ttaattggtc ttcagttatt tttcttgggt 361 cttctctaca gctgacaggc caactcaacc tactgccaac caagacctga ctggcacgag 421 gaaaggtaat gctgctgaaa cacttggaga aagtgatcct tttcacagta gttagttggg 481 acatcaccat agttatttca gaatcacatt ttcttccttt tagttattgt taagtttgaa 541 tatgacctag catcacttta aaattaattt ctaacctaaa cctaagttct ggatggtgtt 601 atgttcaaat ttatttttaa ctttactttg ggttccagtc aaattctgat accaactaaa 661 tcatagcagc cattgtgaat tccgaacaag aaggcgttta acgtattcct acagacaaat 721 gttgagagtt aactctacag gaagttgggc tcatgataat aatcgcaatt aaccccttaa 781 ttactttcaa attttatttc tataaaagtc ataattttat ttgttatgga ataattattt 841 ttttaaaaca tgtttttcaa atattcatga aagctggata attctaccat ttcacgaatt 901 atttcttctt accaagtgat gagggcaaat gcaaatgtag ctgatacgca aagtatggtc 961 ttatctctgt gatttttgtt tgtgccaaaa ggaaattact atcattttat agaatatttt 1021 cttttgttta catatcattt tactgtgtga actgataagg ggtctgtgat tcatcataaa 1081 aacattttca gtctataatc ctaaaagatc actagcaagt gaagcattaa taaggagctt 1141 catcccactt aaagaaatat gagtgtgctt agattgttca aattgattcc aaatgagttc 1201 catcaaatgg aaatttgatg acatcgtatg ggcagttttg aagtcacttt agtagacagc 1261 cttcctaatc attattgcct gagaatcaga cgctaatgga gtccagtggg ctttccatcc 1321 cctatctgag gctttaattc tgtgaatctt ttttgcgaag ttcaactttg tagtgattca 1381 agtttatatg agtgtgtcaa taacaatagc tgcaacaaga tgaagttagt aattacttcc 1441 taaacataat atatacttaa atataaaaag tttggggtat catatattat tagtctaaaa 1501 agatgattta ttaaaaatca aataagcaag tatgacactg tatatactct tttagaaaaa 1561 taaggctaat gtatttcttt aaatatttta gtcaaagcca ctgtaattgt catctctata 1621 cttcttatta aaacaatcaa gttaatcata tcagttatga ttactggaga agtatgttat 1681 aaagaaaaca gtctcataac cctccacttc ccattccatc ctacatactt ctgttggatt 1741 gatcatggaa ttatagaatt ttagaaatct ctaagttcaa accctggttt tacaggtaag 1801 aaaatgtaaa gcccaggaaa gggaaatcac ttgatcaaag ttacagaggt agagataagg 1861 cctggtctag aattcagtcc tccaaaaaat gtatagtcat acactttcta gtccaaaaac 1921 actcaattgt tttctgttgt ttataaagcc atgcctagct ttttaaccca tcattatctt 1981 aaaggtgtct actttttctg ctctaaactt tccttccaat cttatatttc ccttctcata 2041 caagattcca atcaaatgtt catggcgatt ctcaagactt tctcctctac ttgaaatatt 2101 tctctccctg ttaaatactg cataggcaag cctcatctat atgtcatttt ttccacacaa 2161 agactgctgc ttctcttaac tgaaactttt attaatagaa aaccactttt gtggcatttt 2221 ctactttttt tccctcttac agttgttttt cagtatatgt ttcaacactc ttcctaaaat 2281 tgatgctact cattggcaag gcctgtatca tatttacatt gataatctca acatttgaca 2341 tagtaatgat aatttgataa ttccgttaat attgtcaaaa attaaattca gaaattttta 2401 ctccaagttt gagtttttac attcttttgt taaaataaat ttttcacatc ggctaaatct 2461 acctgtaaat ctggcttttt ttcttaatca aggaaacatt gtattcaagt cactcctaaa 2521 tttctagtaa caaattcttt taaattaatt tttttttaaa tttatcttta ggtgcaataa 2581 tgaagagttt tcttctagtt gtcaatgccc tggcattaac cctgcctttt ttggtaagtt 2641 aatttcatct aaccagattg tactgacttt aagaaaattt tgtttaaggg ctttaactat 2701 gcaaacttct aaatacaatt atgcttcata gaacaaaata tccacatagt tgttacattt 2761 ttaaggttaa tttttttaat atttatttct attatgaaat caaagtcatg ttcttttgga 2821 tatccattta gttctgtgaa ttaagaaatc acacttaatc atgtacagga aacttttaaa 2881 attctcatgc tatagagaaa atacattcag gggaaaaaca atactgaaaa cattttaaaa 2941 ttgtaaatgc attaaggata attcagaaag aataaataca gaaatcatag tctgggaatc 3001 agaaattatt ttaattaagt tcctgtctca cccagactcc aatctcttta tggaaacaaa 3061 atccttcaaa ggtcaacttt ggttaagctt tttcacaagt ttcaaagacc ttgatacttc 3121 tgcctttaat atttgaatca ttaacttttg ttatattctt ccccattatg tcaattgact 3181 gacccctttc cagggatcaa tggttataaa cagcttgaag gcttgcaggc attatcttga 3241 attatatgca atgttttaag gctactggac ttagaaaaag gttctgtttt tttcttttaa 3301 ttttttgact tgtttttatt gttgttttgg ctattgcctc tttttatttt cttgttcatc 3361 tgggtattct aggcactgac aaaggaggca agcataaata atggtatatg tcctcctcct 3421 agattgtaaa attttgaagg aacactagtc ctcgttctgt gaatctctgt aatatttatt 3481 tattattaag tttagagttg cttttaagtt aacatctatt aattaattct attacctgca 3541 tgttgtgctt gtgtataaca tgtccattaa ttctattagg tcaattcttg gagtcaaaat 3601 taccttttaa aactttgagt tctacaactc aatactttgt ggtcacctga acatttggaa 3661 cttaggttgt ctctttgagt aatagggctt aaagtttaaa tcatgaaatg atctacaaat 3721 ccctgctgca tagcaacctc ttcctgagct acctgagccg gattcgcttt caaacattcc 3781 ttcccatttt cacaaaggct tcaggcctca gggacatgaa caaatataca gaaaaaaaaa 3841 aatttattga ttttaccatt caatcttatg agtagtcaaa tttatacaaa ttaaaacatc 3901 aatgtgatac catgttatat gtactatttt atcccttccc ccaaaataaa aatagttaga 3961 taatatttta ctagcactta tgcactaaac tgcataaaca catagcctat tggtgacatt 4021 ataatttgac acaacttttc tataccacaa tttcataatt tctatcaaag aatcaaatct 4081 catttgtgaa aaatcatttt aaggtattaa cgaaaattat taaaaaggcc actggtatac 4141 agatgttaat catcactctg tttataatta aaattttatc ttaatagata acgataagaa 4201 ataaacattt catgacataa tcattgatca cttagttatc aactataaaa atgtaactat 4261 gaacctatga tataaataaa aaaaagagac accacatttc ctataaaaag caggatactt 4321 tttagacaag catataccag gaatacataa cataggttga tatcgcatta aggtatctag 4381 attgtcagaa ttttttaaag tatactttct acaatattca tattacttct tttttttttt 4441 tttttttgag acagagtctc gcttggtcgc ccaggttgga gtgcagtggc acaatcttgg 4501 cttactgcaa cctctgcttc ccaggtttaa gcgattttcc tgcctcagcc tcccaagtag 4561 ctgggactac aagtgcccgc caccatgccc agctaatttt tgtattttta gtagaggcag 4621 ggtttcacca tgttggccag gctggtcttg aacccctgac ctcagatgat ccacccacct 4681 ctgcctccca aagtgctggg attacagggt gagacatgtg cccggcctca tattacttct 4741 ttacatttta aactaatgat ttaatatatt taaagaaaag caagtctttg tctcaaagca 4801 aggaaacttt attaccttct caattttctg attttctata ctcacacctt ccaaaaggtt 4861 tctagagtca ttttataccc aaggaataca tgtgtaatgt cctaaaacag aactgtcaat 4921 ttaaaaatat ttaatattat acctcctttc aaaagtttct aaaagctttg gagtttttag 4981 ttttataaat atgtgtccat aaaaattgtt ttctcactcc tgattctcta aatatgtcac 5041 aatccaaaac taagaaaagc actataaaaa caaatgaaag atggtattaa aaacactatt 5101 atttatatgt taagtttatg caagaggcaa tattttaaat cttctaagta aaccttatga 5161 atcatcttag tatgagagca gagaaaggaa aaaaggctca atgctcctct aatttttaac 5221 aatctaacaa tgtatattta ataaaaacta aatagaaatt attcatattt ttcatttaag 5281 catatttgtt ggcatgggag tcagatgact agatttttcc cacagtgact ttttcattct 5341 aggtggtgat gtttaaagac aatataaaca tgcagtttat tattactaat cagttagact 5401 actgttgctt cttaagtagt ccataggcct catagatcac tcatgaaaaa tgttttaaaa 5461 aataaaaaga aaacatattt tgtctttaac acaaaagatt tttttaactg atttaagtac 5521 tttttttttc ttttttaact tatttctcat tatttcttct tctggaactc cccaggctgt 5581 ggaggttcaa aaccagaaac aaccagcagt aagtctattt taattacttc tgttacaggc 5641 atgaactaca aaatatattc tgcaaaggtc aattgtatta tagatgcctt ctgtctaaaa 5701 cctaaatatt tcatttccat aaaacaatct aataatattt gcaagaaaat aatttcagca 5761 acaggtgata tgaatcaaca ttgatgttca tctgaaagtt ttattttctt atataaccca 5821 aatcatgtat ctttggtttt catgtatatt tagcaatatg agaaaactct ttcaacattt 5881 catattggaa aaaaatgaaa acataagtaa agcttaatat ttcccctttc cactttttaa 5941 gacaagggaa gatccttttc tgacccatct ttgaatactt ctcatacatc tatatttact 6001 tcccctttcg tttcccaaat atgtttgaat acatttaaat gtgtctcgtt ttaggagaaa 6061 tgtttgcttt gtagtataat tttgtcaatt tatttctatt catatggaaa ggtctactga 6121 caaattttta aacagaggca catagatata gacaattctt tatctaatct tttgtgaaga 6181 aaattaaacc aaaaaaaatc caaaatctaa atccaggaat ctctccacat cacattatgt 6241 tactattctg ctatagtttg gaaaaaataa atgtgctctt gggctaattt ttaaatgaat 6301 aaaatttgag accaattaca aatatgtggt gagaatacat acgtgtgtgt gcgccattct 6361 gtgtgtgtct gcatgttgtg cttgtgtata acatgtcaaa gttctcaatg agatttttct 6421 ctgtgttgta cctacaatag ctaacatata agatcacttt ccatatatgc caatctctgg 6481 gcaatatggt ttacattatg aataaactca tctaatttct tgaaacaaat attatattaa 6541 tattatcaaa ataatattta taatattgaa aatcataaag tggaaaggca atgtacaaat 6601 caacatctta taaaataata ttgctgctgg gttccatact tctaatatct tactcaatgg 6661 taaatactat attatttcaa aaaatgcaga tttaaggtat ttccacattt gggtctataa 6721 taataatatt ctgtataatt tatttttttt gcagtgccat gagaatgatg aaagaccatt 6781 ctatcagaaa acagctccat atgtcccaat gtattatgtg ccaaatagct atccttatta 6841 tggaaccaat ttgtaccaac gtagaccagc tatagcaatt aataatccat atgtgcctcg 6901 cacatattat gcaaacccag ctgtagttag gccacatgcc caaattcctc agcggcaata 6961 cctgccaaat agccacccac ccactgtggt acgtctccca aacctgcatc catcatttat 7021 tgccatcccc ccaaagaaaa ttcaggataa aataatcatc cctaccatca ataccattgc 7081 tactgttgaa cctacaccag ctcctgccac tgaaccaacg gtggacagtg tagtcactcc 7141 agaagctttt tcagagtcca tcatcacgag cacccctgag acaaccacag ttgcagttac 7201 tccacctacg gcataaaaac accaaggaaa tatcaaagaa cacaacgcag gtaaattaac 7261 agtatataaa atgagtaatt ccgacaagaa gcatggattt atgaatacaa ccataaattc 7321 taaaatagta caaatagata aactaagtgt gttacagaag cagacaaaac agggtactta 7381 cagttttacc ttggtaaacc catagcattg atacaccaga ttctgttcca actagaaatt 7441 taaaataatt ttatttgaca aagtgaaaat aattggcaac ttcattatca aacttttttc 7501 tgacaattgg gacactctta gaatgtaata gttttatttc atccttattc acacacaaac 7561 tatgacagta gagtaaaaca gcaagtaaca ttttcatgat atttcaaaaa taatttttag 7621 aagaagttat atagaatatg agtttgagta aattttaatt ctgtaacaat tctcttgcac 7681 tctcattgta gtatgaacag agtaaaagga aatgatatgc ttccatgcat ttctttattt 7741 cagacatcat ttccaaaatg attcatgaag ttaagcctct caaatttctc ctattctaag 7801 ataaacccat ggaagaatta taatgcttaa cttgaaaaac acataaatat taaaaggata 7861 ctttcagaac acaatgtgat gcaaagtgtt actatttatt acttggacca atggataatg 7921 gcagaatata aaataatttg aaacttatga gaagaattta aacaaagaga atgattaata 7981 aagtggaaaa tatccaaaga gtgtaaaata atttggggag aaagttgcaa aatgtggagt 8041 ttctttacaa caaatatttt cagtccatca gatgtctaca tgttttatga tctaaaatac 8101 cagacaatgg tctgtgatat catgggacta ccattagccc agaaaaggtt gcttctttca 8161 tctgcttgcc taatgaccat aggctagtca ttactttctc tgaactttta ttttctcata 8221 attctggttg ctataagact caagagagaa tgcacatggg aaaggatttt aaaaactgat 8281 caatctgtaa aatgtcatgt attggaaaag ataaagtaaa attcatacca gtatcctaag 8341 tctccactaa atgataaaaa ccgtacataa ttatgtctgt tgattcccat agtaaccata 8401 tgaaacagat attattccta tctcaaattt agggataaaa accagtagga ctgaggacat 8461 taagtaaatt atcacagcta gaaattgcaa atgggaggca aaccaaacct gtttatttca 8521 gatagtgggg tttaattact atgtcattta tttttatatg ggctgtctaa cttgctgaaa 8581 aacagggaaa atacactgtg atttccttag actagcatat gggtaaattg tgttgtttag 8641 ttgctgttca acaggatatc cattaagaaa aagagaacaa gagctggtta aacagctggc 8701 ttattatatt ttggagaaca aaataagaaa attaataaag acaacatgaa agcaagttat 8761 ttaataagta gttacatatt ttactgatgc aaaacataaa gaagcatgta aaagagtttt 8821 ttctgtaaat attacataat gtctattatt ttcagtttac tatatgccta ttttaaatat 8881 ttaagtttaa atatgtaatc tgattgactt attggataca gacatgaaag aaaaattgtg 8941 gaagctaaat aaatatatat catgagaact gttaacatac tatgaataaa tttctaaact 9001 atcattattt tttataggac ttgctgaaac caaattacta cttcacactc tccttcagcc 9061 atttgtctgc cttcagtcaa cagaaaatgt gattttcaca gattcagctc ttctctcctt 9121 acattttaca ttcatgccac attcaatatt ttgattcttg cacaataaag ccaactgatt 9181 gcaactgatt ctttgagagg agtttgcaaa ggtacctcag ggatctctga aactttgaaa 9241 cgctatgaca aatttagtgt gagttagagt acacaaattt acctaaaggc agtgctcata 9301 gtttcatcag gttctcaaag tggttcatgg tctgacacta agaatttctc ctctaggatg 9361 acactgggac cagagaaaag ccaagccata gtgttccaat ccaggggacc acagagtgac 9421 tcttgagagc catgggaact ctagattatt cctgagagca aattctgggc acaggcaaac 9481 tacaaatgtc cagaactaca aacacagata cgttgcaaaa ttatagatgt tgacagagac 9541 tctaaacatg catgaccaga tttccccaca ctggactcca cagtctcaca tctgttcaaa 9601 cacacacaca cccccacacc acccacccac ccacacacac acacacacat cacacaccag 9661 aaagcttctg gaattcaggt attccagttt ggaaaaactc gtgaatattc aaggtccaag 9721 gataggacat agttgttttc ttaaacaaat attgcaagga aaatcaggtc acctttgtgc 9781 ttggtccata tttaacacat catgccagaa aaaataaaat tcactcagcc cttacaaaat 9841 gttt // LOCUS HSU52427 6051 bp DNA PRI 16-JAN-1998 DEFINITION Human RNA polymerase II seventh subunit (rpb-7) gene, complete cds. ACCESSION U52427 NID g1924973 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6051) AUTHORS Schoen,T.J., Chandrasekharappa,S.C., Guru,S.C., Mazuruk,K., Chader,G.J. and Rodriguez,I.R. TITLE Human gene for the RNA polymerase II seventh subunit (hsRPB7): structure, expression and chromosomal localization JOURNAL Biochim. Biophys. Acta 1353 (1), 39-49 (1997) MEDLINE 97398138 REFERENCE 2 (bases 1 to 6051) AUTHORS Schoen,T.J., Chandrasekharappa,S.C., Guru,S.C., Mazuruk,K. and Rodriguez,I.R. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) Retinal Cell and Molecular Biology, National Eye Institutes, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892 FEATURES Location/Qualifiers source 1..6051 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13.1" misc_feature 1..831 /note="5' flanking region" repeat_region 238..312 /rpt_unit=(GATA)n mRNA join(832..949,1083..1192,2158..2317,4475..4525,4633..4698, 4946..5017,5536..5569,5787..6003) /gene="rpb-7" /product="RNA polymerase II seventh subunit" gene 832..6003 /gene="rpb-7" exon 832..949 /gene="rpb-7" /number=1 CDS join(938..949,1083..1192,2158..2317,4475..4525,4633..4698, 4946..5017,5536..5569,5787..5800) /gene="rpb-7" /note="seventh largest subunit" /codon_start=1 /product="RNA polymerase II seventh subunit" /db_xref="PID:g1924974" /translation="MFYHISLEHEILLHPRYFGPNLLNTVKQKLFTEVEGTCTGKYGF VIAVTTIDNIGAGVIQPGRGFVLYPVKYKAIVFRPFKGEVVDAVVTQVNKVGLFTEIG PMSCFISRHSIPSEMEFDPNSNPPCYKTMDEDIVIQQDDEIRLKIVGTRVDKNDIFAI GSLMDDYLGLVS" intron 950..1082 /gene="rpb-7" /number=1 exon 1083..1192 /gene="rpb-7" /number=2 intron 1193..2157 /gene="rpb-7" /number=2 exon 2158..2317 /gene="rpb-7" /number=3 intron 2318..4474 /gene="rpb-7" /number=3 exon 4475..4525 /gene="rpb-7" /number=4 intron 4526..4632 /gene="rpb-7" /number=4 exon 4633..4698 /gene="rpb-7" /number=5 intron 4699..4945 /gene="rpb-7" /number=5 exon 4946..5017 /gene="rpb-7" /number=6 intron 5018..5535 /gene="rpb-7" /number=6 exon 5536..5569 /gene="rpb-7" /number=7 intron 5570..5786 /gene="rpb-7" /number=7 exon 5787..6003 /gene="rpb-7" /number=8 misc_feature 6004..6051 /note="3' flanking region" BASE COUNT 1360 a 1472 c 1565 g 1653 t 1 others ORIGIN 1 gcccctttgg caaaggtcgt aaacattttt tttttatttc cacaccaatt tctgtagatt 61 attatgagaa ttaagggaga gaattatatc aagcgcttag cacagtgtct ggcatctagg 121 aagaaagtag taagtgatag catttattta tttatttatt ttatttattg agatggagtt 181 tcgctcttgt tgcccaggca tctaggaaga aagcagtaag tggtagcatt tatcaatcta 241 tctatctatc tatctatcta tctattatct atctatctat ctatctatct atctatctat 301 ctatctatct atctgttgag atggagtttc gctcttgttg cccaggctgg agttcagtgg 361 caagatctcg ggccgcctcc caggttcaag caattcagat agttgccgac cgggcgcggt 421 ggctcactcc tgtaatccca acactttggg aggccgaggt gggtggatca cctgaggttg 481 ggagtttgag accagcctga ccaacatgga gaaaccccat ctctactaaa aatacaaaat 541 tagtcaggtg tggtggcaca tgcctttaat cccagctact cgggaggatg aggcaggaga 601 atcgcttgaa cccgggaggc ggaggttgca gtgagccgag atcgcgccac tgcactccag 661 cctgggcaac agagggagac tccgcctcaa aaaaaaaagt tgtcgtttga tggctggggg 721 gcgggggtga ggggcggggc ttatggttct cgcccacgac ggcgaggccc aatcagaccc 781 gcggagctgg tggagttccg ccctcgcgga ggaggtggtg ttccgccctc ctagtcgcac 841 caagcgcgga actggggttg cggcgtctaa gtgtttccgg tggattccca gggactgtcg 901 gaggtgtgga ctctgcctgc ctacctggtc tgggaagatg ttctaccatg tgagcagggc 961 tcagggtggc ggcaagggct gggtggaaag gaagagcaag gccaggcgcc ggcgcgagac 1021 tccccttgtc gtccaataac ctgccagcgc ctggctggtc ggcatcccac ttttctccgc 1081 agatctccct agagcacgaa atcctgctgc acccgcgcta cttcggcccc aacttgctca 1141 acacggtgaa gcagaagctc ttcaccgagg tggaggggac ctgcacaggg aagtgagtgt 1201 cgagctcacc gcaccgccag atcgttcgat gcgcacacgg ggcgctatac ctgcaacccc 1261 tccccaactc ctactcgtcc tgccacccac tctggggacc cctctcctct ctcagggagc 1321 cagttgcttc ctgctttggg ttcccagagc gccctctgca atgctggaca ataagacttc 1381 cttctctcag ctgagatggg tcattcattg actccctcaa acattttaaa aacccttggt 1441 gtgtgtgcca ggacccgtgt tgggctcaga acaaggtaga atgggaacaa gatagaatga 1501 ggacagcccc tgtggagctc acagtgtaat cattgttcct gtgtctgctc tccctctgtc 1561 attctgctgg ttttctctta tcccctgtgc ccagggacca cacaccatgg gcctctatat 1621 atcttctgaa tgaatgacca gaaaaagcag atcgagagtt tgggggagga aggaagtagc 1681 ctgggatttg accatctctg ggctttgcac agagggggct ctctgatatt aatgagctgg 1741 gacccagctc agggcagtgg agtacttggc cctccagcat cagaaaaacc tgcaggaaca 1801 tgcagattcc tgagcctcct tattccctac agaatcaaga ctgtctgggg cagacctgag 1861 aatctgcatt ttaagaagat gattctcagg catcccgaag attgagatct gttgcctcaa 1921 ggactccagc cttcttggac atttattgtg aaattaccag aacagcattt ggtcttacac 1981 atactgatct ctgttctgtc ttcttttggt cattaagctc tctgggagca gaggtgctat 2041 ctacttgtcc ttttgctctc cccgacccta cccccaagag ctcagcgact tttattgcag 2101 atgtctttag aggctcttgg acacacagtg tcttcctgtt tctcatcctc cttgcaggta 2161 tggctttgta attgctgtca ccaccattga caatattggt gctggtgtga tccagccagg 2221 ccgaggcttt gtcctttatc cagttaagta caaggccatt gttttccggc catttaaagg 2281 ggaggtcgtg gatgctgttg tcactcaggt caacaaggtg agaccataca taggggaggc 2341 agtggggtag tctctcggaa gatctgggtt gggttctggc tccacttcat aactctgtgc 2401 ctttggctaa gtcacttcac tttttttttt tttttttttt ttttgagaaa gttttgctct 2461 gttgcccagg ctggagtgca gtggtgcgat cttggccact gccacctctg cctcctaggt 2521 tcaagtgatt ctcctgcctc ggcctcccaa gtagttggga ctaaaggcac gtgccaccat 2581 ggccggctat tttttttttt tttggagacg gagtctcgct ttgtcaccca ggctggtgtg 2641 cagtgggcga gatctccact cactgcaagc tccgcctccc gggttcacac cattctcctg 2701 cctcagcctc ccaagtagct gggtctacag gtgcccacca ccacgcccgg ctaatttttt 2761 gtatttttag tacagacggg gtttcactgt gttagccagg atggtctcga tctcctgacc 2821 tcgtgatcca cccgcctcgg cctcccaaag tgctgggatt gcaggcttga gccaccgcgc 2881 ccggcctgca ctttcaaatt atatggtaat cttgccctgc ctaactcaga gggcaagaga 2941 gtcagtatta atagtcataa caataaaagc ctgtgttgat tgagtagtta ctatgggcta 3001 ggtgatgtgc agagactcac aatgactcta aaaggtgggt tttttttttt ttttgagacg 3061 gagtctagct ctgttgccag gctggagtgc agtggcgcaa tctcggctcg ctgcaacctc 3121 cgcctctggg ttcaagcgat tcccctgcct cagcctccca actagctggg actataggca 3181 tgcgccacca tgcccagcta atttttgtat ttttagtaga gatggggttt ctccatgttg 3241 gtcaggctgg tctcgaactc ccgacctcag gtgatctgcc tggctcagcc acccaaagtg 3301 ctgggattat aggcgtgagc caccgtgccc ggccagtggg tgctttttac agaagctgaa 3361 gcttggaaca aattaagtgt actggccaag atcatgtggc caacaggtgt tcaaatcttg 3421 gactaacacg tcccaggcct aagctcttta atactttaaa atgaaagctt aggctgggcg 3481 cggtgactca catctgtaat cccagcactt tgggaggctg aggtgggcag atcacctgag 3541 ggtgggagtt caagaccagc ctggccaata tggtgaaacc ccatctctac taaaaataca 3601 aaaattagcc aggcattgtg gcatgtgcgt gtagtccctg ctactcggga ggctgaggca 3661 tgagaatcac ttgaacctgg gaggtggagg ttgcagtgag ctgagattgc atcactgcac 3721 tccagagtgg agaacagagt gagattctgt ctcaaagtaa atatgtaaat aaatacaaat 3781 gaaagcttaa aaaaaaagag aaaagggctg ggtgcgctgg ctcacaccta taatcccagt 3841 gctttggggg gcagaagtgg gaagattgct tgagcctagg agtttgagac agcctgggca 3901 acatatcaag actctgtctc tacaaaaagt ttttaaaaat tagcctagca tggtggtgtg 3961 cacctgtggt ttcagcaact tgggaggctg aggtgggagg atcgcttgag cccaggagtt 4021 caaatttaca gtgagctgtg atcgcaccac tactttccaa cctgggcaac agagtgagac 4081 cctgtctcaa aatttaaaaa gcaaaaaaca aaaacacaac acttgagggg acaaagaaaa 4141 ggcaagatca ctcttttgtt tttttttttt gagacagagt cttgctctgt tgccaggctg 4201 aagtgcaatg gtgcaatctc ggctcactgc aacctccacc tcctgggttc aaacgattct 4261 cctgcctcag tctcctgagt agctgggatt agaggtgcac gccaccatgc ccggctaatt 4321 tttgtatttt tagtagagac ggggtttcac catgttggtc aggctggtct gaaactcctg 4381 acctcgtgat ccaccggcct cggcctccca aagtgctggg attacaggcg tgagccaccg 4441 tgcacggccg taagatcact catttttttt ctaggttgga ctcttcacag aaattgggcc 4501 catgtcttgc ttcatctctc gacatgtaag tctgggcaca ctgggtgggg ctgaggaaga 4561 cagatactca ttcatcacca tctattgaag catccattta aagtgctaac ctagatctca 4621 ctttcctttc agtccatccc ttcagagatg gagtttgatc ctaactccaa cccaccatgt 4681 tacaagacaa tggatgaggt gagtggacag aaagaagtca gggagacaag gagagaatca 4741 gccatcctga gggagaatat tggcctgggg atggctgagt acagatggtc cccgaattac 4801 catggcttga ttcatgattt tatgacttta cgatggtgtg aaagtgatgt gcattcaata 4861 tgcccccggg cttacagtga agtgattgtg atgtaacttc tcataaactg aggagcatct 4921 gtacactgtt ccctcctctc ctcaggatat tgtgattcag caggacgatg agatccgctt 4981 aaagattgtg gggacccgtg tggacaagaa tgacattgtg atcttttccc cacctctcta 5041 cctataccag gttgagctga gaactgtcga tttctttttt cttttttttt ttttttggag 5101 acggagtctt gctctgtcac ccaggctgga gtgcagaggc gcgatctcgg ctcactgcaa 5161 gctctgcctc ccgggttcat gacattctcc tgcctcagcc tcccgagtag ctgggactac 5221 aggcggccgc caccacacct ggctaatttt ttatatttgt tagwacagac agggtttcac 5281 catgttagcc aggatggtct cgatctcctg acctaatgat ccacccgact cgccctccca 5341 aagtgctggg attacaggtg tgagccaccg cgcctggccg agagctgtcg atttcttacc 5401 acccacagtt tcagggggtt ctgtgctctt ctcccaggaa gaagggatgg atgggcagaa 5461 gccctgggcc tccagggtgg aggttggttc tgtgctcatc ttggcttcac tttcttttta 5521 ccatctttct tgcagtttgc tattggctcc ctgatggacg attacttggg tgagtgcctg 5581 atcataggtg ctggggttat tgcctggaga agggatgtgt gggggtgggg agtaatatag 5641 gattcaatgc ccaaatcaga gagacagaag aaactttcat gctgtctgct tgaaagatcc 5701 aggacatttg ccttgggatg aggagtacat ggttgtggct accctaaatt ccggttctaa 5761 ctgatatgct ttttctggtt tcgcagggct tgtaagctga gcctggtggc ctcctaccct 5821 tggtcctact ctaggaagtg tgattgtcac acttatcatg ttgtccagag gtccagtctg 5881 gctgctgttg tggaggcaag gaaggcaact catcccagaa ggcatctggt gcttcttgta 5941 gcttaactac tgcctcctca tttttcagta tgtgttctaa gtataaaaag tcctttggtt 6001 ctcatggaag tgttatcttc tttttttttt cagaggagga acaatatctc t // LOCUS HSU53874 3270 bp DNA PRI 03-JUN-1997 DEFINITION Human blue cone pigment gene, complete cds. ACCESSION U53874 NID g2138075 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3270) AUTHORS Shimmin,L.C., Mai,P. and Li,W.H. TITLE Sequences and evolution of human and squirrel monkey blue opsin genes JOURNAL J. Mol. Evol. 44 (4), 378-382 (1997) MEDLINE 97250263 REFERENCE 2 (bases 1 to 3270) AUTHORS Shimmin,L.C. TITLE Direct Submission JOURNAL Submitted (09-APR-1996) L.C. Shimmin, Human Genetics Center, School of Public Health, University of Texas Houston Health Science Center, 6901 Bertner, Room S250, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..3270 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /chromosome="7" /map="7q31-35" mRNA join(<11..362,647..815,1137..1302,1910..2149,3139..>3258) CDS join(11..362,647..815,1137..1302,1910..2149,3139..3258) /note="G-coupled-protein receptor family; photovisual pigment" /codon_start=1 /product="blue cone pigment" /db_xref="PID:g2138076" /translation="MRKMSEEEFYLFKNISSVGPWDGPQYHIAPVWAFYLQAAFMGTV FLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFSVFPVFVASCNGYFVF GRHVCALEGFLGTVAGLVTGWSLAFLAFERYIVICKPFGNFRFSSKHALTVVLATWTI GIGVSIPPFFGWSRFIPEGLQCSCGPDWYTVGTKYRSESYTWFLFIFCFIVPLSLICF SYTQLLRALKAVAAQQQESATTQKAEREVSRMVVVMVGSFCVCYVPYAAFAMYMVNNR NHGLDLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIMKMVCGKAMTDESDTCSSQK TEVSTVSSTQVGPN" exon <11..362 /number=1 exon 647..815 /number=2 exon 1137..1302 /number=3 repeat_region 1541..1812 /rpt_family="Alu" /rpt_type=dispersed exon 1910..2149 /number=4 repeat_region 2322..2497 /rpt_family="MER13" /rpt_type=dispersed exon 3139..>3258 /number=5 BASE COUNT 864 a 791 c 807 g 808 t ORIGIN 1 tggggcatcc atgagaaaaa tgtcggagga agagttttat ctgttcaaaa atatctcttc 61 agtggggccg tgggatgggc ctcagtacca cattgcccct gtctgggcct tctacctcca 121 ggcagctttc atgggcactg tcttccttat agggttccca ctcaatgcca tggtgctggt 181 ggccacactg cgctacaaaa agttgcggca gcccctcaac tacattctgg tcaacgtgtc 241 cttcggaggc ttcctcctct gcatcttctc tgtcttccct gtcttcgtcg ccagctgtaa 301 cggatacttc gtcttcggtc gccatgtttg tgctttggag ggcttcctgg gcactgtagc 361 aggtactgca ggggaaaaag gggttagggg aaggcaaagg ttgctactcc gctggagggg 421 gttcctaaag aggagtctgg gggaaatgag tctggtgctt tttaaaatac tggggtacaa 481 agcaacccag actagaagtt tggctaaaat aggatgtttg agtcttcact ccaaatgtca 541 gtccagccct gtctctctgt gcttcctccc acccgatctg tttgccactc tgccagccag 601 gctggtgggg ctctgtctag cccattattc tcacatttca ccacaggtct ggttacagga 661 tggtcactgg ccttcctggc ctttgagcgc tacattgtca tctgtaagcc cttcggcaac 721 ttccgcttca gctccaagca tgcactgacg gtggtcctgg ctacctggac cattggtatt 781 ggcgtctcca tcccaccctt ctttggctgg agccggtgag agtgcagggc agtggtgctg 841 agttaactag gagctcaggt tgatgtgggt ggaaagagag cttgggtata actatttagt 901 ctttgacctc tacttttaaa gagttgcaat atgaggggaa gggcagtggg agacaagtgc 961 taacgtttac tctgcagttg gaattgctgt agcttctccc agtcaggaca gaaaaccccc 1021 ctgcttgaag ccttagggca ttccgtgggt tctaagtgga gaacacaatc caggcatctc 1081 agctcccacc gcactcttgt ggagagtcca gtgagcaagt gtttggtcct ttgcaggttc 1141 atccctgagg gcctgcagtg ttcctgtggc cctgactggt acaccgtggg caccaaatac 1201 cgcagcgagt cctatacgtg gttcctcttc atcttctgct tcattgtgcc tctctccctc 1261 atctgcttct cctacactca gctgctgagg gccctgaaag ctgtgagtgg catttgatag 1321 tcagggaaga aggggttcgg ggctccacat gagaaggaag agtgctctga aacataagat 1381 gcctggaaat gtccatagcc agagagggta tctaaaagca gcaaaggaag taggaggagg 1441 gagaatgatg gagatccaaa ggaacaggcc aggagatggg acagaaaaga ggcaatcaga 1501 gtggatgccc cctcccccat cccacagaaa agcatccaga gaccgggcgc agtggctcac 1561 gcctataatc ccagcacttt gggaggccga agcagacgga tcacctgagg tcaatagttc 1621 cagaccagcc tggcctacat ggcaaaatgc taaaatgcga aaattagctg ggcatggtgg 1681 tgtgtgcttg taaccccagc tactcaggag gctgagacaa aagaatcact tgaacccggg 1741 aggtggaggt tcagtgagcc gagactgcac cactgcaact ccagcctggg caacagagcg 1801 agactcggtc tcaaaaaaaa aattaaaaat taaaaattaa aaaaaaaaaa aaaagcatcc 1861 agagggacag gaaaaagaga gatgtgatgc tttccgtgct ccaccccagg ttgcagctca 1921 gcagcaggag tcagctacga cccagaaggc tgaacgggag gtgagccgca tggtggttgt 1981 gatggtagga tccttctgtg tctgctacgt gccctacgcg gccttcgcca tgtacatggt 2041 caacaaccgt aaccatgggc tggacttacg gcttgtcacc attccttcat tcttctccaa 2101 gagtgcttgc atctacaatc ccatcatcta ctgcttcatg aataagcagg taaagctctt 2161 tattcacatt cctatggtcc agaagaccct ggttcttttc tcaccattga cttttaactc 2221 agagcaccct ggactctacc caggtttcta gtagacgagg gaagccacaa aacccccgag 2281 taggttggga agcctttggt aagcacaggg aggaaggcac ggttatcaag acgagaaaat 2341 agaaccccgg aggaaagaac ttgagtcagg aaaatgaagt tgctccaaag aacaggatga 2401 atgaaagcat tttattgaaa aactcgtgca gcaaaccacc atggcacacg tttacctatg 2461 taacaaacct gcacatcctg cacatgtatc ctggaactta aataaaatta aaaaaataaa 2521 aataaaaact cagattcctc tcaattttca gtccttgcat ttaataattt cttaatcatt 2581 tcccttccaa cttttagcct gcacgagcat gtgtgaagca cagaaatcat accacatgca 2641 aaaatctcta aaatatctta tcatctgaag gtactggggg atttcctatc ccatctgaaa 2701 tccgagctaa taaacaccaa accctaagtg gcaaaaaccc tactttcaga tggtattgtt 2761 tcctcaatcc cagaggtaga ctcaaaacta atttgaaacc tccctggata gaagagaatt 2821 ggcagtcctt tccagctggg agcacctgct agtaatggag gggcctctgc tgacagtgct 2881 tttatgaagc aggatggttt gtgaatttac caacagtgag gtctcagact tgaccagttt 2941 aggattaccg tagacccagg agtagttcta gactggaatc tagatagttt tcaggatggg 3001 gaagatagat tcaaaaccac ctaagggcat tctgggtaca aagcattgtg caaggctttg 3061 gtgatacaga gaataaggtc ttttttccca tacttcctca tctgccaagg ttatctccaa 3121 ttgtaccttt ctctccagtt ccaagcttgc atcatgaaga tggtgtgtgg gaaggccatg 3181 acagatgaat ccgacacatg cagctcccag aaaacagaag tttctactgt ctcgtctacc 3241 caagttggcc ccaactgagg acccaatatt // LOCUS HSU56438 12177 bp DNA PRI 08-NOV-1996 DEFINITION Human dioxin-inducible cytochrome P450 (CYP1B1) gene, complete cds. ACCESSION U56438 NID g1663555 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12177) AUTHORS Sutter,T.R., Tang,Y.M., Hayes,C.L., Wo,Y.Y., Jabs,E.W., Li,X., Yin,H., Cody,C.W. and Greenlee,W.F. TITLE Complete cDNA sequence of a human dioxin-inducible mRNA identifies a new gene subfamily of cytochrome P450 that maps to chromosome 2 JOURNAL J. Biol. Chem. 269 (18), 13092-13099 (1994) MEDLINE 94230403 REFERENCE 2 (bases 1 to 12177) AUTHORS Tang,Y.M., Wo,Y.Y.P., Stewart,J., Hawkins,A.L., Griffin,C.A., Sutter,T.R. and Greenlee,W.F. TITLE Isolation and characterization of the human cytochrome P450 CYP1B1 gene JOURNAL J. Biol. Chem. 271 (45), 28324-28330 (1996) MEDLINE 97067052 REFERENCE 3 (bases 1 to 12177) AUTHORS Greenlee,W.F. TITLE Direct Submission JOURNAL Submitted (24-APR-1996) Pharmacology & Mol. Toxicology, U. Mass. Med. Center, 55 Lake Ave North, Worcester, MA 01655, USA FEATURES Location/Qualifiers source 1..12177 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p21-22" mRNA join(3070..3414,3805..4848,7881..11587) /gene="CYP1B1" /product="dioxin-inducible cytochrome P450" gene join(3806..4848,7881..8469) /gene="CYP1B1" CDS join(3806..4848,7881..8469) /gene="CYP1B1" /codon_start=1 /product="dioxin-inducible cytochrome P450" /db_xref="PID:g1663556" /translation="MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRR RQLRSAPPGPFAWPLIGNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERA IHQALVQQGSAFADRPAFASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQP RSRQVLEGHVLSEARELVALLVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDD PEFRELLSHNEEFGRTVGAGSLVDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKF LRHCESLRPGAAPRDMMDAFILSAEKKAAGDSHGGGARLDLENVPATITDIFGASQDT LSTALQWLLLLFTRYPDVQTRVQAELDQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFS SFVPVTIPHATTANTSVLGYHIPKDTVVFVNQWSVNHDPLKWPNPENFDPARFLDKDG LINKDLTSRVMIFSVGKRRCIGEELSKMQLFLFISILAHQCDFRANPNEPAKMNFSYG LTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ" BASE COUNT 3238 a 2803 c 2832 g 3304 t ORIGIN 1 gagctcaatt aaccctcact aaagggagtc gactcgatct aaaaactaga tggcattatt 61 gaggagttgg acaggcggtc ttaggaacaa cctcgtttta tactttttaa aaataggaat 121 ttacttaaag gcaacttcca tctgctctca ttctggcaga ggtattgtga taaaggcttt 181 caggaatgtc aacagctcag gattggcatt agctcataga taattcccgt ctactaaatt 241 aaagtctatg agagattgat gcatgcacca gagtcttgaa ttattaggca aggatcttgc 301 taaagctgga catgggaagg ttgctatgaa actcatttca aaggctgtgt aggccaactc 361 acctctggga aaccaacgct tattattaaa ccaatgcaac tgctcaaaaa ctaaaactcc 421 atcactatct aagttcccca tcatggtgtg tgaaatcatt tccagaaaac tcttcaaaag 481 ataccgaata aaaggcatca caggccgggc acggtggctc atgcctgtaa tcccagcact 541 ttgggaggcc caggctggtg gatcacctga ggtcaggagt ttgagaccag cctggccagc 601 atggtgaaac cccatctcta ctagaaaaat acaaaaatta gccgggtgtg gtggtgcgca 661 cctgtaatcc cagctacttg ggaggctgag ccaggagaac ctggcaggcg gaggttgcag 721 tgagccgaga tcacgccatt gcactgcagc ccaggtgaca gagtgagact ccatctcaaa 781 aaaaataaat aaataaaaat aaaaataaaa aaagccatca cgaaaagaca tttctttatc 841 aggtcagcaa attgattgct ggtttcaatc tctcaagaag tcagattctc ttcagacact 901 cctttgtgtt tctttacaat gtaatcagtc attccagggt cagctatgag gcacactccc 961 ctggcctccc tcataagcaa tttaattgca aagggttagc atactatttc ttctctgcta 1021 ttctcagaat tctcaattat gaaaacaaaa acaataatta aaaagttctt ccagctgatt 1081 tttttttttt tttttgagac agagtctcac ttaaactctg ctgcccaggc tgtagtgcag 1141 tggtgcaatc tctgctcact gcaacctctg cctcccacgt tcaaccaatt ctgctgcctt 1201 cccccctgat ccccgagtag ctgggattac aggtgtgcac caccacgtcc ggctaacttt 1261 tgtattttta gtagatgggg tttcgccatg ttggccaggc tggtctcgaa ctcctgacct 1321 caagtgatcc gcccgcttaa gcctcccaaa gtgctgggat tacaggcgtg agccactgcg 1381 cctagcccca cgtgcatttt ttttttttta agggaggcag gaaagaaggc atttgggcct 1441 cttatctgca attttcttcc gtgtaaactt atgtgaagga tctggagtgg gacttggtgg 1501 cttcaagccc tggctccact tcttgatggc tgcgcgacct gggttaagtc acgcaacctc 1561 tctgaaccct agtttattca tctgtaaaca ggtcaataat agcacgagat tgcagcgcga 1621 gtggagctca aagtgcaggg ttgtccctgg tgaacctatc acttttatat ttatcctttg 1681 atgaagccag tacaattcct acctggttaa ccagatacat cccacctctt ccctcgagtt 1741 cgcccttccc cccgcctcgt gaagtccttg ttctcttagc tgtcttgaaa atcctatgca 1801 tcagcatgta ggaaagggcg cgccaggcgg gggaagccac ccccgcccga gcgcctccgg 1861 ttcccttata aagggagggc ccccttcgcg accgcaagcg cgcccaggaa gaccacagag 1921 ccgccggtgc gcacggaggt ggcgatacgc gcccgggctc ggcctgcggg tggtggccca 1981 agcgtccgcc tcgctggcct ggcaggcgcg actgtgcgtg cgcagccgag ggtggtggcg 2041 gccggcaccc cacgccaagg gtggtggtgg ccggcacccc accctcggcc gccgcctccg 2101 cgtgtcaggt gccgtgagaa gcgcgggagg agcggccgca ggcagcgccc agggatatga 2161 ctggagccga ctttccagaa gcggcgcacg caaagcccag ctccgcacgc aaaggggagg 2221 cgacacgaga aacttcaacc cgataaagtt cgccggagcg cggagattcg cctcctcctg 2281 ccactctccg ccccgctcgg gtcccgcccc gctagctccc ccaggccccc ccagtcgccc 2341 cagcttggct ccccgccctg cgccaacggc ttccatcgca gcctgggcgg ccccgcgccc 2401 accagcgggc ggcgccacct ggagtggcct ctacgcggga aatctcaggg ccagctgcgc 2461 cccaggagcc tttgtgtgcc caagcactgt cggggccccg gggcggggga gcggctactt 2521 ttagggattc ctgatctcgc cgcaagaact ggaaaaaatt tagcatgcca aagagcctcc 2581 actgaggtgg caatttgttt gcgagaacct aagataaaat ttaaacaacc aaccaggggc 2641 gctgtgaggc aaaccgctgc cactacactg gctttccggg aagcaagctc aagccgcgga 2701 gagggaaggg aggtcgtgcg ctcggggcgg ggcgcgctcc caagtcgagc gcagcggccg 2761 gggcaggttg taccgagcgt ggttctgggg acaccgtgcg gccttgattg gaggtggctg 2821 tgatgaagcg cggttaccgc acaatggaaa cgtgggcacc tccgctccca tgaaagcctg 2881 ctggtagagc tccgaggccg gccggtgcgc ctggacggga gtccgggtca aagcggcctg 2941 gtgtgcggcg cgccccgccc cccgcaggcc ccgccctgcc aggtcgcgct gccctccttc 3001 tacccagtcc ttaaaacccg gaggagcggg atggcgcgct ttgactctgg agtgggagtg 3061 ggagcgagcg cttctgcgac tccagttgtg agagccgcaa gggcatggga attgacgcca 3121 ctcaccgacc cccagtctca atctcaacgc tgtgaggaaa cctcgacttt gccaggtccc 3181 caagggcagc ggggctcggc gagcgaggca cccttctccg tccccatccc aatccaagcg 3241 ctcctggcac tgacgacgcc aagagactcg agtgggagtt aaagcttcca gtgagggcag 3301 caggtgtcca ggccgggcct gcgggttcct gttgacgtct tgccctaggc aaaggtccca 3361 gttccttctc ggagccggct gtcccgcgcc actggaaacc gcacctcccc gcaggtcagt 3421 ctgtctgccg aggcgctgcc cggcgacctc ttcagatgga ttattacagg tagcgggtgg 3481 cgtggtaggt actttaaagg aaatcaagcg ccaccgcctc gatgcccgca gcgttgtccc 3541 caaattgcag gaaccgttac gcgccttgcg gggaggggaa gggtttggcg ctgggttaca 3601 gcgaggtgga aacacgcccc ttctcttctc caagggagag tgggttgggg atgggaaggg 3661 gcgtcttcgg ccatttctcc agagagtcag ctccgacctc tccacccaac ggcactcagt 3721 ccccagaggc tggggtaggg gcgtggggcg cccgctcctg tctctgcacc cctgagtgtc 3781 acgccttctc ctttctgtcc ccagcatggg caccagcctc agcccgaacg acccttggcc 3841 gctaaacccg ctgtccatcc agcagaccac gctcctgcta ctcctgtcgg tgctggccac 3901 tgtgcatgtg ggccagcggc tgctgaggca acggaggcgg cagctccggt ccgcgccccc 3961 gggcccgttt gcgtggccac tgatcggaaa cgcggcggcg gtgggccagg cggctcacct 4021 ctcgttcgct cgcctggcgc ggcgctacgg cgacgttttc cagatccgcc tgggcagctg 4081 ccccatagtg gtgctgaatg gcgagcgcgc catccaccag gccctggtgc agcagggctc 4141 ggccttcgcc gaccggccgg ccttcgcctc cttccgtgtg gtgtccggcg gccgcagcat 4201 ggctttcggc cactactcgg agcactggaa ggtgcagcgg cgcgcagccc acagcatgat 4261 gcgcaacttc ttcacgcgcc agccgcgcag ccgccaagtc ctcgagggcc acgtgctgag 4321 cgaggcgcgc gagctggtgg cgctgctggt gcgcggcagc gcggacggcg ccttcctcga 4381 cccgaggccg ctgaccgtcg tggccgtggc caacgtcatg agtgccgtgt gtttcggctg 4441 ccgctacagc cacgacgacc ccgagttccg tgagctgctc agccacaacg aagagttcgg 4501 gcgcacggtg ggcgcgggca gcctggtgga cgtgatgccc tggctgcagt acttccccaa 4561 cccggtgcgc accgttttcc gcgaattcga gcagctcaac cgcaacttca gcaacttcat 4621 cctggacaag ttcttgaggc actgcgaaag ccttcggccc ggggccgccc cccgcgacat 4681 gatggacgcc tttatcctct ctgcggaaaa gaaggcggcc ggggactcgc acggtggtgg 4741 cgcgcggctg gatttggaga acgtaccggc cactatcact gacatcttcg gcgccagcca 4801 ggacaccctg tccaccgcgc tgcagtggct gctcctcctc ttcaccaggt aaagcctctg 4861 ggaggcgtgg gccaggtctt ttctcctctg aaaaaggcgg agtagagaca gaatatgctg 4921 agtttgcaag cagggccccg ggtttggggt ttcgctccag gtccccaccc ctcaaaacca 4981 agatcgcgtc ggtaaaggga ctcacagtga gggctgcgac acgcacgcgc cccacccagc 5041 ggtgccccga cccctccggt ctcctatctt gtctctatcg tcccctcccc tgcttgcgag 5101 tgagaacaca tttgcaaaga cccctccacc ccccgggaaa aacaagagtt tttaaatgct 5161 tggagatgag ccctgatatc tctctccctg gcgcattaca atcagaactg gaatagttcc 5221 gaaagaaaag gtaatgtcat aaatatgtta aacacagcag cctctcctag gctagtcctc 5281 ggcgtgcatc cgaggccgcc cagccctggc gctaaaagcg ggccgcccgt cagggctttg 5341 ttccaggcca aggaagccca tggaggccgg gccagccgac aggtaacccg cacagaaact 5401 ttcagaaggc ggccacaact agcgggcagc gctaggttta taaaacctcc gcgctaggag 5461 tttgagaaat gccggggtag aagacaagaa gcagtcactt ttacgaaagc agagtagcat 5521 tcagaaaggc agatgggata tccaggaggc gcctgcagac gtttctggcc cctgcgcttg 5581 gctcagttag cggaccccct gatgcccacg ttggtctcta ctaagcacgg attcaacagg 5641 tccctggtgt cgggttgcca gatttggcaa aagaaaagta agttttacac gggaatactc 5701 acactaaaag attagccctt gttgatctga aatccatatt taactgggcg gcctgtagta 5761 tttactgtgg aaacactatc cctaggggca aatgtttcgc aaggcaaatt ttgattgccg 5821 aagagaccag aaatcctggt tttgtgtcat ttcttggagc acaagtgagc agttggagat 5881 gctgaatctg caggcgccac agaaaggtgt ttggaaggca gagaatgact ctttccttat 5941 taaaatccac tgcaatctat atttccttag atactgtaca gctaccttca caaattaaaa 6001 gtttctgtat acttaaaatg gctttttagt attaaaatca tagaaacacc catggtgggt 6061 gagggagagg cagaaatcga ataaagaaaa gtcaaccaga gatcagggaa aaggaaatcc 6121 cggaataggt tccataggtt tttgtgcatc cgcagatagg cattttaact tttgaaacgg 6181 cctttgtttt tcattagaac cacaatagtt ctccgagtac tccaattagg cggcaaaaag 6241 aaataaaaca atttggaggt cactttcaat tcaatatgct gatacttttt tttccttttt 6301 agtgaaaaga cgtatgacag ggcttggcaa attacacgat ctgtttttgt atatacgttt 6361 tattgacgca gtcatgccca ttcatttata cattgtcgat ggctgcgttc actctacaac 6421 acaaggcaga gcagagtagt gcaacaaaga gggtttggtc cacaaagtct aaaatagata 6481 ctctctggct ctccccacca tacatgtttc tatgaaaaaa gttggcagac tcctgcaata 6541 taatgttgaa gaagcgattc taaaaataac tccacttcat cacatacgtg tatacatttt 6601 ttttcggagt tgcaaacaat cagttacttg ttttttgact tctaacactt tgactcaagg 6661 tagtggacat ttctaatctt ttaatattta ttttggtaat ttttgatggc tatatagtat 6721 tttgctatga taatgtatag ttatatataa tcatttattg aactttaatg ttggtcattg 6781 gccttgattg catttttaaa atttttattt taattttatg tatttactta ccttagagac 6841 agggtctcac taccttaccc aggctggtcc caaactcctg ggctcaagca gttctcccgc 6901 cgctgtctgg caggtagctg gggctacagg cgtgtaccac catgcctggc taatttttag 6961 aattttaggt cctgttgtgg ttacatattt ttgtatctct taaattattt tctgaaaata 7021 catttctagg catggattac tgggtttaaa atgaggcggg aagaggttac cttcatgtct 7081 cttgcctcgt attaagattt tgttttttaa aaagattgta tcagtttgta tcatcaacag 7141 tgaataagta ctacagtttg taccataatt ttatgaacat tgggtatttg ctttcaaatt 7201 taaaaaatac agtatgtagc tttatccatc aggacaccaa ttatctttgt ataaaatgag 7261 aacagcatgt ctgttggaat tgtccaggga aatgagggag aaaaaaaatt tactttcaca 7321 ttgtaacttt cgtgggccct gggtgctttt gcctttgtag attccttata ctataaaaaa 7381 attaaaaatt aaatttcatg actaccctga tataaagatg aatgcattaa aatgatgtat 7441 gaaaatattt tcttctacct aaaagtgagt ttttttaggt ctgaagattg taaaagacgt 7501 aaaaatattt tcatgggccc ctaaaaagtg ttgtgagccc taggcactgt cctgcggtac 7561 ccaatggaaa agtcagcctt atctactatt gtgctttttg aggctgggaa aacttaagag 7621 ttttttgact tataatggga aagacagcat tagtcatgca aggcctatta caggaaatat 7681 aattcttaaa gtccatcttg taatttagtg agaaattagg aagctgtttt agattttttt 7741 cccagaaata ttaatttagt cactgagcta gatagcctat ttaagaaaaa gtggaattaa 7801 aataaattat aatgtgcttt ctagatgaaa taagaatttt gctcacttgc ttttctctct 7861 ccacattaaa caccaaacag gtatcctgat gtgcagactc gagtgcaggc agaattggat 7921 caggtcgtgg ggagggaccg tctgccttgt atgggtgacc agcccaacct gccctatgtc 7981 ctggccttcc tttatgaagc catgcgcttc tccagctttg tgcctgtcac tattcctcat 8041 gccaccactg ccaacacctc tgtcttgggc taccacattc ccaaggacac tgtggttttt 8101 gtcaaccagt ggtctgtgaa tcatgaccca ctgaagtggc ctaacccgga gaactttgat 8161 ccagctcgat tcttggacaa ggacggcctc atcaacaagg acctgaccag cagagtgatg 8221 attttttcag tgggcaaaag gcggtgcatt ggcgaagaac tttctaagat gcagcttttt 8281 ctcttcatct ccatcctggc tcaccagtgc gatttcaggg ccaacccaaa tgagcctgcg 8341 aaaatgaatt tcagttatgg tctaaccatt aaacccaagt catttaaagt caatgtcact 8401 ctcagagagt ccatggagct ccttgatagt gctgtccaaa atttacaagc caaggaaact 8461 tgccaataag aagcaagagg caagctgaaa ttttagaaat attcacatct tcggagatga 8521 ggagtaaaat tcagtttttt tccagttcct cttttgtgct gcttctcaat tagcgtttaa 8581 ggtgagcata aatcaactgt ccatcaggtg aggtgtgctc catacccagc ggttcttcat 8641 gagtagtggg ctatgcagga gcttctggga gatttttttg agtcaaagac ttaaagggcc 8701 caatgaatta ttatatacat actgcatctt ggttatttct gaaggtagca ttctttggag 8761 ttaaaatgca catatagaca catacaccca aacacttaca ccaaactact gaatgaagaa 8821 gtattttggt aaccaggcca tttttggtgg gaatccaaga ttggtctccc atatgcagaa 8881 atagacaaaa agtatattaa acaaagtttc agagtatatt gttgaagaga cagagacaag 8941 taatttcagt gtaaagtgtg tgattgaagg tgataaggga aaagataaag accagaaatt 9001 cccttttcac cttttcagga aaataactta gactctagta tttatgggtg gatttatcct 9061 tttgccttct ggtatacttc cttactttta aggataaatc ataaagtcag ttgctcaaaa 9121 agaaatcaat agttgaatta gtgagtatag tggggttcca tgatttatca tgaattttaa 9181 agtatgcatt attaaattgt aaaactccaa ggtgatgttg tacctctttt gcttgccaaa 9241 gtacagaatt tgaattatca gcaaagaaaa aaaaaaaagc cagccaagct ttaaattatg 9301 tgaccataat gtactgattt cagtaagtct cataggttaa aaaaaaaagt caccaaatag 9361 tgtgaaatat attacttaac tgtccgtaag cagtatatta gtattatctt gttcaggaaa 9421 aggttgaata atatatgcct tgtgtaatat tgaaaattga aaagtacaac taacgcaacc 9481 aagtgtgcta aaaatgagct tgattaaatc aaccacctat ttttgacatg gaaatgaagc 9541 agggtttctt ttcttcactc aaattttggc gaatctcaaa attagatcct aagatgtgtt 9601 cttattttta taacatcttt attgaaattc tatttataat acagaatctt gttttgaaaa 9661 taacctaatt aatatattaa aattccaaat tcatggcatg cttaaatttt aactaaattt 9721 taaagccatt ctgattattg agttccagtt gaagttagtg gaaatctgaa cattctcctg 9781 tggaaggcag agaaatctaa gctgtgtctg cccaatgaat aatggaaaat gccatgaatt 9841 acctggatgt tctttttacg aggtgacaag agttggggac agaactccca ttacaactga 9901 ccaagtttct cttctagatg attttttgaa agttaacatt aatgcctgct ttttggaaag 9961 tcagaatcag aagatagtct tggaagctgt ttggaaaaga cagtggagat gaggtcagtt 10021 gtgtttttta agatggcaat tactttggta gctgggaaag cataaagctc aaatgaaatg 10081 tatgcattca catttagaaa agtgaattga agtttcaagt tttaaagttc attgcaatta 10141 aacttccaaa gaaagttcta cagtgtccta agtgctaagt gcttattaca ttttattaag 10201 ctttttggaa tctttgtacc aaaattttaa aaaagggagt ttttgatagt tgtgtgtatg 10261 tgtgtgtggg gtggggggat ggtaagagaa aagagagaaa cactgaaaag aaggaaagat 10321 ggttaaacat tttcccactc attctgaatt aattaatttg gagcacaaaa ttcaaagcat 10381 ggacatttag aagaaagatg tttggcgtag cagagttaaa tctcaaatag gctattaaaa 10441 aagtctacaa catagcagat ctgttttgtg gtttggaata ttaaaaaact tcatgtaatt 10501 ttattttaaa atttcatagc tgtacttctt gaatataaaa aatcatgcca gtatttttaa 10561 aggcattaga gtcaactaca caaagcaggc ttgcccagta catttaaatt ttttggcact 10621 tgccattcca aaatattatg ccccaccaag gctgagacag tgaatttggg ctgctgtagc 10681 ctattttttt agattgagaa atgtgtagct gcaaaaataa tcatgaacca atctggatgc 10741 ctcattatgt caaccaggtc cagatgtgct ataatctgtt tttacgtatg taggcccagt 10801 cgtcatcaga tgcttgcggc aaaagaaagc tgtgtttata tggaagaaag taaggtgctt 10861 ggagtttacc tggcttattt aatatgctta taacctagtt aaagaaagga aaagaaaaca 10921 aaaaacgaat gaaaataact gaatttggag gctggagtaa tcagattact gctttaatca 10981 gaaaccctca ttgtgtttct accggagaga gaatgtattt gctgacaacc attaaagtca 11041 gaagttttac tccaggttat tgcaataaag tataatgttt attaaatgct tcatttgtat 11101 gtcaaagctt tgactctata agcaaattgc ttttttccaa aacaaaaaga tgtctcaggt 11161 ttgttttgtg aattttctaa aagctttcat gtcccagaac ttagccttta cctgtgaagt 11221 gttactacag ccttaatatt ttcctagtag atctatatta gatcaaatag ttgcatagca 11281 gtatatgtta atttgtgtgt ttttagctgt gacacaactg tgtgattaaa aggtatactt 11341 tagtagacat ttataactca aggatacctt cttatttaat cttttcttat ttttgtactt 11401 tatcatgaat gcttttagtg tgtgcataat agctacagtg catagttgta gacaaagtac 11461 attctgggga aacaacattt atatgtagcc tttactgttt gatataccaa attaaaaaaa 11521 aattgtatct cattacttat actgggacac cattaccaaa ataataaaaa tcactttcat 11581 aatcttgtta tgaagtttta tgttgtagca aatatagtct aatgtatcac tttatttcaa 11641 ctattactat ggattattca attaaaatat tgcactgtaa acattttatt tgtccattct 11701 acatttcagg ccgaggccac atgctttttt ttcccgtagg ttttcatcaa ccaaatttct 11761 aatctctata tttttgaatt tctaagaacc aacacggagg ttacatttca atcaaacaat 11821 caagccaaat gagaattaat ttgcaatgtt gaaagattga atagattgcg actttaagtt 11881 gctgatcgat ttctgcttca tctaaagaat gcagtacaca tttttcttcc tcagcctcat 11941 accagtcctt tctctgagta ttttcttctt tcattttttt aaaggtacat gtgcctcaat 12001 tagtgcaatt aatgtccaat aaaataagat gggtacaagt attctgtcat gtactatgtg 12061 gaaatgtttt attgccaatc tttttacttg gtaaaaatac ttgacttcac agggcctagg 12121 tcatgactct ccaggccttc actgtgccac actcacagat gacagggctt gcggtac // LOCUS HSU57623 9170 bp DNA PRI 16-JUN-1996 DEFINITION Human fatty acid binding protein FABP gene, complete cds. ACCESSION U57623 NID g1377853 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9170) AUTHORS Peeters,R.A., Veerkamp,J.H., Geurts van Kessel,A., Kanda,T. and Ono,T. TITLE Cloning of the cDNA encoding human skeletal-muscle fatty-acid-binding protein, its peptide sequence and chromosomal localization JOURNAL Biochem. J. 276 (Pt 1), 203-207 (1991) MEDLINE 91248148 REFERENCE 2 (bases 1 to 9170) AUTHORS Phelan,C., Morgan,K., Baird,S., Korneluk,R., Narod,S. and Pollak,M. TITLE MDG1 paper JOURNAL Genomics (1996) In press REFERENCE 3 (bases 1 to 9170) AUTHORS Baird,S. TITLE Direct Submission JOURNAL Submitted (03-MAY-1996) Stephen D. Baird, C.H.E.O, Genetics, 401 Smyth Rd., Ottawa, Ont., Canada, K1H 8L1 FEATURES Location/Qualifiers source 1..9170 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1248..1351 /number=1 CDS join(1279..1351,4738..4910,6800..6901,8339..8392) /codon_start=1 /product="fatty acid binding protein FABP" /db_xref="PID:g1377854" /translation="MVDAFLGTWKLVDSKNFDDYMKSLGVGFATRQVASMTKPTTIIE KNGDILTLKTHSTFKNTEISFKLGVEFDETTADDRKVKSIVTLDGGKLVHLQKWDGQE TTLVRELIDGKLILTLTHGTAVCTRTYEKEA" intron 1352..4737 /number=1 exon 4738..4910 /number=2 intron 4911..6799 /number=2 exon 6800..6901 /number=3 intron 6902..8338 /number=3 exon 8339..8622 /number=4 BASE COUNT 2313 a 2265 c 2351 g 2241 t ORIGIN 1 atcccctgag ccccggagtt tgaggctgca gtgagctatg atggcgtcac agtactccag 61 cctgggagac acagcgagag actttgtctc taaaaaataa taataaaata aaaagttcaa 121 tgaaacaata cacccaaagc cctcagcatg caataaatag caagacaagg caggtcttat 181 ttttactgaa agtgcttagt aaactataca gtgacaaacc accgcacaac aggctctcga 241 aaggaggcag caaattaccc aaaagtgcag gcggcttgct agtgtgcaca ggccaaagaa 301 agggcggcag gtggggaagg cagccatggg ccttgaagag ctgaccgaat tggcagaatt 361 tctgcaggag gggagctggg aacgacctga gctaaagctc ggagctgtgc gaagaaaccg 421 gaaaagccca gagcacttgc aggggcgggt ggggagctag atggtggggt ggggtgggga 481 cggaggaggg ccagcaggag acattccgca gggaggggca agcacgtgtg aggcgggcgg 541 ggcgcgaagg gtcaggcttt tgctcaaaac aggcagagga caaggtcagc tcagccgcag 601 accgagccgc tggtgactgt ctccgccacc aggcagtgag agtgaaggga gagcgcgacc 661 tctgaagccc gctagactaa gcttgcaatc tgagctccat tcaccccctc ctatttcttg 721 agaccttgtc agttcccctg tgagcctcgg actcacttgt aaaacgagga cagatgcccg 781 tgccagaagt caaccagagc tttccccggc gtgggcacca gcccaagggc gttttgtttt 841 ctagtctcat ctctgctctg acgctaagct caaagaggga ctgggggacg ggaagatatc 901 caccatggat gcgccctaga tctcgggctg gtgtcggctg ttccttctca gattccagag 961 tgtctagagg ccaggaaagg gagaaggtcc taccagcctg gggtagggac tcgggggcca 1021 ggcactggcg ctgacgcagg ctagcagggc gccactggct ggtccccacc cacctcggtg 1081 ggttggggga tgggcgcacc agcccctcct gggtgagccc tagcctgggg cttcctattt 1141 cgggagccgg gggcgtgggc cacgtctcct catgtgatgc gagggctatt taaagcggca 1201 gcccgggcag ggagccgccg tcggagccct tgcacgcctg ctctcttgta gcttctctca 1261 gcctagccca gcatcactat ggtggacgct ttcctgggca cctggaagct agtggacagc 1321 aagaatttcg atgactacat gaagtcactc ggtgagcaag ccgcggggct caggatgttg 1381 gcttggggac tggctggtgg cgtgcctagc cccacgcagc actcctgccg catccctcct 1441 ggttaagact ggggaatagg ggagcgcgga gatggcagcc tggcctagag caggtggggc 1501 ctgttcagag ggggctttgg tggtccaaat ctggttagag accacggtag ggaggtggtg 1561 gaaggaggca gctgtgtggg aggctctttc caggaagagg gatatgtgat ttggaggtag 1621 gaggagggtt tggataaaga acactgatca caggaaaggg agtgtagcca ggggagaaaa 1681 agaacagggg catgggtagt ttagaaattg gaggagactg aacccagaaa gggaatgggg 1741 cagccaggga gtgtacaatg atgtaaacaa gtaggaaata cctaggagga aaaagattag 1801 tggggaaaaa actgtggatc agtgaatcag atatgagaag gacgtaagac aggaacctgc 1861 agtaagcagc aatccccatc tctgcttggt tagggaagag aattcttgct ggagaatgcc 1921 ctttctcacc agccagtctg accttgtcct gcagtctatg tatccaggcc ttcatcactg 1981 tctgtgagcc tcgtggtagg gtggggcaag aggcccatga tcagctgggc ctttcctgca 2041 acccaaggct cacctatctg tgcgaggggt aggcagagaa agccattgga cttctgatgt 2101 gcagtagagg gtcccaaggc aaggtcaaga cctgggaggg aggatcactg gtttaggagg 2161 atgtggagaa ctcctgtggt gttgggatgg agaagaatca ggattcaaag aatctcacag 2221 gtgaggaact tggagattcc cataccatct agttcaacag ggaaactgaa accaggagag 2281 tagaaatgta ttataacaat tccacagcag agccaatatg aaaatctaag gtttctagat 2341 ctgtaaccca gagctcttcc cactacccta caggccctgc gagtgggaag aaaagtagaa 2401 actgcttagc taatgattga cctcagccct tcttctactg ctttgggctt agatggagag 2461 gtcaaagctc tcaacggcct ctaccctatc ttgggcgcta tgcccagtaa ttctaggcag 2521 gcagtcattc ttagaggagc agcccccagc ccccacgaac acagcccagc agctattggg 2581 aagttggaat gcccagattt agttcctcct tccaaagctg ggccagagct gagtcttgaa 2641 ttgagctgca acaactttac cattcttgtt cccttattct gccccgagtt gggtcagcgg 2701 gctggtctcc ctgaagtcct gttatctttc agcagcttat gttaaggcag ccagcattct 2761 catcgtagga atggaaagcc tgggaaaata ccctcctcag ctctcagtaa gtagtgctgg 2821 cttcatttct aagtagaacc cagatctccc tgagtctcct aaattctgtc agctcaatat 2881 tcttagtttc tcttggttca gaccctcact catcccgcag tggtttcctt ttcaaacact 2941 ccatacctct gggtagatcc taagtgaaca gagctcccag tgccgtgaca aggtcctgct 3001 ctgtgcaagg gagtgtgatt ggcctgactc atcctgatac caaggggcaa tgccaagttc 3061 ctcactggcc aagcaagggt gggctgacag cataacagca gaggcagccc ctgcccctcc 3121 tgctgtagac ctagggctct caaggggcaa agaggtcccg tctagtacca gtgaccacag 3181 gcacaactgc tggcctggat tgagtatgtg ctggacagaa tcgcccagtg aaaatagtca 3241 acagttttgg agccgaggtt caaatctatg tcagtagttt attctctttg aatttcgaca 3301 agacacttcg cactcttcat tgtaaactgg ggataatcta cgcttcgagg ctgttacaag 3361 cattaagtaa aacaacccat gtagggcatg tgcagagtac ctagcttcca gcaagcacta 3421 tgtagccagg tacatttgga gactttacac acaccacctc actacactgg gctgcctcct 3481 gcctcacctt tgccttggaa gacagttcaa tgttaagctg ctggggggag agggggcagt 3541 catgattagt tctttgttct ttacttggtt gcaggacact taggactttg cccagtaccc 3601 aaggaagcca tgcttgggtc aggaagagag tctctgtaaa gccttagact gggagtcagg 3661 agacgggttt gagtctaact cattgttgct accgctttag gctcctcctg aatctgcaca 3721 ataggacaaa tacttccttt gtacctaact cctagatcat agataacagg ctttgaaaat 3781 gatgggttgc catgtataag ggacaagagc actaacactt cttagtttca gggtaaaaac 3841 ttccaaagtt ggaaaactcc tatgcctaag gctttggaag ggaaagtcta tgtttctctt 3901 ctttcctcag ccttattcct aaggctttga gagcttttca ggtgccctgg aaggcagcct 3961 tatgctccag ccttgggagg tagtatagct gagcacttaa gcaagctctg gactcagaca 4021 attctgggct tcaatctcag atttgtgacc ctgggcttta cctctgtttt tgtatctgta 4081 acgtggaaac agtcttcaga agaacaggaa gaactaaatg agataacatg tacagttctt 4141 actacacaaa aagctcatag tacttaatag tagctttttt tttttttttt tgagatggaa 4201 tctcactctg ttggctaggc tggagtgcag tggcacaatc tcgactcact gcaacctcca 4261 cctcctaggt tcaagcaatt ctcagcctca gcctcctgag tagctgggat tacaggcaca 4321 taccaccaca tctggctaat tttttgtatt tttagtagag acgggtttca ccatattggc 4381 caggctggtc ttaaactcct ggcctcatgt gatccgcctg ccttggcctc ccaaagtgtg 4441 attacaggcg tgagccacca cacttggccc aatagtagct tattctaatc ccagctctgc 4501 cactgacttg ctatggcact gctgttcctt aagtatctct catctaatgg gatcagttat 4561 ctgtgttcac caaacagaac taagcgcaag actgaatttt aaaattccca tgcaaaggct 4621 ttgaaagata cagtcctcca cttccccata cccaggcctg agagttattc attgagtttc 4681 ttgtacactg cttctctacc ccagctcata tactcataac cttcccccta ccctcaggtg 4741 tgggttttgc taccaggcag gtggccagca tgaccaagcc taccacaatc atcgaaaaga 4801 atggggacat tctcacccta aaaacacaca gcaccttcaa gaacacagag atcagcttta 4861 agttgggggt ggagttcgat gagacaacag cagatgacag gaaggtcaag gtaagtcagg 4921 gaaacagggg tggggaatgg agagtgctga gactctaaaa gagaataggc tggtagtctt 4981 ggctccctgg tattgcaccc tgaggggcag actatcatgg ggaatttaca tgaaacaaga 5041 ttcataaagc ctgtgtagtg ctggaatgcc actgatgcta aatacatgtc agttctgtcc 5101 tcttgttttc ttccctccct tcttgggatt catctattgt ctgcctcgga atgggcagca 5161 cagagccagg atgttcttct gacctcagta tctactccag ctccagctgg gtgaccctgt 5221 gcaaggtatg cagtagctct aggtttcttt ccccttccat agatggagag ttatgtggcc 5281 atggctgtga cctgaagtgc tttaggaatg atgcccagaa gtcagggccc tccactgagt 5341 gaggtcattg tgacctccag cagcaaaaaa ggcagccagg aactagaagc acctactcag 5401 atgccgcttc aacttctaac tcccagacat ggccaatgac cctgacaaac tatttccagt 5461 gttgccagct gacaggcagg aaagagctat gttccgtgat agggcattca ccttgtcatg 5521 aatgtgtttg cagtgtctcc caccaagcct tagcccctcc tcccagggtt ctatcaccct 5581 gcagtggctg tcttggcagc ttgcctcagc cttccaggcc aggcatggga gcgagagaac 5641 ttaagggctt tgacctctat agggtgtccc tatagcagtg ttctatcatg acactatcat 5701 tcagccccat cagctgtttc ctcttcctca tagctgtccc cagaaagaac aggatcacac 5761 aggtggctgg cagcagagct ggggatggtg cccaaagatg gcagtctacc ttggataaag 5821 gtggctgccc caccacctgc tcatacctcc ttggacttgc ctactttctc aaggggcaag 5881 aaccccaatt aaacacaata gccctgtgga atgcctaggg caaaaatatc tactctgagt 5941 aggcaaaaaa aactagggga atgagaacaa ggagtaaggt aaggataaaa aagagcacac 6001 taagagacag gcctcatacc ccttatcacc taaacaatac acagaacctt ctcagattct 6061 cctactgaac caccttgctc atcaggatcc cttagcctgg ccttgtggcc cccaaactcc 6121 taggaaagag agctggaaga gctgccaaat gagaaccagc tgatgtatgt atgctggcag 6181 cacccagagc tgaggaacca cttcaagggc atccagtcac aggactttgt ggttgctgcc 6241 ctcttgttgg ctaaagaggt cacatgatgt ggaccaagaa aaggtgtagg aatacagggc 6301 aggaagtcta attatccaat acttcctatc actaagggtc ttttagacat tatgtggact 6361 aaccacaagg ctggataaag attctcagga ctactcctcc tcctcagtca gtctttccca 6421 gggatagact agtaaatccc acctgtatct gaggggacca ggctacggga atcacctaga 6481 gtacagataa gtgtctgtct tgaaggcttg tggtacttct cagagccagg ctctctggct 6541 ccaccatact gcctgcctct ccctccttgc ctaatatctg aaggcctctt ccccagaaag 6601 gcagtagtgg agcagaggct ggaggtgaac tagatgtctt gcagggatag ctgggaggcg 6661 gattgcctga gctcttgtcc tcacaccatc actagtttgg gtcaaaggct gtgtcctctg 6721 tggcccagtg tccagacccc accctgcccc tcaattcctg actaagatca cagctcaggc 6781 ctctaccctc tttccacagt ccattgtgac actggatgga gggaaacttg ttcacctgca 6841 gaaatgggac gggcaagaga ccacacttgt gcgggagcta attgatggaa aactcatcct 6901 ggtaagatgg gcaactttgg agctatatct gattggttat tactactgct ctttcagcca 6961 agcctgttct aaaaagccaa gtcctcccct gagagctgta gaagctggga caagagagtg 7021 gttgtgggtc agggtggtat caggtgggaa tttttctgtg tagtggcttt ggactcacac 7081 aggccggaac tcaaatctta ccttataggc tacatgactg tgggcaaatc accttttcca 7141 agtgcaactg taaaacgggt attaataata ccaaccttgt agggctgctg ggaagcctgt 7201 aagagacagt gtatgcacag cacaaagcat cactgattga ggaacacagc aggtgctcca 7261 tgtcctttgt ttgctcttcc tgtgtttcta ccttgcctca cctcaggaag aagtagaaaa 7321 cagggccaaa tctgatccca ggccctctag gaggggctcc cattgcctat ctcagcattc 7381 cctttcctct cctccctagg actgcattgt cacttgcagg gacaggctcg tgactggtgg 7441 ggacactgaa tgacagtaca gtcctttctt ccccattcta gtcctacccc attttcatgc 7501 tttctatgtc tggcctactg aaactacttg actactgctt gggtaggaag taccacagcc 7561 aggctggcag atctgttcaa gcttggggac ttcacttgga gaatctagcc ttgactgaat 7621 tccccccaga cccagggaga gcagccaact gtggattctg cctaaccaca gggcctcagg 7681 ttttcaccta ggcatcttca ctgcacacct tcttgggtca gcataacctg ttaactgcat 7741 tcttgtactc atgtgggaca ggggtcccct tgaagtttgg aatgaggtgc ctagctttgg 7801 tggggatgtg atatgcagga ccaaattctc agtggcagct gaactatggt gaggccatgg 7861 gtctggctct atgatgccag accggatagt gggaggtaca gggctctggc cctggcacta 7921 ctctaagtta gggaaggatt ggagttagta cccaaacaca gtcctttcct gagtctctgg 7981 atatttttcc tatttgtcaa ctatatgcca ggcaccatct tagacactaa ggatgaagaa 8041 gccaaatggt ataagggaag gaaaaacact caggtcttga ccaaattact tcctctctaa 8101 aggctcgttt tttccaaatc tctaaaataa gaattacaat gcctgtctta aggatttgct 8161 gtgcatatca gaaaaaaaaa attatgtatg tatacacaca cacacacaca cacacacaca 8221 tacatacttg ccggcactgg taggtctcag tgacaattat caggaggaag ggagggtaga 8281 atgctcgcaa tggtgttcct ggctcccacc ccccatctca ctctgtcttt ccttccagac 8341 actcacccac ggcactgcag tttgcactcg cacttacgag aaagaggcat gacctgactg 8401 cactgttgct gactactact ctgccaatcg gctacccctc gactcagcac cacattgcct 8461 catttcttcc tctgcatttt gtacaaatcc acgaattctt ctggggtcag gtgccactga 8521 ccgggatcca gttccagttc ccatggtgta tgtggttttt tttttttttt tttaactgca 8581 ctcatagggt gctctgaggt caataaagca gagccaaggc cacccagttg ccttttggcc 8641 tttggtaaca taactctggg agtcttggtt tatcctgtgt gtcagagagt gggcagaaat 8701 aacggcctga aggttactga ggaagaagca ctggatggga gactgaaatg gacagtctcg 8761 gagcctgtta atcagctgat caccttacac atttaataat aaaagagctg tacctacacg 8821 ttgcctttac actgcccccc ctccatggtc aaatgaccta gttcagtcag tgatggggct 8881 tccccaggtt tggctattga actgtcactt caggcccatc ctacactgaa agctcttggg 8941 tctggctgtt ctctgtgaaa tgctgtagtc tctccctttc cagaattcag gttcagggca 9001 cagaacccag gcttgtacca tggtggtggg agaaaatgac cactggccaa gaggactgct 9061 gacctgtgca ccaggctagt acttatgact acaaattctt actgcttctc taatcaactc 9121 tgagggaaga gggcatctga tcattacaaa agggagggct tataagtgat // LOCUS HSU60477 5500 bp DNA PRI 02-OCT-1996 DEFINITION Human apolipoprotein AI regulatory protein-1/chicken ovalbumin upstream promoter transcription factor II (TFCOUP2) gene, complete cds. ACCESSION U60477 NID g1575342 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5500) AUTHORS Speckmayer,R.W.M., Paulweber,B. and Sandhofer,F. TITLE Human ARP-1/COUP-TF II JOURNAL Unpublished REFERENCE 2 (bases 1 to 5500) AUTHORS Speckmayer,R.W.M., Paulweber,B. and Sandhofer,F. TITLE Direct Submission JOURNAL Submitted (11-JUN-1996) 1.Dep.of Int.Med., General Hospital, Muellner Hauptstrasse 48, Salzburg A-5020, Austria FEATURES Location/Qualifiers source 1..5500 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" mRNA join(<1..442,1974..2501,5226..>5500) /gene="TFCOUP2" gene 1..5500 /gene="TFCOUP2" CDS join(1..442,1974..2501,5226..5500) /gene="TFCOUP2" /note="nuclear factor; also called chicken ovalbumin upstream promoter transcription factor II, COUP-TFII; gene also called ARP1" /codon_start=1 /product="apoliprotein AI regulatory protein-1" /db_xref="PID:g1575343" /translation="MAMVVSTWRDPQDEVPGSQGSQASQAPPVPGPPPGAPHTPQTPG QGGPASTPAQTAAGGQGGPGGPGSDKQQQQQHIECVVCGDKSSGKHYGQFTCEGCKSF FKRSVRRNLSYTCRANRNCPIDQHHRNQCQYCRLKKCLKVGMRREAVQRGRMPPTQPT HGQFALTNGDPLNCHSYLSGYISLLLRAEPYPTSRFGSQCMQPNNIMGIENICELAAR MLFSAVEWARNIPFFPDLQITDQVALLRLTWSELFVLNAAQCSMPLHVAPLLAAAGLH ASPMSADRVVAFMDHIRIFQEQVEKLKALHVDSAEYSCLKAIVLFTSDACGLSDVAHV ESLQEKSQCALEEYVRSQYPNQPTRFGKLLLRLPSLRTVSSSVIEQLFFVRLVGKTPI ETLIRDMLLSGSSFNWPYMAIQ" BASE COUNT 1214 a 1528 c 1417 g 1341 t ORIGIN 1 atggcaatgg tagtcagcac gtggcgcgac ccccaggacg aggtgcccgg ctcacagggc 61 agccaggcct cgcaggcgcc gcccgtgccc ggcccgccgc ccggcgcccc gcacacgcca 121 cagacgcccg gccaaggggg cccagccagc acgccagccc agacggcggc cggtggccag 181 ggcggccctg gcggcccggg tagcgacaag cagcagcagc agcaacacat cgagtgcgtg 241 gtgtgcggag acaagtcgag cggcaagcac tacggccagt tcacgtgcga gggctgcaag 301 agcttcttca agcgcagcgt gcggaggaac ctgagctaca cgtgccgcgc caaccggaac 361 tgtcccatcg accagcacca tcgcaaccag tgccagtact gccgcctcaa aaagtgcctc 421 aaagtgggca tgagacggga aggtatcggc ctctcatttc tccttccctc gtcctgggtc 481 cggggtcctg ggtacgtttg gctagcctgc tctgggtaag gacaagaagc cccaagctct 541 tctcttcgta ttgcagcgga aaagggttta tactagaagc gagttctgca ttggaaccca 601 gaccccaaat ccgcatgctt tggccgactg atttccttct ttactctctc tttgggctgt 661 ttccatttcc tttgcattga ttgattgtga gttcactgga gtctgccttt ctgcaaggga 721 tggggtgttt gttgttgttg ttttaaagcc tagtttactt ctctctctct gcccttgttt 781 ttcctgcatg ttcaacatgt ccctccccct cacccctttc cccagccccc accctctcaa 841 aaaaaaaaaa aaaaaaaacc tgagattgta cttttgtaca ggaggttcaa attacaaatg 901 gcaattttat gcacttcgcc gtattaacgc tgccgcccgg gcagcctcat gtgaccctcc 961 gtagattaac atcctgctaa aaaaaaaaaa tacctctgct ttcttttttc ctttcacttt 1021 ttgaaacgaa gagagcgcga taggaagtag gaaagggtgg gcgaggggcc ctgggcggct 1081 gctttcgctc tgcgcgagtt gggtctttgt gatataaaat tcgccgagcc ccgcgagccg 1141 tgctttgcca atggcgcgct cgtgcggggg cgctcctccg ggttggggcg agccaacgcc 1201 ggggtttctt tgtgtttctg cgagagcaac tctcccggtc cggagtcaga taacagcctg 1261 ggcccgagcc tcgccggctt tccccggccc ttacaggccc tgcccaggct ccgctagtgc 1321 cggccggcct gctccctgcc tctcccggct tcctctctct ttaagccggc ctctctctct 1381 ccgccctctc cctccgtctc tttctccgag cacactgatt agacagacgc tagacctccg 1441 ctctctgctt gtctctcact gggggggttc cccgccgggc tggggctggg gcttcggggt 1501 ttgtgggaga atcgttccgg agtggccaca ggccgtctgg ggtggaccct cgtgcctttt 1561 gcaaaagcgc ctcaccctcc ccccagactc gcccctcccg ctccctctcc tccaatcaat 1621 aagaaatatc agctgtttag cagtaaagaa gaaagatgcc ctcagaatgc tacatcccgc 1681 ccacagcgcc ggggaccccg aggcaaggtg gccaattctg ggtcctcggc ggaccagccc 1741 cgagcgggcc tcggaggcaa gtgtgcccct ctggccctca gagctcgcct gggtggtggt 1801 ttgaaaggaa tggtgccaag agcctggcga ctccagctct gtggcaggcc tgcctggttc 1861 ttgcgctgct cagggcctgg gtcaggtggg ggtcgggctg gggcagaccc gccggggaac 1921 ctggggactg aggctggtca ttaactgtgg agtgtctctt ttccctcccg cagcggtgca 1981 gaggggcagg atgccgccga cccagccgac ccacgggcag ttcgcgctga ccaacgggga 2041 tcccctcaac tgccactcgt acctgtccgg atatatttcc ctgctgttgc gcgcggagcc 2101 ctatcccacg tcgcgcttcg gcagccaatg catgcagccc aacaacatca tgggtatcga 2161 gaacatttgc gaactggccg cgaggatgct cttcagcgcc gtcgagtggg cccggaacat 2221 ccccttcttc cccgacctgc agatcacgga ccaggtggcc ctgcttcgcc tcacctggag 2281 cgagctgttt gtgttgaatg cggcgcagtg ctccatgccc ctccacgtcg ccccgctcct 2341 ggccgccgcc ggcctgcatg cttcgcccat gtccgccgac cgggtggtcg cctttatgga 2401 ccacatacgg atcttccaag agcaagtgga gaagctcaag gcgctgcacg ttgactcagc 2461 cgagtacagc tgcctcaagg ccatagtcct gttcacctca ggtaggaagg agccctgtct 2521 tctcgtgccc acgggctcct agccagagct gggccccaga gaacttggga gtcccagggc 2581 aaacccagtc tgctccttga aagccaaact gttgagtgca acttaaagtg ggaacttttt 2641 atagctgctg ggcactacca ggcctgtcct ggggagagga gaggaaggct ggattgcagg 2701 aaggatgctt cagatagatt ccccacatgg gccatagggc agcctctgct gctgtttccc 2761 ccacctgcag tatccagatt gggatgccct ctcctgctcc ctttcctgag cccaggcccc 2821 ccggctgctg ggccagggaa tctcctcaag ccctcggttt tctctagcgc ctggctgggc 2881 ctccaggaga tgccctgggc agtggagcag ccctgacatt gtctctgcag ggaaaagttt 2941 gcctgctaac tcaataggag ttggagctcc aggcaaatcc aacctgaggg cagcaatttt 3001 caaaatcata ttcttactgg aataatgcag actgccagga gaggcgaaaa gatgtgtcag 3061 gatggggcaa caggaggaaa gggaccgtca taactcttca gaatgtctgc aggaggttga 3121 agactctggc ctcttgcttc ccctgcccaa gcaaacactt aaggcaattt tcaaacaacc 3181 caaagtgtat gccaacttgg gctggatatg gatgttgcta gagatataaa accatcattg 3241 actgcattat gcaatctaat tagtttgcat gtagtttaac ctgtgtgagt aatagagctg 3301 atactaatta acttgagaca aaatgatttt tgaaggtgtc tggctttcca ttttgggcca 3361 cacctgagct ttttggctgc ctccaggctg agttcagcat agcccaggtg gatggaaatc 3421 accaaccccc tttcacattc ttgtactaag ttgtcagctg atccttttgt catggctgag 3481 aggggaaagg ggttcccgat taagaataac aaacatctgg acgttagcag ggcttggaac 3541 ctagtcaccc actacaattt tttaagcttg ttaacaggaa acgaggagag atgtgtagtt 3601 tatgtggcaa acataaagga ggtccacaat ctcttcctta ttgttcgaaa gaagtctggg 3661 gaactaagtc attagctgaa cagcttgcag cttcttagtc ctcaaactgc tgtcttgtac 3721 attgctgctt tccacataaa acctcccagc acagccaccc caaattctgc ctttggaagt 3781 ccttaggtaa tcaattaaag gccagtgtct gggcagggaa gctgtttgca gaggtttgga 3841 tagggattat ttttctcgag tgcccctcat acctagaacg tttgcctgct gcagatatac 3901 agacaagata tctccctacc tcatagccac cttcaccctg tcccacaacc caacacccag 3961 cccaccttct tgacgatgga ggtagtcgtt aatttcttgg tatcattgct agctcttagc 4021 cttacccttg gtctttcaag gaaaaaaaaa tgcctgatac tgtcgttaac agtttaattt 4081 atgactattt gcaaaatgtg gaggcctgtg tggagaatgc aaatcgcctc acaaggagat 4141 aaaggttcct gttatctgat tcagagggag acacacagtt ccgaggatgc cgagggaagg 4201 agtcctgtgc aaacagatct tttagttcct gtccacagac acctctacat tctgtagaca 4261 tgcagagtcc tagatataca tatgaggagg ttgcatgtgt gttctggttc ttggggtgga 4321 ggtggtggtg gagattttcg aatatctgtt ttaatgactc acaatatgtt aaccccagtc 4381 ttgtatgggg agccctgaaa ggtcaggtca caacctgaac cgcctttacc aagaagcttc 4441 tcatttcaaa ggcacttact gtattttcac gtatcaaaaa agaaaagtgt tttcctgaag 4501 ataacaggga ggggggatgc ttctcggagg gacctgggtt ttaaaaaata aaggggggta 4561 ggagaaaaag ggagagagaa gaaaaagggg gaaaagaaac agctcaaatg aacagagaga 4621 tcacggtttg catggctgcc cttcaatagg ctaaactgac aagatcagtt ttaaagggcc 4681 acttaaatca gtttcaaaga ctggagaaaa atgagtcgcc ctctttgggg ggaatctgag 4741 gctggatcgt ggacgccata ctcggactgg ccaggttgtc cacccccaac ccccaacccc 4801 tatataatta taagccaaac tattatttac agaggaggag gggagagagt gctggtagtt 4861 ttggtttctt ctgtttactg acttgggcca gacagaccca atgcactttt gcttgggcct 4921 ctcaatgacc caggctaagg actcgggcag tgaaagcctg tagcacactg agctatgttt 4981 tattttaaag caaacacttc ctcaggacca tctcttgctc tttcttcccc caacccgcct 5041 ccccccctta agttttcagt acatagacat ttgctccaac tccggttggg gcagtttgac 5101 agagctctgt gcacacacct catgtgaccc aattcaaacc tcttaatctg atgactgaat 5161 tcttcttctt cttcttcttc ttctttttct tcttcttctt cttctgtttt taaactttct 5221 tccagatgcc tgtggtctct ctgatgtagc ccatgtggaa agcttgcagg aaaagtctca 5281 gtgtgctttg gaagaatacg ttaggagcca gtaccccaac cagccgacga gattcggaaa 5341 gcttttgctt cgcctccctt ccctccgcac cgtctcctcc tcagtcatag agcaattgtt 5401 tttcgtccgt ttggtaggta aaacccccat cgaaaccctc atccgggata tgttactgtc 5461 cggcagcagt tttaactggc cgtatatggc aattcaataa // LOCUS HSU60669 2424 bp DNA PRI 14-JUL-1996 DEFINITION Human 1 alpha,25-dihydroxyvitamin D3 24-hydroxylase (CYP24) gene, promoter region and partial CDS. ACCESSION U60669 S78775 NID g1418240 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2424) AUTHORS Chen,K.S. and DeLuca,H.F. TITLE Cloning of the human 1 alpha,25-dihydroxyvitamin D-3 24-hydroxylase gene promoter and identification of two vitamin D-responsive elements JOURNAL Biochim. Biophys. Acta 1263 (1), 1-9 (1995) MEDLINE 95359195 REFERENCE 2 (bases 1 to 2424) AUTHORS Chen,K.-S. and DeLuca,H.F. TITLE Direct Submission JOURNAL Submitted (13-JUN-1996) Human Oncology, University of Wisconsin, K4/350, CSC, 600 Highland Ave., Madison, WI 53792, USA FEATURES Location/Qualifiers source 1..2424 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q13.2-13.3" enhancer complement(973..987) /gene="CYP24" /note="enhancer 2; VDREd" gene complement(973..2424) /gene="CYP24" enhancer complement(1094..1117) /gene="CYP24" /note="VDREp" TATA_signal 1233..1238 /gene="CYP24" exon 1263..1922 /gene="CYP24" /number=1 prim_transcript <1263..2424 /gene="CYP24" mRNA join(<1263..1922,2129..2316) /gene="CYP24" CDS join(1664..1922,2129..2184) /gene="CYP24" /codon_start=1 /product="1 alpha,25-dihydroxyvitamin D3 24-hydroxylase" /db_xref="PID:g1418241" /translation="MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVC PLTAGGETQNAAALPGPTSWPLLASLLPDSLERGSQETARHPGGVPQEVWQDFPHEVG FL" intron 1923..2128 /gene="CYP24" /number=1 exon 2129..2316 /gene="CYP24" /number=2 intron 2317..2424 /gene="CYP24" /number=2 BASE COUNT 514 a 781 c 656 g 473 t ORIGIN 1 aagcttaagt tggactgcaa aagattttta agaaacaaac tagaggctta caaaagaatt 61 taccagggaa ggatttgcca acacttgggg caaaaatgac tgagacacgt aattgaaaag 121 cagggcgagg aaacagaatg tccctcccct cctccctggc tggtggtagt tttgtattgg 181 atcaggttga aaggattcga gccactttaa aacaatcaac aaaaatctcc aacttccgtt 241 cctgaattag caaaagcgtt ctacccttgc cagccgcgat aactctagaa attgtgtatg 301 tttgtcactg gacctgaata ggacgcacct tgggggagag ggggagtact gggggcggtg 361 gtttcaagct ctgtgtgccc ctcctttgca caaggtagtc ggtgtagaag aacagaggcg 421 ggcagtggac ataggggagg tggcccagcc tgaattcaag catcgttggt gcaagccacg 481 ctctcaaggc gcatttccta acttcctaaa gaagtaacaa cagcaactaa cttttattga 541 ctgttgacta tagacaaggc ttcgttgttg ccgctttacg tgcattaatt catttaatct 601 gcaaaacaac caaacatgaa acagcaggaa ggaaattcta caaactctcc ttcttgctca 661 agttaagaaa gtctcctctt ctggtgcatt tcagtaagac tcaaatcctc cccaccctgg 721 gaggcgcaga aagccaaact tcctccaaaa aaaaaaaggc aaaaaaaaaa aaaatcactt 781 cagtccaggc tgggggtatc tggctccccg ggaggcgcgc gggctccccg ggccctggca 841 gacgcggcag cttttctggg cccgcactcg gggacctcgc ccgcccggca tcgcgattgt 901 gcaagcgccg gcggcaacca cggccggctg cggctcctgc gccgggggag ggcggggagg 961 cgcgttcgaa gcacacccgg tgaactccgg cgttcgcatg ccttcctggg ggttatctcc 1021 ggggtggagt gctgccgccc ccaccccacc tcccgcgccc agcgaacata gccccggtca 1081 ccccaggccc ggacgccctc gctcacctcg ctgactccat cctccttcca ccccccctcc 1141 cctgggtccg cgtccctcgg agtctggcca gccgggggcc actccgccct cctctgcgtg 1201 ctcattggcc acccagggca tgctctgtct ccataaatgc atggtccctg ggcataggaa 1261 catggagagg gacaggagga aacgcagcgc cagcagcatc tcatctaccc tccttgacac 1321 ctccccgtgc tccagccaga ccctagaggt cagcttgcgg accaacagga ggactcccag 1381 ctttcccttt tcaagaggtc cccagacacc ggccaccctc ttccagccct gcggccagtg 1441 caaggaggca ccaatgctct gaggctgtcg cgtggtgcag cgtcgagcat cctcgccgag 1501 tccttctgct gcctgtcccg cctcaccccg ctccatcaca ccagctggcc ctctttgctt 1561 ccttttccca gaatcgttaa gccccgactc ccactagcac ctcgtaccaa cctcgcccca 1621 ccccatcctc ctgccttccc gcgctccggt gtcccccgct gccatgagct cccccatcag 1681 caagagccgc tcgcttgccg ccttcctgca gcagctgcgc agtccgaggc agcccccgag 1741 actggtgaca tctacggcgt acacgtcccc tcagccgcga gaggtgccag tctgcccgct 1801 gacagctggt ggcgagactc agaacgcggc cgccctgccg ggccccacca gctggccact 1861 gctggcgagc ctgctgccag attctctgga aagggggtct caagaaacag cacgacaccc 1921 tggtaaaccc ctttttcgcc cctgactctc tcctccctct tcccctccca gtggcgctcc 1981 aaaccctccc cgacatcctt cgccatggtg cttcggcgta tgggcagcga ggtcccggtc 2041 ccctaggagt aaggaggcgg gaggagggaa agcgcttcgg gcacggcgga ggatgcgctg 2101 acaccgcgct gtgcccggcc ggctgcaggt ggagtaccac aagaagtatg gcaagatttt 2161 ccgcatgaag ttgggttcct ttgagtcggt gcacctgggc tcgccatgcc tgctggaagc 2221 gctgtaccgc accgagagcg taccccagcg gctggagatc aaaccgtgga aggcctatcg 2281 cgactaccgc aagaaggcta cgggctgctg atcctggtag tccaatccga gggacctggg 2341 catgcggcca gacctgatga gcctgacggc cgcctggaaa ttgtgaagag gcggagagtt 2401 agagttccct ggggttggaa gctt // LOCUS HSU61537 6821 bp DNA PRI 21-JUN-1997 DEFINITION Human potassium channel beta subunit gene, complete cds. ACCESSION U61537 NID g2209018 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6821) AUTHORS Folander,K., Biazzo,D. and Swanson,R. TITLE Sequence of the gene encoding the beta subunit of a human, large-conductance, calcium-activate, K+ channel JOURNAL Unpublished REFERENCE 2 (bases 1 to 6821) AUTHORS Folander,K., Biazzo,D. and Swanson,R. TITLE Direct Submission JOURNAL Submitted (20-JUN-1996) Pharmacology, Merck Research Labs, Sumneytown Pike, West Point, PA 19486, USA FEATURES Location/Qualifiers source 1..6821 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q34-35" /tissue_type="smooth muscle" mRNA join(1..179,1643..1814,6519..6821) /product="potassium channel beta subunit" CDS join(46..179,1643..1814,6519..6788) /note="calcium-activated large conductance potassium channel" /codon_start=1 /product="potassium channel beta subunit" /db_xref="PID:g2209019" /translation="MVKKLVMAQKRGETRALCLGVTMVVCAVITYYILVTTVLPLYQK SVWTQESKCHLIETNIRDQEELKGKKVPQYPCLWVNVSAAGRWAVLYHTEDTRDQNQQ CSYIPGSVDNYQTARADVEKVRAKFQEQQVFYCFSAPRGNETSVLFQRLYGPQALLFS LFWPTFLLTGGLLIIAMVKSNQYLSILAAQK" BASE COUNT 1632 a 1787 c 1691 g 1711 t ORIGIN 1 ctctagaact agtggaagac taaatgatca ctgcccccag tgaatatggt gaagaagctg 61 gtgatggccc agaagcgggg agagacacga gccctttgcc tgggtgtaac catggtggtg 121 tgtgccgtca tcacctacta catcctggtc acgactgtgc tgcccctcta ccagaaaagg 181 tactgagctc tcccggcctg cccacccccc aaccctctcc taagggtctg gcatctgtaa 241 ggaaccctgg ccttctactc acccctctcc ttgaccctca ttcaaggcag cagggtcaag 301 acgatgctca ttcttaggtt ctagggttca agtgcaggga aggtaggatg aactgagccc 361 cactggtctg agattctagg acctggtggt aacagacaca tgatctagga cgtggtggta 421 acagacacat gatgcccttc tcagaagaaa tagcaaagaa tcagagcaaa ggttccagtg 481 caagtttgat cccaggagga atcagagaat tgcagatgtt tacagctcag ctgcataatt 541 ctctcccctc tgaagcctcc ctgtcaggtg ggcatccagc ctcctgatga gtacctcaag 601 agacggggag ctccttgcct cactgaacat acatttcatg tttgaggttt tctgatgtgt 661 ctgcattgag ccagcatctg cttccctgaa gttctgctca ggaatctcag agaatggtcc 721 atgcccttgc tccatgacat atcaaccact tgagatttta aggaaaagct gtgaattcct 781 aattcgtgtt ctatttggaa caaagacact cagttccccc agcctgtcct taggataagt 841 ttcagacctg tgctctgggc acgttccagc agctgacagc ccccttctga ctggcaatgg 901 gtctcccaca ggatacttgg gagttgaagc acaactgatt tactaatatg ttgatgaacc 961 tgttagcctc tcccgggcct gctgcagcct cactcttcac aacgctgaag cgattccagg 1021 ctggactctt aatctggtga agcaccatct agagactgtc cccaccatct ttcccatcct 1081 ggctcaactc aaaccccttc tacatctgct ccttttaggg gaaagaacac agctctccaa 1141 cccactcaga cctagcttca ggtcctgcaa ggtctcagca aagtcacttg acctttctga 1201 gtttctcttc tgcctctgca aaatgaagac gatatggata agaaagtctc aggtttggct 1261 gctcgtcaga atcacatcag ggagctttga aaaaacaccc agggcctggg ccgaaccacc 1321 caagaagttc tgattcactt gctctggggt gtagcttgga cacgggaatc tttttagaag 1381 ctccccaggt tgttgcaaga gtagccaagg ttgagaacca gtcatataga taatgtctaa 1441 gaccccatcc agcttctaca attcctccct gtgtccgggg actgtggggt catgtgcctt 1501 tattgctgga tgctgaagac aagatgagga tcccagtcct ggatccctct agaggctccc 1561 aggggataat gggttcaaat gggtgttctg tgtgtcaccc agttctgaga tgtgtatgtg 1621 tgcatgtgtg tggtctcctc agcgtgtgga cccaggaatc caagtgccac ctgattgaga 1681 ccaacatcag ggaccaggag gagctgaagg gcaagaaggt gccccagtac ccatgcctgt 1741 gggtcaacgt gtcagctgcc ggcaggtggg ctgtgctgta ccacacggag gacactcggg 1801 accagaacca gcaggtactg aactggaggg atggggacac atccctttat ccctgtcagg 1861 tgtgtgagcc tgttcctaac tgtccctgtc cccgaggctg agaaagatca acccacttga 1921 ccaggtctca cagctggaat gagactgagc cagccctagc tcttctgtgt tggagtcaca 1981 cagacctggg tttgaatctg ccgctaactg gctgtgtaaa cataggcagg ccacttagcc 2041 tctcacctcc tcagtttcca cccttgtaaa gtggaatgct aatgctgcac cttgaagagc 2101 tgttatcaag tcagagacgc tgtatataaa gtgccacgct tgatgctgag tggatagtgg 2161 gcactcaaac aaaagcagtg attagtgtta gtactttcat tctacaacct cagtcaaatg 2221 agggaccagg cccagggcca gatcctcaaa gtggaatccc ggtgactccc caggaaacca 2281 gggactcagg gctggagtaa cctccataaa caaacattta gatggttttt ccatccttgc 2341 atcccaacat cccaccctcc ccctccaaaa cacaaaactc accaccatat cctttcctaa 2401 agcgatcagt tggcccatat tctgtctctc ttccttcctt ctttctttcc ttcttttttc 2461 cctttcttcc ctccttcttt cttttctttt ttttcccctc cctccttccc tccctctctc 2521 cctcccttcc ttccttcctt ccttccttcc ttgtttcctt ctttcctcct ctcatttatt 2581 tattcaacag ccattgagca cctcctgtgt caagcctgta ggactcaaga ggagagagct 2641 catgcatgaa cagtgcctag cagggaagaa ttggcaccgg ggagaagaac agaggaagtt 2701 acaggggtta gaggaggggg atgttttcca gctgacaatg atgggggtat cagaaaggct 2761 cttggtgaga ttgcaccatt gcactccagc ctggcaacaa gagtgaaact ccatctcaaa 2821 aaaaaaaaaa aaaaaaagaa aggctcttgg attaggcttt taacatcagg caggcgcagg 2881 atgcaaggca tagagaatga ggtgatatag gaagcgggta tgcctatagg acgaggcatg 2941 gcggagggaa gggctaggtg gagacaggga tcagagggct gactacagaa aatgacccat 3001 gatggggtgt atgcagctgg aaaaggctgt cagggcagac taaggtctgt tcccagaggc 3061 ctttgatcac cacaatatgg aattaaaatg tgagtgtgta tgtctcaaga tatgtcaact 3121 ttggctatgc attagaaccc tgtaagagtc ttaaaaaaca aaactagggt ccaacactag 3181 acatatcaag gtggaatctc tggtcagaga tccattcatg gagctcccaa ggtgattcta 3241 aggttggaaa ttattgctgt tggcaatggg aatctttagg gcatgatcag agtttcagag 3301 ttgggctggg aagatggagc ctacaggtgt gtgggtccag ttactagaca gaggagccaa 3361 ggccgcagca agcactcacg accaagatgt ggcctgcagc aggggagatg ctgacagtcc 3421 ggacttgatg gctgatggga cgggaagggg cagggtgcct gggatggggc gtggaggctg 3481 ggggatcttg agagtgctga aggtcaatga gaagtgggtc tgggcagtgc aggatggtca 3541 ggtgttctgt catcgctagt atgactcggc cccacccggt ggagagaggt ctcagggccc 3601 actggaggca gctcctaccc atgacccctg cctctgcgtt ggcattcagt agcccttgat 3661 ttatcccaat gcaagaaaca aattttcaat aactggtaac attctccggg agtgatatgg 3721 gccagagcag gctggttttc aggctgcctc cctggtcctg agattgccag ctcagaggag 3781 catcccctga atgaactgtc agcaggatgt gaaggaagga catgctggag ccactgctgt 3841 cctctcccct gccgcccagg cttgaccagg tgtggtgctg atagcaggct gcaggagctg 3901 ggaggaagca gtggaaatgc ccttcgtgtt tttgccaagc agctgggttg tgggctagct 3961 gctccttccg cctatgtcca tcataggggt tcaagtggct ggggctccag gctagctggg 4021 tgtcctctgg aatcactgtg aagggccaag aagactctca tcccaccctc caactctgaa 4081 ggtaggggcc atagtctgcc ctcagctttt cctgtgagtc ttagcctgga gcaggtgagt 4141 tctaaattcc ctcccattgt ctgtcctggg gaccttacac tgcagggctt gggaaaaagc 4201 tggttggttt cagatcattc agataactca ggcttagctg gacccaggac aggagagaaa 4261 ggatgcattc aggccccggc ttctgggttt ggccatgaag agtgaattcc tgcccagtgt 4321 cttgccaaac aggcccagca taaaagatcc agaggaaccc tgagcaggca gaggatacac 4381 taggcccagg atggacagaa gcctctactt cagagctcat ccatagacct ctttttaaga 4441 aggtaaagaa ggagctgttg ttgcaatttc atagataagt caaaacatat tattattaat 4501 aatagcatct tccacttttt tgagttctta tcgtgtgctg ggcttctgtg ctcagtgctt 4561 tacatgcatc atctcattta atccccatga cagttctcct aggcaggtag cctcacaaac 4621 ccattttaca aggcccagtg aggttgtggc agttgccaag gtcacacagc tcagaactca 4681 ggcctgtctg actccatagc cacactgcta tttaaggagc taggttagcc aaggtcacag 4741 tcagaggcat tggagagact caggccaggc cccctcaccg tcagctcagg atgctggatt 4801 atatgctact gaggacctgg cagtgtccaa cagctctgaa tgccagattc ctgaggagcc 4861 agcgtctccc cagcctcagc tactccctgg ctggctgggg ccctagatct gagtgcagca 4921 tctgcatttc ccaggctgtg agaactacag tgcaaccctc ctctgtgttg ctggagtgaa 4981 tgccctcagg gcaggcagct tccagggtgg cacctcatat aaatccaggg agtggccgag 5041 gcctgcaagg aatctggctc aacacctgcc atttgggtct cagttttaca cattttgatc 5101 ttgcacatga tttcgctagg gcaaaattaa gtgcatgaag gaatcacaca ttcctgggtc 5161 tcgccctcag tgatgcagag tgggtaggga cctgggctat gggggccctg taggtattct 5221 gctgcagatg ggggcaccac acttgcagct cagaacagag ataaaaccag ataaaactcc 5281 gctttatgac tgctttgcat cactttccag ctggcctcat ctgaacaggg cagaaagttc 5341 ctaataacaa taatcaacgt cagaattaaa tgtgtgtcat gctctgctgg cacaaaactc 5401 ttttcaatgc attctttctt tcctttggat cccagtgtca aatacccgct tgccaacata 5461 gtttgtttgc tgtacacagg ttgaaactgc aaaggttttc aattttaaat ttagctaata 5521 ttttactttt tttttttttt gaaatgggat ctcgctttgt tgccttacct ggagtgtagt 5581 tgcgagatca tagctcactg cagcctccaa ctcctgggct caggtgatcc tcccacctca 5641 gtctcctgtg tagctgggac tacaggtaca ccaccatgcc cacctggata atttttaact 5701 ttttttgtag agataaagtc ttgctatgtt gcctaggcta gtctcgaact cctgggctca 5761 agggatcctt ccccttcggc ttcccaaagt gttaggatta caggtgtgat ctggtttcta 5821 acttctctta aagtgttgga ggtctggcta ttatagatct tctggatggc ctgacatcca 5881 ttgcctggag ggagtggtga ctgcctgtct ggaggcagct ggctctctcc ctattgacag 5941 gccctgccac accctggcaa ttcttcctgg caccaaggca agagtcagtg ggcacctgtc 6001 atctcattgc tttgcctgct tcaccctctg acaatacctt catgagccct gtaggcaatt 6061 gagcacatga tctcgatttc aaatacacag cccacttggc ctgtgtccct ccctggtttc 6121 cttagatagc tctgtgtgaa gtccttcttt cataatatcc tttgaaaccg atattcaccc 6181 atttaagtga tgaagcttca gggctcagaa agagtgagaa atttgccaaa aattctacag 6241 ccaatccaag cagagccagg actccaaggt aggacctgcg cccaggtgtg tagcctgtac 6301 cccgagcagc tccagcaccg ccatccacac acaagacaca aatgagagcg tgctctccag 6361 acctgtacag cgcatggcct gccaagcttt agaccaaagc cccgactcca atccaggggc 6421 ccgactacaa agctggtatt tttctttact ccatatcaag cccttggcta agaaatgggg 6481 atttccaagc ccacagctgc tcttgtttct ccccacagtg ctcctacatc ccaggcagcg 6541 tggacaatta ccagacggcc cgggccgacg tggagaaggt cagagccaaa ttccaagagc 6601 agcaggtctt ctactgcttc tccgcacctc gggggaacga aaccagcgtc ctattccagc 6661 gcctctacgg gccccaggcc ctcctcttct ccctcttctg gcccaccttc ctgctgaccg 6721 gtggcctcct cattatcgcc atggtgaaga gcaaccagta cctgtccatc ctggcggccc 6781 agaagtagag ccatccatcc atgccatacc acttgtcagg g // LOCUS HSU65896 13697 bp DNA PRI 03-JAN-1997 DEFINITION Human gamma-glutamyl carboxylase gene, complete cds. ACCESSION U65896 NID g1763689 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13697) AUTHORS Wu,S.-M., Chu,K., High,K.A., Stafford,D.W. and Solera,J. TITLE The Genomic Sequence of Human Gamma-Glutamyl Carboxylase JOURNAL Unpublished REFERENCE 2 (bases 1 to 13697) AUTHORS Wu,S.-M., Chu,K., High,K.A., Stafford,D.W. and Solera,J. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) Biology, UNC-CH, Chapel Hill, NC 27599-3280, USA REFERENCE 3 (bases 1 to 13697) AUTHORS Wu,S.-M., Chu,K., High,K.A., Stafford,D.W. and Solera,J. TITLE Direct Submission JOURNAL Submitted (02-JAN-1997) Biology, UNC-CH, Chapel Hill, NC 27599-3280, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..13697 /organism="Homo sapiens" /note="hepatocyte" /db_xref="taxon:9606" /chromosome="2" /map="2p12" mRNA join(649..774,1188..1358,3099..3257,3569..3734,5915..5993, 6585..6691,7869..8032,8678..8943,9105..9236,9608..9759, 10194..10363,10565..10695,11103..11250,11425..11620, 12049..12331) /product="gamma-glutymyl carboxylase" CDS join(732..774,1188..1358,3099..3257,3569..3734,5915..5993, 6585..6691,7869..8032,8678..8943,9105..9236,9608..9759, 10194..10363,10565..10695,11103..11250,11425..11620, 12049..12241) /codon_start=1 /product="gamma-glutymyl carboxylase" /db_xref="PID:g1513200" /translation="MAVSAGSARTSPSSDKVQKDKAELISGPRQDSRIGKLLGFEWTD LSSWRRLVTLLNRPTDPASLAVFRFLFGFLMVLDIPQERGLSSLDRKYLDGLDVCRFP LLDALRPLPLDWMYLVYTIMFLGALGMMLGLCYRISCVLFLLPYWYVFLLDKTSWNNH SYLYGLLAFQLTFMDANHYWSVDGLLNAHRRNAHVPLWNYAVLRGQIFIVYFIAGVKK LDADWVEGYSMEYLSRHWLFSPFKLLLSEELTSLLVVHWGGLLLDLSAGFLLFFDVSR SIGLFFVSYFHCMNSQLFSIGMFSYVMLASSPLFCSPEWPRKLVSYCPRRLQQLLPLK AAPQPSVSCVYKRSRGKSGQKPGLRHQLGAAFTLLYLLEQLFLPYSHFLTQGYNNWTN GLYGYSWDMMVHSRSHQHVKITYRDGRTGELGYLNPGVFTQSRRWKDHADMLKQYATC LSRLLPKYNVTEPQIYFDIWVSINDRFQQRIFDPRVDIVQAAWSPFQRTSWVQPLLMD LSPWRAKLQEIKSSLDNHTEVVFIADFPGLHLENFVSEDLGNTSIQLLQGEVTVELVA EQKNQTLREGEKMQLPAGEYHKVYTTSPSPSCYMYVYVNTTELALEQDLAYLQELKEK VENGSETGPLPPELQPLLEGEVKGGPEPTPLVQTFLRRQQRLQEIERRRNTPFHERFF RFLLRKLYVFRRSFLMTCISLRNLILGRPSLEQLAQEVTYANLRPFEAVGELNPSNTD SSHSNPPESNPDPVHSEF" variation 8762 /note="polymorphism resulting in a Gln to Arg substitution at amino acid 325; 121 chromosomes tested, heterozygosity .32" /replace="a" BASE COUNT 3353 a 3033 c 3450 g 3861 t ORIGIN 1 gggctcaagc gaacctccca cctcagcctc ccgagtagct gaaaccacag gagcgcgcca 61 cgacacacag ctaacttttt ttttagtgtt ttttcttttt ttgttgttgc cttttttttt 121 ttttctttat cgatcctccc gcattggcct tccaaactgt tgggattaca agcgagagca 181 gtgcccggcc cctactgctt tgaaacctcg gttctttgca ggtcctggaa aagaaagtga 241 aggggtcgtg caatggctca aaacccaggc aacatttcca ggcgcgtcct caactcggcg 301 tcactctgac aaaaagtttg ccctttgcgc ccgatccctc acaccagaga gcaccgcagg 361 gacgcaggga cggagaggac taatggggtc ggggaggacc caaggaagtt ggggaactca 421 gactcttttc agtgtttaaa aagcagaggc cccctcggaa ggcataacag gcgagtctaa 481 caatgcccct ggagatcaac ggcactaagg ggtacctttg ccccgctcca aggctccttg 541 ttccgccccc tgccgccaag caaggaatcc ggcgccgccg gtcgtgggta gtatttacga 601 cagtgctggc gctttgcggc cggccgcaca gccgctgacg cgtcgggagg cggagcctag 661 ggaagcaaat tctcctggcg gcctccgttc agacgcggca gctgtgaccc acctgcctcc 721 tccgcagagc aatggcggtg tctgccgggt ccgcgcggac ctcgcccagc tcaggtaggc 781 ttaggacccc gccctccggc ggagagttcc taggagcgct ggtctcagag gggcgggggc 841 gggaggacgc ccagtgtctc ccggtgaggc gggggggggt cctctgtggg gaagggggcg 901 tgctgtgaga catttgcagc ctgcggcttc tgaatgatga tgacattcgg gaacactgcg 961 gttaggggac tgtagtctga ggggttctgg gacattaggg gatcgtggag agggggctgc 1021 actgcaggga tgtttcctat agtcagatca ttgaggcaac ctcattgagg caacctcatt 1081 gagccgaccc gggtgggcgg aagctctgag ctgttggtgc agtgatttct ttgatttgag 1141 gtggagccca cccccgcaca catattttct ccatttattg attgcagata aagtacagaa 1201 agacaaggct gaactgatct cagggcccag gcaggacagc cgaataggga aactcttggg 1261 ttttgagtgg acagatttgt ccagttggcg gaggctggtg accctgctga atcgaccaac 1321 ggaccctgca agcttagctg tctttcgttt tctttttggt gagtccagtt tatgggtggg 1381 agcaatttgg ttgagagtgg agaatgacaa tctctatttt ttggttgcaa gtagacttag 1441 ctcccttttc ctcagagctg aagacattgg tataaaagaa gaggtctttg cgaagtttag 1501 gattaaacct gctttgtctc ctacagttag gatttacttg gtggccagaa atgctttttt 1561 ttttttcctt agacggactt tcgctcttgt tgcccaagct ggagtgcaat ggcgctatct 1621 cggctcactg taacctccgc cacccgggtt aaagcgattc tcctccctca gtctcccgag 1681 tagctgggat tacaggcgcc caccaccaca cccggctaat tttttatttt ttttgtatgt 1741 gtgtttttag tagagacggg gtttcaccat gttggccagg ctggtctcaa actcctgacc 1801 tcaggtgatc cgcctgcctc ggcctcccaa agtgctggga ttacaggtgt gagccaccgt 1861 acccagctag aaatgctata tgatatgtag aggcgaggca gcagtgaccc attttgcttg 1921 tggcatgaag ttgcccagaa gactcagaga aacagagcag ggggccacag ggaccatacc 1981 ttttggctgt gactccctaa tcaagagctt cttaactgag gccgcagcat ggtgtgcttg 2041 gaatggttta cagatgtgca gagatattag tccccttagc ccagtagttc tcaactggag 2101 ggagttttgc tccctgggag acatttggca atgtttagag acatttttag ttgtcacaac 2161 ttgggggagg aagaataggt ggctgctggc atctctatag aggtcaggga gtgctcctaa 2221 atgtcctgta gtgcattgga taatcccctc aataaataat taaccagccc aaaatgtgac 2281 tagtcctgaa gttgagaaac cctgctttat ctcaagctgg cttccctgat ctgagtagcc 2341 tcttctgttt attcccaggg tgttgacgaa atgcaactgt ttttggagtg tgccaggaag 2401 cagaaaaggt tgagaagcac tcccagagca ctttctttct ttctttcttt ctttttttga 2461 gacggagttt cgatcttgtt gcccaggctg gagtgcaatg gtgtgatctc ggctcactgc 2521 aacctccgcc tcccaggttc aagtgattct cctgcttcag cttcccgagt agctgggatt 2581 acaggcatgc gccaccacac ctggctaatt tttttgtatt ttaagtagag acggggtttt 2641 tccatgttgg tcaggctggt ttcgaactcc caacctcagg tgatctgccc acctcagcct 2701 cctaaagtgc tgggattaca ggcgtgagcc accgtgcctg gcctcccaga gcactttctg 2761 aagctcaata ggatggatct ttatagctgt caagacccta acagttttta tagcattttg 2821 tgcatttctg taatttaatc ctctctgtag tcctgtgatg tattcccctt ctcacaaatg 2881 gggaattggt taaccgagct gcaggggctg ttctagatgg cacaacagat gtagtactgt 2941 gcagagtagg cactcaataa ttaatactga ataaatggca gagccaagat tagaaaagaa 3001 gtctgaccct ggagctgcac actgtcaact gtgttccact gtattctact gcttccttag 3061 ctgaggccca atgaccaact cccctattgc cttcttaggg ttcttgatgg tgctagacat 3121 tccccaggag cgggggctca gctctctgga ccggaaatac cttgatgggc tggatgtgtg 3181 ccgcttcccc ttgctggatg ccctacgccc actgccactt gactggatgt atcttgtcta 3241 caccatcatg tttctgggtg aggggaactg tgggaaatat tgactggcct gccagaaaca 3301 cagaaataga agcatgctgg tgaatggtca aagtctttgg caccttattt cacttctgct 3361 gtgatcaatt tcatgctgtt gacctgtgtg tcacttaagc tgtgatccca taattctttt 3421 taaaatatta aaataacatt taattttcat ttagttttat gtgattatac aatgcttgat 3481 ttttcttaat tccaaccata tcctggtcca gggaaactgg gcctgtggtg atctcttgac 3541 ctcttaatcc tctccctaca aatcccaggg gcactgggca tgatgctggg cctgtgctac 3601 cggataagct gtgtgttatt cctgctgcca tactggtatg tgtttctcct ggacaagaca 3661 tcatggaaca accactccta tctgtatggg ttgttggcct ttcagctaac attcatggat 3721 gcaaaccact actggtatga gtcagggggg acaaagggga ggggttggct gggtcaggat 3781 ggttatcaga gataggggcc cagtgaggtt ctggaatttt tcagtcatgt ttgcaaagta 3841 tggttacaag gtcactttga tctaggtgga tttgggaggg gaaagggcca actagttgga 3901 gcgcttggga tgacacaaag caaaatggct ccatcccacc ccttcagtct ttccctatct 3961 ccccaggtgc cttgatttat tcacctcttg taaccctctt agagagaaaa acagaatact 4021 aaagccagac atggctgttt tcagatactt taagggtggt tatgtagaag agggagcaga 4081 ctttctctgt gtgagtccag aagggcagag ttagagccaa aagtagaagt tgctgtgagg 4141 cagagatcat gtcaggagaa gagggcccac cagggctgct cagcattgct gttctcggag 4201 gaggtaaccc ttccccattt ctggcttagg tgactgtctc taaggggagg tggagaaggg 4261 atttggagat aaggtggtgg ggcagttgac cttaaggatt ctttatatgt tgagtgaaaa 4321 gtaataaaat aggtgtcaat aacatgagta tggggaaaaa aactcaaaca ttgaattcat 4381 taacagtcat taagttttat ttatttttct ttttatttat ttatttttaa atagagatgg 4441 ggtctcgctt tgttgcccag gctggtcttg aacttctggg ctcaagcagt cctcccacct 4501 cggcctccca aagtgctggg attacaggca tgagccactg tgtccggcct atttgttaat 4561 acacacatac aaaacccctt tggtaaaact tgaaactcat tcagaatggc cagctctgat 4621 catagagaat taagaggcag ttttgataga atatctctga aaagttccgt tgtggggata 4681 tttgttgaga tggcatctca ctgtgtcgcc caggctagag tgcagtggcg tgatcatagc 4741 tcactgcagc ctcaaccttc ttgtgtttag gtgatccttc catctcagcc tcctgagtag 4801 ctggaacaac aggtgtgtgc caccacactc agctaatttt taaaattttt tgtagagatg 4861 ggatctcact atgttgctca ggctagtcag aagttccatt tttatgactt tatgcatttt 4921 gaaaaaatca cttatcattt attcagggtt ttccacatcg caggcactat actaaatatt 4981 tagagttttt cctcattaaa accctgtaag gtcgatggtg ctgttcacat tttatgtttt 5041 ttggagtcag ggtcttgctc tgtcactgag gccagaatgc agtggtgcag tcatagctca 5101 ctgcaacctc aaactcctgg gctccagaga ttctctcgct ttagcttccc aaaatgttgg 5161 tattataggt atgaactact gagcctggcc tgttcacatt ttaaagataa agaagctaag 5221 tattattgac gagtgaagaa tttgcctaag gtcagctagc agttaagaaa cagaagtgag 5281 tccaggtgtt catgactgca gagtccagtg ctcttgacca cagtgtaagc tgcccctttc 5341 ccagcttcta cttcttggaa acttgctttg cttctgcttc cccagagtac atctggtgtt 5401 agtagcctgg gtctaatcag ttctcttcca gaatacaagc tgttagcagg aaccctctta 5461 tactggatcc tggcacccag gtctatgaaa tgttagttta tgttgtgtgt tactctgctg 5521 ttgagggttt tcattttttt ttaaagataa tacaccgcat atttatttac acatacatag 5581 cttttcaaac actccatcaa attcctcccc tttttcagta attttctagt attgtagtcc 5641 ttttttcctg atttgatcca ttagaacaaa ctcaactggg agctaacaag aaatactgaa 5701 ctttttgctg ttataactta gtagcaatat ttcttctatg acttgtataa tcatgttaga 5761 atttgctcat ataattctaa aaggtctcag aataaccagg gaggatatgg gagatgtgcc 5821 caggatagat aggattatcg tgtataaaag gaaggatgga gatgtagctg cattaagcag 5881 ctcccccctc caatgtttac ctccctcggt gtaggtctgt ggacggtctg ctgaatgccc 5941 ataggaggaa tgcccacgtg cccctttgga actatgcagt gctccgtggc caggtacagc 6001 attttaggac agagggagga gctccttgac agccatgtgg tcagcatggc catcccactg 6061 ccagactaca ctctccgctg cctccctgga tcttctgttt ttagttttta ccctttgctg 6121 gaaagctagg ctggcttctc attttttcca atcccttaat ctctgaatta tcttatttgt 6181 caaggaaaga ctagagcttt gtccctgcta gattggaatc ctcatgacaa agagaaagga 6241 ttcagtctgt gctttatgac ttcttccttg gtgcctgata ctgtcgctga cattggttgg 6301 atgtcaataa atgctgtttg gatgaggcga atttctgaga tgaatgaata gaagatgggg 6361 tctattcttc cttgggccct ggactcagat cagtatttat tttattcatt taaaaaatat 6421 atataagtag attatttggg tcaaggactg taatatttgg gtcaaggact gtaactcagg 6481 agcatggatt caagttgtcc taactatatg ctccttattt tattcttgcc tatctgtaga 6541 aaagattctt tttgcttgaa ctttgtgtca cattttattt tcagatcttc attgtgtact 6601 tcattgcggg tgtgaaaaag ctggatgcag actgggttga aggctattcc atggaatatt 6661 tgtcccggca ctggctcttc agtcccttca agtgagtggc agtcatggtg gaggcctctt 6721 tgaggcatgg gtacagagca ggtgactcat ctctctcagt aatggcacct gttcctactt 6781 tgcctgaatt tgccacatgc cttcctttgt ccactcattt ccctattcag cagaagcaga 6841 gtcagtggta tcaaaagatt gagccctagc tgctcttggg agatatatgg tatagatagg 6901 caagaaagga aaagaaattt aaaaaggaaa ggaaagagcc ggaactggct gacttggtga 6961 gtgagccagt gccctcagca gctgccttgg atggtgcctg tgtccttgcc ttgccactct 7021 agcctcccat cagagccctg cagctgatgt actgagaaga tgggcactca ctgttcaggc 7081 tctgacaaaa ccattattat cacttccaca aaacaaaggg tcacctttct gatttgtctt 7141 cttcccattt cttctttcct cttgttttac tttattctct ttcttcttcc ctgttatttt 7201 ccagttcctg aagtttgact tcattatgat gttagccaag cattcacctt ttaaagtgtc 7261 ctctcactca tgacctccgt ggggaacgtc ccttccacaa cacacacaca tctcagtcaa 7321 attttgtcct aggagatctg cccttgctta ttacatagga tatgttagaa aacattgaac 7381 acctaaagtg tgtgaaaatg tagagggggc tgggcatggt ggctcatgcc tgtaatcccg 7441 gcactttggg atgctggggt gggaggattg cttgagccca ggagttcaag accagcctga 7501 gcaatataag acactgtctc tacaaaaaaa ttttttaaaa aaacagctag gcgtggtggc 7561 acgtgcctgt ggttccagct acttcggagg ctgagacagg aggattgctt gagcccagga 7621 gtttgaggct acagtgcacc atgatcatgc cattgccttc cagcctaggc gacagagtga 7681 aacttcgttt caacaacaac aacaacaaca acaacaacaa acaacctaga ggagtgtctt 7741 catgggtgga ttatgtacct taggcatcta cccagccact ggagtagcca aggcacattc 7801 gtgctgtgaa tgtgctttga tgtgttctcc agtgggctgg tgcttgaact ttctccatgt 7861 gtttgcagac tgctgttgtc tgaggagctg actagcctgc tggtcgtgca ctggggtggg 7921 ctgctgcttg acctctcagc tggtttcctg ctcttttttg atgtctcaag atccattggc 7981 ctgttctttg tgtcctactt ccactgcatg aattcccagc ttttcagcat tggtcagtac 8041 ctgtagcctg ggaggaaaca ggaaccacag ttttgcagga agggactagc cagaggcatt 8101 gtttgggagt gggagagggt ggtgtgagca ggagctaaat tgaacccagg gcagtcacta 8161 tgaagtaggc ataaatgggg ataagaggac tggggcttct gggctcagtt agtatgttgt 8221 acagtaggga atctccagga agaacaaata tcttacagtt aagaactact ttgtggccgg 8281 gcttggtggc tcaggtttgt aatcctagca ctttgggagg ccgaggtggg tggatcacct 8341 gaggtcagga gttcgagacc ggcctgacca acatggtgaa accccgtctc tactaaaaat 8401 acaaaaactt agctgagcat ggtggcgcat gcctgtaatt ccagctactt gggaggctga 8461 ggcaagagaa tcgcttgaac ccaggaggca gaggttgcag tgagctgaga tcacgccact 8521 gcactccagc ctgggcgaca gagaaagact gtctcaaaaa aaaaagaaga actactttcc 8581 ttttcagtgg aacacctggg agaagtctcc taagggaacg gggttgaaga ggcccagcca 8641 aactcctgaa atatgtattt tctccctgct ttcctaggta tgttctccta cgtcatgctg 8701 gccagcagcc ctctcttctg ctcccctgag tggcctcgga agctggtgtc ctactgcccc 8761 cgaaggttgc aacaactgtt gcccctcaag gcagcccctc agcccagtgt ttcctgtgtg 8821 tataagagga gccggggcaa aagtggccag aagccagggc tgcgccatca gctgggagct 8881 gccttcaccc tgctctacct cctggagcag ctattcctgc cctattctca ttttctcacc 8941 caggtttgta ggggctgtgg agtgtacagc aaaggctgga cttagaaagg aacggatgac 9001 aatagttggg aaagaatggg gctggttttg cagccccttc ttggtagggg atggggtcag 9061 tgtgagaagc attgagctga ttcccctctg tgctgccttc ccagggctat aacaactgga 9121 caaatgggct gtatggctat tcctgggaca tgatggtgca ctcccgttcc caccagcacg 9181 tgaagatcac ctaccgtgat ggccgcactg gcgaactggg ctaccttaac cctggggtga 9241 gatgtctgat gctgacaatt tactgttctt cagagccccc ttgtgctttg gggtctggtt 9301 ttgtcttaca cacaccataa agcaaaacaa atttgagtct gctttgcctt ttctcaagag 9361 ccagattgaa ctatctttta tgtgggttga gatagggcaa tgcattcatt tccttgaact 9421 gagattagat tcaggaaaag taagcaccac tagcccaaca atgaaaaatt cattcccaaa 9481 ggcaagttcc atttttttct agccagaatg ggcagcaggc agggatctcg ttgtgccatt 9541 ggcagtgcat gacatgtctt ggaagttgat aatgatgagg tgggactaat atgggggttt 9601 ggggcaggta tttacacaga gtcggcgatg gaaggatcat gcagacatgc tgaagcaata 9661 tgccacttgc ctgagccgcc tgcttcccaa gtataatgtc actgagcccc agatctactt 9721 tgatatttgg gtctccatca atgaccgctt ccagcagagg tgggcaaggg gagcagagag 9781 tgggaaaaaa tgaagcacag ttctttttac caagatgaac agcccttgct tcctggtatt 9841 gtcttgctta aaaagctgtc cttcctctag tgcatgtccc ccttgatctg gttgtgggca 9901 tagctggttg cttccagcag ggggcaccat catccaggag aaaagtgagt tgacttgttc 9961 ttttagacac tgttgacata ataccttctc aggtagctac taacccccat tatacccagt 10021 gctccagatc tcttctgcct tctttctctg ccctggactg gcaataacat gctgacacag 10081 gccatttctt cttctattcc ctgtccttag gaatgatgat gaaagggaga gaactggaag 10141 ttctggtggg tgggtggctg tgatgtcctt agaacatggt tttctttcct aaggattttt 10201 gaccctcgtg tggacatcgt gcaggccgct tggtcaccct ttcagcgcac atcctgggtg 10261 caaccactct tgatggacct gtctccctgg agggccaagt tacaggaaat caagagcagc 10321 ctagacaacc acactgaggt ggtcttcatt gcagatttcc ctggtatgga ggaaagtgag 10381 gagaggcttt tctgtctttc cttctggctt ccttggccag ggacattcca tgtggtggga 10441 ggggagaggg aatgtttttg aaacctaagg ggatgatggt ggtaaaggtg tgatgtctct 10501 gttctggctg ggagtgagtt cactctgcca tggggtggga tgatgaactc tgttggtctg 10561 gcaggactgc acttggagaa ttttgtgagt gaagacctgg gcaacactag catccagctg 10621 ctgcaggggg aagtgactgt ggagcttgtg gcagaacaga agaaccagac tcttcgagag 10681 ggagaaaaaa tgcaggtatt ttgctgggat taacaacagc agaaagaact gagtgcagta 10741 gctcacacca gtaatcccag cttcttagga ggctgaggtg ggaagccagg aattcacagt 10801 gagcctgagc aacagagtga gaccccatct ctaacaattt aaaaaacaat ggcagaaagc 10861 ttagatgagc aagattcact tattatagac tgaaaatatt ttaccatagc agtaatgcag 10921 tggagctcaa aaaacaaggg cccctgggac tgtgaagagc ctcatccagg attctagagg 10981 tccgttactt tcgattccaa aacacaattg agtagaaaga agccaagagt catagagtag 11041 ggaggccata agctggctag agaaaagctg atgaaattta tccatttctt ctcccttgtc 11101 agttgcctgc tggtgagtac cataaggtgt atacgacatc acctagccct tcttgctaca 11161 tgtacgtcta tgtcaacact acagagcttg cactggagca agacctggca tatctgcaag 11221 aattaaagga aaaggtggag aatggaagtg gtgagtgtat gaacatttgg gctgagatag 11281 cctctggttt caagaagtca gggttatgaa tgatgttcta gccattcaag aatgaacttg 11341 aagtctctag ctggcagaag aggagttcta aggggagaga gtgatccatc tcagtgattc 11401 ctcttttttc tgtcttccgg gcagaaacag ggcctctacc cccagagctg cagcctctgt 11461 tggaagggga agtaaaaggg ggccctgagc caacacctct ggttcagacc tttcttagac 11521 gccaacaaag gctccaggag attgaacgcc ggcgaaatac tcctttccat gagcgattct 11581 tccgcttctt gttgcgaaag ctctatgtct ttcgccgcag gtaagttcac aacaatattt 11641 gtcattgcca tcatatgttg gcaagcttgg taactttccc ctggggagag taactctaga 11701 gactgtcata caggaaagga cagtttagct ctaggtatct tttcctgcca ttcttcttga 11761 tttactggaa gaataagtgg cctatatctg atggcctgct ttcttgttct gcctcagctt 11821 atctccagcc tctctgtggg ctgttcctac cctatcctgg aatcctctac cagactgtat 11881 aaagagtggt gtacccttcc agctgggaga tggagggtca cttgggggga gattcccata 11941 tatgcctgtg tgggaggagt gaggaccagg tgccttctaa gggctttagc tgaaattctt 12001 ctggtgaaac aggagtttga acttggtcgg ctttttcctg tttttcagct tcctgatgac 12061 ttgtatctca cttcgaaatc tgatattagg ccgtccttcc ctggagcagc tggcccagga 12121 ggtgacttat gcaaacttga gaccctttga ggcagttgga gaactgaatc cctcaaacac 12181 ggattcttca cattctaatc ctcctgagtc aaatcctgat cctgtccact cagagttctg 12241 aagggggcca gatgttgggt gcagatgtag aagcagccag tcacagaccc attctatgca 12301 atggacattt atttgaaaaa aattctcaaa agtttttttt ttttttttgt gggggggcgg 12361 ggttctaaag ctgtttttaa ctccgagatt acaacttaga ggaaccaagg aaataaagca 12421 aataagattt aacaacccaa gattaagagg ccaggaagag gttagacgca atgtgaaact 12481 gtcctcctag gataaggttt aaagtggctt tttgggggct gggtgccgtg gctcacgcct 12541 gtaatcccag cattttggga ggctgaggtg ggcagatcac ttgaggccag gagttcgaga 12601 ccagcctggc caacatggca aaaccccttc tctactaaaa atacaaaaat tagccagacg 12661 tggtggtggg tgcctgtaat ccaactaccc aggaggctga ggcatgagaa tcgcttgggc 12721 ccaggaggtg gaggttgcag tgagccgaga tcgagccact gcactcctgg gcaacagagc 12781 aagacttcgt ctcaaaataa ataaataaag tggctcttgg ggaaaagcaa tttaatgtac 12841 cacgatgaat agctaactgt tcccaagtgt ttgctatgtg caacacaccg cgtgagcagt 12901 gttacctgca ttattacatt aggctgagag gtaaaataat ttgcccgaag acatacagct 12961 agtgacgaat ggactgatgg tttgaactta acgtctattt gacttaaggt cctgcaccct 13021 gccacttgta attttcagaa tcactgataa tctgaaataa tgcagcttaa aacatgtttt 13081 cttaattaaa agtataattg gatggtggtc aggttgtaaa tgcacacaaa gtaatttatc 13141 ttcattgcaa ttttggttcc ttctttgttc ctattatcta tatacaccca gtgtttcact 13201 agcagttatg agagcagtca gaatagcagc atcaaagtta aataaggaaa tctagatgat 13261 tgctagattt tctctaaaaa taagcttagt atttggtatt attctagcct ttggtgagca 13321 gaaggggagg taaagtaaga attaactaaa tttagtaagg gattagggaa tgcttagaca 13381 ttttgtaggt tggggtatac aagggtaact caggaagctc ctcagttcag tgaaaattca 13441 aagggttggc actgagctag gaagcgagaa tgcagggtta tgaaaaagga tccctgcttt 13501 gggagctcat ggacttgtag cagagaccgg tagtaaattg gcagtctcag taacttctgt 13561 aaatgattgc atctctttgc aggcttctct tcccatactc tcccccactc ctccttggac 13621 agtgttccgg gggttctttc ctaagcaact gctttttcag tatacatact tcagatgatt 13681 gtgtctactc caggatt // LOCUS HSU66711 5543 bp DNA PRI 05-SEP-1996 DEFINITION Human Ly-6-related protein (9804) gene, complete cds. ACCESSION U66711 NID g1519439 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shan,X., Bourdeau,A., Wells,D.E., Cohen,E.H., Landgraf,B.E. and Palfree,R.G.E. TITLE The 9804 gene has typical Ly-6 structure and is located near the E48 gene on human chromosome 8q very close to the telomere [Abstract 234] JOURNAL FASEB J. 10, A1040 (1996) REFERENCE 2 (bases 1 to 5543) AUTHORS Cohen,E.H., Palfree,R.G.E., Shan,X. and Landgraf,B.E. TITLE Identification and characterization of a novel human Ly-6-related gene with reduced expression in a costimulation-deficient U-937 clone JOURNAL Unpublished REFERENCE 3 (bases 1 to 5543) AUTHORS Shan,X., Bourdeau,A., Rhoton,A., Wells,D.E., Cohen,E.H., Landgraf,B.E. and Palfree,R.G.E. TITLE Characterization of the human gene 9804 and its mapping to chromosome 8q24.3: Evidence for a human multigene family homologous with Ly-6 in the mouse JOURNAL Unpublished REFERENCE 4 (bases 1 to 5543) AUTHORS Palfree,R.G. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Roger G. Palfree, McGill University / Royal Victoria Hospital, Medicine / Endocrine Laboratory, 687 Pine Avenue West, Montreal, Quebec, H3A 1A1, Canada FEATURES Location/Qualifiers source 1..5543 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="P26,P36,P40,P19, and overlapping B18,B7,PCR5." /map="8q24.3" /chromosome="8" mRNA join(1168..1245,3543..3651,3972..4091,4223..5060) /gene="9804" gene 1168..5060 /gene="9804" exon 1168..1245 /gene="9804" intron 1246..3542 /gene="9804" repeat_region complement(2870..3188) /rpt_family="Alu" exon 3543..3651 /gene="9804" CDS join(3600..3651,3972..4091,4223..4446) /gene="9804" /note="similar to mouse Ly-6 antigen TSA-1/SCA-2" /codon_start=1 /product="9804" /db_xref="PID:g1519440" /translation="MKIFLPVLLAALLGVERASSLMCFSCLNQKSNLYCLKPTICSDQ DNYCVTVSASAGIGNLVTFGHSLSKTCSPACPIPEGVNVGVASMGISCCQSFLCNFSA ADGGLRASVTLLGAGLLLSLLPALLRFGP" intron 3652..3971 /gene="9804" exon 3972..4091 /gene="9804" intron 4092..4222 /gene="9804" exon 4223..5060 /gene="9804" BASE COUNT 980 a 1886 c 1620 g 1037 t 20 others ORIGIN 1 ctgcagagac agaaccccct gccctcaccg ccaacattcc agaggggaag atgccctgcc 61 tgtgccccac gtaagaccca gaggcaggga ctcccccggg ccacagccta ctggaggctc 121 ctgtccccag agacagctgg gccatcgccc caaagctgtg gaagccccac ccacgatgcc 181 atcataccac ggccaggaca tctcgtgaca cacacagccc cagcctcagc ctccatacac 241 acacacagbc accacacaca ctcactcatg caaaacacac acacagccac attgactgca 301 cccctagatt tgagacaggg gtgttacagg tgtcttgtca gccccagccc cctnctgaac 361 cacatggcca cctgccccac cctcacattt ccccacaccc tccccctact cccgtgtggg 421 gttcagaaga cccctccctg gtgctcgccc tgtcacccag aaaggaaacg cactgtcaac 481 tggagaacag gcggtgcaga gccccagcgc ccacagccct cggagagggc tggcgggagg 541 agaggcaccc accgatgcag agcctcagtg ccctcgcaga agccagcacc cacaacctcc 601 cacctcccaa ccccttgccg gtctccccac ccccgcctga gggatggcca aggaacagcc 661 cagctggagc gggaagcagg cacaagatgc acctcaggaa tcccaggctg agggggtgcc 721 tgcgtcctcc gnncccgagc tccactccgc aggcgagacc cccaacactg ctccccgaca 781 ctctccgagg cacccctcct ccgtgctcac acccaaacac gtctgctcac ccgtgggctt 841 cccaccccga acacaacgag gactgtccag gtgtcagtga cggtcacccc gacctggtnc 901 cctgcncggc ctngccaggt cagcacccgt ccgtcnncac agcagggtcc gggtggaagg 961 aggtccgggc ctggtccgtg gctcagctcc acctcactgg gaaactggga ggtggtctca 1021 gaacttctgg gcctcagctt ccccatctat cccaaggagc cggggtagcc tgggcggcgc 1081 ggggaccccc accgggctcg ccgactccgc ccgcggaggc ccctcccgtc cgcccctgcc 1141 cccccggggc gcgcggctgc ctgggaggct ccggccagcc gcggtccaga gcgcgcgagg 1201 ttcggggagc tccgccaggc tgctggtacc tgcgtccgcc cggcggtgag tccgcgcggc 1261 cccggcggga cgcccccgcc acctgcgcgc accgtcagac ccggcggcct cggctgcggt 1321 gcacgcggcc cggctcagcc acngcggcgg aaggccctgc gggggcgggg gagagggtgg 1381 gagagagcaa gaggggngnn nggggagggg caggagaggg gctggggctg ccggtgggtt 1441 ggtccccaga gagctgagct tctcctccca tcccccgcag gcctccccga atgtttccaa 1501 agatctgggg cgggacgggg cggagactgc cgcaggagcc tcccggcnnc cccaggctgc 1561 gcagccactg gagcccatca cccaactcca gacgctcctg gctcctctac gtgggccggg 1621 agggacagcc ttgaggacta ggggaggggg cacgggacct tgcagagctc ctggccggaa 1681 agggaggatt gaccgccccc ggcatatcac cccggagcac tggaacccgc ccccgcttct 1741 tgttttggca ctggtggtgc ttgcggtgag gtccaaggag cccagcctcc ctgagtggac 1801 cgccgggccc ctccccgttc cgggacacag gagagctccc gccccttgct ggctgggcag 1861 cccctagata cctggctccc aggggccagc ttccctgagc ctggggatga gccatgagcc 1921 tgcagcctgg ccccagggcg gccccccacg gcctgccggc tggctccctc cggctccatg 1981 gcccacccgg ccttcctaat tgccttcgcg tcccaccggt gccttagctc agcctggtgg 2041 cccagcggtt gggtgccacc cagtgagcag gtggcggcac cagctggacc tgtttgtggc 2101 cctgtatgct aggacttcct cagagacagc tcagggaccc cccccaccta ccagaacctc 2161 acttaggggg tgggggaaaa ggaattggag atccttcctc ctcggctact ccctggaggc 2221 ggggatagcc ccagccggac ttggcactgt ttttggccac ctggggctcc cactcccaac 2281 cccaggatgt cagcccaggt ctcactgtcc tggcctgctg ctctcccctt aggcctctgc 2341 gggcccctct ccagatctgt tctctgaggg catcctcctt acccccagtg cccagcacta 2401 gctccccagg cccgggatgt ccctcccact cctctgccca cgcgtgtccc ctgaagaagg 2461 cagcagctcc ccctccaaca ccatgcactc acaaaacaga gaatcacgac cccagctggt 2521 gtgttccaga ttctttctcc agcagtgcag agggtcctgt gcagaggccg aggagcagta 2581 cagcgaccca tctggccctt tcctgctggt gggaaccagt ggcaacgcag ccttgtctct 2641 cggagcccat ttcccagcca cagaatgggg agtagcagat actgagtggg gtgctttcct 2701 cggaactgag taacatcagg aatggggagt gctttccccc tcagccatgc cccacctggc 2761 cccaggtctc catgtgaggg gagctgcctg ggctcaaaac cactgagcca tcccaaggaa 2821 aattctagat acagattagc taatatacac tgacagatac acatatagat gtgtatatag 2881 atttctcttt tttttgagac agggtctcac tctgttgccc aggctgcagg aatgcatggc 2941 atgatcatag tatacaactt cagtacttga acatccctag agcatcaaga gatccatccc 3001 acctcagacc tcccaagata gcatgaccac aggcacgtac cacgcccggc taatttttgt 3061 attttttgta gaaaacagga ggtctccact gtgttgccca gggtggtctc aaacccctgg 3121 cctcatatga tcctcctgcc ttggcctcac atagcatgga gattacaggc tgagccctgt 3181 acccagcccc agactagcta atattatgac tgatcaccat tccccattcc ccattccccc 3241 caaccccagg accactggca gaggccactc actctgcctt cttttctgtg gttctgagaa 3301 ggctgctgag tttcctcctc ttgcctgtgc agccccttcc atgcaggttt caggagggag 3361 atagcctaga gcatgtctgc actgagtgag gagtgggtca ggagagaagt cagggaagcc 3421 ttgtgtgctg gcaggtccct tcccctctgc tgtctgtgtc ctcatctgca aagtgggggt 3481 gcatggtctt ggggaggagt gaggcaccac cccgggcccc cgtaaccagt gtgtctctcc 3541 agagcaggac aggctgcttt ggtttgtgac ctccaggcag gacggccatc ctctccagaa 3601 tgaagatctt cttgccagtg ctgctggctg cccttctggg tgtggagcga ggtgaggtgc 3661 ccttggggac cccagacctt tgtccagctg tgccctgctc cactccctct ccacccctct 3721 cccctgagca gacgctccag gggtccttcc aggccgctcc cagcagaggg ctcacccggc 3781 ctggccacac tgtctcactg tgtgtttgag tgtcgcttga cctgctcgac ggccagggtg 3841 gggtgtcact gtctttgctc tgccttcagc ccagggcctg gtatacagta atttctcagt 3901 aaatgtccac tggggtcagg cctgggaggg acactggagg cttccctgaa caggtgtgtc 3961 ctccttcgca gccagctcgc tgatgtgctt ctcctgcttg aaccagaaga gcaatctgta 4021 ctgcctgaag ccgaccatct gctccgacca ggacaactac tgcgtgactg tgtctgctag 4081 tgccggcatt ggtgagtgcc aggctcagac cgtgccttcc tcccctggcc atctccctag 4141 cccgggccgg ggctcagcag aggccattgc tgtctgtctg cagccgtctg tctctcccct 4201 gacagcctca tttcccatgc agggaatctc gtgacatttg gccacagcct gagcaagacc 4261 tgttccccgg cctgccccat cccagaaggc gtcaatgttg gtgtggcttc catgggcatc 4321 agctgctgcc agagctttct gtgcaatttc agtgcggccg atggcgggct gcgggcaagc 4381 gtcaccctgc tgggtgccgg gctgctgctg agcctgctgc cggccctgct gcggtttggc 4441 ccctgaccgc ccagaccctg tcccccgatc ccccagctca ggaaggaaag cccagccctt 4501 tctggatccc acagtgtatg ggagcccctg actcctcacg tgcctgatct gtgcccttgg 4561 tcccaggtca ggcccacccc ctgcacctcc acctgcccca gcccctgcct ctgcccaagt 4621 gggccagctg ccctcacttc tggggtggat gatgtgacct tccttggggg actgcggaag 4681 ggacgagggt tccctggagt cttacggtcc aacatcagac caagtcccat ggacatgctg 4741 acagggtccc cagggagacc gtgtcagtag ggatgtgtgc ctggctgtgt acgtgggtgt 4801 gcagtgcacg tgagagcacg tggcggcttc tgggggccat gtttggggag ggaggtgtgc 4861 cagcagcctg gagagcctca gtccctgtag ccccctgccc tggcacagct gcatgcactt 4921 caagggcagc ctttgggggt tggggtttct gccacttccg ggtctaggcc ctgcccaaat 4981 ccagccagtc ctgccccagc ccacccccac attggagccc tcctgctgct ttggtgcctc 5041 aaataaatac agatgtcccc cagcttcctg ctctgagtgt ggctgcttgc gggagaggca 5101 gagcacccca ggtttggagg gtcctggggt cttctgtggt atggncaggg gggtggtggg 5161 ggaggaggag tcatccccct gagccacagc cagctgcttg cctgacctca gagggagccc 5221 ctccccagtg ctctgccctt tcttctgccc caggttcagc aggtcaagtg agttcctcct 5281 ccacacagnn naaggaggat gttggggagc atggaggaaa gaaagcgggt gcaagaagga 5341 gcacgctcag gctgggcagc acccagggcg tagacttggg aatggagggt gtctgtctgg 5401 acccgctggt gcagagggca caaaagctgg gctggggcag gacagacggg ccggagtgta 5461 gtgagtggcc ctgtgggagg ggcagggggc tcccacgcag aagctctgcg agcagccctg 5521 acccatccgt gcctggactg cag // LOCUS HSU66875 1569 bp DNA PRI 31-MAY-1997 DEFINITION Homo sapiens cytochrome oxidase subunit VIa heart isoform precursor (COX6AH) gene, complete cds. ACCESSION U66875 NID g2138177 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1569) AUTHORS Bachman,N.J., Riggs,P.K., Siddiqui,N., Makris,G.J., Womack,J.E. and Lomax,M.I. TITLE Structure of the human gene (COX6A2) for the heart/muscle isoform of cytochrome c oxidase subunit VIa and its chromosomal location in humans, mice, and cattle JOURNAL Genomics 42 (1), 146-151 (1997) MEDLINE 97321054 REFERENCE 2 (bases 1 to 1569) AUTHORS Lomax,M.I. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) Lomax M.I., The University of Michigan, Anatomy and Cell Biology, 1335 East Catherine, Ann Arbor, MI 48109-0616, USA FEATURES Location/Qualifiers source 1..1569 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p" repeat_unit 296..414 /rpt_type=dispersed /rpt_family="Alu" 5'UTR 727..801 /gene="COX6AH" mRNA join(727..874,976..1112,1281..1392) /gene="COX6AH" exon 727..874 /gene="COX6AH" /number=1 gene 727..1392 /gene="COX6AH" transit_peptide 802..837 /gene="COX6AH" CDS join(802..874,976..1112,1281..1364) /gene="COX6AH" /EC_number="1.9.3.1" /codon_start=1 /product="cytochrome oxidase subunit VIa heart isoform precursor" /db_xref="PID:g2138178" /translation="MALPLRPLTRGLASAAKGGHGGAGARTWRLLTFVLALPSVALCT FNSYLHSGHRPRPEFRPYQHLRIRTKPYPWGDGNHTLFHNSHVNPLPTGYEHP" mat_peptide join(838..874,976..1112,1281..1361) /gene="COX6AH" /EC_number="1.9.3.1" /product="cytochrome oxidase subunit VIa heart isoform" intron 875..975 /gene="COX6AH" /number=1 exon 976..1112 /gene="COX6AH" /number=2 intron 1113..1280 /gene="COX6AH" /number=2 exon 1281..1392 /gene="COX6AH" /number=3 3'UTR 1365..1392 /gene="COX6AH" BASE COUNT 345 a 472 c 450 g 302 t ORIGIN 1 aagcttttgg catccagggt ccagcagtgt gagctgtggg ggtgtcactg cagctggcca 61 acggaatgac ttgtttatga ctgtgcagac tgtaggcatc tctggctata tcagacatag 121 taagctacct aagattctca attacttatc ttctaaaact agtaagagtg agttctgttc 181 tttgcaaaga actgtgacta atacatatac tgaactactg ttacctgcca aaatccaggt 241 tgacacattc ctaccttaca taaacttcta taaaaatgat gtaagggccg tgtagtggct 301 catgcctgta atcccagcac tgtgggaggc tgaggcggga ggatcacgtg agctcaagag 361 ttccacacca gcctggacaa tatagtgaga tccccgtctc cagtatttaa aaaatctttt 421 aaggtacgaa ttattataca ttttacagct gaaaacactg agactcaaaa agacacttaa 481 ttcattctac aaatatttat taaagcccaa ctgtgtgcca gccaccaggc atagaatagg 541 gaacaaaata gtttccctgc aactatataa aaagaatggc atattgtact taaatgcctc 601 cttgccaaaa taagaggcac aggacaacag ctgtctggag atgacctcaa gccagtcagg 661 ccccatttta aatatagaaa cccctaagaa tagccgccag tgctccagac tcaacaggtg 721 attggcccag agaggggagg tgaccccagg ccccaggaaa gggagcgagg acagcgctgg 781 ttcccggctc cccgcaccat catggctttg cctctgaggc ccctgacccg gggcttggcc 841 agcgctgcca aaggaggcca cggaggagca ggaggtgagt ggggaacggg cggatccggg 901 ggctccccta ccctgcccac ctgttcacag gcccgccgcc ccaggaccgc cgcgctcacc 961 ccgctccgtc cgcagctcgt acctggcgtc tgctgacctt cgtgctggcg ctgcccagcg 1021 tggccctctg caccttcaac tcctatctcc actcgggcca ccgcccgcgc cccgagttcc 1081 gtccctacca acacctccgc atccgcacca aggtacgcgg gacgggcgcg cgggcggcac 1141 gggggtgctg cgggggcggg ggggggtgct gcgggggcgg gggggggtgc tgcggggggg 1201 gggtgcgcgg ggcgcgcggg gcggggcgcg gactccggac tcacggctca cgcgagctcc 1261 ctcccccgct ctctccacag ccctacccct ggggggacgg caaccacact ctgttccaca 1321 atagccacgt gaaccctctg cccacgggct acgaacaccc ctgaggcccc ggacgccccc 1381 ggacacaata aaggtgtgaa gcttcgagtc tgcggctctg tggggagggc ggggccacgg 1441 ggagcgcgcc cagaggcgcc cgctccgcgc atgcgcccag cttggtaagc gctcctgctc 1501 cgcgcatgcg cctagcgcgg tgggcgctcc acgtgctttt cccacgccgc tgatacggag 1561 gtgctcgag // LOCUS HSU70732 3136 bp DNA PRI 19-MAR-1997 DEFINITION Human glutamate pyruvate transaminase (GPT) gene, complete cds. ACCESSION U70732 NID g1763095 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3136) AUTHORS Sohocki,M.M., Sullivan,L.S., Harrison,W.R., Sodergren,E.J., Elder,F.F., Weinstock,G., Tanase,S. and Daiger,S.P. TITLE Human glutamate pyruvate transaminase (GPT): localization to 8q24.3, cDNA and genomic sequences, and polymorphic sites JOURNAL Genomics 40 (2), 247-252 (1997) MEDLINE 97237043 REFERENCE 2 (bases 1 to 3136) AUTHORS Sohocki,M.M., Sullivan,L.S., Sodergren,E.J., Tanase,S., Daiger,S.P. and Weinstock,G. TITLE Direct Submission JOURNAL Submitted (13-SEP-1996) Human Genetics Center, Univ. of Texas - Houston, P.O. Box 20334, Houston, TX 77225, USA FEATURES Location/Qualifiers source 1..3136 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q24.3" mRNA join(<1..429,562..651,733..841,973..1106,1222..1465, 1820..1899,1967..2103,2203..2377,2471..2626,2701..2813, 2880..>3136) /gene="GPT" 5'UTR <1..267 /gene="GPT" gene 1..3136 /gene="GPT" CDS join(268..429,562..651,733..841,973..1106,1222..1465, 1820..1899,1967..2103,2203..2377,2471..2626,2701..2813, 2880..2970) /gene="GPT" /EC_number="2.6.1.2" /codon_start=1 /product="glutamate pyruvate transaminase" /db_xref="PID:g1763096" /translation="MASSTGDRSQAVRHGLRAKVLTLDGMNPRVRRVEYAVRGPIVQR ALELEQELRQGVKKPFTEVIRANIGDAQAMGQRPITFLRQVLALCVNPDLLSSPNFPD DAKKRAERILQACGGHSLGAYSVSSGIQLIREDVARYIERRDGGIPADPNNVFLSTGA SDAIVTVLKLLVAGEGHTRTGVLIPIPQYPLYSATLAELGAVQVDYYLDEERAWALDV AELHRALGQARDHCRPRALCVINPGNPTGQVQTRECIEAVIRFAFEERLFLLADEVYQ DNVYAAGSQFHSFKKVLMEMGPPYAGQQELASFHSTSKGYMGECGFRGGYVEVVNMDA AVQQQMLKLMSVRLCPPVPGQALLDLVVSPPAPTDPSFAQFQAEKQAVLAELAAKAKL TEQVFNEAPGISCNPVQGAMYSFPRVQLPPRAVERAQELGLAPDMFFCLRLLEETGIC VVPGSGFGQREGTYHFRMTILPPLEKLRLLLEKLSRFHAKFTLEYS" allele 307 /gene="GPT" /note="creates an amino acid change from His (type 1 GPT) to Asn (type 2 GPT)" /replace="a" 3'UTR 2971..>3136 /gene="GPT" BASE COUNT 456 a 1135 c 1009 g 534 t 2 others ORIGIN 1 ccccgccttc acccactgcc tctgcctccc tggggcagag ctgtttccca gacgggtggg 61 gcggggccca actgtcccag ctccttcagc cctttctgtc cctcccagtg aggccagctg 121 cggtgaagag ggtgctctct tgcctggcgt tccctctgca cggctgcccc ctcccaccct 181 gcccactaag ccagacccac tgtcgccatt cccacttctg gtcctgccac ctcctgagct 241 gccttcccgc ctggtctggg tagagtcatg gcctcgagca caggtgaccg gagccaggcg 301 gtgaggcatg gactgagggc gaaggtgctg acgctggacg gcatgaaccc gcgtgtgcgg 361 agagtggagt acgcagtgcg tggccccata gtgcagcgag ccttggagct ggagcaggag 421 ctgcgccagg tatggcccag ggcccctcgc tgccctccag gtcacaatgg ggtggccgag 481 ttggccccag ccccactggc acatgggaca agggctgagg ggttaggtag ggcacagtct 541 ccctgcctgc acccctccca gggtgtgaag aagcctttca ccgaggtcat ccgtgccaac 601 atcggggacg cacaggctat ggggcagagg cccatcacct tcctgcgcca ggtgaggctt 661 ctctgcactc cctcccggag caccccccca cccccagccc atgtaccctg gcctcagcac 721 tccgtctctc aggtcttggc cctctgtgtt aaccctgatc ttctgagcag ccccaacttc 781 cctgacgatg ccaagaaaag ggcggagcgc atcttgcagg cgtgtggggg ccacagtctg 841 ggtgagagag agccagggcc agggccaggc ggaagcagag ggccgccctt gccactggag 901 gagggaagtc ccttgggagg gctgaggaga acttcacctg taccttccca atcctttcct 961 gcccgactcc aggggcctac agcgtcagct ccggcatcca gctgatccgg gaggacgtgg 1021 cgcggtacat tgagaggcgt gacggaggca tccctgcgga ccccaacaac gtcttcctgt 1081 ccacaggggc cagcgatgcc atcgtggtag gctgggcatg ggcaccaaga cattcctgac 1141 actgcagagg gggcgcccag ggtgggggac aggtgcggcc ccaggccctc gccaaccctg 1201 ccttcccctt cctttccgca gacggtgctg aagctgctgg tggccggcga gggccacaca 1261 cgcacgggtg tgctcatccc catcccccag tacccactct actcggccac gctggcagag 1321 ctgggcgcag tgcaggtgga ttactacctg gacgaggagc gtgcctgggc gctggacgtg 1381 gccgagcttc accgtgcact gggccaggcg cgtgaccact gccgccctcg tgcgctctgt 1441 gtcatcaacc ctggcaaccc caccggtgcg ttccccgccg ccccgcccaa ttccccccgc 1501 gccaacgttg cgttccccgc cgccccgccc aattcccccc gcgcccacgg tgcgttcccc 1561 gccgccccgc ccaactcccc ccgcgcccac ggtgcgttcc ccgccgcccc gccccactcc 1621 ccccgcgccc acggtgcgtt ccccgccgcc ccgccccact ccctccgcgc ccacggtgcg 1681 ttccccgccg ccccgcccca ctcctcccca cccaacgtgc gccctggccc ggcnngccgg 1741 tccgctggac cccgctgccc agcggagggc agtgcgcccc cttggctcac ccagcactgc 1801 tgcctccccg gcaccccagg gcaggtgcag acccgcgagt gcatcgaggc cgtgatccgc 1861 ttcgccttcg aagagcggct ctttctgctg gcggacgagg tcgcggcggg ggagcgggga 1921 gccgggcaac agtccgcccc cgtgacgcct tgcgcccctt accgaggtgt accaggacaa 1981 cgtgtacgcc gcgggttcgc agttccactc attcaagaag gtgctcatgg agatggggcc 2041 gccctacgcc gggcagcagg agcttgcctc cttccactcc acctccaagg gctacatggg 2101 cgagtgcgtg caacgaggcg ggtgggggct cgcgggccat ggccaggccc tcctcgcccg 2161 atgggccacc ccctcctccg cacctgacct ggccgtgcgc aggtgcgggt tccgcggcgg 2221 ctatgtggag gtggtgaaca tggacgctgc agtgcagcag cagatgctga agctgatgag 2281 tgtgcggctg tgcccgccgg tgccaggaca ggccctgctg gacctggtgg tcagcccgcc 2341 cgcgcccacc gacccctcct ttgcgcagtt ccaggctgtg agttgggggc aggagggggt 2401 ccaggtgacc taatcagggg tgggggatcc gagtgccgtg ccctgatggg ccctccctcc 2461 gcggccacag gagaagcagg cagtgctggc agagctggcg gccaaggcca agctcaccga 2521 gcaggtcttc aatgaggctc ctggcatcag ctgcaaccca gtgcagggcg ccatgtactc 2581 cttcccgcgc gtgcagctgc ccccgcgggc ggtggagcgc gctcaggtca ggcgggggcg 2641 gggcctgcgg ggtgggtagg ggggtttggg tatccctctc tgacggctct ccgtccacag 2701 gagctgggcc tggcccccga tatgttcttc tgcctgcgcc tcctggagga gaccggcatc 2761 tgcgtggtgc cagggagcgg ctttgggcag cgggaaggca cctaccactt ccggtgaggc 2821 ctggccctca ctccctgtcc cgccaccctg gcccttcact cactgtcaac tcctttcagg 2881 atgaccattc tgcccccctt ggagaaactg cggctgctgc tggagaagct gagcaggttc 2941 catgccaagt tcaccctcga gtactcctga gcaccccagc tggggccagg ctgggtcgcc 3001 ctggactgtg tgctcaggag ccctgggagg ctctggagcc cactgtactt gctcttgatg 3061 cctggcgggg tggggtgggg ggggtgctgg gcccctgcct ctctgcaggt ccctaataaa 3121 gctgtgtggc agtctg // LOCUS HSU73002 3240 bp DNA PRI 14-MAY-1997 DEFINITION Human ADP-ribosylation factor 5 (ARF5) gene, complete cds. ACCESSION U73002 NID g2088528 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3240) AUTHORS McGuire,R.E., Daiger,S.P. and Green,E.D. TITLE Localization and characterization of the human ADP-ribosylation factor 5 (ARF5) gene JOURNAL Genomics 41 (3), 481-484 (1997) MEDLINE 97312710 REFERENCE 2 (bases 1 to 3240) AUTHORS McGuire,R.E. and Green,E.D. TITLE Direct Submission JOURNAL Submitted (01-OCT-1996) Human Genetics Center, University of Texas, 6901 Bertner, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..3240 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q31.3" gene 37..2841 /gene="ARF5" CDS join(37..103,620..700,1027..1136,1607..1678,2505..2630, 2755..2841) /gene="ARF5" /codon_start=1 /product="ADP-ribosylation factor 5" /db_xref="PID:g2088529" /translation="MGLTVSALFSRIFGKKQMRILMVGLDAAGKTTILYKLKLGEIVT TIPTIGFNVETVEYKNICFTVWDVGGQDKIRPLWRHYFQNTQGLIFVVDSNDRERVQE SADELQKMLQEDELRDAVLLVFANKQDMPNAMPVSELTDKLGLQHLRSRTWYVQATCA TQGTGLYDGLDWLSHELSKR" BASE COUNT 601 a 931 c 869 g 835 t 4 others ORIGIN 1 ccgcgtcggt gcccgcgccc ctccccgggc cccgccatgg gcctcaccgt gtccgcgctc 61 ttttcgcgga tcttcgggaa gaagcagatg cggattctca tgggtgaggc agatcgagcg 121 cgcggcccgg accggggcgc cggccccggc gcagcccttc cgcccccgcg tccctccagc 181 cccgctcacc tgggtctctg gccccgagtc acccacctca taacgccccg gggcctctgc 241 tctcgggcgg gtcccggtct gcatcgccga ccccggggcc tgacacccgg agctgcgggc 301 ctgggtgggg tgaagctccc tccgccccag cccggctggt aagaagggag gattccgccc 361 ttggagacga cttttaaaac gagcgcggcc tacctcccgt gccccttgcc tccagtcctc 421 tcccgctccg cgccctcttt ggagttggct cacgccaccc gactgccctc caggcctcaa 481 aggtagataa cgacgcgcac ctggaggcgc tcccgctctc cgccccagtc accatcagct 541 gttgtggcct ctccctttcc ctggtctcta gggggctccc tcgctcccat ctccatccct 601 gtgccccttt ccgttgcagt tggcttggat gcggctggca agaccacaat cctgtacaaa 661 ctgaagttgg gggagattgt caccaccatc ccaaccatag gtgagcccgg ggacgaagca 721 gggagcggga gcgccgcggn cgggccctcg cggggtctgc tcccagttct gccacttttc 781 tggagctgac cctgacacca gntacagcta tttgagaagt gggtcagggt atctctacgt 841 ggataccgga ggcagggtgg atgtgaccta gccccgccct gttgaccgcc agttctgggg 901 ttctgttccc ctgggccttg atcattgcct tctggttttg tgtccctgct gaaatctaag 961 tagttttagt gagttccttt ctttcaggag ttttctcatc cttttttttt tccaatttat 1021 ctgcaggctt caatgtagaa acagtggaat ataagaacat ctgtttcaca gtctgggacg 1081 tgggaggcca ggacaagatt cggcctctgt ggcggcacta cttccagaac actcaggtgg 1141 agtgttggga ggggactttc taaccccacg ggaaaaggtg ttagggctgg gagaacagaa 1201 ttgttggcct aggctgggcc cccaggcaag gaaaaaaccc ttccccactt ctaccccttc 1261 tgtttggcta tcccctactc actcttcaga actcacctta gtcttcccca gcttggactt 1321 ctctttagaa gcaggattct gggaccaggc tgcctgattt agccctcata acatgtcttt 1381 atttccttgc acatctttgg ataaactact tcctctttgc agtggggtct ctagtttgtg 1441 ccatgactcc tgtagagctt ctatagagtt ctgtaaactt ctgtagagtt tactagatga 1501 gaaggattga cacaattaca aatgggaaga aaacagaaag ggccacacta cggtatgtcc 1561 caggcagaag tgattgcttc tcctttcttc cttccttcct gcccagggcc tcatctttgt 1621 ggtggacagt aatgaccggg agcgggtcca agaatctgct gatgaactcc agaagatggt 1681 gagtacccag agccctggga actgagccct cagcttgggg acagagtgat ctctgcagtg 1741 gtatagaagt cagggagccc ccaacaggca ttgaagacct ggaatataag tttttctttg 1801 tggacagaca cattgtgtat ctcaccctgt tttggatggg agtgactttt tcactttttc 1861 tgtgcattgt cttgtctttt tttttttttg aaatgggagt tttgctgttg ttgcccaggc 1921 tggagttgca atggtgcaat ctcggctcac tgcaacctcc gtctcccggg ttcaagcaat 1981 tctccggcct cagcctccca agtagctggg attacaggcg cccgctgcca tgcccagtta 2041 atttttgtat ttttagtana gacggggttt caccatgttg gccaggctgg tcttgaaccc 2101 ccaacctcag gtgatccacc cgtcttggtc tcccaaagtg ctgggattac aggagtgagc 2161 cactgcgccc ggccgcattc tcntgtcttt acatctgagt ctgagaagtt gtgggagaga 2221 tgggacttat atgtggcaaa gagaaacatt ttacattcct gccttttcca tttggaaatg 2281 ttgtcatttt gcctaggcaa gtactcaaat taggaaaagg tctttctttt ctggacttat 2341 gggctcaggt cacctcaagc tcatttacaa ctatgtcttg tcttcaacgg cagactgcac 2401 aatacttttt ttttccaatc aagatgctag gaggtagaaa gggagttccc caacacagta 2461 gaggagacgc agcctgggtc ccaccttctc ttctcctctc tcagctgcag gaggacgagc 2521 tgcgggatgc agtgctgctg gtatttgcca acaagcagga catgcccaac gccatgcccg 2581 tgagcgagct gactgacaag ctggggctac agcacttacg cagccgcacg gtaggggtcc 2641 tgcccacctg gtgctgaatc ctgcctcttg agggaagctg caggctggga cagagatata 2701 aaggcattcc ttttgttcct ggttgctgac ctctttcttt ccttttcccc acagtggtat 2761 gtccaggcca cctgtgccac ccaaggcaca ggtctgtacg atggtctgga ctggctgtcc 2821 cacgagctgt caaagcgcta accagccagg ggcaggcccc tgatgcccgg aagctcctgc 2881 gtgcatcccc gggatgacca gactcccgga ctcctcaggc agtgcccttt cctcccactt 2941 ttcctccccc atagccacag gcctctgctc ctgctcctgc ctgcatgttc tctctgttgt 3001 tggagcctgg agccttgctc tctgggcaca gaggggtcca ctctcctgcc tgctgggacc 3061 tatggaaggg gcttcctggc caaggccccc tcttccagag gaggagcagg gatctgggtt 3121 tccttttttt tttctgtttt gggtgtactc taggggccag gttgggaggg ggaaggtgag 3181 ggcttcgggt ggtgctataa tgtggcactg gatcttgagt aataaatttg ctgtggtttg // LOCUS HSU75285 14796 bp DNA PRI 09-AUG-1997 DEFINITION Homo sapiens apoptosis inhibitor survivin gene, complete cds. ACCESSION U75285 NID g2315862 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14796) AUTHORS Altieri,D.C. TITLE Molecular cloning of effector cell protease receptor-1, a novel cell surface receptor for the protease factor Xa JOURNAL J. Biol. Chem. 269 (5), 3139-3142 (1994) MEDLINE 94148797 REFERENCE 2 (bases 1 to 14796) AUTHORS Altieri,D.C. TITLE Splicing of effector cell protease receptor-1 mRNA is modulated by an unusual retained intron JOURNAL Biochemistry 33 (46), 13848-13855 (1994) MEDLINE 95034823 REFERENCE 3 (bases 1 to 14796) AUTHORS Ambrosini,G., Adida,C. and Altieri,D.C. TITLE A novel anti-apoptosis gene, survivin, expressed in cancer and lymphoma JOURNAL Nat. Med. 3 (8), 917-921 (1997) MEDLINE 97398388 REFERENCE 4 (bases 1 to 14796) AUTHORS Altieri,D.C., Adida,C. and Ambrosini,G. TITLE Direct Submission JOURNAL Submitted (17-OCT-1996) Pathology, Boyer Center for Molecular Medicine Rm.436B, Yale University School of Medicine, 295 Congress Ave., New Haven, CT 06536, USA FEATURES Location/Qualifiers source 1..14796 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q25" mRNA join(2762..2921,3174..3283,5158..5275,11955..13185) /product="apoptosis inhibitor survivin" CDS join(2811..2921,3174..3283,5158..5275,11955..12044) /function="IAP-related inhibitor of apoptosis over-expressed in common human cancers, in vivo" /note="Recombinant expression of survivin counteracts apoptosis induced by growth factor (IL-3) withdrawal in pre-B cell transfectants. Survivin expression is developmentally regulated: present during fetal development but undetectable in terminally differentiated normal adult tissues, survivin is abundantly expressed in transformed cell lines and in all the most common solid cancers and in 50% of high-grade non Hodgkin's lymphoma. The coding strand of the survivin gene product is highly homologous and complementary to the sequence of Effector cell Protease Receptor-1 (EPR-1) encoded by GenBank Accession Numbers L26245 and L32866. Single-strand specific probes identify two separate mRNA transcripts for EPR-1 and survivin with distinct molecular sizes and non-overlapping distribution in fetal and adult tissues" /codon_start=1 /product="apoptosis inhibitor survivin" /db_xref="PID:g2315863" /translation="MGAPTLPPAWQPFLKDHRISTFKNWPFLEGCACTPERMAEAGFI HCPTENEPDLAQCFFCFKELEGWEPDDDPIEEHKKHSSGCAFLSVKKQFEELTLGEFL KLDRERAKNKIAKETNNKKKEFEETAKKVRRAIEQLAAMD" polyA_site 13185..13186 BASE COUNT 3482 a 3488 c 3888 g 3938 t ORIGIN 1 tctagacatg cggatatatt caagctgggc acagcacagc agccccaccc caggcagctt 61 gaaatcagag ctggggtcca aagggaccac accccgaggg actgtgtggg ggtcggggca 121 cacaggccac tgcttccccc cgtctttctc agccattcct gaagtcagcc tcactctgct 181 tctcagggat ttcaaatgtg cagagactct ggcacttttg tagaagcccc ttctggtcct 241 aacttacacc tggatgctgt ggggctgcag ctgctgctcg ggctcgggag gatgctgggg 301 gcccggtgcc catgagcttt tgaagctcct ggaactcggt tttgagggtg ttcaggtcca 361 ggtggacacc tgggctgtcc ttgtccatgc atttgatgac attgtgtgca gaagtgaaaa 421 ggagttaggc cgggcatgct ggcttatgcc tgtaatccca gcactttggg aggctgaggc 481 gggtggatca cgaggtcagg agttcaatac cagcctggcc aagatggtga aaccccgtct 541 ctactaaaaa tacaaaaaaa ttagccgggc atggtggcgg gcgcatgtaa tcccagctac 601 tgggggggct gaggcagaga attgctggaa cccaggagat ggaggttgca gtgagccaag 661 attgtgccac tgcactgcac tccagcctgg cgacagagca agactctgtc tcaaaaaaaa 721 aaaaaaaaag tgaaaaggag ttgttccttt cctccctcct gagggcaggc aactgctgcg 781 gttgccagtg gaggtggtgc gtccttggtc tgtgcctggg ggccacccca gcagaggcca 841 tggtggtgcc agggcccggt tagcgagcca atcagcagga cccaggggcg acctgccaaa 901 gtcaactgga tttgataact gcagcgaagt taagtttcct gattttgatg attgtgttgt 961 ggttgtgtaa gagaatgaag tatttcgggg tagtatggta atgccttcaa cttacaaacg 1021 gttcaggtaa accacccata tacatacata tacatgcatg tgatatatac acatacaggg 1081 atgtgtgtgt gttcacatat atgaggggag agagactagg ggagagaaag taggttgggg 1141 agagggagag agaaaggaaa acaggagaca gagagagagc ggggagtaga gagagggaag 1201 gggtaagaga gggagaggag gagagaaagg gaggaagaag cagagagtga atgttaaagg 1261 aaacaggcaa aacataaaca gaaaatctgg gtgaagggta tatgagtatt ctttgtacta 1321 ttcttgcaat tatcttttat ttaaattgac atcgggccgg gcgcagtggc tcacatctgt 1381 aatcccagca ctttgggagg ccgaggcagg cagatcactt gaggtcagga gtttgagacc 1441 agcctggcaa acatggtgaa accccatctc tactaaaaat acaaaaatta gcctggtgtg 1501 gtggtgcatg cctttaatct cagctactcg ggaggctgag gcaggagaat cgcttgaacc 1561 cgtggcgggg aggaggttgc agtgagctga gatcatgcca ctgcactcca gcctgggcga 1621 tagagcgaga ctcagtttca aataaataaa taaacatcaa aataaaaagt tactgtatta 1681 aagaatgggg gcggggtggg aggggtgggg agaggttgca aaaataaata aataaataaa 1741 taaaccccaa aatgaaaaag acagtggagg caccaggcct gcgtggggct ggagggctaa 1801 taaggccagg cctcttatct ctggccatag aaccagagaa gtgagtggat gtgatgccca 1861 gctccagaag tgactccaga acaccctgtt ccaaagcaga ggacacactg attttttttt 1921 taataggctg caggacttac tgttggtggg acgccctgct ttgcgaaggg aaaggaggag 1981 tttgccctga gcacaggccc ccaccctcca ctgggctttc cccagctccc ttgtcttctt 2041 atcacggtag tggcccagtc cctggcccct gactccagaa ggtggccctc ctggaaaccc 2101 aggtcgtgca gtcaacgatg tactcgccgg gacagcgatg tctgctgcac tccatccctc 2161 ccctgttcat ttgtccttca tgcccgtctg gagtagatgc tttttgcaga ggtggcaccc 2221 tgtaaagctc tcctgtctga cttttttttt ttttttagac tgagttttgc tcttgttgcc 2281 taggctggag tgcaatggca caatctcagc tcactgcacc ctctgcctcc cgggttcaag 2341 cgattctcct gcctcagcct cccgagtagt tgggattaca ggcatgcacc accacgccca 2401 gctaattttt gtatttttag tagagacaag gtttcaccgt gatggccagg ctggtcttga 2461 actccaggac tcaagtgatg ctcctgccta ggcctctcaa agtgttggga ttacaggcgt 2521 gagccactgc acccggcctg cacgcgttct ttgaaagcag tcgagggggc gctaggtgtg 2581 ggcagggacg agctggcgcg gcgtcgctgg gtgcaccgcg accacgggca gagccacgcg 2641 gcgggaggac tacaactccc ggcacacccc gcgccgcccc gcctctactc ccagaaggcc 2701 gcggggggtg gaccgcctaa gagggcgtgc gctcccgaca tgccccgcgg cgcgccatta 2761 accgccagat ttgaatcgcg ggacccgttg gcagaggtgg cggcggcggc atgggtgccc 2821 cgacgttgcc ccctgcctgg cagccctttc tcaaggacca ccgcatctct acattcaaga 2881 actggccctt cttggagggc tgcgcctgca ccccggagcg ggtgagactg cccggcctcc 2941 tggggtcccc cacgcccgcc ttgccctgtc cctagcgagg ccactgtgac tgggcctcgg 3001 gggtacaagc cgccctcccc tccccgtcct gtccccagcg aggccactgt ggctgggccc 3061 cttgggtcca ggccggcctc ccctccctgc tttgtcccca tcgaggcctt tgtggctggg 3121 cctcggggtt ccgggctgcc acgtccactc acgagctgtg ctgtcccttg cagatggccg 3181 aggctggctt catccactgc cccactgaga acgagccaga cttggcccag tgtttcttct 3241 gcttcaagga gctggaaggc tgggagccag atgacgaccc catgtaagtc ttctctggcc 3301 agcctcgatg ggctttgttt tgaactgagt tgtcaaaaga tttgagttgc aaagacactt 3361 agtatgggag ggttgctttc caccctcatt gcttcttaaa cagctgttgt gaacggatac 3421 ctctctatat gctggtgcct tggtgatgct tacaacctaa ttaaatctca tttgaccaaa 3481 atgccttggg gtggacgtaa gatgcctgat gcctttcatg ttcaacagaa tacatcagca 3541 gaccctgttg ttgtgaactc ccaggaatgt ccaagtgctt tttttgagat tttttaaaaa 3601 acagtttaat tgaaatataa cctacacagc acaaaaatta ccctttgaaa gtgtgcactt 3661 cacactttcg gaggctgagg cgggcggatc acctgaggtc aggagttcaa gacctgcctg 3721 gccaacttgg cgaaaccccg tctctactaa aaatacaaaa attagccggg catggtagcg 3781 cacgcccgta atcccagcta ctcgggaggc taaggcagga gaatcgcttg aacctgggag 3841 gcggaggttg cagtgagccg agattgtgcc aatgcactcc agcctcggcg acagagcgag 3901 actccgtcat aaaaataaaa aattgaaaaa aaaaaaagaa agaaagcata tacttcagtg 3961 ttgttctgga tttttttctt caagatgcct agttaatgac aatgaaattc tgtactcgga 4021 tggtatctgt ctttccacac tgtaatgcca tattcttttc tcaccttttt ttctgtcgga 4081 ttcagttgct tccacagctt taattttttt cccctggaga atcaccccag ttgtttttct 4141 ttttggccag aagagagtag ctgttttttt tcttagtatg tttgctatgg tggttatact 4201 gcatccccgt aatcactggg aaaagatcag tggtattctt cttgaaaatg aataagtgtt 4261 atgatatttt cagattagag ttacaactgg ctgtcttttt ggactttgtg tggccatgtt 4321 ttcattgtaa tgcagttctg gtaacggtga tagtcagtta tacagggaga ctcccctagc 4381 agaaaatgag agtgtgagct agggggtccc ttggggaacc cggggcaata atgcccttct 4441 ctgcccttaa tccttacagt gggccgggca cggtggctta cgcctgtaat accagcactt 4501 tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatctt ggctaatacg 4561 gtgaaacccc gtctccacta aaaatacaaa aaattagccg ggcgtggtgg tgggcgcctg 4621 tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccagg aggcggagct 4681 tgcagtgagc cgagattgca ccactgcact ccagcctggg cgacagaatg agactccgtc 4741 tcaaaaaaaa aaaaaaaaga aaaaaatctt tacagtggat tacataacaa ttccagtgaa 4801 atgaaattac ttcaaacagt tccttgagaa tgttggaggg atttgacatg taattccttt 4861 ggacatatac catgtaacac ttttccaact aattgctaag gaagtccaga taaaatagat 4921 acattagcca cacagatgtg gggggagatg tccacaggga gagagaaggt gctaagaggt 4981 gccatatggg aatgtggctt gggcaaagca ctgatgccat caacttcaga cttgacgtct 5041 tactcctgag gcagagcagg gtgtgcctgt ggagggcgtg gggaggtggc ccgtggggag 5101 tggactgccg ctttaatccc ttcagctgcc tttccgctgt tgttttgatt tttctagaga 5161 ggaacataaa aagcattcgt ccggttgcgc tttcctttct gtcaagaagc agtttgaaga 5221 attaaccctt ggtgaatttt tgaaactgga cagagaaaga gccaagaaca aaattgtatg 5281 tattgggaat aagaactgct caaaccctgt tcaatgtctt tagcactaaa ctacctagtc 5341 cctcaaaggg actctgtgtt ttcctcagga agcatttttt ttttttttct gagatagagt 5401 ttcactcttg ttgcccaggc tggagtgcaa tggtgcaatc ttggctcact gcaacctctg 5461 cctctcgggt tcaagtgatt ctcctgcctc agcctcccaa gtaactggga ttacagggaa 5521 gtgccaccac acccagctaa tttttgtatt tttagtagag atggggtttc accacattgc 5581 ccaggctggt cttgaactcc tgacctcgtg attcgcccac cttggcctcc caaagtgctg 5641 ggattacagg cgtgaaccac cacgcctggc tttttttttt ttgttctgag acacagtttc 5701 actctgttac ccaggctgga gtagggtggc ctgatctcgg atcactgcaa cctccgcctc 5761 ctgggctcaa gtgatttgcc tgcttcagcc tcccaagtag ccgagattac aggcatgtgc 5821 caccacaccc aggtaatttt tgtatttttg gtagagacga ggtttcacca tgttggccag 5881 gctggttttg aactcctgac ctcaggtgat ccacccgcct cagcctccca aagtgctgag 5941 attataggtg tgagccacca cacctggcct caggaagtat ttttattttt aaatttattt 6001 atttatttga gatggagtct tgctctgtcg cccaggctag agtgcagcga cgggatctcg 6061 gctcactgca agctccgccc cccaggttca agccattctc ctgcctcagc ctcccgagta 6121 gctgggacta caggcgcccg ccaccacacc cggctaattt ttttgtattt ttagtagaga 6181 cgggttttca ccgtgttagc caggagggtc ttgatctcct gacctcgtga tctgcctgcc 6241 tcggcctccc aaagtgctgg gattacaggt gtgagccacc acacccggct atttttattt 6301 ttttgagaca gggactcact ctgtcacctg ggctgcagtg cagtggtaca ccatagctca 6361 ctgcagcctc gaactcctga gctcaagtga tcctcccacc tcatcctcac aagtaattgg 6421 gactacaggt gcaccccacc atgcccacct aatttattta tttatttatt tatttatttt 6481 catagagatg agggttccct gtgttgtcca ggctggtctt gaactcctga gctcacggga 6541 tccttttgcc tgggcctccc aaagtgctga gattacaggc atgagccacc gtgcccagct 6601 aggaatcatt tttaaagccc ctaggatgtc tgtgtgattt taaagctcct ggagtgtggc 6661 cggtataagt atataccggt ataagtaaat cccacatttt gtgtcagtat ttactagaaa 6721 cttagtcatt tatctgaagt tgaaatgtaa ctgggcttta tttatttatt tatttattta 6781 tttattttta attttttttt ttgagacgag tctcactttg tcacccaggc tggagtgcag 6841 tggcacgatc tcggctcact gcaacctctg cctcccgggg tcaagcgatt ctcctgcctt 6901 agcctcccga gtagctggga ctacaggcac gcaccaccat gcctggctaa tttttgtatt 6961 tttagtagac ggggtttcac catgctggcc aagctggtct caaactcctg accttgtgat 7021 ctgcccgctt tagcctccca gagtgctggg attacaggca tgagccacca tgcgtggtct 7081 ttttaaaatt ttttgatttt tttttttttt gagacagagc cttgctctgt cgcccaggct 7141 ggagtgcagt ggcacgatct cagctcacta caagctccgc ctcccgggtt cacgccattc 7201 ttctgcctca gcctcctgag tagctgggac tacaggtgcc caccaccacg cctggctaat 7261 tttttttggt atttttatta gagacaaggt ttcatcatgt tggccaggct ggtctcaaac 7321 tcctgacctc aagtgatctg cctgcctcgg cctcccaaag cgctgagatt acaggtgtga 7381 tctactgcgc caggcctggg cgtcatatat tcttatttgc taagtctggc agccccacac 7441 agaataagta ctgggggatt ccatatcctt gtagcaaagc cctgggtgga gagtcaggag 7501 atgttgtagt tctgtctctg ccacttgcag actttgagtt taagccagtc gtgctcatgc 7561 tttccttgct aaatagaggt tagaccccct atcccatggt ttctcaggtt gcttttcagc 7621 ttgaaaattg tattcctttg tagagatcag cgtaaaataa ttctgtcctt atatgtggct 7681 ttattttaat ttgagacaga gtgtcactca gtcgcccagg ctggagtgtg gtggtgcgat 7741 cttggctcac tgcgacctcc acctcccagg ttcaagcgat tctcgtgcct caggctccca 7801 agtagctgag attataggtg tgtgccacca ggcccagcta acttttgtat ttttagtaga 7861 gacagggttt tgccatgttg gctaagctgg tctcgaactc ctggcctcaa gtgatctgcc 7921 cgccttggca tcccaaagtg ctgggattac aggtgtgaac caccacacct ggcctcaata 7981 tagtggcttt taagtgctaa ggactgagat tgtgttttgt caggaagagg ccagttgtgg 8041 gtgaagcatg ctgtgagaga gcttgtcacc tggttgaggt tgtgggagct gcagcgtggg 8101 aactggaaag tgggctgggg atcatctttt tccaggtcag gggtcagcca gcttttctgc 8161 agcgtgccat agaccatctc ttagccctcg tgggtcagag tctctgttgc atattgtctt 8221 ttgttgtttt tcacaacctt ttagaaacat aaaaagcatt cttagcccgt gggctggaca 8281 aaaaaaggcc atgacgggct gtatggattt ggcccagcag gcccttgctt gccaagccct 8341 gttttagaca aggagcagct tgtgtgcctg gaaccatcat gggcacaggg gaggagcaga 8401 gtggatgtgg aggtgtgagc tggaaaccag gtcccagagc gctgagaaag acagagggtt 8461 tttgcccttg caagtagagc aactgaaatc tgacaccatc cagttccaga aagccctgaa 8521 gtgctggtgg acgctgcggg gtgctccgct ctagggttac agggatgaag atgcagtctg 8581 gtagggggag tccactcacc tgttggaaga tgtgattaag aaaagtagac tttcagggcc 8641 gggcatggtg gctcacgcct gtaatcccag cactttggga ggccgaggcg ggtggatcac 8701 gaggtcagga gatcgagacc atcctggcta acatggtgaa accccgtctt tactaaaaat 8761 acaaaaaatt agctgggcgt ggtggcgggc gcctgtagtc ccagctactc gggaggctga 8821 ggcaggagaa tggcgtgaac ctgggaggtg gagcttgctg tgagccgaga tcgcgccact 8881 gcactccagc ctgggcgaca gagcgagact ccgtctcaaa aaaaaaaaaa aaagtaggct 8941 ttcatgatgt gtgagctgaa ggcgcagtag gcagaagtag aggcctcagt ccctgcagga 9001 gacccctcgg tctctatctc ctgatagtca gacccagcca cactggaaag aggggagaca 9061 ttacagcctg cgagaaaagt agggagattt aaaaactgct tggcttttat tttgaactgt 9121 tttttttgtt tgtttgtttt ccccaattca gaatacagaa tacttttatg gatttgtttt 9181 tattacttta attttgaaac aatataatct tttttttgtt gtttttttga gacagggtct 9241 tactctgtca cccaggctga gtgcagtggt gtgatcttgg ctcacctcag cctcgacccc 9301 ctgggctcaa atgattctcc cacctcagct tcccaagtag ctgggaccac aggtgcgtgt 9361 gttgcgctat acaaatcctg aagacaagga tgctgttgct ggtgatgctg gggattccca 9421 agatcccaga tttgatggca ggatgcccct gtctgctgcc ttgccagggt gccaggaggg 9481 cgctgctgtg gaagctgagg cccggccatc cagggcgatg cattgggcgc tgattcttgt 9541 tcctgctgct gcctcggtgc ttagcttttg aaacaatgaa ataaattaga accagtgtga 9601 aaatcgatca gggaataaat ttaatgtgga aataaactga acaacttagt tcttcataag 9661 agtttacttg gtaaatactt gtgatgagga caaaacgaag cactagaagg agaggcgagt 9721 tgtagacctg ggtggcagga gtgttttgtt tgttttcttt ggcagggtct tgctctgttg 9781 ctcaggctgg agtacagtgg cacaatcaca gctcactata gcctcgacct cctggactca 9841 agcaatcctc ctgcctcagc ctcccagtag ctgggactac aggcgcatgc caccatgcct 9901 ggctaatttt aaattttttt ttttctcttt tttgagatgg aatctcactc tgtcgcccag 9961 gctggagtgc agtggcgtga tctcggctga cggcaagctc cgcctcccag gttcactcca 10021 ttcgcctgcc tcagcctccc aagtagctgg gactacaggc gctgggatta caaacccaaa 10081 cccaaagtgc tgggattaca ggcgtgagcc actgcacccg gcctgttttg tctttcaata 10141 gcaagagttg tgtttgcttc gcccctacct ttagtggaaa aatgtataaa atggagatat 10201 tgacctccac attggggtgg ttaaattata gcatgtatgc aaaggagctt cgctaattta 10261 aggctttttt gaaagagaag aaactgaata atccatgtgt gtatatatat tttaaaagcc 10321 atggtcatct ttccatatca gtaaagctga ggctccctgg gactgcagag ttgtccatca 10381 cagtccatta taagtgcgct gctgggccag gtgcagtggc ttgtgcctga atcccagcac 10441 tttgggaggc caaggcagga ggattcattg agcccaggag ttttgaggcg agcctgggca 10501 atgtggccag acctcatctc ttcaaaaaat acacaaaaaa ttagccaggc atggtggcac 10561 gtgcctgtag tctcagctac tcaggaggct gaggtgggag gatcactttg agccttgcag 10621 gtcaaagctg cagtaagcca tgatcttgcc actgcattcc agcctggatg acagagcgag 10681 accctgtctc taaaaaaaaa aaaaaccaaa cggtgcactg ttttcttttt tcttatcaat 10741 ttattatttt taaattaaat tttcttttaa taatttataa attataaatt tatattaaaa 10801 aatgacaaat ttttattact tatacatgag gtaaaactta ggatatataa agtacatatt 10861 gaaaagtaat tttttggctg gcacagtggc tcacacctgt aatcccagca ctttgggagg 10921 ccgtggcggg cagatcacat gagatcatga gttcgagacc aacctgacca acatggagag 10981 accccatctc tactaaaaat acaaaattag ccggggtggt ggcgcatgcc tgtaatccca 11041 gctactcggg aggctgaggc aggagaatct cttgaacccg ggaggcagag gttgcggtga 11101 gccaagatcg tgcctttgca caccagccta ggcaacaaga gcgaaagtcc gtctcaaaaa 11161 aaaagtaatt ttttttaagt taacctctgt cagcaaacaa atttaaccca ataaaggtct 11221 ttgtttttta atgtagtaga ggagttaggg tttataaaaa atatggtagg gaagggggtc 11281 cctggatttg ctaatgtgat tgtcatttgc cccttaggag agagctctgt tagcagaatg 11341 aaaaaattgg aagccagatt cagggaggga ctggaagcaa aagaatttct gttcgaggaa 11401 gagcctgatg tttgccaggg tctgtttaac tggacatgaa gaggaaggct ctggactttc 11461 ctccaggagt ttcaggagaa aggtagggca gtggttaaga gcagagctct gcctagacta 11521 gctggggtgc ctagactagc tggggtgccc agactagctg gggtgcctag actagctggg 11581 tactttgagt ggctccttca gcctggacct cggtttcctc acctgtatag tagagatatg 11641 ggagcaccca gcgcaggatc actgtgaaca taaatcagtt aatggaggaa gcaggtagag 11701 tggtgctggg tgcataccaa gcactccgtc agtgtttcct gttattcgat gattaggagg 11761 cagcttaaac tagagggagt tgagctgaat caggatgttt gtcccaggta gctgggaatc 11821 tgcctagccc agtgcccagt ttatttaggt gctctctcag tgttccctga ttgttttttc 11881 ctttgtcatc ttatctacag gatgtgactg ggaagctctg gtttcagtgt catgtgtcta 11941 ttctttattt ccaggcaaag gaaaccaaca ataagaagaa agaatttgag gaaactgcga 12001 agaaagtgcg ccgtgccatc gagcagctgg ctgccatgga ttgaggcctc tggccggagc 12061 tgcctggtcc cagagtggct gcaccacttc cagggtttat tccctggtgc caccagcctt 12121 cctgtgggcc ccttagcaat gtcttaggaa aggagatcaa cattttcaaa ttagatgttt 12181 caactgtgct cctgttttgt cttgaaagtg gcaccagagg tgcttctgcc tgtgcagcgg 12241 gtgctgctgg taacagtggc tgcttctctc tctctctctc ttttttgggg gctcattttt 12301 gctgttttga ttcccgggct taccaggtga gaagtgaggg aggaagaagg cagtgtccct 12361 tttgctagag ctgacagctt tgttcgcgtg ggcagagcct tccacagtga atgtgtctgg 12421 acctcatgtt gttgaggctg tcacagtcct gagtgtggac ttggcaggtg cctgttgaat 12481 ctgagctgca ggttccttat ctgtcacacc tgtgcctcct cagaggacag tttttttgtt 12541 gttgtgtttt tttgtttttt ttttttggta gatgcatgac ttgtgtgtga tgagagaatg 12601 gagacagagt ccctggctcc tctactgttt aacaacatgg ctttcttatt ttgtttgaat 12661 tgttaattca cagaatagca caaactacaa ttaaaactaa gcacaaagcc attctaagtc 12721 attggggaaa cggggtgaac ttcaggtgga tgaggagaca gaatagagtg ataggaagcg 12781 tctggcagat actccttttg ccactgctgt gtgattagac aggcccagtg agccgcgggg 12841 cacatgctgg ccgctcctcc ctcagaaaaa ggcagtggcc taaatccttt ttaaatgact 12901 tggctcgatg ctgtggggga ctggctgggc tgctgcaggc cgtgtgtctg tcagcccaac 12961 cttcacatct gtcacgttct ccacacgggg gagagacgca gtccgcccag gtccccgctt 13021 tctttggagg cagcagctcc cgcagggctg aagtctggcg taagatgatg gatttgattc 13081 gccctcctcc ctgtcataga gctgcagggt ggattgttac agcttcgctg gaaacctctg 13141 gaggtcatct cggctgttcc tgagaaataa aaagcctgtc atttcaaaca ctgctgtgga 13201 ccctactggg tttttaaaat attgtcagtt tttcatcgtc gtccctagcc tgccaacagc 13261 catctgccca gacagccgca gtgaggatga gcgtcctggc agagacgcag ttgtctctgg 13321 gcgcttgcca gagccacgaa ccccagacct gtttgtatca tccgggctcc ttccgggcag 13381 aaacaactga aaatgcactt cagacccact tatttatgcc acatctgagt cggcctgaga 13441 tagacttttc cctctaaact gggagaatat cacagtggtt tttgttagca gaaaatgcac 13501 tccagcctct gtactcatct aagctgctta tttttgatat ttgtgtcagt ctgtaaatgg 13561 atacttcact ttaataactg ttgcttagta attggctttg tagagaagct ggaaaaaaat 13621 ggttttgtct tcaactcctt tgcatgccag gcggtgatgt ggatctcggc ttctgtgagc 13681 ctgtgctgtg ggcagggctg agctggagcc gcccctctca gcccgcctgc cacggccttt 13741 ccttaaaggc catccttaaa accagaccct catggctgcc agcacctgaa agcttcctcg 13801 acatctgtta ataaagccgt aggcccttgt ctaagcgcaa ccgcctagac tttctttcag 13861 atacatgtcc acatgtccat ttttcaggtt ctctaagttg gagtggagtc tgggaagggt 13921 tgtgaatgag gcttctgggc tatgggtgag gttccaatgg caggttagag cccctcgggc 13981 caactgccat cctggaaagt agagacagca gtgcccgctg cccagaagag accagcaagc 14041 caaactggag cccccattgc aggctgtcgc catgtggaaa gagtaactca caattgccaa 14101 taaagtctca tgtggtttta tctacttttt ttttcttttt cttttttttt gagacaaggc 14161 cttgccctcc caggctggag tgcagtggaa tgaccacagc tcaccgcaac ctcaaattct 14221 tgcgttcaag tgaacctccc actttagcct cccaagtagc tgggactaca ggcgcacgcc 14281 atcacacccg gctaattgaa aaattttttt ttttgtttag atggaatctc actttgttgc 14341 ccaggctggt ctcaaactcc tgggctcaag tgatcatcct gcttcagcgt ccgacttgtt 14401 ggtattatag gcgtgagcca ctgggcctga cctagctacc attttttaat gcagaaatga 14461 agacttgtag aaatgaaata acttgtccag gatagtcgaa taagtaactt ttagagctgg 14521 gatttgaacc caggcaatct ggctccagag ctgggccctc actgctgaag gacactgtca 14581 gcttgggagg gtggctatgg tcggctgtct gattctaggg agtgagggct gtctttaaag 14641 caccccattc cattttcaga cagctttgtc agaaaggctg tcatatggag ctgacacctg 14701 cctccccaag gcttccatag atcctctctg tacattgtaa ccttttattt tgaaatgaaa 14761 attcacagga agttgtaagg ctagtacagg ggatcc // LOCUS HSU75898 1468 bp DNA PRI 06-NOV-1997 DEFINITION Homo sapiens HSPB2 gene, complete cds. ACCESSION U75898 NID g2586092 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1468) AUTHORS Iwaki,A., Nagano,T., Nakagawa,M., Iwaki,T. and Fukumaki,Y. TITLE Identification and characterization of the gene encoding a new member of the alpha-crystallin/small hsp family, closely linked to the alphaB-crystallin gene in a head-to-head manner JOURNAL Genomics 45 (2), 386-394 (1997) MEDLINE 98008929 REFERENCE 2 (bases 1 to 1468) AUTHORS Iwaki,A., Nagano,T. and Nakagawa,M. TITLE Direct Submission JOURNAL Submitted (23-OCT-1996) Institute of Genetic Information, Kyushu University, 3-1-1 Maidashi, Higashi-ku, Fukuoka 812-82, Japan FEATURES Location/Qualifiers source 1..1468 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /cell_type="lymphocyte" mRNA join(17..207,725..1377) /gene="HSPB2" gene 17..1377 /gene="HSPB2" CDS join(114..207,725..1179) /gene="HSPB2" /note="similar to small heat shock protein" /codon_start=1 /product="HSPB2" /db_xref="PID:g2586093" /translation="MSGRSVPHAHPATAEYEFANPSRLGEQRFGEGLLPEEILTPTLY HGYYVRPRAAPAGEGSRAGASELRLSEGKFQAFLDVSHFTPDEVTVRTVDNLLEVSAR HPQRLDRHGFVSREFCRTYVLPADVDPWRVRAALSHDGILNLEAPRGGRHLDTEVNEV YISLLPAPPDPEEEEEAAIVEP" polyA_signal 1357..1362 /gene="HSPB2" BASE COUNT 241 a 537 c 359 g 331 t ORIGIN 1 gtcgaccccg cccccacctc ctatcgagcc ctggctctcc gggcagctgg aggggtcgcg 61 ctgcgcctgt tggggctgca cctcggacca gggcttctgc tgcatctgca gccatgtcgg 121 gccgctcagt gccacatgcc cacccggcca ccgccgagta cgaatttgcc aacccgagcc 181 gcctgggtga gcagcgcttc ggagaaggta tggcacagac accaccccct tgccccccac 241 ccccaccccc tgaaattctg ggctgacctc ccaacgttgc tggcctccac cccactctgg 301 gcttaactga tctcctggga ccagccaccc cccaccccag actacccctc atcccctcca 361 agcagagatc tttcccattc ctgctgacct caagacacac ttggtgcttg cagtgctgat 421 acctgctctt tgccggcctg ggtgtgcctc cttgtccccc ttctatccac ccaggccgtt 481 tggtgcctcc tcctcctccc cctcctcctc cttctcctcc tcctcctccc tttcctttcc 541 gttctctttc ccctcttccg agctgccttc tcctcctagc tacagtgtgg cctccctccc 601 gttccctact cctcctgact cccctcttgc tcttcccaat ccctcctctc cagatttccc 661 atttcgcccc tgtgccagcc tcatcctccc tcatcctgcc tcttgccttc tctctgccct 721 ttaggcctcc tgccagaaga gatcctgacc cccacactct accatggcta ctatgtccgg 781 cctcgggccg ccccagctgg ggagggcagc agggcagggg cctccgagct taggctcagt 841 gagggcaagt tccaggcatt tctggatgtg agccacttta ccccagacga ggtgactgtg 901 aggactgtgg ataacctgct ggaggtgtct gcccggcacc cccagcgcct ggaccgccac 961 ggcttcgtgt cccgagagtt ctgccgcacc tatgtcctgc ctgctgatgt cgacccctgg 1021 cgagtccgag ctgctctctc ccatgatggc atcttaaacc tggaagcacc tcggggtggc 1081 cgacatttgg acacagaggt caatgaggtc tacatctccc tgctccctgc gcctcctgat 1141 ccagaggaag aggaggaggc agccatagtt gagccctgat tgccacagac ccagcaccca 1201 gcaaatccct ctctacctcc caaggtgata tgggcagctg cccaccactc cagaggtagc 1261 agcatccttg ggggaaggga aaggtgcatg gtccacaatg tatggtttgg tcccatggga 1321 catgtcatag ccttggttta gttttgggtg gagctgaata aacccaaatc tcagggcctt 1381 gtttgtactg ctccctattc tgtggccggg aagctgggat ggggagggag ggaggaggtc 1441 acagccagct agacaaaaag ctcctcag // LOCUS HSU78190 4000 bp DNA PRI 03-DEC-1996 DEFINITION Human GTP cyclohydrolase I feedback regulatory protein gene, complete cds. ACCESSION U78190 NID g1698996 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4000) AUTHORS Milstien,S., Jaffe,H., Kowlessur,D. and Bonner,T.I. TITLE Purification and cloning of the GTP cyclohydrolase I feedback regulatory protein, GFRP JOURNAL J. Biol. Chem. 271 (33), 19743-19751 (1996) MEDLINE 96355270 REFERENCE 2 (bases 1 to 4000) AUTHORS Bonner,T.I., Modi,W.S. and Milstein,S. TITLE Structure of the human GTP cyclohydrolase I feedback regulatory protein gene JOURNAL Unpublished REFERENCE 3 (bases 1 to 4000) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) Lab of Cell Biology, NIMH, Bldg 36, Rm 3A17, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..4000 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q15" mRNA join(<356..485,2096..2190,3495..3981) /note="5' end based on comparison with EST, GenBank Accession Number AA100037" CDS join(450..485,2096..2190,3495..3618) /function="mediates tetrahydrobiopterin inhibition of GTP cyclohydrolase I" /codon_start=1 /product="GTP cyclohydrolase I feedback regulatory protein" /db_xref="PID:g1698997" /translation="MPYLLISTQIRMEVGPTMVGDEQSDPELMQHLGASKRRALGNNF YEYYVDDPPRIVLDKLERRGFRVLSMTGVGQTLVWCLHKE" polyA_signal 3951..3957 polyA_site 3982 /note="based on comparison with EST, Genbank Accession Number N89728" BASE COUNT 829 a 1207 c 1128 g 836 t ORIGIN 1 tgttcaagaa gcgggcgcgg gggcaacacg cgcgcgggga cgcagtaaga tcacaggctg 61 caaagaaaac agcactaagc gtcctggagg acgctggagc ttcaggatag tgtccttcca 121 ggtcgtggga ccgccccctc ctccccgcac tccaccttcc cgcaactgga tcactcggcg 181 gaagcctcag gtgggccccg gggtccaagt ccgccgccgg gagggcgggg gcgggccccg 241 aaccgccgac tgcagggcgc gtcacggcag ccgctcctga ttggcggcca cggcggtcac 301 gggcaaggtg ccccgggatt gacgccgcct gtggccagag ccggagtaac gggactccca 361 gctgcgcgtc gcagtcccga cgcgagaagg gctggagtcg gcgtccagcc tagagccccc 421 ggtgggagcc aggccgggac gcgtgcacca tgccctacct gctcatcagc acccagatcc 481 gcatggtgag taccggccgc ctggcgactt ggagccggga ccaggcccta gggggctgcg 541 actgcacctt catgcctccc tcctgggctg cacccaccgg ggactggacc gggaacccgc 601 ccagacacgc ccccttggcc ggggagaggg agtggagaag cgggctcagc tcggcttccg 661 ggactcagcg agctctggcc ggcctgaagc ctttattgcc gcgaaggggc ccgggtacgt 721 gcgtcccggc aacagggcga cctccgggcg cttccggctc ctgtcctacc tgtcggcggg 781 agagacccag ggagccctgg cctcacgcgc agcacctagt gccctgcaca gagccgctcc 841 gcaggggaaa aaagggcgaa taacctgaga gaaagaggtt tggaaaagat aaggacctgc 901 caaggcgcta aaaccccggt gccaagtcct tgacccataa acagcaacta gctgcaccct 961 taggcacttc caggaagggc tatccctgaa gctttaggga aggaagaaaa gaaagaacac 1021 cctgcttgga aaaaaggaag gcgagaaggg gctgtgcagt tagaggtggg aacaccatga 1081 gttagaagtc acgttccttt ctctgggcgc agtttcctta tctgtaaaaa ggttctgatt 1141 gcccctcagg agtccccttt tagcttcaga aagacatgac gggcagcaag gtgctctctg 1201 cccttgctca ctgcccagtg ctggggtgaa ccagcaaagg gtacacaacc cccttggtgg 1261 gttcaccacg caacagagag ctcttgcaga gccctgggca gcggtttgtg aagcagccat 1321 ctgctcagct gccaaaatgt ggattcctgg gtcccttccc agacctccta aaactctgtc 1381 tggggcttgg gaacccatac ttggaaatgg ccccccaaag tgattcttat ttcttaaaag 1441 atatggcttt attgatatgt aatgtgcata ccataaattc tttgatttaa agtgtacaat 1501 tcagtgttgt tgtttcttta gtgtactcag agttgtgcaa ccatcaccat aatcagtttt 1561 agaacatttc taatacccca aaaggaaacc ctgtacccac tggcagacac tcttgatttc 1621 ccccaagtcc ctcagcccta gataaccact aattaccttt ctgtggtaat gtgatctgag 1681 tctggcttct ttcatttaca tgtttatttt gttttgtttt tttgaaatgg agtctcactc 1741 tgtcacccag gctggagtgc agtggtgcaa tcatggctca ctgcagcctc aaactcctcc 1801 cacctcaagc gatcctccca cctcagcctc agtagctgga actataggct cgtgatactt 1861 tgcctggcta aagctgagct ttaaggcact agggagggcg cctacattgc tcctcctgat 1921 cattgatggg ctccacccct tcacagcttg tgctcagccc ctaagaatcc ttgctactct 1981 ctgtagcctt tacctgaacc ttactcaggg ctagcaggca ggggaggaag gacaggcaag 2041 accttccacc tcctcctggc agccagccaa gcagccactg tggcttacct tgcaggaggt 2101 gggccccact atggtgggcg atgaacagtc ggatccagag ctgatgcagc atctgggggc 2161 ttcaaagaga agagccttgg gaaacaactt gtaagtagca gcctccctca gtatcctctc 2221 cctgccagcc cctagacctg cctctgctcc ctttatacaa cctccaagcc cagggacagg 2281 gcccagcatc agagccccct gcctgcatcc tggggctctg ttctccggag gagttggtcc 2341 catctgtagc ctctccaaca tcccccctct cccccacgag gcttgctctg aaagctttcc 2401 acatccttac tggaaccgca gaaacagagt ccgttccctg ggtctaggac tccactagca 2461 gactgttcct tgccctgccc tctcctaagc tggaccccat caaggacaga caggcacaga 2521 gagaaaggtt tttgtcgtga ggatctgcat gagtcgcaaa gctaagccct agaagcaggg 2581 agcagggagc ctgcgtccta cctccagggt ggctgctgcc tggggaccta gccatttccc 2641 ttggcattct tcctaatgcc tccatgtcta aactgtgact actgcttcca ttcctcctct 2701 cacccctgga ggcagggaga acccaagcta cacaggctct ctgaaaagag ttgtcagagt 2761 gagtgccagt tcaaatcggg gaaagaggag ggctgatccc agcccctggg tcaaggctcg 2821 gggacagaag ccagccccct ccctttgcca tgaggcactt gcctctccat gcccagcttg 2881 gggcttgagc cctttgtctg cttcatggtc tttcagcatc aattaaacct gacagaagga 2941 ctaccagcac tctccatgct agggagtggc gcacagagcc aggagctact ggggctcagc 3001 cttgttgcta cagaactgca ggacagccag ggcagtgagg ggcctggagg cttatttttt 3061 tttgaaatgg agtctctgtc gcccaggctg gagtgcggtg gtgcaatcat ggctcactgc 3121 agcctcagcc tcctcctacc tggaagcctg ggtggggagc aagcccaggg agggcggcag 3181 cagggctgtg gtcagactaa catggtctac catctgggac gctggggaac ctgattaacc 3241 cctctgtttt ctcatctatc aatggccaca acagtggcac cttcactagc ttgttgggaa 3301 gaataaatga gatagctcgt gaagtaccta gaaggggagg aggcaggggg cgtgcactta 3361 ccccagcgcc cagcaagcag ccagcaagtg tgagtcacta caagagtggc caggctgcct 3421 gctgcagaca ctagctttgt accaggagct tgctgtgtgc ccctcctcac cccccccatc 3481 ctgtctcttt gcagttatga atactacgtc gatgaccctc cccgcatagt cctggacaag 3541 ctggaacgca ggggcttccg tgtgctgagc atgacggggg tgggccagac gctggtgtgg 3601 tgtctgcaca aggagtgacc ttctcatgct gatttgcaga cggggcaccc ctgtggaggg 3661 gctgctgtgg gccctgacct ccaagctcct gcctcaccgt ctgccttgct cctctcttcc 3721 caaatcatca ccgccatggg cccagcccca aagggcagtg aatggccttc tctgaaaccc 3781 tgcgtcaagc agtgggagag ggcagtgccc ggtgccctgg tgctcccagc tgccctcctg 3841 cttcgggcct gggccgaggg ccttgtgtag gccatgttcc tcgggcagct gccccgggcc 3901 ggagctgggc actccagcgg ccctggcgcg tggctcctgc atagctagcc caagccaata 3961 aagggctgtg atgagtggct gcgcctgtgc tctgcttgtg // LOCUS HSU79415 4715 bp DNA PRI 04-JUN-1997 DEFINITION Homo sapiens prepro dipeptidyl peptidase I (DPP-I) gene, complete cds. ACCESSION U79415 NID g1947070 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4715) AUTHORS Rao,N.V., Rao,G.V. and Hoidal,J.R. TITLE Human dipeptidyl-peptidase I. Gene characterization, localization, and expression JOURNAL J. Biol. Chem. 272 (15), 10260-10265 (1997) MEDLINE 97248590 REFERENCE 2 (bases 1 to 4715) AUTHORS Rao,N.V., Rao,G.V. and Hoidal,J.R. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Internal Medicine, Division of Respiratory, Critical Care and Occupational Medicine, University of Utah Health Sciences Center, Rm. 743A, 50 N. Medical Dr., Salt Lake City, Utah 84132, USA FEATURES Location/Qualifiers source 1..4715 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q14.1-q14.3" /chromosome="11" gene 1..4715 /gene="DPP-I" CDS join(1271..2159,3805..4307) /gene="DPP-I" /EC_number="3.4.14.1" /note="cathepsin C" /codon_start=1 /product="prepro dipeptidyl peptidase I" /db_xref="PID:g1947071" /translation="MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVG SSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWF AFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQ EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPA PLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTN NSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMK EDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG LRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAI ESIAVAATPIPKL" intron 2160..3804 /gene="DPP-I" BASE COUNT 1390 a 993 c 1070 g 1262 t ORIGIN 1 ttttaaggga gatataagtg aataatttgg acctgctctc tttgaatgtt tataatctgg 61 tggtggaaaa aaaatggaca tatgaatatt gatttgtgac cagtgcaaag ggggcaaaaa 121 ttcatatccc aaagaaaacg gggacacatc cggtctgtct tgttcatcac tgtgtccaca 181 gggcctgaca cctagtaggc tcagtgggag aaaggagccc caattaccaa caaaagccag 241 gaaagaacgg gaggctctta cggaaaaggg tgatacttaa actgagcaag gaggcacctg 301 gaaatagtgc cacctaataa tttttggcga tcagactggc acactagaac ggttcataag 361 accagccttc tcccattggc tagcttcctt cctcaccctt ctcaccctgg gcaagccgct 421 tcctctctct gggcctcttg cttttcctct gtaacataaa aggggttgag caatatcatc 481 tctgagagcg ccatgtgtgt gcgtgccaga gggaaaaccc ccacaacgct aatacatcaa 541 aactgcaggt ttgcacaaaa actgaattct gctgaatgcg aacaggcaaa cagcatttac 601 caggaaacaa aaacaaaatc aagcacataa aaaagtagga agagttggaa aacggaagga 661 agataagttt tcaaacagct ggaatagttg atgttagcta gcgaagtttt tcagaggaaa 721 aaacaagaat ttggttatga ggcaagtgga cctgagaaaa aagactaaag gggaagaata 781 gcaagtaaaa cagaactcca cttgctagat ctctccgtct gtcgcgttct ttcacctgac 841 ccactccctt attcccccca caccctttcc ttctctccct acgttaccgc acaggaacga 901 agtctgggtc atgtgcggac cgcttgtggc tcttaaatcc tctttttgtc accctggccg 961 tgcaaaattt tgaaaacgtc cctcggcaaa aaaaataaaa ataaaaaaaa aaatctgtcc 1021 ctggcctctt ccctagttct gggtccagtt gcagccaagt gaggggcagc gcgcgctccc 1081 aagtccccgt ttcagagacg cgcacgcgcc tggcgcccaa cccccaatcc cctgctgctc 1141 agtgaccccg cccacgggtt tccgggccgg cgtagctatt tcaaggcgcg cgcctcgtgg 1201 tggactcacc gctagcccgc agcgtcggct tcctggtaat tcttcacctc ttttctcagc 1261 tccctgcagc atgggtgctg ggccctcctt gctgctcgcc gccctcctgc tgcttctctc 1321 cggcgacggc gccgtgcgct gcgacacacc tgccaactgc acctatcttg acctgctggg 1381 cacctgggtc ttccaggtgg gctccagcgg ttcccagcgc gatgtcaact gctcggttat 1441 gggaccacaa gaaaaaaaag tagtggtgta ccttcagaag ctcgatacag catatgatga 1501 ccttggcaat tctggccatt tcaccatcat ttacaaccaa ggctttgaga ttgtgttgaa 1561 tgactacaag tggtttgcct tttttaagta taaagaagag ggcagcaagg tgaccactta 1621 ctgcaacgag acaatgactg ggtgggtgca tgatgtgttg ggccggaact gggcttgttt 1681 caccggaaag aaggtgggaa ctgcctctga gaatgtgtat gtcaacacag cacaccttaa 1741 gaattctcag gaaaagtatt ctaataggct ctacaagtat gatcacaact ttgtgaaagc 1801 tatcaatgcc attcagaagt cttggactgc aactacatac atggaatatg agactcttac 1861 cctgggagat atgattagga gaagtggtgg ccacagtcga aaaatcccaa ggcccaaacc 1921 tgcaccactg actgctgaaa tacagcaaaa gattttgcat ttgccaacat cttgggactg 1981 gagaaatgtt catggtatca attttgtcag tcctgttcga aaccaagcat cctgtggcag 2041 ctgctactca tttgcttcta tgggtatgct agaagcgaga atccgtatac taaccaacaa 2101 ttctcagacc ccaatcctaa gccctcagga ggttgtgtct tgtagccagt atgctcaagg 2161 taagtgttgc atttcagaca ccatttatga gctatttacc tgtgtgcagc tggctgttgt 2221 tggcaaaggc aaaaggatga tgcagtagag agagcgcagt gtctatagtc agaaaatctg 2281 agtgcaagtc tggccctatc acttattaat ggatgattgc tcatggaatt tactgtacca 2341 tccagcaaaa tgtcaatagt tactatatat tgagtgagct ctgcttgtta catatatggc 2401 ctaaacaatg gctcaaaaaa atttggagga agtagtaagt ataattccta tttatggggg 2461 cgggggaaca gggaactaag gaaatttttt ctaataattt ggaaggtctc aacaagtttt 2521 agcattggga gtttcaattc taattcattc gttctccaaa aacccacttt tattaaaact 2581 atactaaaca ctgtttctct ctggggagaa ttttaaaatt ctgtaactta gggctgggca 2641 cagtggctta tgcctataac cctatcactt tgggaggctg agatgggtga atctcttgag 2701 tcctagagtt tgagaccagc ctgggcaaca cggcgaaacc ccttctctat taaaaataca 2761 aaaaattagc tgggcgtggt ggtgtgtgct tgtagtccca gctattcagg aggcggaggt 2821 gtaagaatca cctgagccca ggaggtcaag gctgcagtga gccgatatca tgccactgca 2881 ctccagcctg ggcaaacgga gtgaggccct gttatgaaaa aaaaaaaatc tgtacttagg 2941 ctttcagatc aggctgtatg tgatgtatgt cgaaaacaca gctataattg attgagggag 3001 aaacgttacc attttaaagt ttatgctttc aagcccagat ttggccacta ggaatttccc 3061 agctcactag tgaaactgct gatgagtgat tatttgccag tgagcctttc attctttcta 3121 aaatatgtac tactagttgt gacttgtagg ctataggggc tataatatat caagacaatc 3181 tttatcctca tgaagcttac agttaagtaa gagatagaga ttaaataatt ataacaacag 3241 agtgaagaac agtgaagaaa aagtacagag ttataatata tataataggg ccaggactgc 3301 atgaggaagg taggaaagac atttcggcaa gaggttgtca gggaaaagac ttgcttgaga 3361 aagagccaag ttgtggggtc tggctgctta gcaatgacca taatacctaa cttttgctat 3421 ttttacatga agtaactaat ttaaccttat gaggaaagta ctactaccat ctagatttta 3481 caggtaagta agcagagata cagagaagtt aaactcttca cacggctttg gctttaaacc 3541 tatataggct tcggagcctc cccacttaac cactttgcca tagctacatc catattaggt 3601 gctaagtaga tatctgttaa gtagaaggag gatgaaagga tagttagcta gttgaaaaat 3661 ggatggatga atgaagtgat gcttaagcta agaacaactt tcaggggtaa catgcaaaga 3721 ataatggagc aaagaagaaa aaatagaaaa tgggataatc ctttttctac taaggggtaa 3781 ccatgtgtgt tattcaatct tcaggctgtg aaggcggctt cccatacctt attgcaggaa 3841 agtacgccca agattttggg ctggtggaag aagcttgctt cccctacaca ggcactgatt 3901 ctccatgcaa aatgaaggaa gactgctttc gttattactc ctctgagtac cactatgtag 3961 gaggtttcta tggaggctgc aatgaagccc tgatgaagct tgagttggtc catcatgggc 4021 ccatggcagt tgcttttgaa gtatatgatg acttcctcca ctacaaaaag gggatctacc 4081 accacactgg tctaagagac cctttcaacc cctttgagct gactaatcat gctgttctgc 4141 ttgtgggcta tggcactgac tcagcctctg ggatggatta ctggattgtt aaaaacagct 4201 ggggcaccgg ctggggtgag aatggctact tccggatccg cagaggaact gatgagtgtg 4261 caattgagag catagcagtg gcagccacac cgattcctaa attgtagggt atcccttcca 4321 gtatttcata atgatctgca tcagttgtaa agggaaattg gtatattcac agactgtaga 4381 ctttcagcag caatctcaga agcttacaaa tagatttcca tgaagatatt tgtcttcaga 4441 attaaaactg cccttaattt taatatacct ttcaatcggc cactggccat ttttttctaa 4501 gtattcaatt aagtgggaat tttctggaag atggtcagct atgaagtaat agagtttgct 4561 taatcatttg taattcaaac atgctatatt ttttaaaatc aatgtgaaaa catagactta 4621 tttttaaatt gtaccaatca caagaaaata atggcaataa ttatcaaaac ttttaaaata 4681 gatgctcata tttttaaaat aaagttttaa aagta // LOCUS HSU80184 14131 bp DNA PRI 31-MAY-1997 DEFINITION Homo sapiens FLII gene, complete cds. ACCESSION U80184 NID g2138289 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14131) AUTHORS Campbell,H.D., Fountain,S., Young,I.G., Claudianos,C., Hoheisel,J.D., Chen,K.S. and Lupski,J.R. TITLE Genomic structure, evolution, and expression of human FLII, a gelsolin and leucine-rich-repeat family member: overlap with LLGL JOURNAL Genomics 42 (1), 46-54 (1997) MEDLINE 97321044 REFERENCE 2 (bases 1 to 14131) AUTHORS Campbell,H.D. TITLE Direct Submission JOURNAL Submitted (28-NOV-1996) MES and Centre for Molecular Structure and Function, RSBS, Australian National University, P.O. Box 475, Canberra, ACT 2601, Australia FEATURES Location/Qualifiers source 1..14131 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p11.2" mRNA join(208..305,1914..2024,2336..2407,3670..3750,4079..4164, 4251..4412,4751..4854,5214..5389,5475..5632,6377..6461, 6787..6934,7117..7253,7456..7668,7916..8095,9525..9607, 9732..9806,10050..10133,10212..10383,10900..11004, 11085..11276,11576..11764,11881..12020,12105..12339, 12471..12625,12711..12771,12871..12999,13083..13189, 13273..13378,13514..13579,13661..14092) /gene="FLII" gene 208..14092 /gene="FLII" CDS join(243..305,1914..2024,2336..2407,3670..3750,4079..4164, 4251..4412,4751..4854,5214..5389,5475..5632,6377..6461, 6787..6934,7117..7253,7456..7668,7916..8095,9525..9607, 9732..9806,10050..10133,10212..10383,10900..11004, 11085..11276,11576..11764,11881..12020,12105..12339, 12471..12625,12711..12771,12871..12999,13083..13189, 13273..13378,13514..13579,13661..13795) /gene="FLII" /note="see GenBank Accession Number U01184 for cDNA; similar to Drosophila melanogaster fliI in GenBank Accession Number U01182 and Caenorhabditis elegans fliI homolog in GenBank Accession Number U01183" /codon_start=1 /db_xref="PID:g2138290" /translation="MEATGVLPFVRGVDLSGNDFKGGYFPENVKAMTSLRWLKLNRTG LCYLPEELAALQKLEHLSVSHNNLTTLHGELSSLPSLRAIVARANSLKNSGVPDDIFK LDDLSVLDLSHNQLTECPRELENAKNMLVLNLSHNSIDTIPNQLFINLTDLLYLDLSE NRLESLPPQMRRLVHLQTLVLNGNPLLHAQLRQLPAMTALQTLHLRSTQRTQSNLPTS LEGLSNLADVDLSCNDLTRVPECLYTLPSLRRLNLSSNQITELSLCIDQWVHVETLNL SRNQLTSLPSAICKLSKLKKLYLNSNKLDFDGLPSGIGKLTNLEEFMAANNNLELVPE SLCRCPKLRKLVLNKNHLVTLPEAIHFLTEIEVLDVRENPNLVMPPKPADRAAEWYNI DFSLQNQLRLAGASPATVAAAAAAGSGPKDPMARKMRLRRRKDSAQDDQAKQVLKGMS DVAQEKNKKQEESADARAPSGKVRRWDQGLEKPRLDYSEFFTEDVGQLPGLTIWQIEN FVPVLVEEAFHGKFYEADCYIVLKTFLDDSGSLNWEIYYWIGGEATLDKKACSAIHAV NLRNYLGAECRTVREEMGDESEEFLQVFDNDISYIEGGTASGFYTVEDTHYVTRMYRV YGKKNIKLEPVPLKGTSLDPRFVFLLDRGLDIYVWRGAQATLSSTTKARLFAEKINKN ERKGKAEITLLVQGQELPEFWEALGGEPSEIKKHVPEDFWPPQPKLYKVGLGLGYLEL PQINYKLSVEHKQRPKVELMPRMRLLQSLLDTRCVYILDCWSDVFIWLGRKSPRLVRA AALKLGQELCGMLHRPRHATVSRSLEGTEAQVFKAKFKNWDDVLTVDYTRNAEAVLQS PGLSGKVKRDAEKKDQMKADLTALFLPRQPPMSLAEAEQLMEEWNEDLDGMEGFVLEG KKFARLPEEEFGHFYTQDCYVFLCRYWVPVEYEEEEKKEDKEEKAEGKEGEEATAEAE EKQPEEDFQCIVYFWQGREASNMGWLTFTFSLQKKFESLFPGKLEVVRMTQQQENPKF LSHFKRKFIIHRGKRKAVQGAQQPSLYQIRTNGSALCTRCIQINTDSSLLNSEFCFIL KVPFESEDNQGIVYAWVGRASDPDEAKLAEDILNTMFDTSYSKQVINEGEEPENFFWV GIGAQKPYDDDAEYMKHTRLFRCSNEKGYFAVTEKCSDFCQDDLADDDIMLLDNGQEV YMWVGTQTSQVEIKLSLKACQVYIQHMRSKEHERPRRLRLVRKGNEQHAFTRCFHAWS AFCKALA" polyA_signal 14068..14073 /gene="FLII" polyA_site 14092 /gene="FLII" /evidence=experimental BASE COUNT 2879 a 4090 c 4328 g 2834 t ORIGIN 1 gatcgtggaa gcctccaact ccagcacagg gggggcccgc actccttggt gcctccgcag 61 tcttcgaaca gacggggaaa ctgaggccct aagaggccta aagcctacac ttccccgcgg 121 aatgcggcgg gcgcgggcgg gttaaagggg cggggccggc gctggcccag cccgcggctc 181 cccccagcgc cctcgccccg gcgctcccta gcccggcgcg gcccggcagc gagagcggcg 241 ccatggaggc caccggggtg ctgccgttcg tgcgtggcgt ggacctcagc ggcaacgact 301 tcaaggtgag ccgggccggg cgcgtgccgg gcctccctgc aggccttctc ttccgcccgg 361 cccgcttggc ctccgcttcc tgtcccgctc ggctccaggc cctgcgtgtc ccagggactg 421 cggccggggc ggggtggggc tctcccctgg ccaggaatgg acctccgagg cctccgccca 481 gtgcctgtac ggggagaccg aggctgggca ggccggggac ggacgcttgg gctcggagtc 541 ccgccggggg cccggccggg agcccccacc ccgcgtccgt gttctcggtc taggctggat 601 ggtggcctgg ggatcgggcg agcctccccg cccctctgag gtcacacgcc caagccaggc 661 cctgactcag ggatggctgc ccggctgccc aggggcgggt gtggagccga ctcatcactg 721 ccctctcccc gccaccgcca cccgccatac tgtgtgatcc tgggcctctc agatcacttc 781 ccctctcaga gcctaggctt tctcagccgg tgaaggggct taaaagtctc ctccttggac 841 gcaaggagta tagtgagatt tgttggtgtg gagagtgtgt caggccggtg tgccaggtgc 901 tgggtggcag ttgtctgagc agctgctgga ctgagagaaa agggaccgga gcccagccct 961 ccgcagccag tgggtggggc ggcggggtgg ttccaggact caagaatgag acaggcattg 1021 cctgaccaaa ggggaagcag agaaagagag ggagggagag ctggctggag ggtcagcctc 1081 accttcaggg tgggtgactg gcagggtcgg cctgggttgt tggtctgggg ctggggccga 1141 ggaggcaggt ttttcctgcg tttacctgcc tgtgtacaca cctttgaccc caagtacaag 1201 tggctacacc acttttccct gaccccatcc ccaccaccct aacctggggt ccccgttcct 1261 ctttgctgaa ctgggagagt ccgagtcacc agctgggaat gtggcgctag ggggaaagga 1321 atatcagccg caggcactga aggcacctac tgccccagac ttggtgccac aagggctgga 1381 ggtgccgagg tgggcgttca cacaccaggg gagtgcagtg atgctggcct gccctgcagg 1441 gtggggagcc acggtctccg caagcaaggg tttgaccctg gccgtgagtt gccagtgtgc 1501 tgatggcagg atgagggcac aggtggcgcc tttgcaaagg cttggaaggg ggatccccca 1561 ccccaggcag ccagtcagag tgtggaatgc taggtgaagg aatttgagcc gagctaacca 1621 cagagccgga tgctgggctt tcacctgaaa caagccagtt ttgtcctggc cctcctgggg 1681 cccagaatcc agtaggagag acaaatctta ggccaacact tttgtcacgg gggggcttaa 1741 aattcccatt ctatgggtga agaaattgag gcacaatgat gaaatgactt gtctgtgttc 1801 tcacagatgg tgagagttgg ttctgtggct ccccaaaggc tcccatgggc tggaggagcc 1861 atagaaggtg gtgagcaggg gctctgtgct tcaccttgac atccttcccc cagggcggct 1921 acttccctga gaatgtcaag gccatgacca gcctgcggtg gctgaagctg aaccgcactg 1981 gcctctgcta cctgcccgag gagctggccg ccctgcagaa gctggtaagg gggtttgggc 2041 gggcagggga gggtccctgt ggatgagcca ggtgggcaga gcctgtggag aaggcagaca 2101 acttctccag agagaggaac accgacagct atgagcagag ctggaagaaa gggagtgggc 2161 accctgtcca gggaggtgtg caagaagagg ccgggtgagt gagggaggga agctgcaggg 2221 gtagttgggc cagtgtggct gaggccttcc tggatggtgc agagttgtgg aggcaggcag 2281 ggactctgga caccccaccc accctgctga gcctgtgctc cccattccgg cccaggaaca 2341 cttgtctgtg agccacaaca acctgaccac gcttcatggg gagctgtcca gcctgccatc 2401 gctgcgcgtg agtgctggcc gggaggccac cgagcttggg gttggggcca aggtccggtc 2461 agggacgtga agcctgggct agacaccaag ctgggccagc atttctgccc acctccgtgt 2521 caggctcaga ggctcaggtg tcacagaggg gccctctcat tagaggttac agcagtgccc 2581 tgtgtaaagt aacccatgtg gatcagagaa ttccatcagg aaaaaggtaa agaaatccaa 2641 aatcttacca gccgttatgg acaataacta cagtaataat agccaagact ggctgagcaa 2701 tgttcccact attctgttag gattttttca tatgctctag ccctccacta atgttgcaga 2761 agagaaactt ggccaccaag gcagattact tgccccaggt cacagagcca ggcagtggta 2821 gagctggtgt tgagtggggt ttggccggga gcaccgtgta ccttctaccc aagacttgaa 2881 cgtcccagaa catcaggccc ttggccagac cctgtaggcc tcaacgaggg ccatcccatc 2941 tgaagtcaga gccctggaga tgtttagggc acagcttctg acttggactc tctgagttca 3001 aagctgggct ttgccacttc ctggtgatgt gactttgtac aaatcgcttc taccttctgg 3061 gcctcctctg caagtgcagt ggttgggagg ggcagcagtg accccgtgaa gcagcagcag 3121 ggtgccgcac aggcattcag actggagaag tggcggggca gctccggtga ctcccgttgt 3181 cacttggagt cacgacacca ttctctggcc ctgttccctc atgctcaagc tttctgcatg 3241 tctgtaaggt atgaggaggt gtcagtctta agttagggag agaaagcaga cttcctctcc 3301 atccagattc tgatttctct gtggtccact tctctccctc tgttctgaaa accctgactc 3361 agaccttacc cctatctcac cttctggaat taagaagaag agaaatggtt tgaacagaaa 3421 aggttccctg agggccctgt gctggggctc cggggcacat ttgagctggg ctttgaggca 3481 tgaataggag ctcactggga ggctaggaag agcgctccag gagagggagc tatttgatca 3541 aaggccagag gtgagagagt cctcaggcct ggcctctggt gagctgcctg aggaggtgtc 3601 cctctctaga ctctgttcct tcctgagagt ggaagccagg aattatagac tcttatggtt 3661 tattcccagg ccatcgtggc ccgagccaac agtctgaaga attccggagt ccccgatgac 3721 atcttcaagc tagatgatct ctcagtcctg gtcagtggct acccactcac ctggcccctg 3781 tgctagccgg accattcaat ccccactcac tataaaagtc cctggggcca ttgttatccc 3841 attttgcaga tgggaaaact gagtcttgga gtggataata ccttgcctga ggtcactttt 3901 ggagtcagat ccctaacaca ggcactcttc ctcccaagtc cctggctgtc tccctttggc 3961 cctgaagccc cctcatctac ccccacaccc aactcagtgc cctgcctgat ccccaagctt 4021 ggcagacgcc tggtggggag gagactcccc tgacctgggt cttgcccctt ctacccagga 4081 cttgagccac aaccagctga cagagtgccc gcgggagctg gagaacgcca agaacatgct 4141 ggtgctgaac ctcagccaca acaggtgcca ggccagtggg gggcaggggg caggtgggcc 4201 ctgtggggtg gctggtgctg actcaggtgg ctcctgctcc ccacccgtag catcgacacc 4261 atccccaacc agctcttcat caacctcact gacctactat acctggacct cagcgagaac 4321 cgcctggaga gcctgccccc gcagatgcgc cgcctggtgc acctgcagac gctcgtgctc 4381 aatggaaacc ccctgctgca tgcacagctc cggtgggcgc cccctcggac cgcactcctg 4441 cagaagccac cctgaggagt ctggccgggc cccttcactt cccagggccc taacccttct 4501 cccttaggcc cagccatccc tccgtgcagg cagcacccct cagtgtacaa cctcctaagt 4561 atcatgtcca cctctcagtc ctgtcttccc gcttacacac cccgcaacca cccttgggct 4621 tgtcccttca accagatgtg acctcacacc taccctctgg gccctgtgct tttggtcctg 4681 ccctgccaag ccccaccctt aacactccct aggtgggccc tggccctgac cactcacgtc 4741 accctggcag gcagctccca gcgatgacgg ccctgcagac cctgcacctg cggagcaccc 4801 agcgcaccca gagcaacctg cccaccagcc tggagggtct gagcaacctc gcaggtcagg 4861 cagccccagg agcccctctg agctcgggcc ccctctgggc cctcttgctt ccttgggtgg 4921 aaggggttga accctgctct tcggaccaat acccagaggg aaaaggctgg agttatgtta 4981 cagtgacttt tgcgattatg atgatgcctt aacactgact ttatgcctag aaaactcccc 5041 cgcaaccccc atgtgaggga gtttgtccaa atggagaaac ttgggcgcac agcagttgtc 5101 acttgcctaa gccacacagc tcagaggtgg cagagctggg gttcacacta gggctggctg 5161 ccctggcacc tggtgtatgg gccccagctg atgcacccca ccctgcaccc cagacgtgga 5221 tctgtcctgc aatgacctga cacgggtgcc cgagtgtctg tacaccctcc ccagcctgcg 5281 ccgcctcaac ctcagcagca accagatcac ggagctgtcc ctgtgcatag accagtgggt 5341 gcacgtggaa actctgaacc tgtcccgaaa tcagctcacc tcactgcccg tgcgtctgag 5401 gccctcagta gggggatggg aggaaggccg tgagcctggc cctgacctcc cacccctgcc 5461 cgttccccct ccagtcagcc atttgcaagc tgagcaagct gaagaagctg tacctgaatt 5521 ccaacaagct ggactttgac gggctgccct caggcattgg caagctcacc aacctggaag 5581 agttcatggc tgccaacaac aacctggagc tggtccctga aagtctctgc aggtgctggg 5641 cagggctggg gctagcagag gcacttgcag cacagagaac cccgggatcg tagtcagagc 5701 ccaccttgtg gtctgtgcaa gtcagacaaa atgaagatgg gttaattcac cttttctttt 5761 cttttctttt cttttttgtt tttgagacag agtctcgctc tgtcgcctgg gctggaatgt 5821 agtggtgcaa tctcggctga ctgcagcctc tgtctcccag gttcaagcga atctcctgcc 5881 tcagcctcct gagtagctgg gattataggc gtgcgctacc acgcccggct ctttttgtat 5941 ttttagtaga gatggggtct caccatgttg gccaggctgg tcttgaatgc ctgtcctcaa 6001 atgatcagcc tcccaaagtg ctgggattac aggtgtgagc caccacgccc ggcccagtta 6061 actcaccttt gtggatgaag accacagtcc catgccagcc attcccaact tgagcccctg 6121 agagcaccct tcggggggca ctgttaggga caaggccaca tgacgtggga acgcagtgtt 6181 gcttctctcc ccttctggag atcacagtgc ccgtaacatg gtctcttggc tctgagaagt 6241 ccttcaggca gagaatggag acccccgcca gttcctctga ccctacctgc ctcccctccc 6301 caggactagg ggttttcttg cccacttccc cttgtctgtc acctgcctga ggccctggct 6361 ggccctgtct tcacaggtgc ccaaagctga ggaaacttgt cctgaacaag aaccacctgg 6421 tgaccctccc agaagccatc catttcctga cggagatcga ggtcaggcat gctgggtttg 6481 agagggctca agagagacaa gcaaggggtg tgccctggag caagggggac ccagcagagg 6541 cctcctgagg gaggtggcca gtgccagggt gcccgtgggg agaaggtggg ccccacctcc 6601 ctgcatccag agtacattcc acagtctcat ttaagccccc agccagcccc agaggtgggt 6661 gggaggagaa cccagttccc tgacctctca gccctggcgg ctgcagacaa gagcaagaaa 6721 ctggattggt gtgtctggga ggtaggaccc aggctcagtg ccggctctgc tcaccctatg 6781 gggcaggtcc tggatgtgcg ggagaacccc aacctggtca tgccgcccaa gcccgcagac 6841 cgtgccgctg agtggtacaa catcgacttc tcgctgcaga accagctgcg gctagcgggt 6901 gcctctcctg ctaccgtggc tgcagctgca gctggtgagt ggggaaaggt ctggcctggg 6961 aacgggggct cctctccagg cctcaaatga cacatctgta aattgggtgg catacagctt 7021 tctggaccct ggtgggtgga gttggcccct ggttggtcgc atgcccaagg gggcagcagg 7081 cccagtgagg gctcagctgt ttgcccctcc cctaagcagg gagtgggccc aaggacccta 7141 tggctcgcaa gatgcgactg cggaggcgca aggattcagc ccaggatgac caggccaagc 7201 aggtgctgaa gggcatgtca gatgttgccc aggagaagaa caaaaagcag gaggtgagcc 7261 aggccagggt tgggacaagg cagggagcct tagcttctgc tgaaggccga ccctggcctt 7321 gaccctggcc aagagggacg gggggtgctg ggcagaaaga acagctgtgc aaaggccctg 7381 aggtgtgggt agcaagtaaa gactggcctg agtggcttca cagagtcaag ccgtgctctg 7441 tgactctgcc cccaggagag cgcagatgcc cgggccccca gcgggaaggt gcggcgttgg 7501 gaccagggcc tggagaagcc ccgccttgac tactccgagt tcttcacgga ggacgtgggc 7561 cagctgcccg gactgaccat ctggcagata gagaacttcg tgcctgtgct ggtggaggaa 7621 gccttccacg gcaagttcta cgaggctgac tgctacattg tgctcaaggt gagggctggg 7681 catgctgtta ggggccatgt ggcccagtac cttcctctaa gatggctcag tccccaactc 7741 cagagttttg ttcagggagg ccaggtggga ggtgggcagg ctgcaggagg tggtccaggc 7801 acaggggcag tacaaagccc ctggggctgg gaagcctgtg ctgttcagag tggcgtctcc 7861 ggggtggccg gaagaccagg atgaaagctg atgtggccgg tgccttggct tccagacctt 7921 tctggatgac agcggctccc tcaactggga gatctactac tggattggcg gggaggccac 7981 actcgacaag aaagcttgct ctgccatcca cgctgtcaac ttgcgcaact acctgggtgc 8041 tgagtgccgc actgtccggg aggagatggg cgatgagagc gaggagttcc tgcaggtgcc 8101 agcccgttaa aattgggggt gggcagagtg gggttcccag ggaactctgg tgcaggggca 8161 gtgggcccaa ggcaggtagg cagaacaggt gggcaggcag ggagctgggg ttaaaggggg 8221 tggatcaacg atactgtgcg ggacctaagg cagaccgcag ggtgaggtct aaggtggcca 8281 aagcctgggt cagcaaagga gagttctgga acgcggggca gaattcagcc gggcaaggat 8341 aagggtacag gctgggcatg ggagctagac cagtgggctg acacagggga gctgagagta 8401 gggtggaagg ggtgttgcca agagcagtga gaaggcaggg ccaaaacctg ttattaggag 8461 gcaaggacac acctggatgc caaggaggtg gggccaaggg cgggaccaga gtccagtctt 8521 gaataaatta ctgagcagtg ttagccgcca gctggttaca cctagacata tgttcgcatg 8581 tacacacacg tggagtttat gcacacaccc cacccgcccc gagttttttt tacccacatt 8641 cacccaggtt tccactcttc catgaaccca cttgcttgtg taccctgata tgcgtgcatc 8701 taaacatgct gctggcatct ttacaaacgt gggtgaacgt gtccatattt ttcatacatc 8761 ccaacacccc gacgtgtgta tgtttataga cacagatgtg tatgttttat tttagacaga 8821 ctctcgcttt gttgcccagg ctggagtgca gtggcacaat cttggctcac tgcaacctcc 8881 gcccccaggg ttcaagcgat tctcctgcct cagcctccca agtagctggg attacaggca 8941 tgtgctacca cgcctagcta atttttgtat ttttttagta gtgatggggt tttgccatgt 9001 tggccaggct ggtctcgaac tcctgacctt aggtgatcca cccgccttgg cctcccaaag 9061 tactaagatt acaggcatga gccaccgtgc ccagccagat ctgttttttt tgttgttgtt 9121 tgtttgtttg tttgtttgtt ttaagacaga gttttgctgt tgttgcccag gctggagtac 9181 aatggcacaa tctcagctca ctgcaacctc cgcctcccag gttcaaacaa ttctcctgcc 9241 tcagcctccc gagtagctgg gattacaggc atgcaccacc acgcctggct aatttatgta 9301 tttttagtag agacagggtt tctccatgtt ggtcaggctg gtctccaact cctgacctca 9361 ggtgatccac ccacctcaga ctcctgaagt gctgggatta caggcatcat agatttgtag 9421 cactcaattc acctgtatgc aactctgtac ccccacctga ggggcagtgg ttggtggggg 9481 agggggcagc tcagtgatga agaaccacac cctcacacac acaggtgttt gacaacgaca 9541 tctcctacat tgagggtgga acagccagtg gcttctacac tgtggaagac acacactatg 9601 tcaccaggtg agggtgtgtg ggcagtgggg ggtatggacc tggagtctgc tggcaagggg 9661 tggaggggag ggtgttgggg ctaagttagg cccttggtgc ctacactgag ccactcttgc 9721 cctgcgccca ggatgtatcg tgtgtatggg aaaaagaaca tcaagttgga gcctgtgccc 9781 ctcaagggga cctctctgga cccaaggtga gccccaacaa tgtgaggtgg gggtgcttca 9841 tcctcagcct cctccccacc agtgggggca ggtgtgggcc tcttagaaga ggaggccctc 9901 tagcaattcc tgcccacgag cttggcgggg ggtccctggg gcttacccag aaaccccata 9961 gtgagggatt ttttttggtg gggtcaggag catggggtga ccattgcccc atgcaccatc 10021 acagctcctt cctgcccctt cttgtccagg tttgttttcc tgctggaccg agggctagac 10081 atctacgtat ggcggggggc ccaggccaca ctgagcagca ccaccaaggc caggtacaag 10141 gacaccgtgg ggtcttttgt gtgaaggaaa ggggtgtgag cctcgctgat gactttgcat 10201 cctgctctca ggctctttgc agagaaaatt aacaagaatg agcggaaagg gaaggctgag 10261 atcacactgc tggtgcaggg ccaggagctc ccagagttct gggaggcact gggtggggag 10321 ccctctgaga tcaagaagca cgtgcctgaa gacttctggc cgccgcagcc caagctgtac 10381 aaggtgagcc ctgctgctga ggcccacctt ggcctcccaa gttcacccac cgactggggg 10441 aatggactac acaccgatgt agttggcttt gggctccatc tcgactttgg tggggagagg 10501 gtgccacctt ctggacaagg aacagtatga gcaaaggtct ggaggctgct tttgagaagt 10561 agcctcacgg agtctttcca ggatggtggg gccgtagggg agacagaagc acctagtgcc 10621 agggctggga accttggctc aattatctag attcttgtga acaggaaagg gacacgttaa 10681 ttagaaatgg catgaagggc ctggagaggg caggcaggga aaactccatt taaaagccct 10741 tctgaaagga tatccctaaa aagaccttga aggggagggg gaaacagaag cagtcagtga 10801 gccacgctgc agggcagggc tggttccaga gcagagggct gtgtgggtga ggttcctgtc 10861 tccagatccc ttccctgcca tgctcatcct tcctgccagg tgggcctggg cttgggctac 10921 ctggagctgc cacagatcaa ctacaagctc tccgtggaac ataagcagcg tcccaaggtg 10981 gagctgatgc caagaatgcg gctggtaaga gatgcaggag gagggcgtgg ggccagatcc 11041 ctccccgttg gctgggcttt gacctccaga ctcccggtcc gcagctgcag agtctgctgg 11101 acacgcgctg cgtgtacatt ctggactgtt ggtccgacgt gttcatctgg ctcggccgca 11161 agtccccgcg cctggtgcgc gctgccgccc tcaagctggg tcaggagctg tgcgggatgc 11221 tgcaccggcc acgccatgcc acggtcagcc gcagcctcga gggcaccgag gcgcaggtgc 11281 gcttgcagct tcggaggccc ctccccagga tccttcgcac cctgacatgc cctgacccct 11341 gtggcttcac tacctcccta cagagctggg cccaaaccca cctcctcttt gctgactctg 11401 cccaggcccc gccccctctg caagctgccc ctcctattcc tacctccgtg tccgaggccc 11461 cgcctcccaa gccccgccct agatctccct tgggtcccgc ctcaagcctc gggctacttc 11521 ctccaactct aatcatgccg cacccctcat ggttagttga tggcctccct ggcaggtgtt 11581 caaggccaag ttcaagaatt gggacgatgt gttgacggtg gactacacac gcaatgcgga 11641 ggccgtgctg cagagcccgg gtctctccgg gaaggtgaaa cgcgacgccg agaagaaaga 11701 ccagatgaag gctgacctca ctgcgctttt cctgccgcgg cagccgccca tgtcgctggc 11761 cgaggtgggg gcagggccgg gaccggggcg cggggggggg ggtggggggc aggccgaggg 11821 cggggcctat gcgcagagcg cgctcagacc tgccgcgggg cgggtccctc tcacctgcag 11881 gcggagcagc tgatggagga gtggaacgaa gacctagacg gcatggaggg tttcgtgctg 11941 gagggcaaga agtttgcgcg gctgccggaa gaggagtttg gccacttcta cacgcaggac 12001 tgctacgtct tcctctgcag gtaccacctc agcactccct tccccaagtc cctcccctgg 12061 tgctgggggc tcgagcccct gctcacaacc ttccaacccg gcaggtactg ggtgcctgtg 12121 gagtacgagg aggaggaaaa gaaggaagac aaggaggaga aggccgaggg caaagaaggc 12181 gaggaagcaa ccgctgaggc agaggagaag cagccagagg aggacttcca gtgcatcgtg 12241 tacttctggc agggccgtga agcctccaat atgggctggc tcaccttcac cttcagcctg 12301 caaaagaagt tcgagagcct cttccctggg aagctggagg tgtgcctggc agccccgcga 12361 gcccgaggcc aggcggaggg gtgtggtggg gctcagaccc agcgaaggtt aggctcccga 12421 ggggcaggtg ggggtccagg gagctgcccc taacacttcc acatccccag gtggtacgca 12481 tgacgcagca gcaggagaac cccaagttcc tgtcccattt caagaggaag ttcatcatcc 12541 accggggcaa gaggaaggcg gtccagggcg cccaacagcc cagcctctac cagatccgca 12601 ccaacggcag cgccctctgc acccggtgcc tggctggggg aggacgcagt ggcgagcaag 12661 cgtcagggag ccaggcctga atccttgctg atgccccatc ccttccacag gtgcatccag 12721 atcaacaccg actccagcct cctcaactcc gagttctgct tcatcctcaa ggtggggttg 12781 tgggtgtgag ccaggactcc gtggaccaag gggtggggtg tttgggggct gagtcagctg 12841 cgaacacccc cacccctgca cactccccag gttccctttg agagtgagga caaccagggc 12901 atcgtgtatg cctgggtggg ccgggcatca gaccctgacg aagccaagtt ggcagaagac 12961 atcctgaaca ccatgtttga cacctcctac agcaagcagg tcaggaggcg ggcgggcagg 13021 cgggcacaca cagaggagcc agggcaggca ggggccctgg gctggactgg cctttcccgc 13081 aggttatcaa cgaaggtgag gagcctgaga acttcttctg ggtgggcatt ggggcacaga 13141 agccctatga tgacgatgcc gagtacatga aacacacacg tctcttccgg tgaggccagg 13201 gccctgaccc tcccctcagt cctggcactg ctgcttatga cctctggcct caccttgacc 13261 cttgattccc aggtgctcca acgagaaggg ctactttgca gtgactgaga aatgctccga 13321 cttttgccaa gatgacctgg cagatgatga catcatgttg ctagacaatg gccaagaggt 13381 gtgatgttgc cactagccct gcccatccaa ggaggcgccc atagacttac ttacaaatgg 13441 gaattctcgg gccaggagca aggggctgtt cccaggcacc catgcaaccg tatctcccct 13501 gctgccccca caggtctaca tgtgggtggg gacccagact agccaggtgg agatcaagct 13561 gagcctgaag gcctgccagg taatctgggt ggagggaggg ccgaggtggg cctgtgggca 13621 gggcagggtc atcacccaag gctcgctgtc cccttgccag gtatatatcc agcacatgcg 13681 gtccaaggaa catgagcggc cgcgccggct gcgcctggtc cgcaagggca atgagcagca 13741 cgcctttacc cgctgcttcc acgcctggag cgccttctgc aaggccctgg cctaagacag 13801 gctggcacag ccccaggctt ggtgaggaag aggaaggggc ctcatccact gtctgctagc 13861 aaagaatgta ctcaggtgac accacctgct ccagccacgt ccagtgccac agtccccagt 13921 agcctcaagc agcaccaatg gggatgaccc tgacaggtgc cctcaggggt ctgggaaatc 13981 caactctctc cacagtgtga gtgcacgtgt gaagccccct cactcttccg ctagggataa 14041 agcagatgtg gatgcccttt aagagatatt aaatgctttt attttcaata ttaaaaatca 14101 gtatttttaa tattaaaatg gcgctaattt t // LOCUS HSU82083 21716 bp DNA PRI 30-OCT-1997 DEFINITION Human metabotropic glutamate receptor 6 (mGluR6) gene, complete cds. ACCESSION U82083 NID g2231437 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 21716) AUTHORS Hashimoto,T., Inazawa,J., Okamoto,N., Tagawa,Y., Bessho,Y., Honda,Y. and Nakanishi,S. TITLE The whole nucleotide sequence and chromosomal localization of the gene for human metabotropic glutamate receptor subtype 6 JOURNAL Eur. J. Neurosci. 9 (6), 1226-1235 (1997) MEDLINE 97358610 REFERENCE 2 (bases 1 to 21716) AUTHORS Hashimoto,T. TITLE Direct Submission JOURNAL Submitted (12-DEC-1996) Biological Sciences, Faculty of Medicine, Kyoto University, Yoshida, Sakyo-ku, Kyoto 606, Japan FEATURES Location/Qualifiers source 1..21716 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q35" gene 2587..19328 /gene="mGluR6" CDS join(2766..3269,5635..5851,6158..6293,6965..7119, 8303..8443,8564..8764,10716..10861,10945..11568, 14465..14776,15822..16019) /gene="mGluR6" /codon_start=1 /product="metabotropic glutamate receptor 6 protein" /db_xref="PID:g2231438" /translation="MARPRRAREPLLVALLPLAWLAQAGLARAAGSVRLAGGLTLGGL FPVHARGAAGRACGPLKKEQGVHRLEAMLYALDRVNADPELLPGVRLGARLLDTCSRD TYALEQALSFVQALIRGRGDGDEVGVRCPGGVPPLRPAPPERVVAVVGASASSVSIMV ANVLRLFAIPQISYASTAPELSDSTRYDFFSRVVPPDSYQAQAMVDIVRALGWNYVST LASEGNYGESGVEAFVQISREAGGVCIAQSIKIPREPKPGEFSKVIRRLMETPNARGI IIFANEDDIRRVLEAARQANLTGHFLWVGSDSWGAKTSPILSLEDVAVGAITILPKRA SIDGFDQYFMTRSLENNRRNIWFAEFWEENFNCKLTSSGTQSDDSTRKCTGEERIGRD STYEQEGKVQFVIDAVYAIAHALHSMHQALCPGHTGLCPAMEPTDGRMLLQYIRAVRF NGSAGTPVMFNENGDAPGRYDIFQYQATNGSASSGGYQAVGQWAETLRLDVEALQWSG DPHEVPSSLCSLPCGPGERKKMVKGVPCCWHCEACDGYRFQVDEFTCEACPGDMRPTP NHTGCRPTPVVRLSWSSPWAAPPLLLAVLGIVATTTVVATFVRYNNTPIVRASGRELS YVLLTGIFLIYAITFLMVAEPGAAVCAARRLFLGLGTTLSYSALLTKTNRIYRIFEQG KRSVTPPPFISPTSQLVITFSLTSLQVVGMIAWLGARPPHSVIDYEEQRTVDPEQARG VLKCDMSDLSLIGCLGYSLLLMVTCTVYAIKARGVPETFNEAKPIGFTMYTTCIIWLA FVPIFFGTAQSAEKIYIQTTTLTVSLSLSASVSLGMLYVPKTYVILFHPEQNVQKRKR SLKATSTVAAPPKGEDAEAHK" BASE COUNT 4472 a 6210 c 5737 g 5297 t ORIGIN 1 ggatccattt tggttcatct ttcttgagtg gaatattggt tttatataaa gaaagttaaa 61 tgatttgtta atctgctgat tggttgcgta tgaaatcaca ttgtctgtta ttaaagcttt 121 tcaccataaa aaaaaaaaaa aaaaatctct gttctttgac tgtctgtatg gaactctgcc 181 atagcagccc agctgaggaa aacagtccca gggtcaggga acagcaggac ttcttcttgg 241 ggcaaccttt taaatgacgt cctgtgcatt ttgtacaaat tctgtgctac ctcatcgagc 301 ctgagcagct gtatcaggat gctgtaagag ggccctatgg cctgtgcgag gtgcactttg 361 gccttctgcc cctgcacggg gctgaataaa ttatgctgaa tgttccctcc ttcctcaatc 421 cttgtttctg ggcccctctg gtgcctacgc tcctgcttct cctctaactc ctcggccacc 481 ccctcccagt ctccttactt ctgctcagac ctgcatgttg gggttcctgg ggctccagcc 541 tcactgtctg tccacactct cagccccagg gaactcctgc gtcccacggc tttaaacgcc 601 tccgtggcga tcccacctca gctttctctc ccgccctggc ctcttccctc agttccaggc 661 tcatgaacca cgaagcctcc actttcctgc ccaaacccga actcttggct tctgccctcc 721 ttgacctgct cagctgtgtc tctggtcctc atattctcag ggaatgcccc agcagctatg 781 ccatcctcca cccctccaag agccaaatct ccatcaggtc taaaatcctg ccacccccct 841 ccccttctcc tcagccccct gctcacaggt gtgcatgcct ccctcctaca cggcgaggac 901 cccctgccag tcccccagct gctgcccccg cttcctgatg gtctgtcctc cagacagggg 961 cagtggctgc cttggtcatg ccaagcagga ggctgctgtg tgctgggagc tgtcaggctc 1021 gtcctgaaca gggaagggcc catccacctc ccaaacccag tttatgcagt ccttcgcaat 1081 gtcaggctca gggcctggca ccagccaagc tccccaccct tcccactgtt aaaatggata 1141 ggagcagggc taggcccagc ctgttgactc tgggcttcca ccaggagaag tggttctggc 1201 agtagaaact atcggggcct gggagaggcg ggggaagaga gaaaggtggc atgtttcttg 1261 cttgctccct ctaccagcct tgtccaaatc cccgcagcca ccctaatcca gcctgtctaa 1321 tggagcccaa gccggctcag gccctcggac gaggagcctg ctaatccctg tggctaggag 1381 ctcaccacct gtctccagga cgccctttgc tctcttggca tcagagagcc aaatcctggg 1441 cctcggatgg ggggatgata aaagcatctt ttggccaagc cccctcacct tggcctccac 1501 gatgagatgg ggagttaggt gcagagagcg ttggcacagt gagcaccgca gctcgagtgg 1561 ctgcctcaga cccagagccc gaggagactt tatacggagc cagaacgacc ccgcggggtt 1621 ccatcctccc aagcaatagg cgggagtggg agctgcgagg aaagccggcc cctcccctcc 1681 ctccatccaa ggcagtgtgg gctgtttgtt tcatgccatt ctgggtgtga atcctgatgc 1741 ccacacatgc cagctgcatg cacttgggca actcaactca ctcctcgagg gctgtttctc 1801 gactgcaggg tgttgtaagt tcgctaatac taaaggcttc tccctcctgg ccccttcctg 1861 cccctcgctc ttcctcctct tgctcaggcc ctcccagctc aggcagcccc tgccccctgc 1921 agggttctgc aaggagaaag ctggggaata ccttaggcaa ctgcagtcag gagcactggt 1981 ggccaggaca gagacagaga gacagaaaag gggtcaggga cagagagaga taaccgcagg 2041 gagagacagg aagggacaga gacagaaaag atttccaaga agaggacaga ggcagaaagc 2101 cagggacaga gactgagaaa cagagaccta gaggcagaag aagactgaga tagagatgga 2161 cagagattgt gtcagacaca gccccagaga cagccagaca gtctgagtca gacgcaaacc 2221 aaagacaaga aaacaggaaa acagacccag agattaggag agggagggga aggagatgcg 2281 gggagagcca gcaccgccac cccccacact caggaggggt ctccaccctc ggagcggtct 2341 ctcatccctc cctagaatcc ttaaatcctc tctcgctcag ggcctcggcc gcatctgtca 2401 cagacttgtc ctgaaccgac agcggctggc gcaggtgact ggcttggggc gggagcctgg 2461 gtgtgcgctg gggatggacc ccgaggaaga ggggccaagc tgtcgggaag cggcagggct 2521 ggaggggtgg aggcagtggt cgggcgggac cccgggcgac agggttcggc gcttgtaaga 2581 gcgagacgga ggcccgggca ggccggctga gctaactccc cagagccaaa gtggaaggcg 2641 cgccccgagc gccttctccc caggaccccg gtgtccctcc ccgcgccccg agcccgcgct 2701 ctccttcccc cgccctcaga gcgctccccg cccctctgtc tccccgcagc ccgctagacg 2761 agccgatggc gcggccccgg agagcccggg agccgctgct cgtggcgctg ctgccgctgg 2821 cgtggctggc gcaggcgggc ctggcgcgcg cggcgggctc tgtgcgcctg gcgggcggcc 2881 tgacgctggg cggcctgttc ccggtgcacg cgcggggcgc ggcgggccgg gcgtgcgggc 2941 cgctgaagaa ggagcagggc gtgcaccggc tggaggccat gctgtacgcg ctggaccgcg 3001 tcaacgccga ccccgagctg ctgcccggcg tgcgcctggg cgcgcggctg ctggacacct 3061 gctcgcggga cacctacgcg ctggagcagg cgctgagctt cgtgcaggcg ctgatccgcg 3121 gccgcggcga cggcgacgag gtgggcgtgc gctgcccggg aggcgtccct ccgctgcgcc 3181 ccgcgccccc cgagcgcgtc gtggccgtcg tgggcgcctc ggccagctcc gtctccatca 3241 tggtcgccaa cgtgctgcgc ctgtttgcgg tgagggccgc ggggccgggc tcggtgtcca 3301 gcgtccctct cccgtcctcg ccctggggtg gcccaagtct ccttccttta cttcttctgg 3361 aagagtctcc gttttaatca ctcgaattca cacagcgtca aagaagccct gtcagttgag 3421 cccgacgcag gcctgaatcg gatccaccca aacactccag ccgcccatgt ctaccccgcc 3481 ccgcgaggcg ccccttcccg ggcgggtgcc acctacgcct ctggccggtt ggctggggag 3541 ggcggcctga ctacagcccc catttgccag cctgctcggg tgcccttgtt ctgggcacac 3601 ccctgcaacc atacaaggtg ctcctgcccc ctgggcctgc ggagcgctcg agtgatattg 3661 caactttgag ttttgtttgg aacccgggcc cttcttccag agcagcaact ggggcgtgtc 3721 cctgggctgc tatgggggat ggataggcct gaaaggtctg gtctttctct tttttgatca 3781 tttaaattta ttttcaaaat gccttgaact ctccctcaag gggcatctga gagcagggcg 3841 gggcgttccc ttctgagctc tggtctcctg caccctgccc tgcgcaggtg cccgggctgg 3901 ccctgctctg tcctcaaatg tagggaactg tcccgtgagg ggcagcgtgt tccttgcagt 3961 cctaggggcc agcagaatcg ggattaggag ggctttggca tacaagtttg ggggttttcg 4021 atttggtcat tcactggggg ttgtgggaca gtttttctgg ggttctgtgg cgggaggggc 4081 ctgtgtgcat ccggctctct gtgaaggagg aaggcctggc ttccatctac ctaggagagg 4141 aacttctgtt tcctgagagc ttctccgact cccgggcgca gcgtctgtgt ttgttccagg 4201 cgtgcggggc cgctccctat cgtctgcgca catgaaagcc cccatccctg tttcctcgga 4261 ttggcagtgt tgatgtgaca tctccacgct ggcaccagac aggctgtgcg gccgcccagg 4321 ggtttttcac ccacctgtat gtgtccccgg ccctgcctag agctctagtt tctgtctctg 4381 tcctctaggt cgaatgcaga gattccagac cccaaggctt caagggccca gttggggact 4441 caggtgggca ggactctgtg agcccagggc ccactgctgc cagctcagcc cagattgagg 4501 ttgtgctgcc cagctagaaa tgcaaacatt aataaaaact gtgcaaaaca cagccacagg 4561 caagatgctg caattcctgg tctaaaagaa ctacattcat ttcccagcca catgaccgtg 4621 ctggcaggca ttttaaatta acttccctca ggaggtccag ggggcatggg agggttgttc 4681 ccaaactcaa gtgtccagga gagctgggag gaccacagca ttattcactg agctgtggcc 4741 actgggcttt ctcgtctctc ggtctcctca gttgtttctc tctctctctc tccctctctc 4801 cttcttgttt ctctccagcc tcaccttccc tctttctttc tctctccctg ccactcctgg 4861 cttccttttc atgctgtctc tatactttct atcctgcttc aaacaaccct ggaccagccc 4921 ccctccctcc ttccagaaac tccccctctc tggccttcac accagctcct gctatagccc 4981 ctcttgcgct ggagccccat agccccctct tggcggcctt ctcagccccg tgtctctgcc 5041 tccacactcc ccggtcagtt cagcctctgg tctctgctgt taatcactct aatgcatgca 5101 gagaaaccca gtttccaaca atccccatcc taggccagac acagctttaa atggaacatc 5161 ccctgctgct cccctgtctg ccctaatccc tcctagggct gcctgtccct gcggaccacc 5221 ccttggtgtg aatcaagaaa acgtgctttg ccctgggcct gactaggaca ggcagatgca 5281 gctggatggg ctgcaggccc tctgctgggg ggtccttgtt caggacacag cttgtaccag 5341 tatatgtggc ggtcctgccc tctccccagc cctgcatacc tcccacctgc acagcattca 5401 cggaattctc aggcccactc acaaaacaca gaccttgccc acagcttcca gcctttccag 5461 gtggctgctc cagctggccc caggtcccca ccccaaccct tccatctccc atctgtgtcc 5521 caggtccatg ccacacacct tctgtgcccc gtccgtgtcc tgggcccccc ttaccctccc 5581 tctcttgagt tactgacctc catccacagc ccagctgtcc ttcctgcccc acagataccc 5641 cagatcagct atgcctccac agccccggag ctcagcgact ccacacgcta tgacttcttc 5701 tcccgggtgg tgccacccga ctcctaccag gcgcaggcca tggtggacat cgtgagggca 5761 ctgggatgga actatgtgtc cacgctggcc tccgagggca actatggcga aagtggggtt 5821 gaggccttcg ttcagatctc ccgagaggct ggtgagctgg gggctccgag gcaccaagga 5881 gtctacatag tgtgggcccg gggccctcag ttggggcaga agcagaaaga tggggggccc 5941 aggtccatgc tgggcgaggt tgcctggctg gtgctccatg tgggcgggac ctctttgttt 6001 tggacgactc agccctgcct caggcttggc ggagggtgca ggccagactt cacccctgca 6061 gacccttcag ccagcctaca cagtcctctg cagcggtggc caggtgggtg tgtgggtggg 6121 tgccacggaa gctgactctg gcacccccaa cgcccagggg gggtctgtat tgcccagtct 6181 atcaagattc ccagggaacc aaagccagga gagttcagca aggtgatcag gagactcatg 6241 gagacgccca acgcccgggg catcatcatc tttgccaatg aggatgacat caggtgggac 6301 agtggcacaa tcaccgggcc cctgggtctc cctgagggtc ccacccctcc tcttgcacat 6361 cgccacctta cacactctcc ctgcctccct ttcccctgct ccccgcaccc cccaccttcc 6421 ctgctgctgc ctctgagtct ggtgcccagg ctcagaagca gcctggtctg gggagatgcc 6481 acaagcccag actgaatagc gcacagaaca tgtgtctgtg tttacagctc ctgggggtgc 6541 ggggagggac ctgtgggcag aacctagtga gaaaatgagc tgtgagggaa ggagcccagg 6601 gtccatacag aggctgcgga cccagccgag acccaccggc ccgagttcat ctccagccta 6661 ggaggctggt gtgacgctgg tggaggcatg tggggagggg ctggcttcga actggatttt 6721 cccccaagga gtccctctga accccctgaa cagtgggtta cagtgggcag agcaagtggg 6781 caggcctagg gtcagaggaa agcccaggga aggtgcccta aatgcccccg gccccgtttt 6841 tcctgagaaa caaggatctg gtgagtgatt atagagcagg ggagatggat agagtgggaa 6901 gggatggggg ccgagctgga ggaggctccc aggccctccc tcaccccagc cctgctccta 6961 ccaggcgggt cctggaggca gctcgccagg ccaacctgac cggccacttc ctgtgggtcg 7021 gctcagacag ctggggagcc aagacctcac ccatcttgag cctggaggac gtggccgttg 7081 gggccatcac catcctgccc aaaagggcct ccatcgacgg tgagtccagg ggcactccat 7141 cccccacccc acacgtctcc ccagcacccc ctcttgggga tgctcatttt ctgcatctgt 7201 gtaattggtg ccataatcca gatcccttct cagatcccca gatcctatga gtccgatcat 7261 ctgtggacgc gtccctccaa gcagcgcttg actgggatgg cgtgcgagga aagcacgctc 7321 cactcactgg acggggagga attgttgggt ttttggtttt gtttcttaat ttagtgcctg 7381 tatttctagt gctgccaaag acattctaat ctgagggtga gggcgggagg gagagaaacc 7441 aaggatacgg aatataccat cctggtgatc agagtgatga ggcccctgca cccccagtcc 7501 tacctgtgga tctggatgaa tccagcagga acagatggct cactttacaa taggattgct 7561 cattttccca aggctggacc gaggacacct tgatgacatc cttcagggct attttatgaa 7621 acacagaccc tggaaaggct gagccctggc tgctagggcc tcttagcctt agggggatat 7681 ataccatcac gccttttatt tcagctacgt tgcatctggg tgcaggtggc agcctggact 7741 gagagtagtt ttagcgataa aggcatctca tcacaagaca ggaggtctga gagttggtga 7801 ctttaggcat ggggcagtga gtgaatgcac tctcaaagcc ctgggctctg tccattcctt 7861 tgttccacca ttccctgtgg ttgtcttggg tcttcctgct tatggtctca tagtcacaag 7921 atggctgctg aggctccacc cattacatct tcacagcagc atcctaacag gaaagaagga 7981 gcagggataa aagtctttca aatggtgagg ctctgagttt gcatcagcat gtaagtgcta 8041 ccctggaacc tccagttggc tctgagtcca ccaatcagaa cttggtcacc tggccacgcc 8101 tagctcacct ggccactcct agctcacctg ccacgcctaa ctcacctggc cactcctagc 8161 tcacctgcca tgcctagttc acctggccac tcctagctca cctggccact cctagctgaa 8221 agtgagcaag tggcagagag acacaggaga aacacagctg gctcaggtca gcgccaccct 8281 cccccttctt ccctgtctgt aggatttgac cagtacttca tgactcgatc cctggagaac 8341 aaccgcagga acatctggtt cgccgagttc tgggaagaga attttaactg caaactgacc 8401 agctcaggta cccagtcaga tgattccacc cgcaaatgca caggtgagag ctgcccaggg 8461 tggtgagggt ggggagggtg gggagggtgg tgagggggaa ggcgggaggc cgggttcggt 8521 gcatgggggc cgggcacact ttctctgggc atccctagga caggcgagga acgcatcggc 8581 cgggactcca cctacgagca ggagggcaag gtgcagtttg tgattgatgc ggtgtatgcc 8641 attgcccacg ccctccacag catgcaccag gcgctctgcc ctgggcacac aggcctgtgc 8701 ccggcgatgg aacccaccga tgggcggatg cttctgcagt acattcgagc tgtccgcttc 8761 aacggtgagt gggtgcctgc ccccctgcag cagtgaagga cagctggggg gctggatttt 8821 gctgtgaggc ctgggacaac ctcagggtgg aagggctggg tgaggctgag tggtctaatg 8881 ggggtgcctg gggctgagtg ccaaagacat gagacaagtt ccggtcccca atttttaaca 8941 ctgctccctc cccaggcttc agtgttccct gcgctggttt gcacagttca gatgctgggg 9001 agcttctctg atcccttctg ccaggaaccc ccagtgggtc acaccacact cccctccact 9061 gcgcagagct tgagggcatc tggaccctgg gggtgcacgt ggggcacagt taagactgcg 9121 tggggcgcct acccagggac atgccgccca ctctgagtca tgcgccaggc acaggcgtgg 9181 gtctgacctg tcccttggca ccaggcttgg tgtgtggtca gggctcaaaa ctgtgtcttc 9241 tgaggagacc ccagtggact caagctctgt ccacggtttg tggagagatg actttctgtt 9301 caatcactat cctttacaca attcaaaact ccgctcttcc actgaacact cactcaccac 9361 cttctctccg ccatgctctc tgctaatcaa ttccatttaa tatgactcca gagcatctct 9421 gcaagctggt ggaacaagct tgtaagaaaa atagagctca ttgttcaagt cattccaatt 9481 tcatactgtc actcagacct ctctccgtgt ctgaactcat cctctcgcct ggacagtgaa 9541 acttttttca tttttaattg tgataaaata catataatat aaaatgtatc accttaacca 9601 attttttaag tgtagagttc agtgatatta aatacattca gtctgggcgc aatggctcac 9661 tcctgtaatc tgagcacttt gggaggccga gacaggtgga tcacttgagg tcaggagttc 9721 gagaccagcc tggccaacat ggtgaaaccc catctctact aaaaatataa aaattagcca 9781 ggtgtggtgg caggcgcctg taatcccagc tactcgggag tctgagacag gagaattgct 9841 tgaacccggg aggcagaggt tgcagtgagc tgggattgtg ccactgcact ccagcctggg 9901 cgacagagtg agactctgtc tctaaataaa taaataaata ataaattcac aatgttatgc 9961 aaccatcacc accatctatc cccagaactc ttcatctatg aaaccataac tctgtatgca 10021 ttaaacactt ccccatcccc tcctcccgcc agcacctgca accgccattc cacggcccct 10081 ctctatgaat tgtctaccct cggtacctcc tgtggtggca tcagacggta tttgcacatg 10141 tgtgactggc ttgtttcacc gagcatcatg tcctccaggt ccatccattt tatagcatgg 10201 gtcagaactg ccttcctttt gaaggctgaa tcgcatccca ttgtgtgaat ggccccgtcg 10261 tgttcgtcca ttcttctgtc agtggacgct cgggttgctt tcatcttgtg gctatgtgaa 10321 taacgctgct gtgaacgtgg ctgtacaagc atttcttgcc accctgcttt ctagtctctc 10381 gagtatctac ccgggagtgg aactgctggg tcatgtgttt aatcttttga ggaggctcca 10441 cactgttttc tgcagcggct gcagcctttt cccacccacc gtgcacaagg gttccgactt 10501 ctccacacgc tcacaacact cgttattttc tggttcttgg ctagaagcca tcccaatggg 10561 tgaaggggga tctccctgcg gcttggattt gcacgtccct tatgagcagt gatgcttgtt 10621 aagctcctcc tttctcctgc tggtccctgg ggcctcccca ggaggcagct ctgctgggct 10681 ggaggaccac cgaggatgac ggactctctc tccaggcagc gcaggaaccc ctgtgatgtt 10741 caacgagaac ggggatgcgc ccgggcggta cgacatcttc cagtaccagg cgaccaatgg 10801 cagtgccagc agtggcgggt accaggcagt gggccagtgg gcagagaccc tcagactgga 10861 tgtgagtgtg cagaccgagc cccgggtggg catgggccca ggatcccgga gcagacgccc 10921 cagcctgtgt tttgtgcgtc ccaggtggag gccctgcagt ggtctggcga cccccacgag 10981 gtgccctcgt ctctgtgcag cctgccctgc gggccggggg agcggaagaa gatggtgaag 11041 ggcgtcccct gctgttggca ctgcgaggcc tgtgacgggt accgcttcca ggtggacgag 11101 ttcacatgcg aggcctgtcc tggggacatg aggcccacgc ccaaccacac gggctgccgc 11161 cccacacctg tggtgcgcct gagctggtcc tccccctggg cagccccgcc gctcctcctg 11221 gccgtgctgg gcatcgtggc cactaccacg gtggtggcca ccttcgtgcg gtacaacaac 11281 acgcccatcg tccgggcctc gggccgagag ctcagctacg tcctcctcac cggcatcttc 11341 ctcatctacg ccatcacctt cctcatggtg gctgagcctg gggccgcggt ctgtgccgcc 11401 cgcaggctct tcctgggcct gggcacgacc ctcagctact ctgccctgct caccaagacc 11461 aaccgtatct accgcatctt tgagcagggc aagcgctcgg tcacaccccc tcccttcatc 11521 agccccacct cacagctggt catcaccttc agcctcacct ccctgcaggt gggtccgtag 11581 gctcccaagg gccagggcag ggagggggac tgtccgcaac gttacaaagc caaaagggcc 11641 aggcacagtg gcacacgcct gtagtcccag cacctttgga ggctgaggtg ggtggatcac 11701 ctgagtccag gagttcgaga ccagcctggg caacatggca tgaccctgtc tctgcaaaaa 11761 atatgaaaat tagccaggca tggtggtgcg tgcctgtagt cctagctact tgggaggctg 11821 aaataggagg atcgcttgag cccaagagtt gaaggttgct gtgagccaag atcgcaccac 11881 tacaatccag cttgggtgtc agagtaagac cctgtctcaa aacaaaaaga taaatgccgt 11941 ttttagagtc actcttttct attttggttc ctgtgttgtt ttattttttt ttttttagac 12001 agagtctcgc tgtgtcgccc aggctggagt gcagtggcac tatctcagct ccctgcaagc 12061 tctgccttcc gggttcacgc cattctcctg cctcagcctc ccgagtggct gggactacag 12121 gcgcccgcca ccacgcccgg ctaatttttt gtatttttag tagaaacagg gcttcaccgt 12181 gttagccagg acggtcttga tctcctgacc tcgtgatccg cccacctcgg cctcccaaag 12241 tgctgggata acaggcggga gccctgcgcc cagtgagttt tttttttttt ttaaggccac 12301 aggaaatgga gcccctcccc tcctgtgaca gcagctcagt tagggccgcc atctcccctg 12361 ctgggtcctt gaaggaaaac ggggcagtca ggtggggagt gtgggcctcc agagatcttt 12421 ctgtgcaaca gtgaagttat tttctaaacc aaaatgctct tctacaattt ggctcacata 12481 tggaaacagg tcaaagagct atctgaccgg ttttcctgtt tccacaaggg agtctctgct 12541 gccctgctcc ctggggcagt gtggatgaga ggcacagggt ccaacatttt actgtggaag 12601 agcatcgaat tccatccagt gccagcgttt cgtgctggcc gttaaataat ggagctcctt 12661 tgggggattt tcctaagaca gtaaagttat tacagccttg gagacaactc ttggagtctg 12721 gggcgccctg gatgttttct gtgtgaagag tacgagagac agatggcttg gggagggtga 12781 cctggtgtga cactgcaggg ccagctgggt tctcagggcc aggtggaggg tggcctcagt 12841 gcgaacacct gctcacccat ttggagcact cccggtgctg tccactcccg gcgagcccct 12901 tctctattcg ttccgcacac gccacgtgcc tcgggtcatg cagttctccg tgactaggag 12961 gtgctcccgt cttcctccac gtaacagggc ctgagtgtcc cctggtcagg cccagggtct 13021 ccgtgtgccg cacatcctga gaaggtgccc tctacgtccc ggcctgaggt gctcctgctc 13081 ccccaccctc aggtggagat gcttagaccc cgttggggac ccccagctct cctccccagc 13141 ctcagccact gcccgcccca cactgtaccc tcttttccag atacaaggga tctttaaaaa 13201 cacacaagcc tgcaactggc ctaggtgttt tgccatgact accacgtcat gacgggctgc 13261 aaaatggcat ctcactgtgg cttcaattcg cctgcactta gcaccagtgc gccggccatg 13321 gggctttcct ctcctgggct cagcctgttc ctgtcctttg ctcatttgct gttattttgc 13381 aagagtccgt tccgagttcg ctcgctcccc actccacggg ctgcggcatc tgtgggccgt 13441 tttggtggtt accgctgtgc agccctgggc ttgcattttt gtttctcatg agcaactttt 13501 tttttttcca cctggagcct tgttctagta ggaacttcca aacttatttt ggaagttggt 13561 ctcgatctcc tgacctcgtg atccacttat ttggaaggtc ctttgttcag gggtctgctg 13621 cctttgggag aagcacaaaa agctgctgct tctgggtggg cagatgactg caaagcatcc 13681 cttccagcag gatcacatgg ccctgctcca ggtgctcggc atggcactgg catgcaggct 13741 ggattccagc tccacccctg cccccaatca ggcgctctcg tacagtgaac aaccagcacg 13801 aacgtgtggg gcagccttcc agggtcccgc ctttatgcag gtggctttgt tcctgttccc 13861 cagctttcct tggcttgagg ttgtgcttcc tgtcatgtga gggagtggta actccatctc 13921 ctccctgtga gggcctgcac agactccaat actcctcggc atcagctccc attcctgctc 13981 ctccccactg agtgctcctt tgtttctctt gcctgtggcg tgccctgatt ttgaagctca 14041 gtacacgtga ggaggctaga gaggcgatgc gagtagggcc tcagttccag agcttccggc 14101 cacatccatc taccttgaag gggcgcagta aatggcattg gtaatggcta gaatgcagga 14161 agaggacacc aagctgcccc ttatgcacac tgcactgatg ttgccaacat atacgggcgg 14221 cacagccctt ctccgcagct gccaggtcac gggcctttgg acggacccct cttgggctgc 14281 ccctgcctgg gctgccatcc cctcttctcc actgagtgct cctgagagcc accacgaaga 14341 ggtggagcat cccctccata ggccagagcc tcaaggggat cctgacccgc aggggcctct 14401 gtcagggctg gaatggattt gtccctggaa agtgctgtgt ctgatgcagt gttggctcct 14461 gcaggtggtg gggatgatag catggctggg ggcccggccc ccacacagcg tgattgacta 14521 tgaggaacag cggacagtgg accccgagca ggccagaggg gtgctcaagt gcgacatgtc 14581 ggatctgtct ctcatcggct gcctgggcta cagcctcctg ctcatggtca cgtgcacagt 14641 gtacgccatc aaggcccgtg gcgtgcccga gaccttcaac gaggccaagc ccatcggctt 14701 caccatgtac accacctgca tcatctggct ggcattcgtg cccatcttct ttggcactgc 14761 ccagtcagct gaaaaggtaa tgaggtcccc aggaggtctc gctgcctgtt tcctgcctgt 14821 caggctggtc gattcattca tgagattaaa tgcttgttcg tgcttttgtt tatcccgtat 14881 ctggctgctt gttcattcac tcattccctt cttactttgt tcattcattc acttttccaa 14941 gcttctaacc cctcttctta acttccactc ttcttttttt ctttttatca ttttcatttc 15001 tttgagagat attgcctcac tctgtcaccc agttacagtt agcttgtttt ttttaattca 15061 agttcttcat atgctttttt tttttttttt ttttttttga gacagagtct cactctgttg 15121 cccaggctgg agtgcagtgg cgtgatcttg gctcactgca agctccgcct cctgggtcaa 15181 gtgattctcc agcctcagcc tcccccgtag ctgggactac aggtgcctgc caccacgcct 15241 gggcaatttt tgtaatttta gtagagatgg ggtttcacca tattagtcag actggtctca 15301 aattcttgac ctcaagtgac ccatctgcct cggcctccca aagtgctggg atgacaggcg 15361 tgagccactg cgcccaggct tcatgtgctt cataactgcc aacccattaa tgcgtcagcc 15421 cattgctgca gcatttattc acgtatgcgg cagaatattt tatgcacctg tcatccattt 15481 atttgcccat ctgctcctga agtttgtctc tgcccattta ttcatttgta tgtatttgaa 15541 acgtgctcat tcactgacat gcacccggct cccgcttccc caggtgtctc tctgttaata 15601 acccctcctc acccctccct cctgcccaca ccagagctgc atggaggctg ccagcccctc 15661 aggcgcgggg cccctgctca tactgttgtc ctggtctaag tgctcattcc cagttcccca 15721 actgaaacag actcagctcc agagtatgtg gtgaggactg tgtggagcgg ggcagggaga 15781 gcagggctga gtcccatctg gtcccctctt ccatggccta gatctacatc cagacaacca 15841 cgctaaccgt gtccttgagc ctgagtgcct cggtgtccct cggcatgctc tacgtaccca 15901 aaacctacgt catcctcttc catccagagc agaatgtgca gaagcgaaag cggagcctca 15961 aggccacctc cacggtggca gccccaccca agggcgagga tgcagaggcc cacaagtagc 16021 agggcaggtg ggaacgggac tgcttgctgc ctctcctttc ttcctcttgc ctcgaggtgg 16081 aagctgtata gagcccgggt ccacggtgaa cagtcagtgg cagggagttt gccaagacca 16141 tgctccgcgt cggtggggct ggccttgaga aggaactgga cccagctcta ccccgattcc 16201 agcatgtgag cttcatgctt cctcaccaca gaccagactc gcttcccatg gtgggaaaca 16261 gccaccgaga aggttctagc tctagaaagg gactaaactt attctctcat ccgaagtcca 16321 aagaggatga tgaagccctg ggctttgcct ggtttgcggg agatttcctc ccctcagtca 16381 acccccataa cctggggatt gggcagtgtg gaagaacgtg tagaccccag aatgaaacat 16441 ggggttggag tggaggagga gctgtctcag caagaggaga cctggggctg tgcatctgga 16501 tggaggcact caggcctggg taggattcct ctggcacgga gggagagacc ctgggtgaga 16561 cccctgtgag catgggaagg gcctgcagtg ggcgcgggag tgagctgagg aactggggtg 16621 cgcccccatg agattcccaa tgccatgggc tttcccccat ccccccggga ttgggcaagg 16681 tcagacttag agtacagctg ttttcctccc ctctgtgtac tcccttaaat caccccaacc 16741 ttggccaggc atggtggctc acacctgtaa tcccagcact ttgggaggcc gaggcaggtg 16801 gatcacctga ggtccggagt tcgagaccag cctggccaat gtggtgaaac cctgtctcta 16861 ctaaaaatac aaaaattagc caggtgtgat ggtgggtgcc tgtaatccca gttacttggg 16921 aggctgaggc aggagaatcg cttgaacctg ggaggtggag gttgcagtga gctgtgattg 16981 tgccactgta ctccagcctg ggtgacagag cgagactctg tctcaaaaaa acaaaacaaa 17041 aaaacaccaa aaaaaccccc aaacctgaag aaattcagat acacgtgtgt aatgttagtg 17101 atgtgagaac aaggagcagg ggtgcatttg tgttgtgttc gggttgggga tgggtttagg 17161 agctccaggt tgggagcagt gacagagagt catggccgtg gtgagggtga atcccaagtg 17221 gatggctcag gacgggtatg gaaacccttc attcctcata ggtactggga agtccatttg 17281 caagctgagc gccaggcctg gggaggaaga ggcttgggct gcagatgcac gcacatttgt 17341 ttttcactga tagtttttac aaaaagcttg gtttaagtta tggaatttta tgtccctggg 17401 agtagaattt acatttgtta aattgaccac tgtttaagat cagtatacat tctctagtct 17461 gtgatgtctg gagctagttt tgagggtgaa ccacacttta tccaacatac aaactttccc 17521 atgcagcttc tctggtgcgc agttggtttt gaccgtggga ctaggtgctt ctgcaggttt 17581 taagtaatta acttaaaagc ttctcctctg agaaacattt ctgttgcgct actgactctc 17641 cttctccaca tttgttgtgt tcctagggct tctctatagt gcacattagg acgtttcatt 17701 tgttgctgaa tgctttccag aattatttat tccatagggt ttctctcctg tgcagctctc 17761 tcatgggtaa tggggcgtgt tttcttgcca aaggcggttc caccctcgtg attgtatagg 17821 gctcttctcc tgtatgaact ctgagatcag tgagctctga tctccaaggg aaagttttcc 17881 tgcatttgct gttttctcat gtctctccca gtgtgaattc tctggcttct agctgaaaac 17941 ttttccacag ttttacattc atgtggtttt ctccactgtg aactctgtga ttcagaatca 18001 gaagcagttc ttagtagagg catttctaca ctgattgcac tgaggatatc tccccagtgt 18061 gaagtttctg gcatagagtc ctggcttccc gcagacgact ttcacactct gccatgttca 18121 tgcctgtggg cctctctggc aggaactctg atgcaccgcg aggcccatgt actcctgtgg 18181 ctttctcaca ttcggtctac ttgcagggta tctccacagc atgcaccatt ctgggtacag 18241 ggggacatcc tctgttactg aagatgttgt catatttagt accttcacaa ggtttctctc 18301 cttccagaat tttctgatgt acacaaataa ctgacttcca caagagggct tttccacact 18361 cggtgtgtgc atacagtttc tgcctgtgat catttcttta tgttattatt ttattttttc 18421 gagatagggt cttgctcaat ttcttaggct ggagtgcagt ggcacgatca tagctcactg 18481 aagtttcgac ctgggctcaa gcaatcctcc cgcttcagcc tcctgagtag ctggtgcgca 18541 cgaccatacc cagctaatgt tttatttttt gtagagacga ggtctcacta tgttgcccag 18601 gctggtctcg aacttctgag ctcgagcgat cctcctgcct ccacctccca aagtgttcgg 18661 attacaaacg tgagccatcg cacctagcct ctttgatcat ttctgtggtg ttcagtgggg 18721 gttgacagct ccctaaagat tttcctgttt ttttgcatgc atgggtttga attctttgag 18781 gtccaattta tttggacccc tgaataaagt tttgtgggtt ttcttctatg tgtggaatta 18841 tataggcatt cttccagtgt ggtttctctt atgtcgagtg agagctgacc tgcaccgaag 18901 tttgtcccat ttgttgccct tgaattatct gtatgaatta tatgttccag tgaaaatgga 18961 gttctgggtt ggaggcttat tccatgttta cacaattaaa attgcagtgt tcctctctgg 19021 gatgagagct ctaaagcaga gtaagattac gttctgatgt aagctttaac cacctattta 19081 taaggtctca cctgtggtcc actgtgttga gacttctaca gaagagcttc tgtatagtaa 19141 ccattttctt aggctgtctc acttgtgtga atcttctgac acatttatta tagctttgtc 19201 ccatttctta tcctttttgc tctttagaaa tttcccttta atttattaca ttcattgctt 19261 actgtaaaga gtccaggtaa ctgactttaa ttcaagttac ttcctgttca ataaatttaa 19321 cttttcccaa aagtttgact taaggtcttc ggttgccttc ctttatacag ataattgttt 19381 cataaaacct tccttcaaaa atggaataaa atgcacaatt ccaagtgaat cttctcatca 19441 ttttatagaa tgaaacttcc ttcccaaata ctctttgtta taatgatacc tttgtttcta 19501 aacgtgaaaa attccaaact aaatattttc tatctcttgg gctttgacag gggaaaaact 19561 gctgtagtat ggacaaccag agtggaatac tgatgcttat tacaagtagc ttggggggtt 19621 tcaaaaatga cattccaccc tgtggagaaa aggctgaaga aatgtaaggg ataggaacca 19681 agaaggaata aggccctgat aagaagaaac tgaccagttg gataggaatt ctaaaaggca 19741 cgcaagggtg tagagaagta aattagtaaa aggtctggat ctggctgagt gcagtggctc 19801 atgctgtttt cttggcactt tgggaggctg aggtgagtgg ttcacttgag gccaggtgtt 19861 caagaccagc ctggccaaca tggcgaaaca ccatctcggc aaaacaccat ctcttctaat 19921 ttcacaaaat tagctgggca tggtggcaca tgtctgctat cccagttact cggggggctg 19981 aggctggaga atcgcttgat cccgggaggc ggaggtttct gtgagttgag atcgtgccat 20041 tgcactccag cctgggcaac aaaaaaaaac tctgtctcca aaaaaaaaaa aaagaagact 20101 gaatttgaaa attttaagcc tcatcacttc atttgtggat tactgtttta ctctattcag 20161 gctgctataa aagagtacca tagactcagt ggcttctaaa aaacagacat ttctcatagt 20221 tctggaggct ggaaagtcca agagcaagac accagtggat ttggtgaggg ctgcctgctt 20281 gttcttaaac agcactccct tgctgtgtcc tcatatggtg ggaagtgtga gggcaccctg 20341 gggtctcttt tataagggct ctaatcccat tcccgagggc tctactttca taacctccca 20401 aaggccccag ctccaaatac catacaatgg gggttaggat ctcaccatat gcattttgga 20461 gcaacatgaa tattcaacct atagcaatta ttctgtcaga atactaacta gtctttctgc 20521 ttcttgtatc ttttccctct tattcatcct acatagtatt atcaaaatca ccgttccatg 20581 gcaccgtggt gctgtgtccc attgctaggg tcccagccat ctcctgtcac ctccatctgg 20641 ctgtggttta ttctcaaatt tacctgcaga ctctcctaac agaaacagca gagagggcac 20701 attagtctag gggaacacgg ccccaggcac tgtgcactga tacaatctgc cagattgggt 20761 ctggtcagag ctctcaaata ctgaatatac ctgtagtatc aacatctgag ttattttttc 20821 cattctggtc tgcagacagg taagacaggt aaaacttcca gggcacacag accttgtctt 20881 gcccatgccc tgtgtacttc tttaagtcag ctttcacagc ttgaccttgc actgtgtcat 20941 ggttaagtct tcttgaggtc aactccagct ctgctagatg agtgaccttg ggcaagatac 21001 tcattcctcc cacagtcact aaaagccaga gtctccttaa tgcaccctgc agaggtgccc 21061 aaattgctgg ccccattagc ctaccattga tcctcccagt cctaccaatc actgcccaca 21121 cagaccccac tcattaaccc agccccacag ctgagcagtc ccaggacccc atctcactcc 21181 cagctagccc ctcctcactg ccccccccag actgcttatt cattttggtc ccacaggtac 21241 ccttgggtgc ttactctgtg ctaagcacct gggatacatc agcagacaaa atagacagga 21301 actgcttccc tcataggact tgcaccctag tggtggagac acacaataaa caacatacaa 21361 aactaaccaa taaatgacac tgtatgttaa agggtgatgt gctgtggaca aaagaaaaca 21421 gcagagcagg gtacagagaa tggggcagct ggcgggggat ggctgcaatt ttgaataggg 21481 caatccagta caggcttcat ctgaaggagg tggggagtga gccatgggga tatctggggg 21541 atggggtcct gcaggcggag tggcaagtgc aaggccaagg gcaggggtcg gcctggtgtt 21601 tggggaccaa gcaaagagag tggcatggag gagtagggtg agcaggggag aaaatgtgag 21661 gaaggcagca gagtggagca gtcagggctc tccagagaac agagcccaaa ggatcc // LOCUS HSU82827 2014 bp DNA PRI 08-FEB-1997 DEFINITION Human transcription factor HOXA13 (HOXA13) gene, complete cds. ACCESSION U82827 NID g1832352 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2014) AUTHORS Mortlock,D.P. and Innis,J.W. TITLE Mutation of HOXA13 in hand-foot-genital syndrome JOURNAL Nature Genet. 15 (2), 179-180 (1997) MEDLINE 97172976 REFERENCE 2 (bases 1 to 2014) AUTHORS Mortlock,D.P. TITLE Direct Submission JOURNAL Submitted (20-DEC-1996) Human Genetics, University of Michigan, 3703 MSII, Ann Arbor, MI 48109-0618, USA FEATURES Location/Qualifiers source 1..2014 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" gene 109..1987 /gene="HOXA13" CDS join(109..1030,1743..1987) /gene="HOXA13" /codon_start=1 /product="transcription factor HOXA13" /db_xref="PID:g1832353" /translation="MTASVLLHPRWIEPTVMFLYDNGGGLVADELNKNMEGAAAAAAA AAAAAAAGAGGGGFPHPAAAAAGGNFSVAAAAAAAAAAAANQCRNLMAHPAPLAPGAA SAYSSAPGEAPPSAAAAAAAAAAAAAAAAAASSSGGPGPAGPAAAEAAKQCSPCSAAA QSSSGPAALPYGYFGSGYYPCARMGPPPNAIKSCPQPPSAAAAAAFADKYMDTAGPAA EEFSSRAKEFAFYHQGYAAGPYHHHQPMPGYLDMPVVPGLGGPGESRHEPLGLPMESY QPWALPNGWNGQMYCPKEQAQPPHLWKSTLPDVVSHPSDASSYRRGRKKRVPYTKVQL KELEREYATNKFITKDKRRRISATTNLSERQVTIWFQNRRVKEKKVINKLKTTS" misc_feature 220..507 /gene="HOXA13" /note="encodes poly-alanine repeat region" BASE COUNT 356 a 693 c 614 g 351 t ORIGIN 1 cgcgatccgc ccccctcgct cgggccccgg ccccgcgccg cgcgctcttc acttcttggg 61 ggctttttaa aacagcgcca ctggggtctt ctccatgcgg ctcgggctat gacagcctcc 121 gtgctcctcc acccccgctg gatcgagccc accgtcatgt ttctctacga caacggcggc 181 ggcctggtgg ccgacgagct caacaagaac atggaagggg cggcggcggc tgcagcagcg 241 gctgcagcgg cggcggctgc cggggccggg ggcgggggct tcccccaccc ggcggctgcg 301 gcggcagggg gcaacttctc ggtggcggcc gcggccgcgg ctgcggcggc cgccgcggcc 361 aaccagtgcc gcaacctgat ggcgcacccg gcgcccttgg cgccaggagc cgcgtccgcc 421 tacagcagcg cccccgggga ggcgcccccg tcggctgccg ccgctgctgc cgcggctgcc 481 gctgcagccg ccgccgccgc cgccgcgtcg tcctcgggag gtcccggccc ggcgggcccg 541 gcggcggcag aggcggccaa gcaatgcagc ccctgctcgg cagcggcgca gagctcgtcg 601 gggcccgcgg cgctgcccta tggctacttc ggcagcggct actacccgtg cgcccgcatg 661 ggcccgcccc ccaacgccat caagtcgtgc ccccagcccc cctcggccgc cgccgccgcc 721 gccttcgcgg acaagtacat ggataccgcc ggcccagctg ccgaggagtt cagctcccgc 781 gctaaggagt tcgcgttcta ccaccagggc tacgcagccg ggccttacca ccaccatcag 841 cccatgcctg gctacctgga tatgccagtg gtgccgggcc tcgggggccc cggcgagtcg 901 cgccacgaac ccttgggtct tcccatggaa agctaccagc cctgggcgct gcccaacggc 961 tggaacggcc aaatgtactg ccccaaagag caggcgcagc ctccccacct ctggaagtcc 1021 actctgcccg gtaaatgacg actattccca gccctggtct tccggctctg ctccagcttc 1081 ttctccgctc gcacccgggc gatcccgggt gcgtttctgt tctcttcctg gtctgcccta 1141 gcggctctgc acccctggga gcccgagcat ggctggctgg gtctgcctgc actgcctcga 1201 gttgagctgg tccctggctc tccctgggtg aggggtggct tgtggagacc tcggatagct 1261 tccctctccc tctgcgcccc gccctcccca gtcccagaaa ccaatttaag ggtgagaaat 1321 cgaccagaaa acagctcccc aaatcgcccc tccctattca ttctctcaaa aatggcttca 1381 gtgtagaagc ttcgagtatt gggacgggca cccagaaagg aggcaggcac agaagtgttg 1441 taccttgagc ctggcgctaa ggtgtgggcc gttggaccag gctatcactc gaggctgcgt 1501 acgcgctgct cctgcaggat ggccgggttg gggaagtcac tggagccctg ggtgatttca 1561 tttcagttca gaactaacta ccttccccac tgaccctcta ggctttagca gaagacagga 1621 ttgtacagcg ggtggcaaag agcagccggg cgctgcaagg cgggtggctc agatcgagct 1681 gtcgcctatg ccctggctgg ggtccgatcc ctgtgtaact tgccttctcc cttgtcttct 1741 agacgtggtc tcccatccct cggatgccag ctcctatagg agggggagaa agaagcgcgt 1801 gccttatacc aaggtgcaat taaaagaact tgaacgggaa tacgccacga ataaattcat 1861 tactaaggac aaacggaggc ggatatcagc cacgacgaat ctctctgagc ggcaggtcac 1921 aatctggttc cagaacagga gggttaaaga gaaaaaagtc atcaacaaac tgaaaaccac 1981 tagttaatgg attaaaaata gagcaagaag gcaa // LOCUS HSU86758 6803 bp DNA PRI 29-APR-1997 DEFINITION Human netrin-2 like protein (NTN2L) gene, complete cds. ACCESSION U86758 NID g2052392 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6803) AUTHORS Burn,T.C., Connors,T.D., Van Raay,T.J., Dackowski,W.R., Millholland,J.M., Klinger,K.W. and Landes,G.M. TITLE Generation of a transcriptional map for a 700-kb region surrounding the polycystic kidney disease type 1 (PKD1) and tuberous sclerosis type 2 (TSC2) disease genes on human chromosome 16p3.3 JOURNAL Genome Res. 6 (6), 525-537 (1996) MEDLINE 96425699 REFERENCE 2 (bases 1 to 6803) AUTHORS Van Raay,T.J., Foskett,S.M., Connors,T.D., Klinger,K.W., Landes,G.M. and Burn,T.C. TITLE The NTN2L gene encoding a novel human netrin maps to the autosomal dominant polycystic kidney disease region on chromosome 16p13.3 JOURNAL Genomics 41 (2), 279-282 (1997) MEDLINE 97288529 REFERENCE 3 (bases 1 to 6803) AUTHORS Van Raay,T.J., Foskett,S.M., Connors,T.D., Klinger,K.W., Landes, G.M. and Burn,T.C. TITLE Direct Submission JOURNAL Submitted (24-JAN-1997) Human Genetics, Genzyme Genetics, One Mountain Road, Framingham, MA 01701, USA FEATURES Location/Qualifiers source 1..6803 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" mRNA join(2930..4060,4132..4320,4416..4565,4699..4749, 4860..4934,5187..5576) /gene="NTN2L" /product="netrin-2 like protein" gene 2930..5576 /gene="NTN2L" exon 2930..4060 /gene="NTN2L" /number=1 CDS join(3133..4060,4132..4320,4416..4565,4699..4749, 4860..4934,5187..5536) /gene="NTN2L" /note="neural chemokine" /codon_start=1 /product="netrin-2 like protein" /db_xref="PID:g2052393" /translation="MPGWPWGLLLTAGTLFAALSPGPPAPADPCHDEGGAPRGCVPGL VNAALGREVLASSTCGRPATRACDASDPRRAHSPALLTSPGGTASPLCWRSESLPRAP LNVTLTVPLGKAFELVFVSLRFCSAPPASVALLKSQDHGRSWAPLGFFSSHCDLDYGR LPAPANGPAGPGPEALCFPAPLAQPDGSGLLAFSMQDSSPPGLDLDSSPVLQDWVTAT DVRVVLTRPSTAGDPRDMEAVVPYSYAATDLQVGGRCKCNGHASRCLLDTQGHLICDC RHGTEGPDCGRCKPFYCDRPWQRATARESHACLACSCNGHARRCRFNMELYRLSGRRS GGVCLNCRHNTAGRHCHYCREGFYRDPGRALSDRRACRACDCHPVGAAGKTCNQTTGQ CPCKDGVTGLTCNRCAPGFQQSRSPVAPCVKTPIPGPTEDSSPVQPQDCDSHCKPARG SYRISLKKFCKKDYAVQVAVGARGEARGAWTRFPVAVLAVFRSGEERARRGSSALWVP AGDAACGCPRLLPGRRYLLLGGGPGAAAGGAGGRGPGLIAARGSLVLPWRDAWTRRLR RLQRRERRGRCSAA" exon 4132..4320 /gene="NTN2L" /number=2 exon 4416..4565 /gene="NTN2L" /number=3 exon 4699..4749 /gene="NTN2L" /number=4 exon 4860..4934 /gene="NTN2L" /number=5 exon 5187..5576 /gene="NTN2L" /number=6 BASE COUNT 945 a 2508 c 2184 g 1166 t ORIGIN 1 ggagctcggt tggaaacccc ccgaggcata ataggcgctc gataaatgtg caataggtga 61 acatgtggtg gcttgcaggc gtctgggggg agacagcagg ttctgggctg ggcagggaat 121 tattggatca acgggcatct tacaggaaag actctcagct ccctgccgcc taggactgtc 181 cagcccatct atgccctctc cccagcctgt gccccaaagc tggagctgcc actctagggg 241 tgaggggtgg ggtggggagg gggaggcgaa gcactgcggc ctgagttgca ggtgggggga 301 ggggaggcgg agcttctttg ttgcagaagg tgccaggagg gggcagggcc agtggagagg 361 tgggaggtgg gagaggcccc agccaggggc tgggacaggt ggctgggtcc ctggggagca 421 ataagtcccg cttgggcgct gtggggaggc ccttcctaac tcccaaacac catctgtgag 481 ggctgggggt gggggcagag tagcgtgtgc agaggactgt tcctggggag aggccctgtg 541 accagcggcc tcctccctgg ggagctggcg gtacaatggc cctctgggcc cacggcctcc 601 cgccgctgct gctgacccag atgaacaatt ggggcagggc tgagccccag gcacctactt 661 tcccccaccc cagaagccac cagacgttct gcagacccca gtcctggctc acagggaagc 721 tgagctggag acaaagccag cccctctgat gagggtggaa gaggctgctg gccactgtcc 781 ctcttgcagc ctggctggca gccagtctgg cagtggccct gacgtccaga gacagcttgg 841 gtttccccag aggcttgtct ctggccagtg ggacccctct gtcaggcctg ggcttttctc 901 tccactgtcc cagaatgatg atctcagccc ccatagtccc cccagggttc ctcccaccct 961 tagggtgggg tgtcgggggg tgggggttgg gagccagaag gaccttgaag agggtggttg 1021 ggacgtttca ggttctaagc ttgacccaca gagcggagcg tgagccccgt caggttgagg 1081 tccctcaact tgtaaaggac acaattccat tctctttatc aggaagctga ggggcagggg 1141 ccctgtggca gagagagagc cccttagccc tctctgttca gtcctccggt gcccccatcc 1201 ctgtgcatct gtggctgtca catgcagatg tgtggcaagg agaaggtgcc caccagccag 1261 tgtcagttgc tccaggagcc aagccaggtg ccctatcacc ctgtcttccc gttcctcccc 1321 tccatggtca ggccctcctg ctccctcctc tggtccttca gtttccccta ggaggcttcc 1381 gtgtcctcct gcccctcctc tccccaacag cgggatgcgt ctacctctcc attctcttcc 1441 tcctggtcct tgctcatctc tggtcgtgtc cagggtagca cccacgtggc ctcctccacc 1501 agctgcaggc ctggcctccc atctgaaacg gggcattcag gcctcgatgc tggccctgca 1561 cggaacttgt tccctgcccc tccctgggat gcttggcctc ctctgtcaag gacctgaaag 1621 tcggagggga ggaggtttct ctgaccagag ctgttcctgg accctctttg gtggtgtcgc 1681 tcccaggcac agctacccca tccccagcta gtccccaggc cacccagctg ggcttctgcc 1741 tcagtttccc tgcccaaacg tgctgtgacg tagggcagtg ggctccgggt tgcgaccagc 1801 cccttcccat gattaaaccc tactccctgc ccctgcagag gggtcctcaa cagctaacca 1861 agcccccgaa ccccaagaag ccaccccatc ccaccctcca gcttccatgt cctccctgcc 1921 agctgggccc gtggcagagg tgcccctaga aacttgcaga cccagggagc tttgggatca 1981 gaatctggcc tggtgcaggg gatgctggcc tcatgtctta gcccagctca ggcccatggg 2041 ggtgcccccc ttcctcaaca tgggcaggag acactccaat ttgtgcagct ctcgacttgg 2101 gcctgatgcc acttgagact catcaaatcc aacagcttca gagcgcgtgc tgagtaacag 2161 gcatctggca ggtgaggaaa caggagccca agacatgcag ccagaaatgg ggcagttgga 2221 ttcaaaatta gacctgaccg aatcctgggt tccttctact cgagtagatg ctgctttggg 2281 gatgaccctt caactggtgg ttacttggct tccctacctg gggaacatcc agggcctctg 2341 ctgtcagacc cggggccttg cctgcctgat ggtcttcagg gaggaggcga cccagacccc 2401 cgtccagcac gtggcacagc cccaggagca gtaaagacct ggctgtgggc ccaggaccct 2461 gctgggtggt cccccacggg ctgcgaaggc tgagctgccc ccctccagac ccctcccgcc 2521 agcgcattcc tggctccccg gcccctcccc tggctcccgg gcctcccagc ccccttcccc 2581 gctggcccag cccgcgtctg aatctgcttc tgattccagc tctgcgatga ggccccctcc 2641 cctcccctgc ctccttcccg acccgagcag ccccgccccc ggctgggccc gggcttgcgc 2701 ctgctgcgcc ccccaccccc tcctggcaca gctcgtccgc cctcgctgca gccgggagga 2761 ggcggcggcc cgtgcaccgc aggccccgcc cgcccacggc ccttcccggg aggccgggag 2821 acctgctccg cccggccctc ggtgggtgag tgcgagcggc gggtggggcc tccgcgggcg 2881 gaggcaccgg gagcgggggc gacgcctgtc atcgctctag gcccagcggg aggacgcgcc 2941 aacatccccg ctgctgtgct gggcccgggg cgtgcccgcc gctgctccca cctctgggcc 3001 gggctggggc cgcccggggg ccctgttcct cggcattgcg ggcctggtgg gcagagccgc 3061 ggagagggct tcttttcccc aagggcagcg tcttggggcc cggccactgg ctgacccgca 3121 gcggctccgg ccatgcctgg ctggccctgg gggctgctgc tgacggcagg cacgctcttc 3181 gccgccctga gtcctgggcc gccggcgccc gccgacccct gccacgatga ggggggtgcg 3241 ccccgcggct gcgtgccagg actggtgaac gccgccctgg gccgcgaggt gctggcttcc 3301 agcacgtgcg ggcggccggc cactcgggcc tgcgacgcct ccgacccgcg acgggcacac 3361 tcccccgccc tccttacttc cccagggggc acggccagcc ctctgtgctg gcgctcggag 3421 tccctgcctc gggcgcccct caacgtgact ctcacggtgc ccctgggcaa ggcttttgag 3481 ctggtcttcg tgagcctgcg cttctgctca gctcccccag cctccgtggc cctgctcaag 3541 tctcaggacc atggccgcag ctgggccccg ctgggcttct tctcctccca ctgtgacctg 3601 gactatggcc gtctgcctgc ccctgccaat ggcccagctg gcccagggcc tgaggccctg 3661 tgcttccccg cacccctggc ccagcctgat ggcagcggcc ttctggcctt cagcatgcag 3721 gacagcagcc ccccaggcct ggacctggac agcagcccag tgctccaaga ctgggtgacc 3781 gccaccgacg tccgtgtagt gctcacaagg cctagcacgg caggtgaccc cagggacatg 3841 gaggccgtcg tcccttactc ctacgcagcc accgacctcc aggtgggcgg gcgctgcaag 3901 tgcaatggac atgcctcacg gtgcctgctg gacacacagg gccacctgat ctgcgactgt 3961 cggcatggca ccgagggccc tgactgcggc cgctgcaagc ccttctactg cgacaggcca 4021 tggcagcggg ccactgcccg ggaatcccac gcctgcctcg gtgaggcctt ggagggtggc 4081 ctggggacct tggacacaac cagcctgccc ctgacccatc cctccctgca gcttgctcct 4141 gcaacggcca tgcccgccgc tgccgcttca acatggagct gtaccgactg tccggccgcc 4201 gcagcggggg tgtctgtctc aactgccggc acaacaccgc cggccgccac tgccactact 4261 gccgggaggg cttctatcga gaccctggcc gtgccctgag tgaccgtcgg gcttgcaggg 4321 gtgagccacc accggccacc tgcaggccct caccctctga cttcccagat ccccagacag 4381 gcttctgacc aggcccttcc cacctctgtc ctcagcctgc gactgtcacc cggttggtgc 4441 tgctggcaag acctgcaacc agaccacagg ccagtgtccc tgcaaggatg gcgtcactgg 4501 cctcacctgc aaccgctgcg cgcctggctt ccagcaaagc cgctccccag tggcgccctg 4561 tgttagtgag tgaccctgcc ccgcctcagc caccaagcca aggccacccc agctccctgc 4621 tgttgtcccg tctattcccc gagccctgca gatctctctg cccctccatc gcaggccatt 4681 ctccctccct ctctgcagag acccctatcc ctggacccac tgaggacagc agccctgtgc 4741 agccccaggg tgagtggaca caggacaggg ccccagactg gcatgacttt gggggagggg 4801 gctctgggag gagagggtgg ggaaagggag tctgtgccag cctcccacct tctacccaga 4861 ctgtgactcg cactgcaaac ctgcccgtgg cagctaccgc atcagcctaa agaagttctg 4921 caagaaggac tatggtaggt gccctcaggc ctcccgcgga ccttcccacc ttcctcctct 4981 ccctaccttc cctcctccgc cagcttcccc ttggaacgcc ttgacccttg ctgggcccca 5041 aggcccatcc tcatccctca ggtcctccac gggcagcgac cccgcccctt cagcccccac 5101 tgccctcctg gtgtcctccc cgtgcctccc cctaccgcgg gcaggccgcc ccttcctgac 5161 cccgccccct ctcgctctcc ccgcagcggt gcaggtggcg gtgggtgcgc gcggcgaggc 5221 gcgcggcgcg tggacacgct tcccggtggc ggtgctcgcc gtgttccgga gcggagagga 5281 gcgcgcgcgg cgcgggagta gcgcgctgtg ggtgcccgcc ggggatgcgg cctgcggctg 5341 cccgcgcctg ctccccggcc gccgctacct cctgctgggg ggcgggcctg gagccgcggc 5401 tgggggcgcg gggggccggg ggcccgggct catcgccgcc cgcggaagcc tcgtgctacc 5461 ctggagggac gcgtggacgc ggcgcctgcg gaggctgcag cgacgcgaac ggcgggggcg 5521 ctgcagcgcc gcctgagccc gccggctggg cagggcggcc gctgctccca catctaggcg 5581 cacgttcacc ctgtgccttc gcctgccaag gagtccttgc tcgcgtcgcg cgtgtcgcca 5641 cctgggccgc cgccccgtcc ccgccggcag ctccctcggt acctcccgtc tggccctggg 5701 gggatgtgac cggcgcacgg acagcccgcc ccgcacagag gcagatgata tggcacaccc 5761 ggaggacccc atggtctccc gccctctggc tgtcggccct gtcccagggg cactgggata 5821 cccggaaggc tgtgaatcct tcgtgatgcc gggccctctc ggggatctca gatcatcccc 5881 ggggccgctg tgatgcaccc ccacctgtgc ggcgacccgc caggagcgca ctgacctccc 5941 caaagactgt ggccaccgca ggcgccttgg acccccatgg gggacagggc gtcccctgcc 6001 tcctgcagcc ccacgagggc ggcggccttg gccctgcggc tgggcgtccg cgtccgggcg 6061 ccccgcggcg tctgctgccg ggtcccgtaa ctttcttggc cgcctgtgtc cccgtctgcc 6121 ggctccgtcc ggccgtccct ctctctgccg cgtctctgac cctcggcgcc acagctcctc 6181 agctcagggc ccgtcccaga acctccttcc agcccttctc ccccgactcg ggaagggacg 6241 tcgtgcccac gcggttccgg atccacgcgt gacccggccg gaccgcgact ccgacaggcg 6301 gctgtccggg cccccgatgc cctcggcagg gccgtgccac cccccgcccc ttgttgtccc 6361 cccgggaccg gcactgccgt ttgcctcctc tccgcacggg accggttccc ggccggcccc 6421 agcttccgcc gctgcggccg ccgaccgtca gcgcgcatgc ccagagccgg gcaggccgga 6481 gccccgccgg ctctccgggg tgggcacagg gcgacagctc ggcgggggcg gggccgagca 6541 cgcgcgtgcg cagaaaggcc ggcgcggcag gctgaggaga aagcggcgcg cggaggtggg 6601 tgcgctcggg gcgtgcgggg ggcgcgcggc ggggtggcgg gtggcggggc cgggtccccg 6661 ctgtcaccgc ggtcggcgcg tgctgggggc gggagcgtgg gggccgggct gcgtgcccca 6721 ttcgaggcgg ggatccccgg ccacgcgcgg gttgggggct ccagagcccg gcaccgcccg 6781 gcgctgcagc tgcggcttgg cct // LOCUS HSU89387 13359 bp DNA PRI 15-JUL-1997 DEFINITION Human RNA polymerase II subunit hsRPB4 gene, complete cds. ACCESSION U89387 NID g2253634 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13359) AUTHORS Khazak,V., Estojak,J., Majors,J.A., Sonoda,G., Testa,J.R. and Golemis,E.A. TITLE HsRPB4, a partner protein for hsRPB7: evolutionary conservation of RNA polymerase II subunits implicated in stress survival JOURNAL Unpublished REFERENCE 2 (bases 1 to 13359) AUTHORS Khazak,V. TITLE Direct Submission JOURNAL Submitted (12-FEB-1997) Basic Science, Fox Chase Cancer Center, 7701 Burholme Ave., Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..13359 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R7it1" /map="2q21" /sex="female" /cell_line="HeLa" /cell_type="epithelial" /tissue_type="cervical carcinoma" /chromosome="2" mRNA join(1330..1425,6371..6551,8791..8886,11294..12822) /product="RNA polymerase II subunit hsRPB4" exon 1330..1425 /number=1 CDS join(1353..1425,6371..6551,8791..8886,11294..11372) /note="partner protein for hsRPB7" /codon_start=1 /product="RNA polymerase II subunit hsRPB4" /db_xref="PID:g2253635" /translation="MAAGGSDPRAGDVEEDASQLIFPKEFETAETLLNSEVHMLLEHR KQQNESAEDEQELSEVFMKTLNYTARFSRFKNRETIASVRSLLLQKKLHKFELACLAN LCPETAEESKALIPSLEGRFEDEELQQILDDIQTKRSFQY" intron 1426..6370 /number=1 repeat_region 2410..2690 /rpt_family="Alu" repeat_region 2893..2937 /rpt_family="Alu" repeat_region 4068..4361 /rpt_family="Alu" repeat_region 5699..5914 /rpt_family="Alu" exon 6371..6551 /number=2 intron 6552..8790 /number=2 exon 8791..8886 /number=3 intron 8887..11293 /number=3 exon 11294..12822 /number=4 BASE COUNT 3443 a 2826 c 3230 g 3858 t 2 others ORIGIN 1 ggatcctaag gatgtgacac tggttttcaa caacatgctt agagaactca tgaagtggat 61 tgggtgtcaa cccagtgaac atgtttttat ttaatttatt ttttgaagtt tatgtggtga 121 tggtgtggct ttccgaaatg ggcaaatatt cagaaaatct tttgcatttt cttctgtcag 181 gaatggggaa ggggagtggg ggcacaatct gagaaaggac acctgtgctg ttctaggcat 241 cgctggcaag tttgtgggaa gggatgggca agggtgagtg ggtttgctcc acaccgtcct 301 gtgctgctcg agaggacctg ggacgtgcga gggaaacgtg ggtgacggtg cctaggctgc 361 ggcccttcac tgctgtgctg ggttcctgca gcctgctacg tttcccttgg caatgtaaat 421 gaagatggag gggtcgtttc gtgatttcct gctgctgaga ataaatgtct tgttaaaaac 481 gtggcaacgg ttactcttag gtgccatgga tcgatgtcag ggtggtcagc tctggactaa 541 gccacccacc tccaatttgt acaacagtat tgatacatag ggctacactc attactgttc 601 aagtgttcta tgttaagagt tgtgtttaat ttctaaagat taaaaaaagc aaaaaaattg 661 gtgctaaacc ttcacccctg agcacgctca gtgagactgg tcatgcaagc atttacagtg 721 ccatgctcct caagccgatt ttttcttgta gaaatgttgc cctatttgtc ttctccaatg 781 tatggtatgt tattttattt tattatttta ttttatttta ttttatttta tcgagggggg 841 gtacgatacc tgccatttaa gaaaatgaat agaaaatttt aaaacccgag aaatggggga 901 aaaaaaatca gtgcacaaga attgggctgg ttaggcccag caccacactg aagtgggctc 961 agtggttttt ggagtgaaga agccttactc cctgcacatt ccctcatgct cccacacaag 1021 tccagcaatg gaaatgcttg ggttcctctt gctttgtcag gggactcagg agtcgaccaa 1081 gggaaaccat ttggccccgt gaggaatggg cattgtcagt atccgtcctg aacggggcct 1141 agtcaggaag cggtctagaa gtgtacggtc acggtcgcct catgaaagtg tgtagcaggt 1201 ggctctcagg aaaaatacca agtctggatc atccatgtgg cagctttgca tagggagagg 1261 atagctccga actggaactg aactgccttc tctgcacgct tgaccaaagc agtgatgaag 1321 gcgctggtgg tggcgcgcgg cgcggcgcgg cgatggcggc gggtggcagc gatccgcggg 1381 ctggcgacgt agaggaggac gcctcacagc tcatctttcc taaaggttgg gctcggggct 1441 gccaaactcc cccgcgccac ttcgcgtggt cgccgcaggc ctggcgttat gcgcgcttcg 1501 cccaaggccc tgcctaagcg gcggcttggg caagccccac cgcggcgtgg ggctggggag 1561 ggaacatggc cttgggaggg accatggcct tgggagggac cgcctgggca actgggttat 1621 tttatgtgat aaactgcgaa gttccgggga cctgtgtcaa agataaacaa agccggacac 1681 tcaggtggta aggacagatt tttaaacagt aatatactgt tgcactaggg aaaagagccc 1741 agcgtgaacc gaactcaact tcgattagta cagaacctct gggcgtttta acggagaatt 1801 agggaataag gatgtggtga gcgggagctc gggggagtcg gggaagtgag aaatgtaaaa 1861 aagcgggaag aggaggattg gtccgtgtga acgcagcttg gtttgttgac tggcgcacct 1921 gtggaagtta ggcccctacc ctcccacaga ggcagagaga cagagtccta tcttcagctg 1981 ttggctggaa caaattattt tggcagcctt gagttttctc acgcacgcac tttaagtggt 2041 tggggagggt attcctaggg atgaggtctt cagctcaaaa acttgaaaat gtatacagct 2101 gtcttacact ttattgattt attgatttat tgattgagat agggtctcgc cctcttgttg 2161 aggctggagt atagtggcat gatcagagcc cactgcaacc tcaacctctc aaccgctcaa 2221 gctatcctcc cacctcagcc tcccaagtag ctgagaccac agatgcacgc ccccatacct 2281 ggcccattaa aaaaattttt tcgtaaagac agggtctcac tatgtcgccc aggctggtct 2341 caaactcctg ggcttaagtt aatcacggca cctgacctat cttattcttt tattcattca 2401 ttcatttact tatttagaga cagagtctca ctctgttgcc caggctgggg tgcagtggta 2461 cgaactcggc tcactgcaac ctccgtctcc tgggctcaag tgattatcct gcctcagcct 2521 cccgagtagc tgggattaca ggtgcccacc accacacccg gctaattttt gtattttcag 2581 tagagctggg gtttcaccat gttggccagc ctgttctcga actcgtgacc tcaggtgatc 2641 cacccagctc ggcctcccag agtgctggga ttactgatgt gagccattgc ctggcaaaat 2701 aataaattta aaattaaaac ggaaatacta ccctctaaga aataaaaaat ataaaaatga 2761 aaaaatgttt attatgtgtt ttttgtattt tatggtttaa tacgttagaa cttactattt 2821 tagttgttaa tttatttttt ttaatttttt ttttaatttt attttgagat agggtttcac 2881 tctgtcaccc aggctggagt gcagtgatgt gatttcggct cactgcaacc tacccctcct 2941 agtttcaagc catcctgcct cagcctccca agtagctggg aatacaggcg cctgccacca 3001 tgcccagcta atttttgtgt ttttagtagg gacggtgttt caccatgttg gccaggctgg 3061 tctcgaactc ctgacctcaa gtgatccacc caccttggcc ttcccagtgc tgagattaca 3121 ggtgtgagcc acctcaccct gccttttttt tttttttttt tttttttttt tttttttttt 3181 tttgagacgg gatctcattc tgctacctag gttggagtgc agtggtgtga tcacagctca 3241 ctgcagcctc aaccttccct aggctcaggt tatcctctca cctcacttca gcctctggaa 3301 tacttgggac tacaagtgca ggccaccact cctggctaat ttttgtattt ttttgtggag 3361 acaaggtttc accatgtggc ccaggctggt cttgaactcc tgggctcagg taatctgccc 3421 gcctcgacca cccaaagtgt tagaaagtat aggtgtgagc cactgcacct ggcctattaa 3481 tggtggtaat gacgtatcct cggtaaaatt tccagatgac atacagccaa ggagttgttt 3541 ttcctttttt agcaacagag attaattatg gccattgttc ttaaaatatt tgcacaagag 3601 aaaataacag gcagatcccc tgatctattt ccttttgttt ctaaataaat tgtgtgtgtg 3661 tgtgtgtgtg tatgtgcgcg cgtgtgcgcg ccttcacttg aaaatgttcc ttgggattag 3721 ccatggggag aagtcttgga tccctcctct ccatagttac acaaaagtgt ctgaactgcc 3781 tcccccatcc ccattttgtt gatgctgaat cctgggaatg cctcccaaaa gctctgtggt 3841 aggtctcaga caccactttc ctaggcactc tgagttacag tttggctgcc tcgaccttcc 3901 ttggttgaag ggagtgaggg taatgtatta gtagtacttg ggtattgttc ttaatgagaa 3961 atagggacag ttgaccagtt tcctggtgtc ctaaaagttc cattcctttc catttaacaa 4021 gtaatttggt ttagtgcaga aagggaccat ctctcttttt tttttttttt tttttttttg 4081 agacggagtt tcactgttgt tgccttaggc tggagtgcag tggcacaatc tcggctcact 4141 gcaacctcca cctcccgggt tcaagcaatt ctcctgcctc ggcctcctga gtaggtggga 4201 gtacagtcat gtgccaccac tcccgactaa tttttgtatt tttagtagag atggggtttc 4261 accatgttag gctggtctcg aactcctgac ctcaggtgat tcacctgcct tggcctccca 4321 aagtgttggg attacaggcg tgagccactg cgcttagcct gggaggcatc tcttaacatt 4381 gatttttcca ggacctgtaa aagcatcaaa gttccaacaa acagatttgt aactgattag 4441 ctgctgcttc cctttttttt tttttttttt tggcctgatg tcatttgtta ctgtcacttc 4501 agagtttgga ggttctgcag tcctgataca taatgccttt tcctctactc attgctgtga 4561 ggcagtagtt tcttctgtac ctacactgcc tcagtgttaa ggattaaaag aggtaacttt 4621 ccctggtata caaataggct ctcactgtag taaatccccc tgttataggc tagaagactg 4681 aaaaagaagg tgttctgagg ttttcgttta aactctcctg ccctcaggta gaaaacagtt 4741 tttggttacc tattttttaa tttatatttt aattttattt caatagtgct ccaactgtat 4801 tggcagccta ttctatttag tagcaatgag tacctttcaa ataaaaatac agtttcctcc 4861 tgacccacca cttaaaacta tctgtgttgt aaaaggaaaa tgaagctctt gtcagttacc 4921 tggcttgaga aatgggaagg cattactctg agggaggtgt tagtgatttc cctgatagta 4981 aacagaccca tagcacatct aaatgtgaaa ttgcaagtcg ttttggcttt tcctcactgt 5041 tgcttcctct tcagtgtggg gttataaaca tgaattcatg tttatgaatg gttcctactc 5101 taagggaact cactgttaag agaaaggcag ataaaaacta tctctaatac tttgagataa 5161 acattaggaa cataagatcc tgcaggaacg taagggagag aatgattttc ccaagggtaa 5221 cagcattttt taacagaact attgtagaaa tgtagaaggt cgccgtatat tatgaaaagg 5281 ggccatgggt ttctttttct ctcaaaccta attcttaaaa attgcttata acatttgtgt 5341 gtgcacaaaa atagattttg gggtacatta tatttatttc cagacacttg gccctattta 5401 acatgtaaca attcttaaaa ttgagagtat aatactagca ttatggaagt aggaagatga 5461 tgagctgatg ccggctagag ggaaggaatt gtcagtgtac tcttgaaatc agtaactagt 5521 ctatgtgtca ttacttagta gcatgtctag gtcaggtttt tggtgtcaga atctagctca 5581 gacaagctaa atgttagtgt ctcgtgaact ccactgtgcc ttgtggtatg taactgtgcc 5641 ttgtggtatg taactgtggg tgaacttaaa gatgtggagt agctgcaggc ataaaaagga 5701 aggagatccc agcctgggca gtgtggtgaa acccccatct ctaccaacaa cacacagggt 5761 gtggtggcgt gtgcctgcag tccagctacc tgggaggccg agatggggat ggggcatcac 5821 ctgaactcag ggaggtcaag gctgcagtga attgtattca tgccactaca ctccagcctg 5881 ggtgacagat tgagactctg tctcaaaaaa aaaaataaaa gaacgaaatc atatcttttg 5941 cagcaacatg gatggagctg gaggccattc acttaagcaa ttaacacagg aacaaaagac 6001 caagcacctc acattcttat aagtgggagt taaaccttga gtacatatgg acagaaggga 6061 acaacagaca ctgaggccta gttgaggggg tgggtggtgc agattgaaaa actatgtatc 6121 agtactatgc ttatcacttg gatgacaaaa tagtctgttc gccaaacctt cacaacaccc 6181 attttcccct gtagtacaga cctgcacatg cactctgaac ctaaaataaa agttcaaaaa 6241 aaagcagatg tggagtagga aacccctcga tttctgcagg ggcagagaaa tgttcaaatg 6301 cacagtgaac atttagaagc aaacttttga aggagtattt tcagctgcca tttatgcttc 6361 ttacaaatag agtttgaaac agctgagaca cttctaaatt cagaagttca tatgcttctg 6421 gaacatcgaa agcagcagaa tgagagtgca gaggacgaac aggagctctc agaagtcttc 6481 atgaaaacat taaactacac agcccgtttc agtcgtttca aaaacagaga gaccattgcc 6541 agtgttcgta ggtgagtgct aaaagaaagt ttttattaat ccaaaccatt ggacaactgc 6601 atgtagaatg ctgtcccctc cccctttccc aggtgtcact tgtgaattta agtaaaataa 6661 tgttaggctg ggcatggtgg ctcacaccca gcactttgga aggttgaggc agcagatcac 6721 ttgaggccag gtgttcaaga ccagcctggt cgatatagca aaaacccgtc tctgctaaaa 6781 atgcaaaaaa ttagctggtg gtacacacca ttaatcctag ctgctcagga ggccgaggcg 6841 ggagaatcgc ttgaatctgt aggccaaggt tgcagtgagc cgagattgcg caactgcact 6901 ccaacctggg caacttagaa gaccagaatt tcagccaggt gtggtggctt acacctgtaa 6961 tccacacttt gtggagctga ggtggaccag tctcttgagg ccaggagttc aagaccagca 7021 tgtgcaacat ggtgaaactc gtgtccctac aaaaaaaata cagaaattag ccaggtgtgg 7081 tggcaggtac ctgtggtcct agttacttgg gaggctgaag tgggaggatc atgtgagcct 7141 gggaggttga ggctttagtg agctgtgatt gtgccactgt actccagtct gggaacagat 7201 tcaaatatga atctgtcata ttttgtgtaa gtccagtatg tgtgatctaa atggtactgt 7261 tagtggggaa gtaacttttt ttgttatttt ttttagagat ggggagactt actatgttgt 7321 ccaggctggt ctcaaactcc tggcctcaat tgatcctcca accttagccc cccaaagtgc 7381 tgggattggc cgggtgcagt ggctcatgcc tgtaattcca gcactttggg agaccaagat 7441 gggcacatct cttgcggtca ggagttcaag accagcctga ccaaaatggt gaaatctcgt 7501 ctactaaaaa tacaaaaatt agctgggcgt ggtagcgcat gcctgtagtc ccagctactt 7561 gggagcctga gaaaggagaa ttgcttgaac ccgggagagg gaggtagcag tcagccaaga 7621 tcgtgccact gcactgcagc ccgggtgaca tagagtgaga ctccatctca aaaaaaacaa 7681 agtcctcgaa ttagaggtaa gccaccatgt ccagcctata gatagataga tagatagata 7741 atagtatttg tacctatgta tgaggtacat atgatatttt gttacttaga atgtgtaatg 7801 agtatgttag ggtattgagg gtattgatca tttctatgta ttaggaacat gtcaagtctc 7861 ccttagctat ttttatttta tgtatttatt cattgatttt agagacaagg tctcactgtg 7921 ttgcccaggc tgaagacgtc ttgaactcct ggactcaagt gatcctcctg ccttggcctc 7981 ccaaagtgct aggattacag gcatgagctg ctaagcctgg cctcttctac cttttttttg 8041 ttttgtttcg ttttgttttg agacagtctc acttcatcac ccagattgga gtgtagtggt 8101 gcggtcttgg ctcactgcaa cctccacctc ccaggttaaa gcagttctca tgcctcagcc 8161 ttccaaatag ctgggactac agacacacac caccacaccc ggctaatttt tttagttttt 8221 tgtttttttg gttttgtttg tttgtttgtt tgtctgtttt gagacggagt ctagttctgt 8281 cacccaggct ggagtgcagt ggcgtgatct tggctcactg cagcctccgc ctcccgggtt 8341 caagcaattc tcctgcctca gcctcccgag tagntgggag tagcccgcca gtgcacccag 8401 ctaatttttg tgtttttagt agagacgggg tttcaccacg ttggccaggc tggtcttgaa 8461 ctcctgacct ccgttgatcc acccgcctcg gcctcccaaa gtgctgggat tacaggcgtg 8521 agccactgcg ccttggcaat ttttgtattt ttagtagaga cggggtttcg ccatggtgct 8581 caggctggtc tcgaactcct cacgtcaagt gattacccac cttggcctct cagagtgctg 8641 gaattacagg tgtgagccac tacacctggc cgtctagata tttttaaata ctagctacgt 8701 ttgatccctt ttttcctgtt gattactctt tgatttttgt tgtttatttg tttttgatta 8761 ttttgatttt ttttttttcc ttttgattag cttgctactc cagaaaaagc ttcataagtt 8821 tgagttggcc tgtttggcca acctttgccc agagactgct gaggagtcca aggctctaat 8881 cccaaggtta gtaccagttg tgataagttc tctgtatact ggataaattc ctacaaatag 8941 aattggtgtg tctaaaatgt gtatgcgtta tatatttggg tacataattg ccaatattga 9001 atgaaagtgc cttttttttt tttttttttt tttttgaaac ggantttcgc tcttgttgcc 9061 caggctggag tgcagtggtg tgatctcggc tcaccacaac ctctgtctcc tgggttcaag 9121 cgattctcct gcctcagcct cccgagtagc tgggattaca ggcacgcacc accacacttg 9181 gctaattttg tgtttttagt agagatgggg tatctccatg ttggtcaggc tagtcgcgaa 9241 ctcccagcct cagttgatcc acccgcctcg gcctcccaga gtgctgggat tacaggcgtg 9301 agccaccgtg cccagccaaa agtgcccttt taacagtgtg tgagaatgat ggttttatac 9361 cacaccaagt gatacgcaaa aatatgaaac agctggaact tctgtccgct gatggtagga 9421 ttgtgagtgt taattagaac aatcattttg gagagtgatt tggctatgtc tggtaaagat 9481 gaatatactc caagttccaa aaatccattc ctggtacata tcctaaaggt aactcacaca 9541 aatttagaag gagacgtata ttttagtgtt cattgctgca ttgttttgtt tttagtagac 9601 atggggtttc accgtgttgg ccaggctggt ctcaaacttc tggcctcaag tgatctgcct 9661 gcctcagcct cacaagtgct gggattacag gcatgagccg cggtgcccag cctcatgctc 9721 cactgctaat gatagcagaa atttgataat ctcttggcca gtaagaaaat ggataaatga 9781 atcattgtat aatcatacaa tgtttattat acagcagtaa aaaaaaaatc aatgaactag 9841 aaccacatgg aatatcaaca tatgccagaa tgaattttga ggaaaaaaat tgcagaagga 9901 taaatacagt atgatgccat ttcttataaa gtttgaaact atgctgcata ttatttacgt 9961 atccataaat gtgtagtgag tataaaaata tgtatggcaa aacaaatttt ttttaaatgt 10021 atggcaatga taaatactaa attgaggatg gtggttattt ctggggaagg agggaaggta 10081 ctggtctagg agagtataca cagccatcca cttttcctgc ttattaaaga actctgggct 10141 gggcgcagtg gtttaagcct gtaatcccag cactttggga ggccgaggca ggtggatcac 10201 aaggtcagga ggttgagacc atcctggcca acatggtgga actccatctc tactaaaata 10261 caaaattagc tgggtgtggt ggtgtgcgac tgtagtcccg gctactcggg aggctgaagt 10321 aggggaatca ctggaacccg agaggtgatg gttgcagtga gccgagattg cgccactgca 10381 ctccagtttg gcaatagagc gacactctgt cccaaaaaaa aactctggcc aggtgtggtg 10441 gctcacacct gtcatcccag cactttggga ggttgatacc attagaaaac atgaagacag 10501 taaatgaaaa aatgcagggc cgggcgtggt ggctcatgcc tgtaatccca gcactttggg 10561 aggttgagac aggaggatca ccctgaggtc aggagttcga gaccagcctc gccagtggtg 10621 aaaccccgtc tctactaaaa atataaaaat tagctgggtg agggctgggt gtggtggctt 10681 acgcctgtaa tcccagcact ttgggaggct gaggcgggcg gatcacgagg tcaggagatc 10741 gagaccgtcc tggataacac agtgaaaccc tgtctctact aaaaatacaa aaaattagct 10801 gggcgtggtg gcgggcacct gcagtcccag ctacttggga ggctgaggca ggagaatggt 10861 gggaatccgg aaggcggagc ttgcagtgag ccgagattgc gccactgcac tcccagcctg 10921 ggcgacagag caagactccg tttggtaggc tgagacagga gaatcacttg aaccctggag 10981 gtggaggttt tggtgagccg agatcatgcc acttcactct agcctaagag gcaagagcga 11041 aactccatct caaaaaaaag aaaaaaaaaa acctctaagt caagtggggc taactgtaaa 11101 ggtatatttt ttataccttt tatcttttat atgtttgaaa tattttgtaa tgttttatca 11161 ggaaaagtgg aaaagaaatc cagatgaaag gtaaaggtgt tagagatgtg ggcagtagat 11221 tagcacgcct caaagaagag tgcggggaaa ttgccagtcg ccaaatcact catttctttt 11281 cattttcttg tagcttggag ggacggtttg aagatgagga gctgcagcag attcttgatg 11341 atatccagac aaagcgcagc tttcagtatt aatctccaaa catcactgct gctcggagaa 11401 accacatccc caggcataac accaccttcc cactgtctgg ggctgacttg cacagaaatt 11461 ctgttgaaga cagttgagaa ttcctttgga gaaaacagcc cagcttggcg tggggttagg 11521 ttgctgtttc aaataactca caggcccagg tgacatggaa tcttggagca gccttgtgca 11581 gtggcagcca gtggcttcct gaacgtgcct ctgcgaagtg tgagatgagg ggtcacataa 11641 ccacactgtt gactacctca ttcctggttt ttggcctcca catcatcttt tttcttaata 11701 tttcatgttt taatttcagg gtgtttatac tttttgaaac tagaccagaa gatagtagac 11761 tttatagaga aagaccagtt ttacctagat actaaaggaa gaattaaacc gctgttagtt 11821 tgaaatgctt tttttttttt tttttaaatg gagatagggt cttaactctt gtccaggctg 11881 gaggagtgca gtcgtacagt catggctcac tgaagtcttg accccgctgc ctcagcctcc 11941 caaataactg gggccacagg tgtgcaccac aactctcagc taatttttaa aattttttat 12001 agaggtgggg ttttactatg ctgtccagac tggtcttaaa ctcctgggct caagtgatcc 12061 ccctgccttg gcctcccaaa ctggtgagat tacaggcatg agccaccaca actggcctga 12121 aattcttaaa ggatgggagt gtcgatgaca gcaccttggc atcgttgtgc ctaacctggg 12181 agacggaaga agcacgccat gggaagtgtt tacacttggg ggacaagtgc taagtattgt 12241 ggagcccata gccccttgag atagatggct actttgcctt tcttcttgaa ctgtcttgca 12301 gaatgtggat ttggggtaag tggtcttgaa ggattcattt agtcaccctc aaattaagat 12361 ttttacttca tctttcttgg gcctgcacct ccaagataac aaagaagaag caatggtcgt 12421 gccaaagagg tccacaacca ggtgtgcact gttcactgca gcccatttgc tgtatgaact 12481 gtggttgttg tgtgcccaat gacaaggcta ctaagaaatt catcatttga aacgtagagg 12541 ccgcagcagt cagcgatgtt tctgaaatga gcatccttga cgcctgtgta cttcccaggc 12601 tggatgtgaa gctacattac catgtgagtt gtgccattca cagcacagtg gtgaggaatt 12661 gagctcatga agcaggcaag gaccgaacac ctccacccca acgtagacct gcaggtgctg 12721 ccccatgacc tccaccaaag cccatataag gagcggagtt gttaaggact gaagaaaaac 12781 ttctctggag aaaaataaaa ttgcaattct acttaaaaaa aatttttttt ttttttttac 12841 ttcataggcc aggcttgaag ttctgaacac tttgaagtct ccaattatga gagatccagt 12901 ctaagcctct ggcctgctaa ttagcaataa gtgctttatt tggaaggagg gagtcatcca 12961 ctcttgagcc actgcagtga agtcacttga tctcagtctg ggggaaaaca cttcaatagc 13021 taaacattct agctttgatt tttctgaagg gaatacactt gttttcaatt ttggggtttt 13081 tctttggggc acttgcttga ctctgtatga acttgtgatc caaggaaaaa ggagaaagaa 13141 cagtgttggc ttttaaaatc aggatggttt tatgtttgct acgaaataag gcaagaataa 13201 aaaattctta tttttattta tttatttatt ttttgagata gagtctggct gtgttgccca 13261 ggatgcaatg gcgcaatctt ggctcactgc aacctctgcc ttctgggttc aagtgattct 13321 cctgcctcag cctcccaagt agctgggatt acaggtacc // LOCUS HSU91522 2292 bp DNA PRI 16-APR-1997 DEFINITION Human peroxin 12 (HsPEX12) gene, complete cds. ACCESSION U91522 NID g1938368 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2292) AUTHORS Chang,C.-C., Lee,W.-H., Moser,H., Valle,D. and Gould,S.J. TITLE Isolation of human PEX12, mutated in the group 3 peroxisome biogenesis disorder patients JOURNAL Nature Genet. 15 (1997) In press REFERENCE 2 (bases 1 to 2292) AUTHORS Chang,C.-C., Lee,W.-H., Moser,H., Valle,D. and Gould,S.J. TITLE Direct Submission JOURNAL Submitted (27-FEB-1997) Biological Chemistry, Johns Hopkins University School of Medicine, 725 N. Wolfe St., Baltimore, MD 21205-2185, USA FEATURES Location/Qualifiers source 1..2292 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(<1..154,459..1012,1869..>2292) /product="peroxin 12" gene 29..2268 /gene="HsPEX12" CDS join(29..154,459..1012,1869..2268) /gene="HsPEX12" /function="import of peroxisomal matrix proteins into peroxisomes" /note="HsPex12p" /codon_start=1 /product="peroxin 12" /db_xref="PID:g1938369" /translation="MAEHGAHFTAASVADDQPSIFEVVAQDSLMTAVRPALQHVVKVL AESNPTHYGFLWRWFDEIFTLLDLLLQQHYLSRTSASFSENFYGLKRIVMGDTHKSQR LASAGLPKQQLWKSIMFLVLLPYLKVKLEKLVSSLREEDEYSIHPPSSRWKRFYRAFL AAYPFVNMAWEGWFLVQQLRYILGKAQHHSPLLRLAGVQLGRLTVQDIQALEHKPAKA SMMQQPARSVSEKINSALKKAVGGVALSLSTGLSVGVFFLQFLDWWYSSENQETIKSL TALPTPPPPVHLDYNSDSPLLPKMKTVCPLCRKTRVNDTVLATSGYVFCYRCVFHYVR SHQACPITGYPTEVQHLIKLYSPEN" BASE COUNT 646 a 505 c 449 g 692 t ORIGIN 1 aagtgaaagc cagtacacgc aggaaactat ggctgagcac ggggctcact tcacagctgc 61 ttctgtggcc gatgaccagc catccatctt tgaggtggta gcacaggaca gtttaatgac 121 agcagtgaga cccgctcttc agcatgtggt caaggtaagg tattaattct tggaaaggtt 181 ttagggacaa ctgggtaaaa aacaaacgaa taatttgtgc ttatttggta gcctagcgtt 241 tgaaatagcc aatttgcaaa taacttgtat cccatgactt aagagttact gagtctctgt 301 tatgtgtaga gacctgtact gaatacccag catttcaggg tccaacagtg acttatttaa 361 aacaaattgt gtagacttta gaggaaaagg gggaatgtaa ataggtgtag aacttgtgtc 421 atggaatgaa tttcacttat tgctttattt tgctttaggt tcttgcagaa tcaaatccca 481 cccactatgg cttcttgtgg aggtggtttg atgaaatctt tactctgcta gatcttctgc 541 tccagcaaca ttatctgtct agaaccagtg cctcattttc tgaaaacttt tacggcttaa 601 agagaattgt aatgggggac actcacaagt ctcagagatt ggctagtgct ggtctcccaa 661 agcagcagct ttggaaatct attatgttcc tggttcttct tccctatctg aaagtgaagc 721 tggagaagct ggtttctagc ctgagagaag aggatgaata ttctattcat cccccttctt 781 cccgctggaa acgattttac agagctttcc tggcagccta cccatttgtg aacatggcct 841 gggaaggatg gtttcttgta caacaacttc gatacatcct aggaaaagct cagcatcact 901 caccactgct gaggctggct ggagttcagc taggtcgact gacagttcag gatatacaag 961 ctctggagca caaaccagct aaggccagca tgatgcagca accagccagg aggtaaggcc 1021 ttaacatttc caaaagatct ttagttagta aattcgaaaa tcgtatccca gttttgtaaa 1081 attcacctcc ctgttaactt tgtggcattt catagagtca tagaaaagct attccttgtg 1141 tccataaaac aaggactgtc atcctggtaa ccttcatcat tccaggtata ctttagtgca 1201 tacttgctat gtcccaggcc ctgtgctagg tgcttgcttt acgtacatta tctcaatcct 1261 caacacaacc caatgaagta aatgtattag ctcacagttg aggaaactaa acccaagcag 1321 ttaagtacct tgccaaagtc acgtaggtac atagaaggtg gcagagctgg gattcaaacc 1381 aaaccaatac aattctaaag actgtgctct gtgagccact gcacccagcc taagactgtg 1441 ctcttaatca tgatgctgtg ctccttctag ttcatatttg tgttctccta caagttcaaa 1501 tatatacatt cctcaatact gctaagtttt ttattaagtg aattgggttt tttttttttt 1561 ttctggatca ctatcttcca ctaaaaaaag ggaggtctat ataatcaaat tctctgattt 1621 atctttcaaa acacaaaacc aaaccctaat catgttacta aaaacaaagc ttagctccta 1681 tgctcctaaa atattgtaag ttagaatcac ttagaataga tccttaagga gatagtacca 1741 gtctaccaac cattcagtcc cccagtgatt ctgatgaaaa atgtaagaat tgcatttata 1801 ttagtcactt tgtaatgata cctaaataaa gtttctctct ttgcctcctt gttcttttct 1861 cccaacagtg ttagtgagaa gataaactca gctctgaaga aagctgttgg gggtgttgcc 1921 ttatccctgt ctactggcct ttctgtgggt gtattcttct tgcagttcct tgactggtgg 1981 tactcatctg aaaatcaaga aaccatcaag tcattgactg ccctgcctac tccaccacca 2041 cctgtacacc tagactataa ctctgattct cccctcttac ccaaaatgaa gactgtgtgc 2101 ccactgtgtc gtaaaacccg ggtgaatgat actgttcttg ccacctctgg ctatgtgttt 2161 tgttaccgct gtgtgtttca ttatgtgagg agtcaccaag cttgtcccat cacaggttat 2221 ccaacagaag tacaacatct gattaaactc tactcccctg agaactgaaa gggaatcatg 2281 tcttatcctc ac // LOCUS HSU95012 4940 bp DNA PRI 06-NOV-1997 DEFINITION Homo sapiens neural retinal-specific leucine zipper protein (NRL) gene, complete cds. ACCESSION U95012 NID g2232010 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4940) AUTHORS Swaroop,A., Xu,J.Z., Pawar,H., Jackson,A., Skolnick,C. and Agarwal,N. TITLE A conserved retina-specific gene encodes a basic motif/leucine zipper domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 266-270 (1992) MEDLINE 92108034 REFERENCE 2 (bases 1 to 4940) AUTHORS Farjo,Q., Jackson,A., Pieke-Dahl,S., Scott,K., Kimberling,W.J., Sieving,P.A., Richards,J.E. and Swaroop,A. TITLE Human bZIP transcription factor gene NRL: structure, genomic sequence, and fine linkage mapping at 14q11.2 and negative mutation analysis in patients with retinal degeneration JOURNAL Genomics 45 (2), 395-401 (1997) MEDLINE 98008930 REFERENCE 3 (bases 1 to 4940) AUTHORS Swaroop,A. TITLE Direct Submission JOURNAL Submitted (21-MAR-1997) Ophthalmology, University of Michigan, 1000 Wall Street, Ann Arbor, MI 48105, USA FEATURES Location/Qualifiers source 1..4940 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q11.1-q11.2" mRNA join(<2165..2545,3445..>3777) /gene="NRL" /product="neural retinal-specific leucine zipper protein" gene <2165..>3777 /gene="NRL" CDS join(2165..2545,3445..3777) /gene="NRL" /note="involved in rhodopsin regulation; leucine zipper family transcription factor; bZIP transcription factor; Maf oncogene family; mRNA sequence was deposited with GenBank Accession Number M81840; similar to M. musculus NRL protein encoded by GenBank Accession Number L14935" /codon_start=1 /product="neural retinal-specific leucine zipper protein" /db_xref="PID:g2232011" /translation="MALPPSPLAMEYVNDFDLMKFEVKREPSEGRPGPPTASLGSTPY SSVPPSPTFSEPGMVGATEGTRPGLEELYWLATLQQQLGAGEALGLSPEEAMELLQGQ GPVPVDGPHGYYPGSPEETGAQHVQLAERFSDAALVSMSVRELNRQLRGCGRDEALRL KQRRRTLKNRGYAQACRSKRLQQRRGLEAERARLAAQLDALRAEVARLARERDLYKAR CDRLTSSGPGSGDPSHLFL" misc_feature 3445..3777 /gene="NRL" /note="encodes DNA binding domain" BASE COUNT 1048 a 1300 c 1438 g 1151 t 3 others ORIGIN 1 tgcagggcct tggaaggtgg gggtgctggg agtttgtctg gtcctctttt tccaatccca 61 ggctatgcat ggctagggct gagggttgtg ttcacataga ggggtggtgg ttcagaccca 121 tgtttctgtc cgaactggga aactgagcac ttgggagtcg gatcatcagt gaggagcttt 181 gttggggaag atgctccctt tccttctttg agaataagca cacttgtact ctcctctctc 241 tccatttccc ccaaccccag gccaataaga tatgcaaatg atctccttac cttattggtt 301 ggaggcctca ggaccacaga tgacctcaga gagctggccc tttaagaatg ccctttgggc 361 tctgtgccca cagccccctg gagctgagca gaggcaccag gccctgctcc atggagcctt 421 cagtctcctg ggaagctgtg cctgtctggc tctggcactg accacatcct ctcggccatt 481 tctgaagtga gtatatccag ggcctacctg ggtgtcaggc cgctcaccta gaaccccctc 541 tttaacacct gctctctgtt gtctgttttc aagaatccta acaccttggt accctactcc 601 actcccaccc tgactggcct gtgtccctca ctgctatcac gtcagaaaga taaacaacct 661 tccccactac tccttctggt acatccagtt ctttctggat acctctctca gtgtctcacc 721 atctcctttc tgccaagacc cagttccttt aatctgtgtc tgtttccatc ccctctgtcc 781 tctcaagggt gacaggttcc atagagaggg cagcaaccct gccctggaac aggaagagag 841 aggcctgggt cttcctcttt gcctctctgg gtagagcctg ggggtctgac atgtggttgg 901 aaagaaggct acctaggctt ggggcatcta gaaggagccg aagtctaagc cctggggatg 961 aacactccat gccgcgcacc ttctgggggt ggggtgggag ttactggaat tgggagctgg 1021 actagagctt ggacttcaag gctgggaagg gtatctgagg tcagggagcc agaagttcct 1081 caggtgaaca tgtgccccat gattgataac agacaggaga gttccttggt gacataagtt 1141 taggacactt tttgttgaac ttgaacagac ggtttactaa aggattgctc aggttaatat 1201 gctcttgtgc tatgtgcatc tgtaagaggt agggataggt agcaacattt ctcaaagtta 1261 tctgagcaca gacctcttca taggtattgt tcagaatcag tgttctagag aacactcttt 1321 ggtatatgtt acaaagtcta actgccctca ttgaacagaa gggaaaacgg gtcagaaggg 1381 tgaaggtgat ctgccaagtt cacacaacgt aacaagttct gaatagaaga aaaatcaatc 1441 ttggtctggt tccccccttt gggagagact ggcccttgag gaaagatggt ggccagttga 1501 ttctgatctt tctagaacta attctaggac ctttcttttt tccatataaa gcctcttgcc 1561 cctacaaaag ggattcattg attgattaat ccatagaaca aatgtacact gagggtcttc 1621 tataggcaag gcacttccct gagtgctttg agagacagag agctataaga acacattatc 1681 cttggctctt aaaattcaca gcatacatgc cttcccactg tcctaaaata tacactcact 1741 ttgtcacagt taattaagca atttaaaaaa atggcttgga aatggaatga tgcctcttga 1801 gatgacagac ctctcggcat gtctcagcac tagggttggg agcactgcca tactgctctt 1861 gcttttcata gatcctgagg catcagtggg gcgagaggct gtgctgtcct cttcctcctt 1921 caggaattca gctgcttgtc ccctgtcagg agcccctgcc ctctgaaagg ttactcttca 1981 gcctggtggg gactctgcag tgaacagagc tgcaccatcc ctctggcttt cccaaactct 2041 tgctccaggg cacttgggct ttgagggaag agggacttgg tgaagagggg atggcaggtg 2101 gcctccatgt gctccagacc tctcctcctc tttgcaggtg cactcctccc agcccagctc 2161 cagaatggcc ctgcccccca gccccctggc catggaatat gtcaatgact ttgacttgat 2221 gaagtttgag gtaaagcggg aaccctctga gggccgacct ggccccccta cagcctcact 2281 gggctccaca ccttacagct cagtgcctcc ttcacccacc ttcagtgaac caggcatggt 2341 gggggcaacc gagggcaccc ggccaggcct ggaggagctg tactggctgg ctaccctgca 2401 gcagcagctg ggggctgggg aggcattggg gctgagtcct gaagaggcca tggagctgct 2461 gcagggtcag ggcccagtcc ctgttgatgg gccccatggc tactacccag ggagcccaga 2521 ggagacagga gcccagcacg tccaggtgag tggtcagcaa gctggcctga ggggaggcag 2581 ggcaaggaag gaggactgcc caagagagga aggggagctc ccagaggggg ttatggactg 2641 ggacagggga cagagggcag gagaagggga gaaggtccct tgaaagcaat cagatcgaga 2701 aaactacatt ctgcttctcc cccttttctt aaaatggaga gaaatgagta ggctgaacca 2761 ggaggagcag ggagaagata agattagcag agaatccaaa gggaagaatt gagctgggga 2821 gtgggcgact ccggggggta aacagataca tggtgtgtgg aaaccaggga aagggttatg 2881 tgtggagtgg agctgggtta agactggttt agattggcct tttcaagact tcgtgcttcc 2941 cagcccccaa ccttctcagg aaggattggg actcatccta ttaattacag acacagatgg 3001 gggtgttgcg ggcattagga tgtaatccaa atcctgtgag ggtcaccggg ttgcaggctt 3061 caggacaggg aaacgcagcg cgttggaggg ttgggggctg tctgggtgcg ggtccactgg 3121 acacacccgg gcctggagct ggatgccggg gatcccagag acgagcccgg ggtttaggtg 3181 cgcgacgggc tcgcctgacc gtggccggcc ctgcaccgtg gggcgcccgc ctgactggag 3241 caacggtcag ctggggggcc cggggagcgt cggggcctgg ggcgggctct ggaccgaaac 3301 agactgcgtg gaagggcgag ccttccggtg aaggtgggag ccggggcggg gctgtcccgg 3361 ggcggagcca ggtagcgtcg ggccctcagg gcagagccgg gtgcgacctg gcgctgaccc 3421 ggtttctgca ttctccctcc gcagctggca gagcggtttt ccgacgcggc gctggtctcg 3481 atgtctgtgc gggagctaaa ccggcagctg cggggctgcg ggcgcgacga ggcgctgcgg 3541 ctgaagcaga ggcgccgcac gctgaagaac cgcggctacg cgcaggcctg tcgctccaag 3601 cggctgcagc agcggcgcgg gctggaggcc gagcgcgccc gcctggccgc ccagctggac 3661 gcgctgcggg ccgaggtggc ccgcctggcc cgggagcgcg atctctacaa ggctcgctgt 3721 gaccggctaa cctcgagcgg ccccgggtcc ggggacccct cccacctctt cctctgagcc 3781 gttcagagca ccttgtggtg tagtgggggc tgggtggggt ggctccgccc aggaggcggc 3841 tgcacggttc tctgcatcgt taccagagcg ccttctggtc ctagccacgc cctgtatgac 3901 cgcgcaaata tccccaaagc ttttgggtcc tcaagtcatg cccgaattta gatgctggtc 3961 attttctgga gaggggtccc ctccccttac gaacacaaaa acccagccca catgactagc 4021 acgctgagct ctgcagggac cagtgccagg cactgggggg tggaagtgtg gtgacacagt 4081 gaatgggagg tggaggaggg ttgcagctcc cacctcagtt tagtttttaa ttcagggttt 4141 tcaacctgta acacattaaa gctgtaatta gcaatgaggc tgtattttca ttctgaagct 4201 tgtaacctcc ccattttagc actacagaat tttcaagatt tcaatatcca acaactagat 4261 agattaggac ctctatccga gatgcttttt ccctgcccaa ccctgtggcc ttcagggctc 4321 agagcagcaa aggcctgaag agtgagctct gggggttgtt ggtgtgggtt gggagagagc 4381 tgtgtgcaga agtctggaaa cctgggtcct agtcccagct cttccatggg atccccctgt 4441 caccctgagc aaatcagttg cttcctggac ttgtgttact tcatctaatt ctcatgtgga 4501 ttggacgact tctgctccct ttccagttct ggcatctccc cagtatggaa gtcccggtgg 4561 tctccccaag aagtccccaa gacaatctcg ccaaaggcac ctcctatcct cctgcagttt 4621 cccagctgca gcctaggcag gggatgcaca gcccaggcga ggaagcctgg cttctctgtg 4681 agcacatacg tgggtcctcg gcagctccct ccaggctgtc tgggcctcca gacctgcaca 4741 gggtgctcct gccacctccc acctctctga gggctgaggt gagacttctc ctgggatgac 4801 aatttgctga gagagtgcag cttttgtgaa ttaaacttga agtccaggca gaattctaat 4861 gcaataagct aaatgttctt gnaatttaag aagtgttcac tcttttatcc ctgnttcagg 4921 gtactgtttg ggtntctttt // LOCUS HSU96402 2200 bp DNA PRI 22-JAN-1998 DEFINITION Human homeobox protein GSC-2 (GSC-2) gene, complete cds. ACCESSION U96402 NID g2352055 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2200) AUTHORS Funke,B., Saint-Jore,B., Puech,A., Sirotkin,H., Edelmann,L., Carlson,C., Raft,S., Pandita,R.K., Kucherlapati,R., Skoultchi,A. and Morrow,B.E. TITLE Characterization and mutation analysis of goosecoid-like (GSCL), a homeodomain-containing gene that maps to the critical region for VCFS/DGS on 22q11 JOURNAL Genomics 46 (3), 364-372 (1997) MEDLINE 98110571 REFERENCE 2 (bases 1 to 2200) AUTHORS Funke,B. TITLE Direct Submission JOURNAL Submitted (02-APR-1997) Molecular Genetics, Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..2200 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11" mRNA join(<293..551,660..913,1481..>1585) /gene="GSC-2" /product="GSC-2" gene 293..1585 /gene="GSC-2" exon <293..551 /gene="GSC-2" /number=1 CDS join(293..551,660..913,1481..1585) /gene="GSC-2" /note="homeobox protein" /codon_start=1 /product="GSC-2" /db_xref="PID:g2352056" /translation="MAAAAGGAASRRGAGRPCPFSIEHILSSLPERSLPARAACPPQP AGRQSPAKPEEPGAPEAAPCACCCCCGPRAAPCGPPEAAAGLGARLAWPLRLGPAVPL SLGAPAGGSGALPGAVGPGSQRRTRRHRTIFSEEQLQALEALFVQNQYPDVSTRERLA GRIRLREERVEVWFKNRRAKWRHQKRASASARLLPGVKKSPKGSC" exon 660..913 /gene="GSC-2" /note="659" /number=2 exon 1481..>1585 /gene="GSC-2" /number=3 BASE COUNT 308 a 763 c 794 g 335 t ORIGIN 1 ggcggcgagg cggccctccg ccccttcccc gcaggtcgcc ctgccagctt gagcggcggg 61 cgcgggtggg gcaggaggac gcggggcggg gtggggggca cagggggccc tctctgcagc 121 gcccccggcc cccgcgcgcg gaggcagggc cgccgcagtc gaggattagc gcgttcgcgg 181 ccggcgctgc gggattaacc cgcgtggact ggacgcccgg cccggggatt actgcgcgct 241 ccctccccga cgtatatatt cccgcggcgg cggcgccccg gccgggccgg gcatggcggc 301 agcggctggg ggcgcggcga gccgccgggg tgccgggcgg ccctgcccct tctccatcga 361 gcacatcctc tccagcctgc ccgagcggag cctcccggcc cgggccgcct gcccaccgca 421 gcccgccggt cgccagagcc ccgcgaagcc agaggagccc ggggcgcccg aggctgcgcc 481 ctgcgcctgc tgctgctgct gcggcccccg cgcggcgccc tgcgggcccc cagaggcggc 541 cgccgggctg ggtgagtggg cgcggagcgg ggcgcggggc ccggcggagc ccggggcgcg 601 gcgcagtggg tgccgagctt ggccccagcc ccgcgcctca ccgcgccctc gctccgcagg 661 cgctcgtctg gcgtggccgc tgaggctggg accggcggtg cccttgtctc tgggtgcgcc 721 agccggaggt tccggggcgc tcccgggcgc ggtcggcccg ggttcgcagc ggcgcacgag 781 gcgccaccgc accatcttca gcgaagagca gctgcaggcg ctcgaggcgc ttttcgtgca 841 gaaccagtat cctgacgtga gtacgcgcga gcgcctggcc ggccgcatcc gccttcgcga 901 ggagcgcgtg gaggtgagtg ccccgcccag cctttccccg gagcgcgcgg gccgcggcta 961 cactggactg gggtcctggc gggcgggcgc cctttgcaaa gacggcctcg gcccaagccc 1021 cgccctggcg cgccggaggg aggaggtccc tggacggcgc tgggcgtccg ggggtatgag 1081 gagcgggtga gagcagggag gtgccgcggg aaaggaaccg gagggctact tttcttttct 1141 tttgttttac actttcctct ggtgacgaaa gaggcccgcg ttcacgtcca gaatttggga 1201 aattcagaag agcccgcaac ccaagaaggg gcgtcctggt cgccgccagc tggaggctgg 1261 ggcgggtact aagggggttc ccatctcgcg tccagaccca ccgagtctgt ccgcagcgaa 1321 taagggcagg tggcgcgcag ccgcggcccg ggtgtcggct ctacagcgcc gtccgcccac 1381 atccctgttg cgaagctccc ctctcggtcc ctgtgggacc ctcgggagcc ggtgggacgc 1441 aggacctggg gctagggctg agcattcccc ccatccccag gtctggttca agaaccgccg 1501 ggccaaatgg cgacaccaga agcgcgcgtc ggcttccgcg aggctcctgc ccggcgtcaa 1561 gaagtccccg aaggggagct gctgatgact ctaggagctg cccctgggct cggccaccct 1621 ttttgggatc tttggagttg gcgctgagag aagacaggtc tacccgaaaa ggagctggga 1681 gagtacaccg gccgcctcca cccgtctcca cagcccttgc ctcctgcagc tcgtgctgcc 1741 gtggcgctgg ggacgggccc ccggtgcttg gtgttccacg gcagtgggag tggcgagtcc 1801 cttgggggtg ggctggggca tagagcagtt tcctcagctc cctacccccc gagagacact 1861 aactccaccg caggagggga accaccccgt atcaacacgg gacccagaat cctacgcagt 1921 ggagcgtctc tccgcaccct gggacatgct ggccaccctc ttctcaatgt ggacattgac 1981 ctaacttgac ctggctcgtc ctcccccagc gggagagggg atggggttcg tgtctgtgca 2041 gtcctggcgt tgcaggcttc ccaggccctg ggctgggtct tggtatctgg acctgtagaa 2101 taagaaggtc gcaggagcga ttccaggagc ccctccacag tcccttcacc ttcgagcctc 2161 gcgctgatac tgagaaacta gcaacttcaa ataccacaga // LOCUS HSU96846 6965 bp DNA PRI 11-DEC-1997 DEFINITION Human natural killer protein group 2-F (NKG2-F) gene, complete cds. ACCESSION U96846 NID g2673990 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6965) AUTHORS Plougastel,B. and Trowsdale,J. TITLE Cloning of NKG2-F, a new member of the NKG2 family of human natural killer cell receptor genes JOURNAL Eur. J. Immunol. 27 (11), 2835-2839 (1997) MEDLINE 98056789 REFERENCE 2 (bases 1 to 6965) AUTHORS Plougastel,B. and Trowsdale,J. TITLE Direct Submission JOURNAL Submitted (09-APR-1997) Human Immunogenetics Laboratory, Imperial Cancer Research Fund, 44 Lincoln's Inn Field, London WC2A 3PX, England FEATURES Location/Qualifiers source 1..6965 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p12.3-p13.1" mRNA join(<2186..2404,2785..2883,3415..3468,4009..4416) /gene="NKG2-F" gene <2186..4416 /gene="NKG2-F" CDS join(2218..2404,2785..2883,3415..3468,4009..4145) /gene="NKG2-F" /note="NKG2 family member; contains one potential ITIM motif, does not contain any lectin motif" /codon_start=1 /product="natural killer protein group 2-F" /db_xref="PID:g2673991" /translation="MNKQRGTYSEVSLAQDPKRQQRKLKGNKSSISGTKQEIFQVELN LQNASSDHQGNDKTYHCKGLLPPPEKLTAEVLGIICIVLMATVLKTIVLIPCIGVLEQ NSFSLNRRMQKARHCGHCPEEWITYSNSCYYIGKERRTWEERVCWPVLRRTLICFL" BASE COUNT 2187 a 1117 c 1212 g 2433 t 16 others ORIGIN 1 aatgacaccc tagcagctca gtctcttata ttctgaggtg tatgatgaga taaactgatt 61 caaagtggcc ctagtgtgaa aacttcctgt tcaaatccga agtgatacat tgatgaggaa 121 tctgcttttt gcactcttgc ctcactctga gctttcacag ggcagtctgt gaagatcaga 181 gatatctgtt ttttgttttt tgtttttttc tgagacagag tctcgctctg tcgcccaggc 241 tggagtgcag tggctcccga tctcagctca ctgcaagcta cgcctcccgg gttcaaccgt 301 tctcctgcct cagcctccca agcagctaca ggcgcccgcc accacacccg gctaagtttt 361 tgtattttta gtagagacgg ggtttcaccg gatcacagat gtcttttgtg tcgttaagag 421 gtaacctgac ctctccacta aggggcgtgt gactttctga tgacagaagg tatgtgttgc 481 tagtctcctg tttctctact agctcatctc ctttagtaga aaattgtaaa tattaaatgt 541 gatgcaacaa ggtaaagtta atacagaata aaatcaaagt agaactgtct gtacatgaat 601 aattcagggt gaaaagcaac ccaaaatata atatatgttt gaggatttct taaagtattt 661 acattattgc ctgtggagaa ttagtcaagc acatgcatac atcataatag gtttgttaat 721 caatataatt tagagtgaaa ccgataaata tcacgacaat gaagaattaa tttgaaattt 781 tataatcatt atagatgtwa taatacttta tatttagccc agactttaaa aatgcatttt 841 taatggatac atcaaattgt gaatgtcaaa cataatttct catacacata cttatnttta 901 attgaagcag atttattttt ttaagaatgt aataatgaag tagtttccat aaaatgcaaa 961 tctaaactta attataaata ataggacgac ctaaaaaaaa ttgagccaaa caaacattgc 1021 aaaaaagatg tactacactg acatgagact cctacctcat tagttatttt acacactcag 1081 tatgaattag aaataataaa atggcaattg tgttaaaaat aattaaaata tctaatatgt 1141 attgaggact aatgtgtgcc aggttgtatg ttaagggttt tatatacatt atacttcata 1201 aattattttc cccaatctag attagtaaac tggtgttgta gacaccaagt aactttgtaa 1261 aggtcatgca ttatgagttt caaagctgta attaatttta ggtctcactg gctacagagc 1321 tcattgacta gattattaaa ttatattttc tgatttatta caattactta cacaaccaaa 1381 gtgaccctcg ctgcctttta aaacaaccac agggtctttc tgaaagtttg aacttaaagt 1441 gttctttcat agatggtcca aacttatatt tatcattttc agttttcnta ttttagtcat 1501 taaatcaatt accaatcaat tctgtaaaat aaagagttaa atctttttca gtagtattta 1561 gtttatgcat aagttaatgt aatgttttac aaggaatnaa atatttcttt aaaattttaa 1621 caatttttnt gttttcctca atcattcaga ataaaaaaca tttgccaaga cctcatgccc 1681 ttttagtaaa cgaatgtaag tcactaacaa tgtattttag tattaaaagt tgttctagat 1741 gccaacaaat tattccttcc atgtgtagcg ctcaatcctg acattaatga attttaaaat 1801 tgtcatacaa ttagtgcatt ttattttcaa atttcaattt tagtttttta cacaggtatt 1861 ctattggtga taaaatattt ctgtagaatc tgattttaaa ttaatgaaga tatttatatt 1921 cagttacttc ttagttttca caaatgtatt gttttttatt gttcccatta ataatattat 1981 gtcaaatata gaaggccttt taaaaagtga cctattgttt aaagagattt cagtttaaaa 2041 ctttaggaaa ttagtaccag acttttatat tggtcaacag caaaatgaac attactactc 2101 agcctccaac acatgcagtt tgcctatacc agggatcctg tcaaaatata caccacttat 2161 agcttcttaa gtgcagttat catagagcac agtccctgac atcacacagc tgcagagatg 2221 aataaacaaa gaggaaccta ctcagaagtg agtctggccc aggacccaaa gaggcagcaa 2281 aggaaactta agggcaataa aagctccatt tcaggaacca aacaggaaat attccaagta 2341 gaattaaacc ttcaaaatgc ttcttcagat catcaaggga atgacaagac atatcactgc 2401 aaaggtaaaa cattaaatag atcttcaata ttattgttct aggatgtgca gttgaatgca 2461 gaaaggtggg gaaagatagg gaatattttg cacttgtgag aatcagaggt caaagtcagg 2521 atctaatatt ctaatatgaa atctgaagcc tgattttcag gcaggcattg ttcaattgyt 2581 aatttgtgat taacaactca tggagcatta tatttactga taatgaaatg gtatattctg 2641 agagaaagat tactagagta gatgtagatt tagaggacag agtttatcat tatgttttcc 2701 tgtgccacgt gagttctctt gtacgtaaac cttctcatca actctctatc tcccctctct 2761 cagtgcctct ttctctccct gcaggtttac tgccacctcc agagaagctc actgctgagg 2821 tcctaggaat catttgcatt gtcctgatgg ccactgtgtt aaaaacaata gttcttattc 2881 cttgtaagca tattcttgaa agattataag ggaacttttc actataatga ttggaagcgc 2941 cttgaaacat ttcataataa tgaggattag aattctctgt ttaatgtata tctctgaacc 3001 ccaagataat atgctgcttc tgaacttttc aaatttataa ataacagaat aattgtagaa 3061 aacatttatt tttttgtgtg tactaaatat atatgtatat atgatacaca cacacagaga 3121 tgtattctga tttcatgact caaagacatg ttttaagaga aaaaatattt agaaaaacaa 3181 attaattttt gaaagtggtt agatcaaata ctataagaga tggtgaagtt ttatgctaat 3241 ggctttaaaa atatttgttt taaaaagatc tcattatttt tatagaaaag tcatttttat 3301 ttcagattgt tccaatttaa aataatttta aattttgtat ttcaaagtaa gcaactgaat 3361 ttattataat ttgtctagat attaattttg taagaatcac tttaattttt ctaggtattg 3421 gagtactgga gcagaacagt ttttccctga atagaagaat gcagaaaggt accattatga 3481 ttgtcaatgt tctgatgtta gtacagttta tattttgtct ctaaagggat gcaaaatgat 3541 aataaaatgt tttgggaaaa taaactataa cattgagcca taaatgttta taaaataaag 3601 attatatgag ggcatgtcct tttctccaat aataagtaga aatgctcagt taaaatcatt 3661 ataccctctt gttgcattta attaactgaa atttcctact actataagat gataagagat 3721 aaataatttn actatactta aaaagcagtt ttgttcnntg atgtttaata tgtgtagggt 3781 ggatttttgt tggcgggctt gttttatatg ggaacacaat taagggatga gaggtggacc 3841 ttttattgtg catgtgcgta tgagtgactc gttattttaa aatatatatt taacaactta 3901 tgaggatgca gatattgtgt acctgtatgt ttatagcttt gcaaatatat aaaataattt 3961 tcatttgtaa acatattgtt ttgcatagta attcatattt ttatttagca cgtcattgtg 4021 gccattgtcc tgaggagtgg attacatatt ccaacagttg ttattacatt ggtaaggaaa 4081 gaagaacttg ggaagaaaga gtttgctggc ctgtgcttcg aagaactctg atctgctttc 4141 tatagataat gaggaagaaa tggtaagacg taaatgtttc aacactttac taaaagctta 4201 tttctgtcta tatcatattt gtagaaatca tccatatgtt tatacatata tttacttcat 4261 atatttttaa gtctgtgtag tattcaactg acttcataat atttttatat tcatatactg 4321 ttaatgcaca tttggttatt tccagttttg cttttcatgg aaacccatgc ttctataaat 4381 gtttttatca caaaataaat ataaagaaaa ctaagcatgt catggtcatg catattgatc 4441 acaaagtgaa tggattcatg atacaggggc agatcatgaa atagaagttt ccagccccag 4501 cagtcctgct cctgtgtatg ccttcctcac ttttcctgcc tcctctccaa acctttgtat 4561 tgatgtttaa tgtgataaat aagctttgcc tgttccagaa tttatgtcaa gggaatcata 4621 caatgtgtat tttcatgtgt gtcttctttt actcctatat gatatttgtg caataatgca 4681 gttagctgtt gtttgttgct tttcattgct gcatgatgtg catagaacat ttacaatctg 4741 ttcagtcctg tgtctctgga tggggaagtg agtctcgtgc cctcaggaac aaagaggaac 4801 ctggatgggc acgtgtagtc atgcggttcc tttgctgatc cctctcaccg cacctaccct 4861 gatgtctttt ggtttgggag gagaagcact tcctctagct cctaaatggt agcaggsctc 4921 ctggtaggtc atccttgcca gtggtaccag gcttgcctaa tgttgctggc aggactccct 4981 ttcaatacag agtgggaatg ggcttagctg acccacctcc tgttaccaga tttggttctg 5041 agaaacagag gactagatca ctttcttctg ctgagtgggg gactgaagat gcccggacac 5101 tctgttgatc cctgattctg gggtcccaaa ctattttgcc ttcctctttc cacctttcag 5161 agtttttctg tgtgtctttc attacttgca gagtttagag ttgtatttat tagggagtag 5221 cagagaaaaa ggartccatg tacctggtca agaccattgt tattccacca aaaccaaatc 5281 agataaaagt gagggcttat ctagttagag agtggtgtgg tgcccagaaa agccaatttt 5341 tggctgctgt gtcatttatt tcctcaatgg caacctttca atttccttct aaacttcaag 5401 gaggaaaagt tccactatga gacacatagt ggaaaaatac atttatttaa tcagaatatc 5461 tggagaaata catgcacagt atcaggaaat ttatttcagt tcttactgtg tctgtcttac 5521 caacacaagc aactgttaaa ttacatgaat cacaattttg cttgttaaaa taagaaagaa 5581 tgaatgtact aagttataaa aattgaagaa ttcattttaa gtcaaaatta caaaaatttg 5641 taatagcttt ttacactata aaagttgttt tcaagtttac tattttgttt aaggtattta 5701 ttttgttttg aaaattaagc atcctttgga aaggatgtat wacctgcact tttaaaaatt 5761 aataaaagtt tagaattcaa agaaagcacc taaaaattag taaaatcttt tatagttatt 5821 atttaaatga aaatttttta tttttcgtta gaaatttctg ggctccctgt cagttctctc 5881 atgggttggc atctctcgta gtagcagtga tcatccatgg gtgtcaataa atggctcaac 5941 gttcaaattg aagtaagttt tttgaatgat gctatatagt agaagaatat aaaaggacag 6001 gttcagaata acattatgaa taaatttaag tgcaattata gccataaata aagatgttga 6061 aagttagtga aatgttgata taaatgttaa ggaatgatcc acaattacat tttctgtggt 6121 tttagttacc caggctcaac cacggccaaa aatattaaaa ggaaaactct akgawkakaa 6181 aatccatgag ttttaaattg ttcactcttc tgagtagtgt gatcaaatcc tttatatagt 6241 gtatggctcc atctcacctg ggacatgaat cattgcttcg tccaggatac ctgcacagcc 6301 taagttccct gtccattagt cactcagctg tcgcagttat gagattgact aaggtggtat 6361 ggtaccgctt gtgttcaagt cactcttagt ttacttaata agggccccaa aatgcaagag 6421 tatttatgct ggcatattgt tataattgtt ctttttcatt attattagtt tgttttaatc 6481 tcttaccgtg cctaatttat aaattaaact tcatcaaagg tatgtatgta tagataaaac 6541 acagtatata tacagttcag tactatctat gatttcagac atccactggg ggtcttagaa 6601 catatccact gtgcagatga aaggactact tcattcagtt ttcatattat gatatgcagt 6661 tttcccagca ttatttttga agagactatc ttttttttcc aatgtatgtt cctggcacct 6721 ttgtcaaaag tgagttgtct gtaaacaaat ggatttgttt ctggattcat tggtctgtat 6781 gtctgtttta atatgagcac catgctgttt tggtttctgt agctttgtag tatattcagg 6841 taatgtgatg cctccagctt tgatattttg ctcagaattg ctttggctat actggatgtt 6901 ttgtggttcc atataaattt gaggattgct tttgctattt ctgtgaagaa tgtcactggc 6961 atttt // LOCUS HSU96876 12003 bp DNA PRI 06-SEP-1997 DEFINITION Homo sapiens insulin induced protein 1 (INSIG1) gene, complete cds. ACCESSION U96876 NID g2358268 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12003) AUTHORS Peng,Y., Schwarz,E.J., Lazar,M.A., Genin,A., Spinner,N.B. and Taub,R. TITLE Cloning, human chromosomal assignment, and adipose and hepatic expression of the CL-6/INSIG1 gene JOURNAL Genomics 43 (3), 278-284 (1997) MEDLINE 97422604 REFERENCE 2 (bases 1 to 12003) AUTHORS Peng,Y. and Taub,R. TITLE Direct Submission JOURNAL Submitted (08-APR-1997) Genetics, U. Penn. School Medicine, 705a Stellar-Chance, 422 Curie Blvd., Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..12003 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q36" mRNA join(1273..2098,4978..5102,5665..5831,6136..6235, 10635..>12003) /gene="INSIG1" /product="insulin induced protein 1" exon 1273..2098 /gene="INSIG1" /number=1 gene 1273..12003 /gene="INSIG1" CDS join(1687..2098,4978..5102,5665..5831,6136..6235, 10635..10664) /gene="INSIG1" /codon_start=1 /product="insulin induced protein 1" /db_xref="PID:g2358269" /translation="MPRLHDHFWSCSCAHSARRRGPPRASTAGLPPKVGEMINVSVSG PSLLAAHGAPDADPAPRGRSAAMSGPEPGSPYPNTWHHRLLQRSLVLFSVGVVLTLVL NLLQIQRNVTLFPEEVIATIFSSAWWVPPCCGTAAAVVGLLYPCIDSHLGEPHKFKRE WASVMRCIAGFGGINHASAKLDFANNVQLSLTLAALSLGLWWTFDRSRSGLGLGITIA FLATLITQFLVYNGVYQYTSPDFLYIRSWLPCIFFSGGVTVGNIGRQLAMGVPEKPHS D" exon 4978..5102 /gene="INSIG1" /number=2 exon 5665..5831 /gene="INSIG1" /number=3 exon 6136..6235 /gene="INSIG1" /number=4 exon 10635..>12003 /gene="INSIG1" /number=5 BASE COUNT 2935 a 2843 c 2868 g 3356 t 1 others ORIGIN 1 agctcacagt ggggtcgagg agacacagca caggcttctg aaaggtgccc tgtccattag 61 ggagcacccc atttacagat gggtcatcct tcagctcact agtttcccag cgcatgacag 121 tgggtagctt ttttttttcc tattataaac aaatcttcga tgtgactgcg atgatctacg 181 gatgatctac gtatctggtt gtgtgatcct caagcactta tgcaaggtgg agtgtaaaca 241 atggccaaga cccaaactgc aatgttccct gaagtccaac ctgcttggag tccttgggaa 301 cgtgtacaat ggttaacagg cttgagagga gtcagctggg agagtgagag cgagactgcc 361 ctgcagaggg gccgctgagg atggacccct gtaggccgtc cacacccacc acgcgtgtca 421 tcctcaggaa atgcaaccgg atgctcggaa tttcgcccct aaagcgactc agggcagtct 481 cagctcagga agacttcttt ttttgacctt ctggcttaaa gccacaaccc caggctgagg 541 atgccacatc aacggcgttt cctcgccacg ctgcaggcgt gactcacgtt tctaagcctg 601 cagggtcccc gggccaagcg cggctactga gagcacagcg tcctccctca gcccctgctc 661 atctgcacgg agcaaagccg cagctttctc tgctgtcagc gctttatacg actcggatgt 721 aggctctgag cgggcacggg cccgccctca gacggcgacc cgaggggcaa acaggatgga 781 ggcgcgctcc cttggaggct gtggtctcga ttttatctgc tcctgggtgg tgcgggtggt 841 ttgctctcta ggcgtgagca aaggccacaa ggggcccgaa aaaatcaccc aaggatgcaa 901 gcccgccatt gcgctgagca ggggcctccc cgccgccaga gcccgcgccc ctccagggag 961 gcccgcgcga tcgagacccg ccgcctcccg accaggcgcg ggacaccagg cgaggggaga 1021 caggagccgg gaggggaggg gccgggcccg caaccccacc cagtcctgcc tccgcctcct 1081 cccggtagga ccgtcccctc ccaggcccca gctccgcggc cgcctcacca gaccgcgtgc 1141 ggacgggctc gcgggcgggg cggggcaagc tcaggccacg cccctgggcg gtgcgcgcgg 1201 gcggtgggta ggccgggcag gctcacgtga tgcgggcccc gcggcccgcg gatataaacc 1261 cgcgcgcccg ccaggcgctg cggccgtccc gggccgtgac tcctcctttc ccccgccccg 1321 cctccgttcg gagagccggc gggcgggcgc ctctcggcca ggtacgcggc cggctgggat 1381 aggggtcgcg ggcgggcttt ggtcgcgcag cagccggtcc tccccggagg taggcgggcg 1441 cgggccctgt tgggtctttg ggacgcgggt cccgctgggg ccggggatgc tcctctctgt 1501 gagcgcggtc ccgtcccccc ctgtccccgg gcgggcgcac gggggtctaa ccttgggggg 1561 cggcgctccc gcctgtcccg ggtgccgggg ttcgcgtccc gcggggcctt cctcgctctt 1621 tgtctcttct gagtgaactt gatgaccccc ttcttccagg aagcgcctct tggacgcgtg 1681 tgaccgatgc ccagattgca cgaccacttc tggagctgct cctgtgcgca cagcgcgagg 1741 cgccgaggcc ccccgcgagc cagcaccgcg gggctgccgc ccaaggttgg ggagatgatc 1801 aacgtttccg tgtccgggcc ctccctgctg gcggcccacg gtgccccgga cgctgacccc 1861 gcgcccaggg gccgcagtgc tgcgatgagc ggccccgagc ccggcagccc ctaccccaac 1921 acctggcatc atcgcctgtt gcagaggagc ctcgtgctct tctcggttgg ggtggtccta 1981 accctggtgc tcaacctgct gcagatccag aggaatgtca ctctcttccc cgaggaggtg 2041 atcgccacca tcttttcctc cgcctggtgg gtccctccct gctgcgggac agcagctggt 2101 gagtacccct cctggttctt ctggaagtaa aaagggtgtc ttttcggttg aaagtacttt 2161 tcagcctata agaaacatgc atttgaaact ggaactatcc acagattgaa agtgtgcatc 2221 tccagtcttg ggatttagag aaatgaagga ggggtagaga ggacctggca cccccgaggg 2281 cgggttcggg acacagctct ctgaagtttg aggaagcaca gtgatggaga tgaggcaggg 2341 agataattgg ctgttcaagc aggtgctgtg ccaagttgac tgagggccct tccctcctgg 2401 cgtccgggta cagccccagc tgtcatcccg cgccagccgc cccggattgg ccgcctgggg 2461 aactggtgag cagcgggact gaaaggccat tggtcgcggc agcaaacatg ctaccctgct 2521 ggataactga aaacaaaccc gtcacatggg ttcaggaggc ggttgccagg ggagcaactt 2581 ctcagcactt ccttttgccg ggtcgctgag taaactgtgg tctgaagcca aaccgcctac 2641 accctgaact gttagagctc tgtagattac gttcttaaga aaagataccc cttacggcgt 2701 ttagattttc gtacacccaa gcagtagcat agatgctgta caacgtaaca ctgagtcagt 2761 ctgaactgtg gacacgggga ctgctgggtg tgtgtactag actttcttgg ctatcattga 2821 cccgctttta tagaaatgtt gagttgatct tggctcagca gatcacttac tgctcctgta 2881 gtcacacagc atttgtttat ttaacgagct tgtcgggtgc tttctgcaac ctcaactcta 2941 taagctgttg aaatgttaat ccaaaagtga gcaccacaag cccctatcct taagcagctt 3001 agagtctggt ggggaaaaca aaccagtatg tcagtgactc cagtgcaggg cagggagcat 3061 cctgtttgct ttgtggctga cgcctgccag ctgactgctg accataggct tcagatgctt 3121 tgctgtgttc attccttaca gcaatgccat ttgtagcttc atttatagat gtggcaactg 3181 aatcacagtt gaagtcaagc ccaaggttgg cgcaggtaga aagggcccaa gccatcggtg 3241 gtagactgct ccttcttgcc atggtagggt gtgtctcctg cactgtggaa acacattgcc 3301 tcagttgact cactgaaaac agcaagcagc agagaggttt gtagcagatt aaacaggggg 3361 ctccgtgttg aagaactagg acaggtgtct gtaactaaaa agtagtgggt agaatgtttt 3421 catatgtgtt ctcggaggga cgggttcctc tgtcgtttgt cgaaacacat tatgtggctg 3481 ttctaagtga acactttttg agaacgatat ctagagttga aattccatgt ttcatataca 3541 gttggcagaa tagtttcttt cctttaaccc taaaacttga tcttagaaat gagataggta 3601 gccttcagta gttgtaattg gatcctcaag ggtttctagg aatatctgga caatttctca 3661 tactctcata ttctactttt ttttaaacca tcagccaaga gcataagtaa aatttttcat 3721 agtatgtttt ttacctattt atttattaag ctatagtgag gtaaagttaa tttggggggt 3781 ccatttttca gaactccaaa attagaacct gatgaagtat aataaattga aaacaacatt 3841 ttttatctga atgtttgatg caaactgtgt ttcttgtttc tcagttgaac ctaaaaggta 3901 tatttctttc atactagact gaggtgaaac aattgcaggg ttctcaagag ccctgggaca 3961 cagtgaccct tgtgtcccag catccaaccc cacggggcgg caccccactc ccccctacag 4021 acattcattc ctgccttcat ctttggcatt ttacttctct tgggggaatg ccattccttc 4081 ccacctgtcc aaattccaag ttcgtgtcta taaagaataa ctattccaga ccacagtgat 4141 ctctttctct gtgtcccagt tacataatcc ttataatttt acacatgaga aaacaaacat 4201 gcagagccac gtcccatcag agggcaagct gggatttgag gctgagtttt ctcttcttgg 4261 tctggtgttc tttccgcagc ttgaggcttg accatgcggg aaccagatgg agaataagcg 4321 aagcctgtgt ggggctgtct cgccccgtac ttgcccatgt gcatgtgtgc acagtaggtg 4381 ctcagtgcct cgtacagaac atgctagaac ccccgaaaat caactccagg gtcactctag 4441 gagcatgtga cccagactgt caaagtcaca tcagcaaata tgtatctcat cacgtgcccc 4501 aggcccagta ctgggtggtg gggaggagtg tcaagagctg tgatgtgaca gggggtggac 4561 agatgctaac ctttttattg taagatagaa atcacaccgg atgaaaagac aaggtggaac 4621 tacatagtca tttaagttct gtgtgcttta ggcaaatgtg gatattaaat ataaggttac 4681 aatattgctc atattaatat gaataatccc tgcttgattc cttcctgtgt ttactgagac 4741 atgttcaact acactaaata atagtgattt ttaaaaaact tttcccaaac agtacaattt 4801 tgtgggagtg gtactcaaag agaacaagaa tataaaatca ctctgaagtt atagctgata 4861 attttttcta ggaaaaggac tggtttgagt aatgtactgt gttactaatg accacgttgg 4921 aaattctgga acttggaatc tgctgtattc taacattggc aacttttttt aaaacagctg 4981 ttgttggcct actgtacccc tgtatcgaca gtcacctcgg agaaccccac aaatttaaga 5041 gagaatgggc cagtgtcatg cgctgcatag caggttttgg tggcattaac cacgccagtg 5101 ctgtatcctt aattttctgt gctacgtcca gagtatcttc ttaggttata tattgaagca 5161 ttttgttttg ttatatactc aaatgactcc atgtcataat acagacatga tggtggtacc 5221 tacccatttc agattactaa aaagagaaag aagaatgggg ctatcgatga cttcatagca 5281 aaatatattg ttttatggtc aaaataaata taaaattatt taaattttta tatatattaa 5341 aaaaattttt aaaaatatat ataaaatata taataaatat atatttatat aaatatatat 5401 aagcactcca tatatagctt tcagcaagtt cattctcatg gctaagtttc tgactcagcg 5461 atggcaggga gatttcaggt gatcttaatg attactgtaa cctgcatcca acaaatcctt 5521 tctttagctc attacacagg ctttatggac agtgaattat ctgtgggaca ataataaaat 5581 acccatgttt ttaacccttg tggttttacg tttttttatc ttaataagag tcaccgaact 5641 tttccttaac ttttatatca ccagaaattg gattttgcca ataatgtcca gctgtccttg 5701 actttagcag ccctatcttt gggcctttgg tggacatttg atcgttccag aagtggcctt 5761 gggctgggga tcaccatagc ttttctagct acgctgatca cgcagtttct cgtgtataat 5821 ggtgtctatc agtatacagt aagtgtgtgt tttcaaatat tggctttgga aagcttagat 5881 attttcatat tttatttgtt ttttatatag aatgatacag atttaggcat gaaattctca 5941 aaaatgctgc tatccataac ttaatgtaac gcaaaggaaa accaagtagt agcagttaca 6001 accaaacaaa tatgaacatt cgtaacattg gatggttgat atgcgttctg catatcccca 6061 ctacagagaa tctgtgatgt agaattgagc caggtgaaat tgactgctaa ggtacagttt 6121 cttttctcgc tccaggtata cgtccccaga tttcctctat attcgttctt ggctcccttg 6181 tatatttttc tcaggaggcg tcacggtggg gaacatagga cgacagttag ctatggtaag 6241 tgaaatgatc atattatctt ctaaaacttg cttctcttta ccttgataga atgactttac 6301 atgatacatt caaatttgcg ttcatatatt ctggttcaga cttgagtgac aactactgag 6361 aaaagcttgg agtccggatc aagcaacctc ccgcttcagc gtcccgagta gctgggatta 6421 caggcacatg ccaccatacc cggctaactt ttttgctttt tgtggagatg ttatctccct 6481 gccttgtcca ggccgggctc gaactcgtgg gctcaagcca tcctcccacc ttggcctccc 6541 aaagcactgg gattataggc gtgagcctgg ccttgctttt tttaatatta aaaatctggt 6601 cggatggtcc cacatggaaa tagtatgttg ttttcatgca ttcgccatca tccccgttaa 6661 tagtcttccc aaaggagagc tggcaacctc catcattgct ttcttgcttt gagatttttg 6721 tttcgggaac tctctttggg agagaattcc ctaagaaggt ttttaaagaa taaaattgtc 6781 ctcaaccaga ttgactaatt acacattata actacaactt gaataacatt taaagagttg 6841 ggttatttct tgtttgatag gttggatgct tgttatatgt caagcctttt taaatcggtt 6901 ttaagaaata agcattattt tggtggaggt agaaatttca caacattcat aagttctaaa 6961 catctagaaa tggttggagg aaacattttc gttttgaaga tagtttcatt aagtaatccc 7021 tgctgtgctc ctttgcttac agaaaaaaat gttaaagata actccccata ggatagttgg 7081 tgtaattata attctcctta cttcagcttt ctcttcatcc tttttattct ttaagcttgg 7141 gggaaaatgt ttggatgttc tcgtatctta aaactggagg agagaggctg ggcgcggtgg 7201 ctcacgcctt taatcccagc actttgggag ctgaggcggn cggatcacga ggtcaggaga 7261 tcgagaccat cctggctaac acagtgaaac cccgtctgtc ctgaaaatcc ccgaaaggaa 7321 aaaaaaaata cccaggcgtg gtggtgggca cctgtagtcc cagctgcttg ggatgctgag 7381 gcaggagaat ggcgtgaacc cgggaggcag agcttgcaat gagccgagat tgtcccactg 7441 cattccagcc tgggtgacag agcaagactc tgtcgccgaa aaaaaaaaaa aaactggaga 7501 agagagatca ccaccttaat ccccttccct actgggctat cacatgaaac tgagctctga 7561 ggttctaagc ctaacccgcc taccactgtt tttattcata aatacttaaa tccttacctc 7621 tggatcccaa ctgttactct ttttttcact tcaaaatttg caaactcatt tctatccatc 7681 aaaccccgga aatggcccag gggcttcaca gtgcagtccc ctccctcctg ctgcatccgt 7741 tcctcaggct gtgccccagg ttgcattcgt ggtgcgtgtt ttctcccagg gcagccttcc 7801 ccacattcca ctcttgtcat actgttgact cacaaaatgg aagtggctga atgttccagg 7861 gatagtgtct aagagctatt ggtttaaggt tgaaacctac accaagtaag gctttgtgaa 7921 gggagatgac gagagtgaag ttgttgacag tctgaaggtg aagagattcc gagggtcagt 7981 tcagcaccgg gaccagaccc gcccagatcc agggtaattt gaggtctgtg tttgggtgct 8041 gatggttttg gccagtggac tccacaacct ctccttctta gctgtctcag gtctcttcag 8101 atgccttggg tcttaactta tgcttccctc ccttaagaag aaacatagtt aattggcaag 8161 aataaccatg ctcaaaaagt tacaagttgg gtttgggttc aagttatatg ggcttcctag 8221 cttttctatg tggtatttcc ccccacacat gccgttgtcc tttgtgtccg aaaatacagc 8281 agaaacattt gttatcagcc tgttacggtt tttttgtttg cttgtggttt gtttttattt 8341 gtactttttt ggagacggag tcttgctttg tcacccagac taaagtgcag tgggacaatc 8401 ttggctcact gcatttgaac cgtctcccag gttcaaatga ttatcctgcc tcagcctccc 8461 aagtagctag ggatcacagg gcgcccacca ccgcacctgg ctaatttttc tgtttttagt 8521 agagacgggg tttcgccacg ttggccaggc tggtctcgaa ctcctgacct caggtgatcc 8581 acctgcctct gcctcccaaa gtgctgggat tacaggaatg agacaccgcg cccggcccag 8641 tctgtatgtt taaaaaggaa aatgagtctg taaccatcat ttattagaac ttgtagtatg 8701 ttgctgtgtg tagcttcaag tccttcctca agagtaagct gcaaactgcc aacagaaatc 8761 aagcctgtcc tgttgacact gcccggcccc actgggacac ctggagggag agtccccccc 8821 acctcagtgt cagattggaa tttgatttgg cggtcctgtt tgtgattgac ccaatacagc 8881 tgctaatgat aataatacta ttattgttat tgagggttta catttgttgg gctcttactg 8941 ctgcacagga ggcattgtgc tgagcgtttc cattgttagg cagtttagtc ctcatacaca 9001 agtctttgga gtagaaacta ttattagccc atttctaaaa ggggaattag cagcttagga 9061 agattaaaca gcttgaccag gtgggtggca gaggtcgacg ttgcaagccg acttctcatt 9121 agcataccac actgccaata ccaacacaat gctcgcactg tctcacctgg aagcaggtta 9181 cagcagcctg cgttggtgtc tgggttgcca tttacaatgt gccaccatta ggcaacactg 9241 ctggaagtcc ctgtgccacg tgtcaggtcc gccccggcct ggctgtcgcc cgcctctggc 9301 gtctctgcca ggcctcactg gggtctctgt cgaccaggga ttctaaaagt ccacttccca 9361 tctatttatt cagtacatta gggaacaggg agtcaaattt ttgttggccc aagtttcctt 9421 tcccgggttt gcttgcccaa gaagtttgcc cgttttttgt tttccccccc cccacggttt 9481 gaataagaac cccctcccca gttgtgcaaa aacccaagaa acttgggcag ttccttttca 9541 cctttgttcc aattcccttg aaagcattgt attgtttccc taaggaagtc cttaattgtg 9601 ttttgtcccc caggggtagt accctggcaa taccctggtt gactccaacc cagactcatg 9661 actttcttac ctgccctggt tttgttcaag atgagaggct cttttggcct gttttctgga 9721 tgagtaatct ccgtcattgc tgaagtgagg agctcagctc cttcctctta aagtgagtgg 9781 agggcaatta agtaacgtgg gaagtcacga ttctgaggga aaaaaattct gtatggagtt 9841 tctcttttgc cctttagcag aagctccgtg acctcaggaa ggtcatttaa tctctctgtc 9901 tttgcttctt catcagtgaa atactgttct ccattaagaa tttaccctga ggtgcctggt 9961 catagtagta actgtagtgt ctacctctca gaagctctgt gagggtcaga tgaggaagaa 10021 catgtagaaa ggatttccca actgtgaagc tctgtgtgaa ttgagggcac acagagcttc 10081 agtattacaa agcatgacct atttccaatg tctgtttcaa caaactgctg gtaaggtggg 10141 ttataacaca ggcatcagat actaataaag gtttgtggca gaaacctccc ttcagataaa 10201 ggttcaatcc cttaaattgc agttagaact ttcaggaacc tatcataccc taagcatcac 10261 catgctttga taaaattggg tgtttccttt atatagatat taatttagtt aaactttcac 10321 caagtcttac acttacatgg tggctttaat aagggcctca gcagcgtaat ttcacctgag 10381 gcagagaatt atcctaattt tgcaggtgag agctgtggct cacggggcca gagccatgaa 10441 gacggcgcag gggagctggg agagctggcc tcggtctcca gttctctgca catgtcccac 10501 ctctcaatct tgttgcttca gtcaaagaaa gtgtcaatat ttcccatgtt aggacctact 10561 taattggtaa ttaaaatatt tatacattta gtgctaattt tctctttcct tttttttttc 10621 ctttctacac atagggtgtt cctgaaaagc cccatagtga ttgagtcttc aaaaccaccg 10681 attctgagag caaggaagat tttggaagaa aatctgactg tggattatga caaagattat 10741 cttttttctt aagtaatcta tttagatcgg gctgactgta caaatgactc ctggaaaaaa 10801 ctcgtcacct atctagaaaa gtcaagaata gggaggtgga gaatgatgac ttaccctgaa 10861 gtcttccctt gctacccaca ctggcgcctg tctgtgccct ggagcattct gcccagccta 10921 cgtgggttca gtcaggtgcc accttcccaa gtattcgatt tcattcatgt gattaaaaca 10981 agttgccata tttcaaagcc ttgagctaag actcaattac caacccgcag ttttgtgtca 11041 gtgcccaaag gagggaggtt gatggtgctt aacaaacatg aagtatggtg taataggaat 11101 aatatttatc caaaagattt ttaaaaatag ggctgtgttt aaagaaggaa tcaaaacaag 11161 aaaagcagca gtgattatag agaggtcaca ctctaagtgg ggtcgcggcg tggccacgct 11221 tcacggtcac gctcgtccgt cctgcagtgg cgtgtttaca tggtcacacg tgtgtgtatc 11281 accagtgggt caactgcttg tcattcctcc cgcggcagtg ttgtgtagac aatcttactg 11341 agcaaaaggc aatgaaaagt cttgggctcc cacactgcga tatattggaa ttcccacctc 11401 agtttatgaa gtttatttcg aaatccatag tcatctaaga atgaatacct gtctgccatg 11461 tatttcaatc ttagtgagcc aaaattgttt gtttgttact acagaataga gatgactgtt 11521 ttttggcaca gccctatgga atttgcaatc tgtgattgcc ttgtaaaaag gagagtgcat 11581 atggcactgc attaaacgtg tggtgtttct agtcaatgat attggtgagc acaatgtatt 11641 catttaatgg catagaccat accagaccta atttgcaagt attgggtctt caaacttcaa 11701 gtgcaatgta ttatgaaaac caatctgagc cttgtatctc ttaaatattt attccttcta 11761 acgtgtgaga tgtcccgaga gaagggttct ccattcattt cagtgctgcc tggaggaaac 11821 tcggcaatga tttcttcagt tgtgaagttc ctttcgggtt acaacctcca ctggaaccct 11881 caaccttcga aatactccag ttttgggggt tggggccatt tacttataaa tttaccgccg 11941 ggtttttgga atctacatgt cttgggggcg ggctcaaatt cttcgaaagt ggttggatta 12001 aaa // LOCUS HSUBA52G 4555 bp DNA PRI 26-JUN-1997 DEFINITION Human UbA52 gene coding for ubiquitin-52 amino acid fusion protein. ACCESSION X56997 NID g37566 KEYWORDS UbA52 gene; ubiquitin-fusion protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4555) AUTHORS Baker,R.T. TITLE Direct Submission JOURNAL Submitted (14-DEC-1990) R.T. Baker, MASSACHUSETTS INST OF TECHNOLOGY, DEPT OF BIOLOGY - 16-520, CAMBRIDGE MA 02139, U S A REFERENCE 2 (bases 1 to 4555) AUTHORS Baker,R.T. and Board,P.G. TITLE The human ubiquitin-52 amino acid fusion protein gene shares several structural features with mammalian ribosomal protein genes JOURNAL Nucleic Acids Res. 19 (5), 1035-1040 (1991) MEDLINE 91212181 COMMENT See , for related sequences. FEATURES Location/Qualifiers source 1..4555 /organism="Homo sapiens" /note="chromosome 12 or 19" /db_xref="taxon:9606" /germline /dev_stage="adult" /cell_type="lymphocyte" /clone_lib="EMBL3A" /clone="lambda-UA1" repeat_unit 114..406 /rpt_family="Alu" repeat_unit 743..758 /rpt_type=DIRECT misc_binding 747..752 /bound_moiety="Sp1" repeat_unit 827..843 /rpt_type=DIRECT misc_binding 832..837 /bound_moiety="Sp1" repeat_unit 932..944 /note="pyrimidine tract" /rpt_type=INVERTED mRNA join(934..962,2363..2473,2733..2819,3942..4044,4129..4306) /gene="UbA52" /note="in placental gland cDNA clone" prim_transcript 934..4312 /gene="UbA52" mRNA join(934..962,2363..2473,2733..2819,3942..4044,4129..4312) /gene="UbA52" /note="in adrenal gland cDNA clone" gene 934..4312 /gene="UbA52" exon 934..962 /gene="UbA52" /number=1 intron 963..2362 /gene="UbA52" /number=1 misc_binding 1037..1042 /gene="UbA52" /bound_moiety="Sp1" misc_binding 1208..1213 /gene="UbA52" /bound_moiety="Sp1" repeat_unit complement(1415..1703) /gene="UbA52" /rpt_family="Alu" exon 2363..2473 /gene="UbA52" /number=2 CDS join(2371..2473,2733..2819,3942..4044,4129..4222) /gene="UbA52" /codon_start=1 /product="ubiquitin-52 amino acid fusion protein" /db_xref="PID:g37567" /db_xref="SWISS-PROT:P02248" /db_xref="SWISS-PROT:P14793" /translation="MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLI FAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGIIEPSLRQLAQKYNCDKMICRKCYAR LHPRAVNCRKKKCGHTNNLRPKKKVK" intron 2474..2732 /gene="UbA52" /number=2 exon 2733..2819 /gene="UbA52" /number=3 intron 2820..3941 /gene="UbA52" /number=3 repeat_unit 2939..3296 /gene="UbA52" /rpt_family="Alu" repeat_unit complement(3373..3623) /gene="UbA52" /rpt_family="Alu" exon 3942..4044 /gene="UbA52" /number=4 intron 4045..4128 /gene="UbA52" /number=4 exon 4129..4312 /gene="UbA52" /number=5 variation 4226 /gene="UbA52" /note="in adrenal/placental cDNAs" /replace="t" polyA_signal 4280..4285 /gene="UbA52" BASE COUNT 970 a 1178 c 1341 g 1066 t ORIGIN 1 ggatccgcac atctcggcct cccaaagtgc aggcgtgagt caccggaccc aggtcccgcc 61 ctggcacttt ttaaccaccc acaaatctgg atcctacact gaaaagagac actgcagtgg 121 ctcacgtctg taatcccagc actttgggag gccaaggcgg gcggatcacc tgaggtcgcg 181 agtttgagac cagcctgacc aacatggaga aaccccgtct ctactaaaaa tacaaaagtg 241 gccaggcatg gtgtcgcaca cttgtaatcg cagctactcg ggaggctgag gcaggagaat 301 tgcttgaacc caggaggcgg aggttgcggt gagccgagat cgcgccattg cactacagcc 361 tgggcaacga gagcgaaact ccgtctcaaa aaaaaaaaaa aaaaaatcct gagtcccgct 421 tgacaccttt tgtcaggcac caccaccttt ctgggcgaat gcggtagtac cgtctgctct 481 ccctgctgct gtcctgaaat ccattcaggc acagcggccg agagctttat aataaccgat 541 tccaggtgtt aggtgctttc ccagccccga ctcctgcgtc ctggacccgc agtcctctgc 601 ttaatacctt tgctttatta gaaaacattc tcctctactc cgttcagcta ttcgctgagg 661 gcccgccaac cgccagcggt tgtcaatggc ctagaggcag cggacgcaaa cacggggaga 721 ggtgcaatcg tctcaagtga ctcggcgggc ggggcccaca accggaagcg ggtgggcgac 781 cttcacccac gtgcgctgcg gcttcgttcg ccagcatcca agatggcggc agggcggggc 841 ccaaggcgcg gcgcgaattg tgacgcaggc gtccggcgtg ctccgtcgca agcgctttcg 901 gcggcgatta ggtggtttcc ggttccgcta tcttcttttt cttcagcgag gcggccgagc 961 tggttggtgg cggcggtcgt gcgggttcgc gccgggccga gagcgggttg ggggctgcgg 1021 gaggctgcag gggcctgggc ggcagaagag gcggccctga gctggctcat gcgggccagt 1081 ctcggcaggg tggctgggca gggctcgcga ggccacggct cggagcccag accggggccc 1141 aggagcgaac gccgttttgg agaggagcct gcctgctctg cctgccagcg tgaccccacg 1201 aggcctcggg cgggaagagg tcctcggggc agatccgagt taatgagaga ggggtattga 1261 gcgtgtagcg ttaactctgc cagtcactgc gtcagtcgct ttggaaatac taaatttctc 1321 gagctgagtc ttcatacctg gctcctaatc tacgtctgta aggaggagct ggtggtagtg 1381 tctgcttttt agacttttct ttagactatt tgtatttttt tcagatggag tcttgctctg 1441 tcgcctaagc tggagttcag tggtgcggtc tcggctcact gcaatctcca cctcccgggc 1501 tcgagcgatt cttctgcctg agcctcccga gtagctggga ttataggcgc ctggcaccac 1561 gcccagttga tttttgtagt tttagtagag acggagtttc accatgttag ccaggctcat 1621 cttgaactct tgacctcaaa tgatccgtct gcctcggcct tccaaagtgc tgggattaca 1681 ggcatgagcc cctgcgcccg gtcgattctt tgtcttttta agtcaacttt tatatgtgaa 1741 caatgcttgg caggtggttg gtagatacta agtgatgttc gtggtttggg gtcaaggcaa 1801 gaagtggggt ctggagagtt ttggtgtaat tgagaaggaa gctaagagtg ttgggtgctc 1861 cagcttggag ttagagagga gagaggctgc cacaggaaga catgtgtgtt gtaggggatg 1921 gcttcccatc caggctggca gcaggagcag cctgtgcaga tcaggacctt gctccctgga 1981 agagggtgga ccgccttcag ggaagatgga tctagcaaga tgatgccaaa gggtacttat 2041 tccatcagga gatactgacg agtccttccg ccgctaaacc taaggtgaat aaccacagtc 2101 tgtgttcctg aagagcaccc gtgcggtcag gagggtggag gacatgtgat cttagttcca 2161 ggacatgttt agactacagg ccagggtgtg tgagaagcct agcagggcca ggcttggagg 2221 agtgaaagga agacaggtac tggggcagga ccagttggac ttggtgcagg caaagggata 2281 gcaactgtgg tgtaggcacc tgagcttgtg ctactcaggc atgcattgct caccagtcta 2341 tcctgccgcc catcctcctc agacgcaaac atgcagatct ttgtgaagac cctcactggc 2401 aaaaccatca cccttgaggt cgagcccagt gacaccattg agaatgtcaa agccaaaatt 2461 caagacaagg agggtgagta gggctgggtg tgggggctct ggctgtgaac tgggagtccc 2521 tctctcgccc aggggagtct cagtcctgtg tgggttgtgc tgactttaga tctgttttgc 2581 ccttgcttct ccatgtgatc tgaagaacgt ttgttatctt ctacctcagt tggccttttg 2641 agaaactggg ggtagtgctg gagctcccct gcagaggaca ctgccagtaa tatggtccgc 2701 agagcctcta actgagcctc cctccccctc aggtatccca cctgaccagc agcgtctgat 2761 atttgccggc aaacagctgg aggatggccg cactctctca gactacaaca tccagaaagg 2821 taccggggtt ggggttgctg ggcagggacc caagatcccc aggtcctagg aaaggagcat 2881 tgatggcctc aggggttggg gagcagttca aatgacttgt gttttgttta aataatggga 2941 ctgggcacag tggctcatgc ctgtaatccc ggcactttgg gaggcttagg cgggtggatc 3001 acctgaggtc aggagttcaa gaccagcctg gacaacgtgg tgaaatcccg tttctattaa 3061 aaatacaaaa atcagctggg tgcagtggct caggcctgta atcccagcac ttcgggaggc 3121 tgaggcgggc agatcacaag gtcaagagat tgagatcatc atgaccaaca tggtgaaatc 3181 ccatctctac taaaaataca aaaattagct aggcatggtg gtgcgtgcct gtagtcccag 3241 ctactcagga ggctgaggaa ggagaattgc ttgaactcgg gagacaaaaa aaaaaagtca 3301 taatgtgaat ttttttatca ctgcaataag gaaattagtg tcacttgtgg gagcgacaag 3361 aattcagtgt cctttttttg tgagacagag tcttactctg tcacccaggc tggagtgcag 3421 tgacgcgatc tcactgtgac ctccgtctcc cgggttcaag cgattcccct gcctcagcct 3481 cccgagtagc tgggattaca ggcacccgcc accacgccca gctaattttt tttgtatttt 3541 tagtagagac agggtttcac tacgttggcc aggctggtct cttaaagtgc taggattaca 3601 ggcgtgagcc atggtgcccc gcctagactt cagtgtctga ccttgcctga accacttaga 3661 ggtcggcttc catgttagaa acccagatgg atgcctcagt tggcatgtgt cagtctcaga 3721 ctccccccag ggctcgtggt cagtgctgag atggagattt cctggggcag gctggctggg 3781 acagtgtatc atccacacgt agaacgacgg cgggggatcc cgacttggtg tccccatcac 3841 acttgagaaa gcagcagact ataggccctg gagggtcctg cccctgtgac tgaggagcca 3901 gggctgggct cagtcgccgt ccttctggct gtctcctgca gagtccaccc tgcacctggt 3961 gttgcgcctg cgaggtggca ttattgagcc ttctctccgc cagcttgccc agaaatacaa 4021 ctgcgacaag atgatctgcc gcaagtatgt gtgctccgat gcttgggggg ctgtgggggc 4081 tgccggagtc ggggtatgcc ctcacccacc cctcctgtct ctgtgcaggt gctatgctcg 4141 ccttcaccct cgtgctgtca actgccgcaa gaagaagtgt ggtcacacca acaacctgcg 4201 tcccaagaag aaggtcaaat aaggtggttc tttccttgaa gggcagcctc ctgcccaggc 4261 cccgtggccc tggagcctca ataaagtgtc cctttcattg actggagcag caattggtgt 4321 cctcatggct gatctgtcca gggaggtggc tgaagagtgg gcatctccct tagggactct 4381 actcagcact ccattctgtg ccacctgtgg ggtcttctgt cctagattct gtcacatcgg 4441 cattggtccc tgccctatgc ccctgactct ggatttgtca tctgtaaaac tggagtaaaa 4501 acctcagtcg tgtaattggt gggactgagg atcagttttg tcattgctgg gatcc // LOCUS HSUBR 3321 bp DNA PRI 07-APR-1994 DEFINITION H.sapiens gene for uterine bombesin receptor. ACCESSION X76498 NID g468753 KEYWORDS bombesin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3321) AUTHORS Gorbulev,V.G. TITLE Direct Submission JOURNAL Submitted (29-NOV-1993) V.G. Gorbulev, M.P.I. fuer Biophisik, Kennedy-Allee 70, 60596 Frankfurt-am Main, FRG REFERENCE 2 (bases 1 to 3321) AUTHORS Gorbulev,V., Akhundova,A., Grzeschik,K.H. and Fahrenholz,F. TITLE Organization and chromosomal localization of the gene for the human bombesin receptor subtype expressed in pregnant uterus JOURNAL FEBS Lett. 340 (3), 260-264 (1994) MEDLINE 94178377 FEATURES Location/Qualifiers source 1..3321 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterus" exon <1155..1588 /number=1 CDS join(1155..1588,1972..2323,2459..2872) /codon_start=1 /product="uterine bombesin receptor" /db_xref="PID:g468754" /db_xref="SWISS-PROT:P32247" /translation="MAQRQPHSPNQTLISITNDTESSSSVVSNDNTNKGWSGDNSPGI EALCAIYITYAVIISVGILGNAILIKVFFKTKSMQTVPNIFITSLAFGDLLLLLTCVP VDATHYLAEGWLFGRIGCKVLSFIRLTSVGVSVFTLTILSADRYKAVVKPLERQPSNA ILKTCVKAGCVWIVSMIFALPEAIFSNVYTFRDPNKNMTFESCTSYPVSKKLLQEIHS LLCFLVFYIIPLSIISVYYSLIARTLYKSTLNIPTEEQSHARKQIESRKRIARTVLVL VALFALCWLPNHLLYLYHSFTSQTYVDPSAMHFIFTIFSRVLAFSNSCVNPFALYWLS KSFQKHFKAQLFCCKAERPEPPVADTSLTTLAVMGTVPGTGSIQMSEISVTSFTGCSV KQAEDRF" intron 1589..1971 /number=1 exon 1972..2323 /number=2 intron 2324..2458 /number=2 exon 2459..>2872 /number=3 BASE COUNT 944 a 666 c 700 g 1001 t 10 others ORIGIN 1 tctagagaga aaaatcctaa acagcccctt aaaagaaaac atggcattaa aacagacagt 61 caaacagtaa aaagtggttt tattaggaca tttagagaca aagcaactgg tgctgtgcct 121 ggatttttca tctcttaaaa ggttgttttg gctgggtgtg gtggctcatg cctgtaatcc 181 cagcactttg ggaggctgag gcaggcggat tgcctgaggt caggagttca agaccagcct 241 ggccaacatg gtgaaaccct gtctctacta aaaatacaaa aattagccag gcatggtggc 301 gggcgcctgt aatcccagct actcgggaga ctgaggcagg agaattgctt gaacccagga 361 ggtggaggtt gcagtgagcc aagatcgagc cactgcactc cagcctgggt gacagagtga 421 gactccatct caaaaaaaaa aaaaaaatta aaaaggttgt tttgatagtg tacagcacag 481 cctggataac tgcaaagttg gggtttacca cgtgacagag ctaggtatta agcccaagtg 541 acctgaatga tcactccttg gaggtgacat aaaccctgtc gaagtgtaaa tgatctgttg 601 cttcaccatc tctctgatat taaattttga ggagtgggtt caaggttgtg cttcagggga 661 cacctacact tatttatcat gtaaaacaaa atgcactaga ctaccatgtg attgcagggg 721 gataaatggt ttttttctta aaaaaaattg acaaaaccta tatagaccat cctgggtaga 781 tatgaaaaca gggtgattct aataataaat aagacgattt aatagggtct agttggtccc 841 aagattcccc aacagggaag aataagcaac tgcagagcat ttccccagcc ccctccctct 901 ccttatgaca ggacacgcca ataaaactcc acctcctact cttcagtgcc agatgggtaa 961 gcatttagcc aagcatgctg ggagacacac agaactgaag caaaggagta tctggatgtc 1021 ttggattttc ttcccattct gttctgttct gttctcctaa taccatctcg ttactagacg 1081 taggcattgg acgtgacaat caactgcatt tgaactgaga agaagaaata ttaaagacac 1141 agtcttcaga agaaatggct caaaggcagc ctcactcacc taatcagact ttaatttcaa 1201 tcacaaatga cacagaatca tcaagctctg tggtttctaa cgataacaca aataaaggat 1261 ggagcgggga caactctcca ggaatagaag cattgtgtgc catctatatt acttatgctg 1321 tgatcatttc agtgggtatc cttggaaatg ctattctcat caaagtcttt ttcaagacca 1381 aatccatgca aacagttcca aatattttca tcaccagcct ggcttttgga gatcttttac 1441 ttctgctaac ttgtgtgcca gtggatgcaa ctcactacct tgcagaagga tggctgttcg 1501 gaagaattgg ttgtaaggtg ctctctttca tccggctcac ttctgttggt gtgtcagtgt 1561 tcacattaac aattctcagc gctgacaggt gagtttcttt tctccattat atttgccagg 1621 atgtgaaatt gggcaaaaag aaaggaaagc ttnnnnntga attactctca gtcacacata 1681 aagtcatgct gataaatgtt tatttgtcct aaatagagag tctatacata aaattgcaat 1741 ctggaatatc tcataggagt gaataatgga aaaggagtta gcataaataa aaatattggg 1801 atgtattatg tgcatttaaa accatagcca attctagttg gttaatgttg acttggttat 1861 acacatatgc acggagggta gaactggatg accactaagg tcacttctca ccctaaaatt 1921 tggtaacttt gacatttatt tggacctttg cctctgatta tgtgtttcta gatacaaggc 1981 agttgtgaag ccacttgagc gacagccctc caatgccatc ctgaagactt gtgtaaaagc 2041 tggctgcgtc tggatcgtgt ctatgatatt tgctctacct gaggctatat tttcaaatgt 2101 atacactttt cgagatccca ataaaaatat gacatttgaa tcatgtacct cttatcctgt 2161 ctctaagaag ctcttgcaag aaatacattc tctgctgtgc ttcttagtgt tctacattat 2221 tccactctct attatctctg tctactattc cttgattgct aggacccttt ataaaagcac 2281 cctgaacata cctactgagg aacaaagcca tgcccgtaag caggtatgta ttaatcagta 2341 ctcatgcaaa tctagtttga acnnnnngct aaatgttttg ttttttttgt tgttgttgtt 2401 ttttgtgttt tttttttttt tttttttttt ttttttgcta tgtttcttcc ccctatagat 2461 tgaatcccga aagagaattg ccagaacggt attggtgttg gtggctctgt ttgccctctg 2521 ctggttgcca aatcacctcc tgtacctcta ccattcattc acttctcaaa cctatgtaga 2581 cccctctgcc atgcatttca ttttcaccat tttctctcgg gttttggctt tcagcaattc 2641 ttgcgtaaac ccctttgctc tctactggct gagcaaaagc ttccagaagc attttaaagc 2701 tcagttgttc tgttgcaagg cggagcggcc tgagcctcct gttgctgaca cctctcttac 2761 caccctggct gtgatgggaa cggtcccggg cactgggagc atacagatgt ctgaaattag 2821 tgtgacctcg ttcactgggt gtagtgtgaa gcaggcagag gacagattct agcttttcaa 2881 ggaaaaatgc tgcttctcct cccagcgtgt gtatccgact ctaagctgtg tgcaggtgta 2941 tggtgtccag atttttgttg tttgaaaagt gtgttgaaat cttaggagtg aaggatccct 3001 ataagtaagt aaaatacaaa ccattacttt cttcaaagta caaatagtaa tgtcatcggg 3061 ctttctaaat aaaatgaagc cccactaagt gcagaaagac aagtttatat atgccagtga 3121 atcgtaaggg aagtcaaatg ggaaggaagg tgtaataaac aagatgaaac ttaaaaatct 3181 catttgttgt ttaatcacat ctgtgatgct tctaactcct catatactgt gatttgcata 3241 aaattgtgtt tgtgtttact gtgtggtgta aatctagaga tactttgtat gtgaaaaggg 3301 ggatcgagtc gagtcgacgc c // LOCUS HSUPA 7258 bp DNA PRI 07-FEB-1997 DEFINITION H.sapiens uPA gene. ACCESSION X02419 NID g37601 KEYWORDS plasminogen activator; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7258) AUTHORS Riccio,A., Grimaldi,G., Verde,P., Sebastio,G., Boast,S. and Blasi,F. TITLE The human urokinase-plasminogen activator gene and its promoter JOURNAL Nucleic Acids Res. 13 (8), 2759-2771 (1985) MEDLINE 85215647 COMMENT Direct repeat 1 is the hexanucleotide sequence GGCGG, previously found at similar regions in several viral and eukaryotic promoters and known to be essential for promoter activity. (McKnight et al (1984) Cell, 37, 253-262). FEATURES Location/Qualifiers source 1..7258 /organism="Homo sapiens" /db_xref="taxon:9606" gene 720..7188 /gene="uPA" CAAT_signal 720..722 /gene="uPA" repeat_region 739..744 /note="repeat 1" /rpt_type=DIRECT repeat_region 754..759 /note="repeat 1" /rpt_type=DIRECT repeat_region 765..770 /note="repeat a" /rpt_type=DIRECT TATA_signal 775..781 /gene="uPA" mRNA join(802..889,1196..1283,1701..1728,1875..1982,2586..2760, 2954..3045,3203..3422,3644..3792,4458..4598,4945..5093, 6083..7188) /gene="uPA" exon 802..889 /gene="uPA" /number=1 intron 890..1195 /gene="uPA" /number=1 exon 1196..1283 /gene="uPA" /number=2 CDS join(1227..1283,1701..1728,1875..1982,2586..2760, 2954..3045,3203..3422,3644..3792,4458..4598,4945..5093, 6083..6259) /gene="uPA" /EC_number="3.4.21.73" /codon_start=1 /product="urokinase-plasminogen activator" /db_xref="PID:e300604" /db_xref="PID:g1834524" /translation="MRALLARLLLCVLVVSDSKGSNELHQVPSNCDCLNGGTCVSNKY FSNIHWCNCPKKFGGQHCEIDKSKTCYEGNGHFYRGKASTDTMGRPCLPWNSATVLQQ TYHAHRSDALQLGLGKHNYCRNPDNRRRPWCYVQVGLKPLVQECMVHDCADGKKPSSP PEELKFQCGQKTLRPRFKIIGGEFTTIENQPWFAAIYRRHRGGSVTYVCGGSLMSPCW VISATHCFIDYPKKEDYIVYLGRSRLNSNTQGEMKFEVENLILHKDYSADTLAHHNDI ALLKIRSKEGRCAQPSRTIQTICLPSMYNDPQFGTSCEITGFGKENSTDYLYPEQLKM TVVKLISHRECQQPHYYGSEVTTKMLCAADPQWKTDSCQGDSGGPLVCSLQGRMTLTG IVSWGRGCALKDKPGVYTRVSHFLPWIRSHTKEENGLAL" intron 1284..1700 /gene="uPA" /number=2 exon 1701..1728 /gene="uPA" /number=3 intron 1729..1874 /gene="uPA" /number=3 exon 1875..1982 /gene="uPA" /number=4 intron 1983..2585 /gene="uPA" /number=4 exon 2586..2760 /gene="uPA" /number=5 intron 2761..2953 /gene="uPA" /number=5 exon 2954..3045 /gene="uPA" /number=6 intron 3046..3202 /gene="uPA" /number=6 exon 3203..3422 /gene="uPA" /number=7 intron 3423..3643 /gene="uPA" /number=7 exon 3644..3792 /gene="uPA" /number=8 intron 3793..4457 /gene="uPA" /number=8 exon 4458..4598 /gene="uPA" /number=9 intron 4599..4944 /gene="uPA" /number=9 exon 4945..5093 /gene="uPA" /number=10 intron 5094..6082 /gene="uPA" /number=10 exon 6083..7188 /gene="uPA" /number=11 polyA_signal 7155..7160 /gene="uPA" polyA_signal 7168..7173 /gene="uPA" polyA_site 7188 /gene="uPA" BASE COUNT 1680 a 1858 c 2089 g 1631 t ORIGIN 1 ttcaatagga agcaccaaca gtttatgccc taggactttg ttcccacaat cctgtaacat 61 catatcacga cacctaaccc aatccttatc aagccctgtc aaaaacggac tttaaaccaa 121 gctgcaaatt ttcagtaatc tggccttgcc tttccccctc tgatagcacc atcaaacaaa 181 cccccttact gccgaaagca ataagcccgg ctttgttcca tccactggtt gtgttggtga 241 tatctgggga ctgccactga acagacgcac agagggagcc cctacaggca ggggtttttc 301 tgtctgtgct tcttgggaga gtatgtctcg tacatttgtc gcgtgatgaa gacttcacag 361 ctccatccag cgaccagact cacagctcca tccagctgcg gcaagggggt ctgaggcagt 421 cttaggcaag ttggggccca gcgggagaag ttgcagaaga actgattaga ggacccagga 481 ggcttcagag ctgggctgag gtagagagtc tcctgtgcgc cttctctcct ctctgcaatt 541 cggggactcc ttgcactggg gcaggccccg gcaggtgcat gggaggaagc acggagaatt 601 tacaagcctc tcgattcctc agtccagacg ctgttgggtc ccctccgctg gagatcgcgc 661 ttcccccaaa tctttgtgag cgttgcggaa gcacgcgggg tccgggtcgc tgagcgctgc 721 aagacagggg agggagccgg gcgggagagg gaggggcggc gccggggcgg gccctgatat 781 agagcaggcg ccgcgggtcg cagcacagtc ggagaccgca gcccggagcc cgggccaggg 841 tccacctgtc cccgcagcgc cggctcgcgc cctcctgccg cagccaccgg tgagtgccgc 901 ggtcctgaga tccccgggcc ggatgcgcgg cggccccagc tcccgagcgt ctgcctgccc 961 cgccctgggc tgcccgggct ccctgggctc cccggcggct gcacggagtc aaggcgcccc 1021 gtcccgggcg tcccccgcgg gtgccgatcc aggctgcccg gagtccggag cccatagagg 1081 agagagacag ctggggagcc tggtcaccgc gggcatctcc cctgcgctgc agtcgcccgc 1141 ctggcctgcc ttcccgttcc tccgcctctt gccctgactt ctccttcctt tgcagagccg 1201 ccgtctagcg ccccgacctc gccaccatga gagccctgct ggcgcgcctg cttctctgcg 1261 tcctggtcgt gagcgactcc aaagtgagtg cgctcttgct ttgactgatg ctgcccaagg 1321 acctctgatc agcaccaggg gagaggaggg gctgctcagg gagctggggt ctccggattc 1381 catccacagc agggccagac tctccccagg aaatgggaca gggtggcagc ggaggcttga 1441 gaaccacggg ggttggcact ggctggcaag ggaggaagag ggccaccggg actgccccag 1501 cctgcgggca tctggtagat gaagcttaat ccatttctcc tggctggaaa ccatggtctt 1561 ccatttgaga actagatacg aacagggtga ggcgagaggg agagggaaga gtgggttttg 1621 ggattggggc cagtttaccc tcaccctgga tccctggagc atgggacctt tgatgaagcc 1681 tcctcccgaa tctcttccag ggcagcaatg aacttcatca agttccatgt gagtatccac 1741 ccctacaaca gttggctgca cagacaagtt gggaaggctt caggggacac tcccctccct 1801 gccctctgct gcagcgtgcg ccacccctta ccacttccac tccccctcgc ttaccccacc 1861 tttgttctct ccagcgaact gtgactgtct aaatggagga acatgtgtgt ccaacaagta 1921 cttctccaac attcactggt gcaactgccc aaagaaattc ggagggcagc actgtgaaat 1981 aggtatgggg atctccactg caactgggag agaaatttgg ggacagggag ggatgggtgg 2041 gaggcaagag caggcaggag ttaggagctg gaggtagggt gggtgacatc ttcatcccta 2101 tgtgacaagc ataaacacac acacacgctc acgaaacagt ggccacacaa atgtgaggtg 2161 gggttggaag gagaccctgt ccagtcttct ggcaggtctg aaacgacatc tttaaaatgt 2221 ccgttggcag ccgggcatgg tggctcacgc ttgtaatccc agcattttga gaggtcaagt 2281 ttgagtggat catttaggtc aggagttcaa gaccagcctg gacaacatgg tgtaaccctg 2341 cctctactaa aaatgcaaaa atcagcctgg catggtggtg gatgcctgta gtcccagcta 2401 cttgggaggc tgaggcagga gaattgcttg aacatgggag gccagatctc agtgagctga 2461 gatcacacca ctgcactcca actgggcgac agagcaagac tccatctcaa aaaaaaaaaa 2521 aaataaaagt tagttggaat gttcttctct ttctcatatt ctctcatcct cctgtcccct 2581 tgtagataag tcaaaaacct gctatgaggg gaatggtcac ttttaccgag gaaaggccag 2641 cactgacacc atgggccggc cctgcctgcc ctggaactct gccactgtcc ttcagcaaac 2701 gtaccatgcc cacagatctg atgctcttca gctgggcctg gggaaacata attactgcag 2761 gtgaggtggg ggcaacaagg accaaaagcc ctccctacag cttcccagaa accttgttac 2821 catccccttc tcccagaggg ctggccatag cacaagagaa gtgcggcctc tggttgagtc 2881 ttccctgagg ggaggaggca gggaaggccc tctgggttgg aatgacatcc cctatctttc 2941 tgtgttgtgc caggaaccca gacaaccgga ggcgaccctg gtgctatgtg caggtgggcc 3001 taaagccgct tgtccaagag tgcatggtgc atgactgcgc agatggtgag catcactgac 3061 ctgctgatga caggtgggtg gaaggggaca aacttacatg tccccttatt ccatcacagg 3121 aggactgagg aggtgggggg tgcccgagag ggatgctttc tcctacctgc ctccctaaga 3181 catccctctg tttgtcctcc aggaaaaaag ccctcctctc ctccagaaga attaaaattt 3241 cagtgtggcc aaaagactct gaggccccgc tttaagatta ttgggggaga attcaccacc 3301 atcgagaacc agccctggtt tgcggccatc tacaggaggc accggggggg ctctgtcacc 3361 tacgtgtgtg gaggcagcct catgagccct tgctgggtga tcagcgccac acactgcttc 3421 atgtacggcc ctgggtttct cctcttcgac tcttctgccc caccccaagc acatcccttt 3481 ctccttccca gcaaagtgtt ccgcctcatt tctccctcat ctgcccctgt ccatgcgccc 3541 atggccttgg ggacaagtcg tgctttgagg cctctaggga gggaaggaag aagtggcatg 3601 atttcatggg actaagctgt ttgatgggta tcttcttcca cagtgattac ccaaagaagg 3661 aggactacat cgtctacctg ggtcgctcaa ggcttaactc caacacgcaa ggggagatga 3721 agtttgaggt ggaaaacctc atcctacaca aggactacag cgctgacacg cttgctcacc 3781 acaacgacat tggtgagggg gaacgcccgc gactactgtg gccataatgg cttggggaga 3841 gtgggaccca gggagagact ggagctgagt tgaagctgcc ggtggggcag gggtggggcg 3901 agggaccttg aagcctcgat atacatgaca aaggatggca gggaagagtt ccatgaagtc 3961 tgaggggcct ggtgctcctc tggagagacc ctgaatttcc ccaacaagta gccctcttgc 4021 gagtggaaac agccctgtgg gtatatggct tgggctggga aggccctgtt tatatgaatt 4081 agaaaaagac acaccttcct ttgtgggatg cagcctctgt ctgtgctagg atatagaact 4141 tggagaatgg agccttggga tggattccag cctaactacc tcagggggat cctctagagt 4201 gcagctggga gtttttgcag aaacgacctg tacagctgta tgcagtggct ctggccatcc 4261 aagccttttt caacacctgg aacaaagccc ttggggcatg gggcagggga ggtttccagg 4321 tgataagcga ccagcagacc tccctggatg actgacctag ggataggcat agctacttcc 4381 tcggcacttg gaggggacag atggggaccg cctaaccagt agtgatcttt ctcctctgac 4441 cctctgtcct cccccagcct tgctgaagat ccgttccaag gagggcaggt gtgcgcagcc 4501 atcccggact atacagacca tctgcctgcc ctcgatgtat aacgatcccc agtttggcac 4561 aagctgtgag atcactggct ttggaaaaga gaattctagt aagtgacaat tgcgactgac 4621 ttagaaggtc ctgaggagtg ttttgacctg aaaatgagcc cagtgtgatc aagggaagac 4681 tgcagagtta gaggtgggag cactgaggcg gtggcagatg ggtccaggga tggatgaaga 4741 gtgttgttta gggagcgatg ggctgcaaag gtaaatagat ggtaggggct ataggtggag 4801 gtaaatggct cagatttgca tggagagaga ataatgggcc tctccctggg tgatgatact 4861 ttatggtgtc ccctctctgg cgagacgtcc cacgtggagg cagataaatc ttgatgcaaa 4921 cgcctccctg ttttctccac ctagccgact atctctatcc ggagcagctg aaaatgactg 4981 ttgtgaagct gatttcccac cgggagtgtc agcagcccca ctactacggc tctgaagtca 5041 ccaccaaaat gctgtgtgct gctgacccac agtggaaaac agattcctgc caggtgagtg 5101 ttccaagcat ctctctccac ctcttccata tctccccaga gctcctgggc ttgttccagc 5161 cagcttaagg gtgtctctct ctagccaaag ccctaagtag ccagaatcag gagctcaggt 5221 ctttgagggt ttaaaccagt ccttatgtgt ttgccagaca ttaccaaaaa aatcccagct 5281 ctgcgctagt cacttcagac tgggggcacg agatcctaga aagaggaaac agtaaaagac 5341 aatgtaactc agtgcccagg gtgtgttgtg aactataaat gatcaggtgt tcaggagagg 5401 gaggtgagtg ccaacctgag ggtcagggag gggaggcttt aaaggaaatg tgacttgata 5461 ggcatttgaa gaggcagagg gaagaaagga aggtgtttca gttgaaagat acaaaactga 5521 gaaggaggct ggcatattcc gggtggggag gagaactagg gtctgggagt gtggatggaa 5581 tagtggcaga tgacagggct tttaaagcca agcaggggat tttccaactt cgatgtggta 5641 gaaatggggc tgcgtcaggc acagtggctc atgcctgtaa tcccagcatt gggctaggcc 5701 gtagtcgatg gatcattgag gccagagttg agaccggcct ggaccaacat ggtgaaaccc 5761 tgtgtctact aaaaaatgca aaaaaaaaaa ttagccaggt gtggtggtgc ctgcctgtaa 5821 tcccagctaa tcaggaggct gagacatgga atcgcttgag cacaggaggc aagtttgacg 5881 tgagctgaga tcacgtcatt gcacgccagc ctgggcgaca gagcgagatt ctgtcctccc 5941 gccgaaaaaa gaaagaaaat gggaagtcgc taaggacttt gactgggaaa ctcttccctc 6001 tctctggtat ggttgggtga tgggatcaga aatcccctcc tcacttctct agggctcatc 6061 ttttgtatct ttggcgtcac agggagactc agggggaccc ctcgtctgtt ccctccaagg 6121 ccgcatgact ttgactggaa ttgtgagctg gggccgtgga tgtgccctga aggacaagcc 6181 aggcgtctac acgagagtct cacacttctt accctggatc cgcagtcaca ccaaggaaga 6241 gaatggcctg gccctctgag ggtccccagg gaggaaacgg gcaccacccg ctttcttgct 6301 ggttgtcatt tttgcagtag agtcatctcc atcagctgta agaagagact gggaagatag 6361 gctctgcaca gatggatttg cctgtgccac ccaccagggt gaacgacaat agctttaccc 6421 tcaggcatag gcctgggtgc tggctgccca gacccctctg gccaggatgg aggggtggtc 6481 ctgactcaac atgttactga ccagcaactt gtctttttct ggactgaagc ctgcaggagt 6541 taaaaagggc agggcatctc ctgtgcatgg gtgaagggag agccagctcc cccgacggtg 6601 ggcatttgtg aggcccatgg ttgagaaatg aataatttcc caattaggaa gtgtaacagc 6661 tgaggtctct tgagggagct tagccaatgt gggagcagcg gtttggggag cagagacact 6721 aacgacttca gggcagggct ctgatattcc atgaatgtat caggaaatat atatgtgtgt 6781 gtatgtttgc acacttgtgt gtgggctgtg agtgtaagtg tgagtaagag ctggtgtctg 6841 attgttaagt ctaaatattt ccttaaactg tgtggactgt gatgccacac agagtggtct 6901 ttctggagag gttataggtc actcctgggg cctcttgggt cccccacgtg acagtgcctg 6961 ggaatgtact tattctgcag catgacctgt gaccagcact gtctcagttt cactttcaca 7021 tagatgtccc tttcttggcc agttatccct tccttttagc ctagttcatc caatcctcac 7081 tgggtggggt gaggaccact ccttacactg aatatttata tttcactatt tttatttata 7141 tttttgtaat tttaaataaa agtgatcaat aaaatgtgat ttttctgatg acaaatctcc 7201 ctggtgcttg tatgggaagg agttggagta cataaaaagg agaaaataac aaaggtgg // LOCUS HSUSF2 14440 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens USF2 gene. ACCESSION Y07661 NID g1806093 KEYWORDS USF2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14440) AUTHORS Groenen,P.M., Garcia,E., Debeer,P., Devriendt,K., Fryns,J.P. and Van de Ven,W.J. TITLE Structure, sequence, and chromosome 19 localization of human USF2 and its rearrangement in a patient with multicystic renal dysplasia JOURNAL Genomics 38 (2), 141-148 (1996) MEDLINE 97127588 REFERENCE 2 (bases 1 to 14440) AUTHORS Groenen,P. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) P. Groenen, Center for Human Genetics, Laboratory for Molecular Oncology, Herestraat 49, 3000 Leuven, BELGIUM COMMENT Related sequence: X90823-X90826. FEATURES Location/Qualifiers source 1..14440 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="q13.1" /clone_lib="LLNL 7058" /dev_stage="adult" /haplotype="2n" mRNA join(3688..3749,4031..4077,4166..4284,4388..4588, 5032..5182,5303..5390,5668..5726,13284..13378, 13532..13660,13753..14380) /gene="USF2" gene 3688..14380 /gene="USF2" exon 3688..3749 /gene="USF2" /number=1 CDS join(3688..3749,4031..4077,4166..4284,4388..4588, 5032..5182,5303..5390,5668..5726,13284..13378, 13532..13660,13753..13842) /gene="USF2" /codon_start=1 /db_xref="PID:e291919" /db_xref="PID:g1806094" /translation="MDMLDPGLDPAASATAAAAASHDKGPEAEEGVELQEGGDGPGAE EQTAVAITSVQQAAFGDHNIQYQFRTETNGGQVTYRVVQVTDGQLDGQGDTAGAVSVV STAAFAGGQQAVTQVGVDGAAQRPGPAAASVPPGPAAPFPLAVIQNPFSNGGSPAAEA VSGEARFAYFPASSVGDTTAVSVQTTDQSLQAGGQFYVMMTPQDVLQTGTQRTIAPRT HPYSPKIDGTRTPRDERRRAQHNEVERRRRDKINNWIVQLSKIIPDCNADNSKTGASK GGILSKACDYIRELRQTNQRMQETFKEAERLQMDNELLRQQIEELKNENALLRAQLQQ HNLEMVGEGTRQ" intron 3750..4030 /gene="USF2" /number=1 exon 4031..4077 /gene="USF2" /number=2 intron 4078..4165 /gene="USF2" /number=2 exon 4166..4284 /gene="USF2" /number=3 intron 4285..4387 /gene="USF2" /number=3 exon 4388..4588 /gene="USF2" /number=4 intron 4589..5031 /gene="USF2" /number=4 exon 5032..5182 /gene="USF2" /number=5 intron 5183..5302 /gene="USF2" /number=5 exon 5303..5390 /gene="USF2" /number=6 intron 5391..5667 /gene="USF2" /number=6 exon 5668..5726 /gene="USF2" /number=7 intron 5727..13283 /gene="USF2" /number=7 exon 13284..13378 /gene="USF2" /number=8 intron 13379..13531 /gene="USF2" /number=8 exon 13532..13660 /gene="USF2" /number=9 intron 13661..13752 /gene="USF2" /number=9 exon 13753..14380 /gene="USF2" /number=10 BASE COUNT 2766 a 4210 c 4353 g 3110 t 1 others ORIGIN 1 tcaggaggca gcaggtccca ctgctgatga taaagttgca ggctgcctga gctaatgaag 61 gggcttcctc taggctgtgc acttagtctt ctgcttccaa accaaatcag aggtgaggca 121 ccctctctgg gcccatctct ctcctccatt ttcctgttgg ggtcccaggg aggaagccac 181 ttgcctaggg cccaggaatt ttgcaagcct cttgccctag ggaggaagga agggaggagg 241 atcttacctt gaactgtcaa gcctagagcc tggtggggca ggcagaaatg ggtgcagtcc 301 atgagttaga aacactagag gagacacttt gctgcttggc cggggcaggc aagttaattc 361 ccgaggctcc tgccactgca tctcaatctg gaaggtgacc aggtgggcag gacccacgtc 421 tcccagatga ctcatttttt ctagaacagg ggcttggctg ccaaagagga tacttgattt 481 cggcttgtgg ggacagtggt ggacccagca tctgggcttt atataaaggg cagctttgtt 541 gccctgtaaa cacacagacc atgggtggcc acttcttcca gtaagttagc tggggagttg 601 gaagtttagg taaaaccttt tgattgacaa atgttggcga attaccatgc tgttaaatga 661 aacattgttc tgccaccctg gggctgtggg tgcctgcgtg caccctctga aaaatcacac 721 aggaagtggg gtggggtctc tgtgaagctg gtgtccccca gcctcaggga tgctgcagaa 781 atggaatgag gaccaacagg gactcagatg tccaaggaag ctctacagcg gagaggacgg 841 cttgggaagg aggtccaggc ccaggtccct ccggaaccca atgggtatgg ggcagcctgg 901 ctcctgcctc atcccccttc tcctgttgat tgtgtcctca cagtgtatgc cgccggcaaa 961 gcagccacct caggtgttcc cagcatttat gcccccagca cctatgccca cctgtctccc 1021 gccaagaccc cacccccacc agctatgatt cccatgggcc ctgcctacaa cgggtaccct 1081 ggaggatacc ctggagacgt tgacaggagt agctcaggtg aggccggggg aagcaggaac 1141 agctggtggg agtgtgctgg gcatctggac actgaggggc aggggctgga aggaagagtg 1201 tcttgggagc cgaggagggg ctctgctcct ggtgcgcggc cactgacagc cactctcccc 1261 cagctggtgg ccaaggctcc tatgtacccc tgcttcggga cacggacagc agtgtggcct 1321 ctggtgagaa tccatcgtcc cgaagttgga tgtgcctgta agggagaggg gtgggccagg 1381 atccatcctc ccaaaccgac caccaccccc ctgtccctag aagtccgcag tggctacagg 1441 attcaggcca gccagcagga cgactccatg cgggtcctgt actacatgga gaaggagctg 1501 gccaacttcg acccttctcg acctggcccc cccagtggcc gtgtggagcg gggtaagcag 1561 gagccttggg gtctgagggc ttttaaggtg ggggggtgaa acatgtctcc ctgatacctg 1621 ccgcagggac tcttggtgca aaccctggac cccgggctcc tccagcagtc agtgacaccc 1681 cccttccctg cagccatgag tgaagtcacc tccctccacg aggacgactg gcgatctcgg 1741 ccttcccggg gccctgccct caccccgatc cgggatgagg agtggggtgg ccactccccc 1801 cggagtccca ggggatggga ccaggagccc gccagggagc aggcaggcgg gggctggcgg 1861 gccaggcggc cccgggcccg ctccgtggac gccctggacg acctcacccc gccgagcacc 1921 gccgagtcag ggagcaggtc tcccacgagt aatggtggga gaagccgggc ctacatgccc 1981 ccgcggagcc gcagccggga cgacctctat gaccaagacg actcgaggga cttcccacgc 2041 tcccgggacc cccactacga cgacttcagg tctcgggagc gccctcctgc cgaccccagg 2101 tcccaccacc accgtacccg ggaccctcgg gacaacggct ccaggtccgg ggacctcccc 2161 tatgatgggc ggctactgga ggaggctgtg aggaagaagg ggtcggagga gaggaggaga 2221 ccccacaagg aggaggagga agaggcctac tacccgcccg cgccgccccc gtactcggag 2281 accgactcgc aggcgtcccg agagcgcagg ctcaagaagg tgagggccgc cctccctggc 2341 gtccagaccg tccctgggcc cccagccggt ccccgcggct catacccttc tttctttctc 2401 ccttgcagaa cttggccctg agtcgggaaa gtttagtcgt ctgatctgac gttttctacg 2461 tagcttttgt attttttttt ttaatttgaa ggaacactga tgaagccctg ccatacccct 2521 cccgagtcta ataaaacgta taatcacaag ctctggagag aaccatttgt tcggccgcgc 2581 ggggcggggg accggggctg ctcccgtatg cgtctgtaaa gcgccgcgtc ccgggggcac 2641 cggagtccgg ggccgggagg aagagaccca gcctggcccg gcccgcgccc gcgccgccgg 2701 ccggagaacg tgccccgcgc agccgccgcc cgcctgcgtg cgcgccccgg ccccgcccag 2761 gcgtgcgcat gcgccccggc cctccgcctt cgcgcaccgc aggctggccg tccgggacgc 2821 gcgcgcgctc ctctcccctt ccagcccatc ccccccagcc ccccaccgac ctactttact 2881 gtctccaaac tcgggcagcc cacctggccc ccgacgaccc cagcccctgc taccgggtac 2941 cccgacgttc catccagacc cgcgtttcac cagggcggcg cgcggcgacc tcgcgccccg 3001 cggagccccg ggctcgcgcg cgcccgcccg cccccggaga cagaccagcg cgcgcgctcc 3061 cgggccgcct ccccccagcg cgcgtccgcc ccsggctcgc gccgccgccg ccgccgccgc 3121 cgcgcgcgcg cagctcaagt aaaggaggaa aaaaaaaagg gggaaaaata gaaagcggcg 3181 gcggctgcag cagcgatccg ccgccggact gggccaagcc gggcggcggc cgcgcgagcc 3241 ggcgatccag ggcactggcg gcggccagcc agggcgggcc gtgttcaaaa aaaaaagtcg 3301 cggcggcggc ggctgctcag ggaaggaggc ctgagggccg cgtgcagcgg gcgggcagct 3361 gggtgggctg ggggcggccg cgcggcgtcc cggagcctcg ggccgcccgg agccggcggg 3421 cgggcggagg cggaggcggc ggcggctgca gcggctgcag gagcggcggc ggctgcggcg 3481 gcggcggcgg catctcctcc tcacatgacc ccactgtttg tccccgtgat cagcgcgagc 3541 ggctcccgta tctcctccgt cccctcctgc cgcgcggcgt gagcgccggg ctcggggccc 3601 ccccggccgc ccgccccctc ccctccctcc ctcccctccc ctcccctccc ccccgggccc 3661 cgcgcccccc ccgcccccgc cccccccatg gacatgctgg acccgggtct ggatcccgct 3721 gcctcggcca ccgctgctgc cgccgccagg taagatcccc ggcccggccg tgcccccgcg 3781 ccccggcccc ggccccggcc ccgcggcctg caggccgggg ccgcccatga tcccgagcgg 3841 ccgcgggccg gctcaaaatg gaggccgccg gcgcgggggg gacctggcgc ctcccgcccc 3901 cggcccccgg cctcggcggc gccccggcct caggcgcggc cgggtgggac tggggccctg 3961 cagctgggcg cgggggcggg ggcgcgggcg cgggccgcgc tgaccctgct ccctcctgtg 4021 cccctggcag ccacgacaag ggacccgagg cggaggaggg cgtcgagctg caggaaggtg 4081 agtgcttgcc gggccggccg cgcccgggga gggctggggg cgctcggcgc ggccctgacc 4141 gtgccccgac cctcctcggc cccaggcggg gacggcccag gagcggagga gcagacagcg 4201 gtggccatca ccagcgtcca gcaggcggcg ttcggcgacc acaacatcca gtaccagttc 4261 cgcacagaga caaatggagg acaggtgagc ggcgggccgc gagggcgaac gggcgggcgg 4321 gcgggcgcgc cgggaaggct cggacctggc cccagcgccg gcctcgccgc tctgccgccc 4381 cctgcaggtg acataccgcg tagtccaggt gactgatggt cagctggacg gccagggcga 4441 cacagctggc gccgtcagcg tcgtgtccac cgctgccttc gcgggggggc agcaggctgt 4501 gacccaggtg ggtgtggacg gggcagccca gcgcccgggc cccgccgctg cctctgtgcc 4561 cccaggtcct gcagcgccct tcccgctggt aggtgccctg ccacccctgg gtgggggggg 4621 gagggagtgg agaggggaca ctggctctgc tcttggggag ccccgggggt ggggcaggtg 4681 tcgcccagcg gatgctgcct tcaggcctca ggctgcatgg ggccagatcc ctgttgtgca 4741 ccgtgaaacc tggggacagt gctcttggaa tttttttttc ccgcaaaatg ggaatgatgt 4801 gtcttaagag gaaagtttct tagagtgcga aacggatttg ctcagcaagt gatcgctgac 4861 ctcccgccat gttccagggc tgtttgaaac acaggacaga atgctacacg cagaaggagt 4921 ccccactcct gttaattgca gagaatgcag taaaccccaa gcacagcaat gaggggacgg 4981 gatggaggaa cagagaaagc taagggtagc ctctgccctc ctactcccca ggctgtgatc 5041 caaaatccct tcagcaatgg tggcagtccg gcggccgagg ctgtcagcgg ggaggcacga 5101 tttgcctatt tcccagcgtc cagtgtggga gatactacgg ctgtgtccgt acagaccaca 5161 gaccagagct tgcaggctgg aggtgaggag tagaagtcag attggcaggt gggggaggca 5221 acgggcccag cagggaggga agccccccca gccagttctg acttcaccct gccttgccac 5281 taacccccca ctctccctgc aggccagttc tacgtcatga tgacgcccca ggatgtgctt 5341 cagacaggaa cacagaggac gatcgccccc cggacacacc cttactctcc gtatgtgcag 5401 gggacacctg gagggcctgg tgttgaaatg gaaggaagag gggtttctgg agtagaagct 5461 gggcagttag catgaagtgg gcacatggtg taatgttttt tttctttgcc tgtttctgct 5521 gctctagtgc acataaagta tcatgtcttt tgtttttgca gatggattta ccttgcaaag 5581 ataattttca aatctaatac ttagcagatg cttgggcaaa cagctctgcg ataatacatg 5641 cccccttttt ttttcctcct cttgtagaaa aattgatgga accagaacac cccgagatga 5701 gaggagaaga gcccagcaca acgaaggtga ggacaaggtg tggctccggg tccccctgac 5761 caccaccctc acaggctcag ccagccctgg agtgtggagt gacagagaga gaagccactc 5821 ctgggcaggc cacaagtgct ccagagggct ttgctggacg ctgtaaaggt agaaagtgag 5881 caacgagggg agagtacagt gtcagaagta gagggacagg gagtgtgaat tggaactggc 5941 catttgttga cccggctggt gatcttgaat tcatggccca catctaagat ggtggacttc 6001 acactcccat acggattgcc tgccccactt gaaaaatggg gctgaccggg ctcacgtggg 6061 ctgacctttc cacgtggcag ctgtcatgct cacatcccgc actcccatgc ctgcctggcc 6121 tccagaagta ggtctgttcc cccagggagc acatctgagg gagaaactgc atgaagttct 6181 gcctctggct acctaaggcc ggagcatcct gccctttcca gaaacaccag gtctcacctc 6241 aggtcatttg gccttcagca ctgtcccgcc ccgcctgtca gtagccactg cagtgggcac 6301 agcactgtgc aagtgccagg agagtgctta cacccaaaca accatcattc aacaaatatt 6361 tactgcccgt gggttctggg ctgagcggca caggccctgt gccaggtctg gagtctggcc 6421 tggagcgtca gggaggctca aaggcctgag ggcttatgtc tcatgggtct catgggacat 6481 tgatgggggg tgggctgggg tgacgcttcc ctagggagag gggagtgtgg gtcaggttac 6541 tttgtggggt gctggtttgg ttgtgggaga agttccattt agaatgcctc aggagggaag 6601 atcgggaagg cctcgaatgc ccgagtgagg ggctgaatgc tgagtcctaa aggtggtgat 6661 gggactgggg tctgtggccc aggtagttgt gttctgagca gctcctccct gagttcaggc 6721 gttcattcag cgcgtgcata ctacgaccag cccccctttt gcgtcctcag atatggcaat 6781 gcagcagatg ataggacaga caaggtctca gctctctgtg cgcgttctgg tgggagagat 6841 gggagtcaca cagccgcatc aggcagtgca ggcagtcaca ggcaggagag gaaagctggg 6901 gaagaaggtg gcgggtgatg gcggcatggt agtgtgctct gaagggctct tcgagaggtg 6961 acattgaagc agatccctga agcgacctga gggaaggggc ctgttgagaa cctcaggaga 7021 gtcccgggca gagggaagag ctagaacaca ggccaagccc tgagccaggg ccgccatggt 7081 gtgtctgcag aagagcagaa ttcaggaaag gaaagaggaa gttggcgcag tggagggggc 7141 tgcccaggcc agatggtgga ggaccttgga agaagtcccc agtgtggctg tctgggctcc 7201 tgtggtttta ttccagggtg atgagaagcc agcagagggg tttgggttgt ggaggctgtt 7261 ggcaagagtg gtgggaaggg gctgggaggt aggagaccat acgactgtca ggcctgagac 7321 agggcagggg ttgggggaaa tgtggggagg actccaaggc tggagaataa gggtttcgtg 7381 ctgaactccc ctccatgctt ggaagggaag gggtgcatct gagcttggtt ttgagatgca 7441 gtgagtttga ggggccccca gagcagcccc agccccacga cttacctgtg gtggggcgtc 7501 agggtggttc ttaatctccg tacctgtctt ccctctgtga aatggtggaa gcacagccgc 7561 cccgggcacc tggtggggat gcagtgagtc ttggtggcca cacagtaagg cgaagagagg 7621 actcagaacc ctcccacccc actctcaccc ccgcctctcc cttagttaat ctgactccag 7681 gcagccttga aactttctca ccctgtgaag cagtcttcat cacctgccca cttctccagc 7741 tccgtcctgt ctccgctttc ctgacttgca gagcatctgc cccaatctgc tcatcctccc 7801 agtaccctca gccatctggc tcatggctct accaaaacag ctccacttcc ggtcacccgt 7861 ggtgacccca ttgccacatc cactggcctc ctttgctgtc tgggctccaa gcatctgatg 7921 ctgcctgttt ccctctgttc ctttctggaa tcttcctttc tcctccatcc tctctgattc 7981 tcctgtgatc tttgcctttc ctaccccact gttaacagtg aaatttccag ggctgtccca 8041 gttccaagtg agtagatggg ggtgccgggg aatctgtgtt ttccatcagc tcttcaggtg 8101 atttgtcata gtggacaagc ttgagaaact gctttggctg cgagtgtttc tttttttttg 8161 agaagggggt ctcactctgg tcacccaggc tggagtacag tggcacaatc acagctcact 8221 gcagccttga cctcccaggc tcaagcgatc ctcacacctc aggcttctga gtggctggga 8281 ccacaggcag gcgtcatcac atggggtaat ttttttttat ctgtagagtg gaggtcttgc 8341 tatgctgccc aggctcgtct tgaacttctg gcctcaggca gtcctccggc ctcggctaca 8401 ggcctgagct gccacaccca gtcttgctgg tgtttcataa ggtccggccc ctcccgtcgg 8461 cctcaggtgt atgacttgtg aggcccagcg atgggctcgg aagctcttgg tcagtatccc 8521 ccaaactctg ttcccccttg taggtgttcc tcttttttct ttgctgcaga agagtgtgtg 8581 actcacaaaa agttgaggac tgttccttga tgtcctccca ggcttcgccc accacctggc 8641 tcctggtaat gcaaggaact gtgtttccat ccattccctt cctttcactg caactccagc 8701 cccacgtttt gaactaccta ttcgagattt ctgtctctgg acatccctgt aatgcctcaa 8761 actcaactcg cccaaaagag aatgtgccat ctcccgtctc caaaggcttt catgcaacct 8821 tcttctttgc agagcagcaa aagctacccc tgtcacagct gcccaagcca gaaactaaga 8881 ccagagccgc tttcaactct gccctcttct tcccttatgg ccacaaaccc caagtttctc 8941 ttctgtggtg atttcagatc cgcttctctc acttcttttt ccctccttcc caccacttct 9001 tggtactgca ggagcttatc ccagagtcac agactcaaat gcctgcaggg gcctgaggga 9061 gggaggtgca cttggcatct caggaagggg tgtgtggctg ggtgtggtgg ctcccgcccg 9121 gaatcccagc acttttggga ttacattttg gccaagacgg acggatcact tgagcccagg 9181 agtttgagac catcctaggc aatgtgacaa aaccctgtct ctaccaaaaa aattaaaaat 9241 tagccaggcc tggtggtaca tgcctgtaac agtcccagct acttgggaac ctgtggtggg 9301 aagatggctt gagtccggga gacagaggtt gcagtgagct gagattaccc tccagcctga 9361 gcaacagacc cagaccttgt gtcagaaaaa gataggaaag gttgtggggg gcccccagtg 9421 atctggggca tgtgtgcgct gcctgaaggg tggcccttgc acagtgcctt ccccttgctg 9481 ccctgtcaga atctagggcc actgttgcca gaaactctga gaagctagaa atcgagactt 9541 atgtgaaatc tgatttttag gtgccagatt agattgcttt ttccgtttca ttgatttcta 9601 ctcttaatgt tactaattca tgcttctact cacttggttt tttgtttgtt tgttgttttg 9661 ttttgagatg gaatctcgct ctgtcacagt ggcgcaatct cagcttacta caacctctgc 9721 ctccccggtt caagtgattc tcctgcctca gtctcccaag tagctgggac tacaggtgcc 9781 cgctgccaca cctagctagt ttttgtattt ttggtagaga cggggtttca ccatgttggc 9841 caggctggtc tcgaactcct ggcctcaggt tatccacctg cctcagcctc ccaaagtgct 9901 gggattacag gagtgagcca ctgcacccgg ccccttctac tcactttgga ttcaatttgc 9961 taggtttcat gctttagctt ccctggcaca ccccaacccc agctttttaa ggtaattctt 10021 tttttttttt tttttccaat tttctactaa aatctttatt ttacaggatt aatgtttcac 10081 agcagggatt gagatgattc ttaacctttc tctttgctta acaacattta gagttgccag 10141 ttcactccag catcacttta gctacatccc actgggatat gttgtttttg ttatcattca 10201 gtttaaagtg ttttccaatt ttccttgtga tttcttcttt gacctttctg ttatttggag 10261 gcaggttgct taatttccaa atatttgtgg gtttttcaga tatctttttg ttattgattt 10321 ttaatttagt tctgttatgg tcagagaaca tactctgtat gatttcagtt ccttgaaatt 10381 cgatgagact tacccgacgg gcccgaatgt gtcctagctt ggcggaagct tctggggcat 10441 tgacgaggac aggtgtctgc tgctgtcagg cagaatgtcc ccccaggtca gtgaggtcag 10501 gctggttgag tgttcaggct ttctattctg atttttgttg ttgttgttta cttgttccac 10561 caatcaatga gaaagatgtc catgtctctc actatatttg tggatttgtc catttcttcc 10621 ttcagtcctg gcaggtttcg cttctgaatt ttctcttttt tttttttttt tttttgagac 10681 agagtctcgc tctgtcaccc aagctggtgt gcaatgatgt ggtctcggct cactacaacc 10741 tccacctctt gggttcaagc aattctcgtg cttcagcctc ccaagtcgct gggattatag 10801 gcatgcactg ctatgcctgg ctaatttttg catttttagt agagccgggg tttcgccatg 10861 ttggccaggc tggtctcaaa ctcctggcct catgtgatcc accctcccaa agtgctggga 10921 ttacaggcgt gagccacccc acccagcttc tgaattttaa gtttcctatg gagtaactga 10981 agtgcttctg tggaatgaac gtggccttgg tcctaatggg tcttcctgta tttcaaatgt 11041 gccatgatgg tcttccgaag ccacaggctg gacagcatta ccgcctgcta atgtcagggc 11101 ctggattcaa gggcctggga ctctggagga ggaggagatg cggtcctgac ctgcgagcag 11161 agcacacaag gccttgacaa ccgatcttta caaccttttc ccacccctgc ataccctgaa 11221 atggttttca ggccaccatt tctgggatct tcccagctgc ctgccttaga gtctttctct 11281 cagaaaccag cccgaatgtt tcctggatct ctgaggattc atcccacccc cattcccatg 11341 cagaggccat ggctcggcgc ctggcgttcc cgaggcactt tgtaccatct tcagtgatga 11401 cacgaatgca atggggttag tgccacttgc ccacagtggt gctgcgagtg cccatggggc 11461 gcaccggggc tgcttccacc ctggacccca gcctggcgga gggcctggca catagtaggg 11521 cggggagtag gtgtgtaaga agtaagcttg ctctggagag gatgtacctg cagccggcgc 11581 ccagctctcg agaggactgt gggagggcag aggctgaaac gaggccgcct gccgagttcc 11641 tccacagacc cctgggcgag gagcgctctg ctggggactg cgccagcggt gcctccaggc 11701 ctggactctc agctccgctg ctgcttttgg ccagagttcc caaaggcggg tgtggttgtc 11761 tttggtggga aggttcagcg caccgcgata gtaatagggg agcaagcact cgctgagccc 11821 agtgcagggc ctggaccgtg ctccatagca agcactgatc ttgggctatt ggttttggag 11881 gaaagaggga gcacaaggat ggaaggggcc agggaagaga ggtcacggtg ggagaaggag 11941 agtccgcatc catcggaggg tttgtcactc aggcctgagt gggaggcact ggaggcaggg 12001 agaggccagt cagcagcatg cggccacagc ccagagaagc cagaggggtt tgctggtggg 12061 tgggccccag ccttttcaag ggaacggcag gtctggggag ctgccgatct cggggtgtga 12121 gggtaacggg caggctgtgg cccacccctc cttctcaggc tcactggacc cctgaccggt 12181 ttcatcacca ttatccaagg ctttgaaaag accccctggc cttcctgcca catgtgccct 12241 agctataggg cttcgttccc ctccacaaat ggtgcagaca gcatgagcca ccctggcagg 12301 gggctggggg tccgtggggt aggagttggg ggtagctgag cgttctttcc tgtccccacg 12361 gtgcctggtg ctggggcttg gcagccaggg ttgggacagc ctggctttag caggtcctga 12421 gtcaggggtc tcaggctccg cagcacacag tccccaggca ggtgtcagga tgggatgtgg 12481 ccagagaaag gcatgtgctc tgtctgggga tagctgccac ccagctgaca cagttggcat 12541 gagattgtgg agttttatgg agtaaaatca caaatctggg ttttttaagt aaaaccttct 12601 atgagttgcc atctctggcc ttcaacatag gatttgggtc tttcctgggt gagctggtgc 12661 tttgtgggcc aggccctggg agctgcaagt accagttggg atctgccatt gccctgtgga 12721 aggattttaa gcaggggtga aacatggtca gacttggatt tagagatgag gaaaaaaaac 12781 aaaatcacgg tgccagagtc tctggggaga ctggagaatg cagtgcaggg gccgcaggag 12841 ccaaaaaggg gcactggtcc tgcgaagagt ggctctgtcc atccctgcct tccagtatgt 12901 gcccactgtg accagagctc catttctcaa aagaagccag tgatgccaat taccatgaga 12961 aatttcctta tttttaactg tcgattaaaa tttgtacaag tatggtgggg caggcctgca 13021 aaatgatctc caagtgcctg gcctttgatg cccgtcctaa aagcatgtgt ggcactggag 13081 ggcccacgtt tctttgttcc tcttgagaag ctggggtggg ggccggctgg gctcagggtg 13141 cggcccctca cttcccaggg ctgtggtctc ggtggggcct gcactgttcc tgggtgtcct 13201 tgggctgagc ctctgccctg tgaggacggc cgctcaggca tgatccgtca gcgaagccgt 13261 ggctgtgact ctgttttccg cagtggagcg gaggcggagg gacaagatca acaactggat 13321 cgtccagctt tcgaaaatca ttccagactg taacgcagac aacagcaaga cgggagcggt 13381 gagcaccccg gaccctcagt gtctgcggtg gtcccggccc ccgacccttg catgcagaaa 13441 gtccaacagc catggggctc gggagtcatc cctggggtgg aggccggtgg gcggtgcctg 13501 ccctaaggct cctggtcccc tcgcccccca gagtaaagga gggatcctgt ccaaggcctg 13561 cgattacatc cgggagttgc gccagaccaa ccagcgcatg caggagacct tcaaagaggc 13621 cgagcggctg cagatggaca acgagctcct gaggcagcag gtgggtgcgg ggcctggagc 13681 gggtcagggc ccaggagccc cagatgcaag gcgctggccc tcagctccct tgacctccgt 13741 cgtgtccgcc agatcgagga gctgaagaat gagaacgccc tgcttcgagc ccagctgcag 13801 cagcacaacc tggagatggt gggcgagggc acccggcagt gacgcccgcc accaccacgc 13861 agccgccgcc gcccacgccg gcctctgctg cccccttccc cagcccttag cacagagagg 13921 gacacatgcc cctcccccag ctgcgttttt ttatagtaga tttttaacaa aaaacgggga 13981 gaaataatgc atttctgtgg atacagtgcc caccgccctc ctccacttgg aaacggtatc 14041 ctccctgccc atccgtctgt ctgtcgccct tctcccggcc ctcgctaagc cccggcactt 14101 ctagtggtct cacctggagg caagagggag ggtacagagc ctctgccaac gtcccgctgg 14161 tgcctcctgc tctctggagg tactgagaca gggtgctgat gggaaggagg ggagcctttg 14221 ggggggccac ccggggcctg gacctatgca gggaggccac gtcccacccc acctcttgtt 14281 tctgggtccc tgctcccctt tgggggtgtg tgtgtgtgtt ttaattttct ttatggaaaa 14341 attgacaaaa aaaaaataga gagagaggta tttaactgca ataaactggc cccatgtggc 14401 ccccgccttg tctgcttgtg tgtttgtcca tctcaggagt // LOCUS HSV698D2 42013 bp DNA PRI 05-JUN-1996 DEFINITION Human DNA sequence from cosmid V698D2, between markers DXS366 and DXS87 on chromosome X contains proteolipid protein (PLP) gene and promoter region, ESTs. ACCESSION Z73964 NID g1370120 KEYWORDS PLP; X. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 42013) AUTHORS Bridgeman,A. TITLE Direct Submission JOURNAL Submitted (04-JUN-1996) Sanger Centre, Hinxton, Cambridgeshire, CB10 1RQ, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is the entire insert of clone V698D2. The true left end of clone V698D2 is at 1 in this sequence. The true right end of clone V698D2 is at 42013. V698D2 is from the human chromosome X-specific cosmid library. FEATURES Location/Qualifiers source 1..42013 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="X" /clone="V698D2" repeat_region 32..326 /note="Alu repeat: matches 1..306 of consensus" repeat_region 772..1064 /partial /note="Alu repeat: matches 308..1 of consensus" repeat_region 2236..2418 /partial /note="Alu repeat: matches 308..90 of consensus" unsure 3390..3420 repeat_region 4200..4241 /note="7 copies of 6 mer 88 % conserved" misc_feature 4823..6556 /note="match: X66420 PLP gene promoter region" unsure 5382..5434 /note="single clone" CDS join(6557..6560,15144..15330,16027..16288,17360..17528, 17999..18072,18895..18960,20088..20159) /codon_start=1 /product="PLP gene" /db_xref="PID:e246993" /db_xref="PID:g1370121" /translation="MGLLECCARCLVGAPFASLVATGLCFFGVALFCGCGHEALTGTE KLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGFYTTGAVRQIFG DYKTTICGKGLSATVTGGQKGRGSRGQHQAHSLERVCHCLGKWLGHPDKFVGITYALT VVWLLVFACSAVPVYIYFNTWTTCQSIAFPSKTSASIGSLCADARMYGVLPWNAFPGK VCGSNLLSICKTAEFQMTFHLFIAAFVGAAATLVSLLTFMIAATYNFAVLKLMGRGTK F" repeat_region 7129..7164 /note="18 copies of 2 mer 81 % conserved" repeat_region 7630..7713 /note="MIR element fragment" repeat_region 9989..10038 /note="25 copies of 2 mer 84 % conserved" repeat_region 9989..10030 /note="7 copies of 6 mer 88 % conserved" unsure 12619..12807 /note="single clone" unsure 13102..13139 /note="poor double stranding" repeat_region 13496..13529 /note="17 copies of 2 mer 88 % conserved" repeat_region 13496..13531 /note="6 copies of 6 mer 86 % conserved" repeat_region 14645..14938 /partial /note="Alu repeat: matches 308..1 of consensus" misc_feature 20086..22190 /note="match: multiple ESTs" misc_feature 20160..20560 /note="match: STS G13642" misc_feature 21264..21703 /note="match: STS R66465" repeat_region 25274..25371 /note="2 copies of 49 mer 96 % conserved" unsure 25691..25858 /note="poor double stranding" repeat_region 25763..26048 /note="Alu repeat: matches 1..308 of consensus" repeat_region 26395..26606 /partial /note="Alu repeat: matches 97..308 of consensus" repeat_region 26666..26889 /partial /note="Alu repeat: matches 308..72 of consensus" repeat_region 26890..26946 /partial /note="Alu repeat: matches 57..1 of consensus" repeat_region 27099..27192 /note="2 copies of 47 mer 89 % conserved" unsure 27402..27409 repeat_region 27542..27611 /note="MER5A element fragment" repeat_region 28221..28509 /partial /note="Alu repeat: matches 308..1 of consensus" repeat_region 31746..31871 /note="63 copies of 2 mer 83 % conserved" repeat_region 31749..31868 /note="20 copies of 6 mer 83 % conserved" repeat_region 32327..33070 /note="L1 element fragment" repeat_region 33158..33446 /partial /note="Alu repeat: matches 308..1 of consensus" repeat_region 33456..34028 /note="L1 element fragment" repeat_region 34125..34205 /note="L1 element fragment" repeat_region 34323..34747 /note="L1 element fragment" repeat_region 34866..35101 /note="L1 element fragment" repeat_region 35226..35300 /note="L1 element fragment" repeat_region 35416..35481 /note="L1 element fragment" repeat_region 35524..35693 /partial /note="Alu repeat: matches 304..124 of consensus" repeat_region 36058..36210 /note="L1 element fragment" repeat_region 36325..36420 /note="L1 element fragment" repeat_region 37761..37883 /note="MIR element fragment" repeat_region 38572..38703 /note="MLT1A element fragment" repeat_region 38722..38753 /note="16 copies of 2 mer 91 % conserved" repeat_region 38724..38753 /note="5 copies of 6 mer 93 % conserved" repeat_region 38833..38880 /note="MLT1A element fragment" repeat_region 38883..39169 /partial /note="Alu repeat: matches 306..1 of consensus" repeat_region 39368..39499 /partial /note="Alu repeat: matches 148..1 of consensus" repeat_region 40759..40800 /note="2 copies of 21 mer 100 % conserved" repeat_region 41384..41566 /note="MLT1A element fragment" repeat_region 41588..41725 /partial /note="Alu repeat: matches 293..155 of consensus" repeat_region 41733..41857 /note="MLT1A element fragment" BASE COUNT 11651 a 8770 c 8999 g 12585 t 8 others ORIGIN 1 gatcaggagg ttgtttagag aatgtaatct gggcctgcat ggtgactcac gcctgtaatc 61 ccagcatttt ggaaggccga ggcgggcaga ttgcttgagc tcaggagttt gagaccagcc 121 tgggcaacat gacgaaaact catctctaca aaaatacaaa aaattagcca agcgtggtgg 181 cttgtgcctg tagtctcagc cacttgggag gctggggtgg gaggatcaca tgggatccgt 241 aggcagaggt tgcagtgagc tgagatcacg ccactgcact ccagcctagg tgacagagtg 301 agaccctgtc ttggggaaaa aaaaaaaaga ggttgtaatt tgtagtagga ctagaataag 361 tctgttctgc tcattatgtt gcaggatgag ccttttcaga ttctgagccc tttcagaatt 421 ctcagaaatc ttgaggaaga cactggactt ccctggtaag gggtaacatg gaacttgaac 481 caattatatt tgaactggaa tggacaggat gacactctgc tgacatttgg cttacttgtg 541 cacttagggc agggcaagca cagcataggt gctcaatgtt agttttgtct gaattttcat 601 tttgttaacg cctgtaatat tccttccaga ggaggctgta agctctctca gaacaagggt 661 tgcatccgga aaccatctca tttctctgtt ttcccctcga tgtctggcaa agtgctagat 721 acaggataag aagagaaaag aaaagacaaa ggtctgcaac taagtaattt tttttttttt 781 tgagagaggg tctcactttt ttgcccaggc tggagtgcag tggcacaatg agggctcact 841 gcaaccttga cctctcgggc tcaagtgatc ctcccacatc agcttctgag tagctgggac 901 tacaggcatg caccatgacg cccagctaat ctttaaattt tttgtagaga cgggggtctc 961 actatgttgc ccaggttggt cttgaactcc tggcctcaag tgatcctcct cccttaggct 1021 tcccaaagtg ttgggattac aggtgtgagc cactgtgcct ggtggcctct aagtattaac 1081 agggaattgc catttactgt gctatgtccc aggctgaggg gggatatgaa cccaatttgg 1141 ctacctacaa agcctaaact cctaaccatt atggattatg tatcctgctt caggacccat 1201 ctcatcttgc cttgccccag atgatcctcc ccagaacaaa gcatgactgg ggtgcccagg 1261 gcaggctcac aagatgaaga gattgagggc tgagatgcag agaagaaggt gtgtccatag 1321 ggaacatgat ttcctgtgtg agatttccct agccattctc gtggctcaca ccccaaaagc 1381 cagcttgtct tgccacagca gactgccctt gcctgtaggg ctgccggatt cactaataaa 1441 aatacaaggc attcagttaa attagacttt cagacaaaca atgagcaatt tctcagtata 1501 agtatgaccc atccagtatt tcggatatac ttaaatgatt attcattgtt tatctgaaag 1561 tcaaatttaa ttgagttgag catcatgtat tttatctgac aaccctactt gcctgggcgt 1621 gatatttcaa agagcagagg aatggggggc tttggggact tcccatttag tgatttcagc 1681 aaggggaacc taacacctgc ttttgttgtt ggcagccctt gaggcaactg cttcaagcac 1741 aatgcttaag gcagtgttcc agtgctggtg taggctgacc aacttgtttt ggttttccca 1801 ggccgtaccc atttttagca ctgaaagtcc cacatcttgg gatctcttcg ttcctggcaa 1861 accgggatga ttggtgattg gtcaccctag tgatagacat acccagaggt gctggtggag 1921 atgttagaag acaacaaata cacaaagaaa ctgctaggtg tgtctcaaag agtctcaaca 1981 tttcttcaaa gcacatatcc ttggctccat tttaaagata ggcaggttca ggggaagtat 2041 aggaatcctg gatctgaaag gtgttggttc tagagatcac taaagttcag cctctatttt 2101 ttaaatgctt atcaaacatt tgcagagata ctctgtcacc ccatctggtg attattatta 2161 ttattgttat tatacttgag cataggcact atggtaagaa ttttatgttc agtatcttgt 2221 gtaatctttt tttttttttt tttttgagat ctgtcggagc gcagtggcac catctcgcct 2281 cactgtaacc tctgccttcc aggttcaagc acttcttgtg cctcaacctc ccgagcagct 2341 ggaattacag gcatgcacca ccatgcctgg ctaattttta tagttttagt agagacaggg 2401 tttcaccatg ttgactagtt actgactaat ttggaacccc actgatccta caaactgccg 2461 gaagtttgcc tctgccctgg gtttcaaata gcggcctcct acttaacctg agaagtgtag 2521 aaggttgaaa acttgggcta cagcagcccg aagcctgtcc atcctccccc agcccccggc 2581 tgttgttgct cagggcaaag gctctctatt cacacatttt ttcacagcag aacaacctcc 2641 caagcaactt ccaagcaata agagtctgga gacaggcatt ctctagagat ggctctggtc 2701 tttgcttgct aggtgccatt gggaaaattg tctcagtatc ccatcagtca aattgaacaa 2761 ataacattgg ttccccagaa ataatactag ttttatttct actttgctca gaaatcacat 2821 caaacaagca tcctccctgc aggggcaacc ctgcaaaagc tggccaagtt tttatgttgg 2881 tggaggtggg ttctgtggac agcttgtttc ttcacattgc tagtagctat gtggtgctcc 2941 tcaggcattc tttagctgag atttttcatg tgattatttt ggcttccagg ctactcttta 3001 ctccatccca agccacgggc atctccccct cacaaataca ccccatggcc tgcccctatc 3061 catcctccca ttgttccttg aaaccttcag gaaaggcaga agatcaccct ccactctcta 3121 cccctgtctg ccagctcaaa gtatgtgaca tctctggccc tcagccctcg gttctcttgg 3181 ccagaaaggc atagtctagc caatcggcat gaagttgggt gtggaaagtt ctagccacag 3241 tacataggac ataatatact gcctgcctgc cagccagcca gtcttctatg ccagggctag 3301 gaaagctgcc ttccaaaatt tggtctggga agacattatc ctgggaatga gaagcaaagc 3361 ctcagctcta gaagcccaaa taatagcaga tgacccaggc tttcactggt acgcgcttct 3421 ctcttcccct ttatgttctt ctctgtgctc agggagaagt tggccactag attgggcact 3481 tgatccctta tctagttttt cctgggagaa attcactgga cttctctgag ttttagagtt 3541 cagtctacac aatgaagcaa ggagaaccca cttgatctgt ggccttcaat aatctgatac 3601 tatgggatca tagagcagag agtggaggta atattcaagt gttttcgagt tatttgcaag 3661 aaagctgcac agtcaaacca agccatgatc aagggggccc agattctctg aggtaccatc 3721 agaattgcat gaagcaaact gtttacgagt tttatacaat cctgcattgt tcccacaggc 3781 acccagtaca ggcagcaggc tgcaccaaga aggctggcag tgcctcttga ggaatccctt 3841 ggcagtgcag gagcctagct ggactcaaga atattttgcc aagggcaccc agcagaacca 3901 atggaagcag aatcacagaa ataggtgggg gtatgtagat aaaaatgggc tctggaggac 3961 ctggaagata ccctatgagg tctgtaggga gccctcagga aaggtctgct gacgaggtac 4021 tgaggcagag tgtcgggatg tacagagcaa tgagaacaca ctagattcta gggagttggg 4081 aaccttaagg cactgggatg agtgagacac ggtcacctgg ctgggtccct tttctggccc 4141 tgggcataag aacaagcata agaacaaggc gaagacaggg aatgtggaga gagaaaaagg 4201 aaacagagac agacacatag agagaaacag agacagagac aggtaacaga gacaaagaga 4261 ttgagacttg gggagaggat agaatgaata ttaaggcaat ggagatggaa gaaaggagaa 4321 gagaaatgtg tgttaaggga atgtggttga taaattggta tctgtgggtt tattgggaac 4381 acatattttc tttccagtag agttgctgtc cccagagctg ctgctgagaa ctcaggggaa 4441 gctttgtgag agggctcagt ctgagtgatg tcccctgtgt ggagagctgt ctcatctatt 4501 cgagctgtga aggatcattt gcctcatcct ctaaatctga actcccaaag tttgggctca 4561 ctcctgccat ctgatgagct gaggagacac atttggcgat ttcttgctct tggtcctccc 4621 ccttctccag ggccattccg acagtgacca tccagaagtt tattgctggc tcaatcctac 4681 agttgagctt ccaaagcagg catagtctgt ggctggcgag cagggctggg gagggcgagg 4741 ggctctagga cccttttcca tcagtcacat ggccttagtc tcgtctgctc tggaaagcta 4801 ctattatacc gttttgcaag gggcagcatt tccagagatc cttttttctt ggggctgata 4861 caagagcaaa ggatctagag ttctagtact ctaagcaagc ctcaaacggt gcaggatttg 4921 gccccagtgg gcccacaggg gcatctgcca aagactcgtc catttcctaa cagcagagcc 4981 caagccagta acatgtccaa agtcacagcc caaagagaaa actgtaagac acaatcttgc 5041 ctttctccca ccccaggaag gtactagtct ctgctcagta tctccctcct cttctctcct 5101 ctcccaaggc aaaatcacag gaaaagttcc aggagaccat accttccttc tccggagtct 5161 tccaacaggt gcccctctta ttatatgcca gccagttgtt ttaaaggcca tccatgggag 5221 gacaccggcc tctaggtcaa gggagctgtg aagggagaca ggactgacaa ggagcccaga 5281 cagacactgt ttaagattca cctgttctag aaaaccttcc caggctgatc ccatctagca 5341 gtgggcattt aaataccttc acagtcttta gaaatagctt attgctaagg caggttttat 5401 atagtatttc cctgtttttg ctgcttagcc atttttgtct taccgcttct ttctctgtgg 5461 caaggactat atttcgtttc ttctctccac cctcactgcc acctgctgct tcactctaac 5521 ccactccctg cctcttgtct gcctggctta agctctctga tgaagatatt acactctcag 5581 gatagaatac tgactacctg attcccagat cgtgttcttg acttgtctcg aagtcgatta 5641 tccttcttgg aaatcaccca catcttcaac gctggccttt tctttaacct agtctgtgaa 5701 cttgcatagg tcttgcccac ttctgggtct ttattattgt agctataaaa ttattacaat 5761 tctagcctcc cccagcctaa tcacaatctg ttcatcgaat tgacagcctg cattttgtct 5821 tcatgcagtt gaaactgaga aggatttgga ggaaattatc ttaaaagaca attttttctt 5881 ctctattgtg ttcgtagagc tgggggtggg ggtgggggag gaacgcttat tttccaagga 5941 atcgggaggg gaaaggtgga gagggccaag ggcagctagg agtgatgtgg ggagctggag 6001 cagattcgga aggactttgg gagctaatat ctaggttttt gactctgagc ccctgttggg 6061 gctctcactt catggcttct cacgcttgtg ctgcatatcc cacaccaatt agacccaagg 6121 atcagttgga agtttccagg acatcttcat tttatttcca ccctcaatcc acatttccag 6181 atgtctctgc agcaaagcga aattccaggc aagccttagg gaaaaaagga aaaacaaaga 6241 aaatgaaaca attggcagtg aaaggcagaa agagaagatg gagcccttag agaagggagt 6301 atccctgagt aggtggggaa aaggggagga gaagggagga ggagaggagg aggaaagcag 6361 gcctgtccct ttaagggggt tggctgtcaa tcagaaagcc cttttcattg caggagaaga 6421 ggacaaagat actcagagag aaaaagtaaa agaccgaaga aggaggctgg agagaccagg 6481 atccttccag ctgaacaaag tcagccacaa agcagactag ccagccggct acaattggag 6541 tcagagtccc aaagacatgg gtaagtttca aaaactttag cattgaagat tcaagaggac 6601 acaggaattc acaagagaat ttccaacttt ggggttcggg ggttccatgg tttaaatgag 6661 tcgtgttttg gcacttgttt tctttttaaa ttccttaaca ttcagttaga ttaactcctt 6721 ctccgctgta ctaagcatga tgctttatta ccaaagaatt ccagcagcac gtggagagtc 6781 ccacagatct ctggagagag tgcccagctg gatggatgtg gtcatgtgtc ctcctacccc 6841 ctcacttccc tctatcttct cttttccctt ggttccaagg ccctggcaac ggttgggtca 6901 gcttgccctc ctggttctgc tggaggccac tgagcactgt ctagccctca gatgtacaaa 6961 ataactgctt gcttgatgca aactttgaac catgtttggc attaatagga agctggggat 7021 cgggcaagct ggactcaggc ttggctatgg ctgaggaggc cattctcagg ttcaggaaat 7081 gaaagtaaag tcctcaatgg cttgaccatg ggaaacagat atgggtagga gtgtgtgtgt 7141 gtatgtgcat atgtgcatgt gtgtgcacat atagggttga tcatttctag caaatatcac 7201 agacacagct ttcctgaagg ctctttgctg tgacaaggag gatgctggga cattagggct 7261 cttgctaagt tggctgagta ttgagggctt ttctaggttc cgtgaacaaa gacagaagct 7321 atgaagttct ggtggtagtt tctactcaga gtaatgtgtg cagagcacaa gggtctttgt 7381 ccctattgga ctccacagag tcctgtagac attgctaaca tttggcaaaa caaagatagc 7441 agcttaaaca catcctaaat gaaatgagac tcagttgccc catgtattcc ctatcaaatg 7501 tagcatccag tagacaacat ccacagagat caatttgcaa ttatactgag ctggttcagc 7561 ctttaggttg tttcatcatg aaacacagag agaacacttt gaagcatgtg ttcaggtaaa 7621 acaggaggga gagcactgga cagggagtcg gaaaaatcag attcaaatcc cagatctgct 7681 acttatgaac tatgtgacct tggacaggtc actgaatcac cacttcctca tctggaaaat 7741 ggggacagca ataactactt tgtctacttt cagggttgtg agaatcaagt aacaataatg 7801 tcaaagtcct ttgaaaatgg gcaagatcta taacaactgt aaatcatgat aatacctact 7861 attagacaat aatgtgtata caggcgaact cttcctcttt ccagggtggt tgggttttct 7921 ttgcgtgctt gggtcagaat catttgacca attatgtgtg accagatgag aggtgaatgt 7981 tggaggaact caattcctgc cctgggtaac atcaggaatc ctatctaatg tcagccgagc 8041 tgactggata ggttgtgcat taaaaacaga gagggagtaa agaaggatgt tgaatattac 8101 ttcctcaggg gatgaaatcc ctgaggaaag gcaggaacaa ataaagtgac cacagggaca 8161 attgcctgtt atggaggctg ggtggtttcc tccagaggaa gagtgccaca gataagggca 8221 cgattgagga tgcacattgg gtcccaggtc ctaagggtgg acgatacttg ggccaggtac 8281 atatgatgga gcagccactg agttttcatt tattcaataa cagagtatct aaagtacttt 8341 gaattataga tcatctctaa caaggtactg gactgccact acttctgatg tggagtgcac 8401 tgctgttcaa caattgcagt aaacacattg tatcatggaa tccgggattg gaaggaactt 8461 tagaagttgt ctgagccaac cttcaatgcg ggattcattt ttacaacaac caattaaatg 8521 ttcgtacagc ctctgcttaa tcacctccag gaacagggaa cttactactt caaccatata 8581 gaatccctta ggattggaag caaggaggaa cactgattca gtagtgtctg cacaaaagat 8641 ttatggggaa aatgatagct cattaatttt tgccttgatt tggaagggga atgttccaag 8701 tgagtctttg ggataatgat ggaaccagaa aggagcaaaa gagcagtagc tgattattct 8761 ctgctctgaa acacaatgcc attattgaac agcaaaacat tgtctgcaag cctgttgtcg 8821 tctgagaaca gtggctgtta ccctggtaac cagactgtga tgcctccatc caaaagtggc 8881 tgtttgccaa gcagtgtagt tttggctggc ttgcagtaag tgggtgtagc ccaggcaaaa 8941 agtgtggcgg cagagcgggg gtccagtttt ataaaatgcc cgaagactaa ctgagtccca 9001 tcaaaatggc aagaatgccc caagaatacc aggagccaaa aataaaccag atcccagcca 9061 gcatgttcac ctagtcttcc tggagacact gactaaaata atcaagtgca gccttctcta 9121 cctcacttct ccctttggcc gtgcagagcc agaggagttc ttggaaagcc aggcagtttg 9181 aaaggtctac cctgctttag ggggccttgg gcttggctct gagctaggaa agccagaagt 9241 ttacttaaca ggctgacaac tggaactgct ctctcagagc gatctgagta ataacaacct 9301 caaccctggg aggtgggtgg gaggggaaat atagtctcca aacccattaa ggtagaactc 9361 cagccagtaa catcactttc ccccattgct tttgtctatc gtcctgaagc cttgaaagcc 9421 atcccctggg tacaggtctt tgtcttgaaa gaaagcaaca gtagaaatgc cttccctgcc 9481 cacagcactt caggtctaaa gaagaccaag tgaggacagt gacatgatca ggaagggtca 9541 cagcagccat ggtaggtgaa tggagcagct tttccttcct ttcctgcttt cagagcctac 9601 tcagtgccaa acttacatca aaagctcagc aagtactgat tgattaatcc ctcttgccta 9661 cggcaaaatg gccctgaggg ctatctctca gctctgcact ggcctctgag ctgtctgcca 9721 ggcagggctc tagaaaagag tcccaaggaa gagaataagg ttccccaaat cagcttcaca 9781 ctgcaacaag gagagaggag gaaatggtgg gagactgacc tgaggaacaa ggtatggcta 9841 ctattgtctc tagtaccagt tacttccctt tcatcttccc attcgtgggc aaggtggaga 9901 caatgtgcca aggataagtt ttaacaatca aagtcctacc tcagcttccc aatgcttgca 9961 cataaattgg aatgtgtaca tcattcattc tgtctctctc tgtgtgtgtg tgtgtgtgtg 10021 tgtgtgtgtg cgcgtctgaa gaggagtggg gagtatagga aacaagatag tccagatgct 10081 gttgccgtgg taatactaaa tcgtgtccaa agaggaggct gagattcctt tttagttaac 10141 cttgtccaca tgatttgact atccagcagg cttggattcc caggccccag agacttcggg 10201 actgttttcc agcacttgaa tgtggtatat aagtgctgat atcaaaaggg atatgtagga 10261 aagacaagat tataggcagg gcattgtctg accaggacga ggcccgaagc aaaagtcttt 10321 gtttgtctct ggcacctctt gataattgtt gttttactcc tagaacccac ccactgatcc 10381 ccatccatcc ctagtaccag tcacggctgc tctagcattt cggccttgtc aactactgca 10441 ggccctgatc aaggaggcct ctgttcatgg ggtaaggtta aggggtgggt tctgggagga 10501 gggtaagact gtagtgcaga ttcgagacct agggaggtgg ggccagtttc tcactcaatg 10561 agcctgcctg ttgtttagct ctattgtata gggttcttct atctctccct ttgtcagctg 10621 gattcaaggg acaggctggg aattgtgcag ggtacaattg gtctctttct accattcctt 10681 tcgatgaaag cccttcctct ccttcctttg atgagttccc agattcacac tgaatttcca 10741 tcatggagct ggtccttgag gcgggcccag ggcatgggaa aggagggagg gagggttggg 10801 agctcttggt gcctgcttaa agagacagca tcaggatcaa cacagtccct tgatgaacct 10861 ggatctgtgt aagtcacatt ggttctgttg atctctgtca gaccgctgtg aagagcgctg 10921 gtgagaaggg cttaaagttg gagtatcaac tatcacagat tgattagact tgttttggga 10981 gtttgtatac aaatgtagcc aaccatgaca tgatcttttc cctttgctta cattttgccc 11041 acacaacttc gttatatcca ttttaacact tgcttatagg tagctttaga ttttgattca 11101 cagacatatg tttgttcacc ccaacaggat ttcaagtttt ctaaagaaag ggactggcct 11161 tttaattccc ttctgtttct cctaatgtct atgtcactgc tagtgtgctt aattcttgta 11221 ctgaattcac ataaacagaa atgaaagaaa aacactggag ggggtcatta tttttctcta 11281 tgacctttca tcaaagtcag gtggcttaat ctgactgagt tgcattgaga aatgttatta 11341 ctatatggtt cttttatgcc aactctctca tctcgcagtc tgtacttaga ctggaaatgt 11401 gagaatgtct cttgagagag ccaaggaact tctcccctag aatctcttcc agccttcttg 11461 aatcaaactc acaccctttc aaccaggagg ggttctgaaa tggagagctt ttggctttag 11521 gctattgttc acacataccc aagggacatt tcttggactg tgaaaaatca aagggagccc 11581 aatggatctt ttctcagaat gctttgattg tccaactact ttgatgaaaa aataatttct 11641 tgcaggctct gagtttttct cagttagaat ctaaaatctg attcttgctt ggacagactg 11701 actgagattc ttcacttctg ctgaacttaa atgcccccaa agcaaacatt tcatctaggt 11761 tgtgaagaga tgaacatctg atgcttatgc tgatattact gaattgttta tagcttaata 11821 gtgatagatt tatctaaagt catttatttc agactttaca ttagaaactc attaaagcct 11881 acttctttct tgtcccttgc tttgagctgt agtggagcag aaaaggggat gctaagttat 11941 tcaagttcta agaaaataga ggaaaaatgg atgtaaaaag aaaggctatg atctggtgac 12001 tcagaattgc tcctaaggct tagccattta tcacccagta agaacacaat ccatgacact 12061 ctcaggttat ttggatgatg attgagggat cagatacact caagaggatg tttccctctc 12121 atctttgcaa ccctagaggt ggcatcacct aacgcttaaa gtggagttct cttttggcta 12181 gcaagagttc aagttcctcc atccctgacc atacctgcac aagcacttag cccttcccag 12241 atgaaaatgc tgggatgagt gagtaggccc aagatggtca cttgctaggg tggatggctc 12301 ttcttaccct cggctaaaca aacagtcctg ccattgttcc ctgagtctga ccccctcccc 12361 accccccggc tgcaggcccc tgtggaatac caatcaggct cctgagatct caggaaagaa 12421 caaggcttct ttgtccgggg gagctatgga gggctctgtc caagccctga ccctgtgctg 12481 gggagaaagg gaacagtggg gtcactgacc ttcttctccc cccatcccca ggcttcagag 12541 actttcttta ctagaaaagt ctaagagttt gggggtgggg aggagttgga taggcagaga 12601 aggaaaatgg cagtactgtt tacttctaag ctccttttct tgacttatag gctgaatcta 12661 gtatcttaac ctgttgagaa agaggaagat caatttctgg aatgttttct gagaacaaca 12721 atcgatttca gttgttgtgt ttttctctgt ggtatacatg tgtgtatgtg cacatttgta 12781 cacatgcatg catgtgcgtg tgcacacaca cagacacacc catgggccag ctctgaaaag 12841 acactcttcc caagcacgtt tgcaaagttc atgctgagta aactcagcct ccactcacag 12901 cagcacagct acagattgct tcttatagag gccaagtttc ctggcagcaa tgctttagtt 12961 cttggaaaag gaagttctgc tccaaatgtt agctgcttac tttgtttggc ttgggcttct 13021 gaggctcatc tttggtgtga cactctttga aggccaagaa agggctgtca tttcagggga 13081 tggtgggtaa taggactcac tccttttgtg tctctgagaa cagaggcaaa tgcctctacc 13141 tcgattttcc catgacagag ataatttgat ctttccaaat ctgttcacac tgatgcaatt 13201 cagccatgta aatcactagg tattcatgct ctctgggcta aaaacaaact gtaactaatg 13261 ttaacagaat caggaaaagg agtccccttt ccctggtacc ctatagaaaa gtgagtgtgg 13321 cttggagaaa tgcttgaata caaaaatcca agcctaagta attaatacaa tttattttta 13381 ttgtacaagc tgtcacaacc tgcttattca aaaatagcag cttcccagta catttgctga 13441 gtatttgtgt gtgtgcatgt gtatactgtg tttatagata tataaaatca gtgggtatat 13501 atacatatat ttatatatat acatacatat gaattaagac attttttctg aagagcttaa 13561 cattctcatt tcttctacat tttagggatc tggccgagag gccagaggaa gccattcttt 13621 tcctcatcta gtcagtcatc tacttggcat ttgctaaggt tcactccata tgacatctta 13681 gagcatggga ctagtacttg gaacacagct gctttcagag cctgtgactt cttgtgtgcc 13741 tctcctgttt ctcagcaaca ctggcatagg gcctgggata ccaggtctgg ggatctcagg 13801 gactcttagc actttaagac acatgtgttc ccaggccctg gtgtgttcct ctagtgccag 13861 aaagatgttt catgctttgc tgactttgta taaagtctgt ttgtagctgt tttgacagaa 13921 tctcagcgta taactgaggg tggggacatt agccaagctg cattatagga ggacaaaact 13981 gccatacaaa gtgtccaaaa tcattaagcc tgcattttta ttattgggag taatatcaaa 14041 cctcctattt tccaattttc atttcttgtc ctgtgctagc tccatcctgt ttggactgct 14101 cctcccatat gtaaactaag aagaatcaag cattctttgc aacaaataca cacgatgctc 14161 aaaaatgtcc aggagcatcc aatttccaaa gtttcctcca cctggaatgc tcttcatgct 14221 aaaatcctgt ctgacaatac cagcatctct ggcctgcact catcccttcc tggaactcca 14281 agtgcattta ccctctgtta ccacttactt ggctgcctga attgttagtt gaaaatatta 14341 ggtctactta gctaattctt cctcaggaaa ttaaagactc ccatatggca gagtctgtgt 14401 cttttctctc ttcatatccc gtataacacc cagcataatg ctgggcatat agtgagtatt 14461 ccataaatag ttgatgaatg actaaaataa gcaagcaaac aaacagacta gaacaataag 14521 aaagaaggga ctgatttcat aatctctctg gcttgctatt tgaattgctg aattattatt 14581 atttattaaa tattttttaa attctggcaa taaaaggtaa ggatttattt tctttctttc 14641 tttttttttt tcttgagaca gagtctcgct cttactgccc aggctggagt acaatggcgc 14701 aatcttggct cacggcaacc tccgcctcct cctgggttta acagattctc ctgtctcagc 14761 ctcctgagta gctgggatta caggcatacg cccatgcccg gctaattttt gtatttttag 14821 tagagacggg gttttgccat gttggccagg ctggtcttga actcctgacc tcatgtgatc 14881 cacctgcctc agcctcccaa agtgctggga ttacaggcat gagccaccgt gcccggccaa 14941 agatttattt tcaagaatga aacaaagtaa ggattctggg tcaatctcac atgctgaaag 15001 ccaaaacctc tagccgctcc tgctttttga cttcggagtg cccactatct ccgagcctgt 15061 gagcacaggg cctggcagag gggtttgagt ggcatgagct acctactgga tgtgcctgac 15121 tgtttcccct tcttcttccc caggcttgtt agagtgctgt gcaagatgtc tggtaggggc 15181 cccctttgct tccctggtgg ccactggatt gtgtttcttt ggggtggcac tgttctgtgg 15241 ctgtggacat gaagccctca ctggcacaga aaagctaatt gagacctatt tctccaaaaa 15301 ctaccaagac tatgagtatc tcatcaatgt gtaagtacct gccctcccac acagacccat 15361 cttttttttc cctctctcca tcctggagat agagaactct tcagtacctt agtaactagc 15421 aggggactgg ggtggagcca gaccggattc ccgagtcttc cctctgtgca gatctctcct 15481 gctccacatt cgaagcccat tcagaaagga ggggtctgcc tgcctgcctg tcagcccctc 15541 ccacccccgc ccccttgttt tcttacacgt gttctgactt ctgctaggtg tggttcatat 15601 tgcccaagtt ggagcctcca gcgtagtagg tatggagaag ccaagggagg cactaagcct 15661 ctcctgttcc tagaacaagt taggctcctg ttccttcacc cacctttctt cttgtctagc 15721 tccctactcc tgtgagttag catgtctgaa gggtggggca gggagagtag gtccctggtt 15781 ctccaggtcc cagggtaagc aaggtgtggc gggaggggca tatgtttctg gtcacataca 15841 ccctgtcttc acttatttca acaaagataa ggtgaaagca tgcaggagga acctggtgat 15901 tcctctagaa aatccctagc cttgttaagg tgctcgctct ggtgtatacc tcacttatgt 15961 cgggaaagaa gccaggtctt caattaataa gattccctgg tctcgtttgt ctacctgtta 16021 atgcaggatc catgccttcc agtatgtcat ctatggaact gcctctttct tcttccttta 16081 tggggccctc ctgctggctg agggcttcta caccaccggc gcagtcaggc agatctttgg 16141 cgactacaag accaccatct gcggcaaggg cctgagcgca acggtaacag ggggccagaa 16201 ggggaggggt tccagaggcc aacatcaagc tcattctttg gagcgggtgt gtcattgttt 16261 gggaaaatgg ctaggacatc ccgacaaggt gatcatcctc aggattttgt ggcaataaca 16321 aggggtgggg gaaaattggg cgcgagtctg tggcctcgtc cccacccaag gctgggtcct 16381 ctctaggggc ctggcatttg agtgaggaag cgatggctgc agccgaacga gaaggtcagg 16441 aagaacgtgg tgcccagctg gcttagcctc acctttcaaa ggttccctaa gcaaatttct 16501 tctcaaaaca gaaagcatga gttttgtggg atgctttgta caatcagacc atttctaagc 16561 catctgttgg tatccctttg ttcccttcct agtaggtacc acaagagtgg atctaactgg 16621 acaagagtct aaaatgctgc tcatgtgatt gagacttggg cacctgatct gagagggagg 16681 atggataata aaaattaaat aattaactcc aaggtgaaat ttacaatgtt ctgggatcct 16741 gcacagttcg aggtcccaga gggaattgga gtagatctgc atgttgaatg ttttattgcc 16801 tgagggttat ccatgctttg agtgaggtgc acatttgttt tcagtctctg gagttaaaga 16861 tcctttggcg gctgaacatg tgtgggttcc agcaatgtct ttttgtggcc agcagttagc 16921 ttagaaaggt tttgtatctg aaattttaat ctcctattgg ctttgttcaa tggctaggga 16981 acaaaaatgt tcctatggca aggaacatgt tctaagctca gcctaaggca caaaatggca 17041 cgactttctt tcaagatctg ttttgatttc tttacacctt atctgcccaa aaacatcctc 17101 tgaagcctct ctaacccagg gatcctcctc actcttcccc tacccattcc ccccaccctc 17161 cgttatactg gggccagtta tctagtagat actgccaatt acccttggca gaggtgccct 17221 gctcactaat ttcatttgaa ggagagccct ggaacctggt tttaatgtct ggcacacgcc 17281 actccaggat ctcccagttt gtgtttctac atctgcaggc tgatgctgat ttctaaccac 17341 cccatgtcaa tcattttagt ttgtgggcat cacctatgcc ctgaccgttg tgtggctcct 17401 ggtgtttgcc tgctctgctg tgcctgtgta catttacttc aacacctgga ccacctgcca 17461 gtctattgcc ttccccagca agacctctgc cagtataggc agtctctgtg ctgatgccag 17521 aatgtatggt gagttagggt acgggtgctt tggctctcct acccactatg gaagcactat 17581 atatttggtt attttcttag tgtaaggagg gtggtgatta tgagaaaaat ataagatgat 17641 gaatgattgg gtcttagttt attaatcctt ccctactgaa accagagagg tttcttcccc 17701 gggaagggaa cttggaagtg gtgggagttt tcttggccat tcacattggc ctactctagt 17761 tgactgctgt tcacaacccc aaagcagcac atttcaataa caaacacaag gtttcaccac 17821 tgttcaatac caccttctct tttttgtaaa cctgtagaaa agaggatcct gattgttggt 17881 agaatccaac tttacagcca ggataattag agatggaaga agggctctgg gggaaagtct 17941 ccatgtggcc ccgtaactcc ataaagctta ccctgcttgc tttttgtgtc ttacttaggt 18001 gttctcccat ggaatgcttt ccctggcaag gtttgtggct ccaaccttct gtccatctgc 18061 aaaacagctg aggtgagtgg gttatttggg ttattttaca agggagtagc taataccata 18121 caaattacac ccatggcctt caattttaag gactgaaagt ttccctttgc tggattttga 18181 attagccgat tgccttctac aacatgttgg ctaagtgtgc ctgagccaat gagcatagaa 18241 ggtaaaacac ctgttttctc tagagttgca tagaaagact tcctttccaa cccttcccct 18301 cttaaaagag aagtttgtag ctttaggtaa agatagagcc taatggcaaa atcccccaca 18361 tgaaaatttt cattccttta gttaaaaacc atatcaagaa aatgggctgt gcacagaggc 18421 attcataggt tttatgaaaa cattttcaat ttggaacttc tttatctcaa atgagaggta 18481 aaataagtac atttgaaagg gaaagcactg tattgagaac agatattata tacaaatgag 18541 gcaggggaat aaaagtaaat ccgggggaga caggcagtaa attatttctt tgacagcaaa 18601 atgttgactt taaattttgg gcccctgggc acaactgtag ggaaccagtt aattattttg 18661 gagtactgta ctatttcatg actattctgt gatctgggtg ttaatgggcc aggtgctatg 18721 gtgtacactg aagactggga ggcccacatt tagggaagtg aaaacttagt tagagatgtg 18781 gaattagcac acaagaaaga tatcaacaca ttcagaaagc tgtattcatg atttacagtg 18841 gagcatatta ctgctgttgc aagaaacagt tcttcctctt tcattttcct gcagttccaa 18901 atgaccttcc acctgtttat tgctgcattt gtgggggctg cagctacact ggtttccctg 18961 gtgagttgac tttgaatgat cttggcaagt aaataggcct gagatagtgt gggtacagct 19021 attctgaaag gcaagaaggt agactgcttc catccttgaa atgctggagg gaagcttctg 19081 ggaagatagc aaagggtgga ggctccgtac ttctacttgc attggcaagg cagaaagaca 19141 tcacaaatta gatgggaaca agaaaagtgg tatgaggaaa gaggtaagag attcatagac 19201 acgaatgata atatttagta tttatagaac atgtaatgtt gccaagtact ttgcagtcat 19261 aaattacggt taattctcaa tacattttga tcagatcagt gacaaaacac tgaaacatca 19321 cgaaacatga cttgctatag cctggaattc tatatactat caatatgaat ccactttgga 19381 tactctccag tggatttagt tactcatatg gaaatactgg gaggacctcc taacattatt 19441 agaattgtta tgattataat acaatgctat gtcccaggtc ttgctgatag tgctacagtg 19501 ccctgtgaat gtagtgtgct cattgtgcag attaaaaacc taaggcactg aagggtgaag 19561 tgatttatct gaagttattt tataagcagt gatcagacaa gatgagctca cagaactcct 19621 ggcccctact gctgaggttt ccatacagag tcaagtaatt tctcaccttg taaaacgaat 19681 tgattcatta accaggggag agctctactg catgatgtgg ctgtgtgtct acagcaagca 19741 ccctatgact ctaagtcact cggacatatt gatgtggcaa agccaaatat tgttcacttc 19801 cctgaggaaa actcagtgct agatcaaaca gaggtgtgga ataaatcttt atgatttgat 19861 tctctgggcc tgggccatga gacccatgat gcctcagaga catcggactt ccagtcaagt 19921 gtatatggag aaagccaagc ctgggatgta ctgctttttg cagagcatgg gtttttccct 19981 tatttagtta tgattttatt tctacccttc ctcattccca aagggatttg aggagggagt 20041 gctttctttt ctactctcat tcacattctc tcttctgttc cctacagctc accttcatga 20101 ttgctgccac ttacaacttt gccgtcctta aactcatggg ccgaggcacc aagttctgat 20161 cccccgtaga aatccccctt tctctaatag cgaggctcta accacacagc ctacaatgct 20221 gcgtctccca tcttaactct ttgcctttgc caccaactgg ccctcttctt acttgatgag 20281 tgtaacaaga aaggagagtc ttgcagtgat taaggtctct ctttggactc tcccctctta 20341 tgtacctctt ttagtcattt tgcttcatag ctggttcctg ctagaaatgg gaaatgccta 20401 agaagatgac ttcccaactg caagtcacaa aggaatggag gctctaattg aattttcaag 20461 catctcctga ggatcagaaa gtaatttctt ctcaaagggt acttccactg atggaaacaa 20521 agtggaagga aagatgctca ggtacagaga aggaatgtct ttggtcctct tgccatctat 20581 aggggccaaa tatattctct ttggtgtaca aaatggaatt cattctggtc tctctattac 20641 cactgaagat agaagaaaaa agaatgtcag aaaaacaata agagcgtttg cccaaatctg 20701 cctattgcag ctgggagaag ggggtcaaag caaggatctt tcacccacag aaagagagca 20761 ctgaccccga tggcgatgga ctactgaagc cctaactcag ccaaccttac ttacagcata 20821 agggagcgtn agaatctgtg ntagacgaag ggggcatctg gccttacacc tcgttnaggg 20881 aangagaaac agggtgttnn gtcagcatct tctcactccc nttctncctt gataacagct 20941 accatgacaa ccctgtggtt tccaaggagc tgagaataga aggaaactag cttacatgag 21001 aacagactgg cctgaggagc agcagttgct ggtggctaat ggtgtaacct gagatggccc 21061 tctggtagac acaggataga taactctttg gatagcatgt ctttttttct gttaattagt 21121 tgtgtactct ggcctctgtc atatcttcac aatggtgctc atttcatggg ggtattatcc 21181 attcagtcat cgtaggtgat ttgaaggtct tgatttgttt tagaatgatg cacatttcat 21241 gtattccagt ttgtttatta cttatttggg gttgcatcag aaatgtctgg agaataattc 21301 tttgattatg actgtttttt aaactaggaa aattggacat taagcatcac aaatgatatt 21361 aaaaattggc tagttgaatc tattgggatt ttctacaagt attctgcctt tgcagaaaca 21421 gatttggtga atttgaatct caatttgagt aatctgatcg ttctttctag ctaatggaaa 21481 atgattttac ttagcaatgt tatcttggtg tgttaagagt taggtttaac ataaaggtta 21541 ttttctcctg atatagatca cataacagaa tgcaccagtc atcagctatt cagttggtaa 21601 gcttccagga aaaaggacag gcagaaagag tttgagacct gaatagctcc cagatttcag 21661 tcttttcctg tttttgttaa ctttgggtta aaaaaaaaaa aagtctgatt ggttttaatt 21721 gaaggaaaga tttgtactac agttcttttg ttgtaaagag ttgtgttgtt cttttccccc 21781 aaagtggttt cagcaatatt taaggagatg taagagcttt acaaaaagac acttgatact 21841 tgttttcaaa ccagtataca agataagctt ccaggctgca tagaaggagg agagggaaaa 21901 tgttttgtaa gaaaccaatc aagataaagg acagtgaagt aatccgtacc ttgtgttttg 21961 ttttgattta ataacataac aaataaccaa cccttccctg aaaacctcac atgcatacat 22021 acacatatat acacacacaa agagagttaa tcaactgaaa gtgtttcctt catttctgat 22081 atagaattgc aattttaaca cacataaagg ataaactttt agaaacttat cttacaaagt 22141 gtattttata aaattaaaga aaataaaatt aagaatgttc tcaatcaaac atcgtgtcct 22201 ttgagtgaat tgttctattt gacttcacaa tagaaactta ataatcgtac cttgttcaag 22261 gagatcattc atttttcagc tcatccaagt cattctcata acatttctct gaaataaata 22321 gtatatgaat agattacttc tacttttata gttgaagaca ctaagaaata gaagcaaagt 22381 aatttgccca aagaaatcca gtaagtacat gtctgagctt gtgttaaaac gcagatattg 22441 aaaccaattt attgcctact taaaggtttc ttttctcttc gaagttgggt ttcagaatgt 22501 tcagagtcaa ctatggttac tttttcaata ccttagtggt gccccagtcc ccggtgcatt 22561 tagatttaag ttattgttac cttctcttta aattgtttgg atattccagt aatgatccct 22621 tagtaattca tacgtgacta atatttagtt ttatttggat agtactggat ggaaggttta 22681 gtacttaaag gacagagcag gtatggaaag ggcaacgtaa atgtaaacag gctgtggcag 22741 ggcagtacta aaataaatta gtgacaccta ccaccctgga tggccagcct actagcccac 22801 cggggacata gcattaaagc cctttcacaa ccctaggtta gaatttggca cccttggaca 22861 gcactctgat gaccagctta aagaaagctg tcttaaaatc atttcattgc cccatagttg 22921 cagctggcaa atgactggag agaaaggaat ctttagctgg agggatgaca agtcagtcat 22981 cagttaagga gctccattca aaagcagttt caatttaatt tcctgatttc tgcttaacca 23041 caattaatat tctgcaagcg agtctggttg aacgacttta agacataaag aataaaaata 23101 tgacagggac ttattttaag acactgcaaa caaggacaca gcaccatatt ttggagaatt 23161 gattcagggt tctacagagt gattgtattt ttgcctcaga ggaaccgaaa gccagttccc 23221 aagaaagcta tgttttccat ctgcccttat ttggctctgc ctctgggatg aatctataga 23281 tggagtttct aggctctcag aagctgagag catctccagt ctatcaattg aacccattgt 23341 tcttagctct cccacacacc ataaacctcc ttttttctaa ctgaaagagc tctctttgtg 23401 ctgagatcag ccgaagatca agatgcagca gtataaacaa gaaacatttt cttacagcac 23461 cagttgtatt gcttttccta tcttcagggt cagtactgag tgcagttatg caggacgtgg 23521 aagctgcagc tttgtccaga gcaacatttg ttcattcctt aattcgttct gtgaacattc 23581 attgagaaac tactaagtat tgtgttaaac cccaggaagt tcaagtttgg gggcagtgtg 23641 ggtagaaagg tgatcaagac cagatcttac actcaaggat ctctctcaga ttcagccata 23701 gcagacaggt cttccacggt aaactggagt ggaggcaagg agtgtgcaaa aggcagtctg 23761 atcggctgac tccagtgggc atctgaattc tcccctcagt atgtgggagg ggtgggtgtt 23821 aagggccttg acttcacgtg gctctgggtt tttagggcat ggatacaaac aaaacttcaa 23881 tttagagtcc tgacaacttg tgacttgctc tttctgtgtc accctgcttc ttgccttaca 23941 tgtgacagca tttatgcaca cactggctca atgcatgtgc gtgcttccac ccactcaatc 24001 atttggggag tcagagggca catgaatcaa gattgatata aacacaagct attcagtctg 24061 gccttttgtc tctatccagg ctaagtctga gaagccaagg aagggtaatg taagatataa 24121 catatcaaaa ctgcacttgt atcccctaaa tttatacaac aacaacaaaa aagataccaa 24181 gtccttctct gtttggattt ccttaactgt gaacacaggt agtgtggcta ttttgacatt 24241 aacagttaac ttcatgaata ttaagtaaca aataaaagaa atactcatta ttatatttac 24301 gaaatcaata cttggccatt cctacatcaa taagcaggta aggacttcaa gtcaatattt 24361 atatattggg aaagattttc ctcgttcact cacttgtttg aaagacgcaa acatactctt 24421 cattatattt gttgtcccac aaagtttggt gtctatgtgc tcctgttctg agactccttg 24481 aattgattga ctttattcta aaggaaaata tcagataagg taatgagtgt tttactgatt 24541 atggatcagt ccaaagaaat ccagtcttaa cattttactc aaactctcaa aaaatattta 24601 tttgataaat tatctgtgcc aagtattatg caaagtgttt gtggtatagc agttaataaa 24661 agagatacag tctcagccct catggagctc aactaaagac agaaaggggt aggcagggag 24721 acagggagca tggtgaagaa tgaggctaac atccttccac agaaacagtg gaaaagatag 24781 aatcagggag aaacttgcac agtcccatga gaaacaaaga gtcatgttaa aaaaaaaaaa 24841 gatatagaca tgtaagtctg tatttagata aatgcaatgg tatttaagat tgaaaaacat 24901 taaataggct tggactgttt tgccccccaa tgtggaaagt cacccagctc taggttccaa 24961 aggaactagc ttcatggatc ttctatatat tatgtgaaac cctatctttc aagcatcaaa 25021 ggggcaaaat acaaagctga ttctgagggt tacaatatta aaaccaaaca agccaactga 25081 tatctgaaag tgtatagatg gttttaattc tagttgatga tgaattcaca tgtagaccaa 25141 ctttaaaaaa caacagggaa atgtggctct tgcaatcaaa gcttccttcc atgtccccaa 25201 catgtaccag atagaaattt ggtgaaatct catttgactt aaaatattaa atcagcccac 25261 attatccaaa aaaaagtgaa gagaaacata aaaattatcc tatatgcaga tgatattatc 25321 acaagtgaac agaaacacac aaattatcct atacgcagat gatattatca caagctcaga 25381 gtggtcttag caaacaaatg gctcagctaa ttactgctag gaagaaagta tacatatcag 25441 ctttcctaaa aaaaaaattt gtcattttca gtagatgttc ctaccttcct gttggtgaat 25501 agtgtgaata attccataaa gcaggtacct caggatatat tctgcagctg gcttatctga 25561 aaggacaaaa gtaaattaca tttctcaaag ccaggtaata tatgggtgct tatggaattt 25621 ctgcacatgt gttgttgttt gtttgtcttt aatagcttag cgggttacaa ctactttaaa 25681 atatttccag gttaaaaaat tttaagtcat gctttatgcc ctatttgtct tggtgttaaa 25741 gaagattcaa aataattgtc tgggcagggc atggtggctc atgcctgtaa tcccagcact 25801 ttgggaggcc aaagagggca tatcacttga gcttgaggtc aggagttcaa gaccagtctg 25861 gccaacattg caaaactaaa aatacaaaaa ttagccaggc gtggtggttc aagcctataa 25921 ttccagctac tcaggaggct gaggaacaag aatgcttgag cctgggaggc agaggttgca 25981 gtgtgccgag attgtgccac tgcactccag tctgggcaac agagggagac cctgtctcag 26041 aaataataat aataataatt gcctgaagat aaacttggcc cggactgtaa gcagccttgc 26101 ctccaacaat aatagctgca ctttatcaag cactgtatct gcaaagtact ttacacatct 26161 cattttatcc tcacatcagt cttatgaggc agaagcaact attatccttc agttccatgc 26221 tggagcatgt ggccttgttg gtcacatcag tttacacatg agcatgttaa tcctatccta 26281 gctcagctga ctggcaaagc tgcagagatt agctgaatcc cggtatttgt cagggagcag 26341 aaccactatc attgtagctc agtatagtga acaaaagttc tctgaaaagg ggccacaaag 26401 cgagacaccg cctctacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaagta ctccagcatg 26461 gtacctagtg cctgtagtcc caactactct ggaggctgag gtgggagtat agctgtagcc 26521 ccagaggctg cgggtgcagt gagctgtgat tgcaccatta cactccgtcc tgggcaacag 26581 agcgagaccc tgtcttgaat aaaaaataaa acaaacaaaa ttattaaaaa taaaaaggaa 26641 tttagaggaa agagaccttt tttttttttt tttttgaaac agggtctcac tctgttgccc 26701 aggctatagt gcagtggcac cgtctcggtt cactacagcc ttgacctcct gggctcaagt 26761 gatcctctca cctcagcctc cagagtagct gggactacag gcgactgcac catgctcagt 26821 gctaatttaa aaaaaatttt ttgtagagac gtggttttgc catgttgccc aggctgtctc 26881 gaattcctgg cccacctcag cctcccaaag agctgggatt acaggcatga gccactgctc 26941 ccagccccaa ggaaagagac tttattccag agaacagttt ggacaccaga gacacacagc 27001 cttaggtgta aaatgaaggt gtgctccaga gagcaaagct agggttatac ttttacagca 27061 aaagtttatg caaatgaagt attcaaactc acctagttct gactggttga tagaactgag 27121 ccctgattgg tcaaggcaac tgagcctgat tggttgaggc agctaagcca tgattggtca 27181 aggcagatga tctcagattg gttggttcag gtgagctctg gaagtcccaa agttgaacag 27241 aggtgtgggt tttggaggaa cttgagacct ctgacctata gttggcagat ggccacttgg 27301 ctctatttaa aatttaggcc cagttagcca ctcaggatcc attttggagc actgcctatt 27361 tcaggttcac atttgtcata gtacctgcca aaaactccat tgagagaagc agtggttaag 27421 gggggataac ctgaaggaag tgtactccat tcattcccca gtcatctcca aatatttttt 27481 attgtgcaat actatgagta aaacatgttg gagcccacat ccccccgcca tgaccttgct 27541 gctctaagtg tggtcttaga acaaggagca tcaggatcac ccgggagcta gttagaaatg 27601 cagaatctca gcctgatcta cacccactga atcagaaact gcactttaac aagatcccag 27661 gacgatttgt atacatatta aaatttgagc agatttgctg tgtgtcatta tataaaatgt 27721 actgaaatat atattaaaca tatgctttaa atattgacat tatattcatg tactgctgtt 27781 ctaatataca cattacaaag catggaaaac aacatttaaa agtataagat aaatatgaag 27841 taatatttta aaatatgttt tatttgataa tatatactta cacaatttat atcttatttg 27901 aaacatgata ttattaatat tttagtgaga acaatgtgac tgaatatgcc tcattaattt 27961 tgaaatcatg gtttaacatt ttagaatact gagttcagtt taaactgccc ttggctcgaa 28021 tggatgttat agctgaaaat aataacttac aaagattcat agatgtaaat ggatgcagta 28081 tgacattggc agtgttcaat aaattacgtt aaattttcag cctcactttg cagttatgca 28141 aagatttttg ttgaaatttg actatcaaat tttcactctc cctgccatca tttagtggcc 28201 cttgcaaact acaatttttt tttttttttt gagatggagt ctcactctgt cacccaggct 28261 ggagtgtagg ggtgcgatct cggctcactg caacctccgc ctcccaggtt caaacaattc 28321 tcctgcctca gtctcctgag tagctaggat tacaggcatg catcaccaca cccggctaat 28381 tttttatttt tagtagagac ggggtttcac catgttggcc aggctggtct caaactccca 28441 atctcgtgat ttgcccgcct cagcctccca aagtgctggg attacaagcg tgagccaccg 28501 agcccgaccc aaactaaaat atttttacaa atgagttcaa gctattagag taggttgttt 28561 taggcaagtt aaacatgccc tgttacttta aatatatata tatataattt tatactttta 28621 agtgttctgt ttttcatgcc ttatatgtgt atattagaac agcagtacat gaatataatg 28681 tcaatattta aagcatatgt ttaatatata tattttaata cattttatat aattacacac 28741 agcaaatctg ctcaaattta tatatatata tagggaaaag aagtgacaca catagttttc 28801 agcagcaaag tcacaagctg atgaaggtat ctccgcatat ctgttttcaa attctctctg 28861 tatggcatat ttcctttgaa aggcacttac tttctcactg attgttaaaa aggcatattt 28921 ccttgacaaa tcagcagagt gagaacacca gagtcaatct ttttaatatt tattgatatt 28981 agatattaat aaaactttat tcttatttta agattaaaaa ttagtacaaa tactagttca 29041 aatgaatcat gttgcaatca ccacagagtg cacacacatc tcacattaga ggtcatactg 29101 tgtagaagtt tctccaaaac attttgaggg tggaaatctc tttctaaggt gataagtttt 29161 tcaggcctct accataacag gtggagggaa ttacccattg gttgagaaaa catttaaaat 29221 atacaaccaa ttcaataacg ctgttttaaa attttttctt acagaaaatt tcaaacatac 29281 acaaaagtaa agagagtccc taaatgcaaa gagcctcatg caaccattac tcagcttcag 29341 taatgatcaa tattgagtaa cattttaaaa tgagtttgta ttctattgat gtccacacag 29401 atgaagacat atcattcatt cattttactg acaatccttg gtgttcagat ggatgcgggg 29461 cgggggactc cttgcagcaa cacctgcatt ttctactcta cccccaagag gacagaggac 29521 aaagcttggg gattgctcta ctttagagtc taaaacatct tcctgattta tgccagacca 29581 ataggtagca acataatact tgaattgctt cacagtggcc atcttggaat tatttttact 29641 aaattgctct gttttgtgat gtcagtgacc tgtattaagg accatttttt aaaagccgtc 29701 ctatgcaaca gtggctatgt ttactatggt catgtttttg tgggtgtcat tcattatact 29761 atatttatcc agaaggctac aggggacata tttctctaca gtctagtatc aaaactagtt 29821 gtgctttatt tgagtgttga agtaagagca gactgttatt atgattcctg atgtgttttg 29881 gatttctttt ggtgggggtg gggggcagaa cacattcatt gctttagtaa tttaatcttg 29941 ctgagttgtt gaaaccaagt gaagtttcca aaaccacctc ttactttgtt gggttctttt 30001 tctttctctt tcccctctct tttctgtgga agtgtgaaag gtggtcagaa ttgaaagaag 30061 aaagaaaaca gttctaggtc tcttcttccc tcaccctcat ttcaaactct agaattcaga 30121 aaaccgtaaa acactggact tagatgtggt tttgtcagta agttaataga tggattcaag 30181 tatggtgaat taacatcttc tctcaatcag atctgtatac tccagcccta ataagaactc 30241 taagctcttt actcatttta atgtatctgc gtgagagacc ttgtctggca aaaacttgga 30301 gaaataggta taaatttgac ttgaaatggc tccatgtcaa aagcttttac cttcacaggt 30361 aagaaaggct tttgatatcg ttcactatct tttttcccca cagtaaaagc ctagggtagc 30421 tcatttccag ggctgttttc aacatgctgg gacaactcca attgtttcaa agagtattag 30481 gaaaatgata caaataagac tttaatgcag ttcaatttta aatgcatata ccatacacct 30541 ttaggaaaaa agaatgagat aaactaaaat aaaagaagtg atagttttca attctaaagt 30601 atttggggat tattgcttta aaacagggtt accaacaagc atagatccat tgtaagctct 30661 gcctcattct ctccccttct gtcttcctgg gcttacagtg gagtatgatg atgtctccag 30721 gactaggttt cctgttaccg tggacactca tgacccctcc tccacagtgt tccagcaccc 30781 cattctccta gtctgagtac ccaccttgtt taccatttat agtagggttg tccccacttc 30841 agatcacttt acacttattg ctccaatccc ctcctaggag tgtggctcct aggctcctat 30901 catttggtcc caccactttc ccaggtatac tgttcctgcc aagtggtgac aggcctccta 30961 aagaccacgg cctgatgttc tctaggtttg tttgtcattg ggcctagcat ctgcactctg 31021 tccaccttac tgagtgtgaa taaattattg tttcaccaga agatcctgag atcctggaac 31081 tgacaacctg ggaatctaac aatggcaaac aatatgactt tgattatgta ccagcattgt 31141 tgtttgtact ttgcagatat taactcataa aagcctcata acagtccaga gatgagcact 31201 attattatca tcattctaat tttaaagatg aagcaactga ggcacagata ggttaactgc 31261 gttacccaaa gttacaggct ggtaagtggt agaactatgt tttaaactta ggtagtcaca 31321 ttccagagtc tctttactta accactgctc tatggttctc ccagcacagg gatacatcct 31381 atcacagggg aggaagtctt taaaatgaaa gaaaacattt tatataacaa actttcctgt 31441 ttgttgtcct gggagagggg catgtcaagg acaaaatgag attaatgaga acagaacact 31501 gagggccaga acaggctaat cagaacttgg aacacctagt ctctaccatg aggggagtca 31561 atgccactgt gcaacttatg taaccactaa atacagaaca aattttatca agaaattaac 31621 agttaaaaag tacataatac ataaacacca gtggataact tacgaagata ccagcatgtt 31681 taaggacaat attaaatgga tcgaagtggg tgcctgaggt gagagaggaa aataggggta 31741 gggatgatag aagaagaaag agagagagag agagaaagag agagagagac agagagagag 31801 agagaaaaaa agggagagag acagagagag agagagagag agaggcgggg gtgaagatag 31861 agagagaggg accttgcctg aaatgataat gaaaatatac taagaactca gaaatacgat 31921 caactcaatt ctcagccacc tgaggttcaa attaaacaaa caaatgattg acctggaatg 31981 gcatcctttg actcttgagg gcactgataa tttttctagt tggatacata gagtttctga 32041 tacatattaa gggtaagcag gactacagtt taggaagtgg ctacatagga atgaaccttg 32101 atgtagcctt tgctttactg ggaaatgact aggttattct ttgcctctgc ttttctttgc 32161 agttacatgt gaaaataaac tcatatcacc agctatcagt gtatgatagg ttactttttt 32221 acaactttat ggttatataa ttcatataca attgacccat ttaaagtgtg caatccaatg 32281 agttttatta atagtatgct cacagagttg tgcaactatg accacaacca atcttagaac 32341 attttcatca cccccaaatg aagatccata cccactagca atgtttccct attctttcac 32401 cctcacaccc ccaacactag gcaatcacta atccaccttc tgtctctgtg gatttgtctg 32461 ttctgggcag ttcatataaa tagcatcata tactatgtgg tcttttgtga ctggcttact 32521 tcctttaaca taatattttt gaggttcatt catgttgtag catgtatcag tactttgttc 32581 cattttattg tcaaataata tatatttcaa ttgctgaata atatatattc gattgcatgg 32641 gtatgacaca ttttgtttat ctattcttca gttaatgagc atttggggtt tatctgtttc 32701 tttgctgtta taatagtgct gcaacaaaca ttcctgtaca agtttttgtg tgaacatatg 32761 ttttcaattc taatggattt atgcctagga gtagaattgc tgagacatat gataatttta 32821 tgtttaactt tttgaggaac tgtcagactg ttttccatag tggctgcaac attctacatt 32881 tccaataaca atgtatgaag gttgtaattt ctccacatcc ttgtcaacac ttgttattat 32941 ctggcttttt aattatagcc atcctactgg gtatgaagtg aaatctcatt atggttttgt 33001 tttgcatttc cctgatgact aatgatgtta aacatctttt catgtactta tggtccattt 33061 gtatatcttt agagaaatgt ctatgagata ttttgccaat tttaaaatta tttgtctatt 33121 attgtgttat gagtacttta catattttta tttacttatt tatttttgag actgagtctc 33181 gctctgtcgc ccaggctgga gtgcagtggc atgatctcag ctcactgcag cctccgactc 33241 cctggttcag gtaattctcc tgcctcagct tcctgagtag atgggattac atgcacatgc 33301 catcaggcca gctaattttt gtatttttag tagagacgga gttttaccat gttggccagg 33361 atggcctcca tctcctcatc tcgtgatcca cccgcctcag actcccaaag tgctgggatt 33421 acaggcgtga gccaccgcgt ttggcctact ttacatattt tagctaccag ttccttgtca 33481 gatacatgat tttcaaatat tttctcccat tctttgtgtt gtcttaactt tcttgatggt 33541 ggctttgaag caaaaaagta tttaacttag atgaagtcaa atttatctat tttttctttt 33601 gtcgctcttg tttttgctat catatctaag aaatcattgc ctaatctaag gtcacacaga 33661 tttttaccta tgttttcttc taagagtttt atagttttaa cacttacctt tacatctata 33721 attcattttg agttaatttt tgtttatact gtgaggcagg agtccaattt cattgttttc 33781 catgtggata ttcaggcatc ccagcacatt tgttgaaaag attattcttt cccccattaa 33841 attatcctgg cagtcttttt gaaaatagat tcatcataaa tgtaagggtt tatttcttgg 33901 ttctcaattc tatcccattg acctatgtgt ctattattat gtcattatta cactgtcttg 33961 attactgtag ttttgtagta agttttgaaa tcaacaaatg tgagtcctcc aactttattc 34021 ttttccgagt ttgtattagc tattctgagc tctttgcact tccatatcaa tttcaacatc 34081 aatttcagca tcagcttgtc agtttttgca aacaaaacaa gacaaaacaa aacagcttgg 34141 attttgccag ggattatatt gattctatag atcaatttgg ggaatattgc caccttaaca 34201 atattgagcc tttcaattca tgaaaacaga acacctttcc gtttatttag gtcttgttta 34261 acttatttca atgaagtttt gtcattttct gtgtacaact cttgcacttt tttgtcaatt 34321 tttttctaag tatttttgat gctattataa atgaaattat atttttaatt ttctttttgg 34381 attgcttatt gctagtgtat agaaatatga ctgatttttg tgtgttgatc ttgtgtcccg 34441 ctgaactcat ttattagctc taatagattt ttggggcttc tttacaattt tttaatatat 34501 aaaatcacat catctgtgaa tagagattat cttacatctt cctttccact ttggaggcct 34561 cttgtatatt gcttctgcct aattgacctg gttagaactt ccattacaat gttaaataaa 34621 agtggaaaga gtgaacatct tgtctttttc ttgatcatag aaggaaagct tgcagtcttt 34681 ccccattaag tagaatgtta actgcagatt tttcataagt gccctttatg aggttaagga 34741 aattccctgt attcctagtc tgttgagtgt tttcttttat caagaaaggg tgttcaattt 34801 tgtcatctat taagatgatg atacattttt tacccttatt ctaataatat ggtgtattct 34861 attgactggc tttcatatgt gaaccaggtt tgcatttctg ggataagtcc tagttggtca 34921 tggtgtatga tcctttttat atgttgcctg actcagtttt ctactacttt attgaggatg 34981 tttgcatcta tatttgtgaa ggttgttggt ctgtattttt cttttctcgt tttctctttt 35041 acctggcttt ggtatcagga gagtaatggt ctcatagaat gagttgggaa atgttccctc 35101 tatttttgga agaatttgtg aagaactggt gtaaaagcct ggtggaattc accagttaag 35161 ctgtctggtc ctggactttg tgggaggttt ttctcattat taatttaatc tctaaaattt 35221 tctgtttctt cttcagtcag gcttggcagt gtgtgccttt ctagaaatgt gtttatttca 35281 tttaaattat ttaatttgtt tactaggttt ttttaaaaat aattttcttt ataattaaaa 35341 aaattcttta aggtgatact ttatttctga tttttgtaag tcaactcttg tctcttttct 35401 tggtaggtct agctaaattt tgttgatctt ttcaaagaac caacttttga ttttgttgat 35461 tttctctatt gcttttctat tctctatttc atttgtttac ctctaatctt tttgtttgat 35521 tgcttttgag aaacagggtc tcactctgtc acccaggcta gagtacaatg gtgcaatcag 35581 agctcactga aaccttggaa cttctggact caagtgattt tttcacctta gcctcccaag 35641 tagctgggat tacaagtgtg actcactgca cccagctcta atctttgtta tttacttctt 35701 tctgcttgct ttggatttag tttactcttc ttttcctagt tttaagatga aaagttggtt 35761 attgatttga gatctttctt ccttttaaat ataggtactt acagatacag tataaatttc 35821 ctctaagcac tgcttttcct gtatccggta agtttagtca ggttgtaatt tcatttttat 35881 tcatctcaag gtattttcta atttttcatg tgatttatcc ttttatctat tggttattta 35941 ggtgggtgtt aatttccaca tatttgtgaa tttcccaaat ttccttcagt tgttgatttc 36001 taatttcgtt gtaccatgat cagagaatat tctttagata atgtaaatct tttaaagttt 36061 attgagactt ttgttatggc ttaacatata atctatcctg gagaatgttc attctgcact 36121 ggagaagaat atgtatactg ctgtttgggg gtagagtttt ctgtagatgc ctgttaggtc 36181 tggttggtat atagtgttat tcaagtcctt tcttagttgt ttagttgcct agtttttcta 36241 ctcattattt caagtaagat attgaagtct ttaactatta ttattgttga attgtccatt 36301 tctcccttcc attctatcag ctttttgcat tatgtatttt gaggctctat tgttaggtgc 36361 atatatggga tagatttttg aaggttgttc aacttgttta cccaaggtga taaaagaatg 36421 tactcagacc gcagaagaga tttaaccaga tctcttcatg ttagtctcaa ggctaagatg 36481 gaaaaacatt gactatttga taatatagtt atgtgggcct tagcaaataa gttcctgagg 36541 aaggaaaagt taaaagatgg ggcagggagg gggcggtcct cgtggcccaa agcctgcctt 36601 atcttccaca tgccaggaca taggctgtca ctagattcca tgctggtggg aagagcagct 36661 catggttgca ttttactctt tgatgttcca aacactccta aaacgagata ttttaagaga 36721 gaaatgaaca aatgagagtt tgctttgaag tgctcagaaa ccaaagaatc attctagact 36781 cttccttcat gagtcatacc tagttctgtc agtcctacct ttttcctctt attataactg 36841 aaacctgaat gattgaaaca gcctcatctc tgctcctcct acccctcttc tagccccaag 36901 tctgatctat ctcctcactg ccgtcagact actctttaaa aaggcaattc tgatctccaa 36961 tcacttttct gtttaaaact ttttactgaa agcaagcaca gtcagtgatt tctgtggggc 37021 tttcctcctc ttcacctctc tctactgctg tcaccccagc ccccactcct cccataataa 37081 gaagcagacc ctccctgcct ttcccactgt caagggactc ttctctgaaa gaataacatt 37141 ccctgggctt cttcactgaa ttattcatct ttgctgcctt ggtatggcaa aacttctcat 37201 taaaagccct ggttgggagt gacctactca gcccagccct cacaactaga cagagagaag 37261 ccaaggggag gaagagatgc agtggcaggg gtgggaggct cccccaaagg gctgtgacag 37321 ggagtcccca aaacccaggg acttgggaag ctgaatggca cctctagaga ctcatggccc 37381 tggggtgacc atggtgctca tctgggctgt catctttctc cacagctgtg gtcaaaccca 37441 gaggcctgtt tccaccattg attcccagtc ctggaatctt ctgagcactg agcagtcata 37501 gaaaatccag tcatgggatg gaagagacct cagctgtgta aaatgggacg agttatccct 37561 attgtacctt gagtgactga ggccatgtca ggaggagagc cagagtgggg gagagaagac 37621 tgcatttcta gttctagctc tggtattaac tggctgtgtg accccaggga tattgcctgc 37681 cttctctttg agtctttgtt tgctagtaca cagaaggcaa cacacatgct gatgccatga 37741 agaccatggg ctctgctgcc aggctgcagg cctctgtttc ttccaagata tgtaaccttg 37801 aaaaaccatt tagcttgaca aagccttagt ctcctcatct gtaaaatggg ggcaacaaca 37861 gtacctacct cagacgattg ttgaaaatgc acataaagca ctcagcacat ggccttaata 37921 aacacccaat aaatgtcagc tactggtctc tcatcagaga gcctcgttct ccagaacttg 37981 gtttgttgag ccgtaaactg tcaggtctgt ctttctggaa ggaatgtgac attctggcca 38041 tatcgtctcc cagaagaggc tgtttgcagg tagcctgagc catcccttca gaatatgttt 38101 agatcgatac acatcacacg tgatgccatg caaaggcaca tgacccaccc ctcgttttct 38161 ctgcaccctt gaaaacactg ccttggagcc cagtgctcct gagctttaga agccaccttc 38221 tcatgattct gtacacagca gctgtaagag aaactctgag ggcccactct cagggagaca 38281 ggaaggctgg ggagccagag gccaggtgag ctggggccag ggcaagaggg tcagtaagaa 38341 acagggcccc cttccccctg ggcttcgggg atccagctaa ctggctggaa atgcaccttt 38401 gcctgtggct tcttgcccaa gacctggcta gcttggcttt cctgtttgct gtacatgttg 38461 ggattgactt catattctgc ctagagacta ggacccatga agaagggccc aggccctttc 38521 tttgtcacct ttcaagccta ttaatggttg ttatggaatg aatgctagtg aaattcatat 38581 gttgaagctc taacccctta taggatggta tttggagaag gggatctttg ggaggtaatt 38641 aggtttagat gaggtcatga gggtggggtc ctcacaatga cattggtgtc cttataagaa 38701 aagggagaga ccagagcatg atttctctct ctctgtctct ctcgctctct ctctgttatg 38761 tgaggaaaca ataagatggt ggccatctgc aagctagtaa aagggccctc tggagagcct 38821 gaccatccta gtctgggatt tccagcctcc agaactgtaa gagataaatt tctttctttc 38881 tttctttttt tatggagttt tgctcttgtt gcccaggctg gagtgcaatg gctcaatctt 38941 ggctcaccac aacctccgcc tcctgggttc aagcaattct cctgcctcag cctcccgagt 39001 agctgggatt acaggcatgg gccaccacac ccggctaatt ttgtattttt aatagagaca 39061 gcatttctcc atgttggtca ggctggtctt gaactcctga cctcaggtga tctgcctgcc 39121 ttggcctccc aaagtgctgg gattacaggc gtgacccacc atgctcggct taatttctgt 39181 tgtttttaac cacctagcct gtgatatttt ttattgctgc ctgagctgac taagacaatg 39241 atgctgcaag aaaccttgcc accagagcag ctaagatgaa cccccatgga gccactgcca 39301 tgcagcatgg tagttcaagc tggggcaggc accagtgcct cccatccatt ctttcaggta 39361 ttcattcttt tttttttttt ctcagaggca gggtctctct ttgttgccca ggctgaagtc 39421 atactcctgg gatcaaatga tcctcccacc tcgacctcca aagtagctgg gactgcaagt 39481 atgtgccact gtacctggct taattaaaaa atctttagct ggcacctacc ctatgcctgg 39541 ccctgcacta ggtgctgtag ggtgcacagt aggcccaggc ctcaactttg tggcgcttag 39601 tggctagtag gaaggagcct tgctgaagaa tcatacacat ggaaagtgag acacaataga 39661 attttttaaa ccccaggata ttgtgaacac ctataagact gagtaagggg ccaggaaaga 39721 actccaaggg ggtgatgttt aagtctggac accaagaaaa agaaggcacc agctaggcaa 39781 atagtgaagg ggagattggt gcaggtggag ggatgaagtg aattcctgct tcccaggcag 39841 ggcatttcca aactcttttg tcttagcaga gtgtggcttt ttgaggtctg gatgcctgtt 39901 ggcttctact gccctaacaa actctctgac tactgccatc tgggctttgc atcagagtca 39961 ccagaaaaac ttgataagat gcagatgcct gggctgcacc accctttgag actttaagca 40021 ggtcatggtg gggtccagga atctgcattt taacaaagcc cccaggtgga tctgctacaa 40081 gtagtctgct gttctgtgaa tgcttctttg aaggcactgg caggataaag tctaaatgcc 40141 atggcataac ctgcgtgact cttcagcttt tcctgtctat tgttatagct gtctcgctag 40201 tcaccttcct ctaggcttct catgcccact cccaccacac ctatcctcca caacatgagt 40261 aatcattctc taacccaaac ctaagcttcc tacccctttg cttagaacac cttcttttat 40321 tcctttttcc tttcacaata gatccccatt tcttaacatg gtatcaatca tgcttctgcc 40381 acttcccatt tactctcttc accttcatac ctcaaacagt acctgacgct gggaagacat 40441 tgaacagata tttatttaat tgctacagta atgggactta aatatgtcat atttcactga 40501 aaaacctggg gatatttcaa gttataaaaa tcatgtccag gagggaaaga actgtcttta 40561 gttatcggaa gtcctgtgtc atgtggaatc cttgttttga gtggccctag gaccaaggag 40621 tggaagtcat aagaagtcag aatttactca atataaggga tcatgtgcca tttttgcaag 40681 atataatgat ctgcttcaaa aaatggtaag ttttctgtct cctgaggtat tctagcattg 40741 actctgtggc aggcatttgg tgaaaatact caagcattgg gtgaaaatac tcaagcattg 40801 ggtggaattt ggacaaaaca acctactagt gaaatcacag gattctatga ggaagtaaaa 40861 gtgaagaggg gaagaaaata atctctccat tccagggaaa tagaatattg tggttaatta 40921 tcatgtcact atgcaacatg catgaaatgc ttattctcca taccataaaa tagacctaat 40981 aataataacc cttagttttt ctttgatgat ccagaaatta atttgggctc ttgcaatgtc 41041 taggggttcc cgttataaac attaattttt attataatag tgacatgtgg aaatgccagt 41101 taaaatctct gccagaatgt caaataaaca tgctttatta ctgtaactaa agaaatattc 41161 tagaaaaaaa atttaaaaat gtatcatgca gagacacaat agctttccaa ctcaatttag 41221 tatatacaaa aaaattacct taggacaaag agccatgcca tcacaatcta aagaataaca 41281 ctacagatta gaagactcat ttggctgaaa tacagctaat aacaaaagct aatgacttag 41341 tcttcttagg tggctataaa aataccatag acgggatggc ttaacaacag aaatttattt 41401 ctcatagttc tgaaggctgg gaaagagcct tcaagatcaa ggtcttggca gattcagtgt 41461 ctggtggggc ccccttcctg gtttacagat agctaccctc ttcttgtatc ctcaaagcag 41521 aaaaagagag agaaagagag gtctcctctc tttctctctc tctctctttt atggaaagcg 41581 ggggcaagga gtttctctct gtcacccagg ctggagtgca gtggcatgaa catgactcat 41641 tgcagcctgg acctcctagg ctcaagcaat cctcctgctt cagcctttga agtagctggg 41701 aaaacaggta tgtgccacca taccctcttt ctcttcttat aagggcacga atcccattat 41761 gaaggctcca ctctcacgaa ttaatagcta actgtaatta cctcctaaag actccaccac 41821 caaataccat cacatggaag attaagtttt taacatatta attagggaga tatattcagc 41881 caatagcaac caaaaactgc tgcatagttg tcaattaaac accaatttcc acagttcatt 41941 gccaatagct atttttctta ttttatctct acatgaaaga aacaaggaga gtgtgggact 42001 gagtagtagg atc // LOCUS HSY09781 36159 bp DNA PRI 09-JAN-1998 DEFINITION H.sapiens beta-sarcoglycan gene. ACCESSION Y09781 NID g2769563 KEYWORDS beta-sarcoglycan. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 36159) AUTHORS Duclos,F. and Broux,O. TITLE Structure of the beta-sarcoglycan gene, genomic sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 36159) AUTHORS Bourg,N. TITLE Direct Submission JOURNAL Submitted (04-DEC-1996) N. Bourg, Genethon, Dystrophie Des Ceintures - J.S. Beckmann, 1, Rue De L'Internationale, Evry, 91000, FRANCE COMMENT Other reference: AC=U31116 ; AC=U29586. cosmid (clone: cos4) producted by K.P. Campbell, Howard Hughes Medical Institute, University of Iowa, Iowa City, USA. FEATURES Location/Qualifiers source 1..36159 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q12" /clone="cos4" repeat_region complement(644..940) /rpt_family="Alu" repeat_region complement(3903..4199) /rpt_family="MLT1d" repeat_region complement(3910..4148) /rpt_family="MLT1c" repeat_region complement(3919..3971) /rpt_family="MLT1b" repeat_region complement(4198..4465) /rpt_family="MLT1a" repeat_region complement(4198..4329) /rpt_family="MSTc" repeat_region complement(4213..4373) /rpt_family="MLT1b" repeat_region 4653..4770 /rpt_family="MLT1a" repeat_region 4669..4770 /rpt_family="MSTc" repeat_region complement(4780..4860) /rpt_family="MLT1a" repeat_region complement(4871..4930) /rpt_family="MLT1c" gene 6952..34875 /gene="beta-sarcoglycan" GC_signal 6952..6957 /gene="beta-sarcoglycan" exon 7000..7032 /gene="beta-sarcoglycan" CDS join(7000..7032,11602..11811,15358..15543,16300..16491, 17121..17252,21053..21256) /gene="beta-sarcoglycan" /note="43kDa dystrophin-associated glycoprotein" /codon_start=1 /db_xref="PID:e284885" /db_xref="PID:g2769564" /translation="MAAAAAAAAEQQSSNGPVKKSMREKAVERRSVNKEHNSNFKAGY IPIDEDRLHKTGLRGRKGNLAICVIILLFILAVINLIITLVIWAVIRIGPNGCDSMEF HESGLLRFKQVSDMGVIHPLYKSTVGGRRNENLVITGNNQPIVFQQGTTKLSVENNKT SITSDIGMQFFDPRTQNILFSTDYETHEFHLPSGVKSLNVQKASTERITSNATSDLNI KVDGRAIVRGNEGVFIMGKTIEFHMGGNMELKAENSIILNGSVMVSTTRLPSSSSGDQ LGSGDWVRYKLCMCADGTLFKVQVTSQNMGCQISDNPCGNTH" repeat_region 8443..8527 /rpt_family="MER45" repeat_region complement(10636..10912) /rpt_family="MER2" gene 10829..28095 /gene="unknown" exon 10829..11035 /gene="unknown" /note="=potential internal exon" repeat_region complement(10903..11204) /rpt_family="Alu" repeat_region 10912..11166 /rpt_family="SVA" exon 11602..11811 /gene="beta-sarcoglycan" /number=2 exon 15358..15543 /gene="beta-sarcoglycan" /number=3 exon 16300..16491 /gene="beta-sarcoglycan" /number=4 repeat_region complement(16578..16856) /rpt_family="Alu" repeat_region 16587..16838 /rpt_family="SVA" exon 17121..17252 /gene="beta-sarcoglycan" /number=5 repeat_region 18663..18950 /rpt_family="Alu" repeat_region complement(18717..18941) /rpt_family="SVA" repeat_region 20068..20144 /rpt_family="MIR2" repeat_region complement(20379..20660) /rpt_family="Alu" repeat_region 20439..20640 /rpt_family="SVA" exon 21053..21256 /gene="beta-sarcoglycan" /number=6 polyA_signal 21493..21498 /gene="beta-sarcoglycan" repeat_region complement(21906..22089) /rpt_family="MIR" polyA_signal 22284..22289 /gene="beta-sarcoglycan" repeat_region complement(23544..23848) /rpt_family="Alu" repeat_region 23604..23810 /rpt_family="SVA" polyA_signal 24211..24216 /gene="beta-sarcoglycan" polyA_signal 24488..24493 /gene="beta-sarcoglycan" polyA_signal 25599..25604 /gene="beta-sarcoglycan" repeat_region 26039..26308 /rpt_family="Alu" repeat_region complement(26065..26296) /rpt_family="SVA" exon 27595..28095 /gene="unknown" /note="potential internal exon" repeat_region complement(30955..31241) /rpt_family="Alu" repeat_region 30971..31215 /rpt_family="SVA" polyA_signal 32383..32388 /gene="beta-sarcoglycan" polyA_signal 32419..32424 /gene="beta-sarcoglycan" polyA_signal 32468..32473 /gene="beta-sarcoglycan" polyA_signal 32766..32771 /gene="beta-sarcoglycan" repeat_region complement(33011..33290) /rpt_family="Alu" repeat_region 33055..33284 /rpt_family="SVA" repeat_region complement(34154..34348) /rpt_family="L1MD1" repeat_region complement(34196..34348) /rpt_family="L1MB7" repeat_region complement(34206..34348) /rpt_family="L1ME2" repeat_region complement(34233..34348) /rpt_family="L1ME3a" repeat_region complement(34236..34348) /rpt_family="L1MD2" repeat_region complement(34246..34348) /rpt_family="L1MC2" repeat_region 34332..34605 /rpt_family="L1MA10" repeat_region 34382..34858 /rpt_family="L1PA7" repeat_region 34382..34879 /rpt_family="L1PA15" repeat_region 34382..34881 /rpt_family="L1PA11" repeat_region 34382..34605 /rpt_family="L1" repeat_region 34382..34745 /rpt_family="L1PA2" repeat_region 34403..34605 /rpt_family="L1MA2" repeat_region 34407..34605 /rpt_family="L1MD2" polyA_signal 34411..34446 /gene="beta-sarcoglycan" repeat_region 34427..34800 /rpt_family="L1PB1" repeat_region 34427..34875 /rpt_family="L1PB3" repeat_region 34438..34605 /rpt_family="L1MA9" polyA_signal 34870..34875 /gene="beta-sarcoglycan" repeat_region complement(34937..35022) /rpt_family="L1MA9" repeat_region complement(34937..35038) /rpt_family="L1ME2" repeat_region complement(34941..35038) /rpt_family="L1MA10" repeat_region complement(35102..35842) /rpt_family="L1" BASE COUNT 11013 a 6912 c 6930 g 11304 t ORIGIN 1 ggatctttgt ttcaattttt ggttcatagt tcaccttagg tcaagcattg attttgggcg 61 cattagccaa tcaccaaaag ttgggctccc ttttttagta aaatttattc tgccagcttg 121 ctagaataac tactatttct aacctccagt caaggaagtt ctctatagtt gcacttttaa 181 caaatttaaa cccagcttaa actgataaac ttttcctaat tacctcaacc cctcagagct 241 catcactttc ttctttatgc tttctctctt taacttttgg acatgttaat attcttgcaa 301 ttatcatgct atattatatt tatttgcttc ctatctaagc ttttgagctt catgaggaaa 361 aggagagagg cttatttagc ttcatatcct cagcacagca actggcacat aagaaagaag 421 aaacaagttg ggggccagga ggaggaagag aggaaggaaa aggaaacaga ttggaaagga 481 aaggatgtga ttggctgctt tgtatctgat gtcataatcc actgagatag aaggcaattt 541 taacagccaa ttcgtggttc ttacggtgtc ataattctac tacctagcca ggagtgattt 601 gttctagaaa ggataatgtg tcaagtacgt agatggaata gggtttttgt ttgtttttga 661 gacggggttt cactctcgtc acccaggctg gggtgcaatg gcatcatctt ggctcacctc 721 aacctccatc tctcgagttc aagcaattct tctgcctcag cctcccgagt agctgagatt 781 acaggcatgc accactacgc ccggctaatt ttatattttt agtagaaacg ggctttctcc 841 atgttggtca ggctgctttc aaactcatga cctcaggtga tacgcccacc tcgtcctccc 901 aaagtgctgg gattacaggc gtgagccacc atgcccggcc ttggaataga ttttaatttt 961 gtctaccctg gatttttacc tacatcagtc agcttgaaag atttttaatt gggtatttta 1021 agaagccaac aaaagatgag ctgaatccac tgcctgaagt gagagctgaa gtgagagcta 1081 tgtatattta gttcacacta caccaaacat aaatatatgc aataaattac aggggattat 1141 attacgcagt aggattttca tcttaatttc aagtcttctt cactaaggac agttgctaac 1201 ccttagccag gggctgataa gaacaacctg tgtttggctt ggatgctctt ttaagcactc 1261 ctcatccttt cctggctgag ccacggtatt cttctccttc tgctgtctgt gtttggatct 1321 tgtgtctttg gatcagggcc tgtattacta tttgaagaaa atcttttagt ctttgctgac 1381 ccacttgcat aaagttgcct ttaaatgcaa tgtttatcca aggtattgct taggcccttt 1441 gtgctcagag gcactttgtt cttgtaggta gcaaaacact tctttgtcca cttaagtgtt 1501 ttctaacctg agtgttctgt attctccttt caggagttct gttttgtcct tttctctgcc 1561 acattaaggg aaaggaaaat attttctaaa aaacacccat tggagaacaa atgaatttat 1621 tgggaactga gtgggaggca ggttccccca ctgacagaag gaagaatgct gtttgctgtg 1681 tcataccaag gagcactacc cacacagagc agcagttcag aatttttttg tacaattatt 1741 tcatgtgcac tattgattaa actataaaaa caagaggaaa ttttaaatat atcacaagtg 1801 tatcacctta gcccctaaat aaactataga acatttgcaa caaatacaga ttagaaggag 1861 aaaaaaccca aggtagatgt agtgtgattt tggagagctg caaaatattt ttcctatcgc 1921 aacccaggtt tccatcatag gtttttcaga aaagaaggcg tcctcaccct ccatcttttc 1981 agctctgcta gctaaatgca atggcaaggt catggttctg tgaaaatccc ccgaggagga 2041 gtaagctcag gacatatcaa gctattgcag agacacagtg gatatggtct accttgcaag 2101 atgtccttaa gcacatcagg agagggatat gggatagcac gagatagtcc ctaggatgag 2161 tgatgttgaa atagatcctt ttcagcactt ttaataaaca tttactggat ttctagtgtg 2221 acacaaccat ctgccttaag catggacaga attaagctca gtctctgttc cctagaatct 2281 cataatctat tcaagaaaga gaaaagcaac catacttcaa tgttgtcagt tagagtagat 2341 aaacgttctg cggaaacaat gactcctcct ttgcctcctt tcccatcttc tgaaaaagga 2401 tcagggaaag tcatagaaag gtaatgttta gagcctgttt tggaaggatg gagagacgtt 2461 caccaggaag agaagagcaa cttgaaacag gaagaatcat agcaaagagg catgaaattg 2521 aatggagggg cagaaaatga caaatagtag tccaatgtga ctacagagta gtcagggaat 2581 cagagataag actgcaaaga tagtcctggt ttcaactatg atgggtctta tgtatcaggc 2641 taaaggaatt gatctttatt tcataggcag tggggaggca gcaaagggct gtaaatagaa 2701 aaactgtatg attgggtctc ttttatatca cattggaggt cacagaggtt aagtatggaa 2761 ggggcagaac tggacccttg aaagagcact tagaagacac agagatgagg gttagtctag 2821 tggccttctc tggggtatgg tatggtgagc attagatacc gatgtcagag atacttagat 2881 gctagtaata ttttttaccc aatagaaaaa attaatttaa tattcatcag agctattaag 2941 tatgtattat atacttggtt cttaatgtgt tcaggactta gagcaacatg gaaaataata 3001 acacattacc tctgctttgt aaggaactca gtctaatcag aagcaaactg tatatgcatg 3061 aaaagtaaag taacaagata ctgcactaag tatcaaatgg gtgttacagc aaattagtgc 3121 aataatagtt aaaaggaaag aggaataagg actggctgtg attgtcagga ggaggtagga 3181 ctgaggttgg ttctgaaaaa tgggaagaat ctggagaggc atgaattcac agataactaa 3241 aacagatgga caaataaata aatttcagta tagtgtgatg tgacaatagc agcatgcttg 3301 gggatcacag agaggtccac acaaatcaga aatgtatgat tgggagaagg tttaagaaag 3361 gcttttcaga aaagatatcc cttgatctga gttttagtgg acaagtagag acagctagat 3421 taaaaggaga aagagccaca tatgcaaagg aacagagcca ttaagaactt gagttcgggc 3481 aagcagatgg gggctgcaaa tattgcacag ggataaaggg atagattact ttgtctctga 3541 ttccctgact cctctgtagt cacgttggat gtgtgtgtgg gacaggcagg aaacaaggct 3601 agaaaggggc agaaggacca aaagtttcta accttttggt gaaggaccaa aatttggtga 3661 aggaccaaat ataaggagct taaattttat catgaaaggt attggagtat ggtaagcttg 3721 agggtgagga gtgacatgat cagatttgta ttttagaaag atcattccag aaatagtata 3781 gaactacttt gaggtgtctg actagatggt gatatcattc actaagggaa cacaggagga 3841 ccagattggg aagaaaggtt ttgagttcaa atttttcatg tgttgaactt gaagtgtgta 3901 tttgtaacac acaaacttgg tggctttaaa caacagaaat ttgttgtctt acagttccag 3961 aggttagaag taagaaatca gagccttctg gaggctctaa gcgtatatct gctccatccc 4021 tctttcttag cttctggtag cggttggcaa ttattggtat tccttgccta gtagagtgat 4081 cactccaatc tctgcttcta tgttcatatc atcttagtgt atgttgccat tctgtcctct 4141 tataaggata agggctcatc ctaaattcag agtgatctca agatccttaa aaacatctgt 4201 cttagttcat ttgggctgct ataacagaat atggactagg tggcttataa acaacagagc 4261 tttatttctc acagttttgg aggttgggac atccaagatc aacatgctta gagatttagt 4321 gtctggtgaa ggcctgtttc ctgattcata gatgtcactg tgtcctcaca tggtggaaga 4381 aacaaaccaa ttctcttgga tctattttat aaaggcacta atcctattca tgagaactaa 4441 tcatgaatag tgatagctaa tcacccatca tttccttcta tgatgtgaat gtgtccccaa 4501 aaattcatgt tcggaaattt aatctccaat gcaacagcat tgggtggtgt tgccttttga 4561 gagctgttca ggtcataggg gcttggcatt cctcccctca ggatgacaag gtgttcaagg 4621 taacatcctg gaaacagaga ccagaccctc atcagacaac tgaacctgct tggacttctc 4681 agcctccaga actgtgggaa aataattttt tgtgctttat aaattcttca ctctcagatg 4741 gtctgttata gaagcacaaa gtggactaag tcagcttcta aaggccctac ctcctaatat 4801 catcacactg ggggctaaga tttccacatg tgaaatattc aggggagaca caaacattca 4861 ttctatagca atctgcaaag actctgtttc aaagtaacac cacgttcaca gatactggag 4921 gttaggactt ggatatgcct ttttgcaggg cactattcaa accactgcaa agtgaacaaa 4981 ggtgtaagtg tcttcataat ctgactcata ctccagccct ggggacctgt gttttgtgct 5041 tccaaatgct aatatcatgc aacagtcaaa agaaagagca aaattaaagg ctgtgatgta 5101 tggtagaaag agcaaagact agtcagatgg acctagattc aatactggct ctttaggagt 5161 tgaaaagatg gtcagccatt taacctctac aagccttgat ttcttcattt gtaaaagagg 5221 cattgccagg cacagagttg actttaaatg tcattatttc aggggaaaaa aggctgacaa 5281 tagtacattt gaaaataacc cacattaaaa tgtggtcatg tgagtgaatt ggatctcttt 5341 aaaagagaat agaaattggc attaagaata gtgggatgtc ctgtcagaga tggacatcag 5401 cgatgaggaa agaatccaag agaaggtgat attcaaatag ccaaaggagc atttggtttc 5461 aggaaaattg gctaccaggt atctcaccac ctaagtggga gggaggttgg caaaatcaag 5521 tcagggctta gaaattggga tactgggcag atttctaaga tatctattaa gtttaattac 5581 tgtgctagac attggagatt aaacgatgag taagacacct accttccaga tagaacacac 5641 catcttttga atgaaatcag caggtagaca agataaaaat acaatgcaat ctgaagtgct 5701 agaaaaggct aggcacacag ttaaggaaca gatctgtggg agcctctgaa gcctttagag 5761 aaaatatgac gtttgaatag aatagttctt gaaggatgaa taagagtgcg cgtgccaggg 5821 aagaagaact ttcccagcag agagaaaaac gcaaagctta ggaattaagg actagaagat 5881 aggaactcag aatcagttaa aagcccagtt gtgcaaattg atcaatgaga ccacaggcag 5941 aatattaaca cagagcctta ctgctgactc ctgcatcatt gggctttaac ttagaatttt 6001 cctgcctaaa atgcatgttt gtcccacaga ctcagagagg agtgagtcta gagatatgag 6061 tcaagtggtt ccagcattta ggaaaaaaga aatgcagaga gactgatgac cagctaggta 6121 ctgactccca ctgaattaga cagtactagg gcccaaaggt gcttattaga agtgattcag 6181 ctacatgagg aacagttgct attgacttca ctgcattttt tttcagctaa aataattatt 6241 ggattatatc tacaacaggg aacaatgaat gtctactcaa gtaactcgta catttgagtg 6301 gcttcccggt gttcagtaaa tgtccgttca attaggttcc cgaggatata catggtaaca 6361 ctatctccaa cactaaatga aaaatccatc cccagtatcg caggataaaa acatcagact 6421 tcataggcct ccgaagtcaa gcagcccgaa ggattcaacg tcctccttga ggactctcag 6481 tcttcaagac tccagccacc taagtgacca ctcgttcact tctgcccacg cagtcaccac 6541 gccggtccgc gtccacttgt ggtctcgata tcggcccctc ccaccctagc caaggagacc 6601 ttggtccgag aaacaaagag agcctcacag ctccggatgg caccgatctt tcagcgatgg 6661 ggaacaaagg ggaaggggag aagcgtcctc gatttttgag ggtgcccttt ctgaggaatc 6721 tgcaggcggc ccggggtgga gcggaatccc cataggcccc ttctggcccc tgtgcaatac 6781 aacgcgcccc aggccaaggt cctaggtgca ggggccgccg cagagcctga gcccgcaggg 6841 ctgggatgcg ggtcgtcagg gatgcgggct ggggagggga gggtgtgagc agcgggtgcc 6901 cccagtcccc gccagggcgc aagcgcaggc gcgcagggtg cggtcgcgga ggggcgggca 6961 cagtcgggcg gggagctcgg cggcggcggg cgcgggaaga tggcggcagc ggcggcggcg 7021 gctgcagaac aggtctgtga gtaccgccgc ggccgcgggc tggaggagcg ggggaggccg 7081 cgtcctgccg gctgggcccg cgggagccct ggccgggggt gcggggggcg cgggccgcag 7141 ggtctgggtt cggcggggcc gatcggggcg gggcggggcg ggaggagctg tgccccgccg 7201 aggatgctgg cacaagctag ggaccaggag ccaggggcgg cctcgggctc ggcccggcct 7261 ggctgatcct cggagggggt gaggaggggt ctccttccgg agagcctggg aggcggtcga 7321 gccagcctgg aagcctgcgc tgggacgtgc cagggacgta ccccgttcag ccgccagcag 7381 agatttggat aaaaataacc gcagccccag cgagaaggct cctcttacaa gaaacgggtc 7441 agctgttgcc ccggtgtcag cttgccatct ttgttttttg tttctgccat ctccaatatg 7501 cagacatccc agatttcccc tcacgctttg gtttttgaag atttagtcac cagtctaaaa 7561 ctatctcctg ttcactttta aaaatgattt atccaagatt tcctgacctg cattgtaaac 7621 ttgtaactta gaatggttta aaaggctcct catcccccca aaatctagtt atacgtatgc 7681 atacatgtac atgcatatgt gtgtgtgtta atatatacat acacgtaatc gctagggtga 7741 tttattgtaa tacagataat tttcagtaac tttccccctt acctaactta cttttccttt 7801 gaacaattat agtagtttct ctttaaaaag tctcattcta aaacattgtt gaaaaaaatg 7861 agtcaagttt tgacctttgg aacgattttc ttagataatt aactttttag agacctatta 7921 gatatatgtc actggagggc tcattctgaa gataaaagac gtaaaatctc tcttcaagga 7981 gcttaaagtc atacagccaa atacatgcag aaaaaatcag ctaaagacaa taggcaggct 8041 gcagatttgt aacagtaaaa taaaacatag aggaatgagt gaaactgaaa ggagacaaaa 8101 cttcttggag ttggtattta aattgtgcct tggctttcct catttcccgc ctaagttatc 8161 agcttctgtt tcagatgatt gtttttccat ctggttctat ttttgcttta cacataagtg 8221 gggcagtctg atccgttact cagtctccac ttcaaattta cgcattacgt ctccagccct 8281 gaacttaact gaattttcca taggcacctc aaactcaaaa atgttcaaaa ctggactctg 8341 tcatacttac tattaaaaag gctacaacca tccattcact tgtttaacct agccagaaac 8401 ttaggaccaa ggccatttta agggagctga gaaggacctg cacttggttt aatgttctgc 8461 tgtcctgggc ttgagattta aacagtgagc cccacatttt cttttccccc tgggccccac 8521 aaattatata gccagtccta tttgggagtc atcttcgacc cctttgtgcc acactcatat 8581 ccaaacaatc acgagatcct gacagttcta cctcttaaca cctttcccca gacttcagct 8641 tcaaagtcaa acctcagtca gtacctcatt gggactctta gagaaacctg ctgttgatat 8701 cttcacttct aggctcttca gttccagttc atctcttttt ggagtgattt tactacaatg 8761 caaatctgat tgtataggaa acctatttta actcttcagt agttcccctt tgccttcagg 8821 ataaagtcca ttgttcttgt catggtattt atgttgctta ctggccccta agcacatctc 8881 tcattgtatt tcttgcttca tccctacacc tatccttctc accatccaca taaagtgacc 8941 aaaattagtt ctcccagaga tggttgcctc tcaaagtctc catgcttctt cctctgccta 9001 gagtgacctt cttagctctt ttgcctggtt aactcctgtc catccttcaa gagttaactc 9061 aaccttcagc ttttcttaga aactttccct gatgcccttg cctgctaggt ggagatgcat 9121 tcctcttctg taccctttct gtattgtatg cattgctcca ctacagcact tgtcatgctt 9181 tgtgaactgt tagtcacctt gtctgtctac ctgactggcc tgtaattaag ctccttgagg 9241 gtggacacta tgttcatcct ggtagcttgg agaacctggc ttatagtaac tattcaataa 9301 atgttaaatg catcaagatg aagacagtaa tgtgcatggt ttggcagaga agagtaaaag 9361 ctacgtctca ggagggatga attttgtcct ttctctcttt tatgtggacc tatattcaaa 9421 ctttggctaa cacttttaag ttgcccatct gttagggctg tcagaagcct gaggttcttt 9481 tttggaacct gactacaaat tctgttaagt cttctcttaa agtatctact taatctctcc 9541 tttttcatcc cactttagtt taggtcctct taacctcaat cctccattac tacaacatcc 9601 ccagcagctt acttactggt actcctcttt cttctctctc gtacaaatgc ccacaccttt 9661 aacactctga atttgtgctc aaaaacctcc agtgactccc aattaaatag aacataaaga 9721 ctagaatttg taccctgctg tccaagaacc ttcacaacct ggccctactt ttccaacttc 9781 ctcttccctg ccttgccata tctgacccat ctacttctcc ccatttccct aacaactcca 9841 ggtcatcctg tctctcctcc ctttgaactc tgcagtgtgt attgcctaga ctattgattt 9901 tggcacctga acatacagag tgctttattg cattgttaaa tattttatgt taacatgtac 9961 acaaatcata tctgttaata catagaggat aaggaccctt tcctatagtt cttttgtatt 10021 cctatctatg gaattagtgt ttaagcgtaa agtgcttaat acatgcgtat tcaacaactg 10081 aattttctca gagctttaag aaataagtag gacagtatcc atttctgccc atttcttttg 10141 tgctcttgat attcaagatt attacccaaa atgtcattca accatagtct aggtcagagg 10201 ttctcaaatc attttggtct caggttctct ttacactctt aaaaaatgga ggattccaaa 10261 gagctttggt ttatgtgggt tatattcatt gatatttact ataggagaag ttaaaatgga 10321 gaaattttta aagtacaggg ctatacaagc atacattctg ttgagtgatg atgccatcac 10381 attggtcatg tagcctttac attcatgggc ctaggagagt aaaaacagca aacagtatct 10441 cagtgttatt tcaaaaatac agccccccaa agggtttgtg gaaccctccc aggactcccc 10501 accctgcatt tgaaaactac tgatctaggg taatgtgagc tctagtcagt gaaattataa 10561 gtaacaaact atgtcagcct ttccggtctt attttcatat aaatgaaaat atataccgtt 10621 gtccctcaat atctatgggg gattggtctc aggaccccca aaggatagcc aaatccattg 10681 atgctcaagt cccttatata aagtggtgca gtttttgcat gtaacctctg cacatcctcc 10741 tgtatacttt aaatccatct gcagattact tacaacactg aacacaatgt aaacactatg 10801 taaacagttg ttatactgta ttgtttaggg aataatgaca aggaaaaagt cgtacatgtt 10861 ctgtacaaat gcaaccattc actttttttt tttttttttt tttttttttt gagacggagt 10921 ctcgctctgt cacccaggct ggagtgcagt ggcgcgatct tggctcactg caacctccgc 10981 ccctccaggt ttaagcaatt ctctgcctca gcctccggag tagctgggat tacaggtgcg 11041 tgccaccatg cccagctaat tttcttgtgt ttttagtaga gacagggttt caccatcttg 11101 gccaggctgg tcttgaactc ctgacctcgt gatccacccg cctcagcctc ccaaagtgct 11161 gggattacag gcttgagcca ccgcgcctgg cctcaaatat ttttgatcca cagtggttaa 11221 atccatggat atggatccca tggatatgga gggccagttg tatacgtaaa aaaattaata 11281 aattattaat aaaagcagtt actcactata ggatacgtat gacaactctg ctcagaaaca 11341 agctgtatac tttgacagtg ggactgttga aagcagagag gaaaaatatc agatagtgac 11401 cttgaagttt cttctgtact ttccttctaa gcacagttat gcccttattt ggaactgcaa 11461 ctcaatgtga tttgcagttt attcatggaa aatgtgtcag taaccaattc agcttgctat 11521 agctatgaat ttgcacctaa tagataaatg caccaaaacg agagggtata gctgttccat 11581 tatgttgtat ttcattttta gcaaagttcc aatggtcctg taaagaagtc catgcgtgag 11641 aaggctgttg agagaaggag tgtcaataaa gagcacaaca gtaactttaa agctggatac 11701 attccgattg atgaagatcg tctccacaaa acagggttga gaggaagaaa gggcaattta 11761 gccatctgtg tgattatcct cttgtttatc ctggctgtca tcaatttaat agtgagtatt 11821 tgtaagaata ctcattttaa ttgccatggg gaatttttta tctgctttat agtgaaatca 11881 tagactttat tttctccttt tctatacatg tcttgtctct tactgaattt taatgaatac 11941 tttattaaaa aaactagtgt gctaatattt tcatatatta atttcattta caaaaaaatg 12001 tgcatgggtt tatcccttta atggacttaa tttttagcaa ggatagaatt atgaggctcc 12061 cacctagctc aatcggtaga gcatgagact tttaataagg ataggatcaa ctcaatagaa 12121 gagaactttg ccaagatgat tttattgctt tcacatctcc ttaaaaatat aatcctttct 12181 gttttcattt aacttttcct acatttaatg ttgcctcctc agctggattt taaaatttta 12241 aaccctttga tggcagtaaa aggttatatt ttatagtcct aagtccccgt tgtcaaagga 12301 tgcaagaatc tggtaggctg attcacacta aagtactgag gacctgatag gatgcctttt 12361 ccaaagatat ttgctttttt aatgaagaaa tccttttaat ctgaagttgt gactgtctgg 12421 ggttaaatct gaaactctgg catgtgcatg tacagatatt ttcagaagga atgattattg 12481 atatacttaa gactgatcat gaacatagtc tgcagcagtt ttgaagtgat tctttttgtg 12541 ttattttcta cttctaatgc atgcaaggca gtctttttgt atttatgtat gtccaaagca 12601 cttctgtcta attcttcact caaatcagaa agacctcttc ctatatattt catattagat 12661 atataaattt ttactaggag cacaaatctg aatgcatttt attttataaa tgtattttat 12721 cggagacttc tggtcaagat ggcagcataa agacattttt atcttcttct ctaaaaagct 12781 cgttccaaag taacaatgtc acgaaacaga aatatctttg aagaaagtcg gaagcaactg 12841 aaatcctaaa tcataataaa tgaagacaca aagtaggagg aacaatggca gataacttat 12901 cagagaggag aatctgcttg ttggggaggg gatgttggtg gaagtacctt gtagaaccct 12961 aaaaagcctt gggaattaga agttccagta ccttgaaaag cagaagtgag gcaagaactg 13021 aaaacagaga agtcatttga aaatctgtat aagaagcagt gagatcccta ggcctcctct 13081 cccttccagc cagcagttag acagctaccc ctcctaatgt catacaaaac tagtgtgtct 13141 ctggacacac tggaccagag aggattctat ctgcctgcgg atactggagg gtaggggttg 13201 ggtttccata ctgaaacagg aaagtctggg tcctgagact ccctttccca cgctctcttt 13261 tcctcattct ccacctccag ctccctggca gccaggcaag agacagaaga gccttctctg 13321 ggattcctga tgcatataga aaaaagatct gaagatactg tcatttggca agtcccggtg 13381 aaaaagccat gtaatttccc tcttgtgaca cctgtcagtc acaagcccta cccctcaaat 13441 aattttctat gaattcatca ctaagtatga atactgagcc agggatcaat agacttttga 13501 ggaaagtgcc aagaattgaa aaatgataga gatgaaaact cctaaaccat ggaaaaaata 13561 gaaactcatg gaatagagaa aatgccacaa aaactaccaa aaaaacaaaa gccctaaatt 13621 cagtattttc agagaaataa gaatatactg catcaatgaa acaagaataa catgccattt 13681 aaaaggcaca agccaggcac agtggctcat gcctgataaa gtgagaccct gtctctaccc 13741 aaaaaaaaaa aaaattaaaa ggaacaatta agaactcata gatatgcgcg tgcacacaca 13801 cacacacaca cacacacata cacacacaga gatgaatggg tgggtgggta gatggatata 13861 ggtaggtagg taggtaggta ggtagataga cagaagacat tagaagataa aaaaactctt 13921 ccagaaattt caacaaaatt aatcaactca ggagctctaa catgctgtta ataggagttt 13981 cagaaagagc acagaaaaca ctggggggga aatgatcaaa tacataagtc aagaaaattt 14041 cctagaacag attgaaaact accatactgg gtccagcgta atgcataaaa agacacaaca 14101 aggccattat tatgaaattt aagaaaacca gtaataaaag atcctaatat cttccagagg 14161 gggcaaaaaa gcaggttatg tataaagaat acaaaatcag aatggaatag gacctcccaa 14221 aagcagcatt gggacctgga aggcaataaa gcaatggctt tcaaaattca gaaagaaaat 14281 aattttccat ctagaattgt atacttagcc agtccagcaa tcaaatatga agcattataa 14341 agacgtattt ttgccatgca cactttccaa aaaattttac ttctcataca ttgcttagaa 14401 gccactggag gatgtgctct accaaaagaa agatgaggat acaggaaact gggtgtctat 14461 cagaagagaa agacaaaagg aagttcagca tgaacgtaag ggaaagttta tgattacagc 14521 tgtgcagggg gcccatacag accagtccag cccacgcaga ccaaaccaga aagatacagg 14581 ttccagaaag atttcccgag agagaaaaat agaactaata tattatctga gaggctcaga 14641 cctacagaaa attgtattga gagattttac agaaatagtg gaggatttga gattaattag 14701 atttttaaag aaactaagga aaatgaagta ttaaactctg agagacaagg tacaggaaag 14761 aaactgccat tataggacat atttggtttg gtaatgaaca atatttatat ggttgtaata 14821 ctgaaaataa tgaccttgga tttaattaaa acttgagatt ggagctggaa aagaaaagag 14881 atataaatga gctaaattta tcatctgcta aaatagaaaa ttaatactaa aacctaaatt 14941 atgataaatg aagaaataac atatttagaa gaaatatttg ttagatattt tagaaatata 15001 tgtttagaag aaatacatat atttagaaat aaggagataa attctagatg aaatagctaa 15061 atttgttgaa agtgattgtt cctgggaatg agatgtggag agggtagaaa aaataagaaa 15121 ctattcattt taaattatag acctttagca cattttgact tttagctatt tgcatgtaat 15181 accttaataa aaataaaatt cgttttgttt ataagtggaa aattaattga tatatttacc 15241 tcactgataa aacattttag gaggtcttta acattttggt gataatattt tctatttgtt 15301 ttccaattac attaaatgta tgcggcctac caacagcgga cttgtgttct ttttcagata 15361 acacttgtta tttgggccgt gattcgcatt ggaccaaatg gctgtgatag tatggagttt 15421 catgaaagtg gcctgcttcg atttaagcaa gtatctgaca tgggagtgat ccaccctctt 15481 tataaaagca cagtaggagg aaggcgaaat gaaaatttgg tcatcactgg caacaaccag 15541 cctgtaagtt gccacattaa ctgaaagaaa tgcaaacagg agaggggcat tagagaaaaa 15601 tactttgcaa aaacctagtt gattattgga aagagatatt ttcccttaga aaaattagta 15661 tatttataat atatctataa atgtatgaat atagtgaata cagtttaaaa catccataaa 15721 tattaatatt taataatact attgctttta tcatctgtta aaattagatg atttcttaat 15781 atttgagatt ttgttcaatg gtagcatttt ctatgacaga tgtaatttgt atgagaatac 15841 ataggttact cttttctcct ccatatgagg aagccattaa gaaagcactt tcccaggtga 15901 gaatactctg tagttttggg ggctcttgaa gagaatatgc caatctagtc agtggcatcc 15961 catgccaaga gttgaaaaca tagttttcaa ctgaataaaa cctcaaggag gtattcttac 16021 caccacccct gcctatctta ggccttgggc catgttcctc catgactcat gcatttttgc 16081 agagctaatg agcaaataaa tgtataaacc ttcattctaa agaactcttc ttccaagtgg 16141 atggagggag aacagacaga gttaaacgta tttgaagtta gtctgttatt ggggttttta 16201 aatttctgta atttttgtaa tttttaaaat atattgttca ggaattttgt ttgcagtctt 16261 ctttgaaaat atattcacaa taaactgttt tttttgaaga ttgtttttca gcaagggaca 16321 acaaagctca gtgtagaaaa caacaaaact tctattacaa gtgacatcgg catgcagttt 16381 tttgacccga ggactcaaaa tatcttattc agcacagact atgaaactca tgagtttcat 16441 ttgccaagtg gagtgaaaag tttgaatgtt caaaaggcat ctactgaaag ggtatgattt 16501 attggctttg ttttactaat gggagagaat tattctctag agttataaag tgggaagaga 16561 gttatactgt tgttgttttt tttttgagac ggagtttcgc tctgtcgccc aggctggagt 16621 gcagtggcgc gatctcgact cactgcaact ccgcctccca gattcacgcc attctcctgc 16681 ctcagcctcc cgtgtagctg ggactacagg cgcgcgccac catgcccggc taagttttgt 16741 atttttagta gagacggggt ttcaccgtgt tagtcaggat ggtctcgatc tcctgacctc 16801 gtgatccgcc cgtctcggcc tcccaaagtg ctgggattac aggcgtgagc caccgctccc 16861 ggctgagagt tatactgttt taaagtagta taaaagaata tattccagaa caccggaaag 16921 gattaattct agatttgtaa ttaactagct tgtgtgatca ttatcaagtc atctttatct 16981 caatttccac atctctaaaa cattaataga aacatttgac tttcatctct catgactgaa 17041 gcttctattt ctctatctct gataacaatt ctcacattaa tttagaaact ctttttagta 17101 cttgtttttt aaattttcag attaccagca atgctaccag tgatttaaat ataaaagttg 17161 atgggcgtgc tattgtgcgt ggaaatgaag gtgtattcat tatgggcaaa accattgaat 17221 ttcacatggg tggtaatatg gagttaaagg cggtgagtat tcttttaaga gcttaagaga 17281 attattaggt tcttgggtac ataaatccat atgtggaata gtatacaatt attgaaaatg 17341 atataattgt ctattttttg acatggaaag atgttcaaag tgggttgtga agtaaaagaa 17401 agttacaaaa caatttgtga aattcattat ttttattttt cagaacatat gtatgtttat 17461 aaaaacctag cagggtgtgc cagaatgtca acaatgttta gctgtggata gcaagataac 17521 agattatttg ttttattttt atttctacta gcctgtattt tctgtttctt tctctctttt 17581 ttttttggat aatgatcact tgtattgttt tagaaggcaa aaaaaaaaac cttctaaaag 17641 agaattctgg aagatcaaag taaatattca tgccttggag ttgttgtttt taattaaaga 17701 tttttcacat ccctccctga gtttcatatc caatgatgtg aaaaaaaaca attagcaaat 17761 ttataaaatt attttattta gatattttta ttaaaaatgt atttttaatc tattattaat 17821 tgtaaaggaa agaggttttt gtctttatcg tggctgttga aagtgaattt tctttgtgct 17881 taacagatta aatctgaact ttaacaggta tggtatttca tgtgccataa gtccaaaaac 17941 tttttaatgt ctcacaattc tggtcaaagt atacttattc aaaatgcctt tgggtacctg 18001 caatatgcta tttaattaaa gcgaattaaa tttgtcttac caagtattac atgacaaaag 18061 tgaacttaaa cgtagggact tatcattctt tcttattaaa aaaaatttca atttcatagt 18121 tcctgggaaa aatacttttt aaaagtagtt gttttacttt tataagcagt ttttccccca 18181 gaacacataa tcccatctct aaataataaa gatggacata aaatgtaatt cttgccagaa 18241 actcagttgc ctacagagtt aatttgtatg tatattgtgg tgtaagaagc cattcttcta 18301 agaacactgt tgaagaatag ccacagtaca actgctgttc tcagcagtag aaagtcacag 18361 aagcaagaga tttgtttaga gaccattagc aacttaccaa accttttatt tttttctacc 18421 tgccacttct tgtcccattc taccttgtag ttaatatgac aaatatttaa atactttgaa 18481 atatacatat acacatatca gtttctctac tgttaaagga aaataaccta gtattaaaaa 18541 gataattagc tttataactt cgttaatatc agtggttagt tctaaagtgt ataagccaat 18601 cattagaaat aattttatca attaaactgc ttttttgaaa aagctgttta agaagcaatt 18661 taggccgggc gcgatgactt acgcctgtaa tctcagcact ttaggaagct aaggcgggtg 18721 gatcacaagg tcaggaattc gagaccagcc tggccaacat ggcgaaaccc cgtctctact 18781 aaaaatacaa aaattacctg ggcgtggtgg cgggcgcctg taatcccagc tacccgggag 18841 gctgaggcag gagaatcgct tgaaaccagg aggcagagat tgcagtgaac ccagatcgcg 18901 ccactgcact ccagcctggg agacagagca aaactccgtc tcaaaaaaaa aaaaaaaaaa 18961 aaaaaaaacc aacaacaaaa aacaatttaa agaaagtaaa attaaatgac cacaagatgg 19021 cagcattgga acaaattgga aaactagtca cttaagaatt actgtctatt ccatatacta 19081 ccactgcata atcagttaac catctccatt atttataacc aattttctct ctttgcttta 19141 aataccctct gtaagccgat gcctcccaaa tttggaattg tgctttaacc tctcctctga 19201 actaagagtc acatatctag ctgcctgctt aacattatcc attttgatat ctgataggca 19261 tctcacattt aacaattgaa aagttagatt ctccatcccc acaagcatgc atttctttca 19321 gtattcttgt accattaaat gacacccatc agtccatagc aaagccttct ggctctacct 19381 tcagaagata cagagtcttg ctgtttctca ccacctcggc catcaccacc ctgatgtaaa 19441 ctgccatcat tccctgcctg gatggttgga acagcctcct ggctgatctt cttacttcca 19501 agtattcact cccatagtct gttttctacc caacagccag agtggccatt ttaaaaagat 19561 caagtcattt ctgtattcag aatcctccaa atgcttcccg tataatttgg aataaaatct 19621 aaagtctatt ccctgcctac atgtctgatc ctcagttatt tctttgactc atctcctact 19681 ggttgttccc ttcaccctcc gtgcttcttc cacactggcc tcatagctgt ccctggaatc 19741 caggcacatc ttcacctcgg ggcctttgca gacactgtta cctctgcttt gagtaccctt 19801 ccccagttac ctagacagct tcctccctca cttcctctag tttctgttgg aatgtgacct 19861 tatcaaacca caggtggtcc ttcctgagca ccctgtgtaa aactcagtcc ccatctgctc 19921 actctctact tatcctgctt tatttctcct catagcattt atcccacctg atgtcttatg 19981 tgttttactg ttcatttttt ttattgtctc cccctacccc attaaaacat caactcaata 20041 agaacttggg ctttgcctgt tttacttact gctgtattcc ccagcaccta gaatagtgtc 20101 tggcatacag caggcactca gtaattattt gttgagtgaa tgaaaagaag aaaggaggaa 20161 agggagaaag caagagagtc ctaagagtca ctaaaaagaa ggattttaag gaactacaat 20221 aagagaatgt ttctggaaca gtgggcgtga agaacctcca gactagagtg cctggtcttc 20281 aaattatatg gtttgaatct tcatttgcag tgttgcaaaa acattggaaa ttaaaattaa 20341 ttccctctag gaggaaaagt cttttttttt tttttttttt ttttttgaga cagtgtctcg 20401 ctcagtcacc caggctggag tgcagcggtg tgatctctgc tcactgcaag tccgcctccc 20461 gggttcacgc cattctcctt cctcagcctc ccaagtagct gggactacag gtgcccacca 20521 ccacgtcccg ctaagttttt gtatttttag tagagacgag gtttcaccgt gttagccagg 20581 atggtctcga tctcctgaac ttgtgatccg cccacctcgg cctcccaaag tgctgggatt 20641 acaggcatga gccaccgcgc ttggccagga aaagtctttt tgtactcatg atcacaaact 20701 taccaatcag tacctttgtt atagttctct taggacttac aaatgaaaat gatctgaaac 20761 tttatcaaca tcaattttat tatgaactta ggaattctga tgtaatttta actaacaata 20821 gaggaattat gttgcagatt tatttacata tgttatggat atatgaagtg aagaagtcac 20881 tcattgaatc ctgatatttt gttttttatt gttgctgttg ttgttttgtt tgtgacttta 20941 ctctttctta ctgcttgctt ttctcattga tatttcagga agttttgttt actgactttg 21001 ttctgtaccc cctcatggag gtaaataaat catattacta ttctcctaat aggaaaacag 21061 tatcatccta aatggatctg tgatggtcag caccacccgc ctacccagtt cctccagtgg 21121 agaccagttg ggtagtggtg actgggtacg ctacaagctc tgcatgtgtg ctgatgggac 21181 gctcttcaag gtgcaagtaa ccagccagaa catgggctgc caaatctcag acaacccctg 21241 tggaaacact cattaaaaga accccagagg tcaccaacat gtttatatct tgacttgact 21301 tttttatgca tgcaaatcat tgtttttaca gagtttgtga taactcataa ttattttaat 21361 ggcagagcac tgctgtatct gttttatggt ctacatagtt aaaatcttct cagagagcct 21421 aaattctaat acattttatt aatttatact aatcttcata tttactgttc tctaaaataa 21481 ttatgagaag caaataaaat caaaagtcat gtttaaagac gtgtttttaa aattccacta 21541 tcccttttct aaaggttaaa ggtctgaagc agctgtttag attcactgta agtaaacttt 21601 ggtaactcta atggggatag acccacttaa gatatttaaa aaggtatggc atcagcgttt 21661 catgctctgc cttttagctt ctaaaaggaa agatgcagat ttctagtgca ttaagcctga 21721 gccatattct cacatgcaag tgaagtcatt aaagaacttt acatatgtga gatagataca 21781 atggttcctt agttttgcac tgggaagaaa atattttgta aaagaatgtt tatttgaaat 21841 aatgataact atcaattgtt cacaatgtgg tggaaattaa aacaccatct cagctttaac 21901 ttttaaataa taatgataac tatctttatt gagcatcttc tacatcctag gcattgtcct 21961 aggcattgca tgtttatatc cccaattctc accacaaccc tgcaagtagg tggtattatc 22021 caagttttac ccattaagaa actgaagatc agagaagtta agaaacttgc tcaacatcat 22081 atagtaagta gcagagttgg gattggaatt caggcatgac tcaaacctgg atgtacttga 22141 ttccaaatgc catgttgttt tcactctctg cactgacttt ttaattattt aaaactctag 22201 aaagatgaac aaaggttaat ttaaacttac ctaagaagat gagaatcaaa caaaacagat 22261 atgcttactc tagttaaaaa gaaaataaat ctcatgtcag acccagaaag gaccaatcac 22321 tgtccgattg taagctatgt tgggccaatt ccaaaatatt atacgatgga gaggtcaaat 22381 ttacctactt ctgagttacc tcagtttccc aacaatggac cttggcacac tggagtaaca 22441 atacataaca gagttgccaa gatatttata ccctcagcac tcggggcaac acagtggaaa 22501 gtggggaggc catagaccca aacaagttct ttgggccagg catggcctag taagtacacc 22561 atgcctcgaa aataagtcca gaagcactgg actaaagagt gctaatgcag gaaataatac 22621 acataatttt taggtaagga taataattta tctctgctcc taatattact atcccattgt 22681 aattatttat aaccctcaag ccagttgatt tttaatatat ttgattggaa aagaactctc 22741 tggtattatt aagactcaca cagaatcagg gacagggccc ccaaaggagt ttgctgtaaa 22801 ataggcagta gagttgtggc acgggcccaa ccctgcattc aagtgtaaca gcattctgtc 22861 agggtcactt tgaattgtgt acataagaaa accaatacaa aaaacaattt gtattcaata 22921 ttgtcacatt tctctctggt agaaaaatca aataccttag agattatgaa gtcattaatt 22981 tatactgaaa ttggattgac ttactaccta acactgagcg ctgtttttaa atgaaaagaa 23041 tgagatttat aaccacttga gtgttattgc agtgatattt gaactcattt gaatatattc 23101 agtatcattt aatgtctgaa ttcagaaaaa aatgccgaaa tttttattca gatggtccat 23161 aaattaaatt gcatattcat tacttatctg ctctatttag atttatttta aaagtttatt 23221 taagtaaata tttttataaa aaccagaaaa cactgtatta caaaatatta tttattaaat 23281 gtagttcagg aaataatcta tttttactct tttttgggaa atacttgtgt ttttgataca 23341 tctccatgaa gtgcttttga gaggagaggc tattttgatg tttttataca actgaaggtt 23401 aacagccata gcattttatg atactttaca ggtagtcctg gctttttccc tgaaacataa 23461 gcttggaaaa tctattagaa agcagaacag ggcaaactct ccattttact tatggctttt 23521 ctaattttta attaattaat ttattttttt gagacagagt cttgttctgt cgccaggctg 23581 gagtgcagtg gcacaatctc ggctctcggc tcactgcaac ctctgcctcc caggttcaag 23641 cgattctcct gcctcagcct cccgagtacc tgggactaca ggcatgtgcc acagttcccg 23701 gctaattttt gtatttttag tagagacggg gtttcaccat gttggccagg atggtctcaa 23761 tctcttgacc tcgtgatctg cctgccttgg cctcccaaag tgctgggatt acaggtgtga 23821 gccaccacgc ctggccggct tatttttatc cacagtaaat cttcagcaac tcattgtctc 23881 caccagatag tatttttctg taaatgaaat gctgacttcg cctcttcctg ctgtatgctc 23941 atccctgcac tgagcacaga tatgacaagc agtagccatg ggggaggtgg gtgacaaaga 24001 taggaccccg ggagggggcg caggtacatg ctagtttcaa ttaccacagt attctagaga 24061 cgggttgcaa tgacaagggg ggcaaatgaa atcaatgcaa gatttcttaa taatgggcag 24121 acagaaaaat gtaaaaccac acaaaacgga ctgctgataa tattttaaaa tatacttatt 24181 tgtcttcttt ttgcattgtg aaaaaaacaa aataaatttt gtgtgataat tttgatgatg 24241 aaaggtggaa gttctaccta gatttgaatg agtgtttttt taagggaatg agaatgtcat 24301 ggtgctaaac ctgacaaata agagatcatt gaaatgctga aaattttaac agtcttctta 24361 aaagtattga gggggcaaaa attaccaatt atggtataca aaaataagcc tataaatgtg 24421 tttcacattg ctaacttgag tttcagttga ttcagtttgt aataactagt aatgagcttc 24481 tgtttacaat aaaaattctg taaattgttt gctgttaatt ttttaagggt tatatattaa 24541 ctgttttcta cttagtcatg cgaacgtgat gaatttaaat atggcttgag aatgggacgc 24601 tgtcagtatt tatgtaatct ctctggactc ctagaactca aagttagaaa gttgtacatt 24661 ccaagttttg aatgccctcc tctctgtgca ttgcagaagg cctctctcca gtggtgcttc 24721 cctggtgtcc taggggaaag tgcacactgt caccatggca ggaggccaaa gccggagatg 24781 aaagggtgag ggctggccac tgggcagcct tgatggcagg ttgggtggag cctcaaaggt 24841 tcaggtctta aatccctacc tggcttgatc cagatatcca cccatttgat aaccacttag 24901 gatacgaagg tatgtcttag aggcattatt actttccagg tcttgtcctt acccgaaata 24961 gcctctgttg cttttttcct tcagttttgc caccctttct tttatgtata aatgaactct 25021 tcatttcctt tctataatac tttgtttttt tgtgctgcct tcttcctatc tttcacttgc 25081 tgactcacaa atgtacacac ccagtatttc taactgcttc ctcctacttc agcttgacta 25141 tttccatttt gaaatttaca ttttgcctaa ttgtttccag ttttctaagc aactttttct 25201 ggagttctca gtggttcctt acctaataca tataaattta taaatagtat gattttttta 25261 acaaattaat acctttcact ttaagatata agattccttg tctagctgtt ttgaactcaa 25321 acctaacatg tataaatctc ttcagcttaa aaagaaccag ggcaatttta attcatgtcc 25381 ccttaagaaa aagtattttt caaccatgtc tttctcttca gcactacaaa gtaatgtacc 25441 tgtgaatggt gactatattt ttttgttgct gattttgtta cattagcttg gtgcaccatt 25501 gtatatataa tgatctcagc tttgaacgta gaaaaaatat ctttgttcct gggcaaatca 25561 attcaatttc tagtattgca taagaagcac actgtaaaaa taaatcactc ctccatgctg 25621 taggctacct ttgctgttca tggccatgtg tacctgaatt taaatgccct tgaaaatatg 25681 ggaggattga ccttgtatga aacaaaagtt ttaaagttac attgtaaatt tcattcactt 25741 cactccatgc attcatttac acacaaacaa cataacccag tgacagactg cagactatta 25801 aaaatgtaaa acattggata gatagataga tagataatgt tccttttttt tttgggagtt 25861 cagaacagat acaaagcaaa acaatagcaa cgctgaatgt tactaatgtg tatcaattcc 25921 ttcagtgttt actgaaagat ttctgtaaga ataaagacaa atttggaggc taaagcagga 25981 aaaatgacca ggccataaga tgtgagatga atctggaagg atacacaaga ctgtggctgg 26041 ccgggtgtgg ttgctcacgc ctgtaatccc agcactttgg gaggccaagg tgggcgggtc 26101 acctgaggtc aggagttcga gaccagcctg accaacatgg agaaacctca tctctactaa 26161 aaatacaaat tagtgagttg tagtggcaca tgcctgtaat cccagctact cgggaggctg 26221 aggcaggaga attgcttgaa cccgggaggc ggaggttgtg gtgagctgac atggcgctct 26281 tgcactccag cctgggtaac aagagcgaaa ctccatctta agactgtggc taagaggaaa 26341 aggcattgaa cttggaactg ggaatgaact gaaagtgaat ttcagtatgc gatatgcata 26401 gatcattctt tgcagaagcc acatcttgta gtggctccac atgaacttat caagctaggt 26461 cattaagttt acctagataa cacaggcttt gaataccaac ctgagggatt tggactatag 26521 gcaataagga gccctttgag gatttcaaat gaaggaataa tataggaaat acgttattgt 26581 aggaaaataa tcctttggag tcacctagga tggatcacaa agaaggaaag tcaagaggca 26641 gaggcaagaa agtcagttaa gccattttag tatttcagac tgcagtgatg aaaactaaac 26701 tgagtgccat tagaacagaa aggaagggac aatttcttgg aaactcatat actcaataat 26761 gtgattattt gtacacacag tggcttttaa gtggaattta gtttaaaaca aatcacaaaa 26821 attacccatt tttaaactta tttttgtgat acattcagcc attgtaattt gtacacactt 26881 ctttttaaaa actttgaatt ttttctcaaa aatgtttggt acaaaagata ctcaatgttt 26941 ctgtatcgta ttacatcttc atggcttgga gccagaacat gtatttgata acacattatg 27001 ttgtatgtat ctaaatgatt tcataatgat ttgaaaagtc acacaagcca cacagcaaaa 27061 cttgtatttc ttctagtcat gaaaagtaag agcctctttt gtgagttaaa ctttttactc 27121 atagagttat ggagggtcat ggtttagtgt aactggttgt catttatgta ataaactaca 27181 tggttgatta ctatgttgta gataagcata atagactttg ctctgtatta atggcctgca 27241 aagaaacagg gaagtcttta atcttggtaa ggtttcattc atcccaaagc acactgtgga 27301 gtgaaaacta ctggaaaagc agatgcagag gtgaggccta actgaatatg tattttctga 27361 aaaatcaggg gagtcctttc tttaaaatgt gacataaatg taattttata ctatggctat 27421 ttcattttgg tagctacatt ttcatgcccc agtaagaaat caaacattcg cttaaattgg 27481 caaccaagac aagataactt tccgaaatct taaattgaaa attaaatttg ctgcttattg 27541 ctgttagttt ttatatatta ttgtgaataa gtcaattcat tttccttttt ccaggcatta 27601 tgaaaaacct ctatttcaga gtcattacca tagttatagg tctttatttt actggaataa 27661 tgacaaatgc atcaagaaaa agcaatattt tattcaattc tgaatgccaa tggaatgaat 27721 atattctgac aaattgttct tttaccggaa agtgtgatat acctgtggac atatcacaga 27781 cagcagccac tgtggatgta agtttcaatt tctttagagt tctcttacag tctcacacga 27841 aaaaagaaga gtggaaaata aaacatctgg acctcagtaa caatctcata tcaaaaataa 27901 ccttaagccc ttttgcatat ttacatgctt tggaagtgtt aaacctcagc aacaatgcca 27961 tccactccct ctcattggat ctactcagtc ctaagtcctc atgggtgaaa cgccacagaa 28021 gcagcttcag aaacaggttt ccattgctga aggtgctcat tcttcaaaga aataaactca 28081 gtgacactcc caagggtgag tacaatttta actaggcaga gaaacaatgc ttacgtggag 28141 ttgtttcatg ttcataagaa ttccacattt gttttcaggt tcaagggaca gctgaaagta 28201 aagcaatttg atcagagcac atataataat gtttcttagt ggcattggaa gttatagtgt 28261 gtggaattta cagctaatga aatttagtct cccattagta tatctgacat taatgttatg 28321 tgcggttatc gagacactag gaagtattta agtgctaaat aataatgagt gattgtaagt 28381 tacctacttt aattatcatg tattagttaa tatagaaaag cactgagcaa taactagttc 28441 aaatacattt aatatgtaat acaattcaaa ttaaatgttg ccatattata atatttttct 28501 tctttgagta cattttgcct cagtctatct agttgtcctc caaacttaca gcagaagtat 28561 cctaatatta tatattggta tgtaaacctt ctgattatca agtcaaaatc agaattcctt 28621 atcctagttt ctaacagggt actccagata gcctccaatt atcataaatt ctcatctgac 28681 attcaatagt ctatatgttg gagttcctta acggatacac caatgttttc tttggaaaat 28741 atatcaagtt cataaagcaa aacttccttt gcccttttta tattccagag tgctagacaa 28801 ttaatttcct ttccttgttt gttttcaggc ttgttcactc tttgctaagc acctccttta 28861 tataagacat tattcaaaaa aatgtataag gttattgttc tctttctcca ttccaaattg 28921 tgcatattta gtttttctta ttatccagtt ctgacatttc atgtttttat cttcatgctc 28981 tttctccagc tttgctttaa gcatcatttt ggcaataata gcctagcagg tataggaatt 29041 attacagtca ccatgtcagc agaattgttt gagaagggtg ggcgtcttat ctccctgacc 29101 ttctccataa aagtagtatg tcatattttc agtgtaaatg aattgcattt tattttattt 29161 ttattgtatt atttttatta caagaagaat acctgttact tatagaacat ttaacaactg 29221 aagaaaaatt taaagggaaa aattaaagtc attcatgtta tatatgttct tcaagtttat 29281 tatttatgcc tatttgtacc tttttctttc aaaattggat catactgtcc atatttctaa 29341 tttgtatttt catatcatag aatattttaa taactttata taattaagca atctcacaaa 29401 tatttcaaat agttggtttt tatatatgcg tatatcataa ttggtttaat caagcactag 29461 gttatttcca ggttttgttt tttgatggac gtcattatag ctacatatat ccatataata 29521 taacaaaaat ttttaactgg tatggtagat gtgatctcaa tccacaaact caagtgtact 29581 agagatataa ccttctgttg tctgtagtaa cataatttcc gattctgagc ccccattagg 29641 gggagttgta gggatactta acatcagtct ttcatcagct ttcctctgcc tttgcctgga 29701 tttgtaccca gcttgagaat cgtatactct aacaaggcat caaatcgtat actctaacaa 29761 ggcatcgaat cgtatactct aacaaggcat cgaggggtga gcattcctta cccctgctac 29821 ccagataata tttcttttca gcctcttcag tttggagaaa atttcttctc cctttttttt 29881 ttcttcaaaa ccctctgtct ggaaatcact ttcttctaaa gggttagggt tttggaaaac 29941 aaaagccaga aagacttgta gactcagcta ctttctgcct tgccatatgc cttccctggg 30001 acaatgaacg tggagcctac tttctgtgtg aatggagaga aagagcaaga ggaaaatata 30061 gggaagtaag agattaacat gtcaaaatat taccccacca caagagtaaa taggaagtgc 30121 tgagtttctc tccttaaggc agatgaattc ccctgagcta actgggggtt aggactaata 30181 cagttccact taaagaccaa atatccctat aaatgcagat acagatccat cttctttttt 30241 tgagttaagc ttttgcctcc caaaagatca agaaaaatgt tcaacttgaa gaggaggggg 30301 aaaaaagaag ggaagtgagc aaattctgta tctgaaattc tattaaaatt ctagcaaatg 30361 aagacataga gagatgaatg aaaagagata gacggtggaa gttcagtgta aggcatgaat 30421 ctgattggat ataattcaga gtccttttat ccatagtatt ttgtgattta aattattttc 30481 ccataatcat catattagtt ttcactaggt tgagtctcct ctaacaattt ttcctatttt 30541 atgattctgt gactatggct gagttgctct gactttgaaa tcccattttg aggggaactg 30601 ctggtgagag agatgctgcc tagggcatgc cctatttttg gcctatttct ttgttcttca 30661 attcttcact ctaagtctat ttataggtga catctggata atataagatt tactcattgg 30721 atcattgtaa ggataaaatg aaaaaatgta tctcaaagac ctaacttaac atggtatttg 30781 gcataataaa ctgtcacaat ataaatatat gaaaactact tcagaatctt gtctgaagat 30841 ttaccttgca aatactagtt tttattattc tcttggcata aattgagaat aacagtatat 30901 ttagaaacaa ataattaatt tttgtctaaa tctcaaggtt ttttttgttt tttgtttttt 30961 gagacagagt ttcactcttg ttgcccatgc tggagaaaaa tggcacgatc tcggctcccc 31021 gcaacctccg cctcccaggt tcaagcgatt cttctgcctc agcctcccag gtagctgggt 31081 ttacaggcat gtgccaccag cccggctaat tttgtatttt tagtagagac ggggctcctc 31141 catgttggtc aggctggtct cgaactcctg atctcaggtg atccgcaggc ctcaccctcc 31201 caaagtgctg ggattagagg cgtgagccac cgtgcccggc ctcaagttat tttttaaaat 31261 gttcaacaac tattgaagta gtagcccact ataattcatg agcagataat tatttgtgtc 31321 tgactatata ggcacttgca ttcatctaaa ataataataa attagagaga tctcagaaag 31381 caacaatcaa gcaacctttg taactatgta ccacatattt taaaatgggg actatttcta 31441 gaaccaatga aaagttgctt tcactcagga aagcccattt gtgtaagggt gagatatcac 31501 acgaactaac cctgtcttca gagtcagttg tggtgtcacc aataagcttc atgttgggaa 31561 aatgcccaaa agaggtgctg gccaatgcaa tttataaaac agaagccagc catgaattta 31621 attttaatga tgcttctacc ttcctcccat atttttggaa tacagcattc ataattgaaa 31681 aactagttta aaaacatgaa aacaatggga tcagtctctg gttaatctgt cctaccttca 31741 tcaagcagta tttttggagt gctcacagag taactggaaa tgcagcagga acaacatctt 31801 gttgggattc cagttaggtt gatagaccaa atgtagcagc ctcacattaa ccaactaagt 31861 tctagaacct aagaggtaaa tgtgttagaa ataagataaa ataataccta acatttgaaa 31921 ctgggggaat taaaaaaaat acacagtgta ttataaaaaa ctaaagtata ttgtgttcct 31981 ttattcatag aaaattaaag atttcatcct gattgctact ggattatgga attatcaata 32041 agcaatgtac tcaaaactaa ttcaggttct aaagaaacaa aaagtatata agacactaaa 32101 ctaagaaggc aaggcctttg gctaacctac ataatttgac cacaataaga tctgacctaa 32161 tgaatcgaat atcttaccca ggaccattct agactctatt ccaattctta attgctcttt 32221 ggactaacag ccataatccc tgaatattgt tttatgttat caggttatca cacttgggag 32281 agcacttaag ttcatagtac agaaggagta tttactaagt gtaatgtctg agtatccttt 32341 ctgtggcctt tttttctgag agacaggtca tttcttggaa ataataaaac tgcctctggc 32401 cacttaaatg caaaggctaa taaaattcct gcctagtgtg tcctttgtaa tctttagaaa 32461 gcagttcaat aaataattcc ctttccacag agtacctcta tattttcctc ccagcattca 32521 actctcctct cttgaagtat aaaatgtacc cctgatgtcc aagccccagc cgtgaatgca 32581 ccatacagcc tcttttcact catttgacta ccctctgaac attggcttac caagcaaaaa 32641 tattaaaaga atacattttc ccattgagaa gtgcagtcaa gtaccatgca agagttttct 32701 taaatttccc aactccatcc tgagagacgc tatacaaacc aggtctgtac cagttccctt 32761 aaaaaaataa aatcttaatt atctaaaatt gaaagtaaag gaaaagtttt cattcaggag 32821 gtgtaaactg aacttcctgg tctttattct catctccaga agaagtttac cttcctcttg 32881 gaaaatctca gcttgatatt atgaggggag ttaaataaga ggattctcta ataactgcat 32941 tcaaaatcaa aatcaccctc tccccacatg cttgggtatg ttcgggtccc actcttttta 33001 tttactcctt ttttttttct gtttttagat ggagcctcac tcttgtcacc cagactggag 33061 tgcagtggca tgatctcagc tcactacaac ctccacctcc cgggttcaag agattctcct 33121 gccttagcct tccatgtagc tgggattaca ggtgcctgcc accacaccca gctaattttt 33181 ttgtattttt agtaaagatg gggtttcacc atattggcca ggctggtttt gaactcctga 33241 cctcaagtga tccgcctgcc tcagcctccc aaagtgctgg gattacaggc atcagtcacc 33301 gttctcagcc tgtttacttg taattctttt acacatctta tattacacta tttacttgtt 33361 tgttgacctt aaacaagtca cttaacctaa aaaagcttcc tggtccttat ctttaactga 33421 gaataagtat atgtataggg taggtataag gtttagagac agtgtgcatc aagactataa 33481 cacatagagt gatgatacat gtaaaacatc tctcatagtg cctggcacac agcaggtgtt 33541 caaaagcatt agttcccctt tagtgccact tgatcgtcct gagtctttaa ttctatattt 33601 cctgccctgg tctttgctgt aacggtaact tccttgattt caggcacaat agataattaa 33661 aggggattct cattctctgt cttcagatgt ttagagtatt caccaacagt ctgggtgcct 33721 ctgttcatat ttatataggt ctgcataatg ttggaagcct ctgctcttcc tttttcagag 33781 cactttttga tctgtcctct gactctctga accacactga ctcatgaaac catcaccact 33841 atcaagataa ttaaaacaaa aagtatataa gatactcaac taagaaggca gagcctttgg 33901 ctcacctaca taatttgacc acaataagat ctgacctaat gaatcaaata tcttacccag 33961 gaccattcta gactctattc caattcttat tgctctttgg actaaaagcc ataatccctg 34021 aatattgttt tatgttatca ggttatcata cttgggagag cacttaagaa gttcatagta 34081 cagagggagt atttactaag tgtaatgtct gtcaccgcaa aagcttcttc atcccccttt 34141 gtaatctgtc ttttccaccc atcccttctc cccaacccag gaaaccgctg atctcctttc 34201 tgtttcatta tagattcact tgcatttcta taattttata taaaaggaat catacaatat 34261 gtacttttat ttttggtcta acgtcttttt tcagcataat tattttgaaa cttatccatg 34321 tcgttgtata tattaaaagt tcattcctac taaaaacaga attacaattc gacctagcaa 34381 gcccattagt gggtgtatat gcaaaggaaa aataaatcat tctacaaaaa agacacatgc 34441 actcatatgt tcatcacagc accaatcaca atagcaacta catggaatca acctaggtgc 34501 ccatcaacag tggattggat agagaaaatg tggtacatat acatcatggt ctcctacaca 34561 gctctaaaca agaatgaaat catgtcgttt gcagcaacat ggatgtagct gcaggccatt 34621 atcctaagca aagtaatata ttctcgctta taagtgggaa ctaaatcttg ggttcaaaca 34681 gacgtaaaga tgggaacaat agacactggg gaccactaga aggaggagga gtgtgggagg 34741 agggagacta aagctgaaaa actacctatt gagtactatg ctcactacct gggtgatggg 34801 atcatttgta ctgcaaaccc tcagcatcag gcaatatacc tgtgtaacaa acctgcacat 34861 gtactcgcaa ataaaagttg acattatttt aaaaaataag acaaaaagaa aaaattttgt 34921 gaactgtggg gaaaggttca ttccttttca ttgctgtgta gaattccatc atgtggattc 34981 atttatctgt tgatggacac ttagattgtc tttttggcta ttaaaaataa agctgctaga 35041 tctttcataa atgctctatg ccaggttgat gaaattctct ctattcccag tttgcccaag 35101 accttttctg gatctactga gatgatcata tgggtttctt ttttaatctg ttaatttgga 35161 gaattgtgta gattttctaa tattaaatca actttgcatt cgtgggatga attccatttt 35221 gtcatgattt attgtctttt ttatatattt tggattcaat ttgctaaaat tttaagaaat 35281 tttacaccta tgttcatgag gaatattggg ttattatttt cttttgctat attgttgatg 35341 tctagttttg gtattagagt aatgtgggcc tcataaaatg aattgggaga tatttattcc 35401 tcttccgtct tctggaagtg tttgtttaga attggctttg tttcttccat aaatgtttca 35461 tagaattccc cagtgaagct atctgaacag gagttttttc atgggaaggt atttaattac 35521 aagtttactt tcattaatag atgcagggtt attcacatta tctattgctt tttgagtgag 35581 tattggtagt ttattagcat aaagctgttc acaatgtttc attatccttt tagtatctat 35641 agactctgta gtaatgccac ctctcttatt attgacactg ataatttata tcttctcttt 35701 tctgatcagc ctggctagaa atctttttaa atcaatttta ttgttgttaa agaaccagct 35761 tttggtttta ttgatttttc tctgttattt ttacatttcc tactttattt cttctctgat 35821 ctttatgatt tcctttcttc tgcttactta caatttcatt cattcttgca agtttcttaa 35881 aggggaagct gaagtcaatg acttgaggca tttcttcttt aatagacatt tagtgcaact 35941 tgataataaa ttagtaattt gcctctaaaa atggctttag ctacgtccca caaattttga 36001 tatgtgatgt tttcattttt agttgagtca aaattatttc tattttgatt tttcttggtc 36061 ccatgggtta tttagaagtg cgttatgaat atcgaataat ctgtggatat cctagaattc 36121 tagctttggg atttccagta tatatcttaa gagccctat // LOCUS HSZNGP1 9823 bp DNA PRI 07-JUN-1994 DEFINITION H.sapiens gene for ZN-alpha-2-glycoprotein. ACCESSION X69953 NID g467670 KEYWORDS class I major histocompatibility complex; HLA antigen; plasma protein; Zn-alpha-2-glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9823) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (04-JAN-1993) C. Lopez-Otin, Uni. de Oviedo, Dep. de Biologia Funcional, Area de Bioquimica, Facultad de Medicina, C/Julian Claveria S/N, 33006-Oviedo, SPAIN REMARK revised by [3] MAT REFERENCE 2 (bases 5221 to 5308) AUTHORS Freije,J.P., Fueyo,A., Uria,J.A., Velasco,G., Sanchez,L.M., Lopez-Boado,Y.S. and Lopez-Otin,C. TITLE Human Zn-alpha 2-glycoprotein: complete genomic sequence, identification of a related pseudogene and relationship to class I major histocompatibility complex genes JOURNAL Genomics 18 (3), 575-587 (1993) MEDLINE 94140356 REFERENCE 3 (bases 1 to 9823) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (22-MAR-1994) C. Lopez-Otin, Uni. de Oviedo, Dep. de Biologia Funcional, Area de Bioquimica, Facultad de Medicina, C/Julian Claveria S/N, 33006-Oviedo, SPAIN FEATURES Location/Qualifiers source 1..9823 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /cell_line="blood leukocyte" /clone_lib="EMBL 3" /clone="ZN9, ZN2" protein_bind 192..199 /bound_moiety="AP2 transcription factor" protein_bind 246..253 /bound_moiety="Oct transcription factor" gene 321..9464 /gene="ZNGP1" TATA_signal 321..325 /gene="ZNGP1" exon <442..517 /gene="ZNGP1" /number=1 mRNA join(<442..517,4457..4717,8037..8312,9181..>9464) /gene="ZNGP1" CDS join(442..517,4457..4717,8037..8312,9181..9464) /gene="ZNGP1" /codon_start=1 /product="ZN-alpha-2-glycoprotein" /db_xref="PID:g467671" /translation="MVRMVPVLLSLLLLLGPAVPQENQDGRYSLTYIYTGLSKHVEDV PAFQALGSLNDLQFFRYNSKDRKSQPMGLWRQVEGMEDWKQDSQLQKAREDIFMETLK DIVEYYNDSNGSHVLQGRFGCEIENNRSSGAFWKYYYDGKDYIEFNKEIPAWVPFDPA AQITKQKWEAEPVYVQRAKAYLEEECPATLRKYLKYSKNILDRQDPPSVVVTSHQAPG EKKKLKCLAYDFYPGKIDVHWTRAGEVQEPELRGDVLHNGNGTYQSWVVVAVPPQDTA PYSCHVQHSSLAQPLVVPWEAS" intron 518..4456 /gene="ZNGP1" /number=1 misc_feature 1677..1825 /gene="ZNGP1" /note="MER-14 sequence" repeat_unit complement(1855..2156) /gene="ZNGP1" /rpt_family="Alu sequence" repeat_unit complement(2201..2471) /gene="ZNGP1" /rpt_family="Alu sequence" repeat_unit complement(2537..2828) /gene="ZNGP1" /rpt_family="Alu sequence" misc_feature 2839..2998 /gene="ZNGP1" /note="MER-12 sequence" repeat_unit complement(3333..3627) /gene="ZNGP1" /rpt_family="Alu sequence" repeat_unit complement(3638..3932) /gene="ZNGP1" /rpt_family="Alu sequence" exon 4457..4717 /gene="ZNGP1" /number=2 intron 4718..8036 /gene="ZNGP1" /number=2 repeat_unit 5921..6228 /gene="ZNGP1" /rpt_family="Alu sequence" misc_feature 6332..6425 /gene="ZNGP1" /note="MIR sequence" repeat_unit complement(6871..7038) /gene="ZNGP1" /rpt_family="Alu sequence" repeat_unit complement(7039..7338) /gene="ZNGP1" /rpt_family="Alu sequence" repeat_unit complement(7339..7473) /gene="ZNGP1" /rpt_family="Alu sequence" repeat_unit complement(7475..7866) /gene="ZNGP1" /rpt_family="Alu sequence" exon 8037..8312 /gene="ZNGP1" /number=3 intron 8313..9180 /gene="ZNGP1" /number=3 exon 9181..>9464 /gene="ZNGP1" /number=4 polyA_signal 9709..9714 BASE COUNT 2538 a 2341 c 2421 g 2523 t ORIGIN 1 cagctggggt ctacccaggt ccatgtcttg gacatgttga gagtttttct ggaaggcagg 61 gatacagtgt ggtccaaaaa cacacaaatg cccctactgg cccaggggtt gtcacaatag 121 actggaaggg tgacacatcc caggcgcttg ccacccatca cacgcacctc ctacccactg 181 gcatccttcc accccaggca cacacaaagc ctcagtccag agatcaactc tggactcagc 241 tctgaatttg catatcctgt gtgtagattc attcttcata acctctgccc agcctagctt 301 gtgtatcatt tttttttctc tattagggga ggagcccgtc ctggcactcc cattggcctg 361 tagattcacc tcccctgggc agggccccag gacccaggat aatatctgtg cctcctgccc 421 agaaccctcc aagcagacac aatggtaaga atggtgcctg tcctgctgtc tctgctgctg 481 cttctgggtc ctgctgtccc ccaggagaac caagatggtg agtggggaaa gcaagggatg 541 ggtgctggag aggactggaa ggaggtgagg aacaggacat gtggctggga gacaggctgg 601 atgcagctgg gataccctgg catacggcag gaatgggtgc ccaaggctgt caactccctc 661 agctcacaca cttccaggag cattcaggga gcctctgcgc tggcccgaaa taagaccttc 721 aggaatctga atctaaaacc cctagtttac agtgaaaaca aagactccaa agaccaagcg 781 acctgcttgg ggtagacagt caggacggag taggaaccat atgcctggag ctgcttctgc 841 tcctgttcct tccctccttc cgatggctgg gtacacctgc ctgacgctga ggaaaagaga 901 gagcagcccc aaggggaaag tgggaaggca ggttggctgg agggatggtg ctagaaggaa 961 acccgtgccc aaatcccaca ctcagacacc actgcagtgg gtctggaagg cgagtggctg 1021 gaagagaaga gagtgggagc tccgggagat caagagtcac tcctaggata agggaaggag 1081 gctgtttgtg gcatgagaat gtgcaggata aagacatgga agcgaatggc ttctcagttg 1141 tgtgagttta aaattcatga catttacaaa ttgtcagaaa aggtgttata tgtttgttat 1201 ataacaatca ctttggaatg ttaatctgat tctgtgccaa aatctgaatt actcagggtt 1261 ctccagagaa acagaactaa taggtggtac acatatacat atatatgtac gtacacatac 1321 atacatacac tgtatacaca tggatacaca cacacatagg aagagattta catatatgta 1381 tacaaaagag agagagagta gagatttatt ttaagaaatt gactcacact attgggagga 1441 gtaacaagtc ctaaatcttc agagccggcc agcaggctgg agacccaggg aagagttgat 1501 gtcttagtct tgattccaag ggcagactgt aggcagaatt ctttcctctt taggggacat 1561 ctgaggcttt ttctcttaag gccttcaact gattggatga agcccaccac tatggagagt 1621 aatccacttt actcaaggtc tactgatttt tttgtaaatt aaaaaaaaaa ctgtgggtgc 1681 atagtatgtg tatatattta tggggtacat gagaggtttt gattcaggca tgcaatgtga 1741 aataatcaca tcatcaaaaa tgaggtatcc atcccttcaa gcttttatcg tttgtgttac 1801 agacaatcca attatacttt tttggttatt ttagttttta aaagtatttg attatttatt 1861 tatttattta tttttgagac agagtctcac tctgtcaccc aggcaggagt gcagtggcat 1921 gatctcggct cactgcaacc tccgcctccc aggttcaagc aattttcctg cctcagtctc 1981 ctgagtagct aggactacag gcacctgcca ccacacctgg ctaatttttt tgtattttta 2041 gtagagacgg gtttcatcat gttggccagg ctagtcttga tatcctgacc tcgtgatctg 2101 cccgccttgg tctcccaaag tgccgggatt acaggtgtca gcaactgcgc ctggcctctc 2161 ttttggttat ttaaaagtgt acaattaaat tatgattatt attattattt ttgagatgga 2221 ttcttgttct gtcacccagg ctggagtgca gtggcgtgat cttggcttac tgcaaacctc 2281 cgcctgttgg gttcaagcaa ttatcttgcc tcgggtgtac actgccacac acggctaact 2341 tatgtatttt taatagagat agggtttcac catgttggct agactggtct tgacctcttg 2401 acctcaagtg atccactcac ttcagcctcc cagagtgctg gaattacagg cacgagccac 2461 cacacctggc cccagttaaa ttattattga ctatagtcac cctgttgtgc tatcaaatag 2521 taggtcttat tcattcttct tttttttttt ttttttgtga cagagttgcc caggctggaa 2581 tgcagtggtg caatcttggc tcactgcaac ctctgcctcc cgggcttaag cgattctcct 2641 gcctcagcct tctgagtcgc tgggactaca ggtgtgtgcc accacgcccg gctaatttat 2701 gtatttttag tagagatggg gtttcaccat gttggccagg ctggtttcga actcctgacc 2761 tcaagtgacc cacctgcctc agcttcccaa agtgttggaa ttacaggcat gagccaccac 2821 acctggcccc agttaaatta ttattcactg gagtcacttt gttgtgctat caaatagttt 2881 tctaactatt ttttttgtac ccattaacca ccctcccaat ttccccccaa ccctgccact 2941 acccttccca gcctttggta accatccttc tactctctat gtccatgaat tcaattgtag 3001 ggtctactga tttaaaggct aatcacattt agacactcag gagcaagaat aattttagta 3061 attgaactag gattctgcca tatgacctcc aacatcatta gcacctgtgt aaattgtatc 3121 ataaaataat tatggaacta ttatggaaat gtccctctct cccagatccc accttgtacc 3181 aaaatgcaag gtacaacccc gggaattctg agctccatcc tagtcttacc ctgtgctaat 3241 tcagtctggg tcatttcttg aattttctgg taaattctcc tttctaccct ttctaactat 3301 atgtatttgt caggttaagc tagaagtgtt aatttttttt ttttttgaga tggagccttg 3361 ctttgtcacc taggctgaag tgcagtggca tgatctcagc tcactgcaag ctccgcctcc 3421 cgggttcatg ccattctcct gcctcagcct cctgagtagc tgggactaca ggcacccgcc 3481 accatgcttg gctaattttt tgaattctta gtagagacgg ggtttcacca tgttagccag 3541 gatggtctcg atctcctgac ctcgtgatcc acccgcctcg gccccctaaa gtgctgggat 3601 tacaggcgtg agccactgag cccggacgaa atgttaattt gttttttttg agacggagtc 3661 tcactctgtc atccaagctg gagtgcagtg gcatgatctt ggcttgttgc aacctctgcc 3721 tctctggttc aagtgatttt cctgcctcag cctccagcat gactgggatt acaggcccgc 3781 accaccatgc ccagctaatt tttgtatttt ttaatagaga tggggtttca ccatgttggc 3841 caggctggtc ttcaactcct gatctcaagt aatctgcctg ccttggcctc ccaaagtcct 3901 gggattacag gcatgagcca cggagcccag cctagaaatg ttaatttcta acgcatgtca 3961 gattccatgc acactgggca aggttccatt cctccatggg gtgactcagg gatccaggcc 4021 aattgcatat tgagactctt tcatattatc ctgtggcctt caaagtcgtc acctctaggg 4081 atgagaaaca aaagggacaa gccagctggt agggtcttgg acaagaagaa agacatcact 4141 tctgctcaca ttctcttttg acaaaactca gtcacatggt cccaatatat cttcgaggtg 4201 gctgagtaat gttatcttcc tatgtgtcaa gcagaggaaa taatgtagtg aagacacagg 4261 atggtctctg aaatatcatc tcaggcatga aagtagagca tattcacttg agtgagcctc 4321 cagtggtgtg aagttgatgg caggagaaag agctggggaa gaaaaggcca gtggcaggtc 4381 tcccctccta gccctatgca gccccacagt gggacccttg catggacctc aaccatcaga 4441 atcttttctt ttgcaggtcg ttactctctg acctatatct acactgggct gtccaagcat 4501 gttgaagacg tccccgcgtt tcaggccctt ggctcactca atgacctcca gttctttaga 4561 tacaacagta aagacaggaa gtctcagccc atgggactct ggagacaggt ggaaggaatg 4621 gaggattgga agcaggacag ccaacttcag aaggccaggg aggacatctt tatggagacc 4681 ctgaaagaca ttgtggagta ttacaacgac agtaacggtc agtgaataac agaccacagg 4741 ggtggaaggt ctaacccaag aggcagcccc cccagtgtga gtggcaaggg atcagcagga 4801 tggaaatagt cccaatccca ggggaagaac aggagacaca gcagaaacac agacatgtcc 4861 gcatcccacc caccccacag cacaggtgct ccccgcttcc ccatcaattg ccccatcctc 4921 atcccaggcc tcaggtcaca caggaagtga tggcagagtc acttcctatc caggcaccta 4981 tgacctctca cctccacacc ccacccatcg gaggctgata cccccgtgag aaggcatcag 5041 actcacccct gtccagggag gttgcctgga gagtgagcca ctctcaaagt cactcagacc 5101 tgggctcacc tggtggttct gccagtccta gctgttgaca gtgaaacgtt cccaaaatat 5161 ctggttgaaa tctgcaaaca ttggagcact gagacctacc tccaaacaag tctgtaatat 5221 ttaactatgt ctgttctatg aaggatgtca cagtctgtcc tgatctccct tgcagctcca 5281 tcacctagca cagggtacag ccaatattgg ctcaattgaa atttgtggaa tccacagaga 5341 aaagcacccg gcacacaccg tagcccatgc tgggggctca ggaagtgctg gattcaaaac 5401 tgtgggctgt tagagttcct tggagcccta aagttcctcc ttaccatacg atgcagaccc 5461 aggaagggcc acctgcgcta tggtcagagg agctggtggc agagcccgtg cagagatggt 5521 ccctgtgccc ccggcccagt gctctttctc ctaaaccaca ctgccagccc caaggcagcc 5581 aacctcaggt ctggtgaact gctggtgtta aattatcata gagtgggtgt caaaagatgg 5641 gctactaagt acaaaaatgc ccaaggtgct acatgggatc tgaagatttt caaaaggagg 5701 caagaaagag ataggcagat gtttcaagga tgtggggtgg gggaggtctt ggtaaggaaa 5761 atggcccagg ctgtgtgtca gcaataggag aggagggggc acaggtgatc agaaaagaca 5821 ctgggggaag cattgatgga caggaataga aatggcaaag tggataatta agaggaagga 5881 ggatgaggag atgaacacag ggtattagaa aataatagaa ggcagggctt ggtggctcac 5941 tcttgtaatc ccagcacttt gggaggctga ggcaggcaga tcacctaagg tcaggagttc 6001 gagaccagcc cggccaacat ggtgaaaccc tgtctctact aataatacaa aaatagcctg 6061 gcatggtggc acacgtctgt ggtcccagct actcaggagg ctgaggcagg agaattgctt 6121 gaacccagga ggcagaggtt acagtgagcc aaaatcctac cattgcacta cagcctgggt 6181 gacaagagtg aaacgttgtc taaaaacaaa aaacaaaaaa caaaaaaagg aaataatagt 6241 agctgacatt tactgagcac ttactttgtg ccaggcccat ctatgagcat atataatgct 6301 cagaatagcc ccctaaaaca gtgctcttgg cattgccatt tcagaggtga ggaaatagag 6361 gcacagggag ttgagtggct ccagttcagg caacacacca ggtgggggtg gggggctggg 6421 gagagacctg ggacgtgagc ccagacagct tgagagcttt cagagtctat gccaacagca 6481 ccaaccagtg ctgggtaaac acctgctttt atcatcagaa caaagaggct gtgtcccctg 6541 ccctatgagg tccatttctg agagttgtgg ctaatgggca agaaggttgg ggctttagag 6601 atttgggata aagatatcaa acaccagaaa ggtagaaaga agtgatcaga ttagggttac 6661 ttaggtgatg atatgaactc ttcctagaac tgagagaaaa agagagcctt cctttactca 6721 tatgaaatca caaataattt ctatccaatt tggaagtaca ctttggtgta gttgtgacag 6781 cttcctcagg actcagcata aattcaaaca aataattgtc cttagaagag atgctataga 6841 agagatagaa atatattcat attctgtagc tttttttttt ttgagatgga gttttgctct 6901 tgtcacccaa gctggagtgc agtgatgcaa tctcagctca ctgcaaactt tgcctcctgg 6961 gttcaaggga ttctcctgcc tcagcctccc gataactggg actacaggct acaggcatgt 7021 gtcactactc ctggttaatt tttttttttt ttttttaaga ctgagtcttg ctctgtcttt 7081 caggctgatg tacaatggct ccatctcggc tcactacaac ttctgtcccc caggttcaag 7141 cgattctcct gcctcagcct catgagtagc tgggattaca ggcatgtgcc agcacaccca 7201 gcaaattttt gtatttttag tagagatgag gtcttaccat gttggccagg ctggtctcaa 7261 actcctgacc tcaggtgatc ctttggcctc agcctcccta actgctggga ttacaggcat 7321 gagccactgc gtccagccta attttatatt tttggtagag atggggtttc accatattgg 7381 ccaggctggt ctcgaactca tgacctaagg tgatccatcc tcctcagcct ctcaaagtgc 7441 tgggattaca agtgtgagcc actgggcctg gtgctttttt tttttttttt tttttttttt 7501 ttttgagata gggtctcact ctgtcaccca ggctgaaatg cagtagtgtg attttggctc 7561 attgcagcct tgacttccca ggctgaagtg atcctcccac ctcagcctcc tgagtagctg 7621 gggctacagg catgcaccac catgctgcgc taatttttat attttttgta gtggtgggat 7681 ttcgccatat caccctggct ggtctggaac ccctgggctc aagcgatcca ctcgcttcag 7741 cttctcaaag tgctgggatt acaggcatga gccacagcgc ccaggctgta gctctcttaa 7801 ggaggaacat atctcatctg agacaaacct gaaatgccaa accaaactga gttagcccct 7861 ctctgtctgt tgtatatatt ggagtaataa cctatttgtc ttgataaagg gattgcatgc 7921 ttgaattgca aaaaccttta tttcttttgg gttgcccaat gtgcaagact aagagttatt 7981 ttgataaatt tctcaccagg ctgactgtct ctctgtgggg tcgggggagt tttcagggtc 8041 tcacgtattg cagggaaggt ttggttgtga gatcgagaat aacagaagca gcggagcatt 8101 ctggaaatat tactatgatg gaaaggacta cattgaattc aacaaagaaa tcccagcctg 8161 ggtccccttc gacccagcag cccagataac caagcagaag tgggaggcag aaccagtcta 8221 cgtgcagcgg gccaaggctt acctggagga ggagtgccct gcgactctgc ggaaatacct 8281 gaaatacagc aaaaatatcc tggaccggca aggtactcac tgcttcctgc tccccagtac 8341 tgagcccaga ataaaagacg atctcaggct aggagctcag gcaacatctt agtccggtct 8401 catctgttcc tggatgtccc tcagaccccc agctttcatc ttttaggatt tattccttcc 8461 ctgggataat ataatttgtg gtccaaaaag aacatcatca aaatttcagg cagaatgggc 8521 caggaaggcc attctttctt gatgagtgtc cccaaatcat ctccaattaa cagacaagga 8581 gcttgaggtt agggaggtga gggtaacact gtctgtaaga ggcagagctg ggactcaaat 8641 tccagatttc agattccaaa tcccatcgtt ttttatctct acaatgatgc ctcccatctg 8701 ggtggtggag agaagggagg cgtgtaaaat gtcagcccca gaaggacaag agcaagccag 8761 tgtgagcgga attgatggct gcaagctgag acttggattg gagacgtagt gagactcagg 8821 attgtgcagt gctgcaggga agtggttgct ggatagaggc atgggctgaa ccaagcagct 8881 ggactgagac tgggggacag aactccaaag cccactgaga tgtgggaaaa catggagaag 8941 cacacggagc attcacaact tattgccgtc agagtcaata catgggtgag gtggggattg 9001 ggcaagaggg aaagcgtcag ccttccctga tattctggaa agtctcccgg ggctgggggt 9061 gggcaggtac agagcttcga gctctgctga tcgctgacat ccaggggtgg gggtaggaag 9121 agacctgggc cgggagaagt ccacctcaag cctgcagtgt cacactctat ccctccacag 9181 atcctccctc tgtggtggtc accagccacc aggccccagg agaaaagaag aaactgaagt 9241 gcctggccta cgacttctac ccagggaaaa ttgatgtgca ctggactcgg gccggcgagg 9301 tgcaggagcc tgagttacgg ggagatgttc ttcacaatgg aaatggcact taccagtcct 9361 gggtggtggt ggcagtgccc ccgcaggaca cagcccccta ctcctgccac gtgcagcaca 9421 gcagcctggc ccagcccctc gtggtgccct gggaggccag ctaggaagca agggttggag 9481 gcaatgtggg atctcagacc cagtagctgc ccttcctgcc tgatgtggga gctgaaccac 9541 agaaatcaca gtcaatggat ccacaaggcc tgaggagcag tgtgggggga cagacaggag 9601 gtggatttgg agaccgaaga ctgggatgcc tgtcttgagt agacttggac ccaaaaaatc 9661 atctcacctt gagcccaccc ccaccccatt gtctaatctg tagaagctaa taaataatca 9721 tccctccttg cctagcataa cagagaatcc tttttttaac ggtgatgcgc tgtagaaatg 9781 tgactagatt ttctcattgg ttctgccctc aagcactgaa ttc // LOCUS HUM17BHSDI 4194 bp DNA PRI 15-OCT-1996 DEFINITION Human 17 beta-hydroxysteroid dehydrogenase (17BHSDI) gene, exons 1-5, complete cds. ACCESSION M29037 NID g1129086 KEYWORDS 17-beta-hydroxysteroid dehydrogenase; estradiol 17-beta-dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4194) AUTHORS Luu-The,V., Labrie,C., Simard,J., Lachance,Y., Zhao,H.F., Couet,J., Leblanc,G. and Labrie,F. TITLE Structure of two in tandem human 17 beta-hydroxysteroid dehydrogenase genes JOURNAL Mol. Endocrinol. 4 (2), 268-275 (1990) MEDLINE 90231340 COMMENT Draft entry and computer-readable sequence for [Unpublished (1989) 2705 Laurier Boulevard, Ste-Foy, Quebec (Que)G1V 4G2] kindly submitted by F.Labrie 14-OCT-1989. FEATURES Location/Qualifiers source 1..4194 /organism="Homo sapiens" /note="vector lambda EMBL3" /db_xref="taxon:9606" gene 1770..3323 /gene="17BHSDI" exon 1770..1875 /gene="17BHSDI" /number=1 CDS join(1779..1875,1969..2136,2283..2453,2621..2714, 3209..3323) /gene="17BHSDI" /codon_start=1 /product="17 beta-hydroxysteroid dehydrogenase" /db_xref="PID:g1129087" /translation="MACTVVLITGCSSGIGLHLAIHLALDPSQSFKVYATLRDLKTQG RLWEAARALACPQGSLERLQLDVRNSSSLAAARERVTEGRVDVLVCNTGLGLLGPLGE DAVASVLDVNVVGTVRVLQDFLPDMKRPGSGRVLVTGSMGGLMGLPFNDVYCASKFAL EGLCESLAVLLLPFGVHVSLIECGPVDTAFMQKVLGGPDQVMDRTNTRTFRLLH" exon 1969..2136 /gene="17BHSDI" /number=2 exon 2283..2453 /gene="17BHSDI" /number=3 exon 2621..2714 /gene="17BHSDI" /number=4 exon 3209..>3323 /gene="17BHSDI" /number=5 BASE COUNT 890 a 1216 c 1215 g 873 t ORIGIN 1 ggatccacct gcctcggcct tccaaagtgc tgggattaca ggagtgagcc accgtgcccg 61 gccatgtctc tctttttaac actaatgtta ccctgacctt tgaacgtaga atgcccttct 121 gttgcaggaa aacctctttt caaaccatgt ttgtcctttg ctggcatgcc acagcaacag 181 tcaccaacac agaagacttc tgtgaccaaa tatttggagg attttcccca cacacaccaa 241 gcagcagaca tcagctgggt gtcctccaat tcagttccaa tgtaatcaac cagagacagc 301 atcagatccc acagggttag ggtgcagatc catgagacca ccccctcctt cccaacggtt 361 acaagtcctg atccctggaa cttctgacta actggcttca agttggagtt cccatgaccc 421 ccttcccctc tttggagtca actcatttgc gacagtgacc cacgaaacac agggaaaccc 481 ttattatgtt tattgcttta ttacagagga aaaaaatttt tttctttctt ttttgagaca 541 gggtctcact ctgtcatcca gaatgactgc agtggcagga tctggctccg tcacccaggc 601 tggagtgcag tggcatgatc tcggctcact acagcctcca tcccccccaa accccacgcc 661 tcagcgcccc accccgcaag tggctgggac tctaagcata caccaccaca cccagctaat 721 ttttttgtag tttttgcaaa gacggggtct cattctgttg ccctggctgg tcttgaactc 781 ctgagctcca gcctggcctc ccaaagtgct gggattacag gcatcagcca ccgtgcccaa 841 cctcaaagga tattttaaag gatagaaata aacagccata tgaagagata cagacagggc 901 ggtctggaag ggcccagagc aggagcttct atctccatag agttggggtt acatcaccct 961 ccaggcacat ggatgagttc ttcaccttct gtcagcctcc acacgttcag ctctcagaag 1021 cttcccgaac cctgtccttt gggcctttta tggagaactc cattggctgt ccatgactga 1081 agcatggaca actgtgataa tgtgattggg caaaaagggt ctgatctaag cccagcaagg 1141 ccagtccaga ttctttgggc ctttgtgcag cattcctttc tccagggtat ggggcaagga 1201 cccactctgg aatgaggatc ctacaaccca caatcagatt agagtcctgc cttgggcagc 1261 tgaaaagagg acaggagaag gtcaaagaga ggaaaggctg ttttttgagg cctgaggcgc 1321 cccaacatga caacgaaaga ctgtaaccat ggtcatgtga gttatgagct aggaaccctg 1381 gacgaaacca acacatatac aatcatctcc cacctcccaa cacctttact ttcacagcct 1441 ctgcagcaaa ctgcggtcac tataatcgct cctgtggcac agaggcatac ccaggggaat 1501 ctgcccaggg gccactctgt gcccacatgg gaacccacat ctgcttgtaa agcctcccct 1561 ccctctgacc agaaacgagg acagtttgtt gttccaagca gtgggctcat gtctgttttg 1621 gctcagaaca gggtggggag agcgggccag ggacccgcag gagggcttat ccttgagatt 1681 gcgtgggaga cacaacaagg ggtgggggcc cgcaggcggg gcggggcgaa gcaggtgata 1741 tccagcccag agccccagcc tctccccaca gtctcaccat ggcctgcacc gtggtgctca 1801 tcaccggctg ttcctcaggt attggcctgc acttggcaat acatctggct ttggacccat 1861 cccagagctt caaaggtata gataggtagg gacagggagg gagagaaggg aaaagccctt 1921 ggaggccaga agagaagtca gatcttcctc ctctcccaaa acctccagtg tatgccacgt 1981 tgagggacct gaaaacacag ggccggctgt gggaggcggc ccgggccctg gcatgccctc 2041 agggatccct ggagaggttg cagctggatg taaggaactc aagctccctg gccgctgccc 2101 gggaacgcgt gaccgagggc cgtgtggatg tgctgggtga gcctcctgga agcatatggg 2161 ctcctaggag ccttctccgc cctgcgttga aaccaacatg tccccaggcc cctggagcat 2221 gaggggacag gccgtgctga gggtgatgct gaggcgggct ggttgggcct ctgtccccgc 2281 agtgtgtaac acaggcctgg gcctgctggg gccgctgggg gaggacgccg tggcctctgt 2341 actggatgtg aatgtagtag gtactgtgcg ggtgctgcag gacttcctgc cagacatgaa 2401 gcggccgggt tcgggacgcg tgttggtgac tgggagcatg ggaggattga tgggtgagtg 2461 gcagggaccg ggccccgagc tccagattct ttgtgtgcag agctgagcct tgaaggcagg 2521 ctccttaggg ggtggggtgc aatcagcttg gaggggcact gcctgccggg ggatgacccc 2581 ctggccgctg cgcctcagga acctcatctt cccacccaag ggctgccttt caatgacgtt 2641 tattgcgcca gcaagttcgc gctcgaaggc ttatgcgaga gtctggcggt tctgctgctg 2701 ccctttgggg tccagtgagt caacaccccc gtctcctcaa ccctcttaac tctgacctag 2761 agatgccgag caccctgtcc tgcggaagcc gctctggtct ctgcccggct tacatttgct 2821 gcgtgccagg cacttaggct ggagcattgg cacgcattgt gccacttgct gacctggctg 2881 ctgaagtgtt ggtattgtta tggggaagct ccagccaaga gaggttaggt gactggccca 2941 aggtcatgca gcggccgggg atcccgccag gttcgaattc tgacaccagg gctacctggc 3001 agcctcagat gggtttggga gggctgtcga gcaataaccc gctattcaaa tgttctggtt 3061 atccccagcg ctctttccac cttcgggacg cagcggtgct gttctgggtc gtggccaggg 3121 ccggggtcgg ggccggtgct ggggcaggag atgggacttg ggcctgggtc gctccgtccc 3181 tgcccacttg cggctctcgg gccagcagcg tgagcctgat cgagtgcggc cctgtggaca 3241 ccgccttcat gcagaaggtg ttgggcggtc ccgaccaggt gatggaccgc acgaacaccc 3301 ggaccttccg cctcttgcac taatacctcc accacagcaa ggagatctac cgcgaggagg 3361 cgcagcaccc tgaggaggtg gtgaaggtga gcggggggcg ggactccggg agcgggggcg 3421 gtgcgtcgtc ctgcgcgcag ccggggccag agctcctctc ccgccgccgc aggtcttcct 3481 caccgctatg cgcgccccga agccgaccct gcgctacttc acaaccaggc gcttcctgca 3541 ccagctgctg atgcgcctgg acgacccctt tggcttcgac tacgccgccg ccatgcaccg 3601 ggacgtgttc gccgacgatc ccgcagaggc cgaggctggg gccggggctg gggcgagcgg 3661 gggcggggcc ggtgggatgg gagaccctga gctcagcgat cctctggccg ccccgcaaga 3721 aaggctccgt cagccactgt ctcccgcgcc ctcctttgtc tcctgggcct gtgcggtccc 3781 tggggatggg acggcggtga cggctgtgga tggctaatta agacagatca cgttagcccg 3841 ttatatctgc gcggctaggc gcgatggctg tcgcctataa tcccagagct ttggaaggcc 3901 gaggcaggag gatcgctcca ggccaggagt tccagaccag cctgagcaac atagtgagac 3961 accccatctc taaaataaaa aaattagcac agtggcacca ttccttgagc tcaggagttg 4021 gaggctgcag tgggcatgat cgagctactg ctctccagct tgggcgatag agtgagacac 4081 tgtcaattaa ttaatctaat caaccaacca acccaacaac ccagaaacca aggtccagaa 4141 agaagccagc ccaggatcat gcctcaagtc cacagtaaag cccagacaca gtca // LOCUS HUM6PTS 718 bp DNA PRI 22-JUL-1996 DEFINITION Human gene for 6-pyruvoyl-tetrahydropterin synthase. ACCESSION D25234 NID g624868 KEYWORDS 6-pyruvoyl-tetrahydropterin synthase. SOURCE Homo sapiens blood leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ashida,A., Hatakeyama,K. and Kagamiyama,H. TITLE cDNA cloning, expression in Escherichia coli and purification of human 6-pyruvoyl-tetrahydropterin synthase JOURNAL Biochemical and Biophysical Research Communication 195, 1386-1393 (1993) REFERENCE 2 (bases 1 to 718) AUTHORS Ashida,A., Owada,M. and Hatakeyama,K. TITLE A missense mutation (A to G) of 6-pyruvoyltetrahydropterin synthase in tetrahydrobiopterin-deficient form of hyperphenylalaninemia JOURNAL Genomics 24 (2), 408-410 (1994) MEDLINE 95213043 REFERENCE 3 (bases 1 to 718) AUTHORS Hatakeyama,K. TITLE Direct Submission JOURNAL Submitted (12-NOV-1993) to the DDBJ/EMBL/GenBank databases. Kazuyuki Hatakeyama, Osaka Medical College, Department of Medical Chemistry; 2-7 Daigakumachi, Takatsuki, Osaka 569, Japan (E-mail:med004@art.osaka-med.ac.jp, Tel:0726-83-1221(ex.2453), Fax:0726-82-6851) FEATURES Location/Qualifiers source 1..718 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /tissue_type="blood" exon <1..320 /number=1 5'UTR <1..6 CDS join(7..320,519..642) /codon_start=1 /product="6-pyruvoyl-tetrahydropterin synthase" /db_xref="PID:d1005497" /db_xref="PID:g624869" /translation="MSTEGGGRRCQAQVSRRISFSASHRLYSKFLSDEENLKLFGKCN NPNGHGHNYKVVVTVHGEIDPATGMVMNLADLKKYMEEAIMQPLDHKNLDMDVPYFAD VVSTTENVAVYIWDNLQKVLPVGVLYKVKVYETDNNIVVYKGE" intron 321..518 exon 519..718 /number=2 3'UTR 643..>718 BASE COUNT 209 a 107 c 163 g 239 t ORIGIN 1 gggaagatga gcacggaagg tggtggccgt cgctgccagg cacaagtgtc ccgccgcatc 61 tccttcagcg cgagccaccg attgtacagt aaatttctaa gtgatgaaga aaacttgaaa 121 ctgtttggga aatgcaacaa tccaaatggc catgggcaca attataaagt tgtggtgaca 181 gtacatggag agattgaccc tgctacggga atggttatga atctggctga tctcaaaaaa 241 tatatggagg aggcgattat gcagcccctt gatcataaga atctggatat ggatgtgcca 301 tactttgcag atgtggtgag gtgggtggca ctgtatcttg ccttatgtgg attgtaaaac 361 aagaattgat ttgaatactt tgattgttgt gtgatttctg aagttttaat ttaatgaaat 421 ctttcgaaac tagaatttct attttctgta aatattaaac atgaaatttt attgtttgca 481 ttttgaattt tttttgtttt tgtttttttt tcttatagca cgactgaaaa tgtagctgtt 541 tatatctggg acaacctcca gaaagttctt cctgtaggag ttctttataa agtaaaagta 601 tacgaaactg acaataatat tgtggtttat aaaggagaat agctattggg gttagcattg 661 cacaaagccc agtttctttc tgtgtttgaa aaagattttg atccccttgg aatattaa // LOCUS HUMA1ATP 12222 bp DNA PRI 08-AUG-1995 DEFINITION Human alpha-1-antitrypsin gene (S variant), complete cds. ACCESSION K02212 NID g177830 KEYWORDS alpha-1 antitrypsin; antitrypsin. SOURCE Human: liver, cDNA to mRNA, clone pAT83; genomic DNA (S variant), clones pAT4.6 and pAT9.6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12222) AUTHORS Long,G.L., Chandra,T., Woo,S.L., Davie,E.W. and Kurachi,K. TITLE Complete sequence of the cDNA for human alpha 1-antitrypsin and the gene for the S variant JOURNAL Biochemistry 23 (21), 4828-4837 (1984) MEDLINE 85047190 FEATURES Location/Qualifiers source 1..12222 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1952..2001 /gene="A1A" /number=1 gene 1952..12101 /gene="A1A" intron 2002..7311 /gene="A1A" /note="A1A mRNA intron A" exon 7312..7961 /gene="A1A" /number=2 CDS join(7316..7961,9412..9682,10939..11086,11910..12101) /gene="A1A" /codon_start=1 /product="alpha-1-antitrypsin" /db_xref="PID:g177831" /translation="MPSSVSWGILLLAGLCCLVPVSLAEDPQGDAAQKTDTSHHDQDH PTFNKITPNLAEFAFSLYRQLAHQSNSTNIFFSPVSIATAFAMLSLGTKADTHDEILE GLNFNLTEIPEAQIHEGFQELLRTLNQPDSQLQLTTGNGLFLSEGLKLVDKFLEDVKK LYHSEAFTVNFGDTEEAKKQINDYVEKGTQGKIVDLVKELDRDTVFALVNYIFFKGKW ERPFEVKDTEEEDFHVDQVTTVKVPMMKRLGMFNIQHCKKLSSWVLLMKYLGNATAIF FLPDEGKLQHLVNELTHDIITKFLENEDRRSASLHLPKLSITGTYDLKSVLGQLGITK VFSNGADLSGVTEEAPLKLSKAVHKAVLTIDEKGTEAAGAMFLEAIPMSIPPEVKFNK PFVFLMIEQNTKSPLFMGKVVNPTQK" sig_peptide 7316..7387 /gene="A1A" mat_peptide join(7388..7961,9412..9682,10939..11086,11910..12098) /gene="A1A" /product="alpha-1-antitrypsin" intron 7962..9411 /gene="A1A" /number=2 variation 7971 /gene="A1A" /note="c in one cDNA clone; t in another cDNA clone" /replace="t" exon 9412..9682 /gene="A1A" /number=3 allele 9628 /gene="A1A" /note="a in cDNA clone; t in S variant genomic clone" /replace="t" intron 9683..10938 /gene="A1A" /number=3 exon 10939..11086 /gene="A1A" /number=4 intron 11087..11909 /gene="A1A" /number=4 exon 11910..>12101 /gene="A1A" /number=5 variation 11999 /gene="A1A" /note="c in one cDNA clone; g in another cDNA clone" /replace="g" BASE COUNT 2984 a 3125 c 3155 g 2958 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagg ttggaggggc ggcaacctcc tgccagcctt caggccactc tcctgtgcct 61 gccagaagag acagagcttg aggagagctt gaggagagca ggaaaggtgg aacattgctg 121 ctgctgctca ctcagttcca caggtgggag gaacagcagg gcttagagtg ggggtcattg 181 tgcagatggg aaaacaaagg cccagagagg ggaagaaatg cctaggagct accgagggca 241 ggcgacctca accacagccc agtgctggag ctgtgagtgg atgtagagca gcggaatatc 301 cattcagcca gctcagggga aggacagggg ccctgaagcc aggggatgga gctgcaggga 361 agggagctca gagagaaggg gaggggagtc tgagctcagt ttcccgctgc ctgaaaggag 421 ggtggtacct actcccttca cagggtaact gaatgagaga ctgcctggag gaaagctctt 481 caagtgtggc ccaccccacc ccagtgacac cagcccctga cacgggggag ggagggcagc 541 atcaggaggg gctttctggg cacacccagt acccgtctct gagctttcct tgaactgttg 601 cattttaatc ctcacagcag ctcaacaagg tacataccgt caccatcccc attttacaga 661 tagggaaatt gaggctcgga gcggttaaac aactcacctg aggcctcaca gccagtaagt 721 gggttccctg gtctgaatgt gtgtgctgga ggatcctgtg ggtcactcgc ctggtagagc 781 cccaaggtgg aggcataaat gggactggtg aatgacagaa ggggcaaaaa tgcactcatc 841 cattcactct gcaagtatct acggcacgta cgccagctcc caagcaggtt tgcgggttgc 901 acagcggagc gatgcaatct gatttaggct tttaaaggat tgcaatcaag tgggacccac 961 tagcctcaac cctgtacctc ccctcccctc cacccccagc agtctccaaa ggcctccaac 1021 aaccccagag tgggggccat gtatccaaag aaactccaag ctgtatacgg atcacactgg 1081 ttttccagga gcaaaaacag aaacagcctg aggctggtca aaattgaacc tcctcctgct 1141 ctgagcagcc tagggggcag actaagcaga gggctgtgca gacccacata aagagcctac 1201 tgtgtgccag gcacttcacc cgaggcactt cacaagcatg cttgggaatg aaacttccaa 1261 ctctttggga tgcaggtgaa acagttcctg gttcagagag gtgaagcggc ctgcctgagg 1321 cagcacagct cttctttaca gatgtgcttc cccacctcta ccctgtctca cggcccccca 1381 tgccagcctg acggttgtgt ctgcctcagt catgctccat ttttccatcg ggaccatcaa 1441 gagggtgttt gtgtctaagg ctgactgggt aactttggat gagcggtctc tccgctccga 1501 gcctgtttcc tcatctgtca aacgggctct aacccactct gatctcccag ggcggcagta 1561 agtcttcagc atcaggcatt ttggggtgac tcagtaaatg gtagatcttg ctaccagtgg 1621 aacagccact aaggattctg cagtgagagc agagggccag ctaagtggta ctctcccaga 1681 gactgtctga ctcacgccac cccctccacc ttggacacag gacgctgtgg tttctgagcc 1741 aggtacaatg actcctttcg gtaagtgcag tggaagctgt acactgccca ggcaaagcgt 1801 ccgggcagcg taggcgggcg actcagatcc cagccagtgg acttagcccc tgtttgctcc 1861 tccgataact ggggtgacct tggttaatat tcaccagcag cctcccccgt tgcccctctg 1921 gatccactgc ttaaatacgg acgaggacag ggccctgtct cctcagcttc aggcaccacc 1981 actgacctgg gacagtgaat cgtaagtatg cctttcactg cgaggggttc tggagaggct 2041 tccgagctcc ccatggccca ggcaggcagc aggtctgggg caggaggggg gttgtggagt 2101 gggtatccgc ctgctgaggt gcagggcaga tggagaggct gcagctgagc tcctattttc 2161 ataataacag cagccatgag ggttgtgtcc tgtttcccag tcctgcccgg tcccccctcg 2221 gtacctcctg gtggatacac tggttcctgt aagcagaagt ggatgagggt gtctaggtct 2281 gcagtcctgg caccccagga tgggggacac cagccaagat acagcaacag caacaaagcg 2341 cagccatttc tttctgtttg cacagctcct ctgtctgtcg ggggctcctg tctgttgtct 2401 cctataagcc tcaccacctc tcctactgct tgggcatgca tctttctccc cttctataga 2461 tgaggaggtt aaggttcaga gaggggtggg gaggaacgcc ggctcacatt ctccatcccc 2521 tccagatatg accaggaaca gacctgtgcc agcctcagcc ttacatcaaa atgggcctcc 2581 ccatgcaccg tggacctctg ggccctcctg tcccagtgga ggacaggaag ctgtgagggg 2641 cactgtcacc cagggctcaa gctggcattc ctgaataatc gctctgcacc aggccacggc 2701 taagctcagt gcgtgattaa gcctcataac cctccaaggc agttactagt gtgattccca 2761 ttttacagat gaggaagatg gggacagaga ggtgaataac tggccccaaa tcacacacca 2821 tccataattc gggctcaggc acctggctcc agtccccaaa ctcttgaacc tggccctagt 2881 gtcactgttt ctcttgggtc tcaggcgctg gatggggaac aggaaacctg ggctgaactt 2941 gaggcctctc tgatgctcgg tgacttcaga cagttgctca acctctctgt tctcttgggc 3001 aaaacatgat aacctttgac ttctgtcccc tcccctcacc ccacccgacc ttgatctctg 3061 aagtgttgga aggatttaat ttttcctgca ctgagttttg gagacaggtc aaaaagatga 3121 ccaaggccaa ggtggccagt ttcctataga acgcctctaa aagacctgca gcaatagcag 3181 caagaactgg tattctcgag aacttgctgc gcagcaggca cttcttggca ttttatgtgt 3241 atttaatttc acaatagctc tatgacaaag tccacctttc tcatctccag gaaactgagg 3301 ttcagagagg ttaagtaact tgtccaaggt cacacagcta atagcaagtt gacgtggagc 3361 aatctggcct cagagccttt aattttagcc acagactgat gctcccctct tcatttagcc 3421 aggctgcctc tgaagttttc tgattcaaga cttctggctt cagctttgta cacagagatg 3481 attcaatgtc aggttttgga gcgaaatctg tttaatccca gacaaaacat ttaggattac 3541 atctcagttt tgtaagcaag tagctctgtg atttttagtg agttatttaa tgctctttgg 3601 ggctcaattt ttctatctat aaaatagggc taataatttg caccttatag ggtaagcttt 3661 gaggacagat tagatgatac ggtgcctgta aaacaccagg tgttagtaag tgtggcaatg 3721 atggtgacgc tgaggctgtg tttgcttagc atagggttag gcagctggca ggcagtaaac 3781 agttggataa tttaatggaa aatttgccaa actcagatgc tgttcactgc tgagcaggag 3841 ccccttcctg ctgaaatggt cctggggagt gcagcaggct ctccgggaag aaatctacca 3901 tctctcgggc aggagctcaa cctgtgtgca ggtacaggga gggcttcctc acctggtgcc 3961 cactcatgca ttacgtcagt tattcctcat ccctgtccaa aggattcttt tctccattgt 4021 acagctatga agctagtgct caaagaagtg aagtcattta ccccaggccc cctgccagta 4081 agtgacaggg cctggtcaca cttgggttta tttattgccc agttcaacag gttgtttgac 4141 cataggcgag attctcttcc ctgcaccctg ccgggttgct cttggtccct tattttatgc 4201 tcctgggtag aaatggtgcg agattaggca gggagtggac gcttccctgt ccctggcccc 4261 gcaaagagtg ctcccacctg ccccgatccc agaaatgtca ccatgaagcc ttcattcttt 4321 tggtttaaag cttggcctca gtgtccgtac accatggggt ccttggccag atggcgactt 4381 tctcctctcc agtcgccctc ccaggcacta gcttttagga gtgcagggtg ctgcctctga 4441 tagaagggcc aggagagagc aggttttgga gacctgatgt tataaggaac agcttgggag 4501 gcataatgaa cccaacatga tgcttgagac caatgtcaca gcccaattct gacattcatc 4561 atctgagatc tgaggacaca gctgtctcag ttcatgatct gagtgctggg aaagccaaga 4621 cttgttccag ctttgtcact gacttgctgt atagcctcaa caaggccctg accctctctg 4681 ggcttcaaac tcttcactgt gaaaggagga aaccagagta ggtgatgtga caccaggaaa 4741 gatggatggg tgtgggggaa tgtgctcctc ccagctgtca ccccctcgcc accctccctg 4801 caccagcctc tccacctcct ttgagcccag aattcccctg tctaggaggg cacctgtctc 4861 gtgcctagcc atgggaattc tccatctgtt ttgctacatt gaacccagat gccattctaa 4921 ccaagaatcc tggctgggtg caggggctct cgcctgtaac cccagcactt tgggaggcca 4981 aggcaggcgg atcaagaggt caggagttca agacctgcct ggccaacacg gtgaaacctc 5041 agctctacta aaaatacaaa aattagccag gcgtggtggc acacgcctgt aatcccagct 5101 atttgggaag ctgagacaga agaatttctt gaacccggga ggtggaggtt tcagtgagcc 5161 gagatcacgc cactgcactc caccctggcg gataaagcga gactctgtct caaaaaaaac 5221 ccaaaaacct atgttagtgt acagagggcc ccagtgaagt cttctcccag ccccactttg 5281 cacaactggg gagagtgagg ccccaggacc agaggattct tgctaaaggc caagtggata 5341 gtgatggccc tgccaggcta gaagccacaa cctctggccc tgaggccact cagcatattt 5401 agtgtcccca ccctgcagag gcccaactcc ctcctgacca ctgagccctg taatgatggg 5461 ggaatttcca taagccatga aggactgcac aaagttcagt tgggagtgaa agagaaatta 5521 aagggagatg gaaatataca gcactaattt tagcaccgtc ttcagttcta acaacactag 5581 ctagctgaag aaaatacaaa catgtattat gtaatgtgtg gtctgttcca tttggattac 5641 ttagaggcac gagggccaag gagaaaggtg gtggagagaa accagctttg cacttcattt 5701 gttgctttat tggaaggaaa cttttaaaag tccaaggggg ttgaagaatc tcaatatttg 5761 ttatttccag ctttttttct ccagtttttc atttcccaaa ttcaaggaca cctttttctt 5821 tgtattttgt taagatgatg gttttggttt tgtgactagt agttaacaat gtggctgccg 5881 ggcatattct cctcagctag gacctcagtt ttcccatctg tgaagacggc aggttctacc 5941 tagggggctg caggcaggtg gtccgaagcc tgggcatatc tggagtagaa ggatcactgt 6001 ggggcagggc aggttctgtg ttgctgtgga tgacgttgac tttgaccatt gctcggcaga 6061 gcctgctctc gctggttcag ccacaggccc caccactccc tattgtctca gccccgggta 6121 tgaaacatgt attcctcact ggcctatcac ctgaagcctt tgaatttgca acacctgcca 6181 acccctccct caaaagagtt gccctctcta gatccttttg atgtaaggtt tggtgttgag 6241 acttatttca ctaaattctc atacataaac atcactttat gtatgaggca aaatgaggac 6301 cagggagatg aatgacttgt cctggctcat acacctggaa agtgacagag tcagattaga 6361 tcctaggtct atctgaagtt aaaagaggtg tcttttcact tcccacctcc tccatctact 6421 ttaaagcagc acaaacccct gctttcaagg agagatgagc gtctctaaag cccctgacag 6481 caagagccca gaactgggac accattagtg acccagacgg caggtaagct gactgcagga 6541 gcatcagcct attcttgtgt ctgggaccac agagcattgt ggggacagcc ccgtctcttg 6601 ggaaaaaaac cctaagggct gaggatcctt gtgagtgttg ggtgggaaca gctcccagga 6661 ggtttaatca cagcccctcc atgctctcta gctgttgcca ttgtgcaaga tgcatttccc 6721 ttctgtgcag cagtttccct ggccactaaa tagtgggatt agatagaagc cctccaaggg 6781 ctccagcttg acatgattct tgattctgat ctgacccgat tctgataatc gtgggcaggc 6841 ccattcctct tcttgtgcct cattttcttc ttttgtaaaa caatggctgt accatttgca 6901 tcttagggtc attgcagatg aaagtgttgc tgtccagagc ctgggtgcag gacctagatg 6961 taggattctg gttctgctac ttcctcagtg acattgaata gctgacctaa tctctctggc 7021 tttggtttct tcatctgtaa aagaaggata ttagcattag cacctcacgg gattgttaca 7081 agaaagcaat gaattaacac atgtgagcac ggagaacagt gcttggcata tggtaagcac 7141 tacgtacatt ttgctattct tctgattctt tcagtgttac tgatgtcggc aagtacttgg 7201 cacaggctgg tttaataatc cctaggcact ttcacgtggt gtcaatccct gatcactggg 7261 agtcatcatg tgccttgact cgggcctggc ccccccatct ctgtcttgca ggacaatgcc 7321 gtcttctgtc tcgtggggca tcctcctgct ggcaggcctg tgctgcctgg tccctgtctc 7381 cctggctgag gatccccagg gagatgctgc ccagaagaca gatacatccc accatgatca 7441 ggatcaccca accttcaaca agatcacccc caacctggct gagttcgcct tcagcctata 7501 ccgccagctg gcacaccagt ccaacagcac caatatcttc ttctccccag tgagcatcgc 7561 tacagccttt gcaatgctct ccctggggac caaggctgac actcacgatg aaatcctgga 7621 gggcctgaat ttcaacctca cggagattcc ggaggctcag atccatgaag gcttccagga 7681 actcctccgt accctcaacc agccagacag ccagctccag ctgaccaccg gcaatggcct 7741 gttcctcagc gagggcctga agctagtgga taagtttttg gaggatgtta aaaagttgta 7801 ccactcagaa gccttcactg tcaacttcgg ggacaccgaa gaggccaaga aacagatcaa 7861 cgattacgtg gagaagggta ctcaagggaa aattgtggat ttggtcaagg agcttgacag 7921 agacacagtt tttgctctgg tgaattacat cttctttaaa ggtaaggttg ctcaaccagc 7981 ctgagctgtt tcccatagaa acaagcaaaa atatttctca aaccatcagt tcttgaactc 8041 tccttggcaa tgcattatgg gccatagcaa tgcttttcag cgtggattct tcagttttct 8101 acacacaaac actaaaatgt tttccatcat tgagtaattt gaggaaataa tagattaaac 8161 tgtcaaaact actgacgctc tgcagaactt ttcagagcct ttaatgtcct tgtgtatact 8221 gtatatgtag aatatataat gcttagaact atagaacaaa ttgtaataca ctgcataaag 8281 ggatagtttc atggaacata ctttacacga ctctagtgtc ccagaatcag tatcagtttt 8341 gcaatctgaa agacctgggt tcaaatcctg cctctaacac aattagcttt tgacaaaaac 8401 aatgcattct acctctttga ggtgctaatt tctcatctta gcatggacaa aataccattc 8461 ttgctgtcag gtttttttag gattaaacaa atgacaaaga ctgtggggat ggtgtgtggc 8521 atacagcagg tgatggactc ttctgtatct caggctgcct tcctgcccct gaggggttaa 8581 aatgccaggg tcctgggggc cccagggcat tctaagccag ctcccactgt cccaggaaaa 8641 cagcataggg gaggggaggt gggaggcaag gccaggggct gcttcctcca ctctgaggct 8701 cccttgctct tgaggcaaag gagggcagtg gaggcaagcc aggctgcagt cagcacagct 8761 aaagtcctgg ctctgctgtg gccttagtgg gggcccaggt ccctctccag ccccagtctc 8821 ctccttctgt ccaatgagaa agctgggatc aggggtccct gaggcccctg tccactctgc 8881 atgcctcgat ggtgaagctc tgttggtatg gcagagggga ggctgctcag gcatctgcat 8941 ttcccctgcc aatctagagg atgaggaaag ctctcaggaa tagtaagcag aatgtttgcc 9001 ctggatgaat aactgagctg ccaattaaca aggggcaggg agccttagac agaaggtacc 9061 aaatatgcct gatgctccaa cattttattt gtaatatcca agacaccctc aaataaacat 9121 atgattccaa taaaaatgca cagccacgat ggcatctctt agcctgacat cgccacgatg 9181 tagaaattct gcatcttcct ctagttttga attatcccca cacaatcttt ttcggcagct 9241 tggatggtca gtttcagcac cttttacaga tgatgaagct gagcctcgag ggatgtgtgt 9301 cgtcaagggg gctcagggct tctcagggag gggactcatg gtttcttatt ctgctacact 9361 cttccaaacc ttcactcacc cctggtgatg cccaccttcc cctctctcca ggcaaatggg 9421 agagaccctt tgaagtcaag gacaccgagg aagaggactt ccacgtggac caggtgacca 9481 ccgtgaaggt gcctatgatg aagcgtttag gcatgtttaa catccagcac tgtaagaagc 9541 tgtccagctg ggtgctgctg atgaaatacc tgggcaatgc caccgccatc ttcttcctgc 9601 ctgatgaggg gaaactacag cacctggtaa atgaactcac ccacgatatc atcaccaagt 9661 tcctggaaaa tgaagacaga aggtgattcc ccaacctgag ggtgaccaag aagctgccca 9721 cacctcttag ccatgttggg actgaggccc atcaggactg gccagagggc tgaggagggt 9781 gaaccccaca tccctgggtc actgctactc tgtataaact tggcttccag aatgaggcca 9841 ccactgagtt caggcagcgc cgtccatgct ccatgaggag aacagtaccc agggtgagga 9901 ggtaaaggtc tcgtccctgg gaacttccca ctccagtgtg gacactgtcc cttcccaata 9961 tccagtgccc aaggcaggga cagcagcacc accacacgtt ctggcagaac caaaaaggaa 10021 cagatgggct tcctggcaaa ggcagcagtg gagtgtggag ttcaagggta gaatgtccct 10081 ggggggacgg gggaagagcc tgtgtggcaa ggcccagaaa agcaaggttc ggaattggaa 10141 cagccaggcc atgttcgcag aaggcttgcg tttctctgtc actttatcgg tgctgttaga 10201 ttgggtgtcc tgtagtaagt gatacttaaa catgagccac acattagtgt atgtgtgtgc 10261 attcgtgatt atgcccatgc cctgctgatc tagttcgttt tgtacactgt aaaaccaaga 10321 tgaaaataca aaaggtgtcg ggttcataat aggaatcgag gctggaattt ctctgttcca 10381 tgccagcacc tcctgaggtc tctgctccag gggttgagaa agaacaaaga ggctgagagg 10441 gtaacggatc agagagccca gagccagctg ccgctcacac cagaccctgc tcagggtggc 10501 attgtctccc catggaaaac cagagaggag cactcagcct ggtgtggtca ctcttctctt 10561 atccactaaa cggttgtcac tgggcactgc caccagcccc gtgtttctct gggtgtaggg 10621 ccctggggat gttacaggct gggggccagg tgacccaaca ctacagggca agatgagaca 10681 ggcttccagg acacctagaa tatcagagga ggtggcattt caagcttttg tgattcattc 10741 gatgttaaca ttctttgact caatgtagaa gagctaaaag tagaacaaac caaagccgag 10801 ttcccatctt agtgtgggtg gaggacacag gagtaagtgg cagaaataat cagaaaagaa 10861 aacacttgca ctgtggtggg tcccagaaga acaagaggaa tgctgtgcca tgccttgaat 10921 ttcttttctg cacgacaggt ctgccagctt acatttaccc aaactgtcca ttactggaac 10981 ctatgatctg aagagcgtcc tgggtcaact gggcatcact aaggtcttca gcaatggggc 11041 tgacctctcc ggggtcacag aggaggcacc cctgaagctc tccaaggtga gatcaccctg 11101 acgaccttgt tgcaccatgg tatctgtagg gaagaatgtg tgggggctgc agcactgtcc 11161 tgaggctgag gaaggggccg agggaaacaa atgaagaccc aggctgagct cctgaagatg 11221 cccgtgattc actgacacgg gacggtgggc aaacagcaaa gccaggcagg ggctgctgtg 11281 cagctggcac tttcggggcc tcccttgagg ttgtgtcact gaccctgaat ttcaactttg 11341 cccaagacct tctagacatt gggccttgat ttatccatac tgacacagaa aggtttgggc 11401 taagttgttt caaaggaatt tctgactcct tcgatctgtg agatttggtg tctgaattaa 11461 tgaatgattt cagctaaagt gacacttatt ttggaaaact aaaggcgacc aatgaacaac 11521 ctgcagttcc atgaatggct gcattatctt ggggtctggg cactgtgaag gtcactgcca 11581 gggtccgtgt cctcaaggag cttcaagccg tgtactagaa aggagagagc cctggaggca 11641 gacgtggagt gacgatgctc ttccctgttc tgagttgtgg gtgcacctga gcagggggag 11701 aggcgcttgt caggaagatg gacagagggg agccagcccc atcagccaaa gccttgagga 11761 ggagcaaggc ctatgtgaca gggagggaga ggatgtgcag ggccagggcc gtccaggggg 11821 agtgagcgct tcctgggagg tgtccacgtg agccttgctc gaggcctggg atcagcctta 11881 caacgtgtct ctgcttctct cccctccagg ccgtgcataa ggctgtgctg accatcgacg 11941 agaaagggac tgaagctgct ggggccatgt ttttagaggc catacccatg tctatccccc 12001 ccgaggtcaa gttcaacaaa ccctttgtct tcttaatgat tgaacaaaat accaagtctc 12061 ccctcttcat gggaaaagtg gtgaatccca cccaaaaata actgcctctc gctcctcaac 12121 ccctcccctc catccctggc cccctccctg gatgacatta aagaagggtt gagctggtcc 12181 ctgcctgcat gtgatctgta aatccctggg atgttttctc tg // LOCUS HUMA1GLY2 4944 bp DNA PRI 30-OCT-1994 DEFINITION Human alpha-1-acid glycoprotein 2 (AGP2) gene, complete cds. ACCESSION M21540 NID g177839 KEYWORDS alpha-1 acid glycoprotein; orosomucoid. SOURCE Human DNA, clones lambda-AGP-[1A,2A,6B]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4944) AUTHORS Merritt,C.M. and Board,P.G. TITLE Structure and characterisation of a duplicated human alpha 1 acid glycoprotein gene JOURNAL Gene 66 (1), 97-106 (1988) MEDLINE 88329732 FEATURES Location/Qualifiers source 1..4944 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q32" prim_transcript 1594..>4940 /note="AGP2 mRNA and introns" exon <1609..1722 /gene="ORM2" /note="alpha-1-acid glycoprotein 2; G00-120-251" gene 1609..1722 /gene="ORM2" CDS join(1609..1722,2138..2280,2499..2569,3274..3381, 3532..3635,4776..4841) /note="alpha-1-acid glycoprotein 2" /codon_start=1 /db_xref="PID:g177840" /translation="MALSWVLTVLSLLPLLEAQIPLCANLVPVPITNATLDRITGKWF YIASAFRNEEYNKSVQEIQATFFYFTPNKTEDTIFLREYQTRQNQCFYNSSYLNVQRE NGTVSRYEGGREHVAHLLFLRDTKTLMFGSYLDDEKNWGLSFYADKPETTKEQLGEFY EALDCLCIPRSDVMYTDWKKDKCEPLEKQHEKERKQEEGES" intron 1723..2137 /note="AGP2 intron A" exon 2138..2280 /number=2 intron 2281..2498 /note="AGP2 intron B" exon 2499..2569 /number=3 intron 2570..3273 /note="AGP2 intron C" exon 3274..3381 /number=4 intron 3382..3531 /note="AGP2 intron D" exon 3532..3635 /number=5 intron 3636..4775 /note="AGP2 intron E" exon 4776..>4841 /note="alpha-1-acid glycoprotein 2" /number=6 BASE COUNT 1206 a 1369 c 1292 g 1077 t ORIGIN 1 bp upstream of BamHI site; chromosome 9q31-qter. 1 ggatccgctg aaaaatgaaa cagaaatgag tcgtgatggg cagggaggga gaagcaaggg 61 agacgagaag tggggaacat ggaaggaaaa gccacgtgag gaagaaacca gaggtcaaga 121 gaaaaagaat catggaggta gaggaagcaa aaaacacaca taacaaagaa tgtggacttt 181 ggagtcaaac taatgtgagt ccaaacccag gctctctccc aaaccagttt gcggcagatg 241 gccagtggaa cctcactctc ctcatcagta aaaagggggc agagtgaggg tcctgagagc 301 tagtacaggg actgtgtgaa gtagacaatg cccagtgttt agcgtaagaa tcagggtcca 361 gctggtgctc cctaaacagc agctgctgtt cactgttgaa aggcgctctg gaaggccagg 421 cgcggtggct catgcttgta atcccagcac tgtgggaggc cgaggtgggc ggatcacctg 481 aggtagggag ttcgagacca gcctgaccaa cgtggagaaa ccccatctct cctaaaaata 541 caaaattagc caggcgtggt agcacatacc tgtaatccca gcgactcggg aggctgaggc 601 aagagaattg cttgaaacca gcaggggagg ttgtggtgag ccaagatcga gccattgcac 661 tccagccagg gcaacaagag gcaaaatggc gaaactccat ctccgagaaa aaaaaaaaaa 721 aagaatactt tctgaaagta tttattcata caaataaaga cttgacccat aaggtaggaa 781 cgcaaatggg ccacggaatc actcattcca cagtatacac cgagtgccct tgaagtgctg 841 ggcactgctc caggattggg ggcatattgg tgaaaagaga agcaagcctg cctgctcaga 901 tggcagggaa tggggaaaaa cagggagaca gtttcctgtt tgagatgttg ggagtctgct 961 tcgagtagta tatttactgg aaatagacca ctaacttgga tgtccctttt tggaaatgtg 1021 cctgcgtcca gggctgggtt ggggccccaa tgaactttgg ctctgacata gctgttgcca 1081 cactcagtgg aactgaatcc atgtttgcct tcacccggca tccttcaccc caactctccc 1141 cgccacaaca tacatcccat gccagcctgg ggaccctcaa aggtgcttca tcattaggtt 1201 tgtggctggg tcctactgaa gtaagtcttg gcactcagag ggataggaat tgaatgaaga 1261 catgagattc ctctgcggga ggcctctcta ggaaatctgt ggactcacac gtttactaat 1321 gttgctgcag ccccgcaccc accttggcct tgggcagcca tactctaggg cttttgtaac 1381 ctctccatgt gaggaactca aattagacct gggtttggag gcggtgctcc gagctggcct 1441 ttgggggagg ttttgtgcga ggcatttccc aagtgctggc aggattgtgt cacagacaca 1501 gagtaaactt ttgctgggct ccaagtgacc gcccatagtt tattataaag gtgactgcac 1561 cctgcagcca ccagcactgc ctggctccac gtgcctcctg gtctcagtat ggcgctgtcc 1621 tgggttctta cagtcctgag cctcctacct ctgctggaag cccagatccc attgtgtgcc 1681 aacctagtac cggtgcccat caccaacgcc accctggacc gggtgagtgc ctgggctagc 1741 cctgtcctga gcacatgggc agctgcctcc cttctctggg cttcccttta cctgctggct 1801 gtggtcgcac ccccactccc agctctgcct ttttctcttc tgggtcccca gggtgaaatt 1861 ctcaccagcc caggggactc tggaggcacc ccctgcctcc aaacacagaa gcctcactgc 1921 agagtccttc acggaggacg gttctgtgct gggcctggag gggctgcctg gggggcaatg 1981 actgatcctc agggtgagct cctgcatgcg cactgcccac caggggcctc atctccccat 2041 ctgcaaaatc agggagagat ctgcctgagt ctcctcccag ctgacagtca aagattcagc 2101 atcaagcccc catcaccagc tccccccttc tccccagatc actggcaagt ggttttatat 2161 cgcatcggcc tttcgaaacg aggagtacaa taagtcggtt caggagatcc aagcaacctt 2221 cttttacttt acccccaaca agacagagga cacgatcttt ctcagagagt accagacccg 2281 gtgagagccc ccattccaat gcacccccga tctcagctgt ctggccagaa gacctgagca 2341 agtccctcct tcttcctggc cttggccttc ccatgggtgg aaccgggagg gttggcttta 2401 atctccacca gaactcttgc cccgggactg tgatgggcga ttggccactt ctcctcgata 2461 acattactgt ttttcttccg ccttctggtt gactttagcc agaaccagtg cttctataac 2521 tccagttacc tgaatgtcca gcgggagaat gggaccgtct ccagatacgg tgagggccag 2581 ccctcaggca ggagggttca ccgtgggaac agggcaggcc agcataaggt gggggctgga 2641 tgtagagccc tggaggcttt gggcacagag aaataaccac taacattttt gagctcttac 2701 cacgtgctca gaaaaaatcc ctaagaagac actgagagaa ttagatgagg aaacataaga 2761 acagagacct caaatagttt ccccaaggtc acacagctta taattagaac tagaattgga 2821 actccaggct ggcttcagat ctgcctctct ctcacgccct ctttaagatc ctttgcaaac 2881 caatggtaga agcctgtatg ttggagaggt ggtaccttca actatgtccc ccatcaccgc 2941 agaggtggca catggcaggg atctgatgga gctgaactga catcatttag catcccgagc 3001 ctcctctctg ggcctcattt tcctcctctg taaaacgggg agaaaggccc tgacagccac 3061 agtctgtgtg aggctcctga gatctcatgt acagaaagtg cttggcgtgg agctgggcac 3121 gcagcagggg ctgggcacac ggtggcccaa aggagacccg ggccttcact gatgggcttt 3181 gtggccccgg acacacctag gactcctcac ctgtaagaca ggcaccattg tgccatccca 3241 tgttctcacc cagaggctct ttttctcttc cagagggagg ccgagaacat gttgctcacc 3301 tgctgttcct tagggacacc aagaccttga tgtttggttc ctacctggac gatgagaaga 3361 actgggggct gtctttctat ggtaggcatg cttagcagcc ccaaactcat gcccctctca 3421 ggcctcaccc cccattcacc cacccctggg ctggccccta gaaccccagc cctccctggc 3481 ctccgccggg ccccaccatg tccccagtca gtctccttgc tccccctgca gctgacaagc 3541 cagagacgac caaggagcaa ctgggagagt tctacgaagc tctcgactgc ttgtgcattc 3601 ccaggtcaga tgtcatgtac accgactgga aaaaggtaaa cgcaagggat tggacattgc 3661 ccaccttgtc catggcccaa cttgggcagc cccagaggcc cagagcagga aagctgccag 3721 gcaaggctgc acagctaggc agatcttctg cttttaggca cctgcctcac tgtagggaca 3781 gctgagctct acagaggccc aggggtggtg gatgagagcc caggagggag aagtccctgt 3841 gaaaccaggg aggacctgaa agctaacagg agggaacagc gtgagccacg gggttggggg 3901 attggcaatt ggaggggacg taatgcgggg agttaccacc tacagacgcg tcccaaaccc 3961 caggctttca ccccaacctc cactccccgc tcatttttaa tacccgtgca gtggggaatt 4021 gatactgtgg ttttcaatgt cacccacact gcagcacggc cacagtcacc atcccgattt 4081 ttgctacaaa tgaaaattac tgtataatga gctccttaac acttttcttt aaacctgtgt 4141 ttggaagact tgtgttggtg tggccctgtg ccctaatacc tgtgaaatca cagcaccgat 4201 gagctggttc caatttttaa aatatataca tgcagtactt ccatgactat tcaaagaaaa 4261 acaattcctt ccatttgcca cctgagatga ccaccaggga tgtgaactac ctcctgcccc 4321 atccccagcc ccaggatcct gggacagggc ttatgaacgc aaccactgta gtcagctcac 4381 ttgatccaca gcctggcacc tccactgtct ggctagggag cctcgaatgg gtcccaaggc 4441 caccctgctc ctcagttaca tcatctgcat agtagtggtg gttgtgagga attcaggagc 4501 tgcagcataa gggccctgca ggtactatgt gctcagtaaa tgccagtggt tcttaagggt 4561 ctgagctccc attgtagagg caagtaagct gaggttcaga gacagaaaat gacttgccca 4621 agatcaccca gctgggaagt gacagtgcca gggttggagc cctggttgag ctggttccac 4681 aggccagagc tcattctgcc ctctccccgg aagacctccc accctgtccc catgcctctg 4741 cttctccctc accccaattc cccgctgcct tctaggataa gtgtgagcca ctggagaagc 4801 agcacgagaa ggagaggaaa caggaggagg gggaatccta gcaggacaca gccttggatc 4861 aggacagaga cttgggggcc atcctgcccc tccaacccga catgtgtacc tcagcttttt 4921 ccctcacttg catcaataaa gctt // LOCUS HUMACCYBB 3646 bp DNA PRI 30-OCT-1994 DEFINITION Human cytoplasmic beta-actin gene, complete cds. ACCESSION M10277 NID g177967 KEYWORDS actin; beta-cytoplasmic actin; cytoplasmic actin. SOURCE Human DNA library from HUT-14 cell line, clone lambda-Ha160. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3646) AUTHORS Nakajima-Iijima,S., Hamada,H., Reddy,P. and Kakunaga,T. TITLE Molecular structure of the human cytoplasmic beta-actin gene: interspecies homology of sequences in the introns JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (18), 6133-6137 (1985) MEDLINE 85298307 REFERENCE 2 (sites) AUTHORS Harris,D.E., Warshaw,D.M. and Periasamy,M. TITLE Nucleotide sequences of the rabbit alpha-smooth-muscle and beta-non-muscle actin mRNAs JOURNAL Gene 112 (2), 265-266 (1992) MEDLINE 92210011 COMMENT A potential cap site was found at position 239. Through interspecies sequence comparison, a second potential cap site was found at positions 242-244. FEATURES Location/Qualifiers source 1..3646 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7pter-q22" prim_transcript <319..3589 /note="actin mRNA" intron 320..1086 /note="actin intron A" exon <1093..1215 /gene="ACTB" /note="cytoplasmic beta actin, (first expressed exon); G00-118-964" /number=2 gene 1093..1215 /gene="ACTB" CDS join(1093..1215,1348..1587,2029..2467,2563..2744, 2857..3000) /note="cytoplasmic beta actin" /codon_start=1 /db_xref="PID:g177968" /translation="MDDDIAALVVDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVM VGMGQKDSYVGDEAQSKRGILTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHP VLLTEAPLNPKANREKMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVMDSGDGVT HTVPIYEGYALPHAILRLDLAGRDLTDYLMKILTERGYSFTTTAEREIVRDIKEKLCY VALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMESCGIHE TTFNSIMKCDVDIRKDLYANTVLSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPE RKYSVWIGGSILASLSTFQQMWISKQEYDESGPSIVHRKCF" intron 1216..1347 /note="actin intron B" exon 1348..1587 /number=3 intron 1588..2028 /note="actin intron C" exon 2029..2467 /number=4 intron 2468..2562 /note="actin intron D" exon 2563..2744 /number=5 intron 2745..2856 /note="actin intron E" exon 2857..>3000 /note="cytoplasmic beta actin" /number=6 BASE COUNT 613 a 1117 c 1102 g 814 t ORIGIN 145 bp upstream of AvaI site. 1 gcccagcacc ccaaggcggc caacgccaaa actctccctc ctcctcttcc tcaatctcgc 61 tctcgctctt tttttttttc gcaaaaggag gggagagggg gtaaaaaaat gctgcactgt 121 gcggcgaagc cggtgagtga gcggcgcggg gccaatcagc gtgcgccgtt ccgaaagttg 181 ccttttatgg ctcgagcggc cgcggcggcg ccctataaaa cccagcggcg cgacgcgcca 241 ccaccgccga gaccgcgtcc gcccgcgagc acagagcctc gcctttgccg atccgccgcc 301 cgtccacacc cgccgccagg taagcccggc cagccgaccg gggcatgcgg ccgcggccct 361 tcgcccgtgc agagccgccg tctgggccgc agcggggggc gcatggggcg gaaccggacc 421 gccgtggggg gcgcgggaga agcccctggg cctccggaga tgggggacac cccacgccag 481 ttcgcaggcg cgaggccgcg ctcgggcggg cgcgctccgg gggtgccgct ctcggggcgg 541 gggcaaccgg cggggtcttt gtctgagccg ggctcttgcc aatggggatc gcacggtggg 601 cgcggcgtag cccccgtcag gcccggtggg ggctggggcg ccatgcgcgt gcgcgctggt 661 cctttgggcg ctaactgcgt gcgcgctggg aattggcgct aattgcgcgt gcgcgctggg 721 actcaatggc gctaatcgcg cgtgcgttct ggggcccggg cgcttgcgcc acttcctgcc 781 cgagccgctg gcgcccgagg gtgtggccgc tgcgtgcgcg cgcgcgaccc ggtcgctgtt 841 tgaaccgggc ggaggcgggg ctggcgcccg gttgggaggg ggttggggcc tggcttcctg 901 ccgcgcgccg cggggacgcc tccgaccagt gtttgccttt tatggtaata acgcggccgg 961 cccggcttcc tttgtcccca atctgggcgc gcgccggcgc cccctggcgg cctaaggact 1021 cggcgcgccg gaagtggcca gggcgggggc gacttcggct cacagcgcgc ccggctattc 1081 tcgcagctca ccatggatga tgatatcgcc gcgctcgtcg tcgacaacgg ctccggcatg 1141 tgcaaggccg gcttcgcggg cgacgatgcc ccccgggccg tcttcccctc catcgtgggg 1201 cgccccaggc accaggtagg ggagctggct gggtggggca gccccgggag cgggcgggag 1261 gcaagggcgc tttctctgca caggagcctc ccggtttccg gggtgggctg cgcccgtgct 1321 cagggcttct tgtcctttcc ttcccagggc gtgatggtgg gcatgggtca gaaggattcc 1381 tatgtgggcg acgaggccca gagcaagaga ggcatcctca ccctgaagta ccccatcgag 1441 cacggcatcg tcaccaactg ggacgacatg gagaaaatct ggcaccacac cttctacaat 1501 gagctgcgtg tggctcccga ggagcacccc gtgctgctga ccgaggcccc cctgaacccc 1561 aaggccaacc gcgagaagat gacccaggtg agtggcccgc tacctcttct ggtggccgcc 1621 tccctccttc ctggcctccc ggagctgcgc cctttctcac tggttctctc ttctgccgtt 1681 ttccgtagga ctctcttctc tgacctgagt ctcctttgga actctgcagg ttctatttgc 1741 tttttcccag atgagctctt tttctggtgt ttgtctctct gactaggtgt ctgagacagt 1801 gttgtgggtg taggtactaa cactggctcg tgtgacaagg ccatgaggct ggtgtaaagc 1861 ggccttggag tgtgtattaa gtaggcgcac agtaggtctg aacagactcc ccatcccaag 1921 accccagcac acttagccgt gttctttgca ctttctgcat gtcccccgtc tggcctggct 1981 gtccccagtg gcttccccag tgtgacatgg tgcatctctg ccttacagat catgtttgag 2041 accttcaaca ccccagccat gtacgttgct atccaggctg tgctatccct gtacgcctct 2101 ggccgtacca ctggcatcgt gatggactcc ggtgacgggg tcacccacac tgtgcccatc 2161 tacgaggggt atgccctccc ccatgccatc ctgcgtctgg acctggctgg ccgggacctg 2221 actgactacc tcatgaagat cctcaccgag cgcggctaca gcttcaccac cacggccgag 2281 cgggaaatcg tgcgtgacat taaggagaag ctgtgctacg tcgccctgga cttcgagcaa 2341 gagatggcca cggctgcttc cagctcctcc ctggagaaga gctacgagct gcctgacggc 2401 caggtcatca ccattggcaa tgagcggttc cgctgccctg aggcactctt ccagccttcc 2461 ttcctgggtg agtggagact gtctcccggc tctgcctgac atgagggtta cccctcgggg 2521 ctgtgctgtg gaagctaagt cctgccctca tttccctctc aggcatggag tcctgtggca 2581 tccacgaaac taccttcaac tccatcatga agtgtgacgt ggacatccgc aaagacctgt 2641 acgccaacac agtgctgtct ggcggcacca ccatgtaccc tggcattgcc gacaggatgc 2701 agaaggagat cactgccctg gcacccagca caatgaagat caaggtgggt gtctttcctg 2761 cctgagctga cctgggcagg tcagctgtgg ggtcctgtgg tgtgtgggga gctgtcacat 2821 ccagggtcct cactgcctgt ccccttccct cctcagatca ttgctcctcc tgagcgcaag 2881 tactccgtgt ggatcggcgg ctccatcctg gcctcgctgt ccaccttcca gcagatgtgg 2941 atcagcaagc aggagtatga cgagtccggc ccctccatcg tccaccgcaa atgcttctag 3001 gcggactatg acttagttgc gttacaccct ttcttgacaa aacctaactt gcgcagaaaa 3061 caagatgaga ttggcatggc tttatttgtt ttttttgttt tgttttggtt tttttttttt 3121 ttttggcttg actcaggatt taaaaactgg aacggtgaag gtgacagcag tcggttggag 3181 cgagcatccc ccaaagttca caatgtggcc gaggactttg attgcattgt tgttttttta 3241 atagtcattc caaatatgag atgcattgtt acaggaagtc ccttgccatc ctaaaagcca 3301 ccccacttct ctctaaggag aatggcccag tcctctccca agtccacaca ggggaggtga 3361 tagcattgct ttcgtgtaaa ttatgtaatg caaaattttt ttaatcttcg ccttaatact 3421 tttttatttt gttttatttt gaatgatgag ccttcgtgcc cccccttccc cctttttgtc 3481 ccccaacttg agatgtatga aggcttttgg tctccctggg agtgggtgga ggcagccagg 3541 gcttacctgt acactgactt gagaccagtt gaataaaagt gcacacctta aaaatgaggc 3601 caagtgtgac tttgtggtgt ggctgggttg ggggcagcag agggtg // LOCUS HUMADAG 36741 bp DNA PRI 04-OCT-1995 DEFINITION Human adenosine deaminase (ADA) gene, complete cds. ACCESSION M13792 NID g178076 KEYWORDS Alu repeat; adenosine deaminase; long terminal repeat (LTR); repetitive sequence. SOURCE Homo sapiens (human). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 36741) AUTHORS Wiginton,D.A., Kaplan,D.J., States,J.C., Akeson,A.L., Perme,C.M., Bilyk,I.J., Vaughn,A.J., Lattier,D.L. and Hutton,J.J. TITLE Complete sequence and structure of the gene for human adenosine deaminase JOURNAL Biochemistry 25 (25), 8234-8244 (1986) MEDLINE 87128922 REFERENCE 2 (sites) AUTHORS Berkvens,T.M., van Ormondt,H., Gerritsen,E.J., Khan,P.M. and van der Eb,A.J. TITLE Identical 3250-bp deletion between two AluI repeats in the ADA genes of unrelated ADA-SCID patients JOURNAL Genomics 7 (4), 486-490 (1990) MEDLINE 90353944 REFERENCE 3 (sites) AUTHORS Gossage,D.L., Norby-Slycord,C.J., Hershfield,M.S. and Markert,M.L. TITLE A homozygous 5 base-pair deletion in exon 10 of the adenosine deaminase (ADA) gene in a child with severe combined immunodeficiency and very low levels of ADA mRNA and protein JOURNAL Hum. Mol. Genet. 2 (9), 1493-1494 (1993) MEDLINE 94061056 COMMENT [2] sites; 3250 bp deletion. [2] describes a patient with severe combined immune deficiency caused by a 3250 base pair deletion in the ADA gene. FEATURES Location/Qualifiers source 1..36741 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20q13.2-qter; 475 bp upstream of HindIII site" /chromosome="20" LTR 1025..1357 /note="THE O family LTR" repeat_region 1362..1672 /note="Alu repeat" LTR 1680..1717 /note="THE O family LTR" repeat_region 2357..2903 /note="Alu repeat" mutation 2455..5706 /note="g-3250 bp-c in [1]; gc in [2] -> immune deficiency dysfunction" prim_transcript 3936..35975 /note="ADA mRNA" gene 4031..35664 /gene="ADA" CDS join(4031..4063,19230..19291,26344..26466,28908..29051, 29823..29938,31176..31303,32425..32496,32573..32674, 32851..32915,34354..34483,35100..35202,35651..35664) /gene="ADA" /codon_start=1 /product="adenosine deaminase" /db_xref="PID:g178077" /translation="MAQTPAFDKPKVELHVHLDGSIKPETILYYGRRRGIALPANTAE GLLNVIGMDKPLTLPDFLAKFDYYMPAIAGCREAIKRIAYEFVEMKAKEGVVYVEVRY SPHLLANSKVEPIPWNQAEGDLTPDEVVALVGQGLQEGERDFGVKARSILCCMRHQPN WSPKVVELCKKYQQQTVVAIDLAGDETIPGSSLLPGHVQAYQEAVKSGIHRTVHAGEV GSAEVVKEAVDILKTERLGHGYHTLEDQALYNRLRQENMHFEICPWSSYLTGAWKPDT EHAVIRLKNDQANYSLNTDDPLIFKSTLDTDYQMTKRDMGFTEEEFKRLNINAAKSSF LPEDEKRELLDLLYKAYGMPPSASAGQNL" exon <4031..4063 /gene="ADA" /note="adenosine deaminase" /number=1 mutation 4052 /gene="ADA" /note="The functional significance of this substitution at cDNA position 117 (AA 8 Asp->Asn) is unknown as it occurs in the same allele as a 5 bp deletion in exon 10 which is associated with very low ADA mRN" /citation=[3] intron 4064..19229 /gene="ADA" /note="ADA intron 1" repeat_region 4907..5227 /note="Alu repeat" repeat_region 5606..5908 /note="Alu repeat" repeat_region 7582..8001 /note="Alu repeat" repeat_region 8179..8484 /note="Alu repeat" repeat_region 10005..10204 /note="Alu repeat" repeat_region 10257..10534 /note="Alu repeat" repeat_region 13452..13777 /note="Alu repeat" repeat_region 14837..15386 /note="Alu repeat" repeat_region 15806..16106 /note="Alu repeat" repeat_region 16913..17224 /note="Alu repeat" repeat_region 18414..18717 /note="Alu repeat" exon 19230..19291 /gene="ADA" /number=2 intron 19292..26343 /gene="ADA" /note="ADA intron 2" repeat_region 19605..19902 /note="Alu repeat" repeat_region 22523..22829 /note="Alu repeat" repeat_region 24481..24773 /note="Alu repeat" repeat_region 25143..25453 /note="Alu repeat" exon 26344..26466 /gene="ADA" /number=3 intron 26467..28907 /gene="ADA" /note="ADA intron 3" repeat_region 26949..27269 /note="Alu repeat" repeat_region 28032..28333 /note="Alu repeat" exon 28908..29051 /gene="ADA" /number=4 intron 29052..29822 /gene="ADA" /note="ADA intron 4" exon 29823..29938 /gene="ADA" /number=5 intron 29939..31175 /gene="ADA" /note="ADA intron 5" exon 31176..31303 /gene="ADA" /number=6 intron 31304..32424 /gene="ADA" /note="ADA intron 6" repeat_region 31460..31867 /note="Alu repeat" exon 32425..32496 /gene="ADA" /number=7 intron 32497..32572 /gene="ADA" /note="ADA intron 7" exon 32573..32674 /gene="ADA" /number=8 intron 32675..32850 /gene="ADA" /note="ADA intron 8" exon 32851..32915 /gene="ADA" /number=9 intron 32916..34353 /gene="ADA" /note="ADA intron 9" exon 34354..34483 /gene="ADA" /number=10 mutation 34462..34468 /gene="ADA" /note="This mutation is found in both alleles of a patient with severe combined immunodeficiency secondary to ADA deficiency. The deletion creates a premature stop codon resulting in very low levels of ADA m" /citation=[3] intron 34484..35099 /gene="ADA" /note="ADA intron 10" exon 35100..35202 /gene="ADA" /number=11 intron 35203..35650 /gene="ADA" /note="ADA intron 11" exon 35651..>35664 /gene="ADA" /note="adenosine deaminase" /number=12 BASE COUNT 8165 a 9716 c 9721 g 9139 t ORIGIN 1 gatctgggta aagggttttc caggtgtcag gatggaagtg actaaggtgc agaggctgga 61 gggctggggc aggtagaagc aagcattcct gttacctact gctgtgtgac aatctccccc 121 taaaacacaa tggcttaaaa taacatccat ttcattacat atctcaatac tataggtcag 181 gaatttgggc tgggcttact tgggtaattc ttctgtccca catggcattg accaaagcct 241 ggttttcagt gggcagctgg gctggatggc ccaacacagc ttcgctaaca tgattgctgt 301 cttcgtaggg atggtggaag cctgggctca gtgggactgt caactggaat ggccatatgt 361 ggactctctt agcatgatgg tctcttctag aagcttgggt tcccagagag aatgttcaag 421 aggccccaaa ggacaccaca aagcttcttt atgaccaagg ctcggaaatc caggaagctt 481 gctcccatca cgctctatta ctccaacaag tcactcaggc cagcccaggt ccaagaggag 541 gaaacctaga ctccatcttg caatgtgaag aattgcaaat aatttgtgtc acccttaagc 601 aaccagcaac tcatctaggt tgattggcat ttcagcaatg tggtgggaag tggtgggact 661 gatgttgaag agggacttga atgtcatgag aggctgggga ggcaataagg tggggagtga 721 agtttctcga gtcagattca aatttaaacc ccagttttgc cacttacaac ccatgagcca 781 agcaggctgt ctctctatct gaacctcagt gtcctcatct gtaaaatgag gagaacacct 841 cctacatctg aggatgactg taaagatgaa atgggatggg tgcttataaa gtgcttccca 901 gtgtacctgg ctccaaacct gtctcagtaa atggcagccc ctattattga acccgagtaa 961 cacagagagc caagaaagga tcttacaaaa aactcccctg gctttgacaa tgtatgagac 1021 ccactgatag ggtttggctt tgtgtcctca cccaaatctc atctagtagc tcccataatt 1081 cctacatgtt gtgggagaga ctcggcggga gataattgaa tcatggggga tggtctttcc 1141 catgctgttc ttgtgatagt aaataagtct cacaagatct gatggtttta aaaatgggag 1201 tttccctgca ggcgctctct ctttgtctac tgccatccat gtaagacgtg acttgctcct 1261 cctttgcctt ctgccatgat tgcaaggcct ccccaccatt gtggaactgt aagtctatta 1321 aagcctcttt cttttgtaaa ttacccagtc tcaggtatgt cttttttttt tttttcatga 1381 gatggagttt cgctcttgtt gcccaggctg gaatgcaatg gtgtaatctt ggctcaccac 1441 aacctccacc tcccaggttc aagcgattct cctgcctcag cctcccgagt agctgggatt 1501 acagtcatac accaccacgc ctggctaatt ttgtattttt tttttttttt ttagtagaga 1561 cggggtttca ccatgttggt caggctggtc tcaaactccc gacctcaggt gatcctcctg 1621 ccttggcctc ccaaagtcct gggattacag gcatgaacca ctgcgcccag gctcgggtat 1681 gtcttcatca gtagcatgaa aataatggac taatacagcc accctctccc tcactcccac 1741 atacaaccaa accccaaatc cagctgattt tacaccctaa atgcagcttg aatatgagtt 1801 tctccacttc ccccactgac atcactatgc cctacccaga ccatggcagt tgcctccttc 1861 ctggtatcct gtcctccctc acccccgctg gccccctgta atgccctccc ctcacagcag 1921 ggagcccagg cttctcaaag tgccctgtgg gtgcgaacca cctgggggtc ctgtttgtat 1981 aaaatacaga ttctacttca gtaggtctgg gatggggtct gaaagtctgc atttgtagtc 2041 agctcccagg tgatgtgggt gctgatgatc cctggatcac actttcagta gctggagaat 2101 attttttcca aataaaaggg tgattttgtc tcgcctccac ttaaaacact ccactgactt 2161 cctaggaatc ccacaccatc gctgggtccc acatccctgg caggattcag ctcccatcag 2221 accttctagc cccttgctct ccactctccc actctctctt tcccccttgt ttatgggttt 2281 gttaatttat ttatgatgaa atgaaatgaa gctaccatcc accccagtac tggaacatta 2341 tcaataacct gtgtgtggcc aggcgtggtg gctcatgcct gtaatcacgc cttgggaagc 2401 cgaggtgggt ggatcatgtg aggtcaggtg ttcgagacca gcctggccaa catggtgaaa 2461 ccccgtctct actacaaatc caaaacttag cagggcacgg tgccacgcgc ctgtaatccc 2521 agctactcgg gacgctgagg ccgagaactg cttaaaatcc aggaggtgga ggttgcagtg 2581 agccgagatt tcgccactgc actccagcct gggcgacaga gcaagagtcc atctcaaaaa 2641 aacaaaaaca aaaacaaaaa aacaaaaaac aaaaattagc caggcgtggt tgtgggcgcc 2701 tataatccca gctactcggg aggctgagac aggaaaatcg cttgaaacgc tgggggtgcg 2761 ggggggcggt ggggaggagg cgggccagag gggcagaggt tgcagtgagc ccagatcgcg 2821 ccacttcact gcagcctccg cgaaagagcg aaactccgtc tcagtaaata aataaataaa 2881 taaataaata aataaataaa taacctgtac ccgcgtgtta tttccctccg tccttacctc 2941 ctcccggctc cttccctttc acctgagata accactcttc tcgtatctat gctcatcttt 3001 cccttgcttt acattttttc caccgatgca tgtgtctaaa catacatact tttggttttg 3061 cttttacaca ttctaaaagt tgcaccattg tatgcagttt tccgcaactt agtttttttc 3121 actcaacatt gtttctgaga cattgtttct gttgttgtct ggctgaagtt cattccgttt 3181 cactgctgtc taacgtttca tggtgtgaat attccggttt atttgcccac tcgcccgtgg 3241 aggggcattt gagggtgttt ccaatgttcc tgttattcgg aatagcgctg gtgtgaacat 3301 tctgcacagg tctctggctg cgcctgggcg ggtttcttaa aggtgaatgc ccaggagggg 3361 actgtctgtg ttctccctcc ctccgagctc cagccttcct cgcctccttt cactcccagc 3421 tccctggagt ctctcacgta gaatgtcctc tccaccccca cccacccctg atgaactcct 3481 gcaggttctg caggccacgg ctggcccccc tcgaaagttc cttaactata caattatggt 3541 gtgtgtttct gcgacgagcg tccgtctatc cggtggaagg cacgccgctc gaggcttgcg 3601 atgctcccgg ggtccccgct tctagcttgg gcctggcgca cagcagcgcc cagactgcag 3661 ggggacgctt gaaagttgct ggaggagccg gggggaaggc agcgcccagc gaggcggctg 3721 gagcgcgcgc ccacaggtgg gtccggtcgg gcgccgcggg gccgtagttt tcgggtcggc 3781 gggcgaggac gccgggtcca gaattccagg aaatgcgcga tccaggccgg cgggcggggc 3841 gggggctccg gcgagagggc gggccccggg aacggcggcg ggcggggcgg gaggcggggc 3901 ccggcccgtt aagaagagcg tggccggccg cggccaccgc tggccccagg gaaagccgag 3961 cggccaccga gccggcagag acccaccgag cggcggcgga gggagcagcg ccggggcgca 4021 cgagggcacc atggcccaga cgcccgcctt cgacaagccc aaagtgagcg cgcgcggggg 4081 ctccggggac gggggtccgg cgcctgggcg gcccgagggg cttagcgggg cccagcccgg 4141 ggcgtccaaa ccctgggaac gaacgggggc tcctgcaggc gagttcttcc ttcggcttag 4201 gccgtggctt gcttgcgggc taatcaggga caatggggca gagaaggtcc agaacccgga 4261 ggcctccaga gtctgcttct gcccctgact tgacccctct gggtctcagt ttcgctgtct 4321 gtcaagtggg catcctagca ccgctgagcg ctgtgtgggc ctgggcaggg acttgaggtc 4381 tctgaagctc agctgtatga tcaggcccga tgtctacgcc ggatagcgac ctagtgctgt 4441 gccccgcgcc tactgagtgc tcagtgaatg gaagcagctt tgtacgccag cgttatggtg 4501 gtgagcgcca aggagctcag gtttgtggat gcgccccggg gaagaaccgt gagccctgcc 4561 agaaagggga gggaggggag cagagcaccc cccttccccc gcgcgggaag aacaggagct 4621 aggtaggccc tgggtttggg gccctagcag ggttcactcg aggccaagcc atggcccact 4681 ggccccaggg gagaatcccc ttgtttctcc gcccaccagc tgtggcgtct tgggactgtt 4741 ggggtcaggg agggtctgga cccccttggc ctgtctcaga gtccgagagg aggggcccag 4801 gagtctgcca agcagggtga gtcagccagt agggtgtgag agtggttggg gaaggagtca 4861 gctgcagtca gcctcaactt acccttctaa gaaataggtg tgagtggccc aggaggttgg 4921 ctcacgcctg taatcccagc actttgtgag gctgaggcgg gaggatcatt tgagtccagg 4981 agtttgagac tagcctggac aacaaaacta gaccccgtct ctccaaaaaa taaaaaaagt 5041 taggggaagt gtgtgtggtg gtgcactccc gtagtcccag ctactcagga ggctgaggcg 5101 ggaggatcgc ttgagcccag gaggttgagg ctgcagtgag gtgtgatggt gccactgacc 5161 ttcagcctgg gagacagagc gagaccctgt ctcaaaaaaa aagagaagaa aaagaaaaga 5221 aaagaaatag gtgtgaatga tgatgacagc tatcacaaaa gtgccggtga gaatccagtg 5281 agtgtgcatg tgtcagtgag ggagacaggc tgtggagagc ccacctacct tctgaggagg 5341 gtgaggcctg gcccccacta ctgatgcccc cagcccaggg aaaatgctca gctactcccc 5401 gtcagaagct ggaacgactg aggtgctgta caagccctcc tacccccacc cctgcctcct 5461 tcacgtctta ctggagctgg ggcccatgat tggcgcctcc cctttgcagt ctttttatta 5521 aatgctctgg gctccctctg cccttgggct ggggacccac tgtaccctga tgtgaatcct 5581 atggcagtag caaagctctt tgattggcgg ggtgcagtgg ctcacgcctg taatcccagc 5641 actttgggag gcaaaggtgg gtggatcatg aggccaggag ttcgagacca gcctggccaa 5701 catggcaaaa ccccatttct actaaaaata caaaaaatta gctgggcatg gtgcgggcgc 5761 ctgtagtccc acgtacgcag aaggctgagg caggagaatg gcataaaccc gggaggtgga 5821 gcttgcagtg agccgagatc tcgccattgc actccagcct gggtgacaga gtgagactct 5881 gtctcaaaaa aaaaaaaaaa aaaaaaaagg ctccttgatt gcgaacatgt tgggagttat 5941 ggagagaaca gcagggccca cttctagagc acttgttgca gacacccatt ggatccttgc 6001 agttcttctg taacagccca tcaagggagg ggctcatatt attatcccca ttttttggcc 6061 ttgctcagtc ctcccatctg attcaagctg gcagatcatt ttccctattg ggacctcagt 6121 gtccacacct ggaggatgga acatcagctg cttatgtggg tgtcccgtgt cctgagtccc 6181 aaggccacaa ggtgatgctt gagagtgaag gtagaatgtt acctgccatg tgtttgaggc 6241 gtgacaaatc ttgtatgatt gtgaggagga acttgtgtga gctggcagga gaagtgggaa 6301 ggagtgtgaa tctcagagcc actgtgacca gagccagctc cctgccctct tgtgggaggg 6361 acagatgaca gttataatta ttagcattac tagctgcagc taatggagtg ttgatgtttc 6421 tgccaggcac cgttctaaac acattatctg cattttttat ttaatccagg cacagagagg 6481 ttaactaggc ccaagatcac acagctagga aatgtccaac tctggggttt gagtccaagg 6541 gaggctggct tcgaaatccc atgcctctaa ccatctttcc taaactacct ctgcagaagc 6601 ctttggggat agaggtgcca gtgccccagg tgcaaacctc ctgagacagg agcctttgct 6661 gtgtccttca gcttctcata cctgccacca gctgaggcct gggacctggt cagctagaag 6721 aaagcagagc agggcagcgc ttttcaaact gcactcaagt ggcctgactt ttaatgttca 6781 cactgtgatt ctgtgtgggt cgggttgggg cctgcgatgc tgcactgctg accagctccc 6841 aggaaatgct aatgtcaacg atccaggaac acactttgct tagcaaggcc ctaggcagct 6901 gccttctgtt gtgcgggacc cctattgact ccaatggata tagcaccagg ttcaagaggc 6961 taccttcttt ggaagaggta gcaaacaaga tacggggttt tactgggggc ttagacacag 7021 ggaagagagt ccagtggcgg cagactgagc agaagaaccg caaccacttg caaatcatgc 7081 agtttatgta gcattttcat ttaacacctt ctcccaacca tctccaccta gtaaccttca 7141 tttaacccaa aacaaagggc ctcggtccct atacccctgt atggtcagtg tcccgtggga 7201 atggggtggg gctcagatgt tcctcataga taacgactgg atctccaggt tggccactct 7261 tggattcctt cgctcagaac tctgaacacc cattcaagtg tgcctgccat gcagggtcat 7321 cgtcagggga tgcccaagtc aagtttgcct gtcgggtgtg cctcccatac ccccacctgg 7381 tttgacttag cacctgctgg gcactggaag aagtgcaaag gggggttgca ggggtggccc 7441 ttatcagcct atgttcacag gtggcaccag gcactcaggc attctgcatc ctggaggcca 7501 gtgctgatca catgcctgtt acaataatca taacaatagc tgtccttgaa gtagtcctgg 7561 gtaccaggtg ccttcagtga ctttttcttc tttgccagaa tctcactctg tcgcccaagc 7621 tggagtgcag tggcaagatt ttgggtccct gcaacctctg cctcctgggt tcatgcgatc 7681 ctcctgcctc agcctcccaa gtagctggga ctacaggcgt gtgccgcagt ctcactctgt 7741 tgcccaggct ggagtgcagt ggtgtgatcc tggctcacta caacctccac ctcccgagtt 7801 caagccattc ttctgcctca gcctccggag tagctgggat tacaggcgtc caccaccacg 7861 cccggctaat ttttgtattt ttagtagaga cagggtttca ccacgttagc cagctggtct 7921 cgaactcctg atctcaggtg atcctcccac cttggcttcc caaagcgctg ggattacagg 7981 tgtgagccac tgtgcccggc tagtaacttt tatctcacgg aatcctctgg acgacttgac 8041 aaggcatggg tcttcatccc catttacaga tgaagaaact gaagcttagg gagtggaggg 8101 acttgccagg gctacacaaa atctgagagc cttgaagctg tagactggca agtgaacagg 8161 tacaggctgg gacagcagtt tctttctttt tttctttttt tagacagagt ttcgctcttg 8221 ttgcccaggc tggagtgcaa tggcacgacc tcggctcact gcaaccttcg cctcccaggt 8281 tcaagtgatt cttctgcctc agcctcccaa gtagctggaa ttacaggcat gcaccaccat 8341 gcccggctaa ttttttgtat ttttagtaga gacggggttt ctccttgttg gccaggctgg 8401 tctcgaactc ccgacttcag gtgatccgcc cacctcagcc tcccaaagtg ccgggattac 8461 aggcatgagc caccgcaccc ggccaaggga cagcagtttc taaactgtcc ctctctgatg 8521 cagaggggaa ttggggctaa atcagcaatg tgccttttct gtctcatatt tgaatgtcta 8581 ctctgcacga ggcgctgtcc tgctttgcat acagtgactc atttaatgtt tatgtcagcc 8641 ctctgaggaa ggtcctgtcc tattattaac ttcacttatt atgaggaaac tgagactcag 8701 agaggggagg gaacttgcca aagtcacaca gctggcaagc agcagagcta gacttgaacc 8761 cagatctgcc tgcactcaag tagaagctgt tcattgcttt gctcatttgc caattccact 8821 ttatgcaaaa aagagggggc agtgtggggg gaagagttag aatcagggtg gcagggtggg 8881 ccagtgcatt agccctgggc ttcagatgta ctggggttga attcctgcct gccgcttagc 8941 agctagggta cctcaggtag acaactcctg aaactcagct tccccctctg taaaatgggg 9001 tgacaaaacc aagatcttgg ggttcttggg gaaactgaca tgctgattgg tttttgtaca 9061 gtgcctggct ggtaacagca ggccctcagg ggtgcgtttc cttcctgggg actggagtgg 9121 gggttgcagt agactctggg aggcctctcc agctgcagaa tctccctcct ccctcctcct 9181 ttttgtcttc ctgacacaaa acccaccagc tgcacttctt tgggcttgca gtggctttca 9241 gttaccagag ccacctgtta aaacaaaaat gtgcctagga agagcctgcc ttacccattt 9301 tgactcacat ggcagttggt ggtggagggg aacaaaggag actgagtttc atcgaagcct 9361 tttgcttcgg aggaggaagg gaggatcaga gagaggaagt ggtctgtgtt cacacaggga 9421 ggcaggggag gccaggcagc ttcccaatcc tgcattcaac ctcagggtgg gcttgacctg 9481 ggtggctggg ggccctgtga tccaggagag acttgtccac ctgctcaggt gtcttgaagg 9541 ggtccctgtg gtaccccctg ggcggggcaa ggtagtagga ccatggtctg gctggggagg 9601 tggagaggag caggctgtgg gcgcagagtg aggttggaat ctgtatttac ccaaggtgtt 9661 gggggtaggc ttgccctcag cccttaatgt tctcaggccc ctgagcagtt gtgggggata 9721 acctctgcac tcctagtgac cagggagcta gaacagcaag gaatttgaac ttggacacca 9781 gctggggtca ggctctctgg gtctgagtcc tgatttccca ctttccagct agaggagctt 9841 gaatgagtca tttaacttca cggtgcctca gtttcccctc tctaaaatga gaattatacc 9901 catacccacc tctcaaacac caagtgcagg cctggctcag agcaggtgct gcagcaatag 9961 ctgccattgg tcagcatcat catcatggtt ggtaatggtc ctactttgac ttttgagaca 10021 gagtctcact ctgtcgccca ggctggagtg cagtggtgca atctcggctc actacaacct 10081 ctgctcccgg gttcaagtga ttcttctgcc tcagtctccc aagtagttgg gattacaggt 10141 gtgcgccacc atgcctggct aatttttgtg tttttagtag agacagggtt tcaccatgtt 10201 ggccataaca atggctgtcc ttgaagtagt cctgggtacc aggtgccttc agtgactttt 10261 tttttttttt tttttttgag ctggagtctt cctctgtcac ccaagctgga gtgcagtggc 10321 acgattttgg ctcactgcaa cctctgcctc ctgggttcat gcgatcctcc tgcctcagcc 10381 tcccaagtag ctgggacttg ggatacactt gcccccgctg gtcctccctt ccacctctgt 10441 gaagaggagg tctcaaactc ctggcctcaa gtgatccacc cacctcagcc tcccaaagtg 10501 ctgggatttc aagagtgagc caccgcacct ggcccctgtt tagatgttag catcagtgac 10561 ccagcacctt gctatgtggc atgcagggag cgtgctgcta gacctccggg tttagagtca 10621 aatagcttcc tggctgtggt gtgcattaga ctttctaact caaggtcctc ccactctctg 10681 agcctcagtc ttgttgcctt taaaacgagt ttaagtgtgc tgagtcccta tgctgtggct 10741 ccacaggaat ttccccaggt ggaagacaca tcttgccttc tgtgaaacct ctcagcagca 10801 gagctgtcag gccccgtcag caggagacac tgtggggact gctcagtccc ttccactgtg 10861 tacctcggag ctggcggagc ctagatgagg ctgagcatag agggcttcct ggaggaagtg 10921 gagctgaaac agtttctcag cccagggctg ctctgtctcc tggcctcaca ctaaaagtca 10981 gttgagaggc catagtggca taagtcactg accctggcac tgcccagctc atcaccaaaa 11041 gcagggctag ggagggaggg gacattcgat tggcagtggg cacctgtggc tcatctgggt 11101 tctggccacg gtgctcaggt tctgtgagct gaccaggcag ccctggctcc tctgcccccg 11161 tgtgggttct gccaggtccc atggggcagg tcagcccctt ccttgttgca gggagagcac 11221 ccagcattgc tgacatggga cagggaaacg aggaaataac ggtgtggtca ttgaacacag 11281 agagcactag gtgctgtgcg aggtgctgag gacacgacat gatgacacag acaaggtccc 11341 ccctctcagc aaacggctca tgagggagac agacatgtta catacatgaa cccaaaaagt 11401 cagacgaaaa caaaacagag cgatgtgttt gggaggcaaa cccaactgcc ggagggcgag 11461 cagttgggaa cgtggaaaca tgagtcagat ctgggagtat ctgtcccagg agtccaagac 11521 ctgggtcctc atggtagctc tgccaccgac acactgagtg accttgggta agtgaaccca 11581 ccgccctgga cctctctggc acgcatctct tgagagcagg gacttagtgc atttcccgag 11641 ggcctccacg gtgcctggca catagtgggg cttagtaaat atttgttggt aactgaggat 11701 gcttcctgtt cacatcagcg ctgggaggat ttcctgctgt tcagacaaat gctgggctgg 11761 ctgtgagtca gccttgcaga gagcaaaggc agtgggaagg ggcgtgagat tcccctctgg 11821 agaggtcagg aggccaggca ctgtctcgac atgagtgcca gggagggggt gtggcctgtg 11881 ggcagggctt gggctgaggc agagggactt gagttccacc ctagctctac caccatcaat 11941 tttgtgtaac tctggacagg ccactgaact tctccgggct tagcctggca agtccatttc 12001 cccatctgta acatgggccg atatgtacat tgcctaggga ttaaatgaga taaagggtct 12061 gaaaacagta ggtagctgct ttatcattat tattatttct gtattattga tgtctgaggc 12121 taggcccaca gaggcagtac agtagagtgg ttaggagctc aagaatcaga ctagggttca 12181 aattctgact ccatcactga ctgttttggg gtacttcttt gaacctcagt ttcttcatca 12241 gtaaaatggg agtgaagtct ctaccttgct ggttgtaagg atgaaataag ataatgcata 12301 tagatggtct agcacatagt agatactcaa aagtttgagg ccactgctga cccttttccc 12361 tgaaaggaga caggagagcg gggtcgccac cccattgtca ttgtcatctg gaataggctg 12421 acagacttcc catggtgtgt tgcagttttc tagaaaattc agtaggaggc ctgcctgagc 12481 ttgagccacc tgtggaggtg cttcctgcct ctgctccaca cctgaaacgc gtctgggcct 12541 cttctcaggc agccgtgaga agggatgagt gctactggtc atggtgggca gctggctctg 12601 ctttccccct tcccagaggc gctcctgcct cctgcccagc tccctgaacc cctagcttct 12661 gcaccccggc actgtctggc ttctgccccg ctgagcaccc actgtctctg acgctgcctt 12721 gagtacttcc cgcatgttat tcaaatccca atcagatctt ccctccccca gtagctggtc 12781 ttctgttctg gcttcctgcc atcctgtcct ccacacagca gccgggaaag gtttttttaa 12841 aggggactct ccgatttaac acacttgggt ggaaaaccct ttgcttcggc ctctgcaatc 12901 tccctgcccc ctctccactt tgccctggcc tcatttctca ccactaacct cactctgcac 12961 tctggccaac tccccgcctg cttcctgatt cagacactaa gcacacgcag ctcccctgcc 13021 tggagccatt ctccctctcc ttctttcttc tccctggaga actccccctt taagtgatct 13081 tttcccaaca cactttctaa attgccccca ccccagtgtg atttttcttt atctcatagc 13141 acttggtctg cttcttatca cagtttgcaa ggctgagttc agaaaggtgt gtttgctcat 13201 tctgaggcag gagaggctac cttgtgctgc tgtggtaaca aacagccccc aggtctgagg 13261 ggtctgcaga gacccaggtt gacctcatac tgcttgtccc tccagggcct ccagtgaggt 13321 ttcggctcct tggatcactc agggccccag gcagatggga agattccact ctgaacattg 13381 ccaattgttg tgccagagta aagcagagct gggaggtggg ctcttgaatt ggcatttaaa 13441 tacttttgcc aggcagggta aggcagctca cgcctgtaat cataacactt tgggaggcct 13501 aggtgggtgg atcacctgag gtcaggagtt caaaaccagc ctggccaaca tggtgaaacc 13561 ctgtctctac taaaagtaca aaaattagcc gggcatggtg gtgggcgcct gtaatcccag 13621 ctacttggga ggctgaggca cgagaatccc ttgaacctgg gaggcagagg ctgcaatgag 13681 ctgagatctt gccactgcac tccagcctgg gcaacagagc cagactccat ctcaaaaaaa 13741 aaaaaacaac aacaacaaat aaataaatga ataaatactt tagccagaag tagccatgca 13801 gacctccccc caccagtccc acccacaagc ggacgtgact accgccccca ttcactgcct 13861 gatcctcctg ttctcagggg ctccaaggcc aggcctggtt tgaccttctg actttctgac 13921 ttcctcctac cttcccagta acctcatgca actcctttca ctcagcctca atcatcccca 13981 tgggtgttta aacttgccca agacatgccc ctttgaaaaa gcctgccatt ctcttgaccc 14041 acatgcacgt cctgccccct ccaaggctgc tagttccttt aggggcaaaa ttgtgaaaga 14101 gtagtctaaa ccttcttcct cttcttacct ccacttcttt cttaccttat tcccatgtgg 14161 attctaccct cactcaggcc tctagaacgg ttcctctacg gcagtggttc ccaatcttga 14221 ctacgtgttt ttttaaaaaa agtcctccac ctgggcctgc caccaaggat ttttctttaa 14281 ttgacctcag atggggttga ggccttggga actggccaga acttcccgtg ctcctaactt 14341 gcagccgggg ttaagaacta ctcctctgaa gcccccagtg cctgcgcttt tagcccgacg 14401 gacaagtttc tgcccttcca tcctgtgacc tccagcaggg cctgaccatg tgagttttct 14461 gtggctgccg tgacaagttg ccacaccctg catggcttca accaacagaa acgtgtgccc 14521 tggcagttct gggggccaga agtccaacat caagatatca tcagagccac atgcccactg 14581 aaggctctcg ggggaatcca ttccttgcct cttctggttg ctggtggctc taggcattcc 14641 ttggcttgtg gctgcatcat tccagtctct gcctctgagg tcacgttgct gcttcctctt 14701 gtgtgtgttt ctcttaaaac tctctgcttc tgtcttataa ggatacatgt gattgcatct 14761 agggcccaac cagataatcc aggataaact cttcctgtca agacatttaa taatcacact 14821 ttgccatata aggtaatttt tttttttttt tgaggtggag ttttgcactt tcacccaggc 14881 tggagtaaag tgatttaatc tcggctcact ggaatctctg cccccaggtt caagcaattc 14941 tcctgcctca gcctcctgag tagctgggat tataggtacc tgccaccatg cccagctaac 15001 ttttgtattt ttagtagaca tggggtttca ccatgttggc caggctggtc tcgaactcct 15061 gacctcaggt gatccacccg ccataagtta atattttttt tttgagaggg agtattgctc 15121 tgttgcccag gctggagtgc tagtggctca atctcggctc actgcaacct ccgcctccca 15181 ggttcaaatg attctcctac ctcagtctcc tgagtagctg ggactacaga tgcatgccac 15241 catgcctggc tgatttttgt atttttaata gagaggggat ttcaccatgt tggccaggct 15301 ggtgttgaac tcctaacctc aagtgatcca cccacctcag cctcccaaag tgttgggatt 15361 acaggcatga accaccacgc ccgacccata taaggtaata tttacaggtt ctggggatta 15421 ggattagcat gtagacagct ttgtgggggc caccattcag cccactatgc taaccctgtg 15481 aaccgttgct cgcttctcct tgacatctga cggcctggcc ttctgcatac cacacaccct 15541 cccacctctc tggccacagt tctgtaggct cagcctcctc cgtaaggcca ttaagtgctt 15601 gtgctggtca aagtttcatc ctaggccttt tccttacctc ccttgatatt ttctccctag 15661 gtgagctcct tcaagcccac agcttctgtg cttacccaca ctcctaccta cattcccagc 15721 ttgggcttct caggccagct ctagactctt gtatcccact gggttcttcc acttaccttt 15781 ggatatctca aaggcatctc cagttggctg ggcacgatgg ttcacacctg taaccccagc 15841 actttgggag gccgaggtgg gcagatcact tgaggtcagg agttcaagac cagcctggcc 15901 aatatggtga aaccccatct ctactaaaaa tacaaaaatt agctgggcat ggtggtgggt 15961 gcctgtagtc ccaactactc gggaggctga ggcaggagaa tcgcttgaac ccgggaggtg 16021 gaggtttccg tgagctgagc tggagccact gcactccagc ctgggcaaca gagtgaaact 16081 ccgtcttaaa aaaacaaaaa acaaaaggtg tctctagtgt aacataacta aaaccaaacc 16141 aatcatgcct ccctcccccg catcctccct cctggaggga gctccaggac ttggtcttct 16201 cttccagagt tctctgtctc aaactgcggg aattgctccc cacccaggcc taacctgaag 16261 tgtgagcctt ggcatctctt tctatccacc tgtttttcct ctatgcacct cacaaccctg 16321 gtccaagcca ccgtcatctt tcaaatggct gcagtagcct ctaactggcc ttggaggagc 16381 catcctcttt ctctaaccag ctgccaaccc tgcaatggcc tctgtgtgct ttccagataa 16441 agcctgactc ctcgtggccc gcacagccct gcctgggtgg tcctatcctg cagcctctcc 16501 agtaccatga accctccctt ctctgaacct ctatttaatc catttcatat accccgtttt 16561 ctcctgccat agggccttgc acatgctgtt ccttctgcct ggaattttct tcctgcctcc 16621 ctccgcaccc ctgccttgtg ttgtgggttc ctcgctatcc tctagctttt cgctcaggct 16681 cattgttggc cctctagatg tattcacttc tcttgtttgt taccctctgt cataggactg 16741 tgttcgtact tcccaaggag tcgtcttggt ttgtgactgt acattttccc atgtgacatt 16801 tgcttaatgc ctctcccact ctggggcctg tacaagcccc aggaacagga cttggaccct 16861 cctgtttaac tctacaatct agcatccagc aggcgcgcag gccttcgttg acttttattt 16921 tattcttatt ttttattttt gagatgcagt ttcgctcttg tcgcccaggc tggagtgcag 16981 tggcgtaatc tcggctcact gcagcctctg cctcccaggt tcaggtgatt ctcctgtctc 17041 agcctcccaa gtagctggga ttacaggtgt gcgccaccac gcctggctaa ttttttgcat 17101 ttttagtaga gatggggttt caccatgttg gccaggctgg tctcaaactc ctggcctcag 17161 gtgatccacc cacctcggcc tcccaaagtg gctggattac aggggtgagc ccccatgccc 17221 agccttcatt gacttttagt tgacaactat ttagcatttg ctatgtgcca agaactccct 17281 gcctactaat gcagttaacc ctcatgaagc ctagaaggaa ggactgccat tctccccact 17341 taacagatga ggatgccgag gcacaggaag tgaagtgact ttctcagggt caagcaggga 17401 gtgagtggag gagccgagat tccagctcta accgcatgat gctctataca gtgtgactcc 17461 ggctctctgg ctgggccctc tccatagccc tgtgagggtt aaggatagaa aacagaggct 17521 cagagagttg aggtcccttg cctgaggtca cacagctggt tggccgttcc ctgggctata 17581 agcttcagta ttcccaatgc tgagcatatt ttgagaaccc gagaaacaga cgtttggctg 17641 ggtgggaact gaactcattt tgtcagggaa ttcaacaact aagttggccc tgagactggg 17701 tgtgaagacc gctctgtccc ctgccagctg gatgacctca ggagagatct gatgactctg 17761 aggtcctgct gataggacct ctggtgtctc tgttccctgc tggcctcccc tgggcctggg 17821 ttgggtttcc tctgcaggag gcagctcatg tatgtgctcc tagacgccct tgggccagca 17881 gctccttggc tgttcctccc tgagccaggg cagccaactt tcttatccag ctctccatgc 17941 tccccacccc agcatgagat gtcagctgag agttttctgg atctccccta gctaggggga 18001 aagcttccat catttggaac aggaacagca ggaacagcaa agtccctttc cccaccatct 18061 cccactgcct gctgtgcttc tcctaacagc tcatggtaaa caccctgact gagcggcagg 18121 ggctgtttcc tttgggctat ccatgtccac ctacactgcc ctttttaatc cttacaattt 18181 ttcttggaca cgggggcata atattccatt gtttttcagt tgaggaaact gaggctcaga 18241 gaggtcaagt gtcttgtctg aggtcacaca gcagaactgg gagtcaagcc agatgggctg 18301 cctccaagga tcctactctt aaactctaga gtactagaaa gatcttccgt tgcctaatat 18361 tgattcctga taggctatgc ttgagtagca tctgcttttg aaaatggagc ctgggtcggt 18421 tgcggtggca catacctgta atcccagcac tttgggaggc tgaggtgggt ggacacctga 18481 ggtcaggagt tcgagactag cctgagcaac atggtgaaac cctgtctcta ctaaaaatac 18541 aaaaattaac tgggtgtggt ggcacctgcc tatagtccca gctactccgg aggctgaggc 18601 acaagaattg cttgaaccca ggaggtggag gttgcagtga gaggagatca cgtcactgca 18661 ctccagcctg ggagacagag cgagactcca tccgtctcaa aaaaaagaaa acgaaaatgg 18721 atcctgaatt ttgaaatatg ctgtgactct tccctagttt gggacatctg ggtcaatccc 18781 ttttgttaaa gtagtttatt tagttggctg agagcgggag ctgcctacgt gacctggagc 18841 acaagctttg gaattgggct tgggttagaa ttccgcctct gccactcacc agctgcgatt 18901 aagaacaaag atactgggtt gggctcctgc ctctattact tgcaatctgt gtggccttgg 18961 atgagatatt taacacctcc gaacctcagt gtcctcaatt gtgaaagaga tcgagataac 19021 agctgaaccc acatcccagg agcggattaa atgagatagt gcagtacaga gtttaccgaa 19081 gtatatgggg tcagcagcca gccagtaaaa tggtggctaa tggttatcat gattaatgtt 19141 aacattaagc tctgaaaggt ccttcgtgaa ctcataggta tttgttctct ctctcccttt 19201 ctctctctct tccccctgcc cccttgcagg tagaactgca tgtccaccta gacggatcca 19261 tcaagcctga aaccatctta tactatggca ggtaagtcca tacagaagag ccctctctcc 19321 ctgggatttg agtggggtcc ccagctccac ccagaggccc ctggggaatt ccagggtcac 19381 tgttccttcc tgtctccctg tgggaatcaa gccagctcca ggccagaagt gggactgtga 19441 ggacatggag gcctcggcac tgagctgcag acccgcagac caactcctga gctttctggg 19501 cctctgagtc ttgtcctcct ggtgtcaggt gagccaggcc tgagcctgct ctccccaccc 19561 acccacatac gtgcatgaag gtagttccca gggctgaatc cgtctttttt tttttctttt 19621 gagatagagt cttgctctgt cgcccaggct ggagtgcagt ggcatgatct cggctcactg 19681 caacctccac ctcctgggtt caagtgattc tcctgcctca gcctcctgag tagctgggat 19741 tacaagcaca tgccaccaca tccagctaat ttttgtattt ttagcggaga tggggtttca 19801 catgttggcc aggctggtct cgaactcctg acctcaagtg atccacccag cttggcctcc 19861 cacagtgctg ggattacagg catgagccac tgtgcctggc tcctgtcttt tgacttaact 19921 gagagcctat atatagcagg tgatgtgctc acatgagatg ccagtacaat ttcttgagca 19981 tctcctagag ctgggctggg ctttatcagc tcattgaatt cctccacgct tggaagagga 20041 ggatacgctc tctgcatttt actgaggagg gaatgggctc agccaagaca gttgtccacg 20101 gtcacacaaa ttaatagcag atcaagagtt gaacccaagg ctgtctgacc cctaaggctt 20161 tactacatca tcagggtcat aacctgctag gagtcacgga aaagtggctc cccaactctg 20221 ggcctaaatc tctgcatctt ccaagtgaga acacacttcc tgcctcagct ctcagagatg 20281 ctagggggcc agagggtccc cctgttcccc agcgaggaag gttcttccct tcctacccag 20341 acctcaaggg ctcacagcag ctcctctctt aggaccagct tttaagggca gggactttaa 20401 aggccagtgg atctggattc aaatttggac atattatctc ctgtctgcga acttggtctc 20461 tatcaactga ggctaagaac aggccctccc tagagagatg acctaggagc taggggctcc 20521 ttgtccaccc agccctgccc ccgcagacct gtgttcctcg gatgtttgca caacactcat 20581 tttgtttgga gctgaaagaa ctcagcctct ctgtcacagt cttgaaattc agctcgggac 20641 ccaaatttga acatttctgc tccataagcc agaatcctgt tattcagagg cctgccctca 20701 tggagagaat gagggatccc gggggttgcc cccaactctc gggagcatct ccaccaactc 20761 cctgagagat ttctggtaag tccactattc tccatctttt cacacttcca gggaccttct 20821 tctgccccag gaagctgcca ttgatttaat tcctatttaa ctgcaaggca taagcacagt 20881 agcacctcct gtgtgccaaa cactccttta agtgcgttac ccgggttaag ttattgaagc 20941 ctcacaacaa tttgtaagat aggaactcta ttgccgtcat ttacagatga ggagactgag 21001 ccgtggtagg tggagtaagg tgcccagtaa gcacagggcg gaggtttgaa cccagatagt 21061 ctgcccccga gtccatggcc ctggccatta ccccctgtca gttagaggtt ttggtaagtg 21121 atgcccgtaa aatgcttagt tcagggccta gcacacatta atgtgctcca taaatgtcac 21181 ttaatgataa tattcttatt aattggagct tatatctcta agtggggtga aacctcttgg 21241 cttatctctg cctggccttt gcccatgtca agccgccaac ttgccacaag gcccctaatg 21301 aggtcgttca gtggggcacc aagatgagat cgaacccagg cactcattaa ggggtcacgg 21361 agggctcatc agctgcagcc aggggctggg agcgccgggt ggggctaaga gaaaggggaa 21421 aggagccgcc gggaggggca ctggtctgat cgtccattcc tcacaccacc tctgggcctt 21481 ggagatggcg tgcggcaggt gccagctgga gcttggcctg aagtcagcag gcaggggact 21541 ggggagtttg tcacactcag atatgggtgt ctgtaaatgc acacaaatat gggctaagaa 21601 tggaaggagg aggggagccc ctggcctgag ccctgctagg cccaattcag tggccctttt 21661 tccagctctg ggactcaggc ctgcctcatt aactgtcctc acccatttct ccttcctcca 21721 gttcccagga ttctggcctt ttcaggggcc tctccaacct ctttctcagt cttgtttata 21781 accctgtcaa ctatttctac agagattctg aaactggctg ctctttcctc cgatcactgc 21841 cctggtctgg gccaccactg cccctccctg gtgctgtggc ctcctgattg gtctcagcca 21901 tctactctgg ccttcctctc tacgggccct gcagtgctgt agttggagca agagccttaa 21961 cccatggtct tcccagctca ttccccagct tccccatctc actcagagtc aaagccaaag 22021 tccacacatg ggccttaaag ttctgcaaag cctgcattgc ctctctgacc tctctaaggc 22081 tccttgctta gtccacactg gatgtttttc aaacatgcca gacctaggaa acagagagtc 22141 tgggttactt gcccaaggtc acacagcctt taagtcacag agctgggatt caaacccaga 22201 ccactgggct tcagagtctg ctctttctca tgacacacaa agtttcattt cttcctctgt 22261 gcacccctac atggaaaata ttatgtttta ctgacaaggg caccaagggc cttagagggg 22321 agcgctcctg cctgggatga tgtggtaaat aggggtggga gatggacttg acctgcaacc 22381 cctgcgctca tcctccctcc ctccctgggc tcctgatggt gggcttcttg tgactgtgtt 22441 gcccaccaag gccggaagag gaccagacag tgccccagca cagcagctgt ggctgaccag 22501 ggagtaggga tcatctaaga acagagcgtg catggtgctc acgcctgtaa tcccagcact 22561 ttgggaggcc aaggcgggtg gatcacctga ggtcaggagt tcaagaccag cgtggccaac 22621 atgggaaacc ccgtgtctac taaacataca aaaaattagc caggcatggt ggtgggcatc 22681 tataatccca gctacttgag aggctgaggc aggagaatca cttgaaccag ggaggtgaag 22741 gttgcagtga gtcgaggtcg tgccattgca ctccagcctg ggcaacaaga gcaagactcc 22801 gtctcaaaaa aacaaaacaa aagaaaaaac agagggtggc cctatgagga gccttcgctt 22861 gtgtgggtgg ccagggacag caagaggtgc cagggcccta ggaacagctc tttcctgctt 22921 caactttggg ctccagatgg gcgctttcca gctcagtctg agcagcttcg ggaagctgtg 22981 tcccatggga gacactggga gtcccctgtg ctctttgtct cctgtcgggc ccccacatta 23041 gctctctggc ctcagctctg gcttccctcc aatttgtttc ccacgcagca gccagaggag 23101 ctttcaaaaa ggtaaattat ttcatgctag tcccctgctt gaaatcctac agtgccttcc 23161 cagtgctttc agccaaagcc ccagtccctt cctaagccca gcctggccct gcctccctgg 23221 tgcatcatct gcacaaatgc ctgctctctg acctccagcc accctgcact tccaatgccc 23281 gcggcttcct gcctgcagct ttagtacaga cccctcccct gcccagaact gcccccaccc 23341 caaggcttct gctgaaatgt cacctcctca gagaggcctt ccctggctgc tctgtctaaa 23401 ctctgtgttg agaagttcct tcttgatggt tgttgaggag ggaggctgga gaagaagaat 23461 caaagaggag aaatagaaag caaaataatt tgttcttggg gacgggctgg tgctgggcac 23521 ggggaggcgc ccgtctctgg tgtgggcagc tgggtagatg gaggagccgt atttggaaat 23581 gtggaaccca ggaagggagt gatctagagg gaggggaaag gtggcgcgag atgcctgcct 23641 ctcaacaggt agccagacac atgggtctgt cttggtcact gctatctgcc cagtgcccag 23701 cacatcacag gccctcagtg gtggtgtgtg ggcatagaga attagaagct gtggacctct 23761 ggatccggag ctgaaaacca ccaaaggaga tgagttggcc tggccaggtg tgtaaaaggc 23821 agagtctgag agagaacgac cagagggcag agccccgcag gtggagtcct gggggctgga 23881 gggagaccat taggagaatc gcacatggct ggcgcagcag gtcccaggca aatgtggcca 23941 ctgggtttgg caatatggga gccagagccc tagtgtcatc tccctgcctt ctacccagca 24001 gttcccagag tgatatcccc aacagtgttt gacaactggt acaggctctt cagcggccac 24061 agttactggg caaggccttg tgagggtgac tttggggcag ctggccagca gtgggagggg 24121 aagcagtctc aggggtacct gaggcactga gctccgacct ccaggtgcca atgccgcacc 24181 agggcaccgt tcccctgcag gctcttacag ggattagggg ctggtaagga gcagtgatta 24241 ggggctgact agcaggctgg tgggcaccag catgacccct tggtggtacc ctctgggcac 24301 tcatggggac ttgggctaac agatggggaa gggagcacat tcagggggct taggaaacat 24361 atttatgtag ggaagcattt taatatttta gtaacagaag ctattaaagg acttacaaac 24421 ttacttacat acactaaaac actatttggt caaacttctg tttctttggc actttcctcc 24481 tttattcttt tttatttttt tgagacaggg tcttgctctg tcacccaagc tggagtgcag 24541 tggtgcaatc ttggcccgca gtagccttga cttccaggct caggtggtcc tcccacctta 24601 gcctcccaag tagctgggac tacaggtgca cgccaccacg cctggtgaat ttttgttttg 24661 aaggggtttc actgtgttgc ccaggctggt ttcaaactcc tgggcttaag tgatccgcca 24721 gccttggctt cccaaagtac tgtgattaca ggtatgagcc actgcacccg gcctcctatt 24781 tttctgcttc tgctttgtgg ataattggat gcttggacct cctgatttaa tcttctaatt 24841 tccttaactg tttactccta tttttcatca tcttgtcttt ttgttctact ttgtggagga 24901 tttcttcact tttagcttcc agttcttttc ttacatcgtg acagttgctg ccgcattctc 24961 ttgtaaattt ccgagggctc gttcttgggt tctgaatgtt ccctcctttc aaggatcttc 25021 tcatctcttt gaggatattc atgtcttttt tgttttggtt cttaggtttt catctgttct 25081 ctgtgctgtt tcctcggagt gcttttgtct attctgttgt tttgtccctc atgttagaag 25141 catttctttt ttttttcttt ttttttttgt gatacagagt cttgctctgt caccaggctg 25201 gagtgcagta gcatgatctc ggctcaccac agcctctgac tccctggttc aagtgattct 25261 cctgcctcag cctcctgagt agctgggatt acaggcacac accaccacac ccaactaatt 25321 tttgtatttt tggtagagac ggggtttcac catgttggcc aggatagtct caatctcctg 25381 acctcatgat cctccgacct tgcctgggag gccaaagtgc tgggattaca ggcgtgagcc 25441 accatgccca gcctagaagc atttcttaat gtctggtgtt ctctggctgt tgtatcttaa 25501 aaaaaaaagg ggggggaaac tgaggctcga ggtgaccttg tgagctggag cagagccggg 25561 atgggatgag gaggcaggag cgtgtgcaga agagagggag cccccctgag ctcgcaccct 25621 gcttcccgtg gctgggaggg gaggccgaga tgcttgggga gaaatggagg ctccaagcca 25681 gaggggctgt ttccagcacg ctcttactga gcgctgctgt agtccagctt ggtgtggcgg 25741 ctgtgggcag ggaggggaga gaggtctgag ctggctggcg gcccactggg cccctcccct 25801 gagcctccac cggccctctc ccagtgcgct gggctgggca agcctctgat gtgccagcca 25861 gatggagggt gaagtcctga tgcctgcccc taccctggga attgtgatgc tgcagttact 25921 gcccctgata acccctgact gggcatagga ccagctggct gagccagctc ctggggctga 25981 ggaggaagcc atgaacttga cctggcactt tccttgtctc caagcatcag tcaaccaagg 26041 atatggaggg ggtgtgtgca tgtgtgcaca catacacaca cacacacaca cacacttcaa 26101 cctgtttatc ccccttgaga tttgctgact tgtgcattgg gggtagaagg tgctggaaaa 26161 attccggtcc tggttctcag tttccccatc tgtccagtgg gagcagctgg actgagagac 26221 gcccatgtct cctgctgtgg tcctgcaagg aggctggcgc tcctgagtct gctccatcct 26281 ggcctgtcag gcctgcctgg atcctgcccc gggttggtcc accactcact gttttgtttc 26341 caggaggaga gggatcgccc tcccagctaa cacagcagag gggctgctga acgtcattgg 26401 catggacaag ccgctcaccc ttccagactt cctggccaag tttgactact acatgcctgc 26461 tatcgcgtga gttgccccca acccacaggt cctagggcag cattgatccc tatgactagg 26521 accaggcctg tccctcagcc tgtgggggcc agagaagttg ctctgaaacc acagctgtct 26581 ttctcaccat tgtgtacact tagtgagtct ctccagtgcc tttaggcctc agttttccct 26641 tctgagatgt gggtgtgatg gactgaaatt gcttcaagtt ctacagagaa atggcagaat 26701 atgggagcta agaacacagg gtcagaggca gtgcagggct tgaacccggg ccatctatct 26761 cctagttcag ggcttcgtgt tgtgagggga ggagaggcct gaatataggg tgggggcggg 26821 gagatgtggg gaagattctc caaaaggctt tttctttttc ttgtcttgag tcgccaggga 26881 acagcactag gtaccgaaaa ggccagaagg ggtatgggcg agtactagag agaaatttcc 26941 atgactgctt tatttattta tttatttatt tatttattta tttattgaga cagagtctca 27001 ctctgttgcc caggctgaag tgcagtggtg cgatctcagc tcactgcaac ctccacctcc 27061 cagtttaagg gattctcctg ctttagcctc ccaagtagct gggatcacag gcacccacca 27121 tcacacccaa ctaatggttt tgtattttta gtagagatgg ggttttacta tgtttgccag 27181 gctggtctcg aattcctgac ctcaggtgat ctgcccgcct cggcctccca aaatgctggg 27241 attacaggcg tgagccactg cgcctggcct ccatcctcat cctgaagatg caagaacttc 27301 tggtgacccc ttctcctgag agtggcctga tctcccctgg gcagggcact ttcttcccac 27361 gctgggctct cccacgactt gtgtgccttc cctcacacat tctagtaacc acttcatttt 27421 cactcttcat ggtgggaact tccagctaag cacagtccac cgttacgtga tcaacacagt 27481 ggccctggca ggccaatttg tgccttgctt ctggaacaaa catgcagtaa taacaacgaa 27541 aatgttttga gcatttgtcc gctctgctcc aagcactgac ccgggtgggg tttatgaagt 27601 ttgactcatt tgtccccgca ataactcctt gacctaggtg tcagagggtg actaaccagg 27661 ggtcacacag cagataagtg tgggcacaag gatccaagtc catgactgta tcccacgtgt 27721 ctcccacatc caggcatccc tctggacttg tccagctgtg tccttttctc tcatttctct 27781 tccctgccag ccttaactcc atcaccaaca aatattgggc tactctgtcc taggcatggt 27841 cctcagctga gaggtcgcag ccatcccaag acagaggggt ccttgccaca tggagactgc 27901 attctagtag ggaatacagc aaactggctg ataagccata tgacacacaa tgttgagtag 27961 tgataaggac ctgggagaaa aagaaagccc aggagaatgg tggaggggcc gttttaagat 28021 aaggcggtct gggccaggta cagtggctca cgcctgtatc cccagcactt tgggaggctg 28081 aggtgggcgg atcatgaggt caggagatcg agaccatcct ggctaacaca gcgaaacgct 28141 gtctctacta aaaatacaaa aaattagccg ggcgtggtgg catgcgcctg taatcccagc 28201 tacttgggag gctgaggcag acgaatcact tgaacccagg aggcagaggc tgcagtgagc 28261 tgagatggcg ccactgcact ccagcctggg cgacagagca agattctgtc tcaaaaaaaa 28321 aaaaaaaaga taaggtggtc agggaaggcc tctctgagga ggtgaagctt cagctggctc 28381 taaaccaggg gagcgggaga gacgcagtgt aggacagtat cggggaagag caggcctgtg 28441 tcttctccgg tggcctcagg gaatgaggga gaaggaaggt gctggggagg ctggcaaggc 28501 tggaggatgc aggcttgtgg gcaggacctg ggagttgcga tgtcactctc cgtggcagga 28561 agctactggg gcttcgaggg gagaagtgat atgctttgat ttaccttctt aaaagattgc 28621 cccaactgct gggtggagaa caggatgaca ggggcaagca tggagacagg gaggccagtt 28681 agagatggcg tgattcaggc caggatgagg ggtgagaact ggtatgcagt tccaaagtag 28741 agctgatagg acttgcccag tgtctggatc ttatccagtg gatgcccaga gcttgggtct 28801 ggggatgaag tgggtttaat ctgccaaggg ttggggatgt catttgctcc tggagctccc 28861 aagggacttg gggaaggttg ttcccaaccc ctttcttccc ttcccagggg ctgccgggag 28921 gctatcaaaa ggatcgccta tgagtttgta gagatgaagg ccaaagaggg cgtggtgtat 28981 gtggaggtgc ggtacagtcc gcacctgctg gccaactcca aagtggagcc aatcccctgg 29041 aaccaggctg agtgagtgat gggcctggaa ggggccatgc tgagggtgtg gctgggaggc 29101 tcagctctga gactggaagg gcgaactgct gggaatccct gacccaagca agaccttgtt 29161 cttgccccca gtctggtcca tggcctcaga aagatgggtt taactctgtc acaagagacg 29221 tggttcccat cctccctttg ccgttatgtt cttaccttgg gcacaagtgt ttggctgtgt 29281 cttgctctgg ccacaggcct gctgtccagg aatgttaacc tgcttagcca cccaggattt 29341 ctgaggggtc tcccttgtca ctgatgctga tcagatctct aaaggcccta aaggtcctgc 29401 tctaacttca taactgaagt gagtctggcc catttctagc cccctgcctg ggcccccatg 29461 gatctctaag tggtatcaca aaaccaccct gccccatttt ctgagccatg attctgatac 29521 atatagaatg tgaacatcat ggcaggccca agcttagcaa tgctgtccat ctgggggtgg 29581 ggagggccat gttgacaccc cacacctccc actaagatct aggagcaccc agctgcttta 29641 agagctagag ggacatgcta gggcctgggg gcatctctgc cagtctttcc tctgaggcag 29701 tgggtcagtg ggggaggagg gtcctcccca aagcctcctc ttcctcctct gtcccagtcc 29761 cagagctgcc ctttaggcct tccttttgcc tcaggcccat ccctactcct ctcctcacac 29821 agaggggacc tcaccccaga cgaggtggtg gccctagtgg gccagggcct gcaggagggg 29881 gagcgagact tcggggtcaa ggcccggtcc atcctgtgct gcatgcgcca ccagcccagt 29941 gagtaggatc accgccctgc ccagggccgc ccgtctcacc ctggccctga cctcctggcc 30001 tagcagtggg gctgtacctg atctcccctg tgccccacag ccccatggtg tccccttgag 30061 cccactggca tgaacttggg gcttcatgaa acaactggag acctcctagg caggctcaga 30121 acttctggag atgttctccc cagggacacc atgcctttat agccaccctg caggaagctc 30181 aacaccaaat aggaacgtaa ctattgaaaa aaaaatctag gctagattct gatcagccca 30241 tagtcctccc tcgagaccca gtggaccagg ccccatcctg tctgggcctg aataggtctg 30301 atttccaaga tttctgaggg gtctcccttg tcactgacgc agatcagatc tctagagttt 30361 gtgcctcatg gtgcacagcc tcactgtgtg atattgggca ggtcacactg ctgctctggt 30421 tatgcaccaa gacacctcag ttgtgcactg tcacaaggag atgatcacac ttacttcatt 30481 cctctaccct caggattagt aagaaccaaa gagctacctg cacgcatttc ctctaatcct 30541 cgcagcagcc tgcaaagcag aactaccatt gcttagtccc atttgacaga tgaggaaact 30601 gaggtggagt gaggtgcagc ctcttgcaag gcacaaaccc tggatttgta tccggggaca 30661 tctagttcca aagcctgtgt tcattcattc tttcttaaac acttcagaat aactttattg 30721 gttaagagta cctaatacat tagcgagata cttcccaata ctagtgtgag ttctatttta 30781 gatgacgtgt taaacggtcc tccgtttcct catctgcgca tgggaataag cctaccatga 30841 gtgttgttgg aaacaccagg tgagagaagg gtccgtgtca tttactgagc tcaggccccg 30901 tccttggtgc tttacacaca tggcctcggc aaagcctggc cgtgaccctg tgcaatagct 30961 ggcagggttc tttctgaaaa gggcggaaac tgaggccata agcagagcag ttttccgcag 31021 ccatgtggtt aggacatagc agttaggatt tgaagacact gagccctgtt ttgtgctggc 31081 ctcccatggg gggtttgggt gggacagcag gcaggtaggc tgggaggtct ctccatggtg 31141 ctggtgacag agcctgggtg ggcatctgcc cacagactgg tcccccaagg tggtggagct 31201 gtgtaagaag taccagcagc agaccgtggt agccattgac ctggctggag atgagaccat 31261 cccaggaagc agcctcttgc ctggacatgt ccaggcctac caggtgggtc ctgtgagaag 31321 gaatggagag gctggccctg ggtgagcttg tctcccaccc atagttggga gaaatcacaa 31381 gaaccaggga ccatggtgtc tcctgagttc tgaagtgtgt ctttgttggg tcttaaggct 31441 tggaactgga atccccctgg gccaggcgtg gtggttcatg cctgtgatcc cagcactttg 31501 ggaggcgagg caggaggatt gcttgagcct aggagtttga gaccagccag ggcaacatag 31561 tgagatccat ctctgcaaat acaaaaaaaa gtagtcaggc atggtggtgc atgcctgtag 31621 tcccagctac ttgggaggct gaggtgggag aattgcttga gtccaggaag tcaaagctgc 31681 agtgagctgt gataatgcga ctgcactcca gcctgggtga cagagggaga ccctgtctca 31741 aaaaaaaaaa aaaggaagaa agaagaaaga gaaaagaaag agaaagaaag agaggaagga 31801 aggaaaaaga ggaagggagg gagggaggaa ggaaggaaag aaggaaggaa gggagagaga 31861 aagaaaagcc tccacttggt gttgggagtc ctgtgctgag cctgcttctg gctgtgattt 31921 gctgtgtgaa cctgggcaac actgtgtctt ctctgggcct ctgtttcttc tattgggatg 31981 actgagttgg agccgacatc tcaaaagtcg cttccagcgt gatgatgaat gggcctcctg 32041 tggagggtgc agcatggtgg agaagtcagg gctctggagt cccactgccc gggctcagag 32101 cttggttcca cacttcctgt ctgaccttgg tcacattact tgaatctcct gagcttcagt 32161 ccttcatcat aaaatgggtg ggataatagt tgtgaatatt agataatgta tacaagtcac 32221 ttcatatact acctgacaca tggtaactgg ctaatgagtg acagctacca cttagataag 32281 gacttggagg gtaaaagacc aggtttcccc atgctgttga agcaggcagc atgactagga 32341 tggttcaatc tccacagcat ggtcaaggca ggctgccggg gccctcccgc tagggcaccc 32401 atgacctggc tctccccctt ccaggaggct gtgaagagcg gcattcaccg tactgtccac 32461 gccggggagg tgggctcggc cgaagtagta aaagaggtga gggcctgggc tggccatggg 32521 gtccctcctc actgcctcct cccatacttg gctctattct gcttctctac aggctgtgga 32581 catactcaag acagagcggc tgggacacgg ctaccacacc ctggaagacc aggcccttta 32641 taacaggctg cggcaggaaa acatgcactt cgaggtaagc gggccaggga gtggggagga 32701 accatccccg gctgtcccaa cttcctgtat agagaggcag aaagcagggc gggtcccagg 32761 aactcgaggg gtggccccag gcccagacat ggggggagga atcagcatgg cctggggcca 32821 tccctgccag ccacacacct gctcttccag atctgcccct ggtccagcta cctcactggt 32881 gcctggaagc cggacacgga gcatgcagtc attcggtgag ctctgttccc ctgggcctgt 32941 tcaattttgt tccaggaagg ccaaagaggg aagaaacttt agggattggg catcagccca 33001 tgccgcgtct tttagatatg aaatctcttc gacaccctgg gaagcaggca ttgccgtcct 33061 catcttacaa atgaggaatc cgaggcccag atgtgctgtg gcttgactgg gattacccag 33121 ctgctaacca gcagagctgg ggccctacag ctcatcagct ggagcagaac gctccattac 33181 tctgagggaa gcttccacac ttccaattct cccaactctg ccccctgggc atcgcatagg 33241 aagcaggagt ccctctggcc agcatgttct ctcttcctga cacctggccc ttgggacccc 33301 tgggcattcc cctgagcgcc atcttgaagc tttccaccgg aggtctgttc caccctgcct 33361 ggctcccatc ctggagtcta accagggtca aggccctcct tccgtcctgt cgccaagcca 33421 caggagcagt atcaggcctt aggaaaaagc cgccttcccc aagacaagga cagcaagaac 33481 tcagggtgac catggtcagg ccagcactta tccatctgcc aggcatatga gaaggggagg 33541 ggcttcggct ctgatgttct gatgacaagg gggtcttggg gcttgcttag ggacacgtgg 33601 cacctgtgga ggttcttgga ggcatgtggg tataccatgg gctggaaaaa gatccaggag 33661 tcatctgcac agatatggtg gctgaaggag aagcagtggc cccaggaggt ggtggagcaa 33721 gaagggccta ggatagaacc cagaaggaca atggtattta agggaccagc aaaagagaca 33781 agtaggagga aagtcaaaag tgtggtgtca cagaaatcca gggaaaaggt ttcaagaaac 33841 agtcaacagt gtgaaattct gctatgcaag tcgattatgg tcagagctag gaaagatcca 33901 ttagatacaa caagatggtg gtcagggatc gtgccaagaa cagcttccat ggtatgttgg 33961 agtagccagc tcccagtggg actgaggaac aagcagggta gggtgcagag gggaaggctg 34021 gagagggtgg cagccggagg gggatgttgc tttcttggct cccaccccca cgcccccacc 34081 ggctgccatt ctgcctggtt cccatgtctg gcccctctgc tgcctttgcc cagctctggt 34141 cttcaggatg ggctggattc tggactttct ggttacatag acttgaacaa gtcacctaag 34201 ttctgaattt atttccccct ctgcacaagg atcagatctt tcagatctgt ttgaggctgc 34261 tgtgaggatc aaaggcgggt gaacgtcaat gtgttctgac tatttatgta agagtaaaag 34321 gaggctgatt ctctcctcct ccctcttctg caggctcaaa aatgaccagg ctaactactc 34381 gctcaacaca gatgacccgc tcatcttcaa gtccaccctg gacactgatt accagatgac 34441 caaacgggac atgggcttta ctgaagagga gtttaaaagg ctggtgagtg ggtgtgagcc 34501 atactggcct tgactcgggt ttgggagtat ggtatctaca ggtccagtcc ggggcctgga 34561 atctttggag agagggagtg agtctgcctc aacagtccaa gacaagccca acctagacac 34621 tttccacaga gaagacatct ttgtgttgac gtcctgacct aggaccaggt ttttgatcct 34681 ttgcttgggt tgagtgcctt taaagaatcc agtgaaagct gtcaaccctc tccccagaaa 34741 ggtgtgtgca gcagctatga agtcttgcac actctcttca ggttgttctt aaatcccagg 34801 ctgaataagt ccattcctgc acgtgtctgc gaggtgtctc tggcccccta catgccaccc 34861 tgtctctcaa aggtttctcc aacttccttc tcacagccct ttttcatgta atgacaaatt 34921 aagaacacga cctcatggtc tctactctgg cacttgctgc cgtgtgacag tggacaaatc 34981 cttccccctc taagcgtatc tgcccatgtt gagtgaagag gatggactat cactacattg 35041 ctaagagctg ccttctttgt tctctggttc catgttgtct gccattctgg cctttccaga 35101 acatcaatgc ggccaaatct agtttcctcc cagaagatga aaagagggag cttctcgacc 35161 tgctctataa agcctatggg atgccacctt cagcctctgc aggtaggttc ctgtctgggc 35221 ttctgggcag ttgcctgtcc tggccccagt gtggctttct gtgggacttc tagcaagatg 35281 cccttccatt cttgggcagc gcatgaatgt gtgatgactc cctggtttct gggccctggc 35341 tgggagcagc gtctcattag atcggtttgt tttctataaa agttcttgag aggctgttct 35401 aaggggagac tttctgaagc ccagtcccaa aggtctgggc agttggggac acctccatgg 35461 ctgcccaaag ccaagggcag ggagaggggc ccaggctgtt ctgctccttt cttcctatgt 35521 ggtcttggca aggcatcttc ttgccatcat aggaaggagt tcctttctgg ttctggtgtt 35581 ctatgatttt tacaacatcc tgggtactac aagttgcctg atctttttgc ttctctgaac 35641 caacgagcag ggcagaacct ctgaagacgc cactcctcca agccttcacc ctgtggagtc 35701 accccaactc tgtggggctg agcaacattt ttacatttat tccttccaag aagaccatga 35761 tctcaatagt cagttactga tgctcctgaa ccctatgtgt ccatttctgc acacacgtat 35821 acctcggcat ggccgcgtca cttctctgat tatgtgccct ggccagggac cagcgccctt 35881 gcacatgggc atggttgaat ctgaaaccct ccttctgtgg caacttgtac tgaaaatctg 35941 gtgctcaata aagaagccca tggctggtgg catgcagcag gtggcatgta atttggtggt 36001 cttgggcggg ccgatgtggg caggatgagc atggagggag ctgggtcagc ctgctcagca 36061 gcagggcctg agcctaaggg tggctgtgaa tgccaggcca gagatcccaa tgctgtgggc 36121 caagaggggt ccagaggctg tcctccttcc agaagaaata aggcttctct ggttgttgct 36181 caaacattcc ctgaactctc agcccctcct aactctaggt tttaaggagt aaagcttcct 36241 tttgggttcc tgaagctggc agttggggtg agagcagatg agatggaaga gggctcatca 36301 gacactggcc ttggagggtg ctggcctctg cagaacgcca gcatcttctc agaatcgtat 36361 gttctagaag cctgggcgaa gtccggctaa ttgtggactt ggggaaaata aggcccaacc 36421 cctgtttttg caaggttaag gagaaataat cttaaaccag tcacacaaat catcggcatt 36481 tatttcctgg gtcctaggtg tcacttatcc tggtggacag ggcagaggtg gtcagatcgt 36541 tttgagccaa aatcccttcc ctaaaaatgg atctgtggag ctccatgagg gaacctcaga 36601 gatgcacaat gacagtttag ctaaaatggc ttaaaaaatg tgaattgatt gtcagctctc 36661 tccatatctg ctgaaaaaag gtttaaaatt tttaaaaagt ttaaaagtgt tttctaaaaa 36721 agggacaagc aggtctggac c // LOCUS HUMADPRF02 5167 bp DNA PRI 31-DEC-1994 DEFINITION Human ADP-ribosylation factor 3 gene, exons 2-5. ACCESSION M74493 NID g178158 KEYWORDS ADP-ribosylation factor 3. SEGMENT 2 of 2 SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5167) AUTHORS Tsai,S.C., Haun,R.S., Tsuchiya,M., Moss,J. and Vaughan,M. TITLE Isolation and characterization of the human gene for ADP-ribosylation factor 3, a 20-kDa guanine nucleotide-binding protein activator of cholera toxin JOURNAL J. Biol. Chem. 266 (34), 23053-23059 (1991) MEDLINE 92078170 FEATURES Location/Qualifiers source 1..5167 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(M74492:1319..1536,118..358,1197..1307,1525..1649, 2193..5092) /gene="ADP-ribosylation factor 3" /product="ADP-ribosylation factor 3" gene join(M74492:1319..1624,1..5092) /gene="ADP-ribosylation factor 3" intron order(M74492:1537..1624,1..117) /gene="ADP-ribosylation factor 3" /number=1 exon 118..358 /gene="ADP-ribosylation factor 3" /number=2 CDS join(211..358,1197..1307,1525..1649,2193..2354) /gene="ADP-ribosylation factor 3" /codon_start=1 /product="ADP-ribosylation factor 3" /db_xref="PID:g178160" /translation="MGNIFGNLLKSLIGKKEMRILMVGLDAAGKTTILYKLKLGEIVT TIPTIGFNVETVEYKNISFTVWDVGGQDKIRPLWRHYFQNTQGLIFVVDSNDRERVNE AREELMRMLAEDELRDAVLLVFANKQDLPNAMNAAEITDKLGLHSLRHRNWYIQATCA TSGDGLYEGLDWLANQLKNKK" intron 359..1196 /gene="ADP-ribosylation factor 3" /number=2 exon 1197..1307 /gene="ADP-ribosylation factor 3" /number=3 intron 1308..1524 /gene="ADP-ribosylation factor 3" /number=3 exon 1525..1649 /gene="ADP-ribosylation factor 3" /number=4 intron 1650..2192 /gene="ADP-ribosylation factor 3" /number=4 exon 2193..5092 /gene="ADP-ribosylation factor 3" /number=5 polyA_signal 2588..2593 /gene="ADP-ribosylation factor 3" polyA_signal 5073..5078 /gene="ADP-ribosylation factor 3" polyA_site 5092 /gene="ADP-ribosylation factor 3" BASE COUNT 1019 a 1308 c 1310 g 1530 t ORIGIN 1 ccctcaggcc tctccgtttg tcatgaatga actctagcca tgaccttggg aaattctatt 61 tacttggaag tcctgacaag caaatgatgg tatgattttt acatttcctt tgtctaggat 121 caagtcttcc ataccaggga ccaggccaaa accagactgc cactgcccct acttgggccc 181 cactgtcccc agaaagcagc tgctgtgatc atgggcaata tctttggaaa ccttctcaag 241 agcctgattg ggaagaagga gatgcgcatc ctgatggtgg gcctggatgc cgcaggaaag 301 accaccatcc tatacaagct gaaactgggg gagatcgtca ccaccatccc taccattggt 361 aagagcacag cttggatgtg ggctttcacg ccggcagctg aggcagggca gggacctatt 421 cctggttccc caacctggcc tcttctcccc aggcaagtcc atacaatttc ccgggttcag 481 aaaagagggc agtgttgttt agcttactac caggtgctga ggctgctgtt tgccacctaa 541 cctggctctc cctgatgcgg gaacttccat ggcctggcac tcaacagctg ccctcaggca 601 cggagttggt ttcttctctg tgttgtttgg aaagcaggta aaaggctttg atgttcctgc 661 caaagcttgg aattacttgg gtttagatct ttggtggcca atcttggtta cagtttgctg 721 aagggagttc tccctacatt tgctatgtag atggggtctc cagtcagcat tgtcagctgg 781 ctctgctttg gggctccagg atgccagctt acttcccttt cttcacctct gcctctcagg 841 ccccacatct tagctgaggc cgactctggt agccaaagta ccagagcagg gctgagtcat 901 tttgtaggag ggagcaccaa agactgagtg gaaaagtcag gaaagttggc ccacgttacc 961 caggcaactg tgaacctctc aaccaacttc ctcatcctct ttgacttctc ttaggttttc 1021 attttggcct tcaggctgct aggttaacga ttatcaggtc tgaggaagcc ttagcttttg 1081 ttccattgtt ggtgttatgg ccaccactgg gggaaagcag aaccaaactc cattgtggtg 1141 gtgtttgcgg ggctagtgtt tgagccaagt ggccattgcc tgtttttttt ccaaagggtt 1201 caatgtggag acagtggagt ataagaacat cagctttaca gtgtgggatg tgggtggcca 1261 ggacaagatt cgacccctct ggagacacta cttccagaac acccaaggta tgctcaggcc 1321 tagcgtgggt tggttgggta tgagcagaga gtgtggctgt ctttgttaat gggtggccaa 1381 ctgttaccct gctgcagctc ctgggggcag aagaaaggga ggaaagggag caaatatttg 1441 gagggtgtca ggcaggtgtg gtcttttggt tggggggagg gggttacctg tcatcttagt 1501 gcagcccatg ctctgccttc ctagggttga tatttgtggt cgacagcaat gatcgggagc 1561 gagtaaatga ggcccgggaa gagctgatga gaatgctggc ggaggacgag ctccgggatg 1621 ctgtactcct tgtctttgca aacaaacagg tgagacttct tcccacccct gaatttggga 1681 gaccagcaaa ctggctgtgg aaattgttgg ctcttgggcc tattatatgg ttaacccttt 1741 tagtatgctc tcaatgttag aagcgtggaa aactgagcac ttggcttctt tatcccctct 1801 tcttgtcccc accccaagct ttatttagca acatttcagt cctcaggaat tgtctcaaat 1861 ctttctttca tcttgcctgt tatctacttt aggagaattg gcctcactcc tgctgtcaga 1921 cagacctgat tctttggtca gtctgaatca cgtctcttgc ttgcccacca tcctcgtatt 1981 ggttccaatt cttcagtaca gtgacctcta gtgcttggat cttgagttct aggtgactaa 2041 atccaaagta gctcctatag gatcttgttc tgggtttcca aaaataaagg aggaggggaa 2101 ggtgctggta cacatttctt tggtagatgg gtgttctaga agcctttaaa aaatcctgag 2161 gcttcttttt atgtgtctcc cacgtgctcc aggatctgcc taatgctatg aacgctgctg 2221 agatcacaga caagctgggc ctgcattccc ttcgtcaccg taactggtac attcaggcca 2281 cctgtgccac cagcggggac gggctgtacg aaggcctgga ctggctggcc aatcagctca 2341 aaaacaagaa gtgaaagcca gacagcccta acaaagcacc ccacccaccc ctgacatacc 2401 tactgtcacc ctgccccagt cctacccctt cctctccatg caagtgtggc cagggccctg 2461 ggtatcatgt ccacatgccc agcaagagcc ttgcctcccc tgcctcccct cctttttcct 2521 gtccacctat atgaccaatc cctaattgct gtcctgatga tgagtcattc caatttactg 2581 gatttaaaac aaaaagttgt tacggtttca tggggtggcc cctctctctc tctcacctct 2641 gggtttcggg aggtcgagtg gggtattctc tttgtttggc agtggttgct gtcctcttgg 2701 catctgagtc ctttccctgt ccccacaagc ccttctactc cctgtttgcc tttcctttgc 2761 caccctctcc ctgttttaca tggaaattgc acgggcctct ctgtgtttgc gtgcatgtgt 2821 gcgcctgtat atacatgtat agagagatat atttgtgggg gccgggggga gcggggaggg 2881 ggtggagtga agctcagggc ttgcagccag gacactgccc actggagcct gtcatctgcc 2941 ccttccgtgt tggtgagatt aagggaagaa ggtgggaggg cattagtcaa gccggactgc 3001 cccagtcttt gactggttac cctcctctct gcaggtaggt ttttctggat gataggacag 3061 atgatggatt cttcaccacc attttccttg tactcttctc tccacccttc ttagggctgg 3121 aggatggcat ttatctctag atgactgttc ccaaggagat gccactctgc tctgccttac 3181 tgttgggaag gagagaggaa cccactcact cttatcagct tagagtgttc aactgatcct 3241 tccccaccat cttcagatct gttcctctgg agcttgactg gggggtgggg atggcaagtc 3301 catggtgtgt gttagggacc cagggcattg ggggaaggga ggcgaaggcc tgaggttttc 3361 catcttcatt ctcttttgtt attaggagaa ggtgaggcat gggccaatcc cattcctgcc 3421 ttccagtgcc ccaccaccct tctgatctta ggggttggat gggtcatctg tccttggtca 3481 gaattttctc accctagtac tcctactggc ctgaccaggg ctgactgggt attctaatga 3541 ggagctggga agtggggtga catagctgac cttagtatca acccccctct tctccaactc 3601 ttgttttcta gtccaggata tgctaaagga cgaagatttt ctattgtttc caggccctca 3661 gaccatcctt gctgtccctt caccctccat gcccctgcca tcctcactca cttctcataa 3721 atggatctgg gggttagagt ggaggaaata aaatcctgaa gtggttgtca gtgcttgtgg 3781 gatgcagtgg tcttcaggat tctccccgca ttattatgta gttgttcaga agctgttgct 3841 actgcgtctc tctgtctgca gggggcactt gttcctgagg tcctcgcctg ggtcctgact 3901 tctgtagttt ctgtgaagac gtggctgtcc tgtgggctcc tgtgtgtgct ggctgcgcct 3961 gagctccgag acagctgccc ataacctgct aaggtcaggg cccagggccc cccacagcat 4021 ggctgaggga ccccttccca gtagggctgt cccaagcccc actccatggg gccaggtttc 4081 ctggccactt ttgttgatag cagatggatc aatattagta accccatcca agattgaact 4141 gtgaaagaga ggttccaggg gtatctccct cagtgccagt tacttagtga tgtgttaccc 4201 aggatgcagc aggtgtgtgc aggtatcttg ctctctaggc ctgtgtgtgt gtgtgagtgt 4261 gtgtgtatat gggtgtgggt gtgtttgtgt gtggtggttg tggggggggt gttggaggtt 4321 ctatttcatt taaaaatgta ttcatttccc tggccccaca gatgcttccc tccatacttt 4381 ttatttgctc tgtttcctca gatttatttt gtcttgcccc attgcctctc cttccttgaa 4441 ctctttcctt ttcaatctca actcattcac tgctgttgct catttgccct tttccatgta 4501 ttttttccta cttaggattt tccactctat ctttgtggtt catgaggact ttgcctcttc 4561 tcctgcctca ttccctgact ttctctagtt agatgtcagt cctaaatagg ttttcctcac 4621 ttcaggcctc acccctcacc ttgttttttt gggggctcca gggaccaatc tggggctgga 4681 aatgttagga ggttgccttg gtgctgcccc agatctgtcc agcagggggc agtaagtgat 4741 cttggtagta tccctggctg ctaagccctt ggcaggggtg gtcctttcta cattcccatt 4801 cacttaacag ctcttttggg attgggtgtt tcattccatt tctgcccact ccctctcctc 4861 tcctctctgg taggtttaat tttatgcttt ccctgattcc agctttctgc ttcctgagga 4921 ctcccgctcc ccccacccca aagtttgtct gtggtgttat agtggtaact gcagttcctc 4981 ctctggaatg tagactgtat atgtttaata actcaccttc tctatccttg ctcaaaatgt 5041 gggataacgc gatgactgtg accctggttg gaaattaaac ttgttttatg cagctttgga 5101 gccgaccttc atctttgctg tgcagcagcc gtgggacagt agtttcatgg tgaggggagt 5161 cctgttt // LOCUS HUMAFP 27553 bp DNA PRI 03-JUN-1997 DEFINITION Homo sapiens alpha-fetoprotein gene, complete cds. ACCESSION M16110 NID g773678 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 27553) AUTHORS Gibbs,P.E., Zielinski,R., Boyd,C. and Dugaiczyk,A. TITLE Structure, polymorphism, and novel repeated DNA elements revealed by a complete sequence of the human alpha-fetoprotein gene JOURNAL Biochemistry 26 (5), 1332-1343 (1987) MEDLINE 87185438 REFERENCE 2 (bases 1 to 27553) AUTHORS Dugaiczyk,A. TITLE Direct Submission JOURNAL Submitted (01-APR-1987) Dugaiczyk A., University of California at Riverside, Biochemistry, Riverside, CA 92521 USA FEATURES Location/Qualifiers source 1..27553 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="charon 4a library of T. Maniatis" /map="4q11-22" /tissue_type="liver" /dev_stage="fetus" repeat_unit 2609..2701 /citation=[1] /rpt_type=direct /evidence=experimental /rpt_family="x" repeat_unit 2835..2930 /citation=[1] /rpt_type=direct /evidence=experimental /rpt_family="x" CAAT_signal 3072..3076 TATA_signal 3114..3119 /note="putative" /citation=[1] /evidence=experimental prim_transcript 3141..22629 mRNA join(3141..3269,4082..4133,5096..5228,7516..7727, 9214..9346,10265..10362,11911..12040,14316..14530, 16188..16320,16889..16986,17469..17607,19255..19478, 20619..20751,22090..22144,22485..22629) 5'UTR 3141..3184 exon 3141..3269 CDS join(3185..3269,4082..4133,5096..5228,7516..7727, 9214..9346,10265..10362,11911..12040,14316..14530, 16188..16320,16889..16986,17469..17607,19255..19478, 20619..20751,22090..22134) /citation=[1] /codon_start=1 /evidence=experimental /product="alpha-fetoprotein" /db_xref="PID:g178236" /translation="MKWVESIFLIFLLNFTESRTLHRNEYGIASILDSYQCTAEISLA DLATIFFAQFVQEATYKEVSKMVKDALTAIEKPTGDEQSSGCLENQLPAFLEELCHEK EILEKYGHSDCCSQSEEGRHNCFLAHKKPTPASIPLFQVPEPVTSCEAYEEDRETFMN KFIYEIARRHPFLYAPTILLWAARYDKIIPSCCKAENAVECFQTKAATVTKELRESSL LNQHACAVMKNFGTRTFQAITVTKLSQKFTKVNFTEIQKLVLDVAHVHEHCCRGDVLD CLQDGEKIMSYICSQQDTLSNKITECCKLTTLERGQCIIHAENDEKPEGLSPNLNRFL GDRDFNQFSSGEKNIFLASFVHEYSRRHPQLAVSVILRVAKGYQELLEKCFQTENPLE CQDKGEEELQKYIQESQALAKRSCGLFQKLGEYYLQNAFLVAYTKKAPQLTSSELMAI TRKMAATAATCCQLSEDKLLACGEGAADIIIGHLCIRHEMTPVNPGVGQCCTSSYANR RPCFSSLVVDETYVPPAFSDDKFIFHKDLCQAQGVALQTMKQEFLINLVKQKPQITEE QLEAVIADFSGLLEKCCQGQEQEVCFAEEGQKLISKTRAALGV" sig_peptide 3185..3241 mat_peptide join(3242..3269,4082..4133,5096..5228,7516..7727, 9214..9346,10265..10362,11911..12040,14316..14530, 16188..16320,16889..16986,17469..17607,19255..19478, 20619..20751,22090..22131) /note="alpha-fetoprotein" /evidence=experimental intron 3270..4081 exon 4082..4133 intron 4134..5095 exon 5096..5228 intron 5229..7515 repeat_unit complement(6956..7181) /citation=[1] /rpt_type=inverted /evidence=experimental /rpt_family="Kpn" exon 7516..7727 intron 7728..9213 repeat_region 8188..8198 /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region 8199..8502 /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" allele 8402..8403 /citation=[1] /replace="aaa" repeat_region 8503..8513 /citation=[1] /rpt_family="Alu" /rpt_type=dispersed /evidence=experimental allele 8691..8693 /citation=[1] /replace="tc" allele 8722..8724 /citation=[1] /replace="at" allele 8922 /citation=[1] /replace="g" exon 9214..9346 intron 9347..10264 allele 9691..9695 /citation=[1] /replace="ac" exon 10265..10362 intron 10363..11910 allele 10673 /citation=[1] /replace="g" exon 11911..12040 intron 12041..14315 allele 12634 /citation=[1] /replace="g" repeat_region 13818..14120 /citation=[1] /rpt_type=direct /evidence=experimental /rpt_family="xba" exon 14316..14530 intron 14531..16187 repeat_region 15092..15394 /citation=[1] /rpt_type=direct /evidence=experimental /rpt_family="xba" exon 16188..16320 intron 16321..16888 exon 16889..16986 intron 16987..17468 exon 17469..17607 intron 17608..19254 exon 19255..19478 intron 19479..20618 exon 20619..20751 intron 20752..22089 exon 22090..22144 terminator 22132 3'UTR join(22132..22144,22485..22629) intron 22145..22484 exon 22485..22629 polyA_signal 22629 repeat_region complement(23761..23771) /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region complement(23772..23901) /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region complement(23902..23912) /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region complement(24201..24219) /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region complement(24220..24527) /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region complement(24528..24545) /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Alu" repeat_region 26397..26409 /citation=[1] /rpt_type=dispersed /evidence=experimental /rpt_family="Kpn" repeat_region complement(26409..26481) /citation=[1] /rpt_type=inverted /evidence=experimental /rpt_family="Kpn" repeat_region 26482..26735 /citation=[1] /rpt_type=direct /evidence=experimental /rpt_family="Kpn" repeat_region 26736..26748 /citation=[1] /rpt_family="Kpn" /rpt_type=direct BASE COUNT 8819 a 4656 c 4994 g 9084 t ORIGIN 1 gaattcccaa tatctagtat tttctactat taaactttgt gcctcttcaa aactgcattt 61 tctctcattc cctaagtgtg cattgttttc ccttaccggt tggtttttcc accacctttt 121 acattttcct ggaacactat accctccctc ttcatttggc ccacctctaa ttttctttca 181 gatctccatg aagatgttac ttcctccagg aagccttatc tgacccctcc aaagatgtca 241 tgagttcctc ttttcattct actaatcaca gcatccatca caccatgttg tgattactga 301 tactattgtc tgtttctctg attaggcagt aagctcaaca agagctacat ggtgcctgtc 361 tcttgttgct gattattccc atccaaaaac agtgcctgga atgcagactt aacattttat 421 tgaatgaata aataaaaccc catctatcga gtgctacttt gtgcaagacc cggttctgag 481 gcatttatat ttattgattt atttaattct catttaacca tgaaggaggt actatcacta 541 tccttatttt atagttgata aagataaagc ccagagaaat gaattaactc acccaaagtc 601 atgtagctaa gtgacagggc aaaaattcaa accagttccc caactttacg tgattaatac 661 tgtgctatac tgcctctctg atcatatggc atggaatgca gacatctgct ccgtaaggca 721 gaatatggaa ggagattgga ggatgacaca aaaccagcat aatatcagag gaaaagtcca 781 aacaggacct gaactgatag aaaagttgtt actcctggtg tagtcgcatc gacatcttga 841 tgaactggtg gctgacacaa catacattgg cttgatgtgt acatattatt tgtagttgtg 901 tgtgtatttt tatatatata tttgtaatat tgaaatagtc ataatttact aaaggcctac 961 catttgccag gcatttttac atttgtcccc tctaatcttt tgatgagatg atcagattgg 1021 attacttggc cttgaagatg atatatctac atctatatct atatctatat ctatatctat 1081 atctatatct atatctatat ctatatatgt atatcagaaa agctgaaata tgttttgtaa 1141 agttataaag atttcagact ttatagaatc tgggatttgc caaatgtaac ccctttctct 1201 acattaaacc catgttggaa caaatacatt tattattcat tcatcaaatg ttgctgagtc 1261 ctggctatga accagacact gtgaaagcct ttgggatatt ttgcccatgc ttgggcaagc 1321 ttatatagtt tgcttcataa aactctattt cagttcttca taactaatac ttcatgacta 1381 ttgcttttca ggtattcctt cataacaaat actttggctt tcatatattt gagtaaagtc 1441 ccccttgagg aagagtagaa gaactgcact ttgtaaatac tatcctggaa tccaaacgga 1501 tagacaagga tggtgctacc tctttctgga gagtacgtga gcaaggcctg ttttgttaac 1561 atgttcctta ggagacaaaa cttaggagag acacgcatag cagaaaatgg acaaaaacta 1621 acaaatgaat gggaattgta cttgattagc attgaagacc ttgtttatac tatgataaat 1681 gtttgtattt gctggaagtg ctactgacgg taaacccttt ttgtttaaat gtgtgcccta 1741 gtagcttgca gtatgatcta ttttttaagt actgtactta gcttatttaa aaattttatg 1801 tttaaaattg catagtgctc tttcattgaa gaagttttga gagagagata gaattaaatt 1861 cacttatctt accatctaga gaaacccaat gttaaaactt tgttgtccat tatttctgtc 1921 ttttattcaa catttttttt agagggtggg aggaatacag aggaggtaca atgatacaca 1981 aatgagagca ctctccatgt attgttttgt cctgtttttc agttaacaat atattatgag 2041 catatttcca tttcattaaa tattcttcca caaagttatt ttgatggctg tatatcaccc 2101 tactttatga atgtaccata ttaatttatt tcctggtgtg ggttatttga ttttataatc 2161 ttacctttag aataatgaaa cacctgtgaa gctttagaaa atactggtgc ctgggtctca 2221 actccacaga ttctgattta actggtctgg gttacagact aggcattggg aattcaaaaa 2281 gttcccccag tgattctaat gtgtagccaa gatcgggaac ccttgtagac agggatgata 2341 ggaggtgagc cactcttagc atccatcatt tagtattaac atcatcatct tgagttgcta 2401 agtgaatgat gcacctgacc cactttataa agacacatgt gcaaataaaa ttattatagg 2461 acttggttta ttagggcttg tgctctaagt tttctatgtt aagccataca tcgcatacta 2521 aatactttaa aatgtacctt attgacatac atattaagtg aaaagtgttt ctgagctaaa 2581 caatgacagc ataattatca agcaatgata atttgaaatg aatttattat tctgcaactt 2641 agggacaagt catctctctg aattttttgt actttgagag tatttgttat atttgcaaga 2701 tgaagagtct gaattggtca gacaatgtct tgtgtgcctg gcatatgata ggcatttaat 2761 agttttaaag aattaatgta tttagatgaa ttgcatacca aatctgctgt cttttcttta 2821 tggcttcatt aacttaattt gagagaaatt aattattctg caacttaggg acaagtcatg 2881 tctttgaata ttctgtagtt tgaggagaat atttgttata tttgcaaaat aaaataagtt 2941 tgcaagtttt ttttttctgc cccaaagagc tctgtgtcct tgaacataaa atacaaataa 3001 ccgctatgct gttaattatt ggcaaatgtc ccattttcaa cctaaggaaa taccataaag 3061 taacagatat accaacaaaa ggttactagt taacaggcat tgcctgaaaa gagtataaaa 3121 gaatttcagc atgattttcc atattgtgct tccaccactg ccaataacaa aataactagc 3181 aaccatgaag tgggtggaat caattttttt aattttccta ctaaatttta ctgaatccag 3241 aacactgcat agaaatgaat atggaatagg tgagatattt tgtgtttttc ttgtcttttc 3301 tctatatcaa aattttttaa attataaaat ttgcattaat ttgtcttgat ttattattca 3361 tatttattat tccacatgga gaaaaaatat ttaactgatg gatatattta aatgaaagaa 3421 aaacttgtaa ctttacaaga ggtttacaaa gttatagcag tgtttaatgg atgaatggtt 3481 tgtatgtttc atgttgaatt aatttttaca cttcaatggt atgcatatta actttgaaaa 3541 attatatata tacacatata tgtacatata tatgaatata aataaaattt tatatgtgaa 3601 gaagccagaa ttatgctcct tcacataact ccctcagact agtaaaatag ataaaatctt 3661 tgtttttaat acagaaaaat gggtcattat ttgatggtct gaagaagaaa tattgtgact 3721 gggatatgaa tggcaaaccg tagtacaact atgttcaaaa gaatgcctga aatatatttt 3781 taaccatttg actttcagga cagttacagc actacagtac agggaaaaac caaacaactg 3841 gaagacaaaa tctggatttt agtgataggt ctactataaa ttatgcttgt taacttcatt 3901 ccttagtttc ctagttttct tttcctcaag tataaaatta agatgcttag gttatcccta 3961 atgttctttt aattctgaaa ctgtacagtt ctaactgaaa cacaaacatt catatgtaac 4021 aatgattact ttcttggttg cagttgaaaa cacgtttcat gaagtttatt ttgccttcca 4081 gcttccatat tggattctta ccaatgtact gcagagataa gtttagctga cctgtaagtt 4141 ttgcttatat aaatgtactt taaatgtgta aagcaaggat aagtaaatac ttaaataaaa 4201 ttgggtaccc ctgtgagctc ttaaaagcac aaaagcaatt tggacaattt caagaaaagt 4261 tactcatact gaatatcaac ttgatgttga agaggttaaa ctgttgacta atgtcttcga 4321 cattgacctt ttgattcctt gaaatctcat gagtcaaacc aaatcagatt ttagaaactg 4381 aagattagtg tctgatcagt gacaaccata tactaattca ggaatttttc tcatcagtac 4441 caacagggtg atattataat gttttctttt ctgtatacta tttaaatctt agcagcaaac 4501 cataggtgat aaaatattct atttgctgtt atttgtggag agtatgttag tctcttggat 4561 gtctttccat tccacatttt aaaaatttct aacaaagaat ttaaagtagt gtgttgctgt 4621 tactccttgc acatccaaac ctgcataagg attgctttga gtcaatccat gagcactgta 4681 gtcttgggtt ttagaccttg atcatactgg gaatagacac tgttagaggt ctgtctaatt 4741 accaattttt ttttgcttaa atttaaaagt aaccataaag aatatagata ccctcaatta 4801 tgggtacatt acagtagatg gatggtcaca gaaggagaaa ccactcttat gggaaatcca 4861 cttattttag cctttaacat ctatatgtat atttatggca aaagaaaaca agaaaaagac 4921 taaagtttct tctcagatga cctggaagct aattttacat aattttacaa atcaaatgtc 4981 taaacagatt acaacataaa tagaaaacaa aacaaacaaa tgaaaaacta tacttgagaa 5041 aaataagctt gctgcaggtc tgttccttaa ggattcacac gtatttttgt ttcagggcta 5101 ccatattttt tgcccagttt gttcaagaag ccacttacaa ggaagtaagc aaaatggtga 5161 aagatgcatt gactgcaatt gagaaaccca ctggagatga acagtcttca gggtgtttag 5221 aaaaccaggt gagtgaataa ttttaaaaaa gcattgtgat atttgacaaa aatttagcat 5281 gctgaagaga agatacaaaa atagcagtga aaaatgcatt taaatatttg aagagctatt 5341 gtatgaaaga gggattagat tcattctgaa ttgctaaaga gggcagaaga gaacaatagg 5401 tagttattat aaagagacca tataaatatg atgaactaag gttctgaaat aagattatct 5461 tgatgactat gggcatatta acttttttga gcttcagttt tcttatctgt aaaataaggg 5521 atgataatag ctcccatttc atagttagca tggaaattga tataacagca atagtagcta 5581 acttttatta tacacacaat gtgactggca ttattctagg gagcataatg tgtatattga 5641 taataaaaat attttatgac ataggggata gatagcactg atgaatcaga atggttgtcc 5701 agtgagtcaa gagatgctgg ctcgggcttc tgggcaggat atcagctttg cttacctata 5761 tttatttatt aaacatttaa aataatcctt gaagatagat gctaatcttc caactgagga 5821 agctgaggct cagagaattt aagtaacttt cttatgggaa ccaccaaatg gcagagccag 5881 gatttgaact agaccatctg gcttaaaatt gacagtctta gtagcttcat tacactataa 5941 ctatagtgaa tgtaagatgc atagcacatc gttagggttg ccaggtttag caaacaacaa 6001 caaaacataa tacccagttc aatctgaatt ttagataaac actaaatact tttcttagta 6061 taaggatatt tcattgtaga agctcaacaa ataatattta ttatttattt tatctcaaca 6121 tagaaacaaa cttgataatg attagaactc tccaattata aaacaacatg cccagagaat 6181 actctgttat ggtggggtta attaggtggc tgaaagacaa tgtacctgga atatcataga 6241 agagatgctc ctttaaggat atagtttaag ttctttccaa ctttgaaatt tatgaattga 6301 caaaaatttc tgttttgcat ctctattttt gtcttgttct gataatcttt tcaaaatgtg 6361 tataaaaaaa caagaataca ttatctattg caactttaca accaattaga ggttcaaggt 6421 aatgttacag atcgctgatt tattcttgta aattcaaagg tatgtctttt aaatgaggat 6481 tgggaattag aaatcttacg taagccttcc aggattctct aaatattact gtagcagcta 6541 taaaagctac ataaaagttc cctcagatac atgaaacaca tgtattcctc agatgctttc 6601 tgtggaatat tgatgctgtc atctgagttt ggtaagggta agtcacagag gaggaaacac 6661 atacatttta aaacatttta gctaaatatg taattgtggc caagaaaagt gtttttttaa 6721 aaaataatta tttcatttca aaatcatttt tatttataat tgaaaataat atgcagtttt 6781 ttattgtctt gtaaggatgg catgtaaaat gagcatttat gtctgaaatg tggtatgtct 6841 gtgtgtgtgt gtgtatatgt atatatgtat gtatatacca taatatatac atatgtattt 6901 gcaattccaa aagttacatc tttaatgaga atcatgaaaa tattattttg ctcagtttct 6961 ttttttattt aaatttaaat ttaaagttct ggtgtacacg ggcaggatga gcaggtctgt 7021 tacataggta aacacgtggc atggtggttt gctgcaccta ccaacctgtt gcctgggtat 7081 taagcccagc gtgcattacc tatttttcct aatgctctcc ctccctgcac tccactccct 7141 gacaggcccc agtgtgtgct gtttccctcc ctgtgtccat gaaacatggt ttatatattt 7201 ataaaagttt atatatccta gtccaaattt ttgataacac aaaaaggaaa aataaaataa 7261 ttcaaaattg ggaaagagaa acaaaagatt gcatggcttt tttcctttta ttgtttggac 7321 attaaagtct cattttccat aaggcagcaa agaaatctat ttcatcaggc tgaaacaaaa 7381 tacattagaa tttgtatgga aaatttttca gaatctatag ttctgatttt agacattaga 7441 aaatgtttca tgtgtcttat agatttttaa agcaaagtta gttgtctttc tccaaacata 7501 aaatatttct tatagctacc tgcctttctg gaagaacttt gccatgagaa agaaattttg 7561 gagaagtacg gacattcaga ctgctgcagc caaagtgaag agggaagaca taactgtttt 7621 cttgcacaca aaaagcccac tccagcatcg atcccacttt tccaagttcc agaacctgtc 7681 acaagctgtg aagcatatga agaagacagg gagacattca tgaacaagta aggatccagt 7741 ttaaaggtag atgcaaacct cagaaacaca gcaatggcaa gcctaattta gtatttttgc 7801 aatgtactca tgtactccca gtaagaggta taatgtttct ttggtgttgt gtctgctgag 7861 gtcccaggca aggtaattag gagagcaaca ttcaatgtaa cttggtttcc atagcacgct 7921 agatgtagta caaaccacag gacaaaccta ccagaggttc cattagtccc ttgagagata 7981 tacacctttt ttttttcttt ttacccattc tactctcaat tttaccttgt tcaagaatat 8041 attagtattg caccaaattg atgctttcca gagccatata ttggtgtttt gtgttcacct 8101 aattgcctag aaataaaagt tagagatcca agcatggtca agcatggtca agatacacag 8161 acatggcaaa ggtttactga acacttagga tgttgtgggg ccgggcgcgg tggctcacgc 8221 ttgtaatccc agcactttgg gaggccgagg cgggcggatc acgaggtcag gagatcgaga 8281 ccatcctggc taacatggtg aaaccccgtc tctactaaaa atacaaaaaa aattagccgg 8341 gcgtgatggt gggcgcctgt agtcccagct actcgggagg ctgaggcagg agaatggcgt 8401 gaaccctgga ggcggagctt gcagtgagcc gagattgcgc cactgcactc ccgcctgggc 8461 cacagagcga gactccgtct caaaaaaaaa aaaaaaaaaa aaggatgttg tggaaacatg 8521 tctgcttgca cagaggatca gattaacact cacaaacaaa ctttgaagcc tccttctcct 8581 cttcctcttc attcttcttc tttcccccat tttgttgata ggaaattgaa ggttggaagg 8641 ctaaaacaac tggctaaggg cacacagcta gtacatttga ccccaggttc ttctgttgga 8701 gaagcctgta cgtaactctt caatcacttc tgccttccat gttacttcct aaaccagata 8761 aatagagaga ttgcccttag aacatctctg ctatggcgac tatccaggtg cataacccca 8821 ttctgcactg acaagggata aaatgtccat ctttctgttg cacttagcag aagtctggct 8881 ttgctgatcc ctgaaacata tctgagtagg cttattgaaa aagacctttt aataagaatc 8941 atggttagca tgtctgccta ttttcttctc ataatagaat ctggtaccat ctgtcagata 9001 tttttccccc caggatcttt agtgaggaaa attttgacaa catgtggaaa aaatatgatg 9061 attttcccct taggaacaca gtaaagacaa agtcaaatgc attttgttgt tgttgttttt 9121 gaatctttaa ataaatccca gtgtccagtt ccaagcagta gtagtcctat ttttaggtgt 9181 ttataaatct tctagctcta ttttatttca cagattcatt tatgagatag caagaaggca 9241 tcccttcctg tatgcaccta caattcttct ttgggctgct cgctatgaca aaataattcc 9301 atcttgctgc aaagctgaaa atgcagttga atgcttccaa acaaaggtat catatttgcg 9361 tggatatctg aaccagtact gtagtctatg actcattaaa acaaaacaaa gttaaaaatg 9421 aaaacgtgct taattgtgga gagtatcgtt tttggaatag agaaatagtt cagcagtctg 9481 atattcttcg agtgaacaaa actggatttg ctggttttta ttatctattc attaagtcaa 9541 caataatttt attattaagg aagtgagttt gatgggataa agaaggaaag acagacagac 9601 agaaagagag aaagagaggg tgctataaat gggaaggagt aagtgaaaga aaacggagaa 9661 aaggaggagc agaggagaga gaaaaaatga agaactatga taaatgcttt atacttactg 9721 tttttttaaa ccttagaaca aatttgaggt gtaagtattt ttgtccttta acaaactagg 9781 aaagaggttt agagaagaaa atttttttta ggtttagaga ggtagaataa tatccaccat 9841 taactacttg ctaccagcag tcctataaca aaaatttcct gagaactgac tactctcagt 9901 gtcaagccgt gatacaatcg tattttttgt aactgagaag acacttaaga aattagcagg 9961 cttcaatttg tctttaagct gtttaaaggt acagttgttc atttatgatc cccagtataa 10021 aagttatgtt tttgtttcaa ggtactttga gtaaatttgt ctggcacaga tgcatataaa 10081 ctaacccaaa agaataaatg agtcaataat attctgcgat aatgtatgac atttataatt 10141 tttagtaaaa aacatatttt atggaatttc attttaccta tacttgttgt ttttctaaat 10201 attagagctt gtaaagaaaa tgttagtata tgctttcatg acattttgtt tcctctacat 10261 ctaggcagca acagttacaa aagaattaag agaaagcagc ttgttaaatc aacatgcatg 10321 tgcagtaatg aaaaattttg ggacccgaac tttccaagcc atgtaagttc aagttctatc 10381 tagggaagag ggtgagagct acagaactac cattttgcaa tttgggttcg tttttttaat 10441 tgttgctgtt ttagagaatg aagacccctt tgtgacctct ttgatgaggg ctaatgggat 10501 tagaaccatg aactcttagg atcagaagga agctaacgga taagtcagtt taacacttac 10561 taaagcctag ctgagataat acatagaaaa gacttttaaa gtttatgtta tttacctgtt 10621 ctttaagaca cttaagttct ggcctgccat caaattatac ctcatcacta gaccatattt 10681 ttctagctct tctacaaaat aagtcagcct tcactgagtg tcatttaaaa cttttgcctt 10741 aacaagaaat tctttatatt aattgtgttt cttaatcttc tataaggctc tttatagcat 10801 ttattgcttc ccataaaaat attcctttga ggcaataata ttagaatcta gtgtcaggag 10861 aaaagaacat tttaaattat ataacttctt taaagtactc atcaactctt ttatgataaa 10921 acatttctag tatatgaaaa tatctgagct gctaatcgaa tagtagtaag tatatatatt 10981 cagagtttat ttgattgctg tttggttaaa taacacgtta aagcatattg tagacaatgg 11041 aaatctagaa tgaagttttt agtaatagaa ttagttctaa agactgaaat ttcttcttgt 11101 aagaagtcag atttatgcct taactaccct ccaactcaat tagaaattga aagattaatt 11161 tatgacctac tttaaaaaaa tttatcttag taataaaata gaaattgaga tagtattata 11221 atacccttat catttgctat ccataagtga aagctaagtg gtctaaattt atagagaaga 11281 tccatcttta tcagaagcag gagtaacact attttctcta ggagcaatgg ttttacaaag 11341 acaactctta aaatttaatg aaatatgcag aggaaaccag aatttttatt tcttacttct 11401 tttttggcca cattactctc ttggcacact ggtatgcctg aaatgtttaa tctgctccac 11461 tttcttgtcc catttcattg atttgcacat acgcctcctt tcctggattc tcagaagtat 11521 tttttacccc agtaaaaaat ggttctcatt ttcatggaat tttccatttc ttaattgttt 11581 atccttttaa agttttatgt ccactcaata caaatgctca gcataagggt tgaatggcaa 11641 aaataattta ccttaattct taagacatgt tttaggaaaa gataaattat ttctaaacca 11701 tttgtggggc cagctcacag tttacagagt taccaccttg aaagatatgg ctggaatcaa 11761 gccttaaaac atgtttcctt ttctttttaa acaacaaggg aatttcagtc atttctcttt 11821 gaaacttcta ctgtagtaat actctataaa ttctatttta ttttgacaga taaccaagaa 11881 attaatttct aatttctttt ttttccctag aactgttact aaactgagtc agaagtttac 11941 caaagttaat tttactgaaa tccagaaact agtcctggat gtggcccatg tacatgagca 12001 ctgttgcaga ggagatgtgc tggattgtct gcaggatggg gtgaggagtc ttgcttctta 12061 aaatagaaga ttttcactcc cttttctttc tttttgtctc attctaaaag ggagaaggtt 12121 gtttgacttg aattggttac agagtatgta aactaggtga ctccttaaat ttgcagaatt 12181 ctcggtagta aaacttaaac catcttttgt tgatcctggc tttcacttta gctatacccc 12241 tttttgtgaa accaaggctc atctatttct tacttctaaa aaaaccgtgg gaacttctca 12301 gaaggcttct ccatagttac ttggaggacg ggaggaaact aaggtttaat gtatttattt 12361 tttcattcat ttattctttc atttgacaaa taaatatata ttaaatactt tctatctgct 12421 agccactatg acagacactt gttttaaaag cacaggctga cctcaaggaa ttcacagtct 12481 gataggagag ataagacagt gacctccttg gagttaggga ctgccttggt ttactgttat 12541 ctccatagca caatgcctgg cacatggaag gcattctata atagtttgtt aaatgaacga 12601 atgcaataaa aattgcacaa gtaactgtcc taccaggtaa aaagctagcc ttgccaaaga 12661 caagtgtgaa taaagtggtc tcggagaatt agaaaacaaa atttaaaaaa cccagcaaca 12721 gtttcttgag tgtgctctag tcagtggtgg tttaagcagt gtggcattgg cttattttgg 12781 ctaaccctag acttcctaga ttttacaaga gaggactgtt gaccctcaga ttactctgtc 12841 tctgtgggct ccatgacaca ccaaagagat taaaatccaa ggagcttaaa aattactgct 12901 cttaggtact caaatggctt gataaccagc actggactat ggtttccaga aggctgaaag 12961 tgaagataaa atgctcattt cagcccatcc gatggcaatt cagtgagatg cttgaagtaa 13021 gcaagaagcc gaggctgcag aggaggcctg agagtgacag tccctgagga gctggggaag 13081 aagtgggagg aggcagcctg gcaggcgact gtactacttt acttgttttt cttctagaaa 13141 tatggcatac taggaaaacc aacaagaagt attttgtttt tctttgagct cagttttccc 13201 attttgaacg gacaatttta ctgtttctcg ttgtcatttt taaaaagtta attttttcaa 13261 tttttggagc tcataaccac ccttttcctt taaagtggaa acattaattt cagcatgata 13321 tgtaagttgg attttgatag ctgaataatg ggttctaatt atctttgctg agaatgtaca 13381 gaattttcag tcccatgaca ggtatatatg taagctctgc ctctctctgg ccacttaggt 13441 gcattgccat tttattatct atagactgcc ctctgaaggt catagtcagt cactgcagta 13501 tgttctgata aagatgatta tcattcttac ggaatttctc gtggagcaga aagtttgctc 13561 tccatgttat gataccagtt gcaagtgttg tttaggggca aatttgaatg ctaatagaaa 13621 tacatatagc aacatgcatc ctattttatt tgagcacatt tccctcttat ttgtaaaggt 13681 tttcaattga aataacatag gatttagctt acaactatgg aaagaagaat tgaacaaaca 13741 ggtaagtgga aaggaatgag aaaaggcaaa agtggggaga aagcactaaa acgggagaca 13801 agttaaaatt tctttttaat tgataggtca cgttctcact ctatttgcct ttaagggaag 13861 aaagcaatca agttaatatg ttttccttca ttgtatagta tgtaactacg gacactatta 13921 gaggagggat ttgtgtagca cttaggacat tatacttgat aatttccaag ggtctttcta 13981 gatttaaaag tctgattcta acgtagtaat aaaaataaag gcccaatttt ctctttaata 14041 ttgcctgaag atattactct attattgcat taaaattaaa cattcacaca ttgtttgcac 14101 tgctaaataa aattatgtaa tttcttcttc tttccttcct ccttccccca tccctctcta 14161 tttccctttc cccttccttc tttcctggcc ttttttcctt ctttctttgt tcccttctcc 14221 ctccctcccc tttcttcctt tttctaaagc tggctttgag atcctttatt aaagaataaa 14281 tctttaaaac ttatacttta ttttccctgt tgcaggaaaa aatcatgtcc tacatatgtt 14341 ctcaacaaga cactctgtca aacaaaataa cagaatgctg caaactgacc acgctggaac 14401 gtggtcaatg tataattcat gcagaaaatg atgaaaaacc tgaaggtcta tctccaaatc 14461 taaacaggtt tttaggagat agagatttta accaattttc ttcaggggaa aaaaatatct 14521 tcttggcaag gtaacacact ctgtaaatgc atgttcatgc aagtaaaaat gattatgtgg 14581 ctgacagatt tgcgttgttg aaatggagag tgatgattat ggtttttgag ttcaatatgt 14641 gaggatattt ggctagaatg ttctgagcca aaatagattt cagtagataa ccagggaata 14701 agtaatggga tttggtgttt aacggtgaag cgttcaccac tgtgactcat taactgcttt 14761 gctatgaagc tgaattttat ttcacatcaa tttctctgga atcagaagca ttgtcatcct 14821 gtaaagatta ctcatatcaa ggccaccatt gaactctcaa ataggatatg gatatttttg 14881 taataagaag agttcatgat taagaatgaa ctcttgctac gcatgttaaa aaaaaaactt 14941 ttctccaaaa gataacacaa gagataatgc taggtagaag aacttttata ggaacagctt 15001 attggctatg tattaaatac atgttttgta ttttttaaga aaatcaaaac atgtttagag 15061 acatttgcag tacagtagtt tgttttaata caactgatag gtcacgttct cactctattt 15121 gcctttaagg gaagaaagca atcaagttaa tacgttttcc ttcattgtat agtatgtaac 15181 tatggacact attagaggag ggatttgtgt agcacttagg acattatact tgataatttc 15241 caagggtctt tctagattta aaagtctgat tctaacgtag taataaaaat aaaggcccaa 15301 ttttctcttt aatattgcct gaagatatta ctctattatt gcattaaaat taaacattca 15361 cacattgttt gcactgctaa ataaaattat gtaagctaga ataaagttca gatttaggag 15421 acacatagtg acaactgatt ggtgacagaa ctaatcctat aatctgggaa tacggttagt 15481 aaagtcaaga attaccttta agtttacaca tccatgcaca tctaaatcta attgtttaat 15541 agaagcagtt cttcagttgc aaaggttctt tgcagtagaa ttttctcagc caggaatgat 15601 tttcccccag atatttgcat ggcttctttc acttagctga tctctgttct gatatcagct 15661 gcctagagag aattttcttg accacattca aagttagtgg cctctccacc ttgggtatca 15721 tccttttttc tctttttcat ctttatttat tttcattgat ttatcgctaa ctgaaatgag 15781 atggcctatt tcttgtttat ttgttctgcc tccctataat gtgtgctttt cagagggcag 15841 gtatttatct tagacatcat tgagtccgtt ctgcttaaag caatgctagc aaagagtgga 15901 cactggaaaa atatttgttg aataaatgaa tataaagtcc gtaattgaaa agtcaaattg 15961 agagatgcag gagaaaacaa aaagccattt tacaggacaa tttgaaggat cacagtctgt 16021 attaacagtt ttgccattca tataattcaa atcatatttg attttcaggt ttatttattt 16081 gaatttaact tccacatgcc atattatata ggaataactg gagaagtgat ggctcctttt 16141 gtctcttagt tccaataact tgaaatattt ttctccacat atttcagttt tgttcatgaa 16201 tattcaagaa gacatcctca gcttgctgtc tcagtaattc taagagttgc taaaggatac 16261 caggagttat tggagaagtg tttccagact gaaaaccctc ttgaatgcca agataaagga 16321 gtaagttgct ctagaatttt aggggagtat gaaaaactgg attgatatca tctgttaaaa 16381 atgctgtttg tttgaaagcc tctagttttc aactagttgt tagccagtta tatctatttg 16441 tctagatatt aagctgttat taactagcag tcagcagcta gtggcttgct ttagaaacaa 16501 aaatgttaat tgcttctcag ccttttggct aagatcaagt gtagaaataa aaatgttaac 16561 caaaagtcct ttgatccaca aataaaggta gtattcatta ttcatttttg gataacttca 16621 gaaaggcaag aatttggtac agaaagaact gtaaccattt atccaaagat tgagttttgc 16681 cattaaatga ttttgtgatt tataaaatgt taaacttaat ctccccaaaa tccattttct 16741 gtaattatca aaatttacac tttaccatat ttaatattta aacatctctg attggtttta 16801 taatagtata taatattgat caattttata tacaaagtta tgcatccaag aaaagaaaaa 16861 tgtatatgta ataattcttc attttcagga agaagaatta cagaaataca tccaggagag 16921 ccaagcattg gcaaagcgaa gctgcggcct cttccagaaa ctaggagaat attacttaca 16981 aaatgcgtat gtttttgtaa acagtatttt tagtgaatta aaattattaa agagaatgta 17041 gccttcccca attctcctcc tttggaagca acaagaatga cctgtgaggt ctgatctgtg 17101 gtattgactt taagttcccc atactgtgca aatttttgca gtaaagatat ccattctgtc 17161 atagtctgtc cgagttaaag caccaaaaga tcacagttaa aatcaatgaa gctccgaggt 17221 tgagaataca agtcaggctt cctcctggag tttgcttttt cattgtgata tgcttctata 17281 ataggagtac agggagtggt tgttagagta aatgcatctc aaaagttggt tcaaatacca 17341 tgtagatgaa acaaacgagc tgacctcatg tcttctggca tgagagtaga gagtctgtga 17401 gaagaagcaa ggcggctaaa aactcatgaa tgactcagca ggacttagtt aaaaaatgct 17461 tctttcaggt ttctcgttgc ttacacaaag aaagcccccc agctgacctc gtcggagctg 17521 atggccatca ccagaaaaat ggcagccaca gcagccactt gttgccaact cagtgaggac 17581 aaactattgg cctgtggcga gggagcggtg agtgtctgct tggtttggtc ccatctcatt 17641 tctgccctgt ttgacttgaa atagcctcat aattcccctc tagggaagac ggtaaaaacc 17701 aatgtagaga tggccttagg aggcttgttt gattagtcac ggttggaggg gtgtgagaac 17761 ccagctctgg atggctggca tgtggccatg cttcctattc ctccagggtg gctggtggaa 17821 gttcagccag tttagtcaac aatatctgag ccaactttat atatcagaaa gacagaacga 17881 ccaacatgta actcataatt cataacaatt cataattcat aacttcaaat cataatttct 17941 gctttttgtt catatactta ctttgatgtt ttaaaaaagc tttatctttg attgattaaa 18001 attagtcatg ctattttagc catatttata ttttcacctt ttgtaacatg atattattat 18061 tatgacatca ggaataattg gttccctttc gcagggtata gggtacagca caggataagt 18121 attatgttct gaatagtaaa atgactttcg agtcagtaat gccaatattc tttacttcct 18181 aatgtcacta gtatcataca taagattaca ggatgaatta aaaatatttt tcctataaag 18241 tcataattgc aaacaaaatt gtctatttta tcctttttcc tctttttcat aatggggagt 18301 tatttgctgt tagtcttcat gtcatacatt ttttccccaa aggttaagag taaaaggaga 18361 gttcttgtga ttaaatgtca cctcaattgt ttgttgaatt tcccatgctg ggaggctgca 18421 gggatgcagg atggtgtaat ggttcaggag tgtatgttcc ggaggccacc agccaacaac 18481 cgcattctac tttcactatt ctttagttgt atcacagtgg gaaagcaact tatgtcatta 18541 agcttgagtt ttttcatctg acatttgaga atacaaatta taccaccctc atacgacagc 18601 tgttttaaac aagataatct gcataactca cacagcacta gtctgacaga taaagtgcac 18661 acaaaacata ttatttctta ttacaagtta ttactaggtg attaagaaat atctcctaag 18721 taggcaaggt agcaagattc tacattagga aagtcttaaa aacccacaaa attgctctta 18781 cttcttttca attaggatga tatattagct gcaagtgtat acatgtgtat atgtatgtga 18841 ataaaagggg taagtttgtg ctattcttac cttcagatag tgattatcaa aagaaaaatg 18901 gaaagttcaa ctaaatacac atgggaaaca taaaggcaga gacatttttg tcctttagaa 18961 gtgtgtatgt aactggaagc atgtttcaaa tagctgacac aaatagctaa atgactatcc 19021 tcaacatcac atatggacca tctgctacta cttgctaagg cttagcccaa acaaatgggt 19081 aaatcctgga atttacaata taatgtcaca tgatcctaca tagcaaattt tcctgtaata 19141 ttaattataa attgctgggc attagaaatt attgcagcag ttttctgaaa aactgaacca 19201 actttgtgac taatgcccaa tctccttact tttttttctc attctcctaa ccaggctgac 19261 attattatcg gacacttatg tatcagacat gaaatgactc cagtaaaccc tggtgttggc 19321 cagtgctgca cttcttcata tgccaacagg aggccatgct tcagcagctt ggtggtggat 19381 gaaacatatg tccctcctgc attctctgat gacaagttca ttttccataa ggatctgtgc 19441 caagctcagg gtgtagcgct gcaaacgatg aagcaagagt aagaaactgt tacttgctag 19501 catggaaaag aatgacaacc ccaaagagta actgagactt ctacctcgct cacctaacac 19561 tattgggctc actaacagag cgttactccc aaaacactta aaatgccttt gaaaatagtt 19621 ttgtctcagt gtcttcacag tctcattggg gaagcaggtc tagaaaaatc gacgagggtg 19681 gacaatttcc tgtttgtaaa aataatctct gttgtaactg ttattgtgat atgtatttgg 19741 gggttgagga aaagtgggca atctattctg aggaattaga gtgtatcttt gcagcaaatt 19801 tgggtacttc cattccaagc acaggaaaca catcattgaa tcttttttta cactatttac 19861 actttgaaga gaataaccat cttatttaat tcaaccatgc agtttgggtg ttaagaaatg 19921 acatgtacat ttcagttcat tgtgggagct cttttgtaat ggtgatggtc atgcaagtca 19981 atggagctta tgttcttcaa actcccatgc attttaatcc tcacttgttt tgtaaatagt 20041 cttccttcat tggaaaaccc attcttctct tttttcctct atcacagtct gaggtatgtt 20101 tcacagtatg ataagaatgt tgcctgttct ggcaagcttt ttctattgct ctggtctact 20161 ttctattgct ctggtctaag tccaacatga aaggcttgct aagtgagcag tgcaggcaat 20221 tagtgctgcc agtgcccaga taaggggtgt gataactgga tgggcaggat tcggagatct 20281 gggtctttga gtgtagataa gacacagtta agaagagcgg acaggaaagg atattcctgg 20341 gggatgaggg gagattgcct tccactacac ataagtatgg tcaagtatga aatagtgttt 20401 tatccacaac ctgcacaact ccaggctggt ggaacacttg gcatgttttc agcctcaatc 20461 tttctactga aagtactaga caaggtgtgt gtggtcagtc tggtgatagg ttgatggagt 20521 aagggtttag gctctgaaaa ttctctacta ggaaggctgt agaaaaatag cattgcataa 20581 cagacttctc ttgtattttg ttttgtttta aatcacaggt ttctcattaa ccttgtgaag 20641 caaaagccac aaataacaga ggaacaactt gaggctgtca ttgcagattt ctcaggcctg 20701 ttggagaaat gctgccaagg ccaggaacag gaagtctgct ttgctgaaga ggtacatgca 20761 gctcatttca tactcaaaat acttgctatg gaattttctg tagtggataa tgaaaggaag 20821 accctacaaa tttataactt taaaatattt tcagagagat ttaaatttca ttgagaagca 20881 gattgaggga ttctataaga tttaaaaaat aatcacattt tcttgcttaa tattaggaaa 20941 atttataata ttaaaatata ttaatagaat tagtaatttt aatttatttc ctagtagaga 21001 aacccataaa gtgaatgtgt aaataattga tggtaattta gatagtttct ggcctaaaat 21061 tgatcaattc agctaaatgg attaaaggat ttaatagcaa attaattgtg caaacagagt 21121 attaggagtc tatttgtaga aaatgttttt gaactcattt agaagcttgc ttttgtacat 21181 caacagagta gtatttagga gttattttaa ttacatagta attttagctg gataattagc 21241 cagattttct ttaaccaggg gattctacct aacatttaaa aaaattacct tttttcagct 21301 ttattgaggc atgattgaca aatacaaatt atatatgttt agggtgaaca tgtgatgttt 21361 caatatattt atacattgtg aaatgattat cacaatcaag ataattggct aaattttaca 21421 aatctttagt ttgtattgct acatatattt gaatatagca acactatact ttaaaaagat 21481 attctataac ttagcgtttt tgtcaatttt acctttctca ccatgtaaaa tccaaagaca 21541 gatatattta gaaatgtaga gtttttctat aaataatata attagatgca tttgagtgtg 21601 tgcacttacc agtatatgtg tgtgtttttg gtgggatcag gtagggtggg acatagataa 21661 ccaaattaga taaaactggt gaaacagatt tgatgtgaag catttctgaa aaacatgaca 21721 caagaagatt aatgttctct aatctgagaa gacatttatt tagatataga gaacatgaac 21781 aaatagtagc agtgctttat ctgcaaacct tttaatttct aataacttgt aatttgtaga 21841 ggaaggggaa agattgagaa tacgcattga tttggagatt gttatagaag aaaactgttg 21901 atgtgaaaga atattgtttt ctccctggct tttactatcc caggttgttg gcatcagaga 21961 tgtgtttctt catttttaac ttagttaatc tacaaaccta tgaattcacc ccggattgta 22021 gagtgttaac tgtatgattg gtataataat ccatttcttt atctgattat gtttattctt 22081 aattttcagg gacaaaaact gatttcaaaa actcgtgctg ctttgggagt ttaaattact 22141 tcaggtaaca aaacattcag acaagcctga atacaatgtt gtttctccag aaatatcaat 22201 ccataatgag atagatcatg aggagtgcca ttaattctct taaaaataca tggaattcaa 22261 aaaaaagttt attttaaaaa cacttgaaca aaattacgca cacaatgtta aattagtggc 22321 tcaactatgc aaaatccttt ttggttattt aaaagacttc aacaaatgct atcagaagac 22381 tttcctacgt atccaatatt tctctgatat aaaataatag aaccagttac ttactgcacc 22441 tattagttta attagtattt aatatatttt tgctcatatt gcaggggaag agaagacaaa 22501 acgagtcttt cattcggtgt gaacttttct ctttaatttt aactgattta acactttttg 22561 tgaattaatg aaatgataaa gacttttatg tgagatttcc ttatcacaga aataaaatat 22621 ctccaaatgt ttccttttcc aagtttgctt atttatgaaa agttatcgat aatttcttta 22681 gttttgtata ccattgtctg aagcagattc tgttaaaata gcattaagtg ttggttgtta 22741 taggagatta aagctatcca aggatggatt tacagcacta gatcacttgg tgaactgaaa 22801 aatgttccca agttaaacac tatttgatgc taccagggca ttttgtttat taaatgacca 22861 tcactgaagt attctaacag ataatctgga gatgagaaaa gaaattatta ttcttctatg 22921 ggatctaaga aaatttaaca tctacttttt ttctcaatct tgttcagttc tctattcaca 22981 gacattttaa aacataagaa tattacagtt ttgattgaat ataattaatg ttttcatcaa 23041 ttaaattttt attccacaaa tgtttattaa gccttcactg agttccaggc cttgggctgg 23101 gtacagtcac tgttctccag tcgcttcagt agagaaggaa gcaacacaat aatgatgcct 23161 cagattatga ccacaaggag ttgacaacaa attgttgagg gaacgaaagg gtgggtgaca 23221 tcagttctga ctggaaaagg tgccataaat gtttaagaag agtcttagaa aatgaatgag 23281 cattcatcag ttcaagtaaa agagaaggcc attttatgaa agaggaaaca tggtcagagg 23341 tactgggtat gacaaagtat gatgaatagt tttagaggag tacagggtag gtaggaaggg 23401 tagagatgta gaatgttagg caggatgtgg aagctatgaa gggctaatcc agactctcag 23461 ggctctattc taaagaggtt ggacaatatc ttatgagtaa cagaggacca cagtgatagc 23521 taagcagagg tgtgctatta ataattagct ttgaggaaga taatctatga gatacggagc 23581 atgggcaatg gggcagggaa aagggtagag ggggcaaagt cagcaagagg tgacaaaaag 23641 accattcccc cagtagggaa gctgtttatc ttattagctt tgaatagaaa tctgaaaatt 23701 acatgcatac tttggcctgg gctctgggag aagacaatgg ctttagagca ataaaatgtg 23761 ctttcggttc ttcttctctt tttttttgaa ttggaatctg ctatgttgcc cagctggtct 23821 caaaatcctg ggctcaaaag atcctcctac cttggcctcc tgagtagctg ggactagagc 23881 catgaaccac tacacctagc ccttttagtt ctaactagat tgctttctac ttgacatttt 23941 tctttccatt gtactgcaac atctatgaac tttgtctgat tttggtgtaa ttctaaaagg 24001 tattaattta aagatcacaa taaaataaaa tcacgttcaa tataaaagca ttaagctgga 24061 aaatgcttta tttttttaag ggaaaagcac aatgaaggta aaactgttag ggaaatttgt 24121 gagtcaaata atattttgtt gaaatatata aagaaaaaat gattggaggc agaaggataa 24181 attgacgtaa gtacaattgt aaattgggga tttttaaaaa tttattttat ttatttattt 24241 atttttttga gacagagtct cgctgtgtcg tcaggctgga gtgcagtggt acaatctcgg 24301 ctcactgcaa gctctgactc cctggttcaa gcgattctcc tacctcagcc tcctgagtag 24361 ctgggattac aggcacgcgc catgacaccc agctaatttt tgtattttta gtagagacgg 24421 ggtttcacca tgttggccag gctgacctcg aactcctggc ctcatgatct gcctgtctcg 24481 gcctcccaaa gtgttgggat tataggcatg agccactgca cccagccaaa ttggagattt 24541 taaaacatct ctacctcatt aacagaacaa gttaggcaaa taacaaataa agactcaggg 24601 tattaaactg ctgtagtcag gattaccaac aactttcaca ctgcaaaatc caatagtgga 24661 ttcttagtct ttagcttatc tgagttctct cagcagcatt tgacacaatt gatcactctc 24721 ttttccttga aacgttttcc tcacttggtg ttcggacttt cttccaacct cattggctgt 24781 ttcacctcaa tcttctttgc tgtgtcctcc taactctttg acctctaaat gttggagtgt 24841 cccagaacag ttcttccttg accgaaattc actcatccag actcagtgca ttaaatgcca 24901 tctagatgtt gacaacacat gtttatagct ccagcctgga tttcttcctt gacttacagg 24961 ctcttacgtc agcttcctcc ttaacatctc catgctgata atgtatagac aacacaaact 25021 tagcacatta caaattgaat gtgaatctac tccttgtccc aaatttgtac ctccatcttg 25081 tctatctgtc tgtttttatg agtcagacat tttggattta ttgatgaccc ctctcttgca 25141 tatcacccac acattcaatc tatgagcaaa gcttgtcggc tctaccttta aaaagtatcc 25201 agaatttgac tatttctcaa tactttggta ccagtcacaa tcatatgcct cactggggtt 25261 attgcaatag attcctaatt ggtcttttta caagtgacct tgctccaccc tcaaccctat 25321 tctcaaaagt gaacattatg tcacttttcc actcagaatc ctctagtggc ttcccacttc 25381 tctcagagta aaatgcaaag tctgtccaat gaccctacat gataatatct ttctgacctt 25441 ttttgttact cctctgatca ggtacagagg tctccctgct gctcttcaag ttaccaatca 25501 atgtcctgct cagggttttg ccctcatgac ccttctggct tagaagaccc tcagctatgt 25561 gtgaggctca cttcatcacc aacttcaagc ttctactcgt agttatctgc taaatgatga 25621 tgttttcctc aactacctta ttttaattca gaaatgccat tcccagtccc cacaacactt 25681 atcaccatct gacaaacact agattttact tgtttatttt ctgtctacct cactagagtg 25741 taagctccat gagaacagta acttttgtct cctttgttta cttcggctat cttcagcata 25801 tagaacagtg cctgaaacat aataggcatt ctagaaatat ttatttgagt atgtggagga 25861 atactacacg ccatgggcta tatatatttt cttataaggt ggtgaaacta gaggggttgt 25921 tagcttcctc taataaaatt tttttctaga gaaaaaaaaa aagcacaggc ctaagaaatt 25981 aacataaaaa ccataagata ccactgagga tgttaaagac gtgtgtgtgt gtgtgtgtgt 26041 gtgtgtgtgt atgcatgtgt gtgtgtgtgt gtatgcatgt gtatgtgtgt gtaggcatgt 26101 gtgtgtgtgt atcgtgtaga atgagaatta ggaagtggtg gcattctaga atagctggct 26161 tgaaagaggg tgggtaggtt tgggggatag atttttaact tgtgcttttg tctatacact 26221 aaattacaac gaagtctgcc acttgcttca ctgccactgt gaagcataaa tcagaaggag 26281 tagctcaaat tgcttggtga ccacaggata atattgggat aaggctaaac tccagctggg 26341 gcaaaagact ggtgaaaaca gctgggtgca gaacagctac tgttaaaaat taagtcttcc 26401 aatctcttta ttttatttta ttttatttat ttatttattt atttatttat ttttttgtat 26461 tatactttaa gttttagggt acctgtgccc agcaatccca ttactgggta tatacccaaa 26521 ggattataaa tcatgctgct ataaagacac atgcacatgt atgtttattg cggcactatt 26581 cacaatagca aagacttgga accaacccaa atgtccaaca atgatagact ggattaagaa 26641 aatgtggcac atatacacca tggaatacta ggcagccata aaaaaggatg agttcatgtc 26701 ctttgtaggt acatggatga agttggaaac catcattgca atctctttca tgcctacttt 26761 cccaggactt gtggattacc gtataaggaa tttgtctcct ccaatgatgc tgctttcaaa 26821 gagttaagga agggtccaga gaattgctcc atagcatgga attaaattaa taatcaataa 26881 cagtaagatc tctagaaagt ccccaaacat ccattaacta agtaacatgc ttgtaaacat 26941 tcatgaatca aaaaagaaat cagaagcata tttaaaaaat atttttgcct gaatgaaaat 27001 gaaaaatgtg tcagaattta tagaatgtca ctaaaacagt gacatttaaa ttaaaggaaa 27061 tttataacgc taaaggccta tgtgttatag aaaagaagga agatctcaaa tcaatgacat 27121 cagcttcttc cttaataatc tagagatgaa agagcaaatt aaaaaactag tcatgagaaa 27181 gaagataata aagatcaaag gagaatcaat aaaatagaaa acagaagaaa tcaatagata 27241 taatcactgc aaccaaatgc ggttctttaa gatcaatact attaataaac ctctaaccag 27301 atttctcact accaatgttt ggaatgagag aggtgatagt attacaagta ttaaaataat 27361 gagaagaaaa tgttgtgaaa aaaatatgtc agtaaattca atgacatatg aaaaggacag 27421 attccttgaa agacacactt acaaaagctc atgcaagaac aagtagttaa cgtggttatt 27481 tatgtattaa ggaaattgaa gttgtaatta agatcattcc cacaaagaaa actccaggct 27541 tcacgatgaa ttc // LOCUS HUMAGAL 13663 bp DNA PRI 28-AUG-1996 DEFINITION Human alpha-N-acetylgalactosaminidase (NAGA) gene, complete cds. ACCESSION M59199 NID g1513066 KEYWORDS alpha-N-acetylgalactosaminidase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13663) AUTHORS Wang,A.M. and Desnick,R.J. TITLE Structural organization and complete sequence of the human alpha-N-acetylgalactosaminidase gene: homology with the alpha-galactosidase A gene provides evidence for evolution from a common ancestral gene JOURNAL Genomics 10 (1), 133-142 (1991) MEDLINE 91257820 REFERENCE 2 (sites) AUTHORS Wang,A.M. JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..13663 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblast" /dev_stage="adult" /map="22q13-qter" /chromosome="22" mRNA join(1321..1805,3530..3665,4167..4338,4815..4992, 5297..5391,6192..6353,9038..9235,10990..11133, 11635..13663) /gene="NAGA" /note="G00-119-445" /product="alpha-N-acetylgalactosaminidase" exon 1321..1805 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=1 /product="alpha-N-acetylgalactosaminidase" gene 1321..13663 /gene="NAGA" CDS join(1790..1805,3530..3665,4167..4338,4815..4992, 5297..5391,6192..6353,9038..9235,10990..11133, 11636..11770) /gene="NAGA" /EC_number="3.2.1.49" /codon_start=1 /product="alpha-N-acetylgalactosaminidase" /db_xref="PID:g1513067" /db_xref="GDB:G00-119-445" /translation="MLLKTVLLLGHVAQVLMLDNGLLQTPPMGWLAWERFRCNINCDE DPKNCISEQLFMEMADRMAQDGWRDMGYTYLNIDDCWIGGRDASGRLMPDPKRFPHGI PFLADYVHSLGLKLGIYADMGNFTCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFS TPEERAQGYPKMAAALNATGRPIAFSCSWPAYEGGLPPRVNYSLLADICNLWRNYDDI QDSWWSVLSILNWFVEHQDILQPVAGPGHWNDPDMLLIGNFGLSLEQSRAQMALWTVL AAPLLMSTDLRTISAQNMDILQNPLMIKINQDPLGIQGRRIHKEKSLIEVYMRPLSNK ASALVFFSCRTDMPYRYHSSLGQLNFTGSVIYEAQDVYSGDIISGLRDETNFTVIINP SGVVMWYLYPIKNLEMSQQ" exon 3530..3665 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=2 /product="alpha-N-acetylgalactosaminidase" exon 4167..4338 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=3 /product="alpha-N-acetylgalactosaminidase" exon 4815..4992 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=4 /product="alpha-N-acetylgalactosaminidase" exon 5297..5391 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=5 /product="alpha-N-acetylgalactosaminidase" exon 6192..6353 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=6 /product="alpha-N-acetylgalactosaminidase" exon 9038..9235 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=7 /product="alpha-N-acetylgalactosaminidase" exon 10990..11133 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=8 /product="alpha-N-acetylgalactosaminidase" exon 11636..13663 /gene="NAGA" /EC_number="3.2.1.49" /note="G00-119-445" /number=9 /product="alpha-N-acetylgalactosaminidase" BASE COUNT 2924 a 3618 c 3630 g 3491 t ORIGIN Map position: 22q13-qter. 1 gagctcgccc ggggatctac cttgcctgat actaggtcat aagactctca ttccagaagg 61 ggtcctgccc tcctcccagc aggagggaat gccacacaga gagaacaaga agaatccgaa 121 tctgaacaga cattgctggg tttccacact cagctattaa cattaaatca ttcccttttg 181 tctaatcgcg tttgagcgca gttgtccatt ccttattcct tttttttttt tcctattgcc 241 cgggctggag tacagtgaca caatcacagc tcactgcaac ctcgacctct ggactcaagc 301 aatcctccca cctcagcctc ccgagtagct gggaccacag gcacctgcct ggctttttaa 361 ttttttgtag agacagggtc tcactatgtt gcccagactg tcgttaaatg tctggcctca 421 agcgatcccc ctgcctcagc cttccaaagt gctgggatta caggcatgag ccaccagacc 481 cggccagggc tgtccattct tcatcaaacc gagacataaa aataaagatt tccctgagta 541 tttgtgtctt cattcctgaa ggctctcaag tcacataaaa cattgagtaa atcaatctat 601 tatgcttttc tcttgctaac ctatatttta tttgtaggag tgtgcgctgt gacctttggg 661 atgagtaagg gaaaatatca cacctttctg cccctacaat gcccagaaca aagcatttgg 721 cacttagcag tttctcaata aatacttgat gaatgaatga agaaggaagt agcccagtct 781 gtgcgagtag tccctgagcg tcaggcgggg aaggagagcc ggctgccttt gcgggagctg 841 tgaccgcctg ctttggtgcc tcccacgcac cctccttgta gagacgatta atcccagact 901 tccattttcc tagctagctc actaggcggg ccttgcttca gagcagtctc tcatttcctc 961 tgcacaaagc cctcaaccgt cctctgcacg gccgcttact gagcgcccac ggtgagcctg 1021 gcacgatcct tgccctcttg gtccccacag aggtgaagac aggctcgcgg ggtccgggcc 1081 acgaaggaca gccccgagtc cgttctgcca atgccagcct cgctagcacg caggcgggag 1141 cccaatgggg aagggcttag ggaggctggg aggccggacc catatgggcg gggcacagcg 1201 ttgctagggg ggattctgat aggttgcgcg aagggttgcc ttcgtgaggc cccccctcac 1261 tggaacgctt cggacttatc aggttaccgg attcgagtca gaagcggcgg caggtctgaa 1321 cgcttcggta gcataagtgc ggtgggaggg ggctctgcgg ttttggggac ccagggcggg 1381 aacccggacc cggctggaag tccggagcct gccgcagccc cgcccctccg ctctttcttg 1441 gtgaccttaa gccagtggct gcctttttct gagcccgggc ggggccgaag gcgcccgtag 1501 gccctcggga ctcccagcac tgcagagggt gtgaggtctg acatccaaga cacgttgttt 1561 cgtatttctg aaggaagaac tcaagctccg ggaagtgatg gctggggatg gggcgggcaa 1621 cttggggacc gagtgtacga tccacgccta aggttgaggg cggccgagct agccaggcag 1681 ccgtgacccc agtgcttttc agacgtttct tagcttccag agcccaacac atacagctga 1741 tacacgcaga ccagatctgg tcaggtcctc ggaagctgag tccagagcga tgctgctgaa 1801 gacaggtacc tacaccggct ggggtggctt ggaagggtag gggtgagtgc tacaaggagt 1861 gctttggccc ttcttcccta atgtgggtat gtggagagac aggttcctct ctcttttact 1921 tagagctggg caggaaggga ggagatgctt ccccatacgg gaatcatttc tccttcaaag 1981 gggcccttcc tttctcactt cctctcccct caagcacaga ggccctaact gatggatatc 2041 agcatgtcgg agaaccgcct gtacagtccc tacagtaact ccatattaca gataaggaaa 2101 ctgaggccga gggggaaagg tctcacttag gtgagctgat tgggtgccta tcctggctgg 2161 cttacttctt aaccgtctct gaggcctctt gagggtctgt ggtagagagc aaagagctct 2221 ggactggggc tcaggagacc tggagtctgg ccaatggtcc cgacttaacg atttttcaac 2281 tttactgtgg tgtgaaagcg acgcacattc agtagaaacc acagtacgta tacaacctgt 2341 gtttcagtat tcagtaaatt gcatgaaata ttcaacaatt tattataaaa taggctttgt 2401 gttagatgat tttggccagc tgtaggctaa tgtgcgttgt cgcacttttt tttttttttt 2461 ttttgagacg gagtctcact ctgtcaccca ggctggagtg gagtagagtg gcgtgatctc 2521 ggctcactgc aacctccatc tcccaggttc aagcaagtct cctgctcagc ctcctaagta 2581 gctgggatta caggcactca ccacctcacc ccggtatttt ttttgtagtt ttagtagaga 2641 cagggtttca ccatatcgcc aagctggtct tgaacccctg acttacggtt gatccgccca 2701 cctcggcctc ccaaagtgct ggaattacag gagtgagcca ccgtgcctgg cctgtccaca 2761 tttttaaggt aggctaggct cagctatgat gtgtgccagg ttaggtgtat taagtacatt 2821 tttgacatga tattttcttt tttattattt tgggatgggg tctcgcactg tcacagccca 2881 ggctagagtt cagtggcacg atctcagctc actgcagcct ccgctctggg gttcaagtga 2941 tcctcttgcc tcagcctccc gagtacagcg catgccacca cgcctggcta agtttttgaa 3001 tttttagtag agacaaggtt tcaccatgtt ggcctgacct caagtgatcc actcgcctcg 3061 gccttccaga gcgttgggat tacaggcatg agccactgca tctggctgat gtgatatttt 3121 caacttacac tgggcttatg agaacgtagt tccgtggtag gcctaggagc atctgtactg 3181 ttaagttcta tatatgggat gggaggagct gatgggggtt ataggaaggt taggacataa 3241 ggtgttagtt ttgctaattc caggtgttag ttttgctaat tccagcaatg ctgtggtgac 3301 ggccagacct caagccactg cctggcccag gaccatcctc ccacttactg cttgcaagag 3361 ccagccattg caggaagagc cattctaccc taaacccagg agtccagttt cccttatata 3421 taggcaggca gcctctgtag ggctgagcaa catacagatg gggtgggatg agggctggag 3481 ctgtgggggc aactgatagt caccatcccc tccgtccgtt gcccggtagt gctcttgctg 3541 ggacatgtgg cccaggtgct gatgctggac aatgggctcc tgcagacacc acccatgggc 3601 tggctggcct gggaacgctt ccgctgcaac attaactgtg atgaggaccc aaagaactgc 3661 ataaggtgag taaggtccac actcacctga tgagccttgc ccagggccct gcagattcgg 3721 gggcaggaag tggagagtcc actagaaaag agggcaaggt cagagttctc catcccagag 3781 actgcagcca gaagtcactc ggactgtctg acacctagcc ccatgacctg tctggagcct 3841 ctgcagggac tctcaggccc caacctggtc acgaggcagg gaccttacat ggacccagcc 3901 cctgagataa ctcttagaag gtcagccagg gcccctgtac ttcccctgac ccagtcctgc 3961 cttttagcct ggggccagcc actcctttca agccagcttg cttatggctc atcgcacaat 4021 ctctggtggt catttacact gatacctggc ttagggctca gcaaggtcat ggtttttata 4081 gcccagattc aatcagagct gaggcagggt gtgggtgagg gccgggctgg gtccctgagc 4141 taccccatcc ctctgcccct cactagtgaa cagctcttca tggagatggc tgaccggatg 4201 gcacaggatg gatggcggga catgggctac acatacctaa acattgatga ctgctggatc 4261 ggcggtcgcg atgccagtgg ccgcctgatg ccagatccca agcgcttccc tcatggcatt 4321 cctttcctgg ctgactacgt gagccgcacc cagccctgcc cttgcgctca gggctagacc 4381 actcctttca ccctacctcg cttagggctc acttacgtga ttgagatctc acagtctagt 4441 ctagggggaa acagacaaat gaacaccgac ttttggccta ttctcgttga gtgagagcca 4501 gaaagggcat ctcacacagc ctggtattct aggaaggttc cctggaggag gcatttcttg 4561 agctgagccc agagagatgg cattggaggc agagaggatg gcaggagcaa aggcctaggg 4621 gcaagaaaca gccgctggga ggtgtgtgtt gggggagagg tgggccatca tgggcagagg 4681 ccgggcagca tgctgggacc atctcctttg gtccgtggga gccactggag gtttaatggg 4741 acggggtcag atatgccttg aggaggctcc ttgagggtcc tgcttgagcc cactgtgtcc 4801 tcaaaccccc acaggttcac tccctgggcc tgaagttggg tatctacgcg gacatgggca 4861 acttcacctg catgggttac ccaggcacca cactggacaa ggtggtccag gatgctcaga 4921 ccttcgccga gtggaaggta gacatgctca agctggatgg ctgcttctcc acccccgagg 4981 agcgggccca gggtgagtta cgcagctggc tggggcacgt aggcaaacgg ggagggtggc 5041 catggagacc cacccagcgc ccacctggct gagcttccca attcccccta cccagggtta 5101 gggggattgt ggctccaggc tcctaaaata tgagtggcag catagcccac cctggaagca 5161 tttgaggctc tgggtatgtg gggaggcacc tactgtttgg agcttgtttc ttcacatctg 5221 ttgggaaaag gtctgctaag gccccagact gcctcctgca cttcctcagg actgaaacag 5281 gttgggcttt ctctagggta ccccaagatg gctgctgccc tgaatgccac aggccgcccc 5341 atcgccttct cctgcagctg gccagcctat gaaggcggcc tccccccaag ggtaagccat 5401 tctcccccct gcctgatggc gcttcccaca cccaggcctc cttacctcct gagtgccagg 5461 cttcagggac ggggccttcc agagccaggg ctgctctcac cacagggccc atatggtgat 5521 ttgtaggtgt ctagagacct acattaagta gccgcaaaga gcaggtgggc tcagcagccc 5581 agagaggtgt gtgtgttgga gggagtggcg ggccttagag aagaggccct tgtgaagtgg 5641 gctgaaggag ttggcatcag aggcaggaga gccatctgtg ctggcacaca gtgccaggat 5701 tccaggaggt gcccattagg cgaggagacc cagaaggctg gggaggccag gcaggtgggg 5761 ccaagctcag aggagctgtg ggtactgagg ctccttagca tgtttgacac caaaaagggt 5821 gggctcaggc ctctggagct gagaagaaag cagacagcag tttctcactg agagaacatg 5881 ggggtgggtg ggtgtcaggg agggttgttt gtgggtgcac cagcctgggc ccacaccctg 5941 ggtttgaccc aagctggcat gagaaggcag ggtaggctcc tggcaccagc atgcacagaa 6001 gggagacccg gtagccccag agcaggccac cctctgaggc cctcctcatc ctgagcaagc 6061 cggtggggcc tgttctcttc cctcttcccc actctctgtc tctcctgctg agcctcctca 6121 actcctgtgt gggtccctgg gctgcagcgt gtgattctgg tgctggactc tgcctttccc 6181 tcgacatcca ggtgaactac agtctgctgg cggacatctg caacctctgg cgtaactatg 6241 atgacatcca ggactcctgg tggagcgtgc tctccatcct gaattggttc gtggagcacc 6301 aggacatact gcagccagtg gccggccctg ggcactggaa tgaccctgac atggtaccag 6361 gatggagggg gatgtcctag gcccaccatc tgtgacgacc ctgctcccca ctgccctgtg 6421 cttcttcctt cttggaggct cactgccggt tctaggtcag cgcttctgaa atgtgtcctg 6481 aactggttgc tttagacttg acttagaagc tggtttcagg gcccatcccc acgtttgcat 6541 cacaatgttc aggggaagaa cactgttaac catctcccca ggcaattctg atgtttctga 6601 aagctcaaga acttgggtct gtagtgaccc tggaataaga tgttattgct ctgcaggttg 6661 aagatcacga ccttgctatt cctagtgcct ggactgctct cccccgtgtg cctctgatcc 6721 ttttgtcaat ttcctgcctc atcctccaca gggacaactg gccctcacag tcctgcttct 6781 agcccttcca ttctatctcg gtcattgccc tgtgacattt tgttgttgtt gcaatttata 6841 gctttatagt tatatgttag tcaacaagca tttactacct gattgatcct atgatgatta 6901 tagaattaca agtgctaagc caggtggagg gagcaacctg gaggagcagg aactggcctg 6961 ctgtggagtc ccagctctag ctctgccact ttacctcggg caagtaactt aatttgttct 7021 gtgcctcagt tttgtcaatt ggaaaataag aatagtagct accttatagg gttgagggtt 7081 aaaataactc tgaaatgctt ggcacagtga tgggcacaca gtgcttaaat tgttgtcact 7141 aagcctgtaa ttgtgtcctg actatggatt gcttgccctg gggtcaggag tcccccttgg 7201 tccattcagc tgtggctagg gcaagtgggt gtggccagtg gtaggtacag ggactttgtc 7261 cgcttctgtt tcttaccact gtcaccttgg gacatttatt gacttaacct agtttataag 7321 ctttttaaca acaagagccc atttgatcat ctgaacaacc tgtgatgtag ccagggcata 7381 ttaccacccc catcaccccc atgggctttt gttttttatt gatttattta ttttttgaga 7441 cagagtcttg ctctgtcgcc caggctggag tgcaatggcg cgatctcggc ttactgcaac 7501 ctccacctcc caggttcaag cgattctcct gcctcaaccg cccgagtagc tgggattaca 7561 ggcgcctgct atgacaccca gctaattttt tttttttttt tttttagtag agatggggtt 7621 tcaccatgtt ggccagactg gtctcaaact cctgacctca tgatctgccc gcctcagcct 7681 cccaaagtgc tgggattaca ggtgtgagcc accacgccca gcctatttat ttatttattt 7741 agagactgag ttttgctctt gttgcccagg ctggagtgca atggcatgac tggctacgca 7801 ctcgctccag gtcaagcgat ctctgtctca gcctccaagt agctggatta caggcatgcg 7861 caccacgcct ggctaatttt gtatttttgg tagagatggg gtttctccgt gttggtcagg 7921 ctggtttcga actcctgacc tcaggtgatc tgccctcctc agcctcccaa agtgctggga 7981 ttacaggcgt gagccaccgc gcctggcctt ttgttttttg agtcagagtc tcactctgtt 8041 gcccaggctg gagtgcagtg gtgcaatctc ggctcactgc aacctccaca ttctggttca 8101 agttattctc gtgcttgaca gtagctggtt acaggtgttt accaccacgc ctgggtattt 8161 ttatattttc agtagaaatg aggttttgct atgttggcca ggatggtctc taactcctgg 8221 gctcaagtga ttcacctgcc tctgtagcag gaccagccgc agataaaact cctcagacac 8281 cagattaaag aaggaagagg tttttattcg gctgggagtg tcggcagact cgtgtcttaa 8341 gagccgagct ccctgaaaaa gaaattcttg gcctttttaa aggcttacaa ctttaagggg 8401 tccatgtgaa agggtcgtga tacatcacgc aagcgtggga aacatgactg gaggctacat 8461 gcatcagcta acagaacaaa aagttttaca atgctttttt catacagtat ctggaatttg 8521 cagataacac aagtagttta ggtcaggggt tgatgttatt attattactt ttttttttaa 8581 ctcctagggc caggtggtgg tgccatggtt gtctggctat ttatcttact tttgtttatt 8641 tccaactttt tgctttttct ctcctcctgt cttgtgaact aggcaaggtt gggagaggag 8701 ggcagcagga gtagtagtgg tctccttcct tacctcagcc tcccaaagtg ctggcattta 8761 caagcatgag ccgccgcacc tggccatctg cccacccccc taccgccgac tcctgttctt 8821 gatggaaaat aaaaagctga agctcagatg gaaggtgact taactccccg aaggcacagg 8881 gctaggggat ggtgtttgct gaaccaggcc tcctccacga tacagccatc tttccctaat 8941 ccctaattgc tcttcctcgg ctacaagttt ggggatggtc cctgtgtcta cctccttgag 9001 gagatgagag ctgactactc ctcttctctg cccccagctg ctcattggga actttggtct 9061 cagcttagag caatcccggg cccagatggc cctgtggacg gtgctggcag cccccctctt 9121 gatgtccaca gacctgcgta ccatctccgc ccagaacatg gacattctgc agaatccact 9181 catgatcaaa atcaaccagg atcccttagg catccaggga cgcaggattc acaaggtact 9241 agggtgtgga gggaaggaag gggagggctg aggaactggg ttctcctgag agaaaggctg 9301 ccagctccct gggggcaaca cctggcgagg tacaggagtc gcccagtccc caaccagggc 9361 taccccttct ggttgcttat ggttgaggac tctgatggga gctgctccaa ctgtcctcct 9421 cttgctgggt gagagcaggg ctgagcagga cagctcaagg gagtcgggga tgagaggtgt 9481 cagccacata agtgcacata gcaagggtga ggcacagagc ttctatacac ccgtgatggc 9541 ctgcagagag cttggacttc cctccagagc aggaggagct ggtttgtttg tttttgagac 9601 agggtctcac tctgtcaccc aggctggagt gcagtggcac aatttcgact cactgcaatc 9661 tctacctgcc aggttcaagc aattctcgtg cctcagcctc ctgagtagct ggcactacag 9721 gcgcctgcac cacacccagc taatttttgt atttttagta gagacaccat gttggccagg 9781 cttgtctcga actcctggcc tcaggtgatc cacccgtatc agcctcccaa agtgctggga 9841 ttacaggcat gagcaccgca ctcggccagg agaagctgtt atagccaagg aatactacga 9901 ctactggtgg ctgctattta ttgagtacct accatgtgct gggagtttta gataattttt 9961 ctcagcaagg tagttatctt gccattttac aaatgagaaa aatgaaactt cgagagtctg 10021 agtaacttta tcccaaggct acacagttgg tacaaacaag actggacttc agtgtcacct 10081 caaagccttt tttttttttt ttttttttga gatggagtct cacgctgtag cccaggctgg 10141 agtgcagtgg caccatctca gctcactgca acctctggct cccaggttca agcgattttc 10201 ctgcctcagc ctcccaggta gctgggatta caggtgtgcg ccaccacacc cggctaattt 10261 tttttgtatt tttttcagta gagacagggt ttcaccatgt tggccaggct actctcaaaa 10321 ctcctgacgt cagctgatcc actgcctcgg cctcacaaag taatgggatt acagcatgag 10381 ccactgtgcc tgtctgcctt tgctctttac caaatcctgg attctggtaa aaagaaacct 10441 acagaactat ggaaggcacc tatagaactg gtgatgccca gaggaagtaa caattccctg 10501 ccagaggggc tgatggtgga gctgggcctg gaaaaccttc tggaggatgg gagttcacat 10561 ccagctccac tctccaccct cctggaacag agttcactgt tcccactgga cagcaccctc 10621 caggccagca ctggcagctg tttggggcca gcactcatac gctgtactgt tgttgcgctt 10681 ccctgtttct gcgtttatcc ctcccgttgt cctatgagct tctggggcag ggctcatgca 10741 gcacttgtct cagtgtgcta gcataggggc cgggctcaga gtaggtgttg atgagtatct 10801 gctgagtcag ggaaggtggg cagatagggt tagataagct ggggtgctgg aggcccgtgc 10861 gatcctccct aaacctgtgt gacatggagc tgtgaactgg gggacccaga actcagggag 10921 ggccagggag gcaatggtag gtcctgtctg agcaagggac cccagccagt agccaccttc 10981 tgtgcccagg aaaaatctct catcgaagtg tacatgcggc ctctgtccaa caaggctagc 11041 gccttagtct tcttcagctg caggaccgat atgccttatc gctaccactc ctcccttggc 11101 cagctgaact tcaccgggtc tgtgatatat gaggtgagca cccggctgcc tgccagcccg 11161 ctgggcttca gatttctgtg ggagggtatt atggggggtg gttgttcaca tatggcctcc 11221 cctgagccgt gggcttaggg ctctgcaggt gccatgggga gctgcaggca ctgagagctg 11281 cagccagtgc ctgctctgca ccagtatttt caacctatgc aaacagaact actgcatata 11341 aatatgaatc ccattagcct ctacggctgt actcctccca agggtcagcc cttgctggca 11401 cgttcctcac agttttccat gggctggggc tttctccact gcccgtgctt gggcacctgc 11461 tggatttgct ggtggagaag tgatggggat tcagggctcc cacttctgta gggcatactg 11521 gaagcccagc acagacaggc caaagtccag gcagaaggct tctacatcct ctagcggggg 11581 gggtctctgt gccccactct catggtggca ttggtgatgt ccctccccac tgcaggccca 11641 ggacgtctac tcaggtgaca tcatcagtgg cctccgagat gaaaccaact tcacagtgat 11701 catcaaccct tcaggggtag tgatgtggta cctgtatccc atcaagaacc tggagatgtc 11761 ccagcagtga ggagctggga catgtgacag gctgtggtgg caccactgag cctagaccat 11821 ggagccttgg catgcccagg gcaagtgggg aggttctctg ctccccaggc ctgctcggtg 11881 actgacccca tcatacccaa agtgcaatct cacggccagg ttctatgccc tgtccaagcg 11941 taaaccctct tggaaacttc ttttggggca attttcctgt ggccttcctg gcctctactt 12001 ccatgtgcgc agccccacag acgttgctga gcaactcgcc agcctcctga gctccatgcc 12061 catcaggact ctagcctctg accttgctgt tgactctgaa atcaggattt ggaagttttc 12121 gaattaggag tagagagatc tgacctcttg ccaggaatgc ccatggatca tgtgattggc 12181 ttttctaccc atagagggcc ttgcagcctg ataccactgg gagtgagggt cacaaaggag 12241 accttggctc cctcaggtca ccaataaacc tgttctttaa tcaagtagtt gtaatatgga 12301 cctggggtac ctttagttag ggactctggt ggcactacac ggggtgctgg attcaaatct 12361 cacctctatg gatgatgtct ggcatcgtct ggcccctcct tttccaaagt gaccagaaga 12421 ttgtacttcc tagagatgcc agacttgcct cacacccagg agctgcccct gctgtgcctg 12481 gtatgctggc agtggtgatg gtgacaacac aatgaggcca agtagcaatg ctcccaggtc 12541 ctgcagcttg tgctctgccc agcagtcccc tggccagcac ctcagcctgc tcttctctct 12601 ggtggctcca ccccaacccc tacttctttt cctttatttc tctcccaacc tggctgcaaa 12661 ggaggttcat atccccagat gtgtccacaa acctgtccag gagttgtgtc ttaaaattga 12721 tttcctgtcc ctcttcctaa ggacaggggt aacagtcaaa gtgaaggaat gatgggttct 12781 tgcctgtctt gctttggtcc gtatttccca agccctcctc ttttagccaa ggctacaggg 12841 cacttgtctt ccctggtggc tctccccttt ggagagagct agtggcagag aacctagcta 12901 ggtctccttg ggctttgaat gcaggagcac ccagtgcccc aggccttgca ctgtgctgcc 12961 caaggagtat tttctgggtg gatctcccat actcctcagg gttctgaggt ccttgtattt 13021 agggtatggt gtaaaaactt acatatgcac attttgttac atcatttaag atgattgaag 13081 ctttactggt taagtacagg gtcaccagag aaagttcctt tccccagttg atgtcttgag 13141 gctgttgaga aacttaggac agaagcctac ctcatcccag gcctgcccct ccgtgcatag 13201 ctctgccttt gggtcttgtt catagtgctg ttcaggcaaa atggaatagt ctcaggaggg 13261 aacaccttct cttgctagct ccaggaaggc tttgtgggaa atgagttgag accatggact 13321 tgggagttgg gggccagagt tgagcctgga cttgaccaat taattccctt acctttcagc 13381 ccaagatagc tacatgtctt tacttgctgt ataagttttc cttttgtcct gggggtgcct 13441 gattgatcct atctcttcac ccttcattct ttcaacaaac atgccatcta tctcccagga 13501 catttattcc cttctattca aggccagtta ggaatgcaat tatttttttt ttcagttaaa 13561 tacagacttg tttggacgca aggtaccctt tctcttttta aaaacttttt atcatgaaaa 13621 ctttcaaata caaaaataaa atggtagaga gtgaacctcc atg // LOCUS HUMAK1 12229 bp DNA PRI 30-OCT-1994 DEFINITION Human cytosolic adenylate kinase (AK1) gene, complete cds. ACCESSION J04809 NID g178321 KEYWORDS adenylate kinase. SOURCE Human DNA, (library of T.Maniatis). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12229) AUTHORS Matsuura,S., Igarashi,M., Tanizawa,Y., Yamada,M., Kishi,F., Kajii,T., Fujii,H., Miwa,S., Sakurai,M. and Nakazawa,A. TITLE Human adenylate kinase deficiency associated with hemolytic anemia. A single base substitution affecting solubility and catalytic activity of the cytosolic adenylate kinase JOURNAL J. Biol. Chem. 264 (17), 10148-10155 (1989) MEDLINE 89255503 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Nakazawa, 10-JUL-1989. FEATURES Location/Qualifiers source 1..12229 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="of T.Maniatis" /map="9q34.1" TATA_signal 902..908 exon 944..983 /partial /gene="AK1" /note="G00-119-664" /number=1 mRNA join(944..983,3948..3988,5534..5569,5742..5905,6656..6772, 10075..10266,10508..12188) /gene="AK1" /note="G00-119-664" /product="adenylate kinase" gene join(944..983,3948..3988,5534..5569,5742..5905,6656..6772, 10075..10266,10508..12188) /gene="AK1" intron 984..3947 /gene="AK1" /note="G00-119-664; does not fit consensus" /number=1 exon 3948..3988 /gene="AK1" /note="G00-119-664" /number=2 CDS join(3982..3988,5534..5569,5742..5905,6656..6772, 10075..10266,10508..10576) /gene="AK1" /codon_start=1 /db_xref="GDB:G00-119-664" /product="adenylate kinase" /db_xref="PID:g178322" /translation="MEEKLKKTKIIFVVGGPGSGKGTQCEKIVQKYGYTHLSTGDLLR SEVSSGSARGKKLSEIMEKGQLVPLETVLDMLRDAMVAKVNTSKGFLIDGYPREVQQG EEFERRIGQPTLLLYVDAGPETMTQRLLKRGETSGRVDDNEETIKKRLETYYKATEPV IAFYEKRGIVRKVNAEGSVDSVFSQVCTHLDALK" intron 3989..5533 /gene="AK1" /note="G00-119-664" /number=2 exon 5534..5569 /gene="AK1" /note="G00-119-664" /number=3 intron 5570..5741 /gene="AK1" /note="G00-119-664" /number=3 exon 5742..5905 /gene="AK1" /note="G00-119-664" /number=4 intron 5906..6655 /gene="AK1" /note="G00-119-664" /number=4 exon 6656..6772 /gene="AK1" /note="G00-119-664" /number=5 intron 6773..10074 /gene="AK1" /note="G00-119-664" /number=5 exon 10075..10266 /gene="AK1" /note="G00-119-664" /number=6 intron 10267..10507 /gene="AK1" /note="G00-119-664" /number=6 exon 10508..12188 /partial /gene="AK1" /note="G00-119-664" /number=7 BASE COUNT 2417 a 3457 c 3877 g 2478 t ORIGIN Chromosome 9q34.1-q34.2. 1 tagcctataa tacaaattcc aacccacctc atctggggct gctgcctggg ctcccatccc 61 tgcccggcta catcactgag cacctactac tatgtgccag tctccctgca aaacgctgga 121 taaacacgtg gctttctacc agggaacctc ccgcaaggta tttgacatgc tcgcctcccg 181 ttctctgctg tgtctaagga ttcacagatg cggctggagt ctgctgctta gcacagtgag 241 tcgtcattat gggagctatt cttcttatca agaccgcaaa tcccctctct ggctatccac 301 agccttctga atgtcccggg caactccagg ggagcagggc tggttttcta taatcttcta 361 ccttactggg tggtctttgg ggtctggcgg gttccagtcc cagaggagct cgaagggtcc 421 ctccaacagg ggaagaggag tcccaggtgg gtcctggcca gggcctctgg gcaggctctg 481 agggcgggct tggggactgt ggccagcaaa gcccctgagt cgatgcctca gccctctcgc 541 tccctggtct ggcctctctc tggcaccaat gcgctgtggg attttgcgga aagagccgct 601 cttctctgag cctcagtttc tccagctatc atagggaaag cctggccttg taccttgagc 661 acagtcgggg tatcgcaatg gaaaactctt ggcaaactgt aaagtgtagt tcgcgtgtgt 721 gtgggcacag ccacctgggg gtccacggtg cggggcacac ggtgcgggtg cggtgtcgcc 781 gcgcacccgg ctcgggctcg gtcccgcccc gcttcccggt ccctggtccg ctctccctcc 841 ctccttcccg ccctccctgc cttcgggaac gccggctccc gatgccgcgc gctgacagcc 901 ttataaatag tcgcctttgc cggccgccgc gaggacgggc agggcacgca ctggccccgg 961 cgcccacccg cacccctccc caggtcagtg cgtgcccgcg cgtgtctggg ggggcgcctc 1021 tgcggggagg ggcagcggca ccgggggagg gcgggggcgt cctggtgcgg ggctccctgg 1081 gggctgtgcc ggctgtgtat ccgtggttgt gatccgtgtg tgtttgcgtg gctgtgttcc 1141 ccgggcgctg gggatctgtg cctcgctgcc tgtgcctttg tggatccgtg cgtgtgtgtg 1201 cgtgtgtgtt tgtgtgtgtt gcagcccctt ccagcctccg agatagtcac ccctttgttc 1261 tggctggcac tcccggggct ccaggtcact gcccttggca tgtcccagct tgggcccaga 1321 gagggagagc tgtggcctgg ctccctgccc cagaggacct ggacaggggc agcctctgtg 1381 ctccgtaggc tatcgtgtca ctccacttct agtgctgggt gcattggaga acaccccctg 1441 gatgctcaga gcttgcccta gcctcctgac cagggccagc gcttggggcc ttcccacagc 1501 actgctcagt ggccgcctgg tactcctcag cagcagcagg acaggcccag agaggggaca 1561 ccaactgccc caaggtcaca cagctgagtg atggagctgg gattcgaacc ctagtcagtt 1621 gccacccact gggaaccgac tacgttgggt cagaagctgg gagacctggg cttgccttgg 1681 ccttgtcttg tgattggggt taagtcaccg tccctctggg cttcggcccc acaggcccct 1741 tagcagacca tccagtcata ttaggagtcg ctggggcctt gacccagggg ccttataagg 1801 cctgggtgct tagcccgtaa cccaactcca tcctagaccc ctgtcctctg tgaggcctcc 1861 tcacgaccca agttattttg gatcaaggaa gtggcaggtg ccatcagggc ggagggggct 1921 tagattacca ggaggggggc ggggcagggc ggggtcgagg ccacttctca gggcttcact 1981 gtggttgtcc ctggcccagg acgctccccc tcccatgggc agctgctgag gggcacttgg 2041 cagctcctac ccccctgaag cacactcagc aggaaaagtg gctgccactc ctgcctgggg 2101 cagcaggggc atcagcaaag ccacttcagc tgctactggg tgccaggccc agcgtagcag 2161 aaactcaggg ccgagttgga ccctagctgg ccctcgaacc agtctgtcct cctccccatg 2221 cctcagtttc ctcccaagat ggcagtgagg atggaacgag gctccccagc cctgtgctca 2281 gcccatgcca ggcccagtca atgggatccc aggcccgcag cacagaagag gggcccagta 2341 cagcgtggtt gtgcagtgca gggcaggttt ccaggaggca cctgccagtg tgggagtgga 2401 gggcagagtc caggaagact tcctgaagga ggtggcacct gagctgagtc ccacacaata 2461 gcaaggtggg gaaaggcatc caggcattgg gtacaacatc tgccagtggc agctgatagg 2521 gttgcagagg gcagcagagg tcagctcacc gggacctatc cactatgtca aggggcccag 2581 gctttgtctg ggagccgaga ggccacacat gcaaggttgt aagcgggtgt gtgatgtggt 2641 ctgatacttg ccagagaaaa tcctgcaaat tgcgtggcag ggtgtggagg cttcaaggcc 2701 catgtggagg cgggggcagt ggcccaggcc ccttgcttag cacaggacat agtggtgagg 2761 agacagacgg agagagcctg gggaggggag ggttgggcag tgggtgtgca gggctgtgga 2821 gaggagagag gagccttcac actgactcag cctgtggctt gccatagggg ctggagctgg 2881 gacttgggga gggctgttca gagcctgaga agcctgtggg gcttctgggg cctgtctcgg 2941 gggcatggat gtctggggct ggagagaaag gcctgggctg cctgcacagg gatgggggtc 3001 cttggagcct gaagccaagt agagagggtg gtgtgggtgc cttgggtgga ggtcccggtg 3061 cggtggggtc gctggctgga ggtcaaggct ggaggggagt gtggctgtgg tggttgcgag 3121 tggccttggg gagggctggt ggctgggagt gaagaagtgg aggccaaagg tgtccagagg 3181 gcagttctga gatagggggg cgaggtaggg gaggtcctgg gggctgtggc agcacagggt 3241 ggggtcgcag gtgtgattgg gctcatgggc aggctgaggg caaggtgggg gcagtaaagc 3301 caggtcctca aagggtgggg gtgcagggga catgcgtggg gctggttgtt gggcctgtag 3361 aggtgaagag cctgggaccg cctcactttg acaactgtaa agggctgtgt tcatgcgagc 3421 ggctcctgta gattccttca ccccaggcct ctgtccctta gaggatcccc atccaaacct 3481 catcccagac acacatagtt cctggccacc tcccacctcc tggcagcttc cctctggccc 3541 tggggagcgt tgtcaaggtg tctgggcaga cgctggcccc gcgccctgtt ttctgggagt 3601 ttcgagggga gcagctctgc agttgtgtgg ccgctgttta tgtccctcgt gccatcctca 3661 ctgtcccagc ccctccagcg ggaacccagg tggatgggag ggggctactg cttctgtgac 3721 aggtgaggcc atggaggcca gagaggaaag gtgacttttc caaggtcact ttcccaaggt 3781 ggctttctca gggaaggggt cctgggcctt ccctgccccc agcagccccc tctgccctca 3841 gactgcatgt cgggcctggc acacaccagt tgccttgggt gtggcttagt ggaaggtgag 3901 tcagggtgag gaggaaaatc cctccccaac ctgtgccctt gtcttacaga gcactgacac 3961 ggctcccggg acctcggcag gatggaaggt atgagcctcc tctccttttg cccattgcct 4021 gggcctcagt gtgcctgact attaaggggg tgcgctctca gggagacatt ctgggagggg 4081 ttggaggagg ctctggggct gccggagggg gcctggctgg gagcctccag tatcctcctc 4141 tggcctctca cgggatgtgc cagctccaag attaatttta gacgtgacga ggcctcccca 4201 aggctcctga gggccccagc gcacaccccc tatgctgggt ttatagagct gagaccctgc 4261 cccatggaga ggccacagat acacaaaaca ccaggtctaa tgaggacaag gcaattcccc 4321 gtacctctcc gggactcagt ttccctatca tccagtgagg gtgggccttg aggaagtctt 4381 ggaccctcct agttcctatg ggctgggact tagagggaga cacgggcagg cagagagtcc 4441 agaacgggcc cgccctcagc tgctggcagc accatccgtc cagttcctgg cactgtgcct 4501 tggcctgggc agctcttaaa gacacccagc gggcctgctg ctgctgctgc tgctgctctg 4561 cctgagctcc aaatatcaac agccataggg ctgcggccac tgcacccccg ggataggggc 4621 ccacaggagg agcccttctt gacagctcct gtagggctcc agcttatggt gagggcccct 4681 gggagccgcc cagccacctc ctccctgggg aaggggcaga gaaccagaga ggggcaggcc 4741 tttcctaagg tcacacagca aggtgggata cagcctggtc ctctatttcc ctgggaagcg 4801 ggggagcctg aaagcccagt ggggagcttg ggttctggtt ttggttctga catcccttga 4861 gcaggccttc cctctctcga gctcagtttc cctctgccca gtggggctgt ggccttgctg 4921 ggatctaaca gcaggcagga cgagacagag gttggagcca agctcaccgc cctctctggg 4981 caagagttcc acatgcccac cacagccctc caccccggca actgtctcca tggcaacata 5041 tgttgtggtt gctaagcccc cactgccggg ggaggcgggt gatgggccgg cccagggcct 5101 gagaggggaa gggcagggtg cacaggagac acggcaggac gggaccatgg gctgctgctc 5161 ctcgagtgac ccccgcagag aagacgatct gagagccaga ggtgaccggc cccggggtcc 5221 aggcacccac agggaggcgc catgagggct ggcggcccag cccgctggct gtgtggagag 5281 tacttagcct ttaggctttt gcctctgccc gtctggctgc ctggagcacc ctcccttccc 5341 tgacctctcc caggctccga gctcactcac atccctgctg actctgggct gtgacttgtc 5401 tgcctccctg acaggagcgc cagcggtgtc tgtccctccg cctggggccc agcacaggcc 5461 tgggactcca gaggtgcctc tggctgtggg tccagcgagt gactgagccc tgctcccctt 5521 ttctctcccg cagagaagct gaagaaaacc aagatcatct ttgtggtggg tgagttgcgg 5581 gcaggcgggt ggtgcagcaa gggtcttcac tggcactgga gggagcccgg gcctggggct 5641 gcaccctggg ctctgcccct ccctctctgg gagaccctgg cagcccctgt tcctcctggg 5701 accctgtgtc tcggggttta tgaatgggtg ggcgtttgca ggtgggcctg gctcagggaa 5761 gggcacccag tgtgagaaga tcgtgcagaa gtatggctac acccacctct ccaccgggga 5821 cctcctgcgg tccgaggtca gctcaggctc ggccaggggc aagaagctgt cggaaatcat 5881 ggagaagggg cagctggttc cactggtgag tgggccctgg tggggtgaga ggcaggggat 5941 gacagtagcc ctgagtgggt gtcccaccag cagcaaagcc cacggcacac aggcaaagct 6001 gaccgtaacc gcccccggct ccaaaccctt ctgtggttcc ccattgccct ggacagaaag 6061 ttccttccag aagctcccaa gaccttgcag gactctgccc tgcccccaac tctctccctt 6121 ccctccatcc ttcagccacc ctggcctctg tcccctcaga ctagccatac tccttccagc 6181 ctggaatatt ctttcctctt cagctttcag acttccctaa ctccccagat ttggcctaag 6241 acctcccccc ccaattttct actccagaag ccccacttct ctgcccgtac aacctgtgcc 6301 atgcttagaa atggagaaat catctgtgga atgtgtctgt cccctcacca gcccctgagc 6361 tccccagggc agggaaggcc taattggtcc atggccgtgt cctcgggacc caggacagca 6421 cctggctgga aggagggtct cagtgaatac tgttgagtga atgcatcatt cattcattca 6481 ttcactcagt ccaggtctcc acaattccga ggaagcttcc tggaggggga ggcttgtttg 6541 ggacatttgt gttggggaga gggcatttca ggtggcagga gcagcgtgac gagaggctgg 6601 gagaagggat ggggaggaga ggctcatggc cccttcctcc tttgctgtgc cccaggagac 6661 agtgttggac atgctccggg atgccatggt ggccaaagtc aatacttcca aaggcttcct 6721 gattgatggc tacccgcggg aggtgcagca aggagaagag tttgagcgac gggtaaggca 6781 ctgacccaag tggggatcct ggtgggctgg ggcaggataa gactcccatc cccatgggcg 6841 ggcctgcggg ggctgtgatg caggtcagag ttctgagcag aggctgattc atggagtgtg 6901 ggtgaggagg aaaaccagga agactttctg gaagtgtaga agaaaatgtg tagaagggga 6961 agcgccagga ggggcagttg ccaaacctgg gctcagattc cagctcaacg gcttgagatc 7021 ttgggccagt cattctgcct cctgaacctc agttttccca tctgcaaaat gcagggagtg 7081 ggaggaccag gtcacagctg ccttaaggga cagtgtgggg actggagagg atgtgccctt 7141 ccagtgtggg ccttggctcc ctataaacac acagaggggc agggaacagc actggcaggg 7201 ctggaaaggc aaggccagtg tggggagaac agaatatgtg ggcaggacaa caagggaggt 7261 tccagcctgg ctggaggctg caaatgccat aaagggacat cagtctcctt ccaggggtca 7321 tgaggagcct ggctggaaag agcaggcctg gaaaatgatg gccctgagct tcctagcacg 7381 agcatcctgc agatacacct tccccaggat ggcaccccac acacgcacac acgtggacgc 7441 ttgccacata cacatgcaca agccccacaa gttcatacac acaaggccag gggcatgtgc 7501 acacatcgac acacacggca gtggatactc actcctgcac atgcatgtca cggacgcagc 7561 acacatatgc aggaatgcac acgtgctcgc aacatggccc tgtgcacaca catacacttt 7621 tacctcggtg tcacaaggct gtatgggaga aataaacggg actgcaaggg cctaggaggg 7681 agaggtcggc aagggctagg gggagcctaa ccagcatcct ggtgggctgg ggcaggataa 7741 gactcccatc cccgtgcccc atccccatgc cccattccca tgccccgtct cgtagggcct 7801 tctgggcttc cgtagcttcc agcctcatct ggggatagga ggccatattt tctgacctcc 7861 gagtggctgc tgagtggctg caagacctca ggaaagtgtc taccccactc tgcccctcac 7921 cttccctatc tgaaatgggg gttagctggg attccttcag acctcagacc cctgaccctg 7981 tgttaggtgc cacagttctg aaagatgtcc tggcctggct cacacatacc agctgtgtgg 8041 taccacctgt gaccctcagg gccacatcta gaaaatagag ttaatgaaag tcggccgggt 8101 gcagtggctc acgcctgaga tcctagcact ttgggaggcc aaagagggca gatcacttga 8161 ggtcagggtc aaaaccagcc tggccaacat ggtgaaaact catgtactaa aaatacaaaa 8221 attagccagg catggtggcg ggcgcctgta atcccagcta ctcaggaggc tgagacacga 8281 gaatcatttg aacctggctg gacggaggtt gcagtgagcc aagatcacgc cattgcactc 8341 aagcctgaac aacagaatga gactccgtct caaaaaaaaa aaaaaaaaaa agtccccacc 8401 cttccagggc tgttgtaggt gccgctgagg gccgcagctc agccctggtc ccatccctgc 8461 tgtggcctcc aggtgtccgg gcagctggtc ttcttggctc tggacaccag ggggcgccag 8521 ggcttcaccg aagagggagg gtcagcctgg aagtgtggcc ggctcacacc caaagctttt 8581 ctggagactg gcagctgccg ttgtggactc cccgggctag atgtccattg tcctgacagg 8641 gagactgagg ccccacagcc agaatctcaa gcccagaggc tgagctggct ctcccagatt 8701 taccacctct gctgggtgtg tgtaaagagc ctactgtgct tggtcctgga gacccagatg 8761 aaccagagac tgtgagtctg tgccctggag gagctcagca gctgatggat gatgatgggg 8821 ggtgggtgac cttgaagttg aggtcagact gctgcatcca ggcccagcct ttgcccccat 8881 ggtaatctgt aaggtagaag cagccgctgg gtggtacgga gtcatctccc tgaaggcagc 8941 tgacgctgtc accctgaatc cttaactacg gaggagctag ggatgagcgc taggatgggt 9001 atcgcaggcg aggacaaagg gcttgcccca ggtcacacag ctgaacagag gcaggggtga 9061 gcctgggtcc aggtcagtct gacgccggag gtcaggctgc ttctcgcctc tgtccagctg 9121 tgtggccttg gacaagcctc ttggttacac agctggacag aggcacgaaa cagcctgacc 9181 tgagaggcag tgccccaagg ccttgacagg ggtccagggc caaacgtgcg cagaaaccct 9241 tgggggctgg cccaggcagt tggaagcagg agacagggtt tgaaggtccc aatcccattc 9301 ctaaagctgt ggcagtcatg ggtgactctg gggtgctctg ccctcccaac caggcagctg 9361 tggtagctgg attgagcgag gcaggggctg cagaacctta ttcctcttgc ctttccaagg 9421 cgtcccatgg atgacatgga tgacatgctg ttccaatgac acggacattg ctcagaaccc 9481 cccaggactc agcagcccct caccccttcc ctcatccctt gttcctttgt caaaggaccc 9541 cagaattgtg agacaagtgc ctccctgccc tcagggccct tgggagacct gcttaaggaa 9601 agataaaggc gctggaagag ctgttccagg aggcggggag tactgggggg cggggggtgg 9661 catcagagag actgccctga ggaagtgagc ttcgcactga aacctgatgc ttggtattga 9721 ggtgaaggcg agttaggatc aggctcgggg gagagggctc ctagcagagg gaacagcatg 9781 agcacaggtc tggcatggaa ggaggttgat gagagcctag agcagagaga gtcaggctct 9841 ctcagtctgg caagtagtgt ggactttctc ctgagggcac tggggagcca tggtgtaggt 9901 ttgagcaggg gagtggcagg aaagattggt tttaggaagg tctttcaggc tgccttgggg 9961 aagggattga aggtctgagg gggtatctac tggggcttca tagatgggaa gatggggagg 10021 acatggccct agggggcagg aatggggtcc ctgtgacatg gtgcccgcct gcagattgga 10081 cagcccacac tgctgctgta tgtggacgca ggccctgaga ccatgaccca gcggctcttg 10141 aaacgtggag agaccagcgg gcgtgtggac gacaatgagg agaccatcaa aaagcggctg 10201 gagacctatt acaaggccac agaacccgtc atcgccttct atgagaaacg tggcattgtg 10261 cgcaaggtgg gccccgcggg acggggcagc tccgggaaga acgggttcct atgtggctcc 10321 gcctcagctc actgtgtggc ctcaggccag cccctctctc tgggccttgg tttccccact 10381 ggttcaatga ggggctgctg tcaattaggt tgggctcggg gaggcaggcc tggtctctgg 10441 ggcctgccat ggtgaggccc accccactcc acctggtgac tcagcctcac ggcccatctc 10501 cccacaggtc aacgctgagg gctccgtgga cagtgtcttc tcccaggtct gcacccacct 10561 ggacgcccta aagtagcaac gctggagccg cttccccagc tcagagcccc gccccacccc 10621 gtcctgatta gaggtcctcc tggcctgagc gcagcgcctc caccctgccc tgctgagcac 10681 agacggagga agccgcttat cctgttttca tggacagctg agcactaaag gaatttctaa 10741 ggacatttgg ttttactgct ttttctctgc ttccagttgg agttgattca tgtgcttgtg 10801 cctacctggc cgcaagtccc cagcccctca accctccgtt cctcctcagc ctccctttgc 10861 cagccacccc tcctctagct ctggtgggag gcccggggcc cttcctcgca cagggcatgc 10921 ctggcctgag gacccggcgc tgagtggcgg ggcccctgct ccgaggggct catgttcagg 10981 cagaaccggt cccagcctgg gctcctctgc atcttgctct gtgccttggc cctgaccccc 11041 atcgctctga gcatatgttc catgcctgcc ttgccggggc ctggactgca caggcagcaa 11101 ggtcatggtc tgagtggggc ttcctgggca gttggggcgg cccacgccag ctggcccagt 11161 gggtagtgaa ttggcttcct tgacgcgaga ggctctgagg gtctgaaaag ggcatctcaa 11221 tggcatgggt gggtggggag tcagtcatgt cactgaaatt gaatggggga ggcccaatga 11281 ggtggctcat gcctgtaatc ccagcacttt gggaagctga ggcaggagga tcacctgagg 11341 tcaggagttc gagagcagcc tggccaacat ggcaaaaccc cttatttact aaaaatacaa 11401 aaacttagcc gggcatggtg gcatgtgcct gtattcccag ctactcatga ggctgaggca 11461 ggagaatggc gtgaacccgg gagtggagct tgcaatgagc caagattgcg ccactgcact 11521 ccagcctggg tgacagagca agactccgtc tcaaaaaaaa aaaaaaaaaa aaagaaatta 11581 attgggggag tcatggtcag gggtgagacc tgaaggacct ccccctgtgt ggccctggac 11641 acagcccacc ctctgtgagc cgtttccaat tctaaaacag actcaatgtc cccctcaccc 11701 ccacctcaag gtcaggatgc gaacacactg agtgaggagt ggacgctgtc cattgccacg 11761 gccatgaggg ctggagacca gaacagcatg gcccgaagcg tgcggggccc cggatgactt 11821 ggggacaccc cagaatcccc tggggagaac ccttcctgcg cgctttcatt ttttgacctc 11881 atcactgaga aaggctcaat ttggtgctca cgtgtcctta acacctgatc tggcccaagc 11941 tgcgtgccct ttaagccaag agagcctctt gtggaccccg cctgcccgaa tgaaatccga 12001 acagttgggg ctgttatggc aagtggggct ggtttttcat ttccattggt tatttaaagt 12061 ttcctttaaa ataaacgatt ttaagttata aaaggtgaat ctattgaaag aagaacatca 12121 aagaaataaa caggagttca gcggagtagc agaagacaag gcatgtaggg ggagccattc 12181 tgtcccaggg aagttgtgga gggtaggggc tgtgtggagg cctctgcag // LOCUS HUMALIFA 7614 bp DNA PRI 31-OCT-1994 DEFINITION Human leukemia inhibitory factor (LIF) gene, complete cds. ACCESSION M63420 J05436 NID g178414 KEYWORDS glycoprotein; leukemia inhibitory factor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7614) AUTHORS Stahl,J., Gearing,D.P., Willson,T.A., Brown,M.A., King,J.A. and Gough,N.M. TITLE Structural organization of the genes for murine and human leukemia inhibitory factor. Evolutionary conservation of coding and non-coding regions JOURNAL J. Biol. Chem. 265 (15), 8833-8841 (1990) MEDLINE 90256813 FEATURES Location/Qualifiers source 1..7614 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11.2-q13.1" mRNA join(657..739,2471..2649,3343..6826) /gene="LIF" /note="G00-120-152" exon 657..739 /gene="LIF" /note="G00-120-152" /number=1 gene join(657..739,2471..2649,3343..6826) /gene="LIF" CDS join(721..739,2471..2649,3343..3753) /gene="LIF" /codon_start=1 /db_xref="GDB:G00-120-152" /product="leukemia inhibitory factor" /db_xref="PID:g178415" /translation="MKVLAAGVVPLLLVLHWKHGAGSPLPITPVNATCAIRHPCHNNL MNQIRSQLAQLNGSANALFILYYTAQGEPFPNNLDKLCGPNVTDFPPFHANGTEKAKL VELYRIVVYLGTSLGNITRDQKILNPSALSLHSKLNATADILRGLLSNVLCRLCSKYH VGHVDVTYGPDTSGKDVFQKKKLGCQLLGKYKQIIAVLAQAF" sig_peptide join(721..739,2471..2490) /gene="LIF" /note="G00-120-152" exon 2471..2649 /gene="LIF" /note="G00-120-152" /number=2 mat_peptide join(2491..2649,3343..3750) /gene="LIF" /note="G00-120-152" /product="leukemia inhibitory factor" exon 3343..6826 /gene="LIF" /note="G00-120-152" /number=3 BASE COUNT 1512 a 2159 c 2297 g 1646 t ORIGIN 1 aggcctgacc ctctggggcc ctgggcaacg tgttcccctc ggagcccctt ggggcccagt 61 ggcaatgtgc agattggagg aggctacctc tggggtggct tccagcgccc atgttcagtt 121 ccactttatg accgtctaaa gtccacgccg gccccggccc ctcctcctgg agcctccttc 181 tcagccaagc cctagccctc ctcctcccac tgccccccta aagcccccca accagggggc 241 acaggggcca gtctccatct gctagtccag actcgctacc tccccaaacc caggtgagtc 301 agggccgtct ggggcgccca ctgctgggac ccctgctgac tcggccaggg gccccctcct 361 ggcgatgcca tcttcagaca actcccggga caagccaggc aggaaaacca cagggcgttt 421 tgtcgaaggc ttcattataa ttttatcaat caaattctta gaagagggaa aaagtctgtt 481 ctccccaccc tcccccctca ctcgtccccc cccttcactc tcactttctt ccattcataa 541 tttcctatga tgcacctcaa acaacttcct ggactgggga tcccggctaa atatagctgt 601 ttctgtctta caacacaggc tccagtatat aaatcaggca aattccccat ttgagcatga 661 acctctgaaa actgccggca tctgaggttt cctccaaggc cctctgaagt gcagcccata 721 atgaaggtct tggcggcagg taaatacacc cgccccgcgc cggcttcgcg tccccgctgc 781 ggggcgcggc ggcaacttgg ggcgcttggc agcgcgagcc ggacgcccac ccgccgcaga 841 cacacgaaca cttggggcgc ccgcgcagcc accggaggcg ctgggtggcg gcccggagcg 901 agcgcgggca catggtccca agcaccgcac gccccgcggc aggttggcgg cgagggaggg 961 ggcgccgagc cttcctcctc ctcctcttct tccccctctc cttctcctcc tcctcctcct 1021 cgctccggac actcgctggc tgctcttccc tctgcgcttt ctcccagatt cactctccct 1081 ctctcttttt cttttctttc tttctttccg ctttctcctt tccaaccgcg gtcccgggct 1141 gctcccaggg aggggcgcgg gcggcgagca gcttgcaaac tccggcctgg gacgggagca 1201 ggtgccgcct ccatctgctg gtgtctggaa gcgtgtggtc tgcgctaggt gagatatagg 1261 ggtgtggccc ccctcccgca gccaccccgg ggcctcagca ctgccttggg atgctgggac 1321 gaacagggga cccaggagaa gtgaacttga ggaggcccct gtccccgctg ttccagcatg 1381 ggggatccgg gggaggtctc cagcctcact cgcccgctga ccccggcccc caccatcttc 1441 aggtgccctt ctgcagagcc ctggaaaagc gtctaagggg gcctgggggg agtcgggaga 1501 acaggccccc ctgggggagg ggcaaaccag gacatgtcgg gacagctccc agctctcctg 1561 tcacctttca ctttccttcc tccccgccca cccacctgcc tatgaccttt tgccttttct 1621 ctctccattt cctctccctc cctgagccgg tgtgtgtgtg ttaggaggga ggggaacccc 1681 tgggacaagg gacagcggat gagtcgggag aaggcatgga gtcaggggct gtccggagct 1741 gggggaacag agagttttga atgatgattt ggggatggag agtggggaca gccaggcaga 1801 aatggggtga gcttgagtga gataggggac actagggaag gaagagatag aggatgatgg 1861 gggcgggggc agaattgggg gcaattggcc cagggagcca gaatcaagtg gcaggtttgg 1921 gagggagatg agggtcaacc aggactcctc cccactcccc catgccccgt ggcccccatg 1981 ggtgcctggc ttcggagatt tgggctgcaa tgggccagtg agtgggaagc gctgctgagg 2041 aacctgggcc accacgggag gtgggaagag aggggtctcc tttctcctgg tgcctgctgg 2101 cctggggcct ggtgccttcc agccagaggg ccagggggcc ttaggacctt tgccttcctg 2161 agggaaaggg tgggatggct gggagtctct cctggaccct gcaccctttg ggtggaaatg 2221 gcttgtgtct tgccctatct ttacgtcatc ccagaagaga gcagggagaa ctaaggtaga 2281 gaaagggaga cagagaaaca cacgcagtga cagagtgaag actaggcccc aggaagacag 2341 ctgcaggtgg tgagggggaa ccaggagtcc tctcccatgc ggccattgtt ggacccccat 2401 ccggtgtgcc atgaccccag gccacccttt cctgcctttc tactcatggc ttcttcctga 2461 ctgtccccag gagttgtgcc cctgctgttg gttctgcact ggaaacatgg ggcggggagc 2521 cccctcccca tcacccctgt caacgccacc tgtgccatac gccacccatg tcacaacaac 2581 ctcatgaacc agatcaggag ccaactggca cagctcaatg gcagtgccaa tgccctcttt 2641 attctctatg taagttaccc ctgggatact gacaggagat ggcagggagg gggcttgtaa 2701 atatcattag gggctgtcct gatctgggtt gaggggacct tttggggctg gaaggagaga 2761 atggggagag ggcttgatta aaccaccccc agactcctgc cacttcctgc ccaagcttcc 2821 ccagggaagc ttccccaggg tgcccagtta gcaaggggag aactgagtgc aaaggtgggg 2881 acctggcact tcttatcttg tgattgtcct gctgcaggga gcgagggatg gaggggaaat 2941 gggcgtgagg caccagggag atgcggttga gaggcagtgg gctgtgggtg ctgggcatgg 3001 aggggcgtcc cggaacattg tgagtgcagg gatggaagta cttgtgtgtg gtgccccagc 3061 tagggctaga caccgagttt tcccttctgt ccccttaggg tggtgatgat gatgatgatg 3121 ataatgatga ctgcgtgcat ggctcagtct ttgatcttta gcaagggcac tcacattaca 3181 attagttttg gctctcatga caattccaga tgcttacagg gcaaggagtt gggtcctcat 3241 gcgctagatg gggaaacaga cgcaagagct tgcccaaagg gttggcggca gggctgggac 3301 actgacccct gactcccacg tcacctccct tctgcccctc agtacacagc ccagggggag 3361 ccgttcccca acaacctgga caagctatgt ggccccaacg tgacggactt cccgcccttc 3421 cacgccaacg gcacggagaa ggccaagctg gtggagctgt accgcatagt cgtgtacctt 3481 ggcacctccc tgggcaacat cacccgggac cagaagatcc tcaaccccag tgccctcagc 3541 ctccacagca agctcaacgc caccgccgac atcctgcgag gcctccttag caacgtgctg 3601 tgccgcctgt gcagcaagta ccacgtgggc catgtggacg tgacctacgg ccctgacacc 3661 tcgggtaagg atgtcttcca gaagaagaag ctgggctgtc aactcctggg gaagtataag 3721 cagatcatcg ccgtgttggc ccaggccttc tagcaggagg tcttgaagtg tgctgtgaac 3781 cgagggatct caggagttgg gtccagatgt gggggcctgt ccaagggtgg ctgggcccag 3841 ggcatcgcta aacccaaatg ggggctgctg gctgaccccg agggtgcctg gccagtccac 3901 tccactctgg gctgggctgt gatgaagctg agcagagtgg aaacttccat agggagggag 3961 ctagaagaag gtgccccttc ctctgggaga ttgtggactg gggagcgtgg gctggacttc 4021 tgcctctact tgtccctttg gccccttgct cactttgtgc agtgaacaaa ctacacaagt 4081 catctacaag agccctgacc acagggtgag acagcagggc ccaggggagt ggaccagccc 4141 ccagcaaatt atcaccatct gtgcctttgc tgccccttag gttgggactt aggtgggcca 4201 gaggggctag gatcccaaag gactccttgt cccctagaag tttgatgagt ggaagataga 4261 gaggggcctc tgggatggaa ggctgtcttc ttttgaggat gatcagagaa cttgggcata 4321 ggaacaatct ggcagaagtt tccagaagga ggtcacttgg cattcaggct cttggggagg 4381 cagagaagcc accttcaggc ctgggaagga agacactggg aggaggagag gcctggaaag 4441 ctttggtagg ttcttcgttc tcttccccgt gatcttccct gcagcctggg atggccaggg 4501 tctgatggct ggacctgcag caggggtttg tggaggtggg tagggcaggg gcaggttgct 4561 aagtcaggtg cagaggttct gagggaccca ggctcttcct ctgggtaaag gtctgtaaga 4621 aggggctggg gtagctcaga gtagcagctc acatctgagg ccctgggagg tcttgtgagg 4681 tcacacagag gtacttgagg gggactggag gccgtctctg gtccccaggg caagggaaca 4741 gcagaactta gggtcagggt ctcagggaac cctgagctcc aagcgtgctg tgcgtctgac 4801 ctggcatgat ttctatttat tatgatatcc tatttatatt aacttattgg tgctttcagt 4861 ggccaagtta attccccttt ccctggtccc tactcaacaa aatatgatga tggctcccga 4921 cacaagcgcc agggccaggg cttagcaggg cctggtctgg aagtcgacaa tgttacaagt 4981 ggaataagcc ttacgggtga agctcagaga agggtcggat ctgagagaat ggggaggcct 5041 gagtgggagt ggggggcctt gctccacccc catcccctac tgtgacttgc tttagcgtgt 5101 cagggtccag gctgcagggg ctgggccaat ttgtggagag gccgggtgcc tttctgtctt 5161 gcttccaggg ggctggttca cactgttctt gggcgcccca gcattgtgtt gtgaggcgca 5221 ctgttcctgg cagatattgt gccccctgga gcagtgggca agacagtcct tgtggcccac 5281 cctgtccttg tttctgtgtc cccatgctgc ctctgaaata gcgccctgga acaaccctgc 5341 ccctgcaccc agcatgctcc gacacagcag ggaagctcct cctgtggccc ggacacccat 5401 agacggtgcg gggggcctgg ctgggccaga ccccaggaag gtggggtaga ctggggggat 5461 cagctgccca ttgctcccaa gaggaggaga gggaggctgc agacgcctgg gactcagacc 5521 aggaagctgt gggccctcct gctccacccc catcccactc ccacccatgt ctgggctccc 5581 aggcagggaa cccgatctct tcctttgtgc tggggccagg cgagtggaga aacgccctcc 5641 agtctgagag caggggaggg aaggaggcag cagagttggg gcagctgctc agagcagtgt 5701 tctggcttct tctcaaaccc tgagcgggct gccggcctcc aagttcctcc gacaagatga 5761 tggtactaat tatggtactt ttcactcact ttgcaccttt ccctgtcgct ctctaagcac 5821 tttacctgga tggcgcgtgg gcagtgtgca ggcaggtcct gaggcctggg gttggggtgg 5881 agggtgcggc ccggagttgt ccatctgtcc atcccaacag caagacgagg atgtggctgt 5941 tgagatgtgg gccacactca cccttgtcca ggatgcaggg actgccttct ccttcctgct 6001 tcatccggct tagcttgggg ctggctgcat tcccccagga tggcttcgag aaagacaaac 6061 ttgtctggaa accagagttg ctgattccac ccggggggcc cggctgactc gcccatcacc 6121 tcatctccct gtggacttgg gagctctgtg ccaggcccac cttgcggccc tggctctgag 6181 tcgctctccc acccagcctg gacttggccc catgggaccc atcctcagtg ctccctccag 6241 atcccgtccg gcagcttggc gtccaccctg cacagcatca ctgaatcaca gagcctttgc 6301 gtgaaacagc tctgccaggc cgggagctgg gtttctcttc cctttttatc tgctggtgtg 6361 gaccacacct gggcctggcc ggaggaagag agagtttacc aagagagatg tctccgggcc 6421 cttatttatt atttaaacat ttttttaaaa agcactgcta gtttacttgt ctctcctccc 6481 catcgtcccc atcgtcctcc ttgtccctga cttggggcac ttccaccctg acccagccag 6541 tccagctctg ccttgccggc tctccagagt agacatagtg tgtggggttg gagctctggc 6601 acccggggag gtagcatttc cctgcagatg gtacagatgt tcctgcctta gagtcatctc 6661 tagttcccca cctcaatccc ggcatccagc cttcagtccc gcccacgtgc tagctccgtg 6721 ggcccaccgt gcggccttag aggtttccct ccttcctttc cactgaaaag cacatggcct 6781 tgggtgacaa attcctcttt gatgaatgta ccctgtgggg atgtttcata ctgacagatt 6841 atttttattt attcaatgtc atatttaaaa tatttatttt ttataccaaa tgaatacttt 6901 tttttttaag aaaaaaaaga gaaatgaata aagaatctac tcttggctgg ctctccggag 6961 tgtactgatg tggggagatg ggctggaagg gctgggactg tccctgtcct gggcaccagc 7021 caagtgggac tcagcgaagg gtggaggagg gtgggggagg ggcacctggc ataggtgggg 7081 gcagttaggt ggtattttgg ccaaggcaga acaaggtggg tggtgtctag atcatggggt 7141 gcccccaagg agagagatgg attgctcaga ggtaaagggg gtgctgggca cggtgggtca 7201 ctcctgtaat cctagcactt tgggaggctg aggcaggtgg atcatttgag cccaggaatt 7261 cgagaccagc ctggccaaca tggtgaaacc ctgtctacaa aatatacaaa cagccagatg 7321 ctgtggcgtc cgcctgtggt cccagctact cggggtgctg aggtgggagg atcccttgat 7381 cccaggaggt ggaggctgcg gtgagccatg attgcgccac tgcactgcag cctgggtgac 7441 agaggaagac cctgtctcaa aaaaaaaaaa aaaaaaaaaa aagaagtaaa cggggtgccg 7501 tgtgatatct cagctcattc cctcccaact cctcacttaa cctcatggga tctccaggag 7561 ttgccatccc cacataccag aggaagaaat cgaggctcag agccatgaaa ccac // LOCUS HUMANFA 2710 bp DNA PRI 01-NOV-1994 DEFINITION Human atrial natriuretic factor (PND) gene, complete cds. ACCESSION K02043 NID g178629 KEYWORDS atrial natriuretic factor; hormone; natriuretic factor; preprocardiodilatin; pronatriodilatin. SOURCE Human DNA (genomic library of Lawn et al.), clones pHGRB1 and lambda-hPND13; and cDNA to atrial mRNA, clones phANP1 and phANP82 (see comment). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 480 to 692; 815 to 1141; 2235 to 2537) AUTHORS Oikawa,S., Imai,M., Ueno,A., Tanaka,S., Noguchi,T., Nakazato,H., Kangawa,K., Fukuda,A. and Matsuo,H. TITLE Cloning and sequence analysis of cDNA encoding a precursor for human atrial natriuretic polypeptide JOURNAL Nature 309 (5970), 724-726 (1984) MEDLINE 84219799 REFERENCE 2 (bases 1 to 1839; 2124 to 2548) AUTHORS Nemer,M., Chamberland,M., Sirois,D., Argentin,S., Drouin,J., Dixon,R.A., Zivin,R.A. and Condra,J.H. TITLE Gene structure of human cardiac hormone precursor, pronatriodilatin JOURNAL Nature 312 (5995), 654-656 (1984) MEDLINE 85061626 REFERENCE 3 (bases 1 to 2583) AUTHORS Greenberg,B.D., Bencen,G.H., Seilhamer,J.J., Lewicki,J.A. and Fiddes,J.C. TITLE Nucleotide sequence of the gene encoding human atrial natriuretic factor precursor JOURNAL Nature 312 (5995), 656-658 (1984) MEDLINE 85061627 REFERENCE 4 (bases 2105 to 2710; 7 to 1792) AUTHORS Seidman,C.E., Bloch,K.D., Klein,K.A., Smith,J.A. and Seidman,J.G. TITLE Nucleotide sequences of the human and mouse atrial natriuretic factor genes JOURNAL Science 226 (4679), 1206-1209 (1984) MEDLINE 85065766 REFERENCE 5 (bases 1 to 1839; 2124 to 2548) AUTHORS Drouin,J. JOURNAL Unpublished (1985) COMMENT [2] revised by [5]. [5] revises [2]. A potential enhancer sequence is located at positions 203-213. A TATA box is present at positions 442-447. A potential polyadenylation signal is present at positions 2509-2514. A potential glucocorticoid receptor binding site is present at positions 1283-1297. Two Alu repeats are present within intron B [4]. Another human PND allele has been sequenced (see separate entry). The sequence in [4] is somewhat mislabelled in figure 2: within intron B some of the human sequence is labelled as mouse. A revision of the sequence in [2] was kindly sent by J. Drouin. The individual revisions are not annotated in the FEATURES table. Complete source information: Human DNA [4] (genomic library of Lawn et al. [2]), clones pHGRB1 [3] and lambda-hPND13 [2],[5]; and cDNA to atrial mRNA, clones phANP1 and phANP82 [1]. FEATURES Location/Qualifiers source 1..2710 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p36" prim_transcript 473..2537 /note="PND mRNA [1],[2]" gene join(570..692,815..1141,2235..2240) /gene="PND" CDS join(570..692,815..1141,2235..2240) /gene="PND" /codon_start=1 /db_xref="GDB:G00-118-727" /product="natriodilatin" /db_xref="PID:g178630" /translation="MSSFSTTTVSFLLLLAFQLLGQTRANPMYNAVSNADLMDFKNLL DHLEEKMPLEDEVVPPQVLSEPNEEAGAALSPLPEVPPWTGEVSPAQRDGGALGRGPW DSSDRSALLKSKLRALLTAPRSLRRSSCFGGRMDRIGAQSGLGCNSFRY" exon <570..692 /gene="PND" /note="prepronatriodilatin; G00-118-727" /number=1 sig_peptide 570..644 /gene="PND" /note="prepronatriodilatin signal peptide" intron 693..814 /gene="PND" /note="G00-118-727" /number=1 exon 815..1141 /gene="PND" /note="G00-118-727" /number=2 conflict 886 /gene="PND" /citation=[4] /replace="" mat_peptide join(1061..1141,2235..2237) /gene="PND" /note="G00-118-727" /product="atrial natriuretic peptide" intron 1142..2234 /gene="PND" /note="G00-118-727" /number=2 conflict 1331..1334 /citation=[4] /replace="" conflict 1349..1356 /citation=[4] /replace="" conflict 2106 /citation=[4] /replace="" conflict 2111 /citation=[4] /replace="" conflict 2119 /citation=[4] /replace="" conflict 2131 /citation=[4] /replace="" BASE COUNT 667 a 665 c 768 g 610 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccattt gtctcgggct gctggctgcc tgccatttcc tcctctccac ccttatttgg 61 aggccctgac agctgagcca caaacaaacc aggggagctg ggcaccagca agcgtcaccc 121 tctgtttccc cgcacggtac cagcgtcgag gagaaagaat cctgaggcac ggcggtgaga 181 taaccaagga ctctttttta ctcttctcac acctttgaag tgggagcctc ttgagtcaaa 241 tcagtaagaa tgcggctctt gcagctgagg gtctgggggg ctgttggggc tgcccaaggc 301 agagaggggc tgtgacaagc cctgcggatg ataactttaa aagggcatct cctgctggct 361 tctcacttgg cagctttatc actgcaagtg acagaatggg gagggttctg tctctcctgc 421 gtgcttggag agctgggggg ctataaaaag aggcggcact gggcagctgg gagacaggga 481 cagacgtagg ccaagagagg ggaaccagag aggaaccaga ggggagagac agagcagcaa 541 gcagtggatt gctccttgac gacgccagca tgagctcctt ctccaccacc accgtgagct 601 tcctcctttt actggcattc cagctcctag gtcagaccag agctaatccc atgtacaatg 661 ccgtgtccaa cgcagacctg atggatttca aggtagggcc aggaaagcgg gtgcagtctg 721 gggccagggg gctttctgat gctgtgctca ctcctcttga tttcctccaa gtcagtgagg 781 tttatccctt tccctgtatt ttccttttct aaagaatttg ctggaccatt tggaagaaaa 841 gatgccttta gaagatgagg tcgtgccccc acaagtgctc agtgagccga atgaagaagc 901 gggggctgct ctcagccccc tccctgaggt gcctccctgg accggggaag tcagcccagc 961 ccagagagat ggaggtgccc tcgggcgggg cccctgggac tcctctgatc gatctgccct 1021 cctaaaaagc aagctgaggg cgctgctcac tgcccctcgg agcctgcgga gatccagctg 1081 cttcgggggc aggatggaca ggattggagc ccagagcgga ctgggctgta acagcttccg 1141 ggtaagagga actggggatg gaaatgggat gggatggaca ctactgggag acaccttcag 1201 caggaaaggg accaatgcag aagctcattc cctctcaagt ttctgcccca acacccagag 1261 tgccccatgg gtgtcaggac atgccatcta ttgtccttag ctagtctgct gagaaaatgc 1321 ttaaaaaaaa aagggggggg gctgggcacg gtcgtcacgc ctgtaatccc agcactttgg 1381 gaggccaggc agcggatcat gaggtcaaga gatcaagact atcctggcca acatggtgaa 1441 accccagctc tactaaaaat acaaaaatta gctgggtgtg tggcgggcac ctgtactctc 1501 agctacttgg gaggctgagg caggagaatc acttgaaccc aggaggcaga ggttgcagtg 1561 agcagagatc acgccactgc agtccagcct aggtgataga gcgagactgt ctcaaaaaaa 1621 aaaaaaaaag gccaggcgcg gtggctcacg cctgtaatcc cagcgctttg ggaggccaag 1681 gcgggtggat cacgaggtca ggagatggag accatcctgg ctaacacggt gaaaccccgt 1741 ctctactaaa aatacaaaaa attagccagg cgtggtggca ggcgcctgta agtcctagct 1801 actccggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt gcagtgagca 1861 gagatggcac cactgcactc cagcctgggc gacagagcaa gactccgtct caaaaaaaaa 1921 aaaaaaaaaa gcaactgcca ctagcactgg gaaattaaaa tattcataga gccaagttat 1981 ctttgcatgg ctgattagca gttcatattc ctccccagaa ttgcaagatc ctgaagggct 2041 taagtgaaat ttactctgat gagtaacttg cttatcaatt catgaagctc agagggtcat 2101 caggctgggg tgggggccgg tgggaagcag gtggtcagta atcaagttca gaggatgggc 2161 acactcatac atgaagctga cttttccagg acagccaggt caccaagcca gatatgtctg 2221 tgttctcttt gcagtactga agataacagc cagggaggac aagcagggct gggcctaggg 2281 acagactgca agaggctcct gtcccctggg gtctctgctg catttgtgtc atcttgttgc 2341 catggagttg tgatcatccc atctaagctg cagcttcctg tcaacacttc tcacatctta 2401 tgctaactgt agataaagtg gtttgatggt gacttcctcg cctctcccac cccatgcatt 2461 aaattttaag gtagaacctc acctgttact gaaagtggtt tgaaagtgaa taaacttcag 2521 caccatggac agaagacaaa tgcctgcgtt ggtgtgcttt ctttcttctt gggaagagaa 2581 ttcaggccga tattccttgt cgttttactc tttgtcagag gaaagaatgc tgagtttttc 2641 ttcttccttt catttcaccc tccttttttg gtaggtggtt gggaggccta attcatctag 2701 tgggtttttt // LOCUS HUMANT1 5768 bp DNA PRI 31-OCT-1994 DEFINITION Human heart/skeletal muscle ATP/ADP translocator (ANT1) gene, complete cds. ACCESSION J04982 NID g178658 KEYWORDS ATP/ADP translocator; mitochondrial inner membrane protein. SOURCE Human fetal liver DNA (library of T. Maniatis). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5768) AUTHORS Li,K., Warner,C.K., Hodge,J.A., Minoshima,S., Kudoh,J., Fukuyama,R., Maekawa,M., Shimizu,Y., Shimizu,N. and Wallace,D.C. TITLE A human muscle adenine nucleotide translocator gene has four exons, is located on chromosome 4, and is differentially expressed JOURNAL J. Biol. Chem. 264 (24), 13998-14004 (1989) MEDLINE 89340499 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Li, 17-JUN-1989. FEATURES Location/Qualifiers source 1..5768 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4" CAAT_signal 1413..1417 TATA_signal 1473..1477 prim_transcript 1498..5492 /note="ANT1 mRNA and introns" gene join(1607..1717,2985..3471,3980..4120,5035..5192) /gene="ANT1" CDS join(1607..1717,2985..3471,3980..4120,5035..5192) /gene="ANT1" /note="ATP/ADP translocator" /codon_start=1 /db_xref="GDB:G00-119-680" /db_xref="PID:g178659" /translation="MGDHAWSFLKDFLAGGVAAAVSKTAVAPIERVKLLLQVQHASKQ ISAEKQYKGIIDCVVRIPKEQGFLSFWRGNLANVIRYFPTQALNFAFKDKYKQLFLGG VDRHKQFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKGAAQREFHGLGDC IIKIFKSDGLRGLYQGFNVSVQGIIIYRAAYFGVYDTAKGMLPDPKNVHIFVSWMIAQ SVTAVAGLVSYPFDTVRRRMMMQSGRKGADIMYTGTVDCWRKIAKDEGAKAFFKGAWS NVLRGMGGAFVLVLYDEIKKYV" exon <1607..1717 /gene="ANT1" /note="ATP/ADP translocator" /number=1 intron 1718..2984 /note="ANT1 intron A" exon 2985..3471 /gene="ANT1" /number=2 intron 3472..3979 /note="ANT1 intron B" exon 3980..4120 /gene="ANT1" /number=3 intron 4121..5034 /note="ANT1 intron C" exon 5035..>5192 /gene="ANT1" /note="ATP/ADP translocator" /number=4 BASE COUNT 1481 a 1377 c 1404 g 1506 t ORIGIN Chromosome 4; 298 bp upstream of PstI site. 1 tctacttata ttcaatccac agggctacac ctagttcttg gtacacagta catgctcagc 61 aagagtctgt tgaatgaaca catacatggt ttatctgttt gtctcttccg agttcttgac 121 ttctgtctgc tctgacctct ggcagctttc cactagtttc tagctttcat tctgcttacc 181 tggatttcgg aactctagcc tgccccactc ttagataaac gcatgccctc tgtggccctg 241 gaaccttagt gacttctgct ataccaaagt ctccacgccc agggtgacac gcagctgcag 301 ctccgtaaac ctctaacatg atgtcagcaa atattaaaaa aaaaaagttt ataaaaacaa 361 tgaataaact ttgttaaagg tacaaatgaa aattagcaaa catgggaaga taattgagta 421 aagagtttaa agttaaaaac gaattgcagt cattctaggg gaaggaacag ttgtatttga 481 aaacctgtat ggttacatga actgcctaaa aaacaagcta aggaaaatta aagctcagat 541 ttatatattt taagaaatta attgcaatta atttcctggg attaaatagc atttcctcaa 601 ccccagctgt cattaaaaag aggcaaatac agccaaggac tggatcttct ccggaaggct 661 gacagcactg accctcaaga aggcaccggc tgacagacag aacattctgc cctaatatgt 721 gctgaaattc cgctgagagc agagtggtac attgaaccct ttaggggctt acaaaagaag 781 tgtcctgtgt tttagagtca cagagttttg cagaaacaag tatgaattca cctagtggcc 841 ccctgcacca ggtctttcct gtgggcactg agtgcagaca catcaatatg taatagcaga 901 atgaatgact gaacgaacga ttgaatgaaa agaaatgaga ggcagcaggt tgtcagattc 961 tatgaggcaa tcacagcatc aggtgacctt agtatctatt tgagaggact gccatttatt 1021 ctcgggagcg cacggctcta aagaggccca tatccaggca gtgagctctg gtggggggcg 1081 cctttagatg caagaaggag gaaacagctc gaaatccctg ggcctgagcg cggcccgtgc 1141 aggccggagg gtcaagaact ctccaccggc ggcagcggcc cggtgtctgc cccggcttcg 1201 ccccggccta aggctgcctg tgctataaat acgcggccca catgccgcgg tgacacggtg 1261 ttccctgggc tcggcgggac agataacatg aatgtgccct ttaaacgtcc caagttgcag 1321 ggacagcccc cggcccagcc tcgctcccgg aagcgccttc gcccccgatg ccctctgcag 1381 ctgggaggag ggggcgcccc gcacctgccc agccaatgcg cggcgcgagc gccggccgcg 1441 acccgcctcc tctcgcgaga gcccggcggg gatataaggg ggagctgcgg gccaggcggc 1501 ggccccctag cgtcgcgcag ggtcggggac tgcgcgcggt gccaggccgg gcgtgggcga 1561 gagcacgaac gggctgcctg cgggctgaga gcgtcgagct gtcaccatgg gtgatcacgc 1621 ttggagcttc ctaaaggact tcctggccgg gggcgtcgcc gctgccgtct ccaagaccgc 1681 ggtcgccccc atcgagaggg tcaaactgct gctgcaggtg aggaccgcgc ggtgcaagag 1741 gcgggcgcgg gcgcggcggg ccgggcgggg cgcgcgatgc ggcgcgagct gcagggcgcg 1801 gggcgccgcg gaaaatctgc gccaggccac aggcccgggc gcccgcccgc ccgcggggga 1861 agaaggtgcc ctctgcgtag agacaggtcc agcgtcagtc gcagattcct ggtgtcgggt 1921 ggcgcccggc gttcgggtgt ctatatatgg aaacccaccc ggagccggtt tacgtgtgcc 1981 agatcctgcg cccgtgacag cacgggcgtg cactcaggcc cggaggcacc tagtgattgc 2041 cagtattttt ggcaccgtct tatgcgcacg cacctttaca ataaaaacat caaaataatc 2101 atcacccaag aattccctta tcgtatctca tgcacaatgc tgtatgtagg ctgacgcctt 2161 catctttatg taacctctgt gagagagtta ttcttctcca ttttacagat gaagctgagg 2221 ttttgaaata ttaagaaaca attttcggaa taaactcaga tcatcctgtc tccaaatctt 2281 ttcctcccct acctggtcgc tgaatggttt atcatcctct cgtgttttcc tccacctgcc 2341 caaaaggtca gggcccctca atgaggaaga gcccaatttg ggagtcagaa ttactaacaa 2401 caaaaccccc acaaattgct cacaacggca gcaaaccctt aataattgat tacttggatt 2461 atctgcttga aaactttgga ggcctaatgt ttagtggatt tattctcctt cctctattag 2521 agcatctagt agagatcctc atctccaggg tgatcagagt gacactgaga aattgtcatt 2581 ttttggccat catgtctatt aaatccaaag ccctttgaag cagggagtgt tactcatttc 2641 tgtcccccag taagcccctc atacagttct caaacctagg gaaagtgaaa taaataaatg 2701 gctatagctt tatataattc aatcaccttt tcagtttatt tggggcaata cctttccctc 2761 aaatacccta ataattgaag caacattgga ttattttggc ttgttatcca gtaactaaca 2821 tggataacag tatccattta cacgtcctcg tatccatttg atttcctcat cctttttttc 2881 ttcaaaaaaa aaatctagga agtgcaaacc tttttttttt ctcctgtcct cttcccttct 2941 ctctaccctg cctgtcctct gtcacccacc ctcccctcca ccaggtccag catgccagca 3001 aacagatcag tgctgagaag cagtacaaag ggatcattga ttgtgtggtg agaatcccta 3061 aggagcaggg cttcctctcc ttctggaggg gtaacctggc caacgtgatc cgttacttcc 3121 ccacccaagc tctcaacttc gccttcaagg acaagtacaa gcagctcttc ttagggggtg 3181 tggatcggca taagcagttc tggcgctact ttgctggtaa cctggcgtcc ggtggggccg 3241 ctggggccac ctccctttgc tttgtctacc cgctggactt tgctaggacc aggttggctg 3301 ctgatgtggg caagggcgcc gcccagcgtg agttccatgg tctgggcgac tgtatcatca 3361 agatcttcaa gtctgatggc ctgagggggc tctaccaggg tttcaacgtc tctgtccaag 3421 gcatcattat ctatagagct gcctacttcg gagtctatga tactgccaag ggtgagagag 3481 gggcatcggg gagaaggagg gtggtgtgga aagaggatcc tatgggatct ataactcaca 3541 aaggacctga tatatattga tcttgttttt tctagtctct gggataattg aggcttctga 3601 atgaggaggt gatgtgcata agttaatagc tgaagcgttc cttgtgtcct ctactgaaat 3661 aaactctggc ctttagttat tcagagagga ggagggggga gcctgtctcc ctctagacac 3721 agccatagca gttactgagt ttaacttgaa gccacttcca atgccctgta tacaagctga 3781 gcactgcccc tccggggtcc ggagagggca gcagccacct ttgctgtctg cctggtcata 3841 tgtgaagcac ctgcacaggg gcaggttccc cgcaaggtca gagcatggag ctggaggtgc 3901 agtggcctct ctccctccac ctgctttctg ctgagaacag gcacttcata gccgttcggc 3961 ttctgggctc tgtccacagg gatgctgcct gaccccaaga acgtgcacat ttttgtgagc 4021 tggatgattg cccagagtgt gacggcagtc gcagggctgg tgtcctaccc ctttgacact 4081 gttcgtcgta gaatgatgat gcagtccggc cggaaagggg gtaagcttgt gctctactca 4141 tctaaacttg tttggttttg cccgaggaga acattttaca gggctccttt cagtcttcct 4201 tactggaaat taattttcaa aattatttga taaggactta gggaagaaag atggtattaa 4261 ttccccctaa cgttctcaac tatcctatta gggaaaagta ttttccattt tattagagat 4321 gataagaaca tgaatagtaa gacatttaga tgtgaattta actaggtatc cagcattata 4381 gagaccctag gccctcttcc cttagagcct gggtgcaaaa gctagggaaa agaagtagtt 4441 agctacttct tacaaagaac tcttgcttcc ctcctagtta caggtgttag tgggatgggg 4501 tgtttagctg ggtagagatg gcctgaagca atctgttgtg ccagagaaag ttttggcttc 4561 tataggttga accatatgaa attgccactt taaaagtcaa aaacagtcca atgttagcag 4621 tttcgtatgt ttcaacgaat agttacagcc ttttatttag actgcataac ctcgtgcagg 4681 atcatctgag gctcagcctc agttcggtcc tccataaaaa aaggtaaccg cgtagcataa 4741 tactcctgct ccactgcgcc cttcttgttt cgcagttggg cagtccatga attacttggt 4801 taattgcccc agttcttcac tgaccttgaa ctaatggagt aggaatgaca ggagacccag 4861 cctgccagtg aagcaaggaa ggagatgtcc agtgggatgt tgcatggagc tgggactcca 4921 tgcccagatg accctgattt tataaaactg gtaacagtgt gtacagatat gtttcagggg 4981 aaaagtctct ttcctccagc gttacggagc cctcaccagc atttgtttcc acagccgata 5041 ttatgtacac ggggacagtt gactgctgga ggaagattgc aaaagacgaa ggagccaagg 5101 ccttcttcaa aggtgcctgg tccaatgtgc tgagaggcat gggcggtgct tttgtattgg 5161 tgttgtatga tgagatcaaa aaatatgtct aatgtaatta aaacacaagt tcacagattt 5221 acatgaactt gatctacaag ttcacagatc cattgtgtgg tttaatagac tattcctagg 5281 ggaagtaaaa agatctggga taaaaccaga ctgaaggaat acctcagaag agatgcttca 5341 ttgagtgttc attaaaccac acatgtattt tgtatttatt ttacatttaa attcccacag 5401 caaatagaaa ataatttatc atacttgtac aattaactga agaattgata ataactgaat 5461 gtgaaacatc aataaagacc acttaatgca cgctttctat tttattgaac tcttattaac 5521 tgtaaaatgc atttttaaaa gatcaaaaat gcatattttc tagcatgatt catgtatcag 5581 tcagcagcca agcttctaaa tgccagatat tatattgaga atgtattata tgagaacgta 5641 caatgcttaa agttccggtt ttcaaactta ggcaggtcat attctatcta tcttatccag 5701 cgttactgta ggctagaaag tgataatggc tttcataatc ctgccttgtc ttaggcactt 5761 tcctgcag // LOCUS HUMANT2X 4982 bp DNA PRI 31-OCT-1994 DEFINITION Human adenine nucleotide translocator-2 (ANT-2) gene, complete cds. ACCESSION M57424 J05624 NID g178660 KEYWORDS adenine nucleotide translocator-2. SOURCE Human placenta DNA, clone 21. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4982) AUTHORS Ku,D.H., Kagan,J., Chen,S.T., Chang,C.D., Baserga,R. and Wurzel,J. TITLE The human fibroblast adenine nucleotide translocator gene. Molecular cloning and sequence JOURNAL J. Biol. Chem. 265 (27), 16060-16063 (1990) MEDLINE 90375457 FEATURES Location/Qualifiers source 1..4982 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="Xq13-q26" CAAT_signal 1128..1132 /gene="ANT2" /note="G00-125-190" CAAT_signal 1455..1459 /gene="ANT2" /note="G00-125-190" CAAT_signal 1817..1821 /gene="ANT2" /note="G00-125-190" TATA_signal 1907..1910 /gene="ANT2" /note="G00-125-190" mRNA join(1936..2114,3124..3610,3835..3975,4350..4767) /gene="ANT2" /note="G00-125-190" /product="adenine nucleotide translocator-2" gene join(1936..2114,3124..3610,3835..3975,4350..4767) /gene="ANT2" exon 1936..2114 /gene="ANT2" /note="G00-125-190" /number=1 CDS join(2004..2114,3124..3610,3835..3975,4350..4507) /gene="ANT2" /codon_start=1 /db_xref="GDB:G00-125-190" /product="adenine nucleotide translocator-2" /db_xref="PID:g178661" /translation="MTDAAVSFAKDFLAGGVAAAISKTAVAPIERVKLLLQVQHASKQ ITADKQYKGIIDCVVRIPKEQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQIFLGG VDKRTQFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKAGAEREFRGLGDC LVKIYKSDGIKGLYQGFNVSVQGIIIYRAAYFGIYDTAKGMLPDPKNTHIVISWMIAQ TVTAVAGLTSYPFDTVRRRMMMQSGRKGTDIMYTGTLDCWRKIARDEGGKAFFKGAWS NVLRGMGGAFVLVLYDEIKKYT" intron 2115..3123 /gene="ANT2" /note="G00-125-190" /number=1 exon 3124..3610 /gene="ANT2" /note="G00-125-190" /number=2 intron 3611..3834 /gene="ANT2" /note="G00-125-190" /number=2 exon 3835..3975 /gene="ANT2" /note="G00-125-190" /number=3 intron 3976..4349 /gene="ANT2" /note="G00-125-190" /number=3 exon 4350..4767 /gene="ANT2" /note="G00-125-190" /number=4 polyA_signal 4750..4755 /gene="ANT2" /note="G00-125-190" BASE COUNT 1211 a 1226 c 1265 g 1280 t ORIGIN 1 gagctctgga atagaataca gtagaggcat catgctcaaa gagagtagca gatgtggcca 61 gggaaaggtc acatgtagaa aaagggatac acaagtgatg gcgaggtgat cgaggataat 121 taacagttga ataaaaatgc attaatcaga atgtcaagag ttcaaactag aaagattttg 181 aattttaaat gcatataaat aaaatccacc tctctccaaa taaagccctc tgtagaattt 241 tgctttagct cttgtttctc tcaccagaat agagaagcga gaatcacctt tggaggatat 301 actctaccat tctctgaaac tgtttttttg cattcagcta cctcgctatt ataatcagct 361 gagaaacacc atagccaaaa ctaacacagg tcaaagattt ctcccaggct caacaaagct 421 agattcaaaa ttgctccttt gaacacatgc aatttgtgtg catcagtttc gatcgctggg 481 gttcttgagg agactaacaa attgcaaagt gtactttcag aggaagggcc taacatcctg 541 gggggtccta gaattgggtc ctttcacagg cagatctgga gaagatagga gagggcagga 601 aaggaaagtg aaggatatga acactatgtg ttcagtgtgc aaaggcctgc tggcattttg 661 cctatgggaa ttctgaagtc tgggcacata aggttcagct gcagttgcag ttttcaacag 721 tcgtccttat ttttataaac attgttatga ttattgacac ggaacttgtg attacactat 781 ttgctcctgt tctgaacttg agggagtcca cttaagctaa caacggaggt ttttgaggac 841 atgtgtggca aatgcttccc agtgaaggga ttaaggggct cagttaagca cttgtccatt 901 ggcacaatgc tgccaaactt aattcatttg attcctctaa atgtttctgg agttcagaaa 961 cttatcaatc caagggcact ttacctgggg tcctgggagc caaatgtcaa cgtacttctt 1021 aaccttccta agcctcttaa ccttctgggt acaacctcta tatttgtctt ctggctaaaa 1081 tcatgcattc cctgcccctg gacccctgac ctttggcact tcccaggcca attcctggca 1141 agtctttgct gaagtgattt gcaaaccagg tctggggtac cggtggcagt tccttcgtct 1201 gctgttccct taatttagtc tggtttcact ggctccagga cagacgaacg gttctagagt 1261 ggagaagagt gaggacattc accttggtga aaaacaactc gtaaatttaa gacaaatgat 1321 tttttcagta ctattttaaa aaaatcagaa gcaatgcaaa aatccatcat caacaaatga 1381 gcaacgacag gatcccaacc tgcacttgca tgaccttgtc tcgcttgcct caccctaaac 1441 ccagccttga cactccaatt aaactttatt tacaaaacaa gggggccggc cagtaggatg 1501 tagtttgccc atacgacttt tttaaagtat cgcattgact actgtttatc tcgatgactg 1561 aagggttctt ttggcatccc tgtagcaaat gcgtctcacc ctagtcctgg tcctgctcca 1621 agggtttttg tccaggcaca tcgtgacctc acccttcctc ccctctccga ggcctctctc 1681 agggtccagc gttcaagtcc cgggtgttct ctggacccgc cccttgctct cgccgggtca 1741 ggtgccgagg ggagcacggg cggcgcggag agcagtcccg gcccgccctc cacgactcct 1801 cctcctgcga gctgggccac tcaggtctgc cgcccagcgc gccggggccc agaccccgcc 1861 ccggccccgc ctccgacgcc tgccgctcca gctccggctc cccctatata aatcggccat 1921 ttgcttccgt ccgccccgca gcgccggagt caaacggttc ccggcccagt cccgtcctgc 1981 agcagtctgc ctcctctttc aacatgacag atgccgctgt gtccttcgcc aaggacttcc 2041 tggcaggtgg agtggccgca gccatctcca agacggcggt agcgcccatc gagcgggtca 2101 agctgctgct gcaggtacgt ctgggatcca gagcccaacc aggaagtggg ggaagggtcg 2161 cacagaaggc gggcgccgag gggtggcggg gagcgaactc taaagacatg gccagggaag 2221 cggcttagga gaggccagag cgggcgcaga ggcagaacag aagtcaaact ggtgggaggc 2281 gccctttagt gacctgaagt agtgagtcta ggaaggggcc ggggcagagg gcaggaccag 2341 gctctcggca tctccgaggc ggctgactcg atggagcagt ttctgagtga cttcctcccc 2401 tttcctgggc gtcaagggcg aagcgtccgc gttagaaaga ggaactccag ttttaccgaa 2461 gacctcaagg ttctgcaagg agataactgc ccgggggagg ccatgcgccc gggtccagcg 2521 gcctcccagc ccgcggacgc ctcaaacctc gccgggccgg aagcccggcg ccgggaagcc 2581 ggtgtgcctt ttacgtccgc ccccgcgcag ccgcgaccgc tgccggcgtc tccgcctgcc 2641 tccctgcgcc gcggctccag tgccggctct agagggcgct cctgggctag cgtgtagggc 2701 tggcggcggc ggcgtcgggt cacctctggg agcggagtgc gggcggagcg agacggaagc 2761 agctcaggag acttgaggcg taggctccgc ctcccaaggt gaccgcgccc tatgtgggac 2821 tcgccctaat gcctctgaac ctgggtttga ggtaatgacc tttctcctag gtctgaaggt 2881 cacgggtccg ctggaggatg ccccctctcc actcagaggg gtggaggctt aatgctactg 2941 gtgcagatca cctcttcccc tgtgacagcc tcagagggtt gggagggtcc agccagtatg 3001 atatacgaag actagatttg agagagggga gcctacctta agggcattga tcgagatggc 3061 ataagctctt ctctttccct tccccatggt tataactgtc cctgttggct tccttcctgt 3121 caggtgcagc atgccagcaa gcagatcact gcagataagc aatacaaagg cattatagac 3181 tgcgtggtcc gtattcccaa ggagcaggga gttctgtcct tctggcgcgg taacctggcc 3241 aatgtcatca gatacttccc cacccaggct cttaacttcg ccttcaaaga taaatacaag 3301 cagatcttcc tgggtggtgt ggacaagaga acccagtttt ggcgctactt tgcagggaat 3361 ctggcatcgg gtggtgccgc aggggccaca tccctgtgtt ttgtgtaccc tcttgatttt 3421 gcccgtaccc gtctagcagc tgatgtgggt aaagctggag ctgaaaggga attccgaggc 3481 ctcggtgact gcctggttaa gatctacaaa tctgatggga ttaagggcct gtaccaaggc 3541 tttaacgtgt ctgtgcaggg tattatcatc taccgagccg cctacttcgg tatctatgac 3601 actgcaaagg gtaagtttgc tgtgggcttt aacgttgtgt tcttaggaga cagtttaaaa 3661 gagcattgta ccaacctaac agtccaagag ctaaagagtt gtttttttaa ttgctaaagg 3721 aagccaagat catccaatgc aacccttgtg tacagatgac gtgtttaggg gatgtgggga 3781 aaggaagtca gtaaaacttc tgctttttgg taaagatctc tttcctattc ctaggaatgc 3841 ttccggatcc caagaacact cacatcgtca tcagctggat gatcgcacag actgtcactg 3901 ctgttgccgg gttgacttcc tatccatttg acaccgttcg ccgccgcatg atgatgcagt 3961 cagggcgcaa aggaagtaag ttccacttga gacagaagac aaagttgtag tcgtggggca 4021 atctgctgcc acaaactggt gatacatacc tttaaaaatg gctgtctgtc caagtcaagg 4081 gatggggttg atagcatctg tgtctgttcc acaactgcct ttgagcctgc cctcagatgc 4141 catgaggtgc ttaaatggtg taagaccaat gggtagcctg tatcctgtgg ttcatagtat 4201 taatatttca gtgttgccca tgctaatgtg tgaatgttgg atttaaagct gacgttctca 4261 gaggtggggc tctgctttat ttagcctagt gaatcttagg atttttcatc ggccttcagt 4321 cactaactcc acgtctttat tctttgcagc tgacatcatg tacacaggca cgcttgactg 4381 ctggcggaag attgctcgtg atgaaggagg caaagctttt ttcaagggtg catggtccaa 4441 tgttctcaga ggcatgggtg gtgcttttgt gcttgtcttg tatgatgaaa tcaagaagta 4501 cacataagtt atttcctagg atttttcccc ctgtgaacag gcatgttgta ttctataaca 4561 caatcttgag cattcttgac agactcctgg ctgtcagttt ctcagtggca actactttac 4621 tggttgaaaa tgggaagcaa taatattcat ctgaccagtt ttcctctaaa gccatttcca 4681 tgatgatgat gatgggactc aattgtattt tttatttcag tcactcctga taaataacaa 4741 atttggagaa ataaaaatat ctaaaataaa ttttgtctgc agtatatttt catataaaaa 4801 tgcatatttg agtgctacat tcgaataaat actacctttt tagtgaatgc tagattttta 4861 ataaatgcta cagtatctcc ggagatgaag aactgtcttt ttaaaaccaa ttgtcagcag 4921 tccgcttaac agaataactt ggccgtgcca cccacaaaca tttccaacac attagcaaag 4981 ga // LOCUS HUMAPOA4A 3613 bp DNA PRI 11-APR-1996 DEFINITION Human apolipoprotein A-IV gene, complete cds. ACCESSION J02758 NID g178756 KEYWORDS apolipoprotein. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3613) AUTHORS Elshourbagy,N.A., Walker,D.W., Paik,Y.K., Boguski,M.S., Freeman,M., Gordon,J.I. and Taylor,J.M. TITLE Structure and expression of the human apolipoprotein A-IV gene JOURNAL J. Biol. Chem. 262 (17), 7973-7981 (1987) MEDLINE 87250378 REFERENCE 2 (sites) AUTHORS de Temmerman,P., Visvikis,S., Boerwinkle,E. and Siest,G. TITLE Study of the sequence tagged site (STS) in the beginning of human apo A4 gene region JOURNAL Nucleic Acids Res. 18 (18), 5576 (1990) MEDLINE 91016872 COMMENT Draft entry and computer-readable sequence [1] kindly provided by N.A.Elshourbagy, 15-APR-1987. FEATURES Location/Qualifiers source 1..3613 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHAIVG[5.1,5.2]" /clone_lib="C.Lau" prim_transcript 893..3495 /note="APO mRNA and introns" primer_bind complement(986..1005) CDS join(1006..1054,1412..1538,2316..3330) /gene="APOA4" /note="apolipoprotein A-IV precursor" /codon_start=1 /db_xref="PID:g178757" /db_xref="GDB:G00-119-000" /translation="MFLKAVVLTLALVAVAGARAEVSADQVATVMWDYFSQLSNNAKE AVEHLQKSELTQQLNALFQDKLGEVNTYAGDLQKKLVPFATELHERLAKDSEKLKEEI GKELEELRARLLPHANEVSQKIGDNLRELQQRLEPYADQLRTQVNTQAEQLRRQLTPY AQRMERVLRENADSLQASLRPHADELKAKIDQNVEELKGRLTPYADEFKVKIDQTVEE LRRSLAPYAQDTQEKLNHQLEGLTFQMKKNAEELKARISASAEELRQRLAPLAEDVRG NLRGNTEGLQKSLAELGGHLDQQVEEFRRRVEPYGENFNKALVQQMEQLRTKLGPHAG DVEGHLSFLEKDLRDKVNSFFSTFKEKESQDKTLSLPELEQQQEQHQEQQQEQVQMLA PLES" sig_peptide join(1006..1054,1412..1422) /gene="APOA4" /note="apolipoprotein A-IV signal peptide" gene 1006..3330 /gene="APOA4" /map="11q23-qter" sig_peptide 1006..1054 /gene="APOA4" /note="apolipoprotein A-IV signal peptide" exon <1006..1054 /gene="APOA4" /note="apolipoprotein A-IV precursor; G00-119-000" /number=1 intron 1055..1411 /gene="APOA4" /note="APO intron A" STS order(complement(986..1005),1412..1431) /citation=[2] exon 1412..1538 /gene="APOA4" /number=2 sig_peptide 1412..1422 /gene="APOA4" /note="apolipoprotein A-IV signal peptide" primer_bind 1412..1431 /gene="APOA4" mat_peptide 1423..1538 /gene="APOA4" /note="apolipoprotein A-IV" mat_peptide join(1423..1538,2316..3327) /gene="APOA4" /note="apolipoprotein A-IV" intron 1539..2315 /gene="APOA4" /note="APO intron B" mat_peptide 2316..3327 /gene="APOA4" /note="apolipoprotein A-IV" exon 2316..>3330 /gene="APOA4" /note="apolipoprotein A-IV precursor" /number=3 BASE COUNT 867 a 965 c 1082 g 699 t ORIGIN 1 bp upstream of XbaI site; chromosome 11q23-qter. 1 tctagagatt taaagtatgt gggaggctgt gcttaggtta tatgtaaata ctatgccatt 61 ttatatcaag gacttgaaca tccatggatt ttggtatctg cagagggtcc tggaatcatt 121 tcccatggag actgagagat gaccgtacta cccacttcgc aagcaatgtc ttctttaatg 181 tactgaacca tcccattgtt cagaggagaa actgaagctc agggctttga ataactagac 241 caaggaggca cagcatggga gtgggacatg aagcactcta caattaaccc tttcaggaca 301 aggccctgtc tcccacaccc atctgcccaa aggctctcca gggccccctc ctcttgggtg 361 taccttgaca agagacctag attttagctc actatgctgt ctgcagtcct ggatggtccc 421 actccagtgt ctggtgctct gagatggagt cagcattagt ggcggatgtg gagactgggg 481 ggacctgtct tcactggggt agacagagga gatgtggact ttgcccccca tgagcccggc 541 acaaacccga gaggccgcca gcagggcctc gaggcatcag tcccgggctc atgggctccc 601 tgaggtgttt ctcctactgt tttccgttcc cctcctccct tccatgctga ggttggtggg 661 gtgggggtgg gggtgcccac gcacggaaca gccaccactt ctaacctatc gcctgagccc 721 tgatctgctg tcagcttcca cgtagtctca gggtcacaaa agtccaagag gcctcttggg 781 aatgtgtcac cttccagcgt ggagtcacac tgaggaagga ggaggggagg gcagccaggg 841 gggtggcgat agggagagag tttaaatgtc tggctggctc tgagcttcag tcagttccca 901 ctgcagcgca ggtgagctct cctgaggacc tctctgtcag ctcccctgat tgtagggagg 961 atccagtgtg gcaagaaact cctccagccc agcaagcagc tcaggatgtt cctgaaggcc 1021 gtggtcctga ccctggccct ggtggctgtc gccggtgagt agaagctgtc tttggatggc 1081 actcctgggc tgctgctctg agtagtgcag gatggaggct gagccaaagc aaaaggacac 1141 ttctgagtgc ccatcagccc ccagctggac atgaggtctg cctggctgcc aagtggctca 1201 caggagagct ggcccagtcc cagtggtggg cccattggca ttggtgctat accagtttca 1261 catatccctg tggcttccaa aaagctaagc tcagacaggg aaaatggcag gttgaggcac 1321 ccccaccatc atccagtctg cagctcagag ctggagcaga ggggccacac aggagacggg 1381 gcctcatgaa ttgctctctg ttaccaccca ggagccaggg ctgaggtcag tgctgaccag 1441 gtggccacag tgatgtggga ctacttcagc cagctgagca acaatgccaa ggaggccgtg 1501 gaacatctcc agaaatctga actcacccag caactcaagt aagagggact acagtgtgtg 1561 gtggtgacgg ggaattctta aaggccatgc aatgtactgg caagggttga gcttagagac 1621 aggagccctg agcttaggat acccactgcc ctgccactaa ctggccgggc ctctgaacct 1681 aggatccaca tatgtaaacc ggaagtttgg accgaataat ccctgccatg tccttttgct 1741 ttgacgttct agagtttgac aaatggccac atcctatcat tcaggctcat ggaagagagg 1801 gagggaggaa aatgtcatgt gagctgattt ctaatacgtt tcagaaagac aggccccagt 1861 ggaatcaagg ggagggaggt gggaatattt gggaggcccc tgggcacagg caaggaaagc 1921 agcaccttgt gccactggaa gaccccagca gaggtcaaga agacaacatt gtgttacaca 1981 atgtgatcct atggcccaga acattccctc tgggaaggac ctcaaagtcc caccctctgc 2041 agacaaggag gggaaagcaa actgctggag gtgacatggt gggtagattc tgagacaaac 2101 tatgtgggag atcctgagat agaaattcag catcgtaact tagtctgtga cacccatcct 2161 ctccaatctg caccaccata gggagggtga actcggtacc tctgagcact cacctgtctt 2221 agcacgtgtg cataaggcga gtggtataca agcagacaaa gtcttgccgt gtaaatgcca 2281 aatgtaacgt ggcctccttg tgcccttccc cacagtgccc tcttccagga caaacttgga 2341 gaagtgaaca cttacgcagg tgacctgcag aagaagctgg tgccctttgc caccgagctg 2401 catgaacgcc tggccaagga ctcggagaaa ctgaaggagg agattgggaa ggagctggag 2461 gagctgaggg cccggctgct gccccatgcc aatgaggtga gccagaagat cggggacaac 2521 ctgcgagagc ttcagcagcg cctggagccc tacgcggacc agctgcgcac ccaggtcaac 2581 acgcaggccg agcagctgcg gcgccagctg accccctacg cacagcgcat ggagagagtg 2641 ctgcgggaga acgccgacag cctgcaggcc tcgctgaggc cccacgccga cgagctcaag 2701 gccaagatcg accagaacgt ggaggagctc aagggacgcc ttacgcccta cgctgacgaa 2761 ttcaaagtca agattgacca gaccgtggag gagctgcgcc gcagcctggc tccctatgct 2821 caggacacgc aggagaagct caaccaccag cttgagggcc tgaccttcca gatgaagaag 2881 aacgccgagg agctcaaggc caggatctcg gccagtgccg aggagctgcg gcagaggctg 2941 gcgcccttgg ccgaggacgt gcgtggcaac ctgaggggca acaccgaggg gctgcagaag 3001 tcactggcag agctgggtgg gcacctggac cagcaggtgg aggagttccg acgccgggtg 3061 gagccctacg gggaaaactt caacaaagcc ctggtgcagc agatggaaca gctcaggacg 3121 aaactgggcc cccatgcggg ggacgtggaa ggccacttga gcttcctgga gaaggacctg 3181 agggacaagg tcaactcctt cttcagcacc ttcaaggaga aagagagcca ggacaagact 3241 ctctccctcc ctgagctgga gcaacagcag gaacagcatc aggagcagca gcaggagcag 3301 gtgcagatgc tggccccttt ggagagctga gctgcccctg gtgcactggc cccaccctcg 3361 tggacacctg ccctgccctg ccacctgtct gtctgtctgt cccaaagaag ttctggtatg 3421 aacttgagga cacatgtcca gtgggaggtg agaccacctc tcaatattca ataaagctgc 3481 tgagaatcta gcctcaactg gttgccggat gaatcctcct tgcagctggg gaggtgggga 3541 ggtaaccatg actgggcaga gcttagcagc ggcctggcag gagacaccca ggattgggga 3601 gatgagactg cag // LOCUS HUMAPOC2 4057 bp DNA PRI 02-MAY-1996 DEFINITION Human apolipoprotein CII (APOC2) gene, complete cds. ACCESSION J02698 NID g178824 KEYWORDS Alu repeat; apolipoprotein CII; apolipoprotein C-II; lipoprotein; repeat region. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4057) AUTHORS Das,H.K., Jackson,C.L., Miller,D.A., Leff,T. and Breslow,J.L. TITLE The human apolipoprotein C-II gene sequence contains a novel chromosome 19-specific minisatellite in its third intron JOURNAL J. Biol. Chem. 262 (10), 4787-4793 (1987) MEDLINE 87165892 COMMENT Draft entry and clean copy sequence for [1] kindly provided by H.K.Das, 26-JAN-1987. The third intron is composed almost entirely of a novel 37 bp minisatellite, which is repeated five and a partial times (positions 3396-3604). This minisatellite is found in about 60 locations, all on the long arm of chromosome 19, and seems to be highly conserved. Three polyadenylation signals are located at positions 3847-3852, 3994-3999 and 4015-4020, the latter two are located after the major mRNA end. FEATURES Location/Qualifiers source 1..4057 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CII." /cell_line="LE392" /tissue_lib="T.Maniatis" /map="19q13.2" /chromosome="19" exon 511..551 /gene="APOC2" /number=1 gene 511..3862 /gene="APOC2" exon 523..551 /gene="APOC2" /number=1 intron 552..2937 /gene="APOC2" /number=1 repeat_region 749..822 /note="Alu repeat copy A" repeat_region 904..982 /note="Alu repeat copy B" repeat_region 988..1657 /note="Alu repeat copy C" repeat_region 2117..2749 /note="Alu repeat copy D" exon 2938..3005 /gene="APOC2" /number=2 CDS join(2951..3005,3173..3332,3629..3668) /gene="APOC2" /codon_start=1 /product="apolipoprotein CII" /db_xref="PID:g178825" /translation="MGTRLLPALFLVLLVLGFEVQGTQQPQQDEMPSPTFLTQVKESL SSYWESAKTAAQNLYEKTYLPAVDEKLRDLYSKSTAAMST" sig_peptide join(2951..3005,3173..3183) /gene="APOC2" intron 3006..3172 /gene="APOC2" /number=2 exon 3173..3332 /gene="APOC2" /number=3 mat_peptide join(3184..3332,3629..3665) /gene="APOC2" /product="apolipoprotein CII" intron 3333..3628 /gene="APOC2" /number=3 exon 3629..3862 /gene="APOC2" /number=4 BASE COUNT 1036 a 1091 c 1085 g 845 t ORIGIN 208 bp upstream of HaeIII site; chromosome 19q12-q13.2. 1 ccagaaaaaa acccacccca ctgcctccca ggaatcaatg agtagaagag gtgacacctg 61 atggggaagg aagagtaggg aggtcgggaa gggtatcaag gaataacacc ctattgtggg 121 cttgcggaga atgggggact tcaaggcgtg tcagtttcag gagggtgagg gcaggagcgt 181 gggtggagtc agcaggtccc catgatggcc ctcactgaga gcttcgccct tgtctcctac 241 aagctctgac tccattccca gtgggcaccc agcacctcca acccctccac agcccccaac 301 ccagcctctg tcggaggcga attctcagag tgagggttcc ctgtcacttg agagaaggtt 361 ccctgtgacg tgaccttggg ggacgtcatt gccctttctg tccccaccca ccccctccgc 421 agttctgttg gccaggactt tggcctagac aaaggatggg ggttgtggct gtggagcgga 481 agtgggtctc aaccactata aatcctctct gtgcccgtcc ggagctggtg aggacagcct 541 gccagagtct ggtaagaaag ggactcaggg tgcggggaca gggggcgtca ggggagaggg 601 caaagatcga taaagcagga attttaagag gcacaatatt agaagcccgt gttggaacca 661 tgactgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg agagagagag 721 agagggagat ggagtctcgc tatgtagccc aggctagact caaactcctg ggctcaagca 781 atcctcctgc ctcagcctcc ccagtagctg ggactacagg tgcaccacca cactccacaa 841 atcacagaat ttagaactgt agactatttg agcttctgct tagagttagg gtggctgagg 901 tggggaggat cccttgagcc caggagtttg aggatgcagt gagctgtgat cttgccaccg 961 tgttccagcc tgggtgacag agaaacccca tttctaaaaa agagaaagaa aaagggatag 1021 gtacaatggc tcatgcctgt aatcccagca ctctgggagg ccgaggcggg tggatcactt 1081 gaggtcagga gttcgacacc agccttacca gcatggtgaa accgcatcta tactaaaaat 1141 acaaaaattg gccgggtgtg gtagcatatg cctgtaatcc cagctattcc agaggctgag 1201 acaggagaat tgcttgaacc caggaagcgg aggttgcagt gagcccagat cgtgccactg 1261 tactctagcc tggtgacaga gcaagactca gtcttggcgg aaaaaaagaa tgaaaaaatt 1321 taaaaaacta aaaaagaact gtaggctggg gctggtggct tacacttgta atccaaacgc 1381 tttgggagcc aaggcaaacg gatcacttga tgtcaggagt tggagaccag cctggccaac 1441 atggtgaaac cccgtctcta ctaaaaatac aaaaattaga caggcatggt ggtgcatgct 1501 tgtatttcca gttactcagg aggctgaggc aggagaatcg ctcgaaaccc ggaagacaga 1561 ggttgcggtg agccaaaatt gcgccatcgc actccagcct gggcgagaga acaagacctt 1621 gtcttggaaa aaaaaaaaga attgtagacc atttgcttgt gttctttctc cggggatcag 1681 atctcaccct ctttctgccg tacttcctca tctcctacgt gtggatgatg atattgtgcc 1741 ctgtgcatgt tcttcgtcac caaaagtgcc tctctcatag agcaggtgag aactcagtga 1801 ggagatgcag ggacatgagg tctgacttag ggcagagccc taaggtaaca catttgatct 1861 actgtaggtc cttaatggtg tctgcagagc acctccctgc actgactcag ccttagcaaa 1921 gggcagaggc tttgctgtgt tccctgctgg gcccagaact gtttaggtgc tcaagaaagc 1981 cttctaggct gggctcagtg gctcacacct gtactcccag caccctgggg aggccgagat 2041 gggaggatcg cttgagccca ggagttccag accagcctgg gcaacaaaac aagtctccca 2101 tctctacaaa agaataaaaa ttagcagctg ggcatggtgg ctcatgcctg taattccagc 2161 actttgggag gccaaggcag gcaaatcact tgaggttagg agttcaagac cagcctggcc 2221 aacatggtga aaccccatct ctactaaaaa tacaaaaatc aggtggggca cagtgctcaa 2281 gcctgtaatc ctagcacttt gggaggccaa ggtgggcgga tcacgaggtc agaagttcga 2341 gaccagcctg gccagcatgg tgaaacccca tctctactaa aaatacaaaa tattagccgg 2401 gcatggtggc aggtgcttgt gattccagct ccttgggagg ctgaggcaga agaattgcta 2461 gaaccctgga ggcagaggtt gcagtgagcc gagaacacgc cactgcactc cagcctgggt 2521 gacagagccg agactccatc tcaaaaaata cgaaaacaaa aatcagccgg gtggtggcgg 2581 gtgcctgtaa tcccagctac tggggaggct gaggcaggag aattgcttga acctgggagg 2641 tgggggttgc agtgagccaa gattgcacca ctgcactcca gcctgggcaa cagagtgaga 2701 ttccatctca aaaagaaaaa aataataatt aaaatgttaa aatcaggagt agaatcacag 2761 aatgttggaa agtgaggccc aagaaggggg ctgtgtccaa gtccatgcat gggaaacttg 2821 actgggacac cgagctcaca cagagcagga tctcagtccc ccccaccaga gtgggcgtga 2881 ccacaggaac agccgcctcc agtcagcctg ccacatgaca ccccctcaat gttccaggtc 2941 tctggacact atgggcacac gactcctccc agctctgttt cttgtcctcc tggtattggg 3001 atttggtgag tgtgggcttc cggggaggga agccttgggg aggggaatga gctccaagca 3061 tcttcccagc ccaggccctt cttacctctg cctctgccct ctcctcttct tccttcctcc 3121 tttccccctg ctgcagcccc acgggctctc ctgacacact ctccccctgc agaggtccag 3181 gggacccaac agccccagca agatgagatg cctagcccga ccttcctcac ccaggtgaag 3241 gaatctctct ccagttactg ggagtcagca aagacagccg cccagaacct gtacgagaag 3301 acatacctgc ccgctgtaga tgagaaactc aggtagcacc tgcccctgga gaaatggggt 3361 ctggcccata ccaccgactg catccaggac ccagaagttc aggccccagc cctcctccct 3421 cagacccagg agtccaggcc tcagccctcc tccctcagac ccaggagtcc aggcccccag 3481 cccgtcctcc ctcagaccca ggagtccagg cccccagccc ctcctccctc agacccagga 3541 gtccaggtcc ccagaccctc ctccctcaga cccaggagtc caggccccca gcccctcctc 3601 cctctaacca tctgtgcttt ctccccaggg acttgtacag caaaagcaca gcagccatga 3661 gcacttagac aggcattttt actgaccaag ttctttctgt gctgaaggga gaggagtaac 3721 agccagaccc cccatcagtg gacaagggga gagtccccta ctcccctgat cccccaggtt 3781 cagactgagc tcccccttcc cagtagctct tgcatcctcc tcccaactct agcctgaatt 3841 cttttcaata aaaaatacaa ttcaagttgc ttctcatgga tggcactgct tttctgagga 3901 ctcaagggcc aagatggagg ggctgactca gtccagccaa catttaatga gcacctactt 3961 tatgtatgga gctctaaccc atgggtccat gggaataaag cagtgaatag taacaataaa 4021 taatcgtaac agcaattaga gactaatctt tattgaa // LOCUS HUMAPOCIA 5375 bp DNA PRI 13-FEB-1996 DEFINITION Human apolipoprotein C-I (VLDL) gene, complete cds. ACCESSION M20902 J03217 NID g178830 KEYWORDS apolipoprotein C-I; very low density lipoprotein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5375) AUTHORS Lauer,S.J., Walker,D., Elshourbagy,N.A., Reardon,C.A., Levy-Wilson,B. and Taylor,J.M. TITLE Two copies of the human apolipoprotein C-I gene are linked closely to the apolipoprotein E gene JOURNAL J. Biol. Chem. 263 (15), 7277-7286 (1988) MEDLINE 88213410 FEATURES Location/Qualifiers source 1..5375 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.2" TATA_signal 466..472 prim_transcript 493..5145 /note="apoC-I mRNA and introns" intron 526..663 /note="apoC-I intron A" gene 684..741 /gene="APOC1" CDS join(684..741,1986..2121,4976..5033) /note="apolipoprotein C-I" /codon_start=1 /db_xref="PID:g178831" /translation="MRLFLSLPVLVVVLSIVLEGPAPAQGTPDVSSALDKLKEFGNTL EDKARELISRIKQSELSAKMREWFSETFQKVKEKLKIDS" exon <684..741 /gene="APOC1" /note="apolipoprotein C-I , first expressed exon; G00-119-687" /number=2 intron 742..1985 /note="apoC-I intron B" repeat_region 1095..1403 /note="Alu repeat copy A" repeat_region 1095..1109 /note="Alu copy A 5' direct repeat" repeat_region 1391..1403 /note="Alu copy A 3' direct repeat" repeat_region 1467..1477 /note="Alu copy B 5' direct repeat" repeat_region 1467..1960 /note="Alu repeat copy B" repeat_region 1951..1960 /note="Alu copy B 3' direct repeat" exon 1986..2121 /number=3 intron 2122..4975 /note="apoC-I intron C" repeat_region 2320..2647 /note="Alu repeat copy C" repeat_region 2320..2331 /note="Alu copy C 5' direct repeat" repeat_region 2636..2794 /note="Alu repeat copy D (partial)" repeat_region 2636..2647 /note="Alu copy D 5' direct repeat" repeat_region 2636..2647 /note="Alu copy C 3' direct repeat" repeat_region 2783..2794 /note="Alu copy D 3' direct repeat" repeat_region 3100..3402 /note="Alu repeat copy E" repeat_region 3100..3108 /note="Alu copy E 5' direct repeat" repeat_region 3394..3402 /note="Alu copy E 3' direct repeat" repeat_region 3494..3793 /note="Alu repeat copy F" repeat_region 3798..4081 /note="Alu repeat copy G" repeat_region 4126..4776 /note="Alu repeat copy H" repeat_region 4126..4140 /note="Alu copy H 5' direct repeat" repeat_region 4762..4776 /note="Alu copy H 3' direct repeat" exon 4976..>5033 /note="apolipoprotein C-I" /number=4 polyA_signal 5130..5135 BASE COUNT 1363 a 1342 c 1509 g 1161 t ORIGIN 5 bp upstream of SphI site; chromosome 19q12-q13.2. 1 gcatgcagcc cccagtcacg catcccctgc ttgttcaatc gatcacgacc ctctcacgtg 61 cacccactta gagttgtgag cccttaaaag gaacagggat tgctcactcg gggagctcgg 121 ctcttgagac aggaatcttg cccattcccc gaacgaataa accccttcct tcgttaactc 181 agcgtctgag gaattttgtc tgcggctcct cctgctacat tctgagtggg ggaaagggac 241 taaggtggtc tgaggacccc acagagtcag gaagattgag aggtgagagt gctgaacggg 301 gaggggcttt ggggctaagg gaagtgcccg ggaccccacc tgaccccaac gctcacggga 361 caggggcaga ggagaaaaac gtgggtggac agagggaggc aggcggtcag gggaaggctc 421 aggaggaggg agatcaacat caacctgccc cgcccctccc cagcctgata aaggtcctgc 481 gggcaggaca ggacctccca accaagccct ccagcaagga ttcaggttgg tgctgagtgc 541 ctgggaggga cacccgccta cactctgcaa gaaactcaaa aagggagatg aggggatcgt 601 gggagggagg tagggaggga ggagggtgcc actgatcccc tgaacccctg cctctgcctc 661 cagagtgccc ctccggcctc gccatgaggc tcttcctgtc gctcccggtc ctggtggtgg 721 ttctgtcgat cgtcttggaa ggtaaaagtg ggatgggaga attggggagt ttggagattt 781 ggaagagtga aggtggctac aggcctgggg tcccggctta gaggacctct gagagctccg 841 gggccccttc tgggtcgtgg ttgcctcatc gtggtcgggt gggtctccag gttctcccag 901 gctcagtccc gcaggcgcca aatctgcgca ggagagcact agcaaccgat gacgtattga 961 ggcccacacc tctgggattg gctgtcctgc ttcgacagcc ttgaaagtgg gtaagctggg 1021 tggggggctc tgggagaggt cagtgctgag taaggcaatt cccagcagct tgagccccac 1081 caggtcactc cagtattcct ccccattctt tttttttttt tttttttttc tcttgagacg 1141 gagtctcgct ctgtcgccga ggctggagtg cagtggcgcg atctcggctc actgcaagct 1201 ccgcctccct ggttcacgcc attctcctgc ctcagcagga ctacaggcgc ccgcctccgc 1261 gcccggctaa ttttttgtat tttcagtaga gacagggttt caccgtggtc tcgatctcct 1321 gactttgtga tccgcctgcc tcgacctccc aaagtgctgg gattacaggc gtgagccacc 1381 gcgtccggcc attcctcccc attctaacca catgatcccc aaggatctct atccatcccg 1441 gtatcccaac ctaagggggt tccaataaca aatttttggc cgggcagggt ggctcatgcc 1501 tgtaatccca gcactttggg aggccgaggc gggcagatca cttgaggtca ggagttcaaa 1561 ccagccaggc caacatggtg gaaacttcgt ctctagctaa aaatacaaaa aaattaggcc 1621 aggtgtggag gcacgcgcct gtaggcccag ctactcggga ggctgaggca ggagaatcac 1681 ttgaacccgg gaggcggagg ttgcagtgag ccgagatcat accactgcac tccagcctgg 1741 ctgacacagc aagactccgt ctcaaaacaa aacaaaacaa aaataggctg ggtgtggtgg 1801 tgcacacctg taatcccagc tacttgggag gctgaggcag gagaactgct tgaacccggg 1861 aggtggtggt tgcagtaggc cgagatcatg ccactgcact ccagcttggg ctacagagca 1921 agactccatc tccaaaaaaa aaaaaaaaaa aacaaatttt gaacccctgc ccatcttcct 1981 ggcaggccca gccccagccc aggggacccc agacgtctcc agtgccttgg ataagctgaa 2041 ggagtttgga aacacactgg aggacaaggc tcgggaactc atcagccgca tcaaacagag 2101 tgaactttct gccaagatgc ggttagaacc cttcccaggg cacgggagag ctggggtgtg 2161 tttttgggtg gagccctggc agatggtcca agatgaacag attgaaaaaa aaacaagtcc 2221 tggagaggct gacaacatcc ctctggtcac acagctagat ctcaaggtgc tcagacttca 2281 aggacagttt ccctgactcc catccaggcc atattttaaa agatggtctt gggctgggca 2341 cggtggctca tgcttgcaat cccagcactt agggaggccg aggtgggctg attgcctgag 2401 gtcaggagtt cgagaccagt ctgaccaaca tcggtgaaat cttagtctct actaaaaata 2461 caaaaaaatt acggcaggca tggtggcgtg cactgtaatc ccagctagtc gggaggctga 2521 ggcaggggaa ttgcttgaac caggaaggtg ggagttacag tgagccaaca ttgtgccagc 2581 ctgggtgaca gaaggagact ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaacaagat 2641 ggtcttgccc aggaatggtg gctaacacct gtaattccag cattatggga ggctgagatg 2701 ggaggattgc ttgagcccag gagttcgaga ccagcctgac caacatggcg agatcctgtc 2761 tccatttaaa aaaaaaaaaa aaaagatggt tttgtgaggt aatgaaaatg aaggccccaa 2821 gcttggccag acctgggtcc ccaggctgga gtagcacccc ttcctgtgtg atcttgacag 2881 aggggcatta ctgtgagcct cagtttcctc tcctataaac tggtggttct acagggaagt 2941 aaaggagcag gcctacaggg tgtctggtac atgtagatgc tcagtatatc attaaaccac 3001 cttccccttt gcaagttaga gagtcatttg ttctttaaaa aatattttac tgagcatctg 3061 ctaagtgctg gaaaactctt tcaatgtggg gaataaaaca gtgaagaact gccgagcacg 3121 gtggctcaca cctgtaaccc caccactttg gaaggccgag gtgggtggat cacttgaggt 3181 caggagtgcg agaaccccgt ccctaataga aatgcaaaaa aaattagctg ggcatggtgg 3241 cccatgcctg tagtcccagc tccttgggag ggctgaggcg gagaggattg cttgagccca 3301 ggagatctag gctgcagtgc gccatgtttg tgccactgca ttccagcctg ggtaacagaa 3361 tgagaccctg tctcaacaaa aaaagaaaag aaaagagaag aaaagagaaa agaaagacag 3421 ggagggaggg aggaaggaag ggagggaggg agggaaaata gagccaggca taaacttaga 3481 aagatcgttt ggaggccagg cacaatggct cacacctgta atcccagcac tttgggaggc 3541 caaggcaagc agatcaactg aggtcaggag ttcgagacca gcctaacatg gagaaaaccc 3601 ctgtctctac taaaaaaaaa tacaaaaaaa ttagccgggg cgtggtgcat tcctgtagtc 3661 ctagctactc gggagcctga ggcaggagaa tcacttgaac ccgggaggcg gaggttgcag 3721 tgagccgaga tcatgccact gcactccagc ctgggcgaca aggcgagact ccatgccaaa 3781 aaagaaaaaa aactcctggc gcggtggctc acgccagtaa tcccagcact gtgggaggct 3841 gagcaggcgg atcacgaggt caggagttcg agactagcct gctcaacata atgaaaccct 3901 ctctgtacta aaaatacaaa aattagctgg gtgtggtggc aggcacctgt agtcccagct 3961 actcgggagg ctgaggcagg agaatggctt gaacctggga ggcagaggtt gcagtgagcc 4021 gagacagtgc cattgcactc cagtccaggt gacagagcga aactccatct caaaaaaaaa 4081 aggaaggcat tggtagcaag agatggcagg ccttgaaagc caggccaggg tgaagtgttt 4141 cttttttttt tttttttttt ttctttttaa attttttttt ttgagacgga gtctcgctct 4201 gtcacccagg ctggattgca gtggcctgat ctcggctcac tgcaagttcc gcctcccggg 4261 ttcatgccat tctcctgcct caccctcccg agtagctggg actacaggca cctgccacca 4321 ggccagctaa ttttttgtat tcttagtaga atgtagaatt tacttagtag aattttttgt 4381 attcttagcc agcatggtct cgatctcctg acctgggtga tccacccgcc tcggcctccc 4441 aaagtgctgg gattacaggc gtgagccacg gcgcccggcc ttattttttc tttttgagat 4501 gtacccagac tggagtacag tggtgcgatc tcggcttact ggaacctcca cctcccgggt 4561 tcaggcaatt ctcctgcctc agcctcatga gtacttggaa ctacaggtgt gtgacaccac 4621 acatggtatt ttttgtattt ttagtgaaga tgacatttca ccatgttgcc caggttggtc 4681 tcgaactcct gacctcaagt gatcagccta cctcggcctc ccaaagtgtt gggattacag 4741 gcgtgagcca aatgcccagc caagggtaaa gtgtttagac ttaaagtgct ttggtccatc 4801 tgggaaactg aggcagagaa gttggcccac ccagcccagc ggtcctccta atcccacaga 4861 cagtggggat ggagattctg caaggggaag aggtgggagt caggtagcag gcagaatttg 4921 gacagcctgg gaggtagctg cacacagtga cccccttcct tattcctccc cacagggagt 4981 ggttttcaga gacatttcag aaagtgaagg agaaactcaa gattgactca tgaggacctg 5041 aagggtgaca tcccaggagg ggcctctgaa atttcccaca ccccagcgcc tgtgctgagg 5101 actccctcca tgtggcccca ggtgccacca ataaaaatcc tacagaaaat tctctcctga 5161 gtgcttcttt actctgggga aggggctgcg ggagagggta ggggcttcca gagagggcag 5221 ggtctgcagc cactgtggaa aaacagtatg gggtttcctc aaaacattaa agatagaact 5281 ctcaaatgat ccttcaatcc cacttctggg tatttattca aaagaattga aatcaggacc 5341 ttgaagagat acctgccctc ccatgttcac tgcag // LOCUS HUMAPOE4 5515 bp DNA PRI 09-NOV-1994 DEFINITION Human apolipoprotein E (epsilon-4 allele) gene, complete cds. ACCESSION M10065 J03053 J03054 NID g178852 KEYWORDS Alu repeat; allelic variation; apolipoprotein; apolipoprotein E; lipoprotein; repeat region; very low density lipoprotein. SOURCE Human DNA [2], [1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5515) AUTHORS Das,H.K., McPherson,J., Bruns,G.A., Karathanasis,S.K. and Breslow,J.L. TITLE Isolation, characterization, and mapping to chromosome 19 of the human apolipoprotein E gene JOURNAL J. Biol. Chem. 260 (10), 6240-6247 (1985) MEDLINE 85207610 REFERENCE 2 (bases 196 to 5269) AUTHORS Paik,Y.K., Chang,D.J., Reardon,C.A., Davies,G.E., Mahley,R.W. and Taylor,J.M. TITLE Nucleotide sequence and structure of the human apolipoprotein E gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (10), 3445-3449 (1985) MEDLINE 85216517 REFERENCE 3 (bases 1 to 5515) AUTHORS Emi,M., Wu,L.L., Robertson,M.A., Myers,R.L., Hegele,R.A., Williams,R.R., White,R. and Lalouel,J.M. TITLE Genotyping and sequence analysis of apolipoprotein E isoforms JOURNAL Genomics 3 (4), 373-379 (1988) MEDLINE 89212602 COMMENT [3] two allelic variations. Draft entry and computer-readable sequence for [3] kindly provided by M.Emi, 19-AUG-1988. Apolipoprotein E is a constituent of the human very low density lipoprotein in the plasma. There are at least six distinct phenotypes derived from the single E gene on chromosome 19; next to the epsilon-3 allele (see separate entry), the epsilon-4 allele, represented by the sequence below, is most common, the product difference being arginine in place of cysteine at residue 112 [2]. The gene structure of apo E is similar to that of other apo genes: presence of the 66-bp repeats in the fourth exon (starting at base 3782 below) makes the E gene highly similar to the A-I gene (see separate entry) as argued by [1]. A potential TATA box is found at positions 1014-1018, and a potential polyadenylation signal at 4616-4621. [2] and [1] had slight differences in the boundary positions for the Alu repeats and their flanks; the boundary positions indicated in [1] have been used in the FEATURES table below. Draft entries and clean copies were kindly supplied by J.M. Taylor, Gladstone Laboratories, San Francisco, and by J.P. Levine, Rockefeller University, New York. FEATURES Location/Qualifiers source 1..5515 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.2" conflict 202..203 /citation=[2] /replace="" repeat_region complement(300..345) /note="direct repeat flanking Alu repeat 3' copy [1]" repeat_region complement(346..635) /note="Alu repeat [1]" repeat_region complement(636..680) /note="direct repeat flanking Alu repeat 5' copy [1]" conflict 963..967 /citation=[2] /replace="" exon 1047..1090 /note="apo E mRNA [2],[1]" /number=1 intron 1091..1847 /note="apo E mRNA intron 1 [2],[1]" conflict 1362..1363 /citation=[2] /replace="" conflict 1789..1793 /citation=[2] /replace="" sig_peptide join(1871..1913,3007..3017) exon 1871..1913 /partial /gene="APOE" /note="preapolipoprotein E; G00-119-691" /number=2 gene 1871..1913 /gene="APOE" CDS join(1871..1913,3007..3199,3781..4498) /note="precursor" /codon_start=1 /product="apolipoprotein E" /db_xref="PID:g178853" /translation="MKVLWAALLVTFLAGCQAKVEQAVETEPEPELRQQTEWQSGQRW ELALGRFWDYLRWVQTLSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQLTP VAEETRARLSKELQAAQARLGADMEDVRGRLVQYRGEVQAMLGQSTEELRVRLASHLR KLRKRLLRDADDLQKRLAVYQAGAREGAERGLSAIRERLGPLVEQGRVRAATVGSLAG QPLQERAQAWGERLRARMEEMGSRTRDRLDEVKEQVAEVRAKLEEQAQQIRLQAEAFQ ARLKSWFEPLVEDMQRQWAGLVEKVQAAVGTSAAPVPSDNH" intron 1914..3006 /note="apo E cds intron 2 [2],[1]" repeat_region complement(2092..2104) /note="direct repeat flanking Alu repeat 3' copy [1]" repeat_region complement(2105..2429) /note="Alu repeat [1]" repeat_region complement(2430..2442) /note="direct repeat flanking Alu repeat 5' copy [1]" repeat_region 2520..2526 /note="direct repeat flanking Alu repeat 5' copy [1]" repeat_region 2527..2886 /note="Alu repeat [1]" conflict 2568..2569 /citation=[2] /replace="" repeat_region 2887..2893 /note="direct repeat flanking Alu repeat 3' copy [1]" conflict 2947..2948 /citation=[2] /replace="" conflict 2983..2984 /citation=[2] /replace="" exon 3007..3199 /number=2 mat_peptide join(3018..3199,3781..4495) /product="apolipoprotein E" allele 3182 /note="g in epsilon-4; a in epsilon-3 [2]" allele 3191 /note="g in epsilon-4; a in epsilon-3 [2]" intron 3200..3780 /note="apo E cds intron 3 [2],[1]" conflict 3568..3569 /citation=[2] /replace="" exon 3781..4640 /partial /note="preapolipoprotein E" /number=3 allele 3817..3819 /note="cta in [2] and 1 allele [3]; agc in other allele [3]" allele 3844..3846 /note="gac in [2] and 1 allele [3]; tgt in other allele [3]" allele 3932 /note="c in epsilon-4; t in epsilon-3 [2] (arg; cys)" allele 4342 /note="g in epsilon-4; a in epsilon-3 [2]" conflict 4708..4709 /citation=[2] /replace="" repeat_region complement(4761..4768) /note="direct repeat flanking Alu repeat 3' copy [1]" repeat_region complement(4769..5048) /note="Alu repeat [1]" repeat_region complement(5049..5056) /note="direct repeat flanking Alu repeat 5' copy [1]" BASE COUNT 1042 a 1667 c 1600 g 1206 t ORIGIN 201 bp upstream of BanII site on chromosome 19q12-q13.2. 1 ggaacttgat gctcagagag gacaagtcat ttgcccaagg tcacacagct ggcaactggc 61 agacgagatt cacgccctgg caatttgact ccagaatcct aaccttaacc cagaagcacg 121 gcttcaagcc ctggaaacca caatacctgt ggcagccagg gggaggtgct ggaatctcat 181 ttcacatgtg gggagggggc tcctgtgctc aaggtcacaa ccaaagagga agctgtgatt 241 aaaacccagg tcccatttgc aaagcctcga cttttagcag gtgcatcata ctgttcccac 301 ccctcccatc ccacttctgt ccagccgcct agccccactt tctttttttt ctttttttga 361 gacagtctcc ctcttgctga ggctggagtg cagtggcgag atctcggctc actgtaacct 421 ccgcctcccg ggttcaagcg attctcctgc ctcagcctcc caagtagcta ggattacagg 481 cgcccgccac cacgcctggc taacttttgt atttttagta gagatggggt ttcaccatgt 541 tggccaggct ggtctcaaac tcctgacctt aagtgattcg cccactgtgg cctcccaaag 601 tgctgggatt acaggcgtga gctaccgccc ccagcccctc ccatcccact tctgtccagc 661 cccctagccc tactttcttt ctgggatcca ggagtccaga tccccagccc cctctccaga 721 ttacattcat ccaggcacag gaaaggacag ggtcaggaaa ggaggactct gggcggcagc 781 ctccacattc cccttccacg cttggccccc agaatggagg agggtgtctg tattactggg 841 cgaggtgtcc tcccttcctg gggactgtgg ggggtggtca aaagacctct atgccccacc 901 tccttcctcc ctctgccctg ctgtgcctgg ggcaggggga gaacagccca cctcgtgact 961 gggctgccca gcccgcccta tccctggggg agggggcggg acagggggag ccctataatt 1021 ggacaagtct gggatccttg agtcctactc agccccagcg gaggtgaagg acgtccttcc 1081 ccaggagccg gtgagaagcg cagtcggggg cacggggatg agctcagggg cctctagaaa 1141 gagctgggac cctgggaagc cctggcctcc aggtagtctc aggagagcta ctcggggtcg 1201 ggcttgggga gaggaggagc gggggtgagg caagcagcag gggactggac ctgggaaggg 1261 ctgggcagca gagacgaccc gacccgctag aaggtggggt ggggagagca gctggactgg 1321 gatgtaagcc atagcaggac tccacgagtt gtcactatca ttatcgagca cctactgggt 1381 gtccccagtg tcctcagatc tccataactg gggagccagg ggcagcgaca cggtagctag 1441 ccgtcgattg gagaacttta aaatgaggac tgaattagct cataaatgga acacggcgct 1501 taactgtgag gttggagctt agaatgtgaa gggagaatga ggaatgcgag actgggactg 1561 agatggaacc ggcggtgggg agggggtggg gggatggaat ttgaaccccg ggagaggaag 1621 atggaatttt ctatggaggc cgacctgggg atggggagat aagagaagac caggagggag 1681 ttaaataggg aatgggttgg gggcggcttg gtaaatgtgc tgggattagg ctgttgcaga 1741 taatgcaaca aggcttggaa ggctaacctg gggtgaggcc gggttggggg cgctgggggt 1801 gggaggagtc ctcactggcg gttgattgac agtttctcct tccccagact ggccaatcac 1861 aggcaggaag atgaaggttc tgtgggctgc gttgctggtc acattcctgg caggtatggg 1921 ggcggggctt gctcggttcc ccccgctcct ccccctctca tcctcacctc aacctcctgg 1981 ccccattcag acagaccctg ggccccctct tctgaggctt ctgtgctgct tcctggctct 2041 gaacagcgat ttgacgctct ctgggcctcg gtttccccca tccttgagat aggagttaga 2101 agttgttttg ttgttgttgt ttgttgttgt tgttttgttt ttttgagatg aagtctcgct 2161 ctgtcgccca ggctggagtg cagtggcggg atctcggctc actgcaagct ccgcctccca 2221 ggtccacgcc attctcctgc ctcagcctcc caagtagctg ggactacagg cacatgccac 2281 cacacccgac taactttttt gtattttcag tagagacggg gtttcaccat gttggccagg 2341 ctggtctgga actcctgacc tcaggtgatc tgcccgtttc gatctcccaa agtgctggga 2401 ttacaggcgt gagccaccgc acctggctgg gagttagagg tttctaatgc attgcaggca 2461 gatagtgaat accagacacg gggcagctgt gatctttatt ctccatcacc cccacacagc 2521 cctgcctggg gcacacaagg acactcaata catgcttttc cgctgggccg gtggctcacc 2581 cctgtaatcc cagcactttg ggaggccaag gtgggaggat cacttgagcc caggagttca 2641 acaccagcct gggcaacata gtgagaccct gtctctacta aaaatacaaa aattagccag 2701 gcatggtgcc acacacctgt gctctcagct actcaggagg ctgaggcagg aggatcgctt 2761 gagcccagaa ggtcaaggtt gcagtgaacc atgttcaggc cgctgcactc cagcctgggt 2821 gacagagcaa gaccctgttt ataaatacat aatgctttcc aagtgattaa accgactccc 2881 ccctcaccct gcccaccatg gctccaaaga agcatttgtg gagcaccttc tgtgtgcccc 2941 taggtagcta gatgcctgga cggggtcaga aggaccctga cccgaccttg aacttgttcc 3001 acacaggatg ccaggccaag gtggagcaag cggtggagac agagccggag cccgagctgc 3061 gccagcagac cgagtggcag agcggccagc gctgggaact ggcactgggt cgcttttggg 3121 attacctgcg ctgggtgcag acactgtctg agcaggtgca ggaggagctg ctcagctccc 3181 aggtcaccca ggaactgagg tgagtgtccc catcctggcc cttgaccctc ctggtgggcg 3241 gctatacctc cccaggtcca ggtttcattc tgcccctgtc gctaagtctt ggggggcctg 3301 ggtctctgct ggttctagct tcctcttccc atttctgact cctggcttta gctctctgga 3361 attctctctc tcagctttgt ctctctctct tcccttctga ctcagtctct cacactcgtc 3421 ctggctctgt ctctgtcctt ccctagctct tttatataga gacagagaga tggggtctca 3481 ctgtgttgcc caggctggtc ttgaacttct gggctcaagc gatcctcccg cctcggcctc 3541 ccaaagtgct gggattagag gcatgagcac cttgcccggc ctcctagctc cttcttcgtc 3601 tctgcctctg ccctctgcat ctgctctctg catctgtctc tgtctccttc tctcggcctc 3661 tgccccgttc cttctctccc tcttgggtct ctctggctca tccccatctc gcccgcccca 3721 tcccagccct tctcccccgc ctccccactg tgcgacaccc tcccgccctc tcggccgcag 3781 ggcgctgatg gacgagacca tgaaggagtt gaaggcctac aaatcggaac tggaggaaca 3841 actgaccccg gtggcggagg agacgcgggc acggctgtcc aaggagctgc aggcggcgca 3901 ggcccggctg ggcgcggaca tggaggacgt gcgcggccgc ctggtgcagt accgcggcga 3961 ggtgcaggcc atgctcggcc agagcaccga ggagctgcgg gtgcgcctcg cctcccacct 4021 gcgcaagctg cgtaagcggc tcctccgcga tgccgatgac ctgcagaagc gcctggcagt 4081 gtaccaggcc ggggcccgcg agggcgccga gcgcggcctc agcgccatcc gcgagcgcct 4141 ggggcccctg gtggaacagg gccgcgtgcg ggccgccact gtgggctccc tggccggcca 4201 gccgctacag gagcgggccc aggcctgggg cgagcggctg cgcgcgcgga tggaggagat 4261 gggcagccgg acccgcgacc gcctggacga ggtgaaggag caggtggcgg aggtgcgcgc 4321 caagctggag gagcaggccc agcagatacg cctgcaggcc gaggccttcc aggcccgcct 4381 caagagctgg ttcgagcccc tggtggaaga catgcagcgc cagtgggccg ggctggtgga 4441 gaaggtgcag gctgccgtgg gcaccagcgc cgcccctgtg cccagcgaca atcactgaac 4501 gccgaagcct gcagccatgc gaccccacgc caccccgtgc ctcctgcctc cgcgcagcct 4561 gcagcgggag accctgtccc cgccccagcc gtcctcctgg ggtggaccct agtttaataa 4621 agattcacca agtttcacgc atctgctggc ctccccctgt gatttcctct aagccccagc 4681 ctcagtttct ctttctgccc acatactgcc acacaattct cagccccctc ctctccatct 4741 gtgtctgtgt gtatctttct ctctgccctt tttttttttt tagacggagt ctggctctgt 4801 cacccaggct agagtgcagt ggcacgatct tggctcactg caacctctgc ctcttgggtt 4861 caagcgattc tgctgcctca gtagctggga ttacaggctc acaccaccac acccggctaa 4921 tttttgtatt tttagtagag acgagctttc accatgttgg ccaggcaggt ctcaaactcc 4981 tgaccaagtg atccacccgc cggcctccca aagtgctgag attacaggcc tgagccacca 5041 tgcccggcct ctgcccctct ttctttttta gggggcaggg aaaggtctca ccctgtcacc 5101 cgccatcaca gctcactgca gcctccacct cctggactca agtgataagt gatcctcccg 5161 cctcagcctt tccagtagct gagactacag gcgcatacca ctaggattaa tttggggggg 5221 ggtggtgtgt gtggagatgg ggtctggctt tgttggccag gctgatgtgg aattcctggg 5281 ctcaagcgat actcccacct tggcctcctg agtagctgag actactggct agcaccacca 5341 cacccagctt tttattatta tttgtagaga caaggtctca atatgttgcc caggctagtc 5401 tcaaacccct ggctcaagag atcctccgcc atcggcctcc caaagtgctg ggattccagg 5461 catgggctcc gagcggcctg cccaacttaa taatattgtt cctagagttg cactc // LOCUS HUMATP1A2 26668 bp DNA PRI 31-OCT-1994 DEFINITION Human Na,K-ATPase subunit alpha 2 (ATP1A2) gene, complete cds. ACCESSION J05096 NID g179164 KEYWORDS ATPase; Alu repeat; Na,K-ATPase. SOURCE Human leucocyte DNA, clones CL[6-2,23-1,30-2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 26668) AUTHORS Shull,M.M., Pugh,D.G. and Lingrel,J.B. TITLE Characterization of the human Na,K-ATPase alpha 2 gene and identification of intragenic restriction fragment length polymorphisms JOURNAL J. Biol. Chem. 264 (29), 17532-17543 (1989) MEDLINE 90008924 COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by J.B.Lingrel, 18-AUG-1989. FEATURES Location/Qualifiers source 1..26668 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leucocyte" /map="1q21-q23" mRNA join(1577..1692,6713..6817,6999..7058,9020..9223, 9750..9863,10103..10237,10943..11060,12480..12748, 13580..13778,13908..14017,14194..14328,15030..15219, 15350..15525,19411..19547,20072..20222,20361..20529, 20766..20920,21174..21297,21497..21642,21828..21958, 22647..22748,22900..22991,24296..26668) /gene="ATP1A2" /note="G00-119-712" /product="Na,K-ATPase subunit alpha 2" exon 1577..1692 /partial /gene="ATP1A2" /note="G00-119-712" /number=1 gene join(1577..1692,6713..6817,6999..7058,9020..9223, 9750..9863,10103..10237,10943..11060,12480..12748, 13580..13778,13908..14017,14194..14328,15030..15219, 15350..15525,19411..19547,20072..20222,20361..20529, 20766..20920,21174..21297,21497..21642,21828..21958, 22647..22748,22900..22991,24296..26668) /gene="ATP1A2" CDS join(1681..1692,6713..6817,6999..7058,9020..9223, 9750..9863,10103..10237,10943..11060,12480..12748, 13580..13778,13908..14017,14194..14328,15030..15219, 15350..15525,19411..19547,20072..20222,20361..20529, 20766..20920,21174..21297,21497..21642,21828..21958, 22647..22748,22900..22991,24296..24324) /gene="ATP1A2" /codon_start=1 /db_xref="GDB:G00-119-712" /product="Na,K-ATPase subunit alpha 2" /db_xref="PID:g179165" /translation="MGRGAGREYSPAATTAENGGGKKKQKEKELDELKKEVAMDDHKL SLDELGRKYQVDLSKGLTNQRAQDVLARDGPNALTPPPTTPEWVKFCRQLFGGFSILL WIGAILCFLAYGIQAAMEDEPSNDNLYLGVVLAAVVIVTGCFSYYQEAKSSKIMDSFK NMVPQQALVIREGEKMQINAEEVVVGDLVEVKGGDRVPADLRIISSHGCKVDNSSLTG ESEPQTRSPEFTHENPLETRNICFFSTNCVEGTARGIVIATGDRTVMGRIATLASGLE VGRTPIAMEIEHFIQLITGVAVFLGVSFFVLSLILGYSWLEAVIFLIGIIVANVPEGL LATVTVCLTLTAKRMARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMWFD NQIHEADTTEDQSGATFDKRSPTWTALSRIAGLCNRAVFKAGQENISVSKRDTAGDAS ESALLKCIELSCGSVRKMRDRNPKVAEIPFNSTNKYQLSIHEREDSPQSHVLVMKGAP ERILDRCSTILVQGKEIPLDKEMQDAFQNAYMELGGLGERVLGFCQLNLPSGKFPRGF KFDTDELNFPTEKLCFVGLMSMIDPPRAAVPDAVGKCRSAGIKVIMVTGDHPITAKAI AKGVGIISEGNETVEDIAARLNIPMSQVNPREAKACVVHGSDLKDMTSEQLDEILKNH TEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKADIGIAMGISGSDVSK QAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEITPFLLFIIANIPLP LGTVTILCIDLGTDMVPAISLAYEAAESDIMKRQPRNSQTDKLVNERLISMAYGQIGM IQALGGFFTYFVILAENGFLPSRLLGIRLDWDDRTMNDLEDSYGQEWTYEQRKVVEFT CHTAFFASIVVVQWADLIICKTRRNSVFQQGMKNKILIFGLLEETALAAFLSYCPGMG VALRMYPLKVTWWFCAFPYSLLIFIYDEVRKLILRRYPGGWVEKETYY" intron 1693..6712 /gene="ATP1A2" /note="G00-119-712" /number=1 repeat_region 5542..5845 /rpt_family="Alu repeat" exon 6713..6817 /gene="ATP1A2" /note="G00-119-712" /number=2 intron 6818..6998 /gene="ATP1A2" /note="G00-119-712" /number=2 exon 6999..7058 /gene="ATP1A2" /note="G00-119-712" /number=3 intron 7059..9019 /gene="ATP1A2" /note="G00-119-712" /number=3 repeat_region 7657..7939 /rpt_family="Alu repeat" exon 9020..9223 /gene="ATP1A2" /note="G00-119-712" /number=4 intron 9224..9749 /gene="ATP1A2" /note="G00-119-712" /number=4 exon 9750..9863 /gene="ATP1A2" /note="G00-119-712" /number=5 intron 9864..10102 /gene="ATP1A2" /note="G00-119-712" /number=5 exon 10103..10237 /gene="ATP1A2" /note="G00-119-712" /number=6 intron 10238..10942 /gene="ATP1A2" /note="G00-119-712" /number=6 exon 10943..11060 /gene="ATP1A2" /note="G00-119-712" /number=7 intron 11061..12479 /gene="ATP1A2" /note="G00-119-712" /number=7 exon 12480..12748 /gene="ATP1A2" /note="G00-119-712" /number=8 intron 12749..13579 /gene="ATP1A2" /note="G00-119-712" /number=8 exon 13580..13778 /gene="ATP1A2" /note="G00-119-712" /number=9 intron 13779..13907 /gene="ATP1A2" /note="G00-119-712" /number=9 exon 13908..14017 /gene="ATP1A2" /note="G00-119-712" /number=10 intron 14018..14193 /gene="ATP1A2" /note="G00-119-712" /number=10 exon 14194..14328 /gene="ATP1A2" /note="G00-119-712" /number=11 intron 14329..15029 /gene="ATP1A2" /note="G00-119-712" /number=11 exon 15030..15219 /gene="ATP1A2" /note="G00-119-712" /number=12 intron 15220..15349 /gene="ATP1A2" /note="G00-119-712" /number=12 exon 15350..15525 /gene="ATP1A2" /note="G00-119-712" /number=13 intron 15526..19410 /gene="ATP1A2" /note="G00-119-712" /number=13 repeat_region 16470..16773 /rpt_family="Alu repeat" exon 19411..19547 /gene="ATP1A2" /note="G00-119-712" /number=14 intron 19548..20071 /gene="ATP1A2" /note="G00-119-712" /number=14 exon 20072..20222 /gene="ATP1A2" /note="G00-119-712" /number=15 intron 20223..20360 /gene="ATP1A2" /note="G00-119-712" /number=15 exon 20361..20529 /gene="ATP1A2" /note="G00-119-712" /number=16 intron 20530..20765 /gene="ATP1A2" /note="G00-119-712" /number=16 exon 20766..20920 /gene="ATP1A2" /note="G00-119-712" /number=17 intron 20921..21173 /gene="ATP1A2" /note="G00-119-712" /number=17 exon 21174..21297 /gene="ATP1A2" /note="G00-119-712" /number=18 intron 21298..21496 /gene="ATP1A2" /note="G00-119-712" /number=18 exon 21497..21642 /gene="ATP1A2" /note="G00-119-712" /number=19 intron 21643..21827 /gene="ATP1A2" /note="G00-119-712" /number=19 exon 21828..21958 /gene="ATP1A2" /note="G00-119-712" /number=20 intron 21959..22646 /gene="ATP1A2" /note="G00-119-712" /number=20 exon 22647..22748 /gene="ATP1A2" /note="G00-119-712" /number=21 intron 22749..22899 /gene="ATP1A2" /note="G00-119-712" /number=21 exon 22900..22991 /gene="ATP1A2" /note="G00-119-712" /number=22 intron 22992..24295 /gene="ATP1A2" /note="G00-119-712" /number=22 repeat_region 23845..24165 /rpt_family="Alu repeat" exon 24296..26668 /partial /gene="ATP1A2" /note="G00-119-712" /number=23 repeat_region 25527..25820 /note="ALU repeat" BASE COUNT 6489 a 7135 c 6574 g 6468 t 2 others ORIGIN 1 gaattcagat gttagccttg tgccctgaga gtggatttgc tagaggagag gctgataatc 61 cctctgcagg gagttccctc tggggccaga gatggaccca gcccagccag caggcacctt 121 aggcctctgg ccaaagcccc ccttccctcc ctcccttcct ctctgctttc tttgcactca 181 ggacacttag acctgcccca ctccgctggc ttcccctcac tccatctctg tgcccagaga 241 aacaggggta ctccagggga atcctgtcct ctgactgttc ccagtagaag gcgggagcag 301 aattttttct tttctctctg ccccttctgg gttccccttc ctctctgtgc ccctcatcct 361 ttggattact catcaaatga gcagcaacaa cacccctaga ccagctggtc ctgtcctggg 421 tcatctcgtt gtccccagtt tctagctggt ataaggttaa agataccctt tccagcccca 481 ggcaaagcct taggcatttc tgggatggct cctgactctg atgggcatcc tcactcctca 541 gacatttcca atctccccca cttcactgca gagaggcctg ttctccagcc tcagtcctgc 601 atgctgcagc tttgcttccc cttccccact ggaagaagga aacagtcgct ctcctttccc 661 gttcacatgt ctgatatttg ctatgatatt gaccttcaca cacatcttag aggttctgta 721 ttttcctctc ttttgacttc tccctatcac tactctactc aggcatagat tcagtctaga 781 gctcttcaac agcaatgaaa cggtaaacga ggggtaaaca aaacccagag atgtcatcag 841 gtgcctgcta ggagtcccac ccgttcccct ctagccctgg cctgcccctc cctaccagag 901 tcttgattct ccttcaaaac tttggagagg ttacatctgg cagggtttgg ggacaccctc 961 ttgcctccat ctccaaactt cccgctgcct cccgagccac ctttgactcc tcctgggccc 1021 cccttccagg tgcctcaagt gcagaggtca gccttcggtt caaattctat ttctgggctt 1081 cagtgtccct gcaacctcta taatagaagg aacagagtgg tctatgtttc tttctttctc 1141 tgggagattg gatgtagact cgggagacct gagttctggc tctgtcttat caattcaaat 1201 tgtttaatct ctctgagcct cagtttaccc acttggaaaa gaagagttag gccacatggt 1261 ctctgtcatc ctgggattct gagtcaactg ctccccttca gccagatcct agggggcata 1321 tccccccaac ctgggaccca ggcccccttc ccttgcagcc tctctcagtt gtgagtgcca 1381 caggccagag ggggttgaga ggagccaagg gggggcatgc cgtgggacaa acgggctgtc 1441 tagggccagt ggctaaaggg gcgggagggg aggagtcctc agggatcctg tttcaacaaa 1501 cgtttctttc ggaggagggg aaggcggggg agagggggag aaggacctat ttaaagctac 1561 cctgttgctt tggctttctc tgtctgccag ggtctccgac tgtcccagac gggctggtgt 1621 gggcttggga tcctcctggt gacctctccc gctaaggtcc ctcagccact ctgccccaag 1681 atgggccgtg gggtgagtat ccctaaagag caggggtcgc agtctagagg gtgagggagg 1741 ctgctgagga aggatgaagt gggaatggga tgtggaagga gcgggagaga ggagctgagt 1801 gggatacttg gaacttgaaa ctaacagtct ccaaactacc aatgtgtgta cagatcccac 1861 ggcggtcaca taccctagtg ctctcctctc ctctgcccgc cccagccccc tgcccacagt 1921 tagacccaca gcccaccaca catgccccct ccatctctcc ctccacccca ctcccacatc 1981 tctagtcctt ccctgcagca gtccctcctc tcactgcatc tcccggcctc ccagctgaaa 2041 ggatgtttgg cactgtgagt cctcctgtct ttgccacctc ctgaaaatat ccattcctct 2101 gggagtccta gcctgagagg ctgggggtcc attttgaggt tagagagggg cagtagagca 2161 tccggctccc agagtcacca agtagttccg gacaccagct ccaggggccc tgagtgccaa 2221 ggacagctgg gcggggtgtg gggaggaaga atgatgcccc agcccccacc ccagcacccc 2281 tccggatgtc ccgagcagca ttgtccgtgg gttccaagcc atgggctttg aggactttcc 2341 cagaggctgg tgacaggtga ccccccccca gtccacccgt ccatggcatt ctgcagcaat 2401 ggctacacca agaggaagat taacaacaat ggggcgggca gggagagggc tttgaattct 2461 tggaaatgaa attcccatat ggagaagagt ggcccaatcc ttttacatgc tgtgcctggc 2521 tgctttcctt ttgtggaata ctggcaaaag aggacacaga gggaaagggc atgctgcgga 2581 acaggagagt atgaggaaga ctgtgcctag ggggagccgt gggtgtttct gggcccagga 2641 ggagtcaggc aaggacatgt tattgcacct agttgagcag ctgggaaatc ctgagcctgg 2701 acaccagggt tcctgagcta gggttggagc cataaataga attggagtgc atgcatgtat 2761 ctgtgtgtct gtgtctgtgt gtgtgtgagt gtgtgtcctt gaccacgtgt tctccatagg 2821 cggggagtcc tgcagttact catggatgag tacctgtgtt tgtgtctcag caggcaaatt 2881 tgtaaatgta caagccttca agcctgcaca agcataaatg tcactgtgtg ggggtgcttg 2941 agagtctgtg agcctgccct taggagctgt gaggtatggg cttcacggag gaagaggtca 3001 tagctgcagg aagatatact gtcttgggac cttcagctgc atcttgacat ctctggctcc 3061 ccatggaaag ggcatcccta gctgagatgc aggacctctg agaagaggca ggggttaaag 3121 gatcagataa gccctccatt tccccatcca agtgaagaga gaaggagggg aggtagcccc 3181 ctaccctgct ccatcttaga gcaaggtaga cccagctcag gaggtcttgc ttgggaagtg 3241 atgagacctt gacttttcca gtctgtcttt ttcccctagc acccccaaac tccccttaat 3301 caccatccta agttgctgtg ggtgatgcaa tagcaagatg aggagcagat ctgggctgtt 3361 taaactaaga ggctgggtga ggtgggggta tttagggcct ggagcttaga gttcaaccta 3421 ccaacgaccc ctgaagaggg aaggcatctg acacccacaa cctgttctag ggatgatttt 3481 tcctccaatc ccttctcccc tgcatctcca ctgcagaggc aggcttcact gtccccccat 3541 tacccagtgg ctgtgaaggg cagcgtggga gttgggggaa gggacgacac tggtgggagg 3601 gagcccagcc tgctccagct accacggaga ggctgagatg gggggagcgt tggcggattc 3661 ccagctgccc ccactctgtc ccagcctctg gctttctcaa aaaggactct ctgttctcct 3721 tcagcattca agacccagag agggggactt gtggttgggg gagggaggag tggagggagg 3781 ttgggggggt ccttgcttcc tctctttctt tcttgcctgg gcagccgctg gccccaaatc 3841 tctgcaggct cctggctgca gagcctgaga tctttgccag gacaggagga gggggaaggg 3901 gcagtgtgtc tcaagctcta agcctgctgg agagcagggc gggagcttgg gaaaaggagg 3961 cactgcgtgg agctgcttag ctcagccaca atccagcatg ccaaagtgca tggaccagca 4021 agtttttaaa aagcatgcat tttattccaa tttttatgaa atttatttca catctgaata 4081 tacagaaatt cctctgcacc ctgctcttct gcccacagac ccagccctgc tcctctgtgt 4141 atgcctgggc ccccttctgt ccctgaagtt cctgagggtc acactgaagc tcagccagcc 4201 tgatctcttg cctctgttcc cttgattgca tttcttcctt ttctcagctc tgtcccacca 4261 cagacagtta gtgaatgtaa agcatttcaa tccactcaat taaatgaatc ctcttaggtc 4321 acctgagaaa ggcagaggca gaaccactca caagcctttc cccacccctt ccattgggnn 4381 gctcggtctc cccaagaaga tctgccttta ggaggccaaa attataccag attcatagat 4441 gcttcaggtg gagacaaacc cacgatttta ctggtcacct tttctgagaa gagcacttct 4501 gagcaacaca gcctccaaaa gctcaatatc tttttaaatc attggtatct tgggaacaac 4561 agtgttgact tctccttgca tcccccaccc ctacccttgc ttctgattgt actggtcaga 4621 cttctacctc tcaagatgct gaattggcga cctcttatga acaaggcagc tgccatactt 4681 ctgggattgc tcctgactct gataagtaaa aaaggcatta ccctggagca caaaagtctt 4741 agggcagtga agctattcca tatgatacca caatggcgga tacacgtcag tatacatttg 4801 cccaaaccca tagaatgtac accaagagtg aactcaaact taaactatgg gctttggacg 4861 ataaagatgt atcaatgcag gttcatgatt gcagcaaatg taacactctg atgcagggtg 4921 ttgatagtgg gtgaggctgt gcatgtgtgg gggcaggaga tatatggaga actttctcta 4981 cttcccactc aattttgctg tgaaccagaa aattgctcca aaaaaagtat attttttaaa 5041 atgcaaacaa aacccccaaa actgcattat cctgcaaaaa aaaaaaaaaa aaaaagcaaa 5101 caccttggct gtggacccta cccaggtatc cactctttat gggaaacaaa agttaacatc 5161 atagtgtggg aaaactaatt cctgaaagcc tagtgcagac tttgcagaat ccagtggaga 5221 aataggcacc aggtggaaac tgagcagccc ctctatgaca aggactctat ttcctgaggg 5281 aagtgggagg agagaagact ggggcagaag aggaagtaag aaggtacctt tttgggaact 5341 tttcatggct acccaggcct tcctcagcct ctatcctatt ctatcacccc aagagggcct 5401 cctcttagaa ggaaagattt cccaccctgt ccccaagaag ttccctgtga gaccctggaa 5461 ggctggtagt ggtgagccca caggccctgg agtggaaggg aaatcaggag cctcataaga 5521 ctacaattaa aatgtccatc tgctcaggcg cagtggctca cacctataat cccagcattc 5581 tgggaggcca aggtgggagg atcacttgag cccaggcaat atagtgagac cttatatcta 5641 aaaaaaaaaa aaaaaagaaa gaaagaaaga aaattgtttt aaattagctg agcgtggtgg 5701 tgcatgcttg tggccccagc tacttgagag gctgaggtgg gagtattgct tgagccctag 5761 aagcggaggg tgcagtgagc tgaaattgca cactgcattc catcctgggt gacagagtga 5821 gaccctgtct caagaaaaaa aaaaaggcca tctgattcac cagatgaccc aggaatgtct 5881 agacttaaaa actatataaa atgtagtttt agatttactc tcagatacaa tttcttatgt 5941 tttgggacaa atggggctca gttcccatgt tacccaattg tgctttgggg gtccagtcct 6001 gtgagttgcc tcccgtctag taacgggcag gcctgcccag caggatggag agggctccca 6061 gcacacccca tgtctccact tcctggacat gccctgactc tccagctact gcctcctctc 6121 agtccatcag gcctgctctt tccatcccta ccattccacc accagacctg gctctcctct 6181 ggacacgttc ctgcctaatc ccctggctcc ccaaagactc ttactccccc aagttcctga 6241 gtatttctac ccctctctaa gtaaaattgc catggtgttg ggctaagaca ttcctacaga 6301 gtgattctac tccctcaaca accccccaca aaaccctgtc cactcccttc cagaattttc 6361 ccaaatagtg taggatacag gttttcaaaa tttttgtttt gttttgtatt tagcagcagg 6421 gtcctttcaa caaacaaaat cttagcagaa gcccaagcat taaaccactc aagtggtgct 6481 tttcagtcct gagcccctca cccttctctg aaactcacat gggagcccct ggttctgcca 6541 atagtctaca aactgagggc ctagggtccc tcaccctcca tcccccctat ctcccccaag 6601 gcagccgctg cttttgaaca cacaccccca ctgcctgggc tccctggctg agtggtggga 6661 atggaggccc cagcccctct cttccctgac tctccggctc tccctccctc aggctggccg 6721 tgagtactca cctgccgcca ccacggcaga gaatgggggc ggcaagaaga aacagaagga 6781 gaaggaactg gatgagctga agaaggaggt ggcaatggtg agggaactgc tgggccatgg 6841 aggaggggcc ccatgctggg agagctgtcc ctgcagccca ttgcactcag agaaactccg 6901 tgtcccccat gctgctcaac tcacccctgt gccctgcagt accctcatat gcatcttaga 6961 tacctcctcc ccaaagtaac tcaccctccc ttccccagga tgaccacaag ctgtccttgg 7021 atgagctggg ccgcaaatac caagtggacc tgtccaaggt gagtggaggg gcttctaggg 7081 aaggaacaaa agaggcaaga aaaccatgca gcatcaaggt ggcaggagcc ttaaaactgt 7141 aatccagcct tctttacaga tgaggaaact gaggcccaga aacaaggact ggcccaagga 7201 catgcagcta gttggtggca cagccagaac tagaatttgg atcccctgca tcctagccca 7261 aagctctctc cctgtatacc ctagaagcca ggactcccta tgacccaggc cccagagggc 7321 ctccaggcag ggcccttccc tataccccaa gcaacttcag ttgcacacag ccttccacag 7381 agctgacagc tgatgcacat gggctgacag cccattcctg tggttacagt gatttgctgg 7441 gcccctgcta catatgtgac attgtgatag atgcgttaca tacatttctc taatctttac 7501 aacaaccctt tgagataggt attgcagacc tgtttattaa acaaaggaga ctcagagagg 7561 gagagtaact tcccaaggtc acatgaacag taactggtac agtcaattgc taagcccttc 7621 aattaaaaat ttgtttaaaa atcattaatc aaaaaatgct gggtgtggtg gctcatgcct 7681 ataattttac cactttggga ggccaaggca ggcagatcac ttaaggtcag cggtttgaga 7741 gcagcctggg caacatggca aaaccccatc tccactaaaa atacaaaaat tagcttggca 7801 cagtgatggg cacctgtaat cccacctact cgagaggctg aagcaggaga atggcttgaa 7861 cccaggaggc ggaggtttca gtgagccgag atcataccac tgtactccag actgggcgat 7921 agagtgagac tccatcttga aacaagcaaa caaacagaca aatatttagt gggtgtatat 7981 ctcctttatg ggccttccct gactgctctt tctaaaattc tatgccccta ttcctttacc 8041 ttactttatt ttccttagta acacttttct atctggcact aatatgtatt tgtttattgt 8101 gtgtcacctc tccagatcat aagctcctca agggcagaga ctttgttggt cttgtccatc 8161 tctgcatccg cagagtgaca attgtgcctg gcactggata tacatttatt aaatgaatga 8221 atgaatgaaa aaaaaaaaaa aagaaacggt ccctgcttta aaggagctca aagtctagta 8281 gcagaaaaaa tgcaatccta ggattccaat gcaatatggt gagtgcagct ctagaatcaa 8341 gtatagaaca ccatggggaa ccagaaagaa gcatctgaac cagcctgggg ctgagggaat 8401 agggccaggc aaggctttct gtggaaagtg acaggtgatt gccaaagttg tgtgacactg 8461 atggtcaata atgtaggact tgagcagggt gtggaggtgc ccacctgtag tctcagctac 8521 ttgggaggct gaggtggaag gattgcttta gcccagggtt caaggatgca gtgagctata 8581 attgcaccac tgcgctccag cctgggcaac atagcaagaa tccatctctt aaaaaaaaat 8641 ctaggactct cctagaatcc ttgataaaat aatctgaagg aagagagatg gggaaaaaac 8701 agttgaggtg tctttccagg gagcttatcc agactgcctc caggggagat gttatggccc 8761 taaccatagg tgttgtctca atccaatata attaggggtc aagcccagtc aaacccttct 8821 cagccctcaa gccctccaag ttgcatcaca aatgccacct cctccatgtg gtctctcctg 8881 atggtccccc accccttcca accaggtggg actttccttc tgggcattct tcccatgccc 8941 gggagctgaa gggatgggca tggtgactgg ctgggttggc tccggatgcg tgcccctacg 9001 cctctccttg ctccctcagg gcctcaccaa ccagcgggct caggacgttc tggctcgaga 9061 tgggcccaac gccctcacac cacctcccac aacccctgag tgggtcaagt tctgccgtca 9121 gcttttcggg gggttctcca tcctgctgtg gattggggct atcctctgct tcctggccta 9181 cggcatccag gctgccatgg aggatgaacc atccaacgac aatgtgagcc cacacgcctg 9241 acccgggaac agcccgtgac tgtcctccaa ccctgaaccc ccaacacagt ggggggtggg 9301 cagggaacaa ggccctcaca taacagtcct acagatgccc ctgcatctta ggctggaaag 9361 gggagaggct tctatatata tctgtaaagt actcctccgc caacagtgca tacacattca 9421 catactcatt aattaatgaa acaatctctc cctgttcctc cccgcttaag tgagcctgtc 9481 tgtgtgcctg ggtcttcatc ccctagcaaa gtcctaaccc tgggaacttc ctaagacttt 9541 ccctcccatt tctagtctta acaggcttga ggttggcaga tcaaggggga ggttagtgag 9601 aagggctttc ccctaccatc atcactctca gtcacagaca aaggtctggg ctgtcatctt 9661 ggatggcact gcctgctcat cccaagtggc agctgcccct ttagggttgg ggggaaggtc 9721 aggtccctga aactctttct ccttaccagc tatatctggg tgtggtgctg gcagctgtgg 9781 tcattgtcac tggctgcttc tcctactacc aggaggccaa gagctccaag atcatggatt 9841 ccttcaagaa catggtacct caggtaagat ggcagggctg ggctctgggc taggctgtaa 9901 ggttttggca agagtccagc tcatcttctg tcagctccca ggcactaaga tagagatgga 9961 cagaaaagat cctccagctt tccatgccag cacctaattg tttatggggc ttctccttct 10021 gcttgacggt gtgggagacc agcaggagaa gaaggcaggg gcagagacaa gcatttcatg 10081 agctgcctgt ggctccccac agcaagccct tgtgatccgg gagggagaga agatgcagat 10141 caacgcagag gaagtggtgg tgggagacct ggtggaggtg aagggtggag accgcgtccc 10201 tgctgacctc cggatcatct cttctcatgg ctgtaaggtg aggaggtcat accagagcaa 10261 gcagttgagt ctaaggagaa ggctgtgtgc agagctgaga ggggcccagt gaggtttaaa 10321 ggtggagaga cccaggtcca aatgtcaaca ccatgcctac gcccctgcta aaccttctct 10381 cagtgtgggg tgcttgtgca gtgcctcctt gcatctgtgt gttatgaatg ctctatgccc 10441 caggtatttc tttgttgtcc actttttagc aatcgtgtat tgaataccaa ttatgtgtca 10501 gctgctttat atatgttcga tgtgctatct gatgtaattc tgaaagagtg gtgtaatcat 10561 gcagggctgc tgcatgtggc cacacacgtt gcatcctgaa tgaccccagg aggtgcattt 10621 tcataactac agtgttagtg tctccccctg tggttgtgca aggcagagat tctgtttaca 10681 ttcaccattt tacagttgag tacatggagg ctctcttggt taagcaagta acccgaggtc 10741 acataagaag tcaggaggag tcagaattcc aaaccagggc aggggcaata gatgagtaat 10801 gccataaata tgtgatgagg gcaagtgtga ctttgttcct cgtgtcatga agattgagtg 10861 tggttggagg tggttcagga gacagctgtg tgcatacaag tggctctgcc agtctgatga 10921 ctatgcactc cttcctcctc aggtggataa ctcatcctta acaggagagt cggagcccca 10981 gacccgctcc cccgagttca cccatgagaa ccccctggag acccgcaata tctgtttctt 11041 ctccaccaac tgtgttgaag gtgagaagcc aggctgcccc ctgtaggaaa gagtctgaat 11101 cctgaatcca tagtcaggat gaagggctct ggtagtactt acctggcaaa agctctgcca 11161 ttggtggggc aattgagggg tcagggggcc ctgaatatta tggaaacacc ctccacagag 11221 cagggtattg actgagggca gcctgactcc atgggaaatg tatctccctt ccccagggag 11281 atctctgttc tgtccacccc aagacttgag tggatttcaa agctttcagg tattgggtta 11341 agaacccctt acgcatgcgc acacacagat gcacatacgc tcagaaccaa cctggattct 11401 taggagtctt tggatttgat tgatgcttct attccttgtg gttctgggcc tcgctgcccc 11461 cttcgataac ctttgctttg gccagggccc cagggtctgg gagggccaag aaggcattct 11521 ggccaggtga acacttgggg cctacttcag gctgcttcta acctttgtta catcaaaaga 11581 catgtttcac catgcacaga ctacccagtt tgccccatcc ctgttctggg ttctagagat 11641 aaacaaaggt caataagaac tcccagagct gccattactt aatggcaggg gtccaagcac 11701 aactgaaacc tgtgccctct gtggtaaagc cagtgagcag agtcagggct gcatcagaga 11761 aggagtagga gggaagagac agaaagaggc ctcactagac cagtggcctc tgaaaatagg 11821 gcaaaggagg cctcatatcc caatccacac ttcccctgtc cattactcat ttctcctcag 11881 tgtattatgt ttctctgtgg tttttttccc ttcttcgaca agttggaatt caaataagga 11941 ttcaaaccca aatttggctg gctccaatgt gcatgctttt ttcaatacac cacccttcct 12001 tcccagaaga cccctgtaat gtcacacaag ggttagcaat tgcctggctt cctctataga 12061 gattcatgcc taatgttgag aaagaaaatc ttaagactgg atggaaaatc cataatgagt 12121 gctttacttt gtgttcataa tatacttttg gatggtatca agtatgggaa tgattttacc 12181 aaaatgccac aggcaatttt tatttattga atcttgtcta tttgaacttt gagattattt 12241 gcaagcttct ccccaggctc gagtacaaga gaaaataatt tgctgttgta ttgattcaag 12301 gattggcgat ctatccaggg aaaggaaggc caagtgggga caacctactg aatgtggcct 12361 ccagatctgc ggctttggct cacctatcct gtgatgattt tccttgctca ggaaatagga 12421 tgggactgca gtccctggga gccacaaggc acccaacctg atgccccacc atgttgcagg 12481 cactgccagg ggcattgtga ttgccacagg agaccggacg gtgatgggcc gcatagctac 12541 tctcgcctca ggcctggagg ttgggcggac acccatagca atggagattg aacacttcat 12601 ccagctgatc acaggggtcg ctgtattcct gggggtctcc ttcttcgtgc tctccctcat 12661 cctgggctac agctggctgg aggcagtcat cttcctcatc ggcatcatag tggccaacgt 12721 gcctgagggg cttctggcca ctgtcactgt gagtgggtca ggctgaggtg ccaccagggg 12781 agggtctcac tactctttcc tcagagtgat agaggcacag ttgcttgtag cttctcacta 12841 ctttttcccc ctagagtcac ttattggatg cgatactcag agatcactta atacatggct 12901 tagggtatga gttgagaaat ctcaggtggg gctattctta ggcactcagt ttctcttttg 12961 gagcttgtca cggctgcctt ttctcacgat ctgccagcag tctgggtctc attccgtcca 13021 agtctcctgt agaggctgtc tcaaaggtca ccagtcacta actagcaaat ccagggacct 13081 tttctcagct ctcctcccta actctgctgc tgaaacagat actgctgtcc accctcctac 13141 cattctctct tttgtcgatt gctgtgatgt gtcactatcc tggttcgcct ctcctctgtc 13201 ccttcctgct ccccctacac tgattctcag gcctcagctg tctgttcttc ctacaccctg 13261 agcaagagcc aatctcctcc cagtttgcag tgtcccttct gtgttgatga ctcagacatc 13321 cctatgctca gctctgctct ggccctctct ggagctctag tcaaacgtct taaaccccct 13381 aatgggcatt tcatcttaac agatactcat gatctcaaca catctaaaat tgaacattca 13441 ttatttacag ctaagtccca cccattcaag ttagtggagt agaatcaggg gaggaggaat 13501 ggagccacgg tctagggtaa ggttatggcc atctccggct tcagccttaa ccttttttat 13561 tctcctcttt ctctaccagg tgtgcctgac cctgacagcc aagcgcatgg cacggaagaa 13621 ctgcctggtg aagaacctgg aggcggtgga gacgctgggc tccacgtcca ccatctgctc 13681 ggacaagacg ggcaccctca cccagaaccg catgaccgtc gcccacatgt ggttcgacaa 13741 ccaaatccat gaggctgaca ccaccgaaga tcagtctggt gattgggtgc tccagagggg 13801 gtggatagga ttagaggagg ctgagggcag tggcgtggtg gggtgagtgg ttgagataaa 13861 ggctctaaag ggagccacgc tcctggttcc ccctcatttc ctcccagggg ccacttttga 13921 caaacgatcc cctacgtgga cggccctgtc tcgaattgct ggtctctgca accgcgccgt 13981 cttcaaggca ggacaggaga acatctccgt gtctaaggta gggggtcagg acacacacca 14041 ggtatgtttt gggggtgtct ccaaagcctc ttgctggccc cagctttcct tctcacatga 14101 tgtggctgcc ttgggggttt cagtgccgcc ttcacctgat cctccactcc cttccctccc 14161 atgctgacac tgaattcttg tctcttctgg cagcgggaca cagctggtga tgcctctgag 14221 tcagctctgc tcaagtgcat tgagctctcc tgtggctcag tgaggaaaat gagagacaga 14281 aaccccaagg tggcagagat tcctttcaac tctaccaaca agtaccaggt ctgcttgggt 14341 tgccaggaca gaggaagaga gagggatata aatgggtgag ggtggacaaa gccaagggga 14401 atcatcacta gcaggagtgg gggtgtctga gggtcatgtt ccctccccct gctaagtccc 14461 ccaggacagc tcatatgact gcatgtctga ttgcaactag ccccagctcc caaaccttac 14521 tccatccctt actacaatct gtcactctct caactcttct tcccatagct ccgcatcctc 14581 tggcctagcc ttccaactct tctaccccag tcagtgaaac cataattctg taattccttt 14641 atggtgcccc ttagccttta ggcacctgtt aaatgccaag aatgcaatct cagcattcct 14701 tattatccat agaggtaaag ccccctgagg ccaggtggtg gtggggatgc tgaggaggga 14761 gccgttcagc ccctagcaca gtgcctgtct taacctgacc tcaataaata tttgttgaat 14821 gaaagactgt cctacggagg tggctctcag gttacaagtg ttggaactgt gaggtctaaa 14881 cacccccctg cacaaggaga ttctctttgt tgacaatctt tgatgggttg gggctacttt 14941 tctaaggtgg tttccttacc agctgctgct ctatgccgcg ctaccaagac aagtatggcc 15001 ctctctgtaa ctacctgttg tctctccagc tgtctatcca cgagcgagaa gacagccccc 15061 agagccacgt gctggtgatg aagggggccc cagagcgcat tctggaccgg tgctccacca 15121 tcctggtgca gggcaaggag atcccgctcg acaaggagat gcaagatgcc tttcaaaatg 15181 cctacatgga gctgggggga cttggggagc gtgtgctggg tgagaggcca gaaacaggag 15241 gctcagaagg ggattcccaa gcctctgcgg catccctggg gtgggggact gtgggggcgt 15301 ccaggaagcc actctgcgga tctcactgat cccttctgcc cccctttagg attctgtcaa 15361 ctgaatctgc catctggaaa gtttcctcgg ggcttcaaat tcgacacgga tgagctgaac 15421 tttcccacgg agaagctttg ctttgtgggg ctcatgtcta tgattgaccc tccccgggct 15481 gctgtgccag atgctgtggg caagtgccga agcgcaggca tcaaggtact ggcctcccat 15541 cctcccctcc attctagcct cccccatgcc agagttcaag gagctgcagt ggctgctgcc 15601 ctggaaaggc ccaggccacg gtggcctcct tcccactgac tcagagaaga agctgtccat 15661 ctgcaaggaa aggcccaccc ctgccttggg gcactcaccc ttatcccttt tgctcagaga 15721 ggccagtgtc ccatgcccct cctctccctc cctggcacag ctctttgtcc catctgcatg 15781 tgtacccttc cattctgatt tgaagcataa tctggcactc ctatcttgaa gacctagttg 15841 tcctaccctt cccaactact gcccaccctc tctgtcaact taggatggga ttagcaggct 15901 cttgtatgct cccagtggct cagcccataa ccccacagaa tgcctcccac tcaaggttct 15961 cttgttcaac tacaattgct tctcactgac cactccctcc tctgtgctat gtttgttctt 16021 cattcaacaa atatttactg tttactatgt ttcagacact gtgcaaggtg caaactctag 16081 caccttgcac agtgcctgat acatagtaaa caatttgtgt cacatgacac atctagatgg 16141 gacaatgagc tgtgccaggg aggacacata taaatcaggc actattctaa gcactaaacc 16201 tctatgattt catttaatct acgcaacaac cctgtgaata agcaccagta tttttccttt 16261 tttacaaaga gggaaactga ggcacaaagt aacatgccaa ggccacacag ctagtgtaag 16321 tcatagagca ggtacttgaa ccaaggcagt ctgctatttc ttaactatca taccaggaga 16381 cttgtttctg tgggtgtatg tggtttctga accccataac cactctgtct ccgaggcagg 16441 catcattatc cctgttaaaa gtaaaacaag gctgggcaca gtggctcatg ctgtaatccc 16501 agcactttgg gggggtccaa ggctggtgga tcacctgaga taaggagtcc aagaccagcc 16561 tggccaacat ggtgaaaccc cgtctctact gaaaaaaaaa tacaaaaatt agccacgtgt 16621 agtggcaggt gcctgtaatc ccagctcctt gggaggctga ggcaggagaa tcacttgaac 16681 ccgggaggca gaggttgcag tgagccgaga ttaggccact gcactccagc ctaggcgaca 16741 agagcaagac tccatctcaa aaaaaaaaaa aaagtaaaac aaatggaaac tgcaagtggg 16801 taagtctttt ccccaaaata aaaccaccag taagtgatgg agtgggacta tgaagcatga 16861 tctgcttgac cggaaagcct gagctcctgc tgcttggcca ggtgcctctc tctgtgatca 16921 gaaccatcac ggaggcctgg gcagaggacc tctatgggaa gtcagctcag ggaagcatta 16981 attctgcctc agagaactgg acgatgtttg acaaaaggaa tgatatttga gctgggcttt 17041 ggaggtcaaa gagtattctg caaagcagtg agggagagga aggcatgtca agatcaggga 17101 acagtgtgta ccctggcgtg gctagactgt acggagcgtg tgtaggggtt ggcaggggag 17161 aggtggcaag gagtactggg ggttgagact ggaacactgg gtcagagcca gttggaagga 17221 cttggcttaa atgccagact tgggctttat cctgcaggcc acagagagcc atcagtgtca 17281 cgggggaggt tgagctttta ggagactacc tggtttgagg agggactgga tttggggaag 17341 aactggagtc agggagagac aggttaagag gccattgcca tagttcaggc aaagaatgat 17401 gacctgaact agggcagaaa caaagggaat ggagaggaat agccatgtct gagaaatgtt 17461 ttgagaactt ggtgaccaaa ctggatatgg aaggtgagga agatacagga gagaagggga 17521 caggatgaat agagaggaca acatctaagg tagggagcag tagcaggctt ggtggggtgg 17581 caggagtaga gaaaaggagt tcaatgttga ccttaagctt aaagagtcta taagataccc 17641 caggaaagac agatgttcca gtgaacattg gagtatggtt tagagctcat gaaagatggt 17701 caggactgga gatttgaatc tcattagcac aagggtgcaa actaaaggct tatagtatgt 17761 tgtattatat tgcccagaaa gagcagacag gtcaagtgaa gggaggagaa tgaaatcctg 17821 tgccaaacca acatttaggc acaagaggaa ccagtgagga ggggagaggg tggtaggaac 17881 caatgccatt taatcttcat gaccacctag aaaggcagga atcatctctg cattgtgctg 17941 agacctcagt gcttcacttg cctaaggtgc tgtagctgga gaaggtcggt gtttaaactt 18001 gagcccctca ttcaactcca aagccactgc ttcttccatc ctactaggct gccaccctga 18061 gaggctggga aagagcatgc agaggagtgg agggaaaaca ggaatggagg agtatgtgga 18121 agccagggga gggcagggtt tcaaggagaa gaggtcaacg gtgccaaatg cctcagaggg 18181 gtcaagtagg atgaggacgg caaagacact tttcattttt caattgagag gcccctgtgg 18241 cctttgggtg gcagtttcag tgaagtggtg ggagtggaag ataaggcagt ggggaccatg 18301 gaggcttcct tttcagtatc cataggactt gccctgcaag cccacacatg caaatgctat 18361 cctaacccaa ctctgctcac ctcatagcct cttagcccat cagatgccag ccaccctccc 18421 catcccaact ccattcactc tgcaacaata gcaatgaaca ttatggtgaa ggtagggacc 18481 accgtgctca ccagggagga tgtcagaggc agatgctcaa ggtggtggag gcaggggtgt 18541 tgtctccatc ttattcctgt tctgtccatt acattgctgc agtgatcaat atttgacact 18601 aagaacctca tgtcctatat ccaggtccct gtgcatttga gcaaattgta gcccctctct 18661 aagccttggt ttcctcatgt gtaaaataaa ggagttgtta aagggtcctt ccagctctga 18721 ctgccttgaa tcccatgaaa tccaaaatcc taaaatatct cctaaataat cctaaagatt 18781 tgaggcttct cttccagggg atcctcatga acttgttact ctgttatgga tactttcctg 18841 cacttaagca aactatctct ctctaaccct ctgaacataa ctcaacttgc cattatcttt 18901 cctcctcacc cacccactca aattcacaca gacacacaca aatccccacc tcacatacac 18961 acacatccca cacacacaca tagacacaca caatccccac cccccacaca cacatctcac 19021 acacacgcac acatacacaa atccccaccc cacatacaca cacacacaca taccccacac 19081 acacacacac atcccacaca cacacattcc ctctcacaca cacacacaat ccccaccccc 19141 tcacacacac acaatcccca cccacacaca cacacacaca tcctacacac acatacacac 19201 accccacaca cacacacata cacacacaca catacacccc accccagcca ctcccctgcc 19261 cacagacgca cacacacatg cccttatttc ggttgggatc tgccatcccc agacatctcg 19321 ctatctagct ttctcttact ttgggagagg aacaggaggg ggataaaccc ttaatactac 19381 tttcctgttg tctcctctcc ttcccactag gtgatcatgg taaccgggga tcaccctatc 19441 acagccaagg ccattgccaa aggcgtgggc atcatatcag agggtaacga gactgtggag 19501 gacattgcag cccggctcaa cattcccatg agtcaagtca accccaggtg aggcctctgc 19561 aggaagcccc tgtgccctaa tcaacacgtc ctcttgcaca gaaggcttgg gtgtcccctg 19621 ggagtatttg gatagcattt ctgggacctt tataggcctt acctctgaca ctattgtgac 19681 tggttcctca gccctgagct tgtatccagg ctgtgccact gaatacccat gtaaccttgg 19741 agatgtcaca ttcccttcct gagcctccat tttctcacca caaaattagg atgccatctc 19801 cactgctgta ggagtgggat gtgagtaaag aaactagccc agggcctagc aaactgcact 19861 tacaactgcg atgatcccat ggtgaggatt tagactcaca ttctcaaaga gaaatggggg 19921 aagtgagtac tgaggagagg agaactgaag caacagggga agcaggctca gcagggagcc 19981 tgcaggctgc ggtggtgaag agaggcaggg ggcaggaggg gctggtacag gtgccagggg 20041 tcagctgtct ctgtccccac ccaccctcca gagaagccaa ggcatgcgtg gtgcacggct 20101 ctgacctgaa ggacatgaca tcggagcagc tcgatgagat cctcaagaac cacacagaga 20161 tcgtctttgc tcgaacgtct ccccagcaga agctcatcat tgtggaggga tgtcagaggc 20221 aggtgagcac agccacggga ggcagatgac aggcagggac cggggaggca gggacagggc 20281 caagacaagc atggagtgag aggcgaggag ccaggcttgg gaaggggttt cgtcctcaag 20341 tgtggccgtc ttccctccag ggagccattg tggccgtgac gggtgacggg gtgaacgact 20401 cccctgcatt gaagaaggct gacattggca ttgccatggg catctctggc tctgacgtct 20461 ctaagcaggc agccgacatg atcctgctgg atgacaactt tgcctccatc gtcacggggg 20521 tggaggaggg tgaggaggct gcatgggttg ggatggtttg caggatactg aagccggcac 20581 ctctgttccc tgtcccttta ccccagttga agaatcattc cacaactcta ggcagcccag 20641 cccactttag cttcccttga acacaaaatc ttcctctctt gggaagacag gcagccatgc 20701 ttcaggggct gggggtgggg aagagtccct ctgacctccc tgatgccctc agaatctccc 20761 cacaggccgc ctgatctttg acaacttgaa gaaatccatc gcctacaccc tgaccagcaa 20821 catccccgag atcaccccct tcctgctgtt catcattgcc aacatccccc tacctctggg 20881 cactgtgacc atcctttgca ttgacctggg cacagatatg gtgagcgcag gaggtggagg 20941 aggggacagg caaggcaatc gtgatggcac agtggcaggg aggagaggtg cactggggca 21001 gtggccccag ctgtgggggt tacaggagac aggcacaggc ctggaagaca atggggtctg 21061 aatacacgct tttttaactg tgtcaacgat cgtcactgtc gaagatcaat tgcttctgtc 21121 atctcctacg tcccttcaaa tgccctccct gccccatttc ctaccccaca caggtccctg 21181 ccatctcctt ggcctatgag gcagctgaga gtgatatcat gaagcggcag ccacgaaact 21241 cccagacgga caagctggtg aatgagaggc tcatcagcat ggcctacgga cagatcggtg 21301 cgccaagccc cgggcctcgg gagggaaccc caacagggtt cttttcccag ctttcagagg 21361 aatgagcccc aagcaaaatt ccaggacaga ggccagctac ccaagggtca gggacctcca 21421 tctctggccc tgaggggctg tgccccttct gcttcctgct ctgaccctgc ccctgccttt 21481 ggccctctac ccacagggat gatccaggca ctgggtggct tcttcaccta ctttgtgatc 21541 ctggcagaga acggtttcct gccatcacgg ctactgggaa tccgcctcga ctgggatgac 21601 cggaccatga atgatctgga ggacagctat ggacaggagt gggtgagtgg tgctgtgtaa 21661 acacagcgca catgtgtgaa ggtacgggaa gctgaatgcc tggcaggagt ggccagtggt 21721 ggccaactgg gactggggct aggggtggat caggagaaac catgctacct tctggtccta 21781 ccctttcctc cgacactctc atctgtctct gcccaccctc cctccagacc tatgagcagc 21841 ggaaggtggt ggagttcacg tgccacacgg cattctttgc cagcatcgtg gtggtgcagt 21901 gggctgacct catcatctgc aagacccgcc gcaactcagt cttccagcag ggcatgaagt 21961 gagtgcccac ccccatggca cctacccacc caggctctgg gcacactcac caacccacac 22021 aggccaagct tccaactcag ccctggacca gagctgtaag gagggcatca gattttaatc 22081 ttccatctgt atatccatag ataggtttca tatagaagag agaagccatg cttcaggctc 22141 ctaattttgc attttgttat tgtctctctc cacctttctt ctctccttct ctccctccct 22201 tctttccccc atctcctact tgagtgccac ctgctcctat cctttgccca gccctaaatc 22261 cagtgtcttc tgtgcccacc cctcccctgg ccttccgcag gcagtctagt ttatagcact 22321 tgcactgtgg ttctggaggc cacgtgctgc cgagggtcgg gttcagcgct tcttgcagtg 22381 tgagttccag cgtgccaggc actgtgctaa gtgcctaaca cattgtttag ccccacgata 22441 gctctatgag gtgggtatta ctgtttacat ttcatggata agaggcccca aacaggttaa 22501 atgatgtgcc caaggtcacc aagatggtaa gggttcacca gatccgggga attctctctc 22561 ctctttccct tgcaccccag gggtatccca ggatctctgc ctccatgatc ccccttcacc 22621 tgccacctcc tttctttgcc tttcaggaac aagatcctga tttttgggct cctggaggag 22681 acggcgttgg ctgcctttct ctcttactgc ccaggcatgg gtgtagccct ccgcatgtac 22741 ccgctcaagt gagtgtctct ttcgggcggc ctgagtagtc atacgggggg ccttcagccc 22801 cctccaggat ccaagttctg atcgctttga atgctccttt atgtgacagc caccaagcca 22861 acctctgatg ctgctgacac tctcctccat tgctttcaga gtcacctggt ggttctgcgc 22921 cttcccctac agcctcctca tcttcatcta tgatgaggtc cgaaagctca tcctgcggcg 22981 gtatcctggt ggtaagcccc tccacattcc ccccagcaaa gtgcaagccc caccaccagc 23041 tcctccctcc aggaccacac gcagactcca ctcccactac tggttcctgc ttctgttcct 23101 caacaccctc agatcctggg gtcccaggtc ccaggagccc tgactctttc gaacctgttg 23161 tctctgccct cttcccatca gattctaaac cccgcctaac tcacagttgg tctccgaggc 23221 cctccacctt ccacctgaaa ccacagaaca tcagaattag aagggccttt ccaggtcacc 23281 tagtcccacg tccatatttt acagatagag gaggctcaga ggtgaatctg ttcagggtcg 23341 taaagcaaag aatagcagga ctggtactag aacccaggcc tcctgatccc cactcatcaa 23401 ccgacaagtt attgtttttc ctacaagcgt tttttaatca cttagtagag gttaaaatat 23461 gatggcttac aaaccaaata ctgtccagga atactgtttg gtttgacctg tgccatattt 23521 tgtgtgtgtg gggggaggag gggggattaa ttggtccgat atagaaaatt tagggtcttt 23581 cacataaaat ctgaattctc ctgaagatca gaagagctgt taatcccatg tccacatctc 23641 tgcacagcaa gaatggacag ctgctccctt agactctcgt tcgtcacctg aagaggccct 23701 gtgggcgttt gcattttcac cctcgaccta ttctgcccca gcctccaatg tgctaaatac 23761 aagtctcccc taaaggcagg ttcaccctcc cttcctttaa cctatggccc cagctcactg 23821 gacctccttc tcctcccttt caccttcttt tttttttttt tttttttttt tttttttgag 23881 acggagtctc gctctgtcgc ccaaccagga gtgcagtgca gtggcacgat ctcggctcac 23941 tgcaagctcc gcctcctggg ttcaagcaat tctcttgcct cagcctccag agtagctggg 24001 attgcaggcg catgccacta tgcccgacta attttttgta tttttagtac agacagggtt 24061 tcactgggtt agccaggagt cttgatctcc gcctgcctca gcctcccagt atgctgggat 24121 tacaggcatg agccactgca cccagcctcc ttccaccttc ctaagccact tcccccagtg 24181 cccttcacca ggcttctcct tccacctggc ttcagcgacc caagccccta ggaaattgat 24241 ctcttgcctc cttttaagct catgctgcaa tctccactcc caatctctct aacaggctgg 24301 gtggagaagg agacatacta ctgaccccat tggaagaaga accaggcatg gaaagatggg 24361 gagctctgga ggtgttgtgg ggatggtgat ggagagggat ggaaataacg ggtggcattg 24421 ggtggcaaca tttggggaga gataatgagg caactcagca ggctaagttg cggggtatat 24481 aaattggggt gatgacccca tagacctaac tgtgaacaat cagattagac actatgtgtt 24541 agagtccccc cgaccagatc cttttccatc ccactccact atgttgtcta ttttttctga 24601 ggaattaagg gttaccccac cctgcccact cccatccctt caaccccact tcctactgta 24661 atagatcagc atccaaaagc aggaacccat ctaaaccaga aggaagccct ctcagatcac 24721 cccagcctca ctccatttcc cacttccacc cccgttagct tcctgcagga ctctatccct 24781 ggcttcccct tcagaccttg caatcacaaa aggttcttct ggtgagtgca agagcctgag 24841 actggaaaag gtggacttgt ctcccagtcg aggctggtaa gggaccttca gggagagctg 24901 ggcagacagg tgggagatgg aggtagggct ggctggagga aggaaacaac aaaggaagtg 24961 aggtagtgcc aatgacagga catttgacat gagtctccag atagatgtcg tggactccag 25021 ctctacgtcc cacattttag aataccccac cagcagaaca aactcagatc tcatcagggt 25081 agcagcagag gcaggaccag aaggcaatca agagcttcca gaaatgccac acttgtgtgc 25141 cacagagttc cccgctgacc cttggttagg ggtcctctta gtccacaagg tccggatgtc 25201 actcatgtac ttaataacac ttcaccttct gtaatactaa gtcctcagag ctccatgctg 25261 ttctgaaagg gatggccaca agttctttcc cagcctcttc cattcccttt cttttcatgc 25321 ccatcccgat gaacctgcat cattccccga cactgccaag ccaaccctgg aaaaggagtt 25381 cgctggccat tggctagaat cagggtggag aagttccctg aaccttcctg tctcccaggg 25441 acatgtatgc ttccagggac aagcttaggt catgaacatg gtcagaacct ttggacaaga 25501 ggaaaaatac taagagattt gctttttctg ggtgcggtgg ctcatgcctg taatcccagc 25561 actttgggag gccgaggcag gtggatcatg aggtcaggag ttcgaggcga gcctggccaa 25621 catggtgaaa ccctgtctct actaaaagta caaaaaatta gccagtcatg gtggcacacg 25681 cctgtaatct cagctactca ggaggctgag gcaggagaat tgcttgaacc tgtgaggaag 25741 aggttgcagt gagctgagat cgtgccatta cactccagcc tgggcgaaag ggtgagactc 25801 catctcaaaa aaaaaaaaaa tgatttgctt ttgacgtctt aggtggcagg gctgttccct 25861 ccaggcaaat gcccttcaaa ccgacgatca ttgtgcccac ttaccctggg ctggagagtt 25921 ggtttcaggt tcctacagga gatagctttc tttcccttac tccctatcta acacttttgc 25981 tctgcaggca gccttgccca ttctctaagc ctggcttaga aggcactggg aatgtcctgt 26041 agagagagac ctagataggt catgcaagtg agaaagacat ctgaggaaaa tggaagacct 26101 aaggcagaca ggaaggaagc acaaaagaca agcattgggt cagacccata aaccacctcc 26161 caaaggctgt catttcattg cactggaatt ttgctttatc agaagcaagg aagtaaggga 26221 gtcattgcct tgggcctggg aatctaagtg ggagacaata ttaatttgga tccgattaat 26281 tggagattac taactgtgga caaaagttta tctttgcaca atcaataaaa atggcatttt 26341 tttagtaaat taagagcata aacaatattg ctagaggtgg catgtttagt ctaccaaaaa 26401 caatactttt caggcacttt agaaatatcc ttttagaagc agcgagtgca tgggctaatt 26461 atcatcaatc tttatgtatt tgttaaagaa acatctacag gatctttatt ggtgaccttt 26521 tgtaagacat tagtttgagg tactacctat ctacttgaaa ataataaagt ggcatttctt 26581 tatgaaaaaa aaagaaatct cttccataat tcagatttct acactttata cttgcctccc 26641 tcctaaatcg tgatattgaa atatggtg // LOCUS HUMATPGG 15067 bp DNA PRI 28-OCT-1994 DEFINITION Human gastric (H+ + K+)-ATPase gene, complete cds. ACCESSION J05451 J05452 NID g561633 KEYWORDS (H+,K+)-ATPase; Alu repeat. SOURCE Homo sapiens (tissue library: of K.Motojima) liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15067) AUTHORS Maeda,M., Oshiman,K., Tamura,S. and Futai,M. TITLE Human gastric (H+ + K+)-ATPase gene. Similarity to (Na+ + K+)-ATPase genes in exon/intron organization but difference in control region JOURNAL J. Biol. Chem. 265 (16), 9027-9032 (1990) MEDLINE 90264383 FEATURES Location/Qualifiers source 1..15067 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="of K.Motojima" protein_bind 58..65 /bound_moiety="AP2" protein_bind 64..73 /bound_moiety="STM1" repeat_region 136..152 /note="D1 copy 1" /rpt_family="direct" repeat_region 218..234 /note="D1 copy 2" /rpt_type=direct protein_bind 350..354 /bound_moiety="NF-Y" repeat_region 375..388 /note="D2 copy 1" /rpt_type=direct repeat_region 388..407 /note="D2 copy 2" /rpt_type=direct repeat_region 394..416 /note="palindrome p1" protein_bind 440..445 /bound_moiety="STM4" repeat_region 607..888 /rpt_family="Alu" repeat_region 690..709 /note="palindrome p2" repeat_region 866..880 /note="palindrome p3" protein_bind 1042..1048 /bound_moiety="AP1" repeat_region 1060..1076 /note="D2 copy 3" /rpt_type=direct protein_bind 1105..1109 /bound_moiety="NF-Y" TATA_signal 1115..1121 misc_binding 1138..1142 /bound_moiety="cAMP" CAAT_signal 1144..1149 repeat_region 1213..1226 /note="palindrome p4" protein_bind 1328..1336 /bound_moiety="AP4" protein_bind 1339..1344 /bound_moiety="SP1" protein_bind 1360..1365 /bound_moiety="SP1" TATA_signal 1374..1380 exon <1433..1444 /note="(H+ + K+)-ATPase" /number=1 CDS join(1433..1444,1535..1678,1794..1853,2425..2628, 4131..4244,4448..4700,4990..5258,5871..6069,6376..6485, 6566..6700,7215..7407,7974..8149,9251..9387,9473..9623, 9729..9897,9987..10141,11233..11356,11880..12025, 13481..13614,13950..14051,14136..14227,14418..14446) /codon_start=1 /product="H+,K+-ATPase" /db_xref="PID:g561634" /translation="MGKAENYELYSVELGPGPGGDMAAKMSKKKKAGGGGGKRKEKLE NMKKEMEINDHQLSVAELEQKYQTSATKGLSASLAAELLLRDGPNALRPPRGTPEYVK FARQLAGGLQCLMWVAAAICLIAFAIQASEGDLTTDDNLYLAIALIAVVVVTGCFGYY QEFKSTNIIASFKNLVPQQATVIRDGDKFQINADQLVVGDLVEMKGGDRVPADIRILA AQGCKVDNSSLTGESEPQTRSPECTHESPLETRNIAFFSTMCLEGTAQGLVVNTGDRT IIGRIASLASGVENEKTPIAIEIEHFVDIIAGLAILFGATFFIVAMCIGYTFLRAMVF FMAIVVAYVPEGLLATVTVCLSLTAKRLASKNCVVKNLEAVETLGSTSVICSDKTGTL TQNRMTVSHLWFDNHIHTADTTEDQSGQTFDQSSETWRALCRVLTLCNRAAFKSGQDA VPVPKRIVIGDASETALLKFSELTLGNAMGYRDRFPKVCEIPFNSTNKFQLSIHTLED PRDPRHLLVMKGAPERVLERCSSILIKGQELPLDEQWREAFQTAYLSLGGLGERVLGF CQLYLNEKDYPPGYAFDVEAMNFPSSGLCFAGLVSMIDPPRATVPDAVLKCRTAGIRV IMVTGDHPITAKAIAASVGIISEGSETVEDIAARLRVPVDQVNRKDARACVINGMQLK DMDPSELVEALRTHPEMVFARTSPQQKLVIVESCQRLGAIVAVTGDGVNDSPALKKAD IGVAMGIAGSDAAKNAADMILLDDNFASIVTGVEQGRLIFDNLKKSIAYTLTKNIPEL TPYLIYITVSVPLPLGCITILFIELCTDIFPSVSLAYEKAESDIMHLRPRNPKRDRLV NEPLAAYSYFQIGAIQSFAGFTDYFTAMAQEGWFPLLCVGLRAQWEDHHLQDLQDSYG QEWTFGQRLYQQYTCYTVFFISIEVCQIADVLIRKTRRLSAFQQGFFRNKILVIAIVF QVCIGCFLCYCPGMPNIFNFMPIRFQWWLVPLPYGILIFVYDEIRKLGVRCCPGSWWD QELYY" intron 1445..1534 /note="(H+ + K+)-ATPase" /number=1 exon 1535..1678 /note="(H+ + K+)-ATPase" /number=2 intron 1679..1793 /note="(H+ + K+)-ATPase" /number=2 exon 1794..1853 /note="(H+ + K+)-ATPase" /number=3 intron 1854..2424 /note="(H+ + K+)-ATPase" /number=3 exon 2425..2628 /note="(H+ + K+)-ATPase" /number=4 intron 2629..4130 /note="(H+ + K+)-ATPase" /number=4 repeat_region 2908..3189 /rpt_family="Alu" exon 4131..4244 /note="(H+ + K+)-ATPase" /number=5 intron 4245..4447 /note="(H+ + K+)-ATPase" /number=5 exon 4448..4700 /note="(H+ + K+)-ATPase" /number=6 intron 4701..4989 /note="(H+ + K+)-ATPase" /number=6 exon 4990..5258 /note="(H+ + K+)-ATPase" /number=7 intron 5259..5870 /note="(H+ + K+)-ATPase" /number=7 exon 5871..6069 /note="(H+ + K+)-ATPase" /number=8 intron 6070..6375 /note="(H+ + K+)-ATPase" /number=8 exon 6376..6485 /note="(H+ + K+)-ATPase" /number=9 intron 6486..6565 /note="(H+ + K+)-ATPase" /number=9 exon 6566..6700 /note="(H+ + K+)-ATPase" /number=10 intron 6701..7214 /note="(H+ + K+)-ATPase" /number=10 exon 7215..7407 /note="(H+ + K+)-ATPase" /number=11 intron 7408..7973 /note="(H+ + K+)-ATPase" /number=11 exon 7974..8149 /note="(H+ + K+)-ATPase" /number=12 intron 8150..9250 /note="(H+ + K+)-ATPase" /number=12 exon 9251..9387 /note="(H+ + K+)-ATPase" /number=13 intron 9388..9472 /note="(H+ + K+)-ATPase" /number=13 exon 9473..9623 /note="(H+ + K+)-ATPase" /number=14 intron 9624..9728 /note="(H+ + K+)-ATPase" /number=14 exon 9729..9897 /note="(H+ + K+)-ATPase" /number=15 intron 9898..9986 /note="(H+ + K+)-ATPase" /number=15 exon 9987..10141 /note="(H+ + K+)-ATPase" /number=16 intron 10142..11232 /note="(H+ + K+)-ATPase" /number=16 exon 11233..11356 /note="(H+ + K+)-ATPase" intron 11357..11879 /note="(H+ + K+)-ATPase" /number=17 exon 11880..12025 /note="(H+ + K+)-ATPase" /number=18 intron 12026..13480 /note="(H+ + K+)-ATPase" /number=18 repeat_region 12797..12996 /rpt_family="Alu" exon 13481..13614 /note="(H+ + K+)-ATPase" /number=19 intron 13615..13949 /note="(H+ + K+)-ATPase" /number=19 exon 13950..14051 /note="(H+ + K+)-ATPase" /number=20 intron 14052..14135 /note="(H+ + K+)-ATPase" /number=20 exon 14136..14227 /note="(H+ + K+)-ATPase" intron 14228..14417 /note="(H+ + K+)-ATPase" /number=21 exon 14418..>14446 /note="(H+ + K+)-ATPase" /number=22 protein_bind 14640..14649 /bound_moiety="STM1" repeat_region 14771..14783 /note="palindrome p5" polyA_signal 14835..14840 polyA_signal 14998..15003 BASE COUNT 3289 a 4381 c 4310 g 3087 t ORIGIN 1 ccagcccctc gctgctgctg cccatggagg tccctacctg agggatgagg agggagtccc 61 caggcctcct ggcatctgtc acccaacacc aagatcaagc caagaccctt ggacacatcc 121 tttccccatg ttgcaccttg gtttccccat ctagggtagg gacggggaat ggttgagagc 181 tgtaaagtct gtagctggat atgtgagtcc ctgtgagcct cagtttcccc ctctgcacaa 241 caggaatgat agttctgcct gcacagggtt ttttgcaaag acccgtgagg tcacacgtgc 301 agagcctggc atgtgtgtgg tgctgctgca gtgaagtttg ggtttttcca ttggtggtgc 361 catagaagag ctaggctctg gggtcaagcc ctggggtcaa gcccgggctg gaaccctgtc 421 cctaattctt agacgttgaa atcttctgca aagtccctga tgggggctaa gaagagaata 481 ttgctttaca gggctgctgt tagatgctaa ggaaaccatt tgagtgacac acagagcaca 541 cccagcccat agtgaatagt cagagttcag taaacattca gccaccatag agagtgttgt 601 aagcatggca gggcatggtg gctcacgcct gtaatcccag cactttggga ggccaaggcg 661 ggtggatcac ctgaggtcag gagttcgaga ccagcctggc caacctggtg aaaccccgtc 721 tctactaaaa gtacaaaaat tagctgggtg tggtggcatg tgcctgtagt cccagctact 781 caagaggctg aggcaggaga atcacttgaa cccgggaggc ggaggttgca gtgagctgag 841 attgcaccac tgcactccaa cctgggtgtc agagtgacac tccatctcaa aaaaaaaaag 901 aaaaaaaaaa agagtgtggg agcaccctcc tcttccagtt gatcaggcac gtgctattta 961 gactgaaaga ggacactctg cagccagctg atctggttct cactccatta tgtcctagcc 1021 tgtgatggag ccataggaga atgactcaac ccctccatgc ctcagtttcc ccatctgtaa 1081 agggtcttgt cccctcaaag ggttattgga aggttaaata agttagcaca tggaagccgt 1141 cagcacaatg cccagcgcat aatgagcacc ttatgtttgc tcttcttgtt gctgcagcgg 1201 ttcccagggg ctccctgatc tcagggcagg ttgacatggg gggatctggg caggggaggg 1261 gggggccagc ccaggaggga atgcctgagg tgcccaggcc tagaggtcac cccaaggagc 1321 ctgtaatcag ctgtggtcgg gcggttctgc tccaccccac cgccctccct gggtatatca 1381 ggactggtcc gggcccaggc tctgttgggt gggagcacag gcaccgggca ccatggggaa 1441 ggccgtgagt ggggcccagg ggtccggggt gcagtgggga caggggttgc agtggaatct 1501 gagttctccc tccctccatc ctgtccattc ccaggagaac tatgagctct actcggtgga 1561 gctgggtcct ggccctggcg gggacatggc tgccaagatg agcaagaaga agaaggcggg 1621 tggcgggggt ggcaagagga aggagaagct ggagaacatg aagaaggaga tggagattgt 1681 gagctctgac tggggagggg cgggagctgg agggtaggct gggaaccagg cagtgggcgg 1741 ggagggggcc aggagagccc agagcaagca ccctgcccca ccccaccaca cagaacgacc 1801 accagctgtc agtggcggag ctggaacaga aataccagac cagtgccacc aaggtgagct 1861 gggagactgg gagcgtacag aaaggggagg aggagagagg gagggagagg gagatggaag 1921 gagggagatg gagagaggga gagagaggga gagatgaaga gaggaagata tggagaaata 1981 gagagaggga gggaggcaga aaaggaaaga gggagggagg gagagaggaa aggatggagg 2041 gaggggagga gagagggagg gagatggaga gagggagaga gagggagagg gagagaaggg 2101 agaaagcatt agagagaaag ggtgacacac agagacggag agaggagtag ctgaaaagag 2161 acagcaagac agagaaggca gagacaaata aggagagagc agcaggagag agacacatag 2221 aaaggggagc gaggagagag agacaaaggg agagatgaag gagggaggga gagatggggt 2281 gggggcagga accaagagac aaagagatga gaaggagcag gacagaccaa gctggagaag 2341 ggaaactgaa gcagggggct ttccacgggg tgtgcatgtg gaggaaagcc cccaggaccg 2401 cctcgccctg cccctctgtc ccagggcctc tctgcgagcc tggctgctga gctgctgctg 2461 cgggatgggc ccaacgcact gcggccacca cggggcaccc cagagtacgt caagttcgcg 2521 aggcagctgg ccgggggcct gcagtgcctc atgtgggttg ccgccgccat ctgcctcatc 2581 gcctttgcca tccaggctag tgagggggac ctcaccaccg acgacaatgt gagccatgtt 2641 gtggggacag gctcaggaca tggggacctg gggatggaag acactgagcc tgggacttgg 2701 gacaagaggg gcagagaagg atggggatag acgaacacga aaggcagtct caagacaggg 2761 gtcatggagg gtgaaggaca ttgggatatg gatggggcag acccaaggac acaggacaca 2821 gggacactgg ggtgaatctt gggtatgggg acataggtgg gggcataaag agtatggcaa 2881 tgtgatacgt aagagatggg atttggaggc cgggcgtgta atcccatgcc tgtaatccca 2941 gcactttggg aggccgaggc gggcagatca cctgaggtca ggagttcgag accagcctgg 3001 ccaacatggc aaaaccccat ctctactaaa attacaaaaa ttagccagac atggtggcgg 3061 gcacctgtaa tcccagttac ttgggaggct gaggcaggag aatcatttga acccaggagg 3121 tggatgttgc agtgagccaa gaccatgcca ctgcactcca gcctgggcga cagagcgaga 3181 ctgcatctta aaaaaaaaaa aaaaaaaaaa aaaaagagat gggacttgga agcagaggac 3241 atgcagtgca gacacaggac agactgaggc acggggaata ggagaccctg ggactcagac 3301 ataggacacc agggcaggga cccaagcctc aggacacagc acatagagag acacaaaaca 3361 tgagacatag aagcacagag acagagagga aatggaggtg cagggcaagg gacattggac 3421 atatggcaga gatgagggca agggacatgg ggactcaggg acatgtgaca tgggacagag 3481 gctcgaaaag tcatggccat gggacacagg atgggaactc ggagatacac agacatggga 3541 gacacgggac acagtgactg gggagctaga acatgggggc cgtggccaca gaggacacag 3601 accatgacgt acaaggtctg cagcacacgc agggacgggg atgtgaggat gcagggacaa 3661 gggggcccag gggacacaca gagggtggca gcagattatg gaaaacccag gatgtgaaac 3721 tggagacacg gggaatacag gccatgggga caggaggaga catgggacag tggacatggg 3781 gaatgcaagg cttggggaca caggagaagg gaacacgtca gaccaaggca cacagtgatt 3841 tggggcacaa gtcacagata tggggacatg gaagattgag gattcagggc cagaggatat 3901 gaaatataca cagaaacaca agacactgag aagacagagc aggcgagtgt gggacatgag 3961 ggacatggga gggggctaaa ggctacaggg aatggtgcat ggagcttcct agtaggtggg 4021 gaaaagcatg gggaaaacca agaaaaccag ggcaccaggg agcaggtcgg tggcctcagg 4081 caggcagctt ccggcccctc tgaataaccc caccccttcc ctgtccccag ctgtacctgg 4141 caatcgctct cattgctgtg gttgtcgtca ccggctgctt tggctactac caggaattca 4201 agagcaccaa catcatcgcc agctttaaga accttgtgcc acaggtgggt tccccagcac 4261 ccagccactc cagatcccag agactccaaa ctgctcctgt agcctccttc catcaggccc 4321 ctgtccctca tctgtggcct ccatctctcc ccatgggtcc cagctctcct catcctttac 4381 ctccccacat gcagcagctc ccacccccac ccccgtccac ctcaactttg gtgcccctgc 4441 cccgcagcaa gccactgtca tccgcgatgg agacaaattc cagatcaacg ctgaccaact 4501 ggtggtgggc gacctggtgg agatgaaagg tggggacaga gtgcccgccg acatccgcat 4561 cctggcggcc cagggctgca aggtggacaa ctcctcgctg acaggggagt ctgagccaca 4621 gacccgctca cccgagtgca cgcacgagag ccctctggag acccgcaaca tcgccttctt 4681 ctccaccatg tgccttgagg gtctgtgaag cccccgcctg cctgcccagt aactgctgcc 4741 tccactggcc tctctcctgc tatcttctag gacagggggc ttcagcctcc ctgtctctct 4801 cctgcatgac tgccacctcc ccgtgactgc cacctcccca gggcacactg cttctctgtc 4861 tcccactgtc actaccctct cccaggtcac atcctccact ggcagtgact ggctgtcctt 4921 cagctcacac ggcagtgcac accccactgc cgccccctat ccctgagtcg cgccttggcc 4981 cccctgcagg caccgcgcag ggcctggtgg tgaacacggg cgaccgcacc atcattgggc 5041 gcatcgcatc gctggcgtcg ggggtggaaa acgagaagac acccatcgct atcgagatcg 5101 agcattttgt ggacatcatc gcgggcctgg ccattctctt cggtgccaca ttttttattg 5161 tggccatgtg cattggctac accttcctgc gggccatggt cttcttcatg gccatcgtgg 5221 tggcctatgt gcctgagggg ctgctggcca ctgtcacagt gagtgggggc aggcggggat 5281 gcagggagct gaggggcctg ctcaaccatc tgttctttcc tccacatttc ccccatcacc 5341 tccctgcctg tctatctgtc cacttagctc ccatccattc tcctgccttc ccatcatcca 5401 ctcattcagc aagtccttcc cgaggcaacc ctggggcaag tcctgtgcca cacattgagg 5461 gcgcagagat gacgaggccg ggtccttaac tggaagaggc tgatgtcctg tgcaggcagc 5521 cagcatgtca ccagaccttc cacagtcagc acccagctcc agaccagtcc ccagggaggt 5581 gcggggaagg ttcctcttct cattttgatt gtgtgaagag aattttcaga acctcggctg 5641 ggacagctca gagaggggcc tcgctactgt atctgggcag actggggaag gagtcccaaa 5701 caaagcactg gctgaacaaa agctggggcc aagaaaacac ctggtgtttg cagaagcacg 5761 tgcacacgca gccccgcggc tggggtaaag ccttccccgt tctttgacgc ggggaagagg 5821 actggaagcc agggtgggtc cccagcctgc ggtgctcacc ccttccccag gtctgcctgt 5881 ccctgacagc caagcgcctg gccagtaaga actgcgtggt caagaacctg gaggcggtgg 5941 agacattggg ctccacttcg gtgatctgct cggacaagac agggactctc actcagaacc 6001 gcatgactgt gtcccatctt tggtttgaca accacatcca cacagctgac accacggaag 6061 accagtcagg tgtgggggag ggagggcaca ggcgccaggg caggatctgg tggatggtgg 6121 agacctgaag agggataggg aggtagggag ggctttgtac gaaggcgggg cggctcggct 6181 ccaggtctca aaggagactc gctaccgggt ctagagggaa gggtcatccg ggggctgggt 6241 ccctcagggt tgggtggaac ccaattccac ccaggtctgg ggagggggcc cgcgatatga 6301 gcctaggagg agggggctcg gttctggtcc gggagagccg ccgtgcggga cccagcctgg 6361 acgcctggtc tgcagggcag acgtttgacc agtcctcgga gacgtggcgg gcgctgtgcc 6421 gggtgctcac cctgtgcaac cgcgccgcct tcaagtccgg ccaggatgca gtgcctgtgc 6481 ccaaggtgag agccaggggg cctccaggga atcccgggag gctggtgtag ccggcttcgc 6541 ccctgacacc ggtccccgct cccagcgcat cgtgattgga gacgcatcgg agacggcgct 6601 gctcaagttc tcggagctga cgctgggcaa cgccatgggc taccgggacc gcttcccaaa 6661 agtctgcgag atacccttca actccaccaa caagttccag gtgcgcagcc ggccccgccc 6721 acggcctggg cccgcccacc tcgggccccg ccccaggcca cacccacatc agcgggccac 6781 gaggggcttc gctccttgcc ccttctctcc gcagccaaac cccaccccat ctcccgctcc 6841 gcctatggcc ccgtccacca gaggcaaggc cccgcccacc acccagcccc gcccacagct 6901 tctcctcggc gggactgtcc atggggagca cctcgctctc caaatccgcc cccagcccaa 6961 ctccgcaccc taattctgtt cattccctgc gtccaggact ccaaatctgc acctagcccc 7021 aacttaacgc ttaccttcta ccaaggaccc ctccacacaa ggcctccatc ttccaccttc 7081 acccttcccc gcagctctca gagcccgcac cgagctctcg gccccgcccc tctcctcagc 7141 cccgccccgt tccctcgtcc acagccccgc cctgttcccc cgtccacagc ctcgcctctc 7201 ccccccgccc acagctgtcc atacatacgc tggaggaccc gcgggacccg cgacacttgc 7261 tggtgatgaa gggcgccccc gagcgcgtgc tggagcgctg cagctccatc cttatcaagg 7321 gccaggagct gccgctggac gagcagtggc gcgaggcctt ccagaccgcc tacctcagcc 7381 tgggaggcct gggcgaacgc gtgctcggtg agacccctgg gcaggggagg agactccagg 7441 tgcgaggagg agctagcaca gaccctgccc accactcccg gcccagccct gaccgtcctc 7501 atccctggct gctggctgat ttgtcgcagt ccacagcctc acccttgact ttccctgtac 7561 cagtcctgac ccctgtctgt cctcactttc taccttgacc ccagtcctca cctggacttt 7621 tatccgtatc ctgacctcta acctctccca atttttgtcc tgacgcctta aacttcgctg 7681 cctctaccct agaaagcaga tctgtggtcc aattctgggc cttttacttg ctgtgtgacc 7741 tctggcaaga aatgtcccct ctctgagcct cagtttcctc atctgtaaga tgagaatggg 7801 cctcacaggg tggttgtgag ggtagattca gataatgtga gtgaaggggt cagcacaagg 7861 tcaggcatcc ataggtgttt ggtgcaaggg gcctggttat tttacccatt tcctggccct 7921 gctaagaaag ggtgtgccca gggctcttcc ctgacttccc cgtttccggt caggcttctg 7981 ccagctctac ctgaatgaga aggactaccc gcctggctat gccttcgacg tagaggccat 8041 gaactttcca tctagcggcc tctgctttgc gggacttgta tccatgattg acccaccccg 8101 ggccaccgtc cctgatgctg tgctcaagtg tcgcaccgca ggcatccggg tatgccactg 8161 tggggaggaa caagtggagt gaggacagag gatggcagca ggatttgggt tgaagaaccc 8221 catgacaaac ccacctgaaa ctggcttgaa ccaaaatgca ttggtttatg atactgaaaa 8281 attctgggga attaagcttc aggtctagcc agatccagga gctcagtgtc attaggaatt 8341 ttcctttctc catgtctcag ctctgctgtt ctctttgctg gctttatttg caggcactcc 8401 cttcacttgg ggacacacac acccctgctc cagcagctca aacttaactc ccacctgctt 8461 agccacccca caaaaagaga gtgcctcttt tgcagtagca ctggattggg tcactcattt 8521 atccctgagc cagtcactgg ggctggggga tggaagacct tgattggcca ggtctgggct 8581 tacttaccca atcttagcca gggacctcac tgcaactgat ggtgtagggg agaggagtgg 8641 ttccccaaac acaaaacaag ggtgggaaag ccaaggccag gcaggcaatg cccaagccct 8701 tcccgtgtct ctacggccct gttacatcgg gccttgttac ctttctaagc ttgttcgtta 8761 gccctctttc cttgttcact cctctgcagt tatggtgtcc tccctgctgt tcctccgaca 8821 gttcgggcct gtcctcacct cagggccttt gcacctgctg ttctccctgc ccaggagcac 8881 tcttctccca gctttccacg tggctgactc tctcacctct tacaggtctc tgctcagata 8941 tcaccttctc aataagccct tccctgggta cacttaaaat cgagtcaccg tttccttcac 9001 caagcttaat atgttttttt ccactttcta acacactata taattcgttt ctttattttc 9061 tgtcaccctc actagagctc ctcaaggaca gggatttttg tctcttttgt ctctgctata 9121 tcccaggcac gtaggaggtg gtcacgcttg gtcaagtaaa tgaacacgtg tatcagcaga 9181 tgtctcccag ggaggacagg gagattgggc tgatctgaag gggtcagtga tccactggtt 9241 ccccctacag gtgatcatgg taacgggtga ccaccccatc accgccaagg ccattgcagc 9301 cagtgtgggc atcatctcgg aaggcagcga gacagtggag gacatcgctg cccgcctccg 9361 tgtgcccgta gaccaggtta atcgcaagta agccccccca gacctccccg gggttctccc 9421 cacatctggc acagaccgag gtcccagcga ggtcctccca tctcctcccc agggatgccc 9481 gtgcctgtgt gatcaatggc atgcagctga aggacatgga cccatcggaa ctggtcgagg 9541 ccctgcgcac ccaccccgag atggtgtttg cgcgcaccag cccccagcag aagctggtga 9601 tcgtggagag ctgccagcgg ctggtgagcc ctgctgatga gggtgggcag gcgggcagac 9661 aggcggggct ggtcttggac cggcctctca ctgaccaccc acccaccacc tgcacccact 9721 gcctgcaggg tgcgattgtg gccgtcacgg gggatggtgt gaatgactcc ccagctctga 9781 agaaggcaga catcggagta gccatgggca tcgctggctc agatgctgcc aaaaatgcag 9841 ctgacatgat cctgctggat gacaactttg cctccattgt gacaggcgtg gagcagggtc 9901 cgagccacgc cagggggagg gcaggcagtg tgggcacagg agggagaggg ctgtgaggag 9961 gctgaggtgc ccaccctacc ccacaggtcg actgatcttc gacaacctga agaagtctat 10021 tgcctacaca ttgaccaaga acatcccaga gctgacaccc tacctcatct acatcaccgt 10081 cagcgtgccc ctgcccctcg ggtgcatcac catcctcttc atcgaactct gcactgacat 10141 tgtaagtcca cagcccaggg tacccatcca cagggtgccc agagacactt gcttgcagat 10201 ggacaggagg gacagagacc accacagaca ctcttgggca cagagtgaca gacaagacct 10261 gtgtgtgcaa tgacagccag tgcaggacac atgggtggca ggcaggtgaa cagacctaaa 10321 tggacataca gccatgcaca cacaggcgga cactcaggta ggcccgggcc gacccatgca 10381 cacacatgac tcagatacag atactcagac agagatggaa acagaacagg tatctggcca 10441 aacttgaatc acagacacgt gcccctaaac agacacggac aaataaaccc cctcagatag 10501 accccatggc agacagacag aaaagaacca cagatacagt cacgaatgca gccagaggcg 10561 cagctgtgtt gacaggtatt tattcggtaa catttacaca gtgctttctc tgggccagac 10621 acaccatcat ttccttaatt ttgtactcat aaccacacta tgagatgagt tttgttacta 10681 tcacccgtac tttactgatg agctcatcaa ggcatagaca ggtagagtgg attgtcccag 10741 gtcactggca aataagtggc ccggctgtga cttgagcttg agcagtcaca ggtcccacag 10801 cctgtggtcg tggtcagtat gctctgctgc cactctaggc acaagcagat gaaggggcac 10861 ggcgacactg agagactgaa gatacacaga gatagggagt gacacagaaa tacacaagag 10921 cagatacagt cacagagaat tacactggag acagggacag agaaacagag tgaggaacag 10981 agaaagagag ggagggagac agagggagaa gataaaagca gatgttctta cgaagatggg 11041 caccaggatg ccaacaccca ggggccagtg gggcccaggc tgtccaggct ggtgtggctg 11101 tgccgtccct gcaggtcctg cctgctaacc tcagtcctgt tcctgctcac aagtggggga 11161 agggggaccc gaagtggagg ggacaagggc ccagccaaac cctcagtccc agttccttcc 11221 actcctggcc agttcccatc tgtgtccctg gcatatgaaa aggccgagag tgacatcatg 11281 cacctgcgtc cacgcaaccc aaagcgtgac agattggtca acgagcccct ggctgcctac 11341 tcctacttcc agattggtgg gtgcccgtgg gccggggagc ctggaggtgg ggggctctgc 11401 gctgggctgc tcgcctgccc caggccctgc cttgggctcc ggctggttct ggacctcagt 11461 gtctgcattt ggtcctgggt ctccgttgtc ctccgtgtcc ctctctgttt gtgacagtct 11521 ctctggttct gctgtgtccc tgctctctct ctgaccttgc tctccatgac cctgtctctg 11581 agttgttgcc tatttctgtc tttttgggtc ctggatgtgt ctttttctct ctgtccctgc 11641 ctctgtctct ctgtccctgc ttctctctct gcctctgtgt ctctgtcact gtttctgtgt 11701 ccctggctct gtgtgtcctt gaccttgtgt atctgtgtcc ctgtctctct tcccgtgtcc 11761 ctgtctgtgt gtccttgacc tttgtgtatc tgtgtccctg tctctccatg tccctctctt 11821 cctgtgtccc tgtctctgtg tccatctctg tctccctgtg tccccttgtc tctctccagg 11881 tgccattcag tcctttgctg gcttcactga ctacttcacg gcaatggccc aggagggctg 11941 gttcccactg ctgtgcgtgg ggctgcgggc gcagtgggag gaccaccacc tacaagatct 12001 gcaggacagc tacggccagg agtgggtgag ccctgccccg ccccatccca tccctgggct 12061 ccctccttcc cagggcaact tgccttgtgt gcggcccact tgttcctcca cgtgtgactg 12121 agttcagcca gagcatcccg gacccccaga ctccctgcca gttcgggggg gcaagggcct 12181 ttcccaccat ctccagctcc aaactcctgc ttctgatgat cgcaactgca gatgggtgtc 12241 tctgccctga cactaatgca tcatctcaga tgtggaacca ggcccctatt ctgctatttt 12301 ccctacagca tcactcccac agggtatttg cacatgtggt ctattttaat gaaggtttgg 12361 agcacagtga ggctagatct ggccagcaga tgtgtttagt tcacctgtag agttacatac 12421 agacgaggat acttccctta aacatttact atcatggtta aggtagacac tgaagccaga 12481 ctgcctggac ttatggtctg gtactgcttc tcacaagctg tgtaaccctg ggcaggttgc 12541 tttgcctctc tgtgcctgtt ttctcacctg caaagtggag agaatatcaa agtctacctc 12601 agtggttttc agtttgctcc cagaccagca gcatcaccat ctcctgggaa catgtttgca 12661 gtgcaaattc tcaggctcta tcccagccct ctgggagccc ttgctgggat tctcatgcat 12721 gttcaagttc cagcaccact gatttacctc ctagggttac ccagcggatt aaatgagcta 12781 atgcacgtgc acactcggca cctggagcag tcagtgatcc ccacagttag ctggtattag 12841 gagttgtgtt gccaggcatg gtggcacatg cttgtaattc cagctacttg ggaggctaag 12901 gcaggagaat ctcctgaacc tgggattcgg aagttgcagt gagctgagat tgcaccactg 12961 cactccagcc tgggtgacag agcgagattc catctcaaaa aaaaaagtag ctgtgttgaa 13021 gaaatgggct gcatagtggc tgcccccttt aaatggggca tctgccctct tttcccaaag 13081 gcccgcggtg actgcctcac ctatttttgc catctgcctg gcccctggag tcactggact 13141 ttgcaaatcc catctgagga tttcaattcc cctttcccag gccctttctt ttgggtgcct 13201 cccccctttc actatgaaag cagtcggccc agcctacagt tttcccggcc actgtcatcc 13261 aagctctgct aggctgaggt ttctgggccc cacgttctgg ggtcctgcac acctggcaga 13321 ccctgcctct ctccttcccc ccacccccac actctgatgc aaggcagaga gagcatttag 13381 gggttcacgg ccctgggacc ctatgcccgg tgggctgcag aggctctccg ccgggactca 13441 ggctggactc aggctgtttc cgtttgcccc tggcctgcag acattcgggc agcgcctgta 13501 ccagcagtac acctgctaca ccgtgttctt catcagcatt gaggtgtgcc agatcgccga 13561 tgtcctcatc cgcaagacgc gccgtctctc tgccttccag caaggcttct tcaggtgtcc 13621 ctggctggcc ctcttcctgc cctccaaccc ctggtcccaa ctcccatcag atcctgactg 13681 tggtttagaa cagccggcaa atgccagcgg gtgccagtgc cagatggtga catgccaaaa 13741 aaacctttct aactggccaa taactccccc cacggccccc tccacatttc actaaagggg 13801 tggtgctgag aggcccgggc cactccagtg ttcagtgtgt agaacacctc cctgctgctg 13861 ctgacatctg acctactttg acccacagta tcctgatttc catcacgtcc ccaattccta 13921 atggcctgtc ccattctgcc cacccccagg aataagatcc tggtgatcgc catcgtgttc 13981 caggtctgca tcggctgctt cctgtgctac tgccccggca tgcccaacat cttcaacttc 14041 atgcccattc ggtgagaggc tgtgaggggc aaggacaggg agggggctgt agggcacagg 14101 cctgggatgg ccttcacctt gccatgtgcc ttcaggttcc agtggtggct ggtccccctg 14161 ccctacggca tcctcatctt cgtctatgat gagatccgga agcttggagt tcgctgttgc 14221 ccagggagtg agtgtggaga ctgggagcct gggggcaggg gactgaaacg tttgagcagc 14281 tgcacccacc tttgatcctc aacacttggc ctctaagccc ttgaccctcc atagcaacca 14341 gcactgacct ctgagcccct ggccctgcct aagtggcacc cccagcactg agtctccttt 14401 ctccctctct ctcccaggct ggtgggacca ggaactctac tattagaggg acgactgcct 14461 tcaagcatcc ctgcaactgc cacagcaggt gggggcaggg ctcgtgggac cctctggaca 14521 gccaccaaga tatctgagca accaagagtc ccagccccac cagtatctgc ttctgtagcc 14581 cacggcaccc caaacttgga gggacctgcc cactcccctc ccccattccc aaggttcgca 14641 cctcctggag cagcagcgcc tgggcagtcc tctgggctgg cctcgggaaa ccgcactgtg 14701 gtggcggtgg ggctctgaca gggagtacag ctgaccgctt ctggagggtg tttctgttct 14761 taggactcca gtccaggctg gacggctgcc tgagggccct tcgttaaaga cacgcttgtg 14821 tcctgggcga tggtaataaa accagctcat gctgactgtg ctgtatctgg tgccaggcac 14881 tgtcctcagc gtctctcatt agtgcttgtg atctcccaca gtcctgtaca tgggggctgt 14941 ttatccaggg ctgcacctcc agtgcctagt acacagcctg gcacatagta ggtcctcaat 15001 aaacctgcct tggggaaatg attcctattc ctttgcccag ggtcatggca tctaaacgca 15061 agctggg // LOCUS HUMATPSAS 18004 bp DNA PRI 01-MAY-1997 DEFINITION Human gene for ATP synthase alpha subunit, complete cds (exon 1 to 12). ACCESSION D28126 D14706 NID g559316 KEYWORDS ATP synthase alpha subunit. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 18004) AUTHORS Akiyama,S. TITLE Direct Submission JOURNAL Submitted (29-JAN-1994) to the DDBJ/EMBL/GenBank databases. Shuichi Akiyama, Showa University, School of Dentistry, Department of Biochemistry; 1-5-8 Hatanodai, Shinagawa, Tokyo 142, Japan (E-mail:sakiyama@dent.showa-u.ac.jp, Tel:03-3784-8163, Fax:03-3784-5555) REFERENCE 2 (bases 1 to 18004) AUTHORS Akiyama,S., Endo,H., Inohara,N., Ohta,S. and Kagawa,Y. TITLE Gene structure and cell type-specific expression of the human ATP synthase alpha subunit JOURNAL Biochim. Biophys. Acta 1219 (1), 129-140 (1994) MEDLINE 94368840 FEATURES Location/Qualifiers source 1..18004 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_unit complement(551..856) /rpt_family="Alu" repeat_unit complement(1341..1678) /rpt_family="Alu" repeat_unit 1921..2206 /rpt_family="Alu" misc_signal 2257..2268 /note="putative factor/enhancer sequence: Z-DNA" CAAT_signal 2399..2403 misc_signal 2423..2430 /note="CS1" misc_signal 2425..2430 /note="putative factor/enhancer sequence: PU BOX" CAAT_signal complement(2489..2494) misc_signal 2607..2614 /note="putative factor/enhancer sequence: SV40 enhancer core" misc_signal 2680..2687 /note="putative factor/enhancer sequence: AP-2" misc_signal 2790..2798 /note="putative factor/enhancer sequenec: GFII/MLTF" misc_signal 2807..2815 /note="putative factor/enhancer sequence: Mt4-like sequence" misc_signal 2823..2830 /note="CS2: nuclear factor-binding site" misc_signal 2852..2862 /note="putative factor/enhancer sequence: OX BOX-like sequence" misc_signal 2863..2871 /note="putative factor/enhancer sequence: Sp1" misc_signal 3055..3062 /note="putative factor enhancer sequence: AP-2" misc_signal 3090..3097 /note="putative factor/enhancer sequence: GFII-like sequence" misc_signal 3101..3107 /note="putative factor/enhancer sequence: GCF" misc_signal 3109..3119 /note="putative factor/enhancer sequence: NRF-2-like sequence" misc_signal 3152..3159 /note="putative factor/enhancer sequence: Mt3-like sequence" misc_signal 3165..3172 /note="putative factor/enhancer sequence: Mt1-like sequence" misc_feature 3252..3269 /note="G+C rich region" misc_signal 3283..3290 /note="putative factor/enhancer sequence: AP-2" misc_signal 3295..3302 /note="putative factor/enhancer sequence: EST-1" misc_signal 3303..3311 /note="putative factor/enhancer sequence: GFII" misc_signal 3310..3316 /note="putative factor/enhancer sequence: GCF" misc_signal 3342..3349 /note="initiator" 5'UTR 3355..3408 exon 3355..3468 /number=1 misc_signal 3367..3374 /note="CS3: nuclear factor-binding site" misc_signal 3388..3395 /note="putative factor/enhancer sequence: AP-2" CDS join(3409..3468,6478..6556,9759..9928,11614..11787, 11878..12044,13352..13500,14117..14268,14377..14601, 15115..15222,15352..15496,16954..17104,17245..17326) /standard_name="F1-ATPase alpha subunit" /codon_start=1 /evidence=experimental /product="ATP synthase alpha subunit" /db_xref="PID:d1006217" /db_xref="PID:g559317" /translation="MLSVRVAAAVVRALPRRAGLVSRNALGSSFIAARNFHASNTHLQ KTGTAEMSSILEERILGADTSVDLEETGRVLSIGDGIARVHGLRNVQAEEMVEFSSGL KGMSLNLEPDNVGVVVFGNDKLIKEGDIVKRTGAIVDVPVGEELLGRVVDALGNAIDG KGPIGSKTRRRVGLKAPGIIPRISVREPMQTGIKAVDSLVPIGRGQRELIIGDRQTGK TSIAIDTIINQKRFNDGSDEKKKLYCIYVAIGQKRSTVAQLVKRLTDADAMKYTIVVS ATASDAAPLQYLAPYSGCSMGEYFRDNGKHALIIYDDLSKQAVAYRQMSLLLRRPPGR EAYPGDVFYLHSRLLERAAKMNDAFGGGSLTALPVIETQAGDVSAYIPTNVISITDGQ IFLETELFYKGIRPAINVGLSVSRVGSAAQTRAMKQVAGTMKLELAQYREVAAFAQFG SDLDAATQQLLSRGVRLTELLKQGQYSPMAIEEQVAVIYAGVRGYLDKLEPSKITKFE NAFLSHVVSQHQALLGTIRADGKISEQSDAKLKEIVTNFLAGFEA" transit_peptide join(3409..3468,6478..6546) misc_signal 3503..3510 /note="putative factor/enhancer sequence: AP-2" misc_signal 3526..3533 /note="putative factor/enhancer sequence: GCF" misc_signal 3555..3562 /note="putative factor/enhancer sequence: AP-2" misc_signal 3637..3642 /note="putative factor/enhancer sequence: CRE" misc_signal 3730..3735 /note="putative factor/enhancer sequence: GCF" misc_signal 3765..3772 /note="putative factor/enhancer sequence: AP-2" misc_signal 3809..3814 /note="putative factor/enhancer sequence: PU BOX" misc_signal 3952..3957 /note="putative factor/enhancer sequence: PU BOX" misc_signal 4168..4173 /note="putative factor/enhancer sequence: AP-3" misc_signal 4180..4188 /note="putative factor/enhancer sequence: C/EBP(AP-1)" repeat_unit complement(4632..4926) /rpt_family="Alu" repeat_unit complement(5118..5406) /rpt_family="Alu" repeat_unit 5488..5816 exon 6478..6556 /number=2 repeat_unit complement(6907..7211) /rpt_family="Alu" repeat_unit complement(7491..7807) /rpt_family="Alu" repeat_unit complement(8448..8746) /rpt_family="Alu" repeat_unit complement(9383..9708) /rpt_family="Alu" exon 9759..9928 /number=3 exon 11614..11787 /number=4 exon 11878..12044 /number=5 exon 13352..13500 /number=6 repeat_unit complement(13715..14010) /rpt_family="Alu" exon 14117..14268 /number=7 exon 14377..14601 /number=8 exon 15115..15222 /number=9 exon 15352..15496 /number=10 repeat_unit complement(15642..15975) /rpt_family="Alu" exon 16954..17104 /number=11 exon 17245..17458 /number=12 polyA_signal 17440..17445 polyA_site 17458 repeat_unit complement(17602..17901) /rpt_family="Alu" BASE COUNT 4525 a 3795 c 4072 g 5612 t ORIGIN 1 attacaggtg tgagccacca tgccggctaa ttttgttgtt attgttgttg ttgttttgag 61 atagtctcac tcctgtcgcg catactgaag tgcagaggcg caatcttggc tcactgcaac 121 ctccacctgg ctcactgcaa ccttcacctc ctgggttcaa gcgattctcc ttcctcagcc 181 tcctgagtag ctggggttac aggtgtgcac caccaccccc agctaatttt tgatttttta 241 gtagagacgg ggttttgcca tgttggccag gctggtcttg aactcctgac ctcaggcgat 301 ctgcctgcct tggcctccca atgtgctagg attacatgcg tgagccacca tgcccggcct 361 tgttgttttt agaaaccata ttttttcctg ttagcttgtg aaagttgatt gggtcattct 421 tgtcatcccc aactaaaaca gtgtcaagag gtaaagggaa aaaaaagcac tcagggcaca 481 taacattgct ccaaaaatgt aattatctgt aagcctggct gctgaaactg cccgctataa 541 cctaaaccag tttttttgtt tgtttgtttt gagatggagt ctcactctgt cgcccaggct 601 ggaatgcaat ggcatgattt tggctcactg caacctctgc ctccaggttc atgtgattct 661 cctgcctcag cctcccaagt agctgggatt acaggcaccc gccaacatac ccagccaatt 721 tttgtatttt tagcagagac ggggtttcat catgttggtc aggctggtct tgaactcctg 781 acctcaggtg atccacccgc ctcggcctcc caaagcgctg ggattacagg cgtgagccac 841 cgtgcctggc tgacctaaac cagttttatc taattagctg ctaaaaacaa cctgcagtga 901 ctctaagaat tgttttacct agttgccatt acttgccaat caaaacttgc cagctcccca 961 aaaccttact agtgacaatg aactttctta aagaacaata tacaacagtt ctcttttttt 1021 gttgtttgtt ttgatatagg gtcttgcttt gttgcccagg ctggagtgtt ccggtgtgat 1081 cacagctcac tgcactccca aactcctggg ctcaagtgaa cctctcatct tggcttccca 1141 acgtgttggg attacaggca tgagccactg ttcctggcta tatttctctt ttacaaaaat 1201 aaaacctcta accatctgtt tgttcttctg acgtaccaaa gaccaccctg tttgtgtgta 1261 tgccctgaat ggcaattatt gcttcccaaa tgttttaaat ttagagattt gtctctgtat 1321 tttattttac ttggacagcc tttttctttt tcttttcttt ttttctttct tttttttttt 1381 ttttttttag atggagtttc acttttattg cccaggctgg aggacagtgg tgcaatctgg 1441 ggctcactgc aacttctgcc tcccgggtta aagcaattct cctgcctcag cctccccagt 1501 agctgggatt acaggtgccc gcccccatgc ctggctaatt tttgtatttt tagtagagat 1561 ggggttttgc catgttggcc aagctggtct cgaactcctg acctcaggtg atccacccgc 1621 ctcagcctcc caaagtgctg ggattacagg cgtgagccac tgcgcctggc ttggacagcc 1681 tttttgtacc cctgattccc aggagaactg ctgctctatt ttcccatggc ttaccttgaa 1741 aagagtagga tactagagaa aaggatggct aatcacgtgt ttacaagcca tgtttcagga 1801 gatgaccctt gtagcccgga tatttatttg cctcaccaaa attcaggttc taggtgtaag 1861 ccaggatttg ggggatggaa ctgaaaatgc ttctccaaac ggtgaaaagc gggcccagcg 1921 cagaggctga ggaggaagga tcacttgaag ccaggagttc gagaccagcc tgggcaacat 1981 agagagacct ccccccacgc cctgcccacc accgccgtct ttacaaaaaa attagccagg 2041 catggtggtg ggaccacagg ggcacaggct gaggtcagag gattgctcca gcccaggagt 2101 tccaggctgc aatgaatggt gataaaccca ctgcactctt tcctgggcaa cagagtgaga 2161 ccctgtctta aataaaataa aataaataaa taaaaaaaaa agaaaagtga aggcaacgtt 2221 aataagtgtc cactatgaca taataataac cagcatcaca cacacacacg aatcttgcat 2281 atatcacaaa aatctcagta tggtaattct gtggagaaaa aaaaaaagcc catccaggta 2341 agcatagaaa gtgttttttc atcatctccc tttgactcca aatgaagaag cagagttccc 2401 aatagcatga actagatgtt ggcagaggaa aaagcgagtc ctcgtcctat cctggtgtaa 2461 acttctgatt ctggacaatt ctcagtagga ttggcaatga ctcttctagg cctccttgtg 2521 attgttgata tatttttttt aatttaaagt tccggaatac atgtccagga cgtgcaggtt 2581 tgttacacag gtaaatgtgt gccatggtgg tttgctgtaa ctatcatccc atcacctagg 2641 tattaagctc cgcatgtatt agctatttat cttgatgctc ccctcccccg gttatcagga 2701 tttttaaaca gtttgctcta attcactgac aggattcttt tagaatgaat ctaaaaaagc 2761 tgggaatctt taacctgagc gcagatcgga tcacgtggaa cggtgctggt gtttgttaaa 2821 aattcagatt gccccgaatt ttcgactcag agggtctgaa gtggggctaa gatctgcaga 2881 ttaacgggct ccgtgggagg ctccctgctg gtggccagtc ccggggttta gctgttcact 2941 gttgctactc gtgctaaacc acggtctaac actaccgaca ctattaatcg ccctggtcct 3001 caagacagcc gcgaccccag acgggtgagg ccctcatcct ctagcaccag atggggcctg 3061 ggaactgctg cttagcagat tcagctaccc acgtggtcac ccggggcaac cacaaggtcg 3121 actcaaaaga ccaaaataaa tgtataacga atgccaggac caaagccgga atatatctaa 3181 tcattcattt tgaggtttaa acttaaaaat gttacataaa aaaataagta acctgccaaa 3241 aaatgcagag tggcggcggg ggcgggggga gaggtggtga acgcgagggg cagtacttcc 3301 gggtcaggtg ggccggctgt cttgaccttc tttgcggctc ggccattttg tcccagtcag 3361 tccggaggct gcggctgcag aagtaccgcc tgcggagtaa ctgcaaagat gctgtccgtg 3421 cgcgttgctg cggccgtggt ccgcgccctt cctcggcggg ccggactggt gagcaccgaa 3481 ggccggcatg atgcaggcgg ccgggtgggg ctgcagggtg gtggtgcgcc ggctcgggcg 3541 ctctctgcag gagggcgagg ggctgtggga tgccgccatc ttgcacccgt ggcttctccg 3601 gctggacaga gcaggcgaca caggtgccct tttgctcgtc acctgcgcag aggcagaatg 3661 gtacagggca gacagttaac tcgatggtgt ccagagacag ggcctcaaga ttcctgcctt 3721 cggctgacag cggccctaga agggggatct tgggtgaagg tcagggcttg ggcgctagct 3781 ctccgaggcc tgttctgaat cggtgaggtt cctctctttc gattcttagg gtcgcgtccc 3841 gctgaaggtc agaaaactgc tgtcagctct tgttgaagct agtcagtgga gtctgacatt 3901 gtgtgaaggt cagcaggccc agcatttcac ctctgcctgg agctcagttt agaggaaagg 3961 aataaatgaa ggtggttttt ggaatactta cgtaggcttt ctcttagtgc tgaatacgaa 4021 acgaccgtga aaacgtcctg caggtgcaaa gaaccatttc atgtggccag cactgtgaca 4081 acagtgaagc tggtttctaa attgtctctc cggcagagtg tgtgctttgt ggcacttgga 4141 cgtgtttaaa gaatttgtcc ttgagttttc cacctggttt tgagtaattt ttagtgatgg 4201 agaattcaag taaaagagaa caaggttgga aaatggctgt aaggattaat acccttcgtt 4261 atttgtttcc cttgaaatta acatgatttc agtgtaaacg cattgacact gacacttttt 4321 ttttgttcta aaagtaatct caatttgaaa ggtgaaataa acatgttggc tgttttgctg 4381 ccttaacaac cgctggtagc atcttgggtt tctgttaaca aaaaagcatt tttaacggtt 4441 ttcatctagc tcttatggaa ggtagaggct tcagaattga cagtgaccag tcgcagtatt 4501 acaacttgaa ggagaaactg aaggaagttc caggtggctg caggatttgt tgagagagta 4561 aaataagtcc acatgccaca actcttggtt tctaatctaa ctctagcatt tattcagttt 4621 ttgtttgctt gttttttttt tttttttttt tttgagatgg tgtctcgctc agtcgcccag 4681 gctggactgc aatggcgtga tctccgccca ctgcaaccac cgcctcccag gatcaagtga 4741 ttctcccacc tcagcctccc gagtagctgg gattacaggc acccgccgtc atgccctgct 4801 actttttgta gtttttcgta gagacggggt ttcaccatgt tggctaggct ggtcttgaac 4861 tcctgacgtc aggtgatccg cccgcctcgg ccttctgaag tgctgggatt acaagcatga 4921 gccgtcgtgc ccggccgctt gttgtcgttg ttgttgttgt tgtttttaag aaaattggaa 4981 ctactttgag ttttgcccct tgtaggaagc tgtttatctt agaattcctg caactccatt 5041 aaaatacatc taaagtcaaa ataataaaaa ttgggtacct ttaaagggaa agacacttac 5101 attttgttaa cagttgcttt tttttttttt ttttgagacg gagttttgct tttgtcgccc 5161 aggctggagt gcagtggcgc gatcttggct caccacaacc tctgcctccc aggttcaagc 5221 gattctcctg cctcagcttc ccgagtagct gggattacag gcatgtgcca ccacgcccag 5281 ctaattttgt atttctagta gagacggggt ttctccatgt tggtcaggct ggtctccaac 5341 tcccgacctt aggtaatccg cccgccttgg cctcccaaag tgctgggatt acaggcatga 5401 gccaccgtgc cctgccaata gttgcttttt aaaatactga ctgccaggcc gggtgcagtg 5461 gctgacgtct gtaattccag cactttggga ggccaaggtg ggtgcatcac ctgaagtcag 5521 gagttcgaga tcaacctggc caacatggcg aaaccctgtc tctactaaaa ataaaaaaat 5581 tagccgggca tggtggtggt cgcctgtagt cccagctact caggaggccg aggcatgaga 5641 atcgcttgaa cctgggaggt gaaggttgca atgagccaag atcgtgcccc tgcactccag 5701 cttgggtgac agaatgagac tccgtctcag aaggaaaaaa aaaatgtgtg tgtatatata 5761 tatttactgc ccagtctggg caatatgatg aaatcctgtc tctataaaaa aaataagtac 5821 aaaaattagc ctggccaggt ggcgcactcc tgtgatccca cctactgggg aggctgaggt 5881 gggaggatca cctgagccca ggaagtcgag gctgtagtga gccgtgattg agccactgca 5941 ctccgacctg ggtgacaaga gtgagaccct gtcccaaaaa agaaaaaaaa ataagccggg 6001 tgcggtgctc acgcctatag ttctagcact ttgggaggcc aaggcgggtg gatcacttga 6061 ggtcaggagt tcgagaccag cctggccagc atggtgaaac cccgtctcta ctaaaaatat 6121 aaaaaattaa cctggtggag gtggcgcacc cctgtagtcc cagctactcg ggaggctgag 6181 aatcgcttga gactaggagg tggaggatgc agtgagccgt ctcaaaaaaa caataataat 6241 aaagtacaga ctgcatttta aaaatatgcg agtgtttact gtgtactagg agctgtataa 6301 aaggttttca catatatcaa aagctacttt cccatttgac attttaaaca aaatgcttta 6361 tatgtaatta gcattttcac cagtactttg atcacactaa gctggcattt gtttaagctt 6421 gtaagtttta taaaggcttt gttaaaaatc aaatttttaa gaatactttt gcactaggtc 6481 tccagaaatg ctttgggttc atctttcatt gctgcaagga acttccatgc ctctaacact 6541 catcttcaaa agactggtaa gttattattt ctcagtctac gccgcactta ctagatgaag 6601 atataaatta catacatcgt ataactgtgg tgagttgttt ttcatttgtt agttttttct 6661 ctagtttgta tagactctca ggttcttaaa ccatattgtt accatttagc cctgtcatgg 6721 ttctcttgtt atagtcaaat gaactactgt tacatatttt tcttttgata ttagtatggc 6781 ttctttgttc tggtaatggc ccttgggtag aaacagaaag aacatacact aggtaatgga 6841 atctttgtga cctctggaag acttcctttt tccttttcta aaaggcatta gaaaaaatgt 6901 aagttcttat tgataggttt tcttattctt tttctttttt tttgagacag aatcttgctc 6961 tgtcgccagg ctggagtgca gtggcaagat ctcggctcac tgcaacctct gtctcctgag 7021 ttcaagcaat tctactgcct cagtctcctg actagctggg attacaggca tgcaccacca 7081 tgcccggcta attttttgta ttttagtaga gacaaggttt caccatgttg gccaggatgg 7141 tctcgatctc ctgaccttgt gatatgcctg cctcggcctc ccaaagtgct gggattaaag 7201 gcgtgagcca ctgcaccggt ccactgatag gttttatttt ttcaaaggca gtagctacta 7261 gatatttggc aatctgtgaa cttgcacatt acagatgcag tagagaagtg gggaggcaag 7321 cataggaaga cactataagt tggggaattt ctttggagat gagtatttgg acttttaaaa 7381 taaaggttaa atatatatgt gtgtgagagt gtgtgtgtat atatgtgtgt gtgtgtgtgt 7441 gtgtgtgtgt gtgtgtatgt gtacacacac ccccacgttc agcatatatg tttttctttt 7501 ttcttttttt tttgagatgg agttttgctc ttgttgccca ggctggagtg cagtggcatg 7561 atctgggctc actgcagcct ctgcctcacg ggttcaagtg attctcctgc ctcagcctcc 7621 cgagtagctg ggattacagg tgcccaccac cacacccagc taatttttgt attttttgta 7681 gagacaaggt ttcaccatgt tagccaggct ggtctcgaac tcctgacctc aggtgatcca 7741 cctgccttgg cctcccaaag tggtgggatt acagacatga ggcactgtgc gtggcctcct 7801 tttttttgag actctgttgc ccaagctgga gtgcagttgc gccatctcag ctcactgcaa 7861 cctccacctc ccaggttcaa gtgattctcc tgagtcagct tccctagtgg ctaggattac 7921 aggcacgcac caccacacct ggctaatttt tgtattttta gtagagatag agtttcgcca 7981 tgttggcctg gctgatctca aagccctgac cccaggggat ccgtctacct tggcctccca 8041 aagtgctggg atgttttttt attttgtttt gttttgtttt ttgagacaga gtttcgctct 8101 gttgcccaag ctggggtgca gtagcgtagt cttggctcac tggaccctcc acctcctggg 8161 tttgagtgat tctctcctgc ctcagcctcc cgagtagctg gtattacaag catgcaccac 8221 cacagctggc tatttttttt tttttttttt ccttagtaga gatgggattt tgctgtgttg 8281 gccaggctgg tccaaactct tggcctcaag tggtccgccc gcctcccaaa gtgctggtat 8341 tactggcgtg agccactgca cccaacccct gtatgctaag atgaagaata ccacctatac 8401 taccctgtgt gccttgaaag agaaagacac actttgttaa cagttgattt ttatttttta 8461 ttttttattt ttttgagata gagtcttgct ctgtctccag gctggagtgc agtggggcta 8521 tctcagctca ctgcaacctc cttctcccag gttcaagtga ttctcgtgcc tcagccaccc 8581 aagtagctgg gattacaggc atgcaccacc atgcccggca aacttttgtg tttttagtag 8641 agatgggggt ttcaccatgt tgcccaggct ggtctcgaac tcctggcctc aagtgatcca 8701 ccccctttgg catcccaaag tgctggtttt acaggcatga tctaccgtgc ctggccaaca 8761 gttggttttt aaaatatgtt tttattgaat ttagccaaaa attcaggttt taacatagtt 8821 acagatttct tagaaattaa aggagaacat ttattttaag ggttttactt aatttgatta 8881 gcttcttgtc tggtttacat ttgttttgtg tggacttttt tttccttaat cacttggttt 8941 tattttcata gcttttaaaa atagaatctc gtgttacagt gtagaatata tatatataat 9001 ttttgtgtgt gtgatagtct cactttgtca cctatgctgg agtgcagtga cacaatcttg 9061 gcttactgtg acttccacct cccaggttca agcagttctt ctgtctcagc ctcctgagta 9121 gctgggacta caggcatgcc cgtgcccagc taattttttt tttattttta gtggagacgg 9181 ggtttcacaa tgttggccag gctggtctcg aactccttac ctcaagtgat ctgcctgcct 9241 cagcctccta aagtgctagc attaccactg tggctagcgt gagccactgt gctgggtctg 9301 tagaatatat ttggattatt atgatcacaa aactagtttt ttgttgttga gaaatgggct 9361 ttgaaaagat ccgaggacca gatttttatt attattatta ttattattat tattattatt 9421 attattattt tgagatggag tttttgccct tgttacccag gcttgagtgc aatggtgtga 9481 tttcagctca ccgcaacctc cgcctcctgt gttcaagcga ttcttctgcc tcagcctccg 9541 cctcccgagt agctgggatt acaggcatgt gccaccatgc ccagctaatt ttgtattttt 9601 agtagagaca gggttctcca tgttggtcag gctggtctca aactcctgcc cttaggtgat 9661 ctgcctgcct cagcctccaa aagtgctggg attacagctg tgagccactg tgcctggccc 9721 agattatttt tttaattgaa ttgaattgtc ttccataggg actgctgaga tgtcctctat 9781 tcttgaagag cgtattcttg gagctgatac ctctgttgat cttgaagaaa ctgggcgtgt 9841 cttaagtatt ggtgatggta ttgcccgcgt acatgggctg aggaatgttc aagcagaaga 9901 aatggtagag ttttcttcag gcttaaaggt aaattataaa ataagatttt ggtgttttag 9961 gatctaagtc ttctcagtta ttcatttagc gattcaaaga ttgtatcaga tttttaaaaa 10021 actttgtctt aatttgcctc agattttttt cttgttatta tcatagattg taaagagaat 10081 aaacttaagg gtttgaatta gccaagtttt tttactctag ggactgattt atacataact 10141 tgtagccact ggctccaaaa tataaaattg aataaacttt caccccaaat ccaagaagtt 10201 taagtctgtt ctgtcttgct gttaagatta ttttatacaa aagcaaagca tttgataact 10261 tcaacaaaat ttttctggtg gaaattggaa gtccgtgact ttgtatttct agtgagtaga 10321 catattgttc aaaggcgagt tttggtgtag ctatcagaag ctctctaatg taagtctact 10381 ttgtattggg ttgagcctca tctgtcaggg tatgagaaat aagttgtagg agttagggaa 10441 agcagaggat gacatctgga taaactagag agagtggaac tgtctagtag gcatgcagat 10501 atgggaaaag atagtcttga tactcttggt aatgtctact acggaaaaag agccacttga 10561 tgtggaagaa taattacaga gcccaaagtt atcattagct gtaagacatt ggccaattta 10621 gcatcttcag tgtacagtat ttatctttga aaggtgaggt cgaaacagat taattctgag 10681 ttcccctttt agcttttgaa atccaagtgt ctagaacagt actgtccagt agtattggaa 10741 atatgtagta ttctgcagtg atgagaatac atctgtgctg tttaatatgg tacctactat 10801 actatcacta ttaatcatgt aaaatgtggc tagtgtgaca ggaactgaat ttttaatctt 10861 ttaaatttga caaatttaaa tagccacatg tggttagtgg atgctgtatt gaacatcaga 10921 gatgtaggtt aaagtttttg gatgtgtaat gatattccat gatactccaa gtacttctcc 10981 agagttttca ttattgacca gactattcct ggaatagcct ttttgcttca gacatgattt 11041 aaagatctaa aatatagttt taatgatgag agatgatttg gtgctgttat gttatgaaaa 11101 tttgtatttg attggttaca gttaggtagc ttttaatttc actttctact aactcgaaga 11161 gtacaggttt tgattaattt ttgaaactgt ttatggagct tctcacacat aaataagatg 11221 ataatcaaat ttgtcttaag atttccattt ggccttcatc actttatttc atactgatca 11281 cccagcccaa gaatcattgc caaagaaaat acagtggaaa aaagcttatt aacctgtcca 11341 atttctttgc ttctgtaata gttatttgag gatatgatag tgagcatatt aggcttatca 11401 atgtgctgtc ctaagattat tcaaaatagc aattttaaaa gtcttaatca gagagggttg 11461 tttgttagtt ggaaaagtac tcaagaataa accaggtgaa aaggacaata ctgcttcccc 11521 ccccaccccc cgactatttt cttatgtagt attggtgtat ggggaggagt gcccattcag 11581 cttcattgat gttcaattgt gtcttttgtg cagggtatgt ccttgaactt ggaacctgac 11641 aatgttggtg ttgtcgtgtt tggaaatgat aaactaatta aggaaggaga tatagtgaag 11701 aggacaggag ccattgtgga cgttccagtt ggtgaggagc tgttgggtcg tgtagttgat 11761 gcccttggta atgctattga tggaaaggta ggtttaatgt tcgttagata tagttgcagc 11821 ttctaacaaa catcaaaact gattatgctt agggtttttc tttttatttt ttaacagggt 11881 ccaattggtt ccaagacgcg taggcgagtt ggtctgaaag cccccggtat cattcctcga 11941 atttcagtgc gggaaccaat gcagactggc attaaggctg tggatagctt ggtgccaatt 12001 ggtcgtggtc agcgtgaact gattattggt gaccgacaga ctgggtaaag actcaaaagc 12061 atattaaaag ttctaactaa aatttgctca atggaaggat tatattttag tcttggaatg 12121 gtaaatggta atggggacat tagaatctaa tacgaggatt aatagtaaaa tttcagctgg 12181 gaagtgaagg gggccagccc ctccacacct gtgggtattt ctcatcaggt gggacgagct 12241 agctcagtcg gtagagcatg ggactcttaa tcccagggtc gtgggtttga gccccatgtt 12301 gggcaccaga tgaagggggc cagcccctcc acacctgtga gtatttctcg tcaggtgggt 12361 tgagagactg agaaaataaa taagacacag agacaaagta tagagaaaga acagtgggcc 12421 caggggaacg gcgctcagca tatggaggac ccatgccgcc actggtctct gagttccctc 12481 agtatttatt gatcactatc tctactatct cggtgtgggg gatgtggcag gactgtaggg 12541 ttaatggtgg ggagagggtc agcaggaaaa cgtgagcaaa ggtctctgtg tcataaataa 12601 gtttaaggaa aggtgctgtg ccttgatgtg catgtaggcc agatttatgt ttgactttac 12661 acaaacatct cagtgcagta aagagcagta ttgctgccag catatctcac ctccagccat 12721 agggtggttt tctcctatct cagtaaatag aatgtacgat tgggttttac accaagacat 12781 tccattcccc gggacaagca ggagacagat gccttcatct tatctcaact gcaaagaggc 12841 tttcctcttt cactaatcct cctcagcaca gaccctttac gagtatcggg ctgggggatg 12901 gtcaggtctt tcccttccca cgaggccata tctcaggctg tctcagtggg gagaaacctt 12961 gcacaatacc caggctttat tgggcagagg gccctgcagc cttccacagt gcattgtgtc 13021 cctgggtact cgagactgga gaatggcgat aacttttacg aagtatactg cctgcaaaca 13081 catttttacc aaagcacatc ctgcacagcc ctaaatccgt taaaccttaa gtcaacacag 13141 cacatgtttc tgcgagcaca gggttggggc tatggttaca gactaacagc atctcaaggc 13201 agaagaattt ttcttagtac agaacaaaat ggagtttctt atgtcttctt ctttctgcat 13261 agacacagta acagtctgat ctctctttct ttccccacag ggaagtcctt ttaagtttag 13321 gaagatttat atttactttc tctttttaaa ggaaaacctc aattgctatt gacacaatca 13381 ttaaccagaa acgtttcaat gatggatctg atgaaaagaa gaagctgtac tgtatttatg 13441 ttgctattgg tcaaaagaga tccactgttg cccagttggt gaagagactt acagatgcag 13501 gtattaaaag aaatttagtc ccattgctat tatcttaagt cccattgctg ctgattgtag 13561 aatagcttct acttaatgaa ggcatatttt aattacttcc taatgggctg aactttttaa 13621 tacaaatcag taggtaagac aatgtaaaac agtagtacat ttaaaggttt tgttggtttc 13681 tttaaagtgg aagctgtttg attctgatta actctttttt tgttgtttga gacggagttt 13741 cactcttgtt gcccaggctg gagtgcgatg gcacgatctc agctcactgc aacctctgcc 13801 tcctgggttc aagcgattct cctgtctcag cctcccgagt agctgggatt acaggtgcat 13861 gccaccacgt ccggctaatt tttgtatttt tagtagagac ggagtttcct catattggtc 13921 aggctggtct cgaactcctg acctcaggtg atcacccgcc tagggcctcc caaagtgctg 13981 ggattacagg catgagccac tgcgcctggc ctgattaact cttaatggta aatattttta 14041 gcacttaggt aatttttata gtagcctcca atttaaaacc acaatattga taaaattgta 14101 aaatatattt tctcagatgc catgaagtac accattgtgg tgtcggctac ggcctcggat 14161 gctgccccac ttcagtacct ggctccttac tctggctgtt ccatgggaga gtattttaga 14221 gacaatggca aacatgcttt gatcatctat gacgacttat ccaaacaggt caaaggaaat 14281 aaatttttag aatccattta tttgtactga agtaaaagtt cacatatgca acttctattt 14341 aataggttaa cttcacaaac ctattctgta ccataggctg ttgcttaccg tcagatgtct 14401 ctgttgctcc gccgaccccc tggtcgtgag gcctatcctg gtgatgtgtt ctacctacac 14461 tcccggttgc tggagagagc agccaaaatg aacgatgctt ttggtggtgg ctccttgact 14521 gctttgccag tcatagaaac acaggctggt gatgtgtctg cttacattcc aacaaatgtc 14581 atttccatca ctgacggaca ggtattattt taatgattaa aagaaatgtt aaaaaagatt 14641 tggataattc taaggcattg gttgagacta atggtatatt gactgatttg ctaaatgcat 14701 tttacttttt attatggaaa atttcaaata catagaaaag ttaggagaat ggcatagtga 14761 aaccccatgt atctgtcacc cagttattgc atagagtagc atgtttcgtt ctagtttcat 14821 ttttaaccac tactgttctc ccttctagat aattttgaat caaatagaaa ttatatcact 14881 tcatttgtat gtatttcagt ataaatatgt tgcctattat gtttttttct catctcatgc 14941 acatctccta attgcagcaa taaaatatac aggtcaggtc gtgaaagatt tggggtactg 15001 atgctttgca ttgttaggtt ttgagtttca gctttttggc ataatgactt aaatttgaag 15061 atagttgatt tcttacttta agcttgctag cgtgcgtatt tccttacatt atagatcttc 15121 ttggaaacag aattgttcta caaaggtatc cgccctgcaa ttaacgttgg tctgtctgta 15181 tctcgtgtcg gatccgctgc ccaaaccagg gctatgaagc aggtaatttt gcatcacttt 15241 ggcagtaagg gggctatcat tagattagca tcttaaggta ttaatatatt aactatattt 15301 gtatagtgct ctgtgtcact gagttacagt acagtaattt ctgtattgca ggtagcaggt 15361 accatgaagc tggaattggc tcagtatcgt gaggttgctg cttttgccca gttcggttct 15421 gacctcgatg ctgccactca acaacttttg agtcgtggcg tgcgtctaac tgagttgctg 15481 aagcaaggac agtattgtaa gttgttcttt tcttcatttg gtcaggttcc tattgtgtat 15541 cagggtcttc ccaggcctac agagctgcat cacatatcac ggcctgtact acgtgttgtt 15601 atctgttatc tttcatcaaa ggcactttta cttaaactac atttgttttt tgatttttga 15661 tttttttttt tttttgagac gaagtttcat tcttacggcc taggctgaag tgcaatggcg 15721 caaccttggc tcattgcaac ctccacctcc tggattcacg cgattctcct gcctcagcct 15781 ccttagtagc tcggattaca ggtgtgagcc accacgcccg actaattttt gtatttttag 15841 tagagacagg gtttcaccat gttggccagg ctggtctcga actcctgacc tcaggtaatc 15901 caccctgcct cagcctccca aagtgctggg attacaggtg tgagccccca tgcccggccc 15961 tgtattctgt tttttaaaga gagacagtgt cttgcagtgt cacccaggct ggagtgcagt 16021 ggtgccatca tagctcactg caggcttgaa ctcctgggct caagcagtcc tgcagcctta 16081 gcttcctgag tagctaagac tacgggcatg cactaccatg cctggctaat ttttttgtat 16141 tttgtagaga cagggtctca ctatgttgcc caggctggtc ttgaactcct ggccttatgc 16201 catcttccca cctctcaaag tgctggcatt ataggcgtga gccaccatgc ccggcctcaa 16261 actatctttc tttttctttc tttctttttt tttttttttt ttaatacaga gtctccttct 16321 gtcacccagg ctggagtata tttttagtag aaatggtaaa acctctacca ttttggccag 16381 gctgatcttg aacccctgac cttaagtggt ccacccgcct cagagtccta aagtgctggg 16441 attacaggtg tgagttacca tgcccggccc aaactacatt tcttattatc atttgagtgt 16501 aaatgatagg ggcttcacaa ccaggaaaac ttaagtatat ttaatgtcct tctttcagga 16561 attaaggcct gttagagtcc cctcccccac ttggctatat gaatatgtaa tttggcagat 16621 ttaaaagaaa gttaatacgt ttttattttg tcaagtcatt tcagcaaaat atttatactc 16681 aaaacagtat taacactgtc agattataaa cagtgcttat ggctatactg aaagattatg 16741 cctcaagcct tgaaaatata agaaataact gttttctgtc aggggtggga ggaaggatta 16801 ttaagtggag ttctaggtca ttgatattgt gaagaataga taatttacag ggacatgtta 16861 ggaattggag aaaaagtaag ttttcctgac cttggtaaca tggcattcat ttctttgagc 16921 aactcacttg gcattcttct tgtatctttt cagctcccat ggctattgaa gaacaagtgg 16981 ctgttatcta tgcgggtgta aggggatatc ttgataaact ggagcccagc aagattacaa 17041 agtttgagaa tgctttcttg tctcatgtcg tcagccagca ccaagccttg ttgggcacta 17101 tcaggtatga atgcaattgt tggcatcttt ttttaaagtt atgtttaaga tatgaagtta 17161 aaattatttt caaatctgta gttaggctag tcattaaaac tttttccagg tcagaactta 17221 cgacctgctt ttatttccaa atagggctga tggaaagatc tcagaacaat cagatgcaaa 17281 gctgaaagag attgtaacaa atttcttggc tggatttgaa gcttaaactc ctgtggattc 17341 acatcaaata ccagttcagt tttgtcattg ttctagtaaa ttagttccat ttgtaaaagg 17401 gttactctca tactccttat gtacagaaat cacatgaaaa ataaaggttc cataatgcat 17461 agttgttttc tgtcatttgt gttattcttt aaaaccaaga tcaaattgag aaattggtaa 17521 gcaaatgctt cttgatctat tttacttgaa tattggtaca gtcaactggg ctagataatt 17581 caaggctgag ctctttgtaa gttttttttt tttttttttt ttttgagact gattctcact 17641 ctgtcactga ggctggagtg cagtggcacc ctgtcggccc actgcaacct ccgtctccgg 17701 gttcaagcaa ttctcttgcc tcagcctccc acgtagctgg gattacaggt gcccaccacc 17761 acgcctggct agttttttag tatttttagt agagacgggg tttcaccatg ttggccaggc 17821 cggtcacgaa ctcctgacct caggtgatcc acctgccttg gcctacctac cacagtgctg 17881 ggattacagg tgtgagccac cacacccggc tgctctttgt aagtttctag agtactttgt 17941 gtttaagaga aattcctaaa ctggaatatg tggcaggctg acaatactga agaggatagc 18001 tggc // LOCUS HUMATPSYB 10186 bp DNA PRI 31-OCT-1994 DEFINITION Human ATP synthase beta subunit (ATPSB) gene, complete cds. ACCESSION M27132 NID g179280 KEYWORDS ATP synthase beta subunit. SOURCE Human fetal liver DNA, clone g-beta-lambda 1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10186) AUTHORS Neckelmann,N., Warner,C.K., Chung,A., Kudoh,J., Minoshima,S., Fukuyama,R., Maekawa,M., Shimizu,Y., Shimizu,N., Liu,J.D. and Wallace,D.C. TITLE The human ATP synthase beta subunit gene: sequence analysis, chromosome assignment, and differential expression JOURNAL Genomics 5 (4), 829-843 (1989) MEDLINE 90077425 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.B.Chung, 11-AUG-1989. FEATURES Location/Qualifiers source 1..10186 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12p13-qter" prim_transcript 2099..9908 /note="ATPSB mRNA and introns" gene 2120..2246 /gene="ATPSB" CDS join(2120..2246,2728..2910,3126..3300,4123..4244, 4494..4678,5250..5408,5501..5623,7888..8100,8775..8976, 9663..9763) /note="ATP synthase beta subunit precursor" /codon_start=1 /db_xref="PID:g179281" /translation="MLGFVGRVAAAPASGALRRLTPSASLPPAQLLLRAAPTAVHPVR DYAAQTSPSPKAGAATGRIVAVIGAVVDVQFDEGLPPILNALEVQGRETRLVLEVAQH LGESTVRTIAMDGTEGLVRGQKVLDSGAPIKIPVGPETLGRIMNVIGEPIDERGPIKT KQFAPIHAEAPEFMEMSVEQEILVTGIKVVDLLAPYAKGGKIGLFGGAGVGKTVLIME LINNVAKAHGGYSVFAGVGERTREGNDLYHEMIESGVINLKDATSKVALVYGQMNEPP GARARVALTGLTVAEYFRDQEGQDVLLFIDNIFRFTQAGSEVSALLGRIPSAVGYQPT LATDMGTMQERITTTKKGSITSVQAIYVPADDLTDPAPATTFAHLDATTVLSRAIAEL GIYPAVDPLDSTSRIMDPNIVGSEHYDVARGVQKILQDYKSLQDIIAILGMDELSEED KLTVSRARKIQRFLSQPFQVAEVFTGHMGKLVPLKETIKGFQQILAGEYDHLPEQAFY MVGPIEEAVAKADKLAEEHSS" exon <2120..2246 /gene="ATPSB" /note="ATP synthase beta subunit precursor, (EC 3.6.1.34); G00-119-718" /number=1 sig_peptide join(2120..2246,2728..2747) /note="ATP synthase beta subunit signal peptide" intron 2247..2727 /note="ATPSB intron A" exon 2728..2910 /number=2 mat_peptide join(2748..2910,3126..3300,4123..4244,4494..4678, 5250..5408,5501..5623,9662..9760) /note="ATP synthase beta subunit" intron 2911..3125 /note="ATPSB intron B" exon 3126..3300 /number=3 intron 3301..4122 /note="ATPSB intron C" exon 4123..4244 /number=4 intron 4245..4493 /note="ATPSB intron D" exon 4494..4678 /number=5 intron 4679..5249 /note="ATPSB intron E" exon 5250..5408 /number=6 intron 5409..5500 /note="ATPSB intron F" exon 5501..5623 /number=7 intron 5624..7887 /note="ATPSB intron G" exon 7888..8100 /number=8 intron 8101..8774 /note="ATPSB intron H" exon 8775..8976 /number=9 intron 8977..9662 /note="ATPSB intron I" exon 9663..>9763 /note="ATP synthase beta subunit precursor" /number=10 BASE COUNT 2468 a 2289 c 2375 g 3054 t ORIGIN 1 cctcccaaag tgcagggatt acaggtgtaa gccaccacac ccagccaaaa tcctatgttt 61 tgatgcagtc cactgaaatt agttttgatc ccactccacc atggaatgat ttcctagttg 121 ccaaacccaa acccagagga tttttttcag ccctaatttt tttttcttct tttttatttt 181 tctttttttg ttgagacagg gtctcacaac gtcagccagc atggagtaca atggtgcaat 241 cttagctctc tgcaacctct acctcctggg ctcaagccat attcccatct cagcttcccc 301 agtagctggg actactggca catgccacac ctggctaatt gtgtgtctgt gtgtgtgtgt 361 gtgtgtgtgt ttattttttt acatattttt tagagacaag gtttcaccac gttgcccagg 421 ctggtattga actcctgggc tcaagttatc cacccacctc ggcttcccaa agtgttggga 481 ttactgggct ggccgtgccc agcccttttt acttatttta ttatttattt atttatttat 541 tttttattat tatttttgag acggagtctc gctctgtcac caggctggag tgcagtggca 601 cgatctcagc tcattgaaac ctccgccttc caggttcacg ccattctcct gcctcagcct 661 cctgagtagc tgggactaca ggcgcccgcc accacgccca gctaattttt gtatttttag 721 tagagacggg gtttcaccat gttggccagg atggtctaga tcgcttgacc tcatgatctg 781 cccacctcgg cctcccagag tgctgggatt acaggcgtga gccaccgtgc ccagcctatt 841 ttattttata tatttatttt atttttttag agagtctcac tctgtcgccc aagctggagt 901 gcagtggcgc atctcggctc actgcaagct ccgccatctc ggatttacgc cattctcctg 961 cctcagcctc ccaagtagct gggactacag gcgcccgcca ccacgcccgg ctaatttttt 1021 gtattgttag tagagaccgg gtttcactgt gttagtcagg atggtctcga tctcctgacc 1081 tcgtgatccg cccgccttgg cctcccatag tgctgggatt acaggcgtga gccaccgcgc 1141 ccagctaatt tttgtatttt tagtagagac ggggtttcac catgttagct aggctggtct 1201 cgaactcctg acctcaggtg atccacctgc ctcggcctcc gaaagtactg ggattacagg 1261 cgtgagccac cgctcccagc ctaccctaag aatttctatt cacatttcat agctgaggtc 1321 tctcagcttc cttcaaggtg gacaattctg tatctctagg tgacccaatt ttatcccctc 1381 ggtctttcgt gcacccagtt tatgctaacc aggaaccatc acattattgc attgtttgtt 1441 cacctggacc ttaaaaggaa ggttcttagt cttatttatc ttattattga gtctcctcaa 1501 tgccctgcta aagtactggt tacattagtt tatttcattt taataggttt aaatgataaa 1561 gatcctgtct aggttgctaa tctgtgcaat cgcctgggag cttgcaatct cactacttcg 1621 agagaaaaac cttattttgg ctctaaagag ggctcctgac ttcgccacgc tcaccttcag 1681 tggtgcttca ccgcactacc acttccctaa ccaccgcccc tatggctgtc acctagatca 1741 aggacctatc taaggagaaa gcccaaggac aggcaaagac aggccacgca ctttcaacag 1801 gaactcggcc cctttcctaa acgtagttcc tctgattacc ctgtccaacg gacgtcgcta 1861 tgatccaatt aagttttgag gctcagagtg acgccctcta acctcacact gtccaataaa 1921 aaaaggagtt tcagattagc accagttttg accaatggat gagaagatca gcgtgtatta 1981 acctgtagaa gcagcacaga gaatgatgag cagccagttc acccaatgga cctgcctact 2041 gcagcgtagg cctcgctcaa cggcaggaga gcaggcggct gcggttgctg cagccttcag 2101 tctccacccg gactacgcca tgttggggtt tgtgggtcgg gtggccgctg ctccggcctc 2161 cggggccttg cggagactca ccccttcagc gtcgctgccc ccagctcagc tcttactgcg 2221 ggccgctccg acggcggtcc atcctggtaa gtgcttttct ctaggagcta acgttccatt 2281 ttgccgcccc atgaccttga gccggggaac gatggtagct cgggcctaag ggatccgtgg 2341 tgttaaaagg atgcctggag ccgcgtcttg ctctctaatg gctggagaac aagaatggga 2401 cacccatagg aggtttcttc gacccagctc tgtcccattt tgtataaagt cctttgtgtg 2461 acggaaaagg tgcgggcagg tgcgttgacc ttccggttga ataggggtac tgaactctcg 2521 aaaaatgggg tcttgtgcat gcggatggga aggctgtgca ggcgctttat cgatcgctgc 2581 ggccttccct cttggacacg cgttccgtac aagcggaaag aggcctgact tcggctgagt 2641 tttcctctcc ttgtgtgtta agtcctggct tccgatgacc tttaactggc ctgaccccag 2701 ctccttccaa tatcttttgt ctgttagtca gggactatgc ggcgcaaaca tctccttcgc 2761 caaaagcagg cgccgccacc gggcgcatcg tggcggtcat tggcgcagtg gtggacgtcc 2821 agtttgatga gggactacca ccaattctaa atgccctgga agtgcaaggc agggagacca 2881 gactggtttt ggaggtggcc cagcatttgg gtaagtagag tttgttagga atagtaaaga 2941 ccttgtgtag cccaaatatc cccaaaacca agctgtctgc cttctatgat gattttatca 3001 aaatgacttt cgttcttctg agtttgctga agccacattt aggtactgag aaggagtctt 3061 ggtcgattta ggtcttgata ccaatttatc cttatgtata accctgcgat gcctgtatct 3121 aacaggtgag agcacagtaa ggactattgc tatggatggt acagaaggct tggttagagg 3181 ccagaaagta ctggattctg gtgcaccaat caaaattcct gttggtcctg agactttggg 3241 cagaatcatg aatgtcattg gagaacctat tgatgaaaga ggtcccatca aaaccaaaca 3301 gtaagttgct cttattgcat ctatgcacaa aggaacttct gttatttgca gttttacaca 3361 ttaaggtgac atgttctcct aaagcagaat ttttcttctg atacgttcga ttattaaaat 3421 cacacagtgc agtttcacct gaagaaatta ggatgcagaa tttctgattg acaaaagcct 3481 actacttgag gttttttttt ttgagaggga gtttcgctct cgactcactg caacctccac 3541 ctcctgggtt caagtgattc tcctgtctca gcctcctgag tagctgggat tacaggtgca 3601 tgccatcatg cccggctaat ttttgtattt ttagtacaga tggggtttca ccatgttggc 3661 caggctggtc tcgaactcct gattttgggt gattcccccc ccacctcggc cccccaaagt 3721 gctgggataa caggcatgag ccactgcgcc tggctagttg agtattttta agactcggtc 3781 ttatggctta ggtttaggag tctccaaaac ttgtaggttt ttgtatttgc tattgctatg 3841 tgagggaatg ctgttaggtt aaggctcagt aaaatgaaaa atttatgaca gttaaagtca 3901 gggaatggga ttattgtttc ctaagataat aattttcttt attgcagaaa gttgggttag 3961 atcctagatg atagtttatc tgatttatgt aagaaagggg aggtaagact gtggttcctc 4021 aggtaaaggg atggacttga gagttgatag ggagggatat ataagattat catttgcttt 4081 caacaaggtg aatagatact atcctctctt ctacctttac agatttgctc ccattcatgc 4141 tgaggctcca gagttcatgg aaatgagtgt tgagcaggaa attctggtga ctggtatcaa 4201 ggttgtcgat ctgctagctc cctatgccaa gggtggcaaa attggtatgt tattaagggg 4261 tacatatttt ccaaacctaa tatgggagga aaaactcata caatgttatg agtggatatt 4321 gacatctatt cctcactgat gagtacgttc tgactttcgt tcttctgagt ttgctgaagc 4381 cagatgcaat ttctgagaag gaataggatg gaaggaagca attacttttt ttgaagtttg 4441 cttatttagg aaagcagaac tcttaaatat tctttctttt caacctctta tagggctttt 4501 tggtggtgct ggagttggca agactgtact gatcatggag ttaatcaaca atgtcgccaa 4561 agcccatggt ggttactctg tgtttgctgg tgttggtgag aggacccgtg aaggcaatga 4621 tttataccat gaaatgattg aatctggtgt tatcaactta aaagatgcca cctctaaggt 4681 aagcattgag ctaattgatg tccacgtggt agttacgcca ggaaactatt gccaactgaa 4741 gccatgttct atattttcta tttaattttt ttttcttttt aagaaaatgt taatattttc 4801 attttcgtgt cccagggcct acaaaggaaa cctgttagtg ttttcccccc caacttccaa 4861 tacacaaatg gacattttct ttcctctttg atactgacat taaataacat ttccaggttt 4921 tctttttggt agatagtccc atcatttgaa agctaatttt gggagggctg gattggacgt 4981 ggtacatatg tttcagttaa gcctattttg tacttttcac ttaacttgaa taggttttta 5041 tttcctttct actatagttt tgccttacta atataagaag ctgtcttttt tttttgaact 5101 gcatctctaa tagggaaaag aaacttgcac acaaaaacgc tgacaccttc tgagaaagga 5161 tctgtggtcg tttctccgat ttgggaacct tcagtatgtg gcttctctac tcctttgtca 5221 ggttttacga ttatgtatga ttactgcagg tagcgctggt atatggtcaa atgaatgaac 5281 cacctggtgc tcgtgcccgg gtagctctga ctgggctgac tgtggctgaa tacttcagag 5341 accaagaagg tcaagatgta ctgctattta ttgataacat ctttcgcttc acccaggctg 5401 gttcagaggt aagagggaag gcttgagagg acctggttca tctggccttt cttcggatga 5461 ggctaaggat gtagacacct aagacctttt tttcttttag gtgtctgcat tattgggccg 5521 aatcccttct gctgtgggct atcagcctac cctggccact gacatgggta ctatgcagga 5581 aagaattacc actaccaaga agggatctat cacctctgta caggtaagaa aaattacata 5641 gatgaagatc tgatttgtat aaaggcaggg tgcagtggtg catctcagct actgaggagg 5701 ctgaggcagg aagattgctt gagcccaaga gatcagcctg gggaacaaag ctgtatgtat 5761 gtagagagat gtataaaact gactgggcca ggcgtggtgg ttcatgcctg taataccaac 5821 actttgggag gctgaggtac gcgatcatga ggtcaggagt taagacacca ccctggccaa 5881 cgtggtgaaa tcccgtctct actaaagata caaaaaatta gctgggcgtt gtggcaggcg 5941 cctgtaatcc caactactca gaggctgagg caggagaatc gcttgaatcc gggaggcaga 6001 ggttgcaatg agccaagatt ggggcactgc actccagcct gggtgacaga gtcagactct 6061 gcctcaaaaa attaaaaaaa aataaatgaa ggtatacatt taaatgtaga tcacaaagca 6121 atggtttcta gtctgagggg agtccggcgc tatatggaaa ctgagcatta aatatttact 6181 gtatcagggc tgagtgcagt ggctcacact tgtaattcca gcactttggg gccgggcacg 6241 tggctcacgc ctgtaatccc agcactttgg gaggccaagg caggcggatc atgaggtcag 6301 gagatcaaga ccatcctgac ttaacacagt gaaaccccat ctctactaaa aatacaaaaa 6361 attagctggg cgtggtggca cctgtctgta gttccagcta ctcgggaggc tgaggcagga 6421 gaattgcttg aacctgggag gcggaggttg caggaggttg tggtgagctg agattgggcc 6481 actcactcca gcctgggcgg cagagtgata atccgtctac aaaaaaaaaa aatttactgc 6541 attaatgtac tatacatttt caaatatata tagatgggat atatagtaaa attacataaa 6601 ttttacatat agtaaaaatg tggctaaact gtaggaactt ttcatgattc agtgtaatga 6661 actattgtgc aagaagtttt ttgttcttgt ttttaattct tttttttttt tttttgagac 6721 ggagtttcac tcttattgcc caggctggag tgcaatgggt gcgatctggg ctcactgcaa 6781 cctcctcctg ggttcaggtg attcctttgt ctcagcctcc tgagtagctg ggattatagg 6841 tgcccgacac caggcccggt taatttttgt atttttagta gagacggggt ttcaccatgt 6901 tagtcaggct ggtcttgaac tcctgacctc aggtgatctg cctgactcag cctcccaaag 6961 tgttgggatt ataggcttga gccactgtgc ccagcctctt ttttattttt ttattttttt 7021 taatttttat ttttttgaga cggagtctgt tgcctaggct acagtgtggt agcacagtct 7081 tggctcacta taacctccac ctcctgggtt caagcgattc acctgcttca gcctcctgca 7141 tagctgggat tacaggcatg catcaccatg cctggctaat ttttgtttcc gttttttttt 7201 ttttttttag tagagacccc cgtctctact ggtgtcgaac tcctgacctc gtgatctgcc 7261 ctccttggcc tcccaaagtg ctgggattat aggcatgagc caccgtgcct ggcccccagc 7321 ctcttgtttt taattctaaa agtgggcatt aaaatagtta aaataaaaga tgaagaacaa 7381 aaagaatgat actaaaaatt ccttccatca tggctgagag accagataat ctgtcagctt 7441 attcagcagg tattaggggt ggtaagaagg gtcttttgag tgctaagaat atggagtgag 7501 atgataaaga taggtcagaa ggggatggct taagtaaaat tactttggtt cgctggcgtg 7561 gtcgtcacgc ctataatccc cacacttttg ggaggccagg cgggtggatc acgaggtcag 7621 agttcaagac cagcctggcc aaagatagtg aaacccccgt ctctacaaaa aatacaaaaa 7681 caacattagc cgtgtgtggt gatgggcacc tgtagtccta gccactcggg aagctgagac 7741 agagaattgc ttgaatccag gaggcagagg ttgcagtgag ccgagattgc accactgcac 7801 tccagcctgg gcgacagagc gagaccctct caaaaaaaaa aaaaaaaatt tactttggtt 7861 cctgttcttc atgggatctc actttaggct atctatgtgc ctgctgatga cttgactgac 7921 cctgcccctg ctactacgtt tgcccatttg gatgctacca ctgtactgtc gcgtgccatt 7981 gctgagctgg gcatctatcc agctgtggat cctctagact ccacctctcg tatcatggat 8041 cccaacattg ttggcagtga gcattacgat gttgcccgtg gggtgcaaaa gatcctgcag 8101 gtgagtatat tactatgtgg gatcagtgtc cggaaagtca aaaagagggc ctgtggggac 8161 tatatgagga ctggatcttt cttagtgatt tgttttggag caaggaaagt tgaggctggc 8221 aattgctagt gagagctaat gaggtctctt gagtttccag ggtacttaac gctttggaat 8281 gcaatatttt tctttctttt tttttttttt tgagatgagt ctcactgtgt ctcccaagct 8341 ggagtgcagt ggtgccatct cggctcactg caacctccgc ctcccaggtt caagcgattc 8401 tcctgcctca gcctcctgag tacctgggat tacaggtgcg cgccaccatg cctggctact 8461 ttttgtattt ttagtagaga ccaggtttca tcagttggtc aggctggtct cgatctcctg 8521 acctcatgat ccacccaccg tggcctccca aagtggtggg attacagacg tgagccactg 8581 cgcccggcgg aatgcaatat ttttctaaca gcaccaaact aggactatgg agaacaagac 8641 actgatcttt cttggctgag ggccctcata acaccaccac cttctctgcc ccctagcatg 8701 taactttccc tttgttagct tgtcccaaat taaaggaatg agaatactta actcagtctt 8761 cttttttctc ataggactac aaatccctcc aggatatcat tgccatcctg ggtatggatg 8821 aactttctga ggaagacaag ttgaccgtgt cccgtgcacg gaaaatacag cgtttcttgt 8881 ctcagccatt ccaggttgct gaggtcttca caggtcatat ggggaagctg gtacccctga 8941 aggagaccat caaaggattc cagcagattt tggcaggtga gattttgagt acaaatcttg 9001 aatgtttact gtgctgtggt cccattccaa caaccgtcaa gcaatatatg taatatacta 9061 tgcttaatta tattttttaa tttaaaaaac aaacttaccc attgattttg tttgaaaata 9121 ctctacattt taattttgag tgtgattttg ttacttgatt ctctagctcc cttttatttt 9181 atatgtattt tttgagactg agtctctgtc gcccaggctg gagtacactg gtgcaatctt 9241 ggctcactgc aacctccacc tgccgggttc aagtgattct ccagcctcaa ccatccaaag 9301 tagctgggat tacaggcaca cgccaccacg cctggctaat ttttgtattt ttagtagcca 9361 tggggtttca ccatgttggc tgggcttgtc tcgaactcct gaccttaggt gatccgcctg 9421 ccttggcctc ccaaagtgct gggattacag gtgtaagcca ccgtgcctgg cccatgtgtt 9481 cttaattcat actgtatcat atcttgtaaa tttgatttgt gagggaaatt taagctttct 9541 aagatgacat gaattcatca cattctaact gatggcctga agtggtgagg aatgttacat 9601 gatgcagaaa gttgatatcc ctccgcttct tactcttttt ttttttctcc cccatcatac 9661 aggtgaatat gaccatctcc cagaacaggc cttctatatg gtgggaccca ttgaagaagc 9721 tgtggcaaaa gctgataagc tggctgaaga gcattcatcg tgaggggtct ttgtcctctg 9781 tactgtctct ctccttgccc ctaacccaaa aagcttcatt tttctgtgta ggctgcacaa 9841 gagccttgat tgaagatata ttctttctga acagtattta aggtttccaa taaaatgtac 9901 acccctcaga atttgtctga tctcttggtt ctgacaacat agtcaacact gaagggttat 9961 gtatttaatt ttagttttag agacacggtg tctggctgtg ttgccaagac tggtctctaa 10021 ctcctgggct cgagatctcc cacctcagtc tcctgagtag ctggggctac aggtgtatgt 10081 agtctcacat caccagcact gttttcaaca attagatttt tagagtggct ataagaagca 10141 gtttcagcat gaagtgggcc atgtatgttg aaattggtcc ttaaaa // LOCUS HUMAZCDI 5002 bp DNA PRI 17-DEC-1992 DEFINITION Human azurocidin gene, complete cds. ACCESSION M96326 NID g179301 KEYWORDS azurocidin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5002) AUTHORS Morgan,J.G., Sukiennicki,T., Pereira,H.A., Spitznagel,J.K., Guerra,M.E. and Larrick,J.L. TITLE Cloning of the cDNA for the serine protease homolog CAP37/azurocidin, a microbicidal and chemotactic protein from human granulocytes JOURNAL J. Immunol. 147, 3210-3214 (1991) MEDLINE 92013155 REFERENCE 2 (bases 1 to 5002) AUTHORS Zimmer,M., Medcalf,R.L., Fink,T.M., Mattmann,C., Lichter,P. and Jenne,D.E. TITLE Three human elastase-like genes co-ordinately expressed in the myelo-monocyte lineage are organized as a single genetic locus on 19pter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 8215-8219 (1992) MEDLINE 92390417 FEATURES Location/Qualifiers source 1..5002 /organism="Homo sapiens" /db_xref="taxon:9606" /germline GC_signal complement(28..33) CAAT_signal 137..142 GC_signal 179..184 TATA_signal 212..216 mRNA join((237.241)..310,636..792,2002..2146,3148..3381, 4156..>4458) /standard_name="azurocidin mRNA" exon (237.241)..310 /number=1 /label=exon1 CAAT_signal 237..242 conflict 250 /note="'C' in conflict" /citation=[1] /replace="" CDS join(253..310,636..792,2002..2146,3148..3381,4156..4317) /codon_start=1 /product="azurocidin" /db_xref="PID:g179302" /translation="MTRLTVLALLAGLLASSRAGSSPLLDIVGGRKARPRQFPFLASI QNQGRHFCGGALIHARFVMTAASCFQSQNPGVSTVVLGAYDLRRRERQSRQTFSISSM SENGYDPQQNLNDLMLLQLDREANLTSSVTILPLPLQNATVEAGTRCQVAGWGSQRSG GRLSRFPRFVNVTVTPEDQCRPNNVCTGVLTRRGGICNGDGGTPLVCEGLAHGVASFS LGPCGRGPDFFTRVALFRDWIDGVLNNPGPGPA" sig_peptide join(253..310,636..655) conflict 297 /note="'G' in conflict" /citation=[1] /replace="" intron 311..635 /number=1 exon 636..792 /number=2 /label=exon2 mat_peptide join(656..792,2002..2146,3148..3381,4156..4314) /product="azurocidin" intron 793..2001 /number=2 exon 2002..2146 /number=3 /label=exon3 intron 2147..3147 /number=3 repeat_region complement(2246..2369) /rpt_family="'Alu'" repeat_region 2417..2703 /rpt_family="'Alu'" repeat_region complement(2895..3014) /rpt_family="'Alu'" exon 3148..3381 /number=4 /label=exon4 intron 3382..4155 /number=4 repeat_region complement(3525..3810) /rpt_family="'Alu'" exon 4156..>4458 /number=5 /label=exon5 conflict 4380 /note="'' in conflict" /citation=[1] /replace="" repeat_region 4661..4951 /rpt_family="'Alu'" BASE COUNT 1089 a 1268 c 1771 g 874 t ORIGIN 1 ggatccactg gttcctgaca ccctcacctg cccctggggg tgtggccatc ttctagagag 61 ggaaactgag gatcagtgca gaatgtaggg ggagcccagg ctggcccagg gagcagttgg 121 cggtggaggc cttgggcaat ttcccgtgtt cccactgagt ggggctgtcc ctgggcctgg 181 gcggggacgc caccaactgc caaggcctgt gtataagggc agccgccgcc ttagccacag 241 acctgccccg ccatgacccg gctgacagtc ctggccctgc tggctggtct gctggcatcc 301 tcgagggccg gtgagtgcct ctctgtgccg gtggtccccc atctgtgcta gggcccggct 361 gccagggcag aactcagact taaagcacag agaaggcaag cggcttggcc tgggtcacac 421 agccagcccg gcctggacga tcccgcgaaa ggcgtgaggg cggacggtgt gcgggactca 481 ggggccccct gtcctcttag ggagtgggac gatgggggag ggtgggtccc cccgcagccc 541 cactgggtgg atagagctga ggctgcagct tcacacgccc tcccggccac tgtgtggatt 601 cttggggatc tcagagctgt ctccccccga cccaggctcc agcccccttt tggacatcgt 661 tggcggccgg aaggcgaggc cccgccagtt cccgttcctg gcctccattc agaatcaagg 721 caggcacttc tgcgggggtg ccctgatcca tgcccgcttc gtgatgaccg cggccagctg 781 cttccaaagc cagtgagggg tcctggggag ggggcctagg gggcattggg gctcagagaa 841 ggggcttggg gggcttaggc attcagtggg ggtgcttggt aggtgaggag gggaggggat 901 tgcaaaagga ggggctcagg gaaaggaggg ggcttggaga gggaaatggg gactgagttg 961 aggagggacc caaggatatt ggggggctca gatggaggag gcccagagaa gggaaggggg 1021 tcagatggag gaggcccaga gaaaggaaga ggctcagatg gaggaggtgc agtgaaggaa 1081 agggggtcag atgggggagg cccagagaag ggaaggggct cagatggagg agggggccca 1141 gagaaaggaa ggggctcaga tggaggaggt gcagagaagg gaagggggtc agatgggggg 1201 aggcccagag aagggaaggg gctcagatgg aggaggtgca gagaagggaa ggggctcaga 1261 tggaggaggt gcagagaaga gaagggcctc agatggagga ggtgcagaga agagaagggc 1321 ctcagatgga ggaggtgcag agaagggaag ggcctcagat ggaggaggtg cggagaaggg 1381 aagggggtca gatggaggag gtgcagagaa gggaaggggg tcagatgggg gaggcccagg 1441 gaagggaagg ggctcagatg gaggaggggc agagaaggga agggggtcag atgggggagg 1501 cccagggaag ggaaggggct cagatggggg aggcgcagag aagggaaggg ggtcagatgg 1561 aggaggtgca gagaagggaa gggggtcaga tgggggaggc ccagataagg gaatggggtc 1621 agatggggga ggtgcagaga agggaagggg gtcagatggg ggaggcccag ataagggaag 1681 gggctcagat ggaggaggtg cagagaaggg gagggggtca gatggaggag gctcagagaa 1741 gggaagggac tcagatggag gagggggcgc agagaagaga aggggctcag atggaggagg 1801 aggcgcagag aagggaaggg gctcaagatg ggaagggggc cctggaaagt ctcggctctg 1861 cttctgtaaa agcgggggag ttttcagggt gaaggattgc agtctgcagg ctgggatccc 1921 ccctaatttg caagccggct tgctctgtgc ccaggcccca gcctggtgtc ctccctctgc 1981 cctttcctcc gctactctca ggaaccccgg ggttagcacc gtggtgctgg gtgcctatga 2041 cctgaggcgg cgggagaggc agtcccgcca gacgttttcc atcagcagca tgagcgagaa 2101 tggctacgac ccccagcaga acctgaacga cctgatgctg cttcaggtga gaggatggtg 2161 ccacctgtga tcccagcacc tcgggaggcc gacgttagcc agggaaacaa gtccaaactt 2221 ggtctctaca aaaaaataca aaaattagcc gggagtggtg gcgcgcacct gtggcccctg 2281 tgcttcagga ggccgaggcg gaaggacggc ttgaggtcag gagttcgaga ccagcctggg 2341 caacatggcc aaactcagtc tctacaaaaa tatatatgtg tgtgtgtgtg tgtgtgtgtg 2401 tgtgtgtgtg tgtgtatctt gccgggtgag gtggctcatg cctgtaatcc cagcattttg 2461 ggaggccgag gtgggcggat cacgaggtca ggagattgag accagcctgg ccaacatggt 2521 gaaaccccat ctctactaaa aatacaaaaa ttagccaggc atggcagcgg gcgcctctag 2581 tcccagctac tcaggaggct gaggcaggag aatcgcttga acccgggagg cggagcttgc 2641 agtgagccga gatcgcgccc ctgcactcca gcctgggtaa cagagccaga ccctatctca 2701 aaaaaaactt ccaaaaacaa tacagcaaca catacagatg taccacggtt cgcgtatgga 2761 gcctcctgtt ggtggagact gacgtcgttt tcaaatgctt ttgctatgac agaatcatgt 2821 gaatgttttt catgtttggt ttttttcttt gagaaaatga taaaattatc tcaaaaatat 2881 cattaaaaaa tttaaaaaag tagagacggg ggtttcacct tgttggccag tttggtctcg 2941 aactcctggc ctcaagtgat ccacccacct tggcctcgca acgtgctggg aatacaggcg 3001 tgagccaccg cacccggccc ctgccgggaa ttaaacgcaa accacttaca gactacagtt 3061 aatgtcgctg acacttctgc tcccaggggt ccccatgagg ctccagtccc cagggccacc 3121 ctcccctgac tccatttcct tccccagctg gaccgtgagg ccaacctcac cagcagcgtg 3181 acgatactgc cactgcctct gcagaacgcc acggtggaag ccggcaccag atgccaggtg 3241 gccggctggg ggagccagcg cagtgggggg cgtctctccc gttttcccag gtttgtcaac 3301 gtgactgtga cccccgagga ccagtgtcgc cccaacaacg tgtgcaccgg tgtgctcacc 3361 cgccgcggtg gcatctgcaa tgtgagtgct ccctgtggcg ggaggagggg tcctgagagg 3421 tactgagctc tccgtggcag gagaaagcaa gtgcaggctg agggcggcac agcagggggg 3481 ccccaggatt gagcattttc acggtaggag aaacagtatc tttttttttt tttttgagac 3541 agagtctcgc tctgtcgccc aggctggagt gtagtggcgt gatctcggcg gctcactgca 3601 acctccgcct cctgggttca agcgattctc ctgcctcagc ctcctaagta gctgggatta 3661 caggcatgcg ccaccacgcc cggctaattt tgtattttta gtagagacag ggtttctcca 3721 tgtgggtcag gctggtctcg aactcctgac ctcatgatcg acccaccttg gcctcccaaa 3781 gtgttaggat aacaggcatg agccaccgtg cctggctgag aaacagtagc tatcaaacgc 3841 cggctgtgag ccacgtctgt gctgggggtt ggggacccag caggcatggt agagccggtc 3901 actgagggac tcaggcgtgt gattgccagg ggaggggcac ctggcccagc ctggaggtgc 3961 caggaagctc cagaaagcaa ctgatcccaa agtccactag cagttaacca gggcagagaa 4021 agagaagagc catgcaaagg ccctggggct ggatcaggac ttgtaggttc caggggcagc 4081 aagaggcctc tgcagttctg gggtggcgtg ggagccaggc cctgggacgc cctgacacag 4141 ctgctgcctg cccaggggga cgggggcacc cccctcgtct gcgagggcct ggcccacggc 4201 gtggcctcct tttccctggg gccctgtggc cgaggccctg acttcttcac ccgagtggcg 4261 ctcttccgag actggatcga tggtgttctc aacaacccgg gaccggggcc agcctagggg 4321 ggcctgtgac ctcccatgga gcccagcccc cgccctccac acctccggcg ctccgcaccc 4381 acctcccacg gccccgcccc tgcccccgct ccggccagag gggccctggc tgtaataaag 4441 aagccgatct ctcctctgct cctggtttct gttcattggt gggggagggg gctgtgggga 4501 cgcgtgagtg gcaccttcac cggccttagg ggcacccacc gcaggtgcac tgcctgtgca 4561 gatgtcagat gttcagagat tccctcaaag cccggggaag caggggctgg tgttatctgc 4621 acccgacagc ggggtgttgg ggggaggccc aggttcagag aggttgggtg gctgcccaga 4681 ggtcacacag tgaatgccgc ccagcacttt gggaggccga ggtgggcgga tcacctgagg 4741 tcaggagttc aagaccagcc cggccaacct ggtgaaaccc catctctata aaaatacaaa 4801 aattagccgg gcatgatggc gggcgcctgt aatcccagtt acttgggagg ctgaggcagg 4861 agaatcacct gaacccggga ggcggaggtt gcagcgaacc gagatggcgc cactgcactc 4921 cagcctgggc gacagcgaga ctccagctca aaaaaaaaca aaaaccacgg gagaaaacgg 4981 ggaacattct cctcttggat cc // LOCUS HUMBETGLOA 3002 bp DNA PRI 26-AUG-1994 DEFINITION Human haplotype C4 beta-globin gene, complete cds. ACCESSION L26462 NID g432453 KEYWORDS beta-globin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3002) AUTHORS Fullerton,S.M., Harding,R.M., Boyce,A.J. and Clegg,J.B. TITLE Molecular and population genetic analysis of allelic sequence diversity at the human beta-globin locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (5), 1805-1809 (1994) MEDLINE 94173918 FEATURES Location/Qualifiers source 1..3002 /organism="Homo sapiens" /note="sequence found in a Melanesian population" /db_xref="taxon:9606" /haplotype="C4" allele replace(111,"t") allele replace(263,"t") /note="Rsa I polymorphism" allele replace(273,"c") allele replace(286..287,"") /note="2 bp insertion of AT" allele replace(288,"t") allele replace(295..296,"") /note="1 bp deletion of C or 2 bp deletion of CT" allele replace(347,"c") allele replace(476,"t") allele replace(500,"c") CDS join(866..957,1088..1310,2161..2289) /codon_start=1 /product="beta-globin" /db_xref="PID:g532506" /translation="MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFE SFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPE NFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH" exon <866..957 /number=1 allele replace(874,"c") intron 958..1087 /number=1 exon 1088..1310 /number=2 intron 1311..2160 /number=2 allele replace(1326,"g") /note="Ava II polymorphism" allele replace(1384,"g") allele replace(1391,"t") allele replace(1976,"t") exon 2161..>2289 /number=3 allele replace(2522,"c") allele replace(2602,"a") allele replace(2604,"c") allele replace(2760,"t") /note="Hinf I polymorphism" allele replace(2913,"g") BASE COUNT 810 a 601 c 599 g 992 t ORIGIN 1 acctcctatt tgacaccact gattacccca ttgatagtca cactttgggt tgtaagtgac 61 tttttattta tttgtatttt tgactgcatt aagaggtctc tagtttttta cctcttgttt 121 cccaaaacct aataagtaac taatgcacag agcacattga tttgtattta ttctattttt 181 agacataatt tattagcatg catgagcaaa ttaagaaaaa caacaacaaa tgaatgcata 241 tatatgtata tgtatgtgtg tacatataca catatatata tatatatatt ttttcttttc 301 ttaccagaag gttttaatcc aaataaggag aagatatgct tagaactgag gtagagtttt 361 catccattct gtcctgtaag tattttgcat attctggaga cgcaggaaga gatccatcta 421 catatcccaa agctgaatta tggtagacaa aactcttcca cttttagtgc atcaacttct 481 tatttgtgta ataagaaaat tgggaaaacg atcttcaata tgcttaccaa gctgtgattc 541 caaatattac gtaaatacac ttgcaaagga ggatgttttt agtagcaatt tgtactgatg 601 gtatggggcc aagagatata tcttagaggg agggctgagg gtttgaagtc caactcctaa 661 gccagtgcca gaagagccaa ggacaggtac ggctgtcatc acttagacct caccctgtgg 721 agccacaccc tagggttggc caatctactc ccaggagcag ggagggcagg agccagggct 781 gggcataaaa gtcagggcag agccatctat tgcttacatt tgcttctgac acaactgtgt 841 tcactagcaa cctcaaacag acaccatggt gcatctgact cctgaggaga agtctgccgt 901 tactgccctg tggggcaagg tgaacgtgga tgaagttggt ggtgaggccc tgggcaggtt 961 ggtatcaagg ttacaagaca ggtttaagga gaccaataga aactgggcat gtggagacag 1021 agaagactct tgggtttctg ataggcactg actctctctg cctattggtc tattttccca 1081 cccttaggct gctggtggtc tacccttgga cccagaggtt ctttgagtcc tttggggatc 1141 tgtccactcc tgatgctgtt atgggcaacc ctaaggtgaa ggctcatggc aagaaagtgc 1201 tcggtgcctt tagtgatggc ctggctcacc tggacaacct caagggcacc tttgccacac 1261 tgagtgagct gcactgtgac aagctgcacg tggatcctga gaacttcagg gtgagtctat 1321 gggacccttg atgttttctt tccccttctt ttctatggtt aagttcatgt cataggaagg 1381 ggataagtaa cagggtacag tttagaatgg gaaacagacg aatgattgca tcagtgtgga 1441 agtctcagga tcgttttagt ttcttttatt tgctgttcat aacaattgtt ttcttttgtt 1501 taattcttgc tttctttttt tttcttctcc gcaattttta ctattatact taatgcctta 1561 acattgtgta taacaaaagg aaatatctct gagatacatt aagtaactta aaaaaaaact 1621 ttacacagtc tgcctagtac attactattt ggaatatatg tgtgcttatt tgcatattca 1681 taatctccct actttatttt cttttatttt taattgatac ataatcatta tacatattta 1741 tgggttaaag tgtaatgttt taatatgtgt acacatattg accaaatcag ggtaattttg 1801 catttgtaat tttaaaaaat gctttcttct tttaatatac ttttttgttt atcttatttc 1861 taatactttc cctaatctct ttctttcagg gcaataatga tacaatgtat catgcctctt 1921 tgcaccattc taaagaataa cagtgataat ttctgggtta aggcaatagc aatatctctg 1981 catataaata tttctgcata taaattgtaa ctgatgtaag aggtttcata ttgctaatag 2041 cagctacaat ccagctacca ttctgctttt attttatggt tgggataagg ctggattatt 2101 ctgagtccaa gctaggccct tttgctaatc atgttcatac ctcttatctt cctcccacag 2161 ctcctgggca acgtgctggt ctgtgtgctg gcccatcact ttggcaaaga attcacccca 2221 ccagtgcagg ctgcctatca gaaagtggtg gctggtgtgg ctaatgccct ggcccacaag 2281 tatcactaag ctcgctttct tgctgtccaa tttctattaa aggttccttt gttccctaag 2341 tccaactact aaactggggg atattatgaa gggccttgag catctggatt ctgcctaata 2401 aaaaacattt attttcattg caatgatgta tttaaattat ttctgaatat tttactaaaa 2461 agggaatgtg ggaggtcagt gcatttaaaa cataaagaaa tgaagagcta gttcaaacct 2521 tgggaaaata cactatatct taaactccat gaaagaaggt gaggctgcaa acagctaatg 2581 cacattggca acagccctga tgcatatgcc ttattcatcc ctcagaaaag gattcaagta 2641 gaggcttgat ttggaggtta aagttttgct atgctgtatt ttacattact tattgtttta 2701 gctgtcctca tgaatgtctt ttcactaccc atttgcttat cctgcatctc tcagccttga 2761 ctccactcag ttctcttgct tagagatacc acctttcccc tgaagtgttc cttccatgtt 2821 ttacggcgag atggtttctc ctcgcctggc cactcagcct tagttgtctc tgttgtctta 2881 tagaggtcta cttgaagaag gaaaaacagg ggtcatggtt tgactgtcct gtgagccctt 2941 cttccctgcc tcccccactc acagtgaccc ggaatctgca gtgctagtct cccggaacta 3001 tc // LOCUS HUMBFXIII 33206 bp DNA PRI 31-OCT-1994 DEFINITION Human factor XIII b subunit gene, complete cds. ACCESSION M64554 J05294 NID g179416 KEYWORDS blood coagulation factor; factor XIII; factor XIIIb; zymogen. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 33206) AUTHORS Bottenus,R.E., Ichinose,A. and Davie,E.W. TITLE Nucleotide sequence of the gene for the b subunit of human factor XIII JOURNAL Biochemistry 29, 11196-11209 (1990) REFERENCE 2 (sites) AUTHORS Nishimura,D.Y., Leysens,N.J. and Murray,J.C. TITLE A dinucleotide repeat for the D1S53 locus JOURNAL Nucleic Acids Res. 20 (5), 1167 (1992) MEDLINE 92195854 FEATURES Location/Qualifiers source 1..33206 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast, leukocyte" /dev_stage="fetus" /tissue_type="liver" /map="6p25-p24" mRNA join(<2931..2994,7093..7293,8181..8366,9073..9249, 9606..9782,12684..12863,12951..13136,14251..14433, 17308..17508,19264..19446,29399..29612,30723..30943) /gene="F13A1" /note="G00-120-614" /product="coagulation factor XIIIb" gene join(<2931..2994,7093..7293,8181..8366,9073..9249, 9606..9782,12684..12863,12951..13136,14251..14433, 17308..17508,19264..19446,29399..29612,30723..30943) /gene="F13A1" CDS join(2931..2994,7093..7293,8181..8366,9073..9249, 9606..9782,12684..12863,12951..13136,14251..14433, 17308..17508,19264..19446,29399..29612,30723..30756) /gene="F13A1" /codon_start=1 /db_xref="GDB:G00-120-614" /product="coagulation factor XIIIb" /db_xref="PID:g179417" /translation="MRLKNLTFIIILIISGELYAEEKPCGFPHVENGRIAQYYYTFKS FYFPMSIDKKLSFFCLAGYTTESGRQEEQTTCTTEGWSPEPRCFKKCTKPDLSNGYIS DVKLLYKIQENMHYGCASGYKTTGGKDEEVVQCLSDGWSSQPTCRKEHETCLAPELYN GNYSTTQKTFKVKDKVQYECATGYYTAGGKKTEEVECLTYGWSLTPKCTKLKCSSLRL IENGYFHPVKQTYEEGDVVQFFCHENYYLSGSDLIQCYNFGWYPESPVCEGRRNRCPP PPLPINSKIQTHSTTYRHGEIVHIECELNFEIHGSAEIRCEDGKSTEPPKCIEGQEKV ACEEPPFIENGAANLHSKIYYNGDKVTYACKSGYLLHGSNEITCNRGKWTLPPECVEN NENCKHPPVVMNGAVADGILASYATGSSVEYRCNEYYLLRGSKISRCEQGKWSSPPVC LEPCTVNVDYMNRNNIEMKWKYEGKVLHGDLIDFVCKQGYDLSPLTPLSELSVQCNRG EVKYPLCTRKESKGMCTSPPLIKHGVIISSTVDTYENGSSVEYRCFDHHFLEGSREAY CLDGMWTTPPLCLEPCTLSFTEMEKNNLLLKWDFDNRPHILHGEYIEFICRGDTYPAE LYITGSILRMQCDRGQLKYPRCIPRQSTLSYQEPLRT" exon <2931..2994 /gene="F13A1" /note="G00-120-614" /number=1 /product="coagulation factor XIIIb" intron 2995..7092 /gene="F13A1" /note="G00-120-614" /number=1 exon 7093..7293 /gene="F13A1" /note="G00-120-614" /number=2 /product="coagulation factor XIIIb" intron 7294..8180 /gene="F13A1" /note="G00-120-614" /number=2 exon 8181..8366 /gene="F13A1" /note="G00-120-614" /number=3 /product="coagulation factor XIIIb" intron 8367..9072 /gene="F13A1" /note="G00-120-614" /number=3 exon 9073..9249 /gene="F13A1" /note="G00-120-614" /number=4 /product="coagulation factor XIIIb" intron 9250..9605 /gene="F13A1" /note="G00-120-614" /number=4 exon 9606..9782 /gene="F13A1" /note="G00-120-614" /number=5 /product="coagulation factor XIIIb" intron 9783..12683 /gene="F13A1" /note="G00-120-614" /number=5 exon 12684..12863 /gene="F13A1" /note="G00-120-614" /number=6 /product="coagulation factor XIIIb" intron 12864..12950 /gene="F13A1" /note="G00-120-614" /number=6 exon 12951..13136 /gene="F13A1" /note="G00-120-614" /number=7 /product="coagulation factor XIIIb" intron 13137..14250 /gene="F13A1" /note="G00-120-614" /number=7 exon 14251..14433 /gene="F13A1" /note="G00-120-614" /number=8 /product="coagulation factor XIIIb" intron 14434..17307 /gene="F13A1" /note="G00-120-614" /number=8 exon 17308..17508 /gene="F13A1" /note="G00-120-614" /number=9 /product="coagulation factor XIIIb" intron 17509..19263 /gene="F13A1" /note="G00-120-614" /number=9 exon 19264..19446 /gene="F13A1" /note="G00-120-614" /number=10 /product="coagulation factor XIIIb" intron 19447..29398 /gene="F13A1" /note="G00-120-614" /number=10 exon 29399..29612 /gene="F13A1" /note="G00-120-614" /number=11 /product="coagulation factor XIIIb" intron 29613..30722 /gene="F13A1" /note="G00-120-614" /number=11 exon 30723..30943 /gene="F13A1" /note="G00-120-614" /number=12 /product="coagulation factor XIIIb" polyA_signal 30919..30924 /gene="F13A1" /note="G00-120-614" BASE COUNT 9901 a 5689 c 5955 g 11661 t ORIGIN 1 gaattctttt tctggcagtt caggaatttc ttcttggttt ggatccattg ctgatgagct 61 agtgtgattt ttgggtggtg ttaaagaacc ttgttttgtc ataataccaa aattgttttt 121 ctagttcctt ctcttttggg taggctatgt cagaggggag atctggggct caaggctgct 181 gttcagattc ttttgtccca cggggtgctc ccttgatgta gtattctccc ccttttccta 241 gggatgtgac ttcctgagag ccaaactgta gtgattatta ttacttttct ggatctagcc 301 accagaaggg ctaccaggct ctgggctggt actgggggtt gtctgcatag agtcctgtga 361 tgtaaactgt cttcagttct ctcagtcatg gataccagca cctgttccag tagaggtggc 421 aggggagtga aaggaactct gtgagggtcc ttagttgtag ttgtttgatg cactagtttg 481 gtgctggttg gcctcctgct gagaggtgaa gccagctgga cttcctgggt cgaatgggga 541 cttggagaac ttttctgtct agctaaagga ttgtaaacac accaatcagc gctctgtgtc 601 tagctaaagg tttgtaaatg caccaatcag cactttgtaa aaacagacca accagcactc 661 tgtaaaatgg gccaatcagc gggatgtggg cggagaccaa ataagggaat aaaagctggc 721 cacctgagcc agccgcagca acctggtctg gtccccttcc aggctgtgga aactttgttc 781 tttcgctctt tgcaataaat cttgctgctg ctctcttttt gggtctgcac tacctttatg 841 agctgtaaca ctcaccgcga aggtctgtgg cttcactcct gaagtcagca agacctcgaa 901 cccaccagga ggaagaaaca actccggacg caccatcttt tagagctgta acactcactg 961 tgaaggtctg cggcttcact cctgaagtca gcaagaccag gaacccacca gaaggaagaa 1021 actctggaca catctgaaca tcagaggaac aaactccgga cacaccatct ttaaaaactg 1081 taatgctcac cgcaagcgtc cgcagcttca ttcttgaagt cagcaagacc aagaacccat 1141 tggaaggaac caattccaga cacactgcca tgaggtagtg ctttcatgag agcatcagct 1201 gtggtatata ggggaggatc aggtggtggg tgggacacta gaacttccaa gagaatatga 1261 cctttgtctt cagctaccag ggtgggtagg gaaggaccat caggtggggg cagggttagg 1321 catgtctgag ctcagagtct tcttgggtgg ggcttgttgt ggctctgtgg ggaatggggg 1381 tgtgattccc aggttaatgg agttaccttc ccaggaggat tatggctgcc tcttctgtgt 1441 catgcaggtt gtcagggaag tgggggaaag ctggcagtta cagacctcac ccagctctca 1501 cacaacccaa caagcaaatc tcacttctac tgtgttcccc caaaaagcac cgaatttgtt 1561 tccaggcagt aggggagcag gattgagaac ttgccccagg ctactagcct cccagccgag 1621 aatgcaagcc agacttttgt gcctccctac ctgttaagtc tgcacaccag attcacgccc 1681 tttcctgagt tctggacagg aaagtttgtg ttcggttgga attgttacta agttcagctg 1741 gaggtttcct tctccctgtg gtcttttccc agttcctccg gcagcccctg tgagacaagt 1801 cagaaatggc ttctctgggg acccacagag acgacggggc tttacccttt gcttcctcta 1861 cccctgtatt tcgctcagct ctctaaattg tctgagctcc aggtaaggtc aaatccttct 1921 cctgtgatct agaccttcag gttccccaag ggtgtgtgtt caggggtgga caatacccct 1981 ttcacacttt tcacagtttg ggcattcaca gtatgtgggc tattttctgg gtcatgcagg 2041 agcaatgtac tttcttcaga gggtctgtgg attatcttgg ctttcctgtt atatccttgc 2101 agcagttctt ggagcaaaag ttcacaatgc gagtctccac atcctgctct gtccatccaa 2161 gtgggagctg caagttagtc ctgcctccta tccaccattt tttcctattt tgtattttaa 2221 attaaaactc attagaatgt aaactccatg agggcaataa ttttaatcca gttttttcac 2281 tgctaaatac ctcaaaactg ttacatgtgc tgccattatc atggacgttc taaacatttc 2341 tcatgaatga atgttgatat agcacagtgc tgatctcacc acctgccaca aagtaaaaac 2401 tctctaaatg tcaaatacta ctattataag taatactatt ctatcattga gacaactatg 2461 gttctgtatc tctttaagga gtgcagtcta ggcaacaaca tatttcaaca tcaggtatct 2521 gttatctcat gttcctaagt tcatcttttt acctgatcaa tattagtaaa gatctatggc 2581 ctcaagtgaa ctttactcta tgaatttaca attgggaact taggttgaaa atttacctag 2641 tattatttat tctgctgcta taatttatta tttattctgc tgctattgtt gccctaatca 2701 gttatcatgc tcttactgga ctctgacatg gtaagtaatg tcaattaata tagttgtgtt 2761 attttaacat tatttcaaaa tattttaagt gaaatataag actcgttaca aaaggactta 2821 gacagaggtt ataaattatt aactttttcc tcaagagaag tattgtgtac ttaatgacaa 2881 gttcctagtg cttagaattg ttaaaatctt tgtgaagcac accactgaag atgaggttga 2941 aaaacctgac ttttatcatc atattgataa tctcaggaga actctatgca gaaggtaaaa 3001 ttaattctta atattcttta ttctctaaaa catacaaatg gagattctaa tatttataca 3061 actcaagttt taatatactc ttctattttc attatcaatt ttataacata ttttataaat 3121 ataacttttt ataatattta gaccttttcc tttgagcaaa attagtaaag tgttcttaca 3181 gtagcataag tcagaaaagc aaaacaaaac aaaaaccctt gtatctctag ggggcatagg 3241 agaccaggat ctaaatactt cgaaaaggtt ttcaggttaa caatatgtcc atttcgaagt 3301 cacatgttaa ctttaattct aaatttatgc tagaatccat tcctaatact gacaaaactt 3361 gtacaatttt aattctgttc ataaggtgag agtttaatgt gggaaatttg aaaaggaaat 3421 agccataaat attttttact tttgagactt tgatttttat cacaatgaat tattcatttg 3481 ttggagacat taagtagtga ttagtaggtg aaccatataa ttgcttgtac aaacagggac 3541 acttagagag taaaagaggg gctctattaa taactgtttg gacaacacat gtaaatcagg 3601 actttccaaa caaaccaata cctatggact ccctattaga aagctttata gtcagattgc 3661 ccaatgtgta agattagtgt gcgagtggaa gctaggcagt gtacatgggt atgttgttca 3721 ggaacattat caatattatc aggccgtaga tgaagaatcc agggaggaca cttagcatta 3781 gcaagagtga acatacaaag aaggaagcta cagcaataca gcttttctaa aataaacatt 3841 tttgggaatt tgttgaaatt aaatacataa gttttaactg tgttttagtt gaatagttaa 3901 ctataaattt tctaccaaaa catctataat taaaattata aaagtgaaag acacagaatt 3961 aatcttattt tatactttta attctataat aggatgattt ttgcatgaat ttcttttgat 4021 tttatttttc tgtctaaata gggaaatacc agcagatata attgtctgtt atctatacca 4081 taaagacatt gcaatattct agttaatttt ctaattatta taaggttata atgctatttt 4141 acacaacaaa aaaaaaaagt aagctgcttg aagaccaaga caatgatttt atttttgcat 4201 gcttctctca actcccattg tgcacttcag ggacaagaat tgaattaatt gtgtttctca 4261 tttcatggta tgcaagtatt attggcctat gaaaggtctc actcttgctt tacttacaca 4321 ttttatcccc ttattcctaa tcttgaagtc catttcccct gggacctact acaatgtttt 4381 aacatgttac tctagctctc tcaaccacaa atagctctgg aatgaatatg taatataacc 4441 aacttatctt tctgtaagaa tgtcatcagt ctgtattcac tcacagatag cattattgag 4501 tctattcttg tacttgattt ctgtaatgag agattacttt tgcagaatgg cagttttgaa 4561 gtatatcatc tctgagagag gatgtaaatt gttgctcctc atcattctca ccaacacctc 4621 atattgtcca tttttgtact ggttgctata tggttattca taaagtgata tcccattttt 4681 atttatttac cattctctga tcagtagtaa gtgtgtccat cgcttcatgt acttttttta 4741 atatgttcag acatctgtga tttcctggtt tatatgtgat tagtatattc ctcacatttc 4801 tcattaattt tttgtcagga attcatattc tccaaggcat tccagctacc ttaacaaata 4861 atgtgcctat cctggtggtg gagggagatg tcccaggaca gagagtcttt aaagccacaa 4921 cctaggattg ttgtgcagag aatactttgc tgcccctccc tgacaagcaa ctgtgcccct 4981 caggaaatgg aatggggact atccagccca tgtttttctg tctgctccct ttttattgta 5041 aattcctttg gtgcttgggt gtgaaaaaat ggtgagaata attctcaagg aatggggact 5101 atccagccca tgtttttctg tctgctccat cagatagtgg aagttttgtt gctcatcatt 5161 ccacatccat ttccttcaaa cctaggctag cactactagt attttatgtc tcatagtagc 5221 attaaggtat acagtgatca accaaggcat tgaagccgta gtaatcacac tggatccagg 5281 ccttcttcca tccatgactg cccatgtttc ttcccaccac agctcctagg attgttcgag 5341 catttctttc ttttaagtgt ttttatgact ttaagatctt caacataaag atatattttt 5401 aaaatatatg ttttatttgg agtataacat atccagaaaa atgcaaaaat cacaggatac 5461 acttatatag tttttacaaa attttacacc atccagatca agatatagaa cagtgatagt 5521 accccagaaa tatgatcctt ctaaggcact aaaccccaaa ggtaaccatt ctgacttcta 5581 ttatcatgga ttagtttgtt ttcttttaaa aagttatgca aatgtataca ttaattcata 5641 tgtgattggc ttctttcact caatattatg tttatgagat tcaccatttg ttcatttctg 5701 ttgttggata atattccatt gtatgcatag tctaaaatgt gtttatcttt tctattattt 5761 atgggcattt agattgttcc cagtttggga ctatatgaat actgccacta taagcatttt 5821 tttccatgtc ttttggtata caaatatgtg ctttttttat tgggtaatat ggtttgtaca 5881 tgttttaccc caatatatat tgcaaaggtg ttctctcata gagtttggac tgatttatac 5941 tttcagttgt agaaatatgc gagttccagt ttcttcactt cttcacaatc attgagatag 6001 cttgtctttt tcatttcagc agttctattg gatatgcagt agtatctcat ggtggtttaa 6061 attttcattt ccctgatgac tagtaatgct aagtaccagg tctctaacca tttaaatatc 6121 ctattttgcc aagttcttgc caagtattgc tttgttttac ttggttgtct gttttttctt 6181 ttaatgctat ttatttcaaa attttttatg agttagcata tgttaaggag aaactttatc 6241 tattcctgaa ctgtactgtc caatactagc catcagccac atatagctat tggagacagt 6301 acttaaaatg tcactagttc aagatcagat attctgtaag tataaatatg tagtggattt 6361 taaagagtta gaaaaaaaaa gaatgtaaaa tattttcaat atattgggtt aaataaaaac 6421 ctattattaa aactaacttc aattgttcct tagtattttt atgaatgtga ctattagaaa 6481 tttttaaatt acataagtga cttacaatac atatctatta ggaagtgcag gcagggtgga 6541 gtggttcaag tcttgtaatc ctagtgcttt gggaggccaa agcagaagga tcctttgagt 6601 ccaggagttt gagaccagcg tgagcaacag agtgagaccc ctcatctcta caaaaaataa 6661 aacaaaaatt agccagatgt ggtggcatgt gcctgcagtc ctagctacag gagaggaggc 6721 ttgcttaagc ccaggagctc ctggtggcag tgagctatat ttcagcactg cactgaagct 6781 tggataactg aggaagactc tgtgtctagg aagaaaaaaa aaagtactgt tctacgaagt 6841 tatgcaaagc ttccctgaaa aagtggtctg ctaaatttta tcatgactgt tggatgttta 6901 aagtcatttg cagatcaatt atgataaaag gactccttga gttgtcacaa aagtacctta 6961 aaaatttaag taagaaaaat aagtactagt tgaagtgtct ctaaaaactt ttagtttgta 7021 ttgtcacctg attacaaatt tatgttttta gatttgtttt tccattttca gcttgacatt 7081 tactcatttc agagaaaccc tgtggttttc ctcatgtgga aaatggaaga attgcccaat 7141 attactatac ttttaaaagc ttttactttc caatgagcat agacaaaaaa ttgtcatttt 7201 tctgcttggc tggttatacc actgaaagtg gaagacaaga agagcaaacc acgtgtacaa 7261 cagaaggctg gtctccagag ccaaggtgct tcagtaagtc agctggatat gtcactcaat 7321 gtttcaatac tcaaagaaat ttgtatataa aaataggggt ccaataaaaa tggaaagcgg 7381 ttaaaccttt tagtagataa gaacaaaatc actagcaatc tgtctttcta caaaatgaag 7441 gcataatttg aaatccagta aataatgctt ttacattaaa atgttatttt ctgtatctgt 7501 ttttaataat actctgttga agggaatagg gtaataactg aaattatcat attgcaaggc 7561 aaagagaggt aatttacttt aagtaacaag aaacttattt tgataattaa gcataggatg 7621 ccagaaaaat gcttgaaact ctggcctctg gaagggcaag attgctactc taatgcttta 7681 tcatttgtaa gactcatctc tcttcatgtg ccaacttctc tcatgatacc cattgaaaaa 7741 tgtgtttttc aattttgaat tgaaagtttg agttcaaagt tgtgagaagg taatctaatt 7801 gactcatttc attattttat atcaggctac actatcatca aactatcaat gatcttccac 7861 tagggcaggt gacccagagt aaggggcttt ggttgagcca gtttcgttta aaagaggatg 7921 ttgaatatgg ctattgtggt tgatatatct agtaaaccta tctagaagta gatgtggtag 7981 gggtagaaat tactataaga actaaaatta gtaaatacaa agagagtcag tgttaaattg 8041 aatccatttt caaaaaaatt atgcagatat ttccaaaatg aaatcgccaa taataacatt 8101 atacttttgg cttaatttta caatttagta aaagacaagc ttagtttcat cattaagtta 8161 aataattttt tttcccatag aaaaatgcac taagcctgac ctgagtaatg gttacatctc 8221 tgatgtaaag ttattgtata aaattcaaga gaacatgcat tatggttgcg cttcagggta 8281 caaaaccact ggagggaagg atgaagaagt ggttcaatgt ctctctgatg gatggtcttc 8341 tcaaccaacc tgtaggaaag aacatggtat aagaatcatt tcctaaactg aagataagtg 8401 cttatcaagc ttatattgaa tatattgaag cttatattga tatctattat ttaatagaat 8461 ttatttttca ttatgtctac aatgcaggaa gttgatatta cattgctttt ctcaagtact 8521 aggtgattcc ttaatccaaa tgttctcaat ccctacaacc cacctacgcc aatagttcct 8581 actttgcatc aataaaagga attggaatgg aataaaatga aagaagcttt ttctttatat 8641 gtcactattt cctttattac agaagaattt aagatcatgt acattatttt atattgtgaa 8701 ccaatgtcag atatacttaa aaatagttca ataatttaaa tatatgtata ctttgttagt 8761 aagttgttta aattgtaaaa atgcatagac agcacatact ttcacaattg ctactgcaca 8821 agcctttagt tgaattttgg cctttggtca agcacttgta gactagaatt ttaaaaacag 8881 aaaactggta gagttaggtc acaaggtgta tgaagaagag aatagaggca aagaaaaatt 8941 ggggctattt ttcaattttg ccaagagttg attttaccat gtattcttaa gatttgtaaa 9001 accatatttt gagatttttt ataatctagg ttattaaaaa gtttataaaa taaataattt 9061 ttataatttt agaaacgtgt ttggctcctg aattatataa tggaaattat tccacaacac 9121 agaaaacatt caaagtgaag gacaaagtac aatacgaatg tgctactggc tactacacag 9181 ctggaggaaa gaagacagag gaggtagaat gtctcacata cggatggtct ctcacaccaa 9241 aatgtaccag taggttctca tttggaataa aatctactct cactctgaaa gctttttctc 9301 atagagaagt gtatataagt tgtcatgctc attatgtttc accttttcat gcatttttgc 9361 ctttattttt aattatgttg atagcagtct ttgataatta aagctctcaa gtagaataaa 9421 ttcattttag ctcacagaat taaattagaa taagcctaat gatatttaac aaatatataa 9481 aatgaattca taagcttttt ggcaaattta atttctattc tagttatgtc atgtaaatag 9541 tgcattttgt tgtagcagaa aaaataaaca tttgtttgta attttattca ttcttttatt 9601 tgcagaatta aagtgctctt ctttaagatt aattgaaaat ggttattttc atcctgtaaa 9661 gcaaacctat gaagaaggag atgtcgttca gtttttctgt catgaaaatt attatctaag 9721 tggatctgat ttaattcaat gctataactt tggttggtac ccagaatctc ctgtatgcga 9781 aggtaaattt ttaaatttaa tactcgcaaa atgccatgtc tttatcagct ttaatctcta 9841 ttagctctga caaaattcct gtggtcatat ctattttttt tcctgagtcc tatttttaaa 9901 ataaatatta tcttttaaat cagtgctatt tttagcaatc tttaatatat cccacctgaa 9961 taaatcttgt tttgtcttag tttgtaaatg actatatctt ttaaatttat tattttaaaa 10021 tatctattat acattttaaa tcccttatta taacgaataa cttttcttat tgggaattat 10081 taggactaat taccccaatc tggcagcttt gcaaggatat tttctcttaa tttgccacat 10141 atttccagat gtaaaaatta atctccaaaa tagtgaggta gaaagcagag aaagcaggaa 10201 aagcagaaga atataatgcc cagaagtgaa aatgggaaaa gaggtcatct tgtaaaagct 10261 gcccaaaaat gtaatatcta tggtcacatt tctattactg gaagtgtcca aacaggtaat 10321 gcaattacct gtcagaagac atccatagtt atttaaacaa acctacaatt taatatgagg 10381 agggtcctat actagcagaa gatatgaaaa cagtaattgc agatatgaaa aaaataattg 10441 ctataatgtg acaaatggta caatcaactc aggtacaaag tgcaggaata aacattagag 10501 attctgattg gtatgctgtg gaatagcatt cacagaagtg atggacaata tttaagttgg 10561 gtcttaagag ataagtagca gtttgagaat tgggaggata gggtgaggca aagggtttga 10621 tagtcatttt ttattttctg taatatggga tataaggtct attgacaggc aggtgaaaga 10681 catgaaggta gaacttggag gaaagtggtg aagatttgaa acaggtgcta ggggaaacaa 10741 atgagggctc tgaccatggg tatgcagtgg gattgccaga actatcataa ttccctaaat 10801 gcactgtaca gtaactttaa agattttatg atttttcatt tgtgctcgac atccttggta 10861 tagtagtaga gaagtagatt ttttaagcaa atttttttaa agtctattat tccttattta 10921 ttttttattt ttttgagatg gagtcctgct gtgtcgccta agctagagtg cagtggcacg 10981 atctcagctc actgcaacct ccacctccca ggttcaagtg attctcctgc ctcagcctcc 11041 tgagtagctg ggactacagg tgtcggccac catgcccagc taattttttt gtgtttttag 11101 tagagacagg gttttgccac gttggtcagg ctggtcttga actcctgacc tcaggtgatc 11161 agcctgcctc agcctctcaa agtgctactt caaaaataaa tggcagacct acttaaggcc 11221 ttttataccc ttctcctctt ccaaattagc cagcatgaca acatgcctgt tttttccggt 11281 aaacagtcat taggcataca atggctaata taattcagcc cacaacaata taattcagtt 11341 gtgggctgaa ttatatctct cttactcctc ccaaaatata gccacattct aatactgaga 11401 accattaaac gttaccttac acggtacaag acactttgag gttaacaatc ttaaggtagg 11461 aatattatct ggaactatct atccagactc taaatataat cacaaatgct tttataaaag 11521 ggaggcagag ggagattgac tgcagaagag gaagcaggag atgtgacaat ggaagccaca 11581 ggttgaaatg ttttgaggaa ggggttgtga gccaaggaat acaggtgatt tcaagaagct 11641 agaaaaggca aagtagctga ttctcctcca aagcctccag caggaatgtg gctctcctga 11701 tatctcgatt ttagtccagt gaaactgact ctggacttct ggcctctagg actgtaggag 11761 gataaatctg tgctgtttta agccactaaa ttgtaattca tcaaagcagc aataggaaag 11821 tagtgtggtc agtattttag agcaggagac tttaatttag acagacgtgg gttcaagaac 11881 agatctgcta catgctatta atttatttac ttcataagtt ttagtttgtc tataacatgg 11941 gagaaataat agttcctact taataaagtt attttaggac taaaagaggt aatgaatgta 12001 aaatgcctgc tgtgttagta ttctattggt gcgtaacaat gtcaccacaa actcagcagc 12061 ttaaaacaac acaaagaaat tatttcccat tggtgtaggt tgggggtctg ggtgcggctt 12121 aaccgactcc ttgcatcgat ggctatcaaa gctgcaatca agatgttagc tggactgcag 12181 tctcatctga ggcttgactg ggcaagactt cttccaaact cactgtagtt attggcagca 12241 ttcagtgctt tgcagactgc tgaactgaga gcctctgttt cttgtggctg ttaactgaag 12301 gcagccctct gccccttgcc ttgcagccgt tctgctttgg aagctcatga tgtagcctct 12361 tgtttcttca aaaccagcaa ggagagagag tctcttggca agaatggcgt taaaaactga 12421 tgttacataa ttatgtgtgc ataatcacgt actaatccat cacctttgct gtattttgtg 12481 ggctagaagc aagtcacagg tcctactcat ggaaaggggt atcacacaag gaagttaaca 12541 tcaggaggtg aggatcaaag gaatgctcct agaatcagtc tgccattctc tctatgatgc 12601 ttgacacgat gaatatttta atttaatctt tattattttt tgtaattata tgactaaagg 12661 tgctgattaa ttttttccat aaggaagaag aaacagatgt cctcctccac ctctgcccat 12721 aaactccaaa attcaaacac attcaacaac ttatcgtcat ggagaaatag ttcatataga 12781 atgtgaactt aattttgaga tccatgggtc agcagaaata cgttgtgaag atggaaaatc 12841 gacagaacct ccaaaatgca ttggttagta acaccttgaa gaagctttgc taaaatgaaa 12901 tctgcatgtg tagctaaatg gtcagttctc tttcaactgt accttttcag aaggacagga 12961 gaaggtagcc tgtgaggaac cacccttcat tgaaaatggt gcagcaaatt tacactctaa 13021 gatttattac aatggggata aagtgacata tgcatgtaaa agcggctacc ttctccatgg 13081 atcgaatgag ataacttgta atcgtggaaa atggacactt cctcctgagt gtgttggtat 13141 gtatgctaca tttaccatca gtagctaaac ttatggtgta aaattttcca ttctgttact 13201 ttaatctgaa tattcctagg aaaagaccac tgcttaggca aattagaaac acattgctac 13261 agcacctact cttctgcttt gcatgtgtgc atgtgtgtgt gcctgtatgt gtagctatct 13321 gtagaattta agtcaatgta aagaattctt aaacaaaagt tttttttgaa cgctagtgct 13381 aagcacggtt atctgatctg tgtcatgcca ttacctgtaa gttctaaatg acaacacata 13441 aatgacgatg tgcttcatta gataagtaaa taagggcatg gtggattgag tgacgggcca 13501 tgagcattat ctacttcttc tccccatatg tcctgcaaag ttagggtagt gcattctgtt 13561 cttgaccttg ggtaagtcct gctgatattt tgcatctggc cattctgaac cttatgccct 13621 tctgagacat gtcctggtct ttcattttac tgaaaggcac caccaggcta cactatcatt 13681 cctggagact actcttcctg tcatctatct catttcttaa tcagatcaga gcaacatctc 13741 cacacacatg tttatctagg atatttatta aataagacaa aagcaatgaa ggatgctaaa 13801 gaaatagata aagtgagaga aagataccaa aaagaggaaa aaaattgata tttccttcat 13861 gggtctattg agacctgaga tgaagaggaa tgaaaaagaa gaaaataaaa gtggggtaat 13921 gccaaagagc aattttagaa ttgaccataa tctttaggtt gaagaacagt ttctcattct 13981 ataggtttaa gaaatgtgta tatcataatt ccttcacaat tgagaaaaga gaaaaatacg 14041 tatagttgtt tttaggtaac atatttgctg tcaactcttg cttagcaaat tttaaaaata 14101 tttgagctga aaaaattttt catatttgta atgaatttat aaaacaattg tcctaaatta 14161 tataatacaa atatcgagac actatagtat gagttactat agtatattgt tatgagctca 14221 tattaatttt aaaacaacct cttttcttag aaaataatga gaattgtaag catcctcctg 14281 ttgtaatgaa tggggctgtt gcagacggga tattggcaag ctatgcaaca ggatcctcag 14341 tggaatatag atgcaatgaa tattacttac tgaggggatc aaaaatatct cgttgcgaac 14401 aaggaaaatg gtcatcccca cctgtttgct tgggtaagaa agagaacaca tggaatgtct 14461 acgtttgtac ttttatgtga ttttcttaca gtgttttata tcatgaaaat gatgattttg 14521 tacaaatctt ttttcacaag tgttttctct ctcatttctt taagtagaaa gtccctaaaa 14581 tattagtgta ttttttctat tgtggtaaaa tatacataac ataaaattta ccactgtagc 14641 tatttttaag tgtacaattc agtggcatta gtatattaac atttttgtga ccaattagct 14701 tttgggttgt ctttctccta gctgaaattc cctccctggt atacagccct caatatctct 14761 gcttgtattg ccctcagaat gtcatttcaa gaagaataat tagtcctatt ctttccttgg 14821 tagagtatat tgctaatata ttatagagta tattatatag agtatattgc taaattttac 14881 caccgcatcc tcatctgaat ggacacgttt gttacatctt gttgattata taaaagtgtt 14941 tgctaatgac ttagataata aatggtcttc taaatgtttt acacacacaa gcacacactc 15001 cacaaaatgc tatgatatga tgcagaggta tatatcttag atttttttgc tcaaaatata 15061 ttgtaaacat ggtttgtcat acttctacaa tatatacttt aatggctgca tagtattcca 15121 taatgcagat atacatatgt aatttaacca gtttattgat ggatgtttag tgttcaacat 15181 ttttatattc caaacagtac actttcaagt aaatctttgt acatgttaaa catatttatt 15241 tttgttaaaa ataattatca tgagatcagg aattttaatt ctttttctat taattggcaa 15301 attgtctatc agaaagtttg ttctgctttg ttaaccctac cataagtgaa ttaaagtagt 15361 aatcatccta atctttgtta actttgaata ttttaatctt gacatttttt gttaagcagg 15421 caagcacaat tattttaatt tgatttattt aagaattaat taatttggaa ttttttaact 15481 ttctagtagc cacaggtttt ttacatgaat taattattca tagtttagcc attttttcat 15541 acttataaat ctatatataa attattttat ccaattgaga atcatgttta ttcgaattat 15601 ttttaattta tttttattta tagtgttttt aagtatatat aagtgtatgt tttaagtctt 15661 gagctctttc aaccttttgt tagtagtttc cacctttcaa gtaattctta ggagtcgatt 15721 cccattcaaa ttcagataca gatacatatt cacatagaat tttgctattg gtctcatatt 15781 tttataaatg tttttgatat taagtttttg atgctttggt tgatattgca ttgagtggag 15841 atatagtaat ctctaaggac aaagcctaga ttaaaatatt tatggcaaag ctggtaatga 15901 aacaagctca taaggaaata acaacattta aggaaataca tcattctata tctggaacaa 15961 ctgcctcaag agagttaggt ttggtcctaa tgaatcaaag caaataccaa atgaagatta 16021 atcattgtta tcagggcaag agaaaggggc tttgtatatc ttggtaacta ttgaaactgg 16081 aaaggctatc agtacaaatt cacaacagta tgataaagcg tagaattttc tctgtaggaa 16141 ttattattcc agacattaag agtctctcta acctctccat actctcttcc ctgctctaat 16201 cactccagtc tttctggtct ccttactttt ccacaaacaa gccaagaatg ctctcacttc 16261 agggcctttg cactggttgt tccctttgct tgggacactc tttcataaga aaatgacatg 16321 gttcactcag tctccttcaa gcttttgtgc aaagtcccct tcaagctttt ttacttctta 16381 tttaggccta ccctaatcac cttactgaaa agtgcaactt ctcttcacct agctcttttg 16441 ttaccccctc ctgctccagt actctttttt tcatctaatt tatcttatgc tgtagtttaa 16501 agatggctac tgtgttagtc tgttctcatg ctgctaataa agacatacct gagactgggt 16561 aatttataaa ggaaagagat ttaatgcact cacagttcca catggctggg gaggcctcac 16621 aatcatggta gaagacaaag gagaagcatg gaacgtctta caaggtggca agcaagagag 16681 catgtgcagg gaaactcccc tttataaaac catcagatct catgagactt attcactgta 16741 atgagaacag catgggaaag acccacccca ggattcaatt acctccaacc aggttcctcc 16801 catgacatat gggaattatg ggagctacaa ttcaagatga gatttgggta ggcacacagc 16861 caaaccatat cagccacaaa ttatacaatt catgtcttta ttatatttat tatgtattgt 16921 aggtgtttct cccagttaga aggtaagctc catgggagca ggagtcttgg ttttattcat 16981 tagcaggttt tctattattt tctacatggt agttaaaggt ttttatttta tagaaagaaa 17041 attattatta atctaactga ttgatgccct taaatagtat ttttgatgtt ttaaattaag 17101 tggttttagt ttatgacaat aatgtataga ttcatttgat agtaataaag tatatatttt 17161 cttatatata tatgtgtgag aagccttaat tatgatttca attataaaat attgcttttg 17221 aaacttgtca taatttttgt tttaaaaaat atcactttta tatttgtcag acaagtaaag 17281 aatttaaaag agcattttgt tttacagaac catgtactgt taatgtggat tacatgaaca 17341 gaaataacat agaaatgaag tggaaatatg aagggaaagt cttacatgga gatttaatag 17401 attttgtatg taaacaggga tatgacttat ctccattaac cccattgtct gaattatctg 17461 tgcagtgcaa cagaggagaa gtgaaatatc ctttatgtac tagaaaaggt aaaataataa 17521 tgttctctgc aagtatcttt acttgaccaa tatctgctac ctcggttgaa aatttattta 17581 tagtttgttg ctatttttct tattattatt ctttttaatt ttgttttatt ttattttatt 17641 ttactttgag ttctgagaca caagtgcaga atgtgcaggt ttgttacatg ggtatacgtg 17701 tgccatggtg gtttgcaatt ccagtaatgg aattgctgcg tcgaatttca tttctggttc 17761 taggtccttg aggaatcacc acactgtctt ccacaatggt tgaactaatt tacattccca 17821 tcagcagtgt aaaagcattc ctatttctcc ccagccttac cagcatctat tgtttcttgg 17881 ctttttagta attgccattc tgactggtgt gaaatggtgt ctcattgtga ttacacctta 17941 tacaaaaatt aactcaagat ggattaaaga tttaaatgta aaaaccccaa accataaaaa 18001 ctctagaaga aaacctaggc aataccattc aggacatagg catagggcaa atacttaatg 18061 acgaaaatgc caaaagcaat tgagcttgag agccaaaact gacaaatgag atctaattaa 18121 cctaaagagc ttctgcacag caaaagaaac tatcatcaga gtgaacagga aacttacaga 18181 atgggagaaa atttttgcaa tctgcccatg tgacaaaggt ctaatatcca gaatctacat 18241 ggaacttaac acaaatttac aagaaaaaaa cgaacaacac catcaaaaag taggcaaagg 18301 atataaacag tcacttctca aaagaagaca tttatgtggc caacaaacat atgaggaaaa 18361 agctcaacat cgctgatctt tagagaaaat ttatacctat gaccccaatt caccaccatc 18421 atattctaat ctataaagtt gagaggagtg atccaacttc caaacttcaa tcagcatata 18481 tacctatata ggatggccat gctgacatct tttaattcca ataagtcaat tttatgaagg 18541 gaaaaaatca agaggtttta taaatgttgc cagactatca gagaattgat caccttcttg 18601 tgagcactca acaaccaaca ctgtaaagca ttgctctact tagggctttc tggggaccca 18661 ttgtctcagt actcactggc tatagggact attactttta ataaggaaga aaaggaaaat 18721 aatatataca gaaatcatgg cttctagaaa aatcccatat acacttaggg agaaataaaa 18781 tcatgattgt taatactctc atagttttaa atatgcatgt aaacactata acgtgggatt 18841 ctaattataa aaatacattg tatggtataa tattattgca aaaacatctt caatatttta 18901 aaaatatttg agaactgcct aaaaaatggc tttatggcta ggtgcagtgg ctcacgcctg 18961 taatcccagc actttgggag gccaaggtgg gcagatttct tgagcccagg aattcaagac 19021 caacctgagc aacatagtga gaccctgtct ctacaaaaga aaaatgcacc tgtagtccca 19081 actagttggg aggctaagac aggaggatca ttgagcctgg gaggcagagg ttgcagtgag 19141 tcatgatcat gccaccatac tccagcctgg gtgacagagg gagactctgt ctctgtagca 19201 cttataaagt actggatgtt cattatagca attcattgta tactttaaaa cttatttttg 19261 cagaatctaa aggaatgtgc acatctcctc ctcttattaa acatggagtc attattagtt 19321 caacagtaga cacctatgaa aatggctctt cagtagaata cagatgtttt gatcaccatt 19381 tcctagaagg atctagggag gcctattgtt tagatggaat gtggactaca ccaccattgt 19441 gtttaggtat gtactactaa atatgcctct aacaaagtaa aactatatat ttttaatttg 19501 ctgtcatttt tgagttaaca taacactaat ataatgctat atttgctaca tttgtgttat 19561 tttattcatg tgcttatttt atcactgctt tctattcttt cctctagatc ataaatacct 19621 atagttcagt ttgttgtaat acatattttt ttaaagacaa accacacatc aggttggtat 19681 gggctataag ttgaataagc catagtggtg tgagtaaact ttaacaatta tcattcattg 19741 tgattggtac tataatagaa aacattaggg aaacatgtat aatagaaaac aatgcggaaa 19801 taaagaaatg gacaactcaa tatggaaaga tgaagcaagt taatgtttta ttagcaaggc 19861 atcctagtaa gagtaacaac atttactaag gccctcagag gcatgaaata gcacatctgt 19921 aaataataag aaaatatttt ttatttccaa ttctgacttt tatttacttc tcttatcttc 19981 tttcactggc taggacttca gtacaatgtt gaatggaaga ggtagtggat atctttattt 20041 gcttcccatt ctcataagta aaacctttga taattcacct ctatgtgttg ttataaatat 20101 ttttttaaaa atcaagttaa aggtataaaa acctttcttt ctattttgcc gaaagtattt 20161 ttaaagaatc ttcaatctgt gtttaatttt tagcaatatc tgttgaaata attacatgat 20221 tattactttt gttttttccc ctgcaaattt aatgaattat attcgttctt ttgaatgtta 20281 aagcatgatt ctatcccata agaaattcca caaggttgtg ataaattatg catattatat 20341 gttatatttt tatttatcaa tattttactc gatctatttt catgagaaga ctggttggtt 20401 attttccttt ctggtttgat atcaagtctg tgctgacctc atcccatgaa tgggaagtgt 20461 tccgtctttt ctggttctct ggattgatgc tatttcttcc tttataattt taagagttaa 20521 ccagagaagc catctgggac ttgagttttt ccgtgaaaag acaaattgac agcttcgatt 20581 tgtttaacaa ataatggtaa ttatttcttc ttctgtcagt tttggtaggg agtggtttct 20641 tgatggttca ttacaactac attgtcaaat ctactaacat aaagtagtta tgacatcttc 20701 ctatcctctt tttgatgttt ttattgatgt atcttttgtt attacttcta ttaatatgta 20761 tctgtcttga catgccttgc taagggtcta tacattttat tagtgttttc aaaaaaccaa 20821 catttgcatt tgttgattca tcaaattttg ctttctattt tattaattat gctgttatct 20881 ttattatttc ctttcttcta tgttcttgga tttcttttgt tactcttttt ctggcttgtt 20941 tagatgaata catagattac taattttcag catttatatt ttttctaata catgaattta 21001 aaaatagaaa tttctctctg agcacaactt taacttcatc caaaccattt ttatatcttg 21061 gatttttact gtcattcagc tcaaaaattt tatagtattc attttgatat attctctgtg 21121 ctctgaatca tttagaagtt tctttttaaa gttctgaata tttaggggat tttctagttt 21181 tcttttggtt actgatttct agtttaattc aattaagaca tactttgaat tatatctgca 21241 ctttaaaatt tgttgatatt tgctttataa caagcatgta tttgctttag gtaaatgttc 21301 ttttcattga aaagttaact ttgtcattgt tgggtcactg ttttatatat gtgaattgga 21361 ttgagttttg tttttgatgt ttaagttaat ccaaagctgt actgatttgt gtgtgtttgt 21421 tttctatcaa tcactgagag aagcatgtta aagtctcact tggcatttta aaacattact 21481 aattttagtt ctgtccattt ttgcttttgt attttgatac tatttcatta ggtacacaca 21541 caaattgaga cactaaatct tcgtatcttg gtgtattggc attgtctaat tttaaaagac 21601 ctccttcata tatagtaatg ctttgccttt ttttttgcct tgaggtctcc tttagctgca 21661 ccaaggggct tttgacaagc attcatgtag tacacatttt tccatccctt aactttcaac 21721 cttctgggtt tttactttta gtgtgcctct cttataagca atatatacct tattttaaaa 21781 tatacttatg caatttataa ttttttccct tgtaatggaa tagtctattt acatattcat 21841 gtaattacta atatatttct gtttatacct accaacccta ccaactttct atactaggtg 21901 tatgttacta tactccatgt atatcccatt ggtcatctcc ctctttctct ctctctctcc 21961 atataggtgt gtgtgtatat atatatacac atatatatat ataatatttt aagttctagg 22021 gtacatatgc acaatgtgca ggtttgttac ataggtatac atgtgccatg ttggtttgct 22081 gcacccatca actcttcatt tacattaggt atttctccta aagctatccc tcccccagac 22141 ccccacagac aggccctggt gtttgatatt ccctgccctg tgtccatgtg ttctcattct 22201 tcaacactca cctatgagtg agaacatgca gtgtttggtt ttctgtcctt gtaatagttt 22261 gcttacaatg atggcttcca gcttcatcca tgtccctgca aaggtatgaa ctcatccttt 22321 ttaatggctg catagtattc catggtgtgt atgtgccaca ttttcttaat ccagtctatc 22381 attgatagac atttgggttg gttccaagtt ttgctattgt gaatagttct gcaaaaaaca 22441 tatctgtgca tttgtcttta tagcagaatg acttataatc ctttgggtat atacccagta 22501 atgagatcac taggtcaaat ggtatttcta gttctagatc cttgaggagt cgccacactg 22561 tcttccacaa tgattgaact agttcatact cccaccaaca gtgtaaaagc gctcctattt 22621 ctccacatcc tttccagcat ctgctgtttc ctgacatttt aatgatcgcc attgtaactg 22681 gagtgagatg gtctctcatt gtggttctga tgaccagtga tgatgagcat tttttcatat 22741 gtctgttggc tgcataaatg tcttcttttg agaagtgtct gttcgtatcc tttgcccact 22801 ttttgatggg gttgtttttt tctcgtaaac tttttaagct ctttgtagat tctggatggt 22861 agctctttgt cagatggata gattgcaaaa attttctccc attctgtagg ttgcctgttc 22921 actctgatga tcgtttcttt tgctgtgcag aagctcttta gttcaattaa attccatttg 22981 tctattttgg cttttgttgc cattactttt ggtgttttag tcatgaagta cttgccaatg 23041 cctatgtcct gaatgatatt gcctaggttt tcttctaggg tttgtatggt gttaggtctt 23101 acatttaaat ttttaatgca tcttgagtta atttttgtat aaggtataag gaagtgatcc 23161 agtttcagct ttctacatat ggctagccag ttttcccaac accatttatg aaataggaaa 23221 tcatttccca tttcttgttt ttgtcaggtt tgtcaaacat cagatggttg tatgtgtgtg 23281 cattatttct gaggcctctg ttctgttcca ttggtctata tatctgtttt ggtaccagta 23341 tcatgctgtt ttggttactg tagccttgta ttatagtttg aagtcaggta gcctgatgcc 23401 tccagctttg ttctttttgc ttaggattgt cttggcaatg cgggctcttt tttggttcca 23461 tatgaacttt aaagtagttt tgtccaattc tgtgaagaaa gtcaatggta gtttgatggg 23521 gatggcattg aatctataaa ttaccttagg cagcatggcc attttcacga tattgattct 23581 tcctatccat gagcatggaa tgttcttcca tttgtttgtg tcatctttta tttcgttggg 23641 cagtggttag tagttcttga agaggtcctt cacatccctt gtaagttgga ttcctaggta 23701 ttttattctc tttgaagtaa ttgtgaatgg gggttcactc ataatttggc tctctgtttg 23761 tctattattg atgtatagga atgcttgtga tttttgccat tgattttgta tcctgagact 23821 ttcctgaaat tgtttatcag cttaaggaga ttttgggctg agacgatggg gttttctaaa 23881 tatataatca tgttatctgc aaacagggac aatttgaatt cctcttttcc taatttaata 23941 ccatttattt ctttctcttg acagattgcc ctggccagaa cttccaacac tatgttgaat 24001 aggagtggtg agagagggca tccctgtctt gtgccagttt tcaaagcaaa tgcttccact 24061 ttttgcccat tcagtaagat attggtggtg ggtttgtcat gaacagctca tattattttg 24121 aggtacattc cctcaatacc ttgtttattg agagttttta gcatgaaggg ctgttgaatt 24181 ttgtcgaagg ctttttctga atctattgag ataatcatgt ggtttttgtc attggtcctg 24241 tttatgtcat ggattatgtt tattgatttg tgtatgttga accagcattg catcccaggg 24301 atgaagccat cttgatcgtg gtggataagc tttttgatgt gctgctggat ttggtttgcc 24361 agtattttat tgaggatttt cacatcgatg ttcatcagga atattggtct aaattctgtt 24421 ttttgttgtt gttgtgtctc tgccaggctt tgatatcagg atgatgctga cttcattaaa 24481 tgaattaggg aggattccct ctttttctat tgattggaat agtttcagaa ggaatggtac 24541 cagctcctct ttgtaccttt ggtagaattc ggctgtgaat ccatctggtc ctgggctttt 24601 tttgatttgt aggctattaa ttattgcctc aatttcagac cctgttattg gtctattcag 24661 acattcaact tctttctggt ttagtcttgg aagggtgtat gtgtccagga atttgtccat 24721 ttcttctaga ttttctagtt tatttgcata gagtgtttat agtattctct gatggtagtt 24781 tgtatttctg tgggattggt ggtgatattc cctttattat attttattgc atctatttga 24841 ttcttctctc ttttcttctt tattagtctt gctctcagtc tatgtagttt gtttatcttt 24901 tttaaaaaaa cagttcctgg attcattgat ttttttgaag ggattttttg tgtctctatt 24961 tccttcagtt ctgctctgat cttagttatt tcttgccttc tgctagcgtt tgaatgtgtt 25021 tgctcttgct tctctagttc tttcattgtg atgttagggt gtcgatttta gatctttcct 25081 gcattctctt gtgggcattt agtgctataa atttcccact acacactgct ttaaatgtgt 25141 cccagagatt ctggtacgta gtgtctttgt ccttactggg ttcaaagaac atctttattt 25201 ctgccttcat ttcgttattt atctagtagt cattcaggag taggctgttc agtttccatg 25261 tagttgtgca gttttgagtg agtttcttaa tcctgagttc taatttgatt gcactgtggt 25321 ctgagagaca gtttgttgtg atttctgttc tttttcattt gccaaagagt gttttacttg 25381 cagttatgtg gtcaattttg gaataagtgc agtgtggtgc tgagaagaat gtatattctt 25441 ttgatttgga gtggagagtt atgcagatgt ccattaggtc tgcttggtcc agagctgagt 25501 tcaagtcctg gatattctta tttaccttct gtctcgttga tctgtctagt attgacagtg 25561 gggtgttaaa gtctcccatt attattgtgt gggagtctaa atctctttgc aggtctctaa 25621 ggacttgctt tattaacctg ggtgctcctg aattgggtgc atatatattt agaatagtta 25681 gttcttcttg ttgaattgat ccctttacca ttatgtaatg gccttctttg tctcttttga 25741 tctttgttgg tttaaagtct gttttatcag agactaggat tgtaacccct gctcagcaat 25801 ggtagacgac cctcccccca taacgctgtc gtgtcacagg ttgatctcag actgctgagc 25861 tagcagtgag caaggctcca tgggcgtgga aactgctgag ccaggcaagg gagggtatct 25921 cctggtctgt cggtggctaa gaccttggga aaagcacagt atttggtcag gagtgtacca 25981 tttctcactg tgctgcccag gctggagttc cttagaaaca aatcagaaac aaatctctcc 26041 ctctctctct ttctctctct ctctctatat atatataaat atatatatat atgagttagt 26101 tccaagatgg ccgaatagga acagctccgg tctgcagctc ccagggagat cagtgcaaca 26161 caatgagact ctctttgtac attttgttct tcaagttagt agcattcctt gaaccttgat 26221 gtacttgata tccttcatgg gttttagaaa tgtttggcta atatctttta aaatgtacat 26281 ttctcacctc ttctgtgatg ccatattaca tatattacat ctttttgcta tgccattgtc 26341 tctttaaatc tatattttct ctcattttct tcttgtttca ctgtagtaat tttttggtaa 26401 ttttctcatt gtatcatttc tttgtttcta tttggtaatt ttcttctagt ttaacaatcc 26461 tctctttaga tgtatccaat ctgctttaaa ctcatctttt gagttctcaa tttcagttat 26521 catctttgca tttctatttt atattttctt tttgtagatt tgagttatct gatgaaagat 26581 cctagcttaa catcaacttc ttgaacatat aattcatact ttttattaag gcttgtctgc 26641 aaactccaat atctggatct attttttagc cccaaattct tgatattttt gttaacagtt 26701 gattgaattc caaatattat acgtggaaaa tttcaggatg tttggatgat gtcagcttac 26761 tccaaagaag actcatctca tcctctggct gatgattagt gtagaggcag ttggttttaa 26821 tacaattaga aataaagctt tctctatgtt tgttttcaga cctctgtttg ccttttctca 26881 cagactgtaa ctctccaggg ctctttacag agggtttgtt gtgtgtatta aagctttttc 26941 tccttgactg tctctgatat tactgggaga ctgccaaatg cccactgtaa ctttttattt 27001 actttctgct tggcttctca tcctatgcct ctatacaagt caaggataag cagtacctta 27061 gggaaaagct ggtgcagagc ctcaaactca tatctctaca cttcctatcc ttctaaattc 27121 tttattcttc aagtcctggc tgctttggtt taccgaaatt ctcatttttc tatcttcaac 27181 tccatgagat tttcagaagt tctgctggct gtatgcctaa gcttaccatt ctctatcatc 27241 taacctctca ggtcagactt agagtcaaca aacacctcaa gagaaaaaaa tggcacatag 27301 aatttttttt tttttttaga cagtctcctt ttgttgccca ggctggagtg cagtggtgcg 27361 atccctgctc actgcaacct ccaccttcta ggttcaagag attcttctgc ctcagctttc 27421 ccgagtagct gggattacag gtgcatgcca ccatgcccgg ctattttttt tttttttttg 27481 tagagatggg gtttcaccat gttggccagg ctggtctcaa actcctgacc tcagatgatc 27541 cgcccgcctt ggcctcccaa agtgctggga ttacaggtgt gagtcaccac gcccagcctc 27601 acatagatgt tgaactcacc tgaatacatt ttttccagaa ccttacttcc ttgactaagt 27661 tccttggagg ttttctgata cctcgaaatg cttcctgttt ttcagatttt ctggttctta 27721 atagaagtgc tggtttgata taatctcctc tgtcatagct gaaagttaaa ggtccatgaa 27781 tcaactttta aatttatttt aatttaaatc tcatgttctg tagaacagtg acatcgtaaa 27841 ttatagtgct attcgacttc attttcacaa ctattttgta atgagtgtaa taattcttgt 27901 tcattttcat cacctgtagc acttccagca taatgtttga cacataggca tttaataagt 27961 agttgttgaa gaaagtaact gaacacagtg ctctcagctg cggtttcact ttctgtggtt 28021 tcagttacct gctataaact gcagtctgaa aacgttaaat tgaaaatttc agaaataaac 28081 aacttataaa ttttaaatta tgcacttagt agtataataa aatttcatgc cattccatcc 28141 ttcctgcccc agatatgcta aaaagaagct gtaaagtgct tcctttaagt gaaagggcgg 28201 aaattcttaa cttaaaaaga aaggaaaaat aaatgtatgc tgaggttgct aagatccatg 28261 gtaacaatga gtcttctgtc tatgaaagtg tgaagaagga aaaagaaatc catgctagtt 28321 tcgttgttgc acctcaaact gcaaaagtta caagccacag tgtataataa atgtttagtt 28381 gaggtggaag aggcattaaa tttgtgggca gaagacatga acagaaacat gttccaaaag 28441 aatgacaaca gggtttagca ttatctgcga tttcaggcat gcactggggc tcttgaaatg 28501 tatcccctgt aggcaatcag ggagggcata ctgtaagtaa gtttgttaat gtactacagg 28561 tatatgattg atatgccata ttagcaactc tggacctttt tacctgtctc agaagttatt 28621 ccttaatgct aggtcatcca ggagacaaat aaggacataa gaaatcccat tttctgaaat 28681 aatttgatat ttgatatttt agaaatatta atgtttttat tctgtaaaaa aaatcgaaga 28741 ggtttttttg tgtattttaa tacatacttt accattctct ttggtttacg atacatggga 28801 aaggacacaa ctatactttg tatgcttaga atttaaaacg tcttaatcca gtcttgatat 28861 ttcttaacta gaaatttgaa agagaagtaa ctgtcaacca ttttggatgg catgaggtta 28921 taaagcttgg gtcagtgttg agaatctaca tttccataaa atgttattta tacagcaatt 28981 aataatttgt ctcatgagaa ttttatttga actttctgaa gaaggaaaag ggaaaagagc 29041 atgatatcaa tgaccattag taactgtcaa tgcttccccc caaatacctt tttgtttgtt 29101 tgctatcttt gaaaaagcta atgacataaa actaaaatac cttttagttt ttaaaactgg 29161 aagaaaatac cttttctcca aatccaggac cacagagaga aatggctcac tgttttctgt 29221 gagatttgaa catattttct gactcaaata ttaattctga ttaactgact tgctgctctt 29281 aggcctgaga attttaaggt cctcatgagt aggcaacaca acgattcctg ttcttaccaa 29341 gatgtaacaa gatagttttc ttctttttct ttcttttttt ttaaattgtt gttcccagag 29401 ccatgcacat tatcttttac tgaaatggaa aagaataatt tacttctgaa atgggatttt 29461 gacaacagac cacacatttt gcatggtgaa tatattgagt ttatttgtag aggagatact 29521 tatccagctg aattatatat tactggatct atacttagaa tgcaatgtga cagagggcag 29581 ttaaaatatc caagatgtat tccaagacaa aggtaagaag tttttttttt ttggtcagat 29641 tatttttttt ggtcagattg ttattcaaca tttcagaaat gttatgttct ataccactca 29701 tactttatgg caattgcaaa gaaaaaataa taatattttc ttcatttttt ttacaccaaa 29761 caaacaactg gaagtggtaa tcaattcaac aggccttctg aattgtctga agtgaaagca 29821 acgtttgcca tttgatattt gtgacacttt ggcaaatgga aaaataaacc aatttatacg 29881 tgcaatattt caatgtgcaa aaattaacct atatgttctg tgataaggac aaaaatcaca 29941 ctttctccta cttctcaata taagttcaag cactactaat aaaggttgga aaatgtggta 30001 tttattttca ggaccacatt tttgtgtaac ttttggaaat tacatactaa agtgtttcct 30061 catatgacag tttaaaaaat atatcctctg ctacttacag gaaataattt aacaattttc 30121 ttagagaaat gttaaaaaag tataataaaa tgatgtaaac cattttgtag gatcacaatt 30181 ttaacccttt ttgaagacac tgtatataaa aaatggaaat tatatttttt gttcaactaa 30241 gaatattaaa aatataagta gtggcaaagt gcatgtggtc ctgttttaaa aaatgcaaac 30301 aatgaactgt atgaactaca taaagtcagt tttttagatg ctttgaccac tctagtaaaa 30361 aaatatttaa aagacacaca atagtgggaa aaatttttat ttttatttct aattgtggta 30421 gcatcagggt ggaaaaaatt ttatatgcta tttaaatgtt ttggtacagt atgaatttct 30481 cacttttaag agaaaaaggt aaacacgtag aaaatattta aaaaatcatg agtagtgtag 30541 taaataatat tgcattttta aaagggaact ggtggtatta tttactaagt aagatcattt 30601 attcttatta gtacttaata aaaacaatga ctcattatat gaaaaattga aaggctgaga 30661 ttgtaattaa cacctgactg caattgatgc ttatttcaaa aatctctctt tttcccctca 30721 agcactctgt cttatcaaga acccttaaga acatagaaat gaatggcaga aagaggagtc 30781 atatttcaat acatcatgaa attccttata aaatataatt ttgaggaaat aagttaaaaa 30841 cttcggagtt tttccttgct taaatatttg aatctaaatt atttgtgctg aacatttcat 30901 tatttataaa tgaaaaccaa taaatgtcta atttttagtt tgtatgattt gatgtagaac 30961 ttaatctctt ttgtatgtct tccatctgtt acaaattaat ctacacatca tattttcatt 31021 tcctaatgcc cattgactag actatatagt tgttttcaat atactgtatg tgcaaaaatt 31081 tcacattcac attcgtcatc atctttatta atattaaggc ttttgttacc cttaattcaa 31141 agtttatgtc actattaaag ggccaaacac actagagata agaaactata tgctgtatta 31201 gaggatctgc cttattaaag gcagaagaga ctgcccttca gactttctaa atgtaaaaaa 31261 cttttaaaat taactgtaat atttgctaca acgttaataa ccaaattgtt tatgaggtgg 31321 tgtactacca tatttgaaca tgtgctcaaa tattgttaaa gagacacaat taaagaaaga 31381 atgacccttg gaattttatt taattttatt tatttattta tttatttatt tatttattta 31441 tttagagaca gagtcttgct ctgtcgccca gcctagagtg caatggcatg atcttggctc 31501 actgcaattt ttgcctcccg ggttcaagca attctccttc ctcagccttc caagtagctg 31561 ggattacagg cgtgtaccac cacgcccgct acggtttttt tttttttttg tatttttagt 31621 agagacaagg tttcaccatg ttgaccaggc tggcctcgaa ctcctgacct catgtgatcc 31681 acccgccttg gcctcccagg actcttggca tttttacatt ttacactttc aagtcttttt 31741 ttatatcaaa taatttttgg aattatgcag actacaaaat ctgaaataag agagtcctac 31801 aagcatcaaa aaatgagtgt cagtattatc aaggtctgtt agagacatga aacagcaaca 31861 ctgtcacttt ttcatttgaa gtgagagtta tacttctaat tatttcagct cattcataag 31921 catatgtaat agcactagta cccagatact aggaccccga ttaggattgg ttgttggtat 31981 tagaacgagt tttcctttct agaaactact cagtttttct aactcaagat cttagtttct 32041 cattacgatg atattctcaa atgcccaatg tacaattctc tactgctggg tatgaaatag 32101 cttcttttgg aaatgatgcc acagctgacc taactctgtg gattaaataa gtattaattc 32161 attcaatgtt aaacatgtat atttcagaac tgtccttgag ttcatattca ttattttaat 32221 taatctcctg tagtaggttc ccacagtgga tctcataaaa atggaaattt aattaaaatt 32281 aaatatcaac agaaaagcat taatcttttt aaataaattt cccaatctat caattaaata 32341 acattattga gcacctgcta tatcccaggg aacatgctgg gtagtacagg attcaaaatg 32401 attaaaattc acctcgttat ttcaaagaac tcacaaagta aaaaactact tacttagtaa 32461 aagagataaa cttgtactag tgattgcaga ctcaaaatgg gtgatgattg tggcttatct 32521 agtgatattt ctagtgaact ggagatggca tgcactgttt aacctggaca actacttcgc 32581 tcaaggtgaa taattgtcat gggggaatat tgtatccagt attgccatat gttctgattt 32641 gttaattgtg aaatgcactg atttttatat gttgcgagga gataaaaaat tatttaggca 32701 gtgcagctca aagtaaactc cacatgctaa agaaataaaa attttgaaaa gtgcaacagt 32761 ggaagcatgt acgaagtaaa gtggtggcac aaagtaggta gtatttaacc tttttttttt 32821 tttgtatgaa tgtggtatag gaagcttctt ggaaaaaagt taaagtaaat ctgaggttta 32881 caaaactaga aaacacactt caagtagaag gaatggcatt tgcaaagtca aagctcatac 32941 tctatgctag gcattacacc gagcactgaa aataaaacaa tgaataagac acagactgtt 33001 tctgctcttg tggagcttgt ctctcggagc ggaagaaaga tataagcaaa tccttacaaa 33061 ataaggcatg gcagcaaaga caccatcttt gtcagtaaac accagatgat actgcctagt 33121 actgggtgtt taaaaagtct tcagtgagga agtgatatat aagtgacata taggctgaga 33181 ttggatggga gaaatgataa gaattc // LOCUS HUMBHSD 9404 bp DNA PRI 31-OCT-1994 DEFINITION Human 3-beta-hydroxysteroid dehydrogenase/delta-5-delta-4-isomerase (3-beta-HSD) gene, complete cds. ACCESSION M38180 NID g179467 KEYWORDS 3-beta-hydroxysteroid dehydrogenase/delta-5-delta-4 isomerase. SOURCE Human leucocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9404) AUTHORS Lachance,Y., Luu-The,V., Labrie,C., Simard,J., Dumont,M., de Launoit,Y., Guerin,S., Leblanc,G. and Labrie,F. TITLE Characterization of human 3 beta-hydroxysteroid dehydrogenase/delta 5-delta 4-isomerase gene and its expression in mammalian cells [published erratum appears in J Biol Chem 1992 Feb 15;267(5):3551] JOURNAL J. Biol. Chem. 265 (33), 20469-20475 (1990) MEDLINE 91056097 REFERENCE 2 (sites) AUTHORS Lachance,Y., Luu-The,V., Labrie,C., Simard,J., Dumont,M., de Launoit,Y., Guerin,S., Leblanc,G. and Labrie,F. TITLE Characterization of human 3 beta-hydroxysteroid dehydrogenase/delta 5-delta 4-isomerase gene and its expression in mammalian cells JOURNAL J. Biol. Chem. 267 (5), 3551 (1992) MEDLINE 92147720 COMMENT Draft entry and computer-readable sequence for [J. Biol. Chem. (1990) In press] kindly submitted by F.Labrie, 27-AUG-1990. The two accession numbers, M38178 and M38179, are rodent sequences which are published and discussed in 'J. Biol. Chem.' Vol. 266, pp. 583-593, 1991. FEATURES Location/Qualifiers source 1..9404 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leucocyte" /tissue_type="blood" /map="1p13.1" exon 1387..1439 /gene="HSD3B" /note="G00-120-056" mRNA join(1387..1439,1569..1798,5682..5846,8014..9231) /gene="HSD3B" /note="G00-120-056" gene join(1387..1439,1569..1798,5682..5846,8014..9231) /gene="HSD3B" exon 1569..1798 /gene="HSD3B" /note="G00-120-056" CDS join(1654..1798,5682..5846,8014..8825) /gene="HSD3B" /codon_start=1 /db_xref="GDB:G00-120-056" /product="3-beta-hydroxysteroid dehydrogenase/delta-5-delta-4-isomerase" /db_xref="PID:g179468" /translation="MTGWSCLVTGAGGFLGQRIIRLLVKEKELKEIRVLDKAFGPELR EEFSKLQNKTKLTVLEGDILDEPFLKRACQDVSVIIHTACIIDVFGVTHRESIMNVNV KGTQLLLEACVQASVPVFIYTSSIEVAGPNSYKEIIQNGHEEEPLENTWPAPYPHSKK LAEKAVLAANGWNLKNGGTLYTCALRPMYIYGEGSRFLSASINEALNNNGILSSVGKF STVNPVYVGNVAWAHILALRALQDPKKAPSIRGQFYYISDDTPHQSYDNLNYTLSKEF GLRLDSRWSFPLSLMYWIGFLLEIVSFLLRPIYTYRPPFNRHIVTLSNSVFTFSYKKA QRDLAYKPLYSWEEAKQKTVEWVGSLVDRHKENLKSKTQ" exon 5682..5846 /gene="HSD3B" /note="G00-120-056" exon 8014..9231 /gene="HSD3B" /note="G00-120-056" BASE COUNT 2604 a 2198 c 1938 g 2664 t ORIGIN 1 tctgtgaact taaactagac caggagaaat ctgcaagtgc aatttgaatt gtgcccttgt 61 tcagtaaatc catctggagg gaaatccatg taagcccatt atctcctctc tcagactatc 121 atttgaaatg ttgctgctgt cttacaaaat ttagaaattt gcaatggtga cagctccaat 181 cacttagttt tatgtgaggt gactttcttt cctacccaat atatctgaaa cctcaatggt 241 catttgcccc tggagagccc tgggttccta gatgaacaac atcattgaag gaaagctatc 301 tggtgataac cacttaatga tatggcctgt atcacaagag tggggatttg tggttgtatg 361 ccccaatgac tcttatcata ggtgttcgaa cttatccata tttttggctg caaaacgatt 421 ctcagcaaat tcttgaactg gacatactgt gaatgcttct gaagcctatg ccccagaact 481 gcccaaatat caccgccata agcttcccaa gacactttac tcaaaaaaat tgaaatataa 541 tttacatgca ttaaaatgcc caggtcttca gtgttcagtt cagtggaact tgacagttat 601 atatgcccat gtaagtagca ccccaaaaaa gatatagacc atttccgtca ctccagaaag 661 ttgttttctg cctgttccct gttaattccc tgaccttccc ccaatctacc tcataggcaa 721 ctacttcctg atttttattc ccacagttct cattttattt ctatatttct ctaatttttt 781 cttctaagat ctctgtggtt ctggatatat atttaaatat aaaattattt tgaaatgaag 841 ttttgtgcag ggagtacaga aaaattgacc gttgattgtc tctgttgttt atttttctat 901 ttcatgattt tcattatttt tattattggc ttacatttat taattttgtt tttttttttc 961 actcgttttt cacatttttt tttttttttt ttgagatgga gtcttgctct gtcacccagg 1021 ctgaatggca ggggcaccat ctcagctcac tgcaacctct gctcccgggt tcaagcaatt 1081 ctcccacctc agcctccgaa gtagctggga ttacaggtgc ccgccactgc accgggctaa 1141 tttttgtatt ttcagtagag gtggggtttc atcatgttga ccagctggta tcaaactcct 1201 gactttgtga tctgcacgcc tcagcctccc aaagtgctgg gattacaggc gtgagccact 1261 gcacccagcc ccgtttttca ggatcccata ggaggagaga gcaatgagta catggccaga 1321 gatcaaagtg ataagggttg ggccagaagc cacagtgcat aaagcttcag acttgccaca 1381 ggaaatgagg tgagaagtac gtccactctt ctgtccagct tttaacaatc taactaatgg 1441 ttagagattt ttcattttct ttcagctact cctgcagtgg tggggacaca gaatgtttgc 1501 aaaaaaaatg gggtggagga aaatgaggca tctgtgtgag tatataacca tttgacatct 1561 ctttttagcc ctctccaggg tcaccctaga atcagatctg ctccccagca tcttctgttt 1621 cctggtgagt gattcctgct actttggatg gccatgacgg gctggagctg ccttgtgaca 1681 ggagcaggag ggtttctggg acagaggatc atccgcctct tggtgaagga gaaggagctg 1741 aaggagatca gggtcttgga caaggccttc ggaccagaat tgagagagga attttctagt 1801 aagtaaactt gggtcatggg tgtgtggttc catcttaaac actgcatgta tgtgggggga 1861 gatggacctt gtctagcaag ttattgaaat ttgtagccaa atctaagcca atctcacatc 1921 caaagtcatc aagaaataaa tattaaatag tataaaatgg catagtatga aagatactgg 1981 gtgggatttc cagagactag attctggccc tgacccagaa cttgagaagc agccacctca 2041 gcctccaggc ctcttttctt ctcatctaga aaatcccaca tcagttcttt gtcttgtctg 2101 caactctttt atgttctgaa gcttttgtct tggcaattgc tgtacaacat tcataaagga 2161 cactattacc ctggagacct caccaatggg tcattgtctc tctagaacgt gccaggctgt 2221 gttttgtttt gcagcttcct tagtatcctt tgtgagggca aaataaaagg aagttaaggt 2281 aaagatgagt gtataagtgg gtgtgtgtga cagagagaga ggaaaacaat aagggggaga 2341 gagagtcggg gagagagagt cggggagaga gactcagaga gagagacaat gagaaagaca 2401 gagaaataat gaaaaatata tagtgagaga gagagcatca gtgagagaat gtgtgtgcac 2461 agtgcacagc acagagcaga gacagagtaa gaggcagtat aaggccagac atgcctcatt 2521 tagattttgc atatatggct tttttttaaa aaaaaaaaaa caaaacattt acctctgttg 2581 ctcatcatca aaaagaaatt gactgcagaa ttttctgtgg caactccttg tacaaatggc 2641 cccaagaggc aggtagaaaa ggcatcacat ccaacaggac aatgaacgca tcaaggtccg 2701 gaatcacaaa gcaacatgct aaggccatct gattgatgcg tagcagagct cagaccagag 2761 ctttatccat aggctatttc acctggcccc agagctgatg cttggcaaag tggatcttca 2821 gtctctccct ttgctccttg gtgtgcccca aaacaattcc ctgtcctcct ccccaaccca 2881 ggggatttgg gggactgtgt gtctacatgt gggctctggc tgcccagagg tcctgccctg 2941 cccacagaga atcccccact tgacagcact cagggaccac acccaacact ctacttttgc 3001 tcttctctga cctcttgaaa tgttcgattt cttccaccct ctgttttaat gttgtctttg 3061 ataaacaaat atgaaatgct tataatcctt gcctgtgtct cagcaatatt tgcaatttaa 3121 catctgaagc cagaaaaaag caatgctcag cctagaggta ttgttcccct atggtaaaaa 3181 tgcagatcat ataatagaca ggcagggatc tgtatgaaga taaaaacaga gaaagcattc 3241 agtgttcttt ttgtgccagg cacttggcca aaggtttgcc aactcatttg ctcctcataa 3301 caaccctatg agatcggtaa tattatcctc cttttttaaa gatgaggaga tagaaacaga 3361 gaggttagtt aactttttga aggtcccaca gttagcatat ggcaaagcta gatttgaacc 3421 cagacagact agtgcagcat ctgtgctctt aagcagtagt ttgtcttaca gctcctttag 3481 gccttcctct agcccaactt acaccataaa agcaactcca tttagtctac ataaaacact 3541 tgccaagtct acagtgcctt ggagaagaag ggagccatga cttcatcaac aaagtacctt 3601 ttcctagtcc ctctagatcc ctgggaactc actggggaag agtccttttc tctcagcttt 3661 ggacttcttc cactttgaca ctcctttgag ccagtctgtc tgaagtggca gcaaagaaga 3721 agcagaggaa ggcagcagga agggcagagg tggaactaga accagagaac atcccactct 3781 caggctgtta cccacatcac caccagcagc ctcttccctg actcccccac tccaggacct 3841 tcttggagtt gcctgacaac gggaagaggt tgggagtaga caaaacatgg cagaggcatc 3901 aggagagcaa gtggtcctgc ttgtacccca ggaaggaatt tctccagctt cgaatttttc 3961 actctgaatt ttgctctcac tggtctttgg tctttgttgc caatttcagt taaactggct 4021 tattctttgg aaaagaatga tattaaacaa cagaaaagtt gatattaaac agccagtcct 4081 gcacctggtg tacagggtac aagcttggag gcagggtggg cgtgtttaag ccttcactag 4141 aaacgtgacc ttagatctgt cccttaacct ctctgagact cagtttcctc actgataaag 4201 tgggactcag aatacctacc tcatgagata gctgtaaaaa gaaataaata tcctgtaagt 4261 tgcatacaga tgataaagca caaacaagtg tagagtattt atccaagctc catgtagcat 4321 ggtgttcatt ttgtgggcta attccaggtg acaaccttac tggcagcgtc acttcctgtg 4381 gatttcagag gaacctgaag acattgatgt tgtaaccact gcctccagct agacagctgc 4441 gatgttagtt cattggtggt taactcattt tcagcattgc tacctgcttt tgatgatatc 4501 tgataatagt tttccaggga aattcatcat tgaagcccac ccacaattcc tgagtaatga 4561 ctccactctc ctcccagttg cccaagcagg aaactgtagg ggtgatttta tcttctcctt 4621 ttcctatcag ctcagatcca ataggtacca ggaccttcaa ggtaaggtct gaaattcaaa 4681 cctggtatat ctctcccctc ctctgcttcc actgctggac cctgcttcca gtcctcatcg 4741 aaactttggc acagacattc tgattaatca cttgatcatc agtctttctc aaatccccta 4801 taccagtcca ttttatatcc tcaagccaag attaacttcc tacaccacaa ccctgctttt 4861 gttccttttt ttgttgttgc ttgtgtggtt ttcttttttt tttttttttt tttttttttt 4921 tttttttttt tttgagacaa ggtggtcttg ctctaccacc caggctagag tgcagaggtg 4981 tgatcttggc tcactgccag gaaccagaga ccccagcctc catttatcct cccacctcag 5041 tctgcagagt agctgggact gcaggcaggc acatgccacc atgcccagct atttcatgta 5101 ttttaataga aatgggattt ccccatgttg tccaagctgg tctcaacctc ctgcgctcca 5161 gtgatccatc catcacagcc tcccaaagtg ctggaataac aggcatgagc cacggggctt 5221 ggcctttttg tcattttatt ccaaatataa ataaggggtt ttctttaaca tgatgctgga 5281 gtcccaaaat tatgattcta tacatgattt ctcattactt acagaattaa cttcaaactt 5341 ttccctcata ctctctacaa tcacactact ctccttagaa aatttgatgg gcttcccatg 5401 actctagcaa tgcttatatt ccaggcatac ctgacagctc actgctcaga ttacttttct 5461 aaaatgagat gggagagaaa cagatgtttg ctctttccaa gaaattgata cacatgctga 5521 tttctgtgcc ttttttactt gttctttccg tagaatgtac accctccact ctaacaccct 5581 actctaacca tccttgaaca cctatgtaac atcaccttta tcagaaaact tctcagccag 5641 atacagaaat cattccaatg acctgacctg tgttcacaca gaactccaga acaagaccaa 5701 gctgacagtg ctggaaggag acattctgga tgagccattc ctgaagagag cctgccagga 5761 cgtctcggtc atcatccaca ccgcctgtat cattgatgtc ttcggtgtca ctcacagaga 5821 gtctatcatg aatgtcaatg tgaaaggtat ggtaggctgg ggaggagatg cagcaaggtg 5881 gggaattaag gatcacaaag aagggccagg aagggaagag aagtccctcc actgaacacc 5941 tgctgtgctc tgggccaagt gcctttgctg atcactacga ataggagagt tcaagactgc 6001 taactttagt tttttagatg ataaaactag gactgagaga gggcaagtaa cttgtccaag 6061 gtcccccagg taagtaagca ggtaggagag ttagacttta aactcatccc tgtgtgactc 6121 caaaggctct ttctactgtg actccaaaag ctctttctac tgtggttcca atcaaaagtc 6181 aactaatttc tgacttcaga ctcttgatac ccagacaccc cttgcctccc aggccagaag 6241 cagaccttgc aggtgccact gtagtcacca ttttgaacct tgtgtgtagg ctgatgaaaa 6301 cattcagagc cctcctgccc acctcaaaca aaaagtcctc tcgagagaac tagcaaagct 6361 ggttcacaag gtctgtcagg acagaattat ccagcacatg ccttcccaca atattttttt 6421 aacagtgaga tttttccagt atcaagaata cagtctcttc agctcaccac agggcctact 6481 attccagagc cgttttcagc caggtgccca aagtgcaccc tcatttaatc ttcatccctt 6541 aaatgcttgg cgtttccttt caaagtatgg ctttccaact accttaaatt cagacacatt 6601 tccttaagga atcttctcag aaatctcacc ctctgatctc cagagtccag tctcttcagc 6661 accatagcac cccaaacccc ggcctggccc tccgcattta gtctactctg actctttttc 6721 caaagcaacc aattgctttt gagtctccag caccataaaa gaaaaaaaaa aacaacttcc 6781 agtttctact tacccatact tctaagtacc cctgctgtgc aaggagggtg ctaagagtaa 6841 ccagttttaa ttactacttc aacagctccc aaaaggaggg gatatgaaat gacaggggaa 6901 gtgttagagg catctgtttg ctaaaaccat tataactccc aagcttagaa aatacttcac 6961 aatgttggtc atttccccct tgaaggtaaa ctaactccaa attttctctg ccacaagatt 7021 aatttttgaa cacatgaggc tgtctttcca gaaactcaaa ttgcacagag acattaaaag 7081 ggctaagaat acaaaaattt gtaaataaaa atgatccttt ccctaggaaa ttccttcctt 7141 cttcaaggga aggcactatg ggtacctcca tgagctttgc ttctacacag ctcaggggag 7201 gctgcaaggt cctcccactg caggtgttac cagcagagga cacacttctc tccccagccc 7261 tccaccaggc tccacaggaa atgccagggc agggtttaaa agaaggttta tcacctccac 7321 tttacataac caggtaaaca gagtcacagc cagaattgga agcagctttc ccagggaatg 7381 cacaatcagg aagagtgtgg gtttccagca tctccccaca acccactgct tgctcagtga 7441 ctccaggatg gtccttgtga caaagtttta ttagagccca gcaaaatgag aaaagtacca 7501 acaatttcat atatattgaa gagtttatat tgcctatata caatggtgta tgtgtgtgtg 7561 tgatgtgaat tcttcaactg cccattttgc ctaaggactg gaactgtcca tattcacaga 7621 gtatagcctc ctgatttttt agattttaaa tgatctgact ttaatggtac tgattacaaa 7681 tcataatttg tcaataccct tacttggcaa aataaaaagt gataacccta ggtccctttc 7741 caaaattaaa gtacacacac acacacatcc atgaaaccca aaagtctaga aattcttact 7801 ctccagaacc cacatgtcag tttctcttca ttttggtatt tttcttatgg ctgtagtacg 7861 accaaatctc agacagaacc acagaagaat gtaccctgag tctgttacaa ccaccatatt 7921 tgggagtggg gggtggggca catagatctg tgttcgtggt tggcacctct tagggatata 7981 tcctgacagt gacaatatgc tcttcatgga caggtaccca gctcctgtta gaggcctgtg 8041 tccaagctag tgtgccagtc ttcatctaca ccagtagcat agaggtagcc gggcccaact 8101 cctacaagga aatcatccag aatggccatg aagaagagcc tctggaaaac acatggcccg 8161 ctccataccc acacagcaaa aagcttgctg agaaggctgt actggcggct aacgggtgga 8221 atctgaaaaa cggcggcacc ctgtacactt gtgccttacg acccatgtat atctatgggg 8281 aaggaagccg attcctttct gctagtataa acgaggccct gaacaacaat gggatcctgt 8341 caagtgttgg aaagttctcc actgttaacc cagtctatgt tggcaatgtg gcctgggccc 8401 acattctggc cttgagggcc ctgcaggacc ccaagaaggc cccaagcatc cgaggacagt 8461 tctactatat ctcagatgac acgcctcacc aaagctatga taaccttaat tacaccctga 8521 gcaaagagtt cggcctccgc cttgattcca gatggagctt tcctttatcc ctgatgtatt 8581 ggattggctt cctgctggaa atagtgagct tcctactcag gccaatttac acctatcgac 8641 cgcccttcaa ccgccacata gtcacattgt caaatagcgt attcaccttc tcttataaga 8701 aggctcagcg agatttggcg tataagccac tctacagctg ggaggaagcc aagcagaaaa 8761 cggtggagtg ggttggttcc cttgtggacc ggcacaagga gaacctgaag tccaagactc 8821 agtgatttaa ggatgacaga gatgtgcatg tgggtattgt taggagatgt catcaagctc 8881 caccctcctg gcctcataca gaaagtgaca agggcacaag ctcaggtcct gctgcctccc 8941 tttcatacaa tggccaactt atgttattcc tcatgtcatc aaaacctgcg cagtcattgg 9001 cccaacaaga aggtttctgt cctaatcata taccagagga aagaccatgt ggtttgctgt 9061 taccaaatct cagtagctga ttctgaacaa tttagggact cttttaactt gagggtcgtt 9121 ttgactacta gagctccatt tctactctta aatgagaaag gatttccttt ctttttaatc 9181 ttccattcct tcacatagtt tgataaaaag atcaataaat gtttgaatgt ttaatgtgga 9241 gggaggtgtg agttatgtta atacatactt tcctcctttc tccttctttt ccaggtattg 9301 ccatctccaa tttagaagag tagcataaaa cctgggttgg ggagagggca gagtagtggg 9361 agggccagga agggcgatga agactgaata agctctacta cctt // LOCUS HUMBLYM1 1004 bp DNA PRI 31-OCT-1994 DEFINITION Human Blym-1 transforming gene, complete coding region. ACCESSION K01884 NID g179497 KEYWORDS Alu repeat; Alu-like repeat; Blym-1 oncogene; c-myc proto-oncogene; repeat region; transforming gene. SOURCE Human (Burkitt's lymphoma) DNA, clone pHuBlym-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1004) AUTHORS Diamond,A., Devine,J.M. and Cooper,G.M. TITLE Nucleotide sequence of a human Blym transforming gene activated in a Burkitt's lymphoma JOURNAL Science 225 (4661), 516-519 (1984) MEDLINE 84250206 COMMENT The human Blym-1 gene activated in Burkitt's lymphoma cells, was cloned into plasmid pBR322 to create pHuBlym-1. The pHuBlym-1 plasmid induced transformation of NIH 3T3 cells with an efficiency of approximately 4500 foci per microgram of DNA. The cloned chicken Blym-1 gene hybridizes to a small family of sequences in DNAs of both chicken and human cells, suggesting that it is a member of a small gene family conserved throughout vertebrate evolution. FEATURES Location/Qualifiers source 1..1004 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p32" gene 107..156 /gene="BLYM" exon <107..156 /gene="BLYM" /note="Blym-1 protein; G00-119-038" /number=1 CDS join(107..156,535..661) /note="Blym-1 protein" /codon_start=1 /db_xref="PID:g179498" /translation="MTLRGLRLQWRKQLKMRARPCLLEKRKKIVSYISFLLSDLKGTL AIDSLYSLQFAGGN" intron 157..534 /note="Blym-1 cds intron a" repeat_region 266..313 /note="Alu-like repeat" exon 535..>661 /note="Blym-1 protein" /number=2 BASE COUNT 314 a 178 c 209 g 303 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcacca ttccagatac cattaagaac attcatgatt tatggaagga ggtcaaaaca 61 tgaatattta taggagtttt gaaaaagttg attacaaccc tcatagatga ctttgagggg 121 tttaagactt cagtggagga aacaactgaa gatgaggtgg aaataacaga tcatagaatg 181 agaagccctc ctagggataa ttattattca catgattgct tttcagataa accaacacta 241 atgtttttta aaagtgagtt aggttgctgg gtggtggctc acgctgtaat cctagcactg 301 tgggaggctg aggcaggagg attgcttgaa ccaaggtatt caagaagagc atgggcaaca 361 tgatgagacc ctcgtctcta ccaaaaattt aaaaattagc tgggcagggc atgatgatgc 421 acatccctaa tcccagctac ttgagggctt gaggtggcag gatcacttga acctaggaac 481 attgaggctg cagtgagcta tgatcttgcc actgcactcc atactgcatg acagagcaag 541 accctgtctc ttagaaaaaa gaaaaaaaat agtgagctac atttctttcc ttctcagtga 601 cctgaagggt actctagcta ttgacagtct ttactctctt cagtttgctg gaggtaactg 661 atttcattca gttcctaata ccatataaat tatagtttca ataaaaatta attcccatcc 721 ctgaggagtg cttacctact cgggtactat tttttttttt atacaagtgt gtgagttctt 781 cccctttaag atttttcata ggggcagcat tccctaaaag taaggaattt tcaggcaagt 841 gggtatgagt tgaactctca ttttactatt tcaaactgaa tgagagtaag ccagcaattc 901 aggttttttt catcaagaca atagaaagaa ttttaccttt cttacagaaa ctggttatgt 961 ttaattacat atcatgtgta aagaaattgt caagtagcga attc // LOCUS HUMBNPA 1922 bp DNA PRI 15-JUN-1990 DEFINITION Human brain natriuretic protein (BNP) gene, complete cds. ACCESSION M31776 NID g179514 KEYWORDS brain natriuretic protein; protein hormone. SOURCE Human DNA, clone H1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1922) AUTHORS Seilhamer,J.J., Arfsten,A.E., Miller,J.A., Lundquist,P., Scarborough,R.M., Lewicki,J.A. and Porter,J.G. TITLE Human and canine gene homologs of porcine brain natriuretic peptide JOURNAL Biochem. Biophys. Res. Commun. 165, 650-658 (1989) MEDLINE 90088474 FEATURES Location/Qualifiers source 1..1922 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 400..1864 /note="brain natriuretic protein mRNA and introns" CDS join(498..629,861..1116,1659..1675) /note="brain natriuretic protein" /codon_start=1 /db_xref="PID:g179515" /translation="MDPQTAPSRALLLLLFLHLAFLGGRSHPLGSPGSASDLETSGLQ EQRNHLQGKLSELQVEQTSLEPLQESPRPTGVWKSREVATEGIRGHRKMVLYTLRAPR SPKMVQGSGCFGRKMDRISSSSGLGCKVLRRH" exon <498..629 /note="brain natriuretic protein" /number=1 intron 630..860 /note="brain natriuretic protein, intron A" exon 861..1116 /number=2 intron 1117..1658 /note="brain natriuretic protein, intron B" exon 1659..>1675 /note="brain natriuretic protein" /number=3 BASE COUNT 430 a 552 c 519 g 421 t ORIGIN 1 ctgtgagatc accccgtgct cccagcgctc acgtcggtcc tcggaaagcc ggggtcctcc 61 ctgccttttc cagcaacggt ggggtgggga ggcaggaaga aagcgccaac ctaggacccc 121 ggagatttgc agcaaaggaa gaagcgggag acgggcactt gtctgtgtct ccagcgcgtt 181 cctgcccccc gccgacccgg cccatttcta tacaaggtcg ctctgcccgg tctccacctc 241 ccacgtgcag gccgcggagg ggctcattcc cgggccctga tctcagaggc ccggaatgtg 301 gctgataaat cagagactag acctgcatgg caggcaggcc cgacactcag ctccaggata 361 aaaggccacg gtgtcccgag gagccaggag gagcaccccg caggctgagg gcaggtggga 421 agcaaacccg gacgcatcgc agcagcagca gcagcagcag aagcagcagc agcagcctcc 481 gcagtccctc cagagacatg gatccccaga cagcaccttc ccgggcgctc ctgctcctgc 541 tcttcttgca tctggctttc ctgggaggtc gttcccaccc gctgggcagc cccggttcag 601 cctcggactt ggaaacgtcc gggttacagg tgagagcgga gggcagctca gggggattgg 661 acagcagcaa tgaaagggtc ctcacctgct gtcccaagag gccctcatct ttcctttgga 721 attagtgata aaggaatcag aaaatggaga gactgggtgc cctgaccctg tacccaaggc 781 agtcggttca cttgggtgcc atgaagggct ggtgagccag gggtgggtcc ctgaggcttg 841 gacgccccca ttcattgcag gagcagcgca accatttgca gggcaaactg tcggagctgc 901 aggtggagca gacatccctg gagcccctcc aggagagccc ccgtcccaca ggtgtctgga 961 agtcccggga ggtagccacc gagggcatcc gtgggcaccg caaaatggtc ctctacaccc 1021 tgcgggcacc acgaagcccc aagatggtgc aagggtctgg ctgctttggg aggaagatgg 1081 accggatcag ctcctccagt ggcctgggct gcaaaggtaa gcaccccctg ccaccccggc 1141 cgccttcccc cattccagtg tgtgacactg ttagagtcac tttggggttt gttgtctctg 1201 ggaaccacac tctttgagaa aaggtcacct ggacatcgct tcctcttgtt aacagccttc 1261 agggccaagg ggtgcctttg tggaattagt aaatgtgggc ttatttcatt accatgccca 1321 caataccttc tccccacctc ctacttctta tcaaaggggc agaatctcct ttgggggtct 1381 gtttatcatt tggcagcccc ccagtggtgc agaaagagaa ccaaacattt cctcctggtt 1441 tcctctaaac tgtctatagt ctcaaaggca gagagcagga tcaccagagc aatgataatc 1501 cccaatttac agatgaggaa actgaggctc agagagttgc attaagcctc aaacgtctga 1561 tgactaacag ggtggtgggt ggcacacgat gaggtaagct cagcccctgc ctccatctcc 1621 caccctaacc atcatcaccc tctctctttc cctgacagtg ctgaggcggc attaagagga 1681 agtcctggct gcagacacct gcttctgatt ccacaagggg ctttttcctc aaccctgtgg 1741 ccgcctttga agtgactcat tttttttaat gtatttatgt atttatttga ttgttttata 1801 taagatggtt tcttaccttt gagcacaaaa tttccacggt gaaataaagt caacattata 1861 agctttatct tttgaaactg atttgtcttg gcgcattaaa aataatccct catttcaaag 1921 aa // LOCUS HUMCACY 3671 bp DNA PRI 31-OCT-1994 DEFINITION Human calcyclin gene, complete cds. ACCESSION J02763 NID g179765 KEYWORDS calcyclin. SOURCE Human placenta DNA, (library of P.Leder), clones pG2A9B[1.7,3.0,6.0]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3671) AUTHORS Ferrari,S., Calabretta,B., deRiel,J.K., Battini,R., Ghezzo,F., Lauret,E., Griffin,C., Emanuel,B.S., Gurrieri,F. and Baserga,R. TITLE Structural and functional analysis of a growth-regulated gene, the human calcyclin JOURNAL J. Biol. Chem. 262 (17), 8325-8332 (1987) MEDLINE 87250432 REFERENCE 2 (bases 1321 to 1380) AUTHORS Ghezzo,F., Valpreda,S., De Riel,J.K. and Baserga,R. TITLE Identification of serum-responsive elements in the promoter of human calcyclin, a growth-regulated gene JOURNAL DNA 8 (3), 171-177 (1989) MEDLINE 89251078 FEATURES Location/Qualifiers source 1..3671 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21-q25" prim_transcript 1372..>2746 /note="CCY mRNA and introns" intron 1416..2002 /note="CCY intron A" exon <2024..2161 /gene="CACY" /note="calcyclin, (first expressed exon); G00-119-048" /number=2 gene 2024..2161 /gene="CACY" CDS join(2024..2161,2534..2668) /note="calcyclin" /codon_start=1 /db_xref="PID:g179766" /translation="MACPLDQAIGLLVAIFHKYSGREGDKHTLSKKELKELIQKELTI GSKLQDAEIARLMEDLDRNKDQEVNFQEYVTFLGALALIYNEALKG" intron 2162..2533 /note="CCY intron B" exon 2534..>2668 /note="calcyclin" /number=3 BASE COUNT 732 a 987 c 1110 g 842 t ORIGIN 215 bp upstream of HinfI site; chromosome 1q21-q25. 1 agtactcggt gttcctgagg atgctgtgca tggcctacaa cgacttcttt ctagaggaca 61 acaagtgacc agggctgccc tccaccctca ccctccaccc tttgctgctg acctcggctg 121 ctcctctcac agaccctctt tggcccctgc cctcctctcc ctcccagatg gacccttcca 181 tgggaggaaa taaagtttcc atcgcaggtg ctgggagtct ggttttgaag ctgtcttgtc 241 taccttggcc tggggagagg ggagcacagg aagggtctct ccttgagtgg gttgagacag 301 cttctgcctc tgggggttag ggtcctgggc tcccactgca ttcctctcct tctttggtgt 361 ggacgtcatt ggttttgtca tggcttagtt ttgcctgcct ggaaaatggg gaagttaggc 421 caggcgggaa ctctgcaagg atgcagagga agttaagagg gaaagttgct ttgagaggag 481 gacactggga ggggttggga gtggctcctg agggcggtga taggcaggca ggcctgactt 541 gtccacagct cacccggagg ccaccttggc agcacctgta ggaagggcat gtctggcctc 601 cacaccagcc ccctccctct tcaccatttc cccttcaata gcaccactct catcatctat 661 gggggacagt gctttcttct ctccctgcct cctccatcaa aatcttttct caggggaggg 721 tctgaaaagg ccttcactcc cccgtaaata acgaatggtg cttacagggc tgggctccca 781 cgtgcatgca cattaacacc aaaggtgctg tagtgaatgg aatttggggc actgagggga 841 aggcgtggag gtgttggtag gaacttgttg ctggtggggg atgggcgccg tagatatcct 901 ttacaccact ggctactccc cctatctcct ctggggtgac cctgagtatc ctctgtggga 961 caccggcatc ctgtgaggcg ccctccttgc ccacattgac gctgcgctgg ctcgagggtc 1021 acattcacgg tctggcagag gaagcagggg tgaccgccgc agtcctcctc ctgctcccct 1081 tgccgagtca cgtgtcacga agagcaaact gagcaaactg agctgcgcag atgaggggag 1141 actcgtcacc aggcgtgcag tgggcactgc tgggctcccc catcccgtcc taacccggaa 1201 cagccccggg caggaggcgt ggaaagtcga gggggtaaac cgcgaatgtg cgttgtgtaa 1261 gccacggcgc agggtggggc gcgggcggga cttgggcggg cggggtgggc ttggccgagc 1321 tggcctccgg ggcaccgacc gctataaggc cagtcggact gcgacacagc ccatcccctc 1381 gaccgctcgc gtcgcatttg gccgcctccc taccggtgag ttctctccag gagccctggg 1441 tactttccag ggccagctgc cctcacgctg ggggtccagc catcccctgc ccagttcagc 1501 cgctggatcc agactggggc catctgtggc gctcccccgc tggagggata gtcaggagca 1561 gcagtgctgt gccaggcagg ccttgggcta agggatcgca atggggtgtg ctcttttggg 1621 gtgcggaagg gagtgccctg ggtgtgtcat tgccaccatg tgtggccctg tgaagctgtg 1681 tttaagctgc ctttgcagcc tccattcccc tcccctgccc agccatactc ctcaacttct 1741 ggatcccctg aaggacagtt ctcagctgtg cccaaagcta ctgttcctat atgcttctta 1801 gaatccttaa gccacctctc ttgccttggc cctagtgtgc tctctccttc cccttcagcc 1861 ctgggctgtc tcctgatgcc attgtgtgtg gcctgagact gggtggttcc aaaggaggcg 1921 gggctagtgc aggcagcatt attggggtgt gtgggtgaga agtccttgct cccatggcac 1981 tgactaggcc ctctgctgcc agctccaagc ccagccctca gccatggcat gccccctgga 2041 tcaggccatt ggcctcctcg tggccatctt ccacaagtac tccggcaggg agggtgacaa 2101 gcacaccctg agcaagaagg agctgaagga gctgatccag aaggagctca ccattggctc 2161 ggtgagtggc ctcctcccca ggaccccttt tcccaccctt gtcctttgga agcaaggatt 2221 aggggagaga gaggtgccag gtgcatctga ctcacattta cccacattct gaggccctgg 2281 tccacatgta gaccctgagc tgtagaccca ctctcccagc gggtagggga tgcttccagc 2341 cggatatcca tctctccaaa tgaggaccag taactgagaa gtatctgagg agaagcaatg 2401 ccaaagtgac atgggtcctt ggtgatgagg gagcacagag ccacttgcag agaggattgc 2461 ctaggagggg gaaggggaag aatccagggt tgtcatcacc actgagtatg gatttcacat 2521 tctaacacat tagaagctgc aggatgctga aattgcaagg ctgatggaag acttggaccg 2581 gaacaaggac caggaggtga acttccagga gtatgtcacc ttcctggggg ccttggcttt 2641 gatctacaat gaagccctca agggctgaaa ataaataggg aagatggaga caccctctgg 2701 gggtcctctc tgagtcaaat ccagtggtgg gtaattgtac aataaatttt ttttggtcaa 2761 atttaccctt gcgtcttggc ttccgaatga tttctgttcc tccttggctt agtgggacac 2821 cagccattgg aagatttgct cacggtcaac ctctgaaaat gactcattga ctcgccaggc 2881 cagaggaccc accctgacaa ggctgcctct agcgcgtaag gtgcctttat gtgaatgagg 2941 agagatgccc ctcttggcaa cgccatccta aggaaaggct caagtggttt ccagtagaga 3001 gagtcctggg atgagcttgg agatggaaat ggtcctttgg gccgggatgt gatggggttt 3061 gggggcctgg aagtgaggca gagatagttc cagaggctcc cagatgtgtt ttgctctggg 3121 tgtggcaaga ggggccttgg ggtggggcaa gtccctttct catcacagcg caggggttag 3181 atagggcaca tctgagatgc ctgaggcttg gctcagggag tttcctacac cagtgaggac 3241 gctgtgtgac tgagtctact gcggctgccc aggtcccagg tggagtgggg gaggcacact 3301 cttggagtgt gtcccgtcat tcagggtgag ggctttttgt tggaacggtg gtctgaggag 3361 ctggcagctg caccaacacg tgaaccacgg ggtgttcagt aatggggcgg ggtatccctg 3421 cagcctcagc gtaatgactc acccggcact tccacgggat ccagcctgga tctcagcccc 3481 catcagagaa gatgactaat tgaatcattg tccatcatct ggattagtgt tttaaggcag 3541 aagggaagag gataaggagg gtaaacgctg tttccgggtg atgccacatc attaagcctc 3601 tctaggccta gtccgagctg ggcaagttta cctctagctt ctggggaaga gatcttgact 3661 ttagatggag a // LOCUS HUMCAPG 3734 bp DNA PRI 31-OCT-1994 DEFINITION Human cathepsin G gene, complete cds. ACCESSION J04990 NID g179914 KEYWORDS cathepsin G; serine protease. SOURCE Human lung fibroblast DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3734) AUTHORS Hohn,P.A., Popescu,N.C., Hanson,R.D., Salvesen,G. and Ley,T.J. TITLE Genomic organization and chromosomal localization of the human cathepsin G gene JOURNAL J. Biol. Chem. 264 (23), 13412-13419 (1989) MEDLINE 89340411 COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by T.J.Ley, 13-JUN-1989. FEATURES Location/Qualifiers source 1..3734 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q11.2" CAAT_signal 253..256 TATA_signal 293..297 prim_transcript 322..3020 /note="cathepsin G mRNA and introns" gene join(350..404,1161..1308,1763..1898,2074..2328,2763..2936) /gene="CTSG" CDS join(350..404,1161..1308,1763..1898,2074..2328,2763..2936) /gene="CTSG" /note="cathepsin G" /codon_start=1 /db_xref="GDB:G00-119-822" /db_xref="PID:g179915" /translation="MQPLLLLLAFLLPTGAEAGEIIGGRESRPHSRPYMAYLQIQSPA GQSRCGGFLVREDFVLTAAHCWGSNINVTLGAHNIQRRENTQQHITARRAIRHPQYNQ RTIQNDIMLLQLSRRVRRNRNVNPVALPRAQEGLRPGTLCTVAGWGRVSMRRGTDTLR EVQLRVQRDRQCLRIFGSYDPRRQICVGDRRERKAAFKGDSGGPLLCNNVAHGIVSYG KSSGVPPEVFTRVSSFLPWIRTTMRSFKLLDQMETPL" exon <350..404 /gene="CTSG" /note="cathepsin G" /number=1 intron 405..1160 /note="cathepsin G Intron A" exon 1161..1308 /gene="CTSG" /number=2 intron 1309..1762 /note="cathepsin G Intron B" exon 1763..1898 /gene="CTSG" /number=3 intron 1899..2073 /note="cathepsin G Intron C" exon 2074..2328 /gene="CTSG" /number=4 intron 2329..2762 /note="cathepsin G Intron D" exon 2763..>2936 /gene="CTSG" /note="cathepsin G" /number=5 polyA_signal 2990..2995 BASE COUNT 990 a 960 c 959 g 825 t ORIGIN Chromosome 14q11.2. 1 cttgctttgc tggagtattc tggtaatttg atgggttgag ggttctggac acaatgcccc 61 aagccccttc cttgttgtgc tgggttccta tttctgctct cggcactgac ttagcagctg 121 ctcaagagct cactatgttg gcttggatta cacggtctca cccacatctc cggcagtttg 181 tgggcaaacc tcctgagcag ccttgggtga tgaaaccttt catggtagca ggagaatggg 241 actgtgaatt ctcaatcccc tgtccccacc ccttccttcc tctctcaggg ccttaaagtc 301 taggaggagg aagcacagca gcaactgact gggcagcctt tcaggaaaga tgcagccact 361 cctgcttctg ctggcctttc tcctacccac tggggctgag gcaggtgagt gaccatcccc 421 accctcagag gcctgacctc atcccataga ttcttgagcc aaattgcctt ggtatatcct 481 aattctgtac tgttgagcaa gttatttgaa tttgtgtttc ctcatctata aaatgagaat 541 aatattaata ccgatcttgc agagttgcca tgagagttaa ataagttaga gtatttaaat 601 gtcttggaat tgcccgcaca ctataagtgc tataaaaaca tgctttgtgt aaataatttg 661 gcagcatgtg tcagacccta cctaggaggt aagaatacag caataacagt accatcagct 721 catgtctaga tttttaaaca ccagtcccac gtggtcttga attggactca gagggctctg 781 ggaagctcca tgaggataaa agtataaggg aacttcagga acaatcctgt acttacagca 841 aagcattctc ctcaatacct gaggctgaag ctggccttgc ctggaacaag ggttgttctc 901 cctcttttgg agaggaggag ggaggtgagg cctaggatgg ggaaaagggc tcctttcaag 961 acagcagtgt ttcctgtaga accctggagc cccctcccaa tctgctgccc catagactcc 1021 aagcctcagc accatctcct ccctctcctg caccctctct cctgccgtcc ccatcttcca 1081 gcctttctgg agccaccaat ctggtaccca cattgcaggt tcagcaagca tagagctaag 1141 tgccaaatgc ttccttccag gggagatcat cggaggccgg gagagcaggc cccactcccg 1201 cccctacatg gcgtatcttc agatccagag tccagcaggt cagagcagat gtggagggtt 1261 cctggtgcga gaagactttg tgctgacagc agctcattgc tggggaaggt gaggagctaa 1321 ggaacttcct ggccagccag gaacacagcc ctgcggagct cttcggtgga agagccatct 1381 gaaagaagag ttgtagcaat gaaagggtga aagaaagacc aagtgagtct ttgcgggagg 1441 gaacaggcca gtgtaaatga ggaggaaagg aggataagat caaaaagagc aagaggaaga 1501 gatggaagac acatattggg gctcaaaata taaactcagg ctatttatca acttaatctg 1561 gggaagtaaa cctgaaggca agtaccaccc tgtcatccct agctcagagc tgctgagaaa 1621 gaggatacag ctgagcccca gggccctccc atcccctcga ttctggttag ctgcagtctt 1681 gccctccccg tgctgtctgc ctaccctgca gagctggtgg accatagctc ctgcagccca 1741 gacctacctc ttgcttttgc agcaatataa atgtcaccct gggcgcccac aatatccaga 1801 gacgggaaaa cacccagcaa cacatcactg cgcgcagagc catccgccac cctcaatata 1861 atcagcggac catccagaat gacatcatgt tattgcaggt accacctacc tggccctctg 1921 gctccttcct agtgtgtccg gggacaatgg aggaggaagt gagggcaagg ctccggggtg 1981 gcggggaggg catgggatgt gtactgcacc agcgaccccc gagccttggc tggaggcccc 2041 agctgagcgg gaacgcctac attcttcctc cagctgagca gaagagtcag acggaatcga 2101 aacgtgaacc cagtggctct gcctagagcc caggagggac tgagacccgg gacgctgtgc 2161 actgtggccg gctggggcag ggtcagcatg aggaggggaa cagatacact ccgagaggtg 2221 cagctgagag tgcagaggga taggcagtgc ctccgcatct tcggttccta cgacccccga 2281 aggcagattt gtgtggggga ccggcgggaa cggaaggctg ccttcaaggt aaggcatggg 2341 cattggccaa cacaccccgg gagagagggg cccgtgcaga gccaggcagt gcgaacagat 2401 tccatcccca cagcctcagc ctggcagcca gaccagggtg ggctggggat tgttttcccc 2461 atcaacctgg tctctggggg aataggagga agacccacaa cacatacata ggcaacattc 2521 tcctggagaa gggagaggta ccttgactca gattgggctg gagacagtaa ttaaggcaga 2581 gctgaagtcc agcgaccgaa aagatccaga ggcttggctc ctgtacccca ccgatcttcc 2641 atctcacaca cacccagcaa ttgaaggggc ccacccaccc ctgccttccc tgagagcccg 2701 gagctcaggg aagcaggagc agggaggcct gtctcagtct cccttctcct ctctacctac 2761 agggggattc cggaggcccc ctgctgtgta acaatgtggc ccacggcatc gtctcctatg 2821 gaaagtcgtc aggggttcct ccagaagtct tcaccagggt ctcaagtttc ctgccctgga 2881 taaggacaac aatgagaagc ttcaaactgc tggatcagat ggagaccccc ctgtgactga 2941 ctcttcttct cggggacaca ggccagctcc acagtgttgc cagagcctta ataaacgtcc 3001 acagagtata aataaccaat tcctcatttg ttcattaaac gtcattcagt acttagtttg 3061 tttggattgc tacaacaaaa tagcacaaat tgggtggctt ataaataaca aatttatttc 3121 tcacaggtct agaggctaag aagtctaaga tcaagtcact agcagattca gtgtctaatt 3181 agggcccatt ttctggttca cagacaacca tcctctccct gtgtccacat atggcaaaag 3241 gggcaaggga attctctgat gtctctttta caagggacct agtctcattc aaagagctca 3301 gcttttacga cctaatcaca tcccaaaggc cccacctaat gccatcacga cattggggat 3361 taggtctggg aaacataggg aaagagtgtc tctacacaaa aattttaaaa ttagccaggc 3421 atggtggcat gtgtctatag tcccagctac ttgggaggct aaagtggaag gattagttga 3481 acccacgagg ttgaggcttc agtgaaccat gcactccagc ctgagcgaca gagcaagaca 3541 ccattccaag aaagaaaaaa aaaaagactg gcaggccaaa aagacagaac tgaaattcca 3601 aaaaaaaaga cctactttag tgtatgaaaa aggtggcatc tcaaatcact gggaaacaat 3661 ggaatttttg aataaatagc attagaacca acctagatag atatttggag gggatggaag 3721 gtataattgg atcc // LOCUS HUMCBRG 3326 bp DNA PRI 03-MAY-1994 DEFINITION Homo sapiens carbonyl reductase gene, complete cds. ACCESSION M62420 NID g179977 KEYWORDS carbonyl reductase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3326) AUTHORS Forrest,G.L., Akman,S., Doroshow,J., Rivera,H. and Kaplan,W.D. TITLE Genomic sequence and expression of a cloned human carbonyl reductase gene with daunorubicin reductase activity JOURNAL Mol. Pharmacol. 40 (4), 502-507 (1991) MEDLINE 92017676 FEATURES Location/Qualifiers source 1..3326 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(185..566,1112..1219,2608..3044) /gene="CBR" gene 185..3044 /gene="CBR" exon 185..566 /gene="CBR" /number=1 CDS join(278..566,1112..1219,2608..3044) /gene="CBR" /EC_number="1.1.1.184" /codon_start=1 /product="carbonyl reductase" /db_xref="PID:g179978" /translation="MSSGIHVALVTGGNKGIGLAIVRDLCRLFSGDVVLTARDVTRGQ AAVQQLQAEGLSPRFHQLDIDDLQSIRALRDFLRKEYGGLDVLVNNAGIAFKVADPTP FHIQAEVTMKTNFFGTRDVCTELLPLIKPQGRVVNVSSIMSVRALKSCSPELQQKFRS ETITEEELVGLMNKFVEDTKKGVHQKEGWPSSAYGVTKIGVTVLSRIHARKLSEQRKG DKILLNACCPGWVRTDMAGPKATKSPEEGAETPVYLALLPPDAEGPHGQFVSEKRVEQ W" exon 1112..1219 /gene="CBR" /number=2 exon 2608..3044 /gene="CBR" /number=3 BASE COUNT 755 a 902 c 802 g 867 t ORIGIN 1 gcctcggtct tcggcctgcg ggttctgcaa agtcaggcta gctggctctc cgcctgctcc 61 gcaccccggc gaggttccgg tggggagggg tagggatggt tcagccccgc cccgctaggg 121 cggggcctgc gcctgcgcgc tcagcggccg ggcgtgtaac ccacgggtgc gcgcccacga 181 ccgccagact cgagcagtct ctggaacacg ctgcggggct cccgggcctg agccaggtct 241 gttctccacg caggtgttcc gcgcgccccg ttcagccatg tcgtccggca tccatgtagc 301 gctggtgact ggaggcaaca agggcatcgg cttggccatc gtgcgcgacc tgtgccggct 361 gttctcgggg gacgtggtgc tcacggcgcg ggacgtgacg cggggccagg cggccgtaca 421 gcagctgcag gcggagggcc tgagcccgcg cttccaccag ctggacatcg acgatctgca 481 gagcatccgc gccctgcgcg acttcctgcg caaggagtac gggggcctgg acgtgctggt 541 caacaacgcg ggcatcgcct tcaagggtat ggggagggga cgtggcctcc ccgaagaaga 601 accgatgcac tggggctcct ggcgtctgcg gggtccataa cgcctcccta gggaggagag 661 ggaggagcta ggagatgcag ggagtgcaga aaccttggag aaagtgagag ttttccagcc 721 aaagggaact ttgtgtttcc ctggctggga ctcttgggga tctttttcag gttttctgca 781 gtttttctga attgagcttt aaggcaactg gatggatttt gaactgattt taaaataaac 841 gacaaattgc aaaaagaaaa aatggttatg ccaagatcca gtgtgtgcac ccttcatcct 901 ccttccccca atggtgccat cttacaaaaa tactagaaca tgatcaaacc aggagattga 961 cattggcaca atactgtaac cagatacaga cttactttag gcagagggca ctaagttttt 1021 gttttttttt tttttttttt ttagtatcat tgtatagaat tttaccccat gggtacaatg 1081 tattaactat gttctctttc tctcctaaaa gttgctgatc ccacaccctt tcatattcaa 1141 gctgaagtga cgatgaaaac aaatttcttt ggtacccgag atgtgtgcac agaattactc 1201 cctctaataa aaccccaagg tgagtctgat gggaaacagc gcaccttctc tttggggctt 1261 gatttggccc ctgcttgcct gggatttctc ctgcaggctc ctgcttcctc tccatgctgc 1321 acgtgcactg acctctgtgc tttttctcct gccagctgat atgtgccatt ttgcctcagg 1381 actttgtcct cacttctccc actcccatgg ccattcgtcc tccctgtcgc actggtcttt 1441 gcacacctgg ctgactctca tccttcaggt cttagctcaa acgtctcctg agagatgctc 1501 tttatttgct ccattcccct ccgcgtggga gtccatcctg taccctttct ctgctctttc 1561 agagagatct tattcaggct ggtggaggct ttactaccac cagttcatta ggcccttcta 1621 atacctgtca cctgtctgaa acactcctta agctcccaac cacatggttg tcttgctgcc 1681 tttccatctc ttcctgatgc ctttccccac aatctaagat atttccagag gatccctatc 1741 cttttcccta agtcgtttgg gacacagacc ccaccaccca cccaccagga tcctgcaggt 1801 caccatgccc cacccctcat aatcatttac taggtcctag gagtagtgcc cactgagttc 1861 ctcctgaatc cgtctcattg cagctctaca gcaggccctc acctgtccct ctggacaatt 1921 gcaaaccctc acaaggcacc tctggtcttg agtcttttcc tcttttcctt tcccagcatc 1981 ctgcgtactg tctgcatggt catgcctctc ccaaaatcct tcaggaggaa agtccaagcc 2041 cttagcatgg ttcacagaga tgtccataat ctgccgctgc ttaactctgg gcccatttta 2101 actccccttc aacgtgctga aggtgctgga catgactatc acatgtcttt ggttgtaaac 2161 tgctgtgata gttaccctaa gtaatgggac aggagatgaa cccacccatt aaataacaca 2221 gcaattaagc agccactttt agaaaaattt aaatgtgtgg cttcgagttg ggtacttgca 2281 tgtacagctt actctcagct tttatcctgt ttccctgtcc tgtcgtcctg ttaagttgtg 2341 ctacttctaa ggctctggtt caccaggacc tcaaagggca gctcttccaa atctcacctg 2401 actctactca gccaaaattc gaaggcagga atgaacttct tcatgcaact accaccacac 2461 tttctatgcg tattcctctt agaactcata ccagtaactg tctctcatat gaaaattttc 2521 tgctccaaaa tccctgcaaa tttggcattc acctctctac gggattgttg cacacctttc 2581 tacataatgc tttgtggtgt atcttaggga gagtggtgaa cgtatctagc atcatgagcg 2641 tcagagccct taaaagctgc agcccagagc tgcagcagaa gttccgcagt gagaccatca 2701 ctgaggagga gctggtgggg ctcatgaaca agtttgtgga ggatacaaag aagggagtgc 2761 accagaagga gggctggccc agcagcgcat acggggtgac gaagattggc gtcaccgttc 2821 tgtccaggat ccacgccagg aaactgagtg agcagaggaa aggggacaag atcctcctga 2881 atgcctgctg cccagggtgg gtgagaactg acatggcggg acccaaggcc accaagagcc 2941 cagaagaagg tgcagagacc cctgtgtact tggccctttt gcccccagat gctgagggtc 3001 cccatggaca atttgtttca gagaagagag ttgaacagtg gtgagctggg ctcacagctc 3061 catccatggg ccccattttg taccttgtcc tgagttggtc caaagggcat ttacaatgtc 3121 ataaatatcc ttatataaga aaaaaaatga tctcttatca attagcactc actaatgtac 3181 tactaattga gcaacctacg cactcagttg actacgtaaa tctgtcaggt cttttgtgat 3241 ttcctctgat gcaggagagg aaaaattgta attgatgaaa ataatgaatg aaaatcaaca 3301 gatgaataaa tggttcttta taagtg // LOCUS HUMCD19A 8743 bp DNA PRI 17-JUL-1995 DEFINITION Human CD19 gene, complete cds. ACCESSION M84371 NID g901822 KEYWORDS B-cell specific protein; surface protein. SOURCE Homo sapiens (tissue library: human genomic cosmid library) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kozmik,Z., Wang,S., Dorfler,P., Adams,B. and Busslinger,M. TITLE The promoter of the CD19 gene is a target for the B-cell-specific transcription factor BSAP JOURNAL Mol. Cell. Biol. 12 (6), 2662-2672 (1992) MEDLINE 92269839 FEATURES Location/Qualifiers source 1..8743 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="human genomic cosmid library" /map="Unassigned" protein_bind 1260..1287 /gene="CD19" /note="medium affinity; G00-127-605" /bound_moiety="BSAP" protein_bind 1321..1347 /gene="CD19" /note="high affinity; G00-127-605" /bound_moiety="BSAP" exon 1359..1494 /gene="CD19" /note="G00-127-605" /number=1 mRNA join(1359..1494,1752..2018,2317..2520,2640..2915, 4849..4959,5558..5606,5917..6005,6428..6541,6676..6780, 6860..6928,7030..7086,7175..7231,8083..8175,8279..8385, 8548..8743) /gene="CD19" /note="G00-127-605" gene join(1359..1494,1752..2018,2317..2520,2640..2915, 4849..4959,5558..5606,5917..6005,6428..6541,6676..6780, 6860..6928,7030..7086,7175..7231,8083..8175,8279..8385, 8548..8743) /gene="CD19" CDS join(1407..1494,1752..2018,2317..2520,2640..2915, 4849..4959,5558..5606,5917..6005,6428..6541,6676..6780, 6860..6928,7030..7086,7175..7231,8083..8175,8279..8370) /gene="CD19" /codon_start=1 /db_xref="GDB:G00-127-605" /db_xref="PID:g901823" /translation="MPPPRLLFFLLFLTPMEVRPEEPLVVKVEEGDNAVLQCLKGTSD GPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLASWLFIFNVSQQMGGFYLCQPGPP SEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWA KDRPEIWEGEPPCVPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVH PKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITA RPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFF KVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALG SRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPE DEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILY AAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR" intron 1495..1751 /gene="CD19" /note="G00-127-605" /number=1 exon 1752..2018 /gene="CD19" /note="G00-127-605" /number=2 intron 2019..2316 /gene="CD19" /note="G00-127-605" /number=2 exon 2317..2520 /gene="CD19" /note="G00-127-605" /number=3 intron 2521..2639 /gene="CD19" /note="G00-127-605" /number=3 exon 2640..2915 /gene="CD19" /note="G00-127-605" /number=4 intron 2916..4848 /gene="CD19" /note="G00-127-605" /number=4 exon 4849..4959 /gene="CD19" /note="G00-127-605" /number=5 intron 4960..5557 /gene="CD19" /note="G00-127-605" /number=5 exon 5558..5606 /gene="CD19" /note="G00-127-605" /number=6 intron 5607..5916 /gene="CD19" /note="G00-127-605" /number=6 exon 5917..6005 /gene="CD19" /note="G00-127-605" /number=7 intron 6006..6427 /gene="CD19" /note="G00-127-605" /number=7 exon 6428..6541 /gene="CD19" /note="G00-127-605" /number=8 intron 6542..6675 /gene="CD19" /note="G00-127-605" /number=8 exon 6676..6780 /gene="CD19" /note="G00-127-605" /number=9 intron 6781..6859 /gene="CD19" /note="G00-127-605" /number=9 exon 6860..6928 /gene="CD19" /note="G00-127-605" /number=10 intron 6929..7029 /gene="CD19" /note="G00-127-605" /number=10 exon 7030..7086 /gene="CD19" /note="G00-127-605" /number=11 intron 7087..7174 /gene="CD19" /note="G00-127-605" /number=11 exon 7175..7231 /gene="CD19" /note="G00-127-605" /number=12 intron 7232..8082 /gene="CD19" /note="G00-127-605" /number=12 exon 8083..8175 /gene="CD19" /note="G00-127-605" /number=13 intron 8176..8278 /gene="CD19" /note="G00-127-605" /number=13 exon 8279..8385 /gene="CD19" /note="G00-127-605" /number=14 intron 8386..8547 /gene="CD19" /note="G00-127-605" /number=14 exon 8548..8743 /gene="CD19" /note="G00-127-605" /number=15 BASE COUNT 1757 a 2679 c 2182 g 2125 t ORIGIN 1 ggatcctctc gcctcggcct cctaaagtat tgggattaca ggcatgagcc tctgtgcctg 61 gctgtaactg acatgtttta agcaggggaa tgacatgctc tagtgaaagc cagtctgggc 121 agctgggtag ctaatgaggg gattagagag attttgttga atgaaaggca gattgagtcc 181 tgctactcgc ccccttcatt ccccttcatt catgcctcat tcttccgcct cccagccgcc 241 tcaactggcc aaagggaagt ggaggccctg ccacctgtag ggagggtccc ctggggcttg 301 cccacagcaa acaggaagtc acagcctggt gagatgggcc tgggaatcag ccactgagaa 361 agtgggtctc ttgggtccct gaattctttt tctgagtccc tgcagcagtg aaaaagacac 421 agaggcacat agagagtgac agagaaagag agagacagag aggagaggca tggggcagaa 481 taagaacaga tttaggagtt agaactcctg ggttctttta aaacaatttt tcttttagag 541 acagggtctt gttgtgttgc ccggactgga gcacagtggc tattcccagg cataatcatg 601 gtgcactgca gccttgaact cctgggctca agcgatcctt ctacctcagc ctcccaagga 661 cctgggacca taggcgtgta ccactgtgcc tggcttttgc ctggttttaa actgaggcag 721 tatgacttga gctcttaggc attaattgaa gctgtatctc attaactgag ggcttatgat 781 gtgctggaca ctgggctaat agtgctgaac atattgtcat ttttaatctt cacaaacaat 841 atttgtatag gactgttttc ttttcttttt tttttttgaa acagagtctc actctggtgc 901 ccaggctgga gtgcagtggt gtgatctcgg ctcactgcaa cctccgcctc ctggtttcca 961 gtgattctcc tgcctcagcc tcctaagtag ctgggattac aggtgtgcgc caccatgccc 1021 ggctaatttt tttttttttt tttgagaagg agtctatgtg cccagcattg ttctagagca 1081 cttgcaatta gtggtgaaca acacggtctc tactccaagg ggctcacatt cttgtgcaga 1141 aaacagaaat gaacaaataa acacacaaga tcatttcccg tggtagtgag agctgggatg 1201 aaaataaaac agcgtggcag ggaggaggca agtgttgtga gtctggaggg ttcctggaga 1261 atggggcctg aggcgtgacc accgccttcc tctctggggg gactgcctgc cgcccccgca 1321 gacacccatg gttgagtgcc ctccaggccc ctgcctgccc cagcatcccc tgcgcgaagc 1381 tgggtgcccc ggagagtctg accaccatgc cacctcctcg cctcctcttc ttcctcctct 1441 tcctcacccc catggaagtc aggcccgagg aacctctagt ggtgaaggtg gaaggtatgt 1501 ccaaagggca gaaagggaag ggattgaggc tggaaacttg agttgtggct gggtgtcctt 1561 ggctgagtaa cttaccctct ctgagcctcc attttcttat ttgtaaaatt caggaaaggg 1621 ttggaaggac tctgccggct cctccactcc cagcttttgg agtcctctgc tctataacct 1681 ggtgtgagga gtcggggggc ttggaggtcc cccccaccca tgcccacacc tctctccctc 1741 tctctccaca gagggagata acgctgtgct gcagtgcctc aaggggacct cagatggccc 1801 cactcagcag ctgacctggt ctcgggagtc cccgcttaaa cccttcttaa aactcagcct 1861 ggggctgcca ggcctgggaa tccacatgag gcccctggca tcctggcttt tcatcttcaa 1921 cgtctctcaa cagatggggg gcttctacct gtgccagccg gggcccccct ctgagaaggc 1981 ctggcagcct ggctggacag tcaatgtgga gggcagcggt gagggccggg ctggggcagg 2041 ggcaggagga gagaagggag gccaccatgg acagaagagg tccgcggcca caatggagct 2101 ggagagaggg gctggaggga ttgagggcga aactcggagc taggtgggca gactcctggg 2161 gcttcgtggc ttcagtatga gctgcttcct gtccctctac ctctcactgt cttctctctc 2221 tctgcgggtc tttgtctcta tttatctctg tctttgagtc tctatctctc tccctctcct 2281 gggtgtctct gcatttggtt ctgggtctct tcccagggga gctgttccgg tggaatgttt 2341 cggacctagg tggcctgggc tgtggcctga agaacaggtc ctcagagggc cccagctccc 2401 cttccgggaa gctcatgagc cccaagctgt atgtgtgggc caaagaccgc cctgagatct 2461 gggagggaga gcctccgtgt gtcccaccga gggacagcct gaaccagagc ctcagccagg 2521 gtatggtgat gactggggag atgccgggaa gcgggggtcc agagacagag gggaggggaa 2581 actgaagagg tgaaaccctg aggatcaggc tttccttgtc ttatctctcc ctgtcccaga 2641 cctcaccatg gcccctggct ccacactctg gctgtcctgt ggggtacccc ctgactctgt 2701 gtccaggggc cccctctcct ggacccatgt gcaccccaag gggcctaagt cattgctgag 2761 cctagagctg aaggacgatc gcccggccag agatatgtgg gtaatggaga cgggtctgtt 2821 gttgccccgg gccacagctc aagacgctgg aaagtattat tgtcaccgtg gcaacctgac 2881 catgtcattc cacctggaga tcactgctcg gccaggtaga gtttctctca actgggaggc 2941 atctgtgtgg gggtactggg aagaagtgga agccagtcaa tcttagattc ccccaacccg 3001 agggctactc ccagcctcac cccaaacccc aacttccaca cagaacactg actccaagtc 3061 tttctttttt ttgacagagt ctcgctctgt tgcctaggct ggagtgcagt ggtgccatct 3121 tgtcttggct cactgcaacc tccgcctccc aggttcaagt gattcccctg cctcagcctc 3181 ctgagtagct gggattacag gtgcccacca ccacgcctgg ctaatttttt tttttttttt 3241 gagacggagt cttgcactgt cacccaggct ggagtgcagt ggcacgatct cagctcactg 3301 caacctccac cttccaggtt caagtgattc tcctgcctca gcctcccgag tagctgggat 3361 taaagcctgg ctaatttttt ttgtattttt agtagagatg gggtttcatt atgttggcca 3421 ggctggtctc aaactcctga cctcgtgatc cacccgcctc ggcctcccaa agtgctggga 3481 ttacagacat gagccacagg gccgggccaa gcctaatttt gtatttttag tagagatggg 3541 gtttctccct gttggaccag gctggtcttg aactcctgac ttcaggtgat ctgcctgcct 3601 tggcctccca aagtactggg attacaggca taagccaccg cacctggcct agacttcaag 3661 tctttcttcc ctcgcttcca agacactact tttctgggtc ttcacctacc attgcttgcg 3721 cctgcccacc agcttgggtg gagtcttcct tcctccccaa ctcctcactc ttggagccct 3781 gggccctctt cttatccctg tctgcacact ttcctatttg aacttgactc tcaatggctt 3841 cttgggtcac catgccttgg tgactctatt ccaggctcca tactcagcca tctcctgtgc 3901 catttgatat cccatggaca cctcaggctc aacagataca aaatcaaact caatgtcttc 3961 cccaagtata gtcttcttgg tggcccagtg taagcagagg gcaccaccac ctgctccctc 4021 gcccaggcta agaacctggg catccttctt tttcctcacc ccgtccaaca aactggtcac 4081 agtgttctgc caattctctc tccatgcaat cctatcatgc tatcctaact gcaattcaca 4141 aacccaaccc caactttcac tccaaacttg atccaagcaa tgtgctggat cccaactgta 4201 accttgcaaa ctcaactctg cccttcactt tgaccgtgac tatccttaat tgcagcagga 4261 aactgatcat tatgctcccc tcaatccaca cattgcctct gagtacagcc atggtttgtc 4321 cacgatttgc tcaaagacac tgcccatgtc ctgtgccagg gtctgtgaca atccctgacc 4381 tcctgggaca tggctcctta gagagaggag agcctttctc acagcttggg actttgagtc 4441 tgtgtctttt tttttttctt gagacggagt tttgctgtgg ttgcccaggc tggagtgcag 4501 tgatctcggc tcactgaaac ctccgcctcc cgggttcaaa cgattctcct gcctcagcct 4561 cccaagtagc tgggattaca ggcacccacc accatgccca gctaattttt ttgtattttt 4621 agtagagatg gggtttcacc atgttggcca ggctggtctc gaactcctga cctcaggtga 4681 tccacccgcc tttgcctccc aaagtgctgg gattacaggc gtcaaccacc gcgcccggcc 4741 gagtctgtgt cttgcctctg tgcctcagac ttgcggttcc ttgagatctc aggattggga 4801 cgtaagatgc cagcctgggg tcctcgtctc atagcccctt ccccctagta ctatggcact 4861 ggctgctgag gactggtggc tggaaggtct cagctgtgac tttggcttat ctgatcttct 4921 gcctgtgttc ccttgtgggc attcttcatc ttcaaagagg tgagtcatgt ccccagtggg 4981 tctgtccaaa ccctactcca tcttccccag gataagccgg ctctggccag tctgacaacc 5041 atctttcttt cctcccatcc ctcccttcaa gaccccagaa tcctgttctc cccagtcttc 5101 ctctagcctc cctcaaactt cccaagcctc ttgcaatttt tttttttttt ttgagacagg 5161 gtctcattct gtcaccccag ctggagtgca gtggcacaat ctgagctcac tgtaacctct 5221 gcctcccagg cttaagtgat tcttgtgctt cagcctcccg agtacctggg actacaagtg 5281 tatgccacca cacccggcca attttttata tttttagtag agacgaggtt tcaccatgtt 5341 ggccagactg gtctcgaact cttgacctca aatgatccgc ccacctcggc ctcccaaagt 5401 gctgggatta caggcacgag ccaccgcgcc cgtccgcctc gcaatttgaa ctcctgtctc 5461 ctttgttgaa ccaagtgacc tccccagcac ctggccccac aaatcctcac cctgccaagc 5521 agcccctcct ctgatcacgc cctttaactc ccaccagccc tggtcctgag gaggaaaaga 5581 aagcgaatga ctgaccccac caggaggtaa tgcaaccagt gcaccccgcg gtaacaccct 5641 ccaccttcac tttatgcctt gcacttactg tttcctctgc ccaggggttc tttgctccgt 5701 ctctactgtt tcaaatactg cccaacctca aagcccagct ccaaagctac ctcctctgtg 5761 aagaactcct tggaaatgat catctcagac tcctctattg gctgtcccag cacaagtgat 5821 cacgtttaac ttctgaaggc ctggacagaa tcttgagtgg gtccgccatt ccattccaag 5881 tcggccctca ccgtgcactt cctcttctcc cgccagattc ttcaaagtga cgcctccccc 5941 aggaagcggg ccccagaacc agtacgggaa cgtgctgtct ctccccacac ccacctcagg 6001 cctcggtaag aggcaccgcc cctccagcct atagctccgc cccagatccg gggctccacc 6061 cccactctcc tcatccctcc aatccgctgt gcgccaagcc ttctggagct cggaactccg 6121 cccccggggc ggggagtccc gcccagctat gagccccgcc tctagaacca gaccccgcct 6181 ccagggctca gagccacgcc cccaggaccc agagcctgaa gtcgtaatca agagcagaac 6241 ttcgccccag aactgaaggc ctcggcccta gatttagatt ccgccccagg gttcaaggcc 6301 gggttcctag acccagagtc cattcgcaga gcccaaaaca tcctcttccc gtgccccgcc 6361 gcgcggaccc ttagccttga ccgcccccat ctcttctgac cccgtcttac aatgcccctc 6421 tcaccaggac gcgcccagcg ttgggccgca ggcctggggg gcactgcccc gtcttatgga 6481 aacccgagca gcgacgtcca ggcggatgga gccttggggt cccggagccc gccgggagtg 6541 ggtgaatgac tgggagaggg aagggtcgtt ccccacatgg agggggttgg agcggtctgt 6601 ggcccgaata gtggactggg ccctggagga gagggggcat gactcggttc cccatcccca 6661 tccccaaacc cccaggccca gaagaagagg aaggggaggg ctatgaggaa cctgacagtg 6721 aggaggactc cgagttctat gagaacgact ccaaccttgg gcaggaccag ctctcccagg 6781 gtaaggctgc cctcccccgt ggccccccac ctctgcggtg gcctgtggac tcccatggac 6841 acccctcctt ctacaccaga tggcagcggc tacgagaacc ctgaggatga gcccctgggt 6901 cctgaggatg aagactcctt ctccaacggt aacttggggc ctttgtggga cctcagagac 6961 ttaggtgtaa ttgcagcgct gtgacactcc tagaagggga tccctggagt tctctctctt 7021 ctgccacagc tgagtcttat gagaacgagg atgaagagct gacccagccg gtcgccagga 7081 caatgggtgt gtgtgaggat ggcaacagtc caggggggag gcggaggaca cctggaggcc 7141 aggaggaata gtaacctccc tcttcccttt ccagacttcc tgagccctca tgggtcagcc 7201 tgggacccca gccgggaagc aacctccctg ggtgagagat gctttcaatc agactgcctt 7261 gcccagcttg ggtgacctgg cctcagctct gacaccagat ccaactttga cctgaccctg 7321 accccaaacc cgaacccaat cctgtgactc ctctcacctc aacactgagc cccatccccc 7381 atcctgagcc ccatccccca tcctgacccc caatatttac cccctcccta actgtgaata 7441 tcaacaccga tcccaatgca gtatcagcct ggacttgatc tccacctcac ctcagcccca 7501 gtgcagacct caacttggac cccagcttac tctgcagctt cttcatgact ctgactccga 7561 ctccctccag tttcttcttt ttctttttct tttttttgag acggagtctc cctctgttgc 7621 ccaggctgga gtgcagttgc cacctctgcc tcctaggttc aagcgattct catgcctcag 7681 cctcctgagt agctgggatt atagacgttt gccaccacac ctggctaatt tttgtatttt 7741 cagtagagac agggtttcgc catgttggcc agactggtct ccaactcctg gcctctagtg 7801 atctgcccgc ctttggcttc ccaaagtgct gggattacag gcatgagcca ccacgcccag 7861 cccagttctg ttcttgaccc cttccttagc cataatctaa cccatatcta accctgaccc 7921 tacagctaac tggggcccca aactcaatgc taaccaaatc accccttccc agcacagcat 7981 gggtaatgct cctcaccttc ctctgcccct cagtcttcct ccttaccgta ggctgtactt 8041 cccatgccct agcctccaat tctccatccc ccgcccaagc agggtcccag tcctatgagg 8101 atatgagagg aatcctgtat gcagcccccc agctccgctc cattcggggc cagcctggac 8161 ccaatcatga ggaaggtggg tgcttctgcc gctgtcccct gctgtcccct gggctgactt 8221 tgccttccag cctacttcca gtgccaccca tgttctcctc ctccctggtc ctatccagat 8281 gcagactctt atgagaacat ggataatccc gatgggccag acccagcctg gggaggaggg 8341 ggccgcatgg gcacctggag caccaggtga tcctcaggtg gccaggtgag ctgggactgc 8401 ccctagggaa agcggggagg gagggagata ggcacggatg gcagtggctg ctggctttca 8461 gggagggaga gggaacaggg ttcctagggc ctggtgggca gggggaggac tgctggaccc 8521 ctccccatca ccgtttcttc tgcatagcct ggatctcctc aagtccccaa gattcacacc 8581 tgactctgaa atctgaagac ctcgagcaga tgatgccaac ctctggagca atgttgctta 8641 ggatgtgtgc atgtgtgtaa gtgtgtgtgt gtgtgtgtgt gtgtatacat gccagtgaca 8701 cttccagtcc cctttgtatt ccttaaataa actcaatgag ctc // LOCUS HUMCD79B 3938 bp DNA PRI 09-NOV-1994 DEFINITION Human CD79b/Ig beta/B29 gene, complete coding sequence. ACCESSION L27587 NID g567109 KEYWORDS . SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3938) AUTHORS Hashimoto,S., Chiorazzi,N. and Gregersen,P.K. TITLE The complete sequence of the human CD79b (Ig beta/B29) gene: identification of a conserved exon/intron organization, immunoglobulin-like regulatory regions, and allelic polymorphism JOURNAL Immunogenetics 40 (2), 145-149 (1994) MEDLINE 94299285 FEATURES Location/Qualifiers source 1..3938 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="B cell" exon <300..400 /number=1 CDS join(334..400,1208..1258,2206..2517,2706..2824,3122..3163, 3273..3371) /codon_start=1 /db_xref="PID:g567110" /translation="MARLALSPVPSHWMVALLLLLSAEPVPAARSEDRYRNPKGSACS RIWQSPRFIARKRGFTVKMHCYMNSASGNVSWLWKQEMDENPQQLKLEKGRMEESQNE SLATLTIQGIRFEDNGIYFCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQRNTLKDG IIMIQTLLIILFIIVPIFLLLDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEV KWSVGEHPGQE" intron 401..1207 /number=1 exon 1208..1258 /number=2 intron 1259..2205 /number=2 exon 2206..2517 /number=3 intron 2518..2705 /number=3 exon 2706..2824 /number=4 intron 2825..3121 /number=4 exon 3122..3163 /number=5 intron 3164..3272 /number=5 exon 3273..3832 /number=6 polyA_signal 3833..3838 BASE COUNT 790 a 1164 c 1225 g 758 t 1 others ORIGIN 1 gatgaccacc ggtggggtaa gcacagacag aggggagcac aggcttcccc cagaagactg 61 agaggccccc cagaggcatc cacagaggac cccagctgtc ctgcccaagc tgggcgaccg 121 ccaaacctta gcggcccagc tgacaaaagc ctgccctccc ccagggtccc cggagagctg 181 gtgcctcccc tgggtcccaa tttgcatggc aggaaggggc ctggtgagga agaggcgggg 241 aggggacagg ctgcagccgg tgcagttaca cgttttcctc caaggagcct cggacgttgt 301 cacgggtttg gggtcgggga cagagcggtg accatggcca ggctggcgtt gtctcctgtg 361 cccagccact ggatggtggc gttgctgctg ctgctctcag gtacagaacc cacgacgagg 421 cctgtggggt ttgctctcat ctccagctgt ctgggccctg cggctctgcc tcctgcttgc 481 tccaccctcc tcctctgtct ctcactcttt accgcctgtc tccctctcat ggccctgggg 541 cctgggtctg tgggtgtcag ctgcacgtgg tgatgggctc aggagcctgt ggccccagtt 601 cctctcctgc ctggagagca tcaggctgag ctgggcatag gagcaaacca gagttagcaa 661 gcaggaccgt ggaagaacta ggctcaggag gaagagaggc tgagggcggt gcctgggagt 721 cgggagggga gaggctgtcc caaccggacc ctggaagatc gaagatccaa atgtggtccc 781 ctcctctggg aggtgaagat cctccgggct gtgactttct tcctgtctct ctttaaaaaa 841 aaaaaaaaaa agagagacag ggtgtcccta tgttgcccag gctggtctca aactcctgga 901 cccaagcgat cctcctgtct tggcctctca aaatactgag atttccctgt ccctcctgtc 961 tcattcagat gcctgctcaa agccaccctg agggtcaaag gcagaaaggg ggctgggggc 1021 agggcttccg gtacccccac cccatatctg tctcccaagt actggggaca gggaggaccg 1081 aggggtcttg aagctgctgt catggaagga ggaagtagct ccgggaacat agagggcatg 1141 caggggtggg gcagcccagg ccagaggagg ccctgactgg ccccgcccac ttcctcccac 1201 ccagcagctg agccagtacc agcagccaga tcggaggacc ggtaccggaa tcccaaaggt 1261 cagtagcctc gcctgccgct tctgagggca agttgtgtcc gactatgtcc tgtgtccgac 1321 tatggtccct gtcctggggg cggtccgcct tcccaccctt gcctgatctc cccatccagg 1381 attcccatcc ccaggcctcc ttaggatctc ctctgagccc cacccccagc ccactgtcat 1441 cctcatcaga cacatgccct ccagccttgg ccactgaagg gctgggccta aactctgggc 1501 tcatagcaag ggagcctggg ggttcctcta ggactcaggc ccgggcctgt ctagctggga 1561 atctggctga gtctcggtga gccctccagt gcagggtcac tatgtgaggg gctcagggat 1621 tcccgggcag tgctcgctaa gaacaaggca gtcgaagctg gaaagagggt tctggaacaa 1681 gaagtggacc cagtccctgc aggagggaag agggggatga tggagggaag tcatgcaggg 1741 agggggtcga ctgaggaagg tctcagttga cagactagga atttggcctt tatccctgga 1801 ggaacggggg agactgagga tttaagcctg tgcagggcac ccagtgaaag gaggggccag 1861 gcctgtgggc ctccaggtcc tgggggtgtg gcctgggtca ggctgttgta atgggcatag 1921 gaaggaatgg caaggtccca gggctcttcc aaaggaaaat cgaaggggag gtggggcagt 1981 cgggaagagg tggagtctga gagcttggcc gtggcttctc atgtggctca gcctctacgt 2041 ggctctgctt cccaactcca acctcagcct cctgcctgtc ccacaccctg agaagcaagg 2101 actaagccca gggagtctgt gccagccagg gcaggggcct gctgctgggg ctggcactca 2161 ggaagatacc aagcggaatg ctgagcctga ccttggcacc cacaggtagt gcttgttcgc 2221 ggatctggca gagcccacgt ttcatagcca ggaaacgggg cttcacggtg aaaatgcact 2281 gctacatgaa cagcgcctcc ggcaatgtga gctggctctg gaagcaggag atggacgaga 2341 atccccagca gctgaagctg gaaaagggcc gcatggaaga gtcccagaac gaatctctcg 2401 ccaccctcac catccaaggc atccggtttg aggacaatgg catctacttc tgtcagcaga 2461 agtgcaacaa cacctcggag gtctaccagg gctgcggcac agagctgcga gtcatgggtg 2521 tgtgccaggt ggctgggacg ctcctgtagc tgtccgcccc accctagccc ggcctgcctc 2581 ttgtggtgat ggtcatggcc ctagagcact caggagagcc actgggacct gagggagctg 2641 cccttgtagg cctaanagtt ggacttgtgg cccaggctca caaaccccac ccctgttccc 2701 cgcaggattc agcaccttgg cacagctgaa gcagaggaac acgctgaagg atggtatcat 2761 catgatccag acgctgctga tcatcctctt catcatcgtg cctatcttcc tgctgctgga 2821 caaggtgatc aggggacggg ggaagccttg ggggaccgca gaggaggcca ccgcataggc 2881 caggcatatg cagggtctcc atctccccag ctctcgttgt ggccttcttg gtccagcctg 2941 tccccactgg ctcaccaatt caatcattcc ttccttcctt ccttcattca gctgttcttg 3001 cagaatgcac ctcactcctg acccctcacc cctctccctg gccctcccca gcctggccca 3061 gcaggggatg gggctggggg gacactaaca ctctgatctt ccatccctct tccgccccca 3121 ggatgacagc aaggctggca tggaggaaga tcacacctac gaggtaagga gaggggcagg 3181 cccagcagct ctgagtcctc ggggtcagtg gccactatct gctggtgtgg ttggggtgtg 3241 gtcccggcct gagttccact taatgtctcc agggcctgga cattgaccag acagccacct 3301 atgaggacat agtgacgctg cggacagggg aagtgaagtg gtctgtaggt gagcacccag 3361 gccaggagtg agagccaggt cgccccatga cctgggtgca ggctccctgg cctcagtgac 3421 tgcttcggag ctgcctggct catggcccaa cccctttccc ggacccccca gctggcctct 3481 gaagctggcc caccagagct gccatttgtc tccagcccct ggtgtccagc tcttgccaaa 3541 gggcctggag tagaaggaca acagggcagc aacttggagg gagttctctg gggatggacg 3601 ggacccagcc ttctgggggt gctatgaggt gatccgtccc cacacatggg atgggggagg 3661 cagagactgg tccagagccc gcaaatggac tcggagccga gggcctccca gcagagcttg 3721 ggaagggcca tggacccaac tgggccccag aagagccaca ggaacatgat tcctctcccg 3781 caaccactcc cacccccagg gaggccctgg cctccagtgc cttcccccgt ggaataaacg 3841 gtgtgtcctg agaaaccaca cacaggctcg tggcatctta tgggtttggg gtggatagga 3901 acatctatgt gggcacttct gttgtttgaa tgtggcat // LOCUS HUMCD7AA 3280 bp DNA PRI 01-NOV-1994 DEFINITION Human CD7 antigen gene, exons 1-4. ACCESSION M37271 NID g180163 KEYWORDS CD7 antigen protein. SOURCE Human lung fibroblast DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3280) AUTHORS Schanberg,L.E., Fleenor,D.E., Kurtzberg,J., Haynes,B.F. and Kaufman,R.E. TITLE Isolation and characterization of the genomic human CD7 gene: structural similarity with the murine Thy-1 gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (2), 603-607 (1991) MEDLINE 91110576 COMMENT Draft entry and computer-readable sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by L.E.Schanberg, 31-JUL-1990. FEATURES Location/Qualifiers source 1..3280 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q25.2-q25.3" gene join(564..645,1077..1391,1649..1863,2624..2734) /gene="CD7" CDS join(564..645,1077..1391,1649..1863,2624..2734) /gene="CD7" /note="CD7 antigen protein" /codon_start=1 /db_xref="GDB:G00-119-770" /db_xref="PID:g180164" /translation="MAGPPRLLLLPLLLALARGLPGALAAQEVQQSPHCTTVPVGASV NITCSTSGGLRGIYLRQLGPQPQDIIYYEDGVVPTTDRRFRGRIDFSGSQDNLTITMH RLQLSDTGTYTCQAITEVNVYGSGTLVLVTEEQSQGWHRCSDAPPRASALPAPPTGSA LPDPQTASALPDPPAASALPAALAVISFLLGLGLGVACVLARTQIKKLCSWRDKNSAA CVVYEDMSHSRCNTLSSPNQYQ" exon <564..645 /gene="CD7" /note="CD7 antigen protein" /number=1 intron 646..1076 /note="CD7 antigen protein intron A" exon 1077..1391 /gene="CD7" /number=2 intron 1392..1648 /note="CD7 antigen protein intron B" exon 1649..1863 /gene="CD7" /number=3 intron 1864..2623 /note="CD7 antigen protein intron C" exon 2624..>2734 /gene="CD7" /note="CD7 antigen protein" /number=4 BASE COUNT 534 a 1139 c 1019 g 588 t ORIGIN 1 cagctggtcg gctgcagcgt tgggtgcggg tgcaggccca ggctccttcc ccacggccaa 61 accagggggt gccaattctg gcctgccctg tgctcctctg cactgtgtgg gtgactcatg 121 ggtcatgggt ggctccagat ttctgggcat aaattgggtc ccagcagaaa gtaccccaga 181 ggaccaggca ggctgggggc cagctctgcc gctcacttcc agaccctgcc acccgtcccc 241 agccagggag tcccttctct gtcctctggt ctgccccgcc ccaggccttg acctcctccc 301 tgtggagatg caggggcagt gggggataca gggacgccct gctctcaggg cagctctcag 361 ggaggcagtg ctggggaggg gtaggtgaga ccgcccctcc caccgggccc acagcaccac 421 cctggtctga ggcaccgcct ccaggaagcc ctctctgagc tctgagcgcc tgcggtctcc 481 tgtgtgctgc tctctgtggg gtcctgtaga cccagagagg ctcagctgca ctcgcccggc 541 tgggagagct gggtgtgggg aacatggccg ggcctccgag gctcctgctg ctgcccctgc 601 ttctggcgct ggctcgcggc ctgcctgggg ccctggctgc ccaaggtaag agcttcccag 661 gctctccatg gccacagctc cggagctctc cctgccccat gagctcagag cccccagtct 721 gagccacagc acagccccca ggaagcgggt ggggtgctga gcggcctcca gtgtctgagg 781 actcatttaa gagaaggaaa aagggtggac ccggtgggga gtggccgggg ctgtccaggc 841 agggccgctg ctttgggagg aagaagccca cagtctcgga acacgaggac agcacctccc 901 ccaacaccac agccggtgcc cagatctgct ccatgccccg taaggcaccg tgtctttggc 961 gacatgtcag ccctgggctg tctcagggcc ccaccatccc caccactgtc ccctgcaggg 1021 aggacattct ctgtccttct ggccagactg atggtgacag cccaggtcct ccccagaggt 1081 gcagcagtct ccccactgca cgactgtccc cgtgggagcc tccgtcaaca tcacctgctc 1141 caccagcggg ggcctgcgtg ggatctacct gaggcagctc gggccacagc cccaagacat 1201 catttactac gaggacgggg tggtgcccac tacggacaga cggttccggg gccgcatcga 1261 cttctcaggg tcccaggaca acctgactat caccatgcac cgcctgcagc tgtcggacac 1321 tggcacctac acctgccagg ccatcacgga ggtcaatgtc tacggctccg gcaccctggt 1381 cctggtgaca ggtagggaat gtgcccatcc cagacccccc tcccaacccc agctgctggc 1441 caggctctgc tcccccagcc cttgtcgtgg gaccctccct cctacatgtg cctgaactgt 1501 tccagctccc agcccactgc ccccagcagc ctcctagata gctgcccctc ctcccctcca 1561 cagcctttcc ctgccccgaa tcccaaaccc cgggggctct aacaggttct ccaccgggag 1621 aatcccttcc ttcttttttc cttctcagag gaacagtccc aaggatggca cagatgctcg 1681 gacgccccac caagggcctc tgccctccct gccccaccga caggctccgc cctccctgac 1741 ccgcagacag cctctgccct ccctgacccg ccagcagcct ctgccctccc tgcggccctg 1801 gcggtgatct ccttcctcct cgggctgggc ctgggggtgg cgtgtgtgct ggcgaggaca 1861 caggtcagtg tgagccccag ctgccacctg caccccaaag attgttccct ctcctgagag 1921 cagcgtgggg ggcgccaaag ccccagagga aaaccctggt acccgccgcc tcctctgcag 1981 gctctggaac tctcagggtg gaagtggcct cggcggggag ggagggggac cagcaccaaa 2041 gcgacctgag ggggagggag cagcccaggc gctgggcccg cgtgtgcgcg tgtgccggtg 2101 tgcaggtgtg agtgtctgag gggggatctc tgaggtctgt gcgtgcaggt tggagtgtgc 2161 gggcagagga gggagcgccc aggcgtgggc ccgcgtgtgc acgtgtgcag gtgtgagcct 2221 gtgtgtgggg atctctgagg actgtgcgtg catgttggag tgtgccggcg tgctgtgtgg 2281 ccgtgggaag gggagggggc ctcagacaga cacagagtct aggcgtggcc actcccctcg 2341 gtccctgctg tttgtctggg gctggagtgg gggtcccgct tggcttgggc ttggtggggg 2401 ggtccaagcc acttccactc tccttcttca gaggcacccc agggtccgac ccagccctgc 2461 ctggggaagg ctctgctggg aaccctgggg ctcaggagac ctgggggcca gcccagagga 2521 gcccgtgggg gtctcgtgtt aaaattagaa agctgaccgg agtggggtgg ggtccaggac 2581 accacgctgg ccactggttg gagacggcgg ttctgtcttt cagataaaga aactgtgctc 2641 gtggcgggat aagaattcgg cggcatgtgt ggtgtacgag gacatgtcgc acagccgctg 2701 caacacgctg tcctccccca accagtacca gtgacccagt gggcccctgc acgtcccgcc 2761 tgtggtcccc ccagcacctt ccctgcccca ccatgccccc caccctgcca cacccctcac 2821 cctgctgtcc tcccacggct gcagcagagt ttgaagggcc cagccgtgcc cagctccaag 2881 cagacacaca ggcagtggcc aggccccacg gtgcttctca gtggacaatg atgcctcctc 2941 cgggaagcct tccctgccca gcccacgccg ccaccgggag gaagcctgac tgtcctttgg 3001 ctgcatctcc cgaccatggc caaggagggc ttttctgtgg gatgggcctg gcacgcggcc 3061 ctctcctgtc agtgccggcc cacccaccag caggccccca acccccaggc agcccggcag 3121 aggacgggag gagaccagtc ccccacccag ccgtaccaga aataaaggct tctgtgcttc 3181 cttttctgac ttctgcattt attcctcagc cagcaggagc cagggaggga gatggctgac 3241 acttcggacc ctccccaagc ctccgcctgg ggctctgcag // LOCUS HUMCEL 11502 bp DNA PRI 01-NOV-1994 DEFINITION Human carboxyl ester lipase (CEL) gene, complete cds. ACCESSION M94579 NID g180243 KEYWORDS carboxyl ester lipase. SOURCE Homo sapiens (individual_isolate 1) (tissue library: lambda-DASH) male DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11502) AUTHORS Lidberg,U., Nilsson,J., Stromberg,K., Stenman,G., Sahlin,P., Enerback,S. and Bjursell,G. TITLE Genomic organization, sequence analysis, and chromosomal localization of the human carboxyl ester lipase (CEL) gene and a CEL-like (CELL) gene JOURNAL Genomics 13 (3), 630-640 (1992) MEDLINE 92347858 FEATURES Location/Qualifiers source 1..11502 /organism="Homo sapiens" /isolate="1" /db_xref="taxon:9606" /cell_type="lymphocyte" /sex="male" /tissue_lib="lambda-DASH" /map="9q34.3" mRNA join(1630..1717,4043..4193,4279..4401,4679..4876, 6160..6290,6468..6575,6718..6835,8301..8487,8687..8890, 10096..10293,10620..11460) /gene="CEL" /note="G00-127-527" exon 1630..1717 /gene="CEL" /note="G00-127-527" /number=1 gene join(1630..1717,4043..4193,4279..4401,4679..4876, 6160..6290,6468..6575,6718..6835,8301..8487,8687..8890, 10096..10293,10620..11460) /gene="CEL" CDS join(1643..1717,4043..4193,4279..4401,4679..4876, 6160..6290,6468..6575,6718..6835,8301..8487,8687..8890, 10096..10293,10620..11364) /gene="CEL" /EC_number="3.1.1.3" /codon_start=1 /db_xref="GDB:G00-127-527" /product="carboxyl ester lipase" /db_xref="PID:g180244" /translation="MLTMGRLQLVVLGLTCCWAVASAAKLGAVYTEGGFVEGVNKKLG LLGDSVDIFKGIPFAAPTKALENPQPHPGWQGTLKAKNFKKRCLQATITQDSTYGDED CLYLNIWVPQGRKQVSRDLPVMIWIYGGAFLMGSGHGANFLNNYLYDGEEIATRGNVI VVTFNYRVGPLGFLSTGDANLPGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESA GGASVSLQTLSPYNKGLIRRAISQSGVALSPWVIQKNPLFWAKKVAEKVGCPVGDAAR MAQCLKVTDPRALTLAYKVPLAGLEYPMLHYVGFVPVIDGDFIPADPINLYANAADID YIAGTNNMDGHIFASIDMPAINKGNKKVTEEDFYKLVSEFTITKGLRGAKTTFDVYTE SWAQDPSQENKKKTVVDFETDVLFLVPTEIALAQHRANAKSAKTYAYLFSHPSRMPVY PKWVGADHADDIQYVFGKPFATPTGYRPQDRTVSKAMIAYWTNFAKTGDPNMGDSAVP THWEPYTTENSGYLEITKKMGSSSMKRSLRTNFLRYWTLTYLALPTVTDQEATPVPPT GDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGAP PVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMP AVIRF" intron 1718..4042 /gene="CEL" /note="G00-127-527" /number=1 exon 4043..4193 /gene="CEL" /note="G00-127-527" /number=2 intron 4194..4278 /gene="CEL" /note="G00-127-527" /number=2 exon 4279..4401 /gene="CEL" /note="G00-127-527" /number=3 intron 4402..4678 /gene="CEL" /note="G00-127-527" /number=3 exon 4679..4876 /gene="CEL" /note="G00-127-527" /number=4 intron 4877..6159 /gene="CEL" /note="G00-127-527" /number=4 exon 6160..6290 /gene="CEL" /note="G00-127-527" /number=5 intron 6291..6467 /gene="CEL" /note="G00-127-527" /number=5 exon 6468..6575 /gene="CEL" /note="G00-127-527" /number=6 intron 6576..6717 /gene="CEL" /note="G00-127-527" /number=6 exon 6718..6835 /gene="CEL" /note="G00-127-527" /number=7 intron 6836..8300 /gene="CEL" /note="G00-127-527" /number=7 exon 8301..8487 /gene="CEL" /note="G00-127-527" /number=8 intron 8488..8686 /gene="CEL" /note="G00-127-527" /number=8 exon 8687..8890 /gene="CEL" /note="G00-127-527" /number=9 intron 8891..10095 /gene="CEL" /note="G00-127-527" /number=9 exon 10096..10293 /gene="CEL" /note="G00-127-527" /number=10 intron 10294..10619 /gene="CEL" /note="G00-127-527" /number=10 exon 10620..11460 /gene="CEL" /note="G00-127-527" /number=11 BASE COUNT 2461 a 3514 c 3264 g 2263 t ORIGIN 1 ggatccctcg aacccaggag ttcaagactg cagtgagcta tgattgtgcc actgcactct 61 agcctgggtg acagagaccc tgtctcaaaa aaacaaacaa acaaaaaacc tctgtggact 121 ccgggtgata atgacatgtc aatgtggatt catcaggtgt taacagctgt acccctggtg 181 ggatgttgat aacggggaga ctggagtggg cgaggacata cggaaatctc tgtaatcttc 241 ctctaatttg ctgtgaacct aaagctgctc taaaaatgta catagatata aactggggcc 301 ttcctttccc tctgccctgc cccagccctc ccccacctcc ttcctctccc tgctgcctcc 361 cctctgccct cccctttcct ccttagccac tgtaaatgac actgcagcaa agtctgagca 421 aatgcctttc cctgggcgcc ccagccacct gcaggcccct tatttcctgt ggccgagctc 481 ctcctcccac cctccagtcc tttccccagc ctccctcgcc cactaggcct cctgaattgc 541 tggcaccggc tgtggtcgac agaagaggga cagacgtggc ctctgcaggt ccactcggtc 601 cctggcaccc ggccgcacgg ggtggcagaa cgggagtgtg gttggtgtgg gaagcacagg 661 ccccagtgtc tcctggggga ctgttgggtg ggaaggctct ggctgccctc accctgttcc 721 catcactgca gagggctgtg cggtggcctg gagctgccac tgagtgtctc ggtgagggtg 781 acctcacact ggctgagctt aaaggcccca tctgaagact ttgttcgtgg tgttctttca 841 cttctcagag cctttcctgg ctccaggatt aatacctgtt cacagaaaat acgagtcgcc 901 tcctcctcca caacctcaca cgaccttctc ccttccctcc cgctggcctc tttccctccc 961 cttctgtcac tctgcctggg catgccccag ggcctcggct gggccctttg tttccacagg 1021 gaaacctaca tggttgggct agatgcctcc gcaccccccc acccacaccc cctgagcctc 1081 tagtcctccc tcccaggaca gatcaggctg gatggtgaca gttccacagc cttgagtggg 1141 actgccttgt gctgctctgg gattcgcacc cagcttggac tacccgctcc acgggcccca 1201 ggaaaagctc gtacagataa ggtcagccac atgagtggag ggcctgcagc atgctgccct 1261 ttctgtccca gaagtcacgt gctcggtccc ctctgaagcc cctttggacc taggggacaa 1321 gcagggcatg gagacatgga gacaaagtat gcccttttct ctgacagtga caccaagccc 1381 tgtgaacaaa ccagaaggca gggcactgtg caccctgccc ggccccacca tcccccttac 1441 cacccgccac cttgccacct gcctctgctc ccaggtaagt ggtaacctgc acaggtgcac 1501 tgtgggtttg gggaaaactg gatctccctg cacctgaggg ggtagagggg agggagtgcc 1561 tgagagctca tgaacaagca tgtgaccttg gatccagctc cataaatacc cgaggcccag 1621 ggggagggcc acccagaggc tgatgctcac catggggcgc ctgcaactgg ttgtgttggg 1681 cctcacctgc tgctgggcag tggcgagtgc cgcgaaggta agagcccagc agaggggcag 1741 gtcctgctgc tctctcgctc aatcagatct ggaacttcgg gccaggctga gaaagagccc 1801 agcacagccc cgcagcagat cccgggcact cacgctcatt tctatgggga caggtgccag 1861 gtagaacaca ggatgcccaa ttccatttga atttcagata aactgccaag aactgctgct 1921 gctaagtatg tcccatgcaa tatttgaaac aaatttctat gggccgggcg cagtggctca 1981 cacctgcaat cccaccagtt tgggaggccc gaggtgggtg gatcacttga ggtcaggagt 2041 tggagaccag cctggccaac atggtgaaac cccgtctcta ctaaaaatac aaatattaat 2101 cgggcgtggt ggtgggtgcc tgtaatccca gctactcggg aggctgaggc aggagaaccg 2161 cttgaagctg ggaggtggag attgcggtga gctgagatca cgctactgca ctccagcctg 2221 ggtgacaggg cgagactctg tctcaaaaaa tagaaaaaga aaaaaatgaa acatactaaa 2281 aaacaattca ctgtttacct gaaattcaaa tgtaactggg cctcttgaat ttacatttgc 2341 taatcctggt gattccacct accaacctct ctgttgttcc cattttacag aaggggaaac 2401 gggcccaggg gcagggagtg tggagagcag gcagacgggt ggagagaagc aggcagcagt 2461 ttgcccagca tggcacagct gctgcctcct attcctgtgc aggaagctga aagccgggct 2521 actccacacc cgggtccggg tccctccaga aagagagccg gcaggcagga gctctctcga 2581 ggcatccata aattctaccc tctctgcctg tgaaggagaa gccacagaaa ccccaagccc 2641 cacaggaagc cggtgtcggt gcccggccca gtccctgccc ccagcaggag tcacacaggg 2701 gaccccagat cccaaccacg ctgttctgct gcctgcggtg tctcaggccc tggggactcc 2761 tgtctccacc tctgctgcct gctctccaca ctccctggcc ctgggaccgg gaggtttggg 2821 cagtggtctt gggctctgac tcaaaggaga ggtcaccttc ttcttgggcg agctcttctt 2881 ggggtgctga gaggccttcg gaggtcatca cgacccctcc ccatttcccc accctgaggc 2941 cctctggcag tctcaattgc acagggatca cgccactggc acaaggagac acagatgctc 3001 gcaggggatg ccacgatggc tgcatgtgtt gcttctggtt gctttgctca gttcaaccgc 3061 cgcactctcc cacaccagtg tgacaggggg cccatcaccc tagacttcag agggctgctg 3121 ggaccctggc tgggctgggg gtgtagggca cctgcccttc ccacctggaa cctggcacag 3181 tgacagccag caagcaatga cctggtccta ccatgcacca cgggaagagg gagctgctgc 3241 ccaagatgga caggaggtgg cactggggca gacagctgct tctcaacagg gtgacttcaa 3301 gcccaaaagc tgcccagcct cagttccgtc agggacagag ggtggatgag caccaacctc 3361 caggccctcg tgggggtgga cagcttggtg cacagaggcc attttcatgg cacagggaag 3421 cgtgtcgggg gtggtaggtg tggtccctag gggttcttta ccagcagggg ggctcagaac 3481 tgtggggact tggcgatggg gccatcgact ttgtgcccag ccagctaggc cctgtgcagg 3541 gagatgggag gagggaaaag caggccccac ccctcagaaa ggaggaaggt tggtgtgaaa 3601 cactcccggg tacactgagc attgggtaca ctcctcccgg gagctggaca ggcctcccat 3661 gtgatggcaa acaggccgac agagacacgg ctgttgctcg tcttccacat ggggaaactg 3721 aggatcggag tcaaagctgg gcgccatagc agaacccaaa cctccatccc acctcttggc 3781 cggcttccct agtgggaaca ctggttgaac cagtttcctc taagattctg ggagcaggac 3841 acccccaggg ataaggagag gaacaggaat cctaaagccc tgagcattgc agggcagggg 3901 gtgctgcctg ggtctcctgt gcagagctgt cctgctttga agctgtcttt gcctctgggc 3961 acgcggagtc ggcttgcctt gcccctccgg attcaggccg atgggttgag cccccctgac 4021 cctgcccgtg tctccctcgc agctgggcgc cgtgtacaca gaaggtgggt tcgtggaagg 4081 cgtcaataag aagctcggcc tcctgggtga ctctgtggac atcttcaagg gcatcccctt 4141 cgcagctccc accaaggccc tggaaaatcc tcagccacat cctggctggc aaggtgggag 4201 tgggtggtgc cggactggcc ctgcggcggg gcgggtgagg gcggctgcct tcctcatgcc 4261 aactcctgcc acctgcaggg accctgaagg ccaagaactt caagaagaga tgcctgcagg 4321 ccaccatcac ccaggacagc acctacgggg atgaagactg cctgtacctc aacatttggg 4381 tgccccaggg caggaagcaa ggtctgcctc ccctctactc cccaagggac cctcccatgc 4441 agccactgcc ccgggtctac tcctggcttg agtctggggg ctgcaaacgt tgaacttcca 4501 tgaaatccca cagaggcggg gaggggagcg cccactgccg ttgcccagcc tggggcaggg 4561 cagccgcttg gagcacctcc ctgtcttggc cccaggcacc tgctgcacag ggacagggga 4621 ccggctggag acagggccag gcgggcgtct ggggtcacca gccgctcccc catctcagtc 4681 tcccgggacc tgcccgttat gatctggatc tatggaggcg ccttcctcat ggggtccggc 4741 catggggcca acttcctcaa caactacctg tatgacggcg aggagatcgc cacacgcgga 4801 aacgtcatcg tggtcacctt caactaccgt gtcggccccc ttgggttcct cagcactggg 4861 gacgccaatc tgccaggtgc gtggtgtcgg ccctgaggtg ggcggaccag catgctgagc 4921 ccagcaggga gattttcctc agcacccctc accccaaaca accagtggcg gttcacagaa 4981 agacccggaa gctggagtag aatcatgaga tgcaggaggc ccttggtagc tgtagtaaaa 5041 taaaagatgc tgcagaggcc gggagagatg gctcacgctg taatcccagc actttaggag 5101 gcccacacag gtgggtcact tgagcgcaga agttcaagac cagcctgaaa atcactggga 5161 gacccccatc tctacacaaa aattaaaaat tagctgggga ctgggcgcgg cggctcacct 5221 ctgtaatccc agcacgttgg gagccccaag gtgggtagat cacctgaggt caggagtttg 5281 agaccagcct gactaaaatg gagaaacctc ttctctacta aaaatacaaa attagccagg 5341 cgtggtggcg cttgcctgta atcccagcta ctcgggaggc tgaggcagga gaatcgcttg 5401 actcaggagg cggaggttgc ggtgagccga gatcatgcca ctgcactcca gcctggagaa 5461 caagagtaaa actctgtctc aaaaaaaaaa aaaaaaaaaa aatagccagg cgtggtatct 5521 catgcctctg tcctcagcta cctgggaggc agaggtggaa ggatcgcttg agcccagggg 5581 ttcaaagctg cagtgagccg tggtcgtgcc actgactcca gcctgggcga cagagtgagg 5641 ccccatctca aaaataagag gctgtgggac agacagacag gcagacaggc tgaggctcag 5701 agagaaacca ggagagcaga gctgagtgag agacagagaa caataccttg aggcagagac 5761 agctgtggac acagaagtgg caggacacag acaggaggga ctggggcagg ggcaggagag 5821 gtgcatgggc ctgaccatcc tgcccccgac aaacaccacc ccctccagca ccacatcaac 5881 ccaacctcct ggggacccac cccatacagc cacgcacccg actcagcctc ctggggaccc 5941 acccactcca gcaaccaacg tgacctagtc tcctggaccc accccctcca gcaccctacc 6001 cgacccagct tcttagggac ccaccatttg ccaactgggc tgctccatgg ccccaactct 6061 gttgagggca tttccacccc acctatgctg atctcccctc ctggaggcca gcctgggcca 6121 ctggtctcta gcaccccctc cccctgcccc tgcccccagg taactatggc cttcgggatc 6181 agcacatggc cattgcttgg gtgaagagga atatcgcggc cttcgggggg gaccccaaca 6241 acatcacgct cttcggggag tctgctggag gtgccagcgt ctctctgcag gtctcgggat 6301 ccctgtgggg agggcctgcc ccacaggttg agaggaagct caaacgggaa ggggagggtg 6361 ggaggaggag cgtggagctg gggctgtggt gctggggtgt ccttgtccca gcgtggggtg 6421 ggcagagtgg ggagcggcct tggtgacggg atttctgggt cccgtagacc ctctccccct 6481 acaacaaggg cctcatccgg cgagccatca gccagagcgg cgtggccctg agtccctggg 6541 tcatccagaa aaacccactc ttctgggcca aaaaggtaaa cggaggaggg cagggctggg 6601 cggggtgggg gctgtccaca tttccgttct ttatcctgga ccccatcctt gccttcaaat 6661 ggttctgagc cctgagctcc ggcctcacct acctgctggc cttggttctg cccccaggtg 6721 gctgagaagg tgggttgccc tgtgggtgat gccgccagga tggcccagtg tctgaaggtt 6781 actgatcccc gagccctgac gctggcctat aaggtgccgc tggcaggcct ggagtgtgag 6841 tagcctgctg ggttggccca tggggtctcg aggtgggggt tgaggggggt actgccaggg 6901 agtactccgg aggaagagga aggtgccaga gctgcggtct tgtcctgtca ccaactagct 6961 ggtgtctccc ctcgaaggcc ccagctgtaa gggagagggg gtgccgtttc ttcttttttt 7021 ttgagatgga gtctcactgt tgcccaggct ggagtgcagt gtcacgatct cagctcactg 7081 caacctccac ctcctgggtt caagtgattc tctgactcaa cctcccatgt agctgggact 7141 acaggcacat gccaccatgc ccagataatt tttctgtgtg tttagtaggg atggagtttc 7201 atcgtgttag ctaggatgat ctcggtcttg ggacctcatg atctgcccac ctcggcctcc 7261 caaagtgctg gaattacagc gtgagccact gtgcccggcc ccttctttat tcttatctcc 7321 catgagttac agactcccct ttgagaagct gatgaacatt tggggccccc tccccccacc 7381 tcatgcattc atatgcagtc atttgcatat aattttaggg agactcatag acctcagacc 7441 aagagccttt gtgctagatg accgttcatt cattcgttca ttcattcagc aaacatttac 7501 tgaaccgtag cactgggccc agcctccagc tccactattc tgtaccccgg gaaggcctgg 7561 ggacccattc cacaaacacc tctgcatgtc agccttacca gcttcgtacg ctaaggctgt 7621 ccctcactca ttcttctatg caacatgcca tgaagccaag tcatctgcac gtttacctga 7681 gcatgagctc aactgcacgg gctggacaag ccaaacaaag caacccccac ggccccgcta 7741 gaagcaaaac ctgctgtgct gggcccagtg acagccaggc cccgctgcct cagcagccac 7801 tgggtcctct aggggcccgt ccaggggtct ggagtacaat gcagacctcc caccattttt 7861 ggctgatgga ctggaaccca gccctgagag agggagctcc ttctccatca gttccctcag 7921 tggcttctaa gtttcctcct tcctgcttca ggcccagcaa agagagagag gagagggagg 7981 ggctgccgct gaagaggaca gatctggccc tagacagctg actctcagcc tggggacgtg 8041 tggcagggcc tggagacatc tgtgattgtc acagctgggg agggggtgct cctggcacct 8101 cgtgggtcga ggccggggat gctctaaaca tcctacaggg cacaggatgc ccctgatggt 8161 gcagaatcaa ccctgcccca agtgtccata gatcagagaa gggaggacat agccaattcc 8221 agccctgaga ggcaaggggc ggctcagggg aaactgggag gtacaagaac ctgctaacct 8281 gcctggcctc tcccacccag accccatgct gcactatgtg ggcttcgtcc ctgtcattga 8341 tggagacttc atccccgctg acccgatcaa cctgtacgcc aacgccgccg acatcgacta 8401 tatagcaggc accaacaaca tggacggcca catcttcgcc agcatcgaca tgcctgccat 8461 caacaagggc aacaagaaag tcacggagta agcagggggc acaggactca ggggcgaccc 8521 gtgcgggagg gccgccggga aaggcactgg cgaggggggc cagcctggag gaggaaggca 8581 ttgagtggag gactgggagt gaggaagtta gcaccggtcg gggtgagtat gcacacacct 8641 tcctgttggc acaggctgag tgtcagtgcc tacttgattc ccccagggag gacttctaca 8701 agctggtcag tgagttcaca atcaccaagg ggctcagagg cgccaagacg acctttgatg 8761 tctacaccga gtcctgggcc caggacccat cccaggagaa taagaagaag actgtggtgg 8821 actttgagac cgatgtcctc ttcctggtgc ccaccgagat tgccctagcc cagcacagag 8881 ccaatgccaa gtgaggatct gggcagcggg tggctcctgg gggccttcct ggggtgctgc 8941 accttccagc cgaggcctcg ctgtgggtgg ctctcaggtg tctgggttgt ctgggaaagt 9001 ggtgcttgag tccccacctg tgcctgcctg atccactttg ctgaggcctg gcaagacttg 9061 agggcctctt tttacctccc agcctacagg gctttacaaa ccctatgatc ctctgccctg 9121 ctcagccctg caccccatgg tccttcccac tggagagttc ttgagctacc ttccatcccc 9181 catgctgtgt gcactgagag aacactggac aatagtttct atccactgac tcttatgggc 9241 ctcaactttg ccataatttc agcccaccac caacgttaaa aagtcttcat gtaataatag 9301 ccaattataa taaaaaataa ggccagacac agtagctcag tgcctgtaat cccagcacat 9361 tggagggtca aggtgggagg atcacttgag gtcaggagtc tgagactagt ctggccaaca 9421 tggcaaaacc ccatctctac taaaaataca aaaattatcc aggcatggtg gtgcatgcct 9481 ataatcctag ctactcagga ggctgaggta gcagaattga ttgacccagg gaggtggagg 9541 ttgcagtgag ccgagattac gccactgcac ctccagcagg ggcaacagag tgagactgtg 9601 tctcgaataa ataagtaaat aaataataaa aataaaaaat aagttaggaa tacgaaaaag 9661 ataggaagat aaaagtatac ctagaagtct aggatgaaag ctttgcagca actaagcagt 9721 acatttagct gtgagcctcc tttcagtcag gcaaaaaggg aaacagttga gggcctatac 9781 cttgtccaat ctaattgaag aatgcacatt caccttggag agcaaaatat ttcttgacta 9841 ctgaattcta gaaggaaggt gcctcacaat gttttgtgga ggtgaagtat aaattcagct 9901 gaaattgtgg aacccatgaa tccatgaatt tggttctcag ctttcccttc cctgggtgta 9961 agaagcccca tctcttcatg tgaattcccc agacacttcc ctgcccactg cccgggacct 10021 ccctccaagt ccggtctctg ggctgatcgg tccccagtga gcaccctgcc tacttgggtg 10081 gtctctcccc tccaggagtg ccaagaccta cgcctacctg ttttcccatc cctctcggat 10141 gcccgtctac cccaaatggg tgggggccga ccatgcagat gacattcagt acgttttcgg 10201 gaagcccttc gccaccccca cgggctaccg gccccaagac aggacagtct ctaaggccat 10261 gatcgcctac tggaccaact ttgccaaaac agggtaagac gtgggttgag tgcagggcgg 10321 agggccacag ccgagaaggg cctcccacca cgaggccttg ttccctcatt tgccatggag 10381 ggactttggg caagtcactt aacctccccc tgcatcggaa tccatgtgtg tttgaggatg 10441 agagttactg gcagagcccc aagcccatgc acgtgcacag ccagtgccca gtatgcagtg 10501 aggggcatgg tgcccagggc cagctcagag ggcggggatg gctcaggcgt gcaggtggag 10561 agcaggcttc agccccctgg gagtccccag cccctgcaca gcctcttctc actctgcagg 10621 gaccccaaca tgggcgactc ggctgtgccc acacactggg aaccctacac tacggaaaac 10681 agcggctacc tggagatcac caagaagatg ggcagcagct ccatgaagcg gagcctgaga 10741 accaacttcc tgcgctactg gaccctcacc tatctggcgc tgcccacagt gaccgaccag 10801 gaggccaccc ctgtgccccc cacaggggac tccgaggcca ctcccgtgcc ccccacgggt 10861 gactccgaga ccgcccccgt gccgcccacg ggtgactccg gggccccccc cgtgccgccc 10921 acgggtgact ccggggcccc ccccgtgccg cccacgggtg actccggggc cccccccgtg 10981 ccgcccacgg gtgactccgg ggcccccccc gtgccgccca cgggtgactc cggggccccc 11041 cccgtgccgc ccacgggtga ctccggggcc ccccccgtgc cgcccacggg tgactccggc 11101 gccccccccg tgccgcccac gggtgacgcc gggccccccc ccgtgccgcc cacgggtgac 11161 tccggcgccc cccccgtgcc gcccacgggt gactccgggg ccccccccgt gacccccacg 11221 ggtgactccg agaccgcccc cgtgccgccc acgggtgact ccggggcccc ccctgtgccc 11281 cccacgggtg actctgaggc tgcccctgtg ccccccacag atgactccaa ggaagctcag 11341 atgcctgcag tcattaggtt ttagcgtccc atgagccttg gtatcaagag gccacaagag 11401 tgggacccca ggggctcccc tcccatcttg agctcttcct gaataaagcc tcatacccct 11461 gtcggtgtct ttctttgctc ccaaggctaa gctgcaggat cc // LOCUS HUMCFVII 12850 bp DNA PRI 01-NOV-1994 DEFINITION Human blood coagulation factor VII gene, complete cds. ACCESSION J02933 NID g180333 KEYWORDS coagulation factor; coagulation factor VII. SOURCE Human DNA, clones 7M1 and 7DC1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12850) AUTHORS O'Hara,P.J., Grant,F.J., Haldeman,B.A., Gray,C.L., Insley,M.Y., Hagen,F.S. and Murray,M.J. TITLE Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (15), 5158-5162 (1987) MEDLINE 87260948 REFERENCE 2 (bases 856 to 12446) AUTHORS O'Hara,P.J. and Grant,F.J. TITLE The human factor VII gene is polymorphic due to variation in repeat copy number in a minisatellite JOURNAL Gene 66 (1), 147-158 (1988) MEDLINE 88329723 COMMENT [2] minisatellite imperfect repeats only. Draft entry and computer-readable copy of sequence in [1] kindly provided by P.J.O'Hara, 26-JUN-1987. FEATURES Location/Qualifiers source 1..12850 /organism="Homo sapiens" /db_xref="taxon:9606" /map="13q34" prim_transcript 487..12686 /note="factor VII pre-mRNA (alt.)" prim_transcript 487..12660 /note="factor VII pre-mRNA (alt.)" prim_transcript 487..12664 /note="factor VII pre-mRNA (alt.)" exon <522..585 /gene="F7" /note="factor VII; G00-119-897" gene 522..585 /gene="F7" CDS join(522..585,1654..1719,4294..4454,6383..6407,6478..6591, 8307..8447,9419..9528,10124..10247,11064..11659) /note="factor VII" /codon_start=1 /db_xref="PID:g180334" /translation="MVSQALRLLCLLLGLQGCLAAGGVAKASGGETRDMPWKPGPHRV FVTQEEAHGVLHRRRRANAFLEELRPGSLERECKEEQCSFEEAREIFKDAERTKLFWI SYSDGDQCASSPCQNGGSCKDQLQSYICFCLPAFEGRNCETHKDDQLICVNENGGCEQ YCSDHTGTKRSCRCHEGYSLLADGVSCTPTVEYPCGKIPILEKRNASKPQGRIVGGKV CPKGECPWQVLLLVNGAQLCGGTLINTIWVVSAAHCFDKIKNWRNLIAVLGEHDLSEH DGDEQSRRVAQVIIPSTYVPGTTNHDIALLRLHQPVVLTDHVVPLCLPERTFSERTLA FVRFSLVSGWGQLLDRGATALELMVLNVPRLMTQDCLQQSRKVGDSPNITEYMFCAGY SDGSKDSCKGDSGGPHATHYRGTWYLTGIVSWGQGCATVGHFGVYTRVSQYIEWLQKL MRSEPRPGVLLRAPFP" intron 586..1653 /note="intron A1" exon 1654..1719 /note="optional" intron 1720..4293 /note="intron A" exon 4294..4454 /number=2 intron 4455..6382 /note="intron B" exon 6383..6407 /number=3 intron 6408..6477 /note="intron C" exon 6478..6591 /number=4 intron 6592..8306 /note="Intron D" exon 8307..8447 /number=5 intron 8448..9418 /note="Intron E" exon 9419..9528 /number=6 intron 9529..10123 /note="Intron F" exon 10124..10247 /number=7 intron 10248..11063 /note="Intron G" exon 11064..>11659 /note="factor VII" BASE COUNT 2532 a 3888 c 3902 g 2528 t ORIGIN 212 bp upstream of XbaI site. 1 cccggcactt ctcagtgagg ctctgtggct cacctaagaa accagcctcc cttgcaggca 61 acgcctagct ggcctggtct ggaggctctc ttcaaatatt tacatccaca cccaagatac 121 ggtcttgaga tttgactcgc atgattgcta tgggacaagt tttcatctgc agtttaaatc 181 tgtttcccaa cttacattag gggtttggaa ttctagatcg tatttgaagt gttggtgcca 241 cacacacctt aacacctgca cgctggcaac aaaaccgtcc gctctgcagc acagctgggg 301 tcacctgacc tttctcctgt cccccccact tgagctcagt ggctgggcag caggggatgc 361 atggccactg gcggccaggt gcagctctca gctggggtgt tcagaggacg cctgtgtcct 421 cccctccccc atccctctgt cacccttgga ggcagagaac tttgcccgtc agtcccatgg 481 ggaatgtcaa caggcagggg cagcactgca gagatttcat catggtctcc caggccctca 541 ggctcctctg ccttctgctt gggcttcagg gctgcctggc tgcaggtgcg tccggggagg 601 ttttctccat aaacttggtg gaagggcagt gggcaaatcc aggagccagc ccgggcttcc 661 caaaccccgc ccttgctccg gacaccccca tccaccagga gggttttctg gcggctcctg 721 ttcaatttct ttccttctag aaaccagcat ccaggcacag gaggggaggc ccttcttggt 781 agcccaggct ttggcgggat tatttttcaa agaactttag gagtgggtgg tgctttcctg 841 gcccccatgg ccctgcctgt gaggtcggac aagcgcaggg agtctggggc ctctcagagt 901 gcaggaagtg cgcacagggt gctcccaggc tggggagcac aggtagggga cggtgcgtgg 961 gggatggcgc ctggggcatg ggggatgggg tgtgggaaac ggcatgtggg gcgtagggga 1021 tggggtgtgg aggatcgggg gtggggatgg cgtgtggggt gtgggggatg ggccgtgggg 1081 gggtggggcc tgggaaacag catgtggggc atggggtgtg ggggtgaggt gtgggaaagt 1141 gtgtggggtg tgggggatgg ggcatggaaa gggcgtgtgg ggtgcagggg atggggcatg 1201 gaggtgtggg ggatggggtg tgtggggtgt cggggatggg gcatgtgggg tgtgggggat 1261 ggggcatgga aaggcgtgtg gggtgcagag gatggggcat ggaggtctgg ggcatggggt 1321 gtgtggggtg tcggggatgg ggcatggaaa gggtgtgtgg ggtgtgggga tagggtcagg 1381 ggatggcgtg gggggtgtgg catggggatg gcacgtgtgg catggggatg gggatggggg 1441 gtggggcatg gccgagtggg gctggggctg ggaatggtga gtggggcatg gggatggcga 1501 gtagggggtg tggcgtgagg atggctagtg gggcgtgggg atggcgtgtg gggatggcga 1561 gtggggggtg ggctgtgagg gacagtgcct gggatgtggc tgcagcccta gctcacagca 1621 tggccttatg accccggcca ccttcctgcc caggcggggt cgctaaggcc tcaggaggag 1681 aaacacggga catgccgtgg aagccggggc ctcacagagg tgagcaggga ctgccactgg 1741 ttttgtcctg gggcccagtg ggggcaacat cacctccttc ccctcccatg gcaaagagcc 1801 agcccgcggg gtggctactg cagtgccccc caaggagggt gttccctgct cgagaggaag 1861 tgaccgctcc agcttggcct tccctgggac tggggtgcag gcgattttat cttctttgct 1921 ccattctgtt ccttccagat aatcgtgtgt tcttcatcag gttttcctca gttcttgaga 1981 gcttttctga tgcaaatctg ctttcacccc agggcggtca ccggctctgc tcacaccagc 2041 ctccaagggt gtgggtgtcc cgggagtgtg ggtgtcccgg gggcgtgggt gtcccgggag 2101 tgtgggtgtc ccgggggcgt gggtgtcccg ggagtgtggg tgtcccgggg gcgtgggtgt 2161 cccgggagtg tgggtgtccc gggggagtgg gtgtcccggg agtgtgggtg tcccaggggc 2221 gtgggtgtcc cgggagtgtg ggtgtcccgg gggcgtgggt gtcccgggag tgtgggtgtc 2281 ccggaggcga gggtgtcccg ggagtgtggg tgtcccgggg gcgtgggtgt cccgggagtg 2341 tgggtgtccc gggggagtgg gtgtcccggg agtgtgggtg tcccaggggc gtgggtgtcc 2401 cgggagtgtg ggtgtcccgg gggcgtgggt gtcccgggag tgtgggtgtc ccggagcgag 2461 ggtgtcccgg gagtgtgggt gtcccggggg cgtgggtgtc ccggaggcga gggtgtccca 2521 ggagtgtggg tgtcccgggg gcgtgggtgt cccgggagtg tgggtgtccc ggaggcgagg 2581 gtgtcccggg agtgtgggtg tcccgggggc gtgggtgtcc cggaggcgag ggtgtcccag 2641 gagtgtgggt gtcccggggg cgtgggtgtc ccgggagtgt gggtgttcca gaggcgaggg 2701 tatcccagaa gtgtgagtgt cccgggggtg tgggtgtccc gggggcgtgg gtgtcccggg 2761 agtgtgggtg tcccgggggc gtgggtatcc cagaagtgtg agtgtcccag gggcgtgggt 2821 gtccgggggc gtgggtgtcc cgggggtgtg ggtgtcccgg gggtcgtggg tgtcccggga 2881 gcgtgggtgt cggggactgc agggacatgg gcctcccctc ccactcctgc cgcccagggc 2941 acctcctgtg aggactcgga gtccgtgagt tcccacctcc ttgagcccga ttctttggtg 3001 tccccgcctg catcctcagc ctccttccaa accagaccag ttctctaggg gcgtcgacgt 3061 gtgaaactga ttttaaagaa aacaggcggt ggcctttctc tcggccccac gtggcccagt 3121 agcgctcacc ttccgtccct tcttccgcgc tcagtaacca atttaggccg ctcctgcaga 3181 actcgggctc ctgcccaccg gcccacagcg tccacctgag gcctcttcct cccagcaaag 3241 gtcgtccctc cggaacgcgc ctcctgcggc ctctccagag cccctcccgc gcgtcctctc 3301 agccccgctc gcctcctccc ggggcctccc tctcccgcct gcccccaggc ccgtctccct 3361 cgcgggctga ggcaggttcg gcagcacggc gcccggggcg ggggtcactc tccaccaccg 3421 cgtggtgccc acagctcacg gcgctcccgg gtgacggtcc cctcggctgt agggcgtcct 3481 gaagagcggc ctgctcggag ctgagcgcac ggggttgcct gcccctgggc gtctctggcc 3541 ctcaccagcc ccgtcttccc atgggcaaaa cggcggtcct gtttgtccac aagtaaccgt 3601 cggggttacg gaggggccag gagctgcggc ggggggctgt gctctcagga ccggccccag 3661 gaggatccgc gcgaggtctg gagctctcag gggtcgcggg ggacagaggg gccccaagcg 3721 gaggcgggaa ggcggcagaa gcccaggacc gccaagagct ggcgaggaag cccggggctc 3781 gctgtcgggg gagccgggca ggggccgcgc ctcggcacca ggacgcgagg cctgggaagg 3841 cggatctggc cgcgagcacg cggtgcgggt ggagacgcag ggatttggat ttccgcgggc 3901 gctgcacgga tttccacgcg cggttcacgt gggccccagg gggtgcccgg cacccggggc 3961 cgcgccgcct tctcctgccc ggcatcgacc cgcagcctca cgtttaccgc ggcgcccgca 4021 gcccccttcg cccgcttccg cgcgtgcccc cgagcgcgcc ctcgggatca gcccccggaa 4081 gcagagaggc caggccggga aggatgggcg aacggggtgg ctgacccggg agcacggcag 4141 ggaggacacc cagccaggcc cgcgagcagc gccgctcccc tcctccagga cgggcgggaa 4201 cctgcgatgc ccccgccgcg tgggccgtgg ggcggtctcc gaggcactgg gcggggcacg 4261 cggtgggcgc ttcacggaac tcgcatttcc cagtcttcgt aacccaggag gaagcccacg 4321 gcgtcctgca ccggcgccgg cgcgccaacg cgttcctgga ggagctgcgg ccgggctccc 4381 tggagaggga gtgcaaggag gagcagtgct ccttcgagga ggcccgggag atcttcaagg 4441 acgcggagag gacggtgagc ccagcctcgg ggcgccccgc gcggacactg cacggcggcg 4501 gtgaaccagg ccgcgtgggg ccgcctgcgt ctctttggct gcggcctgtg ggcggcgaac 4561 acgcagcggc gcccgcgcgc gcgctctctc tgcgggggtc gctttccgcc cggggtgact 4621 ccgctttcct gggcgatgcc cccaccccca ggcacgcgct ctccccgtgc ggccgcaccg 4681 cgcatgccgg ttttcacatc agaaaatacg atttgcacaa gcacacttag ggtgtccccc 4741 ttaacttccc aagggagtcc ccccagtccc cgaagtccag ggcagcctgc gcatcgcaga 4801 cgcgcgcggc tcgcagaagg gacgtggtga gaagctggcc cacagcatgc caccagcggc 4861 acctcctcag ggcacgtgtc ggggagaaac aacacttagg gaccctggga ctttctccag 4921 ctcacgctca cgggtccacc tcacactacc aagatcacct caatagacgg acactcacac 4981 agggcacact tcacactcac aggtcacctc acactcacag gacacctcac actcacaggg 5041 cacacttcac actcacgggt cacctcacac tccaagatca cctaaagagg acacctcaca 5101 cagggcacac ttcacactca caggtcacac ctcacacaga tcatctcatt ctcacaggac 5161 acctccctct cacaggtcac ctcacactca caggacacct cacagaggtc acctcacacc 5221 cacaggacac ctcacagagg tcacctcaca cggggcacac ttcacactca ggtcacctca 5281 cacccacagg acacctcaca gaggtcacct cacacccaca ggacaactca cagaggtcac 5341 ctcacacagg acacctcaca aaggtcacct cacacccaca ggacacctca cactcatagg 5401 cacctcagtc ttacaggaca actcacactc acaggtcacc tatctcacag gacacctcac 5461 actcacaggt caccttactc tcacaggaca cctcacacag ggcacacttc actccacagg 5521 tcaccatacc tcacacagat cacctcatac tcacagatca cttcattcat tctcacagga 5581 tacctcacac tcagggcaca cttcacactc acaggtcaca cctcacacag atcatctcat 5641 tctcacagga cacctccctc tcacaggtca ccttacactc atctcacact cacaggtcgc 5701 cacacctcac actcacagga tgcctcacac tcacagaacc acatctcata tgcacaagac 5761 acctcacact caggacacct catgctcaaa gaagcctcac actcacagga ggtccagctg 5821 tctgaggcaa aggctaacat gaccctttcc agacaaattg aggatggtca tgcctagcat 5881 ttttatacac ctagttttga aagcatttct catctgttgt attctcacag caccccgtga 5941 gtttaagttc aggtggccaa cagtttcttc agcaatcact tttttctgtg gagtgctttt 6001 gctgtttgtg gaatattttg catctgctac tgcaccctct ccccgtatgt gtggccaccc 6061 tgtcagaggt ggagctgtgg ctcagagcct gtgtacctcg tcccaggtcc acagctcagc 6121 gacagaagag tcagggttga acctcgggtg ttctgacttg ggagcaggaa atgtgtggtc 6181 acccatagtt ccagatgtcc tggggagggg ccaagattag aagaaaccta cctcagctcc 6241 agaggaaagt ctggcttcct gagcccaccc cgccagaccc aggtccaagt cccccaaccc 6301 cagttcatgg tgtgtccagt gcttaccgtt gggtgctctg gtgaaggtgc atctcacgag 6361 gcttgctctc ttgttccttc agaagctgtt ctggatttct tacagtggtg agtggatgat 6421 caccaccagt cctgcctgca acccttctca gcttactgac accagcccac tccacagatg 6481 gggaccagtg tgcctcaagt ccatgccaga atgggggctc ctgcaaggac cagctccagt 6541 cctatatctg cttctgcctc cctgccttcg agggccggaa ctgtgagacg cgtaaggccc 6601 cactttgggt cccatatttg cagagggccc tggggagctg gtggaggtgg cctggccaac 6661 cgggctgcag ggtgcaacaa cctggtgggg tgtgtaggcc gggcattcag ggctcagccc 6721 agttggaaat tggtctaggt gacctttaaa tcccttccag tctgaggtct ttgacaggga 6781 cccaaggttc tgattatcag actcagtggc ccccttcgcg gtcccggccc tgggcaactt 6841 ctcagccctg gagactggcc cagttgagag tccctgtgtc ccgtgtgccc attccagatc 6901 ccacctagct aggtacccgt ttggtaaact tccccttctc ctactttcca ttacaaaggt 6961 ttgaggggtt tgtttttttt tttaaccatc tgaatattaa attaatcaca aagtttaggg 7021 cccccaacct cccttgggtt cagtaattca ctagaaggac acatagaaat ccaaatatcc 7081 actgagtgga tacactcaca ggtaccgttt attacagcaa aggatgcagg cttaagtctg 7141 cagagggacc agggacaagc ttccccttgt cctctcctgt ggggtcatgt ggacatcctt 7201 aattctccca gaatgacgtg tgacgagaac gtgggaagta ctgccaaact tggggaacgc 7261 tacgagcccc gtgtccagag gtttgatcag ggctcaatga catagaccca gctgaccagg 7321 cacgcatggc tgacctcagt ctcagcccct ccagagctac gccgataatg cggccaaggc 7381 cccaccatac atcacattgt cagctagacc atccagcatg gctcaaggcc caggtaaaca 7441 ccaacattcc ctcaggcaag accttccaag ggcttagcgg tcatttccca ggagccaagg 7501 caaaggctac cctttctctg gcacagcagt tcatccttga ccacccaaga ccacattctt 7561 acactgaatg agctctcctg tgcagcagcc attttcttct ctaagcagaa gagagcccag 7621 caagctggag gaggctgaag agagaggctt cctgctggtc atctgggtcc agaatgcctg 7681 gagatctctg ctcagccctg gtgcccagca gccctggtgt gcatcctgca gggcagcctt 7741 cccgccggag tcctggactt gctcagggcc actccccttg cccatgtcaa ccaaagtcag 7801 gctgccggtt ctgcttcttc tgtctgagcc catgaccagt gctgggacta actgtccccc 7861 aggcgggctc acggtggtac gaggccagct tggagaactg tctcagctct ctggtcctct 7921 cgtcagttgg gtctctgatt ggaaagtccc ttggacactt taccatcccc attggacttt 7981 cactttcccc caggctccca tcagctgctc ggaagagtgg tcaccctgga ggccactgcc 8041 caccagccag gcacccccca aatgcaaccg cagccagcac tgccagccac tggcaaggct 8101 gttcagacat gtggctcctc tgatccacgc cttgtccttt ggatcagtcc acggagcagt 8161 gtgccaagct caggctctgt cacccacagc tcatgccacc ttccaggcag aacaccactg 8221 ctgacccagg ggcatggcca ccccgggggc tggcgtctcg ctgaccccca gaagcccctc 8281 tcagggtgtc cccttcctgt ccccagacaa ggatgaccag ctgatctgtg tgaacgagaa 8341 cggcggctgt gagcagtact gcagtgacca cacgggcacc aagcgctcct gtcggtgcca 8401 cgaggggtac tctctgctgg cagacggggt gtcctgcaca cccacaggtg accaggcttc 8461 atgtcccagt cccagatgac accagtccct gtcccactag gattatctta ctggacaaaa 8521 gacgggtggg actggccttc acatctactg agcactaact atgcactgac caattgtgag 8581 gtgggatctg ggcaccaagg gtggcacagg ccagcagcga ccagtgacta ggatgggcac 8641 cctgggggca atccctgaat ggcctcaggc cccctgccaa cttctaggca gaccagggga 8701 gccaagcaag gcactatctc acgtccaact gcccactcgc aggaatcctc cgccagggtt 8761 catgaatcta cttcggcaca gccaatgtct gtactgactg ctgcccactc tgcattccaa 8821 aactcgtaaa ggctcctggg aaaatgggat gtttctccaa accagcctgg aacgaatggg 8881 ctgcacttcc aaaagcaggg acaccccaca cccactgtct ctaaagaggc ggaacgtgcc 8941 caccctggcc acacagcctg ggactcagcc tgccacctcc tcgggcttcc tttctggccc 9001 aagaccttga ttgaagcaga tcaaaactaa gcatgggatc aaaacaacac agtttgattc 9061 atctttaggt agaatttcat tcaccttcta ctaaagtcaa acaacacatc ttctccctga 9121 aaagtgagca gagggcggtt ttaagacgta agccctctgt ttcctccaaa accagccctg 9181 accattgtct cctcagccag ccacttcttc aagggcctct catggccggg ccccaccagt 9241 caggcccagc cgaggccctg ccttccacca cccctgggcc ctgggagctc ctgctcctgg 9301 gggcctccca tagcctcggc ctcaaggcct ctcagaggat gggtgtttct gaatctttcc 9361 tagtggcacg ttcatccctc acaaatctct gcatctttct gacttttgtt ttacacagtt 9421 gaatatccat gtggaaaaat acctattcta gaaaaaagaa atgccagcaa accccaaggc 9481 cgaattgtgg ggggcaaggt gtgccccaaa ggggagtgtc catggcaggt aaggcttccc 9541 ctggcttcag gattccaagc cctgagggtc ttgaagcctt ttgaatgtga acaacagctc 9601 tggaagggaa aatgggcagg tcagcccaag cccacaggct ccaagtcagc acacctagca 9661 cctccagctc gcggcacccc catgctttta gtggggcaag gaaggagaaa agaaaacgac 9721 actcactgag ggtctaccct gtgcagagaa ccctgcgaga tgccccatcc gagttgtcac 9781 gtcgtcctca cggttactct ttgaggtggg atctttgcct gatctttgca aaatcaggag 9841 cattggatca aagctatgtg aagatcctgt gaggtgaaca gtgaaatctc acagcgacat 9901 ttgtattctt gggccgtgcc caagagcacg tctcggctag agaggggcac agcctcccag 9961 agccaggtct gagcagcttt gcctgggagg gatctgcaaa gaccccagga tttcagaaag 10021 aaattgtgca atgccagagg ttccttggca tgcccgggag ggcgagtcat cagagaaaca 10081 atgacagcaa tgtgacttcc acacctcctg tccccccgcc caggtcctgt tgttggtgaa 10141 tggagctcag ttgtgtgggg ggaccctgat caacaccatc tgggtggtct ccgcggccca 10201 ctgtttcgac aaaatcaaga actggaggaa cctgatcgcg gtgctgggtg ggtaccactc 10261 tcccctgtcc gaccgcggtg ctgggtgggt gccactcttc cctgtccgac cgcggtgctg 10321 ggtgggtgcc actctcccct gtccgaccgc ggtgctgggt gggtgccact ctcccctgtc 10381 cgaccgcggt gctgggtggg tgccactctc cgctgtccga ccgcggtgct gggtgggtac 10441 cactctcccc tgtctgaccg cagctctcaa gtgtctcagg ggctgtggct ctgggcttcg 10501 tgctgtcact tccacagaca gacagacatc cccaaaaggg gagcaaccat gctgggcacg 10561 actgcctgtg gcaccgtgct ctcagccact ttcccatgcc caaataaaac gataaaagac 10621 tgggggcttc tgcccatcct gcctcacttg accaagagcc cagaagagga tgcgacaccc 10681 agggcctcat gggaccaccg gctggcaggg gttctgctca ctgggtttat gggtgagacg 10741 agcactccca ggagggccac tgggccggga agaactgtgg agaatcgggg cacgccctgt 10801 cctcccagct gccagggcac agcatccctt ccccacctgc aacacccaga ccccagattc 10861 accccagttc acttgtcccc acacgagcca caggctgcca cctggggcag gctggcccac 10921 cttggggtta gatgcaggtc cccttgcccc agaaggagac tgcagcccct gcagacctag 10981 aaatggccac agcccatccc catgcaccag ggggtgaggt ggcaggtggt ggaaagggcc 11041 tgaggggggc ttcttccttc caggcgagca cgacctcagc gagcacgacg gggatgagca 11101 gagccggcgg gtggcgcagg tcatcatccc cagcacgtac gtcccgggca ccaccaacca 11161 cgacatcgcg ctgctccgcc tgcaccagcc cgtggtcctc actgaccatg tggtgcccct 11221 ctgcctgccc gaacggacgt tctctgagag gacgctggcc ttcgtgcgct tctcattggt 11281 cagcggctgg ggccagctgc tggaccgtgg cgccacggcc ctggagctca tggtcctcaa 11341 cgtgccccgg ctgatgaccc aggactgcct gcagcagtca cggaaggtgg gagactcccc 11401 aaatatcacg gagtacatgt tctgtgccgg ctactcggat ggcagcaagg actcctgcaa 11461 gggggacagt ggaggcccac atgccaccca ctaccggggc acgtggtacc tgacgggcat 11521 cgtcagctgg ggccagggct gcgcaaccgt gggccacttt ggggtgtaca ccagggtctc 11581 ccagtacatc gagtggctgc aaaagctcat gcgctcagag ccacgcccag gagtcctcct 11641 gcgagcccca tttccctagc ccagcagccc tggcctgtgg agagaaagcc aaggctgcgt 11701 cgaactgtcc tggcaccaaa tcccatatat tcttctgcag ttaatggggt agaggagggc 11761 atgggaggga gggagaggtg gggagggaga cagagacaga aacagagaga gacagagaca 11821 gagagagact gagggagaga ctctgaggac atggagagag actcaaagag actccaagat 11881 tcaaagagac taatagagac acagagatgg aatagaaaag atgagaggca gaggcagaca 11941 ggcgctggac agaggggcag gggagtgcca aggttgtcct ggaggcagac agcccagctg 12001 agcctcctta cctcccttca gccaagcccc acctgcacgt gatctgctgg ccctcaggct 12061 gctgctctgc cttcattgct ggagacagta gaggcatgaa cacacatgga tgcacacaca 12121 cacacgccaa tgcacacaca cagagatatg cacacacacg gatgcacaca cagatggtca 12181 cacagagata cgcaaacaca ccgatgcaca cgcacataga gatatgcaca cacagatgca 12241 cacacagata tacacatgga tgcacgcaca tgccaatgca cgcacacatc agtgcacacg 12301 gatgcacaga gatatgcaca caccgatgtg cgcacacaca gatatgcaca cacatggatg 12361 agcacacaca caccaagtgc gcacacacac cgatgtacac acacagatgc acacacagat 12421 gcacacacac cgatgctgac tccatgtgtg ctgtcctctg aaggcggttg tttagctctc 12481 acttttctgg ttcttatcca ttatcatctt cacttcagac aattcagaag catcaccatg 12541 catggtggcg aatgccccca aactctcccc caaatgtatt tctcccttcg ctgggtgccg 12601 ggctgcacag actattcccc acctgcttcc cagcttcaca ataaacggct gcgtctcctc 12661 cgcacacctg tggtgcctgc cacccactgg gttgcccatg attcattttt ggagcccccg 12721 gtgctcatcc tctgagatgc tcttttcttt cacaattttc aacatcactg aaatgaaccc 12781 tcacatggaa gctatttttt aaaaacaaaa gctgtttgat agatgtttga ggctgtagct 12841 cccaggatcc // LOCUS HUMCHYMASE 8124 bp DNA PRI 01-NOV-1994 DEFINITION Human mast cell chymase gene, complete cds. ACCESSION M64269 NID g180541 KEYWORDS mast cell chymase; serine protease. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8124) AUTHORS Caughey,G.H., Zerweck,E.H. and Vanderslice,P. TITLE Structure, chromosomal assignment, and deduced amino acid sequence of a human gene for mast cell chymase JOURNAL J. Biol. Chem. 266 (20), 12956-12963 (1991) MEDLINE 91302311 FEATURES Location/Qualifiers source 1..8124 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="mast cell" /tissue_type="placenta" /map="Unassigned" CAAT_signal 5018..5023 /gene="CMA1" /note="G00-127-603" TATA_signal 5099..5104 /gene="CMA1" /note="G00-127-603" 5'UTR 5128..5157 /gene="CMA1" /note="G00-127-603" gene join(5158..5215,5886..5890) /gene="CMA1" CDS join(5158..5215,5886..6036,6786..6921,7108..7362, 7733..7876) /gene="CMA1" /codon_start=1 /db_xref="GDB:G00-127-603" /product="mast cell chymase" /db_xref="PID:g180542" /translation="MLLLPLPLLLFLLCSRAEAGEIIGGTECKPHSRPYMAYLEIVTS NGPSKFCGGFLIRRNFVLTAAHCAGRSITVTLGAHNITEEEDTWQKLEVIKQFRHPKY NTSTLHHDIMLLKLKEKASLTLAVGTLPFPSQFNFVPPGRMCRVAGWGRTGVLKPGSD TLQEVKLRLMDPQACSHFRDFDHNLQLCVGNPRKTKSAFKGDSGGPLLCAGVAQGIVS YGRSDAKPPAVFTRISHYRPWINQILQAN" sig_peptide join(5158..5215,5886..5890) /gene="CMA1" /note="G00-127-603" intron 5216..5885 /gene="CMA1" /note="G00-127-603" /number=1 exon 5886..6036 /gene="CMA1" /note="G00-127-603" /number=2 mat_peptide join(5891..6036,6786..6921,7108..7362,7733..7873) /gene="CMA1" /note="G00-127-603" /product="mast cell chymase" intron 6037..6785 /gene="CMA1" /note="G00-127-603" /number=2 exon 6786..6921 /gene="CMA1" /note="G00-127-603" /number=3 intron 6922..7107 /gene="CMA1" /note="G00-127-603" /number=3 exon 7108..7362 /gene="CMA1" /note="G00-127-603" /number=4 intron 7363..7732 /gene="CMA1" /note="G00-127-603" /number=4 polyA_signal 8020..8025 /gene="CMA1" /note="G00-127-603" BASE COUNT 2399 a 1928 c 1683 g 2113 t 1 others ORIGIN Chromosome 14. 1 tcccagttaa tacataatca atatgcaatt tattaataca tctctccatg tccactcccc 61 ctgtatcttg ccattcttga cctgcatttc catcctcctt accttcccta gaggccaact 121 cattttcttt gaaaaacctg gcatttccca gaaaaaaaag tgaagggctg ggagctgtcc 181 gttgtcctga tttgctccct ctgcccttgc ttccaaatgt ggttggaaag aagcactatt 241 gaaaaatccc taaacgcacc cctgcagggt tggctctacc ctgtagccat ggacacatgc 301 tgttgatacc acctgcctca tgagtctcac ataatttgcc ctttcacact atctacccca 361 tcagccttac caaaaccata cctgcatcct gggcagcatc tgcccttcaa gagactaagg 421 aatctccttg caaccaagaa tgactagacc aatgagacac cctttaaggc cccagcacaa 481 tatagaaatc ccacaatatg gtaatcccag taaggagcta tcaagccatt gcaggaccat 541 ctagaataca actagagtat agttcctttc aatccaggaa ctatactcta acagcttggc 601 tcacaggaac cagaagtgaa gatgatgagg atcagggctg agcctgtgag caccagctcc 661 accactgaca ccaaccacag attaaacaag catcttgtgg acccctggga tggaaagaat 721 agttgttgcc ttatcaacct cccccacagc ccacacagaa aagataaaat catcatggct 781 acagtgttac agaagatgat gacccaagga gtaggcctgc ctgagtgaat gctgagagtg 841 ataatgggag cagtagcatc tcagagacta cagcagaaac catccacata aagagctttg 901 cccaaactta tgataaaggg caccctcaga gactctccct actttaatat tagcccattg 961 cagaaatggt gagtggaaag agaaatctta ggaagaaccc cttaaaaaag caaaatgctt 1021 tttaggtttg tgctgaagag cctggaaaag aaataaggac acacacgctg agaaatcttc 1081 ctcctgcccc aacactggga taatctccaa ggatctctcc atatctcatt ctcctggata 1141 cactgtccac tcagaaatat tgtgcagagt gcagtaattc aaaagtgagc tattgtgtta 1201 ggagtgaagg caagagtatc gtaaaataaa tcaaatttga aatgaattct cttaaattgc 1261 tttatagatg tttaatgtaa gccagcagct attaaacgat aaaccttaaa ttcgagaaaa 1321 acttggtcat tcagaaacta tagaaacagg caggacttat tgcgagggca aacacagagt 1381 gagctccagc ctgcttcagg aaaatctgcc agtgccatga aggatgtact ctgtctgctc 1441 cactgcacta ctgctcagta tgagcccatg ccatcagctg tccctgaccc acaggagttc 1501 tttagaagag actggtcaac aaaagtttct agggtgtttt atacctgcca actcgagggt 1561 taaaacaagt tgcatagaaa tgctcaatca agaaagacac agtcattact cagagaataa 1621 taaacagcct ggcagcacat gaatgaatag aaaaaagatg ttacatgcaa agcatgaaat 1681 aaccaaattc cataacagat gttaatctgt aatgtgttta ggagaattta gaggaagtat 1741 aagatttatt ctttcatcaa aaaaattata gccaatgagg atatatctat caattatcca 1801 tcaagtggtg atatggcagc acaaggtaaa acacaaagga ataaaaccaa cgtttattaa 1861 gaaccaatca tgtggcattt cacattgagc atcatattta attctgaaaa aaatccttgt 1921 actgtatcat tcttcatatt ttatggatgc agtaactaag gctgagaact ttaaaatttt 1981 tcctaagttc agacacatag ctaagtggca gaaccaagat tcaaactcac cccatctaac 2041 tgcagagcaa actgcatgcc ttaaatgtca aagtgaatac tagcacagtt aatacaatgt 2101 ttggaaactc agagaaggaa tgatccctct gcattatagt tactaaggaa tcattgccat 2161 tatttaaatg ccagtgcttc tacatcaggc ccaaattttc tgtcctacta actgtgaatc 2221 aagacttgat tcaacctcta cttgagtatc tgccgcaatg agaaatcact tacctccact 2281 aaccacacat ttattttata acaacagatt gttagtaagt cctttcttat acatactcaa 2341 cagctgcttc ccaagatgct gtaggattat gtctagagtc aaactagcca gaagcaatgt 2401 ccaaaataca ccataacact gtgcagcaaa ggtcctacta ccacttgttt ggcccaaaca 2461 ttctaggcag cactggatat ctgaatcatc aattatttcc acaaacactg acccctctac 2521 cagtcaccct cactagaaga attaattcca catgataata gctccctcat gttactccct 2581 tctaagtcaa attgtacacc cctttatctg attaacagag tctaagtcac atgacctaaa 2641 tgcaagagaa ctgggaatgg acgtttgtgg attctacctt agtaaggcaa agttatcatt 2701 gggaattcct ctaatacagg aagggtgttc cagagacatt aaggagccat ataaatggaa 2761 aatgtccact acaatccatc acttggttgc cccacatcaa cattcattct tttgccacac 2821 ttaaagtttc caagaacaaa aattatccca ctgaacataa tctttactat cttttatata 2881 aaggaaaatt agacttgact cagcagaact gaaataaccc agctctaaca gttactgctt 2941 ttaacttcaa gtactgtgtc tctaggtgat acctgctcca acaatagttt ggtcacattt 3001 tcaatttgat attctctagt ctcccaactt gataactgta ccctaaacca taaagttcac 3061 taccaacatg ctatatataa aataaccaaa gggggaagaa gaaagagaaa aaggaaatct 3121 cttaaaatac acaggtatac atatgacaaa gcaaagaagg aaatgtgagc agatagtgca 3181 gtcctcgttt ctgaaattgg tcccctgact ggggctatac ctattccatt tcctcaccct 3241 cagccaggca ggtggagcaa aaacttaagt cttggtggat ctgaatcttg atgctgtgga 3301 gctgtcttac tagccccaga ctacctgcct ctcaatttct aattatatca gtgaaagcaa 3361 acagctttga tttgtttaag cctctgattt tttggtctaa ctgatgtaag accacaagga 3421 caagagttct ccagctccgg attctcttct gttctgttaa tggtgaaatg cccgagagaa 3481 gagttgccaa ctttggcaaa taaaaaatac aggattccag ttaaattcaa atttagataa 3541 acaacaattt tttagtatta gtgtgtccca ttcaatattt ggacatactt aactaaaaaa 3601 tgatttgttg ttcatctgaa atacaaattt aactgggcat tctgaatatt ctctggcaac 3661 ccccgagaga gtgaagaaag tggtacaagg acacttaaga agaccagatt tgaaaagaca 3721 ttacggatgt gtttaaatgt cttattctag agagagttag agctgtaggt agaacttggg 3781 aaattaagtt aaaagcagac acagagacct ggccaatata tactaaggag tggatcactc 3841 tggtcacaag cccaacctga gaccaagggc atagtgagat gatttgggaa aggcacttat 3901 acactactca tccccgtctt tgaactaaat gccttataaa tctccaagag aaatgacagt 3961 ccaccatgtg gactgctttc tgtaagtcca gggaaaataa aagctatgtg cttgaaaccc 4021 acttctgata ttataaggtg tgtgatcttt gtcatgttaa tgggtctgag tatcaattct 4081 acaattgtaa agtgacagta atggtgtgtc cccaggttgt tgtggaaagc ttgattctta 4141 atgcaacagt aggaaacccc agcctctctg gagcaaacac ccttctacat ctttacttcc 4201 cctgcacatt ggcaggactc tattcctcta tttctctcta gtgctagagc agaaagggac 4261 cttgatttga tatcaggaaa atctatttct gaaccataag ctatgatagc tgatttaaaa 4321 aattgactat catgacatga taatgatcat aatggtaata catattgata gggttgccgt 4381 gaaagtaata atatatctaa gagttgtgac aatatatgat acgcctagac tctcagaaaa 4441 tgctaattcc aatcccaatt gctctttgca taaagttctg tcctagggtc tgttcttttc 4501 ccacatctac cctccttgga tctctcttct gtctttttca tgtggttcag aggaggagag 4561 agatccaggt caatgttttt caaattacaa ggaattatca tttaaatggg gaagaagctc 4621 aagttttgac gtgtagtgga attggagtgg agtggagtgg aatggaaact aacaggaaga 4681 cactgcacat ggttaagata aagattgttt cctgaaacct ttaatttgtg cttacatact 4741 cacacataca tatgtgcatg cactgggact ctgcaatatg catttctgac tatggaacat 4801 agccataaaa gtctttgcac tgaacgttca gtgggccttt cacaagctgc cctaattggg 4861 aaagaaaaac atggtccctc catttcctgc ccccaactcc agaaaagtca ccatagttga 4921 gggtacatct gagaagccag cacttgggag ttcagggctc aagttccttt ctagaaaaac 4981 actgggtgat tctaggggaa cttccgatca gaaacagcca attcagagtg agagaagaaa 5041 acgtgaccat gcagttcctg tggttaccag ccttgcccct ctcttgcctt ctgggagtta 5101 taaaacccaa gactggaaag gaaaaccagc atttgctcag gcagcctctc tgggaagatg 5161 ctgcttcttc ctctccccct gctgctcttt ctcttgtgct ccagagctga agctggtgag 5221 tatcagggtt cttccctctg aaatctgcag tatcagctcc tgaaacaaag atgtttagtc 5281 tgaaatagct gactcctaaa cagggttcca agatctctct tcaagagtcc cacagaggaa 5341 atttccactt gggatgtgtg ccaccccacc cccaccccca cccactgcca ttctctacag 5401 cctaggacac ccccaggaac aaggaatttc acctcaattg tagaaaagcc cagagcaagt 5461 ggaaggaaaa ggggtatccc caggaaaaca gacatgtcct cttaatcttc tgagcatcag 5521 ggctacccat tactttgtga ctttctcact ctgtgaccat gctcaagagc tatggagaaa 5581 tctaaaacag gaacctggac agtgggtcct acacagagac agaggagagt gggccagggc 5641 aaggtgggag tgggagaagt ctgagatgaa aacatcagaa tggagcagag gcaagaatga 5701 gatttcacct gggaggttat gggtggggaa agatacgaaa tacaggagac aggagaggga 5761 agatgggcgg aacacagggt gagaatgaga ttccagggaa gcctagctca gctttaaccc 5821 aatttgtcca ttcattggag agagtatcta tggccgtgtt caaaccctgg ggtgctctgt 5881 tccaggggag atcatcgggg gcacagaatg caagccacat tcccgcccct acatggccta 5941 cctggaaatt gtaacttcca acggtccctc aaaattttgt ggtggtttcc ttataagacg 6001 gaactttgtg ctgacggctg ctcattgtgc aggaaggtga gacaacaggg tctatttatc 6061 tccaaatggg agatgaacaa ccagagtagc atccaggaat acacctgcac tggggactga 6121 agagggggtc ctgggtcttg tcaactttca ggagagggaa gactttgggc tgaaagactt 6181 tagtctgtgt ttgaatagtt ccttgagcct cagtcactga gctaagctcc cttcggagga 6241 aaaggaggtc ctgtccgaag gtccctcttg ttgcagtagc acccctcacc cctacccaac 6301 tcaagacaca cggctcactt ttcagggccc cacccagtct cagggccact tcctctatgg 6361 ccttttcaag aacactggct ctagttctca gggtcctgaa cccatcattt tatgggagca 6421 gagaacaggt ctacataaga cccccacttt cccgttttaa ctgatatctc ctgcttcagg 6481 ggctggccct catgcagggt tccctgaatt aggaagtgtg aaccctgtcc cctgagtcct 6541 ccctggcctg ttcagtcccc agcaattcca ggggtcgtag aaattgtgtc tgtttcctga 6601 gaaagctctt tcatgagtta agcctgagcc ctcaaatgcc acaagtggcc catgaaaagg 6661 gagatgggta gagtccggcn acccagtgac agagtttagt cctcttttct cagaatgagc 6721 tcacctcaga agaaacccca agccatcact gtcgcctcct tttccttcct tcttcctcac 6781 agcaggtcta taacagtcac ccttggagcc cataacataa cagaggaaga agacacatgg 6841 cagaagcttg aggttataaa gcaattccgt catccaaaat ataacacttc tactcttcac 6901 cacgatatca tgttactaaa ggtgacaaca cctctcttct ccctttccac ttcccattct 6961 cctaagcttc tccttcaggt cctcattgcc ctgaattttt cttaggactt ggctataaca 7021 tgaagctact caccctgtcc ctccctgatc acctccaact gtccagagcc catttcgagg 7081 actgacagtc cttcattccc ttcacagttg aaggagaaag ccagcctgac cctggctgtg 7141 gggacactcc ccttcccatc acaattcaac tttgtcccac ctgggagaat gtgccgggtg 7201 gctggctggg gaagaacagg tgtgttgaag ccgggctcag acactctgca agaggtgaag 7261 ctgagactca tggatcccca ggcctgcagc cacttcagag actttgacca caatcttcag 7321 ctgtgtgtgg gcaatcccag gaagacaaaa tctgcattta aggtgatcct ccaactaggt 7381 ttcctctcca aaactcactg ttcagggacc tgaatgctct tagaaggaga tggggtcagc 7441 aggttgtcag tcaggtgaca gggtgagcat cacaggaatt gctgtcctcc cgtggtccaa 7501 gacagcctct gaccatccat tccagtctac tgcactgggg gcatggggtg actgtggaga 7561 atgtggatga cggtcccaag aaaggaagaa ggggcatcag aactagatgt ataagtgagg 7621 agctccacct cctgggtctg actttaggtc tcactgtgac tccaagctgg ctggcagaca 7681 ggagtggagg acttcccggg ctcaccttct tctctctctc ctccccctac agggagactc 7741 tgggggccct cttctgtgtg ctggggtggc ccagggcatc gtatcctatg gacggtcgga 7801 tgcaaagccc cctgctgtct tcacccgaat ctcccattac cggccctgga tcaaccagat 7861 cctgcaggca aattaatcct ggatcctgag ccagcctgaa gggaagctgg aactggacct 7921 tagcagcaaa gtgtgtgcaa ctcattctgg ttctaccctt ggttccctca gccacaaccc 7981 taagcctcca agaggtctcc tacaggtaac agaactttca ataaacttca gtgaagacac 8041 agcttctagt cgtgagtgtg tgtccctctc tgctgctctc ttctcctgca catgtgacct 8101 gattcccagc ccaagcacca agga // LOCUS HUMCKMT 6896 bp DNA PRI 02-MAY-1996 DEFINITION Human mitochondrial creatine kinase (CKMT) gene, complete cds. ACCESSION J04469 NID g180589 KEYWORDS creatine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6896) AUTHORS Haas,R.C., Korenfeld,C., Zhang,Z.F., Perryman,B., Roman,D. and Strauss,A.W. TITLE Isolation and characterization of the gene and cDNA encoding human mitochondrial creatine kinase JOURNAL J. Biol. Chem. 264 (5), 2890-2897 (1989) MEDLINE 89123390 REMARK Erratum:[J Biol Chem 1989 Sep 25;264(27):16332] COMMENT Computer-readable copy of sequence for [1] kindly provided by A.W.Strauss, 23-NOV-1988. FEATURES Location/Qualifiers source 1..6896 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-hCK39." /dev_stage="adult" /tissue_type="placenta" /map="15" exon 241..552 /gene="CKMT" /number=1 gene 241..5570 /gene="CKMT" sig_peptide 404..517 /gene="CKMT" CDS join(404..552,1157..1355,1559..1654,2041..2262,2392..2477, 2593..2716,4378..4512,5014..5139,5343..5459) /gene="CKMT" /EC_number="2.7.3.2" /note="precursor" /codon_start=1 /product="creatine kinase" /db_xref="PID:g180590" /translation="MAGPFSRLLSARPGLRLLALAGAGSLAAGFLLRPEPVRAASERR RLYPPSAEYPDLRKHNNCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPGHPFI KTVGMVAGDEETYEVFADLFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERYVL SSRVRTGRSIRGLSLPPACTRAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQ QLIDDHFLFDKPVSPLLTAAGMARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKG GNMKRVFERFCRGLKEVERLIQERGWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPL LSKDSRFPKILENLRLQKRGTGGVDTAATGGVFDISNLDRLGKSEVELVQLVIDGVNY LIDCERRLERGQDIRIPTPVIHTKH" mat_peptide join(518..552,1157..1355,1559..1654,2041..2262,2392..2477, 2593..2716,4378..4512,5014..5139,5343..5456) /gene="CKMT" /EC_number="2.7.3.2" /product="creatine kinase" intron 553..1156 /gene="CKMT" /number=1 exon 1157..1355 /gene="CKMT" /number=2 intron 1356..1558 /gene="CKMT" /number=2 exon 1559..1654 /gene="CKMT" /number=3 intron 1655..2040 /gene="CKMT" /number=3 exon 2041..2262 /gene="CKMT" /number=4 intron 2263..2391 /gene="CKMT" /number=4 exon 2392..2477 /gene="CKMT" /number=5 intron 2478..2592 /gene="CKMT" /number=5 exon 2593..2716 /gene="CKMT" /number=6 intron 2717..4377 /gene="CKMT" /number=6 exon 4378..4512 /gene="CKMT" /number=7 intron 4513..5013 /gene="CKMT" /number=7 exon 5014..5139 /gene="CKMT" /number=8 intron 5140..5342 /gene="CKMT" /number=8 exon 5343..5570 /gene="CKMT" /number=9 BASE COUNT 1746 a 1718 c 1720 g 1712 t ORIGIN 227 bp upstream of PstI site. 1 catgccacat ccccggggcg ggagggggct acatccccgg ctttagacgc gcgagtctca 61 ggtcccgcta attacctggc gggtgctgcc cacccctgcc ctcgcgcacc tagcgcgtgg 121 cagcgggaag gcggggcctg ggggagcccc acccctggag actgcggctg gggcctccct 181 ctcctccgcc cgcccgcctg ccactagctc attgcgcctc tcctgcagtc tgattgggca 241 ccggctccca ttccggctcc agcctccaat ccgaccccca tttcggctgc agcctcggac 301 ctagctccgg ccctcggtct atccggttgc atcctccctc cctgttccgg atcttatctt 361 gcgccagcgc ctactccagg atcccgtagc cagacctcaa gccatggctg gtcccttctc 421 ccgtctgctg tccgcccgcc cgggactcag gctcctggct ttggccggag cggggtctct 481 agccgctggg tttctgctcc gaccggaacc tgtacgagct gccagtgaac gacggaggct 541 gtatcccccg aggtaacagt gcctgaggcg cgggaggagg cgggggcagg aggtgatggg 601 aacgaaggtg cgggtagaag tgagaatccg ggcaacagag aagggctata atcacgaagg 661 ccctggagct ggagggctgt gcagtctgca gacctcagtg gggtgggggt gggggccaaa 721 accataaagc aagaacattc ctggggacct gccaagacca gctctggccc tacgagttct 781 agctgcactg gctgcccaaa tccctaattg taaagccagg aactatcctt ttcgctcccc 841 tccatctcct tccctcattt cctcaattcc tctccttagg cttttcccct cctccatccg 901 tagtgttgtg tcatgggagg aaagaactga gcagatctga agaaactgag ctggccagcc 961 agaggcaact agaactatta ggaaagcata gactctgaaa gtccctaaag agattaccaa 1021 ggtttaccct ctttctaatt cccctcctcc cgcggagcaa agccagacat ggccaactgg 1081 acagctccca ggtaactgca ctaggtctag gcgtctgtga ccctccctcc atggttactg 1141 ggtaccccct ccccagcgct gagtacccag acctccgaaa gcacaacaac tgcatggcca 1201 gtcacctgac cccagcagtc tatgcacggc tctgcgacaa gaccacaccc actggttgga 1261 cgctagatca gtgtatccag actggcgtgg acaaccctgg ccaccccttc atcaagactg 1321 tgggcatggt ggctggagat gaggagacct atgaggtagg gggtccccag agtctccctg 1381 atgatccaat tcatcttccc agtaatccca gctcctttcc cttaaagacc tctcactttc 1441 ccccaagact ctgagccccc catacttaag ttttctgaac cagtgaaatc aatgcacaat 1501 tgaagtctgg ggagggattc cctctcctta accatctctc cctcttaact ccccttaggt 1561 atttgctgac ctgtttgacc ctgtgatcca agagcgacac aatggatatg acccccggac 1621 aatgaagcac accacggatc tagatgccag taaagtgagt tcaaatatcc cacttctgat 1681 ttgcattgcc tgtgtacaac actctgtatc tccaacccct tcaccttatt tcctgactca 1741 tggtcattat actgctgagc ttttaatctt aatgtaagga aagaatcata tcttaagggg 1801 cagcatatat ggagatggaa ggatagataa gaatgaccat gacccaaggt gggtggtttg 1861 gggacgggtc tgcaatgccc ccttcaattc cagtgctttc ccaaagggcc tcttcttcca 1921 atgcatgcag gaagaatgca cacagagtcc tctaatgcct aaggaaggtc tctcctttcc 1981 caggggccct cagttcccac cgtgtttctg tgacttacat tcatttccct tatctcccag 2041 atccgttctg gctactttga tgagaggtat gtattgtcct ctagagtcag aactggccga 2101 agcatccgag gactcagtct gcctccagct tgcactcgag cagagcgacg agaggtggaa 2161 cgtgttgtgg tggatgcact gagtggcctg aagggtgacc tggctggacg ttactatagg 2221 ctcagtgaga tgacagaggc tgaacagcag cagcttattg atgtgagggc cttaagaggg 2281 tgctggttgg tgggagcaga tggggaaggc tgggccagat gagacatggg ctctgaaagg 2341 cccaggggcc accatgaaga ttcttaaccc aagtcccgtt actcttccca ggaccacttt 2401 ctgtttgata agcctgtgtc cccgttgctg actgcagcag gaatggctcg agactggcca 2461 gatgctcgtg gaatttggta tgaagctgct cattacctct tttgtcttca tgccctcata 2521 aatgcttttt ttccctctat ctctcccaat tcttgccttg cctcttgatc actgtccctc 2581 tccggccctc aggcacaaca atgagaagag cttcctgatc tgggtgaatg aggaggatca 2641 tacacgggtg atctccatgg agaagggtgg taacatgaag agagtgtttg aaagattctg 2701 ccgaggcctc aaagaggtta gagaagacta tgtaggggag ctaggtggga ggacataagg 2761 aaaaccaaag agtagcataa atagattatg taatttacca accaacccag gacatgtctt 2821 atagtaaaaa ggactatcta ggactcactc caggactaaa ggtgtaaacc agctgggacc 2881 atactgggaa aaccaggaca tgtggtcaca ctaagattag gaaaagaaag agtgtcagga 2941 atcttaggaa gtgaacaagg cttttgacag agagtgcaaa gaaggaataa atgagatggc 3001 acgtcagtgc ctgggatgtg tgcagtggga tggtgaggtg tgcagataag gaaaacattc 3061 gagcttagat tgatgttggc ggggagaggt tgctgtgttc atgactctaa tataaccacc 3121 cagttctgag acaaggtagg ccttgactct ggattctatc attcttgtta aagtttcggg 3181 tctaggcttt aagttgagag ttcggagaga gactggggaa ggtggaggat agaatggttc 3241 gagttctaga atatgtggct ctagatgaga ggttgaactg aatcatcaat cctacatgga 3301 ttgggtctcc gtattcaagt ctacattaga aatccccata aactcaattc aattcttact 3361 gtatgttctc aaacatacag ttctatttta ggtttgcaaa gaaaaagagc tcctctttta 3421 gattctgaga agtttctact atttttggca agtaatagat aacatattct gactatgagt 3481 gggtagggaa gtacctttaa attatatgcc tcagtttcct catctgtaaa attgggataa 3541 tgagattttc tacattttag gttgttgtgg ggattaagtg aaatacaggt aaagtacttg 3601 gtccacagta agtgcttaat aagtgttaaa gtgttagctg caatattatt ctggatggaa 3661 gagtttcccc ccatgttcag catgtaagat atcccctatg gcatggttcc ttctgaacta 3721 taaagaggat ccctttactc atgttgggtt gtggtctttg tgaccatcat tctgctagat 3781 cccttgtctc ttgaactcta atagtcatct tcatgactac atggttaagt gaagccaaac 3841 gccttccccc cgccccctat tcctatgaat ctggcttttc tgctctgttt tcatctttct 3901 ctgcattcac acaggtgctc cgttcacagc taacagaatg ttatcttacc tcttcctggc 3961 aaagcttaca ccttcatctt ctgtctgaag ggacccttct aagctctagg ctcattagca 4021 aagcaaagat aatcgatgca tgcagacctc attgaataat cagtcatctc tcagttcagt 4081 ttaccacctc tgttcatttc cctagatcat ccttaataca ccactccttc gagttttctt 4141 cttccacata agatattttt tcacaatctc attattatgc acatcataat tttgcatcat 4201 gcatgcatga aaacaataac aaaccttttt catttaaaaa aagaccaatg tcattcattc 4261 acagccaagt ttctgttcta gacatatttc tagtgttctt gtgggtctag ctaagggagg 4321 gtccagggtt aatgaaatat ccctgatttt tcgttaacaa aacctttgtg gactcaggtg 4381 gagagactta tccaagaacg tggctgggag ttcatgtgga atgagcgttt gggatacatc 4441 ttgacctgtc catctaacct gggcactgga cttcgggcag gagtgcacat caaactgccc 4501 ctgctaagca aagtaaagga gttgtggggt tacagagggg tgtgagtaag gaagggtggg 4561 ttgtggatgg ggagggagtg gaccctttgg aaaggagcca aacatgttgt ggctaaaggg 4621 tcagaggaca ggccaggcac agtggctcat gcctctaatc ccaacacttg ggaggccaag 4681 gcaggcagat tacttgagcc caggagttca agaccagcct gggcaacctg gtgaaacccc 4741 atctctacct acaaatacaa aagttagctg ggtgtagtgg aggctgaggt gagaggatca 4801 cttaagcctg ggaagtcgag gcttcagtga gctgtgatca ctccagcctg ggtgacagag 4861 agagaccctg tctaaaaaaa attaaaaaag aaaaaagaaa aaaggaaaaa aaaagttcag 4921 gagacagagc tctgagcagg ttcagggctc tttcaggtag gacctagtct ctgcctctat 4981 tgaccctgct cccaatccct atctcctctc taggatagcc gcttcccaaa gatcctggag 5041 aacctaagac tccaaaaacg tggtactgga ggagtggaca ctgctgctac aggcggtgtc 5101 tttgatattt ctaatttgga ccgactaggc aaatcagagg tgagatccta agggattagg 5161 acaaggagag gtataggtct gcgagggccg aaatatggca gtgagtgagc ctccgggatg 5221 taacataatc tgaaatgaaa ttcaggttga gtgggaggca attggaaatg agcaggcaag 5281 tcagtcagtg ataaagaaaa actcagactg taggaagcag atcaaagatt agtgtccctt 5341 aggtggagct ggtgcaactg gtcatcgatg gagtaaacta tttgattgat tgtgaacggc 5401 gtctggagag aggccaggat atccgcatcc ccacacctgt catccacacc aagcattaac 5461 tccccatcgc cagctgatga ctcaagattc ccaggagttt tgctcattct aatgatggcc 5521 cattctactt gctctggacc tgcccccgca tcccctgcct ccatcctagt aaagactcct 5581 tgctatgctg cagctgtctg tgttacttct aatggtgggg tgaggaggga gcagccttca 5641 ggaaatgaaa agaggcagtg ggattattta tgatggaaag agactccaga tatggcaacc 5701 caggaacact gattctcagg tgggtggaaa gcattaacat tttacccata ttcctcatca 5761 gcttctgaaa ataatcagga tgcacttctg tttgcacttt attcattatg acttaagatt 5821 tctctcccca caatctcctt ctactgtaga gacaggctca tagcaggtgg ccaaggaagc 5881 tgatagtcaa taccagggac caggaaggtc gtgaccagtc ctggaggccc caggctgtac 5941 ttcgacctat aatagacagg gaatgggagt aatatcacaa ctcagctctc caggagcatt 6001 gatacttgga aattagcgct ctgcctgtag actccttcac tccagggatc tccctgggtg 6061 cactctaaga gccagacagc accaaattag gggtttgatt ctgggtcagg agatggagga 6121 tcaagctgtg cagctgggaa ctcaccttgc tgttctgggc tctcctttcc ctcatgttgg 6181 gcccatgcaa ctgctcgtcg ctgctcagga ctcagaaagg ccatttgctc aggagtgaca 6241 gccacagcct gagcactggt gagactagat agttggatgg gactaaacac cacctgaggg 6301 caggggtagg aatcagtgca tgcatgtagt ccccattggg ccctggctct cctgtggtca 6361 ccccagtcca ttaatactta cagcaaattt aggaggaggg atgacagaaa tggcaagagg 6421 agtaacgccc tggatctgtc cccgcagcag tgctgaaaga gccaggtctg ggatcccagc 6481 tgttgaagca agtggcatcc aaacattgtc ttagactgac cttccctctc ttcaaaccta 6541 tagaccttct ctaactactc ccaaagtgcc ctatcataga ccttccccaa tatgtctcta 6601 gccccttatt taaacaccct caggccccca ccttaagaat tgcagggcag tcttccatcc 6661 agtccaccca tggtatagaa accaaaccaa cttgcaccag cagtggccca gctccccacc 6721 tgctatggtg ccaatttcag tgaagatctc aggcccccag ttactgattg ggccaaaccc 6781 accaggcagt acaagtaggt gggccagaac ctccagttgt tcctcagagc actgcagatg 6841 cagggtgccg aggaagagag ctgcttggct gtagaacagt gggaaggaag gaagaa // LOCUS HUMCNP 1699 bp DNA PRI 30-APR-1991 DEFINITION Human C-type natriuretic peptide gene, complete cds. ACCESSION M64710 NID g180676 KEYWORDS C-type natriuretic peptide. SOURCE Human liver DNA, clones lambda-hCNP[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1699) AUTHORS Tawaragi,Y., Fuchimura,K., Tanaka,S., Minamino,N., Kangawa,K. and Matsuo,H. TITLE Gene and precursor structures of human C-type natriuretic peptide JOURNAL Biochem. Biophys. Res. Commun. 175, 645-651 (1991) MEDLINE 91207363 FEATURES Location/Qualifiers source 1..1699 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" misc_feature 81..85 /gene="CNP2" /note="Y box" misc_feature 88..93 /gene="CNP2" /note="GC box" misc_feature 100..105 /gene="CNP2" /note="GC box" TATA_signal 133..138 /gene="CNP2" gene join(309..398,843..1133) /gene="CNP2" CDS join(309..398,843..1133) /gene="CNP2" /codon_start=1 /product="C-type natriuretic peptide" /db_xref="PID:g180677" /translation="MHLSQLLACALLLTLLSLRPSEAKPGAPPKVPRTPPAEELAEPQ AAGGGQKKGDKAPGGGGANLKGDRSRLLRDLRVDTKSRAAWARLLQEHPNARKYKGAN KKGLSKGCFGLKLDRIGSMSGLGC" intron 399..842 /gene="CNP2" BASE COUNT 279 a 568 c 599 g 253 t ORIGIN 1 gatccctccg gggtgggata agggagggga gcccccgcgg ccccctcccg gccctcggcg 61 cggccgcgtg cgtggtgtca ttggcccggg cggcccggtg ggcgggagga tgacatcagc 121 ggcaggttgg attataaagg cgcgagcaga gtcacgggct cagagcgcac ccagccggcg 181 ccgcgcagca ctgggaccct gctcgccctg cagcccagcc agcctgctcc gcatccccct 241 gctggtctgc ccgccgacct gcgcgccctc gctgccgccc gtgtgcgccc ctcgacccca 301 gcggcaccat gcatctctcc cagctgctgg cctgcgccct gctgctcacg ctgctctccc 361 tccggccctc cgaagccaag cccggggcgc cgccgaaggt gggtgctgtc gtggggacgc 421 cgagcctggg agaggcgtgg gaggctgggg gcttggagaa tgcggcgcgc aggacccagg 481 agagagggaa ggcaggcggc tgtctcctcc gagatgcgcg tgggcgagag ccggggagcc 541 ctcgaagcgc ggattcgggg gtccacttct ccagcctccg gagaacatcg gcccatgcgc 601 agccccctac cccagtgtgg cctgcccggc gagcagcaaa gggagggcag ggggcttccg 661 gagggagcgg cgaaggcggc cgcgtggcag gtggatgcgg ggccaagctg gccggcatcg 721 gtgggggcgg ctctgggctt gggagggaca ccccgcgccg gcgggcgcgt ggggctggag 781 catcagagtc ccccgtgctg cagccgcgtg tcccttcacc tgcccgctct ttcctcggac 841 aggtcccgcg aaccccgccg gcagaggagc tggccgagcc gcaggctgcg ggcggcggtc 901 agaagaaggg cgacaaggct cccgggggcg ggggcgccaa tctcaagggc gaccggtcgc 961 gactgctccg ggacctgcgc gtggacacca agtcgcgggc agcgtgggct cgccttctgc 1021 aagagcaccc caacgcgcgc aaatacaaag gagccaacaa gaagggcttg tccaagggct 1081 gcttcggcct caagctggac cgaatcggct ccatgagcgg cctgggatgt tagtgcggcg 1141 ccccctggcg gcggtgagta cggcccaccc gacgcccagc cccagcccgg cccgggaccg 1201 cccgccgccc agccagcctt cggaggcgcg cgagccgcct ttgctcaagt tgtgctaggc 1261 gtttgccagc cgcccccttt attatcccac tttacagaca aagaaagcga aggataacgt 1321 gatcggggaa ctttggcaag gtcagaaacg gctcagcctg gttgaaccca cctggcttct 1381 tctggagaag cagaaacagg cttggtggtg tctcacccac ccctgaaccg tagctgaact 1441 agcagcactg gcccctattg gccagctggt ggggggattg agaggagatc atgggtttgt 1501 gggagcagag aaggaaggtt acacccacaa gtccagggga catcgatcat ctgctggcca 1561 ccatgccccc tgtagtgaga gtagccctct gctggcactg tcagagcgcc cttctgcctg 1621 ggacactccg attcctgtcc cttctctaaa cccaggcagt gggcaaactg gtctgtccag 1681 ggtcctgagg cagctgcag // LOCUS HUMCOL2A1Z 31001 bp DNA PRI 03-AUG-1995 DEFINITION Human pro-alpha1 type II collagen (COL2A1) gene exons 1-54, complete cds. ACCESSION L10347 NID g450393 KEYWORDS alpha-1 type II collagen. SOURCE Homo sapiens male adult blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Cheah,K.S., Stoker,N.G., Griffin,J.R., Grosveld,F.G. and Solomon,E. TITLE Identification and characterization of the human type II collagen gene (COL2A1) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (9), 2555-2559 (1985) MEDLINE 85190534 REFERENCE 2 (sites) AUTHORS Baldwin,C.T., Reginato,A.M., Smith,C., Jimenez,S.A. and Prockop,D.J. TITLE Structure of cDNA clones coding for human type II procollagen. The alpha 1(II) chain is more similar to the alpha 1(I) chain than two other alpha chains of fibrillar collagens JOURNAL Biochem. J. 262 (2), 521-528 (1989) MEDLINE 90026318 REFERENCE 3 (sites) AUTHORS Vikkula,M. and Peltonen,L. TITLE Structural analyses of the polymorphic area in type II collagen gene JOURNAL FEBS Lett. 250 (2), 171-174 (1989) MEDLINE 89325561 REFERENCE 4 (sites) AUTHORS Ryan,M.C., Sieraski,M. and Sandell,L.J. TITLE The human type II procollagen gene: identification of an additional protein-coding domain and location of potential regulatory sequences in the promoter and first intron JOURNAL Genomics 8 (1), 41-48 (1990) MEDLINE 91184811 REFERENCE 5 (sites) AUTHORS Huang,M.C., Seyer,J.M., Thompson,J.P., Spinella,D.G., Cheah,K.S. and Kang,A.H. TITLE Genomic organization of the human procollagen alpha 1(II) collagen gene JOURNAL Eur. J. Biochem. 195 (3), 593-600 (1991) MEDLINE 91153296 REFERENCE 6 (sites) AUTHORS Vikkula,M., Metsaranta,M., Syvanen,A.C., Ala-Kokko,L., Vuorio,E. and Peltonen,L. TITLE Structural analysis of the regulatory elements of the type-II procollagen gene. Conservation of promoter and first intron sequences between human and mouse JOURNAL Biochem. J. 285 (Pt 1), 287-294 (1992) MEDLINE 92344585 REFERENCE 7 (sites) AUTHORS Ala-Kokko,L., Kvist,A.P., Metsaranta,M., Kivirikko,K.I., de Crombrugghe,B., Prockop,D.J. and Vuorio,E. TITLE Conservation of the sizes of 53 introns and over 100 intronic sequences for the binding of common transcription factors in the human and mouse genes for type II procollagen (COL2A1) JOURNAL Biochem. J. 308 (Pt 3), 923-929 (1995) MEDLINE 97104294 COMMENT Bases Reported in References REFERENCE 1 (bases 26401-26754,26809-26980,27089-27253, 27308-27488, 27597-27840,27895-28337,28446-31001) AUTHORS Cheah, Kathryn S E, Stoker, Neil G, Griffin, Jane R, Grosveld, Frank G, and Solomon Ellen TITLE Identification and Characterization of the Human Type II Collagen Gene (COL2A1) JOURNAL Proc. Natl. Acad. Sci, USA 82, 2555-2559 (1985) REFERENCE 2 (bases 1-85,5892-5908,6122-6154,6259-6291,6397-6450, 6614-6715,7694-7771,8399-8443,8555-8608,9009-9062, 9838-9891, 10266-10319,10451-10504,10811-10855, 11391-11444,14507-14551, 15031-15084,16598-16696, 16993-17037,17130-17228,17418-17471, 17996-18103, 18469-18522,18607-18705,18846-18899,19340-19438, 19832-19885,20292-20345,20696-20749,20993-21046, 21289-21333, 21480-21578,21813-21920,22263-22316, 22595-22648,23025-23078, 23451-23504,23759-23866, 24358-24411,24856-24909,25656-25817, 26015-26122, 26293-26400,26755-26808,26981-27088,27254-27307, 27489-27596,27841-27894,28338-28445) AUTHORS Baldwin, Clinton T, Reginato, Anthony M, Smith, Carol, Jiminez, Sergio A, and Prockop, Darwin J TITLE Structure of cDNA clones coding for human type II procollagen. The alpha1(II) chain is more similar to the alpha1(I) chain two other alpha chains of fibrillar collagen JOURNAL Biochemical Journal 262, 521-528 (1989) REFERENCE 3 (bases 86-4190) AUTHORS Vikkula, Miikka, Metsaranta, Marjo, Syvanen, Ann-Cristine, Ala-Kokko, Leena, Vuorio, Eero, and Peltonen, Leena TITLE Structural analysis of the regulatory elements of the type-II procollagen gene JOURNAL Biochemical Journal 285, 287-294 (1992) REFERENCE 4 (bases 4191-5891) AUTHORS Ryan, Maureen C, Sieraski, Madelyn, and Sandell, Linda J TITLE The Human Type II Procollagen Gene: Identification of an Additional Protein-Coding Domain and Location of Potential Regulatory Sequences in the Promoter and First Intron JOURNAL Genomics 8, 41-48 (1990) REFERENCE 5 (bases 20346-20695,20750-20992,21047-21288, 21334-21479, 21579-21812,21921-22262) AUTHORS Vikkula, Miikka and Peltonen, Leena TITLE Structural Analyses of the Polymorphic Area in Type II Collagen Gene JOURNAL FEBS LETTERS 250, 2:171-174 (1989) REFERENCE 6 (bases 5909-6121, 6155-6258, 6292-6396,6451-6613, 6716-7693,7772-8398,8444-85541-30997) AUTHORS Huang, Min-Chi, Seyer, Jerome M, Thompson, James P, Spinella, Dominic G, Ceah, Kathy S E, Kang, Andrew H TITLE Genomic Organization of the Human Procollagen a1(II) Collagen Gene JOURNAL FEBS LETTERS 195, 593-600 (1991) REFERENCE 7 (bases 8609-9008,9063-9837,9892-10265,10320-10450, 10505-10810,10856-11390,11445-14506,14552-15030, 15085-16597,16697-16992,17038-17129,17229-17417, 17472-17995,18104-18468,18523-18606,18706-18845, 18900-19339, 19439-19831,19886-20291,22317-22594, 22649-23024,23079-23450, 23505-23758,23867-24357, 24412-24855,24910-25655,25818-26014, 26123-26292) AUTHORS Leena Ala-Kokko, Ari-Pekka Kvist, Marjo Metsaranta, Kari Kivirikko,Benoit de Crombrugghe, Darwin J. Prockop, and Eero Vuorio. TITLE Comparison of the Human and Mouse Genes for Type II Procollagen (COL2A1). conservation of the relative Sizes of 54 Introns, about 70% of 25,000 Base Sequences of the Introns and Over One Hundred Sites Throughout the Gene for Binding of Common Transcription Factors JOURNAL Manuscript, in preparation. FEATURES Location/Qualifiers source 1..31001 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /dev_stage="adult" /sex="male" /tissue_type="blood" /map="12q13" gene 1..31001 /gene="COL2A1" exon 1..85 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=1 CDS join(1..85,4191..4397,5892..5908,6122..6154,6259..6291, 6397..6450,6614..6715,7694..7771,8399..8443,8555..8608, 9009..9062,9838..9891,10266..10319,10451..10504, 10811..10855,11391..11444,14507..14551,15031..15084, 16598..16696,16993..17037,17130..17228,17418..17471, 17996..18103,18469..18522,18607..18705,18846..18899, 19340..19438,19832..19885,20292..20345,20696..20749, 20993..21046,21289..21333,21480..21578,21813..21920, 22263..22316,22595..22648,23025..23078,23451..23504, 23759..23866,24358..24411,24856..24909,25656..25817, 26015..26122,26293..26400,26755..26808,26981..27088, 27254..27307,27489..27596,27841..27894,28338..28445, 28803..29091,29546..29733,30077..30319,30855..31001) /gene="COL2A1" /codon_start=1 /db_xref="GDB:G00-119-063" /product="alpha-1 type II collagen" /db_xref="PID:g450394" /translation="MIRLGAPQSLVLLTLLVAAVLRCQGQDVQEAGSCVQDGQRYNDK DVWKPEPCRICVCDTGTVLCDDIICEDVKDCLSPEIPFGECCPICPTDLATASGQPGP KGQKGEPGDIKDIVGPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGN PGPPGPPGPPGPPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAP GPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARG FPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGR TGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPR GEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPGPQGATGPLG PKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGE RGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRP GDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLPG APGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGD QGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPP GAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAG ANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQ KGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNP GPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPG PQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGP AGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDR GEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPG PSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAGPPGN PGPPGPPGPPGPGIDMSAFAGLGPREKGPDPLQYMRADQAAGGLRQHDAEVDATLKSL NNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETG ETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFL RLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKD GCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL" intron 86..4190 /gene="COL2A1" /note="G00-119-063" /citation=[6] /number=1 exon 4191..4397 /gene="COL2A1" /note="G00-119-063" /citation=[4] /number=2 intron 4398..5891 /gene="COL2A1" /note="G00-119-063" /citation=[4] /number=2 exon 5892..5908 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=3 intron 5909..6121 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=3 exon 6122..6154 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=4 intron 6155..6258 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=4 exon 6259..6291 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=5 intron 6292..6396 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=5 exon 6397..6450 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=6 intron 6451..6613 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=6 exon 6614..6715 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=7 intron 6716..7693 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=7 exon 7694..7771 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=8 intron 7772..8398 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=8 exon 8399..8443 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=9 intron 8444..8554 /gene="COL2A1" /note="G00-119-063" /citation=[5] /number=9 exon 8555..8608 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=10 intron 8609..9008 /gene="COL2A1" /note="G00-119-063" /number=10 exon 9009..9062 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=11 intron 9063..9837 /gene="COL2A1" /note="G00-119-063" /number=11 exon 9838..9891 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=12 intron 9892..10265 /gene="COL2A1" /note="G00-119-063" /number=12 exon 10266..10319 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=13 intron 10320..10450 /gene="COL2A1" /note="G00-119-063" /number=13 exon 10451..10504 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=14 intron 10505..10810 /gene="COL2A1" /note="G00-119-063" /number=14 exon 10811..10855 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=15 intron 10856..11390 /gene="COL2A1" /note="G00-119-063" /number=15 exon 11391..11444 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=16 intron 11445..14506 /gene="COL2A1" /note="G00-119-063" /number=16 exon 14507..14551 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=17 intron 14552..15030 /gene="COL2A1" /note="G00-119-063" /number=17 exon 15031..15084 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=18 intron 15085..16597 /gene="COL2A1" /note="G00-119-063" /number=18 exon 16598..16696 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=19 intron 16697..16992 /gene="COL2A1" /note="G00-119-063" /number=19 exon 16993..17037 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=20 intron 17038..17129 /gene="COL2A1" /note="G00-119-063" /number=20 exon 17130..17228 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=21 intron 17229..17417 /gene="COL2A1" /note="G00-119-063" /number=21 exon 17418..17471 /gene="COL2A1" /note="150960; G00-119-063" /citation=[2] /number=22 intron 17472..17995 /gene="COL2A1" /note="G00-119-063" /number=22 exon 17996..18103 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=23 intron 18104..18468 /gene="COL2A1" /note="G00-119-063" /number=23 exon 18469..18522 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=24 intron 18523..18606 /gene="COL2A1" /note="G00-119-063" /number=24 exon 18607..18705 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=25 intron 18706..18845 /gene="COL2A1" /note="G00-119-063" /number=25 exon 18846..18899 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=26 intron 18900..19339 /gene="COL2A1" /note="G00-119-063" /number=26 exon 19340..19438 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=27 intron 19439..19831 /gene="COL2A1" /note="G00-119-063" /number=27 exon 19832..19885 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=28 intron 19886..20291 /gene="COL2A1" /note="G00-119-063" /number=28 exon 20292..20345 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=29 intron 20346..20695 /gene="COL2A1" /note="G00-119-063" /citation=[3] /number=29 exon 20696..20749 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=30 intron 20750..20992 /gene="COL2A1" /note="G00-119-063" /citation=[3] /number=30 exon 20993..21046 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=31 intron 21047..21288 /gene="COL2A1" /note="G00-119-063" /citation=[3] /number=31 exon 21289..21333 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=32 intron 21334..21479 /gene="COL2A1" /note="G00-119-063" /citation=[3] /number=32 exon 21480..21578 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=33 intron 21579..21812 /gene="COL2A1" /note="G00-119-063" /citation=[3] /number=33 exon 21813..21920 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=34 intron 21921..22262 /gene="COL2A1" /note="G00-119-063" /citation=[3] /number=34 exon 22263..22316 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=35 intron 22317..22594 /gene="COL2A1" /note="G00-119-063" /number=35 exon 22595..22648 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=36 intron 22649..23024 /gene="COL2A1" /note="G00-119-063" /number=36 exon 23025..23078 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=37 intron 23079..23450 /gene="COL2A1" /note="G00-119-063" /number=37 exon 23451..23504 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=38 intron 23505..23758 /gene="COL2A1" /note="G00-119-063" /number=38 exon 23759..23866 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=39 intron 23867..24357 /gene="COL2A1" /note="G00-119-063" /number=39 exon 24358..24411 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=40 intron 24412..24855 /gene="COL2A1" /note="G00-119-063" /number=40 exon 24856..24909 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=41 intron 24910..25655 /gene="COL2A1" /note="G00-119-063" /number=41 exon 25656..25817 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=42 intron 25818..26014 /gene="COL2A1" /note="G00-119-063" /number=42 exon 26015..26122 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=43 intron 26123..26292 /gene="COL2A1" /note="G00-119-063" /number=43 exon 26293..26400 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=44 intron 26401..26754 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=44 exon 26755..26808 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=45 intron 26809..26980 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=45 exon 26981..27088 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=46 intron 27089..27253 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=46 exon 27254..27307 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=47 intron 27308..27488 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=47 exon 27489..27596 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=48 intron 27597..27840 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=48 exon 27841..27894 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=49 intron 27895..28337 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=49 exon 28338..28445 /gene="COL2A1" /note="G00-119-063" /citation=[2] /number=50 intron 28446..28802 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=50 exon 28803..29091 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=51 intron 29092..29545 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=51 exon 29546..29733 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=52 intron 29734..30076 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=52 exon 30077..30319 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=53 intron 30320..30854 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=53 exon 30855..31001 /gene="COL2A1" /note="G00-119-063" /citation=[1] /number=54 BASE COUNT 6451 a 8344 c 8801 g 7405 t ORIGIN 1 atgattcgcc tcggggctcc ccagtcgctg gtgctgctga cgctgctcgt cgccgctgtc 61 cttcggtgtc agggccagga tgtccgtaag tcttcccccg ccgctgcctg cctgcctgct 121 ttccatgcgt ccctcagcat ccttctcccc ggcccgctcc agctctggag cccgcggctc 181 cgggctaaaa cggctcccgg ggtcgtagcg cgccgactta ggcacaggac acgcagaagt 241 tcaccaagaa gagttctgcc aatcaagact ctgtcccagg gtcctcggtg cccatcgcag 301 ttgcaagtat ttgcaggtcc ctacgttgcg ctagaatact gaacttgcaa agtgttggct 361 cggagaagtt tgcgcacaga tataaatggg ctcttttcca ccagctttga taattaggcg 421 cacatgcaca cagctcgcct cttcgaagca cttcgagttc agcaaaaaca gatctcaact 481 catcgaactt aggtgaagta ggaaagagag agcgcgacgg ggagcaagca aacgccaaag 541 ggttgacttc acagcctgtc caaggcttgg tggctggtgg gctcaaagca gagttagaca 601 aaggggacta acactctgac actggtgggc tgaaatccca ggccacaaag aacggcttcc 661 gataggccct ctgagacctc agcgcctctt tagggtaccc tccccctccc agctggccct 721 ggagcaaggt gcagccctag cgctcatctc gacttccctc cgtccgcctg cgcctctctt 781 ctgataaagg gtacagaaac ttccagtagg agaggccatc tgaaagacga taacattcca 841 accagaccgt gcttttcaaa tgcccccgaa aatagcgccc ccttccccgc ggtcttaccc 901 cattccccgc cgccccgagg tactacaatg agttactttt ctaaattctg gaactcaccg 961 agccaggctg cgtggtgtgt gtgtgtgtgc gtgtgtgtgt gtgtatgtgt gtgtcaggga 1021 aggggagcag gctggtcgat tgctacggtt gctacaacta tttcaaccgg tatagttaga 1081 gatggctctt gtagtcgggt ccaaatgctg ttcggactgc acctttctac ccctcctttg 1141 gtaaggtcca ctgtctggga ttatattcag gacaaacgaa gcctggaaag tgtattaggt 1201 agagaggatt tttttttcca cgtgtttggg cacgtttccg acggctggga ttccagccct 1261 gtctttgtat gttacagatt gtaaatcaat cgcagaggga aactcttcgg cgggggaaat 1321 aaaagttctc tgccttcgag gctctgtggg ccctctcctg ccaccaggct gtttccaggg 1381 atagcgtgga aggcggcggg ctcaggcggg ctttccggtc attagcgcag cgggggcagg 1441 gctggagcct gcggcgcagc tgcgaggagc cgggagcagg agactctggc cgggtcaccc 1501 ggtagtgcgc taagctggag gcgcgctcct gggcatttga ggaatacagc gtgactatac 1561 gtggcctgga ctcagactga ctatattttt gtactaaatt tacaagcaca cgcccacaaa 1621 gctgtcttct tgactgaccc ctgccttagt gagcaatgga attagctggg tggctttaaa 1681 ataattctca aattctccat ccggtattag ggtcgcttgc ttaattaggc ggtagaggtc 1741 tctcatcgcc gcatctttcc tgggagggag tgattccaca gcttctccgg cccaaacctt 1801 ccagtcgctc ctcctcccag agggagtgtg attctgcatc cgagaggctg attttgcgcc 1861 ctggagcatc ccacctttta taacttcccc cgcctggggt cagcggaccc aaaggtgtga 1921 cgtggggaaa tgcgcagtct gcgtggacgt caggaatgtc agacacctag agctcggcca 1981 cacccctcct ctccatcttt ccacgagttt gagaaactta ctggcggcgg cgtctttgac 2041 cctcatctgc atttcagagc cctcgcctcc gaaagtgccc ctggctcagg ggagagatct 2101 caatcctcct ttgtgaggct tgtttgcatt gggagattgg cagcgatggc ttccagatgg 2161 ggctgaaacg ctgcccgtat ttatttaaac tggttcctcg cagagacctg tgaatcgggc 2221 tctgtgtgcg ctcgagaaaa gccccattca tgagagacga ggtccagtgg gttctctcgt 2281 actcccagac cccctctccc acaatgcccc cctgtgcccg cccgccgcca cctctcggct 2341 ccagccctgc gcagagcggc ggtgaagcaa aacagttccc cgaaagaggt agctttttaa 2401 ttggcttgcc acaaagaatc acttatacgg ccctgcggta atgaggggaa ccggatcagg 2461 cgcgccggga tgctatcggc agccgttttg gagcagcaat tatggtggtg ctgggctcct 2521 ccgtccacac ctaggggatc cggttacggc gctggctcct ttctggggca gtcatttaat 2581 cccacttttc actctcccgg tgtctgtgag cgagccgtgt ccagagccgc agccacagag 2641 tcactcagcg gctcttacac ccagcgcagc ctggccccgc ccctgcgccg gcgcttcccg 2701 ggccgccctt ccccgggaaa tctgatccgc acggggagtg gcccctctcc tagcatttcc 2761 ccctctcctc cctgggtcct catgggcgag ggtgggctct cctgtagtct gggctggagc 2821 gcattaaccg atgccccctc tcccacacct tcctcaccgc ctgcattcca ctgctccagc 2881 tattttaacg gcgggtgtgt ccccgcaact tctgtatttt ccctggaatc cctcaccctc 2941 ctgtgattat cttgcccaaa ggctaggcgg atttcttcta gtgggaaagt aaaaaggaac 3001 gtttatcttt ggattttcac tctctttaaa gagcagtggg caggctcgtt tctttctccg 3061 cctctgggtt tgtggctctt tcctattatt catcccctgc tgctgctatt gccttgggga 3121 ttttgatgag aaaaacacgc tgggcgctcc ctacgacgtg gtgcggctct acagcccttg 3181 gctgctaagg agcgctcttg tcagcacagg tttcatttgc agcatgaatt ccagacggca 3241 gggcgctggt ggaggagact agtccctgct attcttcctc tgcagtcttg gaggaggcca 3301 ggcctggact ggcaatctta gccctagcca ggtattcaac gacccctgct ccccaaactg 3361 gggtgctgtt ttcagatgga ggcagggcct ctccaggcag ggctacaggt ggaggtcagc 3421 actgggggcg ctttggctcc actggcctcc taagcagttt attagcctgc ccaagcccca 3481 agtgtattgt ttgaatgggt ctatccccct ccccaaattg gtcctaattc taatatggtt 3541 caaagaatga gacaagatcc taattctaat agctcgtctt ttcacccccc tttcttatat 3601 acctattttt ggagcctcac tgcttataga ttccaatttt tgtaggtaga attttctaca 3661 ttccctctga atgttagttg tcagttgtat ttagctaatc ccataattcc cagaggaagg 3721 cagaagaaag aagacttctc tgctcctggg ctggtggaag ggaggtctcg ccatttttct 3781 gtctcctttc tttttatagt cccagaattc ctattcagaa tatcttgtct cctcccttcc 3841 gctcaccctc caactccctc cacccactcc atcacctggt ctcccccgta ttaggtgggt 3901 aaagagaata tagtatagta accccccacc ttcattgctg ggtcaagatt ttcactggtg 3961 aatagacaac atggtgcaag gtgcataata aatatttgtt gaatacatgg aaaaatcaat 4021 gatgttttag gaaaataatt tttaagttct atatgtccag gtggccccag cctacattct 4081 tcagcatttg aattctgtca agttgactgc aacctctctc tttttctctc tggctcccca 4141 ccccctcctt cccttggctc tctgcttctc cctccccacc cttggtgcag aggaggctgg 4201 cagctgtgtg caggatgggc agaggtataa tgataaggat gtgtggaagc cggagccctg 4261 ccggatctgt gtctgtgaca ctgggactgt cctctgcgac gacataatct gtgaagacgt 4321 gaaagactgc ctcagccctg agatcccctt cggagagtgc tgccccatct gcccaactga 4381 cctcgccact gccagtggtt gtaatttatt tatttcctgt tcaacataaa taaattactt 4441 gcaagcactg caaacacgct cccatagatg ctggtcgtct ctgcaaagca gaggggctag 4501 ttatccatgg gacctggtag ctggggtaga aaaggaaaag gccacttctc acttgcaggt 4561 tgaaactgag tgaatgagcc tgagacacta gaggggtcct tctttgccca acatctccaa 4621 aaacatttgc ttccaagaca catgaaggac agatgtaatt ctacaaaaaa aaaaaaaaaa 4681 aatcctctct gaaatcatct ctgcaaatta ctagagccac tatggagatc aaatgctctg 4741 tcttggccaa tccacgaatt aattcctcct ctgccaccga taccttgtct tctccttaga 4801 agacttctat gtatgtggtc ttcagtgtgg agaaagctct gccagctagt ggggagactg 4861 caggggcaga ggctccctct ttgagttatg gaacattggt ggtagtttcc tctctgctat 4921 tacctctctt ggagttgacc attaattcag aagcaaaata ataagagagg gaagggctag 4981 gctttgggag ttctagtggg gacgggtgga gacagagccc catgtatctg cactgtagtg 5041 ggtggttata aactcccagt tagatccagt gctggtggat gatatatgtg caggtgaccc 5101 cttccccagc attcaataca agatgtccta tctcccctgc agagtgagtg gggacgcttg 5161 tgtaggtttt ttgggtagct cttgctgtcc ccttcctgct gaagtagaga aggccgtggc 5221 aagggaagtg agaagctgcc tttccttaac acttcaccaa cactggctcc ctaatgtgca 5281 cattcccaga tcctttctga ggggcccgtg tgagtgaagt gttgattgcc tttactattt 5341 tgctgctact gtgaaggaga ggttattgac tggggtggca caggctatga tgctccgatg 5401 ctcttcataa ctcatatgcc ttgctgtttt tgtgttttta tttgtgcttg cttcaaggag 5461 acccagctct aatgtaagac ctttctaagt acctaactct tcctctggga gggcttgggg 5521 ttcgggaacg gctccctacc tgtgggggga agagagactg aatctgtgct ttccttcttg 5581 tggctgatta gatcttgagc tcttcattgc ctttttgtgc tgcccttgct cctttctttt 5641 gcatgctgcc tgctttttga ataacaaagc ctgggtcacc tccatatcct catgggacct 5701 cagcaacccc aggccacagt ggccctaaca ccccaacaga ggggttcagt ggagtcacag 5761 gaacgtgccg ccttccttga ttgtgtcctt ttacttgttt gatctaatga gtgagtgttt 5821 gagtgacaag aataggtatt tttccatctc aagattctta ccttcttctt ctctatattt 5881 tttccttgca gggcaaccag gaccaaaggt aagggctttc ttctttttct tttttcatat 5941 ttttttggct ttatattttc tgcttcaaaa gcaatgctat gttaatccag tctgtgattt 6001 tttagacatc agaagatatc tgtttcagag ggtacctcaa cacaggggct gctggcaggg 6061 ttttagacta ggggcttagt gggcttactc ggcttaatcc tgtgaatgtt tcatgtttca 6121 gggacagaaa ggagaacctg gagacatcaa ggatgtaagt gcaaattatt ctcacccggt 6181 attcgacgtc gtcgtctaaa tgggtcattt ccttgtgctc tcctctaact taccatcctg 6241 tggggctctc tctcacagat tgtaggaccc aaaggacctc ctgggcctca ggtaagagag 6301 ggagaaaatc tctttctccg tcccttcctc gctgcgcaag ttactgatct gtaactcctg 6361 gccttgctgt catcttacca tgttcttcac cttcagggac ctgcagggga acaaggaccc 6421 agaggggatc gtggtgacaa aggtgaaaaa gtgagtaaaa agcaatgctg cttgaccctg 6481 gtggacttcc caggtccccc aaggccccac catgtgttta agggcctggt cacctcttaa 6541 agagcagcca agggacagat ggctcttgga gaaacactgc ttcccattga tgcctttttc 6601 tctttatgcc aagggtgccc ctggacctcg tggcagagat ggagaacctg ggacccctgg 6661 aaatcctggc ccccctggtc ctcccggccc ccctggtccc cctggtcttg gtggagtaag 6721 tatccttact tcccattcct tcaggctgtc cctccagaaa tgtggctttt aaattgctgc 6781 ttgcacttac ctggctggct cccagggctg ccagcagtgt gtacatagcc tgtccatggg 6841 ctttgctcag gcctgtaatt tagaagagtc acatattagg catgagactg tggtgctaag 6901 ggctggcttt tttcactaac tgggattcta taaagaaagt cctcagttac ctggcttcct 6961 ggcatctgta ccacgtagtt gatgctgggg ggtgggtgta agggatagga ggaaggatga 7021 ctgggcactt gtatttccct ggaagacgag tgaccactgt ccttggaaga catttatcct 7081 tggttcttgc caagtacatt ccaagcaact attcactctc atgaaagagc tccactgagt 7141 gaaggtgtgt ggctaaagtc aattctggaa tcaaaccaat caacaaatta tatgattgcc 7201 tagtttttgc aggtttgcta ttttgatgtt tgctgtattt taaattctta aactcaaatg 7261 ggatcacaga tgcctactac atctcttgct gaaatattcc aagactgttg attttagtct 7321 tttgctgggc actaaagtct aaagaataaa gaacaccctt agaaggtttg gttatgtttc 7381 tccatatacg ttaaataaca tctgtcatat tttagagcat aaaaataatt ttataaaatg 7441 aaatgcaagg aactgatact tcctcaaata acactttccc ttccagtgaa atgattttgc 7501 cactgtcatc taataatcca ttcccaaaat tactttccag gcatcagtgg taattctgat 7561 caatgatagg ttagtctcca acaatcagag tttatctcag tagagttctt tgtatccata 7621 tagagactta cctagccaag tagggaagac ctagtgcctt tcaacctcct aacgttgttg 7681 ggtttctttg cagaactttg ctgcccagat ggctggagga tttgatgaaa aggctggtgg 7741 cgcccagttg ggagtaatgc aaggaccaat ggtaagaaaa gacactagtt ctttgcagcc 7801 aaaatggcag gaggtggccc ttagcagagc cagagagtct gacaacctct gctttacaga 7861 taattgctta gagtggctct cctccgtagt tatgtaacct cccattcagc tagcccaaag 7921 catttggttt ttaatggcaa tggatgccac ttttaatgat gcgctggagt gactaagaag 7981 aatgaagatg ggagatgcat ataggctgat ctgttagaag gccagttgct attgctcttg 8041 gaatgagaac tgaagaatgc agacagcagc tactgttctc cagcatccac agacttccag 8101 caggccctct cagcccgcag ctctgacttg gcacatgcta aatgaaactc agcctttagt 8161 aaacatggct gctgtccagg agaaagcaag gccagctttt ctgtccaaat ggtgcctata 8221 aataaaaata gagtgttgcg tggggagtgg gaaatgagag ggagcagcca ctctaggccc 8281 cttgcccaca gagtaacttc ttgtcctttg cccgggctgt tggctgggag aagatggcac 8341 actggaggcc actgaggaag catgtgtagt aaacccctca ttttctgttc cgatgcaggg 8401 ccccatggga cctcgaggac ctccaggccc tgcaggtgct cctgtaagta tctgcaagtc 8461 tttttgcctc catcgtgtcg cagatgattc ccaagcacta tgatgtttta gcagtttata 8521 gggattgacc tggtatcctc attttacttt ttaggggcct caaggatttc aaggcaatcc 8581 tggtgaacct ggtgaacctg gtgtctctgt gagtaccagc acggccctgt cccttctctg 8641 ggggagcctc taatgataga ccactaggac gcagctgctg tccctcccag ctctgcccag 8701 ctctttccca cagtcggtgg ccccaaggaa attcggatgt cacttcctag ctgtggagga 8761 actctcacag acagcccaat gtggcaagga ccaccaggga ctctgtccta acagcccctt 8821 tggggtcacc ccagcctgtg ctatctgctg caatcccact atgatctctg cacctttgct 8881 ctgaccttcc catctttctt cttcatagaa gaactggcat tccaaaacta caatgtcaaa 8941 gttttgtcca ttgcttaggt gtcttcccac tataaccatc tcttaaacta tcttcctttg 9001 tttgtaaggg tcccatgggt ccccgtggtc ctcctggtcc ccctggaaag cctggtgatg 9061 atgtgagtat acacgagtag acaaatgagg agctgcctcc tttgaaaggg cctggagagg 9121 gtgtgtgctt ggggagtgac agggaggcac ccagggtgga ggtatcttga ggagcaagac 9181 tgggcagtcc caaaccctga cgccatctcc tatctatatg gccactgtga ctgtgctggc 9241 aagttccctg gggaccgctt tggatccaag gggaagacaa ataattaaaa catcattagc 9301 cccaggaagg gaaattgaga aatgagagaa gggagagaaa aaatacaagg cagaaagatg 9361 tagagaagga aaaacaaaga aagaagcgtt caacaaccca gcattatctt aattgtaaat 9421 gagttagaaa aagcacagcc tgagtcagga tgtctacaaa ggatgcaaac tgaaatgaag 9481 agacaagaat tggcactctt gtcgtatttt tatgaattcg attagacagt aaaagtctct 9541 tgaggttaga gagagcacat acagtcagca gaacctagga gaggagagaa aagcctctca 9601 ggggaagttg gaggctggtg aggacagagg agcttgccca tggcgtatgc atgtgtccaa 9661 aagaataaat ggtgacccat gaaaggcatc caggcacgtg gagtctgaag gaggtgaggg 9721 agatgagtga gccggtacag aaggcatgga gggctggaag gagggaagcc ctctgggtgc 9781 ccccactatg ctactgcgtc tctgaggaag ctgggatatc tctctctctc ccttcagggt 9841 gaagctggaa aacctggaaa agctggtgaa aggggtccgc ctggtcctca ggtaaacgcc 9901 accgttccca gcctcaggca tctttcctag cgtctccctc cctgtggcct taaacacagt 9961 gcatccagtt caatgaggtc acttctgaga tgaaacgcca gtagccccta tatttatcac 10021 gaccatgttt gtaatttcca ctcaggctct catgagggag gctgggcagt tgttatttat 10081 accactttgc ataaaatggg gggtacgggg aggggtggtc gtggttttac agaaagagct 10141 gtccaagtgt ggggattcga gacaacgccc tggtggcgaa gggaactgga ggccctcctg 10201 cagccagggc agctttccac tgttatttta ctctgtgctc tgaacacctc cactttggat 10261 tgcagggtgc tcgtggtttc ccaggaaccc caggccttcc tggtgtcaaa ggtcacagag 10321 taagtatcac gggtgagaag gttggaagga agagatgcct ggtgggagag aaaagcactt 10381 tggggtgcgt gcatttcttc caacttgggt ttcccagaag tctgattgaa cattttctct 10441 tgttccctag ggttatccag gcctggacgg tgctaaggga gaggcgggtg ctcctggtgt 10501 gaaggtgaga ggccagaaag tacaatggga tggggaggag ggagacaatg aggagcccct 10561 cttcctagcc agggagacac tgtggagctc agtggaacta gctcctcaga acagccttgg 10621 ctgggaacac cagccctaca tcctgatggg ccaacagcag gcctggagag ctcagggcat 10681 tgtccctcac aggactgaag tttgtgtcag tgcgagctga gatgaccagg gcttttggcg 10741 tcttccctag gagtttgctg gcggccaaga atggggtccc agacactgac cttgtgcatc 10801 atttttccag ggtgagagtg gttccccggg tgagaacgga tctccgggcc caatggtaag 10861 tatggacacc ctccaggaag gtttatccaa agactcttca gactatcaga tggctgcaaa 10921 gagctccctt tgtgcaaagt tcatattctg tgttgtagat ttcatctgat tgtgagcaaa 10981 aagcaaaatg tattagacag atgatttgtt caagatttca ccaacatttc cttaagatag 11041 ccatgttatc acactaaaga tgctcccatt ttaaaaaatt ctgttgagtc tcaacatttt 11101 gtcaagctca tctactgcaa ggagcaaggt gtgcttgtaa caaaggttcc caataggtag 11161 caacaggaac attcgtgtgt tccgcctgtg gagaaactgt tgggtgtgat ctgaagcatc 11221 ctggctagtc aaggagccag caccatcagg aggtccttgt tttcctgggt gtgggcatcc 11281 tccctctcct ctggtatccg caaagggcct gcaggtagaa atggtcaccc tgagcaccgt 11341 aaagccaact catgcttagg ctgtcctggt gtgtgttcca gggtcctcag ggtcctcgtg 11401 gcctgcctgg tgaaagagga cggactggcc ctgctggcgc tgcggtgagt aattgacaaa 11461 gccaaacacc accatttgcc gagcacttta gagtttacag gtttgtttct cttgaccctc 11521 gaaacaaacc tgtgaggcat agggagtatt gctatccctt aagaattcac cccagggttc 11581 catcaaagct tccaggctga gtctcacagt gaaggaggaa ggataggaat gggagggtcg 11641 atgggtgaaa gcatgattct cttaaccagt ccagattatc aggtaatccc ttcaacaacc 11701 accacccact ccctgggcaa tccagctgga gtttacagac agacttagct ggctatagca 11761 ccaccgtgct actctctgtt cttcctggtt gctcaaatgc cctagaaaag tggaacaggt 11821 gagcatcaac tcacagggct ctatgctggc tgctgctgcg agggatgtta tgctatagta 11881 ccaggggcca ccattccata ggcacttcct gtgtttaata ccctatagct ttacttcatc 11941 tcatcttcct ccatatcctg agaggtggtt ctattcttct accccatttt acggatgaaa 12001 aaaccgagac acagaaaggt gaaactagct taagataaat ggtgccttgc agccttagac 12061 tctggtggcc tctagttaat gtgggaaatt aagggtgagg ggattggcag ctgatggagg 12121 gtgcagggtg ccagacagag gcgtttagct ctgatccctt agcaatagag agtccttgta 12181 ggcacttggt caggcgagtg atgcgatgaa agctgtgttt aagaaagatt atgctttctg 12241 ctgatttcat acccccaaca cccaagctct gaggcccctc ctcacaggtc cttgcagggc 12301 tggccaaaat aaagcagctt cactccgttg tgctgctttc cagctaatgt gtctgtttgg 12361 cagaagtttc cctcaaaggc agatcagtga aataagcaga agcctcgacc cccctttgtc 12421 agccagagct gctgaagtgc cttgccccag ggtcactttg tgtgagggga ttagagagca 12481 ctggggctgc caagaaacac tgccgtttct acagattagc aggacgctgg cttgtggctt 12541 ttagcgaggc tcagagctgc ggtggcccta gtctgcatgg gctaaagaca agctccatct 12601 cctgtccttt ttccctcctt cctgggcaca gccgccctgc ttcttggttc tctctgttgg 12661 ttcctgtccg cacggtagtt aggctggcag cgtgtgtagg atttggctta gaagattgac 12721 aacattgcct ttgagccctt ctttgctact cctccctctc ccctcccatc agactcctct 12781 ctggagtctg ctctgcgagg cctctgctct gtggtatccc agcagccttc tcagccttga 12841 cttccagaag ggggctgtgc agtgtccggg gtgtgcaggc cccagacacg gggtaggctc 12901 atggagatcc aagtgctgat ctagtgtcaa ggctggcctg gagactgggc tgggttggtg 12961 tcagcctgct gtggtcatgt gccctcccaa gggcctgtat cctctctcca gacttgctgc 13021 agggagaggt ggcagatgtc agcctagttc tggcctctca gagcagcatg gcagctccct 13081 ttcactcagg cccaggctgg gcctcctgct ggctgaccct ggggagaggg tgctccagag 13141 caccccaagg aacagcttcc cgaagcagcc aggccagccc agaggggctg tggccaatcc 13201 tgaagcttta tgttcctgct gacatttttt ctaagttttc tcttgctttc ctcttaaatg 13261 ccaatctgga gagtctccgt taggagaaat ggaccccagc caggaagaag agttgagttg 13321 tatttaaaac acgagctccc cctaaagcat ccttctttag cttctaagga gaggcagaga 13381 ctgacaggca ggactcagca ggtaaaagta cccccctgac ctgctcagtc agcctaggcc 13441 cagctccacc cagcctgtgg ggcccagagt ttcggtaaag agttccctgg gccttaagga 13501 accttgagag agcatttgag gggtgccacc acaaacttgg cagaaaaaac cctccccctc 13561 caagtccagt cctagagaag gagctggcaa ccttgccttg ctttgtaagc aaaagcctct 13621 tagggcttga gtctagatgt agtgtttgag ctgtggctgg tgccctcccc catcagggag 13681 ccaatggtag acatcctatg ggcatctttg ttttccgtaa gagcaggctg ctcggggatg 13741 ggccagagga agaggcaacc tggagtcaac caagaggagg ccttaaccaa gccttaacca 13801 cagaggttaa ccaagccttg aaagcgcttc cccctgagca ggccaggaag cactgagtcc 13861 acatggttgc ctcgctgttt catttcctta cactcaattc tctcagtctt taaatgatca 13921 cttggccttg aagttacgga tatttggggt ctgaactgaa gttgaagaaa agaggaaatg 13981 atttaagctt tgtttaagat taggggccag gtgcggtcgt cacgcctgta atcccagcac 14041 cttgggagcc tgaggcgggt ggatcacctg aggtcaggag ttccagacca gcctggccaa 14101 catagcaaaa cccagtctct actaaaaata acaataaaaa aattagccag gtgtggtgac 14161 acatgcctgt aatcccagtt actcaggagg ctgaggcaga attgcttgaa cttgagaggt 14221 ggaggttgta gtgagccaag accgcaccac tgcactccag cctggcgaca gagccaagct 14281 ccgtctcaaa aacaacaaca aaaaagatta gaagaagccc attactgcct tctggccacc 14341 cactcgcaca gacaccaaaa ctgcagccca cacctcgcca tcctcgtgct ctgccctggg 14401 acaccccagg cacagtgtgt ccttcgtttt ctgtaagggt gggctgggag cagggacgga 14461 cagggcctgt gggcacctct catggtcact tccttcttgc tcacagggtg cccgaggcaa 14521 cgatggtcag ccaggccccg caggtcctcc ggtaagttca tttcatcctc agcaggtcat 14581 tgttgctgtg ctttaagtcc cgttaagcag cccaaggcag tctcgagggt gtattgggtg 14641 caaccacagc agcactctga tgtctactgg aaagggggag gaaagagaag aagtttgtaa 14701 atatcaattg agcatatcga taacaagctt tgaagcatgg gctcattttc ctcagccatc 14761 ctttcagcag tctttttaga ggagggaggt caaaggagtt tctgcttctc accacagatg 14821 tagtcagaaa cttgctttgc cttctgaagc caggcaaagc ttcctgggga cgctggcaat 14881 ggggacaatt ttcatccaag gccttttagc cacaatggat atggagtgaa atcagtacag 14941 aggagggaag gagtgtgagg tgtcggggtc gctcgctttg gaggccagaa ctggcattca 15001 cctctcttct catccgccta ctctctccag ggtcctgtcg gtcctgctgg tggtcctggc 15061 ttccctggtg ctcctggagc caaggtacgt gccctgttgt ccagtcagga acttctgggt 15121 gccgagaagc tgtcctttcc ccgtaaccct tgctcattgc tccctcaaca accacctgct 15181 cccttctgag aagtagctcc ccaccacccc acccactggc ccctccatcc aggcagggca 15241 aaaagccaga cactcgcagt ctcacctgga gggaagtaag acagaagata aaatgtggga 15301 gatccagtta caactttgga gtggggaaag gtggacagag aagaagacgg ggatacacca 15361 taggcctggc aggggcagaa ggccaggagt ggcagcacag ggaagcaaac ctaggggaga 15421 cccaacagct gagcaagctc ggccggtgac gggcatcgga gggaactggg cagggaaaag 15481 ggcacaggca ggagcccctg ctccctctgg gtttctgctt tatttggggt gcctggctct 15541 tccaaaccat gttaacggag ttctctggag gattactaga ggccagtggg aggccagcca 15601 gttcagggac aggcctcgca gcccaggaag gattccagtg tgaacgtccc tgggaatgaa 15661 taaggagcct ccatgtgtca ctggcatcag gttgcttttc cctcctgggg ctttccatgg 15721 caaccagaca gtgtctgagg tccggagccg ggtgaaggag acccattgtg aagagggaca 15781 gcggaaggtg aggggggctg acctttggaa aataataatt accacagtga agcaggaatg 15841 ttctgagaag aaacctgagg agctctgccc tctctccagg tcagcagccc tccccaggga 15901 ctctgccatc tagagtgggt tgtaattttc aggaaaaaat gaaagtaaaa gcacaagcca 15961 ttttgtgggg agggggcttg ccagaggcgc ccgctaaggg gaattgggct gtattgagag 16021 cagggagggg cagagtcccc atgtgctttt gccttggctt tctggcttac tgagaacaga 16081 ctggggccgg agccagggtg tcactgttca cccatcagcc agatgggagt gaggtggtgc 16141 tctgagctgg gatgttcaga gacttagaag ggacctcagc tcctcaataa aatagaaaaa 16201 caggaggtgg gggagagagc ggtgtccgtc catcatccca cggtgccagg atggcagggt 16261 ccccagccca cgcttttctg atggtgtcga tggaacagca ggttgcccat tgctgtagta 16321 tgtagctgtg ccgtggcatg tggaggctca ctgtgtagag atgaggtaag cagtagagga 16381 ggcaggcgtg ggaagtcatc aagtcatcag ctcggtcagg cagggagaaa aacggcagcg 16441 tgaactgtgt gtgaaccgac atgttcatgt gcagggttgg gtgcatgtgc ataatttagt 16501 gctgtcgttg cagctggacc ctgagctatt gcccacccac tagaggtctg tgtcccctct 16561 cttcttcttc atttcatacc tccctgtctc ttcccagggt gaagccggcc ccactggtgc 16621 ccgtggtcct gaaggtgctc aaggtcctcg cggtgaacct ggtactcctg ggtcccctgg 16681 gcctgctggt gcctccgtaa gtgcagcttc tctttggcct gggggggtct ggggtctgtg 16741 gctttggaac tcttgactct gtactttgct ctgacagttg tgggctccaa ccaccaaacc 16801 ttcattctgg cccaatgcct gtcccacctc tagatgtatt cccttctatc ccatcttccc 16861 cttgaaacac atagtgggaa tgtccctgaa atggacagca cctatgccag gtccctggat 16921 ctggatcctg gagggctgga ggtggttggg gttcattctt tgctgcttat ttgacaatgt 16981 ctcccttttc agggtaaccc tggaacagat ggaattcctg gagccaaagg atctgctgtg 17041 agtgttgccc gtggactttg ctaccccagg agagcccagt cctgcctctc ccctctcctg 17101 acacccctcc cttcttctca tgcccacagg gtgctcctgg cattgctggt gctcctggct 17161 tccctgggcc acggggtcct cctggccctc aaggtgcaac tggtcctctg ggcccgaaag 17221 gtcagacggt aagagcccaa agtgaccccc aagttccact gacatctctg gagtcaaacc 17281 ccatcacccc tctttcccat gctctcctgc cctggcctca cagcggcctc catccgaggg 17341 catcttgaac aggggttctg gggaggggca ggctccctgg agagaatctg gtgtgaggac 17401 ctgcctctct tttcaagggt gaacctggta ttgctggctt caaaggtgaa caaggcccca 17461 agggagaacc tgtgagtatc tgcccccaag cccttgtctt ctctgctgct gttctatgag 17521 gcacagcctc agccccactg acccaccacc tccctcctcc agggctctat cccccaatct 17581 gggtcctttc agattatgcc tggaggagac ttaactgggc tgagaaggcc cagatacagc 17641 ttcagctccc atccttggtt tggctagtgt gaacagttgg atctttagcc cctctcactt 17701 ccctctgccc tgccatggct cgtcctttat gcctggagga gacttaacag ggactgagaa 17761 ggcccagata cagcttcagc tcccatcctt gggttggcta gtgtgaacag ttggatcttt 17821 agcccctctc acttccctct gccctgccat ggctcgtcct ttatggcctc tcgtcctcaa 17881 gccccccccc agccctgaaa cagttgccaa ggctacttcc ttcatactct agatcgaggc 17941 ttgctccaag gccaggtgaa ggctcactct gtttctcttt tttgctggtc ctcagggccc 18001 tgctggcccc cagggagccc ctggacccgc tggtgaagaa ggcaagagag gtgcccgtgg 18061 agagcctggt ggcgttgggc ccatcggtcc ccctggagaa agagttaagt gaatgtggag 18121 gctccatccc atggggcctg tgacctcgag agggaagtgg agtccttgtg gtccgtgttc 18181 tggtcaagtc ccgtgacttt tccgcatgtc atcctcctct ttctccatcc tctccgcggg 18241 agagggagtc tgatcccgag ttgtgccgcc aaccaccaga ctgacatgaa atagtctgag 18301 ctccttccca ggaagcgggg caggctccag aagttaacct ctgagaatcc tgcaggccac 18361 agctgctccc cagaaattgg ggttggtggg ttagtgggat ggacccactg gagcctggct 18421 gggttgggct gttctcactc actgcctctc ctccctgtgg ctccttaggg tgctcccgga 18481 aaccgcggtt tcccaggtca agatggtctg gcaggtccca aggtgagtgg gagaagaggg 18541 gctggggtcc tccctgcatc gctgaggtca catggtatcc cactgactcc ctgtgtaccc 18601 ttgtagggag cccctggaga gcgagggccc agtggtcttg ctggccccaa gggagccaac 18661 ggtgaccctg gccgtcctgg agaacctggc cttcctggag cccgggtaag tagcagagct 18721 gctgttgccc ttggcttcag accctcaggc ccttcctggc tggctccttc cagccctgca 18781 ctgccaggat tgggaggtcc tggggccggc tcctgacccc accctcttct ctctcctgaa 18841 caaagggtct cactggccgc cctggtgatg ctggtcctca aggcaaagtt ggcccttctg 18901 taagtctatc ctctgagggc tgctaggagg gtggggggat ctccctgggg aagcaaggga 18961 aaagagagat ggagtttggg ttagggaggc ctgaagtact gtgaattttg agaattgtga 19021 cgagggggta gatggtaggc actggggcca gatgtaacct gtgcagtagc tgtgagcact 19081 gaaaatgcca ccccagtatg cattcggggc ttatccttgg gggaatgatg acatcgtgtg 19141 tgcactttct ggggcagctt tctaagctca gcggtgtctt gttgtagatg ggcccatggg 19201 tgtgatgtgg tcaatcctag atgctgagca tgtgtggctg gtgccatgtc ctggcctgcc 19261 atgtaggccc ttagtggatg ttgggtggat ggatgtggtc agagtgtcta tgttctgaga 19321 atggtgttct gtctttcagg gagcccctgg tgaagatggt cgtcctggac ctccaggtcc 19381 tcagggggct cgtgggcagc ctggtgtcat gggtttccct ggccccaaag gtgccaacgt 19441 aagtaataat ttgctcttct atttccttcc atgtggtgct acctacctcc ctgccctctt 19501 ggggaaaggg ctgggtcctg agtagagttt acccagggac agtgatgagt ggggctcctg 19561 tgccatgggt ggcagtgggg gtctgtatgt gatttgggga aaatccatgg ccccacagag 19621 cctcggggca ttgcggccat aattgttcca tgtggcagtg ccagcaggct ggttgccatt 19681 atggcccctg aacagaagag aagggctgat actttgcttt atcttggctg tccatcagga 19741 tgtggcccca ggctcagtcc ctgcagcccg ctctgccccc aactccctcc caaaccatcc 19801 gcctcatggc cctgccctct ctttccttca gggtgagcct ggcaaagctg gtgagaaggg 19861 actgcctggt gctcctggtc tgagggtaag tatccttccc cgctgcccat gacttggtgg 19921 tggccgggca tctgcaggga ggacagggga acggcctccc catggcatgg tcccgggacc 19981 cctcagtatt gagtgttgat ctctgtggct agaccccatg ctggctgggc ctttgggtgt 20041 ctacacaggg agacttctgt ttgccattgg tcagcaggcc ggggagctgg ggaaggcttc 20101 catgctgaga acagctaaga aaagacgggg ccctgggaag gaagggaggg gaaggtgtgg 20161 aaatggagct cagctggggt accgtggagg tctggaaact ctgggccaga agtacctttg 20221 cccaatccta gggggactgc aagcgggaag aaaagcgtgt cattggactt ttctttttct 20281 cttctgtcta gggtcttcct ggcaaagatg gtgagacagg tgctgcagga ccccctggcc 20341 ctgctgtaag tacctgccca gcctccccag gtggccctgg gggcaggggc tgggaggggt 20401 gggggtggga gagcccatcc attaatggag ctgacagatg tgaatgtggg ctgagctgat 20461 acaccagact cactctgagc tgaggcaggg tgtcccagga ggctgtgtgg acccacattg 20521 gtggagagga gtgtgggtgg ctgatgggag tgcagggagg catgcatgca ctgtctgagt 20581 ggtgcaggaa gacgcctgtg ctgcccaccc tgctgacctc ccctgggccc tactagctgt 20641 ggctctcagg gtctctggaa cactggctca gctcaccttt tcttttccac tgcagggacc 20701 tgctggtgaa cgaggcgagc agggtgctcc tgggccatct gggttccagg taggtggctg 20761 gaccaggctc tctgtgtcag tcctttgcca tacccagggc tccctggaca gcagcaggca 20821 ctatcggtgg agggcccaca cctcttgcag tgtccaggca tcgagccttc cctgcacccc 20881 tggctgtcac tgctgctgct tcctttcttt gggtctgccc tatactgtgc ctccctgggg 20941 gccagggcag caaactcact cctttgctaa cgcttgtcac ttcggcttct agggacttcc 21001 tggccctcct ggtcccccag gtgaaggtgg aaaaccaggt gaccaggtga gtatggggct 21061 ctttggacct gcaacctgtt tagatgggaa ggtcttttct gatgcctagg aggcaagggc 21121 aagagggcat gaggagcctg tgaggcctgg gaatgtctgg acccatgtcc cagcctccac 21181 agatgacaca atcccatgga ggagtgatat tcagccctgc tgtggagaat tgttcagggg 21241 tctgtgatat gaagccttca ctctcacaca tctttctttg ttctccaggg tgttcccggt 21301 gaagctggag cccctggcct cgtgggtccc agggtgagta tcctgttggc caatgcgggc 21361 tgcctccttg ggcctgccct gggtcctatg ctcctgctcc tttccccacc tccctgcttc 21421 tccctggacc ttctccccca ctgctgttgg ttgatcactt cttggtgtct ctgccgcagg 21481 gtgaacgagg tttcccaggt gaacgtggct ctcccggtgc ccagggcctc cagggtcccc 21541 gtggcctccc cggcactcct ggcactgatg gtcccaaagt aagtgaggct gcatccagta 21601 ggggtcttcg tggtagcctg gagtcccact gagcaggaga gaggagcggg ctcaggagga 21661 atgaagaaca gaagtggggg gagctggaaa ggaggtctac atgggaggaa gggaaggaag 21721 aggggtttgg ggcctggtta cccaggctcc atgaacatgg gttcagggag aggtgctgtc 21781 cactacagac tccctcttac ctccctcccc agggtgcatc tggcccagca ggcccccctg 21841 gcgcacaggg ccctccaggt cttcagggaa tgcctggcga gaggggagca gctggtatcg 21901 ctgggcccaa aggcgacagg gtaagtactg aggttacagc ctcctcacca aagctgtggc 21961 tttgccaatg tcctgcccct tgtgatcgct tccgttccct tatggcacct ggtgatgaag 22021 gtttctgtta gccctttttg aggagcttaa agactccttt ccaaagctcc ctgcctttta 22081 gtgacatcct ttcccctgtt ccttcatctc acccctgctg ctcctcaccc accctgagac 22141 cacagcaaat tcctcttggg cagggactgg gctttcccta gcaccccagc ctgggtggga 22201 ctgagcaaac catgggggtc ctggggtgcc tggctgaggc ggctggtttt ctcttccctc 22261 agggtgacgt tggtgagaaa ggccctgagg gagcccctgg aaaggatggt ggacgagtaa 22321 gtgaatgcgg gctgctggac tgctgggcat taggatccta gccctgcacc caggagagca 22381 ggagagaggg tctgggcagt ctgccactgg ggtccctggt cctgtctctg tcggggctgg 22441 gcaactgcag ggacttctct gttaaaatgg ggccagaggg taagtgggag ctctggaggc 22501 ggtgggagca cgcaccaagg ttggcttggt gccgggccgc acgtgctcgg ctggctcagc 22561 ctgcctccct cacctctacc tgctctcccc gcagggcctg acaggtccca ttggcccccc 22621 tggcccagct ggtgctaacg gcgagaaggt gagtcccggc tcctttcctc tccacacctt 22681 gcctccctgt cacacctcct tcttatctcc tgccaaaggg gttctgtctt ctcctccctc 22741 accactgtca ccctcggcca agggctagga gtgaagaggg ggccctctca gaagtgaagc 22801 cgctggcagt gttccctgtt gggtggggca actgggctgg ggtaaacaca cattcagcag 22861 aggccctcga gagggtgcgg gtatgggctg cacagtaaca caggctgtgc agggggacct 22921 ggagccccct tcccacgagc aaggcccccc aaatgcactt tgccctctcc cactctgcct 22981 ccccaccttc ttaccccagc tcttcctccc ttccccaccc tcagggagaa gttggacctc 23041 ctggtcctgc aggaagtgct ggtgctcgtg gcgctccggt gagtgtctgc ccctctgagc 23101 ctggctctgc cgaggcccct gggaaccaga gagccaggga gtcagtgcag gccctcatgc 23161 tgcctggtgg ccctgtgtgc tgccaggcac tcggtccctc cctacccgct gggtctaggg 23221 tgggaggaga gatgggaagg gaagggggaa ggcacgtcac tcccatcatg tgttcagggt 23281 gagggctttt gggttaacag agcctctgcc tgcgttcagg actaagggct gctttcagat 23341 ccccgtctct ggggaacagg aggctgggca gggccacggg gctcttggag gggagcagaa 23401 gcaggtcagg cagcggggcc tgactctcgc catgccccct tctctcacag ggtgaacgtg 23461 gagagactgg cccccccgga ccagcgggat ttgctgggcc tcctgtgagt atctctgtcc 23521 atcctcctgg gtacctccac tcaggccagt tccacatcca gcaccctcgg gcatcggagc 23581 ttgtcaggga gggaaatgga tgctcctctc ctctcctttc ccgcctccat actaatagaa 23641 ccatcatgtc cagaccagga cacacacgca gatactcaca gagtctcccg ctcttctgga 23701 agagctccag ggttttagcc tgcccctcat tcacctgctt cctccttccc cattataggg 23761 tgctgatggc cagcctgggg ccaagggtga gcaaggagag gccggccaga aaggcgatgc 23821 tggtgcccct ggtcctcagg gcccctctgg agcacctggg cctcaggtgg gtaacgctgc 23881 actccaagaa ttgttccctc aaggaagggc tcctggcgtg cagatgggaa ggccccagca 23941 ggctgcgcag aggatggttc gcaggcctgg gaacaccccc atgttggtag aaggagcttc 24001 catgtggcat gtgggctgtg tgggtggggt gtagggactg acagagtaca ggctggccac 24061 agccagccag aaccaagctg ctgatctcct ggggaggagg gggcggtggc aggaagagct 24121 tcccgggagg ccagacccca gaccggttct gtggttgcct gacaggcttc tcctagaaca 24181 caagtctcct gtggcagagg ggacagagct gcctgtggac gcctcttcag gctgggtttt 24241 tagtgccaag aaagctgcat cttcgaaaac ctcaggggtc cattgttggg gctcagacag 24301 aagcccaccg tcttccttgg gccactgggc ctcactgtct ccctctttcc tttccagggt 24361 cctactggag tgactggtcc taaaggagcc cgaggtgccc aaggcccccc ggtgagtgag 24421 gcctctgaca ccccaccctg cacctcacaa agaggcctgg ccccagaggc tcccatggcg 24481 gggggtgttc tgggatgccc ccgactgttt tcccagccct gcgttggtcc ccagcctaag 24541 cccacccaca gccaggtggg agagagggcg cctgtgggct gggtgcactg tggtcccgga 24601 ttccccagcc cgaggcttgt ccctcgttca gctacctgaa gtgctgactg tggaaaccgg 24661 agcagggaaa cagcctgtgc ctgcttctat gaccagaccc gggggccctt tctccctccc 24721 tggatccccc ttgcgggctt gggatccctc gccctttcca taccaggctc tgagaccacc 24781 cccgcccccc gcccatttta atctcaaacc tcctgagggc ttgaggttct caggggctct 24841 cctctcccca cacagggagc cactggattc cctggagctg ctggccgcgt tggaccccca 24901 ggctccaatg taatggatgc tcctggcatg agaggcacag gcaggcatgg aatagcctgg 24961 cccagcaccc aacaggtctg gagagctcca ggctggcctt cactcctctg agtccgttcc 25021 tgccccctgg aggagtagct aactgatcat ggggacctat cccctgaggg tggctggggg 25081 cagggttgtg gccctgctgt cagcaatggg tggctgccct gctggcttgg tgggcaggac 25141 agggctgcca tgattacctt catcctggac agatgtttct tgagcatgtt ctgtgcactt 25201 gtctgtgtgc tttaaagcaa gtccgcatgt cctgggtcgc tctcagggag tttgcatgga 25261 gggagcagtg gtgatgaact cactgggttc accagcgcca taggcagagg gagcgagtgc 25321 tgggggcaag ctcaggggtc agaggggctg ggctggagcg gctcctgagg aagggggatt 25381 gaggacaaag ggaggaggga ggagagattc caggtagatg ggagaaagca agcaggagct 25441 ggagaagttg gggctgtctg cccagagctg tctcacatgg tgagaaggtt gcagcggtag 25501 attggggttg ggggggttaa gctgcctgcc cttagtgtgg ggaggtagag aagcccccag 25561 agaggaaact gctgtcactg aggccacagt gactttggca gcctggatga ggaagggtga 25621 gatgagtcct cacttcccgc attttctcct tctagggcaa ccctggaccc cctggtcccc 25681 ctggtccttc tggaaaagat ggtcccaaag gtgctcgagg agacagcggc ccccctggcc 25741 gagctggtga acccggcctc caaggtcctg ctggaccccc tggcgagaag ggagagcctg 25801 gagatgacgg tccctctgta agtccctcac caggcccatg ccaaggtccc tgggagcagg 25861 gtcgtgagtg gcttctgagc tcacagagca tggggtagga gggagcaggg gccgtgggga 25921 tgccagggag cggggctgca cagacagagc tgtgctgaga ggacgaagag gctgggccac 25981 tgtcagttct catctcctgc ctctcctctc tcagggtgcc gaaggtccac caggtcccca 26041 gggtctggct ggtcagagag gcatcgtcgg tctgcctggg caacgtggtg agagaggatt 26101 ccctggcttg cctggcccat cggtgagtgt ggggtatccc tccctccatc caagctggcc 26161 ctgcctgcca aggcttctac ctccctcagc accctcagga ctgtcccttg tctgccctct 26221 cctgaagggt cagtgggccc tgggcagggg tgcttaccac ttgcactcat catccttgtc 26281 tctgtcctcc agggtgagcc cggcaagcag ggtgctcctg gagcatctgg agacagaggt 26341 cctcctggcc ccgtgggtcc tcctggcctg acgggtcctg caggtgaacc cggacgagag 26401 gtgagcagtg agaccccctg gggtggccct gattggggag aggggccctg tgagtctctg 26461 tgctgggtca gcaaggacaa gccccagtca gggcctcgga gaagggggcg gcagcgctgg 26521 ccgacaggcg aaagcctagg tacaatggga aggttgtcgg ggagagagac gggcatagag 26581 accaagggct gcttctggaa ggaggaggga aacttggtga ggaaactttg gcttcaaagt 26641 gtgagtgagt tgggcagaag aggagaggcc tgggcttctg agaggggctg ggggagcaga 26701 gggggaggtg ggcaggaagc agctctaagt gcattcttgt ttcactttgt ccagggaagc 26761 cccggtgctg atggcccccc tggcagagat ggcgctgctg gagtcaaggt gagtgtctgg 26821 tgtctgtgtg tgcagtgggt tggggaggac attgcctcgg gcctgacagg tcagctgggg 26881 gtggcaggtt ggaacaagtc tcatctcagc ctagaaggac cttctgttcc tgtctcttct 26941 ggaacattct tctctgagcc tgagacctct ctcctgacag ggtgatcgtg gtgagactgg 27001 tgctgtggga gctcctggag cccctgggcc ccctggctcc cctggccccg ctggtccaac 27061 tggcaagcaa ggagacagag gagaagctgt aagtatcctg gaattcagta aaagccgcct 27121 tcccctgcgc ggtggggctg aggcagtccc tgggtttccg cagtctctgg actaaggagc 27181 agtggcctca gatgcagagg aggcccccac ctgtcctggc ttttctctga cgctgcgctc 27241 actctctcct cagggtgcac aaggccccat gggaccctca ggaccagctg gagcccgggg 27301 aatccaggtg agtatccaag tgtcctgcac tgagtcccca ccagggatag gctgggaggg 27361 cagccagcct ccaggtggtt cctggcctcc agccctgtgt ttccggggat tcctcagctt 27421 gggtgggaca ggagggggct cctgtcctgg ccctgacctg actcaatcgg tgtctgtctt 27481 gttcccaggg tcctcaaggc cccagaggtg acaaaggaga ggctggagag cctggcgaga 27541 gaggcctgaa gggacaccgt ggcttcactg gtctgcaggg tctgcccggc cctcctgtga 27601 gtgtcactgc ctgcgtggga cttcccgagg cctcctgcca cacagagccc acttgagctc 27661 cctgtgctgc caggacagct tgggatcacc ctaagcagtt tctaggattt cctcagggct 27721 ggagggagga ggaagtggaa agggaatggg gctgggacat aaagctgttc ccccagctcc 27781 cagaatatag atagatatgt ctgtgctgac cgtggccttt tgcctcttcc ttctacacag 27841 ggtccttctg gagaccaagg tgcttctggt cctgctggtc cttctggccc tagagtaagt 27901 gacatggagt tggaagatgg agggggccct tcagagagtg tgggcctgtg ttcccatggg 27961 gagggaaatg ctgctgcttc tggggaagct gtgggctcag gggtcctcac tcagtaatgg 28021 gggcaggact ggctcatgtg cctatggcca gaaaagcgcc tgaggccaca atggctgtaa 28081 gacaaacatg aatcagcctc tcgctgtcag acagaacagc attttacaaa gaggagctta 28141 ggagggtagg caagccatgg agctatcctg ctggttcttg gccaaataga gaccaactta 28201 gggttccatg actgagcatg tgaagaactg ggggcggagt ggctggtgct atcaggacag 28261 ccacctaccc agccccagcg actccccagc cttccctgtg gtgaccactc tttcctcacg 28321 acctctctct cttgcagggt cctcctggcc ccgtcggtcc ctctggcaaa gatggtgcta 28381 atggaatccc tggccccatt gggcctcctg gtccccgtgg acgatcaggc gaaaccggtc 28441 ctgctgtaag tgtcctgact ccttccctgc tgtcgaggtg tccctaccat ccgggaggct 28501 tgagctcttt tttgctcagg gcctctttta gggcatcagc ctgcagctaa cagtgatggc 28561 atcctttatc ctgaggtctc ctcagaggtc acagggccca tgatcagtgc tgggaaactg 28621 aagagaaggg ctaaggaaga aatagacatg gtgctgtggt ttccttggtc ctcgcctgct 28681 acacctccgc cccacccatg gggctgggaa gagggacact ctagtacatt ctagcaaatg 28741 gggatggaca tggaggggca ctttcacaca atcctggctg atctctctgt ttcctgctgc 28801 agggtcctcc tggaaatcct gggccccctg gtcctccagg tccccctggc cctggcatcg 28861 acatgtccgc ctttgctggc ttaggcccga gagagaaggg ccccgacccc ctgcagtaca 28921 tgcgggccga ccaggcagcc ggtggcctga gacagcatga cgccgaggtg gatgccacac 28981 tcaagtccct caacaaccag attgagagca tccgcagccc cgagggctcc cgcaagaacc 29041 ctgctcgcac ctgcagagac ctgaaactct gccaccctga gtggaagagt ggtaagcttg 29101 gagaacagga tcccctgccc cgggaagcag ggagtcatcc cttaggccta gcagcaaggg 29161 aggagatgcc ccctagtaca gggcagagct gggcctggaa gtttccgcca gagggttcct 29221 ctcttatttc acagcagaga agctgcagcc ctggcccctg tcctgccatg gctacctggc 29281 cgaggtgacc tcagggtgga ctccatccac cagctgggca ctgcttctgc tctctttgca 29341 tgtgttcttc cttagggctg gacttagctc atgcagatct ccctgcccct gcatcctccc 29401 aggtccccct cctttcaggc cacatgtgaa cctcatccct tgtccctgta ggcctctctg 29461 tctctttcag tcaggcctgg gtctctcaag cttttgtgtc tgtgcctgtc tgagccccca 29521 tgggtgctgc ctcttccccc tgcaggagac tactggattg accccaacca aggctgcacc 29581 ttggacgcca tgaaggtttt ctgcaacatg gagactggcg agacttgcgt ctaccccaat 29641 ccagcaaacg ttcccaagaa gaactggtgg agcagcaaga gcaaggagaa gaaacacatc 29701 tggtttggag aaaccatcaa tggtggcttc catgtgagta cctgggtgcc ctagatgatg 29761 agcagagatg gctcctcaaa ctctttcttt tctttctccc tggaagcttt tagcaccttc 29821 cccatatttt cctccagttt tctgttgggc ttgagaggag ggaaagagga ggaaaagtat 29881 tttttcccca cgtggaggtg ggaaaagagg tcctctgagc ttgctccact cctggaagca 29941 aaaatgtcca actagctccc tgctgcccca gtacccttga ggtccttgaa ccatgaactc 30001 ttggcagccc ctacagcccc tggtcccatt gaatgccagc tcccaggcct cacactgccg 30061 ctctctgccc caacagttca gctatggaga tgacaatctg gctcccaaca ctgccaacgt 30121 ccagatgacc ttcctacgcc tgctgtccac ggaaggctcc cagaacatca cctaccactg 30181 caagaacagc attgcctatc tggacgaagc agctggcaac ctcaagaagg ccctgctcat 30241 ccagggctcc aatgacgtgg agatccgggc agagggcaat agcaggttca cgtacactgc 30301 cctgaaggat ggctgcacgg tgagtggggc tgccagagag aagagctgcc tgtgcccaaa 30361 ctgcctggag cagggctgag ggttggcccg cggcagctgt caggtcctaa agtgacagga 30421 tcatcagagg catgagtttg agggtcatgt agagaagata ggctgagtga caggtgagag 30481 agaggcacat atcattccat cttctccatt cccctggctc aggggaacaa aaccctacct 30541 ggaacccagt gactactgta gaagtgttct cgcaatgtgt acagggtgaa gaagcggtca 30601 caggttggga gctcactgtg gggagtgggg aaggagggga agggcagggt ggagaagggc 30661 cctgccgcta aggataggag ttgaagtgga gaggcctttg gcaagccaag aagaggtctc 30721 aggagccccc tcagtgtggt tcaaccttgt gggctctgat gctcgccagt ttgttcagtt 30781 ttgggcttct gggcagctgg aactgggtag caaggcatct actgaacaga gcctcctcct 30841 tttttctccc ctagaaacat accggtaagt ggggcaagac tgttatcgag taccggtcac 30901 agaagacctc acgcctcccc atcattgaca ttgcacccat ggacatagga gggcccgagc 30961 aggaattcgg tgtggacata gggccggtct gcttcttgta a // LOCUS HUMCOX5B 2593 bp DNA PRI 01-NOV-1994 DEFINITION Homo sapiens cytochrome c oxidase subunit Vb (COX5B) gene, complete cds. ACCESSION M59250 NID g180936 KEYWORDS cytochrome c oxidase subunit Vb. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2593) AUTHORS Lomax,M.I., Hsieh,C.L., Darras,B.T. and Francke,U. TITLE Structure of the human cytochrome c oxidase subunit Vb gene and chromosomal mapping of the coding gene and of seven pseudogenes JOURNAL Genomics 10 (1), 1-9 (1991) MEDLINE 91257815 FEATURES Location/Qualifiers source 1..2593 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2cen-q13" mRNA join(44..146,1023..1096,1297..1396,1949..2144) /gene="COX5B" /note="G00-127-530" gene join(44..146,1023..1096,1297..1396,1949..2061) /gene="COX5B" CDS join(44..146,1023..1096,1297..1396,1949..2061) /partial /gene="COX5B" /codon_start=1 /db_xref="GDB:G00-127-530" /db_xref="PID:g180937" /translation="MASRLLRGAGTLAAQALRARGPSGAAAMRSMASGGGVPTDEEQA TGLEREIMLAAKKGLDPYNVLAPKGASGTREDPNLVPSISNKRIVGCICEEDNTSVVW FWLHKGQAQRCPRCGAHYKLVPQQLAH" exon 44..146 /partial /gene="COX5B" /note="G00-127-530" /number=2 exon 1023..1096 /gene="COX5B" /note="G00-127-530" /number=3 exon 1297..1396 /gene="COX5B" /note="G00-127-530" /number=4 exon 1949..2144 /gene="COX5B" /note="G00-127-530" /number=5 BASE COUNT 615 a 607 c 685 g 686 t ORIGIN Chromosome 2. 1 ctgcagcttg ttcccggaag ttttgctgct agtcgcggac gcaatggctt caaggttact 61 tcgcggagct ggaacgctgg ccgcgcaggc cctgagggct cgcggcccca gtggcgcggc 121 cgcgatgcgc tccatggcat ctggaggtac tcgggtctcc gggcgtgcca gggaccagag 181 tgttgccctc ccagggtggt cccagggcgg caaagcggcg cggctcgtgc agcttctcga 241 ggtcccagtg gccgctttac ggtccccagt gcctcaggct ctgcaggcat ctccctgtaa 301 ttctggaccg ctgctcctgc cgctccccga actcactccg ctgcgaaagt atcctaaacg 361 gaggtgccgg gtgaccttgg gagggaccgg ggctgccacc gggatgggga ggggtccggc 421 ctcccttcaa acctgcgccc acctcaagca gagtgggttc tacatgcttt tagacaaatg 481 tcgacaaatt tgcctcggtg gttggagaaa gaaaagctca taggccgggc gcggtggctc 541 acaactgtaa tcccagcact ttgggaggcc gaggcggaca gatccctgag gtcaggagct 601 caagaccagc ctggccaaca tggtaaaacc ccgtttctac taaaaataca aaaattagcc 661 gggcgtggtg gcgcgcgcct gtagtcccag ctactcggga ggctgaggca ggagaatcgc 721 ttaaacccgg gaggcggagg ttcccgtgag ccaacatcgc gccattgcac tccagcctgg 781 gcaacaagag caaaactccg gctcaaaaaa gaaaaaaaaa gctcccccga gtgctgccgc 841 ttgtgtggat gggtacttgg tggttcttag gggaccatgg atatgagtag cctttaggag 901 cttgtgagcc cgctaaaact tatacagaag tttcggggca ccattttcct tgatcatttc 961 tgtttgtagt ttttctatca gtcatttcag tcagcgtcat aattcacgtt atcttccttt 1021 aggtggtgtt cccactgatg aagagcaggc gactgggttg gagagggaga tcatgctggc 1081 tgcaaagaag ggactggtaa gagaaactcc cttctgtctt ctgtgtaact tatggccttg 1141 gatgtgttca tagtggtctc ctctctggga gtatttgata caggaaactt ggcttgtagg 1201 ttcagtcccc tgagcttctg gaaggtaggg cttatttggc ctaatggttc aattcttgtt 1261 ttttttgttt tgttttgttt tttgtttttt cttcaggacc catacaatgt actggcccca 1321 aagggagctt caggcaccag ggaagaccct aatttagtcc cctccatctc caacaagaga 1381 atagtaggct gcatctgtaa gtacctcacc tctatttttt atccacttgc ttaatatatc 1441 ctacaatagt gtgtaagctg cctcaaatct tcagtgtgtg agtgcatgtt ggtaagtttg 1501 tctaagggtc ttgacactct caagcctcat tatgcctgat agttcatcct tactggaaag 1561 aagcgcagca cagcggtaag actggctcac tgggagtgcg gcatgaagga gtacccaccc 1621 agcaaatatt tattgttata ctgcttctat gccaggcatc attttagaca ctagggatac 1681 ataccagaac ggaccctgct ttgcgttcta cataggcaag ggaattgtta gaatttacag 1741 tgaccttgat acaaggtcag tttactcata ggtgaactca gagcctcaat ttcttcatct 1801 gaagaatggg aggagggctt gaactgatct cttaagatat caaccatagt cttacttgtg 1861 tatcagagat gtctgaggaa aagaaattca tgttgaaagt ctccctttct aggttggatg 1921 accataaacc accttttttc ccttttaggt gaagaggaca ataccagcgt cgtctggttt 1981 tggctgcaca aagggcaggc ccagcgatgc ccccgctgtg gagcccatta caagctggtg 2041 ccccagcagc tggcacactg agcacctgca ctaaattact caaaatgtgc tgtaaagttt 2101 cttctttcca gtaaagacta gccattgcat tggctccttc tcccatagat ggctggtctt 2161 atttcttacc cgtattcttt ggtaggcatg gaatatgctt attttgggaa aagctgtctg 2221 ttaatgctag cttgccatcc acttactgaa agtgtataac cagtgtatag tgcttagatt 2281 aataataaga atagatcgac aacccgtaat gcaatgaatg ggaccacctg gtatgagaga 2341 aaggggtggg ctgaggagtc aggctgacag gacttaaaat attggctcca tcatttggct 2401 ctatctctgt ggacatgttc tctgggggtc aagtacaact acaaaaaggt aagtacctta 2461 caaggtgctt aatatgtaaa gagctgagtg gatagtaggt cctcacagta tcctcataga 2521 gacagggtgg ccttgtgaag agagttttgg ggattttgaa gtcttgaggg acccaggtgc 2581 aaagtaagaa ttc // LOCUS HUMCP21OH 4042 bp DNA PRI 01-NOV-1994 DEFINITION Human 21-hydroxylase B gene, complete cds. ACCESSION M26856 X05448 NID g180963 KEYWORDS 21-hydroxylase. SOURCE Human whole blood, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4042) AUTHORS Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C., Porter,R.R. and Campbell,R.D. TITLE Molecular characterization of the HLA-linked steroid 21-hydroxylase B gene from an individual with congenital adrenal hyperplasia JOURNAL EMBO J. 6 (6), 1653-1661 (1987) MEDLINE 87275858 FEATURES Location/Qualifiers source 1..4042 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6p21.3" gene 745..946 /gene="CYP21" CDS join(745..946,1044..1133,1416..1570,1678..1779,1868..1969, 2071..2157,2327..2527,2728..2906,2990..3093,3191..3456) /note="21-hydroxylase B" /codon_start=1 /db_xref="PID:g180964" /translation="MLLLGLLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPD LPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVS RNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVA IEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRF FPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQ LLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSR VPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGA HLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAF TLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQNQ" exon <745..946 /gene="CYP21" /note="21-hydroxylase B; G00-120-605" /number=1 intron 947..1043 /note="CYP21P intron A" exon 1044..1133 /number=2 intron 1134..1415 /note="CYP21P intron B" exon 1416..1570 /number=3 intron 1571..1677 /note="CYP21P intron C" exon 1678..1779 /number=4 intron 1780..1867 /note="CYP21P intron D" exon 1868..1969 /number=5 intron 1970..2070 /note="CYP21P intron E" exon 2071..2157 /number=6 intron 2158..2326 /note="CYP21P intron F" exon 2327..2527 /number=7 intron 2528..2727 /note="CYP21P intron G" exon 2728..2906 /number=8 intron 2907..2989 /note="CYP21P intron H" exon 2990..3093 /number=9 intron 3094..3190 /note="CYP21P intron K" exon 3191..>3456 /note="21-hydroxylase B" /number=10 polyA_signal 3942..3947 BASE COUNT 779 a 1261 c 1154 g 848 t ORIGIN Chromosome 6p21.3. 1 tcgacagcta gatttccagg ctggaatcct gccctccaca acatgcgaac aatacccgtg 61 ttgcatatag agcatggctg tgaagagttg agtgagtgcc cacaaagcac ttagagcagt 121 gtctggtaca tgctattact ccgcagcggg aaaccacttc ctcctttgtc ttctgggcac 181 ttttgtgagt gaaaggaggc actaataaca atcacactgg gatacctgta tatactggaa 241 tgccccaggc aaaccaggct taaactgtat tactctatct gtagcttaaa ctaacaaaca 301 acccacacaa atcacatttt gttcttcagg cgattcagga aggcctatta ggcagggact 361 gccattttct ctctgagaca aacatcatgc cagtaaactg gcccacggtg gggtggcaga 421 gggagagggc ccaggtgggg gcggacacta ttgcctgcac aggtgatgtg gaaccagaaa 481 gctgactctg gatgcaggaa aaaggtcagg gttgcatttc ccttccttgc ttcttgatgg 541 gtgatcaatt tttttgaaat acggacgtcc caaggccaat gagactggtg tcattccaga 601 aaagggccac tctgtgggcg ggtcggtggg agggtacctg aaggtggggt caagggaggc 661 cccaaaacag tctacacagc aggagggatg gctggggctc ttgagctata agtggcacct 721 cagggccctg acgggcgtct cgccatgctg ctcctgggcc tgctgctgct gctgcccctg 781 ctggctggcg cccgcctgct gtggaactgg tggaagctcc ggagcctcca cctcccgcct 841 cttgccccgg gcttcttgca cttgctgcag cccgacctcc caatctatct gcttggcctg 901 actcagaaat tcgggcccat ctacaggctc caccttgggc tgcaaggtga gaggctgatc 961 tcgctctggc cctcaccata ggagggggcg gaggtgacgg agagggtcct ctctccgctg 1021 acgctgcttt ggctgtctcc cagatgtggt ggtgctgaac tccaagagga ccattgagga 1081 agccatggtc aaaaagtggg cagactttgc tggcagacct gagccactta cctgtaaggg 1141 ctgggggcat tttttctttc ttaaaaaaat ttttttttaa gagatgggtt cttgctatgc 1201 tgcccaggct ggtcttaaat tcctagtctc aaatgatcct cccacctcag cctcaagtgt 1261 gagccacctt tggggcatcc ccaatccagg tccctggaag ctcttggggg ggcatatctg 1321 gtggggagaa agcaggggtt ggggaggccg aagaaggtca ggccctcagc tgccttcatc 1381 agttcccacc ctccagcccc cacctcctcc tgcagacaag ctggtgtcta ggaactaccc 1441 ggacctgtcc ttgggagact actccctgct ctggaaagcc cacaagaagc tcacccgctc 1501 agccctgctg ctgggcatcc gtgactccat ggagccagtg gtggagcagc tgacccagga 1561 gttctgtgag gtaaggctgg gctcctgagg ccacctcggg tcagccttgc ctctcacagt 1621 agcccccgcc ctgcccgctg cacagcggcc tgctgaactc acactgtttc tccacagcgc 1681 atgagagccc agcccggcac ccctgtggcc attgaggagg aattctctct cctcacctgc 1741 agcatcatct gttacctcac cttcggagac aagatcaagg tgcctcacag cccctcaggc 1801 ccacccccag cccctccctg agcctctcct tgtcctgaac tgaaagtact ccctcctttt 1861 ctggcaggac gacaacttaa tgcctgccta ttacaaatgt atccaggagg tgttaaaaac 1921 ctggagccac tggtccatcc aaattgtgga cgtgattccc tttctcaggg tgaggacctg 1981 gagcctagac acccctgggt tgtaggggag aggctggggt ggagggagag gctccttccc 2041 acagctgcat tctcatgctt cctgccgcag ttcttcccca atccaggtct ccggaggctg 2101 aagcaggcca tagagaagag ggatcacatc gtggagatgc agctgaggca gcacaaggtg 2161 gggactgtac gtggacggcc tcccctcggc ccacagccag tgatgctacc ggcctcagca 2221 ttgctatgag gcgggttctt ttgcataccc cagttatggg cctgttgcca ctctgtactc 2281 ctctccccag gccagccgct cagcccgctc ctttcaccct ctgcaggaga gcctcgtggc 2341 aggccagtgg agggacatga tggactacat gctccaaggg gtggcgcagc cgagcatgga 2401 agagggctct ggacagctcc tggaagggca cgtgcacatg gctgcagtgg acctcctgat 2461 cggtggcact gagaccacag caaacaccct ctcctgggcc gtggtttttt tgcttcacca 2521 ccctgaggtg cgtcctgggg acaagcaaaa ggctccttcc cagcaacctg gccagggcgg 2581 tgggcaccct cactcagctc tgagcactgt gcggctgggg ctgtgcttgc ctcaccggca 2641 ctcaggctca ctgggttgct gagggagcgg ctggaggctg ggcagctgtg ggctgctggg 2701 gcaggactcc acccgatcat tccccagatt cagcagcgac tgcaggagga gctagaccac 2761 gaactgggcc ctggtgcctc cagctcccgg gtcccctaca aggaccgtgc acggctgccc 2821 ttgctcaatg ccaccatcgc cgaggtgctg cgcctgcggc ccgttgtgcc cttagccttg 2881 ccccaccgca ccacacggcc cagcaggtga ctcccgaggg ttggggatga gtgaggaaag 2941 cccgagccca gggaggtcct ggccagcctc taactccagc ccccttcagc atctccggct 3001 acgacatccc tgagggcaca gtcatcattc cgaacctcca aggcgcccac ctggatgaga 3061 cggtctggga gaggccacat gagttctggc ctggtatgtg gggggccggg ggcctgccgt 3121 gaaaatgtgg tggaggctgg tccccgctgc cgctgaacgc ctccccaccc acctgtccac 3181 ccgcccgcag atcgcttcct ggagccaggc aagaactcca gagctctggc cttcggctgc 3241 ggtgcccgcg tgtgcctggg cgagccgctg gcgcgcctgg agctcttcgt ggtgctgacc 3301 cgactgctgc aggccttcac gctgctgccc tccggggacg ccctgccctc cctgcagccc 3361 ctgccccact gcagtgtcat cctcaagatg cagcctttcc aagtgcggct gcagccccgg 3421 gggatggggg cccacagccc gggccagaac cagtgatggg gcaggaccga tgccagccgg 3481 gtacctcagt ttctccttta ttgctcctgt acgaacccct cccctccccc ctgtaaacac 3541 agtgctgcga gatcgctggc agagaaggct tcctccagcg gctgggtggt gaaggaccct 3601 ggctcttctc tcggggcgac ccctcagtgc tcggcagtca tactggggtg cgagagaggt 3661 gggcagcagc tcagcctccc cccgctgggg agcgaaagtt tcttggtctc agcttcattt 3721 ccgtgaaggg caccgagaac tcgaagccct tccagtggta ccagctcact ccctgggaaa 3781 ggggttgtca agagagagtc aaagccggat gtcccatctg ctcttcccgt tccccttaag 3841 gaggtagctc ccagcactca accaacctcc ccgcagagct cccttcctga ccctccgctg 3901 cagaggattg aggcttaatt ctgagctggc cctttccagc caataaatca actccagctc 3961 cctctgcgag gctggcatga ttgttccatt tcacccagcc actcagtccc ttgcctgtta 4021 cactgtgggg ctgaaaccta gg // LOCUS HUMCRPGA 2480 bp DNA PRI 01-NOV-1994 DEFINITION Human C-reactive protein gene, complete cds. ACCESSION M11725 NID g181067 KEYWORDS C-reactive protein. SOURCE Human fetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2480) AUTHORS Lei,K.J., Liu,T., Zon,G., Soravia,E., Liu,T.Y. and Goldman,N.D. TITLE Genomic DNA sequence for human C-reactive protein JOURNAL J. Biol. Chem. 260 (24), 13377-13383 (1985) MEDLINE 86033784 FEATURES Location/Qualifiers source 1..2480 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21-q23" gene 230..290 /gene="CRP" CDS join(230..290,569..1182) /note="C-reactive protein" /codon_start=1 /db_xref="PID:g181068" /translation="MEKLLCFLVLTSLSHAFGQTDMSRKAFVFPKESDTSYVSLKAPL TKPLKAFTVCLHFYTELSSTRGYSIFSYATKRQDNEILIFWSKDIGYSFTVGGSEILF EVPEVTVAPVHICTSWESASGIVEFWVDGKPRVRKSLKKGYTVGAEASIILGQEQDSF GGNFEGSQSLVGDIGNVNMWDFVLSPDEINTIYLGGPFSPNVLNWRALKYEVQGEVFT KPQLWP" exon <230..290 /gene="CRP" /note="C-reactive protein; G00-119-071" /number=1 intron 291..568 /note="intron A" exon 569..>1182 /note="C-reactive protein" /number=2 BASE COUNT 621 a 574 c 620 g 665 t ORIGIN Chromosome 1q12-q23. 1 tttgcttccc ctcttcccga agctctgaca cctgccccaa caagcaatgt tggaaaatta 61 tttacatagt ggcgcaaact cccttactgc tttggatata aatccaggca ggaggaggta 121 gctctaaggc aagagatctg ggacttctag cccctgaact ttcagccgaa tacatctttt 181 ccaaaggagt gaattcaggc ccttgtatca ctggcagcag gacgtgacca tggagaagct 241 gttgtgtttc ttggtcttga ccagcctctc tcatgctttt ggccagacag gtaagggcca 301 ccccaggcta tgggagagtt ttgatctgag gtatgggggt ggggtctaag actgcatgaa 361 cagtctcaaa aaaaaaaaaa aaagactgta tgaacagaac agtggagcat ccttcatggt 421 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgg tgtgtaactg gagaaggggt cagtctgttt 481 ctcaatctta aattctatac gtaagtgagg ggatagatct gtgtgatctg agaaacctct 541 cacatttgct tgtttttctg gctcacagac atgtcgagga aggcttttgt gtttcccaaa 601 gagtcggata cttcctatgt atccctcaaa gcaccgttaa cgaagcctct caaagccttc 661 actgtgtgcc tccacttcta cacggaactg tcctcgaccc gtgggtacag tattttctcg 721 tatgccacca agagacaaga caatgagatt ctcatatttt ggtctaagga tataggatac 781 agttttacag tgggtgggtc tgaaatatta ttcgaggttc ctgaagtcac agtagctcca 841 gtacacattt gtacaagctg ggagtccgcc tcagggatcg tggagttctg ggtagatggg 901 aagcccaggg tgaggaagag tctgaagaag ggatacactg tgggggcaga agcaagcatc 961 atcttggggc aggagcagga ttccttcggt gggaactttg aaggaagcca gtccctggtg 1021 ggagacattg gaaatgtgaa catgtgggac tttgtgctgt caccagatga gattaacacc 1081 atctatcttg gcgggccctt cagtcctaat gtcctgaact ggcgggcact gaagtatgaa 1141 gtgcaaggcg aagtgttcac caaaccccag ctgtggccct gaggccagct gtgggtcctg 1201 aaggtacctc ccggtttttt acaccgcatg ggccccacgt ctctgtctct ggtacctccc 1261 gcttttttac actgcatggt tcccacgtct ctgtctctgg gcctttgttc ccctatatgc 1321 attgaggcct gctccaccct cctcagcgcc tgagaatgga ggtaaagtgt ctggtctggg 1381 agctcgttaa ctatgctggg aaatggtcca aaagaatcag aatttgaggt gttttgtttt 1441 catttttatt tcaagttgga cagatcttgg agataatttc ttacctcaca tagatgagaa 1501 aactaacacc cagaaaggag aaatgatgtt ataaaaaact cataaggcaa gagctgagaa 1561 ggaagcgctg atcttctatt taattcccca cccatgaccc ccagaaagca ggagcattgc 1621 ccacattcac agggctcttc agtctcagaa tcaggacact ggccaggtgt ctggtttggg 1681 tccagagtgc tcatcatcat gtcatagaac tgctgggccc aggtctcctg aaatgggaag 1741 cccagcaata ccacgcagtc cctccacttt ctcaaagcac actggaaagg ccattagaat 1801 tgccccagca gagcagatct gctttttttc cagagcaaaa tgaagcacta ggtataaata 1861 tgttgttact gccaagaact taaatgactg gtttttgttt gcttgcagtg ctttcttaat 1921 tttatggctc ttctgggaaa ctcctcccct tttccacacg aaccttgtgg ggctgtgaat 1981 tctttcttca tccccgcatt cccaatatac ccaggccaca agagtggacg tgaaccacag 2041 ggtgtcctgt cagaggagcc catctcccat ctccccagct ccctatctgg aggatagttg 2101 gatagttacg tgttcctagc aggaccaact acagtcttcc caaggattga gttatggact 2161 ttgggagtga gacatcttct tgctgctgga tttccaagct gagaggacgt gaacctggga 2221 ccaccagtag ccatcttgtt tgccacatgg agagagactg tgaggacaga agccaaactg 2281 gaagtggagg agccaaggga ttgacaaaca acagagcctt gaccacgtgg agtctctgaa 2341 tcagccttgt ctggaaccag atctacacct ggactgccca ggtctataag ccaataaagc 2401 ccctgtttac ttgagtgagt ccaagctgtt ttctgatagt tgctttagaa gttgtgacta 2461 acttctctat gacctttgaa // LOCUS HUMCRYABA 4206 bp DNA PRI 01-NOV-1994 DEFINITION Human alpha-B-crystallin gene, 5' end. ACCESSION M28638 NID g181075 KEYWORDS alpha-crystallin; crystallin. SOURCE Human DNA, clones 730 and cp8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4206) AUTHORS Dubin,R.A., Ally,A.H., Chung,S. and Piatigorsky,J. TITLE Human alpha B-crystallin gene and preferential promoter function in lens JOURNAL Genomics 7 (4), 594-601 (1990) MEDLINE 90353958 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Dubin, 04-OCT-1989. FEATURES Location/Qualifiers source 1..4206 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q22.3-q23.1" TATA_signal 924..930 prim_transcript 949..>4040 /note="A-B2-cry mRNA and introns" exon <999..1199 /gene="CRYA2" /note="alpha-B2-crystallin; G00-119-805" /number=1 gene 999..1199 /gene="CRYA2" CDS join(999..1199,2274..2396,3754..3957) /note="alpha-B2-crystallin" /codon_start=1 /db_xref="PID:g181076" /translation="MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFPTSTSL SPFYLRPPSFLRAPSWFDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHG KHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIP ITREEKPAVTAAPKK" intron 1200..2273 /note="A-B2-cry, intron A" exon 2274..2396 /number=2 intron 2397..3753 /note="A-B2-cry, intron B" exon 3754..>3957 /note="alpha-B2-crystallin" /number=3 polyA_signal 4075..4080 BASE COUNT 1028 a 1041 c 985 g 1152 t ORIGIN 1 gtcgacacca cccaaaatag tgccgagcct cttggggggg gaggggctgg gagtgggggc 61 cctgagtgag agcaacgagg gtgtgaccag cgccgcccgg acccctagtc ccctcccccg 121 cacactcttc agctgtcgca gggggcctga gaggacagct gagggtcctg gctgggaacg 181 agctggggag ggggagctgg tggtgcctgg ggcatgaaga ggcctcgctg agaccctcac 241 aaacggtttg cacgtttcca cacctcattt tctcctcttc ggtggcaggc actgtgcacc 301 caattcctaa agcactcctg gatttaatgt tctgagagcc acatagaacg aaagatgcaa 361 gaaatctgtt tgctcttttt tcagggggtg gggtctttct gcccagatgt gggatcctct 421 cctaaaccca ggtcaaccca gggcacgagg cagatggctg gtgctgacat gttgaccatc 481 actgctctct tccaaggact cacaaagagt taatgtccct ggggctcagc ctaggaagat 541 tccagtccct gcccaggccc aagatagttg ctggcctgat tcccctggca ttcaggactg 601 gaaaggagga ggaggggcac actacgccgg ctcccatcct ccccccaccc cgcgtgcctg 661 cttgggattc ctgactctgt accagcttca gagaacaggg gtgggggtgg gtgccattgg 721 gtgtggacag aaagctagtg aaacaagacc atgacaagtc actggccggc tcagacgtgt 781 ttgtgtctct cttttcttag ctcagtgagt actgggtatg tgtcacattg ccaaatcccg 841 gatcacaagt ctccatgaac tgctggtgag ctaggataat aaaacccctg acatcaccat 901 tccagaagct tcacaagact gcatatataa ggggctggct gtagctgcag ctgaaggagc 961 tgaccagcca gctgacccct cacactcacc tagccaccat ggacatcgcc atccaccacc 1021 cctggatccg ccgccccttc tttcctttcc actcccccag ccgcctcttt gaccagttct 1081 tcggagagca cctgttggag tctgatcttt tcccgacgtc tacttccctg agtcccttct 1141 accttcggcc accctccttc ctgcgggcac ccagctggtt tgacactgga ctctcagagg 1201 tgagtctccc cacagctagg acgggagagt ccttactgga acctcctgga aacttctcca 1261 tccattttcc tttcctaccc tgcctaaacc attttaggca catgtgtgtc caaatgtgaa 1321 gaaaaatgag gaggttgcta gtgccttcct cccccatcac ctgtttctat ttgatagtcc 1381 tctgtatccc atttattaca ttttttcatg cactgtcaag tttatcctcc gtcccctaac 1441 ttctctacag gatacccctt tctggtttgg ttcatgacaa tctgcaggga aagagctgcc 1501 ttcaaactcc tttgcttatc tcttccaaca ccttggactc ttgaccgatt ttaccatctc 1561 aggtttcaga gccaggagag agccctgcct catcctgagc tgttcatccc catgggtatt 1621 ttctgccttt ctattccctc ttctatgatt ttctgggttt ctcagggcta cgacagggcg 1681 ctggcctggg tccaatcaag ccctacgagg aaacaatata gggacgccca tttgtcctaa 1741 gagggtggaa gaacagggtg aacaaataag gttgacagag ctgtcacaga taacactctg 1801 gtttaaaaat attcaagtgt gagtaaacag gagctgagtg ggcaagggct ttggaaggac 1861 aagcaggacc agcagaacat tccagattgg gtgggtggaa aactggcaaa gagacctgag 1921 ccagaagaag aggcctttgt ctcacagaca aaccacaaag ccaggcattg gagtcagaga 1981 ggcagcagat gccaggcttg cacccatcct tgcgactggt cccctgggtg atctgtcttc 2041 ttctctgtcc ctgtaaataa agtttgggtc tgatcaccat gagccttagg tatcactgtg 2101 gtggctccct gaagcagaca gctatgttta tttaaaaagg agatttttta agcagagaag 2161 agaaggatga attacccgga cagaaagcag ctctgcagaa taagacagca cctgtgtaat 2221 cagtattttt gccctctttc tcccatccca ttcccttacc ttgctatttc tagatgcgcc 2281 tggagaagga caggttctct gtcaacctgg atgtgaagca cttctcccca gaggaactca 2341 aagttaaggt gttgggagat gtgattgagg tgcatggaaa acatgaagag cgccaggtat 2401 gtagcttgtt tttttgtttt ctgctcattc attcagtgat actgtaatag tccaggtagt 2461 gctatcagct ttggaggctg gctacattcc agtcccaagc cataacagtc gggatcaggg 2521 gttacaaatc aatgtctaga agactaagtt aggatagaca tattgctgtt gttactatta 2581 tggccagaga tgtggccttt gatttgatcg ccttagatgg gatgatggga tgctgatgcc 2641 ccatttaagc cagtggttct gaatctgggc cacattagaa tcaccagggg aactttcaaa 2701 aacctaatgc tcgggcatcc tccagaccaa ttagcatatg tgctgccgaa gcgagcacta 2761 ctccagacca attaaatcag catttttaag ggtgggaccc aggcatcagc aatttttaag 2821 gtaattctaa tctacagtca aggttgagaa ccactgatta ggtatagggc tgtcagacac 2881 ctagttgctt tgcataatta cattaactac aggtacccta aaagcacttg agttgtgact 2941 tctcttttag ctgtgcaaga atccgtgtct cttctttagc ccatcttaat gctgaactac 3001 ttggtttgtc taaatttcag agctgtgctc agtctttaat cccctacagc ccatgtggta 3061 atcagttaac gagagcctgt ttggctacat gcttgagagt cagcaggcat acgggttaag 3121 gtcatctact ctttggggga gttctgacaa atggaacagc ttgttatgac tttataagag 3181 ggctttaaaa ttgcttctca ccatttaacg atagctcaga acctgtgcgt caaccagtac 3241 agtttgtcct cagtaatgtc ctcaggctgt ttcaattttg cttatatgat ttaggtttgg 3301 gtcatagtct ccttggatgg agtcattttt tttttttttt aatttcagca gcagtcctat 3361 tgttctggaa ccttctggga cattcctgaa gagtcaggac aatttcaggg cttcctcagg 3421 gactcagatt ctaaatgaga ttccaaattc tgtaggccca gccaacattg atctaaacct 3481 ttgggaaata cccctaaaca tatctatgcc tcagggtttg aaaaacaatg aagtgttgga 3541 ctgtttcaga cttctcagat tctcactggt aggagtgact acctaggcaa tttcatctta 3601 gctgcaaccc tgaaacgaag ctctatttat ttttcctatg ttgtcatggc atttggtctc 3661 acctaagggg aaatcaggat gcctgagttc tgggcaggtg ataatagttc ctgttcttat 3721 ctctctgcct ctttcctcat tcttttgggt taggatgaac atggtttcat ctccagggag 3781 ttccacagga aataccggat cccagctgat gtagaccctc tcaccattac ttcatccctg 3841 tcatctgatg gggtcctcac tgtgaatgga ccaaggaaac aggtctctgg ccctgagcgc 3901 accattccca tcacccgtga agagaagcct gctgtcaccg cagcccccaa gaaatagatg 3961 ccctttcttg aattgcattt tttaaaacaa gaaagtttcc ccaccagtga atgaaagtct 4021 tgtgactagt gctgaagctt attaatgcta agggcaggcc caaattatca agctaataaa 4081 atatcattca gcaacagata actgtcttgt gtttgaatat tccacacact tttaaataaa 4141 tatacagata ccacagatct atttatgatt gcattatgat ttagagggct ccaaggattt 4201 tagagt // LOCUS HUMCSPA 4770 bp DNA PRI 25-AUG-1995 DEFINITION Human cytotoxic serine proteinase gene, complete cds. ACCESSION M72150 NID g957196 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4770) AUTHORS Caputo,T. and Rowe,P.B. JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..4770 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="D-SECT2" /tissue_type="blood" misc_feature 1..875 /note="5' flanking region" TATA_signal 846..849 /note="TAAA Box" exon 876..960 /number=1 mRNA join(876..960,2107..2254,2772..2907,3113..3370,3773..4029) /product="cytotoxic serine proteinase" 5'UTR 876..905 CDS join(906..960,2107..2254,2772..2907,3113..3370,3773..3916) /codon_start=1 /product="cytotoxic serine proteinase" /db_xref="PID:g181157" /translation="MQPFLLLLAFLLTPGAGTEEIIGGHEAKPHSRPYMAFVQFLQEK SRKRCGGILVRKDFVLTAAHCQGSSINVTLGAHNIKEQERTQQFIPVKRPIPHPAYNP KNFSNDIMLLQLERKAKWTTAVRPLRLPSSKAQVKPGQLCSVAGWGYVSMSTLATTLQ EVLLTVQKDCQCERLFHGNYSRATEICVGDPKKTQTGFKGDSGGPLVCKDVAQGILSY GNKKGTPPGVYIKVSHFLPWIKRTMKRL" intron 961..2106 /number=1 exon 2107..2254 /number=2 intron 2255..2771 /number=2 exon 2772..2907 /number=3 intron 2908..3112 /number=3 exon 3113..3370 /number=4 intron 3371..3772 /number=4 exon 3773..4029 /number=5 3'UTR 3917..4029 polyA_signal 4012..4017 misc_feature 4030..4770 /note="3' flanking region" BASE COUNT 1208 a 1328 c 1076 g 1158 t ORIGIN 1 aaccaggatg gtctcgatct cctgacctcg tgatccaccc gtcttggcct cccaaagtgc 61 taggattaca ggcatgagcc accgcgcccg gccttagtga acaattatta tcaggatgat 121 cagcatccat ttttccatca cctggcaaca acaactagca tttctcaagg gaaatcccct 181 cctcattcat aagcaccata gatggagtgg aacagtcccc acctcagctt aaggagggct 241 aaataaccaa agcataaagc aatcagtcca ctacacttca gtggacccaa tgatgactca 301 ttttggttca ataaatatca agtccagaac ttttgtgcag aactgtggaa taaaaaaagt 361 tatctttgtc acagaatatt tatatttaaa gaggggattc taaatatccc aaaaatatga 421 atctgagata gtggtaagat aggaatgggt tgccatttgc agtatttgtt gggtgtgtgc 481 aactctgcat actgctaagg caagtccagc ctccgtctca caccaacagg cagatttccc 541 caccacggcc cttaccatct tgtgctctaa ccccagattt tcagtctact cttgctaacc 601 atcaattctt caagggcaag agccatgtct ctcatatccc tggatctggc ttgctaaata 661 attattcatg attttatctg agtagaaaaa gagccaagaa ctgagaaaag gagacattct 721 tggttctcag tcactgagga gcttctgcag taccagcctt tctgctagct gcagccacag 781 cttcaacagc tccacctcct tgttctttat tcttgctttc accttgttac tcagcagcag 841 gggtgtaaat gtgacagtgc catgtcaaca ccaacagctc tgacctgggc agccttcctg 901 agaaaatgca gccattcctc ctcctgttgg cctttcttct gacccctggg gctgggacag 961 gtaagtgact atccctattc cagaggcctg aacccatctt ataagatacc ctgtacccat 1021 gagcactggt caggaatttt cctcaatctg agcccacctc ccattccaca cccagactta 1081 taaatctgag gctagataga cactcagcaa agatcgaaaa atgaagggtg ttccctaaaa 1141 ggtttaatgg gtgttagcct ctccctagac ctctccttta tgacctggag tgtggattgt 1201 tcttagaaag gcatttggta gggaatgtga agctaaaaaa gataagtaat tattactcta 1261 cactccaacc caggaaagag gagctcagac accaagttgc agtcatagga gttgttactg 1321 gactcagctt aagaaccact tattcagtgc ccccacccag tcacccctgc caggagggaa 1381 gcccatggtg caactgatct cagagatggc aagatgactg tgtccatagc tctcccatta 1441 ccttggctcc acctgggctt tgcgattcat ttttagttga tttctccact tccttctgcc 1501 tttgcaggcc ctcagctact ccactgacct ggtgataacc ccctctaaca tccctgaggt 1561 cctgaatccc accagcacta cccccactaa acctcagcca aaggctaatt ggaggctatt 1621 catttatgca ccaaacaaca cttactgaga acctagaatg tgttcagccc tggcacatga 1681 gaattttaga aaatccaact ccagaagctc atgggtagat cattagaaaa tggcaccaat 1741 caggaggaca gcagaggctc agaggaatca gagagactca ctaaaaagga ggagcctggg 1801 aagtcctgtc caggatcctc tggggctcta ccactgtgaa aatagatctt tcttcctcca 1861 tgtgtgtttc tgcagtagga tgcagaagtc aagcactgtc cattctttct gagagaaggt 1921 tctgagactt aaagttcaga aaagcttgct ctgcctgcaa ggagtactgc ctgcagagct 1981 caggaggacc cctccctcac tgtctgcagc acacaaaaca cttcaccctc aatcactgtc 2041 gtcctaagcc ttctctcctg actgcagcca tccccagcta ctttatcttc cagtctttcc 2101 tttcagagga gatcatcggg ggccatgagg ccaagcccca ctcccgcccc tacatggcct 2161 ttgttcagtt tctgcaagag aagagtcgga agaggtgtgg cggcatccta gtgagaaagg 2221 actttgtgct gacagctgct cactgccagg gaaggtaagg agcagcacca gctcacctcc 2281 tgagttcccc ctacagggac ccttgttttc tcctggggac tgcagcccag gggagcttcc 2341 agagttcctg gtaacacaag cccatgaaag ctcatcggag cagccacctg ggaataggac 2401 taagatgatg aaaggctgag aaataggaca gggtcagagg gtcagagggt cagaggtcag 2461 agcaagtggc atggtttcac acactggccc aataaggcag tagtaaaggt tgaatccatg 2521 cattaaggat caaaatgcaa actcgagcaa tttcatgata tttcctgaag gcaagagaga 2581 gcccccagaa gccctactct tcatgtcttt gtgaagtaga gggcacagtc actcagccct 2641 ggagccctcc tgtcccttca acttcccctc agctgcagcc ctaccctgcc acagctgtca 2701 tctgctcttc actgctcctg ggctctatcc cctgtgactc cacccccatc ctcactctgc 2761 tctctgtgca gctccataaa tgtcaccttg ggggcccaca atatcaagga acaggagcgg 2821 acccagcagt ttatccctgt gaaaagaccc atcccccatc cagcctataa tcctaagaac 2881 ttctccaacg acatcatgct actgcaggtg aggcacactc ctgccactct tgctcttctt 2941 ggtccagttg gttccactcc cactgggatg ccggcccttt cctcctttcc atcctgacct 3001 cttggtcagt tcctgtgcct tagaggagag ggaagattgt gcagccccat cactgtgtcg 3061 gggcccagaa ggccattgcc tgacctggac tttcttgctt cttccccacc agctggagag 3121 aaaggccaag tggaccacag ctgtgcggcc tctcaggcta cctagcagca aggcccaggt 3181 gaagccaggg cagctgtgca gtgtggctgg ctggggttat gtctcaatga gcactttagc 3241 aaccacactg caggaagtgt tgctgacagt gcagaaggac tgccagtgtg aacgtctctt 3301 ccatggcaat tacagcagag ccactgagat ttgtgtgggg gatccaaaga agacacagac 3361 cggtttcaag gttgggtttc cagcctctac ccagaaagac cacagggaga gaggaacttg 3421 agggagttct ggggtatgac agtggccaga tctttatgct ctcagcccag agcttgggca 3481 gcctgtgtcc ccctggaatc tagtctttag tccactcctt gggaggactg gggtggctgg 3541 aggggaagga tctgtcatcc gacagtcact tatcaccaaa gtgtccgtga gaattgcaga 3601 gaaaggcttg gtctggggag agccagaaag cacagccagg gccatgctgg agacccagga 3661 ctgagggagg tgagtgacaa ggcctgcctg gcactgatgt cacacaaaat ccataacaga 3721 gaccacccat cccagggaac actagctcag tctttcctct ctctgttcac agggggactc 3781 cggggggccc ctcgtgtgta aggacgtagc ccaaggtatt ctctcctatg gaaacaaaaa 3841 agggacacct ccaggagtct acatcaaggt ctcacacttc ctgccctgga taaagagaac 3901 aatgaagcgc ctctaacagc aggcatgaga ctaaccttcc tctgggcctg accatctctg 3961 ggacagaggc aagaatcccc aaggggtggg cagtcggggt tgcaggactg taataaatgg 4021 atctctggtg taaatatgac tgaagtctta tttattcact gaaccatctt caatatgcaa 4081 caacactgct ccaggcagcc gtctgctgtc agttctggct tttgcttctt cctttctccc 4141 ccagccctcc ctcctcccta aaccccatgc tgtacagttt tttcactaat ttctcaatca 4201 catgtctttg atactgaacc ctctgttaga gctggtttga actccattat aaacaacaat 4261 cttcaacccc tctacccttc acaaaaaaaa ataaacctct ctggacgtta ggaatttctt 4321 atcgcctcta aaccaactac ctactcccct aacaacgtag cagagaatgg agcagggcag 4381 tagcagatgg agtggaaagg aaataactcc agggactcag aaggccaggg gcatgccatg 4441 agtactcagt ttcctatttt cagggcctgt ccctccaatg ggggcaggct tctcccacag 4501 ccaggtctgc ctgcagctta ggaggttgag atgctcctct gcacctaggt ccctcttccc 4561 tgccctgggc ccagccctga gctcagtcaa ccttcttcct tcctggccgg ccttcccagc 4621 tcccaagagt ttttcaccca agtgcctggc taccagtttc tttcccttct agatcaccct 4681 gttctgaagc cagcctctct ctcatgtccc taacagagtt cccagctctg tgtgaattca 4741 cacacccttt gaaagaagca ggacactgct // LOCUS HUMCYC1A 4622 bp DNA PRI 02-NOV-1994 DEFINITION Human cytochrome c-1 gene, complete cds. ACCESSION J04444 NID g181239 KEYWORDS cytochrome; cytochrome c-1. SOURCE Human leukocyte DNA, clones EMBLIII-[401-402]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4622) AUTHORS Suzuki,H., Hosokawa,Y., Nishikimi,M. and Ozawa,T. TITLE Structural organization of the human mitochondrial cytochrome c1 gene JOURNAL J. Biol. Chem. 264 (3), 1368-1374 (1989) MEDLINE 89109139 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by T.Ozawa, 28-OCT-1988. FEATURES Location/Qualifiers source 1..4622 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q24" prim_transcript <1..3801 /note="cytochrome c-1 mRNA and introns" gene 1407..1535 /gene="CYC1" exon <1407..1535 /gene="CYC1" /note="cytochrome c-1; G00-119-827" /number=1 CDS join(1407..1535,2138..2334,2429..2555,2641..2798, 2888..3048,3311..3411,3508..3612) /note="cytochrome c-1" /codon_start=1 /db_xref="PID:g181240" /translation="MAAAAASLRGVVLGPRGAGLPGARARGLLCSARPGQLPLRTPQA VALSSKSGLSRGRKVMLSALGMLAAGGAGLAVALHSAVSASDLELHPPSYPWSHRGLL SSLDHTSIRRGFQVYKQVCASCHSMDFVAYRHLVGVCYTEDEAKELAAEVEVQDGPNE DGEMFMRPGKLFDYFPKPYPNSEAARAANNGALPPDLSYIVRARHGGEDYVFSLLTGY CEPPTGVSLREGLYFNPYFPGQAIAMAPPIYTDVLEFDDGTPATMSQIAKDVCTFLRW ASEPEHDHRKRMGLKMLMMMALLVPLVYTIKRHKWSVLKSRKLAYRPPK" intron 1536..2137 /note="cytochrome c-1, intron A" exon 2138..2334 /number=2 intron 2335..2428 /note="cytochrome c-1, intron B" exon 2429..2555 /number=3 intron 2556..2640 /note="cytochrome c-1, intron C" exon 2641..2798 /number=4 intron 2799..2887 /note="cytochrome c-1, intron D" exon 2888..3048 /number=5 intron 3049..3310 /note="cytochrome c-1, intron E" exon 3311..3411 /number=6 intron 3412..3507 /note="cytochrome c-1, intron F" exon 3508..>3612 /note="cytochrome c-1" /number=7 BASE COUNT 862 a 1309 c 1483 g 968 t ORIGIN Chromosome 8. 1 gtcgactttt agtttagatt gtgccatctg gctctagcca atggagacag gacacagtag 61 cagggacaag ctgtgtaagg gataaaaata gcttctctcc tttattcagg tgtgctctca 121 ccatttttcc atctgtgagg agcaccctct ctgcagaaag taaaattgac ttgctgagag 181 aactttttgt cggaatgctg atctttcctt atggtaccag ggaacaagcg ttctgtttct 241 aaataaacat tttacatata acacagggtt tctccatgtt ggtcaggctg gtctcaaact 301 cacgacctca agtgatctgg ggcctcccaa agtgctggga ttacaggcgt gagcaccgca 361 cccagcctag tttttaaatt ttatttgctt tttttttttt cttgagaggg agtcttccgt 421 ctgtcatcca ggctggagtg cagtggcacg atctctactc actgcaacct ccgcctcccg 481 ggttcaagcg attctcctgc ctcagcctcc ccagagtagc tgggactaca ggcgcgcgcc 541 accacgcctg gctaattttt gtattttcag tagagacaag gtttcaccat gctgaccagg 601 atggtctcca tcttttgacc tcgtgatctg cccgccccac cctcccaaag tgctgggatt 661 acagcgtgag ccaccgcgcc cggcctttgc ttacttttta aatattttaa aataaaaata 721 ttctcacata acaagatcat atttacttac ttttttgaga cgaggtctcg ctcgatatcg 781 cccaggctgg agtgcagtgg cgatctcggc tcaccgcagt ctccacctcc ggggctcaag 841 cgatcctgcc tcagcttccc aaagcgctag gacccaaggc gcgcatcacg cgtcgggcca 901 tgagtcaagg gaatgaagga gaacagggtg ctcagcctgg gggcccagcc tcccgctggt 961 gcacgcgctg cctgcggccg agacccctgc ccgcaccctc tgccgggctg ccctccaagc 1021 cgccctttct ctggaggtcc tcagcctgca ggggcaccct ccacccggcc atcgcgcagc 1081 ctgggaaggt ggagaaaagg acgtcggggt gtcggaggcg gcgtgggaaa cgccgggcgg 1141 agcgtggcgc tgtcacggca acagagagac gcgacggggc cccgccccac cgccagtttc 1201 cacgacaacc cgaagagcgt ggggagtcag gcggtgcccc ggcccctgac tgacgcgacc 1261 ggggccagcg cgcttcgtcc ccgcccaccc gacaggcccc gcccccgagc ccggccccgc 1321 cccgcgctcc ccggctttcg cgaggttttg actctcgtgg cgccccaggg gccgacggga 1381 gtggcggccg cgcggaggag gccaagatgg cggcagctgc ggcttcgctt cgcggggtag 1441 tgttgggccc gcggggcgcg gggctcccgg gcgcgcgtgc ccggggtctg ctgtgcagcg 1501 cgcggcccgg gcagctcccg ctacggacac ctcaggtgag cgctgggccg ggccccggcc 1561 tccgcgcggc cccgcatctc cgtgaaggtc acggcgggga ggctgcgggc gcgggcctgg 1621 gcagcgcgga agcggtaccg gccacccagc gtccccggtc ccagctgcct gccgaccttg 1681 agctggtggg atcagggctg ggcgcccacc tctccgaacg gcagagagcc cgtccccagc 1741 gtgggggttg gcgggacggg ctagctgccg tggcggggct ggggctttcc cgaatggcgc 1801 gcccaggacg gctcttgcgg ctggctgtcc aaactgggcc cgcgtcctga agtgacccca 1861 gcctgatctc gccagtgctt gtgaccttgg cctgtcccag cacccttggt cacttcggtc 1921 tgatccccgg ctcaggatcc aggaacaccc tctcctgaga gcgaatcacg gtctaggggt 1981 cagcggcgga gagggatggg gtgggtagtg ggacagggtg tgcgtctggt gccctgcagg 2041 aggaggccat tgctgggtgg agaggtgggc gctgagagtc agatccccag gtagggctgg 2101 gtggtgtcct ggcaagcgct gagcggagat cttgcaggca gtggccttgt cgtcgaagtc 2161 tggcctttcc cgaggccgga aagtgatgct gtcagcgctg ggcatgctgg cggcaggggg 2221 tgcggggctg gccgtggctc tgcattcggc tgtgagtgcc agtgacctgg agctgcaccc 2281 ccccagctat ccgtggtctc accgtggcct cctctcttcc ttggaccaca ccaggtgtgc 2341 agctggctgg ctggctggca gcgggaggtt ctgggtggag ctggtaaggt ggaatcttca 2401 gctttcctaa ccctttccct ccctccagca tccggagggg tttccaggta tataagcagg 2461 tgtgcgcctc ctgccacagc atggacttcg tggcctaccg ccacctggtg ggcgtgtgct 2521 acacggagga tgaagctaag gagctggctg cggaggtgtg gggtctggga tgcctgggac 2581 ccagggctca gggctcccac tgttgagatg gcagggttgt gatgaggctc tcggtggcag 2641 gtggaggttc aagacggccc caatgaagat ggggagatgt tcatgcggcc agggaagctg 2701 ttcgactatt tcccaaaacc ataccccaac agtgaggctg ctcgagctgc caacaacgga 2761 gcattgcccc ctgacctcag ctacatcgtg cgagctaggt acacgggctg cccatggggt 2821 ggtgctggag agatggggga agggctgact tgtgcttgga actaaggagc catggatctg 2881 gtcctaggca tggtggtgag gactacgtct tctccctgct cacgggctac tgcgagccac 2941 ccaccggggt gtcactgcgg gaaggtctct acttcaaccc ctactttcct ggccaggcca 3001 ttgccatggc ccctcccatc tacacagatg tcttagagtt tgacgatggt aagaggcctc 3061 cagtctccta cccccaggga tgctttccct gtgttcctgg gtccaggagg tcctgcccac 3121 ttcttgtctg gggcgtcttc tgtgcatgtc tcaggaggct tttgggctcc ttcagttttg 3181 cgtatgtccg gtggagggta ctgccccttt gatgccttgg gtaggggcag tgtctgcttc 3241 acagaggggg ggcatgatcc caggttggac atgagcctga gaatagccct cactgcttgt 3301 cgttggccag gcaccccagc taccatgtcc cagatagcca aggatgtgtg caccttcctg 3361 cgctgggcat ctgagccaga gcacgaccat cgaaaacgca tggggctcaa ggtaaaaggg 3421 ttgggaggcc atggtgggta tcagagaagg gctgggagct gggcagggct cctcccactc 3481 ccttctctga gccttccttg tctgcagatg ttgatgatga tggctctgct ggtgcccctg 3541 gtctacacca taaagcggca caagtggtca gtcctgaaga gtcggaagct ggcatatcgg 3601 ccgcccaagt gaccctgtcc agtgtctgct tgccatcctg ccagaacagg ccctcaagcc 3661 caagagccat cccaggcctg ttcaggcctc agctaagcct ctcttcatct ggaagaagag 3721 gcaagggggc aggagaccag gctctagctc tgggccctcc ttcagccccc atcatgggaa 3781 taaattaatt ttctcaatgt acatatttga gttattatat ggagtgcagg tttgcaagga 3841 actctgatac caagaccctt gccttcacaa ggctcccagg atgtcccatg tcccatgcag 3901 tgtggccagc aatgccctgc actgactggt agctgggaag tccagacagt gggtgtggga 3961 cagaaagcgg agaactgcat gttagaggaa gtcagcattt ggtggggtgg ggtgtgtctg 4021 agcacagagc tgaaaagggc ctagccaggc aggatgtggc ccagggacct gttggaatga 4081 acagccaagt gttaaggtgt tgaaccactt ggcccccatt ctgatggcag ggaaccttgc 4141 tcattttaac agggaaagcc tgtctagggc aggggctatt tattctgggt tcgggctgag 4201 gaaggggatg gctcccagcc aggtgaagga gtcaagtaga cagacaggat gtgtgtcatg 4261 gagtgaggag tggcccctcc tagcagtcca cagggtagct aggtggtggg tcctggccag 4321 aagctggagt agctgggttc ccacaggagc atgctgctga gggagggaga cagtggatgc 4381 tatctaggag ggtcacagag acatgaccca gcaccggccc agtgacctgg tgcaggactc 4441 actcctgagc ttccctcgtc cctaggaagg gaaggaaaac acccattttc attgctggtg 4501 tcctagctgc aggagctgag gggagtggtg gccatagtcc aaggaaggca agagctgaat 4561 gggagaaagg tggtaccaat aggcctgcag gcaaagagag gtgcaggcct gggggaccag 4621 gg // LOCUS HUMCYCAA 3088 bp DNA PRI 15-JUN-1989 DEFINITION Human somatic cytochrome c (HCS) gene, complete cds. ACCESSION M22877 NID g181241 KEYWORDS cytochrome c. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 144; 1218 to 1394; 1496 to 3088) AUTHORS Evans,M.J. and Scarpulla,R.C. TITLE The human somatic cytochrome c gene: Two classes of processed pseudogenes demarcate a period of rapid molecular evolution JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9625-9629 (1988) MEDLINE 89071748 REFERENCE 2 (bases 1 to 3088) AUTHORS Evans,M.J. and Scarpulla,R.C. JOURNAL Unpublished (1989) COMMENT Draft entry and computer-readable sequence for [1],[2] kindly provided by M.J.Evans 03-MAR-1989. FEATURES Location/Qualifiers source 1..3088 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 81..2512 /note="HCS mRNA and introns" intron 145..1217 /note="HCS intron A" CDS join(1226..1394,1496..1644) /note="cytochrome c" /codon_start=1 /db_xref="PID:g181242" /translation="MGDVEKGKKIFIMKCSQCHTVEKGGKHKTGPNLHGLFGRKTGQA PGYSYTAANKNKGIIWGEDTLMEYLENPKKYIPGTKMIFVGIKKKEERADLIAYLKKA TNE" exon <1226..1394 /note="cytochrome c, (first expressed exon)" /number=2 intron 1395..1495 /note="HCS intron B" exon 1496..>1644 /note="cytochrome c" /number=3 BASE COUNT 894 a 534 c 745 g 915 t ORIGIN 1 gcacgtcagg gcgcgggagc gcggagcgag tttggttgca cttacaccgg tacttaagcg 61 cggaccggcg tgtccttgga cttagagagt ggggacgtcc ggcttcggag cgggagtgtt 121 cgttgtgcca gcgactaaaa agaggtgaga gcgggtcgcg gaggccgcac ctggttagag 181 gcagagctgt gggaggcgcg cacttgcgag cgagccgaaa cccaagcggg gagcattcga 241 ggtggagccc gcgctgggtg ggagggcggg gagtgaagac cctggactgt ggtcagaccg 301 agctgggcga gtaacggctt gaggtgcggc ggagccctaa ctagggacag gtatggtctc 361 ggtcagggac tggaggcggc ttggatacag atccgaggag gaggcggcct cttccgtagt 421 ggttgctgaa gggctatgga aatgataggc aagacttccc tcctggaaag ccgaagctta 481 gagcttcacg ttcttcttca gagggcaaaa gctgttgctc ttctaataag gggccagttc 541 ttttcgtggg cacatgtttc ttccgtcagt cgttctgaca tcctagaagg agtttcatca 601 atcaccttga aaccgacctg gacgggtgac ctcgtggtcg ccccaggaga tcacaggtag 661 gggagttggg atcgcccggg ggaccgtgca gcctgcccct gagctcccat tcacaagttc 721 gagtgtcaag ctactcctgt gacctgggca gatagaaaca gccaggaccg ctttttaaac 781 atttgtgtgc tttgcgttat cctcagggag aggtggcttt acattgtagt aagattaaat 841 ggttaggtct ttttaaaagt tgcggttgtg gtgattttgg cttaatgtgt tcgcccttga 901 gcttcagatc tgtgacttcg tgaccatgat tgtctcttct gaaactggag tttgaattag 961 gttccctctt tgcttgggct ttaacgttcc ttcacgtata cacacaaaaa tacgtttttg 1021 aggaggtact cctaaaaatg tttttggtat taaagaatat ttggtataaa gagtattaaa 1081 gcaaaacaag attcattctg gtatttaatg acataaatta gcaatggatt ggtaattaag 1141 tggctagagt ggtcattcat ttacactgta tttgttacct gaggaaaaat ttactaagtt 1201 gaagctttcg tttttagaat taaatatggg tgatgttgag aaaggcaaga agatttttat 1261 tatgaagtgt tcccagtgcc acaccgttga aaagggaggc aagcacaaga ctgggccaaa 1321 tctccatggt ctctttgggc ggaagacagg tcaggcccct ggatactctt acacagccgc 1381 caataagaac aaaggtaaga gtcacttgtt aaataaaaca acacaaaatg caggaatata 1441 acatgtggca aactatcagg agtgtgaaat aaccgatgca ttctttcttg tttaggcatc 1501 atctggggag aggatacact gatggagtat ttggagaatc ccaagaagta catccctgga 1561 acaaaaatga tctttgtcgg cattaagaag aaggaagaaa gggcagactt aatagcttat 1621 ctcaaaaaag ctactaatga gtaataattg ggccactgcc ttatttatta caaaacagaa 1681 atgtctcatg acttttttat gtgtaccatc ctttaataga tctcatacac cagaattcag 1741 atcatgaatg actgacagaa tattttgttg ggcagtcctg atttaaaact aagactggct 1801 tgtggttaaa tgaatatgtt cagtttttga attttaatag taactccaat tcagtaaatg 1861 gtatcactgt ttaccccttt taaagatatg attagacttc gttagtaatg ttcaactttt 1921 cacaaagatg gtgagtgcca tcttaaaact tactggagat tggttttata tttagattta 1981 tataactggt tatgtgaata tatttaaata ctggggaaat tgcttcactg tcttagaacc 2041 aagcaagatt cacctgtgtt ttgtgttcat gttcatttgc ctcttaaagg caagggttga 2101 agataaataa ggtagcaatg tctatagttt tggccttaac tatgccaatc taattataat 2161 tccctgtatt taaaatggtt tcttttactt attgaaaggc attttagtgt ggtttatgtg 2221 taatattaaa gattattcaa cacctctcac atcttacaga tctataaggt cacatgcttt 2281 taaaatagta gcaagttaaa cttcactctt gaattcttta caatctaagt caaactaagt 2341 tataatttag gattgtcttt aaacagccat tcagaaacaa aactgtagaa ctgtgtattt 2401 gattgggaat ggtgcttttg ccaacttaaa aggattaaag taacggagat atacacaaat 2461 tttaaaatta tgtgtgatca caagactaaa gataattaaa aagaaaacca cagatcatga 2521 ctttttgact gtgcttgatt tcatgactga tgcacaaatt ttaatgatta aaaagtgcag 2581 gagccctaaa tgtcagtgca gcagccctaa atgtcagtgc agcagtgtta accagtcatg 2641 gtgctagatt gtttacttgg ttttctagga ctgcctcaac tagaataaca cttcactaat 2701 tgactcttag tttctttgct cagattgaga actgcagcat ttatcgcaga catggacaga 2761 ggaatgcctg tggtcatagt tttgtgatgt gtaacagtgt ataattacat actgaattat 2821 ttcatgcata gtctgtgcca tacacattta gagtagtcct tggagatttt atggagatgg 2881 tgagcacaag gtaagtcata aagaataatg agaaaataaa tctatgctgg tgcagctgag 2941 aactgtatct ttgtgggaca gtgagaagac tgagaagatg tgaatccatg gtctcaaagg 3001 tgatagggac gattagatag gtgttttaag gcctgaaagc aatttataac atatgagtct 3061 tatttttatt tatagaaatg tgaagctt // LOCUS HUMCYP2DG 5503 bp DNA PRI 15-SEP-1990 DEFINITION Human debrisoquine 4-hydroxylase mutant allele (CYP2D6-MA1) gene, complete cds. ACCESSION M33189 NID g181305 KEYWORDS debrisoquine 4-hydroxylase. SOURCE Human individual MAGA DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5503) AUTHORS Gonzalez,F.J. JOURNAL Unpublished (1990) COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Gonzalez, 23-MAR-1990, for release after publication. Author address: F.Gonzalez National Cancer Institute Bldg. 37 Rm. 3E-24 National Institute of Health Bethesda, Md 20892. FEATURES Location/Qualifiers source 1..5503 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 689..702 prim_transcript 726..5103 /note="debrisoquine 4-hydroxylase mRNA and introns" exon <814..993 /note="debrisoquine 4-hydroxylase" /number=1 CDS join(814..993,1696..1877,2419..2571,2661..2820,3254..3430, 3621..3762,3970..4157,4612..4753,4852..5030) /note="debrisoquine 4-hydroxylase" /codon_start=1 /db_xref="PID:g181306" /translation="MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYSPGPLPLPGLG NLLHVDFQNTPYCFDQLRRRFGDVFSLQLAWTPVVVLNGLAAVREAMVTRGEDTADRP PVPITQILGFGPRSQGKQRGVPGALWARVARAEALLRLHLAQLGPGQEVAGAVGDRGG RLPLCRLRQPLRRPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKE ESGFLREVLNAVPVLLHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTE AFLAEMEKAKGNPESSFNDENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQRRV QQEIDDVIGQVRRPEMGDQAHMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIEVQGFRI PKGTTLITNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSAGRRACLGEP LARMELFLFFTSLLQHFSFSVPTGQPRPSHHGVFAFLVTPSPYELCAVPR" intron 994..1695 /note="debrisoquine 4-hydroxylase intron A" exon 1696..1877 /number=2 intron 1878..2418 /note="debrisoquine 4-hydroxylase intron B" exon 2419..2571 /number=3 intron 2572..2660 /note="debrisoquine 4-hydroxylase intron C" exon 2661..2820 /number=4 intron 2821..3253 /note="debrisoquine 4-hydroxylase intron D" exon 3254..3430 /number=5 intron 3431..3620 /note="debrisoquine 4-hydroxylase intron E" exon 3621..3762 /number=6 intron 3763..3969 /note="debrisoquine 4-hydroxylase intron F" exon 3970..4157 /number=7 intron 4158..4611 /note="debrisoquine 4-hydroxylase intron G" exon 4612..4753 /number=8 intron 4754..4851 /note="debrisoquine 4-hydroxylase intron H" exon 4852..>5030 /note="debrisoquine 4-hydroxylase" /number=9 BASE COUNT 1066 a 1537 c 1851 g 1049 t ORIGIN Chromosome 22. 1 ggctgggaag tggggtactt ggtgccgggt ctgtatgtgt gtgtgactgg tgtgtgtgag 61 agagaatgtg tgccctaagt gtcagtgtga gtctgtgtat gtgtgaatat tgtctttgtg 121 tgggtgattt tctgcgtgtg taatcgtgtc cctgcaagtg tgaacaagtg gacaagtgtc 181 tgggagtgga caagagatct gtgcaccatc aggtgtgtgc atagcgtctg tgcatgtcaa 241 gagtgcaagg tgaagtgaag ggaccaggcc catgatgcca ctcatcatca ggagctctaa 301 ggccccaggt aagtgccagt gacagataag ggtgctgaag gtcactctgg agtgggcagg 361 tgggggtagg gaaagggcaa ggccatgttc tggaggaggg gttgtgacta cattagggtg 421 tatgagccta gctgggaggt ggatggccgg gtccactgaa accctggtta tcccagaagg 481 ctttgcaggc ttcaggagct tggagtgggg agagggggtg acttctccga ccaggcccct 541 ccaccggcct accctgggta agggcctgga gcaggaagca ggggcaagaa cctctggagc 601 agcccatacc cgccctggcc tgactctgcc actggcagca cagtcaacac agcaggttca 661 ctcacagcag agggcaaagg ccatcatcag ctccctttat aagggaaggg tcacgcgctc 721 ggtgtgctga gagtgtcctg cctggtcctc tgtgcctggt ggggtggggg tgccaggtgt 781 gtccagagga gcccatttgg tagtgaggca ggtatggggc tagaagcact ggtgcccctg 841 gccgtgatag tggccatctt cctgctcctg gtggacctga tgcaccggcg ccaacgctgg 901 gctgcacgct actcaccagg ccccctgcca ctgcccgggc tgggcaacct gctgcatgtg 961 gacttccaga acacaccata ctgcttcgac caggtgaggg aggaggtcct ggagggcggc 1021 agaggtgctg aggctcccct accagaagca aacatggatg gtgggtgaaa ccacaggctg 1081 gaccagaagc caggctgaga aggggaagca ggtttggggg acttcctgga gaagggcatt 1141 tatacatggc atgaaggact ggattttcca aaggccaagg aagagtaggg caagggcctg 1201 gaggtggagc tggacttggc agtgggcatg caagcccatt gggcaacata tgttatggag 1261 tacaaagtcc cttctgctga caccagaagg aaaggccttg ggaatggaag atgagttagt 1321 cctgagtgcc gtttaaatca cgaaatcgag gatgaagggg gtgcagtgac ccggttcaaa 1381 ccttttgcac tgtgggtcct cgggcctcac tgctcaccgg catggaccat catctgggaa 1441 tgggatgcta actggggcct ctcggcaatt ttggtgactc ttgcaaggtc atacctgggt 1501 gacgcatcca aactgagttc ctccatcaca gaaggtgtga cccccacccc cgccccagga 1561 tcaggaggct gggtctcctc cttccacctg ctcactcctg gtagccccgg gggtcgtcca 1621 aggttcaaat aggactagga cctgtagtct ggggggatcc tggcttgaca agaggccctg 1681 accctccctc tgcagttgcg gcgccgcttc ggggacgtgt tcagcctgca gctggcctgg 1741 acgccggtgg tcgtgctcaa tgggctggcg gccgtgcgcg aggcgatggt gacccgcggc 1801 gaggacacgg ccgaccgccc gcctgtgccc atcacccaga tcctgggttt cgggccgcgt 1861 tcccaaggca agcagcggtg gggacagaga cagatttccg tgggacccgg gtgggtgatg 1921 accgtagtcc gagctgggca gagagggcgc ggggtcgtgg acatgaaaca ggccagcgag 1981 tggggacagc gggccaagaa accacctgca ctagggaggt gtgagcatgg ggacgagggc 2041 ggggcttgtg acgagtgggc ggggccactg ccgagacctg gcaggagccc aatgggtgag 2101 cgtggcgcat ttcccagctg gaatccggtg tcgaagtggg gggcggggac cgcacctgtg 2161 ctgtaagctc agtgtgggtg gcgcggggcc cgcggggtct tccctgagtg caaaggcggt 2221 cagggtgggc agagacgagg tgggcaaagc cctgccccag ccaagggagc aaggtggatg 2281 cacaaagagt gggccctgtg accagctgga cagagccagg gactgcggga gaccaggggg 2341 agcatagggt tggagtgggt ggtggatggt ggggctaatg ccttcatggc cacgcgcacg 2401 tgcccgtccc acccccaggg gtgttcctgg cgcgctatgg gcccgcgtgg cgcgagcaga 2461 ggcgcttctc cgtctccacc ttgcgcaact tgggcctggg caagaagtcg ctggagcagt 2521 gggtgaccga ggaggccgcc tgcctttgtg ccgccttcgc caaccactcc ggtgggtgat 2581 gggcagaagg gcacaaagcg ggaactggga aggcggggga cggggaaggc gaccccttac 2641 ccgcatctcc cacccccaag acgccccttt cgccccaacg gtctcttgga caaagccgtg 2701 agcaacgtga tcgcctccct cacctgcggg cgccgcttcg agtacgacga ccctcgcttc 2761 ctcaggctgc tggacctagc tcaggaggga ctgaaggagg agtcgggctt tctgcgcgag 2821 gtgcggagcg agagaccgag gagtctctgc agggcgagct cccgagaggt gccggggctg 2881 gactggggcc tcggaagagc aggatttgcg tagatgggtt tgggaaagga cattccagga 2941 gaccccactg taagaagggc ctggaggagg aggggacatc tcagacatgg tcgtgggaga 3001 ggtgtgcccg ggtcaggggg caccaggaga ggccaaggac tctgtacctc ctatccacgt 3061 cagagatttc gattttaggt ttctcctctg ggcaaggaga gagggtggag gctggcactt 3121 ggggagggac ttggtgaggt cagtggtaag gacaggcagg ccctgggtct acctggagat 3181 ggctggggcc tgagacttgt ccaggtgaac gcagagcaca ggagggattg agaccccgtt 3241 ctgtctggtg taggtgctga atgctgtccc cgtcctcctg catatcccag cgctggctgg 3301 caaggtccta cgcttccaaa aggctttcct gacccagctg gatgagctgc taactgagca 3361 caggatgacc tgggacccag cccagccccc ccgagacctg actgaggcct tcctggcaga 3421 gatggagaag gtgagagtgg ctgccacggt ggggggcaag ggtggtgggt tgagcgtccc 3481 aggaggaatg aggggaggct gggcaaaagg ttggaccagt gcatcacccg gcgagccgca 3541 tctgggctga caggtgcaga attggaggtc atttgggggc taccccgttc tgtcccgagt 3601 atgctctcgg ccctgctcag gccaagggga accctgagag cagcttcaat gatgagaacc 3661 tgcgcatagt ggtggctgac ctgttctctg ccgggatggt gaccacctcg accacgctgg 3721 cctggggcct cctgctcatg atcctacatc cggatgtgca gcgtgagccc atctgggaaa 3781 cagtgcaggg gccgagggag gaagggtaca ggcgggggcc catgaacttt gctgggacac 3841 ccggggctcc aagcacaggc ttgaccagga tcctgtaagc ctgacctcct ccaacatagg 3901 aggcaagaag gagtgtcagg gccggacccc ctgggtgctg acccattgtg gggacgcatg 3961 tctgtccagg ccgtgtccaa caggagatcg acgacgtgat agggcaggtg cggcgaccag 4021 agatgggtga ccaggctcac atgccctaca ccactgccgt gattcatgag gtgcagcgct 4081 ttggggacat cgtccccctg ggtgtgaccc atatgacatc ccgtgacatc gaagtacagg 4141 gcttccgcat ccctaaggta ggcctggcgc cctcctcacc ccagctcagc accagcccct 4201 ggtgatagcc ccagcatggc tactgccagg tgggcccact ctaggaaccc tggccaccta 4261 gtcctcaatg ccaccacact gactgtcccc acttgggtgg ggggtccaga gtataggcag 4321 ggctggcctg tccatccaga gcccccgtct agtggggaga caaaccagga cctgccagaa 4381 tgttggagga cccagcgcct gcagggagag ggggcagtgt gggtgcctct gagaggtgtg 4441 actgcgccct gctgtggggt cggagagggt actgtggagc ttctcgggcg caggactagt 4501 tgacagagtc cagctgtgtg ccaggcagtg tgtgtccccc gtgtgtttgg tggcaggggt 4561 cccagcatcc tagagtccag tccccactct caccctgcat ctcctgccca gggaacgaca 4621 ctcatcacca acctgtcatc ggtgctgaag gatgaggccg tctgggagaa gcccttccgc 4681 ttccaccccg aacacttcct ggatgcccag ggccactttg tgaagccgga ggccttcctg 4741 cctttctcag caggtgcctg tggggagccc ggctccctgt ccccttccgt ggagtcttgc 4801 aggggtatca cccaggagcc aggctcactg acgcccctcc cctccccaca ggccgccgtg 4861 catgcctcgg ggagcccctg gcccgcatgg agctcttcct cttcttcacc tccctgctgc 4921 agcacttcag cttctcggtg cccactggac agccccggcc cagccaccat ggtgtctttg 4981 ctttcctggt gaccccatcc ccctatgagc tttgtgctgt gccccgctag aatggggtac 5041 ctagtcccca gcctgctccc tagccagagg ctctaatgta caataaagca atgtggtagt 5101 tccaactcgg gtcccctgct cacgccctcg ttgggatcat cctcctcagg gcaaccccac 5161 ccctgcctca ttcctgctta ccccaccgcc tggccgcatt tgagacaggg gtatgttgag 5221 gctgagcaga tgtcagttac ccttgcccat aatcccatgt cccccactga cccaactctg 5281 actgcccaga ttggtgacaa ggactacatt gtcctggcat gtggggaagg ggccagaatg 5341 ggctgactag aggtgtcagt cagccctgga tgtggtggag agggcaggac tcagcctgga 5401 ggcccatatt tcaggcctaa ctcagcccac cccacatcag ggacagcagt cctgccagca 5461 ccatcacaac agtcacctcc cttcatatat gacaccccaa aac // LOCUS HUMCYPBB 6910 bp DNA PRI 30-MAY-1996 DEFINITION Human CYP11B2 gene for steroid 18-hydroxylase, complete cds. ACCESSION D13752 NID g505080 KEYWORDS aldosterone synthase; steroid 18-hydroxylase. SOURCE Homo sapiens blood lymphocyte (library: Charon 4A) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kawamoto,T., Mitsuuchi,Y., Ohnishi,T., Ichikawa,Y., Yokoyama,Y., Sumimoto,H., Toda,K., Miyahara,K., Kuribayashi,I., Nakao,K., Hosoda,K., Yamamoto,Y., Imura,H. and Shizuta,Y. TITLE Cloning and expression of a cDNA for human cytochrome P-450aldo as related to primary aldosteronism JOURNAL Biochemical and Biophysical Research Communication 173, 309-316 (1990) REFERENCE 2 (sites) AUTHORS Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K., Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S. and Shizuta,Y. TITLE Role of steroid 11 beta-hydroxylase and steroid 18-hydroxylase in the biosynthesis of glucocorticoids and mineralocorticoids in humans JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (4), 1458-1462 (1992) MEDLINE 92159068 REFERENCE 3 (bases 1 to 6910) AUTHORS Kawamoto,T., Miyahara,K., Mitsuuchi,Y., Ulick,S. and Shizuta,Y. TITLE CMO I and CMO II deficiencies: typical inborn errors of aldosterone biosynthesis in humans as derived from defects of the steroid 18-hydroxylase gene (CYP11B2) JOURNAL Unpublished (1992) REFERENCE 4 (bases 1 to 6910) AUTHORS Miyahara,K. TITLE Direct Submission JOURNAL Submitted (21-NOV-1992) to the DDBJ/EMBL/GenBank databases. Kaoru Miyahara, Kochi Medical School, Department Medical Chemistry; Kohasu Oko-cho, Nankoku, Kochi 783, Japan (Tel:0888-66-7368, Fax:0888-66-7368) COMMENT Submitted (21-NOV-1992) to DDBJ by: Kaoru Miyahara Department of Medical Chemistry Kochi Medical School Kohasu, Oko-cho Nankoku, Kochi 783 Japan Phone: 0888-66-7368 Fax: 0888-66-7368. FEATURES Location/Qualifiers source 1..6910 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /clone_lib="Charon 4A" /tissue_type="blood" TATA_signal 569..575 /evidence=experimental 5'UTR 604..610 exon 604..849 /number=1 gene 611..6466 /gene="CYP11B2" CDS join(611..849,1237..1392,3201..3400,3538..3741,4028..4182, 4995..5161,5561..5639,5719..5916,6353..6466) /gene="CYP11B2" /codon_start=1 /product="steroid 18-hydroxylase" /db_xref="PID:d1003404" /db_xref="PID:g975640" /translation="MALRAKAEVCVAAPWLSLQRARALGTRAARAPRTVLPFEAMPQH PGNRWLRLLQIWREQGYEHLHLEMHQTFQELGPIFRYNLGGPRMVCVMLPEDVEKLQQ VDSLHPCRMILEPWVAYRQHRGHKCGVFLLNGPEWRFNRLRLNPDVLSPKAVQRFLPM VDAVARDFSQALKKKVLQNARGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSS ASLNFLHALEVMFKSTVQLMFMPRSLSRWISPKVWKEHFEAWDCIFQYGDNCIQKIYQ ELAFNRPQHYTGIVAELLLKAELSLEAIKANSMELTAGSVDTTAFPLLMTLFELARNP DVQQILRQESLAAAASISEHPQKATTELPLLRAALKETLRLYPVGLFLERVVSSDLVL QNYHIPAGTLVQVFLYSLGRNAALFPRPERYNPQRWLDIRGSGRNFHHVPFGFGMRQC LGRRLAEAEMLLLLHHVLKHFLVETLTQEDIKMVYSFILRPGTSPLLTFRAIN" intron 850..1236 /gene="CYP11B2" /number=1 exon 1237..1392 /gene="CYP11B2" /number=2 intron 1393..3200 /gene="CYP11B2" /number=2 exon 3201..3400 /gene="CYP11B2" /number=3 intron 3401..3537 /gene="CYP11B2" /number=3 exon 3538..3741 /gene="CYP11B2" /number=4 intron 3742..4027 /gene="CYP11B2" /number=4 exon 4028..4182 /gene="CYP11B2" /number=5 intron 4183..4994 /gene="CYP11B2" /number=5 exon 4995..5161 /gene="CYP11B2" /number=6 intron 5162..5560 /gene="CYP11B2" /number=6 exon 5561..5639 /gene="CYP11B2" /number=7 intron 5640..5718 /gene="CYP11B2" /number=7 exon 5719..5916 /gene="CYP11B2" /number=8 intron 5917..6352 /gene="CYP11B2" /number=8 exon 6353..>6910 /number=9 3'UTR 6467..>6910 BASE COUNT 1372 a 1907 c 2241 g 1390 t ORIGIN 1 ggatcctgca aggagggata caaattacat acatttgtca aaacccacag catgttgacc 61 accaggagga gaccccatgt gactccagga ccctggttga taacaacgta tcgagattcc 121 tcacatggaa ccagtgcgct cctgtggtgg agggtgtacc tgtgtcaggg cagggggtac 181 gtggacattt tctgcagttt ttgatcaatt ttgcaatgaa ctaaatctgt ggtataaaaa 241 taaagtctat taaaagaatc caaggctccc tctcatctca cgataagata aagtccccat 301 ccattttact cctctcagcc ctggagaaag gagaggccag gtcccaccac cttccaccag 361 catggacccc cagtccagac cccacgcctt ttctcagcat cctcagacca gcaggacttg 421 cagcaatggg gaattaggca cctgacttct ccttcatcta cctttggctg ggggcctcca 481 gccttgacct tcgctctgag agtctcaggc aggtccagag ccagttctcc catgacgtga 541 tatgtttcca gagcaggttc ctgggtgaga taaaaggatt tgggctgaac agggtggagg 601 gagcattgga atggcactca gggcaaaggc agaggtgtgc gtggcagcgc cctggctgtc 661 cctgcaaagg gcacgggcac tgggcactag agccgctcgg gcccctagga cggtgctgcc 721 gtttgaagcc atgccccagc atccaggcaa caggtggctg aggctgctgc agatctggag 781 ggagcagggt tatgagcacc tgcacctgga gatgcaccag accttccagg agctggggcc 841 cattttcagg taaagccctc cctggccctc gctgggaaca cccagatccc tgcccctgct 901 gcccaggacc ctgccaggca ctcagcactg ccattcccag caggtcccgg cactctgcat 961 cctttggagg atggggaagg agtgcagcac atgctggtct gtggtgctgc cagggcaggg 1021 gatagtgcag agaaaacccc agctcactgc agagagggca ggactcagaa gcactaaagt 1081 tgaaaggttc cagggagcca gcaggagggc tttagctgtg aagccgctaa tccaggagca 1141 gggagggtgg acaggagaca ctttggattg ggactgcagg gtggggccac gagggacatg 1201 accccgtcca gcagggcctc ctgcttggcc ccacaggtac aacttgggag gaccacgcat 1261 ggtgtgtgtg atgctgccgg aggatgtgga gaagctgcaa caggtggaca gcctgcatcc 1321 ctgcaggatg atcctggagc cctgggtggc ctacagacaa catcgtgggc acaaatgtgg 1381 cgtgttcttg ttgtaagcgg cgagttggga gctgagagct gggagcaggg tgggcagcct 1441 gggtgtaggg gggaggcgag agaggtagga cccaaaagca catctgccct gggcccctgt 1501 ggtgggcagt gagggtgagc acccggccca gaggacggcc atcctgtggg gtcgcgtctg 1561 cactgtgggt tggggaagca gggcggtggt ggagaaatgg gcacgggcac ctctgcagag 1621 aagacgcaga gcaatgagcc cttctgtgta gtgagaaccc gctctgcacc aacctcggcg 1681 gctgctttct cttgcggtct ggggactgtc cttcccatag gtcagaaaac tgaggccctg 1741 agaaggggac ttccactggc ccaggtcaca ggctgagtgc tgagcctggt gttcgccggg 1801 gccgcagcct ccctcagggc gctcagggtc cctgcagtcc tggcaaacct tcctgatggg 1861 gacagtccgg ggcaggaggc aggtggggac gcaggtggct ggtggttccg ttgttctcag 1921 aagcaaggca caaggtgggg cggttgatgg cactggggag gatgtttcct ggcccgtgga 1981 gagggtggcg cctggtcagg tgggcaggga gaggctgatg cttggagtcg gtcacctgca 2041 gggatgttgt cattaggacg ggggaaggac tggatgagga tgtcacagtg gtgacagccc 2101 ccactccatg gtaggaaggg aacgctattg ggaatagtgg ggtttaggta aaagggcacc 2161 cgtgggtcgg ggccttcact gaggctggcc tatagatgac atctgggaga gagtcaggac 2221 ccaggaaggc aggtccagga ggctgggtgc gcataatgga aggaagggga gcgctcctgt 2281 ctgtgtgtgt gtcttgcatc tgtgcacatg ctgtgtgttt ctctgtacct gcattgcaca 2341 tgtgtagtgt gtgcacgtgt cgtgtgtgaa tgtatgtgtg gtgtgtgtgc acaagtgtct 2401 gtgtgtgtgc atgtgcaggt gccggcatgg gtgtagtgtt tgtgcacaca tgcacatgcg 2461 tctcttcaca catggtgttg aggtcttgca tgggcgcacg tgtgcatgtg catcttctgc 2521 ctgtcatcac tgtcaacagc tcacagcagc cagctggaca taaataaagg agttttgcag 2581 gaatgtggct gacaggggaa attcctcccc accattccct gggggcatcc atggagcccc 2641 cacgcactct ggctgtgggt aggatggcat gaagcacaaa gcttggtttc tgtcctgcag 2701 aagatataga tgcttcacag agacagcaga gcagatgccc cagaggcact gtgcccaggg 2761 cggggaaggg tggggaggag agggcagcca ggggctctcc cctcaggaca ctgtgtgggt 2821 gaggtgggca aagcttgaca acaggggtca cctcctttct tggagaaaag ccctaccctg 2881 ttactacagg gagggcccgc atgggtgagg tggtgccaga cttgggtcgc caggtcccgg 2941 gaatgacctc agttaccctg tcagcacctg tgggcagaag ctaccatctc atccctgctt 3001 agacctgagt ggcctttgcc cagcacctgg aggccgctct gagaaaaggc tgcagctcga 3061 acacaaacag gcagcttcta ccagggcccc cagtcagctc cctgcaggcc gattcccctt 3121 ggggacaagg aggatgggat acgggtcagg gcctgtgtct tgctggggcg gcctcacaag 3181 ctctgccctg gcctctgtag gaatgggcct gaatggcgct tcaaccgatt gcggctgaac 3241 ccagatgtgc tgtcgcccaa ggccgtgcag aggttcctcc cgatggtgga tgcagtggcc 3301 agggacttct cccaggccct gaagaagaag gtgctgcaga acgcccgggg gagcctgacc 3361 ctggacgtcc agcccagcat cttccactac accatagaag gtgtgggcca tgcgggaagg 3421 tccagcccca gagaccctgg agtggccagg gatggggatg gaggactgaa gggagtgtgg 3481 ggaggcagcc aggaggcctg gggctgcctt gtgctcagca gtgcatcctc cccgcagcca 3541 gcaacttagc tctttttgga gagcggctgg gcctggttgg ccacagcccc agttctgcca 3601 gcctgaactt cctccatgcc ctggaggtca tgttcaaatc caccgtccag ctcatgttca 3661 tgcccaggag cctgtctcgc tggatcagcc ccaaggtgtg gaaggagcac tttgaggcct 3721 gggactgcat cttccagtac ggtgaggcca gggacccggg cagtgctatg gggaagggac 3781 accatggggg cccaatttct ccttctccac cacccagtgg ggaatggagg ccacagggag 3841 gggtcgggga ttcctcacct tcctgccggg gagattggtg cgaggctggg gctgggctgg 3901 gctgatccgg agaatttggg atgagagcag ggagatttgg gtgtcggggc agtctgggca 3961 ggaggaggac actgaaggat gcttcccagc accaagatct gagggctgtc ccctgctccc 4021 tggacaggtg acaactgtat ccagaaaatc taccaggaac tggccttcaa ccgccctcaa 4081 cactacacag gcatcgtggc ggagctcctg ttgaaggcgg aactgtcact agaagccatc 4141 aaggccaact ctatggaact cactgcaggg agcgtggaca cggtcaggcc agcaaccagc 4201 cccacccaga gagggtgatg ccaagcctgc ctcccaggca ctgcctgcca atgccacacg 4261 gcacccacgt tccccatccc caggctacag gccccacatt tctgttgccc tcagccttcc 4321 ccctcctttg ttaagggatg agatttgcag gggaggggaa atgtgagctc cccctcacat 4381 gagactgagt ttgcagttac ctgtgtgggg atccatgctc caggctggaa gaaagttgga 4441 tgaggccctg gacacacagc agctctgtcc ccactggaaa gctctgggtg tacaaggaga 4501 aggagggttg agaggcagct ggaggactcc actgggcacc cttcccagtg tgcccggtca 4561 ccttgggcca gaaatgtaga tgcatgggag ggcagggttg tggggaagac agcagcacag 4621 gctccagcca gtgcagaggg gcctgtgggt gcacagtggg gagaactcaa tggaagcaga 4681 gggagctggg gctccagaac tccctggatg atgctgaggt gtggccccct gccctaatgg 4741 tggctgtgag aacccgccct gaagaggctg caggggacct gggccttggt ggagatgggg 4801 gtcacctttc cctgaagaag tcagggaatc tggcccaagt ggtcatcaag gtttcagatc 4861 cggggtccca gggctctgtt tttgctcagg gcatggatgt ctccacccct cagagggagg 4921 ttgtcctggg aggggtgtcc cgggggctga gtcctcctgt gcaaggtctg accctgcaga 4981 catggcttct gtagacagcg tttcccttgc tgatgacgct ctttgagctg gctcggaacc 5041 ccgacgtgca gcagatcctg cgccaggaga gcctggccgc cgcagccagc atcagtgaac 5101 atccccagaa ggcaaccacc gagctgccct tgctgcgggc ggccctcaag gagaccttga 5161 ggtgggtgct ggctgaggcc tccctgtggc cctggccccc tgctggagag cagcccccac 5221 tgggtggtgg cagacagaat ctggggctga taaacagcgt cacccagcag cccattcccc 5281 tgcacctgct cttcctcccc ctcaaggaca gggagctctt cttcctctgg aatccctctt 5341 caacgccctg gggattaacg tggggcatgt ccttctgcgc tcggggctgc ttaagttagg 5401 ggaggtttgg ctgggctcag caggtgcaag gaagcacttc ctacgacctg ggcttcccat 5461 ggatctggga cctctgcggg gtcttcggta ggaagggtgc tgagagcaca gggagcccca 5521 tccagctgag gaccctttct gtggatgccc ccacctccag gctctaccct gtgggtctgt 5581 ttttggagcg agtggtgagc tcagacttgg tgcttcagaa ctaccacatc ccagctgggg 5641 tgagtgagcc ccacacccct cgagctgaga acctccctcc ccagtcattc cctgatccct 5701 gctctgcacc gtccgcagac attggtacag gttttcctct actcgctggg tcgcaatgcc 5761 gccttgttcc cgaggcctga gcggtataat ccccagcgct ggctagacat caggggctcc 5821 ggcaggaact tccaccacgt gccctttggc tttggcatgc gccagtgcct cgggcggcgc 5881 ctggcagagg cagagatgct gctgctgctg caccacgtaa gcaggcctgg gggcgggggc 5941 gggacctggg cagcagaggc gggacctgca cactgggggc ggggcttgca tggtgtgatt 6001 gacacctggg aacagtggat ggggccttgg ttggttgagg tcggcgtgac cagggaggat 6061 ctgtgctgag caagacaggg taggatctgg gtgaggttgc ttctaaacat tgaaatgggg 6121 actaggggag tggggtggag cctgtacaga ataatggggc ttgggcaaga cctgggcagg 6181 attcagtctg ggcctggtcc gcaaggtggg gctggtcaga aatgggatag gttggggccc 6241 aggctgctgc tcccccttca gcataattgt tgcacctggg acgatgggag gaagctgccc 6301 caggtccatg ggctactgac caggccagat ggaaacccag cctctgtcct aggtgctgaa 6361 gcacttcctg gtggagacac taactcaaga ggacataaag atggtctaca gcttcatatt 6421 gaggcctggc acgtcccccc tcctcacttt cagagcgatt aactagtctt gcatctgcac 6481 ccagggtccc agcctggcca ccagcttccc tctgcctgac cccaggccac ctgtcttctc 6541 tcccacgtgc acagcttcct gagtcacccc tctgtccagc cagctcctgc acaaatggaa 6601 ctccccaggg cctccaggac tggggcttgc caggcttgtc aaatagcaag gccagggcac 6661 agctggagac gatcttgctg gcagggcctg gccttgtccc cagccccacc tggccccttc 6721 tccagcaagc agtgccctct ggacagcttg actctactcc tcccagcgct ggctccaggc 6781 tcctcatgag gccatgcaag ggtgctgtga ttttgtccct tgccttcctg cctagtctca 6841 catgtccctg tccctctcgc cctggccagg gcctctgtgc agacagtgtc agagtcatta 6901 agcgggatcc // LOCUS HUMCYPIIE 14776 bp DNA PRI 02-NOV-1994 DEFINITION Human cytochrome P450IIE1 (ethanol-inducible) gene, complete cds. ACCESSION J02843 NID g181355 KEYWORDS cytochrome P450. SOURCE Human (adult) liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14776) AUTHORS Umeno,M., McBride,O.W., Yang,C.S., Gelboin,H.V. and Gonzalez,F.J. TITLE Human ethanol-inducible P450IIE1: complete gene sequence, promoter characterization, chromosome mapping, and cDNA-directed expression JOURNAL Biochemistry 27 (25), 9006-9013 (1988) MEDLINE 89166510 REFERENCE 2 (sites) AUTHORS Uematsu,F., Kikuchi,H., Abe,T., Motomiya,M., Ohmachi,T., Sagami,I. and Watanabe,M. TITLE MspI polymorphism of the human CYP2E gene JOURNAL Nucleic Acids Res. 19 (20), 5797 (1991) MEDLINE 92051346 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.J.Gonzales, 18-AUG-1988. FEATURES Location/Qualifiers source 1..14776 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10" prim_transcript 2789..14198 /note="CYP2E mRNA and introns" gene 2825..3001 /gene="CYP2E" exon <2825..3001 /gene="CYP2E" /note="cytochrome P450IIE1; G00-119-833" /number=1 CDS join(2825..3001,3907..4066,7006..7155,7545..7705, 8113..8289,9172..9313,12151..12338,12838..12979, 13864..14048) /note="cytochrome P450IIE1" /codon_start=1 /db_xref="PID:g181356" /translation="MSALGVTVALLVWAAFLLLVSMWRQVHSSWNLPPGPFPLPIIGN LFQLELKNIPKSFTRLAQRFGPVFTLYVGSQRMVVMHGYKAVKEALLDYKDEFSGRGD LPAFHAHRDRGIIFNNGPTWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRK TQGQPFDPTFLIGCAPCNVIADILFRKHFDYNDEKFLRLMYLFNENFHLLSTPWLQLY NNFPSFLHYLPGSHRKVIKNVAEVKEYVSERVKEHHQSLDPNCPRDLTDCLLVEMEKE KHSAERLYTMDGITVTVADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG PSRIPAIKDRQEMPYMDAVVHEIQRFITLVPSNLPHEATRDTIFRGYLIPKGTVVVPT LDSVLYDNQEFPDPEKFKPEHFLNENGKFKYSDYFKPFSTGKRVCAGEGLARMELFLL LCAILQHFNLKPLVDPKDIDLSPIHIGFGCIPPRYKLCVIPRS" intron 3002..3906 /note="CYP2E intron A" exon 3907..4066 /number=2 intron 4067..7005 /note="CYP2E intron B" exon 7006..7155 /number=3 intron 7156..7544 /note="CYP2E intron C" exon 7545..7705 /number=4 intron 7706..8112 /note="CYP2E intron D" exon 8113..8289 /number=5 intron 8290..9171 /note="CYP2E intron E" exon 9172..9313 /number=6 intron 9314..12150 /note="CYP2E intron F" exon 12151..12338 /number=7 intron 12339..12837 /note="CYP2E intron G" exon 12838..12979 /number=8 intron 12980..13863 /note="CYP2E intron H" exon 13864..>14048 /note="cytochrome P450IIE1" /number=9 BASE COUNT 3635 a 3645 c 3772 g 3724 t ORIGIN 2246 bp upstream of EcoRI site; chromosome 10. 1 cccccattga aaaattgtct ttctgatctt tataaacaat tatttaatat ccagtaaaat 61 cttctctata ttgctttact agtgagttct attaaaattt tgaagcacag aaaattcccc 121 tacagtataa agtatcccca gtcacagaga agacaggggt tttgcaatga tttctagaat 181 agtgcaattt ttatgcaaga acctaatata acacaaaaat tatagcccga ttttatttgt 241 gggtatagat gcaaaattac taaaaatact attaacaagt tgaatcctta gggtgttaaa 301 agagtatcac tccatgaacg agttggttgt gatgtggaac tatgaggtac ttttatgata 361 caatataaaa atttatggta attttatggt acattgtgag acagtgtttt cttctagcat 421 catactagca ggtctatgga gaaaaatcac aggattgtct caatcaaaaa aagatttcat 481 taacccaact ctcatccctg ataaacactg ttagttatct agagaaagaa gaaaattgtc 541 ccaatacagt cacctctttg ccacacccag ccaacagcag acgtgatgga agcctgaaga 601 acaccctgcc acgggcacag gcagaggcac aggcaccctg tcgtcctgat tatttcacct 661 tgtcacgggc agaggcacag gcaccctgtc gtcctgatta tttcaccttg tcacaggcac 721 aggcaccctg tcgtcctgat tatttcacct tgtcacaggc acaggcactc tgtcgtcctg 781 attatttcac cttgtcacgg gcagaggcac aggcactctg tcatcctgat tatttcacct 841 tgtcctagag tgtcctgcca atgggacaga tgcaaaacaa ataaaagccc cggcttctga 901 aaagaagcac acagaaatgt cattattttc aaacgaggtg ttcccgtata taaaatttga 961 tgttggttgg gcatctaaca gtattatggc cagaggactc agaccacagc tgcatccctg 1021 tgaggcacag actctccagg gcacgcgggt cccgctggga tgtgcacact caggtgagct 1081 gcacagacaa ggtgtcctca gcccagggga gccagaggcc tgctctgcct ctccaccctg 1141 atgcttcctg ttctcacccc accaaagcca aggcttcaat ttcagtctgt ggggagctga 1201 ctctgctgct ctcaagcact agaagaagga accagtaatc gaggaaactt gtggacccca 1261 atggtgtctg tcccggccag gcctggctgg gcccacacag gacaacaggg ttcaggggtc 1321 tggacagctg tttctgccca gggaattgtc cctgccacct cacactggcc actggaaagg 1381 aaagagagga ggaggcggca ggctaaccca cccgtgagcc agtcgagtct acattgtcag 1441 ttctcacctc gaggggtgcc aaaaaccaga gggaagcaaa ggcccctgaa gcctctgcca 1501 gaggccaacg ccccttcttg gttcaggaga ggtgcagtgt taggtgcagc acaaccaatg 1561 acttgcttat gtggctaata aattgtcaag agaaaaactg ggttagaatg caatatatag 1621 tatgtagtct catttttgta taaatacaag tatagaatgg cataactcaa aatccacaag 1681 tgatttggct ggattgtaaa tgacttttat tttcttcatt tctcatcata ttttctatta 1741 tacataaaga ttcattgtta atataaaagt acaaaattgc aacctatgaa ttaagaactt 1801 ctatatattg ccagttagaa gacagaatga aaaacattct cttcattcta accacacaca 1861 caaaaaactc cacaaaatac ctatggacta ccttcataga aggtggaaga gggtctgtat 1921 gaagaaaatg cttaatacat gaaagaagaa gctagtcaat gtggaggtct attgtgcgcc 1981 gggatcaaca aagacaagat atgtttaaaa tggtgttcta aatttaccct aatgtaaaac 2041 aaatccaata aaactctaat gtgatttttt aagaatttaa atttggaata attccaaaga 2101 acaatttttc ttaatttcta cagccagaat atataccttt aaaaaaaatg aaaacagaga 2161 ttaactttct cagaattggt tgactcactc tttcctttta tttttcttcc atggaatttt 2221 ccagttaact tgagaaagtg gaatcgaatt ccgatgttga attttccttc tggccccatt 2281 catgtggcag gtggtgattc aggtactact gggggctgct cagacaaacc tcctcatcag 2341 acatcaagag gctgttgcac caggagggcc ggtaccgtgt ctagaggtgg tcggcatggg 2401 gttggagttg tattacataa accctactcc aaacaaatgc atggggatgt ggctggagtt 2461 ccccgttgtc taaccagtgc caaagggcag gtcggtacct caccccacgt tcttaactat 2521 gggttggcaa catgttcctg gatgtgtttg ctggcacagt gacaggtgct agcaaccagg 2581 gtgttgacac agtccaactc catcctcacc aggtcactgg ctggaacccc tgggggccac 2641 cattgcggga atcagccttt gaaacgatgg ccaacagcag ctaataataa accagtaatt 2701 tgggatagac gagtagcaag agggcattgg ttggtgggtc accctccttc tcagaacaca 2761 ttataaaaac cttcctttcc acaggattgt cctcccgggc tggcagcagg gccccagcgg 2821 caccatgtct gccctcggag tcaccgtggc cctgctggtg tgggcggcct tcctcctgct 2881 ggtgtccatg tggaggcagg tgcacagcag ctggaatctg cccccaggcc ctttcccgct 2941 tcccatcatc gggaacctct tccagttgga attgaagaat attcccaagt ccttcacccg 3001 ggtaagagaa atagtgttga ttttagggag aataactcag caattggatc tggtatgtgt 3061 gtattcaact catttgcaga caaattgtgg ttgttcaata ccagcctgtt gtgaattacc 3121 tgaattgata gcatcctgga gcgacactca aaatgtgtcg cctgtggtgc agctggagcc 3181 cggagcctgc gtgccaggcc ccggaggccc ccgccgtgcc ttgtcctggg gctgatgatg 3241 gggaggccgg cgaggccggg ctgctgcgac gccaggataa ccgggctggc ggccagatgc 3301 gcactcgctg ggcgtccgcc tgtgtttgcc aaagcacgag ttgaaacgtg aagtgttggg 3361 ccagcccgtg tggcaccaat acctgccgcc tacgactgtt gtgaacactg aatgggccaa 3421 caaacctaaa cgttaaatga actgataacg ccgtcagcac ggagcaggcg ctgggtgttt 3481 gcgctcttgc gcgtgcgctg ctgtggggcg caggctgacg gcgggcgggg gtcgcctgct 3541 ccagctcggg ctcccgcgcc agaaccgggt ccagaacctt gattccggaa gcgggcaacg 3601 gggtggttgg tgggcgcgcc tgagggaagg gacgtgagga gccggagtcc gcggagttgc 3661 cgcggagttg tccgcggagt ccaggcgggt ggggagcaga gcagctggaa ccccccgagc 3721 gccctgcaga cgcagcagcc tcttgagggg agggtctccc ccacctcggg ctggacaaag 3781 acagcttttc cccacgtccc tctgggttct ctagagcaac agcaataccc gcccggcagg 3841 tgtggcttag agccccgcac ctcctcgccg cgcgcgggcc tgacttctag ccacgggtct 3901 ccgcagttgg cccagcgctt cgggccggtg ttcacgctgt acgtgggctc gcagcgcatg 3961 gtggtgatgc acggctacaa ggcggtgaag gaagcgctgc tggactacaa ggacgagttc 4021 tcgggcagag gcgacctccc cgcgttccat gcgcacaggg acaggggtga gtccgcgtcc 4081 ctggcacgga gcggggggtg cataacacgc cccgggacag ttacgggcgc tagccacgtc 4141 ggcgatggcc aaataataaa ctaacagtaa tattatagta atagcatccg aaggatgaga 4201 tcaggattag gcgatggccc ccgcgcgttg cctgccgagc gaggcgcact gagtcgccca 4261 ggaatccggc ctctcggcga ctgtgcggga gagttttatg gggatgggcg gggctgcttc 4321 tgagcaggag tcgccgcccc cacccccacc gttccgcctc tgggccgcag gctcctcccg 4381 ggagcgcttt cccctcctgt tcaaccgccg gggtacaggt ggcttcgtcc accgaggtcc 4441 cctcacccac gctgaggcgt cggaagctgc ggacactgct cgcttcaggg ctttgctcag 4501 ctgcagctgg tgacctccag agagggagtc tctgatgtcc cgctggggtg gatgtcctga 4561 gaccgggaag ggggaagaga cccactgaaa tcctatctcc cagcctcacc tctgctgtct 4621 cctccacgct tcctgtctcc agagccccga gttcagcata agcagaaagc ggcctgttcc 4681 ctctctaggg agaggagggt tgcggtctgg aggtctggct cgtctttatc tgcgcattct 4741 cccagcctcc tggcttcaga cctcagcgag gcggcggctg cggccggctc tcctcttcct 4801 gcctgcagac ctggcctgct gcttctttct ccttcctccc tccctgcctg ccctgcggtt 4861 tcaaagtaga ttagaaataa cagtgtccca catggaagcc tctacttctt cctgggtcaa 4921 ctttgatgac gaggctccag aaaacctttg caatgctgtg tggaattttt aaatcggtga 4981 gctcgtgctc ttgccctatt tatttgtcca gcgtacattt ctgaacattg tgaacgtcga 5041 atgggccaac aaatctaaaa attaaatgag ctgataaaga acgccgtcag cacagagcag 5101 acgctgggtg ttcgcgctct tgagcgtgcg ctctgcgggg cgcgggctgg tggcgggcgg 5161 gggtcgccgg ctccagctca ggttcccgcg ccaggaccgc gtccagaacc ttgtctccgg 5221 aagcgggcaa cggggtggtt gtatcacaat tagtggcatt tggttttcct tcttctgcat 5281 tgtgggtttt acttctctgg ggttgccaaa aacaaaatta accatctcag tccttgtcgt 5341 taacgcagga gaagcattac tggaggaggc tctggggttc tgtggttgag gagctcagtt 5401 ctggttccgg ggagccctta tctgccaccc acgggtccaa ggcacagtcg gaggcagcag 5461 ggaggggagc ggaattcaca tcaacacaga tggggctcaa ggggactttg ctgcctctgc 5521 ctggagggtc taaagtttca ttttcatatg acccgcaggg cgcagactgg cggaaaatta 5581 gcagagccct gggcatgggc tgcacctggc cttaagggac aatgatggaa atattcctta 5641 ttagcacaat actgagcaca ggctgtgtga taatgtgtca agggaactgc agacatcctt 5701 tcagaaaaag ttcataaaac ggagaaagtt tggttcccaa cctagatttt taacctgttg 5761 aactctgtct aaatgggtca tctcgggatg tcctccactc aacatgacca cagtctgccc 5821 ctctgtccca cctgtctcct cagtccttcc tccccacctt tcaggatgaa atgaaaccct 5881 cagtccagct gcacccctgc cccacccacc tcatctcatg tgccctcccg cccctctcag 5941 gccggacagc cttgcttctg gaacacacga gcacagcttc accaggcact ttctgagcac 6001 cctgcaggcg cctcccagga gtggtcagtg gtcaatcagc taatgaagct gcataggaca 6061 tgacccttgt ttaccgcaga atgcccagag ctggcaggat gtcttatatg caggaagtac 6121 ccaaaatgta tttattgagg aagtgatgat ggataagagg aagacggaga gcgagggaga 6181 gaggggctag gggccctgcg gtgtaaaggg ggtgtggctg ggagtgtgca ggggaacagg 6241 gatcatttca aggttcctat ctgggagaaa ataaaaaggt ttacagttag ttgagataag 6301 cgtgggaata tgcgaacatt tttaaagaat aaaaagttta gctttaaatt tgttgattcc 6361 aaatgtgttc atactctcgg gaggatccat caagcaactc ttgggaggag agacagggca 6421 gggcaggcct tgacagctca gaagggcgca gtagggacag ttcttggttt tcccagctct 6481 gatgctttgc acagtcgctt gtgtgacctg caagatttta gtgaagaaac ttgctgtgga 6541 gtcggaaagc tgcaagttga ggtgtgtgtg gtgtgagggt taaaaatctg tgagaacaga 6601 atgaatggct tttcaagaat gttgtcgata gataggaaag aggtgggagg tgttcttgga 6661 gtggccatat gtggttttat gtagcatggg gaagactcag cagaaaggaa aaagaaagaa 6721 ggtaaattga cagcatgaag tagagcaccc aggagaggct acatgtgatg aagaaaccac 6781 agtgcagact gtgaggaccc cagaaaggct cctccccaaa acctgaccag tggccggtgc 6841 tggcagctcc caggctggga caccctctgt ctctctgtcc ctctgccccc tctgtcactt 6901 ctttatacac ctgtaaatcc tgccctgctc tccaaggccc tctgtagccc atttctcccc 6961 aaaatgggta tttagaataa ccttctgctg gcccctctgc cttaggaatc atttttaata 7021 atggacctac ctggaaggac atccggcggt tttccctgac caccctccgg aactatggga 7081 tggggaaaca gggcaatgag agccggatcc agagggaggc ccacttcctg ctggaagcac 7141 tcaggaagac ccaaggtgcg tatctgctgc ctagcagggc ccagtcctct tgcagaccag 7201 cggtgtgggg agccctggct gggactccta gactgcatct gaaccacagg gacctacgga 7261 caaggagagg gtctcgtgag tccccagata ctgcatttta caactctagg ttccagctac 7321 acagttcagg gagcaagggt ggccattaaa cacgtgactt gtatcctaaa tactgttgaa 7381 aagcaaagga aactcaaaca ggttcagaca ttcactatct ttcgtaaact ggcagttttc 7441 agggcacctt ctcacaggcc ttggtgaacc tcagtgggtg actgagcagg tggaggagtc 7501 tcctcacccc catcttctgg ttgccctgac tgcctgtttt gtaggccagc ctttcgaccc 7561 caccttcctc atcggctgcg cgccctgcaa cgtcatagcc gacatcctct tccgcaagca 7621 ttttgactac aatgatgaga agtttctaag gctgatgtat ttgtttaatg agaacttcca 7681 cctactcagc actccctggc tccaggtgaa gccactttcc tctttcatca gtcatcaact 7741 gtagagttta cgttagaaaa agaaggaaaa tttgggttat atgtgataga caggactgca 7801 aaagccaaac aacatagctt cgaggggtgt ttgattagac agcccaaata ttcctcccag 7861 agacatctct ggggccccac gcaccccctt tcctaacgtc aggatgtgta tcgacctgtg 7921 tgtgcacatt tgccatgcag agtttgcact gctgaggaga atggtgccca agaaggacac 7981 tgttgaccca aaatattcca aataaacaat gattacagcc acaaattcag gtttggagaa 8041 agttgttggt ccaacacaca caattatgtt gcatccagaa aaaagtagta aaatattttt 8101 ttccctctct agctttacaa taattttccc agctttctac actacttgcc tggaagccac 8161 agaaaagtca taaaaaatgt ggctgaagta aaagagtatg tgtctgaaag ggtgaaggag 8221 caccatcaat ctctggaccc caactgtccc cgggacctca ccgactgcct gctcgtggaa 8281 atggagaagg taggctcggc ctcccatgat gtgggctctc cggggtgggc agagaatgca 8341 caatttcaga tttacagagt gagctgcact tgctggtgtc cagacctccc accgcagcat 8401 gctctgagtt tcatacacac actcttggct tcagcatgac cactggacgc aagtcagcct 8461 gcctggctgc caagctggcc tggggtttgg ggcacatggg cgggacgctt agctctctcc 8521 aggccctgct gctcaaccct ttctagtctg cagactttga gaattgcatt ttgtctgagg 8581 agaagccctc agccttcctt gtgggcatgc actccccaac tgtgcgcacg tgcaggactt 8641 ccaggcctcc ccagcttcat ccacctgcag gtgctcagga tcctgatccc ctgccccctt 8701 cccaccttgg tgaaacttct tgtatccttg tcttgtcctt tcctatggct tgtggctcaa 8761 gaacaaatgt ggagcccaca ctgatttccc aggactgtct gagcatcttc tccaccagtt 8821 tggcccctcg tggcagcaga cactagccct gtagcaggag gggttagcag gagccgttta 8881 gctcctgcct gagctatgac caaggtcagg gggatctcac ctctcccagg atggccctca 8941 tgctgtggag ggagacagag ccctggcctg ccctcagcag atttctggga gcctcagttt 9001 ccctggctgt gagtggagat gactctgtct gtcacagctc caagtcacag ttccactggg 9061 agagcctctt ggacactgtc tcctgtgtcc ctgtggagct gggaggtggc tggttctgtg 9121 ctgaaaggag acaagcagcc ccttctctcc ggtctgtctc cggtatcaca ggaaaagcac 9181 agtgcagagc gcttgtacac aatggacggt atcaccgtga ctgtggccga cctgttcttt 9241 gcggggacag agaccaccag cacaactctg agatatgggc tcctgattct catgaaatac 9301 cctgagatcg aaggtaggca agtgactgaa gggacaccgt gcgtgcggct gcatctccct 9361 ggatggccag ccttgcacat tttaggctgc agctttctgt ctgaagctgc ttgttaaccc 9421 tcatggtgat gtggtgagat ggctggatgc actgctgtga ggggaggtgt tatggtctgt 9481 gctgaacact ggtactcttg cacactggtt ggtccatacc ccactaagac acccctggtt 9541 gcagaaaaga acatcccaac accagagtgg agagaggtgg cagggtctgc attctgctcc 9601 ataaataacc tctttatgac agagaagata atgtcccagt tccccccaag taagacctgg 9661 tcttctaggc agagcaggtg gggaggttgg agctggaggg gagggtcctt gctggggcgt 9721 cttcctcaaa tgcggacgtg aggagggaag tccaggaaga agcagctaca gctccccctg 9781 gacccttgtc gttccttcca cagggctcct cccagcggca cctggggcag ctgggactct 9841 gtgcctggag gaggtgtgaa aggtctgggt ctaggtgggc agagggtcat gccctgagaa 9901 acacccatct gggccaagta gaggtgatgt gagggcaccg catgcaaaca ggccagtcag 9961 ggttgggtcc aagtaaaggg gaggaaaggg agctgcagcc tggctggaga gtgccggggg 10021 gcccagagcc cctgcctctc gctgggctgg aaacagggct gggcagcctc tgcccgaggc 10081 agttcacagc ctgagtggtg tgtgccgccc tcctcctgaa gctgctgcta atggtcactt 10141 gtggtcttaa ggctcgtcag ttcctgaaag caggtattat aggctatgaa gttatttccc 10201 ccaagaaagt cgacatgtga tggatccagg gtcagaccct ggcttttctt gttctttcct 10261 tcttcttctt ctttttattt atttattttt tttttgaggg gacagggtct cactctgttg 10321 cccaggctgg agtgcggtga tgcaatcatg gctcattgta gcttctacct attgggctca 10381 agcgatcctc ccacctcagc ctcccaagta actgggccac aggtgcacac caccacaccc 10441 agctgattaa aaatttaaaa aaattatttt ggctgggcac agtggctcat acctgtaatc 10501 ctggcacttt gggaggctga ggcaggcgat cacgaggtca ggagttcgag accttcctgg 10561 ccaacatgat gaaaccctgt ctctcctaaa atacaaaaaa gtagccgggt gtggtggcac 10621 gcgcctatag tcacagctac tcaggaggct gaggcaggag aatcgcttca acctcagagg 10681 cacagggtgc agtgatccga gattgcaccc cactgcactc tagcctgaca acagagcaag 10741 aatcagtcta aaaaaaaaat tgtagagaca agttgttact atgttttgta ggctggtctt 10801 gaactcctgg gctcaagtca tcctcctgcc ttggcctccc aaagtgctgg ggttacaggt 10861 gtggccaccg tgccccatcc ctggcctttg ctttttcaat cacatggaaa tgtgaagggt 10921 gaaggagcca aaagtttagg gaaggaatca ttgtatggat ctgcagtgat tataagagaa 10981 ctttcgacta ctctgcacta ggggaaccat ggaatcaaaa aatgttttaa attattattt 11041 atgaggaggt tccaatatag acaaaaggaa aataaatatg attgacatgt atatatccat 11101 tgccaaattg aacgtttatt aacattttgc gatacttcca tcagagctct taaaaagaaa 11161 atgtgttaca gagccagcca aagtctacct cctcacatct ccccacctct ctcaccagaa 11221 atggcttcag aattgctgtg tggctttgca cttttaacag ttgttaatta tcagcacagt 11281 attcatatta ttgctgtatg tgtttaatat tttacctggg tactgtacat aacattttgc 11341 agcttggttt tttcactcaa catatgatga tgttccatgg gaactccaaa cacggggagg 11401 ctaggcgact tgctcaaggc agctgttacc tctgtcagaa agacagaggc tttcagattc 11461 aagaagtaga ccctgcatgt ctgattctgt tctgtaaacc cccttcatac tcagaagcat 11521 gcaataaaca agcctggggt aattatcaat gcaaaggtta ccctcccaga agaaatttcc 11581 aaaacacttt cattattctc tgctcttgac atgaagagaa ctgaataagc catcatcaac 11641 tgagataatg gatgccaaaa catccagtaa ataacctcat agagcttagc tctcactaag 11701 tttttggagc attttccagt aattcaaagg acctggggaa ccttaagcac tgcttaggat 11761 gctccataaa catcttctgc gtgggtaggg gagtggatgg atggctggat gggtgggtgg 11821 atggacggac ggatggatgg atggatggat ggatggatgg ttggatggat gggtgggtgg 11881 atggatggat gggtcaatgg atgtgtggat ggatggaagg gtgggtggat gggtggatgg 11941 ctggctggtt gggtgggtgg gtggatggat gcatgggtgg atggatggag gatggatgga 12001 tggatggagg ggtgtataga tggaggggtg gatggatgtg taggtgggca gatggataaa 12061 agcgtgattg aatagatggg tggatgatgg gtggatgccc aactggccag gaaccaatcc 12121 ctgaaatttg tcccattcat atcttggcag agaagctcca tgaagaaatt gacagggtga 12181 ttgggccaag ccgaatccct gccatcaagg ataggcaaga gatgccctac atggatgctg 12241 tggtgcatga gattcagcgg ttcatcaccc tcgtgccctc caacctgccc catgaagcaa 12301 cccgagacac cattttcaga ggatacctca tccccaaggt taagcaatga gcctgcagca 12361 cacagcatga acaccatcct atcactaatc gccttcctgc cagggagcag gatgggggcc 12421 ccaagaccct tccctttggc aggggtcact gaggggaagg gctggcccca ctcccaccct 12481 gtgggatact gcatctccag gagtgctcac attggcctgg tgaccagaga ggtggaggaa 12541 atctggaaaa gagcctcagc agatagtgcc tgggactgta gtgaattcta atgccaggaa 12601 caaactatca caaccagccc tggggttaat cctgtgagaa gattagggct ttcatcttca 12661 tttagacctg acccctgact gctttctatc taatccttca ctaagcaact ccttcaactc 12721 gaaatatact atcctatata gcataatatt caaaacaaca ttcttcactg ggggtttcca 12781 gatgaaagcc cacattttgt taacatgact cactgagaca gtctttgttt ctcctagggc 12841 acagtcgtag tgccaactct ggactctgtt ttgtatgaca accaagaatt tcctgatcca 12901 gaaaagttta agccagaaca cttcctgaat gaaaatggaa agttcaagta cagtgactat 12961 ttcaagccat tttccacagg tgagaaagat cagaggcagt accttccctt gaggagcagc 13021 ccacactcct catctcccct ccacatgtgc tctgccctcg tcccaggcac ccactgacac 13081 cccaaacctc actgtgtgcc ctgtttctat tgacaacatg acccaaatgt gctcttccct 13141 gttcagagaa gttacataac atcttttagc agcaatcctg ggaatgaagt gttgtaggtg 13201 gatttttttt ttcccaaaga ctagacattt tacatcattc attgctaaat tttgtttcta 13261 ttttaacaag acttagtgaa aagctctcaa agccatatta cccaattctc cctaatttta 13321 aaccagagct actaaacaaa acctaacctt tggttaccta gaatcatcac aggaagcatc 13381 aaagccttcc tgggatgtga ctcagtgatt ttctttgagg cacttgtcct ccttcccagg 13441 gcctcatctt agggattgtt gtgggaagat catacaacca actccatact tttcacaccc 13501 agtgctggag ccccagcttc taacagggca ctatttccct cctgtaggca tcactgatga 13561 gcactggggg tgccttcttt actgggcaga catggtcttc ccaacttaac accggttttt 13621 gcagttgagc tctggataat tgagattgta tgaaggctgg tccccgaatt agtcagtgtc 13681 gctggtatcc ttccactcaa gtacattttg tgcttctttt aataggcaga gaggggtgag 13741 tcctgccctg tgatggccgt ttgcccacag cctcctcctc cccgcttccc ctagtctcac 13801 tgttaacagt gtcgtgtctc tgaaactccc tcagtgtctc atcaatacca ttgttacttc 13861 taggaaaacg agtgtgtgct ggagaaggcc tggctcgcat ggagttgttt cttttgttgt 13921 gtgccatttt gcagcatttt aatttgaagc ctctcgttga cccaaaggat atcgacctca 13981 gccctataca tattgggttt ggctgtatcc caccacgtta caaactctgt gtcattcccc 14041 gctcatgagt gtgtggagga caccctgaac cccccgcttt caaacaagat ttcgaattgt 14101 ttgaggtcag gatttctcaa actgattcct ttctttgcat atgagtattt gaaaataaat 14161 attttcccag aatataaata aatcatcaca tgattatttt aactatatgt taagtcatgg 14221 aatatcttaa ttgtttaagt gattctcaca gagaggtttt tttttttttt tttttttttt 14281 tgagagtttt gctcttgttg accaggatgg agtgcagtgg catgatcttg gctcactgca 14341 acctctgtgt cctgggttca agtgattctc ctccctcagc ctcccgaata gctgggatta 14401 caggcaccca ccaccatgcc agctaattct ttgtattttt agcagagaca gggtttcacc 14461 atgttggtca ggctggtctt gaacccctga cctcaggtga tccacctacc tcggcctccc 14521 aaagtgctgg gattacagca tgagccaccg cgcccagcca gagagaggtt ttaaatatat 14581 atgtttactt taatattaag ttataacata attttcatgt tattgaaaag ctcttccatc 14641 taggatcaca ccacttcagt gtcagaatca tattgaggtg gggaatttgt attagtcagg 14701 tttctctaaa gggacagaaa caataggata gatgtatata cgaaagggag tttattagga 14761 gaattgactc acatga // LOCUS HUMDEF5A 2880 bp DNA PRI 25-JAN-1993 DEFINITION H.sapiens defensin 5 gene, complete cds. ACCESSION M97925 NID g181532 KEYWORDS defensin 5. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2880) AUTHORS Jones,D.E. and Bevins,C.L. TITLE Paneth cells of the human small intestine express an antimicrobial peptide gene JOURNAL J. Biol. Chem. 267, 23216-23225 (1992) MEDLINE 93054653 FEATURES Location/Qualifiers source 1..2880 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 1267..1271 TATA_signal 1328..1334 mRNA join(1359..1570,2550..2786) /product="defensin 5" exon 1359..1570 CDS join(1399..1570,2550..2662) /codon_start=1 /product="defensin 5" /db_xref="PID:g181533" /translation="MRTIAILAAILLVALQAQAESLQERADEATTQKQSGEDNQDLAI SFAGNGLSALRTSGSQARATCYCRTGRCATRESLSGVCEISGRLYRLCCR" exon 2550..2786 polyA_signal 2770..2775 BASE COUNT 879 a 595 c 547 g 859 t ORIGIN 1 caaatataga gactctccaa gggcccactg agccccaaag gatttggatc aaatatggtg 61 atattatgga aatatgtagt aatatcttaa aaatgtgtaa gatatagtct cttttttttt 121 ttttttaaga gaaggggtct cactatgttt ttaggctggt atcgaactcc tggtctccag 181 tgatcctccc acctcagcct gtcaaatagc tagaaatata ggcatgtacc accatgctgg 241 cttaagatgc attctttgac acagcaattc tatttctata agtttatcca tataggtaag 301 agaacatata tacaagataa tcactgtaac tttacttatt actgcaaaag tttaaaaata 361 accaaattgt aataatttta taatatttta tcagtacaaa aaataagtga tggcatatac 421 aaaccctggg atagtataag gctattaaaa ttataatagc attccatgta ttttgatata 481 caaagtgcca atgttacagg tgaaaaaagc gaagtgcaga atactatgtg taactgttaa 541 tagtgatggt ttgctgggtc agaactgaag gcctgggggt agaaatgaga gctcatgact 601 tctacctttt gaatgttgtt ccttgtgcat gatttacaat tttctaaaac taaaaaaaaa 661 atctcagaaa ggggctgtac gcacctaaat tactttgata ttccccaaag tggagagaag 721 tacccgctac acattttatg tgatgcattc agatcacacc aactccttga actaaatccg 781 aatttttatt ttaatctgat aaacttggcc tactatttta ctgaactcat ttcccctata 841 gcctgataag gtcattgacc tctccatact ggcaccagcg ggagactact cacctcgaga 901 tctcaaaagc ctcctacatg aggttagtaa tatccctgaa tcctgcaatg aattaactct 961 ctactccact gggtcccagg tctgccccca gagagtcatc cagagagtac cagggaccat 1021 cttcagaaaa caagaggcat ttgatcccca aacttcttga atgaaagcgc tgttgttttt 1081 cttttttgaa tatataaaag taaatactca agcagatggg aaacagaaca ggatagtaat 1141 acccttatca tcattaacac cttggatcaa gaagaggcat taagcataca gactcacgct 1201 ttgatgaaag ctgggagaaa gaggagcatc aaagggatct tgagaacaaa ggcagtcctt 1261 cccctcccaa tcacatgccc acctcctctc actgcagctt ctgtctcagg tcttctccca 1321 gcagagctat aaatccaggc tgactcctca ctccccacat atccactcct gctctccctc 1381 ctgcaggtga ccccagccat gaggaccatc gccatccttg ctgccattct cctggtggcc 1441 ctgcaggccc aggctgagtc actccaggaa agagctgatg aggctacaac ccagaagcag 1501 tctggggaag acaaccagga ccttgctatc tcctttgcag gaaatggact ctctgctctt 1561 agaacctcag gtaggagaca tcaatcttgc acatctgcaa aatctagaaa aaaaggattg 1621 gagaaaggat ctggagtcaa gtgtggaaag gtctacctca cttgagtgac tttacttaat 1681 cttcctggac cttgattttc tcatctataa attaatcagt gagaaccaaa taaatctaaa 1741 agattttctt ttttctaaga ctttcagctc caagatattt ctgtgaaatt tgctactttt 1801 aagatagaaa gagctacact gactagttct ttgtagatct aaatgggcag acttagttat 1861 atagagagtg ttttactttg tccattggaa aagcttttag aacctagaga ggaacctata 1921 ggtgtgtttt gatgtaggct aataggcttg attaaatctt tctacaatac atccttagat 1981 caaaacatca tattgtgtct catacatata cacaattatt gtttgtcaat taaaacaagt 2041 aaatatgtaa aatgttaaaa aaaaaaaaaa aaaaaaaagg agagacagag aatgaagaat 2101 ttgaatttgg aaagtcttca aagactcctt gagcaccaaa gtatttggtc catgacatta 2161 gcatgcacaa tgcggcattt cagaaactga ttcaggtgct ttagggagcc ttgttaggac 2221 ctggaaatca cacatggagg tcaagattag gcgtgtggat gaagcagaat gaagagtagg 2281 taaccctgag gttgagaggt atattgttgg accagggagc aggtaataaa tacatcctgg 2341 atagactcac atggggaaaa aaactatgat cttgcatgac taacacatag ctagtaagat 2401 ttcttgtcac ttacgacaaa gacatgaatt ttctccatcc taacatgact gatacagtgt 2461 ctcttattta gactatctca gttagtctgg ctgtgcttgt cctttttccc acctccctcg 2521 ctgtgcctga ccctctcttc tttccacagg ttctcaggca agagccacct gctattgccg 2581 aaccggccgt tgtgctaccc gtgagtccct ctccggggtg tgtgaaatca gtggccgcct 2641 ctacagactc tgctgtcgct gagcttccta gatagaaacc aaagcagtgc aagattcagt 2701 tcaaggtcct gaaaaaagaa aaacatttta ctctgtgtac cttgtgtctt tctaaatttc 2761 tctctccaaa ataaagttca agcattaaac ttagtgtgtt tgaccttttt aattttcttt 2821 tctttttcct tttttttctt ttgctttgtt atatggtggt ttgtatggtt cctttgtatt // LOCUS HUMDKERB 8815 bp DNA PRI 15-SEP-1990 DEFINITION Human cytokeratin 8 (CK8) gene, complete cds. ACCESSION M34482 NID g181572 KEYWORDS cytokeratin 8. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8815) AUTHORS Krauss,S. and Franke,W.W. TITLE Organization and sequence of the human gene encoding cytockeratin 8 JOURNAL Gene 86, 241-249 (1990) MEDLINE 90215304 FEATURES Location/Qualifiers source 1..8815 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 1007..1011 exon <1113..1436 /note="cytokeratin 8" /number=1 CDS join(1113..1436,3972..4180,4809..4869,5344..5439, 5958..6248,7113..7333,7492..7550,8380..8567) /note="cytokeratin 8" /codon_start=1 /db_xref="PID:g181573" /translation="MSIRVTQKSYKVSTSGPRAFSSRSYTSGPGSRISSSSFSRVGSS NFRGGLGGGYGGASGMGGITAVTVNQSLLSPLVLEVDPNIQAVRTQEKEQIKTLNNKF ASFIDKVRFLEQQNKMLETKWSLLQQQKTARSNMDNMFESYINNLRRQLETLGQEKLK LEAELGNMQGLVEDFKNKYEDEINKRTEMENEFVLIKKDVDEAYMNKVELESRLEGLT DEINFLRQLYEEEIRELQSQISDTSVVLSMDNSRSLDMDSIIAEVKAQYEDIANRSRA EAESMYQIKYEELQSLAGKHGDDLRRTKTEISEMNRNISRLQAEIEGLKGQRASLEAA IADAEQRGELAIKDANAKLSELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYR KLLEGEESRLESGMQNMSIHTKTTGGYAGGLSSAYGGSQAGLSYSLGSSFGSGAGSSS FSRTSSSRAVVVKKIETRDGKLVSESSDVLPK" intron 1437..3971 /note="CK8 intron A" exon 3972..4180 /number=2 intron 4181..4808 /note="CK8 intron B" exon 4809..4869 /number=3 intron 4870..5343 /note="CK8 intron C" exon 5344..5439 /number=4 intron 5440..5957 /note="CK8 intron D" exon 5958..6248 /number=5 intron 6249..7112 /note="CK8 intron E" exon 7113..7333 /number=6 intron 7334..7491 /note="CK8 intron F" exon 7492..7550 /number=7 intron 7551..8379 /note="CK8 intron G" exon 8380..>8567 /note="cytokeratin 8" /number=8 polyA_signal 8779..8784 BASE COUNT 1868 a 2324 c 2481 g 2142 t ORIGIN 1 tcaacggatc tcgctctttt ttttctttgg agatggaatc tcgctctgtc gcccaggctg 61 gagtgcagtg gcaagtctca gctcactgca actctgcctc ccgggttcaa gtgattctcc 121 tgcctcagcc tcctgagtag ctgggattac accatggcca gctaattttt gtatttttag 181 tagagatggg gtttcaccat gttggtcagg cttgtcttga actcctgacc tcgtgatccg 241 cctacctcag cctcccaaag tgctgggatt acaggcgtgc acagcgtgcc ctggccttgg 301 atctcttttt atcttgcacc ttcagatgta gagggacgac agccactgtg tgtgtatgtg 361 tatgtgtgtg tgtgtgtgtg tgtgtgcgcg tgtgatgttt attcactcat ttatttattc 421 attcattcat tccacaaata tctacccaga ccctcttggc actgcaccag gtcgtagggg 481 tagaacagta acctggaaag atgaggcaaa tggttgattt cagattcaag gctttggact 541 ccagctgttc tgtcatccag ctcaggcagg ccctcataat cgcttcaatc agggagaaca 601 caggagagtt tctctggggt gtcggcagct cagaggagac ccaaatacta ggagacccct 661 tttcccatgc ttcccagtcc tccagtttat ttcccccagg aaggagggag acaagaccca 721 gagtcagggt tgtagtggct gggcggccca ggcaagtctg cttgttacac gacttgtgcc 781 aggacaggat ttcttccagt ttcatattca ctgaactgcc ttttcctggg tttctggggg 841 tggtgctgga gtgggctcca gggttggaac gggcccttgc gacgcgtctc tgctgccccc 901 acctgagtct gccccgaggt ggcaggtgac gggttcacgc gacgcctctg gcctagccac 961 tcaggtacga ggcctttccc ccactccccg gggctgggat ctcttttata aaaggccatt 1021 cctgagagct ctcctcacca agaagcagct tctccgctcc ttctaggatc tccgcctggt 1081 tcggcccgcc tgcctccact cctgcctcta ccatgtccat cagggtgacc cagaagtcct 1141 acaaggtgtc cacctctggc ccccgggcct tcagcagccg ctcctacacg agtgggcccg 1201 gttcccgcat cagctcctcg agcttctccc gagtgggcag cagcaacttt cgcggtggcc 1261 tgggcggcgg ctatggtggg gccagcggca tgggaggcat caccgcagtt acggtcaacc 1321 agagcctgct gagccccctt gtcctggagg tggaccccaa catccaggcc gtgcgcaccc 1381 aggagaagga gcagatcaag accctcaaca acaagtttgc ctccttcata gacaaggtga 1441 gggtcccctg cgtggctgac tgtgccccgc agcccctttc tcctggtagt cccggtccct 1501 atgcacatct ccagccccca gctggcgtcc tgctgggcct cacccgccct gggcacactc 1561 tcccttccat cctccgacct cacccctccc gtgcaccttg gtttgggctg ggtgagggtg 1621 gggagagggt ctggacagcc gggatgaatc ctggggcttc cttcttccct tttaaactgg 1681 agggtcttgg aagagagaga caacttaagg gtacagccta gttcccacca cccctctcta 1741 caaatcccgt tcttcctcag gtcattctgt cccaaattat aaaaaataat agcggttatt 1801 gttctcaccc caacccagtt ctgaccgtct tttaacgtat gcctgcggca gtcccagctg 1861 ttcgggacta ccctcctcca ggttcgcctc ttcgccagca ctacccaagg ctccccagtg 1921 gtgcctttgt gatttttttt ctttcttttt tttacatagg ggtttggtgt gattctagca 1981 ttctaggaga aggaagtggg tgtctcggtt caaacgggca aatattgatt gaggcctttg 2041 gccgccggag gcctgagtgc gggggtcaca gaatgagtca tacggcccct ggcccggcag 2101 cgtgggcggg gccgagggcg gggtgagggc tgcgggcagc agtctgcggg acgctctcct 2161 ccactggcgg agctcggcgt cgggggcggt gtgggtgggg tggggtgggg tggggtgggc 2221 tggggtgggg tggaggaggc gagggcctgg cctcggaaag cccatgcagg attcaaagtc 2281 tcctgggacg ccgcccgggg tttacgtcct gttaagttta tggcttcaga taacgcggtc 2341 gcccaccaac gcccctcgcc cattcagccc gtgtcccttt ctcggcgtcc tgtccctgct 2401 gcccccagcc tcggctccac tttccacaca gcaggagcca gggccgggtt ttgcagcctg 2461 ggactccgct gcctgagccc cggcccccgg cggccccgag gattgggccc ttcacgctga 2521 ctggctcctg ggaggcattg tgggaacggg aggagggaaa tcctggggca gagtaagccg 2581 ggaggaaccg gagccccagg aacccagtgg tcgggggccc tcgctgtcca agcgcctgga 2641 cttgacttgt tgactgcgtt ttgctagccc tggggtcctt atagagagca gctaagcata 2701 ggctttggaa tctgaattct tggtctgcac tcgtctgccg gttcctggtt atggactccc 2761 ttgccaagtc ttatttcctc atctataaaa tgaatatgag agcccctaaa tccatatagc 2821 aaaagttttt gccttattca aacttacata tgtaaagagt tcagcagtgc ttggcccaca 2881 ttccattagg ataagatgtt ataatcactt ttttttaaaa aataattttg gggcagaatg 2941 actggggaag aaagcgattt gcagagagtg gtggagggaa ctaggctgta cccttaaaag 3001 atttctgtcc cctccagttt agaaggagtt acaagttttt ttgtttgttt gagacagagt 3061 tactctgtgc ccaggctgga gtgcagtggt gtgatctcag ctcactgcaa cgctccgctt 3121 cctgggttca agcgattctc ctgcctcagc caccgagtag ctgggactac aagtgcgtgc 3181 acagcccggt taattttgta attattgtag gcaaggttca atatgttggc aggctggtct 3241 cgaactctga cttcagaaat ccgcctgcct tgaccaccca aagtgctgga attacagcgt 3301 gagcctccac gcccggcctc tttttcaatc ttaacatctt tagaaaggtt ggctattttt 3361 ggccgggcgc gggcttacgc ctataatccc agcactttgg gaggccaagg cgggccaatc 3421 acaaggtcag gagttcgaga ccatcctgcc taagacggtg aaaccctgtc tctactaaaa 3481 atacaaaaaa attagtgggg cgtggtggca cgcacggctg cctgtagccc cagccactcg 3541 ggaggctgag gcaggggcag gagaatggca tgaacttggg aggcggagct tgcagtgagc 3601 tgagatcttg cactgcactc tagcctgggc cggagactcc caaagaaagc ttggctattt 3661 ttattgatgt gtaatataca acctatgtaa atgaagttag gcctattggt ttgcaaatgc 3721 agctttaaca taattacctt acctgtctcc ttcccctacc caatgctgag ggacattgct 3781 ccccacctca ccatcatgcc atgctttctc cccctggtca taggtgatct ttccagaaca 3841 gctaaccagg tgcctggggt ctggagactt actgcttgag gagtgaatta agagaaaaga 3901 ctgcttgctt tcctccagac tttgagccct ggcctgatgt agaccttttt gctctctcct 3961 ccttcgtata ggtacggttc ctggagcagc agaacaagat gctggagacc aagtggagcc 4021 tcctgcagca gcagaagacg gctcgaagca acatggacaa catgttcgag agctacatca 4081 acaaccttag gcggcagctg gagactctgg gccaggagaa gctgaagctg gaggcggagc 4141 ttggcaacat gcaggggctg gtggaggact tcaagaacaa gtgagcaact ccaccctcca 4201 cccaactgaa gtcacctgct ctcctccacc ccttgacctt gggactaagt ccatggccct 4261 ctgttgtggg aagtgcagtc ctatctaatt agggtgacca cctgatgagg tttctcggac 4321 agtctgtgtt tatgccaggt tctagcacat tgttgatagt acccacccct ttcaatctaa 4381 ctgtctggat ttgaagaaca aattatgtgt caatgttgac atggtaaacc tgagacggga 4441 gagataggca gcctgtgggc ctcacttttg tacttaacat tctggcccct ctttagtctt 4501 gacccttgac ctctagcaaa ctctagaaag ttctgtctga ggtctcatgt caggccctgc 4561 tgttaacact ctcaaggtgt ccaatccgat gtgtattcat ggatttggag agagatttcc 4621 tgcttcccac gggctaaggg aggggtgagg gtggagaggg cagctgggga aggcagaagg 4681 accagccttc tcatatcctc atctctgtga actgaatttc ctgatttcac aacgcccctg 4741 tctcccaaaa gaccaagggc aacctccctt ttgccttcat cctctaattg taagtctttt 4801 cctcacaggt atgaggatga gatcaataag cgtacagaga tggagaacga atttgtcctc 4861 atcaagaagg tgagggagtc tcccttctcc tatctggaca ctggaggctg gggctcagag 4921 actcagacca agaagctttc tgggttttgt ccctaaatat tcctaagtag tgggacaaac 4981 tcatttatgt aaacatttgg gtgcacagaa aggtagacaa ggatggagtg gtaggtgcat 5041 ttggacagaa ctcttgacat cggtgttggg acatggttca gaaaacagag cagtagaact 5101 ggagatctgg ctctagaagg ctccctagag aaggaggtgg aagagggtgt gttgcaggaa 5161 gcagaggtga aggtgtgtgg gctgagaatg cacatgtgat gggcagaggc tgggctggaa 5221 gatcaatcca caaagtggca actagaaagt cctgtgacca ggccattggg tggaccttgg 5281 gagccccttg gttggggttg ggtgtggaaa cccagctcag gctcccctct cctcatcccc 5341 caggatgtgg atgaagctta catgaacaag gtagagctgg agtctcgcct ggaagggctg 5401 accgacgaga tcaacttcct caggcagcta tatgaagagg tatgttcctg gtcgcaggag 5461 agtgagggtc cccagccttg tcagcgcctc caccctgaga ctcaaccaga ggctcctccc 5521 agcccccagc acactaataa gacaaaggac cccactgctg actaattaca gccaccaata 5581 tttgctcggc tagtatttat tgggtctata tgttctgtcc ctcgcatgag gtgagtcatt 5641 accccatttc acagacgaga aagtgggctc agagaagtga aataacgtat ccaaggtcat 5701 catagggtgt ggtgattcag cagcaactct gtccccaaag cccttgttcc taatctttga 5761 gctgcattgg atccctctgt gcacctagta ttggtgaccc agttcctttt tcaggaactt 5821 tgcccctctc cctgaccctg actcccacct gctcctctcc tctgctgccc ctgtcttata 5881 cctaagaaag gctgttgtgg aaaagggggc tcctgtgtgc agagacaggg cctcaccact 5941 tgccctcttc cccacaggag atccgggagc tgcagtccca gatctcggac acatctgtgg 6001 tgctgtccat ggacaacagc cgctccctgg acatggacag catcattgct gaggtcaagg 6061 cacagtacga ggatattgcc aaccgcagcc gggctgaggc tgagagcatg taccagatca 6121 agtatgagga gctgcagagc ctggctggga agcacgggga tgacctgcgg cgcacaaaga 6181 ctgagatctc tgagatgaac cggaacatca gccggctcca ggctgagatt gagggcctca 6241 aaggccaggt atgggccggg ttgggggtgg gagggttcct tggacacaat cctggtgaga 6301 ggagataatg taggaagagt gaagtttctg ggagtcgggg aaggaatcct agaccagggt 6361 tcaggagttg gaggggcagc cacagttcag cttctcagtc tgcttctgag aagcaaaggg 6421 atgcagggaa ggtcccttgg gccaggacag aggtgaaagg ggactggggc aggtatgttg 6481 gggactcgtg atacatgctc caagcctgct ttaatcagtc atatgcatca ggggtaaggt 6541 tgagctctgc tgctttaagg aaagtctaga acccagggat ctagtccagt tagggtaggg 6601 ggaccttaca gtgtcgcagg tcgagaaggg tgtggagggg aagcacctgg aaactgctca 6661 tgtctccctg atctgcttcc ttagtctcgt ttatttattt atttattttt gagacagagt 6721 cttgctctgt cgcccaggct ggagtgcagt ggcgtgatct cggctcactg caagctccgc 6781 ctcctgggtt cacactattc tcctgactca gcctcctgag tagctgggac tacaggcgcc 6841 cgcaccaggc tggctaattt tttttgtatt tttgctagag acggggtttc actgtgttag 6901 ccaggactcg tcgatctcct gaccttgtga tctgcccgcc tcgcctccca aagtgctggg 6961 attacaggca tgagcactgt gcccggccct tagtctcatt aattgagctg gggagtcagc 7021 ctagtgtgtg gaggacctga gggagggtgg acgcacggag gaagagaagg catacccaac 7081 ctgacctact tacctgtccc ctacccacag agagggcttc cctggaggcc gccattgcag 7141 atgccgagca gcgtggagag ctggccatta aggatgccaa cgccaagttg tccgagctgg 7201 aggccgccct gcagcgggcc aagcaggaca tggcgcggca gctgcgtgag taccaggagc 7261 tgatgaacgt caagctggcc ctggacatcg agatcgccac ctacaggaag ctgctggagg 7321 gcgaggagag ccggtgggtg tgggtacctc tgaccggacc tgcttcccta tccctgggac 7381 ctggggtggg gacggtggga gccccctgaa gccccttgga cttggggtcc tgttgttctg 7441 ggccaagaag ggctaggagt tggtcctgac accccatttg acagggtaca ggctggagtc 7501 tgggatgcag aacatgagta ttcatacgaa gaccaccggc ggctatgcag gtggtgtccc 7561 agggccctgg atgagggcgg gaggcagggc cagggaggct cagctccagg gagggggctg 7621 tgctcagtcg ctcacagtga cctcagcctg agcactcatg ttcttgggag aatcctaggg 7681 tggggaggca catattcagg gaactccagt aataacttta ttacttagta acttcatatt 7741 agaagataca ccaataacca tagctgtgtg ccaggcactt gcgtaagtat cctacaggtt 7801 ttatgtgatt tattttattt attaatttaa tttaattttt ttgagacgaa gtctcgctgt 7861 caccaagctg agtgcagtgc tgatctcagc tcactgtaac ctcacctcct gggttcaaga 7921 gattctcctc cgtcaggcct cccaagtagc tgggactaca ggcgcatacc accatgccca 7981 tgctaatttt tgtattttta gtagagacgg ggtttcactg tgttgggcag gctggtctcg 8041 aactcctgac cttgtgatca gtgctgggat tacaggcatg agacactggg cctggctgta 8101 atttattttt tatatgacac ctgtaaacgt cttcagttga ggaaggctga ggtgcagcta 8161 aatgtccaag ctgacacagg ctatatatat ggcagctgtt ttccaccctg ctcctggttt 8221 tccctgacag ttctggagta gtgaaccatg caatcactga tcaggagagc tgggttaacc 8281 tccatccctg gggctatgtt gggaatgagc agggagaagg gcatggagcc tgccatggtg 8341 ggcttctgta ctcatgtggc tacctctgtc cctcaccagg tggtctgagc tcggcctatg 8401 ggggctcaca agccggcctc agctacagcc tgggctccag ctttggctct ggcgcgggct 8461 ccagctcctt cagccgcacc agctcctcca gggccgtggt tgtgaagaag atcgagacac 8521 gtgatgggaa gctggtgtct gagtcctctg acgtcctgcc caagtgaaca gctgcggcag 8581 cccctcccag cctacccctc ctgcgctgcc ccagagcctg ggaaggaggc cgctatgcag 8641 ggtagcactg ggaacaggag acccacctga ggctcagccc tagccctcag cccacctggg 8701 gagtttacta cctggggacc ccccttgccc atgcctccag ctacaaaaca attcaattgc 8761 tttttttttt tggtccaaaa taaaacctca gctagctcgc cgaatgtcct tgctt // LOCUS HUMDNL1L 11967 bp DNA PRI 21-MAY-1996 DEFINITION Homo sapiens muscle-specific DNase I-like (DNL1L) gene, exons 1-9, complete cds. ACCESSION L40817 NID g886275 KEYWORDS DNase I; homologue. SOURCE Homo sapiens (clone: 14C6) (clone library: Q1Z cosmid library) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11967) AUTHORS Parrish,J.E., Ciccodicola,A., Wehhert,M., Cox,G.F., Chen,E. and Nelson,D.L. TITLE A muscle-specific DNase I-like gene in human Xq28 JOURNAL Hum. Mol. Genet. 4 (9), 1557-1564 (1995) MEDLINE 96081217 FEATURES Location/Qualifiers source 1..11967 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="14C6" /clone_lib="Q1Z cosmid library" /map="Xq28" exon 925..1085 /gene="DNL1L" /number=1 intron 1086..3777 /gene="DNL1L" exon 3778..3862 /gene="DNL1L" /number=2 intron 3863..7316 /gene="DNL1L" exon 7317..7538 /gene="DNL1L" /number=3 gene join(7404..7538,7889..7977,8058..8144,9349..9449, 9590..9702,9781..10029,10130..10264) /gene="DNL1L" CDS join(7404..7538,7889..7977,8058..8144,9349..9449, 9590..9702,9781..10029,10130..10264) /gene="DNL1L" /note="lysosomal-like" /codon_start=1 /product="DNase I" /db_xref="PID:g886276" /translation="MHYPTALLFLILANGAQAFRICAFNAQRLTLAKVAREQVMDTLV RILARCDIMVLQEVVDSSGSAIPLLLRELNRFDGSGPYSTLSSPQLGRSTYMETYVYF YRSHKTQVLSSYVYNDEDDVFAREPFVAQFSLPSNVLPSLVLVPLHTTPKAVEKELNA LYDVFLEVSQHWQSKDVILLGDFNADCASLTKKRLDKLELRTEPGFHWVIADGEDTTV RASTHCTYARVVLHGERCRSLLHTAAAFDFPTSFQLTEEEALNISDHYPVEVELKLSQ AHSVQPLSLTVLLLLSLLSPQLCPAA" intron 7539..7888 /gene="DNL1L" exon 7889..7977 /gene="DNL1L" /number=4 intron 7978..8057 /gene="DNL1L" exon 8058..8144 /gene="DNL1L" /number=5 intron 8145..9348 /gene="DNL1L" exon 9349..9449 /gene="DNL1L" /number=6 intron 9450..9589 /gene="DNL1L" exon 9590..9702 /gene="DNL1L" /number=7 intron 9703..9780 /gene="DNL1L" exon 9781..10029 /gene="DNL1L" /number=8 intron 10030..10129 /gene="DNL1L" exon 10130..11212 /gene="DNL1L" /number=9 misc_feature <10336..>10532 /note="insertion/deletion polymorphism" /evidence=experimental BASE COUNT 2897 a 3194 c 3345 g 2531 t ORIGIN 1 cttggacaat gcccattctt ctatacttct ccctgtactg tcactgcaac cagcccagtc 61 taggctagtg gatggctcca agtcctcctt cctggcctcc caggatccac ctgtgacatc 121 tataggcctt tctccaccca gcagccaggg tgagctctga agggctccag tgtggctagg 181 tggctcccca tgaactctag gacagttaag ctctctgttc aggagtggaa gccccaggca 241 cccctccacc cggccatctg gttaccagag ccaagcaact tcctgtgaag tcagcagagc 301 aggaacctct aggtgtggca accaggctcc caaatagggc tgaacggtac acgcccagcc 361 aggggcaaac ctgaagccag gacactgggc ttccagtccc tggcccagaa cttcctcaca 421 ggatatccta tgtccaaccc ccaaattttt cccaagactg tggggccctg gatggtttag 481 agaacagagg gttctggagg cctgacgtta aaggagctgt aagatatgcc ctgtggccgt 541 tccttggtcc tgctccatgc ccaaggccct ctccaaatca tgctccctca gggatgctgc 601 aggagccaca aatccaagtc tcagccctgg ggcagaacgc ccggaaaccg gagtgccagg 661 gctccatgaa aaggcagcga gaaggggttt gttctgacga ggctgggccc gtcccgaatc 721 ctcgcctttc ctccccctgc ccagcacact ggcccgggta ccccagagat gagggtcgtc 781 catgcaggac tggtgattgg acacggtgat gaggggcgtg gccgggcctc gcttctcgat 841 gagctcgtac agcacctccc tgttgtgcac ggtcaggtgg ttcatgtact ctggcgggga 901 aggagaggtc aggcgctccg ggctcgcccg ctaggtcggg gccgcggcgt cccccaccct 961 aagtcccacc tccggccggg catgggtacc cgggcgggcc tgcctcggcc tgggcccact 1021 cactggtcca gaagcagctg taggtgccca ccaagcccat gacgacgctg ctggccaggg 1081 tccaggtgag cggcggcacc gcggggaacg gccactttca cgtgcagagg ccaccccacc 1141 cgaccgccgg ccgggctccc agcgccccgg gccgacctgt ggaacgcctc taggctgcga 1201 cctctaccgc ctcgacacac tggcccctga ctggcccgcg cacggacgga cgggggcgcg 1261 gcgcaacagg gtactggacc gaggtccctt cgcaggtcag gccggagcag cgcccggccc 1321 cgctctctcg tcactgggga gcggtctccg aagtcacagg ccgtgggcgc gctgcggaac 1381 gggaggaaac gggaaagcgg gcgctggcac cccggcccgg tgcaaccgcc ggaaagcgaa 1441 cctgcgccgg tgcccagccc gaccgactgt cgaaagcgcg cacgggccgc ttgacggcag 1501 cgaggccagc cccggggcgg cgggaagtgg ccgtcacccc gtggggctcc cgcttcctgg 1561 cagcctcctg cgctcgcctt ttgtctctct agtgtttctg ttttttgttt gacacagggt 1621 ccagccagac ccaggctgga gtgcagtgat gggatcatgg ctcactgcag cctcgatctc 1681 ccagactcaa gcgattctcc tgcctcagcc tcccgagtag ctgggaccac aggcgcccgc 1741 catcacactc tgatccaccc gcctcggcct cccaaagtgt tgggatgaca gccgtgagcc 1801 accccgccca gccacatata caatttgaca tagttacagt actacatgat actatgtgta 1861 tgtagcaatg tataatctac tgtatggcaa aatatacaat atatacagca atatatacaa 1921 acatattgtg tatcatatgt aatacaagaa acagtatttt ataccatctg atacagtcac 1981 aatgtcagct actacctcta gttctgaaac atattcatga cccggaaaga aagcccccta 2041 tgcattaagc agccagtttt ttatttttat tttatattta tttatttatt attatttttt 2101 taggagtctc gctctgtcgc cagctggagt gcagtggcac aatctcggct cactgcaacc 2161 tccgcctccc gggttcaagc tattctcctg cctcagcctc ctgagtagct gggactacag 2221 gcgcaggtcg ccacgcccgg ctaatttttt tttttttttg tattttagta gagatggggt 2281 ttcaccatgt tgcccaggtt ggtctcgaac tcctgagctc aggcagtcca cccacctcag 2341 cctcccaaag tgctaggatt acaggcatgg gccaccgcac ccggcagttt tttattttta 2401 tttttattta tttttatttt tgagacagag tcttgctctg ttgcccaggc tggagtgcag 2461 tgacgagatc tcagctcatt gcaacctctg cctcccaggt tcaagcgatt cttctgtctc 2521 agcctcctga gtagctggga ttacaggcgc aagccaccac acctggctaa tttttgtatt 2581 tttagtagag gctgggtttc ttcatgttgg tcaagctggt ctcgaactcc cggcctcagg 2641 tgatctgtct gcctctcctc ccaaagtgtg agtcatccta cctggcctgt tttttatttt 2701 taaaccataa tttattgatt tttttttttg agatggagtc tcgcactgtc gcccgggctg 2761 gagtgcgagt gcagtggtgt gatctcggct cactgcaagc tccgccttcc aggctcaagc 2821 aattctcctg cctcagcctc ctgagtagct gggattacag gcgcccacca ccacgccggg 2881 ctaatttttt gtatttttag tagaaacggg tttcattatg ttggccaggc tggtcttgaa 2941 cgcctgacct cgtatccacc cgcctcagcc tcccaaagtg ctgggattac aggcatgagc 3001 caccacgccc agcccataat ttattgattt tttaaaattt gtccagcctt ctattaccac 3061 gtcgaatcca ttagctacag ccatcccatg agaagctgag tggattcagc cccacctcct 3121 gctcacagac cctgtccgag cacctcattt gtcccaacag cattactgca ggacccccag 3181 gacgttggac tgccagctcc ctgggtctcc tcctctctgg ggcagatcct cagtcctccc 3241 ttgacttcac gactgtggcc agatcatgtg tggactgtcc ctctctttgg gtctccagag 3301 cgcttgcatc aaacacccct aactcagaag tgtgcagcca cactgggact cagaacccaa 3361 caacagggac agaagactca cgcccttggg gtgcccggtc tcgtggcatc aggcatgact 3421 tccagctcct gcgccttccc cagcaactgc tgactgggga cccagaccgg gagctgagcg 3481 acgggcctgg cgagcgaagc tcggggtctc actcaggcac cagcccctcc ttgccccagg 3541 cttgagtgac tcacaggtct gtcatgtccc gctggcccgg gaaggcgggg tctgggtggg 3601 aaggtgggcg ggcaggtcag cagttcccag gcctggtgct ggtcctcctt gttggcggcg 3661 aggcggagac gtggcgggcc tgcctgttgg caagggtggg gcgtccggcc cagccaggcc 3721 ctttctggcc gtgtggctca ggaacacagc ctcttcctct tttttttccc ctcctagccc 3781 tattcaggca ggagctgctc ttctggggta tcgcgatcca cttaaggatg aggcagactt 3841 ggtgacaagc tggtctgagc aggtatggga gccccctggg gagacggaag aagggaggaa 3901 gttgccttct gcctggggag ggtttgagag ggagagggaa gcctagggct cccaccaagg 3961 ctgatattga cagccagggt ttggggctga agccaggaac cgtcgctctc tctggttctt 4021 actggtagcc ctcatggggg tccctgacgc cagagcctcc aaggctgcat gtgccagccc 4081 agggctgccc acatacccat gtatatccca gaataggcac cagggtaggg aacccaaact 4141 agcatgagtg acagagcagg tggtcaggga gaaacagaca tcaaacccag ccaggagaga 4201 gacaccgcaa cagagagaca gagaagggaa accagagacg agagggaaag tgagacagag 4261 acggacagag cctcagagag gtaggaagtg agcccgaagg gaaggtgaca ggttgataga 4321 gaaaagcagg cagacagagc agaggaactc atgcctcagg atgcaaaagc cacatggaca 4381 gggcgtggag caaggagccg agccctgcaa ggtcaggtga gtgtgagaca aagataaact 4441 gagagagggg ggctccagct ggtgagatga acaagaaggg ctccccagtg gtggcagggg 4501 catgggaaag agtccccatc acaggttgca atggggatga tttggccata tctgagacct 4561 tttaaaatgc acacaccctt cactgccggg agtttatctt aaagagaaca tctcacaagt 4621 ggggagacag agatgaacga agatgttcct gtagttgcta atggaggcga aaagcactgg 4681 ggtcttggaa caacatggtg aaaccccatc tctccaaaca aattttaaaa attagcttgg 4741 tagccgggcg cggtggctca cacctgtaat tccagcactt tgggaggctg aggtgggagg 4801 attacttgag gtcaggagtt tgagaccagc ctggccaaca tggtgaaacc ctgtctctac 4861 taaaagtaca aaaattagcc gggtgtggtg gcacgtgcct gtaatcctag ctacttggga 4921 ggctgaggca ggataatctc ttgagcctgg gagacggagg ttgcagtgaa ctgagatcaa 4981 gccactgcac tccagcctgg gagacagagt gagactccat ctcaaaaaaa gaaaaaaaaa 5041 attagcttgg tgtggcagca tgcaccttgt agtcccatct actcggaagg ctgaggtgag 5101 agcatcactt gagcctggaa ggtggaggtt gcagtgagct gagatcacat tactgccctc 5161 cagcctgagc aacagaatga gactatcaaa aaaaaaaaaa aaaaaaaggg ggggggccgg 5221 gcgcggtggc tcacgcctgt agccccaact actcaggagg ctgaggcagg agaatcactt 5281 gaacccggaa ggcagaggtt gcagtgagcc aagatcacgc cactgcactc cagcctggtg 5341 acagagcaag actccatctc aaaaaaaaaa aaaaaaaaaa aaaaaaaggt agactaggcc 5401 aggcgcagtg ggtcaggcct ataatatccc agcactggct gggcgcagtg gctcacacct 5461 gtaatcccag cactttggga ggccaaggcc ggctgatcac ctgaagtcag gagttcgaga 5521 ccatcctggc caagatggtg aaaccctgtc tctactaaaa aatataaaaa ttagctgggc 5581 gtggtggtgg gtgcctgtaa tcccagctac tcgggaggct gaggcaggga gttgcttgaa 5641 cccaggaggc tgaggttgca gtaagccgag atcacgccac tgctctccag tctgggcgac 5701 agagtgagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaatccca 5761 gcactttggg aggctgaggt aggaagatca cttgagtcca gcagtttgag accagcctgg 5821 gaaacataga attcaggagg ctgcagtgag ctgtgatggc accattgcac tccagtctgg 5881 gcgacagagc aagactttgt cccaaataaa taaacaataa aagaataagc ttgaccgctt 5941 gtgcactgtt ggcgagaacg taaaatggtg cagtcaccat ggagaccaat atgacacttc 6001 ctcagaaaat taaaactaga actaccatat gatccagcaa ttccacttct gggtatatac 6061 ccaaaagaac tgaaagcagg ggctgggcgc tctggctcac gcctgtaatc ccagcacttt 6121 gagaggccga ggcgggcaga tcacgaggtc aagagatgca tgaggctggg gcaccagggc 6181 tcattcctgc aatcccagca ctttagcagg ctgaggcagg agaattgctt gagcccagga 6241 gttcaagacc agcctgggca acatagtgag accttgtctc tatttaaaaa aaaaaaaaaa 6301 aaaaaaaaag gacacggtgg ctcacacctg taatcccagc actttgggag gccaaggtgg 6361 gcagttcacc tgaggtcagg agttcgagac cagcctgacc aacatgaaga aaccccatct 6421 ctactaaaaa tacaaaatta gctgggcgtg gtggagcgtg cctgtagtcc cagctatttg 6481 ggaggctgag gcaggagaat cgcttgaacc tgggaggcag aggttgcagt aagccgagat 6541 cgcaccattg cactccagcc cgggcagcaa gagcgaaact ccgtctcaaa aaaaaaaaaa 6601 aaaaaccaga tggatggatg aatgaatgaa tgaacaaaat gtggtctacc cagatgccca 6661 gacaatggaa tatgattcag ccatgaaaag gaaagaaatt ctacagccgg gtgcggtggc 6721 tcacgcctgt aatcccagca ttttgggagg ccgaggcggg cagatcacga ggtcaggaga 6781 tcaagaccat tctggctaac atggtgaaac cccatctcta ctaaaaatac aaaaaattag 6841 ccgggcgtgt ggcaggcacc tgtagtccca gctactcggg aggctgaggc aggagaatgg 6901 cgtgaaccca ggaggcggag ttgcagtgag ccgcgtttgc gccactgcac tcagcctggg 6961 cgacagagtg agactccgtc tcaaaaaaaa aaaaaaaatt ctgacatatg ctccaacatg 7021 aatgaccctt gaggacattt tgctcagtga aatgagccag tcacaaaaag acaaatactg 7081 taggatttca cttttttttg agacggagtc ttattgtcgc ccaggctgga gtgcagtggt 7141 gcgatcttgg cttgatcttc acagagcaag actctgtctc aaaaacaatt tttttttttg 7201 gagacagagt ctcactctgt cacccaggct ggagtgcagt ggtatgatct cgggaggtca 7261 ctgggaggct ggtgctgggc agcagctagt gccacttcct ctgtcccata tttcagcgcc 7321 tccagagcca gaactgagcc cagtgagagc gcaccctggg gcagcctgga ttcctggggt 7381 gtccccggca gccacacaca gccatgcact acccaactgc actcctcttc ctcatcctgg 7441 ccaatggggc ccaggccttt cgcatctgcg ccttcaatgc ccagcggctg acactggcca 7501 aggtggccag ggagcaggtg atggacacct tagttcgggt aagttcatca cttcagggaa 7561 gccgtgcccc taaccaccat gagccaaagc ccccatccct gcctgaccag gggtatgttc 7621 tagagaagag gggagaagaa gaggtgtggc acggagatgc agcgcaggcg cttgcctgct 7681 gctggcctca gtctcctcct ctgtgaaaca ggtgacatgt gggtgggatg agtcttttcc 7741 tggggcattt ccagtcccac tgctttctgt gggctgcagt cagtgagcct taacactctg 7801 ggccacaaca gtgccagtcc cctgggccca ctgcctcctc tcagggcagt tggcactgag 7861 tcaggctggc cttcctgatg tcccacagat actggctcgc tgtgacatca tggtgctgca 7921 ggaggtggtg gactcttccg gcagcgccat cccgctcctg cttcgagaac tcaatcggtg 7981 aggaagattc tgtgggtcta atcaggggca cacccagagg cttgaggccc tgaccccgcc 8041 tccccttttc ttggcagatt tgatggctct gggccctaca gcaccctgag cagcccccag 8101 ctggggcgca gcacctacat ggagacgtat gtgtacttct atcggtgagc acggagctgg 8161 cagcgcagca gggtgggggg cggtccttgg ccctaatgtg aggtaggctc attagtcccc 8221 tcggccacag gccccccatc ctgggacagc ctcacctgct ttgcatgctt tctcttctca 8281 acacctttta ttctgggctc ttcccatcct cgacccccgc cagagtgtcc atagagcagg 8341 aggccttcct tctataagca caaatgtggg gtccctcctc ctctctcctg tgggagctgc 8401 tggggctcag ggtgtggctg gccccctcta tgaaagtggc cagcaagggc tgcagctcct 8461 gggaagaggc tccgtgggga ccacttatgg gccttgggta gacccgatct tctaagtccc 8521 tgggaggaca ggctggcagg gagccctttg gagtcaggat ccagagggca agggaggctg 8581 cagggaagga gaactgattc tctatataaa taaaatgata tgacatctgg tatgcgctcc 8641 aaaattactg ggtaggcagg gtacagtggc tcatgcctgt aatcccaacg ctttgggagc 8701 tgaagcagca ggatcacttg aacccaagag ttcaaggctg cagtgagcta tgattgtgcc 8761 agtgcactgc agcccgagca acagagtgag accatgtctc aaaaaaaaaa aaaaaaaaaa 8821 aaagaatgac atcaaagtgt ccaaatgtca aaatgtacta tacccagtcc gtggaatatt 8881 attcagccag aaagaggaat gaagcacaga cgtgccacag catatggacg aacgcagaaa 8941 acattgatga gctgatttca ttcatatgaa atatccagaa taggcaaatg tacaagatag 9001 agaacagatt aatggttgca ggggctgggg ggaggggaga atggagagta actggtaatg 9061 gatgttatgt ttttattagg agtaatggat accaatgttt taaaattgat tgtagaaata 9121 gttgcacagc cttatgaatg tacaaaagaa ccgtgttgta tacttgaagc aggtgaattt 9181 tatggcatat gaattatgcc tgatttttga tagttctaga gacaggttgg gagggctgtg 9241 cccacattgc cgctgttagt ggagggcccc tccctgagcg ccgtttggca cttgcaagcc 9301 aaaatcaaag acgtggggcc actgctgccc atccctttgt cctcccaggt cacacaaaac 9361 acaggtcctg agttcctacg tgtacaacga tgaggatgac gtctttgccc gggagccatt 9421 tgtggcccag ttctctttgc ccagcaatgg taggaaggga agggtgggct tggtatgtgg 9481 gagcagaagg ggagttctgg ctggctccac tgggggcttc ctcacccaga cgggctaggg 9541 ggagaaggtc caggagtgga tggcagccct cggtccctgg gccttgcagt ccttcccagc 9601 ctggtgttgg tcccgctgca caccactcct aaggccgtag agaaggagct gaacgccctc 9661 tacgatgtgt ttctggaggt ctcccagcac tggcagagca aggtaggcct gggcagtttg 9721 tacaggaggg acaaggatag ggtggcccag cggccagcct gatggctgct gtgtctctag 9781 gacgtgatcc tgcttgggga cttcaatgct gactgcgctt cactgaccaa aaagcgcctg 9841 gacaagctgg agctgcggac tgagccaggc ttccactggg tgattgccga tggggaggac 9901 accacagtgc gggccagcac ccactgcacc tatgcccgcg tcgtgctgca cggggagcgc 9961 tgccggagtc tgctgcacac tgcggctgcc tttgacttcc ccacgagctt ccagctcacc 10021 gaggaggagg tgaagggatg gggctggcat gaggagggag ggcagcagca gggtcacagg 10081 gaagcccctc ggccccgggc actgacactg ccccaccacc cattcccagg ccctcaacat 10141 cagtgaccac taccccgtgg aggtggagct gaagctgagc caggcgcaca gcgtccagcc 10201 tctcagcctc actgttctgt tgctgctatc actcctgtcc cctcagctgt gccctgctgc 10261 ctgagcgtcc ccctaccccc ccagggcctg ctgccttttg ggacttaaac cccagcctcc 10321 cccgtccatc cagccctggg gctggggggc ttcaactata gttgccctgt gactgtagtc 10381 cacccctgcc tgccttgttt gatttggctc ttgttctttg gttgggcttg tgcctagatt 10441 aggagaggaa gccaggggcc ctgcactcat gccacctgcc aggtagtgta gtatcaggag 10501 tggagacaaa gtgggctctg ggttggggta ggggaaggga gggttcagaa agaggaatga 10561 agatgttgta tgacaagaag gaaagttact gagaacaaaa acccagattg gtgagatagg 10621 acacttgtgc agcagatatg ccaatgggcc atgtttattg tggattggta agaatcacca 10681 ggaaaccatt aagccccaat agctacaagg agggtggtta atctgctata tcaaactcct 10741 tccctgaaac cagcaaacac cgggaaacat tttggctcat tataatccgg tgaacaatgc 10801 agtcaggcct gttataaccg ctgagcagcc acactcgcac ctcctgggtg ctgtagtctg 10861 tgttggtaca ggcttctgca tgcctggtaa aggtccagcc aaggctggtc aaggcaacat 10921 ctccacacag aaaatctgca ccagttatgt aagctaaaaa gctgtgtgaa cccaggtgtc 10981 ccggaaaggg gctgcaggac acagcaaaat gccagcagca tgccggaccc ctcccttcca 11041 tcctcctctc caaagaagag aggtcaggaa aaacactggc tgggacgcta gaagggtcat 11101 gtgttaacta taatcacatt tatggtttgg aaccatcacc ccaaggtaaa aaaaaaataa 11161 aaggtattcc caggtatgtt tggcaaaata aaataaaggt aattaaaaac ctaagaaatg 11221 gctgtaaact gagattacaa aaaaccagag gccttttggg agccacaggc tgggggctgg 11281 ggccaacata ccccagctct gggaccaaaa gcctttggag tcctgtccct ggatacagat 11341 gctcaggcca cccacctagc actcctctca tttttgctgc tcccccaaaa gccagcgctg 11401 ttctacaata gttcaggtca gacatctaag atccccaggc caaaccatgc cacacacgtg 11461 tctgttccga ggtcttgtta caaatgacca aaggttagcc tagacctttg gtcaggtggt 11521 gaaggccctg ggaatgcagg ttctagaagg ggtgggggca tggggcccag gaggttgcta 11581 gagataggaa atggggcaaa tcgcaggtga cctcagcagt acctggatga ttcccaccaa 11641 ggaggaatgg ccaaaccaga tgggtgagaa caagcccaat gtgatgcaca cctacccttg 11701 ctaaataaac gcggtccgca tgtgttgtaa taagcaacac aactgtaggc tgaccccagc 11761 ctgggattgc aaggctttgc tgcagctgtg cttcatgaaa agcacttcag tgtttttcaa 11821 cctactgcct aggacctggg gaggtccaag tcaagctcca gagtagacaa ctgacataag 11881 gccaccagtc ccagcccttt gcagggtttg ctgcctcatg agcagggctg agttgtcaac 11941 ctgtttctga agtgaaaggg caatgac // LOCUS HUMDODDA 26764 bp DNA PRI 11-AUG-1995 DEFINITION Homo sapiens deoxycytidylate deaminase gene, complete cds. ACCESSION L39874 NID g886279 KEYWORDS deoxycytidylate deaminase. SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 26764) AUTHORS Weiner,K.X., Ciesla,J., Jaffe,A.B., Ketring,R., Maley,F. and Maley,G.F. TITLE Chromosomal location and structural organization of the human deoxycytidylate deaminase gene JOURNAL J. Biol. Chem. 270 (32), 18727-18729 (1995) MEDLINE 95370171 FEATURES Location/Qualifiers source 1..26764 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI38" /cell_type="fibroblast" /tissue_type="lung" 5'UTR 1..1317 exon 1..1425 /number=1 /product="deoxycytidylate deaminase" CDS join(1318..1425,1828..1963,22267..22383,23741..23837, 25392..25470) /codon_start=1 /product="deoxycytidylate deaminase" /db_xref="PID:g886280" /translation="MSEVSCKKRDDYLEWPEYFMAVAFLSAQRSKDPNSQVGACIVNS ENKIVGIGYNGMPNGCSDDVLPWRRTAENKLDTKYPYVCHAELNAIMNKNSTDVKGCS MYVALFPCNECAKLIIQAGIKEVIFMSDKYHDSDEATAARLLFNMAGVTFRKFIPKCS KIVIDFDSINSRPSQKLQ" intron 1426..1827 /number=1 exon 1828..1963 /number=2 /product="deoxycytidylate deaminase" intron 1964..22266 /number=2 exon 22267..22383 /number=3 /product="deoxycytidylate deaminase" intron 22384..23740 /number=3 exon 23741..23837 /number=4 /product="deoxycytidylate deaminase" intron 23838..25391 /number=4 exon 25392..26764 /number=5 /product="deoxycytidylate deaminase" 3'UTR 25468..26764 BASE COUNT 7079 a 5521 c 6539 g 7625 t ORIGIN 1 gagctccagt tctctctgtt tctacagctg ggcctcagtc ctcacagggc tctagtgggc 61 gttcagcaca ccaggagaag gcgcccaggg gattaacagg acccttccct ctcctgctct 121 tccctcgccc cccatggctc cccctccctc ctatttagta ggtacttctc cagcacccca 181 cagcgatgat ggatgctggc cctaaagcca ggtgggaaac caggctgatt cagcacctgt 241 cttttcctgt taggggggtt gccaggcctc tcagatgtgg ggcagcgtaa tcagtgggtg 301 accgtggttc ccagcaccca ctctccactg catccttgtc ttaggcactg ccgccctgca 361 ctgtagcaga tgcctaaata aagtgtggag cctgaagcag aagttttggg agatgcttgc 421 cttcctttag accacattcc tcgcctttct caagatttct ccgtggtacc caccccgttt 481 ttgtgttgtc ttccttcctt gcccttgttt cattcctttt gaaagcgacc actcccctga 541 agtcctagga cccccttcgt gggctctccc tgccgtagac agtctctgct tgcccctggt 601 aaactacgtt ccaaaaacct tgctttagga gtagcacttg acgtttttca tagtttcttc 661 tcatgtaatt cattactcgg agttgataac ttctgtccca tctggattat tgaggagaaa 721 gggaagggaa aaattatctg gttaatatca gagttgggaa ggcttcagat gttttctttg 781 aatccgtctg gcatggatga taatcctgtg tggtttaggt taagttacct aagttctctg 841 agtctgtgta agatgggatg gcactgtcta ccttagctag gagtaaggct tgagtgacat 901 gatgtgtata aactttaaca atgtctggaa tctatagatg atcaataaat atttcttttt 961 tccaccctcc tcctgtgctg actccgcttt gtaagggaga caggtaaagg acacaacaat 1021 ttgtaggctg gcataaaaaa tcctatgaga gaaagcagta aatagatgaa atatttggga 1081 acgtgaagct ataggtttca aattcaaact tctgaaatag gaaatttctt aagtttttac 1141 ttaagaccag agactgttta gggcccatgt gctgtgattt ctgattgata aatgtagctg 1201 aagttgccca tgtttagaga tttaaaggtt cagtgtggtt gggttgaact ggaaaacaga 1261 cttactaaaa tgcttctctc aacttctgtt ttgtcatgtt tttcttctag acccaacatg 1321 agtgaagttt cctgcaagaa acgggacgac tatttggaat ggccagagta ttttatggct 1381 gtggccttct tatcagcaca gagaagcaaa gatccaaatt cccaggtaaa tgaatttcac 1441 gggagaatgc ttagatactc acaggcaaag agattactgc acatgagcag aaaggggaga 1501 ctcagtggac gcctgtcaac ttaaaagtgc agggtcggtc tgccttttgt gctgcgtttc 1561 tacaagctta gttcttcctg tgagacagaa aattgtcaca tgccattgtt aactgcctct 1621 tgtgcccctt ctgggactgg gggtgcactt tccttggctt gtgggtgctg ccacaaaaaa 1681 ccaggcaagg agaccgacag ctctcagcac ctgcagcagc tatgcgtctg ttttgagttc 1741 tgcagtcccc ctggttttat tccctgcttc cacagcggca gaccagccag ttgcaggcac 1801 caagcctttg ggcatgtttc cttctaggtc ggcgcctgca tcgtgaattc agaaaacaag 1861 attgtcggga ttgggtacaa tgggatgcca aatgggtgca gtgatgacgt gttgccttgg 1921 agaaggacag cagagaataa gctggacacc aaatacccgt acggtagggc aagttttatg 1981 tcccattctg cttagtgaca tagcctggtg agtgcctaat taagaaagtt ggtcagaagg 2041 catcatctcc cttttgtcat caacaactgg aaagaaaagt atcctactga actaccccaa 2101 tttcctggcc ttgtctctta agttccaaag agagccacgg agagaagcat ggtagtggcc 2161 acacaggctg gccctgagat ccagggctct gctgcctgtg ctgagcctgg caggcttcac 2221 tggcaacctg ctggcctgca ccacaccagt gcactgatag tcggcgtctc tgcagagggt 2281 cacagcccct tgcagagata gctgagggat gtgcatggtg tctctgtggg ttcctttttt 2341 tactacttgg aagcagggtc ggctaaccag gggctggctc gggctttggt ctgatgagaa 2401 gcaggcttca ttcagcccat gtgcctgatg agaacatctg gctcaagctc attgctggca 2461 agcaggaatc aaactgtgaa ctagacatca ggagatggag gatccagttg cttcctccag 2521 ttacagggga tgctgcaatt tcacagtagg ctgtggtcag accaaaagcc agaagtgagg 2581 ctgggcacga tggctcacgc ctgtaatccc aacactttgg gaggtggaga cgggcagatc 2641 acctgaggtc aggagttcga gaccagccta gccaacatgg tgaaacccca tctctactaa 2701 aaatacaaaa attagctgtg cgtggtggca ggtgcctgta atctcagcta ctcgggaggc 2761 tgaggaagga gaatcgcttg aactcaggag gtggaggttg tagtgagcca agactgcgcc 2821 actgcactcc agcctgggtg acagagcaag actctgtctc aaaaaacaaa acaaaaacaa 2881 acaaacaaaa aacaaaacaa aaacaaagaa aagaaaaagc tagaagtgta gaactgtaaa 2941 cctttggctt gactgtctct ggattttccc cttgaaatta gggaaggctg cttcttagag 3001 tttgttgcct gtctaagccc cagggaaaat tccttcagag attgcctggg agggtccagc 3061 catagcttta cttttccttg cattccttgg agaatatgca agcaatgctg gaaaagaggc 3121 aaatcatgct gggtatgcag gttgctgatt tttgtaaggt taggccctgt attttatgtc 3181 tgagaatgaa cagactggcg ctgttggaag tgaagggaga gtaaggccat ccaaatggga 3241 aagattgtgt cttcaggcca aggcgtgatg tacgaccttt cttctttgtt gactctcctg 3301 gtaggttttc ttctgggttt ttctcatgaa gaatccaagt caacatttag tctaatttat 3361 atatttttac taatgttcct ggacctcaat catgttaaat aagccccata aatttctttc 3421 taaattgcct aggggcaact tctccatgtt tgaaagaaaa agttaagagg tacctggaga 3481 gagaggagga actgaggact gcaaagcagt ctgggtttcc aggatgtgaa aagtagttta 3541 gggatgaagg ggaccgaatc ctcagatcct tgatttaacc tcaccctccc tcgtcggagt 3601 gggaagagag gatggagttg ggggagggta agactgtctt ctcagcacag cttcagagcc 3661 ggtggtagtg gatgactaat tagaaaaatg cgcggactca gaaactgtca ccctgatttt 3721 gattctataa gcttttgatg gtaagaaaga agggaagtga ttagttgttt ttaagtccat 3781 gcaaagataa tttttcacag tgcggttcaa agtcccctat aaaaatggtg aaagtggagg 3841 gtcggatatg ttaaaggtct gcattgaact gtgtgacggc cacagtcagt atgtcaccga 3901 gtctgggcag tgaaatattg gccctggggc tggactttga tatggaggaa ttctgggggg 3961 atggcggtcc gtcatactca agctggtgca aaagataccc gtggttggac agtgcggagg 4021 ggtcctgggt gagcactgag tgggcctcca atggggcctg tgccttcttc agtcacccag 4081 agcccccagc agatctttaa gcagtgacct ggaagccaag ggccatgcag tacatttgca 4141 gcacctgccc cacaccccag acttcccagc tctcctgact tcccgtgtag ctgattttgt 4201 tggttatgag acactgcctt ttgtccaaat tgaggagagt acctttcaga gcccctttct 4261 agaaggaatt tttttaaaaa cctgcatagg tatttgtcca acttcaccga ttgtttttta 4321 ttgaatagct cctgagctcc tgggcactgg ttgtctgtct ctgcattatt ttttattatc 4381 catatgatac ctctaggttt tcaaagctct ttgctcttac ttatcaatta ttcatttata 4441 tgtagctgtg ttagaatgtc gacagttttc tttcacgaga gtaagcttgt aatttgatgt 4501 accataaaag ttgtagaagg tattaaatgg tttctaaaag gcatttagta ttactgccgc 4561 cataatacag taaagatggg cttaaagtct aaatgacgat gactgtgtat aaactgtaca 4621 cttgttgtct tctcaaggtt tttcaggcca cagagaagta aatgtttgga ttgcctggga 4681 tgtcacagaa atgtttgctg ataattgact caaatgcaaa gtcttttttt ttttttttgc 4741 tttcgtttta agattttcaa aattaattta aaaaataagg ggatactgcc tggagttgga 4801 tagctcaacc cattctttcc tcatctacat taatggtgac tgagctgctg tagcaggaag 4861 ggtaaattca taggttaatc cctgtaaagg agctttagag attatctcga gttctggaca 4921 atgattttta actcttttga atgtacagga ccctctttta aatgctgaaa aacttccagg 4981 acccctgcat aatgctaaat tcttagtatc ttttgagtaa gaaactccac aatgacaaaa 5041 actacttcgt atcagtaaca ttttttgtta atctgaactt tattaattgg atagcattta 5101 tgtatgtttg ggaaaagttt atgtatgtcc tggacatggg atttatccag ttgtggatat 5161 gcttggaccc cttgcaatac cttagtagtg tctcagggtt ctatgcccac aggatctaag 5221 cctcctcctt tcttgtaagt aacttctata atgtcacaca tgcagggtga catctcctaa 5281 ttccaaggtc agtgctcttc ctgctacctc cctgtgtctc tcctgtgggg cacatgaatt 5341 cagctgtact gaaatgccaa tcattgtctc attttttttt gcagttgatc agactgaggc 5401 tccgataaaa taattactcc tcagagatca ggagctcggg gaggggtggc ctgtgtccgg 5461 catcatgcag aacagcatgc gtggctcact gtaacatttc ttcaaagggg agtgcagtgg 5521 actgaatgtg tctcccccca aattcacatg ttgaaaccct aattccaatg tgctgatatt 5581 tggaggtggg gcctggggga cgtaattagg ccatgagggt ggagctctcc tgaatgggat 5641 tagtgccctt ctaaaaggga ccccagggag ctcagttact ctttttccat gtgcaaggat 5701 aggcggagaa gatggctatc tgcaaccctg aaaggagacc ttgccagaat ctaaccatgc 5761 tggtgtcctg acttccagcc tccagaacca tgagaaataa atgtccattg tttctcagcc 5821 acccagtcta tgggactttg ttatagcagc cccaagggac tgtgacaggg agcttgagaa 5881 ctgtgtaaat ttgagtttac caaataacat acagcttaat actgctttat taattccgta 5941 agtcggaagt cattagtcct gattgatgcc ttgtggggag gagggaaaca cacactactt 6001 gtaacataat aggaagatgg agcttttcct gggctgtgtc tctagagagt tatttttaaa 6061 agggtgtttt tgttcttcca gtcttgggat caaaacttca taaaactatt ttattataat 6121 gtaggtatta atgtggaaaa tgaaaattcc ctctgctaaa gaagcttttt attccatcct 6181 tttcctgaaa aatccaaaag ataatatgat gtcagtaaca ttaagtggca ggaagaaggc 6241 aatgttatta tactaagaat cttttcagat gaatgacaat taaaaaagag gaagcagcag 6301 atttatctaa tttacaagat taaaaaactt gggaagattt ttattttcat gagggggtgg 6361 gttgctaaac ccgtgatgaa gtattcagtt gacatttttc attccttttt ctttgtggaa 6421 ttcattatag tgaagcacat ttaatcaaaa taccaatgtt atcaaatgag gtaaaatact 6481 cttggtgatt cagatatcac tctttccttg ggttttcgta accgattttg atgatatagt 6541 ttacaaatta gagagttgga gaattaactt gttcccgttg ttaaacttaa gggaattttt 6601 agtaatgctt gccacatgta tccagctgcc cagcacttgt ggccttgctg atttctgctg 6661 gcaaagggct ctgggttttg gtgagtgcct ctgtgcttgg cctcctatcc cacccaatgg 6721 accagggagc cagggttgct tggcctgctg catctcactg caccagggag ggaaccgtag 6781 catctcagtg tccccaagag tcgggggagg cctcactctg agtgcgttgg gtagaagtgg 6841 ggctcttacc cgagtcagtt cctatgtcta cagggatact ctaaaaagga tgtaatgctg 6901 ggaaggaaaa tgctgaaaaa cctggaatta attaaggcga aagagttcag cagagcatct 6961 taaacacctt tgccagtgtg gggcagtgcc tctttaaatt ctctatttaa atttcatctt 7021 tatttttcag acttagcata ttgttaagca tgcatagatg tcacttttgt gattacttag 7081 ccatttactt tacccccttg aacactgaca ttttcctgac tgccatgcac gtgatatatt 7141 tctttggcat ctgaaaggtt agtgttgaga aactaattta tcaagaagtt aggaggctga 7201 tccttcttaa gatatgactt cctcacaact tgtaagagct tcttagtaat taagatcaaa 7261 tttgttcttt ccgtcctgaa attgaatgtc ctctggtttc gttactcagt ccatgaattt 7321 gataacttgc tgtttctttt cctgcagaat ttaatttcac ttaaagcttc ctgccgctcc 7381 tcccctgcct ggccttctac tagttgtggc ccgacagcag gtctaattag ggacttttgt 7441 acaaaactac tgttataatc aatggaaaac ctttgaacaa ctgcatagaa aagaaattaa 7501 gtagattatt tttacgaatc aaggggcaaa ctcagctgtg atcgtgtggg gtttcagata 7561 aacagaggga cacaccagtg tatatatgag tcttctccca cccactaaaa tcaaccagct 7621 ccatgtaagc atgttttact ttaagatata attgtttata aacagttttc agaaaggaat 7681 aaataatgat attttaatat aatttatttt gatatttatg tatttttata gttacagagt 7741 taattatttt taaagtgcta gagtttacta tttcattatg aatatctatg aaagacatat 7801 agtaacaaaa taactccttg ttttgaagag cacacatgtt aggaaaagct gaaatgtttg 7861 gggattggac ctacccatga aaatacacat atctgtctcc tttctataaa aagtgacttt 7921 tgagtcagtt cacctcgtag aagtttgaaa taaaacttaa tctaaagtag tggttccagg 7981 gttcccagaa gtgggctgaa gtttatcttt gtactgtaac tctcaagata aaaggtttgt 8041 tgaaaaatta gtagccttga taaaatgtca ggggcacatg caaactaatt tagaaaacag 8101 ctatgttaag tatttgagca cattaatgat ctgaatgcaa atggatattc cttctttatt 8161 tcttcttctt cttcttcttc tttttttttt tttttttttt ttgagacaga gtcttgctct 8221 gtcacccagg ctggagtgca gtggcacaat cttggctcac tgcagcctcc acctccaggt 8281 ttcaagcgat tctcctgctt cagcctcgga gtagctgtgt gattacaggc acgtgccacc 8341 acgcctggtt aatttttgta tttttagtag agatggggtt tcaccatgtt ggccaggctg 8401 gtctcaaact cctgacctca ggtgatccac ccgcctcagc ctcccaaagt cctgggatta 8461 caggcttgag tcactgtgcc cggccacttc tttatttctt taactttgct aagtctttgc 8521 ttagaaacac agggattgct gtacagtgtg gttacctgag aatgattact ctcatagtct 8581 tgattctggt gatctgttct ataatcagtt gtgctgattt tgactgatct tctctgaaat 8641 tggctgaaat gactaaagct ttattgtatt ttgagttttt tttaatccgg aatctccctt 8701 caggagtttt gttaatacca gctatattaa agtagaatat gaatatattg tttgctccca 8761 gaaatacaca ataaaaccta gttcttagag atttttgcaa atgtttgtat tgctttttac 8821 ttttacttcc tgaagacttt ttttttttaa atctggatag ggatatgcat gtctgtaata 8881 cacacattag ctaagtaaag ccccatagtg cttaaaacaa caattgccat attcacacga 8941 aagatcctgg agccgccgta aggagccaag ggttccaggt ttggtatgtt catgcccagt 9001 gactgtgggg ctgttagagg catcacctga catttatttc tcaatatctc tatgaagtaa 9061 attatcactg aagattcttc cagttctctt attccaagga aaaatataag tattatagct 9121 ttgatggcat tagattagta tctttttctc agcttacgga gttctgaaag cgttctcggt 9181 caaatcagag ctgtaaaaag ctgttggagg caatgccaca gggagaatgt gcatgtgtac 9241 atatggataa attaatagaa acaggcgcct ttccctccct cctctaacga ttaaagactg 9301 gcgcagttct aagatgaata tagaccatat ggaggtacca tgcatgaata ggaggggaca 9361 gtcagactgt aggttacctc acagacatgc agattaaaag acattggatg agaaaaggtt 9421 ggataacagc aataaagaaa aagatcctag ctgagtatgg agtgcacgcc tgtaatccca 9481 gcacgttggg aagccgaggc aggtggatcg ttcgagttca agagttcaag accagcctgg 9541 gcaacatggc gaaatcccgt ctctacagaa aatacaaaaa ttagccgggt gttgtggcgt 9601 gcagctgtag ccccagctac tccagggagg ctgaggtggg aggattgctt gagctcggga 9661 ggcccaggca gcagtgagcc aagattgcac cactgaactc catcccagac gacagaatga 9721 gagaccctgt ctcaaaaaaa ttaagaaaaa gatcctatgt ctaagtgaaa gaaaatgtat 9781 gattacatgg gacacgctga aaactctttg aaagagacaa gaggttaaga ccaaatagaa 9841 agaaaggact ttagaaattc tttagagaac aagtcagcat gacaatattt attcaaacag 9901 agatacttgc ttgacattgg gaaatccctg tgataaatgc aggatactgg tgaaggcaag 9961 agagttgcaa tgcaagggaa tgggcgctcc ccttgctgat gctgtctgaa gggtcggtgc 10021 caaccaatta gtggaggtgt aaattttctc tgtgcttctt tcaaatgata gatgccatag 10081 taaatttaaa atattgatat gataactgag catgaatttt ttaattctgc aattaaaaga 10141 ctttgttcaa aatgccatta attttgagtt taggtgaata aagggaagca gaattcttct 10201 gtcatttttc cttctccatg tgagttatca ctttttagga attaaaataa gtactaacga 10261 gtcacctgtt ctcttaccag tttcagttgt ctgctggtga tggggatgat ctctggagtc 10321 atctttcttt ctttttaaaa tcctaactct attttataat gcgttggagt ttttcagaca 10381 ccatcacatg tggtcatttg ttcattcagc agacacttct agggcattta ttatgtgttt 10441 tacatgccgg gcactctgca gaggtcaaca gaatgccatg ggagggtctc tgtttttgaa 10501 gagtttgcag actaattgtg gaggaggtag acattaatga aacaaacaga tatgattgaa 10561 tattcctttc tgaggcagtc actttttttt tcttttttaa tgagataggg accagacccc 10621 gaatctgcca gccccttgat ctcggacgtc ccagcctcca gagctgtgag aaatacattt 10681 ctgatgttta gaagctacaa atttatgata tttgttagag cagcccaaac aaactaggac 10741 aggttgtgtg tatgaactgg attgttctgg ggcaagagtg gaaacagcga ctcgccagcc 10801 gggaggtgac gtggtgtacg atgcgagggc cttggagggg gtgggtgctg gggctggaga 10861 ggcctggctg gagatggggt gtattttagg ggcagcatca ggaagatggc gtgataaagg 10921 gtggaagagg tagattgact ccccccaaat ggttctggtt gcagtagcta gacaggcatg 10981 ctgaggagga gaagcaagat tttggaatga tcaaggaaga tttagacacc tgaaggctga 11041 ggtgcccatg agcgctccct gcaaagggtt gggtggtcag gtgggtgtat gagttgggga 11101 ttagtcccga ctggctgtag cagtacaggt atgtcagtgc cagcgacctg ggtggctgat 11161 gaagggagca tgtgcagagg cagtaggagt agctgtggct gagccctgag ggctccagcg 11221 cttaaggtgg ttggagaaag aagggcccgt gagaaggagc acacagagca gtgttttaag 11281 aaagaaggcc tcattttggg aggccgaggc ggggggatca tgaggtcaag agatcgagac 11341 catcctggcc aacatggtga aaccccatct ctactaaaaa tacaaaaatt agctgggcgt 11401 gttggtgcat gcctgtagtc ccagctactt ggaggctgag gcaggagaat tgcttgaaac 11461 tgggaggcgg aggttgcagt gagccgagat cgcgccattg cactccagcc tggcaacaga 11521 gtgagctttg tctaaaaaaa aaaaaaaaaa aaaagaaggc gggcttgtca gctgtatcgg 11581 aggctgccaa gaggcctaag attaaaagtg ttcatggggt tcagcaacgt ggaggtcaga 11641 ggtgatattg ccaagtacag tttcagccaa gtacagtttc agcggagcaa tagggttgga 11701 agccacattg gagcctcttg aaggagaaca gagggtgaga caatggggac agctttattt 11761 agatcgggct ttgggatgtg tggctgagaa gagagcagag aagtggggtg gtggctggaa 11821 tggagataag gccacgggaa aggattgcat tgtttgtagg ataggagaat ctggttttgg 11881 ctttggtttt tgcttttctg tagcagacat ttgtttgttg atgggaatag tccaagacag 11941 agagagtgag atagagattt ctggaacaga ggcaaggtgc tcaaggctgt gaggatcggt 12001 ctctcagaca ccctggttga taataagagg aaggactgag aggggaggtg ggaggacagc 12061 agacacaggg tcctggagcg aaaggcagag tgtgatcagt tgcaccattc cggtggagga 12121 agcagggtca tcggttgaga gagctgggag ttggagcttg gatgccggga gaagagaaga 12181 gctgattgtg tggggtggga gaagaatgtc ctaggaatgt agggcagtgt caggagctca 12241 catcttatct gagaatctcg agttgggcgg ttgcttccca ttttctacct gcagacccag 12301 gatcagtgcg ctgcctgagg caggccaggg agagctggaa ttaattagaa cacacatccc 12361 agctcctctt ccagtgccca gtgctcaggc ccggttcccc tctaaacgct attaacacac 12421 aaaggaggct tttaagctcc atcacagtac tatcagcgct gattagcgct ctgtttctct 12481 tggtgctgtt gtcacgcatt ttaggcatca cgtgtgcatt tcacatgaac agtgggaagg 12541 gagttggcat ggcctaccgt ggatgcagta cagctatttc cacatcctcc tcctcctgac 12601 ctgattgtgt gcccttagcc cataggaggc agtgtaatta gagttctggg ggctttatta 12661 ccaataatgg ccttaccaca gtcagaaatc tctttagccc tgagcactac cattggggct 12721 aagtggctcc cctgggtgcc gtgagttgtc atccgcagct ttacgaagag gcattgagga 12781 agggggcagg gatggggagc agagtgggag agagggaaaa agaaatgggt attctcacag 12841 gcaggctgga ttgggcaggc caggacagcc tggtctcagg tgggcacacg tgggagagga 12901 tgggaagcac tcacatgtag caagtagctt gtctgtgctg tgctcttgac agatgttagt 12961 ctttctaacc gttcttggag agggactttt ctctctattt cttaggtccg gaggctgagg 13021 ctgaagaaga tgcttgactg gtaggttgtg gacagtgtgt gaggacatgg tagggaggaa 13081 gcagagaaga caggcagaac caggggtctc caagacccct gagtgcattg ttgcaggagc 13141 ggagacggaa gttcttgcct cagctttcag ggccctgggc tgtgggcagg cggatgggca 13201 ccctgggtgt ggaggcttcc ctcagagcct gcaggatggg ctgcctttta cgtgagtgac 13261 ccaccacttt ctccccagac cttggtagaa ttcctttgcg ctgtcatttt cttctagcag 13321 agcctcctct ctgggcacag ggtggaaaca gaacagagcc cacacatttt ccacgcatgc 13381 ggcgggtttc tccacatcac ccccgtttct cgtctttatt ttcacacaca cacttacact 13441 gtcatgccta tcaggtggaa tcatgtgtcc ctccatggcg ggagggctca gttactttat 13501 cttctgagta gaaaggccac actctgcctg gggctgggct gggggcagga gtgtctggat 13561 gccaaggaca tggtaggctg tcgggtgcag cgtgatcggg gaggcagaag cgctgtcact 13621 tgttgtctct gatcaggaag acctggttcc tgccagccag tgtctttctt aggaaacagg 13681 tcctgttctg agtactggtg acaaagtccc catttgcaga gttctttgga gttcacacag 13741 cattttccta ctgtgtccct cacttgctgt tcgtgtgacc ctgtgagatg ggaaggggcg 13801 ggtgtcgtta ttcctattcg tagctgagag aggctggaac ttgaacaggg tattaccttt 13861 gtgtaggcac atcctaggat ttaaatccct ttgtcctgaa aaaacatttg taatccacat 13921 acatttttct acagaaacaa tgttacttga agattcagtt ccccttgcac ctgcgagcag 13981 tgcctagaac cctcgttttc tggattttct cagcactcct tcagcagtcc tactgtgttg 14041 ctttagcttg cactcgggcc gctgctgatt tccatgctgg ggcatcgcag cccagctggt 14101 gtgagctctg cagggaccag acccacctcc atcccgcagc ccacacaccc agccctgtta 14161 gctctccacc cagtgcaggc cagtttgcaa ccagaactta ccagaacata cctgggaaac 14221 tgcgaaagtt ttaaactgtt ttattttctt gctccagtcc tgttactaag ggctgctccc 14281 ctgggtgggg gacagggctc tgtctgagca ggcttagagg gacataggct ttggagcagt 14341 ccagcaggct ggcaggagga agacagggcc ttagccagga gaggggcggg gagcaagagg 14401 ggcttgtctg gtggaactat cttagagaag tgatacctgt tgatttctat atggtgctcc 14461 cttccccagt tccaccttta tttcaaatca tttgaaaacc ttgataatga gccagattgc 14521 tgctttacta ataggaactt catgcaggag agataagata gagaggtgac attatttggg 14581 accaaattct agaaatcttt gctttagatt ttggggaagc cataattggg aacacttcat 14641 aagcttttgg gagagtgtac ttggtggggg ttgtgggggg cttaactgtc agcggctatt 14701 ccgttgttaa attattgacg tgcaggatag ggactgcgga ggctggcagt gacatatgag 14761 gccctgtcag gatatctttc aataattaag aaacaagtag ttgaaattcc attttagttc 14821 cagaagctgt gaagcagttc cataaatgga tcagttagag gagtaagtgt gtaccccatc 14881 cctcagcgga agtttccatc aggcatctgt gggagtgggg gccggggcag gggtgcaggt 14941 gctcccagag acaacatgtc caggtttagc aactctgata actataggac cccagaagta 15001 aaagcactca ggccctaact tcaggaagaa atgacaaaaa gctttgtcat cagaatggaa 15061 gtagaaaatc ctgtaaatta ctgagttttt tttttttttt ttaatgcaca ggtcaacctt 15121 gaacttgggt atgccatttc atgtcacttg ttaattgcag ggtgggaacc cagtagaaat 15181 aatagagggt gatgcttccc accttttcat tccagaaggc catgagcaga cgcttcttag 15241 ctcgagtcct gacccttgtc ttgtcagtga acagaactgt ccgtgctctg aacggctcag 15301 cctggggtgg cggcagaagg cgttgcaggt ctgttctcag tggcagcagg cactccatgc 15361 ctggccttct gaccccggtc ctagcgggac gcaccacacc aggctgccac ctgttcaggt 15421 cagctttggc agaacaaccc tcaggtctaa ggacatcgtt ttctttaagg aaaatcctgt 15481 gtcactaaaa atgagggatt tttaagcaac ctggagagtt cagaactaga actagagttc 15541 agaactagat cagaaccttc attttcctga ctccaaattt tctcaggccc cagtagttcc 15601 tgtcacatct ccatagcaga tgaagatcag ttgagggtag aataacacta tcactgctaa 15661 attatttagg cagcaattag aatttgttca gcattattta gggtttttaa acaaacctat 15721 ctgagtaacc ttataatact accgtctaaa atagatattg tatcagtctt atatcagatg 15781 aggtgaaagt cacatctata aaacgctgaa ctcttaaatc taaccaggga tggaatatat 15841 cctgttaatt tgtcaaactt ctagagaacc gcctacatga gacagtaagg attaatcaag 15901 ccctactaag agagaatttg aaaaatacag ttttctaatt atacagtctc ttctccatgg 15961 tccaaggtta atgcagttct gaattctgac agttagtaat agaaaagaaa aaaaaaaaaa 16021 ccaaagcata tgaattgtct tataattacc tgaaacagaa agtctcttct ggcaatcaag 16081 gccttctaag cattaatctg atattttcag catttgattc ttttcctttt aaacaaggtt 16141 gggaataact ggagggttgt gtgtcactct ttgtgcttat caaatatatg tactgtttgc 16201 agatttattt tgcttattgc ttagctgtag atgtgtaatt ctaggttgaa cattttatct 16261 ttttttcttt ttttttttac aatttagact aacttatgaa tacttcctag acactgtatc 16321 tttgaaatac ttgatcctta ttcctactac gaatataaaa actaacaatt acttcttggt 16381 tagttttggt tagttgtagc tcttgctgca gttcatccag gcaacacaac aaaaatgtgc 16441 actctaggga aaggtaaatt tattccttca tttattgctg tcaaatatct gtggaatgtt 16501 tattgtacag gttgagcatg ccaaatccaa aaatcaaaat ctgaaacttt tttttttttc 16561 tgagatgggg tcttgctctg ttgcccagac tggagtacag tggtgcaaac acagctcact 16621 gcagccttga cctcctgggc tcaagcagtc ctcccacctc agcctcccaa gtagctagga 16681 ctacaggtgc acgccaccat acctggctaa gtttttttat tttcagtaga gatgagctgt 16741 cactatgttg cctaagctgt tcttgaactc ttgagctcaa acgatctccc ccacttggcc 16801 tcccaaagtg ctgggattac aggcatgaac tactgcacct ggcccaaaat ctgaaacctt 16861 ttgcacacca gcatgacact tgaaggaaat gctcatggag cactctggat ttcaaatgtt 16921 tggatttggg atgctcacct ggtaagtata atgcaaacat tttaaaatct gaaatctgaa 16981 acaattctgc tcctaagcat tttggataag agatatttaa cccatagagg aaaggtacat 17041 ggaagcccca gaccacccaa aagaaaaggc ctggggagaa gatgggcagt tatggctaga 17101 caccagattt caggttttac agatggatct ggcatctgag atctatgaac agatggattc 17161 ttaaggtcag tgactggccg ggcgcggtgg ctcatgcctg taatcccagc acttttggag 17221 gccgaggcag gtggatcacc tgaggtcagg agttccagac cagcctggcc aacatagcga 17281 aaccctgttt ctactaaaaa tacaagaatt agctgggcat ggtgacgggt gcctgtaatc 17341 ccagctactc gggaggctga ggcaggagaa tcatcgcttg aaactggtag gcagaggttg 17401 cagtgagccg agatcacgtc attgcactcc agcctggaca acaagagcaa aactccatct 17461 aaaaaaaaaa gaaaaagaaa aaaaagggca gtgactatgg gagaaaaatc aaagaccaat 17521 attttacctt gggaagtgcc cacaaaggca ggaaaaataa aaggacctac agaaaagtaa 17581 agcagaaaga gaacacaagt gtatggtggt cctgagaggt tttagaatag agaggagtcc 17641 aaagagagca ggtgaatcaa aaggttccag gccagaagag accttgggtt ttcatgatga 17701 cgagaccctc ctgtggcctg tggagctggc tctcatcagg tcatcagagc caactgtgcg 17761 cattgcttcc caagtacacg tcatgttggg agtgggagat tggctgtgtg aaaaaatgtc 17821 tacaccacgg aaattggcag atatatggtg acccagcaaa cacagttgac acctgaaaca 17881 taatttttag gaatagtatt taaccttgta tgcgggatcc attctgacct ctatccttcc 17941 cttcttacct atttcattaa aacagaagaa aaagctggtg taactcattg aactgatttc 18001 atgacctgca gtttgaaaaa cattatttta gagaaaactg gggattatgt atgccttaca 18061 aagtcagaag acaattattt taaaaataaa ttatagtaac aattgtagat aggatgattt 18121 catcttgaaa ggaggaaaaa aagaatccta aaaggctgac aagatgacct ctctagacta 18181 aggactaagt atgcatgctt acaggagaag aaaaacgagc agagagaagc gggagccctc 18241 agagatggag ggaggaaggg gctgggagca cggccttgtg tttgcctggg gagaggaggc 18301 cgtgttttga tagtgaaagc ttgttttaga gaaagtcacc tgttctggag gattgaggaa 18361 tgaaactgga aaccttagga aaaaagcaaa gacatcggat gtacttggta ttttttaaat 18421 tgcttttatt tagttagctg gaaatgcact ttctcagatc cactgtcact cctgcaaatt 18481 acaagagcgc tcagccaccc tgtcgaaggt catccggctt ccaagcagca aattggacct 18541 tgagccctcg tgctgccctt gttgcaaagc tctttgtcca cactcgctgg cagaggtgga 18601 gagtacagag gggaggatgc ttggcctgtg gtggtctggc cagccagggc aggcatgagc 18661 tcgtcacagg ccaggcagtc atcgagtgct ctttgtggcc tgaagtggat ttggcaggtg 18721 cctgttattt gatcagtgtc gatggccagt cgacacaagt aaatgaatcc ccaaacttta 18781 ctctgaccca cctgctttaa acaaattgga ggaagtcatc aagttagttt ccctagtttc 18841 ctctctgagg acagtctggg ggctgaggct ggctgagtcc tgtctcctca ggacaggccc 18901 aaaaggaggt acagggcggc ctgggggaca gcaaggttca gagtttcagg gctatggaga 18961 tcctggagct gatagacgag tgtcagccac aaaggtgaag ctggatttca ggaccacaac 19021 aggcggcacg tgggagttac gaagggggca gggctgggtt gcggcctcag gtgctcctct 19081 gggttggggc ttctctcctc agccccgcag gtgcgtcggg caagcctgtg atcataagac 19141 agggctgagg gggtctgggg aagggtacgg gaaccagcaa gagagaccag ggtggggctg 19201 atcttacacc ctccaccgaa cacttgagtt tcctgcttgt cagtgtcact tttgcgtatg 19261 tgggattttg gattgttcct tttagagtct catgtgtacc cgacacacta ctgttcatat 19321 tgcccagtag ctttatatat atgtaaatat atatatatat atatgggtgc taaatattta 19381 tcaaatgaat aaaggaggaa atgcagattc cagttattca tggtgatgtt gggaagtact 19441 gaaagtcctc ttaaattcaa atatcagaaa tatgctctcc catctaataa atactttttg 19501 atggtgctgg tattaatatt cagatatgat atatatttaa tatttatttt taatattaat 19561 attcagatat atatattttt taacaagaag ctttacattc ttcatcttgt gattttttgg 19621 ttaaaagaaa ggaagtctgt ttactaaatc gctgttgcat gtagtttcta taaatcgccc 19681 tctgtacaca gggatgtcag atgctgccca agagaatctg tttcatatgg taattttcac 19741 taatgaatcc agacctatac tgttaaattt tgccccaaac tgcctgggat gaagtagagg 19801 aagctcatgg cagtgagcgt gacagttctt agaggacgct atcatagtgt ttcataacag 19861 atgcagaggc tctgcagttc tgaatattaa taccagcacc atccaaaagt atttattaga 19921 tgggagagca tatttctgat atttgaattt aagaggactt ccagtacttc cgaacgtcac 19981 catcaataat tggaatctgc atttcctcct ttattcattc gataaatatt taacacccat 20041 tatgttcatg gcatcacatt agactctggg ggagttacaa agaacctagg ctgctctgcc 20101 ttcaaggggg aaataaaaga aacaattaca tgtaattgtt tttaacacca agcagcgtga 20161 aacccagctc tggagcagcc tggtcacaag ggggagaaga ggtcacggga gcccctccaa 20221 ggccagcgca catgtggcca taccttgtgc cttcccaagc ccgggccctc gcacccttct 20281 cttgtctttc cttgtcctcc catgccgtca gggtttatga gacctgcaag tctcggggca 20341 tggctctggt cactctgggc agcgaagatg attctctgat gggctgtggt cagtgaagca 20401 cagcttgttt gcctcttttg attttcctgc aatgccagga gagattgctc agttttaggc 20461 aggactgaga tctcctaaaa catggaattt gtagacaaaa atgaatactt cctaacattt 20521 cctgagagag cttttggtca gccttatgcc tgcaacattg cattgctgct gggggaccag 20581 gcacacagag gtagctgcag ctgcacccga acacgagagg gtggctgagg agagtggcca 20641 ggccttcatc tctctgaccc agagtacgtg ctcagtaaat gatcatcgat gaagcaagtg 20701 agtcgggact gaaatgtatg agcgccataa gtttaggcca ctgctcattt aaggcctcag 20761 aactcctgga agtgttacgt gtagaaggtt ggcaggctgg agagggtagg ggagaccttg 20821 aagattccca tatttatatt ctaggtggct tgtgtgatgt ggccatggcc ggatgtcatt 20881 cattaactcg tttgacaaat atttatttta gcacttgtgt gcatagcatg tgctgggaat 20941 accaggggag ccagaaggca gtgcttccgg aacttgtctg cacgttggaa tcacccgggg 21001 aattgcttac aaattcaagc ccaagccaca gcccaggcca tcacactctg caggcacagg 21061 gcacagatct ctgtattttg gaagcgccta agggtgttct gatatcagag aggtttgcaa 21121 gtcacggctg ttaggattta taatagagtt ttagataaca cacccaggaa aattcagtaa 21181 cgatacaaga cggccatcca gaaacctcct gggcaggatg agggagtcaa acccaggtgt 21241 gtgctgtggg caccactcat tccccctcaa ggaggatgtc gcctgtggat tagggagcgg 21301 gagcatctag gcaaactgga cctctggggt gtgtgttgct ttgccaggtt cgagagaggg 21361 aggaagggga tccccttgac tctgaactca ggcaggtccg gatgtgaccc gggatcatgg 21421 gtggaatgca ctggtctatc aagagctcaa ctagtcattg ccatttaagt ggatgaatgt 21481 ttttcttata aaagacagac tgggctgggc atggtggccc acacctgtaa tcccacagct 21541 ttgggaggcc aagatgggaa gatcacttac ggccaggatt tcaagaccag cctgggcaac 21601 atagtgagac tccatctcta caaaagatac aaaaaattag ctgggcttgg tggcacgtct 21661 gtactcccag ctacttggaa ggctgagttg ggaggatcac ttgagcccag gagctcgagg 21721 ctgcagtgag atatgatcat gccgctgcac tctgggcaac agaacgagac ccagtctggc 21781 tgctggcgtt tttccttgtg cgtatgtggg ctgagcacga tgtggtaaag cagatccctc 21841 aggatggaca cgctcagctt cgtcacttgt tgattctctt gctcttaaaa tcccctagcc 21901 ctcttaacag ccacggtgaa cttccccatg tgtgcgtgtc agcacaccaa tgagccagct 21961 caggtggcac tttcatgccc cagggtcttg tggagccgtg aagcggacgt gcgcttaggg 22021 tggggaaggc acttgacctc ggccgtcctg gccgctctag gtgccttcag gcctcactgc 22081 tcatcagcaa ccagccagac cacgtgggcc tcatttggta ttcttaggac ttgccatttc 22141 taatcaatgt ttttacttga gtctgttata tgttatttca gaggggggaa tggaatgtgt 22201 taacacttag taaattcttc atgagctgca accaaagttt ttctttcttt ccctttcaac 22261 ccacagtgtg ccatgcggag ctgaatgcca tcatgaacaa aaattcgacc gatgtgaaag 22321 gctgtagtat gtatgttgcc ttgttccctt gtaatgaatg cgctaagctc atcatccagg 22381 caggtaggaa ccgggtcttt ccatttgtca catttgcatt gctgcctgct cgtcagtagc 22441 tcattgctga tgtgtttgat cttaaattgc tattgtttag atctaaatta attgcactat 22501 cagctttgca agatatttcc cgtgttttgt ttttgaatgg tgccaggtat tctaagacta 22561 attgcatcct ctgtgattct tagcatgttt ttaatttagt ttgcaacttc ttaaactgca 22621 agctgccaaa aaacaacaca ggcattgcat ataattactg tcaatttact gttctctgac 22681 atatgtgcat agatttgggg gagggaagaa atgaacttct gtcagaaaca aggtcagctg 22741 ggagtctgta tatgaaattt taatttgtat ctaatacatg gtagatattt gaggatctca 22801 acataccttg gcatataaat gtcataagac ccaaatggca ctaccttcag gttttcttat 22861 tttacggttc ccataaatct acaccattgt attctgaata tttcgccttt ttcaaactgt 22921 cacatctaag attcagctac actttgtctc cataaaactc tcggctggag atttaaagac 22981 ttaccatgtt tagccaaatg tgtggtttga aggggccgtt tatcaccagt cctgtggatc 23041 tcagagcaga agcggcctgc tggtactgag gagtcgcgtg ggctccctcg tgtcgccgtc 23101 accgttcctg ggaagctctc ctcgcggtgc ccaggtgtgg aggcggtggc agcctgtggg 23161 ccgagcagtc cgggccccac actgcagcct cctctgcctt cagaggctga tggagcagcg 23221 gccttcagtg cggggccctt gggattcctg gctgccgagt tagcgctgcc tgctgagatc 23281 aggacaagaa aacagatctg ccctctgtag agctggtttg aagtggtctg caaagccatg 23341 ctgggaagca cagtggtgcc cacctctgaa tttgccagca gcccccactc cctccattag 23401 tcctcccact agtaagagaa gctaccttgt gatactgcct tgattacctg tagtaaggga 23461 aattggttaa tcggatcaag gcttgactac ttgacatgtt ttccctttgc atttgtttaa 23521 aaaaaaaatc cataaagact ggagggaaga ttagatagaa tccaccctct gctgtattag 23581 aaagcagcag atatgtattt cagtgactgg acactgtctg caagcacact cagaaaaggt 23641 cattaatcat agacgctgaa agtgctccag ctcgttccgc ctgtcttttg cagcatctgt 23701 aggtgcgtac actaagggag ctcccattgt taccttacag gtataaaaga agtgattttc 23761 atgtctgata aataccatga tagtgacgag gcaactgctg cgaggctcct gtttaatatg 23821 gccggggtga cattccggta agtggggaga atgcgggcgg ggggacgggg tagggtgaag 23881 gctgatgaag agatgtattt gaccttgtct cctccttgtg atgcctgact tccacacttg 23941 gaggtgtctt tagaaatgca agggcataag aatttaaaga ctattatagg ggtagatatt 24001 taagagcagt gacaggaaaa tctgaagacc atttgtttaa tgattgtact ttaatttcag 24061 taactttccg gggtaattag actaattatt caagtactga gaagtttaaa cattcacaag 24121 ttttataata ggaacgccaa accttggtga attttgcaaa tgtaacaact ttggcataga 24181 aaatttagga tgaaagtagc ctgccagatt gaaacattgg ggattctctc tttttttttt 24241 tgagacggag tctcgctctg tcaccaggct agagtgcagt ggtgcgatcc tggctcactg 24301 caacctccgc ttctcaggtt caagcggttc tcctgcctca gcctcccctg ccacacccag 24361 ctaatttttt tgtactttca gtagagacag ggtttcacca tgttggccag gatgacctca 24421 tgagctgcct gcctcagcct cccaaagtgc tgggattaca ggtgtgagcc acctggggat 24481 tctttttaca gcaacatttt gcattgagat ttgaagtctt acagattttt atgattatct 24541 ttgctaagaa atggttgtga gctatatttt acttgcaagt aagttcaact acctttgatt 24601 catggcagaa aattcctttg ggactgtaaa gttttccagc gcttcatggt gttggcgttc 24661 agcagcagtg ggaccatggt cataggtgac gtgccctggc cccttgtgct ggtcattagg 24721 gatacagctc agtgtatctc atgaactcaa ccgtggtgcc ttctgagact ttaattcaaa 24781 ttatttgtat actctcttgt tcagtgattc ttaaaagaat ttgttttgtt cccaaatgtt 24841 ttctttttaa ttgttgtaga agaaatgagg aacgagtgaa aagttaatga gcaaaaaaaa 24901 aaaaaaaaaa agttggttta tgtttgcatt tttagaacgt aatttgaaaa attactcctc 24961 actaaaggta gtcgtttttt gctatatata agcggcaaaa gaaaatttac tatttttttg 25021 cattagagag actccaagtt ggctgcagaa atttttctag aaaataagat aaatctgatg 25081 ggaaaactag taatcggata tcatgttgtg cagctgactt gttagaaatc ctttgactcc 25141 cccaataatt cgtgaactca gagagctgaa aacatacaaa ccacaccacc ccatgcatct 25201 ttttttgtct tagtcttttc tgctctgtgt ttctcttggt atggtgcact ataattggct 25261 cctatagaga gaaaatagca agtgaatccc aggacatatt tttctctgca cagaaatatg 25321 agttaagcca aagtaaacaa attgctttca ctactcacca gataataatt gtattttgtt 25381 tttattttta ggaaattcat accgaagtgc agcaagattg tcattgactt tgattcaatt 25441 aacagcagac cgagtcaaaa gcttcagtga gttacatctc attcaatctc cagaagattg 25501 ggattatcgt cttctaagag gttgctaatg cctttcatct tgaagttaca cataacttct 25561 tactagccag tatggcaaaa gtaggcatct aaagaatata aagcctcaaa tcttccttac 25621 tgtctctctt gtcacatgga atctacatgt gtttgaacta ttgctttagg atttaaaata 25681 ggggagcctg tggtggcctg gtgcacaggg ctagaacgag agtgcctccc cttcttgtgt 25741 cctggctggc tgggatgctg gtggctcttc agaggagcat cagctgtctg tcatctgctg 25801 cgatccggca gcctctcttc actgctacat gtgctggaag gacaaataaa taattgtggt 25861 tgtgttctta atggggacga gcagacacac tgatctgaac atctggccca agtgaagcat 25921 ggcatatagt gcccttggaa gaaaattagg cctcaaatga cagtagcatt gaagtgtttg 25981 ctgcagagtt gagggaaacc cccagccacc ctcccggaat ccgagatagg gtggcacatc 26041 tgtcctgaca gacgaggagt gtaactgaac caggaatatt tcctccattc ctgctctccc 26101 actgcacaca gggtggtggc acattatccc tctggggggt ggggacgcct gttgttttgg 26161 ctcaatttgg gtttgttggt cacatggagc tcttccattt cgtttagctg aataatgagt 26221 tgttcctaga ggagacagcc tgtctctcct tgttgccccc aaagcccatg ccctgccgtg 26281 gtggcagctg gggctgtgga tgggaggggt ccccaacatg gatgtgttgc ccctcctccg 26341 catgccaacg cagttcatgt acaaggcccc tctgcaactg gagagaaaat taattcctat 26401 cccgtgagtg gattgtgaga aattccaccc acgtggagac agcttactgc agcactgttg 26461 gtgttcggag ctcttctgtg ccctggctcc atgctttcac ctacacaagc atcaccttcc 26521 taatcaccgc ggggcgggga gcgtgtggct ctgccccttc tctttaatct catttaattt 26581 ttattaaaca tgctcagtac ctgtgttgag aaaaggcttt ctttatccta aagattatta 26641 cctttttaaa gtgctcttat attttcatga gtttttattt tgtctctgag attttgtatt 26701 ccacattcta gggtattctg taatttggct ccttaccaat attattaaaa tcttattaaa 26761 atct // LOCUS HUMDS 22573 bp DNA PRI 07-JUN-1996 DEFINITION Human gene for dihydrolipoamide succinyltransferase, complete cds (exon 1-15). ACCESSION D26535 NID g537349 KEYWORDS dihydrolipoamide succinyltransferase. SOURCE Homo sapiens (library: lambda EMBL3) peripheral blood cells DNA, clones KGE2-[5,9,13,36]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 22573) AUTHORS Nakano,K., Takase,C., Sakamoto,T., Nakagawa,S., Inazawa,J., Ohta,S. and Matuda,S. TITLE Isolation, characterization and structural organization of the gene and pseudogene for the dihydrolipoamide succinyltransferase component of the human 2-oxoglutarate dehydrogenase complex JOURNAL Eur. J. Biochem. 224 (1), 179-189 (1994) MEDLINE 94357218 REFERENCE 2 (bases 1 to 22573) AUTHORS Matuda,S. TITLE Direct Submission JOURNAL Submitted (15-JAN-1994) to the DDBJ/EMBL/GenBank databases. Sadayuki Matuda, Kanoya National Institute of Fitness and Sports, Department of Biology and Health Science; 1 Shiromizu-cho, Kanoya, Kagoshima 891-23, Japan (Tel:0994-46-4111, Fax:0994-46-2831) COMMENT Submitted (15-Jan-1994) to DDBJ by: Sadayuki Matuda Department of Biology Kanoya National Institute of Fitness and Sports 1 Shiromizu-cho, Kanoya Kagoshima 891-23 Japan Phone: 0994-46-4111 Fax: 0994-46-2831. FEATURES Location/Qualifiers source 1..22573 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda EMBL3" /tissue_type="peripheral blood cells" misc_feature 222..227 /note="GRE: glucocorticoid-responsive element" misc_feature 321..328 /note="AP-2: cAMP-receptor-binding site" CAAT_signal 374..378 GC_signal complement(723..728) /note="Sp1-binding site" exon 748..853 /number=1 CDS join(791..853,1422..1455,4410..4458,7915..7967,8095..8169, 8718..8773,9885..9996,11658..11810,12173..12249, 13136..13233,17195..17325,18751..18824,19117..19200, 19892..20059,21022..21156) /codon_start=1 /product="dihydrolipoamide succinyltransferase" /db_xref="PID:d1006080" /db_xref="PID:g643589" /translation="MLSRSRCVSRAFSAPLSAFQKGNCPLGRRSLPGVSLCQGPGYPN SRKVVINNSVFSVRFFRTTAVCKDDLVTVKTPAFAESVTEGDVRWEKAVGDTVAEDEV VCEIETDKTSVQVPSPANGVIEALLVPDGGKVEGGTPLFTLRKTGAAPAKAKPAEAPA AAAPKAEPTAAAVPPPAAPIPTQMPPVPSPSQPPSGKPVSAVKPTVAPPLADAGAGKG LRSEHREKMNRMRQRIAQRLKEAQNTCAMLTTFNEIDMSNIQEMRARHKEAFLKKHNL KLGFMSAFVKASAFALQEQPVVNAVIDDTTKEVVYTDYIDISVAVATPRGLVVPVIRN VEAMNFADIERTITELGEKARKNELAIEDMDGGTFTISNGGVFGSLFGTPIINPPQSA ILGMHGIFDRPVAIGGKVEVRPMMYVALTYDHRLIDGREAVTFLRKIKAAVEDPRVLL LDL" repeat_unit 870..877 /note="DR-1" /rpt_type=direct repeat_unit 928..936 /note="DR-2" /rpt_type=direct repeat_unit 975..983 /note="DR-3" /rpt_type=direct repeat_unit 992..1000 /note="DR-1" /rpt_type=direct repeat_unit 1093..1101 /note="DR-3" /rpt_type=direct repeat_unit 1151..1159 /note="DR-2" /rpt_type=direct exon 1422..1455 /number=2 exon 4410..4458 /number=3 exon 7915..7967 /number=4 exon 8095..8169 /number=5 exon 8718..8773 /number=6 exon 9885..9996 /number=7 exon 11658..11810 /number=8 exon 12173..12249 /number=9 exon 13136..13233 /number=10 exon 17195..17325 /number=11 exon 18751..18824 /number=12 exon 19117..19200 /number=13 exon 19892..20059 /number=14 exon 21022..22573 /number=15 polyA_signal 22550..22556 /note="aattaaa: possible polyadenylation signal" BASE COUNT 5710 a 4774 c 5431 g 6658 t ORIGIN Chromosome 14q24.2-q24.3. 1 accttgtttc ttaaaaaaac agctgggagt ttatattcta gggatagggt gacaaacgaa 61 caacttccaa aatgcacaag atgtatgggg aacaattagt ttggctggaa tttgcgttgg 121 ggggagatca gaagctgttg agaaataagg ctatagttgg tgaagtatac cgactcgtaa 181 aaaggcaatg ggatttagaa attcctggga ttctgtttct ttgactcggg gttaggaaga 241 cagggaagta acaagtgcct tttttttttt ttttaatagt taagggacct gagactgggg 301 gtggggggca atgtcatctg ccccaggcca cacaagtgta agatcaggca tgcatgtctg 361 ccaaggtttc agttcaatgt ccgtgcccct cttttgctct cttcccaagg tacagaggcg 421 gaacggcgcg gtgccctcgg gtcccaagag agaagggcgc caagctcaga gccagttctg 481 tggaagggta ggggaaggcc tgaccccggg ggcaatgtgg tggcgggaca gtcgtataca 541 ccttcaggag aaccacctca ggctgacagg cagtgcggat ggccccaaga gcgccttagg 601 gttgcctgct ccagagagcg ggactcctgc gctgcagccg ggatactgga cgccgccgtg 661 cgcccacccc ctgccagtgg ttcgctccga acacccctcg tcggcatggc gggtactgcg 721 gcccgcccca cttccggttg ttgtccggcc ctatatccgg tgtccgcccg ccctcggctc 781 ctccgccgtg atgctgtccc gatcccgctg tgtgtctcgg gcgttcagcg ctccgctctc 841 cgccttccag aaggtacggt ctggccgagc cggggccccg acgggtgagg agtctgttgg 901 cgggcagaga ggcggcctgg aagcaggggc gcggcgcggc cgggccgggc accaagggca 961 ctgggacgcg gaggccgcgc gggctgggcg gcccggggcc gtggttgccc tcgggaccgt 1021 tttgcggagc ccagcggccc cctgccttct ccaggcagcc tgcggggaac gtttcctgct 1081 gtaccccacc ctccgcgcgg gagctgggga gccgcgcccg tccagatact tgactcagag 1141 ttagggagga ggcgcggcgg agtcctccgg cgggattcgg cgccgtcatc gtcttgctct 1201 gttaaaatca gtgggtatta atgagtccgt tgggaagttg agatagtttg gcaggcccag 1261 cgtgttgatt gggtctgcca gcacggtgtg ttgcatttac ttcggagtta cgagattcac 1321 ctgtcaccac cactgggaca gctttgagcg tgtacttaaa agaaaaagct gggttttttt 1381 ttcatcttgg gtgacgtcga taatgtcttc tttctcaaca ggggaactgc cctctaggga 1441 gacgttccct gcctggtaag ttctgccctt accgtcacct aaaatgctct cctcctgggc 1501 agagcagtat gtgaggaact cgggggtgcc tgtgtttctc ataatattta aagacagttt 1561 ggttcagaga gtagtgagac gcaatgctac ataatcacga gctatcgctt attcatttag 1621 ctcgtagttg agtttctact gtgttaggca ctgtgctaga cgctagggat acaggagtgg 1681 atacagaagt gtacagttct tgccctcgtg gagctcctag gctagttgga tagtgtcagt 1741 tgctaaacaa gccaacaaag gcaaattttt gtaactgctg agaaggaaac gagggtggac 1801 agaagagccc tttttgagga ggtgacttaa aagttatggc catggccggg cgcggtggct 1861 gacgctgtaa tcccagcact ctgggaggct gaggcgggcg gatcacgagg tcaggagatc 1921 gagaccatcc tggttaacac ggtgaaaccc catctctact aaaaaataca aaaaattagc 1981 cgggcgtggt ggcaggcgcc tgtagtccca gctcctcggg aggctgaggc acgagaatgg 2041 cgtgaagcgg gagcggagtt gcagtgagcc gagttcgcgc cactgcactc cagcctgggc 2101 gacagagcga gactccatct caaaaaaaaa aaaaaaaaaa agttatggct agcagaatgt 2161 gacggagcta acccagtgaa gcccaggaga aagagaatgc attcacaaag accctgagga 2221 agaaaagagg ttggcagaac ttaagagctg gaaaaaaggt gatttggccg gagtattttg 2281 gttaagggag agagtggcaa gagatagaag atggtataga gactgaagga gtttagattt 2341 tatcctagca gttaaaaaga aaaaagtgca tccatagctt tttctgttat atgcctagcc 2401 taagtatttg ttgaatgagt tgaccttgtt gtgggcattt ctttgtgcca agcatttata 2461 tgagtgcttt ctgtgaattt tttagttaat catctcaaaa cagtaaaaag tattttctct 2521 attttacaaa tgcataaccc aagacaagag agcttaaaag ccttggccat ggtcacccac 2581 ccagctgttg ataagccagg tctacctgac ccatcctgaa ctcttaaccc accatgttct 2641 catttactca tggactcaac agatctattt attgagtgct tactatatgc cagccactgt 2701 tctaggtaac aggggtatag atagactcta gcctcgagat tttatgggct agctggagag 2761 agagagcaaa tatacattta tgctccgaca agggcataaa ttttggttac tacgagagag 2821 gaattaaagg gtgtgatgag agggtgggat gatgtagacc agcttgggaa ggttgggaat 2881 gtcagtaaag gcttccctga agaagtgaca tttaactgag acctgaatat aaagagccaa 2941 gaaatgtatt ccaggcaaag ggaattgctt gtgcaaaggt tttcagcagg aaggcttgac 3001 cagtgataga catgggaaga agcctgtctg gcaggaaaga tagagtgggc tggacggtac 3061 aggatcgcgt aaagtgtgga ctttatcctg agagttatga gaacataaag cccctagcac 3121 aatgtcggct tgtagtggac tctcaataac aaacatgttt ttcaagtggg aaccctgaac 3181 aatgtcagta aacattttct ccacttcata ggtgggcagg acaatggaaa gagccttgca 3241 cttggggatt tgcatttggg ctcagggatt caccttgggc tagtcgctta acctccaaaa 3301 gtcctactta tatgaaatgg gaataatggc agatggcgtt ccacttcaga gagttgtgag 3361 actgtaaaat caaaatcgac ggcattatgg gaaagctttt tataaaccct aaagcactgt 3421 acaaatgcaa atttgctgta ggatgaactc tgaattggac tccaggttcc caaaatcagt 3481 gtgagccgcc atctattcta gttcaaaaac atactatgct tagaatttgc actgcatagt 3541 cattggctgc tggcacataa aggtgactgg cttacctatc ttgggatgct tgtgaggagg 3601 tgggcataag agagggctag ctctcatttg gtgtgtcctg agcccatggt tcagaaagag 3661 ttgttaatct gaaagaaaaa agacataaga aagaattcag aagaggcagc tctagttact 3721 agtcatcagt tggggaaatg cacttaacta tgataaggac agctgactgg gcagaaacgg 3781 gtctctcacc ttgttgagag tgaccagggt ggaagacttc agaaaggcat gggaagtcaa 3841 taaactgaaa gatggggaca tgattttctc agctctagga caaactggag accccgagcc 3901 taaagtagcc aagtctttgc tactaggatt caggtgataa gatttccaac ccaatctttt 3961 caaacccaga ccaaacatcc tccttgcttc tcaaattggg ttatgtatag tcacaggata 4021 agccatccca tgtctttctc aagcttcagt tgtatttgaa gacttggcga gtagagacat 4081 tttattatga aaacatacag gaataatgaa aagttctggc tttttaaggt agatgagatt 4141 atataatcac tttgggctgt gccatcaaat tttttctctt tcctccccag ccaaacacat 4201 acccactcga tctcacatac ccatttgctc tctgccacta aggctacttt ggaaaagttt 4261 gagaagcagg actctatgat aaagattata tccgttgccg ttgatcctgc tctgtctcag 4321 ggttgggggt gagcagacct tttccattcc cagcatgtat cctttgtgag ataagagttg 4381 ttggttaaga gttattttgt ttcttgcagg ggtctcctta tgccagggac caggttaccc 4441 taacagcagg aaggttgtgt aagtatcact ggggagtaca gatatgtggc cacgggttat 4501 tatttaaaga cataaagttc taaatttgac catctgattt cttcactggc ctccagacct 4561 agaaattgag gtgattatgt tgagtcccta gactaacctc aatgttcact tttcccatgt 4621 gatgttcctt gagcatcagc tccgtgtcag gttttggact gggagtgtgc ccactctcca 4681 caggtgtttg gattataatg aaagttaagt tggaatcctc tttttatggg atatagaagg 4741 gtgctcctag atattctgtt tggatgtaag tgcagctgaa aggctcgagg ttacatttat 4801 tgtggagatt gagagcatcc tctcagtagc atagttactc atttttaaaa acagtaaaag 4861 cataaaactg gttgagtcct gggactgact gactagtttg gagggaatgc tgttattgca 4921 taatgcagag tttggagaaa ggctacccat gtgtgtttca gattaacttc tgcgtatgaa 4981 tgttgcttaa atgctgcttt gcattccaag tgtttttgtg gtggtgctaa gagaatgctc 5041 tctagtaacc tgttcagcag ctgtcaggct gctttactct acagggagaa tcctcaaagt 5101 gaagaggtcc agcggatgag cttcttactt actctacatc acaatgaggg tttttgttgt 5161 tgttgttttg agacaggatc ttgctctgcc actttggctg gagtgcggta gcctgatctt 5221 ggctcactgc atcctcagcc tcctgggctc aagagatcct ttcacctcag cctcctgagt 5281 agctgagact acaggcacat gcaccatgcc cggttgtttt ttgtattttt tgtagagatg 5341 gggttgtttt gccatgatac ctggggctgg tgtcaaactc ctgggctcaa gcaatccagt 5401 ccacttgcct tagcctccca aagtggagga tacaggcgtg agctatggtg cctggctgat 5461 ttttaaaatt tgagtaaatg ttgcactcag tactagaaat atgaataccg gtgttatttt 5521 ggagggcagt ttgatagtca taagtctttg aagtgtgtgt accttttctt tttttctttt 5581 tttcaagatg gagtcttgct ctgtcaccca ggctggagtg cagtggcgcg atctcggctc 5641 actgcagcct ctgcctctcg ggttccagca attctcctgc ctcagcctcc tgggtagctg 5701 ggattacagg cgcacgctac cacacctggc tgatttttgt attagtagag acggggtttc 5761 atcttgttgg ccaggctggt ctctgaaact cctgacatca ggtgatccgc cgactcagcc 5821 tcccaaagtg ctgggattac aagcatgagc cactgcccgg cctgtgtgtt ccttttcatc 5881 tagcaattcc atctctagga atctatactg aagaactcct ggtatattta tctcagattt 5941 aatatctgat aatataaata ttatcagatt tatcttaaaa tggcaaagag caagggattg 6001 gctaaactat agcatgttca catgttcaca ctatgaaaag tgggaaaaag caggttacaa 6061 aataatttca ttttctactg tgtggtatga tcccattgta agtattcttt ttgctatttc 6121 taatattctg tagtgaacat ctagttattt agagattcaa aatgtattac tcatctgcaa 6181 tcaaatgaaa atgcgctaat catgactctc tacaataaat acactaaata tttgaaagta 6241 aataatgaag aaaaagtaaa ctttgtgtgg gatcacccca atcatgtcac cctaatccag 6301 tatctttgtg tgtgttgtgc cttttttgtg aataaggtac ccactttatt taataaatct 6361 tttattgtaa aacaattttt ggttttatgt ttttactaat agtattatat ttacatgatt 6421 aaaaattcaa aaagtaaaaa gggtgtccat gagaagactt ttttatcctt gtcctctagc 6481 cccccgaatg cctcttgggt agtgtggcca ccactccggg ttcctggatg acttgcagag 6541 cagcctctgc acttgcatgt tacctgttag ttcttgccta taggcatgta ctttacaagg 6601 ttacaaacta tatgtgaaac aatttgtatt cttgtagttt ttaaatatct tcattttttc 6661 ttggttatac atagacctta tagttatttt gacattataa aagtaaaaat acctaaagac 6721 aacttaaatc tcttttactt ttctattata aaaatgcttg ttgtggaata ttcagacaat 6781 cctgaagtgt aaaaagtgaa gtccccatcc aaatcattct cccaaattgt gtaagtgtgc 6841 aacacataac cagagatacc atatatgtct acatctctca cccctcagag gtaaccactg 6901 ttaatggctt tgtgcataca gggtgagctg attctccagt ttatattatt tactgtaaag 6961 gccatatcag taaacgcctt ggtctttgag gttcgttttg aatgaatcca gaatgtgttg 7021 atgtgctgtt ttctagcaag agaaccgagt aggagatacc atatggggtt tgtgtgtggt 7081 ttttttgttt ttgggttttt tttttttttt tttggcataa caccttgttt tacttccgag 7141 ttaaccttgt taatcttgct gccttatctg atctaaatct gttatcttga ggcagaaatt 7201 ccattatacc ttgcacaatg gcttgtaccc tgggcaggcc gtaaatctat ctcggattta 7261 gttaaggaag agacctggct ggacatggtg actcactcct gtaatcccag cactttgggt 7321 aggatctttg agacccgtct gggcaatgaa atccatagtc tacaaaaagt aatttaaaaa 7381 aattatctgg gtggcatgtg cctgtagtcc cagctacccg ggaggcagag gtgggaggat 7441 cccttaaatc caggagtttg aggttacagt gaactattat catgctactg cactctagcc 7501 tgggtgacag agcaagactc ttatctctta aaaaacaaaa aaccaaatca agaagctaag 7561 tcatttcggt ctgcaaagga ccaaggtgct tattttttta ttgaaagagc caaatgtaag 7621 tctgttttgt gagagttggc attgcattgc agcacagggg cctgagcata cttggggagc 7681 tcaccattta aggtacctgg gtcgttaggc tctactccag acctgctgag tcagaatccc 7741 tgggggagag gcctagacaa gtgaactttg aaaaagttcc tccccaagtg acactgatgg 7801 acacccctgg tcaagagtca ctgtttaagg ggaaggtgac ccatggggcc ttaactagac 7861 taaaatgtga gtggttcgcc tgttaaaagg agttaacgtg tgtttctttt gtagcattaa 7921 caacagtgtc ttcagtgttc gctttttcag aactacagct gtatgcagta agtacctgct 7981 ttcttgggaa tggaatttta tgggaagagc aacaattgat tgagtttaat taaagaaaaa 8041 tattaaaaag gaaaaatgaa ctacttatga ttttcttttt tgcatttttc ctagaggatg 8101 acttggttac agtcaaaacc ccagcgtttg cagaatctgt cacagaggga gatgtcaggt 8161 gggagaaagg taagatttag tttcctattt tttttttttt tttttttgag acagagtctt 8221 gctctgttcc cagcctggag tgcagtggta tgatctcggc tcactgcagc ctctgcctcc 8281 caggttcaag tgattctcct gcctcagcct cccgaatagc tgggattaca ggtgcccacc 8341 accatgcctg gctaagtttt ttgtattttt agtagagaca gggtttcacc atagttggtc 8401 aggctggtct caaactcctg acctcaggtg atctgcccac ctcagcctgc cacaaagtgt 8461 tgggattaca ggcctgagcc actgcgtgag ccactgcgcc tggctcctgt cacttcttaa 8521 gtttaagtgc tatctaaaat tctcccctcc cctcaggata gttgtagaat ttatagctct 8581 gccgttttct ttacatatgt actttctgct ggccctgtag gaaagggttc ttataacata 8641 cctgccaaaa tgggttttgt ggctactgga ggctccccgc taacaggtag ccttgtagcc 8701 tttgattgtc ttttcagctg ttggagacac agttgcagaa gatgaagtgg tttgtgagat 8761 tgaaactgac aaggtaggct tatcttatat tcgtaccagc ttttcatggg cttcccttag 8821 ttaacctaat gaaagaaaca gggacttaac cattgtttag agataatttt taactagctc 8881 taacttcagt tgtaaaatta ttggattgtt caatttaaaa aaaaaacaac tttttacatt 8941 taactttttt tttttttttt tttttttgag acagacttgc tctgttgccc aggctggagt 9001 gcaatggcct gatctcagct cactgcaacc tccacctcct cctgggttca agcgattctc 9061 cttgcctcag cctccctagt atctgggctt acaggcgcat gccacaacgc cgggctaatt 9121 ttttgtattt ttttgtagag acaggtgttg ccaggctggt ctcaaactct tgacctcagg 9181 tgatctgccc gccttggcct cccaaagtgt tgggattaga gctagagcca cgcacctgac 9241 ctgcatttaa ctttaaattg tttgctgtgt gtgtactatg tacttcattt agtagcttta 9301 gtcttgtagt gtgtggtgag ggggtgggga gggtaaataa gctgtttcca ctttttcctg 9361 tatcacttgg aaacatatta actgaatatt gcctgaatgt ttctgagaaa ggacagaaag 9421 tgaatcttca cagctagcga tgacttgtta tcacccttgc tctgaaagtt atggccttga 9481 gatgtttgtg tgcaaggcct tatatattgt gttgcaactc attgttactc actggagata 9541 cgcttgttgt tgttgttgtt ttgagacgtt tcgctctatc acccaggctg gagtgcagtg 9601 tcacgatatg gctccacctg ccgggctcaa gtgattctcg tgcctcagcc tcccaagtag 9661 ctgggattac atttttgtag tttttagtag agatggggta tcaccatgtt ggccaggctg 9721 gtcttgaact cctgacctca agtgatccgc ctgccttggc atcccaatat gctgggacta 9781 cagtcatgaa gccactgtgc ctggcttcat tggagattct atacagctaa ttttctaatg 9841 ctggataaac atttggactt cctccatctg tcttcctctt ccagacatct gtgcaggttc 9901 catcaccagc aaatggcgtg attgaagctc ttttggtacc tgatggggga aaagtcgaag 9961 gaggcactcc acttttcaca ctcaggaaaa ctggtggtaa agaagttctc ctggtggtca 10021 aggtctccag tgttccctct tgggattggg actgagcata atgtgctaat tccccgatat 10081 gtcaaagact cttgtcattc cagctgccct tagttgaggg aataagggaa ttaaatttat 10141 atatagtgtg aacttcaaat gtcaaattct aaaagaaaag tgtatttttt aaaaaataaa 10201 ccttatactc tgttctttcc atgggtgttt cttagcagga gctgagaaac tggacctttt 10261 cttatagata tacagattga gcatacctaa tctgaaaatt caaaatctga aatgctccaa 10321 aactgaaaat ccaaaatctg aaatgcttca aaactttttg agcactgaca gaatgctcaa 10381 aggaaatgct cattgtcttt tatggggaat tcagtgttgt ttcttttcac atctgttctc 10441 tcatatctct aggaagggtt ggtatttctt ttgggaagaa atgagaaacg aatttgtact 10501 ggacctgata ggattagaga ttaataactt tatctgactc ttagttaaat tttataacca 10561 ctaaaaagtc tatttctttt tcctaagcga gaatcgtacc cgtagaccaa cgagtcacag 10621 aagtctattt ctttttcact gttacactaa taaggcagat agaccgcatg attgaggtag 10681 tcaggcaaga ccagaagctc agagtagaaa acatgtatgg tagaaggctg ggaaataagg 10741 tagtcctcct gttgcagaag tttgtcttag cactaaccct ggatataatg tccagatggg 10801 tgaaaaacag aaacaaacat ttcatttaaa gttgtgagat tattacttgg ccacttcagt 10861 gatttatgag tcatcctgga tgaagtgctt tcttaagcct aagtcaaccc tcgatgacta 10921 ttttattttt tgagatggag tttcactctg tcgccaggct ggagtgcagt ggcctgatct 10981 tgattcacta caacctccgc ctccggagtt caagtgattc tcctgcctca gcctgccgag 11041 tagctgggac tatgggcaca tgccaccacg gccagctaat tttgtatttt ggtagggacg 11101 ggctttcacc atgttggcca ggatggtttt gatctgttga ccttgtgatc cgcctgcctt 11161 ggcctcccaa ggggagtaat cccttggcct gtaatcctgg gattacaggc gtgagcaact 11221 gcacccagcc tattttttaa tagtattctt gcatatataa tgttaaaata gaggatgaag 11281 agtatcagta attagaaata tgttatgtat gcatttcaat attttaaatt gcctaggtgt 11341 tttttttcct tggaagttta cattttcttg ggcttgatag gtgctttgaa agtacttgct 11401 gaatttattc agtggaaacc tgtgaaattg ggaagggagc taagttgagc cagttatctg 11461 atggacattt agggaccctg aggactagag aatagtgcct tatgtcattg tgtgtgtgtc 11521 tcatagtgct ttgcatattg ctgataccca gtagacattc gttacttgat tggagctaga 11581 gtttttgcca ctaaattttg cttggttgct tctcatttca gacagtgcca gtggcatata 11641 cttttcttgt ttttcagctg ctcctgctaa ggccaagccg gctgaagctc ctgctgctgc 11701 agccccaaaa gcagaaccta cagcagcggc agttcctccc cctgcagcac ccatacccac 11761 tcagatgcca ccggtgccct cgccctcaca gcctccttct ggcaaacctg gtaggcttcc 11821 aactcccaca tgtcatgtgg gagaacatct cgtctaatat gttccttcaa agggtaaact 11881 ctagactgat gatggttctg aacaggccag gttggctgct ttgtagaaag gaattctaga 11941 ctttgtatcc aaaggcctgc tttgcatctc agctttctgt tactagctgt gtgtctttag 12001 gctagtcact taatttatct gaacagattg ttcatctgtg aagttggagt gatactaata 12061 tacctcagaa tatcttgaag attaagtaac agtacatatg aaagcactgg cacaaactca 12121 gcagatgttt ctttccttgt ctgatgcagc tttatcctct tttcattttc agtgtctgca 12181 gtaaaaccca ctgttgcccc accactagct gacgcaggag ctggcaaagg tctgcgttca 12241 gaacatcggg taagcctctg aggaccactt tcaggaaaga ggcagggggc agtgttgaga 12301 ttagtggagt atgggtgtgg tgatgcctac ctttgaatgc ctggcagctc attccctcaa 12361 ccttgataaa gggagtttag gatataactt cttcatagtc acataaccat caaatgatag 12421 gactagaacc tgaggtcttc tgaacacttg ttcagtggtt ttcctactat attatgctac 12481 cagtgaaagc cttgggtaag ttaaatattt gtaggggtgg tagcttagta actcaccccg 12541 tcctcccacg atgtcacacg cttcagagag gtggtttagt gagctctatt cctatacttc 12601 tgggatactt agctgtagct aagggataat actcagtagc aatagctgtc aggtgcagac 12661 ataaagcctc aaggaagtat ggtcaggatt gtggttcctc tgcctgagaa tacttcctaa 12721 aaatatgata gcctcgaaag tttgttgctc atgaggttct gttccacaaa tagctattct 12781 tatgcaaatc tttatcaaca gaaagtgccc tttttcttat gattcttatg ttggctagac 12841 tacactacac tacattcttt tataaatggg ttttggtttt taaatcactg tttctgagcc 12901 ttcattcttg gtcaagcttt aggcacatct gagtgagtag ttttggcctg tgtttgcatg 12961 tttttgcttg gaggagaaag ggccaccaca ggaaagggaa gcctggagct cgtgtactta 13021 gatactgtag tccttggcca tctactcgtg gcaagccgtg ttctttctga ctacacgggg 13081 aatgcttgac ccagagagat cagattgtca atgcttgttc cactcttact ttcaggagaa 13141 aatgaacagg atgcggcaac gcattgctca gcgtctgaag gaggcccaga atacatgtgc 13201 aatgctgaca acttttaatg agattgacat gaggtagtgt ctctagtccc tcttatcccc 13261 taggcccctt tttcttagag aacacgaact tgccccactc cttatctagt gctgattcta 13321 aaactaactt gtcagggaag aggtatttgt ttattgtttt ttgtttgttt gtttttgttt 13381 ttgttttttg agatggagtc tcgctgtctc ccaggctgga gtgcagtggc atgatctcgg 13441 ctcactgcaa gctccgcctc ccgggttcac gccattctcc tgcctcagcc tcccagagta 13501 gctgggaata caggcgcccc caccacgcct ggctaatttt ttgtattttt ttagtggaga 13561 cggggtttca ccgtgttagc caggatggtc tcgatctcct gacctcgtga tctgccttct 13621 cggcctccca aagtgctggg attacaggcg tgagccaccg cgcctggcct gtttattgtt 13681 aactttcaca gagtgagccg ctggcttctt ctctaattgg ctttcctaat actatgcctg 13741 cctcctccat gaccttaacc ttagccttgg ctaggtgtag tggctcacac ctgtaatcct 13801 ggcactttgg gaggccgagg caggcggatt gcttgagtcc aggagttcaa gaccagctgg 13861 gtaacaaggc gaaaccccac ctctacaaaa aatacaaaaa ttagccaggt gtggtggtgc 13921 acatctgtag ccccagctac ttggggggct gagatgggag gattgtttga ccccgtcact 13981 tacatatttt cctaagtgat tcagaatcga gtggatgaaa aggcaaagat aggctgggct 14041 tgtctttcat tttacccagc tctgacagtg aaatggaaaa caagttgtgt ttttaaataa 14101 ccttagaatg ttaaccaaag gtactaattc actaggtatg cccatttctg tagtgagcta 14161 attgttgact tacctgtttt cttcaagatt tcaatgtgtt caagtatcag cactgtcagg 14221 aagagaaatt tgacaaacgt gttttgagtg cgtactgtat acagagtact gtactgtgga 14281 attcagtggg gatagacgct gggcaccatg gctcgtgcct ataatcccag agctttggga 14341 gggtgaggaa ggaggatcgc ttgagcttag gagttggaaa ccagcctggg caacatagca 14401 agactgtctc tacataaaat ttaaaaatta gccaagtgag ggctgggtgc ggtggcttgc 14461 gcttgtaatc gcaacacttt gggaggctga ggtgggtgga tcacatgagg ccaggagttc 14521 gagaccagcc tggccaacgt ggtgaaactc tgcctctact aaaaatacaa aaattaccca 14581 ggcatggtgg cgtatgctgt aatcccagct actcggaagg ctgaggcaca agaattgctt 14641 gaacccagga ggaggttgca gtggctgata tcgcgccact gcactccagc ctgggcaaca 14701 gagcgagatt gtgtctcaaa aaaaaaatta tgtatggtgg catggaccta tagtctcagc 14761 tacttgggag gctcaagtga tcttcctgcc tcagcctccc aaatggctgg gaccactaca 14821 ggcatgtgcc accaggcttg gccaattttt ttagtttttg tagagacaag gtctcgctgt 14881 gttgcctagg ctggtctcct gacgtctagg tcaagccatc ctcctgcttt ggtctcccac 14941 tgggattaca ggcatgagcc accattcctg gccaaaaaag aatttttaaa caggtcttca 15001 agagtgacta gcattttgtc agccttgtgg agaaaggtca ttccagatga tgttaacagt 15061 gtagaccatg agatgggaaa gtagaagaca catttgagaa attgtcaaaa atcatttctg 15121 ttgtgaggat tttgtatagc aaagaatggt aaagaaggtt ggatccttat ttcagaatgt 15181 tttgaatgcc agggtaggag tttatattta attcaggaag caagtcaagt ctcaggtcat 15241 ctctttcttt ctttttgtgc ctgagggttt agtctgcgaa tctacttttc attaagagaa 15301 atttgctttt ctttgggaaa gggtttgaaa aatagggaga agggagtgtt ttcttgatag 15361 gtgcgctgtg gcattgttca gagtagggct ctgtgacctg tttactagat gcatccattc 15421 tgttgggaaa ctttccgaag agactggatg acattcaagc caaagatttc tctcttgttc 15481 ataggatgta ggggaaggac cttgactaat gaatcacaaa ggctgcagct tatgtttcag 15541 tgcattaggg ccaaattgta cacaaattga attggacttt gaaattctac ccagtttctt 15601 aagacaaaac agtaaatatt tgtggccctg aaagatgcag ttggcccaac agctttgatt 15661 ggaaaggaag cgtaataagc tggtctcctt tcagcagcta atcaaggtgt cacttaacaa 15721 tggcctgcgg tccttgctgc agcctaattt ggtttagatc gagtgccctt tgggtcttgt 15781 gctgatcagc taacccataa gggacctttc tgagaccttc tcccttcctc tattgtccca 15841 gcctttgcct ccaagggcag agctctgaaa tagaagctac ttaatgacca gacaataaca 15901 tttgggcagg tttgagagca gagatgttta tcacatctct ttccctcttc tgcagctctt 15961 ctgtcttggg aaccagtaat gtctttaggg tggtaatcct agcactaaat aaggttagaa 16021 tcaggaaaac aagcatttta aacccaatca gtcccccttt gggaaaaact tcctttttca 16081 aaaacaaaag atctgaaggg ggctgttaaa attgtaccac atcttacctt tcatgctaat 16141 gtcgaattag gccattagag ggagaaaggg agaactgagc atttgggcct gagataaagt 16201 taaattcaat tttcatcgta gaactgcatt actgcattcc cgataaatac tgaggtttat 16261 gcagcaaagt gatgggaaag ccatttctca ctaactctgc aatgtgtcat tgaaaaaaac 16321 cagtcccaaa agaataaagg ctgctcatgt tgtaaaggct tttcagaagg cttgctgctt 16381 aatctgaagg gaaccagcct aaaccacaag tggtccactt ttctaaatac caaggaaatc 16441 acagccatta aactaatgtt agggcaagaa ctatgtctta attagctttg tatcttggta 16501 cctggccctg accctcacat gtactaaggt gttagtacct gtttggaaga ataagtggat 16561 ggaactgcca accctggaaa gatgaagaac ggatatgact tttccgtggg cctgaaagaa 16621 cactgggtta tacccttccc tgcagagtca tgactgcttt ttcccgcatg gagcctcatg 16681 tacccatgag ctgtttattt tagtctccac taatcaagag ataggaggaa tatcacacgg 16741 gagttatact cagatttggt ggggtttttt tttttttttt gtattctaac aagaaataaa 16801 aagaatattg cttattttgt agattttgtt tatactttaa aaaatttttt tcaggatgga 16861 tagctcagtt attcagggct tggatgttgg tgtgctgtgt gatctcttgt tcatatgatg 16921 tgtcactgaa atctgtgact tctagaccat gaaggagccc tgtaaagaga tcaacccacc 16981 ttaaagacag attgggcctg attcctgtag atttctttgt aaaagaacag agttactgtg 17041 ttaacctcag gtagactatt tcaaaacttg aactaaggta atggtccaga gtgagggaaa 17101 agctctcacc taacttgtgt atatggattg agaaacctgc tgtgaaagtc aaaatgcagg 17161 ctttgtggtc agtttgtttt gtaatgtttc acagtaacat ccaggagatg agggctcggc 17221 acaaagaggc ttttttgaag aaacataacc tcaaactagg cttcatgtcg gcatttgtga 17281 aggcctcagc ctttgccttg caggaacagc ctgttgtaaa tgcaggtgag ttgcttgtgg 17341 ctggaattgg agaggtcctg ggaggtaggt gtatgagcac agacctgctc ccacatgcag 17401 aggatccaac agaccaaggc tttttatttg ctacctatag aaatatcagt atcttcccca 17461 gggatttctt ggttcacctt cctgaagcag catcatgatt actttctacc cgccagctgg 17521 gcagaacagt ttctcttccc agtgtacagc tcggcagctg ctcaggcagc aggcagttct 17581 gggatagccc agctctccag gctgtctcct gtcaaggtgg aagcacgtgt tctggcttcc 17641 tcccctcccc cagctccaca ggccctgtta ccttgggcct gacttccaca gttggtgtag 17701 gctcctggct gacctttgat cttaggtcga tgggcctcac tgggagcttc ggcttggcct 17761 ctcatggacc tggtgcttat tttctttccc tttttacctt acctacgtca ttgtagtgtt 17821 gacctaacaa gcactttgtt ttcctgttga cacacttctc tgttccttgg gagattggag 17881 aggggtcagt gctacttttg ccatcctcag cagtgtcagg gccctaaagg tctagaagta 17941 gaagccctag tccctcctgt tgagtggtgc tctagggtgg tagaaggcca tctttgtttt 18001 gcccaaatgt atacctcttc tttttcccat tcttctcatg cctccctgac tggctctcat 18061 ctttttcagt ggcccccttc tgcttgccca cacatgtagc ggctctcaaa agttctcgtg 18121 gcctcttttt cacattacat gcttcctctg gatgtcttat ctaacaatac aacatggact 18181 ttctgcggaa atgattccca agcctttatt ttcccattag ttgctaggtg tacattttca 18241 gatgctggtt tgccatctct tgaggtagca tatttcaaag ccagatcacc cccgcttact 18301 cttggatctc ccctcttcct cttgttgaga gttggaagca gagaatgggt agtatctcta 18361 tatccctttt cctttgctaa tttgcccttt tttggaacat ttgttaaccc tagtcattct 18421 ctgctaaagg agggcttgac tagagctact taacgagagt ttagtgcata cctatcctat 18481 ggaagccagt gtgttgagcc cttcttaggg ctgccaaaga aatggatgct cctatgccca 18541 ggctcagtca gggaccttgg gagatactat atttgggatg ggagtaagtg gaatgccagc 18601 tgccatgcat aatgcagtgg tgttcatggt ggttggtcag cccacactgt atcaagctct 18661 gcagattttt actctgttaa tgcacatatt acctcattag tcttggcctt cctggggagt 18721 tgtatgtaac atgtattttc tctctcatag tgattgacga cacaaccaaa gaggtggtgt 18781 atacggatta tattgacatc agtgttgcag tggccacccc acgggtatgt tggggcagga 18841 ggtggggaat gttggtctta gaccctcacc ttatctgtgt gaaggagatc acacaagaga 18901 aatcatttct ttaattctgt atttttagaa gggagttagt aaaagtaacc tttttttctt 18961 ttaaaccatg ccgcattctt ttaacacttt ctgttaatca cactgagtaa tgaaggtatt 19021 ctagggaggg cataccatgg gttgaattca agggagttgt taactataaa aggtactatt 19081 aatttgcaac tgaaaacagc ttttcacccc cttcagggtc tggtggttcc agtcatcagg 19141 aatgtggaag ctatgaattt tgcagatatt gaacggacca tcactgaact gggagagaag 19201 gtaaagtaga aagatgtata caagctgcta agcaggcgag ggaagagagc cttcagaagg 19261 ctgggctcac tagcaagcag tgctcatgga aagtcaagtg catagcccct gagaaaagca 19321 gttgcccatg agagctaagt attttatata tctggtggag tttaccatgc tctggtaact 19381 gaaatctcat cagatgagac ctgctagaaa agatcatctt tgaagtcttc aactgtatga 19441 aattgtgcct tttctctggt ggagcactgg tcacagacac cttagccgaa caactgtcac 19501 acggaagcta gcgcttgtgc catcgatctc gatacaagct ttctggactt cctttgcttt 19561 tttgttttta gaggcagggt ctcgctgtgt tgcccaggct ggagtagagt ggtagagtca 19621 tagcttactg cagccttgca ctcctgggct caagcgattc tcctgcctca gcctcccaag 19681 tagctgggac tacaggtgtg caccaccatg cctggctaat tcttaagttt tttatacaga 19741 tgggggtctc actatgttgc caggttggcc ttgaactcct ggcctcaagc agtcctcctg 19801 cctcggcctc ccaaagcgct gggattatgg actgacttta ggtgaaggaa actggggtgg 19861 cttaatattt caattatgaa atctgtttta ggcccgaaag aatgaacttg ccattgaaga 19921 tatggatggt ggtaccttca ccattagcaa tggaggcgtt tttggctcgc tctttggaac 19981 acccattatc aacccccctc agtctgccat cctggggatg catggcatct ttgacaggcc 20041 agtggctata ggaggcaagg taggaaccgt cacttctaag gtcctagtgg ctaggtctcg 20101 atgaaaggga aatccaacta aatgctaatt gtaaaacatt aactataatt tcaggcctaa 20161 gttccatttc ttagtttcta atagctaagg cactgattat caaattgtgg tctggatcag 20221 catcatctgg gaccttatta gaaatgcata ttcttagacc ccatcccaga cttaaagaag 20281 aagtgcagat gatcctgatg catattcaag tttgagaacc actgagctga ggaggcttgt 20341 ttgcttctat ggagtggggg atatagttgg aaacgtggtc ccttgcccct gggaaacaga 20401 tgacaattat gcatgtcagt ctgtgagctg ccaagttgta tgataaagct tgtaaatttc 20461 ccgggagttc agagaattaa gtattaaaag gggcttgact ggcccttaga agatacgttg 20521 gtatcaaaaa agctttctgg gaactcctgt gtttagaatc ctttttgggt tactgactcc 20581 tgtgtgaaac cacagtctgt agatcttcct ggaaaatgcc catacaccca actacacata 20641 caattccaga atttgtagac cctggcttaa tttttttttt ttttttaaat tctaacccca 20701 gagagaagca agtagaggca gccagaactg aagggaaaac tttccttgta ggcagcagat 20761 gcgttagagg gcagagtatg tttttaaaaa ataaaaggca gttgtgagaa gacagttttc 20821 ttggcaaact ttgtttctga gtggggaacg tttgcactga gggtaagtcc tggtctttga 20881 aatactgtaa atatgcagag cgtaacatca ataggaaagg ctctggaatt aggaactttc 20941 ttatgggctg tgctaaatct ccttcagcga ggctggctgt ggcttgctgg agacaaacct 21001 atttaccttt cctctctgta ggtagaggtg cggcccatga tgtacgtggc actgacctat 21061 gatcaccggc tgattgatgg cagagaggct gtgactttcc tccgcaaaat caaggcagcg 21121 gtagaggatc ccagagtcct cctcctggat ctttaggagg aacccacaca ccctacaagt 21181 tgatcatgca ggaactgaaa accagtcttc tccctgtccc ctcatgggtc ccgggttagc 21241 ctggtgacag gcagacacat gctgttggcc tcaagcaagg aagcagagca ctgtgtaacc 21301 agcagtcaca ggtcttttct tggcgttcct gccaggctct ccctctctgc acctgtctca 21361 tagcctcgaa tatcttaatt ccttaggctt aagagagaga gccttaatgg atgctcattc 21421 atattcctgc ctttcttcca tcagctctct gcaaagatga ttttgctttt ccctagtgct 21481 ggtatactat agagaaaccc ctggggatca tgtgattaag ttcctatctt ttgaaagttt 21541 gttctgcaga gacttctagg aggatgctgt gcctcccaag ctcagagcag cctctgtcct 21601 ggctgtgcac attctccctt gattccactt gtgtggaggg attgaacaca ggcaaagagg 21661 tgctgctttg cttcttcaat ggcaccttca ttctccgttg tcattgactt caagatgcct 21721 cttctacctc ttccaggaag cacaggccag gggatctggg tgtgtgagtg ggaggagagg 21781 gcagaggtcc cctgaggtca tgcattgtaa tcatcataga aggagagccc aggcctgccc 21841 tcacgctctc catcataggc tgacaccaag aagactcgtc ttggcacaat ctcacacagc 21901 tggggctgta gcaacccttt ccaacccctt tgctggttgc tgggcctcat tctagcacct 21961 tgttcttaga gcagattcta gcacatcatg gcagtgggac caagcgtggt cccgaggaag 22021 ggccagagcc tggtagagac tagggaaggg aggtctcctc tagactgact cacattgcct 22081 tgagcttttc agttaagttg ctgtaagcac ctgggctgag gaggcagttt ttgttccttc 22141 ctgcgttata gcggggcctt gtctcttcct ctgcaggaca cagatctgga ggacgtggac 22201 tgcggtagga aaccaccctg agggtgttag tacctagtgg tgaaacggat gaggtcattt 22261 ctaaggtgtg ttgcccgtgg aatctgggca caactcattg gaattccttg gagccactgg 22321 gattcatggc tttgtatcca actgcatcca ggcctgaggc tgctgacgtt tgacaccagg 22381 gccagtagag agtgcccttt tgtatcttaa gccaagtaag tgaggcctgg gggtggggga 22441 ggggggaagg ggtgggagcc aatactgagt gcctgcagca tctactactc tgtcttcact 22501 attcagaacc ttgtaactaa agtatttaaa gaaactgatt ttaaatgcaa attaaagggc 22561 agatattctc aaa // LOCUS HUMEDN1B 12461 bp DNA PRI 07-NOV-1994 DEFINITION Homo sapiens endothelin-1 (EDN1) gene, complete cds. ACCESSION J05008 NID g340555 KEYWORDS endothelin-1; vasoconstrictor. SOURCE Homo sapiens (clone library: lambda-EMBL3) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12461) AUTHORS Inoue,A., Yanagisawa,M., Takuwa,Y., Mitsui,Y., Kobayashi,M. and Masaki,T. TITLE The human preproendothelin-1 gene. Complete nucleotide sequence and regulation of expression JOURNAL J. Biol. Chem. 264 (25), 14954-14959 (1989) MEDLINE 89359303 FEATURES Location/Qualifiers source 1..12461 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda-EMBL3" /map="6p24-p23" repeat_region 98..383 /rpt_family="Alu" protein_bind 739..745 /bound_moiety="acute phase reactant regulatory element" misc_feature 979..1039 /note="Z-DNA region; putative" protein_bind 2183..2188 /bound_moiety="acute phase reactant regulatory element" protein_bind 2951..2958 /bound_moiety="TPA/JUN" protein_bind 3241..3248 /bound_moiety="TPA/JUN" protein_bind 3316..3328 /bound_moiety="NF-1" protein_bind 3499..3505 /bound_moiety="TPA/JUN" CAAT_signal 3510..3515 /gene="EDN1" /note="G00-119-861" TATA_signal 3577..3582 /gene="EDN1" /note="G00-119-861" exon 3608..3939 /gene="EDN1" /note="G00-119-861" /number=1 gene join(3876..3939,5585..5753,7183..7338,7503..7646, 9212..9317) /gene="EDN1" CDS join(3876..3939,5585..5753,7183..7338,7503..7646, 9212..9317) /gene="EDN1" /codon_start=1 /db_xref="GDB:G00-119-861" /product="endothelin-1" /db_xref="PID:g556203" /translation="MDYLLMIFSLLFVACQGAPETAVLGAELSAVGENGGEKPTPSPP WRLRRSKRCSCSSLMDKECVYFCHLDIIWVNTPEHVVPYGLGSPRSKRALENLLPTKA TDRENRCQCASQKDKKCWNFCQAGKELRAEDIMEKDWNNHKKGKDCSKLGKKCIYQQL VRGRKIRRSSEEHLRQTRSETMRNSVKSSFHDPKLKGKPSRERYVTHNRAHW" sig_peptide 3876..3938 /gene="EDN1" /note="G00-119-861" intron 3940..5584 /gene="EDN1" /note="G00-119-861" /number=1 protein_bind 4612..4618 /bound_moiety="acute phase reactant regulatory element" exon 5585..5753 /gene="EDN1" /note="G00-119-861" /number=2 mat_peptide join(5677..5753,7183..7219) /gene="EDN1" /note="big form of endothelin-1; G00-119-861" /product="endothelin-1" mat_peptide 5677..5739 /gene="EDN1" /note="small form of endothelin-1; G00-119-861" /product="endothelin-1" intron 5754..7182 /gene="EDN1" /note="G00-119-861" /number=2 exon 7183..7338 /gene="EDN1" /note="G00-119-861" /number=3 intron 7339..7502 /gene="EDN1" /note="G00-119-861" /number=3 exon 7503..7646 /gene="EDN1" /note="G00-119-861" /number=4 intron 7647..9211 /gene="EDN1" /note="G00-119-861" /number=4 protein_bind 8205..8218 /bound_moiety="NF-1" protein_bind 8260..8266 /bound_moiety="acute phase reactant regulatory element" exon 9212..10443 /gene="EDN1" /note="G00-119-861" /number=5 protein_bind 9392..9398 /bound_moiety="acute phase reactant regulatory element" protein_bind 10751..10764 /bound_moiety="NF-1" BASE COUNT 3372 a 2728 c 2816 g 3544 t 1 others ORIGIN 1 gatatcctat taatacagag atacagaaag aaatacataa aaaatagttt tatcaaatac 61 tttccagcat tcaagtgtag cctcaaaagc aagaataggc caggagtggt ggctcacgct 121 gtaatccaca gcactgtggg aggccaaggt aagaggattg cttgaggcca ggatttcaag 181 accagcctag gcaacatagt gagatcccta tctctacgaa aaaattttaa aacttagctg 241 ggcatggtgc ttgagcctgt tgtcccagct actcaggagg tgaagtagga gtgtcacttg 301 agcccaggag gttgaggctg cagtgagcta taactgcacc actgcactcc agccttggag 361 acagagtgag accctgtccc caaaaaaatt aaaattgaga aaaaaaaaaa ggcaagaaca 421 gccacagcaa actttctatt ggggaaaaaa aaaaatcctc ctctttacat ctctcccttc 481 cttcccttcc ctttctgaga gtgactgtgg ccaaaaggag cattttcccc ctgcagtcct 541 ctgaggggtg gggtggggct atgaagctat ccttcatatt cactcctttg tccagctctt 601 ttcacctcta gttcttctcc ccgcatctct gtctagcagt gccttaagtg gaggaggggt 661 gggggcatca agcttgtaaa actggtttgt tggggttctc cttctcccct catttcttga 721 ttcttgggaa aatgtcttgc tgggaggctg cctggcgagt gccctagctg ccttctgtgg 781 gcttgaatgg ggcttccctc tgcccctaca ggaggaaaag ggagctgctg ccagagggag 841 aaatggagag atggacagag aaggcaggtg ccacccctcg cccctgacac acaaagaaaa 901 agacacggaa attctctctc tctcttctct tctcctatct ctctctctct ccctctctct 961 ctctctctct ctctctctca cacacacaca cacacacaca cacacacaca caggcgcgcg 1021 ccgcgcgcgc aggcacacgt cttgcaaatt caggattcaa agagacaggg gcaccattat 1081 atttggcacg gtggggcctt ccaggtctga aatcctgcat tcttccttac tatttacttt 1141 ccccgagctc gagaagggcc aggtgtgggc ggatggctgg ccacgttttg tgtttccaat 1201 tcatattcac gggatgacac agacggggcg tggtgagtgc tgttggaggc gcttgggcag 1261 tttcattttg ccccacttct ccacctgaag gctgggcgtt gctggaacct gcaggggcag 1321 cctcagcaag gtggggtggc gtggagtggg gtgggagaag ggactccagc tgaagtagaa 1381 cccaggctgg acctgagaat attggggagg gcatgggcgg tggtttccgg gtaggggcct 1441 tgaggacatg ttggtcctga ctgttgtcag tgtttggtca aagttgccaa aaggttaaaa 1501 aaaaaaaagt agggggagtc cctgccaaga catatttccc aggccacctt tcttccgcgg 1561 gagtgttggg ggggaggcgc tgcttggaac ctgtgaatgt gacatcagct ctcctctcct 1621 ctcccaaggt cggctttgga gagggaggtc agggcaccct tgcctggcac aggcacgctg 1681 gcttccggct cagtgccgcc tgctctccgg gagctgtgcg ctccctgggc cccggggcta 1741 ggctgaggta agcgcacagc ggaggccagg cgcgccggca gaggcctggg ggatagggtg 1801 gaggcatctc tgggtgtggg tgtgggtgtg ggtgtgggag ggagagttct tgcctctctc 1861 tctcccatct ccaactcttg cttcagtggc tcttttagag gatgcatgtc attatggacc 1921 tgtcgctgcc actgtccctg ttcccccagc tgtgacttcg agggaggtct ggggatctga 1981 gtctgtccaa acccacggct ttgctgttgg gataaaaact gtccttttga ttttagaagg 2041 aggagggaaa aaaggtttcc cagcatgtgt gttgtgccag tcttggaaat tcatccgtgc 2101 ttgaattcca ccctccatcc ccagaaaaac tggagtaaaa caaaaagagg agatggacaa 2161 agtgtgtatt tgatggcatc ccctgggaag agactctaaa tttatcccat aggtcttact 2221 gggccactgt gagcgctttg gtggagaaca aacaaaaatt ctgggtgctc agttgtctaa 2281 cctgaaaaat gggactagcg gaaaaagcca atgtgttcca tgcacctttt gctttcttta 2341 ttaaggcatg atgtcacctg tacagtaact gccctgtgtg tacttcaggg ggggatttca 2401 aggttagata gacaggaaat tgttttgaaa atgtaaacac attattaaat gtgaagtatt 2461 atctgattcc ttgttcgaat ggcatttcct tctcagcacc accttccttg catattcact 2521 taaccttgta caagaacacc tttttgccct aaatgaagac acccccccaa aaaaaagagt 2581 cccagaaaat atgtccctgc ttgtgcggga ataaatagaa tattctgagg tgcattcctc 2641 cttcctatgt taggcaacat tccttgaccc tcctcggccc ccaagccagg ttgcgttttt 2701 ttctgccatt tagaagggtt ttcctttttg tcctagtaaa acatcagccc ctgtagctct 2761 tcatctcccc ctggtgttct tctcccgcca tgtcttaaga ttggtggcac cgaccaatct 2821 taagatttaa gttctgtgtg aaaaacacct ttgcttttca atcagtttat cagcctcctc 2881 cgcaggggaa tgtggacaca caaaagaact tatcggggct tctcatcagt gatagggaaa 2941 agactggcat gtgcctaaac gagctctgat gttattttta agctcccttt cttgccaatc 3001 cctcacggat ctttctccga tagatgcaaa gaacttcagc aaaaaagacc cgcaggaagg 3061 ggcttgaaga gaaaagtacg ttgatctgcc aaaatagtct gacccccagt agtgggcagt 3121 gacgagggag agcattccct tgtttgactg agactagaat cggagagaca taaaaggaaa 3181 atgaagcgag caacaattaa aaaaaattcc ccgcacacaa caatacaatc tatttaaact 3241 gtggctcata cttttcatac caatggtatg actttttttc tggagtcccc tcttctgatt 3301 cttgaactcc ggggctggca gcttgcaaag gggaagcgga ctccagcact gcacgggcag 3361 gtttagcaaa ggtctctaat gggtattttc tttttcttag ccctgccccc gaattgtcag 3421 acggcgggcg tctgcttctg aagttagcag tgatttcctt tcgggcctgg cttatctccg 3481 gctgcacgtt gcctgttggt gactaataac acaataacat tgtctggggc tggaataaag 3541 tcggagctgt ttacccccac tctaataggg gttcaatata aaaagccggc agagagctgt 3601 ccaagtcaga cgcgcctctg catctgcgcc aggcgaacgg gtcctgcgcc tcctgcagtc 3661 ccagctctcc accgccgcgt gcgcctgcag acgctccgct cgctgccttc tctcctggca 3721 ggcgctgcct tttctccccg ttaaagggca cttgggctga aggatcgctt tgagatctga 3781 ggaacccgca gcgctttgag ggacctgaag ctgtttttct tcgttttcct ttgggttcag 3841 tttgaacggg aggtttttga tccctttttt tcagaatgga ttatttgctc atgattttct 3901 ctctgctgtt tgtggcttgc caaggagctc cagaaacagg taggcacgct cgttgacttg 3961 taagtctcgg aattacaagt tagtgtgttc ttatccacct tcatgctttt cttgcttcta 4021 tttttccccg ttctttttat gactgcagct tagagagcaa gtgtctgaga attattgctg 4081 aaacgtactt taagtcttct agtgtaaaat gtaaaattcc tctactgaat acaattaggt 4141 gcaattgact ataacatgac attaaaataa cttatcgttt tattattatt attccattat 4201 gtgtttcctt ggcttttaaa aaatgagaag agtatggaca tatacaattt agtcaaatgt 4261 atgtttgtaa tatatgtgtt tatacaggta cacaggccat ataggaactt aaatcttatt 4321 taaacactat tttaatagtg tgttaacgtg taaaatattt aagcattcca gcttgaagcc 4381 aaggaattgt atccagtcgt tcaagcaatg tatgttcagt aaaatcacct gcagagcaaa 4441 agtctgttga ctaactaccg cctcccccgc ccccccacca ccccccgcag gcggtttctg 4501 ggtgaagcag atgttttctt taaaatttgt catcattgac tttaggtttc ttttggcagg 4561 tttttggcac ccaaaacagt gtgagctctc ttttcagctt tattcacctg tgctgggagg 4621 ggagctagga taattcttgg ctgccgaagg atttaggcag tgcgtgtgca tctgcccggg 4681 tcccccccgt ttttagggtc agtgcacttt ttttgtcttt tcgtgaccct gactaaagag 4741 aaaggatgtc aagggaatga aaatcctgga atgtgtctga tcatttgaaa tgtacaaaat 4801 tgggcagata agctgcatgg ctaaattgtt aggaggaaga ggcaaggcag tagtggagaa 4861 gggggaggca gtggatccca cacaagcctg atgcccaggg attcggaatt caaaatcccc 4921 ccagcctacc ttcagtcccc tgacctgctt ctcagcccca ccttaggtca ctggtttcta 4981 tggagttacc ctactgaatt gaatattgaa tagttaattt ctctctccaa tcattttccc 5041 cacctaattt tgaaagatat acatcatctg gggtaccctg tgccctacac agcatgtgaa 5101 gtggatgggt accccctaaa gagagggtca tcctgaatgg ggaagtggcc ccaaagctag 5161 gaataactgt gatttcttgt ctttagtcat gtgccaatgt taagtaagct tcagtggata 5221 gtgctgtcct accaagttcc ttgtagaagc cagccggatt ttcaacaggc agcattccac 5281 agcatttccc tgagcctgct tcaagagggg tgggggaagt cccttttcag gtgtttatct 5341 cctctgcatt tgtgtaatct ccctgaaggt ggataagcca agggcatgag ggggaggcaa 5401 aaggtgaact catgttaagg agggaaaaaa ataaagagcc cttttttctg tgtttcttgc 5461 tgatggcagg ctgtgtgctt catctgcttt tatctgctct gctagctctg actctactgt 5521 gatccagcat gtctctcggc gtttgaggag acatccccca ctgacctgct ctttctctcc 5581 ccagcagtct taggcgctga gctcagcgcg gtgggtgaga acggcgggga gaaacccact 5641 cccagtccac cctggcggct ccgccggtcc aagcgctgct cctgctcgtc cctgatggat 5701 aaagagtgtg tctacttctg ccacctggac atcatttggg tcaacactcc cgagtaagtc 5761 tctagagggc attgtaaccc tattcattca ttagcgctgg ctccactgga gcccagtttt 5821 agagtttctt ttctagggac tctgaaggta gtccttctaa caccatccaa gtgcctcagt 5881 ggggacagtt tccctctatt cctgaaaata acgacagctt cgttcttagc aaccaagggg 5941 agggtcttct gaggccccgt agctcaggct actcatgatg ggacaagcag gaggccactg 6001 cacgtttcaa atgaggaact ttcagtgaga gggcctcagg gggacactct cacagtggca 6061 tctgatgggg tttcgggaat aattgccgag gtcagatgtg ggttagtgca acctgtgctt 6121 ctcatgggag ggtggagact gagaggcaga agtgatgata tagagggtta gaatcactta 6181 attttagtta cagaaaaacc taggctcaaa gtgttgaagc catttgtgca ggagtgagtt 6241 tgtagcagag ctagaactgg agcccggatt tcctttgctg ctatattttc cctttagaaa 6301 tgcccatttc agaactgaaa tagaaatact gtccataggc ttctctttca cctacagaga 6361 agaaaagcag atttcctcct tctgccctgg acactagttc atcatctgtc ggaagcagtc 6421 ataaacaagc acacatttac tatgcataca atgtaccgtt atgacaaagg aggaccaaaa 6481 tccaaacaat atcaaaccac accaaaaacc acaaggagcc taataattac taaggtgata 6541 cttccaaagg gaggacttta tttcttagat gagaatgaaa atggacacat tggaaattat 6601 tggagagccc tctggctatg agtccttcca caaccatatg gtaccaccga ctggcaggag 6661 aaatgtgtga acatgtgcct cctctcccca accactgggg ccggtggggt gacggtggca 6721 cttttagcag tatcctccgt ggtttgagtt gaaaataagt tttaaaaatc ctgtgagtca 6781 tggttttgca ttgaaacctc ttcccactgt gtacccacaa atagttaact aaatagacca 6841 ttagaaaagg aagaaaatat aaagcagatg ccaagcagag atgtcctaat ttttgacaaa 6901 aaagcaatgt tgcttgtgtc aagaagaaac tgaactttgt gaagagttga aatggaattc 6961 cactgaatta gaaaaacttg ttttctcctg cctggataca tacagtcagg gccattgatg 7021 cacaggtgtt cctggctgtt gttacacttt accctctgaa atgatgctcc caagtgctat 7081 gtgatgagct ccttgtgtgc ccagtggaat aggtgtgtcc atgtgtcatt ttaaagacta 7141 ttaattacac taatatagtt tctttctctc tttggataat aggcacgttg ttccgtatgg 7201 acttggaagc cctaggtcca agagagcctt ggagaattta cttcccacaa aggcaacaga 7261 ccgtgagaat agatgccaat gtgctagcca aaaagacaag aagtgctgga atttttgcca 7321 agcaggaaaa gaactcaggt gagcagaaac acctttgctt ttcaatcagt ttaacagcct 7381 cctgaactcc ttcctatcat ggtactgcct tcctgtttta gagagactaa cagagacatt 7441 gaaagtcagg gtaaagctga atataacatt gctgaaatgt ttttccttgt gtattttaac 7501 agggctgaag acattatgga gaaagactgg aataatcata agaaaggaaa agactgttcc 7561 aagcttggga aaaagtgtat ttatcagcag ttagtgagag gaagaaaaat cagaagaagt 7621 tcagaggaac acctaagaca aaccaggtaa gagggaagga agaaaaatta ggtaagaggt 7681 tcacaagaac aactagcccc agtcagtgat gccagcagcc tgttcctcca gcccttctta 7741 cccgggcagg tgaaagactt agaaaacagt agcagaggag atctatgcat cctatagatt 7801 aaaaggagca aaagaatccc tcttaaatat ttccatgaag ctctggaatg caaaccgatg 7861 tcctctgtac ctttagcaca taccatttca tctacaggta gatttcccaa ccaaaatata 7921 tccagagatg cctttgtcat tgggttatat acagcctttg cctctctgag tcaatgtatt 7981 taccactttc cctgagaaat cgaaaatcat tttggggagc ggacatttag aaaaagaatc 8041 aaagtgtcat ggataatcaa attcttcaat aagttgcagt tattcagatg gccaaaggaa 8101 aaataaagtc attagatagg gttggtagaa tttagaacat gctgtttttc aggtttatgg 8161 tctttttttt tttttttttt tttttttaaa tagggaaatg tgtttggtgc agagccaatg 8221 tcattccaaa aagctctctc ttttcctggt cagtcatgtg ctgggacaga gaagggatct 8281 ggattaggca acatcataga gttgctctga gctgctcttt ggtgataacc cttccaaatc 8341 ctaaactttt tggaattcac aagctcaaag gaggaaacct actctctgat ctaccacatg 8401 ttctgcattt ttctatcatg gtctatggaa acttctctta gaaatccagt ggcaagaagt 8461 tctatgatta aagtgttctg agctcaggcc aggcagtcat gaactacttc tgagttgttt 8521 actactgatt tgtggggcag cctcagctat cggtttcttc acacctgctt atgagagtat 8581 ccatatttat ggtcgcaggc agtaatgctc cccacgagat cagtttctga actaacctgg 8641 aattttttat gggtttttat tatgccaact attaaatcaa cattacagtt cttccctctg 8701 tatttctcct gtaaaacatt aggcctgcaa aaaaaaaaaa tctttttaaa aataattgcc 8761 ataaagtatt tgctctgggc ctactgtatg cttcttttyt ttttctctct tttcaactaa 8821 gtcaccgtca atttattaag atggccataa ctattcaaaa cctatgctga gttcctcaag 8881 gcagggtcgc atagtgatga aggttgggat ggggctacgg aagaaaccag aacaactcta 8941 gtttatttaa aacctgtatt tactgcccac ttccccttag acttgaccat atgacccctt 9001 gctccccatt ctaagcatag gggcaggctt tatttttaca atggtaatag atgatatcac 9061 ttgaggtttt atcaaagagt tgcggcgggt ggtgaaagtt cacaaccaga ttcaggtttt 9121 gtttgtgcca gattctaatt ttacatgttt cttttgccaa agggtgattt ttttaaaata 9181 acatttgttt tctcttatct tgctttatta ggtcggagac catgagaaac agcgtcaaat 9241 catcttttca tgatcccaag ctgaaaggca agccctccag agagcgttat gtgacccaca 9301 accgagcaca ttggtgacag accttcgggg cctgtctgaa gccatagcct ccacggagag 9361 ccctgtggcc gactctgcac tctccaccct ggctgggatc agagcaggag catcctctgc 9421 tggttcctga ctggcaaagg accagcgtcc tcgttcaaaa cattccaaga aaggttaagg 9481 agttccccca accatcttca ctggcttcca tcagtggtaa ctgctttggt ctcttctttc 9541 atctggggat gacaatggac ctctcagcag aaacacacag tcacattcga attcgggtgg 9601 catcctccgg agagagagag aggaaggaga ttccacacag gggtggagtt tctgacgaag 9661 gtcctaaggg agtgtttgtg tctgactcag gcgcctggca catttcaggg agaaactcca 9721 aagtccacac aaagattttc taaggaatgc acaaattgaa aacacactca aaagacaaac 9781 atgcaagtaa agaaaaaaaa aagaaagact tttgtttaaa tttgtaaaat gcaaaactga 9841 atgaaactgt tactaccata aatcaggata tgtttcatga atatgagtct acctcaccta 9901 tattgcactc tggcagaagt atttcccaca tttaattatt gcctccccaa actcttccca 9961 cccctgctgc cccttcctcc atcccccata ctaaatccta gcctcgtaga agtctggtct 10021 aatgtgtcag cagtagatat aatattttca tggtaatcta ctagctctga tccataagaa 10081 aaaaaagatc attaaatcag gagattccct gtccttgatt tttggagaca caatggtata 10141 gggttgttta tgaaatatat tgaaaagtaa gtgtttgtta cgctttaaag cagtaaaatt 10201 attttccttt atataaccgg ctaatgaaag aggttggatt gaattttgat gtacttattt 10261 ttttatagat atttatattc aaacaattta ttccttatat ttaccatgtt aaatatctgt 10321 ttgggcaggc catattggtc tatgtatttt taaaatatgt atttctaaat gaaattgaga 10381 acatgctttg ttttgcctgt caaggtaatg actttagaaa ataaatattt ttttccttac 10441 tgtactgatt tggaatcatt actgaaattt gtaaggagtg ggccaacgtg attaagtacc 10501 ataaaggcaa ataaatggtt aaagacggtt tcatagaaaa gtgacaatta gaaggatatt 10561 acggtctaag ctaattatat aaagaatttt atctgtatct taaatgttga ttttatactg 10621 cattgaggta aaaacacaaa acaaaaaagc agctttaaca cctctgtctt ctcttgggta 10681 gcagcctcct gcttctcctt cacctgaaaa attctccagg gacttcatcc attaacttgg 10741 ctcaggctat tggcaggatt cacagtttaa gctgatggtg tggtgagaga tgctttatcc 10801 atattaatgg actgaaggaa gtaatggcaa gacaaccccc caaaacatac ctaattatac 10861 aaagttatat accaaagttg cttttagaaa atggcctgct cagagcaagt agaggtttcc 10921 aatggctttt tattttctca cattaaggat gttgtttctt aaggaacatt gagtaccatt 10981 gcttcttcgt gatagcctag gactgccgtg tgcccatgga ggtagagaca ccaggtactg 11041 attctaggtc ctctgccaca aagcaccact tcctctccac tttgccttgg ctggccttgt 11101 cagctcactg gagagcacag tattgcaatt gcagtattgc aaatggtcac tactaactga 11161 attctctaag agcttgatta gccctcgaga atcttccttg cccttctcta atagtgtctg 11221 aaggaattcc tggcatttaa caaatattag catgtagtga tcactgtcgt cctaacagtg 11281 acacatcaga aggatttcaa ataacagtct tcaggcatgc gtaatcaatg tcctgtgcag 11341 agtctccgtc ctcattgatc ctcatttttc tctttaaggc acagtccaat gtctttgggg 11401 aattgtttat aaagcttact ttatccataa actgtttctc agtgcgtgac tctgaagaaa 11461 attttgaagt tttgcccatg ttgacaaggt gcttggtctg aacttggcca gtatttaatc 11521 ttgagcaaac gattcaattt ccttctatcg tgagttttct catctatgaa acaagggagt 11581 tgaggggagt ttctttcata cctctgagaa agagtttgag attacataaa gaagttgaag 11641 tggcatgaaa aaaaataaag atctgagctt agaagacatg gatctaatac atttaagagg 11701 aagtcagaat cagagaagcc actgaacaaa acagtccaaa cggagcatag taagtcagat 11761 tgatgagttt tggttgggtt tttcatcagt caaacccttg agcccccctt tcccatgctt 11821 cctgcttcag tatccagtag gaaaaatgaa agggatgatg tagacactct agggcatgag 11881 gatttgcagt aaataagttg ggagactcac agaaaattaa tatttttcaa acatgaagac 11941 gaaacattca attatattac agtccacatc agcttgaagg gtaaactgat gggatgatct 12001 gtcacatttc ttgctctgtt tccagtaaaa gcatggtttc tggaaaccca cttaggacag 12061 ctttctctct ttacactgat agcccaggca agctttgatc tcagaactcc agaaaccaga 12121 gaactctagg tggaatgtgg taacttttgc cagggcagag ggaacaccta ctaataggta 12181 cttcatttgc accaccagag attggcatct tttttgatgg atccactggc tttgatactg 12241 cctgtactcc cccaaaacac agcttgggta ttggactaat ctagagctcc ctcaggagaa 12301 ctcttgctga cattaagaaa gagcaacatt ttgtctttcc aggtgaaaat ccaaggccaa 12361 aaagggagtg actcacctaa gatcacagaa ggagctgtag catctctgga gcctgaacac 12421 ttaagttaag cacgactatt tcacgcagag ggcatgaatt c // LOCUS HUMEF1A 4695 bp DNA PRI 07-NOV-1994 DEFINITION Human elongation factor EF-1-alpha gene, complete cds. ACCESSION J04617 J04616 NID g181962 KEYWORDS elongation factor. SOURCE Human placenta DNA, clone pEFG1, and fibroblast cell line GM 637, cDNA to mRNA, (library of H.Okayama), clone pAN7. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4695) AUTHORS Uetsuki,T., Naito,A., Nagata,S. and Kaziro,Y. TITLE Isolation and characterization of the human chromosomal gene for polypeptide chain elongation factor-1 alpha JOURNAL J. Biol. Chem. 264 (10), 5791-5798 (1989) MEDLINE 89174636 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Nagata, 20-JAN-1989. FEATURES Location/Qualifiers source 1..4695 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" misc_binding 205..214 /bound_moiety="Sp1" misc_binding 320..328 /bound_moiety="Sp1" misc_binding 332..340 /bound_moiety="Sp1" TATA_signal 546..552 prim_transcript 576..4087 /note="EF-1-alpha mRNA and introns" intron 609..1551 /note="EF-1-alpha intron A" misc_binding 983..992 /bound_moiety="Sp1" misc_binding 1026..1034 /bound_moiety="Sp1" misc_binding 1122..1131 /bound_moiety="Sp1" misc_binding 1132..1141 /bound_moiety="Sp1" misc_binding 1240..1249 /bound_moiety="Sp1" misc_binding 1302..1308 /bound_moiety="Ap1" exon <1582..1725 /gene="EEF1A" /note="elongation factor EF-1-alpha" /number=2 gene join(1582..1725,2092..2271,2377..2673,2757..2907, 2995..3251,3341..3575,3671..3795) /gene="EEF1A" CDS join(1582..1725,2092..2271,2377..2673,2757..2907, 2995..3251,3341..3575,3671..3795) /gene="EEF1A" /note="elongation factor EF-1-alpha" /codon_start=1 /db_xref="GDB:G00-118-791" /db_xref="PID:g181963" /translation="MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEK EAAEMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPGHRDFIKNM ITGTSQADCAVLIVAAGVGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEP PYSQKRYEEIVKEVSTYIKKIGYNPDTVAFVPISGWNGDNMLEPSANMPWFKGWKVTR KDGNASGTTLLEALDCILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGM VVTFAPVNVTTEVKSVEMHHEALSEALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPP MEAAGFTAQVIILNHPGQISAGYAPVLDCHTAHIACKFAELKEKIDRRSGKKLEDGPK FLKSGDAAIVDMVPGKPMCVESFSDYPPLGRFAVRDMRQTVAVGVIKAVDKKAAGAGK VTKSAQKAQKAK" intron 1726..2091 /note="EF-1-alpha intron B" exon 2092..2271 /gene="EEF1A" /number=3 intron 2272..2376 /note="EF-1-alpha intron C" exon 2377..2673 /gene="EEF1A" /number=4 intron 2674..2756 /note="EF-1-alpha intron D" exon 2757..2907 /gene="EEF1A" /number=5 intron 2908..2994 /note="EF-1-alpha intron E" exon 2995..3251 /gene="EEF1A" /number=6 intron 3252..3340 /note="EF-1-alpha intron F" exon 3341..3575 /gene="EEF1A" /number=7 intron 3576..3670 /note="EF-1-alpha intron G" exon 3671..>3795 /gene="EEF1A" /note="elongation factor EF-1-alpha" /number=8 BASE COUNT 1200 a 989 c 1235 g 1271 t ORIGIN 3 bp upstream of SmaI site. 1 cccgggctgg gctgagaccc gcagaggaag acgctctagg gatttgtccc ggactagcga 61 gatggcaagg ctgaggacgg gaggctgatt gagaggcgaa ggtacaccct aatctcaata 121 caacctttgg agctaagcca gcaatggtag agggaagatt ctgcacgtcc cttccaggcg 181 gcctccccgt caccaccccc cccaacccgc cccgaccgga gctgagagta attcatacaa 241 aaggactcgc ccctgccttg gggaatccca gggaccgtcg ttaaactccc actaacgtag 301 aacccagaga tcgctgcgtt cccgccccct cacccgcccg ctctcgtcat cactgaggtg 361 gagaagagca tgcgtgaggc tccggtgccc gtcagtgggc agagcgcaca tcgcccacag 421 tccccgagaa gttgggggga ggggtcggca attgaaccgg tgcctagaga aggtggcgcg 481 gggtaaactg ggaaagtgat gtcgtgtact ggctccgcct ttttcccgag ggtgggggag 541 aaccgtatat aagtgcagta gtcgccgtga acgttctttt tcgcaacggg tttgccgcca 601 gaacacaggt aagtgccgtg tgtggttccc gcgggcctgg cctctttacg ggttatggcc 661 cttgcgtgcc ttgaattact tccacgcccc tggctgcagt acgtgattct tgatcccgag 721 cttcgggttg gaagtgggtg ggagagttcg aggccttgcg cttaaggagc cccttcgcct 781 cgtgcttgag ttgaggcctg gcctgggcgc tggggccgcc gcgtgcgaat ctggtggcac 841 cttcgcgcct gtctcgctgc tttcgataag tctctagcca tttaaaattt ttgatgacct 901 gctgcgacgc tttttttctg gcaagatagt cttgtaaatg cgggccaaga tctgcacact 961 ggtatttcgg tttttggggc cgcgggcggc gacggggccc gtgcgtccca gcgcacatgt 1021 tcggcgaggc ggggcctgcg agcgcggcca ccgagaatcg gacgggggta gtctcaagct 1081 ggccggcctg ctctggtgcc tggcctcgcg ccgccgtgta tcgccccgcc ctgggcggca 1141 aggctggccc ggtcggcacc agttgcgtga gcggaaagat ggccgcttcc cggccctgct 1201 gcagggagct caaaatggag gacgcggcgc tcgggagagc gggcgggtga gtcacccaca 1261 caaaggaaaa gggcctttcc gtcctcagcc gtcgcttcat gtgactccac ggagtaccgg 1321 gcgccgtcca ggcacctcga ttagttctcg agcttttgga gtacgtcgtc tttaggttgg 1381 ggggaggggt tttatgcgat ggagtttccc cacactgagt gggtggagac tgaagttagg 1441 ccagcttggc acttgatgta attctccttg gaatttgccc tttttgagtt tggatcttgg 1501 ttcattctca agcctcagac agtggttcaa agtttttttc ttccatttca ggtgtcgtga 1561 aaactacccc taaaagccaa aatgggaaag gaaaagactc atatcaacat tgtcgtcatt 1621 ggacacgtag attcgggcaa gtccaccact actggccatc tgatctataa atgcggtggc 1681 atcgacaaaa gaaccattga aaaatttgag aaggaggctg ctgaggtatg tttaatacca 1741 gaaagggaaa gatcaactaa aatgagtttt accagcagaa tcattaggtg atttccccag 1801 aactagtgag tggtttagat ctgaatgcta atagttaaga ccttacttat gaaataattt 1861 tgcttttggt gacttctgta atcgtattgc tagtgagtag atttggatgt taatagttaa 1921 gatcctactt ataaaagttt gatttttggt tgcttctgta acccaaagtg accaaaatca 1981 ctttggactt ggagttgtaa agtggaaact gccaattaag ggctggggac aaggaaattg 2041 aagctggagt ttgtgtttta gtaaccaagt aacgactctt aatccttaca gatgggaaag 2101 ggctccttca agtatgcctg ggtcttggat aaactgaaag ctgagcgtga acgtggtatc 2161 accattgata tctccttgtg gaaatttgag accagcaagt actatgtgac tatcattgat 2221 gccccaggac acagagactt tatcaaaaac atgattacag ggacatctca ggttgggatt 2281 aataattcta ggtttcttta tcccaaaagg cttgctttgt acactggttt tgtcatttgg 2341 agagttgaca gggatatgtc tttgctttct ttaaaggctg actgtgctgt cctgattgtt 2401 gctgctggtg ttggtgaatt tgaagctggt atctccaaga atgggcagac ccgagagcat 2461 gcccttctgg cttacacact gggtgtgaaa caactaattg tcggtgttaa caaaatggat 2521 tccactgagc caccctacag ccagaagaga tatgaggaaa ttgttaagga agtcagcact 2581 tacattaaga aaattggcta caaccccgac acagtagcat ttgtgccaat ttctggttgg 2641 aatggtgaca acatgctgga gccaagtgct aacgtaagtg gctttcaaga ccattgttaa 2701 aaagctctgg gaatggcgat ttcatgctta cacaaattgg catgcttgtg tttcagatgc 2761 cttggttcaa gggatggaaa gtcacccgta aggatggcaa tgccagtgga accacgctgc 2821 ttgaggctct ggactgcatc ctaccaccaa ctcgtccaac tgacaagccc ttgcgcctgc 2881 ctctccagga tgtctacaaa attggtggta agttggctgt aaacaaagtt gaatttgagt 2941 tgatagagta ctgtctgcct tcataggtat ttagtatgct gtaaatattt ttaggtattg 3001 gtactgttcc tgttggccga gtggagactg gtgttctcaa acccggtatg gtggtcacct 3061 ttgctccagt caacgttaca acggaagtaa aatctgtcga aatgcaccat gaagctttga 3121 gtgaagctct tcctggggac aatgtgggct tcaatgtcaa gaatgtgtct gtcaaggatg 3181 ttcgtcgtgg caacgttgct ggtgacagca aaaatgaccc accaatggaa gcagctggct 3241 tcactgctca ggtaacaatt taaagtaaca ttaacttatt gcagaggcta aagtcatttg 3301 agactttgga tttgcactga atgcaaatct tttttccaag gtgattatcc tgaaccatcc 3361 aggccaaata agcgccggct atgcccctgt attggattgc cacacggctc acattgcatg 3421 caagtttgct gagctgaagg aaaagattga tcgccgttct ggtaaaaagc tggaagatgg 3481 ccctaaattc ttgaagtctg gtgatgctgc cattgttgat atggttcctg gcaagcccat 3541 gtgtgttgag agcttctcag actatccacc tttgggtaag gatgactact taaatgtaaa 3601 aaagttgtgt taaagatgaa aaatacaact gaacagtact ttgggtaata attaactttt 3661 tttttaatag gtcgctttgc tgttcgtgat atgagacaga cagttgcggt gggtgtcatc 3721 aaagcagtgg acaagaaggc tgctggagct ggcaaggtca ccaagtctgc ccagaaagct 3781 cagaaggcta aatgaatatt atccctaata cctgccaccc cactcttaat cagtggtgga 3841 agaacggtct cagaactgtt tgtttcaatt ggccatttaa gtttagtagt aaaagactgg 3901 ttaatgataa caatgcatcg taaaaccttc agaaggaaag gagaatgttt tgtggaccac 3961 tttggttttc ttttttgcgt gtggcagttt taagttatta gtttttaaaa tcagtacttt 4021 ttaatggaaa caacttgacc aaaaatttgt cacagaattt tgagacccat taaaaaagtt 4081 aaatgagaaa cctgtgtgtt cctttggtca acaccgagac atttaggtga aagacatcta 4141 attctggttt tacgaatctg gaaacttctt gaaaatgtaa ttcttgagtt aacacttctg 4201 ggtggagaat agggttgttt tccccccaca taattggaag gggaaggaat atcatttaaa 4261 gctatgggag ggtttctttg attacaacac tggagagaaa tgcagcatgt tgctgattgc 4321 ctgtcactaa aacaggccaa aaactgagtc cttgggttgc atagaaagct tcatgttgct 4381 aaaccaatgt taagtgaatc tttggaaaca aaatgtttcc aaattactgg gatgtgcatg 4441 ttgaaacgtg ggttaaaatg actgggcagt gaaagttgac tatttgccat gacataagaa 4501 ataagtgtag tggctagtgt acaccctatg agtggaaggg tccattttga agtcagtgga 4561 gtaagcttta tgccattttg atggtttcac aagttctatt gagtgctatt cagaatagga 4621 acaaggttct aatagaaaaa gatggcaatt tgaagtagct ataaaattag actaattaca 4681 ttgcttttct ccgac // LOCUS HUMEMBPA 3608 bp DNA PRI 15-SEP-1990 DEFINITION Human eosinophil major basic protein gene, complete cds. ACCESSION M34462 NID g182079 KEYWORDS eosinophil major basic protein. SOURCE Human fetal liver (library ATCC 37333), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3608) AUTHORS Barker,R.L., Loegering,D.A., Arakawa,K.C., Pease,L.R. and Gleich,G.J. TITLE Cloning and sequence analysis of the human gene encoding eosinophil major basic protein JOURNAL Gene 86, 285-289 (1990) MEDLINE 90215311 FEATURES Location/Qualifiers source 1..3608 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 224..233 TATA_signal 265..271 prim_transcript 294..3574 /note="MBP mRNA and introns" intron 348..986 /note="MBP intron A" misc_signal 994..1001 /note="eukaryotic initiation signal" sig_peptide 999..1043 /note="eosinophil major basic protein signal peptide" exon <999..1056 /note="eosinophil major basic protein precursor, (MBP; first expressed exon)" /number=2 CDS join(999..1056,1626..1933,2235..2366,3077..3188, 3409..3467) /note="eosinophil major basic protein precursor" /codon_start=1 /db_xref="PID:g182080" /translation="MKLPLLLALLFGAVSALHLRSETSTFETPLGAKTLPEDEETPEQ EMEETPCRELEEEEEWGSGSEDASKKDGAVESISVPDMVDKNLTCPEEEDTVKVVGIP GCQTCRYLLVRSLQTFSQAWFTCRRCYRGNLVSIHNFNINYRIQCSVSALNQGQVWIG GRITGSGRCRRFQWVDGSRWNFAYWAAHQPWSRGGHCVALCTRGGYWRRAHCLRRLPF ICSY" intron 1057..1625 /note="MBP intron B" exon 1626..1933 /number=3 mat_peptide 1883..1933 /note="eosinophil granule major basic protein" intron 1934..2234 /note="MBP intron C" mat_peptide 2235..2366 /note="eosinophil granule major basic protein" exon 2235..2366 /number=4 intron 2367..3076 /note="MBP intron D" mat_peptide 3077..3188 /note="eosinophil granule major basic protein" exon 3077..3188 /number=5 intron 3189..3408 /note="MBP intron E" mat_peptide 3409..3464 /note="eosinophil granule major basic protein" exon 3409..>3467 /note="eosinophil major basic protein precursor" /number=6 polyA_signal 3556..3561 BASE COUNT 796 a 850 c 1005 g 957 t ORIGIN 1 ttcaaaacac cagtaaaaca ggggagatat gtattttgga aaagcaccca aggcgattct 61 gaagtgtagc ccaggataag aaccattgcc cagagctgtt ccagatggcc cctgggttcc 121 tgaagtgggt atcgggagag aaatcttcac tgaatgaatg agtgggctcc ccagggaagt 181 gatgaaatgg tccttatcag ccttgctatc tccctctgac agaggcaaac tctctctccc 241 tgggggaagt tcctccaagg cctctatata agaagtcttt gtgagaggaa gcaaagaagg 301 acctgggctt tgggaagatc taaagaccca ggaaggtctc tgggtgggtg agtgctttct 361 ctgctgtggt ggagctggtg acagtttatt ctcccaggag gtccctggct gtggctgaca 421 gtttctggag ggctggcagg cgtctacctg tggctttcag gttatgagga tgtcagcagg 481 ggcagccttc atcctctgcc ttgcacattc cttctgcggg atgtgaaagt gctccttggc 541 tggggaaagg agatggtgga gacatggagg agggtgtggg tggcttcttg aactctgagg 601 aggggacata ccttctaagt cctatgtgtt cctaggaaag ccaataatca ttgcttctcc 661 cgcctttttt atgtcataga ctctgaggga cccattaagt acaaacaaat aagcgtaata 721 gtcccttctt tacttccggg cctgaaggaa agccagcctc agccacccct cagggtttgc 781 tgcgttctgt ttagaaagag gtccttgcgt cctggatcct ggagcatcag gagctgggct 841 tggcatgagc ttttctggcc catcctgatt tctattcagg ccttcttttt ctccacctca 901 ctcccacggt cccctaatgg tgtgattgtg atgtgtgtgc atgtgtgtct gtgtgtgtca 961 atgacaaact gtgttctccg ttgcaggata aagccaagat gaaactcccc ctacttctgg 1021 ctcttctatt tggggcagtt tctgctcttc atctaagtaa gtgttttttg ccttcagtct 1081 ttctttctct gttttttccc tttctatggt agatggggtc agagttacac acccaccccc 1141 ttctttgatc gtcttctatt tctgaatttc tgtgtgctta aagggatggg gactctatgg 1201 ccaggagttg aaaggatttc tcaaggcgtc tgttatgtct gtggtcttgg ttctactgtg 1261 acattcccaa ttttgtcctt tctccattat gcttactttg agcttactga gtgccttctc 1321 tcctttaact ctcttagcat cgccatgaag taggtggtat tgtataccca tttcacagaa 1381 atacagctgg tggatgatgg aaccagtacc caagcccatg actgcccgac tctaagtcca 1441 tgctcttaac caccttgacc ttgtcaggca gcttgggttc ccctcataga gactgggttc 1501 caggttcccc ttcccaggca gagttgagca ctctgatgcc cagggcaagg tgtgagctgt 1561 ctgtggttct ggggaggaac aaggggagat gtgaaggaag gacacttagc tatcctccct 1621 gccagggtct gagacttcca cctttgagac ccctttgggt gctaagacgc tgcctgagga 1681 tgaggagaca ccagagcagg agatggagga gaccccttgc agggagctgg aggaagagga 1741 ggagtggggc tctggaagtg aagatgcctc caagaaagat ggggctgttg agtctatctc 1801 agtgccagat atggtggaca aaaaccttac gtgtcctgag gaagaggaca cagtaaaagt 1861 ggtgggcatc cctgggtgcc agacctgccg ctacctcctg gtgagaagtc ttcagacgtt 1921 tagtcaagct tgggtgagtg gcctatggct gaggctgagg tgggagcatg gaacgggtgt 1981 gggatatgcc cccagcattg ctatcactgg ctctttttcc cattgagggc cctgggggtg 2041 tcagtagaac ctgagcctca gagaggtgtt ggggtaagag gggagggcca cctacaaaca 2101 gaagttgcat tttggtctcc aaccttcaaa tggttgtggc aggggaggga gggaatgaat 2161 tgtggggact caagacccat gtgaattcat gtaggaagga tgctccattc tttgtctttt 2221 atcctgccct gtagtttact tgccggaggt gctacagggg caacctggtt tccatccaca 2281 acttcaatat taattatcga atccagtgtt ctgtcagcgc gctcaaccag ggtcaagtct 2341 ggattggagg caggatcaca ggctcggtaa gagaagtgtg aacactaaat ggggtgcacc 2401 tgctgatctc agccagcact cagcttgcat cagatttgtc tgtttttctc ctgtataatc 2461 tccagaagaa ccagggatag atggacaccc acagacaaca ctgagggggc tgcctgggca 2521 ttcagggaag agctaaggat ttagaatcag gaggtttggg tccaagttcc tttccatctc 2581 tcactatcta tgtaacttaa gttagctggg catggtggtg catgtctgta atcctagcta 2641 cttgggaggc tgaggcagga gagtcactgg aacctgggag acagaggttg cggtgagccg 2701 agatggagcc attgcactcc agcctgggca acaagagcga aactccgcct caaaaataaa 2761 taaataaata aataaaataa aaaaaaatta aaacaagacc atgagtttgt ttcctcatct 2821 ctaggatgag ttggcaaccc ttgttctacc ttttgttagg gctggaagga caagcctgtc 2881 actgggatgc atagaatctg atggtgataa ttgccgtgga tcagcatttc agatgactag 2941 gacagttccc atcatggtcc agcagggaag ggcccattgc ccggtgggca gcagaaagag 3001 ctggcagata cggggccagg tctgcttctc tgccttccct ctgccccatc ccttcttccc 3061 ctcttgcttt ctccagggtc gctgcagacg ctttcagtgg gttgacggca gccgctggaa 3121 ctttgcgtac tgggctgctc accagccctg gtcccgcggt ggtcactgcg tggccctgtg 3181 tacccgaggt gaggtggggc tggggatgaa cgatggaaag gtctgggaga tgggaagtgc 3241 cccaaggagg agatgctaca aagagcctga ccctttgtgg gagaggcttc ctgggtcttt 3301 tatatactct gactccacag cagtgtgtgg gtgggaaaag aggccctcct gtgggttgag 3361 ttgggatgga caagaggctg aaagtccctt tctgttctgc cttcacagga ggctactggc 3421 gtcgagccca ctgcctcaga agacttcctt tcatctgttc ctactgagct ggtcccagcc 3481 agcagttcag agctgccctc tcctgggcag ctgcctcccc tcctctgctt gccatccctc 3541 cctccacctc cctgcaataa aatgggtttt actgaaatgg atttattttc tcctctgatc 3601 gcggatcc // LOCUS HUMEPOHYDD 24790 bp DNA PRI 07-NOV-1994 DEFINITION Homo sapiens epoxide hydrolase (EPHX) gene, complete cds. ACCESSION L29766 L25880 NID g537525 KEYWORDS Alu repeat; epoxide hydrolase. SOURCE Homo sapiens (tissue library: T. Maniatis) fetus DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 24790) AUTHORS Hassett,C., Robinson,K.B., Beck,N.B. and Omiecinski,C.J. TITLE The human microsomal epoxide hydrolase gene (EPHX1): complete nucleotide sequence and structural characterization JOURNAL Genomics 23 (2), 433-442 (1994) MEDLINE 95137590 REFERENCE 2 (bases 1 to 24790) AUTHORS Hassett,C., Aicher,L., Sidhu,J.S. and Omiecinski,C.J. TITLE Human microsomal epoxide hydrolase: genetic polymorphism and functional expression in vitro of amino acid variants JOURNAL Hum. Mol. Genet. 3 (3), 421-428 (1994) MEDLINE 94282033 FEATURES Location/Qualifiers source 1..24790 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_lib="T. Maniatis" /map="1p11-qter" exon 2019..2288 /note="non protein coding exon" /number=1 intron 2289..5467 /number=1 repeat_unit 2923..3192 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=direct repeat_unit 4810..5086 /note="Alu Sb subfamily" /rpt_family="Alu Sb subfamily" /rpt_type=direct exon 5468..5655 /gene="EPHX" /note="G00-119-876" /number=2 gene join(5473..5655,8515..8695,15392..15619,15955..16084, 16566..16774,19095..19203,21228..21353,21875..22076) /gene="EPHX" CDS join(5473..5655,8515..8695,15392..15619,15955..16084, 16566..16774,19095..19203,21228..21353,21875..22076) /gene="EPHX" /codon_start=1 /db_xref="GDB:G00-119-876" /product="epoxide hydrolase" /db_xref="PID:g537526" /translation="MWLEILLTSVLGFAIYWFISRDKEETLPLEDGWWGPGTRSAARE DDSIRPFKVETSDEEIHDLHQRIDKFRFTPPLEDSCFHYGFNSNYLKKVISYWRNEFD WKKQVEILNRHPHFKTKIEGLDIHFIHVKPPQLPAGRTPKPLLMVHGWPGSFYEFYKI IPLLTDPKNHGLSDEHVFEVICPSIPGYGFSEASSKKGFNSVATARIFYKLMLRLGFQ EFYIQGGDWGSLICTNMAQLVPSHVKGLHLNMALVLSNFSTLTLLLGQRFGRFLGLTE RDVELLYPVKEKVFYSLMRESGYMHIQCTKPDTVGSALNDSPVGLAAYILEKFSTWTN TEFRYLEDGGLERKFSLDDLLTNVMLYWTTGTIISSQRFYKENLGQGWMTQKHERMKV YVPTGFSAFPFELLHTPEKWVRFKYPKLISYSYMVRGGHFAAFEEPELLAQDIRKFLS VLERQ" intron 5656..8514 /note="intron 2" /number=2 repeat_unit 6533..6823 /note="Alu Sq subfamily" /rpt_family="Alu Sq subfamily" /rpt_type=direct repeat_unit 7214..7502 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=inverted repeat_unit 8010..8295 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=direct exon 8515..8695 /gene="EPHX" /note="G00-119-876" /number=3 intron 8696..15391 /number=3 repeat_unit 8996..9264 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=direct repeat_unit 9731..9933 /note="Alu J family" /rpt_family="Alu J family" /rpt_type=inverted repeat_unit 10078..10243 /note="Alu J family" /rpt_family="Alu J family" /rpt_type=direct repeat_unit 10255..10539 /note="Alu Sb subfamily" /rpt_family="Alu Sb subfamily" /rpt_type=direct repeat_unit 10558..10842 /note="Alu Sb subfamily" /rpt_family="Alu Sb subfamily" /rpt_type=direct repeat_unit 10872..11151 /note="Alu Sc subfamily" /rpt_family="Alu Sc subfamily" /rpt_type=direct repeat_unit 11974..12129 /note="Alu J family" /rpt_family="Alu J family" /rpt_type=inverted repeat_unit 12144..12430 /note="Alu Sb subfamily" /rpt_family="Alu Sb subfamily" /rpt_type=inverted repeat_unit 12457..12591 /note="Alu J family" /rpt_family="Alu J family" /rpt_type=inverted repeat_unit 13054..13343 /note="Alu J family" /rpt_family="Alu J family" /rpt_type=direct repeat_unit 14269..14557 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=inverted exon 15392..15619 /gene="EPHX" /note="G00-119-876" /number=4 intron 15620..15954 /number=4 exon 15955..16084 /gene="EPHX" /note="G00-119-876" /number=5 intron 16085..16565 /number=5 repeat_region 16105..16144 /rpt_family="dinucleotide" /rpt_unit=gt exon 16566..16774 /gene="EPHX" /note="G00-119-876" /number=6 intron 16775..19094 /number=6 repeat_unit 18029..18317 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=inverted repeat_unit 18347..18636 /note="Alu Sx subfamily" /rpt_family="Alu Sx subfamily" /rpt_type=inverted exon 19095..19203 /gene="EPHX" /note="G00-119-876" /number=7 intron 19204..21227 /number=7 exon 21228..21353 /gene="EPHX" /note="G00-119-876" /number=8 intron 21354..21874 /number=8 exon 21875..22289 /gene="EPHX" /note="G00-119-876" /number=9 polyA_signal 22272..22276 polyA_site 22289 BASE COUNT 6046 a 6179 c 6621 g 5944 t ORIGIN 1 ctggccacaa gtgattcacc accttggccc cacgaagtac tgggattaca gtcatgagct 61 accacactgg gccaaagccc tcgtattgta acctccaaga tgttttcatt tctaaagtca 121 aaggttgtca gagttgctgt tttgcaggga gccatgggat ggggagtctc cgggaaaccc 181 aggctggctc cttaaatggc atcacccaag ttggccatca agcttcccca gactcagtca 241 ctcggccccc tctttccctg gctgtagagc aggggtttgg actcgaggct gctgagttcc 301 tctccagctt ccatgcccag tggcagtgac ttatcaactg gcccacccgg tggatgtagg 361 ggtgatgcag ccctgctctg tcctcttgag tgctgggtgg ccatgagaga gggaggccat 421 gagagggagg ggagggttct catctgctta gtgccacccc ctccccagcc caccccaggt 481 cctctgccgc cttgggttcc tgcgtgagca gtccttctca ctctgggcga tgttcgcttt 541 caccagccca accccaagaa gtccaccctg gggaggacag agggggtctc cacggctcca 601 cacccccatg ctttgtcgga gatctggcca ggcagagtgg tgccctcttg actgagcctc 661 gagcctttgc tttcccctta ctccaaataa agccgcagcc ctggcgaggg agcccagagg 721 cctgaggaca gcctgctggg tgcctggcac tctctagatc ctccctgcct gggcggtggg 781 catctgcaat tctccctgtc tcgaactgca gcagaatgtg tgggcagggt taggcagata 841 gagagtgggt gcctggactc attggtcaga ctctaagacc gcccaccgcc cgccgccccc 901 gcagtccttc ccaggactgg ctgacccacc agatagggga gggtgttgtt tcttattatt 961 acattttttg agaaaagcta tgcaggcctg ggagacgtct gtgccctagg agtcccttct 1021 ggcctgaaga ctcaaggctg caaacaggtg ctgagaggga acaagaaggg gagttggggt 1081 gcaacacacc cagtgacaga ccaggagtga gtgctgtggc cacagattct aatcctcaca 1141 actgcagtgt gatttaggct aagccccttg cctactccga ttcttgcctt aaataagtga 1201 aaaggaaagt caagtttgat gagttgtggc tctttagtgg gtggatttgc atgtaccatg 1261 gattacattt accagggtct aggaattgtc aagcgcccag caggctgcca tcaggcctct 1321 gagctccttc gccaggcggc tttatttgga gtaagaggaa agcaaaggtt tgaggctcag 1381 tcaagagggc gccactctgc ctggctagct ctccgacaaa gcagacgggg gggtgtggag 1441 ccgtggtgac cccctccgtc ctccccaggg tgggctcctt gggcttgggc tggtggaagt 1501 gaatattgcc tgcagcaaga agaaggactg ttcatgtggg agcgcaaggc ggcagcgggg 1561 ttggaaaccc acccccacac agtcaacacc tgctgtcaca ccaggctaat caagaacacc 1621 tttctttggt gcattaacca cctgctatta aataaaaggc tgttactagg tggcaaagta 1681 acacactgca gtagagccta tttgtaatct gagatcagtg gctgtaaggc accatccttg 1741 agccaagcaa gtactaatat tcttgttatc ttaattagaa tgcaagagat cgtgttttta 1801 aaattctctt tgaatctgtc actttgccct cctcttcctg ggtgatacac ctggtagtgc 1861 tggtgggggc catcataatg ccccttgtcc cagattccct tcttttagat gggactcgag 1921 cactgatcat ttcagccctg tatctctcag gtcagcgtgg ttcagtttgc tgtgcagagt 1981 ccaggggaga taaccacgct gtgcacacat gagattggct gacttggcag gactgtgcaa 2041 ttgtcagaag gccgtgggga gtgggggcca gtgcctgcag cctgccctgc ctctctcaca 2101 ggcccttaga gcatcgccag gtgcagagct ccacagctct ctttcccaag gagtaatcag 2161 agggtgagaa cgtggagcct ggtggacagg tgaaagcact gggatctttc tgcccagaaa 2221 ggggaaagtt gcacatttat atcctagagg gaagcgacag cagtgcttct ccctgtgctg 2281 aggtacaggt aaggagggtg gtttgtaaag ttcactgggg agggtgatcc ctctttattg 2341 ttctaatatt atgctcgcag cctccttgac aacatcatgc aaaatgtgtc ttactagttt 2401 ctagtgcata aaatattggt ggagctcttc gctgtgctgg gccagtcacc agtgctgggc 2461 actgtgggtc aagcatggcc caggacggac agccctgcct tcccaagcta tggtttattg 2521 tggtaggatc tgcctgccca gtgaagtgtg ttcagctgct cctgttgctg ggggaagctc 2581 tgctctctgc cttcggctgc ctcccacacg ctcccatgtg tctcccagtc cctaacccct 2641 agtcctgctt tcaataccgc aggggctttt agctgacttt tctccatctt cttaaagggc 2701 aggatttcct ggcagagaca taacgtcttt ctggaaatga aagaagctga tgcatgtttt 2761 tcttttctga attacttaaa taagaggaac tcatttccag gtattccctg ccttgttttt 2821 attgtgggag tgtggtccca gctgcccatc tgatcagctg acatagggtc cttggggccc 2881 aggcactgtt ctgcttggga ttgaaagatg aaagccacag ggccaggcac agcagctcgt 2941 gcctctaatc tcatcacttt ggaaagccga ggtgggcgga tcgctggagg tcaggggttt 3001 gagaccagcc tggccaacat ggtgaaacgc tgtctttact aaaaatacaa aaattgcagg 3061 aggtggtggt atgcacctgt aatcacagct actcaggagt ctgaggcacg agaatctctt 3121 gaacccaggg acagaagttg cagtgagccg agatcgcacc actgcactct ttgagactct 3181 gtctcaaaaa aaaaaaaaaa aaaaaaaagg gaaagcaatg gcctggcttg gctggagatg 3241 gaacatgcat caacgtgtcc cacgtgtctc cagcagggat agcgaaggcc tgccccctgg 3301 cacatagact gagtaaatgt tgatcagtga gaccttgagg cactttagag tttcacaatc 3361 cagagaggga gatagattgg aagttcaagg gtggacacct gggcatgatt cccaggacga 3421 aggctgcagt cggttctaag aagtatgctg gtgactatga gagaagaggt tgtgggtttc 3481 ctctcttgaa agctgtctcc ataatcacat gtgggggtga taagggctct ggtccagggg 3541 agggggcgag gcacacaggg ggctgaatct gatggcatga ggacttggtg atgggctgtg 3601 gggaaagaat tggagatgac tctagggcct gagggtggga agctgggagt tcactgggat 3661 cactgataga aacgcggagg tggggggtgc caagctgatt ttcctgatta gatttcctac 3721 agaactgcag gaataaggcg agagccatcg ccctgggtgg atttctgttt ggagctctat 3781 cgctggacca ctgtggacaa gtatttcata gaacgacccc cagtagccca attctaggtt 3841 tagaggggca gctcttgaca ggagcaaact ccagggcctc tgcaaccccc cagcagagtc 3901 ccaaacatct gacatgtagc caatggagga catgcaagtg gtggggttcc cagcagggac 3961 cacctcctgc cttgggtttc caagacagag cgaggggtgc tgctggggcg tggtttgcag 4021 gggccttgtc agaactcgat gctttctcct ccgtctgggt cctaactgca gtgcatctag 4081 agaccctcct ctgactcagg cataaggacg gcccgatatt catgcaggtt actctgaaca 4141 agaacagtct gtgggttact tccctaggaa tgaggaagat gaagagctag ataacaatgc 4201 tgggctggta tctgcctgat agtgctttgc caagactatg gtttcttcct cttctgatca 4261 aaaaagacac aggagtcaga caaatgggga ttaaaagaca actgagttga tgtgaaatgt 4321 aacaggcaac actatggaca cagaaagtag attagaggct gcttaggacc taacgtggag 4381 ggcagtaggg gagtgacagc tgaagggtat ggagtttctc cgtgccgatg aaaaatgttc 4441 tacagttgac tgtggctatg gctgcacatg tctgtgatat cctagaaacc actggagggt 4501 tcactttaaa tgggcgaatt gtatggtata tgaattatgt ttcaataaag ctcttaccag 4561 ctaagctgtg tggccatggc tgggcttcct agcagtgtcc cccaacccca aattgaactt 4621 actctcccag cacttgggcc tccccattac cccctgtctg ccctgaggat aagcctctct 4681 ccaagaagga cagagcaggg gctgtgggcc accattcagg aaaggtgctc ctctcccctg 4741 ggggagggaa cccagaggtc atggacagaa gccttctcag acatgcaaaa aacatgaggc 4801 tgtcgggcgc cgtggctcac acctgtaatc ccagcacttt gggaggccga ggcgggcgga 4861 tcacgaggtc aagagattga gaccatcctg gctaacatgg tgaaacccca tctctactaa 4921 aaatacaaaa attagctggg tgtggtggca cgtgcctgta gtcccagcta ctcgggaggc 4981 tgaggcagga gaatggtggg aacctgggag gcaaaggttg caatgagctg agattgtgcc 5041 actgcactcc agcctggtga cacagtgaaa ctccatctca aaaaaaaaaa caacatgagg 5101 cctcagaaat gtagctccca aaacccccta cccagggctg tgcagaagcc aaaccaggtt 5161 cctattttcc caggatgatg aacagttttc aaagcttagt aggattgtgg gtgattttgt 5221 ctcattgtct gttgtttttc catttttcta gatcatatat atataatatt tatactatat 5281 ataatatgta tatagtgtgt gtatatatat atataatatg tatatattag gtcaagtgta 5341 aataaaatca gggacagggt tggagtcgtc agcaggaaag agcctgctgg ggatcagagt 5401 ctctgggctg tcagggctcg ggctgggctg ggctctcgtt gttaatggtc ttcccctcat 5461 cttgcaggag ccatgtggct agaaatcctc ctcacttcag tgctgggctt tgccatctac 5521 tggttcatct cccgggacaa agaggaaact ttgccacttg aagatgggtg gtgggggcca 5581 ggcacgaggt ccgcagccag ggaggacgac agcatccgcc ctttcaaggt ggaaacgtca 5641 gatgaggaga tccacgtaag gcaccttggg ccgggccggg ctgggcagtg gagagggtgg 5701 tgtgtcgaag acaggggttg ggtcttaggc cagatgcggg aggggacggg ggcttgggaa 5761 tggtccatcc tacttgggag tgtgtttcag tctgcttttc ctctgctggg caggtgcagg 5821 ttgctttgga gagggatgcc cagccctccc actttctctc ttctcctttc cctacccaag 5881 gcccagcagt gctcaggcag gagatggggt aacctgtatc aaaggtcaca gaaagcagct 5941 gtcatcacct cctctatgcc ctgcctgaag ccttgggagt gaggttgtgg gagaaagtga 6001 gcaggttccc gtcagagacc caggaagcac tgagaaggca gggatgctac tggttaggca 6061 ccggctgtgt gtcagccatt cctggtcaca ttcaatccta cggtgatccg cggggtgagg 6121 attgttgttc ccaaactgca gatgcagaag ctgaagctca ggggcataaa gcaactcgtt 6181 ccaaggcggc ctagctaaca tgcggaagga tttgaagtcc aatctaatcc tgaagctgct 6241 gcccctcctc tgcacaggga tagagtcagg aaggaggtga ctgggcaggt ggggtgaact 6301 gagaatgcat tctcggggag gagcaaggct tgagtggggg acaggacggt agcagcctgc 6361 atgccctgcc aatcctttct ttcattccgt gggttttgag cacctactgt gtgcaggccc 6421 agtgagtgct tgggagaaga gacagcccct tccgtccaac agcatacagt ccagtgaatg 6481 agaaaaggaa tgtgcacaaa gaaaagtaga ctaaaagata aataaagata ggctaggtgc 6541 ggtgcgtcac gcttttaatc ccagcacttt aggaggctga ggcggtggat cacctgaggt 6601 caggagttca agaccagcct ggccaacatc gtgaaacccc gtctctacta aaaatacaaa 6661 aaattagccg ggcatagtgg cgggcaccta taataatccc agctactcgg gaggctgagg 6721 caggagaatc ccttgaaccc aggaggcaga ggctgcagtg agccaagatc acaccactgc 6781 actccagcct gggcaccaag agtgaaatgc catctcaaaa aaagaaacag atgcataagg 6841 ataaatgtca caaaaagcat ggataaaagt catgagagtt ccaaggagga agagatttaa 6901 aagcagaagc tgggctaaac ccaagagcaa ccaggctgag gctccctggg taggacttgc 6961 ttgctgcagc cttgctattt agggccaagc tgagccctaa ggaaagctcc tttagggaat 7021 tagactccta agggagtctg agatttccaa aatctgtttg ggggtaactg aaacacttgg 7081 gaagaagcct gggaatggtg atgcttgaga ggctctctaa gctgacatga acttcctcct 7141 ctggggctgg ccttttagcc tgatacatgt caatgagact tgcaccctct ctcctggtct 7201 gtaaattttt tagtttttgg agacagggtc ttgctctgtc accaggctgg gggtgcagtg 7261 gtgcgatctt ggctcactgc aacctccgcc tcctgggttc aagcgattct cctgcctcag 7321 cctcccaagt agctggcatt acaggcacct gtcaccatgc ctggctaatt tttgtatttt 7381 ttagtagaga tggggtttcg ccatgttggc caggctggtc tcgaactcct gacttcaagt 7441 gatccacctg ccttggcctc ccaaagcact gggattacag gcgtgaacca ccgtgctggg 7501 cctggcctgt aatttttatc ccaactcagt tataatgtat cattcaatgt tgtatatcag 7561 gggtcaggaa accaggccca tagaccaaat gtagcctgct acctgtttta gcacagcgcc 7621 catgccaaga atagtgttta cctctttaaa tagttagggg gaaaaaaatc aaaagaagaa 7681 taatagtttg tgacgtgaaa attatatagc agtcagattt cagtgtccat aaatttttat 7741 tggaacacag ccacccccat tcatctgtgc attggctatg gctgtccata aataaagttt 7801 tattggaaca cagccacccc cattcatctg tgcattggct gtggctgctt tcacactatg 7861 aagacagagt ggagtcactg cagcctgcaa agcctagaat atttactatc tggcccttta 7921 cagaaaaagt ttgccgactc ccgctgtgag aacgcctggg taaattaaag atgagtatcg 7981 tgtatacact taaagagtaa aagatgagct gggcgcagtg gcttacacct gtaatcccac 8041 cactttggga ggccaaggca gacagatcac ttgaagccag gagttcaaga ccagcctggc 8101 caacatggca aaaccccgtc tctactaaaa atgcaaaaat tagccggaca tggtggctca 8161 cacctgtaat cccagctact ctgggagctg aggcatgact ggcttgaacc tgggaggtgg 8221 gggttgcagt gggccgagat cacgccactg ccctccagcc tggggccaga gtccggagac 8281 cctatctcaa aaaaaaaaaa aagaaaaaaa gaaatgcgaa gtctacagtg aagaaggaag 8341 aagtgaagaa cacatccgct ttggagtagc cagtgatgtg ggaaactgcc ttgccacttc 8401 cagagggcag tatcttgtgg ctgcagggtt tgctgtgttt tctggaaaca gactttgctc 8461 ttgtgctctg tccttcccat ccctctcaac ttggggtcct gaattttgct ccaggactta 8521 caccagagga tcgataagtt ccgtttcacc ccacctttgg aggacagctg cttccactat 8581 ggcttcaact ccaactacct gaagaaagtc atctcctact ggcggaatga atttgactgg 8641 aagaagcagg tggagattct caacagacac cctcacttca agactaagat tgaaggtatg 8701 tttgcaaaac gccagccaga gagggatgta tgtcatgaga acagccttct tccacaatgt 8761 gacttacagt cagataaagt tggagagatt cagaacccaa ttataggtca ctgagatgta 8821 cttatacttg taaccttact aaatgcaaaa atattgcagt cacaacaaat ctgtccatct 8881 ttagagctag gcaggaacac atacatgcaa tttaaaaacc aaagaaatgc accatccagt 8941 ccaatgatgt ggcaatcttt gagaggaagg atggagatga gatttagacc caggccaagt 9001 gggggtggct cagaactgta atcccagcac tttgggaggc tgaggtgggt agatcacgag 9061 gtcaagagtt caagcagcct ggccaagatg gtgaaatccc atctctacta gaaatacaaa 9121 aaatagccaa ggtgtgttgg caggcagctg taatcccagc tactctggag gctgaagcag 9181 agaattgctt gaagccagga ggcgaggttg cagtgagcca agatcgcgcc actgcactcc 9241 agcctgggct ccatctcaaa gaaagaaaag actctgctct cagttctttt ggatctatac 9301 tcagaagtgg acttgttgga tcatacggta attctgcgtt taattttctg aggaaccgcc 9361 atactgcgtt ccgcaggcta gctgcaccat tttactttcc cacccaacaa tgcctcagaa 9421 tcccgattct ccaaatcctt gccaatattt ttttcctttc tttttttcgg taatggtcat 9481 cctaatgggt atgaaatggt atcacattgt ggttttgagt tgcattttcc taatgattac 9541 tgatgttgag cattttttat gcggtactta ttggattctt tgtaatttct ccgaagaaat 9601 gtctgttcat gtcttttgtc cattttttgg gtgttttatt agggtttagg agttctctat 9661 atattctgaa taaaatctct tatatttgca aatgttttct cccactctgt gggttgccat 9721 acaattaatc tttttcagag acaaggtctt gctgtgttgc tcaggctgga gtgcagcggc 9781 tattcacagg cgtgatcatg gctcaccaca gcctcgaact cctgggctca agcaatcctc 9841 tcgcctcaga cctcctgaat agctggtact ataggcgagc accactgtgc ccagcttgtt 9901 aattttttat attgaagaaa ttgtacatgt taggaatctt atttttaatg ttttaagata 9961 atttggcctt ttcaaggaac agtgaagatt ttaatttgtt aaaattctag gaattactta 10021 aagtatatga aagacatgtt ggtcagtcag ataaactaaa gttcctaaaa aaaggaaatt 10081 gaaattatag tgttaactgg gtgtggtggc atgcacctgt agtcccagct actcaggaga 10141 ctgaggcagg aggattgctt gagcccagga attcaaggct gcagtgagct acaaggacac 10201 cactgcactg ggtgacagat caagacccca tctcttaaaa aaagagagag aggctgggca 10261 cggtggcgca cgcctgtaat cccagcactt tgggaggctg aggcaggtgg atcacgaggt 10321 caggaggttg agaccatcgt agctaacacg gtgaaatcct gtctctaata aaaatacaaa 10381 aaattagcca ggtgtggtgg cgggtgcctg tagtcccagc tactcgggag gatgaggcag 10441 gagaatggcg tgaacccagg agatggagct tgtggtgagc tgagatcaca ccactgcact 10501 ccagcctggg cgacagaatg agactctgtc tttaaaaaaa aaaaaaaaaa acaggctggg 10561 cgccgtggct cacgcctgta atcccagcac tttgggaggt caaggcgggc agatcacgag 10621 gtcaggagat cgagaccatc ctggtctaac ttggtgaaac cccgtctcta ctaaaaatac 10681 aaaaaattag ccgggcgtgg tggcaggcgc ctgtggtccc agctacttgg gaggctgagg 10741 cagaagaatg ccgtgaaccc ggaaggcgga gcttgcagtg agcccagatt gcgccactgc 10801 actccagcct gggtgacagg gcaagactct gtctcaaaaa aaaaaaaaaa gaaaaaaaaa 10861 aaaaaagccg ggtgcggtgg ctcacacctg taatcctagc actgtgggag gccgaggcgg 10921 gtggatccca aggtcaagag atcgagacca tcctggccaa catggtgaaa ccccgtctct 10981 actaaaaata caaaaattag ctgggcgtgg tggcacgtgc ctgtagttcc agctactcgg 11041 gaggctgagg caaaagaatt gcttgaacca gggaggcgga ggttgcagtg agccgagatc 11101 gcgccactgc actccagcct ggcaacagag caagactcca tctcaaaaaa aaagggagaa 11161 aggaaggatg gacggaaacc tcgacttgga ggtgccaggg cagcaaacat ccttgttctg 11221 atgcaagcag gccaggcctg tgggagaatg agccagagaa aagctaagtc aaatcccagt 11281 ggtgtgtgtc ctggaactag agcccatcct ggagctcagc caagaacact aaaaaaaaaa 11341 aaaaaaaaaa aaatctggaa tgattttttg gatcaaattt ggacttctct tgagtgttgc 11401 tgtattcttt acagtaattt gatatcttct attatgagcc taacaagagg atccaagcca 11461 cttaatccca gccaaactgc tgtatgcaat taccataggt tcctgtttgg attcctggca 11521 cgaaattgcc tggcttcctt gagtttgact tggatccctg tgttgtccca gctcttcagt 11581 gtccatccag cagaaactaa cagggcgcct gctgtaccca ggcattctgc tggggctggg 11641 gacgcataga gaggacaggg tccctccttc acagctcata ggctcttcag gggcaattcc 11701 agcacatagt ggcggtgttc tgaaggagga ggctctggga gcttcatgag ccccacaatc 11761 ttcctttgaa tgtgagggaa ggcttcctgg aggaggtggc atctaagctg aaatgagtgg 11821 gactttgcca ggtgtgtggg gaaggtggtc caagaaaggg gaacagcatg ttcaaaggcc 11881 cagaggcagg aggcagtaaa gggccctttc catcctgcag ctgccagcct taccccactg 11941 gaccctttca gggttcttca tgagactggg ttttttttag agacagggtc tcactctgtc 12001 agccaggctg gagtgcagtg gtgtgatcac agttcattgc agcctcaatt tcctgagctc 12061 aagcgatcct cccacctcag cctctcaagt agctgggacc acaggtatgc accaccacac 12121 taggcctttt tttttttttt tttttttctt gagacggagc tcgctctgtc acccaggctg 12181 gagtgcagtg acgcaatctc gactcactgc aagctccgcc tcccaggttc acgccattct 12241 cctgcctcag cctcccgagt agctgggact acaggcgccc gccaccatgc ccagctaatt 12301 ttttgtattt ttagtagaga cggggtttca ccgtgttagc caggatgatc ttgatcgcct 12361 gacctcgtga tccacctgcc tcagcctccc aaagtgctgg gattacaggt atgagccacg 12421 gcacccggcc tttgtttttt ctttcctttt tttttttttt tacattttta gtagagacaa 12481 ggtctcacca tgtcacccag gctgatcgca aactcctggg ctcaagcaat tctcccacct 12541 tggcttccca aagtatcagg attataggtg tgagccacca cacccagctg acattgggca 12601 tttatggtac aaaacctccc tcaggtacaa cgctccaaaa gagccagttt ccagcattga 12661 attcatcgtg gacttaccca tgtgtggttc tggcaccctg cccggggctg gaggctgttc 12721 ttggccttgg tcctttagaa taggagtaaa tttggcaaca taggaggaca cccaggatac 12781 attgttaagt taaaaaaaaa gagattctaa aatagaatgt atgtatggtt ctatttttgt 12841 taaaagttat gttctctaag tctttttaaa aagtttgaac tgtcacagcc aagaagggtc 12901 taaagagaca tgacagctca gtggtgtgtg agtcctggat gggatcctgg aacagaaaat 12961 ggatatcagg ggaaaaactg aggaaatctg gctcatattt tgttgtgtgt agataccagc 13021 attatccagg ttttaaaaag gagaaaatcc tggccgggtg tggtggctca cgccagtaat 13081 cccaacactt tgagaggcca aggtgggagg atggcttgag ccagtagttt gagaccagcc 13141 tgggcaacat agcgagaccc cttctctaca aaaaaaaata acaaaaatag gctagacatg 13201 gtgttgccca cctgtagtcc cagctacttg agaggctgag gtgggaggat cacttgagcc 13261 tgggaggttg aggctgcagt gagccgtgat tgtgccactg cacttcagcc tgggcaacac 13321 agtgagacct catctctaaa agaagaagaa aaaaaaagga aatcctgcaa tgtgtgacaa 13381 catgaataaa ctctgaggaa ataatgctaa gtgaagtaag ccagggacag gacaaatacc 13441 gcatgattcc acctatagaa ggaatctgaa atagtcaaac tcagcagaag cagagaggag 13501 aagggtggtt gtcaggggct ggggaatagg ggaaacggca ttgctgatcg ttgctgtgta 13561 aagtttgagt cctgccagcc gaatacatcc tagagatggg ctgtacagca gagcgccgtc 13621 agtgttggac acttacaatc tgttaaaagg gtaggaccca tattaagtgt tctcgccata 13681 taataaaaag tgtaaacact aaggaaatca atggacttta tagttaataa tatctcagtc 13741 tggttcgtta attgtaataa atgtaccatg ctaacataaa atattaataa tggaggaaac 13801 ccggtgggtg tgaggtatat gggagttttc tggactcttt gcagtttttc agcaatctaa 13861 aactgttcca aaataaagtt tagacagaca acaccatgtt ttctttagag agtgggatta 13921 catgtgagtc ttcccccttt ggcttatctg tgttttctaa gttttctaca gtaaacatgt 13981 attatttgta taatatatta acgacagaac agaacattag ctagaccatg ctgactgcac 14041 cagccctggt ggcagagagt tccaagcatg gcagccgccc tcactgtgaa tgccgtaagg 14101 ccccaggtgc tgtccagggc actggcctgg gaaggggctg ccttcttttg atatgcagag 14161 gccccaagtc agtccagagg agacaagaat ccattcacct gcccttggct atgatgatgt 14221 tttcagcatt gcacttaaaa aaaaatcttc cactttattt tattttaatt ttttgagatg 14281 gagtcttgct ctgttgccca ggctggagtg tggtggtgtg atctcggctc agtgcaacct 14341 ctgcttccca ggttcaagcg attctcttgc ctcagcctcc cgagtagctg gaattgcagg 14401 cgcctgtcag catgcccggg taattttttt gtatttttag tagagacggg gtttcaccat 14461 gttggccagg ctggtcttga gctcctgacc ttaggtgatc cacccgcctc agccttccaa 14521 agtgctggga ttacaggcgt gacgaccaca ccaggcccaa aaatcctcca ctttaaagaa 14581 aatatcattt ggtaatcaga aaaaggtgac attttgtcaa tgtgtatgaa aaaaattatg 14641 catttaatac ctgttcattg tagaaaaatt ggaaagtaca taaaactcaa agagaaatta 14701 aaagcactgg aatctactgc tcaggaaaac tgctttttac attcttggag tttatcctcc 14761 cagactttta aaattcacca atacatggta taagtacagt ataagcatag atgacttttt 14821 ataaaaactg ggtcacacaa tggatatgac ttgatcattt gcttttacct cccagcaata 14881 tcttctgatc agttttctat gttaaataaa tatacatcta ccttgtcagt ttagatgact 14941 gtactggact ccagtatact gtcaaactat acttgattaa tcctgtattg ctggatacgt 15001 ggggctttct ccctaccctc cagattttaa attattgaac aagtatttat ggaggcctgc 15061 tgtgagccag gagctgtcct gagccctgga aacccagcag tggctgtaca gacctggccc 15121 agctgtcagg gggcacctct aaggaaaccg ggaggcaata atcgtagctc ccttgcaggg 15181 aggttgtgaa ggctgagtga ggacatctgt gcacctggag cacagtgtga gtgtgaaacc 15241 agtgtcagcc cttattactg tcaataccat gaaggggcgg cgggggcact aagggtggca 15301 ggactcaata tctaggctct ggggggtgcc agagcctgac cgtgcagggt cttctctctc 15361 cctccaccct gactgtgctc tgtcccccca gggctggaca tccacttcat ccacgtgaag 15421 cccccccagc tgcccgcagg ccgtaccccg aagcccttgc tgatggtgca cggctggccc 15481 ggctctttct acgagtttta taagatcatc ccactcctga ctgaccccaa gaaccatggc 15541 ctgagcgatg agcacgtttt tgaagtcatc tgcccttcca tccctggcta tggcttctca 15601 gaggcatcct ccaagaaggg tacggggctg ctagaggttc cataactgcc ccgtcctcgc 15661 caagggtggg cccggtgttc ccaccaggct ctccttccgg cggggtgagc agggagttgg 15721 cccgaggaag ctgggaaagg aggggcctga gaggccggcc ccagacacac cgccctccgg 15781 gctggagatg ccacccctat atttgggctc caggattcct tcttgcctct gtgagctttt 15841 ctgacctcca cctgggggta ggcgggtcct gagaaatttc atagaacacc agagggccca 15901 aggagcaatc tgcctgtgac tccgtgactc catgcctttc cccatcactg ccagggttca 15961 actcggtggc caccgccagg atcttttaca agctgatgct gcggctgggc ttccaggaat 16021 tctacattca aggaggggac tgggggtccc tgatctgcac taatatggcc cagctggtgc 16081 ccaggtgagg tcactgttgg ggtggtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 16141 gtgtcctcta agaagtggac ctgtgtgcag ggtgggccaa ggacccccag gggagaggag 16201 ggagtgtgac ctgtcactca gcagtgcctg aggcatgttg acttggatcc tcctgtctgt 16261 aacccagggt tgcggctctg ggtcaggttc ccaggcagag aaggcctgtg ataggagggg 16321 aaggtcaggc attagacctc tctgctgacc aagctctggg atagccctga gcagaactcc 16381 ccagaaactg tatttttccc ccaacctccc ttctccaact cccatgcctc cagtggggcc 16441 agtgctgaaa gaggctggct ctgtgctcgc tcctcagccc tgaggcctgt tcctccgcca 16501 tctctcctct ctccctgtcc ttgacaccag cccagcctca ccccggcccc tctctctgcc 16561 ttcagccacg tgaaaggcct gcacttgaac atggctttgg ttttaagcaa cttctctacc 16621 ctgaccctcc tcctgggaca gcgtttcggg aggtttcttg gcctcactga gagggatgtg 16681 gagctgctgt accccgtcaa ggagaaggta ttctacagcc tgatgaggga gagcggctac 16741 atgcacatcc agtgcaccaa gcctgacacc gtaggtgagt gtgctcaggg gtcctcgccc 16801 actgccggct ccactggggc agggagacac cgcggggtaa cctcaccacc ccaccccaat 16861 gctgcccaca gaaggaagcc tcttgtaagg acctcccagc tctttagatc tttaaactgc 16921 attttcccat agaatcaaga ggatggaccc tcagcagatt cccaggccaa tttagcaact 16981 ctgtttagcc gactcagtat tcatagtggc tttcacccct gatctcccac ccctgagtct 17041 atatcctaca ctgagaagca gagggatgga gggggtgagg taggaggaat cctgggagtg 17101 ggcagctctg gggctgcagg ggagagggga gagggtgccc tggggccttg gggaagaggt 17161 tcccaggcag tgagggaggg cccagcaacc atgtctgcca ccagggtcct gttaaaggac 17221 aatacttttt caaaaaggta aatttataaa ggaaccagta aaattttaaa aagaaacaat 17281 tcaaagacaa ataagaacac tgaaattaat ttgctaacag acacaaaact tgtcaacatc 17341 cacagagtaa tatcaattaa atctccatgg gacaaaattc atttgcctct atgaacaaga 17401 gtcattcgca ttaccctcag ttctttacaa ttcacagaac tgtaaaatag ctcaatgaca 17461 aggaaagatg ggacccctgt agcatcacgt cagtccggaa ggtctgctag gtaactccct 17521 agatcttgtc cagaagacac ccaggaattg tcttgcccat ctcaaagttc ttcatagttt 17581 tcatggcaaa cccatgccca cacacacgag gcagctcata atcataataa tggcacttta 17641 taaaagtcag catttttctc tgattataaa atacatgttg tcaaaaagct taaaaatgca 17701 aaaagtacaa aacattaaag acaatctcat ataacctctt ctccctcccc acatcccctc 17761 tccttcaaaa gaacttgtga acattttggt gagaattttc acaagtgaag tacgtccatc 17821 tcaccaacat ggccccaaca ctccagatgc cctcggggtt ctctcccagg taccacgctc 17881 tgacttctga ctgagatggt tttgattgtg cctgtttaag tcctatgaat agaatcacag 17941 agtatatacc cttgagtctg gcttcttttg ctcaatatta tgtttgtgag attcatttat 18001 gccattgtcc cagctttatt tatttatttt ttctcaagac ccagtctggc tctgttgccc 18061 agggtagagt gcagtggtgt gatctcagct cactgcaacc tccgcctccc aggttcaagc 18121 aattctcctg cctcagcctc ccaagtagct gggattacag cacctgccac catgcctggc 18181 taatgttttg tatttttagt agagacgggg ttttaccatg ttggccaggc tggtctcgaa 18241 ctcctgacct caggtgatcc acccgcctcg gcctcccaaa gtgctgggat tacaggtgtg 18301 agccaccgca cccagcctgt cccactgtcc cagccttttt tttttttttt tttgagacag 18361 agtcttgctc tgttgcccag gctagagtgc agtggtacga tctcggctca ctgcaacctc 18421 cactcccggg ttcaagcaat ttgcctgcct cagcctccca agtagctggg attacaggca 18481 tgcaccacca tgccctgcta atttttttgt atttttaata gagatggggt ttcaccatgt 18541 cagccaggat agtctcgaac tccttaccgc aggtgatcca cctttctcgc cctcccaaag 18601 tgctgggatt acaggcatga gccaatgccc ccagcctgtt ccagccattt ttaatgactg 18661 gacagtattt tggcttagga gtatatcaat atttgtaact gatcccctat tttagacttt 18721 tttaggtggc ttctcatttt tcatgaatca gagcacatag actgaaaaac cccgcctttt 18781 taaaaacagc acttctcatt gggaccccac caaatgcaga gcgtaggccc ttcagtttga 18841 aggaagctga gcagcagtgt gccaggccct gggagacagt gggaaggtat cgtccctgtt 18901 tatcagatga ggatgccgag gcagagttaa ggaagcgagg catgtgagca cccaggagtt 18961 aaggcaggcc tgtgctcgaa cgtggcttcc tgcacacagc ccgccctctg cggccagtgc 19021 cacacatcac acctgaagct ccagctctct ggtccccagg cctgagtctc tcccttgctc 19081 cccgctcccg ccaggctctg ctctgaatga ctctcctgtg ggtctggctg cctatattct 19141 agagaagttt tccacctgga ccaatacgga attccgatac ctggaggatg gaggcctgga 19201 aaggtgaggc cctggtttgc cccctgcagt catgaccctg gtcccagcag ccaacctcct 19261 caccctcttc atcccccttg tctggttgct ctgcatgggg gcactcagca aattctattg 19321 ttggctttct tacaaatcct caccctgtga ccaacgtagc tgcatttttt gggtgaggtg 19381 gcctggttgg gttgttccca gcctcacagt gcatggtcaa gactaggatg caggctcacc 19441 tgcttgtgct ccttttgctg aacaggcctc tggggattag ggagcccaca gcgagcaggc 19501 accacattat tgtcaagagc acctgggctt ttcctcctga ttccctggaa gagcctggca 19561 tcagggtcac gtgactgcgt gttccaggag ggtggtcgca gaaagacagg aggctgttaa 19621 tctctcagag ccttagaggc tgtatcctct acatgcctga ctccccagct ctgtggcctg 19681 actcaggctt caccaaggtc agctctgacc cacctttcca gggcccatcc ctttattcct 19741 tcagtgacac acaggaccag acagtgctgg ggatgggaaa gacagtaaaa tgagactgaa 19801 tcctgccttg ggggtaggga ctctgtcttg gccatgctgt agccccatca cctagaacaa 19861 tgccgggtga acagtgggag agcccagtac atagatgttg aatgaatgcg caaaggactg 19921 tgagagtcca gtggaaggag gattctttgt ctaagcggca gagaggggaa atgaccagga 19981 gaatgtatct gatttgggcc ttggaggata agtaggagtt tgctggatgg aggagagaaa 20041 cttaggcaaa gagaaaagca ataaaaatgc aaggaaaggg tgcttggaaa gggtggcttg 20101 tccagaggct gaagcaggta cactggggcc agatcaaaag caaactcatt ctgcaggcag 20161 gtcctcacct cctccaagag cctcagcagg gccttcgtct gggatctgtt tcccctgccc 20221 tacattcagc aggacacagg ggttgatgga gcttctgcaa gaccaccttg gcagggggtc 20281 tggcctgctc cttgcacacg gatacctgga gcaagatgtg aggccccacc agactatccc 20341 actaggccac tcctgcagct ctggccccgg gttacaggga gcaccaaatc cctgagcagg 20401 aggtagacag gatcagatct gtgttctaga aagaacagga atacactagg gtggaaatgg 20461 tggtcgtggt cgttttcaag ctgcggcaga gccgaaaggg agatggaagc acttggccaa 20521 cttgcagaaa tccagttcac agaggtgatg tgtatactgg aactagtttt cattgaccag 20581 ttttcatttt gccaccaaaa tcacattgaa aaggctgttc cctttatctc tgtccctgct 20641 ccctgctgct gttgctagag attcagggtt tgagtgagga aggtagggtt gggagggcgt 20701 gctctgggga aagtggaagg tggggtgggg gctgtggagt atgcatcctg gacccaagct 20761 ggctaggttt ggattctcca cttaataagt aaaaatggag ataatagtac ctatgtccca 20821 taggctgttg agaggaaaac atgaattaaa aagaaggtaa ggagcttaga acatgctgcc 20881 tggtatgtgg tcaccacagt agctggtgcc ctgagcagtt tcacggtgca ggggttgagg 20941 ggagggagga gaggtcagtg gcaggagaga ggggacaggt gtgtagaggc aaggtgggtg 21001 gggggaggac cttggggtga taagagacac ttcaactaaa tccatgcgct ccagtctagt 21061 gccctggcgc agcctgcctg tgacacgagg ataccacaca cgttgcatga ggtctgagga 21121 gggaagccgc atgggcaggg cctggcattt gtaggacttt ctggctgccc tttgtcacac 21181 aactgcatgt ggcactgaga gtggggcttt gtgttctgcg ttcccaggaa gttctccctg 21241 gacgacctgc tgaccaatgt catgctctac tggacaacag gcaccatcat ctcctcccag 21301 cgcttctaca aggagaacct gggacagggc tggatgaccc agaagcatga gcggtgagcc 21361 tggctgagcc gagaacaggg gcctctgagg ctggaggcag ggggacggcc agtcttgggc 21421 tccacgaagg gcacttggtg gtgggataag tatagccttg cctgcaggtg ccccagggcc 21481 cttggatggg aacactaaag gtcgaggtgt ttggagaaag ccctcatgga gggaccaggt 21541 gcctggctcc cgggcggccc tcagtaccgc tccccagtct aggatgctgg tctggagtgg 21601 cactgatggt tctgggattt ccaagtctca catggaaacc ctcaaaggtg gggatttaga 21661 ggctgtccca tgccccctgc ctgggtggcc ctcccagaaa agagaaggcc ctcagtgagg 21721 ggagagcggt tgagaccctg gatttgtcct tttctttatg gctccttgtg ggtgggccag 21781 ctgatctcac ccccaggcgc tttaaccccc tgctgtgcag acggaggggc gcagggccac 21841 agcctcaccc ctccgtcggc tctttcactt ccaggatgaa ggtctatgtg cccactggct 21901 tctctgcctt cccttttgag ctattgcaca cgcctgaaaa gtgggtgagg ttcaagtacc 21961 caaagctcat ctcctattcc tacatggttc gtgggggcca ctttgcggcc tttgaggagc 22021 cggagctgct cgcccaggac atccgcaagt tcctgtcggt gctggagcgg caatgaccca 22081 cccctctccc cccgcctgcc acctcccccc acaagtgccc tccaggcttt tcttggggaa 22141 gatacccctt ttctgaggaa tgagtttgcc tccgtcccct gcccatgctg ggagcccacg 22201 ctcaccccct cacccctcca agctcactcc ccaaccccca actccgtgtg gtaagcaaca 22261 tggctttgat gataaacgac tttactctaa aagcggctgg aactcagtga catgagcgtg 22321 cgctgacccc acatgggccc cctgtgcaag cagagctggc cggccccctc cttgctggca 22381 gaggcacggg aggcctgctg gggatgaggc cactggccag ggctatgctg caccagacca 22441 atggcaccgc cccacccctc ccagcgcagg ggcagcttgg agcagaggca gcactggcca 22501 ccactgcggg ggcaagtcag cgtcaagaga gtccctgagt gagaaggccc agataagccc 22561 aggcccccca ggcagcggac aggcacaggc agggctacag aggtgccaag gccccaggca 22621 gttgtgctag gagcctggac ctgctcttcc acatcctccc attacccacc cctactgcat 22681 caggcttata ccttggtgcc ccctggaggc agcagggagg aggcttctca ggcagaagtc 22741 ttaagttgca tcccattccc cagaatcccc aggagggaga agagggatgg gctgcccctc 22801 cttcctgcca gagccacagc tcaagggcag tgggatggcc ctgcacccag cccaggtacc 22861 ccttcctctg tgccgaccat gctgtccttt ggctccgagg aggccctgag cactgcccac 22921 ccccacacct tggagggagc agacggaggg ggtgagtact gcacagagcc tgcctggagg 22981 tgcagactca tgcactcggc cctgaagagg tgagagaggc tggatcttaa ctgtggtcca 23041 atgggcatat tatgagtggt ccttgtccct gcctgctatg gggtaaccct gacccagcta 23101 gcgtccctga caacatgata caaaaacaca gaactctagc ggatacagag atgtgtgcac 23161 agctggcagg agcactggcc acctcccaac ctgttcccca tagcttactc ctggatgtga 23221 gcctggaggg actgctgggc tggggaggga ctggccatgg agcagggagc tggatgccag 23281 cggacctcag acctggactg cagcatctct ttggttatcg gttgtgtgtg tgctcagaag 23341 ggaggaggcc ttccccttgt tacatctctc tctccatata ttctcccgga ctcatcagtt 23401 tggggggcga aaagtccacc acatggtctt ggcgacaaac cactggggat gtcaccccag 23461 ctgcagagct ggaagcttgt gccggaggga aactgggtga gcaagggagg ggcgaggctg 23521 cctgtgctct cctcaccact gctgggacca gaaaagatgg gcacctgata gctcagctgg 23581 gcagtcccaa cgcattccca ctcagccaag ggcctgagcc caggccattc tggttgtgtc 23641 ttttcagagc agtgagacct aaaacagatg ggaagaagtc atgaacttgg gagtcaggcc 23701 gcttcacctg tcttctcacc ccacaggctc ccctgagctg gcccagccag cagctcaccc 23761 agcctcaggc ctcctggggg gcagcagcca cactgcccgt ctgcgccaag cactgtccag 23821 ggatagtccc gctgatgttg tggatggcac cataggtctg ctgctgctgc tgctgcggag 23881 acagtgctgt cctctccgag gccaagccgt tcagaatccg aggcacgtag ggctgtcagc 23941 aaatggattt gtgcttggcc tggacagcga tgagcttcca cactgcatcc ctcccctcct 24001 gcccttctcc caacaggaca cagacagtgg ccataaagga catatgaaga aacaccgaca 24061 tccatttagg acattaagac cgaaaatatg acacctgcag aatttcctag cagccagggc 24121 aaaagagaaa gcacagttat acagttctca tgatcacaga aggaagcaca agagagcagt 24181 tctcagagga aactaaactt cctagccctg gatcctagga ggatttctca ctttgtagaa 24241 gaaacaagtc tacctatata ggggggctta gaacgctgcc ctgggacccc tgggatgctc 24301 aacatatctg tgtgagtctc attttgagtc tcatttcctc ccaatcccag gctccagcac 24361 tgaccccagg cacagagccc tgccatactc ctcctgagac cctggctgtg agggcttgac 24421 atcatccacc tgtcctgcat gttccctgct tggccagcat agccacaggt gtgtcgctgg 24481 gtaatgagtc cttctgaggc cctcccttgt ttggaacttg tccaaggaca gacaccgata 24541 agagagcctg acacctggga ggcagccacc agaaagaacc ctgcagatga ggtcggctct 24601 gtcctggcca cggctgtcag gaaggctggt gactacagct ctggaaccct ggggctctgc 24661 tttcccttcc cagccccacc ctaggagcag gaccctggag gctgtgtgca gagagaacta 24721 ggaaggcaag gagccctgtg tctctctacc atggggtctt cctctgaagc aggtgcaatg 24781 ttcatttctt // LOCUS HUMETMAGA 3343 bp DNA PRI 13-FEB-1996 DEFINITION Human secreted epithelial tumor mucin antigen (MUC1) gene, complete cds. ACCESSION M35093 NID g182252 KEYWORDS MUC1 gene; cell surface antigen; tumor mucin antigen. SOURCE Homo sapiens breast tumor DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wreschener,D.H., Hareuveni,M., Tsarfaty,I., Smorodinsky,N., Horov,J., Zaretsky,J., Kotkes,P., Weiss,M., Lathe,R., Dion,A.S. and Keydar,I. TITLE Human epithelial tumor antigen cDNA sequences. Differential splicing may generate multiple protein forms JOURNAL Eur. J. Biochem. 189 (3), 463-473 (1990) MEDLINE 90276413 REFERENCE 2 (bases 1 to 3343) AUTHORS Tsarfaty,I., Hareuveni,M., Horev,J., Zaretsky,J., Weiss,M., Jeltsch,J.M., Garnier,J.M., Lathe,R., Keydar,I. and Wreschner,D.H. TITLE Isolation and characterization of an expressed hypervariable gene coding for a breast-cancer-associated antigen JOURNAL Gene 93 (2), 313-318 (1990) MEDLINE 91033045 COMMENT Draft entry and computer-readable sequence for [Gene (1990) In press] kindly submitted by I.Tsarfaty, 12-JUN-1990. FEATURES Location/Qualifiers source 1..3343 /organism="Homo sapiens" /note="(vector lambda gtWES)" /db_xref="taxon:9606" /cell_line="MCF7" /tissue_type="breast tumor" /map="1q21-q23" misc_signal 384..397 /note="MUC1 ERE" misc_signal 633..644 /note="MUC1 CACCT motifs" TATA_signal 689..692 /note="MUC1 TATA box" prim_transcript 777..>2207 /gene="MUC1" /note="MUC1 mRNA and introns; G00-120-705" gene 777..2207 /gene="MUC1" CDS join(785..842,1342..2207) /gene="MUC1" /codon_start=1 /db_xref="GDB:G00-120-705" /product="tumor mucin antigen" /db_xref="PID:g182253" /translation="MTPGTQSPFFLLLLLTVLTVVTGSGHASSTPGGEKETSATQRSS VPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEPASGSAATWGQDVTSVP VTRPALGSTTPPAHDVTSAPDNKPAPGSTAPPAQGVTSAPETRPPPGSTAPPAHGVTS APDNRPALASTAPPVHNVTSASGSASGSASTLVHNGTSARATTTPASKSTPFSIPSHH SDTPTTLASHSTKTDASSTHHSTVPPLTSSNHSTSPQLSTGVSFFFLSFHISNLQFNS SLEDPSTDYYQELQRDISEMVSIGLSFPMLP" exon 785..842 /gene="MUC1" /note="secreted epithelial tumor mucin antigen precursor, (MUC1); G00-120-705" /number=1 intron 843..1341 /gene="MUC1" /note="G00-120-705" /number=1 enhancer 1063..1090 /gene="MUC1" /note="putative" misc_difference 1329..1331 /gene="MUC1" /citation=[1] /replace="aaa" exon 1342..2207 /gene="MUC1" /note="secreted epithelial tumor mucin antigen precursor; G00-120-705" /number=2 repeat_unit 1670..1729 /gene="MUC1" BASE COUNT 679 a 986 c 981 g 697 t ORIGIN 1 gagctcctgg ccagtggtgg agagtggcaa ggaaggaccc tagggttcat cggagcccag 61 gtttactccc ttaagtggaa atttcttccc ccactcccct ccttggcttt ctccaaggag 121 ggaaccccag gctgctggaa agtccggctg gggcggggac tgtgggtttc agggtagaac 181 tgcgtgtgga acgggacagg gagcggttag aagggtgggg ctattccggg aagtggtggt 241 ggggggaggg agcccaaaac tagcacctag tccactcatt atccagccct cttatttctc 301 ggccgcctct gcttcagtgg acccggggag ggcggggaag tggagtggga gacctagggg 361 tgggcttccc gaccttgctg tacaggacct cgacctagct ggctttgttc cccatcccca 421 gttagttgtt gccctgaggc taaaactaga gcccaggggc cccaagttcc agactgcccc 481 tcccccctcc cccggagcca gggagtggtt ggtgaaaggg ggaggccagc tggagaagaa 541 acgggtagtc aggggttgca gcattagagc ccttgtagcc ctagcccagg aatggttgga 601 gagagaagag tagagtaggg aggggggttt gtcacctgtc acctgctcgg ctgtgcctag 661 ggcgggcggg ggggagtggg gggaccggta taaagcggta ggcgcctgtg cccgctccac 721 ctctcaagca gccagcgcct gcctgaatct gttctgcccc ctccccaccc atttcaccac 781 caccatgaca ccgggcaccc agtctccttt cttcctgctg ctgctcctca cagtgcttac 841 aggtgagggg cacgaggtgg ggagtgggct gccctgctta ggtggtcttc gtggtctttc 901 tgtgggtttt gctccctggc agatggcacc agaagttaag gtaagaattg cagacagagg 961 ctgccctgtc tgtgccagaa ggagggagag gctaaggaca ggctgagaag agttgccccc 1021 aaccctgaga gtgggtacca ggggcaagca aatgtcctgt agagaagtct agggggaaga 1081 gagtagggag agggaaggct taagagggga agaaatgcag gggccatgag ccaaggccta 1141 tgggcagaga gaaggaggct gctgcaggaa ggaggcggcc aacccagggg ttactgaggc 1201 tgcccactcc ccagtcctcc tggtattatt tctctggtgg ccaggcttat attttcttct 1261 tgctcttatt tttccttcat aaagacccaa ccctatgact ttaacttctt acagctacca 1321 cagcccctgg gcccgcaaca gttgttacag gttctggtca tgcaagctct accccaggtg 1381 gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg 1441 ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca 1501 ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca 1561 cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc 1621 cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc 1681 ccccagccca gggtgtcacc tcggccccgg agaccaggcc gcccccgggc tccaccgccc 1741 ccccagccca tggtgtcacc tcggcgccgg acaacaggcc cgccttggcg tccaccgccc 1801 ctccagtcca caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg 1861 tgcacaacgg cacctctgcc agggctacca caaccccagc cagcaagagc actccattct 1921 caattcccag ccaccactct gatactccta ccacccttgc cagccatagc accaagactg 1981 atgccagtag cactcaccat agcacggtac ctcctctcac ctcctccaat cacagcactt 2041 ctccccagtt gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc 2101 agtttaattc ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca 2161 tttctgaaat ggtgagtatc ggcctttcct tccccatgct cccctgaagc agccatcaga 2221 actgtccaca ccctttgcat caagcctgag tcctttccct ctcaccccag tttttgcaga 2281 tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggtac agttctgggt 2341 gtggacccag tgtggtggtt ggaggggtgg gtggtggtca tgagccgtag ggagggactg 2401 gtgcacttaa ggttggggga agagtgctga gccagagctg ggacccgtgg ctgaagtgcc 2461 catttccctg tgaccaggcc aggatctgtg gtggtacaat tgactctggc cttccgagaa 2521 ggtaccatca atgtccacga cgtggagaca cagttcaatc agtataaaac ggaagcagcc 2581 tctcgatata acctgacgat ctcaagacgt cagcggtgag gctacttccc tgctgcagcc 2641 agcaccatgc cggggcccct ctccttccag tgtctgggtc cccgctcttt ccttagtgct 2701 ggcagcggga ggggcgcctc ctctgggaga ctgccctgac cactgctttt ccttttagtg 2761 agtgatgtgc catttccttt ctctgaccag tctggggctg gggtgccagg ctggggcatc 2821 gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg 2881 gtgagtgcag tccctggccc tgatcagagc cccccggtag aaggcactcc atggcctgcc 2941 ataacctcct atctccccag gctgtctgtc agtgccgccg aaagaactac gggcagctgg 3001 acatctttcc agcccgggat acctaccatc ctatgagcga gtaccccacc taccacaccc 3061 atgggcgcta tgtgccccta gcagtaccga tcgtagcccc tatgagaagg tgagattggg 3121 ccccacaggc aggggaagca gagggtttgg ctgggcaagg attctgaagg gggtacttgg 3181 aaaacccaaa gagcttggaa gaggtgagaa gtggcgtgaa gtgagcaggg gagggctggc 3241 aaggatgagg ggcagaggtc agaggagttt tgggggacag gcctgggagg agactatgga 3301 agaaaggggc ccctcaaaag ggagtgcccc actgccagaa ttc // LOCUS HUMFABP 5204 bp DNA PRI 08-NOV-1994 DEFINITION Human, intestinal fatty acid binding protein gene, complete cds, and an Alu repetitive element. ACCESSION M18079 J03465 NID g182351 KEYWORDS Alu repeat; fatty acid binding protein. SOURCE Human DNA (library of T.Maniatis), clone lambda-HIFABP. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5204) AUTHORS Sweetser,D.A., Birkenmeier,E.H., Klisak,I.J., Zollman,S., Sparkes,R.S., Mohandas,T., Lusis,A.J. and Gordon,J.I. TITLE The human and rodent intestinal fatty acid binding protein genes. A comparative analysis of their structure, expression, and linkage relationships JOURNAL J. Biol. Chem. 262 (33), 16060-16071 (1987) MEDLINE 88058967 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Sweetser, 19-JAN-1988. FEATURES Location/Qualifiers source 1..5204 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q28-q31" prim_transcript 1028..>4393 /note="FABPI mRNA (alt.) and introns" prim_transcript 1053..>4393 /note="FABPI mRNA (alt.) and introns" exon <1089..1155 /gene="FABP2" /note="fatty acid binding protein; G00-119-127" /number=1 gene 1089..1155 /gene="FABP2" CDS join(1089..1155,2350..2522,3546..3653,4098..4148) /note="fatty acid binding protein" /codon_start=1 /db_xref="PID:g182352" /translation="MAFDSTWKVDRSENYDKFMEKMGVNIVKRKLAAHDNLKLTITQE GNKFTVKESSAFRNIEVVFELGVTFNYNLADGTELRGTWSLEGNKLIGKFKRTDNGNE LNTVREIIGDELVQTYVYEGVEAKRIFKKD" intron 1156..2349 /note="FABPI intron A" exon 2350..2522 /number=2 intron 2523..3545 /note="FABPI intron B" exon 3546..3653 /number=3 intron 3654..4097 /note="FABPI intron C" exon 4098..>4148 /note="fatty acid binding protein" /number=4 repeat_region 4466..4742 /note="Alu repeat" repeat_region 4466..4472 /note="5' direct repeat" repeat_region 4736..4742 /note="3' direct repeat" BASE COUNT 1770 a 867 c 836 g 1731 t ORIGIN 156 bp upstream of HindIII site; chromosome 4q28-q31. 1 gtaatatctt gggcaagccc tagagcttct ttcctgaccc ttagttaata agatgttatc 61 tggtcacatt cagtcacaat aatagactca ttttagtaat aaacatctta agactagtaa 121 ttaaaactct ttacttcaca ccaagtttcc tccccaagct tggcctgttc ctggctggca 181 gcctgaagta gggaaaggag agatatggtg accttttctt tgtacctttc tagctaccct 241 ctataccctg accccacata cataattgag ctgtggcttc tgactctact gggtttgggg 301 atgagaggca gtgagagtaa aatgaaggag tggttttaat taatggcaca gctaaaactg 361 gattttgttc tctctgcaca tggcagatgt ttaaagctca ttctttcttt tatgcaagtt 421 tttacaccat ccagcctcat ttgtacctct tgaatttttg ctcagtggcc tatcaccatt 481 caggatcaag acaaaaatca atgagcactt attgtgtgtc atgcacccta caaagtgcca 541 ggatatttat ccaaactcct ggcaatgcta aacacaatgc aaaaagacat attagaaaac 601 gaatcttatt aactttagct tttcaactgt atttcatcat aaagtcttac tttacaagat 661 aattgctgtt gtgaaaaagg gaaaggtcat ggtctcattt cccagatgtt atttgatata 721 tgctataaat tatattacct ccaacatagt ctgcactttg aacttagaaa aacaatcttc 781 agacggcatg cattctaatt cttgaaataa gtatgcccac aaactgtagt ttaagacaga 841 ataggtatgc ttctcatgtt ttaattcagt tgaatttcag aagatctcag gaatgtacag 901 aacgagaatt aagaattaat aagaataaga attaattaat tgcttgacat agagtagtta 961 ggtgatttcc tgaactttaa gcttccacat cacagtatga agttggttca agataagaaa 1021 tataataaat tctcgcccaa ggacagacct gaatctctag ctgcctagag gctgactcaa 1081 ctgaaatcat ggcgtttgac agcacttgga aggtagaccg gagtgaaaac tatgacaagt 1141 tcatggaaaa aatgggtaaa gactttattt ctttgtggct cattctttgc tttcttacaa 1201 acatttttct ttctaactcc taaatctcta ggagattaca gatagcttac agatagctcc 1261 tgatgtggta gagagggatc cagaagatgt tcagaggagg gaaaccatat tttcccttct 1321 tacattagga agaatccact atctcactaa tggaagaaaa gattctttga gtgctgttct 1381 ctgaaacaca ccaaaaagat ccagaaatgt ttccttcact ctttaactga aaaatgactt 1441 tttttgttgt ttacagtaag aaaatggcag cgtgtaatga taacttccag atctgaaaat 1501 gttaaattct aggagatgga aaaacaaaga ccatataaga aagtaatgga aaaagttctc 1561 ttaaaattta tagctctgaa taagttagat ttaattctga tttcttctaa cttaaaaaag 1621 ttttggaata atcttgagaa gctgtgtagt tttctccagg gcgtttaatt taactgattt 1681 ataatttgat accaatactc tggcagccca tatactatac aagataggca aacaaatttg 1741 tgtcattccc ctaaaagaaa aatctgcatc aattatagct tacagtttag gaactctaag 1801 tttaaattta taaaagttgt agattcttat agtgattttg gcttaatatt tgctaatttt 1861 ctcatttttg tgtcagaaag aaatgccaca agaagcaaat agaactataa agttcaaaat 1921 gttaaagcca ctaagaaaaa caaaggggca tttaagaaaa aagaatactg tatatgtgga 1981 attaaagatg tgcttcctta taaatatatg aatatacatt ttaatccttc atttaatatt 2041 tctagaattt gatttactta acactgaaat gaacagtttg ttaatcttat taaggttgct 2101 cagctctaag attctataat tctgtactct acttaatttt tctcaagtta tggaaaaaca 2161 actttaatca gttctcttga tcggattgaa cctgaacttc tatagaagca atctgaatgt 2221 tcttgtgcaa aggcaatgct accgagtttt cttcccaccc tcaaaataaa caaacaaaac 2281 ataacttgga aaaataaaca cttcctatgg gatttgactt tattttctcc attgtcttac 2341 cttttacagg tgttaatata gtgaaaagga agcttgcagc tcatgacaat ttgaagctga 2401 caattacaca agaaggaaat aaattcacag tcaaagaatc aagcgctttt cgaaacattg 2461 aagttgtttt tgaacttggt gtcaccttta attacaacct agcagacgga actgaactca 2521 gggtaagaat tttttttttt atgagcaatg cattcttgat ttttctaccc aatattaaaa 2581 tgatttctgc tctatttcat tggatggttt aattaatgca ggtctccttc actaactgaa 2641 gaagccaatg aagtttgtct acattatata ttacacaaat tggcagggta tttaaatatg 2701 cttttatttt tatacgcatc tgtgaagaat ctgaattgaa cagtaagaat tagaaaacta 2761 tcttttgaat gactgaatat agacctattc ataaagaaat ttaaaactgt gtttttaaac 2821 agtacagcaa aagaagcctt tagagttaat atgtaactta actgtaacat gttgaaataa 2881 taaaagaaat gaatagatga acaaatgagt gagttaccaa atggaaagat ttgatgtatt 2941 gtaggtcatt gggagtgtac cttttcatgt ttaagataac acattttagg aagtcatcat 3001 tttcaacaaa ttttttaaaa acttttttta gcctcaacat ttttctattt aaattacatg 3061 tttgtaatga caatttaact actgaatgtt ttatcgtaag ttatgtcttt ccttaattag 3121 taccacaatc acacaaatta aaacaagcac aggttattaa catctccgtg aaactaattt 3181 taaccatgac tatatttctg gacacgtaac atgaaagatt cagaaagaag tgctgctcat 3241 ctgccttaaa attcagcgta tggaaattat tgaagagaac aagcataatg gttatcaaca 3301 catactctgt agcccaatgg cctaggttca atcctcactc tgtgacttta ggtgaatcac 3361 tgtgccattt tacagtctcc tcttctgcaa agtagagata gtagtatcag tttcataggg 3421 tcaccatgaa gattaaatga aaaagtgtgt ctacagaact cagaacagtg cctgacatgt 3481 gtaagaccct aataaatgcc attattatta ttattattat tattattatt attattatta 3541 tgtaggggac ctggagcctt gagggaaata aacttattgg aaaattcaaa cggacagaca 3601 atggaaacga actgaatact gtccgagaaa ttataggtga tgaactagtc caggtgagtt 3661 gtcaaattta tagctatttt caaaaggcaa aaattactac aaaacaataa tttttgtcac 3721 tgctgagcca gatcttcagt aaactgacta cttcttttct cataaatctt actgatttta 3781 aaaatattgt atagctattt tctgatgcct atttactaaa gacaacttat atatgtcaaa 3841 taatcaatgc ctattttaac tgaaaatata aatgactaca aaccaacatg tgttttaaaa 3901 tggctgtatc ccatatctgt ataaatcttg ctatcaagta caagaaaaaa ttgtataaac 3961 tcatactcat ataatatata tgaatatata atataaaaat agtataaact catatagtat 4021 aaaactataa tactactttt tcttaactta gatgtaaacc ttaaagataa attcttctgt 4081 ttgttaacac ctttcagact tatgtgtatg aaggagtaga agccaaaagg atctttaaaa 4141 aggattgagc attattcttg gcgcacagtc caaaatacaa attggacaga agatctatat 4201 tgtaccagaa ctgtttattt caccccatca agtataaggt tactgattga ttggtccttt 4261 tataaacatt ggtatatttc cattcatgcc aaagcaaaag aagtaaaagc taattaggat 4321 ttaatttgtt ttatattctc taagatatat atttactaaa agaatttgtg acattttaaa 4381 aaacaaaaat aaatattgca tccatgttgc tttatatgta gccttgcctt ttaaaagaaa 4441 aagtatgtga atatgaattg acagattgtt ttcgtagaga gagggtctta ctctttcact 4501 caggctggaa tgcagtggag agatcatagc tcactgtaac ctcaaactcc tggactcatg 4561 caatcttcct gcctcaggct tctgagtagc taggactatg ggtacattcc acagtgccca 4621 gctaattttt gttttgtttt ctttttattt tttttagaga tggggtcttg ctatattgcc 4681 caggctggtc ttgaacccct ggcctcaagc aatcctcctg cctcagcctc tcaagttgtt 4741 tttttcttta catttgataa actaaaagca taggctgcat atgagtcttt aacatcttga 4801 actggttgtg aataattttc tggcactggt tgtaagtaat atctattatt ataaaaataa 4861 tatatgctca accagaaaac ttagaaataa gaaacacaaa tgtaaaataa gtatttccat 4921 aactcataat ccagagataa ttgccattct gattttgata gatatcctct cagctctctt 4981 ccctgggggc agatatttcc caatacatac cactttgaat aggatgatag gaaataaatg 5041 atgtactaca ttaaattaaa ttattgtatt acatttttgt acacatcagt cattcccagg 5101 cttggctgaa aatcaggatc atctgagaaa cttaaacaat ttctgcattc ttaatctcca 5161 ctgttattct attatatcag aatcgctaat agaaccaaga attc // LOCUS HUMFIXG 38059 bp DNA PRI 30-APR-1996 DEFINITION Human coagulation factor IX gene, complete cds. ACCESSION K02402 NID g182612 KEYWORDS Alu repeat; Christmas factor; KpnI repetitive sequence; antihemophilic factor B; factor IX; repeat region; simple repetitive sequence. SOURCE Homo sapiens (clone: FIX-lambda-[6,36,53,61].) (tissue library: T.Maniatis et al.) DNA; and Homo sapiens (clone: FIX-lambda-4243) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 38059) AUTHORS Yoshitake,S., Schach,B.G., Foster,D.C., Davie,E.W. and Kurachi,K. TITLE Nucleotide sequence of the gene for human factor IX (antihemophilic factor B) JOURNAL Biochemistry 24 (14), 3736-3750 (1985) MEDLINE 86000558 REFERENCE 2 (bases 23487 to 23556) AUTHORS Rees,D.J., Rizza,C.R. and Brownlee,G.G. TITLE Haemophilia B caused by a point mutation in a donor splice junction of the human factor IX gene JOURNAL Nature 316 (6029), 643-645 (1985) MEDLINE 85296286 REFERENCE 3 (bases 23378 to 23387) AUTHORS Graham,J.B., Lubahn,D.B., Lord,S.T., Kirshtein,J., Nilsson,I.M., Wallmark,A., Ljung,R., Frazier,L.D., Ware,J.L., Lin,S.W., Stafford,D.W. and Bosco,J. TITLE The Malmo polymorphism of coagulation factor IX, an immunologic polymorphism due to dimorphism of residue 148 that is in linkage disequilibrium with two other F.IX polymorphisms JOURNAL Am. J. Hum. Genet. 42 (4), 573-580 (1988) MEDLINE 88161064 REFERENCE 4 (sites) AUTHORS Hirosawa,S., Fahner,J.B., Salier,J.-P., Wu,C.-T., Lovrien,E. and Kurachi,K. TITLE Structural and functional basis of the developmental regulation of human factor IX gene: factor IX Leyden JOURNAL Unpublished (1990) COMMENT Sequence for [1] kindly submitted on floppy by K.Kurachi, 05-AUG-1985. [1] notes a potential TATA box (2939-2942) and polyadenylation signal (35701-35706); and notes two start codons (downstream of the start codon annotated below) that may be alternative and/or preferred starts for the factor IX prepropeptide. Several tracts of simple repetitive sequence are present [1], including regions with the potential for hairpin and/or Z-DNA formation. [1] describes six long open reading frames in the intron and on the complementary strand. FEATURES Location/Qualifiers source 1..38059 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="FIX-lambda-[6,36,53,61]." /tissue_lib="T.Maniatis et al." /clone="FIX-lambda-4243" /cell_line="49,XXXXX" /cell_type="fibroblast" /map="Xq26.3-q27.1" repeat_region 1..2596 /rpt_family="KpnI" exon 2966..3082 /gene="F9" /note="G00-119-900" /number=1 gene 2966..35722 /gene="F9" prim_transcript 2966..35722 /gene="F9" /note="FIX mRNA" sig_peptide join(2995..3082,9291..9340) /gene="F9" /note="G00-119-900" CDS join(2995..3082,9291..9454,9643..9667,13357..13470, 20634..20762,23328..23530,33004..33118,33787..34334) /gene="F9" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-900" /product="factor IX" /db_xref="PID:g182613" /translation="MQRVNMIMAESPGLITICLLGYLLSAECTVFLDHENANKILNRP KRYNSGKLEEFVQGNLERECMEEKCSFEEAREVFENTERTTEFWKQYVDGDQCESNPC LNGGSCKDDINSYECWCPFGFEGKNCELDVTCNIKNGRCEQFCKNSADNKVVCSCTEG YRLAENQKSCEPAVPFPCGRVSVSQTSKLTRAEAVFPDVDYVNSTEAETILDNITQST QSFNDFTRVVGGEDAKPGQFPWQVVLNGKVDAFCGGSIVNEKWIVTAAHCVETGVKIT VVAGEHNIEETEHTEQKRNVIRIIPHHNYNAAINKYNHDIALLELDEPLVLNSYVTPI CIADKEYTNIFLKFGSGYVSGWGRVFHKGRSALVLQYLRVPLVDRATCLRSTKFTIYN NMFCAGFHEGGRDSCQGDSGGPHVTEVEGTSFLTGIISWGEECAMKGKYGIYTKVSRY VNWIKEKTKLT" intron 3083..9290 /gene="F9" /note="G00-119-900" /number=1 repeat_region 7298..7593 /rpt_family="Alu" allele 8469..8520 /gene="F9" /note="t...50 bp...a in FIX-lambda-4243; ta in FIX-lambda-[36,61]; G00-119-900" /citation=[1] /replace="ta" exon 9291..9454 /gene="F9" /note="G00-119-900" /number=2 mat_peptide join(9341..9454,9643..9667,13357..13470,20634..20762, 23328..23380) /gene="F9" /note="G00-119-900" /product="factor IX light chain" intron 9455..9642 /gene="F9" /note="G00-119-900" /number=2 exon 9643..9667 /gene="F9" /note="G00-119-900" /number=3 intron 9668..13356 /gene="F9" /note="G00-119-900" /number=3 allele 10041 /gene="F9" /note="g in one allele; c in another allele (loss of XmnI recognition pattern); G00-119-900" /replace="c" exon 13357..13470 /gene="F9" /note="G00-119-900" /number=4 intron 13471..20633 /gene="F9" /note="G00-119-900" /number=4 allele 14076..14079 /gene="F9" /note="tcga in one allele; nnnn in another allele (loss of TaqI site); G00-119-900" /replace="nnnn" repeat_region 18165..20265 /rpt_family="KpnI" variation 20416 /gene="F9" /citation=[4] /replace="a" exon 20634..20762 /gene="F9" /note="G00-119-900" /number=5 intron 20763..23327 /gene="F9" /note="G00-119-900" /number=5 exon 23328..23530 /gene="F9" /note="G00-119-900" /number=6 mutation 23387 /gene="F9" /note="g in [1]; a in [3] Ala->Thr; G00-119-900" /replace="a" allele 23387 /gene="F9" /note="g in one allele; a in another allele" /replace="a" mat_peptide join(23486..23530,33004..33118,33787..34331) /gene="F9" /note="G00-119-900" /product="factor IX heavy chain" intron 23531..33003 /gene="F9" /note="G00-119-900" /number=6 mutation 23531 /gene="F9" /note="g in [1]; t in [2] (haemophilia patient); G00-119-900" /replace="t" repeat_region 24172..24475 /rpt_family="Alu" repeat_region 25863..26091 /rpt_family="Alu" repeat_region 31537..31809 /rpt_family="Alu" exon 33004..33118 /gene="F9" /note="G00-119-900" /number=7 intron 33119..33786 /gene="F9" /note="G00-119-900" /number=7 exon 33787..35722 /gene="F9" /note="G00-119-900" /number=8 repeat_region 35948..36262 /rpt_family="Alu" BASE COUNT 12326 a 7397 c 7441 g 10895 t ORIGIN 333 bp upstream of BalI site; chromosome Xq26.3-27.2. 1 gtatatctag aaaaccccat tgtctcattc caaaatcacc ttaagatgga taggcaactt 61 cagcaaagtc tcaggataac aaaatcaatg tgcaaaaatc acaggcattc ttatacacca 121 atagcagaca aacagacagc caaatcatga gtgaactccc attcacaatt gcttcaaaga 181 gaataaaata cctaggaatc ctacttacaa gggatgtgaa ggacctcttc aaggagaact 241 acaaaccact gctcaatgaa ataaaagagg atacaaacaa atggaagaac attacatgct 301 catgggtagg aagaatcaat atcatgaaaa tggccataat gcccaaggta atttatagat 361 tcaatgccat ccccatcaag ctaccaatga ctttcttcac agaattggaa aaaactactt 421 taaagttcat atggaaccaa aaaagagccc gcatcgccaa gtcaatccta agccaaaaga 481 acaaagctgg aggcatcatg ctacctgact tcaaactata ctacaaggct acagtaacca 541 aaacagcatg gtactggtac caaaacagag atacagacca atggaacaga acagagccct 601 cagaaataat gccacatatc tacaactatc tgatctttga aaaacctgac aaaaacaaga 661 aatggggaaa ggaatcccta attaataaat ggtgctggga aaactggcta gccatatgta 721 gaaagctgaa actggatccc ttccttatac cttatacaaa aattaattca agatggatta 781 aagacttcat tgttagacct aaaaccataa aaaccctaga agaaaaccta ggcaatacca 841 ttcaggacat aggcatgggc ttggacttca tgtctaaaac accaaaagca atggcaacaa 901 aagccaaaat tgacaaatgg gatcaaatga aactaaagag cttctgcaca gcaaaagaaa 961 ctaccatcag agtgaacagg caacctaaag aatgggagaa aatttttgca atctactcat 1021 ctgtccaagg gctaatatct agaatctaaa atgaactcaa acaaatttac agaaaaaaac 1081 aaacaacccc atcaacaagt gggtgaagga tatgaacaga cacttctcaa aagaagacat 1141 ttatgcagcc aacagacaca tgaaaaaatg ctcagcatca ccggccatca gagaaatgca 1201 aatcaaaacc acaatgagat accatctcac acaagttaga atggcgatca tcaaaaactc 1261 aggaagcaac aggtgctgga gaggatgtgg agaaatagga acactttgac actgttggtg 1321 ggactgtaaa ctagttcaac cattgtggaa gtcagtgtgg cgattcctca gggatctaga 1381 cctagaaata ccatctgacc cagccatccc attattgggt atataccaaa gtattataaa 1441 tcatgctgct ataaagacac atgcacacgt atgtttattg cggcactttt cacaatagca 1501 atgacttgga accaacccaa atgtccaaca atgatagact ggattaagaa aatgtggcac 1561 atatacacct aggaatacta ggcagccata aaaagaaaat gagttcatgt cctttgtagg 1621 gcatggatga agctagaaac catcattctc agcaaactat cgcaaggaca aaaaaccaaa 1681 caccgcatgt tctcactcat aggtgggaac tgaacaatga gaacacttgg acacaggaag 1741 gggaacatca cacaccgggg cctgttgtgg ggtggggggc gaggggaggg atagcattag 1801 gggatatacc taatgctaaa tgacgagtta atgggtacag cacaccaaca tggcacatgt 1861 atacatatgt aacaaacctg ctcgttgtgc acatgtaccc taaaacttaa agtataataa 1921 taaaaaaaag atcattctaa aatttataca agcccttaga acagttaaaa atatcttacc 1981 aaaagaagaa taaagttgga ggaatcactc tacctaatat aaagtcttac tacatagcta 2041 cagtaattat gacagtgtta tattggcaga gggataaata catcaatggc acaaagaata 2101 gatagagaaa ctggaagtag acccaaaaca atatggttaa ctgacttacg aaaaaatttc 2161 agaagccatt cagtcgagga aggatagggt ggtattgttg ttttttgttt taacaaattg 2221 tgctggataa attggacata cctatggaaa aaaaaatgaa gtttgaccta aacatcatac 2281 tttacacaaa tattaactca aaatggagca tgggcataaa tctaaaactt caaactgtaa 2341 aacatttaga aaaaaatagg aaaaaaacta tcaggatcta gtgttagtgg aagagttcta 2401 aatgtgatcc ataaaacaaa aacaaataaa ctggactaca tcaaaactaa aaaattctac 2461 tctgtgaaag acctaattaa gaggacaaaa gacaagctac aggctggaga caatatattt 2521 aatccacgta tctatgaaag gattcatatc tagaatatat aaacaacctt aagaatctga 2581 cagtaaaaaa aaaaaatcag actaactgga ccactcatac attgctgatg gaaatgtaaa 2641 gtggtacagc cattttggta aacatcattg ctctctgaca aagatacggt gggtcccact 2701 gatgaactgt gctgccacag taaatgtagc cactatgcct atctccattc tgaagatgtg 2761 tcacttcctg tttcagactc aaatcagcca cagtggcaga agcccacgaa atcagaggtg 2821 aaatttaata atgaccactg cccattctct tcacttgtcc caagaggcca ttggaaatag 2881 tccaaagacc cattgaggga gatggacatt atttcccaga agtaaataca gctcagcttg 2941 tactttggta caactaatcg accttaccac tttcacaatc tgctagcaaa ggttatgcag 3001 cgcgtgaaca tgatcatggc agaatcacca ggcctcatca ccatctgcct tttaggatat 3061 ctactcagtg ctgaatgtac aggtttgttt ccttttttaa aatacattga gtatgcttgc 3121 cttttagata tagaaatatc tgatgctgtc ttcttcacta aattttgatt acatgatttg 3181 acagcaatat tgaagagtct aacagccagc acgcaggttg gtaagtactg gttctttgtt 3241 agctaggttt tcttcttctt catttttaaa actaaataga tcgacaatgc ttatgatgca 3301 tttatgttta ataaacactg ttcagttcat gatttggtca tgtaattcct gttagaaaac 3361 attcatctcc ttggtttaaa aaaattaaaa gtgggaaaac aaagaaatag cagaatatag 3421 tgaaaaaaaa taaccacatt atttttgttt ggacttacca ctttgaaatc aaaatgggaa 3481 acaaaagcac aaacaatggc cttatttaca caaaaagtct gattttaaga tatatgacat 3541 ttcaaggttt cagaagtatg taatgaggtg tgtctctaat tttttaaatt atatatcttc 3601 aatttaaagt tttagttaaa acataaagat taacctttca ttagcaagct gttagttatc 3661 accaaagctt ttcatggatt aggaaaaaat cattttgtct ctatgtcaaa catcttggag 3721 tttgattatt tggggaaaca caatactcag ttgagttccc taggggagaa aagcaagctt 3781 aagaattgac ataaagagta ggaagttagc taatgcaaca tatatcactt tgttttttca 3841 caactacagt gactttatgt atttcccaga ggaaggcata cagggaagaa attatcccat 3901 ttggacaaac agcatgttct cacaggaagc atttatcaca cttacttgtc aactttctag 3961 aatcaaatct agtagctgac agtaccagga tcaggggtgc caaccctaag cacccccaga 4021 aagctgactg gccctgtggt tcccactcca gacatgatgt cagctgtgaa atcagactga 4081 aatgctgaaa taacgataaa aaaaaataca gaggttaaac tagcaaagtg agtaaagtca 4141 agggataaag aaaatttgtt ggaaaactca caaagcagga cataaagcaa ggccattaga 4201 tatatctcat tagtgtgaca tctgggagga caaagcatcc aaaccctttc ttctatataa 4261 gtggtgagat gatgaaggtt gtaagaggct tctgccccct tgaagacttc agatgctggg 4321 gaaaggatag ataagaataa ggatgaacct ggcttttgga gcctgggaaa taatgactag 4381 cgataaacct gaagggaagt taagtatacg atccccagat aatactaagg agaaaggcaa 4441 tgtgattctg cagccattgt agccagagat aataagccct tgaggaaggg gccaggggaa 4501 tttttctaag gatagacagt attaatgcag cactctcttc tgctattaaa ctctcattgg 4561 cttctaaaag gagtttcggt gagtgatttg ctgagatgtt tgcattttca tgctgctgcc 4621 tttaggttat tattgcaaca gtttggaatt ttgaaattaa aacagttctg taaaaccagt 4681 ttagttttgt aaagtgtatg catcaaagat gtccttcatt cagacattac tgagttacaa 4741 ctacggtgcc aggtactgtg tcagggtact aggggtatgg ggataaacca gactccctct 4801 ttgatctaaa gcagcatgag gccaggtgag aggtttcaat atatgtgata aaatgtgcac 4861 taggtactaa gggatcatag agaaaggaac acattaaatg gggaaacaat tgatagagag 4921 agaatatttt catctgggtc ttaaaagatg agtaggcgtt ctctctcttt aaatgtctga 4981 tataagggca ttttatgcaa agaaggatca ctcgtgcaaa gactcagctt tgcaagaacg 5041 tgaggtattt caggagtttt gtatggttcc atatggacta tgacaagtga gacaggtaaa 5101 ctaggcagag ctggtcatca gataatgaag tcattaacct aaggagattg gacaataaaa 5161 tgcaatatgg aggtatcgaa gtataaacat aaggagtacc actgatggct gatttaggat 5221 gcccagtctg gcaacacgct aatgaaatga tagtggggga gggggccgta ccaagactag 5281 gagagagcag tcctgagact attgcaatta tctgcgggag acataaaggc tagaacctga 5341 acagtagcag tacaaaaaaa gaggggagtt caaatgatat taaggaagta gaagtggtat 5401 gacttaacca tctgggtatg gaaggggaaa tggctagagt cttggggact ttgtgtttga 5461 tgtgattatg gaccacagaa taatgtctaa gagaactggc tctttagtct gactgccaga 5521 gtctgaatcc tgaatgtttt agtatgttac cttgcaaagc ccttagcctc tatgaatcta 5581 tcttcctcat ttataaaaat aagatgacag tgcctatctc gtgggacttt tgtgaggatg 5641 aagtgagata atggatgcaa agttactgag cacagtgtcc aacacagcag aagcattaca 5701 tatacattag ctattactgg ctacattatg atatacagtt agggagttgg aaagataatc 5761 tgaaattcag gagacgtatc tgactatagg tgagtatttg gaactcattg ttctgtaaac 5821 agtagttaca gcacgtgtgt gggcatctgg agagtgagca tggatattgt gatacctagt 5881 acagtgcctg gcagtagtgg ttgtatgctc agtaaatttt gttgacaggg tcagggccgg 5941 actagactgt ggtaagcaag gcctgtaggg cataaatata cttgtatgcc ccgagaagtg 6001 aggacctctt aaatattgtg ccctacatgc cttgtttggt tcactcttgt cccagcccta 6061 gcaagtatat ataaggtgaa aaaggaaaaa gctgaggctg gagcctggga gaaccctgga 6121 catttaaggg ccatggagag gaacaggagt taatcaattc aagtgctgga tggataacag 6181 gagttagagc aaagcgggga accagaatag agtgattatt ataaaaagag tttcctaaaa 6241 agggagagat caacaattag aaattattta gagcagccag taaatacata aactcaaaat 6301 tattctttag gtcattcctg attgtgacaa tagtcatttc attatataaa tgtgattaag 6361 gaaggaaaag agctacacag aagttattaa agagctaaag agaattgaga aatttaaaac 6421 agaagaaagt agggccaaca tgaaaggagt agggagaaaa agagataacc agcatattgt 6481 ctcactgatc ctgccaacac ctgtgagata gatattctta ttatactact gttaaaccta 6541 atttacattt gacaaagtta aggttcagag cttgtgtgac ttgtccaagg tcacaggtct 6601 agaggaggca gatacttgat tcaaacctat ttctgtctga tctgattcta aagtctgttt 6661 tttcactcaa ccacactgta cagtcagctc tccttgtgag ttccacagcc acagattcaa 6721 ttaactgcag atcaaaaata ttcaagaaaa aaatggatgg ttcgatctct actgaacatg 6781 tacagactct tttatctttc attattccct aaacaataca gcataacaac tatttacata 6841 gcatttacat tgtattagct attaagagaa acctagagat gatttaaagt acaaaggagg 6901 atgtgtttag gttatatgca aatagtaagc catttttata tcggagactt gagcatccac 6961 agatcttgat atttgcaggg ggtcttgcca ccaattttcc atggatactg aggaacgact 7021 gtaaatggat gcaggcatgg atgctattta ggagtgtcca gggccaagta aatgagttgc 7081 tgagcagaga ggtgggtgga ggctgtgagg catcaatatg tggtggcatc catctgcatt 7141 ttggtgattt ttttccttca cagtcctcgg ctgtctggga agagaaggat gaaggcagat 7201 ggctgctcca atttaggggc taggattgca gggtgggcac agcattgcaa acgagtgaag 7261 gaaattgaga aatatggcca atgaagagtt gaagagaggc ctggcatggt ggctcacacc 7321 tataatccca gcactttcag aggcccaggc aggcagatca cttgaggtca ggagttcgac 7381 accagcctgg ccaacaaggt gaaatggtga aaccccggct ttactaaaaa tacaaaaatt 7441 agctgggcat ggtggcgggt gcctgtaatc ccagctactt gggaggctga ggcaggagaa 7501 tagcttgaac ctgggagatg gaggttgcag tgagctgaga tcgcaccact gcactccagc 7561 ctgggcgaca gagcaagact ctgtcaaaaa aaaaagagtt gaagagaaaa agtctaggct 7621 aaattcaaag aaaaaaagtg agcccaaaag gaacttgcag agcaagggaa aagcagggat 7681 gtcaagggac tagaacactc cataaagtga acagctgcaa tgaaaataag ggaagaaagt 7741 ttagttcatc tccgtttctt tcctttcctt tttactttcc tttctcttcc tttttggagt 7801 tagtcaggaa gtagtcccaa ataccccaga aagttcatct tataagccct tggtcctctt 7861 gagatggtat cagatatatt gctagaccct tgaagaaagg aacaactcca ggcaacttct 7921 tgagtccctg ttattaattt tatacataca cacacatata tgtatataca tgaaaacaca 7981 caaacacatg tgtgtgtata cagccatgca ttccttaaca atggggatat attctgagaa 8041 atgtgtcatt aagcaatttc atcattgtgc gaacataata gagtgtactt acctaaacct 8101 aaatggtata gcttactaca tacctaggtt gtattgatgt ggcctattgc tcctaggctc 8161 ctgggctgca aacctgtaca gcatgtgact gtactgaaca ctgtaggcaa tggtaacagt 8221 ggtatttgtg tatctaaaca tagaaaaggt acagtgaaaa tacagtatta taaccttatg 8281 ggaccactgt cgtataatgt ggtccatcat tgaccaaaat gtcattgtgc agcaaatgat 8341 tatctcatat atatatatat atgatatgat atatatgata tatatgtgtg tgtatatatg 8401 tatacatata tatgtgtaca tatatgtata catatataca cacacatata tatgtacaca 8461 catatatgta tatatatgta cacacatata tgtatatata tgtacacaca tatatgtata 8521 tatatatgta cacacatata tgtatatata tatgtacaca cacacacata gagagagaga 8581 gagaggagag gagaggaagg agggagggaa ggagaaatat gattcagata gagacatcta 8641 tcctccagag ttcaggagtg tctcttcaga ctaggtagat gtagcttaaa aaaaacatat 8701 cctggaattc tagagagatg cttaaatcac tgcaattcct ataacacttg ccaaccaaag 8761 gtgctgttga tctgaaattg cttttttaaa ttaatgcagt gatttttctt taacatctag 8821 tgacagacac tggggtcaca tttgcagctg gaccataatt aggcttctgt tcttcaggag 8881 acatttgttc aaagtcattt gggcaaccat attctgaaaa cagcccagcc agggtgatgg 8941 atcactttgc aaagatcctc aatgagctat tttcaagtga tgacaaagtg tgaagttaac 9001 cgctcaattt gagttctttc tttttcatcc aaagtaaatt caaatatgat tagaaatctg 9061 accttttatt actggaattc tcttgactaa aagtaaaatt gaattttaat tcctaaatct 9121 ccatgtgtat acagtactgt gggaacatca cagattttgg ctccatgccc taaagagaaa 9181 ttggctttca gattatttgg attaaaaaca aagactttct taagagatgt aaaattttca 9241 tgatgttttc ttttttgcta aaactaaaga attattcttt tacatttcag tttttcttga 9301 tcatgaaaac gccaacaaaa ttctgaatcg gccaaagagg tataattcag gtaaattgga 9361 agagtttgtt caagggaacc ttgagagaga atgtatggaa gaaaagtgta gttttgaaga 9421 agcacgagaa gtttttgaaa acactgaaag aacagtgagt atttccacat aatacccttc 9481 agatgcagag catagaatag aaaatcttta aaaagacact tctctttaaa attttaaagc 9541 atccatatat atttatgtat gttaaatgtt ataaaagata ggaaatcaat accaaaacac 9601 tttagatatt accgttaatt tgtcttcttt tattctttat agactgaatt ttggaagcag 9661 tatgttggta agcaattcat tttatcctct agctaatata tgaaacatat gagaattatg 9721 tgggtttttt ctctgcataa atagataata tattaaactt tgtcaaaagg actcagaaag 9781 atcagtccaa ccctctaacc catattggat ggtgatatac tacagggtta tgccagtgtg 9841 ggaactatcg ctggtaaata agtttaatcc tccctagggc ttcacaaaga acattgttcc 9901 accccaggag ggtggaagga agaaactgaa atgattgtgt cttagaacct aatgaaagtt 9961 tgcattcctc agtaaaatca gagactgctg attgacttaa atgtttatag cttcaaagtc 10021 ctcctcatta tcatggccca gaagcccttc catgattgtc cttccccacc ctccccatta 10081 cccttcttgc ctcctctgct acttctctcc tcgcacactg ggctccagcc accctggcct 10141 tcctgtcact tcttgcacac tctaggaatg ctcccacttt ggaggcttta tctggctgtt 10201 tctcttattt ggctgttccc aacttcctgt gggctgactc cctcacctcc ttcgggtctt 10261 tgcccaaatg ttaccatctt aatgaggcct accttcacca tctattaata cttcaacctg 10321 ccccagtagc cttaccactc tagacacctg tacagaactc cactctactt tttaacagag 10381 cttttcacca tctaatgtat catataattt cttactcata ctatttatca tttattttct 10441 cctactccac taaaatgcaa gtttcatgtt ggcagggata ttcaattgtt ttgtttattg 10501 atatattcct agcacctaga acagtatctg gaaaagaggt actcagtaaa tatttatcaa 10561 atgaattgac caaaagaagg aaaactcaaa actttaatga caactaactt taaagctaca 10621 ataacttaaa attcagagta ggattttgag ggagggtaag tttcaaagat tgacttacct 10681 aagactatct gcataaataa aaagaaatta atccagacaa caaattcacc aacttccatc 10741 aattggaaat ccaattctat tttctacagt ttatgttctg gagacactac tggacactct 10801 tttactctca taactcataa ctcctccact tttgttttta aatcatgaga gaaaaagagt 10861 tgactctgtt atattgtttt atctaccttt ccttgatctt agaaacgaat actaccatac 10921 cagcttctac tgaggtgccc cctaaagtta gtccaaatag gtctttgcaa tctccattcc 10981 cgcagaattt agaactttga atcacatgat ttatttctaa aagtaaatcc atgccgattt 11041 tccccaccaa aaaattcctg actattaaac tcctacaatc ccttcattgc tcactcccca 11101 cccccaggat catattttaa agttgggccc ttgccttttg ggtcacatag gtacactgtt 11161 tgctatacca caggtatagc tatctggaaa acatggaggg tattattctg ttactactgc 11221 ttcgtcaacc aaaaaataaa acaaaacaag aacaaaaaag aaacaaactc cctgcctctt 11281 ttcacttgca gtcaaggttc ctaaccacta caaaattagc ctatgtttct tcttgcacat 11341 agtagaaacc caagcttctc actgctgtgc tattctgtac catcaactca tcacataaag 11401 agcctggttg aagaatgatt gtccaaccac attactagca tctgtcaaga ctttccagtt 11461 tacaaaaggc ctatcacatt taaccctcac accatccttg tgaccaaagc attattaact 11521 ccattttaca ggagagtaaa ctgaagctta gggaagttaa aagaactgcc aaaggtctcc 11581 cagttgggga gtcatgaagc ccagaagaga agccaaattc tctgctgctc aaccccttgc 11641 tttcactatt acacctcagg gccttcaaat ctaaatgcag ttattcatta aacaggaacc 11701 tggtagtctt aaacaggaat ctctcacttg gtaagatctt gtctcttgtt gtatttgacc 11761 ccaactgtct atggctttgc ctgaacccaa agtacacaca gcctagaaac caaaggagaa 11821 ccaaatgtgg gataaaatga cactcatttt aacgacatgt ctcagcaaat gagttcctgt 11881 gtagctggct gaaagcccag accctttcag taaaacatcc tgaataattc acatttgttg 11941 gtctataata taaagggcaa atgtagctca tttttagacc agttctgaac atcaatagta 12001 acaaaccaga gataaccgat tttgttttca tagaattgga acaaattaga gtatctgtgc 12061 aaaagcatat cagatctagg agcagagggg acaaggtcta atttttaaat aagcaaattt 12121 tccagggagg gactacttat gataaaggga tattagtctc ttagtcaacg gaacctggat 12181 acacgcttct gacagagaag agggagaata ggcaggaatc tacacaccag atgtcaagga 12241 gatttgcttt aaaatacgac tgataattag aaatttctca gtttccccct tttccctcat 12301 tctttgattc ttattgttat ctttatctct tactcctttg tttctcatat attgagtctt 12361 acagatcaag ctcccatttt tttcttcagg ggtatttttc tagttcaaag tgcctaccat 12421 ctcccttctg gttctattca tccttctctc ccaaagctcc tttagaagtg tggattaagg 12481 cagagcacta agaaaccaga cttaaagatt cccttctcat tctgactttt ctcctttcac 12541 ctattccttc ctcctgtttt cttaccatca gtgtcttcaa aggctttcaa gtacacggta 12601 aatgcagaaa cttcaagaaa ggcagaatgg aaacataacc aatgcataca taaataaagc 12661 acactgtaga atctttttaa attctgtatg atatatcgaa tgctgtctct cacattacct 12721 agaccatttg aaaccgaatt tgtaaaacat agactatctt taagtagtaa cagatgcttc 12781 tgacatgttt tctattgtct tgaaccatta ctgcatatga tacatcaaag ttaagtgaca 12841 atacaagaaa gcagattcat ttgctccctg cctaggccgt cagttcctaa agtggaaacg 12901 ccatatatta tctagctcag tttgctctac aagacctgca atagagcctt gtgtgacata 12961 gagataatat ttgttgaagc aattaaattt gacttggaat taactctgcc atcattctat 13021 aaggaaggat tgaaaatcct tctcaccctg tgctgatata gtacctttct atacaaaaac 13081 gtccttctcc ctcttccctt ggattgcata aactatgtac atgccttcct caggggcact 13141 tttctaggac agtgtcagcc taaggatctt tgtttgggtg gcttttagaa actcaggaag 13201 acaggagcat catatgccta taggcagctg gcttccaggt cagtagtttt gctctgaccc 13261 taaaatcaga ctcccatccc aatgagtatc tacaggggag gaccgggcat tctaagcagt 13321 ttacgtgcca attcaatttc ttaacctatc tcaaagatgg agatcagtgt gagtccaatc 13381 catgtttaaa tggcggcagt tgcaaggatg acattaattc ctatgaatgt tggtgtccct 13441 ttggatttga aggaaagaac tgtgaattag gtaagtaact attttttgaa tactcatggt 13501 tcaaagtttc cctctgaaac aagttgaaac tggaaaatgc aatattggtg tatcataatt 13561 tttcttaaaa acataccttt gatgcttata aacatttcat ttgtagtgat agttttcagg 13621 atatgagttc aagaagctac attaaaatca ataacaatat ttggtaacta atattaagta 13681 ataatgatgt tccgactcac cttattaatc tttaatacaa ccgtatgtgg ttagtactat 13741 cattatgcgc attctatgca gatgagaaaa ccgcaactcc aacggccaaa aattacagag 13801 gcataaatgg tttagacagg acttaaactt cagtgtgacc aaaacccatg cttctaacta 13861 ctatattcaa aactcagaga aaactgaacc cagaaaattg aaatcatgac taaattgcta 13921 tcaacatagg tgaaagtcaa ttaagtacag aactggagta tgactggcca attatcccat 13981 ataatgggaa ttctccacat gtacaaacca cttcatatgc taaacttgtt gacaacattc 14041 aaagctcatc cctgaatttg actatattga ttacatcgaa aatgttacat agcaacctta 14101 gaatccttgt gtaccttttc ttctcaaagc ctagattatt tctttttccg acgttttcag 14161 taattggagc agtaaacccc agtgtccctt acctacttgt ttattacctc cagatgcaat 14221 attactggta ctgtgattga gaaacgcaca cagtgctaat gaggaattca ctttctactc 14281 tgacactctg gaagaataga gatgcaatcc taaggaagaa tttaacacca caggctacat 14341 gactaaggat aaagagtaga aaattagcag gactctatta accgattaca gcaatccacc 14401 tgacagatga aaaaggcatg aaatgaaatg aaatgtagca gctacactcg tcctattgag 14461 aaaggaaaaa agtcacctgt aatgttgttc agaaatcctt tcagtactaa aaaattcatt 14521 gaccatcttc ctttagtctc gaaaatttct tagaaggtaa aaaaaggaaa aggtgacagg 14581 gcaaagacat ttgaaaagaa agaaaagagt gaatgaactt gcacacctgg cttggactcc 14641 ccattcccct taggtttcca ttgtggggga caaactaatg cctgggttac ctttcttgag 14701 agtgtgttaa ttgattcaat atctctgaag tgctactttc atctgaaagg ttataatttg 14761 aaattcagat ttacctggat aaatttgatc ttgctattat ggaaacctct agaaatcctt 14821 ggagtagtta ctcattatca gcttaaataa tatagccggt ggagctgagg gaatgagtaa 14881 ctcaattagt ctcagttaca actgaagggc acattgttgt aaactataat tgaaaacata 14941 aatatcttta cctagtttaa aaaataaaga tgctttaaaa ggaggaaggg aatagccctg 15001 aggaatgtaa atataagcac aaaacttcta caacagagtt tgctacgtgt gtggctgtgt 15061 tccacccagc aaaaatgcta agtctacaac tgacacaact tggatactct catgttccca 15121 cattttggtt tggtcaaggc tgtgcagttg tactgcaggc caccaccact cctggcctct 15181 acagtatatt gatctgaccc accaatctga tcaaggttta gaaaaatatt ttcagcccag 15241 ttagctcaca aacaaaatga gaattcccac aaattgctct ttatctcaga caacagagga 15301 aagctacagc aaaagcataa acaaattacc atttaagttt gttgcttcaa attaaagact 15361 aattgcaaca gctactagat agcacagttt atggggcatc tcggccccaa gtcttttgtc 15421 ttataaggtc ttgaaaaaaa gaaaggagat tttcatcaat aagagttttt tgttatcttt 15481 ttcccttgtt catcaggccc ttcactgcga gagagaggtg taaacgttca gggcatgcat 15541 tctagttaaa gaatattaat tggctattgg gtccctttgg ttagaataaa gacctctgta 15601 tgatgtccct agctgtacat caaacccaaa tatctctcag ataaatgaag gtctgtaaga 15661 atttggtcat tcctgtctct tctaaagagt aacagaggca ttttcccgca gtaaagtaga 15721 atggaaagaa aacaaaaatc acaagcctat aaacaccttc ttcaattttc ccagcatgtc 15781 acagacacta ctgtcttatt tactacgtat ttctgaggag taaaaaaagg aaatatgttg 15841 agtttagctg aagcacagca tattttgtgg taaacttgtt aaataaaaca tcttttgtcc 15901 aagctttggt tgtcacacaa gtggatatat caggaaatat aaaggcagaa taaactaaag 15961 cagaacatac taacatttgt agtaggcatg aagggaatta gaaagtgttt gtgttaacat 16021 ggagggaggg agaacagatg ctttgagatg ttcttcaaca gatattctag gcactgagac 16081 ccccttcggg accagagaga gcccatatcc accacagtac ctgacacata aatgctcagt 16141 aattgataaa tgagtcccat tctaactgtt ccttagccct gctctatgga actctcccct 16201 gaattccttg tgccattatt ttatttctgg aatcttcagc cttttagctg agggcaaaag 16261 attgctgatt aggaagcaat atttcccacc tcctgcgcaa aacaagccaa agatcaacag 16321 cagcagcaac atactgagcc ctaaagggca atgacaaatg tggagaatga tacagaggtc 16381 tggttacttc ttagccaatg acacagaatc acaattgaga aaacacagag tttattcatt 16441 cccattgtgc atgccctgga caaaccaagc tgcacctttc gtaacttatc acaatctcat 16501 attgacggaa cactttctac aggtaatgtt tgatttggct gaacacttta gcattgcttc 16561 gtagcaacaa aatgatagct agtaacagaa aaagatccag ggatattacc actgttagtg 16621 aggagaaagg ccttttaatt aattaattaa ttaattaata ggaccaagtg ccatcttttt 16681 ggatcatgcc cttagtggat tattggtagc aaaggttaaa gctcaagctg gttcctttgt 16741 ccccctggca acagttgatt tgcctccctt atctcctgaa gtaccgtaag gactaagagc 16801 caattattac atttggctat gctagcatat gtaaaataga gtttaaaagt ttagattcat 16861 cactcaaaaa ttcatattct ccaaaaccat acagtcactc tgttagcctg tgttccccca 16921 gaaaaaaagt cacaagctta ttattaacat gtgcaatcca ggggcaagag aaaggaactg 16981 aagatgaggc agaaaggaaa agaaagccaa taagaggatg agttatcaaa ctactcgttt 17041 cttaacagca actgattgct taacttcctg ggactgtctc caataagtca aattggcctc 17101 aggttagtcc acctgagtgg gaagaagcgg tgaaagaatt tgtctgtcag tatctgtctc 17161 tcattggtta gaagttcgac ttatggggaa ttaactccct cacatttcct agttggatag 17221 cttgggtacc agaggcatat ggcatccatg ctcagcatga acagggaagc ttcaaggcaa 17281 aagacacata gtgcagctat gagccaaggc aattcaagga tacacccata ggaggctggt 17341 tgacatccac ccagagctaa tcaccaccat gctggaaaaa gacacaggtg aagctgagaa 17401 gaatgaaggt ggtgcatagg aggtatctaa tacagtcact cattttcaaa ctttccatgt 17461 tatgattgca ctgaccactg aggatttcta ttgaaagttt tactgttgtc aaacacgtac 17521 acaaggggaa aggtgtctta cattgtttat gttcctgtgc tgctctagaa acagaaatag 17581 gctcaagagc agagcctgtt tttcttaatt cagcaggtct aagctaacaa gtcctgaaac 17641 atggtacttc ctgttattgg tattgcatag gagaaacaaa gggaaagcac agtaattaga 17701 aaatacaaac aagatggcag gaataagcca aaaatatcag gaaacacaat tattgtgaat 17761 tgggattaaa ctaatctatt aataatgaca actttcagct tggagttaaa aatttaattg 17821 tatactgtta acgaaagtga tacctaaaat aaaattacac tgggaggcca aaatgaaggg 17881 atgtgaaaag aactatcagg taaaaactaa caaaaagaaa ctagcaaagc aatcttaata 17941 tcagacaaaa tagaatccaa gaggaaaatc atttcaaaag acaagagatt ttttttatta 18001 ataaggggaa ttgcatagga gagtaaagaa aatgtgggcc actggaatgc ttagcactaa 18061 tgacatattg gtctttggtc ttcagttacc ttacaggacc ctatttcatt ctcttatgtt 18121 tgatatgtaa ccacctcagc cagcttcaag ttgctttttg gccctaatgg acttcctagc 18181 actataattt cttttttttt aaatgtttta ttttaggttt aggggtacat gtgaaggttt 18241 gttacataga taaacatgtg tcacaggggt ttgttgtaca tattattaca tgacgcagat 18301 attcagctca gtaccaaata gtgatctttt ctgctcctct gcctcatccc accctcctcc 18361 ctcaagtaga ctccagtatc tgttgtttcc ttctttgtgt ttataagttc ttaacactta 18421 gctcccgctt acaagtgaga acctgcagta tttgattttt gttcctacgc tagtttccta 18481 aggatgatag cctccagctc cattcatatt cccacaaaag acataatctc cttcttttct 18541 atggctgcat aatattccat ggtatatatg aaccacattt tctttatcca gtctgtcatt 18601 gatgggcatt taggttgatt ccatgtctgc tattctaaca ctgtaatttc taaagacttc 18661 cagattctac ttttataggt aacctgttaa acagtctagc tctggaagcc aagcaatttc 18721 tagaataact aagcaataga aattacactt caatgcagaa aggcagtatc tacatgagat 18781 tatgaaattg cggttgcttt ttgtgttcac tgaaaaaaat aagtaaaact gtaactttca 18841 gaaaaaatga ttgtacatat agaaaaccca aagcatctaa acaattaaaa taaataagta 18901 tagaaagatt actggataca gagtcaacat acaaatatca attgtatgtc tatataccag 18961 caacgattca aaaatgattt ttataatagc attaaaaatt agacgcttag taataaatgt 19021 gagaaagatg tgcaagaact ctacataaaa aattatgaga cgttattgag aaaaattaag 19081 gaaaacctaa ataaatgaat gaataggcaa tgtttatcat taaaggatac aatatagtaa 19141 atatatcaaa tgtttactaa tggattcaat gcaataccaa agtgccagca ggcttttttg 19201 gtggtgggag gtcgggcagg attcataagc taattataaa atgcatatgg aaatgcaaag 19261 agccaaggat agccaagaca gttttgagga agaataaact tgtactactt acactaccag 19321 atgtcaagac ttattatcga gttacattta ttaagacagt gtggtactga cacaaggata 19381 gacaaataga tcagtgaaac acactagagt gctcagaagc acacctgtac atatataaag 19441 gcttgattta tgatagaggt gccagtgcag tagagaagga aattattggt gttttcaata 19501 aaaagtgata ggtcaattag atattcatat ggcatgaagt atgaaacaat aacaatttat 19561 attcataact tgcagaaagc aaaaatttct taaaatacaa aaagtgatca ccataaagga 19621 aaagattgat aaactggact atattaaaac taaggactcc tgttcagcaa aagacactac 19681 ttcgactgaa aagacaagtc acagagtgag acaagatatc tgcaatacag atacctaata 19741 actgaacccc atacagtgat ggtgggaatt taagttcgta caatcatttt agaaaattgc 19801 ttggcagtat ctactagatc tgaacatgtg atccagtaat tacactcata attataagcc 19861 agtaaaaagg catgtttatg tcaccaaaag atatatacaa gaatgttcat tacactatta 19921 tacataagag ccaaaaactg gaaacaaacc aaatatccat taacagtaga atgaataaat 19981 aaaagctgta atagtaatac agtggaatac tacacagcaa tgtaaatgaa ctactgctgt 20041 acaaaacaac atggtttaat ctcacagaca aaatgttaaa tgaaagacac agacgagtac 20101 atattgcgaa cttctgttta taattcaaga actggcaaga actgtttact gtgttagaag 20161 tccaggtaat ggtaacctat aaaaaggaaa aagggtggaa tgattgggag ggggcatctt 20221 ctggggtctt gataatgtgc tatgtattgg tcagtttagt gtttaaacag gctcatttac 20281 tttgtgaaaa cttacactaa aattgtgtgt attttttgaa tatatgttat acattaataa 20341 atagggtttt taaacctgta gttcataatt tagtgaaagt agaatatcca aacatttagt 20401 tttaaaccaa tcaattatag tgctaccatc atttttatgc attattgaga agtttatttt 20461 acctttcttt ccactcttat ttcaaggctc caaaatttct ctccccaacg tatattgggg 20521 gcaacatgaa tgcccccaat gtatatttga cccatacatg agtcagtagt tccatgtact 20581 ttttagaaat gcatgttaaa tgatgctgtt actgtctatt ttgcttcttt tagatgtaac 20641 atgtaacatt aagaatggca gatgcgagca gttttgtaaa aatagtgctg ataacaaggt 20701 ggtttgctcc tgtactgagg gatatcgact tgcagaaaac cagaagtcct gtgaaccagc 20761 aggtcataat ctgaataaga ttttttaaag aaaatctgta tctgaaactt cagcatttta 20821 acaaacctac ataattttaa ttcctacttg aatctgcttc cttttgaaat catagaaaat 20881 atcagtagct tgaattagac caattaattt tctagattgc atcatatttt aaatataact 20941 atgtaatcat ctacaacctg aattctttct gtgtccaatt tgtccaattt ttttctctaa 21001 catttatatc acaaagcaat taatttgtgt gatttctgca tatgtatttg taattcatca 21061 agtcaaatca atgtagtaat actatatcat aaaatataca caaataattg agtgataggc 21121 ttctagtata aggacggtaa gtttgaagca tgattctatc tgggctggct agtttactct 21181 gagaaagtta ttttttattg ttgggtctta agctgagttt acacacttgg tgtcagaatg 21241 attccggcaa tgaactgttt tatgttctgc taggctgatc agcacaatct atatggctgt 21301 gaacaaaaca atgtttccca gtcataccaa ccatgccacc attttaacag ctgattagtg 21361 tattcagaac atctccactc catgttcgta tggctgttat ctaaagatga aagcagtaga 21421 cacttttatt ttttgaaaaa tttaggctct gcagggtcaa ttatatttga taaatgaggg 21481 gcttttttga agcaaactag atataatttc ttttgcattt ctaaagcctg atatcttatt 21541 aattggtaca ttaaattgtg caccatttct ctgtaactgt ttcagtacct gtctcagcac 21601 tataccaggc agaagaaata aagaaagaac cagtgccaga tcagcttggt caggagaccc 21661 taatcctgcg gcactagagg aattaaagac acacacacag aaatatagag tatggagtgg 21721 gaaatcaggg gtctcacagc cttcagagct gagagccccg aacagagatt tacccacata 21781 tttattgaca gcaagccagt cataagattt actgaaagta ttccttatgg gaaataaagg 21841 gatgagtctg gctagttatc tgcagcagga acatgtcctt aaggcacaaa tcacttatgc 21901 aattgtctgt ggtttaagaa cacctttaag cagttttccg ccctgggtgg gccaggtgtt 21961 ccttgccctc attctggtaa acccacaacc ttccagtgtg gatatcaagg ccatcacgag 22021 catatcacag tgctgcagag attttgttta tggccagttt tggggccagt ttatggccag 22081 atttggaggc ctgttcccaa caaaccagaa gctaggaata tatatcctgc aaataaaatg 22141 aagaatctct aaggcttcag ggcctgccca cttgttcttc tgcctggttc ttcacataca 22201 ctgtctcaaa gctagtctac cttgagagga gcatgaatat gtgtgtgggt gtgtgtctgt 22261 gtattttaac cttaaaaacc taacttccag tatagacaga tggcatacta gctaaaccct 22321 tacaagttct tctatgctat aaaagagaaa cagaattgag aaccacctcc aactattaag 22381 tgttatattt gaatatagcc ttagctttag cagaataagt aggccaaact taaaataagc 22441 ttttctgcct tttcaatgat aaaggtccct tttctgtagc cattgttgat tgtgtacact 22501 tatacataag tattttgaac taatttcctg ttttctcaac cacttgctgt cttcatgata 22561 ctttgtcgca gctggttgct atagaaatgt ctgttacaag gaatgtggct tgaaggaaag 22621 tgataaatga aaatgaaatg tgaagtgact ttgtttgact acaaattccc attctggtag 22681 tccccagtgt atcaatacat tatttttctt tagaaaataa accaacccaa ggaaaaatgg 22741 tgggcaggtc ctggtgaata tggctgtgat aattatatta gcaatctctt tggctaatat 22801 ttgaagccca aataattgaa tcacaatgat ctctccccag aaaatatata aaatgcacct 22861 tggaatctag aaggcctttt agtctgcaaa agaaaccttc ttaatcataa gcagcagaag 22921 tcccatttac caaattggaa agttaaagtt acaaagcatc aatcatcaga cttccattca 22981 gggatggcaa ttgggagtaa gactttttag taaagaaact aaacacaaag tcattagact 23041 ctgtaaaagt cttaccaaat ttgattctgg aacacctatt ctatttccgt aaagatgatg 23101 aattcggagc caaatgttct tttcatgaag gatttgaaaa ctgtccatga aaataacgca 23161 atcaaccttt tagcttgaga ctctattcac tgattagatt tttttaaata ctgatgggcc 23221 tgcttctcag aagtgacaag gatgggcctc aatctcaatt tttgtaatac atgttccatt 23281 tgccaatgag aaatatcagg ttactaattt ttcttctatt tttctagtgc catttccatg 23341 tggaagagtt tctgtttcac aaacttctaa gctcacccgt gctgaggctg tttttcctga 23401 tgtggactat gtaaattcta ctgaagctga aaccattttg gataacatca ctcaaagcac 23461 ccaatcattt aatgacttca ctcgggttgt tggtggagaa gatgccaaac caggtcaatt 23521 cccttggcag gtactttata ctgatggtgt gtcaaaactg gagctcagct ggcaagacac 23581 aggccaggtg ggagactgag gctattttac tagacagacc tattgggatg tgagaagtat 23641 ttaggcaagt ttcagcacta accaatgtga gaaggcctcc agagatgagc agttggtgaa 23701 agagaggctc aaaaccagct accatacagg tcaagaagaa tttggcatta aggaaacagc 23761 atagcaggat tccagacagg caactggtca acaacatgaa ggtctggaag aaaggtcgca 23821 gtactcaggt tcagggcact acttcagctt cagcccttgc aaaaactggt gagagttgga 23881 aagtctttag gctaagaaaa attggattat ttaaaagggg taaagaaagg gactcaagga 23941 ggaaggatta aggcaagaac taggttccaa gaaacagggc atgagagaga gtcttgatct 24001 accactatag ttctcgtggt agcatcagaa tcacctggga acgtagaaat gcaaattctc 24061 ctgctctaca ctagacctac caaatcagaa tatctagggg gtggggccca gcagtctgtg 24121 cgcaaacaag cactgcaggt gattttgatg cacattatag tttgaaaact aggccaggtg 24181 cagtggctca tgccaataat cccagcactt tgggagactg agacgggagg attgcttaaa 24241 cccaggagtt tgagaccagc ctgggcaaca cggcgaaacc ccacctctaa ttaaaaaaaa 24301 tacaaaaatt agctaggtgt gatggctccc acctgtgctc ccagctattc aggaggctga 24361 ggtgggagaa tcacctgagc ctggaaagtc gaggctgcag tgaattgtga tcacaccact 24421 gcacttcagc ctgagtgaca gagtaagacc ctatctcaaa aaacagaaaa agaaaaacac 24481 tggcccaaag gaaatgaact tgttacagaa gccggggttc aaaacaccaa ataatgcact 24541 tgtacctagt ccttcccggg tgctctgcag acatttctcc aagcgtagtc tgcaaacaac 24601 ctacatatgt agaattacct atgcacattt ttcatttaac aaccaagagc tacatttgta 24661 gcaaaatctg ggttgtaact tagcctacag ctgaagccta agagattccg tctgtgagaa 24721 gaaataaccc acctctttgg cccccctccc caggcaggaa gccaggatgg tccttatata 24781 aagttgtgct gtccaatagg taaccactag ccacatatgg ctatttaaat ttaaattaac 24841 tacaattaag agaaattaaa aattcaattc ctcaattgca cctgccaaat tttaagcaca 24901 taacaaccac atgtggctag taactactgt attggagagt gcaagcggag atagaacact 24961 ctattactgc agaaatttct attggatagc acttataata gtttagtgta acttaaaact 25021 ccctagttgc cacagtcatg atttagtagt aatttcatgg atttctctac tgaggttaga 25081 atctctgcca ttagagactg ataaatttaa agtttgcaat tatcaaactg gtgacaattt 25141 aagccagaat caggtaatgt cctcagtttt aacagcattg gaattttctg ggactagctg 25201 tgtatctatc caggattctt gagaatgcct gccatttttc aacataatgg atgtaaggta 25261 ttacacatat acctggggat ggggaggtag gtataattgc acaagcattg tggagaatgg 25321 tatcaaagag tggcagaaca tcacaatcaa ggttttccct ttcttttacc tttgcttttt 25381 aaaaagacaa tatttgctgg acctgatctt ataactcata aatgggacac tgtatgttcc 25441 tttttacctc ctctgtttct acttaattgc accctatgag gactgcttcc cttacctacc 25501 ataacccctt ccttcactca tccatatctt tactcttctt cacaactctg taatattgac 25561 cttctttatg aacctttcct ggaacaatcc ctcttaatgc aagcactgtt attatgcctt 25621 caatgtattt aatatccatg tatctattct ctctaatttt gtcattttgt gttctcatgt 25681 attttcattc attatgtgtc caacttccat ggataacatg gttacaacaa aagatcctac 25741 tttatgacaa ttatcttcct tgggtttgtg ggacatagaa cagtgcacag agtaggggat 25801 ccaagaaccc aggagaatat attagctaag aagataactt ccgtttttaa aagtccaaga 25861 ttcaggagat caaaaccatc ctggctaaca tagtgaaacc ccgtctcttc caaaaataca 25921 aaaaattagc ccggcgtggt ggcaggcgcc tatagtccca gctacacggg aggctgaggc 25981 aggagaatgg cgtgaaccgg ggaggcggag ctggcagtga gccgagatcc cgccactgca 26041 ctccagcctg ggcgacagag cgagactccg tctcaaaaaa aaaaaaaaaa aaaaagtcca 26101 agattttaaa aaaaaaaaaa aaaaggatgt ctgctttgtg agtttagcat tgtctccttg 26161 tcattccaga aatgaaatgg caaatacatt taaatcagaa ctaaaaaggg gaacagggta 26221 taaaggctca atttagtcac atcatttccg tttctcaccc acccccttta aaccagatgt 26281 ttgccaatgc attaacaatg cagatgtttc ctgaaagaaa gtttagtaac tcaagcagac 26341 accttatttt cttttcaagc agaaaagact atgagatggt ggttgtggtt gttccgggag 26401 ggagaagata taaatgatac acattatttc aaatcatttc atgacctcac tgcacactta 26461 tagttattgt acctgttgtc tttttgctgt caagcctagc taagatcatt tggaatgttc 26521 aagatcactc atacatgcat gtgcacacat acacatgcac atatgttcac tccctatttc 26581 atccacatga actaagatta ctgatgtgta cagattcaaa gcacttttat tcttttccaa 26641 aggcaagaag ctgagctact ttccagaata gttgtgaaag accctgtcat acttctgcat 26701 tgtttcctcc acaccacctc catccagttc cttatgaatg gttactggtt ttcaaaaata 26761 tgagataaat tgagtgtata aaagtcattt ttagacaaaa tgaaacagga aatgaaagaa 26821 accagaatct ctcctcattt gtggatgggc cagctccacc atgtcatggt taatctgcag 26881 ggaggaaata ctagatttga ttgcagatca gactgcagca aacctgctgt gactaaggca 26941 tcaagagaaa gcaagcaaca gactggggct tcagtggtga aaacattata tatctagctt 27001 tgaaatatga aatactgttt agcagtgtca cctagaaaag agtgtttcaa aatgctgatg 27061 cttcataaga acctttctct tcagagttgg tttcttttat ctttcaaatt agccagggtg 27121 ggaaataaag tgatcacttg gtgaagaaat ctcacaaaga agaacataga gagttcactt 27181 tcatctggag taatgaacag attgaacaaa ctagaaatgg ttagtctgtt aaagaaaagg 27241 tgtaggtgag ctgtttgcaa gagccacaag ggaaagggga agacaacttc tttgtggact 27301 taagggtgaa agttgcaagc aggcaagacg attctgacct ccattaagaa agcccaaacc 27361 aaccaacaac cactgggttg gttacgcagg ttgggcagca ttgggagcaa atgttgattg 27421 aacaaatgtt tgtcggaatt gttgacttaa agagctgttc tgtcactggg gacagcagcg 27481 gctagatagc cccattcagg gagagggcat ttgttcacct ggccagagat cagagcaggc 27541 taagggactg ctgggatcct gtccagcttt gagaccctac agagccatgt tcacctagca 27601 cgtatcccgt ctgcggtcac ggtcatttct taccttattc cagggctttc acctcagctt 27661 gccaggctgg agccaagggc caacgcagcc gcgccttgtt cgcgatggta gcttcccagg 27721 agccccctat ggttccggaa cgcgctgccg gccccatcct gtttgctacc tcctaaagcc 27781 aaaggcactg gcgggccggg ccagcttcta aagtcgcgca aggttagaag gttccggaca 27841 ggaacggcgt gaggccaatg gaaggaggta cttcagtttc cctccagatg cccagcgatg 27901 ggctcagagc tccttgagaa ctcgggaaag gaagcagggt ctctgaagaa atacttcagg 27961 agtagaaaga ggaagctaga gggttaaatg cactacacag gaacagaaat gagtttttct 28021 tagagttagt atatgtctag aggtgtagta aactaaaaca agtcttgaat tgcataccgc 28081 cacgtaggga agaaatgaaa acctttgaat attagtgaaa aaagggaaac tgcaacgcct 28141 gtattactag atagctttca tcaacagctc aaaaccgaca gatttaaaga agcaacaccg 28201 cattttggct ttctaaagct ttaatttggt ttggatccca tgcccatgac cctgccagct 28261 gacaattcta agcatgcgca aactggcccc aaaaattcct cccacatttc cgaagaacta 28321 tttggccctt tatgtgaagt acctggtttt tccattttct gttttaccat aggcctcagt 28381 tcggtgtgtg gcgtatttat tctacattta acaatttgaa gatcattcta ttagattaaa 28441 aaaaaagaat acaatggaag ccaagtgatt aagctttcct tatgcttata ttaagttgta 28501 gcatatgcat ttaccgatag ttaaccgtat taacctacag aaaatgtcca gggaaatggt 28561 ctatttctta ttctattttt gacctaaaga aaatctttaa aatgtcttag cattttcccc 28621 agtctccatc cacttccctc agctttggcc tgaagctatc tttaaaggta ccctgtacaa 28681 gctcttgccc tgtacagcta gctacagaga ttcaatcctt tctgttcgat taggacacat 28741 ctcagtggca gataacatgc aaagttatta tatgtatgaa ccagaacttg tttttcctta 28801 ggggccagga tgttacacta aggtcttaag actatagtaa tatcttcact tgaaaaagcc 28861 ctctattatt cctatctcag atgataaaaa ttcaattaag agaaataaga acgtgacatg 28921 tgtaatcgca cctggctcta caaagctagt ctggacagac atttaaacaa ttatcctcta 28981 agattatttg atgaaatgca tttcaatgac tagttaacca ttaaaaacca aagtgagcat 29041 cccatctgtt cccagtcaaa tgacctagag caaaggacta ggcaaaccac atctgtgggc 29101 atagcaagct gtacatcaca aacaaatgaa tttgctttgt atatgagtga gagcaaacac 29161 tctttattgt acaacttggg tgggtaagta gggagaataa tggttttact gaaatcgcag 29221 gtaacggtta cgttggagtt aaaggttagg aagaaaacca aagggtaaga gctgttgttc 29281 tgggctggca ttgtcaatga agagcataaa ttcagatgtg aatgtatatt ttgtagaagc 29341 atgtgtgttg ttggtttttg tgtatgtgtg agtctgaaag agggaaaaca ggctcccatt 29401 agactatgac taacaaaaat gtttgacaga ttataactca gatgtcttac tcagagcata 29461 tgccttccca ttttccccat tattccccaa catgatgtct ttaagaactt gtccttgacc 29521 gagcagacat ctcatacccc aaatagctaa tattttgata gctatgatcc tgaacggcca 29581 aacattccaa aaccaagtag tttgtaatat ctttaatgca aatatatttt aggccttttc 29641 cttggcaagg atgtttggtc aggggttggc aaaaataatg ctcttcagac ttaaaagaac 29701 acaaccatat ttcttagcca tccaccagaa agtagtagaa cgctccagga agcaagtctt 29761 tgtcaggagt cagactagct acatcataat ctctctgccc aggggctgtg gatgtcatcc 29821 atcctggcct aactagccta ctgagctgag agatgtccaa tttcccccca atacactaac 29881 cagaggagaa ggaccgtgat atcattgcat gtgaattctt aattccaatt gcttaaacaa 29941 atatgttcag ttgtaactat caataccagt atataacagt gttggccaag ttttattgat 30001 gctgacaatc aattggagtt acagccagac acatggtctt atgaccggcg tacttacgca 30061 gggctttgca ctgagacagg tcgtgcatct gaggtttact gctttgcatt tttgttttgt 30121 aactgaagtc tgatgagaca gccagagcat gtgctaccta gggacttgaa tccctgcagc 30181 cccatttcac ttctcaccac cttccggggt ggtttctcga cctcccactc ccctaccacc 30241 tggtgcctta gccagccctg gctctccctc caaacacctg cccaatgagc actgccaccc 30301 catggtgccc agacatgctc tccctcctca tccctaccta gctaccattg ccactcccct 30361 cccccagcgg ggacatgggc ataggagcag ggagagttaa ggttggtcag gtgcacgtgc 30421 cctatgctat cttggaaggg ggcttggcca tgtggcatct ctggaccaag aatgcgccac 30481 agcacatttg gagggtgaat ggtgggggca caccccttgt ccacctctat ttcaggcatg 30541 gaacacatcc tggcatgaaa gttgcagtcc cttgggaatc acctctccac cttgattgcc 30601 acagtaggcc agtgacaagg gaagattgac acatcatccc tgctggggcc cagtgtcctg 30661 tggctggcag gcaggggatc ctaaggacat gtgggtctta aattgtaggg tgcacttcct 30721 gggcaccttt gagggtctgc actgccccag caaatatccc catgctagaa ggagcaaaat 30781 attaaatggc aaattttaaa aatgtaacaa gatgggttgc aaaagagact acagaggaaa 30841 gcaaaagttt tgtattttag tatcttccat ggcacttttc ttcctagctt ttgaacaagg 30901 ggccccacat ttttatttct cactgagccc cacaaagtat gtagccattc ctgcccggag 30961 tgaggacttt taaaacataa agattatcaa gtcttggaaa ttctgattca gtagatatat 31021 aacaggtctc aaacttaatt atgtaaagaa tattctggag agcttccttt tacccagtcc 31081 cacccaccaa atattctgat aaattaagct tcgattagcc cccagatctg cattttataa 31141 ggatccccag atgattctac tgcaattggt ccacagacca tgcctggacc gaatttgggt 31201 gcttaggagc acaaattctg gagccgggca gacttgagtt tgcttcctag ctttaccaac 31261 tgatctcagg ggagttaatg tttacctcta aactttagct catgcatcta taaataaata 31321 tattaatatc atgtcataag gatattatgt tgtattaaat gtctttaaaa caccacaatg 31381 attagcccaa agtaaacact caataaatgt tcaaaaattt aggaaaattg ttaagactgg 31441 gttgtatgca cactggtgtt tattatatta tgtagttttt tctgtatttt tacaacattt 31501 cagaattaaa agcaacagct agaaaaagag ggaaatggcc gggtgcagtc gtcacgcctg 31561 taatcccagc actttgggag gccaaggcgg gcggatcacg aggtcgagag atcgagacca 31621 tcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattaa ctgggcatgg 31681 tggcatgcgc ctgtagtccc aggagaattg cttgaacctg ggaggcggag gttgcagtga 31741 gccaagatct caccactgct ctccagcctg gtgacagggc aagactccgt caaaaaaaaa 31801 aaagagaggg agagccagag tatgaaaaag gaagtcagag ccctttaatg agtcagcttt 31861 gtaggtctcc aggtaggagg ctagtgcttc agtgtctagg acatagtagg tgttcagtaa 31921 attaaattca ggacaaaaag aacatgcccc aaggaccatc tgatatccac ttaaagtgat 31981 ggactacctc gtttcccttg tttatgaatg ggttcatgcc taagactgtg tgcactttaa 32041 tacaagggca gtcgttcaga actagtcagg tcctgaaaag gatttaccaa atgttgagtg 32101 tgccctctag tgttcacact tcccagcttt cttcctataa aggtggatca aggcacttgc 32161 ttacaactgg aactgaaatc ctccaagtcg atctagacat tgagatggag aaaatattca 32221 ttgtcgactg taattatgca acgaatatcc agttgagata atggacttgc ctcttatcta 32281 ataataccca ggctcaatgc gtcactgctt tgtccacttt gcccaaaatt caagcacagc 32341 taagttgata ttttaggaca aaggcagctt actatccagc cagaggggag tagaatatgg 32401 ttaagagaga gtggaaagaa tgaatgagcc ctgctattcc tcactgcctg gatggctata 32461 agcacagccc ttatggaggc cttaggtctt gcttcacaat attccagttt gaaaagggtt 32521 tgaaaagacc tcctagaaaa atcagtagtt tttctctttt gagtaacatg tagcaaaaaa 32581 aatttcatca tgtaggtaca gggaacaccc tagtaactat taatctcaag gagtcaagcc 32641 agtgtgtttc ctaatgtatc tgctgtatcc ccatgaagca aattttgcca tcagagaaac 32701 tgactcatgg ggaaaaaatc caaggacctc aaatcaccaa aagaagccat tcctcagatt 32761 tgcctaagct taagcttccc tgtctctcat tgtgtgttgc tttcaatgca gttacataaa 32821 tggctttttt gtttatgcac caaaaacact aattcatctg caaagctcac atttccagaa 32881 acattccatt tctgccagca cctagaagcc aatattttgc ctattcctgt aaccagcaca 32941 catatttatt tttttctaga tcaaatgtat tatgcagtaa gagtcttaat tttgttttca 33001 caggttgttt tgaatggtaa agttgatgca ttctgtggag gctctatcgt taatgaaaaa 33061 tggattgtaa ctgctgccca ctgtgttgaa actggtgtta aaattacagt tgtcgcaggt 33121 aaatacacag aaagaataat aatctgcagc accactagct ctttaatatg attggtacac 33181 catattttac taaggtctaa taaaattgtt gttgaataaa ttgggctaaa ggcagaaggg 33241 tcataatttc agaacccacg tcgcaccgtc ctccaagcat ccatagttct tttgatatac 33301 ccctattatc actcatttca gtgaggtaca attagttctt gatgtagcca tttccatacc 33361 agaaggcctt cccaaaaatc agtgtcatgt caccgatcct tttatctctg gtgcttggca 33421 caacctgtag caggtcctca gaaaacaaac atttgaatta atggccaaat gagtttgtgc 33481 tcaaaaaagg ggtgaggata cttgaaattt ggaaaatcta ggataattca tgactagtgg 33541 attcattatc accaatgaaa ggcttataac agcatgagtg aacagaacca tctctatgat 33601 agtcctgaat ggctttttgg tctgaaaaat atgcattggc tctcattaca tttaaccaaa 33661 attatcacaa tataagaatg agatctttaa cattgccaat taggtcagtg gtcccaagta 33721 gtcacttaga aaatctgtgt atgtgaaata ctgtttgtga cttaaaatga aatttatttt 33781 taataggtga acataatatt gaggagacag aacatacaga gcaaaagcga aatgtgattc 33841 gaattattcc tcaccacaac tacaatgcag ctattaataa gtacaaccat gacattgccc 33901 ttctggaact ggacgaaccc ttagtgctaa acagctacgt tacacctatt tgcattgctg 33961 acaaggaata cacgaacatc ttcctcaaat ttggatctgg ctatgtaagt ggctggggaa 34021 gagtcttcca caaagggaga tcagctttag ttcttcagta ccttagagtt ccacttgttg 34081 accgagccac atgtcttcga tctacaaagt tcaccatcta taacaacatg ttctgtgctg 34141 gcttccatga aggaggtaga gattcatgtc aaggagatag tgggggaccc catgttactg 34201 aagtggaagg gaccagtttc ttaactggaa ttattagctg gggtgaagag tgtgcaatga 34261 aaggcaaata tggaatatat accaaggtat cccggtatgt caactggatt aaggaaaaaa 34321 caaagctcac ttaatgaaag atggatttcc aaggttaatt cattggaatt gaaaattaac 34381 agggcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 34441 atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 34501 tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 34561 cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 34621 tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 34681 atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 34741 tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 34801 gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 34861 ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 34921 ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 34981 tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 35041 agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 35101 ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 35161 agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 35221 taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 35281 caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 35341 ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 35401 agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 35461 gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 35521 taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 35581 ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 35641 tctatcttga atcttctaga gagttgctga ccaactgacg tatgtttccc tttgtgaatt 35701 aataaactgg tgttctggtt cataccttgg ctttttgtgg attccattga tgtgaatcag 35761 tcaccctgta tttgatgatg catgggacta ctgacaaaat cactctgacc ctgccaagct 35821 gctgccttct cctgccccaa cctcaccccc agccaggcct cactcttgct agttccttta 35881 gttcttttag tcaatatatt tttgtcttcg catataagta taaataaaca tatttttaaa 35941 tttcttggct gggcccagtg gctcacgcct ataatcccag cacttctgga ggccaaggtg 36001 ggcggatcac ctgaggttag gagtttcagg ccagcctggc caacatggtg aaaccctgtc 36061 tctactaaaa atagaacaat tagctgggct tggtaatgtg cacctataat cccagctact 36121 ggggaggctg aggcaggaga atcacttgag cctggggagc agggggtgcg ggaggttgca 36181 gtgagacaag atcgcaccag tgcactcccc atcctgggtg acagagtgag actctgtctc 36241 aaagaaaata aataaataaa tacatttctt gaggcgtttc ttgttaaatc attcatggag 36301 aggcatccca aacaccacat tcaacaaaac actctgaaaa atgttttcaa atgcaatata 36361 acacagcaga gatttgatgc tctgttatcc agttttcata taaggctgtg tgagctgtgt 36421 cccagagagg acagtggtct gaatccacct gagacagaat tgggtctaac taactgtgag 36481 tatggccttc aataagtcac tctccatttg ggaatttgat ttctccactt gtataatgag 36541 agtatttgac aggatgctct cccaaatccc ttgcaatttt gttagtctgt gatttcatgt 36601 ttttattttt attccttcat ccaacaaata gtcaaggagt aattgctgtg tgccaaatac 36661 caacagtatt cattaaattg taattcagat tttatatata tataaataat gtataatgtg 36721 tataaattgc tttgtgagtg cctactacac tgctagacag tagttgctca ataacttgtt 36781 agctgaatca gaatccatgt ttatcccaga gtagcaatta gtcttgcatc gagtatcgtg 36841 aaagaaggcc acacttaaat aagaataatg cctggggttt aggttttatg aaaaaatgaa 36901 aggaaattag ttctgctttt gttgactaaa ggaagggaag agagaagaga cactataatt 36961 gtctgcctca gatttaagga ggaggctaat tcatgcatta aacacgttac ttcaaatttg 37021 aatgaccaaa ggtctgtagc ctcagcactt caaaattggt aaaagtaaga cactctggcc 37081 ttgtttccat agagaccacc ccttacaaag gcaccaatgg gaaactggcc tcaggactcc 37141 tgttattggt cttctctgtg gcagagaaag gagctcttgg acccataaat ctctgagcca 37201 cagttctttt tgccatgggc tcaaaaatga ttgaattcat catgagccac ctgtggcata 37261 ttgccacact aaacatgtgg ggcctttaag ctcactaaga gccaatgtct tcagagccag 37321 ccctggcttg attctaccta gggcatttgc agttgccata taagaatcat tagtgctttc 37381 aaaattactg tagatacttt gcctaaatag actaaaacat gctgccgtca tattggaagt 37441 gacagattaa aatagaactc ttgccaagtg aaggaaagtg tgctaatata atgcagtcat 37501 tttaacttgc tgtttaagtg tgattgtttt tagttctttt gaatattatt tgttttatac 37561 tgacaggaac gaagtactgt ccaattttct ctgccaagga aaaaagaaaa ggtgttcttc 37621 cttacttacc tgaaccaaaa cagaccagtt tacaaaattg cctaattata attgctaaac 37681 aagttccgaa tgcttacagt ctaatccaag aatgtcagag ctgcaagggc ccttaaacac 37741 catccaatcc actccactca tttagcagat gaagagattg agggcaacat aaggccaggc 37801 ccaagataac acaatgacag ccaggactag agctcaagtc tcccaccctg cactttgaaa 37861 gaataatgct ttcaactgga gtacattaac tctactgtct atatttttag ggcagctggg 37921 gcattctgca ttggtggcaa tcctctcaac aaccctggga ctgaaaactg cctggaattc 37981 ttactaacaa ttctctaatt gaccaaaagg tgacgaaatc aaggagacca ataaggtagc 38041 cttggaaagc aagagtggc // LOCUS HUMG0S19A 4102 bp DNA PRI 07-JAN-1991 DEFINITION Human homologue-1 of gene encoding alpha subunit of murine cytokine (MIP1/SCI), complete cds. ACCESSION M23178 M32337 NID g182846 KEYWORDS cytokine; macrophage inflammatory protein. SOURCE Human lymphocyte DNA, clone LG0S1907. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4102) AUTHORS Blum,S., Forsdyke,R.E. and Forsdyke,D.R. TITLE Three human homologs of a murine gene encoding an inhibitor of stem cell proliferation JOURNAL DNA Cell Biol. 9, 589-602 (1990) MEDLINE 91103879 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Forsdyke, 30-JUN-1989 and 23-FEB-1990. The G0S19 genes are members of the 'small inducible' family of genes. The G0S19-1 product is homologous to the alpha subunit of the murine cytokine MIP1. FEATURES Location/Qualifiers source 1..4102 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1298..1304 /note="CK-2 element" misc_feature 1326..1335 /note="serum-response element (put.); putative" misc_feature 1481..1488 /note="AP1-binding element (put.); putative" misc_feature 1710..1717 /note="AP1-binding element (put.); putative" TATA_signal 1967..1972 prim_transcript 1998..3880 /note="G0S19-1 mRNA and introns" CDS join(2081..2153,2842..2956,3377..3467) /note="G0S19-1 peptide precursor" /codon_start=1 /db_xref="PID:g182847" /translation="MQVSTAALAVLLCTMALCNQFSASLAADTPTACCFSYTSRQIPQ NFIADYFETSSQCSKPGVIFLTKRSRQVCADPSEEWVQKYVSDLELSA" sig_peptide 2081..2140 /note="G0S19-1 peptide signal peptide (put.); putative" exon <2081..2153 /note="G0S19-1 peptide precursor" /number=1 mat_peptide 2141..2153 /note="G0S19-1 peptide" intron 2154..2841 /note="G0S19-1, intron A" mat_peptide 2842..2956 /note="G0S19-1 peptide" exon 2842..2956 /number=2 intron 2957..3376 /note="G0S19-1, intron B" exon 3377..>3467 /note="G0S19-1 peptide precursor" /number=3 mat_peptide 3377..3464 /note="G0S19-1 peptide" BASE COUNT 1143 a 945 c 903 g 1111 t ORIGIN EcoRI site. 1 gaattccaaa ggcatggtcg cacttggctt ctgtcctctg ttattctcca gcatcaaatg 61 tatcaactct aacccctttg gggggaatac aaggcctgtc ctggtttggt cccaatttag 121 ctttatcatc catattcacc cccactgctc tgcagctcca ctgaagcacc ccctctttcc 181 tctgaaccca caatgtcaca ctcaggactc tgcctcagct gggcactcat ctatagatgc 241 ctaaatcccg ggcagttatc cagacacaac taaagttcca tcccttccat gaagccttcc 301 ccaaccctct ggtggaaggt cacttcttcc cctcgtggga ttctgagctt tcatttcttt 361 ttctactagg agtcctagca ctttcggcta aatgctacaa ttacctgttc atacactcta 421 cctgccccca cgagatcagg ggcatctcag aaacaaagat cattaaaacc aactaaatct 481 atttctcatt ataaaatgag gtatgctgat tgattgtgaa agaataaaat aacaaagtat 541 ggaaaagaaa aaaaagcata taatctggct gagaaggtag agacccttcc acaccactga 601 aattatgtat tgaaaagaat aagtaaaaaa ctgcttcaat ttggcatgat ttatgtaagt 661 atagtatagg atccttaaaa tggttcaaag aaatgggaaa tcaagacttc attttggcca 721 aaaccattga acagaaactt cagcatattt atcaataatt tctttcagat taaacaactg 781 acaacaacct atttttcaac cagtgatgtt ggaaatgttt ttttaaaaat tagtttataa 841 atttgtgggc tgaccaagaa ggtaataaag tctaactaag taaaatgaga aaaattcaga 901 aaaagaaaaa aataagaaaa taaatcaccc agggacctat cacacaaata taagaactat 961 tcattcttta aggcatgtat ttccaagcct ttgtattttt ttccatgctt agggttggca 1021 aggaatatat atatatttgt acaaatatat atgtgtatat gtacaaatac atgtatatat 1081 agtacaaata tatatatata tttgtacaat tcttcagact ttgtagaatt tgtataatgt 1141 cgtatcttgc tttttttaac cactgatgtt ataagcatat ttatgccact tcattcattt 1201 tagagactta ataataaatg atctagtgga taatttatca ttccctgatg gagaaaaatt 1261 tagctttgtt tattttagag ttataaacga tgctgggtca ggtatcttta tgtttgaaga 1321 tggctccata tttgggttgt ttccacagaa ctctttccta gaaatgcttt ttctaggtta 1381 atggctacag atatttctag gcacctgaca tattgacacc cacctctaaa gtatttttat 1441 gatccacaac tagcgtttaa cacagcgccc tagtcactac atgactaata aatagacaaa 1501 tgactgaaac atgacctcat gctttctatt cctccagctt tcattcagtt ctttgcctct 1561 gggaggagga agggttgtgc agccctccac agcatcagcc catcaaccct atccctgtgg 1621 ttatagcagc tgaggaagca gaattgcagc tctgtgggaa ggaatggggc tggagagttc 1681 atgcacagac cagttcttat gagaagggac tgactaagaa tagccttggg ttgacatata 1741 cccctcttca cactcacagg agaaaccatt tccctatgaa actataacaa gtcatgagtt 1801 gagagctgag agttagagaa tagctcaaag atgctattct tggatatcct gagcccctgt 1861 ggtcaccagg gaccctgagt tgtgcaactt agcatgacag catcactacg cttaaaaatt 1921 tccctcctca cccccagatt ccatttcccc atccgccagg gctgcctata aagaggagag 1981 ctggtttcag acttcagaag gacacgggca gcagacagtg gtcagtcctt tcttggctct 2041 gctgacactc gagcccacat tccgtcacct gctcagaatc atgcaggtct ccactgctgc 2101 ccttgctgtc ctcctctgca ccatggctct ctgcaaccag ttctctgcat cacgtgagtc 2161 tgagtttcgt tgtgggtatc accactctct ggccatggtt agaccacatc aatcttttct 2221 tgtggcctaa aagcccccaa gagaaaagag aacttcttaa agggctgcca aacatcttgg 2281 tctttctctt taagactttt atttttatct ctagaagggg tcttagcccc ctagtctcca 2341 ggtatgagaa tctaggcagg ggcaggggag ttacagtccc ttttacagat agaaaaacag 2401 ggttcgaaac gaatcagtta gcaagaggca gaatccaggg ctgcttactt cccagtgggg 2461 tatgttgttc actctccagc tcactctagg tctcccagga gctctgtccc ttggatgtct 2521 tatgagagat gtccaaggct tctcttgggt tggggtatga cttcttgaac cagacaaaat 2581 tccctgaaga gaactgagat aagagaacag tccgttcagg tatctggatc acacagagaa 2641 acagagaacc cactatgaag agtcaaggag aaagaaggat acagacagaa acaaagagac 2701 atttctcagc aaaaatgccc aaatgccttc cagtcacttg gtctgagcaa gcctgccttc 2761 ctcaactgct cggggatcag aagctgcctg gccttttctt ctgagctgtg actcgggctc 2821 attctcttcc tttctccaca gttgctgctg acacgccgac cgcctgctgc ttcagctaca 2881 cctcccggca gattccacag aatttcatag ctgactactt tgagacgagc agccagtgct 2941 ccaagcccgg tgtcatgtaa gtgccagtct tcctgctcac ctctatggag gtagggaggg 3001 tcagggttgg ggcagagaca ggccagaagg ctatcctgga aaggcccagc cttcaggagc 3061 ctatcgggga tacaggacgc agggctccga ggtgtgacct gacttggagc tggagtgagg 3121 catgtgttac agagtcagga agggctgccc cagcccagag gaaagggaca ggaagaagga 3181 ggcagcggga cactctgagg gccaccccta ctgagtcact gagagaagct ctctagacag 3241 agataggcag ggggcccctg aaagaggagc aagccctgag ctgcccagga cagagagcag 3301 aatggtgggg ccatggtggg cccaggattc ccctgctgga ttccccagtg cttaactctt 3361 cctcccttct ccacagcttc ctaaccaagc gaagccggca ggtctgtgct gaccccagtg 3421 aggagtgggt ccagaaatat gtcagcgacc tggagctgag tgcctgaggg gtccagaagc 3481 ttcgaggccc agcgacctcg gtgggcccag tggggaggag caggagcctg agccttggga 3541 acatgcgtgt gacctccaca gctacctctt ctatggactg gttgttgcca aacagccaca 3601 ctgtgggact cttcttaact taaattttaa tttatttata ctatttagtt tttgtaattt 3661 attttcgatt tcacagtgtg tttgtgattg tttgctctga gagttcccct gtcccctccc 3721 ccttccctca caccgcgtct ggtgacaacc gagtggctgt catcagcctg tgtaggcagt 3781 catggcacca aagccaccag actgacaaat gtgtatcgga tgcttttgtt cagggctgtg 3841 atcggcctgg ggaaataata aagatgctct tttaaaaggt aaaccagtat tgagtttggt 3901 tttgtttttc tggcaaatca aaatcactgg ttaagaggaa tcataggcaa agattaggaa 3961 gaggtgaaat ggagggaaat tgggagagat ggggagggct accacagagt tatccacttt 4021 acaacggaga cacagttctg gaacattgaa actacgaata tgttataact caaatcataa 4081 catgcatgct ctaggagaat tc // LOCUS HUMG0S24B 3889 bp DNA PRI 09-MAY-1997 DEFINITION Homo sapiens zinc finger transcriptional regulator (GOS24) gene, complete cds. ACCESSION M92844 NID g2072389 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3889) AUTHORS Blum,S., Forsdyke,R.E. and Forsdyke,D.R. TITLE Three human homologs of a murine gene encoding an inhibitor of stem cell proliferation JOURNAL DNA Cell Biol. 9 (8), 589-602 (1990) MEDLINE 91103879 REFERENCE 2 (sites) AUTHORS Taylor,G.A., Lai,W.S., Oakey,R.J., Seldin,M.F., Shows,T.B., Eddy,R.L. Jr. and Blackshear,P.J. TITLE The human TTP protein: sequence, alignment with related proteins, and chromosomal localization of the mouse and human genes JOURNAL Nucleic Acids Res. 19 (12), 3454 (1991) MEDLINE 91288233 REFERENCE 3 (bases 1 to 3889) AUTHORS Heximer,S.P. and Forsdyke,D.R. TITLE A human putative lymphocyte G0/G1 switch gene homologous to a rodent gene encoding a zinc-binding potential transcription factor JOURNAL DNA Cell Biol. 12 (1), 73-88 (1993) MEDLINE 93135830 REFERENCE 4 (bases 1 to 3889) AUTHORS Heximer,S.P., Cristillo,A.D., Russel,L. and Forsdyke,D.R. TITLE RT-PCR analysis of RNA of the CCCH zinc finger protein-encoding gene G0S24 (TIS11/TTP/NUP475) in cultured human blood mononuclear cells JOURNAL Unpublished FEATURES Location/Qualifiers source 1..3889 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="19q13.1" exon 549..630 /gene="GOS24" /number=1 /evidence=experimental gene join(607..630,1441..2397) /gene="GOS24" CDS join(607..630,1441..2397) /gene="GOS24" /codon_start=1 /product="zinc finger transcriptional regulator" /db_xref="PID:g183445" /translation="MDLTAIYESLLSLSPDVPVPSDHGGTESSPGWGSSGPWSLSPSD SSPSGVTSRLPGRSTSLVEGRSCGWVPPPPGFAPLAPRLGPELSPSPTSPTATSTTPS RYKTELCRTFSESGRCRYGAKCQFAHGLGELRQANRHPKYKTELCHKFYLQGRCPYGS RCHFIHNPSEDLAAPGHPPVLRQSISFSGLPSGRRTSPPPPGLAGPSLSSSSFSPSSS PPPPGDLPLSPSAFSAAPGTPLARRDPTPVCCPSCRRATPISVWGPLGGLVRTPSVQS LGSDPDEYASSGSSLGGSDSPVFEAGVFAPPQPVAAPRRLPIFNRISVSE" exon 1441..3103 /gene="GOS24" /number=2 /evidence=experimental misc_feature 2978..2986 /note="TA-rich conserved element (TARCE)" misc_feature 3163..3168 /note="U-rich RNA polymerase II termination element" misc_feature 3198..3889 /note="CpG island" BASE COUNT 648 a 1304 c 1078 g 850 t 9 others ORIGIN chromosome 19q13.1. 1 tcccaaccct cttctccctc tgaatctgtc tctgggactg tctctgtctc cccgtcttcc 61 ctcccttcct caccctgtct atctctctct gtatgtctct tggtgtgtgt gtctctctcg 121 atgtttttct ctctgcctgt ctgcctgtct gtacccctct gcgtctctcc ccgcccccat 181 ccgtctgtgt cgcacgcgca cccccatcgg gcttctgctc ttgtcaattg cccctggggc 241 cctgccccca cctccgcccc agtttccttc tacaagcctc agtctccagc tttgaaaact 301 gggcaggcgt ccccccatcc gcacccccac cccttcccca cgcattcccc gctcggtcac 361 ggctgtccac cggccaagct caggcgcgtc ctcccagggc cgggcggaag ggaaccagtc 421 cagggccagc caggctgccg ggggcgcgcg tccgggaagc gcccctcctg ccccgccccc 481 ggccccggcc ccggccccgc cccgtgcttg cagtttccta taagtagccg gctctcggtg 541 ccagcctgag cctgacttca gcgctcccac tctcggccga cacccctcat ggccaaccgt 601 tacaccatgg atctgactgc catctacgag gtgagtcccc gccgcacggc atccccggta 661 cctgcatgcc tgagtccgag tccccacctc tctagcgccg caaactccag cccgggacgc 721 ttgcctccct tctccaactg gggctcccta gcgccgcgcc ctccagcctg gggcccctgc 781 ctcccgctca gaccagcttg gtgatttgga ggtgaaaatg gaaccgcgac acccggctct 841 tcgctcaaac atgggtgggg cggcccatgc aagtggaaag tcggagaact tttctcagac 901 cgaggctgcc tggaggcgga agtggccccc atacctggct cacccctagt cgttgctgag 961 ggcgtggttt tgcgcggagg cgtctctggg gctgaagtct cagggtgggg ggatccgact 1021 tctgtctctc cagtccctga ccgtagagac agagaaccct aaaaccgaag caatccggac 1081 ttccaggtca actttgcccg gtttctccag ttgtgaaact gaatcccgac gcgtgggtca 1141 tatccgggga ggacaagaga acccaaaatt gggaaacagt ggtgcgccct gacttcgggg 1201 tccccctctt ggtccagccg gggaagccgg gattcctggg tccctcggga taaggcctcg 1261 gtggtgggta aactcagaac ctccaactct gggttcctgg catccggaac ccaggggttt 1321 ctgcgggcgg gtggggctca ggcggggagc ccacaaaccg gcctggcaag ctctagttcc 1381 ctgcagctgg ggtggggcgt gcctgcattt tcaggtgcct taaccgaccc atttccgcag 1441 agcctcctgt cgctgagccc tgacgtgccc gtgccatccg accatggagg gactgagtcc 1501 agcccaggct ggggctcctc gggaccctgg agcctgagcc cctccgactc cagcccgtct 1561 ggggtcacct cccgcctgcc tggccgctcc accagcctag tggagggccg cagctgtggc 1621 tgggtgcccc caccccctgg cttcgcaccg ctggctcccc gcctgggccc tgagctgtca 1681 ccctcaccca cttcgcccac tgcaacctcc accaccccct cgcgctacaa gactgagcta 1741 tgtcggacct tctcagagag tgggcgctgc cgctacgggg ccaagtgcca gtttgcccat 1801 ggcctgggcg agctgcgcca ggccaatcgc caccccaaat acaagacgga actctgtcac 1861 aagttctacc tccagggccg ctgcccctac ggctctcgct gccacttcat ccacaaccct 1921 agcgaagacc tggcggcccc gggccaccct cctgtgcttc gccagagcat cagcttctcc 1981 ggcctgccct ctggccgccg gacctcacca ccaccaccag gcctggccgg cccttccctg 2041 tcctccagct ccttctcgcc ctccagctcc ccaccaccac ctggggacct tccactgtca 2101 ccctctgcct tctctgctgc ccctggcacc cccctggctc gaagagaccc caccccagtc 2161 tgttgcccct cctgccgaag ggccactcct atcagcgtct gggggccctt gggtggcctg 2221 gttcggaccc cctctgtaca gtccctggga tccgaccctg atgaatatgc cagcagcggc 2281 agcagcctgg ggggctctga ctctcccgtc ttcgaggcgg gagtttttgc accaccccag 2341 cccgtggcag ccccccggcg actccccatc ttcaatcgca tctctgtttc tgagtgacaa 2401 agtgactgcc cggtcagatc agctggatct cagcggggag ccacgtctct tgcactgtgg 2461 tctctgcatg gaccccaggg ctgtggggac ttgggggaca gtaatcaagt aatccccttt 2521 tccagaatgc attaacccac tcccctgacc tcacgctggg gcaggtcccc aagtgtgcaa 2581 gctcagtatt catgatggtg ggggatggag tgtcttccga ggttcttggg ggaaaaaaaa 2641 ttgtagcata tttaagggag gcaatgaacc ctctccccca cctcttccct gcccaaatct 2701 gtctcctaga atcttatgtg ctgtgaataa taggccttca ctgcccctcc agtttttata 2761 gacctgaggt tccagtgtct cctggtaact ggaacctctc ctgaggggga atcctggtgc 2821 tcaaattacc ctccaaaagc aagtagccaa agccgttgcc aaaccccacc cataaatcaa 2881 tgggcccttt atttatgacg actttattta ttctaatatg attttatagt atttatatat 2941 attgggtcgt ctgcttccct tgtatttttc ttcctttttt tgtaatattg aaaacgacga 3001 tataattatt ataagtagac tataatatat ttagtaatat atattattac cttaaaagtc 3061 tatttttgtg ttttgggcat ttttaaataa acaatctgag tgtaagctgg gatcctggct 3121 tcttcgcggt ctagagacag gaatggagag ggagggggtg actttttgga agctgggtgc 3181 agttaactct tcctctccga gccccgcggc gcttacctgg caggccgtga cgtcaccggg 3241 cctggccgtt cacctgaaag ggtgggacca ggtgaggtca ccagatggga accggggagc 3301 cacttcccgg gcgtcggggg cgccgcgctc ccgtcctgct gggctccttg acccagcttc 3361 gggcgggtgc ggtcggggac gggatgtttc cgtccccacg gggccgcggg aggcgggagg 3421 ggccggctgg tgaggacgga atgtgctggt gcgcgcgccc agagcgaacg gtggcggtcg 3481 ctgtgcggtg cagccttggg tagcggaacc cctttcggga ctagaggttc ccggggggct 3541 tcgaaccttc tggatgttgg ggaagcgggt ttagggtcta agaggctagg atctcaacat 3601 ttgggagtca caggttgcat cccctagcgc tttagactct agacccctgg tggctcggag 3661 ttgcagattt ctggccaccc gggaccctgg agcccgggaa ttaccgggtc ttggnatttc 3721 cgaaccttgg aagtccgagg cttcgtacac cgaccagggt cgtcggnccc gcgaggccag 3781 ggcgtgtggg taggggncgc gcgtctagga ggggcccggn ggagncgcgt cttcagacca 3841 tncaaggtca gagtcgtcgg gaacaacccc ggngncccgc ntcccggaa // LOCUS HUMG0S8PP 7345 bp DNA PRI 15-AUG-1994 DEFINITION Human helix-loop-helix basic phosphoprotein (G0S8) gene, complete cds. ACCESSION L13391 NID g292036 KEYWORDS basic protein; basic-helix-loop-helix protein; helix-loop-helix protein; phosphoprotein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7345) AUTHORS Siderovski,D.P., Heximer,S.P. and Forsdyke,D.R. TITLE A human gene encoding a putative basic helix-loop-helix phosphoprotein whose mRNA increases rapidly in cycloheximide-treated blood mononuclear cells JOURNAL DNA Cell Biol. 13 (2), 125-147 (1994) MEDLINE 94235158 FEATURES Location/Qualifiers source 1..7345 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" repeat_region <179..348 /note="J-type" /rpt_family="Alu" repeat_region 1825..2994 /standard_name="CpG island" repeat_region 2055..2083 /rpt_family="Alu" protein_bind 2167..2175 /bound_moiety="Zif268" CAAT_signal 2450..2454 protein_bind 2490..2499 /bound_moiety="Sp1" TATA_signal 2512..2516 exon 2540..2681 /number=1 gene join(2572..2681,3666..3767,3870..3931,4479..4645, 4898..5092) /gene="G0S8" CDS join(2572..2681,3666..3767,3870..3931,4479..4645, 4898..5092) /gene="G0S8" /note="G0/G1 switch regulatory gene # 8" /codon_start=1 /product="helix-loop-helix phosphoprotein" /db_xref="PID:g292037" /translation="MQSAMFLAVQHDCRPMDKSAGSGHKSEEKREKMKRTLLKDWKTR LSYFLQNSSTPGKPKTGKKSKQQAFIKPSPEEAQLWSEAFDELLASKYGLAAFRAFLK SEFCEENIEFWLACEDFKKTKSPQKLSSKARKIYTDFIEKEAPKEINIDFQTKTLIAQ NIQEATSGCFTTAQKRVYSLMENNSYPRFLESEFYQDLCKKPQITTEPHAT" intron 2682..3665 /number=1 exon 3666..3767 /gene="G0S8" /number=2 intron 3768..3869 /number=2 exon 3870..3931 /gene="G0S8" /number=3 intron 3932..4478 /number=3 exon 4479..4645 /gene="G0S8" /number=4 repeat_region 4558..4666 /rpt_family="LINE 1" intron 4646..4897 /number=4 exon 4898..5769 /number=5 repeat_region 6000..6040 /rpt_family="MER6" protein_bind 6047..6056 /standard_name="serum response element" /bound_moiety="serum" BASE COUNT 2171 a 1597 c 1479 g 2098 t ORIGIN 1 gatctggaag ggtttatgaa aaaagcaaag aattatatta aacacaaaaa catcaagaat 61 aatctatatt taaaccagtt gcttatgcag gtacatgtat attcactttc attaatatat 121 tgagatgatt aaacttcctt ggcttggaga tatctgaaaa gtgtaattaa ttattattat 181 tattttaaga gacagggtct cactctatca ctcaggttgg agtgcaatgg cacaatcata 241 gctcactgta acttgaatgc ttgaacctcg aacttctgag ctcaagcctt ccacctgcct 301 cagcctccca aagtgctggg atttccagca tgaaccacaa tacccagcct gaaaagttta 361 aatgttaatc agtttcctta tttttcctac gtattatcca gtgagcattt attaaatatc 421 tataatgagc ttaatcttct acaaatttgt gtatcttgtg tatttgtgta cctacaaatt 481 gaaacatctt gatatttaat agtacctaca tatttataac ctaattacgt ctgagttaat 541 cctcgtacga ttttttccat cttgaacagc tatgatttga acatttcctt ttgccacaca 601 tatgcaaatc agtaataaac ttccgtacag tactttagag cttagaaaag acctttcatg 661 taagagtcac ataatatcct atgaggctgg caagaaaata catgttactc ttattacaga 721 tgagaaactg ggactcaaag aggtgtgaag ctagtgtccc aacagttact tagcatttac 781 tgggtaagcc caggctctaa ctgggtctcc tgacagaagt ccagctctct ctctctgctg 841 cctctttgtc agaactcctg gcctgccagc gaatttaact gtctatgagt aaattctaac 901 atctaaggtt ctaagtataa aatatattga ggtttacagt ggtgagaaaa agcaagtgtc 961 tgcgtattac agacagctga taaatacgta ttgagtcatc agaaagtgaa aatagatgta 1021 ggactattat gcaaagcata ggagaggaag agcatatgct taagaaaaaa tggcattcct 1081 agagttaaca agcatgtttc aaagtatcta ttactgaaat ctacacgatt ttcctttcca 1141 caataaagtc cacaactgtt gtcttaaata ataactgcag gttttgccct ccaagtcaaa 1201 cttggagctc caactactgg cagctcaccg taagaatata ttaattattt ccaatttcct 1261 tttaagaagt cgaatacaaa gtagagagaa tatctgcagc ttggttgcaa agccgcttaa 1321 acatttcact ggctagttct ggtaatatta tccgaaatca actcattttg tggttgaggt 1381 aatctatttg tctttaccaa tcaagtcgtt ttaaaacctt caggagtaga gatgatactt 1441 gaagaattaa caaataaaac ttaatcaagg aaactgttaa ctcattctcc cccctacttt 1501 ttgccagtta ttaaatgttc tggtaaatcc taactcaaca agaatgactc atgaggaaat 1561 cctgactcct tgaactcact gattctggtg cattacagct atgaaatctc tatgtctgca 1621 gctttctctc ccaccataag gaaatctgtg atttctattt tggcaggaga tgtgaccaag 1681 gactcctgag gaatgaccct atagacagac ccagccacgt gcctagtagt gatcaaaaat 1741 gcaaacccaa ttaaagcact gtcttctgat aaagtcactt tctcttgttc aatccctaat 1801 gttcattctc aaagtgttcc aagacgcccc agcaagctgc agagcagatc ccaccctgcc 1861 cctgggcaat caacgtttca ggttactcct tccttaagtt aagtggcaga gagaaacggg 1921 cgagggtctc cccgttgcct cagttcacag accagggggt gtcgaatgag tcctacagca 1981 ggacagcaaa caagaaatga ggcgcggggt cagggaacgc gctcaaaaaa ggaagaaaaa 2041 tcccactctt cattcgaaat caggccactg cactccggcc tcgtggcggg cgacctcctc 2101 ctgccaggga atcgccgctc tggcctcggg ctggggagcc cgtcagggtg ggtgagggag 2161 cgccggcgcc cccgcccggc cgccgccaac ttcggctccc tccctccgtc gcaaagccct 2221 cgagcccgcc cggccgccac gcttcagcaa aaggtcgtgc gcagcggccc acactgaaga 2281 ctctccatct gctcccacac ctgccccagc tggccgctgc tatgtggccc gagtgcgcaa 2341 gaagccgggg ccgcagacgt cagcagcgcc ccggcttcga gacccttcgg cagcagccgt 2401 gactgccgcc ggcgggcgct gacccatccc cgtgccagtc tgcagccgac caatccgcgt 2461 cctcttgagg cggggccgga gccgcgaggc cccgccccca agccgaggcc tcataaatgc 2521 tgcgacgcac gcccagccgc aaacagccgg ggctccagcg ggagaacgat aatgcaaagt 2581 gctatgttct tggctgttca acacgactgc agacccatgg acaagagcgc aggcagtggc 2641 cacaagagcg aggagaagcg agaaaagatg aaacggaccc tgtgagtatg gctttcttcc 2701 ctctcccgcc accccctgcc ccacactgca agctgcaaac gcggtacttt cgggctcgcc 2761 tttgacgtta ggaaactagc ctgagcctat gcagggaaaa aaaatcgaaa aggtcaattt 2821 gttaagtaag gttaaatctg ggtgatgctc gggtacagtt taagaaccga gggagacagt 2881 tgatatgagg gcggtggttg atgcgctaag aaattgcggg ttggcttttt gtcctcctgc 2941 attcaaaatg acatcagaat cctgcggctg aagcgcgtcc ccagcattca tacgttgcat 3001 gatgagttct catcagctta cacagctact ggaaggtgat gctcttgctg gttctgaata 3061 tactcgttta aaatccattt ttgtttttta attatagagc agatctcacc cagtccgaat 3121 gtggaacaaa taattgttat gcagcgtctg cttaaaagaa gtgtcgtagg tggggaagga 3181 agagcgcagg ggaatcagtc acccacctct ttgtacagtc tctggcgtgg tccagaacct 3241 cctgctctaa agagagaagc gtgggccggc tccagacagt tccatgtctg tccttttcat 3301 taaagtgcaa aacgtctcgg aattgtaatt aaccttgcaa acaaactgat gccctttgtg 3361 agccagaaat agtgtctgcc ttttgaacta aattcattaa caattcttta aaatacccta 3421 gtgattatag gtagccctgc ccttagttgt aaaactagta gatacggtca gattaatgga 3481 cgaaactgct cagtacatga ggtttaaatg ttaggtggat aagacttatt tgaagagttc 3541 ttgctttgcc ttatgcggtt tgtctctagt tactgggtga ctttatttgg taaaaaagcg 3601 ttcagctgca gtagcatatt caagtgttgc tagttagtaa ttatcttttt aattttttgt 3661 tttagtttaa aagattggaa gacccgtttg agctacttct tacaaaattc ctctactcct 3721 gggaagccca aaaccggcaa aaaaagcaaa cagcaagctt tcatcaagta agttgagaat 3781 cctgtgcttg caaatatcaa tagttagctg ctgaactgaa aaggggaact ctgatgtgcg 3841 taagctaaca tacagaacct ctcttgcagg ccttctcctg aggaagcaca gctgtggtca 3901 gaagcatttg acgagctgct agccagcaaa tgtaagttaa ctcttgagct tgagccattg 3961 ctaacatcgc aaaagcctgg aaaggctgcg tccacctaac aaaagagcag cttgcctgag 4021 ggggattaga ctgcagtcac tataggataa agcctgtttt tctttccttt atttcccagg 4081 gtttaacaaa taaatgcata tattctctcc aggtggggaa gaaaatcagc ctaaacaaat 4141 taaagtggca gttgcttata gttaaggttg agtcagtttt tccattgcat acaatgtttt 4201 caagaggctt agttcccaga gaatttatgg ctcccctata aatatttact ttgcattgac 4261 agcaaagtac ttatattttt gcagcagagt gccaacataa gccttttgcc tactggtatc 4321 ttagtcttta aaaagctaaa ttttgagaat tacaggtttg agcgaaatct agaaaattca 4381 tttagaaata attaaaatgt gagggatagg agaaacataa ggaatcattt ggtctctcag 4441 ttcttcacat tctgcatgct cttttctccc cccttcagat ggtcttgctg cattcagggc 4501 ttttttaaag tcggaattct gtgaagaaaa tattgaattc tggctggcct gtgaagactt 4561 caaaaaaacc aaatcacccc aaaagctgtc ctcaaaagca aggaaaatat atactgactt 4621 catagaaaag gaagctccaa aagaggtaag gaaacaagtt cctaatttca gcacaatctg 4681 gacatcttta gcacaaaagt gaaacagtaa tcaaggacaa agcgggctag gaggggtaaa 4741 aagtccctcc acgttgtagc tttcagttat gttaaagttc tcctgtgact tagctagtaa 4801 agctaatcac acataatttt tattttttgt tttcaaatac taaattttaa tctttaactc 4861 tgaataccaa ataaacaact tttttgtttt atttcagata aacatagatt ttcaaaccaa 4921 aactctgatt gcccagaata tacaagaagc tacaagtggc tgctttacaa ctgcccagaa 4981 aagggtatac agcttgatgg agaacaactc ttatcctcgt ttcttggagt cagaattcta 5041 ccaggacttg tgtaaaaagc cacaaatcac cacagagcct catgctacat gaaatgtaaa 5101 agggagccca gaaatggagg acatttcatt ctttttcctg aggggaagga ctgtgacctg 5161 ccataaagac tgaccttgaa ttcagcctgg gtgttcagga aacatcactc agaactattg 5221 attcaaagtt gggtagtgaa tcaggaagcc agtaactgac taggagaagc tggtatcaga 5281 acagcttccc tcactgtgta cagaacgcaa gaagggaata ggtggtctga acgtggtgtc 5341 tcactctgaa aagcaggaat gtaagatgat gaaagagaca atgtaatact gttggtccaa 5401 aagcatttaa aatcaataga tctgggatta tgtggcctta ggtagctggt tgtacatctt 5461 tccctaaatc gatccatgtt accacatagt agttttagtt taggattcag taacagtgaa 5521 gtgtttacta tgtgcaaggg tattgaagtt cttatgacca cagatcatca gtactgttgt 5581 ctcatgtaat gctaaaactg aaatggtccg tgtttgcatt gttaaaaatg atgtgtgaaa 5641 tagaatgagt gctatggtgt tgaaaactgc agtgtccgtt atgagtgcca aaaatctgtc 5701 ttgaaggcag ctacactttg aagtggtctt tgaatacttt taataaattt attttgataa 5761 ataatattga acacttggag tctagatgtg attttatttt tgataactgg aaatgtggca 5821 gataaaggga aaaacatctt tatagtaaaa gacatttcag tttttgtgcc tgcccagtga 5881 gaattgacag tcttcagaac tgagaaatga tagttgcaga cagataattc tttttttaaa 5941 acctataagc attatgcaga cctaaggaag tcacatcctt agaaccactg catttatgct 6001 ttctagtctt aaaaaaaaaa tcaagcttct caaatgaaga aagaagccaa ataaggtcaa 6061 tgtatgcctg tctttattta ctttccattg gttccaagct cccccacaag taccagcatg 6121 ggcttccaag atacatacct gaaattgttc ccttccataa tgtatcaata gaaaatctca 6181 gaattttcct gattttcatg ccttcatttg gccaacaaag aatctgttac tttaggtaat 6241 gataaagtca aaagaagtgc gctttcgttt cttagactta aatccagaaa tgtgcttttt 6301 tccacttgag agaggaagcc cttgaatctg ttttcaaact ggctaattaa aaagcacccc 6361 ttcctcagca ggctgttttt ttcaatgtcc ttgattatgg tcttaataaa aaaggaggag 6421 gaaccctgat tttagagttg gaagaactga cgcaaagtaa tcatttctca tagctttgtg 6481 accttggtgt ccccattgtc ttctctgact cttcatttta gaccttcctg cctttctggt 6541 tttataaagt agtaagggac tttagaaatc ataaagcact aagaacattt tttttctgat 6601 ttgccagaag aaaaattgac ctccagtagt ttactgtttc cattgtgatt gcactaattt 6661 tcaccttagc cagagtttct ggcaccaaca gacacaagct ctattagaga gactgccttg 6721 tctactgtat cacagcagcc agaataatac ataataggtg ctccataaat attgattgaa 6781 taaatgaatc caaagagtca ttgaaagcaa atatttcttt ggagaataaa tgtgtcaggt 6841 ctagcataat ccttaatttg gaaagatttt gaccaacaga ggaactcagg agtcatgcca 6901 tatatcatcc tatttgtcac caagtcattt attttacttc aaacctctct acagcctact 6961 ctctgttacc actctgatca aagacaccac gatctgatgc ctagattact gcaatagtct 7021 catctcctct cacccactct tccccctctc ccacaccaat gtgttcctgc cttttaactc 7081 tagggatccc ttcaagacaa atctgatctt gtctccagtt gcttataaac acttgacagg 7141 ttttgcaaac aattaggata aagacagtat ctctaatatg gcctaacacc tgcatggctt 7201 gacccctcct tttgttccaa ccttgactct accacatttc tcccagctct ttctgctaag 7261 ccacactgtc ctactttcct agcaggtgcc atcgccactg gcctcacagg accttagcac 7321 ctaatagttc ctctgcatgg aattc // LOCUS HUMGAD45A 5378 bp DNA PRI 25-JAN-1994 DEFINITION Human gadd45 gene, complete cds. ACCESSION L24498 NID g403127 KEYWORDS . SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5378) AUTHORS Hollander,M.C., Alamo,I., Jackman,J., Wang,M.G., McBride,O.W. and Fornace,A.J. TITLE Analysis of the mammalian gadd45 gene and its response to DNA damage JOURNAL J. Biol. Chem. 268 (32), 24385-24393 (1993) MEDLINE 94043278 FEATURES Location/Qualifiers source 1..5378 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI38" /cell_type="fibroblast" /tissue_type="lung" /tissue_lib="Stratagene cat 944201" /map="1p31.1-31.2" exon 225..2595 misc_signal 2158..2165 /note="octamer transcription binding site" misc_signal 2185..2192 /note="octamer transcription binding site" 5'UTR 2257..2551 CDS join(2552..2595,3082..3183,3407..3644,4718..4831) /codon_start=1 /db_xref="PID:g403128" /translation="MTLEEFSAGEQKTERMDKVGDALEEVLSKALSQRTITVGVYEAA KLLNVDPDNVVLCLLAADEDDDRDVALQIHFTLIQAFCCENDINILRVSNPGRLAELL LLETDAGPAASEGAEQPPDLHCVLVTNPHSSQWKDPALSQLICFCRESRYMDQWVPVI NLPER" intron 2596..3081 exon 3082..3183 intron 3184..3406 exon 3407..3644 intron 3645..4717 misc_signal 3832..3851 /note="p53 binding site" misc_signal 3881..3887 /note="AP1 binding site" exon 4718..>5378 3'UTR 4829..>5378 BASE COUNT 1381 a 1247 c 1375 g 1375 t ORIGIN 1 gtcgacttgg gtggggcact ttaggactgt ggttcatttg aattggtgta aacaatacac 61 cggttctact gtcctacagc ctccattcag atgactgaag tcatgggact ttcagcatag 121 ctagctgatg acagtgcata ctattttgtc ccaaaatcca gttcaagcat ggacatacca 181 ataagagcct aagctcttta aaggcaaagg accaggaatt gtacagttct tggtatagaa 241 gaagacaggc aaaagtgttt ttgaactaac gttaaatgtg caatatgtta gaattcatgc 301 aatgcacagg actgcaggat tctgatatct tatttaactc tcaaattcta ttcaactcaa 361 taaaccttga ctgtgcttct actaaatgca ggtattgtac taggagctga ggacaccaaa 421 ctgatgaagt ccttgctgtc aagaaactca catgattccc taattctttg tcagcttgct 481 gtgatcacat tttcttccca agaacctcta agaaatgcct agtggataga accttggagt 541 tccacggaac atattaacaa tcgccaaatg atgactcagg ctagattgtg taattcaggt 601 tttgtctgca aaactgaaaa tgcttcggta acctacctaa atttcaatgt tgaggaattc 661 tttaagaaag acatcaaatg ttaagattta aggcatagat atgagataca tagtcatgct 721 taggtgaatt atgcactgac catgaccatt tctttactca aatgttgtcc atggctgaca 781 acacagtgaa aaaatgagtg caaaatgaca actcaaataa atgaaccaga aaacctatca 841 cttttctttt ccaccaaatt aagatcaaga gagctggaga atattttgtc tagagtgata 901 aaaacataag ggtgcaaaac ttccaggtac ctttgcagaa attacttctg tgacctttgg 961 ctgtacagca accttaataa tgcaagcact gttttgaatg caagcatgtg ggagccattt 1021 tcaccacttt tgatgacttc agtaggttta agaaatgttt ttgcttttat tgcataaacc 1081 ataaaacaaa ggaagggact tttgaactac tcagtgagag tctatatatt aaagtttgtt 1141 tttcaaaaat gtgtaactac catttgcagt tttaaaggtc tgctttccac ctacaagttg 1201 ccattatctc aaaggtgaaa ttttagcata tgactaaaaa cttcctatag ttacagcttc 1261 atgattcagc atctaacatc aataattcac agtgagatca taggaggctc tctgtggaag 1321 gtaacgacat acatacgtta ggaaaggaag cttagggcat atcgagagca ttttgaattt 1381 agacttgtgg gctgtgtggg tgtcagatgg ttgtctctca gctggtgggc gtccagaagg 1441 atccttgttt gggcaaggct ctttgagaaa ggagaatctg ggttgccagg gattcccaca 1501 tgtggtcacc agctccccac gcagaccagc tcacgatttc ccagttacac cgggcaggtg 1561 ggaaaccgtt ctgctttctg tggaaaagat tctaacttgg ttccctgcca tccctgaata 1621 caaacgggtt ggtttttctt ttttcagctt ccaacccttg cagctttcca aaaataaatc 1681 aaaccagcca tcagggcacc gaaataatac tactgctaat aagcagcttc gcctagactt 1741 agataaacaa cacttctgag gtaaactttg ccccggaggt ctggagacac ttttttaatg 1801 taacctgctt actaataatt actagacttc agtgcattaa ccctggaaat agattttaat 1861 agccacccct taaaacaaaa gacatgaaaa gataataaga aaaaagtgcc gcaactatta 1921 tagaaaaaca cttggcagcc tgcttcagcc caagctgagg ccacctctag cctctgctaa 1981 agccccccac tcccaatggt ccccgccaac cggataagag tgcgcgcggg acccgccttc 2041 ccctctcggc accgcccccg cccccgcccc ctcggctcgc ctcccgcgtg gctcctccct 2101 tttccgctcc tctcaacctg actccaggag ctggggtcaa attgctggag caggctgatt 2161 tgcatagccc aatggccaag ctgcatgcaa atgaggcgga aggtggttgg ctgagggttg 2221 gcaggataac cccggagagc ggggcccttt gtcctccagt ggctggtagg cagtggctgg 2281 gaggcagcgg cccaattagt gtcgtgcggc ccgtggcgag gcgaggtccg gggagcgagc 2341 gagcaagcaa ggcgggaggg gtggccggag ctgcggcggc tggcacagga ggaggagccc 2401 gggcgggcga ggggcggccg gagagcgcca gggcctgagc tgccggagcg gcgcctgtga 2461 gtgagtgcag aaagcaggcg cccgcgcgct agccgtggca ggagcagccc gcacgccgcg 2521 ctctctccct gggcgacctg cagtttgcaa tatgactttg gaggaattct cggctggaga 2581 gcagaagacc gaaaggtgag tcggcctgcg gactcttccg gcccgaactt ctcttaccta 2641 ccccgcgctc cccggtgcag ccgggctgtg gaaggcttgc aggggaggaa gctaaaaagt 2701 ttgcacaggg caactcccgc ccttgctccc tcgggactct ccgtggagct cccacggact 2761 gaaagagcgt gccccccaac ccgaacgagc cccgccgggg cctttgcaaa gggcagcagt 2821 ggccgtcgct gcccgtgcgg ctcccgtggc tggcagcctg tggcaggggc actctcggga 2881 cttctcacgg gacgcccggt ccttgggcgt gcaggggtca tggggggtga cggggccgcg 2941 ggagcgccgg gttttcgtag agcccaggtg cgcggtggtg cttgcattcg agagggaggg 3001 gcgtggtacc ggacgagggg ggcggcgatg gccccgaggg caccggggct gacgggaccc 3061 ctcgcccttg cccgcgtgta ggatggataa ggtgggggat gccctggagg aagtgctcag 3121 caaagccctg agtcagcgca cgatcactgt cggggtgtac gaagcggcca agctgctcaa 3181 cgtgtaagtg gggcccttgc gcgtccccca tggcacccct tcccgcccca gcccgggagg 3241 tcgccttggc tgggcgcccc tcgcccggcc gcgccacttc ctgtcgcttt tctgcctgtc 3301 tcggaaggga gggggcgagc gggccgggcg gcgaccccca gggacccggg cagtggttga 3361 gggcgcccgc gcttctgcgc tcactggccc cgcccgctgc ccccagcgac cccgataacg 3421 tggtgttgtg cctgctggcg gcggacgagg acgacgacag agatgtggct ctgcagatcc 3481 acttcaccct gatccaggcg ttttgctgcg agaacgacat caacatcctg cgcgtcagca 3541 acccgggccg gctggcggag ctcctgctct tggagaccga cgctggcccc gcggcgagcg 3601 agggcgccga gcagcccccg gacctgcact gcgtgctggt gacggtaagg gactggggga 3661 ctgcagcctg cagggtagag ccccggaagg acgggagtca gggctgggtt gcctgattgt 3721 ggatctgtgg taggtggggg tcaggagggt ggctgccttt gtccgactag agtgtggctg 3781 gactttcagc cgagatgtgc tagtttcatc accaggattt tctgtggtac agaacatgtc 3841 taagcatgct ggggactgcc agcagcggaa gagatccctg tgagtcagca gtcagcccag 3901 ctactcccta cctacatctg cactgcctcc cgtgactaat tcctttagca gggcagatta 3961 gataaagcca aatgaattcc tggctcaccc ctcattaagg agtcagcttc attctctgcc 4021 agtcagagct aaaaatagaa attgtgtagg agacaaacct tgttaattcc ctagaaatac 4081 attaagagga tagagtggaa ttttttttct ctgcaatctt gcattttttt aatggctctt 4141 tttttttttc ctgataaaaa cctttggtag gtagggaagt tatgttttca ggggtaaatg 4201 tgctactttt gtcttctaaa ttttgctctt ttttgactgg tctagtcaag tgacagcccg 4261 attattttgc tactccttaa aagtactatt ctgtctcttg gagtatggtt gatggcaatt 4321 ccagttaact gctgtgcagc tctcatctca ttgtgcacac agcatggaaa tctttctcaa 4381 aactgtttca ctcaggtcag ggtaacaagt ttggtagagc aaaccggtga atgatactct 4441 catgcaaaac tgaacagata tgcaaacata tgtatgtggt tcagcttggg ttgcatgggt 4501 tcagactttg caatgtgtag tttaataggt aattaccctt aacgcttttg cagggaaccc 4561 aactaccttg aagaaacttt aatttttttg tgcttctaat ttgtctccat gtcacatagc 4621 caaaatatag aatgttcaag tgttttctcc tcaaaagtat aattactaga atatactggt 4681 ttttaaaata agtttatttt tataaatttg tttccagaat ccacattcat ctcaatggaa 4741 ggatcctgcc ttaagtcaac ttatttgttt ttgccgggaa agtcgctaca tggatcaatg 4801 ggttccagtg attaatctcc ctgaacggtg atggcatctg aatgaaaata actgaaccaa 4861 attgcactga agtttttgaa atacctttgt agttactcaa gcagttactc cctacactga 4921 tgcaaggatt acagaaactg atgccaaggg gctgagtgag ttcaactaca tgttctgggg 4981 gcccggagat agatgacttt gcagatggaa agaggtgaaa atgaagaagg aagctgtgtt 5041 gaaacagaaa aataagtcaa aaggaacaaa aattacaaag aaccatgcag gaaggaaaac 5101 tatgtattaa tttagaatgg ttgagttaca ttaaaataaa ccaaatatgt taaagtttaa 5161 gtgtgcagcc atagtttggg tatttttggt ttatatgccc tcaagtaaaa gaaaagccga 5221 aagggttaat catatttgaa aaccatattt tattgtattt tgatgagata ttaaattctc 5281 aaagttttat tataaattct actaagttat tttatgacat gaaaagttat ttatgctata 5341 aattttttga aacacaatac ctacaataaa ctggtatg // LOCUS HUMGALK1A 8095 bp DNA PRI 10-APR-1997 DEFINITION Human galactokinase (GALK1) gene, complete cds. ACCESSION L76927 NID g1929894 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8095) AUTHORS Bergsma,D.J., Ai,Y., Skach,W.R., Nesburn,K., Anoia,E., Van Horn,S. and Stambolian,D. TITLE Fine structure of the human galactokinase GALK1 gene JOURNAL Genome Res. 6 (10), 980-985 (1996) MEDLINE 97064967 REFERENCE 2 (bases 1 to 8095) AUTHORS Ai,Y. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) Instititute of Human Gene Therapy, Stellar-Chance Laboratory, University of Pennsylvania, 422 Curie Blvd, Philadelphia, PA 19104, USA COMMENT GSDB:S:74871. FEATURES Location/Qualifiers source 1..8095 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(419..646,1533..1722,2182..2301,2472..2607,2735..2916, 7024..7174,7251..7413,7496..7686) /gene="GALK1" /product="galactokinase" gene 419..7686 /gene="GALK1" exon 419..646 /gene="GALK1" /number=1 CDS join(482..646,1533..1722,2182..2301,2472..2607,2735..2916, 7024..7174,7251..7413,7496..7567) /gene="GALK1" /codon_start=1 /product="galactokinase" /db_xref="PID:g1929895" /translation="MAALRQPQVAELLAEARRAFREEFGAEPELAVSAPGRVNLIGEH TDYNQGLVLPMALELMTVLVGSPRKDGLVSLLTTSEGADEPQRLQFPLPTAQRSLEPG TPRWANYVKGVIQYYPAAPLPGFSAVVVSSVPLGGGLSSSASLEVATYTFLQQLCPDS GTIAARAQVCQQAEHSFAGMPCGIMDQFISLMGQKGHALLIDCRSLETSLVPLSDPKL AVLITNSNVRHSLASSEYPVRRRQCEEVARALGKESLREVQLEELEAARDLVSKEGFR RARHVVGEIRRTAQAAAALRRGDYRAFGRLMVESHRSLRDDYEVSCPELDQLVEAALA VPGVYGSRMTGGGFGGCTVTLLEASAAPHAMRHIQEHYGGTATFYLSQAADGAKVLCL " intron 647..1532 /gene="GALK1" /number=1 exon 1533..1722 /gene="GALK1" /number=2 intron 1723..2181 /gene="GALK1" /number=2 exon 2182..2301 /gene="GALK1" /number=3 intron 2302..2471 /gene="GALK1" /number=3 exon 2472..2607 /gene="GALK1" /number=4 intron 2608..2734 /gene="GALK1" /number=4 exon 2735..2916 /gene="GALK1" /number=5 intron 2917..7023 /gene="GALK1" /number=5 exon 7024..7174 /gene="GALK1" /number=6 intron 7175..7250 /gene="GALK1" /number=6 exon 7251..7413 /gene="GALK1" /number=6 intron 7414..7495 /gene="GALK1" /number=7 exon 7496..7686 /gene="GALK1" /number=8 BASE COUNT 1515 a 2450 c 2246 g 1884 t ORIGIN 1 ctgatgacct ctcacagctg ctggccctgg ggacctaggg gatccctttg ggaatcccag 61 gtcaaccggg tgccagctcc attgctctgg ctgtgtgggt ggtggtgggt ggtgaggcac 121 atagaaagtc accagcgcca gggaagggca gctcagaaat cctggtcatg ccaagtgcca 181 ctcagggcta ttctcactct agtgagtgct cattggttct tcccgaagtc cagagggaag 241 gacaccatcg ggggcgctgt atcccacccg gcaccattag cccctgccac ccaccaccca 301 gcagctggat tcccacggga gttgcgggtg ggggcggaac cggctgaggt ctgggggcgg 361 ggcgtccggg cgcggggcgg ggctggcggg aatgtgcgca cccccgcgcg gggctcctcc 421 cgagcatccc gcgccgacgg ggctgtgccg gagcagctgt gcagagctgc aggcgcgcgt 481 catggctgct ttgagacagc cccaggtcgc ggagctgctg gccgaggccc ggcgagcctt 541 ccgggaggag ttcggggccg agcccgagct ggccgtgtca gcgccgggcc gcgtcaacct 601 catcggggaa cacacggact acaaccaggg cctggtgctg cctatggtga ggggctgcac 661 ggggagcccc tagcccgccg ccgcctgtcc cggtcgccga ggagggcggg cctcggggac 721 gctgggggcg agttcttccc gcgggagatg tggggcgggc agctgcgcct ggagcaccgg 781 tgcacggaag agtccccggg acaggctgtt ccccacgttg gaagggagga agcgaagaag 841 tggtccccag agggtgcgcg gccgcctctt ggctcaagcc cgccctctgg gggctggggc 901 tcctcgcctt caacctggga gcatgttccc cttaaactgt gaggccctgt gtgccacgca 961 gaaggggaca ctccgcgcct ccggccaccg tggggcccca accgcagacc tgggcgaacg 1021 tagccttctg gcccagcccg ttcaatttac agaggaggaa actgaggcct agagaggccc 1081 agtgaactgc tggaggtcac acagcaggtt cttggcgggg ctgcgacttg ggagtgagga 1141 ctcccagctt tcagcggggg gcgctttccg ccccatctgc agcttgggga gtgcacaggt 1201 acaggatgtc cagagccacc caaaatgtaa aggctttgga gctccagtga tctgttttcc 1261 ctttgggcta agctctcccc ccttgcccca cagctcaggg cagagtccag gtctgtgctc 1321 cagctgcagc cgccccgccc ctgaagacct aagggggcag ggctcaagcc cccaaggtca 1381 gctggccctc aggatcttcc ctgcgacgct gaacctggag gttcagaacc tgatgactgt 1441 ggaggcatca gaacctcggc tggaggcagt gtcattggag aggcttactc cagctggcgg 1501 aagcctcacg tactgcttgt ctctcctgcc aggctctgga gctcatgacg gtgctggtgg 1561 gcagcccccg caaggatggg ctggtgtctc tcctcaccac ctctgagggt gccgatgagc 1621 cccagcggct gcagtttcca ctgcccacag cccagcgctc gctggagcct gggactcctc 1681 ggtgggccaa ctatgtcaag ggagtgattc agtactaccc aggtatgggg cccaggcctg 1741 agccaagtcc tcactgatac taggagtgcc acctcacagc cacagagccc attcatttgt 1801 ctgatacact gtggggaagg cttgtagagt ggagcatccc attgtacaga tgaggaaact 1861 gatgccccca gaaggtcggg aacttgccct gggtttcccg tgacctgatt ggaggagcca 1921 ggatttgaac cccagccttt tttccctcca gagccctaaa ccaggaggac aattagaagt 1981 gtcccagcaa cctcagaggg tgggaaaatg gaggggagtg ggtcccttgg gccagcaggt 2041 tggtggggtt cttgacaatt gagacacaca cctagaaaca gttgctaggc cgttgctgcc 2101 cttcccgcca ggacacctgc ccttcctgtc caatcctccc aggcagcctc tcttaccatc 2161 acctgttctt tccccctgca gctgcccccc tccctggctt cagtgcagtg gtggtcagct 2221 cagtgcccct ggggggtggc ctgtccagct cagcatcctt ggaagtggcc acgtacacct 2281 tcctccagca gctctgtcca ggtaccagct aggccccagc cctgacccag ccctccttcc 2341 ctgaggtctc caggtggtcc cagcttctac tatgccttat ggagggggtg gcagggaatc 2401 tccctggagt gtcattgaag ccactgctgc ttccaccagc cctagcctcc ccacctcacc 2461 ctgtactgca gactcgggca caatagctgc ccgcgcccag gtgtgtcagc aggccgagca 2521 cagcttcgca gggatgccct gtggcatcat ggaccagttc atctcactta tgggacagaa 2581 aggccacgcg ctgctcattg actgcaggtt gggctcgctc ccctcgtccc ctcccgccct 2641 gcactcagca gctcctgggt ggagtgtgcc cactgcctgg cgcagcaagc acacgcttgg 2701 cctcgtcatc tcccccattg taactccacc ccaggtcctt ggagaccagc ctggtgccac 2761 tctcggaccc caagctggcc gtgctcatca ccaactctaa tgtccgccac tccctggcct 2821 ccagcgagta ccctgtgcgg cggcgccaat gtgaagaagt ggcccgggcg ctgggcaagg 2881 aaagcctccg ggaggtacaa ctggaagagc tagagggtga gaactgccag ggtgctctat 2941 cctggaggcg gctgtgctcc ctgctggcgc ctcagtgtgg ccttgaccct gcctgggacc 3001 ccgatctcca ggggcttctg ccatgctctc cccagtccct tcaaacactg cgcacccagg 3061 gttccaatct cagcaggggt gcttgaaatc ctaaaatggt cttatctaat cagaaaaatc 3121 atgtttccat tgtggaaaat gtagaaaagt acaaagtaga aaataataag ctataagggc 3181 actacccaga gataggcact gctgacattt tcacgtttcc tttcagtatt tttccacatc 3241 tgtcttcaaa gctgagtata tgtaatatat catcactttc cccccccacc cccttttttt 3301 taagaggcag ggtctcattc tgttgcccaa gctggagtgt agtggtgtga tcatagctta 3361 ctgcaaactt gaactcttga gctcaaggga tcctcccagc tcagccttcc aagtagctga 3421 gattacaggt gtgccaccat gcccggctaa tttttatctt cgtaaagacg gccttgtagt 3481 gttgcccagg atgatcctga actctggcct caagaggtcc tcctgccttg ggctcccaaa 3541 gtgttgggat tataggcatg agccactgcg gccagcccat ttgccgtgtt ttttttttgg 3601 acacagagtt tcggtcttgt cacccatgct ggagtgcaat ggtgcgatct cagctcactg 3661 taacctctgc ctcccgggtt caagtgattc tcctgcctca gcctcccgag tagctgggac 3721 tacaggcgcc cgccactacg cctggcacat tttttatagt tctagtagag actggggttt 3781 caccatgttg gccaggctgg tctcaaacgc ctgacctcag gtgatcctcc cgcctcagcc 3841 ttccaaagtg ctgggattac aggcgtgagc catagtgccg gtctcttttt tttttttttt 3901 taaactaaac ataatctcag aacccagaac cctatcttat cttatgccat gaaaggcata 3961 tctcggcgtg gctctttttt tttttttttc tttttttttg ggcgaggtgg aggcttgccc 4021 tgttgcccag gctggagtgc agcggcgcaa tctcggttca ctgcatcctc cacctcctgg 4081 gtccaaatga tcctcctgcc ttagcttcct gagtaggtgg gattactgga acccaccacc 4141 acgcccagcc aatttttata tttttagtag agacggggtt tcatgttggc caggctggcc 4201 tcgaactcct gacctcgtga tctgcccgcc tcagcctccc aatgtgctag gattacatgt 4261 gtgagccact gcacctggcc tccgtgtggc tctttaaagc tccacaatat tttagcattc 4321 aggtgctctg tcatttactt aactattttc tgatacacct cacactgcga ttaactttcc 4381 ttatttatct tttttattat ttatttattt atttatttga gacagagtct tgctctgtca 4441 cccaggctgg agtgcagtgg cacgatctcg gctcactgca acctctgcct cccaggttca 4501 agtgattctc ctgcctcagc ctcctgagta gctaggatta gaggcatgtg ccaccacacc 4561 tggctaatct tcgtattttt agcagagatg aggttttacc atgttggtcg ggctggtcgt 4621 gaactcctga cctggtgatc tgcccacctc agcctcccaa agtactggga tgacaggcat 4681 gaaccactgt gcctggccat cttttttatt ttttaaagag atgggttctg ctaagttgcc 4741 caggctggac ctgaactctt gggctcaagt aatcttctca cctagtctcc tgggtagctg 4801 caaccaaagg cacccggttt atctgcattc tctttttttt ctttgagact gagtcttgct 4861 ctgtagccca ggctggagcg cagtggcgtg atctcggctc actgcaacct ccgtcttcag 4921 ggttcaagca attctcctgc ctcagcctct ggagtggctg ggactacagg cgtgtgccac 4981 cagagcgagt taattttttt ttttttttgt atttttagtg gacactgggt ttcactatat 5041 tggccaggct ggtcttggac tcctgacctc aagtgatccg cctgccttgg cctcccaaag 5101 tgctgggatt acaggcacag gcgtgagcca ctacacctgg cctatctgca ttctcttaat 5161 agtttcttag aaatggattc ttaggagtag gattacagag tcaagagaca caagttttgt 5221 aggctgggtg cggtggctca cgtctgtgcc tgtaatccca gtactttagg aggccaaggt 5281 gggcagattc attgagctca ggaattcgag accagcctgg gcaacatggc aaaaccccat 5341 ctctaaagaa atacaaaaat tagccaggtg tggtggtgtg tgcctgtagt cctagctact 5401 taggaggctg gggtgggagg atcaattgag cccaggaggt tgagactgca gtgagctgtg 5461 attgcaccat ggcactccag cctgggcctc aaagtgagat cctgtctcca aaacaaaaaa 5521 gatacaagta tccttaaggc tcctgctaca catggccagg aaggtagtct attggacagt 5581 tttaaggtca ttatcaatat tagctcattt aattccctcc aaaactctgt aaagcacatt 5641 ctgctaccat agttgtcata tttttgatgg gggaatctac agtgagaggc agtgctggga 5701 tctgaacccc atctggacag attagctcca gggcccatgc tcttgactgg ctggccgcgc 5761 tgcccacact gagttgttcc ttcctggcag ggtaggtgtg cctatctcag ggacactaga 5821 cagctccgag ggacctccct gtccttttcc tttgtgaact gtgtcacgtt ctccagagca 5881 gggctcagac ctgccctgcc tgctctgtgc agatgccctt ggccaaggtt ttcacactgg 5941 aacaagttgg tccctcctcc ccaccccagc ctgtccttgg ccctcctcca ggtctccttc 6001 tgcataggag cagctcaccc tgcctcctcc agagtcctgc cctagaagcg caatccctct 6061 ccttccatcc cctgcctggc tgcctggctc cttccctcag cctccaagac atgctcagtt 6121 ttcttccctc ctaaaacacc acccactgtc tcatttccat tcatttcttt ctttctttct 6181 ttcttttttt tttttgagag ggagcctcac tctgtcaccc aggctgaagt gcagtggcat 6241 gatctccact cactgcaacc tccgcctccc aggttcaagc aattctcctg cctcagcctc 6301 ctgagtagct gggattacag gcgcctgcca cgatgcccgg ctaacttttg tatttttagt 6361 agagacgggg tttcgccatg ttggccaggc tggtctcgag ctcctgacct caggcaatct 6421 gcctgcctca gcttcccaaa gtgctgggat tacaggtgtg agccaccgcg cccacccatt 6481 catttctcag tcctttgaat ctacttgccc ctccatcccg ccatgccacc taccctaaca 6541 accttccccc ttaaacctgc gggtttggcc gggcgcagta cactgagtca gtactggtac 6601 tgacccaggt acccctccag cctcagctcc agtcagatgg gacagcctgc tggtccctgg 6661 ctgcttctgc cccctcttct ggagccccag ccctggaggc tccatgtggc tcagcagaac 6721 ttcttctcct cctgctctgt ggtggcctct tgagggcagc actcaccttg gaaagcatgg 6781 agtgtttcaa ccctcactgc tccctgaagg accaaggtgt cccattttac agtcggggga 6841 ggaggcactg tgataaaggg gctcttcaga cccacgtctg agagagccag gctgcgccgc 6901 ccccgcggcc ttccaccctt caccgtccag ccagggccac tgccatcacc gcctgctggt 6961 cctcacaggc gtcggggccc caggcagtga gaaggcggct gctgactcct ctttcctccc 7021 cagctgccag ggacctggtg agcaaagagg gcttccggcg ggcccggcac gtggtggggg 7081 agattcggcg cacggcccag gcagcggccg ccctgagacg tggcgactac agagcctttg 7141 gccgcctcat ggtggagagc caccgctcac tcaggtgagg ccctctgggc gccccgctcc 7201 tgccgggcac aggccggccc aggcccaccc cttcaatatc ctctctgcag agacgactat 7261 gaggtgagct gcccagagct ggaccagctg gtggaggctg cgcttgctgt gcctggggtt 7321 tatggcagcc gcatgacggg cggtggcttc ggtggctgca cggtgacact gctggaggcc 7381 tccgctgctc cccacgccat gcggcacatc caggtgggcg ggcaccaggg cctgggcggg 7441 caggagcggc agcttcccgg ggccctgcca ctcaccccca gcccgcctct tacaggagca 7501 ctacggcggg actgccacct tctacctctc tcaagcagcc gatggagcca aggtgctgtg 7561 cttgtgaggc acccccagga cagcacacgg tgagggtgcg gggcctgcag gccagtccca 7621 cggctctgtg cccggtgcca tcttccatat ccgggtgctc aataaacttg tgcctccaat 7681 gtggtacctg cctcctctag aggtgggtgt atgcttgggt gtcagagaat gggggatgtc 7741 agaaccgctc ccctacccta ggggagcacc tctcaggccc cagaagaatg ggcaaggcag 7801 ggcctagcag tagcaaaacc atttattaag tgcagaacaa aggctgggtc cttgtgctgc 7861 tcccagctct ttggttacaa ataggtttgg gcccacagag gacggacctt gcccccttca 7921 tgcctcccag gagacaccta gcccctgctc tgtgcatgcg ggtgggctgg gcccccaggg 7981 gtgcaaggat ggagtagctg aggaggctcc gggagaggag tcgggaggac gcctagtggg 8041 acattgcggg ggtggcgcag ggtgcggtca agtttggaag aaactgttgg gtcca // LOCUS HUMGALT54X 621 bp DNA PRI 13-NOV-1995 DEFINITION Homo sapiens galactose-1-phosphate uridyl transferase (GALT) mutant Q54X gene, exons 1 and 2 (M96246 bases 303-924). ACCESSION L48714 NID g1066752 KEYWORDS galactose-1-phosphate uridyl transferase; mutation. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 621) AUTHORS Elsas,L.J. II. TITLE Human GALT gene mutations JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..621 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9p13" exon <1..103 /gene="GALT" /note="G00-119-971" /number=1 primer_bind complement(1..21) /gene="GALT" gene 1..621 /gene="GALT" CDS join(22..103,412..491) /gene="GALT" /note="mutation results in premature stop." /codon_start=1 /db_xref="GDB:G00-119-971" /product="galactose-1-phosphate uridyl transferase" /db_xref="PID:g1066753" /translation="MSRSGTDPQQRQQASEADAAAATFRANDHQHIRYNPLQDEWVLV SAHRMKRPW" intron 104..411 /gene="GALT" /note="G00-119-971" /number=1 exon 412..581 /gene="GALT" /note="G00-119-971" /number=2 mutation 489 /gene="GALT" /note="c (Gln) in wt/t (stop) in mutant; G00-119-971" intron 582..>621 /gene="GALT" /note="G00-119-971" /number=2 primer_bind 601..621 /gene="GALT" BASE COUNT 121 a 199 c 203 g 98 t ORIGIN 1 agcggatccc ccggtggcct catgtcgcgc agtggaaccg atcctcagca acgccagcag 61 gcgtcagagg cggacgccgc agcagcaacc ttccgggcaa acggtaactg caccgcggca 121 gggactcgct ggggcgcgga gccgagccct ccccttcctt aggaagcttt cgtccccctc 181 cgaaggttgg aacgctcatc ccgagccaga ccgacaaggc gtacagtctg caggcctcta 241 cgagcagcag gccaattggc gctgggaaag tccaatcctg ggcctctagc tcctgagcgg 301 gacagggccg agagggcgct cccgagcttg ggcctgctgg tgggtgagac ccaggagaga 361 gggagctaga ggggggagct ctgaggactg atcttgactg tctgccccca gaccatcagc 421 atatccgcta caacccgctg caggatgagt gggtgctggt gtcagctcac cgcatgaagc 481 ggccctggta gggtcaagtg gagccccagc ttctgaagac agtgccccgc catgaccctc 541 tcaaccctct gtgtcctggg gccatccgag ccaacggaga ggtaagcctg tagagccctg 601 catctgcagg ctgggccacg g // LOCUS HUMGAPDHG 5378 bp DNA PRI 15-NOV-1994 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene, complete cds. ACCESSION J04038 NID g182980 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5378; 1 to 5378) AUTHORS Ercolani,L., Florence,B., Denaro,M. and Alexander,M. TITLE Isolation and complete sequence of a functional human glyceraldehyde-3-phosphate dehydrogenase gene JOURNAL J. Biol. Chem. 263 (30), 15335-15341 (1988) MEDLINE 89008430 REFERENCE 2 (bases 650 to 1157) AUTHORS Nasrin,N., Ercolani,L., Denaro,M., Kong,X.F., Kang,I. and Alexander,M. TITLE An insulin response element in the glyceraldehyde-3-phosphate dehydrogenase gene binds a nuclear protein induced by insulin in cultured cells and by nutritional manipulations in vivo JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (14), 5273-5277 (1990) MEDLINE 90319094 FEATURES Location/Qualifiers source 1..5378 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12p13" misc_signal 650..729 /note="insulin response element A [2]" misc_signal 729..867 /note="insulin response element B [2]" old_sequence 877..879 /citation=[1] CAAT_signal 884..892 misc_signal 1049..1071 /note="insulin response element A [2]" TATA_signal 1108..1113 prim_transcript 1138..4993 /note="GAPDH mRNA and introns" intron 1190..1429 /note="GAPDH intron A" misc_feature 1447..1456 /note="Kozak consensus sequence; putative" gene join(1453..1481,3116..3215,3306..3412,3542..3632, 3723..3838,3931..4012,4206..4618,4723..4792) /gene="GAPD" CDS join(1453..1481,3116..3215,3306..3412,3542..3632, 3723..3838,3931..4012,4206..4618,4723..4792) /gene="GAPD" /note="glyceraldehyde-3-phosphate dehydrogenase" /codon_start=1 /db_xref="GDB:G00-119-249" /db_xref="PID:g182981" /translation="MGKVKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMV YMFQYDSTHGKFHGTVKAENGKLVINGNPITIFQERDPSKIKWGDAGAEYVVESTGVF TTMEKAGAHLQGGAKRVIISAPSADAPMFVMGVNHEKYDNSLKIISNASCTTNCLAPL AKVIHDNFGIVEGLMTTVHAITATQKTVDGPSGKLWRDGRGALQNIIPASTGAAKAVG KVIPELNGKLTGMAFRVPTANVSVVDLTCRLEKPAKYDDIKKVVKQASEGPLKGILGY TEHQVVSSDFNSDTHSSTFDAGAGIALNDHFVKLISWYDNEFGYSNRVVDLMAHMASK E" exon <1453..1481 /gene="GAPD" /note="glyceraldehyde-3-phosphate dehydrogenase, (first expressed exon) (EC 1.2.2.12)" /number=2 intron 1482..3115 /note="GAPDH intron B" exon 3116..3215 /gene="GAPD" intron 3216..3305 /note="GAPDH intron C" exon 3306..3412 /gene="GAPD" /number=4 intron 3413..3541 /note="GAPDH intron D" exon 3542..3632 /gene="GAPD" /number=5 intron 3633..3722 /note="GAPDH intron E" exon 3723..3838 /gene="GAPD" /number=6 intron 3839..3930 /note="GAPDH intron F" exon 3931..4012 /gene="GAPD" /number=7 intron 4013..4205 /note="GAPDH intron G" exon 4206..4618 /gene="GAPD" /number=8 intron 4619..4722 /note="GAPDH intron H" exon 4723..>4792 /gene="GAPD" /note="glyceraldehyde-3-phosphate dehydrogenase" /number=9 BASE COUNT 994 a 1642 c 1627 g 1115 t ORIGIN 1 ggatcccctg ctgggagggg gcaggggacc tgttcccacc gtgtgcccaa gacctctttt 61 cccacttttt ccctcttctt gactcaccct gccctcaata tcccccggcg cagcagtgaa 121 agggagtccc tggctcctgg ctcgcctgca cgtcccaggg cggggaggga cttccgccct 181 cacgtcccgc tcttcgcccc aggctggatg gaatgaaagg cacactgtct ctctccctag 241 gcagcacagc ccacaggttt caggagtgcc tttgtgggag gcctctgggc ccccaccagc 301 catcctgtcc tccgcctggg gccccagccc ggagagagcc gctggtgcac acagggccgg 361 gattgtctgc cctaattatc aggtccaggc tacagggctg caggacatcg tgaccttccg 421 tgcagaaacc tccccctccc cctcaagccg cctcccgagc ctccttcctc tccaggcccc 481 cagtgcccag tgcccagtgc ccagcccagg cctcggtccc agagatgcca ggagccagga 541 gatggggagg gggaagtggg ggctgggaag gaaccacggg cccccgcccg agcccatggg 601 cccctcctag gcctttgcct gagcagaccg gtgtcactac cgcagagcct cgaggagaag 661 ttccccaact ttcccgcctc tcagcctttg aaagaaagaa aggggagggg gcaggccgcg 721 tgcagccgcg agcggtgctg ggctccggct ccaattcccc atctcagtcg ttcccaaagt 781 cctcctgttt catccaagcg tgtaagggtc cccgtccttg actccctagt gtcctgctgc 841 ccacagtcca gtcctgggaa ccagcaccga tcacctccca tcgggccaat ctcagtccct 901 tccccctacg tcggggccca cacgctcggt gcgtgcccag ttgaaccagg cggctgcgga 961 aaaaaaaaag cggggagaaa gtagggcccg gctactagcg gttttacggg cgcacgtagc 1021 tcaggcctca agaccttggg ctgggactgg ctgagcctgg cgggaggcgg ggtccgagtc 1081 accgcctgcc gccgcgcccc cggtttctat aaattgagcc cgcagcctcc cgcttcgctc 1141 tctgctcctc ctgttcgaca gtcagccgca tcttcttttg cgtcgccagg tgaagacggg 1201 cggagagaaa cccgggaggc tagggacggc ctgaaggcgg caggggcggg cgcaggccgg 1261 atgtgttcgc gccgctgcgg ggtgggcccg ggcggcctcc gcattgcagg ggcgggcgga 1321 ggacgtgatg cggcgcgggc tgggcatgga ggcctggtgg gggaggggag gggaggcgtg 1381 tgtgtcggcc ggggccacta ggcgctcact gttctctccc tccgcgcagc cgagccacat 1441 cgctcagaca ccatggggaa ggtgaaggtc ggagtcaacg ggtgagttcg cgggtggctg 1501 gggggccctg ggctgcgacc gcccccgaac cgcgtctacg agccttgcgg gctccgggtc 1561 tttgcagtcg tatgggggca gggtagctgt tccccgcaag gagagctcaa ggtcagcgct 1621 cggacctggc ggagccccgc acccaggctg tggcgccctg tgcagctccg cccttgcggc 1681 gccatctgcc cggagcctcc ttcccctagt ccccagaaac aggaggtccc tactcccgcc 1741 cgagatcccg acccggaccc ctaggtgggg gacgctttct ttcctttcgc gctctgcggg 1801 gtcacgtgtc gcagaggagc ccctccccca cggcctccgg caccgcaggc cccgggatgc 1861 tagtgcgcag cgggtgcatc cctgtccgga tgctgcgcct gcggtagagc ggccgccatg 1921 ttgcaaccgg gaaggaaatg aatgggcagc cgttaggaaa gcctgccggt gactaaccct 1981 gcgctcctgc ctcgatgggt ggagtcgcgt gtggcgggga agtcaggtgg agcgaggcta 2041 gctggcccga tttctcctcc gggtgatgct tttcctagat tattctctgg taaatcaaag 2101 aagtgggttt atggaggtcc tcttgtgtcc cctccccgca gaggtgtggt ggctgtggca 2161 tggtgccaag ccgggagaag ctgagtcatg ggtagttgga aaaggacatt tccaccgcaa 2221 aatggcccct ctggtggtgg ccccttcctg cagcggctca cctcacggcc ccgcccttcc 2281 cctgccagcc tagcgttgac ccgaccccaa aggccaggct gtaaatgtca ccgggaggat 2341 tgggtgtctg ggcgcctcgg ggaacctgcc cttctcccca ttccgtcttc cggaaaccag 2401 atctccaccg caccctggtc tgaggtctga ggttaaatat agctgctgac ctttctgtag 2461 ctgggggcct gggctggggc tctctcccat cccttctccc cacacacatg cacttacctg 2521 tgctcccact cctgatttct ggaaaagagc taggaaggac aggcaacttg gcaaatcaaa 2581 gccctgggac tagggggtta aaatacagct tcccctcttc ccacccgccc cagtctctgt 2641 cccttttgta ggagggactt agagaagggg tgggcttgcc ctgtccagtt aatttctgac 2701 ctttactcct gccctttgag tttgatgatg ctgagtgtac aagcgttttc tccctaaagg 2761 gtgcagctga gctaggcagc agcaagcatt cctggggtgg catagtgggg tggtgaatac 2821 catgtacaaa gcttgtgccc agactgtggg tggcagtgcc cacatggccg cttctcctgg 2881 aagggcttcg tatgactggg ggtgttgggc agccctggag ccttcagttg cagccatgcc 2941 ttaagccagg ccagcctggc agggaagctc aagggagata aaattcaacc tcttgggccc 3001 tcctgggggt aaggagatgc tgcattcgcc ctcttaatgg ggaggtggcc tagggctgct 3061 cacatattct ggaggagcct cccctcctca tgccttcttg cctcttgtct cttagatttg 3121 gtcgtattgg gcgcctggtc accagggctg cttttaactc tggtaaagtg gatattgttg 3181 ccatcaatga ccccttcatt gacctcaact acatggtgag tgctacatgg tgagccccaa 3241 agctggtgtg ggaggagcca cctggctgat gggcagcccc ttcataccct cacgtattcc 3301 cccaggttta catgttccaa tatgattcca cccatggcaa attccatggc accgtcaagg 3361 ctgagaacgg gaagcttgtc atcaatggaa atcccatcac catcttccag gagtgagtgg 3421 aagacagaat ggaagaaatg tgctttgggg aggcaactag gatggtgtgg ctcccttggg 3481 tatatggtaa ccttgtgtcc ctcaatatgg tcctgtcccc atctcccccc caccccggta 3541 ggcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc gtggagtcca 3601 ctggcgtctt caccaccatg gagaaggctg gggtgagtgc aggagggccc gcgggagggg 3661 aagctgactc agccctgcaa aggcaggacc cgggttcata actgtctgct tctctgctgt 3721 aggctcattt gcagggggga gccaaaaggg tcatcatctc tgccccctct gctgatgccc 3781 ccatgttcgt catgggtgtg aaccatgaga agtatgacaa cagcctcaag atcatcaggt 3841 gaggaaggca gggcccgtgg agaagcggcc agcctggcac cctatggaca cgctcccctg 3901 acttgcgccc cgctccctct ttctttgcag caatgcctcc tgcaccacca actgcttagc 3961 acccctggcc aaggtcatcc atgacaactt tggtatcgtg gaaggactca tggtatgaga 4021 gctggggaat gggactgagg ctcccacctt tctcatccaa gactggctcc tccctgctgg 4081 ggctgcgtgc aaccctgggg ttgggggttc tggggactgg ctttcccata atttcctttc 4141 aaggtgggga gggaggtaga ggggtgatgt ggggagtacg ctgcagggcc tcactccttt 4201 tgcagaccac agtccatgcc atcactgcca cccagaagac tgtggatggc ccctccggga 4261 aactgtggcg tgatggccgc ggggctctcc agaacatcat ccctgcctct actggcgctg 4321 ccaaggctgt gggcaaggtc atccctgagc tgaacgggaa gctcactggc atggccttcc 4381 gtgtccccac tgccaacgtg tcagtggtgg acctgacctg ccgtctagaa aaacctgcca 4441 aatatgatga catcaagaag gtggtgaagc aggcgtcgga gggccccctc aagggcatcc 4501 tgggctacac tgagcaccag gtggtctcct ctgacttcaa cagcgacacc cactcctcca 4561 cctttgacgc tggggctggc attgccctca acgaccactt tgtcaagctc atttcctggt 4621 atgtggctgg ggccagagac tggctcttaa aaagtgcagg gtctggcgcc ctctggtggc 4681 tggctcagaa aaagggccct gacaactctt ttcatcttct aggtatgaca acgaatttgg 4741 ctacagcaac agggtggtgg acctcatggc ccacatggcc tccaaggagt aagacccctg 4801 gaccaccagc cccagcaaga gcacaagagg aagagagaga ccctcactgc tggggagtcc 4861 ctgccacact cagtccccca ccacactgaa tctcccctcc tcacagttgc catgtagacc 4921 ccttgaagag gggaggggcc tagggagccg caccttgtca tgtaccatca ataaagtacc 4981 ctgtgctcaa ccagttactt gtcctgtctt attctagggt ctggggcaga ggggagggaa 5041 gctgggcttg tgtcaaggtg agacattctt gctggggagg gacctggtat gttctcctca 5101 gactgagggt agggcctcca aacagccttg cttgcttcga gaaccatttg cttcccgctc 5161 agacgtcttg agtgctacag gaagctggca ccactacttc agagaacaag gccttttcct 5221 ctcctcgctc cagtcctagg ctatctgctg ttggccaaac atggaagaag ctattctgtg 5281 ggcagcccca gggaggctga caggtggagg aagtcagggc tcgcactggg ctctgacgct 5341 gactggttag tggagctcag cctggagctg agctgcag // LOCUS HUMGARE 4754 bp DNA PRI 27-OCT-1993 DEFINITION Human gastrin receptor gene, complete cds. ACCESSION L10822 NID g406075 KEYWORDS gastrin receptor. SOURCE Homo sapiens (library: EMBL-3 SP6/T7) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4754) AUTHORS Song,I., Brown,D.R., Wiltshire,R.N., Gantz,I., Trent,J.M. and Yamada,T. TITLE The human gastrin/cholecystokinin type B receptor gene: alternative splice donor site in exon 4 generates two variant mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (19), 9085-9089 (1993) MEDLINE 94022320 FEATURES Location/Qualifiers source 1..4754 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="EMBL-3 SP6/T7" /map="11p15.4" repeat_region 225..272 CDS join(659..809,1987..2238,2403..2652,2960..3117,3325..3857) /codon_start=1 /product="gastrin receptor" /db_xref="PID:g406076" /translation="MELLKLNRSVQGTGPGPGASLCRPGAPLLNSSSVGNLSCEPPRI RGAGTRELELAIRITLYAVIFLMSVGGNMLIIVVLGLSRRLRTVTNAFLLSLAVSDLL LAVACMPFTLLPNLMGTFIFGTVICKAVSYLMGVSVSVSTLSLVAIALERYSAICRPL QARVWQTRSHAARVIVATWLLSGLLMVPYPVYTVVQPVGPRVLQCVHRWPSARVRQTW SVLLLLLLFFIPGVVMAVAYGLISRELYLGLRFDGDSDSDSQSRVRNQGGLPGAVHQN GRCRPETGAVGEDSDGCYVQLPRSRPALELTALTAPGPGSGSRPTQAKLLAKKRVVRM LLVIVVLFFLCWLPVYSANTWRAFDGPGAHRALSGAPISFIHLLSYASACVNPLVYCF MHRRFRQACLETCARCCPRPPRARPRALPDEDPPTPSIASLSRLSYTTISTLGPG" exon 659..809 /number=1 intron 810..1986 /number=1 exon 1987..2238 /number=2 intron 2239..2402 /number=2 exon 2403..2652 /number=3 intron 2653..2959 /number=3 exon 2960..3117 /number=4 intron 3118..3857 /number=4 exon 3325..3857 /number=5 polyA_signal 4416..4421 BASE COUNT 1065 a 1263 c 1332 g 1094 t ORIGIN 1 ctgcggcgac tgcaacacag tacggtaaat gcggctctag gctcagacta aagagtccgc 61 gcctcagcag gtgacgcacg cagacctggt ctccaacgcc acctccaagt tccctgatac 121 accagagaaa gatgctggat ccgtggcagc cgtctctacc caacccctcc taccctccct 181 tccccgaaac ctccccctac gctgccaata ctctcgtctt caatctctct ctctctctct 241 ctctctctct ctctctctct ctctctctct ctctttctct cattcctggg aatcgggggt 301 gggggcgggt gataaagaga ggtgggcagg agagaaatct ctaagaggag acgggacaca 361 tctagggagg gggcgagcct ggaggacagt cgcacaggga ggggcagaac tcccgagcca 421 gggagggggt gcacagtcac tggcgggaca gaggctggcg gagggacggg aacccaggcg 481 gggcgagccg cgggagagtg gagggcaggc gcctgggctg ggggcgggga ccaggcgggg 541 cagggggcag ggagaggagg gcggcgggag cctgagccgg aatcgcagcg tgagcaggtg 601 gagccgcggt gggagccgcc gggtcgagct gagtaaggcg gcgggctcgg cgggggccat 661 ggagctgcta aagctgaacc ggagcgtgca gggaaccgga cccgggccgg gggcttccct 721 gtgccgcccg ggggcgcctc tcctcaacag cagcagtgtg ggcaacctca gctgcgagcc 781 ccctcgcatt cgcggagccg ggacacgagg tgggtgcctc cctcagcccc ccccacaagc 841 tatttctcac tgtaccccag aacactaata gagtcccccc aaccgagctc caacagcacc 901 ttatacatcc ttcagcctca ctcctcatca catctgttta tgatctgagt ttacttcccc 961 actgtgagct ctcaaatggc agggatcttg tcatgttcat ctctttgtcc ccagtgccta 1021 gcaccaaaag ggagtcctct gtaaatgttt atagagtgag aatgaatgtt tggagagaat 1081 taaaatacaa ggcagaacta aaataggctg taatagttgt gtaagaaaca ggcatgggtc 1141 agagaaagag gagagtagaa gggccttcag aagaagtagc agtcaatttg agtcttgacc 1201 aacaagggat gaataggaat tcaccagata cagaaaataa gcttgtgatg aggggtagaa 1261 gttgtgagga gggttggtaa aagagtatcc ggtctgagaa accagcttga gcagggagag 1321 agtcacaagt gcaaatgatg cattcaagaa attgagtcat ccagtggctt gcttaaatgc 1381 ctgctgtaca gggagggggt gggagagaaa gaagtccctt ctgggaggat cttgatgata 1441 ttctgagggg ttgcagacag tagagagtaa gtattttttt tttttttttg cttttgtttt 1501 tagagaataa agacatgatc tgatgtgatt caaaaggtag tgggttgtca acagtgtaga 1561 atacctagcg gggtcaaata gagtaggcta agaaaggacc tttgaattta gcagttaaag 1621 attagtgtac ttgggaagaa gtatgcactg atagggcaca gaatattgta ttgtaatgga 1681 atgaagagta aatgagaaga cacaaatatc ctcttaagaa gtttgtctgc agaaagaagg 1741 aaagatttgg aacgaacctt gagaagagac aggctttctt aaggaagggc agagttgggc 1801 caatttaacc aaacctcaag gggttagaag gctaaaggat tcaagttcac catttactga 1861 gcagttcatc ggtggggaat gtaggcatct ggctgggtag tgagggttgg ggataagacg 1921 gaaggagggg gatttgactg aatgaaggct gaccccctac tgccacctct cccttttctt 1981 acccagaatt ggagctggcc attagaatca ctctttacgc agtgatcttc ctgatgagcg 2041 ttggaggaaa tatgctcatc atcgtggtcc tgggactgag ccgccgcctg aggactgtca 2101 ccaatgcctt cctcctctca ctggcagtca gcgacctcct gctggctgtg gcttgcatgc 2161 ccttcaccct cctgcccaat ctcatgggca cattcatctt tggcaccgtc atctgcaagg 2221 cggtttccta cctcatgggt gggtgagaca accacccacc cccacttgcc actctccccg 2281 cctaaagctc ttgtggatgt aggggtcttc cccgttggca gggaggggtg tgaggaagtc 2341 ccactgatgc ttgtgtagtg caagtttcct aggtgactcc ttccttctct tcccttgttt 2401 aggggtgtct gtgagtgtgt ccacgctaag cctcgtggcc atcgcactgg agcggtacag 2461 cgccatctgc cgaccactgc aggcacgagt gtggcagacg cgctcccacg cggctcgcgt 2521 gattgtagcc acgtggctgc tgtccggact actcatggtg ccctaccccg tgtacactgt 2581 cgtgcaacca gtggggcctc gtgtgctgca gtgcgtgcat cgctggccca gtgcgcgggt 2641 ccgccagacc tggtgagctt gcccataaac tatcctagga attcctttct cacccctatt 2701 agatgcttac gaccattgcc cagaatcttc ctccagcttc ccggagaatt accacgccaa 2761 ctcctattct gcatccacca ccctggagtt ccagtttggg gcccctcccc agttctctct 2821 cccttcccag cggcaccccc aaatcctact cctacttcag gtacctgcca cagccctaga 2881 aacactagtc cttggctttt tctccatctg tgattacagc tggacagaaa cccgagtgat 2941 ctgtctgtgt tgccttcagg tccgtactgc tgcttctgct cttgttcttc atcccgggtg 3001 tggttatggc cgtggcctac gggcttatct ctcgcgagct ctacttaggg cttcgctttg 3061 acggcgacag tgacagcgac agccaaagca gggtccgaaa ccaaggcggg ctgccaggtg 3121 gggctggacc acgtgagcaa aatctgggcg aggcggacgt ttggagggcg acggggcctg 3181 ctggagtggg tgggactgaa atgaaggtga gggtgagaag gaagctggaa atggagttga 3241 gctgggagcg gaggtcaggt ggggactggg ctggagactg gggggactcg cctttttctc 3301 tgaccgccca ccctttgtgc tcaggggctg ttcaccagaa cgggcgttgc cggcctgaga 3361 ctggcgcggt tggcgaagac agcgatggct gctacgtgca acttccacgt tcccggcctg 3421 ccctggagct gacggcgctg acggctcctg ggccgggatc cggctcccgg cccacccagg 3481 ccaagctgct ggctaagaag cgcgtggtgc gaatgttgct ggtgatcgtt gtgctttttt 3541 ttctgtgttg gttgccagtt tatagtgcca acacgtggcg cgcctttgat ggcccgggtg 3601 cacaccgagc actctcgggt gctcctatct ccttcattca cttgctgagc tacgcctcgg 3661 cctgtgtcaa ccccctggtc tactgcttca tgcaccgtcg ctttcgccag gcctgcctgg 3721 aaacttgcgc tcgctgctgc ccccggcctc cacgagctcg ccccagggct cttcccgatg 3781 aggaccctcc cactccctcc attgcttcgc tgtccaggct tagctacacc accatcagca 3841 cactgggccc tggctgagga gtagaggggc cgtgggggtt gaggcagggc aaatgacatg 3901 cactgaccct tccagacata gaaaacacaa accacaactg acacaggaaa ccaacaccca 3961 aagcatggac taaccccaac gcacaggaaa aggtagctta cctgacacaa gaggaataag 4021 aatggagcag tacatgggaa aggaggcatg cctctgatat gggactgagc ctggcccata 4081 gaaacatgac actgaccttg gagagacaca gcgtccctag cagtgaacta tttctacaca 4141 gtgggaactc tgacaagggc tgacctgcct ctcacacaca tagattaatg gcactgattg 4201 ttttagagac tatggagcct ggcacaggac tgactctggg atgctcctag tttgacctca 4261 cagtgaccct tcccaatcag cactgaaaat accatcaggc ctaatctcat acctctgacc 4321 aacaggctgt tctgcactga aaaggttctt catccctttc cagttaagga ccgtggccct 4381 gccctctcct tccttaccca aactgttcaa gaaataataa attgtttggc ttcctcctga 4441 acttcctttc tggattcttc tggggcaggt gaggaggcat aggattttgc agtatccttc 4501 tgtcaggtct ctcccactgt tgcacactgc tgtgcacaac cccttgccag aaagcccagc 4561 ctctgaggtt cagcgtggct gtgcatattc tttctgagtt ttcatcctct acctacattc 4621 caatgttgca ataactgtga atggggatgg aaaatggcat atccatgaca tatcatgatt 4681 cctgatcaca ttatcccaaa gtggtgccac tgttaaggta ctatattaat ggatgcttat 4741 taaattttga tgaa // LOCUS HUMGASTA 7739 bp DNA PRI 08-NOV-1994 DEFINITION Human gastrin gene, complete cds. ACCESSION M15958 NID g182990 KEYWORDS Alu repeat; gastrin; repeat region. SOURCE Human gastric antrum and gastrinoma DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7739) AUTHORS Kariya,Y., Kato,K., Hayashizaki,Y., Himeno,S., Tarui,S. and Matsubara,K. TITLE Expression of human gastrin gene in normal and gastrinoma tissues JOURNAL Gene 50 (1-3), 345-352 (1986) MEDLINE 87219893 COMMENT Draft entry and printed copy of sequence in [1] kindly provided by Y.Kariya, 08-APR-1987. The IVS B seems to end with bases ac and not with ag, as would be expected. Letter sent to the author to clarify, 14-MAR-1988. letter received from the author, 06-APR-1988. Base at position 6629 changed from c to g. FEATURES Location/Qualifiers source 1..7739 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q" repeat_region 377..707 /note="Alu-1 repeat" repeat_region 991..1326 /note="Alu-2 repeat" repeat_region 2074..2390 /note="Alu-3 repeat" prim_transcript 3183..6821 /note="gast mRNA (alt.)" prim_transcript 3186..6821 /note="gast mRNA (alt.)" prim_transcript 3188..6821 /note="gast mRNA (alt.)" intron 3243..6283 /note="gast mRNA intron A" repeat_region 3775..4071 /note="Alu-4 repeat" repeat_region 4251..4585 /note="Alu-5 repeat" repeat_region 5014..5328 /note="Alu-6 repeat" repeat_region 5343..5643 /note="Alu-7 repeat" repeat_region 5861..6184 /note="Alu-8 repeat" exon <6289..6499 /gene="GAS" /note="gastrin, (first expressed exon); G00-119-261" /number=2 gene 6289..6499 /gene="GAS" CDS join(6289..6499,6630..6724) /note="gastrin" /codon_start=1 /db_xref="PID:g182991" /translation="MQRLCVYVLIFALALAAFSEASWKPRSQQPDAPLGTGANRDLEL PWLEQQGPASHHRRQLGPQGPPHLVADPSKKQGPWLEEEEEAYGWMDFGRRSAEDEN" intron 6500..6629 /note="gast cds intron B" exon 6630..>6724 /note="gastrin" /number=3 repeat_region 7005..7349 /note="Alu-9 repeat" BASE COUNT 1964 a 2036 c 1844 g 1895 t ORIGIN 1 bp upstream of EcoRI site; chromosome 17q. 1 gaattcatgc tagtcggtgt tagagccatg attcaaaccc tggcctcacc agttccaaaa 61 cccccctgta caccaggagg tccagcttcc atagactcta tccctgctgc ctctttcagt 121 ggcagttggg ctgaggtccc tctgagtcac agctggcagg agggaaaccc ttggattcac 181 agagtgatat gcttagggaa ttaccctcct ttaagccttg gggaaggaga actcccccaa 241 gtgtataaag cgtgtgcaca gaccttgtca cctcttccct tcactgcagc cctatgaggt 301 acaaatgaat gattcctatt tccaaatgtg gttaattgag acccagagag gatgggggag 361 aagacgaaaa tggattaaat acctgcgata ttctcttttg tttgtctgtt tgtttgagac 421 agtctcactg tgtcacccag gctggagtgc aatggcacaa tcttggctca ctccaacctc 481 cacttcctgg gttcaagcaa ttctcgtgcc tcagcctccc aagtagctgg gattacaggc 541 atgtgccacc acgcccagct aatttttgta tttttagtag aaacggggtt tcaccatgtt 601 gttcaggctg gtctcgaact cctgacctca ggtgatctgc ttgccttggc ctcccaaagt 661 gctgggatca cagcatgagc caccgcaccc ggccaaatat ctgcaatgtt ttgagcaccg 721 tgcagggcgc tttatgtaca tcaccttcac gagaccctcc tccccaagtt actttcctca 781 tttcctagag gagaaaacag ctcagagaag gagagaaata tgctcaaagc tcaacagctg 841 gaaaaagaca ggaccaggaa acccttcctt ttccatcaga cgaggtcccc tcatgaggag 901 aacgtgcagt ccgtgactgg ggagctcctc agggtcaagg atgctcatct ctgaatgcaa 961 cccttcccct gcagacctgg cagggtctgc aaatagtgac attctttttt tttttttttt 1021 tttttttttg agacagagtc tcgctctgtc accaggctgg agtgcaatgg catgatctca 1081 gctcactgca acctctctgt gcctcccagg ttcaagcgat tctcctgcct cagtctcccg 1141 agtagctggg attacagcgc ccgccaccat gcccagctaa tttttgtatt tttagtagag 1201 tcaaggtttc accatgttgg ccaggatggt ctcgatctct tgacttcgtg atccacctgc 1261 ctcagcctcc caaagtgctg ggattacagg agtgagccac caagcctggc caaatagtga 1321 cattctaacc attgcctatg gaataggaga catgcttcta tttgttgaaa ttgtcatccc 1381 tcatgtaacc agcctgggga agaacattca agggggcata accccctcca tgggcacaaa 1441 ctgagtcttg ggctctggta caaccagggt tctactgaaa acttatcatt tcccattagt 1501 gttaccatag caacacgaat gtcttcattc attcccctag tgtatgaaca gagctacact 1561 caccccacct gttctcccaa gccccctcaa agtcctttca tctctgagat tctgtgattc 1621 aattaattat caccaatgcc tgcactgctt tttgtcgttt tcatttattc catctgctga 1681 gtcaccgtgt cctgctctag atgagagtgg ttattcacaa cggttattca gccgtcagtt 1741 gccatggaca ccgcacagcc tgcttccttc atcacccatt aattctcagc aaggctggct 1801 gtaagcatgg gtaccactgc agttacccaa tacccaaggg gatcaaactc tcccagaaaa 1861 ggtgggggcc caatgacgca tgaacaggag caaaatctct gctctcaaca gtggaaaagt 1921 aaaaaacacc ctacggctca ttacatctgg tcaaaccatg tggaggcagc agtttatcaa 1981 ataagtcaac aaggtaccat tttgttttct ggtccatctt tggcaaatgc tttagtctgt 2041 ttgctaacca tttgaaggta tattttgata ctgaaaagtt gttggctggg catggtggct 2101 cacgtctgta atcccagaag tttggcagac taagctgggc cgatcatgag gtcaggagtt 2161 cgagaccagc ccgcccaaca cgatgaaacc ccatctctac taaaaataca aaaaattagc 2221 caggcgtggg ggcgtgcgct gtaatcccag ctactaggga ggttgaggca gaagaatcgc 2281 ttgaacccag gaggcggagg ttccagtgag ccgagatcac accactgcac tcctgcactc 2341 cagcctggtg acagaactag actttgtctc aaaaaaaaaa aaaagttgtt atgcagatta 2401 ctagctcccc ttgaaagcaa aatgaaggag gcttcaaagg caggaccctc caggaattgg 2461 ttctgcatag ggggtttgca agtggtggtg catgggcagc tgacctagaa gacacggggc 2521 agctacaagc caaacccagc ccagcacaga gggcctaagg agaagaaggc acaagggtgt 2581 tcatggtagg gggagggtga aaggacttca gaggtgacca ggatgtgagc cactagaagg 2641 agagaacagg gccatcagcc cacaacccat tgctcctggt aacacgttat gtttgcagag 2701 tccagtggag gggacgggga atcaatacat caccccacat tcacacgctt gccaaacatc 2761 agtggatggc gaaatccact ttccctgaat ctaagcccat cggttcactc cctcacctac 2821 caccacccac aaaggtccct atgtagttcg agagataagt gccgaggcct ctgaccccca 2881 ggatatggtg ggaagcattg ctcctgaccc agggagcttg gttcctgccc ctcaggtaca 2941 gctggagagc tgccgccacc ccgctccagc ccctcaccat gaaggtcaac tcccctatcc 3001 ttcccccaca tcctggatga ctgactgaca ctaaatgaaa gggcggggca gggtgatggg 3061 ctgtacctgt gccccacccc attcctctcg cctggactca tatggcaggg taggggcggg 3121 gtggggggac agttgggagg gaccttgagg gctttataag gcaggcctgg agcatcaagc 3181 agagcagaga cctgagaggc accaggccca gccgtggcac cacacacctc ccagctctgc 3241 aggtgagaaa acccaggagg agaggggaga ggctaggaag tgggttgaca ggtcctctcc 3301 cccatcaagg taccaggcca ctggccagag tctggggctc accccttggg gtctccagag 3361 ctgggaccct ttctttgatc ccaggatgga actaggtctt ggctccagta cctaccctgg 3421 tattcccaac cttgccttcc acctgccctt ctgccgcagc cggggcctgc ttagcctatc 3481 ccttcacact cttcccagcc ctgggaggtg aggaggactg gggttcttct cctttctctc 3541 agacaagaaa actgaggccc agtaggcaag atgtgacttg cccaaagtca cagggattat 3601 taggcaaggc tgggatagga atccacagct cctggatccc acttcaagcc ctttcctaga 3661 ccagtgccat ccaaaagaac gtcctatgat gcaggaaatg gtcttcatcc tgcactgttc 3721 aatatggtag ccactagcca cgtgtgcgct tgaaatgggg ttagtgtaag gaagtttttg 3781 ttgttgttgt tgtttgtttt tttatataga gtctcgccgt gtcacccagg ctggagtgca 3841 gtgacatgca acctctgcct cccaggttca agcaattctc ctgactcagc ctcccgagta 3901 gccgggatta cagcgcctgc cacaacaccc agctaatttt tcatattttt ggtagagaca 3961 gagtttcacc atgttggcca ggctggtcac aaactcctga cctcaagtga tccaaccccc 4021 tcagcctccc aaagtgctgg gattacaggc gtgagccact gcacctgacc ataatttttt 4081 aatttaatta atttaaatgt aaagagccac atgtggctgg tggctatagc actgaacaac 4141 acatgtggtt agcatctgac aatacagctc cagatcactg atttttcaaa catttcttca 4201 gcagcagaat tcttttcttc caaagaaaac ctaacagaaa acctgattat gaaaaactta 4261 atggccaggc accgtggctc atgcctgtga tcacaacact ttgggaggct gaggcaagca 4321 gattgcttga gctcaggagt tccagaccag cctgggcaac atagtgagac ctcatctcta 4381 ctaaaaataa aaacaaaaaa attagctggg catagtagtg cacacttgta gtcccaacta 4441 ctcaggaggc tgaggtggga ggatctcttg agcccaagag gtcgaggctg cagcgagtca 4501 ggatcacgcc agccactgca ttccagcctg ggagacagag caagattctg tctcacaaaa 4561 agaaaagaaa aaaagaaaaa cttaaaagca gtacagtatt ttattaatac aaatgtattc 4621 tgacaagaga aaatttttat cattttctta tgataaagcc aatcagaata gtgagaccaa 4681 aagcatacaa aaatagtgag tccagggaat agatgagcat ttgaatttta tgtttgtttg 4741 agattccagt ttagttctgt attatgtttt ccccaaactt gattcaccct gatacataat 4801 tgctaatggt gacagcttaa ttttttctta actgcttcat tataatcaca ggatattcct 4861 cagttaagca aaaatgacag gagaaccttt cctctcaggg ctgcacaaaa atcagtgtga 4921 gctggccaag cattgaatgt gctgcagtct ccaaagtgat aacatgcggt ctgtaacaca 4981 gctggatgtc catgttcctt tggtattact atttaaaagt ttcacttttt gtttttgttt 5041 ttttgagaca gggtctcact ctgtcgccca ggctggagtg cagtggtgca atctcggctc 5101 actgcaacct ccacctcccc ggttcaagcg atcctcccac ctcggcctcc caagtagctg 5161 ggaatacagg tgcatgccac catgcctggc taatttttgt attttttgta gagatggggt 5221 ttcagtatgt tgcccaggtt ggtctcaaac ttgtgggctc aagggatcca tcctcctcag 5281 cctcccaaag tgctgggatt accatgcctg gcctaaaaag ttcactttaa ataacaagtc 5341 ttgggctgag cacagcggct cacatctgta atcgcagcac tttgggaggc tgaggcggga 5401 ggattgcttg agcccaagag ttcaagacta gcttgggcaa catggtgaaa ccctgtttct 5461 acaaaaaata caaaaaaaat tagctggacg tggtggtgca agcctgtagt cccagctctt 5521 ctggaggcta ggtgggagga tcacctgagc caggaggtag aggctgcagt gagcaatgat 5581 tgcactactg cactgcgacc tgggcgacag agtgagaccc tgtctcaata aataaataaa 5641 ttacattatt taaacagcta cagaaccact aacatggcct caaattctgc agaacagaaa 5701 ctagggaaac cattgcccta ggatttgact tcctccaata gaagcagtgg ctgtaatcgg 5761 ggtcaggtaa agagaaaaaa ccacaagact tcggggctgg tgtggaactg gcagaaacct 5821 ggagaactga tctgaagaaa tagtccacaa ctccaccaca gaattgcaga ctcacccggg 5881 tgcggtggct catgcctgta atcccagcat ttgggaggcc gaggcggctg gatcacctga 5941 ggtcaggagt tcgagaccag cctggcaaca tggtgaaacc atctctacta aaaattacaa 6001 aaaattaccg acagcgatgt ggcactgcct atagtcccag atattctgga ggctgaggca 6061 ggagaatcac ttgaacccgg gaggcggagg ttatagtgag ccgagatccc accactgcac 6121 tccagcctag gcaacaagag tgaaactctg tctaaaaaaa aaaaaagaaa gaattgcaca 6181 ctcatcagca ggtagaggcc tagagccaca tggttcagtc cccgcctctg ggcctctgtg 6241 gggacagcct cacccttaag ctagtccctt ctcccctttg cagacgagat gcagcgacta 6301 tgtgtgtatg tgctgatctt tgcactggct ctggccgcct tctctgaagc ttcttggaag 6361 ccccgctccc agcagccaga tgcaccctta ggtacagggg ccaacaggga cctggagcta 6421 ccctggctgg agcagcaggg cccagcctct catcatcgaa ggcagctggg accccagggt 6481 cccccacacc tcgtggcagg taggagctgc tgactgccct gcttgcctca cttggccatg 6541 tttggccaag gtctccccag actggctctg acttcagttc ctagaaggta ggcatccttc 6601 ccccattctc gcctctctcc cctcctcaga cccgtccaag aagcagggac catggctgga 6661 ggaagaagaa gaagcctatg gatggatgga cttcggccgc cgcagtgctg aggatgagaa 6721 ctaacaatcc tagaaccaag cttcagagcc tagccacctc ccaccccacc tccagccctg 6781 tcccctgaaa aactgatcaa aaataaacta gtttccagtg gatcaatgga ctgtgtcagt 6841 gttgtagggc agaggagggg actcatctgg gggtgaagtt gtggcaggga gaagagctga 6901 gtgcctttag gggcagggac ctggctgatt cttcttggtc ccagagccca attgaacgag 6961 aatccacagg tatgggcagg ataatatatg gtagggttca tagccagagt aacctttttt 7021 tttaattttt attttatttt attttgagat ggagtttcgc tcttgtctcc caggctggag 7081 tgcaataatg agacctcagc tcactgcaac ctctgcctcc taggttcaag cgattttcct 7141 gcctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacgct cgctaatttt 7201 tttgtatttt tagtggggac ggggtttcac catgttggcc aaggctggtc ttgaactcct 7261 gacctcaggt gatcccaccc gcctcggcct cccaaagtgc tgggattaca ggcatgagcc 7321 accgtgccca gcctcagagt aagcttttta aagataaacc gtgtccacac tcggttttaa 7381 cgttgcaatg tttcccagtg cacttagaat acaacctgaa ctccctgcca cagccctgaa 7441 agctagccct gcggttctcc tcccactgtg cgccaagtgg cttccttctc tttgtcctct 7501 cggtcctttc ccttacacag acacgccatg ttgctctctt gctcagagcc tccgaccttg 7561 ccattcctgt gctgcaacgc tcttccccag gctcaaatat ggtgggctcc cctttcctcc 7621 ttagctcttg ggtgaaatgc ctcttcctga gagacctacc ctttctgctc ctcacccttc 7681 acacatagac ccagttattc tctaccacag ggtcaaataa aagaaaaaaa attaagctt // LOCUS HUMGCAPB 4053 bp DNA PRI 13-JAN-1995 DEFINITION Homo sapiens guanylate cyclase activating protein (GCAP) gene exons 1-4, complete cds. ACCESSION L36861 NID g623404 KEYWORDS calcium-binding protein; guanylate cyclase activating protein; membrane-associated protein. SOURCE Homo sapiens adult retina DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4053) AUTHORS Subbaraya,I., Ruiz,C.C., Helekar,B.S., Zhao,X., Gorczyca,W.A., Pettenati,M.J., Rao,P.N., Palczewski,K. and Baehr,W.B. TITLE Molecular characterization of human and mouse photoreceptor guanylate cyclase activating protein (GCAP) and chromosomal localization of the human gene JOURNAL J. Biol. Chem. 269, 31080-31089 (1994) MEDLINE 95074147 FEATURES Location/Qualifiers source 1..4053 /organism="Homo sapiens" /note="vector: lambda FIXII" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /map="6p21.1" exon 1..612 /gene="GCAP" /number=1 gene join(412..612,2293..2442,2813..2906,3254..3414) /gene="GCAP" CDS join(412..612,2293..2442,2813..2906,3254..3414) /gene="GCAP" /codon_start=1 /function="calcium-binding protein" /product="guanylate cyclase activating protein" /db_xref="PID:g623405" /translation="MGNVMEGKSVEELSSTECHQWYKKFMTECPSGQLTLYEFRQFFG LKNLSPSASQYVEQMFETFDFNKDGYIDFMEYVAGLSLVLKGKVEQKLRWYFKLYDVD GNGCIDRDELLTIIQAIRAINPCSDTTMTAEEFTDTVFSKIDVNGDGELSLEEFIEGV QKDQMLLDTLTRSLDLTRIVRRLQNGEQDEEGADEAAEAAG" intron 613..2292 /gene="GCAP" /number=1 exon 2293..2442 /gene="GCAP" /number=2 intron 2443..2812 /gene="GCAP" /number=2 exon 2813..2906 /gene="GCAP" /number=3 intron 2907..3253 /gene="GCAP" /number=3 exon 3254..4053 /gene="GCAP" /number=4 polyA_signal 4036..4041 /gene="GCAP" BASE COUNT 845 a 1085 c 1065 g 951 t 107 others ORIGIN 1 ccgtgtggta agaaagggat tagaagattg caaactgggt taaaaaatcc tctcacctac 61 agctcaaggc tgtaatccta aggatctctg cttctctaag ccttgttcta ttttcaactc 121 ttttctccag ggtacagtct cccctggggc tgcaaggatt tagtggagac tcttaacacc 181 agttctctgg catctgtgag tttgagtgtg ggccatcatc ttcttccttc tctctctccc 241 tctccacatt tcccggtacc atctgatcca tcaggccctt ctttgcctag tcctgaaggt 301 actcaggcct gtgagagagg acggccccgt tgtcggccaa gacagctttg ggcgaggagc 361 agcgaaaagg gcctgtccat ctcagacgtc agccccctga aggcctgagc aatgggcaac 421 gtgatggagg gaaagtcagt ggaggagctg agcagcaccg agtgccacca gtggtacaag 481 aagttcatga ctgagtgccc ctctggccaa ctcaccctct atgagttccg ccagttcttc 541 ggcctcaaga acctgagccc gtcggccagc cagtacgtgg aacagatgtt tgagactttt 601 gacttcaaca aggtgagcag gggcccagtg gcagggaggg gaagtgctgg agggacccct 661 ctggaagcct gaccagctgg gggtgaggaa gacgagagag gacgatagaa tgtgccgctg 721 gggagcaact tcatttatac agtcatttgt ttatttgata ggtgggtttt tgacactagg 781 gtttcaacgg atctaatgtt tgtgctttaa tgagcaccaa ctgtgaatca ggcactgggg 841 ggttagaaat gtatcagaca ttgtccctgc cctcactttc tagggcaaga cagagaggcc 901 cacacattag tgtaatccga ggctgaatcc aatggcttcc tccacctctg ccacaggact 961 atcttggttt actccaaaac gagtgtaaac acttccaaga tggtcattga acacctgcga 1021 tgtgctggtc ctatacgtgg tgcttttgca aacacccttt gaagctccca aggatgtggg 1081 cagaggggca gagagcaggg agcaagatga tgattctcta gcctcactgt ctggcatggg 1141 caggccgctt ataatattct gcggacatga ggactggaga cagacagggt tgcctggtcc 1201 ctnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 taattttgta tttttagtag agatggggtt tcactatgct gccaggccgg tctcaaactc 1321 ttgacctcag gtgatccgcc cacctcggcc tcccatagtg ctggattaca ggcgtgagcc 1381 accacgctga gccaaccatg tcttaatgtg tgtcccatgt gagcaccacc tgtactgtac 1441 caaacctgac ccttgtcttt ccactgaaag taaatcaacc ccttgtttct aggcagtctg 1501 gcctcctccc ttattccaca cacacccccg ccttaggggt tttgtgctgc ctgggaccct 1561 ctctgccccc agaattctgc tcggcttaca ccttcctctg ctgcaagtct ctgcttagaa 1621 gtctcctcac taacccctgc cgtgtatctt ctggtttaaa ctctgtttaa aactgggact 1681 ggttcccacc tgtgcatttc ctagtcccac ctttccccgc aagcacctgt cgtcttccaa 1741 taccagattt aaataacatt tcaccattac gtttatcgcc tgtctccccc tgctggaatg 1801 tgagctccat gtgggcaggg atttttgtct tgttgcactg ttgaatccct ggttcctaga 1861 acagtgcctt ggattgtggc aggtgcctta ataatattca ttgtatgagt gaatgaatat 1921 atgagaaatc aggaatttan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1981 nnnnnnnnaa aaccagatgc ctgtgaacag acagcgccag ccattcacac cccagtggat 2041 gctggactta ttagatttga ttggcagcct ctggagtagg cagggtgggc tatacagggc 2101 gtctaggaag acagataggt cacggcggag acaggactgg ccccctcgct gcatctccga 2161 gggtccgggt ctcccctcag cgtctcttgg ggcttgccga gatgggcgga gccttgggtt 2221 atgatgggcg gggcctgagg ctggagtgag cggggcccgg atgggctcac ggcggccgcg 2281 cccctcgccc aggacggcta cattgatttc atggagtacg tggcaggcct cagcttggtc 2341 ctcaagggga aggtggaaca gaagctccgc tggtacttca agctctatga tgtagatggc 2401 aacggctgca ttgaccgcga tgagctgctc accatcatcc aggtgcagag ggcccggcac 2461 tggggggcag cggtctgggg tgggacccgg aactgagagc ccagggttag aacaacaatc 2521 tcaggattga gacaggatgt gggactgagg atcctggtgg cagctgtagg cctcaccagc 2581 tgcgtgaccc aggccacgtt ccttcccttc cttggcctca gtttcctcat caataccaaa 2641 agacgataga agtttgactc ttgagtccca gctctgccta ggcccccggg taccctgcac 2701 tctactcctc actgctcttg ggagccggac aagctctgac ccgtccccaa tcctacccct 2761 gagataggat aaggatgggc ccctctcact tctgcccctt cttccctccc aggccattcg 2821 cgccattaac ccctgcagcg ataccaccat gactgcagag gagttcaccg atacagtgtt 2881 ctccaagatt gacgtcaacg gggatggtga gggggccgag gaggggctcc ccagcggagg 2941 ggtcaccatg gatgtggggt caccaggggt ggaaggtcac taaaggagag ggtgaggaag 3001 ggaggagagg cccaaaggcc cccgtgctgg tcacttcctc cacctgcctc tgccccagcc 3061 acaaagttgg cttttagggg cccctggacc agaatctggg ctctgggttc ctctgcttgc 3121 tgcacccgca gcaggggctc tgacttctcc tcacgtgggc tctgtccctg cccctggcaa 3181 gaacccggtt ctgtgctctg gactgcagaa atgaacaccc tcctccccct gattcccttt 3241 ctctctaccc caggggaact ctccctggaa gagtttatag agggcgtcca gaaggaccag 3301 atgctcctgg acacactgac acgaagcctg gaccttaccc gcatcgtgcg caggctccag 3361 aatggcgagc aagacgagga gggggctgac gaggccgctg aggcagccgg ctgagtgcac 3421 cgcccggctg cttctgcact agcgggtggg gtggtatggt ggtgcctgtt ggtggtgttc 3481 ttgtcttaac cctagataga atctaatgaa ctcagaggct tagctcgcct ctttagggtc 3541 catggtggca gcagagaggc agaagtggga gtcagagcca ggaacagtga aggatggttc 3601 ctggcccctc tgagtgacag ctggtggcag cactccttgc tggggggcac tgttcaacat 3661 tcctctgccg tcggtgaccc ctagcccttc tgactccttc cagctttttc ccagctttcc 3721 cactgagctt ctccagtcat gctcttctga cgtgactctc tgagcagaac tgagctttcc 3781 aggcctctat ggaatcctgc agatccagtg gctgcagctt caatcccagt gctgcaatca 3841 cacatccatt ctgcctgggg accctggagc ctacttgtgc gctttgcatt tcattgattg 3901 acgcctccct tcaacaagca tttactgagg cgcctactat gtactaatgc tagatgttag 3961 atgtacaaag aagacagttt tcatcctcta ggaactcata ggctaatggt gagacacaca 4021 gacaaacatc attataataa aatatgctaa gag // LOCUS HUMGCB1 8850 bp DNA PRI 19-AUG-1995 DEFINITION Human glucocerebrosidase (GCB) gene, complete cds. ACCESSION J03059 NID g183009 KEYWORDS Alu repeat; glucocerebrosidase. SEGMENT 1 of 2 SOURCE Homo sapiens fetus liver DNA; and Homo sapiens skin DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8850) AUTHORS Horowitz,M., Wilder,S., Horowitz,Z., Reiner,O., Gelbart,T. and Beutler,E. TITLE The human glucocerebrosidase gene and pseudogene: structure and evolution JOURNAL Genomics 4 (1), 87-96 (1989) MEDLINE 89122038 REFERENCE 2 (sites) AUTHORS Beutler,E., West,C. and Gelbart,T. TITLE Polymorphisms in the human glucocerebrosidase gene JOURNAL Genomics 12 (4), 795-800 (1992) MEDLINE 92241881 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Gelbart, 24-OCT-1988. FEATURES Location/Qualifiers source 1..8850 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_type="skin" /map="1" source 1..7604 /organism="Homo sapiens" /dev_stage="fetus" /tissue_type="liver" variation 74 /note="9 in one copy; a in another" /replace="a" variation 151 /note="t in one copy; c in another" /replace="c" variation 262 /note="t in one copy; c in another" /replace="c" exon 1230..1485 /gene="GCB" /number=1 gene 1230..8850 /gene="GCB" CDS join(1459..1485,1854..1941,2494..2685,2809..2955, 3921..4054,4265..4437,4993..5230,6102..6326,6727..6890, 7260..7376,7471..7576) /gene="GCB" /note="precursor" /codon_start=1 /product="glucocerebrosidase" /db_xref="PID:g183012" /translation="MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCI PKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGT GLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRV PMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPW TSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPS AGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVV LTDPEAAKYVHGIAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVR LGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTF YKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVP LTIKDPAVGFLETISPGYSIHTYLWRRQ" sig_peptide join(1459..1485,1854..1941,2494..2495) /gene="GCB" intron 1486..1853 /gene="GCB" /number=1 exon 1854..1941 /gene="GCB" /number=2 intron 1942..2493 /gene="GCB" /number=2 repeat_region 2108..2389 /note="copy A" /rpt_family="Alu" exon 2494..2685 /gene="GCB" /number=3 mat_peptide join(2496..2685,2809..2955,3921..4054,4265..4437, 4993..5230,6102..6326,6727..6890,7260..7376,7471..7573) /gene="GCB" /note="glucocerebrosidase" intron 2686..2808 /gene="GCB" /number=3 exon 2809..2955 /gene="GCB" /number=4 intron 2956..3920 /gene="GCB" /number=4 variation 3002 /gene="GCB" /note="g in one copy; a in another" /replace="a" repeat_region complement(3022..3041) /note="copy B" /rpt_family="Alu" repeat_region 3354..3623 /note="copy C" /rpt_family="Alu" variation 3715 /gene="GCB" /note="g in one copy; c or a in another" /replace="c" exon 3921..4054 /gene="GCB" /number=5 intron 4055..4264 /gene="GCB" /number=5 variation 4179 /gene="GCB" /note="a in one copy; g in another" /replace="g" exon 4265..4437 /gene="GCB" /number=6 intron 4438..4992 /gene="GCB" /number=6 variation 4629 /gene="GCB" /note="g in one copy; a in another" /replace="a" repeat_region 4669..4857 /note="copy D" /rpt_family="Alu" variation 4736 /gene="GCB" /note="c in one copy; t in another" /replace="t" variation 4813 /gene="GCB" /note="a in one copy; g in another" /replace="g" repeat_region 4836..4941 /note="copy E" /rpt_family="Alu" exon 4993..5230 /gene="GCB" /number=7 intron 5231..6101 /gene="GCB" /number=7 repeat_region 5405..5687 /note="copy F" /rpt_family="Alu" variation 5527..5529 /gene="GCB" /note="caa in one copy; ca in another" /replace="ca" repeat_region 5806..5966 /note="copy G" /rpt_family="Alu" variation 6021 /gene="GCB" /note="a in one copy; c in another" /replace="c" exon 6102..6326 /gene="GCB" /number=8 intron 6327..6726 /gene="GCB" /number=8 exon 6727..6890 /gene="GCB" /number=9 polyA_signal 6731..6736 /gene="GCB" /note="minor poly-adenylation signal" intron 6891..7259 /gene="GCB" /number=9 variation 7031 /gene="GCB" /note="a in one copy; g in another" /replace="g" polyA_signal 7207..7212 /gene="GCB" /note="major poly-adenylation signal" exon 7260..7376 /gene="GCB" /number=10 intron 7377..7470 /gene="GCB" /number=10 exon 7471..>8850 /gene="GCB" /number=12 BASE COUNT 1988 a 2426 c 2243 g 2193 t ORIGIN Chromosome 1. 1 tctagaaaga cttcactgag atcatttaaa gaacaaaaag gatggctggg gtccagcgca 61 gtggctcatg cctgtaatcc cagcactttc ggataccaag gcagcagatc acctgaggtc 121 cagagtttca gaccagcctg gccaacatag tgaaacccca tctctactaa aaataaaaaa 181 attagctgag catgttggag ggcacctgta atcccagcta cttgggaggc tgaggcagga 241 gaatcactcg aacccaggag gtggaggttg cagtgagcca agatcacgcc actgcactcc 301 agcctgggca acagagtgag actctgtctc aaaaaacaac aacaacaaaa aatacaaaca 361 agagacaagt agttcccagg tgcctaccaa gtggtcaggc actgcactta cctcactgac 421 tgcagtaacc accctttgag gttgtggcat tgcctccatt ttccaggcaa ggaaatgggc 481 tgagagctgg gattagtcag gtcatgactg tgtgtgccac tcccgctaaa tctcatttga 541 tgtggttcat gaggccacac catggacagc ttcctccttg tgtccactga ggatatggct 601 ttgtacaaca ctttggtttt ttgaacgact ttacaaacct ccctgtcttg tgaggaagga 661 agaacagtta ttaccatctg catctgatga tgaaacaagg gacgctgcag aggagccgca 721 ctgaccactc cctccctcca gtcctgtcat cccactgcca gtgtcccacc ctcttgtgcc 781 ctgcacttca ctggctaata acccccctca ctttttcctc tgtgaagcca tcctggataa 841 ttccccaccc acgaatggtc cctcctcatc tcagagagct ctccatgcac acctgttacc 901 gtttctgtct ttatctgtaa atatctgtgt gtctgacttc catgcctcac acacctctat 961 agggcaaaga ctgtcttaaa catcttggta gtgtcagtat tttgcacagt gaagtttttt 1021 tttttaaatt atatcagctt tatttgtacc tttttgacat ttctatcaaa aaagaagtgt 1081 gcctgctgtg gttcccatcc tctgggattt aggagcctct accccattct ccatgcaaat 1141 ctgtgttcta ggctcttcct aaagttgtca cccatacatg ccctccagag ttttataggg 1201 catataatcg taacagatga gaggaagcca attgcccttt agaaatatgg ctgtgattgc 1261 ctcacttcct gtgtcatgtg acgctcctag tcatcacatg acccatccac atcgggaagc 1321 cggaattact tgcagggcta acctagtgcc tatagctaag gcaggtacct gcatccttgt 1381 ttttgtttag tggatcctct atccttcaga gactctggaa cccctgtggt cttctcttca 1441 tctaatgacc ctgaggggat ggagttttca agtccttcca gagaggtaag agagagagct 1501 cccaatcagc attgtcacag tgcttctgga atcctggcac tggaatttaa tgaatgacag 1561 actctctttg aatccagggc catcatggct ctttgagcaa ggcacagatg gagggagggg 1621 tcgaagttga aatgggtggg aagagtggtg gggagcatcc tgatttgggg tgggcagaga 1681 gttgtcatca gaagggttgc agggagagct gcacccaggt ttctgtgggc cttgtcctaa 1741 tgaatgtggg agaccgggcc atgggcaccc aaaggcagct aagccctgcc caggagagta 1801 gttgaggggt ggagaggggc ttgcttttca gtcattcctc attctgtcct caggaatgtc 1861 ccaagccttt gagtagggta agcatcatgg ctggcagcct cacaggattg cttctacttc 1921 aggcagtgtc gtgggcatca ggtgagtgag tcaaggcagt ggggaggtag cacagagcct 1981 cccttctgcc tcatagtcct ttggtagcct tccagtaagc tggtggtaga cttttagtag 2041 gtgctcaata aatccttttg agtgactgag accaactttg gggtgaggat tttgtttttt 2101 ttcttttgaa acagagtctt actctgttgc ctgggctgga gtgcagtggt gcaattttgg 2161 ctcattccaa cctctgcctc ccagattcaa gcgattctct tgcttcagct tcccaggtag 2221 ctgggattac aggcggccac cactacgccc agctaatttt tgtattttta gtagagacgg 2281 ggtttcacca tgctggcaag gcaggtctca aactcctcac ctcaggtgat ccgcccacct 2341 cggcctccta aagtgctagg attacaggtg tgagcccctg cgcccggcca aggggtgagg 2401 aattttgaaa ccgtgttcag tctctcctag cagatgtgtc cattctccat gtcttcatca 2461 gacctcactc tgcttgtact ccctccctcc caggtgcccg cccctgcatc cctaaaagct 2521 tcggctacag ctcggtggtg tgtgtctgca atgccacata ctgtgactcc tttgaccccc 2581 cgacctttcc tgcccttggt accttcagcc gctatgagag tacacgcagt gggcgacgga 2641 tggagctgag tatggggccc atccaggcta atcacacggg cacaggtaac cattacaccc 2701 ctcaccccct gggccaggct gggtcctcct agaggtaaat ggtgtcagtg atcaccatgg 2761 agtttcccgc tgggtactga tacccttatt ccctgtggat gtcctcaggc ctgctactga 2821 ccctgcagcc agaacagaag ttccagaaag tgaagggatt tggaggggcc atgacagatg 2881 ctgctgctct caacatcctt gccctgtcac cccctgccca aaatttgcta cttaaatcgt 2941 acttctctga agaaggtgag gaggaagggg acaagatgac atagagccat tgaaactttt 3001 cgtttttctt ttcttttttt aaaatttttt tgaggcagaa tctcactctg cccattctgt 3061 cggcgagaca ggagtgcagt ggtgtgatct cccctcacag caacctctgc ctcccaggct 3121 atagtgattc tcctgcctca gcctcctgag tagctggaat tataggcgtg cgccactacc 3181 acctggctaa tttttgtatt tttagtagag acagggtttc atcatgttga ccaggctagt 3241 cttaaactcc tgacctcaaa tgatatacct gccttggcct cccgaagtgc tggaattaca 3301 agtgtgagcc accgagccca gcagacactt ttcttttttc tttttttttt tttgagacag 3361 agtctcgcac tgtcacccag gctggagtgc agtggcacaa tctcagctca ctgcaacctc 3421 cacctcccgg gttcaggtga ttctcctgtc tcagcctctc gagtacctgg gattacaggt 3481 gcctgccacc acgcccggct aattttttgt atttttagta gagacagggt ttcactatgt 3541 tggccaggat gattgcgaac tcctgacctc gtgatctgcc cacatcggcc tcccaaagtg 3601 ctgggattac atgcgtgagc cactgacact tttctttgcc ctttctttgg accctgactt 3661 ctgcccatcc ctgacatttg gttcctgttt taatgccctg tgaaataaga tttcgccgcc 3721 tatcatctgc taactgctac ggactcaggc tcagaaaggc ctgcgcttca cccaggtgcc 3781 agcctccaca ggttccaacc caggagccca agttcccttt ggccctgact cagacactat 3841 taggactggc aagtgataag cagagtccca tactctccta ttgactcgga ctaccatatc 3901 ttgatcatcc ttttctgtag gaatcggata taacatcatc cgggtaccca tggccagctg 3961 tgacttctcc atccgcacct acacctatgc agacacccct gatgatttcc agttgcacaa 4021 cttcagcctc ccagaggaag ataccaagct caaggtaggc attctagctt tttcaggccc 4081 tgagggccct gatgtctggg ggttgagaaa ctgtagggta ggtctgcttg tacagacatt 4141 ttgtcccctg ctgttttgtc ctgggggtgg gagggtggag gctaatggct gaaccggatg 4201 cactggttgg gctagtatgt gttccaactc tgggtgcttc tctcttcact acctttgtct 4261 ctagataccc ctgattcacc gagccctgca gttggcccag cgtcccgttt cactccttgc 4321 cagcccctgg acatcaccca cttggctcaa gaccaatgga gcggtgaatg ggaaggggtc 4381 actcaaggga cagcccggag acatctacca ccagacctgg gccagatact ttgtgaagta 4441 agggatcagc aaggatgtgg gatcaggact ggcctcccat ttagccatgc tgatctgtgt 4501 cccaaccctc aacctagttc cacttccaga tctgcctgtc ctcagctcac ctttctacct 4561 tctgggcctt tcagccttgg gcctgtcaat cttgcccact ccatcaggct tcctgttctc 4621 tcggtctggc ccactttctt tttatttttc ttcttttttt tttttttgag aaggagtctc 4681 tctctctgtc acccaggctg gagtgctgtg gcgccatctt cactcactgt aacctctgcc 4741 tcctgagttc aagcaattct cctgcctcag ccttccaagt agctgggatt ataggcgcct 4801 gccaccaggc ccagctgatt tttctatttt tagtagagac ggggtttcgc caggctgttc 4861 tcgaactcct gaactcaagt gatccacctg cctcggcttc ccaaagtgct gggattacag 4921 gtgtgagcca ccacacccag ctggtctggt ccactttctt ggccggatca ttcatgacct 4981 ttctcttgcc aggttcctgg atgcctatgc tgagcacaag ttacagttct gggcagtgac 5041 agctgaaaat gagccttctg ctgggctgtt gagtggatac cccttccagt gcctgggctt 5101 cacccctgaa catcagcgag acttcattgc ccgtgaccta ggtcctaccc tcgccaacag 5161 tactcaccac aatgtccgcc tactcatgct ggatgaccaa cgcttgctgc tgccccactg 5221 ggcaaaggtg gtaaggcctg gacctccatg gtgctccagt gaccttcaaa tccagcatcc 5281 aaatgactgg ctcccaaact tagagcgatt tctctaccca actatggatt cctagagcac 5341 cattcccctg gacctccagg gtgccatgga tcccacagtt gtcgcttgaa acctttctag 5401 gggctgggcg aggtggctca ctcatgcaaa cccagcactt tgggaagccg aggcgggtga 5461 tcacctgagg tcaggagttt aagaccaccc tggccaacgt gttgaaaccc tgtgtctact 5521 aaaatacaaa aaaaaaaaat tatctgggca tgatggtggg tgtctgtaat cccagctact 5581 caggaggctg agaagggaga atcagttgaa cccgggagat ggtggttgcg gtgagccgag 5641 atcgcgccac tgcactccag cctgggaggc tgagcgagac tccatctcga aacaaaacaa 5701 aacaaaacta tctaggctgg gggtggtggt tcatgtatgt atgtgtatat acatatatat 5761 gtgtttatat gtatatatat atacacacac acacatacat acacacacat acacacacaa 5821 attagctggg tgtggcaccc gtgtagtccc agctactcag gaggctaatg tgggaggatc 5881 agttgaccct aggaagtcaa ggctgcagtg agtcgtgatt gcgccactgt actccagccc 5941 gagtgacaga gtgacatcct gtctcaaaaa caaaaaaaaa tctccccaaa cctctctagt 6001 tgcattcttc ccgtcaccca actccaggat tcctacaaca ggaactagaa gttccagaag 6061 cctgtgtgca aggtccagga tcagttgctc ttcctttgca ggtactgaca gacccagaag 6121 cagctaaata tgttcatggc attgctgtac attggtacct ggactttctg gctccagcca 6181 aagccaccct aggggagaca caccgcctgt tccccaacac catgctcttt gcctcagagg 6241 cctgtgtggg ctccaagttc tgggagcaga gtgtgcggct aggctcctgg gatcgaggga 6301 tgcagtacag ccacagcatc atcacggtaa gccaccccag tctcccttcc tgcaaagcag 6361 acctcagacc tcttactagt ttcaccaaag actgacagaa gcccttcctg tccagctttc 6421 cccagctagc ctgccctttt gagcaactct ggggaaccat gattccctat cttccctttc 6481 cttcacaggt ctgcacacct cattgcccct tttgcaacta ctgaggcact tgcagctgcc 6541 tcagacttct cagctcccct tgagatgcct ggatcttcac acccccaact ccttagctac 6601 taaggaatgt gcccctcaca gggctgacct acccacagct gcctctccca catgtgaccc 6661 ttacctacac tctctgggga cccccagtgt tgagcctttg tctctttgcc tttgtcctta 6721 ccctagaacc tcctgtacca tgtggtcggc tggaccgact ggaaccttgc cctgaacccc 6781 gaaggaggac ccaattgggt gcgtaacttt gtcgacagtc ccatcattgt agacatcacc 6841 aaggacacgt tttacaaaca gcccatgttc taccaccttg gccacttcag gtgagtggag 6901 ggcgggcacc cccattccat accaggccta tcatctccta catcggatgg cttacatcac 6961 tctacaccac gagggagcag gaaggtgttc agggtggaac ctcggaagag gcacacccat 7021 ccccttttgc accatggagg caggaagtga ctaggtagca acagaaaacc ccaatgcctg 7081 aggctggact gcgatgcaga aaagcagggt cagtgcccag cagcatggct ccaggcctag 7141 agagccaggg cagagcctct gcaggagtta tggggtgggt ccgtgggtgg gtgacttctt 7201 agatgagggt ttcatgggag gtaccccgag ggactctgac catctgttcc cacattcagc 7261 aagttcattc ctgagggctc ccagagagtg gggctggttg ccagtcagaa gaacgacctg 7321 gacgcagtgg cactgatgca tcccgatggc tctgctgttg tggtcgtgct aaaccggtga 7381 gggcaatggt gaggtctggg aagtgggctg aagacagcgt tgggggcctt ggcaggatca 7441 cactctcagc ttctcctccc tgctccctag ctcctctaag gatgtgcctc ttaccatcaa 7501 ggatcctgct gtgggcttcc tggagacaat ctcacctggc tactccattc acacctacct 7561 gtggcgtcgc cagtgatgga gcagatactc aaggaggcac tgggctcagc ctgggcatta 7621 aagggacaga gtcagctcac acgctgtctg tgactaaaga gggcacagca gggccagtgt 7681 gagcttacag cgacgtaagc ccaggggcaa tggtttgggt gactcacttt cccctctagg 7741 tggtgccagg ggctggaggc ccctagaaaa agatcagtaa gccccagtgt ccccccagcc 7801 cccatgctta tgtgaacatg cgctgtgtgc tgcttgcttt ggaaactggg cctgggtcca 7861 ggcctagggt gagctcactg tccgtacaaa cacaagatca gggctgaggg taaggaaaag 7921 aagagactag gaaagctggg cccaaaactg gagactgttt gtctttcctg gagatgcaga 7981 actgggcccg tggagcagca gtgtcagcat cagggcggaa gccttaaagc agcagcgggt 8041 gtgcccaggc acccagatga ttcctatggc accagccagg aaaaatggca gctcttaaag 8101 gagaaaatgt ttgagcccag tcagtgtgag tggctttatt ctgggtggca gcaccccgtg 8161 tccggctgta ccaacaacga ggaggcacgg gggcctctgg aatgcatgag agtagaaaaa 8221 ccagtcttgg gagcgtgagg acaaatcatt cctcttcatc ctcctcagcc atgcccaggg 8281 tccgggtgcc tggggcccga gcaggcgttg cccgctggat ggagacaatg ccgctgagca 8341 aggcgtagcc caccatggct gccagtcctg ccagcacaga taggatctgg ttccggcgcc 8401 ggtatggctc ctcctcagtc tctgggcctg ctggtgtctg gcgttgcggt ggtacctcag 8461 ctgagggtca aggaaggaag gtgtgttagg agaactagtt cttggatccc tgcccactct 8521 ccccagggct gcccctccca tctgcccctt acctccatcc caggggaagt agagactgag 8581 aatgtgggta caataggcac agaggttgtg cagcccacgc aggtggacct gcagcttccc 8641 actgggcagc tttgcctgca gcagcagggc caagtagctg aagacgaagg cgtccaagga 8701 ggcagggctg gagcagagag agaagggtgg gatggaggag aaccactggg gtagaagggg 8761 taaagatgga gctggaggaa gagtcagcct tgggaggtgg gctctgggca gcaggcggcc 8821 accaggaagg acaggacaca cagttctaga // LOCUS HUMGFP40H 4379 bp DNA PRI 12-APR-1994 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 NID g183125 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Homo sapiens (library: lambda-H40.3a1.) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) MEDLINE 90257340 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES Location/Qualifiers source 1..4379 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CESS" /tissue_lib="lambda-H40.3a1." protein_bind 370..376 /bound_moiety="AP-2" protein_bind 624..629 /bound_moiety="IRF-1" protein_bind 627..633 /bound_moiety="AP-1" TATA_signal 742..747 exon <796..909 /note="P40 T-cell and mast cell growth factor precursor" /number=1 sig_peptide 796..849 /note="P40 T-cell and mast cell growth factor signal peptide" CDS join(796..909,1022..1057,1140..1172,2461..2592,4096..4215) /note="P40 T-cell and mast cell growth factor precursor" /codon_start=1 /db_xref="PID:g183126" /translation="MLLAMVLTSALLLCSVAGQGCPTLAGILDINFLINKMQEDPASK CHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNN KCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI" mat_peptide join(850..909,1022..1057,1140..1172,2461..2592,4096..4212) /note="P40 T-cell and mast cell growth factor" /product="growth factor" intron 910..1021 /note="hP40 intron A" exon 1022..1057 /number=2 intron 1058..1139 /note="hP40 intron B" exon 1140..1172 /number=3 intron 1173..2460 /note="hP40 intron C" exon 2461..2592 /number=4 intron 2593..4095 /note="hP40 intron D" repeat_region 2965..3265 /note="Alu repeat" exon 4096..>4215 /note="P40 T-cell and mast cell growth factor precursor" /number=5 polyA_signal 4354..4359 BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS HUMGLUT4B 8402 bp DNA PRI 08-NOV-1994 DEFINITION Human glucose transporter (GLUT4) gene, complete cds. ACCESSION M91463 NID g183295 KEYWORDS Alu repeat; glucose transporter. SOURCE Homo sapiens Fetal Liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8402) AUTHORS Buse,J.B., Yasuda,K., Lay,T.P., Seo,T.S., Liu,M.L., Olson,A.L., Pessin,J.E., Moye-Rowley,S.W., Karam,J.H., Seino,S. and Bell,G.I. TITLE Expression and regulation of the human GLUT4/muscle-fat facilitative glucose transporter gene in transgenic mice JOURNAL J. Biol. Chem. 267 (17), 11673-11676 (1992) MEDLINE 92291025 FEATURES Location/Qualifiers source 1..8402 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Fetal" /germline /tissue_type="Liver" /map="17p13" repeat_region 6..308 /rpt_family="Alu repeat 1" exon 2033..2257 /gene="GLUT4" /note="G00-119-997" /number=1 /label=exon_1 mRNA join(2033..2257,3535..3651,3763..3935,4028..4152, 4276..4379,4507..4669,4775..4962,5125..5229,5378..5479, 5995..6198,6716..8402) /gene="GLUT4" /note="G00-119-997" /product="glucose transporter" gene join(2033..2257,3535..3651,3763..3935,4028..4152, 4276..4379,4507..4669,4775..4962,5125..5229,5378..5479, 5995..6198,6716..8402) /gene="GLUT4" CDS join(2225..2257,3535..3651,3763..3935,4028..4152, 4276..4379,4507..4669,4775..4962,5125..5229,5378..5479, 5995..6198,6716..6919) /gene="GLUT4" /codon_start=1 /db_xref="GDB:G00-119-997" /product="glucose transporter" /db_xref="PID:g183296" /translation="MPSGFQQIGSEDGEPPQQRVTGTLVLAVFSAVLGSLQFGYNIGV INAPQKVIEQSYNETWLGRQGPEGPSSIPPGTLTTLWALSVAIFSVGGMISSFLIGII SQWLGRKRAMLVNNVLAVLGGSLMGLANAAASYEMLILGRFLIGAYSGLVPMYVGEIA PTHLRGALGTLNQLAIVIGILIAQVLGLESLLGTASLWPLLLGLTVLPALLQLVLLPF CPESPRYLYIIQNLEGPARKSLKRLTGWADVSGVLAELKDEKRKLERERPLSLLQLLG SRTHRQPLIIAVVLQLSQQLSGINAVFYYSTSIFETAGVGQPAYATIGAGVVNTVFTL VSVLLVERAGRRTLHLLGLAGMCGCAILMTVALLLLERVPAMSYVSIVAIFGFVAFFE IGPGPIPWFIVAELFSQGPRPAAMAVAGFSNWTSNFIIGMGFQYVAEAMGPYVFLLFA VLLLGFFIFTFLRVPETRGRTFDQISAAFHRTPSLLEQEVKPSTELEYLGPDEND" intron 2258..3534 /gene="GLUT4" /note="G00-119-997" /number=1 exon 3535..3651 /gene="GLUT4" /note="G00-119-997" /number=2 intron 3652..3762 /gene="GLUT4" /note="G00-119-997" /number=2 exon 3763..3935 /gene="GLUT4" /note="G00-119-997" /number=3 intron 3936..4027 /gene="GLUT4" /note="G00-119-997" /number=3 /label=exon_1 exon 4028..4152 /gene="GLUT4" /note="G00-119-997" intron 4153..4275 /gene="GLUT4" /note="intron 4a; G00-119-997" exon 4276..4379 /gene="GLUT4" /note="G00-119-997" intron 4380..4506 /gene="GLUT4" /note="intron 4b; G00-119-997" exon 4507..4669 /gene="GLUT4" /note="G00-119-997" /number=5 intron 4670..4774 /gene="GLUT4" /note="G00-119-997" /number=5 exon 4775..4962 /gene="GLUT4" /note="G00-119-997" /number=6 intron 4963..5124 /gene="GLUT4" /note="G00-119-997" /number=6 exon 5125..5229 /gene="GLUT4" /note="G00-119-997" /number=7 intron 5230..5377 /gene="GLUT4" /note="G00-119-997" /number=7 exon 5378..5479 /gene="GLUT4" /note="G00-119-997" /number=8 intron 5480..5994 /gene="GLUT4" /note="G00-119-997" /number=8 exon 5995..6198 /gene="GLUT4" /note="G00-119-997" /number=9 intron 6199..6715 /gene="GLUT4" /note="G00-119-997" /number=9 exon 6716..8402 /gene="GLUT4" /note="G00-119-997" /number=10 repeat_region 7407..7725 /rpt_family="Alu repeat 2" BASE COUNT 1655 a 2461 c 2411 g 1875 t ORIGIN 1 gaattccgga tttttttttt tcttcttgag acggagtcac tctgtcgcca ggctggagtg 61 caagggcacg atcttggctc actacaacct ccacctcctg ggttcaagcc attttcctgc 121 ctcagcctcc cgagtagctg ggattacagg tgtgcataac cacgcccggc taatttttgt 181 atctttagca gacatggggt ttctctatgt tggccaggct ggtttcaaac tcctgacctc 241 agtcgatcca cctgccttgg cctcctaaag tgctgggatt acaggcatga gccaccaggc 301 cgggccggca ttccagattt ttcaggggat tcgtgcagca aaggaatcaa gaagggatgt 361 aaaggcacag tgtgttctgg gtacaataag gacttaggca ttgcccagaa caggaggcga 421 aggagataga aggagaggca ggagagatag gtaaggccag agatcggata agagaggcag 481 gaggttttgt tcactctgaa aagggatttg aacttggcaa ttggggcaac agagacagtg 541 acttcttgct tgagagatga gattggacct tcgaaaattg ttctctgccc tcgtcataaa 601 ggaaataaga ggagcacgaa gaccagtgag ggtgatggtg atctggactg aagtggcagc 661 cgccacggag aatatcggat gaatgtgaga gagttttgga ggtcaaagca ccaatgttgg 721 aaactaactg gataaacgag gagagcggcg caggacagga ggaatcgagc ctgacttcta 781 ccataggggt gactgggcgg gtaattcatt gaaataagga agttaggagg aggagcaggt 841 ttggacatgc tgatcactag agctgccaca tccgggcggt aacgaacacc tggatctgca 901 gctccagaga agggcctggg tcagatgtca ctgaagccct atggtggcgg aaaggcgaga 961 aatagtgggt tgagattcca agtgcaatcc actgcggctc ctcgctcgcc ctccaggtgg 1021 cagcacaacc ctgcgcttcc gaagcccgtt ttctgagcca gacactctcc acgctctggg 1081 tatttcggct tctctctccc cacacgccga ccctaggtcg cgcactttct gcctggcaga 1141 atttggccga gggatccaaa cccggagcag cctccagaga gcgtgtcgtt cacgcggcca 1201 gcatatgctc agagacctca gaggctcaga gacctcaggg ctggtggtgt ggtcggttgt 1261 gaccacttgt ccctcggacc ggctccagga accaacctgg ggaatgtgtg taggggaagg 1321 gcgggataga cagtgcccgg agcagggagg cgctgaaaga caggaccaag cagcccggcc 1381 accagacccg ttgtgggaac ggaatttcct ggcccccagg gccacactcg cgtgggaagc 1441 atgtcgcgga ccctttaagg cgtcatctcc ctgtctctcc gcccccgcct gggacaggcc 1501 gggacgcccg ggacctgaca tttggaggct cccaacgtgg gagctaaaaa tagcagcccc 1561 gggttacttt ggggcattgc tcctctccca acccgcgcgc cggctcgcga gccgtctcag 1621 gccgctggag tttccccggg gcaagtacac ctggcccgtc ctctcctctc agaccccact 1681 gtccagaccc gcagagttta agatgcttct gcagcccggg atcctagctg gtgggcggag 1741 tcctaacacg tgggtgggcg gggccttttg ttccagggac tcttttctca aaacttccca 1801 gtcggaggct ggcgggaacc cgagaggcgt gtctcgccag ccacgcggag gggcgtggcc 1861 tcattggccc gccccaccaa ctccagccaa actctaaacc ccaggcggag ggggcgtggc 1921 cttctggggt gtgcgggctc ctggccaatg ggtgctgtga agggcgtggc ccgcgggggc 1981 aggagcgagg tggcgggggc ttctcgcgtc ttttccccca gccccgctcc acaagatccg 2041 cgggagcccc actgctctcc ggatccttgg cttgtggctg tgggtcccat cgggcccgcc 2101 ctcgcacgtc actccgggac ccccgcggcc tccgcaggtt ctgcgctcca ggccggagtc 2161 agagactcca ggatcggttc tttcatcttc gccgcccctg cgcgtccagc tcttctaaga 2221 cgagatgccg tcgggcttcc aacagatagg ctccgaagta ggattcatca tgagggggcg 2281 gggcgggggg gcacgggtcc cgcttttctt gggctggggt cgcggttggg gtcagctggg 2341 ggtggttcct gcgcaggcgc agggggtgaa ggtagggggc tggctattta tacccggcct 2401 ggacaacccg tgactgtgag attccaatcc taccaaaagg cagagtgggt ctggagggcc 2461 tttcggggca caggcagcaa gtggattctg cgagccaggg ttcaccccct tgcctagtag 2521 gcggcgcggc gctccggaat cggggacacc ctgccctcga tccgactcgg gaaagcagat 2581 ccaggcgggt cttgccctcc gggagctgtc cgtccgtctt cgctcacggg cagtgtttcg 2641 aggaccggag gctctccgtg ggcccccacc cccactcctg gccgccctcc aggacgctga 2701 actttccctt ggccccatgg ttggtggagg ggcagagggg actgtcagcc ccccctcccc 2761 cagctcaggt ttccgcttgg agacagtctg tgccgccagc gagcggccac cactgccacc 2821 gcccctcaca ccaccttcct gccctcctcc cctgggcatg gctctcccag gcagaacccc 2881 tggacggccc tccctgcacg gaggttagag ggggagggca ggccacgacg tagatagaga 2941 aggccacccc tagatgaccg ggatgtcctt tctggaacag cacttcttgg tcctgttggg 3001 ggcctcctgg agctggctga cagaaccccc agaggggagg gaagaggaca gtggctgatg 3061 ataataatgc acgtgttaat ttatgaaacc agcactgtca aggatattgt taacatgtga 3121 tgttgatttt cacaacactc tcacagatag gtaggcaagg caggcaacat cacccccatc 3181 tcacagagga cactcaggtt cggggcaggg aagtgacttg gccaaggtca cacaaatctg 3241 agctcttaag gccaagcctg tctcaaggtc acagaagaat tttgagacaa gttctcaaac 3301 atttctctgc cttatggacc caaacatcca gtttctcctt tatgcccagg ttgcagttca 3361 gctcctgttt acattgagat ctttgtgcaa ttcctaatat ggcccagttt ccctcaccca 3421 acatgttggg tggagcccag tatcttcagg ctccagctgg gcccgggccc ctagcggaag 3481 gaaaaaaatc atggttccat gtgacatgct gtgtctttgt gtctgcctgt tcaggatggg 3541 gaaccccctc agcagcgagt gactgggacc ctggtccttg ctgtgttctc tgcggtgctt 3601 ggctccctgc agtttgggta caacattggg gtcatcaatg cccctcagaa ggtgagggcc 3661 tgcagctggc agggtggggg tacccaaacg aggaggacag gtgtctcggg ggtggtggaa 3721 aggggacggt ctgcaggaaa tctgtcctct gctgtccccc aggtgattga acagagctac 3781 aatgagacgt ggctggggag gcaggggcct gagggaccca gctccatccc tccaggcacc 3841 ctcaccaccc tctgggccct ctccgtggcc atcttttccg tgggcggcat gatttcctcc 3901 ttcctcattg gtatcatctc tcagtggctt ggaaggttcg cagctggagg gcaggggtgg 3961 gggaaacagg aagggagcca ctgctgggtg ccctcaccct cacagcctca ctctgtctgc 4021 ctgccaggaa aagggccatg ctggtcaaca atgtcctggc ggtgctgggg ggcagcctca 4081 tgggcctggc caacgctgct gcctcctatg aaatgctcat ccttggacga ttcctcattg 4141 gcgcctactc aggtactcac gggcaccaca gccctgccta gcgccctgtt ctctttcacc 4201 atgcctgggc tttcagatgg gaatggacac ctgccctcag ccctctcttc ttccctcgcc 4261 cagggctgac atcagggctg gtgcccatgt acgtggggga gattgctccc actcacctgc 4321 ggggcgccct ggggacgctc aaccaactgg ccattgttat cggcattctg atcgcccagg 4381 tgaccggagc aagcctcatg ggtgcctggg cagtggttag agtggggctc tggagaatat 4441 ggtgggcttc caaggtaagg cagaagggct gagtgacctg ccttctttcc caaccttctc 4501 ccacaggtgc tgggcttgga gtccctcctg ggcactgcca gcctgtggcc actgctcctg 4561 ggcctcacag tgctacctgc cctcctgcag ctggtcctgc tgcccttctg tcccgagagc 4621 ccccgctacc tctacatcat ccagaatctc gaggggcctg ccagaaagag taagctctcc 4681 cgctgcagcc tggcccaggc ccatgcctcc gcctcatctt gctagcacct ggcttcctct 4741 caggtcccct caggcctgac cttcccttct ccaggtctga agcgcctgac aggctgggcc 4801 gatgtttctg gagtgctggc tgagctgaag gatgagaagc ggaagctgga gcgtgagcgg 4861 ccactgtccc tgctccagct cctgggcagc cgtacccacc ggcagcccct gatcattgcg 4921 gtcgtgctgc agctgagcca gcagctctct ggcatcaatg ctgtatgtgt ggagcagcct 4981 ccaggcaggg cacagccccg ggagggtaga cgagagtggg gagcaaaccc cctccaccaa 5041 cacccagggt agggccagcc tgttgtggct ggagtagagg aaggggcatt cctgccatca 5101 cttcttcttc tcccccacct ctaggttttc tattattcga ccagcatctt cgagacagca 5161 ggggtaggcc agcctgccta tgccaccata ggagctggtg tggtcaacac agtcttcacc 5221 ttggtctcgg taactgctca cctctggaat ggcccgagcc actggcttca cctccctggg 5281 tgtcccggag gtcctgctct tggttgccct cacccacgcg gcccctccta cttcccgtgc 5341 ccaaaaggct ggggtcaagc tccgactctc cccgcaggtg ttgttggtgg agcgggcggg 5401 gcgccggacg ctccatctcc tgggcctggc gggcatgtgt ggctgtgcca tcctgatgac 5461 tgtggctctg ctcctgctgg taaggcctgg aggctaggag gggctagcag cccaccccat 5521 gggaatggtc ctgtgagtct ctgtgaccag ccagggtccc ttcttaacac acatgctttc 5581 aatcctggcg ccagctccgg accaggactg gggctgactg gctccagaat ctgctgggat 5641 tgtggtctgc tcctgaggga tttgggctgc actgggaggc agctgtggca caactctggc 5701 agccaggagg gagagcccct gtcaagcctc aggaacaatc attcctaagg acccagcttt 5761 agagtccagg gagagctgac cgtcataaga actgagaggc cataacattt cctctgcctt 5821 gaacccactt gggatagcca gcagaatgcc agtcaagggc ctgctctaac ccgggacagc 5881 aggcccccta caagctgctg cggagggggt taggtttcac ttctctgagt tgagggcaag 5941 ggaagatcag aaaggcctca actggattct ccaccctccc tgtctggccc ctaggagcga 6001 gttccagcca tgagctacgt ctccattgtg gccatctttg gcttcgtggc attttttgag 6061 attggccctg gccccattcc ttggttcatc gtggccgagc tcttcagcca gggaccccgc 6121 ccggcagcca tggctgtggc tggtttctcc aactggacga gcaacttcat cattggcatg 6181 ggtttccagt atgttgcggt aggtcccccc gccccagcct cccacaccgt aggccagagg 6241 tgggcatcac acagctagcc cacctgcttc cccgtcaggg actcctccag ccacagacca 6301 tgggtctttg ggtcagtttg gtggaccacc tgctccacag aatcaaagca aggaagggag 6361 ctgacctaga ttggatagta actgaggtgt ctgaaacgca ccagtggcat aacttacctt 6421 actccaagaa taaaatgata cactttgcat taatactaca aacagctggg actctcctct 6481 gagtgcagta actgaggatg gtgaagaggg cgaaaactaa gagtgtttgg ggttcagaga 6541 atcctctttt cagtgtaaat tctcattcct gctcatttcc cttgtccctg gaggaggcag 6601 ctgctgtctg ccgtcccccc agctccctat gaaggccttt agctcctggt tgcctgaaac 6661 taccccttcc ctccccacct cactccgtca acacctcttt ctccacctgt cccaggaggc 6721 tatggggccc tacgtcttcc ttctatttgc ggtcctcctg ctgggcttct tcatcttcac 6781 cttcttaaga gtacctgaaa ctcgaggccg gacgtttgac cagatctcag ctgccttcca 6841 ccggacaccc tctcttttag agcaggaggt gaaacccagc acagaacttg agtatttagg 6901 gccagatgag aacgactgag gggccaggca ggggtgggag agccagctct ctctacccgg 6961 cccagagacc ccttcctttc ctctgcagca ctttaaccct ctcttcccta ttatttccgg 7021 gtggaaaaga atccctgcag cctggtagaa ttgggaagct gggggaaggg tggtctgagc 7081 accccctcat tcccctcgtg tgactctctt ggattattta tgtgttgtgg tttggccgtg 7141 gccatcaggg tgggccactc tcccctccct cttccttccc ccatcccctt tcctccccac 7201 cttccccaga ctcagctcca gaataccttc ttcgctgcta gagaaggggg attggaggga 7261 agacaggtct agactttctc agtgggacaa accagagcag agagcaggac aggagacaag 7321 aaatccagtt tcccaccacc ttggactcct cccacaatct gggactttca ctgaattctt 7381 gccacgcaga ctctgggcaa agggggttct cttttttttt tttttttttt ttttgagaca 7441 gtctcgctct gtcgcccagg ctcgagtgca gtggcgtgat cttgcttcac tgcaagctgt 7501 ctcccaggtt cacgccattc tcctgcctca gcctccggag tagctgggac tacaggcgca 7561 tgccaccaca cctggctaat ttattttgta tttttagtat atacgcggtt tcaccatgtt 7621 agccagaatg gtctcgatct cctgacctcg tgatctgcct gcctcagcct cccaaagtgc 7681 tgggattaca ggcgtgagcc accgcgcctg gcgaagggag ttctctcttc gacccctgca 7741 ggctcagcct tccagggcaa gagggaacag gaaagtatgt gcccatgtgt ggcaagatgg 7801 aaggacggca ggctcccgcc tctaggcttg gggctctacc ccgatggttt cccaaggctg 7861 ccaagaagga gccctaactt tcttcctctc ccttcctgga agggtgctgc atccacaggc 7921 ttttgaccaa ctaaggcaaa gaggggattt gaaaggctgc ctggaaacac tgggctggga 7981 ggagcctttg gatattttta tatacgtttg aaaaggggat tgagagaaga aaccaaaggt 8041 cggttgtact aaatgtatat atatagatac ttctataaag tcactgctga agacaagcat 8101 cctattgtgg aggtacttga ggatgggctg agacagggac cataactctt cacccctctt 8161 cctccctctg tcctgcctca gctcaaggcc tcagaatctt ctggatgcca ttgctcatgc 8221 ccctactcac atttctactc gttgctttat taatagtaaa tgctcaataa attgtagctg 8281 ccagtgccgg gcattgctct tggcatttgc agaatactca ctctgtgagg gaggtgtcag 8341 cccatgtcac agatgggcag tgaaacccat gataggggca ctcttcacca gggacacagc 8401 tg // LOCUS HUMGRP78 5470 bp DNA PRI 08-NOV-1994 DEFINITION Human 78 kdalton glucose-regulated protein (GRP78) gene, complete cds. ACCESSION M19645 NID g183644 KEYWORDS glucose-regulated protein. SOURCE Human fetal liver DNA (library of Lawn et al.), clone hu28-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5470) AUTHORS Ting,J. and Lee,A.S. TITLE Human gene encoding the 78,000-dalton glucose-regulated protein and its pseudogene: structure, conservation, and regulation JOURNAL DNA 7 (4), 275-286 (1988) MEDLINE 88283347 COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by A.Lee, 06-JUL-1988. FEATURES Location/Qualifiers source 1..5470 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q" prim_transcript 372..>5470 /note="GRP78 mRNA and introns" sig_peptide 593..646 /gene="GRP78" /note="GRP78 signal peptide" exon <593..714 /gene="GRP78" /note="GRP78 precursor; G00-119-285" /number=1 gene 593..714 /gene="GRP78" CDS join(593..714,805..1036,1402..1539,2175..2287,2377..2764, 2878..3115,3401..3568,4536..5098) /partial /note="GRP78 precursor" /codon_start=1 /db_xref="PID:g386758" /translation="MKLSLVAAMLLLLSAARAEEEDKKEDVGTVVGIDLGTTYSCVGV FKNGRVEIIANDQGNRITPSYVAFTPEGERLIGDAAKNQLTSNPENTVFDAKRLIGRT WNDPSVQQDIKFLPFKVVEKKTKPYIQVDIGGGQTKTFAPEEISAMVLTKMKETAEAY LGKKVTHAVVTVPAYFNDAQRQATKDAGTIAGLNVMRIINEPTAAAIAYGLDKREGEK NILVFDLGGGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHFIKLYKKKTGK DVRKDNRAVQKLRREVEKAKALSSQHQARIEIESFYEGEDFSETLTRAKFEELNMDLF RSTMKPVQKVLEDSDLKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEA VAYGAAVQAGVLSGDQDTGDLVLLHVCPLTLGIETVGGVMTKLIPSNTVVPTKNSQIF STASDNQPTVTIKVYEGERPLTKDNHLLGTFDLTGIPPAPRGVPQIEVTFEIDVNGIL RVTAEDKGTGNKNKITITNDQNRLTPEEIERMVNDAEKFAEEDKKLKERIDTRNELES YAYSLKNQIGDKEKLGGKLSSEDKETMEKAVEEKIEWLESHQDADIEDFKAKKKELEE IVQPIISKLYGSAGPPPTGEEDTAEKDEL" mat_peptide 647..714 /gene="GRP78" /note="78 kdalton glucose-regulated protein" intron 715..804 /note="GRP78 intron A" exon 805..1036 /number=2 mat_peptide 805..1036 /note="78 kdalton glucose-regulated protein" intron 1037..1401 /note="GRP78 intron B" mat_peptide 1402..1539 /note="78 kdalton glucose-regulated protein" exon 1402..1539 /number=3 intron 1540..2174 /note="GRP78 intron C" mat_peptide 2175..2287 /note="78 kdalton glucose-regulated protein" exon 2175..2287 /number=4 intron 2288..2376 /note="GRP78 intron D" mat_peptide 2377..2764 /note="78 kdalton glucose-regulated protein" exon 2377..2764 /number=5 intron 2765..2877 /note="GRP78 intron E" mat_peptide 2878..3115 /note="78 kdalton glucose-regulated protein" exon 2878..3115 /number=6 intron 3116..3400 /note="GRP78 intron F" mat_peptide 3401..3568 /note="78 kdalton glucose-regulated protein" exon 3401..3568 /number=7 intron 3569..4535 /note="GRP78 intron G" exon 4536..>5098 /note="GRP78 precursor" /number=8 mat_peptide 4536..5098 /note="78 kdalton glucose-regulated protein" BASE COUNT 1496 a 1140 c 1343 g 1491 t ORIGIN Chromosome 9q; 3 bp upstream of SphI site. 1 cccggggtca ctcctgctgg acctactccg accccctagg ccgggagtga aggcgggact 61 tgtgcggtta ccagcggaaa tgcctcgggg tcagaagtcg caggagagat agacagctgc 121 tgaaccaatg ggaccagcgg atggggcgga tgttatctac cattggtgaa cgttagaaac 181 gaatagcagc caatgaatca gctggggggg cggagcagtg acgtttattg cggagggggc 241 cgcttcgaat cggcggcggc cagcttggtg gcctgggcca atgaacggcc tccaacgagc 301 agggccttca ccaatcggcg gcctccacga cggggctggg ggagggtata taagccgagt 361 aggcgacggt gaggtcgacg ccggccaaga cagcacagac agattgacct attggggtgt 421 ttcgcgagtg tgagagggaa gcgccgcggc ctgtatttct agacctgccc ttcgcctggt 481 tcgtggcgcc ttgtgacccc gggcccctgc cgcctgcaag tcgaaattgc gctgtgctcc 541 tgtgctacgg cctgtggctg gactgcctgc tgctgcccaa ctggctggca agatgaagct 601 ctccctggtg gccgcgatgc tgctgctgct cagcgcggcg cgggccgagg aggaggacaa 661 gaaggaggac gtgggcacgg tggtcggcat cgacttgggg accacctact cctggtaagt 721 ggggttgcgg atgaggggga cggggcgtgg cgctggctgg cgtgagaagt gcggtgctga 781 tgtccctctg tcgggttttt gcagcgtcgg cgtgttcaag aacggccgcg tggagatcat 841 cgccaacgat cagggcaacc gcatcacgcc gtcctatgtc gccttcactc ctgaagggga 901 acgtctgatt ggcgatgccg ccaagaacca gctcacctcc aaccccgaga acacggtctt 961 tgacgccaag cggctcatcg gccgcacgtg gaatgacccg tctgtgcagc aggacatcaa 1021 gttcttgccg ttcaaggttc gaccggtttt cctcatccag ttagagaacg ggtgggtggt 1081 gggagtattt agagttataa gtctctggaa aagtgttgag acaacagttg aaggttatag 1141 acatgatgta tgtaataact ttaatactat tagtatgtta caaaacttaa gacagttgct 1201 gtcgtactgt ctacgatagt ttaggaataa aagaccgatt aaaactgaac tttgtaagac 1261 acctatactc cctgaagtat ttctagtcaa tttgcagccc caagggacca aaataaacca 1321 aattgtgggg atggtagtgg gtcttttaaa ctttgagatg tcattgtatc tgtgtctgaa 1381 aacaataatt ctttaaaata ggtggttgaa aagaaaacta aaccatacat tcaagttgat 1441 attggaggtg ggcaaacaaa gacatttgct cctgaagaaa tttctgccat ggttctcact 1501 aaaatgaaag aaaccgctga ggcttatttg ggaaagaagg taaatatttc tagaacaatg 1561 ttaagtattt tttgatcatt agtattctcg gttggctgtt atgtatagaa gccttcgtga 1621 agggtttcaa aaattttaat cagaatggta ttcatgcttg tcacggttta attattgagt 1681 ccctttacta taagccaaac aaaaatagac ttttcatgta ttatttaatg cttacaattc 1741 caggaacaat aaaattttat atgttgtatt catcaataat tggcttaaaa actaaagtga 1801 tggtttgact gtaatttttt ttttttgaga tggagtcttg ctctgttgcc caggctggac 1861 tgcagtggca cgatctcagc tcactgcaac ctctgcctcc cgggttaagc agctctcctg 1921 cctcagcctc caagtaatgg aacgacaggc acaccaccac agctggctaa tttttttttt 1981 tttttttaat tttcagtaga gacagggttt ctccacattg ccaggctggt cttgaaatcc 2041 tgccctcagg ttgatcctcc tgcctagcct cccaaagtgc tggattatag gcagaagcca 2101 ccgcctggcc agactgtaat ttaaataagg gttaaactat gtgacaatac acttaattat 2161 ctttatcctt ttaggttacc catgcagttg ttactgtacc agcctatttt aatgatgccc 2221 aacgccaagc aaccaaagac gctggaacta ttgctggcct aaatgttatg aggatcatca 2281 acgagccgta agtatgaaat tcagggatac ggcatatttg ccaaatagtg gaaatgtgaa 2341 gtactgacaa aacttttccc tttttcaatc taatagtacg gcagctgcta ttgcttatgg 2401 cctggataag agggaggggg agaagaacat cctggtgttt gacctgggtg gcggaacctt 2461 cgatgtgtct cttctcacca ttgacaatgg tgtcttcgaa gttgtggcca ctaatggaga 2521 tactcatctg ggtggagaag actttgacca gcgtgtcatg gaacacttca tcaaactgta 2581 caaaaagaag acgggcaaag atgtcaggaa ggacaataga gctgtgcaga aactccggcg 2641 cgaggtagaa aaggccaagg ccctgtcttc tcagcatcaa gcaagaattg aaattgagtc 2701 cttctatgaa ggagaagact tttctgagac cctgactcgg gccaaatttg aagagctcaa 2761 catggtatgt tccttgtttt ctgctttgct aatgagatct ccttagactc tgaattcagg 2821 acattgcatc tagatactta gataacagac atcacagtaa ccatgtcttt tttctaggat 2881 ctgttccggt ctactatgaa gcccgtccag aaagtgttgg aagattctga tttgaagaag 2941 tctgatattg atgaaattgt tcttgttggt ggctcgactc gaattccaaa gattcagcaa 3001 ctggttaaag agttcttcaa tggcaaggaa ccatcccgtg gcataaaccc agatgaagct 3061 gtagcgtatg gtgctgctgt ccaggctggt gtgctctctg gtgatcaaga tacaggtagg 3121 tcatcatcgc agcatctttc ttagtgattc agtagcttga tggaagagct cggtacccct 3181 attgctttag aaaataccag aatatgagca acaaggtcac acagctagta aagggtataa 3241 gtgaagacaa gactggggta gtctccaaga tcattagcaa ctgtttaatt cactgccttt 3301 aaaatgtgtg tgttagaacc taaccaaatg ttagagagat aaactttaca tagctcatag 3361 ggagaacttg aattaaaagt taaataactt atccttacag gtgacctggt actgcttcat 3421 gtatgtcccc ttacacttgg tattgaaact gtaggaggtg tcatgaccaa actgattcca 3481 agtaatacag tggtgcctac caagaactct cagatctttt ctacagcttc tgataatcaa 3541 ccaactgtta caatcaaggt ctatgaaggt aattacctta agtttggtta atatcatggc 3601 tttttttttg agatgaagtc ttgctctgtt gcccaggctg gactgcagtg gcacgatctc 3661 ggctcactgc aaattctgtc tcccgggttc aagtgattct cctgcctcag cctccagagt 3721 agctggatta cagcctgacc accacacctg gctaatttct gtatttttag tagaggatgg 3781 gctttcacca tgtttcccag gctggtctcc aactcctgac ctcaggtcat ctgcctgcct 3841 ccaccgtccc gaaagtactg ggattatagc gtgagccacc acgccagatc tatctatcat 3901 ggcatatttt aaaagaacat gacttaatat gtcctattga aatggctagg gaactaagta 3961 actgctgttt tcagatggag gtcttaattt gaataatgtt gatattagat atttagcatt 4021 cttttttttt tttttttaat ggagtcttgc tctgtcgcct aggctggggt gcagtggcat 4081 gacttgcaac ctctgcctcc cgaatagctg ggattacagg tgcccaccat cacgcccggc 4141 taagttttgt atttttagta gaggcgagtt tcgccatgtt ggccaggctg gtcttgaacc 4201 cctaacctca gtgatcccac ggtcaccgac ctggcctccc aaaagtactg tacccagcca 4261 atgattagca ttctcactaa taatagcatc tgagctggct cctagagtac aagaaaaagg 4321 agttcacagt actttaaaat agataaaatt cagttgagtt agtaacctaa ctcattgtta 4381 gtactagttg ctgctccttg tagaccaata tgaaattact tttagctcga taaaaccaaa 4441 agtgtcactt tatgcttcag actgaaatgc ggggatctag atgtgctaat gcttgtcagt 4501 aacaactaac aagtttttct gtatgtaact tctaggtgaa agacccctga caaaagacaa 4561 tcatcttctg ggtacatttg atctgactgg aattcctcct gctcctcgtg gggtcccaca 4621 gattgaagtc acctttgaga tagatgtgaa tggtattctt cgagtgacag ctgaagacaa 4681 gggtacaggg aacaaaaata agatcacaat caccaatgac cagaatcgcc tgacacctga 4741 agaaatcgaa aggatggtta atgatgctga gaagtttgct gaggaagaca aaaagctgaa 4801 ggagcgcatt gatactagaa atgagttgga aagctatgcc tattctctaa agaatcagat 4861 tggagataaa gaaaagctgg gaggtaaact ttcctctgaa gataaggaga ccatggaaaa 4921 agctgtagaa gaaaagattg aatggctgga aagccaccaa gatgctgaca ttgaagactt 4981 caaagctaag aagaaggaac tggaagaaat tgttcaacca attatcagca aactctatgg 5041 aagtgcaggc cctcccccaa ctggtgaaga ggatacagca gaaaaagatg agttgtagac 5101 actgatctgc tagtgctgta atattgtaaa tactggactc aggaactttt gttaggaaaa 5161 aattgaaaga acttaagtct cgaatgtaat tggaatcttc acctcagagt ggagttgaaa 5221 ctgctatagc ctaagcggct gtttactgct tttcattagc agttgctcac atgtctttgg 5281 gtggggggga gaagaagaat tggccatctt aaaaagcggg taaaaaacct gggttagggt 5341 gtgtgttcac cttcaaaatg ttctatttaa caactgggtc atgtgcatct ggtgtaggag 5401 gttttttcta ccataagtga caccaataaa tgtttgttat ttacactggt ctaatgtttg 5461 tgagaagctt // LOCUS HUMGSTM4A 6082 bp DNA PRI 20-DEC-1994 DEFINITION Human glutathione transferase class mu number 4 (GSTM4) gene, complete cds. ACCESSION M96233 NID g306816 KEYWORDS glutathione S-transferase; glutathione transferase; glutathione transferase M4; glutathione transferase mu. SOURCE Homo sapiens (tissue library: lambda FIX) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Seidegard,J., Vorachek,W.R., Pero,R.W. and Pearson,W.R. TITLE Hereditary differences in the expression of the human glutathione transferase active on trans-stilbene oxide are due to a gene deletion JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (19), 7293-7297 (1988) MEDLINE 89017184 REFERENCE 2 (bases 1 to 6082) AUTHORS Comstock,K.E., Johnson,K.J., Rifenbery,D. and Henner,W.D. TITLE Isolation and analysis of the gene and cDNA for a human Mu class glutathione S-transferase, GSTM4 JOURNAL J. Biol. Chem. 268 (23), 16958-16965 (1993) MEDLINE 93352467 FEATURES Location/Qualifiers source 1..6082 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda FIX" /map="1p13.3" exon 478..776 /gene="GSTM4" /note="G00-221-589" /number=1 gene join(741..776,1063..1138,1566..1630,1942..2023,2124..2224, 3166..3261,3352..3462,5515..5604) /gene="GSTM4" CDS join(741..776,1063..1138,1566..1630,1942..2023,2124..2224, 3166..3261,3352..3462,5515..5604) /gene="GSTM4" /EC_number="2.5.1.18" /codon_start=1 /db_xref="GDB:G00-221-589" /product="glutathione transferase M4" /db_xref="PID:g306817" /translation="MSMTLGYWDIRGLAHAMRLLLEYTDSSYEEKKYTMGDAPDYDRS QWLNEKFKLGLDFPNLPYLIDGAHKITQSNAILCYIARKHNLCGETEEEKIRVDILEN QAMDVSNQLARVCYSPDFEKLKPEYLEELPTMMQHFSQFLGKRPWFVGDKITFVDFLA YDVLDLHRIFEPNCLDAFPNLKDFISRFEGLEKISAYMKSSRFLPKPLYTRVAVWGNK " intron 777..1062 /gene="GSTM4" /note="G00-221-589" /number=1 exon 1063..1138 /gene="GSTM4" /note="G00-221-589" /number=2 intron 1139..1565 /gene="GSTM4" /note="G00-221-589" /number=2 exon 1566..1630 /gene="GSTM4" /note="G00-221-589" /number=3 intron 1631..1941 /gene="GSTM4" /note="G00-221-589" /number=3 exon 1942..2023 /gene="GSTM4" /note="G00-221-589" /number=4 intron 2024..2123 /gene="GSTM4" /note="G00-221-589" /number=4 exon 2124..2224 /gene="GSTM4" /note="G00-221-589" /number=5 intron 2225..3165 /gene="GSTM4" /note="G00-221-589" /number=5 exon 3166..3261 /gene="GSTM4" /note="G00-221-589" /number=6 intron 3262..3351 /gene="GSTM4" /note="G00-221-589" /number=6 exon 3352..3462 /gene="GSTM4" /note="G00-221-589" /number=7 intron 3463..5514 /gene="GSTM4" /note="G00-221-589" /number=7 exon 5515..6049 /gene="GSTM4" /note="G00-221-589" /number=8 polyA_signal 6031..6038 /gene="GSTM4" /note="G00-221-589" polyA_site 6049 /gene="GSTM4" /note="G00-221-589" /citation=[1] BASE COUNT 1247 a 1607 c 1690 g 1538 t ORIGIN 1 caccttgaca cataggccac ttcctgagcc tggacccagt ctcagagctg gggaactggc 61 ccaatgcaaa agggtcggga gcatctgcaa cagagactga gctctatcag cttcggtgac 121 atagcctcca ttcacgctcc ccaactcagc agagagagca caccatcaga cttctaagac 181 ttagtagcca agaagtgttg aattaaactc tctgagacct ctctttagtc tgaccctggc 241 agcctcagtc tcccagagcc tgtgggaact cggcagccga gaggcagaag gctgggcgac 301 gtccggagaa gaagaaacgg gggaagaact tttctcttac gatctggctt tactctcacg 361 cgcacagccg agtccctggg gacccagcag aggtccgaag cggagcgggg cggggcgggg 421 ctacggaagc tggcgaggcc gagcccctcc tagtggttcc ggaccttgct ccctgaacac 481 tcggaggtgg cggtggatct tactccttcc agccagtgag gatccagcaa cctgctccgt 541 gcctcccgcg cctgttggtt ggaagtgacg accttgaaga tcggccggtt ggaagtgacg 601 accttgaaga tcggcgggcg cagcggggcc gagggggcgg gtctggcgct aggtccagcc 661 cctgcgtgcc gggaacccca gaggaggtcg cagttcagcc cagctgaggc ctgtctgcag 721 aatcgacacc aaccagcatc atgtccatga cactggggta ctgggacatc cgcggggtga 781 gtgagggtcc gctgcactgt gggaccgggc gcgtgggcgg gaagtgccga gcggctgggg 841 accggctcta gggacggttc cctccttagg gctatctctc acaggagggc ctgtgcatgc 901 ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcgtgcgccg gggtgggggg 961 ggggtgcagt gcagtgtaga ctaggggctc acctggtgca gagaaagtca ccaagtcagg 1021 gaccctccat ctctgacacg acctgcgggc catctcttcc agctggccca cgccatgcgc 1081 ctgctcctgg aatacacaga ctcaagctac gaggaaaaga agtatacgat gggggacggt 1141 aatgacaccc ttgtgtccgg gctctgccca ctcacgctga gttggcacca agcaacccat 1201 ggtggccacc tgtggctacc tctgcaggcc tcccctgctg gagctgcagg ctgtcccttc 1261 cctgagcccc ggtgagggag tcctgtggcc ttgcaaggca gaatgctggg gcgggatgct 1321 gggccccctg tttaattggg ttgggtgtcc ctcagagctt ccctaaaccc tggaagcctt 1381 agccgtgtgg ggtccagagc cctcagcggg attctttgtc cctgaaccct gggatgtggg 1441 actgagtggt cagattctag atccacctgt ctcagggatc ttgccactgg ttccttggga 1501 gggtccccgg gaaggagggc tgggctctgg ggaggtttgt tttcacttct tcttccccac 1561 ggcagctcct gactatgaca gaagccagtg gctgaatgaa aaattcaagc tgggcctgga 1621 ctttcccaat gtaggtgcag gggaaggggc ggttttgggg gaaagtgcga cgtgtctctg 1681 actgcatctc ctctccccag attagaggtg ttcggatcag gagtcttctg cccaattcct 1741 ctcactcctg gttgtctaca cagcccctgc atgatgttct gtgtcccagc tcatttgttc 1801 atgtgacagt atttctatgt caggcctgcc atgagcgggc acagtgagtg cctggtctcc 1861 cctctgccct tgcatatggg aaggggatgc tggggagcct gctggcccca actgagcttc 1921 cccggtttcc catctatcca gctgccctac ttgattgatg gggctcacaa gatcacccag 1981 agcaacgcca tcctgtgcta cattgcccgc aagcacaacc tgtgtgagtg tggttggctg 2041 cagtgtgtgg ggggaaggtg gcatcctcct tggctggatt ggggtgctat gctcagagtg 2101 agtctgtgtt ttgtgggtgg caggtgggga gacagaagag gagaagattc gtgtggacat 2161 tttggagaac caggctatgg acgtctccaa tcagctggcc agagtctgct acagccctga 2221 ctttgtgagt ccctccctgg tctggaccag aagccaggct tgtattccca tctactctgg 2281 tcctattcac aatttgaact cctcactgct attgatcctc tgagggttcc ctgtactgta 2341 gcagagtgac tgtcatatca ttaaacttat aaaataaaaa cttgaaatca gtccccacca 2401 atcataggaa gtcctatgaa agctagcaat tcagttccta gacaataaag tcatgcgttc 2461 agaattccct tcacctctga cccctgacct ctgccacggg ccatctaacc cagctggttc 2521 atgccatctg cctgttcggg gaatacgcag actcacacta tggggtaaag aagtatgtag 2581 tgggatgtaa tagcaccctc ctctgtggaa ttccgtgcta ttcttttcct agtgcttctc 2641 cttgacacgc acagggatga atgcattttc ctaacaggca gctcaggggc ggccagaggg 2701 cctttgttcc tctgtctcct tccatgcagc ctctcaaaat ctcatctctc cagaaactac 2761 tcaatgtcca ttgtattatt ctgccactgc actaggagga actttcaact tctttctctg 2821 agctccttta gttctttgta tccttgattt tgctcgtgtc tgggtccaga gcctgccagg 2881 tactcaggtg ctcctgggct gacccagagg agggtggttg ggaggtcagt ggggacagat 2941 tcagggatag tgttgcattc ctctctgcct tcccatcacc acaaaagcct ccagctaccc 3001 atttggagtg taataaatgc tggtatgtcc agctgaagcc agttccagct gtggggaaga 3061 tggctgcttg ctcgtggcca gctggggcca tacacagccc tggggaggcc acatctgtgc 3121 agggagcttg tgtctgaggg tggtgacagc tgttttctgc ctcaggagaa actgaagcca 3181 gaatacttgg aggaacttcc tacaatgatg cagcacttct cacagttcct ggggaagagg 3241 ccatggtttg ttggagacaa ggtaatgggg gcatgtgatg aggacactag agatttgcca 3301 tacatcctat gttacagaga ttccagccca cacattcttg gccttctgca gatcaccttt 3361 gtagatttcc tcgcctatga tgtccttgac ctccaccgta tatttgagcc caactgcttg 3421 gacgccttcc caaatctgaa ggacttcatc tcccgctttg aggtgatgcc cccatcctcc 3481 tttctctttg atgccccttg ttccgttacc tcctttcaga tgctttccca gtcctggatc 3541 tgcataaaga ataacttgca tttattgagt gctggcttca tgccaggaac cttgcccagc 3601 acattatacc tatcgtgtgg aatttgaaat ttccaacatt cctacagggt gacagaatta 3661 tcttgcccat ttagagataa gaaaactttg aatgagaggg tcagtccttt gttctgggtc 3721 ccagagccag tggaggctgt gctgggctcc ctgtgagcat ctggatctat gggtggcagt 3781 cagggctctc ccttttgtga caaaagaaag aagcctcagg ctcatccagc ctggatttca 3841 cagcccaggg cactttggaa gaggcagaga actttaggag catggatgca gctggcaata 3901 gtaggactga cacacggtgg cattgacgtc gagtacgaaa cccacaggca gtattcatag 3961 ctactcccag aagctttgca cgatcagacc cccacgtggg gaatcctgag agccagagct 4021 gtggccagag ctggattagg gtacatatgt gggtgcccct gttgaaggag tgtatgttga 4081 agtgctctgt gctggggcac tctccttctt tatctttttt cctctctttt ttccctccag 4141 tgttccaagt gttccccctg tgagatgagt agcacactga ttttactgct attcaccgac 4201 cttctcctct gcatgaggca gggtgtgagg cacagtggga gttgcataga tgactgcccc 4261 atcctggaaa tgagtgcagt gagaggcctg caggcagagc agcctgtgag gtgtgtggca 4321 ccacctgggt accaggcctg gggcctgccc ctcactcatg gggaaccatc cctcacccgt 4381 gctgaatttg tttgagagca gcaaatccta cttttagtac agatgtgaga atttgaggca 4441 ttagtccaac aagtttttca gcctagaatt tgttttcctt tcccactacc catcaaggga 4501 tctggttact cagctagttc ccatcagctc tggctgtggt ctcggctgag tggccttggg 4561 gttatgtaag aggtagtggg aggggaagag agctgagggc tgcagcatat ggtccacctg 4621 ggcttggcct ttgggaatag gcagccctgg ctctctctga atccttagaa attacacggc 4681 tatttgatcc tggaaagatc gtgcagagca cacctgagtg tcatacagcc tggtctgagg 4741 tagtggggtt ggagatgagg tgggttgggg cacagtggtg tttagctcag gtaccaggtg 4801 gggaggttta gactttctgc tttaaaagga atgattagag cctggtctgg cgtttctttt 4861 gctggtccaa cacaccttga ccactttcat ccaggttttg ccaggtcctt gggtgagatc 4921 tgggctctct tccaggctgc acagacattt tcagaggtcc cctctgtgtg tgcaaaccta 4981 ggcaagccag gtgcctccct gtgaaacagg agaatgttgt gtagtcagag agtgacagga 5041 cctcctgagg gatttggggg aggatgggga tttgacagaa agaggccaga actggagaga 5101 gacagaacca gtctacgttg cagctctgtc ccccttagta gctatttgag tgtgaggaag 5161 ttactgaact tctgtttccc acatgagaaa tggtgataat agattcagcc ttgcagagta 5221 gtcgagtggg ttttctaagc ttatgttgta atttctcttg ggtacagagc acccagcacc 5281 gtgtagaatc ttcataagtg ttagctgtta ctgtggtaca acattactta aaggaagttg 5341 gaagagttaa ctccgcaaat ctggggaccc taagaggctg tgtgatgcct cagcacttga 5401 gcccacgtgg aaaggctgtg gccagggccc tgacctgctg tgtctgcagt ggggttgtcc 5461 cagccctcat gggcagctga ccttgagttc tggccttatt ttcccccctc tcagggcttg 5521 gagaagatct ctgcctacat gaagtccagc cgcttcctcc caaaacctct gtacacaagg 5581 gtggctgtct ggggcaacaa gtaatgcctt gaaggccagg aggtgggagt gaggagccca 5641 tactcagcct gctgcccagg ctgtgcagcg cagctggact ctgcatccca gcacctgcct 5701 cctcgttcct ttctcctgtt tattcccatc tttaccccca agactttatt gggcctcttc 5761 acttccccta aacccctgtc ccatgcaggc cctttgaagc ctcagctacc cactttcctt 5821 catgaacatc cccctcccaa cactaccctt ccctgcacta aagccagcct gaccttcctt 5881 cctgttagtg gttgtatctg ctttgaaggg cctacctggc ccctcgcctg tggagctcag 5941 ccctgagctg tccccgtgtt gcatgacagc attgactggt ttacaggccc tgctcctgca 6001 gcatggcccc tgccttaggc ctacctgatc aaaataaagc ctcagccaca tttgctatag 6061 tcttgtctta tttgctcctg gc // LOCUS HUMHA2WC 2226 bp DNA PRI 04-MAR-1997 DEFINITION Human gene for aquaporin-2 water channel, exon1-4, complete cds. ACCESSION D31846 NID g567249 KEYWORDS aquaporin-2 water channel; aquaporin-CD. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2226) AUTHORS Uchida,S., Sasaki,S., Fushimi,K. and Marumo,F. TITLE Isolation of human aquaporin-CD gene JOURNAL J. Biol. Chem. 269 (38), 23451-23455 (1994) MEDLINE 94375443 REFERENCE 2 (bases 1 to 2226) AUTHORS Uchida,S. TITLE Direct Submission JOURNAL Submitted (17-JUN-1994) to the DDBJ/EMBL/GenBank databases. Shinichi Uchida, Tokyo Medicaland Dental University, 2nd Department of Internal Medicine; 1-5-45 Yushima, Bunkyo-ku, Tokyo 113, Japan (Tel:03-3813-6111(ex.3659), Fax:03-3818-7177) FEATURES Location/Qualifiers source 1..2226 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" TATA_signal 545..551 exon 574..1027 /number=1 CDS join(668..1027,1095..1259,1327..1407,1465..1674) /codon_start=1 /product="human aquaporin-2 water channel" /db_xref="PID:d1007204" /db_xref="PID:g567250" /translation="MWELRSIAFSRAVFAEFLATLLFVFFGLGSALNWPQALPSVLQI AMAFGLGIGTLVQALGHISGAHINPAVTVACLVGCHVSVLRAAFYVAAQLLGAVAGAA LLHEITPADIRGDLAVNALSNSTTAGQAVTVELFLTLQLVLCIFASTDERRGENPGTP ALSIGFSVALGHLLGIHYTGCSMNPARSLAPAVVTGKFDDHWVFWIGPLVGAILGSLL YNYVLFPPAKSLSERLAVLKGLEPDTDWEEREVRRRQSVELHSPQSLPRGTKA" intron 1028..1094 /number=1 exon 1095..1259 /number=2 intron 1260..1326 /number=2 exon 1327..1407 /number=3 intron 1408..1464 /number=3 exon 1465..>2226 /number=4 polyA_signal 2221..2226 BASE COUNT 412 a 686 c 666 g 462 t ORIGIN 1 aagcttaatg atttatgggt gattagctgc aagaatgcaa gcacagaaga cacaaacctt 61 tatgccttgg aaatttgtca caagccaacc agttgttttc catccctgtg aagcaggaat 121 aattggaggt ctgacagaag aacttcccag ttaccagaag gagcagcact catgttctct 181 cttagttttg tgtgaggtgt tgcccctgcc ttggtcaaca gtttgagtca gagagatggg 241 ggccgggcac aaatccccac cagacgttcc cattcacaga ccctgtgggg gctgggagac 301 gccctggggc aagcagccct ggggtaacca agggaacaaa cacggaaaac caggacgtca 361 gtccttatct ggagctcatt aaatggggaa cattagtcag ctgtgaaggc caagataggg 421 tgataggcct gtgggtgggc tgggatgggg catgggggca gaggccgcca tggaggagaa 481 gaggtattgg cctcaacgaa ctccacctcc ccgccacgtg cccagatccg ggatggaagg 541 accctataaa tgcccacaac ccagcctccc cagaggcctt gagaaagaga gcgatagagt 601 gcgagagcga gtgcccggag catcctggcc ctgagacagc tgggccagcc ccgcagggct 661 ctgcagcatg tgggagctcc gctccatagc cttctccagg gctgtgttcg cagagttcct 721 ggccacactc ctcttcgtct tctttggcct cggctctgcc ctcaactggc cacaggccct 781 gccctctgtg ctacagattg ccatggcgtt tggcttgggt attggcaccc tggtacaggc 841 tctgggccac ataagcgggg cccacatcaa ccctgccgtg actgtggcct gcctggtggg 901 ctgccacgtc tccgttctcc gagccgcctt ctacgtggct gcccagctgc tgggggctgt 961 ggccggagcc gctctgctcc atgagatcac gccagcagac atccgcgggg acctggctgt 1021 caatgctgtg agtagccaca actttgccat ccacaagggg tgcccctacc ccggccctct 1081 tctctgtccc ccagctcagc aacagcacga cggctggcca ggcggtgact gtggagctct 1141 tcctgacact gcagctggtg ctctgcatct tcgcctccac cgatgagcgc cgcggagaga 1201 acccgggcac ccctgctctc tccataggct tctccgtggc cctgggccac ctccttgggg 1261 taggtcatgg ccatgggttc cagcctccct tcccttctct ctttgattgc cctcctccca 1321 ctgcagatcc attacaccgg ctgctctatg aatcctgccc gctccctggc tccagctgtc 1381 gtcactggca aatttgatga ccactgggta atggctgaaa ccccctgccc tccccgccaa 1441 gccgccctct ccgctcgccc ccaggtcttc tggatcggac ccctggtggg cgccatcctg 1501 ggctccctcc tctacaacta cgtgctgttt ccgccagcca agagcctgtc ggagcgcctg 1561 gcagtgctga agggcctgga gccggacacc gattgggagg agcgcgaggt gcgacggcgg 1621 cagtcggtgg agctgcactc gccgcagagc ctgccacggg gtaccaaggc ctgagggccg 1681 ccagcccgtg ctaaggcccc gacggacgct tgtgaggccc gaggcagaag ggccaccccg 1741 tccctcctct cccgcagtct gaagttggcc ccagcgcaga gtagctgctt cctggacgtg 1801 cgcgcccagg ccagtgctgt gagcaggcgg ggaggaggct gccggaggag cctgagcctg 1861 gcaggttccc ctgccctgag gctgtgagca gctagtggtg gcttctcctg cctttttcag 1921 ggaactggga aacttagggg actgagctgg ggagggaggc aggtgggtgg taagagggaa 1981 actctggaga gcctgcaccc aggtactgag tggggagtgt acagaccctg ccttgggggt 2041 tctgggaatg atgcaactgg ttttactagt gtgcaagtgt gttcatcccc aagttctctt 2101 ttgtcctcac atgcagagtt gtgcatgccc ctgagtgtga acaggtttgc ctacgttggt 2161 gcaagtgtgc atggctgggg acttctcact tccccttgca ccccttcctc cccaacctgc 2221 aataaa // LOCUS HUMHBA1 2685 bp DNA PRI 17-NOV-1994 DEFINITION Human alpha globin gene cluster on chromosome 16: zeta gene. ACCESSION J00182 J00181 NID g183790 KEYWORDS alpha-globin; gene duplication; globin; pseudogene; repeat region; zeta-globin. SEGMENT 1 of 4 SOURCE Human DNA from several lambda clones. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1885 to 1956) AUTHORS Lauer,J., Shen,C.K. and Maniatis,T. TITLE The chromosomal arrangement of human alpha-like globin genes: sequence homology and alpha-globin gene deletions JOURNAL Cell 20 (1), 119-130 (1980) MEDLINE 80222848 REFERENCE 2 (bases 1 to 2685) AUTHORS Proudfoot,N.J., Gil,A. and Maniatis,T. TITLE The structure of the human zeta-globin gene and a closely linked, nearly identical pseudogene JOURNAL Cell 31 (3 Pt 2), 553-563 (1982) MEDLINE 83129370 COMMENT The human alpha globin gene cluster, located on the short arm of chromosome 16, spans about 30 kb and includes the following five loci: 5'- zeta - pseudozeta - pseudoalpha-1 - alpha-2 - alpha-1 -3' [1] The zeta globin transcripts are said to be alpha-like, although their introns are considerably larger than those of the alpha globin genes. The first intron contains a variation of the repeat sequence 'acagtggggagggg' and the second intron contains the repeat sequence 'cgggg'. The former is similar to a 14 bp repeat found on the 5' side of the human insulin gene [2]. Zeta-alpha gene divergence is discussed in [2]. Highly AT-rich sequences are found in the 5' and 3' flanks starting at bases 111 and 2609 respectively. Typical promoter elements 'ccaat' and 'tata' are found at bases 705 and 740. The zeta polypeptide, encoded by the sequence given below is synthesized in the yolk sac of the early embryo, in contrast to the alpha globins. For the evidence pertaining to the functional significance of the zeta globin, and the pseudogene character of the 3' zeta gene, see [2]. FEATURES Location/Qualifiers source 1..2685 /organism="Homo sapiens" /db_xref="taxon:9606" /map="16p13.3" prim_transcript 770..2439 /note="hbz mRNA [2]" exon <825..919 /gene="HBZ" /note="zeta globin; G00-119-302" /number=1 CDS join(825..919,1761..1965,2205..2333) /partial /gene="HBZ" /note="zeta globin" /codon_start=1 /db_xref="PID:g386763" /translation="MSLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYF PHFDLHPGSAQLRAHGSKVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLL SHCLLVTLAARFPADFTAEAHAAWDKFLSVVSSVLTEKYR" intron 920..1760 /gene="HBZ" /note="hbz intron 1" repeat_region 979..992 /note="repeat sequence, copy 1" exon 1761..1965 /gene="HBZ" /number=2 intron 1966..2204 /gene="HBZ" /note="hbz intron 2" repeat_region 1971..1975 /note="repeat sequence copy 1" exon 2205..>2333 /gene="HBZ" /note="zeta globin" /number=3 polyA_signal 2419..2424 /gene="HBZ" /note="hbz mRNA polyadenylation signal [2]" BASE COUNT 495 a 774 c 966 g 450 t ORIGIN HinfI site on the short arm of chromosome 16p13.3 [Cell 31, 553-56. 1 ctcatgaggc tgaggcagaa gaatcacttg aaccagggag tcagaggttg cagtgagctg 61 ggatcgcacc actgcactcc accctgggcg acaaatcgag attccatctc aaaaaaagaa 121 aaaaaaatta aaaggaatat ttgcctcatt atgttacaat aactaatatg gaaagcaata 181 ttgcaatgcc tattagcaca tgacattagg tgaattctcc tttgtccccg gacctgctgc 241 ctcctcctgc ttgtcagggg acagatccag tacatctccc ctcagcgctg ggtggaccta 301 acccttgctt tcttggagga aacccaggaa tccagagaca aagtggaagg gtactggcat 361 gtggttgggc agggctgcct gaggtcggtg tcagccgacc gtggggcttg gtcccaggag 421 gctgcttact gggccctgct cctctggttt cccccaagtc gtgattctga aatgaataag 481 gacggtgcag aactggacta caaatgcagg agtgacttcc tgggagggtg gggcccctat 541 ctctcctaga ctctgtggtc agactctggc caacaccccc tgtaaggcca caggagagga 601 acaggagtga cagcccccaa accccagtcc cacagccctg agggcccctt tgtcactgga 661 tctgataaga aacaccaccc ctgcagcccc ctcccctcac ctgaccaatg gccacagcct 721 ggctgggccc agctccctgt atataagggg accctggggg ctgagcacta ccaaggccag 781 tcctgagcag gcccaactcc agtgcagctg cccaccctgc cgccatgtct ctgaccaaga 841 ctgagaggac catcattgtg tccatgtggg ccaagatctc cacgcaggcc gacaccatcg 901 gcaccgagac tctggagagg tgagtgtcag acgggactgc cagagggact gggtgggagg 961 ccaggtatgt gagtggggac agtggggagg gggcggtggg gaggggacag tggggagggg 1021 accatggaga ggagacagtg gggaggggac agtggggaga ggacagtgag gaggggacct 1081 tggggagggg acagtgagga gggaaccgtg gagaggggac agtgaggaag ggacagtgag 1141 gacagatagg agttgcacaa ccatctgggc tcgctgagac ctgggcaggc acaggcccag 1201 gttctgacaa gcagagggtg aaaggtttcg ttctaggcct gaagggcctt acagggcagc 1261 cagggcacta cagcctctaa agtcccagca tctgggatca gggcactgtc ccagcttcaa 1321 attcccagca tctgatcccc tgggaggggc cagggagctt ttccttccct ggaacgctgc 1381 tgggaggtca tgagcctgca gaaggggtgg cgggcaaccc agtctggggc tgggagggag 1441 gtcctgtggc cagaggagac ggtggagggg ctgggggcac caggcgtgct ggaggcggag 1501 ggcgggagat ttggggacca ggctgcacag aacccgtcgg aagcagggcg atcagccggg 1561 gagctgcggg cctggggggc ctctagccca gggcagcctg ggaggggcag ctgcctgggc 1621 acccgggccc cgcgaggagg ggctggggcc tgctgcgggg tcgcagatgt gtcccggtgc 1681 tcggagaggg ccgcagggcg cgtgggccgt ggcgggaggc cgcgctgctg ggagctcacg 1741 gcccccgccc cccgtcccag gctcttcctc agccacccgc agaccaagac ctacttcccg 1801 cacttcgacc tgcacccggg gtccgcgcag ttgcgcgcgc acggctccaa ggtggtggcc 1861 gccgtgggcg acgcggtgaa gagcatcgac gacatcggcg gcgccctgtc caagctgagc 1921 gagctgcacg cctacatcct gcgcgtggac ccggtcaact tcaaggtgcg cggggcgcgg 1981 tgcgggcggg gcggggcgcg gtgcgggcgg ggcggtgcgg ggcggggcgc ggtgcgggcg 2041 gggcggtgcg gggcggggcg gggcggggtc gcggggaggg gcggggcggg gtcgcggggc 2101 ggggcggggg tcgcggggcg gggcggggtc gcggggcggg gcggggtcgc ggggcggggc 2161 ccgggctagg ccccgccccc gcactgagcc gcccccgccc ccagctcctg tcccactgcc 2221 tgctggtcac cctggccgcg cgcttccccg ccgacttcac ggccgaggcc cacgccgcct 2281 gggacaagtt cctatcggtc gtatcctctg tcctgaccga gaagtaccgc tgagcgccgc 2341 ctccgggacc cccaggacag gctgcggccc ctcccccgtc ctggaggttc cccagcccca 2401 cttaccgcgt aatgcgccaa taaaccaatg aacgaagcag cgtccacctg gtctctgttg 2461 tccgtgggcg gcgggcgctt ggggagcgga gcgggaggag ggcgccccgg ctgtctcggg 2521 gccactgctg gcgcgcaggg atcctacagc cttggcctcc caagtggtag gattagaggc 2581 ttgagccact ggccactggc aggcccctct ttttatttta atcttattaa ttttgtaatt 2641 tttttttttt ttttgagaag gagtctcact ctatcacctg ggctg // LOCUS HUMHCF2 15849 bp DNA PRI 08-NOV-1994 DEFINITION Human heparin cofactor II (HCF2) gene, exons 1 through 5. ACCESSION M58600 J05309 NID g183907 KEYWORDS heparin cofactor II; serpin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15849) AUTHORS Herzog,R., Lutz,S., Blin,N., Marasa,J.C., Blinder,M.A. and Tollefsen,D.M. TITLE Complete nucleotide sequence of the gene for human heparin cofactor II and mapping to chromosomal band 22q11 JOURNAL Biochemistry 30 (5), 1350-1357 (1991) MEDLINE 91120782 FEATURES Location/Qualifiers source 1..15849 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11.2" exon 1750..1796 /gene="HCF2" /note="G00-120-038" /number=1 /product="heparin cofactor II" gene join(1750..1796,6948..7852,11623..11896,13654..13798, 14527..15372) /gene="HCF2" mRNA join(1750..1796,6948..7852,11623..11896,13654..13798, 14527..15372) /gene="HCF2" /note="G00-120-038" /product="heparin cofactor II" exon 6948..7852 /gene="HCF2" /note="G00-120-038" /number=2 /product="heparin cofactor II" CDS join(6964..7852,11623..11896,13654..13798,14527..14718) /gene="HCF2" /codon_start=1 /db_xref="GDB:G00-120-038" /product="heparin cofactor II" /db_xref="PID:g183908" /translation="MKHSLNALLIFLIITSAWGGSKGPLDQLEKGGETAQSADPQWEQ LNNKNLSMPLLPADFHKENTVTNDWIPEGEEDDDYLDLEKIFSEDDDYIDIVDSLSVS PTDSDVSAGNILQLFHGKSRIQRLNILNAKFAFNLYRVLKDQVNTFDNIFIAPVGIST AMGMISLGLKGETHEQVHSILHFKDFVNASSKYEITTIHNLFRKLTHRLFRRNFGYTL RSVNDLYIQKQFPILLDFKTKVREYYFAEAQIADFSDPAFISKTNNHIMKLTKGLIKD ALENIDPATQMMILNCIYFKGSWVNKFPVEMTHNHNFRLNEREVVKVSMMQTKGNFLA ANDQELDCDILQLEYVGGISMLIVVPHKMSGMKTLEAQLTPRVVERWQKSMTNRTREV LLPKFKLEKNYNLVESLKLMGIRMLFDKNGNMAGISDQRIAIDLFKHQGTITVNEEGT QATTVTTVGFMPLSTQVRFTVDRPFLFLIYEHRTSCLLFMGRVANPSRS" exon 11623..11896 /gene="HCF2" /note="G00-120-038" /number=3 /product="heparin cofactor II" exon 13654..13798 /gene="HCF2" /note="G00-120-038" /number=4 /product="heparin cofactor II" exon 14527..15372 /gene="HCF2" /note="G00-120-038" /number=5 /product="heparin cofactor II" BASE COUNT 4477 a 3814 c 3642 g 3916 t ORIGIN 1 gggctttgca tgtgtgagaa caagacagag aatgagggag gtgggcccca cgaggagtgt 61 gggcacagac agcagcctct gcctgtggtg ccacgctgaa gactcagtat tgtatgtgac 121 agatgaaggc tctaagaaga cagctctgac aaaagctaga gtgcaaaatc agactcagac 181 acaaccaccg gtctgtgtcc tgaacacaat ggacctttac actctggaat ttctcaaacg 241 gagcaatgca cagacacccc catgggcccc ttgcacaccc gcagattctc ctaggagtca 301 cattctctct tcagatagac tctgggtgcc gacactccca aacatgctct tgaggagcag 361 tctctgtgat aagctgatct tccagacaat ccagaatatt cttaaaactt tttagatcat 421 aaaatttaaa acacaaatta aaaaacaaat tatcataagg ccgggcacag tgactcatgc 481 ctgtaatccc agcactttgc aaggctgaag caggaggatc acttgagccc aagagttcaa 541 gaccagccta ggcaacatag tgagaccctg tctctacaaa aaagtcaaaa gttagctaga 601 catggtggtg tgcacctgta ttcccagcta cttgcagggc tgaggtgagg aggattgctt 661 cagctcggga ggttgaggct gcagtgagcc aagatcacgc cactgcactc cagcctgggt 721 aacagagtga gaccctgtct caaaaaacac atagggccag gcgtggtggc tcacgcatgt 781 aatcccagca ctttgggagg ccgagacggg aggatcactt cactccagga gttcaacacc 841 agcctggcca acatagtgaa accccgtctc tactaaaaat acaaaaaatt agttggacat 901 ggtggtgtgc gcctgtaatc tcagccactc aggaggctga ggcaggagaa cgcttgaact 961 tgggagacag aggttgcagt gagctgagat cgcaccactg cactccagca tgggcagcag 1021 cgcgaaactc tgtctcaaaa caaacaaaca aacaaacaaa cacccataaa cacaaaatgt 1081 atcacagcct cagagatccc cacgaatgcc taagtggccc tgaatttggg aggcactgct 1141 cagtaatagt cctatctgtc ccacaacaga caggagtgct gggctgcacc tactggcaac 1201 aaacacagca acccttgact gaagaaaggt ccatgccaca atccccttat tctgtaagcc 1261 actaattttg tcctctctcc tccacctttc actgaggaac gagctcttgg aaggacaggg 1321 acacccgcct agtagctgag ccagccacat cagtcctgga gagcaggtgg agggcagatg 1381 ctgtgatcat cccagaagag aggacacagt tggaggcaga tgcatggtct ctactttcag 1441 ctaccctcaa tgcagcctgg tccccagagg cctgaagagc gccttgttta tgtggtgacc 1501 tcaagagggg ctgctcctgc accaaggcta tgtgtgcatg ctaacacagt aaccgtcata 1561 tactcaaagt gtcagctcta agaactggag atgaggagct gcaagccact ctacagttat 1621 caaaggcaca gctgaggggg tttgtgctga ccaagctggt tgcctggtgt ttggattggg 1681 acttatttac tttggaaaat atgcagcaac agcccagcac caaagttcac atcaaaatcc 1741 cactgatgac cttggctgct ttcatctctg aagcgccact tctcagaaac acagaggtaa 1801 gttgggtttc taatgtttct gctgattata aattattttt ggtgtttacg gataggcaac 1861 tggttcattt ttctagcaaa ctaagaattc agaagctttc tacactgttt tagaagtggg 1921 aaatggtttc atttttcagt gtgcctatta taaaattgtg tcagttccat tgttgggaga 1981 gttgacaaac ttagaatagg agctgtggaa tagatgaaaa tattgtactt atattaaatt 2041 aatcgaattg gataactgtc ctgtgattat gtatgagaat atccttgctc ttgggtattt 2101 tccctgaagt attagtatta aaggttagag gggccgggtg cagtggctca cgcctgtaat 2161 cccaacactt tgggaggccg aggcgggtgg atcacgaggt caggagttca agaccagcct 2221 gaccaacatg gtgaagccaa gtctctacta aaaatacaaa aattagctgg gcgtggtggc 2281 acgcgcctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt aaacccggga 2341 ggcagaggtt gcagtgagcg gagatcgtgc cactgcactc cagcctggac aacagagtta 2401 gactccgtca aaaaaaaaaa aaaaaagaag aaaaaagaaa aaatgttaga ggaacaagat 2461 ataggagacc tactctcaaa tggtctagaa gaaaaaatgt gtatgtgcat gcctgtgaga 2521 acacacacgt acgtacacac acacacagat aatgacaggg caaaggttcc aaaattttaa 2581 acctggtaaa tctcggtacg ggtatacagg agttgttcta ctacactatt ctttcaacat 2641 ttttggaagt ttgaacttac ttcaaaataa aaagttttcc aaactttagg cagttacttc 2701 tctcccattc tgcctgctct gttgggcctg gagaccatac accaggaggg atgacggttt 2761 atcaagtgtt atgctctgat gcgtgactga aaaggccaac ccagctctgg caattagcaa 2821 gaaagcacaa tatgaagttc ccaggaaaaa aaaaaagcaa aacaaacttt tgaatgattt 2881 atctttaaaa tatattgttt ctcttcaaac agtaatctgg atttaatcac aacctagtga 2941 tagtttttaa acgtcttcta caatgtttgt tatactaaat agcaaaacat caggaagatt 3001 taccttcaga tctttaattt caatccataa aagatatcag agatattttc tccttcctct 3061 ggtaagggaa tgacgaaaac tatttttggc tttttatcag ataatgtggg aacagggtat 3121 aagaagtttc caaatataac ttctgaatac cgggataaaa catgcatgtc tttactctgc 3181 cactctatct ggcctcagat acgttttcct gaatgcttat ttattcaagt tggtttttgt 3241 tttgttcttt aaccttattt ttatctgaga agaaaacatt ttcccccttt gttccttctt 3301 cttttggctt tcttttttaa aatagagatg aggtcttgct atgttgctcc agctggtctt 3361 gaactcctgg gctcaagcga tcctcctgcc ttggcctccc aagatgctaa gattacaggt 3421 gtgagcccct atgcctggtc ttcttcttct tgatcttagc caaaaggcca agaagtgata 3481 agaggaggac acttgaagtg tagttgggca aggagccttc taccagctgc ttactttctt 3541 tgttcctgac ttttaaaagt gtgttgctat tgatacacag tctcctgata tgtaaaatgc 3601 tgggaggatg aagctaagtt actcaaagtg ccattcagaa actgggccca gttctatttg 3661 cagctacata cattagaaat catttctaga ggctgagcat ggtaactcat acctgtaatt 3721 ccagcacttt gggaggccaa ggcaggagaa ttgcctgagc tcaggagttt gagacctgtc 3781 tgggcaacat ggtaaaaccc catctttacc aaaaacacaa aaaattaact gggtttggtg 3841 gcacacacct gtggtcccag ctacttcaaa aggctgaggt gggagggtct cttgagcctg 3901 agaggaacag gtggcagtga accaatattg tgccactgca ctccagcctg ggtgacagag 3961 tgagaccccg ccgtctcaaa ataaaaataa aaagaaatcg tttctagaaa ctgttttccc 4021 gtgtgtaaac tagtggcact gcagcctgag gcaggtgctg agatggggac ctggaaaagg 4081 caacaggcat tttgagtcag aaacaatgtg actttcctgc tccaaaatgt gcaattcaaa 4141 agtctttctt agttgtgact aaaacaaact ttgaacttac tatttcaaca gtattataag 4201 gggaagaccc aaggaatggg actggcactg ggaaaacagc taggaagctg ctctgcacgg 4261 ccagggagtc tggaagcatc ctggtactcc agagcgaaca aggctgagcg cttgatgtgg 4321 ggcttagagg cttaaccaac ttggttcgaa tctagccact gccacttatt agtgacagtg 4381 acgaaaggct cagtctcctg atatataaaa tgttgggagg atgaaactaa gttacacgaa 4441 gtgccttata cagcgtgtca ggcatccaac agaggccatt atcaacatta accacactga 4501 cagcatttca agcagagtat ccgaacagtt accccatctt caggcctact gagttcaaat 4561 atttgcttaa caagagcagc cagtaactct tacctggcct caactggcag cagatattct 4621 gggcctcaaa tatctatcta ataggaaatg gtcacagaca caaaataagc ttaacaaaag 4681 gcagtttttt tttgtttttt ttttgttttc tgttttttga gataaggact cactctatcc 4741 cccaggttgg agtgcagtag tggcgtgatc acggctcact gcagactcaa gtgatcctcc 4801 tacttcagcc tctcaagtag atgggaccac aggcgtgtgc catcacacca ggctaattat 4861 ttttcttttc tttttttttt ttttgagacg gagtttcgct ctttttgccc aggctggagt 4921 gcaatggtgc gatcttggct caccacaacc tctgcctcct gaattcaaac gaatctcctg 4981 cctcagcctc ctaagtatct gggattacag gcatgcgcca ccacgccggc taattttttt 5041 gtattttttg tagagacagg gtttctccat gttggccagg ctggtctcga actcccgacc 5101 tcagatgatc cgcccacctc ggcctcccaa agtgctggga ttactgacct gagccaccgc 5161 acccagccta tttatttaat ttttcacaga gatgaggtct tgctatgttg cccacactgg 5221 tcttgagctc ctgggctcaa gtgatcttcc tgccttggtc tcccagtgtt gggattatag 5281 gcgtaagcca cagcgcctgg ccggcagttc tttctggggt gattagaagt tgggaccatg 5341 tattacctgt ctgagtcagc attataaaca cctatggtca ctgtcctggc aaaacatgga 5401 atcatcaaag ctcatctaac cagagtgcag ttaataacca ggaagtaagc aagagaaaga 5461 caaaggattt ggcagtcaaa acagatttga caggccaagt cagatcctcc tctgaacgag 5521 tcagaggaac aaataaagac aggattgcca taatgcctct gtgctaaaag cttatcttgt 5581 ttacttaaat aaagggagtg cccctcaggt cttgagtaag agcttgctga catcaccctc 5641 acacagactt tatctcttgt ttctaaccct gtgttagaag cagtaacaca gaagatttag 5701 ttgctcctga cagcagtggg agctattgtc taagagatac aaaggagaaa aaagtatacc 5761 tgcagcaagt gatatcacct ctggggctgc caccacatca cctcactacg ccctgagggg 5821 gtctcagcac tagacaagtt ccaaatcttt tgcaaattaa acaaccccag gtcaggcgtg 5881 gtggcttatg cctgtaatcc cagcactttg gggggctgag gtgggtggat cacctgaggt 5941 caggagtttg agaccagcct ggccaacaga gcaaaacccc atctctacta aacaaaatac 6001 aaaaattaac caggcgtagt ggtgtgcacc tgtagtccca gctacttggg aggctgaggc 6061 aagagaattg cttgagtcca ggaggccgaa gttgcagtaa gccgagatcg cgccactgca 6121 ctccagcctg ggtgacagag tgagactcca tttcaaaaaa taaaaacaac aaaagccaat 6181 tacaacaaca acaacaaaaa aacaacgaat taaacaaccc caaagattgc acaaatttca 6241 agtatcttta gaatatgttt tcagaaagcc tggcccatgg acatttttca acagcatctc 6301 cattgcaaag gtggaatggt gtgagtcaca caggcatggc tgagtcccac taatgcacat 6361 cccttctagg tactctccaa tcaccagccc caggtgccca ctcaagccca gctcttagtg 6421 aggtttccct gactctctgg gcacttccac tcctaccaca cagggtagag ccacacccct 6481 ttccgtaccc ccatgtgctc tggcagcatt attttgagag ccttcgcttt actgcacgtc 6541 tgtcccatct gtcccctgac tggtccatga gcccctggtg ggaactttgt ctctggtaac 6601 taaacactgt ctggaggtgg tggacaaggt gtctggagaa aaacaaactc ctccctggga 6661 tgcctgagct cccaggattc tagaaggtta gttttgcaaa cctttaaaga agggattttc 6721 atcaaggggc ccacagatcc ttcattgagg tttatgagtc ccacatcaaa ggttgggtgt 6781 ctatctacat cagattctct taaagtccat gatcctaaaa cagttaagaa ctaatgctgt 6841 gagggcctct tcctgggtca aagccacagg gaacctgcca tgtggatgct gcagcggggt 6901 gtggatcagc caggccgcct ttcactgtgt tctgttttcc ctcccagctt tagctccgcc 6961 aaaatgaaac actcattaaa cgcacttctc attttcctca tcataacatc tgcgtggggt 7021 gggagcaaag gcccgctgga tcagctagag aaaggagggg aaactgctca gtctgcagat 7081 ccccagtggg agcagttaaa taacaaaaac ctgagcatgc ctcttctccc tgccgacttc 7141 cacaaggaaa acaccgtcac caacgactgg attccagagg gggaggagga cgacgactat 7201 ctggacctgg agaagatatt cagtgaagac gacgactaca tcgacatcgt cgacagtctg 7261 tcagtttccc cgacagactc tgatgtgagt gctgggaaca tcctccagct ttttcatggc 7321 aagagccgga tccagcgtct taacatcctc aacgccaagt tcgctttcaa cctctaccga 7381 gtgctgaaag accaggtcaa cactttcgat aacatcttca tagcacccgt tggcatttct 7441 actgcgatgg gtatgatttc cttaggtctg aagggagaga cccatgaaca agtgcactcg 7501 attttgcatt ttaaagactt tgttaatgcc agcagcaagt atgaaatcac gaccattcat 7561 aatctcttcc gtaagctgac tcatcgcctc ttcaggagga attttgggta cacactgcgg 7621 tcagtcaatg acctttatat ccagaagcag tttccaatcc tgcttgactt caaaactaaa 7681 gtaagagagt attactttgc tgaggcccag atagctgact tctcagaccc tgccttcata 7741 tcaaaaacca acaaccacat catgaagctc accaagggcc tcataaaaga tgctctggag 7801 aatatagacc ctgctaccca gatgatgatt ctcaactgca tctacttcaa aggtaagagg 7861 cacctttaca gttctcacag caaacccaca acatactatt tttgtatgtg ggtagattga 7921 atgccaagaa ctgtactgta gctataattt atccaggaaa actagacaca agattgactc 7981 tggaacgggg acagggaagg ccaagctgaa gtgacagtag catctgacac ttactgagcc 8041 ctaactctgt gctttaacac agccttgtga ggtcatcact gttattagca tccccatttt 8101 acagaggaag ccaccaacac atgaagtaaa aggatgggct gggcgcggtg gctcacgcct 8161 gtaatcccag cactttggga ggccgaggca ggcagatcac ttgaggtcag gagttcgaga 8221 tcagcctgac caacagacca acatggtgaa aacctggctc tactaaaaat acaaaaatta 8281 gctgggcctg gcggtgggtg cctgtactcc cagctacttg ggaggctgag gcaggagaat 8341 cacttgaacc tggaaggcag agattgcagt gagccgagac tgtgccactg cactctagcc 8401 tggacgacag agtgagactc catctcaaaa aaaaaaaaaa aagaagtaaa acgatgctcc 8461 aagggcaccc agttattaag gggcagagcc aaagctgaac ccagggaggc caaccctagc 8521 aatctgttaa attggaagaa ataatacaaa aactgtttta gcatttggcc agcctggatt 8581 tgagttttct cttttccttt cccaattatc aataagcagg aatatagaca aaaggctaaa 8641 gaaatgcacc tgtgaactat tcagcttgag cagctgacat tgacacctac aagtgctttt 8701 caggatactt ttgaactact gggcaggtgg gatggagaaa taaattacta tttccccagc 8761 aactgttctg ggctgagcac aagggcactt tttaaggagg tcaccccaca cccatcacac 8821 acacatagga cccctggaat cctaggaata aataagcatg gatttgtaaa atccaaacct 8881 ctcttttcaa atatcctcac ctggaccaga ccagaagaaa cctctacttt actctctaag 8941 ctgagagtgt ggaaggggaa acacgaggaa tggttcggct tcaggactaa ttgcggtgac 9001 acacaaccac ttctctttgc caccaaggac taccaggtac ctgcaaaggg cagtacttgg 9061 aggccagtgc tttctgctag ttagctcccg tggttttata gcagcccagg cgaaggaagg 9121 agaccccccc cagctcctgg cttctgttca gggaaagggg gccagagccc ctcctgatct 9181 gtccacacac ctgctctgtg ccttggctga ggcccctgca gctctacaag gcaggcattc 9241 tgctggatag gccaagcagg gtcactctga cacccaggtt tccaccccaa ggcatggcac 9301 aatgctggcc tcctgtgggt ggaatcaaag gctgagttct aacaggcttg cggcagacac 9361 acacacagag accacatgta catgatgaac acacatatcc ttttcattac aggttattag 9421 tacaagtttt ggaattgagc aaacaagagt ctaagcgctg gtttcaccac ttctcgtttg 9481 tgtgacctca gacaagtcat tcaacatctc tatgactcag tttccttatc tttatcacag 9541 agatgacacc cactctgaca gggccgaggg aagaaccata agcgatggca atgcaacaga 9601 gtggcacatg acaagagctc agcgaatttg agggaatgaa actgtagatt acaatactag 9661 tacaatatga taaacatatg atattgttag tgacatttat tttacttcta ctagcaaata 9721 acctatgttt aggactgact ttagaacagg ctggcagaag catttttggc agcatcaaag 9781 tcctccaacc tactggtctg ttggagcccc ccaagtacac caaagagcct ctgcattagc 9841 cctggctgag ggttcaggga caggcagaga agtacagcag tgagccatcc ctgcctgcat 9901 ggaggtggag aaatgatcag gcatggtcag ttgacaatct cctaaacaca gtaacccgtg 9961 tcataccaca gtgtaaacac acgtgcaaat gcttctgctt cctttcccca tcatgagaat 10021 agtcactcaa tgccgggcat cacaagggat caaatgctag gagtacccaa tcattcatgg 10081 atgcttctca aaggggacga gtgtctagaa gtgtaatttt aatttcactt aatttcatat 10141 ggaatcatct ccattactaa ttttgttcta attttaatgt gataatcact ttgtaaagca 10201 caataaacag aggcaggctc tcatgaggaa gtcagaagga aagaatccca agagacatgg 10261 gacagctcca tccaaactga aagggccgtg attcccaaaa gagcaatttt gtccccaagg 10321 tctgaagaca cttttggttg tcacaacctg gggggttgga gtaagcatta ctggtatcta 10381 gaagggggag gctggggatg ttgctaaaca ccctaccatg cacagggcag cccacattgc 10441 cacaaactat tatgtggccc aaatgtcaaa aatgctgagg ttgagaaacc ctgggtgagg 10501 cagactcagg gagaagggaa tcgagcttca ctcacaggca ggcaggagct gtctggtact 10561 tcaacctcca agacacctcc tgctcatctc atcctggctg ctctacccac cagctagaaa 10621 ccttgaacaa gttacttcac ttctttgtgc ctctgtttcc tcatatgtaa aagagggata 10681 acaaaacgca cacaacttgc atgttgctag gagcagaaat gagataatac aggaaaggtg 10741 ctgagaagaa tgcccggcac atggccagtt ctcaactact agtcacccat tactattagt 10801 tactcacatc ttagagctaa catagacatg ggcttattcc tggatacaca gcactgtccc 10861 catatctaca gtggtgatcc taagggcaac atggcatcac ccaaatgtct tgttagtcac 10921 tacagaatca cagtgtgagg gatgaaggcc atcaagacag agctgaggct ggcagggtgg 10981 ctcatgccta taatcccagt gctttggaag gctgaggcag gaggattgct tgaggccaag 11041 ggtttgagac cagcctaggt aacatagcaa gaccccatct acaattaaaa aaaaaaaaaa 11101 aaagacagaa agaaaaaata gccaggcgtg gcatgtgctt gtagtccaag ctactgggga 11161 gggaggctga ggcaggagga ttccttgagc ctgggagtgt gaggctgcag tgagctatga 11221 tggcatcgcc gcactccagc ctgcatgaca cagtgagacc tggtctcaaa aaccaaataa 11281 taataacagt aataaaagct ggaaagagct caaagttact catttgacag atgtgacaga 11341 tgaagaaata gaagcgagtt aggtgcctta ccatggtcaa acaactagtt cgtatcagac 11401 cctactccag aaactattcc agtccgggta acctctcgtt aacctctctt gttagaaatg 11461 caaatttctg cccaaatcag gcctcaggaa tcaagagact gtggggtcgg ctctgcaggc 11521 tatctgaatg aggcctccag ggaaatcaga ttcactctca agggtgagac gatttcccta 11581 aaggaacctt ctcataacag cctcttcctg tggcctttac aggatcctgg gtgaataaat 11641 tcccagtgga aatgacacac aaccacaact tccggctgaa tgagagagag gtagttaagg 11701 tttccatgat gcagaccaag gggaacttcc tcgcagcaaa tgaccaggag ctggactgcg 11761 acatcctcca gctggaatac gtggggggca tcagcatgct aattgtggtc ccacacaaga 11821 tgtctgggat gaagaccctc gaagcgcaac tgacaccccg ggtggtggag agatggcaaa 11881 aaagcatgac aaacaggtat ttcacactgt gtgtttgttc ttttgagctc ccagatgctg 11941 ggggtgtctg ggaatactgg aaaatggatc atttttttaa aaagggagaa ttatgtacaa 12001 gtacccaaga acttccatac agggccactc tgttaattca gccccaattt gttgcttgag 12061 ataagagatg attagagagc attcataagg gacacatctg ccctctaggg gccagtttca 12121 gaagttagag gcagatgact tagagacagc ttggtgcttg ctttgtggct tcgagtccca 12181 gcttcatcat ccctaaaatg ggtataattc cattacttcc ccgggtcact tgagaaaata 12241 acagaatcag cgatgctgag cgcccctccc agtacttgga acctaggagg cactcaaaaa 12301 aagattggct caactcttcc ctgcccagga aattccaagg tcctcttagc ctaccgagga 12361 cacatcattc atgatttcct ctattattat tcgttacttt gtagttaaaa ctgcaggtgt 12421 taagtactta ttgagattat tattgggtca tggcagaaag aatggagagg tcttatttct 12481 gtcttactgg atactggcta ggcccatatg aagaagtgat tctggtttga acctccttat 12541 aggacaagaa tacaaacata tgcaaccaaa ctgagaaaag taggctctca gaggaaggta 12601 tttgcccggg tagccagtca tcatgctctg tgaatttttc cttaacaacg tcccttctgt 12661 acctgcctcc ttccattcct ccctgcagcc cggcagctct tgagaaaggg actgcatctt 12721 tttttttttt ttttttttga gacagggtct tgttctgtca cccaggctgg agtgcagtgg 12781 catcatcatg gctcactgca gcctcaacct cctgaactta agtgatcctc tcacctcagc 12841 ctcctgaata gttgagacta caggcgtgca ccttcatgcc cagctaatta aacttttttt 12901 ggtagagatg aggtctcgct gtgttgccca ggctggtctt gaactcctgg cctcaagcag 12961 tcctcctgcc ttggccttcc aaagtgctgg gattaacagg cgtgagccgc tgtgcctggc 13021 ccatttgact tttaattgag atcttacttg gtgcaaggta tgagctaggt aaaagagtga 13081 agaagatcaa gccttcctgc ccatccagct gggattgcac cttaaatctc tttatcccct 13141 gcaaagtgcc agactaactc cacaggcact actgttgcta tccgccccct tagggattga 13201 gtaagttgag gcaaagattg agatattcag cattgtctag tatatacagg aaaggttctt 13261 tttaaaagta cactaccaga tattcgactc cttaattaca aaaaaaaaac caaatgccta 13321 aaattgggaa accaaaccag agaattattt tagatgcctt tttaaaccat aaaccaggaa 13381 aagttctgct gctaaccttg aagataggaa acgaaccata cagtctcaag gaaataatca 13441 tgcaacagaa aacacacctc agttttcagt agcggaatta caaaggagtg tgcttcctaa 13501 aatcctcaac tgacagtccc ggaatataaa ttttaataag tgctatatca attctgtgat 13561 aaatataacc cgtggccctt taaagggaaa atcatgattc ttttgtaact tgtggttcaa 13621 taaaactggg cccccctttc cttttctgtc tagaactcga gaagtgcttc tgccgaaatt 13681 caagctggag aagaactaca atctagtgga gtccctgaag ttgatgggga tcaggatgct 13741 gtttgacaaa aatggcaaca tggcaggcat ctcagaccaa aggatcgcca tcgacctggt 13801 aaccactccc ttgtccaccc ccgacccgtc cccagggtct gcctcagcac agccccacct 13861 ccacttgccc ttcctaccca ccccccaatc tcatgtccca gcttggggtg ctgagtctgc 13921 tcttcggcct gggtgggata cacagaatgc ctagtttcat ggatgccagc tggagagcac 13981 ggcacctggc agacacttac tgggcagggg ggatcccaag agcagccatg gggtgagccc 14041 cactcccgct gacaccagag acaggggaga catgtgctgc ggtctgggaa atagctaccc 14101 ccagccaaat catgaaagag ccattaaaca ccgcactata caacatactt aacttaaacc 14161 aatcgggtcg ctcagcaaaa gagagagaac accagtccaa acagtgcagc agacccagtt 14221 ccccatcccg gagaagtgcg cagcagtgtg gggagctgga gctggggtgg ctgtcctgca 14281 ccagccccca cgaccctcag accacaggca ctgccaagag ggaacatgaa cctagccggc 14341 ctctaagtgc aacggctgcc cctgacaggt ggtgacagat attttcaaga gtgactctga 14401 ccagctgtga tttccacctt acatgttgtc tttggatcct ttccctgaat gatatgagat 14461 tgtgctggga actctagccc tctgtgtgct gacctccaga atctgacaac tttcctttcc 14521 aaacagttca agcaccaagg cacgatcaca gtgaacgagg aaggcaccca agccaccact 14581 gtgaccacgg tggggttcat gccgctgtcc acccaagtcc gcttcactgt cgaccgcccc 14641 tttcttttcc tcatctacga gcatcgcacc agctgcctgc tcttcatggg aagagtggcc 14701 aaccccagca ggtcctagag gtggaggtct aggtgtctga agtgccttgg gggcaccctc 14761 attttgtttc cattccaaca acgagaacag agatgttctg gcatcattta cgtagtttac 14821 gctaccaatc tgaattcgag gcccatatga gaggagctta gaaacgacca agaagagagg 14881 cttgttggaa tcaattctgc acaatagccc atgctgtaag ctcatagaag tcactgtaac 14941 tgtagtgtgt ctgctgttac ctagagggtc tcacctcccc actcttcaca gcaaacctga 15001 gcagcgcgtc ctaagcacct cccgctccgg tgaccccatc cttgcacacc tgactctgtc 15061 actcaagcct ttctccacca ggcccctcat ctgaatacca agcacagaaa tgagtggtgt 15121 gactaattcc ttacctctcc caaggagggt acacaactag caccattctt gatgtccagg 15181 gaagaagcca cctcaagaca tatgaggggt gccctgggct aatgttaggg cttaattttc 15241 tcaaagcctg acctttcaaa tccatgatga atgccatcag tccctcctgc tgttgcctcc 15301 ctgtgacctg gaggacagtg tgtgccatgt ctcccatact agagataaat aaatgtagcc 15361 acatttactg tgtatctgtt ataattctct attttttgaa gctcaaatat caaaagccaa 15421 atccaaattc ctggataact ccaggtatga taaaggctga gaggaagtca cttgagcacc 15481 acaatgtgcc acagcagggc atgttctcag gacaggacag gtgtgtgctg aatcctgggg 15541 agggtctgtg cagtacccca gaactgtggg gtgctaagtg gcacacaagc cccagggctc 15601 ccacagtcta tgccaggctg ctgcagcttt catccctcat acctggtcct gcagtgggtc 15661 tggtttgaca gagcagatga cacctgagga atatgtttct ggatccttca atccctgggt 15721 aagacaagtg aaatccacag aggctgttca gcacgcaaga gtgccagtgc tctttcagtg 15781 aggggatgac tgacggtcac aggtgctgtg tgtgcaggtg tctaactgta accccacagc 15841 ctggcagat // LOCUS HUMHIAPPA 7160 bp DNA PRI 15-JUN-1990 DEFINITION Human islet amyloid polypeptide (hIAPP) gene, complete cds. ACCESSION M26650 NID g184047 KEYWORDS Alu repeat; islet amyloid polypeptide. SOURCE Human fetal liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7160) AUTHORS Nishi,M., Sanke,T., Seino,S., Eddy,R.L.Jr.., Fan,Y.-S., Byers,M.G., Shows,T.B.Jr.., Bell,G.I. and Steiner,D.F. TITLE Human islet amyloid polypeptide gene: Complete nucleotide sequence, chromosomal localization and evolutionary history JOURNAL Mol. Endocrinol. 3, 1775-1781 (1989) MEDLINE 90114181 COMMENT Computer-readable copy of sequence [1] kindly submitted by G.I. Bell, 02-AUG-1989. FEATURES Location/Qualifiers source 1..7160 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 555..>7160 /note="islet amyloid polypeptide (hIAPP) mRNA and introns" intron 659..988 /note="islet amyloid polypeptide intron A" gene 1004..1083 /gene="Iapp" CDS join(1004..1083,5892..6081) /partial /note="islet amyloid polypeptide (hIAPP)" /codon_start=1 /db_xref="PID:g386771" /translation="MGILKLQVFLIVLSVALNHLKATPIESHQVEKRKCNTATCATQR LANFLVHSSNNFGAILSSTNVGSNTYGKRNAVEVLKREPLNYLPL" exon <1004..1083 /gene="Iapp" /note="islet amyloid polypeptide (hIAPP), (first translated exon)" /number=2 intron 1084..5891 /note="islet amyloid polypeptide intron B" repeat_region 2457..2773 /note="Alu repeat element" exon 5892..>6081 /note="islet amyloid polypeptide (hIAPP)" /number=3 repeat_region 6450..6740 /note="Alu repeat element" BASE COUNT 2300 a 1199 c 1183 g 2478 t ORIGIN Chromosome 12. 1 ggtacctaga ataatcccta ccacagagta ggtcttccat tactcttatg cttttaaatc 61 tcccctcacc tcattgtaaa tgacttttga tttctctttt atgccctttt tatacacctt 121 tcccttatat ctccatttat tcctgaagct tcatgggatt cagccattga ggtcacttgg 181 gtttagatat accaaaagtc tgtgatttct ctgtttgcat atatgcacat ttgttgttat 241 ccttaccctt ttctatcagt tccttaccat aacatacact taattcttgg aaatttactc 301 atgtcttaca aagatggcaa attcaaactt ctgctgtgta tgacacacca ttaactgcac 361 aaggacactg tgtatttgct acgttaatat ttactgatga gttaatgtaa taatgaccca 421 tccgcttctg ctgcctgtga ggtactttct atctataggg atggaaatta atgacagagg 481 ctctctgagc tgcctgatgt cagagctgag aaaggtgtga ggggtatata agagctggat 541 tactagttag caaatgaggg ggtaaatatt ccagtggata caagcttgga ctcttttctt 601 gaagctttct ttctatcaga agcatttgct gatattgctg acattgaaac attaaaaggt 661 aaagaatttc ctatttctgg gaaagtttta tttatttaga gaaatgcaca cttggtgtta 721 aattcatggt ttatttcaaa gaaaggctaa agggagaatg tattacaata taaatgttca 781 gattgcttag agaaggaaat tgggaaagta aaaatctcga aattacttga aaagtggaca 841 atattaaggg actgtatcaa taaaattttg atccttgtaa attacgtttt aaaagatgtt 901 tcttttaaaa actaagctct aatttaaaat tacatcaatt agaactgtaa gaaatctctt 961 gatttcagtg ctggattatt ctttgcagaa aatttgagaa gcaatgggca tcctgaagct 1021 gcaagtattt ctcattgtgc tctctgttgc attgaaccat ctgaaagcta cacccattga 1081 aaggttggta acgtttaaaa tcctgtttct ttgtaacttt tgtaaagtgt gagaaaatta 1141 gcattaaata ctgtcaaata actacagcct tagatttctg actatatcat acttaagaca 1201 cagtaccttc agctattcca ttgttccatg aattctgtgt tctttaaaga ataacaccag 1261 tggcaaataa atatctttga tggaacttct gacagacagg aatggataat tccagttttg 1321 tcaagaaata tactcttgga acttagaggg gcaaagccag aacatgaagc gggaaaaaaa 1381 tcaaaaggta gtaatttctt ctatattaac ctgatactga aacaaaccag agaaacttaa 1441 ctaaagcata tatttttata ccaagtggat tctttttgta tatattactt agatttttgt 1501 tttcctcaga tgtctctgga aatgttaaaa acttttacat cttgtggaaa tggaaatgta 1561 tagaaataat caggagcaaa ttaatgtttt taagaaatga aatctaaaag aagtagttaa 1621 aaagccattt gctgttgggg gatttattct atattgctag ctagctattc agtgagtgaa 1681 acagatttat aaaaagttat tcttcttatt acttctaagc tgctataaac ttaatacttt 1741 ttaaaattac tttcagtagg tagcatgtat gtcaggattt cctgggaagt cttattacga 1801 aaggtttcat gtcatttaaa tggtaattaa ggacatctaa caactatgtc acgtaaaact 1861 cttagagtag ttaaaatttt caaactgaga ttttaaaact gtaatttatt taaagggtta 1921 ttaagttcaa atatgtgcat aagtcataaa taacatagtg aggatttgtt tgtgcctaaa 1981 ttagttttgc tccatatagt cttatgggac tgaacttaca cactctttaa caccaaggag 2041 aattaagttt acctttgtaa agagtgtgca tgtcatatta taattcttct cattagaatg 2101 atcgtcatct tgtctttgtt ttccttcgag gtagtttttc ttggaagccc atagcaatat 2161 gcaaagattt ctacagcacc tacgtataat aaatagcaag aatcattatc agaggctttt 2221 tgtcatttca aggcttattt agtttacagg gtgttcttct cagaactgac tgtaattttc 2281 tatttgcttt ttcataaaaa taacttttaa atttacatga agtttctgat aagcagaata 2341 tctgaatgat gacaggaaaa ttagtagtat ttcctagtat atcgtttata tcttgatact 2401 ttctttcaat agatatagaa tatttactaa gcacctttta ccctctcttt tttttatttt 2461 attttgagac agggtctctc tctctctctc tctctgtctc tgtcacccag gctgctggag 2521 cacagtggta caatcatggc tcactgtagc cttgacattc taggctcagg tgatcctccc 2581 acctcagcct cccaagtagc tgggaccaca ggcacctgcc accataccca gctaattgtt 2641 ttatctttat tttataggga aagggtctcc ctatgatgcc caggctggtg tcaaactcct 2701 gggctcaagc cttcctcctg cctcagtctt ccaaaattct gggattataa gagtgagcca 2761 ctgaacccag accattatgt ttttatagat gtttgtttat tatgagagaa acttcactta 2821 gaaatagagc aatatgtaat ataatattac ttgttataaa attattttga tgttagtctc 2881 acaatcttta actttgaatt attagaaatc ttgtaaaaca ttcttcaaat tgctttttaa 2941 tatgttgcct gaaatgagta tgtttgaaca tttgttaaag ggagtatgat ttgtcatgct 3001 gagatgttaa atcatgtact attctacata tctcacagaa agctagcaaa atctatgggg 3061 aaaatgtgtc aaattttaaa ctctttttaa aaaataaaac taacattatt caatgtcatt 3121 ttcctcacaa aatttaatca tctcatttga gattttttca atttgtaaat gtatgaaata 3181 ggataaaagg atcacatact ttcccaccaa cttttttaca ctcccttgta aatatctgcc 3241 tggcaggtaa tcaaaggata gttaaaaata taattacata gatgccaaga tgcaatcact 3301 aggatctccc tgcaggagct cacatacttc cacagatgaa tgttaaggcc gagagcaggg 3361 actcacattt aaagtcattt tgaaaactct ggagagacaa tttaaaagag aggcaattta 3421 agagttatac tttggcttat tgtcatctct gtttaaactc tcttaaagtc aagaatttcc 3481 atgtgtgtat gtgcctgtaa gtggtctaca gctttaatgt ttgttactag ctcgtatgtt 3541 acctgtccag gtagtcaatg agaaaaaaat gcctgaaacc agggaggtaa tgccttttat 3601 taaccatttc agacaacttt ttccatccta aagattgctt tagatagaat cttatatata 3661 ctgaatagta tatttagatg aaaagtcttt tttagaaaag caatttcaca aatatgataa 3721 aaacataaat gcttttacta tttcttctaa gtggaatgat ggcccatcta gctaactcaa 3781 ataaggtaac attttattta gaacaatttt aaattatatt attgaccttc caaccaatta 3841 tcaaaatacc actcagcatt tagcatataa agtatttcac agtgtgcttc agtcatatgc 3901 taaacatatc ttggaacaga tattaccttt gaatcttctc aatttgaccc ataattttcc 3961 tttattactt tttttgagat gtttggacca aattccaatt tttactgttt ttcaagaaaa 4021 gtaagtattt tagaattcaa tgcaaatgta tgaaattact agttcaatcc ttaaagcata 4081 aatcactctt ttgaaatgta cattggtcat atttatggta ccttcaaaaa taaataattg 4141 aacagatagt gtgaatgaga ttgatatagg ttaaataatt agatcccaaa gtggttttct 4201 ttgcccagat aatttgttca aacatttgtc agcatacact tacattcaac aagtatccag 4261 ttcacctaat gctgtaagaa gttttctgta cttaggaaga aatatgggag taaaatttaa 4321 aaaaaaaaca gtttcacatg agattattaa atatttactc ttaggctatc tctacttaga 4381 gatagagata atgaaatact ccccacacaa ggtaaacaca atgagataaa tccattgcat 4441 ttgagtccca gattatgcat atccactggc tcctggacat tgagttttta gccctataac 4501 tatttcattt tccatttacc ctaagtttca ccaatatttt gatttctatg gagctgaaaa 4561 ctaaaacatt tctctaactt tcctaataat cagcaaagag gaagcaatgt tattattctg 4621 catccatttc cgatatcgtt ttaaaagcac attgaaacaa aaggctgtca aaaaaataga 4681 gttggtatac aaataaatgt cttaaataaa aacataagtt aaaattaaat gaattattta 4741 atgtgtggtt atgatttctg agtttataag tattattatg cacttcttcc aggtggctag 4801 aaaaatgtga tgaatattaa taccattgac ataaaaagtc ttttggtttt aacatttaac 4861 ctagtctatc attaaaattc ttgaaagcat aagatccaag caggaaaatg tatttatgct 4921 aaaagtaata aaactctcac actgcaatag agtacctgaa caggtgatag atttgattct 4981 tttggagact ttatgatatt ctcttttttt gacatacttt ttatgacatt attttttact 5041 ttattatatt tcattttatt gttttaagaa caaagcatga tatctaccct tttaacaaat 5101 ttttaagcat gcaatacatt attctggatt atgtgcaaaa tgttgggcag cagatctcta 5161 gagcttagtc atcttgcttg actgaagctg tacacccaat ggttagtaac tccctatttc 5221 cccctctccc ttgcccctga taaccaccat tccactcttt aactcatgaa tttgactatt 5281 ttaaatactt catatacatg gaaccaagtg gtatttatct ttctatgact agcttctttc 5341 actcaaccta atgtcctcaa ggttcatccg tgtgttgcat attgcagaat tcccttatga 5401 catttcttgc ataacactcc tgattcaatt atctcaagga acttaaagac taagtaatgc 5461 tgctttattc ttattggaaa gatgtagaaa taattatttt taaatttctt catatttcag 5521 attacatata aattttacct tctaaattct ttttatatat taaaaataaa ttcttcaaga 5581 tttttaaaaa tgtaagacaa agacactgtt attttgatta tatgtaatat attctgaatt 5641 tccaaaggaa gacttttaac tgagaaatgc aacattgact gtaatgaaag atgttgtatg 5701 attttcaatt gttatttcaa ggtgtcaaaa aaaaatctca gccatctagg tgtttgcaaa 5761 ccaaaacact gagttactta tgtgaaaatt gtttctttgg ttttcatcaa tacaagatat 5821 ttgatgtcac atggctggat ccagctaaaa ttctaaggct ctaacttttc acatttgttc 5881 catgttacca gtcatcaggt ggaaaagcgg aaatgcaaca ctgccacatg tgcaacgcag 5941 cgcctggcaa attttttagt tcattccagc aacaactttg gtgccattct ctcatctacc 6001 aacgtgggat ccaatacata tggcaagagg aatgcagtag aggttttaaa gagagagcca 6061 ctgaattact tgccccttta gaggacaatg taactctata gttattgttt tatgttctag 6121 tgatttcctg tataatttaa cagtgccctt ttcatctcca gtgtgaatat atggtctgtg 6181 tgtctgatgt ttgttgctag gacatatacc ttctcaaaag attgttttat atgtagtact 6241 aactaaggtc ccataataaa aagatagtat cttttaaaat gaaatgtttt tgctatagat 6301 ttgtatttta aaacataaga acgtcatttt gggacctata tctcagtggc acaggtttaa 6361 gaacgaagga gaaaaaggta gtttgaacct tggtaaattg taaacagcta ataatgaagt 6421 tattcttgac atgagaaaat cagtaattgg accaggcgcg gtggctcttg cctgtaatcc 6481 cagcactttg ggaggccgag gcaggcagat cacaaggtca ggagttcgag accagcctga 6541 ccaacatggt gaaaccctgt ctctactaaa aatacaaaaa ttagccgggg gtggtgacat 6601 gtgcctgtaa tcccagctac tcaggaggct aaggcaggag aatcgcttaa acccaggagg 6661 cggaggttgc agtgagccga gattgcacca ctgcactcca gcctgggtgg cagagtgaga 6721 ctcgtctcaa aaaaaagaaa gaaaattagt aattgtaagt acccctgata agcaaattag 6781 taattgtcaa tacccctgtt aagcaattcc tttttgcagt atatttctga aatgacagaa 6841 tgctgtttta aaaacaaaga aataaaatcc tgctcctgac tcggtcaaaa tattttttaa 6901 agtctattgt ttgttgtgct tgctggtact aagaggctat ttaaaagtat aaaactgctt 6961 tgtatccatg agggtttcat tgtgtgttag cagcagtgag cttctattaa atgtatatgt 7021 catttatttt gtttaagtgg ctttcagcaa acctcagtca tattcttatg cagggtattg 7081 cgaaacaact tgtgttctat taatcgtgtc ttcaattaaa agaccacaga cttctggaaa 7141 ctctttgctg tataagaatt // LOCUS HUMHIS102 7550 bp DNA PRI 09-NOV-1992 DEFINITION Human histatin 1 (HIS1) gene exons 1-5, complete cds. ACCESSION L04132 NID g184051 KEYWORDS histatin 1; salivary protein. SEGMENT 2 of 3 SOURCE Homo sapiens (individual_isolate individual J.F.) (library: lambda Charon 40) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7550) AUTHORS Sabatini,L.M., Ota,T. and Azen,E.A. TITLE Nucleotide sequence analysis of the human salivary protein genes HIS1 and HIS2, and evolution of the STATH/HIS gene family JOURNAL Mol. Biol. Evol. 10 (3), 497-511 (1993) MEDLINE 93330039 FEATURES Location/Qualifiers source 1..7550 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1375..1432 /gene="HIS1" /number=1 intron 1433..3999 /gene="HIS1" /number=1 exon 4000..4063 /gene="HIS1" /number=2 CDS join(4013..4063,5258..5278,5357..5386,6434..6505) /gene="HIS1" /codon_start=1 /product="histatin 1" /db_xref="PID:g184054" /translation="MKFFVFALVLALMISMISADSHEKRHHGYRRKFHEKHHSHREFP FYGDYGSNYLYDN" sig_peptide join(4013..4063,5258..5263) /gene="HIS1" intron 4064..5257 /gene="HIS1" /number=2 exon 5258..5278 /gene="HIS1" /number=3 mat_peptide join(5264..5278,5357..5386,6434..6502) /gene="HIS1" /product="histatin 1" intron 5279..5356 /gene="HIS1" /number=3 exon 5357..5386 /gene="HIS1" /number=4 intron 5387..6433 /gene="HIS1" /number=4 exon 6434..6538 /gene="HIS1" /number=5 BASE COUNT 2427 a 1285 c 1184 g 2651 t 3 others ORIGIN 1 ttttatttta ctttcctatt ttactttaag ttctgggata cgtgtgcaga acatgcagat 61 ttgttgcatt ggtaaacgtg tgccatggtg gtttgctgca cctatcaacc catcatctag 121 gttttaancc ccacatgcat tagctatttg ccctgatgct ctccctccct nncccaccac 181 caggccctgg tggcgttgtt tccttccctg aatcagacac aaatttaaaa cataatttct 241 aattagtggg atcaagctaa agaccatgcg tttataaatc tcttaagtag tttttctatc 301 tttaaagtac acaaagagtc tcctggacat gttcttaaaa ttccggttcc aggtgtacac 361 cacaagattc tgactcagta agtttggaaa agtagaataa gaatctgcat ttcaacagct 421 actagctgat gctgatcatg atggtcgcat gtggcaccct ctgaggaaca atggactggg 481 ctgtataact tatctgaact ttagaaaaga gtttttaata ctcagttctt agcacatcaa 541 attctccaac cggtagagag taggtgaatg cttatacatg tttgcctatg aattaaatta 601 aaattctgtt tttctctttc tgtctaatat agttatttca acaaatggcc tctataaata 661 tatatgaacc tcagaaaact ccagggaagt aaattcaaat ttctaaacat aagaaatcag 721 ttttatgtga gagatatgtg accaagcacc tagaatcttt tgacattaaa tatgagaatg 781 tttttatcat atgatatcca aaagatgatt gaattttggc aaatgtttac attcacttta 841 ttacttaaaa cactcaaagt caatgtaaag gtaaaaatac aagataaata atgatttgta 901 ggaaaacttt tgaagtatag aatgactcca attttattat gccaattatc actatttact 961 agagaatgtg taaaattttg ataacatggt aagtatattt ttgacatagc gttggttcca 1021 tcttggacat atatttgttg aaaagtagta agatggtatt ctacagaaac atctttatca 1081 ctgcaatcat ttctatgcct ggtttcctcc agcgtgcttt tccgttgtct gactcactca 1141 gggctagact aacactggga ttagcatgtg atgggtccat tcgctttccg gttgctttgt 1201 cttcacaatg atctattgta aaatcacctg gttaagttta tttttagaat ttgtagagac 1261 agattttggg cattactttc cgtctcaatc atatgatctc ctaattgatg ctatttagaa 1321 aaacaaaagg gagatttcaa cgtgtttaaa tacatcagcc atatagaaaa ggacatctct 1381 tgagacttca cttcagcttc actgacttct tgactctcct cttgagtaaa aggtaatatg 1441 ttcaagtaca atctaatatt atatattagc ataagctttt gtttgctaaa atcaatagta 1501 accccactta tgaaaatgta tattatctat ttaaatctag tttacttgaa ttgcgtattt 1561 gcctaaataa ataatttaca tttttgtata ttacctatag aataccataa tttctaatag 1621 gactgttcaa attagcatag atgatttatc taataaatat ttcctttaaa aactcacaca 1681 caaattcatt ttattttatt ttattttact ttcttatttt actttaagtt ctgggatacg 1741 tgtgcagaac atgcagattt gttgcattgg taaacgtgtg ccatggtggt ttgctgcacc 1801 tatcaaccca tcatctaggt tttaagcccc gcatgcatta gttatttgcc ctgatgctgt 1861 aatgtgtaca attttgtact cgggatctga agatgacttc cagatctcaa ttcctatgaa 1921 ttaacaccag caatcatcac gcaaagtaaa ttatgtggaa tatggcaatg tattacatcc 1981 tctctgatgg tgggacctca attaactctt tttatttcct tgtctagata cctaatatct 2041 actttcttaa ttattaattc tacatttaac ttaaagcagg agttcaaatt ttacaagaag 2101 atactgataa attctctctg gtaaatatat tattataaag gtttattttt aaaaatttac 2161 tttcttctta ataggaaact attttattac ttttttattg acctttccat catttaaaac 2221 agtgctgtta taatgcaaag catatactgt ataagagttt agtcatctca atcttactaa 2281 tttgtttaac tcattttcat tcatttgtgt tatggcagaa aagaaagagt ttactgttcc 2341 aaatttttat tgatgtgtta cttgggaaat atgtttttca taaccattaa ttgtatattg 2401 attatctatg ttcatttatc tatcagttta tgtgttagta aaagaagacc aatttcttag 2461 aacccttttt tattttgtac ttgcaacaga attgacaatc attaacaaca gtaaaattct 2521 catgcctttt gtatcataaa aagagttgta gtaggattta tatattaggt aatagaaaaa 2581 cttcaaaata cataattttt atagttagtc ctatagtaga agtttccatt taaatgagtg 2641 aagcaaaatg aacgaataaa taatatggta aatgccataa atcttgggtt ctgcttttca 2701 tgatagagat gtagggctgt ttaagagcag acattttatt ataattagat gaaaaatgtt 2761 actgtaatgt tatatcacaa atgacaggaa agagagatta gaaaaaaatt tttaaattac 2821 ctattaatga attactttat aactgcaatt aacagccaaa aatgccatgt aactgtaagt 2881 cttgtggtaa ttgtattttc caatggaaat gcttttattt acgtttgtaa tttagaaata 2941 agtaacctgc tatacttttt tatatctgca aaggaactgg atacacttgc atatacataa 3001 ttataaaagc atatcagagt cccagatgag gctaactttg ttgaataata ttatttgaaa 3061 tattgaataa tcttcaccag ttcaacttct caaagtagat acaatcattt tccctgtgaa 3121 ttctcttcat tttatttttc gctacccaat atcactaatg ttctaatctc agggttacac 3181 cttctgtgga ttctgtaata gaactctggc tcatagtcac tgactttatt ataaagccag 3241 tctccattgc ttttcattct actttcctaa tatctcttta cactgtacct taattagcat 3301 tcttaatgtc attatataag tcccctgtag tgtcacaaca acctaagtgg tcatcttttc 3361 tccagttttg tcctccctat gtgttgaact gtttgcattt ttaacatgct ttcctctctg 3421 ttttatttct ttataggctc ccaacaccac cctttactca ttccatataa tccaactcaa 3481 gcttgccttt cttcattacc ttcttactcc catcctcttg gtctgtttaa gatgtccatt 3541 tgtgtgtttc attagcatat taatatttcc aaatcagagc actcaccttt acttatcaat 3601 tcatgaggcc atgctggggc tcacacctgt aatccaagca ttttgagaag acaaggcagg 3661 aagattccta gagcccagga gtttcagaac tgcctgggaa atatagtgag actaaaaaca 3721 ttctctacta aaaaaaaaaa aaagaaagaa aaaattcgct gggcatggtg gcacacactt 3781 gtagttccag ctacaagtgg gactagggag gctaaggtgg gaggatggct tgagaccaga 3841 tggtccaggc tgcagtgatc tgtgctcatg cccctgcact ccagcctgga aaacagagca 3901 tgtctcccaa tgctttgaag catagtgtca tcaatggaag tttgatgaat gaatgcatga 3961 aagaatgtga ttactgattt ttcatgtttg attttatagg actcagccaa ctatgaagtt 4021 ttttgtcttt gctttagtct tggctctcat gatttccatg attgtaagta tatctggaaa 4081 ttttaaatac tacattctca gtacttatcc caagtgtctc tcttattcct tgtgttctgg 4141 tatatactca tgttgacctc aatacagctt tataagcttt aatatttccc tacaaggagt 4201 ttctttgaat taacttatgt aagttcttta aggacttgga atagatattc atcacatacc 4261 tacaaacact caagtacaat tacttgaatg taaaagtaaa agaaaattta gcgagctttt 4321 taaatatgtg ggactaagat ctgtgaaagt atctttaaat gtgaaaagct ctagaccatc 4381 tgaagttaga aatctgagga aagaaaaagg tgcacttcaa tggaacccaa gtattagagg 4441 gtgtcaaaat aggaattttc aagtcatttc ttgcacaacc tactcttaaa gtcagcatac 4501 aattatttaa ttcttcctaa aataaagctg cttattttaa accataaaac aaatctccat 4561 atttattaag tgtcatgagt tcccagctct attgtcattt actactgtgt attagtagga 4621 aaagcacttg acctctttgg agctctttct ccttctctac ttccaccgcc tcttcctcct 4681 cctcctcctc ctcctctttc tatgaactta atagaaaatt ggtcacaaaa gaataataat 4741 actccaaact atccactaaa ttctaatgaa tgtttcgaaa ctgaattggt tatgatatca 4801 cagtttatca cgtgatttta ttataaaagg atagaagtag aaaaataaac aaataagaac 4861 agagtaaagc ataatgcctg tctctatcag caaataaaat atttgtatgt tggtagcttt 4921 tgagctacat ggaacatatt catataaaaa acaatccaga gtttcatttt tcagtttcca 4981 ctctcttatt caccactgac agacttttcc tgccttacat aaaaaagata tataaaccaa 5041 aatgagttga tatttgaata tagcacatta attatagcat ttacccctca ctcattttga 5101 agatgtcata aagctaaaat aactaattta gtctgtcatc aaccaaagaa tgcctccatt 5161 ctgtttaatc atatatgtgg ctaagtcaat atttatactt actcttgaat tataaaatta 5221 aaatattaat tattttctca ttttcttttt ttccaagagc gctgattcac atgaaaaggt 5281 aagacatttt catttacggg aaaacttgat aaataaacat atattgaatt tttaatcttt 5341 tctttttatt tcatagagac atcatgggta tagaagaaaa ttccatgtaa gtgttcttct 5401 gataatgtgc actctaaata aattttcctc tctgactatt tattctccta gaagaatcaa 5461 tagttgtctc tcaaatatat ctatatcatt aattgctaaa gtgcacattg atttcattta 5521 tttatgatgc aaaggcaaaa agtcacaaat atcttgtgtc ctaaaagcat ttcaacagaa 5581 actcaattcc acctgagata agtcaattgt caattatgct ctgagaactt atctttgtcc 5641 tgtaaataat ttcattgacc taaagaatag ttgaaatcct aattctatta agtataacat 5701 atgcttaata aagcatagag gctactcatg taatgtggta agtttctacc ttctgaattt 5761 ttgaggatga tacagtcttt aaattaaact gactgtgcta aattagacag aaaattgtat 5821 ttctctaaca tttttctcac tgtatgtata taaagtctca tgttattttg acctgtaata 5881 tcaatatgaa tctgggttct aaagttctcc ttactaacat agcaagtata catgttaaaa 5941 aagcaccaag taaaaagatg cataaaaatg gatgaagact gtttgtgtta acagcagtgg 6001 gtatgaaaga atgtaggagt tgtaaaggaa agtaagatca ctaaaggaat gacattaaga 6061 agcaccacct aagttggaaa cagtaactta atcctttcca taatactcaa tagtgagcaa 6121 aatagtagct tggaggtaat attttaaaga agatttgaac cctggttaga gaggttacct 6181 ataatcagtg tttccacctg tgctattgtc acaaaaatat cagcaaacag taaatacttt 6241 atcattttta cctactttta gacatttttc tgataaaaaa gaaaaatgta aaagtttgaa 6301 aacatttatg tagataaaat acaaggtctt aacaacttta gcaatctaag cttttaatga 6361 cagtttttga gaaacacttg tactgtcatt ctctattctc tgcaatttgc tctctccttt 6421 tgtgtgtatg caggaaaagc atcattcaca tcgagaattt ccattttatg gggactatgg 6481 atcaaattat ctatatgaca attgatatcc ttagtaatca tggggcatga ttatagaggt 6541 aagctgactc tagttacttg tctttctaga agtgtcaaca ctgacagttt aaaaaaaaag 6601 ccataagcta acaaccattc cagtttaaga aatgtagcat gggttagctc cttgaagtgt 6661 attcatttgt ataaatgctt ctgaagctgt agatatgtgg tgttatttct gaggtctctg 6721 ttctgtgcca ttggtctgta tgttgttttg gtactggtac catgctgttt tggtaactgt 6781 agctttgtag tatagtttga agtcaggtag tgtgatgcct tcagctttgt tcttttgctt 6841 acaattgtct tgggtatatg agcttttctt tggatccata tgaaatttaa agtagttatt 6901 tctaattctg tgaagaatgt caatggtcat ttgatgggaa tagcattgaa tctataaatt 6961 actttgatca gtatggccat tttcatgata atgatttttc ttatccatga gaatggaatg 7021 tttttccatt tgtttgtgtc ctttcttatt tccttgagca gtggtttgta gttctccttg 7081 aagaggtcct tcacatcctt tgttagctgt atttctaggt attttattct ctttgtagca 7141 attgtgaatg ggagttcatt catgatttgg gtctctgctc gccatctgat attcaacaaa 7201 cctgacaaaa caagtaatca agaaaggatc tcctattgaa taaatgatgc tgagaaaact 7261 gggtagccat atgcagaaaa ccgaaactgg accccttcct tacaccttat acaaaaatta 7321 actcaagaca gattaaaaac ttaaatgtaa taccccaaac catgaaaacc tgagaagaaa 7381 acttaggtga taccattcag gacagccgtt ggcaaagact tcatgtaaaa aaatgccaaa 7441 agcaattgca acaaaagcca aaattgacaa atgggatcta attaaactaa agagcttctg 7501 cacaacaaaa gaaactatca tcagagtgaa taggtaacct agagaatgcc // LOCUS HUMHLL4G 4428 bp DNA PRI 23-MAY-1996 DEFINITION Human 14 kDa beta-galactoside-binding lectin (ll4) gene, complete cds. ACCESSION M57678 J05303 NID g184227 KEYWORDS 14 kDa beta-galactoside-binding lectin. SOURCE Homo sapiens adult DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4428) AUTHORS Gitt,M.A. and Barondes,S.H. TITLE Genomic sequence and organization of two members of a human lectin gene family JOURNAL Biochemistry 30 (1), 82-89 (1991) MEDLINE 91105104 FEATURES Location/Qualifiers source 1..4428 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /dev_stage="adult" mRNA join(192..267,1540..1619,3037..3208,4157..>4331) /gene="hll4" /product="14 kDa beta-galactoside-binding lectin" gene 192..4331 /gene="hll4" exon 192..267 /partial /gene="hll4" /number=1 /product="14 kDa beta-galactoside-binding lectin" CDS join(259..267,1540..1619,3037..3208,4157..4303) /gene="hll4" /codon_start=1 /product="14 kDa beta-galactoside-binding lectin" /db_xref="PID:g184228" /translation="MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCL HFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLP DGYEFKFPNRLNLEAINYMAADGDFKIKCVAFD" exon 1540..1619 /gene="hll4" /number=2 /product="14 kDa beta-galactoside-binding lectin" repeat_unit 2338..2638 /gene="hll4" /rpt_family="Alu" exon 3037..3208 /gene="hll4" /number=3 /product="14 kDa beta-galactoside-binding lectin" repeat_unit 3402..3705 /gene="hll4" /rpt_family="Alu" repeat_unit 3706..3988 /gene="hll4" /rpt_family="Alu" exon 4157..>4331 /gene="hll4" /number=4 /product="14 kDa beta-galactoside-binding lectin" polyA_signal 4326..4331 /gene="hll4" BASE COUNT 951 a 1266 c 1251 g 960 t ORIGIN 1 gggcggggtg agggggggca gcagctcgcc actctgattg gtcacctctg ctccaaaact 61 ggcttcaaaa ttccacggac tccgccccca gtggccccca gccctatcct gacttgcaat 121 tggctgaact ttcagggggc ggggctcacc cggtccggtc cagttaaaag ggtgggagcg 181 tccgggggcc catctctctc gggtggagtc ttctgacagc tggtgcgcct gcccgggaac 241 atcctcctgg actcaatcat ggcttgtgtg agtgtgggga ccccccccca aggtccaggg 301 gatagggcag gaactgatgg ccagaggaga gctgggcaga tcgggagcag attctagccc 361 cagctgtgtg gcctggaacc agtgccttct cttttctgga cctcagtggc cacatctgta 421 aaatgggggt gggcgccatg gtccctcaag gccttctctg cattgataat tgtctggatt 481 cctccagggt ctgaaagcac agttatttct gcccagggtt gacattctgc agctctctga 541 gaagtgagcg tgggaagggt gtggccaact gggggacacc caggccacta tccctttccc 601 cctcctccac cccaaagaac ctcctgtcac ctccaccctg cagctgtccc ggtcaccagg 661 ccagggcaga gttaccctct actccagaga acgctgaaaa gttgccagga cccgagacaa 721 gctgcccaag atgggccctt ctaggtcggg ggtggagggt ggttgtgtcc aggctggtgg 781 cggggggggc cggggaaatt cccttccacc acccccaagc tgggaggttg gggtggcagg 841 gaggtgagaa tcttcctcgg gctccaggga aagggttcaa gtttctggca gaagaaacca 901 ctcaaaccag ttaactcttg cccacccact cggggtgcca gggaacagca gtgcccagca 961 gtttgctcaa tcttgttaac cctgagccag cccagccaca gcccgactgc agggctactc 1021 tccctaatgc agagaggccg tgtttttcag ggtctcctct ctagcccctg ggcctctttg 1081 cagagagggg cttgaaggag aatagtggag tcgcccgccc ctagccactt cctctccagc 1141 agaggggccg ccccctccat tcagtgtgac ggtgggccaa gtgtcggccc ctccccagcc 1201 tgatcctctc catctgcgat gggacagagc accccatctc ccaagtcact cttgagtcca 1261 aaattccaag ccaatctgca aaatcttcta gagcctgtct tctagaacct tcacgttaca 1321 gactgaagcc aaccccggtg ggaataggga ctttcccagg accacataga caatcggagg 1381 ctggaaatta cagctcaatc ctttccccag gcttcccctt ggcttggtca gaggatgccg 1441 ggcgggaaca accccactcc cacccccagc cacccccgga cacttcgagc agtggaggcc 1501 ttgtcctcta accggctggg ccggggcttg tctgtgcagg gtctggtcgc cagcaacctg 1561 aatctcaaac ctggagagtg ccttcgagtg cgaggcgagg tggctcctga cgctaagagg 1621 tgagaagtga agtcggggtg gtgggcggca gggacgggct tggtccagca gggagggcgt 1681 ggccggccaa gcccacatct cctccctggc cgggagcggg ttaacggcca gccgccgatg 1741 ctgcgttttc tgggtgactc acttcccccg cagggtctgg gcgcccccac cgttgccgcc 1801 ccctcccccg ctccctccct gctgtagcct cttaaactaa accagctgca gcctcgtcat 1861 ctgtaatacc ttgactcccc cgcccccacc cctttccggt ctgggcggga cctgtcgctg 1921 gggagggcgg ggagaggagt ggggcgggcg tcaccgcgcc ttccccctga gtccctcctt 1981 cctgcgttct ggtcattcat tcattcactg gctcagtccg gtccttcttc cttcacttct 2041 cattcactca gacggctgcc ttattttctc ggcagttagg tgaccttgga ccagtcaacc 2101 aacctcacct agcctcagtt tgttagaaaa acaagggggg aggtggttgt gtggaggaag 2161 tgagatgcgc gtggcgcagt gacagtgagg aggatgagga tcagctgata ttgttaagag 2221 ccgcacactt ctcatgcact aatttcattt caaccaaacc ctcagaggca ggtattgcta 2281 ttatcaccac tcaacaaatg tatttattat tttatttatt tatttattat tttattattt 2341 tatttattta ttttttgaga cacagtctca ccctgtcgcc caggctggag tgcaatggcg 2401 tgatctcggc tcaccacaac ctccgcctcc cgggttcaaa cgattctccg gcttcagcct 2461 cctgagtagc cgggattaca ggggcccacc acaacgccca gctaatttat gtacttttag 2521 tagagacggg gtttcaccat gttggccagg ctggtctcga actcctgacc tcatgatctg 2581 cccgctcgcc tcggcctccc aaagtgctgg ggttacaggc atgagccacc gtgcccggga 2641 gtattattat tattatttgt taattcgccc cataattact aagcatcttt ttctggtgtg 2701 ccccacttgt gctgggcacc gggaatacag agatgtacaa gacaggacgg gaggtcacca 2761 tctggaggga gtcactgacc ttgaccaaac gggctaggat gctaatgact tcaagaatca 2821 agcgagccct cctgtgcagc ccccattgta cagatgagca aacaggggaa gaggggcagg 2881 agcaggtggc atggccagag ctagaatcca ggtttcttgt ctctgttagt gagttcttcc 2941 agcaaggtgc gttcatggga tactgagtga cagattagtc ggtcagtggg gctggagctg 3001 gccgaggtgg cctcatgccc acccgttacc ccccagcttc gtgctgaacc tgggcaaaga 3061 cagcaacaac ctgtgcctgc acttcaaccc tcgcttcaac gcccacggcg acgccaacac 3121 catcgtgtgc aacagcaagg acggcggggc ctgggggacc gagcagcggg aggctgtctt 3181 tcccttccag cctggaagtg ttgcagaggt gggctgcaga ccggaaccgg ggaccaggga 3241 caggggctgg gtgggctggg gcggggctgg gttagtgact agagaccttg gccctgcctg 3301 ctctttcccc tccccttccc tcccttcctg tgtgatggcc agtgtctgcc cctctttgaa 3361 cctcagtggt tgattacaat aaaacgaagg ggaaaaaaaa aggctgggct tggtggctca 3421 tgcctgtaat cccagcactt tgggaggccg aggcgggcgg atcacctgag gtcgggagtt 3481 cgagaccagc ctgaccaaca tggagaaacc ctgtctcgac taaaaataca aaaaattagc 3541 taggcgtgat ggcgcatgcc tgtaatccca gctactcagg aggctgaggc aggagaattg 3601 gttgaacccg ggaggcggag gttgcagtga gccgagattg caccactgca ctccagcctg 3661 gccaacaaga gcaaaactct gtctcaaaaa aaaaaaaaaa aattagccag cgtgctggct 3721 catgcctgta atcctggcac tttgggagac caaagtgggt ggatcacctg aggtcaggag 3781 ttcgagacca gcctgaccaa catggtgaaa ccccgtctct atataaatac aaaaattagc 3841 tgggcgtggt ggcacacaac tgtagtccta gctactcagg aggctgagac aggagaatca 3901 cttgaacctg ggaggcggag gttgcagtga gccgagatta tgccactgta ctccagcctg 3961 ggcgacagag gagactccat ctcaaaaaaa aaaaaaaatc tatcatagga ttagagtaaa 4021 aagaaagaaa aaatattata aatgtacctc cgcaggctca gccacaaact ggggctgtgt 4081 cagggccaca tgaggaacgg gttctggaag ggcccatggc atgtgggccc ggctcactgc 4141 tctcctctac ccccaggtgt gcatcacctt cgaccaggcc aacctgaccg tcaagctgcc 4201 agatggatac gaattcaagt tccccaaccg cctcaacctg gaggccatca actacatggc 4261 agctgacggt gacttcaaga tcaaatgtgt ggcctttgac tgaaatcagc cagcccatgg 4321 cccccaataa aggcagctgc ctctgctccc tctgaaccag cctcgtgtgt gtgcttgtgc 4381 gtgtgtgtgt atgtgtgtgc atgtgtttgt gtgtgtgaga ggggtccc // LOCUS HUMHMG14A 8882 bp DNA PRI 08-NOV-1994 DEFINITION Human non-histone chromosomal protein HMG-14 gene, complete cds. ACCESSION M21339 NID g184231 KEYWORDS DNA-binding protein; HMG-14; chromosomal protein; high mobility group protein; nonhistone protein. SOURCE Human lymphoblast DNA, clone pH14g. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8882) AUTHORS Landsman,D., McBride,O.W., Soares,N., Crippa,M.P., Srikantha,T. and Bustin,M. TITLE Chromosomal protein HMG-14. Identification, characterization, and chromosome localization of a functional gene from the large human multigene family JOURNAL J. Biol. Chem. 264 (6), 3421-3427 (1989) MEDLINE 89123472 COMMENT Draft entry and computer-readable sequence [1] kindly provided by M.Bustin, 23-NOV-1988. FEATURES Location/Qualifiers source 1..8882 /organism="Homo sapiens" /db_xref="taxon:9606" /map="21q22.3" prim_transcript 851..7654 /note="HMG-14 mRNA and introns" exon <1054..1068 /gene="HMG14" /note="high mobility group protein 14, (first expressed exon); G00-118-809" /number=1 gene 1054..1068 /gene="HMG14" CDS join(1054..1068,1391..1423,1517..1546,1629..1676, 4693..4821,6819..6866) /partial /note="high mobility group protein 14" /codon_start=1 /db_xref="PID:g386779" /translation="MPKRKVSSAEGAAKEEPKRRSARLSAKPPAKVEAKPKKAAAKDK SSDKKVQTKGKRGAKGKQAEVANQETKEDLPAENGETKTEESPASDEAGEKEAKSD" intron 1069..1390 /note="HMG-14 intron A" exon 1391..1423 /number=2 intron 1424..1516 /note="HMG-14 intron B" exon 1517..1546 /number=3 intron 1547..1628 /note="HMG-14 intron C" exon 1629..1676 /number=4 intron 1677..4692 /note="HMG-14 intron D" exon 4693..4821 /number=5 intron 4822..6818 /note="HMG-14 intron E" exon 6819..>6866 /note="high mobility group protein 14" /number=6 BASE COUNT 2393 a 1714 c 2094 g 2681 t ORIGIN 1 bp upstream of BamHI site; chromosome 21. 1 ggatccccag cactttggga ggccgaggcg ggaggatcgc ttgagcccag gagtcggaga 61 ccatgctgtg caacatagtg aaaccccctc tctacaaaaa atacaaaagt tagctgggca 121 aaatggtgcc ctgtggtccc agctactcgg gaggccggag tgggaggttc gctggagccg 181 aaggggtcga ggctgcaatg agccgtgatc gcaccactgc actccagcct ggacgacaca 241 acgagaccct gtctcaaata aaaggaagga aggaaggaag ttacacagaa aggccgcgtc 301 gcgtctccgt cccacgccct cctgcagcgc ctgcgcacca ggcccgcttc acgcaggcct 361 gcgaagctgg agcccctgga tagcctttct tgccgacaga ggcgggagaa atttgctact 421 tcctgtatac cttatccttc tcccttccca gtctaagata cgaactataa atgttcgaac 481 ccaattcacc ccggagaggg gccagatacc agtggcctga aggcgcccag gtatccagaa 541 gaattgtggg tggggacccg cggtcgtgac gtgcgtccgc caatcagcgc gcagaccgca 601 ctttgcgctc ggcttcaaac taccgtgagc cggagcgcac tgggaccccg cccccttcgc 661 ctgggtctgg ggccccgcga gacggcggaa aggggtgggg gcgcccgggg gggcgggagg 721 ggggcggggt gcgggatcga gtgacggccc gcctcaccta ttccgggcgc gggctgagtc 781 ccgtagccaa tgggcggggg tggggggcgg cccggccggc ggggaggggg agccgcggcc 841 gggacgcggg gggaggagga ggcgggctcc caatccggtt ccatccggtt ctcccaccgc 901 ccccgctgtg ggtctcagca gctcgggcgg cgggaggagt ggcagcggca aggcagccca 961 gtttcgcgaa ggctgtcggc gcgccgcggc ccgcaggcac ccggcacgcg ccttccccgc 1021 aggcacccgg cacgcgcctt ccccgccgcc acgatgccca agaggaaggt gagcggcggc 1081 cgcggcccgc acacgccccc tggagccgcc gccggccccc gccggccccg cgaggcccag 1141 gccccgttgc acccacggtg gcgacgggcc cgggaggcgc ttggagaccg gcgggcgggc 1201 aggcgagcgc tcggcggccg cgggggcggc gttctggaac gtttggcggc cgggggagct 1261 gagggggcta ttcgaacggg gcggcgggaa gccgtgacgt cacgcggccg ggcattgttc 1321 tcggggccgg gcgggcccgc gagtcctggg actgcggccc gcctctattc gtgcgtctcc 1381 gtctccgcag gtcagctccg ccgaaggcgc cgccaaggaa gaggtgagtg cgggccttct 1441 gcggggggtg gtgggtttcc cgtgagccgc tggcctgcct tctcttctcg ctgactctcc 1501 tttttctttc tccaagccca agaggagatc ggcgcggttg tcagctgtaa gtaaagcgag 1561 ccccgtaacc gttcgttttc cgcgggtcgt cccgggtgag gacgctcagt gctgcttttg 1621 cctttcagaa acctcctgca aaagtggaag cgaagccgaa aaaggcagca gcgaaggtaa 1681 gcctcgaaac gcgcattggg atgcagcggg gccttaggct acactgcttc ttaatgcggg 1741 gcttccattt tgattagcta ttggagcttt atttatactt taataattac ggtaaataat 1801 ttttctagtg gtcgaggcaa aaatgtaatg gatatattca tcctggttta tagaactata 1861 tcacactaat gctgctgcag atgttaggag acttaggacg acaaaaaata ttaaatatta 1921 agtctacaaa ggaaatttat tctttgcgtt gcgtacattg tggctggtgc ttggctttta 1981 attcgtgctt gtacttcctt ttttgacaat aaaagagtca agatagcacc gaggccagga 2041 gaaagggaac gtgtaagttt ttatatatac agtttccaag ccaacttcgg gaagccttaa 2101 cctttttacg gggtgggggt ggggaggtaa aaagttgtga tctctgagaa aataaccgcc 2161 actactctgg aagtgttcat cagcagttat acaaaaccgt gattttggct gctccctaac 2221 aaattcgtga ttgcatgatt cgaattgcag gtctgtagaa tgaagttggc tttgggtgta 2281 cgtgtgtttg ataacttgca gggtggaaaa gcagaacatg tgtaaaacaa gtataagctt 2341 tgtgtttgga tatagcattg aaacctgtaa agctctacat gttcttcgcg gggtgattaa 2401 tttttatcca aaacttaggt gagagtttgt catttgaagt ctcaaaaaaa atttttgtta 2461 atgccttcac ctctttccta aaagtgtagg tagaagttta gcactgcagt catcagaaga 2521 ggcaagatta atttccttga cattgaggag ttgttggata attactaaat ttgtcttgtt 2581 aataaggtat aaaacaagat gttttttctt ttttaattta aattttttac agagctgagt 2641 gtaaaaatat taactatgtt ttctttaaca gaaagttctg tttttgtgat ccttttaaaa 2701 ataaagcttc acggaaggta tgagaatgat atttttcaac tttaaatctc tcattaccag 2761 aagaccatgt ggtaattctc tgtatacagt tagaacagca cggaaacttg aaggcctaaa 2821 aaattagctg accttgttaa aaatgttggc gtgacgagta tactattacc tatctttttt 2881 tattgtgtgt gtgtgtgtgt gtgtgtgtgt tttaaactaa ttggctgaaa tatctgcctg 2941 tttccctctt tacatttttc ttgtttcttt ccttatttat ctttgtccat cttgagatct 3001 actgtaaagt gaatttttta atgaaaacag ttccaagttt tactctcagt gggtttggga 3061 catcagatgt aattgagagg ccaacaggta agtcttcatg tcagtgtttg ttgaggaacg 3121 agcctatgag gtcagttttc ccaaaaggaa aaagggcaga agggatttgt tcatttttac 3181 atctcgtttc tgtaatacac ctttgacttc atggttgatc agactttgaa gtctaaacag 3241 aacgtaagca cttggtgtat cgattatcat actacacagt agacatgttt tcaaggcata 3301 tcttgtcact gtgtgtgctt ctgacttgta tctttcccta gaactaaata ttttgaaggt 3361 ccaaacatta tacttcggga aacttgattg attttttttt aatctagctt ttcagctgat 3421 aatggttaca cgtggtatta tttcaagtcc aaggtattat gtattctatt ttaactgttt 3481 cccagttttg tatttttttg tatttggaat gattatgaaa ttaatgagaa atctttagaa 3541 atcagactac aaaagagtag ttcttaaata cagaagtgat tgtgaaactt tgtgatagca 3601 atgtgtgtag tagcagtttt gtctcaaata catttatcat ttgttactca aaggatgttg 3661 aagtattaaa agtcattatg ctgtctgtgg aatcctacta ttagtacaga acaccctgca 3721 gtgatttttc cgcccatgtt agcattgaag ttgtgagctc tacttgcttg tctttatggc 3781 cctttaatca agtaattggt cagtatccgt atgggtcttt ataccaccgc tggggctaaa 3841 ctaatttagc tgctgctgta tacttactaa caaggaataa atgttaagct ttcttctcag 3901 tattgatgga tggtatctaa aagtattttt atgtttcttt aacatggctt aaattttgaa 3961 cttaatgtat caagttagta tggtcatatt aatacctgtg cttttcagat tcttcaaaca 4021 cctaaatgaa agtgataaat tcaaaactga tccttttagt tcctcattat atgatatgaa 4081 gggattaact gtagcaggat agtcaacctg accgtacggc atggtgcttt ttttcaggtg 4141 attggtctta atgtgagggt taaggtcttg tgaggacagt attgttgaag ttcacaacaa 4201 atttggggat gggtgggaat acggtcgacg agactcctcc ggtattcaac ttcatgacat 4261 tgtcctgtca tgtggaactg tagggaatag tggatgatgg cgacagtggt gaaatatagc 4321 tcgggcttta ggatttgtca tatgtctaat gaggcttgac atgcaagtac tattaaaatc 4381 tccttagcat acgttgtttt cctgattact ggggtggacc ttaagttgtc tcatgtaata 4441 ggtgtatcaa ctgatgtgtg atagacatta atatgacagc tgttatggat acagtctata 4501 caaaatgaga tcctttcata agcttgaaga ttggggtttc acgctcatgt gtgagatgtc 4561 cctctctcaa accttactat gaagtcagca ccactgggca cattacttgt ctgacatgaa 4621 ggggaaaaag taaaacaaaa tgtaaataag gagtaaagca tgtacttaaa atatattctt 4681 attctatcat aggataaatc ttcagacaaa aaagtgcaaa caaaagggaa aaggggagca 4741 aagggaaaac aggccgaagt ggctaaccaa gaaactaaag aagacttacc tgcggaaaac 4801 ggggaaacga agactgagga ggtcagaagt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 4861 gtgtgtgatt gctttgactc tgaacagtag tactaaattg tacatttttt gtagcagata 4921 acagtattca ttataatatt gaacaaagtt ttgtacggta tcccaaagtg attggacaaa 4981 tggttttcta tgatttatgt tattggaaag taaatacgta aatggcttat ttaagagcaa 5041 gaaattttaa tatgctcaat ttcagatata attcaatctt tgttttgttt ttatatctgg 5101 atgttgcttt taaatatatt tttgaaaatt tattcccctt acttgcattt ttcctggcat 5161 ttttttccca ctttcatttc taatgaaaat gtttttgttt gtatttttct taaaggtcta 5221 aataataaca tagatcaagt aaataaaaag actagtgtca ttaatcttta ttcttagaga 5281 cagtgtctgt tagccaggct gaatctttat tgttttaagc tgtgaggaaa agatgtcttt 5341 ttttttttat taatgaagca gtgggagtag agaaggaaca aagaagtctg taacaggttt 5401 tgattgattt gttgtgagca ctactgcact cctctagcct ggaagctgtt tcttaaaaaa 5461 taactgaact gcaagattca gttaagccag gcctttattt tatgtttgtt atattagcta 5521 agctagaaaa gataagtaat ttagtatatt gatttttaaa ttttgatatc acttaagtat 5581 ttgatgttta tatttgatat ttatcaaatt cttaagttat aaactactta ttcctaagtt 5641 aagaaattta taatcacagt ttattttttt aacttgtact tcttgtactt cctaattcct 5701 tttttttttt tttttttttt tgagacggaa tttcgctctt gttgcccaag ctggagtgca 5761 gtggtgggat cttggctcac tgcatcctcc acctcccggg ttcaagcgat tatcctgcct 5821 cagcctcccg agcggctggg attacaggcg tgcatcccca cacccaacta attttttgta 5881 tttttagtag aaacggggtt tcgccgtgtt agccaggctg gtttcaaact cctgacctca 5941 ggtgatctgt ccgcctcagc ctcacaaagt gctggtatta caggtgtgag ccaccgctcc 6001 tggcctgtac tacttaattc taaccataca tcaaatactg tgtgataata agatcacatt 6061 gctaatctag ttctgatttt aaaggaagaa cttgaccttc tagtacatgt taccaaatta 6121 cagttggttt cccttgttta tagctgtgtt ctattaatat aaggctgcca caagtatgaa 6181 accgttttca cataggggaa gtacaggatt gggttcctac aaacctctga tcacaacatt 6241 ttcatcaact gaaaatacgt gactttgtta catgtgtatt tctgtttaga catgccttaa 6301 tgtatgttaa attcacatcg aactcactac ccagtagcac tgtaatttgt acctgaacaa 6361 agccctccaa cacatacgtt ctctgtgtgg cacatcacag ccttgaattg agcaacatta 6421 gatagcactt gaacaagatg cgtgggggcc attttaaaca acaaaatcat cagtaaaaag 6481 cacaaaggtg taaaatatgt ggcactaagt agaccggaaa aacagttgtt tacggcatta 6541 gagctgaaac aagaagacag agtgacctca gcttggagtg tacatgtgat acaactcaac 6601 attttttcct gctctgtaga tgcatgcaca tgaccatgga agggctgcaa gtattgattt 6661 ggggtgaggg ggtggttaac aaaagttttt taacaagagt aggtaggtga ctttacaaat 6721 aacaaaggct attgactttt ttaaagacct gatagttgca tataaataaa atgacaacgt 6781 gttaaatgct aaagatctca atttcctttc ttttgcagag tccagcctct gatgaagcag 6841 gagagaaaga agccaagtct gattaataac catataccat gtcttatcag tggtccctgt 6901 ctcccttctt gtacaatcca gaggaatatt tttatcaact attttgtaat gcaagttttt 6961 tagtagctct agaaacattt ttaagaagga gggaatccca cctcatccca ttttttaagt 7021 gtaaatgctt tttttaagag gtgaaatcat ttgctggttg tttatttttt ggtacaacca 7081 gaaaatagtg tgggatattg aattatggga ggctctgact gtctcgggtg tcagcttaac 7141 attccacaga tggggggtta gtttttatat cctataatac aaagcatatt aaatggcaat 7201 atggagtcag tcctgcattt aatgtcttga acattttaaa ttacttctat taccatgttg 7261 ttttttagta gaattgtttc ctaaagaaaa ccactctttg atcatggctc tctctgccag 7321 aattgtgtgc actctgtaac atctttggtt gtggtagtcc tgttttccta ataactttgt 7381 tactgtgctg tgaaagatta cagatttgaa catgtagtgt acgtgctatt gagttgtgaa 7441 ctggtgggcc gtatgtaaca gctgaccaac gtgaagatac tggtacttga tagcctctta 7501 aggaaaattt gcttccaaat tttaagctgg aaagtcactg gaataacttt aaaaaagaat 7561 tacaatacat ggctttttag aatttcgtta cgtatgttaa gatttgtgta caaattgaaa 7621 tgtctgtact gatcctcaac caataaaatc tcagttatga aaatattttg aagcacatgg 7681 attaaattct ggtttaaatt ttatttacag aatttccaca tttgagcctg gccaacatgg 7741 tgaaaacctg tccctactaa aaatgcaaaa aaaaattagc caggtgtggt ggcaggtgcc 7801 tgtaatccca gctactcggg aggctgagcc aggagaatcg cttgaacctg ggaggtggag 7861 gttgcagcgc acccagatcg cactactgca ctccagcctg ggcaacagag tgagaccctg 7921 tctccaaaaa aaaaaattcc atatttgatc ttccaaagtt gtttttttta agtgaaatgt 7981 ttgaattttt gtttaataca gtcatgggtc accactgact ggatacagtc taagaaacac 8041 attgttaggc agttttcttg cgtgaacatc atacagagtg cttacacaaa cctcagtgca 8101 tagcctatta cctatgtagg ctgtatggtt taacctattg ctcctaggca acagacctgt 8161 acagcatgtt attatgtaaa tactataggg agttgtaata gaatggtaag tacttgtgta 8221 tctaaagaga aggtacagta tataatatgg gaccattatg tggttcattg ttggttgata 8281 cctgagacat aactgcttga gtatctatgt cctttgaagt tgcttgacta tattggttta 8341 agggaaaaac aatccaatat cctttgaatc aaagcaaata cgtaattctt aactcttcgt 8401 atgaggttaa aaaaagtaca gataggccag gtgtggtggc tcacacctgt aatgccagca 8461 cttagggagg ccgaggtggg taggtcacct gaggtcaggt gtttgagacc aagatggtga 8521 aaccgcatct ctactaaaaa taaaacaatt agctgggttt ggtggtgggc acctgtaatc 8581 ccagctactc aggaggctga ggcagaagaa ttgcttgaac ccaggtggta gaggttgctg 8641 taagctgagt tcacgccatt gcattccagc ctgggtgaca agtgaaactt gtcccaaaaa 8701 aagaagctac aactaaatct agtatggtat gtattttcca gatttgcttt catatacatt 8761 ctgttgtcat tttgccctct taagtatagt cttcagaggt ggtgtggggc aactcttgac 8821 ttggcaacag tgaaaacggc cggagaacag accttgttga ggttatcact aaatagggat 8881 cc // LOCUS HUMHMG2A 4341 bp DNA PRI 31-DEC-1994 DEFINITION Human high mobility group 2 protein (HMG-2) gene, complete cds. ACCESSION M83665 NID g184235 KEYWORDS HMG-2 protein; high mobility group 2 protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4341) AUTHORS Shirakawa,H. and Yoshida,M. TITLE Structure of a gene coding for human HMG2 protein JOURNAL J. Biol. Chem. 267 (10), 6641-6645 (1992) MEDLINE 92202209 FEATURES Location/Qualifiers source 1..4341 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="blood" mRNA join(952..1048,1649..1818,2104..2249,2323..2497, 3080..3616) /gene="HMG-2" /product="high mobility group 2 protein" exon 952..1048 /gene="HMG-2" /number=1 gene join(952..1048,1649..1818,2104..2249,2323..2497, 3080..3616) /gene="HMG-2" intron 1049..1648 /gene="HMG-2" /note="does not fit consensus" /number=1 exon 1649..1818 /gene="HMG-2" /number=2 CDS join(1669..1818,2104..2249,2323..2497,3080..3238) /gene="HMG-2" /codon_start=1 /product="high mobility group 2 protein" /db_xref="PID:g184236" /translation="MGKGDPNKPRGKMSSYAFFVQTCREEHKKKHPDSSVNFAEFSKK CSERWKTMSAKEKSKFEDMAKSDKARYDREMKNYVPPKGDKKGKKKDPNAPKRPPSAF FLFCSEHRPKIKSEHPGLSIGDTAKKLGEMWSEQSAKDKQPYEQKAAKLKEKYEKDIA AYRAKGKSEAGKKGPGRPTGSKKKNEPEDEEEEEEEEDEDEEEEDEDEE" intron 1819..2103 /gene="HMG-2" /number=2 exon 2104..2249 /gene="HMG-2" /number=3 intron 2250..2322 /gene="HMG-2" /number=3 exon 2323..2497 /gene="HMG-2" /number=4 intron 2498..3079 /gene="HMG-2" /number=4 exon 3080..3616 /gene="HMG-2" /number=5 BASE COUNT 1103 a 940 c 1172 g 1126 t ORIGIN 1 gaattccgga cccggtagga gggggcgggg ccgggggaga aggcgccgcc tccgcccctt 61 cgggacccgg agggtgccct tgtttacctc ccagtatttg tcacccgaga caaagaggga 121 aggaaggagg agccttctga gggaaactcc ctcggcggga agcatcttac tttcagaacg 181 ttttttaaca ccctacggac gcgggaacgg ttccctcccg gccccccgcg ccctttccga 241 ggttccctgc cttgacttcc ccgagttcta cgacaacctc agcgacagcg ggaggagctg 301 ggcaatcggt tttgcagggc aaactccagg ctctcctcat ttattattct ctgggcggcg 361 cacgggagac cctgcagggc aacagagcag agcgactaca gctcccagga gccaacgctg 421 cagggctgag ccgacgcggg ggacagacag gacctaaatg gtggtcggag gaagggcaag 481 gcagacctct tctgttacct tagcgccagc aaaccgagcc gccgccgtcg ccaagccagg 541 agaccaaccc ccagtctccc gactaagtat ttttaaatct ggcggggctt tccttcccga 601 gccaaccact cccaccgttt acagccccaa ccagtcagcg tgaggcccgc catttttcaa 661 accctcttcc cgccgccaat caggatcgag cagtacattc ctctcctgac cggttctaac 721 gggtctggga gttaacgacc tgggcgaccc accgaacctg ctaggctggg gcgtggaggg 781 cgggtctggt aagacactga ccaatcgtta gcctccgtgg caagggggcg gggactatct 841 gggtttgaat aatcgtcaga accaatcaga ttagtgggga tgtggcccgt ggcctagctc 901 gtcaagttgc cgtggcgcgg agaactctgc aaaacaagag gctgaggatt gcgttagaga 961 taaaccagtt cacgccggag ccccgtgagg gaagcgtctc cgttgggtcc ggccgctctg 1021 cgggactctg aggaaaagct cgcaccaggc aagaataccc tccaataccc tcgggttcgt 1081 ggacctgcct ttccccattc cctcagagcc tctactcggt ctcggcgcag tggctctcgg 1141 ggtctgaccc ggcgagcggc atttggggtg cgggccggcg agggctgggt ctgtggaggg 1201 ccggcgggca gtcggaggag gcggaaactg cccctgaccg ggcccggttc tgggagtttt 1261 caatgtcggt cacgaggtgc ttagcggtta agtgaggctc gcgacgtggg agctggggga 1321 catttctgtc aggaaaggcg ttttgagggc gcttgcggag tcaccggact cggtgaaggg 1381 tgggcggcgg cgtctcgggc cccgggagcc acttgctggg ctgagccgcg gccgctcggg 1441 ccgggaggag gaggaggagg agggtgtgcg ccggggcgcg ggcgggggcg cggcggccgc 1501 ccttgggagg acgccctcgg gaggaggggg gcgcgggcag ggcgggagcg gatttgggcg 1561 ggaagcggag ccccgccagc gcccgccctg gcagctgcgg cgtccgcgcc gacctccggc 1621 ttcccctctc ccccctcggc cccgtcaggt ggacgcggat ctgtcaacat gggtaaagga 1681 gaccccaaca agccgcgggg caaaatgtcc tcgtacgcct tcttcgtgca gacctgccgg 1741 gaagagcaca agaagaaaca cccggactct tccgtcaatt tcgcggaatt ctccaagaag 1801 tgttcggaga gatggaaggt gaggcaggag agggacggag ctcagggggt gcaggtgtgg 1861 ttttcggcta ggagggcctt aggcaggtag taggcaactc atgagcttca actttgccac 1921 cccagtgact cacctgaata tcttaagagt tccaggcaga cgattttagg tttttactta 1981 tataagactt cgaaagcaaa atgctgttaa ggaaaatgtg gtccttaaag atgatcttgt 2041 cgtctttgga tgtttatagg taacaatttt gctgtatttt tggctcttat tttatgtcca 2101 cagaccatgt ctgcaaagga gaagtcgaag tttgaagata tggcaaaaag tgacaaagct 2161 cgctatgaca gggagatgaa aaattacgtt cctcccaaag gtgataagaa ggggaagaaa 2221 aaggacccca atgctcctaa aaggccaccg taagtttaaa ataacccaaa ttgctccttg 2281 gatttttcct tcagtttatt aaactctgtt gcttcctttc agatctgcct tcttcctgtt 2341 ttgctctgaa catcgcccaa agatcaaaag tgaacaccct ggcctatcca ttggggatac 2401 tgcaaagaaa ttgggtgaaa tgtggtctga gcagtcagcc aaagataaac aaccatatga 2461 acagaaagca gctaagctaa aggagaaata tgaaaaggta cagtgtcatc ttttttaaag 2521 ccgtggataa gactaggtat aggtaataac tgtagaaaac ctgggaaatt tagttaatct 2581 tgtattaatg gttgtcagct atgttttgaa aaggcctaat gaaaattgta cacttcaaca 2641 caaggtaatt gaaaccttcc ttttgactga aaccagtgtt tgtagcacta gtatattcct 2701 gcagacagac ttgtagttac ttgtagttat tgtatagtct gtatagtctg ttattttttt 2761 ttttaagcag agagtcaaag aaattgtttt atgtagatat atataaaatg tgaaggtaca 2821 ggagggacta tggcactgtg tgtgatgtaa aagggtattg gtagtgaaag tactgatact 2881 gctgtatcgc aacccttgtc attttacgtc attaacttgt taaagcctag tgggataagt 2941 gctctaaaaa cttagactgg ttaccttttt agacagttat tagggttatt ggtcaatcat 3001 cttagattgt ttacaactaa gtggtttttc acagttgagt aatgataccg gatgctttat 3061 ttttttgaca atatttcagg atattgctgc atatcgtgcc aagggcaaaa gtgaagcagg 3121 aaagaagggc cctggcaggc caacaggctc aaagaagaag aacgaaccag aagatgagga 3181 ggaggaggag gaagaagaag atgaagatga ggaggaagag gatgaagatg aagaataaat 3241 ggctatcctt taatgatgcg tgtggaatgt gtgtgtgtgc tcaggcaatt attttgctaa 3301 gaatgtgaat tcaagtgcag ctcaatacta gcttcagtat aaaaactgta cagatttttg 3361 tatagctgat aagattctct gtagagaaaa tacttttaaa aaatgcaggt tgtagctttt 3421 tgatgggcta ctcatacagt tagattttac agcttctgat gttgaatgtt cctaaatatt 3481 taatggtttt tttaatttct tgtgtatggt agcacagcaa acttgtagga attagtatca 3541 atagtaaatt ttgggttttt taggatgttg catttcgttt ttttaaaaaa aattttgtaa 3601 taaaattatg tatattattt ctattgtctt tgtcttaata tgctaagtta attttcactt 3661 taaaaaagcc atttgaagac cagagctatg ttgatttttt tcggtatttc tgcctagtag 3721 tagttcttag acacagttga cctagtaaaa tgtttgagaa ttaaaaccaa acatgctcat 3781 atttgcaaaa tgttctttaa aagttacatg ttgaactcag tgaactttat aagaatttat 3841 gcagttttac agaacgttaa gttttgtact tgacgtttct gtttattagc taaattgttc 3901 ctcaggtgtg tgtatatata tatacatata tatatatata tatatgtata tatatacaca 3961 tatatacgta tatatacata tatatgtata tggagtctca ctctgttgcc caggctggag 4021 tgcagtggca cgatcccagc tcactgcaac ctccgcctcc cgggttcaag cgattcttct 4081 gcctcagcct ccctggtagc tggggctaca gccatgtgcc accaagccca gctaatttat 4141 atttttagta gagacagggt ttcaccatgt tggtgaggct ggtctggaac tgacctcaaa 4201 tgatctgccc acctcagcct cccaaagtgt tgggattaac aggtgtgagc caccacgcct 4261 gtcccagtat attgtttaac aagtttattt tgggtgaaaa attctcttta atgggaagaa 4321 gaggggctag aatgtgaatt c // LOCUS HUMHMGIY 10144 bp DNA PRI 17-MAY-1996 DEFINITION Human high mobility group protein (HMG-I(Y)) gene exons 1-8, complete cds. ACCESSION L17131 NID g306868 KEYWORDS high mobility group protein; nonhistone protein; transcriptional regulatory protein. SOURCE Homo sapiens male placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10144) AUTHORS Richards,C.A., Austin,E.A. and Huber,B.E. TITLE Transcriptional regulatory sequences of carcinoembryonic antigen: identification and use with cytosine deaminase for tumor-specific gene therapy JOURNAL Hum. Gene Ther. 6 (7), 881-893 (1995) MEDLINE 96097131 FEATURES Location/Qualifiers source 1..10144 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="placenta" mRNA join(629..719,911..1074,1373..1547,3037..3136,4481..4659, 6460..6543,7219..7269,8581..9977) /gene="HMG-I(Y)" /note="transcription initiation site for 10A/1A/8A cDNAs" gene 629..9977 /gene="HMG-I(Y)" exon 629..719 /gene="HMG-I(Y)" /number=1 intron 720..910 /gene="HMG-I(Y)" /number=1 mRNA join(911..1074,1373..1547,3037..3136,4481..4659, 6460..6543,7219..7269,8581..9977) /gene="HMG-I(Y)" /note="transcription initiation site for 7C/2B/3B cDNAs" exon 911..1074 /gene="HMG-I(Y)" /number=2 intron 1075..1372 /gene="HMG-I(Y)" /number=2 mRNA join(1373..1547,3037..3136,4481..4659,6460..6543, 7219..7269,8581..9977) /gene="HMG-I(Y)" /note="transcription initiation site for 6A cDNA" exon 1373..1547 /gene="HMG-I(Y)" /number=3 intron 1548..3036 /gene="HMG-I(Y)" /number=3 exon 3037..3136 /gene="HMG-I(Y)" /number=4 intron 3137..4480 /gene="HMG-I(Y)" /number=4 exon 4481..4659 /gene="HMG-I(Y)" /number=5 mRNA join(4481..4659,6460..6543,7219..7269,8581..9977) /gene="HMG-I(Y)" /note="transcription initiation for site for 11D" CDS join(4525..4659,6460..6543,7219..7269,8581..8634) /gene="HMG-I(Y)" /codon_start=1 /product="high mobility group protein" /db_xref="PID:g306869" /translation="MSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPVSPGTALVGS QKEPSEVPTPKRPRGRPKGSKNKGAAKTRKTTTTPGRKPRGRPKKLEKEEEEGISQES SEEEQ" intron 4660..6459 /gene="HMG-I(Y)" /number=5 exon 6460..6543 /gene="HMG-I(Y)" /number=6 intron 6544..7218 /gene="HMG-I(Y)" /number=6 exon 7219..7269 /gene="HMG-I(Y)" /number=7 intron 7270..8580 /gene="HMG-I(Y)" /number=7 exon 8581..9977 /gene="HMG-I(Y)" /number=8 polyA_signal 9955..9960 /gene="HMG-I(Y)" BASE COUNT 1983 a 2924 c 2968 g 2263 t 6 others ORIGIN 1 ccaccttgat catggctcgc tggtggccat taataaaaca ctttggattt cacaagtttc 61 acgtttgaat ttcacaagac ttgtatctca ccaatcagcc acacgcgggt tgacactgaa 121 aggcacatgc tgacacctgt ccgcccaggg gaaatctacc atctctctat tctagttagg 181 gggcggcgtg taggccggaa ttctctaacg agcgcgtttc tcttcacccc ctgggcccgc 241 gtggaccccc gtccaccccc acacgccctg gaggggggcc gggcccacac gccctggaag 301 cccctagagg tgactctccc tgggacccct gtaccaggga ggaaggatac cgccacccgt 361 caccaccacc ccccgccaga agctccttcg tgactcctct gcgcgtgcct tcccacacct 421 cctcgtccgg gactgcgagg agtgggcggt cgactcgagt tcgcagccca ggcctcccac 481 acgcccctcc ctgccgtcag cacccacccg cgcggcaggc cggcggctgg cgggcggccg 541 cttttaaatc cccgggctca tttgcatggc cccgccccct gagtgacacg gctggcgcgg 601 gcgggcccgt cccccctgcc cctgggtcgc tctttttaag ctcccctgag ccggtgctgc 661 gctcctctaa ttgggactcc gagccggggc tatttctggc gctggcgctc caagaaggcg 721 tgagttcgcg gccgctccgg tggcttcttt tttttatatc tataatttaa ttaaattatt 781 tatttattga ggccgcgcac gggccgtgcc cagcttcctg cccctcgcca tccttcgggn 841 nagggggaat atttttgtcc ccccgcctgg ctgtgacaca taaatacccc gcgggggcct 901 gggcggcgag cacgcggcgg cggcggtctc tgagcgcctc tgctctctct cccggtttca 961 gatccgcatt tgctaccagc ggcggccgcg gcggagcagg cgcgtcctca gcgcccagca 1021 ccgccgctcc cggcaacccg gagcgcgcac cgcaggccgg cggccgagct cgcggtgagt 1081 cgtccccggg gcccgcggcg gcggcggcgg aggggggcgc ccgcgccccg cccgacgtcg 1141 cccctcctcn agccgcccgg ggctgcagcg cgcgccgccg gctttattcc caggccgggg 1201 cggcgcgacc ggcggcgggg gagggccgcg cggcgcgggg gcgggcggcg gggcccggcg 1261 gcgcggggag gacgcccgca ccccttcccc gcgccgcggg cgcctgcggg gggcggggac 1321 ccgggaaagg cgcgcggtgg gggagggggc gcgcagctgc ggcgagcgcg agttgtgcac 1381 ccggcggagc ccggaggagc ccggcgcacc tcgcgcggcg gccgctcccg ctggagccgg 1441 agcccgagcc cgagcccgag cccgggcccg ggtgaggggc ggggagagac acgggctgcg 1501 gcgcggcagg agagtggggg gtcgggcgcc ccccgcagct cagggaagtt ctggaagtga 1561 gggagatggg gaggaacccc caaattcggc cctacgccct ctcggccttt ctctcgcgcc 1621 ccctccgcag atgaggccgg agcgcgggcc tgggtttggg gcttcacccc gaccccacct 1681 ctggcctggc ccgcgcctct tcagccccag gggccgggcc tgccctccgc ccccacaaac 1741 tggtttcgct gctttgctgg gctctgttgg aactagtccc tttgactccc cgtttctatt 1801 ttattttctt ctggggcttt attttgccgg ggaggggacg gagggagggg gctttcttcc 1861 tctcccccgc gcccgcgncc aggtttcctg ccccaaggcg ggcttgggtg ccccctctgg 1921 caggaagtgg ggccgggtgc acccgcgtgg cgtgcatgcg gtcggggtgc gctgctttcc 1981 cgcgcctgca ggagcgcctg cgcccctccc cccgcgcgcc ctggccccgg gtcccgccgg 2041 cgcctgcaac ggccgggcgc agacacttgc agagtcaccc tcgggccggg tctggagcct 2101 gatgcctcgg gggtttagcc ctagccgcta ggcgggtagg caaagtgtcg gattgaaagg 2161 gtctccggga acctacttct tcctgctcgc caacttgctg tgtgacctta ggcaactcaa 2221 ggcttctctg ggctactttt ttacggggga cggggaagaa aacaagggaa cccccaagat 2281 aatttcctag ggcctttcct gcagagtcct taagtatttg aggagcacag tttgcatgca 2341 cagtgtatgg aggagacagg cttgaggcca agcggcgtta ttacaaagtg agggattggg 2401 gttaggccct aaatgtagtt cctccaagac aggcctctga tgttcagaaa caggtggaaa 2461 agacgaacgt ttcttggcct cactggaaag ggcctcgggc ttcatttagt tcgtgggggg 2521 ttggctgagg ggattaaagg gtccccagca gcaagaggtg gggggaggca ccagatggta 2581 tgagagcttc cagggagacc cgccaagatc tccaggcagc cgggccagct ctctgggatt 2641 ccggactgct tcgcccaact ggaaccccct cgatgagggg gccccaagca gaccttatat 2701 gaggtcagca ctcgtttagt tgattaattt tgatgtttag tttgaagggg acctgtggtg 2761 taatataaga ttctaatcac tgcctgtaat tgcagagcag ccaagccact aatgacggag 2821 caggataaat caccaaccac cttgtttaaa agggtctcca agtctgaaac attataagat 2881 agaataatta ccaaagtcct gctgtctgga gggtctcttg atgcttgaaa atggaggtcc 2941 tgggccctct agaaatcaca agtttgcaaa aatgtttttt tcctctcatt ggggtaaaaa 3001 ctctttttct tctttttttt ttttctttta aattagacat gggagtccca ccgtattgtc 3061 caggctggtc tcgaactcct gacctcaagc agtcctcctg ctttggcatc ccaaagtgct 3121 gggattacag gcatgagtca cctcacctgg cccataaatc attttcaaat ggtctagttc 3181 ttattttggt tagagtgaag atttctttct attattttaa gtttttctaa actattacct 3241 agtttgatcc ttgaagcagt caggaggtag ttcaggcagg taccttataa ttaccttgat 3301 cttacagatt gagaggggtt cagaagtgga gccgcttgtc ccacccctta ccatggcagg 3361 cagggattgg ttactggaaa aatgtgtagc ctaggcctcc ggcgtctcat ccatctgtag 3421 cctggggata aggactagga taagaagatg tggaggactg tcctttttaa gcctgtaata 3481 cagggtattc ccaaatacag gttgcaggcc ctccttccaa tgagaagttt taggaaaact 3541 caaaattgat ggggttaatt aaaccccaac aggcctctaa gaagaaggaa gatgctagaa 3601 aatgaccaag aaggttttag gatgagccct aaacctcatc ctaaaggatg agggccagcc 3661 agggtctgtt aggcgggtgc atttcagttg gagaattccc tacccatctg acaaagtccc 3721 atctgactcc agttgggcag aggagggggt aaggggaaag aggagggtgg cctttggctt 3781 tgggctcact ttcctccccc actcacccac tagcttcaac catctcctcc ctcctcaatt 3841 ataccaatct gaattctcca gggatttacc ctgggtgaga acagagaaga gagctggtgg 3901 tgggcaccac ccctggctgt gtgattggct ctttttggtc ctgagtcttg caccttgcca 3961 gataggcaat attgtcagtg aggtgggcag gggatcaatg tccatagctg ccagatcgca 4021 aggtctcatt ctttccattt tcccactgaa ctgcgtaggc tggattttct cttaaagaga 4081 aagggcagca gttgactaca gaaggagggg atgctggcaa tacgtcagat ctcacctccc 4141 cacacctttg tcccagggtg gcctcaccca tgtagctagg tctgggctgg ctgtacctcc 4201 tggacccccc ccaccacgct gcactggggc ggtgttaagg caggacaaca aagagccctt 4261 ctgaacagga gatagataag aagctgatta gccaacggcc cttccccgcc cttggcagca 4321 tctgggggtg ggcagacccc tccgcatcct gggcagagta ggagcctggg ggcccatggg 4381 agcacagttt tctgcagtgt gccttgaagg tgtgggtgat gagggggtag aaagtggctg 4441 aagcgagatg tttgtctaaa agcacttttc tgtctcccag catcccagcc atcactcttc 4501 cacctgctcc ttagagaagg gaagatgagt gagtcgagct cgaagtccag ccagcccttg 4561 gcctccaagc aggaaaagga cggcactgag aagcggggcc ggggcaggcc gcgcaagcag 4621 cctccggtga gtcccgggac agcgctggta gggagtcagg tgggtgtcca aacctttgct 4681 tcacttggtt taccctttgg ggctagggag gtgcctggag tttcattcag caggcgagac 4741 catgcatgga gggtgcagaa tgattctgcg cagtaagcgt gtgggtgtgt cctcccacct 4801 gtgtgtactc ctgccccact tgcaaccctg gtcagatctc tatagagacc ccaggagaga 4861 accgggggcc gaggaattca agataaattc tgaagccaat ggcaaacagt ccaaacctaa 4921 agcaccccag aggtcacatg gccagtgctt aaaacactcg cctttttctt aatttccaat 4981 ccactgtgga ccagtaattt taaaatgcag taaagacgaa tgaacaaaaa tgaacataga 5041 agcccagggt ttttaatcat tagattcagt gcacgtaaaa tgatttttgt caatttgctg 5101 taaagttgct gaatacttat ggtcgattct tgtacttcct catccccagg tgagaaacgg 5161 gtgcctgcgt gggccagcac ctgtttgtgc acaggccagt tatgagtagt tctggaatgg 5221 gcattgcccc tggctcttta gatggaaatt gaagcccaag aagtcaaaaa tgttggggag 5281 ttggtggcag tgaccctcag aaccctggtg ctgctctccc cacaaagtcc tcatagaagg 5341 atgaggctgc cagcctggcg ctgtggggca ccctagggag catggccatt tgggcatggg 5401 ggggacccca tattgccgtc ttacagaggc agctgtcacc caagtgaagg gtggctgttg 5461 gtgaagccca gttacttccc ttccctggga gagaggtaag aggagttgag ttcgacctct 5521 gccccttaaa catgatacag gttgttcacc tgcaggtttt ccctgtccgt acggtggaga 5581 gtggtctgtg ctgacctggg cccagggaga cacgagctgt gaggagtctg ctgtgttgga 5641 attcagagcc tggggtgggg tcatttgtcc tcttagccta gctccctcag ccctcccctc 5701 ctaccccacc tgttagccat ggtcagtggg aggtgtcagg aggggtgcag ggttactgga 5761 ccctggttat agacagacca tgggcggccc cctaggaaac agtagctggc cgccgccttc 5821 accacacacc ttatatgcat tatgcagcag ctttttattc tatctacggc ttattaaaat 5881 ttcagtggga atgatggcag ggatctctgg agtccggcta taattcccct tctctggggg 5941 ccagcctgga tgggctccag gtttgcaagt gcaggcaggg agtgtgtgtc cagcataggt 6001 catctgagag ggtgggcagt gatcatggcc cagtgctaga taaccataca ttacccattt 6061 aatctccatg acaccctgca agggaagtct tcaccattta tagatgagga aactgaggct 6121 ccagcattaa gtgtcccacc caagataagt gtcagaactg agacttggca gagccaccta 6181 actgtctttc tgtccccaga atgcctttcc taggggtctt ctgggctctt gacattttgc 6241 atcttagaaa cctgcaggga agcaagggga gaggtgggag gcactttgca aagatgctca 6301 aactggagct ttggatcctg aagggattcc taggtcaggg atggtttgtg gttcttggtt 6361 cttgctgact taaccttgtt gaggaggcag agggctgtgt cttctgaggg ggtggaaaca 6421 ggtgatgact gtccctgact accccctctg tctttacaga aggagcccag cgaagtgcca 6481 acacctaaga gacctcgggg ccgaccaaag ggaagcaaaa acaagggtgc tgccaagacc 6541 cgggtgagac ttgagatggg actacccctg ggtggatgac tagcagagga tcctcttgtt 6601 ggcaccatgg ccacggctgt ggggaggtct gggaaggggc tcagtcatct cagttgtgta 6661 cccccttccc tggtacaggg atggcagtga gtctttgcta actggggaga gtggggcgat 6721 gactaaggtc ttacttctgg ggggttggtc ctgcccagga cactgcggag atcaggtcag 6781 aagtcccaca gctttcctga tcttagagaa gggcagagtg gggagaatgg agggtccctg 6841 ccagtccctg ggctgagaat gttaccttct cttggggttc cctggagcct gaaggggtgg 6901 ggacaggcag caaaggccca gattcacagt gacattagcc tttcagaaaa gttgttttgt 6961 tttttatttt attttttcta agacacgact catatcctct gagtcatggg cggaggtggg 7021 agtagggggc cgtgtgtgta tgtcgagggg gcaaatgtgg ctggccagtc attgccagct 7081 ggacttgggt gggcctgttg ggtgagagtg ttgggtcacc ctgaacaggc aggtaggtct 7141 gccccccatc actattgggc atcgggtgag cactgatgag cattttggac ttaggagata 7201 ttttctctaa ccctctagaa aaccaccaca actccaggaa ggaaaccaag gggcagaccc 7261 aaaaaactgg taggtgaaga agcagactgc tgcttgcctc ctgggctctt ttgagtgagg 7321 gtgttgagta cacagtacct gatgctatgc accccctatg gaaaggctct ccttgacctg 7381 ctgggacatc agattttaca gaagtccaga gaggggaagg tacctggcct gggctgtgcc 7441 catgagaagt gaggggtccc aggtataaat cagaccacat ccccctgccc tgccctgccc 7501 tagttgtgtg tgggggtccc tccctctcct gctcctagaa tactcagaac ttctagggga 7561 gatcttggaa gtcatctagc ctgtgtcccc ctcaattaga gatgaggaaa ggaagccata 7621 gggggaaggt ttgtccttcc tatgagcctc tgcagaagag aaacagcgaa ggagctgggc 7681 cctgggaggg gtcggtgctg gagttctgat gtgacccacc acactgcact ggagggcacc 7741 atccaattct gggnncccaa acagctggtg gaaaggctcg gtgggctgag tcaagaagct 7801 gcctctaggg ggccactgca gttagggtca ccccagcctt ccagctcctg gccctctcct 7861 acccccagcc tgccccctca aatccctgaa gctgtcattc cttgagctga gccactgctg 7921 gggtgggggg gttagggggt gctgctggcc aggccccaag agtgagtaac aggaaacaag 7981 ttgttttgga gtttgtgcct ggctcggggg ctgtagccgt gtggtgtcca cattcccgcc 8041 cagtgagtga gcccggcggc acacacttcc ccttcctccc caccccggcc tagggtcagc 8101 cctcggccac cccggagggc cagggcacca cagcacagca tctagcccct gtgggccaag 8161 gacctggttc ccctgcaccc accagcgggc tcttgcacct tccagccacc ccttcccatt 8221 tcctccccca gccacctctt ccccacctcc tcttctcccc tagggagtca gtcacatcct 8281 gaagctcatt gctgccctga gctctgccct cctgccctcc ctgggcctgg gggccaaggg 8341 ggcttggctc ctggctctgg gtgagagcag catgtgtgtg gggttttttc ctccttttaa 8401 attcttttta tgaatgaagc cgggcgcgtg gaggttgctg agtcacccac gacgactcag 8461 ccctgactca tccctcttca ggagagccag ggagtgcagg gagcgggtgg ggccagcctc 8521 tgggggtgga agagggggcc accgggccag agctcacacc aacaactgcc cacctcacag 8581 gagaaggagg aagaggaggg catctcgcag gagtcctcgg aggaggagca gtgacccatg 8641 cgtgccgcct gctcctcact ggaggagcag cttccttctg ggactggaca gctttgctcc 8701 gctcccaccg cccccgcccc ttccccaggc ccaccatcac caccgcctct ggccgccacc 8761 cccatcttcc acctgtgccc tcaccaccac actacacagc acaccagccg ctgcagggct 8821 cccatgggct gagtggggag cagttttccc ctggcctcag ttcccagctc cccccgccca 8881 cccacgcata cacacatgcc ctcctggaca aggctaacat cccacttagc cgcaccctgc 8941 acctgctgcg tccccactcc cttggtggtg gggacattgc tctctgggct tttggtttgg 9001 gggcgccctc tctgctcctt cactgttccc tctggcttcc catagtgggg cctgggaggg 9061 ttcccctggc cttaaaaggg gcccaagcca tctcatcctg gcacgcccta ctccactgcc 9121 ctggcagcag caggtgtggc caatggaggg gggtgctggc ccccaggatt cccccagcca 9181 aactgtcttt gtcaccacgt ggggctcact tttcatcctt ccccaacttc cctagtcccc 9241 gtactaggtt ggacagcccc cttcggctac aggaaggcag gaggggtgag tcccctactc 9301 cctcttcact gtggccacag cccccttgcc ctccgcctgg gatctgagta catattgtgg 9361 tgatggagat gcagtcactt attgtccagg tgaggcccaa gagccctgtg gccgccacct 9421 gaggtgggct ggggctgctc ccctaaccct actttcgttc cgccactcag ccatttcccc 9481 ctcctcagat ggggcaccaa taacaaggag ctcaccctgc ccgctcccaa cccccctcct 9541 gctcctccct gccccccaag gttctggttc catttttcct ctgttcacaa actacctctg 9601 gacagttgtg ttgttttttg ttcaatgttc cattcttcga catccgtcat tgctgctgct 9661 accagcgcca aatgttcatc ctcattgcct cctgttctgc ccacgatccc ctcccccaag 9721 atactctttg tggggaagag gggctggggc atggcaggct gggtgaccga ctaccccagt 9781 cccagggaag gtggggccct gcccctagga tgctgcagca gagtgagcaa gggggcccga 9841 atcgaccata aagggtgtag gggccacctc ctccccctgt tctgttgggg aggggtagcc 9901 atgatttgtc ccagcctggg gctccctctc tggtttccta tttgcagtta cttgaataaa 9961 aaaaatatcc ttttctggac tggagtctcc tgtggtgtgt gcccctaaag ccctggggca 10021 ggtaggatgg gtgacacttg tgttgtggcc cccatccttc tggctgcagg aagggagggg 10081 aaatagcact ggaatcacct ccggggaagg tcattttaag cctcttgatg aaatcaaaac 10141 ccct // LOCUS HUMHOX13G 3079 bp DNA PRI 31-DEC-1994 DEFINITION Homo sapiens homeobox protein (HOX-1.3) gene, complete cds. ACCESSION M26679 NID g341517 KEYWORDS HOX-1.3 gene; homeobox protein. SOURCE Homo sapiens (tissue library: lambda EMBL 3) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3079) AUTHORS Tournier-Lasserve,E., Odenwald,W.F., Garbern,J., Trojanowski,J. and Lazzarini,R.A. TITLE Remarkable intron and exon sequence conservation in human and mouse homeobox Hox 1.3 genes JOURNAL Mol. Cell. Biol. 9 (5), 2273-2278 (1989) MEDLINE 89313782 FEATURES Location/Qualifiers source 1..3079 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda EMBL 3" gene join(249..810,1771..2021) /gene="HOX 1.3" exon <249..810 /gene="HOX 1.3" /number=1 CDS join(249..810,1771..2021) /gene="HOX 1.3" /codon_start=1 /db_xref="PID:g387668" /translation="MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGR YGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLP CSAVAPSPGSDTHHGGKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQA SAQSEPSPAPPAQPQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNR YLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP" intron 811..1770 /gene="HOX 1.3" /number=1 exon 1771..>2786 /gene="HOX 1.3" /number=2 polyA_signal 2781..2786 /gene="HOX 1.3" BASE COUNT 740 a 799 c 830 g 709 t 1 others ORIGIN 1 acccccatcg cctccaccca actcccctat tagtgcacga gtttacctct agaggtcatc 61 aggcaggatt tacgactgga caacaaaagc acgtgattcg aagtcgtacc ccatatttgg 121 gtgcctacgt aggagggaac caagtacatg tcccagtcat ttccataatt catcataaat 181 tgtgcaaggg tgctatagac gcacaaacga ccgcgagcca caaatcaagc acacatatca 241 aaaaacaaat gagctcttat tttgtaaact cattttgcgg tcgctatcca aatggcccgg 301 actaccagtt gcataattat ggagatcaca gttccgtgag cgagcaattc agggactcgg 361 cgagcatgca ctccggcagg tacggctacg gctacaatgg catggatctc agcgtcggcc 421 gctcgggctc cggccacttt ggctccggag agcgcgcccg cagctacgct gccagcgcca 481 gcgcggcgcc cgccgagccc aggtacagcc agccggccac gtccacgcac tctcctcagc 541 ccgatccgct gccctgctcc gccgtggccc cctcgcccgg cagcgacacg caccacggcg 601 ggaaaaactc cctaagcaac tccagcggcg cctcggccga cgccggcagc acccacatca 661 gcagcagaga gggggttggc acggcgtccg gagccgagga ggacgcccct gccagcagcg 721 agcaggcgag tgcgcagagc gagccgagcc cggcgccgcc cgcccaaccc cagatctacc 781 cctggatgcg caagctgcac ataagtcatg gtaaagccag cctttttcta aatccacgcg 841 accgcgggga cgcggctctc ggtcccccct ctcttctctg ccgccctctc ccagtctctc 901 ttggtctcat ttctcccagc cctgcggagc tctccttgcc gttctccctc ccctgccttc 961 ctccttcctc ctcttctttg cgagcaccct gcgctcacaa atagtggggg aaatgggcgt 1021 tctctgggac agtttagacg ttggaagggg gaggaagcaa aaaacccctc tggaacccca 1081 cgccttggga cgcgctcccg ggtcaggcca gccgagcaag gcgcagagag gtagaggatg 1141 gctgtagcag ccgtgaatcg ggcttgtcac ggcggataat ttatgaggag ggctacgctg 1201 gggaaacagc gttactaatt acagccccca aaaaggggct tgggggaaag aatcgaggcg 1261 agagcctgca ggattctgaa ttttgggggc aggagggaga gagaaggaaa gggaagaaaa 1321 agaaaacagg ctccccaacc ctgcaggctg gaaacgggag gcggctctcg gggctggaac 1381 tttgagggag ggtgacccga aggccacttg ggcgctcagg aaagggcctt gcttcctggg 1441 tttctgtgcg gtgggcagcc tgggagggct gtgcctcccg atcggggcgc ccggggcagg 1501 gcggaggggg caggagaggg gccagggaaa gccggagtcc gccgggacac ggccccagcc 1561 tcagatgggc agattgttcc cagggtccaa atcgtattgt tttctttcta gaaaggaaga 1621 gagaaggaaa ttcgggaggg gtgtgcgggc tggtaggcag aacttgttga gcttttcgcc 1681 tgggttccct gctcatgacc caagcttgtc cccctggcgg actttggaag acaggagttg 1741 gtggctaaac cgctgacttt tctattgcag acaacatagg cggcccggaa ggcaaaaggg 1801 cccggacggc ctacacgcgc taccagaccc tggagctgga gaaggagttc cacttcaacc 1861 gttacctgac ccgcagaagg aggattgaaa tagcacatgc tctttgcctc tccgagagac 1921 aaattaaaat ctggttccaa aaccggagaa tgaagtggaa aaaagataat aagctgaaaa 1981 gcatgagcat ggccgcggca ggaggggcct tccgtccctg agtatctgag cgtttaaagt 2041 actgagcagt attagcggat cccgcgtagt gtcagtacta aggtgacttt ctgaaactcc 2101 cttgtgttcc ttctgtgaag aagccctgtt ctcgttgccc taattcatct tttaatcatg 2161 agcctgttta ttgccattat agcgcctgta taagtagatc tgctttctgt tcatctcttt 2221 gtcctgaatg gctttgtctt gaaaaaaaat agatgtttta acttatttat atgaagcaag 2281 ctgtgttact tgaagtaact ataacaaaaa aagaaagaga aaaaaaaaaa cacacaaaaa 2341 gtcccccttc aatctcgttt agtgccaatg ttgtgtgttg cactcaagtt gtttaactgt 2401 gcatgtgcgt ggaagtgttc ctgtctcaat agctccaagc tgttaaagat atttttattc 2461 aaactaccta tattccttgt gtaattaatg ctgttgtaga ggtgacttga tgagacacaa 2521 cttgttcgac gtgtagtgac tagtgactct gtgatgaaaa ctgtgactcc aagcggtgtg 2581 tccctgcgtg cctttatagg accctttgca cgaactctgg aagtggctct tataagcgca 2641 gcttcagtga tgtatgtttt tgtgaacaaa gttacaaata ttgtccaagt ctggctgttt 2701 taagcaaact gtgatcagct tttttttttt tttttttttt tgtatttgtt tttaaggaaa 2761 aaatactgac tggaacaaaa aataaacttt ctattgtaag ttctcttggt ctggtttgtg 2821 ccaaatagtg agcggctctg tctgcttttc tgtctgtctg tgcagtcttg gaagctgttg 2881 ggtctgaggc tacctgagca gatgacctgt gcagggagac ctcataccaa cactgtccca 2941 tcgcttccct acctctgacc cattgcaaag ttcagggcat aaggtggaaa aagctgtagg 3001 ctgttccaaa gccccagaag acctgtccat ctctgaggaa accaagttaa cttgctgggt 3061 acaaaaanga gagaagagc // LOCUS HUMHOX4A 5834 bp DNA PRI 26-NOV-1992 DEFINITION Human homeobox HOX 4A gene for homeodomain protein, complete cds. ACCESSION D11117 NID g219879 KEYWORDS HOX 4A homeodomain protein; homeobox. SOURCE Human peripheral lymphocytes, cell line AKIBA, AKIBA genomic cosmid library, DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5834) AUTHORS Taniguchi,Y., Fujii,A. and Moriuchi,T. TITLE Cloning and sequencing of the human homeobox gene HOX4A JOURNAL Biochim. Biophys. Acta 1132 (3), 332-334 (1992) MEDLINE 93041940 REFERENCE 2 (bases 1 to 5834) AUTHORS Taniguchi,Y. TITLE Direct Submission JOURNAL Submitted (14-MAY-1992) to the DDBJ/EMBL/GenBank databases. Yasushi Taniguchi, Tokai University School of Medicine, Dept. of Cell Biology; Bohseidai, Isehara, Kanagawa 259-11, Japan (Tel:0463-93-1121(ex.2579), Fax:0463-91-1370) COMMENT Submitted (14-MAY-1992) to DDBJ by: Yasushi Taniguchi Dept. of Cell Biology Tokai University School of Medicine Bohseidai, Isehara Kanagawa 259-11 Japan Phone: 0463-93-1121 x2579 Fax: 0463-91-1370. FEATURES Location/Qualifiers source 1..5834 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="AKIBA" /clone_lib="AKIBA genomic cosmid" /tissue_type="peripheral lymphocytes" gene 1408..4518 /gene="HOX 4A" exon <1408..1900 /gene="HOX 4A" /number=1 CDS join(1408..1900,3761..4518) /gene="HOX 4A" /codon_start=1 /product="homeodomain protein" /db_xref="PID:d1002368" /db_xref="PID:g219880" /translation="MQKAAYYENPGLFGGYGYSKTTDTYGYSTPHQPYPPPAAASSLD TDYPGSACSIQSSAPLRAPAHKGAELNGSCMRPGTGNSQGGGGGSQPPGLNSEQQPPQ PPPPPPTLPPCSPTNPGGGVPAKKPKGGPNASSSSATISKQIFPWMKESRQNSKQKNS CATAGESCEDKSPPGPASKRVRTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLT ERQIKIWFQNRRMKYKKDQKAKGILHSPASQSPERSPPLGGAAGHVAYSGQLPPVPGL AYDAPSPPAFAKSQPNMYGLAAYTAPLSSCLPQQKRYAAPEFEPHPMASNGGGFASAN LQGSPVYVGGNFVESMAPASGPVFNLGHLSHPSSASVDYSCAAQIPGNHHHGPCDPHP TYTDLSAHHSSQGRLPEAPKLTHL" exon 3761..>4518 /gene="HOX 4A" /number=2 polyA_signal 5321..5326 polyA_signal 5692..5697 BASE COUNT 1427 a 1535 c 1468 g 1404 t ORIGIN chromosome 2. 1 gtgatttgaa accaggtggg ggggagaccc tttttttttt tttttttttt taagttctgg 61 ctgttctgag catgttatag gactttcatt tcccatcaaa accttgtgct gacccaatga 121 ttgactgatt gatccactta ttaattcacc tattcaacaa gcatttattg cactaactac 181 atgcagggca ctgtgctgga tgttaggaac agtagacaaa tgacacagcc cctgcctgca 241 aggagcttac agtttagtgg gcgaatcagc caacaaaatg tctgaggcta taagtacttt 301 tccacacaga agaaaggtaa atgggcaatc ttgaagaaag taaattgtat ctggaggtag 361 agggaagccc tttccctcag ctacagttga gctaaaaaga aggaaactct tcttacattt 421 aggaaaaatt ccttctgata cttccagagg ttcaaataag ttgaacttca taaaatctgc 481 caggcgcagt ggctcatgcc tgtaattcca gcactttggg aggccaagac gggaggatca 541 caagagccca ggaactcaag accagcctgg gcaacattgt aagaccctgt ctctacaaaa 601 aaaaaaaaaa aaaaaactaa gagctggcgc agtggctcac acctgtaatc ccagcactta 661 gggaggccca gcgggttgat cacctgaggt cagagttcaa gaccaacctg accaacatgg 721 tgaaaccctg tctctactaa aaatataaaa aattagccag gtgtggtggc aggtgcctgt 781 aatcccagct actcaggagg ctgaggcagg agaattgctt gaacccagag gcgtaggttg 841 cagtaagctg agatcatgac actgcactcc agccttggca acaggagcga aattccatct 901 aaaacaaaac aaaacaaaac aaaacactaa gacatgtagc caggcatggt gggcacctgt 961 agctactgca gaatagctgg gactttgaaa tactctgaaa tagctattaa taccaggcta 1021 aagtgggagg atcgcttgag cccagtaaat tgaggctgca gtgagccatg ttcatgccac 1081 tgcactccag cctgggcaac aagcaagaca ctgtattaat aaataaatag ataagtaaat 1141 aaactcagat ccagcacctt gcccatctcc ccgccgtgaa gtgggtagga agcagagagc 1201 atgggctagt ccttctatat tgactggtct tgccaatgac actccctctg gggcctcttg 1261 cttttcttct gacaggtcac ctggagcctg ggggccggcc cagctctctc aggattcagc 1321 agacattgga ggtggcagtg aaggatacag tggtagtcaa tgttatttga gcagggtcag 1381 caggccctgg agcttcctga gtgcacaatg cagaaggctg cttactatga aaacccagga 1441 ctgtttggag gctatggcta cagcaaaact acggacactt acggctacag caccccccac 1501 cagccctacc caccccctgc tgctgccagc tccctggaca ctgactatcc aggttctgcc 1561 tgctccatcc agagctctgc ccctctgaga gccccagccc acaaaggagc tgaactcaat 1621 ggcagctgca tgcggccggg cactgggaac agccagggtg ggggtggtgg cagccagcct 1681 cctggtctga actcagagca gcagccacca caaccccctc ctccaccacc gaccctgccc 1741 ccatgttcac ccaccaatcc tggaggtgga gtgcctgcca agaagcccaa aggtgggccc 1801 aatgcttcta gctcctcagc caccatcagc aagcagatct tcccctggat gaaagagtct 1861 cgacagaact ccaagcagaa gaacagctgt gccactgcag gtagctccct gaggtggcct 1921 actgccagac caagccccct ccagattgac ccaaggaagc ctagtcaggg ctggaaatgc 1981 aaccttggag gtcatatgtc taaactccta ctcacgtcaa aatgttcttt tttttagtgt 2041 tcctgattgg ggtcatgacc ttgcagtgac agggtgctcc cttccattcc aggctgctgg 2101 tgctgttgct ggacaggtct tatagctatt aatagagagt gcttccttat atgggcatat 2161 ctgttttcct gggctgctaa ttataactcc attccctctt ccacccaaca gctcttcaag 2221 atttgaagat aggtattaca atccccaagc ctaggtgatt atatagccca tatcacacga 2281 atctcattcc cttaaactca taaaaactaa agtcttagaa agtaccatac taagtacttt 2341 ctaagcatta tctaatttaa tttttcaaag aaccttttga ggtaggtata tgataacatc 2401 cccattttac agataagaaa actgttagag aggataggca acttgcccaa gattctgaaa 2461 ctgcaaagtg gtggatttga atccagtcag tctggcttta ggggctgcta agcataacca 2521 tgagtctcta tttggccacc tcagcccaat tctcccactc cagcaaatcc ataatgggga 2581 ggtgcctgtc ctagtaggag aggatattct gggagacagt aacagccttg gatttctcta 2641 actggaaggg gagcccctcc agttgggcct tctctaggtc cacccagggc atgtagaaga 2701 taggcatggc caggaactct gagggctgtt ccttctctct gcttagactg cttggttcct 2761 gaaattttct gaccttgtgg tatctgatgt ggtttatctt caggtagatg aacttgcttc 2821 caggtccagg gcaagtttgg ggcctggggt gtggcttgct atcagggatc tggtttgcct 2881 gatgttttct ggggctgctg ctcctaggga gagggtatta tcctgcctgc aacctccttt 2941 tcctgcccct ccttcctcag ctgcaggctc aggccctccc tcccaggaga aatccatttg 3001 tcttccctgg gagggagtgg acaagcagct gagaggtggc agggtagtaa aagccagtgt 3061 tgaggctgct gctcctagca ctgtgaatac tcaaaatgct tccagcctgg ccttggactc 3121 cctaaaatac ccaggcagtg tttttttttg tttgtttgtt tggttggttt ttttttattt 3181 tttatttttt tgtttgtcac ctccctggct ttgagtgcag ttgggctgca gtggggcgcc 3241 aagatctcca tccccatact ttgctggggg ttggtggggg gctttgccca gaggccagct 3301 cctaagcaag gcaggctgga gctatttcct cttcctttcc ttctccatac cccacccctg 3361 gcagcagagg ctgggaggag tgttcaaagg agtctggccc ttctttgaca gagggaggcc 3421 ttaccagctg ctcctggtct ctcattaaac tctttcatgg cctttgggtg ggtatgggtt 3481 gatggactag gctgcagggg agagggtggg cagagtgaac tggatctcag aaggctgatg 3541 gaggtttcag gtgcgactga taggtaggcc tagtaggggg ttggtaggta ggtgaattcc 3601 cccttggaat catacctctc aaacggccct tcccctcccc agccaggatt ggaggtgggg 3661 ggagggagga ggaaaagaga accagggaag cacccctctc cagtcctgag ggtccccacc 3721 caactcactc agcgccctcc ctctctccct ccctgcccag gagagagctg cgaggacaag 3781 agcccgccag gcccagcatc caagcgggta cgcacggcat acacgagcgc gcagctggtg 3841 gaattggaaa aggaattcca cttcaaccgc tacttgtgcc ggccgcgccg cgtggagatg 3901 gccaacctgc tgaatctcac ggaacgccag atcaagatct ggttccagaa ccggcgcatg 3961 aagtacaaga aggaccagaa ggccaagggc atcctgcact cgccggctag ccagtcccct 4021 gagcgcagcc caccgctcgg cggcgccgct ggccacgtgg cctactccgg ccagctgccg 4081 ccagtgcccg gcctggccta cgacgcgccc tcgccgcctg ctttcgccaa atcacagccc 4141 aatatgtacg gcctggccgc ctacacggcg ccactcagca gctgcctgcc acaacagaag 4201 cgctacgcag cgccggagtt cgagccccat cccatggcga gcaacggcgg cggcttcgcc 4261 agcgccaacc tgcagggcag cccggtgtac gtgggcggca acttcgtcga gtccatggcg 4321 cccgcgtccg ggcctgtctt caacctgggc cacctctcgc acccgtcgtc ggccagcgtg 4381 gactacagtt gcgccgcgca gattccaggc aaccaccacc atggaccttg cgaccctcat 4441 cccacctaca cagatctctc ggcccaccac tcgtctcagg gacgactgcc ggaggctccc 4501 aaactgacgc atctgtagcg gccgccgcca gcccgaactc gcggcaaaat tacctctctt 4561 gctgtagtgg tggggtagag ggtggggccc gcggggcagt tcgggaaccc ccttccccgc 4621 tcttgccctg ccgccgcctc ccgggtctca ggcctccagc ggcggaggcg caggcgaccg 4681 ggcctcccct ccatgggcgt cctttgggtg actcgccata aatcagccgc aaggatcctt 4741 ccctgtaaat ttgacagtgc cacatactgc ggaccaaggg actccaatct ggtaatggtg 4801 tccaaaggta agtctgagac ccatcggcgg cgcccctgca gagggaccag agcttggaga 4861 gtcttgggcc tggcccgcgt ctagcttagt ttcagagacc ttaatttata ttctccttcc 4921 tgtgccgtaa ggattcgatc ggactaaact atctgtattt attatttgaa gcgagtcatt 4981 tcgttccctg attatttatc cttgtctgaa tgtatttatg tgtatatttg tagatttatc 5041 cagccgagct taggaattcg cttccaggcc gtgggggcca catttcacct ccttagggcc 5101 cctggtctga actagttgag agagtagttt tgaacagtcg taaccgtggc tggtgtttgt 5161 agttgacata aaggattaag accgcaaatt gtccttcatg ggtagagtca ggaagcccgg 5221 tggcgtggca caacacactt tggtcatttc tcaaaaacca cagtcctcac cacagtttat 5281 tgatttcaaa ttgtctggta ctattggaac aaatatttag aataaaaaaa tttcccagtc 5341 agaagtgtat ctgtgttaat catgcacact tcgaagcaga tcactatgcc tttatctcgc 5401 acaatcccca tggaaggccc cgaaacagac tgacactttg aaaaaaatat tttatttatt 5461 tggagtctag tgcataaatt ggttctcggg atacttgatt actttccagc tattgaaatt 5521 aggggagggg gaaatgagga tgcaaaccta agaaggttct gggcgggatg atttgggcag 5581 ggctttttga aagcagcgcc tgctttgctg tttttactgc cttctttcta gacacaatcg 5641 acttttacac tgggggcccc agagctcact tttctgcatt aagtaacaaa aaataaactt 5701 tgaaagaaaa ctgacaatca aagaaaaaac acaagagaag ttgtaagaag aattgagcta 5761 tgaaaaagct aaggtgtaga aaaagaaacc atcatttgga gacgcgtacc taagcttttt 5821 ctctaggaaa tgct // LOCUS HUMHPARS1 11551 bp DNA PRI 14-FEB-1996 DEFINITION Human haptoglobin gene (alpha-2 allele), complete cds and haptoglobin-related gene, exon 1 and three Alu repeats. ACCESSION M10935 NID g184327 KEYWORDS Alu repeat; endogenous retrovirus; haptoglobin; repeat region. SEGMENT 1 of 2 SOURCE Homo sapiens (clone: 5'Hp[1SB,rB] and Hp2E) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11551) AUTHORS Maeda,N. TITLE Nucleotide sequence of the haptoglobin and haptoglobin-related gene pair. The haptoglobin-related gene contains a retrovirus-like element JOURNAL J. Biol. Chem. 260 (11), 6698-6709 (1985) MEDLINE 85207676 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by N.Maeda, 28-JAN-1986. The human haptoglobin locus on chromosome 16g22 exhibits a complex set of duplications. There are three alleles of human haptoglobin: alpha-1F-beta, alpha-1S-beta, and alpha-2-beta. In all three alleles the beta chain is identical. Alpha-1F differs from aplha-1S in the number of charged amino acids, while alpha-2 has a duplication of exons 3 and 4 as exons 5 and 6, creating an alpha-2 chain that is 59 amino acids longer than the alpha-1 (F and S) chains. The whole gene (bases 821-8814) has been duplicated downstream and it is not yet known whether this copy (referred to as Hpr) is expressed in normal cells (see segment 2 of this entry and other haptoglobin entries). FEATURES Location/Qualifiers source 1..11551 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="5'Hp[1SB,rB] and Hp2E" /haplotype="genotype Hp-2/Hp-1S" /map="16q22.1" sig_peptide join(1041..1045,2549..2597) /gene="HP" /note="G00-119-314" CDS join(1041..1045,2549..2631,2918..3019,3780..3854, 4642..4743,5502..5576,6500..7278) /gene="HP" /note="alpha-2 allele, precursor" /codon_start=1 /db_xref="GDB:G00-119-314" /product="haptoglobin" /db_xref="PID:g386783" /translation="MSALGAVIALLLWGQLFAVDSGNDVTDIADDGCPKPPEIAHGYV EHSVRYQCKNYYKLRTEGDGVYTLNDKKQWINKAVGDKLPECEADDGCPKPPEIAHGY VEHSVRYQCKNYYKLRTEGDGVYTLNNEKQWINKAVGDKLPECEAVCGKPKNPANPVQ RILGGHLDAKGSFPWQAKMVSHHNLTTGATLINEQWLLTTAKNLFLNHSENATAKDIA PTLTLYVGKKQLVEIEKVVLHPNYSQVDIGLIKLKQKVSVNERVMPICLPSKDYAEVG RVGYVSGWGRNANFKFTDHLKYVMLPVADQDQCIRHYEGSTVPEKKTPKSPVGVQPIL NEHTFCAGMSKYQEDTCYGDAGSAFAVHDLEEDTWYATGILSFDKSCAVAEYGVYVKV TSIQDWVQKTIAEN" gene 1041..7278 /gene="HP" exon <1041..1045 /gene="HP" /note="G00-119-314" /number=1 intron 1046..2548 /gene="HP" /note="Hp intron A" repeat_region 1526..1534 /note="5' terminal repeat of Alu repeat copy A" repeat_region 1526..1833 /note="Alu repeat copy A" repeat_region 1825..1833 /note="3' terminal repeat of Alu repeat copy A" exon 2549..2631 /gene="HP" /note="G00-119-314" /number=2 mat_peptide join(2598..2631,2918..3019,3780..3854,4642..4743, 5502..5576,6500..6537) /gene="HP" /note="alpha chain A2 allele; G00-119-314" /product="haptoglobin" intron 2632..2917 /gene="HP" /note="Hp intron B" exon 2918..3019 /gene="HP" /note="G00-119-314" /number=3 intron 3020..3779 /gene="HP" /note="Hp intron C" exon 3780..3854 /gene="HP" /note="G00-119-314" /number=4 intron 3855..4641 /gene="HP" /note="Hp intron D" exon 4642..4743 /gene="HP" /note="G00-119-314" /number=5 intron 4744..5501 /gene="HP" /note="Hp intron E" exon 5502..5576 /gene="HP" /note="G00-119-314" /number=6 intron 5577..6499 /gene="HP" /note="Hp intron F" exon 6500..>7278 /gene="HP" /note="prehaptoglobin" /number=7 mat_peptide 6541..7275 /gene="HP" /note="beta-chain; G00-119-314" /product="haptoglobin" repeat_region 9119..9424 /note="Alu repeat copy B" exon <9644..9648 /gene="HPR" /note="G00-119-316; putative; does not fit consensus" /number=1 intron 9649..>11551 /gene="HPR" /note="Hpr intron A" repeat_region 10131..10435 /note="Alu repeat copy C" repeat_region 10131..10139 /note="5' terminal repeat of Alu repeat copy C" repeat_region 10427..10435 /note="3' terminal repeat of Alu repeat copy C" repeat_region 10652..10656 /note="5' insertion target sequence for retrovirus-like element" LTR 10657..11166 /gene="HPR" /note="retrovirus-like element 5' LTR" BASE COUNT 3020 a 2679 c 2652 g 3200 t ORIGIN 857 bp upstream of SstI site; chromosome 16q22.1. 1 aactgccatc atactgaagg taatcttctg aaacttgcgg ttttctttca aaattatgtt 61 tataagatga tccatgttct tgcagagttt attcatttta tgctgtataa tattccatta 121 tatccacata caatgcagta ttgacccttc ctcctgttga tgggcatttg tcttgtttct 181 agttactttg ctattatatc agtgtcacca tgattatcca aaagtaattc ttttgtacac 241 tctaatttaa gaacaactaa ccctttttaa tgaataaatc aaccttgtat tgagttgcta 301 ctaagtttca gttgactagt acctgggata cacacaggtg cagacatttg actgagacat 361 attgattttt ctcatctgcc tatttaggct aatcaccaga ctataaaacc atgagaacca 421 ctgccattga gtatagtctg tgtcagtcta cactatagct ttaactagtt gtgtgatttc 481 ttgcaaagag caatcagaga agacacaata aacacattta ctgatttcag gctggagagc 541 ttttaagcaa tagggagatg gccacacaca aggtggagaa aattactgtg aaaaggatac 601 gtacttttct ttagagcccc acctaagcta ggctgcagaa atgtctacaa tggctttgaa 661 aaaactcaaa atgagccttt ctgcagtgtg aaaatcctcc aagataaaga gacagattga 721 tggttcctgc cgccgccctg tcctgcccag ttgctgattt caggaaatac tttggcaggt 781 ttgtgggcat agagttgcca ggtttcttgg gatttgtaat agaacatcac aagaaaatca 841 agtgtgaagc aagagctcaa ctcttaacag gggtattgtt tgtggttttg ttactggaaa 901 agatagtgac cttaccaggg ccaaagtttg tagacacagg aattacgaaa tggagaaggg 961 ggagaagtga gctagtggca gcataaaaag accagcagat gccccacagc actgctcttc 1021 cagaggcaag accaaccaag atgaggtggg tccacagctt tccctcctgc ctttcctctg 1081 gttctttatt tcagtctttt ttgcatacat cggtagagat gcagaaatag aacaaagaaa 1141 cgggcaaatg ggctaaatta tagtgaacca aagggcttag tgtgttaaat cttctccttt 1201 tctgcatcca tagaagacag tgctgctgtc tttcccagga gataagattt actctcagga 1261 gtgtcttttt ccttcaggtt acatttttga ctttataggg tatgtcatca gctcccgtgg 1321 taggcttcct ggcatcctga gtatatttat tagcagatat tttcctcttt aaaaatgtac 1381 aataaggaag actaatagta acacatttga atgacacaat taattgacta gtacctggga 1441 tacacactaa tacctgggat acatctaatt aaggcactta gatcttataa aaataaacac 1501 tttttgaaat gttgaaataa taagactaga aacttttttt ttttttgaga tggagtctcg 1561 ctttgtcacc aggctgcagt gcagtggcat gatctcggct cactgcaacc tccacctccc 1621 aggttcaagt gattctcctg cctcagcctc ccaggtagct gggactacag gcgtgcgcca 1681 ctatgcccac ctaatttttg tatttttagt agagacgggc tttcaccatg ttggccagga 1741 tgacctcgat ctcttgacct cgtgatctgc ctgccttggc ctcccaaagt gctgggatta 1801 caggcatgag ccaccgtgtc tggcctagaa actattttaa tagaagcaag tagtgcccga 1861 atggttggca tttgttagtg agatggtgaa ctggcagacg gcacctgtgg gtcaatgccc 1921 atggccaccg tcctgctttt ggacaccagt tctcttccag gtaaccttct ggcattttgg 1981 ggtttcagat accatttcct aaaggtgaat tattataaaa tactagaaat acccacgtgt 2041 ttaaaattat atatttaagg aaccttttat tacggaaaat atcaagaagt agaggaatag 2101 cacagtaagc ccaaagttcc catctctgag ttatctacat tttattacca ctatctttgc 2161 ggtgtctgag ggaggtttct ctttcctgga gggctcctgt attattgcca atgtactttc 2221 ctgaatgcag ccagaaactg agcccacccc tccacctatg tgcctttcta tccctctctg 2281 aagcttctgc agaattccca gcaggacagg gcttgctgga agcttggtat gctcagaagc 2341 tgctaaagtg tgtatgggca ggtgtggggg caatttcttg gtcctagcac ttccatatat 2401 cgactttctt ttctggctgc taagtgggag gagtgtgtgt gtatgcatgt gtgtgtgtgt 2461 gtgtgtacat gcatgtgtgt gtggatgcat gcatgtgctg tgaagcaggg agactagctt 2521 tccactcctc cttgtcttct ctctgcagtg ccctgggagc tgtcattgcc ctcctgctct 2581 ggggacagct ttttgcagtg gactcaggca atgatgtcac ggatatcgca ggtcagtctt 2641 tggttgggta ggagtgtgca tcccactctg accctctcgg gtctgcactc tctctgagaa 2701 cacccaattc ccccttctta tctcgacctc tgggctttca ggaccataaa gaacattggg 2761 gttcctgcca gaaatgaggg gagcttgcct ttccattggc ttctattcgg ggtggaagga 2821 gattgatgtg cagagcagct cccgctcatc tgacttttca cggttcactg ggaacaattt 2881 ccaaatagca aactctctgg cttctctctc tttgcagatg acggctgccc gaagcccccc 2941 gagattgcac atggctatgt ggagcactcg gttcgctacc agtgtaagaa ctactacaaa 3001 ctgcgcacag aaggagatgg taagatgtgg acaactgtct ccatgcccta catacaaccc 3061 ccttctctga catttccatg atgggtggtg ctgaggtgat tcgccagaaa gttcgttgct 3121 ctccttggag ccaggagatt tagattctaa taagcgtttt gtcgccagta gccatggccc 3181 tttgggcaga ctaacttttg tcagcctcaa gttttctgtt ttgttaaggg gaggcgatgc 3241 catgcagcct acctcatgta aatctcagag tcagatttac atctccagca gatgtgggaa 3301 aagaaggaat gctgatgatg atgtcaccct cacctagtga gtcttgctgt cctggcactg 3361 ctctaagggc tttatactta tttgctcact tagtcctcac agtatccctc tgaacagagt 3421 ttattgtttt cactttgctg ataaggaaac tgaggcacag acaggttgag tatcttgccc 3481 aaattcaggc agcctgtaag aggcagagtc aggatttgaa ccctgagccc tccctgtact 3541 gcttggctgt gaccgccatg accacagtgt gttctgctgg gcttaactgg tgtccaggca 3601 cttggcttcc agcacagcac tctttccctt cctccttctc atattctctc tcctttctcc 3661 cttcctgtct gcctcctttc ttcttcttct ttttaattct tctccttaaa tgccttctca 3721 ctctgctctg ggtgcagact tgacttttcc tttggctcat ttcttgcctt ttgtttcagg 3781 agtatacacc ttaaatgata agaagcagtg gataaataag gctgttggag ataaacttcc 3841 tgaatgtgaa gcaggtgggt gctgagcact gagcacttaa gagagcaggc aggcgtccag 3901 cggggaacgt cctagaggca cagccttcca gtgcggcttc ctctgagcac acaagagcca 3961 ggaggaggga tgtgggagaa ccgcagctgg ccagggagag acttaagcag ttaggtgatg 4021 actccctaag ggtcaccaag ggtcttgttc attggggcct gaagggcact ggctgaatcc 4081 actgtcggca ctgcccacag atcaggagag cctgtgcata cagagagcct gctagaaagc 4141 cctgggtcta aggagaagca agctccaggg agaacaagtc aaggaatgac ataaaatctt 4201 aatccatgga agcctagcag gaggctggac atgggctgga actcctgctt ctcgttatta 4261 ggaggagctg ttgctctctc ctttcattct cagaaccaga ggcaaagacc cagcctcttc 4321 tgctcttact ggtgtggaaa tgccaacctg cctcgtatta actgcaccat ctacaaaatc 4381 tgagctccag ccagtgctgc tctagattca tctttcttta gagagaatga attattgtag 4441 cccctagccc tttcaatgaa tttcagggaa ttgtggaaat tcctttattg ggataattgt 4501 ttaaatataa tacagttcgc gagcttctat tcggggtgga aggagattga tgtgcagagc 4561 agctcccgct catctgactt ttcacggttc actgggaaca atttccaaat agcaaactct 4621 ctggcttctc tctctttgca gatgacggct gcccgaagcc ccccgagatt gcacatggct 4681 atgtggagca ctcggttcgc taccagtgta agaactacta caaactgcgc acagaaggag 4741 atggtaagat gtggacaact gtctccatgc cctacataca acccccttct ctgacatttc 4801 catgatgggt ggtgctgagg tgattcgcca gaaagttcgt tgctctcctt ggagccagga 4861 gatttagatt ctaataagcg ttttgtcgcc agtagccatg gccctttggg cagactaact 4921 tttgtcagcc tcaagttttc tgttttgtta aggggaggcg atgccatgca gcctacctca 4981 tgtaaatctc agagtcagat ttacatctcc agcagatgtg ggaaaagaag gaatgctgat 5041 gatgatgtca ccctcaccta gtgagtcttg ctgtcctggc actgctctaa gggctttata 5101 cttatttgct cacttagtcc tcacagtatc cctctgaaca gagtttattg ttttcacttt 5161 gctgataagg aaactgaggc acagacaggt tgagtatctt gcccaaattc aggcagcctg 5221 taagaggcag agtcaggatt tgaaccctga gccctccctg tactgcttgg ctgtgaccgc 5281 catgaccaca gtgtgttctg ctgggcttaa ctggcatcca ggcacttggc ttccagcaca 5341 gcactctttc ccttcctcct tctcatatac tctctccttt tccccttcct ttttgtcccc 5401 ttttcctctt ccttttagtt cttctcttta aatgccttct cactctgcac ggggtctaga 5461 cttgacttct cctttggctc acttcttgcc ttttgtttca ggagtgtaca ccttaaacaa 5521 tgagaagcag tggataaata aggctgttgg agataaactt cctgaatgtg aagcaggtgg 5581 gtgctgagca cttaagagag caggcaggcg tccagcgggg aacgtcctag aggcacagcc 5641 ttccagtgcg gcttcctctg agcacacaag agccaggagg agggatgtgg gagaaccgca 5701 gctggccagg gagagactta agcagttagg tgatgactcc ctaagggtca ccaagggtct 5761 tgttcattag ggcctgaagg gcactggctg aatccattgt ctacatcgcc cacagattag 5821 gagagcctgt gcatacagag agcctgctag agagccctgg gtctaaggag aagcaagctc 5881 cagggagaac aagtcaagga atgacataaa atcttaatcc atggaagcct agcaggaggc 5941 tggacatggg ctggaactcc tgcttctcgt tattaggagg agctgttgct ctctcctttc 6001 attctcagaa caagaggcaa aggcccagcc tcttctgctc ttactggtgt ggaaatgcca 6061 acctgcctcg tattaactgc accatctaca aaatctgagc tccagccagt gctgctctag 6121 attcatcttt ctttagagag aatgaattat tgtagcccct agccctttca atgaatttca 6181 gggaattgtg gaaattcctt tattgggata attgtttaaa tataatacag ttcaccagcc 6241 agggctcaaa aatctcagta tttcccactt cctttgttag aaaagtggga aatagagctt 6301 tttgtaatgt aaacaattta aaaaacagaa ttattttaaa actgcaacta ttggaaatga 6361 gatcagcagg tggtaagggc aaagcattta aatctttcta ctttacgcag cagtgacagc 6421 cgcccatgct ttcacccctt tctcagatgg aaaggctctt gcacatttcc actcacgagt 6481 gtcttgctct ccttgacagt atgtgggaag cccaagaatc cggcaaaccc agtgcagcgg 6541 atcctgggtg gacacctgga tgccaaaggc agctttccct ggcaggctaa gatggtttcc 6601 caccataatc tcaccacagg tgccacgctg atcaatgaac aatggctgct gaccacggct 6661 aaaaatctct tcctgaacca ttcagaaaat gcaacagcga aagacattgc ccctacttta 6721 acactctatg tggggaaaaa gcagcttgta gagattgaga aggttgttct acaccctaac 6781 tactcccagg tagatattgg gctcatcaaa ctcaaacaga aggtgtctgt taatgagaga 6841 gtgatgccca tctgcctacc ttcaaaggat tatgcagaag tagggcgtgt gggttatgtt 6901 tctggctggg ggcgaaatgc caattttaaa tttactgacc atctgaagta tgtcatgctg 6961 cctgtggctg accaagacca atgcataagg cattatgaag gcagcacagt ccccgaaaag 7021 aagacaccga agagccctgt aggggtgcag cccatactga atgaacacac cttctgtgct 7081 ggcatgtcta agtaccaaga agacacctgc tatggcgatg cgggcagtgc ctttgccgtt 7141 cacgacctgg aggaggacac ctggtatgcg actgggatct taagctttga taagagctgt 7201 gctgtggctg agtatggtgt gtatgtgaag gtgacttcca tccaggactg ggttcagaag 7261 accatagctg agaactaatg caaggctggc cggaagccct tgcctgaaag caagatttca 7321 gcctggaaga gggcaaagtg gacgggagtg gacaggagtg gatgcgataa gatgtggttt 7381 gaagctgatg ggtgccagcc ctgcattgct gagtcaatca ataaagagct ttcttttgac 7441 ccatttctgt gttgtgttca gtcttgagtc ttttttattt gctcctttat ggtccagggt 7501 agtcagaagg tatagagtct actgggagta tggcagaaaa caccctaaac ccactggaaa 7561 tcccgaaggt gatacaaact cttccacctt agggaatcat gctcactgat tgagtgccta 7621 ttgaatgcta ggtcccagaa agttaactgt tgtccttgtt ttacagacaa ggaaacagag 7681 actcagagat ggtaagtgag ttgcttaagg ttacatagct atgaaacagg gaagcagaac 7741 tttgaaccca ggtctgtttg atacaaactc agaggtcctt tcactgcatg ctgttgcctc 7801 ctcaaagtga attaggagaa aaggcatggg cctggttgag gaagaggcta gctcaaaatg 7861 ggatggggaa aaagtgtttt aacacagaca gtaattcaga gtttgggatc taacctacca 7921 gcactccaga aaacaaaaga caatgatgta aaggcaggat ctctggactt actgagtcca 7981 aatcacagaa tggcccataa atttgctagg taatttaggt ttcctggcca tcagttcctg 8041 gtctcttcca tgaaggactt gagccaggcg atggctaatg tccctatcag caataacctt 8101 ctggtattcc ataaagagtg gggaggtaga gtccctgcat cagtagaagc tgcttttcct 8161 actgggaaca aacccaccct gcaccaggaa tcaaatgcag gctgtctgag cctcctctcc 8221 tgatggcagg tcatgctatt aagggtctcc cagaagaaag gttctggaac tctgagaaac 8281 agaagctctt ctaacagaac tttttcttgc tataagcttt caataccacc aatcaccagt 8341 gcaccttctt ctcatgccaa aatacggccc ctcctttgta gggtgtaaaa cgacactgat 8401 gtcactctct ctcaccaagg tctggccatc tctgtgaagc tctcctcaca ggcaccctgc 8461 agattcatca ataaacttca acaggttagc gccagtaggc caataagatg cacacttata 8521 ttctagaacc tgtgaaagaa gtaatagcca gaaactggaa aataacagtc aaactcttaa 8581 aaaaatgcct ctcagaacac agtcacttac accttttcct ctggtagttc tcttcagcac 8641 caagtttcct ggaagaacaa gctactcccc agttttctta aagaattgtc tgattgtatt 8701 aagtaagtgg cttctggtaa aataatggta agagaaaagc tctggcattc tcttgatcac 8761 tcatcttctc tgccttcctc cctagcaaca tctcctggcc tcagcctgtg ggcttcagca 8821 acaccaaact ccatgttgct ctctgacaag cccggccgtc tccagttctt gggctttatt 8881 tcctcggggg ccctctgctg aaattccctt ccccttgctg tattagcacc tcctacccgc 8941 cttcaacatg caggcactgc ctcctctaag gagccatctg tacttacgta gctgtgagca 9001 taggatgggg catacagcag gcacttaaca aatacttgtg gaatggaaag aaactaagat 9061 aggatcaaaa atacaaatca agatgtgaga gaaagacgat caaaagccct cacgaggggg 9121 tcgggcgtga tggctcacac ctgtaatctc aacactttgg gaggctaagg cgggtggatc 9181 acttgagggt gggagtttga gatcagccag agcaacatgg agaaacctcg tctctactaa 9241 aaatacaaaa aattagccag gcatggtgat atatgcctgt aatcccagct actcgggagg 9301 ctgaggcagg agaattgctt gaacctcgga ggtagaagtt gtggtgagct gagatcatgc 9361 cattgtactc cagcctgggc aacaggagtg aaactctgtc tcaaaaaaaa caaaaaacaa 9421 aaaagaacac cacgggaaaa tcaagtgtga agcaagagct caactcttaa caggggcaat 9481 gtttggagtt ttgttactgg aaaagataga gaccttacca ggtccaaagt ttgtagacac 9541 aggaattacg aaatggagaa gggggagaag tgagctagtg gcagcataaa aagaccagca 9601 gatgccccac agcactgctc ttccagaggc aagaccaacc aagatgaggt gggtccacag 9661 ctttccctcc tgcctttcct ctggttcttt atttcagtct tttttgcata cattggtaca 9721 gatgcagaaa tagaacaaag aaacagggca aatgggctaa attatagtga accaaagggc 9781 ttagtgtgtt aaatcttctc cttttctgca tccatagaag acagtgctgc tgtctttccc 9841 aggagataag atttactctc aggagtgtct ttttccttga ggttacgttt ttgtctttgt 9901 agggtatgtc atcagctccc gtggtaggct tcctggcatt cggaatatat ttactagcag 9961 atattttcct ctttaaaaat gtataataag aaagactaat agtaacacat ttgaatgaca 10021 caattaattg actagtactt gggatacaca ctagtacctg ggatacatct aattaagaca 10081 cttagatctt aaaaaataaa aagacttttt gaaatgttga aataataaga ctagaaactt 10141 ttttttttgt gagacagagt ctcactctgt caccaggatg gagtgagtgg cgcgatctca 10201 gctcatcgca acctccccct ccccggttca agcaattctc ctgcctcagc ctcccaagta 10261 gctaggacta caggcgtgtg ccaccacgcc cagctaattt ttgtattttt agtagcgatg 10321 tggttcacca tgttggcccg gatgatctcg atctcttgac ctcgtgatcc acctgtctcg 10381 gcctcccaaa gtgttgggat tacaggtgtg agccaccgca ccctgcctag aaactatttt 10441 aatagaagca aatagtgcct gaatggtggg cgtctgttag cgagatggtg aactggcaga 10501 tggcgcctgt gtgtcaatgc ccatggccac cgtcctgctt tcagacacca gttctcttcc 10561 aggtaacctt ctggcatttt ggggtttgag ataccatttc ctaaaggtga attatcacaa 10621 aatactagaa ataaccacat aagtgtttaa aattattgtt aaatacagta agaaattctt 10681 cttcaaaagt ttagcctgct taagtttcct tgtcctttgt ttcctgcttt caaggccaga 10741 cttccttact ctctgtgttt cccctaccct ggtaaacaac cttcctgcca gtccttaccc 10801 atacagccca cattccacat ctgctaccca ctctgtgatt tacctctccc gtcgcaatag 10861 cctctcccac caaaactgat cttcccgcct tcccaccagt gcaaccacat tcctgcactt 10921 tgcaagttag ccaaccgggt tcggattgtg cagtccaact ccagccaatg gagtcaggac 10981 acagtagcag ggacaagctg cgttagacat aaaaacctct gctttccttt gtttagggtg 11041 ctctcgtggc aaccagactt accaggagct ctattctgca aaagtaaatt tgccttgctg 11101 agagaccctt tgtcctttgg ctcagtgttg gttcttcttt gcagcaccga gcatttgttt 11161 ccaacaaatt tggtggccca tacagggaaa acattgtcct ccgggaaagg gttctttgat 11221 catcctcttg agaggagaac acatcccact gtccttgttg cggtggcctc atgagtagga 11281 atcgagaccc acctgtctga tgaataaccc cagactctca acaacgtggg gagaaaaaga 11341 cttgcaacac tatggtggcc aggtaactct gtgcgcagac caaggtaaga aatgtcgcag 11401 gagtgacaaa gtacttcctt ggtggtcact atattctggt ggctgaaagt tcatgaatgg 11461 taacaagtgc tactgctgtg tggagtgaat gagtccaatc tgtgggtcta tggttacctc 11521 atacggctta gccttctctg aaggatccgg g // LOCUS HUMHPRTB 56737 bp DNA PRI 09-APR-1992 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete cds. ACCESSION M26434 J00205 M27558 M27559 M27560 M27561 M29753 M29754 M29755 M29756 M29757 NID g184369 KEYWORDS Alu repeat; LINE repeat; hypoxanthine phosphoribosyltransferase. SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N. TITLE A possible nucleotide-binding domain in the tertiary fold of phosphoribosyltransferase JOURNAL J. Biol. Chem. 258, 6450-6457 (1983) MEDLINE 83213350 REFERENCE 2 (sites) AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular abnormality in a mutant form of the enzyme (HPRT-Toronto) JOURNAL J. Biol. Chem. 258, 6458-6460 (1983) MEDLINE 83213351 REFERENCE 3 (sites) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in a patient with the Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 71, 1331-1335 (1983) MEDLINE 83213940 REFERENCE 4 (sites) AUTHORS Jolly,D.J., Okayama,H., Berg,P., Esty,A.C., Filpula,D., Boehlen,P., Johnson,G.G., Shively,J.E., Hunkapiller,T. and Friedman,T.B. TITLE Isolation and characterization of a full-length expressible cDNA for human hypoxanthine phosphoribosyltransferase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 477-481 (1983) MEDLINE 83169681 REFERENCE 5 (sites) AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N. TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino acid substitution in a mutant form of the enzyme isolated from a patient with gout JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983) MEDLINE 83144031 REFERENCE 6 (sites) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated from a patient with gout JOURNAL J. Biol. Chem. 259, 27-30 (1984) MEDLINE 84161915 REFERENCE 7 (sites) AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G., Hildebrand,B.M. and Caskey,C.T. TITLE Molecular evidence for new mutation at the hprt locus in Lesch-Nyhan patients JOURNAL Nature 310, 412-414 (1984) MEDLINE 84270707 REFERENCE 8 (sites) AUTHORS Stout,J.T. and Caskey,C.T. TITLE HPRT: Gene structure, expression, and mutation JOURNAL Annu. Rev. Genet. 19, 127-148 (1985) MEDLINE 86101958 REFERENCE 9 (sites) AUTHORS Stambrook,P.J., Tischfield,J.A., Khan,S.A., Sikela,J.M. and Dush,M.K. TITLE Nucleotide sequence and organization of the mouse adenine phosphoribosyltransferase gene: Presence of a coding region common to animal and bacterial phosphoribosyltransferases that has a variable intron/exon arrangement JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985) MEDLINE 85190571 REFERENCE 10 (sites) AUTHORS Hershey,H.V. and Taylor,M.W. TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia coli adenine phosphoribosyltransferase and comparison with other analogous enzymes JOURNAL Gene 43, 287-293 (1986) MEDLINE 86301884 REFERENCE 11 (sites) AUTHORS Kim,S.H., Moores,J.C., David,D., Respess,J.G., Jolly,D.J. and Friedmann,T. TITLE The organization of the human HPRT gene JOURNAL Nucleic Acids Res. 14, 3103-3118 (1986) MEDLINE 86176788 REFERENCE 12 (sites) AUTHORS King,A. and Melton,D.W. TITLE Characterisation of cDNA clones for hypoxanthine-guanine phosphoribosyltransferase from the human malarial parasite, Plasmodium falciparum: Comparisons to the mammalian gene and protein JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987) MEDLINE 88096579 REFERENCE 13 (sites) AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P. TITLE Resolution of a missense mutant in human genomic DNA by denaturing gradient gel electrophoresis and direct sequencing using in vitro DNA amplification: HPRT-Munich JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988) MEDLINE 88191890 REFERENCE 14 (sites) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint) JOURNAL Gene 63, 331-336 (1988) MEDLINE 88255878 REFERENCE 15 (sites) AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N. TITLE Identification of a single nucleotide change in a mutant gene for hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor) JOURNAL Hum. Genet. 79, 39-43 (1988) MEDLINE 88212418 REFERENCE 16 (sites) AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D. TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence for identical mutations in two partially deficient subjects JOURNAL J. Clin. Invest. 82, 2164-2167 (1988) MEDLINE 89067158 REFERENCE 17 (sites) AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and Caskey,C.T. TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene rearrangement JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988) MEDLINE 88218832 REFERENCE 18 (sites) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency. The molecular defect in a patient with gout (HPRT-Ashville) JOURNAL J. Biol. Chem. 264, 520-525 (1989) MEDLINE 89079703 REFERENCE 19 (sites) AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D. TITLE Identification of a single nucleotide change in the hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale) responsible for Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 83, 11-13 (1989) MEDLINE 89093407 REFERENCE 20 (sites) AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in ten subjects determined by direct sequencing of amplified transcripts JOURNAL J. Clin. Invest. 84, 342-346 (1989) MEDLINE 89292180 REFERENCE 21 (sites) AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and Caskey,C.T. TITLE Molecular analysis of a female Lesch-Nyhan patient JOURNAL J. Clin. Invest. 84, 1024-1027 (1989) MEDLINE 89340900 REFERENCE 22 (sites) AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T. TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989) MEDLINE 89184538 REFERENCE 23 (bases 1 to 56737) AUTHORS Ansorge,W., Caskey,C.T., Erfle,H., Zimmermann,J., Schwager,C., Stegemann,J., Civitello,A., Rice,P., Voss,H. and Edwards,A. TITLE Automated DNA sequencing of the human HPRT locus JOURNAL Genomics 6, 593-608 (1990) MEDLINE 90256168 REFERENCE 24 (sites) AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T. TITLE Multiplex DNA deletion detection and exon sequencing of the hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families JOURNAL Genomics 7, 235-244 (1990) MEDLINE 90269813 REFERENCE 25 (sites) AUTHORS Bouwens-Rombouts,A.G.M., van den Boogaard,M.H., Puig,J.G., Matoes,F.A., Hennekam,M.G.J. and Tilanus,M.G.J. TITLE Identification of two new nucleotide mutations (HPRT-Utrecht and HPRT-Madrid) in exon 3 of the human hypoxanthine-guanine phosphoribosyltransferase (HPRT) gene JOURNAL Unpublished (1992) COMMENT We received a sequence correction from Dr. Edwards June 19, 1991. A 't' was added between bp 22227 and 22228. [1] sites; mutations causing gout. [13] sites; mutations causing gout. [17] sites; mutations causing gout. [Gene 68, 85-91 (1988)] sites; mutations causing Lesch-Nyhan syndrome. [14] sites; mutations causing Lesch-Nyhan syndrome. [19] sites; mutations causing Lesch-Nyhan syndrome and gout. [21] sites; mutations causing gout. [9] sites; mutations causing gout. [20] sites; mutations causing Lesch-Nyhan syndrome. [16] sites; mutations causing gout. [25] sites; mutations causing Lesch-Nyhan syndrome. [23] sites; mutations causing Lesch-Nyhan syndrome and gout. [10] sites; mutations causing gout. [12] sites; conserved domain. [22] sites; mutations causing Lesch-Nyhan syndrome. [8] sites; mutations causing Lesch-Nyhan syndrome. [3] sites; Lesch-Nyhan mutation. [6] sites; mutations causing Lesch-Nyhan syndrome and gout. [2] sites; mutations causing gout. [5] sites; mutations causing gout. [7] sites; mutations causing Lesch-Nyhan syndrome. [18] sites; mutant reversion. Draft entry and computer-readable sequence for [24] kindly submitted by A.Edwards, 26-JUL-1989. Mutant Description RJK 1780 missing intron 1 (partial and exon 2 [25] RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA) [7] RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA) [8] [7] GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [25] [7] RJK 853 complete gene deletion [25] [7] [22] GM 2227 inversion of exons 6-9 (no mRNA) [18] GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated mRNA [7] [18] Connersville missing intron 7 (partial and exon 8 [19]. FEATURES Location/Qualifiers source 1..56737 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq26.1" repeat_region complement(41..258) /note="Alu repeat copy A" repeat_region 261..321 /note="short interspersed repeat copy A" repeat_region complement(694..1012) /note="Alu repeat copy B" repeat_region 1013..1054 /note="short interspersed repeat copy B" mutation 1664..1678 /note="ggccggctccgttat in wt; gt in gout RJK 951 [23]" gene 1677..1703 /gene="HPRT" exon <1677..1703 /gene="HPRT" /note="hypoxanthine phosphoribosyltransferase" /number=1 CDS join(1677..1703,14780..14886,16603..16786,27892..27957, 31618..31635,34938..35020,39816..39862,40034..40110, 41455..41502) /note="hypoxanthine phosphoribosyltransferase" /codon_start=1 /db_xref="PID:g184370" /translation="MATRSPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMD RTERLARDVMKEMGGHHIVALCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRL KSYCNDQSTGDIKVIGGDDLSTLTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKV ASLLVKRTPRSVGYKPDFVGFEIPDKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA " intron 1704..14779 /note="HPRT intron A" repeat_region complement(2869..2902) /note="short interspersed repeat copy C" repeat_region 3078..3107 /note="short interspersed repeat copy D" repeat_region complement(3342..3532) /note="Alu repeat copy C" repeat_region complement(3343..3982) /note="Alu repeat copy D" repeat_region 4228..4254 /note="short interspersed repeat copy E" repeat_region 4463..4494 /note="short interspersed repeat copy F" repeat_region complement(5937..6232) /note="Alu repeat copy E" repeat_region complement(6288..6524) /note="Alu repeat copy F" repeat_region 7504..7807 /note="Alu repeat copy G" repeat_region complement(8408..8713) /note="Alu repeat copy H" repeat_region complement(9169..9477) /note="Alu repeat copy I" repeat_region 9699..10142 /note="Alu repeat copy J" repeat_region complement(10293..10407) /note="Alu repeat copy K" repeat_region complement(11254..11312) /note="LINE repeat copy A (partial)" repeat_region 12154..12310 /note="Alu repeat copy L" repeat_region complement(12998..13302) /note="Alu repeat copy M" repeat_region complement(13314..13638) /note="Alu repeat copy N" repeat_region complement(13923..14238) /note="Alu repeat copy O" repeat_region complement(14239..14610) /note="Alu repeat copy P" mutation 14778 /note="a in wt; t in Lesch-Nyhan RJK 1760, no [25]" exon 14780..14886 /number=2 mutation 14808..14809 /note="tt in wt; ttt in Lesch-Nyhan Chicago [19]" mutation 14874 /note="t in wt; c in Lesch-Nyhan Detroit Leu->Pro [19]" mutation 14877..14879 /note="tta in wt; ta in Lesch-Nyhan RJK 1939 [25]" mutation 14886 /note="g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys [25]" intron 14887..16602 /note="HPRT intron B" repeat_region complement(15843..16090) /note="Alu repeat copy Q" repeat_region complement(16293..16518) /note="Alu repeat copy R" exon 16603..16786 /number=3 mutation 16619 /note="c in wt; g in gout Toronto Arg->Gly [2]" mutation 16623..16664 /note="atgtgatgaaggagatgggaggccatcacattgtagccctct in wt; at in Lesch-Nyhan RJK 2108 [25]" mutation 16675..16676 /note="ag in wt; agg in Lesch-Nyhan RJK 866 [23]" mutation 16676 /note="g in wt; a in Lesch-Nyhan, HPRT-Utrecht: Gly->Arg" /citation=[25] mutation 16677 /note="g in wt; a in Lesch-Nyhan New Haven Gly->Glu [19]" mutation 16679 /note="g in wt; c in Lesch-Nyhan Yale Gly->Arg [20]" mutation 16680 /note="g in wt; t in partial HPRT deficiency, HPRT-Madrid: Gly->Val" /citation=[25] mutation 16690 /note="c in wt; a in Lesch-Nyhan Flint RJK 892 Phe->Leu [Gene 68, 85-91 (1988)] [23]" mutation 16707 /note="a in wt; t in gout Arlingen Asp->Val [19]" mutation 16756..16759 /note="tgta in wt; ta in Lesch-Nyhan RJK 1332 [23]" mutation 16780 /note="c in wt; a in gout Munich Ser->Arg [13] [6]" mutation 16784..16787 /note="tgtg in wt; tg in Lesch-Nyhan RJK 1747 [25]" intron 16787..27890 /note="HPRT intron C" unsure 17479 /note="polymorphism or cloning error" repeat_region complement(18242..18284) /note="LINE repeat copy B (partial)" repeat_region complement(18439..18783) /note="Alu repeat copy S" repeat_region 18962..19084 /note="Alu repeat copy T" repeat_region complement(19104..19397) /note="Alu repeat copy U" repeat_region complement(19778..20064) /note="Alu repeat copy V" repeat_region 20486..20636 /note="Alu repeat copy W" repeat_region complement(20696..20990) /note="Alu repeat copy X" repeat_region complement(20996..21293) /note="Alu repeat copy Y" repeat_region 21582..21635 /note="short interspersed repeat copy G" repeat_region complement(21878..22171) /note="Alu repeat copy Z" repeat_region complement(23461..23773) /note="Alu repeat copy AA" unsure 24904 /note="polymorphism or cloning error" repeat_region 25311..25427 /note="Alu repeat copy AB" repeat_region complement(26280..26389) /note="Alu repeat copy AC" repeat_region complement(26844..27092) /note="LINE repeat copy C (partial)" exon 27892..27957 /number=4 mutation 27898 /note="c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG [25]" mutation 27912 /note="c in wt; t in gout London Ser->Leu [1] [17] [25] [10]" intron 27958..31617 /note="HPRT intron D" repeat_region complement(28395..28431) /note="short interspersed repeat copy H" repeat_region 28924..28982 /note="short interspersed repeat copy I" repeat_region complement(29561..29864) /note="Alu repeat copy AD" exon 31618..31635 /number=5 mutation 31622 /note="t in wt; a in Lesch-Nyhan Midland RJK 896 Val->Asp [14] [23]" mutation 31623..31625 /note="ctt in wt; ct in Lesch-Nyhan RJK 2019 [25]" mutation 31625 /note="t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser [23]" mutation 31629 /note="t in wt; g in gout Ann Arbor Ile->Met [16]" intron 31636..34937 /note="HPRT intron E" repeat_region complement(33149..33619) /note="Alu repeat copy AE" exon 34938..35020 /number=6 mutation 34963..34971 /note="tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK 1210 Met->Lys [23]" mutation 35016 /note="g in wt; t in gout Milwaukee RJK 949 Ala->Ser [19] [23]" intron 35021..39815 /note="HPRT intron F" repeat_region 35563..35605 /note="short interspersed repeat copy J" repeat_region 36576..36646 /note="Alu repeat copy AF" repeat_region complement(37301..37559) /note="LINE repeat copy D (partial)" repeat_region 38614..39040 /note="Alu repeat copy AG" repeat_region 39092..39389 /note="Alu repeat copy AH" repeat_region complement(39428..39452) /note="LINE repeat copy E (partial)" exon 39816..39862 /number=7 mutation 39838 /note="c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA [23]" mutation 39843..39847 /note="tgttg in wt; tg in Lesch-Nyhan RJK 1894 [25]" mutation 39859 /note="g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr [25]" intron 39863..40033 /note="HPRT intron G" mutation 39867 /note="g in wt; a in Lesch-Nyhan RJK 1934. no [25]" exon 40034..40110 /number=8 mutation 40035..40039 /note="tgttg in wt; tg in Lesch-Nyhan Michigan RJK 855 [19] [23]" mutation 40081 /note="g in wt; a in Lesch-Nyhan Kingston RJK 2188 Asp->Asn [25] [6]" mutation 40096 /note="t in wt; g in Lesch-Nyhan New Briton RJK 950 Phe->Val [19] [23]" mutation 40103 /note="a in wt; g in gout Ashville Asp->Gly [21]" intron 40111..41454 /note="HPRT intron H" mutation 40115 /note="g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no [25]" repeat_region 40623..40926 /note="Alu repeat copy AI" repeat_region 40953..41263 /note="Alu repeat copy AJ" mutation 41451..41453 /note="ata in wt; ttt in Lesch-Nyhan RJK 906, GM 1899, downstream cryptic splice site used" exon 41455..>41502 /note="hypoxanthine phosphoribosyltransferase" /number=9 mutation 41455 /note="c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079, His->Asp [25] [23]" mutation 41462 /note="g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr [23]" mutation 41487..41510 /note="aaaatacaaagcctaagatgagag in wt; ag in Lesch-Nyhan Evansville RJK 894 [19] [25]" repeat_region 42641..42937 /note="Alu repeat copy AK" repeat_region 44043..44109 /note="short interspersed repeat copy K" repeat_region complement(44487..44777) /note="Alu repeat copy AL" repeat_region complement(46306..46628) /note="Alu repeat copy AM" repeat_region 46629..46689 /note="short interspersed repeat copy L" repeat_region 47521..47764 /note="Alu repeat copy AN" repeat_region 49258..49552 /note="Alu repeat copy AO" repeat_region 49706..49994 /note="Alu repeat copy AP" repeat_region 50003..50313 /note="Alu repeat copy AQ" repeat_region complement(50775..51067) /note="Alu repeat copy AR" repeat_region complement(51101..51318) /note="LINE repeat copy F (partial)" repeat_region complement(51688..51716) /note="short interspersed repeat copy M" repeat_region complement(51979..52277) /note="Alu repeat copy AS" repeat_region 52408..52675 /note="Alu repeat copy AT" repeat_region complement(54552..54654) /note="Alu repeat copy AU" repeat_region complement(55165..55466) /note="Alu repeat copy AV" repeat_region 56524..56690 /note="Alu repeat copy AW" BASE COUNT 15689 a 11281 c 11599 g 18168 t ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26. 1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt 61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc 121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc 181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt 241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga 301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc 361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag 421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc 481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt 541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct 601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga 661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt 721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat 781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga 841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag 901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct 961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc 1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag 1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac 1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc 1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg 1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt 1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg 1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg 1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg 1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc 1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc 1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg 1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg 1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg 1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc 1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga 1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca 1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc 2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat 2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg 2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag 2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg 2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct 2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt 2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg 2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca 2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg 2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt 2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg 2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa 2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt 2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg 2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc 2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc 3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt 3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt 3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct 3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat 3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt 3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag 3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc 3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc 3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta 3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac 3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc 3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt 3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc 3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc 3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc 3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag 3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc 4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca 4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc 4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc 4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa 4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca 4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga 4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac 4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg 4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt 4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac 4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg 4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag 4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg 4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca 4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg 4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat 4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg 5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc 5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta 5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc 5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact 5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct 5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg 5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg 5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc 5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa 5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat 5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa 5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg 5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc 5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa 5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt 5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt 6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg 6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga 6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg 6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc 6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt 6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc 6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg 6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt 6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga 6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac 6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct 6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag 6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc 6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt 6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg 6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc 6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca 7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc 7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga 7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt 7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc 7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg 7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag 7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag 7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca 7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt 7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac 7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga 7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt 7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa 7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct 7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact 7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt 7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc 8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac 8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct 8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg 8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat 8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag 8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat 8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg 8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg 8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc 8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac 8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg 8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc 8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat 8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga 8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta 8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg 9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt 9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag 9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt 9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc 9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag 9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt 9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca 9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc 9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc 9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat 9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg 9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc 9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga 9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca 9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat 9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta 9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc 10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc 10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa 10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa 10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat 10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt 10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct 10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt 10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga 10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca 10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct 10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg 10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg 10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa 10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta 10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa 10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca 10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct 11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa 11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc 11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat 11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc 11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag 11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag 11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga 11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg 11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg 11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt 11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac 11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa 11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa 11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa 11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt 11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt 12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag 12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat 12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat 12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc 12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt 12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt 12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct 12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat 12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa 12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag 12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt 12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac 12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca 12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct 12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt 12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc 12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta 13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc 13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg 13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc 13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct 13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt 13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg 13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat 13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca 13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga 13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt 13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac 13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta 13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg 13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc 13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat 13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg 13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc 14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt 14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga 14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt 14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc 14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg 14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc 14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc 14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca 14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg 14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata 14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg 14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac 14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg 14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat 14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac 14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa 15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt 15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg 15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg 15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg 15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt 15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat 15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt 15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata 15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat 15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag 15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg 15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt 15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga 15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc 15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca 15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag 15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt 16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag 16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt 16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa 16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa 16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt 16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact 16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac 16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac 16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa 16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg 16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg 16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag 16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa 16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa 16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta 16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta 16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg 17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt 17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt 17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt 17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc 17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg 17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc 17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt 17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa 17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc 17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat 17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc 17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat 17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact 17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa 17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt 17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa 18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat 18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat 18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt 18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat 18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc 18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg 18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt 18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt 18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca 18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc 18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt 18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg 18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg 18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat 18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa 18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat 18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg 19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa 19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag 19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga 19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc 19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa 19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg 19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc 19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa 19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg 19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa 19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa 19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg 19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag 19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga 19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag 19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt 19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct 20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt 20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac 20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct 20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga 20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat 20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg 20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat 20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt 20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc 20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag 20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt 20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca 20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta 20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg 20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc 20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt 21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc 21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg 21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc 21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc 21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa 21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat 21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa 21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat 21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat 21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt 21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt 21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta 21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt 21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt 21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct 21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct 21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat 22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc 22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg 22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca 22201 cacacacaca caaacacata tgtgtatttg tgtgtgtgtg tgtgtgtgtg tgtctcaaaa 22261 ggtatcaaaa gagaatagct ataactttag tgttgatctt gatagtgact tgattaggct 22321 ctgtttaaca tcaaagatgc aaattaatac tttctttgaa catattaaaa atgcagaaaa 22381 tattggagta ttttatttta aataaattgt attctgtata tttaaggtat acaacatgat 22441 gttatgggat acatataggt ggttaaaaga ttactgcagt gaagcaaatt aacgtatccc 22501 tcaactcaca tagttaccca tttttttttt gttttggtgg caagaggagc ttaaaatctc 22561 atttagtgtg aatcccaaat acagcacaat tttattacct atatacttca tgttgtacat 22621 tatatttcta gacttgttca tcctacatat ctgctacttt gtatcctctg agctacatct 22681 ccccattttc tcacttgccc cccaagtagt ttcttaaagt gtctcatgta agagggcagt 22741 agctttcagc ttaaactttt tctctgtatg tagtcgattt ctttgaggta tacttttctc 22801 tccagaatag ttagatgtag gtataccact ttgatgttga cactagttta cctagaactt 22861 atcttctgta aatctgtctc tatttccatc tctgtctcca tctttgtctc tatctctatc 22921 tgtctatctc tatctatcta tctatctatc tatctatcta tctatctatc tatctatcta 22981 aagcaaattc atgcccttct cctatttatt gaatcgagac catagacagg ggtgagagaa 23041 agaatttggc aggaatgggg atgtgtatta tctgtggcat aaggaaactt tacagaacta 23101 ggttcaaaag tatactttct agttctttcc catggctttt cactttgatg tagtccttat 23161 caggtaactg aggttttata taagtcccct gattcttaga acatgaaggt gtagtagtca 23221 aggttggtcc cttgaaacca caaattttgt gaaaaaaaat taagaaaatt tgaataattt 23281 cctcagcaaa tacatattga tcatctgtta tacagccatg agaagtggtt ctgttgcaca 23341 cgtttatttt atcagatcct aatcccaaac caggcataaa atggaaacca tgaagatagg 23401 atgaaataac ttctgaatgt ttgaatgttt gaaaatagtg tacttaaaaa taccaggtgg 23461 tttttgtttg ttttttgttt ttttcttttt ttgagacagg gtctcactct gtcacccagg 23521 ctggagtgta gtggtgcaat ctcatctcat tgcagtcttg acctcccagg ctcaggttat 23581 ctcccacctc agcctcccaa gtagctggga ctacaggcac atgccaccac gcccagctaa 23641 ttttttgtat tttttgtaga gacggggttt caccctgttg cccaggctgg tctagaactc 23701 ctgggcttaa gcgatcctcc cacctcagcc tcccaaagtg ctaggattac aggcatgagc 23761 caccatgcct ggcagaaaat accaggtttt taagtatcag cacttactct tcaatctttt 23821 ctattactat gttgtgctaa atggtatttt ttatttaatt agagcaatgc tgttcaatag 23881 aactttcttt gaggatggaa atcttttatg tttctgctat gtggtacaga gccactagtg 23941 acatgtggct tttgagcgct tgacacatct tgtgcaacac aggaactgaa tttttaagta 24001 atttatattg ccacatgtgg ctaccgtatg ggacagtgta gtactagatg atctgtaagg 24061 gctgtgcttc atcagtgtcg ttttttaact gacaaaaacc tttagttttt tttttagtaa 24121 tgtgtttatt taaaagaatt cataaaatac aagtaaacaa attaacttgt tacctgagca 24181 tatgtccttt catacttatt ttttctgcat acatattttg gaaaatggaa tatctgcccc 24241 ttttttttta tctgagatac agtctacctc taaaaataca tgattctaac attctcactt 24301 tttgttggca tttgatcagg gtatagaaaa acagttaaaa ggacagagaa tggttgagag 24361 attatgatat gaagagaaaa tgtgattgag tgtggtagac ttggggcctg cttgaatgtt 24421 gagagaatga ctgttttccg ataaaaaaaa aaagtccatt ctaggatcct aaaagaaggg 24481 tctgaagttc actgcagaaa gcaagctaca tagtactaag ccactaaggg gacatggagc 24541 ccttagtaat tcctacctta gtaatagtct catcatgccc tcttgggaac ccagccttgt 24601 tgattagcct ctctgctttc tctccttata gttcaacctc cctgtttgtt ccaagcagtt 24661 cttttcctgc ccatttatta tgcatttcta tacagctttc ctcctctttt tctataccat 24721 gctgcagttc ttattgctac ctagaggttt tcaaaattcc taggggcgga taagtaggca 24781 taaacaaagt tcttccctat tatccttcct attttttcac ctagactgaa gaggtagaca 24841 aaatagaaat aaagacatta agggtatgtg tttgtagtcc caaagagctt ctctggcaat 24901 tttgatgtag ttgacagtga cgctctgagt tcaggacaga ttggactcct tggctgagag 24961 gagtgaggag ataggacggt agaggagagg gtagagcaac tctggaggaa gctttcccct 25021 cacctttgcc agtcctgtta tcctagactt aaccataatt aaagatgagg gaggcactca 25081 gtaaagggat ctagtgggaa gcttgttcca gacagccaag gagggaggtt cgcgcagttc 25141 ctttggccac ccaggtgggg taattgatcc atgtatgcca ttcatgtaca atgtaggcac 25201 ttatacctgt attccaatgt agtgaactat accattactc ttaaattaat attctttatt 25261 agcttccatg gtggctatag gccaggcaag agagttaaga aaaaataaat agccaggtat 25321 ggtgactcaa gcctgtaatc tcggcacttt aggaggccga ggcaggagga tagcttgagt 25381 ccaggagttc aagaccagcc tgagcaaaat agtgagatcc tgtctctatt ttttaaaaaa 25441 gccttggggc aaacaggagt atggaggttt ggatgctaat agaacagcag tgtcttactg 25501 cttggagttc tcttgtttct tgtcctatca ccgtagcctt tggatcacag caatttttcc 25561 atgactccat acttttcagt tcttgaatat tttttccttt attcctcttg tctctgtaaa 25621 gacatcaact ggagttggac tgtaatacca ggtatctcca gaagatggca ctatttaaca 25681 gattttataa ataatttgat gtgagtcact gtcatctgaa gcttgttgcc ttttctttct 25741 ttcttctttc ttttttttcc ccatcaattc tgtatgtttg aaatgctggg atttaagtta 25801 gttagaataa gggatgtctg taatttccct aaattgagaa gtaatatgca aaggttgata 25861 tcagaagtca tatgctcacc ttgcaacacc aaataatact ggcccatttg tgatttttga 25921 aagtaacact ccataataaa tggatgtata tatagaagca taacaaaaat agaagcacat 25981 aaaagtgaaa agtctcataa acgccattgt cactactcat gtaattgctg ttacaaattt 26041 gtttaaatgt tgaataaaaa tggtgtcata ggcaacacag tgttccacta cttggtgttt 26101 ttaatagcat tattctgtct cagtgtgctt tggattatca ggtgcttttt aatagttgca 26161 tggtattaca ttgtgtagat gaacttgatt aatttaaatg gttccctgtt aatggacatg 26221 ttggtttgtt tttgtgaaca actgatacag tgaacattta ttttttaaat aaaaaaaaga 26281 gagacagggt cttgctgtgt ttctcgggct ggccttgaac tcctggggtc aagcgatcgt 26341 cttgcctctg cctccctggg attacaggca tgaagccacc gcacccggcc cagtgaacac 26401 tcttgaatgt atctttgtat acttgtcaag tgtttttgta gcaattgatt cccagaagtg 26461 ggaattacat ggaattaagt gacatgcatg tttgcaattt taacaggtat tgctatgtca 26521 ttttcaaaag aagctatgcc aattaatact ctcaccaaca agagtgctta tttcccctca 26581 gcatattatc aggcttaagt tttgccagta tgggtgggag aacagtagaa tcacattgtt 26641 ttagtgtttg tttctcagat agatataatt ttacacctta taaccttctc ttctataaat 26701 tgtctatttg tgttcattct ccattttcct atgggttctt attgttggag cccaatatat 26761 aaaagggggt atttgttaca gaacctcttc agttttggtt catgtcatgc ctgggttttt 26821 accctttcta cggatgttaa aaaaaattct ctattttctt ccagtccact tatggcttta 26881 ttttttacat ttagatttta atccgtctgg aatttatttt tgtgtatgct gtgaggtagg 26941 gaccatactt ttattttttc ccaaatgggt tactagttgg ccaaacatca tttattgaat 27001 aattcatctt ttccctactg actcgaaata ccatctttat tgtatactaa atcctcatat 27061 agttctgggt ctgtttctgg gctctacttt gttcatttac tgtgctggta ctgcaccgtt 27121 gtaattgctg tggctttgtg gtatggtatg gcttgctctc tgctagggca agtcgaagct 27181 cttttgttca cctgctcttt cacccaaatt ttctgtcctg aatccagcac agccaaatta 27241 tggtcattgt caccaccaac tacagtgggt gttgagcatt tcccattgaa tctcctgtaa 27301 gggttttatt ggattctgtg atagcagtaa aatgggagcc taagaggtat tccttaaagg 27361 actactaatc agacctggtt tcccagatga tgctgaagat gacggggcct gggctagact 27421 tttgagggac atatccttgg ggttgggtgt gatatagacc agcccttaca atttgcttga 27481 ctcatgggaa tcgtacaggg ccagaaccag acacctgtca tgctaataac ttccctcaca 27541 attcagaaat cactgtgatt gaagatgggt ggctgttata atactaccca cttaaaaatg 27601 gatgtaaccc attttttagg actcttaaaa acatcaaatc agtaatggcc gattaggact 27661 ttttaatttt tactaatctc tacttgaaag ttttctagtc attcatttca ggaaacctaa 27721 ttcttataat tcatatcatt tagaatatca taatgctatg gatattagct agctaacttc 27781 tcaaatcttc tagttctcat ttaatttgaa gtttgtgtgt gtacataagg atatacatat 27841 acatatgtgt gtgtagatat atatatatat agtttttttt tttttaacta gaatgaccag 27901 tcaacagggg acataaaagt aattggtgga gatgatctct caactttaac tggaaaggta 27961 tgtatcttga aagggaagaa aaaaaagcac ttcataccga gtcaattagt aacagtgtgc 28021 tttcaatcaa tcactaagag ataatttaca tagtataact aaatgggtta tttaaccctt 28081 ggaagcagtc taggttaatt atcgttccct aggtcatgta gtaaaaagac agtagaatcc 28141 aacattaacc ttaaatgtcc atattgtcaa gtactgctgt ctgcctctgt gggactctaa 28201 tttgggatcc ttcaaaaaac attgatgggg gaaaagatag cctttaaaaa aaaaaaaaaa 28261 acaaacctat gtgagtctat gtgaggtaga ctcacatagt ttcctaaaag atagcaaagc 28321 agtattatgt agtggctgaa agtgtgagtt ccggagcctg acaactgatt caaagcatgg 28381 cttagtactt cctaactctg accttgggca agttacttaa cctctctgtg tcccatatgt 28441 gattagggtg aggttgataa tagcagccat agagttaaga ggattaagtg ctataatgca 28501 agtagagctc ttacaacagt ttctggtaaa tcactcaata aattcagaca tactattatt 28561 ttaagaaatc tcaaagagtt ttcttgtacc ttaaaattct cctagtgtga accattggtt 28621 ttggtatatt gtgcttccat gtagtttaat atcaagatgt ttttagattt cccttttaat 28681 ttatttgttg acccattggt tgttcaggag catgctgttt acctgaaaat aatggagata 28741 ttaaggtatt tgaatattta tcttctagta cattgaaaaa ctttttgaga gtaaccaata 28801 ataaatgatg gaatgctact gctttttttt tttgaagctg ccagttattg tttacttaca 28861 ctatgccaaa tataaaggca ttaatctcat aaaagtttca caacaatcct gtgagggaga 28921 cgatatcccc attttacaaa tcaggaaatt aagacttaat aaggttaaaa gacttgcccc 28981 aaagtcacag aaccagtaag tggtagagct tgaatttgaa tacagacctg actctaaagc 29041 tcttttcttt ctttagattt tagtgttcat tgcttacttg aatgagtatc tataagaaaa 29101 ctttaacatg taaaacttct gtgaaattat cttgtcccat atcagggtca tgtcaaacta 29161 atgtcctcct cagcatcttt ggaaaacttc agaggagaaa tgagctttgc ccctcctgtt 29221 catttcatat accactgtta gacctgtcct tccctttcag catgctttgt ccatatttag 29281 aagctgttga agccattact tgtctggtca gtttttagtg ctggaatgga cctagccttt 29341 taggccttct gagatttagt ttgatctcgt ctttcccacc taatggctct gttctactac 29401 atagatttga tctgaaacag ttctctgttt ctaaaataac tttcttttca tgatagtcac 29461 agtaaagtac atttattatg gaaaaatcaa taagtataac gagtgaaagt tatttcttgg 29521 tggtaagatt atgggattat ttgaactttc tgtttcattg tattttattt atttatttat 29581 ttttgtgatg gagtctcact ctgctgccca ggctggagtg cagtagtacg atcttggctc 29641 actgcaacct ccccttccca gttcaagtga ttctcctgcc tcagactccc aagtagctgg 29701 gattacaggc gcacgccacc atgcctggct aattttttta tctttagtag agacagggtt 29761 tcaccatgtt gaccaggctg atctccaact cctgatctca ggtatccacc tgcctcagcc 29821 tcccaaagta ccgggattac gggtgtgagc caccctgcct ggcctcattt tgtcttttgg 29881 gggtattttt gtgtgcagat atatatgtat ataaatattt ttccctcttt tccccagtta 29941 gtatttgagc agatgaactt tggacccgaa tacctgtatt caagtctcta ataccacttc 30001 ttggctattt tcattttatc aaatggcctc ttatcctcgt ttttctcatt tattaagtag 30061 agatgtaact acttgatata attcaaaaac tcaataatgg cattcttttg ttttttagac 30121 tctagtgtct gtactccttg taccatgctg ggattcattt gaacaattgc atggcttttt 30181 tagtgtatta ttaaatttgc agtttactta gaatttactg ggacctcata caaatgggaa 30241 aaaaacataa ctgtgttact catttgctgt gtgcctttgg attgacccta ttttttgtat 30301 tcattttctc cccatgtcct gagttccact ttgaataaaa aagtaatttt tttcctgcct 30361 gtaaaatagg ctaccaatag gctgcagttg tctatagtag ctgcttcact gaggagagct 30421 cagcatgaga gaaatagtat gaattgcttg ccacaagtta tgggctagcc ttacttcatt 30481 ctgtacttgg acctgtttag gcttctaaga gatcttacct ccaacaataa actgctttga 30541 gacatgaaaa ggtggaagct ttacttggtt ataactttac ttttaatacc tagaacagtg 30601 agtcttcaaa cttgtatttg catgcccaat ttataaaaag tttcctgagc atttacccct 30661 aatatatgca ttttaaatta tatatgattt atggtaataa taatatatat gttacaaaat 30721 acatacaaaa atatagatta aacaaggtga ggttaaaaaa tttaaaagtt ctaatctttc 30781 ttgcaaacca gtggatcttt tgtgccttac tctggtaaac actgtcttag aagaatatat 30841 agaacattaa aatcttaatg ctatagttat atgacagagt atgatgagag ctacagataa 30901 acaacacatc atgaatcttc ttgtggcagt gtttataacc attatgtgaa atgctgcctc 30961 attcttataa ctagcataag aacagatagg actttctcga ttttgagggg taattattag 31021 atggtatttt ctgttaagga ctcttccagc tataaaattc ttaaatgtag aaagcgaagt 31081 gagggtttat ggtgagagga agcattggta tcatgtttta gtgtagtcca agaatatgga 31141 cacatccaga aaatgcagat caagtttagc ctaatgagaa aatatatttt ggagtccata 31201 tggtaaatta aattatgtga tttttgagtt attgtacaaa tataattctt agaatgttag 31261 agtcaggaga ctataagaga ccaactgctt caagtttcat ttaacacatg ggaaactaag 31321 gcgagagaaa tttcaagact tgcccaagat tagacctctt gttaagtaat gaaagtgttt 31381 taaaaacagg tgggtcaaat tctgttttta aaatttccat tatgatgaaa atttcagtat 31441 tacaggcttc caaatcccag cagatgggcc acttgtttaa aggagagttt gatataataa 31501 agcatctaaa aacaagagtt tggataattc cttagggttg ttatgatgtg atttgactta 31561 taattggaaa taccgtttta ttcattgtac tgattttcat ttctcttttt cttctagaat 31621 gtcttgattg tggaagtaag ttcacattta cttttaatat aacatttatg acttttctaa 31681 cttagtatgc accatcctaa aggtaagcca gggagagaaa ttcctctgca tcagttttaa 31741 tggtgggctt gtgttctaaa ggagtgagat tggttttttg taaagactac ttagtaattt 31801 gtttttacca ataatggaat ggtatacttc ctacctctct ttttttagtt tgaagtattt 31861 tctttctaaa cataactctc tctctctatt tatctatata taatatatac atatatatct 31921 tatattttat gtatatatat atatatcttg cttagatttt gtcttatgta atatttggta 31981 cataaaaaat aatatttata atttatagac tattttccat gtgttattat gtgctaaagt 32041 attttgtatc ttagcaccga gaggctaagc agtttcctag ggttaccagc tagtaaacta 32101 agggaaacct ttacttcctt tagctcagtg gttctcaaaa tgtggttccc tagaccaaaa 32161 gtattaatat cagacaagaa cctaccgaat caaaatatct gtgatgaggc ccagcaagct 32221 atgctttaac aagtttccga gtgattctga tgcatgctaa ggtttaggat cccttgtttt 32281 tactcataag tcactttctc attaaggcct tccctggcca tcctatataa aatctcatgt 32341 tttcacaccg tcaacttcgt attcctcctc aatactttta ttttcctgat cacttatcac 32401 taacagcctc tctctctctc tctctctctc tctatgtata tatatatata tatcacttat 32461 cactgtctaa cagcctctct ttatatatat ataatctata gattatatat atatgcagca 32521 ttgtgcaatc attatcacgc tcaattttaa aacattttca tttccccaca aagaaaccca 32581 atccccttag ccatcactcc caattttccc ttcccccagc acctagcaaa ctgatcatct 32641 acctacttgc tgtctataag atttgcctat tctggacatt ttgtataaat agaatcatac 32701 aatatgtggc cttttgtatc tggcttctct cacttaatgt tttcaaggtt cattcatgtt 32761 gtggagtata tctgcactca tttccttttt attgccaaat tgtatggata gacaggtgtt 32821 cctcaactgt gtcctgataa acccatctga agttgaaaat atcataagtt gaaaatggat 32881 ttactacttt gataaatcta tcctaaagtc agaaaaatct catgttggaa ccatcgtaag 32941 ttggatacca tctgaattac atttttgtta tccattcact ggttgacaga cgttaggttg 33001 tttccactga tgctccttat ttctcgtacc tgaaatgtcc ttattccctc ccttcttatc 33061 ccatgtttaa gtcatttaag acccagctca aacgtcacct ccacaaaacc ttccttgata 33121 cccctttcct cttcaattca cttggacctt ttgcatttaa ttttaatttt tatttttttt 33181 aagacagagt ctcactctgt caccaggctg gagtgcagtg gtatgatctc agctcactaa 33241 ctactctgcc tcccaggttc aagcaattct catgtctcag cctcccaagt agctgggact 33301 acaggtgtgc gccaccatgc ctggctaatt gtgtgtgtgt gtgtgtgtat gtatgtatgt 33361 atatatgtgt gtgtgtgtat atatatatat acacaaacat atataaatat atatacatat 33421 atatatatac acacatatat aaatatatat acatatatat atatacacac acacacacat 33481 atatatatat atagtttttt tttttttaag tagagatggg gttttgccat gttggccagg 33541 ctggtctggc ctcaagccat cctcccacct cggcctcgca aagtgctggt attataggca 33601 tgagccactg tgcctggcct gcatttcatt ttaattataa aatattttga actcagaaaa 33661 aagggtatgc tgaataccta cgtacccaca aaagtattaa cattttgcca tatttgcttc 33721 tgatcttatt ttttttgaga aattaaagat cataatacaa ctaaagcccc atttctttcc 33781 cttcattccc agaagtatga caattatcct taaagttgat atatatcatt cccatgcatg 33841 ttttttatac ttccctagta caagttagct gtatcctctg ctcaggggct catcaagctg 33901 aatcaaggga ctcatgatcc tcttcaaagt tccttcaggt tgttggcaga atttagttcc 33961 ttgtgattgt aggactgagg gcccgttttc tcactggctg ctggccaggg gttgctccca 34021 gatatttaaa ggctcatgcc ctagcccatg acagtctcac aacatggcag ctgacttctt 34081 caaaaccagc aggagaatct tgctctagtc taccacataa cctaatcaca ggagcggcta 34141 tcccgttatt ttcacagatc ctggtcacat tcaaggggag ggaacccttc tgtgtgtgta 34201 caccaggagg caggaatttt ttttttcttt ttcttttttg ttaaaaagtc ttaaagtctt 34261 ttatccctaa aggaggcagg aattttgaga gccatcagaa ttctgcctac cacagcccag 34321 aaatctgcat ttttcacaag tctccagcca tgatgtttct gatggctcac actgctttat 34381 tccattttta aagagtattt ttattgaaaa gcattagggt tatggtttaa aaaatatttt 34441 ccctaacaaa gatgggtttg tttagagtcc tacttttgac taaatagctg agattcactt 34501 ttatgtaaag ttcattttat agcgttatta atttgggtgc ctttaaaaat agtataaagc 34561 atgtttctcg agtgtagtct gttagccacc tatattggag agttgggagg agagagtctc 34621 tatcttgaat ttatgggaaa aattctaaaa tactttttat aatgaaggac aacatcataa 34681 ctccctaata aaatgtgcat gtatatattc aaatttgctg tcattgatcc tgcacctaca 34741 aaatccagtc ctgggggctg gcattcttac tgcttgctga gggccagatg atatagattc 34801 cagaatatct ccatgtagat tttggtgaga attactgtgc tgaaaagaat gacagtattg 34861 cagttataca tgggggtttt ggtactttat attgtgactc tgaatttaaa gctatgcaat 34921 gtcttctttt ttgaaaggat ataattgaca ctggcaaaac aatgcagact ttgctttcct 34981 tggtcaggca gtataatcca aagatggtca aggtcgcaag gtatgtatga cattttgaca 35041 cagaatattt tcctcatttg aagggggatt aagtgattgc ttctttttaa ggataaatgt 35101 tttcaactgt cattttatct tcgaaaagta atgtaatctc atataagact taagatataa 35161 tccttttaaa taattttgtc atgtgttaat aaagctcata attacagtca cttccttgct 35221 aatattaaca tttggttttc agcatgctaa ttatatcagt ttgtcctgaa tagcatggca 35281 gaggattttg ggcccccttg caaaattaag aataaggatt ccaaagcggg tgaggaagtg 35341 ataggaaggg gtgggccctg aagatctgga cctcctggaa ttgagtgatg aatgctgcat 35401 cttctttgtg tctgtagtga aattttataa tgcctgcttc cttttttatt aagtcggcct 35461 cacctcctca ccttacctat gctgttttac ttttgctttt atagttctac ctgtgtttat 35521 ttctcatttt cgtttcatct ctcaacaact ctggggtggc attattattc ccacttttca 35581 gataaggtta ctgaggcata gggaattgtc caaaggtaca gagctagtcc gctatagaga 35641 tgagatttga acccagggaa cctggctcac agtttatgct tttgcctacc ttaagttttt 35701 aatagagtga catcaaacaa acatttaaga atatgttttt cttttccttt tataatttca 35761 ttaaaaacat taagtctctg atcagtctgc agtttttatg taggggtcag gtaatgttct 35821 aacttctgct ttttcctaag tgattaacag gtttttataa gcccttttga aaaaatcacg 35881 gtatctgtcg agcatctttg aatcagagta agccttctag tgagtcatat gtcagcagtt 35941 tgactgtatg ggcttttcta atatccagtt caagtgttta tcagtgagtt tttcttttaa 36001 atagatttgg gacaggtact atgagagtat ataagtgata cgttatagga cactaactag 36061 tatcctatga aatggcaaaa actgcaatca cttttgcacc aaccaaatag aaactaatca 36121 gtgcacttgc ttatttttct acatgctctt tagggtttta aatgtcaacc tactgtggca 36181 tagactttaa tcctctgggt attcttttgt tgttctttcc tggtatatgc tgtggaattg 36241 agatagactg gttcgtgagc gagagatttt gtgttgccac aggtaggaca tgctcaaaca 36301 atacttgggt catttcttga cccaagtcat ctattcacca tagttttgta gcaccgatct 36361 tgcatacatt tcatgtatct tctttgaacc ccacgtcagt gctgcttata tgatactcag 36421 aaattaaaca ctaaggaata agattttcag gtaggattga gttttggagg gtcacaaatc 36481 ttgtaatgtc taatatttcc actctccctg ctgagaatta gttttggctt ccttggaggt 36541 gatatcgcct ctgttgagta taagtggcct actgtgatca caccactgca ctccagcctg 36601 ggtgacagag tgagaccctg tctcagaaaa aaaaaaaaaa aaaaagaatg catggcctag 36661 atgacttcta aggtttttcc cacccagttc cagttttcat gttctaggca gagcagtaaa 36721 gtgagaaaca catggacttg ggagtttagt ctcgcatttc actgccactt aatctgagcg 36781 actattccat atttaatctc tctgaatgta tttactcatc tttaaagggg aatgattatt 36841 aacatctttt tctcagggaa actatatgag tcaaggagat aatatatttg aaaatctttt 36901 taactgcaaa gcgctgtttc actgttggtt ataatgtgat tgatctcatt gtagtgagca 36961 gctgcttaat tgcgttttag aatgtaggga agatagtaat atttttcaca ttatatatgt 37021 agctggttct ggaactgtaa acatactcct tttttatgga gatctgagtc acgtaccata 37081 aaattcactc ttttaaagtt gtacaatcca gtggtttttg atatattcag agttgtgcat 37141 ctgctaccac tatttcattt tggaacccaa agaaaccttg tacccattag cagtcattct 37201 cccttctccc agcccctggc aactactaat ctactttcta cagaaagtcc gtacagattt 37261 gtgtattatg gacattccat ataaatggac tcatgcaata tcctgtcttc tttcacttag 37321 catagtgttt tcaaggttca tctaggttgg ggcatgtatc agtacttcat cccttgtttt 37381 ggctgaataa tatttcattg tacaaatata tcacattttg cttatccatc tgttggtgaa 37441 catttgagtt tctacctgtt ggcttttatg aataatgttg atttgaatgt ttgtgtacaa 37501 gtatgaatac ctgttttcag gtctcttgag tatatagttg ctaggtcata tagtaactct 37561 gtgtttaaca ttttgaggaa ttgcccgact atttaacaag gtatatgtac tgttttacac 37621 cagtaacata tgagggttcc aatatctcca catccttgac aacacttgtt actgtccttt 37681 ttattgtagc catcctagtg gctatgatgt ggtatctcat tgtggttttg atttgtgttt 37741 ctctgatgct gatgatgttg aacatgtttt catctgctta ttggccattt acatatatct 37801 tcttaagaac ggttacccat ttacagtatg gaaaatgctt cagatgcaac tctagtcatg 37861 ccttagagat ggagctttat taaacattca gatctctagg catatgaagt gctgagttct 37921 cttgaactcc taatacagat tgcactgagt ttagtgatac cttttctgga gcattcctga 37981 gttcaggtag ggagaagggt ttttgctgtg attggcttgt tatgttcttt ctaaatggaa 38041 atagaattga agtgtctcct ctctccattt attggaagag tcatgaggga cataattaga 38101 tgatcccttg gagtctccgg cttaggtcag tggttatcta cttaggctgc acattggaat 38161 cacctgagag ttaaaaaacc aggataacct ctgcctgtgt ctcatctcca gcaattctga 38221 tgtaattggt caggctgtgg cccgagtagg tgagttctgg ttttttaaag ctcccaggtg 38281 attctgatgt gcaatccagg ttgagatcac tttgggccct ttccagctct ttaaacatat 38341 atatttatct aggaaggtat gaaagcataa gttttcttga gactgccttt aacatctgta 38401 aaggctttca aagcagcttc tgtagttttt tttaaatggc tgaatatttt tcaacaggca 38461 gcatttgggt tataaaatta gcttttggta gagttgactt ataccacctc cagcttttgt 38521 tccaaaaata aatactggtt cttttggcac actagttgtt ttaccctaaa gttcctcttt 38581 gtaagccagt tattaaaagt tgtgatgcag ccagggcgaa gtggtacaca tctgtagtcc 38641 cagctactcg gaaggctgag gggggaggat cgctagagcc caagaagtca aggctgcagt 38701 gaactgtgat tacaccactg cactgcagcc tgggccacag agcgagactc atctctttaa 38761 aaaaagaatg ttgtgaggcc gggcgcagtg ctcacgcctg tgatcccagc actttgggag 38821 gccgaggtgg acggatcacc tgaggttggg agttcgagac cagcctgacc aacatggaga 38881 aaccctgtct ctactaaaaa aaatacaaaa ttagccgggc gtggtggcac atgcctgtag 38941 tcccagctac tcggcaggct gaggcaggag aatcgcttga acctgggagg cagaggttgt 39001 ggtgagttgg gcgagccatt gcactccagc ctgggcaaca agagcaaaac tccatctcaa 39061 aaaaaagaaa agaaaagaaa agaatgttgt ggccaggcgc ggtggcttac gcctgtaatt 39121 tcagcacttt gggagaccga ggtgggcgga tcacgaggtc aggagatcaa gaccatcctg 39181 gctaacacag taaaacccca tctctactaa atacaaaaaa aaattagccg ggagtgctgg 39241 cgggtgcctg tagtcccagc tactcaggag gctgaggcgg gagaatggcg tgaacccagg 39301 aggcagagct tgcagtgagc ggagatcgcg ccactgcact ccagcctggg caacagagcg 39361 agattccgtc taaaaaaaaa aaaaaagaat gttgtgataa aaggtgatgc tcacctctcc 39421 cacacccttt tatagtttag ggattgtatt tccaaggttt ctagactgag agcccttttc 39481 atctttgctc attgacactc tgtacccatt aatcctcctt attagctccc cttcaatgga 39541 cacatgggta gtcagggtgc aggtctcaga actgtccttc aggttccagg tgatcaacca 39601 agtgccttgt ctgtagtgtc aactcattgc tgccccttcc tagtaatccc cataatttag 39661 ctctccattt catagtcttt ccttgggtgt gttaaaagtg accatggtac actcagcacg 39721 gatgaaatga aacagtgttt agaaacgtca gtcttctctt ttgtaatgcc ctgtagtctc 39781 tctgtatgtt atatgtcaca ttttgtaatt aacagcttgc tggtgaaaag gaccccacga 39841 agtgttggat ataagccaga ctgtaagtga attacttttt ttgtcaatca tttaaccatc 39901 tttaacctaa aagagtttta tgtgaaatgg cttataattg cttagagaat atttgtagag 39961 aggcacattt gccagtatta gatttaaaag tgatgttttc tttatctaaa tgatgaatta 40021 tgattctttt tagttgttgg atttgaaatt ccagacaagt ttgttgtagg atatgccctt 40081 gactataatg aatacttcag ggatttgaat gtaagtaatt gcttcttttt ctcactcatt 40141 tttcaaaaca cgcataaaaa tttaggaaag agaattgttt tctccttcca gcacctcata 40201 atttgaacag actgatggtt cccattagtc acataaagct gtagtctagt acagacgtcc 40261 ttagaactgg aacctggcca ggctagggtg acacttcttg ttggctgaaa tagttgaaca 40321 gctttaatat acaataattg ttgcattatt atttcagatg ataaatgtgg tcataagtaa 40381 gaaataaatg atcgagttta gtcttttaat tcactgtcct ttgaatacct gcctcttact 40441 ctggaggcag aagtcccatg gatgtgttta tgaacatggt tgaggaagat ttaggaagac 40501 tgcaacagta cactacctaa agcaggtttt ttactccatc tttttttgcc acgtacactg 40561 gcctcccact ttgatatgct tgaaattatc tccttgattt gtctttcaaa actacatatt 40621 gaggctggtt gcggtggctc acacctgtaa tcctagcact ttgggaggcc aagccggaca 40681 gatcacttga ggtcaggagt tcgagaccag cctggcaaac atgatgaaac cccaccttta 40741 ctaaaaatac aaaaattagc caggcgtagt ggtgtgtgcc tgtaacccag ctacctggga 40801 ggctgaggca ggagaatcac tggaacccgg gaggcagagg ctacagtgag ccaacatcac 40861 gccactgcac tccagcctgg gtgacagagc aagactctgt ctcaaaacaa aacaaaaaac 40921 aaaaaactac gtattaagac aagaaacaga ctgggcgcgg tggctcacgc ctgtaatccc 40981 agcactttgg gaggctgagg cgggcggatc acaaggtcag gagatcgaga ccatcctggc 41041 taacacggtg aaaccccgtc tctactaaaa aatagaaaaa attagctggg gtggtggcgg 41101 gcgcctatag tctcagctac tcgggaggct gaggcaggag aatggcgtga acccgggagg 41161 cagagcttgc agtgagcaga gatcgtgcca ctgcactcca gtctgggtga cagagcaaga 41221 ctccgtctca aaaaaaaaaa caaaaacaag aaacaaatta aactaatgtg atagactact 41281 gctttgtttt caaaagatac actccccaaa agttactgat ctaaatacag tagtactatc 41341 tctgtttagt aagaaccctg acaactaata gtgttcttat atgtaaaatg ctattcttgc 41401 ctttcatttc agaatatact ttttaaatgt gaatttctgg attttttttt atagcatgtt 41461 tgtgtcatta gtgaaactgg aaaagcaaaa tacaaagcct aagatgagag ttcaagttga 41521 gtttggaaac atctggagtc ctattgacat cgccagtaaa attatcaatg ttctagttct 41581 gtggccatct gcttagtaga gctttttgca tgtatcttct aagaatttta tctgttttgt 41641 actttagaaa tgtcagttgc tgcattccta aactgtttat ttgcactatg agcctataga 41701 ctatcagttc cctttgggcg gattgttgtt taacttgtaa atgaaaaaat tctcttaaac 41761 cacagcacta ttgagtgaaa cattgaactc atatctgtaa gaaataaaga gaagatatat 41821 tagtttttta attggtattt taatttttat atatgcagga aagaatagaa gtgattgaat 41881 attgttaatt ataccaccgt gtgttagaaa agtaagaagc agtcaatttt cacatcaaag 41941 acagcatcta agaagttttg ttctgtcctg gaattatttt agtagtgttt cagtaatgtt 42001 gactgtattt tccaacttgt tcaaattatt accagtgaat ctttgtcagc agttcccttt 42061 taaatgcaaa tcaataaatt cccaaaaatt taactgcttt atgaattcaa tttaaaaatc 42121 cttaaaataa gtcctgtctc tttaaaaaaa cctatgcata gttatcattt ctctacaaat 42181 taacctagtt tagttttctg ttggttccat tttccttgtt tgttaagttt tagtagctag 42241 tttaattgta atctcaatga ttatgtggta gaatgggttg gcggacgtac aaaaattcct 42301 agctacttca gagacattaa atttcagaca catggtacac tttatattac attttactat 42361 gctaaaataa cacggctttc ttttggaatt ctgttcagtt tttcagattg taatctcagc 42421 tacatctcaa cagattgttc tcagatatgt cctattacct tctttgtgta gatagtgctt 42481 tattgactaa gaacaatgac aacaacacct tttgttttct gggaatagga gaaaagtttt 42541 aagccaaaac tcttaattgc ttatctgctc cacgtgaggt atgaactatc aaacttagga 42601 gccatctagc ttacacgtgt tccttaaaaa gtttgctgta ggccgggcac agtggctcgt 42661 acctgtagtc ccagcacttt tgggagccca gggtggggga tcacttgagc tcaggagttc 42721 aagaccagcc tgggcaacat ggcaaaacgc catctctaca aaaatacaaa aaaaaaaaaa 42781 aacgctgggt gtggtggcgc acaactgtag tcccagctac ttgggaggct gaggtgggag 42841 gattgcttga gcttgggagg tgaaggctgc agtgagcctt gacagtgcca ctacactcca 42901 gcctggatga cagagtgaga ccctgtctca aaaaaaagag tttgctgtaa ttcccagcaa 42961 caaagtagga gactcaaact aaataatttt ctatagtcct agaacttctt agtttacaaa 43021 acatttttac ttctgttatc tcatttgatc ttcataccca tgtaagggtt gaggtagatg 43081 ttaccacatg tgagtgcaat atccagaact ctgaatccct tcttccccta aaatgtcagc 43141 ccgctgaggt ccacttggct accctcttga atactgcatc cagcttccca ctgctgaacc 43201 tctttactct ttttttttca gttgcactta ccgccttcta gtaagttgaa ccatatgaaa 43261 ttaccatttt tgcaggtaaa aaatggccgg tgataggcag tttggcgtcg tataacccaa 43321 taacatgtta tataatttac ccacaagtgg tgggttgcta tgtcctggag gagtcagctt 43381 cagactctag ctaaatgatt gtataacctt gcagctctcc cctaagtgag gaggcaatgt 43441 tgaaagtccc atgtcttatc agaaccaggg aggcagatga gaaactgcct tatggcagct 43501 cccacaacat agggaggtgg gtgacaaatg gccttgggac agcttcttcc caagactggt 43561 tatgttacag tgttcctggg aggatcacat ggcattcctc caagatgggt cagactgctg 43621 ttggccttgt ctgtgtggcg tatgtgaaga cattcatggc cagagctgtt cccttagaag 43681 catctactaa attgatcttt tcctttctta cttactgtct gtctccctta gtaggctgtc 43741 agctccgtga gtgcaggacc ttgccagtcc tggtcactgc tatatcccca gcacctacaa 43801 gagtgcctgg aaaattgtag tgctcaataa atatttgttg gataaatgat agaatgatag 43861 gaagttaaaa agcaattaaa atacttgaaa agaagcaaaa catttttcat gttaagcaaa 43921 aaaaaaaaaa aaacttatta aggatagcta acatgtattg aattctatat gcaatggaat 43981 gatacttagc gcctttgaat ccttatgata accctataag gtaggttgtt tgggtttttt 44041 taattgtccc aactttacag atgaagaagt gcaggtccag agaggtcaca taatttgccc 44101 aggatcacac agctagtaag tagcagatga ggaatttgaa cccaggcagt tgtattccac 44161 catctgccct cttagttcat tgccacttaa cctataatgc ccagctcttg tgtagaaatt 44221 aatacactga taacatagag gaaaacatta agctcattga atgtaataag tccagatgac 44281 ttgtacatta aacacagctt tttgaggtca cagctgatct ctaagaatgt aaactgattt 44341 cctctggcac taaaaagcat tttcaaagac tgttaagaga gtttctccaa cattctcttc 44401 agatttttct gctggcttat tttatgattc tgtggacagc ttcagacaaa ataactttct 44461 ggtatgaagg attgtgttta ctctgctttt ttttttgttg ttttttgggt tttttgtttt 44521 gttttgtttt gtttttgaga cagtgccttg ctctgctgct gcccaggctg agtgcaatgg 44581 catgatctcg gctcactgca acctctgcct ccctgggctc aggccaggtg tatgctacca 44641 ctctcagcta atttttaatt tattttttta gagatacggt cccactctgt ttcccaggct 44701 ggtctcagaa ctcctgggct cagacagtcc cccgccatgg cctcccacag tgctgggatt 44761 acaggcatca gccaccatgc tcagcttgtt ctgccatttt caaatgtgaa ttttatagac 44821 actttaaacc acttgaaaga gtgatgatgt tttaatgatt ttcattatta tttgcaactt 44881 caagcattaa acactgccaa attaagtttc aagttttctc tttacacaat atggatgtac 44941 ttcataatgg acttcctcat catgattaat gagtgaagtg acattcaaac ttggtagctt 45001 ttcagtagaa cttcctttcc caacattttt tctgttcctt taattatggc aatatctgag 45061 agctctgaac ataagtcaaa ggtttgatta tttttcatgt ggcttcctct gcttggaact 45121 ttctgccccg catcttcccg ttgccccctg tgtcctcttg tcatgcccct accctttttt 45181 gagtgtgtct attttctggc actacaagac ataacaggct catcttgtgt tttccctacc 45241 ctgacccaga atcagccatt acttcaagga gccctggttc cattattgga gaatactatt 45301 agaaaccagg atctggtgct aggcatgctc atttctattg gagtgtcata caaacaattt 45361 gtaaattgtt tgtaggtcct cccagtggat aggattagga aataaaacat gcatactaac 45421 catgcataca cacacatcta cgtctatttc tgtatctgtc tgtatacata ttaaaataaa 45481 catgggttga taactaatgt ttctgctgta atccacagcc ttcatcctag cctgccactc 45541 ttcttctttt tagctttttc aacagtggga aatgtggctc ttgttatgta cactttattc 45601 acttatttgt ttgaccctag tatcataaag tagttccgta tgcctgtaac agatcgacta 45661 actagagtcc attatttgcg gaaagatctt tttgtccgaa cgttaccgca ggggtgtcca 45721 atcttttggc ttccctgggc cacactagaa aaagaagaat tgtcttgggc cacacgtaaa 45781 atacactaac actaacgata gctgataagc taaaaaaaaa aaatcaaaaa aatttcatga 45841 tgttttaaga aagtttacta atttgtgttg ggccacgttt aaagccatcc cgggtcgcag 45901 gttggacaag cttgccttac agtatccagt caaaataatg ttttccaaaa ttacttcttt 45961 tctttttcat ccctttcagt gtggccgtta tttataatgc agtttggttc attagtgttt 46021 ttattacaaa tacaccctca gccttcatat cctagtttta atgaattatt acggtgaaac 46081 ataataagag tcagagctat acagaaaggt ctactcagag gtgctttgtt ccctcctatt 46141 ctgttcccac tactcctact ttccactgac cctgtaagca tcatatttat ttttaatggc 46201 agttacattt ttaccaagtg cttactatct gtaggcactt ggtgtgtatt gcttcttttg 46261 gtgttcacag caacctcttg aggtaagcac tattattatc cccccttttt cttttttctt 46321 tctttctttc tttttttttt tttttttttt tgacagtctt actctgttgc ccaggcagga 46381 gtgcagtggc gcaatctcgg ctcactgcaa cctctgcctc ccaggttcaa gtgattctcc 46441 tgcctcagcc tcctgagtag ctgcgagtac aggcacaagc caccacgccc ggctaatttt 46501 tgtattttta gtagggatga ggttttgcca tgttggccag ggtggtctcg aactcctgac 46561 ctcaggtgat ctgcccgcct cgacctccca aagtgctggg attacaggca cgaaccactg 46621 cacccggcta ttatccccat tttttagatg agaaagctga atcccagaga gcataagaag 46681 cttgtccaga gtgacatctc tgatgcataa ccagtactca aacctatttt tctgacacca 46741 aggcctgtgt gtaaactgta aaggggctgc ttggcaccta ctttcctaaa gttgtcctat 46801 cccttctctg tctgggtctt cctgaagctt ggcacttctg aagtcacctc tctgaaaaca 46861 ttctggtaac tgttaaatcc cttgttctag ctattcatgt gttctgtgtg gttaaacaag 46921 gttcacaatg gccacctggc ctttggaact tgggtgaaga ggctgccttc agttgatcct 46981 ccccactccc attttcaaaa catgggttta catgagttat ttgtgaatta ggaaacataa 47041 ccatgttttg agccttcata gaaaacaaac gtctggggtc atacaggtta aaaggagtaa 47101 ccaaattcgg cactatcatt gttctattca gtagacaatt ctggggcctt tctgtgtctc 47161 aggttctgta ctagttgttt caggactttg ggataaatac aaactatccc tgccctcagg 47221 gggcttaagg tcaggtgtac aagtgactct aatgtgaggc aaggctggat tcagtgctgc 47281 atatctaatg ctatgggaat tcaaagagga agtgatcaga atgagaaggg agggatggat 47341 cattccagga gaagcttcag ggaaaagcaa catttaaaat gagacttttg agagtgaggg 47401 aaatttggac aggtggatat agaggatgca aggctagagg aaaggtttta gccagaaagt 47461 ctgcttgggc aaatgcctgg gtaaaaaaag aaaatccact ttgggaggac aaggcgggca 47521 atcgcctgag gtcaggagtt cgagaccagc ctggccaaca tggtgaaacc ccgtctctac 47581 taaaaataca aaaattagct gggcgtggtg gtgggtgcct gtaatcccag ctacttggaa 47641 ggctgaggca ggagaatcac ttgaacccag gaggcagagg tttcagtgag ccgagattgc 47701 gccactgcac tccagcctgg gcaacaagag tgaaacatct aaaaaaaaaa aaaagaaaat 47761 cacagggcag tgtggggaat ggtgagtatt ctaatttggt tgtggcagag aggatgtaga 47821 aggaagtgat aagagagaaa gccggatagg agggcctttg tgccagttag gatgttctag 47881 acttccagcc aggttgccca gctcaaactg gcttaaacaa tgagggggtt tattggctat 47941 gtaattggga agtgcagagg tagctcaggc cagatcagtt tgatccactg ctccattatg 48001 atgtcaaaga cccatgcgat ttccacctca ttattctgct gtccatagag ccaacttcat 48061 cctaaggcca gtccttgtgg tcagacaagg gctgccaata gtaatctggg tgcaagtttc 48121 tttgagaaaa tctttctgtg tcaactctct taaaaggggt gaaaaatctc tccttaagtc 48181 ccactggcca gaatgggccc atgcacccat ttcttaacca gtcactggca actgggggtg 48241 ggattgccgt ttgcccaatc aggtccattt ctggagctaa gattaaactc catttccctt 48301 gggacacatt gaacagaatc agaattcgat gaagaaggaa gaagcggaga attggtttgg 48361 tgttgggtag gcaaccaaaa ataacctctg ttgcctcaag tgccaagaaa gtggtgtttt 48421 gtgcttgtta gggtaaaaat ggggatcatg gaaaatattt taagtttcat agaccaaaaa 48481 atattccagt gtttcatcaa atctaagagg ctatcaatta taagatatac cattatttta 48541 tgtaccacca aggaagaaaa aatgctgcca gtgaagttag gatgtattgc aggttgggtt 48601 ctctgggaag caggctgaaa aggaggtgag aatgcaggac atttatggga gaacaccctt 48661 gggattaata ctggaggagg agaaccaagc agggttggtg gggcacaggg agaagttggg 48721 atgccatgca gtcacaacaa aggcctcagc caaccccacg gggagctcga gaagctgaga 48781 tggcccttca gtgttgccct gccttgtggt gagtgaattg ggtcttcata tccccatgtt 48841 gactggtcat tggatgtggg ctcccttagg aatgggcatc tcttcagcag aggtagcttt 48901 cttcaaaaga ggtgattcca aagagtcacc cactcactga gggctgtctg ctggcagcat 48961 tctcagccac tactcaaaga tgacctgtcc aggaagggga acctaggtgg catgacacat 49021 tgtctattac aacatgctac tgattataag agccgggagg tggggggcaa cacaatgtct 49081 gagatattaa aatggaagtc tcttagaaga aatggataat tctataatta tagttaatca 49141 gaaaggggaa gaagtgggga aatggaccaa gggcctgaga gagaaaacag acgcaacagg 49201 ccactagaaa gataggacac tggagggtgg gaagccctag cagtttcttc cagggtgggc 49261 tgggcacggt ggctcattcc tgtaatccca gaactttggg aggccgaggc gggcagatca 49321 tttgaagtca ggagttggag accagcctgg ccaactcctg tttcaccctg tctctgccaa 49381 aaatataaaa aattagccgg gtgtggttgc atgcgcctgt aatcccagct acttgggaag 49441 ctgaggcagg agaatcgctt gaacccagga ggcagaggtt gcagcgagga aaaatcgtgc 49501 cactgcactt gagcctgggt gacagagtga gactgtctca aaaaaaaaaa aagtttcttc 49561 cagggtggct tctgtgccag agtcaggtgc cccagctacc tctaatttat ggtcctcctg 49621 cactgggaaa cagattttct acttttggtt tcatgataaa taacatttcc ccctgatttt 49681 aaaagttatg gatttggctg ggcatggtgg ctcatgcctg taatcctagc actttgggag 49741 gtcaaggcag gcagatcact taaggtcagg agttccagac cagactgggc aacatggtga 49801 aaacccgtgt ctaccaaaaa aaaaaaaaaa aaaaaaatta gccaagtgtg gtggtacatg 49861 ccagtagccc tagctactca ggagactgag gtgggaggat tacctgagcc caggagatca 49921 ggcctgcagt gagctgtgat tgtgccattt tactccagcc tgggtgacag agtaagaccc 49981 tgtctcaaaa ataatagtaa taggctgggc gcggtggctc aagcctgtaa tcccaacact 50041 ttgggaggcc aaggcgggcg atcaattgag gtcaggaact caagaacagc cttgccaaaa 50101 tggtgaaact ccgtctctac taaaaataca aaaatgagcc gggtgtggtg gcgcatgctg 50161 cattcccagc tactcaggag gctgaggcag gagaatcgct tgaactcggg aggcagaggt 50221 tgcagtgagc cgagattgca ccactgcact ccagcctggg tgacagagtg agactccatc 50281 ttaataataa taaaataata aaaattttaa aaagttatgg atctggatgg agggaaatgg 50341 aatgtataaa agaagtaaac atacacaaga agatacaaat acagaataaa agtaaaatgc 50401 aaccatcatc ccactacccc gataccaggg tatccgtttt tacatctttt ctttcattct 50461 ttctgtcttt atataattgt ataaatgctg cataaacctc ctcttgcctg ctgcctcctc 50521 aaagacctcc ctccctcctt cactgccctt ctgctcctgg agagccaccc tctctccatt 50581 tatccttcct atcagcttca ggttcttacc atgttaacaa aaagaaaatc ttataagcct 50641 gtcactctct acatacgccg cacctccttt cattcatagc ctttaaaaca tatatatagc 50701 agttattgtg gttatttttc tgttcacaaa ataaaaaaac actctttcta gaaaactgga 50761 atatagaggc aagctttttt tttttttcag acggagtttc gttctgtcgc cccaggctgg 50821 agtgcagtaa cgaaattaca gcttactgta acctctgcct cctgggttca agatattctc 50881 ttgcctcagc ctcctgagta gctgggatta taggtgcctg ccaccacacc cggctaattt 50941 ttgtattttt agtggaaatg gggtttcgcc atgttggtca ggctggtctc gaactcctga 51001 ccttgtgatc tgcccatctc ggccttccaa agtgctggga ttacaggtgt gagccactgc 51061 accctgccga ggcaagattt tttttttttt ttttaagaaa acccagttat tccattaccc 51121 aatgaaactc taaacatgtt gatgtacatc cttccaaaat ttctttttat gacaacatgc 51181 tttttatttt taattatttt tattttattt taaggtccgg ggtacatgtg aaggatgtgc 51241 aggtttgtta cataggtaaa cgtgtgcctt ggtggtttgc tgcaccctgt caacccatca 51301 cctacgtatt aagccccaca tgcattagct attgatcctg atgctctctc tccctgctgg 51361 ctccccagca ggccccggtg tgtgttgttc ccctccctgt ttatgagaac actttcttga 51421 cataaagatt tcatttattc ccatggaatt ctaaaggctt ttcatacttg tgaaggaata 51481 atagtttaga aataaactga actttaaaag ataccatttt gaaaaataat atacagccat 51541 caaaaattat atttatggga actatgcaat aatattaaac tctatcatct gttgactgcc 51601 tcctatattc cagaaacttt acatacacca attctaatcc ttacaagaac gctgtgtagg 51661 ctttagcatt agatggacca ggtttcacca actgtatggt cttggataag tacccaacct 51721 cctgtcccta agtttcctca cctgtgaaaa cacggtttct accagctttc aaataagatg 51781 atcaatataa ggcacttgga acagaacctg acacatcata agcactctat aaatgtctat 51841 tatcaccaaa taattccagg tgccttgaaa atttaaatga aaaacaaaat caaaccatga 51901 caatactaga agcaaattta ggtgaacact tttctaatcc gggggtgggc gggggctggg 51961 gggaggcagg gagaagacct tttttttttc tttttgagat ggagtcttgc tctgtcccca 52021 agctggagtg cagaggcgtg atctcagctc actgcaacct ctgcctcctg gattcaagtg 52081 attctcctgc ctcagcctcc cgagtagctg ggactataca ggtgcacacc accacggcca 52141 gctaattttt gtatttttag tagagatggg gtttacaccc tgttagccag gatggtctca 52201 atttcttgac ctcgtgatcc catccgcgtt ggccttccaa agtgctggga ttaccagcat 52261 gagccaccgt gcccggctgg gagaagacct ttctaagcat gataccaaag gcagagacaa 52321 taaaggcaaa gaattgacag aattcactat ccgataaaaa tcacttctgt ggccgggcgc 52381 ggtggctcac acctgtaatc ccagcactgg gaagccgagg tgggcggatt gcttgaggcc 52441 aggagttcaa gaccagcctg gccaacatgg caaacctcct gtctctacta aaaatacaaa 52501 aaattagcta ggcatggtgg catgcctgta gtcccagcta ctcaggaagc tgaggcatga 52561 gaatcacttg aacctgggag gtagaggttg cagtgagcca agatcatgcc actgcactcc 52621 aacctgggtg acaaagtgag actctgtctc aaaaaaaata acaattaaaa taaaatcact 52681 tctgaatggt ggaaagcacc acaaagttag aggtcaagca ataatttgga gaaaagaatt 52741 agtaatttgt tggacagaca aaagactttt ttaatataac aaaaacttta aaaattaaaa 52801 aaatacacat tcgaggacat tttcctaaaa acacaggcaa aggacataaa cagcaaagca 52861 agaagacagc ttgatgtggc cattttatcc agggggacat tttggtgagc cctatggaca 52921 cagctgccat gatgccaaca atgtgacagc tgtccccttc aaaatgcgtt agccccagct 52981 cttcctctcc cccaacctcc agtccaaagg acttgcactt tctactttac tcctttctgc 53041 attgtttaat tttcttttac aaatatgtta cttgtcatca gaaaaaataa agaaataaat 53101 aaactgttag agtgttagcc ccttaaaggg gagcaagaat cacctttcta aaagaaagtt 53161 tatgttaaat ataatattag catatgtgaa tcctgagaga aaagttaaca gtttagttga 53221 gttatttcct ctgtagtctg gagctaaaaa tagggaatct tattctgtcc taaatctttt 53281 ccttcctcca cccagtgtct gtctggatcg aattcattca ttcactcagt aggcactcac 53341 tcagccaggc atggtgctag gcctcaggac ctcgctgtga accagaaact gtccctaccc 53401 ccatggtgca ggcattctgc ttgggagttg gaggaggaac aggtaaaaaa taattaaata 53461 ttcaggttaa cgatatattg tcaggtttga ggattgagga aagggcgcag agagtggcaa 53521 gggctgctgt ttagatacag tggccaggag gctccgatga ggtgaccttt gaggagagac 53581 atgcaggaga tgaggggaca gtgaagagga tttctaagaa cactccaggc agacagaaca 53641 gcgacagcca aggccctgaa gtgggtaggg gcctggtgtg tgtgaggaac ctcaggattg 53701 ccatcatggc tggagcagag acatgaagca agaaggccat ggagatgagg gcagggagat 53761 cccggagtgg ggagatcaga tggggctctg tgtatcatgc aaaggacttt gcattctgtt 53821 ccaagagctg ggaaggttga cataattagg aaaaaagccc agaaaagcag aggtatccat 53881 ttttcatggt aaagatgata atttcaatta aaacacgatt cctggatata tgtaatttgt 53941 aggccaaatg gtgcccaatc cctacctccc tcaccccctc acttccctat ccctaaaacc 54001 tgtacctcaa ctcccgttcg taagtgatgg gagttaggaa tagagaaatc tcccggttgg 54061 gttttctgag caaagaggta acatagcagc tctgttattt ctttcacgtc tccaagggaa 54121 ccatgactca cccttagcta tcccccggga atgtggccct cagagtgttc ttttactgat 54181 tcgtgatttt gttatgtaca cctggagtga tggaacatac cataccagct tgtcagggtt 54241 gctttgtgca aagatcgatg acgtgtgtga acccggatcc atgcttgggg tcctgagttt 54301 caggtgccat ggccagttgc tagcaggttg tatgtgtgtg accagcccct atgtgagtct 54361 ctcagaccct gaaactccaa acaggcttcc ctgggcagag acattctgtc catgctctgt 54421 ggcttgctgc tcgagaggga tagatcacat cctgtgtggc ttcttcttaa atgaagaagg 54481 acattggaag cctgtgctgg gcttctctgg accccccgat gtatatgtat gtatattaaa 54541 gagagaccag ggtctcactc tgttggccag gctggtcttg aactgctagc ctcaagaaat 54601 cctcccgctt tggcctccca aagtgctggg attacaggca tgagtcacca tgcctgatgt 54661 atatattttt ccagctccct tcttttctgt atcatttgct attactacct cttagctatt 54721 agtataaact gatcttgagt tgtgtaaatc tttctggtga ttcactgtga tgggatgatt 54781 gtgtcctctc aaaattccta tgttggagtc ctgacccatg gtacctcaga aagtgactgt 54841 atttgaagat aggtctttaa agaggtcatt gtaaattaat taataaggtc attagggtgg 54901 actctaatcc gatatgactg gtatccttat aagaaaagga aattagcaca cagacacaca 54961 atcagaggga gaagacagcc agtcatctac aagccgagga gacagacctc agaagaaacc 55021 aaccctgcct gcaccttgat cttggacttc tagtcgccag aactgtgaga aaacaaatct 55081 catgtttaag ccagaaccta gcacgtggta cttgttaagg catccctaga aaactaatac 55141 actcactgaa tgaggcaggt agctgtttct tttatttttt gagacagagt ctcactttgt 55201 ctccaaggcc agagtgcagt ggagcgatca cagctcactg cagcccctgc cttccaggct 55261 caagccatcc tcccacctca gcttctcaag tagctgggac tacaggcatg caccaccacg 55321 cccagctaat ttttgtattt tttttttttt tttgtagaga cggggttcac cgtgttgcct 55381 aggctggtct caaacccctg agctcaagca atctgccctc cttggcctcc caaagtgttg 55441 gatttacagg cgtgagccac tgtgcctgga tatggtaact ttttcatatg ctatttgctt 55501 gatgattatt tttctgtttc tgatataatg ctttttatta gagagttatc tgtttgtttt 55561 tattttttaa tgtttgaatt taaaaaatta gtataatttg cataattgaa aaattatatt 55621 tgaataattg aaatatattt gtataacctt aaatttaaaa actatgatag cgtatacagt 55681 gaaattttcc tctcatccct tttttccatt taaccagtgc acttcccaac agccaacaga 55741 taattttagt ttcctcactc cctgagctat tttatgtata tgcaagtaga tatgtacata 55801 catatttctg ccttgtaaca caaatagtag catactatac aactgctctg cttcttcctt 55861 tttttagcta agaatattaa aagagtgaaa aagatgtacg ctaacaaaaa tcaaaagaaa 55921 actagagtga cattataaga actgatgatg tagatttcag agcaatgatt actgctagga 55981 aaaaagggtc attttacatt gatcaaagag gtcaactcat caggaagaca taataatcct 56041 aaacacttat gtacttaaca gagcatcaaa atacatgaag cataaatgaa agaaccgtgg 56101 gagaaagtag acaaattaat gactgtagtt gaagatttca gtatccctct atgaaaatca 56161 gggtagtaca agtacacaga aaattggtaa agatatatga cttgaacaac attatcaacc 56221 aaattgacct catttacatt tgtggaatgt tccaactaag aacgtcagaa aacatactct 56281 tttcaagtgc acatggaaca tttaccaaga tagacaatat tttgggtcac cgcaagtctc 56341 aacacattga aaggattcag atcatataaa gtatgctcca tgaccatgat ggaattgaat 56401 tagaaaccaa taatgtatct ctggaaaata cacaaatatt tggaaattaa tatgcccttc 56461 taaaaaattt atgcatcaag aagaaatcaa aaagggatat ttgaaaagta ctatgaaact 56521 gatggccagg catggtgctc atcgcctgta atcccagcac tttgggaggc cgagaaagat 56581 ggatgaagtc aggagttcaa gaccagcctg ggcaacatgg cagaaccccg tctctactaa 56641 aaatacaaaa aattagccgg gcgtggtggt gggcgcctgt aatcccagca gtccacgtgt 56701 cgccgcccct ggtgatggac cagcggggct tcgacga // LOCUS HUMHSKPQZ7 3188 bp DNA PRI 17-NOV-1992 DEFINITION Human housekeeping (Q1Z 7F5) gene, exons 2 through 7, complete cds. ACCESSION M81806 NID g184406 KEYWORDS housekeeping protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3188) AUTHORS Van den Ouweland,A.M.W., Kioschis,P., Verdijk,M., Tamanini,F., Toniolo,D., Poustka,A. and van Oost,B.A. TITLE Identification and characterization of a new gene in the human Xq28 region JOURNAL Hum. Mol. Genet. 1, 269-269 (1992) MEDLINE 93265036 FEATURES Location/Qualifiers source 1..3188 /organism="Homo sapiens" /db_xref="taxon:9606" exon 262..303 /number=2 CDS join(281..303,1099..1157,1248..1355,1564..1702,2225..2387, 2462..2614) /standard_name="Q1Z 7F5" /codon_start=1 /db_xref="PID:g184407" /translation="MGRRPARCYRYCKNKPYPKSRFCRGVPDAKIRIFDLGRKKAKVD EFPLCGHMVSDEYEQLSSEALEAARICANKYMVKSCGKDGFHIRVRLHPFHVIRINKM LSCAGADRLQTGMRGAFGKPQGTVARVHIGQVIMSIRTKLQNKEHVIEALRRAKFKFP GRQKIHISKKWGFTKFNADEFEDMVAEKRLIPDGCGVKYIPSRGPLDKWRALHS" intron 304..1098 /number=2 exon 1099..1157 /number=3 intron 1158..1247 /number=3 exon 1248..1355 /number=4 intron 1356..1563 /number=4 exon 1564..1702 /number=5 intron 1703..2224 /number=5 exon 2225..2387 /number=6 intron 2388..2461 /number=6 exon 2462..2677 /number=7 polyA_signal 2652..2657 BASE COUNT 637 a 853 c 837 g 861 t ORIGIN 1 cccgggcgca acgcaagagc ggctgcgtct atggtcatga cgtctgacag agcgtccacc 61 cgtcttcgac aggactctat ggttcttacg cgcgcagaca gaccgcctat ataagccatg 121 cgcaggcgga ggagcgcctc tttcccttcg gtgtggtgag taagcgcagt tgtcgtctct 181 tgcggtgccg ttgctggttc tcacaccttt taggtctgtt ctcgtcttcc gttccgactc 241 tctctttttc gttgcagcca ctgaagatcc tggtgtcgcc atgggccgcc gccccgcccg 301 ttggtgagtc ttgaatccgt gtactttcac tgctgggaaa cgggcgggga aagaagtgcc 361 tatggccgct gaaaacaatt gtggggtgga gcctcccccg tgcggcggcc ctgtcttggg 421 aactgaccct atgttttaca cctcccggct attttttagt ctgcaatatt actgtctgtt 481 ccttgcttcc cgtgtcggtg tggaagcgac ggttccctcg tatctctgcc tgtgtcctgc 541 aagctcaccg cattttcggg cgctagatac gctcttgggg cctttgtgtg cgttctctgt 601 cttatttccg ggcgacgtgc cgcgtgcgtt gtcggacgtg aagggcagtc cgggaaaaac 661 gggtgcggcc gcctcctgtg tcctacaggg ggcgccagac gcatttccac tcggcttgga 721 ggtggattta gtgccacgtg cccgaaagtc ttaaattggg tgacctgagc tgtgcaatga 781 taatgcggcg attttgtagt acgcagtgtc ttgaagagag aatttttaac taggaaagtt 841 tgttgcaaag tgtttatagg aagcgtaaga caaagtaacg gaagatggtg tctgttgttt 901 ctagtcttgg gtgttgtctg tgttgcagca gccaactgtt gctttgtagt ttatttcccc 961 gaatggaaac ggtttagaag tggacgtgca ttccccaccc ttttcccgtc cctcgtttgg 1021 gttgttcctt gaggggcaaa gtgcctgttg ggctttctgt gaacctcacc taacctgtgt 1081 tttttcactc ccctgcagtt accggtattg taagaacaag ccgtacccaa agtctcgctt 1141 ctgccgaggt gtccctggta agtagtggaa gagcccctgc actggttggc tctgcggact 1201 ccgcgtccgt ctgtgacacc ccctgcacac ttacccaatc cttttagatg ccaagattcg 1261 catttttgac ctggggcgga aaaaggcaaa agtggatgag tttccgcttt gtggccacat 1321 ggtgtcagat gaatatgagc agctgtcctc tgaaggtaag gcaggattct ttgttcgtca 1381 ccccccagtc cttcctccgt gctccctcaa ccccacccac atacactgca ctggaattgg 1441 gagttgatga aacatgagcc ttacaaaact cagccaacac agttcccctg agctggagat 1501 agtcgtggtg aatgtttctc tattatcttc tctcactttg ctgcttttct tctccctacc 1561 tagccctgga ggctgcccga atttgtgcca ataagtacat ggtaaaaagt tgtggcaaag 1621 atggcttcca tatccgggtg cggctccacc ccttccacgt catccgcatc aacaagatgt 1681 tgtcctgtgc tggggctgac aggtgagctt ggtctgggcc ttttaaggca gttggagtct 1741 ctacattatt aggcttgcat tattttatca gagcatagag gtggccccag tgactcagcc 1801 actatggcta ctagaaaagc caggctggca agtgactttc agtggtcact caggacccct 1861 tcctgcagat acaaacaaag catgagtaag tcttagcaag ctcttcccca caggttaggg 1921 gaaacactgg tgatgggagt agctcttctt gcttttagct gaatgacaac catcttgcca 1981 gcaggtagct gaagctggca gaggagccca gtggcgcctt ttcagtggtt cttgggatgc 2041 tccgcagcca attaagccga ctgagttcct ttcctcatgg ggacccagtg tgcgatggct 2101 gcacacagca gcttccttgg tagtgtacgc agcctgttgg ttgtatgggt tgctctaagg 2161 gaccttggag acaggccttt caggtggatg ttcatgtttc tgaccttgca ctaccccaat 2221 gtaggctcca aacaggcatg cgaggtgcct ttggaaagcc ccagggcact gtggccaggg 2281 ttcacattgg ccaagttatc atgtccatcc gcaccaagct gcagaacaag gagcatgtga 2341 ttgaggccct gcgcagggcc aagttcaagt ttcctggccg ccagaaggta tgtagtgctg 2401 cagccccctt ctcccacctt tgccccagcc tcctgactca gttctttcca ttgctcctta 2461 gatccacatc tcaaagaagt ggggcttcac caagttcaat gctgatgaat ttgaagacat 2521 ggtggctgaa aagcggctca tcccagatgg ctgtggggtc aagtacatcc ccagtcgtgg 2581 ccctctggac aagtggcggg ccctgcactc atgagggctt ccaatgtgct gcccccctct 2641 taatactcac caataaattc tacttcctgt ccacctatgt ctttgtatct acattcttga 2701 cggggaagga acttcctctg ggaacctttg ggtcattgcc ctttcacttc agaaacaggt 2761 tgacaactca gccctgctca tgaggcagca aaccctgcaa agggctggga ctggtggcct 2821 tatgtcagtt gtctactctg gagcttgact tgacctcccc aggtcctagg cagtaggttg 2881 aaaaacactg aagtgctttt catgaagcac agctgcagca aagccttgca atcccaggct 2941 ggggtcagcc tacagttgtg ttgcttatta caacacatgc ggaccaagag gggcttgtgg 3001 gctagaggct gaccagcagc gtttatttag caagggtagg tgtgcatcac attgggcttg 3061 ttctcaccca tctggtttgg ccattcctcc ttggtgggaa tcatccaggt actgctgagg 3121 tcacctgcga tttgccccat ttcctatctc tagcaacctc ctggccccat gcccccaccc 3181 cttctaga // LOCUS HUMHSP27X 2496 bp DNA PRI 10-FEB-1995 DEFINITION Human heat shock protein 27 (HSPB1) gene exons 1-3, complete cds. ACCESSION L39370 X03900 NID g662840 KEYWORDS heat shock protein 27. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hickey,E., Brandon,S.E., Potter,R., Stein,G., Stein,J. and Weber,L.A. TITLE Sequence and organization of genes encoding the human 27 kDa heat shock protein [published erratum appears in Nucleic Acids Res 1986 Oct 24;14(20):8230] JOURNAL Nucleic Acids Res. 14 (10), 4127-4145 (1986) MEDLINE 86232547 FEATURES Location/Qualifiers source 1..2496 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" misc_feature 18..38 /gene="HSPB1" /note="heat control element; G00-127-602; putative" repeat_region 46..58 /gene="HSPB1" /note="abbreviated repeat of Pelham sequence; G00-127-602" CAAT_signal 130..137 /gene="HSPB1" /note="G00-127-602; putative" misc_RNA 159 /gene="HSPB1" /note="CAP site; G00-127-602; putative" mRNA join(159..613,1337..1400,1519..1829) /gene="HSPB1" /note="major mRNA; G00-127-602" exon 159..613 /gene="HSPB1" /note="G00-127-602" /number=1 gene join(159..613,1337..1400,1519..1829) /gene="HSPB1" prim_transcript 159..1829 /gene="HSPB1" /note="major transcript; G00-127-602" CAAT_signal 181..187 /gene="HSPB1" /note="G00-127-602; putative" misc_RNA 209 /gene="HSPB1" /note="CAP site; G00-127-602; putative" prim_transcript 209..1829 /gene="HSPB1" /note="minor transcript; G00-127-602" mRNA join(209..613,1337..1400,1519..1829) /gene="HSPB1" /note="minor mRNA; G00-127-602" exon 209..613 /gene="HSPB1" /note="alternative exon 1; G00-127-602" /number=1 CDS join(250..613,1337..1400,1519..1690) /gene="HSPB1" /codon_start=1 /db_xref="GDB:G00-127-602" /product="heat shock protein 27" /db_xref="PID:g662841" /translation="MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQ WLGGSSWPGYVRPLPPAAIESPAVAAPAYSRALSRQLSSGVSEIRHTADRWRVSLDVN HFAPDELTVKTKDGVVEITGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPE GTLTVEAPMPKLATQSNEITIPVTFESRAQLGGRSCKIR" intron 614..1336 /gene="HSPB1" /note="G00-127-602" /number=1 repeat_region 780..952 /gene="HSPB1" /note="homologous; G00-127-602" /rpt_family="Alu" exon 1337..1400 /gene="HSPB1" /note="G00-127-602" /number=2 intron 1401..1518 /gene="HSPB1" /note="G00-127-602" /number=2 exon 1519..1829 /gene="HSPB1" /note="G00-127-602" /number=3 polyA_signal 1804..1809 /gene="HSPB1" /note="G00-127-602" polyA_site 1829 /gene="HSPB1" /note="G00-127-602; putative" repeat_region 1922..2280 /gene="HSPB1" /note="homologous; G00-127-602" /rpt_family="Alu" BASE COUNT 546 a 757 c 734 g 459 t ORIGIN 1 gaattcattt gcttttcctt aacgagagaa ggttccagat gagggctgaa ccctcttcgc 61 cccgcccacg gcccctgaac gctgggggag gagtgcatgg ggaggggcgg ccctcaaacg 121 ggtcattgcc attaatagag acctcaaaca ccgcctgcta aaaatacccg actggaggag 181 cataaaagcg cagccgagcc cagcgccccg cacttttctg agcagacgtc cagagcagag 241 tcagccagca tgaccgagcg ccgcgtcccc ttctcgctcc tgcggggccc cagctgggac 301 cccttccgcg actggtaccc gcatagccgc ctcttcgacc aggccttcgg gctgccccgg 361 ctgccggagg agtggtcgca gtggttaggc ggcagcagct ggccaggcta cgtgcgcccc 421 ctgccccccg ccgccatcga gagccccgca gtggccgcgc ccgcctacag ccgcgcgctc 481 agccggcaac tcagcagcgg ggtctcggag atccggcaca ctgcggaccg ctggcgcgtg 541 tccctggatg tcaaccactt cgccccggac gagctgacgg tcaagaccaa ggatggcgtg 601 gtggagatca ccggtgagcc cccctgctcc tgcaggggag aggaggaggc tagcagggcg 661 ggcagggccg ggggcgtgcg gttgaaacgg gggtcccggg ggcctgggga gttaaacgtt 721 ggcccagcac cgggaaaaac aggactcctg attcccttgc tcaggaattg ggagtgcggg 781 tcgcttctaa gggcgctttc tgctctgtaa tcccagcgct ttgggaggcc gagacgggag 841 gatcgcttga ggccaggagt tcaagactag cctgggcaac atagcgagac gcgccccccc 901 gccccgaccc cgcgccatta caaaaaaaaa gcaaacaaaa atttttttaa agatcatcga 961 tgaagagaga aaatgcgctt ttctacagag tccccttccc acccacagcc ccatccccag 1021 ataagcgggg agttccctgg cgcggtgcca gtttctagcc gctgagtggg cgtgtgcgcg 1081 gctccaagtg cgcctgcgta ctgctcactc cccagctccg cgccctgctc cgttcctccc 1141 aaaactctga atcgaagaac tttccggaag tttctgagag cccagaccgg cgggcacgcc 1201 cccatcccca accccctctg ttaatcccta ccagcctgca gtcctggctg cttccaagca 1261 ggaggtgggg cctctggcta gcggggccga aaaagtcccc tcccccgcat gtctgatttc 1321 cctcttcccc ccaaaggcaa gcacgaggag cggcaggacg agcatggcta catctcccgg 1381 tgcttcacgc ggaaatacac gtgagtcctg gcgccaggtc ggggtgggtg ggtggcgtgg 1441 gggtggggtc agggaagagg gcacagggac ccacccggtg tgtaatgtaa cgcttgcctt 1501 tcctctctgc acgtccaggc tgccccccgg tgtggacccc acccaagttt cctcctccct 1561 gtcccctgag ggcacactga ccgtggaggc ccccatgccc aagctagcca cgcagtccaa 1621 cgagatcacc atcccagtca ccttcgagtc gcgggcccag cttgggggca gaagctgcaa 1681 aatccgatga gactgccgcc aagtaaagcc ttagcccgga tgcccacccc tgctgccgcc 1741 actggctgtg cctcccccgc cacctgtgtg ttcttttgat acatttatct tctgtttttc 1801 tcaaataaag ttcaaagcaa ccacctgtca ctggcccagg ccctggtgtt tgtggaagga 1861 agcctcaggc acctgccatt tgctggcttt caggagtcat ctttgctcag gcccgtgctg 1921 ggccatgtgg gtacactggt gtaggttgct ggacacaggc tgactcacat ccataaagac 1981 agaggtctta gggccgggcg cagtggctca tacctacaat cccagcactt tggggggttg 2041 aagcaggagg agtgcttgaa gccaagagtt ctagaccagc ctggacaaca tagtaagact 2101 gtctctaaaa aataaaaatt aggcagggtg gtactgcacg cctgtagtcc cagctactca 2161 ggaggctgag gcaggaggat cgcttgagcc cagagttgtg aaggtacagt gagctaacat 2221 cgtgccattg cactccagcc tgggcaacag aacaagatcc tgtctcaaaa caaccaaaag 2281 cccagagaga aagagtgaga ccccatcttt aaaagaaaaa aaaaaaaggt catgattgca 2341 aggtcacgat tgcaattaaa actgtaaggt ggggaaggag gaggaaataa gagaagcacc 2401 tgaggcttga gttctcagga gcacctaggt tgggtcccag gtgaaggggc acagaggtaa 2461 ttgcacctca gagctgatgg gaggattact atgtca // LOCUS HUMHSP89KD 7393 bp DNA PRI 07-MAR-1995 DEFINITION Homo sapiens heat shock protein (HSP89-alpha) gene, complete cds. ACCESSION M27024 NID g341598 KEYWORDS heat shock protein HSP89-alpha. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7393) AUTHORS Hickey,E., Brandon,S.E., Smale,G., Lloyd,D. and Weber,L.A. TITLE Sequence and regulation of a gene encoding a human 89-kilodalton heat shock protein JOURNAL Mol. Cell. Biol. 9 (6), 2615-2626 (1989) MEDLINE 89343979 FEATURES Location/Qualifiers source 1..7393 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" TATA_signal 253..259 /gene="HSP89-alpha" exon 282..341 /gene="HSP89-alpha" /number=1 mRNA join(282..341,946..1107,1200..1566,1896..2029,2329..2646, 2763..2928,3344..3534,3633..3780,4024..4292,4882..5215, 5505..5614) /gene="HSP89-alpha" gene join(282..341,946..1107,1200..1566,1896..2029,2329..2646, 2763..2928,3344..3534,3633..3780,4024..4292,4882..5215, 5505..5614) /gene="HSP89-alpha" intron 342..945 /gene="HSP89-alpha" /number=1 CDS join(946..1107,1200..1566,1896..2029,2329..2646, 2763..2928,3344..3534,3633..3780,4024..4292,4882..5215, 5505..5614) /gene="HSP89-alpha" /codon_start=1 /product="heat shock protein" /db_xref="PID:g703087" /translation="MPEETQTQDQPMEEEEVETFAFQAEIAQLMSLIINTFYSNKEIF LRELISNSSDALDKIRYESLTDPSKLDSGKELHINLIPNKQDRTLTIVDTGIGMTKAD LINNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYSAYLVAEKVTVITKHNDDEQY AWESSAGGSFTVRTDTGEPMGRGTKVILHLKEDQTEYLEERRIKEIVKKHSQFIGYPI TLFVEKERDKEVSDDEAEEKEDKEEEKEKEEKESEDKPEIEDVGSDEEEEKKDGDKKK KKKIKEKYIDQEELNKTKPIWTRNPDDITNEEYGEFYKSLTNDWEDHLAVKHFSVEGQ LEFRALLFVPRRAPFDLFENRKKKNNIKLYVRRVFIMDNCEELIPEYLNFIRGVVDSE DLPLNISREMLQQSKILKVIRKNLVKKCLELFTELAEDKENYKKFYEQFSKNIKLGIH EDSQNRKKLSELLRYYTSASGDEMVSLKDYCTRMKENQKHIYYITGETKDQVANSAFV ERLRKHGLEVIYMIEPIDEYCVQQLKEFEGKTLVSVTKEGLELPEDEEEKKKQEEKKT KFENLCKIMKDILEKKVEKVVVSNRLVTSPCCIVTSTYGWTANMERIMKAQALRDNST MGYMAAKKHLEINPDHSIIETLRQKAEADKNDKSVKDLVILLYETALLSSGFSLEDPQ THANRIYRMIKLGLGIDEDDPTADDTSAAVTEEMPPLEGDDDTSRMEEVD" exon 946..1107 /gene="HSP89-alpha" /number=2 intron 1108..1199 /gene="HSP89-alpha" /number=2 exon 1200..1566 /gene="HSP89-alpha" /number=3 intron 1567..1895 /gene="HSP89-alpha" /number=3 exon 1896..2029 /gene="HSP89-alpha" /number=4 intron 2030..2328 /gene="HSP89-alpha" /number=4 exon 2329..2646 /gene="HSP89-alpha" /number=5 intron 2647..2762 /gene="HSP89-alpha" /number=5 exon 2763..2928 /gene="HSP89-alpha" /number=6 intron 2929..3343 /gene="HSP89-alpha" /number=6 exon 3344..3534 /gene="HSP89-alpha" /number=7 intron 3535..3632 /gene="HSP89-alpha" /number=7 exon 3633..3780 /gene="HSP89-alpha" /number=8 intron 3781..4023 /gene="HSP89-alpha" /number=8 exon 4024..4292 /gene="HSP89-alpha" /number=9 intron 4293..4881 /gene="HSP89-alpha" /number=9 exon 4882..5215 /gene="HSP89-alpha" /number=10 intron 5216..5504 /gene="HSP89-alpha" /number=10 exon 5505..6277 /gene="HSP89-alpha" /number=11 polyA_signal 6255..6260 /gene="HSP89-alpha" polyA_site 6277 /gene="HSP89-alpha" BASE COUNT 2091 a 1436 c 1902 g 1964 t ORIGIN 1 gaattccggg ccgcgcaggc gcattgaggc cggcgcgggg gcgtgcgaga ggcgcgcggc 61 ggcgattgag ggaaggttgc ccgggccggg ccacgagcag ggcctgcgcg cgcccgcagg 121 aaggcgcggg ggcggggtgc cgcggcaacg gcgcatgcgt aggcgcgcgg ccgcggcggc 181 ggctggggag ggttcttccg gaagttgggg aggcttctgg aaaaagcgcc gcgcgctggg 241 cgggcccgtg gctatataag gcaggcgcgg gggtggcgcg tcagttgctt cagcgtcccg 301 gtgtggctgt gccgttggtc ctgtgcggtc acttagccaa ggtgacgggt ggttgtgggg 361 gcctgtggag gtggactggg gaccgggact gggggcggct ggcgcggggc ggaggctgtg 421 ggggccggac cagatccctg aagcagccag ggccgcgctc ccgggcggcc gcgctttcta 481 ttccggaggc ctcgggaccg ctgcggtttc cgcaccccgc ttcaggaatg ggccgggcgg 541 ccgcgggctc cccgaggctc tgcagcagcg ccagagctgg cgccgcacgc gaacagagcg 601 gccccgccgg gggtcccccg cccggccccg ctgggggcgc aggcggagga ggccgccgtg 661 gcggagccgc agcccgcgcg ggagggcacc cggggtgttc gttggggacc gcggcggggg 721 actgggcccg ggctgcggcg gcctcccgga agcgcgcaca cgctcgtggt agttgccgcg 781 ctccgaaatg aggtcatcct ttgtcagccg cccttctatt ttcggtttac ttaaacttcg 841 caaatctcca attcggcttt aaaaatattt ggagaacgcg atggaatctc gtgttgcctc 901 tgtagacgtc ctgcaaggtt ttaaaccggc gcggtgtcgt tccagatgcc tgaggaaacc 961 cagacccaag accaaccgat ggaggaggag gaggttgaga cgttcgcctt tcaggcagaa 1021 attgcccagt tgatgtcatt gatcatcaat actttctact cgaacaaaga gatctttctg 1081 agagagctca tttcaaattc atcagatgta agtcacttat taacccagaa tcggattttg 1141 gtttcagtgt gaacttcttg ggggtgctgt atgcttaaat taatattttt tgttaacagg 1201 cattggacaa aatccggtat gaaagcttga cagatcccag taaattagac tctgggaaag 1261 agctgcatat taaccttata ccgaacaaac aagatcgaac tctcactatt gtggatactg 1321 gaattggaat gaccaaggct gacttgatca ataaccttgg tactatcgcc aagtctggga 1381 ccaaagcgtt catggaagct ttgcaggctg gtgcagatat ctctatgatt ggccagttcg 1441 gtgttggttt ttattctgct tatttggttg ctgagaaagt aactgtgatc accaaacata 1501 acgatgatga gcagtacgct tgggagtcct cagcaggggg atcattcaca gtgaggacag 1561 acacaggtag gcacctgaac attcactggt taagtgagcg gtggaagggt ggggtgctgc 1621 aacccttgga tccctgggga tgcggtgaag cttacagaat ttgtgtttgc ccaacgatca 1681 ggaacaagct agagcactta attagtgtaa tggcatgact tggtgggcgg tccttttggg 1741 cgaggcacct tgcccaaaag gttctgctgt ggtggccttt gcatctgtag taacggcaaa 1801 ccttgcttta tagacacgtt ataagggtgt taggcatggc gttgagcact aaattgaatg 1861 tattattaag tcaatctttc tttcttctct tgcaggtgaa cctatgggtc gtggaacaaa 1921 agttatccta cacctgaaag aagaccaaac tgagtacttg gaggaacgaa gaataaagga 1981 gattgtgaag aaacattctc agtttattgg atatcccatt actctttttg taagttttta 2041 tgtaattgca gagtgaattt ctgtctgtag gtgattgggg tgactgtact acatccttag 2101 tccctagatc tgttcaacta gtctgaagcc tggggaacct aagctctact acctaactct 2161 gtaaaagttc ctcgggctat taagtggatg tatcaaatat tgatgaaaag ctgggtgctt 2221 ctcaagttgg ctaatttaga cttgggcctt aggttgaaaa tagtggtgca gcgctgtcgc 2281 ttagttctgg agtgcagcag tgatgtgatt tcgtgttttc ttttgaaggt ggagaaggaa 2341 cgtgataaag aagtaagcga tgatgaggct gaagaaaagg aagacaaaga agaagaaaaa 2401 gaaaaagaag agaaagagtc ggaagacaaa cctgaaattg aagatgttgg ttctgatgag 2461 gaagaagaaa agaaggatgg tgacaagaag aagaagaaga agattaagga aaagtacatc 2521 gatcaagaag agctcaacaa aacaaagccc atctggacca gaaatcccga cgatattact 2581 aatgaggagt acggagaatt ctataagagc ttgaccaatg actgggaaga tcacttggca 2641 gtgaaggtga gtgactgatg ggtgcttcaa gcttgtcctt agattatcat ctttctccac 2701 caccccaaat atcttctata atgcattgtt cattggtact tagtgtatct gttttattac 2761 agcatttttc agttgaagga cagttggaat tcagagccct tctatttgtc ccacgacgtg 2821 ctccttttga tctgtttgaa aacagaaaga aaaagaacaa catcaaattg tatgtacgca 2881 gagttttcat catggataac tgtgaggagc taatccctga atatctgagt aagtatagat 2941 aggaaaaata atcactgtca ctgattaaag aagtactttc tgggtgggca tggtggctca 3001 cacctataat cctagcactt tgggaggccg gggtgggcag atcacttgag gtcaggagtt 3061 caagaccagc ctgggcaaca tggtgaaacc ccatctctac taaattacaa aaattagccc 3121 catgtagtgg caggctgagg catgaaaaca gcttgaaccc ccccgggagg cagaggttgc 3181 agtgagtgga gattgcgcca ctgccctcca gcctgggcaa cagagtgaga ctgtctcaaa 3241 aatacgtccc atcatcttca ggttatcttt gattttggga gtttacatag taccagtttt 3301 gtccttggaa tgactcagtg catttggttt atattttttt cagacttcat tagaggggtg 3361 gtagactcgg aggatctccc tctaaacata tcccgtgaga tgttgcaaca aagcaaaatt 3421 ttgaaagtta tcaggaagaa tttggtcaaa aaatgcttag aactctttac tgaactggcg 3481 gaagataaag agaactacaa gaaattctat gagcagttct ctaaaaacat aaaggttggt 3541 gtaaataacc attagttttc caattggcct ctttagtttt tttttttttt ttttaattca 3601 gaaagtcttt taaagaacat actttgtttc agcttggaat acacgaagac tctcaaaatc 3661 ggaagaagct ttcagagctg ttaaggtact acacatctgc ctctggtgat gagatggttt 3721 ctctcaagga ctactgcacc agaatgaagg agaaccagaa acatatctat tatatcacag 3781 gtaagagaac actatgttac agtcatacac gtcgttctta caaccttgta ggctctgtgg 3841 gtgtgttttc tactcaggta gcactgttac aactggtatt gatctaggca agataattaa 3901 catgaactag gtcattttct gtcttaggtt ctgcctaggt atctggctag caagaaaagt 3961 cagagctaga tgaaaccatt cttaactgtt aaaaggtcta aaagtaactt tgtaatacct 4021 caggtgagac caaggaccag gtagctaact cagcctttgt ggaacgtctt cggaaacatg 4081 gcttagaagt gatctatatg attgagccca ttgatgagta ctgtgtccaa cagctgaagg 4141 aatttgaggg gaagacttta gtgtcagtca ccaaagaagg cctggaactt ccagaggatg 4201 aagaagagaa aaagaagcag gaagagaaaa aaacaaagtt tgagaacctc tgcaaaatca 4261 tgaaagacat attggagaaa aaagttgaaa aggtatgtga atacagcatt tcctgatcat 4321 tgatacttct aaggtgcttt caagcttagt catacatagc ccattttcgc atgttttcaa 4381 cttaaaacag aaaactatgt cgtgtgtggc tgggcgcggt ggctcacgcc tgcaatccca 4441 gcactttggg aggctgaggc agcggatcac aaggtcagga gatcgagacc atcctggcta 4501 acacggtgaa actcagtctc tactaaaaat agaaaaaaat aaaccaggcg tggtggcacg 4561 gcctgtaatc ctagccactt gggaggctga ggcaggagaa tcgcctgaac ccaggaggcg 4621 gaggttgcag tgagccaaga tcgcaccact gcactccagc ctgggtgatg gagcgagact 4681 ctatctcaaa aaaaaaattg tgcatgtaaa acatgaaatt ataacctgtg ctctttggat 4741 acctaatgcg acatttaagt tgtatttgac agtagatagt attttggatc tattgaaatt 4801 tgggttctac agatttcatt tcacaatgaa agtttaggat taatctttct aggttcctag 4861 tcatcacttt ttggattaca ggtggttgtg tcaaaccgat tggtgacatc tccatgctgt 4921 attgtcacaa gcacatatgg ctggacagca aacatggaga gaatcatgaa agctcaagcc 4981 ctaagagaca actcaacaat gggttacatg gcagcaaaga aacacctgga gataaaccct 5041 gaccattcca ttattgagac cttaaggcaa aaggcagagg ctgataagaa cgacaagtct 5101 gtgaaggatc tggtcatctt gctttatgaa actgcgctcc tgtcttctgg cttcagtctg 5161 gaagatcccc agacacatgc taacaggatc tacaggatga tcaaacttgg tctgggtaag 5221 ccttatacta tgtaatgtta aaaagaaaat aaacacacgt gacattgaag aaaatgggta 5281 aactttcagt tatccaaact tggagcacct tgtctgcttg ctgcttggag gtattaaagt 5341 atgttttttt tagggataag taaggtctta caagagcaaa gaaatgaaat tgagactcat 5401 atgtcctgta atactgtctt gaaagcagat agaaaccaag agtattaccc taatagctgg 5461 ctttaagaaa tctttgtaat atgaggattt tattttggaa acaggtattg atgaagatga 5521 ccctactgct gatgatacca gtgctgctgt aactgaagaa atgccacccc ttgaaggaga 5581 tgacgacaca tcacgcatgg aagaagtaga ctaatctctg gctgagggat gacttacctg 5641 ttcagtactc tacaattcct ctgataatat attttcaagg atgtttttct ttatttttgt 5701 taatattaaa aagtctgtat ggcatgacaa ctactttaag gggaagataa gatttctgtc 5761 tactaagtga tgctgtgata ccttaggcac taaagcagag ctagtaatgc tttttgagtt 5821 tcatgttggt ttattttcac agattggggt aacgtgcact gtaagacgta tgtaacatga 5881 tgttaacttt gtggtctaaa gtgtttagct gtcaagccgg atgcctaagt agaccaaatc 5941 ttgttattga agtgttctga gctgtatctt gatgtttaga aaagtattcg ttacatcttg 6001 taggatctac ttttcgaact tttcattccc tgtagttgac aattctgcat gtactagtcc 6061 tctagaaata ggttaaactg aagcaacttg atggaaggat ctctccacag ggcttgtttt 6121 ccaaagaaaa gtattgtttg gaggagcaaa gttaaaagcc tacctaagca tatcgtaaag 6181 ctgttcaaaa ataactcaga cccagtcttg tggatggaaa tgtagtgctc gagtcacatt 6241 ctgcttaaag ttgtaacaaa tacagatgag ttaaaagata ttgtgtgaca gtgtcttatt 6301 tagggggaaa ggggagtatc tggatgacag ttagtccaaa atgtaaaaca ctaggcgctc 6361 agacggagat ggttaaacac tagctgctcc aagggttgac atggtcttcc cagcatgtac 6421 tcagcaggtg tggggtggag cacatgtagg cacagaaaac aggaatgcag acaacatgca 6481 tcccctgcgt ccatgagtta catgtgttct cttagtgtcc acgttgtttt gatgttattc 6541 atggaatacc ttctgtgcta aatacagtca cttaattcct tggccttaca gtgtctcaaa 6601 gttctttaca aatctactta aagccatcct ggctgaggca ggagaatcgc ttgaacctgg 6661 gaggcggagg ttgcggtggg ctgggattgc accattgcac tgcagcctga gcagcaagag 6721 cgaaactcca tctcaaaaaa aagatgaaaa aatcagccca ataggtaaga gtccctctgg 6781 gactgggggt ggtaatggct aaataaaagt cttcacttct gaaatttggt ggttctagag 6841 acaaaacttg gttagatagc atataggatt tgcattctga gcctaacgcc atatataaaa 6901 atgataggaa atgcctggga cttggatatg tgacagtgtc taagggggca gtgcacaggc 6961 agggagagac cttgtctcag tctcgtgcta agcaggtatt gaatgcttgg taaatggcaa 7021 tcacgtttag tcctttaaat tctctgaaaa gttcttaatc cccttcgata atatacagga 7081 ggaattacgt aaattgaggt cttagtagtg gctgaattta aatttaactc caaagcccca 7141 ttttattctg ctaagttttg ttgcgtactg aaaggtgtta gaagaaaggt ttacggtgtt 7201 aaactaggaa gtgaatggca aaaagggtaa aaagtcaatt ttgttttttc ccaggtctag 7261 caagagaata ttaaatctta agatttcttt caattaaaaa cataaggcac ttgagggaat 7321 caagtcaata tagaacctct ataaatgatc tcccaaagta actcatcctc ctggagttac 7381 tgggaaagca ttt // LOCUS HUMHSP90B 8210 bp DNA PRI 15-DEC-1989 DEFINITION Human 90 kD heat shock protein gene, complete cds. ACCESSION J04988 NID g184422 KEYWORDS heat shock protein. SOURCE Human fetal liver DNA, clone lambda-g24A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8210) AUTHORS Rebbe,N.F., Hickman,W.S., Ley,T.J., Stafford,D.W. and Hickman,S. TITLE Nucleotide sequence and regulation of a human 90-kDa heat shock protein gene JOURNAL J. Biol. Chem. 264, 15006-15011 (1989) MEDLINE 89359311 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Hickman, 23-JUN-1989. FEATURES Location/Qualifiers source 1..8210 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 453..466 /note="heat shock element 1" misc_feature 463..476 /note="heat shock element 2" misc_feature 567..580 /note="heat shock element 3" CAAT_signal 1016..1019 TATA_signal 1076..1082 prim_transcript 1103..7886 /note="HSP mRNA and introns" intron 1202..2634 /note="HSP intron A" misc_feature 1785..1798 /note="heat shock element 4" misc_feature 1838..1851 /note="heat shock element 5" misc_feature 2437..2450 /note="heat shock element 6" CDS join(2635..2781,3381..3587,3679..3838,4025..4158, 4295..4603,5052..5217,5421..5611,5740..5887,6002..6270, 7048..7381,7492..7601) /partial /note="90 kD heat shock protein" /codon_start=1 /db_xref="PID:g386786" /translation="MPEEVHHGEEEVETFAFQAEIAQLMSLIINTFYSNKEIFLRELI SNASDALDKIRYESLTDPSKLDSGKELKIDIIPNPQERTLTLVDTGIGMTKADLINNL GTIAKSGTKAFMEALQAGADISMIGQFGVGFYSAYLVAEKVVVITKHNDDEQYAWESS AGGSFTVRADHGEPIGRGTKVILHLKEDQTEYLEERRVKEVVKKHSQFIGYPITLYLE KEREKEISDDEAEEEKGEKEEEDKDDEEKPKIEDVGSDEEDDSGKDKKKKTKKIKEKY IDQEELNKTKPIWTRNPDDITQEEYGEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLF IPRRAPFDLFENKKKKNNIKLYVRRVFIMDSCDELIPEYLNFIRGVVDSEDLPLNISR EMLQQSKILKVIRKNIVKKCLELFSELAEDKENYKKFYEAFSKNLKLGIHEDSTNRRR LSELLRYHTSQSGDEMTSLSEYVSRMKETQKSIYYITGESKEQVANSAFVERVRKRGF EVVYMTEPIDEYCVQQLKEFDGKSLVSVTKEGLELPEDEEEKKKMEESKAKFENLCKL MKEILDKKVEKVTISNRLVSSPCCIVTSTYGWTANMERIMKAQALRDNSTMGYMMAKK HLEINPDHPIVETLRQKAEADKNDKAVKDLVVLLFETALLSSGFSLEDPQTHSNRIYR MIKLGLGIDEDEVAAEEPNAAVPDEIPPLEGDEDASRMEEVD" exon <2635..2781 /note="90 kD heat shock protein, (first expressed exon)" /number=2 intron 2782..3380 /note="HSP intron B" exon 3381..3587 /number=3 intron 3588..3678 /note="HSP intron C" exon 3679..3838 /number=4 intron 3839..4024 /note="HSP intron D" exon 4025..4158 /number=5 intron 4159..4294 /note="HSP intron E" exon 4295..4603 /number=6 intron 4604..5051 /note="HSP intron F" exon 5052..5217 /number=7 intron 5218..5420 /note="HSP intron G" exon 5421..5611 /number=8 intron 5612..5739 /note="HSP intron H" exon 5740..5887 /number=9 intron 5888..6001 /note="HSP intron I" exon 6002..6270 /number=10 intron 6271..7047 /note="HSP intron J" exon 7048..7381 /number=11 intron 7382..7491 /note="HSP intron K" exon 7492..>7601 /note="90 kD heat shock protein" /number=12 BASE COUNT 1871 a 1839 c 2186 g 2314 t ORIGIN 5 bp upstream of SstI site. 1 gagctccggc tgccctgcac tggttcccag agactccctc cttcccaggt ccaaatggct 61 gcaggagcga agtgggcgga aaaaaagcga accagcttga gaaagggctt gacgtgcctg 121 cgtagggagg gcgcatgtcc ccgtgctccg tgtacgtggc ggccgcaggg gctagagggg 181 ggtccccccc gcaggtactc cactctcagt ctgcaaaagt gtacgcccgc agagccgccc 241 caggtgcctg ggtgttgtgt gattgacgcg gggaaggagg ggtcagccga tccctcccca 301 accctccatc ccatccctga ggattgggct ggtacccgcg tctctcggac aggtcagagc 361 gggtcgccgg gtggggtcgc tgcaaaaacc ctgccccggc cgcagccgag aggcggacgt 421 cgcggggagg gggcgggacc gccgagacag gcctggaaac tgctggaaat gccgcagtgc 481 cgccgccgcc ccttccgccg catgtcggca aagagtcccc gccagccccg gccggcgccc 541 tccccctacg ctgagctgcc cctcagcgcg aaccctccgc ccttcctcta ctcctgcgag 601 agtcgggatc tggggctacc caaggttggg tcccgaatgc cagtccctct gtcgggacgc 661 gagatgtgta gggcagatgc taggaagaag attgggtctg ggacgggtgg tccgcgtggt 721 tagctgcctc cgctcttttt cggtgtcccc cccagtcccg cccttgggtg tggggacgcc 781 tgccccacaa gtgtttaggg aggtcagtgg gttcctcgcc cgtagagaca ccgtttatgc 841 caaatgagca ctcctcatcc ccgctcttga tggagtcatg tcctagacgt gaaactatgg 901 ggctgtgatc acaagcaaat gtgtgggcgg atccgttgct tgggttcttc cccgccccct 961 ctttttttcg gaccatgacg tcaaggtggg ctggtggcgg caggtgcggg gttgacaatc 1021 atactccttt aaggcggagg gatctacagg agggcggctg tactgtgctt cgccttatat 1081 agggcgactt ggggcacgca gtagctctct cgagtcactc cggcgcagtg ttgggactgt 1141 ctgggtatcg gaaagcaagc ctacgttgct cactattacg tataatcctt ttcttttcaa 1201 ggtaaggctg agatctccgc taggcttctt tccctttagt gctgtattcg tgttgttttt 1261 gtttttttct gtcctttagg gagccttagt ctagatgtcg gggtggcttg tggataacgc 1321 tctggatttt tatagggtga gggtagtggt gggtgaggtt ttttgagtcc tcctcggttt 1381 tctctagtgt gtttgggggg tggggctttc tctcggcgcc tgctggccgt agcgaggtgg 1441 gctgtggggt tggggcagtg ggcggctggc agctgcacgt ggtggccgcg cggcccggga 1501 cgctgccatt tttgcccctc cacttccgga cgcggctacg gggcgtcgga gggggaccgc 1561 aggtggcggg ggtgcccgct cgggtgactc agcacggcct tgtgggactg gctttgtcac 1621 ctctcttatc ggacgcgttg ttaaagcctt cttgggtgct ttgtttctgt gagggagggt 1681 tgacggtgtg ggaagagagc tttcggtctc cagcacccga tactccctcc ttccagatct 1741 ttcttgcagt cccggtggag gaggggcggg gaggggagca ggttctggaa gattcatggg 1801 ctccttcctc cgcccttcct cgagagctga gattgttctg gaagcttctg gattctggcg 1861 ccccgcccca gtgcccggat gctgggggcg agggagggtg cactgcggcg ccccctcctc 1921 gcgtggtcct ggccgacgca tgtccggcag tgacgagtgt cggcctggtg gctacggcca 1981 ccatctttct tgggtttggt cctgttctgt aattttgtgc tgtgaaaggg tcgtggtgga 2041 gcttttggct taagaattct ttgtccggat ttaattgctc ctccggtggg tatcgtatgg 2101 atcccaggtt attcctccct gccctatggg caggagtgtc ccgcccttgg actggtctta 2161 ggaactgaca cctcaggggg agcagtttaa agttagtgcc atttttatct taaactagtc 2221 actttgacct cccccaaata aagaactgta ggtagtgatt ttcacattta aatttgtgta 2281 aggattactt gggatctcta gatacctggg ttggaccaac attatgattt ttctgccata 2341 ctaccagatg atgctgaggc tgctggtcac cattctttaa gtaggtgggt tctgtgacat 2401 ttggttgaag aatatttagc ttattttctt tttccttctg aattttcagg cctcccactt 2461 agtgtgtagt ctgagatctt taagagaatg catttttagt cttgggaagg gatagtactc 2521 cggttaaacc agtctgaact cactgtctaa ggtcctaaca aatgatatga cctttaggat 2581 ttttaaacat ggggccttag tgttcttttg taattaatga gatttttatt ttagatgcct 2641 gaggaagtgc accatggaga ggaggaggtg gagacttttg cctttcaggc agaaattgcc 2701 caactcatgt ccctcatcat caataccttc tattccaaca aggagatttt ccttcgggag 2761 ttgatctcta atgcttctga tgtaggtgct ctggtttcca catttggcat ggtttttttt 2821 tttgatactc tagaaggagg ggaaaggagt ggtttggcct ttgttgggga ctactattga 2881 agggggtaaa cttgcagcta ttccaaaaag atgggtttta ctctggccat cttgaacttg 2941 gaagggacta tgtccagaat aagtgggctc atggaactaa ctggttctaa agcctcaaga 3001 tagggggcaa tcagatttga ggctgagaga ggtaaaccaa gattttcttt gaagatacgg 3061 gctttaagaa agcaaaagtg gctgagcgtg ttggctcaca cctgtaatgc caaggcagga 3121 ggatcacttg agcctaggat ttcgagggca gcctgggcaa ccgcgagacc ttgtctctgc 3181 aaaaaattaa atattagttg ggcacggtgg catgtgctgt agtcccagct acttgggaag 3241 ctggggttgg gaggatggct cgaacctggg aggtcaaggc tgcagtgagc tgtcatcctt 3301 gccactgcac tgtagcttgg gcaacagagc aagagtctgt cttggaaaga gcaaaagtaa 3361 gttgctgttt gtatttccag gccttggaca agattcgcta tgagagcctg acagaccctt 3421 cgaagttgga cagtggtaaa gagctgaaaa ttgacatcat ccccaaccct caggaacgta 3481 ccctgacttt ggtagacaca ggcattggca tgaccaaagc tgatctcata aataatttgg 3541 gaaccattgc caagtctggt actaaagcat tcatggaggc tcttcaggta ttgcagttct 3601 gtaggcattc atacttatct gtgttctttg gttttttgct tctttaaaac ttgtgattga 3661 ctttaaactt gttggcaggc tggtgcagac atctccatga ttgggcagtt tggtgttggc 3721 ttttattctg cctacttggt ggcagagaaa gtggttgtga tcacaaagca caacgatgat 3781 gaacagtatg cttgggagtc ttctgctgga ggttccttca ctgtgcgtgc tgaccatggt 3841 aagttagctt ttctgttaca aggtagttgg gttaggattt tctgggctca caccagtagc 3901 agaaattttg ggcatcctgt ctgtaaagca gttcttcaca gcagttctgc tgatacttac 3961 taattgctgg tctcaactgc atatactttt taccctgtta cacgcttgta attgactctt 4021 ctaggtgagc ccattggcag gggtaccaaa gtgatcctcc atcttaaaga agatcagaca 4081 gagtacctag aagagaggcg ggtcaaagaa gtagtgaaga agcattctca gttcataggc 4141 tatcccatca ccctttatgt gagtatggac ttttaaatct tttacactta acgtgcagga 4201 tgtttcctgt tctggagaat ctcattgtcc ctggcttttg ctttccctgg tagtgttttg 4261 tactccaagg ctaacttctg tttttgttac ttagttggag aaggaacgag agaaggaaat 4321 tagtgatgat gaggcagagg aagagaaagg tgagaaagaa gaggaagata aagatgatga 4381 agaaaaaccc aagatcgaag atgtgggttc agatgaggag gatgacagcg gtaaggataa 4441 gaagaagaaa actaagaaga tcaaagagaa atacattgat caggaagaac taaacaagac 4501 caagcctatt tggaccagaa accctgatga catcacccaa gaggagtatg gagaattcta 4561 caagagcctc actaatgact gggaagacca cttggcagtc aaggtgtgag aagcctttgc 4621 atgttggctc aacatgcaca tatggagagg aatgagttag gtggaagagt gttgggtaat 4681 agacacacgg aacttgtcca actgataaca gaaatgtgat agccatgtga tttcacttac 4741 tgattaccct gtcatagtga agtgccatca tttctaatga cctcactttc tcttcttatg 4801 gaaatctggg taatgtctat tggcagcctt acacatccag ggttctgatc agaggggact 4861 gttttctaca tacagctagt acccatctag atcgtggagg gcattaaggc tcagttttct 4921 caggagctgc ttgttgtgtg tgctctatcc cttaggctta gggaggatca ttgttccact 4981 tttagataat ttggtgttgg ggctaaaagg tcctcttttg aaatgtacca cttatttttg 5041 gtttctttca gcacttttct gtagaaggtc agttggaatt cagggcattg ctatttattc 5101 ctcgtcgggc tccctttgac ctttttgaga acaagaagaa aaagaacaac atcaaactct 5161 atgtccgccg tgtgttcatc atggacagct gtgatgagtt gataccagag tatctcagtg 5221 agtatctcct tggcctaatt tagttgggtg aagtcttggg aggttttagg cattctgcta 5281 ggatattcta aggtaacagt tttctgcaat acatagtagg tgtaagggtt caggaggcta 5341 ttagagcctt ctgtttgaat ctggggacca ggtctggtct agctgttttt actgagcttt 5401 ctcaccctgg ttgatggcag attttatccg tggtgtggtt gactctgagg atctgcccct 5461 gaacatctcc cgagaaatgc tccagcagag caaaatcttg aaagtcattc gcaaaaacat 5521 tgttaagaag tgccttgagc tcttctctga gctggcagaa gacaaggaga attacaagaa 5581 attctatgag gcattctcta aaaatctcaa ggtaaaaagg caaataatgc ttattccctt 5641 taccactttc ttagtaataa caataaatta ttccattcac attgaaagtg aagttattgt 5701 agttaagctg gattgttttt cctcttccca cccttcaagc ttggaatcca cgaagactcc 5761 actaaccgcc gccgcctgtc tgagctgctg cgctatcata cctcccagtc tggagatgag 5821 atgacatctc tgtcagagta tgtttctcgc atgaaggaga cacagaagtc catctattac 5881 atcactggtg cgttgactct gattgaagcc tttttggagg agtggggagc acaattaggg 5941 cttcctggga actggcagta tgaggcattt tagtcactga gttcatttaa ttaccctaca 6001 ggtgagagca aagagcaggt ggccaactca gcttttgtgg agcgagtgcg gaaacggggc 6061 ttcgaggtgg tatatatgac cgagcccatt gacgagtact gtgtgcagca gctcaaggaa 6121 tttgatggga agagcctggt ctcagttacc aaggagggtc tggagctgcc tgaggatgag 6181 gaggagaaga agaagatgga agagagcaag gcaaagtttg agaacctctg caagctcatg 6241 aaagaaatct tagataagaa ggttgagaag gtaagccatt ctggggctag gatatatttt 6301 gtaacatctt cgaggtgggc tccctcacaa gcatgtttct atacaattag tggtttgagg 6361 cagcctattt actgtttcat gccttcttgc ctcttgttct cttctctagt caggtttaag 6421 gctattttaa taaaatttgg cacagattag gcattgcttc agttaacttc tgagagtaga 6481 taaaatacca tcattttctt ttttttttct ttttttgaga tggggtctcg ctctgtcacc 6541 caggctggag tgcagtggca cgatctctgc tcattgcaag ctccgcctcc tgggttcacg 6601 ccattctcct gccttagcct cctgagtagc tgccactaca ggcgcccgcc accacacccc 6661 ggctaatttt ttgtattttt agtagagatg gggtttcatc gcgttagcca ggatggtctc 6721 catctcctga ccttgtgatt cgcccacctc ggcctcccaa agtgctggga ttaacaggcg 6781 caagccacca tgcctggccg attttttttt tttttggact ggatctcgct cactgcaaac 6841 tcaagtctcc tgagtggctg ggattacaga tgtgtgctac cacacccggt taattttttg 6901 tagacagggt tttgccatgt tggccagcat ggtctcaaac tcaagtggtc tgtccacctc 6961 ctccccctgc tggaattagg cttgacaatg cctgttttct ctttcaaagt ggtaatgtca 7021 atctaaggct tttgtgatcg tccacaggtg acaatctcca atagacttgt gtcttcacct 7081 tgctgcattg tgaccagcac ctacggctgg acagccaata tggagcggat catgaaagcc 7141 caggcacttc gggacaactc caccatgggc tatatgatgg ccaaaaagca cctggagatc 7201 aaccctgacc accccattgt ggagacgctg cggcagaagg ctgaggccga caagaatgat 7261 aaggcagtta aggacctggt ggtgctgctg tttgaaaccg ccctgctatc ttctggcttt 7321 tcccttgagg atccccagac ccactccaac cgcatctatc gcatgatcaa gctaggtcta 7381 ggtaagtagc tttggtactt ggtgtggcaa ggagtttgtg caactcgtct cctctatgga 7441 tttgacttaa tgctatttgg tcaagtctca catggcttaa ttttacttca ggtattgatg 7501 aagatgaagt ggcagcagag gaacccaatg ctgcagttcc tgatgagatc ccccctctcg 7561 agggcgatga ggatgcgtct cgcatggaag aagtcgatta ggttaggagt tcatagttgg 7621 aaaacttgtg cccttgtata gtgtccccat gggctcccac tgcagcctcg agtgcccctg 7681 tcccacctgg ctccccctgc tggtgtctag tgtttttttc cctctcctgt ccttgtgttg 7741 aaggcagtaa actaagggtg tcaagcccca ttccctctct actcttgaca gcaggattgg 7801 atgttgtgta ttgtggttta ttttattttc ttcattttgt tctgaaatta aagtatgcaa 7861 aataaagaat atgccgtttt tatacagttc tgctttccct tgtgaagtgg atgttatcct 7921 tccctagctt cttcatccct ccagctcttg ctgttttcat gagcacagca agttgagctg 7981 gttttgtagt gaaaataaca gaataccagt gagtcttaag agttcacaca ctgaagctaa 8041 aggcagtttg gaaaaactac catataataa tgccctttca gtcaaccaaa acacaggacc 8101 aagtccactg cagtaattta atttaataaa ataaaattat aagagcaaaa agttacattt 8161 ctaaagtacc aaaacctgca acaggctcat ggaacagagc ctagggatcc // LOCUS HUMI309 3709 bp DNA PRI 08-NOV-1994 DEFINITION Human secreted protein (I-309) gene, complete cds. ACCESSION M57506 NID g184505 KEYWORDS secreted protein. SOURCE Human peripheral blood leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3709) AUTHORS Miller,M.D., Wilson,S.D., Dorf,M.E., Seuanez,H.N., O'Brien,S.J. and Krangel,M.S. TITLE Sequence and chromosomal location of the I-309 gene. Relationship to genes encoding a family of inflammatory cytokines JOURNAL J. Immunol. 145 (8), 2737-2744 (1990) MEDLINE 91010756 FEATURES Location/Qualifiers source 1..3709 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood leukocyte" /map="17" exon 813..960 /gene="SCYA1" /note="G00-118-872" /number=1 /evidence=experimental gene join(813..960,2149..2260,3383..3664) /gene="SCYA1" mRNA join(813..960,2149..2260,3383..3664) /partial /gene="SCYA1" /note="G00-118-872" CDS join(885..960,2149..2260,3383..3485) /gene="SCYA1" /codon_start=1 /db_xref="GDB:G00-118-872" /product="secreted protein I-309" /db_xref="PID:g184506" /translation="MQIITTALVCLLLAGMWPEDVDSKSMQVPFSRCCFSFAEQEIPL RAILCYRNTSSICSNEGLIFKLKRGKEACALDTVGWVQRHRKMLRHCPSKRK" exon 2149..2260 /gene="SCYA1" /note="G00-118-872; putative" /number=2 exon 3383..3664 /gene="SCYA1" /note="G00-118-872; putative" /number=3 BASE COUNT 984 a 857 c 948 g 920 t ORIGIN Chromosome 17. 1 tctagaaaaa aaaaaacaaa aagggaaaat tcccctggca ggactctcag atgctgctga 61 gtagctctca gtcctctctg taacccaaac ataacacatc tatctccgtg cttacactgg 121 gtggctttca cttgtttatc tgtgaattga agagaagttg cttgaggtca ggcagtgctc 181 ctcattggta actgccttct ctggggctaa ccaaggacct agaacagaat aagctattga 241 aaattgttga ggattgaaaa aaatagaaaa aatagaaatg gcaaatatct aggccagtca 301 ctggacatag agaatgttat ttaattctta tcgcacgtcc ttgagacatg tattgctatt 361 tgcattttgt gtgaatatgc atttgggtaa gtttatgtaa tccctcctct gcagaactgg 421 gattcaaatg caggtgtatc tctgttcagg tccagactct tctgccctga agcagtagta 481 cttggatgga atgacgtagg gttggacaag ccacacagag gccacttcct ctcacttact 541 tttctttgct tcccactcaa ccaggacagt tcccacgcac tttttcaaga ttcttatctg 601 ctcccacact tggggaagtt cccaatgcaa cctatcaatc catcaccacc acgaatacca 661 gccaggagag gtggggaaag gagtttacca catggtcgct gggtgtgagc aactgttccc 721 tgtccctatg gcttcccact tgtggctccc accatggcct ggagttttgg gtggagtttt 781 tcaaataaaa gccctcagca ttgcaggacg gcacagtggt gagctcttag cttcaccagg 841 ctcatcaaag ctgctccagg aaggcccaag ccagaccaga agacatgcag atcatcacca 901 cagccctggt gtgcttgctg ctagctggga tgtggccgga agatgtggac agcaagagca 961 gtgagtgtgg caggcatcat tttgcttctc tctggggagg gcagaaacgt ggtcagccac 1021 tctggggttg gagcaggctt ctccttgaac tcaccaactc tatctcccct cttcctacct 1081 aaagaggagg aatggtgaac ttggacaggc tggggtgagg gctagtagga gaaccatgag 1141 ttggggcaaa cacagagaac tgaactgaca gcttcagtac aaggagctct gcttcatcca 1201 gacccaagga agggaacctg tgaggttact cgggtaaagc tgggaggccc aaggtccagg 1261 ggacagcctg ggtgtagctt ctacagtgtg acagacacca agtagagtca gaaggcaaga 1321 ccgggctcta acaattggtc actcttgggc aagtcacttt agctctcaaa ctctactttc 1381 tctatcagtg aaatggagtt gatgatgtct gccctccaag actgtttgga gaataccaac 1441 ctagtaagag gcatgaaagg gggtgcaaac agaaaaacta ggaggaagaa gctgggattg 1501 gaatgcaggt ctcttgcggg atgtggtgtg ggaggagaat gcacaaatgg acagagtggg 1561 ggttgggggc tgggaaagag ctaaggacca gggcaggagg ggattcaaga gactgagtag 1621 ggcagctagc tagttcctgg gagctcttcc cttgtcatct catcagtttg gactcctcga 1681 acaattccta atcttcccca gatcaggtct gtgaactgtg gaccactgtg tcctgcatca 1741 gactaaccag gtccccaggg tgtggggtcc agagcccttg gacatgaata ctggggcaga 1801 accatgcaca tgtggtgaaa taccaaaact ggatgagcct ttagaaacca ggctccaaaa 1861 agttttattt tacagatggg aagtctgggg ccagggtgaa ggcacatctt cctcagggcc 1921 actcagctgg ggtgcgggga gctcagatct gaaccccaat cttctgactc tttacctagc 1981 cccagaacaa ggtggctgat gaggcagagc tatgccggca ccgtctggat gtggtcccca 2041 agccagggct tgtcctggga ggcgtttttt tgtttgtttt ttaaaaattg tgctacaggt 2101 gagaggttga gaaatggatg caaaccatcg tctgtgttcc tcttctagtg caggtaccct 2161 tctccagatg ttgcttctca tttgcggagc aagagattcc cctgagggca atcctgtgtt 2221 acagaaatac cagctccatc tgctccaatg agggcttaat gtaagtgatc acctgctcaa 2281 tctctcccta gagaacagaa ccccgccagc ctggaattac aagagtagac actagatgac 2341 agtattttac tggaataagg tttctaaacc cagagctgcc agcacctggg tgcaagccac 2401 acttgggcgc tagagggagc gctgagcttc ctagcaggtg tgaggaagga tgcatctgtg 2461 ctcctgcagt ggcttgtgtt tcctgaaact ccaaggtgcc aagtattgta tcccagcatt 2521 atgagctcag aggtttaaca aaagcatgag gggttattgt gcactggaaa gagcaaggga 2581 accaggatga gttcctgccc ctggatttgg aacccatagt cttgggtgac cgtggacagg 2641 taactccttt gtactgaatt gtctgtgtat ccttctgtat tccttatctg tgaagggtca 2701 taaacatagc tgcatcacag ggtctttaca aacttaattg gagtagcttt cacataccag 2761 tcagtatttt aaggcttttt catgtatatt ctctctgtcg atcctcttgg ggcacatatt 2821 tttgttatca tgaaaagtga ggttcaggaa ggtagagata tttgtctaag atcaaccaga 2881 tagtaagaga tagagttggt ctatagattg gacaatagtc cagtttagga agtagacaga 2941 tcagaagaga aaaatacaca cccacacaca cacacaaggc gcgcgcacac acacacacac 3001 aaacacatga gtccaacgca acaagtaagg ccccagatgg acacagaaac attaaagttg 3061 gtgcaaagag ctattccagg atgggaattt ctcatctcac ctaatcgtca gaatgttttc 3121 agctgttcag ccccaccctg atacaccaaa ttgaaaccag gagaggggtc caggaaattc 3181 aattcataag ctcctggtgc tttggctgtt ccccagcgtg caaaccacac atcctgtgca 3241 gcaacttcat ttacagaggg gagcccaagg cctagcaaga gcagtttagg ggacctggca 3301 gccggaggag gcggggcttg tgcgttgccc acccagtggt ggcttgggtg gctcagcctt 3361 ctctcttatt ctctgttcac agattcaagc tgaagagagg caaagaggcc tgcgccttgg 3421 acacagttgg atgggttcag aggcacagaa aaatgctgag gcactgcccg tcaaaaagaa 3481 aatgagcaga tttctttcca ttgtgggctc tggaaaccac atggcttcac ctgtccccga 3541 aactaccagc cctacaccat tccttctgcc ctgcttttgc taggtcacag aggatctgct 3601 tggtcttgat aagctatgtt gttgcacttt aaacatttaa attatacaat catcaacccc 3661 caaccctctg ggctcttgga tttcagagtg aaaacttgat ggcattgag // LOCUS HUMIBP3 10884 bp DNA PRI 08-NOV-1994 DEFINITION Human insulin-like growth factor-binding protein-3 gene, complete cds, clone HL1006d. ACCESSION M35878 J05537 J05538 M35879 M35880 M35881 M35882 M35883 M35884 M35885 M35886 M36121 M36122 NID g184522 KEYWORDS insulin-like growth factor binding protein 3. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10884) AUTHORS Cubbage,M.L., Suwanichkul,A. and Powell,D.R. TITLE Insulin-like growth factor binding protein-3. Organization of the human chromosomal gene and demonstration of promoter activity JOURNAL J. Biol. Chem. 265 (21), 12642-12649 (1990) MEDLINE 90324259 COMMENT Draft entry and computer-readable sequence for [J. Biol. Chem. (1990) In press] kindly submitted by D.R.Powell, 03-JUL-1990. The sequence presented here appears in Figures 2 and 3 of ref. [J. Biol. Chem. (1990) In press]. FEATURES Location/Qualifiers source 1..10884 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /map="Unassigned" promoter 102..1905 /gene="IGFBP3" /note="G00-126-724" protein_bind 1808..1821 /gene="IGFBP3" /note="G00-126-724" /bound_moiety="Sp1 and AP-2" TATA_signal 1876..1881 /gene="IGFBP3" /note="G00-126-724" exon 1906..2440 /partial /gene="IGFBP3" /note="G00-126-724" /number=1 sig_peptide 2038..2118 /gene="IGFBP3" /note="G00-126-724" gene join(2038..2440,5726..5952,6497..6616,8212..8337) /gene="IGFBP3" CDS join(2038..2440,5726..5952,6497..6616,8212..8337) /gene="IGFBP3" /note="The AAs encoded by bases 5728-5736 and 5917-5926 may be ASN-linked glycosylation sites.; insulin-like precursor" /codon_start=1 /db_xref="GDB:G00-126-724" /product="growth factor-binding protein-3" /db_xref="PID:g386791" /translation="MQRARPTLWAAALTLLVLLRGPPVARAGASSGGLGPVVRCEPCD ARALAQCAPPPAVCAELVREPGCGCCLTCALSEGQPCGIYTERCGSGLRCQPSPDEAR PLQALLDGRGLCVNASAVSRLRAYLLPAPPAPGNASESEEDRSAGSVESPSVSSTHRV SDPKFHPLHSKIIIIKKGHAKDSQRYKVDYESQSTDTQNFSSESKRETEYGPCRREME DTLNHLKFLNVLSPRGVHIPNCDKKGFYKKKQCRPSKGRKRGFCWCVDKYGQPLPGYT TKGKEDVHCYSMQSK" mat_peptide join(2119..2440,5726..5952,6497..6616,8212..8334) /gene="IGFBP3" /note="G00-126-724" /product="growth factor-binding protein-3" intron 2441..5725 /gene="IGFBP3" /note="G00-126-724" /number=1 exon 5726..5952 /gene="IGFBP3" /note="G00-126-724" /number=2 intron 5953..6496 /gene="IGFBP3" /note="IGFBP-3 intron B; G00-126-724" exon 6497..6616 /gene="IGFBP3" /note="insulin-like growth factor-binding protein-3 precursor; G00-126-724" /number=3 intron 6617..8211 /gene="IGFBP3" /note="IGFBP-3 intron C; G00-126-724" exon 8212..10775 /gene="IGFBP3" /note="insulin-like growth factor-binding protein-3 precursor; G00-126-724" /number=4 polyA_signal 10751..10756 /gene="IGFBP3" /note="G00-126-724" BASE COUNT 2796 a 2578 c 2737 g 2773 t ORIGIN 1 ctgcagacct gggacctcaa gaattgcatt tgatgccgaa cccagctcta atttcagagt 61 caaggtctct gcgagtattt aaggaacgga tgtaaacctg ggggattcgt tttgtttcct 121 tcaattttcc aatgaaatca gagatcctgt tcttgggtgt caacgcagat actagaagga 181 ggtgatacaa gagaaaggaa acagcaagcg acgattatgg cacggtttcc tgtaaacaag 241 gttgagtgta gccacagcct gagcactgtg ggagaagagc tcataagaaa atgacggtgc 301 tgggccttcg tcaccccggg gccctccatt gttcttgtct ttggtctctt tttatttgta 361 gaggtccaat tatttattta tttagtacaa gagggaacga aattgatctt tccattctaa 421 aaggagagta tatatgtata aaaggaagct gtatagatat gggggaagag gtggacaggg 481 ggaaaagggg agaggacgag agagagaaag ggagggagag ggacaaggag agacactggg 541 cgagagatcg attaggagag acagaaatga tgaatgaaga ttaacttcac ccaaggcttc 601 gtcgctggag gggaatggag gagctcctga tttgctatta ctactccaaa ctgcaaaggg 661 ctccttcaag tcacctatcc acctcctaag gcaagcgtcc aatttcaaca gcgttcagga 721 aagtctcctc ccgcggaggt ctcaccgctt cccactccac ccccacaaac tctttggaaa 781 agtgccttga aaaatttaat cctcaatcca atcctggacc accagcgtcc tctgttggtc 841 accgaaggag ggggtgcgca gacaaaactg aagaaactcg agtgccagag aaggccgaca 901 ggagttacag cgacctcagc gcgcaattgc gccccgaact ttactgaaaa gtgtttagat 961 tgcagagata agctagaatc ccaacgcatc gagaatacag taatacgaag tcgccttcaa 1021 aaaatgacaa tgaaaattgc ctattaaagg actatttggt taattacgtt tcagcagtgc 1081 ccagtttatt gtctttatta ttcttttgtc gtgggtgtaa actccatttg aaaacataat 1141 cagggagaat acccaagaca agaagaacag ttgtcattta aaatatttga aaagccctgc 1201 cttaaggagc attcgcttgc cggtccactc ttaattgggg acttgcggtg tagcaacacg 1261 tgagagtctt cttgcgttga gaagtaagcc tggaaaggcg aaggccccgg ggcatcttca 1321 gatgcgtatt tgtgggcccc tggggatata aacagcccag cgggtgtaaa ttaaaccccg 1381 cagtgccttg gctccctgag acccaaatgt aagtcagaaa tgtcccaaga cttcgcctgc 1441 caacggaatt aaattttaga aagctccacg aggtacacac gaatgcggag cgctgtatgc 1501 cagtttcccc gacaccggct cgccgcaggg agacctcacc ccgagagcgg aaggggtaag 1561 ggcggcgggg tcaaggagat cgggggtgct gagttggcca ggagtgactg gggtgaccgg 1621 gggtgctgag gtggcctgga gtgccggggt ggccgggcac accttggttc ttgtagacga 1681 caaggtgacg ggctccgggc gtgagcacga ggagcaggtg cccgggcgag tctcgagctg 1741 cacgcccccg agctcggccc cggctgctca gggcgaagca cgggccccgc agccgtgcct 1801 gcgccgaccc gcccccctcc caacccccac tcctgggcgc gcgttccggg gcgtgtcctg 1861 ggccaccccg gcttctatat acgggccggc gcgcccgggc cgcccagatg cgagcactgc 1921 ggctgggcgc tgaggatcag ccgcttcctg cctggattcc acagcttcgc gccgtgtact 1981 gtcgccccat ccctgcgcgc ccagcctgcc aagcagcgtg ccccggttgc aggcgtcatg 2041 cagcgggcgc gacccacgct ctgggccgct gcgctgactc tgctggtgct gctccgcggg 2101 ccgccggtgg cgcgggctgg cgcgagctcg gggggcttgg gtcccgtggt gcgctgcgag 2161 ccgtgcgacg cgcgtgcact ggcccagtgc gcgcctccgc ccgccgtgtg cgcggagctg 2221 gtgcgcgagc cgggctgcgg ctgctgcctg acgtgcgcac tgagcgaggg ccagccgtgc 2281 ggcatctaca ccgagcgctg tggctccggc cttcgctgcc agccgtcgcc cgacgaggcg 2341 cgaccgctgc aggcgctgct ggacggccgc gggctctgcg tcaacgctag tgccgtcagc 2401 cgcctgcgcg cctacctgct gccagcgccg ccagctccag gtgagccgcc cgccaggtgc 2461 gctgcgtgca gcaccgccac tggcgccgaa gggcctgggg gttgctgggt gccgctgcgg 2521 gagactccgc ttttcttctc actggagata atatgtgggg aaactgaagg cgctccggga 2581 aaggtgaagg cggtcgccga gggaccctcc ccagccggcc ctctacttgc tcgattctct 2641 aagtgcagag tacttgtaaa ttgcaaagcg ctttcagtga aaatgggtaa aggtttccgg 2701 agctgagggg agcggtaccg atgtttagct gttggaaaga tcctggacac aggagattct 2761 cctcgccccg cacgggtgca cacggactgc aatcccaggg atgcttgggg atggggggat 2821 ataggcggat ttggaccaag gaaggtgggt aggcacgttg taggaaatag tacctctctt 2881 ttaaaatact gactttgcac agccttttgg tttgcaaagc aatgtctagt cccggtatgt 2941 ccaaaaacaa gtaaagtgga ttcgggtttt gatatcttct gcggttggaa aacctgaagc 3001 tgaaaaagaa gtaacttctt aaggttaccc agcggccaca acagagtgta ggtttgaact 3061 ccgcgtgcca ctttcagtac cataccattc ttacaactcg ggccacccct gcacctgcgc 3121 cgacctcaaa caaacttcca ggtgcgtggt gggtgcgggc aatgtggact aagtcaattt 3181 caatgacacg gcaagggaat tggaatcagt cctaggctgt ctcccttctt aatctgaaat 3241 gggggggggg aatgagatgt tgttaagggg agccccagaa gaggaaaaat gcaaacattt 3301 ggcagagtta ccctcttgct tagccactat cagtatcagg cagacagcga ctctggtaag 3361 ggcatcacat tgttccctta aaaaaaggag cgggggttgt ttaaatggat ttggcagctg 3421 ttctttcaag cattcttagc cagcctcacc tagttatatg agaaataaag ttcctgcctt 3481 gcacagctga aggctgggag aattctcccc atcctaattc ccccaactcc ccaacgatca 3541 cgttggacag atgtcactgg gcaggccccc atctagggct agcaggatga acagtccctt 3601 tataatttat gtagctgtag agttccacgc ccgggtgaag ttattttctg gctcggcaag 3661 gctggctctg ttcacccctg agaaatgctg gattcatgga aaggcaagat gcctgaaaca 3721 tacactggct ctggtcagct gttaaagctg ctggaggcat ttgtctctcg gggcaaagtt 3781 atgtcatttg ccaagtgtcg tacattattg tgcattttgg ggtattcaaa aagtgatctt 3841 agaaatactg atacacatcg tcattcttgg gctttagcaa tcatcatgat taccacctta 3901 gtagcactgt agtataggtt gatgtgagtt ataagattat aaaaagatct aagtgacttc 3961 tagaatctat ttgacaaaaa aaggtaaatt ttcgacagtc aaaagtcaca attatctgtt 4021 gcttaaatag aactgttttg tcttcatgcc ctagtctgca gcccaggcat taagaagaaa 4081 ccaaggaaat ttaagaaatt actcaaggtt cttagaaaag aagtataaat acgtttattt 4141 acatgttctt agagtattta cattcttagt atctctttta tctcagtatt tccttgaaaa 4201 agaaagcaag ctaagattaa aagaaattga aaccaaatcc tcgcaggtag ggacctcctc 4261 tgtgaggctc tgtgctggac cctgggaatg tgtgcttccc aaggtatgaa accccttggg 4321 gaactttaca gcaggacctc agtgagctgt ttggcaggtg aggaaactaa gacccagaga 4381 ggagagggac tttcctaagg ccctggtgag tgacctgcca gtagccactt ccaggggaga 4441 gcagagcatc tgcagccaaa tcattgcagc cccaggtagc tttctagata gactgtggac 4501 cagatgggcc acctgagctc cctgctaggg ttacacatta tagccctgtt tgtgtagtag 4561 agaaatttca tgactctcaa ttgtggactt aagccgatgc ctccagacct tggcatggtc 4621 cacaggccct gggagcatgg gctctgaatg tagcctttga tccccatagc ggtcttacag 4681 cccctccaag ttcattctga agaaggaatg gagtgagaat cctggctgca gatccagtct 4741 tgaatttagt catatactta aaattccaat tcaactgtta acattccagc atccatttta 4801 agcatcagac tttcttcatt tagcactttt tattataaaa gggagatctg ctggaggggg 4861 atttctccta ccccaccccc acccagggaa ggaaaagctc tttggcactt agaagtctga 4921 gccgtgagtg ggactttggc attgtctgca tccatgtgct gctgtgttca cccggggtga 4981 aaaggactca cttaggcagg caccagcaag atgcacaggg tctgtgtaga ccttgagttt 5041 tagagatgta acggggacct agaaaacaag ccaccaacat gcttgcatga ttctgagccc 5101 ctgaggcaaa acgctttgca ggtaataatt cagttttccc atctgagctg gacaccaagc 5161 tcttataagc gtgtttacct ggtagcattg aggacggtac tggtcaacct tggaattccc 5221 ataagggctt gttacaactc agactcgtgc cgccactcca gcgtttccgg agtggagaat 5281 gtgcatttct tccaagtccc cgggctgccg ctgctcccgc gggtgggagg accacacttg 5341 gagttgactg caaaatttct gagccggcgc tgcagcagcc tcccgtggct caggtctgcc 5401 ccctgccggt ggaagatgaa gcatactgcc ttcacctact gaggggcact gaagcgtttg 5461 tctgccttct ttagttgcag ctacttagga agagcacctg tcagattgac tttcaaacag 5521 ataacttctt gaggtagagc aaccaccatg tagtgagtag tatgatggaa taatacttca 5581 tcgaggtatt taaaaaaaaa acctcacttg gattgccaac taatattgtc atttacatgt 5641 gacctggttg caacgttaag atttttacaa gactgtgata gatattgatg actctcatgt 5701 gtttgtctct cttgggcgtt ttaaggaaat gctagtgagt cggaggaaga ccgcagcgcc 5761 ggcagtgtgg agagcccgtc cgtctccagc acgcaccggg tgtctgatcc caagttccac 5821 cccctccatt caaagataat catcatcaag aaagggcatg ctaaagacag ccagcgctac 5881 aaagttgact acgagtctca gagcacagat acccagaact tctcctccga gtccaagcgg 5941 gagacagaat atgtgagagc ttttcctctt gttaaaggag gagggcaaga cctgccaagc 6001 ctgggtactc agagcctctt gagggcaatt cttactcaac aaaccccagc gcctggctga 6061 tgggtgggca acccctagcc cctctgtgcc ctacctctct cctctcctta cataaagaat 6121 attgaccctt ttggagaatc ttatgaggat caagctgaaa taacactctt aaaagcatat 6181 gggatgtcat aaagacctct gcagataatg aaaatattct cataaagata gttttattta 6241 cttcatcctc tatgcttgtt gacctgctat tggttccatg ccagcttctg tgccttactc 6301 tgggaagagc aaaaaggaga cagggagtga tggttagctt attcggggga ctttcgtgct 6361 acatcagaca taaggtatct gaggagcaaa ttacaggtcc cacttttggt agttgtgcag 6421 catcgtaaga tttttaaagc acacattcta gagtaaaaac tgtgactctg ttgctctggt 6481 ccttcctgat ccccagggtc cctgccgtag agaaatggaa gacacactga atcacctgaa 6541 gttcctcaat gtgctgagtc ccaggggtgt acacattccc aactgtgaca agaagggatt 6601 ttataagaaa aagcaggtga gtgaggtcct cagtgtgttt tcttcctctt ctgttgacac 6661 agaggagaaa cccatgtcac cagcgcccag gctcttgtgg ccatagctct aactctgagc 6721 ctgtgcagca ccagtgccca ggacttggtg ccagtctcag gaggtcagac caagggctgc 6781 tttgacttgt tgctctgagt gctgctatat tggccataat cctcaaccct agtgcctttc 6841 caccacccgc ttcccactcc tgtcctttca atggttcacc cacaggcgga caagatgctg 6901 cccagtggca ccctttataa actgcaagtg gacatgttaa cacatttgtt aatgctgcgt 6961 cagggagtga catttcaaac aactattata gtcagtttcc aagaagtgtg acatgaggtc 7021 ataccacaaa aaagcttacc ctgaaatccc acaatcgtcc cctttcctac tgatgccttc 7081 ccgatagtga gcaggttgca atattaagat tttgaaaagg ctgttgctag atgttggtga 7141 ctcgtgtgtc tctgtctccc ttgggctttt caaggaaatg ctagtgagtg gggggatgac 7201 tgcagcatgg ccagcttgga gagcccagcc atccccagca cataccaggt gtctgtcttg 7261 gcgtggaggg gatggaactt gaaatcagac actcggtcca tgctggggat ggccagtctc 7321 tccaaactgg catgtggtct tcctccgagt cactggcatt tccctagaaa gtccaagtga 7381 gaagaaggca tgagagtcat caacatcaaa caacagtctt ttcaaaatct ttatattgca 7441 acatagtccc attcctggaa aaggaatgga gtgagaatcc tggctacaca tcagccccaa 7501 atgtagtcat tgcctaaaat cccaattaac ctgaaaatga tcaaacaaat ttaagatata 7561 gtaatattaa gctgtaataa atatgcttct ataggctttg tgttatgtga tggcactatt 7621 tcaattggct ttctaattgg acaattgata ctatgctatc tacagaattg gcctttggag 7681 acctaagtga gccacagtgg cctcagggtg accatatact aggattcata gcagtggcca 7741 cagtcagaag cctaagcttt cctccattgc cattgctcgt ttataccacg tttctgtcaa 7801 agtcatattc attcaacaaa gtcatactga gaaggtgtca tgtgaggctg gatgtgggct 7861 ccaaagtcat agctgtgaca ttcgcaggca gcgggatgtt ctcagttcca catttggcag 7921 agaagtcagt caagaggttc tacaagggct ggtgtccacc ttatactcct agaaacacaa 7981 aactgccccc acccccgctt tcttggagca ggaagttaca cccacacgca tgcacaggcg 8041 cacactcagc gggcctaggc agcgtggctc ttgtgttgcc ttagctgaaa tttctgttgt 8101 gctttctcag catagcagag tcacgctggc aaaccatcat gcgccctggc caccgacctg 8161 acaccagacc caggagcatt cacttctctg tcttctgttt ctctcccaca gtgtcgccct 8221 tccaaaggca ggaagcgggg cttctgctgg tgtgtggata agtatgggca gcctctccca 8281 ggctacacca ccaaggggaa ggaggacgtg cactgctaca gcatgcagag caagtagacg 8341 cctgccgcaa gggtgagtac tcaggagggg cagcctgggc tccagggcct cactgtcctt 8401 ggaccagcct caggggctgg gcgtggccac tggccttccc caggcttaca gacccaggag 8461 ctgcagctca gggccagaaa gagcaaagca aataggacag agccctcaga agggtgcagg 8521 gagagggaga ccccatcaac ccaaccaaac aagtgtgggg aaggaggccg gccagtgcac 8581 ctcagggaca ctctgcttta tctcagatac ctcacagcac ctaagctatc attcatccac 8641 acacaaagtg aagattttca aagttaggct ttacccgtga gtctggaggt catttatctt 8701 cacagagaac gtttatcgca gactgctaag atacatgttc taattaagat gtgatgtgag 8761 aacgctgaat gctcgttgga gactcagttg aagtgcagct ttttttctgt caaatatata 8821 atgaatattc tgttagtctg tggctaatat aattttaata aagttaattt aaatctgata 8881 gaaaaatgaa attttaaacg ataattttag agaatgctat tatatccagt cttctttttt 8941 cttttaataa atgagggaac tattggggga aaggaataaa tacattttct ttcattttat 9001 taagacaaat ttagtaagca gaagaaattt gcatgtttag ttataagggt ttcttttttc 9061 cttacaagtt ggaaaaaata attctaattt aagggtaact ctttgacaat gaacactgtg 9121 agcagcatct ggtactcgtt gctttgtttg aaaacatgag ttgagacccc agccgcactt 9181 gcagcctagt gccattagcc tgcaggctgt gctggatatc tcagggcaag agtcgagccc 9241 ttttgatttt ggggggatta tttcaatata tttgcttttt ctttttgttt tagttaatgt 9301 ggagctcaaa tatgccttat tttgcacaaa agactgccaa ggacatgacc agcagctggc 9361 tacagcctcg atttatattt ctgtttgtgg tgaactgatt ttttttaaac caaagtttag 9421 aaagaggttt ttgaaatgcc tatggtttct ttgaatggta aacttgagca tcttttcact 9481 ttccagtagt cagcaaagag cagtttgaat tttcttgtcg cttcctatca aaatattcag 9541 agactcgagc acagcaccca gacttcatgc gcccgtggaa tgctcaccac atgttggtcg 9601 aagcggccga ccactgactt tgtgacttag gcggctgtgt tgcctatgta gagaacacgc 9661 ttcaccccca ctccccgtac agtgcgcaca ggctttatcg agaataggaa aacctttaaa 9721 ccccggtcat ccggacatcc caacgcatgc tcctggagct cacagccttc tgtggtgtca 9781 tttctgaaac aagggcgtgg atccctcaac caagaagaat gtttatgtct tcaagtgacc 9841 tgtactgctt ggggactatt ggagaaaata aggtggagtc ctacttgttt aaaaaatatg 9901 tatctaagaa tgttctaggg cactctggga acctataaag gcaggtattt cgggccctcc 9961 tcttcaggaa tcttcctgaa gacatggccc agtcgaaggc ccaggatggc ttttgctgcg 10021 gccccgtggg gtaggaggga cagagagacg ggagagtcag cctccacatt cagaggcatc 10081 acaagtaatg gcacaattct tcggatgact gcagaaaata gtgttttgta gttcaacaac 10141 tcaagacgaa gcttatttct gaggataagc tctttaaagg caaagcttta ttttcatctc 10201 tcatcttttg tcctccttag cacaatgtaa aaaagaatag taatatcaga acaggaagga 10261 ggaatggctt gctggggagc ccatccagga cactgggagc acatagagat tcacccatgt 10321 ttgttgaact tagagtcatt ctcatgcttt tctttataat tcacacatat atgcagagaa 10381 gatatgttct tgttaacatt gtatacaaca tagccccaaa tatagtaaga tctatactag 10441 ataatcctag atgaaatgtt agagatgcta tatgatacaa ctgtggccat gactgaggaa 10501 aggagctcac gcccagagac tgggctgctc tcccggaggc caaacccaag aaggtctggc 10561 aaagtcaggc tcagggagac tctgccctgc tgcagacctc ggtgtggaca cacgctgcat 10621 agagctctcc ttgaaaacag aggggtctca agacattctg cctacctatt agcttttctt 10681 tattttttta actttttggg gggaaaagta tttttgagaa gtttgtcttg caatgtattt 10741 ataaatagta aataaagttt ttaccattaa aaaaatatct ttccctttgt tattgaccat 10801 ctctgggctt tgtatcacta attattttat tttattatat aataattatt ttattaaaat 10861 gttccctgct ttccctttta gcaa // LOCUS HUMIDS 36845 bp DNA PRI 15-AUG-1994 DEFINITION Homo sapiens iduronate sulphate sulphatase (IDS) gene, complete cds. ACCESSION L35485 NID g530140 KEYWORDS iduronate sulphate sulphatase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 36845) AUTHORS Lu,F., Lu,J., Clingan,R.L., Wentland,M.A., Muzny,D.M., Gu,Y., Nelson,D.L. and Gibbs,R.A. TITLE Complete DNA sequence of the human iduronate sulphate sulphatase (ids) locus JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..36845 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq27.3-q28" gene join(1057..1159,1901..2037,2704..2881,5155..5243, 7885..8085,9676..9846,15752..15878,19094..19267, 22974..23446) /gene="IDS" exon <1057..1159 /gene="IDS" /number=1 CDS join(1057..1159,1901..2037,2704..2881,5155..5243, 7885..8085,9676..9846,15752..15878,19094..19267, 22974..23446) /gene="IDS" /codon_start=1 /product="iduronate sulphate sulphatase" /db_xref="PID:g530141" /translation="MPPPRTGRGLLWLGLVLSSVCVALGSETQANSTTDALNVLLIIV DDLRPSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRL YDFNSYWRVHAGNFSTIPQYFKENGYVTMSVGKVFHPGISSNHTDDSPYSWSFPPYHP SSEKYENTKTCRGPDGELHANLLCPVDVLDVPEGTLPDKQSTEQAIQLLEKMKTSASP FFLAVGYHKPHIPFRYPKEFQKLYPLENITLAPDPEVPDGLPPVAYNPWMDIRQREDV QALNISVPYGPIPVDFQRKIRQSYFASVSYLDTQVGRLLSALDDLQLANSTIIAFTSD HGWALGEHGEWAKYSNFDVATHVPLIFYVPGRTASLPEAGEKLFPYLDPFDSASQLME PGRQSMDLVELVSLFPTLAGLAGLQVPPRCPVPSFHVELCREGKNLLKHFRFRDLEED PYLPGNPRELIAYSQYPRPSDIPQWNSDKPSLKDIKIMGYSIRTIDYRYTVWVGFNPD EFLANFSDIHAGELYFVDSDPLQDHNMYNDSQGGDLFQLLMP" intron 1160..1900 /gene="IDS" /number=1 exon 1901..2037 /gene="IDS" /number=2 intron 2038..2703 /gene="IDS" /number=2 exon 2704..2881 /gene="IDS" /number=3 intron 2882..5154 /gene="IDS" /number=3 exon 5155..5243 /gene="IDS" /number=4 intron 5244..7884 /gene="IDS" /number=4 repeat_region 6827..7114 /rpt_family="'Alu'" repeat_region 7117..7647 /rpt_family="'Line1'" exon 7885..8085 /gene="IDS" /number=5 intron 8086..9675 /gene="IDS" /number=5 exon 9676..9846 /gene="IDS" /number=6 intron 9847..15751 /gene="IDS" /number=6 exon 15752..15878 /gene="IDS" /number=7 intron 15879..19093 /gene="IDS" /number=7 exon 19094..19267 /gene="IDS" /number=8 intron 19268..22972 /gene="IDS" /number=8 repeat_region 21573..21864 /rpt_family="'Alu'" repeat_region 21867..22051 /rpt_family="'Line1'" repeat_region 22098..22267 /rpt_family="'Line1'" exon 22974..>23446 /gene="IDS" /number=9 repeat_region 25413..25686 /rpt_family="'Alu'" repeat_region 26878..27162 /rpt_family="'Alu'" repeat_region 29858..30098 /rpt_family="'Alu'" repeat_region 30767..31068 /rpt_family="'Alu'" repeat_region 33970..34090 /rpt_family="'Alu'" BASE COUNT 9117 a 8696 c 8303 g 10729 t ORIGIN 1 tgagcagtgc accacagcac catcacagct gtatttgtac cttcccattg tccattgttg 61 tttagtcata attaagtcat ctgggctttg ttatagattg attctcaaag ttgaaaatca 121 aactgtgttt gcagtattag gacggtataa tagtattcag tacagtacag tgtagggcta 181 ggattccatc tccccacaat tgctgctctc tctgtttggc gcacccttct tgcagacctc 241 aatgcggctt gctccctcac ttgcttcaaa attttgttca aatgccactt cactagtgag 301 gccctgcctg gccaccccat agaaaataac aagcatgccc cgcacccacg ctttattttt 361 ctccagaccg ggtatgccca ccgacactct ctatattttt ttattgtttt aagctccacc 421 aaagcaggga catagagccc cagtgccacc aacagcgcct ggtatacata tttgcgatac 481 ttaataagaa ttagttgaag aaatgaatag gaaagggagt ccctttagat ggaggaaggg 541 actttctcaa aaggcctcat ggaaggagaa tggcattgag gctggcttac aagaagtgtg 601 tcttttataa aaagaaggaa gggcagggct tccaggtgaa aggcatagca caagcaaagg 661 caaaaagacg ggtaactgct agttgttaag ggcagtggct gaatcgctat acccctagca 721 gttctacacc taggggcctg cgggccgctg ggtctgcgaa gcttgcaagg caactggccc 781 gcccccgctc tgcgcctgtc tctcggccac gcctattgct gcaggatgac gcgcacctct 841 agaacccgcc ccggagggag ggacgcaggg aagagtcgca cggacgcact cgcgctgcgg 901 ccagcgcccg ggcctgcggg cccgggcggc ggctgtgttg cgcagtcttc atgggttccc 961 gacgaggagg tctctgtggc tgcggcggcg gctgctaact gcgccacctg ctgcagcctg 1021 tccccgccgc tctgaagcgg ccgcgtcgaa gccgaaatgc cgccaccccg gaccggccga 1081 ggccttctct ggctgggtct ggttctgagc tccgtctgcg tcgccctcgg atccgaaacg 1141 caggccaact cgaccacagg tgccgcccac gccctccctg ccatctcttc tcccttcctc 1201 cctcccttcc ttcctccttc cttctttcct tccttctttg tttatatcca ttctttttac 1261 ccttcctctc tcctaccatt ccttctttca tccatcattc gttccctccc tccatttttc 1321 actccttcct accgtccctt catccctccc ttccctcttt ccatctattc atccatccat 1381 ctcctatccc tccgtgcttc ccttcctctt tccccccatc tttcccatct ctatccatcc 1441 atctttcccc ctcctttcct ccctccattg gtccatccgt cccttttacc ttctatccat 1501 tcatatttct ctctcttctc ccctccttct ctccatcctt tcttcccttc agctattcat 1561 cttttcctcc ttatcttcct ccatccaacc atctgtcctt tcctgcatac atcattctct 1621 ttttttccta aattcatctt tcctttccac catctttcac tcactatctc gcttcctcac 1681 ccaggttgga ggccatgacc aaagcctaac cctgccaccc aggactcagg cttcctcctc 1741 gagccccact cccacccttg ctgaggcaca gcgccctccc tggctaggct gttaaggtgc 1801 agggtccagc cttgggcctc ttagtaacct agcacctacc atgagggagg gttcagtgtc 1861 agtgcaggtt acctcaccaa agcccctccc tcctgtgtag atgctctgaa cgttcttctc 1921 atcatcgtgg atgacctgcg cccctccctg ggctgttatg gggataagct ggtgaggtcc 1981 ccaaatattg accaactggc atcccacagc ctcctcttcc agaatgcctt tgcgcaggta 2041 tgtctgggaa cctctagctg tgggtgtgtg ctgcttcgtg cactgagggt tgggggcggg 2101 gagcttcagc tattgtcaga tggcacagat tgtgcgggac atcttgttag agggaagcat 2161 agtctggaaa atggtagtgg agaaaatctg gttttccact catgggaaag cttgaccttc 2221 aagggtggcc ttctccttgg gtgcaaggtg cgccctggcc ctgatgtgtt catggcagcc 2281 ctggccagct ttctcccaga acgggcgctt gcctggtgct cagaaggaag gggtgctgca 2341 gaagggcacc cacaggtctg caggctgggc ctcaggtgag agctagccag ggtggccctg 2401 ctccctgaac cccctcatgg ctctgccccc agcaacactg gccgctctgt gtgcctgggg 2461 aggccattgc gtaggaggag ataagcatct gctggtttca ggggctgatt gcttcccttt 2521 tctctcccgg aagcagtctg ttgcctaggt gacacccaaa tcccagtgtc tggtttgagc 2581 tctgcatgac tcagctacca agctgtttgc taggagcctc gggaggcggt tgcttggtta 2641 cctaagagat ggcagacatg ttttgctgtg gcgatgctta cctctgcttc tgctccctaa 2701 cagcaagcag tgtgcgcccc gagccgcgtt tctttcctca ctggcaggag acctgacacc 2761 acccgcctgt acgacttcaa ctcctactgg agggtgcacg ctggaaactt ctccaccatc 2821 ccccagtact tcaaggagaa tggctatgtg accatgtcgg tgggaaaagt ctttcaccct 2881 ggtactgctc catgtccaga gtctgggttc tcttggtttg tggtgtctga atccagcatt 2941 cccatcctgg ggatggggct gtctttgcag agccctcttc tggctgggcg agtccctcgc 3001 tagtcagtgc ttcctttcta aaaaactgac ttgtcaaccc agccacgttt tcacccaaag 3061 tgaaaaaggg tagaaagagc tttgcttcct ttcagaaacc actgagggtg tgctgttggg 3121 tctttcagct cctcgggtgg tagggaggac acaggctggg gaggtggcag tgttgggtgg 3181 agtccagctc agggccccac cctctcccct gcagggtacc tgtcagtaaa cctaggggtg 3241 gggtgaggac acctgagggc ctcccgtgtg gccagcattg ctgttgctga cttattgccc 3301 aatgaggggg ctgtgcttag gtaggggctg ctatccactc tggaaattaa gtcagaaaga 3361 tgagtgtaat ctggtaccca ggatctatag ggtcccagag acggacattc cttacctcaa 3421 atgctgcctg ataaactggc tctctttaat gccaatatgg ggatagtgaa aagcaaacaa 3481 actttaaaac tttttattta tagaagtaat gcatgagtat ttgagaaaaa aaatggagaa 3541 aaaagtattg gaagacaaga gcaccatagg gacaataatg cttatcattt gggaatgacg 3601 aatgttttca gtctctttcc ctttgctcag tggttacacc ggcaggcttg gcagccttct 3661 gcaggagctc agctggccag cccaggttga gagtgacatt cagctctact aagtcaggcc 3721 agtttcagag gtgaacctgt caggaatcat attgcacagc caaaagtcca gtgtcaggag 3781 aactttctgc agtttggtgt tctcattacc ataccccttg cccaaatgca gctggaggga 3841 agtgatgatt ttggagagag gaggaaacac agcagaggga ggagattgtg ctttgggagg 3901 ggaaaccagg caaggactga gtgatgttgc ttgacctcct ccctcctgat gattggattt 3961 tggcaccagg ctctaagcac catgtccttc tcagagaata aacctttccc tgcagccctt 4021 gtgggaacac atttttcagc tgaatctttg ttctcaagac ctttttctgt tagcattggg 4081 cctgcttcaa gatgacagta gccaccactg tgtgccaggt atgtaccgtt taccatgctg 4141 gccactcctt taatcctccc aacagcaagc aaggtgggtt tttatcatca tcatcatttc 4201 ccaaggaaga catgaaggct catgggagtc aagtagcttg ccagaggttg caaggatgct 4261 aagaggcagc atcaggattt ggacctgtgg ttgttggctt cctgtgtgct ctttccacag 4321 tgcttgctgt cttcataggc cctccacccg ctgccctctt tctctccatt cctgtctctt 4381 ctggtactcc ctatgggagt cccactgtgt gactgctgct gatgctttct cacgaggttc 4441 tgcctgtgat cccctccact cccagctaac cactataggc agcacttccc ttcagatccc 4501 tcacaggcct cactggctcc tagaagtcct ccctgccagt cctcacctga ctcacaataa 4561 cctctgtgct tggaggatcc cccacttggc acttcgatgc ttttgaaagc tggaagtcct 4621 tccttggttt ccagtgctgt tatactgtct tcacaaataa actataaact actaagctta 4681 agggtgggtt tctgcttcat gaatgagcac ttgaacttgg aattctaaaa caaggttttg 4741 agtcctggct cccttatcta atatctggtt agcataggta agtcacatac cttccctgag 4801 cctccctctc ctgcccccat cggctccctg atggtgctga tgcctctgct gcctccttca 4861 tagggtgaga gataggtttc agcatgggct ctaaggagct gactgatctt gtccttgtgg 4921 gtcccttcca cagagcctag cacaaatcat cagggcttag ggaccaggaa gtcagatgct 4981 tagttctcaa atagaaacta gagagtttgg ctttagaggg gacttttgta gatgaggaaa 5041 ctgagcccca aagaagggag gttccacttg cccatttgtt tacagagttt taattatggg 5101 gagtggggtg ttgaaagact catcatgttt taacaacctt tttttttttc caagggatat 5161 cttctaacca taccgatgat tctccgtata gctggtcttt tccaccttat catccttcct 5221 ctgagaagta tgaaaacact aaggtaaggc tgtgaaaggg acatttctga agaggaacca 5281 ctttttcctt tgtcacataa actactgggt atactgcatg ttctgtgaag ctggttatat 5341 accacgaagt tgtgggtttc atttgtgata atgttttgac agaagtaagt gttgggatct 5401 tcagcattag gcccgacagg agcagtggct tcattctaga agctggtggg tgctacttgt 5461 tctaaaacct tgggctctaa cgtgtcagac tcattgaata gctccccagt gctcacgctg 5521 gatgagactg agagttgtag gagatgctga gacatgggag ggagaaaaga ggctgagctc 5581 caggagcctc agcccagaat gggagaaagg catgccgtgt atctgtctct gacctaatgt 5641 gtatgttcag cagctgagac ctctgatagg ggttttgaat ttttagagca ttatgaaaaa 5701 aagtaaccta caaagaaaat cttggtttag aggaatccca aatgtataaa tgaaactaaa 5761 acccacattt aagaaactca gaaaaccagc tggggaactg agacatgaaa tgtatctctg 5821 aggttttagt aaacatcgtg ttgggaaggg tgaggatttg gaagtgaggt ggaaggtatt 5881 gttctaagtt cttgacagct ttccatttca gaatacagtg aaaactgtga gctctggcct 5941 ttggcttgtg gcttgaatag ggaagatccc atcttaattc cctgaagcca tcttgcttct 6001 ttttagtggc ctcacttacg aatgccaggt ccctgcccta acttccccct acccatttct 6061 tctcaaagtg gtgggtctga caatactaaa tctccctcga gtgctgcaaa tcgtgtatat 6121 atagatgatg aacaagtggt ctgaggcggg tgctgggctg tcctgcccaa gggtggggtg 6181 agcaaggcag ggctgtgaaa aggcaacact ctttgagatg gaacagcagc tgacacagcc 6241 cctcggtctt tgtggtatac aaatcatggt cagaactaac tttggtctca cggccagcat 6301 gtctacttta ggaagcaaag caggaggttt cattctgtgc ctagtgtagc tgaagttctg 6361 gaaattccat ggactgtcac cttatcaagt tgattgggac cctgtcactt acaagcctta 6421 gcctgtcttt gaaatttgaa atgggttttt tttttctttt taaattttta attaaaatag 6481 agttcatata ccataaaact ctcccttttg aagtatataa tttaatggtt tttagcatac 6541 tcacggagct atgcagccat cacccaaatc gatcttagat catttttatc gctctcaaaa 6601 gaaaccctgt acccattacc agtcgttcct cattttgtct cagcacccag cgtgggacaa 6661 caactgaact atttttggtc tccatggatt tgcctatttt gtccatttgg tataaataga 6721 gtcatacact atgtagccat ttgtttctgg cttctttcac ttagaataat gtttttgagg 6781 ttcagccaca taacagtata aattggtact tcattccctt tttttttttt ttttgagatg 6841 gagtttcact cttgtcaccc aggctggagt gcaatggcgc aatctcagct cactgtagcc 6901 tccacctccc aggttcaagc gatgattctc ctgcctcagc cttccgagta gctgggatta 6961 caggtgccca ccaccacacc cagctaatgt ttgtattttt agtagagacg gggtttcacc 7021 atgttggtca ggctggtctc gaactcctta cctcaggcaa tccactcacc tcccagagtg 7081 ctgggattag aggtgtgagc caccgcaccc agcctttcat tcctttttat ggctgcataa 7141 ttgtctattg taccacattt tgtttatcag ttcatcactt gatggatatt tgggttgttt 7201 ctacttttga ctattaggaa taatgctgcc gtcgacattt ttgtacaagt gtttttatgg 7261 gcatatgttt ttaatttttt tgggtataat aactatgttt aactgtttga ggaattgcca 7321 gactggcttt caaagtggct gcactatttt acattcccac cagcaatgtg agagggttct 7381 aatttttctg catcctcgtc aacacttgtt attttccatc ttaaaaaatt ataaccatcc 7441 ttttgggtgt gaagtggttt tgatttgcat ttccctaatg actaatgatg ttagccacat 7501 ttccatgttc aattggccat ttgtatatca tctttggaga aatgtctatc caaatccttt 7561 gcccattttt aattggcatt ttttaattgt tgaattagag atggattttt ttttataatt 7621 agtctatcgg ggtttctttt aattagttat gatctggaga aagagtatta gtagcaagaa 7681 accaagtgct agtggatttc tggcccctgc ctggaaaaca agaaacacct tttctgtctt 7741 agttctactt ctgatgtctc tgttcagtct gagtgactaa cacgtgaagg gctgattatg 7801 tgaacattaa atctgtgtgt gtagccttca tggcttcatt tcttgcactt aaaaagctga 7861 tgttatatta ttttgttttg aaagacatgt cgagggccag atggagaact ccatgccaac 7921 ctgctttgcc ctgtggatgt gctggatgtt cccgagggca ccttgcctga caaacagagc 7981 actgagcaag ccatacagtt gttggaaaag atgaaaacgt cagccagtcc tttcttcctg 8041 gccgttgggt atcataagcc acacatcccc ttcagatacc ccaaggtgaa gagctggttg 8101 agggctgatc cagcacagct gtgacagctg tgttgtttgt tgagggaggg atttgcacag 8161 ggaaggtggc tacatcctgc catcgccagg caccatggtt gcctgatggg cactagtgtc 8221 ctcagtggag taaagatggg atttagaggt caaggccaag aacatgtaag aatcttgtaa 8281 gaaatgcttg gctttccgct tcactccact ggagggtttg atttcctttc cttgaactta 8341 ttgagcagat gttgggttgg atggtgagat caccacagag tagtgaatca gagctggctg 8401 ccaaagcctg taataaggga tggcccttcc aaaacagccc cagggaatgt gaaactctct 8461 tcaaaaactg cctgttcccc ctgagcctgt caggttagat catctaaaca caaagcactg 8521 cattgcttct tggaacctca aatcctgtac ttgcttgtat gtcatttaga gcatgtagtc 8581 ttttctgttt tagaatcctt ttccattttt ccctatcatg ttttatgagg gcctgagcat 8641 cccatccttt tgactttgca gagaatgccc ctcacatgta atgagagtag agaccagcag 8701 tatgctcttt gatgttgcag gtatcttgct ttgattggcc caggggatct tgctttccta 8761 gtaggagctg tcagccccct tgatagaaga aggctgtgag gttcacctct cctgcctctt 8821 tgcagaaaac agttaacaaa gctggcctgg ctgtgattct ttgaaaggcc tgcttataag 8881 tctagccctt ggctggcatc tgggaacttt gatttctgaa gtgttctcac tattcccaga 8941 attggcttac tatgcttaaa ccgtttagac aaacattatg gttgatgata aacccttggg 9001 tttgtttttt tttttctggg agcctggaat tttgatatgt tccagacaga tagtacctac 9061 ctgaccagcc ccaaataaaa actggataat gaggctttaa tgagcttccc tagttagcag 9121 catttcacat gtattgtcac acaactcgtt gctgggagaa gtaagtatcc tatgtgactt 9181 ccctggaaga ggacttgaag cttgtgccta gtttcctcta gactttgcct catgtgcctt 9241 ttttctttgc tgagtgtgtt tcactgtaat aagtcatagc tgtgagtgag actaaatgct 9301 gaatcctatg agttccccta gtgaattgct gaacctgagg gtggttttgg gaacccaaca 9361 cctcaccaca cttttctagc caggcaggag gtggggacag gaacagaagg gcgtctgttt 9421 gtatgaagga agagagttgt tgctgctcag tttgtaggaa acagaagtcg tatgatttca 9481 ttgccattgg catctcatga gagtaactta aggctggact tccaggtcag ggccgagcac 9541 gtggggaatg ctagtgagcc accactcaaa tgcatcccag gcttagagga aaggcagtat 9601 agacagtgat agagccacaa gcttgtgctt ttgctaaaag agtgacaact ttgtggcttt 9661 gtgtttttcc ccaaggaatt tcagaagttg tatcccttgg agaacatcac cctggccccc 9721 gatcccgagg tccctgatgg cctaccccct gtggcctaca acccctggat ggacatcagg 9781 caacgggaag acgtccaagc cttaaacatc agtgtgccgt atggtccaat tcctgtggac 9841 tttcaggtat caaggacata gtttggggat gtattggaca ctgatgacat agtgtcgtag 9901 gtgaaaccac tcttctcagt agacacaact ccacctataa tgtcttatta agagctttct 9961 ttgtgtgagt tatcaggcaa agtgctgggg tggaggtgtc ctataggtat ttaggacaca 10021 ggcagtgttg cctggatcag gactctccca accagtgtct tactttctaa agaagggaat 10081 gtccagagag ttggtgactt gttgtgtgtg gacacagagg tggggaacca gcctgggtga 10141 ccgttcctaa cccaggggat actgttcagg tgacagctca ttaattgtag agtgatgggc 10201 aactcccgag cagcccagaa tgcttggttt gctggtactt gcaggttccc tctgtgggaa 10261 agcccatcct ggctagctgt cactctcagg gggtggcttg gtgaacaggc ccttggtgca 10321 gagggaactg ggacaccatc atggccagtg ccacccagct gtgatgtccg gcattccttt 10381 gccatgattt aggggcggag caaccacaca attagaatcc tcacactgac tcatcagtca 10441 ccctttctcc ctggagagca gaatacacat tgactaatgc tctttctgga atattccaca 10501 cagaaccctg cccttctctt accacatgaa tgctgtggca ggcaagctat agcggatgga 10561 agatggcagt gttttcaaag tgtgcatcct ggactacctg ccttagaatt tctgggaggg 10621 cttgttggaa atgcaaagtc ttggcccact cagatttaca caattggaag ctggaggtgg 10681 aacttgacct gacaaccgac atttcttaaa aaaaaaaaaa aaaagcacac caggtgattc 10741 tgattcatca gaagtttgag agcctctgct gtagagtatt gtttaaagca aacacagaac 10801 aaaaccactg catacagata tcccttgcta tccaaactta cttggcccct gagtttgcac 10861 aacaagagtt tggttggtga gaggtgtaca ttcctcaaat gtatttgatt cctgtttcac 10921 tgagagtttg gaaagcaatg catccatctg tttgttttct aattttgtgt ttctgggctc 10981 ctgatggtga agtggcttga actggctctg aggctgcacc tggatgcttt attagtgatg 11041 gtcacttaaa atgacccttc aaaatggact gtgtgccagg aagggcgctg attcatgtgt 11101 ggtatcgcct ttagtcctca tggggactta gtgaggtacc tgctctctgt tgtctaggtg 11161 aggaaactca aggcatggga ggtgtgctga cttgtctgac agcacaaaat tagtgctggg 11221 tggagccaat tcatccccag ggactgtaac tccaaagccc catggttgga tctgttggtt 11281 cccagcagtg tgggttcgaa ttcatgttct acctcatacc agctgcctaa cctctgtcaa 11341 gtcatgttac ctccctgagc ctccctgagt tttcttctct ataaaactga aacaaggtag 11401 gtaaagtttt tccatgataa cacctacttt agagattttc agcaagaacc aaagagagca 11461 tgtagaagta tctaggaccg tgtcggctgc acggagctat ctccttcaag ttctcttagt 11521 cccctcctta cacacagtgg acagcatgac tcagagggcc tccctgatgc ctgcatgtgc 11581 tggtctcagc cctgtgttcc ccttgaactc tggcctgctc actgtcctct tgtagctttg 11641 gatgtccgca tccaagccca tctcactgca cccttgagac tcagctctgg gatctcttca 11701 gctccagccc tctaactcag ctctacatga gccagtgacc tcaggggcca cactcaggag 11761 tttgttgcct gctccttccg agtcaccata ctgaccacaa cctcttctcc tagcacttcc 11821 cttgctctac ctcacgggga ttgctagagt atggatttct atctccccct tgtgtccagg 11881 tcctgttggc tttgttgctt ccctaaatag ctaaggcctg gtggcaggtg acttactgta 11941 cttcccagcc gcccagctca ccatacttgt ccatttatcc ttgcccctcc tgcagtacct 12001 ttatcgtcag acactccagc cacaccctct gctctgtagt ccaagtcctg aaagctgctg 12061 gagaaaatcc atgcccccac gaagcaggga ggccctcaga tgccccgcct ccagccccag 12121 cagcccaggc aagctctctg ctcagctcac ctcacttgtc ctctgcagct gctccgtttg 12181 catgacctct tctctactcc ctttcactgc tcccctccag gcctttgcca cccctgtgcc 12241 ccatttctca gtggatagca cctttcctct tgtagaaatg gaagctggtg gacagccaga 12301 ggtaggggat cggacagggt ggatgtgggt acgcatcctg gctgtgcccc ttatcagctg 12361 tgtgactctg ggcaagttaa cctttctgag tcttggagtc cttaatgtgc cgttcacagg 12421 atcagatgcc acctagtgtg tacttgctga atgggctcta ttaataatct ctgtcactga 12481 gacctggaat tttgtgtatg catgttctgg gctcgaggcc tcttccctca actccccacc 12541 actgtccctg cttcccacgt ctgcaagcag aatgagcaag atcttccttc tcacctacct 12601 accctgcagc ctctccatcc actctcatcc ctcccagctg caaaagtgcc agcccccttg 12661 cccctgcctg tttctgcttt ccactccttc acctgacctc caaggcagag cgagtgcaca 12721 ctgggctgtt ctggctgtgt ctgtggcctc attgtccatt tgttcttcac catcctctac 12781 acaggcttcc acctccccaa tccactaaac tctcacctca gcactgacct tggtggccag 12841 taaaatctat ctgtgtaggt cctgaccttc cttgagctct gtgtcacttc ctcactgttg 12901 gccagctcca ccccttctgg gactgttggc ctcccaatcc tttctgtgtc tgccacttag 12961 gtgccacttc ccaggccccc cactatatcc tttccacctc ttcttcccct gctgggctca 13021 gttgagcctt ggcagatgac tccagaacta ccccttcagc ccaggtctct cctgattttc 13081 agatatgtgt gttgacctgc ctgtgagagc cattcctagg cagatgaccc tggagacctc 13141 agctccatag gcttccgcgt ggtcaggcca ccacccctgc cctgcctttt ctccttcctc 13201 ctcctcctct tgcgccccct gcccccaaat gccagcgaat tatgccacca gctgaatgtg 13261 aaacctggag tcacccttgg gttcctcaac acctcctcct cctctgctcc ctgcccccat 13321 gtctgcttga tgcccctgaa ttcctctcac atccacacag tcctctggat ccttcagcac 13381 attgttctgg accacctcag tgggccttgc tttgttttgt tcattcagca agtgtagtga 13441 gcacctctta tgtacccagc agtggcctag taccattttg tctaaagcct ctggaatctg 13501 tgtctgtgtc ctgttgcaac aggactctta ataacttgaa gcccagatca tgtcaatttc 13561 tcgccttgaa agacctcagt gactccattg gcctagaact tggagtctcc actccctgga 13621 tagacccacc aggtccatat tatctggccc ggacctaact ttctagtcac ccctctgccc 13681 tcccatttga ttctccagtc acatgggcct ttcattggcc ccaaatgcaa cctgctctgt 13741 cacaactcca cactgttccg tgccctgcag cccctgcttg gagcatctga gccccatttg 13801 tctgactgca ggtgcttctt gtcaaattct aactcctctg tgaagccttc cctgaatctc 13861 ccaggcagat ttgagggctt attcccctgt gtcacccctg ggcctctgtg ggggatcggt 13921 cagagtggtg ggaaaaacta tagggaaagg atgcaaacct tctgaaaggt cagaaggttc 13981 tgcagagccc caggggagaa tagctgttct ataaccctga ggcagagggc aaggagtagg 14041 tacaagggag tgtgggagaa tttatcttaa acaggcttgt ttacttatgt tgaccaggaa 14101 ctgacctttg atcgtctgtg cttgtgaggt tccctgaaag gggaacaata aatgttaatt 14161 acctgcaggt tggctctagg tttttggcat tatgcctgca ctgaataaaa gccagcagct 14221 ccagcttctc ggggctgctc tctggccact agagccaggc agtaacctag ctgctcttat 14281 gctgcatacc tgtgtctgag tactcatttc atccataggc cagggtctgc aggacagacc 14341 cagcaggcct cactgataat taagcatttt cttgtctcca tgattgtgtc ctcactggat 14401 ggggtggctc atcaagggtg aggaccttgt ctgcttattg ccttacgtcc aggggctagc 14461 ataggaagga gggaatacct ttgtatagga catagccatc ctgacaggtg tctttcatcc 14521 ccaggaggac caaagttcca caggtttcag actgaagact tcatctacca gaaagtataa 14581 gtaggccagg gctcagcata atcctgctgg aaggctagat gtatatcttt tctcttgact 14641 gcaagtgaga acgggtgagt ctcatgattg tcctcaccct ggcagtgatg agaagaagct 14701 ggtgggtcca gctgataagt caggggctgg tctgcgaaga cacggctctc ttctcacccc 14761 tctttggggt ggaagattaa tttttgtcct tagcatttgt gaaccaggtt gggaatgaga 14821 gtcagcccag gaggggccgg tggctcattt acttcagggc atgatctggc tgttcccaaa 14881 gtgcctcttg ggttcaggga ctgtgaagga cgtgctgctc tctggtatct cctttgctct 14941 tctcctgcct gctgacagtt tgttagaaat gagctccacg taacggccat agttcatgaa 15001 gatatgcagt tgtaatctgc tgttgggtta gcaagtgatc agagggaaag gctcagtcag 15061 ggttgctgtt atgtgttaaa ttatgaattt ttttttccct gaaaagggct gttgacatcc 15121 taacccccca gtactgctga atgtaaccat atttggaaat aggtttattg cagatgtaat 15181 tagttaagat caggtcatat tggagtaagg tgggctccta atccaatatg atcggtgtcc 15241 tttaagaaga ggggagagag acacgggaag aacacatgaa gatggagaca cagtgataca 15301 gctgcaagct gaggaatgcc agggactgac agccaccacc accagctagg gagaggcaag 15361 attcttctac ttagagcctt cagggagaga atggccctcc cagcaccttg atcttggact 15421 tctagcctcc agagctagga gacaatacat ttcagcttaa aacaacagag cagtaggtga 15481 cacattcatc cactgtgatc tgtgccctaa agtgaaatct gatttgctta gaaatgtatc 15541 ttatttgtag acctacagat tttgctgtga ctctgtgggt gaagtgatgc tgatggtagg 15601 gaaaggagtg ttacattttg gctgagaaaa tcattaaggg catcaactaa ggggtaggga 15661 ttgggagaga tgcacaggca agcattatct ctgtatgcct tggcaattta aattgcagtc 15721 actctcattt ttattttttt tcaatttgca gcggaaaatc cgccagagct actttgcctc 15781 tgtgtcatat ttggatacac aggtcggccg cctcttgagt gctttggacg atcttcagct 15841 ggccaacagc accatcattg catttacctc ggatcatggt aagcattttg aaattccctg 15901 gtgagtcaaa acatctgaac tttcctgtga aacatgcttt gcaaaattgc cattgacata 15961 aacatgggtg tgttctcttt tgtgaaccag tggttcacaa acaaagtggg atcctggggc 16021 atttttatga tcagcttgtt agtcctgaga cctctgtctt agatgcttga cggataggtg 16081 tgggtgggtg ggggagggtc ttgtagcaac tttttttttt ttcttgatga atgcagcagg 16141 aaccctccat ttgtgtggca gagctcttgt aacccccatt tttaacctgg agggttggag 16201 gacttttagt ttgggtggag aggatccaga acaatcctgg cagagcccag caagctcttc 16261 acgcctggct ccagccctcc cacccctatc cccgctgtct tctctgccga gagcctgggc 16321 ttttcaagtc tttatctccc cctaaggctg tttcctactt ttccaaaaat gaaactatct 16381 ttttaaaagc attttttaaa ttcttcaaca ttccaagagc agggaataaa acagcagctc 16441 cccccgtttc ccactcacat agtcttgttc ctctcatcct cacccactcc cccatctctg 16501 aattgtttca aagcaaatcc cagatggttc cattttccga tcactgtctc tgaaaaatat 16561 tgcctctttt gaagtataac cataatactg ttatacttag agaaaggaaa ataattttca 16621 tatcatcaaa tatgaaatag gataatagta aatcataaca tgtgcttatc atgaaaagaa 16681 aaaaggatac agagcatgta attcagggcc agaacatcct gcccttttct cccacctgct 16741 cagggctaac agatgacaat ggattaggga tccatctgcc agccatcagt tgtgcacata 16801 tgcacacagg gatatatgtt atacatgtgt atgtttgtaa catgtatgtc acaagatgtg 16861 attgcttgtt ctgtgtgtac tgttaacgtt ccctggtttt gtacacttaa tgtcataggc 16921 aatctctgtg tcatttctca gaagcctacc ttttccctta gaaatgtctg taatatttta 16981 ttatatatgg aggtgccata atttttcaca tattcctgat tatctgccat ctgcttctga 17041 gccctcggtg cccagtttcc tcttctccac gaacacactc tgttgtgagg cagttgccgt 17101 agattacaaa tagcccatct cagagtcccc tcctgggaca tcctccatca gaaccgcctg 17161 ggagccagta aaacatgctg gtttctgcaa ctggtctcag gagctcgcag ttctgggcat 17221 gggaaagtgc actttaacag gtgtctcagg tggtctggcc acctctccct gagctcatca 17281 ttccctgggc ctccctatgt tgccacgtcc tcatcttggg cctctggggc aggagccaga 17341 aacctcctga gctgcctgtc aaggtcatcg ggcttgttcc tcctgatctg gagatggatc 17401 cttggggccc acacaggcac gcgttctcct tagccagact tccccgtatt tgctcctggc 17461 tgcagcagca caggctgagg cccggcacca gatgttcaat acgatcttgc agtcaggcgg 17521 tccacagcta tttctgtagc atttgccatg tgtcagaccc tgtgccaggc ctgggggccc 17581 cctctgcctg gtgcagggca gctcacaaaa gctggcagag gccgaaggct gcgtgccagt 17641 attcagaatg ccacagagcg cctggctgct gtaccttcag agcctccaca ggcaccccag 17701 tcagagttcc gggagtggat tcctgaggtc gcactgccag gttccctcac gttgccatcc 17761 tgcaggcttc tcgatccttg accttttaga ttcccgccac actgaatcta aaagggacaa 17821 tgtgacctct aagctggtgg cctaggagcc tgccgataac ctacccattc atcagccctc 17881 gtgggtcaag gctgcctcgt cctccccgag agagtggaga ggtcgagcag tggggatgcc 17941 cctcaggccc cggggcttac tgactggggc gggtgtcagg gggaactccc tctctttctg 18001 agctccactg agctatctga aagtccccac catactcccc agtactgaca gccagaggga 18061 ggaacggctc accagaatca ttgggcatct ctgatgggca tcaagtctgc tttgattatt 18121 gatgagtggg taagagggtc ctctcaactt ggggtctcct tgggaaagca ctgtcaactc 18181 agaacaggtt tcagccctgt tctggagacc acaggccctg gaaggctggg acatcatttg 18241 cagccccagc gtcatctatt agacggagcc agcaaggtct catgctgtgc ggtacatttc 18301 aacactttct caaatttacc cgtggcagct tttggtgtgt ttggttattt ttataacact 18361 actcactgta caatttctgt catgcacatt ttttgtatca aaaagggtgc tgtaacttta 18421 aaagactggg tatttccaac ttggaatatt caaaccatct tttgcaaggg atgttttaaa 18481 taggcatgaa gggttgtttt taattgaggt taaggatctg aaatgagagg ttttggttta 18541 ccctatctat ggtatgtctt aaaaatcaac gaagatgtcc ttgtcttttt tgaatttgcc 18601 gagtgtgttg cagttccaca gctcactgtt aggtggcaca taccccaaac tgaaaacctg 18661 acctcgtagg gcatgagtca aaagacaggt aggcacagga cagggcagtg gtgacactac 18721 agctttcagg gttcccagcc tgtcaagaat gagcatgtct tagcagggga ggaacgcatg 18781 tgtgggaacc gccacagagt cctacgttag gtatatgttg ctagtagttt gttaagatat 18841 ttgagtttgg gaatttaatt attttttctt ttttaaaggt ttcctcatga ggacaaatac 18901 ctgattttga ataaagcagc attcagttga aataaccctt tctgtggtaa ttccaagtga 18961 atatttttct tctaggtgat gagtttctac ttcctctggt ttttacaaca ggaaatgaaa 19021 tggtatctaa aataaacaag ctgtggtatg atgattattc attttctgtc attctgtgct 19081 ttttatgaac tagggtgggc tctaggtgaa catggagaat gggccaaata cagcaatttt 19141 gatgttgcta cccatgttcc cctgatattc tatgttcctg gaaggacggc ttcacttccg 19201 gaggcaggcg agaagctttt cccttacctc gacccttttg attccgcctc acagttgatg 19261 gagccaggta taaaatatgc tgaaatgata ttgcttgaca gtaagatcac ctttagttta 19321 tatgtgaacc actttattga atcataggct ttgggggtta cacagacccc cccttcctgc 19381 cttgtgttgg aatttcttct caacattttg tggtcagcac acgctttctg agcatctccc 19441 atactctggg cggggaaaga gtgacagaac cagaattgac aagtatggcc cctgtccaca 19501 aggagccttg tgggatgatg atggacaaag gaacaggtgc atccccaaga tggggagggc 19561 tgtggtacag tctacaagag cctgctccag agcttagagt aaggggctgc ccccgtaaca 19621 gacgaggaga atcctttccc tcgatgcgca cttttggtag ggactcagcc ttctttgtgg 19681 ggggcacatt ttccattttg acagctcatc aatctagtga accactagat ctgtaaaatc 19741 taccaggtga ttctactctg ccctccccag cagtgaaggg caagtttact ttgacttcca 19801 tgtgccagtc cttcaaagct ttgaaaacag cactgatgcc cccgaccctg tccatccacc 19861 aatatcttct cttctctagg ctgaaatccc cagcccttca accattcgtc acaaggtagt 19921 gtttagacgc gtggttgtcc tggtctctgt cttccaaaaa gtttcagttt gagaaagcac 19981 ttccagaata ccagttggac cagatacccc tcctggatgt gtgggcaacc tgtatgggtt 20041 cttgtgagtt ctatcctgac ttgatctgcc cagtgcattt cagggacagg ttgtacactg 20101 tctctgaaat gacgtatgac gtttgctctt tttggagtgc ctcttctatg aactatgcta 20161 aaccagattg ccccctttct ggacttaagc atttaatcca tccttaacat ctaaatcttg 20221 tactgtatat tgattcctgt ttaaatttgt cttggttctg gaccatcatt caggcctttc 20281 tggatctttc tgagcttgct ttatactacc caattttatt ttttgcactt tgatttttgt 20341 gcactgcatg tcctgtcttc tttcatgtca ttgatagaaa ggttaaacag tttcaaggca 20401 tgattttagc cctttgtttg ctgggaaaga tcttctgatt ggttgagaat cttttctgaa 20461 acttttgttg ggaagtgttc agcaatttag aaattgatgt aactgcacgg cctttcagtt 20521 tatttccaaa ttttatctct agctgtgtct tttccctgaa cttcagactt ttgtttccaa 20581 tagtctttcc cacttagcca gcctaacata tccaaagctg gacttcacat ctttcatctt 20641 cagcttttcc ttcaacactc ttcttgtcct caagaaatga gaatagtcac tcatgccaaa 20701 aaatcatgga ataatctcaa cccctgttct tctcttactc catatgcaat attcagtcta 20761 tgagaaagcc tgtcacactc cattcagact ctgtgtacca cttcccacca cttcgcctgc 20821 cactgtcttg gaccagactg ccactgtctt gtgctgaagt actgcaagag cttgctaaat 20881 ggtccccttg cttgtgccct ggatcccttg gaggttttcc tcctcagagc agccacagtg 20941 attccattaa aacccaagtc aagtcatgtc acagcccggc acaggagcct cttatgtacc 21001 cttcttgatc tgagtaaaag tcgtcacagt ggccttacat gttctggccc cattatctcc 21061 ctgacctcat ctttttataa gtatccaggc cagttgtcct ataacagtgt cccacagcct 21121 ggatttatct gattgcttcc acatgactac attcagggta acatttttga cacgtgtgct 21181 acatgggctg ttgtattctc ccattgtgtc acatggtggg ggcacttcat ggccagcgtt 21241 actagtattg taagtttgaa cactcggttg aagagctagc tagcatcagc cagatcttgc 21301 cattgtaaag gtaccttttt caactttctt tacttgtttt ctttttattt tacttaaaaa 21361 ttaagtgtct agaaaatgaa atcaagcatg ataaagcact gtcttaaaga tccagaaggg 21421 ccagaatggt agtgcaaatc caatttcaat tttaacagga atcaatataa agtacagtat 21481 ttagacttta aatatggatg aaccacaatg aaatactact tcatacccac taggatggct 21541 attaataaaa aacaacaaca aaagctgtga aaggctgggc gtggtggctc acgccagtaa 21601 tcccagcact ttgggaggcc aaggtgggcg gatcacgagg tcaggagatc aagaccatcc 21661 tggccaacaa ggagaaaccc catctctact aaaaatacaa aaattagctg ggtgtggcgg 21721 tgcatgcctg taatcccagc tactcaggag gctgaggcag gagaatcact tgaacctggg 21781 agacggaggt tgcagtgagc tgagccgaga ttgcgccact gaactccatc ctggagacag 21841 ggctagaccc cgtctaaaaa aaaagaatga aacaagtgtt aagagtgtta gtgattatat 21901 ggaaaaattg gaacacttgt gcattgctgg taagaatgta aaatggtgca gccactgtgg 21961 aaaagaattt ggtggatcct cagagttaaa catagaatta ctctatggcc cagaagttcc 22021 actcctaggt atatatccac agagctgaaa acaggtattc aatcaaaggt tgtacattac 22081 tgttcatagc agcactattc acagtaacca gaaagtggaa gcaattcaga tgtctattga 22141 cagaagaaca gacaaaatgt ggtccgtcca tgcaatggaa tattattcag tcttaaaaag 22201 gaaggaaact gacacatgct acatcatgga tgagccttga ggacattatg ctaagtgaaa 22261 aaagtcagtc acaaaaggac aaatactgta taatcccact cctatgaggt atctagagta 22321 gtccagttca tagacataga aagtagaatg gtggtttcca gttgctgggt gaggatggag 22381 aaaggggagt tgttacttaa tggggacaga gtttcagttg tgtaaattaa gaggagttct 22441 ggagatagat ggtggtaatg atggcacaac agtataagtg tacttaattc cactgaactg 22501 tatactaaaa agtggttacg atggtaaatg tcatggtgtg tgtatttgat cgtaataaaa 22561 tgcaaagata actctaatgc agacatccag aagggagtaa tgtgtggggt gatcctgtgg 22621 gagtcagaat atcctgttcc tcaacagact cttttaccta gtggttgtag ccttaattga 22681 tgatccttgt ctgagtcagt tattaaatcg gtggttccaa aattcaggtt gttttaaaaa 22741 gcgccaacag cctcgtgggg ccctaatttt gcatcctgct atttgattgg atgagtaatt 22801 aatgcagggt gaggtgccga ggtggtgttt ctaaacgtct gttgctaaag ataaatgttg 22861 taaattaaaa aaagaaaaca tatggagccc agacaggttc ctttactgct cctgcctggc 22921 catggcaggc ttttataatg taacccattc tgctctgtcg cttcctgttt caggcaggca 22981 atccatggac cttgtggaac ttgtgtctct ttttcccacg ctggctggac ttgcaggact 23041 gcaggttcca cctcgctgcc ccgttccttc atttcacgtt gagctgtgca gagaaggcaa 23101 gaaccttctg aagcattttc gattccgtga cttggaagag gatccgtacc tccctggtaa 23161 tccccgtgaa ctgattgcct atagccagta tccccggcct tcagacatcc ctcagtggaa 23221 ttctgacaag ccgagtttaa aagatataaa gatcatgggc tattccatac gcaccataga 23281 ctataggtat actgtgtggg ttggcttcaa tcctgatgaa tttctagcta acttttctga 23341 catccatgca ggggaactgt attttgtgga ttctgaccca ttgcaggatc acaatatgta 23401 taatgattcc caaggtggag atcttttcca gttgttgatg ccttgagttt tgccaaccat 23461 ggatggcaaa tgtgatgtgc tcccttccag ctggtgagag gaggagttag agctggtcgt 23521 tttgtgatta cccataatat tggaagcagc ctgagggcta gttaatccaa acatgcatca 23581 acaatttggc ctgagaatat gtaacagcca aaccttttcg tttagtcttt attaaaattt 23641 ataattggta attggaccag tttttttttt aatttccctc tttttaaaac agttacggct 23701 tatttactga ataaatacaa agcaaacaaa ctcaagttat gtcatacctt tggatacgaa 23761 gaccatacat aataaccaaa cataacatta tacacaaaga atactttcat tatttgtgga 23821 atttagtgca tttcaaaaag taatcatata tcaaactagg caccacacta agttcctgat 23881 tattttgttt ataatttaat aatatatctt atgagcccta tatattcaaa atattatgtt 23941 aacatgtaat ccatgtttct ttttcaaatc taaagttaaa aaaaaatagc agaagccagt 24001 gtcttaaagt ctatcttttg tttctaagac catgggattt cataatctca agataaaata 24061 tgtatgaagt aattaatgta gaatttttac accaaataat aaataatgct taataaacta 24121 gagatatgag atgtgtagga aatttggtta aacttttttc agatactttc tggcccaaat 24181 aataatttgt tagcaaataa tatgaccctt gaactcaatg gccatctatt aaaagactgt 24241 tgttcacact ggaaaacatt taaagatgtg actatatcca tgggtggatt gaatcactca 24301 aaatatatta gtatccttct ttagggatgg ttggttacag acatgtattt attcaggagg 24361 cagaaaatat tccattttaa ttgcttatta aagaaaacat taaattctaa attattttga 24421 ggactgtgaa gacttttcat tagtgtaata ttaggtcatt gtcaatctcc cagaatgtag 24481 ttctatattc tctaaatatg aaagtatcca gaaaggccag tggtagtaaa aagcttagtg 24541 tatataatct caaaagggat ggaatattta caactcatat ttataacatg ttgaatcttc 24601 tcagttatca gtagtcatca gaagtgtcaa tagctttcta aataaatatt aaatatctac 24661 tgtcctgtag tgaaggagta atttttagta attttctctt tacaaagtct ccagtgtttc 24721 caggtaaata tttgtgaaac aaaatacagc aaactacatt gttacttcag tgtattgttg 24781 ccaaaaatga caagatatta tattaaaatc agtaaatttt agacagattt taaaaattaa 24841 ttagcctaca atagaggtta tatggtaaca cggtgatctt ctaagcagtt aagtgactga 24901 ctgttctggc aacaacgact tctccgtgac tgaagggccc tgttcatttc ctgatcctga 24961 agctcgtctc tcttttgagc ctccgcttgc tttggtcgat ggtttccctc agctttttct 25021 ttgctgttct tcatcctcgt tgttgctgtc atcatgttca ctgtggcttt tacaatacag 25081 cctgtaaatt ccttatgaca tagttcagtg catttggctt tattgcctgc tccacagttc 25141 tttaccttta cttggcttag agaaactgta tctttgttgc ttcatataac ctttccccaa 25201 ccccactaag ctggacataa cttattagtg gtcctcccgt cactttattt gtagaaatct 25261 ctctttcaca tgagcagggg ttctttcatg tggtttagct gacagcagaa ctagtgattc 25321 tagacatttt gcatggccct cattcagtgg ctcacaaaca tgagggagca tcagaactac 25381 ttgaggggct tgttaaaacc cagtgcgtta gaagtcggat gcggtggctc acacctgtaa 25441 tcccagcact ttgggaggcc caggcaggcg gatcacttga ggttaggagt tcaagaccag 25501 cctggccaac atggtgaaac cccgtctcta ctaaaaatac aaaagttagc cgggtgtggt 25561 ggtgcatgcc tgtaatccca gcttcttggg aggccaaggc acaagaatcg cttgaaccag 25621 gagacggagg tttcagtgaa tgaagatcgt gccattgtat tccagcctcg gcaacacagc 25681 aggactgtga ttttctttgg agactcctag attttctgtg gttttgaact gaatttgttg 25741 gatgttggca agtgcctctt atgagctgtt tctttatcct gcatttgccc cacaaagact 25801 tatctggagg tgagcaaagt atgtttggta gtgaggtcac aaaggcaatc agccccttcc 25861 tccccactcc cattgccatc ttctcagtcc ttctcccttt ctttccaagt agtttaccca 25921 cccctcctct ttcctcccct gtccctaaaa taatccacgt gtcttcctaa aatctctctt 25981 tgatcctgtc ctttgataac accgtcagtg cctactactg ggtctagaca gacctctgtt 26041 gagcagtcag agtcttccct gactccacaa tgcccctttc cttggctgac cagtatgact 26101 actggtcccc acctttccct tgcctatccc tacctccctc ctactaggtt gtcccatccc 26161 tctcttcacc cattcattca tgaccatttt tcactaccaa gctccccccc tcccgaagga 26221 ggctgaggtt tttgtgactc tctagactct attgtgggat ggaatgaaca ttgctaaaga 26281 atcttgtgtt cgctttactt taaaaaggta tttttttcct aattataaaa ctgatgtgtc 26341 agttacggaa aaattagaaa tgcagcacaa atacatgaat attttaccac aaaattgcca 26401 tataatatct tgtctttttt gggggtgtga attttttgca ttgttctggt catattcttt 26461 atcatgtaat ttatgttctt ttttactaag tattatgtgt ggttattata gattttcaca 26521 aagatatatt gctggtaata tattttattg tgtagtctta taatttactt aaccttcttt 26581 caattgttag aaatttaggc tatttccaga ttttcagtat tgtaaataat gctgtgatga 26641 ccaattttgt gaataaaatg tttttatgta tttcagatta ttcccttagg atagtctctc 26701 agtgccaagt tgtcaaaaac atctctattt tgcttatctt cctgctctct tgctgcctta 26761 gggggtagta aactgaaaca taaagtaaac atgcatacaa ataaaaaaca taaaacaaaa 26821 ataagcaacc tgatggtaat aggtgaaagt ggtaacctgt tttaactttg aattcttgcc 26881 gggcgcggtg ctcacgcctg taatcccagc actttgggag gctgaggcgg gtggatcacg 26941 aggtcaggag ttcaagacca gcctggccaa gatggtgaaa tcccgtctct actaaaaata 27001 caaaaattag ccgggcgtgg tggtgggcgc ctgtaatccc agctacttgg gaggctgagg 27061 cagagaattg cttgaaccca ggaggcggag gttgcagtga gccaagatcg tgccactgca 27121 ctccagcctg ggtgacagag cgagactccg tctcaaataa aaaacaacaa aaaacaaaaa 27181 aaacttaaaa ttctttgctt gttagtgacc ttgatcatgg ttctctttgt acgatagttg 27241 ggcatctgta tttccacttg tgtgaatttg cctttaaatt ttggttatgg gtttcacctt 27301 ttaaaataat caaacatatt tatcttttcc tgtgtgatag gtttttttct gtatcttttc 27361 ctgttaaaca cacagacccc tccccaatct ggacattgaa taaatattca ttttcctttg 27421 cattgtttgg ggtttggttt tatgcttact gtgtctctcc accccattat cagagtaata 27481 tatgctcatt gcataaaatc tggaacatgt gaaaactgga gatgaaaaga agcccagcat 27541 ttgagacata cccactgctg acatttgtat atttcctttc attcttttac ctgcctaaag 27601 acatatatgc agatatggac atatttagct ttacatacat gcagactacc acatataatc 27661 tatatcttgt ttttttgatc taacacaatg ttagcttggt tttttatgtc attgacagtt 27721 cttgattaac agctgtaatg gctgcacagc cctgcttcat taattttttt ttttggactg 27781 aaatgtatgt accacgttcc gcttatccat tcatccacca gtgaccactt gggtcatttc 27841 taccttttgg ccattgcaat aatgttgctg tcaacaagga tgtacaaata tctgttcgag 27901 tccctgctct taattgtttt gggtatacac ccaacctaca ttaatcttta atcacttaag 27961 atgttggggt ttgtgctggg aggtaaccaa gtgactctgg agcagcctca ccctagctac 28021 agagggaaac tgactcttcc ctctctgccc aatcttttgc cgcctgtcag gttcccaagc 28081 tgtccatacc ttactgtgtg aagcgctttg ggtataaact cccaaataga ggctcagccc 28141 cttttccttt cccaagtcca tcctgcaggt tatagctctt cagaggaatc tcttgaattc 28201 tggcctggga tctgggatta aagtagaacc cagggtgtcc cttggggcct taggcatagc 28261 ccgttgtggt ggaacctgcc atcagcccca ctgcactcag tggtcccctg gtacttcagg 28321 acagtttctc tcttctgcac atgagaactt caagcagaga ggactgtcag ttgttgcctt 28381 cacatcacca atcgtttggc cataatggcc aagagccctc actgctcttg ggtcttgcct 28441 ttccacatgt tactgaaagc tcccgttggt gtccctgtcc actaactcac ctgatgggac 28501 agatgccact ttcctctgag tgtctgataa aacacagtct gttgtgggga gcccacactc 28561 aagggttttc tttgggatca gagtttcccg gctattactc catgccctgc ctgacatcac 28621 tgtcatgtat ggtgctaatc accaaaggat gcccttgttt gattctggat tctaggtgat 28681 agtccttatc ctctgatggg ggaagagttt gggtgggaga gaacaaaccg acaagttggt 28741 tccagatgag gtggctggga gccaaccctc aggtacttgg ctctgagtaa tgtgacggcc 28801 caggcaagag ggctcccact gccaatgagg aacaaaagct caaggtctag tagtgtattc 28861 agcacatgct aggaaacaag agttgggacg tggacaagga gaggagcaaa ggccaggggt 28921 taactgcttt tggtttttcc tctataaaag tcagggacta tgatttatat gggtgagaag 28981 gtttggagta ggactgaagc tgccatttac aaaccccttg ccctaattta ttgtagtttt 29041 gggtgtgttt ctgaaagaac cttttgcagg tgatttttat tcagtactca ttaacttaca 29101 gcttttatga attcacatct tctacactat tgaaaaacat acttttgtat tgtgttaagt 29161 gcagcatgag ttattatatg aataaatcac attaaatttg ccattatcgt gtgtgcaatt 29221 cttgctgctg gtttgggacc gcttggtgct gccttaaaat aaaaggagcc atttaacaga 29281 gcttgatacc gtggtacagg tgtgccccct taaagtatcc aacctgttcc tccccatgtg 29341 agggccaagg cagctgtggt gtgggttttg ttagggagtt gaattcttga gggtagatgg 29401 tgagttgcac ttgaccatag acacactttt ctgggcagtg gtatgtctct cagcttgccc 29461 tgcagttaca gtacagagac tccccagaaa gaaagaggaa gtgggctgac caattaggta 29521 ggcttaggga aaccatatat gcaatagcaa caaggtaaag gaagcaaatt agtgggttct 29581 tggggttcca actggaaatt tgcctgttta gtggtcttgg cttagtgtct tcatgagcct 29641 tagtttatct ttttctacct atcctactgg tttcgactaa aaattcggaa ggcagtgctt 29701 cttatattgg tgaggacatt ggaagacata catttccaaa tagataaagg caggctggaa 29761 ggattgagta aatctcagca gctgaagtaa cttcaatgta ttcatttact ggcttgttta 29821 ttatgtatgg tgcgaagttt aaaaggcaca aacatgaggg aggattgctt gaggccagaa 29881 gctcaagact agccagggca acataacaag acacccatct ttacacaaat ttttaaagaa 29941 attagctggg tgtggtggca tggcacctct agtcccagct actcgggagg cacaggcagg 30001 aggatcactt aagcctggga gttcaagact gcagtgagct ctgatcatgt cactgcgctc 30061 tggcctgagt gacagagtgg gaccttgtct caaacaaagg cacaaacatg tatacaatga 30121 agagtcattt cctcttccat gctgttccca gctaccctat tatcctcaga ggcaagcatg 30181 cctaccagtt cttggaaact gctgaagttt gaaaaggata aatggagacc taacaggaag 30241 gccccagaac caaggtgcta gtgaggggtg acccaaggga gcagtattgc acatgcaacc 30301 cttttacatt gtggccaaca gctcagaatg gggatgcggt gtgatgtggc tgccaggaag 30361 tgaaaggcat ctcaggctac ataaatggat atccaggagg aaggaggagg taagctgctc 30421 tgcctacgtg tcacggtcag ccagcacctg gagaattgtg tttagttgcg aacatggagc 30481 tgtcatggaa atgtggacaa actggagcct gttcagagga cagccctaag atgggtgaga 30541 cccagcatat catgtgagga acagttgaag aagtgttcaa atggggagag aaaagactcg 30601 ggtgggtaaa agtagggggt tggggggaga gcatggcact atcttgaaat ttctgaagaa 30661 ccatcagagg caagagagat taaacacatg atatctagag ccccaaggga agatccagca 30721 ctggaccctg caaggagaca gatggccatc cagaggttga actttcggcc gggtgtggtg 30781 gctcatgcct gtaatcctag cactttggga ggccaaggcg ggtggattgc ctgagctcag 30841 gagttcgaga ccagcctggg cagcttggtg aaaccctgtc tctactaaaa tacaaaaaaa 30901 aaccaaaaaa aattagccag gcttggtggc gtgcacctgt agtcccagct acttgggtgg 30961 ctgaggcagg agaattgctt gaacctgggg ggtggagggt gcggtgagct gagatcgtgc 31021 cactgcactc cagcctggtg acagagtgag actccgccaa aaaaaaaaaa aaaaaaaaat 31081 taaaaaacag aggctgaact ttctaacata gcagcttacc attgaaggag cctgctccta 31141 agtgtggtga gctgtcatct gagctatgta agcatcacgt aggcaagagg ttctcaatag 31201 agagcagttc tcctcttggg acatttgtca gtgtctaggg tcatatttgg ttatcacaac 31261 tggggaaagg ctgctactag catctagtgt gtagaggcca gggatgatag gaatcacaca 31321 cagacagcac cccctccccc agaacaataa agaactatct ggcccaaatg gcaatagttc 31381 caaggctgag aaactctaac ataggcaaac atttgctaca aaaagaaaaa ctactaaaat 31441 cccctctttc tctgagactt ctatgattgg agaatgcatg ctcaattatt aaaatagcat 31501 gctgactata tcaggtgatt ggccctttgc atatagcctt gaatacattt ttaaagattg 31561 caaggagctt tcaaatcaag atgatttaat aacaaaccat ttcatgtagt agtagtagat 31621 tggatttctc tttcttcctg ttcagtggag agaccctccc atctgagttt cctcacccat 31681 agaaaagatg ttaaataggt gatttctttt aaaaaataat attgcatttt ttctcagttg 31741 aaaaattatt acatattcat cacagaaaaa ttagaaaaat atacaaaagc aaagaaaata 31801 atctaacacc actgtttact tggcatagtc cttcccctga ataatacata tataaaaaaa 31861 tggttcatac tctatctcct attggttttt ctacatcctg ttttacaacc tgggtgtgtg 31921 gtggcatggc acctctagtc tgcttttcac actgtgtgat ttctaagggc ttccccaatt 31981 ccaaatttct gtgacacaca aattggaaga atttccatat aaacatgtga ggccatttcc 32041 tggaggacaa aactagaaac ctctgagttt ccttcctctc ccatgacgtc taataaatgt 32101 gagtgtttcc acaaaagata atactgagga tgctgctttt cctagccttg agtttcatga 32161 ctctgtttgc tcacagagtg ccctggatag cccagaaact ctatggaata gagacatcca 32221 tcaccaggcc ttgtggattt tacctctttg tgtatccatg tatccatctg atacatgcat 32281 ccactgatac tttgtgtatc agtgtatcca tccaggtttc tacctccacc gggtaccatt 32341 ctgaagccat ctcctatcac ttggacatgg aaccacttcc taactgatct cctactcttg 32401 ctacactgca aacaagttca cacagttctg aatatgatct ttgcaaaatg caaacctgat 32461 tgtgtctctg cctaccgctt gtacctccct gctctccctc cctcccaatg gcttcccatt 32521 actacgtgga caagcagaaa acacctgaac ctgccctttg tgtagaccat tcatggtttg 32581 ccccctgctg acctgccaca cctccttgtt ttaactccta ctccatcttc atggccaact 32641 taacccttcc tttccaaggg cattcttccc tgatgccttt gactaggttg catagcagct 32701 tgcatcacac acatactact tcttcatagt cctcagcttg gttgtacttt ttcactggtt 32761 tgtgtaatta tttcatttgt atctgtttcc ccagaaagct ccatgggggt agagagtgtc 32821 tatttttgca tcattgcctc ctagtgccca gtacacagta acacccacaa caaatgtttg 32881 tggaatgaat gaatacagtg ttcataatgg ccgcattgtt ttctgtcatt tccctaagct 32941 cctattgctg actctttagg ttgtttccag tgtcatttct attacaaact atgatgcaga 33001 gaacatcacc atgcagccat ctttgtgcac ttgtgtatta atgtatctct aaaggataaa 33061 attgtagaaa tagaattctt gggtcaaaag catgtgtaga ggacatttga catgatgttg 33121 gacaccagca tcctaagtgc ctttctctac ttgggaaatt ctgcactcaa ttattttgag 33181 tggcaggcag ggagagccct cttgccactg cagaggtgca aagtgccagg agctcatttt 33241 cccatgactt ccttgtggct agagagcagg catatgactt tgactcaact cagcaagatg 33301 ctttcctcct tgagttggta aagcaggaac cagtgtgaat tcatcttccc ccagaggaag 33361 gctctgtggt gatgttaacc accagtgtcc agcactggta gtgtcactat ggttaccaaa 33421 ctgtttttgt gtggttatgg cagcttgcct ggtcaccttt gtccttacct gttttttagg 33481 tttgagactt tggccttctc tgggattcta tgagctactc aagcctttca aaaacaaaca 33541 gacaaaaaag aaaaaaacct ttttttctgc ttaaattaac tagtcagtaa aacaaccagg 33601 gctcctgatt gatgaagtgt ctgcatttga aaatctggaa gttagtgcca aatttcctcc 33661 agagaggtag cacaaattca cactcccatc agcagagtgc acagaacctg agttcctatc 33721 ttccctctag tacttcttac gttgctttgc aatgacctat ttctaaggaa ggggacattt 33781 cttattcatt tttctagact aagcctctaa cctagggcct ggcaaattta ggcacttaca 33841 tgtttgctaa atgaatgaat gaaaataatt gtctttaaag agtttgagaa agaccaaagg 33901 taccgtagct tgaatcagag actgggagtt ggaaggacac tcatccacct ccttatttgt 33961 ttattttcct tttgcttttt tagagacagg atctccctat gttgcccagg ctagcctcga 34021 tttcctgggc tcaagcgatt ctcctacctc agcctcccaa gtcgctggga ctacaggcat 34081 gcaccactgc agctccttat tctgaggtgg gagcttaaag cccagaggag gcaagcagct 34141 ccaccaagga gacacagcta gccaagtgga agcagcctca gatccaagtc caccaagtct 34201 attttcagcc cacctccaag tccagtcttc tgcaaggact gctgtttcca caaggtttct 34261 cttcttgggt ctcccgatcc agggagtaga tttcagctga aatgatgtgc aaaggagcca 34321 cctcaaagct atgaaaccgc cagaaaatca ctttgtgaaa ggaagaaggc tggcgaccac 34381 ccgtttggaa aattgccttt gtagtaactg gggggcacac atgtgtgtgg aaggatcaag 34441 cgtcagcctt gctatggcca cagttttgat tttcctttat tctttgtatt gctttcaaat 34501 caaagggaag ccgtagcaga gaacctcttg ccctgagcta cacatatgaa aatgaacaag 34561 gttatgggag tcgcgtgttt tcaataatgg atcgctggtg ccgccctttc taggaaatga 34621 agcgaggctg ttgctaggtt ggcggtaaga ggcactgggc aggggtaggg ggtcctaatt 34681 ccagtggttt tacctttccc actattcatt ccagtccaac caattgatag aataagaatc 34741 aattctgagc agaacagtgt ctcattctat ttcaagcccc ccccaacctg ctttttgaca 34801 tttctatcac cacagcttga tagagaaaaa tattcacaaa gccgagtccg ttttgaagag 34861 aacatctgta aaattgggtc agcagatacg ttacacgttg gcgtggaaga tgaaatagaa 34921 ccctgtctgg tctgagcttc acagcaacag ggaagggctt tgtttccacg gcggcgcgca 34981 cttggatttg tgccttacag ctctgatccg gccacacagc ccaggcccct gggcagccac 35041 agggcgccgc cgccttccgc cctggcttta gggacaagct ggagcctggc ggtggagaca 35101 gcgtgtggtt cctcagagaa ccactggagg ccgctgtggc accaagaacg cacgaagggc 35161 gcccggctgg cggagtggga gtctccaagg agtaatgact gtcgcacaca ctgactgttc 35221 tcaaattcta gtccaggatt ttaaagacac tgttatgaat taacgagaat aaaatactcg 35281 gcagatgaag ctcgggtctc agatcccctg gggttctttt ctaactgctt tctcattccc 35341 accccccccg gaactgggtc tcatttgcaa tccacgaggc acaggcttta tacatgatgg 35401 aaacgtcttc caaaaacgac acaaatgcac aaatgttacc cttgcagcct atagtattca 35461 tccccaaata ctattgcaaa atattatgct attttctcta tcaagcagta ttggcggcag 35521 gtgccaacca ccacgtgggc cgcaggaatg cagctgctgg ataaaatgca ggatgcccag 35581 ttaaatatga atttcagatg aacaataaat aattgcttaa tataagtatg tcccacgtag 35641 tacatggatc atatttatac gacataatta ttcattgttt atctcaaatt caaatttaat 35701 tgggagtcct gtgtttttat ccgtaaaatc tggcctccct tcacagaaag ggtgactccg 35761 gtgaggtagc aaggaggtac cctggtgccc tgagggcact cagagctggg aagtggtcag 35821 aactgaagct gagcacagag ttctattcca atgtgggcgg taatatgtgt cacatgttct 35881 cacagagaac aaataggctc ttagacctgc ctgttgggaa ggggtctcag aggaagagat 35941 ctatacagta actggaggta atgttccatt tttcacgttt tggttattgc cttcacccta 36001 atacctacat aggcacacga gcttcgagta gcattagtcc aagcttcact gaaaggcaga 36061 gttcagttct attgagttca accaagcgcc atcatgccct ggttctggaa ggtcgagcac 36121 acacacactg cgctgatgta ggccttgccc ctggtgtgat caccccaggt cccagagtga 36181 acctgggtct gcaggtagac ttcatagact actgggccag cagtagactt cattctgggc 36241 agccagagaa ggaagtggag ggcagcagag ggggattttc attgtatcat aggtggcaca 36301 acaagcattt cggacctggg gctccttaag ttcccgaggt cccaacatat taatgttcac 36361 ctcccccaac tcctcacaag gctctccact ctgggtggag ctctccatga gtcggtggtg 36421 acctgcatcc ctggaatttg tattttatgg cttggtggat aaacaagagc aactgaagag 36481 cctccaatct ctggctccct tttgggggtc tgcttggcaa ttcaggaagc ccagaaaaaa 36541 atgccggggg atctcaagag caacagcaag ggaagagccc tcgtggcccc agattgatcc 36601 tgttctgcca cttgagagct ggacaacatt gtgcaagcca cttcccttct ctgggtctgt 36661 ttgttcagtg gattggggaa agtaggtgct cctcttgagt gggtttttga gccctcagtg 36721 agcgggtatt gagaacatac gatgcctgag cagtgtctgg catctgttcc ataagtgtgc 36781 cttctcctct ttttccttgt tcatgagccg tggagaaggg tgtgatccac ctgaaagtga 36841 ccctg // LOCUS HUMIFNRF1A 7721 bp DNA PRI 10-NOV-1992 DEFINITION Homo sapiens interferon regulatory factor 1 gene, complete cds. ACCESSION L05072 NID g184648 KEYWORDS interferon regulatory factor 1. SOURCE Homo sapiens Placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7721) AUTHORS Cha,Y., Sims,S.H., Romine,M.F., Kaufmann,M. and Deisseroth,A.B. TITLE Human interferon regulatory factor 1: intron/exon organization JOURNAL DNA Cell Biol. 11, 605-611 (1992) MEDLINE 93000481 FEATURES Location/Qualifiers source 1..7721 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Placenta" /map="5q23-q31" exon 1..219 /gene="IRF1" /note="putative" /number=1 5'UTR join(1..219,1279..1287) /gene="IRF1" gene join(1..219,1279..1287) /gene="IRF1" intron 220..1278 /gene="IRF1" /number=1 exon 1279..1374 /gene="IRF1" /number=2 CDS join(1288..1374,2738..2837,3630..3806,3916..3965, 4073..4202,4386..4508,5040..5089,6248..6383,6670..6794) /gene="IRF1" /codon_start=1 /product="interferon regulatory factor 1" /db_xref="PID:g184649" /translation="MPITRMRMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKH GWDINKDACLFRSWAIHTGRYKAGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKG SSAVRVYRMLPPLTKNQRKERKSKSSRDAKSKAKRKSCGDSSPDTFSDGLSSSTLPDD HSSYTVPGYMQDLEVEQALTPALSPCAVSSTLPDWHIPVEVVPDSTSDLYNFQVSPMP STSEATTDEDEEGKLPEDIMKLLEQSEWQPTNVDGKGYLLNEPGVQPTSVYGDFSCKE EPEIDSPGGDIGLSLQRVFTDLKNMDATWLDSLLTPVRLPSIQAIPCAP" intron 1375..2737 /gene="IRF1" /number=2 exon 2738..2837 /gene="IRF1" /number=3 intron 2838..3629 /gene="IRF1" /number=3 exon 3630..3806 /gene="IRF1" /number=4 intron 3807..3915 /gene="IRF1" /number=4 exon 3916..3965 /gene="IRF1" /number=5 intron 3966..4072 /gene="IRF1" /number=5 exon 4073..4202 /gene="IRF1" /number=6 intron 4203..4385 /gene="IRF1" /number=6 exon 4386..4508 /gene="IRF1" /number=7 intron 4509..5039 /gene="IRF1" /number=7 exon 5040..5089 /gene="IRF1" /number=8 intron 5090..6247 /gene="IRF1" /number=8 exon 6248..6383 /gene="IRF1" /number=9 intron 6384..6669 /gene="IRF1" /number=9 exon 6670..7656 /gene="IRF1" /number=10 3'UTR 6795..7656 BASE COUNT 1750 a 1946 c 2253 g 1772 t ORIGIN 1 agagctcgcc actccttagt cgaggcaaga cgtgcgcccg agccccgccg aaccgaggcc 61 acccggagcc gtgcccagtc cacgccggcc gtgcccgcgc cttaagaacc cggcaacctc 121 tgccttcttc cctcttccac tcggagtcgc gctccgccct cactgcagcc cctgcgtcgc 181 cgggaccctc gcgcgcgacc gccgaatcgc tcctgcagca gaggtgagta cgcctttgag 241 gccggggcac cggcggcgtc gaataaaagg cgcgcggggc accaggaagt ggggggtcga 301 aagctccagg ctggagactc gccggcgcgc ggcgtgtcgc ccgggcctcc gcgcgggctc 361 cggggggcgc cggaggagct gcgagccgcg ggccgcggcg cggggagggc gggaccgggc 421 gtggaccgcc cacccggacg aggctgccgg cgcccggcag ctttcgcaga tctgcgtgcg 481 cgcagccgcc aggggcctgt aggtggcccg ctatgttcgt cccgcgcatc cacacgccgt 541 gccggggacc gagtgtcagc ccacgcgtgg gcgcccagtg ctcccggctt tcggcggtcc 601 cacgtccggc cccaggcgac aggttttggg ctccctgtgc tggtggcaag ggctgcttac 661 tgcccaggtg gctggaggga atcgtgacta cggagactgc gggaagaggc gccacaggtg 721 ttccttgggc cacttctcca gaggagggga aaccgggccg gaagggttag cgtcctggtc 781 ttagcgttgt gggcgctgtg gctgtcagga aggcgtagaa tggattcagg ggcgcgggag 841 ggggctgttc agggtgacgg ctagcccttt gctagctagt ggttacaact caagtcaagg 901 gaatttcttc ttggcatcaa gcaaaagaag tccctccctt cccaaaggat ttgaattttg 961 agcgaaaagt tctgaaatta gggtatctgt gcattttgtc tcttttcctg catatgaatc 1021 ctgaagccat cacttgcatg cctgtctcct ccagagactg gctgggaggg gctgaaggaa 1081 ggggcaaaag catttttgcc taagatgctg aaaaaatttg gagagcagtt ttattccagc 1141 gcagctcccc tccgcactga gtgtagtacc tagcagctgg ctgaggtgag gggagggtaa 1201 ctaagtgacc tcgggtgggc aggtcactgc ccaggtactg ttcaacagat tccagactgg 1261 agcctctgtg ttctctttac agccaacatg cccatcactc ggatgcgcat gagaccctgg 1321 ctagagatgc agattaattc caaccaaatc ccggggctca tctggattaa taaagtgagt 1381 gtaactcttt gggttttcct gccactgttt taacccatgt acttctggag ggaccaaagc 1441 ttcagatgca gctcaaaaag ggaagtgata acgggacaag caggtgtttc tcccagtggg 1501 tcctgcatgc agggagtgtg cacggcccag cctgggcctc acttgcatga ctcctgcctt 1561 cttcccttct tgaggtaggg cacccacctg aaggcacttc cagtttccag cagcaagact 1621 ttccagcatc tgcagagctg gagttctgct ctcctctaag cgagaccctt acaaacatac 1681 acagcactct gcagggctcc aatcgaacaa atagaagact gagaagtgga tgctgctggg 1741 cagaaacgtg ctggcttagc agaggacaaa cgagttaatc ttgcaccagt cactctggcc 1801 caagaagcct atagctggtg cacttggggc aacatagacc ctatagactt agtagcaatg 1861 atagtatcat aataatagct aatgcttact gaacactccc tgtgtgcctg gcacctgcta 1921 agtatgttat ttacattgtg tcatttaatc ctcgcagtag tcctgtgggt tagatcttac 1981 taatgtcatc attttcagat aagtaaacag aggcactgag aggtagatca taagatcaca 2041 caagaagtga tgaagccaag atttgaactt gaacggctcg actcagaaat ctttactgtt 2101 aaccataagt gatataataa cagtaagacc ttagacttca tatttgtcac tgtgtcccta 2161 cacatcctct ggtttttaat cctcaaaatt ttgttggata tgttttctca tttccgagaa 2221 gagaaaactg aggggcaaag agatacagtg acaatgccag ggttacacag tgttcaccat 2281 ccaagtctag cccagagctc cctcagtggt atgaccagga ccccctgtgt aagagcccat 2341 gctcccaggt gtcctgagga gtcctttcta atggaagaag ttcttacttc catgtgggtg 2401 cttacaagcc agagagaaac atcccagagc ttcaaaacca gggctttggg ggagggtgcc 2461 ctgtgtgggt cctagcacat gtgtaacagg cagagggagg tctttgtgag ctaataatgc 2521 tgcagctcat ccaaactagg tgtcctcctg agagatccag agtggtctgt ttaagccagc 2581 ctcaagatgg gtgtccaagc cagatgtcag gggaaaaaag gggaagtcag ccttttctca 2641 gacctgtctg gctgggcagg cctgggtctc agactcagcc ccaaagtctg tggtctctga 2701 cctgacacag ccttatgtgt atgtgtgtat tgttcaggag gagatgatct tccagatccc 2761 atggaagcat gctgccaagc atggctggga catcaacaag gatgcctgtt tgttccggag 2821 ctgggccatt cacacaggtg tgtgcctggg actcaggcct aggaagccca gggtagagac 2881 aagaggaggc actcacgtta acacagaggc tcttcactgg ggtccctgag ctccctgaga 2941 caacatgcag aattactggg aagaggggct ggtggcagac ttgtgtttct ggagaagaga 3001 gtcgatcatc tcagcaaatt ctcaaaggga aaagccaaga tcttagaaag tgtgtggctt 3061 cagggggttt gtggctagat gaaagttctc cctggcaaaa gcatctgtga aaagcagctg 3121 taagccaggg cactgaaaga gacccaggtc tgcctttttc ttcgtgttga ccaaggccct 3181 tggtccaagc ctcatgtggt tggtggcctc cttatccttg agagatggag ctctaggccc 3241 atctcagaac agtcagccca cccatttagt aactgttctc tgctgcccag tctgtgccca 3301 ctctaccctc tggctgctga tagcccaagg aggaagactg ggcatagtct gagacacaga 3361 tagtacactt tggggatatg gggactctag tgcttctggc tgggcccttc actgaggccg 3421 ctagatgtgt ttaagccaag cctgggcatt tgagaaggcc cagggcctag gacctgcaga 3481 gtgtcaccgg gagtacctgc tggtttgacc actgtggctc tctggtagca taagaggtca 3541 ggggtacctt gccttcctcc ttcagccagg ggcagctgag gatccctacc catggccctg 3601 acgatcctct tttttctcct gccctctagg ccgatacaaa gcaggggaaa aggagccaga 3661 tcccaagacg tggaaggcca actttcgctg tgccatgaac tccctgccag atatcgagga 3721 ggtgaaagac cagagcagga acaagggcag ctcagctgtg cgagtgtacc ggatgcttcc 3781 acctctcacc aagaaccaga gaaaaggtat ccaaggactc tgggtccttg ggaagccctc 3841 agggagggag ggtagaagga ggtcagctgg ggctggagag cctgcaccaa ggctgacagc 3901 ccgtctgccc cacagaaaga aagtcgaagt ccagccgaga tgctaagagc aaggccaaga 3961 ggaaggtgag tgtggtccta agcagccagg cctttggtca cctgtgggcc agggtgagca 4021 gtggaagaaa tgctaaggtg gcctgggcct aagctgcttt ctccctcgac agtcatgtgg 4081 ggattccagc cctgatacct tctctgatgg actcagcagc tccactctgc ctgatgacca 4141 cagcagctac acagttccag gctacatgca ggacttggag gtggagcagg ccctgactcc 4201 aggtgagctg gtccaggtct ggcaggagac cccacaggtc agtgggatga ctctttctct 4261 tggaggcatg gtgctggcac atggtggccc attagtgcag gctgcagggt tggtcggagg 4321 gcgctcgatg tcttgcaaac taagaaagca cacaaccttg acctgtggct tctgctgttc 4381 cccagcactg tcgccatgtg ctgtcagcag cactctcccc gactggcaca tcccagtgga 4441 agttgtgccg gacagcacca gtgatctgta caacttccag gtgtcaccca tgccctccac 4501 ctctgaaggt tggtgctcct ggggcctggc ctgctgcttg actgtctggg tctgtgaagg 4561 gcttcctgag agagaaaaga tgatcagaac tccacctggc actgaattga ttgagttggg 4621 cattgcagtc ttagccacca tagggggagg caagcgacgg ggacactagg aaggcagttc 4681 agagtgggct gcagtacagt gggggctggt gagaggaggg aagggggcca ggggtgcatt 4741 ttgggtgtgc tggttctcct tcctcctctg tagcccagca tcgtggaggg tgaggaagga 4801 agtagggtag gggtgggaag cggcgtggct tcagggtttg agaggctgag tcaccaggcc 4861 agggtcctgt tctggaatct ctatggcaga taggtccacc gggagggtgt gtgtgtgtgt 4921 gtgtgtgtca gagagacaga gagacagaga aagggcaggg ggatctggtg ggctggaact 4981 ggaactgcag ggtgagtgtg ctgactgcca gccaacctct ctgctttccc catccacagc 5041 tacaacagat gaggatgagg aagggaaatt acctgaggac atcatgaagg taaagcccct 5101 tcctacctgg gcactcttga agtgaccgtt tctcagtgag gagagagaac cagtgaagcg 5161 ttccaaatca gaggatgggt agctgctgtt gtcacctggc tgcttgcatt gtcccacaag 5221 tgccacattc acgtggcttg actggtggga aacccaccat gggaaggcag gtgggaggcc 5281 tggcctctga cagcgtcctg aagcaagcct tggggcatca gacagctctg tgagtcaggc 5341 actatcagcg atgggtccct ggcctgcatc ctctgcccca acatgcccca gccctgctag 5401 ttcgggaaat gcacatcagg cttcaataat cagcctttag gatccgttaa tatgatgatg 5461 gctttataga aaaagttagc aaattatcct ccaggttttt ttttctgctt cagttttgaa 5521 agtgaatata gtttttgcag ccgggggcag tgctcatgcc tgtaatccca gcactttgga 5581 aggcgaaggt gggtggatca cctgaggtca ggagtttgag accagcctga ctaacatggt 5641 gaaacccatc tctaccaaaa atataaaaat tagctgggcc tggtgcgcat gcctgtaatc 5701 ccagctactc tgaaggctga ggcaggagaa tcgcttgaac ctgagaggcg gaggttgcag 5761 tgagctgaga ttgtgtcatt gcactccagc ctgggcaaca agagcaaaac tccatttcaa 5821 aaaaaagttt ttgcagtagt tgtacgccag ctgttccatt agcccaaaaa attgagacat 5881 ggatgtcgtt ccttatctct agcttttcta gtcatctttt cttgatttat tatgctaacc 5941 tttgttttaa gccacattcc ctcttactat gtccttacac agttgagagg gaagtcgtgg 6001 agatgctata ccagagagtg ggtgtgagag gggtgggaaa atgaattgag gaccagtgcc 6061 aacatgcatt tctgcctcct ctcccgggcc cttgtcctga ctgcagtgca cttctgcatc 6121 ctatctgaga ttgtgaaaat ggccaagggt gtgatactgg ctgagaggag ctggctcatt 6181 gaggcagggc cacagggtga gtctgcactg gaagggagtt gatagcctct tgctcttctg 6241 tccccagctc ttggagcagt cggagtggca gccaacaaac gtggatggga aggggtacct 6301 actcaatgaa cctggagtcc agcccacctc tgtctatgga gactttagct gtaaggagga 6361 gccagaaatt gacagcccag ggggtaagaa ggccctggat ccttatggct tcttagatga 6421 gggagaacca cgtagggatg gagaaagctt gggggcaggg ccagggagca gggcggtaaa 6481 gcatctgggg tactgacaca ttgtgaatta gctacggctg ccatgcctta aggtttgcct 6541 gaagctgagt ggatgtttac tgctgtgctg ggaagagcag aggccatgtc tatggccttc 6601 aggggtaggg ggaagcacac ctgatgccac cgtcccctac cctcatacaa ccttcttcac 6661 atcttctagg ggatattggg ctgagtctac agcgtgtctt cacagatctg aagaacatgg 6721 atgccacctg gctggacagc ctgctgaccc cagtccggtt gccctccatc caggccattc 6781 cctgtgcacc gtagcagggc ccctgggccc ctcttattcc tctaggcaag caggacctgg 6841 catcatggtg gatatggtgc agagaagctg gacttctgtg ggcccctcaa cagccaagtg 6901 tgaccccact gccaagtggg gatggggcct ccctccttgg gtcattgacc tctcagggcc 6961 tggcaggcca gtgtctgggt ttttcttgtg gtgtaaagct ggccctgcct cctgggaaga 7021 tgaggttctg agaccagtgt atcaggtcag ggacttggac aggagtcagt gtctggcttt 7081 ttcctctgag cccagctgcc tggagagggt ctcgctgtca ctggctggct cctaggggaa 7141 cagaccagtg accccagaaa agcataacac caatcccagg gctggctctg cactaagaga 7201 aaattgcact aaatgaatct cgttcccaaa gaactacccc cttttcagct gagccctggg 7261 gactgttcca aagccagtga aatgtgaagg aaagtggggt ccttcggggc gatgctccct 7321 cagcctcaga ggagctctac cctgctccct gctttggctg aggggcttgg gaaaaaaact 7381 tggcactttt tcgtgtggat cttgccacat ttctgatcag aggtgtacac taacatttcc 7441 cccgagctct tggcctttgc atttatttat acagtgcctt gctcggcgcc caccaccccc 7501 tcaagcccca gcagccctca acaggcccag ggagggaagt gtgagcgcct tggtatgact 7561 taaaattgga aatgtcatct aaccattaag tcatgtgtga acacatagga cgtgtgtaaa 7621 tatgtacatt tgtcttttta taaaaagtaa attgtttata aggggtgtgg cctttttagg 7681 aagagaaatt taacttgtag gaatgatttt actttttatg g // LOCUS HUMIGERA 7659 bp DNA PRI 26-JAN-1994 DEFINITION Homo sapiens immunoglobulin receptor alpha chain gene, complete cds. ACCESSION L14075 NID g410211 KEYWORDS Ig receptor alpha chain; immunoglobulin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7659) AUTHORS Pang,J., Taylor,G.R., Munroe,D.G., Ishaque,A., Fung-Leung,W.P., Lau,C.Y., Liu,F.T. and Zhou,L. TITLE Characterization of the gene for the human high affinity IgE receptor (Fc epsilon RI) alpha-chain JOURNAL J. Immunol. 151 (11), 6166-6174 (1993) MEDLINE 94065170 FEATURES Location/Qualifiers source 1..7659 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="Clontech Cat. #Li067j" precursor_RNA 1258..7146 exon 1258..1341 /number=1 CDS join(1287..1341,1776..1796,2850..3104,4910..5167, 6670..6854) /codon_start=1 /product="immunoglobulin receptor alpha chain" /db_xref="PID:g410212" /translation="MAPAMESPTLLCVALLFFAPDGVLAVPQKPKVSLNPPWNRIFKG ENVTLTCNGNNFFEVSSTKWFHNGSLSEETNSSLNIVNAKFEDSGEYKCQHQQVNESE PVYLEVFSDWLLLQASAEVVMEGQPLFLRCHGWRNWDVYKVIYYKDGEALKYWYENHN ISITNATVEDSGTYYCTGKVWQLDYESEPLNITVIKAPREKYWLQFFIPLLVVILFAV DTGLFISTQQQVTFLLKIKRTRKGFRLLNPHPKPNPKNN" intron 1342..1775 /number=1 exon 1776..1796 /number=2 intron 1797..2849 /number=2 exon 2850..3104 /number=3 intron 3105..4909 /number=3 exon 4910..5167 /number=4 intron 5168..6669 /number=4 exon 6670..7146 /number=5 polyA_signal 7044..7049 polyA_signal 7104..7109 polyA_signal 7109..7114 BASE COUNT 2270 a 1451 c 1538 g 2400 t ORIGIN 1 gatcttcatg tggaatgact ggtttcattc aatagactta attcagcagt ctgtggggaa 61 gagcaaggta tgatagaatg gttcctcaag tgcttcagat gtgaagtggg tttaaatata 121 ctgtccctgt cttcttcaga gttttggtaa agataaaata ggacactcat ttaaaagcaa 181 tctttgcaaa tgacaagcca ctatagacat taatagagtt ttcatttcca gtattatcat 241 taatatcaga tcctggaaga aggttgagcc ttgacctaga gcaaaaaaac agaagaatta 301 gtaaaggaat cctggagaaa gcccctgctg tgtatttaaa ggagaaaggg agatcatgtt 361 gggaaattat aatattaaaa gtaaacaaaa gctaggaagt aaaataaaat aaattatatg 421 gcctagatcc ccataagtaa tggtttaact tctgccttcc tgtgttctga gccagattag 481 ggcacagtag agaaagagga gtctctgaaa atgtttccaa tttcgctggt cagacagcgg 541 atcatcagtg aatcagatga aaatttgtgg atttatgcac taactgatca gcaggaaatt 601 aaacaagaaa agcgttggta gctctggtga atcccaaaag aatttggcag ttgctagcca 661 tgctcctgaa tatgtataaa cagtacatca tatgactaag agtttgactt aggggttaga 721 ttttatgtgt ttgaacccca aattagttat ttaatagttg gcaccccaaa acaagttact 781 taacctcact aagattcagt tttcctgttt ataaaatgta gatagtgata gtatgtactt 841 tataggatta ttgtgaaaaa taaatgaaat atcagattta tttaggataa cacctggcat 901 atgtttggta ttcagtaatt agttgctgct gttttattct gctctccctt gcatcccact 961 tttctaagtt gtaaactaaa tagttgtaca cagattgaca gattaagaaa ggcttgtgat 1021 tgtgctagac ctatgcctct ctctcaccag attccaggtg tatatgtgga ggtgggatag 1081 ggagtggagt aagtgggtaa atattaaatt gcccagttgg gcaccatcct gaatattatc 1141 tctaaagaaa gaagcaaaac caggcacagc tgatgggtta accagatatg atacagaaaa 1201 catttccttc tgctttttgg ttttaagcct atatttgaag ccttagatct ctccagcaca 1261 gtaagcacca ggagtccatg aagaagatgg ctcctgccat ggaatcccct actctactgt 1321 gtgtagcctt actgttcttc ggtaagtaga gattcaatta cccctcccag ggaggcccaa 1381 atgaatttgg ggagcagctg gggtaggaac ctttactgtg ggtggtgact ttttctagga 1441 catgtgcaaa ctattgggca tttcccaggg actctgtagt ggagccaagc tagaaagcag 1501 aggcaagtgg gctgagcaac acctaaggag gaagccagac tgaaagcttg gttccttgca 1561 tttgctctgg catcttccag agtgcaaatt tcctaccaag gtaatgaggg tagaggagag 1621 aaagaagctc tttcttcccc tgattctcat tcctgaaaag acggttggtc cttaaaattc 1681 catggatgta gatcttatcc ccacacccag attctagtcc tctggagata aagaagactg 1741 ctggacacta atgtatcctc tctggacttt tgcagctcca gatggcgtgt tagcaggtga 1801 gtcctctgtt cttgttccct tggtgtatca acatgtctgg gcattgcttt cctctcacta 1861 ttttcttcgt cccatcactt ctgctttcta atgagcatga atctgttcct tggccagact 1921 actttccctc tccaccttgc cttgtctttc tttttttccc tgattcattg cattctctca 1981 agtcattctc tcctctgttt tagtcaataa ccatgtctgt tgcacatata catgtctcat 2041 tctctctcct agacactttg gcatgatctc gctcaataat tacattatta ttattattgc 2101 cattttataa ttgaggatgc tgaaactcag tgattttctg gtggttacat ggctaaggaa 2161 ctggatttca acgtaagttc cttggatcta agtccagttc tcttctgact atatcaccct 2221 tttgttatca ccatgtatct acttctttgg tctctgttca aatttgcact acatcccctt 2281 gttccaggaa gccattcaag actgactttc ttagtgcctc tcactacttt ctggaactga 2341 catatgtttt tcactctgta tatacttaca attaaatagt cataaatatt cagagcttgg 2401 agaaacctta tatttcatcc agtccagtaa atttatccat ccataattca ctcattcatt 2461 cacataataa atatttaatg taacaatggt tgaacatggc agacagtgtt tctacctcaa 2521 aagagattgc agtcctcatt tacagatact gaattgaaat taacagaagt agagtgagtc 2581 agctcaaatc acatagtgaa ttggtttctt tgtttttaaa tctcctgcat atgtgtcctg 2641 tctttctccc tgtgttgggc gttccctggg gcaccaatac taatttctcc ttcccctaga 2701 aatcaaaaca gggtcttatc accaacagaa taaggacagg ttgaccactg attgtcagaa 2761 tattgcttcg tttgtacttt taagcctaga cagttttcaa tgactttttt tctctctaca 2821 tgtcttttca tatttttatc ttcttgaagt ccctcagaaa cctaaggtct ccttgaaccc 2881 tccatggaat agaatattta aaggagagaa tgtgactctt acatgtaatg ggaacaattt 2941 ctttgaagtc agttccacca aatggttcca caatggcagc ctttcagaag agacaaattc 3001 aagtttgaat attgtgaatg ccaaatttga agacagtgga gaatacaaat gtcagcacca 3061 acaagttaat gagagtgaac ctgtgtacct ggaagtcttc agtggtaagt tccagggata 3121 tggaaataca gatctctcat gtgagggatg gctcatctga agatgggaaa aaacaggtta 3181 ttccaagggt taggacacca gagtgggatt caaggcctct catttttaag acccctgcat 3241 tggctgggca cagtggctca cgcctgtaat cccagcactt tgggaggctg aggcaggtgg 3301 atcacgaggt caggagatcg agaccatccg gctaacatgg tgaaacccca tctctgctaa 3361 aaaatatata tatataaaat tagccgggcg tagtggtggg cacctgtagt cccaggtact 3421 cgggaggctg aggcaggaga atggtgtgaa cccaggaggt ggaggttgca gtgagctgag 3481 atcacgccac tgccctccag cctgggctac agagcaagac tccgtctcaa aaaataaata 3541 aataaataaa aaagacccct gcatctcttt tcttctaccc ccttcccttt tgattacttg 3601 tatgccttct ttcaatattc tagtcatctc tcaatattat tcctccaccc tattttcctc 3661 tatcttttct gcctagattc aggtatatat tatgtggtca aacagcatga catatatgtg 3721 aacatttcaa agagctgtgt atctggaata ggatcaaaag gtttgactta aagttttgct 3781 ctgcataatc catatggcag gacctgaata ttaggttgta ctcttcgtta tgaaacatat 3841 ctgggtacat ttccttatgt cctctgttgt tacttaagaa cacatatttc atgcttgttt 3901 catttttatc actcctactg ccaacaaata gcatagcatg cttaggcaca tgtggcttaa 3961 ttagcaaatg ttgaataaac aaattaatga ttttgaatag tgaccaatag gtctctttta 4021 tactctatat ttttctcttg agtgaaaaaa aatgtttcaa cctccatatg taaattccaa 4081 acacaaacta aagcaatgta gaatagcttc tttattccct ggagtaggtt ctagagaagt 4141 cctaaaggat tggtcctaaa ttaattatgc ttattatgct agcgatattt cctttcaaaa 4201 ttctccttta atgaatgctt tttaattttt acaaaagcat taaccataga atgtgattct 4261 tgtctttcac tgactcatta gtgacaaata tttgttgagt acctaccaac tcctaagtat 4321 tgctaccaac tcctaaatac tgtgttgggc attcagaata gaatgtagaa ctagacaggg 4381 tccctgactt cttggagcac agagcagtat gggaagagga cattaaataa agaattacat 4441 aagtaattaa tttaaattat acatgttttg aagaagtttt tttttgacaa ctataattaa 4501 cactagaact gggaagtttc tataaggtaa gagaggacaa aatagacact ctcctaagct 4561 aaaattccca agaaagactg tttattttcc cctaactaac tagaactagc aacagaagat 4621 ctgaaaggaa ttctggcttt caagtgttcc atgtatggac tcatcaggga ggtccgagag 4681 gctttgtggc cccagactga cttttcagga ggggaaagga tttatcaata cacaagacag 4741 gctctaagca ttattttgtg ccctttaaaa atccacttta tgagccaaaa agtgagttaa 4801 tgataattca tagtttctga cacatgctct atgcgtggct ctcttttctc tattcattct 4861 ctctctcttc atttattgtt aaataaataa tgtaatgaat gttcttcaga ctggctgctc 4921 cttcaggcct ctgctgaggt ggtgatggag ggccagcccc tcttcctcag gtgccatggt 4981 tggaggaact gggatgtgta caaggtgatc tattataagg atggtgaagc tctcaagtac 5041 tggtatgaga accacaacat ctccattaca aatgccacag ttgaagacag tggaacctac 5101 tactgtacgg gcaaagtgtg gcagctggac tatgagtctg agcccctcaa cattactgta 5161 ataaaaggtg agttggtaaa ggaaaggaaa agcatccata gcaggggaag gaagagagaa 5221 cttctgagcc tgagcagttg cagcttgtag aaggggggca cctgtgatac actggaaagc 5281 ctaccagact tgcaatgagg agacctgggt gatagtatat atctcaatct ctgtttcaaa 5341 gccttgactt gttaaatggt gatagtaata cctgcttgca ctatgaaatt tttatgaaga 5401 ttaatgtggt aatatttgtg aaatgacttt gtaaactgtt aagcactacc caagcataac 5461 agattgtgat tactattttg atctcaaagt catctgttgc tcctggggga acacttatat 5521 ttatcaaatt gaaaaaaagt ttcaaagttg aatgaagaaa ggatataaag agcttgagga 5581 gcccattcca gcttaggagg gctgggaaag gaaaccagca agtcagtaag ctgtgtgcct 5641 gtgtattgag ggaggaggga atggacttga tatggagagg gtagggaggt ggactgcctc 5701 tatggcctgt aagaaaaact gctctctcca aactctttat aagagaggga gcctgtgaag 5761 tattcacttt tgaaggagaa agttagactt ttccttcaca cactttgtac ataataatgt 5821 ttaaaaaagc atgaggtcaa aatacataat taagtcctag cagttctctg ttaactaatt 5881 tgagactgaa gtgctatgta cttgtctcta ggcttccagt atcttcatct gtaaaacaga 5941 atatttggtc tagattccat tagaatcatt tgataactta aaaaatatat tgatgctcat 6001 gtctcatttc ttgagattct gatttaattg gtttggggtg cagcctgggt atacgtattt 6061 ttcataggtc tttcacataa tggtaatggg tagccaatat tgagaatcac ttgtctaggt 6121 gatctttaaa tgatttctgg atgtaatatt ctgaggctct ataatttgag actaatcaca 6181 aaaatcggta cagtttataa acagactaac agaaccacaa aataatagaa ttggaaggca 6241 atttaactag tgcaatttct tcattttgcc taacaggcat gtaagaaatg atgattgatt 6301 gagtaatagg cattgatgac ccctgtcctc actttgtccc ctttccaccc cttaattata 6361 tgtgaattct ggtcttgtca tttcgaataa ggggtttatc tttcctattg tcttcccctc 6421 tgggcacggc acactggcta ctggagttaa gaggaaatgc ttaggactcc ctgtggctcc 6481 agggagcacc aacagagcaa ctcaacctag tgttaatctg agtgttttct ctgtgcttct 6541 ggatgccaca tcacgctaaa aatgaaggac aaagcttggt ctttctctta gggaggatga 6601 aactctgaac ctcatttttc agttcccaag atgaattatg tttctcattg catctgtgtt 6661 ccactacagc tccgcgtgag aagtactggc tacaattttt tatcccattg ttggtggtga 6721 ttctgtttgc tgtggacaca ggattattta tctcaactca gcagcaggtc acatttctct 6781 tgaagattaa gagaaccagg aaaggcttca gacttctgaa cccacatcct aagccaaacc 6841 ccaaaaacaa ctgatataat tactcaagaa atatttgcaa cattagtttt tttccagcat 6901 cagcaattgc tactcaattg tcaaacacag cttgcaatat acatagaaac gtctgtgctc 6961 aaggatttat agaaatgctt cattaaactg agtgaaactg gttaagtggc atgtaatagt 7021 aagtgctcaa ttaacattgg ttgaataaat gagagaatga atagattcat ttattagcat 7081 ttgtaaaaga gatgttcaat ttcaataaaa taaatataaa accatgtaac agaatgcttc 7141 tgagtattca aggcttgcta gtttgtttgt ttgttttcta ctaaaggcaa ggaccatgaa 7201 gttctagatt ggaaatgtcc tctcttgact attgcaagtg cgatctagga atgaaaagac 7261 ataggaggat gccagtgagg tggatcattt ttatgcttct tcttcagctt actaaatatg 7321 aactttcagt tcttggcaga atcagggaca gtctcaagac ataggactct caggatgaag 7381 tagagtccag gattcctctg tgattgtttt gcccctccca aatttatatc ttgaacttat 7441 gtcttgtatc tttatacagc acctgaacca agcattttgg agaaattcca gctaataata 7501 ataaccaaaa ccttcggctc tgaaaacagt ccaggactga ataagatctt gggcaaaaga 7561 actagacagt tttggtttat tttccctttc attttatgtc ttcatcatag tcattggagg 7621 ctcattcttc ttgtcatgga gtaaatggga ttaaagttc // LOCUS HUMIL1B 7824 bp DNA PRI 09-AUG-1995 DEFINITION Human interleukin 1-beta (IL1B) gene, complete cds. ACCESSION M15840 NID g186281 KEYWORDS Alu repeat; interleukin 1-beta. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7824) AUTHORS Bensi,G., Raugei,G., Palla,E., Carinci,V., Tornese Buonamassa,D. and Melli,M. TITLE Human interleukin-1 beta gene JOURNAL Gene 52 (1), 95-101 (1987) MEDLINE 87248099 REFERENCE 2 (bases 1 to 7824) AUTHORS Bensi,G. TITLE Direct Submission JOURNAL Submitted (26-MAY-1987) G. Bensi, Sclavo Research Center, Siena, Italy FEATURES Location/Qualifiers source 1..7824 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Lambda-hilb[4,8]" /map="2q13-q21" prim_transcript 374..7380 /gene="IL1B" /note="IL1b mRNA and introns; G00-120-094" gene 374..7380 /gene="IL1B" intron 446..908 /gene="IL1B" /note="IL1b intron A; G00-120-094" CDS join(924..970,1536..1587,3576..3777,4323..4487,5723..5853, 6570..6782) /gene="IL1B" /codon_start=1 /db_xref="GDB:G00-120-094" /product="interleukin 1-beta" /db_xref="PID:g386816" /translation="MAEVPELASEMMAYYSGNEDDLFFEADGPKQMKCSFQDLDLCPL DGGIQLRISDHHYSKGFRQAASVVVAMDKLRKMLVPCPQTFQENDLSTFFPFIFEEEP IFFDTWDNEAYVHDAPVRSLNCTLRDSQQKSLVMSGPYELKALHLQGQDMEQQVVFSM SFVQGEESNDKIPVALGLKEKNLYLSCVLKDDKPTLQLESVDPKNYPKKKMEKRFVFN KIEINNKLEFESAQFPNWYISTSQAENMPVFLGGTKGGQDITDFTMQFVSS" exon <924..970 /gene="IL1B" /note="interleukin-1 beta, (first expressed exon); G00-120-094" /number=2 intron 971..1535 /gene="IL1B" /note="IL1b intron B; G00-120-094" exon 1536..1587 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=3 intron 1588..3575 /gene="IL1B" /note="IL1b intron C; G00-120-094" repeat_region 3110..3429 /note="Alu repeat copy A; G00-120-094" exon 3576..3777 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=4 intron 3778..4322 /gene="IL1B" /note="IL1b intron D; G00-120-094" exon 4323..4487 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=5 intron 4488..5722 /gene="IL1B" /note="IL1b intron E; G00-120-094" exon 5723..5853 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=6 intron 5854..6569 /gene="IL1B" /note="IL1b intron F; G00-120-094" exon 6570..>6782 /gene="IL1B" /note="interleukin-1 beta; G00-120-094" /number=7 repeat_region 7280..7379 /note="Alu repeat copy B; G00-120-094" BASE COUNT 2099 a 1905 c 1624 g 2195 t 1 others ORIGIN 242 bp upstream of HindIII site; chromosome 2q13-q21. 1 aaagtatgtg catgtataaa tctgtgtgtc ttccactttg tcccacatat actaaattta 61 aacattcttc taacgtggga aaatccagta ttttaatgtg gacatcaact gcacaacgat 121 tgtcaggaaa acaatgcata tttgcactgg tgatacattt gcaaaatctg tcatagtttg 181 ctactccttg cccttccatg aaccagagaa ttatctcagt ttattagtcc cctcccctaa 241 gaagcttcca ccaatactct tttccccttt cctttaactt gattgtgaaa tcaggtattc 301 aacagagaaa tttctcagcc tcctacttct gcttttgaaa gccataaaaa cagcgaggga 361 gaaactggca gataccaaac ctcttcgagg cacaaggcac aacaggctgc tctgggattc 421 tcttcagcca atcttcattg ctcaagtatg actttaatct tccttacaac taggtgctaa 481 gggagtctct ctgtctctct gcctctttgt gtgtatgcat attctctctc tctctctctt 541 tctttctctg tctctccctc tccttccctc tctgcctccc tctctcagct ttttgcaaaa 601 agccaggtgt aatataatgc ttatgactcg ggaaatattc tgggaatgga tactgcttat 661 ctaacagctg acaccctaaa ggttagtgtc aaagcctctg ctccagctct cctagcctaa 721 tacattgcta gttggggttt ggtttagcaa atgcttttct ctagacccaa aggacttctc 781 tttcacacat tcattcattt actcagagat catttctttg catgactgcc atgcactgga 841 tgctgagaga aatcacacat gaacgtagcc gtcatgggga agtcactcat tttctccttt 901 ttacacaggt gtctgaagca gccatggcag aagtacctga gctcgccagt gaaatgatgg 961 cttattacag gtcagtggag acgctgagac cagtaacatg agcaggtctc ctctttcaag 1021 agtagagtgt tatctgtgct tggagaccag atttttcccc taaattgcct ctttcagtgg 1081 caaacagggt gccaagtaaa tctgatttaa agactacttt cccattacaa gtccctccag 1141 ccttgggacc tggaggctat ccagatgtgt tgttgcaagg gcttcctgca gaggcaaatg 1201 gggagaaaag actccaagcc cacaatacaa ggaatccctt tgcaaagtgt ggcttggagg 1261 gagagggaga gctcagattt tagctgactc tgctgggcta gaggttaggc ctcaagatcc 1321 aacagggagc ncccagggtg cccacctgcc aggcctagaa tctgccttct ggactgttct 1381 gcgcatatca ctgtgaaact tgccaggtgt ttcaggcagc tttgagaggc aggctgtttg 1441 cagtttctta tgaacagtca agtcttgtac acagggaagg aaaaataaac ctgtttagaa 1501 gacataattg agacatgtcc ctgtttttat tacagtggca atgaggatga cttgttcttt 1561 gaagctgatg gccctaaaca gatgaaggta agactatggg tttaactccc aacccaagga 1621 agggctctaa cacagggaaa gctcaaagaa gggagttctg ggccactttg atgccatggt 1681 attttgtttt agaaagactt taacctcttc cagtgagaca caggctgcac cacttgctga 1741 cctggccact tggtcatcat atcaccacag tcactcacta acgttggtgg tggtggccac 1801 acttggtggt gacaggggag gagtagtgat aatgtttccc atttcatagt aggaagacaa 1861 ccaagtcttc aacataaatt tgattatcct tttaagagat ggattcagcc tatgccaatc 1921 acttgagtta aactctgaaa ccaagagatg atcttgagaa ctaacatatg tctacccctt 1981 ttgagtagaa tagttttttg ctacctgggg tgaagcttat aacaacaaga catagatgat 2041 ataaacaaaa agatcaattg agacttgaaa gaaaaccatt cacttgctgt ttgaccttga 2101 caagtcattt tacccgcttt ggacctcatc tgaaaaataa agggctgagc tggatgatct 2161 ctgagattcc agcatcctgc aacctccagt tctgaaatat tttcagttgt agctaagggc 2221 atttgggcag caaatggtca tttttcagac tcatccttac aaagagccat gttatattcc 2281 tgctgtccct tctgttttat atgatggctc agtagccttc ctagtggccc agccatcagc 2341 ctagctaggt cagttgtgca ggttggaggc agccactttt ctctggcttt attttattcc 2401 agtttgtgat agcctcccct agcctcataa tccagtcctc aatcttgtta aaaacatatt 2461 tctttagaag ttttaagact ggcataactt gttggctgca gctgtgggag gagcccattg 2521 gcttgtctgc ctggcctttg cccccattgc ctcttccagc agcttggcgc tgctccaggc 2581 aggaaattct ctcctgctca actttctttt gtgcacttac aggtctcttt aactgtcttt 2641 caagcctttg aaccattatc actgccttaa ggcaacctca gtgaagcctt aatacggagc 2701 ttctctgaat aagaggaaag tggtaacatt tcacaaaaag tactctcaca ggatttgcag 2761 aatgcctatg agacagtgtt atgaaaaagg aaaaaaaaga acagtgtaga aaaattgaat 2821 acttgctgag tgagcatagg tgaatggaaa atgttatggt catctgcatg aaaaagcaaa 2881 tcatagtgtg acagcattag ggatacaaaa agatatagag aaggtataca tgtatggtgt 2941 aggtggggca tgtacaaaaa agatgaacaa agtagaaatg ggatttattc taaagaatag 3001 cctgtaaggt gtcagaaagc ccacattcta gtcttgagtc tgtttctaac ctgctgtgtg 3061 cccttgagta cacacttaac ctcttgagct tcagagaggg ataatctttt tattttattt 3121 tattttattt tgttttgttt tgttttgttt tgttttatga gacagagtct cactctgttg 3181 cccaggctgg agtgcagtgg tacaatcttg gcttactgca tcctccacct cctgagttca 3241 agcgattctc cttcctcagt ctcctgaata gctaggatta caggtgcacc ccaccacacc 3301 cagctaattt ttgtattttt agtagagaag gggtttcgcc atgttggcca ggctggtttt 3361 gaagtcctga cctaaatgat tcatccacct cggcttccca aagtgctggg attacaggca 3421 tgagccacca cgcctggccc agagagggat gatctttaga agctcgggat tctttcaagc 3481 cctttcctcc tctctgagct ttctactctc tgatgtcaaa gcatggttcc tggcaggacc 3541 acctcaccag gctccctccc tcgctctctc cgcagtgctc cttccaggac ctggacctct 3601 gccctctgga tggcggcatc cagctacgaa tctccgacca ccactacagc aagggcttca 3661 ggcaggccgc gtcagttgtt gtggccatgg acaagctgag gaagatgctg gttccctgcc 3721 cacagacctt ccaggagaat gacctgagca ccttctttcc cttcatcttt gaagaaggta 3781 gttagccaag agcaggcagt agatctccac ttgtgtcctc ttggaagtca tcaagcccca 3841 gccaactcaa ttcccccaga gccaaagccc tttaaaggta gaaggcccag cggggagaca 3901 aaacaaagaa ggctggaaac caaagcaatc atctctttag tggaaactat tcttaaagaa 3961 gatcttgatg gctactgaca tttgcaactc cctcactctt tctcaggggc ctttcactta 4021 cattgtcacc agaggttcgt aacctccctg tgggctagtg ttatgaccat caccatttta 4081 cctaagtagc tctgttgctc ggccacagtg agcagtaata gacctgaagc tggaacccat 4141 gtctaatagt gtcaggtcca tgttcttagc caccccactc ccagcttcat ccctactggt 4201 gttgtcatca gactttgacc gtatatgctc agtgtcctcc aagaaatcaa attttgccgc 4261 ctcgcctcac gaggcctgcc cttctgattt tatacctaaa caacatgtgc tccacatttc 4321 agaacctatc ttcttcgaca catgggataa cgaggcttat gtgcacgatg cacctgtacg 4381 atcactgaac tgcacgctcc gggactcaca gcaaaaaagc ttggtgatgt ctggtccata 4441 tgaactgaaa gctctccacc tccagggaca ggatatggag caacaaggta aatggaaaca 4501 tcctggtttc cctgcctggc ctcctggcag cttgctaatt ctccatgttt taaacaaagt 4561 agaaagttaa tttaaggcaa atgatcaaca caagtgaaaa aaaatattaa aaaggaatat 4621 acaaactttg gtcctagaaa tggcacattt gattgcactg gccagtgcat ttgttaacag 4681 gagtgtgacc ctgagaaatt agacggtcaa gcactcccag gaccatgtcc acccaagtct 4741 cttgggcata gtgcaatgtc aattcttcca caatatcccc tcatttgatg gacatggcct 4801 aactgcctgt gggttctctc ttcctgttgt tgaggctgaa acaagagtgc tggagcgata 4861 atgtgtccat cccctcccca gtcttccccc cttgccccaa cagtccgtcc cacccaatgc 4921 aggtggttct tgtagggaaa ttttaccgcc cagcaggaac ttatatctct ccgctgtaac 4981 gggcaaaagt ttcaagtgcg gtgaacccat cattagctgt ggtgatctgc ctggcatcgt 5041 gccacagtag ccaaagcctc tgcacaggag tgtgggcaac taaggctgct gactttgaag 5101 gacagcctca ctcaggggga agctatttgc tctcagccag gccaagaaaa tcctgtttct 5161 ttggaatcgg gtagtaagag tgatcccagg gcctccaatt gacactgctg tgactgagga 5221 agatcaaaat gagtgtctct ctttggagcc actttcccag ctcagcctct cctctcccag 5281 tttcttccca tgggctactc tctgttcctg aaacagttct ggtgcctgat ttctggcaga 5341 agtacagctt cacctctttc ctttccttcc acattgatca agttgttccg ctcctgtgga 5401 tgggcacatt gccagccagt gacacaatgg cttccttcct tccttccttc agcatttaaa 5461 atgtagaccc tctttcattc tccgttccta ctgctatgag gctctgagaa acctcaggcc 5521 tttgagggga aaccctaaat caacaaaatg accctgctat tgtctgtgag aagtcaagtt 5581 atcctgtgtc ttaggccaag gaacctcact gtgggttccc acagaggcta ccaaattaca 5641 tgtatcctac tcatggggcc taggggttgg ggtgaccctg cactgctgtg tccctaacca 5701 caagaccccc ttctttcttc agtggtgttc tccatgtcct ttgtacaagg agaagaaagt 5761 aatgacaaaa tacctgtggc cttgggcctc aaggaaaaga atctgtacct gtcctgcgtg 5821 ttgaaagatg ataagcccac tctacagctg gaggtaagtg aatgctatgg aatgaagccc 5881 ttctcagcct cctgctacca cttattccca gacaaccacc ttctccccgc ccccatccct 5941 aggaaaagct gggaacaggt ctatttgaca attttgcatt aatgtaaata aatttaacat 6001 aatttttaac tgcgtgcaac cttcaatcct gctgcagaaa attaaatcat tttgccgatg 6061 ttattatgtc ctaccatagt tacaacccca acagattata tattgttagg gctgctctca 6121 tttgatagac accttgggaa atagatgact taaagggtcc cattatcacg tccactccac 6181 tcccaaaatc accaccacta tcacctccag ctttctcagc aaaagcttca tttccaagtt 6241 gatgtcattc taggaccata aggaaaaata caataaaaag cccctggaaa ctaggtactt 6301 caagaagctc tagcttaatt ttcacccccc aaaaaaaaaa aattctcacc tacattatgc 6361 tcctcagcat ttggcactaa gttttagaaa agaagagggc tcttttaaat aaattcacac 6421 agaaagttgg gcccagttac aactcaggag tctggctcct gatcatgtga cctgctcgtc 6481 agtttccttt ctggccaacc caaagaacat ctttcccata gcatctttgt cccttgcccc 6541 acaaaaattc ttctttctct ttcgtgcaga gtgtagatcc caaaaattac ccaaagaaga 6601 agatggaaaa gcgatttgtc ttcaacaaga tagaaatcaa taacaagctg gaatttgagt 6661 ctgcccagtt ccccaactgg tacatcagca cctctcaagc agaaaacatg cccgtcttcc 6721 tgggagggac caaaggcggc caggatataa ctgacttcac catgcaattt gtgtcttcct 6781 aaagagagct gtacccagag agtcctgtgc tgaatgtgga ctcaatccct agggctggca 6841 gaaagggaac agaaaggttt ttgagtacgg ctatagcctg gactttcctg ttgtctacac 6901 caatgcccaa ctgcctgcct tagggtagtg ctaagaggat ctcctgtcca tcagccagga 6961 cagtcagctc tctcctttca gggccaatcc ccagcccttt tgttgagcca ggcctctctc 7021 acctctccta ctcacttaaa gcccgcctga cagaaaccac ggccacattt ggttctaaga 7081 aaccctctgt cattcgctcc cacattctga tgagcaaccg cttccctatt tatttattta 7141 tttgtttgtt tgttttattc attggtctaa tttattcaaa gggggcaaga agtagcagtg 7201 tctgtaaaag agcctagttt ttaatagcta tggaatcaat tcaatttgga ctggtgtgct 7261 ctctttaaat caagtccttt aattaagact gaaaatatat aagctcagat tatttaaatg 7321 ggaatattta taaatgagca aatatcatac tgttcaatgg ttctgaaata aacttcactg 7381 aagaaaaaaa aagggtcttt cctgatcatt gacttgtctt ggatttgaca ctgaacagta 7441 aagacaaaca gggctgtgag agttcttggg ggactaaagc ccacctcctc attgctgagt 7501 gctgcaaagt cacctagaaa tatcccttgg ccaccgaaga ctatcctcct cacccatccc 7561 ctttatttct gttgttcaac agaaggatat tcagtgcaca tctggaacag gatcagctga 7621 agcactgcag ggagtcagga ctggtagtaa cagctaccag tgatttatct atcaatgcac 7681 caaacatctg ttgagcaagc gctatgtacg aggagctggg agtacagaga tgagaacagt 7741 cacaagtccc tcctcagata ggagaggcag ctagttataa gcagaaacaa ggtaacatga 7801 caagtagagt aagataaaga acaa // LOCUS HUMIL2RGA 4038 bp DNA PRI 18-OCT-1993 DEFINITION Human (IL2RG) gene, complete cds with repeats. ACCESSION L19546 NID g349631 KEYWORDS Alu repeat; Alu-like repeat; interleukin 2 receptor gamma chain. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4038) AUTHORS Puck,J.M., Deschenes,S.M., Porter,J.C., Dutra,A.S., Brown,C.J., Willard,H.F. and Henthorn,P.S. TITLE The interleukin-2 receptor gamma chain maps to Xq13.1 and is mutated in X-linked severe combined immunodeficiency, SCIDX1 JOURNAL Hum. Mol. Genet. 2 (8), 1099-1104 (1993) MEDLINE 94004847 FEATURES Location/Qualifiers source 1..4038 /organism="Homo sapiens" /db_xref="taxon:9606" exon 74..202 /number=1 CDS join(88..202,581..734,943..1127,1336..1475,2239..2401, 2934..3030,3283..3352,3708..3893) /codon_start=1 /product="interleukin-2 receptor gamma subunit" /db_xref="PID:g349632" /translation="MLKPSLPFTSLLFLQLPLLGVGLNTTILTPNGNEDTTADFFLTT MPTDSLSVSTLPLPEVQCFVFNVEYMNCTWNSSSEPQPTNLTLHYWYKNSDNDKVQKC SHYLFSEEITSGCQLQKKEIHLYQTFVVQLQDPREPRRQATQMLKLQNLVIPWAPENL TLHKLSESQLELNWNNRFLNHCLEHLVQYRTDWDHSWTEQSVDYRHKFSLPSVDGQKR YTFRVRSRFNPLCGSAQHWSEWSHPIHWGSNTSKENPFLFALEAVVISVGSMGLIISL LCVYFWLERTMPRIPTLKNLEDLVTEYHGNFSAWSGVSKGLAESLQPDYSERLCLVSE IPPKGGALGEGPGASPCNQHSPYWAPPCYTLKPET" intron 203..580 /number=1 mutation 366..369 /note="deletion" repeat_region 394..511 /rpt_family="Alu-like" exon 581..734 /number=2 intron 735..942 /number=2 exon 943..1127 /number=3 intron 1128..1335 /number=3 exon 1336..1475 /number=4 intron 1476..2238 /number=4 repeat_unit 1643..1655 /note="flanking" repeat_region 1656..1976 /rpt_family="Alu" repeat_region 1977..1989 /note="flanking" exon 2239..2401 /number=5 intron 2402..2933 /number=5 exon 2934..3030 /number=6 intron 3031..3282 /number=6 exon 3283..3352 /number=7 intron 3353..3707 /number=7 exon 3708..4038 /number=8 BASE COUNT 982 a 995 c 1030 g 1031 t ORIGIN 1 aaacgtgtgg gtggggaggg gtagtgggtg agggacccag gttcctgaca cagacagact 61 acacccaggg aatgaagagc aagcgccatg ttgaagccat cattaccatt cacatccctc 121 ttattcctgc agctgcccct gctgggagtg gggctgaaca cgacaattct gacgcccaat 181 gggaatgaag acaccacagc tggtgggaaa tctgggactg gagggggctg gtgagaaggg 241 tggctgtggg aaggggccgt acagagatct ggtgcctgcc actggccatt acaatcatgt 301 gggcagaatt gaaaagtgga gtgggaaggg caagggggag ggttccctgc ctcacgctac 361 ttcttctttc tttctttctt gtttgtttgt ttctttcttt cttttgaggc agggtctcac 421 tatgttgcct aggctggtct caaactcctg gcctctagtg atcctcctgc ctcagccttt 481 caaagcacca ggattacaga catgagccac cgtgcttggc ctcctccttc tgaccatcat 541 ttctctttcc ctccctgcct tcattttctc cccaatctag atttcttcct gaccactatg 601 cccactgact ccctcagtgt ttccactctg cccctcccag aggttcagtg ttttgtgttc 661 aatgtcgagt acatgaattg cacttggaac agcagctctg agccccagcc taccaacctc 721 actctgcatt attggtatga gaagggacga gggggagggg atgaagaaga ggtgggttgg 781 atcagagacc aagagagagg gtagcaagtc tcccaggtac cccactgttt tctcctgggg 841 taagtcataa gtcggttgag gggagatgag gctaggctct ggatatctgc agtacccaga 901 ttggccccac tgttcctctt ccttccaacc tttctcctct aggtacaaga actcggataa 961 tgataaagtc cagaagtgca gccactatct attctctgaa gaaatcactt ctggctgtca 1021 gttgcaaaaa aaggagatcc acctctacca aacatttgtt gttcagctcc aggacccacg 1081 ggaacccagg agacaggcca cacagatgct aaaactgcag aatctgggta atttggaaag 1141 aaagggtcaa gagaccaggg atactgtggg acattggagt ctacagagta gtgttctttt 1201 atcataaggg tacatgggca gaaaagagga ggtaggggat catgatggga agggaggagg 1261 tattaggggc actaccttca ggatcctgac ttgtctaggc caggggaatg accacatatg 1321 cacacatatc tccagtgatc ccctgggctc cagagaacct aacacttcac aaactgagtg 1381 aatcccagct agaactgaac tggaacaaca gattcttgaa ccactgtttg gagcacttgg 1441 tgcagtaccg gactgactgg gaccacagct ggactgtgag tgactaggga cgtgaatgta 1501 gcagctaagg ccaagaaagt agggctaaag gattcaacca gacagataga aggacctaat 1561 atcaagctcc tgttctctgc ctcccagctt ctctgctcac cccctaccct ccctcctcca 1621 actcctttcc cccctatttt ctccagtgag ttttcttttt ttcttttctt ttctttcttt 1681 ctttcttttt tttttttttg agacagagcc tcactctgtt gcccaggctt gagtgcagtg 1741 gggcgatctt gggctcactg cgacctctgt ctccctggtt caagtgattc tcctgcttca 1801 gcctcccaag tagctgggag catgcaccaa ccatgcctgg ctaatttttg tatttttagt 1861 aaagacaggg ttttgccatg ttggtcaggc tggtcttgaa ctcctgacct caggtgatct 1921 gcccacctcg gcctcccaaa gtgctggatt acaggcgtga gccaccattc ctgacaccag 1981 tgagttttca ttagggattc cctacccata ctcttcctga taccagatag acaagtaaac 2041 aaaaggaagc cattaagggg atccagaggg gaggcattag attcaagtca gtgaagggag 2101 cagtgtggct tgagtagtca agagatgaga gagaaactgg gcagtagcac agatgacact 2161 ggtgggtgtt caggagtatg ttttaattct cccttctctc atagacaccc actttccctc 2221 atcctctttc tcctcaagga acaatcagtg gattatagac ataagttctc cttgcctagt 2281 gtggatgggc agaaacgcta cacgtttcgt gttcggagcc gctttaaccc actctgtgga 2341 agtgctcagc attggagtga atggagccac ccaatccact gggggagcaa tacttcaaaa 2401 ggtaaaatgg gcccacatga cccaatccat gagcccaaca ccccagcctt tctaacacca 2461 ctgtcttttg ctccacttcc ctgtcactaa agcccctaaa cttggtgccc catctctcca 2521 cactgtctaa ccccaacctc tagaaatcaa ggtttttctg tgtagggttg ggttagcgtg 2581 ttgttagagt aggggagtgg attgagaagg aggctgaggg gtactcaagg gggctataga 2641 atgtatagga tttccctgaa gcattcctag agagcctgca aggtgaagat ggctttggaa 2701 ccagctggat ctaggctgtg ccacatacta cctctttggc cttggccaca tccctaaact 2761 cttggattct gtttcctaag atgtaagatg gaggtaattg ttcctgcctc acaggagctg 2821 ttgtgaggat taaacagaga gtatgtcttt agcgcggtgc ctggcaacag tgcctggcat 2881 gtagtagggg cacaacaaat ataaggtcca ctttgctttt cttttttcta tagagaatcc 2941 tttcctgttt gcattggaag ccgtggttat ctctgttggc tccatgggat tgattatcag 3001 ccttctctgt gtgtatttct ggctggaacg gtgagatttg gagaagccca gaaaaatgag 3061 gggaacggta gctgacaata gcagaggagg gttttgcagg gtctttagga gtaaaggatg 3121 agacagtaag taatgagaga ttacccaaga gggtttggtg atggaaggaa gccacaggca 3181 cagagaacac agaatcactt tatttcatat gggacaactg ggagaagggt gataaaaaag 3241 ctttaaccta tgtgctcctg ctccctcttt ctcccctgtc aggacgatgc cccgaattcc 3301 caccctgaag aacctagagg atcttgttac tgaataccac gggaactttt cggtgagaac 3361 gctgtcataa gcatgctgca gtctatcaac tgccaactgc ctgccagcaa gacagacaga 3421 gtgtgggggt gggggcagag aggagaggga aggaggccct gcactaactg tcaggatgtg 3481 gccgaccaaa tggggcatgg actatacaga gagagacaca cacagaagtg cagattatag 3541 attgaatgag gcagatggca actggtattg ggggcccagg agcctgtgta tcccttctgt 3601 aatcaattac agtggttgca gacatcatga gtactccttt ggcacagagc tcggtctttt 3661 acttcctgcc cctaattgac ccctgacctg gacatatctg tctttaggcc tggagtggtg 3721 tgtctaaggg actggctgag agtctgcagc cagactacag tgaacgactc tgcctcgtca 3781 gtgagattcc cccaaaagga ggggcccttg gggaggggcc tggggcctcc ccatgcaacc 3841 agcatagccc ctactgggcc cccccatgtt acaccctaaa gcctgaaacc tgaaccccaa 3901 tcctctgaca gaagaacccc agggtcctgt agccctaagt ggtactaact ttccttcatt 3961 caacccacct gcgtctcata ctcacctcac cccactgtgg ctgatttgga attttgtgcc 4021 cccatgtaag cacccctt // LOCUS HUMIL4A 9900 bp DNA PRI 06-JAN-1995 DEFINITION Human interleukin 4 (IL-4) gene, complete cds. ACCESSION M23442 NID g186336 KEYWORDS IL-4; interleukin 4. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9900) AUTHORS Arai,N., Nomura,D., Villaret,D., Dewaal Malefijt,R., Seiki,M., Yoshida,M., Minoshima,S., Fukuyama,R., Maekawa,M., Kudoh,J., Shimizu,N., Yokota,K., Abe,E., Yokota,T., Takebe,Y. and Arai,K. TITLE Complete nucleotide sequence of the chromosomal gene for human IL-4 and its expression JOURNAL J. Immunol. 142 (1), 274-282 (1989) MEDLINE 89080260 FEATURES Location/Qualifiers source 1..9900 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="5q23-q31" TATA_signal 1079..1086 /gene="IL4" /note="G00-120-096" gene join(1106..1305,1578..1625,6826..7002,9591..9783) /gene="IL4" prim_transcript 1106..9783 /gene="IL4" /note="exons and introns; G00-120-096" exon 1106..1305 /gene="IL4" /note="G00-120-096" /number=1 /product="interleukin 4" mRNA join(1106..1305,1578..1625,6826..7002,9591..9783) /gene="IL4" /note="G00-120-096" /product="interleukin 4" CDS join(1171..1305,1578..1625,6826..7002,9591..9692) /gene="IL4" /codon_start=1 /db_xref="GDB:G00-120-096" /product="interleukin 4" /db_xref="PID:g186337" /translation="MGLTSQLLPPLFFLLACAGNFVHGHKCDITLQEIIKTLNSLTEQ KTLCTELTVTDIFAASKNTTEKETFCRAATVLRQFYSHHEKDTRCLGATAQQFHRHKQ LIRFLKRLDRNLWGLAGLNSCPVKEANQSTLENFLERLKTIMREKYSKCSS" intron 1306..1577 /gene="IL4" /note="G00-120-096" /number=1 exon 1578..1625 /gene="IL4" /note="G00-120-096" /number=2 /product="interleukin 4" intron 1626..6825 /gene="IL4" /note="G00-120-096" /number=2 exon 6826..7002 /gene="IL4" /note="G00-120-096" /number=3 /product="interleukin 4" intron 7003..9590 /gene="IL4" /note="G00-120-096" /number=3 exon 9591..9783 /gene="IL4" /note="G00-120-096" /number=4 /product="interleukin 4" BASE COUNT 2687 a 2204 c 2478 g 2531 t ORIGIN 1 gaattcaata aaaaacaagc agggcgcgtg gtggggcact gactaggagg gctgatttgt 61 aagttggtaa gactgtagct ctttttccta attagctgag gatgtgttta ggttccattc 121 aaaaagtggg cattcctggc caggcatggt ggctcacacc tgtaatctca gagctttggg 181 agactgaggt aggaggatca cttgagccca ggaatttgag atgagcctag gcaacatagt 241 gagactctta tctctatcaa aaaataaaaa taaaaatgag ccaggcatgg tgcggtggac 301 cacgcaccta ctgctagggg ggctgaggtg ggaggatcat tgagcctggg aggttgaggc 361 tgcagtgatc cctgatcaaa cattgcattt cagcctgggt gacagagtga gaccctgtct 421 cagaaaaaaa aaaaaaaagt cattcctgaa acctcagaat agacctacct tgccaagggc 481 ttccttatgg gtaaggacct tatggacctg ctgggaccca aactaggcct cacctgatac 541 gacctgtcct tctcaaaaca ctaaacttgg gagaacattg tcccccagtg ctggggtagg 601 agagtctgcc tgttattctg cctctatgca gagaaggagc cccagatcat cttttccatg 661 acaggacagt ttccaagatg ccacctgtac ttggaagaag ccaggttaaa atacttttca 721 agtaaaactt tcttgatatt actctatctt tccccaggag gactgcatta caacaaattc 781 ggacacctgt ggcctctccc ttctatgcaa agcaaaaagc cagcagcagc cccaagctga 841 taagattaat ctaaagagca aattatggtg taatttccta tgctgaaact ttgtagttaa 901 ttttttaaaa aggtttcatt ttcctattgg tctgatttca caggaacatt ttacctgttt 961 gtgaggcatt ttttctcctg gaagagaggt gctgattggc cccaagtgac tgacaatctg 1021 gtgtaacgaa aatttccaat gtaaactcat tttccctcgg tttcagcaat tttaaatcta 1081 tatatagaga tatctttgtc agcattgcat cgttagcttc tcctgataaa ctaattgcct 1141 cacattgtca ctgcaaatcg acacctatta atgggtctca cctcccaact gcttccccct 1201 ctgttcttcc tgctagcatg tgccggcaac tttgtccacg gacacaagtg cgatatcacc 1261 ttacaggaga tcatcaaaac tttgaacagc ctcacagagc agaaggtgag tacctatctg 1321 gcaccatctc tccagatgtt ctggtgatgc tctcagtatt tctaggcatg aaaacgttaa 1381 cagctgctag agaagttgga actggtggtt ggtggcagtc cagggcacac agcgaggctt 1441 ctccctgcca ctcttttttc tgagggtttg taggaagttt cctcagttgg agggagtgag 1501 agctgctcat caaggacttc tctgtccggt tggaggttaa ctctgtctct tgctctctca 1561 tttctgcctg gaccaagact ctgtgcaccg agttgaccgt aacagacatc tttgctgcct 1621 ccaaggtaag aagccgtccc acggtctgtt ttagcaaatg gggagatcca tccccaaatg 1681 tctgaacaag aaacttgtct aatggaaaac gagcgggccc aaattaactc taaggtgtta 1741 gatgttttca aagaacgaga agtctgatct ttactcttaa gcatgttttg gtctttctgg 1801 tttcacttga tttagaagac atgtaataga aagcttacat gctgtagtcc tgactcagat 1861 cctggtcaaa gaaaagccct cttgggtttt acttagcttt ggcatagtgc ctggaacgta 1921 ggaggcactc aataaatgcc tgttgaatga gagaattttt ctggcccata catttctgaa 1981 aaaccaaata ctctcacaga aacagatatt gagatgacag gttgagggag ctttcatttt 2041 gtctaagaga cttcctatgg caacagaaaa ggtatcgcca gagcccctcc tcttccacag 2101 cctggccacc taacagccct ctggttccgg ggctgccgtc cagagctctc agcttgctct 2161 ggccggccga actcccctcc agctcggtct ggaaccatcc tgctgggcag cgtccagcac 2221 atccctgctt cgggctgcct gggcacctcg cctctctgcc tcctgtgctg cctcaccccc 2281 acccctctat ctgtagtggg agggagatag atttgacagc tgatagtgca ttttctctga 2341 caaacacatg actacagccg tatcaatagt tttgtgcatt tcagttcctg ttttcatgga 2401 aacacacggc tgagaatgaa agccccaaag cctcaatttc acagtggtct cctaactacc 2461 tgctttccat gcaaactagg gagatgatat ggccaggagt gaagccctgt gtgttgggca 2521 gggtcacact ccagcaccca gaccatagaa cagggcccat cctgcttcat gagggaaact 2581 gctcttcggg cctttagctg gactatctca tttcattagt tatcccggga gtccgataca 2641 ggatgagatt ctgaagggca aatacacact tttttttttt ttttgagata gggtcttgtt 2701 ctgtcaccca ggctggagtg cagtggtgcg atttcagctc atagcagcct ccacctccca 2761 ggctcaagct atcttcctac ctcagcctcc caagtagccg ggacgacagg tgtgcaccac 2821 cacgcctggc taatttttgt atttttttgt agagatggag tcttgccatt ttgcccaggc 2881 ttgtctcgaa cttctgggct caagcaatcc gtccacctcg gcctctcaaa gtgctgggat 2941 tagccactgc acctgggcaa cagtttatgt gtgtgtgtgt gtgtgtgtgt atatatgtgt 3001 gtgtgtgtat atatatgtgt gtatgtatat atgtgtatgt atatgtgtgt gtgtgtgtgt 3061 gtgtgtgtgt gtgtataaaa tctccaagtc catccaaccg agatggctcc tactagaagc 3121 caagagtcca ccgggttgag cactgggtct ctggaggcct gtcggactgc tgagaaggct 3181 ctaacaaagc caagggaagg gccacctcac tagaagccag gcctggagga agggtgaggg 3241 ctgagggctg gaggtaagac tgcctgtggt tttagaccca ggctctgcca ctgactagct 3301 gtgtggctgg ccttcagaca tcttcacagc tctctgcacc tcagtttcca catgtgaaga 3361 tatgaaagtg attctgaagg tgattgcaag gttgattgga atccagctct tgagttagtg 3421 caaagtgtta ttgtgagatg atataaccac gattaaaagc aagaacaggt gcagagaagc 3481 gatgattcta agaaggaggg gaccgggttg gaaaggatca aaccatccag gatgccgagt 3541 ctggggcaat ccatctgggc tgtttctgga agacccccgg gtgcaggcca ggacactgct 3601 gccctcccgt ccttaactcc cctcttcact cagtcctcac tcacctccct ctcacacaca 3661 caaacatctc ctagaataat ccccactgcc tgccttcact cttacccgtc tcatttgcct 3721 cccctgaact tcatcctcct ggagttcacg atctcactct tcactctttt cttcccctcg 3781 aagattcagc actgcttact tacatgttaa gatatttcag aacagtgaaa tgttgctatt 3841 ttcaaaaacc tacaaaggtg gtatgcagag gaaaaggtac ttctttgtgt tcccaaagaa 3901 aacatctttc caaaatccag cctattgatt ttatttcttc gggggaacaa gaattttagt 3961 atctctaagt tgggtagcat tctactcttg gcagttgctg gaaagaaggc actggtctag 4021 gtcctgggct tcacaggtaa cacctgtcag ggtgtctatg aagtcaaggc tgtctgagga 4081 acagcaaagt gggaagaagc aagctggctg gctgatgaag ggtttcttgg gtggacaagt 4141 agttggagcg atttcctatt taccaaagag agctaaagtt cataattcta cagagagttc 4201 cataatgaac ctcaaatacc tctgtttttt gaaggagttt ctcatataca gcactagctg 4261 actatcctgg gcaggatggg agataatgaa tgcagtgcca atcgggctgg atttatatgg 4321 tcctagtgag gctggtcaag aaccgagtta gaactctcac agagtcactg cccacagaag 4381 aaatctccca agtggctgtt tcctgacatt cccgggaggc aggcctcctt ctgagtcact 4441 ccctaagcag ttctgaactg tgaggtcagc caggctgtcc aagtgcactc cctgagccac 4501 tggcagacac actcagcagc cagagctaga caggcaggtg gtaggagtcc agggccacgg 4561 cagggatgga gtgtcgcccc ctcgctgcga taccagagca agtaaaacgt taaggccttg 4621 cactaaagct gcccttagga tgcattcttt taaagttttt ccatttaatg cagactcttt 4681 tcaattctta ttttatcctt gtttccttta gaaagtcctt tgaaaaatat ctttagaggg 4741 ttttttccta tactatgtgg ccatatacgg gtcaaaatta agtttaattt ccaggctcca 4801 agccagcgtt tcagaaaaat ctcaccaagg tttgtggtaa aagaagcaaa gggctgactt 4861 tttggttttc ttgaatctca ctgttccctc tgcagcagca tgcatgtctg cccacctcca 4921 gacacacagg caccatctgc cgccccccat cagcccgtgt cccttccacc tcgactcgcc 4981 tacaaagccc agagaggtct gtttcttggc ccccagagcc caaagatact gacacactct 5041 tacatttcca actagaatca ggaacgagga gtgactctca gtcagttcat taagtaaatg 5101 tctttctaac cgctctgccc atgggacatc acgccccaca ggggaaaggg gaagcttctg 5161 tagcctggga ttctggtgcc tcagtctggg tctagacttt cctgaaaaaa cgttaaaata 5221 tgaactgcat tcctagaatt tagcctacat aaataagaga tgaacacaaa gatttctata 5281 gtttactcac tgccgcttat ttacagaagc aaaaatctgc cacgataggg gcctgacaaa 5341 tgacagtacc actgtgcaat gcgtttctac gcagctctca atcccatgtt ctctaatacc 5401 accgaaggct taggaaatgc ttatggtata tgtaaagagt aaagaagtta caaacagtat 5461 caacagttga cccctatttt aaaaagtatt tttgaaaagt gtgacgatat ttaccaaaat 5521 attaacgagc aatagttacc tctggctggt gggatgagtg aatgtatttt tgttgaatat 5581 atgttacctt tatagtaaat atatgttatc ttgatcatca gaaaaaaaaa tatgtaagaa 5641 cttgaaagct gcttggacag cgctgctgat agaaacccct gagcatcttg tcactgttct 5701 tctgattcag agggtctggg tggggcaggg gtggtctgag attctgtatt tctaagaagc 5761 tcccagtgat gtccatgctg ctgtccatgg accacacttt gagtatcaag ggaccagagc 5821 atgtcggggg agaggctggg gatagctttc tttatctgaa ctggataaag gaactgggct 5881 caagctaaga accctctcca ggttctgcat ctttgttctt cagtgaaaaa tgagaggaca 5941 caccaggcca ggttcagact gagacacaat ccctctcctg ggttcccaat gacttgtctc 6001 ttgtccattc ccttctctaa ggctaagggc cccccaggaa gagccatgtg gccagaccct 6061 cacagttgct ggcattccaa ggagattctc actccgcatc attggtccaa aaggcccctt 6121 acagaagctc tgcccaaggc tcagatcaat ggcacctgct cccagagctc ctctgatctc 6181 ccaggacacc tttccctgat ctgtgcactt atctcttgct gcctggcaaa atgtcttagc 6241 tcctcacttg ggccatgtgc tgctctcctc tcccatgggg agagccacac ggagagtgct 6301 ggccaaagca gcagagttca ggccaaagga tgtgcactca tttattcaac aggcatgcag 6361 gatttccagg gaaagctgga ttttaaaacc tctgggaaca agagcagaac ctgactgaga 6421 gctcatgtgg gcacttttca tagcagaata gctcatgagg tatagagaca cggacgcaga 6481 acgtgggctg tagcgacaga tggtcctgca ttctagtccc cactgtgcct tttcctcatg 6541 ggatgacttt attcaggtac cctttcggca aaatcctcca agagaaagga aactgggagg 6601 ttctggggag aaggctgctg cgtttgcaat tgggagaggt tgttgacaga ggtttatgtc 6661 tgtggcaagc agccttcctt cagtggaata cttgaagaca ggtctgtagt tgagcaaact 6721 cacctccatt tgtcctcctg gaaagaagaa atcaagagga aaaatctctc tcccatcctc 6781 caaatggagc tggcacattg ctatctgtgg catttgtctt tccagaacac aactgagaag 6841 gaaaccttct gcagggctgc gactgtgctc cggcagttct acagccacca tgagaaggac 6901 actcgctgcc tgggtgcgac tgcacagcag ttccacaggc acaagcagct gatccgattc 6961 ctgaaacggc tcgacaggaa cctctggggc ctggcgggct tggtaagctg cactgtattc 7021 ctggcaagcc ggccgcgtgg ctcctggtgg acagcagcct cacttctaaa cactccttag 7081 gagctgcagc acccttggtc aacccattca ttcattcact cattcaataa gtatttgctg 7141 aagttccaca agtgctgggt gtggttctag gtgctgagga cgtgtcacta aagacagcag 7201 gccgagtccc tgttctcatg gaatgttcta atgggagagt tagaaaaaca aacatgtaaa 7261 atgatggcca gcagtgatac gtgctacaaa gaaaaacata gaaataaaga acataagagt 7321 catgggggag ggggctgact taggagctgg tgacattatc tgagcagata tttgaattga 7381 gggagcaggc cacatgacta actagggaga ccattccagg gagaaggagg aggtatgcaa 7441 aggccttagg atggaaatga actaacttcc tgtatttaaa gaccagtagg aaggccagtg 7501 tggctggatc agagtgagtg aggggtagtt tccaggacag cagatcacac aaggccttta 7561 gattccacca cgagtatgga gggaacacct gcagagcttt gggcaggaca aagactgtac 7621 aatctgattt acgtgattta aaagggtcag tctggctact gtgtggtaaa taggctgaaa 7681 gggggaaagc atagaagcaa gatggcctgt tgggaggcta ccacagtaaa ccaggctaga 7741 gatgatggtg gcgtggacag aatgaagcaa gatggcctgt tgggaggcta ccacagtaaa 7801 ccaggctaga gatgatggtg gcgtggacag aatgaagcaa gatggcctgt tgggaggcta 7861 ccacagtaaa ccaggctaga gatgatggtg gcgtggacaa atggagcagt tgaggtgaac 7921 agatttggga tatgactaaa aataaaacca gaagatttgc tgacagatcg gttgtagggg 7981 gtaagataca ggggaggaaa agatgacctc tttgttcctg cccaaacccc tctggcgatg 8041 gtcagtactg tttacagaga gatgaaagac tggcggcaag gcagggctgg aggttcagca 8101 gaagatcaag agttcaattt tgtacatcgt acatgtaagg tggctcttgg atagccaagt 8161 gaaggtgttg agaagatggt tagaaaagtc tggaacttag gggagaggtc agaacttgca 8221 atacaaaaag gagagtcctt agatagatac tgctgaaaat ctgaatgaca gaaagggaga 8281 gatcaaagga ctgagcctga gatcaacaca tggaggtcag gagaggagga tccagccaag 8341 gggcctgagg aggagtgacc agtgaggcag gagaacatgg agagtgggcg gtaccccagg 8401 aagccggtga ggacactcaa ggagggaggg ttgactgtgt caaatgtact gaaaggacag 8461 gtcaggtgag gaccaagaaa ggcccctggg tttggctgat ggaggccatg ggtgaggctg 8521 atgtaaatgg agaggcagga aggaaagccc agctggagtg ggctcaccga ggatagggtg 8581 gcgagaggag acaaagaagg aacagtgagg gcagacaact ctttgaagat gtttagctat 8641 aaggctgcag agaaactgag cccacagctg cagggtggtt atggagtgag ggaagctctt 8701 ttaaggttgg gggtataccc agcatgttaa tgcacctggg ggaatggtcc agtggagcag 8761 gaagaactga agagagcaga aagaggaaga atcattaggg ggcagaagtc cttgtagccc 8821 agagtggatg ttatctaata tcgagtggag gaattaattg gctttagagg agaacaagga 8881 catgtatccc ctctctgggc ctatcacctt gtagacaatg ggataggtca tgggatagga 8941 acttggcaca acacatgttc tctcttttaa ttctctccat tatcttatga agcaggcaag 9001 taggcaaaca attgtcccaa ctttacaaaa gaaactgaag cttttataaa ttaagtagta 9061 catcctaagc aatacaatta ataaatggta gagctgagat tcaaactgaa gcagtggcct 9121 gggggtagca tctggaatcc ttcccacctt tagggctgct gtgctgcggt gctgctgttt 9181 aatggcacag agggccagat gactgaatct ctctcagcag tccaggcagt catgcagaag 9241 gcccagtaga gcaccgggca ggtctgagcc agcatcttca agttccaccc tgtgagcaag 9301 cacttagctg tgacacactt ctcgagagac tggactcccc cccgcgcaac ccacccaaaa 9361 gcagataggt aatggtatac agtaaccatt tctagaagtg taagtagtat gcacccaaaa 9421 taggcaaaac ctgctggcct agtgatagag acaactccca gtcaggctag actggaggcc 9481 ttggttttat aagtgttcag gtgacaagtg ccacagtagg cttgatcaag tagacaggca 9541 ggcaagacaa atgcttacca atgcaagcta atgaaatgtt tcttttgcag aattcctgtc 9601 ctgtgaagga agccaaccag agtacgttgg aaaacttctt ggaaaggcta aagacgatca 9661 tgagagagaa atattcaaag tgttcgagct gaatatttta atttatgagt ttttgatagc 9721 tttatttttt aagtatttat atatttataa ctcatcataa aataaagtat atatagaatc 9781 taacagcaat ggcatttaat gtattggcta tgtttacttg acaaatgaaa ttatggtttg 9841 caacttttag ggaaatcaat ttagtttacc aagagactat aaatgctatg gagccaaaac // LOCUS HUMIL5 3230 bp DNA PRI 21-AUG-1995 DEFINITION Human interleukin 5 (IL-5) gene, complete cds. ACCESSION J03478 NID g186338 KEYWORDS colony stimulating factor; interleukin 5. SOURCE Human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3230) AUTHORS Tanabe,T., Konishi,M., Mizuta,T., Noma,T. and Honjo,T. TITLE Molecular cloning and structure of the human interleukin-5 gene JOURNAL J. Biol. Chem. 262 (34), 16580-16584 (1987) MEDLINE 88059042 REFERENCE 2 (bases 1 to 3230) AUTHORS Tanabe,T. TITLE Direct Submission JOURNAL Submitted (09-SEP-1987) T. Tanabe, Department of Medical Chemistry, Kyoto University Faculty of Medicine, Japan FEATURES Location/Qualifiers source 1..3230 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal liver, placenta, and myeloma cells" /cell_line="myeloma cell line 266B1" /clone="Lambda-[12,22,38]" /map="1589 bp upstream of HindIII site; 5q23.3-q32" /chromosome="5" prim_transcript 509..2583 /note="IL5 mRNA and introns" CDS join(553..696,905..937,1883..2011,2118..2216) /gene="IL5" /codon_start=1 /product="interleukin 5" /db_xref="PID:g386822" /translation="MRMLLHLSLLALGAAYVYAIPTEIPTSALVKETLALLSTHRTLL IANETLRIPVPVHKNHQLCTEEIFQGIGTLESQTVQGGTVERLFKNLSLIKKYIDGQK KKCGEERRRVNQFLDYLQEFLGVMNTEWIIES" exon <553..696 /gene="IL5" /number=1 /product="interleukin 5; G00-120-097" gene 553..2216 /gene="IL5" intron 697..904 /gene="IL5" /note="IL5 intron A" exon 905..937 /gene="IL5" /number=2 intron 938..1882 /gene="IL5" /note="IL5 intron B" exon 1883..2011 /gene="IL5" /number=3 intron 2012..2117 /gene="IL5" /note="IL5 intron C" exon 2118..>2216 /gene="IL5" /number=4 BASE COUNT 1027 a 545 c 622 g 1036 t ORIGIN 1 atcctaatca agaccccagt gaacagaact cgaccctgcc aaggcttggc atttccattt 61 caatcactgt cttcccacca gtattttcaa tttcttttaa gacagattaa tctagccaca 121 gtcatagtag aacatagccg atcttgaaaa aaaacattcc caatatttat gtattttagc 181 ataaaattct gtttagtggt ctaccttata ctttgttttg cacacatctt ttaagaggaa 241 gttaattttc tgattttaag aaatgcaaat gtggggcaat gatgtattaa cccaaagatt 301 ccttccgtaa tagaaaatgt ttttaaaggg gggaaacagg gatttttatt attaaaagat 361 aaaagtaaat ttatttttta agatataagg cattggaaac atttagtttc acgatatgcc 421 attattaggc attctctatc tgattgttag aaattattca tttcctcaaa gacagacaat 481 aaattgactg gggacgcagt cttgtactat gcactttctt tgccaaaggc aaacgcagaa 541 cgtttcagag ccatgaggat gcttctgcat ttgagtttgc tagctcttgg agctgcctac 601 gtgtatgcca tccccacaga aattcccaca agtgcattgg tgaaagagac cttggcactg 661 ctttctactc atcgaactct gctgatagcc aatgaggtaa ttttctttat gattcctaca 721 gtctgtaaag tgcataggta atcatttgtg atggttcctt tactatatat agagatctgt 781 tataaataat aagattctga gcacattagt acatgggtga taactacatc accagcaaac 841 attctgttaa aagttatgaa tgctggtgtg ctgtaaaaat gattgtattt cctttcctct 901 ccagactctg aggattcctg ttcctgtaca taaaaatgta agttaaatta tgattcagta 961 aaatgatggc atgaataagt aaatttcctg ttttaagctg taaatcatta gttatcattg 1021 gaactattta attttctata ttttgttttc atatgggtgg ctgtgaatgt ctgtacttat 1081 aaatatgagg aatgactttt tatcaagtag aatcctttaa acaagtggat taggctcttt 1141 ggtgatgttg ttagtttgcc ttcccaaaga gcatcgtgtc aggattcttt ccagaaggat 1201 tccacactga gtgagaggtg cgtgctagtc tccgtgcagt tctgactctt tctcactcta 1261 acgtgtttct gaaagtatta gcaactcaga attatatttt tagaaccatg atcagtagac 1321 attaaaatat ataacaaatg ccctatatta ataattctgc atacttaaat aattatgact 1381 atatgatggt gtgtatgcat tgaatatgcc tggtcatatt aaaatgtaaa atatatagtt 1441 tattagtcta aatagaataa aactaccagc tagaactgta gaaacacatt gatatgagtt 1501 taatgtataa tgcattacac ttccaaaaca tttttttcca gttacataat taagttatat 1561 cctttataaa actcctcagt aatcatataa gcttcatcta ctttttgaaa attttatctt 1621 aatatgtggt ggtttgttgc ctagaaaaca aacaaaaaac tctttggaga agggaactca 1681 tgtaaatacc acaaaacaaa gcctaacttt gtggaccaaa attgttttaa taattatttt 1741 ttaattgatg aattaaaaag tatatatatt tattgtgtac aatatgatgt tttgaagtat 1801 gtatacattg cagaatggac aatggaccaa atttttatac cttgtcttga ttatttgcat 1861 tttaaaaatt ttcctcattt agcaccaact gtgcactgaa gaaatctttc agggaatagg 1921 cacactggag agtcaaactg tgcaaggggg tactgtggaa agactattca aaaacttgtc 1981 cttaataaag aaatacattg acggccaaaa agtaagttac acacattcaa tggaagctat 2041 atttgtcctg gctgtgccta tttctatgga attgacagtt tcctgtaata cctattgtca 2101 tttttctttt ttcacagaaa aagtgtggag aagaaagacg gagagtaaac caattcctag 2161 actacctgca agagtttctt ggtgtaatga acaccgagtg gataatagaa agttgagact 2221 aaactggttt gttgcagcca aagattttgg aggagaagga cattttactg cagtgagaat 2281 gagggccaag aaagagtcag gccttaattt tcaatataat ttaacttcag agggaaagta 2341 aatatttcag gcatactgac actttgccag aaagcataaa attcttaaaa tatatttcag 2401 atatcagaat cattgaagta ttttcctcca ggcaaaattg atatactttt ttcttattta 2461 acttaacatt ctgtaaaatg tctgttaact taatagtatt tatgaaatgg ttaagaattt 2521 ggtaaattag tatttattta atgttatgtt gtgttctaat aaaacaaaaa tagacaactg 2581 ttcaatttgc tgctggcctc tgtccttagc aatttgaagt tagcacagtc cattgagtac 2641 atgcccagtt tggaggaagg gtctgagcac atgtggctga gcatccccat ttctctggag 2701 aagtctcaag gttgcaaggc acaccagagg tggaagtgat ctagcaggac ttagtgggga 2761 tgtggggagc agggacacag gcaggaggtg aacctggttt tctctctaca gtatatccag 2821 aacctgggat ggtcgaaggg taaatggtag ggaataaatg aatgaatgtc gtttccaaga 2881 tgattgtaga actaaaatga gttgtaagct cccctggaag aagggatgtg gaacctgtaa 2941 ctaggttcct gcccagcctg tgagaagaat ttggcagatc atctcattgc cagtatagag 3001 aggaagccag aaaccctctc tgccaaggcc tgcaggggtt cttaccacct gaccctgcac 3061 cataacaaaa ggacagagag acatggtagg gcagtcccat tagaaagact gagttccgta 3121 ttcccggggc agggcagcac caggccgcac aacatccatt ctgcctgctt atggctatca 3181 gtagcatcac tagagattct tctgtttgag aaaacttctc tcaaggatcc // LOCUS HUMIL8A 5191 bp DNA PRI 06-JAN-1995 DEFINITION Human interleukin 8 (IL8) gene, complete cds. ACCESSION M28130 NID g186367 KEYWORDS interleukin 8. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5191) AUTHORS Mukaida,N., Shiroo,M. and Matsushima,K. TITLE Genomic structure of the human monocyte-derived neutrophil chemotactic factor IL-8 JOURNAL J. Immunol. 143 (4), 1366-1371 (1989) MEDLINE 89309826 FEATURES Location/Qualifiers source 1..5191 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="4q13-q21" CAAT_signal 1382..1388 /gene="IL8" /note="G00-120-099" TATA_signal 1462..1469 /gene="IL8" /note="G00-120-099" exon 1482..1647 /gene="IL8" /note="G00-120-099" /number=1 mRNA join(1482..1647,2464..2599,2871..2954,3370..(4216.4236)) /gene="IL8" /note="G00-120-099" gene join(1482..1647,2464..2599,2871..2954,3370..(4216.4236)) /gene="IL8" CDS join(1584..1647,2464..2599,2871..2954,3370..3385) /gene="IL8" /codon_start=1 /db_xref="GDB:G00-120-099" /product="interleukin 8" /db_xref="PID:g186368" /translation="MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPF HPKFIKELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS" exon 2464..2599 /gene="IL8" /note="G00-120-099" /number=2 exon 2871..2954 /gene="IL8" /note="G00-120-099" /number=3 exon 3370..(4216.4236) /gene="IL8" /note="G00-120-099" /number=4 polyA_signal 4199..4204 /gene="IL8" /note="G00-120-099; putative" BASE COUNT 1728 a 870 c 858 g 1735 t ORIGIN 1 gaattcagta acccaggcat tattttatcc tcaagtctta ggttggttgg agaaagataa 61 caaaaagaaa catgattgtg cagaaacaga caaacctttt tggaaagcat ttgaaaatgg 121 cattccccct ccacagtgtg ttcacagtgt gggcaaattc actgctctgt cgtactttct 181 gaaaatgaag aactgttaca ccaaggtgaa ttatttataa attatgtact tgcccagaag 241 cgaacagact tttactatca taagaaccct tccttggtgt gctctttatc tacagaatcc 301 aagacctttc aagaaaggtc ttggattctt ttcttcagga cactaggaca taaagccacc 361 tttttatgat ttgttgaaat ttctcactcc atcccttttg ctgatgatca tgggtcctca 421 gaggtcagac ttggtgtcct tggataaaga gcatgaagca acagtggctg aaccagagtt 481 ggaacccaga tgctctttcc actaagcata caactttcca ttagataaca cctccctccc 541 accccaacca agcagctcca gtgcaccact ttctggagca taaacatacc ttaactttac 601 aacttgagtg gccttgaata ctgttcctat ctggaatgtg ctgttctctt tcatcttcct 661 ctattgaagc cctcctattc ctcaatgcct tgctccaact gcctttggaa gattctgctc 721 ttatgcctcc actggaatta atgtcttagt accacttgtc tattctgcta tatagtcagt 781 ccttacattg ctttcttctt ctgatagacc aaactcttta aggacaagta cctagtctta 841 tctatttcta gatcccccac attactcaga aagttactcc ataaatgttt gtggaactga 901 tttctatgtg aagacatgtg ccccttcact ctgttaacta gcattagaaa aacaaatctt 961 ttgaaaagtt gtagtatgcc cctaagagca gtaacagttc ctagaaactc tctaaaatgc 1021 ttagaaaaag atttatttta aattacctcc ccaataaaat gattggctgg cttatcttca 1081 ccatcatgat agcatctgta attaactgaa aaaaaataat tatgccatta aaagaaaatc 1141 atccatgatc ttgttctaac acctgccact ctagtactat atctgtcaca tggtctatga 1201 taaagttatc tagaaataaa aaagcataca attgataatt caccaaattg tggagcttca 1261 gtattttaaa tgtatattaa aattaaatta ttttaaagat caaagaaaac tttcgtcata 1321 ctccgtattt gataaggaac aaataggaag tgtgatgact caggtttgcc ctgaggggat 1381 gggccatcag ttgcaaatcg tggaatttcc tctgacataa tgaaaagatg agggtgcata 1441 agttctctag tagggtgatg atataaaaag ccaccggagc actccataag gcacaaactt 1501 tcagagacag cagagcacac aagcttctag gacaagagcc aggaagaaac caccggaagg 1561 aaccattctc actgtgtgta aacatgactt ccaagctggc cgtggctctc ttggcagcct 1621 tcctgatttc tgcagctctg tgtgaaggta agcacatctt tctgacctac agcgttttcc 1681 tatgtctaaa tgtgatcctt agatagcaaa gctattcttg atgctttggt aacaaacatc 1741 ctttttattc agaaacagaa tataatctta gcagtcaatt aatgttaaat tgaagattta 1801 gaaaaaacta tatataacac ttaggaaata taaaggtttg atcaatatag atattctgct 1861 tttataattt ataccaggta gcatgcatat atttaacgta aataagtaat ttatagtatg 1921 tcctattgag aaccacggtt acctatatta tgtattaata ttgagttgag caaggtaact 1981 cagacaattc cactccttgt agtatttcat tgacaagcct cagatttgtc attaattcct 2041 gtctggttta aagataccct gattatagac caggcatgta taacttattt atatatttct 2101 gttaattctt tctgaaggca atttctatgc tggagagtct tagcttgcct actataaata 2161 acactgtggt atcacagagg attatgcaat attgaccaga taaaaatacc atgaagatgt 2221 tgatattgta caaaaagaac tctaactctt atataggaag ttgttcaatg ttgtcagtta 2281 tgactgtttt ttaaaacaaa gaactaactg aggtcaaggg ctaggagata ttcaggaatg 2341 agttcactag aaacatgatg ccttccatag tctccaaata atcatattgg aattagaagg 2401 aagtagctgg cagagctgtg cctgttgata aaatcaatcc ttaatcactt tttcccccaa 2461 caggtgcagt tttgccaagg agtgctaaag aacttagatg tcagtgcata aagacatact 2521 ccaaaccttt ccaccccaaa tttatcaaag aactgagagt gattgagagt ggaccacact 2581 gcgccaacac agaaattatg taagtacttt aaaaaagatt agatattttg ttttagcaaa 2641 cttaaaatta aggaaggtgg aaatatttag gaaagttcca ggtgttagga ttacagtagt 2701 aaatgaaaca aaacaaaata aaaatatttg tctacatgac atttaaatat ggtagcttcc 2761 acaactacta taaatgttat tttggactta gactttatgc ctgacttaag gaatcatgat 2821 ttgaatgcaa aaactaaata ttaatctgaa ccatttcttt cttatttcag tgtaaagctt 2881 tctgatggaa gagagctctg tctggacccc aaggaaaact gggtgcagag ggttgtggag 2941 aagtttttga agaggtaagt tatatatttt ttaatttaaa tttttcattt atcctgagac 3001 atataatcca aagtcagcct ataaatttct ttctgttgct aaaaatcgtc attaggtatc 3061 tgcctttttg gttaaaaaaa aaggaatagc atcaatagtg agtttgttgt acttatgacc 3121 agaaagacca tacatagttt gcccaggaaa ttctgggttt aagcttgtgt cctatactct 3181 tagtaaagtt ctttgtcact cccagtagtg tcctatttta gatgataatt tctttgatct 3241 ccctatttat agttgagaat atagagcatt tctaacacat gaatgtcaaa gactatattg 3301 acttttcaag aaccctactt tccttcttat taaacatagc tcatctttat atttttaatt 3361 ttattttagg gctgagaatt cataaaaaaa ttcattctct gtggtatcca agaatcagtg 3421 aagatgccag tgaaacttca agcaaatcta cttcaacact tcatgtattg tgtgggtctg 3481 ttgtagggtt gccagatgca atacaagatt cctggttaaa tttgaatttc agtaaacaat 3541 gaatagtttt tcattgtacc atgaaatatc cagaacatac ttatatgtaa agtattattt 3601 atttgaatct acaaaaaaca acaaataatt tttaaatata aggattttcc tagatattgc 3661 acgggagaat atacaaatag caaaattggg ccaagggcca agagaatatc cgaactttaa 3721 tttcaggaat tgaatgggtt tgctagaatg tgatatttga agcatcacat aaaaatgatg 3781 ggacaataaa ttttgccata aagtcaaatt tagctggaaa tcctggattt ttttctgtta 3841 aatctggcaa ccctagtctg ctagccagga tccacaagtc cttgttccac tgtgccttgg 3901 tttctccttt atttctaagt ggaaaaagta ttagccacca tcttacctca cagtgatgtt 3961 gtgaggacat gtggaagcac tttaagtttt ttcatcataa cataaattat tttcaagtgt 4021 aacttattaa cctatttatt atttatgtat ttatttaagc atcaaatatt tgtgcaagaa 4081 tttggaaaaa tagaagatga atcattgatt gaatagttat aaagatgtta tagtaaattt 4141 attttatttt agatattaaa tgatgtttta ttagataaat ttcaatcagg gtttttagat 4201 taaacaaaca aacaattggg tacccagtta aattttcatt tcagatatac aacaaataat 4261 tttttagtat aagtacatta ttgtttatct gaaattttaa ttgaactaac aatcctagtt 4321 tgatactccc agtcttgtca ttgccagctg tgttggtagt gctgtgttga attacggaat 4381 aatgagttag aactattaaa acagccaaaa ctccacagtc aatattagta atttcttgct 4441 ggttgaaact tgtttattat gtacaaatag attcttataa tattatttaa atgactgcat 4501 ttttaaatac aaggctttat atttttaact ttagtgtttt tatgtgctct ccaaattttt 4561 tttactgttt ctgattgtat ggaaatataa aagtaaatat gaaacattta aaatataatt 4621 tgttgtcaaa gtaatcaagt gtttgtcttt tttttagttt tagcttattg ggattctctt 4681 tgtttatatt taaaattata ctttgattta gaaaacataa atgcttcccc ttagcatttt 4741 gttatggaaa attacaaact tttattttta gaaaacagaa ctcctttcca gaaataggtt 4801 acaaacagta gtgtcctcca cagaatgttg gaaatgtttt caactcccca ctgtatacta 4861 tcttgctaat aagtctgtct tcagatttcg attaaccggt ttgtatgtct gtgcacttta 4921 gcatagctgg acattaaaga ggaaagagag tacatattat aagttgctta tcagtaactg 4981 aggagtaaaa ctgataaatg tgaggcaaag aagtttaaaa tatggttaaa gcctaagcat 5041 atttgcaaac aaatcaaaca atactctgag aagtaaaaac ataattattt aattaacaaa 5101 tttcagtgga taaattttat aacaaattag acacagttga aaataaaatt agaaaactag 5161 aaaatagaac aaaagaaact tctggaattc a // LOCUS HUMIL9RA 17073 bp DNA PRI 03-APR-1997 DEFINITION Homo sapiens interleukin 9 receptor (IL9R) gene, complete cds. ACCESSION L39064 NID g632992 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 17073) AUTHORS Kermouni,A., Van Roost,E., Arden,K.C., Vermeesch,J.R., Weiss,S., Godelaine,D., Flint,J., Lurquin,C., Szikora,J.P., Higgs,D.R., Marynen,P. and Renauld,J.-C. TITLE The IL-9 receptor gene (IL9R): genomic structure, chromosomal localization in the pseudoautosomal region of the long arm of the sex chromosomes, and identification of IL9R pseudogenes at 9qter, 10pter, 16pter, and 18pter JOURNAL Genomics 29 (2), 371-382 (1995) MEDLINE 96115587 FEATURES Location/Qualifiers source 1..17073 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="LB33.DAUV" /cell_type="melanoma cell" /dev_stage="Adult" /sex="female" /tissue_type="melanoma" /chromosome="X" repeat_unit 1..17073 /gene="IL9R" /rpt_family="Alu repeat" promoter 1..1572 /gene="IL9R" /note="G00-134-444" gene 1..17073 /gene="IL9R" mRNA join(1573..1814,6934..7047,7477..7588,7705..7883, 8448..8593,9306..9507,10111..10216,11551..11635, 13833..17073) /gene="IL9R" /note="G00-134-444" exon 1573..1814 /gene="IL9R" /note="1a; G00-134-444" /number=1 CDS join(1787..1814,6934..7047,7477..7588,7705..7883, 8448..8593,9306..9507,10111..10216,11551..11635, 13833..14426) /gene="IL9R" /codon_start=1 /product="interleukin 9 receptor" /db_xref="PID:g632993" /db_xref="GDB:G00-134-444" /translation="MGLGRCIWEGWTLESEALRRDMGTWLLACICICTCVCLGVSVTG EGQGPRSRTFTCLTNNILRIDCHWSAPELGQGSSPWLLFTSNQAPGGTHKCILRGSEC TVVLPPEAVLVPSDNFTITFHHCMSGREQVSLVDPEYLPRRHVKLDPPSDLQSNISSG HCILTWSISPALEPMTTLLSYELAFKKQEEAWEQAQHRDHIVGVTWLILEAFELDPGF IHEARLRVQMATLEDDVVEEERYTGQWSEWSQPVCFQAPQRQGPLIPPWGWPGNTLVA VSIFLLLTGPTYLLFKLSPRVKRIFYQNVPSPAMFFQPLYSVHNGNFQTWMGAHGAGV LLSQDCAGTPQGALEPCVQEATALLTCGPARPWKSVALEEEQEGPGTRLPGNLSSEDV LPAGCTEWRVQTLAYLPQEDWAPTSLTRPAPPDSEGSRSSSSSSSSNNNNYCALGCYG GWHLSALPGNTQSSGPIPALACGLSCDHQGLETQQGVAWVLAGHCQRPGLHEDLQGML LPSVLSKARSWTF" sig_peptide join(1787..1814,6934..7025) /gene="IL9R" intron 1815..2621 /gene="IL9R" /note="1a" /number=1 exon 2622..2741 /gene="IL9R" /note="1b; G00-134-444" /number=1 intron 2742..5356 /gene="IL9R" /note="1b" /number=1 exon 5357..5535 /gene="IL9R" /note="1c; G00-134-444" /number=1 intron 5536..6933 /gene="IL9R" /note="1c" /number=1 exon 6934..7047 /gene="IL9R" /note="G00-134-444" /number=2 mat_peptide join(7026..7047,7477..7588,7705..7883,8448..8593, 9306..9507,10111..10216,11551..11635,13833..14423) /gene="IL9R" /note="G00-134-444" /product="interleukin 9 receptor" intron 7048..7476 /gene="IL9R" /number=2 exon 7477..7588 /gene="IL9R" /note="G00-134-444" /number=3 intron 7589..7704 /gene="IL9R" /number=3 exon 7705..7883 /gene="IL9R" /note="G00-134-444" /number=4 intron 7884..8447 /gene="IL9R" /number=4 exon 8448..8593 /gene="IL9R" /note="G00-134-444" /number=5 intron 8594..9305 /gene="IL9R" /number=5 exon 9306..9507 /gene="IL9R" /note="G00-134-444" /number=6 intron 9508..10110 /gene="IL9R" /number=6 exon 10111..10216 /gene="IL9R" /note="G00-134-444" /number=7 intron 10217..11550 /gene="IL9R" /number=7 exon 11551..11635 /gene="IL9R" /note="G00-134-444" /number=8 intron 11636..13832 /gene="IL9R" /number=8 repeat_unit 12885..13467 /gene="IL9R" /rpt_family="GT repeat" exon 13833..17073 /gene="IL9R" /note="G00-134-444" /number=9 polyA_signal one-of(15339..15345,17016..17022) /gene="IL9R" /note="putative" BASE COUNT 3514 a 4234 c 5045 g 4280 t ORIGIN 1 ggatccatgc ccagaattat tcacgaaatt cacttaaaat agcatatatg cttgaaccca 61 ggagtctgag gctgcagcga gctatgatca tgtcactgca ctccagccca gcctgggcga 121 cagagcaaga ctctgtgtct aaagaaatga aaataaaaat aaataaaata gcatatacag 181 gcaacaatcc aaatgttcat tgcagaaatt atgattatta ttgtttttct tttagaggca 241 gggtgtcact ctgttgccca gactaatctt gacctcctgt gatcctcaag agatcctccg 301 gtctctgctt cccaaagttc ttggattaca ggtgtgaggc actgggccta gacattgcag 361 ggattattaa accatggtgc atactaccat taaaggtatg aggtaggttt tatatgctga 421 cctagaaaga tggtgcaaca ccatagggga acaagaaggc tgcaaagtgg cagcacagca 481 ggagtcttta agaaataggt atttgtttca gcttaagaag tgtctggctg ggcgtggtgg 541 ctcacacctg taatcccagc acttagggag gcttaggtgg gcggatcact tgaggttagg 601 agttcaagac cagcccagtc aacaccggtg aaaccccatc tctactaaaa atacaaagaa 661 attagctggg cttggtggca ggtggctgta atcccagcta ctcaggagct gaggcaggag 721 aatcatctga acccgggagg ccgaggttgc agtgagccta gatcgcacca ctgcactcca 781 gcctgggtga cagagtaaga ctcggtcaaa aaaaaaaaaa aaaaagaccg ggtgcgaggt 841 ctctcgcctg taatcccagc actttgggag gcctaggcag gaggatcaca aggtcaggag 901 atcgagacca tcctggccaa catggtgaaa ccccatctct actaaaatac aaaaattagc 961 cgggtgtgat ggcacacgcc tgtagtccca gctactcagg aggctgaggc aggggaatcg 1021 cttgaaacca ggaggtggag gctgcagtga gccgagatca cgccactgca ctccagcctg 1081 gcgacagagc aagactccat ctcaaaaaag aaaagaaaac aaacaaacaa acaaaatccc 1141 caaaaagcaa aagtgtctgc atggacttgt gcctaactac tcacagtggc tacttcttgg 1201 gggaggtgaa aggggaccct gttaaagaat ttttcttctg tattgttttt cttcaactga 1261 gtctgcatta attttatatt taatactctc cccacccctg atgctcacag gttgctttag 1321 ctggtggaag gagaacctgc acctctggtt ttggcaaagt gtagaagggg acaagggcac 1381 tgctctgcac ctgcacaggt tcttgcctcc tggggtcagt ttaatgaatc tcagtggtgt 1441 ttcttgggaa gaacacagtg gacactcact tgccagtcag tagacaaatc actgaagtcc 1501 atgtctggca gttctcaatg tcatggggac tgtaaggttg tcatgtctca cagttccctg 1561 acttagaaga ttagtggtga cagacacttc tcagctctgc tcgggagagc agctctgtaa 1621 tgcgcttgtg gtttcagatg tgggcggcct gtgtgaacct gtcgtgcaaa gctcacgtca 1681 ccaactgctg cagttatctc ctgaatcagg ctgagggtct ttgctgtgca cccagagata 1741 gttgggtgac aaatcacctc caggttgggg atgcctcaga cttgtgatgg gactgggcag 1801 atgcatctgg gaaggtgagt ctgtgctttg ggcttcccaa cctctcaagt cagcatgaaa 1861 ttcagaaggc agagagggac atgtggccct ccaactcggg gcccagggag ccactgtggc 1921 atttgagagc gccaccctag ctttacctcc tctggtgtcc tggccttgtg tatctcttag 1981 agatggcaaa atttgcagcc ctgacctcaa ggattacaaa ctaatttcca gtcctgtagg 2041 ggtcatgggt tctgctgtgg ccagtgtggc actgagtttg tcaggcagaa tttctaatta 2101 tggaaggaac tctgcttcct cagtgattga actgaccttt gtgggggtgc ctttgtgggg 2161 tgagaatggg catacttggg cctcagttgt gatggccatg gggtggagct gaggtctagg 2221 cccagggctg ggaaagcttc taccaacccc gaggcatttg ggtgttttag ggcagaagag 2281 gagccaggag gatgggtatg ccccactgga gctgtgtgtg gggcagcagg tgagggtggg 2341 attccagagg gagggtcaac ccagccaagc agaggaaggg gaggagaggg tttgttttgg 2401 aaagaacatc accctctcag tttcctgggg tctggatagc ctgttcttgt gatgagctgg 2461 aggatgtggg ccctgcttgg atcctcctct cccctccctg cccccatttt ctcttcctgt 2521 gatttatgct gttctgggct caccctcttc ccagagctct cacgctacag ctgacaggca 2581 gggcaggcgt tttaattatt attaattttt ttttgagaca gagtttgaga cagtatcaag 2641 agtgggtatc ctctcagcgt gtcttcaagt agcatcccag agccccgctc ctgggtaccc 2701 acaacatgaa caccgtccag aagcaagggc agcctctgca ggtggggcgg gggttggaaa 2761 acatttatca aacggttgag ttgggtgcag gggatgcaac atgatcaaac agggtcttgc 2821 ctccaggagc ctcatgtagg gacaacgcac agtgatgacc ttcaactgca gtggggcagc 2881 aaggctgtgg gaggggtgtt ttgggcagag caggcgacgt gggtacctct tcaccaagac 2941 agcaggaaga gcacggacat cattttcccg tctcacctcc agactcccag gggactgtgc 3001 caatatcctc actggagccc tgcccatggc cactgctcaa cccttgggcc ctgtcagttc 3061 agggctgtgc agaaggagag ttgcctgtgc tcaggctcag gggttccgtc cagctaagga 3121 ggcctactag gggactgggg aaaggcctca tgaaggacca ggcctgtgat ggggagggaa 3181 gatggcagag gggacagtgg gaagcagagc agcaggggtc tcctccacgc ctctctgtac 3241 ttctcccacc cacccttgcc tgctcccctc tgccctgggt atgtgccctt gtccacccaa 3301 cacctctgca gtgccaagtc cagccctgac ttcttcctga gctgtggccc agcgttccca 3361 caggcattta cctcacggat gcagcactgc ccctcatcct cttctctgaa agtgtgtggg 3421 aaatgcttct tggtgccatc tctctcccac cctgccttcc ctgcaccttc tctatgaggt 3481 gtctgttgtc tctgtacttg cctccagctg gcctttcaga ccccatccct ccctgcctcc 3541 tgcccatccc tacccctgtc agcaccttac atactctgtg cccaacagca ttggggattc 3601 ccactgtgct gaaggaagtc ctcatgtggt ccagagaggg agctggcccc tgctcccatt 3661 tccagcacca ttttctggca caggctccat gtccccaggc tccagctgct tctggggtac 3721 caaggtctgc atttcttcct cttcccccag cagacagaag accttgctgg acaggtgtcc 3781 acttcaatag taacttctga ggctgccagc ctcctgaatg gtccagaaga accagaccct 3841 tccccgcccc cattaaaaaa gacaaaatat ggccaggcat gttggctcat gcctgacatc 3901 ccagcacttt gggaggccaa agcaggtgaa tcacttgagg ccaggagtac gagaccagcc 3961 tgacaaacat ggtgaaaccc tgtgtctact aaaaatacaa aaattagcca ggcgtggtga 4021 ctcatgcttg taaatgcttg taatcccagc tacttgggag gctgaagcag gaggatcact 4081 tgaactcggg aggcagaggt tgcagtgagc cgagatcacg ccactgtgtt ccagcctgag 4141 cgacagagcg acactccgtc taaaaaaaaa aaatatatat atatatacac acatatatat 4201 atgcataata gtctctgtgt aacagtaagt ggatgcagta actggtgtga gggcaacttg 4261 gagagtgtgc ttggaggcac agagatgctc agggctgcct ggactgcctc catgatggtg 4321 gcctgctctg tattaggtga gtgttcagga aaggtgaggg cagggcagcc cacagcttca 4381 cagtggccca gggaagcagg gcaggcaggc tatgaggcct agtaggcatc tgggccagac 4441 tttgacactg aggccatgga atgggtgggg ctctgagaac agaccaaagt gatcatgggc 4501 tgaaggctat gtccacagat ccaaggcggg ataggctgtg ctgggcagtg atgtcagcca 4561 ggctccccag cgggactggg ggtgtcaggg gcagctctgt cccaggtggc agacactggt 4621 ttcccctcct gctctcacaa ccggcctgtt accaggtgtt gtctgagctg tggtgaggct 4681 tccctggtga cattcaggag cagggagcct gtgagtaagg gtgtatgcat ctgccctgac 4741 tgcctggccc tgtggtcaag gatgggggaa ggcagctctg cctgcagctc caccccattt 4801 ataaagcact gtggtgcctt ctgctggggc atgtgctgag tggtgcctcg caggcactgc 4861 cctcgggaag ttcacaggct tatgtggaag ctggtgggaa tgggccaaga agagaggtgt 4921 caggagccag gtattgggca ggtcccaggt ctctgagcct cagtttcttc atctgtagga 4981 gggtggtgac cctgccctgc tcagcttacc aggtacagat gtaagttttc agtgcagagg 5041 aaaagcagag aagttcccct cagatggcca tacccccttg tcgctgtacc caactctccg 5101 gtcctacttt gtagctctca ggtgtcacat gtggatcctg ccctaccatc ccccttccct 5161 gtctagaaac gaggcctgct gagcttggag ccatcccact ccctgctctc aagccgtctg 5221 ctcctgggtt agcctgtggc tggcctggcc tgattctaca tagatgtggg tgtttctcca 5281 ctgctggggc agcagttgtc cattctgggg cctgggtcag ctctcagctg tggccgttgt 5341 gcctgtgctt ccccaggtcc tggtggtgac tccaaccctg ccctcacata tcccaagagc 5401 aggctgactg ccttccccat tcccaccttt ccagtaactg ctgcaagaac ggacagacac 5461 tgctgcagag aacttgccac ggtgtttcat gctgtggctg gtggttccag gctgcacgct 5521 ccattctagg aaagggtgag gcttcctgat catcagtctt aacaggggac tgtcctatgg 5581 gtacctgtat acgctgccgg gagtggggca gagtggggtt agagtagtgc ctgctgccca 5641 ttggggttgt gcgggttcct taagggctgt agtctgtgtg cgtgtctggt tttttcctcc 5701 tccttatcag taatcagtct tgtataacca ggctggccct gcttcctgcc taggggctat 5761 cggtgtacca tctggagttg caaatggggt gatagggcgt cagcggcctt cccacaccca 5821 agcacttcct gacacccagc ccctcatctc tagccaacct ctggctcccc tagggatcct 5881 ggggctcagg cctcacgctc cagcatccgg gggctttgcc ttctggtgtt gctcttcttt 5941 gcaggtgcct tggaacaggg gtgcaacaga gttcagaaag tgccatctgt tgcatggata 6001 ccctgctgcc accctcgtcc acctttctgg ggcagaacag ttggatgggt ctttatttaa 6061 gaggacaaaa gggagagaga atgcatagag gctgggtctg gtgtcacgcc tgtaatccca 6121 acactttggg aggccaaggc aggtggatcc cttgaggtca ggaggtcaag actagcctgg 6181 ccaacatggt gaaaccccgt ctctactaaa aatacaaaaa aaaaaaaatt agctgggcat 6241 ggcgggagat gcctgtaatc ccagctactc gggaggctga ggcaggataa tcacttgaac 6301 ccaggaggcg gaggttgcag tgagccaaga tcacaccact gcactctagc ctgggtgaca 6361 gagggagact gtctcaaaaa aaaaaatttg cttagaattt gtctgtgtga ccctgggcaa 6421 gtcatttccc ctccttggga ctcagttcct tgcctgtgaa cggggacagt gcttctccct 6481 tacagagctg ttgtgagaat taaagtagaa aatgtaccta tggtggttgt tggtagtgac 6541 cagttccccc aaccctgact cccctgcagg atggggcctg ggcccgggaa tgggggatgg 6601 gctggcagag gatgactgtc ccagagagga gtcttctcgg cagatgtgga gacctagctg 6661 agtcagaggc caggatctaa gtttgagggt gttccttaca ccctgcagcc atgagtcttt 6721 ggctgagtca aatggccttt ctgagctcag ttccttatca gtaaagcctg gacagtggtc 6781 caggacgctg gacactgtgt gagtgttagg acacaggaca ctgtgtgagt gcaggtgggg 6841 acccatggag cactctgctg gggagcaatt catggggagc acccctccag agagggatga 6901 tttgcacagg gccctcagcc cagtcccttg caggctggac cttggagagt gaggccctga 6961 ggcgagacat gggcacctgg ctcctggcct gcatctgcat ctgcacctgt gtctgcttgg 7021 gagtctctgt cacaggggaa ggacaaggtg agggctgggc actaatgtct gtatgaggtg 7081 ggtggagaac tagggcatgt ttgggggact gggttgtccg atgtcaagcc tctagggaaa 7141 ggtttggccc aaactgtgct ggggcatgtc ctctaggggt cagcctggac ctcagtctct 7201 agtctcccta cttttacctc cctaccttca ttccctggac cgactgtagt ctcccttcct 7261 tcactctctt gacgcctctc cagatctgac ttgcccgtgt accacaggtc agagcccatc 7321 acttcccagg cctcccagtg cttccctgga cagattctgg gatcatttac tggtgactgc 7381 cctgctaggg tgtcagctgt cagatcctcc ccaacccccg agctcagctc tggcctgaag 7441 tacttaccgt gggctcctga tggtcactgt ctccagggcc aaggtctaga accttcacct 7501 gcctcaccaa caacattctc aggatcgatt gccactggtc tgccccagag ctgggacagg 7561 gctccagccc ctggctcctc ttcaccaggt gagcatggag ggccatgccc acctggacag 7621 ggatgagggt gagttcccca ggattgaagc agctatgcca ggacagtgta gcagccccgt 7681 ggtgctgaca aatgcccttt ccagcaacca ggctcctggc ggcacacata agtgcatctt 7741 gcggggcagt gagtgcaccg tcgtgctgcc acctgaggca gtgctcgtgc catctgacaa 7801 tttcaccatc actttccacc actgcatgtc tgggagggag caggtcagcc tggtggaccc 7861 ggagtacctg ccccggagac acggtgagca gcagctatag gtctggggcg gggccgcttg 7921 gcaagaacat cctggctgct tgggggtttg gagcagggcc ttgcagcctg tgagtggccc 7981 agtgagtgtt ctcagtccca gccgagtgag atccagggct gggggcaggc ttggcccttg 8041 ggaggggagg gcccatatgg ttactgcagg ggcagggttt tggcaggaaa taaacatgca 8101 cggctgctag ttggggcagg ggctggcact tgagtcatgt gaaatgcact tcagtcatac 8161 caggaaggac tccaataaga tgctgggaaa agcttccagc agcagactgt gaaggaaagg 8221 gaaagcaaga tttagaaacc acctagtcta ggtgcagagg ccagaggaag tcattgctgt 8281 cctgtcccgc ctggggcttt tgtggaccag tctcccagtg aggtgcctgg tctgagaggg 8341 ccttgaccat tccccttggg agtctttcag accccagtct tgtgtgttct gactgacaca 8401 cccagaccca tggggcttca gcctcacatg gattcactct gttccagtta agctggaccc 8461 gccctctgac ttgcagagca acatcagttc tggccactgc atcctgacct ggagcatcag 8521 tcctgccttg gagccaatga ccacacttct cagctatgag ctggccttca agaagcagga 8581 agaggcctgg gaggtaacac tttggctggc tttccctggg ggcctctctc ctgggaacag 8641 cagtccaggg tagactcccc actctacata gggagatgtc aacttgtagt gatgagaagg 8701 gaggaactag agcggggtgt gtgtgcacac acacatgctg gcatgcagat gtgtatgctt 8761 tatgtgtgtg tatgggagta gggtgagtgc gcctgtgtct ctgtgtgtgc acataagtgt 8821 ggtgagtgtg catgtatgtg tttatgtgta cacatatgta actgtgcaca ctcatgtttg 8881 tgtgcccatg tttgtgtgtt tatatgtaag tgtacatatg tgtgtgcctg tgccttgcat 8941 ttgtgtgagt gtgcacatgg gcatgcctat gtgtatgagt gtgtatgtga gtgtggtgag 9001 ttgcttctgt gcacacactt ttgtttatga gtgtgcatgc aagtgtgatg agtgtgaaag 9061 tgttcctgta gacatgtttg cctgtgtgtg catatgtgta tttgtgggca aacgcagctg 9121 tgtctgtgag tgtgagtgtg ccttctgtgt gtgtgtgtgc acgtgaatgt ggtgagtgtg 9181 tctgtgtgtt aacacaagtg tgttcaagag tgtgttatat gagcatataa tgcatgtgtg 9241 tattctcgag ggctgaggga cccagcccca ccttcaccac ctgctaactg tccccacccc 9301 cacagcaggc ccagcacagg gatcacattg tcggggtgac ctggcttata cttgaagcct 9361 ttgagctgga ccctggcttt atccatgagg ccaggctgcg tgtccagatg gccacactgg 9421 aggatgatgt ggtagaggag gagcgttata caggccagtg gagtgagtgg agccagcctg 9481 tgtgcttcca ggctccccag agacaaggtg ggcactgctg tggctgctgc acttccagcg 9541 gagtctgggc tgggcgtctt ctcccctgtt cacctcagcc ctgcaccctt tcaccctcct 9601 gtaagcccct ccccgaggca gccatgcctc agttgacccc cttcctctga aggtctgagg 9661 tctgtaggga ggacagaaac acctgccaac tctggggctt cctgggaacc tgtagttagt 9721 ggctgctgtt aggagtgagg gtggcagggc tgcacaccag ggctgggctc ctgcctggag 9781 gctggacatg acctcagtgt ccttaatggg ggctggactg acccttgcgc actgcagtgc 9841 tgagatggcc cagggacttt atgacccacc ttgtggcaga tgggaagagt gaggcccagg 9901 agtgtggttc acacaaggtc cttcagcagg tgacacaaac ctccaaggcc catcacaagg 9961 tccttcagca ggtgacacaa aggtggaagg cccatcacaa accttccact ttggcccagg 10021 gcactaaagg gcgcaccttt gccaggtggg tttgggggga gcctcctggc actgaggctg 10081 ctcacagccc tgggcccttc ctgtccacag gccctctgat cccaccctgg gggtggccag 10141 gcaacaccct tgttgctgtg tccatctttc tcctgctgac tggcccgacc tacctcctgt 10201 tcaagctgtc gcccaggtag gtggctgatg tgtgcgtgtg tgtacatgtg tgagcgggca 10261 agagtgtgca tgttagtgta tgtgtgcaga tgtgtgactg tgtgcatgtg tgagtgtgtg 10321 gtgggtgggc catcaagggc cgcccttgtc tggttcctcc cctcccctct ccactgcctg 10381 gtcctggacg gggtgggctt ttcgagtctc caccctggtc cagaagaggg ttctcaagtt 10441 gccaggggag agcagggaag gggggtctga ggcagaggct gaagataagg gcagcttggt 10501 cccgaccaga cccagagtca ccgagatcaa gagccggagg tcctagtctc ctgcctctgg 10561 aggagcttgg tttgttcatt tgttcacatg ttctacgaaa ggacagttgg ggcatctggt 10621 gtgctcagga ccctgttctc agaacaagat ggacaagggc cctccttaaa taaaccgtca 10681 cgcctcatgc accacatgtc aggttacaag acagggtgca ggggccatgt caggccgcag 10741 atgtgaggaa gagggtgtgg ggagaggggt gttgggtggg gctaagtggg agggtgcttc 10801 gcttctcctt ggagcaaagg aaagtgtaaa actgtgtgac acaacctagt ttatgttttg 10861 aggaatagat tgctgggaca aggggacatg ggggacctgc tggaggccaa tgggggccag 10921 tgaggtgccc acagggagac gatttgcatg agccagtgaa agaagtggag atagagaaaa 10981 gtggagagaa ttgcagggtc tcatgagggg aaatagaagg gacatgtggg tggatgagat 11041 atgagtggtg acccctgagg tggggaaatg ggggacagcc ttgcagggag gttggtgaag 11101 tctatttgga cctgttgggt tggagcccca ggtgacatcc ttgtgggagg gtcagtggat 11161 ccttggcagc aagtctggtg ctcagggaga cactgggtgg ggtgggagcc acagaccttc 11221 tgctgatggc aaagacaggg ttcctggagg tgctggctcc ctctgtgatc tgaggaccca 11281 gctagtaact ccccgttctg aacccgccat cgcagctcac gctgtaaagg acgcgcgcct 11341 cagtataaat cagttctatg cggccgttag gcaaggaggc ccagttgggt cctgccctga 11401 gagtgggttg gaatgtgatg agatgggaga gaggcagtgg cagggacgag gtgggcggac 11461 ctcctgctga tggaaggaag ctcagcctct gcagtgacct caggccacct gggtcgccat 11521 aggcctctga ctggcctctc ttggcctcag ggtgaagaga atcttctacc agaacgtgcc 11581 ctctccagcg atgttcttcc agcccctcta cagtgtacac aatgggaact tccaggtgtg 11641 tgcagagacc acagaaggac atggggggca ggggttgccc agagctctgg cctgcccaag 11701 atgttggctt tcatgagggt tggcggccag tatgggaggc ttgtcagtgc ttggagcctt 11761 ttttgtatat tcagtgaatt tcaatttata cgcgtatctc aaatggggaa aaattagctt 11821 tattttccat tgctgatttc tttttgttct caggtctcta gttccattgt tgattgaaaa 11881 aatgtaaatt tctcataact tatctctgtc ccctctggtt tgcagctttc tgcacccacc 11941 atgtgcctca cctcctcctt ctgcgaaggt gtctgtcctg tggccatggg gaagggtctg 12001 tggtgtgtgt gctgcccttg gggctctcac tgcctctggg ctcctgctct gcctggtccc 12061 ctggtctccc cggatcacat gatggcacca cagctgagga gtgggctctg cacttccccc 12121 ccttccaccc atgttgggct cctacagccc aggcaccagt gagcaacttg ggggttgcat 12181 cagcccctcc cctccctgct gggctgttgg ttcatgcccc ctgggtggga ggagggggag 12241 agggagagct ccagtgagtg gtctctggtt tttcccctca gactcctcac tttgggcaaa 12301 ggacaagagg cagtgagggc ccctccctgg ggtctgggcc aagctgacca ctcttctcca 12361 gaatcttccc tccctgtccc cttcacactg tggctccagc ttactatgca gaaaaatcct 12421 tttctctctc aatgaggagc gtagttttca agatttttgt ccaaaaatat aatttgaacc 12481 atgaaccggg catctggctc ttggcagagt ctccctcttt ccccaaggtg gtagatgtga 12541 ctgtcaggag cctgggcagc tgacgacaag gctgagcagg tcagattgtg actgtcccct 12601 ggactgtcat cctgttgcgg gcaccagctg ttccctagag aactaggaca cctgccacgg 12661 gttatttaga ctgcgggtga ggatctggtg ccataggttg gtctccaggg agcactgcag 12721 tgatggaggg tgttgtgtgt gtgatgcatg ggatggaggc tcctggtccc accaagggaa 12781 cagcttcctt ttggaggcgg gggcctcctg tggccccaca gaaggatcca ggtctgctgg 12841 ccatagccga gtgctttgaa agtcaccagt cctgacagcg attcgtgtgt gtgtctgtgt 12901 gtgtgtgtgt gtgtttatgt gtctgtgtgt gttcgtgtgt gtgtctgtgt gtgtgtttgt 12961 gtgtgtatgt ctgtgtgtgt gtgtttatgt gtgtgtgtct gtatgtgttt aagtctgtgt 13021 gtgtttgtgt gtgtgtgtct ctgtgtgtgt gtctgtatct gtgtgtgttt gtgtctgtgt 13081 gtgtttgtgt gtgtgtctgt gtgtgtctgt gtgtgtgtct gtgtgtgtgt ttatgtgtct 13141 gtgtatgttt gtgtgtgtgt ttgtgtgtgt gtgtttgtgt ttatgtgtgt atgtctgtgt 13201 gtgtctgtgt gtctgtgtgt gtatgtgtct gtgtgtgttt atgtgtctgt gtgtgttcgt 13261 gtgtgtgtct gtgtgtgtgt gtttgtgtgt gtatgtctgt gtgtgtgtgt gtttatgtgt 13321 gtgtgtctgt gtgtgtttat gtctgtgtgt gtttgtgtgt gtgtgtctcg tgtgtgtgtg 13381 tctgtgtgta tctgtgtgtg tttgtgtgtg tgtgtgtctc tgtgtgtgtg tgtgtttgtg 13441 tatgtttgtg tgtgtgtgtg tgtgtgttgg gaatgcccag tctctgcagc tgctgaaagg 13501 ccctgaggca catgctgtca ggagctggct ctgtcctggg cagatatcac catctgtacc 13561 tcggttcagg ctgccgtggg caccaggccc tgtgctgggg gagtgctgag gagcctgaag 13621 ggactcaggg tcccgtgatg aggctgggct ggcacatgga ggaaagacag aatgtccaag 13681 acacaggcgc tgcttggcct ctgggtgtgg acctcaggag ggcttcctgg aggaggaggg 13741 atgctgggct tgccagaaag gaggcagctg ctcccaggat gagttctgaa catgctacct 13801 gagcccttcc ctcctcccgt gctctgttcc agacttggat gggggcccac ggggccggtg 13861 tgctgttgag ccaggactgt gctggcaccc cacagggagc cttggagccc tgcgtccagg 13921 aggccactgc actgctcact tgtggcccag cgcgtccttg gaaatctgtg gccctggagg 13981 aggaacagga gggccctggg accaggctcc cggggaacct gagctcagag gatgtgctgc 14041 cagcagggtg tacggagtgg agggtacaga cgcttgccta tctgccacag gaggactggg 14101 cccccacgtc cctgactagg ccggctcccc cagactcaga gggcagcagg agcagcagca 14161 gcagcagcag cagcaacaac aacaactact gtgccttggg ctgctatggg ggatggcacc 14221 tctcagccct cccaggaaac acacagagct ctgggcccat cccagccctg gcctgtggcc 14281 tttcttgtga ccatcagggc ctggagaccc agcaaggagt tgcctgggtg ctggctggtc 14341 actgccagag gcctgggctg catgaggacc tccagggcat gttgctccct tctgtcctca 14401 gcaaggctcg gtcctggaca ttctaggtcc ctgactcgcc agatgcatca tgtccatttt 14461 gggaaaatgg actgaagttt ctggagccct tgtctgagac tgaacctcct gagaaggggc 14521 ccctagcagc ggtcagaggt cctgtctgga tggaggctgg aggctccccc ctcaacccct 14581 ctgctcagtg cctgtgggga gcagcctcta ccctcagcat cctggccaca agttcttcct 14641 tccattgtcc cttttcttta tccctgacct ctctgagaag tggggtgtgg tctctcagct 14701 gttctgccct cataccctta aagggccagc ctgggcccag tggacacagg taaggcacca 14761 tgaccacctg gtgtgacctc tctgtgcctt actgaggcac ctttctagag attaaaaggg 14821 gcttgatggc tgttcccaaa gtgttgatgg ctgggagaag gggccagagg aggagtgagg 14881 ggtggggttt gtccagccct gggctttccg ggctctagag atagcatggt gtaggctcaa 14941 tgacagttct ggggacagca agttggaggt tcaggggcag cttcaggaca gcaggatgga 15001 ggctcaggga caattcctgg gaggccagtg ccctcgttcc tccttgtcct catcctcccc 15061 cttgctccag gaaactgaga gctgagcctg gagcttccag acagtcagtg ctgggggtga 15121 ccatccagca gtgatggtgg cctgtgaagg gtcctgcttc tgtcctcagc ctctcatggg 15181 gtgggcttgt ggaggagctg tggtctggag agagtggcag ttggagcaga acgtgcctgc 15241 gtttgtttcc tagggctgtc gtaacaaagt gccacaaaat ggttagctta gaaccacaga 15301 gatttgttgt ctcacaattc tgaagtccag aagttggaaa taaagatgtt ggcagtgttc 15361 ccaaccacat gttcttgggc tcccatgaaa cagaagttga tattaggcca aggaagcttc 15421 ccagacaaga ctttattaag tcttatgccc cgaaagtttg ggcagaagag agacggtgca 15481 ggaggaagaa ttcttggctg actccccaag gggaatgcat tgtggtgtct taaggagggt 15541 gacatacata atttatgagc tacatgagtg tcattgcaca tatggggtgg agcgaagggt 15601 gctcagacgc atgctaacac atacgttgca tgatcagaaa atggcagata agcccctccc 15661 tgggtgagga ctttagtatt atcataagac cagggtcatt ctcctggcct tgtgcacaag 15721 caggtgatgg agtcaactcc cgtcagtaag acttatggcg ggatgctgct tatcttagtt 15781 tatttcagac agttggcaag gtctggccag cgagtatggc acctggaggg tggtgctgca 15841 aggtctagtg gtcagcgggc acgtatggaa caatacgtta gtgggggtgg gccgagtccc 15901 atttatactc tctcagcagg gccatgctcc ctctgaaggc actagggaag gattagtttc 15961 aggcctctct tcagcttctg ttagtttctt ggcttgtgac accaaagctg taatctttct 16021 tttgttttgt ttttgtgacg gagttttgct cttgttgccc aggctggagt gcaatggcac 16081 aatctcggct cactgcaacc tcctcctccc aggtttaagc aattcttctg cctcagcctt 16141 tggagtagct gtgattacaa ggcacctgct accacgcctg gctagttttt gtatttttag 16201 tagagacggt gtttcgctat gttggccagg ctggtctcaa actcctgata tcaagtgatc 16261 cgtctgtctc tgcctcccaa agtgttggga ttacaggcat gagccaccgt gcctggccaa 16321 agttccagtc tttacaggga tttttccttg tgtgcatttc tgtgtccaaa tttccccttt 16381 ttaaaatcac aataatagtg aattaaggct ggccctaacg atttaatctt aacttgatca 16441 tctgcaaaga cactatttcc atataaggtc acattcacag ctactggggt taggacttca 16501 acctagaggg cctgacttct ggcccaacac atcatggccc atcccagcat gccccatccc 16561 cttcctgggt gccccaggca gatcacagga gggcctgact gctgggcttt gggctgacat 16621 tgggatcatc tgcctagtta gggctgtgac cagactgaga taggaggtgg gacctgactc 16681 ctgaggcagg gcttgaactc tggaccagat tacagactag ctgaaacagg caaaagcacc 16741 cctccataag acacacccac tggtgccaag tgagtttgcc gttctcatgg taacagctgg 16801 aaattactgc ccctttccat ggcaatgacc tgaaagttac cacccctttt ctagaaattt 16861 ctaaataacc tcctccttaa tttgtatata gttacaagtg ggtataaata tgtgtgcaga 16921 actgcctctg agctgctact ctgggctgac tgcctatggg gcatccctgc tccataagga 16981 gcagtacctc tgctgccact gtgcacagct gcttaaataa aagttgctct ctaataccac 17041 cgactcgccc ttgaattctt tcctgggtga agc // LOCUS HUMIMPDH 6193 bp DNA PRI 22-MAY-1995 DEFINITION Homo sapiens (clone FFE-7) type II inosine monophosphate dehydrogenase (IMPDH2) gene, exons 1-13, complete cds. ACCESSION L33842 NID g602457 KEYWORDS NAD-dependent; differentiation; inosine monophosphate dehydrogenase; inosine-5'-monophosphate dehydrogenase; nucleotide biosynthesis; proliferation associated gene. SOURCE Homo sapiens (tissue library: lambda GEM-11 (Promega)) blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6193) AUTHORS Glesne,D.A. and Huberman,E. TITLE Cloning and sequence of the human type II IMP dehydrogenase gene JOURNAL Biochem. Biophys. Res. Commun. 205 (1), 537-544 (1994) MEDLINE 95091778 COMMENT Related sequences J04208 and L08114. FEATURES Location/Qualifiers source 1..6193 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /tissue_lib="lambda GEM-11 (Promega)" /map="karyotype band P21.2-P24.2" promoter 879..1000 /gene="IMPDH2" /note="minimal promoter" gene join(983..1182,1628..1676,1900..2001,2110..2184, 2516..2722,3385..3472,3546..3746,3825..3914,4014..4109, 5163..5451,5545..5688,5769..5852,5943..>6005) /gene="IMPDH2" misc_feature 983 /gene="IMPDH2" /note="major cap site" exon 983..1182 /gene="IMPDH2" /number=1 mRNA join(983..1182,1628..1676,1900..2001,2110..2184, 2516..2722,3385..3472,3546..3746,3825..3914,4014..4109, 5163..5451,5545..5688,5769..5852,5943..>6005) /gene="IMPDH2" misc_feature 1000 /gene="IMPDH2" /note="minor cap site" CDS join(1085..1182,1628..1676,1900..2001,2110..2184, 2516..2722,3385..3472,3546..3746,3825..3914,4014..4109, 5163..5451,5545..5688,5769..5852,5943..5964) /gene="IMPDH2" /EC_number="1.1.1.205" /codon_start=1 /product="inosine monophosphate dehydrogenase type II" /db_xref="PID:g602458" /translation="MADYLISGGTSYVPDDGLTAQQLFNCGDGLTYNDFLILPGYIDF TADQVDLTSALTKKITLKTPLVSSPMDTVTEAGMAIAMALTGGIGFIHHNCTPEFQAN EVRKVKKYEQGFITDPVVLSPKDRVRDVFEAKARHGFCGIPITDTGRMGSRLVGIISS RDIDFLKEEEHDCFLEEIMTKREDLVVAPAGITLKEANEILQRSKKGKLPIVNEDDEL VAIIARTDLKKNRDYPLASKDAKKQLLCGAAIGTHEDDKYRLDLLAQAGVDVVVLDSS QGNSIFQINMIKYIKDKYPNLQVIGGNVVTAAQAKNLIDAGVDALRVGMGSGSICITQ EVLACGRPQATAVYKVSEYARRFGVPVIADGGIQNVGHIAKALALGASTVMMGSLLAA TTEAPGEYFFSDGIRLKKYRGMGSLDAMDKHLSSQNRYFSEADKIKVAQGVSGAVQDK GSIHKFVPYLIAGIQHSCQDIGAKSLTQVRAMMYSGELKFEKRTSSAQVEGGVHSLHS YEKRLF" intron 1183..1627 /gene="IMPDH2" /number=1 exon 1628..1676 /gene="IMPDH2" /number=2 intron 1677..1899 /gene="IMPDH2" /number=2 exon 1900..2001 /gene="IMPDH2" /number=3 intron 2002..2109 /gene="IMPDH2" /number=3 exon 2110..2184 /gene="IMPDH2" /number=4 intron 2185..2515 /gene="IMPDH2" /number=4 exon 2516..2722 /gene="IMPDH2" /number=5 intron 2723..3384 /gene="IMPDH2" /number=5 exon 3385..3472 /gene="IMPDH2" /number=6 intron 3473..3545 /gene="IMPDH2" /number=6 exon 3546..3746 /gene="IMPDH2" /number=7 intron 3747..3824 /gene="IMPDH2" /number=7 exon 3825..3914 /gene="IMPDH2" /number=8 intron 3915..4013 /gene="IMPDH2" /number=8 exon 4014..4109 /gene="IMPDH2" /number=9 intron 4110..5162 /gene="IMPDH2" /number=9 exon 5163..5451 /gene="IMPDH2" /number=10 intron 5452..5544 /gene="IMPDH2" /number=10 exon 5545..5688 /gene="IMPDH2" /number=11 intron 5689..5768 /gene="IMPDH2" /number=11 exon 5769..5852 /gene="IMPDH2" /number=12 intron 5853..5942 /gene="IMPDH2" /number=12 exon 5943..>6005 /gene="IMPDH2" /number=13 polyA_signal 6000..6005 /gene="IMPDH2" BASE COUNT 1404 a 1597 c 1628 g 1564 t ORIGIN 1 ggccaagaga aaccagccag gaaaaaccag acagactttc acactaaaga agaggcctcc 61 attttttttt ttcttttttt tattggtgta gttacgaagc ctttcaggct gcttctgttt 121 aaaatataaa agaaaacttt gccccctttg catcttcata aacctgctgc ggcagactcc 181 tcagccgatg gtggctctgg gtttccttga gtgtcatatg tcctagaaag ttgctggctg 241 actctttttt gtctggggcc tggggaaagg gcttggactg tgaaaagaaa tgtggcccct 301 ttccatcttc aagagagatg gaattaatga tggatggacc ctggagggaa tctccccagc 361 cgacttccac tgggctgaca gactttgctg accacagggg aacgatgttc ttttctttct 421 tcatgatcag acataaactt agcatcttaa tggaagaaaa atgaggggaa cttcaattat 481 gatttattaa agacaatttc tattacaccc tcctttatga caagtgacat tttagatgta 541 aaagtaaaaa ctttaccatg cctttttttt ttttgttggc ctaacattga ggccttaaaa 601 cctgaggctc ctgtgcctga tggaattctt gtaacataca cttgtgtatc atataaagat 661 accactctgt ttctcttatg tattcttact ctagttgttt attaagaatg acaagcacgt 721 cttttcaaca tgttagtgaa caacgtctct ttattttggg gaggagcccg gtgggacagt 781 agaagtaaac ccttgcctgt taattgaact ggccgaggtc ctgcggggag tgtggggtgg 841 gacgtgaagt ggcagctcct cagtgacaag gccactatgt gctatacgca tgcgctgttt 901 cttcagcgcc agctccgccc ccgccgcagc gaggcgggta ggttccgccc gcgcgcacta 961 cgccctgacg tcagcgtcgc gcgcagcagt gacgaaatcg gctggtttat attggcgcgg 1021 cccagacggc agaggtctct gcggcgcggt cctcggagac acgcggcggt gtcctgtgtt 1081 ggccatggcc gactacctga ttagtggggg cacgtcctac gtgccagacg acggactcac 1141 agcacagcag ctcttcaact gcggagacgg cctcacctac aagtgcgggc ctatggggct 1201 cacctgggcg aagagagggg cggcacctgg ggaaaaggcc agatgggggc ttccgagagc 1261 cgccttgggc gtgggcgtgg gggacaggtt tcgggaattg gccacccacc tccggggctc 1321 actttgagac atctggagcg gtgcttgggg tacggagaag catctcagta gtgcttggaa 1381 tctcaggccg gagattccat gctccagaca catggaattg tcagggtgcc aggagcactg 1441 atgacagctc ggtggcagaa aaggtcccct gtacccattt ccaccttgtg ctgtgcctgt 1501 gacacacaga ctctgtcagt gccactctct gctgacatgc gttacgggtt gattggtaca 1561 tggggatgga gtctcatgct gccctaagct gagagcttta ctcccaattc actcccttcc 1621 ctcgcagtga ctttctcatt ctccctgggt acatcgactt cactgcagac caggtggtga 1681 gtatgatcag gaattgggtc ctgaacataa ggtgagctaa ggtgctacta tcacctggtc 1741 tctgggctaa ctcaccagat tgggcttggg ggtggatttt ataggatcat gttgcttgtt 1801 agcatgcagg tcatacaaaa gtgccttttg tgggggactc agtaacacct tgaggtgtaa 1861 gggatgcttt cccacactgg ttgacgtttg tctcctcagg acctgacttc tgctctgacc 1921 aagaaaatca ctcttaagac cccactggtt tcctctccca tggacacagt cacagaggct 1981 gggatggcca tagcaatggc ggtgagcccc atggggtgtg ggggaaggag caaggactcc 2041 atcgcctttc cccaaaggca ttcagtcctg cttcttgtca cttagtagta gtctcttttt 2101 atcctgtagc ttacaggcgg tattggcttc atccaccaca actgtacacc tgaattccag 2161 gccaatgaag ttcggaaagt gaaggtcaga agggcaacga tcattagcaa gcgctcctgg 2221 gaattgcact gaggtggggt ggggtgggag taggggttta ttctaattta gtattctttc 2281 ttcccaccat ggggttcagt tactgagaag accctgagat tctgtttctt aaagcagcag 2341 caatagacca ggtgtacagt gcctccagcc tacccatgtc tctaagatgt gttggtgtga 2401 tttggtcttg tggcactgcc aaagggatcg ataagcagag accccatgct tcagatcaag 2461 agcctgatga aagtagttca aagatgcgat gccctttctc accatccctt tccagaaata 2521 tgaacaggga ttcatcacag accctgtggt cctcagcccc aaggatcgcg tgcgggatgt 2581 ttttgaggcc aaggcccggc atggtttctg cggtatccca atcacagaca caggccggat 2641 ggggagccgc ttggtgggca tcatctcctc cagggacatt gattttctca aagaggagga 2701 acatgactgt ttcttggaag aggtgggtgc cactggcaga gtagatccaa gctctagcag 2761 cccaggttga agacaagggg cttctgttgt ctaagacatc tctgatagcc tcttctctct 2821 gggaaaggag tgtcaaacca agtttctgac tcctttattg ggttcctgca tccttcccct 2881 ccagaggctt tcaaactgcc caaatgcttg ctagtattca tgctttgttc ttttaaaccc 2941 ctaaggtagg ggcagtgctt gaggtctgcc ctgattttct tgccattgtt cctgtgttgt 3001 cctccttatt ccatagtgta agagggtgct ccctgtgcca tgttgtcctt tctactcatc 3061 cctcactcaa taccatactg ctaagaatta ttgggatccc ttgaggaagg gtgttttggg 3121 tgtgaatgta tgaacgtggt ggtccataga acttcacccc tgtaatctca gtactttggg 3181 agacttgagg caggcaaatc atgaggttag gagtttgaga ccagcctggc caacatggtg 3241 aaaccccgtc tctactaaaa atacaaaaaa ttagttgggt gtggtggcgc gtgcctctga 3301 gctcagctac ttgggaagga aagaaaagag ttaactggtc atagtcgatg actggccctt 3361 cttcacatct cacctcccac gtagataatg acaaagaggg aagacttggt ggtagcccct 3421 gcaggcatca cactgaagga ggcaaatgaa attctgcagc gcagcaagaa gggtaagtcc 3481 tagagctggg aaagggcctg gaactaatgc ctagggtcct gattcatgtc ctgccctgac 3541 cacaggaaag ttgcccattg taaatgaaga tgatgagctt gtggccatca ttgcccggac 3601 agacctgaag aagaatcggg actacccact agcctccaaa gatgccaaga aacagctgct 3661 gtgtggggca gccattggca ctcatgagga tgacaagtat aggctggact tgctcgccca 3721 ggctggtgtg gatgtagtgg ttttgggtga gcctgctaca caggtgggtt ggggcaatag 3781 gggcagctgc cacatgacag tctgagactt acttcttgtc ctagactctt cccagggaaa 3841 ttccatcttc cagatcaata tgatcaagta catcaaagac aaatacccta atctccaagt 3901 cattggaggc aatggtaagg caagattgtg ccctgaagga tgtgtggtgg gagtgggtaa 3961 gctgggctcc caccttaacc ttcacatgag cattttcctt tccttcacca tagtggtcac 4021 tgctgcccag gccaagaacc tcattgatgc aggtgtggat gccctgcggg tgggcatggg 4081 aagtggctcc atctgcatta cgcaggaagg taagaatata ctttgatagg gcccacagag 4141 accccagttt caggccccac ataccttgga accaagcact cttatgggga caatagagcc 4201 ctccaatagg ggcaaccctg gggccaaatt ggcctcttgg atgggtgaca tggctggaag 4261 caagacaagg tcttgtcatc catcacctgc cactcccttg cctagcctgg ccttcttgct 4321 cctgctctgc accttctcta aactctggtc tgtttgtttt atttagtggc tcccaagatc 4381 cctccagaca taaagtccca cagccccaaa tgcccttcta ctgtcacggg ttgctatagt 4441 aagtccagcc cggcccactg gcccggccca gctgcccact gcccggctgc ttggttgacg 4501 tagctgtagc tcccggggtt tgtggtgctg atgggtgggc aaggccaact gatacacaga 4561 ccatggtttg ggggtggtgc tcaagaacat agctccctct tccttctggg gtgctgatag 4621 ggcaggacct tcttccctag gcaggacctt ttctgcccac atcttctgca gcaggcatag 4681 ttcagggacc aactcctttg tgtggtcctt gctctcccac acaaccactg cctgctccac 4741 acaaccacag cctgctctcc tacacaacca ctgcccttat cttgcctgag ctgcccacca 4801 ttcagggagg aagggctggg cctgacctca gtgcctctgg ggactaactg cacggtgact 4861 cctgccccat ctgataccag ctggacgggg ctttcccggc tgctaagact gcatgtgcct 4921 gaggctgccc tgggcactgc acacatgcac gtgcacatgt gggggtgagt agaggtcagc 4981 acagctgttt gtactattta aggcagatgt ggcagcagcc atcccagaca cgtctgttcc 5041 tcagccaagc tgcagtcctg atgcagatgt agccaaaagc tcctgggtcc taaatatggc 5101 cacagggtcc actgcccgtc cccattaacc ctatccaccc atgtgttcct ccatctcaac 5161 agtgctggcc tgtgggcggc cccaagcaac agcagtgtac aaggtgtcag agtatgcacg 5221 gcgctttggt gttccggtca ttgctgatgg aggaatccaa aatgtgggtc atattgcgaa 5281 agccttggcc cttggggcct ccacagtcat gatgggctct ctcctggctg ccaccactga 5341 ggcccctggt gaatacttct tttccgatgg gatccggcta aagaaatatc gcggtatggg 5401 ttctctcgat gccatggaca agcacctcag cagccagaac agatatttca ggtgggacag 5461 gcaagccagt taccccaccc taatcctgca ctgacagtct catctctgct gtgaccttgc 5521 ccgtgtctct gcctctccct gcagtgaagc tgacaaaatc aaagtggccc agggagtgtc 5581 tggtgctgtg caggacaaag ggtcaatcca caaatttgtc ccttacctga ttgctggcat 5641 ccaacactca tgccaggaca ttggtgccaa gagcttgacc caagtccggt gagcttgggg 5701 agctgggaca tgggtagagg ggctggtcca gggccagcta cccacatttg acctctgccc 5761 ttcttcagag ccatgatgta ctctggggag cttaagtttg agaagagaac gtcctcagcc 5821 caggtggaag gtggcgtcca tagcctccat tcgtaagtca ccctgtcctt ggtggggcct 5881 ctgccatacc tcatgcctcc tttctcctgc cctcacctca caggtggtcc ttctgcctgc 5941 aggtatgaga agcggctttt ctgaaaaggg atccagcaca cctcctcggt ttttttttca 6001 ataaaagttt agaaagaaaa aagtgatgcc tgatctttca acagacaggt ggggctctgt 6061 agctgccact accacagatg cacacaaaaa cagcaccctc atttccaggg ggagcctcag 6121 gccccgagat aaatgtgctc catggacctg gaagcgggta gttcaggtcc aaaaagttcc 6181 tctagctctc aca // LOCUS HUMINCP 3716 bp DNA PRI 15-SEP-1992 DEFINITION Human cysteine-proteinase inhibitor (CST1) gene, complete cds. ACCESSION M19169 NID g186399 KEYWORDS cysteine proteinase inhibitor; salivary cystatine. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3716) AUTHORS Saitoh,E., Kim,H.-S., Smithies,O. and Maeda,N. TITLE Human cysteine-proteinase inhibitors: Nucleotide sequence analysis of three members of the cystatin gene family JOURNAL Gene 61, 329-338 (1987) MEDLINE 88185836 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by N.Maeda, 23-MAY-1988. FEATURES Location/Qualifiers source 1..3716 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 12..>3628 /note="CPI mRNA and introns" CDS join(341..568,2079..2192,3311..3394) /partial /note="cysteine-proteinase inhibitor" /codon_start=1 /db_xref="PID:g386825" /translation="MAQHLSTLLLLLATLAVALAWSPKEEDRIIPGGIYNADLNDEWV QRALHFAISEYNKATKDDYYRRPLRVLRARQQTVGGVNYFFDVEVGRTICTKSQPNLD TCAFHEQPELQKKQLCSFEIYEVPWENRRSLVKSRCQES" exon <341..568 /note="cysteine-proteinase inhibitor" /number=1 intron 569..2078 /note="CPI intron A" exon 2079..2192 /number=1 intron 2193..3310 /note="CPI intron B" exon 3311..>3394 /note="cysteine-proteinase inhibitor" /number=1 BASE COUNT 775 a 1005 c 1093 g 843 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttggga gcactgggga agagaggcat ggctcgggga ggtcgcagtg aggactggag 61 tggggaggag ggggagatgg aggaggaggc ttgggagggg cagggggaac ttaggcagga 121 aaggagcttg tagtagcggg ggagtgaaaa gagagatgga gaaagagggg atgggaagaa 181 agagggagaa agggagtcag gggtggggca tggaggtggg tggggctggg ctgccaaagc 241 aggataaatg cacagctgcc tgctggtctg ggctccctgc ctcaggctct caccctcctc 301 tcctgcagct ccagctttgt gctctgcctc tgaggagacc atggcccagc atctgagtac 361 cctgctgctc ctgctggcca ccctagctgt ggccctggcc tggagcccca aggaggagga 421 taggataatc ccgggtggca tctataacgc agacctcaat gatgagtggg tacagcgtgc 481 ccttcacttc gccatcagcg agtataacaa ggccaccaaa gatgactact acagacgtcc 541 gctgcgggta ctaagagcca ggcaacaggt aggtgctccc tccaccccag gggtcctggg 601 tcccagcctg gtttgttccc caacccccaa gagcattccc agcaaatcaa cactgataca 661 ttcatgatct aatgctcaga ttcattcagc tttccctggc tctccgctga tgcccttcat 721 gcctaagcac gctccccggc cgtgcacaaa ctcagcttcc tttaacctgc agcagccact 781 gtgtctgtac catgactgtg gcatttccca gggtccagca ggtgtggatg gagactgtgc 841 ttactctggg tgggcttgat gctgctcagg atgagatcca ggccatgagg ttcatactcc 901 tccctgagtc ctctctgcag gggccacaca ggaacctggc tcactgttct gcagagccct 961 gcttccccaa gtcacgcccc tgggcacagc cccttatggc tagcggcctt caccctcagg 1021 cccggctgac aaactcccac agcctagggc gctgagtccc tgctggggtg gagcatgcct 1081 gaccctgcct ctaccagctg atgcagttag acctcagcca gatgaggaca gtggtcaccc 1141 agcagagcag aggaggggtc aggtcgggag ggagcttcag cagggcaact gggcccagct 1201 tgacctgcat cccatggcac agcagcaaat agtgacacag tctttagagc tcctccacct 1261 tctcctgaaa ttcaaaggaa tccccaccag ccccgtttct cctcttgcag ctgtcagctg 1321 gggctctctc cctgcatacg agatacactc cctggtgccg tggtccccgc tggcctgcat 1381 ctccctttca agcatgacag taacttggag tgaagcacag ggcattgcag accatcaggc 1441 ccagaagcct attttagaca tgggtaaact gacactcgag ggatctcagc agttcctcct 1501 ggttccaaag agtccctcat cccaggtttc tccacagctc tgccacattg tgtctgggaa 1561 aggccctatg cagggaaagg gttcaattct aatctgcaac tgtaagacac gcaggtgtgc 1621 tgctgacttg agaaatgtat cttgaatctc acacttgaaa tggtggcatc cggacggccc 1681 cattgatcca aaatatctgt gtgtgtgaag catctcattt cctactctga gtgaagtaat 1741 aaatctatgt taaatggagg gaataagatt ttcagaagtt aggtgaaatt ttgtcatcag 1801 acagaacttc ctagaaaaga gtcagtgttc cctcgcccct gagccacaga cagcagaatt 1861 caatgaatcc ttttacccag cacagagaaa gcaatgttta agagcgggta tgaggctcag 1921 caccctgcca gttgacagga agagggggct tgtgtgcctt gtgttgacat gtgggcagct 1981 cacgaagccc ccaagcaagt ccagtgactc agccacagtg aagtgcctgt gagtgcatga 2041 actgatgggg gcgctgtcct gttttctcct gtgtgcagac cgttgggggg gtgaattact 2101 tcttcgacgt agaggtgggc cgaaccatat gtaccaagtc ccagcccaac ttggacacct 2161 gtgccttcca tgaacagcca gaactgcaga aggtacgttc ctgatgcagg tcccgggcca 2221 gtcatgcact gcagaggggt gcgtatgtgt cagcctctgc cctacacatg tttggagggt 2281 gtgtgtgtgt gcaggtgggt atgtggggag tcatgtatgc atggatgtgt acatgttcat 2341 gtacttgtgg aggggtgtgc ctgtaggtgt gcatgtggaa aggtacacgt gtgtacacac 2401 ctgtgccagt gtgtgcaggg aggtggatgg gagcatgtgt gcctgtgcat ggatgtgtgg 2461 ggggtgtatg gggctttgta catagatcca tggggatgag gggtccaagt gagtttacgt 2521 agttgtccat gtatgtgcag atggggtggt gagggaggag ggtgatgtgt ttgttttgct 2581 aggaaggctt taggttggga atggttacta taaggtcaat tctgcctgct ttggagtgtt 2641 gcctgttgga caggaagaag cagctgtgcg gctgtgtgct gggcagggag aaggggctct 2701 gtctaatccc aggctcaggc acctgcatgc agccacagcc acagtgatca gattagtggg 2761 acctagaggc ctgttagctg ggaagccctg gacctgcccg gctcacccaa caccagcctc 2821 tccaaggacc tgctggttct tgtgaggtct ccactcgggg aagagcctga gcactcccct 2881 tgttgccctt gccccatacc ccagctcttt gagggggagt tgccctgccc tggttcttcc 2941 ctctggcccc tcttagtgct ggcctggtgc tggaagtgga aggagctggg ggaactgagc 3001 cgcctcccca tgccctgcac ccttggggct cccgaggcct gcccaggcta ctcctcacag 3061 ggctgtgctg ggacaggaca ctgcaggctg gggtggggtc ccaatgccac ctggtgactt 3121 ggagccttgg gaggggcaat ggaacagtca ctattcattc tagttcagca ctctgggact 3181 cagtaggggt gggtgagggc ccagtgtctc acctccatcc tcctcaccca ggctctgaca 3241 tctcatgcct gggcatcttc ccctttaact gtaacccaca ctgattggcc ctctctcttc 3301 cctttcacag aaacagttgt gctctttcga gatctacgaa gttccctggg agaacagaag 3361 gtccctggtg aaatccaggt gtcaagaatc ctagggatct gtgccaggcc attcgcacca 3421 gccaccaccc actcccaccc cctgtagtgc tcccacccct ggactggtgg cccccaccct 3481 gcgggaggcc tccccatgtg cctgtgccaa gagacagaca gagaaggctg caggagtcct 3541 ttgttgctca gcagggcgct ccgccctccc tccttccttc tcgcttctaa tagcctaggt 3601 acacacaccc ccacctcccg caattaaaca gtagcatcgc ctccctctga gttcttgagt 3661 tcttggctgt ctggggatgt gcacgcaggc agggtttctg cagttccttt atgaag // LOCUS HUMIRBPG 9711 bp DNA PRI 06-JAN-1995 DEFINITION Human interstitial retinol-binding protein (IRBP) gene, complete cds. ACCESSION J05253 NID g186534 KEYWORDS interstitial retinol-binding protein. SOURCE Human DNA, clone HGL.3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9711) AUTHORS Fong,S.L., Fong,W.B., Morris,T.A., Kedzie,K.M. and Bridges,C.D. TITLE Characterization and comparative structural features of the gene for human interstitial retinol-binding protein JOURNAL J. Biol. Chem. 265 (7), 3648-3653 (1990) MEDLINE 90154038 FEATURES Location/Qualifiers source 1..9711 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /clone="HGL3" /map="chromosome 10" prim_transcript 1..>9505 /note="IRBP mRNA and introns (alt.)" prim_transcript 7..>9505 /note="IRBP mRNA and introns" prim_transcript 15..>9505 /note="IRBP mRNA and introns (alt.)" CDS join(130..3180,4966..5156,7017..7159,8766..9121) /gene="IRBP" /note="precursor" /codon_start=1 /product="interstitial retinol-binding protein" /db_xref="PID:g386835" /translation="MREWVLLMSVLLCGLAGPTHLFQPSLVLDMAKVLLDNYCFPENL LGMQEAIQQAIKSHEILSISDPQTLASVLTAGVQSSLNDPRLVISYEPSTPEPPPQVP ALTSLSEEELLAWLQRGLRHEVLEGNVGYLRVDSVPGQEVLSMMGEFLVAHVWGNLMG TSALVLDLRHCTGGQVSGIPYIISYLHPGNTILHVDTIYNRPSNTTTEIWTLPQVLGE RYGADKDVVVLTSSQTRGVAEDIAHILKQMRRAIVVGERTGGGALDLRKLRIGESDFF FTVPVSRSLGPLGGGSQTWEGSGVLPCVGTPAEQALEKALAILTLRSALPGVVHCLQE VLKDYYTLVDRVPTLLQHLASMDFSTVVSEEDLVTKLNAGLQAASEDPRLLVRAIGPT ETPSWPAPDAAAEDSPGVAPELPEDEAIRQALVDSVFQVSVLPGNVGYLRFDSFADAS VLGVLAPYVLRQVWEPLQDTEHLIMDLRHNPGGPSSAVPLLLSYFQGPEAGPVHLFTT YDRRTNITQEHFSHMELPGPRYSTQRGVYLLTSHRTATAAEEFAFLMQSLGWATLVGE ITAGNLLHTRTVPLLDTPEGSLALTVPVLTFIDNHGEAWLGGGVVPDAIVLAEEALDK AQEVLEFHQSLGALVEGTGHLLEAHYARPEVVGQTSALLRAKLAQGAYRTAVDLESLA SQLTADLQEVSGDHRLLVFHSPGELVVEEAPPPPPAVPSPEELTYLIEALFKTEVLPG QLGYLRFDAMAELETVKAVGPQLVRLVWQQLVDTAALVIDLRYNPGSYSTAIPLLCSY FFEAEPRQHLYSVFDRATSKVTEVWTLPQVAGQRYGSHKDLYILMSHTSGSAAEAFAH TMQDLQRATVIGEPTAGGALSVGIYQVGSSPLYASMPTQMAMSATTGKAWDLAGVEPD ITVPMSEALSIAQDIVALRAKVPTVLQTAGKLVADNYASAELGAKMATKLSGLQSRYS RVTSEVALAEILGADLQMLSGDPHLKAAHIPENAKDRIPGIVPMQIPSPEVFEELIKF SFHTNVLEDNIGYLRFDMFGDGELLTQVSRLLVEHIWKKIMHTDAMIIDMRFNIGGPT SSIPILCSYFFDEGPPVLLDKIYSRPDDSVSELWTHAQVVGERYGSKKSMVILTSSVT AGTAEEFTYIMKRLGRALVIGEVTSGGCQPPQTYHVDDTNLYLTIPTARSVGASDGSS WEGVGVTPHVVVPAEEALARAKEMLQHNQLRVKRSPGLQDHL" sig_peptide 130..177 /gene="IRBP" gene join(130..3180,4966..5156,7017..7159,8766..9121) /gene="IRBP" exon <130..3180 /gene="IRBP" /number=1 mat_peptide join(178..3180,4966..5156,7017..7159,8766..9118) /gene="IRBP" /product="interstitial retinol-binding protein" intron 3181..4965 /note="IRBP intron A" exon 4966..5156 /gene="IRBP" /number=2 intron 5157..7016 /note="IRBP intron B" exon 7017..7159 /gene="IRBP" /number=3 intron 7160..8765 /note="IRBP intron C" exon 8766..>9121 /gene="IRBP" /number=4 BASE COUNT 2246 a 2672 c 2641 g 2152 t ORIGIN Chromosome 10q11.2. 1 agccccagac cttctgtcca ccagctgaga aggacaaggg cggaaggcag ctgcacagag 61 cagggccacg gcttgcacac agtccaggga gcttttgtgc aggagccagg cctccccctg 121 gtccccatga tgagagaatg ggttctgctc atgtccgtgc tgctctgtgg cctggctggc 181 cccacacacc tgttccagcc aagcctggtg ctggacatgg ccaaggtcct cttggataac 241 tactgcttcc cggagaacct gctgggcatg caggaagcca tccagcaggc catcaagagc 301 catgagattc tgagcatctc agacccgcag acgctggcca gtgtgctgac agccggggtg 361 cagagctccc tgaacgatcc tcgcctggtc atctcctatg agcccagcac ccccgagcct 421 cccccacaag tcccagcact caccagcctc tcagaagagg aactgcttgc ctggctgcaa 481 aggggcctcc gccatgaggt tctggagggt aatgtgggct acctgcgggt ggacagcgtc 541 ccgggccagg aggtgctgag catgatgggg gagttcctgg tggcccacgt gtgggggaat 601 ctcatgggca cctccgcctt agtgctggat ctccggcact gcacaggagg ccaggtctct 661 ggcattccct acatcatctc ctacctgcac ccagggaaca ccatcctgca cgtggacact 721 atctacaacc gcccctccaa caccaccacg gagatctgga ccttgcccca ggtcctggga 781 gaaaggtacg gtgccgacaa ggatgtggtg gtcctcacca gcagccagac caggggcgtg 841 gccgaggaca tcgcgcacat ccttaagcag atgcgcaggg ccatcgtggt gggcgagcgg 901 actgggggag gggccctgga cctccggaag ctgaggatag gcgagtctga cttcttcttc 961 acggtgcccg tgtccaggtc cctggggccc cttggtggag gcagccagac gtgggagggc 1021 agcggggtgc tgccctgtgt ggggactccg gccgagcagg ccctggagaa agccctggcc 1081 atcctcactc tgcgcagcgc ccttccaggg gtagtccact gcctccagga ggtcctgaag 1141 gactactaca cgctggtgga ccgtgtgccc accctgctgc agcacttggc cagcatggac 1201 ttctccacgg tggtctccga ggaagatctg gtcaccaagc tcaatgccgg cctgcaggct 1261 gcgtctgagg atcccaggct cctggtgcga gccatcgggc ccacagaaac tccttcttgg 1321 cccgcgcccg acgctgcagc cgaagactca ccaggggtgg ccccagagtt gcctgaggac 1381 gaggctatcc ggcaagcact ggtggactct gtgttccagg tgtcggtgct gccaggcaat 1441 gtgggctacc tgcgcttcga tagttttgct gacgcctccg tcctgggtgt gttggcccca 1501 tatgtcctgc gccaggtgtg ggagccgcta caggacacgg agcacctcat catggacctg 1561 cgccacaacc ctggagggcc atcctctgct gtgcccctgc tcctgtccta cttccagggc 1621 cctgaggccg gccccgtgca cctcttcacc acctatgatc gccgcaccaa catcacgcag 1681 gagcacttca gccacatgga gctcccgggc ccacgctaca gcacccaacg tggggtgtat 1741 ctgctcacca gccaccgcac cgccacggcc gcggaggagt tcgccttcct tatgcagtcg 1801 ctgggctggg ccacactggt aggtgagatc accgcgggca acctgctgca cacccgcacg 1861 gtgccgctgc tggacacacc cgaaggcagc ctcgcgctca ccgtgccggt cctcaccttc 1921 atcgacaatc acggcgaggc ctggctgggt ggtggagtgg tgcccgatgc catcgtgctg 1981 gccgaggagg ccctggacaa agcccaggaa gtgctggagt tccaccaaag cctgggggcc 2041 ttggtggagg gcacagggca cctgctggag gcccactatg ctcggccaga ggtcgtgggg 2101 cagaccagtg ccctcctgcg ggccaagctg gcccagggcg cctaccgcac agctgtggac 2161 ttggagtctc tggcctctca gctcacagca gacctccagg aggtgtctgg ggaccaccgc 2221 ttgctagtgt tccacagccc tggcgagctg gtggtagagg aagcaccccc accaccccct 2281 gctgtcccct ctccagagga gctcacctac cttattgagg ccctgttcaa gacagaggtg 2341 ctgcccggcc agctgggcta cctgcgtttt gacgccatgg ctgaactgga gacagtgaag 2401 gccgtggggc cacagctggt gcggctggta tggcaacagc tggtggacac ggctgcgctg 2461 gtgatcgacc tgcgctacaa ccctggcagc tactccacgg ccatcccgct gctctgctcc 2521 tacttctttg aggcagagcc ccgccagcac ctgtattctg tctttgacag ggccacctca 2581 aaagtcacgg aggtgtggac cttgccccag gtcgccggcc agcgctacgg ctcacacaag 2641 gacctctaca tcctgatgag ccacaccagt ggctctgcgg ccgaggcctt tgcacacacc 2701 atgcaggacc tgcagcgggc cacggtcatt ggggagccca cggccggagg cgcactctct 2761 gtgggcatct accaggtggg cagcagcccc ttatatgcat ccatgcccac ccagatggcc 2821 atgagtgcca ccacaggcaa ggcctgggac ctggctggtg tggagcccga catcactgtg 2881 cccatgagcg aagccctttc catagcccag gacatagtgg ctctgcgtgc caaggtgccc 2941 acggtgctgc agacggccgg gaagctggtg gctgataact atgcctctgc cgagctgggg 3001 gccaagatgg ccaccaaact gagcggtctg cagagccgct actccagggt gacctcagaa 3061 gtggccctag ccgagatcct gggggctgac ctgcagatgc tctccggaga cccacacctg 3121 aaggcagccc atatccctga gaatgccaag gaccgcattc ctggaattgt gcccatgcag 3181 gtgagaccca agagagacct ggctgaaccc agtcccggga gtgagttgac ccattgtccg 3241 cacatgcagg gctctgtgca cagtgcgtga caatggcttt tagatttgtt ctcacgttta 3301 agttttgacc ggtcaagtcc tttcctcttt ctcaacctgt tccatccact ctctgtgacc 3361 ctggggttgc tgaacacctc tgtagaacat tcatattagg ttggtgcaaa agtactttca 3421 atggcaaaac ccgcaattac ttttgcaccc acctcacagg aagccagttt gaaagccaac 3481 caatactcac aggaagccag ttcggaagct cctgggatag aaggcatttc agccttggct 3541 gggtggaagg tgagtgttgg cagggcttct cattgtcagt gctagggaag aggccaacac 3601 ctgtcagagg tggccaatgg acttcaccaa gtgccccacg ctggccgaga gctccaccta 3661 ggcagcactc acacctccac actgttctac ctgtggtctg ctgcatcgtc acaattgggc 3721 agggcagcat ttgccatggg atcccttgca aggagggtct gagaccaggg cttgggtgca 3781 ggcgctttgt ctgggaggcg gttactgaag caggcatgaa ggagggagca gggagagtgg 3841 gttgggaagc tgacaggcag gtgcctcaaa gctgttctgc tgaagccagg accctgacga 3901 gtgcagggat gctcccaggc atagccctct gcaggcaggg cccagtggtt cctgtcccgc 3961 attggccaag agttgccctg agacatgcct cagggtgggg caggctccct cttcttggag 4021 aaggcctgag ctacgggtgg aaaggcaggg ctgtgctata ggagagcctg tcagtggggt 4081 cgggtgcagc tgaaagcaga gggggctgag agtgccaaaa gcatctacta cagaattgtt 4141 atccccattt tccgcaactg aggcccggag aggagcagag gggagtggcc tgaggccaga 4201 gagctgtgac tgagggcagg gcagggcctg gagggcagtg tctctgtcaa tgaagtctcc 4261 ttgcctgtca atctcaccaa gacctgcctc cctccagcag ccttagagag ggaggaggag 4321 gtgcatccac gtgcgagtag cctgtgctag gcttgcagaa tccccagttt ccaaatcaac 4381 atctccttcc tttccagtat agccaaggtt cacgatttgg agtcagatgt ggattcagat 4441 tctggctcca ccacttactg actgtgtgac ctgagactag ttacttaatc tctctgtgct 4501 tcagtttttc catggaaaag atggggatcg tgttatctcc tgtacaggtg gctgtgagga 4561 tgatgataag ctctacaaag tgcttagtac agggccaggt gcctggtaaa ggtaactaac 4621 atcttccaat cctgccccag tggagcagct tagagacata ggaagtatct ggtaaggttg 4681 gagaggtcag aggggagacc attcctgggc cttctagctc aggacaccag ggcatgtggg 4741 tggcagacag gagcatcctc tgcaaggagg ctgcccatag atcacacatg tccagtggca 4801 tgtcacatcc agacatgcca ctgggaaagt ccctggtgtc tactaattcc ttcagaaatg 4861 ttagttcctg tcccatgccc ttaatatttc ccgtggcgcc tgctttcctg ggctctaaaa 4921 ctggctgctc ctcctgacac tgagtaggac ctccaactct tacagatccc ttcccctgaa 4981 gtatttgaag agctgatcaa gttttccttc cacactaacg tgcttgagga caacattggc 5041 tacttgaggt ttgacatgtt tggggacggt gagctgctca cccaggtctc caggctgctg 5101 gtggagcaca tctggaagaa gatcatgcac acggatgcca tgatcatcga catgaggtca 5161 gtggccaggg gtcagtgctt cctagccagg acgcagggct gccaggggac agtcaaagct 5221 atgggccaca gcagggaaga aaaggaaccc tgtgacacag cagaggacct gaggggggcc 5281 aggcctcctg cctatcaggg cttcggccag ctagcccagc ggatctgctg tcattgcagg 5341 tggggcccag cactggtagt ttttgcaagc ttctccagtg attccaatgt gcattccaga 5401 ttgagaatta ccccagattt gcctggtcat aggattcagt ggggaggaga ggaaggagat 5461 gttaaaaaca ctgattcctg ggccccactc ccctcttact gactcagaag ctccagggga 5521 caggcccgat aatctctatt tgacaggcat cccaggtgat tctgatggtc agtggtgtgt 5581 gagacagcca ggccccccac ctgggagtgt gcctcacatc atacaaaatc aacacatgac 5641 aaccctggtc atatgaatcg tacagggcag aatttggaaa tcacacaggg aaaatttaga 5701 agttaatcac tattgttaac aaaagaattc tttgcatcac caaatgattg tcagaaggat 5761 gtcttcagct agccacctcc cctctgtttc tgtaaaaatg tgcctcatat gtcatgagtt 5821 tacctcatgg accaggggta ggaggtggat gagtaccctc aggcagtggc agacctggca 5881 tcccatgact ctggggtcca ttggtggaca tgccccagca ccctccctgg gcaggttgcc 5941 ttcctcagac gggacccttc catgctcccg tgttgtctga aagaagtggc agggggtggc 6001 agaccagaag cattcaccca gctccagttc tctagccctc agcgctcact cacatggttc 6061 ttagtggtca ctaggtggct ctcggtcctg cacccactgc ttctcctccc tgggagataa 6121 aacattagcc ctggctccag ttctgtgtgt gttcacagcc tgagcttgag gctcctgggg 6181 tctctctgat cattttattc tttagaagtt ctgttgagca cccaaatgac actcatgcta 6241 agtctctttc ttctgaatat gccatttaca ctttttaaag aggagatggc cccagcctgc 6301 aagtgcactt tgcttaaaga ccctcccacc cctgaggttt gtttggaggt tcagacactg 6361 acatcacagg ccccgaaccc agccaggcca tcccaggcca tcagttggac gaggctcctt 6421 aacacatcac tagccccagt ggggagacag gggcccaacg aggtcacaca gtgggacaaa 6481 tctaaatggc ctgggagaaa tacaggctcc cctgctccca atgcagctcc acaagcctga 6541 atatttcaat acactaagtg ccctgttaag acacctgaac taagacagct ggtaggtgtc 6601 ataagattgt cacagaatgc aacgggaaaa acaatgtttc ccaaaatgtg ttccactcat 6661 aggaagatat gggtgcaggg gagcagtcca tggtcaataa atgggggaaa tttatcaaca 6721 agtttctgac tgcaggactt gtcagggcct ttaatatatt aacatgtatt aattacatag 6781 cagaaggaaa ggataacata tttgatcact aagtttcttg ttctaaagag actatgactc 6841 ttggaaatac tctggaaaat gccaacccat ggaattctaa aggaggaaga ggcactagac 6901 tgaggctgcc aagaattaga accctccatg ggacaaagat cctggcctcc ccgaggggca 6961 cacagggcct cactgtgagc tcagcccctg aacaggctct gcttcccatc cttcaggttc 7021 aacatcggtg gccccacatc ctccattccc atcttgtgct cctacttctt tgatgaaggc 7081 cctccagttc tgctggacaa gatctacagc cggcctgatg actctgtcag tgaactctgg 7141 acacacgccc aggttgtagg tacgtggaga agctttctcc tttctgctgt cattctagaa 7201 gcttctggga aaccagggaa agacagcttg ggtgtaggta aaagtcatag taaccagaat 7261 gtaaggccct gtcctagtcc aggcactaga tgctgcatga tctcgaatcc ctgaaaagca 7321 tgtatcacta tgcccgtttt ctagaggaaa aaaaaggggg tgttgactag gtcaaatatc 7381 ttttctaggg tcatgcagct aatacagcta ataagtgaaa cccaaaatgt gaatattagt 7441 ttgtctaata gccaaaattt cccccaggct tagcaccata atcaccagcc tgcaggacaa 7501 atgcagaatt tgaaggattc tccccgcccc tcaccaaccc cagtggcatt gccagtgctc 7561 acagtacttg agctctcaag ccacagtgcc acaaacttgc ctggaaaggt catgatgctg 7621 atcgggcttc tgttccccgc cctgggccca gagcagtggc tataggagca aaggcaggac 7681 cctgagcttt acagaaccat aactctcaaa gccccattcc agggtggatg gcctccatct 7741 atcttccctg cccagtgtgt aagaccaagg aacttgtgct cctacccaac aagaaaggac 7801 tggagagacc gatgggggtg tgcctgtgac cagagaaaca ccccaccatt ttcatgtcca 7861 tgcttctgta atacacaggc attttgctgt agctcctagg acatacgttc aataatatta 7921 caagtgtaaa tgcaaatcag aaccatggtg aattctaaat tcaaatagat gtttcaagac 7981 tggaagttaa agatagactc aattctgcag ataatgaggc cccaaaaccg caatgtccca 8041 aattaattgg gagccaacaa gccaaccttg cggccaccaa gcattttgaa aatgtggctc 8101 tcccaagttg agatgtgctg gaagtgtgat gtgctggaag tgtaatgtgc ataccagatt 8161 ttgaagacat tatgaaaaga aaacctcatt aataatttta tttgattaca ttttgaaaag 8221 aatgtatttt ggatacattg gcttaaacaa aacatgatta aaattaacac catgggtttc 8281 tttttgcttg cttaacatgg ctactaggga acctaaaatg acatctatgg ccagcattct 8341 ggctcacgtt gcattcctgt tggacagtgc tgtcctggag agttgttctt atacatcaca 8401 aattgggctt ttctgagtcc aaagtgaaaa ggaagtcagg tcggtttagg tgttctcatt 8461 attcatgaag agagaggatg ctcctttctc aggagagtca agcctttcct atcacttggc 8521 ataaaaaggc ctcgttggga gttgtcaagc actgaaatcc tcgtgattgg gcatgcattc 8581 ttgtccccat tttacggatg cagaaatcaa ggctcagaga agtcaagtgg ctcacaagtg 8641 gacacaagga cacagaccta aacccccggg cctcctccat cactgcaggc ccaggcagga 8701 tagagaagac aggtgctcca gggtcctgac atgaccccca tcctgaaggg ccttatgtct 8761 tccaggtgaa cgctatggct ccaagaagag catggtcatt ctgaccagca gtgtgacggc 8821 cggcaccgcg gaggagttca cctatatcat gaagaggctg ggccgggccc tggtcattgg 8881 ggaggtgacc agtgggggct gccagccacc acagacctac cacgtggatg acaccaacct 8941 ctacctcact atccccacgg cccgttctgt gggggcctcg gatggcagct cctgggaagg 9001 ggtgggggtg acaccccatg tggttgtccc tgcagaagag gctctcgcca gggccaagga 9061 gatgctccag cacaaccagc tgagggtgaa gcggagccca ggcctgcagg accacctgta 9121 gggaagggcc ccataggcag agccccaggg cagacagaac ctctgggaca cacaccaagg 9181 gcactcctgc aggtggcccg gcctgaggtt cccaggagca gcaaaggggc ctgctgagct 9241 ctggttaggt tacagctgga ggtgtgtata tatacacaca cacacatgta tatacacata 9301 tatatgtgta tgtatatata tgtatatata tatggctttc caataaccac ctaaatttta 9361 acaaaggttc cttctaagtg gtagaacttg gggtggtatt tttaccttcc ttcttcatac 9421 tttgctcttt ttcttaaata ctcattaatg tgcatatatc attattttca gatgcagcta 9481 tcattattcc aaaatacaaa ataaagaaga taaaataaat tatatacccg agccattaac 9541 cacaatttag tgatcactag tggttgttaa atctcttctt ttcatgagtt ctccattcac 9601 attatttaga tcttctggat ctgccctgct gacctctatg gacagctgat tttatttaag 9661 atgtggtgca agatgctgtg gggtcagcag gtggcaaaga tcttgtggat c // LOCUS HUMKALLIST 9618 bp DNA PRI 06-JAN-1995 DEFINITION Homo sapiens kallistatin (PI4) gene, exons 1-4, complete cds. ACCESSION L28101 NID g609489 KEYWORDS PI4 gene; kallistatin; protease inhibitor; protease inhibitor 4. SOURCE Homo sapiens (tissue library: Stratagene lambda Fix II) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9618) AUTHORS Chai,K.X., Ward,D.C., Chao,J. and Chao,L. TITLE Molecular cloning, sequence analysis, and chromosomal localization of the human protease inhibitor 4 (kallistatin) gene (PI4) JOURNAL Genomics 23 (2), 370-378 (1994) MEDLINE 95137583 FEATURES Location/Qualifiers source 1..9618 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="Stratagene lambda Fix II" /map="14q31" mRNA join(<3201..3849,6685..6958,7844..8003,9110..>9310) /gene="PI4" /note="G00-266-537" exon <3201..3849 /gene="PI4" /note="G00-266-537" /number=1 gene join(<3201..3849,6685..6958,7844..8003,9110..>9310) /gene="PI4" CDS join(3201..3849,6685..6958,7844..8003,9110..9310) /gene="PI4" /standard_name="protease inhibitor 4" /codon_start=1 /db_xref="GDB:G00-266-537" /product="kallistatin" /db_xref="PID:g619783" /translation="MHLIDYLLLLLVGLLALSHGQLHVEHDGESCSNSSHQQILETGE GSPSLKIAPANADFAFRFYYLIASETPGKNIFFSPLSISAAYAMLSLGACSHSRSQIL EGLGFNLTELSESDVHRGFQHLLHTLNLPGHGLETRVGSALFLSHNLKFLAKFLNDTM AVYEAKLFHTNFYDTVGTIQLINDHVKKETRGKIVDLVSELKKDVLMVLVNYIYFKAL WEKPFISSRTTPKDFYVDENTTVRVPMMLQDQEHHWYLHDRYLPCSVLRMDYKGDATV FFILPNQGKMREIEEVLTPEMLMRWNNLLRKRNFYKKLELHLPKFSISGSYVLDQILP RLGFTDLFSKWADLSGITKQQKLEASKSFHKATLDVDEAGTEAAAATTFAIKFFSAQT NRHILRFNRPFLVVIFSTSTQSVLFLGKVVDPTKP" intron 3850..6684 /gene="PI4" /note="G00-266-537" /number=1 exon 6685..6958 /gene="PI4" /note="G00-266-537" /number=2 intron 6959..7843 /gene="PI4" /note="G00-266-537" /number=2 exon 7844..8003 /gene="PI4" /note="G00-266-537" /number=3 intron 8004..9109 /gene="PI4" /note="G00-266-537" /number=3 exon 9110..>9310 /gene="PI4" /note="G00-266-537" /number=4 BASE COUNT 2606 a 2301 c 2350 g 2361 t ORIGIN 1 tgggccaagc ttagtatacc tagagtgtat aggcaaggga aagagagaga tgactgaggg 61 aaagcgagaa tccactgaga cagtagctct cactttgcct gacctgctag cttcctctct 121 tgtcctggtg ttgacccatt ttggacctca ggccagctgg ctccagggct gtgcttgtgc 181 ctgctctgcc atccccatgt ctacatcgct cagggctccc agcccactgt gctgctcagt 241 ggtgtatatt tcacttacct taaggcccaa cgacttggaa acacatccag aaagtctgct 301 agaggatgcc tgaatggtca aggtcatctc agtcagggca cccagggcca cattgctccc 361 tgcagtcctt ccccacatag cttgagaaat ggctgcccca ggagggtgca agggtgcaag 421 ggtgcagggg ttcactgatt actggggaga gaacaagttc aattccatgg ttaaagaaga 481 aaaatgcaaa ttgaatacct ggcccaagga ggttcctggt cccaccccat gccacccttg 541 ccctgggatg tactctcctt atggtctgac ctatgagatc tcagtggact ctatcctcag 601 aaatcacaga gtccaggatt gcaaactgga ggccctgctg tgaggccctg cccacaggtg 661 ggtcttgcct ggcctgtaga gagctggctc tctcacttcc acacatcccc accattcccc 721 accgtggcac catggagtct tccttacctg atctaggaag gcggcgctgg agcttgctgc 781 cccctgtctg atccagcccc tcatgttgta ggagggaggt gggcgactga agctcagaga 841 gggagtaaat gagctcaact tcacacagct agtgaggggc tggttggaga ccagattttc 901 caagtggttc cccgggccct cggggagcag caagggccag gcgaacctcc atgcactgca 961 caaacatgaa gaagtgaacc ctcttggggt tccccaggtg ggaagtgggg tgtcttcttg 1021 ggagggggtg aaccaacttc tggtgagttt tccttgttga agcaatgctg cacccgtagc 1081 ctgtttgtta ataattacct ggtcgggaga gagtgggtgg tactgagaag gcctggtttg 1141 accagtcctc ctgggcttga taagacagag ctgagacagc cacccagggg gtccactcca 1201 ggacagactg tgccgtgagt aaccccttga gacaatggag acgagagact ctagagggag 1261 ggtgaccaac tgttccactt tgcccaagac tgataattct tctgggatat attttcagtg 1321 ctaaaaccag gaaagtccca ggcaagctgg gacactttgt cactcttgct agaggtgaga 1381 aatactcagg ctagaaaggc cctgacttag ggtccatggg cactggtggg tgactatgcc 1441 cggagaccca gaaaagcctc ccccagggac acacagctgg tacagggcgg cactgactga 1501 gggccaaggt cttctggctg ccaatccagg tgatcttggg gagctcatgc cggaggggag 1561 gtgtgtctgt gagtgatgcg gtcctatgta tgtttctaaa tgatccctcc aactgctgag 1621 tggagaacag agagcagaaa gactaggaag ctttctgcaa gggtctaggt caggggtcaa 1681 caaaccacag cttgtaggtc aaatccggcc caccgcccat gagctaagaa tgcaagatgt 1741 aaatataaat ataaaatata agtgataagt ataaaagata aatatataaa gctatgaaaa 1801 atcaatagaa gaacaacatt ttgtgacaca ggaaaataat atgaaattca catttcagtg 1861 tccgtaagtg aagtttcact gaaacacagc cacgcctctt catgcatgtc cggtctgtgc 1921 tgtttccagc tgccacagca gagttgggta gtggcagcag agactgtgtg gtctgcaaaa 1981 ccacagatat tactatctgg cgctttgcag caaaagtttt ctgagccctg gttgaggatc 2041 atagcaaaag agggggcgag aggctgaaga acctacagga gagtctagga gttggagacc 2101 agcttgggca gcatggtgag accccgactc tataaaaaat tagctgggtg tggctgtgca 2161 tgcctgtagt tccagctact tggaagggtg catgcctgta gttccagcta cttggaaggg 2221 tgaagtggga ggatcacctg agctcaggaa gtcaaggctg cagtgagcca tgattacacc 2281 actgcactcc agcttgggtg acagagtgag accctgtctc aaacaaacaa acaaacaacc 2341 aaacaaacaa aaaaaaccca accctgcagg tcatggaagt ggattaaatg gattttgggt 2401 ggatagaagc ctagaagctt tttggcctga caaccggata ctggtgggac catttgatgg 2461 ggcaggagca gatttggggg agaaactgga gtatcctttc ttggacagat gttcattatg 2521 aaacgggtac caattctatc cccacaactt tgggcaggtc acttattaaa catccatgtc 2581 cttgtctgta cggggtaaca atacaaagac cagttctact gacctctaca ggggtgggag 2641 ggtcaaacta aatggctgaa agtggtttgt aaaccatata aagcactcca cagatgtttg 2701 tggaacagac cctaaaataa actctgagga tagccatcct ccatgcaaaa tcactgcact 2761 aatgaaatca actcatctga aagacaagtg tgggagaagc agcacctgag tctggggatg 2821 tgggttctgg ctgggatatg tgctaggtag gtgacttggg tgagtcactt tgggagtctt 2881 catttcctgc tctgttgggt gagaatggag tctagtgctg tgcagagctc acacaggtgg 2941 gaaacacaat gacatgatgg acacaaggag tgtgccttgt aagtctaaaa cattgccatc 3001 caatagtgtt gctgtcacca tacatgtgga aggtggtggg tgttacattg cctttcacaa 3061 gcccatctgc aggagttgag ggtgtcactc acgatctccg ggtgctggcc cactgccagc 3121 cttctcagta gggccaggtg ttcttctggg ggaatttcct gggcattctc ttccctccca 3181 taggcctgag agtgcagagg atgcatctta tcgactacct gctcctcctg ctggttggac 3241 tactggccct ttctcatggc cagctgcacg ttgagcatga tggtgagagt tgcagtaaca 3301 gctcccacca gcagattctg gagacaggtg agggctcccc cagcctcaag atagcccctg 3361 ccaatgctga ctttgccttc cgcttctact acctgatcgc ttcggagacc ccggggaaga 3421 acatcttttt ctccccgctg agcatctcgg cggcctacgc catgctttcc ctgggggcct 3481 gctcacacag ccgcagccag atccttgagg gcctgggctt caacctcacc gagctgtctg 3541 agtccgatgt ccataggggc ttccagcacc tcctgcacac tctcaacctc cccggccatg 3601 ggctggaaac acgcgtgggc agtgctctgt tcctgagcca caacctgaag ttccttgcaa 3661 aattcctgaa tgacaccatg gccgtctatg aggctaaact cttccacacc aacttctacg 3721 acactgtggg cacaatccag cttatcaacg accacgtcaa gaaggaaact cgagggaaga 3781 ttgtggattt ggtcagtgag ctcaagaagg acgtcttgat ggtgctggtg aattacattt 3841 acttcaaagg tgagagtcag atcattggta tatgctaaat ccacaccacc gcctccaacc 3901 cactaatttg ttgactggtt aaatatattt ttacaaaaat gtaagtaaaa acgttgtgtc 3961 tgatagggtt gagcaaccct cagggaatgc aatgtgagaa tgttgtatgt tgaagactct 4021 gtagtattga ggtttccatt ccttcctatc tttagacata ttttcttcat aaaaataatg 4081 tataaaatgc attatagcca gtgatcatta gttgaattgc tggattcata tctgagctca 4141 gcctctacca acagtgcaaa caggtcagct tgtttgctca aagcagggat aagacaagtc 4201 tctaattcaa aggttggctt tgaggatcaa atgattaacc catgtatata ctccttagaa 4261 aagggtctaa agagatgggc aaactacagc ccatgggcca aatgtgttca gccactggtt 4321 ttgtaaataa agttttattg gcacacagcc aggcccattc atttatgtat tgtctgtggc 4381 tgacttcatg ctacaatggc aggttgattc attactgcag agattgaatg atctgcaatg 4441 cctaaaatag ttaatatctc tataaacctt ttggtttgtt gacccctgac cattttatgt 4501 actcaataaa tagcaactat aattcatcta atgataacaa tgctcatttt gaagctttaa 4561 aaatatgtaa aagcataaag gagaaagtag acataatctc tagtcctacc actcagaggc 4621 aacaaatgtt aatgttttag catcttttct tcaccttcca ttcacacaca tatatgtcca 4681 cacagttgat tttgcagtgt gcaaacaact ttgtacggtg tgtattagcc agggtttcat 4741 cagagaagga gagccacctc aagtcaatgg taaggagatc attataggaa ttaggcccca 4801 cacaattgcg gctgggaaag tgaaggtcta gaggaaagaa gagttggaag gacagagaga 4861 aagttgccaa tgtaagcgga caagtcagag cttagagggg aatctatatg ccaggctcaa 4921 ccagccatca gagtgctgcc atggagggga gcatcactgc atagtttctg tggtagcctg 4981 gggctgctgt tgtttagtgg ggttagactt gggaaatgag cttggagaca agcagggcaa 5041 actggagcac acagagcact gctgagtcta ccatcatcat atctgactac cagggccttc 5101 agagggcaat ctacttcgaa atctcacagt tctaaccagc ttgaacccaa aatctacagg 5161 gaacaaaatt ctagggagca actcccagcc taatgattca gccaaattag tcatcccatg 5221 taatttgcat gtggaatttt catgaaccat tgaaaagttt ttttaaaaaa gctttgtttt 5281 gaggctgggc atagtggctc acacctgtaa ttccagcact ttgggaggcc aaggcaggtg 5341 gatattgaaa tcagaagttc aagaccagcc cggccaacat ggcaaaaccc tgtctctact 5401 aaaaatacaa aaaaaaaaaa aaattagcca ggcttcatgg cacatgcctg taatcccagc 5461 cacttgggag gctgaggcag gagaatcgct tgaacccagg aagtggaggt tgctgagatt 5521 ttgccaccac actctggtct gggtgacaga atgagactct atctccaaaa aaaagttttg 5581 ctttgaaata acaaaacttt taaactttat aaaaaaaatt aactaaggaa ggaagagaaa 5641 cagtagtatg ggtttcactg tcaggaagac attggttaga atcccagatc ccttgctatt 5701 ccctggacta aaattcctgg catggaggcc ttagtccatt caccctacct gccaccacct 5761 cactgcacat cgcctcctgt atttgcttcc ctcctcacac tgcagcgatt ccatggccat 5821 cattgtggtc acttccttgc aaacactctg aactcttaac ttccttggtt ttccctccct 5881 ctatcatact taccctccaa agctgcaaac ctgattaaac ctaaaggtct aactagtcta 5941 catctgcacc cagcagctga atgtggttga agaaaaacat ccaacggggc tgatggtctc 6001 acttgatatt tgtgaatgca aattttcatt gatatctcta cactgtccag caatcctacc 6061 tacatatctc cactctccaa aacaactata ataacagatt cgtccttctc aaatgtcctc 6121 cccactcatg actttcactt aacaatcttg tctcactttt cattgagaaa acagaatcag 6181 ttaaacagga aatctctcat cttctcacct ccaaatctac tcacctccaa atcttctcac 6241 ctccaaatct actcgcctcc ctgcatttga acccacaagc tcagatgccc tcctctcatc 6301 atggcagtag tgacccacct cttatggaag gctgacccct ctgctcacac tctgccatct 6361 cctctctcct tctcaaggat gctcacctaa caatgaaact ctctttccag caccccttgt 6421 ttcttctcta tcaatcctac tcacatatta gaaaacgaac atcctctgct atctctcatt 6481 ttaaaattta aatgaggcct aaaggatcca ttccccaggc atgtcaggtt ttgaatggtc 6541 aaaatgggac atcttgatgg gctcataggg caggatctgg agcgactgtt tctgtagctc 6601 agaacaccag atggcattcc ctggctggag gactagctct gtggcctgca gatgtcctgt 6661 accttctttt catcttccct tcagccctgt gggagaaacc attcatttcc tcaaggacca 6721 ctcccaaaga cttctatgtt gatgagaaca caacagtccg ggtgcccatg atgctgcagg 6781 accaggagca tcactggtat cttcatgaca gatacttgcc ctgctcggtg ctacggatgg 6841 attacaaagg agacgcaacc gtgtttttca ttctccctaa ccaaggcaaa atgagggaga 6901 ttgaagaggt tctgactcca gagatgctaa tgaggtggaa caacttgttg cggaagaggt 6961 aatcagtgtg ctatgggggc tgaatctaca gtactatcca tcaaatgtaa ctttctaatg 7021 aaagacagtg ccccaaaata gagagtgaac accaaattat gagagaattc tcacttgctc 7081 tgagaacttc catgggcttc ctttaaatga tgaactctcc tgttatattt aaaatatgac 7141 ttccccctct ttaggcaaaa aagaaccaca gtaggagttc agagtcctgg gttctagaca 7201 tagctcttcc actgcacaag acttgggtca agtgactctt cctctctggg gggttacctc 7261 tgtgcaatga agaggtagcc cttctggccc agaaggttga ctaagctgcc atagtgggac 7321 ttaaaaaaac tccatccacc tacccaagga attaatagga aggcttccca tcactctggt 7381 tatgattgga aaaagtcttc aaccatatat aagaaaagga tcctagccca agggcacatg 7441 tggatagaaa attcacacag ctttcatgat ggagagaatc caatccaaag aaattagctg 7501 gttcaaaaag aataaaaccc acagggctat aacagaggca aaggcaaaac tgccacctgg 7561 cgaggcttac atctgtctcc ctgggcacct gaacacccac aggaagcaac ctctgaagag 7621 gggttcccaa ggacgggaga tcccacagac agcagggcct gggtacaaag gaacctggcc 7681 tttcaacatc catttgtggg aggtggtctg agccgaagct gtttgtgggg acagaactca 7741 gatggaatag agccagctag agagggccac caaggagggc tctgcccagc agggatcctg 7801 gcttgttcat taatctaatg ttctaactca atgccccttt caggaatttt tacaagaagc 7861 tagagttgca tcttcccaag ttctccattt ctggctccta tgtattagat cagattttgc 7921 ccaggctggg cttcacggat ctgttctcca agtgggctga cttatccggc atcaccaaac 7981 agcaaaaact ggaggcatcc aaagtaagtc gtcaacagtc agcaatccct agagaactcg 8041 gtagggcagt gcccatggga gctgccaggc gatggggctc ccaagctgcc acatggagtc 8101 tcagagcctg agtgtgttca gtcatctaca ggcatcagca tccaccaatc aaccagccct 8161 ggacgtggtg ctaggtgctg gagatatagt gttgaaccag cccctgccct cagagagctc 8221 tcccagtcta gtcagggaca cagcgtcttc atagagagat aaacgcaatg atagaaaatg 8281 gaggaggggc cgggcatggt aactcatgcc tgtaatctca gcactttggg aggccaaggc 8341 aggcagatca cttgaggtca ggagtttgag accagcttgg ccaaaatggt gaaactccaa 8401 ctctactaaa aatacaaaaa ttagaatggt gtggtggcgg gtggtgggca cctgtaatcc 8461 cagctattca ggaggctgag gtaggagaat tgcttgagcc tgggaggcag aggttgcaat 8521 gtgaccagag atcaggccac tgcactccag cctgggtgac aaagtgagtg agactccatc 8581 tcaaaaaaaa aaaaaaagaa aaaatagagg aaacagagag gagagacaca ggtgaggctg 8641 cattgggtgc gtctatatta gggagacttt gcagataaga gaaagagact caagacttct 8701 gataagaaaa gtgcacgagg gggtagagcc ttgggttcaa gcttgaattc cagttcttcc 8761 tcttatggtt attgaaggag tactagtttc tgaatttatt ttcctcactt ttaaaatgga 8821 catcatactg ctgacctctt aggatggcca tgaagatcaa ataagatgca tgcaaagccc 8881 cgggcacaga gcaagtgctc atttaatgag agctaccatg attgtgtgtg gcaggaggtc 8941 tctagctcaa aaatccgccc aaatgaaatt cttgctgtgt tatccccaag aaatggcctt 9001 gcaggctgtt atgtgcaata ttattcccta gcaatttctg gctctcgcag tcttgtccag 9061 gccccctctc ttgctggctt ggagataatg cttgtgattt tcctcccaga gtttccacaa 9121 ggccaccttg gacgtggatg aggctggcac cgaggctgca gcagccacca cgttcgcgat 9181 caaattcttc tctgcccaga ccaatcgcca catcctgcga ttcaaccggc ccttccttgt 9241 ggtgatcttt tccaccagca cccagagtgt cctctttctg ggcaaggtcg tcgaccccac 9301 gaaaccatag ccctcccagg gctgctcatc tgttccaagc aggaggatgt ggcaggggag 9361 ggctgggagt gagtagctct gtgtttagag ttggggacaa ggatgacacc gaagtccagg 9421 agtccaggac agcaggtgct ggccggtggg gagcggggag gggcactgag atgggcaggg 9481 cctggacatt ccacaccctg gtgctgtgca gcctctggca gagcatccga cctcttggag 9541 caagtttctg cctctggaaa gggggcgggc ccttttcaca acaggctggt tgtaccgagt 9601 aaacaacacg atgccatg // LOCUS HUMLHDC 32351 bp DNA PRI 10-JUN-1994 DEFINITION Human gene for L-histidine decarboxylase, complete cds. ACCESSION D16583 NID g516770 KEYWORDS L-histidine decarboxylase. SOURCE Homo sapiens DNA, clones lambda HDC[1, 2, 3 and 4]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yatsunami,K., Ohtsu,H., Tsuchikawa,M., Higuchi,T., Ishibashi,K., Shida,A., Shima,Y., Nakagawa,S., Yamauchi,K., Yamamoto,M., Hayashi,N., Watanabe,T. and Ichikawa,A. TITLE Structure of the L-histidine decarboxylase gene JOURNAL J. Biol. Chem. 269 (2), 1554-1559 (1994) MEDLINE 94117478 REFERENCE 2 (bases 1 to 32351) AUTHORS Yatsunami,K. JOURNAL Unpublished (1993) REFERENCE 3 (bases 1 to 32351) AUTHORS Yatsunami,K. TITLE Direct Submission JOURNAL Submitted (02-JUL-1993) to the DDBJ/EMBL/GenBank databases. Kimio Yatsunami, Japan Tobacco INC., Pharmaceutical Basic Research Lab.; 1-13-2 Fukuura Kanazawa-ku, Yokohama, Kanagawa 236, Japan (Tel:045-786-7690(ex.3390), Fax:045-786-7692) COMMENT Submitted (02-JUl-1993) to DDBJ by: Kimio Yatsunami Dept. of Pharmaceutical Basic Research Lab Japan Tobacco INC. 1-13-2 Fukuura, Kanazawa-ku Yokohama, Kanagawa 236 Japan Phone: 045-786-7690 x3390 Fax: 045-786-7692. FEATURES Location/Qualifiers source 1..32351 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="EMBL3" protein_bind 5447..5452 /bound_moiety="GATA binding protein" protein_bind 5460..5464 /bound_moiety="LBP-1 (leader-binding protein-1)" misc_signal 5592..5597 /note="CACC box" misc_signal 5733..5738 /note="CACC box" misc_signal 6124..6129 /note="CACC box" protein_bind 6551..6555 /bound_moiety="LBP-1" protein_bind 6788..6793 /bound_moiety="GATA binding protein" protein_bind 6826..6831 /bound_moiety="GATA binding protein" protein_bind 6895..6900 /bound_moiety="GATA binding protein" protein_bind 6965..6969 /bound_moiety="LBP-1" protein_bind 7169..7173 /bound_moiety="LBP-1" misc_signal 7183..7188 /note="CACC box" GC_signal 7221..7226 TATA_signal 7252..7259 /note="TATA-like sequence" protein_bind 7268..7272 /bound_moiety="LBP-1" exon 7278..7414 /number=1 protein_bind 7292..7296 /bound_moiety="LBP-1" CDS join(7384..7414,9600..9772,14491..14604,15461..15583, 18247..18381,18638..18781,19245..19311,20137..20299, 20391..20481,24567..24665,29671..29772,29907..30653) /EC_number="4.1.1.22" /codon_start=1 /product="L-histidine decarboxylase" /db_xref="PID:d1004531" /db_xref="PID:g516771" /translation="MMEPEEYRERGREMVDYICQYLSTVRERRVTPDVQPGYLRAQLP ESAPEDPDSWDSIFGDIERIIMPGVVHWQSPHMHAYYPALTSWPSLLGDMLADAINCL GFTWASSPACTELEMNVMDWLAKMLGLPEHFLHHHPSSQGGGVLQSTVSESTLIALLA ARKNKILEMKTSEPDADESCLNARLVAYASDQAHSSVEKAGLISLVKMKFLPVDDNFS LRGEALQKAIEEDKQRGLVPVFVCATLGTTGVCAFDCLSELGPICAREGLWLHIDAAY AGTAFLCPEFRGFLKGIEYADSFTFNPSKWMMVHFDCTGFWVKDKYKLQQTFSVNPIY LRHANSGVATDFMHWQIPLSRRFRSVKLWFVIRSFGVKNLQAHVRHGTEMAKYFESLV RNDPSFEIPAKRHLGLVVFRLKGPNCLTENVLKEIAKAGRLFLIPATIQDKLIIRFTV TSQFTTRDDILRDWNLIRDAATLILSQHCTSQPSPRVGNLISQIRGARAWACGTSLQS VSGAGDDPVQARKIIKQPQRVGAGPMKRENGLHLETLLDPVDDCFSEEAPDATKHKLS SFLFSYLSVQTKKKTVRSLSCNSVPVSAQKPLPTEASVKNGGSSRVRIFSRFPEDMMM LKKSAFKKLIKFYSVPSFPECSSQCGLQLPCCPLQAMV" intron 7415..9599 /number=1 exon 9600..9772 /number=2 intron 9773..14490 /number=2 exon 14491..14604 /number=3 intron 14605..15460 /number=3 exon 15461..15583 /number=4 intron 15584..18246 /number=4 exon 18247..18381 /number=5 intron 18382..18637 /number=5 exon 18638..18781 /number=6 intron 18782..19244 /number=6 exon 19245..19311 /number=7 intron 19312..20136 /number=7 exon 20137..20299 /number=8 intron 20300..20390 /number=8 exon 20391..20481 /number=9 intron 20482..24566 /number=9 exon 24567..24665 /number=10 intron 24666..29670 /number=10 exon 29671..29772 /number=11 intron 29773..29906 /number=11 exon 29907..30962 /number=12 BASE COUNT 8594 a 7026 c 7405 g 9326 t ORIGIN 1 gatcacctga agtcaggagt tcgagaccag cctgaccaac atggagaaac cccgtctcta 61 ctaaaaatac aaaattagtt gggcatggtc gtgcatgcct gtaatcccag ctactcagga 121 ggctgaggca ggagtatcac ttgaacccag gagacggagg ttgcggtgag ccaagaacat 181 gccattgcac tccagccttg gcaacaagag caaaactccg tctcaaagaa aaaaaaaatt 241 tttttttgtc aacccatgtg ggtatgaaat tgtttctcat tatttgcatt tcactaatta 301 ctagtgaggt caaacatttt ttccctcatt tcttatccat taaagtttat tctgtgaatt 361 gccagttcat ggccattttg ggggtgtagt ttttcttact gatttattga agtttaatgt 421 ttttctggaa actaatattt tgctagttgt gtgaattgta aatatcttct cccactgtat 481 ttttttaaaa actttttatt gtctgaatta gcacatattt agtgaatgcc tctatgtgca 541 attctttttt tttttttttt tttttttttg agatggagtt tcgctcttgt ttcccagact 601 ggagtgcagt ggcatgatct aagctcactg caaccttcac ctcccgggtt caagtgattc 661 tcctgcctca ggcctcctga gtagctggga ttacaggtgc ccgctaccat gaccagctaa 721 ttttttgtat ttttagtaga gacggggttt catcatgttg gccaggttgg tctcgaactc 781 ctgacctcag gtgatccacc cgcctcggcc ttccaaagtg tagggattac aggcatgagc 841 cagtgagctc agcctctatg tgcaattctt aatgtcaagc tgagggagaa gtctaagaag 901 ttttatataa gtctgacacc cagggtgcct gtaattaaag gtggaatgaa attttaattt 961 gcgagttgat ccttctgttt tttagttcag tacttcctcc tttttagaat gtatgctttc 1021 cagataatta gtgagataaa gatttgattt tcacttttag catttctaaa agctgtaata 1081 tgctacctaa cttgttttta attcttgctc taacttctga tattttatag ttgtgtattt 1141 tgaattatat tttaaaggat taagtggatt attgcacatg atatttagta aagtgtttgg 1201 gagcatttag cacagcctcc aaaaattaga attccactag ccccctgatc ctacatatct 1261 ggcatcttaa gagtcaataa tagcaaagtt gagagaggca gcttggtgtg gtggaaagaa 1321 catgggtatt ggatccgata gccttaggtt caaatcctgg cttcttcagt aattatcttc 1381 tgtaatttta gccaagtgtt taatttcctt gagactcagt ttctcgaaga tgaagtagaa 1441 aatggctttc accaggttgt tgtaaagact gaatcaaata gtgtgcatga aaattgcaag 1501 gcatagtgct caggacatag ttaacattct taaatatatg ttgaggaatg gtgtgcacct 1561 gaaaattcct ataaaagctg caaatataca tgccttttga tccagctgtt ccacttcagg 1621 atatccttac tcatgtgtgc aaagcaattt tttgtagaaa tgtttgtaat agcaaaagac 1681 tggaaacaac tgaaatgtac actgctaggg aactgattac tctctggtat gtccacgcaa 1741 tgcagctcta tgaaaaaaag gagtacgtgt caagatatat taacaaataa agtacagaat 1801 attgtactgt acttatccct tttgtgttta acaaaagaaa gaaaaggaga aaaattagaa 1861 tatagtatat atttgccttt atgtgcaaaa aaaggaaatc actttttgtt ttttttgaga 1921 cagggtctca ttctgtcacc caggctgggg tgcagtggcg ctatcacggc tcactgcaac 1981 ctcaacctcc caggctcaag cgatcctccc gccacagcct cccaagtagc tgggactaca 2041 ggcacacgcg ccatgcccag ctaatttatt tatttatttt tggtagagat ggggtttcgc 2101 gatgttaccc aggctggtct caaactcctg agttcaggtg atccacctgt cttggcctcc 2161 caaagtgctc tgggattaca ggcatgagcc actgtgcctg acctcttttc ttcttgatgt 2221 gaaattcttc atagattaga caaatttgcc cattgtgcaa atatgttgca aatatatttt 2281 ctcagtttgt catttatctt gatttttctt atggttttaa aaaaatgtga ttggatttat 2341 ggtttctggc attgaaccat agttataaat gtattcttta ctctgaggtt gtccagggaa 2401 tcttctaatt ttcttaattt aattttcatc ttttaaaaac agctctattg atgcataatt 2461 gatatacaaa aatagcatat atttactact atttgatgag tttggacatg tgtacacacc 2521 catgaagcca tcagttaacc tgatggtggt gatggtttcc tgtaatttta tgcttccatt 2581 tttcacattt agatctgact tttctggaat ttatctaagt gcacgatgtg aaagatagat 2641 acagcttatt tttctttcca gatggctata ccttgttcta ggacaattta ttaaatagtc 2701 catctttttc tttctgatct gagatgctat ctttattata tgctaaattt ctgtacatgt 2761 ttaaatttat ttgctccttt tgtttgtctc tttgtacacc agtatctacc tcattgttct 2821 aattattcga gcttcaggat acattttaat acttctgggt aggaacagaa ccttctcgtt 2881 actcttacaa aattttcttc tttattctgg tttatttttc ctaagaactt taaaagttgg 2941 cactgaattc caaaatagaa cacccaaaaa cctgttgatt tttccactgg gattatgtta 3001 aattttcggg ttaacttagg ggaaaaccat tatttcatga tgttaaatct tactccatga 3061 gagagcatgg tacgtctttt catctgtaca ggtctttttt tatattgttc agtagcaatt 3121 taaaaaaatc atccactatg ttcaatgaaa taaaaggcca gatttaaaaa tttggcaagg 3181 agttgaaaac tataaaaagg aaccaaataa aaattctttt tttgccccca taaggcagaa 3241 gaaatacaaa ttctacaact gaaaaaatat aatatctgat attaagaaat caataaatgg 3301 tttattatat tagctacatt tgaagagaca actaaaaaag aaaaatattt tcaaaccaac 3361 tcagaaacaa gaagatggaa aataaagaat acaatgctat ttatatttat atataatagt 3421 atatataata cacaagttta acataggact gcatttgcag aatagaagag agataatggg 3481 gagatagcaa cacttggaga aaaaataacg aacacttttt tatttaaagt gcctaacttg 3541 gctgggtgtg gtggctgaca cctgtaatcc caacactttg ggaggctgag atgggtggat 3601 catttgagcc caggagttca agaccaactt gggtaacatg gcgaaacccc atctctacag 3661 aaaaactcca agaaataaaa aacaataatc atctgagtgt ggtagcacat gcctgcgtcc 3721 ttgctactca ggaggtgggg gtggcagaat taccagggcc tggaagtcaa ggctgcaatg 3781 agctgtgatc gcatcactat acttcagcct ggttgacgag tgagaccctg tctcaacaaa 3841 taaaaataaa aataaaaata aaaatgccta acttggtgct tggcactttc taaaattgac 3901 acataatatt tgtacctgtt tatggagtat atgtgatatt ttgatacatg catacaatgt 3961 gtgtaatgat caaatcagga tatttaagat atccatcatc tcaaacattt ttctttgtgt 4021 tcagaacatt ccaaatctag ctattttgaa atatacaata aattattaac tatagtcatc 4081 ctaatgtgct attgaatatt attgaattcc atctaattat atgtttgtac caattaacca 4141 atctctactt attcctcacc cccagctccc accaaccctt tccagtctct ggtaactatc 4201 attttattct ctacctccat gagatcaaat tttttacctc ccatatcgtt gtcaatatta 4261 ttatttttcc caccctggct cctaatgaag gatttattga caatatcact ggaatctaaa 4321 tcctctaggc cattttttca ccaatgccct tattttctta gggctttttt ttttcttaaa 4381 tcaacagttt taatgagata taattcccat tctataaaat ttcctcgttt gaagtgtaca 4441 gtttaatagc ttttagtata tccacagaag tgtgcaacca tcaccataat aaattttaaa 4501 acattttcag cagggcgcag tggctcacgc ctgtaatccc agctactcag gaggctgagg 4561 caggagaatt gcttgaaccc aggaggcaga ggtttcagtg agccgagatt gcaccactgc 4621 aatccagcct gggccacaga gtgagactcc atctcaaaac aaaacaaaac aaagcaaaat 4681 aaaacaaagc aaaataaaac aaaaccaaaa cccaacaatt ttgaacattt tcatcttacc 4741 accctgccaa aaatccctgt tctccttagc catcaatctt cagctcccta cccacttcta 4801 accatgaatc tattttgtag atttgtctat tctcgatatt tcatatgaat gaaataatat 4861 aatatatggt tccttgtgac tagtttcttt cacttagtat gtttccaagt tcatccatgt 4921 tttggtatgt atcagtactt cattcctttt tatcatggaa taatattctg ttgtattgat 4981 atactacgtt tcattcattc atctgaacat ttggattatt tccatttttg tctattataa 5041 taattctgct atgaacaccc atgtatacgt ttttgtgtgg acatgatttc attttccttg 5101 gatatacacc tagcagtaga attgttgagt cataaagaac tctaggttta gctttctggg 5161 gagcttccag attattttcc aaaatggctg tgccatttta cattcccacc agtaaagtat 5221 gagggttcta atttttccct gtcttcacca atacttgttg ttatctatct tttttattat 5281 agtcatcata gtctgtatga agtgatttct cactgtagtt ttttcagtag taattaaaaa 5341 atttcctacc tattttagcc taaaattact ttcctgactc tcttcaagga gcctgccaga 5401 gaattctctc tctcgtgctg tctcccttat gcctgggcat aagctctgat aatgccctgt 5461 ctgggaaaac tcttttggcc ttctcaattt ctattacatt gagagcccaa gaacccatgg 5521 tcagtaacag gccaatgaga agcctcctgc attagtctag gtgaaggata atggtggctt 5581 ggaccggaat agggtggtag tcatggaaat ggggagaatt tgacagacta gatttatttt 5641 ggaaatagaa ctgaatctga acagattgaa ttgggcaggg agcagggaaa gtgaagggga 5701 aggaagacac caggtttatg ccttcagcat ttgggtggat gaatggatgg atgaaaggat 5761 agatggatgc atggatggtg gttggatgga tgagtggttg gatgaatgga cggtcaaggg 5821 gtgaatagat gggtggatgg aaaatggaga gatgatgaac tgagatgggg aagactagaa 5881 gtggggcata tttttgggga aagacagttt tgcaatacgt taagtttgag gtgcctcaga 5941 gtcataagag aggatttgtc agaaaggcag ttggatataa gattttgaag ctcaaaagac 6001 agggagggcc tgatggtatg tttaaaccta tacggatatc acataggtaa catttcaagc 6061 ctaagaaatg ggccgggcgc agtggctcac acctgtaatc ccagcacttt ggaaggctga 6121 ggcgggtgga tcacttgagg tcaggagttc gagacaaacc cagccaacat ggtgaaaccc 6181 catctctgct aaaaacacaa aaattagtgg ggcatggtgg cacatacctg taggcccagc 6241 tactcaggag gttgaggtgg gagaattgct tgaacccggg aggcagaggt tgcagtgagc 6301 caagattgcg ccactgcact ccagcctggg cgacagagca agactccgtc taaacaaaac 6361 aaaacaaaaa aagccttaga aatgggcaat gccaccaaag cagaaagatg aaaggagggc 6421 aatgccctga ggtgtgccag cattggagga aacaaacagc cagagatgta ggaggaaagc 6481 ccagagaaac tgtgagctgc aaggatgaag tagtaacaaa taaaatgcca ccaagagaac 6541 cagaaggacg actggaaaca aataaacaag aaagactatt gcatttagca acgtgaaaac 6601 gttggattac tttttactca gaaatcaaac cccaaggata gtgtttcccc tccctacggg 6661 gtttctgcag gtccctttac acgccccctt ctacttcttt aagtgctaaa agtgattaat 6721 ttggggtgat gttttaattt ctattctttt tggcttataa ctgtgttttt tcattctttg 6781 caacaacaga tagtatgttg taatagaaaa atccattgtg gtcaatgata acttctgcct 6841 gttcctgggc acagggggca tcactattag tgcctgttcc tccctcccta ctgctgataa 6901 ggaaacaggg gcaggaataa tactaataat ttgtgctact ttttctgcat acccttggcc 6961 ttattctggt ctactgagct gggagcttgt ctgaggctgg agctctattg tctgaatcca 7021 ggggcagaaa gaactgaggg ctcttttacg gcatccacct ggaggctgcc tgcaggcttg 7081 ggttgctaag tgcactgctt aatttagaag cgccttttat cctggctctc ttgaccagtc 7141 aaagttgtgg tcgctgtttt ctacaaggac tggggaaaga aagggtggaa ttaattaaac 7201 ctggaggaag ggactttgaa gggcggagct aaggtcaaag aaagaaccct ttaaataaag 7261 ggcccacact ggctgccagg gagtgcgcag gactggcaag agggaagccg ggctgctcca 7321 cgcctttcac gccttccacc tcctgcgtgt ccatctgtga gaaggagcca gagcccaagg 7381 gagatgatgg agcctgagga gtacagagag agaggtgagc aacgggcatc cctggccctc 7441 ttccctgctg ctactcctgc aggaacggct gctggtgggt ggcccttaga ttgtggggtg 7501 acatttgatg ggagcattag aattgccttc tgagagttgg caccagcaac attttttttt 7561 ctgcgaaagg aatgtgggat cagagtgtaa aacttttctc aaccactctt tcattctccc 7621 tctcatatac ccactctcac atatgtccac actaccgtgc cttgcttttc ctccccactt 7681 tcccacaaaa ttgaggattt ggatccctga gccactagat gggaaaaggg gattgttcat 7741 attctgaaat agaaaaaaat tttctcaata ttctgtaatc actgcaaatc ctttctttct 7801 taacattcat tccttcatgt gtgaatctat cagcagtgtt ctttcattta gtcattcatt 7861 tcataaatat ttagtaggtg cctactctgt atcaggcaaa accaagtggt tatccagttc 7921 tcaggcatgg atctccctta gcaggaaagt aggtagttgc ttaaaaaaga atcttgaatg 7981 tctatttcta taatgtttta tatcctccta ctcactaatt agtgtgcgtt agcttattcc 8041 attgctttcc taaggcactg tcagctgaga gcaagttcct gtggatattt atcttctgag 8101 tggaacacaa atacatttag cactgttcct ggatcagtgg agggttaggt ctagagctct 8161 cagatctctg gtggggccct gctggagaag aaccaggatg tgagagggtt gaaggggagc 8221 agaaggaaga tagaatggtg tggaaatcta cctttgtctc ataagttttg tctggggttg 8281 gttctgttct taagaacttt gcataagctg atgcagaggg gagaaactga gtcccctctc 8341 ctaggtaaaa gcagcccagg gagtactaag gaatggatgt tgagacagac aaagagatat 8401 gtacacacac acacatgcac acacagcaac agaggcactg aagactgcaa tctgaaagaa 8461 taaataattt gcaatagttt cagagtgttc aacttgactt ttttttcctt ccttactcat 8521 atgttcattc acttgataca ttttttttta gcatctaaaa aatatcctaa gcactgaggt 8581 gttgatggag gcagcaggag ggagagaaag tcattaagat ttggtgccac atttcccagg 8641 ccagggcagg gggcttggtc tggatccatg gttgtgcttt agggattctg tgagcctcag 8701 aaattataca caaagttctg tgtatatgta agcctacagt tttaatcaga ttcttgaatt 8761 aagtcctcat atgagtccaa aaaagatgaa taataactga ttcaatgggg atggtgaaaa 8821 aatagatttt tttgtccttt atcggaactg ggcatggaat ctttctatct gtgctaacag 8881 atataactgg tgggtgtgtg tgtaacaatt tttggataca aagagctata gagtttctat 8941 tgaacctata aaatagttaa tagatgttct gtttctgaga aggaaagaaa ggtaggagag 9001 acaatcaaaa ttgaagtgga taaaggaagg ggaaagaaaa tgaaacgttt cactttggcg 9061 ctagagtttt gtgctttgga gattttcctt tttgttcatt tgctcactta gaaactccct 9121 ccatcattca ttcactccct cactcttttt ttaaattctg aagataaggt cgtgcagggg 9181 atcagatcct accctcggcc catgaatctg tctctctctt ttttttgagt ccaggctgga 9241 gtgcagtggc acgatctcgg ctcactgcaa cctccgtctc ccaggttcaa gcgattctcc 9301 tgcctcagcc tcccaagtag ctgggattac atgcgcctgc caccacgccc ggctaatttt 9361 ttgtattttt agtagagagg ggtttgacca tgttagtcag gatggtcttg atctcctgac 9421 ctcgtgatcc gcctgcctca gcctcccaaa gtgctgggat tacaggcatg agccactgtg 9481 cccggccaaa tccatctcct tctttaccaa gtctcttggc ccggacctga gagggatggg 9541 cacaaatcgg tcaggcccag tcaggggatt tcagcctgat ttctctatgt cacttctagg 9601 gagagagatg gtggattaca tctgccagta cctgagcact gtgcgggaga gacgtgtgac 9661 gccagacgtg cagcctggct acctgcgagc ccagctgcct gagagtgctc ctgaggaccc 9721 cgacagctgg gacagcatct ttggggacat tgaacgaatc atcatgcctg gggtgagaca 9781 cagtgaccac aagggcagtt ctggatttca gggcacagaa gggtgggtgg tggcctggag 9841 aaagacctgc tttgggtcag cttgggatga gaatgcatcc ttcagccact tggccaccaa 9901 caccccagcc catctgctag gcatgtggag ggcaggactt tgactggtat taggtttctt 9961 tttctttttt tttgcccagt tgtgccatgg catgaggttt ctgatgccag ggcagggctg 10021 gccagaacac ctgcattcca gagcacagcc tggcagaagt gtggaagtct aatgggatgg 10081 gtggagaagt gaacattttg tctgagggga aaggacactg tgtatagtaa gacaggggag 10141 aggaagctgg tggtagtgcc tggccctgac gacggagcca gagactcttc tgtacagtgt 10201 ctgagaccac accaaagaga aggaaggaag tggggaacac atcctggcaa ggacacgctg 10261 gtgaccaggt tgcgttggcc caggaaactg agtgggccct ggcaacccct cctgtcccct 10321 ctgaggtctg gtgattgagc tgcttttcaa gagcaagaag ggactttccc tgaatcattg 10381 atactttcca tgatggaggg tgcagcctgg gatagtgctg gtggggccag gaagctgatg 10441 gaagctggct gatgcatggg cttgagctgg agcctgcctt gatgggtggt ggacacccct 10501 gaccactggc ggtgaggagc tctgccatgg tgctcatgct aaagtggcat cgactccctt 10561 cctgtggcgg ggctccctgc catgtgccag ctctgtgctg gccacatgac atgctatcta 10621 atttaatctt cccagtggcc ctgttttaca gatgaggaaa ctgctgatgc tcagagaagt 10681 taactgtctt gcccaaagtc acagaattag tacctggtag atccaggatt ttaccctaga 10741 ccagtcttac ttcacaggct gtgccgtttc tcctaccccc tgtcacctct actggtgggt 10801 gcctgaatgt gcctggctaa gcctgggccc ctagcattgc ctgtctggct ccacaccccc 10861 acacaagctg tgtcttccct ctccctgctt ggggcatttg tgttctgaac cacacatgtt 10921 tagaggctct tcctgaggcc agcttgaggt ttgccttcat cattcccaaa gaacttttga 10981 cttcaatgtg acaaaaaggc caggcgcggg ggctcacacc tgtaatccca gcactttggg 11041 aggccaaggc aggtggatca cctgaggtcg ggagttcaag accagcctga ccaacatgga 11101 gaaaccccgt ctctactaaa aatacaaaat tagctgggca tggtggcaca tgcctgtaat 11161 cccagctact tgggaggctg aggtaggaga atcgcttgaa cccgggaggc ggaggttgcg 11221 gtgagctgag atagcgccgt tgcactccag cctgggcaac aagagcgaaa ctccatctca 11281 aaaaaaaaaa aaagtgacaa aaatatgcct agaggcaggg gtgggaaaat attcaaactc 11341 agaaagtgat ggcaattttg actttttttt tttttttttt ttgaagcaat agagacaagg 11401 tcttgctatg ttgcccaagc tggtctccaa ctcctgggct caagtgatcc tcctgcctcg 11461 gccttccaaa gtgttgggat gagaggccta agccactacg cctggccatg agggcaattt 11521 catacaaagc cactcatctc tttgaccatt cagcctattt cacaagcatt ttttgaaaat 11581 gtcaatatca gtaaggcttt acggaggcta gatcctgata agattatctc ctggatttta 11641 ttctgtaaaa tgtaatctgc tgaggggcta actgtaatct gggagtgaga ctgaaagcaa 11701 ctgtcagaag aggtggaaat gagctgattc atttgatcaa gaggaattcc agcagttgct 11761 gtgggtatgg tcggtgaggc tcatttctgg aaagaggtct gtttgttttt gataagggct 11821 cttctggcta ttttatctta tttatcttac aagaacagac tattgtctct tgatagagct 11881 aggaggtaga agcccttctc tggggtttcc aaaatactgt cactaaggac agtgataatg 11941 atggtcaaat ctctcaactc atctctcagg ccacaccagc ctctgagagc tggcctgaaa 12001 gcagttggcc ctttgcttac tcattttgct cagtgcctca ttgaaaactg aattagtttt 12061 tctcttcctg ccaccaccta gaaaataaga gaagaagaaa aactctcttt cccctttaat 12121 tccaaccctg gttagacaat cctcatttac ttttcatgag agagagagag catccccacg 12181 ggaaaagtca ggacaccacc ctgctcgggt gtctgggctt ctgcctcatt gcggtcacct 12241 gtccctatgg ggtatgtgtg accacaggga tgtggcctct ctctgtcagc cccagttctc 12301 gcctgcaaaa cagagataat gtgtaggtcc taccccatag ggtcctgtga gaagtaagta 12361 agagaaggct tggagagcag gagatgtggt tattaccgat catgactctt acatggtgaa 12421 aattggagga agaggtttga ttctctgagt caagtcagga tatgtgatgt gtctgtcacc 12481 cacatgatta gtttaccgtt aatgttagag cttaacattt cacacatttc ttgcccacgc 12541 ctccacccac acaacaaccc taggagacag gatttatttt acgatgagaa actgaggttg 12601 agggaggtgc aggagaggct cctggtctta tggggtctgt actgtaaaca ctttttgaga 12661 cctccttttg aaaaacaaaa aagaaggagg ccggacacag tggtgcacgc ctgtaatccc 12721 agcactttgg gaggctgagg cgggtggatc atgaggtcag gagttcgaga ctagcctggc 12781 caacatagtg aaacactttc tctattaaaa atacaaaaaa ttagccaggc atggtggtgg 12841 gcacctgtaa tcccagctac ttgggaggct gaggcaggag aatctcttga acctgggagc 12901 caggggttgc actgagccaa gatggcgctg ctgcactcca gccctggtga cagtgcgaga 12961 atctgtctca aaaaaaaaaa aaaaagaatg aatctcaaaa gagacccaca caaatggtat 13021 gttggagctt aagttcatta gcttcatggt aaatctacac ggtcacacag ctggttaggg 13081 actgaggtgg gatgggaact tgtgtccttg gacttaattc ttaggctcct tatactatac 13141 acactgtcca ggacaaggtc cttgctcttg aaggactcta agtatgtgtg gacagtaaag 13201 aacaggatga tgccctaaga tgttatcaaa tgagtcctca gtagagtgaa acatctagaa 13261 ttctagagtt ctatcgctca gggaagggat cgaagggagc tgtgggagct gggggagagt 13321 tcgaggagga aatgaatatt tagtagcttt ctaagaactg gcaggttttg gaaggtcatg 13381 tctttataga gagtgccttt ctatcagggg acacagcatt gctcaaagac ataaatgcag 13441 aaaggaacag ggcatgcttc agagcagtga gatgaggaga gatgagataa tccatgaggc 13501 cgatgaacca ccagggctgc ttctgagttc tgatgggggg cccttcctgc ccactcagct 13561 cctccttaca tccggttttg attgtgggct cagggcttag cctggaggtc ggcctcccat 13621 acgcttggtc tgggctctcc tctctaccgt ctcctttcct cactacattc ccaatgtgat 13681 gcttaaggac agctttgaac ccattcaccc agaccctctg agttgcatgc tcagggagct 13741 cagtgctcca taaaacaaaa acaaaaggtg ccttttgcaa gaaactggat gaaacaagta 13801 gtctgctttt ctggagctca cagtctgccc ttcctttttt ttttttgaga cggagtcttg 13861 ctctgtcgcc caggctggag tgcagtggag cgatcttggc tcactgcaag ctcagcctcc 13921 cgggttcaca tcattctcct gcctcagcct cctgagtagc tgggactaca ggcttctgcc 13981 accccgcctg gctaattttt ttggtatttt tagtagagat ggggtttcac catgttagcc 14041 aggatggtct caatctcttg atctcgtgat ccacctgcct cggcctccca aagtgctggg 14101 attataggca tgagccaccg cacccggcct gcccttcctt ttattgctgg gcaggggcgg 14161 catgaggggt aagtactggg gactaatgtg aataccaccc agaatcccac aactgggtta 14221 ctgtagtgat tttccttttg catatttcct tctggacttt gcatatatga aaaggctcca 14281 ctgatgatct agcataggac atgggccaca ttagtgacaa gaacaccagt gaacaagtct 14341 gattttcatg gagttatgtc acttatttct gcttaaaagt gtcagcccat ctctagagtg 14401 acacttcacc ttgccagaga tggtctctgg agaaagaaag cctgctggaa atgcctcctg 14461 gtgccaactc tgtgcatgct ctgcctccag gtggtacatt ggcagagccc ccatatgcac 14521 gcctactacc cagccctcac ctcttggccc tccctgctag gagacatgct ggctgatgcc 14581 atcaactgct tgggattcac ctgggtgagt agcaacggct gtaactactc gcaatgggga 14641 acgtagcagg gagtggtatc ctagggcaga catattttgt cttggtttct gaatatggct 14701 ccacaataaa tattaatgag ggctatattc tttaagctta cgctttattt taattttctc 14761 atcacatcct ttaaccctat ggaatcacga gtattgatcc cattttgtag gtgaagaaat 14821 cgaggcttaa gcaacttatc tcccccagat cacacagctc atagagacac agataagact 14881 tggatacaga tctggttatt gccacagttt gtgaccttac ctaagcaatt tgatgcctat 14941 tgagtcacca cttcctctgt gctcttgtat ttgagtagct ggagatatgt cctaagactt 15001 cccttaataa ccccttaagc accactcagc tttaaatggt gccaaaagcc ttccagaaca 15061 tctttttatt ttaagcctga actcctatgg gtgtgtgggt gggtgggcgg gggtattcat 15121 atcttttcta ggaaaacatc cccctgggca tttgaggact tctcagctat gtattttagc 15181 gacttggaga accaaatcca agccaaccat ttctagggct gtttctccat agagccttgg 15241 agaattcgcc tcattgaggg cagcatgaat ccaaaacccc acattatttt ctcttttggt 15301 tgacagagag tggaccagga gagaggcacg tggcctaaca tgaccccttc tccagaaatg 15361 ttcccattga caccatcagg ggcagacatg tttctggagc tcgtgggggc acctcagtgc 15421 cctggggctg tgcagtttga agttcctttt cctttctcag gcatccagcc ctgcgtgtac 15481 agagctggag atgaacgtca tggactggtt ggcaaaaatg ctgggacttc cagagcactt 15541 cttgcaccac caccccagca gccagggcgg aggcgtcctg caggtacctc ccttggcaaa 15601 gcttcaccta gcttgggggc taagtagcga aagacaagcc aaaccctaac ctggcaattc 15661 tcccgttctc catggagttt cccccacatt acccacaccg tcatcaccaa cagcaccatc 15721 atcacagctt cacctctact cccacctcag cttctgcttc attcacatgt actgctactt 15781 cctttgaaag aaattaaaga gccaacacat gatcacattt ccagtatgac ttgattcagg 15841 tttcttagtc tataatctga caattccttt ttggtgtccc attttgtatc tcacggtaaa 15901 aggggaaagg agaaggcaag gagaggactt ctctttctct ccctgctagg ttcaggagat 15961 ccccttgcag tgagagcaat taacagcata aactgattaa cggtatcctc tgaccctatc 16021 ttctggatac ttgtcaaatt atggttcttc aaatatttct ttttttcaag agtagtgcca 16081 tggagtggag ctttctgggt gcctcccatg gaccacacac attgttaggt gcttttgcat 16141 ctataaatcc ttgacgttca tgagggcttg gttctaagac ctttgagatt cctcagattc 16201 ttgaattctt tttttaaata aaaatcgtgg ctggacatgg tgactcaccc tgtaatctca 16261 gcacattgag aggctgagac agatggatca cttagcccag gagttcaaac cagcctgggc 16321 aacatggtga aactgatctc tccaaacaaa caaaattagc tgggtgtggt ggcatgtgcc 16381 tgtagaccca gctacttggg atgctgaggt aggaggattg cttgagccca gaagattgag 16441 gctgcagtgg gctatgattg tgccactgcc ctccagcctg ggcgacagtg taagaccttg 16501 tctcaaaaaa taaaaaaatt gccatgttgc tctgagatga gaaccaggaa gttcactagg 16561 tcctcttgct agctaagtga ctccctctct cactgcagcc ttgcagcaag attaaaatct 16621 gaacctcgga agaaagacaa attgcagcat gttgtggaca ttttggattg tcagcaaata 16681 gttacaactt cagatatcat atttgggaga acatgctaaa tgtctgcggt acatgatctt 16741 gcttttcatc tgattatctc aggaaccctg caggtgaata gtaccattcc ccaatgtttt 16801 gagataagag cactgaagtt cagagagatt aaataatttg ccctagctta cacagctgat 16861 aagcagaaga acgaggattc aaactgatgc ctgtctaaat ccaaaccatg ctttctcctg 16921 tatcgtcagc catgtgaact tgagttagtc ccatcacctc tctgagcatc actctcctca 16981 tcttagtgtg acaagaatca taattatctc aaagaggtgt tgtgaggagc aagtagaatg 17041 ccagaaagtg ttttgtaaac tatgaattgc aatacagatg gaaggtgttg ttaccattat 17101 tattattttt aaaaacaggg cctggttctg tctcccaggc tggagtgcag tggagccatc 17161 atggctcact gcagcctcca tctcctgggc ccaagcaatc ctccaacctc agcctcccac 17221 gtagctggga ctacaggtgt gcaccactac acctggcttt ttgttttttg ttttttttgt 17281 agagacaggg tctcactatg ttgtccaggc tggtctcaaa ctcctggact caagggatcc 17341 tcccaaagtg ctgggattac aggtgcgagc cactgcaccc agctgattat tattattatt 17401 atgctacata cttctttaca taaatagctt tctttttatg gagtaactcc caggcactta 17461 atatctgtgg gatctggtcc acaccactct ccatgacagt gtctgccagg catttttagc 17521 aggattctat cagctgtagg tcacttgaag atttcaagaa gcaaagacat caaagactgg 17581 gtgaatgaat gataaggaac cataaagact cctgtgaagt tcctgaggtg gccccgaaga 17641 tttccacaat catggccaga gcaaaatcct acccaaccct tgtttggata atctatactc 17701 ccatcctggg aagcagttaa agggatgctc tccatttcat agacagaaaa gagtcttctg 17761 gaggggaaat gttagcccaa ggtcacaaga gctattgaca actaaatggg cccaagtcac 17821 cagcctctca cacaggccag gatcctctct tcctggggac tcaaactcca tatccatttg 17881 tgtaataagc tgagagttct tcttttaaat atgttttaca tggggagcaa acacatccag 17941 gagggacaga acattctctt tttggtttct gtggcctctg taatagagag gctggtatta 18001 ggtgcaaact tattttagaa ggcagaatag gaatgaaagt tggggaggga catgggcacc 18061 ctgggcagga ggagctgtgt atttccttct gtgaatagct atgacacaca catacacaaa 18121 cacatacaca cacagagaga gagagagaga gagagagaga gagagagaga gaaaaaacta 18181 gaaagcctgg ggaaatttgg caagagaata cagaaaggct gccatgataa tccttttccc 18241 ctgcagagca cggtcagtga atccactttg attgccctgc tggcagcaag gaagaacaaa 18301 atcctggaaa tgaaaacgtc tgagcccgat gctgatgagt cctgcctaaa tgcccgactc 18361 gtggcctatg cctctgacca ggtgagttgc ccgacgcagg acagcccttc gggttggtgc 18421 ttggcttctg gaatcttggt ccttgagcaa tttttttcta gacgtgagta cgctggggtt 18481 ctgagcagct cagacatggg agcttgtctc cagcaactgt agggaagaca agcttccaat 18541 gatcatattt ggttcccgag tcttaggctt cagcagtttc tggggggtga tcattccagg 18601 atgactcttt gctgacaggt ttctgtggcc cttctaggct cactcctctg tggaaaaggc 18661 tggtttgatt tcccttgtga agatgaaatt tctgcctgtg gatgacaact tctcactccg 18721 aggggaagct cttcagaagg ccatcgagga agacaagcag cggggcttgg tgcccgtctt 18781 tgtaagtcag gatatatgcc taagcttaga atgatagaac ttggatgtga ggaaggaatt 18841 ttggcagaat tcaggtaagt aacatgctgc acacaaaagc agaaaggtcc tcccatacag 18901 tattttacac atcatgaggt gctatccata ggtgtgttat ggaataaatt ttgtgggtca 18961 taaccagcat taaaaagaaa tatagtataa aatattagag tgcacacata tagccagagt 19021 atgtattgtt ttatgaaact tttttttcaa atacacacgt atacatgtac tagattgtta 19081 tataaaatgt gtttcttacg agccatggta aaacacagtc tgaaatatac actgttctgg 19141 tgggatggat gggagtagat cacgtctgtg gtgaaatctt tctcaggcca aatgtgtcac 19201 acgagcccac cctcctgtgt taacttgcct taattttccc ctaggtctgt gcaacactag 19261 ggaccactgg ggtctgtgca tttgactgcc tgtcagagct gggccccatc tgtaagtatc 19321 ttccctctgt ggctaggaag acaaggaata catttttaaa tgtctcctaa agcaaggacc 19381 tgaaaccagc tctatgggat tattcttgtt cctctttggt caatgcagag acatgggaag 19441 aaccaccaaa ggtcatgagg gtctcttcca gggatccctg gacattgcct ttcccagtgg 19501 tgtgacaaga gttagaggtg gcctaccttg cctcctgtcc taggaggcga cagtaggaga 19561 gccttcggtt ttctcatcct cttacttgta tgttgaactt tacttaatgc aggctatatc 19621 caaaatacag tgtgaattag gctagaatag aaatgctttc tattctacct tcagaaaagg 19681 aaaagggtag ggtaggggaa atcatgggag agattaaagg cagagaggaa acccagatgg 19741 gactgtttaa tccccaagca agaaaccact aggaaagaga gataccttcc ctctaacttt 19801 tgtattcggt gcactgacat tgctctctgt gctggtggga aattgatttc tgtgccagga 19861 tgttctaggc tgaggccaag agggtgctgg ccctttaccc agtggaagaa aaaggcaagg 19921 gtgaggcttt gtagactttc ccattcggag gtaatgatgc ctccttcagc cccacttctt 19981 caaactgact ccacctgctc ccatctccac acccctccgt ttctcattta gccaaagcac 20041 atcatccttg cagcccacca ctcagcttgg ccaggtgcga gacatctttc cccacttcat 20101 acctcccctc cggccaggtg atgtttgcct cctcaggtgc ccgtgagggg ctgtggctcc 20161 acatcgatgc tgcttatgca ggcactgcct tcctgtgccc cgagttccgg gggtttctga 20221 aggggattga gtatgccgac tccttcacct ttaatccttc caagtggatg atggtgcatt 20281 ttgactgtac tgggttctgg tgagtgtagc agcccagctc cgagcacgca ggaacgcctt 20341 gcctcctctg gagagacctc agccactaat gcccatttgg aaacccacag ggtcaaggac 20401 aagtacaagc tgcagcagac cttcagtgtg aatcccatct acctcaggca tgccaactca 20461 ggcgtggcca ccgacttcat ggtgagtggc cagggacggg cagcctggtg ggctcgggtg 20521 gccagtggga actgtaggcg tctgaccccc caaaatctgt acggtggctc ccttcagaag 20581 gcagcacact atgagctcat ctcataatac tgccaatgca taccttggtt tctaattgtg 20641 cctaagtata tagaatgaaa cccagaaaat ggttaaaaac cctagaaaat gagaccagcc 20701 tttgaggcag tgtgcacatt tcctgaatat ttgtggcatt ttttaattcc cagaaatgct 20761 cttcctacca caggtggacc agttccttct gcctccatct cgggtcatct tttttgtttc 20821 ttacccagca gcagggcagg gtggagaatt tccctggtgg ggaaagagct ttcttgctga 20881 aatcgctttt gtttttttgt ccctgacaga ggcgctgtgg ggcagctctg tggtagtgat 20941 gggctgcagg acttgtgtga ggttctggaa gcctccaagc attcctgact ttcccaactg 21001 actgcgcctc aactctctaa ctcactctct ctctcttttc ttttgttttc tttctttctt 21061 tctttctttt tttttttctg agacagagtc tcgctctgtc aaccaggctg gagagcagtg 21121 gcgcatcttg gctcactgca acctccgcct tccaggttca agcgattctc ctgcctcagc 21181 tttccaagta tctgggacta caggcgtgag ccaccgcgcc cagcaaacct caactctctt 21241 gattccacat cctcagccag tattttgcct gagggaaacg agttcttaac aaattgactc 21301 cgcagtttga ggatgagccc ttcctttact tgtaggggag acagggatgt aaacgaatgt 21361 attacacata acctggccaa cggaggcagc agagcagagt gggggcaagc ttggtttgga 21421 attccattcc agcagtggcc ctttgggaaa ctaactggac ctctctgagc cttgctttcc 21481 ttatctgtaa aatgagaata gtggcattga ttcatagggt attgtgaagc ttaaatggga 21541 acacgtgtgg atcacctagc aaaatagccg tgtggcaagt gctcaatata tggtaggcat 21601 agtatttctg aaagctcttc tttgaagctg cagagatttg ttaggacagt ggttaggcat 21661 gtaggtgcta aagccagatg gcatgtgttg gagtcctggc tttctgttta ctaacttgtg 21721 tgacctcggg caagtgactt aaactctgtg tccagatttc ctcttgtgtc aaatgaggct 21781 gatgaccaca gttctggctt cttaggcttg ctgtgaggat tatacaaatt aatacactga 21841 agtgctgaga gcagtgtctg acccctgtaa gcatccttcc ttgtgaagag atggttttat 21901 cttatcttct tggacaactg aagtgtcttg cccttttagt cttaaaataa tagcttcgtc 21961 atcacttgat tatactggct attctccctt ggaccagtcc acaccctctt tagatatttg 22021 aagcttttgg attagaattg cagcggtaag cctcacggct tactggtggt gacagaagac 22081 cctctagagg taaagaaaga aagaaaaata atggttcttt atctccactg ctctttctgg 22141 taggttttca gagattttta atgaaaaatt aaaaaaattc cagaacagcc caagggtcag 22201 atatctttag ggataactgt aggtttgctt cctgagtgtg aattgatgcc cattctccta 22261 tgagtgtatt tgggtcatca taactgagga gcattatatt cacccttgcc cttttgaatt 22321 tcatgttacc gtttaaattc atttaccaca gtgcctcagg atcttcccgg ggttcattcc 22381 taggagtttg gagattttac agtgctctct gtcttccaga taactcataa aaatgttaga 22441 gtaaactctt cccagccctg ctctggggaa ccttattatt tttacctctc cacccagaag 22501 gcactgcctt acacttaccc ttcccccacc tttaatccta tggatttgat tttcaagtcc 22561 atgaagaaac tcctagaaaa gtgaagttgc tatctttttc atgatcctga gggtcctcag 22621 agctactctc acctgggccc aatgtctttc ccacccaggt ggctgcagct ttgaacagaa 22681 gctcagtcct gctctgagat gagccagaaa cagggcagca tcccaggaga gaagacttag 22741 gtgcttaacc cctggggtcc tggaatcacc taaaatcagg gctagattct ctgacagcgg 22801 agaagctccc tgttttctgc ctctcactgg acagagagca ccagtggagg gagcgttacc 22861 ttagccttgg caagggagcc ctgcaaggac cttacaagga cacagaggag gctggagatg 22921 cctcaaggta aagaagagcc ctgtcaggct ttatacgtcc agattttttg atccaacaga 22981 aaatgttggc tagaaccact ctgctgagtc agcatggatc agttttggaa tagtgacaag 23041 aaaaggagca actgggaata aggaagattg tgaagaaatc aaaataaaat cacctctgcc 23101 tatcctggtt tatttgctgg taaaataaag tagctctctc tatatatgaa ataaattggc 23161 tataactatc tggaaaccag aaccttttga cagatgatcc ctatcatccg ttcattattc 23221 tcttggagtg tccattggac aagaggtgga gttgacactt tccttgttac agcaattttg 23281 ttccattttc aggttttgat gccttctgct ggcacaaaag cccccaagct gctccccagc 23341 acaacagagt tttgcttgga aatcaaaggc ttagtagttt tctgctgggg aaacattgtc 23401 catcaacaaa ggggtaacat cccaactgtg cctggctcag gaaactgggg tgaagggtga 23461 aggtgaggac atttgtgagg atttgataat cttgtgtatg ggcagataaa tacgcagtgg 23521 gacaaacatt tatttttagt caacagaaat tgtggtagaa actttctgac gtttgaggga 23581 gatgtcatta ggaaaccacg tgactcatta gaataactgt aaatgttttc caaatcctct 23641 gctctatttt cagatctgag aaatcagtct gaaattgatt tggaaatttt cagatgtatg 23701 gtgactgcaa ccttgaactt taagctctgt tgaatttggc caccaggagc aaagagtcaa 23761 ttttcatcta acagcggcag aaatttgggt gctcatttga aatctgacag ctggagtgat 23821 ggctctcaga tcacaggaat ctcacaggaa aatgaacaag gcagtgtaag gttaaatagg 23881 ggtcatcttg gctctagaca gaaatgctac tgtgatttgt agctcacatt ttaaaataat 23941 ctaaatagcc aacacctact gacaacttct tctgaggctc ttggcgttct ttgtcctcac 24001 accacttgcc cgggaggtgc aattggttct ttatgtcccg gactgatgag ccagtgaggt 24061 ggcactaccc aaggtcacac agctagtaag tggcacagct ggaggcgaaa cctaggtctg 24121 cagggctcta gtgtctgtgc tctaactacc atactttaca ttttgggagg aataaaaata 24181 gaactttatt ttctgctatt caaggaatat agattcattt agagtaaaaa tgtatttccc 24241 ttatgttgag tttctgggcc gggatgaaat gtttaaagca gcacttgaaa gcaaagtgcc 24301 cttctcattg gttcagggta atggacttga cttccacttg tctggctctc agacaagtga 24361 ctattggcca ccttcatcag tgatttggtc acacgatggg gcaacccttg gggagcttgg 24421 cttactcctt agcaacagct cccagagtca tggtgacatc gtccatccct ggagggctgg 24481 gcaggcaaca gataatgccc agaaatgccc gagggagtgg ggcacccttg tctctaacct 24541 tgggcactgt gttcctttgt ttctagcact ggcagatccc cctgagccga cggtttcgct 24601 ctgttaaact ctggttcgtg attcggtcct tcggggtgaa gaatcttcaa gcacatgtca 24661 gacatgtatg tagcatgctg ctgtgggggc tgagcctgtg ttcctctgtt ttctcatggc 24721 gaggaaggct ggtttctact ttgcagatct gcctaggaga cttgcggtgt gtggttcagg 24781 tgtctgggcc tgttcctcag ctcctgggac ttgattctct ttgttaacca gaggagcgct 24841 gagcacaaca tgccctgagg gctgcctgat gcctgtcctc tgctgagtgt ctcaccgttc 24901 cccttgctgt cactgagcac tggggagggt tggccccaga gtctggtttg caggaaagag 24961 aaagagaaag taaatcaagg ggaagtgtgt gtggccaggg taggatccta tctgctttgg 25021 aattacctca atggtcacag ctggctgaat acttttagca ctattatgat tatgcctgaa 25081 ataaagctcc cctggccttg gacaggtaga cctaccaagt ctggaacaaa cctccctgca 25141 tcaacaggca tttattgaag gcctattttg cttaatttat ttctattcaa ttcaacaatt 25201 acaaggtatt gtgctaagaa agggacacag aggtaaatga gacaggtgca gcctctctcc 25261 catgttcttc atggaagaag cgggacttaa ctaagaactc tggccactgg cattgttctc 25321 ccacttctac aagagtccac atgcaagaca aagccattga taaaagtagt gggggaagct 25381 gaagacactg tcctttctgc ctatgtgtgt ctttgtcctt ccacctttcc tctagggaat 25441 tccttcttgg ttaaacaaac acaaaccaaa atgaagtggt ggaagcggta actatgattt 25501 tttttaacat ttaaaaaaaa taattattat gcttacataa taattacgta tatatttatg 25561 gggtatgtgt gaggttttga tacaggcata caatgtgcaa tgatcaaatt agggtaactg 25621 ggtatccacc actcaaccat ttatcatttc tttgtgttag aaacattcta gttccattct 25681 ttaagttagt ttgaaatgta cagcaaatta tagttaacta tagttgcctt attgtgccgc 25741 ctaacattag atctatttaa ttgtattttt gcgcccgtta actatcacct ctttatcccc 25801 cattccctgc tacccttccc agcctctgcc aaccatcatt ctactgtgcg tctccacgag 25861 ctcagtttgc ttacttttta actcccacgt ttgagtgaga acatgtgaaa tttgtccttc 25921 tgtgcctggc ttattttgct taacataatg ttctccagtt tcctatgact ggttttaatg 25981 ttggtaattt tacctttgta atcaccactg gttgtttacc aatgagcagc tgccaccaac 26041 aatattgagg aactttaggc actttgtgct gtttctagaa atgcagactc aagactggag 26101 ggttgtgggc ggggaatggt tggtggaggt tacaagtaat taggaacatt tttttgctta 26161 cagtttggga ctggatattc aatgtgcaga tcctgactaa cctcccttag cacttctgtt 26221 tgcaaatcaa acccctgaat tcctcctatt agcaacccca ttgcctacca tatgaggccc 26281 acactcctgg gcctggcttt caaggacttg acgattagct cctaccttag cttgggcatt 26341 tcatcgaccc atccagacct catgctcagt ccatctctgc accttcaccc gtgtccttca 26401 cctgaactac cctctcctct ccaaatgttg tgaatggatt aaattctgct tccactagaa 26461 gcttttctgc ctattccatc tgttctggtt tctgcttcat tagcattcct gtgatcacct 26521 actcaaacac ctttacaaaa ctgtgctcaa agttcttcca atgtcccaca atagaagaaa 26581 tggaattaaa ataaagttgt ttgatatgat ttgattttat ctcttcaact tgacggtgat 26641 gataattatt gacacttgcg taatgccttc aatacaaaaa taaattctag ttcttcatta 26701 gatccttagc acagtgcctg gggtggtcac tctaacctgg caaggctcag agggagtaat 26761 tgccctgctt gagggaagaa ggcccgggac tttccctgaa gcctcccaat tccaaatcct 26821 ttcttcatag ttctgggctg ctttctacta cactgagctc ttggaaagtt atataaaatt 26881 aaatttcata tttctgtgtt acagagaagt gctttaaaga atggattttt agttctaccc 26941 tttacctgtt tgatgacctt agacaaatta cttaaccttt ctatatttgt ctgccttatt 27001 tatttattgt tatttatttt tgagatgcag gtctctgctt gctcaggctg aagtgcggta 27061 gtgccatcag agctcactgc agccttgacc tccccaggct caggtgatcc tcccacctca 27121 gcctcctgag tagctgggac tacaggcgca taccatcatg cctggctact ttttgtattt 27181 tttgtagaga tggggttttg tcatgttgcc caggttgttc tcacgctcct gggctcaagc 27241 aatctgccta ccttggcctc ccaaagtgcg aggattacag gcatgagcca ccacccctgg 27301 ccatgccttc ctaatttata aaaatgagaa aaggatacta cctctctcaa attaaatgag 27361 ataatatata tgcagccctt agcacagggt ctacctacta ctcagttaag tactcaatga 27421 gtacctcttc ctcttctctt cttccccttc ttcatcatct ccttcttcat ctaattataa 27481 taaattgtca tcatcatcac catcatcatc atcatcatcc tcatctccta tcacctacaa 27541 tagggattgg taaacatttc ctgtaaaaag ccagatacta aatacttttg gctttgtggg 27601 ccatactcta ttgtaattac ctctgtacta caagagctac cgtagacaat atgtaaacaa 27661 atgggaatgc tgtgttccaa aaaactttac aaacacaggt gggggctata tttggccatg 27721 gctgcactgt gcctgccctt gacctagagc ggtgcagggc actggcagca gctctgcttt 27781 catgacctgt aaagtggggc tttggaggtg gcagtgctgc ctgtgctctg tattctcctc 27841 tcccctgacc tctgcttctg ttattgcaaa cagcatgtgt tgtaggaggg ataaaagcac 27901 agtgggagga gatctaccta ctgtcacttg taccttggga aagtcacttt cccttggttc 27961 cttctaaaaa atggaaagat tgagctgggc acaggggcac gtgtctgtag tcccagctac 28021 ttgggaggct gaggtgggaa gctcaattgc tcccaccata gcccaggtgg gagttctggg 28081 ctatggtgtg ctatgccaat tgagtgtccg cactaagttc ggcatcaata tggtgacctc 28141 ccaggagcaa gggatcacca ggttgcctaa ggaggggtga aggggcccag gttagaaatg 28201 gagcaggtca aaactcctgt gctgtttgaa actgaccacg caggggagaa aggggctctg 28261 agaaagattt ggagaaactc atttcattat ggtcaaaaga caggctcagg gccagacatg 28321 gtggctcatg ccagtaatcc cagtactttg ggaggcctag gaaggaggat cgcttaagcc 28381 caggagtttg ataccagggt gaacaaccta gtgagaccac tgtctttaca aaaaaattaa 28441 aaaatgtagc tgggcatggt ggtgcatgtc tgtagtccta gctactcagg aggtgaaggt 28501 gggaggatta cttgagctca ggagttcaag gctgcagtga gctttgattg tgcaacatgc 28561 ctctgtactc tagcctgtgt gacaaagcaa gaccctatct ctcaaaaaaa aaaaaaaaaa 28621 aaaaaaaaag aggttcagct aactcccttg ccctctcttg aaatagagca gtaagtatga 28681 gaactaactg atagaattac ttagcaaata tttttattga gatcttctgg atgcttggag 28741 aaatttttaa aaagaaaaga gaaggcaagt cctgtccttg tttagttagt aatcctgggt 28801 taaaaacaaa aagcaaaaaa ccccaggaca tttaaaaaga tggtatgaac ttcagtggaa 28861 gattaatata atacaaagcc tttaacattt taaaatgtaa caaaatattc ataccataga 28921 atgtaccatt ttactcattt ttaagtgtac aattcagtgg cattaagaat actcacaatg 28981 tggtgtgacc atcaccatta tttccagaat tttttcatca tcccgagcag aaactctgta 29041 tccattaacc aggtccaaaa cctttccatg atgggtggag acaagatgtg aaaatatgtt 29101 ctctatggaa tgtcagaaac aggggatgaa aagggcattt ggggtacagg gtgagaagag 29161 gtttgagagg caagtgaagc atgtaggctt aggatggggc agggccgcca ctagcccata 29221 tggcaacttg atgtgcacta caaacaggca tctcttcctt taggcacacg cagctcgcat 29281 gaggcacctg gctgggaacg cagagcatgg actgggtttt ggccctcact tgcccacttg 29341 gccagtcaag gagctgtgaa atcaggagtt gaggccgggt tgcctggact gcaaagcaca 29401 tgccttctgg gttctggggc tgagttgaaa gagggccctg agatgagttc tcgaggtgga 29461 tcccatgtga caggaaacag ggcgccattg gaaattttcc agcttggaat atgagcaaaa 29521 tcaaatggag atgacaatga ggcagaagcc gcttacaacg gctgtgacgg gaagatggcc 29581 ctggtttaaa accatttagc agtctgaaag tttatgcata gtctcttgtc tgatgattaa 29641 tctcagttct ctttatactt gcaattccag ggtactgaaa tggctaaata ttttgaatct 29701 ctggtcagaa acgacccttc ctttgaaatt cctgccaaga ggcaccttgg cctggtggtt 29761 tttcgtctaa aggtaatacc atcttccaag cccctctgtg aatgatgtct tgtggtgctc 29821 cagagcctct cggaaacaca aactgggctc tggggatgct gacagggggt gcgatggaga 29881 cttcacttcc tttatttttc aaacagggtc ctaattgtct cacagaaaat gtgttaaagg 29941 aaatagctaa agctggccgt ctcttcctca tcccggccac tatccaggac aagttaatca 30001 tccgtttcac tgtgacatcc cagtttacca ctagggatga catcctgaga gactggaatc 30061 tcattcgaga tgctgccact ctcatcctga gtcagcactg tacttcccaa cccagccctc 30121 gggttgggaa cctcatctcc caaatcaggg gtgccagagc ctgggcctgt ggaacgtccc 30181 ttcagtctgt cagtggggca ggagatgatc cagtccaggc caggaagatc atcaagcagc 30241 ctcagcgtgt gggagccggt cccatgaaaa gggaaaatgg cctccatctt gaaaccctgc 30301 tggacccagt tgatgactgc ttttcagaag aggccccaga tgccaccaag cacaagctgt 30361 cctccttcct gttcagttac ttgtctgtgc agactaagaa gaagacggtg cgctccctca 30421 gttgcaacag tgtgccagtg agtgctcaga agccactgcc cacagaggcc tctgtgaaga 30481 atgggggctc ctccagggtc agaatctttt ccaggtttcc agaagacatg atgatgctga 30541 agaaaagtgc cttcaaaaaa ctcatcaaat tctacagcgt ccccagcttt cctgaatgca 30601 gctctcaatg tggactccag ctgccctgtt gccctctgca ggccatggtt tagacacagg 30661 gccttcagcc agagtctgag gatatacttc agggactctg tgaacccctc acaattgtat 30721 gccaactttg tgtgcttatg tgtacatgca tttttcttgg ggcgagttca taattttaat 30781 caaattctca taggggcttc atgacccaca ataggataca aacgaagagt ttaagccagc 30841 atgatccaga tgggttcagc agtctggtca gtgagaaagg gccgagggta gacaggcagc 30901 ttctgtggtt cagcttgtga catgatatat aacacagaaa taaattatgc ttgtccctga 30961 aacaaaacat accctgtgtc acttaattgg ctgctgaaac attgattaac cagtctggga 31021 gcttaaacat atgtactttt tttgaagcat caattatgag tcaggcactg tggctcatgg 31081 ttcataaatg aggaaaccaa cgtttaggtc acacagcttt aaataggcaa acccaggtct 31141 cctgcttcca gtgaagccca ggctgtttcc accatgcagt actgctcaag gttggacctg 31201 aacaggagct cacagcccag caggctgctg gtcctccagt acatttaaat gtttcctttc 31261 taggtttgga acttgtgcat tttcccctta ttttcctgga cccggtagtc aaataaaagc 31321 tatgctcaca agtggcttgc ccataattag ttcagaggcc aaacacataa ttttatttcc 31381 atttcagatg gtactttgat aggttgtgac tctgaaatgg gttatgtaaa gagtattaaa 31441 gacaaacaaa gctgggcatt agttagaact ttaaggatag atgttaaaca gtaatatact 31501 atcacaatag ggaaaagggt cccatgtgaa ctgaactcaa cttggatttg tgcaaaggtg 31561 actaggcatt tcacagcgtg aatgggagaa tagggagaag gccagcctag acttagaaga 31621 gtcaggaatg tgaaaaatta caaaaagcgg aaagtggttg atgtgaaacc cctctgggtt 31681 tgctaactgg tgctttttga agtaagactc ttaccctccc acagagactg ggagtcaggg 31741 ccctgtgttc aggggttcgc tggaacaaac agttaattct ttcggcagct ttgagttttc 31801 ttaagcaggc actttaaggt gagctaggat caacttaggg atgtggcctt gagctgtgag 31861 aaactgtgtt agtgtttgtt caagtcttta taggcaaagg ttgaggccta gttgagaagg 31921 gggctcggag gagcttagct agagtttggt cgagaaaaga atctttgtta agagtttaaa 31981 gctttttttt tttttttttt tttttttttt ttgagactag tgctttctct ctttcccagg 32041 ctggagtgca gaggcacaat catgtcttac tgcagcctca acttcctggg cttgagcaat 32101 cctcctgctt cagcctccca agtagctatt tatttttgag atggggtctc attctgtgag 32161 tgcagtggtt ggaacacagt tcactgcagc ctcagcctct gaggctcaag ctgtctttcc 32221 acctcagtcc cacgagtagc tgggactatg agtgcacacc actatgcctg gctaattttt 32281 ataacttttt acagagacag ggtctccctg tgttgcccag gctggtctgg aactcctggg 32341 ctcaggtgat c // LOCUS HUMLYL1B 4569 bp DNA PRI 18-MAR-1996 DEFINITION Human LYL-1 protein gene, complete cds. ACCESSION M22638 NID g187266 KEYWORDS LYL-1. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 2213 to 2572; 2660 to 2751; 3555 to 4256) AUTHORS Mellentin,J.D., Smith,S.D. and Cleary,M.L. TITLE lyl-1, a novel gene altered by chromosomal translocation in T cell leukemia, codes for a protein with a helix-loop-helix DNA binding motif JOURNAL Cell 58 (1), 77-83 (1989) MEDLINE 89324062 REFERENCE 2 (bases 1 to 4569) AUTHORS Mellentin,J.D. TITLE Direct Submission JOURNAL Submitted (28-SEP-1989) Stanford Univ.Med.Cntr, Dept.of Pathology, Palo Alto, CA 94305, USA COMMENT . FEATURES Location/Qualifiers source 1..4569 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="19p13.2" exon 567..911 /gene="LYL1" /note="G00-120-158" /number=1 gene 567..4261 /gene="LYL1" intron 912..2218 /gene="LYL1" /note="G00-120-158" /number=1 exon 2219..2572 /gene="LYL1" /note="first expressed exon; G00-120-158" /number=2 CDS join(2277..2572,2660..2751,3555..3970) /gene="LYL1" /codon_start=1 /db_xref="PID:g386861" /db_xref="GDB:G00-120-158" /translation="MTEKAEMVCAPSPAPAPPPKPASPGPPQVEEVGHRGGSSPPRLP PGVPVISLGHSRPPGVAMPTTELGTLRPPLLQLSTLGTAPPTLALHYHPHPFLNSVYI GPAGPFSIFPSSRLKRRPSHCELDLAEGHQPQKVARRVFTNSRERWRQQNVNGAFAEL RKLLPTHPPDRKLSKNEVLRLAMKYIGFLVRLLRDQAAALAAGPTPPGPRKRPVHRVP DDGPRRGSGRRAEAAARSQPAPPADPDGSPGGAARPIKMEQTALSPEVR" intron 2573..2659 /gene="LYL1" /note="G00-120-158" /number=2 exon 2660..2751 /gene="LYL1" /note="G00-120-158" /number=3 intron 2752..3554 /gene="LYL1" /note="G00-120-158" /number=3 exon 3555..4261 /gene="LYL1" /note="G00-120-158" /number=4 BASE COUNT 936 a 1408 c 1430 g 795 t ORIGIN 234 bp upsteam of ApaI site; chromosome 19p13.1-13.2. 1 aaatccaact tcccacccaa ccctacagcc atggctcttc ctctcacctc ctgggcttgg 61 aaatgtgtgg atactgtgct gagggcttgg gacacctgcc caggctggac acccaaggcc 121 agactgttag aactgttccc tttgggtgcc ccactcccgg gggggtggtg gagaggaggg 181 aagggcctct gtaaaaagag gaaccaggct gggtgaggag aggaggtggg ggcccagggt 241 cacggcccca acgggccgga agggctctcc ctcaccccca tcagcacaga ttcgggcagg 301 gagtgtgcaa agccaaaggg cagatcggac ataagaggag cagcttccct gcctcagttt 361 accccctgcg tgcatccagg cacctccctc cccggctgcc ctggggcccg ggtccccaga 421 ggcccggggt ccctacagtg gctgggggag cggaagcgaa gccctgaggg gtggcggacc 481 gagtcctgcc agcgccggct gggggcggcc gggaggccgc gctgctaggt ccccgcccgc 541 tggtttcctc cggggtcagc caggctcctt atcaggcgcg ccaggcagcc tggcccttat 601 ctgcactggg ccagcatcct ccgcccgtgc gccgccaggg gtgagaggga ggaaaccggg 661 gccgggggcg gggagaaggc gggccggccc gggagccgct cactttccct gggggggacc 721 tacgcggaga cctcggctat cctggccttc cgaggcccac gaggaggcgc ggcccaacgc 781 cggggcctgg agcattgagg ccggaccctc gcgagacagc agagcctggc ctgacgctgg 841 aaaccacacc ctggcccaga ctgccagccc tgacgggaca gagccagggc actcaccagg 901 ctgcaagaac agtgctgggg taagagggga gcgggggatc ccgggcctgg gacccagcct 961 gcattccttt gttcattcct tcattcattc attcaccagc agggacccac tagtgagggg 1021 ccaggcctgc ttccccaggg cctagctgag gaagacaggg cagaggggcc aacagtctca 1081 caccttgctg ggacatcctg gactctggaa ccaagagcaa acagggatgt caaaacagta 1141 tgcaaaactg tggatgatcg cggggcattg tggtgcatgc ctgtaatccc agcactttgg 1201 gaggctgagg caggaggatc acttgagccc aggagttcca gaccaccctg gacaacagag 1261 tgagaacctg cctctacaaa aaaaattttt ttttttaaat tagccgggca tggtggcaca 1321 tgcctctact ctcagctact caggaggctg agacaggagg atggctcgag cccaggaatt 1381 tgaggctgca gtgcactatg attgcaccac tgcactccag cctggacaac agagggagac 1441 cctgtctcta aaaaaataaa caaattttaa aaagcgtgga tgctgatgag gatgggggct 1501 tccaagccag agggagggtt ggggcatgac tggactggac tgggcagtgg ggtcagtgtt 1561 tgggggtcta gggtcagcat ttgaggtcat agtgtcagta gtggggtcac agggttgcta 1621 tgggggatgt ttgcagtggg agtgggggtc actcaagcaa ttatggagca ctttcaggga 1681 cagcaatggg aagaaccgca gggtcattct tgaggtgcag agggtcagcc tgggtggggg 1741 ctgggggcct tccgtgaagg gtcagagtcc atcttggagg gattccaagc tcacatggca 1801 tcaatgagtt aatgaggagt ctctgggttg gatctttaga tctgaggata acagggttaa 1861 tcttggggtt tctggggatc ccaggatcag ttaggggctc cctgtcttgt tccctctcca 1921 ggctgcagaa ttcctcagag gctgggcaac agcgcccctc ctgggtcaca aagagctcag 1981 ggacagtgcg ctgcctagca cctggggggc gcctcctact ctaacgtagg acccccctcc 2041 cgtgtcctgg aagtttctgg gcctccccgc catctgcctt tgcctactga aacttctccc 2101 ctcctccttt ccccctcccc ctccctcctt cccatttgag gggtttcttg agctaagcag 2161 gtgggagcgg ggcacctagt cctcctcccc actggctgcc ttctttccca caggtgagta 2221 cccccacgtc ggggtccatg tgcccgcctc aggcacaggc agaggtgggc cccaccatga 2281 ctgagaaggc agagatggtg tgtgccccca gcccagcgcc tgccccaccc cctaagcctg 2341 cctcgcctgg gcccccgcag gtggaggagg tgggccaccg aggaggctcc tcgcccccca 2401 ggctgccacc tggtgtacca gtgatcagcc tgggccacag caggccccca ggggtagcca 2461 tgcccaccac agagctgggc actctgcggc ccccgctgct gcaactctcc accctgggaa 2521 ctgccccgcc cactttggcc ctgcactacc accctcaccc cttcctcaac aggtagtggg 2581 gatctggggt ggggggcagt ggggattggg ggccagggtc cttgcccaca aggacttagt 2641 gacccacgac cccttacagt gtctacattg ggccagcagg accttttagc atcttcccta 2701 gcagccggtt gaagcggaga ccaagccact gtgagctgga cctggctgag ggtgagtgtg 2761 ggtttgtgtg ccttgtgggt ttgtatgcct gtatgtgcac ttgtgggtgc acagaaggcc 2821 tggccgtctc tgtgtggggc tcatgtctgt cctctactcc acctcgggga gacgtgcttc 2881 cagccaacac agagaacagc ccacataggt tctacccatg tgcaaaatcg cagtcccata 2941 gccagcccca cacaaccacc gccacccaac aggccaagca tgtagccgca gtcacggcaa 3001 aacagagcat gccacaccag ggcatgggag ccacaaaatg acagctccac tggaatgtgc 3061 agccacacaa ccacagccac tcgtgaaaca gccacatggc aacatcacac ataaccacag 3121 ccatgagaca taacaagagt ctcacacagt catacaagac acaggacaca gacagtcata 3181 atgagaggac ctctcagacc catgagtaca cccagccagc cgtacttggg cacaatcaga 3241 atgagggccc cacggacagc ttcccagacc aaacaaacac aaggaaatct ttctttaggg 3301 aatctcagtc attgacataa aggtgcccat agtcacagat acagcaggcc cttgtccgta 3361 ggccgggcct gtgatgattc ttgtctggat ggctcttggg aggggtgggg agagctgccc 3421 gtggacccct tggtggagaa agccccagcc catggcctgg gtttaagttc ccacgagggt 3481 gctgcgtgtc tagcgggagg gcaggagggg ccgccctttg cgcatggcac caacccggcc 3541 ctccttgtcc acagggcacc agccccagaa ggtggcccgg cgcgtgttca ccaacagccg 3601 ggagcgctgg cggcagcaga acgttaacgg cgccttcgcc gagctgagga agctgctgcc 3661 gacgcacccg cccgaccgga agctgagcaa gaacgaggtg ctccgcctag ccatgaagta 3721 catcggcttc ctggtgcggc tgctgcgcga ccaagccgca gctctggccg caggccccac 3781 ccctcccggg cctcgcaaac ggccggtgca ccgggtccca gacgacggcc cccgccgggg 3841 atccggacgc agggccgagg cggcagcgcg ctcgcagccc gcgcccccgg ccgaccccga 3901 cggcagcccc ggtggagcgg cccggcccat caagatggag caaaccgctt tgagcccaga 3961 ggtgcggtga ccgcacgcgg cagcacctct gagccggagg gcaccaggga ctcggcccag 4021 ggccgtcaag gaaagggcag tggacgtgct gcgcatgttc gggagcgaac tcccccgaag 4081 aaggaccagt gaagacgtca ggggcaaggt ctcgggggtc cggaagggtg atcatcgacc 4141 cccaagggac ccgcagaccc ttaaaaaaat cacccacaac cctctggaag tggccttgcc 4201 cggtcccctt cccaggggcg aggtcggcaa agcaacatgg cagagcagtc ataggaccca 4261 agtggtgcct cattttttcc cgggctgggg tcgcgggggg aggccaggag gggctgggga 4321 ggcttcgctt ttctcaccgc ccctccggag acccaggggc agacgctcgt cgacggctct 4381 gctgccctcc gggccttgga cagaggccca gaatccaagt cggggccgca ccctaccgac 4441 cccgacccag tcccgcacgg tgcgttaaag ggtcaggcgc tctcgctttg tttttcttta 4501 tttctttatt tcacatacac attagccttc aatggagaag ccgagagagt caggcaaaga 4561 tggataaca // LOCUS HUMLYTOXBB 6305 bp DNA PRI 14-MAY-1996 DEFINITION Homo sapiens lymphotoxin-beta gene, complete cds. ACCESSION L11016 NID g292278 KEYWORDS lymphotoxin-beta. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6305) AUTHORS Browning,J.L., Ngam-ek,A., Lawton,P., DeMarinis,J., Tizard,R., Chow,E.P., Hession,C., O'Brine-Greco,B., Foley,S.F. and Ware,C.F. TITLE Lymphotoxin beta, a novel member of the TNF family that forms a heteromeric complex with lymphotoxin on the cell surface JOURNAL Cell 72 (6), 847-856 (1993) MEDLINE 93208881 FEATURES Location/Qualifiers source 1..6305 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MANN DR7 homozygous cell typing line" mRNA join(2764..2933,3330..3375,3559..3630,4026..4631) /product="lymphotoxin-beta" exon 2764..2933 /number=1 /product="lymphotoxin-beta" CDS join(2772..2933,3330..3375,3559..3630,4026..4480) /note="The 3' end of this gene lies 2.23 kb from the 3' end of the tumor necrosis factor gene" /codon_start=1 /product="lymphotoxin-beta" /db_xref="PID:g292279" /translation="MGALGLEGRGGRLQGRGSLLLAVAGATSLVTLLLAVPITVLAVL ALVPQDQGGLVTETADPGAQAQQGLGFQKLPEEEPETDLSPGLPAAHLIGAPLKGQGL GWETTKEQAFLTSGTQFSDAEGLALPQDGLYYLYCLVGYRGRAPPGGGDPQGRSVTLR SSLYRAGGAYGPGTPELLLEGAETVTPVLDPARRQGYGPLWYTSVGFGGLVQLRRGER VYVNISHPDMVDFARGKTFFGAVMVG" intron 2934..3329 /number=1 exon 3330..3375 /number=2 /product="lymphotoxin-beta" intron 3376..3558 /number=2 exon 3559..3630 /number=3 /product="lymphotoxin-beta" intron 3631..4025 /number=3 exon 4026..4631 /number=4 /product="lymphotoxin-beta" BASE COUNT 1302 a 1730 c 1618 g 1655 t ORIGIN 1 gaattcctgg gctcagaggt cctcccacct tagccttctg agtagctagg actacagaca 61 ccagctacca catgaggctt tgtagaaatg gggtcttact atgttgccca ggctgatttt 121 gaactcctgg tctcaagcaa tctttccacc ttagccttcc aaagtgctgg aattacagga 181 gtgggccact gcacctggct ctattaacat tttttatttg ctttatcaca tatttatcaa 241 tccatctcac ttttaaatat cttttaaaat tacaaatatc agtacatttt acatctaaac 301 ccttcagaag cttaacattg actggagttc agtatttatt tccccatttc ttttctggcc 361 tgaggaaggc aaattttaca tacaaatctc aagtcagtac tctttttttt ttttgagacg 421 gagtcttgct ctgttgccca ggctggagtc cagtggtgtg atcttggctc actgcaacct 481 ctgccttctg ggtacaagcg attctcctgt ctcagcctcc caagtagctg ggactacagg 541 tttgtgccac catatccagc taatttttgt atttttaatg gagaaggggt ttcaccatgt 601 tggccaggct ggtctcaaac tcttgacctc aagtgatcca cctgccttgg tctcctaaag 661 tgctgggatt ataggtgtga gccatctcgc ctggcctaat actgttttgt ttgtttgttt 721 ttgtttttaa gacagagtct tgttcttgtc acccaggctg gagtgcaatg gcatgatttc 781 ggctcactgc aacttccgcc tcctgggttc aagtgattct cctgcctcag cctcccaagt 841 agctggaatt aaaggtgcct accaccacgc cccgctaatt tttatatttt tagtagagat 901 ggggtttcac catgttgatc aggctgctct cgagctcctt acctcagatg atccaccttc 961 cttggcctcc caaagtgctg gtattatagg caagagccac tgcgcccagc cccagtattc 1021 agtttttaaa ctgtcttgtt atcaaggctc tggagccaga tgcctgggtt caaattctgg 1081 ttctgccact gactctgtga gctccataag tttcttaacc tctctgtacc tcagtttcct 1141 cttagggttt ttgtcaggat tataattatt ggctgggcat gatggctcat gcttgtaatc 1201 ccagcacttt aggaggccaa cacgggcaga tcacgtgagt ccaggagttt gagcccagcc 1261 tgggcaatgt ggcaaaaatc catctctaca aaaaatgcaa aaattagctg ggcatggtgg 1321 catgtgccta tagtcccagc tattcaggag gctgaggtag gtgaatccat agatcctggg 1381 aggtcaaggc tgcagtgagc catgatcctg ccattgcatt ccagtctggg tgacatagcg 1441 agaccctgtc tcaaaaaaaa aaattattaa agtgtgtaaa tcagtggcat aaacatgtta 1501 agtgcatttt gtgggtcagc tatattatta ttagtattac ggaaacacat agagatgtta 1561 ccaagaaggg gagatgattg gagccacttc cagcttcctt ggacctggtc tttcttccct 1621 tgactctttt tttttttttt tttttttttt tgagagagag agtctcagcc tgttgcccag 1681 gctggagtgc aatggtgcaa tcttggctca ctgcaacctc tgcctcccag gtttaagtga 1741 ttctcctgcc tcagcctcct gtgtagctgg aattacaggc gcgtgccacc acgcccggct 1801 aactttttgt atctttagta gagacagggt ttcaccatgt tggccaggct ggtctcgaac 1861 tcctgacctc aagtgatcca cctgcctcag cctcccaaag tgttgggatt acaggtgtaa 1921 gccactaccc cggctactcc cttgactctt aaccactcat gctgcctaca tctaccattc 1981 atgtggtcct tgctgctttg ttttggttat tcctgcattt atttgtcctt ttattcattt 2041 atgtataaac atttagtaag cacctactaa tggatagggc tcattgtaga cttggaagct 2101 ctctgagggt gggagtatgc ctcgtccatc tgtctttact ttttgtagca agggaggtaa 2161 agctccattt ccatccctcc ttagtgagtc agtagtcagt ggtgaggcta aggcttacct 2221 ctccctttct cactcagcac agggggctgg agatgagcaa gggaacggga ggaggtcagc 2281 ccagtatggg aatcagttct tctcagggaa cccagacatc catccctcaa gattccagtc 2341 cttgtcctag tccggccctt gacctcagag acgggatcag ctcttcctcc agcacctacc 2401 ttgagggtat agaagaatgc aaaccacatt ggaaacctgg agatctgtgt tctcatttca 2461 gctctgctga ctggcttcct gcaagctacc ttccctccct gggcctcagt ttctctctct 2521 gctgagccag aagatgtcta aagacccctt tggttccacc ctgagagcct gtctccctaa 2581 cctcaacttc ttccccagtt cagagaaccc aggcatccag ctgccccacc ccagctctgg 2641 gtaaacagga agctgggtga ggggagcagg ggtgtgcgga aagtcccagc caggtgtgca 2701 ggtctacagg gagggggtgg gcccgtccct gaggtatgaa agccccctgc tctggctctg 2761 gttcagtctc aatgggggca ctggggctgg agggcagggg tgggaggctc caggggaggg 2821 gttccctcct gctagctgtg gcaggagcca cttctctggt gaccttgttg ctggcggtgc 2881 ctatcactgt cctggctgtg ctggccttag tgccccagga tcagggagga ctggtgagtg 2941 gctgcaacag gccctggtgg agagttgtat cttgcggatg cttggctccc tctggttgtg 3001 cctgtggtct tttgccccct ctggctcagc tggctcggct gtccctggtg gggatgtctt 3061 gtctctttgc tgactctctt tccatgttcc tgtgatgttg tgcttgtgtc ccgacataag 3121 ccccttgtgt ctcctctcct cttcccgagg tacatctgtt tctccgccca agtacctatg 3181 ccttgcttgt tctcccttct aaggaggtgt gtgttgggga tggtgctggt aggagaaacc 3241 ccaggcctgc agcttgggtc cactttcaga ggggtagggg tgacatgagc tgaatctgaa 3301 ctctgggcac tgtgacccca cccaaccagg taacggagac ggccgacccc ggggcacagg 3361 cccagcaagg actgggtaag agcagactgt ctctccttcc ccgcttcaga ccctcagggg 3421 ctcccagctc cctgctgcgt ccccagatac ctcttcctct aggaatccag gctccccatc 3481 cctgcgccct gttctctcaa gggtagcctg catgggtggc tgccctgccc ccaatcgtgg 3541 actctttgcc ccttccaggg tttcagaagc tgccagagga ggagccagaa acagatctca 3601 gccccgggct cccagctgcc cacctcatag gtaaggacct ccaagacctg aataagagtg 3661 taaataatcc gaaggttcca gttctgctcg cccagagtcc ttcggctcca tgattccagt 3721 gctcggtttc ccacccgctt cacgaccttt tgtcgctcgt gcccactctt acgctcgtcc 3781 ccgcagtgta gtttcttctt ccctccggtg caagcaaaag ccggcctgga ggtccccact 3841 acagcgttct gcaccccaca tccgtgttcc ctcggccccc aactcgcact catcccagaa 3901 acagcaccat ccctcctccc ccggcccggc tcggctcccg caggggctaa aagccgccac 3961 ttccccagaa gtcccaagcc tttaggatcg cattcccaag agcgcgtcgg cccgtgtctc 4021 cgcaggcgct ccgctgaagg ggcaggggct aggctgggag acgacgaagg aacaggcgtt 4081 tctgacgagc gggacgcagt tctcggacgc cgaggggctg gcgctcccgc aggacggcct 4141 ctattacctc tactgtctcg tcggctaccg gggccgggcg ccccctggcg gcggggaccc 4201 ccagggccgc tcggtcacgc tgcgcagctc tctgtaccgg gcggggggcg cctacgggcc 4261 gggcactccc gagctgctgc tcgagggcgc cgagacggtg actccagtgc tggacccggc 4321 caggagacaa gggtacgggc ctctctggta cacgagcgtg gggttcggcg gcctggtgca 4381 gctccggagg ggcgagaggg tgtacgtcaa catcagtcac cccgatatgg tggacttcgc 4441 gagagggaag accttctttg gggccgtgat ggtggggtga gggaatatga gtgcgtggtg 4501 cgagtgcgtg aatattgggg gcccggacgc ccaggacccc atggcagtgg gaaaaatgta 4561 ggagactgtt tggaaattga ttttgaacct gatgaaaata aagaatggaa agcttcagtg 4621 ctgccgataa agatgctgag ttgcgacaca cgtcttaatt cagggtgggt gcacgggtgc 4681 gggttaaata ttctcagtac tcttctggtt gcttgaaaca attcatcaca acacagtgta 4741 tggcctttgc tcctagggat gatggtctgc ctgtcccacc ccctccctgc ctctgaatgg 4801 ccaggcccca ccattagccc agttggaggg tgggaggaag ggggacttct caaactccga 4861 agcttctcta ggcatcctga ttttcagggc cacatggtcc caaccagact ctgcaccata 4921 ctcttttctc ttgggtaccc cccaacagtg agaggggtca ttacagagcc cagcaagcac 4981 cactcagaaa ggcccagcag cagagtaagc ccctatcatg acagaggaat gaagcctgga 5041 ggggccccgc acttctcccc ctagagctgc ctgaaggcct ctctgtctcc tacccgacag 5101 tcaactcttc tcctccaagg agcttaattc aaggctcatg gggtctgaag ggaggaggct 5161 gaaggagaaa gaaggggaga atattagaga gagatgggga tggcaggaag gagcctgtgg 5221 tgcctgaaaa caccaggaag ttctggggag gaggaaaaac cgatgcccca cttagggtgt 5281 cccatttagg gtgagacgga aaatcctcac ctttttttca cactttaggt cccccttccc 5341 aaaagtgagt aagtgtgggt gcttctggga tgagtaacag tgtcccccat tacttcatgg 5401 ctgactttca gccacaggct ggaggaggca gagggtgacc caaggcccta tctaggtcac 5461 cccaatgggt caccctaccc cctcagccta ccacatggtt ttctcctgcc tggcacccca 5521 gggctggagg taaagcctaa tttccgaact cagtgggggc tcccagtcta ggggggctca 5581 atttccgtct ccatatttgt ttttggaatt attatttttt tgagacaggg tctcgttctg 5641 tcacccagac gggggtacag tggcatgatc atagcttact gtaacctcaa actcctgggc 5701 ttgagtgatc ctcctgcctc agcctcctga ggagctagga ttacaggcat gcaccactac 5761 acctgactaa tctttaattt tttttctaga aacaaggtct tgctatgttg cacaggctgg 5821 tcttgaacta gtgggctcaa gtggtcctcc cacctcagcc tcccaaagtg ttgggataac 5881 aggcatgagc cactgcgccc cacccttatt tgtctttgac tctctccaga agagccttca 5941 tccagggagg gggtgctttt ctctttccgg attacccacc tctcacctct cccctccttc 6001 accacaaaga ccagtgggac caagccggca tgtgagtcct tcacccacat cttattccta 6061 tgtttcattc ttttttaaaa aatagagaca ggatctcact atgttgccca ggttgctctg 6121 gaactcctgg gttcaagcga tcctctcacc ttggccttgc aaagtggtag gattacaggt 6181 gcatgccacc acgtccggca gttcggttcc ttgttcttta ttgtcctcag tctcttcgat 6241 ttcacccact gagagaatgg aaggggatag aacagctgga aactggttga aggaagccag 6301 aattc // LOCUS HUMMCHEMP 2776 bp DNA PRI 13-MAY-1994 DEFINITION Human monocyte chemotactic protein gene, complete cds. ACCESSION M37719 NID g187447 KEYWORDS monocyte chemotactic protein. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2776) AUTHORS Shyy,Y.-J., Li,Y.-S. and Kolattukudy,P.E. TITLE Structure of human monocyte chemotactic protein gene and its regulation by TPA JOURNAL Biochem. Biophys. Res. Commun. 169, 346-351 (1990) MEDLINE 90290466 FEATURES Location/Qualifiers source 1..2776 /organism="Homo sapiens" /db_xref="taxon:9606" exon <598..673 /gene="SCYA2" /note="monocyte chemotactic protein" /number=1 gene 598..2080 /gene="SCYA2" CDS join(598..673,1472..1589,1975..2080) /gene="SCYA2" /note="monocyte chemotactic protein" /codon_start=1 /db_xref="PID:g487124" /translation="MKVSAALLCLLLIAATFIPQGLAQPDAINAPVTCCYNFTNRKIS VQRLASYRRITSSKCPKEAVIFKTIVAKEICADPKQKWVQDSMDHLDKQTQTPKT" exon 598..673 /gene="SCYA2" /note="monocyte chemotactic protein" /number=1 intron 674..1471 /gene="SCYA2" /note="monocyte chemotactic protein intron A" exon 1472..1589 /gene="SCYA2" /note="monocyte chemotactic protein" /number=2 intron 1590..1974 /gene="SCYA2" /note="monocyte chemotactic protein intron B" exon 1975..>2080 /gene="SCYA2" /note="monocyte chemotactic protein" /number=3 exon 1975..2080 /gene="SCYA2" /note="monocyte chemotactic protein" /number=3 BASE COUNT 700 a 727 c 565 g 781 t 3 others ORIGIN 1 cagttcaatg tttacacaat cctacagttc tgctaggctt ctatgatgct actattctgc 61 atttgaatga gcaaatggat ttaatgcatt gtcagggagc cggccaaagc ttgagagctc 121 cttcctggct gggaggcccc ttggaatgtg gcctgaaggt aagctggcag cgagcctgac 181 atgctttcat ctagtttcct cgcttccttc cttttcctgc agttttcgct tcagagaaag 241 cagaatcctt aaaaataacc ctcttagttc acatctgtgg tcagtctggg cttaatggca 301 ccccatcctc cccatttgcg tcatttggtc tcagcagtga atggaaaaaa gtgctcgtcc 361 tcacccccct gcttcccttt cctacttcct ggaaatccac aggatgctgc atttgctcag 421 cagatttaac agcccactta tcactcatgg aagatccctc ctcctgcttg actccgccct 481 ctctccctct gcccgctttc aataagaggc agagacagca gccagaggaa ccgagaggct 541 gagactaacc cagaaacatc caattctcaa actgaagctc gcactctcgc ctccagcatg 601 aaagtctctg ccgcccttct gtgcctgctg ctcatagcag ccaccttcat tccccaaggg 661 ctcgctcagc caggtaaggc cccctcttct tctccttgaa ccacattgtc ttctctctga 721 gttatcatgg accatccaag cagacgtggt acccacagtc ttgctttaac gctacttttc 781 caagataagg tgactcagaa aaggacaagg ggtgagcccc aaccacacag ctgctgctcg 841 gcagagcctg aactagaatt ccagctgtga acccaaatcc agctccttcc aggattcagg 901 atccagctct gggaacacac tcagcagtta ctcccccagc tgcttccagc agagtttggg 961 gatcagggta atcaaagaga agggtgggtg tgtaggctgt ttccagacac gctggagacc 1021 cagaatctgg tctgtgcttc attcacctta gcttccagag accggtgact ctgcaggtaa 1081 tgagtatcag ggaaactcat gaccaggcat agctattcag agtctaaaag gaggctcata 1141 gtggggctcc cagctgatct tccctggtgc tgatcatctg gattattggt ccgtcttaat 1201 gacacttgta ggcattatct agctttaaca gctcctcctt ctctctgtcc attatcaatg 1261 ttatataccc cattttacag cataggaaac tgagtcattg ggtcaaagat cacattctag 1321 ctctgaggta taggcagaag cactgggatt taatgagctc tttctcttct cctgcctgcc 1381 ttttgttttt tcctcatgac tcttttctgc tcttaagatc agaataatcc agttcatcct 1441 aaaatgcttt tctttgtggt ttattttcca gatgcaatca atgccccagt cacctgctgc 1501 tataacttca ccaataggaa gatctcagtg cagaggctcg cgagctatag aagaatcacc 1561 agcagcaagt gtcccaaaga agctgtgatg tgagttcagc acaccaacct tccctggcct 1621 gaagttcttc cttgtggagc aagggacaag cctcataaac ctagagtcag agagtgcact 1681 atttaactta atgtacaaag gttcccaatg ggaaaactga ggcaccaagg gaaaaagtga 1741 accccaacat cactctccac ctgggtgcct attcagaaca ccccaatttc tttagcttga 1801 agtcaggatg gctccacctg gacacctata ggagcagttt gccctgggtt ccctccttcc 1861 acctgcgtcc tcctagtctc catggcagct cgcttttggt gcagaatggg ctgcacttct 1921 agaccaaaac tgcaaaggaa cttcatctaa ctctgtctcc tcccttcccc acagcttcaa 1981 gaccattgtg gccaaggaga tctgtgctga ccccaagcag aagtgggttc aggattccat 2041 ggaccacctg gacaagcaaa cccaaactcc gaagacttga acactcactc cacaacccaa 2101 gaatctgcag ctaacttatt ttcccctagc tttccccaga caccctgttt tattttatta 2161 taatgaattt tgtttgttga tgtgaaacat tatgccttaa gtaatgttaa ttcttattta 2221 agttattgat gttttaagtt tatctttcat ggtactagtg ttttttagat acagagactt 2281 ggggaaattg cttttcctct tgaaccacag ttctacccct gggatgtttt gagggtcttt 2341 gcaagaatca ttaatacaaa gaattttttt taacattcca atgcattgct aaaatattat 2401 tgtggaaatg aatattttgt aactattaca ccaaataaat atatttttgt acaaaacctg 2461 acttccagtg ttttcttgaa ggaaattaca aagctgagag tatgagcttg gtggtgacaa 2521 aggaacatga tttcagaggg tggggcttac attttgaagg aatgggaaag tggattggcc 2581 cnntntcttc ctccactggg tggtctcctc tgagtctccg gtagaagaat ctttatggca 2641 ggccagttag gcattaaagc accacccttc cagtcttcaa cataagcagc ccagagtcca 2701 atgaccctgg tcacccattt gcaagagccc acccccattt cttttgctct cacgaccctg 2761 accctgcatg caattt // LOCUS HUMMGPA 7734 bp DNA PRI 06-MAY-1997 DEFINITION Human matrix Gla protein (MGP) gene, complete cds. ACCESSION M55270 J05572 NID g187590 KEYWORDS matrix Gla protein; vitamin K-dependent protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7734) AUTHORS Cancela,L., Hsieh,C.L., Francke,U. and Price,P.A. TITLE Molecular structure, chromosome assignment, and promoter organization of the human matrix Gla protein gene JOURNAL J. Biol. Chem. 265 (25), 15040-15048 (1990) MEDLINE 90368682 REFERENCE 2 (bases 1 to 7734) AUTHORS Price,P.A. TITLE Direct Submission JOURNAL Submitted (20-NOV-1990) Department of Biology, University of California-San Diego, La Jolla, CA 92093 FEATURES Location/Qualifiers source 1..7734 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /cell_type="leukocyte" CAAT_signal complement(2868..2872) /gene="MGP" gene complement(2868..7678) /gene="MGP" CAAT_signal 3276..3280 /gene="MGP" TATA_signal 3358..3366 /gene="MGP" exon 3399..3515 /gene="MGP" /number=1 sig_peptide 3455..3511 /gene="MGP" /product="matrix Gla protein" CDS join(3455..3515,5060..5092,6258..6333,7025..7166) /gene="MGP" /note="The penultimate residue in human MGP is threonine in the gene sequence (ACC, this entry) and alanine in the previously reported cDNA sequence (GCC, GenBank Accession Number X07362). This difference is due to polymorphism in the MGP gene, since it is now clear that EST sequences from some cDNA libraries predict a penultimate alanine while EST sequences from other libraries predict threonine" /codon_start=1 /product="matrix Gla protein" /db_xref="PID:g187591" /translation="MKSLILLAILAALAVVTLCYESHESMESYELNPFINRRNANTFI SPQQRWRAKVQERIRERSKPVHELNREACDDYRLCERYAMVYGYNAAYNRYFRKRRGT K" mat_peptide join(3512..3515,5060..5092,6258..6333,7025..7163) /gene="MGP" /product="matrix Gla protein" intron 3516..5059 /gene="MGP" /number=1 exon 5060..5092 /gene="MGP" /number=2 intron 5093..6257 /gene="MGP" /number=2 repeat_unit complement(5470..5788) /gene="MGP" /rpt_family="Alu" exon 6258..6333 /gene="MGP" /number=3 intron 6334..7024 /gene="MGP" /number=3 repeat_unit complement(6679..6888) /gene="MGP" exon 7025..>7166 /gene="MGP" /number=4 polyA_signal 7389..7394 /gene="MGP" repeat_unit 7489..7678 /gene="MGP" BASE COUNT 2303 a 1580 c 1470 g 2381 t ORIGIN 1 ccagtgagaa agctcatcac ttggtctcct ttaaggccag ttggctgcct aacaattttt 61 taaataagag gagccagtat taaatttttg ttcaaagagc acacttgatg catgagacag 121 ggcccatatc tgtatttttc tctactgtat ttccagccta gagttgacaa acagtagatg 181 ctcagtacat ttgttggcta gatagataac ttgatggatg gctggctggc tggctggctg 241 gctggatgga tggatggatg ggagaattat gaaatcatga agctccttct ggccctgaca 301 ggcatggtca ttcttctctt ttctgcctga gagtaggtgg aataggagat ctgtattact 361 ccatggcttc tcttgcttca gttcctacgt tgccaacctc acatgaggag aatcctacac 421 atgtttaaaa actggcaatc atatcactgt ctcatatttc tgttatcact tctgggagtt 481 tcttcaaata ttctctcctc tgaataacac ttcttttttg ttaagggaaa atgtctatat 541 aagtgtcttt cataattatc taaaatctaa ttagaattta gagtttcatg tggtctcgtc 601 ttgacaagat atcccaatta agaaaatgca aactagctgg caaaattaat ttgttcaaat 661 ttcaatattt tctgaaaatt ttcagacagt attctgcaat ctcaaacaat gctattccta 721 accaaagcaa cttttatttc tctgttccca tgtctcgctt ttaatatgtc tcaccttcta 781 caactgcctc cgtttttctc tgtcactcag tctctaccta aaactcaccc agcaaaccaa 841 attggtaagg ctcttctcat ttccccttct ccgttttttt ttttttccta cttccattct 901 tttcttctgt cttctctcag atgagtcaat cttggtcctt tctaatgcaa agctcccatc 961 cctgcttcat gcgttagtcc aagtcctcat cataaaaaca tatgactgga gttggcattc 1021 acaaagttgt ctttgaaatg gggagtaagg tgacagagga gaaaaagaag agctctggat 1081 tctcagacat gttaataatt tttacatatc atatataaat gggattttgc agagaagaac 1141 cagaaataga tgggagagca atggacagga aaggcagatg agggaccgaa gagacacagc 1201 tcccaaaaga aagttagcct tacaaaaacc aagacgataa agagaaatgc ttaagtttag 1261 ggaatccagt ggaagcagtg atttaaggtg aacaaaaggt gaaccttaag ttgaaatgag 1321 aagtgtagga ttttcaagtt tagtttctgg gagtgtaaaa ataaaaaaac aattgtgatg 1381 tcagaggctg aaagattata gttgtcattt gaacttgggg ataaaggaga catctatgac 1441 ttggctggaa aagacagagc taatgtacat tgcaaagcac atatttatag caggaaaatg 1501 ggaagatttc tctttaattc tggagatgga gtggggatgg ggagagtaga ctactcattt 1561 taagggtgaa acattggaat tcaacttgtt tgatgttata ttaattggtg gttaattact 1621 aagctaagta cgtataaaac ttttatctat ggctagcttg tccccccaaa gtcatgcaat 1681 atagtgaact ggctttcgca ctttaaatta ttcattgatc atgtaatgat tcagatgatt 1741 catcttccaa gatggacact gaaactaaca ctcatagtag gttgtggttt aaagagtgga 1801 acaaccgcca gtctcattag tggaaattgt gatggttgaa tttatcaagg atgaacatac 1861 acggtcttct ttctgagatt ttctttaaga ttttcgcaca gataatctat ttcttaggtt 1921 ttggagagaa aacttgaatt ttattgatcc ctcagaactc aatctttcag atttcaaagg 1981 agctatttct tttaatgggg actctgttaa tatttataaa agctcttcac aggatggagg 2041 gtgggaggga aactccatcc caacaagaca aaaagaatga agcatgaggc tccacctagt 2101 tcatcactgc tccttgaaat acatcagtat tgaaagacac atccacccca cccccaaccc 2161 agccctattg ctgttccagc tcaagagtca gaggtcccga agctgtagct cttctacaat 2221 agtctccaaa aaatatggtt tatgatttga ttaaagaata ctgcctcgcc agaagctccc 2281 gagaggcaca tctggtagga cagattttgt gattgcaaaa gaagggggaa aaaaagaaag 2341 aaagaaaaga cctctctata caagataacc agaggcatca aactgaaatc ctcctgtgga 2401 aaataagcta gtacttctgg gcctgatggt gtagtgaaaa cctgtgcttg aggatacatt 2461 acagtgaaag agcaaagtga atagtaagta gctattactt acctccttag ggaggtgtgt 2521 tgtttgtctg tacatccccc acagcaccta gcacagtacc ttgcatctca cctgccactc 2581 actaaaaagt ctatcaagtt agttaattat cgagacaacg ccctcagaaa tgagagaaca 2641 gtaccctctt atccttgctg cactttccag cactgatacg ctgcctaaaa gaggactagg 2701 gcacaggttt gaattaatgt cacaaaactg gatgggcaag ttacaacggt gttgattaag 2761 gaaacagaac tcatggggca ccggatatct ccatcctgat gaacccttgg aaaaatgcca 2821 aagatgcata tccccaggca aatgcctgat tagtctggga ttgatagatt ggtctaggat 2881 tcagccctac tgggaagatg tctaaattat aatcagtgta gaaagcgaag ttctcctaga 2941 agaagaggca aaggttaaaa agaagaaaag aaaagaaagt gaagtccttt ctcccccaaa 3001 acctctcatc aatcaatcag ggtaacaaac agaacactag ggctctgtct gtggaccaaa 3061 cccaaaagcc ctgcggtcag ggccaggagg gtagatcatg tgtttgtggc aacttcctct 3121 gtgggctttt gcccaggtct gtccccaagc atacgatggc caaaacttct gcaccagagc 3181 agcatcctgt gtaacacagt caggtccagc agttagggaa aactgcccac tcagagtaga 3241 taatatctgg aaggaatgac tgtttgggaa aagttccaat gctagttcag tgccaaccct 3301 tccccacctt ctccagctct ctcccactgg ttcctcccct ctcaactgct ctggttctta 3361 taaaaacctc acagccttcc actaacatcc cataggagcc tctctcccta ctgctgctac 3421 acaagaccct gagactgacc tgcaggacga aaccatgaag agcctgatcc ttcttgccat 3481 cctggccgcc ttagcggtag taactttgtg ttatggtgag aaacttttct cccatttctc 3541 tgtgtttact tttctgcctc tgactttggc ttacttctat ttttcctctc cctcctcctc 3601 ttcttccccc tttctctgtt ataatcttaa agtaccatta ctttcacatt tcccagtctc 3661 cgcagaaact gatctgttct attaagtctt ttttatatcc taaatatcca gagtcttatg 3721 caacttaaca ggcaaacccg ttcagtggta agtctctgta tatctagaaa ctcatatttc 3781 agaaagaaga taccaaattc ccagccccct gcatcctcat ttttaaggat atttatttag 3841 actttggtat caatgggtta agggtattgt ttaaaccact tgcctttgag aaaatccatt 3901 tttatgtgaa gtattaagta tagccctttc tagggactgg acaatctcat gaacttacta 3961 tgtttgttca gttaattaat tttaaaataa agttttacat caaaagaatt ttagaaaaga 4021 atcattttca taactcctgt tgtcagaaaa taaattttgc ctgttttcta tatgtcatta 4081 aatatacctg catttgttca aagcttataa aaggaaatct gaagcaaagt tatttactta 4141 tttcagtctt ttgtttcaat tacctagata ttttcattgt tttaaaattt aaattacatt 4201 aacaaccata aagattatgc ttctcactct tgtattcaca aattttctgt attagaggat 4261 ttgatttctt cacctccttt ttaagttttg aagaaaattc acttgctggc aaatattaat 4321 agaagcttct tattccaaaa tttatctgct gtgctcagga gagtggcaga aagaagaaaa 4381 gaagcttctt attccaaaat ttatctgctg tgctcaggag agtggcagaa agaagaaaac 4441 ttcggctttg atatcgtttc agttctctct ctgaactggc atcgtgccca gggtgagctg 4501 tcagctggag ctagtggttt ctgtggctgc caatttaaca caggttctta agaggctttc 4561 ggaaccctct tagaaacctg ccctagtaag cccagcagag caactgccct gtagttctct 4621 tgcctggaga aacctggctg tcttctggat ccttcttaat cctctttgac cctgttctca 4681 aacaggctct gaataaatca gagaagaagg ttctctggag acttctgtac agcacttaaa 4741 gtgtcttatt ttgcttgtct gaagacgtca tagcccttgg gaaattttag ctgaaaatgg 4801 ccactccctc cttcaacatc agagaaacta aaatatagag atatccacag caaggccaga 4861 gctagagaaa aacctcataa atcctaaatt cctgaaattt ctaataacca cactgctaaa 4921 tatattcttc atgtttttag actctttcct cttcttccat ccctgtattt aaactatcac 4981 agtgtctaaa ttgataaata ataacataat gaatcatgga taaatattga tataatgaat 5041 cttttttttt taatttcaga atcacatgaa agcatggaat cttatgaact tagtaagtga 5101 atatttaact tctttattca aatcccttgc attaaagaac ctcttcttat ttttaaataa 5161 acaagatgga aagatatata acagggaggg aaaagggggc ctcttttgga aaactaaagt 5221 aaatttttaa atctaatgac tataaaaatt gccaaaggag caatttttta agtttgaagt 5281 agtgcaatat gggatttaag ctacaggcga catatttaga agccataaaa tctcatttgg 5341 aaattttaaa ttggcaccac gtcaactgca cagatggaaa acgaggagta atgacaaatg 5401 gtaaagcaca gagctggacg ccaagtcagc tgggagacca caggcgccac gttaagctga 5461 gtgctgtttt ggtttttttg tgtttttctt tcttgttttt ttttttgaga cagtgtctca 5521 ctctgtcgcc caggctagag tgcagtggtg tgatctcggc tcgccgcaac ctccacctcc 5581 caggttcagg caattctcat gcctcagcct cctgagtagc tgggattaca ggcccatgac 5641 atcatgcctg gctaattttt gtatttttag tagagatggg gtttcaccat gttgtccagg 5701 ctggtctcga actcctggcc tcaagtgatc cacccaccac agcctcccaa agtgccggga 5761 ttacaggcat gagccaccac acccagccag ctgattgctg ttgaatagct ggatttataa 5821 agactgagca taggaggaaa tggcacatca ctctcatttt taatttattc attattttta 5881 tagtgtttaa actgttcatg tatcggcaat ctagttatgc ttcataaatc ctcaggacag 5941 agaatttctc ctcaaaagga atttaaaatc taccaagtag aaatacagaa attaagaaag 6001 gcaaagtgat cgtccaaact caaaaccaac aaagcctata tgacaagtct ctaagacaca 6061 tggattgatt actgatttca tttgatcagg aagttaatga aatctacttt atactctcct 6121 ttaatttttg ccaatctccg tttatatgag ttgcataagt taaggcactt tcaaatatat 6181 ttgtgtcaag gaatattcac ggaaatattt ccagctatgt gtcgctaaaa ctgcatttat 6241 ttattttctg ttctaagatc ccttcattaa caggagaaat gcaaatacct tcatatcccc 6301 tcagcagaga tggagagcta aagtccaaga gaggtcagta acaaaacttc atgaggagtg 6361 gtcatttttc ccagtgtaga tcacagatct gaattggagt gggaaacagc tttttcatca 6421 tatacattat ttctaattgt atctttaaaa tcaaaaaact taaaagcaat attcagaaaa 6481 caactgaatt attagaaaat tatttgggga aagatccgga aaggagaagg aaggaggaga 6541 gaaaggagga cagaaagaaa acttctattt tcattaaaaa aaaaaaaaaa atctcctgtt 6601 ctgccttccc tccctggttt tttttttggt tggttggttg gtttttctga gacagagtct 6661 cactctgttg cccagactgg attatagtgg cactatctcg tgcctcagcc tcccaagtag 6721 ctgggattat aggcacgtgc taccatgtcc agctattttt gcattttttg tagagacggg 6781 gttttgtcat gttggccagg ctagtcttga actcctgacc tcaagtgatc cacccacctc 6841 agcctcccaa agtgctggga ttacaggcct gagccaccgc acccagcctc tccctgttct 6901 ttaaatatct cttaatatag gggggcatgg agagaaagtc tctccaatat tttcttcttc 6961 ttttccattt ttgtattttt ccactttatc cttctcaatt ttggcctctt cttccacttt 7021 ctaggatccg agaacgctct aagcctgtcc acgagctcaa tagggaagcc tgtgatgact 7081 acagactttg cgaacgctac gccatggttt atggatacaa tgctgcctat aatcgctact 7141 tcaggaagcg ccgagggacc aaatgagact gagggaagaa aaaaaatctc tttttttctg 7201 gaggctggca cctgattttg tatccccctg tagcagcatt actgaaatac ataggcttat 7261 atacaatgct tctttcctgt atattctctt gtctggctgc accccttttt cccgccccca 7321 gattgataag taatgaaagt gcactgcagt gagggtcaaa ggagagtcaa catatgtgat 7381 tgttccataa taaacttctg gtgtgatact ttcatcttgt aaatctgctt tcttttggga 7441 agatattgag atatttaaat catggcccac cttacccaaa ataggagatt ctgttcatct 7501 catatctagt attaattaga aaaataacta cataaaaaga aggaagctaa gaaggcactc 7561 actcagccat aaattctcta aaccctctct accttggaat ccgtgaatgg aatctggtat 7621 gttttttgca ggattttcct attgtaaatt gtggcaaata cagggctccc ttcatttgct 7681 tttcatctct tatgcatcaa agtcaaaaac atttctgaat caagataatc taga // LOCUS HUMMHCD8A 7319 bp DNA PRI 07-JAN-1995 DEFINITION Human MHC class I CD8 alpha-chain (Leu-2/T8) gene, complete cds. ACCESSION M27161 NID g187844 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human DNA, clone pLE2B13.5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7319) AUTHORS Nakayama,K., Tokito,S., Okumura,K. and Nakauchi,H. TITLE Structure and expression of the gene encoding CD8 alpha chain (Leu-2/T8) JOURNAL Immunogenetics 30 (5), 393-397 (1989) MEDLINE 90035142 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.-i.Nakayama, 31-AUG-1989. FEATURES Location/Qualifiers source 1..7319 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2p12" prim_transcript <1..>7319 /note="MHC CD8 alpha-chain mRNA and introns" misc_feature 238..256 /note="initiator-like sequence" misc_feature 476..485 /note="Tcr v-beta decamer-like sequence" gene 619..667 /gene="CD8A" CDS join(619..667,763..1116,1700..1810,2017..2127,2890..2920, 5504..5555) /partial /note="MHC CD8 alpha-chain (Leu-2/T8)" /codon_start=1 /db_xref="PID:g386908" /translation="MALPVTALLLPLALLLHAARPSQFRVSPLDRTWNLGETVELKCQ VLLSNPTSGCSWLFQPRGAAASPTFLLYLSQNKPKAAEGLDTQRFSGKRLGDTFVLTL SDFRRENEGYYFCSALSNSIMYFSHFVPVFLPAKPTTTPAPRPPTPAPTIASQPLSLR PEACRPAAGGAVHTRGLDFACDIYIWAPLAGTCGVLLLSLVITLYCNHRNRRRVCKCP RPVVKSGDKPSLSARYV" exon <619..667 /gene="CD8A" /note="MHC CD8 alpha-chain (Leu-2/T8); G00-120-581" /number=1 intron 668..762 /note="CD8 alpha-chain intron A" exon 763..1116 /number=2 intron 1117..1699 /note="CD8 alpha-chain intron B" exon 1700..1810 /number=3 intron 1811..2016 /note="CD8 intron C" exon 2017..2127 /number=4 intron 2128..2889 /note="CD8 intron D" exon 2890..2920 /number=5 intron 2921..5503 /note="CD8 intron E" exon 5504..>5555 /note="MHC CD8 alpha-chain (Leu-2/T8)" /number=6 BASE COUNT 1690 a 1933 c 1837 g 1856 t 3 others ORIGIN 1 bp upstream of HindIII site; chromosome 2p12. 1 aagcttcggc tgctgccgcc ctcatggaaa tctcctgggg gaagggagag ggtccttcct 61 cggtgaaaac tggggctgct ctagcgagtt cctcagaagc gggcaggtcg ctagctcctc 121 ttccttttca gccctcagtg cccattttgc caataaaaag tcccaaggtg acagtacaag 181 agacgccttt agtgaaggca aaggaaggga cactcccctc ctttgctgcc tactctcgcc 241 ctcacttctt gaaatctttg gtctcccttc acccactctg tcactctcac aagacaacca 301 tttccaagga ctatttccaa gcccttttcc tcatccccaa acccgcagtt ttcagctgcc 361 cccagttgcc tggccaggct gcctcgacgg ccctattcac gggccccagc ctcctcgccg 421 ggctggaagg cgacaaccgc gaaaaggagg gtgactctcc tcggcggggg cttcgggtga 481 catcacatcc tccaaatgcg aaatcaggct ccgggccggc cgaagggcgc aactttcccc 541 cctcggcgcc ccaccggctc ccgcgcgcct cccctcgcgc ccgagcttcg agccaagcag 601 cgtcctgggg agcgcgtcat ggccttacca gtgaccgcct tgctcctgcc gctggccttg 661 ctgctccgtg agtttgagac gcccgggaag gcaggggatg gggcgggcgg ggggcggggg 721 gctcgggact gcttgctcag cctcgctgct tgcctcctgc agacgccgcc aggccgagcc 781 agttccgggt gtcgccgctg gatcggacct ggaacctggg cgagacagtg gagctgaagt 841 gccaggtgct gctgtccaac ccgacgtcgg gctgctcgtg gctcttccag ccgcgcggcg 901 ccgccgccag tcccaccttc ctcctatacc tctcccaaaa caagcccaag gcggccgagg 961 ggctggacac ccagcggttc tcgggcaaga ggttggggga caccttcgtc ctcaccctga 1021 gcgacttccg ccgagagaac gagggctact atttctgctc ggccctgagc aactccatca 1081 tgtacttcag ccacttcgtg ccggtcttcc tgccaggtcc gcgcgccggg ttgcnncacc 1141 tctccgcgtg gggcttgggg ttcacctcaa cctgttttcc acacctgggc cctgctttca 1201 agcctcttag cgcacctgga agcgacttag atactgttcc catttcacag acacagcaaa 1261 ctgaggctaa cgcgaggctg agtagggact ggggatggct ttttacgtcc ggccgatctg 1321 gactcgagtt cggattcttg tgtgattctg ggcaaatgac ttaaattatc tgcgcctccg 1381 tttgctcgtg ggtaaaatgc ggacagcaat caacccgcgc tccagggttg ctggttcagc 1441 ccgggagaga ggccaggtgc aggggagctg gctcctgcgc tccggctctg ggcgccgcgt 1501 gcgctgggct gagccgaggc ccgagcacag ggctccgcgc gcatcccggg gtccccatcc 1561 tgtctcctct gctatccctt ctcccaggaa aaccgggaaa aagcatcccc gtagggaaag 1621 gtgcgcgagg ccagggctag ggcaagaaca cctccataac cccagcctgg cccccaacca 1681 cgctctgttg ctcttgcagc gaagcccacc acgacgccag cgccgcgacc accaacaccg 1741 gcgcccacca tcgcgtcgca gcccctgtcc ctgcgcccag aggcgtgccg gccagcggcg 1801 gggggcgcag gtgaggcgtg cgggggcccg gggcgggggc ggcacgcacc gcggggagag 1861 cccatgccag ggacagccct gatactgtag gtagagtcaa gggctgtcca agtggtggga 1921 aaccgagatc tgaggcggag agacgttccc gggacggtgg ggcggttggg aggtcccggg 1981 ccctctccta aatgtcggcg tctttggtgt cgtcagtgca cacgaggggg ctggacttcg 2041 cctgtgatat ctacatctgg gcgcccctgg ccgggacttg tggggtcctt ctcctgtcac 2101 tggttatcac cctttactgc aaccacagta agtcccgagg aatcgcggcg ggaaggagtc 2161 ggccccgctc ctgcccttgg ctctgctagc tccaggcgga ccttgcgagg ggggagcttg 2221 gggttgagtt ttctgggact cccgagctgc cccgcaggta caggcacttc tttctgggag 2281 atagaatgcg agaaagccct tcccgctcct tgcgctctcg gagaggtttt gggagacaaa 2341 gccgcgttcc cagagccgct gcagcgggtg tgttccgtcg acgaagctcg cagcgcgcgc 2401 gcacatgtct ggacacatga caaacgcccg ccgggagggg gccgctgtgg ccgagtcaag 2461 tgcccaggag ggttagtcaa aaccaccatc agtttgaggt tcgtccttgg ccctcttcgg 2521 cgaaccgccc accctgcttt gggtcccatg caagagaggt gccggatcag gtctgcaggc 2581 tccccgcttt cgcccgcgtc tgtagttgca atttttttta attgatgctt gagacgaagg 2641 atggataaga agcccttacc cggctctcaa gagttataat cctcaaagaa ataggggaaa 2701 ggaaggagaa ttcgtagagg caattcccta tttgcctcat ccaatttcag cccatgatcg 2761 ttctgaaatg attttttgta aaaccaaaaa acagcagcaa caacaacagc aacaacaaaa 2821 aatacactgt ctttggggac tattgatgga taggggcact cactttttcc cttctttttc 2881 cttatccagg gaaccgaaga cgtgtttgca aatgtccccg gtgagtctct tttgaactct 2941 cagcctgctc accttggcca gccctggggt ggggtcctgg tttccccagc agcaaatagt 3001 gggaactttt cccagagaga ggtcatggcc gtgaaatcct ggggcaggga gtcagccagt 3061 tcctgagcaa agagaggcca aggctggcat ctggcctgtg tctttgagaa aaaggaggcc 3121 cgggttaggc gggtgccaac agcagtgtga ctccagtaaa tctccatctt atttgacctg 3181 gcacttccaa aaaaaaaaaa aaaaaacctg agggattccc ccaatttatc ctagctcctc 3241 gtaggacctg acctcctctt tattctgatt attccatctg ggttttgttg ttttcttaag 3301 aaaacaattt tttttcctac ttggctggtc tagttttttg agggagagcc aatcttttat 3361 cagctgaacc aaaataataa tggctttggt tgctaacttc tctgtgtcat gtaggacctt 3421 ggtttgctgc caaggactgg agtagaaaaa aggggaacga gatgcaggga caattggcta 3481 tctggcagct caagtgactg cctcatccag tttcagccca tgatcacaca cacacacaca 3541 cacacacaca cactctctct ctctctctcc ctctctgtta aaaacctctc tccacccgtt 3601 ggctgaggca tccctgtggt agcatctaac aagtgcctgc tctagtcagg cactgtgata 3661 ggggtaatac tccccatctc tcttaatcct cacagcaact ttgtgccatc tcatggctca 3721 gggattagta aagtggcaaa ggcacattaa actcagatta tcaatagtaa taaccatttg 3781 atgaaagcct acaacgcacg ttacttgctt tatagatttt ttttaatttt aagttttatg 3841 acaactttta aagtagtcat gattacctgc atttacagat aaggaaactg aggcttagaa 3901 catttagtga tttgtccaag gtcatactgc ccgggagctg caacttccat ccaaagctca 3961 gacaccttcc ccatggctgc aggctgaagt gttaaggata accgactgtg tacaccagtt 4021 cttttcagta gatgaaatat tttacatttt atacctggag tttgcatttt atactccttt 4081 acatttttct gagggaatcc aagttcagga ctttacacat tgttgttttt agcaatggaa 4141 atgcaatgga tttccaatgc atatgttaca ggtgatatac caaggggata aagactttaa 4201 aaatgagccg ggtattgtgg tgcatgccta taatccgggc tactcaggag gctgaggcag 4261 gagaatcgct tgaacgtggg aggcggaggt tgcagtgagc tgagatcgta ccactgcact 4321 ccagcctggg taacagagca agattccatc tcacaaaaaa aaaaaaaagt ggggggggaa 4381 gtttttagct tttcttttct tttcttttct tttttttttt tttttttttt tgagacggag 4441 tctcgctatc tcgcccaggc tggagtgcag tggtgcgatc tcggctcact gcaagctccg 4501 cctcccgggt tcacgccatt ctcctgcctc aacctcccaa gtagctggga ctacaggcac 4561 ccgcccagct gggactacag gcacccgcca caacacccgg ctaatttttt gtatttttag 4621 tagagacggg gtttcaccgt gttagccagg atggtctcga tctcctgacc tcgtgatccg 4681 cccgccttgg cctcccaaag tgctgggatt ataggcgtga gctaccgcac ccggccagtt 4741 tttagcattt ttattattca gggtgcactt tggaactttt gcagaaggac catccctttt 4801 ctcgatcttt tctgaaaact ctcattgtga gatgaataac tgagaataaa gttctgcctt 4861 aaaatacaaa gctatacaga ttaaacacac ctgggaaatt ccaacacccg agttaatata 4921 ctcagtgatg atggtgggcc gctcagcttc aacagagagg ttttagaggg ctgagagtgc 4981 agacatgacg ctcatgatga attagcatgt ttacaaggtg ctagcctggt gctgaattaa 5041 cacgttttta gcgtaacatc atttggctga tggtggtgat ttgctcacat cctcctctct 5101 ccctgtcatc ctccggggct gagaatcagg tcagctaatg actttggtta tcagtctcac 5161 tctcacacct ctgtggagac cttgacaggg gctccaggaa gctagactaa ctcggtttag 5221 atcctggctc agccacctcc tagatgtgtg accttgggca agtttaactt ctcaatctca 5281 gtgcccaccc ttaaggctgc tgtgaggatt acatgagatg gtgcaagtaa ggcatttagc 5341 acgggtgctg tgctgacgtc attcacccag tcccatgctg gtgccccagg aacaggttct 5401 cagacctttc aaaggctggg ggattgggag gctggacgtg gctggatgga tgctgcactg 5461 gcggggttct ggctacaatg atggtcgctt tcctctcttt caggcctgtg gtcaaatcgg 5521 gagacaagcc cagcctttcg gcgagatacg tctaaccctg tgcaacagcc actacattac 5581 ttcaaactga gatccttcct tttgagggag caagtccttc cctttcattt tttccagtct 5641 tcctccctgt gtattcattc tcatgattat tattttagtg ggggcggggt gggaaagatt 5701 actttttctt tatgtgtttg acgggaaaca aaactaggta aaatctacag tacaccacaa 5761 gggtcacaat actgttgtgc gcacatcgcg gtagggcgtg gaaaggggca ggccagagct 5821 acccgcagag ttctcagaat catgctgaga gagctggagg cacccatgcc atctcaacct 5881 cttccccgcc cgttttacaa agggggaggc taaagcccag agacagcttg atcaaaggca 5941 cacagcaagt cagggttgga gcagtagctg gagggacctt gtctcccagc tcagggctct 6001 ttcctccaca ccattcaggt ctttctttcc gaggcccctg tctcagggtg aggtgcttga 6061 gtctccaacg gcaagggaac aagtacttct tgatacctgg gatactgtgc ccagagcctc 6121 gaggaggtaa tgaattaaag aagagaactg cctttggcag agttctataa tgtaaacaat 6181 atcagacttt ttttttttta taatcaagcc taaaattgta tagacctaaa ataaaatgaa 6241 gtggtgagct taaccctgga aaatgaancc ctctatctct aaagaaaatc tctgtgaaac 6301 ccctacgtgg aggcggaatt gctctcccag cccttgcatt gcagaggggc ccatgaaaga 6361 ggacaggcta cccctttaca aatagaattt gagcatcagt gaggttaaac taaggccctc 6421 ttgaatctct gaatttgaga tacaaacatg ttcctgggat cactgatgac tttttatact 6481 ttgtaaagac aattgttgga gagaaaatca cacagccctg gcctccgctc aactagcaga 6541 tacagggatg aggcagacct gactctctta aggaggctga gagcccaaac tgctgtccca 6601 aacatgcact tccttgctta aggtatggta caagcaatgc ctgcccattg gagagaaaaa 6661 acttaagtag ataaggaaat aagaaccact cataattctt caccttagga ataatctcct 6721 gttaatatgg tgtacattct tcctgattat tttctacaca tacatgtaaa atatgtcttt 6781 cttttttaaa tagggttgta ctatgctgtt atgagtggct ttaatgaata aacatttgta 6841 gcatcctctt taatgggtaa acagcatccg attttggttg tgccatcatt tatttaacta 6901 ttacctgata tttagttgtt tccaaccttt tcctactata acactcttaa aatgttgcat 6961 atatttttgt ttctttagga taaagaccta aaagtaaaat tattgaatca gtccccacct 7021 tcacctgagc ctgccatttt cacatcaatc aaaacagatt agcttcatcc tgcattaccc 7081 ttcttccttt tgttctttct tggtaacaga ggagacgatg ctcctttcat gataaatcac 7141 taagtcatct tgttgttgag gtaaaatatt cgtactgaaa attgcttaag tcgtaaatgt 7201 acagctcgat gacatcataa agcaaacaca cccatgtaat cagccctcca gaagtctgtc 7261 gttagtttta aaaccaagat acttttagag aatggaaagc aagagtggct ttctgttct // LOCUS HUMMHDC3B 8090 bp DNA PRI 19-SEP-1995 DEFINITION Human MHC class II HLA-DC-3-beta gene (DR3,3). ACCESSION K02405 NID g187979 KEYWORDS antigen; cell surface glycoprotein; class II gene; glycoprotein; histocompatibility antigen; integral membrane protein; major histocompatibility complex. SOURCE Homo sapiens (clone: lambda-42) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8090) AUTHORS Boss,J.M. and Strominger,J.L. TITLE Cloning and sequence analysis of the human major histocompatibility complex gene DC-3 beta JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (16), 5199-5203 (1984) MEDLINE 84298108 COMMENT A pseudo-exon is noted by [1] at 6789-6809. FEATURES Location/Qualifiers source 1..8090 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-42" /cell_line="WT49 (DR3,3)" /cell_type="lymphoblastoid" prim_transcript 646..7793 /note="HLA-DC3-beta mRNA (alt.)" prim_transcript 649..7793 /note="HLA-DC3-beta mRNA (alt.)" prim_transcript 652..7793 /note="HLA-DC3-beta mRNA (alt.)" CDS join(705..813,2235..2504,5395..5676,6193..6303,7421..7434) /note="prepeptide" /codon_start=1 /product="MHC HLA-DC3-beta" /db_xref="PID:g386916" /translation="MSWKKALRIPGGLRAATVTLMLSMLSTPVAEGRDSPEDFVYQFK GMCYFTNGTERVRLVSRSIYNREEIVRFDSDVGEFRAVTLLGLPAAEYWNSQKDILER KRAAVDRVCRHNYQLELRTTLQRRVEPTVTISPSRTEALNHHNLLVCSVTDFYPAQIK VRWFRNDQEETAGVVSTPLIRNGDWTFQILVMLEMTPQRGDVYTCHVEHPSLQSPITV EWRAQSESAQSKMLSGIGGFVLGLIFLGLGLIIHHRSQKGLLH" sig_peptide 705..800 /note="MHC HLA-DC3-beta signal peptide" exon <705..813 /note="MHC HLA-DC3-beta prepeptide" /number=1 intron 814..2234 /note="HLA-DC3-beta intron A" exon 2235..2504 /number=2 intron 2505..5394 /note="HLA-DC3-beta intron B" exon 5395..5676 /number=3 intron 5677..6192 /note="HLA-DC3-beta intron C" exon 6193..6303 /number=4 intron 6304..7420 /note="HLA-DC3-beta intron D" exon 7421..>7434 /note="MHC HLA-DC3-beta prepeptide" /number=5 BASE COUNT 2083 a 1865 c 1912 g 2230 t ORIGIN 1 bp upstream of BamHI site on chromosome 6. 1 ggatcctttt gtaacattaa acaagtcata ttaatcttaa atttgtatat gtgaagatct 61 agatgtaaaa tgcatgaaac atgatccaca ttttacaaag agaagcctgg ggcaaaataa 121 attcagtaat ttgttgactc tcataaagca cattagtggt ggaactgcaa ctcaccatta 181 tttccttcta agaactttgc tctttcacca aaacttaagg ctcctcaggg tgtgtctaag 241 acaacagcag taaaaatgtc tatgacagca attttctctc ccctgaaata tgatccccac 301 ttaatttgcc ctattgaaag atcccaagta taagaacaac tggtttttaa tcaatattac 361 aaagatgttt actgttgaat gcatttttct ttggcttctt aaaatccctt aggcattcaa 421 tcttcagctc ttccataatt gagaggaatt ttcacctcaa atgttcatcc agtgcaattg 481 aaagacgtcc agtgcaggca ctggattcag aaccttcaca aaaaaaaaat ctgcccagag 541 acagatgagg tccttcagct ccagtgctga ttggttcctt tccaagggac catccaatcc 601 taccacgcat ggaaacatcc acagattttt attctttctg ccaggtacat cagatccatc 661 aggtccgagc tgtgttgact accacttttc ccttcgtctc aattatgtct tggaaaaagg 721 ctttgcggat ccccggaggc cttcgggcag caactgtgac cttgatgctg tcgatgctga 781 gcaccccagt ggctgagggc agagactctc ccggtaagtg cagggcagct gctctccaga 841 gccgctactc tgggaacagg ctctccttgg gctggggtac ggggatggtg atctccataa 901 tctcggacac aatcttttat caacatttcc tctgttttgg gaaagagagc tatgttgcat 961 ttccatttat cttttaatga tgaagtgagg acaatccaat cccatcctac ggacttaagc 1021 cgaagaggag gagagaggag agaaaagagg agacaaagtg ttcatttact accagtgata 1081 ggacaaagtg agcatggggt tatttttgaa agatatgaat ttctccaaag acacagcagg 1141 atttgccatt taggcgtgtc ccaagacttg cctgactaaa tattatgatt tcctgcattg 1201 ggaaatgcaa ggcagcaatg gtgtctgtag tctccgtatt tggggaaaag ttgtctgcat 1261 tcctgaccca gtggagcgtt tgtggaggca aaatcttggt actgagggaa gctgactggc 1321 actccacaga aagagagcct tcaggtttcg gcaaatggtg acctgagtgg gattccagat 1381 acccgagttg atgatggact aaatttagta gaaaggagga tgtaaagaag ggaaataaca 1441 catactgtga aaccaactca tttcagacac ggaacaatac tttacataaa ttctctctca 1501 ctccttctaa catcctgtgt gtagatatca tgattttctt ttacacaatt atacttgtag 1561 tatggatatt ctgttacata aactgcccgg gctggtgact gccacagttc aatgggaatc 1621 tagtttatca aattcaaaag cttgtgctct ttcggtgaat aaatgtttct ttctaggact 1681 cagagatcta ggactccctt ctttctaaca cagcgtgagt gaacctcaca gggcacttgg 1741 gagggtaaat ccaggcatgg gaaggaaggt attttaccca gggaccaaga gaataggcgt 1801 atcggaagag gacaggttta attctgaacc tgtctcgtca ttcccttgaa ctgtcaggtt 1861 tatgtggata actttatctc tgaggtaccc aggagctcca tggaaaatga gatttcatgc 1921 gagaacgccc tgatccctct aagtgcagag gtccatgtaa aatcagcccg actgcctctt 1981 cacttggttc acaggccgag acagggacag ggctttcctc cctttcctgc ctgtaggaag 2041 gcggattccc gaagaccccg agaggggcgg gcagggctgg cagagttcgg gaggatccca 2101 ggtctgcagc gcgaggcacg ggccggcggg aacttgtggt cgcgcgggct gttccacacg 2161 tccgggccgg gtcagggtgg cggctgcggg ggcggacggg ctgggcgcac tgactggccg 2221 gtgattcctc gcagaggatt tcgtgtacca gtttaagggc atgtgctact tcaccaacgg 2281 gacagagcgc gtgcgtcttg tgagcagaag catctataac cgagaagaga tcgtgcgctt 2341 cgacagcgac gtgggggagt tccgggcggt gacgctgctg gggctgcctg ccgccgagta 2401 ctggaacagc cagaaggaca tcctggagag gaaacgggcg gcggtggaca gggtgtgcag 2461 acacaactac cagttggagc tccgcacgac cttgcagcgg cgaggtgagc ggcgtcgccc 2521 ctctgcgagg cccacccttg gccccaagtc tctgcgccag gaggggcgaa gggtcgtggc 2581 ctctggaacc tgagccccgt ttgttccacc ccagaggaca ggaggcagcg gcgagagtgg 2641 tgggggcagg gtcatcggag gtgcggggac ctaggcagag cagggggaca ggcagagttg 2701 gccaggctgc ctagtgtcgc cccagcctac ccgttcgtcg gccttgtcct ctgctctgca 2761 tgttcttgcc tcgtgcctta tgcatttgcc tccttttgcc ttacctttgc taagcagctc 2821 tctctgctca gaatgcccgc cctcttcccc tgcccgcccg cccgccccac tagcactgcc 2881 ccacccagca aggcccatcg tgcacagctc ttgcagcagg aagcttcagg cttagcctgg 2941 tggagttagg gctgttccac aactgcgcgc aggacattca gcaattacag ttgtgaaata 3001 agatatttta acttttggct tcaaatcatt attcatcgta attctgtttt cttaaatggc 3061 tctcattcat ggcagagatc tttgaggtga gggtgtttta atcattgcat gcctagtacc 3121 tgacacattg actggtatgt ggtgtgagct caatgatctt ctgttaaatt aatgaataaa 3181 tgtactcagc tgcccatcca cttaggctca agaaaaaaaa agaggtaaac agagccttaa 3241 aaatggactt tattaattat tttctataat tttgcttaat gctttaaagt aaactcttat 3301 tgacttggat cttaatagag tttgtgaata caaaatctga ggaaaaaagt ttttgctaaa 3361 aataaaaaca acgcttgaaa gatattgtaa ggcagttcaa atttcttttc ttttcttttt 3421 tttttttttt tgagacggat tctcactctg tcgcccaggc cggagtgcag tggcgcatct 3481 cggctcactg caagctccgc ctcccgggtt cacgccattc tcctgcctca gcctcctgag 3541 taggtgggat tacaggcgcg tgccaccacg cccggctaat ttttttgtat ttttagtaga 3601 ggcggggttt caccgtgtta gccaggatgg tctggatctc ctgacctcat gatccgcccg 3661 cctcggcctc ccaaagtgct gggattacag gtgtgagcca cagtgcccgg ccggcacttt 3721 aatttcttag aaaaagctga acaaatggca caatgcaaag agcaaaagtt ttggaataaa 3781 tagattgaag ccattaaatt attggataaa aatagtttcg ggttgctttt ggcctaggtt 3841 ctcccctccc cccatgacta tccacttcag gaataaacat tctgaaagtc aattttaccc 3901 atttgatgag acatttattt ctagacagtt gccttatcaa ataccatcta tgttacgtca 3961 tttaatctca cagttacttg tgcatcagag attagcatca ccactttata tattggtaca 4021 tgataaacac tttattggtc atggatgggg agatggtcac tgtaggctaa tattggtaca 4081 tgataaacac tttaagtaat cagcccataa ttgctcacca agaccttaag cctcccaaag 4141 tacacaacat tctttgtgtt cttcactaca catccataga gtctaaggga cgtaagcctc 4201 gttaaagcca gttttagacc gagaagcagc aatgagtcta tcctgtgtga tttccatgtt 4261 aatgggacaa aatgatactt tcaaggcatt gaaaattcat gattaatcaa tccctagtct 4321 gaccccagtg ttatctatgc aggtttgcaa aacctttagt tcactaatac tccccttgcc 4381 ttcttttgat tcacatccta atgccagcaa atacttattt tttgctattt cagttccatt 4441 tccataaaat ttattttatc atcttttctc ataaatttat gccctctatt tttactccca 4501 atctgttcaa gatgaacaaa tcttataagg ccacatagct gactgttatt tctgttggat 4561 ctcaggaagg agaacctaaa gaaaagttca agtccaagca gaaaccgtga tttcttccag 4621 atgatggctc atgagtgcca tttaattggg gtgccacctg gtgacctcag caaatcccag 4681 ctatatttat gtgttcacat tacaggatca ttaacccaga ccgaccactg cacaagatct 4741 cagaatattt tctatgggag aacatacata ataatgcctg atttcagaag aagaaagtaa 4801 ttctcaatag caaggggatg gagtagggta gacagctgta attaaactca cttgtgtgat 4861 aaaaagaaat taagggaaaa agaaaatgag gagaacatat tactaaataa agaaagcata 4921 cattaaatat ttactatagt ttcacactaa gagaataaag gaaatgcaat aaagtggcct 4981 gaaaggtaaa ggatgagatg tgtaaagggg tgtagtattt ttcatatgac gacgaactga 5041 gaagataaag gaatcgagtt acgggcaaac atgatgtttg atcagtgtta tttgttttca 5101 aggcctgcct aaattttttt caaatattac aaacttttga aataacattt ctttttgttt 5161 tttgctgtct gttactaggt tgcacatttc ataaaggcag ggaccatggt atgttgtttg 5221 tctttggatt ctcagtgatt gttatattta tatttgttga aggaacctta atccaagact 5281 tggactccaa gtatctttcc actctggttc caaggaggga cctccctcca gcaggcatgc 5341 tgtgtggtct cacatctcac tcctatatct ttccctgtct gttactgccc tcagtggagc 5401 ccacagtgac catctcccca tccaggacag aggccctcaa ccaccacaac ctgctggtct 5461 gctcggtgac agatttctat ccagcccaga tcaaagtccg gtggtttcgg aatgaccagg 5521 aggagacagc tggcgttgtg tccacccccc ttattaggaa tggtgactgg accttccaga 5581 tcctggtgat gctggaaatg actccccagc gtggagacgt ctacacctgc cacgtggagc 5641 accccagcct ccagagcccc atcaccgtgg agtggcgtaa ggggatattg agtttctgtt 5701 actgtgggcc ccacaagaca aaggacagag ctccttctga cccatccctt cccatctctt 5761 atccctgatg tcactgctga gctgggaatc acaggagact agagcacctc tagttccatg 5821 gcgagtgcat cagaagaatc ctgatctcat cacctttcca gatgctaggg aaattactct 5881 acatactgtt gctctggatc cagtcctgat tgctctgagg aactgattat tagggctggt 5941 gactgggatc ttagggtcta agtttatgga tgagttcctg aggagtggag atctgcttcc 6001 ccactctgtc acctactcac tgtatccaag gacctattgg ctggcctttc cctcccttag 6061 gggtggtctg aatggagaac taggttcctt tgatgccttc acctcctgca tctcagactg 6121 gacttcaagc tcctcatcag ggaaactatg gggtatgggg acaaacactg acactcaggc 6181 tctgcttctc aggggctcaa tctgaatctg cccagagcaa gatgctgagt ggcattggag 6241 gcttcgtgct ggggctgatc ttcctcgggc tgggccttat catccatcac aggagtcaga 6301 aaggtgagga accccagggg aaaaggggaa gatggcctgt gacccagacc ctctgttcag 6361 agaggtcctg tctctagatg tagctctttc ctcctgaccc tgagaggaag aaagctgagc 6421 tggaagtgga aggagacagg acaaggttgg aggaggcatt ggaatctgat tttactagct 6481 gaagggtagc cctgtcacag agctgactga tagagcttat tccagggcat ccttaccatt 6541 catcattgtc tcactggctc ctttccaaaa acttcctcca ttaagagggt cagagcctcg 6601 gcctccttac cttctagtga caatttcctt cattttaggg gatttcaaat tagggtgctc 6661 aaggactcga agaacatgaa tgggaagaga atataactct aattaagtca catgtgtcat 6721 tttcctttgg ggtgagagag tgactgttca tgtaatgaga cctttctctg cataacttcc 6781 ttttgtaaga cctcaagggc caccagcagg taatatttcg agccggcatc cagtgtgggg 6841 agggcacagg tgtaagaggg aagagcatga gctgagtgta cctgacagta gtggtctctg 6901 ttcatggtat atttgctgct ataggatcaa gacttagggg tgaagtttgc cagtttctag 6961 gaatctccag aggttgttcc ccagaaccaa gccttaactt tggtggtatc ttcctgttaa 7021 atgtggagcc agaaccacgg cttaaatgtt agacactagg atgatgccca ctttgtgcca 7081 catgatggtg gctactgcct gtaggcattt tccagtgact gaaagaggct gctagtggta 7141 gggatgaggt atcatccaat ttcctaaaaa gattgaaccc ttcatattca ccagaagagt 7201 aacagctgtt cccccacctc ccacacatct gcatcaagct gaagttctgt gtcttcatga 7261 gctgatttct cctttgcaca gatcttgggg gaggtgatga caatacactc tggacctcag 7321 ctttgtctgt ctgaagctgc aggaggcccc tgaggggtgg ggaagatggc aggcccacca 7381 gcgtaccctg tgctgatcat ccctcttctc tcttcttcag ggctcctgca ctgactcctg 7441 agactatttt aactgggatt ggttatcact tttctgtaac gcctgcttgt ccctgcccag 7501 aattcccagc tgtctgtgtc agcctgtccc cctgagatca gagtcctaca gtggctgtca 7561 cgcagccacc aggtcatctc ctttcatccc caccttgagg cggatggctg tgaccctact 7621 tcctgcactg acccacagcc tctgcctgtg cacggccagc tgcatctact caggccccaa 7681 ggggtttctg tttcctattc tctcctcaga ctgctcaaga gaagcacatg aaaaccatta 7741 cctgacttta gagctttttt acataattaa acatgatcct gagttatctg tattctgaac 7801 ttccttaatt gagcagaggc aggaaatcac tgcagaatga aggaacatac cttgaggtga 7861 cccagccaac ctgtgcccag aaggagggtt gtaccttgaa agacactgaa agaatttggg 7921 gtgcaaagtc atggtgggca gaggaggtag aaaatcaact cagttgttgc atcattcatg 7981 gttctttcat attgatgttc agtgcagtgg cctgagaata tcccagcctc tcttctggtt 8041 tggtgagtgc tatataagta aacatggtgg aattgtttgg gggcagatag // LOCUS HUMMIF 2167 bp DNA PRI 29-SEP-1994 DEFINITION Homo sapiens macrophage migration inhibitory factor (MIF) gene, complete cds. ACCESSION L19686 NID g307284 KEYWORDS macrophage migration inhibitory factor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2167) AUTHORS Paralkar,V. and Wistow,G. TITLE Cloning the human gene for macrophage migration inhibitory factor (MIF) JOURNAL Genomics 19 (1), 48-51 (1994) MEDLINE 94245178 REFERENCE 2 (bases 1 to 2167) AUTHORS Wistow,G.J. TITLE Direct Submission JOURNAL Submitted (19-JUN-1993) Wistow G.J., Molecular Sturcture and Function, NEI, NIH, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2167 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(1076..1280,1470..1642,1738..1920) /gene="MIF" gene join(1076..1280,1470..1642,1738..1920) /gene="MIF" exon 1076..1280 /gene="MIF" /number=1 CDS join(1173..1280,1470..1642,1738..1804) /gene="MIF" /codon_start=1 /product="macrophage migration inhibitory factor" /db_xref="PID:g307285" /translation="MPMFIVNTNVPRASVPDGFLSELTQQLAQATGKPPQYIAVHVVP DQLMAFGGSSEPCALCSLHSIGKIGGAQNRSYSKLLCGLLAERLRISPDRVYINYYDM NAANVGWNNSTFA" intron 1281..1469 /gene="MIF" /number=1 exon 1470..1642 /gene="MIF" /number=2 intron 1643..1737 /gene="MIF" /number=2 exon 1738..1920 /gene="MIF" /number=3 BASE COUNT 392 a 657 c 717 g 401 t ORIGIN 1 ctgcaggaac caatacccat aggctatttg tataaatggg ccatggggcc tcccagctgg 61 aggctggctg gtgccacgag ggtcccacag gcatgggtgt ccttcctata tcacatggcc 121 ttcactgaga ctggtatatg gattgcacct atcagagacc aaggacagga cctccctgga 181 aatctctgag gacctggcct gtgatccagt tgctgccttg tcctcttcct gctatgtcat 241 ggcttatctt ctttcaccca ttcattcatt cattcattca ttcagcagta ttagtcaatg 301 tctcttgata tgcctggcac ctgctagatg gtccccgagt ttaccattag tggaaaagac 361 atttaagaaa ttcaccaagg gctctatgag aggccataca cggtggacct gactagggtg 421 tggcttccct gaggagctga agttgcccag aggcccagag aaggggagct gagcacgttt 481 gaaccactga acctgctctg gacctcgcct ccttccttcg gtgcctccca gcatcctatc 541 ctctttaaag agcaggggtt cagggaagtt ccctggatgg tgattcgcag gggcagctcc 601 cctctcacct gccgcatgac taccccgccc catctcaaac acacaagctc acgcatgcgg 661 gactggagcc cttgaggaca tgtggcccaa agacaggagg tacaggggct cagtgcgtgc 721 agtggaatga actgggcttc atctctggaa gggtaagggg ccatcttccg ggttcaccgc 781 cgcatcccca cccccggcac agcgcctcct ggcgactaac atcggtgact tagtgaaagg 841 actaagaaag acccgaggcg aggccggaac aggccgattt ctagccgcca agtggagaac 901 aggttggagc ggtgcgccgg gcttagcggc ggttgctgga ggaacgggcg gagtcgccca 961 gggtcctgcc ctgcgggggt cgagccgagg caggcggtga cttccccact cggggcggag 1021 ccgcagcctc gcgggggcgg ggcctggcgc cggcggtggc gtcacaaaag gcgggaccac 1081 agtggtgtcc gagaagtcag gcacgtagct cagcggcggc cgcggcgcgt gcgtctgtgc 1141 ctctgcgcgg gtctcctggt ccttctgcca tcatgccgat gttcatcgta aacaccaacg 1201 tgccccgcgc ctccgtgccg gacgggttcc tctccgagct cacccagcag ctggcgcagg 1261 ccaccggcaa gcccccccag gtttgccggg aggggacagg aagagggggg tgcccaccgg 1321 acgaggggtt ccgcgctggg agctggggag gcgactcctg aacggagctg gggggcgggg 1381 cggggggagg acggtggctc gggcccgaag tggacgttcg gggcccgacg aggtcgctgg 1441 ggcgggctga ccgcgccctt tcctcgcagt acatcgcggt gcacgtggtc ccggaccagc 1501 tcatggcctt cggcggctcc agcgagccgt gcgcgctctg cagcctgcac agcatcggca 1561 agatcggcgg cgcgcagaac cgctcctaca gcaagctgct gtgcggcctg ctggccgagc 1621 gcctgcgcat cagcccggac aggtacgcgg agtcgcggag gggcggggga ggggcggcgg 1681 cgcgcggcca ggcccgggac tgagccaccc gctgagtccg gcctcctccc cccgcagggt 1741 ctacatcaac tattacgaca tgaacgcggc caatgtgggc tggaacaact ccaccttcgc 1801 ctaagagccg cagggaccca cgctgtctgc gctggctcca cccgggaacc cgccgcacgc 1861 tgtgttctag gcccgcccac cccaaccttc tggtggggag aaataaacgg tttagagact 1921 aggagtgcct cggggttcct tggcttgcgg gaggaattgg tgcagagccg ggacattggg 1981 gagcgaggtc gggaaacggt gttgggggcg ggggtcaggg ccgggttgct ctcctcgaac 2041 ctgctgttcg ggagcccttt tgtccagcct gtccctccta cgctcctaac agaggagccc 2101 cagtgtcttt ccattctatg gcgtacgaag ggatgaggag aagttggcac tctgccctgg 2161 gctgcag // LOCUS HUMMIS 3100 bp DNA PRI 03-MAY-1996 DEFINITION Human Mullerian inhibiting substance gene, complete cds. ACCESSION K03474 NID g188560 KEYWORDS Mullerian inhibiting substance; antigrowth protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3100) AUTHORS Cate,R.L., Mattaliano,R.J., Hession,C., Tizard,R., Farber,N.M., Cheung,A., Ninfa,E.G., Frey,A.Z., Gash,D.J., Chow,E.P., Fisher,R.A., Bertonis,J.M., Torres,G., Wallner,B.P., Ramachandran,K.L., Ragin,R.C., Manganaro,T.F., MacLaughlin,D.T. and Donahoe,P.K. TITLE Isolation of the bovine and human genes for Mullerian inhibiting substance and expression of the human gene in animal cells JOURNAL Cell 45 (5), 685-698 (1986) MEDLINE 86218082 REFERENCE 2 (bases 1 to 3100) AUTHORS Cate,R.L. TITLE Direct Submission JOURNAL Submitted (16-FEB-1987) Richard L. Cate, Molecular Genetics, Biogen, Inc., 14 Cambridge Center, Cambridge, MA 02142, USA COMMENT The precise 3' boundary of the signal peptide has not been established and could be at position 265. FEATURES Location/Qualifiers source 1..3100 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="chmis33" /map="174 bp upstream of AflII site" prim_transcript 201..2944 /note="MIS mRNA" CDS join(211..622,1215..1357,1530..1638,1727..1886,1977..2835) /codon_start=1 /product="prepro-Mullerian inhibiting substance" /db_xref="PID:g386953" /translation="MRDLPLTSLALVLSALGALLGTEALRAEEPAVGTSGLIFREDLD WPPGIPQEPLCLVALGGDSNGSSSPLRVVGALSAYEQAFLGAVQRARWGPRDLATFGV CNTGDRQAALPSLRRLGAWLRDPGGQRLVVLHLEEVTWEPTPSLRFQEPPPGGAGPPE LALLVLYPGPGPEVTVTRAGLPGAQSLCPSRDTRYLVLAVDRPAGAWRGSGLALTLQP RGEDSRLSTARLQALLFGDDHRCFTRMTPALLLLPRSEPAPLPAHGQLDTVPFPPPRP SAELEESPPSADPFLETLTRLVRALRVPPARASAPRLALDPDALAGFPQGLVNLSDPA ALERLLDGEEPLLLLLRPTAATTGDPAPLHDPTSAPWATALARRVAAELQAAAAELRS LPGLPPATAPLLARLLALCPGGPGGLGDPLRALLLLKALQGLRVEWRGRDPRGPGRAQ RSAGATAADGPCALRELSVDLRAERSVLIPETYQANNCQGVCGWPQSDRNPRYGNHVV LLLKMQARGAALARPPCCVPTAYAGKLLISLSEERISAHHVPNMVATECGCR" exon <211..622 /note="prepro-Mullerian inhibiting substance" /number=1 sig_peptide 211..261 /note="Mullerian inhibiting substance signal peptide (see comment)" mat_peptide join(286..622,1215..1357,1530..1638,1727..1886,1977..2832) /note="Mullerian inhibiting substance" intron 623..1214 /note="MIS cds intron A" exon 1215..1357 /number=2 intron 1358..1529 /note="MIS cds intron B" exon 1530..1638 /number=3 intron 1639..1726 /note="MIS cds intron C" exon 1727..1886 /number=4 intron 1887..1976 /note="MIS cds intron D" exon 1977..>2835 /note="prepro-Mullerian inhibiting substance" /number=5 BASE COUNT 433 a 1140 c 1063 g 464 t ORIGIN 1 cacatcaggc ccagctctat cactggggag ggagataggc tgccagggac agaaagggct 61 ctttgagaag gccactctgc ctggagtggg ggcgccgggc actgtccccc aaggtcgcgg 121 cagaggagat aggggtctgt cctgcacaaa caccccacct tccactcggc tcacttaagg 181 caggcagccc agcccctggc agcacccacg atgcgggacc tgcctctcac cagcctggcc 241 ctagtgctgt ctgccctggg ggctctgctg gggactgagg ccctcagagc agaggagcca 301 gctgtgggca ccagtggcct catcttccga gaagacttgg actggcctcc aggcatccca 361 caagagcctc tgtgcctggt ggcactgggc ggggacagca atggcagcag ctcccccctg 421 cgggtggtgg gggctctaag cgcctatgag caggccttcc tgggggccgt gcagagggcc 481 cgctggggcc cccgagacct ggccaccttc ggggtctgca acaccggtga caggcaggct 541 gccttgccct ctctacggcg gctgggggcc tggctgcggg accctggggg gcagcgcctg 601 gtggtcctac acctggagga aggtatgtgg ggcccagccc caagcttggc accgccgtct 661 tccttcaggt gggccgggtc ctcctaggga agatcagggg ctggcagagc ccccaccctg 721 ggcagggagg ctgtggtctt gttcctagga ctgggttgcg ggtccgtggc ctggaaggtg 781 ggcaccacac tctgtcctgt ccccgaagcc cagctcttag acttgcccct gcctcggtgc 841 cagggagaga gctgctgcct tctccccacc cctgaagacg acgcagggct cggggccagt 901 ggaacccttc ttcccacagc cccagcctgt tctcagggcc gctggcctaa gatactccct 961 gcggggaagg ggcttcatcg ggcaccccaa cccagagacc ccagggcggc agccccaccc 1021 acagcctcag acgcagcccc tgcctgcccc tgccgtcacc gctccctggc tgcaggaagg 1081 cagctaagag gggcaccctt gtcccccgct tgaggtcccc tgcacagtgg ccagagcggc 1141 agggacagat cccaaagatt cccggggggt gtggccttca atggctcagg cgtcccctgc 1201 tgtcccggct gcagtgacct gggagccaac accctcgctg aggttccagg agcccccgcc 1261 tggaggagct ggccccccag agctggcgct gctggtgctg taccctgggc ctggccctga 1321 ggtcactgtg acgagggctg ggctgccggg tgcccaggta ccagggagtt gcatggggca 1381 gtgcccgggc cgtggcgggg ggcatgaatt tgttgcaggg tctgcagtac tgagaacagc 1441 gtagaaccag tggcgatggg aggaagggga ccggtagagc ggggctgggt aagcctccat 1501 ccagccgggc tgagccctgg tctccgcaga gcctctgccc ctcccgagac acccgctacc 1561 tggtgttagc ggtggaccgc cctgcggggg cctggcgcgg ctccgggctg gccttgaccc 1621 tgcagccccg cggagagggt aggtccgcgt ggagagggac ggggagccgg gtcgactgcc 1681 cccgggcccc cagcccctga gccagccgcg tgcccaccca ccgcagactc ccggctgagt 1741 accgcccggc tgcaggcact gctgttcggc gacgaccacc gctgcttcac acggatgacc 1801 ccggccctgc tcctgctgcc gcggtccgag cccgcgccgc tgcctgcgca cggccagctg 1861 gacaccgtgc ccttcccgcc gcccaggtgc gcgcaggcac cgggacacgg ggcaggagcg 1921 ggcgggggcg gcgtggcctc gtggccgctc tcaactcctc caattgcggg ttccaggcca 1981 tccgcggaac tcgaggagtc gccacccagc gcagacccct tcctggagac gctcacgcgc 2041 ctggtgcggg cgctgcgggt ccccccggcc cgggcctccg cgccgcgcct ggccctggat 2101 ccggacgcgc tggccggctt cccgcagggc ctagtcaacc tgtcggaccc cgcggcgctg 2161 gagcgcctac tcgacggcga ggagccgctg ctgctgctgc tgaggcccac tgcggccacc 2221 accggggatc ctgcgcccct gcacgacccc acgtcggcgc cgtgggccac ggccctggcg 2281 cgccgcgtgg ctgctgaact gcaagcggcg gctgccgagc tgcgaagcct cccgggtctg 2341 cctccggcca cagccccgct gctggcgcgc ctgctcgcgc tctgcccagg aggccccggc 2401 ggcctcggcg atcccctgcg agcgctgctg ctcctgaagg cgctgcaggg cctgcgcgtg 2461 gagtggcgcg ggcgggatcc gcgcgggccg ggtcgggcac agcgcagcgc gggggccacc 2521 gccgccgacg ggccgtgcgc gctgcgcgag ctcagcgtag acctccgcgc cgagcgctcc 2581 gtactcatcc ccgagaccta ccaggccaac aattgccagg gcgtgtgcgg ctggcctcag 2641 tccgaccgca acccgcgcta cggcaaccac gtggtgctgc tgctgaagat gcaggcccgt 2701 ggggccgccc tggcgcgccc accctgctgc gtgcccaccg cctacgcggg caagctgctc 2761 atcagcctgt cggaggaacg catcagcgcg caccacgtgc ccaacatggt ggccaccgag 2821 tgtggctgcc ggtgacccct gcgccgcgcg gactcctgcc ccgagggtcc ggacgcgccc 2881 cagctcgcgc cccttcccat atttattcgg accccaagca tcgccccaat aaagaccagc 2941 aagcaaccgg ctggggtgtc cgtgcgtgtt agggggcccg tgggacctcc cttgccgtct 3001 ctcctcgcgc acggcccggg tccgccctgt agcgctcgct gtctctcccc tgcctgaagc 3061 gccccaccac cgtctttcag gccccggact tggtgccggg // LOCUS HUMMKXX 3308 bp DNA PRI 07-JAN-1995 DEFINITION Human retinoic acid inducible factor (MK) gene exons 1-5, complete cds. ACCESSION M94250 NID g188570 KEYWORDS retinoic acid inducible factor. SOURCE Homo sapiens (tissue library: FIXII, Stratagene catalog #946203) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3308) AUTHORS Fairhurst,J.L., Kretschmer,P.J., Kovacs,E., Bohlen,P. and Kovesdi,I. TITLE Structure of the gene coding for the human retinoic acid-inducible factor, MK JOURNAL DNA Cell Biol. 12 (2), 139-147 (1993) MEDLINE 93228828 FEATURES Location/Qualifiers source 1..3308 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="FIXII, Stratagene catalog #946203" prim_transcript 994..3091 /gene="MK" exon 994..1039 /gene="MK" /number=1 intron 1040..1337 /gene="MK" /number=1 exon 1338..1414 /gene="MK" /number=2 gene join(1339..1414,1569..1736,1857..2018,2736..2761) /gene="MK" sig_peptide 1339..1404 /gene="MK" CDS join(1339..1414,1569..1736,1857..2018,2736..2761) /gene="MK" /codon_start=1 /product="retinoic acid inducible factor" /db_xref="PID:g188571" /translation="MQHRGFLLLTLLALLALTSAVAKKKDKVKKGGPGSECAEWAWGP CTPSSKDCGVGFREGTCGAQTQRIRCRVPCNWKKEFGADCKYKFENWGACDGGTGTKV RQGTLKKARYNAQCQETIRVTKPCTPKTKAKAKAKKGKGKD" mat_peptide join(1405..1414,1569..1736,1857..2018,2736..2758) /gene="MK" /product="retinoic acid inducible factor" intron 1415..1568 /gene="MK" /number=2 exon 1569..1736 /gene="MK" /number=3 intron 1737..1856 /gene="MK" /number=3 exon 1857..2018 /gene="MK" /number=4 intron 2019..2735 /gene="MK" /number=4 exon 2736..3091 /gene="MK" /number=5 polyA_signal 3044..3049 /gene="MK" polyA_signal 3066..3071 /gene="MK" polyA_site 3091 BASE COUNT 602 a 997 c 1169 g 540 t ORIGIN 1 ggcggccgcg ctcgtcgggg ccgggggcgg ggccgatccc tccggcttcc cgcttcccgc 61 ggagaacaac aatgaaagtg aaagaggggt ggggcggggg cgagcccggg ttctgtggcc 121 catttgccct gtggccttga gcaagcccct cccccaggcc tcgggggctc tcccggtttg 181 ggggaaccgg gcgaggcaat gccacaggcc cagggttaga gggggtgggc acttgcagct 241 gccgatgtgg ctggatctgg aacttctcgg agacggctcc tgtcagcgcc aagtttcacc 301 aaatccaggc ctgcccctcc tcccccagga cccccactcg cagtccctca agcctgtgct 361 cccggaaagg cactgggcga ccgcacccgt ggctttctct gggcgaccgg gtcccagact 421 ccccccagca cagcagagcg cttccctgcc caccccggaa accgccccag gtggccgcgc 481 cccctcccca gcagccagca gggcgccagg gctgagccgg ccgtggaggg gagcgggtcc 541 cgcggttata caggcgccgg ggctccgcgg caggcaagag aagctgaggc ctgagaacgg 601 cccaaacctt ggcgtacggc aggggacgac ctgggatggg ggcagcgggc ggcggcgcag 661 ggagtgggcc gggggccggt gtgcgcgggc gggacggggc ccggggtcgg gagacaccgc 721 ttggaagatg gggccgggag aggcgccgtc gcagcgcaga gggcaccggc ggggagacgc 781 gaggacgcgg ggcccgggaa cacggacgcc ggagtagaag cgcggggggc cgggctggag 841 cgggggcggg gacgcggggg tcgggggcgg tgcgggtttg aggggagggg gcgggcgggt 901 ccttccctgg gggggtgggg agagggggcg ggggcccatg tgaccggctc agaccgttct 961 ggagacaaaa ggggccgcgg cggccggagc gggacgggcc cggcgcggga gggagcgaag 1021 cagcgcgggc agcgagcgag tgagcgcgcg gcgggcccct ggtccgccgg cccgcggccg 1081 atctaggggc tgggggctgg aggcggggtg ggggtctgag ctgcgtcctg ggctcgaggc 1141 gtcccccggg ggagtcgcct cttagcggtg cgtccgggct agcggcgagg ggccgcccca 1201 agtcttccca ccgccgccac cttagcagcc cgacttgggg cctggaaagt ggagcacgcg 1261 gaggtgggag ggccctgcac gcggccccgg tgggaaaggg gacgggccag ggattcagac 1321 tcgggctctc ccctcaggat gcagcaccga ggcttcctcc tcctcaccct cctcgcccta 1381 ctggcgctca cctccgcggt cgccaaaaag aaaggtgatg ggggatgatc gaaggagggc 1441 tggggacggg caggcaggcc cctccacttc tggctggccg cctggttcct agcctggaac 1501 ccaggaaggc ggctcccgag ggagtctccc cgtgccccag tcctgaactc tgttcctcgc 1561 gcgtgtagat aaggtgaaga agggcggccc ggggagcgag tgcgctgagt gggcctgggg 1621 gccctgcacc cccagcagca aggattgcgg cgtgggtttc cgcgagggca cctgcggggc 1681 ccagacccag cgcatccggt gcagggtgcc ctgcaactgg aagaaggagt ttggaggtag 1741 gcgggcgcag tcagagggca gagacggggg cacagcctcg ccgaagcctg ggcggaccct 1801 tggcggaggg cggggccgcg ggcgcgcagc gctgacctgg gccgctctct cgccagccga 1861 ctgcaagtac aagtttgaga actggggtgc gtgtgatggg ggcacaggca ccaaagtccg 1921 ccaaggcacc ctgaagaagg cgcgctacaa tgctcagtgc caggagacca tccgcgtcac 1981 caagccctgc acccccaaga ccaaagcaaa ggccaaaggt cagcgaaagg agaagggggt 2041 ggggctgtcg cggggggctg cccccccccc cccgcctgtg aggggacaat tccaagttaa 2101 accttaagtt ttgagtcctg gccagtggct tcctgacatc gcctcacttg gcttccctgc 2161 ctggaaaagt ctgaagatgg gcactacaag agaggccgca ggtgatgctg gggacataaa 2221 tcctccctgg cccaaatagg gaccaactca aactactcca ttggagcatc tggcttagga 2281 cccagggaga gagtcctgga acggcttgcc tttggtcagc tctccagcca cgggcagcat 2341 ttggtcagct ctgccctttc tagtgttggg aggaggtcaa ggcccaccct gggcctctca 2401 gctcactcgt gactcagccc agcgaggcca gcagggcagg ggtgaatctg cccgcttctc 2461 aggtgaggag gctgaggatg cccagggctg ctgtgaccag gactaggact ggaaacttga 2521 aggttttctg atcccaagtg gaaataggaa gctggggatg tcccatgtcc acatcacaat 2581 ggctgcccca tcccctgctt ccgagtcagc tgattggaaa ccactagggg cagatcttct 2641 ccttccctga tgcccgggtg tttgtggagc cggcggtctg caatgggtca gcctaactgc 2701 tgatatggta ttaatatttc tttcttgttt tacagccaag aaagggaagg gaaaggacta 2761 gacgccaagc ctggatgcca aggagcccct ggtgtcacat ggggcctggc cacgccctcc 2821 ctctcccagg cccgagatgt gacccaccag tgccttctgt ctgctcgtta gctttaatca 2881 atcatgccct gccttgtccc tctcactccc cagccccacc cctaagtgcc caaagtgggg 2941 agggacaagg gattctggga agcttgagcc tcccccaaag caatgtgagt cccagagccc 3001 gcttttgttc ttccccacaa ttccattact aagaaacaca tcaaataaac tgactttttc 3061 cccccaataa aagctcttct tttttaatat aaagcccctt cccaaggagt ttgctgtgga 3121 aatgtgtttg ggagtgggaa ggtggggaga aagaccaggc tgtagggact ggtgggtttc 3181 agggggcttg gtggtgggtg ctctccagag ctcatggaaa aagcagaaca attacaacat 3241 ttcttccagg gcccctgaaa ggtgctcccc atcaagtcac ctaagccttt cggtcctcat 3301 ctccctca // LOCUS HUMMRP14A 4439 bp DNA PRI 15-JUN-1989 DEFINITION Human migration inhibitory factor-related protein 14 (MRP14) gene, complete cds. ACCESSION M21064 NID g188689 KEYWORDS migration inhibitory factor-related protein. SOURCE Human blood monocyte DNA, clone pUCMRP14. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4439) AUTHORS Lagasse,E. and Clerc,R.G. TITLE Cloning and expression of two human genes encoding calcium-binding proteins that are regulated during myeloid differentiation JOURNAL Mol. Cell. Biol. 8, 2402-2410 (1988) MEDLINE 88302148 FEATURES Location/Qualifiers source 1..4439 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 1001..3825 /note="MRP14 mRNA and introns" intron 1029..1415 /note="MRP14, intron A" CDS join(1431..1580,3448..3642) /partial /note="migration inhibitory factor-related protein 14" /codon_start=1 /db_xref="PID:g386958" /translation="MTCKMSQLERNIETIINTFHQYSVKLGHPDTLNQGEFKELVRKD LQNFLKKENKNEKVIEHIMEDLDTNADKQLSFEEFIMLMARLTWASHEKMHEGDEGPG HHHKPGLGEGTP" exon <1431..1580 /note="migration inhibitory factor-related protein 14" /number=2 intron 1581..3447 /note="MRP14, intron B" exon 3448..>3642 /note="migration inhibitory factor-related protein 14" /number=3 BASE COUNT 1099 a 1203 c 1159 g 978 t ORIGIN 1 atcactgtgg agtaggggaa gggcactcct ggggtggcaa ggtgggaggt gggccctgtg 61 ttcccacagt gggcagggag gtagtgaaag ggaagctggc cggacaggaa gggccattcc 121 aagagggctt tgtgcgcagg gctaagccaa gctttctcca taggcaatgg ggagcaactg 181 gaggttcgta gcaggagaag gacacatcaa gcccaccagg aggctaagta aaaacagttg 241 tctcccaagt tataagttcc tggaaccctt gctgggagca ggatttagaa aaatgatgct 301 gagagatgct agaaacatat tcgccctgag gctctctcac tcagactgca agaggaaggt 361 atcatcagaa ttgcccttaa ccaggaacca gaatagctgg gtccccttcc tgccaagtca 421 gcaaccagct atgtgacctt gctcaggtcc atctccgggt gtcagtttct tcatctacaa 481 tgcaagaggg ttgcccacct ctgagaaccc ttctaacccc aaatctcacc ctatgaatct 541 aagaacacaa cccctcgcca tcctaagtat cacagagcca ggcaagcatg ggtgagagct 601 cagaccatcc ttgttggact aaaaggaagg ggcagactgc catggggggc agccgagagg 661 gtcaggcccc cataggtcct cagcctgctt caacctcaaa ggggatgggg ggctgagtgg 721 tgccagagga gcagcaggct cgctcgggga gagtagggcc ttaggataga agggaaatga 781 actaaacaac cagcttcctg caaaccagtt tcaggccagg gctgggaatt tcacaaaaaa 841 gcagaaggcg ctctgtgaac atttcctgcc ccgccccagc ccccttcctg gcagcattag 901 cacactgctc acctgtgaag caatcttccg gagacagggc caaagggcaa gtgccccagt 961 caggagctgc ctataaatgc cgagcctgca cagctctggc aaacactctg tgtggctcct 1021 cggctttggt aagtgagctg ccagcttccc caggcagaag cctgcctgcc gattccttct 1081 ttccttccct gacccaactt ccttccaaat cctcctccta gaagccctcc ttggttggcc 1141 ctgcctactt taaagcttct ttcacatttt cttaggtcat gttcccctgg ggcctcctgc 1201 cctcaaatgc tttgcttttt ggcactctgt agatattcta aaaaatcatt ttgtacatgt 1261 gtgtgacagg ccatctccca gttaagttgc agcctgtgct ttctttttat tttgcacttc 1321 ccccactatt tctgtgagtg cttagtagga agtgtcaaag aagcttgaca gcattttctt 1381 ctaagtgtcc caactcttgg ttttccatta cacagacaga gtgcaagacg atgacttgca 1441 aaatgtcgca gctggaacgc aacatagaga ccatcatcaa caccttccac caatactctg 1501 tgaagctggg gcacccagac accctgaacc agggggaatt caaagagctg gtgcgaaaag 1561 atctgcaaaa ttttctcaag gtagggctgg actctggcag gtctgaccca gcctcaccgc 1621 agtttgggtt gacaagggag gatgggagta tgggctacag caatcaaggg gaagatttga 1681 gctcctggag cccagcccca agacgcagcg agtgtcctgt tatacagggc aggtgctcac 1741 agttacacag gacgacaggg tcaagaaatt gctcaattga acacctgcta tttgtcgggc 1801 cctgttctgg gcagagggat gtagtggtaa atgggagccc actattccat gaggagacac 1861 acagtaaagt tgttggccaa taaagagcac agataaagcc aaatgccaat aagtgcctgg 1921 aagaaaatga gatagagtgc gctgtgggca atggggctgg gtggggtgga ggtgaccagt 1981 tagggtacat gagaagggcc tctttgagga ggtaacattt gagctgagcc ccgaatgttg 2041 gggagggaag cccctgagga tgacacttgg cacaaagctg aggagaccct aagcctcagg 2101 gcgaacttgg ggtggaagac ttgggggctt ttctaatcct aagggtctgc ggtggaaaat 2161 gaatgcataa agagcacatg gagagcacct gcacagcact cagggaactg ggaggttttt 2221 cccccgctcc aaaaatgatt aggcagttct aagaaaaagg ctgagcactt ccaacagcct 2281 ttttgttttc ttttcaaatt tggggaaagt cgggaaacag aggcctgcat taagaagggt 2341 ggaacacatg ggtctcagtc tcagttccag tcccggagcc agacatcctg gggtaggtcc 2401 ccagccctcc cagtgcccct ccctccgcct tggtaaggtg gagaattgca gccttcagag 2461 ttaggggccc tgacagctct ccataggtgg aggcctcagg caggcaggat gctgggtggg 2521 gtaggcaaga aagggcccag cagagaggcc gcatcggaaa actatcctcc atgtgacccc 2581 ctatgcccgc ttcacccccc acctgacatc ccccaccaga agcaaagcga tgctgtggga 2641 aaggaagcag agcctcatgg atgggctgca caggagagtg ctcgcattgg ctgggtaccc 2701 cacaggttct gggaggggac ttagcgaggt gactcagtgc ctcggcctcc caaagtgctg 2761 ggattacaag catgagccac cctgtccgac catctcccct tttatacttt atcacaccct 2821 tgaggtcagc ggagcacata ctctgctctc tgaccctcca tctcccctgc ccacacctag 2881 gtttttctag tgtttccccg ttgtattggt tgaaataagt ttcactaatt ggtaacctcc 2941 agagggaagg gaagggaggg caggggaagg agtgaagtgc agaggggtag cagagtggaa 3001 ctggcctcta agtcagatct gaatttgcat gccctcaata gtcaagcctg tgaaaactaa 3061 tgaccctctc taggactggt ttcaagtctt cctccaggaa gataccattc ctagctgtta 3121 aagttgttat aaggaccaaa tgaggtgaca tttccaggct tactcatgcc atgaccaggg 3181 caagaccctg gaactcagct tcctcttcta taaatagaga atcagcaccc aagtcacagg 3241 gtcatggagg gaataaactg gagagcgttt ggtatgtgct cagtgtctgc tccattgtgc 3301 gcactcagcc tatggtcatt tttaattttt aaatccagcc ccagggtcga ggcttccttg 3361 tacatttgcc agctggtcat ttactgtgct cccagtcccc acctctggcc acacccagct 3421 ctcacagcct tctctcccca cccgcagaag gagaataaga atgaaaaggt catagaacac 3481 atcatggagg acctggacac aaatgcagac aagcagctga gcttcgagga gttcatcatg 3541 ctgatggcga ggctaacctg ggcctcccac gagaagatgc acgagggtga cgagggccct 3601 ggccaccacc ataagccagg cctcggggag ggcaccccct aagaccacag tggccaagat 3661 cacagtggcc acggccacgg ccacagtcat ggtggccacg gccacaggcc actaatcagg 3721 aggccaggcc accctgcctc tacccaacca gggccccggg gctgttatgt caaactgtct 3781 tggctgtggg gctaggggct ggggcaaata agtctcttcc tccaagtcag tgctctgtgt 3841 gcttcttcca cctcttctcc aaccctgcct tcccagggct ctggcattta gacagccctg 3901 tccttatctg tgactcagcc ccctcattca gtattaacaa aatgagaagc agcaaaacat 3961 gggtctgtgc tgggcccctt ggctcacctc cctgaccatg tcctcacctc tgacttcagg 4021 ccccactgtt cagatcccag gctccctgcc ccatctcaga caccctgtcc agcctgtcca 4081 gcctgacaaa tggcccttgt cactgtacac tgtagaaagc aaaaaggcat atctctaccc 4141 cttgatatgc ctgctacctc accaaccagc cccaagcctg tcttcaccca tcactgtcta 4201 cacagccctc tctctctcct aacagaattc tattcctctg aaagtcttca gaaactggac 4261 ctagatagtg ccatgtctgg ggaggaatat ggcaccaggc agtggaaaca aggacagatc 4321 ggtgtgttat ctcacatttg atcagagagc atgatctctc ttaacagacc tgccacccta 4381 atcaacggga gtgctcacac aagtgggagt ctgagagctt agccctatgc ccaccctgg // LOCUS HUMMRP8A 4195 bp DNA PRI 15-JUN-1989 DEFINITION Human migration inhibitory factor-related protein 8 (MRP8) gene, complete cds. ACCESSION M21005 NID g188691 KEYWORDS migration inhibitory factor-related protein. SOURCE Human blood monocyte DNA, clone pUCMRP8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4195) AUTHORS Lagasse,E. and Clerc,R.G. TITLE Cloning and expression of two human genes encoding calcium-binding proteins that are regulated during myeloid differentiation JOURNAL Mol. Cell. Biol. 8, 2402-2410 (1988) MEDLINE 88302148 FEATURES Location/Qualifiers source 1..4195 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 1501..2542 /note="MRP8 mRNA and introns" intron 1534..2017 /note="MRP8, intron A" CDS join(2041..2181,2332..2472) /partial /note="migration inhibitory factor-related protein 8" /codon_start=1 /db_xref="PID:g386959" /translation="MLTELEKALNSIIDVYHKYSLIKGNFHAVYRDDLKKLLETECPQ YIRKKGADVWFKELDINTDGAVNFQEFLILVIKMGVAAHKKSHEESHKE" exon <2041..2181 /note="migration inhibitory factor-related protein 8" intron 2182..2331 /note="MRP8, intron B" exon 2332..>2472 /note="migration inhibitory factor-related protein 8" /number=3 BASE COUNT 1037 a 950 c 1024 g 1184 t ORIGIN 1 ttccaccttt tggctcttgt aaataatgct gctatgaaca tgaatgtaca aacatctgtt 61 tgaatccctg cattcaattc ttttgcatat atacccagga gcagaatgat ggatcatatg 121 gtaattctgt gtttatttat ttgaggaaca aacttgccgt tttccataac agctgcacta 181 ttttacattc ccactaacag tgcattaggc ttccaattct ctatgccctc accaacactt 241 gttttctggg ttttaaaaga agtagtagtc atccttgtag gtgtcaggtg gtatctcatt 301 gtcgttttgc ttcatgtttt cctaaagatt agtaattttc atatgcttat tgaccatttg 361 tatatcttct tcggagaagt gtctatttga gtctttcccc aattttgatt ggtttgtttg 421 ttttttgttg ttgagttgta gggattcttt tatattctgg atattaatcc cttatcagat 481 atttgtttta caaatatttt ctttgtaaca acagaaacac accacagtct tcaaggttgg 541 aagccagtta atctgagtag cattttgtta gtggtgggga gaggatttgt tcctcctgaa 601 atcctgggga attggccacc tcctcttctc ctcttaggca tgaagcgcgt ctggcttctc 661 caaagaactc ttcccctcca ctacctcaga gttagcttcc tctcttcagc cagtgatcct 721 ggggtcccag acacaataat taaccaagag agggtgaaag gctccctgct gtgtttatgc 781 aatggctcag gcccttgtga agtgccgagg gaccccaagc agcctccatc tcccagggca 841 tggtccatcc ccagctttca cagaacagga aagctgtgga ggagtgtggg cagcagggta 901 ggaatggata tagcccttgg caacaacaca tttccccaca aagcacccac ccaaaagaac 961 aacaacgata gttttagttt ttagtaatga gaacaatagt tctcatgact aaaagccatc 1021 agccaggaca ctgttctcaa cccttttgcg gtctttggac cctttgaaac tctgacagaa 1081 gccatggagg aatgttctca ctgagtgcat gcactcaaaa tgatgcattc aacttcaatt 1141 cagtttcagg gatgtatggc ctgaccacca atgcagggga ttagcaatcg caatagtgga 1201 gagggcatgg gagtgggaat ctggctggat caagcaagtg gatgccagca gcccagaaaa 1261 agagcccccc tacctgcttt ttccttcctg ggcactattg cccagcaaat gccttcctct 1321 ttccgcttct cctacctccc cacccaaaat tttcattctg cacagtgatt gccacattca 1381 ctggttgaga aacagagact gtagcaactc tggcagggag aagctgtctc tgatggcctg 1441 aagctgtggg cagctggcca agcctaaccg ctataaaaag gagctgcctc tcagccctgc 1501 atgtctcttg tcagctgtct ttcagaagac ctggtaagtg ggactgtctg ggttggcccc 1561 gcactttggg cttctcttgg ggagggtcag ggaagtggag cagccttcct gagagaggag 1621 agagaaagct cagggaggtc tggagcaaag atactcctgg aggtggggag tgaggcaggg 1681 ataaggaagg agagtatcct ccagcacctt ccagtgggta agggcacatt gtctcctagg 1741 ctggactttt cttgagcaga gggtggggtg gtaaggaaag tctacgggcc cccgtgtgtg 1801 tgcacatgtc tctgtgtgaa tggacccttc cccttcccac acgtgtatcc ctatcatccc 1861 acccttccca ccagaggcca tagccatctg ctggtttggt tatttgagag tgcaggccag 1921 gacaaggcca tcgcttgggg catgaatcct ctgcgtactg ccctggccag atgcaaattc 1981 cctgccatgg gattccccag aaggttctgt ttttcaggtg gggcaagttc cgtgggcatc 2041 atgttgaccg agctggagaa agccttgaac tctatcatcg acgtctacca caagtactcc 2101 ctgataaagg ggaatttcca tgccgtctac agggatgacc tgaagaaatt gctagagacc 2161 gagtgtcctc agtatatcag ggtgaggagg ggctgggtgt ggcgggggct ctctgcctgg 2221 tcctggggct gccctgggcc agcggtcctc cctgccaccc ttcatagatg ctatgcctcg 2281 gctctctctg agatctttaa actctggctt cttcctcctc aatcttgaca gaaaaagggt 2341 gcagacgtct ggttcaaaga gttggatatc aacactgatg gtgcagttaa cttccaggag 2401 ttcctcattc tggtgataaa gatgggcgtg gcagcccaca aaaaaagcca tgaagaaagc 2461 cacaaagagt agctgagtta ctgggcccag aggctgggcc cctggacatg tacctgcaga 2521 ataataaagt catcaatacc tcatgcctct ctcttatgct tttgtggaat gaggttcctc 2581 ggtgtggagg gagggttgga aaacccaaag gaagaaaaag aaatctatgt tatcccaccc 2641 tacctctcac aagcctttcc tgctttaccc ctcacctggc ctctgcccca cattccttca 2701 gcccctcatt tcgagcattg gatttgaggc ttaaggattc aaaaagtcgt catgaatata 2761 gctgatgatt ttatagtggt tctgaaatgg gtcggggatt tgggaacagg gtggtagtat 2821 aagaacaact gatactgttc tctaagctaa atcttagctt ccagctacct gtcttagatg 2881 tggctcttgg gaaccttaga gtgatagcta catagaagtg tgtgggtgtg tgtgtgtgtg 2941 tctgtgtgtg tgtgtgtgag agagagacag acagaaagag agcaagagag ggaagggggg 3001 agaggctgat tgtgtgtgtg gtgtgatgta ggtggacaat gttcagagtc ctccattaac 3061 aggataatcc tcacacctgt ccacatacct gtagtttgtc cttggggatt ttgaaaattt 3121 ttcctccctc tccactccca aactcccaac tcaattaaat gataaaggaa taggcaaata 3181 ggaaaataaa ttagtaaaac ttaagtcaaa gaataggtta ttcatacgct gcctatggga 3241 ttctatgctt tgtgatcaga aaattatcta aaaaatactt cccaagggct ggtacaaggg 3301 aggccagaag acgagtggtt cttctctgag gtggacatta aaaaaagaag aaaatgaagg 3361 ggaacctttt gacaagaatg tcaccccaaa ctggattttc atgctgtggt gtggggaatt 3421 ttctgttgtc ctcacttagg tgctggggca gtggtgttag tgatgggtaa aaaggtagga 3481 agctgtcaca gaatcactaa accagggttc ttaacttgtc tgtctataca tctctgaaat 3541 tgggttgaag ttgtgtgcat cattttgagt gacgcactga gaacattcct ccacggcttc 3601 catcgagagt ctcgaaaagg cccaacacct caaaaaggtt aagaacactt gtcctgctta 3661 ctggttttta gtaacaaatg gcagagtatt tctctctgtc tctctctctt tttttttttt 3721 tttttttgag acacagggtc ttgtctgtca cgtggactag agtacaatgg gcatgatcat 3781 gggctcactg tagcctcgaa cacctgggct caagtaatcc tcccacctca gcctctttag 3841 tagctgggac tacagcatga gccactgccc ttggctaatt tttaaattat ttttttgtag 3901 agatggaaac ttgctatgtt gcccaggcta gtctcaaact cctggactca agcgatcctc 3961 ctaccttggc ctcccaaagt gctgagatta cagtgtgatc cacaccacac ctggccaaag 4021 attggagtat ttttattgct attgttgtgc tgggtgggtg ggtgggtgta tgctttgtgg 4081 ggacgtgtgt tgttgccaag ggctaaatca gttcctaccc tgctgcccac agtcctccac 4141 agctttcctg ctctgtgaag ctaaggatac accccgatga taagctgtca acata // LOCUS HUMN79E2 39683 bp DNA PRI 12-FEB-1997 DEFINITION Human cosmid N79E2, complete sequence. ACCESSION U51561 NID g1236711 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 39683) AUTHORS Hawkins,J. TITLE The sequence of H. sapiens cosmid N79E2 JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 39683) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (15-MAR-1996) Robert Waterston REFERENCE 3 (bases 1 to 39683) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (04-OCT-1996) COMMENT Submitted by: Genome Sequencing Center Department of Genetics, Washington University, St. Louis, MO 63110, USA, and e-mail: sapiens@watson.wustl.edu NOTICE: This sequence may not be the entire insert of this clone. It may be shorter because we only sequence overlapping sections once, or longer because we provide a small overlap between neighboring submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest SOURCE INFORMATION: This clone is from the human chromosome 22-specific cosmid libraryLL22NCO3, constructed at the Biomedical Sciences Division, LawrenceLivermore National Laboratory, Livermore, CA 94550 under theauspices of the National Laboratory Gene Library Project sponsoredby the US Department of Energy. The source of the flow sortedchromosomes was a human/hamster hybrid containing chromosomes Y, 22and 9. VECTOR: Lawrist 16. This clone is part of a cosmid contig isolated using YACs from the SangerCentre chromosome 22 YAC contig described in Collins, J.E. et alNature 377 Suppl., 367-379. NEIGHBORING SEQUENCE INFORMATION: The entire insert of H_N79E2 is contained in this sequence. The orientation of this clone is unknown. FEATURES Location/Qualifiers source 1..39683 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /clone="N79E2" /clone_lib="LL22NC03" /map="22q12" repeat_region complement(1722..2004) /rpt_family="ALU" repeat_region complement(4523..4809) /rpt_family="ALU" repeat_region 5871..6153 /rpt_family="ALU" gene 10611..28017 /gene="H_N79E2.1" CDS join(10611..10652,27880..28017) /gene="H_N79E2.1" /note="coded for by human cDNA R97618" /codon_start=1 /db_xref="PID:g1592828" /translation="MSPEALILQPKVTEIFERILFIWAIRHPASGYVQGINDLVTPFF VVFICEYIGKISCKH" repeat_region complement(11187..11476) /rpt_family="ALU" repeat_region complement(11541..11619) /rpt_family="L1" repeat_region complement(11970..12162) /rpt_family="L1" repeat_region complement(13390..13451) /rpt_family="L1" repeat_region 14971..15262 /rpt_family="ALU" repeat_region complement(15963..16254) /rpt_family="ALU" repeat_region complement(19850..19896) /rpt_family="L1" repeat_region complement(20283..20878) /rpt_family="ALU" repeat_region 27442..27734 /rpt_family="ALU" repeat_region complement(28371..28660) /rpt_family="ALU" repeat_region 29960..30250 /rpt_family="ALU" repeat_region complement(34792..35444) /rpt_family="L1" repeat_region complement(35854..35886) /rpt_family="L1" repeat_region complement(35905..36183) /rpt_family="ALU" repeat_region complement(36589..36875) /rpt_family="ALU" repeat_region complement(36886..38907) /rpt_family="L1" BASE COUNT 9073 a 8707 c 10372 g 11531 t ORIGIN 1 gatcagcttg gtgggatggt gagggccgag gaggggaagg ggctgttgaa gtaggtgggt 61 agcggtcagg gtgggccgca ctgaggaggt gactttgagc agggacttag aggagcaagg 121 gaggcagcta agcaggtgcc tgggggtgca ttcctgtctg gttcattctg tggagagggc 181 ttacccgttc cctcactgtg tttcccatca gggaatagtg cccaccagac acaggtgatg 241 aagagggaga acagtaagta acaggaagca gtgccgtggg ctcctcccac agccggggag 301 gtggagaccg agagagggtg ctcttgtgga aaggcggctg ctggccctgc acagactgag 361 agcagcggcc agcggctgtc ctcgaaaggg cagccagtca acaccctggg cttggtggcc 421 cctgtggtct gtagcagcta atctctgcca ttgtggccca gagcagccac agggaatctg 481 tataaaggaa aacacatagt tgtgttccca taacacttta tttacaaaaa tgggtgggca 541 ggattggctg atggtcagtt tgctgacccc tgatctagat gaaatgggtc ctagcagagg 601 aagagtaagt tcagagaggg gcactgaggc aggagaggtg tggcatttaa ggaacattta 661 agcagcgaag taaaagtggg aggggccatg gtgagtgacg agtctcggcc tcttggtcag 721 agaggcagga agggccaggc ccgcaggagc cgtcagccta ggcagggcct gggcttttat 781 gctgggcatg attagaagca attggaagtg ttttcgggct ggtggtgtta aaatgatgtg 841 attgtcacag agtctctaga gccagcgtgt ggagaaagga ccgtggggtc tttgtccacc 901 cactgatgga cagtcacctg tgagcccggc cagtcttcag cgtctctgta tccagggctg 961 agcagcgggt gggagagtgg ctggggctgg gtgaatgaat gttgttgttg tgtgtatctg 1021 tgggtggctc ttagcatgcg gatttgtccc caggtcttcg gtgacttggg atcacatggc 1081 tctcctgagc agtgtgggac ctcaggttag gagctggggc ttggcttgag tttgccttcg 1141 tgtccatgtg actctgggca cattaaccac gtttcctgag cctgtctcct cgtctgcaga 1201 tagggacagt gacggccttc tgtgattagt gggattaaag cagcactgtg tgaagcacgg 1261 agcgtggtga ggacccagcc cttccttagt tcacctgcgg gtatggcaat tcgggggcaa 1321 agaaaaagaa gtaatcttat gtaattgtta ttgccaagag acggacaggc ttgaaataac 1381 aacaaggtgc agacgggtct ggatgaagcc tgtggctctg agacggttat ctgtgaagct 1441 ggcttgaact tgaagtagaa gcttgtttca ctcaggtctg gctgtcactg cccccccttg 1501 taattttatg ccttcccatc actgtgggca gggctaggta cccaacagaa ggagctctgc 1561 gcctggcaat ccaccagcac atcccagtga aagaggcctg attctggaga tgctcaggtg 1621 ggtcattaaa taatcacccg gaaggcaccg cagaaccacg cctggcactt ctgtgcacac 1681 tcggggctgg ttattctttt tgttactgtt agaatttttt tttttttttt tgaggcggag 1741 tctcactctg tcgcccaggc tggagtgcag tggtgcgatc ttggctcact gcacgctccg 1801 cctcccaggt tcacgccatt ctcctaccac agcttcccga gtagctggga ctacaggcgc 1861 ctgccaccac gcctggctaa ttttttgtgt ttttagtatt ttcaccgtgt tagccaggat 1921 ggtctcaatc tcctgacctc gtgatctgcc cgcctctgcc tcccaaagtg ctgggattac 1981 aggtgtgagc cgccgcgccc ggccagtcac tgttagaatt atgtgaatca atacgctctt 2041 tggaatactg ctatttcctc attttatggg aaaacagagg ccaggtggga ataatggcca 2101 cacctttcct ccatcacgag gactctgagt gtccctctcg agtgcatctt ggcagctagg 2161 ctgttctgtg cccctctcca gaatactgta attcaccgag cgattgtttt ctggagagag 2221 tgttctcctc tggggcacct cctcccattg ggttggggat gcgtctgaca cgcgtgtaac 2281 caggcggttg tgacccgatg ccctttctgc cagggctact tggaaatcac gcatcgtgaa 2341 ccttgaaggt atccggtatt cctttgagct aaaagttcac tgtaggaagt ttacttcagg 2401 aagtaattga cattgttagc tgatgtgtgt ttgagaatac tgggcaatta gaaataatca 2461 aaatactcac aaattgggga aaggtgacat agaaatgcgt atgatggaag ccagtgtcac 2521 cttcaaaatg gatgttttag agcacggttg gcaaatgttt tctgtaaagg cctagatagt 2581 aaatatttga gcctgtaggc catgtggtct cctgtagcca ctccactgtg ctgcgatagg 2641 gcaaaagcag ccacacacag tacataaaca gatgggccgt gccctgtcca gcaaaatttt 2701 atttacacaa ataggtggtg ggccgtagtt tgctgattcc tgttgtagaa caggacttaa 2761 caacagtggg aatatggcca tgattttaaa gggaaaggac caatcataaa ctgggctaca 2821 ttaaagttag gaacttctct tcatcaaaga cgtcattaat agagtgaaaa ggcaagtcac 2881 agaaggaggc tgttggcagc agggctcacg tgtggactat ctggatatga acacctgtga 2941 atcagaaagg aggaggcaga tggcctcatg ggaagggact caaacatctg tactcatact 3001 tcacagatgc agacacgcaa tggcatccct ttgtctcctg ggaaattcag atgaaaatga 3061 aatgtagctt aatagatacc actgcaaagc catcagaatg gctaatgtga aaaggttcca 3121 cagtatcaag tgttggcaag gatgtaggac attgggaatt tgcatacact tctgacagga 3181 gtagagtcac tttgaaaatt ctttagcaaa gtctttcgga ggtatgcaaa cctagcaggc 3241 ccactcccag gtatacactt aacagaaatg catctcatgt actgtaaaag ggagcttcaa 3301 gaatctttaa agcaaaaact ggaaacagca caaatgtttg ttagcagaat ggatttcata 3361 cacagcagtg ttatgcgaca gtggaggaag gaactactgt tacgcgttat aacatgaatt 3421 aattttcaag tgtaacactg aatgaaagca gccagacaga aaaaatgcca cttaggaggg 3481 gaggaggtaa ggggcggtga ccgagaggcc tgctaagacg gtgatggtct ggctttgacc 3541 tgggtggtat tgacggatgt gttcagtttg tgataatgca cacgattttg gggtacacct 3601 ttatctaagt atgctatact tcaataagca tttattaaaa agaagtttca gtggtgtgat 3661 cctaatgtgt taacatgagt gtgcctgagt gtaatttata atatggaata gatgacagtg 3721 gaagcttaag gaggatggaa ggaatgccag tgctcacaca gctgaaaaga tggggctaga 3781 gtctggcttc atgctcgagg cctggcggct ctctgagtcc cgctctcttt caggtcagct 3841 tttctcagtg gacctgtagc ttgtgattct cccctcccat cagagggtgg ccgccgcaac 3901 tccacactgt gcgcgttcac tttcattcac agcagcagcc aagaaaagcc ctggcttggc 3961 tcctattcct gccttgggcc acctctccac ctccaaactg tcagtcatgg tggctggaag 4021 atggcttagc cgtgtagctt agcccagtta agacctgtcc tgaaagaaga ggcttcccat 4081 ccggaccaca agggttgggc agagggtgac gtggctccca gagcagcttc agggctctgt 4141 tcgagcaagg atgagtagat gtttggtggc taacacagtg atgtctacta cagatgggga 4201 aacactttca aaggctaaaa gtgccatccg ggatgtattt gttcattcag tcactcgttc 4261 atatgtgatt cacccagtga aagcttgttg tgcccgccat ctggttggta ccatgagggc 4321 ctggtcctga gtgaacttcc tgctagatgc tgctggagtg ccaggtgacc agccccacct 4381 cgtgggagaa gtgactactg accaggagcc cagtcagact tcatgggaga agtgaccact 4441 ggagacactt gagggatgag aaagacatgg ccaggcacag cagtaagact gctaggaaaa 4501 aaacaaggag aaaagccttt tctgtttttt tgagacagag ccttgctctg ttgcccaggc 4561 tggagtgcag tggcatgatc ttggctcact gcaccttccc tcccgggttc aagtgattct 4621 cctgcctcag cctcctgagt agttgggatt acaggcatgt gccgccacat ctggctaatt 4681 tttgtatttt tagtagagat ggggtttcac catgttggcc aggctggtct tgaactcctg 4741 acgtcgtgat ctgcctgcct cggcctccca aagtgctggg attacaggtg tgagctaccg 4801 cgtctgtcca aaaatctttt taaaataaga tacaggaaca ctaattataa aagaaaaact 4861 atcaactgga cttcatgaaa ctttaagacc ttatctgttt gagacataaa agcaagccat 4921 agactgggga gaggaatgac agtaaataca aaggaaaaag agatttgtat ctagagtgtg 4981 tacagagctc gtcaaactca gtgagaagac gacacacaac ctgagtaaaa gctgggccaa 5041 agatttgaac agatacttca ccaaagaaga gatgcgatgg ccccacgaag agatgtttgg 5101 tatcattaat cattatggaa acgaaattaa aaccacaccc aactgcatgg ctgatagtta 5161 gattcacagt actaagtgtt ggaaaggctg tgtagcatct ggaattctca tacattgctt 5221 gtcggaatgt aaagtggagt aatttcttta gaaaaagagg tgacatctta gaaaactaag 5281 catatcccaa tgttatggcc cagtaactcg actcctaggt atttaaccat gaaaacccaa 5341 agcgtatgcc cacgaaaatc ctgtgcagga atataacaaa ccccaaaccg gaaacaaccg 5401 aaatgctcat cgacaggcga atctttaaca aactggtgga tccgttcaac ggacacagct 5461 agaaaaagga aatcacagct gattcacgta acttggatcc gtctgggaag gtgctgtgct 5521 gctgagtgaa ggaaaagctt gcacgatgtg taggtcctga gtgactccat ttctggggag 5581 tcctagggca gcctcagcca tgcacagtga ggaggtgagc acctccccgg ccagggctgg 5641 gatggactgc agaggatact gggaagtata caggggcggg gggatagaag tgtgtctgta 5701 tcttccttgg agcaggagtt acatagctgg atgcccttgt caaaagccac caaactgatt 5761 gcttaaaatg ggtgcatctt attttataaa ttagaattgg ccatttatat ctgggggttt 5821 tgcatctgca gattcaacca actgtgcttc gaaaatattt gggaaaaaaa ggccggcacg 5881 gtggcccacg cctgtaatcc cagcactttg ggaggctgag gcgggtggat cacgaggtca 5941 agagattgag accatcctgg ccaacatagt gaaacccgta ctaaaaatac aaaatttagc 6001 tgggcgtggt ggcgggtgcc tgtaatccca gctactcggg aggctgaggc aggagatcac 6061 ttgaacccgt gaggtggagt ttgcagtgac ccgagatcat gccactgcac tccagcctgg 6121 gcgacagagt gagactctgt ctcaaaataa atgaataaat aaataaatgg taacagtgca 6181 acaaaagata acataaataa aaaacaacac agtataacaa ttacttacat agcatttacc 6241 ttgtatcagg tgtcataagt catctagaga ttatttaaag tattcaagca gatgtgcata 6301 agttacatgc aaacactatg gcattttata taagggtctt gagtagctaa gaattttggt 6361 atcccccata gatacttagg gacaactata tattgtcaat aatgtcaagt taaaaagtag 6421 aaacaaggca acagaaacca tgaaatcctg ggcagtctgg cccaggaatc aggctcttcg 6481 tgacctcctt gtgctgcctg gccctttctg acttcaagtc ccctttatgt actgagactg 6541 cagctgccat gggcccatga gtctttgaag atagcacatt tgcataagtg agagtacgta 6601 cacacgtatg tgtgtatatg catgcgtgta tatgtgtgtg tgtgcgtggg gtgtgtgtgc 6661 gtggttgctg ttagaagaca gttcagactt tggaatcctg tctggtatat gaggcctcta 6721 gagaagcgct tgtttcatct cgcagccctg ttgtagaaga gccttgggcc cacctgctgc 6781 agggcctgtg ggactgttgt ccaggtgagt ggggagggcc tgcaccggga ggctgcggac 6841 gcaagcccag gccccttcct gagtgctgtc gggagtggtg cttgcattgt gttcagctgg 6901 ctgacctcct ctgtgtgtgt ttcttttttc ccacaatttg tgctcctctc tgtgtcagat 6961 ttcagaatgc caagcttagg tatgatgcaa gtctttttat ttcacgctta ataataggga 7021 ctccatgaat tagctataac catgtatatt ttttgtttat ttctcaccca gggttacctt 7081 cccgccaatg tagaccggag accagccact ctccagagaa aacaaaaaga atattttgca 7141 tttattgagc actattacga ttctaggaac gacgaagttc accaggacac atacaggcag 7201 gtgggaatcc tttctttttt tcgtatgttg cctgatgtac tttgctttgc tgtgatttat 7261 aggaggaaat gttttatcaa aaccatgtgg agttggtcag agttaaagat gcaggtccct 7321 gattagctca cagattttta ttttctaata aataaatttt aaaacaaata atggcaaatt 7381 tcctctgcta gtactatctt gcttgtcttc agttccctct acaaaagtaa aaagagtatt 7441 ttaaattgag gttacaaaat aaatgccttg gattcctgag accggccttg gggtgatggt 7501 gggagtttgt atcacagtga gctgctgtct ggtaataaaa ggccattcct aaaggcccag 7561 gcccaaccac ttagctgatg gcatcaggac agagaggttg cctggggttc tctggcgctg 7621 ggctcttcgg tctaggtagg gatgtgatac cgacccagga aaggtgtgtt catcagtttg 7681 gttttgcagg ggacacactt cagatcatag cagaaggtgt agccaagtta ccattttgtg 7741 atttagttta atataaagtg tgtctaatta tggtttctta aatagtcagt tgacagatga 7801 tgtacataat gggaaaatac ttaaaattga tttatgattc acagtacatc cagaatttgc 7861 tgttagaata gctctttgag gactttataa ttctcactgt ctttgtaata tgtgaaaggt 7921 caaaatggtg gttcttaatt ggtattattg tcttagctat ctgaaaccag cttctgggtt 7981 tcctgaggca cggcaggaat gtttccctct gcaaaccctg cttctaggtg atggagcttt 8041 gccacatttt ctctttcagt tgtgggatag ccacatctct ttgttagttg gttttatctg 8101 taaaaaaaaa aaaaagtcca aatatatgcc ttacctctgt tacataacaa agattcttta 8161 agttaaatgc cttcctggac agcagcttca aagacggatg ggacacaggg ctgcaggagg 8221 aaaggtgact tgcttaggag tgaggctggg gcaaagcagc catctccatt tccttccagg 8281 ccggagctgc ccctggcccg tgttgccttc tgtctctcat cagggcccct ggaccgcact 8341 ggaccgccac ccagttactc tgggaaactg atttgataca cagattactt tggatttgct 8401 taaggaaata ttagcttacc tggcctcttt tgccactcca ctggttttag gaaattgata 8461 aatttggcgt tgttttgaac ctatttttta atgcatttta atttcagaaa agatttccaa 8521 aggatgacac tagcatctat ctgcgagttt taccaagtgg aattctgtta tgagtgttgg 8581 aagagaaagt taattacgaa caggttgcat acatctttat ttagttactg tgataggtaa 8641 taaaaaggtt ccagattatt gtgaattccc agactatttc tacattaata tagatgagac 8701 acgcctgtgg caacgcggta gtttttcctg tctgcctgat gctttgaggc ctcttagaac 8761 tgcgccttct gcgttctcgc catcgccatt tcaggcagag ctgggcccgg tgggggtttg 8821 gagaccttgg tttctccttt acatccactt gggcgtttga cctgtggact gagcccgacc 8881 agccgtctga cccgcagtgg gcaatgccac ccctgcacac tggtggaggc agcaggagtg 8941 gggtggcctc ggcaggtttc ccgcttgctt tccatccttt cttcatttga tttttatgtg 9001 ctcactcttc tgctgaagat gctgtgtcaa agccacatgg agagccacat ggagtcagag 9061 ttaaacgttc atctggtctg tcttttcctc cctcagtcag ctctcttgtg gcctttttct 9121 gatttgccca gttctgtgta tgatgggagg caggcgggat ggatggaggg gtgcacagag 9181 ccccacctgc ccagccttgt ggaggaccgc ctctctaatg ttggagcctc tcctgacaat 9241 ggggatgatg gcaggggttg ggtctggtgg atgctgcctc gtctgctcaa taaatgtgtg 9301 acaggagctc ctatgtaagc gggctttgct ggccctggtc atgcagaggt ggctcagacc 9361 tggttcctgc cctgcaggag cttatgctgc tcatgctggg gacagatgtg tgtccacacg 9421 agtgtaactg tatgggagag caattaggcc gccaggacac cctgatgaga gacctgggcc 9481 cttgtccctg ctggcacctg ggctcctgca tggggccagc ttcagtcctt tggccctgcc 9541 tgtctggata gagtagtgct gcttctgcct cctcctcttc ctcctcctcc tcggcagcac 9601 tgctgctggc gggcctcctt tggcacgcat cacagatgtg tgtgtgtgcc tcagcttact 9661 ccatttggct gttgactcac aatcttgatg ttcaccagta acgttttgag atacttggag 9721 caggcatttt gtagacaagg aaactgaggc ctggccacag tgtgataagt ggcagagtga 9781 tgatgtccca gcctcctggg tgttagagtc ctctgtggag atgtgcaagg atggggcgaa 9841 atgaaatgca attgcagcat ctcccttggc attaaaatgt gcctgtcttc cctcagtcca 9901 ttggtgccgt tgtcctggat aaacgggagc gtgggctagt gtgagatgtt cacgtgggat 9961 gtcgggagcc tgcctgtgat gccaggccaa gcccacaaca ctcttgtttg gttgcttctt 10021 gaaccacaag tgtcctcctc tgcagtgttg gggactttat tatgactctg atgtcagacg 10081 gtggccaagc aggaaccagc tgacactaac agaggctttc aaaggcttcc tttgagaaat 10141 gttctagatt cccccaggac ttcataaagt ggagaaaaat gtggcctctg tccaggaaga 10201 agctgtcaca gctgaaaact tgtaaactat tgatgtactt gaattgctgt ctgtgtcaga 10261 ggctgaaaat cgatgtttac taattgctat ggctgatgtt actaagcagc cgaggcctgc 10321 tctgtttcta atcatgcagc aacggctgct ggatagggac agtgtcaagc aagcctgtcg 10381 cctaggcctg ggcgagctgt tcagagagaa caggctttgg atacctcctt caggggagaa 10441 ggtggtaaca tggagaacga gaatcctcaa ggagagagcg gggtagaggc cggggaagga 10501 cttacctcag tattgctgtt ttaatcatat cgctcgtaat gatgtttgtt gtgacgtaac 10561 ataaatgtcc gtttctcctg cttctccaga tccacataga catccctcgc atgagccctg 10621 aagcgttgat cctgcagccc aaggtgacgg aggtaagaag ctcttgccgt ggggagttcc 10681 cctcgttggg agtggctgat gcccactgtg ctaaccagac agtgggcgca gccggaggtg 10741 cttcacccag aagcaaggtt gctgtggtga gctgccggtg catctagccc ttgtggacac 10801 agttgctgac aggacttcta tttgttcttt aaatattttt actgtataat attgacacac 10861 agcgaagaat gtatgtaata catatgtgtg gatgtatgca ctcatgttcc accaccagat 10921 tgaggaaaca gaacccccca gtgctgccca gctacaaaca tggccttgac ctcaggcctg 10981 acccagagca ccaaggccgc ttgtgtatcc cccacccctg tgcaaccccc ttcctggatg 11041 acgctttctt gccttttatg tttacgattc tttgtgactt tcattcatgg gacagtaggg 11101 acacaatgta ttcatttaat ttgggtcttt tggagctgca taactctttg tgtagtctgg 11161 tctctgcctc ttgttttttt ttcctttatt tatttagaga caagagtctt gctctgtcgc 11221 ccaggctgga gtacagtggt atgatctcgg ctcactgcca cctccacctc ctgggttcgg 11281 gtgattcttt tgcctcagcc tccctgggct gggattacag gcacccacca ccacgcccgg 11341 ctaatttttg tattttctgt agagataggg atttgccttg ttggccaggc tggtcttgaa 11401 ctcctgacct caagtgatct gcccgccttg gcctcccaaa atgctgggat tacaggtgtg 11461 agccacggcg cccggcgtct ctcattgttt tctaagattc agctaatctg ttgtatctag 11521 ttacataata gtttgttatt taaatattcc tcactctcct tcgtcttctt cctgttggac 11581 atttgggttg tttctgatgt tttgctgtta ggaagggtgt tatctggagc cttcacgtac 11641 atttatgcag gtgctcacag gcaagatttc tctaaggctt ttacaaagga gtacaattat 11701 tgcatcattg ggtgtggtgt gtgcctgttg taccttcaag ataatgtcca caagcttccc 11761 agacggccgc taccaaataa actgtcctct gcagcgtgtg agtgttccct ccgcacgccg 11821 cacatccttg ccagttcttc atgctgtggg acttcctgtt tttgttgctc tggtggatat 11881 tcctgttgca gtcctcacgt ttgtttctct ggttactcac ggagttaagc atctctttgt 11941 atatttacgg atgtcttctg tctttccccc ttctgcgaaa tgttcctcct gccttttgcc 12001 catttttctt ttggtttgtc gattcgctcc tttattagtt gttgatcttt acatggtaag 12061 aattctgatt ctttgtcagt ttgtatgttg taaagatctt ctgccctctg cagcttttct 12121 cttccttttt tcttggtgga tcttgatgat cagaagttgt tagctctagt gtggttgaac 12181 ttaacagtct tttaaggtga gcactgatgc atgttatcta aagaactttt ttctacacat 12241 gaaggtgttc acccagattt tcatccagaa gttttgcttt tcacgcctct gtgttcagtt 12301 catctgcagt tgatggccat gtgtgatggg aggcaggagt ctgtttaatt tcctcctggg 12361 acagtggcag tccccatact gtttctgggt ccagctcatc tcccatattt gttatgtcac 12421 tcctgttttc tgtcactgcc acattcatat agatctgtgt tgtgttgtgt tgtttttttg 12481 tcgcagtatt tacgtagtgt tcccttcata tcgcaccccg caagtaattt agcttcatga 12541 cgtcctccta cctggtgggg gtgcacaccg cccactggtt tcctcttcac cgtggactgg 12601 ccagtcccac cccttgctct tccacgtacg tctcagagtc agtttgccaa ggtctttcac 12661 ttgttttttg ttttttgggt ctgtttggtt tatgactgga attgcatgga atctttctat 12721 tccatggaaa gttaggggtc agaaaagaca gtttttagtt tgggagaatg gatagcttta 12781 tgaaattaaa tattcatttt tgtgaatgtg ctagttcctc ttttaaaagt catttttaat 12841 gcttttcagt gaaggttttg aatttcccct atgaagcgtc tcgtacatgt gttaccggtg 12901 gagggtgtcc aggttcttgg tgctttgaac aaagaattgg acaaaatgca caaacaaagc 12961 taggaaagaa tgaagcagca aaagcagaca tttattgaaa atgaaagcac actccacagg 13021 tcgggagtgg acctgagcaa gcagctcaag ggcctggtaa ctggattttc tggggtttaa 13081 ataccctcta gaggtttccc atcggttact tagtgtacac cctatgtaaa tgaagtagtg 13141 gcccgctatc agtctgattg gttgcagaaa gcgaccaatg agaggctgaa gtgaagcacc 13201 ctatgcaaat gtctgattgg ttgcagaaag tgaccaatca gaggctgaag tcacaaaatt 13261 atattcctat gcagatgaag acttggcctg ggaccaggct gctaggttgc tggaggggac 13321 caatcagagg tactttcagt ttttcatctg ccacgcagaa gaggggtgga ttgtgaaggc 13381 agtagcctct ggtccttttg ttacctgggc atggaaagtt gttttttttt gttttgtttc 13441 gttttgtttt tttttgtgtt tttttttttt tttttagttc taggaagcta gtgtgaattg 13501 gccttaggtt acctgcctcc agaccctatt ctcctgcctg acgtgtttta ttagatttat 13561 tcttagttat cacatgtttt tcttgttata gtaaatgtta tctctttaaa gctgaaattt 13621 ttattatttg agatattttg ttttgcatgt tgattgcttc agtaaaggca cgtttaaact 13681 ttgacatttc ttgtctgtgc agtaaatctg ttaaaaaaaa actttaggat tgttaaagca 13741 aaagtattta cagattgtag aatacaaaaa gagaaaatta gtatccaaca gaagtagtca 13801 ttttgagaca cttctctttg attgtccttt ttagtagttg tgcctatcat acctggaaaa 13861 tgagataatt tgttataacc tcctttttac ttacgatttt aaaaatggga aatagagcct 13921 gccacaggac ctttggttta ttcttattta ttttttccta ccacttgctg cattctttgt 13981 acagctgacc tctcaggcct cttggtctca gagtgtggac cctgagtcac ggctagtggt 14041 agcaattggc aattggtcct cttgtctctg accaccatgg gccttgagct gttggtttgg 14101 aaagcctccc ctgtcaattg agaaaaatga tgagacaacg ctcaatcatt tttaggaggt 14161 ttatttgcca aagttaagga cacacgcctg ggagacaggt ctatgtcttt ctcccgggat 14221 gattttcagg gctccaaatt taaaggggac agggctggat attgagaagt gcacagtttt 14281 catatatgag ggagacagag aaaaacattc atgcctttgt ctggctcagt gaatctgcat 14341 ttttttttta acctaaaacg acatagacaa atggagcaga gggaaaatgc agggagtctg 14401 catgttacat aagataacct agacacaatg gggcagggaa caatcagata tgcatttgtg 14461 tctggtgggc tggggtgcac ctgtgaacat aagttgtcca tttacactgc catggtgaaa 14521 tttttacaga aacaccttaa aagatcttgc agctcactag gaatttcctt gtgggcaaaa 14581 tatgggggag gcatgtagct tcatcgtgta gccatcttat tttggaacca aaaggggaag 14641 gcaagtttgc atggcccagg tcccagcttg acttttccct ttggctcaat gagtttggga 14701 ttttgaaatt caatttcctt tcactccctc tgcagtgggt cttgctcagg tcctatgtgc 14761 tccccctggg ggtgctgatc ctctccaggt cccagttcag caaggcttgg cagagtggat 14821 caacttggag ggtagatgag cttctgggag agccaggctc ctgcaggctg gcagagtgtg 14881 aagagtgggt aacctgagca cagccttgtg tgtgtgggga aactgaatga tgagcaaggg 14941 gctttaggaa agtattgaaa tgtttgtctt ggccgggcgt ggtggctcac acctgtaatc 15001 ccagcacttc aggaggccaa agcaagcaga tcagctgagg tcaggagttc aagaccagcc 15061 tggccaacat ggtgaaacct tgtctctatg aaaaataaaa atattagcca ggcgtggtgg 15121 ggcgtacctg tagtcccagc tacttgggag gctgagacag gagaattgct tgaactcggg 15181 aggtggaggc tgcagtgagc cgaggtggcg ccactgcact ccagcctggg tgacagagca 15241 atacttcgtc tcaaaaaaaa aaaaaaattt gtcttgtgtt ggccagtaga aaaaattgag 15301 ttatggtttc tggcattgta gaggtgcaga ggttagattt caatacatct gcaagtgatg 15361 gcaaatcccg aatgtgcctt cacaggccag aactcgatta ctatctccag tacgaggtca 15421 ttgtcagcac attgtttgtt cctctatatg cttggctttg acctcatggt ccaaggtggc 15481 tggcctagca ctctttatca catttgcatt ccactagcag ggataataag acgtgggtag 15541 agaatgccat acccccactt taaagatgtg tcccagcagc tgctcatagc acttctgctt 15601 gtttcctact ggctgtaact tagcctgtca tccaaagtga catggttgta tacttagctg 15661 tatggatcag ggaaaatgta actttgattt ttttagttgt gtgctcccct aaagtttggg 15721 ctgctaatag gaagaaggat ttctgccgta agcatccctc caaccccctg catctatcga 15781 aacatttgct tatccattcc acaggtgatg gctgggtcca ctgtgatcag tttctacttt 15841 ccttcttcct ccttttctcc ttccctttct ctttctctcc ctctggtcag ctttatcgag 15901 gtctaataaa attcacgtat tttaggttta taattggatg ttttgacagc tgtggttttt 15961 tttttttttt ttgagatgga gcctcgctct gttgcctagg ctgctggagt gcagtggtgc 16021 gatctcggct cactgcaacc tccgcctcct gggttcaagc gattctcttg cctcagactc 16081 ccgagtagct gggattacag gcacccacca ccatgtccag ctaattttta tatttttaat 16141 agagatgggg tttcaccatc ttggtcagac tggtcttgaa ctcctgacct ggagatccac 16201 cctccttggc ctcccaaggt gctgggatta caggcgtgag taactgtgcc cggcttgaca 16261 gctgtgttta taggtgtgta ccagctccct aatatgtgat acagaacatt tccgtcactc 16321 tgcgaggttc cccatgtccc tttgtgtcag tctcctctcc ccacccttgc acttgacaac 16381 tgctgatctt ctttctgtct ctagagttct gcccttttgt agaatttcat ataaatggag 16441 tcctagaaac tgtagacttt agtctgtgtc ttcttttact tagtgtagtg gttttgaggt 16501 ttgtgcatgc tgttgcaagg atcaggagtt cattgccttt tcctgctgag tagtgttccc 16561 gtatggccgt accacagtgt gtctgtctgt gcacccgttg acggacactg gagttctttc 16621 taatttttgc ccattacaaa ataaattttc tctaaacctt catgtactaa tcttattggg 16681 gaaagcaaac atcctgttct cactgctgat tttaaaggga agtcttttaa cgttgcacca 16741 ttgacttgtg attttaatct ttactatgtt aaggaaggtg ccttgctctt agttttaaaa 16801 tcataaatgg ttgtggaggt ttatcaaatc ctttttcctc ccatctgaga tatgtttttg 16861 cttctttgat ctgttaatgt agtgaatttt attagatttt ctagtattga gccaactgta 16921 tattcctgaa ataaatccag attggtcatg atggattttt aaaagtgtat ttcagaagtt 16981 tgttggcatc taacgtgcca ggccacactg caggattaag tttgctaaca tgtgatgtgc 17041 cacgctgcca gacgtggaag ccaggtttct ctttaatttg gaaagtgctt tctgaccatt 17101 gtactgattc ttcagttatg cccggatgtg gaatcctctc agctgcgttg ttttctctgg 17161 taatagtgat gtgatccttg gtgtgaaagt attgtttatt ttcagtactt agttctttgg 17221 acataaggaa acctctgtgt tttgctaacc tgttaagaca cctgacgtgg gcacaaaacg 17281 tgggctgtaa attaacgtat ttgaatgagg tgaatgttga gtaattcaca caccaaatgt 17341 ggcatttctg ctcatcgcga gagagttcta cccattaatt aggttgtaaa catatagtca 17401 gtagttttgg tatactcagt gtggatttta tgtgatatat ggttgccttt taacccaatt 17461 ctagatttct acagttcgat tttggtttca ttcatcaagt gttataactc acatgggggg 17521 gggatgttcc ttgaatcatg ttgcagaaac ttgagttgta aataatttct tctttctgac 17581 gctacgatgc ttctggattt tgagggctgt acatcccaca ggtcttctca gtagaattct 17641 ctattttgct cctctcccag aaattatctc catggcgctt tgtataccgt gatgtcacat 17701 tgctgatgca tatagaatta tgatgtttta aaatttctca aatcagattg tttataaacg 17761 tgagtacagc aacattttca atttcaatat agttcctttt tagttatgca aaatctctga 17821 gctccctcgt gtgcactgaa ctgctggtgt tctgggcctg ccttggaggc atgcggggtg 17881 tttgttaggt ggacacggtg aagaccgggg tccttagtag gggacctcaa gcccctcctg 17941 ggagccttct gtgggagagt gctgttgagg tttttaggaa gaaccttttc cttccgaatt 18001 agcatgcatg ttctggcgtt cagatttctg aagtttggca agcaagctgg gaatacctct 18061 ggtttttaaa tttcttcagc cctggtggta cccagtagta gcaaccattt ttattattgt 18121 tgagtattaa ctgatttgca gaaaagatac tttgggaaga gtttaaagtt tgtactactt 18181 ttaatgaatt cccactctct aacaaagacg tgaacatcaa gaatgttact agaactttag 18241 tagctaatga tatccagtga gtacttcttg tttgaaccat ttcagattat tctgtgttgt 18301 ccttaaacac agccgcatcc ttagggcgtc cttcctttat tgccctttgc tagataatat 18361 caccacttaa gcgacagctc acttgctgga atgcgagttg ctgtcctggg acagagcccc 18421 gctgtgtccg ctaaaccagc ccttccactg agctgactct tcgcctccac tcagagtccc 18481 gttcaagtgg gcacgttgtt ttcacctttc ctgcgataac tgcaagcaac acacctgcat 18541 ttttgacaca caaagttgta atggttgtga ttcccgaaga acaccagcag acagacaccc 18601 ggtgggcacg tggggacacg aagactcaga tgtctctgtc ctcagcagcg cacagttgaa 18661 gtgagggatg aagacagatg tgcgtgaaac ccagagagat catatacagt tgtcaatgaa 18721 accacagttt gtatcctgcc gactgcattc cagggtggct gaaagaattg gagaagacag 18781 cccgaggagg ggcagcgtat ggggcagtcc actctggagg gtcatgccca attgctgggg 18841 ccgcctcacg tctctgctct gtgatttctt ctatttcctg gcccggttgg gccgtggatc 18901 tctccctggt gcacctggag tctctgtgga tctctttcct caggggactc tgcaacccag 18961 cctcacgccc tgcagggggt cacagacctc agcctggtgg gcgttcggga ccgagggctg 19021 agggccgagg gttgaggtga agtgcagtgg cacagccatc ccaggctgca ggcgaagggg 19081 ctccccttcg agaacccaca ggatggtagc agagcgaagt tcagcctgcg ggggcgtctg 19141 tactcgggga agctgtggac caggcatgga gccagcctgt ttgccctccc cgccctgatg 19201 gctgctccgg ctggctaggc ctcttctctg agattcttgg cttagtacct ctggcttttg 19261 ccgcttgaat ttggggacct gtcggctctt cctgtttcgc ttcagtggac ttccacccta 19321 gctctcgcct ctggttgctg ctctgagcgt gagtctagag ggaccagcaa ggtcctgagg 19381 gagttgcgag gaatcgggcc ctgtccaggg gcatggggag tctgagaggc cccctcaatg 19441 ttttgcagca gccaggtctt ccccctgcct gccccagcag ggtaagaagc agcacacagg 19501 agtgtatttc ctctcggctg ggccgctgct gtgagacagc cccagtggac gggggtggaa 19561 aggaatctcc ttgaagagat tcttacctga cttcttttct ggctgagact gtctaaggca 19621 gcagtggagg cactgggtag tggttgggac tgaggggtgg taggaactcc cagcaaaccg 19681 ggaaaagcat cagttgctca gagcttgctg ggctgctgcc gagtgccccg ctgcccgtgg 19741 acgcaccttc aagggaggtg gcatgcttaa tttttaccca gaccccttta agtgtcaccc 19801 tgtagacttt gaaaatgctt atttggaaaa aaatgatatg aaatctgtca tatatatgta 19861 tatatatatg tgtatgtata catatataca tatctaaata aaatctatct atctatctat 19921 ctatctatct atctatctat ctatctatct acctacctac ctacctacct acctacctac 19981 cagtctgaga gcaaaagctt agaactggct caatgcggtt ttttgaaaaa agaataaaaa 20041 aaaaggtacg attatttgat gcagggcttc agagtcccac agcatgtgtc ctttggtgtt 20101 ggggatggga cctctttttc ttataacagg ctttcagtga tgtctatgtg gcgagctcat 20161 tgtggtgaaa atcacgcggc aagataattt cagggtaacc ctcggcaata ccatccgagt 20221 gtgctctcag ccgcgagtgc tttcctggct tgcctgctgg tgaccaggat gaaaaacgaa 20281 tttttttttt ttgagacgga gtcttgcttt gttgtccagg ctggagtgca gtggtgtgat 20341 cttggctctc agcaacctct gcctcccggg ttcaagccat tctcctgcct caggctcccg 20401 agtagctggg attacaggcg cccgccacca cgcccggcta attttgtatt tttagtagag 20461 atggggtttc tccatgttgg tcaggctggt cttgaactcc tgacctcagg tgatctgcct 20521 gcctcggcct cccaaagtgc tgggattaca ggcgtgagcc actgcacccg gccgagaaaa 20581 tttttttttt ttttttctga gacagagtct tagtctgtca cccaggctgg agtgcggtgg 20641 cgcaatctca gctcactgcc agctccacct cccgggttcg cgccattctt ctgcctcagc 20701 ctcctgtgta gctgggacta caggcgcctg ccaccatgcc cggctaattt tttgtatttt 20761 tagtagagac ggggtttcac cgtgttagcc gggatggtct cgatctcctg acctcgtgat 20821 ccacctgcct cggcctcccg aagtgctggg attacaggcg tgagccaccg cgcccggctg 20881 agaaaatgtt ttaaagcctc tgtggctttc agagggggac ggtggggact cttaggcacc 20941 acagtacaaa tcaggtgatt tcggaaggag tttttaaaat gatactaaat tgctttggga 21001 ttagaactag tttttgccat ttcaaaacac cagcataact catatgaagt gattcaagac 21061 aaaagataaa aaggaatgag gcagtgaaaa caacagcaag cagcaaggaa agggaagcct 21121 gggcctggag gaggcctctg tcctttctga tggagacagt catcctgtga gttggcttca 21181 tagcttgcca gcatgaagag gagctcaggg tctgtgcctg tcgcctctgc tcagagcctt 21241 tcatgagcaa gttggtaaag cagcagattg atttgggtcc gtgcagggtg tcctgagtag 21301 ccatcccctg ggctagcatc cttagcttcc agccctgcag ccggccccgg ggaggagggc 21361 acggggtggc cgctggaaga gaagcctgcg gtcggttggc gggtggggga caggcggggc 21421 ctctggggac tgagctgagt gggtctctca aggcggacac gcgagggcca gcggtcccag 21481 aagttggggc caagggagcc tctggggcat ggccagggat gtcctgaacc taggggccaa 21541 gcttccccct gccttgctgg tttcccacag gctgttcctc aacctgtggg cgggcaggaa 21601 tcagtggtcc ttctccatcc ttctgggggc atgaccttcc cgtgtcccag acgttctctg 21661 tttatggaac cttgtttcca ttattgtagg tgttggggat tttcctcctg agtctagaca 21721 aggctgatct gtggtagcat tttgaggata ggtttctgtc tgctgtgttg gtgggatggc 21781 tgcccctttt gcagtttcct ggtgggtggt gggaggtcct ggcaggtggg gctcacatta 21841 gcctctcagc cacgtgcggg gcgcctggga cctgcgacct gcgtttggct tagtccaggg 21901 tggagccatc ttggttcttg gcttgccctc agcacagcag tgatgggaga aagggcagct 21961 gcaggaaatt caagcagtta tttacctttt cttccctact gctacccaaa aaaatgtaaa 22021 tcgtatatgt tttcttactt tctaaattaa tagctatcgt tggagggaga taagcagcca 22081 gaaggtgagt ttagatccat aattacatgg agagcaagga gagcctgccc agaataaatg 22141 ttggggaggg gctgcacagc tctgtgtggt ttgtggcagt gatttaattg gcgaggctgg 22201 tgtgcattta ggataggtta gctgcctggg cgccgctgtc tacagtggga atggtgggcc 22261 tgaacacagc ttgcaggcag cacatccctc agcaggaaag acgctcaggc ctcctcttag 22321 tagcagagac tctgcgctga tggcaggcgc gggagcactc accatgggtt tcatctggca 22381 gggcctctcc atcccagatc cttggaatag gaaggcttgg aaataggaag gcttttcagg 22441 gagaaaagaa acagctctcc acctacgagc aagtcactaa ggaaaggctg ctcgctccgt 22501 ttctcagcca catgtacttc ccagatagac cctgagctgt ttcactggcc cgtggctgtg 22561 tgtgtgtccc agatggaccc tgaactgtgt gtgtgtgtgt gtgtgtatgt gtgtgtgtgc 22621 tctaggtgca tgtgtgtgtg ttcttttttg tgtatgtgca cttgtgcctc tgtgtgtgta 22681 catgggctct ttcctgcgtg tgtatatgtg cgcatgtgct tttcatgtgt gcgcgtgtgc 22741 tctctgtgca catgagcttt tctctgtgtg tgtagattgg tgcacttaca ctcttctctg 22801 tgtacacggg cttttctctg tgtgcacgct tcgtatgtgt gtactcttct gtgtgcatgt 22861 gctcttctcc acatgtgtgc tcttctgtgt gtgtgtgctc ttcacgtgtg tgcgctcttc 22921 tccgtgcgtg tgctcttctt tgtgtgtgtg tgcgtgtgtt cttctccgtg tgtgcacaca 22981 tgctcttttc tctgcataga aggctcctct cctcctccct cctcagcttt cactaccctc 23041 tcttcccgtc acctctctgc agtctccctt caccaagtgc ttctatcttc cctgtccctt 23101 tcaatggagt tacttctgcc ccttcctctc tgtgcatttt tccactcaga caggtggggg 23161 ccctgaggtc ctgactggtt gacgtttgaa gtgccaggct gcagcattgc ctggtagagg 23221 aaacagatgg gagaggctga tcacaaaccc aggcaggcac ccatggcatt cattatctgt 23281 ccacacagac ccccagaatg agcccaggtc ctgccaatgg cacaaaacct ggagaaggga 23341 ggaaggggag ggctcacatg cagggcatca gggagcagct gcgccatcct cgcctccacc 23401 gtcctcgcct ccaccatcct cgcctccacc atctgcgcct ccaccatcca cacctccatt 23461 tagccacaaa gagcttccca gtcctcctct gtgccggccg tgcaggactg agggcactgc 23521 tgtaaagtgg gtggaggaaa cccttcccag ggacccacct tgaactgggt gagagccaca 23581 ggccagagcc ggggcagtgc ggtggaggag ctgctgcttt gggtagggag gtggcagagg 23641 agcttgtctc tgaggagatg catgtgaact aagacttggg tggaacatcg tgggagccgt 23701 gggaagtttg ggagacgggg attccaggtg gaaagagcag cagtgcaaag gccctgagac 23761 agccgcagag tggctgggcc tggtgagaga gcacccattg ccggcgagga ggccagagag 23821 cagggagcgg ccaggcacac acctgaagcc atgctcagga gtgcattgtc tcgtcagggt 23881 cttggaaagc cacgggggcc ggatggtgtg acctgacatt tttagcagca acctctgctg 23941 ctcagcggga tgtggagcat gggggtgtta gcaggaagac cagccaggag actgtcactg 24001 cggaccaggc acaaggtgat ggtggtcagg cttgtggcag tggagtcggg ggaaggggct 24061 ggatgccaca cgcatctggg gtgtgctata gtggtgatgg cgtgtggggt ccgcaggtct 24121 ccctgggatt ctctgagggg cggacctcag gagcagtgcc ctaaaagcct gtgcctttag 24181 cggcagagcc aggctttgtg tgcagtgtca cacgcctcca cgggacaccc acttttcacc 24241 gttgcagcct gcttgccacg ttaaaggtca tttttgttca atgcttgagc tggctgtcct 24301 ttatttacat gtcagttgga tgacagcgga catcaagccg agttaaatct gaagaatttg 24361 gactgactga tagctttctt cagtcagtaa gaaattggaa ttctccattt ttcctcaaag 24421 taatacaagc gtgctctttt aaagtacatt actacccagg ctggtgcgga gcgtcccgga 24481 gtctgggtgg gcagaggccc tctgtcctga cttggatgag gggaagaagg taattgctgg 24541 gtctttgggt tgcctgtgaa agggcactca cttggcaggt gcttcagcac tgtttgtttg 24601 ctaactgaag gcttagccac ctcctgccgt gcctgtttgg tggtcctgat ttgccccgta 24661 gtgcatagtg cattccatgt ctctcccgag gctgtgctca ggcagaactc ggtctatctc 24721 aggtacagtt cagagatgcg ggcctcatgg aggctcctgc tgtggaaccg agtcgaggcc 24781 ccggcctgcg cagatctctc ccctgcagca gcctgtttca tggtccacag agcccgggag 24841 cttgttgtgc cgagaggact tgggctcccc tcggcatcct ccctgggtga agttgggtat 24901 gtcctatttg ccaaattggt tttatccagc acttaggctg ctgaggaaag tactgcattc 24961 atgcttatga gtagatatag ccttgagaaa gtgtgtgttt gtgtgtgtgt gtgcgtgcat 25021 gtgtgagaca ggccctctga aagtgagggg ccttgttaaa tggtgggggt gctccactgc 25081 ctaaattctt cctatacaca tacacacaca cacacacact cacacactca ctcactcaca 25141 cccagcactg cagtgcgact gccccctgct ctgggcggtg cctgccttga ctccctctgt 25201 ccagggtgcg tgctcctggg gcgctgcact gcttgttccc tgggctgtag aggatgccgt 25261 gtggcgacag agctgggtcc agtggctgct ggctgccgcg gtcctctgga cacacgtgga 25321 gggatgtcag tggcctcagg caccccgggc cttcttggtt aggaggtgct gggcagagtt 25381 tccccatttc ttcctaactc tgtgtggcca gcttccgtgg agtcaccgcc tctgtgtcat 25441 tccttcttcc ttctgttgct gctgctcctg ttcttgtgtg gcccctgtcc cctctgcccc 25501 ctgctcctga cccttttggg ggctcagagg cccctcagag gcttgctggt gagtggacct 25561 aagggctgtt gagctggacc cgtgccccgt gcctgccttc agggacccag cctgcctgtc 25621 ttctagtcac acgccgtccc ctgggcttgc ttcacgccct aggggctcgg cagggctggg 25681 gctgcgcgtt ggtgactcgt ggtgcagcac tgttggctga ggggttgccg ccctgcttct 25741 tcatctccct ctggccccac acggctcctc tggccgctac ttgcagctcg gctaaatagc 25801 attcgcatca tttgctaagc tttggtacat gagaaattct taaatggcac tgtgttgttg 25861 agtgggtttg gtctttgatg ttaaatattc agttctctct agaaagttgt caactgaaaa 25921 tggaaagcgt gtatgaatat atgtgcgcga acatattcct cattccaccc gcgtctctca 25981 ggtgcctgct gtgttaggtg gtggctctgt tactcacctg aacagccgtg tgcactgcat 26041 gctagcgtgg ggtggcgggg ggcctggact ttagggctgg tcatgccact ctgggtagtc 26101 tccttgggtt gagagacttc acctgtcggt cccaggtccc tcctttgtag gtcccgggtt 26161 gatggtgatt ttggggatgg agcctaccaa gcccagtaag atttaggatt gtaggcaggg 26221 taggaaggga tcgactcccc agggtgcaag tgtagttggc tctgcggggc tgagtttgta 26281 tggcagcgat gtaattcatc catctgtcgg agcgtgtgtt cctgaagtct acctcctgtg 26341 ccttgtttgt atggacaggc ctggacagag cagttgggcc tggccttagc tggcaggcag 26401 cccgaaggtc aggctggagc catcagccag tgacccagtg gagaggagtg ggaactagag 26461 ggccacacag tagggagtgg ttggctcttc caaaaagaga gtggggaggg aacaggtggc 26521 agcagagtct tcacggaggg agtggtgctt gaagtgaatc ctgaaagcca ggggagtcga 26581 gacccagggt gcgggagtgg caggctttcc aggcagaggt aggagtacgg gctgatcact 26641 cagggggatc gagacccagg gtgcgggcgt ggcaggcttt ccaggcagcg gtaggagtac 26701 gggctgatga ctggcaggct cagttgggga agaatgctgt agctcagtgc agccacaatg 26761 tgggtgctgg ctggggagac cctggggcag ggaccagact gaggggccag gggtactgca 26821 ctgggggtgg tgggggctct gtccgtggga ggtgaggagc atgcagaggt cttcagccag 26881 gtgtgcacat gtgcacgtgt atggtgtggg tgataggatt gtgttgcacc ttggtataat 26941 gaaaccctcc ttatgagtga agatgggagc gagtaggtgg gaataagagg tggtggagag 27001 agggtccact gaatatattg agagtattgg agggaggagg ccggaatgac tcctgcatgc 27061 ctggcctgcg tgcctgaatg cctgctgaga tttcgtaagt gatggaagat gtttaagttt 27121 gtcttcttga atggtgatac cttttcagag gtttaaagaa atctgtttac tttatgaggg 27181 aaatttagag catccaccca cccacctgtc catccactca tctgctcgcc tgctcactca 27241 tccatgcctc cctccctgag gaagtctcca ctgataggtc aggtgctgct gcacggtggc 27301 ccgcggggtg tggaggagca gagcgcttct gctgggagag ccagctcagg gacggactga 27361 gccaggggct gctgaagtgc tgcacattgc ttgtgggaca ataaataatt ttttttaaaa 27421 tcaagtttca attgtcttaa gggccgtgcg tggtggctca ttcctgtaat cccagcactt 27481 tgggaggctg aggtgggtgg atcacttgag gtcaggagtt cacaagcagc ctggccaacg 27541 tggcgaaacc ctgtgtgtac caaaaaatac aaaaattagc caggcatggt agcgcatgcc 27601 tgtagtccca gctacttgga aggctgaggt gggaggattg cttgaacctg gaaggcagag 27661 gctgcagtga gctgagatgg caccactgca ctctagcctg ggcaacagag taagaccctg 27721 tctcaaaaaa aaaaaaaaat tgatatttgt ttatatttgt atcttaatat taggagcaat 27781 gagaaaatac agtgtacatt ttaagtaata gaatgctttc attttaaagt ttctgccatg 27841 tttacttttt gctttaccac ctgttccatt ttctttcaga tttttgaaag gatcttgttc 27901 atatgggcga tccgccaccc agccagtgga tacgttcagg gtataaatga tctcgtcact 27961 cctttctttg tggtcttcat ttgtgaatac ataggtaaga tttcttgcaa acattaaacg 28021 tgaactttag tggacttgct gtgtgttact atgtataatt ataacaatat tgaaaattgc 28081 tactaagtgt aatgttatta atgataggta atattagaga ataatagttt tctcatcaac 28141 aacagctttc caaaatgtct ttaatctaca tgctcaacta ctgaagacat taaattgcag 28201 ttataatgat gaatatcttc tttttaacaa tatttgaaga tagataatat atttggggag 28261 tgtagtttat tctaaaggcc atcacagtaa tggctttaat ggtttcagtc attcaatcga 28321 tcagtcattc agtcaatcag tctgtctgtc tgtctatctg tttatctatc tgtttatttt 28381 gagacggagt gttgctctgc tgcccaggct ggagtatagt ggcacgatat cggctaactg 28441 caacctctgc ctcccgggtt caagtgattc tcctgcctca gcctcctgag tacttgggac 28501 tacgggcatg cgccaccatg cccagctaat ttttgtattt ttagtagaga cgggctttca 28561 ccatgttggc cagggtggtc tcgaactcct gacctcgtga tctgcccgcc ttcgcctccc 28621 aaagtgctgg gattatagac gtgagctacc acgcccagtc agtggcttga atttaaatgg 28681 ataagattaa ggtgatagag tgtcagtcac atattttgct tacaagctga aaatcaaatc 28741 ttctttaatt gcgtttctgt ggcctttcat cacttttcta cttttgctat actagtaaca 28801 tttggtttca tgagagtatt aattacttgt gaaataactg gttgaacagt gttttaggtt 28861 gggctgtgtt tttatgctgt ttctttttaa gtactgcata tttttccttt atcacagtca 28921 actgtaatgt aacttaatct gttttgaata tgtttttaag ttaaaacctc tgataataaa 28981 agtgaggatc tgctttttaa tgcgtctatg gataaaacat aaataggaaa gaggggactt 29041 tactgtttag tcacagcatt ttaatttaaa cctggttttg tcttcctgca agccttctgg 29101 acttggtgct cgcctgttgt tattacatgg tgtttttagc ctttccaggt tgtggtcggc 29161 ctcagattcc catccttgct gtgatctgga gagatgagcc ttacccgagg acagatgaac 29221 agattatcct cagaagatga ggacatatcc aagtagtgaa ttttactgca gccctctgac 29281 actttgtttc catttgtgat ttcataggag gttgtctcat tgctaagcaa ctaatgagag 29341 ctgcctcggc tcactcccta atcatctctt aagtagggga gaggctactc ctgagggtgc 29401 ttgcttttct gcctcttacc cctcatttga cagataatga aaatgctctt aatgatccta 29461 gaacagtagg acctctgtca gtagacttca gagttgttaa ttgaaaggag aagttcaact 29521 gagatgtgat tagagaattg atattccatg ttaaatagac catttaatta atgtggtaaa 29581 taagtttgtt ttaactactt taccgccaac ttctgttgta agacctaagc gattcttaat 29641 gcatttgcct gattatctgc aagacaggat gaaggggcat gaccacgatc ttaattgtcc 29701 agaagaaaga aagcctggag aaaatggaat ttgtgtccta taagtggtgt cttcagatga 29761 gctcagagac actctccttg ccttcctgcg tccccatcct tcctgcatcg ccgtcacaca 29821 cgcaggcaca tgcacacaca catgcacaca tcagaacaca actacacatt gggtatacgg 29881 tgtcactttt gaacatctgt gtaaaacata ccatttattt ctgtcagtca actattaagt 29941 ggacatataa aaatgtttgg gccaggcgct gtggctcacg cctgtaatcc gagcacttaa 30001 ggaggttgat gcagatgaat cacgaggtca ggagtttgag accatcctgg ctaacatggt 30061 gaaaccccgt ctctactaaa aatacaaaaa attagctggg cgtgctggca tgcacctgta 30121 gttccagcta ctcgggaggc tgaggcagga gaatcgcttg agactgggag gcggaggttg 30181 cagtgagccg agatcgcgcc actgcactcc agcctgggtg acagaatgag actccatctc 30241 aaaaaaaaaa aaaaaaaaga agttcggtcc tggatttcac atgtgattat tgcatttgta 30301 gtccttttta gatctagctt cctaggtttg gggaatgttc attagtacag tccaagtccg 30361 cctttcatta tgttgtaaag aagcgtttaa cttgcgtggt gtgctgtcca gattccttat 30421 ggggctgaag ttgagtcatt cacgcggccc ttcatggagc acctgccctg gtgctgggtg 30481 tggtgcagag cgctgcggtg gggagatgac ttcggttcaa cgcagccagt gttgacagag 30541 catctcctgg ccctgtgctg tgggagtggg gcaggagaac aaggaaagcc tgatccctgt 30601 cctcttggag tttgtggttg tagcacggaa gacagatttt agaggagtaa tttaaagtct 30661 taattaaagt aattaacatt cacattatca agaggaagcc tgggctgctg tagggacctg 30721 tccccttata ggggtcaggg gaggcctcca gggtgaggtg atgttttagc catgtctcgc 30781 gggatgagga ggaggctaga ttgaggccgg gcattcagag agggcagcag aagggccggc 30841 ctggaatcca aggtgcagtg acaagtagtg gcattgggag gaagggcagg gcggcttggg 30901 ctgcagtgct gagaaggaga agcttggggt cgagtcccag gggtgtgcga aacccccagc 30961 cactgtgctc aggttcttat ctttgaacat tcattctggc agctgcgtgg atgagattgg 31021 agggtttctg gggaagctca tagaccaggg acccgcaggg tccagcaccg gcagaggcac 31081 gggtgccctc cagagaggct attctggttt gctgaactct tagtttgctg ggtcttcaag 31141 ccggtaaacc tcagatgctc tgcttagtga agatcttcaa gattgactgt tccttcagag 31201 aacagagcct ggtaatggga ttctgtggtt tcgacacctg agcacctgcc agtccttggg 31261 cctcagcgtt caggttatca tgtgaccgct catacacctt tggtgcagct gccagtggca 31321 ctagcccggg tgtggctacc tcctgagagc tgcagctggc ctggttctgg ggatgagctt 31381 gtgttgttac ttggatattc ctcttttctt tgaagtacag gatagcatcc cagttaatca 31441 aaggtttaag aaatccccgg ggtgtgggat ttctccaggg gctttcaggc aacagcagac 31501 agatctgtga tgcctgagtg ggagctgggc tctttctggg gatgtgtgtt cgccatgtgt 31561 ccagagcctg taaagatcca cagatcctgg cctgccatat gctcccactg gtttcacggg 31621 aggtggggaa tcctggagtg tggcattggt cctgagtagg agcagctctt cgggagtggc 31681 tcatcgttct ggctgtcagt gggaggttcc aactctgcac atgctggcac tcagagccct 31741 ctcagtgtta cgttctttga cttggaaaga aggatgtggt gggcttcaga ttcccatcct 31801 tgctgtaatc tggagagatg agctttaccc gaggacagat catcccccga ggatgaggac 31861 atatgcatgt agtgaattgt attgcaaccc ctctgacact ttgtttccag ttatgatttc 31921 ataggaggtt gtctcattgc taagcaacta atgagagctg ctgtggttga ggttgacaca 31981 caggacctgg tctgaaggca gaccagtcga ctaggtgaac tgcttcccag cacagaggag 32041 aatgttcatc ttcagcagga ggctggccag gatgtcagag tcagagaaac agtttgtagt 32101 gggggagcaa tgctcaagct gtgtggaggt gtgctcagag gtgggttagg gttcgatggg 32161 tctccatggg tgccaggaca ctgccggcct gagtcaggaa tacatggtat gcatcccagc 32221 agtcctcgtg gagctcacca ctatttgaga agggcagttt ggtcctgctg ttgatggatg 32281 cagagtctgt aaatgcatgt ttagggcatg ctcttgcaca cacacatgct gtgcacacag 32341 gcgtgcaggc acgtgcacgc ggacatacat aaccatacag agacacaaac ccacttgcac 32401 actcctcact agtgctagcc tcaacctctc caagcaaatg gggaacaaat ctgtaatcaa 32461 cctttgtttg agtggcaccc accagtttcg ctcaggccct ggaggtgcct gtcattggag 32521 aaagaggtca agcgatcttg cgtgacctca gaggcagggg cagccgtgtg caaagcatgt 32581 gaagacagtg aagtccagta aatacagcca gttcgacatc accgaagctt ggggagaaaa 32641 ctctacaagt gaaaacgaag tgagagggta taatgtgatg ggagagatcg atcatggaga 32701 agttctctat gccaggaagg tttcgcagct cataaatttc agaggttgaa gccttaggga 32761 gtggggaaga aactagaggc acatagtgat acatttgctg aagcttgaat gtgcccattc 32821 cagatcagag ccaagcgtgc gtgcttgttc ttgcagattg cgtccttgac ccatggggcg 32881 ggcagagtgt ggaggagact ccagttagca cagccattgg agacgacact cagctaggaa 32941 caggcggaag acacactttt tagtctaatc ttttagaatt gagaggaaac agcaaggcac 33001 gtgctacttt ggacttagtt caggccatgg aagaaggcag gtctaggatg agaataacat 33061 agggtccagg gcatcttaga gtttgtaaca gcagaggagt gacacctggt ctagagacac 33121 tgaggggtac aggagcaggc ccagaccaga ggccggtgga ggcaggagac tcgctgacgg 33181 aggactgggc ctgagaggag ggcctggtcc tggatgtcaa ataggctgct gtgagggcac 33241 caaggtggct tggtgtgagt agaggggagc gtgaacgaga aggaaaaatc cagtcacctc 33301 ctagattcct ggccctgtga ggggtgtggt gccgtttacc aggatgagac gtggagtaga 33361 aagagcgggt tgaggaggcg ggatggggtt ctgatttagg atgtgctgag tgtgctctgt 33421 cggcagtgat tgagtagaag agtctgttgg gatccgggcc ttggcaggag atctgcagtg 33481 gaggttagtc tggaaattgt cggcctggaa atggctgctg acagtcgcgg ctgtggatgt 33541 gtgtgtctgg ggagagtgga gtgcagcggg tcacaggatc cagtggctct ctgatgtgtc 33601 aggggtctcc gagctgtgac tgtctgtgac tcgaggagct ggcaatgggg ctgtcccaca 33661 gggaggttta tggcgctctc ccatgccgga gtctgggcgt tgagagtcca ggtggctcct 33721 gctgctgagg gccgcctttg ggatctgtga aggatgcaat gcagctctgg ggcctgaccg 33781 ctggggtcca gatccagacg ctgcccctga agagctgcag gcggccttgg tgcctcagtg 33841 atctctgcag tggacacgaa atggagatcg taatggcacc agtcccttgg ttggtgtgaa 33901 cactgagcga ggcaaaaaca gcttagaacc aggtatggaa cgcagtaagt gtccagtatg 33961 cgttgctctt gtgatcatcg ttactgtcac tttctttgtt gctgcctgtg gctttcctct 34021 gcttaaaatg gtggccgcat ctctgctctg tccctgtcct aggtttggag agagggagat 34081 ggtgcagggt gcagggccga ggccaaagca cgcctcagct gagcctatgt gttttcggaa 34141 gctttcctgg aacccacaca gcgatgtcct ccatcagcca cacccttgcc acccaccagc 34201 cacaggggac ctggagaaga tgtgggtgga gacaggaggc aaggaagagg gtctgtgtgt 34261 gtgtgcatgc aggctgccag accagtgtgt gccccgggga gaggtatggt ggtgtggtaa 34321 ggaaatggag aagaagtggt ttcagaggta agacaaaggc caggcctgag gtatcacaga 34381 aatcaaggga ggggacaact tcaggaagga gggagtggtc agcagagctc atagggatgg 34441 agactgaaaa ttggtcaaag catttgatag tggggatctt gctagcgacc tgccaaatct 34501 ggctgaagaa tagtggcaag caaactgtca catcagctcc ctgccctccc tcagaaatgg 34561 gggcctgaat gtttctgtgc ttggggaaga ttgcactcag tggaccacag cctcttgcaa 34621 ctttatttta aataattttt tatgatggaa agtttcaaac acaaagtcaa caaaattgtc 34681 cgaagaaccc gcgtgcccgt catctgactc cagcagtcac cgttctcctt ccttctgtcc 34741 cccacccact cccaacctcc atgcaacttt acttactact gctgtttaaa aacagcttta 34801 ttgagattca gtttagaaac cataacgttt acccacttag agtgtaaagt tgaaagattt 34861 ttagaatatt tagagtagtg taaccatcag cacctttgtt tttagaatat tttcatcatc 34921 cccaaaagaa atcctatact cattggcagt cactccccat ccttcccttt tctcttgtcc 34981 actagcccca ggcagcctct catttgcttt ctgtgtctgt ggatttgctt cttctggaca 35041 tttcacgtga gtggaattac atggtgcgtg gccctttgtg tctggcttct tccacttggc 35101 atcatgcttc caaggttcat ccacgcagcg tgaatcagta cttccttcat tccgtagcca 35161 ggtaacgtct ccgggtacgg acgcagcacg ttgtgtttct ccactcatct gttggtgaac 35221 atttggattg ttttcatttt tcgactaata ttgtttccat ttttgactaa tattttgact 35281 aacaggaata acgttgctat gagtattcat gtacaagtct tcgtgtagat atgtgtttcc 35341 atttctgttg gtatataact gggagaatta ctgggtcaca tggttactct gtttaacctt 35401 ttgaggacct gccggactgc tttcccacgt ggcggcacca ttttccctgc ccatagcatt 35461 gtgggaggag cgtatgtttc ctcttcctgc aggagtgtgt ccacatcctg gtcaacactt 35521 gtcatcatct ttcactcttt tttttttttt tgtagacttt actgaatctt atattaattt 35581 aaagagattg agttctgcct cttactctca aactgtgata aaaactcaga taagaaggga 35641 tgttgactgt agaaatcatt tgtaaagagg aataaattcc ccggtgctgc tgttttcccc 35701 atagcccagc aacttcagtg ttgacagcct ttgtgcccag tgccaaattc cctgtcataa 35761 gagggaacct ttatggcaac tgtttaaccc ccttaaaatg gatgcacatg ctgatgtttt 35821 cttctaggct taaagctgag tgttaggtat atatgtgttt gtttgtatgt atgtatgtat 35881 gtatgtatgt atgaatgaag gagagagaca gagtgttgct ctattgccta ggctaagtgc 35941 actaacgtta tcatagttca tggcagctgc aacctcccag gcttaagtga tgctcctacc 36001 tcagcctccc aagtagctgg gactacaggc atgcaccacc acgcctgact aattttttat 36061 tttttgtaga ggcagagtct tgctatattg ccctggttgg tctcaaactc ctggactcaa 36121 ggaattcttc ttccttggcc tctcaaagtg ctgggattac aggcacgagc cactgtgcct 36181 agctcaaagc tgggtattct cccattctaa taatcattta gtcatgtcct atgacaatta 36241 taccttaact ttttagaagg gatatatgga aaggtaaaat ttccatatat tccctaaaac 36301 tcatggaggc tgtgattctg tttcattcct gatgtgtgct ctgttttctt ttttcctgtt 36361 gtgcctctcc caggtagatt cgctgctcct cgaatctggg gactgtgtcg catgcgtgtg 36421 tcatgctttg cacagagaag cttcgtggat tagagaaact ctgttgaatt ggatccctac 36481 agaaagtttt tgtgtttata gagcacttta ttagtatttg tttaaacact gaaacaatat 36541 ataattttta tttatttttt gaagttaaca ttatgttatt tattcatttt ttttttttga 36601 gatggtgtct ctgttgctca ggctggagtg cagtcgtgcg atatcggctc actgcaacct 36661 ctgcctcctg ggttcaagcg attctcctgt cttagcctcc tgagtagctg ggattacagg 36721 tgtgtgccac catgcccggc taatttttgt atttttagta gagacagggt tttaccatgt 36781 tggccaggct ggtctcgaac tcctgacctc aagtgatcca cccacctcgg cctcccaaag 36841 tgctgggatt acaggcatga gccactgtgc ccggctacta tgttgttatt attttttaac 36901 ttttatttta ggttcaaggg tacatgtaca ggttttttat ataggtaaat tgtgagtcac 36961 aggggtttgg tgtacagatt atttagttac ccaggtaata agcgtagtac ccgagaggta 37021 gtttctcgat tctgaccctc ctcccgctct ccaccctcag gcaggccctg gtgtctattg 37081 tttgcttctt tgggtccatg tgtactcaat atttagctcc cacttataag tgagaacatg 37141 cagtatttgg ttttctgtgc ctgtattggt tcttttagga taatgacctc catctccgtc 37201 catgttgcta caaaggatgt gatcttgttc ttttttatag ctgaatagta ttccatggtg 37261 tatatgtacc acattttctt tatccagtcc accattgatg gacatttagg ttgtttccat 37321 attttttgct attgtaaata gtgctatgat gaacatacac gtgtatgtgt ctttatggta 37381 cagtgattga tattcctttg ggtgtatacc caataatggg attgctgggt tgaacagtag 37441 ttctgcttta agttattaga gaaatcaccg aactgctttc catagtggct gaactattta 37501 cattctcact agcagtgtaa aagcattttc ttttgtctac aacctccaca gcatctgttt 37561 ttttttttga ctttttaata atagtctttc tgactggtat gtgatggtat gcaaatatgg 37621 tttgcaaata ttttctcctg taggttgtat gtttactctg ttgatagttt cttttgctgt 37681 gcagaagctc tttagtgtaa tgaagtccca tttgttaatt tttgtttttg ttgcaattgc 37741 ttttgacatc ttggtcatga aatctttgcc agggtctgtg tctggagtgg tatttcctta 37801 cattatctta caggatgttt acagttttag gttttacatg taagtcttta atctatcttg 37861 agttgatttt ggtataacaa aggggtctag tttcaatctt ctgtgtatgg ctaaccagtt 37921 atcccagcac catttattga atagggagtc ctttctccat tgcttgtttt tgtccacttt 37981 gtcaaagatc agatggctgt aaatgtgtgg ctttatttct gggctctctc ttctgttcca 38041 ttggtctgtg tgtctgtttt tgttattgtg ccatgttacc agtgccatgc tgtttttgtt 38101 agtgtagcct tgaagtatag tttcaagtgt actcacaagt atattcataa gcatgaatat 38161 tgtaatgcct ccagctttgt tcttttttgc tttggattgc tttggctatt caggttcttt 38221 tttgattcct tgtgaatttt aaaatagttt tttttcctaa ttctgcaaaa aatgtcattt 38281 attggtagtt taataggaat agcactgagt ttgtaaattg ctttgggcag tgtggccatt 38341 ttaacaatag tgattcttcc tatccatgag catagaatgg ttttccattt gtatattcat 38401 ctctgatttt tttgagcagt gttttgtaat tctcattgta gagatcttat atgttacctg 38461 gttagctgta tttctaggta ttttattctt tttgtggctg ttgtaaatgg gattgcattc 38521 ttgatttggc tctcagcttg gacattgtta gtgtacagaa atactactgc tttttgtaca 38581 ttgatttttg tattctaaaa ttttactgaa gctatcagat ctaggagctt ttgggcagag 38641 actatggggt tttctaggta tagaatcgta ttgattgcaa acacttttat ttcctctctt 38701 cctattgaat gccttttatt tctttctgtt gcctgattgc tttggcaagg acttccagta 38761 ctctattgaa taggagtttt gagagtgggc atccttgtct tgttctggtt ttcaagggga 38821 gtgcttccag cttttgtcca ttcagtgtgc tgttgattgt gggtttgtca taattggctc 38881 ttattatttt gaagtatgtt ccttcagcgt tacagagtaa tgtaaaaaca tagctttaga 38941 aacagtatag agatactggt aaatatagat aagttatgaa ggaagcaatt aaaatacttg 39001 gccatcccac tgttgagaag acttattgaa gcctatctcc tcactgtgca gcctcccctt 39061 tgattaagat cctttcttcc cacacagcca cactgggagg gctggagaag gatctctcta 39121 ctttctctgc tcccaagaga ccagtggcta tctgcatggg gagagtcctg ggctctactt 39181 cagttccttg ggggtggcac catggtatca cctctgtgtg gcaagcagtg tgaatctggg 39241 cccttcccca tctcttgccc cgcccagggc tgggcctcga aggtagcctg ggtctcttct 39301 ctaggttgct agggtcgcag tgtctaggtc tgggctggga ggaagaagga ctcatccagc 39361 ttggagcctg agctgtggtt ccagcccgtc ctttgagcca gggtgcgggg cacttgcccg 39421 ctcagcgctc actcaggggt tactgtcctg tgtatggcca ccaggcagcc gccgtaccac 39481 tgtgcataag cagctgtttt ctgtatttcc tctaagtctt gttttctatg cattgtattt 39541 acatagtcga gtgcatatta tctataacat tttgtttcat gtttttttcc cacatttaac 39601 cttacaatgt aggcttttcc ttgtattaag aaatctttat aaaaatcatt tctaccttct 39661 gtacataatg ctctattgag atc // LOCUS HUMNKG5PRO 6746 bp DNA PRI 07-JAN-1995 DEFINITION Homo sapiens NKG5 gene, complete cds. ACCESSION M85276 NID g189229 KEYWORDS . SOURCE Homo sapiens (tissue library: Human placental in lambda-FIXII) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6746) AUTHORS Houchins,J.P., Kricek,F., Chujor,C.S., Heise,C.P., Yabe,T., McSherry,C. and Bach,F.H. TITLE Genomic structure of NKG5, a human NK and T cell-specific activation gene JOURNAL Immunogenetics 37 (2), 102-107 (1993) MEDLINE 93138716 FEATURES Location/Qualifiers source 1..6746 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="Human placental in lambda-FIXII" exon 2082..2261 /gene="NKG5" /number=1 gene join(2210..2261,3111..3214,3749..3847,5296..5467, 6353..6363) /gene="NKG5" CDS join(2210..2261,3111..3214,3749..3847,5296..5467, 6353..6363) /gene="NKG5" /codon_start=1 /product="NKG5 protein" /db_xref="PID:g189230" /translation="MATWALLLLAAMLLGNPGLVFSRLSPEYYDLARAHLRDEEKSCP CLAQEGPQGDLLTKTQELGRDYRTCLTIVQKLKKMVDKPTQRSVSNAATRVCRTGRSR WRDVCRNFMRRYQSRVIQGLVAGETAQQICEDLRLCIPSTGPL" intron 2262..3110 /gene="NKG5" /note="intron 1" exon 3111..3214 /gene="NKG5" /number=2 intron 3215..3748 /gene="NKG5" /note="intron 2" exon 3749..3847 /gene="NKG5" /number=3 intron 3848..5295 /gene="NKG5" /note="intron 3" exon 5296..5467 /gene="NKG5" /number=4 intron 5468..6352 /gene="NKG5" /note="intron 4" exon 6353..6535 /gene="NKG5" /number=5 BASE COUNT 1521 a 1817 c 1806 g 1602 t ORIGIN 1 ctgcagtgtt tggtctcacc aagtttccca caataaagag acatgagtca cctttcaaga 61 ccctttaccc ccaagaatgt ggtcttcaca catgagacca aggtctacaa gtggtcagga 121 gagagggggt ctgctcagat gggggagtag tgcctgagct ggcctcaaga gggttaagtg 181 gccctgcact gaaaacctgg acactgagtt agggtagggc tgggggaaaa cttgggcttt 241 ggagtcgtag ggtctgggtt caaatccaca gaccattccc ttcctagctg tgtgttggtg 301 ggtaattcac tggatctttc tgagtcctgg tttcctcatc tgaggtaaaa cgagtttgcc 361 ggttggtctg agagctgttc taggcatggt ggggagaccc tgacaggcag aggcagccct 421 gctctcaagc agttgattta cagctgggga aacaagacag ccacaaatgc aatacctcaa 481 actcaacttc tcaccagaaa gctccttttc ctaattttca cagccagtcc ctcagcctcc 541 tgggccccaa atactagtaa aacctttgcc tcctctctct tctttctttc ttgtaatcat 601 ataggtacaa agtcctacca attcttcctg aaatatgttt ccttatcaaa aagtcctgca 661 aagccgtgcg tggttgctca tgcctataat cccagcactt tggaggctgg gaggatcgct 721 tgagtccagg agttcgagac cagcctggac aacatatgga gacccatctc taccaaaaat 781 tttaaaatca gcaggggtgg tagtggcaag cacctgtggt ctcatctact tgggaggctg 841 aggtgggggg attgttggag cctgggcggt tgaggctgca gtgatctgtg attgcaccac 901 tgcactctag cctgagggac agagcaagaa cttgtatcag aaaaaaaaaa aaaaagtcct 961 gcggtactgg acactgccat tgcctatacg attcccactc cctcatcctc cctagcagga 1021 tatcaatttt gttcgaagtg tcaatgaagg ccaggtgcgg tggctgatgc ctgtaatcct 1081 aacactttgg gaggccgagg caggcggatc acctgaggtc aggagttcaa gaccagcctg 1141 gccaacatgg tgaaaccctg tctctactaa aaacacacaa attagcaggg catggtggcg 1201 tgcacctgta atcccagcta ctcaggaggc tgagacagga gaatcacttg aacccggagt 1261 ggaggttgca atcagccaag atcacaccac tgcacttcag cttgggtgac aagagtgaaa 1321 ctctgtctca aaaaagaaaa acaaaacaaa aacaaacaac aacaacaaaa agcaaagtgt 1381 cagtgaaggt ccagcaaaag actcccttcc tattgccctt tgcagccagg gtcatcatgt 1441 gacacagttc agatcaatga gatggaggct gagggtccct gggaaagatg tttttcctat 1501 acaggtacca cctctttcag cttcactctt tccattttcc acgtgaacag gccttgtagc 1561 ctggaggagc tacagctgcc tttttgagat gctgaggcac cctgtctgaa gaaggccctc 1621 acatcactca acttgactac tgggtgagcc cttggagagg cttcccagcc tctgctcttc 1681 aagccgaagt accacagggg acacgagtcc cagagttaca ggaccccagc tatggttcat 1741 gtgtaaaggg aaccattagg caaccagggg aaatgatgaa gaagatctac atttacaaat 1801 gtggaaagat gttcgtggta tattgttaaa ttaaaaagct gtttaaaaat agtttttggg 1861 tcaagtgaga tgactcactt atacttttag tataagtatg tcccatgcaa tatctggaac 1921 gtacttgtac taaggggttt ctccctccat cggcacatcc caggcatcct ggcagctgct 1981 ggcctccagc aaccccacat tctagttgtg tgggagtggg ttgtggcatg gaccctgtgg 2041 gctaccactg ccctgacctg cttcttcaca cactggtatt tgtatctgtg gtaaacccag 2101 tgacacgggg gagatgacat acaaaaaggg caggacctga gaaagattaa gctgcaggct 2161 ccctgcccat aaaacagggt gtgaaaggca tctcagcggc tgccccacca tggctacctg 2221 ggccctcctg ctccttgcag ccatgctcct gggcaaccca ggtaaggcct tcccctcggg 2281 atcgatcctg atggcccacc cagcctcgca ctctcaggct ggctgaacct ggagcttgga 2341 ctctgtgggc acccaggtgc ccctgcctcc ccccggcctt ctcccccgtc atggaggcct 2401 ggccctcccc tcagagccag gcttagtccg gtgtgctgcc cagcctgtca ctggcctggc 2461 caaggaggag agacaggcca gggattctgg tcctaactct actggccaca ctgtgtggcc 2521 tgagaccccc ctttccctcc caagcccctg cctccgcatc tgcgtggtga aggccattgg 2581 cctcatcggt ggatctgcgt ttcctcgggc ctacactgtc taggattgtg cggggctggt 2641 gagagaacaa gatctcttcc gtgttcaagg cagacttcct gccccctgca ccctgctctc 2701 tcccaggcct tgaggtcagt gtgagcccca agggcaagaa cacttctgga agggagagtg 2761 gatttggctg ggccatctgg atggaaggta aaaaaagaaa atcccttgaa aggagattga 2821 gggaagtttc tagacaaacc gacccccaaa tctgtgttgc tgggggaaca gaggagaaga 2881 gagagtctcg ccctcctggc tttctagaag gaacgtgaga acacgtgttt gtgctgagag 2941 tgggtcagag cggctccagg gcaaagcatg tggacaggta tcctggcccc ctgcaaggcc 3001 cagctcctgt cctaggccct ggtcacctcc tggactccca ccagccagga gaacgggctt 3061 tccctctcct tccgcctgcg gaggggaagc tgaagtctgg tcttcctcag gtctggtctt 3121 ctctcgtctg agccctgagt actacgacct ggcaagagcc cacctgcgtg atgaggagaa 3181 atcctgcccg tgcctggccc aggagggccc ccaggtacgt gttggctctc tgctcacctg 3241 ccacagtccc tctcctttcc ctcctccctg gtggctcctg gggtgaggtc tggagctctc 3301 taatggtcag gaggtgggag tggaggctgg gctgtttctg acgatgctgg ttttgttgaa 3361 ttcatgtctg gccaggaggg ctacaggtat ctggcagact cctccaggag gatcctctgg 3421 ggtctcaccc tccaaggagc ctggggctgc agaacccaaa taggcagact cccctgggag 3481 ttcctcaata ggagaggggc aagtgcaggg ctgggaaagt actggggttg tgggaggctg 3541 tttctggggt gtctcagagc ctctaagaca agcaaaaggg tgggtagggg ccaggcagcc 3601 agttcaggcc ttcagtgtat ccacgctctg ggaagagatc acggacattc ctgccggcct 3661 cagaaacaca aagggcccct ttcctgggca ctttcacgcg ctcccagagt gtctgagaga 3721 ccatcataag ggctttcttt cctgacaggg tgacctgttg accaaaacac aggagctggg 3781 ccgtgactac aggacctgtc tgacgatagt ccaaaaactg aagaagatgg tggataagcc 3841 cacccaggtg aggccaaggg gctacagagc ctcctgtctg ctgctcaatg gaggggccag 3901 cctgtgacca ggtcggggat cggggagccc gggggcacct tgcacagtga tcctggggga 3961 gggcttccta gaagggaatc tgtgagtccc cgtgtgtctg tggatgaatt tcagagaact 4021 tgtgaaattg tgactctctg gaactgtgta agtcagacgg cagagtatac atggttttca 4081 tcatgtatcc tcaaagaggg cttgtcccag agaagttagg aatcttcccc taaagcccta 4141 acatttgtgt ccaaggcaga gtttgagaag ctagttcccc aagaggcctg ggtcaggact 4201 gataaatccc agatctgcta cttccaagct gcatggcctt gggcaagtca cttccacttt 4261 ctgagcccct gttatcttat ctttgaaatg tgatggataa tagtccctat cttgcaagtt 4321 gtcaaaccct tttttttttt tttccttgag ataggatctt actctgagac ccaggctgga 4381 gtgcactggt gtgatcttgg ctcactgcaa cctctgcctc cctggcccaa gcaattctcc 4441 tgtctaagcc tcctgagtac ctggggctcc aggtgtgcgc caccatgccc agctaatttt 4501 tgtacttttg tagaaacagg gtctcactgt gttgcccagg ctggtctcca acttctgagc 4561 tgaagcaatc cacctgcctt ggcctcccaa agtgtgggat tataggcatg agccactgca 4621 cctggctgct gaagcttttt aaaagagctg agggctggga tgtgcttagc tccacgtcca 4681 gcactgagta aatgcttaac gaatgactgt gttactacca agaattattg tttcactctc 4741 cctccttccc tctcctctgc tgccccaaac tactcagcat cctggcactg caggctcgca 4801 cttagccctg gatacccaga ttcatcctcc tcccctggga tggcatagaa gagactttaa 4861 aaccaaatga gccaagactc caagctctga ccacacctcc cacccccacc agtcttctct 4921 atgcaccccc tctatatctg gagcccccag ccaggttctg gaccaaggta gctacatggc 4981 agagcattta atgtgtgcct ggcagccatg ggcaccattc tccacacaga aggcagggac 5041 aggtgcacaa ggggctgaga ccccagcagg gctaactgtc cttgtctcag gagccctacc 5101 tggccagtct tgggccaggc cttggggact gggagtaggg gctgacccgt ctgtacagtc 5161 tctggcccca tggcaccagg tgccagctcc tcgcacccag tactcccatt gctagggctg 5221 ctggaacctg cagggttggc agagctgggc aggactcacc ctataaccat gtccactgtg 5281 gtgctgctgc tgcagagaag tgtttccaat gctgcgaccc gggtgtgtag gacggggagg 5341 tcacgatggc gcgacgtctg cagaaatttc atgaggaggt atcagtctag agttatccag 5401 ggcctcgtgg ccggagaaac tgcccagcag atctgtgagg acctcaggtt gtgtatacct 5461 tctacaggtg agtgcagagg tgacagcagg gatacctcct gagggttgga gacagcttcc 5521 cccaggatat atcaaagctg cctccttact cccccatctc ccagcatggg aaagtgtgga 5581 gaattgagca gatggacttt agctagaaat gtttgagaaa tactgattag agcttgggct 5641 tcagacacag gtggttgtgg agtaaaatct ggtctccatc tctccctggc tgtgtgacct 5701 taagcaaata acttgacctc tctgagcttc agtttcttca tctgtgaagg agagatagca 5761 atcctgattt ttgagattgg aatgagaatt gaaggaggtc accgtgtgtg tggacctgac 5821 cctggggaaa tgtcctcaga ctgaggctat tcaaggtcat cagaccctca gtcaaactcc 5881 aacccagccc agcacatggc ccctggggtc gggagctggg gccatatcct cccccacaat 5941 cctgggccct gagatctggg ctagggaacc cttcaggcag gggagcatga ggcctttccc 6001 tccatggctg cccaggctgt gctgagagaa cagatctcgg ctgtaggaaa cggggccaga 6061 aaggggcctc ggtgattggc tctggcagct cagctggcac ttgccaatag ctctgggatt 6121 ttatgctggc agatcggggg tccccaccat ttcctgtcat tggagcttgt ggcttttcta 6181 ttcaaggccc cacagcctgc tcaggctgcc gactggcttc caggatgtgc ctctgggtgt 6241 gttcagtagg gtcaggtggc tctgggacct taagcaagta acattctgag tgcctgcttc 6301 tccttgagga cccaccacat ctgcccacag ctagctgttc tctccgctcc aggtcccctc 6361 tgagccctct caccttgtcc tgtggaagaa gcacaggctc ctgtcctcag atcccgggaa 6421 cgtcagcaac ctctgccggc tcctcgcttc ctcgatccag aatccactct ccagtctccc 6481 tcccctgact ccctctgctg tcctcccctc tcaggagaat aaagtgtcaa gcaagatttt 6541 agccgcagct gcttcttctt tggtggattt gaggggtggg tgtcagtggc atgctggggt 6601 gagctgtgta gtccttcaat aaatgtctgt cgtgtgtccc atacactgtt gtagatgtta 6661 tggatttagt ggtgaacgag acaaccttaa cagcattcac acagttagtc gtgaaatgct 6721 tactgagcac tcaccacagc catgca // LOCUS HUMNUCLEO 10942 bp DNA PRI 07-JAN-1995 DEFINITION Human nucleolin gene, complete cds. ACCESSION M60858 J05584 NID g189305 KEYWORDS nucleolin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10942) AUTHORS Srivastava,M., McBride,O.W., Fleming,P.J., Pollard,H.B. and Burns,A.L. TITLE Genomic organization and chromosomal localization of the human nucleolin gene JOURNAL J. Biol. Chem. 265 (25), 14922-14931 (1990) MEDLINE 90368666 FEATURES Location/Qualifiers source 1..10942 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q12-qter" exon 1070..1198 /gene="NCL" /note="G00-125-908" /number=1 /product="nucleolin" gene join(1070..1198,2159..2275,3439..3916,4587..4784, 4889..4975,5160..5301,6307..6431,7037..7160,7620..7777, 8292..8415,8652..8785,9279..9405,9792..10006,10140..10499) /gene="NCL" mRNA join(1070..1198,2159..2275,3439..3916,4587..4784, 4889..4975,5160..5301,6307..6431,7037..7160,7620..7777, 8292..8415,8652..8785,9279..9405,9792..10006,10140..10499) /gene="NCL" /note="G00-125-908" /product="nucleolin" CDS join(1181..1198,2159..2275,3439..3916,4587..4784, 4889..4975,5160..5301,6307..6431,7037..7160,7620..7777, 8292..8415,8652..8785,9279..9405,9792..10006,10140..10216) /gene="NCL" /codon_start=1 /db_xref="GDB:G00-125-908" /product="nucleolin" /db_xref="PID:g189306" /translation="MVKLAKAGKNQGDPKKMAPPPKEVEEDSEDEEMSEDEEDDSSGE EVVIPQKKGKKAAATSAKKVVVSPTKKVAVATPAKKAAVTPGKKAAATPAKKTVTPAK AVTTPGKKGATPGKALVATPGKKGAAIPAKGAKNGKNAKKEDSDEEEDDDSEEDEEDD EDEDEDEDEIEPAAMKAAAAAPASEDEDDEDDEDDEDDDDDEEDDSEEEAMETTPAKG KKAAKVVPVKAKNVAEDEDEEEDDEDEDDDDDEDDEDDDDEDDEEEEEEEEEEPVKEA PGKRKKEMAKQKAAPEAKKQKVEGTEPTTAFNLFVGNLNFNKSAPELKTGISDVFAKN DLAVVDVRIGMTRKFGYVDFESAEDLEKALELTGLKVFGNEIKLEKPKGKDSKKERDA RTLLAKNLPYKVTQDELKEVFEDAAEIRLVSKDGKSKGIAYIEFKTEADAEKTFEEKQ GTEIDGRSISLYYTGEKGQNQDYRGGKNSTWSGESKTLVLSNLSYSATEETLQEVFEK ATFIKVPQNQNGKSKGYAFIEFASFEDAKEALNSCNKREIEGRAIRLELQGPRGSPNA RSQPSKTLFVKGLSEDTTEETLKESFDGSVRARIVTDRETGSSKGFGFVDFNSEEDAK EAMEDGEIDGNKVTLDWAKPKGEGGFGGRGGGRGGFGGRGGGRGGRGGFGGRGRGGFG GRGGFRGGRGGGGDHKPQGKKTKFE" exon 2159..2275 /gene="NCL" /note="G00-125-908" /number=2 /product="nucleolin" exon 3439..3916 /gene="NCL" /note="G00-125-908" /number=3 /product="nucleolin" exon 4587..4784 /gene="NCL" /note="G00-125-908" /number=4 /product="nucleolin" exon 4889..4975 /gene="NCL" /note="G00-125-908" /number=5 /product="nucleolin" exon 5160..5301 /gene="NCL" /note="G00-125-908" /number=6 /product="nucleolin" exon 6307..6431 /gene="NCL" /note="G00-125-908" /number=7 /product="nucleolin" exon 7037..7160 /gene="NCL" /note="G00-125-908" /number=8 /product="nucleolin" exon 7620..7777 /gene="NCL" /note="G00-125-908" /number=9 /product="nucleolin" exon 8292..8415 /gene="NCL" /note="G00-125-908" /number=10 /product="nucleolin" exon 8652..8785 /gene="NCL" /note="G00-125-908" /number=11 /product="nucleolin" exon 9279..9405 /gene="NCL" /note="G00-125-908" /number=12 /product="nucleolin" exon 9792..10006 /gene="NCL" /note="G00-125-908" /number=13 /product="nucleolin" exon 10140..10499 /gene="NCL" /note="G00-125-908" /number=14 /product="nucleolin" BASE COUNT 2873 a 2245 c 2723 g 3101 t ORIGIN 1 attctgctgt agacatagag atgatgatca tagctgacta tgatgatgat cccccgcgag 61 cctgaaagag gaaatgctct ggtttgctaa gcccgcgaat cgagtgagac ccacccacaa 121 agctaaccgt ggaagtcact ggcggcctcc ttcgccctgc cagccgggga acccatccgg 181 tggctctcga cctgctcccg ggccatctgg tgacactgac ttcgcagcca ccaccttaat 241 tggcgcattc gacccaaata ataacctggg aacctgtggg cggtctaagg cccggctctg 301 cggtcgccct cccaggcccc tctccctggc cctgtgaggc cagaaagtta cttctccgag 361 gccagttccc catgtctgag aaatatctcc caacttgagg ttctgtgggg taggggaggg 421 ttcgtgactt tctcacagaa aacctcgtac agaccccgcc actgccttta ttaacagctc 481 tcaggagact gcctgcagga ggggggtcgc tccggcccca tgctcgcggg caagcaggga 541 taagctgtgc ctccaaaagg gccaacggga actccgcggt ccctgaactt ccggtgctgg 601 aggactcctc gctccagggc caccaggagc cgcggcgtga gtgcgtgccg gaaccgaggg 661 cggggtctct gaggaactcc aaggctgccc aagcctacgg acccagccac attggcgaac 721 cggagaccgc ccgattccac cacccccgcg ctcccctcac agccggcgcc aaaaacgcca 781 gtcccacgac gcaggccggg acccgcgcgc ccacggccca atcagcgcga ccttgcacaa 841 agcgagcccc gcccccacgg cgccgttgcc agcccctccc cctcccgtgc cgcctcggcc 901 cgcctactcc ccgccccgcg ccgttcacgg ttagaggctc gcgattggct catggggacg 961 gccgcgagct ttggttggtc ggcgcggagt cacgaggcgc cgtcgtcgcc tttccacagg 1021 cgttactggg caggctcagt ctttcgcctc agtctcgagc tctcgctggc ttcgggtgta 1081 cgtgctccgg gatcttcagc acccgcggcc gccatcgccg tcgcttggct tcttctggac 1141 tcatctgcgc cacttgtccg cttcacactc cgccgccatc atggtgaagc tcgcgaaggt 1201 aaacggcctt gagcgcgacg cagacgtgta ggcctgcttc cgaggggcga gcgcggcgcc 1261 gcggggagga gggcctgcgc gcagtcccgg gcgcgttcta gggcgccatg ctgcgggaag 1321 tctcgcgcga ttagtgggga ggtctcgcgc ttctggctac ttggtggcga ggtgaagagc 1381 ttctgcaggt gctgggggag ggggcgctgg gcctcggggt ggagagatga gaccaaactt 1441 ttgcgacgcg tacgagctgg gactgactct gacgcacgtg cccgggagcg tgcctgccac 1501 gtgggccggc gtaggtctgg aatctccaga gggaccgggt gccttgggcc gggaaatggc 1561 ggtatcggcc ctagtcggag tcccggctgc gctcggatgt ctccgccccg gcctggcaag 1621 ccgatacgtg gtgggccccg gaaggtggct ctgccgcgtg ccttttgcgc tgtgtttcgg 1681 gcaagaggtg gtcctgccag gtacccccac gtggccgcac ccgcctcttt aaggggcggg 1741 gtagtgctgg ggaaaggcat aagcttcatg agaaaataag gtagtatttt taagtgcctt 1801 aatgatcttc accgttaatt tgattcaaat aagggtggta gataaagtac cgggatttgt 1861 agtataaaaa cacggttgtg cttaactaag gtaacgggag gagaaatcat ttcctcaggt 1921 tgacttttta ccttagggca ggttttctgt tggtaaagcc tgggaggaaa aatgtgggcg 1981 gttgagaagt agtccctctt gcattgccat caggagtagt ttctatgtta gttgtggtgt 2041 ttggcactat gagaaatgat ctgagacgga gatgatggcg tatgaacact aatggcaaaa 2101 tatgaatggc ctgaaatgtc gaggtggagg tgtaatgatc tatttgtgtc cattttaggc 2161 aggtaaaaat caaggtgacc ccaagaaaat ggctcctcct ccaaaggagg tagaagaaga 2221 tagtgaagat gaggaaatgt cagaagatga agaagatgat agcagtggag aagaggtaat 2281 tttatccaac ttaatgcaga attatgttaa aactacaaaa tggagagtta agacatgaaa 2341 ttggatatct gtggcaaaaa taagatttta tcaggtatgt cttattgtag tggttgagtg 2401 tttcacaagc tcttcattga catgtcaaga tgtcatttgg ctagtatttg aatgtgagtg 2461 ctaagacgag actgggaatt tcttttacat gttcctctgc agggcttgga gtgtgatttg 2521 ttgtgttaaa tcattacatt tttccagttt caacatgtta gctcaccccc acatgtagag 2581 ctgggcattg tattcagagc tgagaataac cttaccagat tcctttccta tcctccgaat 2641 taaaattaat tggtctccat tccatatata tataactgta tcactactgg ttaagtactc 2701 gggtgtagac tgagggctgc cacctctctt tggtaccatt gaccctcttt agccacctcc 2761 tggcctttta tttgcctcca ctataaagac agctgagcac tgaattgtgc tcaggttttc 2821 gttgagaacc tgaatgaaag ttttactctc cacacattgc cttgataaaa ctacgggatt 2881 ttaatgtagc taaatgatga cttttatcaa actaccatgc acactctttg atgtgtgata 2941 gttttgtaag gaatatttat atttagccta ttcatttttt gtctcaggtc ctaagaattg 3001 agcttcactg ggcttggtgg accgcaacca cgagggcccc aatgatttaa taagttaatg 3061 cttggagcct cctatgtgta acgttctgaa taatttacac atagcaattc atgaccttaa 3121 acatgtaagg atgatactat taccattttc agatgagaaa gttggggctt gggaaagtat 3181 gaggtgtaag aattcagagg gtctggttca gaggtatttt cagtgttcaa aagagttcct 3241 tatgtctggg tattcacctt attatagggg ctctgactta agacaacata acagaagcct 3301 ggagttttaa catgtcatat gtgtcatgcg tatgtcttga accagaggca ttgccagagt 3361 ctaacaactc attgggacca tggttatctt tttgggtgtg gggctggact tactggtttg 3421 gttttcattt atctcaaggt cgtcatacct cagaagaaag gcaagaaggc tgctgcaacc 3481 tcagcaaaga aggtggtcgt ttccccaaca aaaaaggttg cagttgccac accagccaag 3541 aaagcagctg tcactccagg caaaaaggca gcagcaacac ctgccaagaa gacagttaca 3601 ccagccaaag cagttaccac acctggcaag aagggagcca caccaggcaa agcattggta 3661 gcaactcctg gtaagaaggg tgctgccatc ccagccaagg gggcaaagaa tggcaagaat 3721 gccaagaagg aagacagtga tgaagaggag gatgatgaca gtgaggagga tgaggaggat 3781 gacgaggacg aggatgagga tgaagatgaa attgaaccag cagcgatgaa agcagcagct 3841 gctgcccctg cctcagagga tgaggacgat gaggatgacg aagatgatga ggatgacgat 3901 gacgatgagg aagatggtaa ggagttgtct tggtagttac tgggcttctg attacaaggt 3961 atcttgagat tctgggatca catattcctt catcgtacaa cctggagatg agattagaat 4021 cttgtgggaa ttctcttggg ttgttgtggt gtgctagact taattaccca tgaatgattt 4081 tgtcctcttg agaaaatttc aatagcacat ctattagtgt tttttataat gtaggatttt 4141 cgtttctaag tgattttttt ttttttttaa atttttttga gatggagctt ttgctgtttc 4201 ccaggcggga gtgcaatggc gcgctatctc ggcgcactgc agcctccatc tcctgggttc 4261 aagcagttct gcctcagcct cccgagtagc gggattacag gtgcccacca ccacacccta 4321 ctaattttgt attttagtag agacgacatt tcaccatgtt ggccaggctg gctctgaact 4381 ttgacctcag gtgatccacc caccttaggc tctcccaaag tgctaggatt acaggtgaga 4441 tatgctgcgc ccggccccaa gtgatctatt cttgccatga ctgttaacta aacatggtga 4501 caggattcga ttttctttac attagatttg aaaaccgatg ttggttttgg gagattgctg 4561 caatttttag gtgacttctc tttcagactc tgaagaagaa gctatggaga ctacaccagc 4621 caaaggaaag aaagctgcaa aagttgttcc tgtgaaagcc aagaacgtgg ctgaggatga 4681 agatgaagaa gaggatgatg aggacgagga tgacgacgac gacgaagatg atgaagatga 4741 tgatgatgaa gatgatgagg aggaggaaga agaggaggag gaaggtactt aaattagatt 4801 ctgacatacg acatgagtta tgtttaaagg aggcacttaa gtgtttgtgg ctactgatgt 4861 gtgatacatt gtttgacatc ttgtccagag cctgtcaaag aagcacctgg aaaacgaaag 4921 aaggaaatgg ccaaacagaa agcagctcct gaagccaaga aacagaaagt ggaaggtaac 4981 ttgcagaatt aggggatatg ggggagataa acagcacaaa tgatgaataa caaagggact 5041 taatactgaa accagatgtt acattgtagt gtgctgatgt gctgtgtata gaaattttgc 5101 tttggaaact aactttttac cacactacaa gtagactgag ttgagctttt tttgtgcagg 5161 cacagaaccg actacggctt tcaatctctt tgttggaaac ctaaacttta acaaatctgc 5221 tcctgaatta aaaactggta tcagcgatgt ttttgctaaa aatgatcttg ctgttgtgga 5281 tgtcagaatt ggtatgacta ggtagctgct tcactgcacg ttacataccg tgggtctgtt 5341 aatttttcct tcccctgtta gcacagttac tttagcctgc cactgttaaa catgaatact 5401 gtaaacactt caaggttagc attagtgaac taagttagaa ttaaactgta gatcccctaa 5461 gttgcaattt ccataatcag tcgtaacttg gtatagcaca gaataatttt tagtaatttt 5521 tttgttgttt ttgttatgta ttgagacgga cgctggcttt tgttcaggct ggagtacagt 5581 ggcgcaatct tggctcactg caacctctgc ctcccgggtt caagcgattc tcctgcctaa 5641 cctcccaagt gactgggata cgggtgccac tcaccatgca tggctaattt ttgttttgta 5701 tttagtatcg atttcaccat gttggtcggc tggttttgaa ctcctgacct caagtgatcc 5761 acccacctcg gcctctcgaa gtgctggtac agcgtcacca ccctgccagt aagttttaat 5821 aatttggtgt taggtgggag aatgcttgaa cctgggaggc agaggttgca gtgagccaag 5881 ttcgcgccac tgtactccag cctgggcaac agattgagac accgtctcaa tttaaaataa 5941 tgtttatttt cttggaagta ccttgaaact attagacctg tctagtcatc atagtgaata 6001 cttttatcca gacaggattc tcctgtatta gtgcttatag gtgttctttt gtcagctgct 6061 actgtgaatt cttataagca atttagctcc atgatgaaga cctcaaacgt gaatgtgcat 6121 gtcatatctt catgctgagc cgtgttctgt agctgcagtt tgcagagcct tgactttgtt 6181 ttgctatact aggggtgctt tttaaaatgt gatctttgtt tgcaccatca catttgtcta 6241 gatacagatt gtgattttga tttgtgtttt cacctgttgt aattttgccc tcctctccac 6301 ctgaaggaaa tttggttatg tggattttga atctgctgaa gacctggaga aagcgttgga 6361 actcactggt ttgaaagtct ttggcaatga aattaaacta gagaaaccaa aaggaaaaga 6421 cagtaagaaa ggtatgtaag gctttatgag ttatgcaatg aactcaggag ctagactgct 6481 agggaaaatg ctttgtaacc catttccctt tggtttcctc ttattttttt taaatcattt 6541 ttttcctttg gtttcctctt aatgtgggaa ttaaatgagc tacagtgttt acaaggtact 6601 tggcactgct tgtcagtgta taggtaaatt cctgagttag gcaagcaaga gcactcttat 6661 acagaacaag aaccattaca tgcacctaaa ttaagctaag gatctttctt cactgaaact 6721 agttaggtcc ctaattactc cctatataca gtgtaatgtt ttgaattggt acattcactt 6781 tttttgttat gcgcgtctac tctaggttga actccagtgt acctaacaga gagtttgaca 6841 tcaaggctgt gacaacatgg agggaccact tgtgtgttga cactgctata tctccatatt 6901 tagcaccgag ccttgtacat ataggatctc aaattatttg ttgatagagc tatgtgtgtt 6961 tttcccctct ttttgttgtt gccccccacc tttggttttt caggccacag agctcatttt 7021 tgttttttta atctagagcg agatgcgaga acacttttgg ctaaaaatct cccttacaaa 7081 gtcactcagg atgaattgaa agaagtgttt gaagatgctg cggagatcag attagtcagc 7141 aaggatggga aaagtaaagg gtatgttctt ctattgaaat gtaagggttt tattaacatt 7201 aatgcacttc ctgctttata aaagaaatat tggtttgatt tccttaggcg tgtaacttgg 7261 acagtttaac ctgtaagttt gtgcctcagt aacccatctg taccatgggg ataatgtact 7321 catagggtga ttttaaaaga caaagctaat acttacaaag aagcaagttt aatgcctatc 7381 ttacataaat actttgtaag tagtagcagt tctttcagtg aggtgaggtt acatgaaaaa 7441 attccaagta tttgtaaaac tagtgggaag taagagggaa gctcgagttt tgattgaaaa 7501 gtggactaaa caagggcatt ttatgtactc agatctgaag caagttctgt gttgctgagg 7561 taaaagcatt tgtgttaata tggttttaaa aaccatgagt tcttctccct ccattgcagg 7621 attgcttata ttgaatttaa gacagaagct gatgcagaga aaacctttga agaaaagcag 7681 ggaacagaga tcgatgggcg atctatttcc ctgtactata ctggagagaa aggtcaaaat 7741 caagactata gaggtggaaa gaatagcact tggagtggta agaaattagg cttgttccaa 7801 ggttttcaga attggttgag ggaactcttc tagtctttgt atttcataag tttataaata 7861 ctttttaatc aaagttactc aaatgtaggt gaagatcaag gacatgatac cccaagtcat 7921 actcttattt ggaatagtaa tttccaatct tgaaatgaga gctctaaatc attttgcatt 7981 ggaatacagt aggcaaatca agcttccttt gtaggcatgt tttatacttt aaatgacttg 8041 accatgtgcg ttttgaactc agatgattct aggaaaacag accagtcatc agcctatgta 8101 agaacaacca gcaggacatt gcaacacgta ctaggtactt aatatgttga gtaacagaaa 8161 tggatttagc ttacgtcatg agtatttgta tataactcaa gcactgaaat tcttagggaa 8221 tagatattac tgttgtgacc gaagctggga cactgtttca gagtcttagg aatgtggctc 8281 tctatttcga ggtgaatcaa aaactctggt tttaagcaac ctctcctaca gtgcaacaga 8341 agaaactctt caggaagtat ttgagaaagc aacttttatc aaagtacccc agaaccaaaa 8401 tggcaaatct aaagggtaag ataatacctt tgtatcatca gttataggcc tatatatgtc 8461 ttagaggtct aaggacgtaa ggtcatgtgt cctgtagaaa aaagctaaat aattttagcc 8521 tagtaaatga gtgtaaaata agtatattta ggtccaacct tgagagaagg gccttggcca 8581 gatcatgtga ccagtggtat agagagcatg tgcctggtaa attactctaa gcattaactg 8641 ttcatcctca ggtatgcatt tatagagttt gcttcattcg aagacgctaa agaagcttta 8701 aattcctgta ataaaaggga aattgagggc agagcaatca ggctggagtt gcaaggaccc 8761 aggggatcac ctaatgccag aagccgtaag ttcacctggt tagggtgctg tggttggggg 8821 tagcactctc ggtgctttgt ttatttttgc acaaattctg tgtttcctgt tcgctactga 8881 gtgaacaata actggatatc gatgactgat tacctgagaa ataattgatg aaatctcaag 8941 aaaattcctc tagatagtca agttctgatc cagctgtcgt caactcagag tagcaagttt 9001 gcccatgatt tcctgcccca tccactgggc cccacctgct tgggttgctt tcccactttc 9061 catagaagac tggggcagga tatcaactat gcaatggcaa ttaaaaaatg taaacccaga 9121 atagccttta ctttaattaa ggactagttg gcttagttgc ttttaactgc tttttcacta 9181 taacaagtat cttggctagt agtcatacta ggcattgtgc aaattcagtg tacgaactgt 9241 gaattcacat aaatcgcaaa tttttttttc cttcccagag ccatccaaaa ctctgtttgt 9301 caaaggcctg tctgaggata ccactgaaga gacattaaag gagtcatttg acggctccgt 9361 tcgggcaagg atagttactg accgggaaac tgggtcctcc aaagggtaag ggaaggaagc 9421 gtgagtgctg cttccacttg aaggggtttt tgttctgtgc agaccttgag tctaatgtgt 9481 cttctcattg agctccttct gtctatcagt ggcagtttat ggattcgcac gagaagaaga 9541 gagaattcac agaactagca ttattttacc ttctgtcttt acagaggtat atttagctgt 9601 attgtgagac attctggggt tcaagctgtc acaccagtta gttttccata gagagctact 9661 ctgctgcact ggtatctttt tcccaaataa acaaggctac ttctgtggga tggctcccca 9721 gcatgtacag ttaacttggg acatgtgtag taggtgcttt ttataatggg caatttcatt 9781 tggtgttcta ggtttggttt tgtagacttc aacagtgagg aggatgccaa ggaggccatg 9841 gaagacggtg aaattgatgg aaataaagtt accttggact gggccaaacc taagggtgaa 9901 ggtggcttcg ggggtcgtgg tggaggcaga ggcggctttg gaggacgagg tggtggtaga 9961 ggaggccgag gaggatttgg tggcagaggc cggggaggct ttggaggtaa ggcacgcaga 10021 gataatgaca ccacatagca tgtgctcttc agaccctgtg ccctgtcacg gttcctaatc 10081 actggggagg aggagctttg tacccattct tttaacagtg tcttgccttc ctcctgtagg 10141 gcgaggaggc ttccgaggag gcagaggagg aggaggtgac cacaagccac aaggaaagaa 10201 gacgaagttt gaatagcttc tgtccctctg ctttcccttt tccatttgaa agaaaggact 10261 ctggggtttt tactgttacc tgatcaatga cagagccttc tgaggacatt ccaagacagt 10321 atacagtcct gtggtctcct tggaaatccg tctagttaac atttcaaggg caataccgtg 10381 ttggttttga ctggatattc atataaactt tttaaagagt tgagtgatag agctaaccct 10441 tatctgtaag ttttgaattt atattgtttc atcccatgta caaaaccatt ttttcctaca 10501 aatagtttgg gttttgttgt tgttactttt ttttttgttt ttgttttttt tttttttgcg 10561 ttcgtggggt tgtaaaagaa aagaaagcag aatgttttat catggttttt gcttcaccgc 10621 tttaggacaa attaaaagtc aactctggtg ccagacgtgt tacttcctaa agagtgtttc 10681 ccctggaatc tcactggaga gcatggcaaa gccagctctg ccacttgctt cacccatccc 10741 aatggaaatg gcttagtgcg tgtttccagt atcccagccc taactaactt ggttgaaatg 10801 ctggtgaggg gacctgctcc tgcagccctg gtgctgactt gaaggctgct gcagcttctc 10861 ctacttttag caggtctcga ggattatgtc tgaagaccac tctggaaaga ggtcgaggaa 10921 cagattagtc aggtttccta gg // LOCUS HUMOP18A 8058 bp DNA PRI 07-JAN-1995 DEFINITION Human oncoprotein 18 (Op18) gene, complete cds. ACCESSION M31303 NID g189387 KEYWORDS oncoprotein 18. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8058) AUTHORS Melhem,R.F., Zhu,X.X., Hailat,N., Strahler,J.R. and Hanash,S.M. TITLE Characterization of the gene for a proliferation-related phosphoprotein (oncoprotein 18) expressed in high amounts in acute leukemia JOURNAL J. Biol. Chem. 266 (27), 17747-17753 (1991) MEDLINE 92011487 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990) University of Michigan, 4451 Kresge I, Box 0510,Ann Arbor, MI] kindly submitted by R.F.Melhem 13-JAN-1990. FEATURES Location/Qualifiers source 1..8058 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukemia" exon 1131..1176 /gene="Op18" /number=1 gene join(1131..1176,2830..2904,3754..3926,5886..6077, 6483..7452) /gene="Op18" mRNA join(1131..1176,2830..2904,3754..3926,5886..6077, 6483..7452) /gene="Op18" exon 2830..2904 /gene="Op18" /number=2 CDS join(2892..2904,3754..3926,5886..6077,6483..6554) /gene="Op18" /codon_start=1 /product="oncoprotein 18" /db_xref="PID:g189388" /translation="MASSDIQVKELEKRASGQAFELILSPRSKESVPEFPLSPPKKKD LSLEEIQKKLEAAEERRKSHEAEVLKQLAEKREHEKEVLQKAIEENNNFSKMAEEKLT HKMEANKENREAQMAAKLERLREKDKHIEEVRKNKESKDPADETEAD" exon 3754..3926 /gene="Op18" /EC_number="3" exon 5886..6077 /gene="Op18" /EC_number="4" exon 6483..7452 /gene="Op18" /EC_number="5" BASE COUNT 1990 a 1724 c 2006 g 2338 t ORIGIN 1 tcaaagcagg tgtcttggtg tattgaccac caaccgagga cacggtcttg gtcagagaat 61 gggaggaggc ggggaccatg ttcatttcat ttattttttt cccctttttg tgagagaaag 121 ggccagttat tctaaggcac ggtcagacca atttcctttg tgcctttgct gaggttctga 181 gctatccagg tcagggagaa cagtgacctt gtggctcctt tttcgatttt taaaaagaac 241 ccactgctgc tctggggagc acgtgctctg agaacaaaag caacacttaa ctgcaatctg 301 tgtagtctag aacggctatt gctcttcctc ccctatcatc cgcagaaggc ttggaagaac 361 agatggagtc tgcatttaag ctgagtggct ttgttagacc aaaggcgatc aaattccaga 421 gccccacgct ttcccgcaca ggtccctggt caccggctcg ggaacctgag gatgctggca 481 agaacgcaca cagtgaggga agggccacgc ccgcgacctg tgggctgagt ggattagaaa 541 ttacgatgat gtcacaatat gggtgacacg ccggtgtcgg tgtagggtgc cgggggcagg 601 gggccctgca ggggcgtgga gtggcattgt gctgtcacag gggctcgggc gtcttaggca 661 cccatgtggg tggcgcacta gaaggggcac tgctctgtcc gagtgctgcc cttggggcga 721 ggcgggcatg tggctctaca aggtggagtc caggcggcca aagtttggaa aggtagggaa 781 ggaccccccc gccctccgcc tcgtccgccc tgcccttgtt ctcgagaatg gggagctggt 841 tcggacctag tccggggtcc actgccacgc cctcttccac ggcgagacca cccccctagt 901 cccaggccca cacctgggga tgctccccag gcgccctgca acccccggca ttgtcctcct 961 gccctccggg acaggactac acttcccgag gtgcttcggg gtcccggggg gcggcgctcc 1021 acgcgggttg tggggggcgg gggcggcacg tgccgccgct ctcggccaat gcggagcccc 1081 gcgcggaggt cacgtgcctc tgtttggcgc ttttgtgcgc gcccgggtct gttggtgctc 1141 agagtgtggt caggcggctc ggactgagca ggtgggtgcg gggctcggag gaggcggcgg 1201 ctggctgagg ccagcaagag ggacgcggtc ggcgggaggg gctgggccgt ggcagcgacc 1261 ccctgctgca gggcggcggg cggggctgcg ggcctcggag gggttggtgg gcgggggtcg 1321 ctccgctttg tgtggctcgg gcggacgtcg cctttgtccc cgctctccgg gggcgcggct 1381 gttcgtgggc agggggctgg gcgatcaccg ggcggtacgc tccggggtgc cgtccgcaag 1441 gagacaatag ggggccgtgg gcccatcgtt tacctccctc cctccctccc ttccctgcgg 1501 gccccgccgg gttccccatt gtctgaaggg tcggggcggt gccagggacc agcggcttta 1561 ggaccaaact ggcgggtcag cccagggccg cgaccctccc tgcgaccgtg cccactggcg 1621 accgcagctg gtgattgagg gagggcggcg ctcccgggcc ccacgagggt tcttctgtct 1681 tcgcggccgg acgcgcggac agcttgggtg gcggcaggtt gggcatgggg acggcgggag 1741 gcggtggcga gctcaccgcg ggaccacccg ggggcctgtt cccggggcct gccccacccg 1801 ctgaactgtg aagggggtgg tggcggcggc ctggaggtgt ttttggcggg agttgggggg 1861 ggcgtccgcg cagggggagt caggcagggg cggagttacc cggattggac cgttagcccc 1921 gcccacccct ccccttccca cgcgcgcggg ctccggggtg ttgagttcgg ggagattcga 1981 aaaggcgcgg ggaggaaggg ggcggggcca ggggccggag cgcaaggcgt gctctgattg 2041 gccgggggcg accggtcctc ttttcctcgc ccggaccagg ccacgccatc ctgggtccgg 2101 tgctgcgtct aattctctgc ttttcttaaa tcttgtcgct gcctctgatt ttaattccta 2161 gcttttggga acctgtcatc ctacgttttt ggtactagct ggcgtctaca aaagtcataa 2221 tgttaaaaag atcaacaaga gatacagcat ttttcatgac ataacggcag caaatataag 2281 tcaaaatcta gaggttcata aacattttgc ttgctgttgg gcaaggaagc ttaaacctga 2341 gggacaatag gagttcaaca ttattggtta ctattagctt gggcgttttc ttatccacca 2401 cgtcagacac agacaaagca ggggtgggta tttcatttgc acaatgagtt gtaggcagta 2461 ttaagatggc tccgggggca ctgttgagtt gaatctggaa tatcttctta cagtttcggt 2521 gaaatgttaa agagtttatg ggggaaaaat tcttcaccct tgtgactttg tctgatttta 2581 aaaatccaag agttttgtga ccgagaaagc tcagttaact tgattttctg gaaccaatat 2641 catattcagg tcatatttcc caatgtttat ttagtagatt ttgataattt ttttcgtggt 2701 taatttagac gtctttattc cacgtatttt tctgacgaat gtatgtagat gtgatgtgag 2761 atttttttgg gttgatgaca tatagaaagg caaagaaagt gattgcatgt ttttgaaaat 2821 cattttcagg actttcctta tcccagttga ttgtgcagaa tacactgcct gtcgcttgtc 2881 ttctattcac catggcttct tctggtaggt aatctatttg gaaaatctga aattgtaatg 2941 ggcttatgat tttagattga gatggctcag gtcttcgcct ttgatttggc acttatgttt 3001 tggtcttacc aaaacctatt ttatgaatag gagaagaatt taaaaatgat tatcacttga 3061 atgtgccgag agctcgtaat tgtttattgg acagtttggc ttagtctgaa gcaaattgtg 3121 gagtttgcac aagtcttttg tttatgaaag cgattgtcag atactgatgt ctcaaaacag 3181 tatttattaa tccaaaaatg ttgagctttg tttttctggg agatggtttt tatttttttt 3241 gagacagggt ctcaccttgt tgcccaggct ggagtgcagt ggcttaatta tagctcattg 3301 cagccttgac ttctggagct caagtgggct caagcgattc tcccacctca gcctccatag 3361 catctgggac tatcgacatg ggccaccaca cccacctaat caaaaaaaat tttttttgta 3421 gagatgggct ttccctttgt tgcccaggct ggtctcaaac ttcagggctc aagggatctt 3481 cccatgttgg cctcccacgg tgctgggatt ataggcatga gccatggtac ctggccttgg 3541 gaaatggtat ttagataata atatcttgcc tgcaaataca tcttccccta gtgtcagtag 3601 actgatagga ataaaaaggg gaaaaaaaac acaactttcc tcatcagccc tagtttaata 3661 cattaaattg atttgggttt tagaaaatta tagtacagtt tattagaaca ggagaatcct 3721 ggttttctga attataaata taatcaattc tagatatcca ggtgaaagaa ctggagaagc 3781 gtgcctcagg ccaggctttt gagctgattc tcagccctcg gtcaaaagaa tctgttccag 3841 aattccccct ttcccctcca aagaagaagg atctttccct ggaggaaatt cagaagaaat 3901 tagaagctgc agaagaaaga cgcaaggtaa acgaagcaat tcacagaaag caggatatta 3961 atttatgtaa tgggcagatc aattttattt ctataacagg aagaaaacag aattgtagct 4021 acactgtgat tattacatat gccagtgact ggaaggaaat accagtcctc atttattgaa 4081 ctcctgttac atgcccgctc ctttgtttat atttttctcc tttaatacat ggtgttgcct 4141 caaataatgg aaattagaaa cagtttcagg aatgttaagt cgtttttctg cagtcataca 4201 actagtaagt gttggggtca gaattcaagc cctggtctat cttaaaccaa agctcatgct 4261 tctctcatgc ttcctctttg aaaagatttg ttgccagatg attctttggc actttggttt 4321 tgttttttga gagctgtaca ataacatttt aaattgctag tgtgattgtg ttgctcagct 4381 ggtatcatgg tagcttttct cttatctaaa caattctatt ataaagtaac tatctttaaa 4441 agctaatcag aagaatcaat aaatattaat atgctagttg tagaaaattt gggaaataca 4501 gaaatttata aatggaaatt aaaagtattc attatcccgc ttccgagaag caaccagtgt 4561 taacattttg gtgtgtttct ttccattcaa tgttactcat taacaactgt acataacttt 4621 tcatttaact ccctttctca acaaccctga taggattgat tttaaagctg gggcaagtga 4681 ggcacaaaag gtaaggtaat aaccttcccc agaccagcat agtgatttgt agtatacaca 4741 cacaccccgc ccgaagagcc tcagtgctta cactaaggag tcgtcttcta tcgaatagtg 4801 tgagttcatg gagtaggagg aagcaataca acccaaggtt ggacagtgga aagcttttta 4861 gacatcaacc ctggccctgt agtcattagc ctgtgcttta catagtaact ggctaaatat 4921 aatgaaactc ccatcatgac taggatttgg cagaagagaa tcaatagaac cagtgtcaga 4981 tgctctgtgg ttatcctgca agtcagtggt tccaatgtgt tttgagaaca agctgttctg 5041 ttgaaggggt cataccaagg tatggtctgg taattaatgc agtttcctga gacaaaagct 5101 aataagcctt ttccttgaaa caaatttttc tgtcttaaat agtaatctac agacttagtc 5161 ttgaatttcc tatcattgtt ttatcagtta tggttaaaat ttttacaatg agctagtttt 5221 ctttgggtag cttttgaagt taaatagtga aattctttac aataaaagtg ccactcgcaa 5281 gtacatattc ctcaagtcat ccagatacca ttaagcagta aatcttacaa ggatttcctt 5341 aaggactaat tgggtaagat ttctgaacag ataagcactt ttccaaagtt aaatacaaat 5401 actagaaaaa gaatatcatt ttcacagtat tttatgatag ggataattca ggtcctaatt 5461 ttggtgttat ttaaagggac cttattttct gccccttttc aatcccctta gtaaattatt 5521 tttattgtaa ttttaactta tgctgaatac attttatttt ttgaggcaga gtctctgtca 5581 cccagggtgg agtgcagtgg cacgatctca gttcactgca acctccacct cccgggttca 5641 agcaattctc ctgcctcagc ctcccaagta acttagacta caggcacccg ccaccacgcc 5701 cggctaattt ttgtattttt agtataggta ggatttcacc atgttggcca ggctggtctt 5761 gaactcctga cctcaaatga tccacccacc ttggcctccc aaagtgctgg gattacaggc 5821 gtgagccacc acacccagcc tgaatacatt ttagagtgcc tagcctatta aacttttttt 5881 tccagtccca tgaagctgag gtcttgaagc agctggctga gaaacgagag cacgagaaag 5941 aagtgcttca gaaggcaata gaagagaaca acaacttcag taaaatggca gaagagaaac 6001 tgacccacaa aatggaagct aataaagaga accgagaggc acaaatggct gccaaactgg 6061 aacgtttgcg agagaaggtt ggtttcttac tttgtaaaag ggttgagctt ggagtttgat 6121 gcaccaatga gttggcttga actaagtgct ttgataaaag gtgtttggtg tctttttgtc 6181 atccattttg gggcttaaca tattaaatga aaggtatatt ttaagatgga atattcagta 6241 attcccagca taattgcaca gtccttggaa gtccagtagg cagcttgtta ggttctacaa 6301 gggacccagg agatttgatg atgatgtctc agaacttaaa ttgtgtggtt cccacaggct 6361 gtaatatatg cactgaggtt gtgttgggcc tctttgaggt gggggctggg ggtcgtgact 6421 tgacagggct tttttttttt tttttttttt ttgactgatg acaccttacc cttcctttac 6481 aggataagca cattgaagaa gtgcggaaga acaaagaatc caaagaccct gctgacgaga 6541 ctgaagctga ctaatttgtt ctgagaactg actttctccc catccccttc ctaaatatcc 6601 aaagactgta ctggccagtg tcattttatt ttttccctcc tgacaaatat tttagaagct 6661 aatgtaggac tgtataggta gatccagatc cagactgtaa gatgttgttt taggggctaa 6721 aggggagaaa ctgaaagtgt tttactcttt ttctaaagtg ttggtctttc taatgtagct 6781 atttttcttg ttgcatcttt tctacttcag tacacttggt gtactgggtt aatggctagt 6841 actgtattgg ctctgtgaaa acatatttgt gaaaagagta tgtagtggct tcttttgaac 6901 tgttagatgc tgaatatctg ttcacttttc aatcccaatt ctgtcccaat cttaccagat 6961 gctactggac ttgaatggtt aataaaactg cacagtgctg ttggtggcag tgacttcttt 7021 tgagttaggt taataaatca agccatagag cccctcctgg ttgatacttg ttccagatgg 7081 ggcctttggg gctggtagaa atacccaacg cacaaatgac cgcacgttct ctgccccgtt 7141 tcttgcccca gtgtggtttg cattgtctcc ttccacaatg actgctttgt ttggatgcct 7201 cagcccaggt cagctgttac tttctttcag atgtttattt gcaaacaacc attttttgtt 7261 ctgtgtccct tttaaaaggc agattaaaag cacaagcgtg tttctagaga acagttgaga 7321 gagaatctca agattctact tggtggtttg cttgctctac gttacaggtg gggcatgtcc 7381 tcatcctttc ctgccataaa agctatgaca cgagaatcag aatattaata aaactttatg 7441 tactgctgta gcaactcctg tgaaatgact aaagggaacc ttaattattt ctaaagtagc 7501 atttgactcg ggtggttaag gttggcagat acgtcatctt gtatccagag gctgtaatag 7561 tccccattgt cagtgctttg cctctcaatt cagggaacgc ctttggcagc tttcctgtgc 7621 tatttgagaa atagcctaat tatacatatt ttgtgcctct cagacatctg tataaaagct 7681 cttggaagcc aagcacttat agcaaaagat gattgctaat actgcggtgg ggtccatgtg 7741 ttcgtaacgc atgtgcctct gctgcttggg ctgtatcttg agattggggt tttaccaaac 7801 ttattcatga gaacaataac agcctacctc aacttacctc ctaagattgt taggattaaa 7861 tgggtttgtg gcaagagagc acacttagaa catagatatt tagcattccg tataaatcac 7921 gattactgtt ttcaaccaga ggctaatgag tgatctaagt ttatactctt gagttggaaa 7981 ttatgcagtt tttatattcc ttaaattgac ttttaaattt acttaaatga tcaatcagag 8041 gaattctgaa caaagctt // LOCUS HUMP45C17 8549 bp DNA PRI 15-JUN-1989 DEFINITION Human P450XVIIA-1 (steroid 17-alpha-hydroxylase/17,20 lyase) gene, complete cds. ACCESSION M19489 NID g189442 KEYWORDS cytochrome; cytochrome P450; cytochrome p. SOURCE Human DNA, Maniatis library (Lawn, et al.), clones lambda-hM171 and lambda-hM17-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8549) AUTHORS Picado-Leonard,J. and Miller,W.L. TITLE Cloning and sequence of the human gene for P450c17 (steroid 17-alpha-hydroxylase/17,20 lyase): Similarity with the gene for P450c21 JOURNAL DNA 6, 439-448 (1987) MEDLINE 88054468 COMMENT Clean copy of sequence [1] kindly provided by W.L.Miller, 08-DEC-1988. FEATURES Location/Qualifiers source 1..8549 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10" repeat_region 1446..1517 /note="Alu repeat" prim_transcript 1779..8437 /note="P450c17 mRNA and introns" exon <1839..2135 /gene="CYP1A1" /note="cytochrome P450c17, (EC 1.14.99.9)" /number=1 gene 1839..2135 /gene="CYP1A1" gene join(1839..2135,3804..3942,4176..4405,5068..5154, 5849..6064,6297..6466,7364..7467,7986..8268) /gene="CYP17" CDS join(1839..2135,3804..3942,4176..4405,5068..5154, 5849..6064,6297..6466,7364..7467,7986..8269) /partial /gene="CYP17" /note="cytochrome P450c17" /codon_start=1 /db_xref="PID:g386992" /translation="MWELVALLLLTLAYLFWPKRRWPGAKYPKSLLSLPLVGSLPFLP RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATL DIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH NGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLV PWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNG NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQ NVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEV IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQ ELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST " intron 2136..3803 /note="cytochrome P450c17, intron A" repeat_region 2722..2927 /note="Alu repeat" exon 3804..3942 /gene="CYP17" /number=2 intron 3943..4175 /note="cytochrome P450c17, intron B" exon 4176..4405 /gene="CYP17" /number=3 intron 4406..5067 /note="cytochrome P450c17, intron C" exon 5068..5154 /gene="CYP17" /number=4 intron 5155..5848 /note="cytochrome P450c17, intron D" repeat_region 5406..5598 /note="Alu repeat" exon 5849..6064 /gene="CYP17" /number=5 intron 6065..6296 /note="cytochrome P450c17, intron E" exon 6297..6466 /gene="CYP17" /number=6 intron 6467..7363 /note="cytochrome P450c17, intron F" repeat_region 6696..6907 /note="Alu repeat" exon 7364..7467 /gene="CYP17" /number=7 intron 7468..7985 /note="cytochrome P450c17, intron G" exon 7986..>8269 /note="cytochrome P450c17" /number=8 BASE COUNT 2057 a 2306 c 2027 g 2159 t ORIGIN 265 bp upstream of HincII site; chromosome 15q22-q24. 1 gagcaagcct tcatcccgac ggcacactta tataaaaaag aaagggagag atgtatgcgg 61 gaagtcaggg acctgaatga agggactgct ggagccatgg cagaggaaca taaattgtga 121 agatttcatt taatatggac atttatcagt tcccaaataa tacttttata atttcttatg 181 cctgtcttta ctttaatctc ttaatcctgt tatctttgta agctgaggat gtttgtcact 241 tcaggaccac tgtgataatt gtgttaactg tacaaattga ttgtaagaca tgtgtttgaa 301 caatatgaaa ttagtgcacc ttgaaaaaga acacaataag agcaattttt agggaacaag 361 ggaaaacaac cataaggtct gactgcctgc agggtcgggc agaaagagcc atattttcct 421 tcttgagaga ggctataaat ggacatgcaa gtagggaaga tatcactaaa ttcttttcct 481 agcaaggagt attattatta ataccctggg aaaggaatgc attcctgggg ggaggtctat 541 aaacagccgc tctgggaatg tctatcttgt gcagttgaga taaggactga gatacgccct 601 ggtctcctgc agtaccctca ggcttactag ggtggggaaa aactccgccc tggtaaattt 661 gtggtcagac cggttctctg ctgtcgaagg gtgtttgctg ttgtttaagg tgtttatcaa 721 gacagtatgt gcaccgctga acatagaccc tcatctgtag ttctgctttt gccctttgcc 781 ttgtgatctt tgttggaccc ttatcagtgg ttctgctttt gccctttgcc ttgtgatctt 841 tgttggaccc ttatcggtag ttctgctttt tgccctttgt cctttccctc agaagcatgt 901 gatctttgtt agacacttat tagtagttct gcttttcgcc ctttgaagca tgtgatcttt 961 gtaccctact ctccctgttc ttacaccccc tccccttttt gaaaccctta ataaaaactt 1021 gctggtttga ggctcaggtg ggtatcacag tcctaccgat atgtgatgcc acccccggcg 1081 gcccagctgt aaaattcctc tctttatact gtctctcttt atttctcagc cggctgacac 1141 ttatagaaag aacctacgtt gaaatattgg gggtgggttc ccccagtacg ctagtcctgc 1201 tgatgagcaa agaaggtgtt gatggcattt tgatcaacaa gaaatgttat gaaacggcct 1261 cccacctctg gcattcctag tactgaccta tctctccctt cccttccacc gctccccaca 1321 gcccttccac cgctccccac agcttagcac ccctttcgtt cccatacaca tgtacatttt 1381 tattttgggg accattaacc cgacagccct tatcgctgcc aaaaccacat gggctggagg 1441 gccagggctg catggacagt cacaccactg cacaccagcc tggtgatgga gcaagactct 1501 gaaaaaaaaa aaaagaagca taaaagacct ttaacagtcc ctgctacttg tgaccctcct 1561 gaatctgtca tctgtccagt gattttgatt ttgcagcatg gaaagttcca agccttgact 1621 cctgagccca gataccattc gcactctgga gtcattcaag catggggagc tcaggcctgg 1681 ctgggctcca ggagaatctt tcttccacaa ggcaagagat aacacaaaag tcaaggtgaa 1741 gatcagggta gccctttaaa aggcctcctt gtgccctaga gttgccacag ctcttctact 1801 ccactgctgt ctatcttgcc tgccggcacc cagccaccat gtgggagctc gtggctctct 1861 tgctgcttac cctagcttat ttgttttggc ccaagagaag gtggcctggt gccaagtacc 1921 ccaagagcct cctgtccctg cccctggtgg gcagcctgcc attcctcccc agacatggcc 1981 atatgcataa caacttcttc aagctgcaga aaaaatatgg ccccatctat tctgttcgta 2041 tgggcaccaa gactacagtg attgtcggcc accaccagct ggccaaggag gtgcttatta 2101 agaagggcaa ggacttctct gggcggcctc aaatggtaag tggtgcccat ctcctccctg 2161 cccccttcac caccccctgg gattggttca ggtcttcaga ccatgcctag aatggggctt 2221 ccagctccaa caacctgtat ttcttcccaa aagtagacta gtgggatgat gtgaagggag 2281 tgaaccttca gacccccacc cagcctctaa attaggcagg ttgaattaaa ggaatgtctc 2341 ctctaccagg ctgcccagga ggtgggatgg gaggaccaca gtggtctgag gggtcaagga 2401 atcctcactc cactcctctc tctgagccta gacttctcta atggactgag aaggtgcatg 2461 tacagcactt agcctagcac ccagcacagt aagtgccctt atacagccag gattcatgtt 2521 acttttcatg gaaaatgggg gcagtgacta ctgtcctcca tgaaagctgc tggggagaat 2581 tagcctagct attgcaggct gggattgctg ctttcctggt gctatttcca gctactcagg 2641 ctcacagggg cagttttcta caatgacatt tcagggttgc tgatgagcct cccactcagc 2701 agggccccca gcctctcagc attttttttt tttttttttt tttttgagac agagtctctc 2761 tctgtcgccc agcctggagt gcagtggcca atctcagctc actgcaacct ctgcctcccg 2821 ggtcaagtga ttctcctgcc tcagactcct gagtacctgg gactacagga gcatgccacc 2881 atgcccgcta attttttgta tttttagtag agatggtgtt tcaccatgtt agccaggatg 2941 gtctcgatct cctgacctcg tgatcctccc accttagcct cccaaagtgc taggattata 3001 ggcgtaagcc actgtgccct gccagcctct cagctttgat caagccaagg gttggtttat 3061 tttttcttgg accaatcagc caggtctgct gaccaactac ctagctccca cctctgctgg 3121 cttcctcccg ggggcagaga agatggagaa ggctagtcat gtggatcttc agggtcagga 3181 aatggaaaag ggaggctttg gacccttttg ctttgggggg cacctctaga agaagcgctc 3241 gcccaagtcc agactgggta gacaaacatc tgcactctcc aaatgtgggc ttgtggctgg 3301 gtatgcaggc ttgcaatgga agggtaaacc tgagtgaggt gagctgtgcc tttagctcag 3361 ctaagggctc agggaaaagc agagatctgt gcagtcctca gcctctacaa gctcacctgc 3421 tccctaccct ctggacaggc atagtttaga gagtttatcc catccagagt tgccttcctg 3481 tggtcagaaa ctgatgagca aaaagaagcc agagggcacc ctgtcagcga aaagaacccc 3541 aatgctgctg cattctaatt aagggttctt tctttctcct tgatctactg tatttctgaa 3601 ggaattggga gtaggaggcc ttagggtctg tcctaccaag tccttgcagt catggtggag 3661 tgcagtgggg ctgtgcccac atgggagtca gcatgccagg tacctccctt ctcctccagg 3721 aaggaaagca gggaccagag gtgtaagggc aagagtgggg tggatgggtg tgagattcct 3781 acagccttgc ctgctctcta aaggcaactc tagacatcgc gtccaacaac cgtaagggta 3841 tcgccttcgc tgactctggc gcacactggc agctgcatcg aaggctggcg atggccacct 3901 ttgccctgtt caaggatggc gatcagaagc tggagaagat cagtgagtgc caggctggcc 3961 cctggggctg gggctggatc ccacaggagc tgctggaggg agaggagggt tgggcagggg 4021 taaggggtta ggactagagc gcaatgcagc cttttcttgg ctactgctgc catctagtgg 4081 catctgctat ctgtcccccg ctcctgaggc aactggtaca gagagggggt aagggtgctg 4141 attcatttcc accctcatgc cccctctccc ttcagtttgt caggaaatca gtacattgtg 4201 tgatatgctg gccacccaca acggacagtc catagacatc tcctttcctg tcttcgtggc 4261 ggtaaccaat gtcatctcct tgatctgctt caatacctcc tacaagaatg gggaccctga 4321 gttgaatgtc atacagaatt acaatgaagg catcatagac aacctgagca aagacagcct 4381 ggtggaccta gtcccctggt tgaaggtgag atgctgccag ccctgccttc aggttctagt 4441 agaccctgac attgtcccca atcttccttc ctttttactt ccctgctcca gccgcaatga 4501 cccatctttt tcctgattac ctccgccacc tctacctcct ctgccactta aaacctttgc 4561 catttctctg cagagataag atttagcctt ttaattatgc accttagtac tccagataat 4621 gaccttcatt tcttttccaa ttaccatgtg ccagtactaa gcattctata cgcattcatc 4681 gctgaattcc tttggaagta ggttttatta tccccattgt gcaggtgaga agcaggctta 4741 gcggggttaa ggagcttgtc tgagccttca ggtcatcgtc tctctcactc ctaagggctg 4801 gacacatagc agagtcagcg cttgatgttt gattgaatgg ggaaggagag gtggagacca 4861 cgccctcctc ccttgtttag aattgtcttc gtcgtcatga taaacccgtt ctgtgtcccc 4921 atcttgcctt ccattctggc tgaaggtcag gggtggagta ggaacttcca gagacagaaa 4981 agctaagatc cgcctccagg agagactctg gcagctggag aagcaaaatg gaagaagggt 5041 ggatttaaca tttcttttta tttccagatt ttccccaaca aaaccctgga aaaattaaag 5101 agccatgtta aaatacgaaa tgatctgctg aataaaatac ttgaaaatta caaggtaggt 5161 gatagagcag aagagaatat gagttaggct aaaaggaatc acaagagcag ggtggagtcc 5221 attctacaca ctgtagaagc ttcaaaacca agcagagaac ctggcacata gtaggtgtac 5281 aataaaaact gacttaaggg ctgggcgcgg tggctcatgc tgtaatccca acactttgga 5341 ggccgaggtg ggcagatcac ctgaggttgg gaggttcgag accagcctga ccaacatgga 5401 gaaactctgt ctctactaaa aatacaaaat tagccgggca cggtggcgca tgcctataat 5461 cccagctaat ttcgggaggc tgaggcagga gactcacttg aacccgagaa gcagaggttg 5521 cagtgagctg agatcatgcc attgcactcc agcctgggca ttgcaccaag attctgtctc 5581 aaaaaaaaaa aaaaaaaact gactcaagga gttgggatcg aaaagtgagg aactgaagag 5641 gatcctagaa gagacctaac ctctccacca atttaaaaag ggcccggggc tgcctcctac 5701 ctccacacgt tcgtaggtcc tgcccagact tgctctactt ccaagtggaa ggagccttgt 5761 tatctctagt cagggacaga agtatggcag gagtgtcaca gatggggctc cttccttatt 5821 aatgtctccc aaacctcacc caacccagga gaaattccgg agtgactcta tcaccaacat 5881 gctggacaca ctgatgcaag ccaagatgaa ctcagataat ggcaatgctg gcccagatca 5941 agattcagag ctgctttcag ataaccacat tctcaccacc ataggggaca tctttggggc 6001 tggcgtggag accaccacct ctgtggttaa atggaccctg gccttcctgc tgcacaatcc 6061 tcaggtgtgc ttccccctca ttgatcctag accccagcca gcccaatctc tgggctccag 6121 agaaagggag agccaattct ctcaggcttt ctgtgcagga agactaggcc ccagagccac 6181 tactgggaag ggactggaca ggctcttctc gatcgtcaca gttggattat tctctaagcc 6241 cttgcctctc ctgggcttac acacactagt cacctccaac ctactctggt cttcaggtga 6301 agaagaagct ctacgaggag attgaccaga atgtgggttt cagccgcaca ccaactatca 6361 gtgaccgtaa ccgtctcctc ctgctggagg ccaccatccg agaggtgctt cgcctcaggc 6421 ccgtggcccc tatgctcatc ccccacaagg ccaacgttga ctccaggtgt gcctgccctc 6481 ccagtgacat ctagccccat gatgcattca ctgcttgcca gcccacctgg ctccccctac 6541 ccccgcccct gctggccaac ctaaagtcag tcaaccatca actactaaaa atcatcctgc 6601 cggctgggca ctggctcaca cctgtcatcc caacactttg ggaggtgagg cgggtggatc 6661 atgaggtcag gggttcaaga ccagcctgac caattatggt tgaaaccccc gtctcttact 6721 taaaatacaa aaaattagcc atgcatcgtg gcgctgtcct gtagtcccag ctactcagga 6781 ggctgaggca ggagaatcac ttgaaccaag gcggcggagg ttgcagtgag ccaagattgc 6841 accactgcac tccagcctgg gtgacagagc gggactctgt ctcaaaataa ataattaatt 6901 aattaaatat aaaaatcatc ctgcccccag ccccgtggct ccatatctct accacctaca 6961 gacacgcatt gactcatcca cagatctgcc tgacttccca gaggagcttc ctgctgccct 7021 cagagacatg tggtctggga tgaaaggctg ggagctccat gttccaacca gctgcagcac 7081 gcacataaca tgcgctgcag ctccaaacac acacccacat acactgccag acaccaaagt 7141 ccacagacac aggtgttcag acagaagcgc ctgttaggag ggaagggatg gagaagggct 7201 ggatttaggt ttgatctggc agaagctgag gaaaacatga gtgagtggga atgagggagt 7261 aaagggcatt ttcctcacgg cggaagaatg agggggcatg aggctgagca aggaagggag 7321 tacgaagtcc cagacccact tttcctcttc cactctggag cagcatcggt gagtttgctg 7381 tggacaaggg cacagaagtt atcatcaatc tgtgggcgct gcatcacaat gagaaggagt 7441 ggcaccagcc ggatcagttc atgcctggtg agtctgtcct gtcctgcgcc ctgggccaca 7501 cagcgagcct ggactctgct ccaaccaccc cagtacccct tcacctctgc aaagcttccg 7561 ctagaaagct cttggctcca actataccga cctgttgacg ccctcatcct gccatagact 7621 tacccaaact cttcacagct gggtttccca cgctcttctt ccaaccaact gcaaattcac 7681 cctccaagaa gccctctact cctctgtctg ccattaagtc tgtcccttct cccctcggat 7741 ggtgctattt tcataggtta attccacctc ttttccatcc ttcctgaata tttcatttcc 7801 tctgtgtcgt taagggctac ctgaaagcag ggctgtatct ctccccaggc gggttcccca 7861 taataaggct acatcctcag atcagggttc cctgggaggg ccatgtctcc ccctcaacca 7921 gggcagaacc atgcctctcc tccctcctgc cctaacccct ggctgatgcc actccttgcc 7981 agcagagcgt ttcttgaatc cagcggggac ccagctcatc tcaccgtcag taagctattt 8041 gcccttcgga gcaggacctc gctcctgtat aggtgagatc ctggcccgcc aggagctctt 8101 cctcatcatg gcctggctgc tgcagaggtt cgacctggag gtgccagatg atgggcagct 8161 gccctccctg gaaggcatcc ccaaggtggt ctttctgatc gactctttca aagtgaagat 8221 caaggtgcgc caggcctgga gggaagccca ggctgagggt agcacctaaa ggctgtaact 8281 cacagcccct gtccacccta tgtggcccca caacacagat ttagagatac aaccccccac 8341 ccttctccgc cattcttccc tactcccaac ccactctgcc ttctttttca gcttgtggca 8401 atgccagtga tgtgcataaa cagttttttt tttttccata aggtccctga gtagttcatt 8461 tatgtattca tttgctcact cattctttca acaccgattt tattgagcac ctactatgtg 8521 ccattctctt tctcaggaca ctcgagctc // LOCUS HUMPAIA 17509 bp DNA PRI 07-JAN-1995 DEFINITION Human, plasminogen activator inhibitor-1 gene, exons 2 to 9. ACCESSION J03764 M55991 NID g189564 KEYWORDS plasminogen activator; serine protease. SOURCE Human DNA, clones PAI-Cos 1 and PAI-Cos 2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 17509) AUTHORS Bosma,P.J., van den Berg,E.A., Kooistra,T., Siemieniak,D.R. and Slightom,J.L. TITLE Human plasminogen activator inhibitor-1 gene. Promoter and structural gene nucleotide sequences JOURNAL J. Biol. Chem. 263 (19), 9129-9141 (1988) MEDLINE 88243790 REFERENCE 2 (bases 1 to 17509) AUTHORS Dawson,S.J., Wiman,B., Hamsten,A., Green,A., Humphries,S.E. and Henney,A.M. TITLE Common polymorphism in the plasminogen activator inhibitor-1 (PAI-1) promoter which shows allele specific differential binding to HEPG2 nuclear extracts and is associated with altered plasma PAI-1 levels JOURNAL Unpublished (1991) REFERENCE 3 (bases 1 to 17509) AUTHORS Bosma,P.J., Kooistra,T., Siemieniak,D.R. and Slightom,J.L. TITLE Further characterization of the 5'-flanking DNA of the gene encoding human plasminogen activator inhibitor-1 JOURNAL Gene 100, 261-266 (1991) MEDLINE 91276254 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.L.Slightom 12-APR-1988. FEATURES Location/Qualifiers source 1..17509 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q21.3-q22.1" allele 2488..2490 /note="ggggg is gggg in allele" /citation=[2] prim_transcript 3040..15199 /note="plasminogen activator-1 mRNA and introns" intron 3307..4453 /note="PAI-1, intron A" gene 4455..4725 /gene="PAI1" CDS join(4455..4725,6490..6723,7947..8141,9747..9945, 11538..11638,11758..11844,13053..13136,13457..13494) /partial /note="plasminogen activator-1" /codon_start=1 /db_xref="PID:g386996" /translation="MQMSPALTCLVLGLALVFGEGSAVHHPPSYVAHLASDFGVRVFQ QVAQASKDRNVVFSPYGVASVLAMLQLTTGGETQQQIQAAMGFKIDDKGMAPALRHLY KELMGPWNKDEISTTDAIFVQRDLKLVQGFMPHFFRLFRSTVKQVDFSEVERARFIIN DWVKTHTKGMISNLLGKGAVDQLTRLVLVNALYFNGQWKTPFPDSSTHRRLFHKSDGS TVSVPMMAQTNKFNYTEFTTPDGHYYDILELPYHGDTLSMFIAAPYEKEVPLSALTNI LSAQLISHWKGNMTRLPRLLVLPKFSLETEVDLRKPLENLGMTDMFRQFQADFTSLSD QEPLHVAQALQKVKIEVNESGTVASSSTAVIVSARMAPEEIIMDRPFLFVVRHNPTGT VLFMGQVMEP" exon <4455..4725 /gene="PAI1" /note="plasminogen activator-1, (first coding exon)" /number=2 intron 4726..6489 /note="PAI-1, intron B" exon 6490..6723 /number=3 intron 6724..7946 /note="PAI-1, intron C" exon 7947..8141 /number=4 intron 8142..9746 /note="PAI-1, intron D" exon 9747..9945 /number=5 intron 9946..11537 /note="PAI-1, intron E" exon 11538..11638 /number=6 intron 11639..11757 /note="PAI-1, intron F" exon 11758..11844 /number=7 intron 11845..13052 /note="PAI-1, intron G" exon 13053..13136 /number=8 intron 13137..13456 /note="PAI-1, intron H" exon 13457..>13494 /note="plasminogen activator-1" /number=9 BASE COUNT 4974 a 4406 c 4386 g 3743 t ORIGIN 131 bp upstream of SacI site; chromosome 7q21.3-q22. 1 gggacatcta gctatgtcta gacccattga actttcaagt cttcgaggct tggtactgcc 61 tgatccctgt tcccctctga tccttaaggt cagagagcaa agccttgggg ggtacttttt 121 tttttttttt ttgagacaga ttcttgctct gtcacccagg ctggagtgca atagcatgat 181 ctcggctcac tgcaacctcc acctcccagg ttcaagcgat tctcctgcct cagcctccca 241 agcagctagg attacaggca cgcaccacca tgcctagctg atttttatat ttttagtaga 301 gatggggttt catcatattg gccgggctgg tcttgaactc ctggcctcaa gtgatccacc 361 tgccttggcc tcccgaagtg ctaggattac aggtgtgagc caccatgccc tgtcttgggg 421 tacttttgag gacaagagtt aagttcggga ctgggcatgt tggcttatac actttgggag 481 gcaaaggtga gaggattgtt ccagcccagg aatttgagac cagcctggtc aacatagcaa 541 gactctgtct ctataagaaa aataataatt agctgggcgt ggtggtgctt gcctgtagtc 601 cccgatactt gcttgagccc aggagttcaa gtctgcagtg tgctatgatc acatgactgc 661 actccagcct tggtgacaga gtgagaccct gtctccaaaa aaaaaaaaaa aaagagatgg 721 gtttggggct ggacttgtct tcttccaaga ggtcagggca gttgcttctt cctcaccgct 781 aaccacactc tcagggactc caaaccacaa ctaagaaaag gaaaaagaca taaacaagag 841 gcagtttcac aggaagagtt ctcctaatgt gtctgtgagc ctaactttcc gactgtggtt 901 ctcctctctt tccctgccag ctgtgagaaa gcaaagccta aggcagatta aaaagcagga 961 gggctgctca gtgtggttgc ccttggtatc tgtttactga aaaaattcaa aacaaacatc 1021 cgacgcttcc acccactgaa acttcctgtg actctgctat cggctgttgc catggattct 1081 cccaactgaa ccccaccaca tacactaaac ccaacagtcc gtggacccca agtatgccat 1141 ctctccaata caggaccccc tgggcagcac cctggccacc cttccatcca caacatccag 1201 accacacggc caagggcacc tgaccctgtc aaaaccccaa atccagctgg gcgcggtggc 1261 tcatgcctgt aatcccagca tttgggaggc cgaggcagcc ggattcacga agtcaggagt 1321 tcgagaccag cctgaccaaa catggtgaac cccgtctcta ctaaaataca aaaattagcc 1381 gggcgtggtg gtgcacacct gtaatcccag ctactcggga ggctgaggca ggagaaccac 1441 ttgaacctgg gaggcggaga ttgcagtgag ccaagatagt gctactgcac tcagcctggg 1501 caacaaaata agactccgtc tcaaaacaga aaaacaaaaa aaacaaaaaa caacaacaac 1561 aacaaaatcc actagtctag ccctacatgc tcacccactg ctagccagtt tccaccctct 1621 acagcagagg tacccaacct ctgggccagg ggcccgtact ggtccatagc ctgttaggaa 1681 cccaggctgc ataaccagga ggtgagtggc aggtgagtga aatttcatct gtagttacag 1741 ccactcctca tcactcgcat taccaccaga gctccactcc ctgtcagatc agcggcggca 1801 ttagattctc ataggagctc gaaccctatt ctaaactgtt catgtgaggg atctaggttg 1861 caagctccct atgagaatct aatgcctgat gatctgtcac ggtctcccat cacccctaga 1921 tgggaccatc tagttgcagg aaaacaagct caggctccca ctgattctac acgatggtga 1981 attgtggaat tatttcatta tatatattac aatgtaataa taatagaaat aaagcacaca 2041 ataaatgtaa tgtgcttgaa tcatcccgaa accatcccac cctggtctgt gaaaaaattg 2101 tcttccatga aaccagtccc tggtgccaaa aacgttgagg accactgctc cacagaatct 2161 atcggtcact cttcctcccc tcaccccctt gccctaaaag cacaccctgc aaacctgcca 2221 tgaattgaca ctctgtttct atcccttttc cccttgtgtc tgtgtctgga ggaagaggat 2281 aaaggacaag ctgccccaag tcctagcggg cagctcgagg aagtgaaact tacacgttgg 2341 tctcctgttt ccttaccaag cttttaccat ggtaacccct ggtcccgttc agccaccacc 2401 accccaccca gcacacctcc aacctcagcc agacaaggtt gttgacacaa gagagccctc 2461 aggggcacag agagagtctg gacacgtggg ggagtcagcc gtgtatcatc ggaggcggcc 2521 gggcacatgg cagggatgag ggaaagacca agagtcctct gttgggccca agtcctagac 2581 agacaaaacc tagacaatca cgtggctggc tgcatgccct gtggctgttg ggctgggccc 2641 aggaggaggg aggggcgctc tttcctggag gtggtccaga gcaccgggtg gacagccctg 2701 ggggaaaact tccacgtttt gatggaggtt atctttgata actccacagt gacctggttc 2761 gccaaaggaa aagcaggcaa cgtgagctgt tttttttttc tccaagctga acactagggg 2821 tcctaggctt ttgggtcacc cggcatggca gacagtcaac ctggcaggac atccgggaga 2881 gacagacaca ggcagagggc agaaaggtca agggaggttc tcaggccaag gctattgggg 2941 tttgctcaat tgttcctgaa tgctcttaca cacgtacaca cacagagcag cacacacaca 3001 cacacacaca tgcctcagca agtcccagag agggaggtgt cgagggggac ccgctggctg 3061 ttcagacgga ctcccagagc cagtgagtgg gtggggctgg aacatgagtt catctatttc 3121 ctgcccacat ctggtataaa aggaggcagt ggcccacaga ggagcacagc tgtgtttggc 3181 tgcagggcca agagcgctgt caagaagacc cacacgcccc cctccagcag ctgaattcct 3241 gcagctcagc agccgccgcc agagcaggac gaaccgccaa tcgcaaggca cctctgagaa 3301 cttcaggtag gagaaaagca aactccctcc aacctcttac ttcgggctta aggcagagaa 3361 ctcgcctccc cagaatctcc tccctccatg atcccccgct attcctctat tttcttttcc 3421 tcggacctgc agccttgggt cgaccctgcc ctaggggtga ctgcaggaga gcagggagga 3481 tggtcaggcg tcaccaacaa ccccatcacc cagtaacaag aaccttgact ctctcagtcc 3541 ctctgcatca agacacttac ccatttccca cctcatgcct gctaacttga atgaaacaat 3601 cgctgggaaa gcattaagag attaaggctg ggcactgtgg ctcatgcctg taatcccagc 3661 actttgtgag gctgaggcag gcagataact tgagcccagg agtttgagac cagcctgggc 3721 aacatggcaa aaccctgctc tcccaaaaaa atacaaaaat tagctgggcg tgctggtgtg 3781 cctgtattcc cagctacttg ggaggctgag gtgggaggat tgcttcagct ggggaggacg 3841 gaggctgcag ggagccaaga ctgagccatt gcacccagcc tgggtgacag agcaagaccc 3901 tgtctctaaa aatgaatgaa aggaaggaag aaagagagag aaagagagag aggaaagaag 3961 gaaggaagta aagaagaaag aaagaaagaa agaggaaaga ggaagaaaga aagaaaagaa 4021 agaaaagaaa agaaagcaaa tttaaagctt atgcaaatca aagatgttgt gataattgat 4081 aattgagtct gggctaaatt ccccctgggc tgcaaaggca gagagtggta atgactctca 4141 cctgcttttc ttctaaggct tttttacggg acacagaggg aagggagatg gactggattc 4201 caagattccc acagggcaag atgggcgaag actccctgcc actgcccggg gataagtcag 4261 tctgagtgag acggagtggg atgggcttag aacctgaaca tgtcatggtc tcttcctgca 4321 ccttgcccta gtgttcactt accacctgct tgcaggaaac aagaagagca gggcccacag 4381 ctggccagct cccctcccct cccgcctgtc ttccagaacg attccttcac cagccctctt 4441 tccattgctc taggatgcag atgtctccag ccctcacctg cctagtcctg ggcctggccc 4501 ttgtctttgg tgaagggtct gctgtgcacc atcccccatc ctacgtggcc cacctggcct 4561 cagacttcgg ggtgagggtg tttcagcagg tggcgcaggc ctccaaggac cgcaacgtgg 4621 ttttctcacc ctatggggtg gcctcggtgt tggccatgct ccagctgaca acaggaggag 4681 aaacccagca gcagattcaa gcagctatgg gattcaagat tgatggtgag ccacgggaca 4741 ccaggggagg tgggtggcat gcagaacaga cctaccagaa gccaaggaaa ggctggctct 4801 ggcttagccg agccaagccc catacagctg tgctgcaggg gccaccccat cttcttccca 4861 ctacactcca agtcactgga cccttgaatc tccaagggtg tctgaccagt agatttaccg 4921 cttattcacc accgtgtgat cttaacctcg ttaagtttgc ccatctacaa aatgaggatt 4981 atttgctgtc ctaaagaatt catgagccgg gcgcggtggc tcaaacgcct gtaatcccag 5041 cactttggga ggccaaggcg ggcggatcat gaggtcagga gatcaagacc atcctggcta 5101 acacagtgaa actccatctc tactaaaaat acaaaaaaaa ttagccaggc gtggtggcag 5161 gcgcctgtag tcccagctac tcgggaggct gaggcaggag aatggcatga acccaagagg 5221 cagagcttgc agtgagctga gatcgtgcca ctgcactcca gcctgggcga cagagagaga 5281 ctccgtctca aaaaaaaaaa ataaagaatt catggaatta cacttgtgaa atacttagca 5341 tagccatcac tataggaaaa aaatctaagg ccaggcacag tgcctcatgc ctgtaatctc 5401 agcacttttg gagtttgagg caggaggatc acccaaggct aggagttcaa ggccagcctg 5461 ggcaatacgg tgaaaccccg tctctaataa aaatataaaa attagtctga tgaggtggtg 5521 cacctgtaat cccagctact caggaagctg agacacaaga atcactttaa cccgggaggt 5581 ggaggtggca gtgagctgag atcacaccat tgcactccag cctaggtgac agagtgagac 5641 ctgtcaaaaa aaaaaaaaga aagagagaga gagagagaag agagagaaag aaagaagaag 5701 aaagaaagaa agagagagag agaaagaaag aaagaaagaa agaaagaaac aaagaaacaa 5761 agaaagaaag gaaaagaaaa aaaaaactaa ggccaggcaa ggtggcttat gactgtaatt 5821 tcagcacttt ggaagattga ggcaggagga tcacttgagg ccagaagttc gagacaagac 5881 tgagcaacag ggagacccct gcctctacaa aaaaatttac aaattagcca gatgtggtga 5941 cacatacctg tagtcccaac tactcaggag gctgaggtgg gaggatggct tgagcccagg 6001 agctggaggc tgcagtgagc tatgattgta ccactgcact tcagcctggg caacaaaggg 6061 aagccctgtc tgaaaaaaaa aaaaaaagaa aaagaagaag aaagaaaata tttgggttca 6121 accaggaggc gaggttgcag taagctgaca tcgcgccatt gcactccagc ctgggagaca 6181 agagcaaaac tccaactcaa aaaaaaaaaa aaaaaaaaac aggaagaaaa tatttagggt 6241 tcataactta agaacagaga aaaatattct agcccaaaga aagggttggg atctgagact 6301 tttgaagaaa ggaaggagat acagaaaaga gatttcatcc tggaatgaaa tctccctcca 6361 gagagccctg ggaaagcacg gtagccccca tccatcagag tggagcccct tgtgggggaa 6421 gtgggctcgg ctgggaaccc tcaattcagc ataagcctca catgtcctct cctctctgtc 6481 ccggtgcaga caagggcatg gcccccgccc tccggcatct gtacaaggag ctcatggggc 6541 catggaacaa ggatgagatc agcaccacag acgcgatctt cgtccagcgg gatctgaagc 6601 tggtccaggg cttcatgccc cacttcttca ggctgttccg gagcacggtc aagcaagtgg 6661 acttttcaga ggtggagaga gccagattca tcatcaatga ctgggtgaag acacacacaa 6721 aaggtgagca ggcagggaaa ggaaacccat ttcctgggcc tcaagagaaa gggaatttgg 6781 aaataaatcc acatatccca gttgggtgca gtagttcaca cctgtaatcc cagcccaaca 6841 ctttgggagg tctaggcgag aggaaggctt gaggcctgga gtttgagacc agcctggcca 6901 acataacaag acctcatctc ttcaaaaaat ttaaaaacca gccgggcatg gtggtgcaca 6961 cctgtagtcc cagctacttg ggaggctgag gtgggaggat cacttgagtc cagcagttca 7021 aggctgcagt gagctatgtt tgcaccacca cactccagcc tgagtcacag aacaagacct 7081 catctctaaa aaacaaacaa aaaccaaatc cacatatcct aaaaaatgct ccttttcagc 7141 attctcttct ctatggacaa agggctggat gctttaagaa ccaaatctta ggctgggcac 7201 ggtggctcac gcctctaatc ctagcacttt gagaggccaa ggcgggcaga ttgcctgagc 7261 acaggagttc gagaccagcc tggccaacat ggtgaaaccc tgtttctgtc aaaaatacaa 7321 aaaattagcc aggtgtgttg gcgcgtgcct ataatcccag ctgctcggga ggatgaggtt 7381 caaagaatca cttgaacccg ggaggcagag gctgcagtga gctgagatca tgccactgca 7441 ctccagcctg ggtgacagag caagactttg tctccaaaaa aaggaactag acgggttcat 7501 ttaaacccct gactgcagcc ctttgacata catccaattg aggactgggg actccgggaa 7561 acatctaaaa ggcttaaaaa ctttgtctaa cttcagccgg gcatggtggc tcacacctgt 7621 aatcccagca ctttgggagg ctgaggcagg tggatcacaa ggtcaggagt ttgagacgag 7681 cctgaccaac atggtgaaac cccgtctcta ctaaaaatac aaaaattagc caggcatggt 7741 ggcaggcgcc tgtaatccca gctattcggg aggctgaggc aggagaattg cttgaacccc 7801 ggagacagag gttgcagcga gccgagatcg cgccactgca ctccagcctg gcaatagagt 7861 gagactccat ctcaaaacaa caacaacaac aacaacaaca aaatcgtcta acttcctgat 7921 cttcctgatc attgattttc ccataggtat gatcagcaac ttgcttggga aaggagccgt 7981 ggaccagctg acacggctgg tgctggtgaa tgccctctac ttcaacggcc agtggaagac 8041 tcccttcccc gactccagca cccaccgccg cctcttccac aaatcagacg gcagcactgt 8101 ctctgtgccc atgatggctc agaccaacaa gttcaactat agtaagtcca agagcccctt 8161 ccccacagcc cacagcaact gcatctcatt cctggggtct cccaaggaat acccaaaatg 8221 tcaccctctg agggaggaag accacaggga atgctcccct ttaagggagg agagacccta 8281 gaatatactc cagctttgac aaagatttcc caagcaggag acatcaggat aatgggaaca 8341 gaagacagga ggtttatccc atgaaggatg aagaagctga aatccagaga ttccctcagg 8401 gccacatttg tccacctgac tccagggtct catcttcgtg tgttgctagt gtgattacct 8461 ggggatgaga aatcctgctg ggggagttga ggttaagagg atgaggactc caggtgctgt 8521 ggctcacgcc tgtaatccca gcactttggg aggccaaggc aggtggatca ggagtttgag 8581 gtcaggagtt tgagaccagc ctggccaaca tggtgaaacc ctgtctctac taaaaatgca 8641 aaaattagcc aggtgtggtg gcaggcgcct gtaatcccag ctactcggga ggctgaggca 8701 ggagaatcac ttgagcccgg gaggtggagg ttgcagtgag ccgaacgaaa ttgagccact 8761 tcaccccagc ctgggcgaaa gagtgaaatt ccattcaaaa aaaaaaggat gaggactggg 8821 atgaactggt ggctgggtgt ggggaaaatg gaagtgaagg aaggccaaaa gagacagaga 8881 aggcctggcg cggcgactca cgcctataat cccagcactt tgggaggctg agaaggggga 8941 ttgcttgagg ccagaagttg aataccagtc tgggcagcat agcaagaccc tgcctctaca 9001 aaaaaaaaat tttttttaat tagccaggct tggtgacatg catctgtagt ctactcaaga 9061 agctgaggtg aggccaggca tggtggctca cgcctgtatt cccagcactt tgggaggtca 9121 aggcgggtgg atgacctgag gtcaggagtt caagaccagc ctggccaaca tggtgaaacc 9181 ccatctgtat aaaaatacaa aaattagctg ggcatgatag caggtgcctg taattccagc 9241 tactcaggtg gctgaggtgg gagaatctat tgaacccggg agggggaggt tgcagtgagc 9301 cgagatcatg ccattgcact ccagcctggg cgacagagtg agactccttc tcaaaacaaa 9361 caaacaaaca aacaaaatac agaagctgag gcgggaggaa catttgaacc ggattcggag 9421 gctgcagtga gctatgattg caccactgcg ctccagtctg tgtgacagtg agaccctgtc 9481 tcttacacac acacacacac acacacacac gcacacacac agagagaaat tagaagatac 9541 tgaattggca gaagagaagg gaaatagaaa ttaaaatact gaatagggga gcagtgaaca 9601 ggggataccc aaaagccaag agcgagagag agcctggctt ccagaaatag tggagaagcc 9661 aggagaacta ggtgaaaacc cagtgctggg ttgccatcag caagagctgg agccatttcc 9721 aacgaaccat cttgtcgtct tcacagctga gttcaccacg cccgatggcc attactacga 9781 catcctggaa ctgccctacc acggggacac cctcagcatg ttcattgctg ccccttatga 9841 aaaagaggtg cctctctctg ccctcaccaa cattctgagt gcccagctca tcagccactg 9901 gaaaggcaac atgaccaggc tgccccgcct cctggttctg cccaagtaag ccaccccgct 9961 atctccccga cctaccaacc cctctctcct ggctccctaa agtcaccgcc cccaggttga 10021 atttcccaga tctgtgacgc ttgcaggaca tgcatgtgtg ggaggctgat gggaaactgt 10081 ggcctgggtt tgattatgag tcttgcaatc atccctcccc ctgtttctgc tggagggcag 10141 gggacagctc ttcctgacca cacccccaca ttgactatcc ccagaatacc cagcaaaagc 10201 ccccaaaagg agagtcagag aaatgaggga ggtgggggcc caatcagtcc acatctactt 10261 agggtcgccc catcagcact tccatcccca accctttcaa gtcaacatcc aaacaaaaga 10321 aatcacttcc aaggacggag cagctcaaag cgcagctcta gctggggttc caagaaagca 10381 gatttttcga aatccttctg cagaaggaag caaagagatt ttttgaaatc tttctgcaga 10441 aggagaaggc tggagctggg gaactccaga attataggga agcctcccac cacgctcatc 10501 ccaaatttcc ggatgctata atgccaggct tggggaaaga ggagaattta gttggttagc 10561 tggtgcgtgc tctcacttgc atcctctctc ttcctctttt ttttttttct cctctctctc 10621 tggctcataa aaatggaggt aattagttgt gccctggtga gaagcagaga gtgcacaaag 10681 gccccctgct tgagtcctct tcagggttag ctctcagaaa cacaatctgc agaacagatt 10741 tttgttccaa catccttgca ggagaatttg cccttagctt cccccacccc agccaggctg 10801 aataaaatta tgctgaaact actgtcttat ttgaggaaag taattagtca taggtgggag 10861 ggggtgggga gattgcagaa gaatgttcat gaatattagg attttcagct ctaagggggg 10921 actttgtaaa cagctttaga agaagaacca ggccggctgg gtgtggtggc tcatgcctgt 10981 aatctcagca tttggggagg ccaaggcggg cggatcactt gaggtcagga gtttgagacc 11041 agcctggcca acatggtgaa accctgtctc tattaaaaat acaaaaatta gccggctgtg 11101 gtagcgagcg cctatgatcc cagctactcc ggaggctgag gccagagaat cacatgaacc 11161 tgggaggtgg aggctgcagt gagccgagat cacgccactg cactccagcc tgggggacag 11221 agcaagaatc tgtttcaaaa aaaaaaaaag aaaaatagga aggaaggaag gaaaggaaag 11281 gaaagaagag agagagaaag aaagagagag aaagaaagaa agaaagaaag aaagaaagaa 11341 agaaagaaag aaagaaagaa agaaagaaaa agaaaggaaa gaaagaacga acgaaccagg 11401 cctccctctc caaccttcac ctccgtccct attctggcca cttgattcgg gggacacctg 11461 gtaggggatg gggaaaggtg ggagctgcca gccagagggg gaccccggct tgagcagcct 11521 cttgctgcta tctgcaggtt ctccctggag actgaagtcg acctcaggaa gcccctagag 11581 aacctgggaa tgaccgacat gttcagacag tttcaggctg acttcacgag tctttcaggt 11641 aagaagactt tcctttgcat tttctcaccc cagtggactg cgggggcccc taagaggaaa 11701 aaggaacctc tccttgagag cggcagcgat ctaatcctgt atccacatct gtttcagacc 11761 aagagcctct ccacgtcgcg caggcgctgc agaaagtgaa gatcgaggtg aacgagagtg 11821 gcacggtggc ctcctcatcc acaggtgagt ctggctcagg tgaggctcca cgggtgtcgc 11881 ctccatcgcc cttcaggata actggtcccc agacccggaa aggaccccgc agccctctcg 11941 gcacagagca gctctgtctg tgctcagcca tcacccactc cccacctgtt tctcagcctg 12001 gaaaacgggc ttgggaccat ggaaccctgt ttcctcgcct gatggctcct aagttccctg 12061 actgtgaaaa ggcctcctaa agaaaaaccc aagttgttcc cacagtggga agtaaactta 12121 agaaacatgc ttatcaggct gggcatggtg gctcccacct gtaatcccag cgctttgggg 12181 gaccaaggca ggtgaatcac ttgaggtcag gaattcgaga ccagcctggg caacatggca 12241 aaaccctatc tctactaaaa atacaaaaat taggcaggcg tggtggcatg tgcctgtagt 12301 cccagctact tgggaggctg aggcaggaga atcacttgaa tccaggaggc agaggttgca 12361 gtgagccgag atcacgctgc tgcactccag cctgggcaat agagcatgac tctgaagaaa 12421 agaaagaaag aaagagagag agagagaaaa gaaagaaaga aagaaagaaa gaaagaaaga 12481 aagaaagaaa gaaagaaaga aagaaagaca aagaaagaga aagaaagaga agaaaagaaa 12541 agaaagagct tatcaataag cccttaaagg atttagataa atgtgtgtaa gggaagagct 12601 gatccattgc taccaagctc ctggaggaaa ccaggtctca gaggatgtcc ctaaactttt 12661 aaggttcata ttcaggaaaa caaacaactt ccagctgggc ttagtggctc actcctgtaa 12721 tcccagcact ttgggaggcc gaggcaggag gatcgcttga gcccaggaat ttgagaccag 12781 cctgggcaat ataatgagac tgtgtctcta caaaaattag aaaaaaatta gccaggcatg 12841 gtggcatgca cctgtagccc cagttacttg ggagactgag gtgggaggat cacttgagcc 12901 catgagttca aggctgcagt gagccatgaa ggtgccactg cactcccgcc tggcgacaga 12961 gggagaccct gtctctaaga aaaacggcgg gggtgggggt ggtgccagtg ccagcatccc 13021 tctgttctaa gacattgtcc cttctcttgc agctgtcata gtctcagccc gcatggcccc 13081 cgaggagatc atcatggaca gacccttcct ctttgtggtc cggcacaacc ccacaggtga 13141 gcctggaacc catcacgttc cacatcctcc cacccattct ttctctcagg aactagtccc 13201 gacagatgca gacatccctc tatccctgag agggctctgg gcagggaacc cataacccta 13261 ccctgcttcc tgtcccaaga ggaggctacc ttctatcacc cacagacagt gccgggtccc 13321 cgctctgtga ctcaggcagc tgcgactcca gacagctcac tcatctgcct agatctcagt 13381 ccttccaccc acatccagcc tgatgagctg tcccactcct tctgcttctc aacccccatg 13441 gttcttccac cctcaggaac agtccttttc atgggccaag tgatggaacc ctgaccctgg 13501 ggaaagacgc cttcatctgg gacaaaactg gagatgcatc gggaaagaag aaactccgaa 13561 gaaaagaatt ttagtgttaa tgactctttc tgaaggaaga gaagacattt gccttttgtt 13621 aaaagatggt aaaccagatc tgtctccaag accttggcct ctccttggag gacctttagg 13681 tcaaactccc tagtctccac ctgagaccct gggagagaag tttgaagcac aactccctta 13741 aggtctccaa accagacggt gacgcctgcg ggaccatctg gggcacctgc ttccacccgt 13801 ctctctgccc actcgggtct gcagacctgg ttcccactga ggccctttgc aggacggaac 13861 tacggggctt acaggagctt ttgtgtgcct ggtagaaact atttctgttc cagtcacatt 13921 gccatcactc ttgtactgcc tgccaccgcg gaggaggctg gtgacaggcc aaaggccagt 13981 ggaagaaaca ccctttcatc tcagagtcca ctgtggcact ggccacccct ccccagtaca 14041 ggggtgctgc aggtggcaga gtgaatgtcc cccatcatgt ggcccaactc tcctggcctg 14101 gccatctccc tccccagaaa cagtgtgcat gggttatttt ggagtgtagg tgacttgttt 14161 actcattgaa gcagatttct gcttcctttt atttttatag gaatagagga agaaaggtca 14221 gatgcgtgcc cagctcttca ccccccaatc tcttggtggg gaggggtgta cctaaatatt 14281 tatcatatcc ttgcccttga gtgcttgtta gagagaaaga gaaccaccaa ggaaaataat 14341 attatttaaa ctcgctccta gtgtttcttt gtggtctgtg tcaccgtatc tcaggaagtc 14401 cagccacttg actggcacac acccctccgg acatccagcg tgacggagcc cacactgcca 14461 ccttgtggcc gcctgagacc ctcgcgcccc ccgcgccccc cgcgcccctc tttttcccct 14521 tgatggaaat tgaccataca atttcatcct ccttcagggg atcaaaagga cggagtgggg 14581 ggacagagac tcagatgagg acagagtggt ttccaatgtg ttcaatagat ttaggagcag 14641 aaatgcaagg ggctgcatga cctaccagga cagaactttc cccaattaca gggtgactca 14701 cagccgcatt ggtgactcac ttcaatgtgt catttccggc tgctgtgtgt gagcagtgga 14761 cacgtgaggg gggggtgggt gagagagaca ggcagctcgg attcaactac cttagataat 14821 atttctgaaa acctaccagc cagagggtag ggcacaaaga tggatgtaat gcactttggg 14881 aggccaaggc gggaggattg cttgagccca ggagttcaag accagcctgg gcaacatacc 14941 aagacccccg tctctttaaa aatatatata ttttaaatat acttaaatat atatttctaa 15001 tatctttaaa tatatatata tattttaaag accaatttat gggagaattg cacacagatg 15061 tgaaatgaat gtaatctaat agaagcctaa tcagcccacc atgttctcca ctgaaaaatc 15121 ctctttcttt ggggtttttc tttctttctt ttttgatttt gcactggacg gtgacgtcag 15181 ccatgtacag gatccacagg ggtggtgtca aatgctattg aaattgtgtt gaattgtatg 15241 ctttttcact tttgataaat aaacatgtaa aaatgtttca aaaaaataat aaaataaata 15301 aatacgaaga atatgtcagg acagtcactg ccttcacctt ctccatttca caccggtggt 15361 acaagaaatc agaagcctag gccaggtgtg gtggttcatg cctgtaatcc cagcactttg 15421 ggaagccgag gtgggtggat cacctaaggt caggagtttg agaccagcct ggacaacatg 15481 gtgaaacccc gtctctacta aaaatacaaa aattagccgg gcgcggtggc tggcgcctgt 15541 aatcccagct actcgggagg ctgaggcagg agaatcactt gaagccagga ggcagaggtt 15601 gcagtgagct gagattgcac cactgaactc caggctgggt ggcagagcga gactccctct 15661 caaaaaacaa caactacaaa gacaacaaca aacccagaat caaaatcctg ttggtccata 15721 gacctcatgg gtggaagaga ccttcctaca tccaggttgg cccaacatgg gggagtccat 15781 gaaatggtca cctcagctct gccacaagcc ccaaggataa gttgattctg cccctgggaa 15841 tcatcctcaa aaaggaaaaa aatgttcccc tgccataaac tttccactta tgcagatggg 15901 cctgctcgta agtcactgtc actgtgggtt cccaactctg ttcatgacac ttccttccag 15961 caccaaatgc ttcccacccc tctactccca ctccccattc ttcaaaccca gctcaagttc 16021 cagttcctcc acctaggact tcccatggat ccggccaata tcactctcag gtccagcgca 16081 gtggcccacg cctgtaatct cagcactttg ggaggccggg gcaggaagat tgcttgaggc 16141 caggagtttc agaccagcct ggacaacata gtgagactct tcctctaaga aaagaaagag 16201 agaaagagag agagaaagac agaggaaaga gagagaaaaa taaagaaaga gaaagagaga 16261 aagaaagaaa gaaaaagaaa aagaacgaaa gaaagaaaga aagagaaaga aagaaagaaa 16321 gaaaaagaaa ggaaggaagg aaggaaagaa aggagggagg gagggaggga aggaaggaag 16381 gaaggaagga aggggatcaa aacatcatca ctctcaactc gacactgact gagttttctt 16441 ctccctggtc tgtaacagtg cttggattct cgagtgttct ttagctttgt gtgtgtgtgt 16501 cttaccctcc ccaagcccat caaggtatca ggtttcttga aaacaaggct ctcttgttat 16561 atatactcta gcatcttctt gaaaatggct tcctgaaaag tcttttatta tctttctcca 16621 gcatctcgaa ggcttttggc agagggcaca cttccctcca ttagtttctg ttcaaatatt 16681 caaaaataat tttttaaact gaaaaaagac caataaccat ctgttgaact tgtatactgt 16741 ctaactccag aaccctgaag ataggattca gagatgcagc cccctgaaca cccgactcat 16801 ttctattgct ttttaaacaa ttttttgtag aacggggttg caccatgttt tgaacttttg 16861 ggctcaaaca atcctcctgc ctcggcctcc caaattgctg ggattacagt tgtgagccac 16921 catgcccagc ttggctcagg tgtacctttc aacctcctta taccctgaaa gtatgactaa 16981 ttcaggccca ggctagtggc tgaaaagtta cctgtctcta acagatttgc ttggaaagct 17041 aaacagcctc tttttgcaca aaatgcttca aaaagcagaa caaattagcc ctaactctgc 17101 agtatacagt tcaaaaactg ctcatccatg ctgagatggc tctagaggaa aaagagtcct 17161 tcaaaaagcc cccttaccac agaggaggct tgttccagga aatccccaat ttacaagctc 17221 tcaagctatt gtgaggcaaa aaatgatgta gactcagcaa gcacgagacc tgcccagccc 17281 tgtttagctt tgcctagccc tgtcccttgc ctgttctaag tttgtcttcc cttctctatt 17341 tttttttttc ttttaagaca gggtctcgct ttgttgctga ggcaaaagtg cagtagcaca 17401 atcttagctt attacagcct ccatctccta ggcccaaagc catcctcgca cctcagcctc 17461 ccgagtagct gggaccacag gtgtgcgccg ccaagccctg ctaatattt // LOCUS HUMPALC 7619 bp DNA PRI 02-MAY-1996 DEFINITION Human serum prealbumin gene, complete cds. ACCESSION M11518 NID g189585 KEYWORDS albumin; serum prealbumin. SOURCE Homo sapiens (clone: lambda-HPA1.) (tissue library: T.Maniatas) foetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7619; 1 to 7619) AUTHORS Sasaki,H., Yoshioka,N., Takagi,Y. and Sakaki,Y. TITLE Structure of the chromosomal gene for human serum prealbumin JOURNAL Gene 37 (1-3), 191-197 (1985) MEDLINE 86031352 COMMENT [Unpublished (1986) Kyushu U., Maidashi, Fukuoka 812, Japan.] revision of [1]. Draft entry and printed copy of corrected sequence in [1] kindly provided by H.Sasaki and Y.Sakaki, 05-MAR-1986. Sequence in [1] has been revised according to corrections noted in [Unpublished (1986) Kyushu U., Maidashi, Fukuoka 812, Japan.]. FEATURES Location/Qualifiers source 1..7619 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-HPA1." /dev_stage="foetus" /tissue_type="liver" /tissue_lib="T.Maniatas" /map="18q11.2-q12.1" exon 582..676 /gene="TTR" /note="G00-119-471" /number=1 gene 582..7518 /gene="TTR" CDS join(608..676,1601..1731,3824..3959,7272..7379) /gene="TTR" /codon_start=1 /db_xref="GDB:G00-119-471" /product="prealbumin" /db_xref="PID:g386998" /translation="MASHRLLLLCLAGLVFVSEAGPTGTGESKCPLMVKVLDAVPGSP AINVAVHVFRKAADDTWEPFASGKTSESGELHGLTTEEEFVEGIYKVEIDTKSYWKAL GISPFHEHAEVVFTANDSGPRRYTIAALLSPYSYSTTAVVTNPKE" sig_peptide 608..667 /gene="TTR" /note="G00-119-471" mat_peptide join(668..676,1601..1731,3824..3959,7272..7376) /gene="TTR" /note="G00-119-471" /product="prealbumin" intron 677..1600 /gene="TTR" /note="G00-119-471" /number=1 exon 1601..1731 /gene="TTR" /note="G00-119-471" /number=2 intron 1732..3823 /gene="TTR" /note="G00-119-471" /number=2 repeat_region 3279..3286 /note="5' insertion target sequence" /rpt_type=direct repeat_region 3287..3580 /rpt_family="Alu" old_sequence 3297..3299 /gene="TTR" /citation=[1] repeat_region 3581..3588 /note="3' insertion target sequence" /rpt_type=direct old_sequence 3636..3638 /gene="TTR" /citation=[1] exon 3824..3959 /gene="TTR" /note="G00-119-471" /number=3 intron 3960..7271 /gene="TTR" /note="G00-119-471" /number=3 old_sequence 5164..5166 /gene="TTR" /citation=[1] old_sequence 5390..5392 /gene="TTR" /citation=[1] repeat_region complement(5737..5746) /note="3' insertion target sequence" /rpt_type=direct repeat_region complement(5747..6047) /rpt_family="Alu" old_sequence 5780 /gene="TTR" /citation=[1] old_sequence 5907..5909 /gene="TTR" /citation=[1] repeat_region complement(6048..6057) /note="5' insertion target sequence" /rpt_type=direct old_sequence 6206..6208 /gene="TTR" /citation=[1] old_sequence 7185 /gene="TTR" /citation=[1] exon 7272..7518 /gene="TTR" /note="G00-119-471" /number=4 old_sequence 7527..7530 /citation=[1] BASE COUNT 2234 a 1609 c 1544 g 2232 t ORIGIN 1 bp upstream of HindIII site; chromosome 18q11.2-q12.1. 1 aagcttccaa atgacttagt ttggctaaaa tgtaggcttt taaaaatgtg agcactgcca 61 agggtttttc cttgttgacc catggatcca tcaagtgcaa acattttcta atgcactata 121 tttaagcctg tgcagctaga tgtcattcaa catgaaatac attattacaa cttgcatctg 181 tctaaaatct tgcatctaaa atgagagaca aaaaatctat aaaaatggaa aacatgcata 241 gaaatatgtg agggaggaaa aaattacccc caagaatgtt agtgcacgca gtcacacagg 301 gagaagacta tttttgtttt gttttgattg ttttgttttg ttttggttgt tttgttttgg 361 tgacctaact ggtcaaatga cctattaaga atatttcata gaacgaatgt tccgatgctc 421 taatctctct agacaaggtt catatttgta tgggttactt attctctctt tgttgactaa 481 gtcaataatc agaatcagca ggtttgcagt cagattggca gggataagca gcctagctca 541 ggagaagtga gtataaaagc cccaggctgg gagcagccat cacagaagtc cactcattct 601 tggcaggatg gcttctcatc gtctgctcct cctctgcctt gctggactgg tatttgtgtc 661 tgaggctggc cctacggtga gtgtttctgt gacatcccat tcctacattt aagattcacg 721 ctaaatgaag tagaagtgac tccttccagc tttgccaacc agcttttatt actagggcaa 781 gggtacccag catctatttt taatataatt aattcaaact tcaaaaagaa tgaagttcca 841 ctgagcttac tgagctggga cttgaactct gagcattcta cctcattgct ttggtgcatt 901 aggtttgtaa tatctggtac ctctgtttcc tcagatagat gatagaaata aagatatgat 961 attaaggaag ctgttaatac tgaattttca gaaaagtatc cctccataaa atgtatttgg 1021 gggacaaact gcaggagatt atattctggc cctatagtta ttcaaaacgt atttattgat 1081 taatctttaa aaggcttagt gaacaatatt ctagtcagat atctaattct taaatcctct 1141 agaagaatta actaatacta taaaatgggt ctggatgtag ttctgacatt attttataac 1201 aactggtaag agggagttac tatagcaaca actaaaatga tctcaggaaa acctgtttgg 1261 ccctatgtat ggtacattac atcttttcag taattccact caaatggaga cttttaacaa 1321 agcaactgtt ctcaggggac ctattttctc ccttaaaatt cattatacac atccctggtt 1381 gatagcagtg tgtctggagg cagaaaccat tcttgctttg gaaacaatta cgtctgtgtt 1441 atactgagta gggaagctca ttaattgtcg acacttacgt tcctgataat gggatcagtg 1501 tgtaattctt gtttcgctcc agatttctaa taccacaaag aataaatcct ttcactctga 1561 tcaattttgt taacttctca cgtgtcttct ctacacccag ggcaccggtg aatccaagtg 1621 tcctctgatg gtcaaagttc tagatgctgt cccaggcagt cctgccatca atgtggccgt 1681 gcatgtgttc agaaaggctg ctgatgacac ctgggagcca tttgcctctg ggtaagttgc 1741 caaagaaccc tcccacagga cttggtttta tcttcccgtt tgcccctcac ttggtagaga 1801 gaggctcaca tcatctgcta aagaatttac aagtagattg aaaaacgtag gcagaggtca 1861 agtatgccct ctgaaggatg ccctcttttt gttttgctta gctaggaagt gaccaggaac 1921 ctgagcatca tttaggggca gacagtagag aaaagaagga atcagaactc ctctcctcta 1981 gctgtggttt gcaacccttt tgggtcacag aacactttat gtaggtgatg aaaagtaaac 2041 attctatgcc cagaaaaaat gcacagatac acacacatac aaaatcatat atgtgatttt 2101 aggagtttca cagattccct ggtgtccctg ggtaacacca aagctaagtg tccttgtctt 2161 agaattttag gaaaaggtat aatgtgtatt aacccattaa caaaaggaaa ggaattcaga 2221 aatattatta accaggcatc tgtctgtagt taatatggat cacccaaaac ccaaggcttt 2281 tgcctaatga acactttggg gcacctactg tgtgcaaggc tgggggctgt caagctcagt 2341 taaaaaaaaa aagatagaag agatggatcc atgaggcaaa gtacagcccc aggctaatcc 2401 cacgatcacc cgacttcatg tgcaagagtg gcttctcacc ttcattagcc agttcacaat 2461 tttcatggag tttttctacc tgcactagca aaaacttcaa ggaaaataca tattaataaa 2521 tctaagcaaa gtgaccggaa gacagagcaa tcaggagacc ctttgcatcc agcagaagag 2581 gaactgctaa gtatttacat ctccacagag aagaatttct gttgggtttt aattgaaccc 2641 caagaaccac atgattcttc aaccattatt gggaagatca ttttcttagg tctggtttta 2701 actggctttt tatttgggaa ttcatttatg tttatataaa atgccaagca taacatgaaa 2761 agtggttaca ggactattct aagggagaga cagaatggac accaaaaata ttccaatgtt 2821 cttgtgaatc ttttccttgc accaggacaa aaaaaaaaag aagtgaaaag aagaaaggag 2881 gaggggcata atcagagtca gtaaagacaa ctgctatttt tatctatcgt agctgttgca 2941 gtcaaatggg aagcaatttc caacattcaa ctatggagct ggtacttaca tggaaataga 3001 agttgcctag tgtttgttgc tggcaaagag ttatcagaga ggttaaatat ataaaaggga 3061 aaagagtcag atacaggttc ttcttcctac tttaggtttt ccactgtgtg tgcaaatgat 3121 actccctggt ggtgtgcaga tgcctcaaag ctatcctcac accacaaggg agaggagcga 3181 gatcctgctg tcctggagaa gtgcagagtt agaacagctg tggccacttg catccaatca 3241 tcaatcttga atcacaggga ctctttctta agtaaacatt atacctggcc gggcacggtg 3301 gctcacgcct gtaatcccag cactttggga tgccaaagtg ggcatatcat ctgaggtcag 3361 gagttcaaga ccagcctggc caacatggca aaactccgtc tttatgaaaa atacaaaaat 3421 tagccaggca tggtggcagg cgcctgtaat cccagctaat tgggaggctg aggctggaga 3481 atcccttgaa tctaggaggc agaggttgca gtgagctgag atcgtgccat tgcatccagc 3541 ctgggtgaca agagtaaaac tctgtctcaa aaaaaaaaaa ttatacctac attctcttct 3601 tatcagagaa aaaaatctac agtgagcttt tcaaaaagtt tttacaaact ttttgccatt 3661 taatttcagt taggagtttt ccctacttct gacttagttg aggggaaatg ttcataacat 3721 gtttataaca tgtttatgtg tgttagttgg tgggggtgta ttactttgcc atgccatttg 3781 tttcctccat gcgtaactta atccagactt tcacacctta taggaaaacc agtgagtctg 3841 gagagctgca tgggctcaca actgaggagg aatttgtaga agggatatac aaagtggaaa 3901 tagacaccaa atcttactgg aaggcacttg gcatctcccc attccatgag catgcagagg 3961 tgagtataca gaccttcgag ggttgttttg gttttggttt ttgcttttgg cattccagga 4021 aatgcacagt tttactcagt gtaccacaga aatgtcctaa ggaaggtgat gaatgaccaa 4081 aggttccctt tcctattata caagaaaaaa ttcacaacac tctgagaagc aaatttcttt 4141 ttgactttga tgaaaatcca cttagtaaca tgacttgaac ttacatgaaa ctactcatag 4201 tctattcatt ccactttata tgaatattga tgtatctgct gttgaaataa tagtttatga 4261 ggcagccctc cagaccccac gtagagtgta tgtaacaaga gatgcaccat tttatttctc 4321 gaaaacccgt aacattcttc attccaaaac acatctggct tctcggaggt ctggacaagt 4381 gattcttggc aacacatacc tatagagaca ataaaatcaa agtaataatg gcaacacaat 4441 agataacatt taccaagcat acaccatgtg gcagacacaa ttataagtgt tttccatatt 4501 taacctactt aatcctcagg aataagccac tgaggtcagt cctattatta tccccatctt 4561 atagatgaag aaaatgaggc accaggaagt caaataactt gtcaaaggtc acaagactag 4621 gaaatacaca agtagaaatg tttacaatta aggcccaggc tgggtttgcc ctcagttctg 4681 ctatgcctcg cattatgccc caggaaactt tttcccttgt gaaagccaag cttaaaaaaa 4741 gaaaagccac atttgtaacg tgctctgttc ccctgcctat ggtgaggatc ttcaaacagt 4801 tatacatgga cccagtcccc ctgccttctc cttaatttct taagtcattt gaaacagatg 4861 gctgtcatgg aaatagaatc cagacatgtt ggtcagagtt aaagatcaac taattccatc 4921 aaaaatagct cggcatgaaa gggaactatt ctctggctta gtcatggatg agactttcaa 4981 ttgctataaa gtggttcctt tattagacaa tgttaccagg gaaacaacag gggtttgttt 5041 gacttctggg gcccacaagt caacaagaga gccccatcta ccaaggagca tgtccctgac 5101 tacccctcag ccagcagcaa gacatggacc ccagtcaggg caggagcagg gtttcggcgg 5161 cgcccagcac aagacattgc ccctagagtc tcagccccta acctcgagta atagatctgc 5221 ctacctgaga ctgttgtttg cccaagagct gggtctcagc ctgatgggaa ccatataaaa 5281 aggttcactg acatactgcc cacatgttgt tctctttcat tagatcttag cttccttgtc 5341 tgctcttcat tcttgcagta ttcattcaac aaacattaaa aaaaaaaaaa agcattctat 5401 gtgtggaaca ctctgctaga tgctgtggat ttagaaatga aaatacatcc cgacccttgg 5461 aatggaaggg aaaggactga agtaagacag attaagcagg accgtcagcc cagcttgaag 5521 cccagataaa tacggagaac aagagagagc gagtagtgag agatgagtcc caatgcctca 5581 ctttggtgac gggtgcgtgg tgggcttcat gcaccttctt ctgataaatg cctccttcag 5641 aactggtcaa ctctaccttg gccagtgacc caggtggtca tagtagattt accaagggaa 5701 aatggaaact tgtattagga gctcttaggc ctcttcactt catggatttt tttttccttt 5761 ttttttgaga tggagttttg ccctgtcacc caggctggaa tgcagtggtg caatctcagc 5821 tcactgcaac ctccgcctcc caggttcaag caattctcct gcctcagcct cccgagtagc 5881 tgggactaca ggtgtgcgcc accacaccag gctaattttt gtattttttg taaagacagg 5941 ttttcaccac gttggccagg ctggtctgaa ctccagacct caggtgattc acctgtctca 6001 gcctcccaaa gtgctgggat tacaggtgtg agccaccgtg cccggctact tcatggattt 6061 ttgattacag attatgcctc ttacaatttt taagaagaat caagtgggct gaaggtcaat 6121 gtcaccataa gacaaaagac atttttatta gttgattcta gggaattggc cttaagggga 6181 gccctttctt cctaagagat tcttaggtga ttctcacttc ctcttgcccc agtattattt 6241 ttgtttttgg tatggctcac tcagatcctt ttttcctcct atccctaagt aatccgggtt 6301 tctttttccc atatttagaa caaaatgtat ttatgcagag tgtgtccaaa cctcaaccca 6361 aggcctgtat acaaaataaa tcaaattaaa cacatcttta ctgtcttcta cctctttcct 6421 gacctcaata tatcccaact tgcctcactc tgagaaccaa ggctgtccca gcacctgagt 6481 cgcagatatt ctactgattt gacagaactg tgtgactatc tggaacagca ttttgatcca 6541 caatttgccc agttacaaag cttaaatgag ctctagtgca tgcatatata tttcaaaatt 6601 ccaccatgat cttccacact ctgtattgta aatagagccc tgtaatgctt ttacttcgta 6661 tttcattgct tgttatacat aaaaatatac ttttcttctt catgttagaa aatgcaaaga 6721 ataggagggt gggggaatct ctgggcttgg agacaggaga cttgccttcc tactatggtt 6781 ccatcagaat gtagactggg acaatacaat aattcaagtc tggtttgctc atctgtaaat 6841 tgggaagaat gtttccagct ccagaatgct aaatctctaa gtctgtggtt ggcagccact 6901 attgcagcag ctcttcaatg actcaatgca gttttgcatt ctccctacct tttttttcta 6961 aaaccaataa aatagataca gcctttaggc tttctgggat ttcccttagt caagctaggg 7021 tcatcctgac tttcggcgtg aatttgcaaa acaagacctg actctgtact cctgctctaa 7081 ggactgtgca tggttccaaa ggcttagctt gccagcatat ttgagctttt tccttctgtt 7141 caaactgttc caaaatataa aagaataaaa ttaattaagt tggcactgga cttccggtgg 7201 tcagtcatgt gtgtcatctg tcacgttttt cgggctctgg tggaaatgga tctgtctgtc 7261 ttctctcata ggtggtattc acagccaacg actccggccc ccgccgctac accattgccg 7321 ccctgctgag cccctactcc tattccacca cggctgtcgt caccaatccc aaggaatgag 7381 ggacttctcc tccagtggac ctgaaggacg agggatggga tttcatgtaa ccaagagtat 7441 tccattttta ctaaagcagt gttttcacct catatgctat gttagaagtc caggcagaga 7501 caataaaaca ttcctgtgaa aggcactttt cattccactt taacttgatt ttttaaattc 7561 ccttattgtc ccttccaaaa aaaagagaat caaaatttta caaagaatca aaggaattc // LOCUS HUMPAP 4497 bp DNA PRI 07-JAN-1995 DEFINITION Homo sapiens pancreatits-associated protein (PAP) gene, complete cds. ACCESSION L15533 NID g482908 KEYWORDS pancreatitis-associated protein. SOURCE Homo sapiens adult blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4497) AUTHORS Dusetti,N.J., Frigerio,J.M., Fox,M.F., Swallow,D.M., Dagorn,J.C. and Iovanna,J.L. TITLE Molecular cloning, genomic organization, and chromosomal localization of the human pancreatitis-associated protein (PAP) gene JOURNAL Genomics 19 (1), 108-114 (1994) MEDLINE 94245143 FEATURES Location/Qualifiers source 1..4497 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /dev_stage="adult" /tissue_type="blood" /map="2" promoter 1..986 /gene="PAP" mRNA join(986..1011,1300..1409,1970..2088,2276..2413, 3040..3166,3445..3731) /gene="PAP" exon 986..1011 /gene="PAP" /number=1 gene join(986..1011,1300..1409,1970..2088,2276..2413, 3040..3166,3445..3731) /gene="PAP" intron 1012..1299 /gene="PAP" /number=1 exon 1300..1409 /gene="PAP" /number=2 CDS join(1334..1409,1970..2088,2276..2413,3040..3166, 3445..3512) /gene="PAP" /codon_start=1 /product="pancreatitis-associated protein" /db_xref="PID:g482909" /translation="MLPPMALPSVSWMLLSCLMLLSQVQGEEPQRELPSARIRCPKGS KAYGSHCYALFLSPKSWTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSIGNSYSYV WIGLHDPTQGTEPNGEGWEWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLRWKD YNCNVRLPYVCKFTD" intron 1410..1969 /gene="PAP" /number=2 exon 1970..2088 /gene="PAP" /number=3 intron 2089..2275 /gene="PAP" /number=3 exon 2276..2413 /gene="PAP" /number=4 intron 2414..3039 /gene="PAP" /number=4 exon 3040..3166 /gene="PAP" /number=5 intron 3167..3444 /gene="PAP" /number=5 exon 3445..3731 /gene="PAP" /number=6 BASE COUNT 1125 a 1121 c 1020 g 1231 t ORIGIN 1 ctgcagcctt gaactcctgg gttcaactga aggtcctcct acctcagcct gctgagtagc 61 taggaccaca agcacacacc accgcaactg gcttaaatta aaatataaat tgtagagata 121 gggtcttaat gtgttgccca ggctgctctt gaactccttg cttcaggtga tcctcccacc 181 tcagcctctc aaagtgctgg gattatagac ctgagccaca gcacctggcc aactgaccta 241 tgattttaca caatggctgc tcttcccttc tttaactatt attcattctt ctttgatcct 301 cattatttga ctgtagtcct tcttatgtct tgttttcctt cattacctct tattctatca 361 cattgccatt gtcattctcc actggggaag ctctttcttg ctgaagactg gaaagacaag 421 tccattcacc tgattttctg taagattgtg gctcatgtat tgacttgtca gacaattctg 481 aagtttcatc aaaattagct atcatgcttg cataatggcc ctgaaccctc actcctacac 541 ttagcttcag taccatctat gtcctcaact gtccatgata cttataattc ccgtaaatct 601 tcacttaaca cctaacattt atttaatctt actaggcaag gtaataagaa atacataggt 661 ttgcctccag aagtgggttc ttaagaaacc caccagagga actcctcttt cagatgtcca 721 cattagaaga tttcatatca catttggtgc cacaggcctt tgacaaggag gatgcagagg 781 aaaaagcaaa cttcacctct tcctagggaa agtgttggcc tgccaacagg aaagaggcaa 841 catctgggaa aatccccagt ctttgccagg aagagtccat gccaacccca ccccatgacc 901 cctgtcctgc ctactcattg tcactcttca ctccaatgtc cctcccccag atcctctata 961 aaatcccact ctttcctgac cagacaaacc ataccatatc ccaccagaga ggtaagtggg 1021 agctgagaga agatgagacc cagggaggag ctactgcaca tgacacagga gaatacatgg 1081 gagggtccct tcctcaggga gcacaggaac tctgagactc agcaagggtg tcctgggagg 1141 gctcggggat gggagagtac acagattcac aactcattca gaactgtaga agatgatgga 1201 tgtgaccaag atcactttag tcctagggga ctagagaagg aaaatgacat gaggcagtgg 1261 ggtatctgtg tgttctccca ctgaccacgc tttctttagt gactcctgat tgcctcctca 1321 agtcgcagac actatgctgc ctcccatggc cctgcccagt gtatcttgga tgctgctttc 1381 ctgcctcatg ctgctgtctc aggttcaagg tgagattgct ttgcctctag cactgggttc 1441 cctatgaatc ctcagagcta acaagaggag gaaggctcct gtgtgtcatg tgaggtaatg 1501 acgtggtgtc taatgaacct gcctgcagtt cttgcatcat ctctccttcc ttcaggttaa 1561 cttgcagtgg gaggctccat ggtggtccac taacagtgga atgagatggc ttccatttag 1621 tcagtggact ctaatataca ctggtgggaa agtggactct aatatacact ggagggtcag 1681 taatgagatg tggggaggga caatgattgg aggacccaat gtagagacag cccagagtga 1741 ggagagtatt gaatggttga ataaggggaa agggtaataa gagactggat ggtgctccat 1801 ttactatggc tattttgaga taaagaattt ctgaaaacat aagggaagat gaaggggtgt 1861 caggaatgtg gtcttcctcc ccaaggacat tcctaggtat tccccaaggt catctcccac 1921 cccaagcccc actcttcatt ttaccctccc ctctcttctt ccacctcagg tgaagaaccc 1981 cagagggaac tgccctctgc acggatccgc tgtcccaaag gctccaaggc ctatggctcc 2041 cactgctatg ccttgttttt gtcaccaaaa tcctggacag atgcagatgt gagtggttag 2101 atgtggtgtt ggaggtgacc ggtctcaggg ggaggagggt ctccattcag gagagttcct 2161 tgggaatgag gatgaacacg tttatctttc acacagtcct cctcccacct acctttgccc 2221 tgccctccct cagcaggtct caggctccct ctcattctct ttgttgccct caaagctggc 2281 ctgccagaag cggccctctg gaaacctggt gtctgtgctc agtggggctg agggatcctt 2341 cgtgtcctcc ctggtgaaga gcattggtaa cagctactca tacgtctgga ttgggctcca 2401 tgaccccaca caggtgccag tatatcctcc ctctctgtta cctctcaagg tgctattgtt 2461 gcccaggccc actccctgtc ccctgtgcct gcccaggaag tacttcaggg agcactggag 2521 ctcagattct ggggaatatt tggggggaaa gggaaggcca tgaagcatct gaagatctga 2581 gttctgtgga ggtctctatc tttcagataa aatcaatctg ccttcctcag gcgtattaca 2641 taattctcat atgaggctgg gttaacaatt ctctgagctt catggagtct ttgcctacta 2701 ttctgaagga actcttaatg aagataggat caatttttgt ccccatacag aactgacatt 2761 acttttgagg ttcacaagct aatcacaaat gctacatcaa ttattgttct gcaaataata 2821 tattaccttg agttgttcca aaggtcttat gtttattggc tggaattttc caatagcaat 2881 gaggagtcaa ggaagagttt cctactcacc ggcagcatct ggaatagcag accaactttc 2941 ctcatgctgg ggagcaaatc aggtgttgca gctaaggggc catgcaagaa gagctgcaat 3001 ggccattccc ttcacctggc tacctcctct actctacagg gcaccgagcc caatggagaa 3061 ggttgggagt ggagtagcag tgatgtgatg aattactttg catgggagag aaatccctcc 3121 accatctcaa gccccggcca ctgtgcgagc ctgtcgagaa gcacaggtaa gaaacagagg 3181 agctgcctct tcccagtgtt ttccatctca tcccccattc ctgggtctga ccttcaggaa 3241 atcttcctga gctagaaaat acaatgttag tgtgtcttct cttatctcct ctcttctcca 3301 ctttctttga atctctctcc tggattggga cactggtgaa ggtgagggag aggctttaac 3361 ttctaggcta aaacctggga tgccccttca ttggattcac aagcttcctc agccccattc 3421 catttatgtc ttctgtctct ccagcatttc tgaggtggaa agattataac tgtaatgtga 3481 ggttacccta tgtctgcaag ttcactgact agtgcaggag ggaagtcagc agcctgtgtt 3541 tggtgtgcaa ctcatcatgg gcatgagacc agtgtgagga ctcaccctgg aagagaatat 3601 tcgcttaatt cccccaacct gaccacctca ttcttatctt tcttctgttt cttcctcccc 3661 gctgtcattt cagtctcttc attttgtcat acggcctaag gctttaaaga gcaataaaat 3721 ttttagtctg cacttgtttg tcttgtatat gccagtgtca tagccatact ctgagaagga 3781 caaagtgttt gagtggagga aactttatgg gtcttgcttc ttccctattc acccaggcct 3841 ctagggaaaa tgatgaagtg tgcatcccta ccagtgtgtt atgatgaggg tgtgggtcct 3901 gctcatgtag gatttgtgtt gtggagagat gaggacattt ctctcccgcg tacttactgc 3961 cctcccattc ccgtagccca aacctgacag tgtgacatga acagattagg aggctctgat 4021 ggtgcttaga atagtacttc tcagagaatg gcatcagcag gatggtagat aggactttcc 4081 agctcttgaa ccttcacaga aacattcatt tgaactacta cccattaaaa tggaaatacc 4141 ttcacaagag ctaacaatcc caagtgagtg attaaagcat ctgaatgttg caaaaaataa 4201 gaagggatgc atcgaagagg gtagaaagaa gacttttaca ttatttatat cacccctcca 4261 tcaatctcag taagcacagc atggagagac attccctaaa cttggggaaa gagagtgaaa 4321 taagcacttg agttttccat ggaccctaac actaggtttg cctcagtaag acccagtggc 4381 ctctgactcc aggcagacac ccttggactt agactccagg ctgccttgat gccaggccag 4441 gctctgtggc cccaggctct gtgaccccag gctccaggtc agcccccatg actgcag // LOCUS HUMPCBD 12404 bp DNA PRI 07-JUL-1995 DEFINITION Homo sapiens (clones HGPCD2 and HGPCD15) pterin-4a-carbinolamine dehydratase (PCBD) gene, complete cds. ACCESSION L41560 NID g848984 KEYWORDS dimerization cofactor of hepatocyte nuclear factor 1 alpha; pterin-4a-carbinolamine dehydratase. SOURCE Homo sapiens (clone: HGPCD2) DNA; and Homo sapiens (clone: HGPCD15) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12404) AUTHORS Thony,B., Neuheiser,F., Blau,N. and Heizmann,C.W. TITLE Characterization of the human PCBD gene encoding the bifunctional protein pterin-4 alpha-carbinolamine dehydratase/dimerization cofactor for the transcription factor HNF-1 alpha JOURNAL Biochem. Biophys. Res. Commun. 210 (36), 966-973 (1995) MEDLINE 95283563 FEATURES Location/Qualifiers source 1..12404 /organism="Homo sapiens" /note="(vector lambda EMBL3)" /db_xref="taxon:9606" /clone="HGPCD2" /clone="HGPCD15" /map="10q22" mRNA join(4669..4691,7292..7423,7989..8069,9172..9559) /gene="PCBD" /note="G00-138-478" gene join(4669..4691,7292..7423,7989..8069,9172..9559) /gene="PCBD" exon 4669..4691 /gene="PCBD" /note="G00-138-478" /number=1 CDS join(4689..4691,7292..7423,7989..8069,9172..9270) /gene="PCBD" /EC_number="4.2.1.96" /codon_start=1 /function="dimerization cofactor of hepatocyte nuclear factor 1 alpha" /db_xref="GDB:G00-138-478" /evidence=experimental /product="pterin-4a-carbinolamine dehydratase" /db_xref="PID:g848985" /translation="MAGKAHRLSAEERDQLLPNLRAVGWNELEGRDAIFKQFHFKDFN RAFGFMTRVALQAEKLDHHPEWFNVYNKVHITLSTHECAGLSERDINLASFIEQVAVS MT" intron 4692..7291 /gene="PCBD" /note="G00-138-478" /number=1 exon 7292..7423 /gene="PCBD" /note="G00-138-478" /number=2 intron 7424..7988 /gene="PCBD" /note="G00-138-478" exon 7989..8069 /gene="PCBD" /note="G00-138-478" /number=3 intron 8070..9171 /gene="PCBD" /note="G00-138-478" /number=3 exon 9172..9559 /gene="PCBD" /note="G00-138-478" /number=4 BASE COUNT 3508 a 2858 c 2999 g 3039 t ORIGIN 1 gaattcagga tacttggttg ccctgtgagc tcacctccct ggtgagctca agaaaagtga 61 tgactttgta gtgcatgcag tcttttctca ttgttaagta tgatagcaag gctttctgcc 121 tcctggatgg aaggtgaata cttcttgctg tccttaagtc tttccttcca ccgtctctca 181 ccacccattc tcaacctaac gctacacatt ttaggttgct tttatagcaa cattccactt 241 tcaggcacca aattttgttc ctgccatttg ttcctattgc tgcagaacaa atcatcacgc 301 tgagtggctt aaaacaaaaa ttatgatttc tcatgcttcc acgggttgac tggtctcact 361 tggggcctca cgtgggattg aagtcagatg tctcctgggg ctgaagtcat ctgaaggttt 421 ggcaggtctg agcatccaag agtctcactc acgtggctgg agaatgatga acagttccca 481 gagtttcttt ggcttagtct tagaagttat agctggatgt ggtggctcac acctggaatc 541 ccacacttgg gaggccaagg cccaacattt gagcccaaga atttgagacc agcctggcaa 601 catagaaata tcctgtctct gtaaaaaatt taaaacatta tccaggcatg gtgactacct 661 gggggagctg aggcaggagg atcacttgag cccaggaggt caaggctgca atgagctatg 721 attgcaccac tgcactccag cctgggcgac agagcaagac actgtatcaa aacaaagtca 781 catggtgaca tttctgccat atttgactgg tcatactggg ccatcccaga ttcaacatgg 841 gtgagaacta tgcagggtgt gaatattggg agtcatggtt cattgggggc tatctttgga 901 gactaacatc ctgcctaaca ctacatacca aaagggacct ttcacattta ggtcttaaat 961 ctatctaact ataaagttaa caaaagaaaa gggagtatat ttttgtagca gggtggggag 1021 aggacttatc aaataaattt tcaaatgcac aaaattataa ggcagaggga aaaaagataa 1081 atcttattac accaaaatat aagatttctg cccacagaca tcatggatct catgcaagaa 1141 gtcagtatac agatggcaga atgggaggag atatttgcaa tatcaaaact gacaagcaat 1201 taatgtctag attttatata acacccacaa accaactaaa aaaaagagag cactaaaatt 1261 ctcaaaattg gtcaaaggat attaaaagca gttctcaaat agtaaccccc aaatggtaac 1321 aagcatatga agaggttctc aaaatcatta gtcatcagaa aacaatgtag agtaaaacaa 1381 ttgaaagcgt caggaggcag acaatttcct aggcagatgg gggtgggtcc ccagtgtaaa 1441 ccccaccttc aagccaaaaa cagcctggac tgctggtttg gatgaaaccc ctcaacccag 1501 ggtgagaaat ctgttcctgt tgcccaccct tcctgattgg ttcttctgaa taatgctttt 1561 taaccaatca aatgttgcct tttctaatac tacctacagc ccgcccctcc ccaatcctgt 1621 gcctataaaa ccccctagac tcagccacac tgagatagac aaccccacct tctcaatccc 1681 tctccactga gagctgtttc atcgtcaata aaattctcca tcctcatcac ccttcaactg 1741 tcagcatgac ctcattcttc ttggatgcag aacaagagct caggacccac caaatgtggg 1801 cacccacaaa ggctgtaacg ctggccctct accctcactg gtggagggca gctgccccac 1861 agaacagaag ccgcagcagg gccaagctgg tccaggagcc acgggccaga gtggagaaag 1921 gggccgcctg agctgccaac atgccgctgt ccatcaggct gcagatggtg gagataaaag 1981 agctatttag cacactgtaa cacccctctc tggggcttca ggcacccctg cctgggtgct 2041 gccacagtct ccttggggtg acacacctga tctggctgtg ggccccacac agagcctgct 2101 cctgtgtcat ggctcagtgc agctggctgg accctgcatt cactcggtca catgctccct 2161 cccgccaggg gctgagcata cagtcgcagt ggccttgtga gccatgggct gtaacacaag 2221 ccagatgggg cctggcaagc tgagtagata aggcactccc tgctgcaacc tcagcaaagg 2281 ggtcaagaaa aatcctgcat cacaacaagc tctcatttta tagctattag cctgacaaaa 2341 aaataaaaag ctggctaatg ggaagtgtga gtatgtatgt gtaagatgca tatggaaata 2401 gtaccctctc acaccatgca ggagagaatg aggactggtg tccccatcct ggacatcatc 2461 tagaactatt ttgagataca cacccatggg gggaattgct ggatttatag ggtatggata 2521 tacttaagtt gcctaagaaa ctgcccacag cccagggagc aattagagat gttcactgca 2581 gcagcattgt tgtttaactc tgtgtacgta tgtgtttgtg tgcccacatg ctcacgggct 2641 tgggaggtgg tggtagcagc agaaagttga agggaacctg agtgtccatc actgatagaa 2701 caagaaaccc atggcagact ctcattacaa aatactaggc tgcagttaaa agctacagat 2761 tgaatataca tatagcaaca tgggtagatc ttaattttcc actaaatact ctgttttgta 2821 agaaacagac tgagaaatgt aatatattta cataatacca tttacataaa tttaaaaata 2881 cacatagacg aaacaacacc acacattttg caagaacctg tgcaaacaaa aatccacatt 2941 aacacattag agaggctgct atgggcggtg gtggggggcg gggaataaaa gtggagcaca 3001 agaataaaag ttaatagaaa aatgtgaaat gtgagaggac ttttctaatc tgactctaag 3061 acttattata aagctataat aatcaagaca aggtagtgtt ctgtcatgac agacaaacag 3121 atcaatggaa cagaagaaag ggctcagaaa taaacccaca tgtatacagt caattgatat 3181 tttacaaagg tgcaaagaca actcagtgga gaaaggaata actgagtcgc catgtgcaaa 3241 agtgcatctg gatctataac ttgaagcata taaaaatatt tactcaaagt ggatcacaga 3301 ctaaatgtaa aaccaaaagc tataaaacat aggagaaaac ttttgatctt gggttaggcc 3361 aagatttcta tgtgtacaac atcaaaatag aaatccataa aagaaacatt tgatatattg 3421 gatttcatca aaattaagaa cgtttgctct ttgaaaggca ccataagata atgcaaagaa 3481 aatatttgca aatcactgcc tgatcatgtc cttatatcca gaataaaaca ccctcaaaac 3541 tcgataagaa gacaacccaa tttctttttt aatgggcagg aaattcaaac atgctattca 3601 ataaaaaaaa cttatcctta gcaaataagt acatgaaaga cattcaatac aattactaat 3661 caaggaaact caaaaccaca atgagatact gctacacacc aattaaaatt taaaaggctg 3721 actatatcaa gtgttggtga ggacagagaa ctgcaacctt catacactgc tgctaggaat 3781 ataaaatgat acaaccattt tgcaaaacag tatggcagtt tcctaataag ttataaaagt 3841 ccacctacca catgacccag gcattccatt cctaagtgtt tactcaagga aaattaaaac 3901 atacgtactt tcgaagattt gtgcaccata ttcgcagcag ctttatttgt aatagccaaa 3961 aggtaaacaa cgcaaatgtc caatagtgaa tcgctaaaca aattgtagtc tatctgtaca 4021 atgcaatact ataaataaat tttacttagc cataaaagga agaaaccatt gatataggca 4081 acaacatggc tgaaccccaa aacagttatc ctgcctagta tatactctat ggtccattta 4141 cataaaattc tagaaaatgc aaactaattt atagtgacct aaagtacatc agtgcttgcc 4201 tggccaggtg gtttcgggag gaaggggtta ccaaagggca agggaaaact tgcagggatg 4261 atgatatgtt cataatgttg atctcggtga tggtctcacg agggaaacat taaaatttat 4321 caaattgtac acgttgaata tgtgcaattt attgcatgtc aattatatcc aaataagaga 4381 tttttataaa gaaaaaaata ggtattctgg cgcccagggc cacccagctg taagcggcgg 4441 ggtgggattc caacccagaa gcgggcattt ggagccctca tgcttcaccc tagctccctg 4501 aaacaagcac ggaggatggc tctgagcagc gaggaaagtg ctggaagcaa gcaggcagag 4561 cgcgtccctg gctggggacg ttaatcatta ccggagggcg gcccgagcgc ggccccgccc 4621 cgggacggca gcctgcgcgc ccggccgccg cctgccctct ccgctggcca cctgctgccg 4681 cccgcgccat ggtgagtcca ggggtgcgcg gccgcgatcg gggcagcggg gccggggcag 4741 ccgcgggcga ggtgggagcg ggaaggggcg ggggaaagtc ttttcgagtc ccctgcggag 4801 ccgccggggt ccgaaagtgg ggtctctgcc cggctctctg tcccggcggc ggggagggga 4861 agctgaagcc ctgagagggg cggtggcggg tccacatccc acccagttgg gaggccaggc 4921 ctgccctgac cccggccagt gcttgttccg ggaccttccg ggggcggggc ctacgacctt 4981 cctgggccgg ggaagacagt cgtggaggag tccacggatc tcgtcccctt ggtggcccga 5041 ttctgcttta tactggggtg ggagcgagag ggaggctgag tttgcgcgct cgtgtgcgtg 5101 cctgtccgct gccccttccc gctggcactc tgggtacggg gtctgcctgg gactcacgct 5161 ctgcgccggt atgctggccc cagagccagt ccccgcgccg gcttgaggcc cgtgccaggc 5221 cacgggggct tggtggcgct ggagcagccc tcctcccgtg agcttccccc acccaccccc 5281 ggggcggggg cgggggcggg gcccttgctc tgctgtaagc tcgcgagctt catcagacca 5341 tttctggtcc acaggcttcg gagcaggcaa gactgggagg gccttttgat aggagaggga 5401 cttgggggag ggaggatgag tggggctgtt tgtgaggggc gctgaggtcc cagagatgca 5461 ggcttgggaa atcggtaacc actccccaca cctcgccaga ggaggcaaac ggggcagcgt 5521 ccccctctct gtctccagag ccagcacctg gaaggtttgt cagcttcagc cgcaggctgt 5581 gcgaccctgg ggggttccag aaggctctac cgagaaaggg aagcctccta agagctcacc 5641 tggactcacc caccccttta agagccaagg aagtaaacag tgggctctga tgctttggct 5701 gacttctggt tgtctagata cacggaccag gcggtcagcc acggggagaa ggccagctct 5761 ggagtagaga cctacctact gggcagatga tgcaaatcgc tgctgccatt caatccctat 5821 agtcatccag agccgagggg aagagagggg ccaagctaca tcagtccaga gagagctcac 5881 gttttcagga gggaaaacag cagggaaaaa agacggactt aggaaagcac tgcgatgctt 5941 cagctacccc aaatgctacc aggcccatca caagcctgct tgccttagga agtttttttt 6001 ttgtaatgca gatgacattt gctttatagc agtggagttt cctaaagtgt cccattggca 6061 cctcatgatg atctgtggag gtggaggttc tgccaagaga ggtacagcca cctggctggt 6121 aaatggtgaa agcgggtctc aaaactcctg ccaatcaaca catggtgtcc agtctgcagg 6181 ctgcctcata aaagctaaat caacctccct cacattccct ccccacccct aagggtatct 6241 tttcaattca gcactctatt tctgccctgg ggtggtgcta ctctagaagc ctaggttctg 6301 ggatctgtta agctcctaga cctggatcct cctaggatga aactgaccct tgcttccttg 6361 ctttctctga ggttccactc atggcctgaa ttgtgccaat ttgggctgcc ttgagggcac 6421 ttcgcacatt aggtaaatag gatggggcac taggtttgcc agctgccttt ggccaagttc 6481 cttccacttt ggcctgagct tcccaaccag ccagaaaagc agacggaaaa ccagagaaac 6541 ctagccctgg cggctcagga ccagaggctg cctgggagca tgactgagga cctactagag 6601 gtagtgccaa cggcggtcag ctgcagtgtg gctaagaatg ggatggacac ttattttaaa 6661 tgtagctcct tcgtccagcc ccagcaatgc tgacacacag acaacactgc gccatcactg 6721 gctaggggtg ggcgggaaag aacagcagac aagtcgtttc ccctctctag gcctgtttcc 6781 tcttcaatac acgtgaaagt tacggtaggg acgtggggtg catatgatga agtacataat 6841 aggaaaagga aggggggaga gtcatgggta ctcttcctga gaggttgtgg agtaacatat 6901 ttgcatctgg gatttatttt gcttttacgc cctcactgat gtaataatct catggaggga 6961 aggtccgtaa ggctttgagg gcagggtggg ggaagagtgc ttctctgtcc ccgtcatatg 7021 aattacattg cttttgtgat ttcccttgtg tgtgtgtgga gtgttaaaac ccaagacctg 7081 agggcctatc ggcagacgag cctggacccc acctcctggg cagggtgcca cccttgaggc 7141 ctcccagcct attgctcaaa gacctgctgc cctttggtca cccacgtcca aagcagccag 7201 tggaagctaa gttcccctaa aggttggagc ccctctgttt tgacctataa agagaaatgc 7261 ccttgggcct ctgttttttt ccctcttcta ggctggcaaa gcacacaggc tgagcgctga 7321 ggagagggac cagctgctgc caaacctgag ggctgtgggg tggaatgagc tggaaggccg 7381 tgatgccatc ttcaagcagt ttcatttcaa agacttcaac agggtatggc accagggtgg 7441 gacagagttt tctgagacgg gctgatgggg tcaccttaca aagcatgttc tctcccttca 7501 cgtcttgtta gccaagactc tcagacacca cactcatcca gttatttggg cactgtgcca 7561 gtggggaaat taattttcct gtctaactca gaaagtatca tctttgggga ttggagagcc 7621 ataggaggat gtgtcttggg ctctcccttc ctggaacttt tagcagcagt cttctgagga 7681 gatagttcaa gtggaaacca tgcacatctg aagacatttg aacagacggc caagactggc 7741 caggccaaaa gcctgttgtg gtgtttccac cttgttggca tcctaaatag caagactttc 7801 ttcctactgc aaggacaatc tctaactcaa gcctgagaac cagtgacagg atgtcaaggg 7861 ggaaatgatt atagaggagt ctcctggggg gaagctaatt catcctcctg cgtctaggcc 7921 ttgaattggt gatgtccaga agtcctttct cttagaacac taacctggct ctcactccac 7981 ttaaataggc ctttgggttc atgacaagag tggccctgca ggctgagaaa ctggaccacc 8041 atcctgaatg gtttaacgtg tacaacaagg tgagtgatgc tgtgtgcctc tgttgtgctc 8101 tgcccccatc caaatactgc gaactctgac acattctgaa ggcttttgga cttaacaatg 8161 agccttcctg acccataggt catccactca tctcatttcc aatctcactg agattgcaca 8221 tgcctagaat caacataaat acccctggga aaaaaaaatc agtcttacta taaagaaatg 8281 caaattaaaa gatgcatgag atagttgtgg ttaaagtaga catttaagta gacattaaga 8341 aggcaaaacc caagaaagga aaggctgtgg gaatcaggca tacttattca gtgttcatag 8401 ctctaaatgg ctttaatctt gctggaggta tggaagaatg aaaaagtaat aaacttgaac 8461 taagaataca cacatatttg ggaggccgag gtgggcagat cacgaggtca ggagatcaag 8521 accctcctgg ctaacatggt gaaaccctgt ctctactaaa aacacaaaaa aattaggcgg 8581 gcgcggtggt ggacgcctgt agtcccagct actggggagg ctgaggcagg agaatggtgg 8641 gaaccgggag cggacgttgc agtgagcaga gattgagcca ctgcactcca gcctgggaga 8701 cagagcaaga ctccgtctca aaaaaaaaaa aaaaaaaaaa atacagacac atagccaacc 8761 tatcaaatta tttgcttaat gaatagaaac agtcattaaa tgtctaccat gtgccaaaag 8821 ctctgctggt ctctgtttat aaaaacagta agggtggaag aggtcgtcta tgaatggata 8881 gtgtttagga gttggcattg gtttttggtt ttgtctttag cttttaggtt gctcttgaca 8941 ttcagatatt aagcttttat tctatagctg aagtcatgat tatttttgag agacagcaaa 9001 ggggagatgt agaagatgtt gttacagaca tgtcaggaag gagtccccag gagagatcgt 9061 actggccagc tgctattctg gagctcctag cagggctgag cccagcagct agtgactccc 9121 tcctgttctt aagtgaagtc agtggaagca gaatttctgg tttcatttca ggtccacatc 9181 acgctgagca cccatgagtg tgccggcctt tcagaacggg acataaacct ggccagcttc 9241 atcgaacaag tagcagtgtc catgacatag accctgccct tcctctttga attcttccgg 9301 gggaaggggt gactgaactg ggagtccagg gagggagctg aggagccctt accctcccac 9361 cactcccctc ccaagaccca gccgccgccg ttgagggctg agtccttgct gtgggatgtg 9421 ccagtgtccc caccaacacc aggaatttag accttttccc tgcaccactc tcttcatcct 9481 gggggctctg ttacactaat ttgaataaac tctccccttt ctttgcaact tcccagcaac 9541 aataatgatt ttcttgccag gccgtctctt gctccctaat tcatttccca ggaagctgtg 9601 atacagggtg aaataaagtc ttgtcttaga aaccaggacc ctaaacccca cactatgtaa 9661 tagaaacaca tgtgttttta tgtctcaaat aaaactatta tatcacttgg tctcaggctc 9721 tagaaacttc ttttccacag gtcaatccaa attggagtca ataggcattt aaatgaaagc 9781 acttcctctg tgctaggcac ttcctctgtt caaggtactg gaaaatgaca gggaccagac 9841 caccttgcct tcagattcat tatcatctag gggataccat ttgattatca tcaaacagga 9901 acacaattaa ctattaccaa ttaccaatta actaggtact aattaactat tactagagtg 9961 aattaatgct aggtggcagg ataagaaatg agctaaaggg aacgcaagtg attaatggtg 10021 aaaagggcaa atctggaggc tttgagttat gccttgaaga cctgggtagg attcttggga 10081 ggcagagatg gctgagagaa caccatgaac acagataagg gtgtgaaggg cacaggtatt 10141 atatgcattg tggaacctat gtctagaaag gcagtctggg gccatattac aggttgaaca 10201 tccctaattt gaaaattgga aatccaaaat ctgaaacttt ttgaatgcca acatgatgcc 10261 caaaggaaat actcattgga tcattttaga tttctgattt tcagattaga gatgctcaac 10321 ttgtcagtat aatgcaaata ttccaaaatc caaaatctga aacacttctg gtctcacacc 10381 tttcagataa gggatactca acctatattt tcttcctggc tgtgatggct aatgttaggt 10441 gtcaacttga ttgggccatg gggtgcccag gcatttggtc aaacattccg ggtgtgtctg 10501 tgagggtgtt tctgaatgag gataacattt gaatcagtag actgagtaaa gccgactgtc 10561 ttccctaaca tgggtggcct cattcaacca ggtgaaggcc tgtacagaaa aaaagggcta 10621 aaaagggctg accctcttcc aaataaaagg gaagtcctcc tgcctgactg ccttgaggtg 10681 ggacattggt cttttcctgc tttcggactt aaactgaaac atccactctc cttaggtctc 10741 aaggctacaa gcctccggtg tgcaacaata ccatctgctt tgctggttct ccagcttgcc 10801 aactacagat cttgggactt gtcagcctcc ataatcacat gagccaattc cttataagaa 10861 atctctttgt gtgtattggt tctgtttctc tggagaatca taatacacag gcctatcata 10921 tcagatatgg cagatcccaa atcctgatgg tctcttcagt ttcttaaaat cttagcaaag 10981 ttcttagggc ttgcctggtt tactccacaa gagagaagac acaggcagct taggacctgg 11041 caggctattt attactggag cagttaagag atggtttgca cagtcagttc cccagccagg 11101 aactctctag gacagaatat tcccacgctg tcatgaagaa gggggcacta agagcagcct 11161 catctctacc cccacgagaa tgccacacaa attttacctc acatatgtca ctatgcagag 11221 actgggaagc cttcttccta atagttctca cctgtgtctc ctaactcaag tactaagctg 11281 acttgtaaat gccggggtca cttggaacag gaatctagcc atgtcacaac tcagtctgaa 11341 tgagttggtc ccccagcaaa ggccaggaaa ggggcagagg ctggaggttg agaaataagc 11401 gtattgttca caagagggat tagcaggacc ccgttcctag catatatgct agtaaaggat 11461 tatcaatacc tatattgatg gcaaacgaga ttcagacagt ggtctcccat ctgctccact 11521 tctgacttaa aattttccaa gtgtgctttg atctgatgac tggaggactg aaggacagaa 11581 gacgccagat gcggaggtat ggactatgct gttatgttac attcctgcct ctgcttattt 11641 tcatgggtca agggttgaaa aatgttcacc cgatgcaggg acagctcatg cctgtatttc 11701 caacactttg ggaggccaag gcgggaggat catttgagct caggagttag agactaggct 11761 gggcaatgta gagaaactct gtctctacag aatatacaga aaaattagcc aggcatggtg 11821 gcacacacct gtggtcccag ctactcggga ggctgaggtg ggaggatcac ttgagcccag 11881 gaagtcaagg ctacagtgag tgatgatcat gccattgccc tctagcctgg gtgacagtga 11941 aaccttgtct ccaaaaaata aagccaacct tggcagcact acagttgtat gaagcacatt 12001 catgtgagac aaagacacag gtaacacttt cttttctttt tttttttaga aataaatagg 12061 tttattctaa aattaatcca aataattgaa attataggga aggatattaa gaagagaaaa 12121 gattacaaga cagaggaaag cataaaagaa aataaggtaa ttaaaacttc aaatcaggaa 12181 aaaggttctc aaaaatctga ctttgcctac ccaggtacag ttggtggcta tcaaattagg 12241 actcagtaat tacttgtgtc actgagtgaa agctgctcat tgctgccaag aacctacaga 12301 gaaggccaag gggctgacgt acagctgtat ggagcagcaa taagcacccc agacacgact 12361 tcctagataa gccaggcctc ctcatccaag tacatcctgg atcc // LOCUS HUMPCI 15571 bp DNA PRI 21-SEP-1993 DEFINITION Human protein C inhibitor gene, complete cds. ACCESSION M68516 M64880 M64881 M64882 M64883 M64884 NID g189677 KEYWORDS plasminogen activator inhibitor 3; protein C inhibitor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Meijers,J.C. and Chung,D.W. TITLE Evidence for a glycine residue at position 316 in human protein C inhibitor JOURNAL Thromb. Res. 59, 389-393 (1990) MEDLINE 91048502 REFERENCE 2 (bases 1 to 15571) AUTHORS Meijers,J.C. and Chung,D.W. TITLE Nucleotide sequence of the gene coding for human protein C inhibitor (plasminogen activator inhibitor-3) JOURNAL Unpublished (1991) REFERENCE 3 (sites) AUTHORS Meijers,J.C. and Chung,D.W. TITLE Organization of the gene coding for human protein C inhibitor (Plasmingogen activator inhibitor-3): Assignment of the gene to chromosome 14 JOURNAL J. Biol. Chem. 266, 15028-15034 (1991) MEDLINE 91332018 FEATURES Location/Qualifiers source 1..15571 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(2200..2297,7932..8567,10627..10897,11335..11482, 12608..13669) /gene="PCI" /citation=[2] exon 2200..2297 /gene="PCI" /citation=[3] /number=1 /evidence=experimental gene 2200..13669 /gene="PCI" conflict replace(2281,"") /gene="PCI" /citation=[3] exon 7932..8567 /gene="PCI" /citation=[3] /number=2 /evidence=experimental conflict replace(7943,"t") /gene="PCI" /citation=[3] CDS join(7949..8567,10627..10897,11335..11482,12608..12790) /gene="PCI" /citation=[2] /codon_start=1 /product="plasminogen activator inhibitor 3" /db_xref="PID:g189678" /translation="MQLFLLLCLVLLSPQGASLHRHHPREMKKRVEDLHVGATVAPSS RRDFTFDLYRALASAAPSQNIFFSPVSISMSLAMLSLGAGSSTKMQILEGLGLNLQKS SEKELHRGFQQLLQELNQPRDGFQLSLGNALFTDLVVDLQDTFVSAMKTLYLADTFPT NFRDSAGAMKQINDYVAKQTKGKIVDLLKNLDSNAVVIMVNYIFFKAKWETSFNHKGT QEQDFYVTSETVVRVPMMSREDQYHYLLDRNLSCRVVGVPYQGNATALFILPSEGKMQ QVENGLSEKTLRKWLKMFKKRQLELYLPKFSIEGSYQLEKVLPSLGISNVFTSHADLS GISNHSNIQVSEMVHKAVVEVDESGTRAAAATGTIFTFRSARLNSQRLVFNRPFLMFI VDNNILFLGKVNRP" sig_peptide 7949..8005 /gene="PCI" mat_peptide join(8006..8567,10627..10897,11335..11482,12608..12787) /gene="PCI" /citation=[2] /product="plasminogen activator inhibitor 3" exon 10627..10897 /gene="PCI" /citation=[3] /number=3 /evidence=experimental exon 11335..11482 /gene="PCI" /citation=[3] /number=4 /evidence=experimental conflict replace(11447,"c") /gene="PCI" /citation=[1] /citation=[3] exon 12608..13669 /gene="PCI" /citation=[3] /number=5 /evidence=experimental conflict replace(12884,"ga") /gene="PCI" /citation=[3] conflict replace(12901,"ga") /gene="PCI" /citation=[3] conflict replace(12930,"gg") /gene="PCI" /citation=[3] conflict replace(12999,"") /gene="PCI" /citation=[3] conflict replace(13053,"ga") /gene="PCI" /citation=[3] conflict replace(13117,"g") /gene="PCI" /citation=[3] conflict replace(13124,"") /gene="PCI" /citation=[3] conflict replace(13170,"gc") /gene="PCI" /citation=[3] polyA_site 13669 /gene="PCI" BASE COUNT 3933 a 3941 c 3888 g 3809 t ORIGIN 1 aagcttcctc actccttggc acctggctcc gacatcacat tgacttttcc cttcctgctt 61 ctaccatcac atcaccttct tctgactcca atctcctgcc tctttcttgc aaggatcctt 121 gtgattgtaa ttaggaccca gctggataat ccatgacaat ctcttcaaga tccttaactt 181 aatcacatct gcaaagtccc ttttgtcata gaacgataac attcacaggt tctgggtatt 241 aggacacgga taacttcggg gttccattac tcacccataa ctggtatgca gtgctgattt 301 ccatcctgta ggtacggttt agggatctct aggtcaatga gataatggac tcttgctcat 361 gttacatggc ataatgggaa gaaagccaaa cctagaaaaa gagggactca ggttcccttg 421 ttagagcctc ttcttactaa ctgtgggatt aggggctgat tccctgacct gctgtgttct 481 gtcttctctc caattcaatg ggaatgaact gtgagggcac tgagcaaaaa ctaaggtctc 541 aatacctagt agtagtggga cttgcctctg gatacccagt agtgatcctg ccgcctgttt 601 ctggatatcc tacagtagca atcccacctg tttctggata tcctacagta gtgatcctgc 661 ctgtttctgg atatcctaca gtagcaatcc cgcctgtttc tggatatcct acagtagtga 721 tcctgcctgc ttctggatac ccagaagtga tcatgcctgg ttctagatac ccagtagtga 781 ttgtgtttcc tctagatacc cagtaggtat tgtgcttgct tctagagatg tagtagtagc 841 tggacttagc tctagatacc cagtgctcat ctctagactg ggctgagatc agtgtctccc 901 ttgaagggtt attgtaagga tgaaaaaaga taatgcgttt aaagcacttg gtgtagtagg 961 tggtcttttt aaaagtgtga ataaatacta gttcttatta tttctgtgga tatccaacag 1021 ccacataatt gggccccaaa gccatgaaga aggaagagga aatgtcttaa aggttgtcga 1081 tggacagtgt ttgctgaaca tcaaaatcac tttccaggta ttacctctga tttgctctac 1141 caactccaca ccccacctgc agccacataa ccttccatga tcacggccat gcacaacaca 1201 ccatgtcccc caggcaaggg gaccttagaa acataaccag gcttgagaca gcactctgca 1261 ccggtgtctt ggaaatgctc ttaagagtgt atggctgagt tagggaacca ggatttcaaa 1321 gtagaaaggg agaatctacc caagcccata gaaatcctga atccactcct ttctcagcaa 1381 caagcactgg cctgggagtc agccacttat gcaccaaccc cactctgccc ctaattaaat 1441 gcatgacttt gaaaattccc ctcattcttc tgagccccaa ttcagtgatt ggtgcaatca 1501 caggcttggc tacagtgacc cattcattgc aggcatggtg agactctcaa tccctctcat 1561 ttccactaga atctaactgt tgggatctat gacccagtca gcatagcagg cctgtgggga 1621 gctctcaggt tcaagcatat gcccccccta atctacaaga aattagctgc agaaaaccaa 1681 ggaatagaac ctggaaaaag agagggtttg ctagagctgt ccctttccct gtctctggaa 1741 tgccaacaat agggaggctc tttggtcttg tctctcagga gtgcccatgc cattccagga 1801 aaatgatggc ccagctggtg gtgtaaggct tggggggcag cgagtgggca tcgtggtgaa 1861 agcctcggga tcagggagct gcgtctgcag gcaggcctgc tggccggaaa cctgccagga 1921 aaggaagggg ctgtctcggg gcggggccag ggaggggtgg agacagggcc ggctgtggtc 1981 agtgacaaat gctggctgca atccagccag ccctctgccc tttctgagcc cgagggactg 2041 ccacctccac tgtgtgcaca ctcagctacg ggacacagta agtaccgatg ccgcaaaggg 2101 aggtccccag ggcttgaggg catgtgaggc gaggagagga tggactctag agttttgggg 2161 tttggggtct gcaaagctct gaaggagtct catctctgca gtttcaggta tccaaggcag 2221 cagaggtgag tgggtccccc gagctctgtg accttatgct ccacactaac tctggcagag 2281 cctccgtttc ctcataggta agatggaaat aattacaccc tctggatggt gtgactgaag 2341 attaaataca gcgggtgctc tcactcagca catctggcca tgtctgcaga cacatttggt 2401 tgccacaact ggaagggggg tgggggttag tgacatctag aggccagcga tgctgctgat 2461 gatcccacaa tgcccaggac aagatcacaa agcatcatcc tgttcaaaag gtcaacagga 2521 tcaaggttga gagaccctga aataaggcca tggggacaaa atgtcggctg gataggaggt 2581 gctcagtaag tggcagcttc tgttgttttc tgtgcctgga gtcttggggc tttagaaatc 2641 aggaacaatg atccaatatt atcggcttcc gtgagataag ggcatcttgc ctggaggctg 2701 ccacccaggc cggtcatggc agctgctcat gaaggacagt aacaatttgg cagtttgtta 2761 aatgaacaaa atgtagaaat aaagtaagca gaatttttag tttttctgaa ggtagggctt 2821 ttggccagat atgcagcaat aaaagagcaa actgcttcct tgggccagtg tccttgctca 2881 tagatcagga aaccgaagca tgaagaatac aggcggcaga tgcctgaagg taacggacgt 2941 gttcatggtg ctgacggtga tgataagtga cagatgtaga ctcatctcca aacttgtcag 3001 gttatagaca ttaaatatgt gcaactttat gaatagcagt catgtctcaa tcaagtggtt 3061 ttaataaaga aataatagga agccagagct gagagacagg gagggagttg ttcaaggtca 3121 cctggcaagt gagctccggg gcggggagag ctcagctctg ggtggccagc ctggcttttt 3181 ccactgctca gtgtccagct tgcagtctaa tgtctcgaat tacagagaag gagactggtc 3241 agttcattca ttcattcatt ctacaaaggt ttatggagca tctctcctga ctgcaagctc 3301 ttgaaggtga gagcagcaca aatgagggtc ccatggagag agaggccgga atgaaaaatg 3361 tcaatgacaa atgcatatat aaaggcacat gtgtaattga aagagctttg agagaaagag 3421 tcaagggact gttccagaga atagccatgg aaggggaaaa ggtccagtgt gataaggtat 3481 tgcaaagaag tgacatttaa gcaaaagcct gcagcctatg cagaagttgg cctcagtgag 3541 aaaggttggg ggagggttcc agtagagagg gaaggtatgc aaaggcccag agttaggaca 3601 gaacttgctg tgtttgagaa actgggaaaa gaagagtgag cctgggggta tcacgtgatc 3661 cagggcagag caggtccagg ccaggtgcag ccaggtcaca gcagccctag tgggttagag 3721 cacaaatcaa agtttagcat ttatctgaaa cacaggagtt ggccatgagt ttcttaggcg 3781 aggaagcgct gtgaccatat ttatgattga aggagattct tttatatgct gtatatagaa 3841 agcctttcag ggcaaagaaa ggaagctact ggggtagccc tgggggagat gaagggagct 3901 tccactgggg gcagtaagaa agccagggaa aggcggcagc tttaagacct gttttggaga 3961 tagaacggac aagctttgct gatgggctgg agtggaacag gaagtcaaga ttacttcttc 4021 tgggaagttc tgttcctggg tctttaggat ctagaggaag ctgtgacttt gtctctcatc 4081 tctgcctggg ctccaagcct cacatccctt tttgtaatta gaagatattg gacagaccgt 4141 cctcactaac acaattccca cagctgagtc cagggtagaa ctgggcagga cttcactgcc 4201 caacacggga aatatcagtc agcagatttg ggtttcgggg atggtggtgg gccagcggga 4261 agactgacca gggcctaccc atcacatccc caccacctcc cacctcaatt caccttggcc 4321 tgagatgaca ggtgaacatg actgatcctc tctcttccct ctgcagaaac actaaagcca 4381 gggaccagga gaggggcagc ccaaccaagc tttcaaagca ctcagtagag gctggtctgg 4441 gggatgggag gctcccaggg cttcacctgt ctctgtcaaa gccatgtatt tccaccagag 4501 gcccaagagt gcgatggcaa accctggatt tgaaactaag aaacgtaaaa caagcactga 4561 ggactccact gcctcttgag tgacctctct gaccctctgt ttcttctgca ctgttaggat 4621 aatgatacta actccatgtt gttgtagaga agtataaatg agctaataca ggtgaaccgc 4681 ctggggatac caggaggtga ggtcgaggag gaacgaggta tcactcctca gagccactca 4741 gagagaggct gtgcacgagt cagaggaacc tggattttaa ttccggttcc atcactcagt 4801 agctgaaaca agctattcca cttcacttag cctcagtcta ttcaatctgt aaaatagagt 4861 gagtttactt ttggaaaact ctgtaaaata gagagcttac ttttggtgaa ggttaaacat 4921 agtaatattt atggagtgtc tagtatgtct ttaataatta gtggttttac tgaaaagtag 4981 agagagttgg cccagaggga gcaagatttc tgggtctcaa acatgtagcc caggagagcc 5041 taagtgaacc tggggccctc tccaaacaga tcctggggga gactcagtgc acacccggag 5101 aagcagctcc tccccatcgg atctctagtg cttggcaggg ggcggggtct tgagggggtg 5161 tccacaacac atggcagact gcagatgaag aaactgaggc ccagaggggg tgaggcttgc 5221 ccagggtgac ctagtagctg aatagatggg agaatggagc cagggcctca ctgagactct 5281 ctggtcagct gcccctgggc tgtatccaat aaggaaactc ccctgcttct gaagctgttc 5341 tcgaaattat cagctcagtg tgaccctgtg gggggttgag ccacattgtt tctttagaag 5401 catctccata catggctggt tccaaccctt ggcaggaggg accatattgt gctgtaaaat 5461 agactcattt agagaagccg gagattaaag cacccaccta tgtccttcaa agctctccag 5521 gcaagtgcca tggtgggaac aggtagggag tgtcagtggg gggaagccca gactctgctc 5581 actcattatc tgcagattag ggctattgtt ggtggctact aagtcaggga tttcaaaatc 5641 aggaagatgc agccaggaaa agaggaggca ggactctgca gaggaggcag gactctgcag 5701 agtcagagtg ataaccgagt ctgagtccaa gctttgccag tgttagcaag cgactccatc 5761 tctctgaacc tcggattacc catctgtaaa atagagctag cagcaagatg tacctttttg 5821 ggtggtgcag ggctgaagga gttggcacag tgcctgaaag agggtgcggg caatgcgccc 5881 aactgctgtg gctgctgggt ttggtgccag gttcgattct gcaggcagaa acttctacat 5941 gaggctcctt ctcggaagga gctcaggaca caatttggag gctgggctgg caagggtgac 6001 ctgctggagc tattcaactt cacttaaaga caggcctgca gtccaagcct gcccaattcc 6061 tgagaccatt ctctctccac tgctgagccc cacggccact ctgcaaggga tttcccaccc 6121 acctgtttgg ggccctttgg agtttggttt taattgggtc acgggatgct gtgacaggct 6181 gcccctgcct ggtggggatc tggggtcact gatgacattg tgcccatgga gagagcccag 6241 cagaaaggga ttccctccaa ggcgacacac agggcaaagc tcacatcaga agccaggcag 6301 gccctctgca cctggtaatt agccggcccg ggtgctgtca ggctcacacg tgtgtgtgtg 6361 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg taaagcatgt accctatggt acagttgaga 6421 atatggaggc ctcagatggg gcttttgcag aaactgccat gcctactgct cacacttcca 6481 tagcacgtgc ccccaagcac cccatggtgt aggtgctgtt attatcacta tcttacagtt 6541 atggagcagt ggctcaaggt gtaactgatt tgcccaaaat cacactacaa ggacacagca 6601 gggctgagat ttgaacccag gcagtggctt cagagcctga gctgtttcct actgcagagg 6661 gaggaggcaa gacttctacc cgtagccaga tggggaggca tgggcacagg aacggctctt 6721 gggtgaagtg gagggaggaa gaggaggact gaaggcgaag gccacgtcag gagtgatggg 6781 ataccccaca aaggcctccc tgagaagcgc tagagacaaa gatgagtgcc tcctcatctg 6841 gaagatgaaa agatgtcttt gcctgcatgg gctgccgtca caaagtccca ggggctaggg 6901 ggcttcaaca acagaaattt ctttctttac aactctggaa gctggaagtc tgagattaag 6961 gcaccagcag gatttgttcc ttccaaggcc cctctccttg gctcacaggt ggctgccttc 7021 tccctgtctt cacctggtct tccctctgtg catgtctcta tcctgatctc ctctttttaa 7081 tttttgtgta aggacgtagt catattgggt tggggcccac tctagtgacc tcattctaac 7141 tcagtcccct ctttaaaagc cctatctcca gatatagtca cattctgggg tattgaaggt 7201 aaggacttca gtatatgcat tttgggggca caattcagcc agaacaggag gacggtgggg 7261 atgtccacat gaagaggttc aggcagaatt cctttaggag gggaagatgt ctctctgtgg 7321 gacaagggtg gcatggagca gcccctgggg gaaggagaag gggacagttt gcatactggt 7381 attctgccta ccccagggtg gacactcact cagcgtttgc tgaatgaaca gggcaaggcc 7441 agcagtgctg atggtcccag gcatgtagct ggtctgagtt catagaagga ccacagcgcc 7501 ctgccatgtg ccaaaccagg acaccagagt gaaggccaga agctcacatg gaagcagctt 7561 agttccctgg taacctcgag atgctgatga gacagagcag agcagaggga accctctccc 7621 tccatatccc atcctccaaa atgtgtccct tgatgtggat gggtagacag gattcctgcc 7681 ctggcagcca gacccctgcc ttgggtctgc acctcctctc cctccttcct ctccccgtca 7741 tccctaaatc ttgtcctcga gccactgcca ccctgtgtaa accctcatgt ccagtcttgg 7801 gggtgccatc ccttctcttt aaagctgaat ggaccaaaca tacccattga gtgttgggtg 7861 gggacatctc tggaaagtca gcacctggac cagctccacc cctctctgag gacaccttct 7921 ttccctttca gaacaaagaa cagccaccat gcagctcttc ctcctcttgt gcctggtgct 7981 tctcagccct cagggggcct cccttcaccg ccaccacccc cgggagatga agaagagagt 8041 cgaggacctc catgtaggtg ccacggtggc ccccagcagc agaagggact ttacctttga 8101 cctctacagg gccttggctt ccgctgcccc cagccagaac atcttcttct cccctgtgag 8161 catctccatg agcctggcca tgctctccct gggggctggg tccagcacaa agatgcagat 8221 cctggagggc ctgggcctca acctccagaa aagctcagag aaggagctgc acagaggctt 8281 tcagcagctc cttcaggaac tcaaccagcc cagagatggc ttccagctga gcctcggcaa 8341 tgcccttttc accgacctgg tggtagacct gcaggacacc ttcgtaagtg ccatgaagac 8401 gctgtacctg gcagacactt tccccaccaa ctttagggac tctgcagggg ccatgaagca 8461 gatcaatgat tatgtggcaa agcaaacgaa gggcaagatt gtggacttgc ttaagaacct 8521 cgatagcaat gcggtcgtga tcatggtgaa ttacatcttc tttaaaggta aggcccttgg 8581 gcccaaacct gcactttctt tggcttttct gctgctttta tctaaagaat acccaattcc 8641 ctcacataca taaaagacgg ggagtacgtt aagttctttt gggtgcctgt tgagaaaaat 8701 taagtaaaca agcagccaga gaaggtaaga tgaatgcctt cttgctgtgg atgggattag 8761 tgaggctgag atgctgtttc ctccacggag gaagagctgg ttgctgtctt cgggcccctg 8821 gggacatctg aagccccagc tttctacagg ctctgaagta tgaacccatt gtggccacca 8881 tggcaaagac accaacacct tagccactca gggcaggaca cagaccccag aagggcttaa 8941 agggcatttc ccagtccccc gtatccctca gatcttggcc cctctgccct catagaggcc 9001 aagactccct cagacaaatg cttgttcctc tgaaatgcct cctcctgact cctcagcaag 9061 agctgacctc tgcttatctc cccgacactc cttgtaagca ttcctgctcg cctctgcagc 9121 tcctgccagt tgctgaccct ggggaaagca agagtggata gagaggagaa gagaggagag 9181 gagagggtgg gaagggttgc gaaggaaggt aaattgttaa cacctcccct tcctatggtc 9241 acagatcatg agtatctttg gccatttggg tggctataac aaaataccat aaactgggtg 9301 gcttagcaac aacaaacata tatttctcat agttctggag gctgagaagt ccaggatcaa 9361 ggcactggca gatgcagtgc ccattccttg gttcatagag agtgccttct tagtatatcc 9421 ttgctggaag gaggaaggca gctctctgtg gtctcttttg taaggacacc gatcctgttc 9481 atgacagctc cacccccatg acctaatcaa ctcccaaagg ccccctgtcc taataccacc 9541 accttggggg ttaggtttca acatatgaac aatgtgggga cacaaacatt gagaccacag 9601 cagtgagtgt cgaacttgga ctctgagatt tcctatcccc tggtgcaggg cagtccccat 9661 tacaccagat tgctgagggc agctgggaaa taagctaagg acggtattga ctggggtctt 9721 ccttcgataa cgattaagaa gttggaaaca ggccaggcat ggtggctcac gcctataatc 9781 ccaacatttt aggaggccga gatgggcaga tcacctgagg tcaggagttc gagatcagcc 9841 tggccaacat agtgaaaccc cgtctctact aaaaaataca gaaattagcc aggcatggtg 9901 gtgggcgcct gtaattccag ctacttggga ggctgaggca ggagaatcac ttgaacctgg 9961 gaggtggggg ctgtagtgag ccaaaattgc gccactgcac tccagcatgg gtcacagagc 10021 gagactccat ctcaaaaaga agaaaaaaag aaaaaaaaga aaaaaagaaa taaaataaaa 10081 taaaaagaag ttggaaacaa tcacttgtag cgttttgttc agaagttccc ataggaaggt 10141 cagagaaggg tcattgaaga cttcccaatg ggaaaaacca ttcatttcca ggatccatac 10201 taacttcttt ctaaaattta aatcaaaata ttggaatgaa agtgcaaaca gagaagttca 10261 cccagatatc aggtagcatt cacagccagc cacatttttc accctcttca cttggagatt 10321 tggtcttgag taaaacgtta gagaatcaga gaacatcagg gatccagggc ctctgaagat 10381 gtgaaaacca acctccttgt tttgcaaatg tggaaggaaa agtcccacga aaagtccaag 10441 aatgtgccca atgttataaa gagacttgcc ttcatattca agaggttcaa cagtcactgc 10501 tctggggctg ccataaagat ggtctccgct ggctatcttt actgtcttca ctccttttat 10561 ttgcagctga gaatttctaa ttctgacaca aaattctttt tcatttttcc cttttttcat 10621 ctttagctaa gtgggagaca agcttcaacc acaaaggcac ccaagagcaa gacttctacg 10681 tgacctcgga gactgtggtg cgggtaccca tgatgagccg cgaggatcag tatcactacc 10741 tcctggaccg gaacctctcc tgcagggtgg tgggggtccc ctaccaaggc aatgccacgg 10801 ctttgttcat tctccccagt gagggaaaga tgcagcaggt ggagaatgga ctgagtgaga 10861 aaacgctgag gaagtggctt aagatgttca aaaagaggta ctttcagact accccagggc 10921 cagcctaaac ccacacagcc ccagggagac acacacgccc taccagggcc acacagcact 10981 ggtgggaagg actcacccag ccaaggagct gcctccaggc ccagaggcat cctgtgacat 11041 ccaagtcctg ggggcctagc ccagttggag ggacaagagc tggaaactgg gttccttagg 11101 gtggtgccag agtgggcaga gacctctggg cagcccacgt ccaagtccag agcaagggga 11161 ggctcatcct agaaaagagg ccagaggagc cataaccacc attgttcctt gggttaagga 11221 gtcctttttt aaaaccatca aaactaagaa tccagtgcat tatgaatcca aggggtgagg 11281 ctcagtgtgc caatgcccca gaacagtcta agaaagctcc ttttcccttt ccaggcagct 11341 cgagctttac cttcccaaat tctccattga gggctcctat cagctggaga aagtcctccc 11401 cagtctgggg atcagtaacg tcttcacctc ccatgctgat ctgtccggca tcagcaacca 11461 ctcaaatatc caggtgtctg aggtgggttc agaagctcct atgcatctgc ttcccaagat 11521 ctattctgtt ctattctttc tattctactc taccccattt cattccattc cattccactc 11581 aactccactc cactccactc cactccagtt cactctattc aattccactc cactccagtt 11641 cactctattc aattccactc cactccactc cagttcactc tattcagttc cactccactc 11701 cactccactc cactccagtt cactctattc cattccactc cattccactc ctccactcct 11761 ctcatccact ccactctact cctccactcc acatctccac tccactcctc cactccactc 11821 ctccactcca ctcatccact ccactcctcc actccactcc tccactccac tcctccactc 11881 cactccactc atccactcca ctcttccatt ccactccatt ccactcctcc actccactct 11941 tccactccac tccattccac tcctccactc cactccactc tattctattc tattccattc 12001 cattctactc tattctattc cattccattg cagtcaactc cactccactc tctactattc 12061 tattccactc ctctcccctc cactccattc cattgcagtc cactccactg cactccactc 12121 ctttattctg ttctgttcta ttctattcta ttctattcta ttctctccct ctccctctct 12181 tttcccacaa gtagtgaaag tttcactttg tgtcttatcc ttcatgtaat gggaagccat 12241 atccaccact gttccttgag ttaaggagtc ctgttttaaa caatcaaaac taagaaggca 12301 cttcctagct atgtgatctc caaaaaatac ttgactctct gagcttcctt tctctcttct 12361 ataaaattga agaattacac cttgctcaaa gatgccatga gaattcaatg acagacacat 12421 gcgaagtcac cccccagcac agtgcctggg gcagagtagc tgctccattg ttccatttcc 12481 tacttgctcc atggctcagt tgaacagata cttagaggtt gatgcccata ggcagaagct 12541 ttgccatttg ctatgatgac ttcacctgcc cctggtggcc tggtgatgcc tggtgtctcc 12601 cctgcagatg gtgcacaaag ctgtggtgga ggtggacgag tcgggaacca gagcagcggc 12661 agccacgggg acaatcttca ctttcaggtc ggcccgcctg aactctcaga ggctagtgtt 12721 caacaggccc tttctgatgt tcattgtgga taacaacatc ctcttccttg gcaaagtgaa 12781 ccgcccctga ggtggggctt ctcctgaaat ctacaggcct cagggtggga gatgaagggg 12841 gctatgctat ggcccatctg tatgctggta gctagtgatt tacacaggtt tagttgacta 12901 atgaggcatt acaaataata ttactctatg atgattgctt ccacccacac gactgcaaca 12961 tacaggtgcc ttggggaaat gtggagaaca ttcaatcttg ccgtcactat tcatcaatga 13021 agattagcac tgagatccag agaggctgga tgacttgctc aagttcacca gcatggtagt 13081 ggcaaagaga ggtccagagt cctggccctt gatgcccagc tcagtgccac aaagctcagt 13141 aggagggatg ttccagtgga tgagggccac caggaagcac aggtccaagg ctggtcccac 13201 acttatcagc agcaacaact gtcagttcat cctgcatggg aaaaatgttg gaatgggagt 13261 ctgaaatggg gctactgttt cagtcctaac gtgctgtgtg acattgggac aacactttcc 13321 ctctctggac ctcagtttcc ctctgtatac aaggatcaga ttcttgctgt gacccaagaa 13381 ctcctgaaat catatagaaa ggctggggtg ggccctgtca ttcgtggttg atttcaatac 13441 actcaagtgc cattcatcct ttaagaaaaa catctggata tcaaggtgga aatggcccat 13501 ttaatgattg attatatcat tttgtggata tagttataat ctgatgggcc tggctgggag 13561 tggaagaagg gaagcctttt gcaaatagta gagtgtcagt tgcaggtgcc aatgactaac 13621 tttttgaatt ctatgttggc attaacaata aagcattttg caaacactgg ttataactgt 13681 ctttatggag gcagctctgg gaatggtgac attgatagct taccatgctc caggccgggt 13741 gcctggccct tcacctggat ggtcgcattt gcccctcata agactcccat gaagaaaggc 13801 accactatta tcccatctgt tattcacaga tgggaaaggc aaggcttgaa gtggttaggt 13861 ggcttaccca gtcacatatc ttctaagtgg tgcagccaga atttggcggg gggagtgcga 13921 ccaagaaccc tacactcagt cctgtgctct gtgctgtgga ggagagatga ccaggagcag 13981 aaacttcatt caggggcatc tcaggcacca gctcccccat gagccagcta agttccctcc 14041 ctcccttcac caagcaccat gtgtttcctc atgtgccaaa tgaagaggat tagatactca 14101 agaatggaat gagtgggtga gtgagtcctt cgctgcaccc aagtctgatt ttctgtgcgc 14161 ctgctcaccc caccctgcat gttctaagca tgcttccata aggctgtgcc ccaccctctg 14221 attctagagt ctggactgta tcagaggtga gtgcctacta gaggtaacaa ggtcaggacc 14281 ccaaaccttg tccatccccc aaagtactga gcccccacca tgcaccagcc catgccagat 14341 gctttgcact tgtgatatca cccatccctt gacaacccag caagttctat tattgttccc 14401 attttacagg caataacata agtgctttcc cagggtccca cgctggtgac agtgagggcc 14461 cagggtctga gagcccagat cgcacatgtg cgggctggtg gcaggggaga tggcagcaac 14521 cagactcaga catttctctg cagttgtgct gtgggctcag ggtggctctt tacgaagggg 14581 ccccttcgtg gggtcatgca ctcctgtgtg ctttcccttg catcatgcct tgcctgtctt 14641 ggcaaatatt tctctggagt ttacccagcc agtccaaggt cacagggaag ccctgtctgt 14701 gtctcacaca gaaggtcaac gtccagcact gtccaaactt tactcagcaa acagtcacaa 14761 agcagctcct gtgtgggggt cggggtggct cactgtggtc tctgctgcat gtcacacatt 14821 gaagcactgt gctggggtca tcgcaggctg tttaactcaa ttgtcacatg agcctgggtg 14881 cacaaaatgg tagagcagct cagagagaga tggacagaca gcatgaacct ctgaggagtc 14941 aggttttctt ggatgaaggg acactaagat ggctttggag cgtgagaagg acctcaccta 15001 gcaaatgtgg gaaaggagtg agacctccag gcagagggac tggctggaga cgagcgtgat 15061 gtggtgagcc atggagtgta tgggtcccca cagaacttca gtctgggcct gcacagggca 15121 tgtggaggag acaaggagga gggaggtcgg tgccggcggt tcagtgacag agatcctaaa 15181 tgggaggcca gtgttttgtc tgatctcttt catcccaatt tcagggtagt ttggtcatcc 15241 acgccacatt ccaagtgtcc cctgggccct ttctctccct cacccccctg tctgcacatg 15301 agtagatgcc tccacgcagc cctcccagga cgctcacctc tatccacaga tgcttctcca 15361 aaacccacca ggccctccca tggaacgagc tcacctacag ggtaaaatca ggtcacggtc 15421 acatataggc ctgactactc ccctcaggac cctcattcac agccactgta ttaatttgct 15481 ggggctgcca aaacaaagtg tcctcatctg ggaggctgca gtagatttgc tgaaattgat 15541 ttgctagcgt tgctgaaatt gattcaagct t // LOCUS HUMPCNA 6340 bp DNA PRI 07-JAN-1995 DEFINITION Human proliferating cell nuclear antigen (PCNA) gene, complete cds. ACCESSION J04718 NID g189681 KEYWORDS proliferating cell nuclear antigen. SOURCE Human leukocyte DNA, (library of C.Croce), and cDNA to mRNA, clone EMBL3-S2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6340) AUTHORS Travali,S., Ku,D.H., Rizzo,M.G., Ottavio,L., Baserga,R. and Calabretta,B. TITLE Structure of the human gene for the proliferating cell nuclear antigen JOURNAL J. Biol. Chem. 264 (13), 7466-7472 (1989) MEDLINE 89214190 COMMENT Draft entry and printed copy of sequence [1] kindly provided by D.-H.Ku, 15-MAR-1989. FEATURES Location/Qualifiers source 1..6340 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20pter-p12" misc_signal 748..753 /note="GC box" misc_signal 1090..1095 /note="GC box" misc_signal 1110..1115 /note="GC box" CAAT_signal 1124..1129 misc_signal 1145..1150 /note="GC box" prim_transcript 1268..6191 /note="PCNA mRNA and introns" exon <1427..1647 /gene="PCNA" /note="proliferating cell nuclear antigen (PCNA)" /number=1 gene join(1427..1647,2356..2453,2550..2617,3556..3750, 5636..5759,5846..5924) /gene="PCNA" CDS join(1427..1647,2356..2453,2550..2617,3556..3750, 5636..5759,5846..5925) /partial /gene="PCNA" /note="proliferating cell nuclear antigen (PCNA)" /codon_start=1 /db_xref="GDB:G00-120-261" /db_xref="PID:g387005" /translation="MFEARLVQGSILKKVLEALKDLINEACWDISSSGVNLQSMDSSH VSLVQLTLRSEGFDTYRCDRNLAMGVNLTSMSKILKCAGNEDIITLRAEDNADTLALV FEAPNQEKVSDYEMKLMDLDVEQLGIPEQEYSCVVKMPSGEFARICRDLSHIGDAVVI SCAKDGVKFSASGELGNGNIKLSQTSNVDKEEEAVTIEMNEPVQLTFALRYLNFFTKA TPLSSTVTLSMSADVPLVVEYKIADMGHLKYYLAPKIEDEEGS" intron 1648..2355 /note="PCNA intron A" exon 2356..2453 /gene="PCNA" /note="proliferating cell nuclear antigen" intron 2454..2549 /note="PCNA intron B" exon 2550..2617 /gene="PCNA" /note="proliferating cell nuclear antigen" /number=3 intron 2618..3555 /note="PCNA intron C" exon 3556..3750 /gene="PCNA" /note="proliferating cell nuclear antigen" /number=4 intron 3751..5635 /note="PCNA intron D" exon 5636..5759 /gene="PCNA" /note="proliferating cell nuclear antigen" /number=5 intron 5760..5845 /note="PCNA intron E" exon 5846..>5925 /note="proliferating cell nuclear antigen" /number=6 BASE COUNT 1756 a 1280 c 1457 g 1847 t ORIGIN 1 bp upstream of EcoRI site; chromosome 20pter-20q13. 1 gaattctgct gaccaaggta ttaaaagtaa ctaaagagaa gtggtgtgaa gaaagcaaga 61 gagaaacaac aaatcctgtc catcctgtaa caattgaaaa tttctggctg ggcgtggtgg 121 ctcaggcctg taatcccagc actttgagag gccgaggcag gtggatcacc tgaggtcagg 181 tgttcaagac cagcctggcc aacatggtga aaccccgtct ctactaaaaa aaaataataa 241 taataataca aaaattagcc gggtgtggtg gtaggcacct gtaatcccag atactcggga 301 ggctgaggca ggagactcac ttgaacctgg gaggcggagg ttgcaatgag ctgagatcgc 361 gcgactgtac tccagcctgg atgacagagc aggactccat ctcaaaaagg aaggcgggga 421 aaaggggaaa tattaaatgt gtacgctctt tgactcagct gtattacttc aaggagttga 481 tatcaccaaa attgcctaag tgctcaaagg tgtttgtagt taaacaacag gagattgata 541 aattatgtta tatacatgtg atgctatgtt ttaaagaggt actgatatga taaaaagatg 601 tacgtggcat aaaattaaat gtacttatta agtacttttc caagtgttta cggaatgagt 661 gcatttttga aaaaaaaaaa gtgtattcga acttttaaaa aagctttaaa agctttatac 721 aataacgatt gagtgattat aagagctggc gggggaatgt taagaggatg atagggagct 781 aagtttaaca gaacaattca cctctttatc ttgtgacacc tacgagcgca tcaattctgt 841 aattgaaaaa taaagtgcat atttgcagca gctgtactct cttcaggctg caaggaggct 901 tttcctcccg gtaggcttga tttgcatttc actttcactt tcgtggctgg aaactttcta 961 cccacgtagt gaggctagag gagccaccta aagctggggc ttgacgaagc cgggaccggg 1021 acccgatctc cacatatgcc cggacttctt ctgcggccgg gttcaggagt caaagaggcg 1081 gggagacctg cgcgacgctg ccccgccctg cgcccgcttc ctccaatgta tgctctaggg 1141 ggcgggcctc gcggggagca tggacacgat tggccctaaa gtcttccccg caaggccgtg 1201 ggctggacag cgtggtgacg tcgcaacgcg gcgcagggtg agagcgcgcg cttgcggacg 1261 cggcggcatt aaacggttgc aggcgtagag agtggtcgtt gtctttctag gtctcagccg 1321 gtcgtcgcga cgttcgcccg ctcgctctga ggctcctgaa gccgaaacta gctagacttt 1381 cctccttccc gcctgcctgt agcggcgttg ttgccactcc gccaccatgt tcgaggcgcg 1441 cctggtccag ggctccatcc tcaagaaggt gttggaggca ctcaaggacc tcatcaacga 1501 ggcctgctgg gatattagct ccagcggtgt aaacctgcag agcatggact cgtcccacgt 1561 ctctttggtg cagctcaccc tgcggtctga gggcttcgac acctaccgct gcgaccgcaa 1621 cctggccatg ggcgtgaacc tcaccaggtg agcctcgcgc cccgggaagc cgccccggcc 1681 cgcctgcacc tccggctgtg gcgagcgctt cgagcctagc cctcattggc tggcgtgggc 1741 atccagagct tctcattggc ctgcacgcag tggtggggcc caagctgaga tgagcggtta 1801 cggaaaagcc cgcgctggct gctgcgcgaa cctgcttttt cgcgccaaag tcacaaagcg 1861 ggtggtggcg ggaaaatcaa gggtttttcc gcagtgccag gaacactgtt ccagggactc 1921 tttgctcact aaacctgttg gccttgaatg gacgctttag ctgtggcttt cttgtttctg 1981 agacggtctc ggtctcggtg tgttgcccgg gctggtctcc aacttctggg ctcaagcgat 2041 cctcccggct cagtcgcgtc gactttaaat gctttataat gcccttgcga gaaatgtggc 2101 agcctgtcat cctacttagt ggtaggagat tgtttctatc cagaagggac actgctggtg 2161 gtattttagt ataaatactg ccagatgcgt ccaaaacgtc tgcattaata atggcatcct 2221 ccagcagtcc gtttaccctc caccagttct gagacggcct gacgggtgag agtggtaacc 2281 ccttctaacc gcgttcgaaa tacagccctt cagcagacgg cgttgatttt aaagcatgtg 2341 tctcctgtct tctagtatgt ccaaaatact aaaatgcgcc ggcaatgaag atatcattac 2401 actaagggcc gaagataacg cggatacctt ggcgctagta tttgaagcac caagtaagtc 2461 gtaccttttt accgagtcac gaagctacag gaaaatcaaa actctgtgtg agtagaaact 2521 caaaagctat ctgcgtttct tttggtaaga ccaggagaaa gtttcagact atgaaatgaa 2581 gttgatggat ttagatgttg aacaacttgg aattccagtg agtatcagtt tctcattgta 2641 gagagtgctg tacacaggca cgatagttat gtcatagaat gtttgtttat ttttacagac 2701 agggtcttgg ctctgttgcc caggctggag tgcagtagtg ccatatagct ctctctaacc 2761 tgggattcct gggctcaagc agtcctcttg ccttagtctc ctaagtggct aggaaggact 2821 acgggcctgt cccaccacac ctggctaatt tttttcattt ttgtgtgtgg gacgtggggg 2881 cagtctagcc aggctggctg gaactcctgg cctcaagtga tcctcctccg tcaagatatg 2941 ttaatataat ttaaagccta cttcataaca acttttctag aaatatatct actggtgcat 3001 gtttcaaaga gatgatttta gtatttggat agttgttcac cacaagtcta ataatctcca 3061 caggttaaat ttattgttta tgccagttgt ctatttgcat taacttccat gaactcttta 3121 aattgttctc tagaatgctt gctttttatt aatgaggttt taaagctagc ttgagagaaa 3181 tttatccagg ttaggttata aacaccaaag gagagaagaa atgtttgaat gttgaaaatg 3241 cctaatatat tctcttgctt tcttttagaa agtgattagg cctgcttgcg ccatcatgat 3301 ttctgtgcca tactctaatg ttctcttact ttatccctgg aggatgagga ggaggaggct 3361 cttgttccct ggatggtgca tttaatagcc atttattttt ttgagtggag tttgttaaga 3421 aattacgcaa gtcatatttt aaagtaatca gaaaatatga ttctgagttg tttaggtgtt 3481 gccttttaag aaagtgaggg tgccaaatca ttaaatttct aacaattaac ttttggaaaa 3541 ttttgttctt aataggaaca ggagtacagc tgtgtagtaa agatgccttc tggtgaattt 3601 gcacgtatat gccgagatct cagccatatt ggagatgctg ttgtaatttc ctgtgcaaaa 3661 gacggagtga aattttctgc aagtggagaa cttggaaatg gaaacattaa attgtcacag 3721 acaagtaatg tcgataaaga ggaggaagct gtaagtagtt tttaagtaaa aagaaaatag 3781 tttgaagaga attataatac tgcttattag gttaattgct aaaattaaaa gtagacagaa 3841 ttggatccca agtaatttct gaaaattgag atactgttga aatctgtgaa tgatttataa 3901 gtgtcatcca atttagaatt atatttgcaa gaagggaata caaattcagc acgtgtacat 3961 accacagcaa cagtggttta tggatcaagt ccacaccggc tcttaagggt aggattggga 4021 agttaggcgt ataacttagc ttctggagat acttactctc ttaccaaata attgagcata 4081 ggacagcagc tcaatagaag gatatgttag gagtaaaagt ctacctgttt ggagcactta 4141 tgtaatccta atatagctta catgttgtgg gtccaattgg tagcccattt taaaggtgga 4201 gaagcaggct gagcaacctt aagtgacaat ttagccaaag tcacaggctg taggaatcaa 4261 aggttaaaca ggaaggagac tctcactaag gctagaaagc agactccatg caactttgag 4321 agtacctaga gagaccctta tttaaccaaa atagaaagaa catagcaaaa ccccatctct 4381 cataaaaata taaaaattga ccgggtgatg agtggcgaca cacctgtaaa cccgactact 4441 ggaatgcatg agatgggaga atgacttgaa ccgaggaggc ggaggttgca gtgagcccag 4501 atcatgccac tcccctccag cctgggtgac agagcaagat tccatcttaa acaaacaaaa 4561 aaaactcgct aacctgggca taaattaaaa ctttgtaaat caaggacaaa ggtcctaaac 4621 ctcataactt gcattaggat taaatacggt agcattaaag agcttagcat atctgtgtgt 4681 ggcatattat aagcttacaa taaatactat atattgctct cttgtccctt gaatgggtag 4741 tcaacattta gtttaaataa aggtaaaatt cagttgaaag gttttttttt aaattaataa 4801 agtctaggag ctgattcttt atctgtttcc tgaatcacat ttccactcct gccaacctcg 4861 tttttttctt ttgctgtttt tctttgtttt tgagacaggg cttgctctgt gccacccagg 4921 ctggagtgcg gtggtgcagt cggttcacta cagcctcaaa ctccagggct taagtgatcc 4981 tcctgcctca gtttcccaag agccgggaca caggtgtgtg ccaacacact agcctggttt 5041 ccctaatttc attttcccct tgaccattac aactatttgt tgaagaaatt agatcattta 5101 ttagtttcag agtttggatt ttacctgatt gcattcctgt gtatctaata acctctacct 5161 gtgtgtccta cagactggta gctatagcct ggagccttga tatcagggtg ttttgtttcg 5221 ggggtgagag agcaagaata tggtggtggt gtgtgtgcct ctagtaggag gcacagggtg 5281 tctggatgtg tttgcaatgt tagcagctat aagtcattgt ctagatccat taagtcatta 5341 attagagttt gcagagctga aattaatacg ttttatcact tattggctgc ttattagaaa 5401 acttccataa gaaaagcttc ccattatata atttggttat ctaaattata gctataccaa 5461 aagacaaagg ctagataatc gagtcttttt gcatttatgt atcagtcttc aaaattttca 5521 tagcgtccct ccaaagtgac caatacaagt gtttgtgggt ttttataaat atataatgag 5581 ctaatagatt gcaactttct tgatgttttt caatgatgaa tcttttgttt tgtaggttac 5641 catagagatg aatgaaccag ttcaactaac ttttgcactg aggtacctga acttctttac 5701 aaaagccact ccactctctt caacggtgac actcagtatg tctgcagatg taccccttgg 5761 taagataata aatttgaacc ttgttttgat ggtagtcata tgtgatacat actcctcagt 5821 aattaaccat cttcctgtct ttcagttgta gagtataaaa ttgcggatat gggacactta 5881 aaatactact tggctcccaa gatcgaggat gaagaaggat cttaggcatt cttaaaattc 5941 aagaaaataa aactaagctc tttgagaact gcttctaaga tgccagcata tactgaagtc 6001 ttttctgtca ccaaatttgt acctctaagt acatatgtag atattgtttt ctgtaaataa 6061 cctatttttt ttctctattc tctccaattt gtttaaagaa taaagtccaa agtctgatct 6121 ggtctagtta acctagaagt atttttgtct cttagaaata cttgtgattt ttataataca 6181 aaagggtctt gactctaaat gcagttttaa gaattgtttt tgaatttaaa taaagttact 6241 tgaatttcaa agatcacagg gcagtgtctt catttgacca ggactgttga aagtatccta 6301 ctgaattccc agctacagtc accctttgtt caaactgttc // LOCUS HUMPDHAL 17072 bp DNA PRI 20-JAN-1992 DEFINITION Human pyruvate dehydrogenase (EC 1.2.4.1) alpha subunit gene, exons 1-11. ACCESSION D90084 M58568 NID g219981 KEYWORDS pyruvate dehydrogenase; pyruvate dehydrogenase alpha subunit. SOURCE Human leukocyte DNA, clones pGPDHA[13 and 37]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 17072) AUTHORS Koike,K., Urata,Y., Matsuo,S. and Koike,M. TITLE Characterization and nucleotide sequence of the gene encoding the human pyruvate dehydrogenase alpha-subunit JOURNAL Gene 93 (2), 307-311 (1990) MEDLINE 91033044 COMMENT These data kindly submitted in computer readable form by: Kichiko Koike Department of Pathological Biochemistry Atomic Disease Institute Nagasaki University School of Medicine 12-4, Sakamoto-machi Nagasaki-shi 852 Japan Phone: 0958-47-2111 Fax: 0958-47-8514 First 10 bp reported in [1] is linker sequence. FEATURES Location/Qualifiers source 1..17072 /organism="Homo sapiens" /db_xref="taxon:9606" protein_bind 286..291 /bound_moiety="SpI" CAAT_signal 342..346 protein_bind 370..375 /bound_moiety="SpI" TATA_signal 429..434 prim_transcript 439..16253 /note="alpha subunit mRNA and introns" exon 439..619 /note="alpha subunit exon 1" CDS join(563..619,5841..5900,6372..6545,7709..7835,9500..9591, 10908..11000,11767..11922,12107..12178,14064..14131, 15318..15426,15888..16052) /note="alpha subunit" /codon_start=1 /db_xref="PID:d1014824" /db_xref="PID:g219982" /translation="MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLH RLEEGPPVTTVLTREDGLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCV GLEAGINPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNF YGGNGIVGAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPC IFICENNRYGMGTSVERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSG KGPILMELQTYRYHGHSMSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEE LKEIDVEVRKEIEDAAQFATADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS" intron 620..5840 /note="alpha subunit intron 1" repeat_unit 1818..2129 /note="Alu repeat 1" repeat_unit 5192..5479 /note="Alu repeat 2" exon 5841..5900 /note="alpha subunit exon 2" intron 5901..6371 /note="alpha subunit intron 2" exon 6372..6545 /note="alpha subunit exon 3" intron 6546..7708 /note="alpha subunit intron 3" repeat_unit 6677..6999 /note="Alu repeat 3" exon 7709..7835 /note="alpha subunit exon 4" intron 7836..9499 /note="alpha subunit intron 4" repeat_unit 8223..8517 /note="Alu repeat 4" repeat_unit 8642..8956 /note="Alu repeat 5" exon 9500..9591 /note="alpha subunit exon 5" intron 9592..10907 /note="alpha subunit intron 5" repeat_unit 10387..10648 /note="Alu repeat 6" exon 10908..11000 /note="alpha subunit exon 6" intron 11001..11766 /note="alpha subunit intron 6" exon 11767..11922 /note="alpha subunit exon 7" intron 11923..12106 /note="alpha subunit intron 7" exon 12107..12178 /note="alpha subunit exon 8" intron 12179..14063 /note="alpha subunit intron 8" repeat_unit 13559..13809 /note="Alu repeat 7" exon 14064..14131 /note="alpha subunit exon 9" intron 14132..15317 /note="alpha subunit intron 9" exon 15318..15426 /note="alpha subunit exon 10" intron 15427..15887 /note="alpha subunit intron 10" exon 15888..16253 /note="alpha subunit exon 11" polyA_signal 16221..16226 BASE COUNT 4377 a 3680 c 3957 g 5058 t ORIGIN 1 gatcttttaa agacaaagtt agtggcagac catttacaga aaccagatgt tctgtctttt 61 ggctctgagc atgctgctaa tcttcatcat ctagtgtact gaacgagatg tactgaacga 121 gggctgcaga gctgcagcac cggcagagta ggcgctcggt aggacggggc ctgcacaacc 181 tccccggtag tcagcagagc aatctaggaa ggctcctttc ccgcggcgcc ctggaggcgg 241 ggccccacct tcccacgcag gcgctatcaa gccccgcctc ctcacccgcc cgcgctggcg 301 tcggaaagag ccctcagccc ctccctctct ggcgctgata cccaatgggc agcctcaggc 361 ctttagcggg ggcggggcac cccctggacg ccgttctggt tgcccgcggc ccggcgcagc 421 gcatgacgtt attacgactc tgtcacgccg cggtgcgact gaggcgtggc gtctgctggg 481 gcacctgaag gagacttggg ggcacccgcg tcgtgcctcc tgggttgtga ggagtcgccg 541 ctgccgccac tgcctgtgct tcatgaggaa gatgctcgcc gccgtctccc gcgtgctgtc 601 tggcgcttct cagaagccgg tgagacctcc cgggctggcc ggatggggcg cgagtggggc 661 tgaggcgggg cgaggcaggg cgggccaggc cgggcaccca gagcggggtg gaaggcgcca 721 ggggagcggg agcctttact tcgcctccgc gccctgcatt ccgttcctgg cctcgggaga 781 agcggcacga ccgatcacgc caaggtccgt gtgaacttcc cccttctcga cacccacctc 841 ccgcccccgg gcccagctgt gcgccaggcg aagtcggtgt gctcaagagg tgcctgttgg 901 gttacaggac acggaaaggg tggcctcggc tccttcgagt ctccaattga ccccactcat 961 ttcggatctt ctaacttaat ttctcttgac cgagagcttt gtaatagcgt agaatctgga 1021 gacagggtgg cttcgttcaa acagcaccct caccattgac tagccctgtg accttgagca 1081 agtttttaaa cgtcccgggg acccgtttcc taaaatgttt gctcgaagtg gagttaatct 1141 ctaaatggag ataagagtta tctctgaaat gttatcggtt attaaaatgt tatcagttaa 1201 ctctaaaatg gagataataa gagtccccac ctcttggggt tgtcttgagg attcaacgag 1261 tgacacgtgt ggaaacgatt ccaaatagca cctggcacat aatcgataac atgtgtgttg 1321 aatagtgtta tttattgagt ctccagttcg gtatacattt cttgaaccac ctgtgctcag 1381 ttctgaggcg ggttcacaga aggtcagcct cttcagaaac aaacttcctc ctcttccctc 1441 tccctcaaca tctgagcttt tcttggcagt gagttcagga gcgccgaagc agaactcaga 1501 ggacgctgcc ctccctccct tcctacacat cttagggtac agtagctaaa gcaaagagca 1561 acgatgcttg agggtggggg gtagagttta gcactatttc atggcctcta gcatttagag 1621 gtcctaacac ctgagctagc aattctgacc cccctaggca cagtgaggtc gtgttaattg 1681 gtgtaactgc aggcctcggg attctggtat ttcccccagg acttgatacc gctctactta 1741 gtacaggcaa gagattgtca aaaggtaaag aggtatgccc ctctaggaat cctgttgcct 1801 aaaataatga caaaactgcc gggtgcggtg ctcaggcctg taatcccagc attttgggag 1861 gctgaggcag gtggatcacc tgaaggtcag aagttcgaga tcagcctggc caacatggtg 1921 aaaccccgtc tctactaaaa atacaaaatt agccggtcgt ggtggcgggc tcctgtaatc 1981 ccagctactc gggaggctga ggcgggagaa tagcctgaac ccgggagcgg agtttgcagt 2041 gagcggagat cgtgccattg ccatacggcc tgggcgaaca agagcaagac tccgtatttt 2101 aaaaaaaaaa aaaaaaaaaa aaaaaaaaag cgttcccttt aggatatctg tgggtagagg 2161 gctgtaccgg tagttacggg ctcagaaaca tccttccttt aggcacctga tgtaggtttt 2221 cttcttcttc tgcaagtcag gttcattgtt tcctgtatca gtttgcaggg tccccccccc 2281 cccgccacct tacagtagga agaaaattga gttccagata tgaagtcacc tttgaaagtg 2341 cccaggtatc tttccacttg gtgtgtaaac tcttcagata attagaagtt ttctgtgtca 2401 ctcaacttgt catggactaa ttaggaacca ttcctgaagc ttttaaggat agaactaaaa 2461 gtttcacttt tattttttta aagggtggaa taataaacta acgtgttgac tctttgtatt 2521 ttgtaattct tcatacttat ggatgtcttt ttacttaact ataagtaaca aaatagatca 2581 acgttttagt tttttatatt atacatgtaa aaagacattt tgcatataag cctttcacaa 2641 aaatcttgac agtaaacaat aagcagtggc tcacccaaat taggcagact tactgcacta 2701 gactcctacc atctgtgtga tactccatga agggagggag aaggggaggg agaagggtag 2761 gcagctggtc tgatggctgt gacacaagat aatcccctta acctcccaag acgctgtgtg 2821 ttttttcctt ttttattctc cctggtttac tttcgttttg tttgagacag ggtctctgtg 2881 tcacccaggc tggagtgcag tagcaggaca gctcactgca gccttagcct gctgggctca 2941 agcgatcctc ctgccttagc ctcctgagta gctgggaaca caggcatgtg ccaccaccac 3001 acccagccaa ttaaaaaaat ttttttttta ctagagacat ggtcttgcta cgttgcccag 3061 tctggtctcc atctccaggc tcaagcagtc ctcccacctc ggcctccaaa gctgggatta 3121 ctctcactct cttaaaacca ggcaggtagg gagatttatc tcaggcttaa agattgccat 3181 tgtctcatca aagagtgttt ggtgtgaaac tttgaaatga atatcaagat tgtgttttta 3241 tttttgaata aggtttatag ttttcatagt tcttatttca tggaagaaga ttgaatgcat 3301 ttaaaatgtt attttattgt ttgcatttct gtatggctcc ttttgtgaga tctttactag 3361 caatgttttg gctttataag tggtaggtaa gagttttaat ttacactgtt agaatctgga 3421 atttttgaaa cgtttttcct ctttcacatg aatggttcct atgtatttag gaagttaaag 3481 ttttactttt ttttaattaa tttttttttt taggctggaa tgcagtggca cagtcatagc 3541 tcactgtagc ctcaggtgtg tgccaccata cctgactaat tttttaatat ttatttttgt 3601 agagatgaga gtctcatgtt gcccaggctg gctttgaact cctggcttca agtggtcctc 3661 ccaccctggc ctcccaaagt gctggggatt ataggtgtga gccatcatgc ccggcctagt 3721 ttttattttt taaaatttga gtgggttgtt cgtggtctct gtcagagagg aatcccattt 3781 aacagagaat ctttttatgg ctctccagag aaaatgaatg gtaaacttat cttttcacaa 3841 gctctcactc agaaatgata cacacacact tctgatagga cttttagctt ctttaacttt 3901 gttcctttca ctcatatcag tggttcttat ttttgagata cacagtaatg aagccatggg 3961 agaaagtatc taagtagctt tctggcagtc ctaatctttg caggcgcaag ttacaggcgc 4021 atgccacagc tactgggccc cttcttgctc ttatgtatag cattatcctg cctcattgtt 4081 tcaactctag gattgagaaa gaagttacct tttctctgtt actgtcgcct ggctggtttg 4141 gactcctgcc ttccaaaaac tgcagtttct gtagttgtat ttggaaattt atttcacaat 4201 acaataaatt tctggcccca caaaatattt attaactgcc aagaataaca catctgtttg 4261 attgctaaat ataaccattg atttgctgtt tcaccttctc tcagctttac ttcttcccaa 4321 attcctaaat ttccttcact ttttctgaga tacattagtg gactgtctct gcctgtaagt 4381 taactgaaac actgattcct agtatttcag ttgttttcct ccagcactgt cattgtctgt 4441 gtttgttggc tttgtccaat aatggtctat tgaggggtga agatatacgt aattagcttt 4501 ctcctattgg cttgtacact ccagggatac ttggcagatc agtcttaact cttctcacca 4561 agatcagtcc agtgctggat taggtaaggt atgaacacat cagatgtgct ttttatggag 4621 aaatcatgtt ggtttacacg tcagtgtgtg agaatgtggc agaagggagc taaaatagta 4681 tgataatact actggataaa ttttgtggtc taacctaaac cttagccatt acatagaata 4741 cttttgctgt gagcaggttt gctcagttgt aaaactggaa aggaatcatt tctcaccccc 4801 cgcctccaag cttactcaaa catgacagcc accaaacatc aagagaacag tgtttcagag 4861 aacatttcta ctggggcttc aggaggagcc tgtccaagat ttaggctgtt caaattataa 4921 attataaaac aactggctca agtcccagtt agtgtttaag tctagagagt gctaagtaat 4981 cttttcattt atgtcattag tctccctaaa gtatttatca tcagtaacgt tacaatctaa 5041 tttaaaatat tctcattcgt atacagatac caatttgata gaagagtcaa gtttgcctag 5101 agtggagatt aaatcatagt tttatttgaa gtataatttt ggcttgctca aaatgaccag 5161 tatctggtta tgactaagaa tggcatgaaa aggccagacg cagtggctca tgcctgcaat 5221 cccagtactt tgggaggcca aggcaggtgg atcacctgag gtcaggagtt ggagaccagc 5281 ctggccaaca tggtgaaacc ccatctctac taaaaatata aaaattagcc gggcgtggtg 5341 gtgggcacct gtaatcccag ctactcggga gactgagaca ggagaaatca cttgaacccg 5401 ggaagcggag gttgcagtga gccgagatcg caccactgca ctccagcctg ggtgataaaa 5461 gcaaaactcc gtctcaaaac aaacaaacaa aagaatggca taaacagaca cagctcacag 5521 atgatctagt ctctttagcc actaatttca ttatattctc actataattt ctttgaaaac 5581 aaaggatggg tttgtttttt gcccctcttt cgctgcttgc cttcagatgc gggataatcc 5641 tgtttcattg gccaaagcat ggattcattt tggaggccaa ggaagatgca aacacagtgc 5701 acagggtgga agagaagcct atgaatatgt tggggcttat taaatttcca taacttcatt 5761 ctgataactg attattatac tttccaaaat agctgacaat taaaaagtac tgatttgttt 5821 gtatattttt gtcttttaag gcaagcagag tgctggtagc atcccgtaat tttgcaaatg 5881 atgctacatt tgaaattaag gtaagagtgt tttactttgt taataatttt ttcacaggta 5941 cactctgata tacagtttta cctttagaat agaacatctt gatgttcatg attagtcatc 6001 attttcttct aaatgtccag gatcagaagt tcagagaagc ttattcaaaa gtttggaatg 6061 taattcagtg aaatatttga ataagaagag tcttagttgt ttctttgaag gttctttcaa 6121 cctataactc agttggcttc taggggcttt cagtgaaaat catcttagaa agatttcctt 6181 cccccaagcc ccatctcatt gcacagtgag gtttatggat ttaaggaaca gaggcgatat 6241 gaagcattac tgatgtgctc ctttgcagtt tttcaagttc aatattattt gcaatggagt 6301 tagatcttag agtggtcaac agtgtttgca atgtagtatg tggaggataa taactacctt 6361 attcatttca gaaatgtgac cttcaccggc tggaagaagg ccctcctgtc acaacagtgc 6421 tcaccaggga ggatgggctc aaatactaca ggatgatgca gactgtacgc cgaatggagt 6481 tgaaagcaga tcagctgtat aaacagaaaa ttattcgtgg tttctgtcac ttgtgtgatg 6541 gtcaggtgag tggtaggttt gtggtggaac tgtgttattt aggtactgaa gtatggcttg 6601 tacttattgg gctttaccct gccatatgta tcagaagagt ttgaggctgg taatgtaatt 6661 ttcttttatt tatttatttt tttgagacag tctctctctg tcgcccaggt tagagtacag 6721 tggtgatctt ggctcactgc agcctctggt tagagtacag tgtgatcttg gctcactgca 6781 gcctctgtcc actgggctca agcaatcctc ccacctcagc ctcccgagta tgtgggacca 6841 caggtgcaca ccaacacacc cagctaattt ttgtattttt tggagatacg gggtttcact 6901 atgttgccca ggctagtctc aaacttctgg gctcaagtgg tccgcccacc ttggcctccc 6961 aaggtgctag gattacaggc gtgagccact gtgcctcgct gaagccagta ttttagaatt 7021 aaaaagtaga atgccaaaac ctgctatgaa gcttaggcta aagaattcac ataacattgc 7081 cagttttctg tacctgttct tagagtttta ctattttaaa actttctggc actatgatcg 7141 cctgtactgt atataatttg gagagaaagg attagtttgt tttttgtttt gtgggcttag 7201 gtcaagggtt agagtcaaat acctacaagg gccagccagg tagaataaat gagtgaagaa 7261 ggctaggtat acaaaacaga aaatggtgac agggactcat gctgaactgg caccagcatg 7321 ccctacccag aggaatgcca tgacttggtt ccagccagtt ggtgccatgt ggaaatcagg 7381 ggtaatgttt cctgttttcc atgtctaaga gaaggcggaa gtctggattt tcatgtgaaa 7441 ttcccagtgt tttaatgttg acatctgatg taggctttta ttttaggtca tcatacagga 7501 gaaaggaagg aagtggcaca tgtgtgggtt gccagtttat tgcttctggt ttgggccttc 7561 cactctgtat tttggtggaa aatagctact ttctctggtt attaatgaca ggttctacta 7621 gcccacatat ttcactgtgg tctaggaaac gtttttattt agaaacatgt atcatattgc 7681 ctcatagttt ctccttcctc taacacagga agcttgctgt gtgggcctgg aggccggcat 7741 caaccccaca gaccatctca tcacagccta ccgggctcac ggctttactt tcacccgggg 7801 cctttccgtc cgagaaattc tcgcagagct tacaggtttg ctgttgattt acagaaaggg 7861 gaaatgagtg gattaagttt ttaaatatct gtcattaaga gctattatga gttaatattt 7921 gttaaaaatt ttaagtttct ttttttaacc ctctctcctt tggtgctctg gtacttctgt 7981 tgtgctcttg agttaactga ccatttgtga agttctctgg cccctcaggt aaaagtttaa 8041 aacaggttgg tgctataaaa tcacagtagg tttggttatc attcaagcat gccagaagaa 8101 gtctagcagt catagaaagt aagttcggtt gaagcactcc atggtatgca atgtaaattc 8161 tagaaatctt cttaatattc cccttttctt tgtcccccgt gactatttgt ttgttttggt 8221 ggtttttttt tttttttttg agactgtgtc tcactccgtt gtccaggtgg tgtgcagtgg 8281 tgtgatcagg gctcactgca acctcactcc cgggtcaagt gattctcatg cctccacctc 8341 ctgagtagct gggactacag gcatgcacca ccacacctgg ctaatttttg tatttttagt 8401 agagatgggg tttcaacatg ttggccaggc tggtctccaa ctcctgacct caggtgatcc 8461 acctgccttg gcctcccaaa gtgtgctggg gttacaggcg tgcagccgca cctggcctgt 8521 tttgtttttt tgagacagag tctcgctttg ttgcccaggc tggagtgcag tggcctgcct 8581 cagcctccca aaatgctagg attacaggcg tgacgcctgt gcccggtcct cctcctcctc 8641 cttttttttt tttttttttg agacagagtt tcactctttc acccaggctg gagtggctgg 8701 agtgaagtgg tatgattttg gctcactgca gcctccgccc ccggggtcaa gcaattctcc 8761 tgcctcagcc ctcctgagta gcctaggatt ataggtttgc ccaaccacca cacctggcta 8821 atttctgtat ttttagtaga gaccaggttt tcaccatgtt ggccaggctg gtcttgaact 8881 cttgacctca ggtgatccta ccctcttcgg cctcccaaaa tgttaggatt acaggcgtgc 8941 agcggttgcc cggccctcct tgactcttga actatggttg tcctctatat atcaggggat 9001 tggttctagg accctcgagt atacaaaaat cctcaaatac tcaagtccca aagtcagcct 9061 tccatatctt cgggtttgca tcctgagaat attctatttt caatacatgt gtggctgaaa 9121 aaaaatctgt gtataagtgt acctgtgcag ttcaaaccct gttcaaggat tgaatatatt 9181 tagtgtacta gtataggaga ggtcctaaga tgtttgtaac tggccagaaa acccagaaaa 9241 gtccagggta tcatctggat ggaacatctg aaggaaacta agtgactaga gagtaggaaa 9301 agctggaaag gttgaagcac atggaactag tgaaaggaca aggagaaaca tgtgtttgcc 9361 tggagggaca ggtacttaga cgactgaact ggcctctgtg ttctaatggt tgagcctcag 9421 agtacatatt tggggtgcgg tttggtttgc tttgtagagt tggtttgttc tgcacatgtg 9481 tatgttctgc catttccagg acgaaaagga ggttgtgcta aagggaaagg aggatcgatg 9541 cacatgtatg ccaagaactt ctacgggggc aatggcatcg tgggagcgca ggtagtcaag 9601 gacgaggatt gtgtgctgct ttagatttgg ccctggactt tgtcttgaaa aacctttcac 9661 agccccagac aacttttcct gaagctagta cagccatgtg ctgcacagtg acgctttggt 9721 caatgtcgca tatatgatgt tggacccata agattataat ggagctgaaa aattcctcgt 9781 cgcctagtga tgttgtagtg gcacaacaca ttaccttttc tacgtttagg tacacaaata 9841 ttttgcctac aggattcagt agagtcacat gctgtgcagg gttgtagcct aggagcagta 9901 ggtctactat acagcctagg tgtgcagtgg gctgtaccat ctaggttcgt gcattacagt 9961 atggtgttca catgacaaaa tcgcctagtg atgcaattct gagaatatat ccctgttgtt 10021 aagtgacgcg tgactatttt gggggcttgg tttgctttta aagacctagt gcttcatatc 10081 ctaccgtttg agagatgagt agatttggat ggtgatttat aatgtttcct tttaggtgtc 10141 tgctgtttta taagtaagca ggaacctcta gcagtggagc cataaccttc cccttcctat 10201 ttatatttca gtacattaat tgctttatct tgtcaacttc attttggggt ccttgttctc 10261 atcatcagtt agtgaatgat gaagaattaa cagcacaaaa ttatatccgg actgtttctt 10321 ttcctttcta atatattaag attctattat gtgttgtttt tttttaaacc taggttttat 10381 ttttcctttt gaaatggagt cttgctcagc cgcccaggtg agcagtggtg taatctcagc 10441 tcactgcaac ctccaccccc gggttcaagc aattctcctg cctcagcctc ccgagtagct 10501 gggaatatag ttacgtgcca ccatgcccaa ccattttttg tatttttagt agagacgggg 10561 tttcacatct tgtccaggat ggtctcgatc tgtggacctc gtgatctgcc caaagtgctg 10621 ggattacagg cgtgcaccac gcccggccag gttttatttt ttaactcttg aatgcagaaa 10681 tgttagtgtt actggttaaa atagaacata gtatttatat attactttag tgctttattg 10741 aaaatatcgg aggtgggata aacagagaga tagggttgga aggagagttt gtagcagcag 10801 tgtaatttct gtgtcagatt ctggccagga gtgaaaatgc agggcattaa ttagtatctc 10861 ccctcatgga tttctgtggt tcctttctcg gttgtcctta atgttaggtg cccctgggcg 10921 ctgggattgc tctagcctgt aagtataatg gaaaagatga ggtctgcctg actttatatg 10981 gcgatggtgc tgctaaccag gtaattatgt ctcttaactt cccaaaaaca gtcttatttt 11041 caaagtcttt aatatttaca gttgaatttc taaagaagta gcatattgct tattaggtga 11101 aatagcaagt cctatggcta gctcaaattt ggttgactta tggccagatt agagattgac 11161 ctcttagcgt tgtttcacaa gagacttacg ggggcacatt cctgtgaagg agctcacctt 11221 tgctctacat cagtgcttgg caaaggccct gtggtaaagg acctccccac aacctattgc 11281 aaaacaatac agacccattc tcttggatgt ccgggctggc agtgtcaaat tcggataata 11341 gcgtctgagt cctaactcag tttctatgct tctcttgtta ccgagtaatc cccagtctgt 11401 ggccagcact ctgtgaagcc ctgttctaga ggctgattct taggtgctgg ttcactctgg 11461 ctatccagtg ggcctgatag atttcatatt gatctttttt ccagtgtgtt ccttactgct 11521 agcatggccc caaagaaaca agtagtagtt ggtttgtcac cttccttagt tgcaagagta 11581 tgatgcctgc tacttctcct ccaccaccca ccccgctttc cctcaccacc caaagctcgg 11641 ttttagaaga ggaggccttt ctgtgcttta tgaaagcttt ctgtgccagg cagagcagca 11701 gctgttagag atgatgaagc ctggagaaag aagccaaatg aaaccccttt tcgtaactac 11761 ttccagggcc agatattcga agcttacaac atggcagctt tgtggaaatt accttgtatt 11821 ttcatctgtg agaataatcg ctatggaatg ggaacgtctg ttgagagagc ggcagccagc 11881 actgattact acaagagagg cgatttcatt cctgggctga gagtaaggaa ccctgtggtg 11941 gggccgggcc aacgcaaggc caaggccaag ggtatgtcct tgtgcagacc cttgacgatc 12001 ttagaaacat tggagagttt cattctcata caggagcagg tcatgtgaaa gtaaaatggt 12061 ttggggcagt tggattcact gtcgcccctc ccctgtttat taccaggtgg atggaatgga 12121 tatcctgtgc gtccgagagg caacaaggtt tgctgctgcc tattgtagat ctgggaaggt 12181 aaggctctaa cacgtctccc gtagtgacat ttatctctgg aagttcaaag actgcctccc 12241 atgtgcctgc tgaagctgtt agtgggtacc tgctaattga ggtgcatgag atggaagcag 12301 agtgaaggag cagggctcct ttgggtagct tggtcttggt agctcacctg ctgggaagcc 12361 tacgtttctc tcatttgggg gaagtccgtt ctggtgcttc ctctgctttg gcctgtcttc 12421 atgacaactg atttgccttt tccttaggtt aattctgtcc ctcctcccca ccccccatta 12481 atcatgagtc ccttgaagga acagattggg gatccccaca gtgtccagca tagaatgtca 12541 tgtatacaat aggcatttaa tatgtgtgtc ttatacaaat taaacagtat ggtagaagct 12601 cgtaatctta gtcttgctca gtcctaagga tctttccctt ttcagtgtat ggcagtgagg 12661 gagatggtaa gagggagact ggcctaaaga ggtttcactc ctttggcact agtataggct 12721 taggaggttt gggtgttctc cctaaatcag tgttctacag ccccagaaac aaggtttctg 12781 aagttggcac ttcagcgcca taccatgtac gtttaggcag tgatgtgaat cagatgtggc 12841 cagccaatga tggaaatgtc acaaagtgga gagacaggga agtgagggtg ggaatgtagt 12901 aacattaaat aagagctaat gagtagaatg aactagagca ggaccaagac tacagcttgg 12961 gtacaatggg ctgagcagtc tatgtcagga ccctaaggct gggagtgcca ggccctgagt 13021 gatgggtatt gacgccacct ctcccaattc ctgaaaagaa tacatggggg cagccctggc 13081 tggagacaag gggagttgta gaatgtagtt gtagaatgtg tgactttccc aaacagataa 13141 cccttcacaa ttctgagtta ggtgttattg ttttgcccat tttgcagaca aggaaactga 13201 gccatagagc agttaaaggt acacagctac tctgtcttga gccagctccc cactcgccca 13261 ttttaactgc tgcaccaaac caaccaagga tcctcaatgt ttctccaccc cagtgttcag 13321 agccgtctcc tggaagtggg ctagtgtaga atgagcgagc ccaactgtcc agactcccag 13381 cagagccttg gggtttgggt agcagaggtt gttgtgcctg ccaactttgt tacacactag 13441 caaggtctgt gaagtaggag tgctgccagc cccacaacac accatgagaa aggagcatga 13501 gatggaaatc tgtctagcca atagcaggag gctctagaac atgctcagag cctttttctt 13561 ttttcacagg gtctcgcgca gtggcacact cacagctcac tgcacccttg accttccaga 13621 atcaggtgat cctcccaagt agctggacta caggcgcacg ccaccatgcc tggctaattt 13681 ttgtattttt tgatagagat ggggttttgc catgttgtcc aggctagtct cgaactcctg 13741 ggcgcaagtg ctctgcccac ctcagcttcc caaagtgctg ggattacagc atgagccacc 13801 atcctggcct tcaaagcctt tttgattcaa aacaagctgt tgcagattgc cttattaaaa 13861 atgagaaaga tgaaatatgc aatcaatact tgctagaaat gagaacagat cagtcaacca 13921 ataagaaatt cgtgacaact cagaagatct gataagacta cactggacgc ttaataaagg 13981 gcctgcgttt gaggccgtgg attgccggcc tgttcttcca gtcatcgttc ctaactaact 14041 aactgctacc ggttctgttt taggggccca tcctgatgga gctgcagact taccgttacc 14101 acggacacag tatgagtgac cctggagtca ggtacgctca tgggcagtgt ggtttccata 14161 ggggtgggct ttgaatggtg ttacatggca aaagcaacac atttcagtat ttgcttttgg 14221 agctagatac cagttcactt catgtacgca gtgtgttggg catcaagtta tctgaaagca 14281 gtgcctccta ataagaaagc tttctgaaaa tgcctacagt gtatagtgtg tgtgagcaca 14341 agcagcaatt tttaaagaaa gagtgttata taccttaatt gtagagggaa attgttacat 14401 aaaacagggc tgatggtagt gtagtatctt gggggcattt agcttttgta ggcattaccc 14461 caagtctaca cctggcatac tctacctgaa gaggatcaga atttggagat gttcctctgt 14521 ctggtataga gctaagggta actgggtgtt caatccgttg aaatacatca atcaaaaaag 14581 ctaggttcca ctaggaaggc atgaatttta cttttttgct tagtagctct ggcctgtgat 14641 tctggaatcc cagttttgac acagttgata aataagcctt tgtagagtgg acttctaagg 14701 aaaaatcatg tagagagcac gatatggaaa aatgcacttt gtgtaaatct attgttgaaa 14761 atggtagaat ccttttagtg ttacttcaga tgatataggc ataagataca ttggttttgc 14821 tggctgtgct tctttagggg gacttaaggg agaaaggcaa ggcacatgga tttcctgctt 14881 ggcgctctga tgtctcaaag tctaattatc accacacaca ccatctctgc tgtccccacc 14941 catgtagtat acaggagccc aaatgggtgg gacaagtgac acttctttag aaccttacat 15001 ctaaatcaaa gcagcaagca aaaacttggc ccctgttgtc ggaatgccag ggaagccatg 15061 tgactcacca gtgtacggtt ttctagaaaa gacagaagca gttattacag aatgttaggc 15121 tgcgttctgg tattttgaaa gtataacaca ctctgccagc tatagtgaca taagcatggt 15181 atgccccttt gtttcagaaa cacacttctg tatttcactc attgggacaa tccaacccct 15241 atactagttt ctacacgcgt ccttggctct actggaactg ctcttactga tcgattacta 15301 cttttccctc cccatagtta ccgtacacga gaagaaattc aggaagtaag aagtaagagt 15361 gaccctatta tgcttctcaa ggacaggatg gtgaacagca atcttgccag tgtggaagaa 15421 ctaaaggtac agtcacttgt tcatggtggt ttgaaggttg gctttaaaag ttgccacccc 15481 tgcgcgtgac agagtttgtg tgggttcctc caagcccaga aagtgatgtc ctgggacata 15541 aatagttcca tagttccaaa gtcccttggg tgggggcttt tcctttagtt tcctctattc 15601 aaaattgtat tactcttcag atttcagatt ttggtggact gtgaaccacc atcacagtgg 15661 caaagcccca cagtagtatg gttctttttt cctaaaagta tactgtggat ttttaattca 15721 taaaatagat acaccctaga aatctgtatt ccaaaatctt ctgaattagc aactgttcgt 15781 acttgtagtt aaagagttac accagcaaca ggtcctcagc agaactctag ttggtaccta 15841 agctgctgtt cattctaaaa ccttttacac tgttacctaa tttttaggaa attgatgtgg 15901 aagtgaggaa ggagattgag gatgctgccc agtttgccac ggccgatcct gagccacctt 15961 tggaagagct gggctaccac atctactcca gcgacccacc ttttgaagtt cgtggtgcca 16021 atcagtggat caagtttaag tcagtcagtt aaggggagga gaaggagagg ttataccttc 16081 agggggctac cagacagtgt tctcaacttg gttaaggagg aagaaaaccc agtcaatgaa 16141 attcaatgaa attcttggaa acttccatta agtgtgtaga ttgagcaggt agtaattgca 16201 tgcagtttgt acattagtgc attaaaagat gaattattga gtgcttaaag attattttga 16261 cttaaatagt atactttgaa catactctaa ttatgaaagg aagaacaatt cctgtatgcc 16321 tgtttcccct gcccccaagc cccctttaat tgggaggaag accattatgg aaggggaccc 16381 atcacagcaa ttctaccaac catagcaccc accccgagca gcgctggtgc tgcagcctgt 16441 tcgcgctgac catttctcta caagatacaa tatttattat caggcaagag gacagttcca 16501 ttttaaaata agacttttgt aatcattcca attttgtaat catttcaaag gccacataac 16561 ttagttttct ctacttacac attcagtata aatatgaagc tattttctgt tcatatcaaa 16621 cattaactac aaggcacatt cgtatcagtt ttgtgtttct caaattgaag taccatacca 16681 gttctgaggc agtgtcccag cttccatgtt tgttaaatac cccttgtttg tttcaccatt 16741 ccagcaagtg ctgaagggtg tacttttttt gagacagggt cgggctctgt tgcccaggct 16801 ggagtgcagt ggtgtgatca tggctcactg cagcctccac acctcctggg ctcaagcaat 16861 cctcccacct cagcctcctg atagctggga ctacaagtga atttcctaat attccgggag 16921 gtcaaaacca aggctcactg ttttcacaat acacacagtt ctatgtttat aaataacagg 16981 tttcaaaaga aactcaggac agtatttaaa acaagttctt aaactattaa ttgaacaatg 17041 gcatttttaa atatgtaaac acagcggaat tc // LOCUS HUMPDHBET 8869 bp DNA PRI 20-JAN-1992 DEFINITION Human pyruvate dehydrogenase (EC 1.2.4.1) beta subunit gene, exons 1-10. ACCESSION D90086 NID g219983 KEYWORDS pyruvate dehydrogenase; pyruvate dehydrogenase beta subunit. SOURCE Human leukocyte DNA, clones pGPDHB[8-3,8-4, and 8-5]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8869) AUTHORS Koike,K., Urata,Y. and Koike,M. TITLE Molecular cloning and characterization of human pyruvate dehydrogenase beta subunit gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (15), 5594-5597 (1990) MEDLINE 90332628 COMMENT These data kindly submitted in computer readable form by: Kichiko Koike Department of Pathological Biochemistry Atomic Disease Institute Nagasaki University School of Medicine 12-4, Sakamoto-machi Nagasaki 852 Japan Phone: 0958-47-2111 x2347 Fax: 0958-47-8514 Author corrected the sequence length from 8872 to 8869 bp on 18-JAN-1991. FEATURES Location/Qualifiers source 1..8869 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_unit 475..753 /note="Alu repeat 1" CAAT_signal 2233..2237 prim_transcript 2315..8623 /note="beta subunit mRNA and introns" exon 2315..2485 /note="beta subunit exon 1" CDS join(2444..2485,2568..2621,4267..4374,4458..4520, 4622..4657,5311..5596,6016..6126,6453..6544,7639..7780, 8074..8219) /note="beta subunit" /codon_start=1 /db_xref="PID:d1014826" /db_xref="PID:g219984" /translation="MAAVSGLVRRPLREVSGLLKRRFHWTAPAALQVTVRDAINQGMD EELERDEKVFLLGEEVAQYDGAYKVSRGLWKKYGDKRIIDTPISEMGFAGIAVGAAMA GLRPICEFMTFNFSMQAIDQVINSAAKTYYMSGGLQPVPIVFRGPNGASAGVAAQHSQ CFAAWYGHCPGLKVVSPWNSEDAKGLIKSAIRDNNPVVVLENELMYGVPFEFLPEAQS KDFLIPIGKAKIERQGTHITVVSHSRPVGHCLEAAAVLSKEGVECEVINMRTIRPMDM ETIEASVMKTNHLVTVEGGWPQFGVGAEICARIMEGPAFNFLDAPAVRVTGADVPMPY AKILEDNSIPQVKDIIFAIKKTLNI" intron 2486..2567 /note="beta subunit intron 1" exon 2568..2621 /note="beta subunit exon 2" intron 2622..4266 /note="beta subunit intron 2" repeat_unit 3543..3836 /note="Alu repeat 2" exon 4267..4374 /note="beta subunit exon 3" intron 4375..4457 /note="beta subunit intron 3" exon 4458..4520 /note="beta subunit exon 4" intron 4521..4621 /note="beta subunit intron 4" exon 4622..4657 /note="beta subunit exon 5" intron 4658..5310 /note="beta subunit intron 5" exon 5311..5596 /note="beta subunit exon 6" intron 5597..6015 /note="beta subunit intron 6" exon 6016..6126 /note="beta subunit exon 7" intron 6127..6452 /note="beta subunit intron 7" exon 6453..6544 /note="beta subunit exon 8" intron 6545..7638 /note="beta subunit intron 8" repeat_unit 7097..7385 /note="Alu repeat 3" exon 7639..7780 /note="beta subunit exon 9" intron 7781..8073 /note="beta subunit intron 9" exon 8074..8623 /note="beta subunit exon 10" polyA_signal 8602..8607 BASE COUNT 2291 a 2041 c 1997 g 2540 t ORIGIN 1 gaattctttc ttgtgtgaga tccaagaact ctctcttggg atctcgatcg ggactgcttt 61 cctgtaacat ctccttttgc ttggcaaatc tgatcctgaa atcttttggt gctaggttgt 121 gggaaaggtg ggaaggactg attcactttt tcattctgag caccacccac aaacttttgg 181 gggctgaaat ctcaagttca ggagtttctc agggatgctg tggcagtcta ggaagtgctg 241 gccattcctg aggccctggc agggacagtt gggactagac gatggtggca ggtgagggag 301 actcagaagg tgttgattcc atatcttgaa gatcttctat ccattttaca gatgaaacag 361 tcccagagag taaatgtcat ctctatgcag ctgtccccaa atttctacct ttagccctga 421 ctaggcccta ggctccaggc aacctgggta tgtcccagtc atctaaagaa taaaggctgg 481 ctgcagtgct catgcctgta atcccagcac tttgggaggc tgaggcgggt ggatcacttg 541 aggtcaagag ttcaacatca gcctggtcaa catagtgaaa actcatctct actaaaaata 601 aaaaaaatta gccgggcatg gtggcaggtg cctgtaatcc cagctactcg ggaggctgaa 661 gcaggggaat agcttgaacc tgggaggcag aggttgcagt gaggcccact gcacttcagc 721 ctgggtgaca gagtgaaact ccatctcaaa aaataaacaa aaatacactc ttacttcccc 781 agctcctggc acttacactg ccccaggtcc tcccttcccc cactatccat ccagcaatga 841 caaaaatcca aaattttcct ctgctcttcc ctagcactta cagcattcct attgattcta 901 gcccaaatag atctaaaact gttcccctcc ctctccatct ggaagttgcc tcctagtctg 961 cctctgttgc tgccacagac ttcccagcat ccgcctgttc tgcctttgcc tccaccccct 1021 cccattccca gtcagcaacc tactggaagt tgcaaattcc taatttggca gcctgcagcg 1081 ggtgatccat agaagggaat gtttggaatg tcttggggat gggtggagac tgggtgctca 1141 attgtcccat ctgaccccca aggccaaatt aagactaatg agcttccaga ggttgtccct 1201 atggccaagt gtttggcacc tccccataag ggaggagacc acccctcata ttgtcttatg 1261 cccaatttct gcctccaaag aaagaaaaag taaaaactaa aaggcaggaa atgaaatcca 1321 caaacagaga tcctggtgcc acaccctggg ccttggtaaa gatcgccccg acctaatcgg 1381 ttatgttaac tatagattac agacattgta tagaaaagca ctgtgaaaat ccctatcctg 1441 ttccggtcta attaccggtg cttgcagccc ctagtcacat accccctgct tgctcaatca 1501 atcatgaccc tctcacgcac acctccttag agctgtgagc ccttaaaagg gacaggaatt 1561 gccattcggg gagctcggtc ttgagacaag agtcttgccc atcccctggc cgaataaacc 1621 ccttccttct ttaactcggt gtctgaggag tttcgtctgc ggcttgtcct gctacatcca 1681 tagggccgcc cttctctaaa gttgttgtaa ccttccccca agcaggaatc cacttctgct 1741 ttaggaagca gtgggaaaaa gaccatgctt tagagtcaag acagtcctaa attaaaatat 1801 cggctcattt aatcaatttc tgcttcagtt taaccgtctt caaaatggag ataataataa 1861 cactaattac ccaagtgggg tactgtgaga atgagctata cagcaatatc tcaatgcttg 1921 gctcacagta ggtgcttaat acaatcaatg aaagctatga tgtgcattat aaggaatcaa 1981 cagacagcac tgtccaggga agactagact cactcatgaa gtgttaagtt ccccatcaat 2041 agagcgtgtg caagcacttc ttggtgccca aaggattcca gcacccacaa caggttggcc 2101 taactcttaa ggtctctcca gcttccagag atcatgctcc tcagggtctg acttctagtc 2161 cccacgcttc aagtaacacc gaccaggcac gtgtccgcag cgtgcacccc gcctgtcggt 2221 caccgcctac tgtcaataaa gttgatttga ggcggctcca ggcagagggg cacggccacc 2281 cacacgccac ctcagcccgg ccccgcggcc cctcaccggt gggcgggtgg gcgggtcacg 2341 cagccggcgc gccttgccat tgcccgcgcc ggtggaggcg ggcgtgctca cgtctgccgg 2401 cccctctggg gtcgtttggc agcggataga ggacacgacc aagatggcgg cggtgtctgg 2461 cttggtgcgg agaccccttc gggaggtagg cagcagcgcg cggcctgtat tcccacgcga 2521 cccctgccca cccctgtccc gccgtcatct tcctctctcc ctgccaggtc tccgggctgc 2581 tgaagaggcg ctttcactgg accgcgccgg ctgcgctgca ggtaacaggc gcgggtgctc 2641 gtcggacgag ggcggggagg tccccccttc tctcgggaag gaggccggag tccggcctgc 2701 gctgtcggtt ccggccgcgt gggccttggg tttgcggagc cagcatcctc cgtggagctc 2761 ctgctgcctc tgtcgccgcc tggcaattac acaccggcga gtcaccacct cacagctggc 2821 tgggcaatgg gggaaaacta agactaggcg tttcctccct cactccatgt ttttttcttt 2881 aaattgcagc aaaatagacc taacttaaaa tgtacttttt agaagtgtat aattcagtgg 2941 cattaagtac attcaccatg ttgtacaacc attaaccacc atccatttcc agaacttttt 3001 aatcatccca agcagaaact gtccattact cggtaactcc tcatttccct tccccaaacc 3061 cctggtaacc cctgttgtac ttttctgtct ttatgaggtt gcctattcta gaacttctta 3121 taagtggcat catttgtcct tttgtgtctg gcttatctca gcataatgtc ctcaaggttc 3181 atccatgttg tagcatgcgt cagagtttga ttctttccta aggctgaata atatatatat 3241 gtatattttt ttgagatagt ctccctctgt cacccaggtt ggagtgcagt ggcgcgatct 3301 cagctcactg caacctccac ctcctgggct caagggattc tcgcgcctca cactcctgag 3361 tagctgtgat tagatgccgg tgccaccaca ccgggctaat ttttgtattt ttagtacaga 3421 cagggtctca ccatattgcc caggctggtc ttgaactcct ggcctcaagt gatctgcctg 3481 ccttggcctc ccaaagtgct aggattacag gcatgagcca cctcgctcag cccgacattt 3541 tctttttttt ttttgagacg gagttttgct cttgctgccc aggctagagt gcagtggcac 3601 tatctctcag gctcactgca atctctgcct cctgggttca agcaattctg cctcagcctc 3661 ccgagtagct gggattacag cacccgccac cacacccggc caatttttgt atttttagta 3721 gagaatgggt ttcaccatgt tggccagggt ggtctcgaac tcctgacctc aagtgatcca 3781 cctgccttgg cctcccagag tgctggaatt acaggtgtga gccaccgtgc ctggccttgt 3841 tttgtctatt gtgagtaatg ctgctgtgaa cattgatgtg caagtatatg cttgaatccc 3901 tgctttcaat tatttggggc atatacatag aagtggaatt gatggattgt atgttaattc 3961 tcctctcctt ccttttttta actagatttt accattcatt atattaaaag agcaactcct 4021 acattctgga tgcaaagtat ttggcatggc gtggcaccta gtgagcactc agtacattaa 4081 cacatgcaaa gaccaactat gtgtaataga aatgttactt ttagaattct ttgagttttt 4141 agagagaagc ttgttctagt gtttttatat tgtatgctgt gggcagccta ggtaaatata 4201 aaggaagaaa atcttagttg aatcccaaac aattctgtat tgactgtttt tgcatacctg 4261 tgacaggtga cagttcgtga tgctataaat cagggtatgg atgaggagct ggaaagagat 4321 gagaaggtat ttctgcttgg agaagaagtt gcccagtatg atggggcata caaggtaact 4381 aaaatatatg aaagttatac aaaattgagg tcactgttac ccccagatac acttaaggga 4441 tcaactctta attgtaggtt agtcgagggc tgtggaagaa atatggagac aagaggatta 4501 ttgacactcc catatcagag gtaagccttc cagtgggaag cagaccctgg aggcatgtct 4561 attagtgtcc attatcctca tttttatgga ttttcactta tgtttatatt ttactttata 4621 gatgggcttt gctggaattg ctgtaggtgc agctatggta tgtaatactc ctttatgaaa 4681 cttatccaaa gacattcttg ttttcttttt tttttctttt gagacagagt ctcactctgt 4741 tgtccagcct ggagtgcact ggcaccatct ccccttccag gttcaagcaa ttctcctgcc 4801 tccagcctcc atagtggctg gggttacagg tggccactgt gcctagccaa ttttttgtgt 4861 ttttagtaga gatggggttt accatgttgg ccaggctggt cttgaactcc tggcctcaag 4921 tgatccgacc gccttggcct cccaaagtgc tgggattaca ggcgtgagcc actgtgccca 4981 tccccaaaga gatttaacat caccttttaa aaacatcatt tgggaaaaaa aaaaaatacc 5041 aagtaggtag atgtggtggt gcttgcctgc agtcccagct actcgggatg ctgaggaagg 5101 attacttgag cctaggagtt agaggctgca gtgagctatg atgttggcca ctgcactcca 5161 gcatgggtga cagtgagacc ctgtccataa aaaataaaaa taaaaataaa taaaaacatt 5221 atttgaatgt tggaagactt ggagtgattt gtttgctact gtttgttgct gtttctagtc 5281 tttttaacat ccacgttttt gattttacag gctgggttgc ggcccatttg tgaatttatg 5341 accttcaatt tctccatgca agccattgac caggttataa actcagctgc caagacctac 5401 tacatgtctg gtggccttca gcctgtgcct atagtcttca ggggacccaa tggtgcctca 5461 gcaggtgtag ctgcccagca ctcacagtgc tttgctgcct ggtatgggca ctgcccaggc 5521 ttaaaggtgg tcagtccctg gaattcagag gatgctaaag gacttattaa atcagccatt 5581 cgggataaca atccaggtca gcacagggac cctttggttt tttttgaaga ttgaaaaact 5641 aaaatgatat atttgcacag aggaaaccat taaagtcttt tagttaggct tagatatatc 5701 gtatagaatg ttaatacttt atatagtaaa gagtagtaaa gaatgtgctt gcttttccta 5761 tttccgaata taagggggcc agctaggtaa aggggacatt tactgcacac ccacaatact 5821 ggtttgatag ataaaattat attgaccact acaaatttaa gtacctaaac tgctctcaag 5881 gctgctggtc atggtagaga taaggctaat cagcacaatg aaacttctca caagatgaat 5941 gatattgctt tctatgcagt tttagtcatt ctggatgtga ttaattactt ttgtttaaat 6001 atttatgttt ttcagtggtg gtgctagaga atgaattgat gtatggggtt ccttttgaat 6061 ttctcccgga agctcagtca aaagattttc tgattcctat tggaaaagcc aaaatagaaa 6121 ggcaaggtaa gaaaactaag atatataaac agtattttta atccagcccc tcaattttgg 6181 taaaatttaa tctagagtta ctgcatttac tttgactgaa aaattatagg atgagctcaa 6241 gatgccccaa tttctcattc agttgtgctt ttctagatct cagtctggta ggtttgtgtt 6301 tcatttctag gaaggctgcc acattcctca gctcactgga atgaagttga attgaagact 6361 ttatgaatga acaagagcca cagctggctg cacagtggtg tagcatcttc agtttcaacc 6421 tgatctctga tctgtgacca ctctgttttc aggaacacat ataactgtgg tttcccattc 6481 aagacctgtg ggccactgct tagaagctgc agcagtgcta tctaaagaag gagttgaatg 6541 tgaggtgagt ggacatttac ttgttcttaa agaattagaa gggctccttt tagatttaag 6601 acccagaaga gttctgtaac aggttaatta gatgtaaaac cttgctttgt cactcatttc 6661 tgtcttcttg cttaaagctg ggaacctgcc tgcagttcac atacttagct tttacctctt 6721 caggtcactt aggtctggac ttttttgaga gacagtggga gctctctgaa cctcttctgg 6781 ttctctcagt gatacccccc gagagagaaa aaaacaggct tttgttctgt tgcccaggct 6841 gaagtgcagt ggctggatca tggctcactg taaccgtgaa ctcctgggcc caagtgatgc 6901 tcccacctca cccttctgag gttctggatc ttttagaaac tcaccatgtg ctagttatac 6961 ttgggcaagt tatatactct aaatctgttt catcagctat aaaacaggga taatataaat 7021 actgtcccgt catagggctg ttgtgagaat tacacttagt gtagtgcttg gtacatgatg 7081 ggcactcagc tgttaactgt taacttattt tgatgaggga atttaaagtg aagggaggat 7141 tggctgggcg tggtggatca cctgaggtca ggagttcaag accagtctgg ccaacatgtt 7201 gaaaccccat ctctactaaa atacaaaatt agctaggcgc ggtggtctat agcctgtaat 7261 ccagctactc cagagctgag gcaggtacga atgacttgaa ctggaaggcc gggggtggca 7321 gtgagccgag atcacgccac tgcactccag cctggacaac caagagcaaa actccatccc 7381 aaaaagaaat aaaaaagtga aggaagataa atgaatccta taactgcaca gagctgcttg 7441 tggtagacat acagatcttt gttttaatta aaatttttcc ttttacagtt tgtacattca 7501 tgaataggta ttcataggca gattagattt tcctagatac aaagattaac aggaaacaaa 7561 aaaggaaatg gtattatggg gataaggctc tatttgtgct gcaatagctc tgtaaacttg 7621 attatattac ctgtttaggt gataaatatg cgtaccatta gaccaatgga catggaaacc 7681 atagaagcca gtgtcatgaa gacaaatcat cttgtaactg tggaaggagg ctggccacag 7741 tttggagtag gagctgaaat ctgtgccagg atcatggaag gtatttcaaa gaaactggtt 7801 ctagaataca gtcaagctat agtaaacatc atgtacctga atacgtacag gataaagaga 7861 ttgaaagcta ggagtggcca aacgacttca actttaaact tgtaagtttg gcatttctct 7921 tcaggcagat atttttaaaa gcaaatccag aaagcggttc cccataatta gcattcctta 7981 cattttctct tcctggtcag ccacacctct gtgccaaggt ggtagtgccc agctggcatt 8041 tgcactctct cagttgagcc ttccttctcc taggtcctgc gttcaatttc ctggatgctc 8101 ctgctgttcg tgtcactggt gctgatgtcc ctatgcctta tgcaaagatt ctagaggaca 8161 actctatacc tcaggtcaaa gacatcatat ttgcaataaa gaaaacatta aatatttagt 8221 ttggacttga atatcaagtc gttgaaattt atttgaaata cttgctggca ctgcacctgg 8281 atttgtactg caagacctga ctattcataa aggaaaacga tttctaaagc aacagcaggt 8341 atttttgtac agggaagttt aaatgtgttt gtgtatggaa aactctccac tctcctcccc 8401 tagatgccat gcttcctttt gtctgttacg gttgccatgt tctttgaata acaaattata 8461 tcacatttta tcctctctca ccacaaggac aaagtatgga tgtggcagag tcctgatgaa 8521 agatgtatcc aaacaagata acttatatgt ataaaattaa agcatataat acacatttac 8581 tgttagtttg ttttgataag gaataaagga atttctaaca tgactaattt tttttttttt 8641 tgagacaatg tttggctctg ttccccaagc tggagtgtga tggcacaatc ttggcctcac 8701 tgcagcctcc acctcccagg ttcaagtgat tctcctgcct cagcctcagt aggaaattag 8761 ccaccacatc cggctaattt tggtgttttt agtagaggtg gggtttcacc atgttggcca 8821 ggctggtctt gaacacctga cctcaggtga tccgttgacc gaggtcgac // LOCUS HUMPEPYYA 2378 bp DNA PRI 28-OCT-1993 DEFINITION Human peptide YY gene, complete cds. ACCESSION L25648 NID g409744 KEYWORDS peptide YY. SOURCE Homo sapiens male adult lymphocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2378) AUTHORS Herzog,H. TITLE Genomic organisation and localisation of the gene for the human peptide YY JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..2378 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="lymphocyte" TATA_signal 528..532 exon 556..646 /number=1 intron 647..1012 /number=1 exon 1013..1200 /number=2 CDS join(1013..1200,1307..1387,1516..1540) /codon_start=1 /product="peptide YY" /db_xref="PID:g409745" /translation="MVFVRRPWPALTTVLLALLVCLGALVDAYPIKPEAPGEDASPEE LNRYYASLRHYLNLVTRQRYGKRDGPDTLLSKTFFPDGEDRPVRSRSEGPDLW" intron 1201..1306 /number=2 exon 1307..1387 /number=3 intron 1388..1515 /number=3 exon 1516..1756 /number=4 polyA_signal 1733..1738 BASE COUNT 465 a 733 c 762 g 418 t ORIGIN 1 ctgcaggtca ggagccaagg tcaggtgctc ccaccctgcc tccaggagaa agacttttcc 61 caaaaggtcc tgtcattcct cagccttctc caaggtatta ctgaacagag gctgaaaagg 121 gttgtctttt ggaaagagtg gtcacacatc ccacggactg gagggaatgt gtccaccctc 181 ccacccccaa ctcccatcca cggacaggct agggaagaca gaatctcaac cctggtccta 241 ggggctgccc caggagtcta gatccaaccg cttcacttcc tcaggaggct agacctgcca 301 ctggggggct gtggggtgaa catggggttc aggctgggta ccccatccac tgagaccacg 361 aatctgagca tgcaactgcc ccctcagctc ctgcgaggtg gagacatcac gcactggctg 421 ggtataagct cagcgatctg gcgagggtgg aaggagggag gaagaagggc ccaccccctg 481 ctggctcccc ctccccctgg gactccccct ccctttccag gcggggatat aagccccaca 541 aggaaagcgc tgagcagagg aggcctcagc ttgacctgcg gcagtgcagc ccttgggact 601 tccctcgcct tccacctcct gctcgtctgc ttcacaagct atcgctgtga gtggctgggg 661 gctcaagcct ggggaaaccc aggctgtttt gtggggaaac tgaggctggg atggtggaag 721 ctgggggtga gtggtaggaa actggattgg agggggccat cggaggtggg aaggaggcca 781 gcagcggggg tgggggtggg gaggcaggca gggcttatgc tcctctgaca tgaggaaact 841 gagtcaggag acggtgcagt gcccaagaac acacagcaag cccggtggct gagccttggt 901 ccggtaccag ccccagggag agcgtcggag cgcggccccg acggcagccg cagcctcccg 961 ccgtgtaggg tcgaggcttg gaatgaccca cgtcccaatg tcgccttccc agatggtgtt 1021 cgtgcgcagg ccgtggcccg ccttgaccac agtgcttctg gccctgctcg tctgcctagg 1081 ggccctggtc gacgcctacc ccatcaaacc cgaggctccc ggcgaagacg cctcgccgga 1141 ggagctgaac cgctactacg cctccctgcg ccactacctc aacctggtca cccggcagcg 1201 gtgagcgcag gcgctgagcg ggacccctgg tgctctcacc ctgcggcccc gctccacctg 1261 ggggcgtggc cagatctgac cacgctcttc ccttccccgc gcccaggtat gggaaaagag 1321 acggcccgga cacgcttctt tccaaaacgt tcttccccga cggcgaggac cgccccgtca 1381 ggtcgcggta aaagcgcccg ttaccacaca tcctgcatcc gagagagcgg cctggcccta 1441 ccctggcaac atcatttaac gacgtctccc aggctcgcct cccaagatcc aattccttcc 1501 acttcgcttc cgcaggtcgg agggcccaga cctgtggtga ggacccctga ggcctcctgg 1561 gagatctgcc aaccacgccc acgtcatttg catacgcact cccgacccca gaaacccgga 1621 ttctgcctcc cgacggcggc gtctgggcag ggttcgggtg cggccctccg cccgcgtctc 1681 ggtggcccag ccccctgggt gagggctgtg tgtggtcctt ccctggtccc aaaataaaga 1741 gcaaattcca cagaaacgga atgtatgcct ttgtatttcc tttcctttct ccttcgtgga 1801 ggtggcatcc gtagaagtgg agggtgaggc gtggggcagg gaagagaaaa ccgacttgga 1861 cagaggaccc aggacaaggc tggggcgggc tgcgagttcc ctctgctcga cccaggaaga 1921 ctcgaccacc cagcctggag ctgacacgcc caggccagga actgctgtgg aaggaactct 1981 ggtcgagtgg gggcaggagt ggatggaact ctggaagaag gggcggggag cggcgacagg 2041 agcctgctag gcagcctagg agagcagatg gggtcagggc tgtccctcgg ggaccccgtg 2101 aagccgaaga cggctggggg acgagtcggg caccaacacc cggggcgttt tcccaggagc 2161 tgtagtacta gcgcccaggg ctggagtggc tggaggtgaa gtgtaatcca aaatggggct 2221 gtggaggaag agccgtgtgg gggtgagatg gctgtacgtg tgctggccgg agaggcaggc 2281 catttctacc gtccccctta ccctgagtag aggagcaatg ggagtgtcga ggctacccta 2341 tccctaagaa acctccctcc ccaatctcac ctgaattc // LOCUS HUMPF4V1A 1468 bp DNA PRI 07-JAN-1995 DEFINITION Human platelet factor 4 varation 1 (PF4var1) gene, complete cds. ACCESSION M26167 NID g292389 KEYWORDS platelet factor 4. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1468) AUTHORS Green,C.J., Charles,R.S., Edwards,B.F. and Johnson,P.H. TITLE Identification and characterization of PF4varl, a human gene variant of platelet factor 4 JOURNAL Mol. Cell. Biol. 9 (4), 1445-1451 (1989) MEDLINE 89261772 FEATURES Location/Qualifiers source 1..1468 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 58..66 /gene="PF4var1" /note="T-rich region" promoter 184..190 /gene="PF4var1" exon 214..380 /gene="PF4var1" /number=1 /product="platelet factor 4" prim_transcript 214..1411 /gene="PF4var1" /note="exons and introns" mRNA join(214..380,699..825,951..1411) /gene="PF4var1" /product="platelet factor 4" gene join(214..380,699..825,951..1411) /gene="PF4var1" sig_peptide join(281..380,699..700) /gene="PF4var1" /note="leader sequence" CDS join(281..380,699..825,951..1038) /gene="PF4var1" /note="precursor protein" /codon_start=1 /product="platelet factor 4" /db_xref="PID:g292390" /translation="MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQC LCVKTTSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHL ES" intron 381..698 /gene="PF4var1" /number=1 exon 699..825 /gene="PF4var1" /number=2 /product="platelet factor 4" mat_peptide join(701..825,951..1035) /gene="PF4var1" /product="platelet factor 4" intron 826..950 /gene="PF4var1" /number=2 exon 951..1411 /gene="PF4var1" /number=3 /product="platelet factor 4" polyA_signal 1126..1131 /gene="PF4var1" polyA_signal 1258..1263 /gene="PF4var1" misc_feature 1303..1308 /gene="PF4var1" /note="GT cluster" polyA_signal 1371..1377 /gene="PF4var1" BASE COUNT 363 a 380 c 333 g 392 t ORIGIN 1 gaggttggag gtatgcatct ttgtaccacc tcctagccaa ggcaggtgcc cccagccttt 61 tgttttgtaa tcttggctgg ccagagtctg agtcttcata gcagtgtctt agctcctgca 121 ccacagttcc tcgctgtcca caccaggctt ccggactgga aggacagtgg gacagtgacg 181 ggggataaaa gaagcctggt gaggccagga gtcactgcct gcagaacccc agcccgactt 241 tccctgcgca ctgggatcct gctggaacct cagctgcaac atgagctccg cagccaggtc 301 ccgcctcacc cgcgccaccc gccaggagat gctgttcttg gcgttgctgc tcctgccagt 361 tgtggtcgcc ttcgccagag gtgagagcag aaaccaggct gggagggcca gcagcggcga 421 gggggagtcc gggaagccct ggggctggga ggaatcctct aggatcatga tcgcagctgc 481 tcttattgcg cgcgtgctga gtctgcgggt acagtgccag gcactgcacg tgcacctcgc 541 cacctgctca gcacaacctc tgtctgagag aggtctgatt tacggctaag gaaaagaaag 601 ctgaaggtag tggaaaaggt ccctaaagta tctctggcta ctcaggagtc acaactccca 661 cccctcctcc tctcttactc cctccctttc cccctcagct gaagctgaag aagatgggga 721 cctgcagtgc ctgtgtgtga agaccacctc ccaggtccgt cccaggcaca tcaccagcct 781 ggaggtgatc aaggccggac cccactgccc cactgcccaa ctcatgtgag tcctcgcact 841 gcatcagtta gtgctcccgc tccgtgcctc ctctgcccat ccctccccct tctaatgcca 901 tttgcaaacc caaggactga aagtcacgtc tcttctcttt tccctgccag agccacgctg 961 aagaatggga ggaaaatttg cttggatctg caagccctgc tgtacaagaa aatcattaag 1021 gaacatttgg agagttagct actagctgcc taagtgtgca ctttcaatct aactgtgaaa 1081 gaatcttctg atgtttgtat tatccttctt atattatatt aacaaaataa atcaagttgt 1141 ggtatagtca atctatttct taataatact gcaaaaataa tgctgacaca tcacaatttc 1201 atattttaaa atttccagaa ttttaagcaa aaagcattat gaaggaaggc ttggtttaat 1261 aaagactgat tttgttcagt gttatatgtt agctgataca tatttgttca tttatgtgat 1321 tgcagtactt tatagctaca tatttacctt gaatgttaca attagcttgc caataaatat 1381 tagtagctct taagcattac atctgcaaag atatatatat atatatatat atatatattt 1441 tttttttttg ccatgttaag caacacat // LOCUS HUMPGAMMG 3771 bp DNA PRI 07-JAN-1995 DEFINITION Human phosphoglycerate mutase (PGAM-M) gene, complete cds. ACCESSION J05073 NID g189873 KEYWORDS phosphoglycerate mutase. SOURCE Human DNA, clones PGAM4.11 and PGAM5.11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3771) AUTHORS Tsujino,S., Sakoda,S., Mizuno,R., Kobayashi,T., Suzuki,T., Kishimoto,S., Shanske,S., DiMauro,S. and Schon,E.A. TITLE Structure of the gene encoding the muscle-specific subunit of human phosphoglycerate mutase JOURNAL J. Biol. Chem. 264 (26), 15334-15337 (1989) MEDLINE 89359363 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Schon, 07-AUG-1989. FEATURES Location/Qualifiers source 1..3771 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7p13-p12" prim_transcript 921..3751 /note="PGAM-M mRNA and introns" gene join(963..1376,1480..1660,3549..3714) /gene="PGAM2" exon <963..1376 /gene="PGAM2" /note="phosphoglycerate mutase, (EC 2.7.5.3)" /number=1 CDS join(963..1376,1480..1660,3549..3715) /partial /gene="PGAM2" /note="phosphoglycerate mutase" /codon_start=1 /db_xref="GDB:G00-120-280" /db_xref="PID:g387016" /translation="MATHRLVMVRHGESTWNQENRFCGWFDAELSEKGTEEAKRGAKA IKDAKMEFDICYTSVLKRAIPTLWAILDGTDQMWLPVVRTCRFNERHYGGLTGLNKAE TAAKHGEEQVKIWRRSFDIPPPPMDEKHPYYNSISKERRYAGLKPGELPTCESLKDTI ARALPFWNEEIVPQIKAGKRVLIAAHGNSLRGIVKHLEGMSDQAIMELNLPTGIPIVY ELNKELKPTKPMQFLGDEETVRKAMEAVAAQGKAK" intron 1377..1479 /note="PGAM-M intron A" exon 1480..1660 /gene="PGAM2" /number=2 intron 1661..3548 /note="PGAM-M intron B" exon 3549..>3715 /note="phosphoglycerate mutase" /number=3 BASE COUNT 806 a 1111 c 1089 g 765 t ORIGIN Chromosome 7. 1 ccatggcccc ttcattaggg ccccaattgt gactttattg ctcatagtct cttccctgcc 61 ttggtggctc tcatccccca aacctgaatg cagaagtctt ggtcctagac tcaactccgt 121 gccacccttc agcctacgtt gtgggttcct gctaagctga gcatttacct aacaatcaag 181 acttctgaca gtcctcagtc ctgcccccaa acccccttgg atttctcttt ttcaaggtgg 241 tttcggctag gagagtgagc gtggcttggg tgagggcaga tagggtggga gcatggggca 301 tgtatggatg agaccttgac aaagggaccc cggaggaaag acaggggccc tttccccctt 361 tgtcctggaa acccggctca gccccagccc ttgcccattc tgctgctgct gcctggtacc 421 ttccacaagg ccagactcct ctccacaaag ctgtggtctg caccagctcc tctggctctc 481 ctcctctgcc tgctgagggc cgcctcctag cctggctgcc aatcacagga gaaaggggtt 541 gggattttgt ttgtgcctct gtctgagcag agaatggctg ataggcactg agcgttgccc 601 tggagagccc ctctgtccct gctatcccca tctcccctgg cccagacttc tgcccttcac 661 gcccatccct gaccagcagc cccactcagt ctgggctctg ggtgccagct gtatagacat 721 gccacctgaa cccaggccag agctggtgat gcgtggggct attttaagca cagcctcttg 781 gcctgcacac tcccctggcc cccagccccc agcagctcag ctactggtca cctgccaccg 841 cctggaatgc tgattggcag ttggctgggg tgggtggggg ctgggaagac actattataa 901 agctgggagt gttgggaagc agccgtcccc gtccagagtc ctctgtggtc cctgctgcca 961 ccatggccac tcaccgcctc gtgatggtcc ggcacggcga gagcacatgg aaccaggaga 1021 accgtttctg tggctggttc gatgcagagc tgagtgaaaa ggggaccgag gaggccaagc 1081 ggggagccaa ggccatcaag gatgccaaga tggagtttga catctgctac acgtcagtgc 1141 tgaagcgggc catcccgacc ctctgggcca tcctggacgg cacggaccag atgtggctgc 1201 ctgtggtgcg cacttgccgc ttcaatgagc ggcattacgg gggcctcaca ggcctcaaca 1261 aggcagaaac ggccgccaag cacggggagg agcaggtgaa gatctggagg cgctccttcg 1321 acatcccgcc gcccccgatg gacgagaagc acccctacta caactccatt agcaaggtgg 1381 gctgcctttg ctgggaaggc ctctgggaag ctgcagagtg gggagtcggg tgggggccca 1441 ctggcttggg agggaaagca gcgtgcctgt gtcccccagg agcgtcggta cgcaggcctg 1501 aagcccgggg aactccccac ctgcgagagc ctcaaggaca ccattgcccg ggccctgccc 1561 ttctggaacg aggagattgt tccccagatc aaggccggca agcgagtgct cattgcagcc 1621 cacgggaaca gcctgcgggg cattgtcaag cacctggaag gtaggccacc ttcaggagcc 1681 tgggcagggt gggtgggcag cagccagctg gcttctcatc tcagcaaagt ctctcgccat 1741 gaccagcttt ctagcgtggc tccacatcat tcactgaaaa gaggctgaga agccattttt 1801 tagttttgtg aaattttccc catttctgtg taactggaca cactccacag gggctgactg 1861 cactcgaagc tcgctgtgtc ccgaggtggg gcaggctcca aaggtggcat ctgccaaggg 1921 acacccagct aggaaacgga agggctgggc ttagagcatc tggctccaaa tcccaactta 1981 ctgtggggcc ctggacaagc cacctccatc tctgggcctc tcccttttcc ggggtggtgg 2041 ggagctcccc ctggtactga attcctcttg atgtaggctt ggacccctcg cagggccctc 2101 ccccatcagg tcctcagaat ccctgcatga gcttcaccac ctatctccct ctggagcccc 2161 tctctgggca aaggaaagac caatcaaaag aggggtgcag gactatggag tggccagact 2221 ctgggcttgc agctgggctc ccactgaaga gcaagggctg acaaatgggc ccgggatgca 2281 tgggcgcagt aaggcctcgc ccagagtgac tggcacctcc gtccgcctcc caccttagta 2341 ttctgacaca agggcagtct aaattagcat ctgaatgacc ttaaagcttg ttgagtcctg 2401 gaaaggctag aagggtgtgc cccagacctc ctgctcctag ggccgttggg cagttggcca 2461 gagcacccag accggcaggc cccggagacc cagccagccc caagcctgcc cgctccaaac 2521 acggacacct ggcacctggc actggggcca ggcagaggga aggaccacct gcctcctctc 2581 ccttccggag acttcatgca gccccatgac cctcccacag cctggtttgg ggaaagggga 2641 cgcacttttg gtggtgaata tgagggattt cactctgact ccccagagaa cattttctta 2701 aacccctccc tgcacggagc aggggtggag tggcgcgaac atcaaaggtc gagctgctat 2761 tcccagctca ggggctgcag gaggcaggca gggtcaggtt tcgaccaggc tcggcctccc 2821 tgtccctcct ccagctccat tccgcacttg ctcctctgtt caggatgtct agaatttaga 2881 gcactttaga aacaaagggt gctgggcacg gtggctcact cctgtaatcc cagcactttg 2941 ggaggctgag gcaggcagat cacctgaggt caggagtttg agaccagcct gaccaacatg 3001 gtaaaacccg tttctactaa aaatacaaaa ttagccgggt gtggtggcgc tcacctgtaa 3061 tcccagctac ttgggaggct gaggcagaat cacttcaacc caggagatgg aggttgcagt 3121 gagccaagat cgtgccactg cactccagcc tgggcaagag gagtaaaact ccatctcaaa 3181 aaaaagaaaa agaaaaagaa aagaaaaaaa aaaaccaaag ggtgagtgtc ccttcctgac 3241 cctcaacttc agtctggctg gagtcacact gggctgaggg aactatggac agcaccacca 3301 cagatcacag ccacttgggt ggggctgaag tccccatttt tttcaccact gggctatttc 3361 tgtaggctgc ttggtctaac tcagttactc cttgaccttt ggcaacattt ctgtggcctc 3421 gttctcaggg ctgggaagga attggtgcca ggggaactgg ctctgtggac cataaaggtc 3481 acatagtgtc tgctgtgtaa acaggctggg gacagagggg ctaaggacac ctattccttc 3541 cggcataggg atgtcagacc aggcgatcat ggagctgaac ctgcccacgg ggatccccat 3601 tgtgtatgag ctgaacaagg agctgaagcc caccaagccc atgcagttcc tgggtgatga 3661 ggaaacggtg cggaaggcca tggaggctgt ggctgcccag ggcaaggcca agtgaggggt 3721 gggcttgggc aataaaggca cctcccccaa cagcctggag tctccagcgc a // LOCUS HUMPHOSA 4892 bp DNA PRI 16-AUG-1993 DEFINITION Human phosphoenolpyruvate carboxykinase (PCK1) gene, complete cds with repeats. ACCESSION L12760 NID g307332 KEYWORDS phosphoenolpyruvate carboxykinase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4892) AUTHORS Beale,E.G., Chrapkiewicz,N.B., Scoble,H.A., Metz,R.J., Quick,D.P., Noble,R.L., Donelson,J.E., Biemann,K. and Granner,D.K. TITLE Rat Hepatic Cytosolic Phosphoenolpyruvate Carboxykinase (GTP): Structures of the Protein, Messenger RNA, and Gene JOURNAL J. Biol. Chem. 260, 10748-10760 (1985) MEDLINE 85289263 REFERENCE 2 (bases 1 to 4892) AUTHORS Ting,C.-N., Burgess,D.L., Chamberlain,J.S., Keith,T.P., Falls,K. and Meisler,M.H. TITLE Phosphoenolpyruvate carboxykinase (GTP): characterization of the human PCK1 gene and localization distal to MODY on chromosome 20 JOURNAL Genomics 16, 698-706 (1993) MEDLINE 93315163 REFERENCE 3 (bases 1 to 4892) AUTHORS Yu,H., Thun,R., Chandrasekharappa,S., Trent,J.M., Zhang,J. and Meisler,M.H. TITLE Human PCK1 Encoding Phosphoenolpyruvate Carboxykinase Is Located on Chromosome 20q13.2 JOURNAL Genomics 15, 219-221 (1993) MEDLINE 93162661 FEATURES Location/Qualifiers source 1..4892 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..4892 /gene="PCK1" exon <1..232 /gene="PCK1" /note="This partially untranslated exon corresponds to exon 2 of the reported rat gene. Rat exon 1 is untranslated" /citation=[1] /number=1 CDS join(9..232,668..849,1290..1493,1622..1809,2160..2322, 2763..2987,3077..3208,3633..3728,3943..4397) /gene="PCK1" /codon_start=1 /function="gluconeogenesis" /product="phosphoenolpyruvate carboxykinase" /db_xref="PID:g307333" /translation="MPPQLQNGLNLSAKVVQGSLDSLPQAVREFLENNAELCQPDHIH ICDGSEEENGRLLGQMEEEGILRRLKKYDNCWLALTDPRDVARIESKTVIVTQEQRDT VPIPKTGLSQLGRWMSEEDFEKAFNARFPGCMKGRTMYVIPFSMGPLGSPLSKIGIEL TDSPYVVASMRIMTRMGTPVLEALGDGEFVKCLHSVGCPLPLQKPLVNNWPCNPELTL IAHLPDRREIISFGSGYGGNSLLGKKCFALRNASRLAKEEGWLAEHMLILGITNPEGE KKYLAAAFPSACGKSNLAMMNPSLPGWKVECVGDDIAWMKFDAQGHLRAINPENGFFG VAPGTSVKTNPNAIKTIQKNTIFTNVAETSDGGVYWEGIDEPLASGVTITSWKNKEWS SEDGEPCAHPNSRFCTPASQCPIIDAAWESPEGVPIEGIIFGGRRPAGVPLVYEALSW QHGVFVGAAMRSEATAAAEHKGKIIMHDPFAMRPFFGYNFGKYLAHWLSMAQHPAAKL PKIFHVNWFRKDKEGKFLWPGFGENSRVLEWMFNRIDGKASTKLTPIGYIPKEDALNL KGLGHINMMELFSISKEFWEKEVEDIEKYLEDQVNADLPCEIEREILALKQRISQM" intron 233..667 /gene="PCK1" /number=1 exon 668..849 /gene="PCK1" /number=2 intron 850..1289 /gene="PCK1" /number=2 exon 1290..1493 /gene="PCK1" /number=3 intron 1494..1621 /gene="PCK1" /number=3 exon 1622..1809 /gene="PCK1" /number=4 intron 1810..2159 /gene="PCK1" /number=4 exon 2160..2322 /gene="PCK1" /number=5 intron 2323..2762 /gene="PCK1" /number=5 exon 2763..2987 /gene="PCK1" /number=6 intron 2988..3076 /gene="PCK1" /number=6 exon 3077..3208 /gene="PCK1" /number=7 intron 3209..3632 /gene="PCK1" /number=7 exon 3633..3728 /gene="PCK1" /number=8 intron 3729..3942 /gene="PCK1" /number=8 exon 3943..>4892 /gene="PCK1" /citation=[1] /number=9 repeat_region 4742..4785 /gene="PCK1" /citation=[2] /rpt_type=tandem /label=polymorphism BASE COUNT 1257 a 1145 c 1257 g 1233 t ORIGIN 1 ctgcagaaat gcctcctcag ctgcaaaacg gcctgaacct ctcggccaaa gttgtccagg 61 gaagcctgga cagcctgccc caggcagtga gggagtttct cgagaataac gctgagctgt 121 gtcagcctga tcacatccac atctgtgacg gctctgagga ggagaatggg cggcttctgg 181 gccagatgga ggaagagggc atcctcaggc ggctgaagaa gtatgacaac tggtaagctc 241 ggcccccgct gcctgtccca gcaccctgca ggcagggctc ccctgcgtct cctgggagtt 301 ggtggagaaa ggtgaatgaa ggccttcggg tagtttcaga ctcttgagaa gatgaatgca 361 atggtcagaa ccatacagac ttgaattttg tgacattagt gggccagccc aagctttaaa 421 tgaggtgtgt gcacaaaagc tctgccaact agattcctga ttaaaaaaaa ggcagcccct 481 ctcctacaga ccagctccta gtggagtaaa tgtccacctg gccatgtctt agacggtctg 541 tgtgttcaca gtgatggact gttgttagcg tgctcagcac tctgctaggc atggaaagcc 601 acggtactga aggagatggt tcgctgcccg tggtgcttgg ctgaaaggaa gcctgtgatt 661 tttgcagctg gttggctctc actgacccca gggatgtggc caggatcgaa agcaagacgg 721 ttatcgtcac ccaagagcaa agagacacag tgcccatccc caaaacaggc ctcagccagc 781 tcggtcgctg gatgtcagag gaggattttg agaaagcgtt caatgccagg ttcccagggt 841 gcatgaaagg tgagcggaac attgatttga ttgggtaaaa cagcagagag ccttttctta 901 tttacatcta tcctaatggt aattcaaaca ataatggaag ctccaccacc tcagatgtct 961 ttcagttcca tacatgaagt tggcaatgta ttgaaaatgc acatccctct tctgctttta 1021 cagactgtct tatacaaacg tgaaaactag ttccatgcat atgggtttta aatagcttgg 1081 tggacccaat tgcacattta tgaaaactct ttaatttcag caggctgttc actttccagt 1141 ggcctcttct aacccagggc ctggtgacca gcagggtgtg gtggtgttag tggaacaccg 1201 tgaaaatggg tctacctggg aaaaaaatgc tggtcgtgtt cgaagtccaa ggtcatttct 1261 caccagtgcc cacccatcgc accctgtagg tcgcaccatg tacgtcatcc cattcagcat 1321 ggggccgctg ggctcacctc tgtcgaagat cggcatcgag ctgacggatt cgccctacgt 1381 ggtggccagc atgcggatca tgacgcggat gggcacgccc gtcctggaag cactgggcga 1441 tggggagttt gtcaaatgcc tccattctgt ggggtgccct ctgcctttac aaagtaagtg 1501 tattatttca gaatcaaaag tcaaaataaa aaagaaagct gaacgcaaac cccagtgagt 1561 gcctcgggga cccccaacca gggccctggc gcactgactt gggaggggtc cttgttcaca 1621 gagcctttgg tcaacaactg gccctgcaac ccggagctga cgctcatcgc ccacctgcct 1681 gaccgcagag agatcatctc ctttggcagt gggtacggcg ggaactcgct gctcgggaag 1741 aagtgctttg ctctcaggaa tgccagccgg ctggccaagg aggaagggtg gctggcagag 1801 cacatgctgg tgagctgcag gaagccctga tgtgcagatg agaggcctgg ggggtggcag 1861 aaacaaacag cattacagtt cccacccgtc agcacacctc tctgagcgtg caggttcccg 1921 gacagatcgg gaaaccccac cagtaatgat tagtttacac atatacatcg cttttgaagg 1981 gcccccaaaa caccagggga ccatagagat ccttttggac ttcatgattc ttgggagtgt 2041 tgttgcactg atacctgaag gaatagatct tgagggccta cattccaacc tctgggctga 2101 agtaccaacc tcggggagaa ggaaacaaag atcacaataa agaatcttgt ccccaacaga 2161 ttctgggtat aaccaaccct gagggtgaga agaagtacct ggcggccgca tttcccagcg 2221 cctgcgggaa gtccaacctg gccatgatga accccagcct ccccgggtgg aaggttgagt 2281 gcgtcgggga tgacattgcc tggatgaagt ttgacgcaca aggtgactct tttagaccca 2341 actcttggta acgattggac tcaagcgaat cgttggcctt cgaaacatgt cacattctcc 2401 tcagtccagt gtttggattt ttaaactctg ttagtccaga gttggccaag ccttagaata 2461 tggatcctgt aagaattctt caacttaata ttcaatctgg attgaaactg ggccatatgt 2521 tgctgtttgt ttacatacat acatttgttt aaatggtatt ggtggaaaat tgtggaggaa 2581 gcaagagtcg taaacgtatc aaagttgcat atgatgcttg gatgaaaaga gataaatgca 2641 tattctaggg agggaaaaaa gatttgagaa gttggcatag aaattagtcc ggcaatatat 2701 aagagtatat gttctgcttt gcctggcact cactactgct tctctggttt aaaactctcc 2761 aggtcattta agggccatca acccagaaaa tggctttttc ggtgtcgctc ctgggacttc 2821 agtgaagacc aaccccaatg ccatcaagac catccagaag aacacaatct ttaccaatgt 2881 ggccgagacc agcgacgggg gcgtttactg ggaaggcatt gatgagccgc tagcttcagg 2941 cgtcaccatc acgtcctgga agaataagga gtggagctca gaggatggtg tgtccctgcc 3001 agaggcctgt gtgtgccggg ctgcagggac tgcctgtttt gagccaggca ctcacgagcc 3061 tttctctgtc ttatagggga accttgtgcc caccccaact cgaggttctg cacccctgcc 3121 agccagtgcc ccatcattga tgctgcctgg gagtctccgg aaggtgttcc cattgaaggc 3181 attatctttg gaggccgtag acctgctggt gaggctctcc ttcatttagg ctgggaacat 3241 gggtgtgctg ggtaccgaag gcatctgtga aatctctcct tttccatgac cttgtcagag 3301 ggtgcccagg ggcttccttt cttgagcttc cttcccaaag atccagaata attggcaagt 3361 tcaaatgtag aaccaaccct tctggtcacc tggaaccttt ctgaatcctt gatctattgt 3421 agcttgatca aattttactt tttactttgt ggcctcagtc atgtaacttt gagttagcag 3481 ttttctgcaa tttagcttgg tgaatgcaaa actagctcga ttacaagtta ttgtcttgcc 3541 gtgtctttcc gtgttgtgaa taacaccact ggttgtggag tctgaatttc aaagcctctg 3601 atgaacattt ctcttttttt ttcctgctaa aggtgtccct ctagtctatg aagctctcag 3661 ctggcaacat ggagtctttg tgggggcggc catgagatca gaggccacag cggctgcaga 3721 acataaaggt aaatcaaagt cctgatctga aaccacagag aagtgggatt agagcactct 3781 tcgtcactct tatgtctctc tccttttctg tgtctgtgtg tggggagaga gagagagaga 3841 aagagagaga ggagaacaaa gcatgctaat gtcaacaatc aatggcgtca gtcttgccta 3901 ggagagcctc atttactaat gaactccctc tctgtttaac aggcaaaatc atcatgcatg 3961 acccctttgc catgcggccc ttctttggct acaacttcgg caaatacctg gcccactggc 4021 ttagcatggc ccagcaccca gcagccaaac tgcccaagat cttccatgtc aactggttcc 4081 ggaaggacaa ggaaggcaaa ttcctctggc caggctttgg agagaactcc agggtgctgg 4141 agtggatgtt caaccggatc gatggaaaag ccagcaccaa gctcacgccc ataggctaca 4201 tccccaagga ggatgccctg aacctgaaag gcctggggca catcaacatg atggagcttt 4261 tcagcatctc caaggaattc tgggagaagg aggtggaaga catcgagaag tatctggagg 4321 atcaagtcaa tgccgacctc ccctgtgaaa tcgagagaga gatccttgcc ttgaagcaaa 4381 gaataagcca gatgtaatca gggcctgagt gctttacctt taaaatcatt ccctttccca 4441 tccataaggt gcagtaggag caagagaggg caagtgttcc caaattgacg ccaccataat 4501 aatcatcacc acaccgtgag cagatctgaa aggcacactt tgattttttt aaggataaga 4561 accacagaac actgggtagt agctaatgaa attgagaagg aaatcttagc atgcctccaa 4621 aaattcacat ccaatgcata gtttgttcaa atttaaggtt actcaggcat tgatcttttc 4681 agtgtttttt cactttagct atgtggatta gctagaatgc acaccaaaaa aatacttgag 4741 ctgtatatat atatgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgcatgt atgtgcacat 4801 gtgtctgtgt ggatatttgt gtatgtgtat ttgtatgtac tgttattgaa aatatattta 4861 atacctttgg aaaaatcttg ggcaagatga cc // LOCUS HUMPIM1A 6113 bp DNA PRI 07-JAN-1995 DEFINITION Human pim-1 proto-oncogene gene, complete cds. ACCESSION M27903 NID g189958 KEYWORDS pim-1 protein; proto-oncogene; tyrosine kinase. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6113) AUTHORS Reeves,R., Spies,G.A., Kiefer,M., Barr,P.J. and Power,M. TITLE Primary structure of the putative human oncogene, pim-1 JOURNAL Gene 90 (2), 303-307 (1990) MEDLINE 90382681 COMMENT Draft entry and computer readable copy of sequence [Unpublished (1989) WA State Univ, Pullman, WA 99164] kindly submitted by R.Reeves, 13-SEP-1989. FEATURES Location/Qualifiers source 1..6113 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6p21" prim_transcript 780..6018 /note="pim-1 mRNA and introns" exon <1170..1251 /gene="PIM1" /note="pim-1 protein; G00-119-495" /number=1 gene 1170..1251 /gene="PIM1" CDS join(1170..1251,1365..1471,1573..1623,1717..2083, 3588..3764,4526..4683) /partial /note="pim-1 protein" /codon_start=1 /db_xref="PID:g387022" /translation="MLLSKINSLAHLRAAPCNDLHATKLAPGKEKEPLESQYQVGPLL GSGGFGSVYSGIRVSDNLPVAIKHVEKDRISDWGELPNGTRVPMEVVLLKKVSSGFSG VIRLLDWFERPDSFVLILERPEPVQDLFDFITERGALQEELARSFFWQVLEAVRHCHN CGVLHRDIKDENILIDLNRGELKLIDFGSGALLKDTVYTDFDGTRVYSPPEWIRYHRY HGRSAAVWSLGILLYDMVCGDIPFEHDEEIIRGQVFFRQRVSSECQHLIRWCLALRPS DRPTFEEIQNHPWMQDVLLPQETAEIHLHSLSPGPSK" intron 1252..1364 /note="pim-1 intron A" exon 1365..1471 /number=2 intron 1472..1572 /note="pim-1 intron B" exon 1573..1623 /number=3 intron 1624..1716 /note="pim-1 intron C" exon 1717..2083 /number=4 intron 2084..3587 /note="pim-1 intron D" exon 3588..3764 /number=5 intron 3765..4525 /note="pim-1 intron E" exon 4526..>4683 /note="pim-1 protein" /number=6 BASE COUNT 1198 a 1696 c 1716 g 1503 t ORIGIN 6 bp upstream of PstI site. 1 ggatccttcg cccccgacgc gccccccaac acacaaaccc ccagaatccg cccccagcct 61 acagcgcgac gtcagcccgc cccagccgac ttggaggtct cgggtctgag tcacacagaa 121 agaccaccct cgtcggcatc cccacacaca gtccgacacc cggcgcgccg gcctccccgc 181 ctgacacact aacgcccgtc gtctccgcgc aacttgttat gctccggctc gagcccttga 241 cccaaaaacc tcagcgaaac ggagagccgc agagccggcc tcgggcggcc tttgatggct 301 ttgttattgt ttgggtttga atcgatacgc ccctccccat ccttcctccc tcgcggccct 361 acacccagct cccgcctccc ctcacgcccc gcgcccctcc ccctccattt tggcgccttt 421 tccttcccgc cacgtcgtgg cggcgtagag accattctga ccgcgagagc tgggcggggc 481 gggggcgggg cgcgccgagt tatgcagatc aatcggcctc tggttggctg gagtagcgct 541 ggcaggggcg gggccggggc gcggccacag agcgcgcggg gcgggggccg aggggagtcg 601 cccagtcccg ccgcttcccc accccctctc ctccctcggc cggcccggca gccctgctcc 661 ccgccttggc ctcccggaga ggccccgccc cgtccccgcc cgccgcgccc tccccgcgcg 721 ccctccccgc cggcgcgctc ctccccttta ctcctggctg cggggcgagc cgggcgtctg 781 ctgcagcggc cgcggtggct gaggaggccc gagaggagtc ggtggcagcg gcggcggcgg 841 gaccggcagc agcagcagca gcagcagcag caaccactag cctcctgccc cgcggcgctg 901 ccgacgagcc ccacgagccg ctcaccccgc cgttctcagc gctgcccgac cccgctggcg 961 cgccctcccg ccgccagtcc cggcagcgcc ctcagttgtc ctccgactcg ccctcggcct 1021 tccgcgccag ccgcagccac agccgcaacg ccacccgcag ccacagccac agccacagcc 1081 ccaggcatag ccttcggcac agccccggct ccggctcctg cggcagctcc tctgggcacc 1141 gtccctgcgc cgacatcctg gaggttggga tgctcttgtc caaaatcaac tcgcttgccc 1201 acctgcgcgc cgcgccctgc aacgacctgc acgccaccaa gctggcgccc ggtgagagca 1261 ccccccgctc cggccgggga tgcggggcgg cggcgggatc tcctgggtgg ggagctggcg 1321 gctcgcgggc cggcactgag tccccgtgct tccccctttc ctaggcaagg agaaggagcc 1381 cctggagtcg cagtaccagg tgggcccgct actgggcagc ggcggcttcg gctcggtcta 1441 ctcaggcatc cgcgtctccg acaacttgcc ggtgagtggg cgccccgcgg tggggagggc 1501 gcgccgggcg gggggcgcac gggcgtgctt tagcccggac gagggaacct gacggagacc 1561 ctgggcttcc aggtggccat caaacacgtg gagaaggacc ggatttccga ctggggagag 1621 ctggtgagtg ccctgcagga gcgaccccca ggatgagtgg gtggggtgag gggagccccc 1681 gactcccgcc ctaacgcggc cccctcgccc ctgcagccta atggcactcg agtgcccatg 1741 gaagtggtcc tgctgaagaa ggtgagctcg ggtttctccg gcgtcattag gctcctggac 1801 tggttcgaga ggcccgacag tttcgtcctg atcctggaga ggcccgagcc ggtgcaagat 1861 ctcttcgact tcatcacgga aaggggagcc ctgcaagagg agctggcccg cagcttcttc 1921 tggcaggtgc tggaggccgt gcggcactgc cacaactgcg gggtgctcca ccgcgacatc 1981 aaggacgaaa acatccttat cgacctcaat cgcggcgagc tcaagctcat cgacttcggg 2041 tcgggggcgc tgctcaagga caccgtctac acggacttcg atggtgagcc aggcccggga 2101 gggagctgcc caggtgactc ggcccggccc ggcccagtcc ggaggcctcg gccagtctcc 2161 cgcgccagcc ttttgtaaag gtcattgggc cgcctggctc gatgctagcc ggggtgggac 2221 gcaggagagc ctcccagcgt agtaaagccg gggattttca gccagctgaa cctgtaatgt 2281 ttctggcatg attttattct tcaagtggaa ttcagttagt tccaggcttt cccgatgaat 2341 aagaggttgt gggcaaccgg cggtagccca gatttttcta aagtctgacc cagtttcccc 2401 gccagtaaaa ggatgggggc gggggaagag gtggaaattg atgccggttt tgtaattttt 2461 gttttatttt ataagggagt tagttttctg tatggtagtt ttagagctgc agttcttaac 2521 tcctttctga tctagggaag gttaaggaat aggaactata ttattactgg tggctttttt 2581 ttttctttag tgttaagggg agagagagtc aggaatgaat gttgtgaaat aagatctgtc 2641 gctggtttga aaattagttg ggtgtctccg cagagaggat gaaaacctat cctagggagg 2701 ggcttggagc gggttctttc agaaaagaag gaatggagag cctgagatca aagctgcgga 2761 gggtgggtca tcatctgagc ggcttaacct aacaaacgac agcctttcaa aacttgtgac 2821 tcgggcttgt ggtttatgtt tatttgccct tggaggacgc tgggttgggc tgcatttttt 2881 gtatttaaca gttaatggct ggccgggtcc tcccatgttt tttctttcaa gtctttgctg 2941 ccccctggtg agcaccagcg gcatggcccc tcctttttct tttactaccc aaagtttgta 3001 aacaggaatc acgtggtctg aaaccaaccc tgcagctctg tctacctttc cagtctaggg 3061 aggaaagggt gtgggtgtct gctcctgctt gcatcgggtg gggaggaagg cctcccaaag 3121 ggcaccctga cttaggatgt tgtgcaagca tccttgcttg gatcctgtcg gccatgagaa 3181 tctcacccgg gctcctgggc agtggtgaat gcaatttaag gatagtctat gagatacctt 3241 tcttggttgt gcagacatgc atcccttcat ccttcgcagg cggtcctgcc tcacagggcc 3301 tcaagttttg ggtctgcggc cagctgtgtt tgtttcttgg agcagttcat aaagaatttc 3361 agtttatggt ttgggctagc agagaggtgg gtaatgcttt gggttggaga gatgccgtaa 3421 ggtgcgcctc cactctcctt agcccagagg gaaaaatgga gttcacctag ctcctgagag 3481 aagggatttt tttttttttt aaaagaaaga gttatatata ccacccagct tctttgtgct 3541 tgtttttgct aaaagtgtgt tttctcttct attcccttgg ctcacaggga cccgagtgta 3601 tagccctcca gagtggatcc gctaccatcg ctaccatggc aggtcggcgg cagtctggtc 3661 cctggggatc ctgctgtatg atatggtgtg tggagatatt cctttcgagc atgacgaaga 3721 gatcatcagg ggccaggttt tcttcaggca gagggtctct tcaggtaact gatggaaacc 3781 cctggccatg gggttattgg tcttaatggg gctattagtc ttcatgggac agtctttgaa 3841 attctggaga gcttcactct ccagtagatt ctgtcaccct tggcttagaa ttgtaggtga 3901 gtgatttaca cttgagctgg cctcataaat cacatggttt gcacttgagc tttccttggg 3961 aggtcagagg aaggcatgtg tgagcatatt aagaagaaaa gacaatctgg cttctccaaa 4021 aactttttta aaggtaccaa cagaaacctg ataattcctg gctgttttgc cagggagtaa 4081 aaagttaaaa gctcttttag catcttcttt aaggcagcag ctccaaatat tttggtacca 4141 gtgacctcac tgtgggtggt gttcgtgttt gtaagttggt aggtgaattg aatcatttca 4201 tcatgctcag tggtgtctca tcaaaatctc ttgtcatcat ccttcctatt tctggtgagt 4261 gggtgttgtg ggaaaggccc ccactattga agtttgaaaa ccacaggttt aagaggggag 4321 tcagttttta gctgaaagca ggctggagga cccagatatt agtcaatacc ttcctattga 4381 agggtaccca gcacagtgtt ctagaaaatg cttggcctcc ctgggacccc agacttgtgg 4441 gcctctgaga agcaaatggg gaagaccttt gcagtgtaaa aacaagttga gtcattcata 4501 acctcgtcta tcctcctttc tgcagaatgt cagcatctca ttagatggtg cttggccctg 4561 agaccatcag ataggccaac cttcgaagaa atccagaacc atccatggat gcaagatgtt 4621 ctcctgcccc aggaaactgc tgagatccac ctccacagcc tgtcgccggg gcccagcaaa 4681 tagcagcctt tctggcaggt cctcccctct cttgtcagat gcccgaggga ggggaagctt 4741 ctgtctccag cttcccgagt accagtgaca cgtctcgcca agcaggacag tgcttgatac 4801 aggaacaaca tttacaactc attccagatc ccaggcccct ggaggctgcc tcccaacagt 4861 ggggaagagt gactctccag gggtcctagg cctcaactcc tcccatagat actctcttct 4921 tctcataggt gtccagcatt gctggactct gaaatatccc gggggtgggg ggtgggggtg 4981 ggtcagaacc ctgccatgga actgtttcct tcatcatgag ttctgctgaa tgccgcgatg 5041 ggtcaggtag gggggaaaca ggttgggatg ggataggact agcaccattt taagtccctg 5101 tcacctcttc cgactctttc tgagtgcctt ctgtggggac tccggctgtg ctgggagaaa 5161 tacttgaact tgcctctttt acctgctgct tctccaaaaa tctgcctggg ttttgttccc 5221 tatttttctc tcctgtcctc cctcaccccc tccttcatat gaaaggtgcc atggaagagg 5281 ctacagggcc aaacgctgag ccacctgccc ttttttctgc ctcctttagt aaaactccga 5341 gtgaactggt cttccttttt ggtttttact taactgtttc aaagccaaga cctcacacac 5401 acaaaaaatg cacaaacaat gcaatcaaca gaaaagctgt aaatgtgtgt acagttggca 5461 tggtagtata caaaaagatt gtagtggatc taatttttaa gaaattttgc ctttaagtta 5521 ttttacctgt ttttgtttct tgttttgaaa gatgcgcatt ctaacctgga ggtcaatgtt 5581 atgtatttat ttatttattt atttggttcc cttcctattc caagcttcca tagctgctgc 5641 cctagttttc tttcctcctt tcctcctctg acttggggac cttttggggg agggctgcga 5701 cgcttgctct gtttgtgggg tgacgggact caggcgggac agtgctgcag ctccctggct 5761 tctgtggggc ccctcaccta cttacccagg tgggtcccgg ctctgtgggt gatggggagg 5821 ggcattgctg actgtgtata taggataatt atgaaaagca gttctggatg gtgtgccttc 5881 cagatcctct ctggggctgt gttttgagca gcaggtagcc tgctggtttt atctgagtga 5941 aatactgtac aggggaataa aagagatctt attttttttt ttatacttgg cgttttttga 6001 ataaaaacct tttgtcttaa ctcgtggctt ctaatcgtct gtgcggaggc attgctaacc 6061 tgcatttatt gagcatttgg taagtgccaa agaattgtag gagaaaggaa ttc // LOCUS HUMPP14B 8076 bp DNA PRI 08-JAN-1995 DEFINITION Human placental protein 14 (PP14) gene, complete cds. ACCESSION M34046 NID g190216 KEYWORDS placental protein 14. SOURCE Human (cell line GM1416) DNA, clone PP14G. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8076) AUTHORS Vaisse,C., Atger,M., Potier,B. and Milgrom,E. TITLE Human placental protein 14 gene: Sequence and characterization of a short duplication JOURNAL DNA 9, 401-413 (1990) COMMENT Draft entry and computer-readable sequence for [DNA (1990) In press] kindly submitted by C.Vaisse, 04-MAY-1990. FEATURES Location/Qualifiers source 1..8076 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q34" CAAT_signal 2891..2899 /gene="PAEP" /note="G00-127-516" TATA_signal 2942..2948 /gene="PAEP" /note="G00-127-516" prim_transcript 2971..8010 /gene="PAEP" /note="PP14 mRNA and introns; G00-127-516" sig_peptide 3019..3072 /gene="PAEP" /note="placental protein 14 signal peptide; G00-127-516" gene join(3019..3114,3510..3649,4036..4109,5482..5592, 6644..6748,7019..7035) /gene="PAEP" CDS join(3019..3114,3510..3649,4036..4109,5482..5592, 6644..6748,7019..7035) /gene="PAEP" /note="placental protein 14 precursor (PP14)" /codon_start=1 /db_xref="GDB:G00-127-516" /db_xref="PID:g190217" /translation="MLCLLLTLGVALVCGVPAMDIPQTKQDLELPKLAGTWHSMAMAT NNISLMATLKAPLRVHITSLLPTPEDNLEIVLHRWENNSCVEKKVLGEKTENPKKFKI NYTVANEATLLDTDYDNFLFLCLQDTTTPIQSMMCQYLARVLVEDDEIMQGFIRAFRP LPRHLWYLLDLKQMEEPCRF" exon <3019..3114 /gene="PAEP" /note="placental protein 14 precursor (PP14); G00-127-516" /number=1 mat_peptide 3073..3114 /gene="PAEP" /note="placental protein 14; G00-127-516" intron 3115..3509 /gene="PAEP" /note="PP14 intron A; G00-127-516" mat_peptide 3510..3649 /gene="PAEP" /note="placental protein 14; G00-127-516" exon 3510..3649 /gene="PAEP" /note="G00-127-516" /number=2 intron 3650..4035 /gene="PAEP" /note="PP14 intron B; G00-127-516" mat_peptide 4036..4109 /gene="PAEP" /note="placental protein 14; G00-127-516" exon 4036..4109 /gene="PAEP" /note="G00-127-516" /number=3 intron 4110..5481 /gene="PAEP" /note="PP14 intron C; G00-127-516" exon 5482..5592 /gene="PAEP" /note="G00-127-516" /number=4 mat_peptide 5482..5592 /gene="PAEP" /note="placental protein 14; G00-127-516" intron 5593..6643 /gene="PAEP" /note="PP14 intron D; G00-127-516" mat_peptide 6644..6748 /gene="PAEP" /note="placental protein 14; G00-127-516" exon 6644..6748 /gene="PAEP" /note="G00-127-516" intron 6749..7018 /gene="PAEP" /note="PP14 intron E; G00-127-516" exon 7019..>7035 /gene="PAEP" /note="placental protein 14 precursor (PP14); G00-127-516" mat_peptide 7019..7032 /gene="PAEP" /note="placental protein 14; G00-127-516" intron 7036..7786 /gene="PAEP" /note="PP14 intron F; G00-127-516" polyA_signal 7992..7997 /gene="PAEP" /note="G00-127-516" BASE COUNT 1636 a 2226 c 2203 g 2011 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcttta cctagcccta cgtcctgaag attttctctc acgctttctt ctaaaagttg 61 tatagtttta tgttttactt ttaaactatg agttaatgca tatgtcaggt gtgagtttta 121 gatggaggtt cttcgtttgc atgggatcga ttgcacgtga tgaattgctc cagcaccatt 181 tgttgcaaag actgtccttc ttttagaggg actcccgctt gccaggcctc tggtttaatg 241 aaacatgacc agagtgactc catcttaacg tgaataacta gacactcaca aggcacctat 301 aaggttatat aacgaggcta tgctgctcga tactgactac gacaatttcc tgtttctctg 361 cctacaggac accaccaccc ccatccagag catgatgtgc cagtacctgg gtgggtctca 421 cagcacatga gctcaacgtg ggtgagaggc agcagctact tccatggctg ggaaccctgg 481 ggagctgaca actggcttcc tgtccacctc agtgcctgtg ggctggtttt ttctttcttg 541 gttttttttt tatttgtttg tttgctttgt tttttttgag acagggtctc attctgtcac 601 tcaggctgga gtgcagtggc gtgatctcgg ctcactacaa cctccacctc ccagattcaa 661 gcgattctcg tgcctcagct tcccgagtag ctgggattac aggcgcgagc ttatggtctg 721 aaaatacccg catactaagc tgaccaccaa ttataactgc agaacattta tgcccatacg 781 aggcatctcc caccaagcct ggagaatgta ccgatgacct gggagtgcag ggggttatct 841 ttgctcacaa ataacgtcaa cgagtaggct gaggctgaag ggcaaatggt cattgatcac 901 actaggagcc cctatcttta gcgagtgcat ctgcatgatc caggtttcac tgtagctcat 961 tacagcttct tacaaacaga ggcactcaca gaggacgggc gttcctcctg ctcgctgagg 1021 ttgcccggct ctggcacaga gtcatttccc ataaacttgc tttcactgtg ctctgtgagt 1081 caccttgaat tctttcccgt gtgagatcta agaacccatt cttggggtct ggactgggac 1141 cctcttttcc gacaacactt cccccacgca ctgttcttgc agctttgtta aaggtctgtt 1201 gttgttgtgt gtgagtccat gtttggactc tctatttccc attgatctat gtgtccattt 1261 ctctgccaat accacaatct cttggtgact attgataata gtgcttgtaa acttgggtag 1321 actaactctt tatactttat tcttcttcaa aatggcttta gcaattgtag ctcctttgcc 1381 ttgacatgta aattttagaa aaaaaaattt ctctatatat ctacatatgg tggggtgtgg 1441 tggctcacac ctataattcc agcactttgg gaggccgagg tgagcagatc acttgaggtc 1501 agcagttcga gaccagcttg gccaaaatag tgaaaacccc tctctactaa aaagacaaac 1561 attagccagg tgtggtggtg ggcgcctata atcccagcta ctctggaggc tgaagcagca 1621 ggattacttg aacctgggat gcagaggttg cagtgagtca agatcgtacc actgcactcc 1681 accctgggca acagagcaag acttggcctc aaaaaaatat atatatgtat atatgtgtgt 1741 gtgtgtgtgt atacacacac acacacacac atgtctatag ctacctacaa aatatcttgc 1801 tgagaatttg ataggaattg cattaaatct ctaggttgag ttgaggagag ctaacacctt 1861 tgctgtgttg aagcttccag cccatgaaca cattgtgcct gtcctttacc ttaggtcttt 1921 gatttccctt ccttgggttc agagtttaag tcctgtccat atttggtgag atttttgcct 1981 acatgctgtg tgtgtgatta caaatcctga atttccagtt tcgggttctg tctactcctt 2041 gacactgtgt gggaacacgg tggatttttg tgcgattgac cttgctgacc tcactcggtt 2101 ctaggagggg ttttgtagag ttcttgcaat tttgtacctg ggcaatgatg ttatatgcga 2161 atagagacag gtatacctcc ccctctccaa tctgcctgcc tttagttacc atttcttgcc 2221 tagctgctct ggccggaatg acctgctctg atggatgggg gagtgggagt ggacacccgt 2281 gtcttgcctc aggactcaag tgctctccag ggagtgagat gcgggctcga ttgtaagtta 2341 taaagtgatg cttccctgtg ggaaaaagta catccaatat agactgtagg acaaagtctg 2401 aaagtccact gcctcccatt tccacccagt cttgcctgtg agtcagagag aaacagtgct 2461 aacaggcagt gtgcacctgt ccagaggctg gctgtttgag ggacacaggt gtgcacacct 2521 ggggatgtct gagtggaagg tacaggtcag gattatgacc gtgcagtcag tcacccactg 2581 gcatgatgtc tgtggcatcc tggggcagcc atggggctct caggccctct gcctgcccca 2641 caggccatac ccctgccctg gacacagctg tcctcagtgc tggcctctga cccaacattg 2701 tccaggagcc ccaacccaga aggtgctccc gccgctgcca gcctggaccc gacccaggcc 2761 cctcccgcct gaggccctgc caagaactgc ccagcccgga cacagaggag gttccgcgtg 2821 gacgcaggga agagcctccc attgccccag tggaggaagc tgcccagggg ccaaggatga 2881 gtcacaggtt cgaggaatca catggcgagg ctgtgggcgg ggatcttgtc tgccctcctc 2941 ctacataagg ccccctgagc ccacactgcc tcagcatccc tctggctcca gagctcagag 3001 ccacccacag ccgcagccat gctgtgcctc ctgctcaccc tgggcgtggc cctggtctgt 3061 ggtgtcccgg ccatggacat cccccagacc aagcaggacc tggagctccc aaaggtttga 3121 ggctggggga gcgggcactt tactgtggga ggcctggggc gggtgggagc tgcgggcagg 3181 cgggaagcca ggatctcaga aacctacagg aagcacagaa tggacgccat gacgtcagga 3241 agccctcagc cctgctctcc atctttaggg tggcctctct ggtttcccag catcctaggt 3301 gactcattat ttggactttg gaacactcct gagttagcac acactggtca ttttaagtac 3361 aggaaatttc atagcccagg atctggtaga tagcagacaa ccatccaatg ctcactgtac 3421 ccatcccagt tagactcagc cccgtctgca ccgggtgcaa cgagagccat ggtggggtgg 3481 gaccgccgtg cagcccaagg ccccctcagt tggcagggac ctggcactcc atggccatgg 3541 cgaccaacaa catctccctc atggcgacac tgaaggcccc tctgagggtc cacatcacct 3601 cactgttgcc cacccccgag gacaacctgg agatcgttct gcacagatgg tgggtttctc 3661 atcattgaga cgggctgggc gggggctcag tctcccccct caggggtcca ggactgggtg 3721 ggttgggcgg agctggactt agccccaggc attttctgac agccaggggc ttcactgtgg 3781 cccttccatg agggtggggt ggaaaaccag ggctccagac gttccctgtc cccttggatc 3841 ccctgcccca ggctctgggc caacagccaa ccacacagtg cagccccagg tcagactgag 3901 gagaaggtct gggcggctgc gggctgcggt gctccttgga cccggggaag ttcccgtggt 3961 gacctgattt taggagtgac agtgaaggca actccaattc aagtggccac tcatcctatt 4021 gtcaccacct ttcagggaga acaacagctg tgttgagaag aaggtccttg gagagaagac 4081 tgagaatcca aagaagttca agatcaactg tgagtgtccc caggccccaa gggctggctc 4141 agtgctggca tgctagccac gctctcccag aggcggctct gctggggcat gagggagtgg 4201 ggcctggcct gtccccactc tctctgcttc agggagtcag agtgtttact ccggtcaacc 4261 tgatgctgac cccagaggca tcttttacct ggagggcagg ggaagcacta attcttggca 4321 tgacatgact ggatgtgggt ctgcactgtg cccaggccaa ggggacaggt gctttgttgc 4381 actgttcact ctggcctcac aaaaggccag ggaggctgca ggcgagcagg tgggcaggtg 4441 ggcaggtggg taggtgggta ggtggatatg tatacaggtg gcaggagggt aggtgaacag 4501 gtgggtaggt gggcaggtgg ctaggtgagt aagtggttag gtgaacaggt gggcaggtga 4561 gcaggtggtt aggtgaacag gtgggcaggt ggataggtga acaggtgggc aggtgggtag 4621 gtgggtaggt atacaggtgg acaggtgggt aggtggacag gtgggcaggt gagtaggcga 4681 acaggtgggt gggtgaacag gtggccaggt gaacaagttg gtaggtgggc aggtgggtag 4741 gtgggtaggt gaacaggtgg gcaggtgggc aggtgggcag gtgggcaagt ggctgctgtt 4801 cccgtgggcc tggctgcctc ctgcgcactc tggggctgca gctctggtct taggctgagc 4861 tcccaggcct ctctggggga agagagaggg gcttacagca tgtccttggt ccactgaatt 4921 cttcctaaca atttgcaaca ttttgttcta ttttgttaat tattattttt ttaaaaagac 4981 agaggtggtc agggtctggg gcctcttatc ccctcatggg cacattttcc cagcaaatac 5041 agtttgcttc tcatgcttgg gacttgcctc aggcctttct gaccctgctt gccctcccca 5101 gaatcgagcc actctccaag gtccatttct tctccctcct cccgcccctg tgccctgttc 5161 ctgtgccatc tcccgccatc ctcacccgta cgtgacttct cagttggagt ctctccaggt 5221 cacagcctcc ctgcctgccg tgtctgcctc tccacggcac acctggcctc tcgccctcag 5281 ccggggctcc atggccctcc acattgcctc tcctcccctt tcctccctgg cttccctgat 5341 catggtccac agcaggggcc acgtcccatg gtgtcagtga tgaggaagcc acttagtgtg 5401 gtgggatgtc cacacacctg cacaggactc tgctgagacg gaggcttcat cttccttttg 5461 gttcttctct tctttcccca gatacggtgg cgaacgaggc cacgctgctc gatactgact 5521 acgacaattt cctgtttctc tgcctacagg acaccaccac ccccatccag agcatgatgt 5581 gccagtacct gggtgggtct cacagcacat gagctcaacg tgggtgagag gcagcagcta 5641 cgtccatggc tgggaaccct ggggagctga caactggctt cctgtccacc tcggtgcctg 5701 tgggctgact ttttctttct tggttttttt tatttgtttg ttgtttgctt tgtttttttg 5761 agaaagggtc tcattctgtc actcaggctg gagtgtagtg acgtgatctc ggctcactgc 5821 aacttctgcc tcccagattc aagcgattct cgtgcctcag cctccagagt agctgggatt 5881 acaggcgcac gccaccatgc ccagctaatt tttgtatttt ttggtagaga cggggtttca 5941 ccatgttggc caggctggtc tccaactcct ggatcaagtg atccacccgc cttggcctcc 6001 caaaggctgg gattacaggt catccaccac gcccggccag gctgagtttt tctccagcgg 6061 ttcatcgagt cctctgacaa agcaaggagc tgatataggg ccagtgggac ggtcgccagt 6121 caaggggctg ggcttggtgg atagattaat actcactggg cgtccagtca aaacgccctg 6181 aaacctatga tgctgtcaac caaacgaagg ccaggaatac caaaatagcc acataggcac 6241 agcccttccc catgtttctg agcacagtgt ttcctctggg gtcacacagg tgtcttcttg 6301 atcagcctca gccatgcttg gtgagagccg ggcactggga gagccaggca ctgtgctctc 6361 ctgtgacgct gtagacacca tcctaagctg tgcagacccc agcgctgccc agagcggagc 6421 agagggggcc gggcaaggag tgggagctgg ggtcagggaa cctggaggtg cagtggacag 6481 agccccggag accgccctag ggacctactc cagaccaaac tctgccagac ctcggagcac 6541 tggggcctcc ttctctgccc tccctcctca ggcaaggcct ctggagctcc ccagctctca 6601 tggaagcccc aggggcccag gactgaccca gcctcttcca cagccagagt cctggtggag 6661 gacgatgaga tcatgcaggg attcatcagg gctttcaggc ccctgcccag gcacctatgg 6721 tacttgctgg acttgaaaca gatggaaggt gagctctgcc taggacacgc ccagcctcag 6781 ctggaggaga agctgcctct ttcttagccc gagccccctg ctggctctgc aggactcagg 6841 tcactccttt ttggcccctc ccctgttctc ccctggcctt ctggggtgca gagccaccct 6901 gaggtggggt cctgccctct cccaccatcc tttcatccct tctctagccc tggggctgct 6961 gtgtccccag ctgtctcttc tctcgctgac acctccactg tcccatctcc tcccacagag 7021 ccgtgccgtt tctaggtgag ctcctgcctg gtcctgcctc ctgggtaatg tatcagcctc 7081 gcccactgtc tgcggctgcc tctctgggcc cctgggacag accctactgt gtccagttca 7141 gggctgaccc tacaggaatg aactggggtc tggtcttgtg attccagaaa gccaggctgc 7201 tgacgtcccc attcacgagc ccagcctgtg tcttgcagcc attgtattag tcacgggctt 7261 gtgccctata gtcagacctc atgctttctt ttggggttag gggtgttggt tggaaatggt 7321 gggggctata ggaggaggaa ggaggatggt tacatggaag ggcatgagaa gctggggacc 7381 tgcaggtctc ggtcccacgt tctttttttt ttttcttttt ttaagatgga gtctcgctct 7441 gtcaccaggc tggagtgcag tggcacaatc tcagctcact gcaacctcga cctcctgggt 7501 tcaagcgatt ctcctgcctc aaccccccga gttgctggaa ccacaggcgt gtgccaccat 7561 gcccagctaa tttttgtatt tttaatagaa acggggtttc accatgttgg ccacgatggt 7621 ctcaatctct tgacctcatg atccccccgc tttagcctcc caaagtgctg ggatttcagt 7681 gccacattct taagggggtg tgctcaagcc caccacatcc ttccagggct cccccgaaac 7741 accctgctct tcctccctct acttaagtga cctgtaaacc caacagctca cctccgcctc 7801 caggaagacc agactcccac ccttccacac ctccagagca gtgggacttc ctcctgccct 7861 ttcaaagaat aaccacagct cagaagacga tgacgtggtc atctgtgtcg ccatcccctt 7921 cctgctgcac acctgcacca cggccatggg gaggctgctc cctgggggca gagtctctgg 7981 cagaggttat taataaaccc ttggagcatg tcctgtctgg atgcgcagcc actgctgggt 8041 gtgggattca gggacgaggg cctggggtcg gggcag // LOCUS HUMPPPA 2775 bp DNA PRI 08-JAN-1995 DEFINITION Human pancreatic polypeptide gene, complete cds. ACCESSION M11726 NID g190269 KEYWORDS hormone. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2775) AUTHORS Leiter,A.B., Montminy,M.R., Jamieson,E. and Goodman,R.H. TITLE Exons of the human pancreatic polypeptide gene define functional domains of the precursor JOURNAL J. Biol. Chem. 260 (24), 13013-13017 (1985) MEDLINE 86033734 FEATURES Location/Qualifiers source 1..2775 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17p11.1-qter" prim_transcript 937..2601 /note="mRNA and introns" intron 995..1750 /note="intron A" gene 1751..1941 /gene="PPY" CDS join(1751..1941,2192..2263,2455..2479) /note="pancreatic polypeptide precursor" /codon_start=1 /db_xref="PID:g190270" /translation="MAAARLCLSLLLLSTCVALLLQPLLGAQGAPLEPVYPGDNATPE QMAQYAADLRRYINMLTRPRYGKRHKEDTLAFSEWGSPHAAVPRELSPLDL" exon <1751..1941 /gene="PPY" /note="pancreatic polypeptide precursor, (first translated exon); G00-120-311" /number=2 intron 1942..2191 /note="intron B" exon 2192..2263 intron 2264..2454 /note="intron C" exon 2455..>2479 /note="pancreatic polypeptide precursor" /number=4 BASE COUNT 532 a 794 c 826 g 623 t ORIGIN Chromosome 17p11.1-qter. 1 tagccaaaaa gtgggaaatt ataacaatac ttgacggccc cacttaggaa aaaaaacagt 61 ccaggtttgg agttaaattt tggggtctga ctggctgtgg ggtttggggc cagtgatttg 121 acatctctga gcagatttca gaccctgaat ctgtaaaatg ggttgttgtg gggatttagt 181 gaggtgactg agctggcagc actttggaaa atgtgaggct ttacaaatga agctggtttt 241 tagcacttcc ttggaatggg ggctctgaga tggggaggga aatggatact ggaaaccggc 301 acccccctca acatttcccc tgaacctagg cacatgagtc cattctgcat cctatgcacc 361 caccaaatgg gtttctccaa gcctggcacc ccactccttt ccaccttgcc tgaggaccct 421 gaggcccctt ccctcttccc accttgttct agttgcaagc tgcagaggat ggcctgagtt 481 tccattcttc agcctggagg ctctggctgt agcctcagtc ttccagctgc agcctcaatc 541 ttctgcccca agagaatccg ttggatgatt tgggttgatc tttgggaggc aggagagcta 601 gttccaaggc ctgtgaccaa aacctttcag tcactctcga gtcccagaac cccatctttc 661 ttgaaccaag cctgctctgg tgcccaaagg gaaggggtgt cagtgatggg ggcccacagg 721 agacttttac tgtggtccag ttttatctgt tatgctgctg gtctggcacc cacagaactg 781 ccctgtcccc ctcccgtctc tggcctcagt actggcgttg ccagtatgag gagacagtga 841 gaggaggggt gagggcttcg gagtgagaga ggctgacagg aggcggggac tctggggggc 901 tgaggactat aaagcggccc agccgagggc aggggcccat ccggcctgag gcaggtgctc 961 gcttggtcta gtgcccattt actctggact ccgggtaagt gggctcacgc ctgcctgggg 1021 ctctgggggc ctcccccagg ctggggcctg ggatggagag gaggtggtgc tttgcagggc 1081 caggggcctt gggaggccta aggtcctgct tgggtcttgc tttctgcatc tggacagacc 1141 tgtcacgcta gaagagccgt ctgctgactg cacgtgtgtg tgcacactcg tgtgcatggc 1201 ctgtgaactg gaatgtgtga ctgtgacctt gtgagtgaac tggggtccct gtgtgagtgc 1261 ctttgctggt ctgtgcagga cgacatggac aatagcgtct tctccaggac cgatatctgt 1321 gtctctatcg cagggtggag ggccagcacc cacaaggcca tggtttctgt ccttggcttc 1381 ccgagttcat tctgtgtccc ccaccctgtg ggtgacctgc atgcccttat tcccactagc 1441 tgctgcctcc tgggagccat gaggccctag atgtcatcag tcccgtgtct ccaggactca 1501 catccccatt ttaactaact tgcgaggccc tggcttgctg ggctgctcac aggacaggct 1561 gtccgtgcct tcagagcagt gtctaggagg tggagtggcc agcttggagt ggccctttgc 1621 tctgccccct tgtccccaag cttggagaaa tggatgatgg gctaaggggc tggatagttg 1681 ggccctgctt cctaggcacg ggaaaatctg caggcccggg cactccacct cccctctgct 1741 tgctcctcag atggctgccg cacgcctctg cctctccctg ctgctcctgt ccacctgcgt 1801 ggctctgtta ctacagccac tgctgggtgc ccagggagcc ccactggagc cagtgtaccc 1861 aggggacaat gccacaccag agcagatggc ccagtatgca gctgatctcc gtagatacat 1921 caacatgctg accaggccta ggtgtgtgcc acagttgggg agagagatcc cagcccctgg 1981 gaccctgggc ccactccaca ttcctggcca caccctatcc ccagccccag cccccagccc 2041 cttctaggcc tgctcttggg aaacagggca tctgtcgctc aacaggccag accaatgtgc 2101 ctgggcaaga tggtgtccta caggtcagat atgaaacagg tgggctggca cctgggcaca 2161 gtgcttgccc ctgctgcctc ttccctccca ggtatgggaa aagacacaaa gaggacacgc 2221 tggccttctc ggagtggggg tccccgcatg ctgctgtccc caggtgagtt tgactccctg 2281 ccctgtctgt ccaggctccc tggggctgaa atgggggtgg tgggactgaa tcagggcttg 2341 gaaaggtgta gtggggggtg gaagagggag aacaggagcc cagggccagc gtgaggcctc 2401 ctgagggcac gaggcctacc ccctacactg ccatgttctg ccctgtcctc acagggagct 2461 cagcccgctg gacttataat gccaccttct gtctcctacg actccatgag cagcgccagc 2521 ccagctctcc cctctgcacc cttggctctg gccaagcttg ctccctgctc ccacacaggc 2581 tcaataaagc aagtcaaagc cacagcctgt cctgtgtctc tgtggacccc tggtgggggc 2641 agcaggaagt gccaaagtgc caaatgcatg ggagagaaag gggagggcag ggcagggcga 2701 ggaaaagggg atcatccaga acgttggcgg gggtcatcag agtcactcaa ctcatcctct 2761 ccccttcgca gaggg // LOCUS HUMPRCA 11725 bp DNA PRI 08-JAN-1995 DEFINITION Human protein C gene, complete cds. ACCESSION M11228 NID g190333 KEYWORDS glycoprotein; protease; protein C; serine protease. SOURCE Human DNA, clones PC-lambda-8 and PC-lamda-6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11725) AUTHORS Foster,D.C., Yoshitake,S. and Davie,E.W. TITLE The nucleotide sequence of the gene for human protein C JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (14), 4673-4677 (1985) MEDLINE 85270390 FEATURES Location/Qualifiers source 1..11725 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q13-q21" gene 2131..2200 /gene="PROC" exon <2131..2200 /gene="PROC" /note="Protein C; G00-120-317" /number=1 sig_peptide join(2131..2200,3464..3519) /note="Protein C signal peptide" CDS join(2131..2200,3464..3630,5093..5117,5210..5347, 5450..5584,8253..8395,9269..9386,10516..11105) /note="Protein C" /codon_start=1 /db_xref="PID:g190334" /translation="MWQLTSLLLFVATWGISGTPAPLDSVFSSSERAHQVLRIRKRAN SFLEELRHSSLERECIEEICDFEEAKEIFQNVDDTLAFWSKHVDGDQCLVLPLEHPCA SLCCGHGTCIDGIGSFSCDCRSGWEGRFCQREVSFLNCSLDNGGCTHYCLEEVGWRRC SCAPGYKLGDDLLQCHPAVKFPCGRPWKRMEKKRSHLKRDTEDQEDQVDPRLIDGKMT RRGDSPWQVVLLDSKKKLACGAVLIHPSWVLTAAHCMDESKKLLVRLGEYDLRRWEKW ELDLDIKEVFVHPNYSKSTTDNDIALLHLAQPATLSQTIVPICLPDSGLAERELNQAG QETLVTGWGYHSSREKEAKRNRTFVLNFIKIPVVPHNECSEVMSNMVSENMLCAGILG DRQDACEGDSGGPMVASFHGTWFLVGLVSWGEGCGLLHNYGVYTKVSRYLDWIHGHIR DKEAPQKSWAP" intron 2201..3463 /note="ProC cds intron A" exon 3464..3630 /number=2 mat_peptide join(3520..3630,5093..5117,5210..5347,5450..5584, 8253..8395,9269..9386,10516..11102) intron 3631..5092 /note="ProC cds intron B" exon 5093..5117 /number=3 intron 5118..5209 /note="ProC cds intron C" exon 5210..5347 /number=4 intron 5348..5449 /note="ProC cds intron D" exon 5450..5584 /number=5 intron 5585..8252 /note="ProC cds intron E" exon 8253..8395 /number=6 intron 8396..9268 /note="ProC cds intron F" exon 9269..9386 /number=7 intron 9387..10515 /note="ProC cds intron G" exon 10516..>11105 /note="Protein C" /number=8 BASE COUNT 2444 a 3298 c 3375 g 2608 t ORIGIN 575 bp upstream of StuI site; chromosome 2q14-q21. 1 agtgaatctg ggcgagtaac acaaaacttg agtgtcctta cctgaaaaat agaggttaga 61 gggatgctat gtgccattgt gtgtgtgtgt tgggggtggg gattgggggt gatttgtgag 121 caattggagg tgagggtgga gcccagtgcc cagcacctat gcactgggga cccaaaaagg 181 agcatcttct catgatttta tgtatcagaa attgggatgg catgtcattg ggacagcgtc 241 ttttttcttg tatggtggca cataaataca tgtgtcttat aattaatggt attttagatt 301 tgacgaaata tggaatatta cctgttgtgc tgatcttggg caaactataa tatctctggg 361 caaaaatgtc cccatctgaa aaacagggac aacgttcctc cctcagccag ccactatggg 421 gctaaaatga gaccacatct gtcaagggtt ttgccctcac ctccctccct gctggatggc 481 atccttggta ggcagaggtg ggcttcgggc agaacaagcc gtgctgagct aggaccagga 541 gtgctagtgc cactgtttgt ctatggagag ggaggcctca gtgctgaggg ccaagcaaat 601 atttgtggtt atggattaac tcgaactcca ggctgtcatg gcggcaggac ggcgaacttg 661 cagtatctcc acgacccgcc cctgtgagtc cccctccagg caggtctatg aggggtgtgg 721 agggagggct gcccccggga gaagagagct aggtggtgat gagggctgaa tcctccagcc 781 agggtgctca acaagcctga gcttggggta aaaggacaca aggccctcca caggccaggc 841 ctggcagcca cagtctcagg tccctttgcc atgcgcctcc ctctttccag gccaagggtc 901 cccaggccca gggccattcc aacagacagt ttggagccca ggaccctcca ttctccccac 961 cccacttcca cctttggggg tgtcggattt gaacaaatct cagaagcggc ctcagaggga 1021 gtcggcaaga atggagagca gggtccggta gggtgtgcag aggccacgtg gcctatccac 1081 tggggagggt tccttgatct ctggccacca gggctatctc tgtggccttt tggagcaacc 1141 tggtggtttg gggcaggggt tgaatttcca ggcctaaaac cacacaggcc tggccttgag 1201 tcctggctct gcgagtaatg catggatgta aacatggaga cccaggacct tgcctcagtc 1261 ttccgagtct ggtgcctgca gtgtactgat ggtgtgagac cctactcctg gaggatgggg 1321 gacagaatct gatcgatccc ctgggttggt gacttccctg tgcaatcaac ggagaccagc 1381 aagggttgga tttttaataa accacttaac tcctccgagt ctcagtttcc ccctctatga 1441 aatggggttg acagcattaa taactacctc ttgggtggtt gtgagcctta actgaagtca 1501 taatatctca tgtttactga gcatgagcta tgtgcaaagc ctgttttgag agctttatgt 1561 ggactaactc ctttaattct cacaacaccc tttaaggcac agatacacca cgttattcca 1621 tccattttac aaatgaggaa actgaggcat ggagcagtta agcatcttgc ccaacattgc 1681 cctccagtaa gtgctggagc tggaatttgc accgtgcagt ctggcttcat ggcctgccct 1741 gtgaatcctg taaaaattgt ttgaaagaca ccatgagtgt ccaatcaacg ttagctaata 1801 ttctcagccc agtcatcaga ccggcagagg cagccacccc actgtcccca gggaggacac 1861 aaacatcctg gcaccctctc cactgcattc tggagctgct ttctaggcag gcagtgtgag 1921 ctcagcccca cgtagagcgg gcagccgagg ccttctgagg ctatgtctct agcgaacaag 1981 gaccctcaat tccagcttcc gcctgacggc cagcacacag ggacagccct ttcattccgc 2041 ttccacctgg gggtgcaggc agagcagcag cgggggtagc actgcccgga gctcagaagt 2101 cctcctcaga caggtgccag tgcctccaga atgtggcagc tcacaagcct cctgctgttc 2161 gtggccacct ggggaatttc cggcacacca gctcctcttg gtaaggccac cccaccccta 2221 ccccgggacc cttgtggcct ctacaaggcc ctggtggcat ctgcccaggc cttcacagct 2281 tccaccatct ctctgagccc tgggtgaggt gaggggcaga tgggaatggc aggaatcaac 2341 tgacaagtcc caggtaggcc agctgccaga gtgccacaca ggggctgcca gggcaggcat 2401 gcgtgatggc agggagcccc gcgatgacct cctaaagctc cctcctccac acggggatgg 2461 tcacagagtc ccctgggcct tccctctcca cccactcact ccctcaactg tgaagacccc 2521 aggcccaggc taccgtccac actatccagc acagcctccc ctactcaaat gcacactggc 2581 ctcatggctg ccctgcccca acccctttcc tggtctccac agccaacggg aggaggccat 2641 gattcttggg gaggtccgca ggcacatggg cccctaaagc cacaccaggc tgttggtttc 2701 atttgtgcct ttatagagct gtttatctgc ttgggacctg cacctccacc ctttcccaag 2761 gtgccctcag ctcaggcata ccctcctcta ggatgccttt tcccccatcc cttcttgctc 2821 acacccccaa cttgatctct ccctcctaac tgtgccctgc accaagacag acacttcaca 2881 gagcccagga cacacctggg gacccttcct gggtgatagg tctgtctatc ctccaggtgt 2941 ccctgcccaa ggggagaagc atggggaata cttggttggg ggaggaaagg aagactgggg 3001 ggatgtgtca agatggggct gcatgtggtg tactggcaga agagtgagag gatttaactt 3061 ggcagccttt acagcagcag ccagggcttg agtacttatc tctgggccag gctgtattgg 3121 atgttttaca tgacggtctc atccccatgt ttttggatga gtaaattgaa ccttagaaag 3181 gtaaagacac tggctcaagg tcacacagag atcggggtgg ggttcacagg gaggcctgtc 3241 catctcagag caaggcttcg tcctccaact gccatctgct tcctggggag gaaaagagca 3301 gaggacccct gcgccaagcc atgacctaga attagaatga gtcttgaggg ggcggagaca 3361 agaccttccc aggctctccc agctctgctt cctcagaccc cctcatggcc ccagcccctc 3421 ttaggcccct caccaaggtg agctcccctc cctccaaaac cagactcagt gttctccagc 3481 agcgagcgtg cccaccaggt gctgcggatc cgcaaacgtg ccaactcctt cctggaggag 3541 ctccgtcaca gcagcctgga gcgggagtgc atagaggaga tctgtgactt cgaggaggcc 3601 aaggaaattt tccaaaatgt ggatgacaca gtaaggccac catgggtcca gaggatgagg 3661 ctcaggggcg agctggtaac cagcaggggc ctcgaggagc aggtggggac tcaatgctga 3721 ggccctctta ggagttgtgg gggtggctga gtggagcgat taggatgctg gccctatgat 3781 gtcggccagg cacatgtgac tgcaagaaac agaattcagg aagaagctcc aggaaagagt 3841 gtggggtgac cctaggtggg gactcccaca gccacagtgt aggtggttca gtccaccctc 3901 cagccactgc tgagcaccac tgcctccccg tcccacctca caaagagggg acctaaagac 3961 caccctgctt ccacccatgc ctctgctgat cagggtgtgt gtgtgaccga aactcacttc 4021 tgtccacata aaatcgctca ctctgtgcct cacatcaaag ggagaaaatc tgattgttca 4081 gggggtcgga agacagggtc tgtgtcctat ttgtctaagg gtcagagtcc tttggagccc 4141 ccagagtcct gtggacgtgg ccctaggtag tagggtgagc ttggtaacgg ggctggcttc 4201 ctgagacaag gctcagaccc gctctgtccc tggggatcgc ttcagccacc aggacctgaa 4261 aattgtgcac gcctgggccc ccttccaagg catccaggga tgctttccag tggaggcttt 4321 cagggcagga gaccctctgg cctgcaccct ctcttgccct cagcctccac ctccttgact 4381 ggacccccat ctggacctcc atccccacca cctctttccc cagtggcctc cctggcagac 4441 accacagtga ctttctgcag gcacatatct gatcacatca agtccccacc gtgctcccac 4501 ctcacccatg gtctctcagc cccagcagcc ttggctggcc tctctgatgg agcaggcatc 4561 aggcacaggc cgtgggtctc aacgtgggct gggtggtcct ggaccagcag cagccgccgc 4621 agcagcaacc ctggtacctg gttaggaacg cagaccctct gcccccatcc tcccaactct 4681 gaaaaacact ggcttaggga aaggcgcgat gctcaggggt cccccaaagc ccgcaggcag 4741 agggagtgat gggactggaa ggaggccgag tgacttggtg agggattcgg gtcccttgca 4801 tgcagaggct gctgtgggag cggacagtcg cgagagcagc actgcagctg catggggaga 4861 gggtgttgct ccagggacgt gggatggagg ctgggcgcgg gcgggtggcg ctggagggcg 4921 ggggaggggc agggagcacc agctcctagc agccaacgac catcgggcgt cgatccctgt 4981 ttgtctggaa gccctcccct cccctgcccg ctcacccgct gccctgcccc acccgggcgc 5041 gcccctccgc acaccggctg caggagcctg acgctgcccg ctctctccgc agctggcctt 5101 ctggtccaag cacgtcggtg agtgcgttct agatccccgg ctggactacc ggcgcccgcg 5161 cccctcggga tctctggccg ctgaccccct accccgcctt gtgtcgcaga cggtgaccag 5221 tgcttggtct tgcccttgga gcacccgtgc gccagcctgt gctgcgggca cggcacgtgc 5281 atcgacggca tcggcagctt cagctgcgac tgccgcagcg gctgggaggg ccgcttctgc 5341 cagcgcggtg agggggagag gtggatgctg gcgggcggcg gggcggggct ggggccgggt 5401 tgggggcgcg gcaccagcac cagctgcccg cgccctcccc tgcccgcaga ggtgagcttc 5461 ctcaattgct ctctggacaa cggcggctgc acgcattact gcctagagga ggtgggctgg 5521 cggcgctgta gctgtgcgcc tggctacaag ctgggggacg acctcctgca gtgtcacccc 5581 gcaggtgaga agcccccaat acatcgccca ggaatcacgc tgggtgcggg gtgggcaggc 5641 ccctgacggg cgcggcgcgg ggggctcagg agggtttcta gggagggagc gaggaacaga 5701 gttgagcctt ggggcagcgg cagacgcgcc caacaccggg gccactgtta gcgcaatcag 5761 cccgggagct gggcgcgccc tccgctttcc ctgcttcctt tcttcctggc gtccccgctt 5821 cctccgggcg cccctgcgac ctggggccac ctcctggagc gcaagcccag tggtggctcc 5881 gctccccagt ctgagcgtat ctggggcgag gcgtgcagcg tcctcctcca tgtagcctgg 5941 ctgcgttttt ctctgacgtt gtccggcgtg catcgcattt ccctctttac ccccttgctt 6001 ccttgaggag agaacagaat cccgattctg ccttcttcta tattttcctt tttatgcatt 6061 ttaatcaaat ttatatatgt atgaaacttt aaaaatcaga gttttacaac tcttacactt 6121 tcagcatgct gttccttggc atgggtcctt ttttcattca ttttcataaa aggtggaccc 6181 ttttaatgtg gaaattccta tcttctgcct ctagggcatt tatcacttat ttcttctaca 6241 atctcccctt tacttcctct attttctctt tctggacctc ccattattca gacctctttc 6301 ctctagtttt attgtctctt ctatttccca tctctttgac tttgtgtttt ctttcaggga 6361 actttctttt ttttcttttt ttttgagatg gagtttcact cttgttgtcc caggctggag 6421 tgcaatgacg tgatctcagc tcaccacaac ctccgcctcc tggattcaag cgattctcct 6481 gccgcagcct cccgagtagc tgggattaca ggcatgcgcc accacgccca gctaattttg 6541 tgtttttagt agagaagggg tttctccgtg ttggtcaagc tggtcttgaa ctcctgacct 6601 caggtgatcc acctgccttg gcctcctaaa gtgctgggat tacaggcgtg agccaccgcg 6661 cccagcctct ttcagggaac tttctacaac tttataattc aattcttctg cagaaaaaaa 6721 tttttggcca ggctcagtag ctcagaccaa taattccagc actttgagag gctgaggtgg 6781 gaggattgct tgagcttggg agtttgagac tagcctgggc aacacagtga gaccctgtct 6841 ctatttttaa aaaaagtaaa aaaagatcta aaaatttaac tttttatttt gaaataatta 6901 gatatttcca ggaagctgca aagaaatgcc tggtgggcct gttggctgtg ggtttcctgc 6961 aaggccgtgg gaaggccctg tcattggcag aaccccagat cgtgagggct ttccttttag 7021 gctgctttct aagaggactc ctccaagctc ttggaggatg gaagacgctc acccatggtg 7081 ttcggcccct cagagcaggg tggggcaggg gagctggtgc ctgtgcaggc tgtggacatt 7141 tgcatgactc cctgtggtca gctaagagca ccactccttc ctgaagcggg gcctgaagtc 7201 cctagtcaga gcctctggtt caccttctgc aggcagggag aggggagtca agtcagtgag 7261 gagggctttc gcagtttctc ttacaaactc tcaacatgcc ctcccacctg cactgccttc 7321 ctggaagccc cacagcctcc tatggttccg tggtccagtc cttcagcttc tgggcgcccc 7381 catcacgggc tgagattttt gctttccagt ctgccaagtc agttactgtg tccatccatc 7441 tgctgtcagc ttctggaatt gttgctgttg tgccctttcc attcttttgt tatgatgcag 7501 ctcccctgct gacgacgtcc cattgctctt ttaagtctag atatctggac tgggcattca 7561 aggcccattt tgagcagagt cgggctgacc tttcagccct cagttctcca tggagtatgc 7621 gctctcttct tggcagggag gcctcacaaa catgccatgc ctattgtagc agctctccaa 7681 gaatgctcac ctccttctcc ctgtaattcc tttcctctgt gaggagctca gcagcatccc 7741 attatgagac cttactaatc ccagggatca cccccaacag ccctggggta caatgagctt 7801 ttaagaagtt taaccaccta tgtaaggaga cacaggcagt gggcgatgct gcctggcctg 7861 actcttgcca ttgggtggta ctgtttgttg actgactgac tgactgactg gagggggttt 7921 gtaatttgta tctcagggat tacccccaac agccctgggg tacaatgagc cttcaagaag 7981 tttaacaacc tatgtaagga cacacagcca gtgggtgatg ctgcctggtc tgactcttgc 8041 cattcagtgg cactgtttgt tgactgactg actgactgac tggctgactg gagggggttc 8101 atagctaata ttaatggagt ggtctaagta tcattggttc cttgaaccct gcactgtggc 8161 aaagtggccc acaggctgga ggaggaccaa gacaggaggg cagtctcggg aggagtgcct 8221 ggcaggcccc tcaccacctc tgcctacctc agtgaagttc ccttgtggga ggccctggaa 8281 gcggatggag aagaagcgca gtcacctgaa acgagacaca gaagaccaag aagaccaagt 8341 agatccgcgg ctcattgatg ggaagatgac caggcgggga gacagcccct ggcaggtggg 8401 aggcgaggca gcaccggctc gtcacgtgct gggtccggga tcactgagtc catcctggca 8461 gctatgctca gggtgcagaa accgagaggg aagcgctgcc attgcgtttg ggggatgatg 8521 aaggtggggg atgcttcagg gaaagatgga cgcaacctga ggggagagga gcagccaggg 8581 tgggtgaggg gaggggcatg ggggcatgga ggggtctgca ggagggaggg ttacagtttc 8641 taaaaagagc tggaaagaca ctgctctgct ggcgggattt taggcagaag ccctgctgat 8701 gggagagggc taggagggag ggccgggcct gagtacccct ccagcctcca catgggaact 8761 gacacttact gggttcccct ctctgccagg catgggggag ataggaacca acaagtggga 8821 gtatttgccc tggggactca gactctgcaa gggtcaggac cccaaagacc cggcagccca 8881 gtgggaccac agccaggacg gcccttcaag ataggggctg agggaggcca aggggaacat 8941 ccaggcagcc tgggggccac aaagtcttcc tggaagacac aaggcctgcc aagcctctaa 9001 ggatgagagg agctcgctgg gcgatgttgg tgtggctgag ggtgactgaa acagtatgaa 9061 cagtgcagga acagcatggg caaaggcagg aagacaccct gggacaggct gacactgtaa 9121 aatgggcaaa aatagaaaac gccagaaagg cctaagccta tgcccatatg accagggaac 9181 ccaggaaagt gcatatgaaa cccaggtgcc ctggactgga ggctgtcagg aggcagccct 9241 gtgatgtcat catcccaccc cattccaggt ggtcctgctg gactcaaaga agaagctggc 9301 ctgcggggca gtgctcatcc acccctcctg ggtgctgaca gcggcccact gcatggatga 9361 gtccaagaag ctccttgtca ggcttggtat gggctggagc caggcagaag ggggctgcca 9421 gaggcctggg tagggggacc aggcaggctg ttcaggtttg ggggaccccg ctccccaggt 9481 gcttaagcaa gaggcttctt gagctccaca gaaggtgttt ggggggaaga ggcctatgtg 9541 cccccaccct gcccacccat gtacacccag tattttgcag tagggggttc tctggtgccc 9601 tcttcgaatc tgggcacagg tacctgcaca cacatgtttg tgaggggcta cacagacctt 9661 cacctctcca ctcccactca tgaggagcag gctgtgtggg cctcagcacc cttgggtgca 9721 gagaccagca aggcctggcc tcagggctgt gcctcccaca gactgacagg gatggagctg 9781 tacagaggga gccctagcat ctgccaaagc cacaagctgc ttccctagca ggctgggggc 9841 tcctatgcat tggccccgat ctatggcaat ttctggaggg ggggtctggc tcaactcttt 9901 atgccaaaaa gaaggcaaag catattgaga aaggccaaat tcacatttcc tacagcataa 9961 tctatgccag tggccccgtg gggcttggct tagaattccc aggtgctctt cccagggaac 10021 catcagtctg gactgagagg accttctctc tcaggtggga cccggccctg tcctccctgg 10081 cagtgccgtg ttctgggggt cctcctctct gggtctcact gcccctgggg tctctccagc 10141 tacctttgct ccatgttcct ttgtggctct ggtctgtgtc tggggtttcc aggggtctcg 10201 ggcttccctg ctgcccattc cttctctggt ctcacggctc cgtgactcct gaaaaccaac 10261 cagcatccta cccctttgga ttgacacctg ttggccactc cttctggcag gaaaagtcac 10321 cgttgatagg gttccacggc atagacaggt ggctccgcgc cagtgcctgg gacgtgtggg 10381 tgcacagtct ccgggtgaac cttcttcagg ccctctccca ggcctgcagg ggcacagcag 10441 tgggtgggcc tcaggaaagt gccactgggg agaggctccc cgcagcccac tctgactgtg 10501 ccctctgccc tgcaggagag tatgacctgc ggcgctggga gaagtgggag ctggacctgg 10561 acatcaagga ggtcttcgtc caccccaact acagcaagag caccaccgac aatgacatcg 10621 cactgctgca cctggcccag cccgccaccc tctcgcagac catagtgccc atctgcctcc 10681 cggacagcgg ccttgcagag cgcgagctca atcaggccgg ccaggagacc ctcgtgacgg 10741 gctggggcta ccacagcagc cgagagaagg aggccaagag aaaccgcacc ttcgtcctca 10801 acttcatcaa gattcccgtg gtcccgcaca atgagtgcag cgaggtcatg agcaacatgg 10861 tgtctgagaa catgctgtgt gcgggcatcc tcggggaccg gcaggatgcc tgcgagggcg 10921 acagtggggg gcccatggtc gcctccttcc acggcacctg gttcctggtg ggcctggtga 10981 gctggggtga gggctgtggg ctccttcaca actacggcgt ttacaccaaa gtcagccgct 11041 acctcgactg gatccatggg cacatcagag acaaggaagc cccccagaag agctgggcac 11101 cttagcgacc ctccctgcag ggctgggctt ttgcatggca atggatggga cattaaaggg 11161 acatgtaaca agcacaccgg cctgctgttc tgtccttcca tccctctttt gggctcttct 11221 ggagggaagt aacatttact gagcacctgt tgtatgtcac atgccttatg aatagaatct 11281 taactcctag agcaactctg tggggtgggg aggagcagat ccaagttttg cggggtctaa 11341 agctgtgtgt gttgaggggg atactctgtt tatgaaaaag aataaaaaac acaaccacga 11401 agccactaga gccttttcca gggctttggg aagagcctgt gcaagccggg gatgctgaag 11461 gtgaggcttg accagctttc cagctagccc agctatgagg tagacatgtt tagctcatat 11521 cacagaggag gaaactgagg ggtctgaaag gtttacatgg tggagccagg attcaaatct 11581 aggtctgact ccaaaaccca ggtgcttttt tctgttctcc actgtcctgg aggacagctg 11641 tttcgacggt gctcagtgtg gaggccacta ttagctctgt agggaagcag ccagagaccc 11701 agaaagtgtt ggttcagccc agaat // LOCUS HUMPREELAS 2309 bp DNA PRI 05-FEB-1993 DEFINITION Huma elafin gene, complete cds. ACCESSION L10343 NID g190337 KEYWORDS elafin; elastase; elastase inhibitor; precursor. SOURCE Homo sapiens (library: EMBL-3 (from Clontech)) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sallenave,J.-M. and Silva,A. TITLE The characterization and gene sequence of the precursor of elafin, an elastase-specific inhibitor in bronchial secretions JOURNAL Am. J. Respir. Cell Mol. Biol. (1993) In press FEATURES Location/Qualifiers source 1..2309 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="EMBL-3 (from Clontech)" exon 1..595 /number=1 CAAT_signal 398..401 CDS join(517..595,1453..1727) /note="elafin has been sequenced at the protein level; pre-elafin has not; its existence is assumed from its molecular weight (PAGE analysis); precursor; putative; precursor" /codon_start=1 /function="elastase-specific proteinase inhibitor" /product="elafin" /db_xref="PID:g190338" /translation="MRASSFLIVVVFLIAGTLVLEAAVTGVPVKGQDTVKGRVPFNGQ DPVKGQVSVKGQDKVKAQEPVKGPVSTKPGSCPIILIRCAMLNPPNRCLKDTDCPGIK KCCEGSCGMACFVPQ" intron 596..1452 /number=1 exon 1453..1728 /number=2 mat_peptide 1554..1724 /product="elafin" intron 1729..1961 /number=2 exon 1962..2119 /number=3 polyA_signal 2114..2119 BASE COUNT 595 a 537 c 603 g 572 t 2 others ORIGIN 1 tttgtcttca agagtttttc gagaccaggg aagaaggaag gaaatgccca gtttgatcgt 61 gggagtggta aaatgataaa gtagatctgg gtggggtttg tagcaccaga gcataatgga 121 gaaacacctt ggttttgtaa tcaagactgg atctaccagt gacttgctga ataacttcgg 181 tgattccttt ctcttcttgg gtctcactgt atttcaaaac atgaagaatt tcattgtaat 241 gttacctaat aagtgagcca gcacttctac tctgtgagaa agtaggaaaa ctcttgggac 301 aatcagagat gatgtgatgt aatgtccatt agttcttcct gtgaataatc ctgagggaaa 361 gcccccaggt ccctcccaga atggggtgga tatttcccaa tacagctaag gaattatccc 421 ttgtaaatac cacagacccg ccctggagcc aggccaagct ggactgcata aagattggta 481 tggccttagc tcttagccaa acaccttcct gacaccatga gggccagcag cttcttgatc 541 gtggtggtgt tcctcatcgc tgggacgctg gttctagagg cagctgtcac gggaggtgag 601 tgaacaggtg acctgctggg ctgggttgga ctaaggggag accctctgga caccctgggc 661 caggacaggg agcactactg aagcagtagg cagcactgga gcccagattt cagctttctg 721 ttctttgcca tcatattcag aaaaaatagg actttggctg gtggactcca cgtgctttcc 781 acctcagtga ctgagatatc aggactgttt gtggaagtaa tgttggtatg tggccttggc 841 ctcagatgtc aatacctgtg cagaatgtgc aataaaataa tgaactccag gattttaaac 901 cttgggtgtg gacacagtcc ccgtttctct gccccataaa agcactggag taatcagtac 961 tctaaaagga ggttaagaaa caacaagcct tcaggaatca tgttgtttga ggacccccat 1021 tttataagga gggaaccaaa aatgtagaaa tgagtgagca attgccaagg taattcccag 1081 agccaggatg gggctcaagt ctcctagtat gtggctcagg gttctttcct actccaatgc 1141 acttcctaac aaatgacaat gtgtcctctt cactgctggg tgtcacccca gtctgaccac 1201 tgctcctgag agacttggag tggaggaagg gggaagaaac aaatactcaa gggaactctg 1261 gtcctgtaga ccaccccaaa aaaggaagag ccttccaaga gtgtagctcc cagaggtgta 1321 ccttccctac tcaggccatg gtttgaggat gctgcagtaa gcagtggatg gacccagacc 1381 cagaggaaag acatggcagc tgaagcagag gcttactggg tataaatgtg ggctcgtttc 1441 ttcttttaac agttcctgtt aaaggtcaag acactgtcaa aggccgtgtt ccattcaatg 1501 gacaagatcc cgttaaagga caagtttcag ttaaaggtca agataaagtc aaagcgcaag 1561 agccagtcaa aggtccagtc tccactaagc ctggctcctg ccccattatc ttgatccggt 1621 gcgccatgtt gaatccccct aaccgctgct tgaaagatac tgactgccca ggaatcaaga 1681 agtgctgtga aggctcttgc gggatggcct gtttcgttcc ccagtgaggt gagcactagc 1741 tggagaacga ggagacccct gaagacacaa aagaaggctg agcggtgggg aagcatccca 1801 ggttggtggg agggaggttg tgggaggtga cagaaagact gggagactga ggggtctgag 1861 aggctataac cagagtgcct agaaggatga tctgtcttcc tcactgcctc tgagtgcttt 1921 gatgtgctga ctctcacctc tgatactctt ctcttccaca gagggagccg gtccttgctg 1981 cacctgtgcc gtccccagag ctacaggccc catctggtcc taagtccctg ctgcccttcc 2041 ccttcccaca ctgtccattc ttcctcccat tcaggatgcc cacggctgga gctgcctctc 2101 tcatccactt tccaataaag acttccttct gctccacttg tttctggttc ctatgacttc 2161 tgggctcctg gatgctttgg ggaaatggat gtagaattgg gacttcttct ctccagtgaa 2221 gaggggaaac ggtcccatgg tgaaagagag caggnnggag gaaacaagga ggcacatgct 2281 agggcttcat attacaatcc aataatcag // LOCUS HUMPRF1A 6218 bp DNA PRI 08-JAN-1995 DEFINITION Human perforin (PRF1) gene, complete cds. ACCESSION M31951 NID g190339 KEYWORDS perforin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6218) AUTHORS Lichtenheld,M.G. and Podack,E.R. TITLE Structure of the human perforin gene. A simple gene organization with interesting potential regulatory sequences JOURNAL J. Immunol. 143 (12), 4267-4274 (1989) MEDLINE 90079042 FEATURES Location/Qualifiers source 1..6218 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q11.2-q21.3" prim_transcript 1410..>6123 /note="perforin mRNA and introns" intron 1507..3269 /note="perforin intron A" exon <3274..3812 /gene="PRF1" /note="perforin; G00-118-853" /number=1 gene 3274..3812 /gene="PRF1" CDS join(3274..3812,4995..6123) /note="perforin" /codon_start=1 /db_xref="PID:g190340" /translation="MAARLLLLGILLLLLPLPVPAPCHTAARSECKRSHKFVPGAWLA GEGVDVTSLRRSGSFPVDTQRFLRPDGTCTLCENALQEGTLQRLPLALTNWRAQGSGC QRHVTRAKVSSTEAVARDAARSIRNDWKVGLDVTPKPTSNVHVSVAGSHSQAANFAAQ KTHQDQYSFSTDTVECRFYSFHVVHTPPLHPDFKRALGDLPHHFNASTQPAYLRLISN YGTHFIRAVELGGRISALTALRTCELALEGLTDNEVEDCLTVEAQVNIGIHGSISAEA KACEEKKKKHKMTASFHQTYRERHSEVVGGHHTSINDLLFGIQAGPEQYSAWVNSLPG SPGLVDYTLEPLHVLLDSQDPRREALRRALSQYLTDRARWRDCSRPCPPGRQKSPRDP CQCVCHGSAVTTQDCCPRQRGLAQLEVTFIQAWGLWGDWFTATDAYVKLFFGGQELRT STVWDNNNPIWSVRLDFGDVLLATGGPLRLQVWDQDSGRDDDLLGTCDQAPKSGSHEV RCNLNHGHLKFRYHARCLPHLGGGTCLDYVPQMLLGEPPGNRSGAVW" intron 3813..4994 /note="perforin intron B" exon 4995..>6123 /note="perforin" /number=2 BASE COUNT 1385 a 1781 c 1769 g 1283 t ORIGIN 1 gaattccaaa gtcctctctt tgattttata ggtgaggaaa ctaacgctca gaaagggggt 61 tgatatctat cgccgtgagg catacggtaa gtttctggtg aagctgggat cagaacctgt 121 ttagactttg cctctctttt cccgcagata ctttgcagga cttctatgtc cctcaaaccg 181 gccttcctgt catggtcagg aaagaaactc ctcacagcct cagcatccaa gtcaggccat 241 gggtgacagc tggaaagtga tcaggaggct gcagtttcta gaagagggtg gggacactgc 301 ggagagaaga tggggccaga ttccgagaag acagcataag cccctgttcc tgtaagagca 361 gggacggaag cagggacata aacgcaaggg atgagcccca aagtgtgacc catgagacat 421 gatgtcacat gtggtctgga gcctgcccca cttcttccca tcatatacac agttatgaga 481 acaagttgtg agaaccacct cctcccttac ccagctgccc ccaccccaga agccgtgtga 541 ttttgccccc cagtgccctg tgagtcactc cacccatgga aacctcaccc caccctgacc 601 tcaagcaagg cagagtgcag aagacatgtc ctccggtgct accagaccac tctcaccagc 661 acccacgacc tcagcagggc tggagccagc gtggaggcca ctggctgtcc tcacaaagcg 721 aggagcagga gcccctgttc gaggaacatg cttggagttc ggagcctggg ctagggtggg 781 atgtaggttg agcaggaagt ggatggcaag attagagcaa catctctctt ctcccactca 841 gggaggaggg aatggccaca ggctctgaca ctcaagaagg gccaggcaca gttccaagca 901 cttcacaaca acccctaggg tctacatgac ctacaatccc aattgttcag tgaagaaact 961 gaggcacagt gaggctgaag aaccctacca gtccacactg ctggtgcata accgagctgc 1021 ccaagccccg gcggtctggc gtgtaggccc atgctctgag ccgccgcctc tgcttgcctc 1081 ttacatccca cacatgcgat gctgtgcatc agaagcaagg agatggccct gctggcctgt 1141 tcatcaacac cagggccgag tctcaaagtc ctcagcgccc cgccctcctc cgcctgtgtg 1201 ccctgagtcc ccgagcccca gcagctctac tcggcagatg agcctctggc cctgctgctc 1261 gcttcctgag ggctgtcagt ggggagccgg atgagggctg aggacagggt gggtgctcgt 1321 gggaggggag agcacaaagg acctgtgacc acagctgggg gcggggcagg aagtagaagt 1381 gatgtgagtg gtggctggtg caaggagcca cagtgggctg cctggggggc tgatgccacc 1441 attccaggag cctcggtgaa gagaggatat ccatctgtgt agccgcttct ctatacggga 1501 ttccaggtaa ggagagagca gggattgggg gcctggggcc ctgggtggag gggaagaggc 1561 tgatggagca ggaagtgctg tgacctataa gacaagacac ctgggtcaca gacggtgccc 1621 atcactaact cgctgggcag ccctgtgtcc accctgggcc tcagtttcct catgtatgaa 1681 atgaagaggt agcgtgcagt ttctaaggcc ctgcaagtgc tgacattaaa gcttctaaga 1741 aagctggaaa gaggctccct gggacagaat accatgagag tcaggatgag ggctgagttc 1801 actggattag ggttatgact gtgcccgcct cctagctggg atgagcccag ggcctttgaa 1861 gattcccctg gcctctttgc ctccctgggg cgagccacgc tgcctgaaat tccacccttc 1921 aagtcacacc tttgggcagg gcaggaggct ccagctataa tgggggctct ccatagccct 1981 cttgtctcag aacctgtggt acccaagggc aaaggccttt gaagagctca gcttggataa 2041 ggttaggctt gggtaaggtt agagaaagga ggttccagat cataaacagg cccaggcaag 2101 gccccagaaa caccttaaaa ttaggaagaa ccttccagac aatcattgtc catagtaggc 2161 aatagggggg cccctgtcac tgccgaaggc cacagagctt ggcagggagt ggcaaagagg 2221 ggaggggcag caagaaagga tctgaagagc aaccatcgag gcctctcccc acacaaaatc 2281 caagggtgtg gtcaggtctg cccctttttc ttttttttct ttctcttttt tttttttttt 2341 tttttttgag gcagagtctt actctatcac ccaggcggga gtgcagtggt acgatcttgg 2401 ctcactgcaa cctccgcctc ctgggttcaa gcgattctcc tgtctcagcc tcccaagtag 2461 ctgggattac aggcacccgc caccatgccc agctaatttt tgtattttta gtaaagaggg 2521 ggtttcacca cgttggccag gctggtctcg aacttctgac ctccagtgat ccacctgcct 2581 ctgcctccca cagtgttcgg tttacaggtg tgagccacca cacctggcca ggtcggcaga 2641 ttttattcta gaattcgggc tgatgttatt ccataaagct cagagggaga gcgccaaagg 2701 gaaaatgaga agaataggtg caatatagga cctcatttct cttccaattc tgttaagggc 2761 tcagtgggag ggagagaggt tagagagagt gacagagaca aagaaacaga gtgagagtca 2821 gggtgggcaa agggtggctt gttctggggc caaaaatgac acttcttcag aatgaaggct 2881 tcctcaggtc cccagacccc tccctaaacc tgctacagct actctgtccc ctcttctgtg 2941 gccacagccc cctccccacc acccacagtt gtgtcctggg gacagagcca tccacctaga 3001 tccccattag gccttaggga aattctaaaa aggggctccc cttggctggg cttttcccct 3061 ctctgggccc atctgtacaa tgtgggggct gaaccaggcc tataagggac ataccagctc 3121 tgacattcat tgaataatga cttaagagat atctcagccc ctccccttcc atgtgccctg 3181 ataatctgtg gctgtggggg aagggagcag tcatcctcca tccctccacc catggcttcc 3241 cagagcccaa gtgccccctg tctctgcagc tccatggcag cccgtctgct cctcctgggc 3301 atccttctcc tgctgctgcc cctgcccgtc cctgccccgt gccacacagc cgcacgctca 3361 gagtgcaagc gcagccacaa gttcgtgcct ggtgcatggc tggccgggga gggtgtggac 3421 gtgaccagcc tccgccgctc gggctccttc ccagtggaca cacaaaggtt cctgcggccc 3481 gacggcacct gcaccctctg tgaaaatgcc ctacaggagg gcaccctcca gcgcctgcct 3541 ctggcgctca ccaactggcg ggcccagggc tctggctgcc agcgccatgt aaccagggcc 3601 aaagtcagct ccactgaagc tgtggcccgg gatgcggctc gtagcatccg caacgactgg 3661 aaggtcgggc tggacgtgac tcctaagccc accagcaatg tgcatgtgtc tgtggccggc 3721 tcacactcac aggcagccaa ctttgcagcc cagaagaccc accaggacca gtacagcttc 3781 agcactgaca cggtggagtg ccgcttctac aggtgagagc tggggctagg ggtggggggc 3841 tggaaaaggc gcgggaaact ctgggtggtc taagagccct ggaaaggcag agttctccaa 3901 tcaaacttgg aggctgcttc caccaggaag aaactgcaat cctggctctc agacttcagg 3961 tgattcagcc ctgtggcctc ctctcatcca ggaggtccca aatctgggag tgcccagtga 4021 gacatcaagt agagagaaag tgagtgagac catgcctgag agtcccagca gggtgccatt 4081 taggtctagt gagtactgaa ccctgccttt cctgggcaca gccagccttc cacagagtcc 4141 ttcctggata gggacagagg caggagacct ggccgggtca tccttcctca tcgctgtgtg 4201 accttgagca tctcacttct ctctgagctt caatcatcca ctcaaccaat actcacacct 4261 gcctgtggca gggccacact gctgagcgac agcagtcact accctagccc ctgggagtag 4321 gaagtatggg acccatattc cacgtctgag aaaggcacag tgtttgtagg ccagacccaa 4381 gtcacctggt ctgattcatc agacctgggg gccacgtttc ctccacagag aggccgagca 4441 tggcccaagc tgtcaggatc ccgggcatgt gggacccatc cagggggacg gatggatgaa 4501 ggccacatga ctagttccaa agttcgacag ataccatcag tgcaggatca ttgcttttat 4561 gttctttttt actttttctt aaaaaaaaaa tagagatggg gtctcactat gttgcccagg 4621 ctggtctcaa actcctgggc tcaagtgatc ctcccgcctc ggcctcccaa agtgctgggg 4681 ttacaggcat gaaccactgc tcccggccag gatcattgct tttataataa gaaaattaaa 4741 aggaaagaaa aaaatgttat tttgaaaagg aaaagagaaa atacctatac agagcactga 4801 ggtccctgag gggtgagagc ggaggcattc ctgccagccc cgtgccactg tgcttgtgct 4861 ctggagccgg gcccctgggt tccagtccta gttctgccca cttacatgtg accttgagca 4921 gtcctgaagg agttatttga ttgaatgggg gaaatactcc cctgggccca gctgaggtct 4981 ctctcttctc gcagtttcca tgtggtacac actcccccgc tgcaccctga cttcaagagg 5041 gccctcgggg acctgcccca ccacttcaac gcctccaccc agcccgccta cctcaggctt 5101 atctccaact acggcaccca cttcatccgg gctgtggagc tgggtggccg catatcggcc 5161 ctcactgccc tgcgcacctg cgagctggcc ctggaagggc tcacggacaa cgaggtggag 5221 gactgcctga ctgtcgaggc ccaggtcaac ataggcatcc acggcagcat ctctgccgaa 5281 gccaaggcct gtgaggagaa gaagaagaag cacaagatga cggcctcctt ccaccaaacc 5341 taccgggagc gccactcgga agtggttggc ggccatcaca cctccattaa cgacctgctg 5401 ttcgggatcc aggccgggcc cgagcagtac tcagcctggg taaactcgct gcccggcagc 5461 cctggcctgg tggactacac cctggaaccc ctgcacgtgc tgctggacag ccaggacccg 5521 cggcgggagg cactgaggag ggccctgagt cagtacctga cggacagggc tcgctggagg 5581 gactgcagcc ggccgtgccc accagggcgg cagaagagcc cccgagaccc atgccagtgt 5641 gtgtgccatg gctcagcggt caccacccag gactgctgcc ctcggcagag gggcctggcc 5701 cagctggagg tgaccttcat ccaagcatgg ggcctgtggg gggactggtt cactgccacg 5761 gatgcctatg tgaagctctt ctttggtggc caggagctga ggacgagcac cgtgtgggac 5821 aataacaacc ccatctggtc agtgcggctg gattttgggg atgtgctcct ggccacaggg 5881 gggcccctga ggttgcaggt ctgggatcag gactctggca gggacgatga cctccttggc 5941 acctgtgatc aggctcccaa gtctggttcc catgaggtga gatgcaacct gaatcatggc 6001 cacctaaaat tccgctatca tgccaggtgc ttgccccacc tgggaggagg cacctgcctg 6061 gactatgtcc cccaaatgct tctgggggag cctccaggaa accggagtgg ggccgtgtgg 6121 tgagaacagt gagcttggaa aggaccagta tgcttggact gaaggggttc tcacagtggg 6181 agccagggct gtcttcgtat tcccattaga ccaagctt // LOCUS HUMPROT1B 1306 bp DNA PRI 07-MAR-1995 DEFINITION Human protamine 1 gene, complete cds. ACCESSION M60331 NID g190453 KEYWORDS protamine 1. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1306) AUTHORS Domenjoud,L., Nussbaum,G., Adham,I.M., Greeske,G. and Engel,W. TITLE Genomic sequences of human protamines whose genes, PRM1 and PRM2, are clustered JOURNAL Genomics 8 (1), 127-133 (1990) MEDLINE 91184796 FEATURES Location/Qualifiers source 1..1306 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PRM-1" /tissue_lib="of A.M.Frischauf" /map="16p13.13" CAAT_signal 519..522 /gene="PRM1" /note="G00-120-316" TATA_signal 544..548 /gene="PRM1" /note="G00-120-316" exon 575..777 /gene="PRM1" /note="G00-120-316" /number=1 gene join(666..777,869..912) /gene="PRM1" CDS join(666..777,869..912) /gene="PRM1" /codon_start=1 /db_xref="GDB:G00-120-316" /product="protamine 1" /db_xref="PID:g190454" /translation="MARYRCCRSQSRSRYYRQRQRSRRRRRRSCQTRRRAMRCCRPRY RPRCRRH" intron 778..868 /gene="PRM1" /note="G00-120-316" polyA_signal 1045..1050 /gene="PRM1" /note="G00-120-316" BASE COUNT 347 a 337 c 342 g 280 t ORIGIN 1 gagaccaagc ctggccaaca tggcgaaagg ccatctctac taaaaataca aaaattagtt 61 gagtgtggtg gtgcatgtct gtagtccaac tactgggagg ctgctggcca gaggtttgct 121 tgaacccagg aggcagaggt tgcagtgagc caagattgcg ctactgcact ccagcctggg 181 tgacagaatg agactctgtc tcaaaaaaat aaacaaatac ataagtaatt aaaaaaataa 241 aataataata ataatacttc ttaacaaggc tgtctgtctt gtttatcact atataccagg 301 cctaggccaa aatccggaac aggccaaaat attaacagta atattttgtt aattgaataa 361 ctagatgatt gctcccacgg aggagtcatc ttgtatcgcc ccagctgtga cataggcagc 421 ccctacactc gggggcctgc ccgcctctca aatgcccata tatggacatg atgcaggcca 481 cctggcatgg tttgtgaggt ccagcccctt tgccctcaca atgaccaacg gccccctggc 541 atctataaca ggccgcagag ctggcccctg actcacagcc cacagagttc cacctgctca 601 caggttggct ggctcagcca aggtggtgcc ctgctctgag cattcagcca agcccatcct 661 gcaccatggc caggtacaga tgctgtcgca gccagagccg gagcagatat taccgccaga 721 gacaaagaag tcgcagacga aggaggcgga gctgccagac acggaggaga gccatgagta 781 agtgggccca gctgagggtg ggctggggct gaggctggga gctctcaggg cccagccttc 841 ctctcaccac ttttcttggt ctcaccaggg tgctgccgcc ccaggtacag accgagatgt 901 agaagacact aattgcacaa aatagcacat ccaccaaact cctgcctgag aatgttacca 961 gacttcaaga tcctcttgcc acatcttgaa aatgccacca tccaataaaa atcaggagcc 1021 tgctaaggaa caatgccgcc tgtcaataaa tgttgaaaag tcatcccact cttctctcct 1081 tgttcttgag aggggaggct cagtggggag gagggctcgg ggatgagcag agggggagag 1141 ggcctgggca tgcaacggga gaaagatgtc ggtggggggg acatgaaaga cagttgtcac 1201 gctgggtttt gttccaaact tttttttttt tttgagacag agtctcgctc cgtcgccagg 1261 atggagtcag tggccatctg gctcactgca agctccgcct cccagg // LOCUS HUMPROT2 1861 bp DNA PRI 07-MAR-1995 DEFINITION Human protamine 2 gene, complete cds. ACCESSION M60332 NID g190455 KEYWORDS protamine 2. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1861) AUTHORS Domenjoud,L., Nussbaum,G., Adham,I.M., Greeske,G. and Engel,W. TITLE Genomic sequences of human protamines whose genes, PRM1 and PRM2, are clustered JOURNAL Genomics 8 (1), 127-133 (1990) MEDLINE 91184796 FEATURES Location/Qualifiers source 1..1861 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PRM-2" /tissue_lib="of Frischauf" /map="16p13.13" CAAT_signal 553..556 /gene="PRM2" /note="G00-125-271" TATA_signal 767..772 /gene="PRM2" /note="G00-125-271" exon 804..1174 /gene="PRM2" /note="G00-125-271" /number=1 gene join(904..1174,1338..1375) /gene="PRM2" CDS join(904..1174,1338..1375) /gene="PRM2" /codon_start=1 /db_xref="GDB:G00-125-271" /product="proatamine 2" /db_xref="PID:g190456" /translation="MVRYRVRSLSERSHEVYRQQLHGQEQGHHGQEEQGLSPEHVEVY ERTHGQSHYRRRHCSRRRLHRIHRRQHRSCRRRKRRSCRHRRRHRRGCRTRKRTCRRH " intron 1175..1337 /gene="PRM2" /note="G00-125-271" polyA_signal 1618..1623 /gene="PRM2" /note="G00-125-271" BASE COUNT 479 a 537 c 527 g 318 t ORIGIN 1 cgtggtggct cacacctgta atcccagcac tctgggaggc cgacatggga ggctcactga 61 ggccaggagt gtgagaccag ccagggcaac atggtgaaac cctatctatc tccacaaaaa 121 atacaaaaaa ttagctggga gtggcagcac acgcctatag tcccagctac tgggaggctg 181 aggtggcagg atcatctgag ctggaggggt agaggctgct atgatccatg attgcacaac 241 tgcactccag cctgggcgac agacagtgcc tcaaaacaaa aagaggagag agagagagag 301 agagagagag agagagagag agagagaatg agtgaatgta tgggaaataa atgaatcaat 361 gcttggcaca ctaggagcta ttagggaact gtttgctgct gtttgccatc ttaacagtaa 421 gctgtgtgca cagagatcca agcacggagt ccttccacat ggtggctgtg gcacagcagt 481 gagttggcaa ggaagcttcc aaatgacaat gtgcgcaccc accggcttgg ggagatgagg 541 gcttggatta ggcaataaaa cacctgccat ctgtatggtt cacctactgc aagagacaga 601 ggacatgctt tacaggcaag gcaagggggt ctctcgccct tccagccctg cagcttgggt 661 gccagggccc tgctagttgt gacccctggc ctgtgctgcc tcacagagga ggctgggcag 721 ggtggggact gggcggggcc gcagagtcat agtgggcgtc cccctttata tacaagctcc 781 cggggagcct tgcagaccag accaacagta acaccaaggg caggtgggca ggcctccgcc 841 ctcctcccct actccagggc ccactgcagc ctcagcccag gagccaccag atctcccaac 901 accatggtcc gataccgcgt gaggagcctg agcgaacgct cgcacgaggt gtacaggcag 961 cagttgcatg ggcaagagca aggacaccac ggccaagagg agcaagggct gagcccggag 1021 cacgtcgagg tctacgagag gacccatggc cagtctcact ataggcgcag acactgctct 1081 cgaaggaggc tgcaccggat ccacaggcgg cagcatcgct cctgcagaag gcgcaaaaga 1141 cgctcctgca ggcaccggag gaggcatcgc agaggtctgc ctgcgccccc gccttgccct 1201 gcatgtccct gaccacccca ggcacaggag ggaggcgggg acccacccca cctgacaaaa 1261 gctccagccc cctaaacccc gtccccaccc agagttccct aggtgacccc ctcaaccaga 1321 actttctttc ccaaaaggct gcagaaccag gaagagaaca tgcagaaggc actaagcttc 1381 ctgggcccct cacccccagc tggaaattaa gaaaaagtcg cccgaaacac caagtgaggc 1441 catagcaatt cccctacatc aaatgctcaa gcccccagct ggaagttaag agaaagtcac 1501 ctgcccaaga aacaccgagt gaggccatag caactcccct acatcaaatg ctcaagccct 1561 gagttgccgc cgagaagccc acaagatctg agtgaaattg agcaaagtca cctgcccaat 1621 aaagcttgac aagacactcg ctggcctgtc gggttttcct ttgcacgaat cacacaaacc 1681 tggcttccct cacgcccagg gcgggtagag gcttctctgc agcccttcct cagaggcagg 1741 tagggcggga gttctgatgc tctcaggaca cctaggagag ggttactgtg cctgggaggc 1801 caccctatga tgctgtcctg tctattgtgc atccttccat ggtgtcaaag ccacgattag 1861 t // LOCUS HUMPRPH1 4946 bp DNA PRI 03-MAY-1996 DEFINITION Human acidic proline-rich protein (PRH1) gene, complete cds. ACCESSION M13057 NID g190511 KEYWORDS proline-rich protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4946) AUTHORS Kim,H.S. and Maeda,N. TITLE Structures of two HaeIII-type genes in the human salivary proline-rich protein multigene family JOURNAL J. Biol. Chem. 261 (15), 6712-6718 (1986) MEDLINE 86196106 COMMENT Draft entry and clean copy sequence for [1] kindly provided by H.-S.Kim, 28-JUL-1986. Two direct repeats are located in the first intron flanked at the 5' and 3' ends by a 6 bp repeat (positions 1752-1757 and 2523-2528). This whole sequence (positions 1752-2528) looks like a transposon. The sequences 'tggaaa' (252-257 and 280-285) and 'caaacca' (253-260 and 326-332) within the 72 base pair repeats are identical to virus enhancer core sequences. FEATURES Location/Qualifiers source 1..4946 /organism="Homo sapiens" /isolate="R.D." /db_xref="taxon:9606" /clone="1-4" /cell_type="white blood cell" /map="12p13.2" /chromosome="12" repeat_region 224..295 /note="72 bp repeat" /rpt_type=tandem repeat_region 296..367 /note="72 bp repeat" /rpt_type=tandem exon 1561..1716 /gene="PRH1" /number=1 gene 1561..4833 /gene="PRH1" sig_peptide 1653..1700 /gene="PRH1" CDS join(1653..1716,2768..2803,3166..3566) /gene="PRH1" /note="precursor" /codon_start=1 /product="proline-rich protein 1" /db_xref="PID:g190512" /translation="MLLILLSVALLAFSSAQDLNEDVSQEDVPLVISDGGDSEQFIDE ERQGPPLGGQQSQPSAGDGNQDDGPQQGPPQQGGQQQQGPPPPQGKPQGPPQQGGHPP PPQGRPQGPPQQGGHPRPPRGRPQGPPQQGGHQQGPPPPPPGKPQGPPPQGGRPQGPP QGQSPQ" mat_peptide join(1701..1716,2768..2803,3166..3563) /gene="PRH1" /product="proline-rich protein 1" intron 1717..2767 /gene="PRH1" /number=1 repeat_region 1752..1883 /rpt_type=direct repeat_region 2381..2528 /rpt_type=direct exon 2768..2803 /gene="PRH1" /number=2 intron 2804..3165 /gene="PRH1" /number=2 exon 3166..3584 /gene="PRH1" /number=3 repeat_region 3273..3320 /rpt_type=tandem repeat_region 3321..3371 /rpt_type=tandem repeat_region 3372..3421 /rpt_type=tandem repeat_region 3422..3485 /rpt_type=tandem repeat_region 3486..3536 /rpt_type=tandem intron 3585..4743 /gene="PRH1" /number=3 exon 4744..4833 /gene="PRH1" /number=4 BASE COUNT 1516 a 1030 c 1067 g 1333 t ORIGIN Chromosome 12p13.2; 1 bp upstream of BamHI site. 1 ggatccactc atgcacacct gtctagggcc ttgtctttat gattttctgt caccaggctt 61 ctcagccatg cctccaacca agctatcctc aaataaagac taaccatgca caaaaagtga 121 aaaattacac aagacctact caaggtctga aatggaaagt gcagctggtg attctgagta 181 gtatatttta tgaattgatg aacagggaat ttctaggtat cacagagcaa tcagcagctg 241 aggagagagc gacaaaacca ggactctgac agggcttttt ggaaacttaa tttttggagc 301 aatcagcagc tgaggagaga gctaccaaac caggatcccg acagggcttt ttggaaactt 361 aattttttat tgtcaaatta tacagacaaa aatgaaagta aaggtagtca ggagtaccac 421 acacctttac ttagaaggat aaataaagga cacctttgta gggagaaaaa aaaagatata 481 ggacgaaaag taaacatggc aaggtagact cagaatacaa aatttcaaca aataagtgcg 541 agaaaaagaa tgagaaagaa tgagtattgg aatattacag ataaaaatcc ttggtgagaa 601 agaacactct gatgaaacag gtgacataat tactgagtaa aagcagaaga ccaatcaagg 661 caattgaggc ttggacaggt attacaaatt atcttagatg aatgtataat tgtggccatt 721 taattatcag tgatgacaat tacaatattt attagaagaa ataaagtaga ggatgaattc 781 tatggaatat tctggagaaa gtggaggtct tagctatatg acatcgtgtg gctcacttag 841 ggaaccatat attcagtttg ttcaggtcta gaaattcctt ggtggtttct gagttcagtt 901 ttgctttaat aaaccctaaa gatattttta aaatacaaat attttagctt gcctgtgttg 961 tatagcacca aggcttgcct gagtgtagtg agattcacag aagtttggac aattcttgta 1021 gcctgagcag gctgccagcc ttttcttttc atatatgtgt gcctgggagt gtgctgtcct 1081 cacacacttc acagataagc aagaccatct ctgttacatg aaaaatttag ctaaatgtga 1141 aaatacctac gcacacacat catgagagat ggaaagaggc tatcaaaaca gtcagagata 1201 ttgaccatct tcactgttag ctgtaggatc agtgtccacc ccccgacaca cccacacaaa 1261 ctcacataca cagaaatgaa aagcacagtg aaaagcagaa tagcagtctt tccaaacaga 1321 ttgaaaacca aaagatgtct aaagttcaat cctgacactg tgccgactgc ttggacacag 1381 ttccgtcaaa acatttgtcc aagtcatagt tgtgcctttt aggaattaga cacaaacaat 1441 atcctaccca acccttcctg ctgagctaga gtcccaaaag aaatgaggga tacacttgac 1501 ctgaagtaag caaagcagaa gccagtctct gaggcgagga ggcccacctg gtagggagct 1561 caaatgcacc attgtcctgc ttgtctttat aaagggagct gacacgtttc tcccagcaca 1621 aagttgggag tgacaccaga gcctcctgca agatgcttct gattctgctg tcagtggccc 1681 tgctggcctt cagctcagct caggatttaa atgaaggtaa gatgaattgg gggaagatat 1741 tgtgactctg attggggttt acaggcaaat gctatagagg aggaaagtgg agggaagaga 1801 ggaggatgag aaaacagatg ggactgcaga gttctcatgc tgaggatcag aagatctatt 1861 gtacctccat tcctcatcaa ggcctcatag tttatttgtt gcacaaatag aatccaataa 1921 agaattcgta ctaggggtgt gagagagtga gatttgcatt atatagagac atgggactgc 1981 tgtgaaggat gtggagaatg caagacagat tcagggaagt acagctgtga agatcctgta 2041 ctgatcccag tagacaggga tgatggtggc cttgctgtac agtggatgag cattaatgaa 2101 ggagataaac acatgtcaga gctactgccg aggcagagaa ttgggtaaac acttgtctct 2161 gtctacatag agttagagaa taaccagagt gaaacattgt catttttctg tctcctgaat 2221 gtagtatttc aatgtgctgg gaatggcatg cgtaagatta tatccaagtg gctatgtctg 2281 gtggctcctg ttgagaaggc ttgcaaacat aaacaacata tttacagatg aaagagggca 2341 gaagaatccc caaatatgtc attgaaatac tcagaagcag tttaactaaa taagcgctaa 2401 aggttacggc aatgctatga ggaggaaagt gaggaagaga ggaggtgaga aaacagatgg 2461 gactgcagaa ttttcatgct gaggatcaga agacctattg tatcttcatt cctcatcaag 2521 gcttggggaa tcaaaagagg acaaacaggg gcccttctat gttgagttcc tggttgacgc 2581 tcagtgtagt aacaatactg ctttccctta catcttcttc cacttccagt agcatcagag 2641 agtggctgat gagatcacaa aggggatgca cagggtgtga tcagaggtcc tttatcctcg 2701 tagaacacta tgagccttga atgattcagg aagtaacttt tcccatcatc ctgtacttct 2761 tttctagatg tcagccagga agatgttccc ctcgtaatat caggtaaatc ccaataaatt 2821 ctcagtaaac tctgtctcca tttttccctg aaaaattgat gagttctcca gtgtcttctt 2881 atcaccattt tcttgtcagg aattggctaa taccaatgcc ccaaagatat aaacagtttt 2941 ctcccaacct tgattctggg gaccatgagt aaagaaattc gatttttcat cacccttatg 3001 tggattaaga ggagttctaa ttaggaagcc ttgggaaggg gggaggttgg gagttgagag 3061 gcaggtcagg cagagagggg ccagccgtgt ggtgcagaca gagaggtatg aagacaggag 3121 ggttttccag catgagctca gctcttcttg tttcaactca cacagatgga ggagactctg 3181 agcagttcat agatgaggag cgtcagggac cacctttggg aggacagcaa tctcaaccct 3241 ctgctggtga tgggaaccag gatgatggcc ctcagcaggg accaccccaa caaggaggcc 3301 agcagcaaca aggtccacca cctcctcagg gaaagccaca aggaccaccc caacagggag 3361 gccatccccc tcctcctcaa ggaaggccac aaggaccacc ccaacaggga ggccatcccc 3421 gtcctcctcg aggaaggcca caaggaccac cccaacaggg aggccatcag caaggtcctc 3481 ccccacctcc tcctggaaag ccccagggac cacctcccca agggggccgc ccacaaggac 3541 ctccacaggg gcagtctcct cagtaatcta ggattcaatg acaggtacga ttccagttta 3601 ttatccatca aaggctccaa ctgctacagt tctccaactt cattgtgcca atgaatcctc 3661 tgaaaacctg ttaatattgc cctgtcctgg aacacatttc taaaaattgt tattcagata 3721 ttcttgtata gagtatcaag accctgtgac cctgtgtttt acagaagctc ttgaaggcaa 3781 ttctgatttt gagaatcact atcttcaaat tacatgtctt aagtagggtt gacaatgagg 3841 acatagaacc atgttccccc tttggctctc tctattttct ttcctcaaac tcagactccc 3901 atttaaagtt ttcacctgaa catgctttgc tcagtcctgc ctcacatcag gctttcaggt 3961 ccagtattcc tgctaagtgg tccttgaact ttcagttgta aaatggtatc tcatttttag 4021 tatacttaca tttaaagtca tacatgctta agctaacaaa aactaatctc actgaaccga 4081 aatgtacaga gctaaatagt aagagcctaa ctcatctttc ctcccttcta tccttctcaa 4141 aacccccacc ttttactctt tggaatctac tttttgaaat atatattgct acataaatat 4201 atataatgct tttattacaa gcacttatac catactgtat attcagattt gtgtcttact 4261 cactggaaaa atttattgtt tagaaaactt tctcctgagt tcatttagat ctcttcagtg 4321 tttactggct agcttgcttt tacaattgta tactattcta ttgtttggtt gttccgtaat 4381 ctatttaacc aatccctgtc actggatatt aaggcgggat tcacattctc actattatat 4441 gatatgttgc agtgcccatc tttgtaaaaa tagcccttac ataaccaaca gcagcaaagc 4501 atgaacacat aacaataatt gtttctcttt ctcatctaag caacacttta aatcacatta 4561 tgtgcaatgg cttaaagagt gaacaaagaa atattactaa ggagaagggg gccgtagaag 4621 ggatagaggg caagagtagg ggcttgcatc tccatcactg cagtaacccc cagtgaggga 4681 ttagacattt cctgccatgt caagtgttgt ctgtaatctt ccttgtttgc ttgtcttttt 4741 caggaagtga ataagaagat aacagtgttt caaatgccgt gaaacacggc atcatgctct 4801 aacttcagta taccaataaa acaatcagct tgcaatttct gctggtggtg tctctttctg 4861 agtgtttggg actctagaat ctgagaccca tgttcacatt gtaagaatat ccaggacccc 4921 ttctccttga tgtttccagc aagctt // LOCUS HUMPSAP 4778 bp DNA PRI 15-MAR-1990 DEFINITION Human pulmonary surfactant apoprotein (PSAP) gene, complete cds. ACCESSION M30838 NID g190564 KEYWORDS pulmonary surfactant protein. SOURCE Human adult lung DNA, adult lung, cDNA to mRNA, clones HS [2 and 5] and fetal lung, cDNA to mRNA, clone pHS-6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4778) AUTHORS White,R.T., Damm,D.L., Miller,J., Spratt,K., Schilling,J., Hawgood,A., Benson,B.J. and Cordell,B. TITLE Isolation and characterization of the human pulmonary surfactant apoprotein gene JOURNAL Nature 317, 361-363 (1985) MEDLINE 86014366 FEATURES Location/Qualifiers source 1..4778 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 128..133 prim_transcript 171..4637 /note="PSAP mRNA and introns" intron 200..1022 /note="PSAP intron A" sig_peptide 1046..1105 /note="pulmonary surfactant apoprotein signal peptide" CDS join(1046..1217,1531..1650,2404..2481,2957..3333) /note="pulmonary surfactant apoprotein precursor" /codon_start=1 /db_xref="PID:g190565" /translation="MWLCPLALNLILMAASGAVCEVKDVCVGSPGIPGTPGSHGLPGR HGRDGLKGDLGPPGPMGPPGEMPCPPGNDGLPGAPGIPGECGEKGEPGERGPPGLRAH LDEELQATLHDFRHQILQTRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGG RIAVPRNPEENEAIASFVKKYNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAG RGKEQCVEMYTDGQWNDRNCLYSRLTICEF" exon <1046..1217 /note="pulmonary surfactant apoprotein precursor, (first expressed exon)" /number=2 mat_peptide 1106..1217 /note="pulmonary surfactant apoprotein" intron 1218..1530 /note="PSAP intron B" mat_peptide 1531..1650 /note="pulmonary surfactant apoprotein" exon 1531..1650 /number=3 intron 1651..2403 /note="PSAP intron C" mat_peptide 2404..2481 /note="pulmonary surfactant apoprotein" exon 2404..2481 /number=4 intron 2482..2956 /note="PSAP intron D" exon 2957..>3333 /note="pulmonary surfactant apoprotein precursor" /number=5 mat_peptide 2957..3330 /note="pulmonary surfactant apoprotein" polyA_signal 4619..4624 polyA_signal 4639..4644 BASE COUNT 1152 a 1223 c 1350 g 1053 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcctcca gcctgagtgc tcttggggaa acatgctgtg taaacactat gcccatttcc 61 tgcctggagc acaggttttg tggtagggct ctcaggggtg aggaggaagc ctggcagccc 121 ccacatctat aaatgctgcg tctaccttac cctctgactt ggaggcagag acccaagcag 181 ctggaggctc tgtgtgtggg tgagtttagc cccatcccct aggtgttctc cagcttgagg 241 atcgcaggca gagaggacca gcccagcagc cacaggcctg accaaagccc aggctgggaa 301 ggagggcaac tccccatttt ccactgggag gtgtttcaca gcacagtcaa cataggtgac 361 ctgcaaagat cctcatgttt gttattttct ttggccagat ccatcctaca gggttcagca 421 gggcctacag gaggggcagt gagagaacag accccaaaaa gaaaggggac tccatgactg 481 accaccttga ggggggccag gctgccgggc cccgttcatc ttttttcatt ctcaggtcgc 541 tgatttcttg gagcctgaaa agaaagtaac acagcaggga tgaggacaga tggtgtgagt 601 cagtgagtga gtgacctgac taatagcctg ggagggacag ggcaggtttt ctgcagaggc 661 acggaagatt cagctgaagt cagagaggtg aagccagttt cccagggtaa catagtgagg 721 cactgaaaga aaggagactg cactggagcc caggtccccg ggctccccag agctccttac 781 tcttcctcct cctcagcagc ctggagaccc cacaacctcc agccggaggc ctgaagcatg 841 aggccatgcc aggtgccagg tgatgctggg aattttcccg ggagcttcgg gtcttcccag 901 cactctggtc gtcgcccgcc ctgcctcgtc gggctctgcc cagcttcctg agtcctgaca 961 gagcacagtg ggggagatgt tggcagaggt ggcagatggg ctcacggcca tccctcctgc 1021 aggagcagcg actggaccca gagccatgtg gctgtgccct ctggccctca acctcatctt 1081 gatggcagcc tctggtgctg tgtgcgaagt gaaggacgtt tgtgttggaa gccctggtat 1141 ccccggcact cctggatccc acggcctgcc aggcaggcac gggagagatg gtctcaaagg 1201 agacctgggc cctccaggta ctgtgctgca gaccccaccc tcagctgagg acacagaccc 1261 cttttcagga ggcccatctg tccaggcccc taggctgtgg gccatagtga gctgggggct 1321 atagtaagct gggtgggact tcagtctgca gggctggtgg gttcctgggg cccttatgat 1381 ggcgcatcct ggagagtctg tcctcatagt gcccacggag tgatagagtg atagctgagc 1441 cagccctggt gataatgggc atcgagtctc actagctcca accagttgtg ggtgacagat 1501 cctacacatc catgtctctt ttctctgcag gccccatggg tccacctgga gaaatgccat 1561 gtcctcctgg aaatgatggg ctgcctggag cccctggtat ccctggagag tgtggagaga 1621 agggggagcc tggcgagagg ggccctccag gtgagcaggg tggggcaggt gggcagtgga 1681 aacatgggca cagcgaccct gaagtcagtt acacggggat gatggggatc agacaaaccc 1741 tacaggttcc ccaagggcat ttggctcaac ctaagtaaga gaggataagc ttgagggaga 1801 tagctgaggt gtctggggag tgtggtcaca attcagggaa aggcaggtgt gggaagtcct 1861 ccgtgcctca tgaccaccga tggggacaca ctgagtcagg tgtgggatga gggacagcac 1921 tgggaggcag gggaggcatg tcctgggatg gaggccctgg ggctgtctga agggtgaatg 1981 cggacgaggc atccagacag acggtgtgat caggagcccc acagacagag gggaactttg 2041 aagctcagag cggtaagcaa gtccatcagg gcagtgcaga gagcatcatg cttgcccttc 2101 ggtcggaggg tgcgggagag ggacttgccc cacagaggcg ggcagacaga acccctcgag 2161 gacaagagca ggaaagagga caaggggtgg gggtctcagc aggggcaagg cttcactaaa 2221 gaatagggga ccacgggtct gagacacact ggaatcttgt ggaccctctg agcctaggtc 2281 tggtggcgcc taacagcaat gaaagggcag agttccagga ttgcagatgg caaaacacct 2341 cgtggcagca agtgggagtc ttcactggcc tgcccctcct tctgtgtggg gcactctcca 2401 cagggcttcg agctcatcta gatgaggagc tccaagccac actccacgac tttagacatc 2461 aaatcctgca gacaagggga ggtaagggga ccccctgggc tcacggggta ggagtttccc 2521 acaaattccc ctcattctca gcaccagctt ctagaacata gagattacaa ataggcatgc 2581 acatgcaggt cttggggaaa ggaattgacg cttgcttttc ttgatgtctt ttgaatggcc 2641 cagaggagac agaagcagac acaattcact tccccgattt cataggaaag caagttctct 2701 atctgccttg ctttccactg aattcacagg aaattgcacc atttctggca ataagtaatt 2761 gttacttagg tgaatgaata aatggaggag agtctaaaag tgaatttaga aaactgcaat 2821 tggaagagga agagaagaca cagagagagg cagagatgga gagactgggg agaatctggt 2881 agcagagacc ccaggtgagg gaggtggctt agagacaaag tggtcagtgg cctgacccgg 2941 actcctctgc tctcagccct cagtctgcag ggctccataa tgacagtagg agagaaggtc 3001 ttctccagca atgggcagtc catcactttt gatgccattc aggaggcatg tgccagagca 3061 ggcggccgca ttgctgtccc aaggaatcca gaggaaaatg aggccattgc aagcttcgtg 3121 aagaagtaca acacatatgc ctatgtaggc ctgactgagg gtcccagccc tggagacttc 3181 cgctactcag acggcacccc tgtaaactac accaactggt accgagggga gcccgcaggt 3241 cggggaaaag agcagtgtgt ggagatgtac acagatgggc agtggaatga caggaactgc 3301 ctgtactccc gactgaccat ctgtgagttc tgagaggcat ttaggccatg ggacagggag 3361 gacgctctcc ttgtcggcct ccatcctgag gctccacttg gtctgtgaga tgctagaact 3421 ccctttcaac agaattccac ttgtggctat tgggactgga ggcaccctta gccacttcat 3481 tcctctgatg ggccctgact cttccccata atcactcgac cagccttgac actccccttg 3541 caaactctcc cagcactgca ccccaggcag ccactcttag ccttggcctt cgacatgaga 3601 tggagccctc cttattcccc atctggtcca gttccttcac ttacagatgg cagcagtgag 3661 gtcttggggt agaaggaccc tccaaagtca cacaaagtgc ctgcctcctg gtcccctcag 3721 ctctctctct gcaacccagt gccatcagga tgagcaatcc tggccaagca taatgacaga 3781 gagaggcaga cttcggggaa gccctgactg tgcagagcta aggacacagt ggagattctc 3841 tggcactctg aggtctctgt ggcaggcctg gtcaggctct ccatgaggtt agaaggccag 3901 gtagttgttc cagcagggtg gtggccaagc caaccccatg attgatgtgt acgattcact 3961 cctttgagtc tttgaatggc aactcagccc cctgacctga agacagccag cctaggcctc 4021 taggtgacct agagccgcct tcagatgtga cccgagtaac tttcaactga tgaacaaatc 4081 tgcaccctac ttcagatttc agtgggcatt cacatcaccc ccacaccact ggctctgctt 4141 tctcctttca ttaatccatt cacccagata tttcattaaa attatcacgt gccaggtctt 4201 aggatatgtc gtggggtggg caaggtaatc agtgacagtt gaagattttt ttttcccaga 4261 gcttatgtct tcatctgtga aatgggaata agatacttgt tgctgtcaca gttattacca 4321 tccccccagc taccaaaatt actaccagaa ctgttactat acacagaggc tattgactga 4381 gcacctatca tttgccaaga accttgacaa gcacttctaa tacagcatat tatgtactat 4441 tcaatcttca cacaatgtca cgggaccagt attgtttcct cattttttat aaggacactg 4501 aagcttggag gagttaaatg ttttgagtat tattccagag agcaagtggc agaggctgga 4561 tccaaaccca tcttcctgga cctgaagctt atgcttccag cctcccactc ctgagctgaa 4621 taaagatgat ttaagcttaa taaatcgtga atgtgttcac atgagtttcc atagctttgg 4681 ttccaagaaa tatcacattt ctgtattttt gtaaatcaaa tgaactctga ctctgagccc 4741 cacttgcctg aagattggaa attcaatctc aggatgtg // LOCUS HUMPTH2 1156 bp DNA PRI 08-JAN-1995 DEFINITION Human parathyroid (pth) gene: coding region and 3'flank. ACCESSION J00301 NID g190702 KEYWORDS Z DNA; hormone; parathyroid hormone. SEGMENT 2 of 2 SOURCE human cdna of parathyroid mrna([1]) & fetal liver dna([2]). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 108 to 198; 302 to 913) AUTHORS Hendy,G.N., Kronenberg,H.M., Potts,J.T. Jr. and Rich,A. TITLE Nucleotide sequence of cloned cDNAs encoding human preproparathyroid hormone JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78 (12), 7365-7369 (1981) MEDLINE 82150870 REFERENCE 2 (bases 1 to 1156) AUTHORS Vasicek,T.J., McDevitt,B.E., Freeman,M.W., Fennick,B.J., Hendy,G.N., Potts,J.T. Jr., Rich,A. and Kronenberg,H.M. TITLE Nucleotide sequence of the human parathyroid hormone gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (8), 2127-2131 (1983) MEDLINE 83169834 COMMENT see comment for humpth1. the 3' noncoding region is 120 bases longer than in bovine pth mrna (see bovpth) and contains two aataaa signal sequences for polyadenylation. FEATURES Location/Qualifiers source 1..1156 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p15.2-p15.1" gene join(J00300:282..526,1..563) /gene="PTH" intron <1..107 /gene="PTH" /note="pth intron 1" prim_transcript <1..913 /note="pth mRNA" exon <113..198 /gene="PTH" /note="preproparathyroid hormone; G00-119-522" CDS join(113..198,302..563) /gene="PTH" /note="preproparathyroid hormone" /codon_start=1 /db_xref="GDB:G00-119-522" /db_xref="PID:g190704" /translation="MIPAKDMAKVMIVMLAICFLTKSDGKSVKKRSVSEIQLMHNLGK HLNSMERVEWLRKKLQDVHNFVALGAPLAPRDAGSQRPRKKEDNVLVESHEKSLGEAD KADVNVLTKAKSQ" sig_peptide 116..187 /gene="PTH" /note="prepeptide" intron 199..301 /gene="PTH" /note="pth intron 2" exon 302..>563 /gene="PTH" /note="preproparathyroid hormone" mat_peptide 309..560 /gene="PTH" /note="parathyroid hormone" BASE COUNT 371 a 188 c 210 g 387 t ORIGIN HindIII site, about 3400 bp downstream from humpth1;chr. 1 aagcttctcg tgaaaaccaa cccaattagt tagtattgca ttctgtgtac tatagtttgg 61 aatattaaaa atattttaaa atacctccat tttgcttatc cttttagtga agatgatacc 121 tgcaaaagac atggctaaag ttatgattgt catgttggca atttgttttc ttacaaaatc 181 ggatgggaaa tctgttaagt aagtactgtt ttgccttgga attggatttt taatgttgac 241 tttatcattt cgaagtgggg agctaatggg aagtggccct ctctgtttct cttcttccca 301 ggaagagatc tgtgagtgaa atacagctta tgcataacct gggaaaacat ctgaactcga 361 tggagagagt agaatggctg cgtaagaagc tgcaggatgt gcacaatttt gttgcccttg 421 gagctcctct agctcccaga gatgctggtt cccagaggcc ccgaaaaaag gaagacaatg 481 tcttggttga gagccatgaa aaaagtcttg gagaggcaga caaagctgat gtgaatgtat 541 taactaaagc taaatcccag tgaaaatgaa aacagatatt gtcagagttc tgctctagac 601 agtgtagggc aacaatacat gctgctaatt caaagctcta ttaagatttc caagtgccaa 661 tatttctgat ataacaaact acatgtaatc catcactagc catgataact gcaattttaa 721 ttgattattc tgattccact tttattcatt tgagttattt taattatctt ttctattgtt 781 tattcttttt aaagtatgtt attgcataat ttataaaaga ataaaattgc acttttaaac 841 ctctcttcta ccttaaaatg taaaacaaaa atgtaatgat cataagtcta aataaatgaa 901 gtatttctca ctcattgcaa gtatatcttt ttggttatca ctgataccca catgtttaca 961 ttgatcatga ctaggtagaa caatacaaag tattttttta gtcatgtgtt tcacatttgg 1021 atattttgaa catcaacgtt ttagtattac caaagtatta ggtttccaaa tcttcactag 1081 ctcaatactg ttgtcctttt ggtttcagga aaggaaataa aatgctcagc aaaaaaaggg 1141 ggcataaaag tggacc // LOCUS HUMRBPA 13646 bp DNA PRI 19-APR-1995 DEFINITION Homo sapiens retinaldehyde-binding protein (CRALBP) gene, complete cds. ACCESSION L34219 NID g598228 KEYWORDS retinaldehyde-binding protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13646) AUTHORS Intres,R., Goldflam,S., Cook,J.R. and Crabb,J.W. TITLE Molecular cloning and structural analysis of the human gene encoding cellular retinaldehyde-binding protein JOURNAL J. Biol. Chem. 269 (41), 25411-25418 (1994) MEDLINE 95014337 FEATURES Location/Qualifiers source 1..13646 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region 1688..1972 /rpt_family="Alu" exon 3953..4064 /number=1 gene join(4053..4064,4335..4463,5702..5906,7788..7966, 11118..11276,12210..12320,12576..12734) /gene="CRALBP" CDS join(4053..4064,4335..4463,5702..5906,7788..7966, 11118..11276,12210..12320,12576..12734) /gene="CRALBP" /codon_start=1 /product="retinaldehyde-binding protein" /db_xref="PID:g598229" /translation="MSEGVGTFRMVPEEEQELRAQLEQLTTKDHGPVFGPCSQLPRHT LQKAKDELNEREETREEAVRELQEMVQAQAASGEELAVAVAERVQEKDSGFFLRFIRA RKFNVGRAYELLRGYVNFRLQYPELFDSLSPEAVRCTIEAGYPGVLSSRDKYGRVVML FNIENWQSQEITFDEILQAYCFILEKLLENEETQINGFCIIENFKGFTMQQAASLRTS DLRKMVDMLQDSFPARFKAIHFIHQPWYFTTTYNVVKPFLKSKLLERVFVHGDDLSGF YQEIDENILPSDFGGTLPKYDGKAVAEQLFGPQAQAENTAF" intron 4065..4334 /number=2 exon 4335..4463 /gene="CRALBP" /number=3 intron 4464..5701 /number=3 exon 5702..5906 /gene="CRALBP" /number=4 intron 5907..7787 /number=4 exon 7788..7966 /gene="CRALBP" /number=5 intron 7967..11117 /number=5 repeat_region 8534..8875 /rpt_family="Alu" repeat_region 9991..10277 /rpt_family="Alu" exon 11118..11276 /gene="CRALBP" /number=6 intron 11277..12209 /number=6 exon 12210..12320 /gene="CRALBP" /number=7 intron 12321..12575 /number=7 exon 12576..13130 /number=8 repeat_region 13611..>13646 /rpt_family="Alu" BASE COUNT 3374 a 3260 c 3479 g 3533 t ORIGIN 1 agctccacag aggacttggg acagaaacag cccaagaccc agacagcagc cagagtagga 61 gcccagggaa agggcttttg gaaacaaaga gctcaaggtg actcaagtgc accagaagat 121 ctaagcacca ggcattgcag cctgctctgg aggaaggggt cggcaggagg tggggtggtg 181 ggaagaaatc ccaaacctgc caaagtagga atgggtggaa aagcaagaac tatgtccagg 241 gaaaataaaa aagcacttct ggggttggat tctggagagc gacctgtgca ggggagcttg 301 accttctcct gggcacctct aggaggcaaa accaggacca agggtggtca ttataggaag 361 ctgcttgggg ctcaaaccaa caatgaggtt ttaaacaagg agagctcccc agtaatggaa 421 acagggagct tcccgccctt ggaatcctag acagaaaatg ttctacagga attacctcct 481 cctcatccag tggaatagca aaacagaggc tgggaagtta cctgttgaga ggctgcagag 541 gggtttcttc ctgtacaaag gattgaataa taagttccct ccagttaggc acctattgat 601 actgtatttg gagtaactga cctttccaga ggacttattg tcaggcagct aaacccctga 661 actaagcact ttactgccat ttcatccgca ctaaagtcat gagcctggtc ctctagtttt 721 ctacacttct tagaggagga aactgagccg atgagttttg aagtcatgtt tcccggccag 781 aagatgctcg agcgaggctt tacaggcggg cagtctggtt ccagggcatg ccactgcacc 841 acactgccca gaaggaggct cagcctgtct tgtcctctcc ctgcttggcc ttaaccagcc 901 acatttctca actgacccca ctcactgcag aggtgaaaac taccatgcca ggtcctgctg 961 gctgggggag gggtgggcaa taggcctgga tttgccagag ctgccactgt agatgtagtc 1021 atatttacga tttcccttca cctcttatta ccctggtggt ggtggtggtg gggggggtgc 1081 tctctcagca accccacccc gggatcttga ggagaaagag ggcagagaaa agagggaatg 1141 ggactggccc agatcccagc cccacagccg ggcttccaca tggccgagca ggaactccag 1201 agcaggagca cacaaaggag ggctttgatg cgcctccagc cagcccaggc ctctcccctc 1261 tcccctttct ctctgggtct tcctttgccc cactgagggc ctcctgtgag cccgatttaa 1321 cggaaactgt gggcggtgag aagttcctta tgacacacta atcccaacct gctgaccgga 1381 ccacgcctcc agcggaggga acctctagag ctccaggaca ttcaggtacc aggtagcccc 1441 aaggaggagc tgccgacctg gcaggtaagt caatacctgg ggcttgcctg ggccagggag 1501 cccaggactg gggtgaggac tcaggggagc agggagacca cgtcccaaga tgcctgtaaa 1561 actgaaacca cctggccatt ctccaggttg agccagacca atttgatggt agatttagca 1621 aataaaaata caggacaccc agttaaatgt gaatttccga tgaacagcaa atactttttt 1681 agtattaaaa aagttcacat ttaggctcac gcctgtaatc ccagcacttt gggaggccga 1741 ggcaggcaga tcacctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc 1801 catctctact aaaaatacca aaaattagcc aggcgtgctg gtgggcacct gtagttccag 1861 ctactcagga ggctaaggca ggagaattgc ttgaacctgg gaggcagagg ttgcagtgag 1921 ctgagatcgc accattgcac tctagcctgg gcgacaagaa caaaactcca tctcaaaaaa 1981 aaaaaaaaaa aaaaagttca catttaactg ggcattctgt atttaattgg taatctgaga 2041 tggcagggaa cagcatcagc atggtgtgag ggataggcat tttttcattg tgtacagctt 2101 gtaaatcagt atttttaaaa ctcaaagtta atggcttggg catatttaga aaagagttgc 2161 cgcacggact tgaaccctgt attcctaaaa tctaggatct tgttctgatg gtctgcacaa 2221 ctggctgggg gtgtccagcc actgtccctc ttgcctgggc tccccagggc agttctgtca 2281 gcctctccat ttccattcct gttccagcaa aacccaactg atagcacagc agcatttcag 2341 cctgtctacc tctgtgccca catacctgga tgtctaccag ccagaaaggt ggcttagatt 2401 tggttcctgt gggtggatta tggcccccag aacttccctg tgcttgctgg gggtgtggag 2461 tggaaagagc aggaaatggg ggaccctccg atactctatg ggggtcctcc aagtctcttt 2521 gtgcaagtta gggtaataat caatatggag ctaagaaaga gaaggggaac tatgctttag 2581 aacaggacac tgtgccagga gcattgcaga aattatatgg ttttcatgac agttcttttt 2641 ggtaggtact gttattatcc tcagtttgca gatgaggaaa ctgagaccca gaaaagttaa 2701 ataacttgct agggtcacac aagtcataac tgacaaagcc tgattcaaac ccaggtctcc 2761 ctaaccttta aggtttctat gacgccagct ctcctaggga gtttgtcttc agatgtcttg 2821 gctctaggtg tcaaaaaaag acttggtgtc aggcaggcat aggttcaagt cccaactctg 2881 tcacttacca actgtgacta ggtgattgaa ctgaccatgg aacctggtca catgcaggag 2941 caggatggtg aagggttctt gaaggcactt aggcaggaca tttaggcagg agagaaaacc 3001 tggaaacaga agagctgtct ccagaaatac ccactgggga agcaggttgt catgtgggcc 3061 atgaatggga cctgttctgg taaccaagca ttgcttatgt gtccattaca tttcataaca 3121 cttccatcct actttacagg gaacaaccaa gactggggtt aaatctcaca gcctgcaagt 3181 ggaagagaag aacttgaacc caggtccaac ttttgcgcca cagcaggctg cctcttggtc 3241 ctgacaggaa gtcacaactt gggtctgagt actgatccct ggctattttt tggctgtgtt 3301 accttggaca agtcacttat tcctcctccc gtttcctcct atgtaaaatg gaaataataa 3361 tgttgaccct gggtctgaga gagtggattt gaaagtactt agtgcatcac aaagcacaga 3421 acacacttcc agtctcgtga ttatgtactt atgtaactgg tcatcaccca tcttgagaat 3481 gaatgcattg gggaaagggc catccactag gctgcgaagt ttctgaggga ctccttcggg 3541 ctggagaagg atgccacaga ggaggagaga ttgccttatc ctgcgatgat catgtcattg 3601 agaaccaggc cagatctttt tcctgcaggc aacttgtttt aacatctaag gactgagcta 3661 ttggcttggc ccctttgctc aagcaggttt ccaagctatt tgtgtctgtg ccctttgtcc 3721 aagcagtgtt tcccaaagtg tagcccaaga accatctccc tcagagccac caggaagtgc 3781 tttaaattgc aggttcctag gccacagcct gcacctgcag agtcagaatc atggaggttg 3841 ggacccaggc acctgcgttt ctaacaaatg cctcgggtga ttctgatgca attgaaagtt 3901 tgagatccac agttctgaga caataacaga atggtttttc taacccctgc agccctgact 3961 tcctatccta gggaaggggc cggctggaga ggccaggaca gagaaagcag atcccttctt 4021 tttccaagga ctctgtgtct tccataggca acatgtcaga aggggtaagt accctgtcct 4081 ccagggctct cctccaccca ccttccctct cttctctctc ttccctctcc ctccctcctt 4141 acttctctat cttatgctct gcccgggaat ctcggcacac acttctataa ccccctgtgt 4201 gggtcctgac cccacaaaag gaggaggctt aacaatgacc ttgggcaaaa gaactgaagg 4261 tctgagcagg cccatttccc tggctggctc aggctgatgc ggttggctgt tcctcttttt 4321 ccttctctgc ccaggtgggc acgttccgca tggtacctga agaggaacag gagctccgtg 4381 cccaactgga gcagctcaca accaaggacc atggacctgt ctttggcccg tgcagccagc 4441 tgccccgcca caccttgcag aaggtgaggt ggctgacagt gcggagagga gcagcctagg 4501 gaggcaacat gaggggaaca tgactgcatt ctctctcctg tgaaaagggt ccagcctcgg 4561 cactccagat catcctggac ctgatagact ggcctgttct aaggtccttc ctggtggtca 4621 cggctatggc agcaactggg ttgaccagac acccagccag gccagcccaa cccaagacta 4681 cagcccagag cctcggaact agggccctct aaactagact caatatgaga agcattggtt 4741 tggaaatggc ccggggaaca aggatccaat tgctgcccct gctagacagg caagagactt 4801 gagttccaga accagaccta ctttcagctt gctgtgtgac cttaggcaga ttgctttcca 4861 tctctgggtc ttgccttctg tatctatgaa agacggttgg tggactagct cagaggtgat 4921 cagattgtgc tctgtggagg ctgacagagc tgtcttaggg ctgctgggga ggatgagaag 4981 tggctgagct gagcctgggc acccagcttc catctctgct tccttgggca tctcaaggga 5041 aaggtgctat tagctccttc acactttact ccccctcctt tcccttccct cctcatgctg 5101 tccaaggaga gagctgcctg agggcacgga ggagttctcc tgacccagca gaggggagga 5161 atgtgaggct ctgggcagga ggcaggctgc catctgccag agagccgtgt ttttgaaaaa 5221 atcagaatca agcctgggag gcagcagagt gaaatgatga aaagaggagt cccagctcca 5281 ctcctttctc tctgggtgac cttgagcaag aatatcttca cctttatgag cctcagtctg 5341 ctcatctgag agctggggat aagaacacct accatgatgg tctgtggcaa agattaaaaa 5401 acacagtgca tttaaaggac ttagcaggtt gctaggcaaa ggctcagtga gtgtaggcag 5461 tcatttttag aaacttttag actgttggcc atattctgca aatatgcgaa ttctctccac 5521 acacaacagt ctcaggtgtt gccagtgcag atgctgcccc tggggcagaa aagcaaagct 5581 gcggctgcag acccctgggc ccccttggct ggtctagctc tggcaggaga ctcatcacct 5641 gtgtgtcctg cccctcaccc gcacctaagt ttaacttcat gcacacccca ccctgttcta 5701 ggccaaggat gagctgaacg agagagagga gacccgggag gaggcagtgc gagagctgca 5761 ggagatggtg caggcgcagg cggcctcggg ggaggagctg gcggtggccg tggcggagag 5821 ggtgcaagag aaggacagcg gcttcttcct gcgcttcatc cgcgcacgga agttcaacgt 5881 gggccgtgcc tatgagctgc tcagaggtga ggccagtgct tggagggacg gccctgggaa 5941 gacgggaggc catgaggatg ctatccgctc tcatcctggc actgaacagg cctgcttcca 6001 tactgtgggc aagtcagttt ctctgagcct ccacatcctt acctgtaaaa tgggtcagct 6061 ctgataatga gcgaacgaat gaatgaatga agtgtgtcag gagcagattt ctggaggtct 6121 gagtcctctc aactctctcc cagtcttggg attcaatgag taaaacttgg gtaccccatt 6181 tgggtgacga ggggaggaat ttcccctcca gacccccacc atgagaacct aagctttggc 6241 tctggaatgc tgcaggaagg tggatagtgc ctctctcagg gcagttacct aagcctcatg 6301 tgcagtggaa ctgaagatgg tgtcagacct gggaatgaac cccggggaca gaaccgctct 6361 ggcactgctg gctggatgaa gacaggatgc ctcaagaaat catcggccgg ttgcggtcgg 6421 tcacgccgta atcccagcac tttgcgaggc caaggcgggc agatcatctg aagtcggggg 6481 tttgagacca gcctgaccaa cgtggagaaa ccccatctct actaaaaata caaaactagc 6541 caggcgtggt ggcgcatgcc tgtaatctca gctactgggg aggctgaagc aggagaatcg 6601 cttgaacgca ggaggcagag gttgcggtga gctgagatcg tgccattgca ctccagcctg 6661 ggcaacaaga gcaaaactct atctcaaaaa agaaagaaag aaaaaaaaaa agggggtggt 6721 tccctcaaga aatcgaatct tgggcactgg tggacggaaa tggggtctga gtgggccctc 6781 tgcaggctaa tgattttcct tttcctactt cttccttctc caaatggtaa ttcagggcca 6841 tgctgtgcca agaactgttt taggccaagg gatgcagtgg agacaaaata gacaagttcc 6901 ctgccctggt ggaactcata ttctaatggt ggaaacaata catccaataa ataaagtaaa 6961 attatatagg gttaaataat ggtgaggaat aaggaaagaa attaaggagg tggatataaa 7021 gtcttgaggg gctgaagttt tagataaagt ggccagggaa gtcccagtgc aaagggaatt 7081 ttgaagaaaa acccgaagga aatgagatat ctgtggggaa gagcagccag ggcagaggca 7141 acagcaaaga caaaggccag gggttgggag gtgcccagag tgttccagag cagtgagcaa 7201 gttgatgtgg cagtctcttg gcagaggtgc ccaggtcctc taggggctca aatttggtgg 7261 tgcaggccac agaggtgggg aactgctcag gatgggtaag gatggtgttt gtggaccgta 7321 gaggtgctgc tggaatcagg agctctgtgt cctgcttggg gtttggattt tacttgggag 7381 cttctcctca gccctgtgca cacctcacca ctcagcttat cctgtgccca gctgaaaaat 7441 ctgtgtcttg tccccatggg gaagaaaata acaagtaatg attatcctca tatttaagac 7501 tatgcctcat gtttgcccaa gcttatgatg atgtttacac cttgaatctc atttgattgc 7561 cacaacaatt tggtgagata ggccactatc tagactatct agaatagata ggtcaatatc 7621 gagatcccat tttttggaga ggaaacatgg ggctcggata ggttgagtag ccagctcggt 7681 cacacatagg acctggcagg aggccaggtc ttctgagtcc cactaggagg gatggggtag 7741 ggatgggatg gctccctgag ggggttcctc tgctctcctt cctccaggct atgtgaattt 7801 ccggctgcag taccctgagc tctttgacag cctgtcccca gaggctgtcc gctgcaccat 7861 tgaagctggc taccctggtg tcctctctag tcgggacaag tatggccgag tggtcatgct 7921 cttcaacatt gagaactggc aaagtcaaga aatcaccttt gatgaggtta gtgggggctc 7981 gcctggctcc caccctgtgg gctgggttcc ctccctcaac cctggcctct actgggccct 8041 cattcctggt tctccaggag gcaaggtctg tagaggggga aaggtgggca gccacctgcc 8101 acctagccct ttccaggcag ggagagaatt cctaggatgt gccagggcag ctggaaaacc 8161 cagtagcttt gcaggtggag cttgatggtc aataatacat ggctcttcct gggtatccct 8221 gccaagtgcc aagtgggggt aacagctgtg tgccatgagc tgggggctgc cctgccacat 8281 gctaaatgct gagagcaaca gagagaacag ggattagtgg actataggct ggaagagatt 8341 ctggaaagca caggttccaa tctatttagg agcccgagtt ctttgtgctc atatgtaccg 8401 caattttgct tgtctgagcc gttaaggggg ctgccccaag ttggggaatt agggtttgtc 8461 ttgggagaac tttggcatgc gcactgagcc tgaaaaagat tcctttaatt taaaaaaaaa 8521 aactgctaca aacttatttt attttattta tttatttttg agatggagtc tcgctctggt 8581 cacccaggct ggagtgcagt ggcatgatct tagctcactg caacctccgc ctcctgggtt 8641 cgagcgatcc tcaagcctca gccttccaag tagctgggat tacagacgca cgccaccatg 8701 cccagctaat gtttgtattt ttagtagaga cggggtttca ccatgttggc caggctagtc 8761 tcaaactcct gagctcaagt gatccgcccc acctcggcct cccaaagtgc tgggattaca 8821 ggcatgagac accgcacctg gcccaaacta attttagata atgctatcta gttgtgccca 8881 acacaattta ttaaaagtcc atcttttttc aatgacttga aatgccatct ttatcatgtt 8941 ttatattttt cgtatgtgct tggtctattt ctggactttc tattctgttc cattggtctc 9001 tttatctatt catactccaa tatcatactg tttcgatgat agagattttg tggcatgttt 9061 aatctgttag ggctagtttt gtccttagtg atctgctttt tggggggctt ttctagctac 9121 ttttgcatgt ttatttcccc ctatgatctt cagaattaat ttgtctagct ttaggagaaa 9181 tattaggagt atgttttggg gataatatta aacataaata acaagcttgg ggagagctgg 9241 cttctttcag aatgtcacat tcctggccag ttgtcctatc cacaaatgtg agatgccttc 9301 tgttttcttc aagcctactt ttgagttttc aggagtgttt taaagatttc ttcataagct 9361 atgactgtgc tactgcactt catctggggg gacagagaga gactctgctt ctgaaaaata 9421 aaaaggtttt aaaaattttt tcctcacaac gttgtgcaca tttttcatta aatttgtctt 9481 aggtattaat tttgttgtca ttgctaaaaa tggggctctt tcttccatta tatcttctag 9541 ttggttattg tttgtataaa tgaaggttat tgtttttgtt ttttaattgt atatccccct 9601 attttactga actcttttaa aatttatgtt agttttataa tggattctct gggggttttc 9661 taagtatact ataatactat ctgtaagtag aggagatagt ttttcttctc cctttccact 9721 tcttatgccc ctgatttttt tttttttaac aaattgattg cattggttag tactttcaat 9781 atagaattaa atggtggcag agatggtgaa gtctttatct aattcctgat cttagtaagg 9841 atgcctctag ggtttcccat caaataagac actggctgta ggactgaaat gcacacctgc 9901 ctatctgtgt gtattaagga agagacttaa atatatgttt tttcttatta cgtaagtgat 9961 atctactcat tttagaaaat acaaaaaatt ggctgggcgc agtggctcat gcctgtaatc 10021 ccagcacttt gggaggccaa ggtgggtgga ttgcctgaag tcaggagttc gagtccagcc 10081 tggccaacat ggtgaaaccc tgtctgtctc tactaaaaat acaaaaatta gccaggcatg 10141 gtggcaggca cctgtaatcc cagctacttg ggagactgag caggggaatc gcttgaacct 10201 gtgaggcaga ggttgcagtg agccaagatc gcaccactgc actccagcct gggtgacaga 10261 gcaagactcc atctcaaaaa aaaaattaca aacaaattta aaatcatttg caattctact 10321 acctagagag ctctactatt gacatccttc caatcttttc atacatgtat gcacgcacaa 10381 tacacacaca cacacacgcg cacacacaca cacatttttc tttttttttt ttttttgaga 10441 cacagttttg ctcttgtcgc ctaggttgga gagcgatggc gtgatctcgg ctcactgcaa 10501 cctctgcctc tcaagttcaa gtgattctcc tgcctcagcc tcccaaagta gctgggatta 10561 caggtgcctg ccaccatgct cagctgattt ttttgtattt ttagttgaga tggggtttca 10621 ccatgttggc caggctggtc ttgaattctt gacctcaggt gatccacctg cctcggcctc 10681 ccaaagtgct gggattatag gcgtgacgac cgcacccagc ccacacattt ttcttacaat 10741 aatgggatta tattctacaa taatatatca tgcattttcc tgtattatta gtcttctgta 10801 ttgtttttaa ttgttccaga gtattccatg gcataaatgt accataattt caccagttcc 10861 cattgttgga catttaggtt gtctccaatg ttcccactat tgcaagagta ctgggataaa 10921 ccttctggcc tctaagcatg gatgtgacat ttgaacttgt tttcttggta tacattcaca 10981 gaagtggaat gtccgagatc aaagcctgca gcaaattacc ccagtgaaga atgtaccaat 11041 tagctctccc ctcaggacct caagccttaa ttattgttgc gggttagatc ttacagcctg 11101 ttctctctgt gcccaagatc ttgcaggcat attgcttcat cctggagaag ctgctggaga 11161 atgaggaaac tcaaatcaat ggcttctgca tcattgagaa cttcaagggc tttaccatgc 11221 agcaggctgc tagtctccgg acttcagatc tcaggaagat ggtggacatg ctccaggtga 11281 ggcttagagt tctgtcgcac ggcagcctcc cacagctggg gaggctgaga ggctgggcct 11341 cctttcatgg tgcttgcagg gaactgagaa gttggaccac ctcgcgccat aaagagcagt 11401 gacagccaca gtcccaacca taagcacctg catagtgatt gacagtttgc caagtattcc 11461 ctttcattct tccatcgact ccttgagagc attctctccc tccttccctg cttcattttt 11521 caaacctgga aacgaaggct cagagattta agagacatct cgaggtcaca cagcaataaa 11581 gcggcagagg ccagacttga acctgcatct tctgacctgg aatcccatgt tgtctgtcca 11641 catatgtcgc actgcctctc tttgaatgtg gattcacccc gagaaaggga ttatcactaa 11701 ccctctgata actcactcct atatgtgctg acatgtcaga gtcctttctt agagccatct 11761 tagatctgct ctgtgtattg ggaaaggagg ggtgggaggt ggaggttgtg ggaggggctg 11821 atggacaatg agggatgata atactgttcc tgtcgtttgt aatagtttgc agtcccttcg 11881 gcaggcaagt gggtttgaaa aaactcacca aaaggtcaag caaggctact ttcatttctt 11941 ttcctaattc ccagtttaat ttgcatcagc agggagaggg ttgggtctta tgaatcttgg 12001 ggcttcctcc gtgcagtctg ggccacagtg agccagggat ccactagttg ctggcctgga 12061 aataggacct tcagtgggcc cttctccagg gcagccctgt gatgctggac aagtctgttc 12121 tatgccatga ggagaggcag cagggaatga gtgggagcct ctgaggtcac tggggccatg 12181 gggaacgggg tatctctctg tcaccgcagg attccttccc agcccggttc aaagccatcc 12241 acttcatcca ccagccatgg tacttcacca cgacctacaa tgtggtcaag cccttcttga 12301 agagcaagct gcttgagagg gtgagtacgt gggccgccca gacaatgaga caagagggct 12361 gagggtgggg cctcacctga gccctcctca cactctacca tggtcctgag ctatgtggat 12421 cccactgaca cccaacatgg agacctgcgg agagaggtgt cgccaggcaa agactgcaca 12481 tgatggggct ttgtcctcct gcccctttcc tccctcaacc ctcatcctta gggcacctcc 12541 tcctgctcag ttctgtctcc cttgcttccc tgcaggtctt tgtccacggg gatgaccttt 12601 ctggtttcta ccaggagatc gatgagaaca tcctgccctc tgacttcggg ggcacgctgc 12661 ccaagtatga tggcaaggcc gttgctgagc agctctttgg cccccaggcc caagctgaga 12721 acacagcctt ctgaaaacat ctcctgccag ctgaactgta gttagaatct ctgggcctct 12781 cctcaactgt cctggaccca aggctaggaa agggctgctt gagatgactg tggtcccccc 12841 ttagactccc taagcccgag tgagctcagg tgtcaccctg ttctcaagtt gggggatggg 12901 gaataaagga gggggaattc ccttgaacaa gaagaactgg ggatagttat atttccacct 12961 gcccttgaag ctttaagaca gtgatttttg tgtaaggttg tatttcaaag actcgaattc 13021 attttctcag tcatttcctt tgtaacagag ttttacgact tagagtctgt gaaaacaggc 13081 aaggagcccg ggttaaaata tccccctatt cgcccccaaa atgcaataaa agaagataaa 13141 agagagagga tatatgttca aatgatttgg caaacgcagc agtcctggtg gcctctaggc 13201 aggctgggag ttttccacac agaattccag ggtggaagag gcctggaagt ggcttcattc 13261 aggggtggcc tcctcccctt gctgaggtgc agtctgacgt gtgtggggtg gtcaggaact 13321 tggacctctt tccagggtgg cagtcatgtt ccctccatct gtctgggaga aggagtcttt 13381 ccaccaaaag ctgtctcctc cagtcccctt gcctggatgg acagttctac ttcgaattat 13441 aagtggcatc ttctctgagc actcactcag tcctcaagtt atagagttaa agctcttttt 13501 ttgtcttcac tctcaggcat tttacggtga tgaacttaca ctttccaaat ctgcattttt 13561 tttttttttt tttttttttg agacagggtc tcacttgtca cccaggctgg agtgcagtgg 13621 tgcaatcaca gttcactgca acctca // LOCUS HUMRCP 23822 bp DNA PRI 30-MAY-1997 DEFINITION Human gene for ryudocan core protein, exon1-5, complete cds. ACCESSION D79206 NID g1359444 KEYWORDS ryudocan core protein; heparan sulfate proteoglycan; syndecan-4. SOURCE Homo sapiens placenta DNA, clone_lib:lambda DASHII. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 23822) AUTHORS Kojima,T. TITLE Direct Submission JOURNAL Submitted (09-DEC-1995) to the DDBJ/EMBL/GenBank databases. Tetsuhito Kojima, Nagoya University School of Medicine, First Dept. of Internal Medicine; 65 Tsurumai, Showa-ku, Nagoya, Aichi 466, Japan (Tel:052-741-2111(ex.2203), Fax:052-744-2157) REFERENCE 2 (bases 1 to 23822) AUTHORS Kojima,T. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Takagi,A., Kojima,T., Tsuzuki,S., Katsumi,A., Yamazaki,T., Sugiura,I., Hamaguchi,M. and Saito,H. TITLE Structural organization and promoter activity of the human ryudocan gene JOURNAL J. Biochem. 119 (5), 979-984 (1996) MEDLINE 96390006 FEATURES Location/Qualifiers source 1..23822 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /clone_lib="lambda DASHII" /map="20q12" /tissue_type="placenta" protein_bind 5..9 /bound_moiety="LBP-1" misc_feature 10..15 /note="E-alpha H box" protein_bind 68..72 /bound_moiety="LBP-1" protein_bind 103..108 /bound_moiety="SP1" protein_bind 108..113 /bound_moiety="H4TF-2" protein_bind 116..124 /bound_moiety="NF-kB" protein_bind 129..136 /bound_moiety="AP-2" protein_bind 133..138 /bound_moiety="SP1" protein_bind 148..155 /bound_moiety="AP-2" protein_bind 165..169 /bound_moiety="LBP-1" protein_bind 169..177 /bound_moiety="NF-kB" protein_bind 178..183 /bound_moiety="SP1" protein_bind 189..194 /bound_moiety="SP1" protein_bind 197..202 /bound_moiety="SP1" protein_bind 203..208 /bound_moiety="SP1" protein_bind 212..221 /note="GC-BOX" /bound_moiety="SP1" TATA_signal 234..240 /note="TATA box like (TATAGAA)" protein_bind 253..262 /bound_moiety="AP-2" misc_feature 261 /note="Cap site" exon 261..363 /number=1 CDS join(304..363,12753..12891,15607..15653,18112..18310, 21260..21411) /codon_start=1 /product="ryudocan core protein" /db_xref="PID:d1020390" /db_xref="PID:g1944492" /translation="MAPARLFALLLFFVGGVAESIRETEVIDPQDLLEGRYFSGALPD DEDVVGPGQESDDFELSGSGDLDDLEDSMIGPEVVHPLVPLDNHIPERAGSGSQVPTE PKKLEENEVIPKRISPVEESEDVSNKVSMSSTVQGSNIFERTEVLAALIVGGIVGILF AVFLILLLMYRMKKKDEGSYDLGKKPIYKKAPTNEFYA" exon 12753..12891 /number=2 exon 15607..15653 /number=3 exon 18112..18310 /number=4 exon 21260..23384 /number=5 polyA_signal 22555..22560 polyA_signal 22563..22568 polyA_signal 23365..23370 BASE COUNT 5548 a 5624 c 6307 g 6343 t ORIGIN 1 gatctctggg gacctgcctg gcagtgggtc aaataaataa agggagttgg agctcccgga 61 gggtaggact aggggttgag taggagccgg cgggctcggg cagggcgggt cccttggggt 121 ttccaactcc gcgggcggcg cagtgccccg caggcctcgc ttccactggg gaattccggg 181 cggggtgcgg gcggcggggc gggggcgggc cggggcgggg ccggtaggcc gcctataaga 241 tgggtggcgc gcccgcccgg gccactcgcc gcagcctgcg cgccttctcc agtccgcggt 301 gccatggccc ccgcccgtct gttcgcgctg ctgctgttct tcgtaggcgg agtcgccgag 361 tcggtgggtg cttggaggtt cccgggctgg gggcgaagcg ggggcgcagg ccggtgcctc 421 ctttgttcgt cggagcgtgg gatggggggg tcagatcggg ggtacgctac ccccaaccgt 481 acaccgaggc ccgggaaact ttgttggaaa ctttgctccg gggtcacggg ccagctccgg 541 gatggcttca cgcgccgtgc gcccctcgcc tgttgctctt cccgcctccc cgggcctcag 601 ccccgccgcg ggctacgggc tcgttagtga ctaagccggt gtcaactctt caactcccac 661 accctcgtcc cttccctggt gaccctgggg caggcttgga gcgctgaatc ccctcctcgc 721 tctcggggcg cccagagcag acagctttag gatccgagat ggccctgggg gtcggggggc 781 tgcgtgtact cggaaggggg agggttttag ggttgtgcga ggccctcttt cacacaccaa 841 ggagaactga gccctaacct cagttctggc cccagctctg tcattgactt gtgacttagg 901 gcaaaagtcc tgcccttctg aatctcttcc caatactgca ccaagggtct gagggaatgg 961 ggcaagaggg gacactgcgt tagggtttct agaaagttgg ggactctgct cttttcgagg 1021 acagaggaga ggaatggttt agactcaaca cttagccagg agctgagcct ctgctttctg 1081 caagaagtgt gttcattttt tctcaattgc agataagaaa attgaagcat ccaccttgag 1141 tgaggtgaag ggggtagggg ggagagaagg cctcaatcag cccagggaaa cctttccttc 1201 tcactgtcca ctggcctccg tcatagctgt ccctgggcca gcagaagctc tatccatgcc 1261 cgcagccggc ttaggaggag gggggcaatc tcatctggga agttgggggg catgggaatt 1321 actggtgaag gcaatctgtc ccccacagcc tgagctttgt gccccctttg tgccctttag 1381 ccccagtttt cagagcgagt gagtccttgc agtttaacca ttaatgttaa tttctttgaa 1441 agccttgggg ctcctgttcc tctgaattta cttagcggaa ggttgattct gcctgcaggc 1501 tcttcttgag gaatgaatga gaccctaggc aatacttcca gcacaattcc aggcatgcca 1561 tgatgattgc aaacgtggag cgcctttgtc ggggggccag acattgctct aataactttc 1621 taatgggtat atcaaggagc ttaattccaa caacaatctg actgtgtact gttcttaaac 1681 tggtcctgag gctagagagg ttaagtaact tgcccagggt cacacagtta atacacaata 1741 aatgggtgag tcagattgaa atttaggcag ccaggctttc aagtttctgc tttagcttaa 1801 cttctactct ttgtgctact ccaggtgtcc catcgttggt aactaaagac gggtttagaa 1861 taggttgaga ttttatgctg gaaggcaaag gaattctgag gtggaaggaa acaaggccag 1921 agtgaggtga tgacttaacc taaaccaaag gctaccttgc ctaaaatgtt agtggctgag 1981 gacccaagcc ttctgcctct agcacagtgc tctaaactag gccctgaagg atgtgtcggg 2041 tcaagcaact ggggaagcat ccgaaggata ccacctaggc agtacaggga aaaagaggaa 2101 aggacccagg aggttgctga ggtcaccgtg tgcccagtca catgccagtt tcctccaggg 2161 ctgctgagcc ttcaggtgct tcagggtgct gagctgtcag ctgtgtcctg ggggcattct 2221 gaaggatgta gtttggggga aggggactgt gtcagtcctg cctgggtgac ccatcagctg 2281 caggagacat cagccctggg cagctgcttc ctgagatagg tgtcaagtct catcctgacc 2341 tcagctctcc ccttcctggc taatgtcaca gacctcctgc ctgtaactgg ggcacagggc 2401 ttcccctttg gcctgtcccc tccctctttt ctagattgtg gttggaaaaa tcagacatag 2461 tcacggttgg ctcggactga agagatgatc cagcgtgtcc ttttcttttt gcaggtagag 2521 aaaagtgagg cccagggaga aggactttgc taatagcagt taggagtgat agagtacttt 2581 ttatatgaca gatctggtgc attttgtcct cacaaaaaga cctgtcacat ggggattcta 2641 ttatgcccac tttccaaatg tgagaggtaa aatggtacta ctttgggtta gtagagggca 2701 tccaggaccc caggatctct gactagtagc cctcccattg tgggtggtgt tcgcccgact 2761 gttccatcat tccccttacc acccccatat tttggaaggg aacccaggct cagtacccag 2821 ctgtcctctc ctctgtttgg ctgggcttgc tatactaaac cagttcttcc tgtccagctg 2881 ggagcattcc ctgatctgcc ttcctgccac tccctctcag gccaattaaa ggcagccttg 2941 ttttgggagt cccctccacc caaaggtgtt cctacccagg ggcacagcct actgacttgg 3001 ccccaggcca ggcggttgtg gggaagtgtc ccccacctat cacctatcaa gtgtacttta 3061 gcttaaggac atttctggtc ttctacagcg tcctcttctt gattacatgg gagtaggggt 3121 gggggcggaa cgtaggggct tctaggaccc ttgagtgaac agtgagagct cttgggactt 3181 cttgagccca gggagttatc aaacacccca gaaaatattt gggccatgat ttggagggtt 3241 ccgtgagttg gggggaggcc tctttccccg ctgggctgac atcccccacc ttaaaatgaa 3301 aggtttgaac agggtagcct ccagagtcct ttccatctct caatttgatt aataacttaa 3361 gtacctacta ttcaaaagag gtctctctct tgaaggaatt aacttgaggg aattaacata 3421 ctccaccaaa tgctgaatcc ctccctctct ccccccgcac accgagggca ggaactctgc 3481 tctatttgtt tttgtgaaat acctgtcccc tagtttgtac tcaggaaatg cttgtatgaa 3541 tgaataaatt cgtgcatgta actttattct aaatggttca ttaatgttat ttattgctag 3601 tatgagtatc tcccagtact gcgaggtacc attttctcta tttttacagg aaattgatgc 3661 tcggaacaat gcagtggctt cctaaggtca gaaccaggtc cttctgatag ggcaaggtgt 3721 ctggtttgag tgtcctcaga atattccaga tgaggaaatt tcgctgggtt tgaaggtaga 3781 taccttaggt cctacttctg cgttgctggg tgaccttgag caaacatgcc ctgtctctgg 3841 gtctcagtgt ccccaactct aaaataagga ggctggacca ttgccttcca agggtccttc 3901 ctgcccagag agcccattga tgaggggagg ggccctttgc tggcctcctt ggtgaagagt 3961 ctaaacaaat cccagtctca gaagagaagt tggggtggcg gggggacatt cagctcctgc 4021 catccccagc tcctagaaac agagggcttt tccaaggact tggagtgctg agcctgcctg 4081 aatgaggagc tggggaagcc aggctgggct cccagcccag ctccctgttg ggagaaattg 4141 gctcctagct gtccttcaac ctcccggact ggacaggcga gtgtgatttc caaatgaatg 4201 cttaaaattg gggtaagggg ctggaccgag cgctgtgagt cactgcatgc tagcgtagcc 4261 tgcctgagtc acccatttcc tttcaaactc ttggctaata ggacagctct gtggtggggg 4321 gtgttggaat gagctcagag ttttaccttg tcctttggga gtcactgttt cagtgtccgg 4381 ggcctcgagg ggacatacag gacatgtttg tactaggtcc cgccactttc acagcccctt 4441 gcctgcatgt agactttgac attgtacatt gtgcagccag tcctcaaaat tgggctttag 4501 acctctgcag agcaggtagt acttttttcc tctttaaggc aaaactgagg ctgcaactgg 4561 cctgcatttt ttcagagagc aaaagctggt actgttcagg tttggtgtga ccccaggatt 4621 ttctgatgtt tgtgaggact cgtctttgct tcctggggct ggccagaggg cattgaaaca 4681 ttggcttggt gttacacaga cttaactcca gacgtgcgaa gtccacctct tactggctac 4741 atgaattcag tcatgctact ccacctctga gccccagcct cctggtctgt taagaagatc 4801 atgataccgg tgtggcgaag cttaaaggag acgacagggc tgtaaataaa ggcacctagt 4861 accatgcctg gtagggagga ggtgttactt agtgacagtt cccttccttg cccaggccac 4921 cttcatgcca gggggtccta tctctgaaga ttctgagccc aggtctcctg gaaagctttc 4981 tccatccccc ttatccccct tatctacccc cacagctggg aggtgggaag ggagaaatct 5041 agggtggggc ttttggagtc caaatctcct atttgtttat cttagaagtg ggctgtttgc 5101 taattatcga atgggtttat gtttaaacaa gaaccagttc tgggcagccc cacctctcct 5161 gctgggattt gctggagcct catgctgaac agtttgcagc ctggagggag agggggcagg 5221 gggtttgcca agggtatcag accactctgg acactgtcca ggacctgggg tcaccctcct 5281 gtgctggagg ggcagagttt ctacccttaa ggaggctgag tgattgcaaa tagcactttg 5341 aggggtgggg tgttggtgga cagaaaaggt acagtgttct gaaaagccag tttctcgtat 5401 gttttcactg catggtgccc tagagaggga ggagagagaa cacatatgtc aacagttggg 5461 gtctcattta accttagaag aataagcctg acttcttggg cttgtttgtc attaactaac 5521 acagtggtga ccttgggcac attcttgcat ctcactgggg cctctctggt cccatctgct 5581 gaaggctggg tgactgaaaa agagggtaca gaaaactcca gcccccgtcc tagctctgct 5641 gctcacccag ggacacacac agttaatacg tcactttgtt gatgtgaact ccagtgtcct 5701 ctataaaaca cctgtggcac tcaaaggtca tcatcgctgt ttggcaaact tgtaaagttc 5761 tggctttatt agcacctaga caagggttct tcacccggcc agagtttggc tttggggagg 5821 tggtgtctgt gcatatgttg aaaatgtaaa ctaagagtta cagttattgg ggtttagacc 5881 tttttatcct tttcaggggg ctgcagtact ccccaaaagg tcactctgat ctcagcagtt 5941 ctttctggct ttgacctttc tacagctatc cttcctccct cccccacttc ccagccttgt 6001 tcttgcctcc tgcttccccc aacccccacc ttcagcccag accttcctat tcagcggccc 6061 ccaccccttc aggctgcatc tcacccctcc ccctgtcctc caggcccggg agctcggctg 6121 ctccagtttt ctctggcaca gtagaagagg ctgctggtca ggtgacacct ggggtaatgg 6181 aaaggggagg cagggagagg ctggtatgtg tggaaacagt gacttggtga agcccagcag 6241 tcagtggcca ggcctgcggg gactggcggt gtcactctag cctctgggcg tgggggcaga 6301 tgtggcacat ggctggcccg gctacccaga gtggggatac tccttgcctt ggagaagccc 6361 tgccggagcc gtctgtggga cagactgacc tggtctggag gatggcttcc ttgggggtcg 6421 gtgagggagg ctgggaagag gcaggaagcc agcacccagg gctgatctaa tcagctgaga 6481 taaggctgca gcgtgggctc tctactctgc tctgagaaca caggaggttt gtttacatcc 6541 cgagagcctc cctagccctc ggatccagca gggatttcgg atctgctgcc tagattacaa 6601 gctccaactt caatgcacct ctgtctctga ggccctgagg gagccagccc cctcctggct 6661 gtctccaccg gtaatcggag caatgcccag cttggttact gggctgggac agagggaggc 6721 ttgtctcttt gagacctgtc ttttacagat tggaaaactg aggctcagag aagggaattg 6781 tccacgatca tccagggagt tagtaacaag ggtgctgggt cagctcctgg cagggagaca 6841 tccagaggct cctgaaccct tcccccattt ctagctggca ccctaggatc ctggagttct 6901 tgctgtggga atgggctgcc ctgaggcttg gtgaaaagct ggttgcaggc agtgcaggcc 6961 tggctctctc ctgagtgatt gtgttcagag taacccgacc ttgaaggcga catttgaacc 7021 ctcactccac ccccaccccc agacctggtt taaccattca ggcaccagag caccagacca 7081 tggattggtg tgtagtttct ttttaccttc tagattttta tttatttatt ttgtccctgg 7141 ggacccaggt ccccaagtag aatttcaggt gtttctggtc actgtcattt gcaccttcgg 7201 ggaaaataaa aatggtcttt acctctgtct gcttaggaca ggtggtcaaa gctgtgtgac 7261 cttgggcagg tctctgacta tctctgtatc tttttttcac agtctgaagg gacctgattg 7321 gttgttgaaa gtctctgggc tcagaagcaa aatgataacc tattatagat tatattcctt 7381 tacagtttgc aaagcaccat ctccctgtcc ccaggctagc ttccttccag caacagaact 7441 gcctctgcaa gttttcccag gcctctgatc ctttgagcac tgatcccact ggccaggagg 7501 aaggcaggta ggggttaatc acagccacta ttcattgatc acgtgctggg tccttgcaca 7561 cacaaatgca ttcctcttaa tcctcatcac cctgcaaggt gctaccagcc ctagtcacaa 7621 aagaggaaac tgaggatttc agagatgaaa taaactccca agctcatata gttaggaagt 7681 ggcagaactc acacttgtga atctgccttg atgcacaacc actctgggtg gtagagtcac 7741 agttgtgggc cccaggtttt agccaggctg gggaatgtct ggcccttaag aagtgggtgg 7801 ggtggggaag aacagttacg agtagtgtac gctgctgggg gtctcctgct agaaatcatt 7861 ctggtgggtc caggtgttgg agccccaggt actcaccatc ccctctcccc actaaatttg 7921 gcttgccagt tattaccctt ctggtcttgc ctcctgaaag aagggtcaag tgtgtccccg 7981 accctacctc ccctgggaga gccaggtcgg gagaggctct cattagttca cagttatcca 8041 agccctgacc ctgaactcct ctctggtgcc ccagccaagt ttctgttcct ttgtttaagt 8101 gatatcactt tcacctttgt ttactcctag gcagggacag ggttgccctg gagccctggc 8161 ccagccagtg tgttgtggac tggcgggtta ggctggagag aagtgaagag tgggtggcag 8221 tgagaagcct agttgtggtt gggacgtgtt cttgaggaag atctggattt gaatcccagc 8281 tctagctttc tagttgcatg acgttggata agtgactcag ctgaacctca gtcttctcat 8341 ctgcaaaatg ggtagagcac cttgcaaggc tgttttgcca tttaaatgaa cttgtataaa 8401 caaagtaccc agcatggtgc ttggcatgta gtggatactc cttttagtca ctcatgcttt 8461 tcctggggtg atagaagcca taggatttgg ggatagggtt gggataggac cttttcgtag 8521 cttcatgcct atagccaaaa gactagatgg ggagtataac tgtaatgaca gctgctgcct 8581 gtggatttgc tgagaccctt aggggcagcc aacaccctgg aaggcgagag aagataattc 8641 cagtctggag ccaggatacc taggttctaa gtccatctcc gctgccagct gcttggatga 8701 ccttggcaaa atcccttgtc ttgtctgttt gctaggttat aaaatcagat accttctgtt 8761 ggcaggtgtt agtttctgta gaacaaaaga gcacttcccc tcccttcttt ctccccaaca 8821 gtctggggaa gaatgtagta tctctaaacc cccaggcact aatcccagat ccccaccagc 8881 cacagggcca gcagagtctg tgggacctag gcccattgcc ctatttttta ttttttggag 8941 acagggtctt cctctgtcac ccaggctgga gtgcagtggc acgatcgtag ctcactgcaa 9001 cctcgacctc ctgggctcaa gtgatcctcc cacttcagcc tcccgagtag ctgggaccac 9061 aggcgtgcac aaccacattt ggctaatttt tgtagagatg gggtttcacc atgttgccca 9121 ggctgatctc aaactcttgg gctcaagtga gcctcccacc ttggcctccc aaaatgttgg 9181 gattaagcca ctgtgcctag ccaccactgt cttacttagt tggtaatttc tgttgtgtgt 9241 tcatgaaagg gacaaagata caaggagact tgagagccca gagagggtgc ctgtgcatgt 9301 atacacacta acacacatgc cttgggcaaa ggtgggtgag ctgaggagaa cagaccacat 9361 tcttagccag gagcagggcg ggtccatctc tggtcagggc tgggcctggc tgctgggtgg 9421 cctggttctt caaagtcacc ccagactcaa tgggctttat ctgaaaagag ggcggaggag 9481 aggaggaccg ttggtgcctt cccaaccttt acacaaaaaa gagtgattgc ccacaatccc 9541 acggggcttg gtcccgtctt gctggcctag tcctaaatgg ctcttatcca ctttggagtt 9601 gccttccctc ttgtcagagg tcatgggtgg agaagggacc aaaacagggc agagaggggg 9661 cttccagagc tcaaggagag atttaattcc ctgtgtcctc ctatcaccac tgggagctgg 9721 aagaagtttc tttccagccc cttgacttgc tgtaggaggg aaatcctggg ctcatctaaa 9781 tgcagccttt gaagactcca tcttttcaga gctttgaaat aggatcgaat ccaggccgtg 9841 ccgcggagcc ccggggtgac ttcagactag actagtttct tttttggaaa ctgagtataa 9901 aaatgaaggg ttaaggatga acaggtgccc acaaagaggg ctgaactggg aataaatctt 9961 ggtttcagcc ttggttttgc tgctgacttg gctgcaagat cttcacgccc cactttcgct 10021 catagccttc atttctctaa tgtaaaacgg aggtaattcc taacagccag tgggcatgct 10081 aatcccatgg gttgttttga aatacctctt agcactttca catactgaaa gagaggctgg 10141 atgcataaac aaccttccat ggctcctggg ggcagtgagg ggtgggaaaa ggtctctcag 10201 cctgagacaa gtctcctgat ggaactacag cccctgttga ggactttgac ctggtcaaca 10261 gctggccaaa gtgtaccatt ctttctttct cccggctaga ttgacccccc tacttaacag 10321 ggctcccttg gagctggggc aggctggtga ccccgtgtac atatgtgttc atgcgtgtgt 10381 ttatgtgttt gtggttaaat gtccaggtca gtgaagcctg ggttctggcc cagtgtggct 10441 acttcctgct tgtgtggcct tggacaagtg actttacttt tctgagccct tgtttccatc 10501 tctgcaaaaa gggactatta aaaggaccta gacaggctgt gtgcttggtt aaggcctgtc 10561 acttgggttc ttgggggatt tgccacagga gatggaggta ggagcacagg gaccctgccc 10621 ttaggtatag gcacttgggc agccatgagg agccttcctc ctgctctgcc aaaccaaagc 10681 cacaggcacg ggctatgtgc gggggcttga attccagcac cagcagcccg gcagctcctg 10741 attcccgagt catgaagtca tctctgagca gcacttaacc tctctggctt tccaccccca 10801 cgggtgccaa gcgttcagca ttctccccac tccccgggag agagtgattc ctggccactg 10861 ccttccttgt ggcctgaccc cgctcccttc cgggaatcca gcattctccc tctgtggggg 10921 tggaagaggg tgcatgaggg tcaggttcca cctgcctctc cccagaagcc cagtggggag 10981 agtacaggag tggctctgaa gcagctttcc tgggcctctc ctgcaatgat aataacctta 11041 tcttagggac agatgttcct tctcagacac cctcctttgt caatggcagt ctcagctgag 11101 tgaaggactg cctggggtgt ccgaaacaga gacctgacct ctttctatcc tgagttatgt 11161 agcgaacgct ctgtgtgacc ttgggcaagt ccctcccctg ttccgggctc agattcaagt 11221 tgtgtgaaac gggaggacag gagctccttg ggtcctggca ttctgtgatt ctaagcagac 11281 ccccagctcc tgcagttatg gcgtctggag aagatgggaa tgtctttcag cgggaggggc 11341 atggtgtatt gaacttaatg aaaaacccca actctcctgg caaatactag gcactttagt 11401 gtttgaatta attagtagaa taatgaactt tgctcagagc tgctgttctc tgggcaaaca 11461 gaagcctgag cccagaagct ggaggaaggg tgatgggcat ccaaatgttt cctgtgctct 11521 tgagggtaca ttgttcccac tcggtggagc tacaggatgg gagcagggta actgatgtac 11581 tgtagggctg cccgggacct ttgacacttt cttttggcaa gcggtttggt gggagtggac 11641 ctgagactct gtcctgatca gctgtgtctc cacagggtag tggctgagtg atgattatgg 11701 gtactggagt ggatggtctg tgagggtagg gattgtgcct ctcggtgtct gcatggtgct 11761 ggcagcagag tagatctgtg ggagatgttt ggaaggcaag actgaatcca ggagtacact 11821 cctgagtcat caggtctggg cagcgccctg acctgaggct gtcttagggt gtgcgtgagg 11881 cagccctgtc tgtcccggcc cagactgact cagctgggaa aagtatcctg gactgggcaa 11941 gaccagaacc aggagcccac tccctgtcct gtgtgaatca gctgccactg catcacagag 12001 ccctggagtg tagcatccca gggccctgtg catggagact cctggctctg aagtcaggca 12061 gccctgcgta tgcaatcctc gctcttccat ctgccagctg tgtcaccaaa agaaaatgac 12121 tccctcggct gtaaaaagaa gtgaataaca tgcctccaga gttattaaaa cagggcccag 12181 cacatagcaa gtgctcggta aaggatatct agccatatta ataatttgat tattacctca 12241 tttactgttt ttattttttt tgagacgggg gtcccactct gtagctcagg ctagagtgca 12301 acggcgtgat cctggcttat tgcaacctcc gcctcccggg ttcaagcaat tctcctgtct 12361 cagcctcccg agtagctggg actacaggcg taagccacca cgcccagctg atttttgtat 12421 ttttagtaga gacggggttt caccatgttg gcctggcagg tcttgaactc ctgacctcaa 12481 gtgatctgcc tgcctccgcc tcccaaagtg ttgggattac aggtgtgagc cactgtgccc 12541 agcctcatgt actattttta tttgcccaga atggaaagag acttgcctaa ggacacgcgg 12601 tgagttagag gtagagtggg atccaggacg caggtctcca ggccctggct gtctctttct 12661 agtttctgaa tgcccacttc actagctttt gggcatcagc tgtcatggag cactggggat 12721 gttggctgat gtgtctcctt tctttatctt agatccgaga gactgaggtc atcgaccccc 12781 aggacctcct agaaggccga tacttctccg gagccctacc agacgatgag gatgtagtgg 12841 ggcccgggca ggaatctgat gactttgagc tgtctggctc tggagatctg ggtacggaag 12901 gtgtgctggg caggcgtagg cacaaagctg gagggagtgg tggcttcacc agccaggagg 12961 gtgaccatgc cttgagactt ggatttttgt gggacttttc ctagagtgcc cttcttcttc 13021 cttctcaaaa aaaggggaaa caaaagtaat ggattaacct attccatccc ctgagagccc 13081 ctggggacaa gctgtttgct gctttgaagt cattggtagc tctgggtttt ctgagctcca 13141 gcctgaacgt gtcctcataa gctcttctct tttctgcagg gcatggtggg ggtggggtga 13201 gggtaggatg ggtggcagga cagggtggga gtggggaagg aggacccata gagtgttttc 13261 ctttttttga aaggaaaagt tccaccctgg gccacatggt gagaacttgt ctctacaaaa 13321 acacaaaaat tagctggatg tggtggcatg cacctgtagg agtcccagct acttgggagg 13381 ctgaggtggg acgatccctt gagcctagga ggttggggct gcagtgagcc aagatcatgc 13441 tactgcactc cagcctgggt gacagagtga gaccctgtct caaaacaaaa aaggaaaagt 13501 agcagcttag aagtggggat ggggtgggag ggggcatgag tgggcagaga tgtagttggg 13561 aaaccaagaa caagtccctg cttcagtggg ggtgggggcg ggtgaagggc ccaaggctct 13621 aggccagaca gctaataagt gtccctccta tgtgcagaga ggtgttaatg attgcaagtt 13681 ttagctttgc aagttttagc tttggagtca catggtcctg agttcaagcc tccatcctgt 13741 gtgaactgag cttcagtttt ctaatctgta aaatgggaat aataaagata gtacatcagt 13801 gttgtgggga ctgaactgac ttaaagcttt tggcacctac caagcactca gtacgtgtgt 13861 gtttggttta aaaaaaaaat aaattttatg gccgggcacg gtgctcatgc cgtgaatccc 13921 agcactttgg gaggccaagg caggaggatc acgaggtcag gagtttgaga ccagcctggc 13981 caacatggtg aaaccccgtc tctactaaaa atacaaaaat tagccaggtg tggtgtcgag 14041 tgcctgtaat cccagctact tgggaggctg aggcaggaga attgcttgaa cccgggaggc 14101 agaggttgca gtgagctgag atcacgccat tgcactccag cctggtgaca gagcaagact 14161 ctgtcttgaa aaaaaataaa aataaaaaaa taaatttcat tatgtgcata caacatgata 14221 ttatgggata catatagata gtaaaaatgt tactacagtg gagttaagta atatatccat 14281 catctcacat agtcgcccag gaaatgtttt aatattgcag ttagagtttt ctttctcaaa 14341 agttaattcc ctggggatct tgttaaaatg tagattttgg ccgggcgcgg tggcttacac 14401 ctgtaattga agcactgtgg gaggccaagg caggcggatc acaaggtcaa gagatcgaga 14461 ccatcctggc caaccaacat ggtgaaaccc cgtctctact aaaaatacaa aaatcagctg 14521 ggtgtcatgg tgccaccctg tagtcccagc tactcggggg gctgaggcag gagaatcgct 14581 tgaacccagg aggcagaggt tgcagtgagc cgagatggca ccacggtact ccagcccagg 14641 cgacagagag agactctgtc tcaaaaaaaa aaaagtagat tttgattcag tcagccctga 14701 aattctacat ttcttcttct ttttttttta accaatgaat tatttttact ctttttaaat 14761 aagtgaaata ttagctttaa tgttttctga tcatgacaat atttttagat aagaacattt 14821 taaacattca acagtaagag actattgaaa ataaatgaaa ttcattgaat agaagtaatt 14881 aaaataataa tgtaactctt taagcattgt aatggaaaga tgttaatgat atattgttac 14941 gagcccatta ttgggaaaaa tgtatttagg aatacgtatg gagggaattt atttatttat 15001 ttttttgaga cggagtcttg ttctgtcgcc caggctggag tgcggtggta ccatcttggc 15061 ccactgcaac ctctgccaac cgggttcaaa gtgattctcc tgcctcagcc tcccaagtag 15121 ctgggattac aggcgcgtgc catcacccgt ggataatttt tgtattttca gcagagacgg 15181 ggtttcacta tgttggccag gctggtctcg atctcctgac ctcaagtgat ctgcccgcct 15241 tggcctccca aaatgctggg attacaggcg tgagccaccg cgcctggcct tgaaattcta 15301 catttctaac cagctctcag gtgttgctat tggtttttgg atccacactt tgcagagcaa 15361 gggtttagag cagatgaagc ctctgcccag ctgccagctc acacattcct gtgaaagagc 15421 cagggggtgg gtctgaggag ccccatttta cagatgagat gactgaagta ggggtgggga 15481 agctcgcttg ctggacattg agcatttgga agctggttgt aaggtggagc tcccaccagt 15541 cctggctgaa ggggtcattt tcctggggta atggacctca ctcacacagc tattctgacc 15601 ttacagatga cttggaagac tccatgatcg gccctgaagt tgtccatccc ttggtaagta 15661 gctacatgct tctgcctctt ccactttgct cctctatagc agacctattg ggagaggcag 15721 aaaatacagc ccccataggc agaataagtg aggggtctta ccccactatg cgggaaggct 15781 ttttaaaaat ctggccctgg ggtgggcatg gtggctcagg cctgtaatcc cagcactttg 15841 ggaggcttga ggtcaggagt tcaagaccag cctgggcaac acgatgaaac ctgtctctac 15901 ataaaataca aaaattagcc aggtgtggtg gcatgtgcct gtagtcccag ctacttgaga 15961 ggctgaggtg ggagaatggc ttaagtccag gaggcagagg ttgcagtgag ccaagattgt 16021 gccagtgcac tccagcctgg gtgacagagc cagactgtgt taaaacaaac aaacaaacaa 16081 acaaatctgg ccccaggctc attttgtagg ttgctggtag gccatcctcc ctgcagggat 16141 agtcaccgtc aacaccaact ccttttctct acatttatag ctatttccta gcattgatag 16201 aaaagtatat atataggccg ggcacagtgg ctaatgcctg taatcccagc actttgggag 16261 gctaagacgg gcagatcacc tgaggtcagg agttcgagac cagcctggcc aacatggtga 16321 aaccctatct ctactaaaaa tacaaaaaat tagcctggca cggtggcgtg cgcctgtagt 16381 cccagctact tgattgggag gctgaggtag gaggatcgct tgaacctgag aggcagagat 16441 tgcagtgggc agagattgca ccattgcacc ccagcctggg cgacagagac tccctctcaa 16501 aaaaaaaaaa aaaaagtata tatatataat tctatgaact gcgtttttca cttagactgg 16561 tcatgagtat ttccctgcat aatttaatgc tcttgtcatt tttataggct gcgtaatagt 16621 ttacctgatt ccctttattg acggaaaaat ggcttataat ttgttaacat tttaaaatta 16681 taacactgca gcaaacatct tttttatttt tgcaaagcaa taacaagttt attaagaaag 16741 taaaggaata aaagaatggc tactccatag gtagagcagt ggcattggct gctggttgcc 16801 catttttatg gttatttctt gattatatgt taagcaaggg gtagattatt catgagtttt 16861 ccaacaaagg ggtgggcaat tcccagaact aggggctcct ccccttttta gaccatatag 16921 agtaacttcc tgactttgcc agggcatttg taaattgcca tggcactgat gggagtgtct 16981 cttagcatgc taatgtagta taattagcat ataatgagca gtgagaccaa cagtttcatt 17041 gccatcctgt ttttggtggt ttttggcaag cttctttatt gcaacctgtt ttatcagcaa 17101 ggtctttatg acctgtatct tgtgcagacc tcctatctca ttctgttacg taggatgctt 17161 aacttactgg gaatgcggcc cagcaggtct cagccttatt ttacccagcc cctattcaag 17221 atgtaggcac tctggttcaa acacctgaca ttttccccct cccttttgta agaaaaccct 17281 taatcctaag ggttgcagag ggacaaagat ccatcttcta taacttcttc atgctgaata 17341 gggtgatgat attcctgctt aactattagg gcctcttgta tccatggtag agaggggttc 17401 agtcagaaag ggccagtatg gtgagggcca ttcataactc ttagttctga caaaaggtga 17461 tatccaaagt cctccaatca gtgctgcagt ccatttcctt tgattcggga gtctcctccg 17521 tctcatccct tctgtggttc tccagaaaga tgttaccaga aaggggtccc gatccagacc 17581 ccaagggaga gggttcttgg atcttgcaca aggtagaatt cagggtgagt ccatagagta 17641 aagtgaaagc aagtttatta agacagtaaa ggaataaaag aatggctact tcataggcag 17701 aggagctgca gcaagcatct tttacacgta gtctctgaag agctccttac aatagagttt 17761 ccagggcaaa actgccacct taaagggcaa gcgatgtcta aggttttgcc aaattgcttc 17821 cagagtggtt gctctagaat aaccagtggc cagcagtgca ggagagcacc tgcttccctg 17881 ttcccttggg tgcattcatt tttcatttgg gacagatata ctaaaaaagt tggggataag 17941 gattttggca gcataattgt ggagacagtg ttgccaattc ctgctccagg accatatggt 18001 tcagctgaat atggcagaac cagattctct gcctggctga atgtccctgt cccctgccct 18061 gagtctcttc caaaatacgc tgagtgtctc ttctcctttc cgcccatcca ggtgcctcta 18121 gataaccata tccctgagag ggcagggtct gggagccaag tccccaccga acccaagaaa 18181 ctagaggaga atgaggttat ccccaagaga atctcacccg ttgaagagag tgaggatgtg 18241 tccaacaagg tgtcaatgtc cagcactgtg cagggcagca acatctttga gagaacggag 18301 gtcctggcag gtaagtccca tgctgcttat aagatgcctt gaaggtggaa tggggctcag 18361 cgggggagag cacctgcagg cagggatgcc tccagccatg aggctccttg gtgccccttc 18421 cttttgccta ttcaggttgc cctagaacat tgaaagacta caccttcctt atggggtggc 18481 tctgactgtg cagcctggtg gagggagagg aaaaagcacc tatcaaagtc ttctggaaaa 18541 taggcaattg agtcattctt ctgccttaag tctttctcat ttattttgca aaggactttc 18601 actgtataag tttggcatct gggagttaat cattaaaagt taatttccct tgtaagtctg 18661 gaggctcctt cgaattgggt tagcttcccc tccccctact ctatcacttg gcagccttgt 18721 gaccttggct gagaagcttt cgaacttgat gagcctcagt ttccttatct gtaaaatggg 18781 tacagtgata ccttctgggg ttgatataat gagtccatga aaaataaaat atgaaataac 18841 tttgcacact ataaagggct attccgattt ggcctcagtt cagagttctt tactggaatg 18901 tgcggtgagg aatgctttgt cccaggtgtt gacaaaaggg atggagggaa ctccccaagg 18961 tcatggccga gggcagcctg gatgaaccgg cctggcaagt gggcaccctg ggcccatgct 19021 gggtaactcc tgtctcctgg gaatcaacag agccagcagc tccaaggagg cttgagctat 19081 agggacagag cctggcttca tccaggacag atggaaggtc tcacctgcct cttgtaaaga 19141 gggttcctgg gagcacagcc cctgatgact gggcccacct cagccctgac cctggcttcc 19201 tggtatctga gccaaagttc tttttacttt tctttcagaa gtaaaaagat ttgcataaga 19261 ctttggattt gcataaggtt ttgctctaat taactaaagg tgctattgct tctaaagaaa 19321 aatttgaaaa ccactgatta atctaagcac ctgcttctta tacatgggga gactgaggcc 19381 caggctttag gccacatagt aagaaaagaa ctgaagccag gttatctctt taatcttcca 19441 tttgagaatt atacaagcct aagagcctca tgtgaaaagt tatattgtta gctggtgtgg 19501 tggaatcccc cattccagaa gctttaatca gcacccagga gccttattaa atgcttgctg 19561 tatgctgtat gattcctgtg cccctgattg agtccgtaca acacaaaact cagtctaaag 19621 aacttatccg aagtcacaaa gctggaagtg gcagacctgg catttggact gaggaccaca 19681 gtcagcttct gagaatgtgc ttgaaacttg accctgtggg gcatcccagc gcagacccag 19741 ggcctcgtgg aggaactggg gtcatcagag ggaaaggtga tagagacaag aatggggttg 19801 atgcctgata ttccatgtgc ttgctctggc acctcctggg ggtacttttt tgttgctttt 19861 tcataggatt ttacccaaga aagaaccttg cttgactcct ctgtgccact ctgtccccat 19921 tgtgtacata gatttgtagt gtgtgcaggg atggaaaatt aatcttctta gcccgagtaa 19981 gaccgaatta gggaactcaa tctgccacag aagggattct atgaagcatc cctgccccta 20041 gcaaacagga atgagtcatt caggccacct ggcagagtgg acaggccaga cccactcact 20101 gttagaagcc catctctgcc caacactagg caggttctcc tctcggagcc tgaaagtatc 20161 atttattaag cacctcctgt tgtgcacacc tgattcaggg ggttcgggac acagatataa 20221 accttaaacc cttacagtta atgaatcttg agaatatgct atgcactagg cattgttcta 20281 agcactttga gtggattaat ttatttaatc cttaggacaa atgtatgaga aaggtatggc 20341 tcttcccatt ttgcggtagg gagatgaagg aaacttgccc caaatcacac agccaggaag 20401 taggagaggt aggagtggaa accaggcctt agctactgag ttctgtatgt aattgtaaca 20461 taagagtttg gaattagtat gttctgcatg tgtgcacttt gaatgtacat acctgtctat 20521 gaagtgtagg ctatataggt aaatatgcac acagggagag ctagagagtg ccctgtgcta 20581 aggactgcag gataaatatg tctacaggga tttccatagc ctacggtttt ctcctgttcc 20641 tggttcagtt agtgctagac tgttgcaggg gagtccgcgt ggtgtttgga aagagcctag 20701 gctttagatt caggcagatg tgggttaaaa tagtggcctt ggccgagtgc ggtggctcac 20761 gcctgtaatc ccagcacttt gggaggccga gatgggcaag gtcaggagtt caagaccagc 20821 ctggccaaca tagtgaaacc ctatctctac taaaaataca aaaattagcc gggcatggtg 20881 gcacgtgcct ataatcccag ctactcagga ggctgaggca ggagaattgc ttgaacctgg 20941 gaggtggagg ttgcagtaag ccgagatcac gccactgcac tcagctcggg caacagagtg 21001 agacttcgtc tcaaaaagaa aaaggagtgg ccttaccact agccctgtgg tcttcagtga 21061 cttaaaatgc caacgaccca cttcttataa ctggggtcat gaggtcaact taaataaggc 21121 atcagcttgc ctggcacagg cagtggtgat ggtgaggatg tctggttgta agagaactga 21181 cagtggggga aagaggggtt catccttagg tcctgatgag gagctctgac ccccgcctct 21241 tctctctcct cctctccagc tctgattgtg ggtggcatcg tgggcatcct ctttgccgtc 21301 ttcctgatcc tactgctcat gtaccgtatg aagaagaagg atgaaggcag ctatgacctg 21361 ggcaagaaac ccatctacaa gaaagccccc accaatgagt tctacgcgtg aagcttgctt 21421 gtgggcactg gcttggactt tagcggggag ggaagccagg ggattttgaa gggtggacat 21481 tagggtaggg tgaggtcaac ctaatactga cttgtcagta tctccagctc tgattacctt 21541 tgaagtgttc agaagagaca ttgtcttcta ctgttctgcc aggttcttct tgagctttgg 21601 gcctcagttg ccctggcaga aaaatggatt caacttggcc tttctgaagg caagactggg 21661 attggatcac ttcttaaact tccagttaag aatctaggtc cgccctcaag cccatactga 21721 ccatgcctca tccagagctc ctctgaagcc agggggctaa cggatgttgt gtggagtcct 21781 ggctggaggt cctcccccag tggccttcct cccttccttt cacagccggt ctctctgcca 21841 ggaaatgggg gaaggaacta gaaccacctg caccttgaga tgtttctgta aatgggtact 21901 tgtgatcaca ctacgggaat ctctgtggta tatacctggg gccattctag gctctttcaa 21961 gtgacttttg gaaatcaacc ttttttattt gggggggagg atggggaaaa gagctgagag 22021 tttatgctga aatggattta tagaatattt gtaaatctat ttttagtgtt tgttcgtttt 22081 tttaactgtt cattcctttg tgcagagtgt atatctctgc ctgggcaaga gtgtggaggt 22141 gccgaggtgt cttcattctc tcgcacattt ccacagcacc tgctaagttt gtatttaatg 22201 gtttttgttt ttgtttttgt ttgtttcttg aaaatgagag aagagccgga gagatgattt 22261 ttattaattt tttttttttt tttttttttt tactatttat agctttagat agggcctccc 22321 ttcccctctt ctttctttgt tctctttcat taaacccctt ccccagtttt tttttatact 22381 ttaaaccccg ctcctcatgg ccttggccct ttctgaagct gcttcctctt ataaaatagc 22441 ttttgccgaa acatagtttt tttttagcag atcccaaaat ataatgaagg ggatggtggg 22501 atatttgtgt ctgtgttctt ataatatatt attattcttc cttggttcta gaaaaataga 22561 taaatatatt tttttcagga aatagtgtgg tgtttccagt ttgatgttgc tgggtggttg 22621 agtgagtgaa ttttcatgtg gctgggtggg tttttgcctt tttctcttgc cctgttcctg 22681 gtgccttctg atggggctgg aatagttgag gtggatggtt ctaccctttc tgccttctgt 22741 ttgggaccca gctggtgttc tttggtttgc tttcttcagg ctctagggct gtgctatcca 22801 atacagtaac cacatgcggc tgtttaaagt taagccaatt aaaatcacat aagattaaaa 22861 attccttcct cagttgcact aaccacgttt ctagaggcgt cactgtatgt agttcatggc 22921 tactgtactg acagcgagag catgtccatc tgttggacag cactattcta gagaactaaa 22981 ctggcttaac gagtcacagc ctcagctgtg ctgggacgac ccttgtctcc ctgggtagga 23041 ggggggggaa tgggggaggg ctgatgaggc cccagctggg gcctgttgtc tgggaccctc 23101 cctctcctga gaggggaggc ctggtggctt agcctgggca ggtcgtgtct cctcctgacc 23161 ccagtggctg cggtgagggg aaccaccctc ccttgctgca ccagtggcca ttagctcccg 23221 tcaccactgc aacccagggt cccagctggc tgggtcctct tctgccccca gtgcccttcc 23281 ccttgggctg tgttggagtg agcacctcct ctgtaggcac ctctcacact gttgtctgtt 23341 actgattttt tttgataaaa agataataaa acctggtact ttctaaactg cttgcctctg 23401 tcattttcgt tcataacaag tcatcctttt tgggctctgt atccccttga tctcagtgga 23461 gcatgaagaa actccccgga ccaaatcccc tacgggtgcc agacatgccg ggggtgggca 23521 gagggtgggg gcagagaggt aagaaggcag gaaggggcct agagaagagg gaagacttca 23581 gaacatgcac cctgatggcc tatgcagcat atcaccccta cttcaaggtt ttgtttaggt 23641 ggcactgtgt ttaaatagca aacacaaaaa tctttgcgtc agttgccatc catagaaatc 23701 aggaggtttc acataaaaat ccagatttct cacttttctt gggaaaaaga aataaaaaaa 23761 attggcaact gtcagcctgc atggcaacaa gagagctgct gagtggcagg cacccatcta 23821 ga // LOCUS HUMREGB 4251 bp DNA PRI 15-SEP-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 NID g190980 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) MEDLINE 90237042 COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES Location/Qualifiers source 1..4251 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 1169..1174 prim_transcript 1196..4116 /note="reg mRNA and introns" intron 1224..1524 /note="reg intron A" CDS join(1571..1634,2270..2388,2696..2833,3549..3660, 3856..3923) /note="regenerating protein (reg)" /codon_start=1 /db_xref="PID:g190981" /translation="MAQTSSYFMLISCLMFLSQSQGQEAQTELPQARISCPEGTNAYR SYCYYFNEDRETWVDADLYCQNMNSGNLVSVLTQAEGAFVASLIKESGTDDFNVWIAL HDPKKNRRWHWSSGSLVSYKSWGIGAPSSVNPGYCVSLTSSTGFQKWKDVPCEDKFSF VCKFKN" exon <1571..1634 /note="regenerating protein (reg), (first expressed exon)" /number=2 intron 1635..2269 /note="reg intron B" exon 2270..2388 /note="regenerating protein" /number=3 intron 2389..2695 /note="reg intron C" exon 2696..2833 /note="regenerating protein" /number=4 intron 2834..3548 /note="reg intron D" exon 3549..3660 /note="regenerating protein" /number=5 intron 3661..3855 /note="reg intron E" exon 3856..>3923 /note="regenerating protein" BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMRETBLAS 180388 bp DNA PRI 23-NOV-1994 DEFINITION Human retinoblastoma susceptibility gene exons 1-27, complete cds. ACCESSION L11910 NID g292420 KEYWORDS nuclear protein; recessive oncogene; retinoblastoma gene; retinoblastoma protein; retinoblastoma susceptibility; tumor supressor gene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Friend,S.H., Bernards,R., Rogelj,S., Weinberg,R.A., Rapaport,J.M., Albert,D.M. and Dryja,T.P. TITLE A human DNA segment with properties of the gene that predisposes to retinoblastoma and osteosarcoma JOURNAL Nature 323 (6089), 643-646 (1986) MEDLINE 87039336 REFERENCE 2 (sites) AUTHORS Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.F., Bogenmann,E., Li,F.P. and Weinberg,R.A. TITLE Deletions of a DNA sequence in retinoblastomas and mesenchymal tumors: organization of the sequence and its encoded protein [published erratum appears in Proc Natl Acad Sci U S A 1988 Apr;85(7):2234] JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (24), 9059-9063 (1987) MEDLINE 88097427 REFERENCE 3 (sites) AUTHORS Lee,W.H., Bookstein,R., Hong,F., Young,L.J., Shew,J.Y. and Lee,E.Y. TITLE Human retinoblastoma susceptibility gene: cloning, identification, and sequence JOURNAL Science 235 (4794), 1394-1399 (1987) MEDLINE 87149066 REFERENCE 4 (sites) AUTHORS McGee,T.L., Yandell,D.W. and Dryja,T.P. TITLE Structure and partial genomic sequence of the human retinoblastoma susceptibility gene JOURNAL Gene 80 (1), 119-128 (1989) MEDLINE 90006771 REFERENCE 5 (sites) AUTHORS Sakai,T., Ohtani,N., McGee,T.L., Robbins,P.D. and Dryja,T.P. TITLE Oncogenic germ-line mutations in Sp1 and ATF sites in the human retinoblastoma gene JOURNAL Nature 353 (6339), 83-86 (1991) MEDLINE 91351319 REFERENCE 6 (bases 1 to 180388) AUTHORS Toguchida,J., McGee,T.L., Paterson,J.C., Eagle,J.R., Tucker,S., Yandell,D.W. and Dryja,T.P. TITLE Complete genomic sequence of the human retinoblastoma susceptibility gene JOURNAL Genomics 17 (3), 535-543 (1993) MEDLINE 94063891 FEATURES Location/Qualifiers source 1..180388 /organism="Homo sapiens" /db_xref="taxon:9606" protein_bind 1858..1863 /bound_moiety="SP1" /evidence=experimental protein_bind 1866..1872 /bound_moiety="ATF" /evidence=experimental exon <2060..2196 /number=1 CDS join(2060..2196,5424..5550,39446..39561,41926..42045, 44668..44706,45799..45866,56853..56963,59651..59793, 61730..61807,64330..64439,65364..65441,70242..70329, 73753..73869,76430..76486,76889..76920,77001..77077, 78083..78279,149998..150116,153208..153353,156693..156838, 160730..160834,161997..162110,162204..162367, 170372..170402,173707..173849,174361..174410, 177005..177078) /codon_start=1 /product="retinoblastoma susceptibility protein" /db_xref="PID:g292421" /translation="MPPKTPRKTAATAAAAAAEPPAPPPPPPPEEDPEQDSGPEDLPL VRLEFEETEEPDFTALCQKLKIPDHVRERAWLTWEKVSSVDGVLGGYIQKKKELWGIC IFIAAVDLDEMSFTFTELQKNIEISVHKFFNLLKEIDTSTKVDNAMSRLLKKYDVLFA LFSKLERTCELIYLTQPSSSISTEINSALVLKVSWITFLLAKGEVLQMEDDLVISFQL MLCVLDYFIKLSPPMLLKEPYKTAVIPINGSPRTPRRGQNRSARIAKQLENDTRIIEV LCKEHECNIDEVKNVYFKNFIPFMNSLGLVTSNGLPEVENLSKRYEEIYLKNKDLDAR LFLDHDKTLQTDSIDSFETQRTPRKSNLDEEVNVIPPHTPVRTVMNTIQQLMMILNSA SDQPSENLISYFNNCTVNPKESILKRVKDIGYIFKEKFAKAVGQGCVEIGSQRYKLGV RLYYRVMESMLKSEEERLSIQNFSKLLNDNIFHMSLLACALEVVMATYSRSTSQNLDS GTDLSFPWILNVLNLKAFDFYKVIESFIKAEGNLTREMIKHLERCEHRIMESLAWLSD SPLFDLIKQSKDREGPTDHLESACPLNLPLQNNHTAADMYLSPVRSPKKKGSTTRVNS TANAETQATSAFQTQKPLKSTSLSLFYKKVYRLAYLRLNTLCERLLSEHPELEHIIWT LFQHTLQNEYELMRDRHLDQIMMCSMYGICKVKNIDLKFKIIVTAYKDLPHAVQETFK RVLIKEEEYDSIIVFYNSVFMQRLKTNILQYASTRPPTLSPIPHIPRSPYKFPSSPLR IPGGNIYISPLKSPYKISEGLPTPTKMTPRSRILVSIGESFGTSEKFQKINQMVCNSD RVLKRSAEGSNPPKPLKKLRFDIEGSDEADGSKHLPGESKFQQKLAEMTSTRTRMQKQ KMNDSMDTSNKEEK" intron 2197..5423 /number=1 variation 2302 /standard_name="Restriction Fragment Length Polymorphism" /note="BamHI RFLP" exon 5424..5550 /number=2 intron 5551..39445 /number=2 satellite 14868..14914 /standard_name="microsatellite RBi2 (AFM058xd6)" exon 39446..39561 /number=3 intron 39562..41925 /number=3 exon 41926..42045 /number=4 intron 42046..44667 /number=4 satellite 43218..43257 /standard_name="microsatellite RBi4" exon 44668..44706 /number=5 intron 44707..45798 /number=5 exon 45799..45866 /number=6 intron 45867..56852 /number=6 exon 56853..56963 /number=7 intron 56964..59650 /number=7 exon 59651..59793 /number=8 intron 59794..61729 /number=8 exon 61730..61807 /number=9 intron 61808..64329 /number=9 exon 64330..64439 /number=10 intron 64440..65363 /number=10 exon 65364..65441 /number=11 intron 65442..70241 /number=11 exon 70242..70329 /number=12 intron 70330..73752 /number=12 exon 73753..73869 /number=13 intron 73870..76429 /number=13 exon 76430..76486 /number=14 intron 76487..76888 /number=14 exon 76889..76920 /number=15 intron 76921..77000 /number=15 exon 77001..77077 /number=16 intron 77078..78082 /number=16 exon 78083..78279 /number=17 intron 78280..149997 /number=17 variation 99426 /standard_name="Restriction Fragment Length Polymorphism" /note="XbaI RFLP" satellite 102921..102959 /standard_name="microsatellite RBi17" repeat_region 123912..125501 /standard_name="Variable Number Tandem Repeats (VNTR)" /rpt_type=tandem exon 149998..150116 /number=18 intron 150117..153207 /number=18 exon 153208..153353 /number=19 intron 153354..156692 /number=19 exon 156693..156838 /number=20 intron 156839..160729 /number=20 satellite 156895..156976 /standard_name="microsatellite RB1.20" exon 160730..160834 /number=21 intron 160835..161996 /number=21 exon 161997..162110 /number=22 intron 162111..162203 /number=22 exon 162204..162367 /number=23 intron 162368..170371 /number=23 exon 170372..170402 /number=24 intron 170403..173706 /number=24 variation 171935 /standard_name="Restriction Fragment Length Polymorphism" /note="Tth111I RFLP" exon 173707..173849 /number=25 intron 173850..174360 /number=25 exon 174361..174410 /number=26 intron 174411..177004 /number=26 exon 177005..>177078 /number=27 polyA_signal 178872..178877 polyA_site 178993 BASE COUNT 55354 a 32559 c 34187 g 58288 t ORIGIN 1 ggatccagat tcttttgaaa ttcctcctgc accaatatca gcatttctac cttctctgta 61 ggttggcttg cctcacgtta caatggctgc agcaatcaga agtgtcacat cctcagtaat 121 acttaatatt attactatta tttaataata tttagaactg tgccatccct gtttcaattt 181 atcaggctcc cagcagacta ctccttatct ttcaaatgtc aaaactgcat cctgagctct 241 tgcctaaact aatctggggt gaggtgaatg gaagtagcac tttaattgta ttcattcttt 301 gtagctggac ctgggcctgg gctatctcct gacatttgcc cacaagaaag atttctgaag 361 ttaggtagga atggctgttg agtaggccag tgcttgccaa acctttacac atcctcacat 421 atgtcataat atgcagataa aaagataatc ccttatacaa cttgctggga taaactcagg 481 aggcttacag catgacctgc ttgaaggttc ttcctgcctt agaccttgct cagctgctcc 541 aggatgaggg gatttacatc acagcacaac tgtattttat tcacagcata aaccatctct 601 ttccttctca gttgacgagt tcagatgggc aataacagtg tctgccaaag agaaaaaaaa 661 atgtattcaa actagataat ctattggtac aaataccgag acacagaagt gataacagct 721 ttaagccaat gtttgatggt ggtagtccca gcaagctctt ttctgatgtc tttgtgcctt 781 tgcacatgct ccttctctgt cactgttttc ttcatcaaac ataatataat ggacaagtgg 841 aatcaaatag aattgagttc aaattctctg ctacccatcg gccctggtat tggacaaatt 901 aactcctctg agcctgtttc ctcatctgca acgtagacta gctaatacta cccattggaa 961 agcgttgttt cttagctaat gcatgcaagg cttaaaacct agatgacggg ttgataggtg 1021 cagcaaacct ccatggcata cgtatgccta tgtaacaaac ctacacgttc tgcacttgta 1081 tcccggaact taaagtaaaa aaaaaaaaaa aaaaaaaaag aaagaaagaa aaagaaaaaa 1141 aaggctgttt ctggggatta aataagacaa ttatgtaagg tggccagcac agttcctggt 1201 acatagtaaa tgtcaggcct gcctgacaga cttctattca gcagctactg ctcccctgaa 1261 aatcttcctc agacgtttcc acggtgcttc ccgttcttac accactacaa tcctttatta 1321 cactactatc cgttcattcc ccacagctcc ctcccttcct ttccctaacc agtgatccca 1381 aaaggccagc aagtgtctaa cattttctat cttctaagtg actggtaaag ttccgcacct 1441 atcagcgctc caagtttgtt tttgttttgg ccgactttgc aaaacggatt gggcgggatg 1501 agaggtgggg ggcgccgcca aggagggaga gtggcgctcc cgccgagggt gcactagcca 1561 gatattccct gcggggcccg agagtcttcc ctatcagacc ccgggatagg gatgaggccc 1621 acagtcaccc accagactct ttgtatagcc ccgttaagtg caccccggcc tggagggggt 1681 ggttctgggt agaagcacgt ccgggccgcg ccggatgcct cctggaaggc gcctggaccc 1741 acgccaggtt tcccagttta attcctcatg acttagcgtc ccagcccgcg caccgaccag 1801 cgccccagtt ccccacagac gccggcgggc ccgggagcct cgcggacgtg acgccgcggg 1861 cggaagtgac gttttcccgc ggttggacgc ggcgctcagt tgccgggcgg gggagggcgc 1921 gtccggtttt tctcagggga cgttgaaatt atttttgtaa cgggagtcgg gagaggacgg 1981 ggcgtgcccc gacgtgcgcg cgcgtcgtcc tccccggcgc tcctccacag ctcgctggct 2041 cccgccgcgg aaaggcgtca tgccgcccaa aaccccccga aaaacggccg ccaccgccgc 2101 cgctgccgcc gcggaacccc cggcaccgcc gccgccgccc cctcctgagg aggacccaga 2161 gcaggacagc ggcccggagg acctgcctct cgtcaggtga gcgagcagag ccgccgtcgc 2221 ctcacgcggg aagggcgccc cgggtgtgcg tagggcgggc gcaaggcggc tcggcgggga 2281 cccgtcctcg ccaggggccg ggtcccggcg ggaggaggcg ccctccctgc cccccgccac 2341 ggcggagcgt ctgcagaatg gtgacaggat tctgggttct tgggcgaggg gtctcggctt 2401 caacttgaca ggtgtcgggc gggtggggct agggtcctga gcgaagtgac aggtgcagtt 2461 ccctcttgtg aggctcggag gcagagggtc gttgcgagcg tccatcagac gcaaaaaatg 2521 aaaaataaaa atacaaaaat ggtgtctgtg ggagagtttt tcaccggaga attggagtac 2581 tccggtggtc gtctgacttt ctgttttggt tcacgcgatg caacagttgg gaagtatttt 2641 cttccgggcg tgcactgcat ctgaagtcca tttgtgggag aggccgacca gaaagccttg 2701 gacaagaagc gcagggtcct gagtgtccat tgcccacagg atactcggct caggagcttt 2761 gcggcgtttc cttagaacaa taatgcatcg aggccttggg gactcaaagc catctgtagt 2821 gattgatgga gcgtaactct ttagaggaac tgaaacatgg gcaaaacttt catgagacat 2881 ttaccagaag tgcttgaaag tttctaaact tttttttttc ctgtttgatg aactcttctt 2941 gcgtgttagt cggcttcggc ttgtctcatt atttcttcca ttttgccttt tgactttgaa 3001 ccagcaagga tcttggtgtc ccctcttttt gcctttgttt ttggcacaaa attagtggtt 3061 ctgtgcgcaa atggaaattt tcgttttccc ttattaagtg gaatctaaat ttaagcaagt 3121 ccatacgaat gcactagatc ttgaagggaa gtatttattg tattacaaca tcttactttt 3181 cttgattttt ctactttatg gttaaatagc tatgattgaa agagtgtaat tgtcattatt 3241 gtcagcactg gttctacttt gagacaagtt cattgcagag ggaatgggac ttgtttctgt 3301 ttttcactat tttctctccc attcttgtct atcaccaaat cctttcaccc tcacccattt 3361 ctttccactc ggtatacact aacaattcac ggcagaaaag attgaagtgg gatttaggaa 3421 atggcccctg gaaggctatt aaaaatttat atatttaaat ggactgtctt ataggtcagt 3481 taaaaaccat attcgttaaa aaaccaaaat aacaacaaca aaaaattaaa aaccacgtct 3541 ggggcatgtt ctgggaaaag acatggcttt agtttctgat taaattctga tgtatccaat 3601 tcttgcaaat ttcccttggg aaaatgcagt aatggctact ctaaagaatt ccatgttatg 3661 cacacagctt tggaagcata ctctaatgta gggtaaccag gaatattgac gttgtggcca 3721 ctgcttggaa aaaagaggac tgtttctttc attttttaac tcattttata tattttaagt 3781 aatggaacta taaaaaattc tcttataagt caaaaccata caagatacag tctgttcttg 3841 attatccata cccacaaagg gtttcattaa catagacagt tgaactctat aaattaacca 3901 gcaagggtag aaccactgcc gtgcagtctg atactgagca tcttgcctgg agatggaatc 3961 aaggctcagt ccaactgcac ctccagtgac agattccatt cctgatttga gagtttagat 4021 cttatttctg ttgccttatc tctggcctaa gtaatgtagc tggaagggaa actgtcacta 4081 ggaacagcat accacattca ttgttgaaat aatcaaaggt taatcatgtt tcgagctggg 4141 tgtgatggcg cgtacctgta gtccatgcta ctcaggaggc tgaggcagga ggatcacatg 4201 agcttgggag ttggaggcca gcctgggcaa caacgcaaga ccccagtctt taaaaaaaca 4261 aaaaagctga ccaggcatgt tggctcacgc tgtaatgcca gcactttggg aggccaagat 4321 gggaagattg cttgaggcca ggagtttgag aggagcctgg gcaacaaaac aagaccctgt 4381 ctctaaaaca aattttaaaa aattagccag tggctgtggc acacctgtag gctcatctac 4441 tcaggaggct gaggtgggag gatccttgag cccaggagtt tgagactgca gtgagccatg 4501 atttcaccac tgcaccccag cctggatgac agagtgaaac cctatttcaa aacaagaaaa 4561 aaaaacaaaa actaattatg ttttgaagga ggaattggca gtctagaata gtggtcaagg 4621 gcatggatta tagagttaga ctgttgggct cacatagtgg tttacctctt accggctgag 4681 acctcaggta agttctttaa cttctccaat ttgacagaca tctgtaaatg tctactttta 4741 cagatgagaa tacttatcta aaagggctga tgtcgggatt cagtgaaacc atatgtttaa 4801 ggtactttct acaggatcat agtaaatact caaaaattat taattattct tcatactatt 4861 cttattagta gataacacat agaaattaga tgaatgaaaa tgaattagcc aacaagaatt 4921 tattaaatgc ttgtctttgt taaggggaaa ggacaaaaat aagcagtcgc atcatgtaag 4981 atgtgctagt aggtagagag atatatgaaa cattgataag agaagactac agtttaacaa 5041 agtcactggc gttgaattgt ataatattat ctatgggttt ttatcttttt gttgttatct 5101 ttatcctatt ttcccaaaca gctttagcta ttacatttac tttccttcac agaagtgttt 5161 tgctgctttg aagtatattt gacttaccat gcaagcaaat atttttcact gtgtggtatc 5221 cttattttgg aatgaccatg aaaaagataa tcatatgttt aaatttgaag tgtaatgttt 5281 ttctaagata aaataagatc ttaaagtatt taataatgtt ctttttcaca gtagtgttat 5341 gtgcaaacta ttgaaacaag tatgtactga atcaatttga tttataagta tatgccaatt 5401 atatgattat tttcatttgg taggcttgag tttgaagaaa cagaagaacc tgattttact 5461 gcattatgtc agaaattaaa gataccagat catgtcagag agagagcttg gttaacttgg 5521 gagaaagttt catctgtgga tggagtattg gtaaggattt tcttaaaacg ttttgaaatt 5581 tttttttctc attttaaaaa caacttcaaa tcactataca aaaattgaaa gatagaaaaa 5641 tataaagaca ataaaagcta ataataattc cattacccag aggaaattta cctctgctaa 5701 cattaaaaat gtttgaggcc gggcacggtg gttcatgcct gtaatcctac cactttggga 5761 ggctgaggca ggtggattgc ctgagctcag gagttcgaga ccagcctggg caacatggtg 5821 aaaccctgtc tctactaaaa tacaaaaaaa aaaaaaaaaa attagcaggg tgtggcgtgc 5881 gctgtagtcc cagctatttg ggaagctgag gcaggagaat tgcttgaacc caggaggtgg 5941 aggttacagt gagccgagat cgtgccactg cactccagcc tggcaacaga gcgagagtct 6001 gtctcaaaaa aaaaaaaagt ttgaaaaaaa tccctccata aatgcctccc accaaactat 6061 tttaaagcag gtcttagtta tattttattc actatgatac aaggactttt aaaaaaacac 6121 cgccatacct tataaaataa tttttatacc ttctaatatg caatcagtgt tcagatttcc 6181 tgactgactc agaattagtt ttgttagaat tcaggatctt gaaaatatat tgatatattc 6241 ctttcatttt ttctttgcgt aagtatagat gtatatgttt aataaaagga gcaaataatt 6301 actagagttt tatatgtact ggggcaatag tataagaatt ttacatagcc gggcatggtg 6361 gtgtgtgcct gtagtcccag ctacttggga ggctgaggtg ggacgatcac ttgaacctgg 6421 gaggtggagg ttacagtaag ccaagatcgc accactgcac tccagcctgg gcaacacagg 6481 aagaccctgt tgcaaaaaaa aataaataaa aataaggctg ggcgtggtgg ctcaaacctg 6541 taatcccagc actttgggag gcccaggcag gtgatcattg aggtcaggag ttctagacca 6601 gcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattag ccaggcatgg 6661 tagcagatgc ctgtaatctc agctacttgg gagggtgagg caggagaatt gcttgaacct 6721 gggaggcaga gtttgcagtg agccgtgatc atgccactgc actccagact aggcaacaga 6781 gaagactcca tctcaaaaaa aaaacaaaaa aaaccccaga actttacaaa atctacgaat 6841 aactcattta aaaatcattg agatatgaaa tataaaatct ttttttatgt gtaaaatgtg 6901 aaatgttcat ttaacagatg aagacactga gtagagtgag gttgagtaac ttgaccaagg 6961 tccctaggaa ataagatctc aaatataaat ttgagattat acatgtagtt atgtatagtt 7021 acatgtagtt acacatagtt ttgtattttc ctgttttttt ctcatatgag cattttctta 7081 ttaaagtttt tttgtaatat agtttctagt gatagtataa tattttagct agtggatgta 7141 ttttaatttg ttaactctta ctggatattt gggttttttc caactttttg atgctataaa 7201 ataatgtaat aaacatttgt attcgtaaga ataatgcctg gcacaaagtg agtccttgat 7261 aacttgttaa ataaattaat gaatgaatct ctgattattt tattagaata aattcttata 7321 agttgaatta ctaaatcaaa agctatgaac atttttatga ctcttgatga ttattgctag 7381 attacacttc agacagatac ttttaatgaa tttactttct caacatagta gtataagagt 7441 atatatctgt ttttactgat ctgattaaca cagcggaatg tgtttatgta tgtcccataa 7501 gaaaattgtt taaaaccatt aaataaaatc tctggcagta aaaaggagaa aacaagtgtc 7561 atgatttcca aagaattgaa catagtctca catttcttat gtaaaaagag aggaatactg 7621 aagtatcttg aatgtatgac ttgggtatta taatccccag gcatctaatg tatattttaa 7681 ctgacttggg tacctcatgt acaaacgtgt ttttgtgcat atacaattaa atgggatcag 7741 agctccttca cttttctctt cctaaaatac tagattttta gtgttcagaa attttggaag 7801 ggatagctga ttataaagag cattagtgaa tatacatact gtagttttca atttataagt 7861 gcattgtgtt ccaaatctgt ttgcaagttt gatatagtag ttaggtttcc agcctagttc 7921 acataactta tttttttttg tactaaacta taacaataat agttttaaca gtagaggctc 7981 ttttgagctt cagaattgaa tacatatata agcaaatgtt actgctttat tacccttttt 8041 catcctggct ttgtctctac cttccctggg aactggccca gtttgaatct tatttcctaa 8101 tggagctcat aatggggact aggatgtgtg aaattattta tggctaggta ctggaggaag 8161 gtaagaaagg aatttcatgt tatttttggg atttgaggtg tgcttttgaa gggtatgatg 8221 gaagatgatg tcctgttgtc ctgttgtaaa tcaacagact tttccttgaa atatgatttt 8281 aagttagagt tacgtttttg aggtttggcc tgagaattta accttcattt atataataat 8341 atctaaggga gaatgtgata cagtgtgttc taagaagcaa atgtaatgac agaataaatg 8401 tgtttccagt gcagctattc tagtggcagt tgtctctctt ctttgaaaca agaggaggaa 8461 tattgtcaaa aaggaaaccc tttttaacaa tagaaagtag gtgatgttta ttgggataga 8521 ggatgggtta agagcaagga ttaagagtca aacagatata ccatgtccga actccattat 8581 cttgaacatg ttatctacaa ttttcaagcc ttagttttct tacctgtaaa agggactgac 8641 agtaatttct ggatcttgga gttatgtgca gattaaataa attaatgcat ataaagctct 8701 tattagagtc aatgagtatg tatattaggt cttgcaaaca gaataatttt acttaagaaa 8761 taccccagaa aattttatca ttgatgtatt tttttttagg agaacttagg atatcagtta 8821 aaatggttac aattttaaat gcattgtagt gattttgatg tagtaatgtt tgttaaaatt 8881 cttattttta agtactttag catcagaaaa gttgtttatg catacatgaa agcatatatt 8941 aatcccttta aaggaaaagt ttgtggtatt ttgtctagaa aagtgttttc tttttttaaa 9001 ataaatattt agtaaagtgt aataataaag attagctttg ttcataaggt tagattgtat 9061 ttatgctgat tatagcatat gaataggact ttccattaag aaattggcta aggtgatgtt 9121 catattgtga cttaaggtga atcagctgtt agaattttgg tttactttct gcatcttgat 9181 tggtagtatg gtagaatgga aaatgccttg gacctcaagg caggaaactt tggttgtaat 9241 tatattgtag ctatatgatt ttaggcaaat tggttaattc tgtgagtctc agtttgctca 9301 cctgtaaggc agaaataatg cctgaatgtg aacctcatag gctttttgtt atacatacat 9361 atatatgtat atgaaatgta tgatatacat atctcaaaga gctttgggat atattgctct 9421 tgatagaacc acttacttat ttttgtgata tggtggttat attttaaata aatacagtca 9481 tgtgttgctt aacaatggga acacatctga gaaatgcatc attaggtgat ttcatcgttg 9541 tgtgaacatc atagcatgta cttacacaaa cctggatggt atagcctatg acacacctat 9601 gctgtgtggt atatagcatg ttgctcctag tctgcaaatg tgtacagcat gttactgtac 9661 tgaacattgt aggcaattgt aacacaatgg taagtatttg tatatctata catagaaaag 9721 gtacagtaga tcatatcctt actgattttc tttctatgtg ttttactaat tattgaggga 9781 gaaatatgaa gcctccaact ataattagaa tggccctttt aaattctgtt ttttattaat 9841 gtatttttta ttttatttta ttttgagatg gagtcttgct ctgtcgccca ggctggagtg 9901 cgtggcacga tctcggctca ctgcaacctc tgcctcccgg gtccaagcga ttctcctgcc 9961 tcagcctccc gagtagctgg gattataggc atgcgccacc acacccggct aatttttgta 10021 tttttagtag agacggggtt tcaccgtgtt agtcaggctg gtctcgaact cctgacctca 10081 tgatctgccc tcggcctccc aaagtgctgg cattacaggc gtgagccact gtgcctggcc 10141 tttattagtg tattttaaag ctttatcctt gggtacatac aagtttagaa tttttatgcc 10201 ttttaggtaa attggtacct ttatcattat gtaatgtgct tttttatcca aatttatatc 10261 ttttgttctg cagtgtccac atatcaattc cacttttctt gtgatcactc tttacaatgt 10321 gtatcgttta ccttcatttt gattttaacc tatttctttt ttatatttaa atggattttt 10381 gttgttttgt aatcagcatt cattaaaaat ttttagctga atacttttta cttttgttag 10441 ggttccctgt gtctcttcct ttcatgttgt gccacagatg agccattatt tatatctttt 10501 ctctcattgg aggttccttt ttttagggga aattaaataa tctgaaatat aaggaaagga 10561 tatgttgatt tcaagaaagg ttgtttttgt tccttatctg gcttgttctt gttgtgaggg 10621 tgggagcaat actctttgca gctttctgca ttctaggaag aagtggaatt ttacactacc 10681 atttgctttc tttacaggtt taccttggta ttcagtgtct gtccatagga ctaattaaaa 10741 ctataccaga ctccttctat tttaaaccta tccgttcagt ctcaagttat ctttttgcat 10801 tgcctctcac gtctgtccct tcatttcttt taccagcagt gcttggtttt tactgtttgc 10861 tttggtttta atttcctttt tattttttta atttcagaag gtggaagctt atgtcattga 10921 cttgagcatt ttttatgcat gttttagtgc ctctttctga ctagtctctt cttttctcca 10981 tccaatccat cctgtgcctg atttttagaa aagactggtg tacagtttgc cccttttgct 11041 ttaaaattta tttttcattg ttacaaccca ctctcagtag cctccattgt aacaccctcc 11101 catgctccca ccaccattat gaagaagtaa agtccaaact ttctcagtat ttgtagttct 11161 tcatggtctg tgctttaacc ttccttccag tcttattttt tatttccatt gttggccccc 11221 aatgcctgtg ttctcttccc ttccatatga atcttccata tgaatatctt ccatatgaat 11281 atgatagtat tcatattcat gcgatctccc ctttttttgg aatacccttc ttttttcttt 11341 tagcctaaat tctgttttta agatgcagtt aaagcttctc ctttatgcat atctcttata 11401 agatatatta tttgcagact tttcccccca ttctgtgagt tgccttttca ctttcttgat 11461 gatgtcattt gcagcgtagg attttaattt tgatgaagtc cagttttcta tttttttttt 11521 tggtggtggt tttttttttg gttgcttggg cttttgatgt tgtatctaag aaggccttga 11581 ctaacccaag gtcatgaaga cttattctgt gctttcttct aaaagtcata tggtttttag 11641 ttcttgtcag tctgggtttt tttttgtttt tttttttttt tttgagttaa tttttgtgta 11701 tgatgtgagg aagaggtcta actccagtct tttgaatgtg gatatccagt tgtctctgca 11761 ccacttatgg aaaagactat tctttcttca ttgaattgtt ttggtgccct tgatgaaaat 11821 cagttgacta taaaaaccag ggtttatttc tggagtctca gttctattcc attgctctgt 11881 atgtttattc tttttttttt tttttttttt tttgagacgg agtctcgctc tgtcgcccag 11941 gctggagtgc ggtggcacaa tctcggcttc ctgcaagctc cgcctcccgg gttcatgcca 12001 ttctcctgcc tcagcctccc aaagagctgg gactacaggt gcctgccacc acgcccggct 12061 aatttttctg tatttttagt agagacgggg tttcactgtg ttagccagga tggtctcgat 12121 ctcctgacct cgtgatcgcc cacctcggcc tcccaaagtt gctgggatta caggcgtgag 12181 ccaccgcgcc tggccgtatg tttattctta tgatagtacc atactgtttt gtagtatgtt 12241 ttatagcttt gtagtatgtt ttgaaatagg gaagtgtgag ttcgacaact ttggttttct 12301 ttttcaagat tacttttgtt actcttggta cttgaatttc catgtgaatt tttaggatca 12361 atttgtcaat ttctgtttta aaaaaaggca gctgggattt tgatagggat tatgttggat 12421 ctagatcaat ttgcggaata ttgcattctt aacaacatga agtcttcaga tgcatgcata 12481 tggaatgtct tttcgtttaa gtctttcttt caccagtgtt ttcagttttc agagtataaa 12541 tcttactaaa tatttattct tggtgatatt attataaatg gaattgtttt ctttcctttt 12601 tggcttattc attgtaggtg gatagaaata aaatctattt tttttggata agagcagcta 12661 aaatcagttg ggcacggagg ctcacgcctg taatcttagc actttgggag gccgaggtgg 12721 gtggatcact tgaggtcagg aatttgagac cagcctggcc aacatggtga aaccctgtct 12781 ctactaaaaa tacaaaaaat tagctgggtg tggtggcatg cgcctgtagt cccagctact 12841 ctggaggctg aggcaggaga atcgcttgaa cccgggaggc ggaggttgca gtgagccgaa 12901 gtcgtgccac tgcactccag cctgggcaaa aaaaaaaaag aaaaagcagc taaaatctat 12961 gtatttaaca aaaatctcta atacagtacc attttattta ttatagtcct catgttgtat 13021 attagatctc tggacttgtt catcctatgt atttgctact tcgtatcctt tgaactacat 13081 cttttcaaag gatgaaattg ggtaatgtga tgcctctagc tttggtcttt ttgtgtaggg 13141 ttgccttgac tattcgggct ctttttcagt tccatatgaa ttttaaaata gatttttttt 13201 ttctttttcc taattctgtg aataatgtca ttggtagtct gatagaaaca gcattgaacc 13261 tgtaaattgc tttgggcagt gtggccattt taatgatatt gattcttctt atccatgagc 13321 atggaatgtt tttccatttg ttcgtgtcat ctctgattta tttgagctgt gtttgtaatt 13381 cttgtagaga gctttcacct ccctagttag ctgtattcct agatatttta ttctttattg 13441 tggctattgt gagtgggttg cattcttgat ttggctctca gcttgggtat tggtacacag 13501 gaatgctagt gatttttgta cattgatttt atatcctgaa actttgctta tcagctgaag 13561 gaacttttga tcagagacta tgggattttc taggtgtaaa atcatcatct gcaaacagga 13621 atagtttgac ttcctttttt cttattccga tgctctttat ttccttctct tgtctgattg 13681 ctgtggctag gacttccaat accatgttga ataggagtgg tgagagagag catccttatt 13741 ttgttctgcc tttcaagggg aatgctttca acttttgcct cttaagtatg atgttggctg 13801 tggatttgtc atagatggct cttattattt tgaagtatgt tccttcaatg cctagtttgt 13861 tgagctatat accacacttt ctttatccac taatcggctg atgggcactt cagttgattc 13921 catacttttg cagttgtgaa ttgtgctgca ataaacaagt gcaggtttct ttttgatatt 13981 atgacttctt ttattttggg tagataccca gtagtgggat tgctggatcg aatggttgat 14041 ctgcttctag ttctttgaga aatcctcaca ctgttttcca tagaggttgt gctaatttac 14101 atttccacca acagtgtata agcattccct tttcactgca tcctcgccaa catctattgt 14161 ttttttactt tattttccat ataaaaaggg gtttatttag atgttagaat catctagtga 14221 ttcagaaaga ttggacatga atccacccag gcccagggct gagctgcaga gcagccgctg 14281 acatccgcgg ggatttcaca tgcaacaggg agctctcacg tgcttgctgg ccattgtgct 14341 ttaaatcgtt ttttccagcg gacagctgtg actgcagttt tcaaatgtgc tctggcctcc 14401 aaggaccagg ggggcggtgg ggggcggggg tgtcctctgt ataaaaagtg ctgtcctgcg 14461 agcttcccga cggacacttt ggggcatctg agcgattccc ggggagggca gctctgcctt 14521 cctcaggcta ccactgcagc caccgacgga cattctcacc agcaatcaac caaaaaacat 14581 cagagtctct gtgaacttgg ctgcgatgat aagaaccaaa tgttttctaa tctaagactt 14641 gtatgcagaa cacaggaaat tgcaattagg aacaccctac aaaattgaac aacagtattt 14701 tctaaaaata ttttttacat cactttaaat aatttacaga aagctacaga aatcatattt 14761 accaggacac atctgttaaa taaaagcatt gtttcatgtt ggtgtacgtc tatacagggc 14821 tatgtataac cgactcctgt ttctcctccc tgcaaccaca gaaccatcac acacacacac 14881 acacacacac acacacacac acacacacac acacggatac acgcacagat acgctccttt 14941 ccacaaatgc acgcaaaccg ggacgcaaac ccacaactcg agggcttaga ccttcactgc 15001 tgagctgccg ccactgtcct ccgtgcgcgg agtgtcgctg aagtcactgg cgtcactaac 15061 caaggcgtgc ccgcccaagg tgccctggtc ctggtcgtcc ctatcgatga cccttcgtca 15121 atacctcagg tctaaaatgc tttcctccga gcccgagttc ctcctgtcgg gcaaactccg 15181 ccctgtctgt gccttgctgc ttggccaggg cccgccgtcc ttcttcgccg gcaagtgtgg 15241 gcttccaaag acagggctgg gcccggcctg ggcctccctg aaggcagaga aggacgggcc 15301 ctgtggaggc agccccatgt cgggctgaag aggctggcct gcctgccccc tgggagacac 15361 cctttccaaa atggaagagc tgggtggaca gagaggggag aagcccacaa ggggtaggct 15421 gggcaggccc aggcgggccc tcggacccct tgtctttccc gctcccgacg ccccgcctcc 15481 actccgcatc tctacttttc gagcgcagcg cgcacgggtt ccggtgggtg aagtcgctcg 15541 agggcagggt gtggtcagcc tgggctccag agtccctttc agcttcctgg ctaggggtca 15601 ggaagccccg gctcccgcag cccatgtgaa aggtcccccc agcacctccg gcctcggtgc 15661 ccgctgtggc actgctgggc agctgaataa aagcacttct ggcagcatgc tcggcccctc 15721 gtgagcgctg gtctttctga atgtaaacat ttctcctgct cgctgagagc cccttggcgg 15781 agacactgcc actggtgctc ctgggcggtg ccggatcccc ttgggcggga gcagcgggga 15841 gccccttcct gccctggctg aacaggtcgg ctgtgccctg tgctccagcg ctctctgcac 15901 ctcatggccc ttctgccggc tgtgggaccc ccctggccca ctgactccgc gtgcacacgc 15961 caggcggtgg ggcccgctcc ttcctgcaca gcaactctgg cctgaatgag ccccatttcc 16021 cgagagcagc agggccctct gcctgaactt ctgaactgct cagactcctt ggccttttcc 16081 gcgaaaaact tactaatcta cagttcgatg ctctcgtcac tgtccactga actcctgtcg 16141 tcagacaggg agccggggct gggagctggg ccctggcatt ccctggggaa atggattccc 16201 ttcagaggcg gaggcagagg cgcatcccct cactcggaag gccggggcgt gtcctggctc 16261 ctgacgccct cctgcttcgt cttctctgcc atactgccac agacactggg aggtttgttc 16321 cctggaccct cgcagctgtc tctcttggac ttggagagtc agctattaag caccggcggg 16381 cttctgtctt tccagtctct cgggagcccg cccagctgct ccaggaagtg catctgggca 16441 gtgctgaacc tgaccttcct gcacgcagcc ctgggttccc tgcacctctt cttgagcttt 16501 cgcttggacc ttaagtcctt gatggccgtg tccaggtcct cgtcactgtc cagggagctg 16561 ctcttgtctt cggagctctc cttctcgtcc aggtgcctca cctcgtctgt cttaccctgg 16621 ccctgcgtca tgcgagtgtc gccgggcccc ttggctgcgc cctcccctcc cgggacctcg 16681 ctggctttgc cctggacggg caggtcccgc ccctcatggc cgggcacggc tcacgcctgg 16741 ctgtggtcgg catcctggcc accatctttc accacctccc gcgttttctt gggcgtggac 16801 ggcctcatgg catggcgccg cctctacgtt tccttttgca gctcagcagc gggtccggtg 16861 ttttagagag gggggccttg gggccactgg tctggctgtt ggggccaggc ggtgaaagtg 16921 ggccctgggc agcctgcgag cagctctcac ctctggccag caaactcccc gactatgcct 16981 ggagggtcaa aacgtctgga tttcctgatc gatgctgtcg tcgctgtcca cggagctact 17041 gtcgccgtca gagcgggaag gcacgttcag ggagtagaag cgtgggcttg cagaaaggga 17101 cctgttgctg ccttacatgg gggccggcag ggtagtcttg gaaatgccca agattgcttc 17161 cgcgcgcgtc agttcagcgg acgtgtctgc ctggcacgag gaccgttcta caaactcgtt 17221 cctggaagcc gggctcgctg gaggcggagc tttggtttcc ttcgggagct tgtggggaat 17281 ggtcagcgtc taggcacccc gggcaagggt ctgtggcctt ggtggcactg gcttcctcta 17341 gctgggtgtt ttcctgtggg tctcgcgcaa ggcacttttt tgtggcgctg cttgtgctgt 17401 gtgcggggtc aggcgtcctc tctcctcccg gcgctgggcc ctctggggca ggtccccgtt 17461 ggcctccttg cgtgtttgcc gcagctagta cacctggatg gcctcctcag tgccgtcgtt 17521 gctgctggag tctgacgcct cgggcgcctg cgccgcactt gtgacttgct ttccccttct 17581 cagggcgcca cgctcctctt gaccccgctt ttattctgtg gtgcttctga agggctgcag 17641 ggggctgctg gcttcgggct gcccttgttc tcctgcttgg tcgtgaccct ggacttgagg 17701 cttctgggct gcacgttcgt ctttgctaac cggggaggtt tgcgaaaggc gaactcttta 17761 tgggcgccct tcagaccctg ccgatgcgcc acctttgcgg ctagctcttc gtgggatttc 17821 aagagtgact tggtcgaaat tttcgtttgg gtctggtttt ttatctattg accccatcac 17881 atttttgggt cgcatgctat ctcttctcat tcagaagctg ttctatttcc gccttaatgc 17941 tctgctcgaa ggagtcgttg ctgctcgctc tgaccgggaa ggcagaaccc tagtcctcac 18001 tggatctcac ctggctgcca gggccaccac ctgagccagg tacaagtttt ggggggacac 18061 acggaagtcg gggcactgcc gtgagatagt tctggcttac atctactgcc actgcctgca 18121 gccctggaag gctgggctgc gcctggctgg ccccagcccc gggctgtgcg gctccactct 18181 ttgcctttag gtactcccgg gatggcttcc gcaatgtccc ggtccacgga atcatcattg 18241 aatctagcac caacggccaa agtctgcagt ttcctcctcc tcatggggtc aaagtcagca 18301 acaagaccac aggcaggcaa tgcgggctgc tccttgtgca cggtgggctt ggcagctggc 18361 ttggcgtctg ccctgtggcc cctctgtgca gcgcgctcat cggtggtgcc ccgagcaacc 18421 ccgctgcagc gtgctgatga gcatctgcac ccgggcgctc accgacacgc tctccacccc 18481 ctcgtcagcc tccgagaagc acccggggaa cctaaagctc cctggtgggc caaaggcctc 18541 ccatttggac tggggagcaa cccctggagt agcgttcctg agaaacattc tagcagaagg 18601 ggagggacgc gcagagggac gacgctttat ttctctgcca gcttcatata ctcaaaaggt 18661 tggcagcggg gtgcagtggc tcaagcccgt aatcacagca ctttgagagg ccgaggcggg 18721 cggatcgcga ggtcaggagt ttgagacccg cctggccaac atactgaaac cccgtctcta 18781 ctaataatac aaaaattagc cgggcgtggt ggtgtgagcc tgtagtccca gctactcgtg 18841 aggcagaggc cagggaatcg cttgaagccg gaaggtggag gttgcagtga gctgagatcg 18901 cgccactgct ctccagcctg ggtgacagag cagactccgt ctcaaaaaag aaaaaaaaaa 18961 aaaagagtgg caagtggtgc ccatgagata aagtctgcaa atgcttcttt gcagtttaaa 19021 aaggacgacg actataaacc caggatcagg acgacgttac gcatagattt attcttgacc 19081 tgaccaacga gctcttgaaa ttctctttgc aactaggtcg actgtgaaac tgcagccagt 19141 gtccccaaac gccccaagga taaggccttt atcactccga cgcatcctct ctctgctttt 19201 taaatagact tttgactcgg ccaggccccc tgccacccac cgcgctgacc ctgcctgcgc 19261 ttgcgtcccg cgtcccgcat ccggggggag gcggcgggcc cgggtctctg gggctggccc 19321 atttacttag ttttgttttt gttgcatttg cttttggtga cactcccttt ttgatattcc 19381 acatataaat gaggcaataa ttttctttct atgtctgctt atttcactta gcataatggt 19441 ctccctgtcc acctcggctc ccgcgcgggg aggaggggca agggccctcc gcccgggccc 19501 cgcctccccg cgcgccgctg ccaccgcccc gcaccttttt ttgacttttt aataatgctc 19561 attctgactg tgataaggtg gtatctcatt atggttttaa ttggtatttc ccttgtgatt 19621 ggtgatgttg agtatttttc atgtttgttg gccatttgtg tatcttcttt tgaaaaatgt 19681 ccgtttggct gggcgcggtg gctcacgtct gtaatcctag cactttcgga agccaaggcg 19741 ggtggatcac ctgaggtcag gagttcgaga ccagcctggc aaacatggtg aaactccgtg 19801 tctactaaaa atacaaaaat tagctgggcg tggtggcatg cgcctgtaat cccagctact 19861 ctcgagactg aggcaggaga atgacatgaa cccaggaggc ggaggttgca gtgagccaag 19921 gttgcgccat tgcactccag cctggacgac aggagtgaaa ctgcattgca aaaacacaac 19981 aaaaaaaatc cattcatcta gtttgtccac tttttaacag gattatttgt tttcttcttg 20041 ctgattttct tgagtttctt gtagattctg gatcctttgt tggatgtata gtttgtaaat 20101 attttctccc attctgaggg ttgtctcttt tctctgttga ttatttattt tgctatgcag 20161 aaacttttta gtttaagtcc catttactta gttttgtttt tgttgcattt gtttttggtg 20221 acacccccta ttttttagat tccacatata aatgagatca tgcaataatt ttctttctgt 20281 gtctgcttat ttcacttagc ataatggcct ccatgtccac ctgtgttgtg gcaagtgaca 20341 ggatcacctt tctttttatg gctgaataat attccattgt ttatatatac tacattttgt 20401 ttatccactt ctctgttgac agacacatag gttgtttcca tattacttag ctgtatatag 20461 aaatacaggt ggtttttgta tgttgatttt gtactctgaa aacttggtga actaatttat 20521 tagcttcaat agtttcttag tgaattcctt aggattttct agatgtaaaa taatgtcatt 20581 tgcaaataga gatagtttta cttcttcctt aaaatctgga taccttttat ttctttttct 20641 tgcctaattg ctttgtctaa aatctccaat agaatgttga atagaatgat gagaaggaca 20701 ttgttgtctc ttcctggatc ttagtgggag accatccagt ttttcaccat taagtagatg 20761 ttagctgtgg ggtttttgta gatgcctttc atcagttgaa gaatttttta tttctagggt 20821 gttgagtgtt ttgttatgaa agggcgttgg attttgtcaa atgatttttc tgcttctatt 20881 gatataataa tgtagttctt gttctttatt gaaatggtgt agtatattag ttgattttca 20941 ggtgttaaac caacattgca tctctggaat aagtcccact tggtcacaat gtataatcct 21001 tttgatatgt tgctagattt ggtttgctaa tttttttttt tgaggatttc tatacgtgta 21061 tttataagag attttggtct ttagtttttt ccttgtgcgt cttgtctgat ttagtatcaa 21121 gataatactg gccttgcaca gaatggactg ggaagtgttc cctcctcttc tgttttgatg 21181 gggcagtttg tgaagaatag atattaattc ttctttaaat atttggtaga attcatcagt 21241 gatgccatct gggcctggac ttttctctgt ggaaagtttc aaaattacta attcagtcat 21301 tttacttgtt ataggtttat tcagtttgtc agtttctttc tgaatcagtt tttatacttt 21361 atatttttct aggaatttgt ctacctcatc taagttacct agttcattgg caaataattg 21421 ttcatgcatt gccattataa tcctttttat ttctgtaagg ttagtagtaa ttttttttta 21481 tttcattccc gattttggtt aatttgagcc ttctctcttt tatcttggtc agtctaacta 21541 gaagtttctc aatgttggtg atttaaagtt tttttttctg gtttcattga tttttcttta 21601 tttctctatt ctctaaaaaa ttttgatcaa acgatttttc cctgtttttt cactacccaa 21661 tgtttatact taatgtgtgt attggataac tctattaaag ttcatcatca tgtactttca 21721 tcatgtaata gtcagatata tcttatatat aattgtactg cctactttta gaattgcttg 21781 ttctttaagg ctttgacttc ataaaagttt tcactcttgg ccaatattat ttactacagc 21841 gtgttttctt cttcatgagg aaccatggaa gaaatctttg gtccaagacc cattaagctt 21901 tcctggagaa gctaactaat taatttaatt taaccttata gatactatga atcttataat 21961 tgtttgttac tgttttaact gccagctttc atgatagcat gcattttggg ggatactaac 22021 atttttcttg tcattttttt cacctttcta tattttactt tcacaattat aagtaaataa 22081 agacttgttt taacataaat cacactttat agtaataaag agcggatctc tgttgctctc 22141 cttttttact ttaaaatttt ttatttttaa ttattatgga gacataatag gtgtacatat 22201 ttatgggata catgtgatgt tttgatacaa gcatataatg tgtaatgacc aaatcatggt 22261 agttggagta tccatcatct taagctttta tcatttattt gtgttaggag cattccaatt 22321 ctaatctttt agttattttg aaatatacaa taaattattg ttaactatag tcactgtatt 22381 gtgactatat agtgtttcca tagaacacta gatcttgttc cttctatcta actgtctttt 22441 tgtacccatt aaccaactcc tcttttatcc tcccatcccc actacccttc acagcttctg 22501 gtaatcatca ttctactctg tctccatgag ttcaattttt tttttttttt ttaagctctt 22561 gcttgtaagt gagaacatgt acttgtcctt ctgtgcctgg cttatttcac ttaacatgat 22621 gttctctggt tccaaatgac aggatttcat tctattttat ggctgaataa cattccattg 22681 tttatatgta ccacattttc tttatccatg catccattaa tgggcattca ggttaattcc 22741 atgtcttggc tgttgggaaa aatgcagcat tcaacatggg aatgcagata tctctttgat 22801 atactgtttt tatttctttt ggatatatac ccagtggtgg gattgctgta tcatatggta 22861 gttttatttt tatttttttg aggaacctct aaactattct tttatagtgg ctctactaat 22921 ttacattgcc acaagcagtt gtgcttttgt tttttttttg tttgtttttg tttttgtttt 22981 tttttactgt tttaacataa ttatttggag cttcattcat gatggcatat gcatcaacaa 23041 tttatttatt ctcattactg agtagtatct cactgcatgg gatttgtttt tttttttaag 23101 ttcaggggta catgtgcagt tttgttacat aggtaaactt atgtcatggg ggtttgttgt 23161 acagattatt tcattgccca ggaatgaagt ccaataccta ctagttgttt gtcctaatcc 23221 gcttcctctc ccaccctcca ccctctgata ggtcccagtg tgtcttgttc ccctgtgtgt 23281 gtccatgtgt tctcattatc tagctcctac ttataagtga gaacatgcag catttggttt 23341 tctgttcctg cgttagtttg ctaaggataa tggcctccag ctctatccat gtccctgaaa 23401 aacattgctc tttttctatt cctagttata tccatagaaa tgaccattga tagtttttgc 23461 ttactctgtc cctccagttt ttccttcctc cagcatatac acaaggattt taattttacc 23521 caattgaagt cttacttatc tacacctttt aacaagtcat catcagtaat tttggaatag 23581 gcattagatt cacatggttc aagattcaaa cgaaagaaaa gtataaacag taaaaagtct 23641 cctcattcct ttccctaagc cacccacttc tcttcagagg caactaataa aagtttctag 23701 ttttgcatta ttctacacct tgcttttttc atttagcaat atatcttgaa gagcacatct 23761 ccctctgtat ttagcttaat ccttttagtg gatttttttt ttattttttg ggtatatcat 23821 aatttattta tgaatttcat tgttgagaga catttagttt gtttttgaat gtgtaatact 23881 acaaaccgtt gcggtgaatt atccttatat atacattttc atgcacaaat gagagtactt 23941 ttatagtgta gattcctaga agtagaactg ctaggtcaga aggtattcac atttaaaaat 24001 tttaaatttg ctcatagaat gaagtctttg ctcatagaat agtctttgct caaatctcac 24061 tttaccaaag aggcctatcg tgaccactct gcttaatact gtaacctgcc cctacttctc 24121 cctgaacttc tctatccatt gcactctatt ccttttctag catagtatat ggtttaccta 24181 tttatactgc tttttggtta ctgtcttttt ccctctgctc caagaaacag gaatgtttgt 24241 tttgttcact aatatatatt tctagcagtt agaatagtgt ttgatgctta gtaggcattt 24301 agtaaatatt tattgaataa atcgaataca taagcaaaca gtatccaaaa ttatcctccg 24361 taaaaatgcc taatttcttt tttttcctct gtatctcaat gaataaaaaa aaatactgac 24421 tttctttacc cacttatttt ggattttttg gtcttgtttt ttgtttgttt gtttgttttt 24481 gagtttgaca aatttattgg tattttatta aagacattca tctcagttgt ttctctctcc 24541 cagcttgacc ttaggttaat atttcatttg ggtcaagaaa agaatatcag aagaggtatg 24601 ttattttaac aaacaggaaa atggacaaaa atggatagtt tgcctacatt aaagtgagtt 24661 aaattcaagt atcttgatat aattaaatca tataagaact aagagttcta tatacatttc 24721 cattgtttta cctggggctt attctaaact taaatactag ttaacaaaga gttaggaata 24781 tacacaggtt gcttctctga gttattaccg actaaaagag ctcagcgagc agttaccacc 24841 aataaaacgg tctgaagctg cctccaaata caatattgta gtaggagttt tcaaggaaat 24901 ttttatactg tatttttttt tgtctgtgac tgtgctttta aagatgtgtt taactgtcac 24961 attaaaaaaa aagttcgtat acaatacaaa aatacaaaaa gaaaacaccc tacaatacac 25021 aaattcacaa agtatgtgtg gtgagattcc aaaaaatgtt tgaagatgca ttttctttcc 25081 ttctatttca gtatctaaaa tgtgcttttt gagaggccat tggtcaatat gtatacattt 25141 aaaataaacc atataatttt actagtaaga aagccagtag ttaagttcaa ttcaaattgg 25201 atttcacaag ttagtaattt aaatccttgg acaaagttac agaaagtgca tcttcttgtt 25261 ttccatcttc atacaatgtt aaattttttt tttttggtgt ttataccttt taaaaaataa 25321 aagcagccaa atacttaagc aatatgtaca tttaaatttt tggtggtggt ttgtatttta 25381 aaagaaacag ttttcttatg attcgcttca agttctggtg gaaatgctta caggaactag 25441 ctaataaaca aaaaacaaga gaagcacatt caaaatactg atttactttg gtagcaaatg 25501 gtttttcttt gaagattaat gaagatagac aagacccatt aaggtgaagt gggctatttc 25561 aaatattcaa cagtttacat ataaaaaagt tattttaaga gctaagcgtc tgtatccact 25621 gatagcaatg caacacctag tttatgatga cttgaaagaa aacaaatgcc catgaagaaa 25681 aaagctgtat tttaatttta tctaaatgta ctttcagtca atagttcagt aacattttcc 25741 tcccaataca atactccctc tctctataag gctattcctg ggagccagac aagtttaggt 25801 aataagggag ttaagagagt aactgcttgc agttttaaat agacaattaa ctttttgctt 25861 ccctcttaat gtgctaatag ttgtgtctaa aaaaataggc aattcttaaa aaggtaccat 25921 ttctgatttc tttattatct gtaaactttg gaaactaatc acacaaaact accaaattag 25981 caaatgtctt gaaatctgta tacaaaacat aaattacctc taatttcaaa ctctcattta 26041 ttggtgtacc actcaatctt ttaaaaagat gaaaaagaaa aaaaaagtgg ctccaaaggt 26101 agtcttatac cattcttaaa aaaaggaaac tgttcttttt aactttatac ccacccacac 26161 cccaatttca aaacatcatt taattgtctt ggtcatagac atttcgaaga tgagatttta 26221 tattccactc ccatagcttc tggttatcag aaaacccatg ctttccttta ttgaaggagt 26281 ttggtccagc tgatgttggt gtatcccttg caatattctt catcctcatc tttgcttctg 26341 gaggcctttc ctctgattca ctatcttcat cttcattaaa agctgctgct actgaaagag 26401 tttttggagc aagagttgga acagtttctt taggcttact tgatccaagt ttgatggata 26461 tgggtgatgc tttctttgtc gtctgactac ctatggcaaa tccaaagttg gagatctttg 26521 caggctttgt tggaggtctg cagcttcttc ttcagctgat cgcttctcag cgctgcggct 26581 ggaactttcc tctccattac tccagcagct ccagctcatt gcgacttttc cggcttctcc 26641 tctcccgcct tcccatcagc catttccccc gccgcattgt ttttttaaga tttcagaaat 26701 ctttctaata caagttggga tataaataat tagatataga aaatagctac aaggtatttc 26761 ttttaacttt ctccaccccc agctcccagc agtcaagagt caagccctaa gtcttgtcct 26821 ttctgtcaga aatagataca agaaggatac agcaaacact gctataacac ctctctcata 26881 ggattcaaat cagtgtggag attattatag gctccagatg agttaggata tctttaagtc 26941 tcaatttaat aaaactaatt gttttcctgt gctttaaata tctctccatt ctatataaat 27001 ataatataga tctaacagat gtagtcttga aatagatggc tataatgtca gtgaagtatt 27061 ttttccttat ttatatacat tataactttc cttatttatg tacattataa ttttaaaggc 27121 ctgatagtca taattattga tgttcagatt ggtagttatt tctgtctctt atgtattcaa 27181 ctataaacat ttaatcaagt ttatgagacc tagatattta gcagagtttt ggagtaaatt 27241 attttattaa acaaagaata taaaataaac atttctactt atgttcctgt tttaatttac 27301 ctcatttata tgcatcttga tattgcattt tcatattcta aaaactattt ttaaagacag 27361 gttgattcag aattacagtg tacttgaaaa tcatcaggta cctattatta gttttcaccc 27421 acattgaatc tgtgtttcag aaactgtcta ccaaactttg tttattaagt aattattttc 27481 cctagtttta ttcaagatat acaaagctgc caactagtgc ttttcatcag catcaggtaa 27541 attgtgttac tatactgcgt tgaattttag caaaggtaaa gtatttgcta ttttagttat 27601 tgtagcttta ttgaaagcaa acatataggt tttttaaagt attggaaaac tcgccagatt 27661 atcattatat gacttggaac cctagaaact cccagaccac gagacttaat ggttaatgat 27721 ctctttcagt tctctttcaa cttggaattc tcttattcca atgccaactc attgctcttt 27781 tggaaagacc cattctttcc tctattgtct tggtacctcc tcaggacata ttccccaaag 27841 tggaaatgtc agtttaagat aattttcagc attgtgtaga ataagaatgg ttagacagct 27901 gatgtcagca agatggtgga atagtagcgc tgtccttcag aaacaccaat tttgacaact 27961 accatagaaa gaatgccttt atatttgagc atacatggaa cattctccag gatcaatcat 28021 atgtcaggcc ataaaacaag tattaatata tttaagaaga ttgagataat atcaagcatc 28081 ttttctcacc ataatggcat aaaactagca agcaaaaata acaggaaaac tgggaaatac 28141 tcaaatatgt gaaaattaaa catgttcctg aacaaccaat agtcaaataa gaaatcaaaa 28201 tggaaattaa aaaaaaaaat gtgatacaaa tgaacatgga aacagcacat caagatttgt 28261 ggaatgcagc aaaacagttc caataggaga tttcatagtg ataaacacca acatgaagga 28321 aacagtagga ccccagataa gtaacctaac tttacatctc tcagaactag gtagaaaaag 28381 aacaaataag cccaaagtaa tagaaggagg gggataagaa agatcacagc agaaataaat 28441 gaaagactca aaaaacaata gaaaagagca acaaaatgaa gagttttttt gaaaagataa 28501 acaaaattga taaactttta gctagaataa gaaaaaaaga agattcaaat aaataaaatc 28561 aggaatgaaa gaggagacat tatgactgat atcacagaaa tacaaagaat cctgagaata 28621 ctttgaacaa attatatact aacaaattgg atgacctgaa agagacagat tgtttgccac 28681 atactacctg ctaagactga atagcagaag aaatagaaaa tctgaatgac caataacaaa 28741 taagggactg aatcagtaat caaaaatcta ccaatagaga aaagcctcag gacctgatgg 28801 cttcactggt gaattctacc agacatttga agaataccaa tccttctcaa acgttttcaa 28861 aaatttgaat aggaggaaac acttccaaac tcatttaaca aggctagcat taccttgata 28921 ccaaagtcag gcaagggtat cacaagaaaa gaagattata ggtcaatatc cctgatgaac 28981 atatgtgcaa aaattgtcaa caaaatatta gcaacagcac attaaaggga tcatacatca 29041 tgatgaaatt ggatttatct ctgggattca aggatggttc aacatatgca aatctataaa 29101 tataccactt caacagaatg aaggacaaat tatgtgatca tcttaataga tccagaaaaa 29161 gtgtttgaca aaacttaaca ttctttcatg atgaaaactc tcaacaagtt aggtataaaa 29221 ggaatgtacc tcaacacaac aaaggttata cgtgataagc ccacagctag cattatactc 29281 agtggggaaa cctaaaatct tttcctttaa gatcaggatc aagacaagga tcatttgtca 29341 cttgcattca acatagtact ggaaatctag ccagagatat taggcaagaa aaagaaataa 29401 aagacatttg ctacagtttg aatgttttcc ccctctaaaa ctcatgttga aacttaattc 29461 ccaatatggt agtattaaag atggggcctt taagaggtga ttgagtcctg aggcctctgc 29521 ctttatgaat ggatcaatct attcatggat taattggttt tcctgagagt ggcatcagtg 29581 gctttatccc aagctaacat cttagcatgc tcagttctct caccatgcga taccctgcac 29641 cacctcagga ctctgaggac tcacagctat tattcaaaaa gtcagaagat aacagatgtt 29701 agagaggatg tggagaaaag ggaactattg ttcaatgtta gtgggaatgt aaattggtat 29761 gtcttttatg gaaaacagta tggaagctcc tcaaaaaatt aaaactaaaa ctaccacatg 29821 attcagcaat cctacctttt ggtagatatc cacaggaaat gaagtcagta tgttgaagag 29881 atacatacac tcccatgttc attgcagcat tattcacagt cgcaaagata tggaaacaac 29941 ctaagtgtct gtcattagat gtatagataa aatgtggtac agattgtttt aaactaatga 30001 ttggttatat cctgataaac ccattgtaag tcaaaaatat cttaagttaa aattgcatta 30061 agatcatgac aaacccagtt gaaaaatcgc aaattgaacc attgtaagtc aaggactatc 30121 tgttatatat attggaattt tattcagcct taaaaaaggt aggaaatcct gccatttgca 30181 agaacacgga tgaacccgaa ggacaatatg ttgtattagt cagacacaga aagacaaata 30241 ctgcatgatc tcatttatat gtaaaatctt taaaaagtca aatacataca aacagtaaaa 30301 aggtggttag caggggtggg gagcctgggg agaatgtgga gatgtttgtc aaagggtaca 30361 aactttcagt tataaaatga ataagggtta gatgtggcag ctcacgccta taatcccagc 30421 acgtttggag accaagatag gaggattgct ttaggccaga gttagagacc aacttgggca 30481 acatggggag acccccatct ctacgaaaaa ttaaaaaaaa ttttacaggc ggtgacatgt 30541 acttgtagtc ccagctactt gggactctga ggtaggagga gtgcttgtgc acccaggagg 30601 ttgaggctgc agtgagctat tattgtgcca cagcaatcca gtctgggtga cagagtgaga 30661 ccctgtcatg agacctgtct caaaaacaaa aacaaaaaaa gtagtagttc tggacactta 30721 atttacagca tggttgatta tagttaataa taatgttttg tattcttgaa attcactaag 30781 agaaatctta agtatcctta cgacatacac acagacaaaa tggtgactat gtgaagtgat 30841 agatatgtta attagcttaa ttgtagtgca gtagtgcccc cttatcctca gaggatatgt 30901 tccaagaccc tcagtagatg cttgaatcca cagatagtac ttaaccctat atatactatg 30961 ttttttccta tcctgtacat atatacctgc attaaagttt aatttgtaaa ttagccacag 31021 taagagatta gtaacaatga ctaagaaaat agaacaatta tgacaagatg ctgtaataaa 31081 aagtatgtga atgtagtctg tttctctcaa aataggtact ccgctcacct gttttcaaac 31141 catggttgac catgggaaac tgaaactgtg gaaagtgaaa ccgtggatat ggaatgactg 31201 atgtaatcac ttcttattgt atagatgtat caaaatatca cattgaacac cttaaatata 31261 aactttttcg tttgtcaatt gtacctccat aaagctggaa aaaaatgtga cctcatttgt 31321 ttgttggtga tttccttttt tgaaacagca catgtcatat ttggtaaaag gagttgatat 31381 tgttaatcca tataattgga tctgagttgt aggaattaat ctccttactc aatggttttc 31441 aaccttgttt ccaagagtag tagatatcag ttttgtagct atttttcaga aaagctaatt 31501 agtagccctt tgagtatggt ggcagttttc tttcctatat ggttatacct ttttggatgg 31561 atcatttttg ggctttgagc tcttttataa gtaagtgcat tgtttaaaaa agcgagtata 31621 agttgtcact aattaggggt tatcattaga ttggttcaaa tgaactcaac taaatgatgg 31681 tataattggt aatctcttca agtactgagt gtagattttg ttttaaactg taatgtatgt 31741 gattaaaatt ttgataaaga tttttctcga gtatttctaa tttttctctc tcctgcttta 31801 tgtaaatgta gcacttattg acttttctcc tattttacaa ttgtagataa tttaaaaaat 31861 gtatgcagga ttaataagga aaatgaaaac tgttcgtaat cctaccatgt agagataatc 31921 gtgaacattt taatatatat tttcattttc tcttcttttt tgttttttca caaaattgga 31981 atcaaaatat gtgcagaact ttttattctg tttttttctt aacgttatat gttgggtatt 32041 tttacatatc cctttaatat tccttgaaac tgttttgggt ttttatatta ttctatgata 32101 tagctctatc acaatatgtt taaccattct gttttaagat ttttaggctc ttcctactat 32161 ttttgctatt ataaataact tttcaaacat ctattacact aaatcttttt ttaattttta 32221 agttttgtgg gtacatatca agtgtacaca tttatgggac atatgaaatg ccctgatgca 32281 ggcatgcagt tctgttactc taaatcttga ctcaactcta tttttacttg ttaaattctt 32341 agctataaaa ctgaggtatt ttaaggattt ttatgtacct taacaaaatg tccttcaatt 32401 gacattattc tcaaatatgt ttgttgaaaa tatgtttgtt gaaagtacct tttttttttt 32461 gcaaagtgct tgttctttgc ccattttccc cccatttttt gttctttttt tggtgtgtgt 32521 gggggaattg ttttggctag gtaagaatta cttttcttct gtttcaacaa tttttaaagt 32581 gctatcctga gtccttttca taattttcaa gaaaacacat cagttacaga ttttaggttg 32641 caaattgttg ttggaaaagg agacctggac cttaaaagag gaatcaggct ggaaagcaga 32701 atgagatatg agaatcatct gtatatgaga tgatgagggc tgtaggagtg gatgagagtg 32761 ctgaggaaaa aagagaatgg tgggaaaaaa gcccatggag ttgagaaacc ccatgaaaaa 32821 gttgtgagag gaaaagagaa gagaatgagg atgtgagggc atttaaaaaa gctactgaaa 32881 aaagttgcat ggaggtatac tgaaggagaa aaattgcaac agtgtcagat gctgcataga 32941 ggtctttctg gattttgcat ccatgttcat cagggacatc agtctaaaat tctctttttt 33001 tgttgtgtct ctgccaggct ttggtatcag gatgattttg gcctcataaa atgaattagg 33061 gaggattccc tctttttcta ttgattggaa tagtttcaga aggaatggta ccagctcctc 33121 tttgtacctc tggtagaatt cggctgtgaa ttcatctgct cctggacttc ttttggttgg 33181 taggatatta attactccct caatttcaga acgtgttatt ggtctattca gggattcaac 33241 ttcttcctga tttagtcttg ggagggtgta tgtgtccagg aatttatcca tttcttctag 33301 attttctagt ttatttgcgt agaggtgttt atagtattct ctgatggtag tttgtatttc 33361 tgtgggatct gtggtgatat cccctttatc attttttatt gcatctattt gattcttctc 33421 tcttttcttc tttattagtc ttgctagtgg tctatcaatt ttgttgatct tttcaaaaag 33481 ccagctcctg gattcattaa ttttttgaag ggttttttgt gtctctattt ccttcagttc 33541 tgctctgatc ttagttattt cttgccttct gctagctttt gaatgtgttt gctcttgctt 33601 ctctagttct tttaattgtg atgttagggt gtcaattttg gatgtttcct gctttctctt 33661 gtgggcattt agtgctataa atttccctct acacactgct ttacatgtgt cccagagatt 33721 ctggtatgtt gtgtctttgt tctgattggt ttcaaagaac atctttattt ctgccttcat 33781 ttcgttatgt agccagtagt cattcaggag caggttgttc agtttccatg tagttatgcg 33841 gttttgagtg agtttcttaa tcctgagttc tagtttgatt gcactgtgat ctgagagaca 33901 gtttgttata atttcttttc ttttacattt gctgaggaga gctttacttc caactatgtg 33961 gtcaattttg gaataagtgc gatatgctga gaagaatgta tattctgttg attttggatg 34021 gagagttctg tagatgtcta ttaggtccgc ttggtgcaga gctgagttca attcctggat 34081 atccttgtta acgttctgac tcattgaact gtctaatgtt gacagtgggg tgttaaagtc 34141 tcccattatt attgtgtgag agtctaagtc tctttgtagg tctctaagga cttgctttat 34201 gaatttgggt gctcctgtat tgggtgcata tatatttagg atagttagtt cttcttgttg 34261 aattgatccc tttaccatta tgtaatggcc ttctttgtct cttttgatct ttgttggttt 34321 aacgtctgtt ttatcagaga ctaggattgc aacccctgcc ttttttagtt ttccatttgc 34381 ttggtagatc ttcctccatt cctttatttt gagcctatgt gtgtctctgc acttgagatg 34441 gatctcctga atacagcaca ctgatgggtc ttgactcttt atccaatttg ccagtctgtg 34501 tcttttaatt ggagcattta gcccatttac atttaaggtt aatattgtta tgtgtgaatt 34561 tgatcctgtc attatgatgt tagctggtga ttttgctcgt tagttgatgc agtttcttcc 34621 tagcctcgat ggtctttaca atttggcatg tttttgcagt ggctggtacc agctgttcct 34681 ttccatgttt agtgcttcct tcaggagctc ttgtaaggca ggcctggtgg tgacaaaatg 34741 tctcagcatt tgtttgcctg tgaaggattt tatttctcct tcacttatga agcctagttt 34801 ggctggatat gaaattctgg gttgaaaatt cttttcttta agaatgttga atattggccc 34861 ccactctctt ctggcttgta gagtttctgc tgagagatcc actgttagtc tgatgggctt 34921 ccctttgtgg gtaaccctac ctttctctct ggctgcctta acattttttc cttcatttca 34981 actttggtga atctgacaat tatgtgtcct ggagttgctc ttctcgagga gtatctttgt 35041 ggcattctct gtatttcctg aatttgagtg ttggcctgcc ttgctaggtt ggggaagttc 35101 tcctgtataa tatcctgcag agtgttttcc aacttggttc cattctccct gtcactttca 35161 ggtacaccaa tcagatgtag atttggtctt ttcacagagt ccggtatttc ttgtaggctt 35221 tatttgtttc tttttactct tttttctcta aacttctctt cttgcttcat ttcattcatt 35281 tgatcttcca tcactgatac cctttcttcc agttgatcga atcagctgct gaagcttgtg 35341 cattcatcac gtagttctcg tgccatggtt ttcagctcca tcaggttatt taaggacttc 35401 tctacactgg ttattctagt tggccagttg tctaatcttt tttcaaggtt tttagcttct 35461 ttatgttggg ttagaacttc cttctttagc tcggagaagt ttgatcatct gaagccttct 35521 tctttcaaat cgtcaaagtc attctccatc cagctttgtt ccgttgctag cgaggagctg 35581 cgttcctttg gagggggaga ggcactctga tttttagaat tttcagcttt tctgctgttt 35641 tttccccatc tttgtggttt tatctacctt tggtctttga tgatggtgac gtacagatgg 35701 gggtatggtg tggatgtcgt ttctgtttgt tagttttcct tctaacagga ccctcagctg 35761 caggtctgtt ggagtttgct ggaggttgac tccagaccct gtttgcctgg gtatcagcag 35821 tggaggctgc agaacagtga atattgctga acagcagatg ttgctacctg atcgttcctc 35881 tggaagcttc gtctcagagg ggtactcagc tgcgtgtggt gttagtctgc ccctagtggg 35941 gtgtgcctcc cagttaggct actcaggggt cagggaccca cttgaggagg cagtctgtcc 36001 attctcagat ctcaaacttc atgctgggag aaccactacg ctcttcaaag ctgtcagaca 36061 gggacattta agtctgcaga ggtttctgct gccttttgtt cgctatgccc tgcccccaga 36121 ggtggagtct acagaggtag gcaggcctcc ttgactgcgg tggctccacc cagttcaagc 36181 ttcctggccg ctttgtttac ctactcaagc ttcagcaatg gcgcgcgccc tctcctagcc 36241 tcgctactgc cttgcagttt gatctcagac tgctgtgcta gcaatgagcg aggctccgtg 36301 ggtgtgggac cctccgagcc aggcacagga tataatctcc tggtgtgctg tttgctgaga 36361 ccattggaaa agcgcagtat tagggtggga gtgacccgat tttccaggtg ccatctgtca 36421 cagcttccct tggctaggaa agggaattcc gtgaccccct gtgcttccca ggtgaggtga 36481 tgcctcgccc tgctttggct cacacttggt tggctgcact cactttcctg cacccactgt 36541 ctgacaagcc ccagtgagat gaacctggta cctcagttgg aaatgcaaaa atcacctgtc 36601 ttctgcatcg ctcacactgg gagctgtaga ctggagcttt tcctattcgg ccatcttcta 36661 gtctttaatt tttcatctgt atgatgagaa gtgaatccaa tgactttttt agattgaaaa 36721 attttgaatc aaactcaact aaaatgaaca aaatatatga aactacaata atgaagacgt 36781 tatgatattg gtgaaaaaag acaagtaaat caatggaaca gaacagataa tccagaagta 36841 gacccacata tatatattat tataagttca ttttctacag aggtgcgaag gtaatggagg 36901 aaggatattc ttctcaacaa ataatgctgg agcaattggg tatccaaatg cagaaaatga 36961 acttcaatcc ttacctcaca ccatttataa aaactaacca gaaatgaagc atagacctaa 37021 acttctagaa gatgacatag gagaaagtct taatgagctt gggttaggca aagatttcat 37081 agatacctac cactgaaaac ataacccata aaagaaaaaa aatgaataaa ttgagcttaa 37141 tcaaaattaa aaacttctat tccttgaaat atacttctta tggaagtcaa aacacaagct 37201 gcatctgaaa taaaggttac attttgctca taggaaagtt cttcaggtcc ttaaaaatca 37261 gaacctactt gtagaacatt tcacttatgc attttatcat atcagatgat cctgtgtttc 37321 attgttttga ctactaaatg ttctaataaa attcgctgtt ttaaaaaaat gaataaacaa 37381 catactaaat tttagcaact ttcgcttcat tttttaatct ttaaaagaga gagagagaaa 37441 aaaaaaggct ggaattgatg aaaggacata tattgagagc agttattgcc tgagtgcaaa 37501 gaagcaaacc aaaatggtaa gaaataagca taaaaattta aagactatgg tagttggggt 37561 tatttattaa agagacaaac aacctttgac atcttattag ttcccaagaa attgaccata 37621 ttaacataaa cacaaataca gaagttgatt tttgttcatt ttattaaaat atatattttt 37681 aattagtctt gaaaacagtg aagcagtgta tactttgtat cttcaatttc agaaacagta 37741 agtatgtaac atagtttcaa aatcagattt attgaggtat aatttacata cacaaaatat 37801 gccaatttta agtgcacagt tcaatacgtt ttaacagaca tattcactca tataatcacc 37861 accattaaga tatagaactt tttgatcact ccagaaagtt ctttcattcc cctttgcagt 37921 catttcccaa cacctcccct ctgttccgca gcaactacag atctactttt tgactttaaa 37981 gattggtttt gccttttcta gaatttcata tgaaaggaat tattcaatgc atccttttga 38041 gtctggcttt ttctcagtgt aataattttg agatttatct atattgttat atatattagg 38101 acatttcctt ttgattttca aataacattc cattgtaggg atatacccca atttattttt 38161 ccatgcactt cttttttttt tatatttggg ttgtttgtag tttttggcca ttatgactga 38221 agctgctatg aacattcaca tataatgttt tcatttctct tgggtatgaa tatctgggag 38281 tagaattgct atgtgttatg aaaagtgcat atttaacttt ataagaactt ccaaacttgt 38341 tttccaaaat gattgtgcca ttttacattt tcaccagtga tgtttaagtg ttccattgca 38401 ctacatcctc actgaccctt gatattgtca gtcttttaca ttttagctat tctagtgggt 38461 gtataattgt aatttaagtt gtaaaatatt tacatcccaa gtagctaaaa ttaagaaaaa 38521 gtttagagtt ggatggtggt aaacctttaa ttacctgtaa aatttggtaa tctggaacaa 38581 ctcacttatc ctaaagcttt tatataattg atatttcaaa ataatgtaag atggtattac 38641 atgcttttta gctctattgt aatctccagt taagagtaag taaatagttc caactattta 38701 atgaacatta atttgacatt tcttttgttt ccaacttaat gtaatccaaa attacccata 38761 aaatgtaata tttaggggtg gtatatattg tggggaaaaa tgtgttctaa agataatgaa 38821 aaagtactct agtgataatg atagccatca atcattaaga actgactgta tactatatac 38881 tatgccaaag gctttataaa tataatttca tttaattgtt acaaaatggg gtacatattc 38941 ttattttgat tttacaaaga cagttaagga actgaaacat gaaatacaag gttaaaaata 39001 attgaataaa taaattgata aaaggaatga atgaatagag caaggaaaga ataagcgaat 39061 aagaacgtgg gcttgttttt ttcaaacaaa ttatatgtta tggactctag atctcttaac 39121 ttagcttcag tgacattgga ttattgaaaa atgaaactat tttgaggaat tgactgaccc 39181 ctaaagtttc cacaataact tatttaattt tttatctttc taatactttt ttgccttata 39241 atataaaatt tgaatgtttg ttattagtgt gaaatgaaat cctttcaaat atatgccatc 39301 agaaggatgt gttacaaata tacagtatta caaacattta ttttgtatgc tgaataagaa 39361 aaaatcagtt ataatacagt tttaacatag tatccagtgt gtgaattatt taatgaaata 39421 tttgatcttt attttttgtt cccagggagg ttatattcaa aagaaaaagg aactgtgggg 39481 aatctgtatc tttattgcag cagttgacct agatgagatg tcgttcactt ttactgagct 39541 acagaaaaac atagaaatca ggtaaagttt cttgtataaa tataagcctc tgccataaaa 39601 ggaaatgaat tctggatttt cctctcaata gacttttgtg aattagtgag aaatgctaaa 39661 ataaagtaaa acaaaaagaa cttggaccaa atagtgaact gccattctct catggagccg 39721 ttatgaaagt gtatttatgc tgtatttctt taagaggtag cagtttgtgt cctggaaaaa 39781 ttttcattgt gtctctcact attcatgtgt aagctttctt aattgtggct gttaataaag 39841 tgaacagaaa agcaaggatt tgtaaatagg cttgttttat ataaaacctt tatataagta 39901 gtttgagttt atcttaaagg tatttaggta agatttatta aagcctaaga ttgttttaac 39961 tagaggatat tataactaaa tgcttttttt cagattatat attgatttga acatgctaca 40021 atacttgaac cagttaatta cttaataggt gtttgttgtt gagtttataa gcctcataag 40081 aagagtttat taatgtgttt tcaataatgt atgaacttta tcttcagaaa gtatccagga 40141 cttccagtca ttttctttgc aacatattta caaaaatgca acagtatgta tttttaccaa 40201 ataggtaatt atgtttctcc ttgaacaaag acatctagaa ttaaacataa ttgatatttt 40261 atttaaacga gatagattaa attgttgcat ttttgaaata ctttttttgc tatatctaaa 40321 taaaatgtgg aaacaaaggc cttgtatatt tgtaaatata caaaataata gaaatgacat 40381 aaatagttca tagagtcaat acctgcaaca ccacaggcac tcactgcttc tctcccttcc 40441 ttgttttctt cccttaagtt tcccctagaa aaaaggcttt gttgtaacaa ttcagttttc 40501 atggtaatct caggacaggt attgccatat acctcaaaaa aactggaact gagagtagtt 40561 ctttaataaa tgctattctg aggctgattt ctttttcttt ggatatatct gcacttttgt 40621 tcctttctct tttctcttct aatcaacatt ctgatcttct attctttgtc atttctactc 40681 tttctttatt tgctttttca catagcttgt catggctccc ccattccttt ctgtatcata 40741 ggaactttaa taaccttttg gttcccctgc tgattagctg aagcttttat ttccccagtg 40801 tttgattggt ccagcccatt tttttgagtg agggctataa ctcattggtt gctggacagc 40861 ctatggatgg gtcaggtgac tatcttttgt ttcagttagc tgctttggta taattcctta 40921 tacaaaattc atgattagaa attagcatgt gagatgtagt tgatgattgc tatctttaat 40981 atcctattgg aaaatgtata tattgatgtt tcaaatacgt taatttttat ggaaaagagt 41041 aacgaagagt gatttaaagt tattatttaa actgcttaaa ttatttaaat caactttcct 41101 tttccaaaaa taattcaatt ctaaaatcat tggtggggca aataaagtat gcattttggt 41161 gtgtatctct gacaggaggg tgccacaggg cccagagcag gatgttagag tcaaagtagg 41221 gtgaggaggg catcctttgt gtgggggcaa cctgatgggg ctgctttgtg tgcagtgtca 41281 gagactaagc agggtgtgca ggcagccggt gcacaagtga ggtgaggagg gcagcctgct 41341 gtggggagtc agaacccttc cacagtgagg aagtgtctat gctaaggggt ggggtaagtc 41401 cagaactggg agtcagttct gagaagaaca gaaaagtgtc cacacggggg agggggtggc 41461 tttggagata gaaaatggtt acttacaggg ggattgatta aatgagtaag tgaactgagg 41521 atgaatgggt tccaggtttc ccactgtcaa tgaagagtta caaatataga aagggaaaca 41581 ctaaaatgaa gcatatggtt ttatattgga attggatgaa tggatgtgaa ctcatgtttt 41641 ttaaaacaca tagacagata cagaataaat atacgcatat ctcttcttga cccttcgttt 41701 tcttatattc tcaatgatga atagatataa gcataggtat atagataata gaggtgtaag 41761 ttgaaggcta attatttttg caaaaagtaa ttccttccaa aggatatagt agtgatttga 41821 tgtagagctg ataatctttt gaattgaaat atctatgatt tgaaaacgaa ataacacaaa 41881 tttttaaggt tactgattta cttttttcta ttctttcctt tgtagtgtcc ataaattctt 41941 taacttacta aaagaaattg ataccagtac caaagttgat aatgctatgt caagactgtt 42001 gaagaagtat gatgtattgt ttgcactctt cagcaaattg gaaaggtaaa gtaaacattt 42061 tattagggtt acactctgat tttttatgtc attgttcaca attagattct gggaattatt 42121 taacacattt agtaaagtta gtaagtatta attcttagac ttgtcccttt taatgttagc 42181 tcattaattc ttagctttct tatttatcca gtaatatgca ttctgaatgc ttcctggaaa 42241 attaaccgtt ttattatcct ttcatgtctt ccatttgttt tcaaaaactt taagcttatc 42301 tcccatttta gccttctttg ccagaaaact tctagttctg aatagaatgg tgacataaat 42361 ttaaacttct tcctaatagt gttatttagt ttggaacgag aaaacacttg caatggatgg 42421 tgtatgtgac ctatactata aagtaatgat ttgtactccc aatggtggct tagaggactt 42481 tttccattac agttcgagta ggagcataat taatatgctt aggttaagtt ttttctatac 42541 ttttcatcaa aactgtactt ccctacccca aatgtttcag cccttaattt ggcatatgtc 42601 tgttatcatg tttgttttga ggctcatttc atttgcggtt aagagaatat gaaagttaca 42661 gtgttagcta aataagcaat tgtgtaatag catggtaaag ggagaagggt aaaagaggca 42721 tggccctatt ctggagagaa tgaggatgga gatagaggtg gaaagaagca tctccttagg 42781 aggatggctc tattttctcc ccctttttct tttttgacca ctaaaacttc acaactgtgg 42841 tggaggatga gaaaaattag agaaaaaact caaaggccta gggcttttga aggactgtct 42901 tttcttgccc ctggttttaa ttgtaatatg gtagcattgg ccatcttttt tttttttttt 42961 tttttggtct tttcttgatg agaatagaag ctttttgcac aagaaaaagg gaaattttta 43021 attattaatg gagataaatt atctaaataa tttgcataac aaacatgaac taattataga 43081 atacttggaa aacacagaaa ggcataaata aaatatgaat cgaccaaaat gtcaacagtg 43141 gagagaatca cttttcatat tttagaatgt ttctttcctt tgggtattgt ttcacattga 43201 gtagaatatt atatatatgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtggct 43261 tttaaaattt acctaattgt gactttctaa atttttttct attctttaaa cttgtttaca 43321 acctttttaa ttcatatctg gtagcttatt ttaatatgat cctaaataca atgttgcttt 43381 agagtttttt ttggtacaat attatttagg cccctgaaaa actggaaaca actgatattc 43441 acagctgaag ttaaacaata catattaaac atatctaaac ataaaatgtt ataaatataa 43501 aataaaatgt gataaagaat aatggcaata ttggcaagtg cttctaatgt taaacaaatg 43561 aagcaggata tactctgatt atgactgtgt gtaaggaaga aacctagacc actaacgcat 43621 agaatgaaat gactggagag aagtttacca caatgttaac ccaagtttta tatgagtaac 43681 ttacatttgt aaggaggatg tattaccttt ctagtaaaca aagcaagcaa aaacttcttt 43741 aagccaactt taatgagtat tctatacata taagtacttt aagatctata atcatcaggg 43801 tttttttttt tctagatata gaatttgggg actcattggt aggagtgatt tggggtacag 43861 ttctgatttt ttttcctgct atagtaattt cgactggatg cttccaagtt gtttgctctg 43921 ctgtctgaga taaggaatct gcttggatat ttgtttctgt tgtaagaagc aggaattaca 43981 tagtggacaa agcattacat tgtatatgtt taagggtgaa gggtaactat agtggtgatt 44041 ttggtcagag aaaaaaccat agaaggcaat gtccgaatag ggtttacagg aaaaatagaa 44101 attcaccaag caaaatgagt gggaactggg agcgagcatc ttagagatta gcatgtgtac 44161 aaagccaaag cagtgatatc aggatgtact ttgatattgc taatggctga aaggtaagaa 44221 gggatcagat tggtagtaat cttggctttg ccctgcaggc agtggaaact ccttgaatga 44281 ttttaaacta gggagtaaca gttggatttg gcctcaaaag accactcagg actttgtaaa 44341 agggatttaa aatttttatt gacaaatcat atgagataat ttaaagagta aactttacta 44401 accttaggtg gatcagctgg gtgttttcta tcttatttat accttttttt tgaagactaa 44461 ttgagaggat taactgtaat tatatattaa agtgatgtga gatgtcataa attgggaaaa 44521 tctacttgaa ctttgtttta taatgctata tattttttgt ttttaaaata tatacttctt 44581 aaaagaagat gaataaagca tgagaaaact actatgactt ctaaattacg aaaaaatgtt 44641 aaaaagtcat aatgtttttc ttttcaggac atgtgaactt atatatttga cacaacccag 44701 cagttcgtaa gtagttcaca gaatgttatt tttcacttaa aaaaaaagat ttttatggaa 44761 taatctcaaa catcttgata gttagggtta gtttgatcga ttatagcagg ctacttcata 44821 aattaagccc atagatttaa gtcctgtgta gattatttat cttctcacaa agaaaatagt 44881 ataaaataca tgccttgtac tacaaagaag aactaataag gtggaattga ttcaggacag 44941 catatcacca actctgagaa aaatgcaaca aatgcaaatt cattgactaa atctttattg 45001 agggtctgtt acaggcactt tattaactaa taatcagcat aatttctgtg tgagaataaa 45061 tgtaaaaatc tgtattaaaa tttccaaatg attattttaa atgtataatg catgctctaa 45121 cagtatgccc atgtagagct ccagagtttt ttcttggaaa cagaatgagt agtacatgag 45181 attttctgcc tcattggagt agtattgaag ataattaata taaagggaaa ttgtatattt 45241 actgattaat tgatatcaat ctattaattc caacaagtga atgtctctgg aaagattatc 45301 aaggcaaagt gttaaattgg caaactaaag tcatccaaac cttcattttt ctgctcacag 45361 tgttgataat taatcagaaa aaagagcaaa aaatattaag gtaatttgaa acaaagtatg 45421 ttataacata ctatgttttt tatatatttt tatattagaa ttgaaatatt cagtatttct 45481 tttacaaaat ttttctttca aaatgtatac ttttttttct taattttttt ttttgcagct 45541 tctcatggtc aagaatgtat actattctgt gggctaaata tcatatctta gaattataag 45601 acatagaaac attaaatgaa tagagataaa ctcaggtgta aattatgcaa ttaaaatgga 45661 ctgcattcta ttatgcattt aactaaggtc attttttttt taatgcacaa aaagaaacac 45721 ccaaaagata tatctggaaa actttctttc agtgatacat ttttcctgtt ttttttctgc 45781 tttctatttg tttaatagga tatctactga aataaattct gcattggtgc taaaagtttc 45841 ttggatcaca tttttattag ctaaaggtaa gttcattata tttattaaat gctaatattt 45901 caaatgtaat aattaaattg gcattccttt ggactaaatt ccccaatttt tattgagtaa 45961 tgtactcctc cctcattctc tgcttggctt attaactgtt agcaagttcc tataattctg 46021 gtactagaaa caaccttgga aatgctttat ttaatttttg tttctaatat tccatcttcc 46081 ctccctttct tttttttgaa gaataatgat gttactgttt tcataaaaat gatatttgct 46141 catgataaat aattcaaata aaataatttt ggtcacaaaa gcacaaagaa aaattatcta 46201 aaatccaccc agggataatc ggcattaaaa ttacataatt gggatgcagt taaggtatag 46261 aatttttatt atttttgttt gtttatgcaa acagtgttaa cttgttatca gttgaaataa 46321 tgggttataa gatagtattt gcaagcctca tagtaacctc aaaccaaaaa acatacaatg 46381 tagacatgaa aaataaaaag caagaaacta aagcatatca tcagagaaaa ctactttcac 46441 taaaggaaga cagaaaggaa agaaagaagg aagagaagac cacaaaacaa ccagaaaaca 46501 aataacaaaa tgacatgagt aagtccttac ttaccaataa taacgttgaa cataaatgga 46561 ctaaagtctc taattaaaag acacagactg gttgaatgat ggaaaaacaa gactcatttg 46621 atctgtttcc tacaagaaac acactttgcc tataaagaca cataccgact gaaaataaag 46681 ggatggaaaa atatattcca tgcccacaga aaccaaaata agagcagggg tcactatact 46741 tatatcagac aaaatagatt tcaagaacaa aactgtaagg acaaagaagg tcattatata 46801 atgataaagg ggtcaattca gcaagaggat ataacagtgt taaatattta tgcaactaat 46861 actggagcac ccagatatat aaagcaaata ttattagagc taaagagaga cataggcccc 46921 aatacaatag aaatagctgg agacttcaac accccacttt cagcattgga cagatcttcc 46981 agacagaaaa tcaacaaaga aacatcagac ttaatctgca ctatatagac catatgaatc 47041 taatagatat ttacagaaca tttcatccaa gaatatacat tcttcttgtc tgcatattga 47101 tcattctcaa ggatagacca tgtgttaggc ttaaaacatt aaaaaagcat gaaataatat 47161 caagcatatt ctctgatcac aatagaataa aactagaaat taataacaag gtattttgga 47221 aactgtacag atacatggaa attaaacaat atgctcctga atggccatgg gtcaatgaag 47281 aaattaagag ggaaattaaa aaaatttttg aaacaaataa tggaaacata acaaaccaaa 47341 acctatggga tacagtaaaa gcagtacaga gggaagttta tagctgtaag tgcctacatc 47401 ccaaaagagg aaaacatctt tttttaaaaa aacttttagg ttcaggggta catgtgatgg 47461 tttgttacat aggtaaactt gtgtgcacag gggtttgttg tacagcttac ttcatcaccc 47521 aggaattaag cccagtatcc aatagctatt ttttctgttc ctctccctct tcccactttc 47581 caccttcaag tagaccccag tgcctgttgt ttccatcttt gtgttcttca gttttcatca 47641 tttagctccc acttataagt gaggccatgc agtatttagt tttttattcc tgcattagtt 47701 tgctaaggat aatagcctcc agctccatcc atgttcctgt gaaggacatg atctcattct 47761 tttttatggg ctgcatagaa ttccatggta tatatgtacc acattttctt tatccattct 47821 gtcattgatg agcatttagg ttgatgccat gtctttgctt ttgtggatag tgctgcagtg 47881 aacattcacg tgcttttgtc tttatggtag aatgatttct attcctttga gtatgtactc 47941 agtaatggga ttgctgggtc caagggtagt tctgctttta gctctttgcg gaattgtcag 48001 actgctttct acaatgtttg aactaattta cactcccaac caagagtgta taagtgttct 48061 cttttctttt gcaacctcac cagtatctgt tattttttga ctttatagta acagctattc 48121 tgactggtgt gagatgatat ctcattgtgg ttttgatttg catttctcta atgatcagtg 48181 atatggagct tttttttcat atgcttattg gctgcatgta tgtcttcttt tgaaaagtgt 48241 ctgttcatgt gccttgccca ctttttagtg gggttggttt tttttctctt gcaaatttgt 48301 ctaaattcct tataaatgct ggatattaaa cctttgtcag atgcatagtt tgcaaaattt 48361 ttccccattc tgtaggttgt tcatttactc tgttgattgt ttcttttgca gagcagaagc 48421 tcttgagttt aattcaatcc cacttgtcaa ttttttgatt ttgttgtgat tggtgtcttt 48481 atcatgaaga ctttgcctgt tcctatgtct aggatctatt tttttttttt gagacagagt 48541 ctcactctgt cacccaggct ggagtgcagt ggcacaatct cggctcactg caacctctgc 48601 tcccgggttt aagtgattat cttgcctcag cctcctgagt aactgggact acaggtgtgc 48661 accaccatgc ccggctaatt tttgtatttt tactagagat ggggttttac tatgttggcc 48721 aggctggtct tgaactccct acctcaggtg atccatgcac ctcagcctcc caaagtgctg 48781 ggattacagg aatgagccac cacacctggc ctggccatct tcaatccatc ttgagttgat 48841 ttttgtatat ggtgtaagga atgggtccag cttcaatctt ctgcatatgg ctagccactt 48901 atcctagcac cagttaatga aatgggagtc ttttccccat tgcttgcttt tgtcagcttt 48961 gtcaaaaaga tcagctggtc ctaggtgtgc aaccttattt ctgggtgctc tattctgttc 49021 cattggttta cattattctg aaaaaaatgt gcctgttttt gtaccagtaa catgctgttt 49081 tggttactgt agccctgtag tatagtctga agttggataa tgtgatgcct ccagctttgt 49141 tctttttgct taggattgcc ttggctattc aggctctttt ttggttccag ttgaatttta 49201 aaatagtttt ttttttttta tttctgtgaa gaatgtcatt ggtagtttga taggaatagc 49261 cttgaatctg taattttttt ggggcagtat ggccatttta atgatattta ttcttcctat 49321 ccatgagcat gggatgtttt tccatgtgtt tgtgtcatct ctgattgctt tgagcagtgt 49381 tttgaaattc ttgttgtaga gagctttcac ctccctggtt agttgtattc ctatgtattc 49441 tttttgtggc aatttttaat gggattgcct ttctgatttg gctctcagct tggctcttgt 49501 tggtgtgtag aactactagt gatttttata cattgatttt gtatcctgaa actttgctga 49561 agttgttgat cagctgaagg agctttttgg ctctaagact acagggtttt ctagatatag 49621 catcatgtca tctgccacag agatagtttg actccctctc ttcctatttg gatgcccttc 49681 atttctttct ctcacctgat tcctctgatt aggacttcca atactttgtt gaataggagt 49741 ggtgaaaaag ggcatccttg tcttgtgcca gttttcaaag ggaatgcttc aagcttctgc 49801 acattcagta tgatgttggc tgtgggttag ttataggtga aaaaaagagg aaaaactcaa 49861 ataaacaatc taacattgta tcttaaagaa gtacaaaaat aagagcaaat caaacccaaa 49921 attagtagaa gaaaagaaag atcgagcaga aataaatgaa attgcaatga aaaaattgca 49981 aaagatcaat gaaacaaagt gtttttttaa aaagttaaac aaaattgaca agcctttagc 50041 tagattaact aaggaaaaaa gagagaagat acaaataaga tcagaaatga aaaaggagac 50101 attacagctg atactgcaga aactaaaagt atcattagtg gcttctataa gcaactatat 50161 gccaataaac tggaaaatat agaagaaatg ggcaaattcc tagacatgta caacttacca 50221 agattgaacc aggaagaaat ccaaaacctg aacagaccaa taacaagtaa tgagattgaa 50281 gccataataa agtctcctag taaagaaaag cctgggacct gatggcttca ctgctgaatt 50341 gtaccaaaca tttaaagaag aactaatacc aatcctactt acagtattct ggaaaataga 50401 ctaggaagaa atacttctaa actcattcta caaggcctgt attaccctga taccaaaacc 50461 agataaagac ccattaaaaa aagaaaacta taggccagta tttctgagga atattgatac 50521 agaaatcttc aacaaaacac tagcaaactg aattcaataa tacattagat agatcattca 50581 ttatgtcgca gtgggattta tccctgcaat gcaaggatgg ttcaacataa acaaatcaat 50641 gtggtatgtc atatcaacag aatgaaggac aaaaatcata tgatcattta aattgatgct 50701 gaaaaagcat ttgataaaat tcagcatcct tcatgcccaa aaccctaaaa aacctgagga 50761 tagaaggaac ataccttaac gtaataaaag ccatatataa cagacccaca gctagtataa 50821 tacttaatgg ggaaaaattg aaagcctttc ctctaagatc tagaacatga tgaggatgtc 50881 cattgtcatc actgttattc agcatagtac tggaagtcct agatagagta gtcagacaaa 50941 agatataaag ggaatccaga ttggaaagaa agaagtcaag ttatccttgt ttgcagctga 51001 tacaatctta tatttggaag aacctaaaga ctccacaaga aaactattag aactaataaa 51061 catgttcagt aaagttgaag gatagaaaat caacatacaa aaatcagtaa tgtttctgta 51121 tgccaacagt gaataatgtg aaaaagaaat ttaaaaagta gtcctattta caatagccac 51181 acataaaata aaatgcctag gaattaacca cagaagtgaa agatctctat gatgaaaacc 51241 atagaatact gatgaaagaa attgaagagc acaccaaaaa aatggaaaaa aattttatgt 51301 tgttcatgga ttggaagaat taatattgtt aaaatgtcca tactacccaa agcaatctat 51361 agattcaatg caattcctat caaaatacca atgacattct tcacagaaat agaaaaaaca 51421 gtcctcaaat ttatatggaa ctacaaaaga cccagaatag cgaaaactat cctaagcaga 51481 aagacgacgt tgcttgactt caaattatac tacagagcta tagtaaccaa aacagcatgg 51541 tactggcata aaagcagata catagaccag tggaacagaa cagagagccc agaaacaaat 51601 ccacatgcct acagtgaact catttttgac agtggccaag aacatatgct ggggaaaaga 51661 cagtctcttc aataaatggt gctgggaaaa ctggatatcc atatgcagaa gaatgaaact 51721 agacccctat ctgtcaccat atacaaaaat caaatcaaaa tgattaagga cttaaatcta 51781 agaccttaaa ctatgaaact gctataagaa aacattgggg aaaatctcca ggacatcagt 51841 ctgggcaaaa atttcttgag caatacctca caaacacagt caaccaaagc aaaaatggat 51901 aaacgagatc atatcaagtt aaaaagcttc tacacagcaa aggatacaat gaatgaagtg 51961 aagagacaac ccacagaatg gaaaaaatat ttgcaaacta cccctctgac aagcaattaa 52021 taaccagaat atataaggaa cccaaacaac tctatgggaa aaaatctaat aatctgacca 52081 aaaaattggc aaaagagttg aatacacatt tctcaaaaga agacatacaa atgacaaaca 52141 ggcatatgaa aaggtgctca acatcagtga tcatcagaga aatgcaaatc aaaactacaa 52201 tgaggtattt atctcactgc agttaaaatg gctgatattc aaaagacagg caataacaaa 52261 tgctggctag gatgtggaga aaagagaacc cccgtacact gttggtggga atgtaaatta 52321 gtacaactac tatggagaac agtttggagg tttctcaaaa aaattaaaaa ttgagctacc 52381 atattatcca acaatcccac tgccaggtat atacccaacg aaaggaagtc agtatatcaa 52441 agaggtatct gcactcctat gtttgttgca gcactgttta caatagctaa gatttgaaag 52501 caacctaagt gtccatcaac agatgaatgg ataaagaaaa tatggtacat atacacaatg 52561 gagtactatt cagccataaa aaagaacaag gtctagtcat ttgcgacaac atagatggac 52621 ctggagatca ttatgttaaa tgaaataagc caggtacaga aagacaaaca tcacatactc 52681 tcatttattt gtgagatcta aaaatcaaat ctgaactcat ggacctagag agtagaagga 52741 tggttaccag aggctgggaa gggtggtgga ggtttgggag aggtgaggat ggttaatggg 52801 tccacaaaca tagaaaaaat gaataagact ttatagcaca acagggtgac tttagtcaat 52861 aataccttaa ttgtacattt taaaataact taaagagtgt aattggattg tttgtaactc 52921 aaaggataaa tgcttgagga gatggatacc ccattctgca taatgtgctt atttcacagt 52981 gcatgcctgt atcaaaacat ctcacatacc ccgtaaatat atgtactatg tacccacaaa 53041 aattaaaatt taaaaaatta tgtaattgtc attccagata tcttcatata tacattttga 53101 ttaaaaagat gggtagttga gtgcattcac gaatataatt ttctaaaact aggttcatat 53161 tacagtgcta cctttaaaat ataaaaatta atttcactga aattttatag aagagagaaa 53221 tggaaggaat tgcttaaatt tagataagta ttttgtcact ttaagtgata ttatttctaa 53281 aattatttct ctaagattat gaaaaaatgt gaaaaaatta agaggttaca atctttattt 53341 aagtttgtgt ttttatttaa agcaatatac tttaatctgg tttgaagaag gttactatct 53401 tcactcttcc tggtgcctcc ccctctccat tttcagatta gtccatttta ttgttatgac 53461 attatcaagt attataacat ttaaattcta ttttgtcact ataattattg catttggtta 53521 gtctcagttc tgtatttaaa tggattctaa ggttatcata actcctatat catagtttct 53581 ccatttctca gttctttatt tttattcatc tcttagctgg ctgtatttta caatcaacaa 53641 gtttttaaaa atatgctttt tggttccatg ttctttatgt ttgacatgtt taggaatcat 53701 gtctatagct tttatacttg aaggacagct ttgtcatgta ttctattctt gagtcatatt 53761 ttgtttccct caggacaata ttttctggca ttgaacattg ctgttgagaa agctgagact 53821 agcgtaattc cgctcctcat ttacttcctg aagaattttt tttttaattt gaggtttaat 53881 accttaaaca gcataacatt tcttgatttt gttattctgt attagctttt tttttttgaa 53941 caccatatcc ttttttaaac tgaaggttct gctgttcatt ttagaaaatc ttaacatact 54001 ctatttaaaa atattctatt cagctaggtt gttctatacc attgtatttt gggaacattt 54061 tcttatgttg gattgctttt attctcccta tttattgtct tctctcatat tattctgaat 54121 ttttaaatct ttttcttaca tattggctgt ttttatctca agcatttttt ccattttgag 54181 ctaaactatt tttgtaatgg tgttatttgg tcctcatctt ttttcttagc ttgtctattt 54241 ccttcctttt atataatttg ttcattaatt tcctttttat gatacatttt tattgaattg 54301 gatttataat agacgaatta ttcaataata tgtatttgga tcactaaaca aatatattga 54361 gaatagcaaa tggattttaa ccctcctgct aacccagatc aattgctagt ggcttcccgg 54421 cccattgtgt taagaaggac tctagattag catcaggtct ctacagggag gggtacactt 54481 agtgatgtga tggctgccaa gtaaaagcta tgaggtggtg atggtgatgc taccctctta 54541 tctcagaggc acgtttacat ttcatatagt acattacagg tgactaaagg tcatggttgt 54601 gcatcagaga gggtaagatg atgatcagga gaagcttcaa tattaccatt tatttattga 54661 ttcaacagtt agttatttat tagaatttac aatgtctcaa gcatgatatt acgtgctagg 54721 ttaaagtggt ggaagatgta tatgttccct gctcagctag aacttacagt caagtaaggg 54781 agatagacat taaatgaaca atcacagaaa tacttaacta catttgcgat aaggaatgta 54841 aagggaaagt gtaatccccc aatgggttct tcctgctggc tgcacagaca aaatcaatcc 54901 accaagattg tggcattgca gtagagatag tttaattgac cactaggctg acccatgtgg 54961 gagaactgga gttatcactc aaatcagcct cagctggtta agttcttagt gcaagcactc 55021 atagggcatt ttcttctttt agattatctt tcagattggg ttctctgtct gatttttttc 55081 cccttcttat gttgtgattc ttttttgatt ttctatgttt acataatcac tgtgcttttc 55141 cccctcgtat tgctgggact gtatatgtgt aaccctgatt tgatgttctg ttttctctga 55201 tttaatgtga gtgaatatcc cttcactctg cacttaatct gtgacttgtt tatgttcccc 55261 aaagagctat ggtatgggag caggcatttt tattatatct gcttttagtc aggaggtaag 55321 ggagaaagta gataattcaa ctgtaatgat acaagccgtt gttctgagtt tttgttagaa 55381 tgaactttat aacagaacac ttctagtggc agttttatct tttttctata gcaggattgt 55441 tttgggattc aaattgtccc cagttcaaca cagtatgcag agtctctgtg ttgtcagaaa 55501 aataagctat agttgtttac ttcctccatt ttactcttct cttatcagca gtttctccct 55561 ccaagtcaga gaggcaaaag atgaggatac cgtttgagtt agtctcgctc agtcagtctc 55621 tcatttgata ttggtaccag cactttaata cctaataaaa tgaaagcaaa agaatgtact 55681 tgagagagat tctttcatct gacttcaccg tttttaccta gaaatccttg atatcgttct 55741 ttaccactat ggtattttaa aataacttat ttgtgaatat attgcagaca tattttttgc 55801 accataccat tgtatctaga aaacctataa agatgcaacg ttatctgttt tctcacaaag 55861 tttctgcaca tccagtgttt gctgagatta ttactggttt agcagtcaga tactcagaaa 55921 atagaaagtg aatcaaattg aacttctctt ttaaaaataa tgtatgacat ggagccataa 55981 gcaaatttgt aatagctgtg ttttatcatg atttgtatat tttatgtagc tacactattt 56041 gttttcttag gtatctgtta tatgcattta ttttctcaca gccatctagt tgtatatata 56101 ttctaataac tttaaagatt tatggtgaac aattatttat agatgacttt aaatttatat 56161 actaattttg ttatctctta atggttttgg ttattgtaag attttagagg ggttgaaatt 56221 aaaaatatat aaggtttaga tataaataat tggaataaaa tcataatttt ataaataaaa 56281 taattatttt aaatttttag ataatgagaa ttaattgaat cattggaatt agaataaatt 56341 ctaataaact tagtaatttt atataataaa tatacttata aaagtggaat taaaaatttg 56401 aattctaatt attataataa aatattatac aaatataaat aaaatattta taaaattaga 56461 attaaagatt agcatttatc atacttcaaa atctttctct ggagtaatac attctatagt 56521 tcttaaaatt agaaataaat ttaaaaagca aaatttatag tgattttaga cataaagaat 56581 taattataac agaaatactt aaatgtaaaa ttctcagagt agacttaaca cttgatttat 56641 aattccataa ctttacatat ttctatttta catattttat accttttaaa acagattttt 56701 ttttttttta caaaaaaaag aaagaaaatc tttaccatgc tgatagtgat tgttgaatga 56761 ataaatttat ggatatactc taccctgcga ttttctctca tacaaagatc tgaatctcta 56821 actttcttta aaaatgtaca tttttttttc aggggaagta ttacaaatgg aagatgatct 56881 ggtgatttca tttcagttaa tgctatgtgt ccttgactat tttattaaac tctcacctcc 56941 catgttgctc aaagaaccat atagtaagta tttaatttat gcccctttta ctttctcatt 57001 cagcagttgc ttattgaatg tctagtgggt accaaacatg gttctaaggc tgacaggatg 57061 ataaaaaata aatcagacat ggactttgcc cataagtagt gtaagttata gaaggaaaga 57121 taagacatgg aaacaaatga ttagagtata tggtagaaag tggtttgggt caaaatacaa 57181 caaatggagg tttgggagac aagaaagttt ttccagctgt ggaagaccaa agaataccat 57241 ataaaagaag tagtatttga aattaacatg gaagtgtggt taggattggg gaaaggcatt 57301 ctcagtaaag tgttggggga gggcattccc agtgaagaaa agcaagaaac aagaaaaaga 57361 tatgaatgaa attgcatggg gaagagtggc ctattctaag ggaacagctt aactaagttt 57421 ttattactat tggatgttgt aataactatg tatttttttg tatataagct ataactcata 57481 ctatttcagt gggtgatgag tgatgagatg ggaagtatat ttaaataaaa ataacttgct 57541 aatataatta gatagcatta aggaagtatg agtaaggtat attcctttta gcctagaatt 57601 gcttcctttc atggcttcaa gagtaggatt tcaaagtcac ttattatatt attatataca 57661 gaaaaatgta aacttactta aggtcagact aattccttgc ttctgaaatg atgaaagtat 57721 taatgctaaa aataatagat aacatggaag tttcaaagtg ttgttgaaat ggatggaagg 57781 atgaaggctc attaaacttt gtgaacaaga atttagtgga aaaattaagt atggagcaga 57841 attttgaaga tgtgttgaac atggtttggc agagaaggca gaaaccttca tttagcagaa 57901 accacatgca aagaaagaaa taaaaaaaat tctatataga aaacaaatat acttcaaatt 57961 ttctcccaaa ttttataaat ttcctttata taccctttcc ccactttttt caatgttata 58021 actttaataa gttgatgttg caagttactt taggttgaca atctttgttc ttttggataa 58081 ttcaaatgta catgtgaatt ctctaagtgc tctgttaaag agaaatcagc atggggtatt 58141 attgaactga ctgaatatgg taactcttag ttaatttttg ttgtagagaa tttcaaatga 58201 atacaaaaat agtagtataa taagcctcca tgtacttatc atgtaacttc aacagataat 58261 ttacagttaa tctgtagccc cattgcctgc tcagttcccc cagctgtatt attttgaagc 58321 acatctttgc ataccatttc atccttatac atctcagtat atatctctaa aaaatacttt 58381 ttaaaaagca taattacagt gctctttgtt atcacaccta caaatcaata aatagtgact 58441 ccttaatatt actgaatatc tagaaagtgc tcagattttc ttgtaaaaat aattaaaaaa 58501 tttttttaca gtttgaatga gaatctaaat aagatttata ctttgtgttt ggttgacatg 58561 ttttaaaaac tcttttaaaa gtttctcttc catcctttct ttttttcctt gtaatttatt 58621 ttttaaataa tcagattgtc ttgagttttc cacagtgtgg tttttgctga tcatattctg 58681 tggtatagtt taacatgttc ttctgtcctc tcaatttctt gtaagttgtt gattggatct 58741 agaggcaata catacttttt atacttaatc atctgtaatt tttttttttt tttttttgag 58801 gcagagtctc actcactctg tcgcccaggc ctggagttca gtggcgcgat ctcagctcac 58861 tgcaaactct gcctcccggg ttcaagcaat tctcctgctt cagcctcccg agtagctggg 58921 actatgggca tgcaccacca cacctggcta atttttgtat ttttagtaga gacggggttt 58981 caccgtgttg gccaggctgg tcttgaactc ctgacctcat gatccacctg ccttggcctc 59041 ccaaagtgct aggattacag gcgtgagcca ccacacccag ccatctgtaa tatttttaat 59101 agctgaacct gtatttagtt tggattggga atgctgaaaa taatccagga gatttttttt 59161 taaatcacag tttattatca tattattata tttactataa tatattttat acctatcttt 59221 taaagacatc ataatttact aaaacttaaa atattgtttt tatgctatgt ctagaaattg 59281 aattcagaaa tatttgctaa ttatttttaa gactctagga cagaaaactt ttttttcctc 59341 ttgcagacta aaatacgctt agatttagtt tgaaagttgg ctatttccat gccttctctt 59401 tgtatttgtt tatgagactg tagtttacag ttctttttgg gagcagagta gaagagggat 59461 ggcaaaaact aatattagta cataatttgt agtagatatg gatgaaattg ttatccttct 59521 aatgaaacct aataagtaaa agtagtagaa tgttaccaag attatttttg acctaagtta 59581 tagttagaat acttcattat tttatatgat ggatgtacaa ttgttcttat ctaatttacc 59641 acttttacag aaacagctgt tatacccatt aatggttcac ctcgaacacc caggcgaggt 59701 cagaacagga gtgcacggat agcaaaacaa ctagaaaatg atacaagaat tattgaagtt 59761 ctctgtaaag aacatgaatg taatatagat gaggtaattt aacttcatga tttctttaaa 59821 acagttaaag tagatttaga tgtaagttct ccctaacaat atttacttct tttgttatga 59881 gcatgttttt tttgtaatta gtgctaactc ttttgcagta gcaaaatatt tagaaaaaat 59941 taattcgtta tatttagtta ctttgatttt aaagagagta gctccctcac tctggaatca 60001 ctgaaaagta tataatcaag acaatagttt tcaaatgtag tttttttttt tttttattcc 60061 tctgaggtta ctgatacccc taatgtttta aaataccctg ccaaggctgg gcatggtggc 60121 tcacacctgt aatcccagct ctttgggagg ccgaagtggg aggatttctt gaggccagga 60181 ggtctagacc agcctgaaca acatagcgag actctgtctc tacaaaatac atatttaaaa 60241 aaatgaactg ggcatggtgg catgtgcctg tagtcccagc tacttggaag actgaggcaa 60301 aaggattgct tgagcccaga agtttgaggt tacagtgaga catgattgca ccactgcact 60361 ccagcccggg cgccagagtg agactatgtc tccaaaaacg aacaaacaaa ccaaaaaagc 60421 cctgcttaga aaactctagt gctctagtgt tttagtttag tgataagtga taccaacatt 60481 aaatttctgt ttggccttct ataattactc tctttctaaa tattacagca cataatgact 60541 gctggagaat acatatgaat tcccaggaat tacttgtctt gttcatagaa tgtggatgtt 60601 agaaggaatc ttacaaataa tgtattcaat ctccttttta taatgagtaa acaagcttcc 60661 agggactgat agcttattat ttacagcgtc tacattagac tttgggatat ctggtactat 60721 tgcttctttc ctttttacca tacagttttt ctcatagttt gttttattgt ccacaatcac 60781 taacaccaga tagcagtaaa aaatgttaca tctttgaaat ttctctgtta actttgggaa 60841 ggagtgagag tgtggtccta aaatctagtt taagaagtat tagaatgatt tatcaagata 60901 tatcatagtt tttgcactat ggcttactaa tttcaacaaa tatacagttg actcttgaac 60961 aacaggtttg aattgcatgg gtccgtttat acacagatat atttaattac atatattaga 61021 aaaaattttg gagatttgca ataatttgaa aaaatttgta gcaaaccgtg tagcctacaa 61081 atacagaaaa aattaagaaa aaggcatgtc ttgaatgtac aaaatatatg tagatactag 61141 tctattttat catttactat accataaaat acatacaaaa ctgtagctgc agatgcctgc 61201 tgtttcaggc ggacgatttt tgttaacaga tgacataaac ttactgtaaa tgtattttgt 61261 cttcttacaa ttttcctaat attttctttt ctctatcttt atcataagaa gatgatatat 61321 ttagtatgta tatatagtac atatacaaaa tatatgtgaa ttcaatcttt atgttattgg 61381 ttggcttcca gtcaacagta ggctgttagt taaattttgg gtactcaaaa gtaatcgata 61441 ggtctgtagt ctcagctact tgggaggctg aggtggaagg atggcttgag cccaagcatt 61501 tgaagctgta atgcatgtga ttgcacctgt gaatagccac tacacttcag cctaggcaat 61561 atagagagac cccttctcta agaaaataat aaaaaataaa aaagttatac acagattttt 61621 tactgcatgg gggattgaca cctctaactt accctgcatt gttcaagagt caagagatta 61681 gattttgttt taaattttaa tgatcatgtt gtaacttcat ctttttcagg tgaaaaatgt 61741 ttatttcaaa aattttatac cttttatgaa ttctcttgga cttgtaacat ctaatggact 61801 tccagaggta atctgaaagg aaatttaata aaatattaat gttttgagac tgtggaggga 61861 ggataattgt ctaactttct tagatcaatt tactgtgtat cacatttttt ttttgcccaa 61921 gaagaatcta gccaagtaga attgtggtga aactaacttt tgtatagtaa caaaaagctt 61981 ttgagtatat aatcaaatgt attagcatca ttttgtgatt atactagaaa tcaatcttta 62041 ttgaattttt tccaaaatca aagtatagac ttttaaactt ctaaatacca ttaaatagct 62101 aaattttaat agctaaatat aaaccattaa atttgctttg gtaagcctca ttaagcgtag 62161 gaaccatgat taatgccatc aacattattg gcattactac attgtggaac atctcttttg 62221 aggattttta agaactgctt aattgaaaat acaaatgtat ttgttgaaat gactgtgact 62281 attgcagaag tgttttacct tagtgatatt ccatccatct acctgttttg tttcgcattt 62341 tgctacaaat gtctttccaa tgagaacaga ataactgaga gctttgcatg gtaagacgaa 62401 atatgtggag aaaaggatgc tgtgatagat gatgaagaac aaaaccagtg ttttagaaga 62461 catagtatta gtcactatga ccaaaaggca tgaaataaaa tagctagact gctagttctg 62521 taacctgaat tgatagctag tttttgaatt aatttattaa ttcaacaaat attttttgaa 62581 tgcttaatat atgtcaggca cgtgctagat actatggata cagagatttc actctcagtc 62641 actatcctta aagtgctagt gggagaggct gacaaagaaa tagagcatta caggagtcta 62701 caagagtgtg aaaaaagtat tcacagggtt ctgtgatatc tcaaaggaga gctacctaat 62761 ccaaactagg gctgataggt tttgtggaaa atagtttaga ggtgatgtga tttcaaatct 62821 gagtaagcat tacagatata gcatttccaa gggttggaca ttgtagattc aaaggtgcag 62881 tgatgtatga tagtatggtc tgatcagtga actctaagta attcattaag gttgggatac 62941 agggcagaac aattaaaata tggtactgaa gaggcaagaa gggaccaaat gttatattaa 63001 tgagtcttgt taccactttt ggtaccaaaa actaggctgt tcataacttt aaaatctgtt 63061 gtagcaatct ttttttctcc tttaaaagga ttatagtttt aatataaatc ttaagagtaa 63121 ttttactttt tctccaatat ttcacataat atagccaatc aatagatgac tggtaattag 63181 tgcctcaact aaggtaggac aggaacaatg attaaacata agcatggatt actgttctat 63241 ctaatacatt tatgaagatt aatgaaaagt ttgcatttaa ttggatttct tcctatcact 63301 ccttttaaaa tgcaatttta cttctctgag gttggaatca ctttggcgtt aaaagtcaca 63361 gtagaaggca tgccaatagt tttctcttgg gcattggttt ttaaagtata attaaaaaat 63421 tatggatcag aactgatgaa agatttacaa attctagcaa ttgaaaaagt atctaagatt 63481 gggaataaat tgtgcaagtg tgttaattga actgataaat tgagctaatt gtcttgccca 63541 agatacttaa ctgaagggta ctggagtgaa ccaaagggtt tagaaaacat tggtagaata 63601 tgtaatcaag aggatttgca acaatgtttg aaaaaatatt ttatattgat aagcataaaa 63661 gattaaaagc aggctgggcg cggtggctca cgcctgtaat tctagtactt tgggaggccg 63721 aggcaggtgg atcacctgag gtcaggagtt caagaccagc ttggccaaca tggcgaaacc 63781 ccgtctctac taaaaataca aaaattagcc gggcatggta gcaggtacct ataatcccag 63841 ctactcggga ggctgaggca ggaaagttgc ttgaaccccg ggggcagagg tggcagtgag 63901 ccgagattgt gccactgcac tccagcctgg gcgaaagagc gaaactccat ctcaaaaaaa 63961 aaaaaaaaaa aaaagattaa aagtaaagca tatatgggat gtttggaaaa tcttggcagt 64021 taaacagtta ttataatacc tatttttatc ttaggtactg ttttgtagca ttggctatct 64081 ttgtctacat aaaattctaa taaatatttt ctatgcacga aatagaccta aaatcaaagt 64141 tgaacaaatg ttgcaatttt ctgtacctca cttttagata gaccttattt atattgcatg 64201 cgaactcagt gtatattaca aaattaaatg tatattatac aaaaattctt taatgaaatc 64261 tgtgcctctg tgtgctgaga gatgtaatga catgtaaagg ataattgtca gtgacttttt 64321 tctttcaagg ttgaaaatct ttctaaacga tacgaagaaa tttatcttaa aaataaagat 64381 ctagatgcaa gattattttt ggatcatgat aaaactcttc agactgattc tatagacagg 64441 tattgcacat ggtatatttg attgatttgc tttagatata ggttgatact gatataggta 64501 gattatatag tctttagctt agtgaccttt agatatcatt tataacaaat tactttcaaa 64561 tgtctttata caaagaaaag tttaacagta ttttaagcat ataacttatc tacaaatata 64621 gatttaatgt gaattgtgtg tcctataaca gttacctttt tgcagttaac tgaatataat 64681 tttttaaaat gtgcaccaaa agataatggc taaagtaatt agtttcttca aatagtggtc 64741 tctaccttat ttctatcctt tcatatacta ttgcctgcct ggtttattta aaaaagttta 64801 tacaaaagtg tagtctctgt tattatcatg tagaaaccta aaattggaat ggttgagagt 64861 gattgttaaa atttttgact gatcataagt aaagttaatt ttgtgtcttt agcatgaact 64921 cttttaatat gaaactgaaa atatgcgtaa tttagaagtt aaacacagta aaatgtttta 64981 aattttattt aagtggaaat aaagttttta gaaggaataa tgcattgaaa aacttgagtt 65041 ttaatgcttt ttaatactga acaacttggt tatcaatacc accagggaga aagcatctga 65101 ctttcacttt taaaaaaaga cttaatgatt ggtatacctc tttgtcataa acataatgga 65161 aagagaccca caattaaaaa gagtagtgaa aaatatttta tttaagcagc agctgggtca 65221 atctattttc tatcctatct attattgagt tatcatttta tatgatttta tgagacaaca 65281 gaagcattat actgcttttt tgatgcataa agcacaaatt gtaaattttc agtatgtgaa 65341 tgacttcact tattgttatt tagttttgaa acacagagaa caccacgaaa aagtaacctt 65401 gatgaagagg tgaatgtaat tcctccacac actccagtta ggtatgaatt ttcctacttt 65461 taattatatt ataattttgt tattcatggc tttatagtgt ttcagatttg ttcacgtttc 65521 tttatgtatt catacataca tgtaagaaat atatattgaa ggccaggtgt ggtggatcac 65581 acctgtaatc ccagcacttt gggaggccaa ggcgggcaga tcacctgagg ttaggagttt 65641 gagaccggcc tggccaacat ggtgaaaccc cgtctctact agaaatacaa aaattagctg 65701 ggggtggtgg tgtgtgcctg taatcccagc tgctcgggag gctgaggcag gagaattgct 65761 tgaacccggg aggcggaggt tgcagagagc tgagatcgtg ccattgcact tcagcctggg 65821 tgacaagagc gaaactccgt ctcaaaaaaa aaatgtgtat atataatctt tgtgtgctgg 65881 gcattttact ggttgctgaa gatatggtta ttggacctag ttttgtctgt tcttaaagaa 65941 cataacctaa aataggaatg tgttccattt cagtaaataa aaaagaaaaa attaaagtat 66001 cactatttga tctgcatgca accttaaagg attagaacta ttttttatgt ctttttcctc 66061 actcatttcc cacatcaaaa tctgttatca aggcttgttg aataatctac ctccaaaata 66121 tttcctgagt gcatctactt ttctccattt ctgctctcac aactcttgtc caagacattg 66181 ttctctttct cttggatttt ttgcagtagc ctactacctg atcttgctcc ttctggtctc 66241 atcattgttc ctgtctcctc accatcaaag atattctcca tagagccccc agaatttctc 66301 ctagaaatgt aaatcagtta cattcacctt gctgttaaac ccactgactg cttcccattg 66361 caattggtat acagtgcaaa ttccttaaca atgcctaaaa gatcttatat ggtctggttt 66421 ctgttgcctc tccaacctca tcttgttttc tttctcactc tgtgttagcc acactttggc 66481 cttctctgtt tcttgggctt tccaagtttt ttgaaaaccc tcgtacttga tattccctct 66541 tcttgaaatt tcctcctttt ctttccatgg attaccaact gctactcttc attcagcaca 66601 aatattgctt ttgaaaagat atttctctct gaagtatttt attatttctc atatcatcct 66661 gttccattcc tttatagcat gtatcccaat ttatattgat agattcattt gtgtttgctt 66721 gcttttacta atacccaaaa tagccttaac attccatatc acagagacca taaccttttt 66781 gttcactgtc taataattag ggcctgttat aaaacttggt acatagtagg caatcagtaa 66841 tatatttcaa atgactgtat agatatgtaa atggatacat tttcatataa gtatagtata 66901 tactttggac aagataatga gatgacacag aagggactat aaagcaatgt ctgaactgaa 66961 ttttgaaaca tgcaaaggcc tgtgtcacag gataaagatt ttagggagtg ggaacagcat 67021 gttcaaaggg ctggaattgt gaaacatgta agtggagaaa accctgctgt cacaccaccg 67081 acaacaatca acacagaaga cttttgtgac caaatgtggg aaagttttcc tcaccaagaa 67141 acaagcaatc agttctgcag cagtcaccag ctgggtgtcc tctaatttac ttctgacact 67201 gcttacctgg agatagtgtc agatcccaca ggttgagggc tcagtcccca aaactggccc 67261 caacttcaga ttgccagttg caagtctggg tttttggatc ttctgaccaa agggcttcaa 67321 gttggggttc ccatgacacc tcttaggttc gattaatttg ctagagtagt tcacaaaact 67381 caaggaaaca cttctgttta ctggtttatt atagatgata ttacaaagga tacagatgta 67441 aagatgcata ggatgaggta tgggggatgg gatgggaagc ttctatgccc tccctggcca 67501 gacagctgtc caggagcctc cacatgttca gctatccgga agttctctga accctgccct 67561 tttgggtttt tatggaggcc tcattgcata ggcatgacag attagatcat tggctattga 67621 ttatcaacct aaccttcagc ttccctcccc ttcctagagg ttgggggctg gggctgaagg 67681 ttccaaccct ctaactctgc ctcggctttc tagtgaccag cccccatcca gatgcttcct 67741 aggggctgcc agccatgtca attcattagc acacaaaagg acatcactgg agtttctaag 67801 tattttacga gttgtatgtc aggatatggg gtcaaagacc aaacatctat ttcacaatat 67861 cacacaacac aatgtgtctg gggaactatt agcagctctg tattgattga ctataaaatg 67921 agagtaggaa gtcacaggag aggagagtgg gtactagata tgaagggcct tatatgccat 67981 actcacaaat ttggagtgta gcatgtgcat gatatagcct gtgctataaa tggatgttaa 68041 gcaggggtaa ggataaaatc agacataatt tttgcaagtt cacatggtaa tagtcagtgg 68101 ggggatagac tgaggagaat aagaaaaagt gagaaataga cctgtcagtc tgtctcagtg 68161 gttcttaacc atgtgtgcaa attagaataa cctgaggaat tttgttttgt tttttctttt 68221 taaaaatttt gtgtcccatt ccactgagga atatgcctag gccttactac cacagacata 68281 gtgtataaaa atctctgggc ttgtctgttg tagcagttta cttgagagtt tttaaagatc 68341 tgaaccaacg atagtggtgg gaatgaagga acaataacag atttgataaa tatttgatga 68401 attcatactt agaatgttgt tcttgaggca tttctgaaga agatatctag taagcagtag 68461 gatagccttc tatagatttg ggaatcatca ggggagttaa acccataaaa aaaaatgaga 68521 ttgcttgaag ggagggtgca taatggcaag agtgggtggc taagggctga agcctaaagg 68581 atgctagcat taaggactag acagaaattg ttaggttcta gagcaggggt ccccaattcc 68641 cgggccagag actggtctgt gtcccattac gaactgggtg atacagcagg aggtgagtgg 68701 tagacaggca agtgagcatt actgcctgag atctacctcc tgtcagatca gcagtggaat 68761 tagattctca taggtgcatg aaccttattg tgaactgcgc atgtgaggga tctagattgt 68821 atgctcctta tgagaattta atggctgatg atttgaggta gaacagttca aaaccattcc 68881 ccctactctc catggtccat ggaaaaattg tcttccatga aaccagtccc tgttgccaaa 68941 aaggttgggg actgctgttc tagagaatgg gaaagtatga ttggggagtg cagagattga 69001 agcgaagaaa gctgagggag ggagggagtt tcattaagag caagtatttc tctaacagtg 69061 tcaaatgctt caaggaaggt caagtattag aacaaatgag aaaagtcttt tgtttttatc 69121 agttggtcat ttgtggcttt aatcactgta gcttaaatat gattagtgga cacagaagca 69181 tatagaagct gtgttgcaat atatttgaaa gacgaggctg ggtgtggtgg ctcacgcctg 69241 taatcccact ttgggaggcc aaggtgggcg gatcacctga ggacaggagt tcgaaaccag 69301 cctgggcaac atggcaaaac cccgtctcta ctaaaaatac aaaaattagc ccagctgggc 69361 acctgtaatc ccagataatc aggaggctga ggcaggagaa tcacttgaac ccgggaagtg 69421 gaggttgcag tgagccaaga ttgcgccact gcactccagc ctgggcgaca gagcaaggct 69481 ccttctcaaa aaaaaaaaag aaaaagaaaa gaaagatgag gaagtgtaaa tactggttta 69541 gacatgctta gctgagaaca ctgagcatat tttactaaat aatagttgag gatcttttca 69601 gccctaaaca ttctacagtt ctttgcataa ttaaagtgtc ccaaatgttc agctactcca 69661 aaattgtaac attgatattt accatatact gcccccttca taatacacac acataaaatt 69721 atagaattaa caagatctta agagattgac ctaattttat gctgtgaaag tggtgatttt 69781 taaatgtatc caggatctta gtatcctttc ttcttgaagg tctctgtaaa gaaaaatcat 69841 ctgattcttt acttgagttg ctaagtatct tattgtcttt aatttgaaaa tgtttaatat 69901 ttggttgtaa taatgttttc ttaaaacaat accattttgt tgccagttta tatagtttct 69961 cctaaaaata atgccactat tttattgata tgtagtttta ttagtaaata agtatatctg 70021 ttctataact ataaacttat tgattgtgaa tacatatttt cttaaagatt taagtaaaat 70081 gtaatttctt ataaaccaca gtcttatttg agggaatgta gagacaagtg ggaggcagtg 70141 tatttgaaga tacatttaac ttgggagatt gaaaacattt cattttttct ttttttctcc 70201 cttcattgct taacacattt tcctattttt atcccctcta ggactgttat gaacactatc 70261 caacaattaa tgatgatttt aaattcagca agtgatcaac cttcagaaaa tctgatttcc 70321 tattttaacg taagccatat atgaaacatt atttattgta atatcttggc aaagaaactt 70381 gaaattaaaa gttaaagtac tgagttcttt ttaaaatact aatctcctat ctaacatgta 70441 gttatccata atcttttctt gcttttttaa tcttacaaat tatatattat tagtagtatt 70501 gttttattta tacagtgtta tttaaaacat ttttatgttt acctatttgc cttgctcacc 70561 attcttcctt cgaacttatg cctcacttct gagataattt tttcttcttc agatatatcc 70621 tttgataatt actttagtgg tattaatttg ggatgaactt tttgaggttt tattttgttt 70681 tttgatctga aattttcttt acccttattt ttttgtgaga tagctttgct cgatcttcaa 70741 ttccaggttg acatttactt tttagactct tataagtttg aagataatac attacctgga 70801 tcccattttt gctaattaag aagtcttatg ccactcaatc taactgttgt tcctttgtag 70861 ttaatctgtc tttttctctc tggctacttt taagatcttt tctttgaatg tagtattaga 70921 taggacttaa cattccccat acccctacaa tctttttcat ccaatattta acccattttt 70981 atttacaatt cttacctggc cctgttagca ttatagtttg caactcctga tagacaataa 71041 gtttaatatc tgaaggcatc tgataccctc tctcccctaa agacaaggag aaagaaagag 71101 agagaatgaa actatcatct atattgcttg cttgctcctt cactccttac ttaaactcag 71161 atgaggattg ggacccactg aaatgctcat ccactaaaat tggacatagg gtcaggagac 71221 ttattaaaat ttggcttatt tcagcaactg tgctttttgt acttgttagc aattcattca 71281 ttgccataac ctgcctgtga ttttgtcata atctgttgtc atctcttgct atgttccccc 71341 aggaattagc aaagtaattt ctgtcatcat gttcgtaatt gttttggctc agaattgatt 71401 ttgtttgtcc aatatgcata aagtcgcttt tgtttatgag ttatctatag cacaagaata 71461 tatgccaaaa catatgtacc ccaattacct aaacctttta gcaagggcta atgaaagaat 71521 ttaaaagaat gataagccct ttcttctcat atcattttga atacttcctc caatatttgg 71581 agtatgtagg agttcttttt tatttaaaaa attctgcctc aaagactttt ttttaaactt 71641 ggaaactata tgatcttttt ttatgttttt atacagtttt tcaaccctcg ctctcttcct 71701 tccctcctcc ctctagtagt ctccagtgtc tattgctgcc atctttatgt ccatgagtac 71761 ccgatattta gctcccactc attagtgaga acatgcagta tttggttttc tttttctgta 71821 ttaatttgtt taggataatg acctccagct acatccatgt ggctgcaaag ggcataattt 71881 cattcttttt ttatgactgt atagtactcc atagtgtata agtaccacat tttctttatc 71941 caatccacca ctgaagggca cctaggttga ttccatgtct ttgctattgt gaatagcata 72001 ccagtgaaca tatgagtgca tgtgaaagac attttctaga gctcaattta aagcttaact 72061 tttaatgaag cttaaaagct taattatgta gatgataagt tgcatatgtc aaagtgaaca 72121 tgtcttgaag gtcattatgt ttttccttca ttttacttcc ttattagttc aaatttcttc 72181 atttttaaaa ttaatgccac cttccctcca tgtaaaatct tagaagctat gaacatttta 72241 atggacattt tggcaacaat tcttttatga caggaaatgc ttgtttgtta tatgtttcta 72301 agtttgtgtg ctgttttttt tttttaggaa tatagaattg ctgatattct agaatgtgaa 72361 aatgttgcca gacaatgcac attacaatga aaattataat cttattttta tttatatata 72421 tatatatttt tgagacagag tttcactctg ttgcctaggc tggagtgaaa tggcatgata 72481 attggctcac tacaacctcc atctcctggg ttcaagcgat tcttgtgact cagcctccca 72541 agtagctggg attgcagaag cctgctacca tgtccagcta attattgtaa tttttgtgga 72601 gatggtgttt cacaatgttt cccaggctgg tctcaaactc cagacctcag gtgatctgct 72661 cacttcagtt ttccaaagtg ccgggattac agtcatgagc cactgtgcct ggccataatc 72721 ttaattttag aaactttatt tttgaatgta tacattgaat gtgttttatg tataaaggat 72781 aaaaaataga ataaatcttt gtagctgatg agaatcatgc tgtgattact ttttatatat 72841 cataatatga attatataat attatataat atgcatttta atacatcatc ttgtgatgtg 72901 ttttaattat taaagttgaa tagccccact ccaaatgata catgtattgg aacttctttt 72961 gcaacaattt ccctatcttg tatttctgaa aatgtaatca aatgatttag ctaattcata 73021 actctttcaa agcacgtttt aaaattagga aatggtccgg gcatggtggc tcacacctgt 73081 aatcccagca ctttgggagg ccaaggtggt tggatcacct gaaatcagga gttcgagacc 73141 aggctggtca acatggcaaa ccccatctct actaaagaga caaaaattag tgaggcgtgg 73201 tgcgcttgcc tataatccca gctacttggg atgctgaggc aggagaatca cttgaacttg 73261 ggaggcggag gttgcagtaa gccgagattg cgccactgca ctccagcctg ggcgacagag 73321 tgacacttca tctcaaaaaa aaaaaaaaaa aaaaattagt aaatgatacc aagctcaagc 73381 cttggtttta tttcatttgc tttttcggaa gcaacataca tcccaatata actcttaatt 73441 gaaggtattt tatgagaatg tatgaggctc tttctgatgg atggaaagct taacaaattt 73501 gctcttctct agcctagtgg cagaaaattt agataatagg gttttttagt tgtactgtag 73561 tattttttgc tcattaacat ccagtgaaat gatattgtct gcttatgttc agtagttgtg 73621 gttacctagt tattatggaa gtgtttccac atttttatga acaatttaaa aagtcatata 73681 ttatggagca gaaaatatta attctgatta cacagtatcc tcgacattga tttctgtttt 73741 tacctcctaa agaactgcac agtgaatcca aaagaaagta tactgaaaag agtgaaggat 73801 ataggataca tctttaaaga gaaatttgct aaagctgtgg gacagggttg tgtcgaaatt 73861 ggatcacagg taacttgaat tcattgtaat tcgtggtact atagagtaat aatattaaaa 73921 gcagcatctt tccagttcgt ataaatactc taacagtatt tgtctagtag tataaaatac 73981 tgtcagatac tatatccctg ctgcctgtgt atgctgctat ttatgggtaa ctttatggaa 74041 aactacctcc caccccatta taaaaactat gtaataaagg aacacatagc cattgtagaa 74101 attttggaaa atacaactaa gaaaaaaatt gaaatctttg atattaaatt tttgattttg 74161 ctttgacttc tacatgtatg actgttcatg tatgcatatc ctaatgtgaa tgggactata 74221 catgccattt gtcagtgata ttattgagtg atattatttc aggaaatcta aagctaatat 74281 aataggtaag agtatgggcc atggagtgac agccctgggt ttatatttgg ttctaccatt 74341 gttgatcttg gacatgtcat ttaacttctc tgagattcag gttctatatt tgctctgctt 74401 tgttgtgtgg cattaatgag gataatatac aaaaatatat tggaaaactg gagtccactg 74461 tacacattta tttcagtgta gtgaggttat attttgatgc catcagtgtt ttaaacattg 74521 atatttaatg gttctttttg attctacagc catgcaccac ttaacaggga tatgttctga 74581 caaatgcatc attaggcaat tttgttgtgt gaacctggta gagtgtgtct gtacaaacct 74641 agatggtata gccgattgta catctaagct gtatgatata gcctattcat tcctaggcta 74701 caggcctcta cagcatgtgt actgaatact gtaggtaatg gtaacataat gtgttaccat 74761 ttgtgtatct aaatatatcc aaacatagaa aaggtaaaga tatagtacaa aagataaaaa 74821 atggaacatt tgcacagagc acttactatg aaaggagctt acaggattga aagttgctct 74881 gaatgagtca gtgagtcgag tggcaagtga atgtgaaggc ctacaatatt acctatgaca 74941 tgactgtaca ctatcgtaga ctttataaat actacactta ggctaaacta atttcataaa 75001 cattttttct ttcttcagta gtaaattaac catagctcac tgtaactttt ttactttata 75061 aatcttcagt ttgtaagaaa cttttgactc ttataacact tagcttaaaa cacaaacacc 75121 ttatacagct gtacaaaaat attttctttc tttataatcc ttattctgta cacttttttc 75181 tatttaaaaa ttttttcaat ttcataaaga tttttgccaa aaacaaaggc aaacacacac 75241 attagcctag gcctacacag gatcaggatt aacaatacca ctgtcttctg cctccaaatt 75301 ctgtcccaag tcttcaggga caataaaatg catagagctg tcaactccta tgatgatgat 75361 gctatcttct ggaatatctc ctgaaggagc tgcctgaggg tgttttacat ttaacttttt 75421 ttttttttta caagtagaag gaggatgctc taaactaata aaaagtatag tataataaat 75481 acataaacca gtaacatgtt cattattaag tattatgtac tgtacataat tgtatgtact 75541 atacttttat atgactggca gtgcaatagg tttatttaca tcagcatcat cacaaagatg 75601 tgagtaatgt aatgcattgc actacattac actaaacaat aggaattttt cagctccatt 75661 ataatcttat gggactactg ttagatatgc agtcagttat tgaccaaaaa cgttgttatg 75721 cggtgcatga ttatatatga gatttttgct cttagtttgg aaggaaataa tatggtgtgt 75781 gtttatattt ctaaaatttg gttattcttt caaataatgg gaactagtta tttggctaaa 75841 acttcttgct cataaggtca gtttgatctg atttttgcct gtgtcagttt tactaaattt 75901 atggacaaag actgtcctct taatcctgga cagctatctt gaaaatacat gtcctaggtc 75961 atgagaaatg ttatataagg tacataaagt gcttaaaata gtactgggca cataacctat 76021 ctttagtatg aatgatataa actgaaatgg agttaaggaa atccaggtac tggacctacc 76081 ctcttgttaa tttacttggg aatgttaatc accacttaat acttaagttg tgagttttag 76141 acaagctagc ttttgtgttg tcttggcggc catatttgta agaagggtga gaagtatgtt 76201 ttaagaaaag gctttttaaa aaattttagt aattgtcagc tgggtatagt ggtacatgcc 76261 tataatccca gcctcttggg aggccaaagc aggaggatct cttgagccca ggagtgtgaa 76321 ggccagcctg ggcaaaacag tgagactcca tctcaaaaaa aaaaaaaaat ttcataattg 76381 tgattttcta aaatagcagg ctcttatttt tctttttgtt tgtttgtagc gatacaaact 76441 tggagttcgc ttgtattacc gagtaatgga atccatgctt aaatcagtaa gttaaaaaca 76501 atataaaaaa atttcagccg ggcgcggtgg ctcacgcctg caatcccagc actttgggag 76561 gccgaggtgg gcagatcagg aggtcaaggc atcaagatca tcctggccaa aatggtgaaa 76621 ccctgtctct actaaaagta caaaaattag ctgggcgtgg tggtgtagac ctgtagtccc 76681 agctacttgg caggctgagg caggagaatc ccttgaacca cggaggtgga ggttgcagtg 76741 agccaagatt gtgccatttc accccagcct ggcaacagag caagacacca tctaaaaaaa 76801 aaaaaaaaaa aaaaaaaaat tcaatgctga cacaaataag gtttcaatta aacaacttct 76861 tttttttttt ttaaattatc tgtttcagga agaagaacga ttatccattc aaaattttag 76921 gtaaattttt tacttttagt aaaaaatttt tttcttttta tagaagtaag tattttataa 76981 tctttttttt tttcctttag caaacttctg aatgacaaca tttttcatat gtctttattg 77041 gcgtgcgctc ttgaggttgt aatggccaca tatagcagta agttaaattt tcataaataa 77101 acacttttgt tcaatttaaa gttaaaatgt ggtgtgtttc tttggtcggg ggagagggat 77161 agtgtgaggt taaggagaag gaatgcttat tttagatcac tatatactga agaatgtaat 77221 tggtcattat aagccattta agaggcttat ttgagttatt tgaggccatc ttggggataa 77281 tatttcacta ggcttctctt ctgagtatac tggtatactg aatccaaaaa aggtactttt 77341 tgaaatccct ccgaagacct ttgagattgt agagtgccaa aggagtattc aagagtggcc 77401 tctactgata tggaagcact gctgtaacct ccatggttat gaaactaaag attgaagcta 77461 gaatccactg gataaacaga acaaaggcca attacaatgt ttttaaaaac ataaatgtat 77521 atcactttgc aaattatgtt gcaaatgtta aaagaaagtt ttgcttgcag aaacaaaagg 77581 aacatttgta taaggatgca aaacacagaa aaagaggaac ttttagtttc cttattagta 77641 aacagataag gaaaactgtg aatcctcatt tataaccaag aaatgcatat tttaatgaga 77701 taccattttg taactaatga aattagcaga atcttgaaga atctgtctat cctcttacca 77761 aggattaatg agaattaaaa tagatatgcc aatggctgat aagatactaa acctgttcct 77821 gatataagtt tagggtattt aaatctttga aaatttgaga tcagctataa gtcctttctc 77881 taggaaaaac acagatttgc atacactcaa aattggaagg ctatttccta tgagtccgta 77941 gactccaaaa taaaaaattc tgctctaaat aaaaatggtt taacctttct actgttttct 78001 ttgtctgata ataacttcca aaaaaatacc tagctcaagg gttaatattt cataaatagt 78061 tacttttttt tttcattttt aggaagtaca tctcagaatc ttgattctgg aacagatttg 78121 tctttcccat ggattctgaa tgtgcttaat ttaaaagcct ttgattttta caaagtgatc 78181 gaaagtttta tcaaagcaga aggcaacttg acaagagaaa tgataaaaca tttagaacga 78241 tgtgaacatc gaatcatgga atcccttgca tggctctcag taagtagcta aataattgaa 78301 gaaattcatt catgtgcata tggctaacaa attattgtta gtgagaggtg tttcttaaca 78361 aatctacctc aagaacaaat agggaattta atgaataatg ttatttcagt ctatagccca 78421 aggatcaagt ggaatattag aatggagctt taatcgagca ccctaaacca tctaatacag 78481 cacagtgatt tatttaagaa tagcttttct taaaacatgc cactttaaaa caaaacggat 78541 ttttttttta tactttaagt tctagggtac atgtgcacaa cgtgcaggtt tgttacatat 78601 gtatacatgt gccatattgg tgtgctgcac ccattaactt gtcatttaca ttagatatat 78661 ctcctaatgc tatccctccc tcctctccct accccatgac aggccccagt gtgtgatgtt 78721 ccccaccctg tgtccacgtg ttattgttca tttcccacct atgagtgaga acatgcagtg 78781 tttggttttc tgtcttggca atagtttgct cagaatgatg gtttccagct tcatccatgt 78841 ccctacaaag gacatgaact catccttctt tatggctgca tagtatatat gtgccacata 78901 ttccatggta tatatgtgcc acattttctt actccagtct attattgttg gacatttggg 78961 ttgcttccaa gtctttgcta ttgtgaatag tgccacaata aacatacgtg tgcatgtgtc 79021 tttatagcag catgatttat aatcctttgg gtatataccc agtaatgaga tggctgggtc 79081 aaatgctatt tttggtttta gatccttgag gaatcgccac actgttgaac tagtttacag 79141 tcccaccaac agtgtaaaag tgttcctatt tcttcacatc ctctccagca tctgttgttt 79201 cctgactttt taatgatcgc cattctaact ggtgtgagat ggtatttcat tgtggttttg 79261 atttgcattt ctctgatggc cagtgatgat gagcattttt tcatgtgtct tttggctgca 79321 taaatgtctt cttttgagaa gtgtctgttc atatctttca cccacttttt gatggggttg 79381 attttttctt gtaaatttgt ttgagttctt tgtagattct ggatattagc cctttgtcag 79441 atgggtagat tgtaaaagtt ttctcccatt ctgtaggttg cctgttcact ctgatcgtag 79501 tttcttttgc tgtgcagaag ctctttagtt taattagatc ccatttgtca attttggctt 79561 ttgttgctat tgcttttggt gttttagtca tgaagtcctt gcccatgccg atgtcctgaa 79621 tggtattgcc taggttttct tctagggttt ttatggtttt aggtctaaca tttaagtctt 79681 taatccacct tgaattaatt tttgtataag atgtaaggaa gggatccagt ttcagctttc 79741 tacatatggc tggccagttt tcccagcacc atttattaaa tagggaatcc ttactccatt 79801 tcttgttttt gtcaggtttg tcaaagatca gatggttgta gatgtgtggt attatttctg 79861 agggccctgt tctaaaccaa aacagaattc tttaaagtat ttagataatt tatttaaagt 79921 acagttgtta tatatttgaa aatgcaaatg tctgctctta aattgtgaga ttatgctatt 79981 tagattatta aaaagtgcat ttttttcttg ttaaatatag taggattcat tttttcccat 80041 tattaaagta taaaaatgta tactattgtt caattatagg tattggttca atcaatatag 80101 gggcacaaat tatattgtct aggattctct ttatagcatt ataactattg ataaggcttt 80161 tgttttaata aaagtggcaa cttgaggtta tgctcttcaa atcaacatag ttggataata 80221 ctttgtgaat catcagtttt gtaaattaat ggtttctttt gaaactatgt tgcatgctaa 80281 gtatatatct tttaatttgt atcagtatat agaagtggta attaaaaaat taagacttta 80341 ggtatgccat aatcctaatt tgcacatgta accttagttt taatttcctt acgttttgaa 80401 tttttaatga ctaagcgata atttatttct tagtgttttc ttgtattgcc atatagacag 80461 agtgaataga ggagagaaac atgggaaatt gatagaagaa tgttacactc ctattattga 80521 atttcataac tacttatgga atgaatttac acttgataag gagtacttag aagaaaactt 80581 tctgaatctg ggtaattaag tgaaagtaaa aataaataga taagtaaata aaataagcac 80641 atagagtact ttttaaaggc ttatgtgtcc tctcctccca gaaattgata ttgagatatt 80701 cctgtatcat ctgaaagtga ctattgttat ttcttttaga gaattaattt tatttctatc 80761 aacacatatt tattgaacat acgccatatg ccttgaattg agagagatac tgtgaagata 80821 caaaaaagat aaaggcttgg aaaattgtgg aggatgtaat agaaatttac tgtttaaaaa 80881 ttgtttgtac tcaagcctgt ttagctcagt tatttagggc ttctaaacaa tattccttta 80941 tttttgagaa tttttggaga aaaagtgaca agccccaaat ctggaaaaga gtatgattga 81001 tttacaatag gagatgtgga agtcataagc ctctgaattt aaatattaaa tgtagtatct 81061 ctggcagatg aaactataga ggacttttat ttagtcatat atttttagtg tttgagtctt 81121 atattacatt tttaatcata aaaaccttga gtggtagtgg ataactttgt gaagataagt 81181 tatgccacac aaacaggata ctgtgtattt ttcctgtgag aataattgac attatagaga 81241 agggatatgg catagataca tttgcgtttt cacagacttt tttttaaaat ttgaaatata 81301 acacatatta agaaaagtat atatctcaag tgacagtttg gtgaatttgt cacaaattga 81361 acacatccgt gtaaacaaca cccagataaa ggcccagagc aagctttcat agtcttttga 81421 ttctgtgtca taataaaaaa ttctcgttaa aactgtagat tgataggaac atattcacac 81481 agagttgttg actgagggaa atttcaaatg aggtgctgct tacttctctc ctgccaataa 81541 tttaatgaaa gaaagaacat catcttattt ttaatggttg tgtggggaac tgggttcagc 81601 atttttgctt gacagccttg agataaaata aatccataag gaaaaagccc acttctgttt 81661 cttttcatcc cagagagatt aatgggcaat atatgagtgt aagatactgg aaaagtctgt 81721 tgagaaacct agtgaaatct cattctctga aaagttgggg gattacaggt gctatcgaat 81781 gcatcatacc taagggttag atgaaatgac ttccaagagt ttggccctat taaatctgtt 81841 aagaggttta taatgccagc aggtgatgaa atgaaatttg gatgccttag tttcagtatt 81901 ttcagaattt atcatatttt ttcaataaat gtttataaat caaagtatgt cagatgcttt 81961 attccctata tgatctgtct ttcctggact attctggact cttcctattg atctctcatt 82021 tgctaccttg agaccatttc tctcacagca gccagagaaa tctgctcata atgtaaatct 82081 catcatgcta attcctcagg taaaaaacct cctgtggagg acatattcat ctggtaatgt 82141 tggaagaaga ccctaccata tggctttctg caaaaaagac cactatacta cctgtttgga 82201 atacaaaaag taactgactg aaattactgg acaggaagaa cagatgctag ttgagaaaga 82261 gaagagggga gctgggagcc tgaggagaag gcttttagtc tagggctaag tgcagtaggt 82321 atagggaagg acagtaaggt agggcaggga tgtcagtggg aaaactatgg gtctcttcac 82381 caggaaaact ggaaaattga agctagggaa accttggcca ctgaagagag tggaagaatc 82441 ctgcaaggga aagagccaag aaagggtacc cagattctgt ctgaaatctg aataattcct 82501 gtgtgtcaca gactcagagc agccaaatga agggcaaaag atctgaactg aaactggagt 82561 tgctgcccaa taaacagaac agttcaagtt aagcaaagat aattgcttct tacaacaaaa 82621 ccattctcat tttagaaaaa taacagaatt taggatcttc acaatgtaac atttatgatg 82681 ttcaggatac agttctaaaa ttatccagca tagaaagaag gtcaaagaaa ccaaatgctg 82741 gtttatagaa taggagcttt aggccaggtg tgtggtgcac acctaatccc agcattttgg 82801 gaggctgagg tgggaggatc atttgagccc agaagttcaa gaccagcctg ggcaacaaag 82861 caagacccca tctcttaaaa aaaaaaaaaa aaaagctagg tgtggtggca tggtggcatg 82921 ttcctgtggt cctaggtact tgggaggctg aggcaacaca gcgagactct gtctcaaaaa 82981 aataaaaaat aaaaacaaaa aaactagaat gggagcttta agttaacttt ggttaggcat 83041 tgctaaatga tggggataca ttctaggaaa tacttcattt aggcaacttt gtcattgtgt 83101 gaacatcata gagtgtgctt acacaaacct aggtggtata gcctgctaca catcagggta 83161 ggaatagcct gttgcttcta gactacagac ctgtatggta tttttttttt taatctccac 83221 tcttgagtat gtctttttaa tattcaatgt gatgaaactg aagtatgtag aaagttgcat 83281 accaaagtat gatagctgtc ttgaggaaaa atacttattt tattggttga gttgtgaact 83341 gaacccgtca ctcttttcgt gaaacacatg tgttacttgt aagactgata agcaattgtg 83401 tttatttatg cttgggtatg tggtagattc ttgaaagtga acaaagtgag cctgtcattt 83461 caaagaaaac aactgacagt aaaggttgcc aatgataaat tttgtgtttt caagtaaaaa 83521 ttagaatttt agaaacttgt atccaccact gaaagcttga cagagtccta atatataaat 83581 actttctgat gaaatcggtg gtgatatcaa tgaatgtgag agatttgtaa catttacaga 83641 agtacataat tcagtgaacc agtatttcca aatgaccaaa tacatcgtta taaaatcatt 83701 cacgggtaag atattcatta aaagttcaag ataaaccaat ggattttaaa gtcttaaaat 83761 acgaaaaatt tatttcggtt cattgagatt gtttccagtt ccgccctgca actaaccttt 83821 aagaaactat gacttgtcga gttttggcgt agtgtcagaa aacagtaacc ataattatct 83881 gaaaaggtta ttaaagtact cctcctcttt ccatatgagg ccagattttt ctttgtatac 83941 ttcaacctaa acagcatatt gcaacaaagt aaatgtagaa gcggatagga aaattcagct 84001 ctttgctgtt atgccagata gtgaagacat tcagaagatc ataaaacagt gctattgttc 84061 tatctttttt gttttgtaga atatggttat ttttcattaa aaattattta tgttaacata 84121 atgggtttat taagataaat atttttaaat tttctcaagt tcttttgagc cctcaacatc 84181 ctttaatgta aaaaagctat tagctacttt atgcttccta atggaagaat atataaccac 84241 cttgaaaaca tgatgttgaa agaagccagt cacagaaaac cacatattgt ataaattcat 84301 ttatatgaaa tgtccacaag aggcaattca gtagagatag gaagtagatt attggttttc 84361 taaggctggt ggggaggact aggtgttttg aggatgatgg ctagagagta tggtttttct 84421 ttttgagata atgaaaatgt tgtaaaattg actcattatt gtagtcaatt ttataacact 84481 tttatcatct ctgtgaatat actaaaaact attgaattgt acacattaaa tgggtgaatt 84541 gtataggatg taatttttta aatttctgtt ttaattaaat tttaactaaa ttaagtttta 84601 attactaaca aagtaaatat taatagattt aacccacaca aacaaaagct tttggggtcc 84661 ttagtaattt taagaatata aaagggtttt gagtccaaaa agtttgagaa cccttgcctg 84721 tattctcttc cccactcaga cttgtctttc actcacacaa ccatgtttgt ttttgcctgt 84781 ccctgagtct tgaaatgctt tcctctgcct ttttcttgac cagattttaa ggtccatttt 84841 gcctttcctg atcatcctag tctatagtca ttcttccctc atttagtctt aaattgttta 84901 ttatctatat acttaactag tcatttataa actttgtttt actatatgtg catatctcat 84961 gttaaattca tgtaaagtac ttaaaacaat atctgacatt gtattataaa atgtagcagt 85021 taattacata agggtaaaga acagagagga gtcaacttat gatgacatgc aggtttctgg 85081 cttgtataac ttaagtggat agtagtgtca ttcattaaaa taagatacct aggattgaat 85141 gaggtttgct tgggaagatg aatttcattt gatgagtttt aagtaattat aggtcatttc 85201 aatagaggtg tcaagtagca gttggatatg tgtgtctgga acttaggaaa gaaatctagg 85261 ctaagattat ggaccataaa aaatgccacc cttttttgac agtttatctt gtgccaggca 85321 ctttggtgag cacttcttaa atttaatttt catcataatc catgaggtag tttaccctag 85381 atcaggaaat gaatgcttag aaaagtttaa taacttgccc tagaccacac aagcaaaatg 85441 gcaagatctg tgatttgaac ccaggtcttt ctggctccaa agctattaat ccttatgctt 85501 tgctgagtaa ctgaagtttg gagaagggat aagatcactt aaggggagaa tatagtatga 85561 gaagagaagg aggctaaaga tagaattcta aggagctcta acatttaaag gatggacaga 85621 aaagagaagc ctgtgcctac aaaagagatt gaaatagaac agccaggaag ataagaagaa 85681 aatcaggaga agtggtatca tagagagacc gaccaggtgc aatggctccc gcctgtaatc 85741 ccagcacttt gggaggctga ggcaggtgga tcacttgagg ccaggagttc gagactagcc 85801 tgaccaacat gacaaaaccc tgtctctact aaaaatacaa aaattagctg ggcctggtgg 85861 tgaatgcctg taattctagc tactcgggag gctgaggcat gagaatcgct tgagcccggg 85921 aggcggaggt tgagtgagcc aagattgtgc cactgcactc cagcctcagt gacagagcaa 85981 gactctgtct caaaaataaa ataaaataaa attaaattaa attaaaaact gaaaaaaagg 86041 aaaaggagag agagaacatt acaaagtctg agtgagtgct tgctcactca gacttttcag 86101 ataggagctg aaaagtttta gcaacaatga agtcactggt gacagtgaca aggactgtta 86161 aatgaagggg tggggatagg atgagatctg aatgagagat aaagtggaca cagtcaaata 86221 gatggctttt atcaaaagtt ttgcctgtga agagaaataa agatataaca gaagtaaaga 86281 tacagcttga tgtatgtgaa gtagagggaa tataaacata ttttttaaat cagaaaggta 86341 gggtttaaat attgatggaa aagagccatt agaagtgttc tctattgctg gataacaaat 86401 tagtggcttt aaacaaatat tttattttgc ttatgatttt ttgggtcagg aatttgggtt 86461 gaacttgact gggtggtttg cctccgacta tgtgtcagca ggtgcctgga gctatagggt 86521 ccacttccaa gatgcttctt cactcacagg tctggcatct taccgctcct tggtgcacac 86581 tcagctctct tcctcccatc ctccctccct agagtggtta gaatgttaag ctttggagtt 86641 ttcacctgca gagagaagat aggagattca tagtgaaagt agaggtttta agaaacagga 86701 tatttggcag ggtgcagtgt ctcacacctg taatcccagc actttgggaa gccaaggtga 86761 gaggattgct tgagcccgag ttcaagacca gccttggcaa catagggaga ccctgtctct 86821 acagataatt tttaaaatta gccaggtgtg gtggcacaca cctgtggttc ttgggaggct 86881 gaactgggag aattgcctga gcctgggatg ttaaggctgc agtgagctgt gattgcacca 86941 cagcacacca gcctgggtgg tagaatgaga tcctgcctca aaaacaaaca aaacaagcaa 87001 acaaccccaa aacaaacaaa aagtaacagg atatttcatt aaataattca ttcattaaat 87061 aatgaagttg acgtaatgaa tttaatgaag tttaataggt gtagtgggag taacaatgag 87121 aatgaaagtg taggaggttg tagtcaaaga gttgtttcac cttagaattt ctggagacag 87181 aacatttctg ggtagtttac tgtattttaa gaaatgatct tgggagtgca tgaggatcct 87241 ggccaaggag ccagaaagta cagtgagcaa gagtccagga catgtagcaa ggaggaatgt 87301 gactgaaata aagactaatg tcttcagtgc ttgaggggag taaccaggag acaagtggga 87361 ctaaatgggg cagagggcta gagaggtcct gaaagcaaaa ctcctggctc acttaaaaaa 87421 aaactaatct ctctctcaat acctaatcta atgtctaaca tatagaattt aggtgcataa 87481 agcattagat ttgtaatgtc ctttgatgaa tttacccctt taatcattat aaaatgaccc 87541 tctctttatc cctcgtaata ttgtttgctc tgaaaactac ttgtgcctga tactagtgaa 87601 gccattccag ctttcttccc attaatatca acatggcata ttttccccat tctttattta 87661 acttatacgt gtctttattc ttaaagaagt gattttctta taggcagcat attgttgggt 87721 gttgctttct taaatcttgc ctaacagtat ctgcctttta attgggatgt ttagatttaa 87781 atttaatgtg actattgata tggttgggtt aaattctcca tcttattatt cattttctat 87841 ttgtctcatc tcccttttcc tccctttcct ctttttctgc cttcttttgg attgaatttt 87901 taaaattctg ttttatcttc tttgtaggct tactgtctat acttctttat tttgtaattg 87961 ttttttgatt tataatatga atctttaatc tttaatagtc tttctttaaa tagtattaca 88021 ctacttcaca tacagtataa gaatcttaga atagtatatt tccattttcc cttttccaac 88081 ctttgggctg ttgttgtcat acattttgct tcgacagaag ttataagctc tacaatacac 88141 tcataacttt tgttttaaac caattatctc ttaaagagaa ttcaaaacaa gagaaaaaag 88201 atgtttttat ttgattacat atttaccttt ccagtactct tcattccttg tgtagatcca 88261 gattcctttt gttttccttc tgactgtaaa actttttttg gccaggggag tggctcatac 88321 ccgtaatccc aacacttagt ggggctgagg cgggaggatc tcttgaagcc aggagttcaa 88381 aaccagccta ggcaacaaag caaaactctg tctctattct tataaaacaa acaaaaagca 88441 tttctatttt ttttttcttt ttgagatgga gtctctctct gtcacccagg ctggagtgca 88501 gtggcgcgat cttggctcac tgcaacctcc acctcctggg ttcaagcgat tctcctgcct 88561 cagcctcctg agtagctggg attacaggca cacgccacca tgcctagcta atttttatat 88621 ttttagtaga gaaagggttt caccatgttg atcaggctgg tctcgaactc ctgacctcgt 88681 gatttgcctg cctcggcctc ctccaaaggg ctgggattac tggcatgagc caccgcaccc 88741 agccaacaaa aaacatttct tgtgtgcatg tctactagtg atgaattctt ccagatttta 88801 tgtctgaaaa aaaacatttt acctttattt ttaaaaggtc ttttgctggg aattgaattc 88861 taggttgaca gtctttttgt tttttctttg agtgctttaa aaatattgct ccactgtctt 88921 ctggtttgcc ttgtttcttt ttttcttttt tttgagatga agcctcgctc ttttgcccag 88981 gctggagtgc agtggcatga tcccagctca ctgcaacctc tgcctcccag gttcaagcga 89041 ttcttctgcc tcagcctcct gagtgactgg gactaaaggc acatgccacc atgcccggct 89101 aatttttgta tttttagtaa agatggggtt tcaccatgct ggccaggctg gtctcgaact 89161 cctgatctca ggtgacccac ccaccttggc ttcccaaagt gctgggatta caagcatgag 89221 ccaccacgcc tggccttgtt ctttttttgc tgtatctttt tcctgcaggt ttttctttct 89281 gtatttcatg ttgaatacct tctaaagcta tgtcttcaag tttattaatt tttttcttct 89341 gaggtatcta atctgctatt aatctcattt attgtatttt tagtcttact atagttttca 89401 tttctagaaa tttgatttgg gtccttttta tatcttccat atctattttt attatgttta 89461 tgcttttctt taccttcttg aacatatggg atataggtat tataactttt taaatgtccc 89521 tgttactaag tctagtacct atgtcttttt aggtctgttt ctcttgactg atagttctct 89581 ctctctcttt tttttaagag tcgtatgttt ctgctttttt gactgcttgg tgattttttt 89641 gttaggtgcc agatatcatg aattttgctt tagtgggtcc tggatatttt tgtattcctt 89701 tcactatttt gacctgtgtt ttggaacaca attaaattat ctggaaataa tttgactttt 89761 ttaaggcttc atttatttat ttatctggtt tttgtttttg gacacagagc agtctttaat 89821 caatagccga ttttccttca ctatccaggc attgctcttc tgagtactct actgaatatc 89881 ccatatatga caaggttttt ccactctggc tagtggaaag cagaactctt cctgaccctg 89941 tgtcagctcc tggaaggatt ctactccttc ccatggttct tccctggttt tgagtagtat 90001 catcacatcc atgcactgat cagtactcag ctgaagactc aagcttaacc ctctgcagac 90061 ctctggagtt ctctttttgt agctttcttc tctctagtat tctgtcctgc taattttagt 90121 ttgccttggc atctcccatt tcttaactct gtttcttcaa ctgggggacc actgggctgc 90181 attccagttt ctcatctctg gactgtggac cataaactgt ccccaggcag taagctgtgg 90241 caataatgag gctcaccatt tgttttcctt ctcccaggga ttgctgttct gtactgcctg 90301 ttgtttattg tgtaaaaaga attcttttat ttatgttgtt caattttgtg gttgtttaaa 90361 cgaatctggt ccctgtttct tcatcagagc tggcagggtt actaaacgta aggcttagtg 90421 attgatgaag tatttacaaa tatgaaatat gtacacttac agaatattta gggtatcatt 90481 tccgcttcct tgtgttaaat gcatcaggtc ttctgactga tgcattatac tgtttcagag 90541 gggtgcttac tatatggaga acatagaagt actagttatt aactattttt ttgtattaaa 90601 ttttggtacg tcacataaac attgagtctc ctttaaatgt gtcaaatgta gctgacttac 90661 taattggatt ctgtcatact ctttttctgt ggtcaccaaa tattagcaaa tttcttcatt 90721 cagttacaca agtacttggc caacatactt ttacataaga aaatacttag gatgggctgg 90781 caagatagcc aaataggaac agctctggtc tgcagctgcc agcgagatca acccagaagg 90841 caggtgattt ctccatttcc aactgaggta cctggctcat ctcattggga ctggttagac 90901 agtggatgca gcccacggag ggcaagctga agcaggatgg ggcatcacct cacccgggaa 90961 gcacaagggg ttggggaact ccctccccta gccaagggaa gctgtgacag actgtgtcat 91021 gaggaacatt gcattccagc ccagatacta tgcttttccc acagtcttcg caacccacag 91081 accaggagat tccctcgggt gcctacacca ccagggccct gggtttcaag cacaaaactg 91141 ggcagctgtt tgggcagata ccgagctagc tgtaggagtt ttctttcgta ctccagtggt 91201 gcctggaatg acagcgagtt agaaccattc actcccccgg aaagggggct gaagccaggg 91261 agccaagtgg tctacctcag gatcccaccc ccatggagcc aagcaagcta agatccactg 91321 gcttgaaatt ctcgctgcca tcacagcagt ctgaggtcga cctgggatgg tagagctagg 91381 tagggggagg ggcatccccc attactgagg cttgagtttt cccctcacag tgtaaacaaa 91441 gccaccagga agttcagact gggcagagcc taccacagct ccctaaatcc gctgtagcca 91501 gactgccttt ctagattcct cctctctggg cagggcatct ctgaaagaaa ggcagcagcc 91561 cagtcagggg cctatagata aaactcccat ctacctggga cagagcacgt gggggaaggc 91621 gcagctgtgg acacatcttc agcagactta aacattcctg cctgctggct ctgaagagag 91681 cagcagatct cccagcacag tgctcaagct ctgctaaggg acagactgtc tcctcaagtg 91741 ggtccctgac ccctgtgcct cctgactggg aaacacctcc tggcaggaat caacagacac 91801 ctcatacagg agagttccag ctgccatctg gtgggtaccc ctctgggacg aagcttccag 91861 aggaaggaac aggcagcaat ctttgctctt ctgcaacctc cactggtgat atccaggcaa 91921 acagggtctg gagtggacct ccagcaaact ccagcacacc tgcagcagag gggcctgact 91981 gttagaagga gaactaacaa acagaaatga atagcatcaa catcaacaaa aaggacgtcc 92041 acacagaaac cccatttgaa ggtcaccaac atcaaagacc aaaggtagat aaatccatga 92101 agatgaggaa aaaccagcac aaaaaggctg aaaattccaa aaaccagagc gccttttctc 92161 ttccaaagaa tgacaactcc tcccagcaag gggagtttga cgaattgaca gaagtaggct 92221 tcagaaggtg ggtaataaca aactcctccg agctaaagga ccatgttcta acccaatgca 92281 aggaagctaa gaactttgaa aaaaggttag aggaattgct aactagaata accagtttag 92341 agaagaacat aaatgacctg acggaactga aaaaacacaa catgaggact ttgtgaagca 92401 tacacaagta ccaatagctg aatcgatcaa gcagaaaaaa aggatatcag agattgaaga 92461 tcagcttaat gaaataaagc gtgatgacaa gattagagaa aaaagaatga aaaggaatga 92521 acaaagcctc caagaaatag gggactatgt gaaaagacca aacctatgtt tgattggtgt 92581 acctgaaagt gacagggaga gtggaaccaa gttagaaaac actctacagg atattatcca 92641 agagaacatc cccaacccaa caaaacaggc caacattcaa attcaggaaa tacagggaac 92701 accacaaaga tactcctcaa gaagaggaac cccaggacat ataatcgtca gattcaccaa 92761 ggttgaaatg aaggaaaaaa tattaagagc agccactgct caaggaaata agagagggca 92821 caaacaaatg gcaaaacatt ccatgctcat ggatgggaaa aattatagcg tgaaaatggc 92881 catactgccc acagtaattt atagattcaa tgctatcccc atcaagctac ctttggcttt 92941 cttcacagaa ttagaaaaaa ctactttaaa ttttatatag aacctaaaaa gagcccatat 93001 agccgagaca atcttaagca aaaagaacaa agctggaggc atcacgctac ttgacttcaa 93061 actatactac aaggccacaa taatcaaaac agcatggtac tggtaccaaa aaagataaat 93121 aaaccaacgg aacagaacag agacctcaga aataatgcca cacatctaca accatctgat 93181 ctttgacaaa cttgacaaaa acaaccaatg gggaaaggat tccctattta ataaatgata 93241 ttgggaaaac tgactagcca tatgcagaaa actgaaactg gaccccttcc ttacacctta 93301 tacaaaaatt aactcaaaat ggattaaaga cttaaacata acacctaaaa ccataaaaac 93361 cctagaagaa aacctaggca ataccattca ggacataggc atgggcaaag acttcatgac 93421 caaaacacca aaagcaatgg caacaaaagc caaaattgac gaatgggatc taattaaact 93481 aaagagcttc tgcacagcaa aagaaactat cttcagagtg aacaggcaac ctacagaatg 93541 ggagaaaatt tttgcaatgt atccatctga caaagggcta aagtccagaa tctacaagga 93601 acttaaacaa atttacagga aaacaaacaa ccctatcaaa aagtgggtga aggatatgaa 93661 cagacacttc tcaaaagaag atatttatgt ggccaacaaa catatgaaaa aaagctcatc 93721 accactggtc attagagaaa tgcaaatcaa aaccacagtg agataccatc tcatgccagt 93781 tagaatggtg atcattaaaa agtcaggaaa caacagatgc tggagaggct gtggagtaat 93841 aggaatggtt ttatactgtt ggtggcagtg taaattagtt caaccattgt ggaagacagt 93901 gtggcgattc cttaaggatc tagagccaga aataatattt gacccagtaa tcccattact 93961 gagcatatac ccaaaggatt ataaatcatt ctactaaaaa gaacatgcac acatatgttt 94021 attgcagcac tattcacaat agcaaagact tgaaaccaac ccaaatgccc atcagtggta 94081 gactggataa agaaaatgtg gcacatatac accatggaac actatgcagc cataaaaaag 94141 gatgagttcg tgtcctttgc agggacatgg atgaagctgg aaaccatcat tctcagcaaa 94201 ctaaccagga agagaaaacc aaacaacaca tgttttcact tataagttaa aactgaacag 94261 tgagaacaca tggacacatg gaggggaaca tcacacaccg gggcctgtca gggggtgcgg 94321 gtctaaggga gggatagcat taggaaaaat acctaatgta gatgatgggt tgataggtgc 94381 agcaaaccac catggcacgc gtatacctat gtaacaaacc tgcacattct gcacatgtat 94441 cctagaactt aaagtataat aaaaaaacag aaaatattta aaatatatct ttcaaattat 94501 cttttcaaat ttaaatcgag ggtcatttta tgtatgagta taatttgtaa tgaatagccc 94561 atagttttag agtcctgatt cattgacaaa cttataaaca aaattttaag cttttttggt 94621 attcttttga aatgtcaaaa ttgatgggaa gcaaacacaa aagggagaac agtctgtggc 94681 tctgaaaaaa ccaaaaagga tatctgttgt atgtttgtat ggtgggtcat taacctttgg 94741 tgtactttgc agaagtattt taagtattta ccattggaca ttatttgttt taagttatat 94801 tttgctgaga gctacaactc aggaaggcaa gtgaattgta agataacgat taagccacat 94861 ttaacacttt ttaaaagcat gggccaggac aatgaagcag tatgatgcca tatatacttt 94921 tacacaacag ccatgctaat ataatgataa agcaattatg ctatattttt aaaagcttct 94981 catgaattac ctaaaataac catgtattta cttttctttg ccagaaagag gaaacatttt 95041 attttaattt taaaaaattc aattctgaaa acatagctaa agctttaacc tttctcactc 95101 tgacccaatg tattcctaat atagctttgc ccttgacttc cacagctaaa tactagtttc 95161 tcaagtgatt taggagactt gacttcatta ttcaaaactg aaattattta gcttgttcta 95221 aatgcaactt tgcttcccac tatgtctatt tttccattac ttttcaaatc atctgggctt 95281 aaatactgag cattttctcc ttttctttct caatagtttg atagagtcct gtaatttttt 95341 tttaaagctg tctctcatat ttattatatt attattttaa tatacttcat tacacttccc 95401 tagaacagat catcagctct ctttaagtaa tttgtctgag gtcacccagt tggcatttga 95461 actcatattg gtttgattcc aaattccata ctggtttcta gtaaaggtct ctgtcactgt 95521 actacattat ttctgtgaaa tcattcatgc tgtttttact acctctataa tatatagtca 95581 ggcatatggg acagtataaa gggagcaata caaatagtaa gttaatttta ttctaagtga 95641 aagcataata tgtatgtgta ttttttatcc tgagtaacaa gggaagtgtt ttgcctagga 95701 gctaataagt aacaagaaag ggaaaagtaa catttgttga ctagctcctt tgtgcagtgc 95761 actgtgacta ttggtctgac ttattcctca ggaagaagga ttaaaagaag actttacaaa 95821 aatgagcaaa ctatattata aaagtaaatg ctagtatgca tatatgtttc attcatgcaa 95881 ctcctagaaa tgagaattta ctataagtgt tttcagaatc cacaataatt taggttgaaa 95941 ttaggtattt ggatcaactt tcaagggaag gggtgagaat gaggtgaaat agacatctag 96001 attgttttcg tagactgaaa cattggacct gaggtttttg ttgttgtttt actctttctc 96061 tgtttttagt aaaagattcc aggtatgcct aaagtcatct tctatacgaa tgcaaaaatg 96121 taacgacaaa gaagggaaaa taaaagaagg caatatctta ggaataaata ttttatagtg 96181 acattatttt tctgctagtg aactaataat ctacaaaaat tttaagatag ataagcagaa 96241 agaatacaaa caaaaaaaga tagtaattta actcatatga ataatcatcc actgaaatgt 96301 agcaaggata tgctacatat gaaatgtaat tagatgattg cttaaagatc taggaatata 96361 agctcaatgt aataaaataa ctttcttcat cattgttaag gattaataag gcattaaatg 96421 ggttttcatg tgggtcctca tatgttaaaa aggacaacaa taagaaaatc taaagaaact 96481 gagataactt ataaatcaat gcaggtattc tgtgttaaat gcttatgtaa gcctgagttc 96541 ttcatccaga tctgtttatg attttaaaca taaaatcatt ttcatttcta aacctttgct 96601 caagtaactt gtagtacctg agtcctatgt ctgatcccta agtctattct tttctctatg 96661 tatctatacc cttctttaat tttaaaacac tctgacatat gaaaaaatgc cagagtttac 96721 tgtagagaaa tattcataac ttgtctattc atgtcactcc tccccaccac accatccata 96781 ttatacacat tacccattct ctttgttcat taattggcaa aagggactaa gaaattcaag 96841 gatgtccaaa aataaaaaac aaacaaaaat ctaacaactc aggaatgtcc tgtaggtagc 96901 tcatagaaac ttagtttagg aagtagtgct catcacttgc ttacttcttt ggaagtttgt 96961 ctgataatta tacagaccag catcaaaata actctcattt atgtgtgtga catttcagtg 97021 tagaagtaat aataggtaat gtttttgcat atgtcctatt ataggtactg ttaaaagtgc 97081 ttcagtgtat taactcataa cttctcacaa ctctatgaaa tagattctgt catcactgtt 97141 tcacagatgg agaaactttc ctgtgtccac aaaactggtt ggtgacagag ccagaatttt 97201 aactcaggct atctggttcc agagactata cttttagcct ctatgttgcc ttttagtgta 97261 tattagttta ctgtttggta tgtgcattgt catgattcgt ttatttgctc atgttttatc 97321 tgtgtttgta tttttgtgtt tgtgagtgtt actatgctat ggcaaaaagg gcctagactt 97381 tgaattagcg gatcttactt aatattcaca gaagcctgga agtgtaaaca ctgctatccc 97441 tattttatag ataaagagac ctgaggccag agatgtcatg taacttgctt tggtgtggtt 97501 ctgaacaaat tatgtctctg tatctcaggt ttccttgtat gtaaaatagg gataactttc 97561 agttgttgtg aggaataaat gagcatttat gagaatagta attagaaaca taacaattaa 97621 ttgttagtac ccttcattcc ttttcctctg ctatacagtg atttcataag aacagagaac 97681 cgaggattca gttacacagt attaacctgt ttatcttggc ttactataag tacagagaat 97741 ttttctcttc tcttgaggca cccctcctcc cccatcacct tctcaaaatg gaaaaagttc 97801 agggggaaaa ggcatcatgg gactccagca ttgttaagta tgggcagtgt tcactgttat 97861 attacagttg gaatttgtta tgttgcttta gttatgaatc atgtcagtaa ttaccctgag 97921 ggtggtgtta gttcactatt caagtacaac aagaagggaa gtagcaatgg tagaagaata 97981 gctaggacat actctctgta cctaacctgt gggatggaag cacatatact cttattcaga 98041 gaattttata aagagtcatg atgttagata attacttgaa tttagtttta gaattttcag 98101 tttgtaaact tacagtcaaa tcttaactgg aattgtgagg aagggctcta gaattctgga 98161 tatgcaagaa acaaagaaga gaaataggag aaatcgatcc taattagtgt ggcttctgaa 98221 tttctgcagc aaaataaaag gctgttatta attattgagt ttactaaaat tttagaacta 98281 ctctaaggca attatgcttc aatcatctaa gcttacttac ctttaattat aaaatagaac 98341 aactttttgg catttcattt ttccaactat ttatagcatg ttttaaaaac tctgaactgt 98401 cacgacccag tctgtttcaa catacacatt tgtggtcctt ggtgttgtta ctaatttact 98461 taattggttc ttcatatttt cagcttttct cagaaatggc caaatatcaa aggtattttt 98521 ggtgtcttcc tgtaagtgct agatttagaa catgataagg aaacgtagaa taggaaacag 98581 ctggatttgt ttcttagcct tttcaaaaaa gccttttctt actttgctaa gtttcagtgg 98641 ttcttatctc cactaaatat aacaacaaat ttatctttaa gattttgatt atattcctta 98701 agaattcttc aaagactaac ataaatctgt cttgctatct tttaagtctg gtcattttat 98761 cattatgccc ttttcttatt ttaagataat ttaattgtct aaaagttttt aaatttacat 98821 attaagtaat gatagttttt atattaggtt ttatcattta gtttttttct tttaaacatt 98881 aagatgaaaa aggtggagaa gtgaaattac ttgtctttta catgagtcat aaacaaaaat 98941 gaacatagga tacactaatc caatactgaa cttactatct gttcttgaaa atataatttt 99001 tttgcattat ctttttcata ggaatatcag acacctttaa tagtgtacac ttttagattg 99061 gcatctgatt tagttgaacc cacatatttt aacggggtct tctgaaaata aacgtttgca 99121 catttcagtt ccaatgaaga acaaatggcc tggggaaggg atggagagtg gtaaagaaat 99181 cccttgtaga aaaccttttt tttttttttt tttttttaaa cagatgagtc tcactctgtt 99241 ttcccaggct ggagtgcaat gacacaatca tagctcactg aagccttgaa ctcttgggct 99301 caagtgatcc tcccacctca gcctccttag tagatgtgac tacaggtgtg agccaacaca 99361 cccagctaat tttttttatt ctttgtagag acggggtctt gctgttgcca ggctggtctc 99421 aaactcctag actcaagtga tcctattgct ttggcttccc aagtgctggg attagtcatg 99481 agctgccact cttggcctac agaaactatt tatatgtgtc ctttgcttta tttttcttta 99541 agtgcaggtc cattttttcc cctcgtttta ggtagctaag attatattga atcatgtgtg 99601 tttcagtctt tgtcatctca ggaagtctta tgagaccaca acataaagta tagtaatttt 99661 tgtccaataa ttttggccat tagcaaaaat tacctatatg acatataaat caaatggtat 99721 aattttttag tgctaagtct taagtagtat tcatttttat ttaaataaat tgcttctgtt 99781 cgtaaagtac tatatataaa acctttttac cttttttata tataaaacct atgattcata 99841 ggttttatgt cctattaaaa tatgagctaa caataattgg taaaattagt tggaaaatta 99901 tcagttttaa aagaaaatta aattgaaggt tatgttgtta gtaatttaaa gatctcctgt 99961 gttttgatgt aaaaacacta ttttttagca gcagtcaaag ggcttaagta gtatgacttt 100021 ttgtgtaact tggattgtgc aattgctttt aaaaatgaaa ctatttctca tttataatga 100081 gataatggaa ctcagtgtat cggaagagca gactgaacac acacttctac ctcctttctc 100141 cccgcaaacc ctacagaagt gacaaaaaag acacatatca caggggagtg aaacccagca 100201 gtactcagac tgtgaagcag aaaaactgtg gagggaagtg gaggagttga ttcttttttt 100261 tagtataaaa aataaatgat gagtgtattg tggatacaat aaagtagaga aaaccacagc 100321 acaccacacc cccaggaaag gggactggtg gaagtgggag tcttagagag cccccaagct 100381 caaagtagca gatacagaga gtggggcaat aatcaggaag gttttctaaa gtgctgtttg 100441 gcaatggtcc tcaaacttga gcatgcatca aaattaccag gagggcttgt taaaacacat 100501 attacagatt gctgggccca gtcacaaagt ttccagtgta gggtccaaga atttgtattt 100561 ctgatatgtt tccaggtgat actgatgctg ctggtctggg tactacactt tgtgagcaac 100621 attttaagaa catgaaacac ctggaccagt cccccatcta aggccatgcc agcctatgag 100681 aataaaaaaa atgcacctct gtagtcccag ctacttggga ggctgaggag ggaggattgc 100741 ttgagcctgg gaagttgagg ctgcagggag ccgtgactgc actactgcat tccagtctgg 100801 gtgacagaat gagacactgt ctcaaagaaa agcgaaaaaa aagcaccttc ctctttacta 100861 atggattata aaaagatgcc gtccggacac agcaacccta agcttggtat gcagagttta 100921 ggctgaagcc ccatgctgtt gtgcagtgca taaactgtac agatacaagc ccgcgcccac 100981 cttcaaattt tgtgtacact tacagacacg gggctagggt ttcctgatgg tgccaaagag 101041 agaagcagaa cagaaattga agtaaaccga gacttctgta acagacctaa agaaaaagaa 101101 ataggaaagg cagttcttcc taggaggaaa atacaacaaa gtattgtact ggtaggttat 101161 gtagttcctt actttggtat tttcaggtac ctcagttcaa tatttattca ttaccataag 101221 aaagaccaga ggaaatggag gagagaaaat aataaaaaag aacagaaaaa aaaattccaa 101281 aactaatgaa aaccaggggg cttcagatgg aaagggccca tagtgccaag catgatgaat 101341 gaatgtggat actttttgtt tttttttaat gacagtctcg ctctgtcgcc caggctggag 101401 agcaatggtg tgatctccgc tcactgcaac cgctgcctcc taggttcaaa cgattctcct 101461 gccttagcct cctgagtagc tgggattaca ggcacgagac aacgcgccca gataattttt 101521 gtatttttag tagagacggg ttttcaccat gttggttagc ctgatcttaa actcctgaca 101581 tcgtgatccg cctgcctcgg cctcccaaag tgctggaatt ataggtgtga gccaccccac 101641 ccagcaaaaa attcagaatt ctaaagataa tgagaaaatt ataaaagctt ccagagagaa 101701 aaagtaagct attttagggg aatgagatcc agatcagcac agaagttctc attaggaata 101761 ctggatattt tataacctaa aatttgtcaa atatgacagc aaaatgaaat ttggaaatgc 101821 aaggactcag aatgtttatc ttttgactgc ccctttgaaa tgaaattact agggaatatg 101881 ctttaaaaaa atgaaaaaag aatccacaaa agggaaatca taagacctaa taaaaaaact 101941 gtcacaccag agcagagaga aaaatgcaaa aaatcctacg atgcaacaag cccagaaaat 102001 gtttgttcca catgagaaat cagatagttt cataaagaat gccttcaaca acaaaatggt 102061 ttcttttaac aattgtataa cctcatctaa atggtgggaa tttaacaaat acagtttaaa 102121 gttgataaat caagtaataa gagtaataat gaatttgtgt attaacaaaa gaaaatttgc 102181 cttttttcct cgggcttttg aagaatttta agattcattc taattattga gaccaattat 102241 gtgccagaca ctaaaataaa tgaagttcta gggtgttttt ctcccatcct tgttggccca 102301 tagttttgac tgcatataac aggagatggc atgcacagcc cacagaccat aaattttgtt 102361 ttaacacagc cgctttattc atttatatat tgtctgtggc tgcttttgca ctgcagtaca 102421 gagctgagaa gttgcagcca gaaagtagag cctgcaaacc gaaaatattt actatctact 102481 ccttgagaag aaagtttgtc aatacccaat ataaagcatt aacatgtttc ccaaagtcgt 102541 aactatacaa aaaggtataa tctgaaaaat gtcatattct ttgatattcc ttctatccta 102601 tttccaaccc cacccttgtt ttttggttta tctttcctgt gtttcttttt ataaaaataa 102661 gcaggtatgc acgttttatt ttgccttttt cttatacaaa aggtggcata ctatgtgtgc 102721 tcttttatgc tttttttcat ttaacactac cttcttgaag tcattccata tcaatttata 102781 gagatatttc tcattattac agctgcatag tactgtattt tgtatatata ctacagtctg 102841 ttcaactagt tttctatgat tggacatcta ggtggtttcc agtattttgc aatttcaact 102901 attaccacaa taaataactc tgtgtgtgtg tgtgtgtatg tatgtatgta tgtgtgtgtt 102961 atgtaattga aattgtgtct tgaggatcaa tacctagaag taagaatgct gggtcaaagg 103021 gttaatgtgt gtgtgtatgt gtgtgttaga tattgccaaa tcccatccat aaggtttgta 103081 ctatttttca ttcccaccag aaatgtatga aggtagcaat ttctccatga cggccactgt 103141 ccaaattttg ccaagtaatg acattaaaat attcttctca tctattatca ccatcattta 103201 aaaaagaatt tctcccattt taataaagga aaaaggacaa atcagaaaaa aaggataaga 103261 agagcccttt aaattgtata aatatagttc ttgttgtgat tatttttctt ttcttgctga 103321 agacaaatgc atctctaccc tgtatcagac gaatctttat ttgatcatgt ctcccctacc 103381 ttgaaattct tgaggacttt ccacagcttc tcagcactta acgtggtctt aacatggcat 103441 ataagtacgt gtaagatgtt agccctattt ctccagccac tttgaatgtt ttaattcttt 103501 cagtatccaa attctctccc acagggcctt tgcctgtggt gcttgccttg ccaccattaa 103561 gtaactagac aagtaaaaac aaagaatctg ggttcagcca ggtgcagtgg cctgcactgt 103621 agtcccagct gcttgagagg ctgaggtggg aagattgctt gagcccaggt attcaagtcc 103681 agcctgggca atatagcaag acctcatctt tctattaaaa caaacaaaca aacaaacagg 103741 gttcttaccc tggcgctaca ttaatttgtt atgtgacatt gtgtctgttg tttagtctta 103801 cagttcctca gtttccccag cctaaggtgt agttacatca gatgaccatc aggccctttc 103861 tgttctaagt tgtgaatgtt ccagaagcca ttttgatagg acgataatgg aacctagcac 103921 atttacccca tggctattaa ctgtaacgct gctgtaagta tacaacacat aattgtttct 103981 tacaattatt tatttttaaa aatcacttga aatctattat gagtaaacca agttttgaga 104041 aaactgatta ttaaattttt gaattatacc gcttctgggg ctctatcttt tgaattctgt 104101 ttttaacact gcttttccaa tgaaagaggt aattttctac ccttttaaga ggttattgta 104161 gtaaatcttt tgggagcagg aagacaggaa ggtaggagag gcttaacact atcattttaa 104221 ggtaataaaa aagagagaga agtttgcctg gtgttaatct cctgttcatt tattatctga 104281 tctgttgcct tgtttaattg acattctttg tacttttata aaactgtctg acatcaagta 104341 aaaatgataa agagaaaata gaaatttgag aattagagat tttaggagtt atgtacatgc 104401 tgatctctgg aaaatattgc taaataagta atgctgttga tcattaagta ctctggtcgg 104461 tttagatatt aatgtgtgtt ctacattctt aagtttgttc attagtcctt tttaatttgt 104521 aatttatttc ttagaagcca tgttataaat gggtaattca gttttcagga aagccccaaa 104581 aagcttttta aaaagtttgt ttttggagtc tttaagattt catccctgcc taagcactgt 104641 ttataaaatt gtttctacac aaaaatttat tcacaactaa gaacatttgt gacatccaga 104701 gttggactgg catttaggca cgaaaaaagt ttaggtgctg gaaggtagtg atgcttcttc 104761 ttggaaatag taatttaata tattgacaaa ggacaaaata cttggttcat caggtgtcca 104821 aaattgtagg accaaaaagc aattgcttca gattcttgat aactatgtac cattcatgaa 104881 ccaaatgttt ctcggatggg aatatgaagt ttgctccatt ttctatttta tatttctcaa 104941 tttagtgact aatttcttta aataccaata tctataatta agatttctat ttccaattta 105001 cttttcagca ttttgaatgt agaatattaa ggttcattga tagctcctta agtagtgtct 105061 tcagtttaat tttctgtgaa tataatatat ataatacata tatttttatg ctttttaaaa 105121 taaactaggt acttgatatt ttctcagtaa agatgatttc taagttcaga aattattagg 105181 atgctggaaa gccatttttc cacacctgaa aaattttctc agcttactgt gttcctttga 105241 agggagttaa tgtgggaagg aagattagtc aaatcatcta ccgttttatg aaaatgttaa 105301 taacagtggg cacagttttt ataggggggc ttagtgctat cattttcagg taatcaaaag 105361 taaaatcatc agagttctaa tattaatata ggtttttctg aataaagggg gaagggaaga 105421 gaaaaccctc agaatatatg acttaccaca aagtacaaca ctagcaaagg aaaataaatt 105481 ttgttctggt acactttttt tccatagttc tagatgagaa aaccatgttt gttatctgtt 105541 atgtattgct ttatgaagaa tcttacaaaa cttttgattc atttatcttc ctgttgacaa 105601 gagacaccaa aaagtttctt tatgaacata gtaataacaa ctatttaatg actgttttac 105661 ttgcatttta aagttatatt catgttttat gatcttgcct taagttgaga tctttaagct 105721 gaggagagtt ctgagaaatc tcaaagttat acaattctcc acccctttcc cccagatcag 105781 tagattggta gttgcttctc aactcagtat gtttaagaaa gagtgtattt ttatttactg 105841 aagtgtaata tcaaataaag gaatttcagg tcaatggaag aaaacacatt tatattatgt 105901 atatatttct cttgggatat aaaggaccat ggttagtatc ttcactagtt ttttgggttt 105961 tttttctttt ctcttttttt tttttttttg agataggatc ttgctctgtt gtccatgctg 106021 gagtacagtg gtgtgatcat atctcactct aacttggaac tcctgggctg aaggcattaa 106081 agatgtattt tgctttaggt ttatctaaag gaaaccatgg atccctaaag aaaacccccc 106141 actgttattt tattgaaaat gggtttacct gtctagcaaa tgattgtgat ttttctgaaa 106201 tcattacacc tcaaacccag agaacagcaa aaatcaaatt tatgtcttaa aagagttttt 106261 tgtccaagtt gtattgagga cttgaaagat taagacagta gaaatcagat aggttggatg 106321 ctgaatttaa ttacatgtcc ttttccttaa atagaaggca ttctttttta gttaactttg 106381 gagaactgct tgattttttt tttttttttt ttttttttga gatggagtct cgctctgtcg 106441 cccggctgga gtgcagtggc gagatcttgg ctcactgcaa cctctgactc ctgggttcaa 106501 gcgattctcc tgcctctgcc tcccgagtag ctgggactac aggcatgtgc caccacaccc 106561 agctaatttt tgtattttta gtagagacgg ggtttcacca tgttggccag gagggtcttg 106621 atctcttgac cttatgatcc acccgcctca gccacccaaa gtgctaggat tacaggtgtg 106681 aaccaccgtg cccggcctga atttttttaa aaaaggaata tgttggataa taaaaaagtt 106741 ttagtatact tggtactttt tttctaaagt atacctcaga gccagaccta ttttatgttt 106801 agtgaaatga aaatatagtt ttgtttagtt atttggcctt gtagttttgt tactttataa 106861 tgtaaggaac tgtgtcctta attaatcatt ttgacgtaca ataacatctt gcctacttga 106921 aataatgttt tgaacttccc aatatttggc ttaaaatggc cagaagatga aggaaaagaa 106981 aatatttaca gtcatgcatc acgtaatagc attttggtca atgaccgcat aaacgatggt 107041 ggtctcataa gattatcata ctatattttt actgtacctt ttctgtgttg agatgtattt 107101 agatacacaa gtaccattat tttacagttg cctacagtat tctttgcagt agcatgctgt 107161 ataggtttgt agcctagaag cagtagggtg taccatataa gcctaggtgt gtaatagtgt 107221 ataccttcta gagttgtgta agtacacact gtgatgtttg cacgacataa ttacctaagc 107281 aagcatcata agcaacacgt gacagtatat tactttctaa atgccaagta ctatatttca 107341 ctatctctga tgtttctccg acctaacata cttaaaacat aagttcttga ctgtgatgta 107401 agtgcacctt taaaatgtac ctctggtagt attttatttt tttgctaagt aaaaacttag 107461 caaaatgaca ttttattagt tttctccctc tcaatttttg tggcatatat cattttgaaa 107521 ttgggtgcat tatatacaga aaatcaaaat agtaacttca aagcatacca tatagatagc 107581 atctgctttc tacaaattgc ctacacagac caaactccca aaatgttatc aatgtgttac 107641 tgctcaagat cttccaacaa tgttaatatg tcatcatttc tacagaagac tacgttttta 107701 gttatacata tattttaaaa tactcccaaa acatttattt caaattactc ttctttctca 107761 tttgttaatc atagtttaag aaatatgatt agtagtttta attcaaaata aaaatggtat 107821 gttgaataaa atattacata aacataaatt agtataaaat aatttagtaa attaaatcat 107881 ttaatataaa aaggaattca aagacattac agattggcac cacataattt aagtcagtca 107941 cttttagaca ctaaataact ttaagagatt ttttttaatg aaggaacaaa tcaaaatggc 108001 tcagaaaaat cagatggagt ggatacacaa ataaaataca tgttaatgct taacacattg 108061 aatacaaatt ttctttatac taaagacttt aaaatgtcca tgtgttaatt tcttttggag 108121 gtggaaaaat agtttgtcca aaaagacact tttcacagtt gaaggaactt gaaagttctg 108181 tcccagtgag tcctaatggt tttatttcag gcagcagatt cattgtcaaa tatcttactt 108241 tttaaggtct gtaggttatg ctgaataaaa ttctctgcac catgaacttc agagaatctg 108301 aagtcacttc tcctgacaga ccagtttttc atttttattg aattctgaat tgtgtccgat 108361 gtaaagtagt aaactatagg gtcaaaacaa cagttggaaa cagcaataca gagagtgatt 108421 gggtacattg tccttactgc tgccactact gagcaattaa caaatgtttg tgttctcaca 108481 agagaatata aaataagatt gatattgtaa ggaacaaaac agaaacagaa tatgatcaaa 108541 tgtacaaaaa tcatttttaa aaccttagtt ttgtttattt tgcttctact taatgtaaca 108601 ggtttggtta aagtttttag caccatacta gaacaagtta catttaaaat tagaggaata 108661 aaaaatccca ctatttcgat gaaaattaca atccttgaga gatatgtttt ccatgtggct 108721 tctggaaaat tttcaaagca ggcttctgag gcattgttac cctgagagtg ggtagactga 108781 acaaaaacgg cgggtgcact tcctccgatc acagttaacc acacgccagt gcaaacaatc 108841 tttgcatttc ttttggttct tagagtcttt gacttaaatg ggtagacaat tgccagaaat 108901 cgatctacac taatacaggt taagaacaga atgcttccgt acatgttggt ataaaacagc 108961 atcacagaaa tcttacaaag taaatctcca aatggccaat tccgtgttgt gaagtaaaaa 109021 atcctgaagg gtaaagtaaa aacaaaaagc aagtctgaca ttgccaagtt aatcatgtaa 109081 gttgtagttt catttcggac tttgaggacg cagatgaaaa tgtatatggc aacacaattg 109141 gatataaccc aagcacaaac accatgctga acatgcaccc atacaaagtg tacttaaagg 109201 agtcattata gaagcagtgg gagctgttaa cgcttaccat cgtaaaggca cgtccaattt 109261 tcagtttgga agcactttca tcagctgcag tctcctttgg gattcagatt ataacctcta 109321 taacctccaa tttatttgca aattatctgg atctttggat ggttttataa atatttcctt 109381 tttctcagaa atacccaaaa gaaacatgaa atttgttgct gtaaaatttc cgctgggttc 109441 ttcaacagga aaatattttc atttgttagt ctttagaaaa tccatgaatt ccaaggctca 109501 gttaactgac gagcaacctt taaaaaatac agtcaacgag tccaacccat aggattataa 109561 atttaaagaa aagctgtgat ctgtgaccag aatgaaacca cttgtcttct aaacttgtat 109621 ttcagtagtc aagatgaata ctccaaagtt aatgtttctt tatgcctcag aatgttaagc 109681 ttaagttatc aatcttattt tcttttcagt gcagagtttc agagacttaa acattcccaa 109741 tttgtcccat tttacttctt gcttctttta aatgaagttt tccttttttg ttttcctgca 109801 gtggatccca gattgtgggt aagcagaaga gtcttttaaa gattccgaaa taaactccca 109861 agcttgttag ctcttgttga attgaggcct tttcctcagt tgccagttgt atcttgaggt 109921 cgctttctct gaagtgaaaa agaaaggctt cctagatttg taatatgaca tcacaggaag 109981 aagaggctgg gtttggacaa agaaactttc agcccagttt gtttgcactt cctttaatct 110041 gataatgtgt ctgctgggaa tatacctagg aggcttgcaa atggtcactg gctgttgaac 110101 tgcgtggtct gctcattaac tctttttatg tggaaatgcc ttctgcggtg ctggcaacct 110161 ctcagagttg ttttgatttg tttcagtagt aataattgaa agaaaaaaat cactattcag 110221 cctgttctct tattcgattg ttttcgtgtt tgggatatga caatttagaa aagaaaatgt 110281 gtatgggtgt gtatttataa attgaaattg ttagtgatat cttaacatag catttcaaag 110341 aactgtttaa tgactatgaa acaggggcct ttaaccttta aggtataaca agcatttgac 110401 tcttggtttc tgaaaaatac tcaatattta attgttatca ttatgagtag ctcttatcct 110461 gggtctcatt agtttttgat aagcaatttt ctaacattct ttcctctttc ctatgcagtt 110521 gtattaatat tttagagctg tttaaatttt taaattactc atttctccaa aagaacaaac 110581 tcaagaatgt ttatctttgg tgtggtgtta attttacttt ctaagcacaa aatgaaattg 110641 catcttttta agcaaaattt ttttaacctt aaattgattg aaataaggtt tagagaataa 110701 aaatcctact acttggcaaa attttaggtt catagtctta tcttccttct tcctcccagc 110761 tcttccctta accccttaat atgacattaa ctatatattg aaatatattt cagattttgt 110821 ctccatctaa aagtttctat gaatggttta tatgtttata tttaaagtat actcatattt 110881 tgggccaggt gcagtggctc acgcctataa tcccagcact ttggaaggcc gaggcaggtg 110941 gatcacctga gatcaggagt ttgagagcag cctggccaaa atggcgaaac cccatctcta 111001 ccaaaaatac aaaactagcc tagggcaata gtgcgtgcct gtagtcccag ctacccagga 111061 gactgaggca ggagaatcgc ttgaacctgg gaggcagatg ttgcagtgaa ccaagatcaa 111121 gccactgcac tccagcctgg gcgacagcga gactatctca aaaaaaaaaa aaaaaagtat 111181 acttatattt taaagactcc aaaatcagaa gaagaaagtg tgtgtaaggt gagggtactt 111241 aaatgtaggt gttatttcct caaagattga ggatatcaca acctctatgg cacatgtttt 111301 tcttgtttta aatatcattt tcttcagaaa aggctagatc tagttatgcc ttactgttat 111361 tattttttta atttggtcta gtttaagctg aaactcttcc tcttgggtat taagtaattt 111421 aagaacaatt tatttttttc tcataaatta cttagctagc ttaaaaataa cattacagta 111481 aaatgttttg tgttgttttg caccattctt tcacactcag caatttttgt catttgttga 111541 tgaactataa tctatttcta ctgtctggac tagaaaattc atctattatc aaatgacaaa 111601 aaaagatgag tattacttat aaccaatact taaaattttg aagggttgaa tatcttttat 111661 atcacatttg gtatagaaaa gatgtatgtg tgtgttctta ttagaaggct ggtagaagct 111721 gagtgagcat taatgaaaaa aggagtggtt aataagcttg tattctcatt ggaaaaaatt 111781 ttctgttagt agttatgtct tactgttatt tttttttatt tggttcagta tgttttaaaa 111841 aaacaaaatc ttagtaatct tctttcaaat tgatagtctc ctctttagtc atctttaatt 111901 aatgaaatat atgcagttgt atatgctgat ctcatcctac ttttactttc attttgaata 111961 tggcggcatt ttctcattca aataattaaa atttattaat tctacttgga atatttttct 112021 acatattgtc attactcact tgacttgtta ttcacttaag tcattcatct atagtttttt 112081 cttttctttt ctttttcttt ttctttcttt cttttttttt ctcactctgc cgcccaggct 112141 ggagtgcagt ggtgcgatct tgtcttactg caacctctgc ctcccaggtt ccagcaattt 112201 tcccacctca gcctcccaga gtagctggga ttatgggcac ccgccaccac acccagctaa 112261 tttttgtatt tttagtagag atagggtttt accatgtggc caggctggtc ttgaactcct 112321 gacctcaagt catccaccca cttcagcctc cccaagtgct gggattccat gtgtgagcca 112381 ccgtgcccgg cctgttcatt tatagttttt taatgtttga gttactgggg cttaaaaatg 112441 gatagatcac tatttgttga attaggcttt tcattattat cagtatcatg gtaaatgttt 112501 gtataattct taccacaatt cttggttctt tgtagatgat tctcacatgt ttgctgatga 112561 atgaatcata aacttgaatt ttgttgccag ttgtttctac tgaggaagag ccatagcaga 112621 gggactgttt gaacaggaag caagataatt gaatactgca ttttttccag atattaaatt 112681 tggcctctaa ttaaggaacc aaaggggaat ttagaaactt tgaaaaattc ttaggttaga 112741 attggaaaaa tacagagtaa ctacttgcaa agggaggtaa tgaatccata taatcaatta 112801 tatttaatag atacttttgt atttgataaa ctcaggcaaa ctgttatttt aatgtgagat 112861 ctctacactt ctggaactta actctgagaa tcaacttatt tcttatatgg attttttggg 112921 gggagcgaaa attaaacaga gccagataga tgttaaaatg gtgaacacag gaacagctgg 112981 agtctgcagc tcccagcgaa atcaacgcag aaggtgggtg atttctctat ttccaactga 113041 ggtacccagc tcatctcatt gagactggtt agacaggggg tgcagcccat ggaaggcgag 113101 ccaaagcagg gtgggcatcg ccttacacgg gaagtgcaag gggccaggga acttcctccc 113161 ctagctaagg gaagccatga gggaatgtgc cacgaggaac gatgcattcc ggcccagact 113221 ctatgccttt tccgtgatct tcacaaccct tagaccaggc aattccctcg ggtgcctaca 113281 ccacccaggc cctgggtttc aagcacaaaa ctgggcagcc gtttgggcag acaccgagct 113341 agctgcagga ttttttcttt ttcgtactcc agtggcacct gaaacaccag tgagacagaa 113401 ctgttaactc ccctggaacg gggctgaagc cagggagcca agtggtctag ctcagtggat 113461 cccaccccca cagagcccag caagctaaga tccactggct tgaaactctt actgccagca 113521 caggagtctg aagtcgacct gggatgctgg agcttggtgg gaggaggggc ttctgccatt 113581 actgaggctt gagtaggtgg ttttcacctc acagtgtaag caaagccacc gggaagtttg 113641 aactgggtgg agcccactgt agcccaactg cttctctaga gtcctcctct ctgggcaggg 113701 catctctgaa aaaaaggcag cagcccagtc aggggcttat agataaaact gccatctccc 113761 tgggacagag cacctggggg aagggtgcta tgggtggcta ttggtgcagc ttcagcagac 113821 ttaaacgttc ctgcctgctg gctctgaaga gagcagcaga tctcccagga cattgctcga 113881 gctctgctaa gggacagact gcctcctcaa gtgggtctct gacccccatg tctcctgact 113941 gggaaacacc tcccagcagg aatcgacaga caactcacac aggagagctc cagcctgccg 114001 tctggtgggt gccctctgga acaaagcttc cagaggaagg aacaggcagc aatctttgtt 114061 gttctgcaac ctctgctggt gatatccaga caaacagggt ctggagtgga cctccagcaa 114121 attccagcag acctgcagca gaggggcctg actgttagaa ggagaactaa caaacagaaa 114181 tgaatagcat caatatcaac aaaaaggact tccacacaga aaccccattt gaaggtcacc 114241 aacatcgaag accaaaggta gataaatcca tgaagatgag gaaaaaccag cgcaaaaagg 114301 ctgacaattc caaaaaccag aatgcctttt ctcttccaaa gaatcacaac tcctcgccag 114361 caagggaaca aaactggaca gagaataata aactcctcca agccaaagga ccatgttcta 114421 acccaatgca aggaagctaa aaaccttgaa aaaaggttag aggaattgct acctaaaata 114481 agcagtttag agaagaacat aaatgactga tggagctgaa aaacacagca caagaacttc 114541 gtgaagcata cacaagtatc aacagccaaa gcagaagaaa ggatatcaga gattgaagat 114601 cacctttatg aaataaagtg tgaagacaag attagaggaa aaagaatgaa aagaaacgaa 114661 caaagcctcc aagaaatagg ggactatgtg aaaagaccaa acctatgttt gattgtccct 114721 gtacctgaaa gtgacaggga gaattgaacc aagttggaaa acactcttca ggatattatc 114781 caggagaact tccccaacct agcacgacag gccaacattc caattcagga aatacagaga 114841 acaccacaaa gatactcctc aagaagagca accccaagac acataatcgt cagattcacc 114901 aaggttgaaa tgaaggaaaa aatgttaaaa gcagccagag agaaaggtcg ggttacctac 114961 aaagggaagc ccatcagact aatagtggat ctctctgcag aaaccctaca ggccagaaaa 115021 gagtgggggc caatattcaa cattcttaaa gaaaggaatt ttcaacccag aattttgtat 115081 ccagccaaac taagcttcat aagtgaatga gaaataaaat ccttcacaga taagcaaatg 115141 ttgagagatt ttgtcaccac caggcctgca ttacaagagc tcctgaagga agcactaaat 115201 atggaaagga aaaattggtc ccagccactg caaaaacata ccaaattgta aagaccattg 115261 acactatgaa gaaactaatg ggcatcaact aatgggcaaa ctaactggct agcatcataa 115321 tgacaggatc agattcacat ataacaatat taaccttaaa agtaaacggg ctaagtgcca 115381 caattaaaag acacagactg gcaagttgga taaagagtca agacccatca gtgtgctgta 115441 ttcaggagac tcatctcaca tgcaaagaca cgtgctctaa ataaaggaat ggaggaagat 115501 ttaccaagca aatggaaagc aaaaaaaaaa aaaagcaggg gttgctaatc ctagtctctg 115561 ataaaacaga ctttaaacca acgaagatca aaaaagacaa gaagggcatt acataatggt 115621 aaagggatca atgcaacaag aagagctaac tatcctaagt atgtatgcac ccaatacagg 115681 agcacccaga ttcataaagc aagtccttgg agacctacaa agagacttag actcccacac 115741 aataatagtg ggagacttta acaccccact gtcaatattt gacagatcaa tgagacagaa 115801 aattaacaag gatattcagg acttgaactc agctctggac caagcagacc taatagacat 115861 ctacagaact ctccacccca aatcaacaga atagacattc ttcttagcac cacatcacac 115921 ttattctaaa actaaccaca tgattgggag taaaacactc ctcaacaaat gcaaaacaat 115981 ggaaatcata acacagtctc tgagacctca gtgcaatcaa attagaactc aggattaaga 116041 aactcactca aacctgcaca actgcatgga aactgaacaa cctgttcctg aatgaatact 116101 gggaaaataa tgaaattaag gcagaaacaa agaagttctt tgaaaccaat gagaaccaag 116161 acacaacata cccaaatctc tgggacacag ctaaagcaat gtttagaggg aagtttatag 116221 cactaaatgc ccacaggaga aagcagaaaa gatgtaaaat tgacacccta acatcacaat 116281 taaaagaact agagaagcaa gagcaaacta attcaaaagc tagcagaaga taagaaataa 116341 ccaaggtcag agaagaactg aaggagagag aaacacgaaa aacccttcaa aaaatcaatg 116401 aatccaggag ctggtttttt gaaaagatta acaaaataga tagaccacta gccaggctaa 116461 taaagaagaa aagagagaag aatcaaatag acacaataaa aaaatgctaa aggggatatc 116521 accactgatc ccacagaaat acaaactacc atcagagaat actgtaaaca tctctatgca 116581 aataatctag aagaaatgga taaattcctg gacacataca ccctcccaag agtaaaccag 116641 gaagaagtcg aatccctgaa tagaccaata agaagttctg aaattgaggc agtaattaat 116701 agcgtaccaa ccaaaaaaat cccaggagca gatggattca cagctgaatt ctgccagagg 116761 tacaaggagg agctggtact attccttctg aaactattcc aagcaataga agaaaagaga 116821 gtcctcctta actcatttta taagccaaga atcatcctga taccataacc tgccagagac 116881 acaacaaaaa aagaaaattt cggccaatat ccttgatgaa tatcgacgtg aaaaccctca 116941 ataaaatact ggcaaactga atccatagca catcaaaaag cttatccacc acaaacaagt 117001 cagcttcatc cctgggatgc aaggctggtt caacatatgc aaatcaataa acgtaatcca 117061 tcacataaac agaaccaaag ataaaaacca catgattatg tcaatagatg cagaaaaggc 117121 cattgatgaa attcaacact gcttcatgct aaaaactctc aataaacttc atattgatgg 117181 aatgtatctc aaaataataa gagctattta tggcagaccc acagccaata tcatactgaa 117241 tcagcaaaag ctggaagcat tccctttgaa aactggcaca agacaaggat gccctctctc 117301 accactccta ttcaacatag tattggaagt tctgttcagg gcagttagaa aagaggaaga 117361 aataaagcgt attcaagtag gaagagagga aattaaattg tctctgtttg cagagacaat 117421 gattgtctat ttagaaaacc ccattatctc agcccaaaat ctccttaagc tgataagcaa 117481 cttcagcaaa gtctcaggat acaaaattaa tgtgcagaaa tcacaagtat tcctatacaa 117541 caataataga caaacagaga gccaaatcat gagtgaactc ccattcacaa atgctacaaa 117601 gagaataaaa tacctaggca tacaacacac aagggatgtg aaggacatct tcaagaacta 117661 caaacttcac tgctcaagga aataaagagg acacaaacaa atggaagaac attccatgtt 117721 catggatagg aagaatcaat cttgtgaaaa tggccatacc actcaaagta atttatagat 117781 tcaatgctat ccctatcgag ctaccattga cttcctttat aaaattagaa aaaactactt 117841 taaatttcat atggaaccaa aaaagcccat atagccaaga caatcctaag caaaaagaag 117901 aaagctggag gcatcatgct acctgacttc aaactatact acaaggctac aataaccaaa 117961 acagcatggt attggtacca aaacagatat attgaccaat ggaacagaag agaggcctca 118021 gaaataatgc cacacatcta caaccatctc atctttgaca aagatgagac aaaggattcc 118081 gtttaataaa tggtgttggg aaaactggct agccatgtgc agaaaactga aactggaccc 118141 ctttcttaca ccttatacaa aaattaactc aagatggatt aaagagttaa atgtaagacc 118201 taaagccatg aaaaccctag aagaaaacct aggcaatacc attcaggaca taggcatggg 118261 caaagacttc ctgaccaaaa caccaaaagc aatggtaaca aaagccaaaa ttgacaaatg 118321 ggatctaatt aaactaaaga gcttctgcac agcaaaagaa actatcatca gagtgaacag 118381 gcaacctaca gaatgggaga aaatttttgc gatctatcca tctgacaaag ggctaaagtc 118441 cagaatctac aaggaactta aacaaattta caggaaaaaa gcaaacaact ccatcaaaaa 118501 gtgggtgaag gatatgaact gacacttctc aaaagaagac atttatgtgg ccaacaagca 118561 tatgaacaaa agctcatcat cagcggtcat taatgaaatg caaatcaaaa ccacaatgag 118621 ataccatctc atggccgtta gaatggcgat tattaaaacg tcaggaaaca acagatgctg 118681 gagaggatgt ggagaatagg aatgctttta cactgttggt gggagtgtaa attagttcaa 118741 ccattgtgga agacagtgtg gcgattcctc aaggatctag aaccagaaat accatttgac 118801 ccagcaagcc cattactggg tatataccca aaggattata aataattcta atataaagac 118861 acatgcacac atatgcttat tgcagcagta tttacaatag caaagacttg gaacccaccc 118921 aaatgcgcat caatgataga aagataaaga aaatgtggca catatatacc atcgaatact 118981 atgcagccat gaacaaggat gagttcatgt cctttgcagg gacatggatg aagctggaaa 119041 ccatcacttt cagcaaacta acacaggaac agaaaaccaa acaccgcatg ttctcactca 119101 taagtgggaa ctgaacaatg agaacacatg gacacaggga ggggaacatc acacatcggg 119161 gcctgtcggg gttgggggtt agaggaggga tagcattaag agaaatactt aacatagatg 119221 atgggttgat gggtgcagca aaccaccatg gcacgtgtat acctatgtaa caaacctgca 119281 cgttctgcac atgtatccca atacttaaag tataatactt ttaaaaaatg gtgaagacaa 119341 gccttttttt aggactgttg ccataatgac cttgtagact gatttgaaat atagtgtatc 119401 agctttggca acatagggaa accctgtctc cataaaaatt taaaattacc tgggtgtggt 119461 gatgtcaacc tgtggtccca gctggttggg aggctgaagt gggaggattg tttgagccta 119521 ggaggttgag actgcagtga gctttgattg taccactgca cttcagcctg gtgacagatc 119581 aagaccttgt ctaaaaaaaa acaaaaagga aagaatatag cctacatata caaccaattc 119641 aaaatatagt gaatcttggc caggatctgt ggttcacacc tataatccca acactttggg 119701 aggctgagat gagaggatca cttgagacca ggatttcaag accagcctag gcaacatagt 119761 gacaccctgt ttccaaaaat tagccaagtg tggtggtggg cacctgtagt cctggttact 119821 agggaggctg aggtgggtgg atcacttggg cccgggagat tgaggctgca ttgaggtatg 119881 attgtaccat tttactccat cctgagtgac agagagaggc ctgttctcaa ataaataagt 119941 aaataaataa aaagaatgaa atatagtgaa tttcattatg taatgcatcg gtttttaaaa 120001 tttaatgaca tctgattctg attctttgaa tactttcact gcttaagctg ctgggttggg 120061 tttttgtttt tgtttttgtt tttgtgtttt tgagacggag tcttgctctg tcatccaggc 120121 tggagtgcag tggcgcgatc tcggctcact gcaacctccg caggttgggt tttttcactc 120181 tcaggtataa tagatcaagg agtcacctta cactctttat gtttaaggat ttctgatttt 120241 tgaaacaggt tgcacaagaa acatgtataa gattaaatat tctgtaaaag gcttcccttt 120301 atgtttcaat agcattcttc tttaaaagac ctattttttt ccacaataag acaccatgtt 120361 cttttactat aaaattgatc aaagaggaat cgattaaaac tttaaagaat tctctcatta 120421 aaagtttcga aggccataat tgaaattgta cataacattg agaacagggg atatgcaaaa 120481 agaaaaaaga aaaaaatcaa atgagagagt atgtgagact taaaacagaa ataaattgat 120541 ttaagttcaa aagaaagtgt gagccaggca tagtggtaca ctcctgtagt cgaagctact 120601 ctagaggctg agacaagagg atcacttaag gccataggag tttggggctt cattgctctt 120661 tgatgacacc tatgaatagc cacagcactc cagcctgggc aacataacga gacactgtct 120721 cttgaagaaa gaaagaaaaa agtatgggac attgatattt taaaatttaa aaagaattat 120781 tctaaactta cttcagtcat tttgcatgca gcaaatgttt gccatggtga aatgtgactt 120841 ccatctctta atttacactt atgcaaaaag cagttttctc atttgggatt cttttgatgc 120901 tcattatgat aaaattttct atttaggtat agaaacaaat aaaccttttt caatttgaga 120961 aatgaacatt tgtaatttat gttgacagaa tatatttatt ttctataaaa atatctttta 121021 gcctaaggag aaaaaatctt aaattctaga ctagtattac cataagtcta tcggtatcat 121081 ttcgaagtga tttaccttcc tttcattatt gtaaacactt aatactttat gttacttttt 121141 accggaatgt gagtcatcta cccaagttca aagagttgaa ccaactttta tagagttgga 121201 tgggagtaga agaggaaaga tttaaagggt gatgaatgat gattagagca aagtaaaagt 121261 tgatgggaat gatggttgaa gcaggttcaa acagtaggga aaggaaatca agaggagtga 121321 aaatgttatt gtggtaagat ttaaaacttg aataagcata tttatctgtg catacacttg 121381 agttgtccct tcttccttat cttttttttc cgaactctct tgttgcttct tcatctccaa 121441 ctcttagaat catatcacgt cttttctatg gagtctcctt aatctcatca gttccttttc 121501 tgtacttcca cagtacttag tttgcctttt aagacatata cgtgcaattc aattaagaat 121561 gaaaaacaag gccatgtgtg gtggctcatg catgtaatcc tagcactttg ggaggccaag 121621 gcgggcagac ctcttgaggt caggagtttg aaaccagcct ggccaacatg gtgaaactct 121681 gtctctacta aaaatacaaa atgttatcca ggtgtggtgg ctcacacctg taatcccagc 121741 tactcagaag gctaaggcag gagaatcact tgaacctggg aagcagaggt tgcagtgacg 121801 gagatcatgc cactgcactc cagcctgggt gacagagcaa gactctgtct caaaaaaaaa 121861 gaaaaataaa agaatacaaa acaaagaaat aattgcaata caagatataa caacagctat 121921 tagtaacaca tagacatgag aatctacgaa tttagtaaag agcttcttga ggagatgatg 121981 tgtaatgaag gctagccttt acagttgaat acaaatttta aatttcaggt cttgaaggct 122041 agagcattcc tggtaataat gtcccattaa tatagtcatg ctgcctttct catccaaaaa 122101 cacagaaaac tatttgagtg actattttct ccattctctc aaggatgcta tgtaacctag 122161 tcctactctg aattcctgga agatgttaat tcttcacata caatatatgc cttgtggcat 122221 aaaggctgaa tttggtcata aaaccttagt tgggtaaggc taggtgggtt gtagctttca 122281 aaaatttttc acctagttta gacgctgcaa agtgtaatga aaggaccaat ttcactaggt 122341 gttaagtgac ttggattcta atccaagtta ctaaaaatta atgtgcgtct cctgagcccc 122401 acctactcag gaggcagagg taggaggatc tcttgagccc aggaattcca gtttagcctg 122461 ggcaatatag agagacctca actcttaaag gaaaaaaaaa atgacatgac cttgagcaaa 122521 taatttagct tctctattgt atatttgtaa aaggagagta ttaaattaga cctatagaat 122581 tcttttcagc tcttagcttt ctatctaatt cagtctcccc actaattttc aatgacaaga 122641 aagaacatat attattgttt caattttaca gaccaagaaa ctaaaattca gagaagttgt 122701 gacttgctta ggattacaaa gtgattattg gtggagaatg acgaagtcaa aactaaaaac 122761 tgggtcctca caagaagtac ccaacccagt tccttataca tgtttgtata atgaaagaat 122821 acaacttagt gagtgcatgc ataaaggata atagaaagca ctgtgaccta gagatccaga 122881 atccttgatt tctaaatttt gaccttacct tgggcaaact ctgccctgaa ctttaagatg 122941 gatatgacat gcccttttga ccttatgaag agttttcaga ataaaatgaa atgaaaacat 123001 attctgaatg taaacctcta aaaagatact attatagatg acatcaccta taatatctta 123061 actttataga aataaaggtt atgttgtggg aaatcatgga acaataagat ttgttttgtt 123121 ttcttgacca gataggatgc ctgtgctctg tcagataaag aagtgatcat gtgatgcatt 123181 tggtatgcct tttttagcac tgctaatgta cagccaattt aaatcaaata gacataggct 123241 ttgcttataa aaaccacata tatttctaag ttagtttgct tggtggttaa cacattgcat 123301 attgcattag tcttaatatc ctctaatgtt ggagctagca ctggatccta gaaattcaga 123361 ataaaagtcc accttacccc tgttacaact agccattcat cttctttggt atatagttta 123421 gctctccttc cagtcactca agtacaactt aggaaactaa aaccatagct gattattgct 123481 ggagcaggaa ctcgaatcct agtttcttga cttctagtca aatgttctcg tagtaacagg 123541 tagctttcac cctttttttc ttactgatgt tgtgctagtt cgttcttgca ttgctgtaaa 123601 gaaatacctg agactgggta atttgtaaag aaaagagtag ggtttagttg gctcacggtt 123661 cttcaggctg tacaggaagc atggcatcag cttctcctca gcttctggtg agtcctgagg 123721 aagtttatat taatggcaga agtcggaggg gaaacaggca tgtcacgtgg tgacaatagg 123781 agcaagagga aggaggaggt cccatgccct ttttaaacaa gcagatctct tgtgaactaa 123841 ctgagcgaga gctcggttat caccaagggg gtggtgctaa accattcatg agggatccac 123901 ccccctgatg tagtcatctt ctaccaaacc tcacctccag cattggggag gtgttttcaa 123961 cacgagtcat cttctaccaa acctcacctc cagcattggg gaggtgtttt caacacgagt 124021 catcttctac caaacctcac ctccagcatt ggggaggtgt tttcaacacg agtcatcttc 124081 taccaaacct cacctccagc attggggagg tgttttcaac acgagtcatc ttctaccaaa 124141 cctcacctcc agcattgggg aggtgttttc aacacgagtc atcttctacc aaacctcacc 124201 tccagcattg gggaggtgtt ttcaacacga gtcatcttct accaaacctc acctccagca 124261 ttggggaggt gttttcaaca cgagtcatct tctaccaaac ctcacctcca gcattgggga 124321 ggtgttttca acacgagtca tcttctacca aacctcacct ccagcattgg ggaggtgttt 124381 tcaacacgag tcatcttcta ccaaacctca cctccagcat tggggaggtg ttttcaacac 124441 gagtcatctt ctaccaaacc tcacctccag cattggggag gtgttttcaa cacgagtcat 124501 cttctaccaa acctcacctc cagcattggg gaggtgtttt caacacgagt catcttctac 124561 caaacctcac ctccagcatt ggggaggtgt tttcaacacg agtcatcttc taccaaacct 124621 cacctccagc attggggagg tgttttcaac acgagtcatc ttctaccaaa cctcacctcc 124681 agcattgggg aggtgttttc aacacgagtc atcttctacc aaacctcacc tccagcattg 124741 gggaggtgtt ttcaacacga gtcatcttct accaaacctc acctccagca ttggggaggt 124801 gttttcaaca cgagtcatct tctaccaaac ctcacctcca gcattgggga ggtgttttca 124861 acacgagtca tcttctacca aacctcacct ccagcattgg ggaggtgttt tcaacacgag 124921 tcatcttcta ccaaacctca cctccagcat tggggaggtg ttttcaacac gagtcatctt 124981 ctaccaaacc tcacctccag cattggggag gtgttttcaa cacgagtcat cttctaccaa 125041 acctcacctc cagcattggg gaggtgtttt caacacgagt catcttctac caaacctcac 125101 ctccagcatt ggggaggtgt tttcaacacg agtcatcttc taccaaacct cacctccagc 125161 attggggagg tgttttcaac acgagtcatc ttctaccaaa cctcacctcc agcattgggg 125221 aggtgttttc aacacgagtc atcttctacc aaacctcacc tccagcattg gggaggtgtt 125281 ttcaacacga gtcatcttct accaaacctc acctccagca ttggggaggt gttttcaaca 125341 cgagtcatct tctaccaaac ctcacctcca gcattgggga ggtgttttca acacgagtca 125401 tcttctacca aacctcacct ccagcattgg ggaggtgttt tcaacacgag tcatcttcta 125461 ccaaacctca cctccagcat tggggaggtg ttttcaacac gagatttgga agggacaaac 125521 attgaaacca tatcagacat aaaataggaa acttggtcca tagcttaggt aaatattaag 125581 caaagaactt tgtctttaaa tgagtaagct ctgcagactt tttaaaacac taaattctaa 125641 cattatgata gaaatttaag attaagagat tttagacatt ttcatccttc tgtagattaa 125701 aatcgtatga tgaccatcct tcatttttag cttcatcttt atttatttaa aaagtttgca 125761 tttgtacata gtaatgtgcc aagttttata caaaagcagg aaatatctct gctgctttca 125821 aataattaga acaagggttt tctgtgaata gtaaatttaa caccttccag catagattat 125881 tatcagaaat ccaagtactc ttcagttaaa gtaaacactt attgacttgt atcttttacc 125941 tattactgag gggtagaagt taaaaaggat tggtctctca ggatgctcct ggaagctgca 126001 tgcttgtggc tataaagcta ttctctgata tacctgaatt cttttgctcc cagtagtgaa 126061 attgtgtgtg tattaaaagt ttaagcattg tttactccct gatttatatt tgctgatttt 126121 tcatattatt gtaatgttat aaaatggaag tgcaaaaaag ggttgtttta gtcaaaccaa 126181 gttcggtgct ttgaaatgac tcagaggcaa gttgcagaaa atattgtaga cttagatgta 126241 gaaaagatat gagggggcaa cgtaagaaac tagaaaggat tgctaggtgc agtgacacat 126301 gcctgtagtc ccagctactc tgtaggctga ggtgggaaga tcactggagc ccaggagttc 126361 tgggtgtgcg ctatgctgat taggtgtcca cactaagttc tgcatcaata tggtgacctc 126421 ctgggagcag gggactacca ggttgcccaa ggaggggcga acccacccag gtcggaaagt 126481 agaaggattg tgtctctagg cctcagactc ctttgcattt ttctgtcctc accctgcaca 126541 tagtgaaaat cataaaccca cattattggt gtggtttata gaagaaaaac aacttagaac 126601 taaaaaaaaa acaaaaaacc ttgaccttac gtaaagagat tttcaaatga atacactttt 126661 tgtgttttta aaattaaaat gttatatgga attatatata ttattttaaa atgatttcct 126721 gcttagcttt ttaaattaac tggtgccaat ctctcaaata agacaattta tgtaccagca 126781 gtaatcaact ggaaaatgta atgaaaaata ttcaatttac agcaataaca aatagtacaa 126841 aataccttag gaataagctt aataggcaaa gtgcaggatg tagatgaaaa caaagcaaaa 126901 gactacaaaa ctttaataaa ggccataaga gaatacttga acatgtgcca ggcagttttt 126961 ctgggacaaa tctatacatt taatgccaat ccagtaaaaa tctcagtatg ttttcagtgc 127021 atgtttttac aagtttgttc acctgtagat actgtgggtt atatcaccta gcagtttcat 127081 ttctaagaaa aaaatcccct cctaatttgc attctccaca ccttcaccca aaccagatta 127141 ttattagact tttaacaggt caaagtagta tcttgtttag tttgcatttc tttagttata 127201 aggagtccag aaataatccc atacccaggg gaatttaatg tatgataatt tatccacttc 127261 aaatcagtgg agaaaattca ggataattca gtaaatgctc atgagttatc aggatagcaa 127321 tttgaaaaaa agaggtagac tttgatttat tctttatacc aaaataaatt ctagctagat 127381 caataatctc aataaaaaat gactccattt aaaaaatgac tccatataag ttctggaaaa 127441 aaatgagcac gttagaaaaa taacagtaga ggccaggcac tgtggctcat gcctataatc 127501 ccagcacttt gggaggcgag acaggtgggt catctgaggt caggagtttg acaccagcct 127561 agctaacgtg gtgaaacccc gtctctacta aaaatacaaa aattagctgg gcatggtggc 127621 acgtgcctgt agtcccagat actcggaagg ctgaggcagg agaatggctt gaacccaggg 127681 ggcggaggtt gcagtgagcc gagattgtgc cactgcactc tagcctgggt gacagagtga 127741 gtctccatct caaaaacaaa caaacaaaca aacaaacaaa caaaaaaaca gtagaggcct 127801 gctgtggctc acacctgtaa tcccagcaaa gaggctgaag cagaaggatt gatttctggg 127861 gcccaggagt ttggggctgc agtaggccat gccactgtac ttcagcctgg gtgacagccc 127921 aactctaaca attaaaataa aataatatat ttaaaaatag cagtagaggc tttctaagta 127981 ttatataaaa tctagtgttt aaaaaaagat agggtttgac taaataattt tgttgcaaga 128041 acaacataaa gtggaatgaa aacatcagtc catcatatga atcatatgag ttataacatt 128101 tacagaatag ttaaatgaca aattatggaa acaagtctgt tagaacaaac taacaaaaat 128161 aacaacccat tcaaaaaggt gggagatgca tctaaacaag catatcattt tgtccttaac 128221 ttaaaagtgg ttctcagttt taggtaccca aatttaatat ttgtattttt tttaagtttt 128281 ggccttcaaa ataaaaatga ctatacattt gattagactt ctttcttttc tttttattat 128341 agatgattga tttttaagga ctagtgctat gaacattaaa gcaagtatgc ttaaaaccaa 128401 aaacatattt tgactttagg catttatgca gataaatgtt tcgctcttaa cccacttgga 128461 attcaaagag actagctctt taacagatac tgatactcaa aggtaaggaa gaattattgg 128521 tatgattcta agccactagt catctattcc cagagcaaaa tttctgtggt tacttttttt 128581 tattggtttt ctagtgtgct tgctttaatc aactgttttc acatgtttgg aaaatttgag 128641 ggtttcctgt ctatactaga agatacgagc actactccag aaaggttaga aacaaaagac 128701 caaaggacgg cttctacaaa aggccccctg ttaaagccca gccgtttgtt gttgttttag 128761 ttaaacagaa gtggactatt tcaacctaac cgaaaaagat tgctggattt ttgccctggc 128821 catataaata ggaaattaag acaggcatat taaaattttt tcctgcttat gttgtcttca 128881 tctctgggtc aactgatact tatttttttt tccagtagca aaatatttat tgagtgtaac 128941 tgtgtaccag gtgcttttct aggtgctggg aatatagcag tgaataaaac aaaatccctg 129001 agcttatatt ttatttggag aagacaatca ataaataaac aattatacaa ctggtcatat 129061 ggtgataagt tctctgaaga aaaataattt aggaggaaaa agggggtggg cttttactgt 129121 tctacatagg gaggtcaaat aaggccttgc taataagggg acatttgaac agagacttga 129181 aagaagtgaa ggaatgaatc atgtggttgt ctcagggaat taacctttaa ctgatcaaca 129241 tctttaaaag gccttctcca atctgtattt ccagctcacc tcccagaaat tcactcacca 129301 tgttcatgtc cccaagatct ttggggcaaa tagaaatttc taaacagtat ttcaacttgg 129361 taagattcta catttctgct tctgattctt atttttcatg gcacctcata aaataggaat 129421 gcatgttttg tttttttgtt tttttttttg ggctacttgc cagtgttctg aaactaaata 129481 cttctcggag agtagctgtc tctcattaaa gagccccaga tcaacagttc catttcagtc 129541 acattccacc ctgacttgtc tcttctcacc attcggtgat taggttcatg tcttttactc 129601 tttaatgctt tggttagcat cctgtttacc tcaagagact gaactttctt ctcctgaatt 129661 taatttccct gatagcaggg acatctgaca gcctatgtca actgtcttcc ttgacttctg 129721 tcaattattt gtcctaacta tgcttgagta agcgtacagg aatataaaat tttatttcct 129781 tggaataaaa ttttaattta taatatagga gagatattac tattaaggct cttaaaaagc 129841 aattataaga gacagaataa aattaaaagg aaaaaagata gcataaccct acaactctaa 129901 taaatctgct gtttatgatt ttccagattt aatttcagtc tttgctatta tgcttatata 129961 gttgaaatca gagaataata tctttaagat ttggaatggg atactgctct ttctttttat 130021 gtgctcttag ataagaactc tttctactgt gtcttataaa ctctgtcata ccataagttt 130081 ctcaaagtga ttttaaccac tgtacatttt aaacaatatg agaaataaat taatttctga 130141 tatgaacata catgtaagct actaaacctc agtattatga aatatctaaa acgtgactta 130201 ttagaaatag attcttttag acgttaattt ttttttcaaa aagatacata tgtacttttc 130261 tatttatgag tatacctggg taaatataga acataggaaa atgtactttg atgttttaaa 130321 acatttaaaa ttatttaaat ataccactaa ctctaaattt taaaattatc tttgattaga 130381 agcattaaaa ttcttattta agatgtatac tatttgaaga agagaattgg gattctgttg 130441 ccttgctttg taacttaaaa gagctttttc agggactccc tcagagtacc tgctggagat 130501 gaatctgtaa aaaatttttg ttcagagaat ttctgccact aaataagtct taaataagta 130561 tgccacatag gatagactgt atttaaatga aaccattttt tataacttac accaaatatt 130621 atgacaaaag cataattgtt tcttgataaa actgttccac tgttttattt gttttactgt 130681 taacaaaaat gcattttttt ttttttttac ctttagtagg tatgcattgt attacaaaac 130741 aatgatatct tttggaaaac tttcagttaa actagacaaa taaggacttt ggcttattca 130801 gatttattcc attcatcatc tttgaattag tttcttcttg agaagaagca tatcttacaa 130861 cttttgtctt tattacctca atatgtttta tttatatgta tatacatata tacttttttt 130921 cagatagggt cttgctatgt tgcccaggct ggtcttgaac tcctggactc aagttattct 130981 cctgccttag cctcccaaag tgctgggatt ataggcatga gccactgtgc ctggcctcaa 131041 tatttattta aaggttacaa gaatatatgt tcttaaaatg ttgagaatta aaataaataa 131101 tttctccaat cataatagct ttgtatttta ttatattata aattgaaaca ttgaagtttt 131161 aaaattcttc ttcaatgagc caggcctggt ggctcatgcc tgtaatcctt acactttggg 131221 aggctgaggc aggaggatca gttgaggcca gtagtttgag aacggcctgg acaacatagt 131281 gagaatctct ctccatttaa taaataaata aataaaatta ttcttcagtg aatctgtttt 131341 tggattcagt cttgataata aaaacggaga aaaaaaattt taacagatag attatttcaa 131401 tattgctctg tcagcaatat ctggaaatag cattagtcat ttttcttcag gtgcttttat 131461 ccaacaagcc taagactcct aagattccct tccccattag gaagctccag gcaagcaagt 131521 gaaaccagca gaaggcaccc agccacaaaa aacaccctgg cccaggaagt ctctttaacc 131581 ctacagccga gactctttct cagcccagga acaccaggcc tgtagagaga cactattcag 131641 gggtttccac tatacaccat tccaccaggc agcttgacct ggtgaaaatg tgtaggtcca 131701 tgtatccacc accacaagca attgcaccca tatccctcca gcctctctac ccctgtccct 131761 aacacctggc cactgctaat ctgcccttca attctcaaat tttgtcattt caaaaatgtt 131821 atgtaagtgg aatcaaacag gattagtttt ttttctcatt cagtataatt ttctggagat 131881 ccattcacga tattgcatgt atcaatagtt catacctttt tattgctgac tagtatttgt 131941 agcatgaatg tactacagtt tctttggcca gacacatgtt gaagaatatc aggacaggtt 132001 ctagtttttg gctatcacaa taagctgcta tgaacattta tgtttagtat cagtatgggt 132061 tttcttttct ctgggataaa tgcctaatga gtaaaattga agagttgtat ggttattttg 132121 cattttcttt gataaaagac tgcaaaagta ttttccagag tggttgaatc attttaaatt 132181 tccgccagca atatacgagt gagtgatcca gtttctctgc attctttcca tcatttggtg 132241 atgtctttat tttttatttt agtcattctg atagtgtaag gatatctcag tgaggttata 132301 atttgcattt cccatcagga tgatagctaa tgatattgat taacctttcc ctgtttattg 132361 tgacctgtat ttcctctctg gtgaaatttc tcttcatgtc tttgctcatt ttctgattgg 132421 attgtttgtt ttttaccatt gagttttgag ggttctttat atgttctcaa tactagtctt 132481 ttttgttaga tatgtggtta gcaaatattt tctcttggcc tgtagcttct ttcttcatct 132541 tcttcacatc atgtttctca tccaaaagtt tttaattttc ataaagtcca gtttatcact 132601 ttttcctttt atggattatg cttttggtgt caaatctaag aactccagat cctgaagatt 132661 ttcccttata cttaaaaaat tatagtttca cattttaagt ccatgatcca ttttaagttt 132721 atttttgtac aaagtttgag atttgggttg aggttttttt agggggtgct gagttgggca 132781 ccatggatgt tcagttgttt cagcaccatt tattttaaag gctatccttt ttctattgat 132841 ttgcttttgt atctttgtcc aaaagggaac aaagaaaaga tgagacaaat agatgtagaa 132901 tttaaactaa ttatattaat aattgcatta aatgtcagtg ctctaagcag cccaattaaa 132961 aggcagagat tatcagagtg gatgaaaaac aaaacccaac tctatgctat ctaaaagaaa 133021 tctattgtaa atataaagac acgaatatgt taaaaggatg aatgaaaaga ttaggtcctg 133081 gatggaactg aggttaactg aggataattt taagatttgt actatgggtg attaggtggt 133141 tgataacaca attaaccaaa gtaggtattg aaagaccatc tcatagctgg ttggtgatga 133201 aggaaatgag ttcatgttaa tgtttgaacc tattatattt gagatgtttg cacatattca 133261 tgtggatatc tttggtgtcc agttggaaat atggatctag aacttgaaag catagtttta 133321 gtcagaaatg tagacttagg aataatatag atatagataa aatcatgagg ccagttgctg 133381 tggctcatgc ttgtaatctc agcactttga gaggccaagg caggcggatc acctgcggtc 133441 agaagttcga gaccagcctg gccaacatgg tgaaacctca tctctactaa aaatacaaaa 133501 attagctgag tatggtggca catgcctgta atcccagcta ctcgggaggc tgaggcagga 133561 gaatcgcttg aacccgggag acggaggttg cagtgagcca agattgcacc actgcactcc 133621 agcctgggtg acaagagcaa gactctctct taaaaaaaaa atcatgatag tggctcagcc 133681 agcatatgta ttggcaagca gatctgcaga aatattagta ttaaaggacc agcaaggtaa 133741 aatggaaaag atcttcagaa aggccaaggt ctgccaaaat tcttctattt tctattctag 133801 tttctcttgc gaggcaaaac agtcttagaa tttagcttag ggtggcacaa aaacatttcc 133861 tgtaatgtag ttctaataat attccttaaa atacccagaa aacaatagca atttctgcct 133921 tccacttcct ctattgttaa tttttctact agcactactg acttgtactt actttgataa 133981 tcagaatctc ttttaggagc accttgactt aatagccttg gagtccttta ctttctagac 134041 aagttcatgc ttgccttaga aatgtatgtt agttatgaaa agtcatgtag aacagactta 134101 tagcttctac tttttctatc aatattattt acccagtgaa atttatacta atcttgctga 134161 cagtgagagc ctggtttcct tcacaagact aagtagcaaa acagagtgaa gcgttgttca 134221 gagagcaatt aggctgacag ttaatttgag tagataagta ataacatgag ttgagtacaa 134281 gagaaaacta ctgcccaagt cctggataga aacaatgctt tagaaagaaa tatacatatt 134341 ttactttcaa aagttaagta ttccatgaaa tattttcttt atagttaaaa aagatcaatt 134401 gctatattaa ttatctgtta gaacaactaa catttattat ctcacacaat ttctgaaagt 134461 cgggaaactg ggagtggctt tactgtgtga ttttgactca tgtttcctca tgagagattc 134521 ttaaattgtc aaccaaggct gcagctagtt taaggcttca ctgggattgg agagtctact 134581 tccaagttca ctcgtgtggt tgttggcagg cctcagttct ttgctggctg atggccaggg 134641 gcttcatttt ttttgccatg tgagcctctc cgtagggcta ctccatagca gctccctgcc 134701 tcaaccccag tggaagagat cctagagagg gagagaacct aagatggaaa ctgcagattt 134761 ttataaattc atctcagaag tgacatgcca tcacttctgc cctatgtcat tggcccacag 134821 actaaccctg gtacagttta ggaactgtct gcacaaggaa tataaatacc cagtgtcaga 134881 gattattggg gtgatcttgg aggctggctg ccatagttat gacttagtgt caattagaaa 134941 acaaatccta ttatcagttt tcttgtttgt gctttgctgt agttaatttg actcatgata 135001 tggctgcatc atatgtcaat attctagaaa tatttaccaa gtgtccatta tatgcttggc 135061 actggaattt tattttaaga actccctgaa agtcccatct gaaccctggt agtcacctgt 135121 ctaaaatcct tcagcgagtt tatattttgc tatttccttg ctatgaaatc tgtattttag 135181 cattatggaa agaaaaattt ctgtctcaaa ggcttgtgtg ccaggtgccc ttctaagcac 135241 tttacattta ttatctaact tagttgtcaa aacagcccta tagtaagcat tatttttctt 135301 atcttataga ttagctgcat gaagtttccc cagggccata ttattttttc ttctcttgct 135361 tctaagcaca ctattcttct gctttttact taaaataatt tgttctttct ttaggattca 135421 actcaagcat aatctcttcc aagaagtttt ctctaatgct cctaatctga gttagttgtc 135481 tcttctgatt ttcatggcac cctgtgtatt cttttattct ggtacttagc actttgtatt 135541 gctatttttt atttacttat ctgttttttt tttcattaga ctgtaaacta tagcataaaa 135601 gttcaaaaca aaggctctga aatcaaacag tttggaactg tgtcctggat ccaccactta 135661 ttcatgtgac tggacaattt actccttgtc tacctttctt cacttataaa atggcattta 135721 aaaaataatt accttctttc tagagtttgt tgtgagaatt aaataatata agtaatgtgg 135781 ttggtgctat gtctggcatt caataaatgt tatctgttaa gctctttgag ggtagaatta 135841 ttgattgctt atcatgtgcc atgtatcata ccaaactttt tatataggtt attttatttt 135901 cacaacaact ctatgtagtg ggtactacta tgaatgtgct atttctctta gcatctctag 135961 atgaatggat cagtgaatga atgatagatg gaattgtggc cagtggaact aattgtaggg 136021 gtaaacagca acctagaaac atggttaaaa gcatgacaat gggcaaagaa agagtgaaag 136081 aggcagtaag caaaggaacg tgactcccat actagtatag cagctaacgg cacaagtttt 136141 agaaacaggt agacttgggt ttgaatcact gctccatcac ttatcagcca tataatatgg 136201 ggcaaaattt ataatttcta taaaatacac atatctactt tatggacgta ttttgaagat 136261 taaatgagat aatatatgta aaatatttag catagaacct agcatatagt agatattcaa 136321 taaatattaa ctagtactat tattaaatta agataatcat gaagacagaa ttttggtagt 136381 gaggtagtga gagaaataaa atagaatcat gaagtataag gtaatgactt tgaatcatta 136441 atcatagtta tcgaaatcag gcatgataga aagattacgg ttgttaatct tccttatagt 136501 gcaatatcct caccagtcca cacaacattt ttgcccactg agtctgggat aaagttaaaa 136561 attaatcctc ataaactaaa gttaatgaaa tgtttagaat attctgcctt cttaccgtat 136621 catcacttta gcccaaatgc tgggaaaaag agaaacaaaa gtcttctgtc agctattgaa 136681 tgtgttatga tttagagaat ttctgagatt cacagtagag tctgtacttt ttacatttga 136741 tagacctcaa atttgcctga gtttttagtg ttccagaaaa taggctttgg ggaaggtttg 136801 aaagaaggcc tttttgaacc taagaagaca agaaattgaa ctgttatcac aatattgcta 136861 tttaagaaag aatattaagt cacttttcag aaatttatat cagaattaga aaaatgcaga 136921 aatatagttt atacttctta aggaggtggt ggaggagatt agatagatac atgcatacat 136981 atatacataa gtctaaatat ttttatatgt atatgagaga agtatcagga atagggtagg 137041 aaggagatag aatagtagga ttacattaga atctgcttag gagcttttct tacatgtaga 137101 tattctctct ttttgggtat atcaaaatct tgggagtaaa agggaaacct ggcatgcatg 137161 tttaataaag cttccttggg gactctgata tatgctgcct cccaaataaa ttacaagcca 137221 tatcaggaaa taggagttta cagtataaaa tataaccgaa aactaaagtc agatgttcta 137281 caaaatatta aactattttt cctgagatag aagtggttga aatttgggag ataacttttg 137341 ctatgttctg tgaggcaaag aggagggagg ctagagttga tgaagacatc tttcttttat 137401 gtatgatatt gtctatgttg ggccctgtta attttcattt acttatgtac aaatagctaa 137461 tgcatatttt ttaactttaa tgtttgttat ttacatgcat agcctgatgg gcacatgatc 137521 caaagtttaa ttttttggtt gaaatttagt atttggcttc atatatataa agattggggt 137581 aaggcgatct gccttgcctt tcttctactg tgtcaaagaa tttatgatca agggctagag 137641 tcagtattaa ttatatgatt ttgttttaac caagaagaac ttagttaaat ttaaaattgt 137701 agttgcactc atagataaag ggttttcttt gttaaaaggt aaaaaagaga catctcttat 137761 acttgcttaa cttttcattt aaaaaattaa tgtattcata ataaatagaa aatgaaagga 137821 gcttaaaaag caaattgaac acatagttat cagagaaaac aattttaaaa ttatgatttt 137881 aaactctcat ctgtattgct aagaatgccc tatattgtaa tattcaaggt ttgcttataa 137941 ttaaaatgta gggaagaaaa aaagaatcct gcagacattc taaggatgac ctaggccatt 138001 gatagtctaa gcttgaaatt aatcattaga ttttctgaac tcttattaca ctggatctgt 138061 tcagtctgct ttaacaaaat accatagatt tggtggctta taaacaacag aaatgtattt 138121 ctcacagttc tgaaagctgg gaagtccaag atcaagtttc cagcagatct gatatctggc 138181 ggaggcctgt ttcctggttc atagatggca ccttcttacc atgtcctcag atggtggaag 138241 ggacaaggca gctctctggg gtccctttta taagggtact aatcatcctt ttcatgaggg 138301 atgtgttgaa actaaattgt gcctgaagac ccttatgtta cctttacata tcaaatttct 138361 attgtatctg gccaaccttt gattttaagg cactagggca agaatgacca acgggtttcc 138421 aaacatatag aagtagtaac atgtgcatat caacaaaaat cacttggtag gaacccaata 138481 ctagctaata agcctactcc tactgtctta gatctagttc tgtttcagaa ttggaagctt 138541 taaggattat gaaattctgg gaacttggag taggtccacc tagaagtaga agcagcaaaa 138601 cccaaaaaac caatacaatt tttaatagtg ttgtttaatt tttaagtgtt atttactaga 138661 aaataaaact atactgtgtt taatcatcaa gcaattttta acaaaaaact ttagcactag 138721 gaataggtta aacttgacac aatttgctat catagaatta tatatatggg taaagggtac 138781 cagttagatt cccattggca aaaacctggt gattttcatt ctgtggtgta gagatcctag 138841 ttaagaggca ggaattattt aaaaatgact ttaattttga tcatttaatt ataggaccta 138901 tttattcact tgtttttaac ttattcacaa atcttagtca tgagcattat taatagataa 138961 tttgatgcct tgtcacttag cagtacaagc atgtttattt ctgactttta aactcggaat 139021 tcacccttct tgtgtttttc tattcagtcg taacccactc cttctacatc tacaaaattt 139081 ggtgaggcat ctttgacttt catcatattc agacatgcta gcttcaagaa ctttcctatt 139141 cctcttttaa tgtctgtttc tcacattgta ttagttcctt ttatagtatt aattcttcct 139201 acagtatgaa ttttattttt tttttttgaa atggagtctt gctctgttgc ccaggctgga 139261 gtgcagtggc accatctcca ctcactgcaa gctctgcctc ctgggttcac gccattctcc 139321 ttcctcagcc tcccgagtag ctgggactac aggcgcatgc aaccatgccc agctaatttt 139381 ttttattttt tgtagagatg gggtttcacc gtgttagcca ggatggatat agtatgagtt 139441 tttattcatt tgcaaatatt catcatagat atttttgtct tatttcagtt agaacaggat 139501 ttttttcttt ggattgagaa ggtaactttc ctactgtact taggataaat tccacaaaaa 139561 ctctttaaca aggttttttt cctggctatt agaaaggagg aagaccctca ctgaattgaa 139621 gatagtaatg atgggtgagg actttgactt ctgttaacac ctcattcaga taagtgacta 139681 ttttagttta attacatgaa ttttgtcatc tcccttggag ctgcctccca gagctggtat 139741 atcagacatc ctcaagtaaa cctggaccca attttagaaa tcttggctaa ttttgagtta 139801 ggaagtcctt tctttaaaca atattttaaa attaacagaa caatcacaaa atatatttta 139861 catattgcac ttttcaaaat actttaaata tttaaaaaaa tacccaacac cacaaagtaa 139921 cttagtttaa aataaaataa aaatttttga aatcagacct aatcaaattg atatatctct 139981 cattttcttt gaaaattttg aaccaagata tcaattaata aatccacttt gaaagacatt 140041 ttaagacatt taattaataa atccatttga aacacatttt aaaatgtgtt tgttcatgtg 140101 tataaaatta tgtcattgat ataacattac tattccttta aacattgtta tgttcttaaa 140161 aatatctcct cttagtctat aatttagcga aaggattact gttttgaaag acttaaaaaa 140221 aatgcttagc tattttttag ctaagaaaaa agattccttt ttcagtaaat ggataaagct 140281 gaaaaataag aaattaataa aatgttttaa agaaagagtc ataaatatct taatatttaa 140341 ttctaggaag cttctctatt catttgcttt tttattttta aaaagaagtt taaaagagtt 140401 ttagttttaa tatctataaa cttcaaatgt tattccacac tcaaaaaata tttctaatgt 140461 attaagtcac agtttactaa aatagctaag ttttctgatt cataaaatga ataataacac 140521 ttaattttaa gacttatttt gaagattaaa tgaggtaata cattcaaata aaacctttta 140581 aactgaaagt actttcattc ttgtgtaagg aaaccatgaa agtaaaagac acatcattct 140641 attttgccag tgtatcttaa attaactagc ttttattgtc agtttagtgt tcagggcacc 140701 aagggtacat ttcatactaa gattgtgcaa gtgagagtaa atgcctcctt aaattttcca 140761 tcttaggcac ctccgttgct tttagacctg gctatgtttt atattgctta tatacatttt 140821 tttttctttt ggaaaaaaga taagctgaaa aacatttttg ttgatgacat gctattgctg 140881 aaataaaaac ccagaacata gtgatgtgta ggaaaaaact ctctcaaacc atgtttttct 140941 tctgctctca taccaccaca acagttgtca acacaaaaga cttctgtgac aaaatgtgtg 141001 ggtttttttc cccacacact aagcagcaga taccagctgg gcatcctcca actcagttct 141061 gatactgtct acttggagat agtgtcagat cccactggtt ggggtctcag tgcccatacc 141121 tcataccccc agacaccatc ctaggcctcc agaacttctg acctactggc ttcaaattgg 141181 ggttctcaca ccacactctt caggttcaat taatttgctg gagctgctca caaaactcaa 141241 ggatactggg cacggtggct caagctgtaa tcccagcact ttgggaggtc gaggtgggca 141301 gatcacttga ggccaggagt tcaagaacag cctggccaac atggcaaaac cccgtctcta 141361 ctaaaaatac aaaaattatc tgggtgtggt ggcatgcccc tgtagtccca gctactccag 141421 aggctgaggc acgagaatca cctaaaccca ggaggcagag gttgcagtga actgagattg 141481 tgccactgca ctcctgcctg ggccgcagag caagactgtc tgaaaaaaca aaaagaaaaa 141541 atcaaaggaa acacacttac cggtttatta caaaggatat tgcaaaggat acagatgagg 141601 agatgcgtag ggtgaggtat gggagcaggg gtgcagagct tccatgcctc tggggccaca 141661 ccctccagaa acctccatgt gttcagctat ccagaagctc tcctaatcca gtttttatga 141721 tgtcagcatt gggattttta tgatgtcagc attcctcccc tacactatag ggcaaaaacc 141781 atctctgggg agggtcttaa agacacacaa tttacaatga tgggggaaaa attagagtcc 141841 tgccttggaa caggttaaag gagggcaaga gaaggtccga ggcctgcccc tgaggcctac 141901 cacaccaaac gttataacaa cacaacagtt tttatatatt ttcagttttg cataatatga 141961 agttaaagac ttaagtaaat tgagtaaata gtagtttttt ttttttaatt tcacagatca 142021 atagagttaa ctcctgagtt atatgcatac aaccccaaag ttgtttgctg tctggagctg 142081 tgcctacacc cgcctggcat attcttgaga cattggagct aggaagttgg aaattgatag 142141 aaaattttgt cacttgtaac agagctaaag caatatggat tatggtggac cctatcactt 142201 ggaatacttt cctgaaagca acacaggtca tcattgccaa atatttttag gaccccaatt 142261 ctgtactaag tgctagtgat gtgataggga agccagacac aattgaacta gctatctcta 142321 tattagtgac ttatcctaaa acagagtttc tgttgtaatt ggtttactgt ttctcttaga 142381 atatttaaac gttttaccat ggtctttcag attctgcgtg atcatacagc aacctcttct 142441 ttagcaacct ctccactatc ccacccgggt actttagttc tggtggcctc tttcattttc 142501 tgaaagaaca tcagtttctt agcatatgat ctttcatctt ttggaagatc tctgcccctt 142561 ctcctttcaa agcacagcct ccttctcaca tttcaagact cagcatccat gttaccttcc 142621 cagatcagcc ttccttgaac ccaccccatc cacctggatc tccctgttta ttcttcatga 142681 gagaaccctg ttctttcata ggatttttct gaatttgtca tcacatactt atttccttgt 142741 ttgtgtgcct cttctcacca gaccctgagc accgtgtaag taggaacaaa gtattatttc 142801 aaagacccaa gtctgaaacg gtttatctgt ttatacttaa tggctaaggt atacatgtac 142861 acgtattcta tttgactaac atttgaggta aaaaacatgg gggaaaggag cagataacac 142921 atatttttag aggatttaga gtatatataa ctaagaaggt tccctttatt tgttattttt 142981 tgggagggaa tcctgctctg tcacccagac tagagtacag tggcacgatc tcagctcact 143041 gcaacctctg cctctgggtt caagcgattc tcctgcctca gcctcctgag tagctgggat 143101 tataggtgcg tgccaccatg cccggctaat ttttatattt ttagtggaga tggggtttca 143161 ccatgttggc caggctggtc ttgaactcct gatctcaggt gatccacctg cctcagcctt 143221 ccaaggtgct gggattacag gcctgagcca ccatgcctgg ccccatttta aatcttattc 143281 tcactttgag gtggagttaa tctcagaaaa ctttgagatt aagttatctt aatatcaacc 143341 ttctcacttt attcatttat tcaacaaatg tttattgagc ccttactgtg tgccaggtag 143401 ttttaaacat tggggatact attgaataag atagacaaag cccaagcctt cctgcatctt 143461 gcttgctagt agcgagataa taaataagaa aataaatgaa tttataacat aacctcaagt 143521 ggtgatcaat actatgaagg aaatagagca gaggtaggag tagcaaataa atgggaatta 143581 tggctgtctt aggcaagatg tttagacaag gctttgctta ggaggtgcta tttgaggaga 143641 tatgtgaata atggagtggg gtgagtcata ggaagcatgg tcaaagagta ttttgggcag 143701 aggaaacagc aagtacaaaa acccagagaa aggaagaatt atgccacagt catggaatag 143761 aaagatgacc aggatggctg gagtaagaag agtgttagag catgaggtct gagaggtatc 143821 aaggggccag aactcatgta gtgttttgta aaccaagata aggcatttga atttattcta 143881 cgtgtgacag gaagctcttg cagggttttg agtatggagg tgggttaggg agtaacatat 143941 tagctctatg aggacatcta ggaggcagca agagtagaag tatggaaacc agtttgtagg 144001 atgcttttgc agtgggacag aaaagagata agaattgctt tgactctagc aatggtacag 144061 gtaatgaaaa gaagtcaaac ttggaatata ttttgaattt agagccaata tgacttgcct 144121 ggtgtcttgg ataagtatga gagaaagaat aataatgatg actccttgtc ttttttgctt 144181 tgggcaactg gatacaagga gtgatttaat gatatggtaa agactaggga gaagcagtcg 144241 tttttggggg gtggtagagg atgattaaag tttattttat acatagacat caagtgaaga 144301 tgttgagtaa ggtcaaatag ggcagatgct tctgaaatac tccttcaagg ctatttagta 144361 tctgacaaat ggagcattgt tacatttatg gctctacttc ctagcataga catacaacat 144421 atactgggtg gatggatgga taactaggca tacagtcata cgtttgagga aatagagtgt 144481 tctttgtata atggcactca tagatgtaaa taatcacaac cataataata tcaaatatta 144541 tatattactc tgtgtcaagc actgttccca attcttaaca ttattaattc acttctcata 144601 tactatgatt taagttacta ctattatcat tagtatctct gctctacagt tgagaagctt 144661 attgagggtc atgtagctac aggtggccag gatttgaacc catgtactgt ggctccagag 144721 tttgtgctcc tacctactgc actatagttt tcccatgatc atgacatact catcatcata 144781 aaccaatatc tatatatctg tgtgaccatt tggtttgggg tgatggagtg ataaaagtga 144841 tatttgaatg aaatatgaag actattggtg aaaaatgcca ccttaatcct tggaagagaa 144901 atttactttt agtataatcc tattgggtga atctgaagca caactagatt ttatctcata 144961 tctgaaaggt atctaaaata tctaattagt gtacttaagt cctattaatc acacatcttt 145021 gcaaggtctc aagagtctcc taaaatttca ttgtttgaaa gaaaaattgg tagtaggcca 145081 ctcttgaata tatgtgggtt tatactcata ccattttagt taaagatttt caattcgatc 145141 catggctagg aagtataaaa tatttagtat aacagtatgt gtattagctg gaatgaattt 145201 cctaagtgtt aattgttagc attactttgc tcagatgaaa gccattattt acactgcagt 145261 gcccctaatg aatataggaa aagatgactt gtcaacaaaa attatatgat tggagttgta 145321 gagtctgctt ctttttccaa agcatacatg ccaccttctc ttttttctcc ccgccataaa 145381 atccagctcc tgtaggacac cttttcttct gatgcctcca taggagacaa ctcatgctca 145441 ccatctccac ctttaagaat tatttctgta gtgagtcact attacagata ataacccatc 145501 tcctgttggt ctgctgaaac tgaaaaaggg ctgaaaattg ttaaactttt gcagtgatat 145561 gtgtaatgac tatatgatta tcatgcctct ttttattcac agcttatcaa gtacctactt 145621 gacatgtaga aaggaaaaga gtaaagttaa gaaaagacag tttgtttctc ctgaaagttt 145681 agagttgaaa gcgcatcttt aacacattca tacatacatc caagaaattc aaataaaata 145741 atcaagccaa atcactgaaa gctgagaaaa gagttgaatg ttcaagtggc tacttgaagt 145801 cacttcatgt gatgtatcat gtaatcctgc aaaatgagtg ctgtattcct gtttttcaag 145861 taatgaaact gaggttcaga gagcaaaacg tttcattgct agtaaacttc aaactagtaa 145921 acaagaattt gagcttatat atctctaatt ccaggacctg tgcttttatt gctaatggct 145981 tttttaaaac ttgttcttca tgcttttttt tattgagtga attaagagca ggataaataa 146041 gaaaaatagt atttagtaat gtattgctta ggcatgagta acttcctatt tgaaagaact 146101 gttgcgtaat cgccgttttt ttttctttct ttttaaattt cttctccctg catagtttct 146161 gcatctccca gatcgccttt ttgttatttt gttttggcat atttatatta gggactttct 146221 ggaaatattt gattgccact catataaaag agtaaatcac taaaaattag ttagaaactc 146281 tgagcacctg gatgaggttt gggaacttca ttgtatagtt atttggtaga gcaatttctt 146341 aagaaacccc cattgcatat ctttaggttt tttatcctgt tactcagtac ctcactgccc 146401 aaagtacctg gtatcgccac tgccacatcc ttttggtggg ttaaaagttt atctggtttg 146461 cctgttggct ttccttaatg ctgagttcca attcagtttt ttctcaccta ctgagtaggt 146521 taccatttgt gtatctgcct tccaagcttt caaaattttg tatctccagc ctggacaaca 146581 tagggagacc ctgtctctac aaaaaaaatt tcagaaaatt agctgggcac catggcgcat 146641 tcctatagtc ccagctactc aggaggctgc agtgagccat gattgtgcca ctgtgctcca 146701 gcacgggtga cagagcaaga ccctgtctct taaaaaaaaa gtgttgtatc tgaaaggttt 146761 atgtctttta aaataagtca gtttattgcc gttttagtgg ggttttatga gagagagtta 146821 atgtctgcat tcatttggca tctttaacca atgttgttta tcttctgcag attccattta 146881 atttagtttg ttgtagtttt agtcatccag aagtttccaa tttatatgaa ttcagatctg 146941 ttcatatcct ttgcccactt ttgaatggag ttgtttgttt ttttcttgta aatttaagtt 147001 ccttgtagac tctggatgtt agaactttgt cagatggata gattgcaaaa attttctccc 147061 attctgtagg ttgtctgtgc actctgatga tatatatatt tttttttttt tttgctgtgc 147121 agaagctgtt gaatttaatt agatcccact gtcaatttct gcttttgttg tgattgcttt 147181 tggtgttttc atcatgaaat ctttccccat gcctatgtcc cgaatggtat tgcctagatt 147241 ttcttctagg atttttagag ttttcggttt tacatttaag tctttaattc atcttgagtt 147301 aatttttgta taaggtttac agaaggggtc tagtttcaat tttctgcata tgactagcca 147361 attctccgag gaccatttat ttatttaata gggaatttat ttaggaccat ttatttaata 147421 gggaatgctt tctccattgc ttgtttttgt caggtttgtc aaagatcaga tggttgtagg 147481 tgtgtggtct tatttctgag tcctctactc tgttccattc gtctgtgtgt ctgcttttgt 147541 actaatacca tgctgttttg gttactgtag ccttgtagta tagtttgaag tcaggtagtg 147601 tgatgtctcc agctttgttc tttttgctta ggattgtctt ggatatttgc gctctttttt 147661 gattccatat gaattttatg aattttggtt ccatgtgaat tagttttttc taattctgtg 147721 aagaaagtca atggtagttt aatgggaata gcattgaatc tataaataac tttgggctgt 147781 atggtcagtc ttacaatatt gattctttct atccataagc atggaatgtt tttccatttg 147841 tttgtgtcct ctttgatttc tttgagcagt ggtttgtggt tgttgaagag gcccgtcact 147901 tcccttgtta gctgtattcc taggtatttt attcattttg tagcaattgt gaattggagt 147961 tcattcatga tttggctctt tgcttgctta ttgttggtgt ataggaatgc tagcgattat 148021 tgcacattga ttttgtatct gagagtttgc tgaagttgct tatcagctta agaatgtttt 148081 gggttaagat gataatttga cctcctctct tcctatatga atatctttgt ttctttctct 148141 tgcctgattg ccctggccag atcttccaat actatgttga ataggattgg tgagagagag 148201 tatccttgtc ttgtgctggt tttcaagggg aatgcttcca gcttttcccc attcagtatt 148261 atgttggctg tggatttgtc atagatggct ctttgagata cattccttga atacctagtt 148321 tgttcagagt ttttaacatt atgttgaatt ttatcaaagg cctcttctgc aaccacagat 148381 aatcatgtgg tttttgtctt taattctgtt tatgtgatga atcacattta ttggtttgca 148441 tatgttgaac cagccttgca ttctggggat gaagccagct tgatcatggt agataagctt 148501 tttggtgtgt gctggatttg gtttgccagt attttattgt ggatttttgc attgctgttc 148561 atcagagata ttggcctgaa gttttctttt tttgttgtat ctctgttagg ttttgctatc 148621 aggatgatgc tggcctcata aaataagtta gggaggagtc ccttcctttc aattttttag 148681 aatagtttca gtagaaattt gtacctctgg tagaattcag ttgtgaatgc atctgatcct 148741 gggatttttt tttttttttg gttggtaggc tatttattac tgcctcaatt tcagaacatg 148801 ttattggtct acttagggat ttgatttctt cctagttcag tcttgggagg gtgtatgtgt 148861 tcaggaatgt attcatttct tctagatttt ttagtttatg tgcatagagg tgtttatagt 148921 attttctgat ggttgtttgt atttctttgg ggtaagggtg gtatctccct tgtcatttct 148981 gattgcgttt attccattct tctttctttc cttctatatt agtctagcta gtggtctatt 149041 ttattaattt tttcaaaaaa taagctcctg gattcattga ttttttaagg gttttttgtg 149101 tctctgtctc cttcagttct gctctgatct tggttatttt ttgtcttctg ctagctttgg 149161 ggtttgtttg ctctttgaac tggttttagt atcaatattg tactagcttc ataaaatgag 149221 atgaggaatg tttctagttt ttccattcca ggaagaaaat ctgagtaaaa gtggaattat 149281 ctgtcccttg aatgtttggt agaatttact ggtaaaacca aatgagcttg atgttttatt 149341 tatgggaaga ttttaaatta ttgattcaaa ttatttaata gttgcaggac tatgtaagtt 149401 attttatttc ttttttattg agttttggta atttgtattt ttgtagtaat tttccacttg 149461 gtctacgttt tccaatttat tgatataaaa ttattcataa taattcccta ttatcttttt 149521 aatctttgtg attttgtccc atttttcata cctaatactg cctatctatc tcttttcttt 149581 atcttgatca atctgactgg agatttgtta gttatttcaa agaatccatt tttggctttg 149641 ttgaccaaac tgtgttttat ctttgttttc agtttcatta agttctgctc atgtcttcat 149701 ttctttctct ccactttctt tgtttttttt ctgtctttcc tcctaacttc taacttattt 149761 tctaatataa gcgttgaagg ttatacattt ttctactttt ttgtgtgtgg gaagtacaaa 149821 aattgtcaat tgggaatttc gaagtagaga aaaatatttc attctgactt ttaaattgcc 149881 actgtcaatt gtgcctaaaa ttcatagtac ttaccatgtc aaacaatatg attttgatat 149941 gtacctggga aaattatgct tactaatgtg gttttaattt catcatgttt catataggat 150001 tcacctttat ttgatcttat taaacaatca aaggaccgag aaggaccaac tgatcacctt 150061 gaatctgctt gtcctcttaa tcttcctctc cagaataatc acactgcagc agatatgtaa 150121 gcaaaatata tgttatgttg accattcaaa ctgcaaatag attttaagca taagtgcaat 150181 gtaacattct ataaagaaac tgtagggaat agaattttga ataagaatag tttctgtttt 150241 taagaaatta gtaataaaag gtacatgacc caaataaagt catataaaag agtacagagt 150301 gctactgaat cacctaggat ttgcataatg agagcagttt tcatggcaga gttggcagtt 150361 taggtggaat aggatttcaa tgtggaaact ggagagaata gaggagacag aggagtgtgt 150421 tccaagagtc agaatggcat taactcaaaa atgtcaaagg aagcagaaaa catttttcag 150481 agtgtgcagg tcagtccatt tggagtgaag cataggataa gtgtagggag aagtatggga 150541 tgtggactaa cacacagttg attagtgtgt gctggtgatg acgatgggaa aagaaaatgt 150601 aaacccaaat gtatctgaaa caggtctcaa tcaatgtagt agtttatttt gccaaagtta 150661 aggacgtgcc caggagagag atctgtgcct ttctccagtg atgattttga gggctccaat 150721 atttaaatgg gaaagtcagt ctggagggga aagaaaaagg gtatggtaat ccacatgttg 150781 caagagaaaa agagcaggta gaggaatcgt caattatcta tccatctcat gctgagtgaa 150841 tcgacatttt acataagata aggtgaacat agagtcactg tctgtggagg tatttaacct 150901 tttatctgta tttatttgtg tgggaacaaa aggaaaggca gtttcttgca tgactcagat 150961 ttcagcttaa ttttttcctt ttggcatagt gaattggggt cccaagtgtt tattttcctt 151021 tcacaaaagc aatcactaac tgtgtgctgg caatggccta agtacttaag atttattttt 151081 tcatttaatc ttcataacag cccataaggt aacagatact gctgtcagcc tgttaactat 151141 aaaataggaa ttagagaaat aagataactt gcctaaggtc aatagaattc aaacccaagc 151201 tatctaatcc caaagtctag gctcttaatc actctaacta cattgccttt cacagaaatg 151261 tgcctaatat tttgtttctg tttaattctg tgtaggaact gacattacag tattctgtat 151321 tagtgcaggg gccattttcc agagatgtca ttttcatgct aacttgtacc caaagtgttt 151381 ctggcatatg gagaccctgc cttcatgtat acatttgttt cctaagtaat gcattgtata 151441 aataagttaa gccttgtttt tttaaaattc aagaagcagt catctgtaaa ctgccaggaa 151501 gtggtgtagg cactgggaat acagaagtaa gatggcctgg gtccttcccc tcatagagct 151561 tacattctgg aggaggaaca caggcaatga acaagtaaac caagatgtaa gatgatttta 151621 gagtataata agtgctacaa aggaaataaa cagagtggtg tgaaagagag aaacggagag 151681 gctgatagaa tgcttagtgg cactttcctt tctctgataa agtgatattt tgacgtcaca 151741 cctgaagggc aagctgatgc tagctaggca aaaacctgca tgatgtgcat tccaggcaga 151801 acaaacagca agtgcagtgg ctgtagggca gatcaaactt agctggattt aggaacagaa 151861 agaaagaaat cctagtgtgg ctggggtatt ggaagaggag aatggtgcaa gatgagactg 151921 aagagctaag ctggggtaga tcttgtagag atctccactc tctagctgct atctgaagta 151981 caggtgggtt tggaggaaag ggacagggca aggcaaaaag cgaggaaacc aatttggaga 152041 ctatttctgt agttcagatg agggatgtag gtgacttggt ctaaggtatg gtattagaaa 152101 tagaaatgga gagatctggg gtgtatttag agataaatta ataggtttta cattggtttt 152161 atgttggatt tcaggaattt ggaaaatgga gaaattaaaa ataaccctag gtttttgact 152221 taagacacta gatgtaatgt ggagctattc tttgagatgg gaaaataaat ccagagctct 152281 gctttgttct tattaagtct gagattcgag tagttatatc cggtaggtaa ttggagattt 152341 acacctagaa cttcagagac taactttgcc caagttaggc cagtttacga tactgtcagt 152401 gaactgaatg aaaatccaaa gtggatacta gaccagacac tggtccttct aaaatcattg 152461 tgataatggg tcgcttttct tagtgtactg gttaatatac tacgtgttgt agaaacatcc 152521 atagtaaggt acaaattcta cttcctgttc tctttgttct taaagtttgt ctttaaaaat 152581 aagttcattt gccagctacc aaaagaaaat cagaaagtag atctagatta ttctgttaaa 152641 taggggcagc acagaatgat aatatttgta atatctctgc cttgagttat gcctgacata 152701 tttagtatgg tgtaaatctg tctgaataat gttatttgaa gtattaattc atagaattct 152761 gtactgtgta atttaggtgt gtgtaaatct gtgtgtataa ctgaaatctt agatatgaac 152821 catagttatc gctaaaaaaa tttttagaag gagaatcaca aaaacaagat aacagtattt 152881 aatatcaata aattgttttt taagctggaa tcaccttatg gtctcaatac cactataatt 152941 attaaaattg tacattatac atatatagct atttttttct aataaggcag taatccccag 153001 gaaaagccat ttattaaaat agaattagat atgatgatga caagcagttt tcctattaat 153061 atatctttcc cagcttgcat ttaaatagtc tgctataata ccaattaaat agacaagatg 153121 tatctgggtg tacaaccttg aagtgtatgt ataatctgtg attcttagcc aacttgaaat 153181 gaagactttt cctttaaata tatctaggta tctttctcct gtaagatctc caaagaaaaa 153241 aggttcaact acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt 153301 ccagacccag aagccattga aatctacctc tctttcactg ttttataaaa aaggttagta 153361 gatgattatt ttcaagagca tggactctga aactaggctg actgggttca aatcatgttt 153421 cttctacttt ctaggtacat tactgggcaa gtcacttaat atctctgtgt ctcagtttcc 153481 tcatctataa aatggaaatg ataatgttgc gagatctttc ttgactattc agagtcgttt 153541 ttctggccgg aaacctctgt gaccggtggc gcccttgatc aatttttgct caggcccacc 153601 gggctcattt cagccccttg gtctgacagg ctgcactcgg ctcacactac cggcctgggt 153661 ccatgccagc caggagcaag tcagacatgg aacggagagg ggtgtgtggg caagcatgag 153721 atctggccat tgcccacagt cagaaatgct ggctgctgca gcggggcggg cagctccagg 153781 tgctgcacag gtgccaactc tctgtgaggc tgcacctgga ccagacacac cacaagcggt 153841 ttccacagct ggcactgggg aacaataaaa tggctcccag aagcttggag acgccagaaa 153901 ccacagggcc ccaaagaggg agtcacagcc ctggctcaag gagctcccaa ttctgggctc 153961 cccgaagggc cgcagctctt ctctcttctt tacccacaac atggcgagca aggggcatgt 154021 ttcagccctg tttgtgttac agccctttca gcctcgccat ttggtgggtc ctgagttctt 154081 gtcctgcagc caggaagaat gaggtacaca gacaagtgga gggtgagcaa gatgaagagg 154141 agatttattg agcaatagaa cagcttagag aagacccata gtgggcagct cctctctgta 154201 accacaaagt cccgatgagt gttcagctcc tagcagagaa ggtagctcct ctctgcaggc 154261 aggtcgtccc aacaagtgtt cagttctcag cagaaagggt agctcctctc tgcagctgct 154321 cgtcctgaca agtgttcagc tatcagtaga gagggtagtt cctctctgca gctagttgtc 154381 tctccatcct ctgctctgct ctgcctgaga ctcgggcttc tgtgggcctc agcgggaaga 154441 aagtgtgcac cgattggtac atgggcggcc actagtgggc ccagaaaagg taccacaagt 154501 cctcactcct gtccgtggga ctggcagagc ccacccccag ccttcaggcg ctccctggcc 154561 tgaaggtggg gcctcaacag ggactcatcc tctactgctc aggagcccat ctgcctcctg 154621 ccgccatcca tgcgtcaagg ggcacctgta ggccagtgcc aagctgccct cagcccccgc 154681 ctcggcttcc cccccgtgct cgtcagcgcc caaagtctgg agagggctga ggcggcaggg 154741 ggctggcgtg tcagcgctgc cttgagcgtg cgcacatctg gctgggctgc agcagcaccc 154801 aggctcagcc ctgactttgc tccgagattg gagtgggcgc tggcagcagg gagaaaccag 154861 gcagtgggag caggcacttc tgagcctgcg agggcggtgg cagttggggg gtgccttccc 154921 cggcccccaa gtatacagag atgcctgggt ccgcagcggt ggcttgggcg gctgcagctg 154981 cactggggag ggtgggactc ctacctgctc cccagcccag agagcacagg gatgcctggg 155041 tccgcagcgg tggcttgggc ggctgcagaa gcactgggga gctcctgtcc caactcggaa 155101 ggggcggggt tcctgcttgt ccccggctcc tccgggcttc ctggagcgtg ccgccccatc 155161 cacgcctctc tgctgcagca ggtgtgatgt acatacaaaa gtagtgtgtt gttgcctact 155221 gcagacacat ttcctggata ttttcaggac atttaagaag gccagtaacc aggtcatgta 155281 gcattctctg agtgtgtata tatatggtgg ggtggggagg ttgatttgag tgtctgtgca 155341 tgcgtatata cttcctagtg catatccact tggagctctt tgaaagacag taaaaggtca 155401 tgttcctcca tgataatctt tttcctgatt ttattcaagt aagaattatg ctgagcttta 155461 caggcatatt gtcagatatt atccttaggt tagagcagtg gttttcaaag tgtcgttccc 155521 aaccagcaga atccacatca cctggaaatt gtagaaatga gaattctggg gctccagcct 155581 agacctgctc aatcagaaac tcttaagata gggccctcaa aaccttgttt taacaaccct 155641 ccagctaagt ctgatgtatg ctaaagttta agaccactga gataaattgt tagctttaaa 155701 aatttcaaat tacctcatgc aagtcgttga aagagatata taaaaggtct ttatatgata 155761 tttaaagtag atgaaggaaa attcaatttt atggattcct ttttgccaaa tttgcatttt 155821 cagtagtatt ttttataaac taaataaatt acatattgca gtcattttac atataggtaa 155881 tctacaagtt taccattttt caacagagta tgttccattg gtattttata gagaaatgtg 155941 tggtttaatt tttttaggat tggaaaaatt catagacact ggcaattata taatttaaaa 156001 aaatgattat tcctctttaa tattgatagt gggaatgacc agtggtgagt gattattaat 156061 agaattgaaa caaaccctat tgggtttgtc taattttaca gtgtcttatg acctttcctc 156121 cactacttcc cacctctgcc ccaatttcat gccataatag ttaattcagc aagtatcatt 156181 ctgccttata tgtacctaga aatgtgctaa cacaaggaat caattatttt taaggaaaat 156241 agtggtagaa aaatcttaaa catcaccact aaagaactta ttcatgtaac caaacaccac 156301 ctgttccccc aaacacctat ttttttaaaa aactcatgcc tacataatga agcctccata 156361 aaaactcaaa aggacctggt tctgggagct tctggatatt tgcccaactg gaggttcctg 156421 gagaatagca tgcctgggga gggcaaggaa aatccatgcc ccttctcaca tgccttgccc 156481 tctgcatttc ttcatctgta tcccttgtaa tatgcctcat aataaaccag taaacatgtt 156541 tctctggggg aaagaaaaga gtggtagaaa agaggtttct gttaaaatgc tacttaacag 156601 cattataatt agagcaattt catgatttga aaaaaatcta cttgtaattc aaaatgaaca 156661 gtaaaaatga ctaatttttc ttattcccac agtgtatcgg ctagcctatc tccggctaaa 156721 tacactttgt gaacgccttc tgtctgagca cccagaatta gaacatatca tctggaccct 156781 tttccagcac accctgcaga atgagtatga actcatgaga gacaggcatt tggaccaagt 156841 aagaaaatca agcacttcac cttctctcct ccctacttac ttgttaactg atttctttct 156901 ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt tctttctttt 156961 ctttctttct ttctttcctt tttttttttt tgagatagag tctcactctg ttacccaggc 157021 tggagtgcag tggcgcaatc tcggctcact gcaacctccg cctcccaggt caagtgattc 157081 tcctgcctca gcctcccagg agctaggata caggcgtgta ccaccacacc ttgttaattt 157141 ttgtatttta gtagagacag gttttcacca tgttggccag gctggtctca aactcctgac 157201 ctcaggtgat ccgtctacct aaggcctccc aaagtgttgg gattacaggc atgagccacc 157261 gcacccagcc ttgttaagtg atttctaaag aaaatattag attcaaatga gaaactagat 157321 caatttatta gacttaatat atgttcactt tttggctacc atttacattt ttgttgtaat 157381 tcttacaaat tagagcttta acactcaaga gtatatttta gtaactgctt atttaatgac 157441 acaaaagatc ttacctgtaa aaaaaatagc tgaagaacct tcttatcatt ggctttttct 157501 atgtagtagt gagctagata tcattttagt gtgaaaaaag catgagactt aagaatcagt 157561 tgatttaaaa caatacctat tgaatgaatg aatgaagtca attagtttac aatgagagaa 157621 tcatttttca tctcctcctt tttctctata aaataggtta ataagaacaa tatacttttg 157681 gctgggtgca gctcacgcct gtaatcctag cactttggga ggctgaggtg ggcagatcgc 157741 ttgagctcag gagtttgaga ccagcctggg caacctggca aaactctgtc tctacaaaaa 157801 atttaaaaat tagccaggtg tggtgacttg cacctgtggt cccagctact tcggagactg 157861 aactgggagg atttcttgag cccaggaggt ggaggttgca agtgagctga gatcacacca 157921 ctgtgctcca acctgggtaa cagagcaaga ccttgtctca aaaaaatttt gttaattaaa 157981 aaaatatata catatatagg ttaatagctt tactgaaata attcacacac cataaaaacc 158041 acccatttaa aatgtacaat tcagtggttt ttagtatatt cacagttaca caaacatcac 158101 cacaatcaat tttagaattt gtatcacacc agaaagaaac tccataccct caagcagtca 158161 ccccctattc ccctaccccc tcagcccaag gcaatactaa tctattttct ctctatggat 158221 ttgcctcttc tagacatttc atatacatgg aattatataa tatgtggtct tttgtgtatg 158281 gcttctttca tttagcataa tgttttcaag gttcatctgt attgtagctc gtatcagtac 158341 ttcattcctt ttcattgcca gacaatattc cattgtatgg atatacacca cattatgctt 158401 atcattcata aagtgataaa catttgggtt gtttccactt tttggctatt atgaataatg 158461 ttactgtgaa cattcatgaa catgtttcat ttctcctagg tacataccta ggagtagaat 158521 tgctaggtca tatgataatt ctatgttaac tttttgagga atgcccagat cattttccaa 158581 agtggctata ccattttaca tgcctaccaa taatgtatga ggttctaatt tctctatatc 158641 ctcaccaact tttttatctt ttttttttaa taatagccat cctagtgggt ataaaatggc 158701 atctcagtgt ggtttgattt gcatttctct gatggctagt gatggtgagc accttttcat 158761 gtgtttactg gccatgtgta tatcttcttt agagaaatgt ctatcagacc gtttgcccat 158821 tttatttttt atttatgttt tttatttttg acacggagtc ttgctctgtc gcccaggcta 158881 gagtgcagtg gcacgatctc ggctcacggc aatcttcgcc ttctgggttc aagcaattct 158941 tctgcctcag cctcctgagt agctgggatt acaggtgtgt gccaccacac ctggctaatt 159001 tctgtatttt tagtagagac ggggtttcat tatgttggtc aggctggtct cgaactcctg 159061 accttgtgtt ccacccgcct tggcctccca aagtgctggg attacaggca tgagccacca 159121 tgcccagccc attttgtttt gagacatggt cttactctgt cacccaggct ggagtgcagt 159181 gacatgatca cagctcactg cagcctcgaa ttcccgggct caagtgttcc tcccacttct 159241 gcctcctgag tagctgggac tataggcaca caccaccacg ccgagctaat ttttgtattt 159301 ttttgtacag atagggtttt gccgtgttgc tcaggctggt cttgaatggt tgagctcaag 159361 taagccaccc accttggcct cccaaagtgc taggattaca ggcatgagcc accatgccca 159421 gactgctcat tttttaatag agttatttgt cttgttatta ttgaactgta aggggtcatt 159481 atgtagtcta catacaagtc acttatcaca tatataattt gaaaatattt tctcccattc 159541 tgtgagttgt ttcactttct tgttagtgtc ctttgaaaca caagtttttt attttgatga 159601 tgtccaattt atgtagttgt tttcttgtta ttgatgcttt agtgtcatat ctaagaaacc 159661 agtgccaaat ctgaagttgt taagatttat ccctgttttc ctccaaaatg tttatagttg 159721 tagttcttac atttaggttt tgctccattt tgagctaatt ttttaatata gttgaggtag 159781 gggtccagct tcattctttt gtttgtggat aactagttgt tccagcacca ttttttgaag 159841 agtttttttc tctcttcatt gacagttgtc aaaaaatcag ttgactgtaa atatataagt 159901 atttatttct ggactctcaa ttctgtttca tatgagatct acataagcta ttcttatgtc 159961 aggactacac tgtacacttg gctgtaggat gccaagcaaa aatgattatt tggcattctt 160021 ggcaatagta ggtaagagca agaatgttgc agaagcaatg agtacttaaa ttaggctata 160081 aagttcatac aaataaagaa aatctgacat tttaaagtca cagggaatca ttttcagatc 160141 ttatttttgc atttaaaata tcttttattg aaatgtaata cacatatgta aaaacctgtg 160201 tatcaaaagt atacattcta acaaaaattc atcaactgag cttagttata taaccagcat 160261 ccagatcaag aaacaaaaca ttgtcagcaa cacataagac tctcttacat gctcttccag 160321 tcaccatagc ctccttacca ccaccaccaa gtgtgttcaa tattatttta tctaatagtt 160381 tgaaacattt tcacacctgg ctttttaaag tgctagttat acaaaattaa ctgattctac 160441 tttcaacttt tctgggaaac atttaaggga caagagccaa agttagggta atttacaaac 160501 caggtgatca gtcctggata attgagcctt ggtgatttgc attttgttct ttaaacacac 160561 tttgggttaa acacttcatg tagactttca aactgagctc agtatggaaa gaaataactc 160621 tgtagattaa acctttcttt tttgaggcta aaagaaagaa aatggtattt tttaagaaca 160681 aaaccatgta ataaaattct gactactttt acatcaattt atttactaga ttatgatgtg 160741 ttccatgtat ggcatatgca aagtgaagaa tatagacctt aaattcaaaa tcattgtaac 160801 agcatacaag gatcttcctc atgctgttca ggaggtaggt aattttccat agtaagtttt 160861 tttgataaat ccatatccat aacataacat aggtaattca tttgatctca tttattcatt 160921 aatgagatca tatattctgt ctgaccttat tatgtaaatt cacaaataaa aacttttata 160981 ttatttattt gtaacttaaa tagaattgga aagataaggg taattatgaa attacccata 161041 ttcatagttt tttataaagt taataaataa tattttatcc ctgtaataag caggtatttg 161101 taataaactt gacatgagtc atagaacatt agatattctt tgaagttatt tttattactt 161161 tataggaaaa gccagtataa atgcagcctt gcaaatcaat aatacaaatg ttcagtataa 161221 acacattttg tgtttccggt ttaccttccc tttagagaaa tagtagtagt aaaatatact 161281 tggtatgaca catttcatat tgttctgtgg aaagtgagtt gtggggtatt tttcccattt 161341 attattggag ttttcaaaag taatacaaac caagtcattt ttgtttgtat gtttgaagtt 161401 attttctaaa ttcaatcatt ttaattgtat atatcagagg catctgatat aacctctgtg 161461 tttttcccta aaatttgagt gaaatctaac tatcactaaa atatagtgct tttactaaat 161521 gtctacttct tgcaattgca taatgtatac tgaaaccctt atagattggg ggggataccg 161581 ggaggtacag gccagagaat gtagtccaaa tattggtgga taatattatt ggctttcgct 161641 cagctctggc cctttgattc ccatcatgct ttccattcta ccagtctatc tactcctctc 161701 ttcccagtct atatactctt ctcttcttgg atggccagct cttccatctg ctgctgcctg 161761 gctatttctc tcaatcattc tgtgacattt cacttctaga agagcagcta taatccaagc 161821 ctaagaagta attttattta tttattattt tttcctttat aatatgtgct tcttaccagt 161881 caaaaagtat tataaactat tagaaaagaa aatctaaagg tagaaatttt aaaattcatt 161941 taacaagtaa attttacttt tttttttttt tttttttttt tactgttctt cctcagacat 162001 tcaaacgtgt tttgatcaaa gaagaggagt atgattctat tatagtattc tataactcgg 162061 tcttcatgca gagactgaaa acaaatattt tgcagtatgc ttccaccagg gtaggtcaaa 162121 agtatccttt gattggaaaa atctaatgta atgggtccac caaaacatta aataaataat 162181 ctactttttt gtttttgctc tagcccccta ccttgtcacc aatacctcac attcctcgaa 162241 gcccttacaa gtttcctagt tcacccttac ggattcctgg agggaacatc tatatttcac 162301 ccctgaagag tccatataaa atttcagaag gtctgccaac accaacaaaa atgactccaa 162361 gatcaaggtg tgtgttttct ctttagggaa gtagtaaaga atgagagggg gattattttg 162421 atccaagaat aaaaaatata aagcattctt catttcaaat aagctagact cttgaaactc 162481 tatttgctta tttaagtaac ataataagaa tatgggggcg gggtgaagaa aatctattta 162541 cgacttaagc aacgcaagat ggccgaatag gaacagctcc ggtctacagc tcccagcgtg 162601 aggcgacgca gaagacgggt gatttctgca tttccatctg aggtaccggg ttcatctcac 162661 tagggagtgc cagacagtgg gcgcaggcca gtgtgtgtgc gcaccgtgtg cgagccgaag 162721 cagggcgagc gattgcctca cctgggaagc gcaaggggtc agggagttcc ctttccgagt 162781 caaagaaagg ggtgacggac gcacctggaa aatcgggtca ctcccacccg aatattgcac 162841 ttttcagacc ggcttaagaa acggcgcacc acgagactat atcccacacc tggctcagag 162901 ggtcctacgc ccacggaatc tcgctgattg ctagcacagc agtctgtgat caaactgcaa 162961 ggcggcagtg aggctggggg aggggcgccc gccattgccc aggcttgctt aggtaaacaa 163021 agcagccggg aagctcgaac tgggtggagc ccaccacagg tcaaggaggc ctgcctgcct 163081 ttgtaggctc cacctctggg ggcagggcac agacaaacaa aaagacagca gtaacctctg 163141 cagacttaag tgtccctgtc tgacagcttt gaagagagca gtggttctcc cagcacgcag 163201 ctggagatct gagaacgggc agactgcctc ctcaagtggg tccctgaccc ctgacccccg 163261 agcagcctaa ctgggaggca ccccccagca ggggcacact gacacctcac acggcaaggg 163321 tattccaaca gacctgcagc tgagggtcct gtctgttaga aggaaaacta acaaccagaa 163381 aggacatcta caccaaaaac ccatctgtac atcaccatca tcaaagacca aaagtagata 163441 aaaccacaaa gatggggaaa aaacagaaca gaaaaactgg aaactctaaa acgcagagcg 163501 cctctcctcc tccaaagaac acagttcctc accagcaacg gaacaaagtt ggacggagaa 163561 tgactttgac gagctgagag aagaaggttt cagacgatca aattactctg agctacggga 163621 ggacattcaa accaaaggca aagaagttga aaactttgaa aaaaatttag aagaatatat 163681 aactagaata accaatacag agaagtgctt aaaggagctg atggagctga aaaccaaggc 163741 tcgagaacta ctgtgaagaa tgcagaagcc tcaggagcca atgcgatcaa ctggaagaaa 163801 gggtatcagc aatggaagat gaaatgaatg aaatgaagca agaagggaag tttagagaaa 163861 aaagaataaa aagaaatgag caaagcctcc aagaaatatg ggactatgtg aaaagaccaa 163921 atctacgtct gattggtgta cctgaaagtg acagggagaa tggaaccaag ttggaaaaca 163981 ctctgcagga tattatccag gagaacttcc ccaatctagc aaggcaggcc aacgttcaga 164041 ttcaggaaat acagagaacg ccacaaagat actcctcgag aagagcaact ccaagacaca 164101 taattgtcag attcaccaaa gttgaaatga aggaaaaaat gttaagggca gccagagaga 164161 aaggtcgggt taccctcaaa ggaaagccca tcagactaac agcggatctc tcggcagaaa 164221 ccttacaagc cagaagagag tgggggccaa tattcaacat tcttaaagaa aagaattttc 164281 aacccagaat ttcatatcca gccaaactaa gcttcataag tgaaggagaa ataaaatcct 164341 ttacagacaa gcaaatgctg agagattttg tcaccaccag gcctgcccta aaagagctcc 164401 tgaaggaagc gctaaacatg gaaaggaaca accggtacca gccgctgcaa aatcatgcca 164461 aaatgtaaag accatcgaga ctaggaagaa actgcatcaa ctaatgagca aaatcaccag 164521 ctaacatcat aatgacagga tcaaattcac acataacaat attaacttta aatataaatg 164581 gactaaattc tgcaattaaa agacacagac tggcaagttg gataaagagt caagacccat 164641 cagtgtgctg tattcaggaa acccatctca cgtgcagaga cacacatagg ctcaaaataa 164701 aaggatggag gaagatctac caagccaatg gaaaacaaaa aaaggcaagg ggttgcaatc 164761 ctagtctcag gataaaacag actttaaacc aacaaagatc aaaagagaca aagaaggcca 164821 ttacataatg gtaaagggat caattcaaca agaggagcta actatcctaa atatttatgc 164881 acccaataca ggagcaccca gattcataaa gcaagtcctg agtgacctac aaagagactt 164941 agactcccac acattaataa tgggagactt taacacccca ctgtcaacat tagacagatc 165001 aacgagacag aaagtcaaca aggataccca ggaattgaac tcagctctgc accaagcaga 165061 cctaatagac atctacagaa ctctccaccc caaatcaaca gaatatacct ttttttcagc 165121 accacaccac acctattcca aaattgacct acatagttgg aagtaaagct ctcctcagca 165181 aatgtaaaag aacagaaatt ataacaaact atctctcaga ccacagtgca atcaaactag 165241 aactcaggat taagaatctc actcaaagcc gctcaactac atggaaactg aacaacctgc 165301 tcctgaatga ctactgggta cataacgaaa tgaaggcaga aataaagatg ttctttgaaa 165361 ccaacgagaa caaagacacc acataccaga atctctggga cacattcaaa gcagtgtgta 165421 gagggaaatt tatagcacta aatgcctaca agagaaagca ggaaagatcc aaaattgaca 165481 ccctaacatc acaattaaaa gaactagaaa agcaagagca aacacattca aaagctagca 165541 gaaggcaaga aataactaaa atcagagcag aactgaagga aatagagaca caaaaaaccc 165601 ttcaaaaaat caatgaatcc aggagctggt tttttgaaag gatcaacaaa attgatagac 165661 cgctagcaag actaataaga aaaaaagaga gaagaatcaa atagacacaa taaaaaatga 165721 taaaggggat atcaccaccg atcccacaga aatacaaact accatcagag aatactacaa 165781 acacctctac gcaaataaac tagaaaatct agaggaaatg gatacattcc tcgacacata 165841 cactctccca agactaaacc aggaagaagt tgaatctctg aatagaccaa taacaggctc 165901 tgaaattgtg gcaataatca atagtttacc aaccaaaaag agtccaggac cagatggatt 165961 cacagccgaa ttctaccaga ggtacaagga ggaactggta ccattccttc tgaaactatt 166021 ccaatcaata gaaaaagagg gaatcctccc taactcattt tatgaggcca gcatcattct 166081 gataccaaag ccgggcagag acacaaccaa aaaagagaat tttagaccaa tatccttgat 166141 gaacattgat gcaaaaatcc tcaataaaat actggcaaac cgaatccagc agcacatcaa 166201 aaagcttatc caccatgatc aagtgggctt catccctggg atgcaaggct ggttcaatat 166261 acgcaaatca ataaatgtaa tccagcatat aaacagagcc aaagacaaaa accacatgat 166321 tatctcaata gatgcagaaa aagcctttga caaaattcaa caacccttca tgctaaaaac 166381 tctcaataaa ttaggtattg atgggacata tttcaaaata ataagagcta tctatgacaa 166441 acccacagcc aatatcatac tgaatgggca aaaactggaa gcattccctt tgaaaactgg 166501 cacaagacag ggatgccctc tctcaccgct cctattcaac atagtgttgg aagttctggc 166561 cagggcaatc aggcaggaga aggaaataaa gggtattcaa ttaggaaaag aggaagtcaa 166621 attgtccctg tttgcagatg acatgattgt ttatctagaa aaccccattg tctcagccca 166681 aaatctcctt aagctgataa acaacttcag caaagtctca ggatacaaaa tcaatgtaca 166741 aaaatcacaa gcattcttat acaccaacaa cagacaaaca gagagccaaa tcatgagtga 166801 actcccattc acaattgctt caaagagaat aaaataccta ggaatccaac ttacaagaga 166861 tgtgaaggac ctcttcaagg agaactacaa accactgctc aaggaaataa aagaggacac 166921 aaacaaatgg aagaacattc catgctcatg ggtaggaaga atcaatatcg tgaaaatggc 166981 catactgccc aaggtaattt acagattcaa tgctatcccc atcaagctac caatgacttt 167041 cttcacagga ttggaaaaaa ctactttaaa gttcatatgg aaccaaaaaa gagcccgcat 167101 cgccaagtca atcctaagcc aaaagaacaa agctggaggc atcacactac ctgacttcaa 167161 actatactac aaggctacag taaccaaaac agcatggtac tggtaccaaa acagagatat 167221 agatcaatgg aacagaacag agccctcaga aataatgccg catatctaca actatctgat 167281 ctttgacaaa cctgagaaaa acaagcaatg gggaaaggat tccctattta ataaatggtg 167341 ctgggaaaac tggctagcca tatgtagaaa gctgaaactg gatcccttcc ttacacctta 167401 tacaaaaatc aattcaagat ggattaaaga tttaaacgtt agacctaaaa ccataaaaac 167461 cctagaagaa aacctaggca ttaccattca ggacataggc gtgggcaagg acttcatgtc 167521 caaaacacca aaagcaatgg caacaaaagc caaaattgac aaatgggatc taattaaact 167581 caagagcttc tgcgcagcaa aagaaactac catcagagtg aacaggcaac ctacaacatg 167641 ggagaaaatt ttcgcaacct actcatctga caaagggcta atatgcagaa tctacaatga 167701 actcaaacaa atttacaaga aaaaaacaaa caaccccatc aaaaagtggg cgaaggacat 167761 gaacagacac ttctcaaaag aagacattta tgcagccaaa aaacacatga agaaatgctc 167821 atcatcactg gccatcagag aaatgcaaat caaaaccact atgagatatc atctcacacc 167881 agttagaatg gcaatcatta aaaagtcagg aaacaacagg tgctggagag gatgtggaga 167941 aataggaaca cttttacact gttggtggga ctgtaaacta gttcaaccat tgtggaagtc 168001 agtgtagcga ttcctcaggg atctagaact agaaatacca tttgacccag ccatcccatt 168061 actgggtata tacccaaagg actataaatc atgctgctat aaagacacat gcacacgtat 168121 gtttattgtg gcactattca caatagcaaa gacttggaac caacccaaat gtccaacaat 168181 gatagactgg attaagaaaa tgtggcacat atacaccatg gaatactatg cagccataaa 168241 aaatgatgag ttcatgtcct ttgtagggac atggatgaaa ttggaaacca tcattctcag 168301 taaactatcg caagaacaaa aaaccaaaca ccgcatattc tcactcatag gtgggaattg 168361 aacaatgaga tcacatggac accaggaagg ggaatatcac actctgggga ctgcggtggg 168421 gtcgggggag gggggaggga tagcattggg agatatacct aatgctagat gacacgttag 168481 tgggtgcagc acaccagcat ggcacatgta tacatatgta actaacctca caatgtgcac 168541 atgtacccta aaacttagag tataataaaa aatataaatt aaaaaaaaaa aaaaaaaaag 168601 aaaatctatt tacttggatg ggtttacaga tttagttatc agctttcctg actgttaggt 168661 atcttctttt gagaacaatt tgagaaccag tttggttata tgtttcaaaa cacttttata 168721 ttttttaaat agccaatctg ctaaacaaag caggttactt taggttgagt acttttagtt 168781 tgcagtttat tggatgtcct gtaagttttg cttcctgtgg atttttttcc ttttgcttgt 168841 tatattaaat gtagattact gtcaattaag tctttagagg tccatcccta atcctgctgg 168901 cggcctcttt accacctcac cttgggcagg tctctatctg tacttcacaa gggtgctgtg 168961 gatcagggaa atgatgagta tgaagctgtt ttaaattctc agatgaaagg ttgtatgcaa 169021 ctacaaatca ttatattatc ttccacatcc aaccacaagt gctctctagc tttgaagtgc 169081 ttcagttgac taattatatg ttatcatggg ctatttgaaa ctgactttat ttgtgtgaag 169141 taggaggcag attagctagt atagttaatg tagtctcatc tcagaaatta tcagcccata 169201 tggttgtacc taatgggcaa gaaaaggggg catatgttgg cctttcagaa aatatttgca 169261 tggtatattt aattatttaa gtagtacgta ctcattataa aaatttcaaa ctctacagaa 169321 aaatatgaag taagaaataa ttggcaatat gatacaaatg ctctcatgtg tctctgtatc 169381 atcctttatt tatttggtgt tcttgtagtt ggatatctgt gtactgatat ctactacttg 169441 ttctaagctg ctaagatgcc acgtcatgtc tatttaagaa aatttacatt gttccatgcc 169501 actatatgaa aatgtatact taggtagttt tttttaatag tgataattag tccatattat 169561 ggtgataatg atggctactt gctgatcctt agtgaaataa attctgtgtt ggtattcttc 169621 agcaaaaacg tcacattctg aacatcctaa ctaataaatc attgaccagg cattaggaga 169681 gcatcttaac actctaccca gtacatttag gtactctggt taatcaaata ttatatgtag 169741 agtgatggcc acttatcaag gaattggagg gcaaaaaaat ctctatattc actagttcca 169801 tgttcatgtt ttcaaatctt tgtcgagtca tgtcaggcta atgatgttct tcagctttat 169861 tataaagaac ataaattata tggttcaaat aaagacagac taataaagat ttctgactct 169921 atgatatatg ataaatagcc tttactatat caatatagat gctaattaga attctgatta 169981 ttacttaata ttctaagttt tttccaaata taacttgaat tttaaatgag aaaatatgaa 170041 agaaatttga taacttaccc attgatttat gaagaactaa gtaggggtaa ccttgaaact 170101 tgcctttgcc ctccctaaat atgggcaatg gcagaatatg ttcttgcaga cctataactt 170161 ttgctttaaa actaagagac taggtgagta tatgattaga cgggcactgt tagaataatt 170221 cccaaatgaa tatagtttgt cagtggttct agggtagagg taacctttaa tttggtattc 170281 ctaatagttc agaatgatgt atttatgctc atctctgcaa aattgtatat ggttttttat 170341 tactaattgg tatttcatct taacttgaca gaatcttagt atcaattggt gaatcattcg 170401 gggtgagtat tttctttcta tgaaatataa tagtatgcat tgtaagtata aaagaaatta 170461 aagctttcta taatttgaat ttccaaatgc agttattcaa acacctcatc caggcatatt 170521 gcatagaatt ttatgagata tatatatctc agatttactt tcaaatcaag tttaatctca 170581 aatcatactc ctaattggtg aacttcaaaa cttttctaaa tatccacttg agattatata 170641 atacatatat acatttgtgt atatacatac atatatacgt gagctgtttt tgctcacaac 170701 atttctatca ccaaatgtgt gagatttttt tctcacccaa atctattctt caactctctg 170761 gtgttctaca attcaattca attctgacac taattaccca gagtcagcat cagactccac 170821 aggttcaagg gctcagtccc acaaaaatgg tctcactgca gacaccagtc acaagtgtca 170881 ggtccccagg ctacaccaca cttccgtctg acttgaatac gaagttgggg ggttccgata 170941 gtgcctcttc cttacagttt gatccactgc cagaactact cacaaaactc tggaaaatat 171001 tctacttact attatcagtt catcataaaa gatacaaatg aacagccaga tgaagaaata 171061 ttatataggg tgaggtccag aagagtccct agcacagggg cttctgtccc tggggagttg 171121 gggtgcacca ccttcctagc acttagacat gtttaccaac tccaaagatc tcccaacctt 171181 attgttgagg ggtttttatg ggggtttcat tatataggca taattgatta actcaatttc 171241 caaccccctc ccctccctgg atagagggtg gggctgaaag ttccaagctt ctactcaaga 171301 cttggtcttt ctggcaacca gcttccatcc taaattagct aggtacccac caagtatcac 171361 ctcattagaa caaaagatgg tcccatcacc cttatcacac atgaaattcg aagggtttta 171421 ggagctctgt cccaggaacc agggacaaag accaaatatc tttcaatgat accatgtatg 171481 tatgtacata acctcacagg aatgtttata aaacaatttt gaaattcact cattatgagt 171541 gtgatttgaa atgagatact ccaaaatgta agcccgatat ccaaatgtca ccagcctgtc 171601 cctgcctact ggtctccttc catacatatg cagtttttgc ttgtccttcc tctcagactt 171661 ctaggatatt ctttttctgg tacactgatt aggaattgtt tgcatgagat cctgcctcag 171721 tgaaagtggc agagcttcat tctaggagat ccaagggaaa gctttgcttt gaaacattta 171781 ttctaggctg caaatccaca accctagttg gccttccatt aaagtcacta attcagcagt 171841 cccatattca atatgcatta ctgttaatat gttgcaccat ctccattccc ctgagagctt 171901 atatttttaa tttttaaatt tttattttta gagacagtgt ctcactctgt cacctactta 171961 ttataacctc aaactcctcg gcccaagcag tcctctcacc ttagcctccc aagttgccag 172021 gactacaggc atgcaccacc atgtccagct aatttttaaa ttttttgtag agacagggtt 172081 ttctatgttg gccagattgg tattgaattc ctggcttcca cgataccccg tctcagcctc 172141 ccaaagaact gggattacag atgtgagcca ctgcacctgg ccagagagct tatattctta 172201 taggaatggg aagactgcct atgttatgtg ttgctacata atacattacc cccaaactta 172261 gtgacttaaa acaaacgctt attatctcca tttctgtggg tcaataatct aggcatgact 172321 tagctgggcc agagtttctc caaagtctgt gatcaaggtg tcagttgggc ctgcagtcat 172381 ctcaaggctc cactagagga gcattcactg gcagacttat tcaaatggct gttggctgat 172441 cctcgatggc tattggcccc tctattggtt tcttgccctt gggccctcca tagtactgct 172501 tgctattcac aacatggcag cttgctttgc ccagagcagg gactctgagg gaggcaggga 172561 aataaagagc aagagagagg tcacagtctt attgtaatct aattctggaa atgacagccc 172621 attacttttg gcatattatt ttggttagaa gcaagacaac agtagatcta gcccacacac 172681 gaggggagga ggatcacaca aggaggtgaa taccaggagg tggggtcatt gggagccatc 172741 tgagaggctg cccaccacac tgcctcaagt aactagggag aggtaaaagt ttatatgcca 172801 gatgaccaaa tattaaaatg tgtgttacaa atagttcacg atgggctcag ctgtcagact 172861 ttacaaagga gctatgggac cttataagga cagttggaac tggctaggta tcacatagtg 172921 gtcttcaaac atttttgctt gccataacct ctaaaataat tgggaaaaag ttgaatgtac 172981 ttccatatct taaagctgat aatttaaaat attatacatt taatagcagc acgggattta 173041 gtttttgtta aattgtatat gtgctccaaa tagatttacc atcaaaacct gttttgaatt 173101 taatattggg agaattcgct agtttaattt ttggaaaata aagtataatt ggcaaagcta 173161 atcctcactg ttgaatctat ccgtcaaatc agatataatt tctatcagaa agtctatatg 173221 acttgtcaac ataataccca taaagtgaat caaaaattat tattcattga acacatcatc 173281 tcttatcaaa ttcttgtgac cttccttctg gttgtataat agcctaaaaa acaaaaaaag 173341 gacaaaagca agtttccaga aagctgttct gacttgccta cttctgaaaa gtagtcctgt 173401 atggtgggtt ctgaaaatga ggaaccagga cttgcagagt aggcagttgc tggaggaaga 173461 atgtgagctg catgggaaaa gacaggagga tttacaaaga gtgggtgttt aattggggat 173521 ggaattaggt agttattctg atttttagat ttttcatatc ttttatttgg tccaatgaag 173581 cagaaaattt aaatgaagtt attacctttg cctgattttt gacacacctc aaactataac 173641 ttgaggttgc taactatgaa acactggcat ttaatgattt aaagtaaaga attctgtaat 173701 ttgtagactt ctgagaagtt ccagaaaata aatcagatgg tatgtaacag cgaccgtgtg 173761 ctcaaaagaa gtgctgaagg aagcaaccct cctaaaccac tgaaaaaact acgctttgat 173821 attgaaggat cagatgaagc agatggaagg taggaaccag ttttgaatgt tttccagtag 173881 ctgagatggt catctgggga atccagagtc tcagcactgc tcctggctta taccaatttc 173941 tttcatgcca agtttatttg gaagttgtga gaatggctca aaataataga tatgagtgta 174001 gtgcaaagtt aaaaacatct tacaaattgc ataccaacat tcagtgaaga tatctaataa 174061 accctgatct tttttacaaa gctattgata aaattttgtt attcttaaca ttaaatttaa 174121 aaatgtttac tcttgaaaaa tattaaccac tgtattttgt gagaaccact gaaaaaatac 174181 atagcatcat aaatttgtga catttatgtt ttagatggtt agtttttaaa ttttaaaatt 174241 aaaagctact cactaaaata atagcataaa gtaagtcatc gaaagcatca tagttactgg 174301 aaatttgagt tttccattta taaatacaca tgaaatgttt tgcatttttt taatctgcag 174361 taaacatctc ccaggagagt ccaaatttca gcagaaactg gcagaaatga gtaagtactt 174421 ttttcacctt gtgtaaacga aataaacaat tgtttacact gcaagaagtc ttttcgttat 174481 ataaaagaat gtataatttc ttcagttggc aggtttgttt atgcatttaa aatataattc 174541 aatcaaggtt atttatctac aaacatttgt ggattaaatg tatgatgtaa aatgaaggtc 174601 atttttaccc tttctatgat ctttcatgca ggaagactaa gaagtgaaac attgcttgac 174661 cacattcaac acaaatggct acagttagaa aatactttag cagaactaca aagaggaact 174721 atttgggagt gttagatata gggaaaagtt ttataaacct agcatatgta aacatcatca 174781 cccttattta aggaataacc tttgattcta ccatttttaa actttcctta tttgcaaatc 174841 ttaaaacaat ttcaaacaaa agtttccatt actggaaatc agaattttaa taaagaacac 174901 ttatgtcatt ttccaagtta tacagagtga tatcatgtta gggtaagtgg aattctattc 174961 aaaactactc tgagactaga caaaaaaaat aatccaagga tagtagatgg aaagcacatt 175021 aggaaagaga taaatgttca ctaactacta ttgggttgtc ttaggtagtt atggtagtga 175081 gtgtaacatt tcccagtatc tgctgccaga tccattgata ggaaaaggaa ctgggatttt 175141 gaaagaagac aaatattcag gagataggtg ttaaaacaat cattcctagg tcacaggaca 175201 cactaatgag aacaatacca attaaaatga gaggctttct tttaatacat ctttgtagat 175261 actaactttg aaaataagat atttaaaagg atgctttttt tctatgtatt tttttaaagt 175321 gtaattaaaa ctaaataact aaaacaaaat tttctagtta taaatgaatc tctttttgga 175381 ttttatgtca acttacatat tcctatttta actcttaaat agttgcatct tttcccctca 175441 gtacatgaaa atagacaaga ggctcataac caaaatcagg ccttgaagac atatggaaac 175501 attttggaac ccacaaatat gcacttaaaa aggagctgaa agctgtaaaa acatgtatat 175561 tttcttagag aagtgttcag ttgctatagt tagtacattt ccctagtgac acacttagct 175621 atgatggcga gcttcctctg ccaggccaaa gttcaggacc tttgtgcatc tacaaaattg 175681 tgacaaagca ggcagaattt taacccctga ggctaaggca tttctttagg atgctcttca 175741 gcttttctta aagaattaca tttagaaaag atcatcatac aatgctggat tgcccagaat 175801 ggaagggcat tccatttttc caaatgtgct tctttataaa gaactcatgc ctacctctaa 175861 aatcaagtta agaattttta agaaatttaa taagtatccc ggctttgtct atattgtaat 175921 tcataatatg cttttaacat ggccatgccc cgggtaggca tgatggctta tgtctgtaat 175981 accagcacta tgggagggca aggtgagagg attgcttgag gccaggagtt caagaccagc 176041 ctgggcaaca tagcaagacc ccatctttac aaaaaaatcc aaaaattagc caggcctggt 176101 ggcatgtgcc tgtagtcttg gctactggag aggctgaggt gagagaattg cttgagccca 176161 gaaggtcaaa gctgtagtga gccatgatca tgccactgca ctccagcttg ggcaacagag 176221 caagaccttg tctcaaaaaa aaaaaatcca tgcccagttt aaatgagcta tgatttgatg 176281 tatttgcagt acctgcaagg catctgcaaa cataagggaa cctatagctt tgtaaagaat 176341 accaccactc actctggcag attcagtctg ggccagccga gtattctagt ctgctgaagt 176401 atatgccaag catcaaccac ctcatctcta gaatagtggc ctacaaaatg gggcatacct 176461 gaagtatgca aagtcatctt ttggggtaca taaagtagaa gttctattta tctcatttta 176521 atgtttgctt ggggtatgtg ttttataatt tgtacataca tacgtatttt tgtggtacag 176581 gcttggaaaa aaatttactc acagggatat atgattttta aagatttgga gaccactgtt 176641 tttttggggg gtggggaggg aatttttttt ttaattgata tatcatagtt atacatattt 176701 taggggtaca tgtgatgttt tgatacgtgt acacaatgtg taatgatcaa atcagggtta 176761 ttgggatata tatcacctca aacatttttc tttgcattgg gaacattaca attctagcta 176821 ttttgaaata tgcagtaaat taactgtaac tccctacggt actgtcaaat actagaatga 176881 agaccactgc ttttgcaagg tcctgagcgc catcagtttg acatgagcat aatatatatg 176941 gcagccactt gccaacttac ccagtaccat caatgctgtt aacagttctt catccttttt 177001 ccagcttcta ctcgaacacg aatgcaaaag cagaaaatga atgatagcat ggatacctca 177061 aacaaggaag agaaatgagg atctcaggac cttggtggac actgtgtaca cctctggatt 177121 cattgtctct cacagatgtg actgtataac tttcccaggt tctgtttatg gccacattta 177181 atatcttcag ctctttttgt ggatataaaa tgtgcagatg caattgtttg ggtgattcct 177241 aagccacttg aaatgttagt cattgttatt tatacaagat tgaaaatctt gtgtaaatcc 177301 tgccatttaa aaagttgtag cagattgttt cctcttccaa agtaaaattg ctgtgcttta 177361 tggatagtaa gaatggccct agagtgggag tcctgataac ccaggcctgt ctgactactt 177421 tgccttcttt tgtagcatat aggtgatgtt tgctcttgtt tttattaatt tatatgtata 177481 tttttttaat ttaacatgaa cacccttaga aaatgtgtcc tatctatctt ccaaatgcaa 177541 tttgattgac tgcccattca ccaaaattat cctgaactct tctgcaaaaa tggatattat 177601 tagaaattag aaaaaaatta ctaattttac acattagatt ttattttact attggaatct 177661 gatatactgt gtgcttgttt tataaaattt tgcttttaat taaataaaag ctggaagcaa 177721 agtataacca tatgatacta tcatactact gaaacagatt tcatacctca gaatgtaaaa 177781 gaacttactg attattttct tcatccaact tatgttttta aatgaggatt attgatagta 177841 ctcttggttt ttataccatt cagatcactg aatttataaa gtacccatct agtacttgaa 177901 aaagtaaagt gttctgccag atcttaggta tagaggaccc taacacagta tatcccaagt 177961 gcactttcta atgtttctgg gtcctgaaga attaagatac aaattaattt tactccataa 178021 acagactgtt aattatagga gccttaattt ttttttcata gagatttgtc taattgcatc 178081 tcaaaattat tctgccctcc ttaatttggg aaggtttgtg ttttctctgg aatggtacat 178141 gtcttccatg tatcttttga actggcaatt gtctatttat cttttatttt tttaagtcag 178201 tatggtctaa cactggcatg ttcaaagcca cattatttct agtccaaaat tacaagtaat 178261 caagggtcat tatgggttag gcattaatgt ttctatctga ttttgtgcaa aagcttcaaa 178321 ttaaaacagc tgcattagaa aaagaggcgc ttctcccctc ccctacacct aaaggtgtat 178381 ttaaactatc ttgtgtgatt aacttattta gagatgctgt aacttaaaat aggggatatt 178441 taaggtagct tcagctagct tttaggaaaa tcactttgtc taactcagaa ttatttttaa 178501 aaagaaatct ggtcttgtta gaaaacaaaa ttttattttg tgctcattta agtttcaaac 178561 ttactatttt gacagttatt ttgataacaa tgacactaga aaacttgact ccatttcatc 178621 attgtttctg catgaatatc atacaaatca gttagttttt aggtcaaggg cttactattt 178681 ctgggtcttt tgctactaag ttcacattag aattagtgcc agaattttag gaacttcaga 178741 gatcgtgtat tgagatttct taaataatgc ttcagatatt attgctttat tgcttttttg 178801 tattggttaa aactgtacat ttaaaattgc tatgttacta ttttctacaa ttaatagttt 178861 gtctatttta aaataaatta gttgttaaga gtcttaatgg tctgatgttg tgttctttgt 178921 attaagtaca ctaatgttct cttttctgtc taggagaaga tagatagaag ataactctcc 178981 tagtatctca tccattccta gcctttaagg ggctctatat gctagagatt tccaaattta 179041 tttcttcagc cctgatcttt tcacagaggt caaggctttt atagccaaca gaactcttga 179101 ttcctactcc cctctaccca atgtctccaa atataaacta aaatcaaata aataaaaatc 179161 ttttttttct accctggtct ttctccattt tattaaacat ttatcaagtc aaaaacttag 179221 gagctatcta ttaattcttg tctggaccac tgctaaagac ttctaaccag tctccctgtt 179281 ccctttcttt ctttactgtc aaactccaca aagaacaaaa atgattgttt aaaaatataa 179341 attgattcac attgctctct acttgaagta cttaacagga attataaagc ccctactgac 179401 ttcccatcct catcttgtag aattttctcc ttgtcactgt gctttagcac actgaacctt 179461 tttctttaaa atgccaagcc tattcttacc ttaggctttt atatttcttc ttccttctgc 179521 ctggaatact cttttcccac agttttgcat agccagtgcc tttctgtctt tcagaactca 179581 gctcagaggg cttccttgac caccaagcta aaatcatcac tgtgtattgt atcattctcc 179641 ttcaaaagtg taacagcatc catctatgac tatttcccac catctgaaat tgcctttctt 179701 gttacctgtc tccccatccc tagccactcc ccaccccaga ttgtaaccat cagcctatct 179761 attcttattt accactttag gtctagcacc tagaaagtcc atctgtagag taagaaacag 179821 aatatatgtt ttccatattt ttccaagtat gtgacaacaa acaacactct cttcctcttt 179881 taaaatgcca tgtactagta tctaccatat tccaggcaca gttctaaatg ctagagctat 179941 aacaatgaat aaataaaacc aaaatcccca tccttgcaga actgactttc tagtttcagg 180001 caaagagagc aaatgaatga ataaatgagg aatacatagt atgtgagatt gtattaaaga 180061 ctgcagagaa aaacaaagca aagaggaggg caaggaaggc tggtagggaa aggagtttta 180121 aaatttaaaa gagagtgatc agggaaagac ttgagtgata tttgggcaaa gatctgaagg 180181 agacaagaga gtgagccaga cacaaacagg agtcaaacat tccacagaga gaggtaggag 180241 tatgtctggt gtgtttgaag aacgattagg aggtcatagc aaaatgggca gcaagtatct 180301 ctcttcgtct aatgcaaagt ataaaaggcc tttatgtata ttacaagtct ataatccttt 180361 atctgtagtt ccaaaattga gaaagctt // LOCUS HUMRIGBCHA 11298 bp DNA PRI 09-JAN-1995 DEFINITION Human high affinity IgE receptor beta chain gene, complete cds. ACCESSION M89796 NID g337417 KEYWORDS IgE receptor beta chain. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11298) AUTHORS Kuster,H., Zhang,L., Brini,A.T., MacGlashan,D.W. and Kinet,J.P. TITLE The gene and cDNA for the human high affinity immunoglobulin E receptor beta chain and expression of the complete human receptor JOURNAL J. Biol. Chem. 267 (18), 12782-12787 (1992) MEDLINE 92316966 FEATURES Location/Qualifiers source 1..11298 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" mRNA join(354..511,1381..1510,2026..2160,4475..4531,5079..5237, 5640..5738,7224..10214) /gene="high affinity IgE receptor beta chain" /product="high affinity IgE receptor beta chain" exon 354..511 /gene="high affinity IgE receptor beta chain" /number=1 gene 354..10214 /gene="high affinity IgE receptor beta chain" CDS join(456..511,1381..1510,2026..2160,4475..4531,5079..5237, 5640..5738,7224..7322) /gene="high affinity IgE receptor beta chain" /codon_start=1 /product="high affinity IgE receptor beta chain" /db_xref="PID:g337418" /translation="MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASS PPLHTWLTVLKKEQEFLGVTQILTAMICLCFGTVVCSVLDISHIEGDIFSSFKAGYPF WGAIFFSISGMLSIISERRNATYLVRGSLGANTASSIAGGTGITILIINLKKSLAYIH IHSCQKFFETKCFMASFSTEIVVMMLFLTILGLGSAVSLTICGAGEELKGNKVPEDRV YEELNIYSATYSELEDPGEMSPPIDL" intron 512..1380 /gene="high affinity IgE receptor beta chain" /number=1 exon 1381..1510 /gene="high affinity IgE receptor beta chain" /number=2 intron 1511..2025 /gene="high affinity IgE receptor beta chain" /number=2 exon 2026..2160 /gene="high affinity IgE receptor beta chain" /number=3 intron 2161..4474 /gene="high affinity IgE receptor beta chain" /number=3 exon 4475..4531 /gene="high affinity IgE receptor beta chain" /number=4 intron 4532..5078 /gene="high affinity IgE receptor beta chain" /number=4 exon 5079..5237 /gene="high affinity IgE receptor beta chain" /number=5 intron 5238..5639 /gene="high affinity IgE receptor beta chain" /number=5 exon 5640..5738 /gene="high affinity IgE receptor beta chain" /number=6 intron 5739..7223 /gene="high affinity IgE receptor beta chain" /number=6 exon 7224..10214 /gene="high affinity IgE receptor beta chain" /number=7 BASE COUNT 3470 a 2113 c 2283 g 3415 t 17 others ORIGIN 1 aagcttttca aaggtgcaat tggataactt ctgccatgag aaatggctga attgggacac 61 aagtggggac aattccagaa gaagggcaca tctctttctt ttctgcagtt ctttctcacc 121 ttctcaactc ctactaaaat gtctcatttt caggttctgt aaatcctgct agtctcaggc 181 aaaattatgc tccaggagtc tcaaattttc ttatttcata ttagtcttta tttagtagac 241 ttctcaattt ttctattcat cacaagtaaa agcctgttga tcttaatcag ccaagaaact 301 tatctgtctg gcaaatgact tatgtataaa gagaatcatc aatgtcatga ggtaacccat 361 ttcaactgcc tattcagagc atgcagtaag aggaaatcca ccaagtctca atataataat 421 attctttatt cctggacagc tcggttaatg aaaaaatgga cacagaaagt aataggagag 481 caaatcttgc tctcccacag gagccttcca ggtaggtaca aggtattatt tttttctacc 541 ctcagtcact tgtggcaggg gaagtcatag tcacggtgct taggagatga aactttattg 601 atttaggcat ggatccatct agtttaatta atatattggg tatgaggaag ctacttgctg 661 tactttccat gtggttctct ctccctggag aggaacattt ttactcagct tgcaaactgg 721 aaatagattt tctcacatta gaagctcatt ttctgggtat gagacaggag agttcatact 781 gtgtatgtag atctctggct tctgggtctg acatgtgctg agggacacat atccttcaca 841 catgctttta taaatacttg ataaagtaac ctgcttcttg attggtcttt ataatccata 901 agctgtggga tgcttctctg aagatgaaaa tagtaataga gtcccatcta gctattcaaa 961 gccattcctt cattgtattc tgtgcacatg aagttggggt ttgttactga caaaatatat 1021 tcagatacat ttctatgtta aaaggattgt gagatgcata ggtaaatgtg tttattttca 1081 gttttacttg tcaacataga tgaatgagaa agaacttgaa agtaacactg gattaagaat 1141 aggaaaattt ggcatggatt ttgctccatt ttgtcccatc taatcacttg gatagtgttc 1201 aggtgttctt ggtcagttac ttggatgctc tgagctttag tttcttggtg attacaatga 1261 agatttgaat tacaggatgg ctttgaaaaa ataaacaaaa ctcccctttc tgtctgtcga 1321 gaatgttgca cagggagtta cagaatgttc tcatgactga attgctttta aatttcacag 1381 tgtgcctgca tttgaagtct tggaaatatc tccccaggaa gtatcttcag gcagactatt 1441 gaagtcggcc tcatccccac cactgcatac atggctgaca gttttgaaaa aagagcagga 1501 gttcctgggg gtgagtgagc ctcctccaac tttgactaga gtaagggttg ggtctagaaa 1561 agaatattga gttgcatcaa ctgttttccc acttggattc atgagaggtg ttaggtcctt 1621 taaaaaacat ggtagataaa gagttgacac taactgggtc cttttgggaa gagccagaag 1681 catttcctca taaagacttt aaattgctag gacgagaatg gccaacagga gtgaaggatt 1741 cataacttta tctttactta gatgtaaaga acaattactg atgttcaaca tgactacata 1801 cataaaggcg catggagaaa agtattggcc ttccatgcat taggtagtgc ttgtatcaat 1861 tcttatagtg gctagggtat cctggaaaat cttacgtgtg gatcatttct caggacagtc 1921 taggacacta acgcagtttc tcatgtttgg cttctattat taaaaaatga tacaatctcg 1981 ggaaaatttt tttgattttc atgaaattca tgtgtttttc tataggtaac acaaattctg 2041 actgctatga tatgcctttg ttttggaaca gttgtctgct ctgtacttga tatttcacac 2101 attgagggag acattttttc atcatttaaa gcaggttatc cattctgggg agccatattt 2161 gtgagtatat atctataatt gtttctgaaa taacactgaa cataggtttt tctctttctc 2221 agatctaacc agttgtttat tcccagtatt aagatgatat ttataattct taattataaa 2281 tatatgtgag catatataac atagatatgc tcattaacaa caacaaaaga ttctttttac 2341 aattaacggt gggttaaaca tttagcccac agttttatcc catgagaaac ctgaatctaa 2401 tacaagttaa atgacttgcc taagggccac ttgactaata gtaattgaac ctaaactttc 2461 agaatccaac tccaggaaca tacttctagc actattcatc aataaagtta tatgataaat 2521 acatacaact ttatctgtca actaaaaata acaacagagg ctgggcatgg tggctcacac 2581 ccgtaatccc agcactttgg gaggctgagg caggtggatc acctgaggtc aggagtttga 2641 gaccagcctg accaacatgg tgaaacctca tctctactaa atataaaaaa ttagctgagt 2701 gtgatagtgc atacctgtaa tccagctact taagaggctg aggcaggagg cttgtttgaa 2761 cctggaaggc agaggttgca gtgagctgag attgtgccat tgcactccag cctgggcaat 2821 aagtgcgaac tctgtctcaa aataataata ataataatag aaaataaagt tgtcttcatg 2881 aaaaatgagg aaagagattg ctggggtgag aaacattaag atcaatgggc atatggtgac 2941 cttctatgcc ctagaaactc ttttanggta ttttctcctg gtatctcttt tacncatcgt 3001 tctatctgga aaaataggtg gatgagtgag ataataacgg tatatacttt ttaaaggtct 3061 aattgacata tataaattgc aagtatttca gatgtcaatt tgctaacctt gacacacata 3121 gacacacatg aaaacatcac cacattaata caatgtatgt atccatcatt ccaaaagctt 3181 ccctgtgtat ctttgtaact ctttcttcct ccctccactc cttgtcctct cgttcccaag 3241 aaaacattga tctgcttcct gtgaatataa attaacttac attttttaga gctttatata 3301 agtatgttct ctttactgtt tgtcttcctt cgctgcacag ttattttgag attcttcaag 3361 ttttttcttt atatcgatac ttcattcaca agaatatatt ttaattctag actatgtcac 3421 attgactttg tcgtctgcta aatccttagt gctcagatga cttgttcagg actctccttg 3481 aacctgtacc tctgttanat tgaaacttgt ctctactgtc tttttatttc aaacacagct 3541 tattaggtgt ctctcaaccc atcaaacnca caatctgagt ctttaggaga ttgctttgaa 3601 tttgtgctat tgacttatat ntatatnaaa tntgtaaatg tttggtaaaa atatcatcat 3661 gtacnttttc ataattacgc tatntncaca tgatatatgt cagactctgg aaatatgcat 3721 gccacagaca cgtgtttctt gcctaaaggg gctgatggaa gacncacata cnaatagacg 3781 attgcagtag aatgagagtg gtggtctaan cagtacatgt cctgatgttg ctcggacagt 3841 tactacncca agagtacccc ctgcattgtc agggttagca tctcctggaa gcctcatgta 3901 aatgaagaat ttcatgctcc atccaggacc taatgaataa gaatctgcat tttagcaaga 3961 ccctcatatg attcatatac actttttttt ttttttttta gatggagtct cactcttgtc 4021 gcccaggctg gagtgcaatg gcatgatctt ggctcactgc aacctctgcc tcccgggttc 4081 aagtgattct cctgtctcag cctccctagt agctgggact acaggtgcat gccacagtgg 4141 ctggctaatt tttgtatttt tagtagagac agggtttcac cattttggtc aggctggtct 4201 tgaactcatg acctccggtg attcccccgc ctcggcttcc caaagtgctg ggattacaga 4261 catgagccac cacacccgcc ttattcgtat acncatttaa ttctgagaag cactctatag 4321 aaaataagaa taagaaaata ttgggctcac aggtgacatt aataagtaac tttatcgagt 4381 accccaaatt ttacctatgt ttggaagatg gggttaaaag gacacattga aaacaagaac 4441 tcattgtggc ttttttttcc tcctttttga acagttttct atttctggaa tgttgtcaat 4501 tatatctgaa aggagaaatg caacatatct ggtgagttgc ccgtttctgt ctttgtccat 4561 ccttgaaaag ataagaagaa cagagtttta agagtcttaa gggaaacaca tctttgtctc 4621 ctatattact tgtgaatgtg gatatatgat tttgtttcaa tctattttgt gtcctaaggc 4681 tttttgcaac agaagttgga tatatcatta gaaacataaa ttgtaccatt taacatacat 4741 gaagtttatg tttaccttga cgttcttcta aaaagtgtcc tacaccggca ttgtccttgt 4801 aggcatattc acatgatcaa ataaaataat tagttttcaa ttaaggagaa tatttgagga 4861 aagaccgtac gtgttcatgt ggttcctgaa ggcagtccag tgagaaagta atatatgctt 4921 cattaaacaa tgcggacatt ttcagggttt ccctttttaa ccaaaatttg gaagcaatgt 4981 ggaatttact ggatgcatcc agccctgaaa tgaagatagg tttattgaat gtgccagcaa 5041 gtgcaggccc aggtctgagt gttcttcatt attatcaggt gagaggaagc ctgggagcaa 5101 acactgccag cagcatagct gggggaacgg gaattaccat cctgatcatc aacctgaaga 5161 agagcttggc ctatatccac atccacagtt gccagaaatt ttttgagacc aagtgcttta 5221 tggcttcctt ttccactgta tgtatttttt tttgtgtggg aagactaaga ttctgggtcc 5281 taatgtaagt aagaagccct cttctcctgt tccatgaaca ccatcctttt ctgtaacttc 5341 tattacacag tatagtggtt ctgtaagttc acacagccca gggagatgct ggctgcccac 5401 tcccctcaac ccaggcaaat tcctcggggt taaagttatc tactgcaagt gacgatctct 5461 gggtttttct gtgcctgtgt ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtatgtgtca 5521 ctttaaaagg actggtcaga tggtagggag atgaaaacag gagatgctat aagaaaataa 5581 acttttgggg cgaataccaa tgtgactctt tttgtttgtc atttgttgct gttcaatagg 5641 aaattgtagt gatgatgctg tttctcacca ttctgggact tggtagtgct gtgtcactca 5701 caatctgtgg agctggggaa gaactcaaag gaaacaaggt agatagaagc ccgatataaa 5761 atcttgaatg acaggttaac gaattggagc tttattcctt aaaatatggc ctgggttttc 5821 tgaaacattt cttccagaaa atagtttctc caagttttat tactttggtt tacaaatctc 5881 acatttaaat cacattttat accataagta gcacacattt cataatattc ctctgaatga 5941 gggttgggat aataggactg atatgttaga aatgccttaa agtgtgtgga gcatgagaga 6001 tggatgtaca gaaggcttgt gaggaaacca cccaggtatc tggccttgtt ttctgcccca 6061 gaactagccg cctattcctg tttctgtttt attcctttgt ttcttgactt ttcctttcca 6121 acttgctcta aaacctcagt tttctttcct ttctgattca tgactaccaa atgttttcac 6181 ttgcctcacc cgtccattac acctttgata agaaccacca gaccttgtgc tcatgtactt 6241 gcccatgtct gatggaagaa acatactctc tccatctgtc cactttcctg aggcattcaa 6301 gtctagccac cttttaaaat cactctcctc caggctgggc acggtgtcac gcctgtaatc 6361 tcagcacttt gtgaggctga ggagggcgga tcacttgaag tcaggagttc aaaaccagcc 6421 tggccaaatg gcaaaaccaa atcttcttca attataacca aatcttaaac caaatctcta 6481 ctaaaaaata caacaaaaca aaacaacaac aacaaaaaca gaaaaggaaa cattagccca 6541 gcgtggtggc aggtacctga ggttccagat acttgggagg ctgaagcagg agaatcgctt 6601 gagcccaaga gatggaggtt gcagtgagcc gagatcatgc cactgcacca cagccagggt 6661 gacagagcca tacttcccag cacattggga ggccaaagct gaagaataat ttgaggtgag 6721 gatttggaga ccagcctggc caacatggtg aaactccgtc tgtactaaaa atataaaact 6781 tagtggggca tgggggcaca cacctgtaat ttcagctact taggaggctg aggcaggaga 6841 attgcttgaa cccgggaggc ggaagttgca gtgagccaag atcgtggcca ctgcactcca 6901 gcctgggtga catagtgaga ttctgtctca aaaaaaataa aagaaattta aaaaatcact 6961 ctcttccaaa gatagataaa taagacagca gatatactaa ggaataacct caccaacttg 7021 tcattgactg acatgatttc ttttggccca cttggccagc tagtctggtt tggttttctg 7081 gaaatgaaag aaataatcag agtttaatga cagagagcgt gagacccaga aagacaaaag 7141 tagatgaggt aagtctcttg agcgagactt ctagggatgg gaaatttgtg gtgattgata 7201 tgaaatgatt tttcccttat caggttccag aggatcgtgt ttatgaagaa ttaaacatat 7261 attcagctac ttacagtgag ttggaagacc caggggaaat gtctcctccc attgatttat 7321 aagaatcacg tgtccagaac actctgattc acagccaagg atccagaagg ccaaggtttt 7381 gttaaggggc tactggaaaa atttctattc tctccacagc ctgctggttt tacattagat 7441 ttattcgcct gataagaata ttttgtttct gctgcttctg tccaccttaa tatgctcctt 7501 ctatttgtag atatgataga ctcctatttt tcttgtttta tattatgacc acacacatct 7561 ctgctggaaa gtcaacatgt agtaagcaag atttaactgt ttgattataa ctgtgcaaat 7621 acagaaaaaa agaaggctgg ctgaaagttg agttaaactt tgacagtttg ataatatttg 7681 gttcttaggg tttttttttt ttttagcatt cttaatagtt acagttgggc atgatttgta 7741 ccatccaccc atacccacac agtcacagtc acacacacat atgtattact tacactatat 7801 ataacttcct atgcaaatat tttaccacca gtcaataata catttttgcc aagacatgaa 7861 gttttataaa gatctgtata attgcctgaa tcaccagcac attcactgac atgatattat 7921 ttgcagattg acaagtagga agtggggaac ttttattaag ttactcgttg tctggggagg 7981 taaataggtt aaaaacaggg aaattataag tgcagagatt aacatttcac aaatgtttag 8041 tgaaacattt gtgaaaaaag aagactaaat taagacctga gctgaaataa agtgacgtgg 8101 aaatggaaat aatggttata tctaaaacat gtagaaaaag agtaactggt agattttgtt 8161 aacaaattaa agaataaagt tagacaagca actggttgac taatacatta agcgtttgag 8221 tctaagatga aaggagaaca ctggttatgt tgatagaatg ataaaaaggg tcgggcgcgg 8281 aggctcacgc ctgtaatccc agccctttgg gaggccgagg tgggcagatc acgaagtcag 8341 tagtttgaga ccagcctggc caacatagtg aaaccccgtc tctactaaaa atacaaaaaa 8401 aaaattagct gggtgtggtg gcagtcacct gtagtcccag ctacttggga ggatgaggca 8461 ggagaatcgc ttgaacctgg gaggcggagg ttgcagtgag ccgagatcgc accagtgcac 8521 tccagccttg gtgacaatgg gagactccat ctcaaaaaaa aaaaaaaaaa aaaaaagata 8581 aaaagtcaga aatctgaaaa gtggaggaag agtacaaata gacctaaatt aagtctcatt 8641 ttttggcttt gattttgggg agacaaaggg aaatgcagcc atagagggcc tgatgacatc 8701 caatacatga gttctggtaa agataaaatt tgatacacgg tttggtgtca ttataagaga 8761 aatcattatt aaatgaagca agttaacact ctaagagaat tattttgaga tagaagtgaa 8821 gctaagctaa acttcacatg cctataattg gagggaaaaa ctaaggataa aatctagcct 8881 agaagataca ataattagtc ataaacatgc attgtgaaac tgtagagagc aggtagccca 8941 aaatagagaa agattagata aagagaaaat aagtatccat cagagacagt atctctaggc 9001 ttgggcaaga gaaaagtcca cagtgataag caactccacc taaggcatga atatgcggca 9061 gagaaaacag caatagtgaa tgaatgcaaa aggtgctgag caaattccac acatgagtat 9121 tgtgcatgag taaatgaata aaacatttgc aaagaccttt agagaaagag aatgggagca 9181 tatgtgcgaa ataagatagt tgattatgaa tagaaggtag tgaagaaaag caagctaaga 9241 aaaaattctg tttataaaag aaggaaaaga tagtttatgt ttttagccta agtataagag 9301 tcctacagat ggactgaaaa aaatcagtct gagagtatta gtcacaatta atgaaataat 9361 tacattttat gtattgagga tgccaagatt aaaaggtgac aggtagatgt taatttccct 9421 agattgtgaa agtgatcacg acaatcacac aacaaataat taagtgactt ggtatgcttt 9481 atttaattgt agggcctgag gttttccatt ctcatttttc taaaatacaa ttttgtttct 9541 ccaaatttga cagcagaata aaaaccctac cctttcactg tgtatcatgc taagctgcat 9601 ctctactctt gatcatctgt aggtattaat cacatcactt ccatggcatg gatgttcaca 9661 tacagactct taaccctggt ttaccaggac ctctaggagt ggatccaatc tatatcttta 9721 cagttgtata gtatatgata tctcttttat ttcactcaat ttatattttc atcattgact 9781 acatatttct tatacacaac acacaattta tgaatttttt ctcaagatca ttctgagagt 9841 tgccccaccc tacctgcctt ttatagtacg cccacctcag gcagacacag agcacaatgc 9901 tggggttctc ttcacactat cactgcccca aattgtcttt ctaaatttca acttcaatgt 9961 catcttctcc atgaagacca ctgaatgaac accttttcat ccagccttaa tttcttgctc 10021 cataactact ctatcccacg atgcagtatt gtatcattaa ttattagtgt gcttgtgacc 10081 tccttatgta ttctcaatta cctgtatttg tgcaataaat tggaataatg taacttgatt 10141 tcttatctgt gtttgtgttg gcatgcaaga tttaggtact tatcaagata atggggaatt 10201 aaggcatcaa taaaatgatg ccaaagacca agagcagttt ctgaagtcct ccttttcatc 10261 agctctttat caaacagaac actctataaa caacccatag ccagaaaaca ggatgtagga 10321 acaatcacca gcacactcta taaacaaccc atagccagaa aacagaatgt aaggacaatc 10381 accagccatc ttttgtcaat aattgatgga atagagttga aaggaactgg agcatgagtc 10441 atatttgacc agtcagtcct cactcttatt tacttgctat gtaaacttga gaaagctttt 10501 ttctctttgt gaacctcagg ttttacatct gaaaatgaga aatttggaac aaaagattcc 10561 taactggtct ttctgttccc atattctgtg atttttcaat atttaggatt tttggtaatc 10621 acaattactt agtttgtggt tgagatagca acacgaatca gaactatttg gtggacatat 10681 tttcaaagga gtagctctcc actttgggta aagaagtgat gcnggtcgtg gtggctcacg 10741 cctgtaatcc cagcacttta gggaggccaa ggcgggtgga tcacgaggtc aggagatcga 10801 gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaaatacaaa aaattagcca 10861 ggcgtggtgg cgggcgcctg tagtcccacg tactcgggag gctgaggcag gagaatggca 10921 tgaaccaggg aggcggagct tgccgtgagc cgagatagcg ccactgcagt ccctcctggg 10981 caaaagagca agactgcgtc tcaaaaaaaa aaaaaaaaaa aaaaaaagaa gtgtgtggag 11041 tagcaggaca cctgcaacaa taatattttt ctaaatccct ctgaaaaatg ctaatcaaag 11101 ggtttttttc ctaaaaattg tcttagaaat aaaatttccc ctttgggaga ccgaggctgg 11161 cagatcacga ggtcaggaga tagagaccac ggtgaaaccc cgtctctact aaaaatacta 11221 aaaattagcc ggggngtggt ggtgggtaca cctgtagtcc cagctacttg gaggctgagg 11281 ctggagaatc acgtgaac // LOCUS HUMROD1X 2841 bp DNA PRI 09-JAN-1995 DEFINITION Human rod outer segment membrane protein 1 (ROM1) gene exons 1-3, complete cds. ACCESSION M96759 NID g292430 KEYWORDS disk morphogenesis; disk rim; peripherin-related protein; rod outer segment membrane protein 1; rod photoreceptor; transmembrane protein. SOURCE Homo sapiens (tissue library: lambda DASH) adult DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2841) AUTHORS Bascom,R.A., Schappert,K. and McInnes,R.R. TITLE Cloning of the human and murine ROM1 genes: genomic organization and sequence conservation JOURNAL Hum. Mol. Genet. 2 (4), 385-391 (1993) MEDLINE 93278386 FEATURES Location/Qualifiers source 1..2841 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_lib="lambda DASH" /map="Unassigned" mRNA join(637..1241,1629..1875,1992..2487) /gene="ROM1" /note="G00-120-350" exon 637..1241 /gene="ROM1" /note="G00-120-350" /number=1 gene join(637..1241,1629..1875,1992..2487) /gene="ROM1" CDS join(652..1241,1629..1875,1992..2210) /gene="ROM1" /codon_start=1 /db_xref="GDB:G00-120-350" /product="rod outer segment membrane protein 1" /db_xref="PID:g292431" /translation="MAPVLPLVLPLQPRIRLAQGLWLLSWLLALAGGVILLCSGHLLV QLRHLGTFLAPSCQFPVLPQAALAAGAVALGTGLVGVGASRASLNAALYPPWRGVLGP LLVAGTAGGGGLLVVALGLALALPGSLDEALEEGLVTALAHYKDTEVPGHCQAKRLVD ELQLRYHCCGRHGYKDWFGVQWVSSRYLDPGDRDVADRIQSNVEGLYLTDGVPFSCCN PHSPRPCLQNRLSDSYAHPLFDPRQPNQNLWAQGCHEVLLEHLQDLAGTLGSMLAVTF LLQALVLLGLRYLQTALEGLGGVIDAGGETQGYLFPSGLKDMLKTAWLQGGVACRPAP EEAPPGEAPPKEDLSEA" intron 1242..1628 /gene="ROM1" /note="G00-120-350" /number=1 exon 1629..1875 /gene="ROM1" /note="G00-120-350" /number=2 intron 1876..1991 /gene="ROM1" /note="G00-120-350" /number=2 exon 1992..2487 /gene="ROM1" /note="G00-120-350" /number=3 BASE COUNT 538 a 871 c 867 g 565 t ORIGIN 1 gcccggggcc gcagtctcca gacccccccg ggccctcgga ctctcccggg gccgctctcg 61 gctcccgggg gtggggtggc agggccgtcc ggtgccacag cgccgcagca caaacaggcg 121 ccggacgcgg agccgccagg aagcgcggga gggggggcgg gcccgagggg gggccgggcc 181 gcttggtaac ccctccctgt ccgggcctcg ccgctcagta cgggggcggg gctagccggc 241 tgaccccctg gcctactccc gccgtccggc tccaggccct tcccggatcc ccgcccccgg 301 attcccaggg gacggggaag gtagcgcccg ccccgatatc tccgcccccc agccccctaa 361 cccctcaggg ttgagcggac caaccccacc acttccgcga ggggcagggg cggggtcaca 421 aaacgggccc tcggcctagg ggcggagttt ctcgtaaggg gcaaggccaa ggcatcttgt 481 attggggctg acaggggggc gggttattag ggctgaggat gggaggtagc tcagggtatt 541 ggggtcaggg tggcattagc ccagctcaag ccgggccggg ctgactcagc atcctgcccc 601 agccagcttc catccctgac acctctgcac tcccttgggc agagatggga gatggcgccg 661 gtgttgcccc tggtgctgcc cctgcagccc cgcatccgcc tggcacaagg gctctggctc 721 ctctcctggc tgctggcgct ggctggtggc gtcatcctcc tctgtagtgg gcacctcctg 781 gtccagctaa ggcaccttgg caccttcctg gctccctcct gtcagttccc tgtcctgccc 841 caggctgccc tggcagcggg cgcggtggct ctgggcacag gactagtggg tgtaggagcc 901 agccgggcaa gtctgaatgc agctctatac cctccctggc gaggggtcct gggcccgctg 961 ctggtggctg gcacggctgg tggggggggg ctcctggtcg tcgccctcgg gctagccctg 1021 gctttgcctg ggagtctgga tgaggcgctg gaggagggcc tggtgactgc cttggctcac 1081 tacaaggaca cagaggtgcc tgggcactgt caggccaaaa ggctggtgga tgagctgcaa 1141 ctgaggtacc actgctgcgg gcgccacggg tacaaggatt ggtttggggt ccagtgggtc 1201 agcagccgtt acctggatcc cggtgaccgg gatgtggctg agtgagtgat ttgcgtctcc 1261 cttcctcctc ctcctcctcc ctggacaggc tccctcctgc tgccttgaat ccccacctcg 1321 ctcagagggg caataagtag aacatagtgg ctgagagact ggttacagct ctgccattta 1381 ctagctgtga aacccagggc atgttaccaa accaccctgg gcccattcct tcacctgtaa 1441 aatggaaata atagtactta tctgatagag ttgttgtgaa gatgtgaatt atgcttggct 1501 ggcacatagt acagcagtca gtaaatgttt tactattctt tttcccttct gaacacctgt 1561 gcccttcagt ccctccccca ggcctctatc tccagacatc cttaacccct ctgtccctcc 1621 ctttgcagcc ggatccagag caatgtagaa ggcctatacc tgactgatgg ggtccctttc 1681 tcctgttgca acccccactc accccggcct tgcctgcaaa accgtctttc agactcctac 1741 gcccaccccc tgttcgatcc ccgacaaccc aaccaaaacc tctgggccca agggtgccat 1801 gaggtgctgc tggagcactt gcaggacttg gcaggcacac tgggtagcat gctggctgtc 1861 accttcctac tgcaggtgag tcagcaaagc atctgacacc tcctcccacc cgggactcct 1921 ccctgcctcc aaccctgggc ctcttggaac cgctgactct ccctgactct ttccccttgc 1981 ttcccccaca ggctctggtg ctccttggcc tgcggtacct gcaaacagca ctggaggggc 2041 ttggaggggt cattgatgcg ggaggagaga cccagggcta tctctttccc agtgggctga 2101 aagatatgct gaaaacagca tggctacagg gaggggttgc ctgcaggcca gcacctgagg 2161 aggccccacc aggagaagca cctcccaagg aggatctatc tgaggcctag aggcctggag 2221 cttggggtga ggaagaggga gggatggaca agtctgaaaa cctcacaact ccttaccaag 2281 gctccaggtt ggggggatcg taggattaga ggggctaagg atagtcagcg agctggactg 2341 gggtaagaaa gaaaaccaga tgtcctaggg cctagccctt gtagtcagaa ccaccaggga 2401 acagcaaaga acagagtgat gggaaagtga catgagaagg cctggaggct gattctgata 2461 tagactcaat aaagtttttg gatggaagca attgcttttt cttgtcaagg ggatgggggc 2521 ctgggagaac tgatttctgt ctgatggagc agctaggact ccaaagtttg gaccctggct 2581 cgacctgtgc agcaacagga gcccacatct gtaaggatca gaaagcaaga acccaatgta 2641 agaagcaaag ggaaaacaag aggcccctcc aggttgagat tctttattct ggaggtagga 2701 agggggtcag catgctcagg tgggaagggt ccagcccagc tcctccagcc cccagtgcat 2761 gcccagcccc aataagttac ccagttactc agctgccctc cctcctgggt ccatctgtcc 2821 ttctgttcca ccctagacag g // LOCUS HUMRPS17A 4029 bp DNA PRI 09-JAN-1995 DEFINITION Human ribosomal protein S17 gene, complete cds. ACCESSION M18000 NID g337502 KEYWORDS 40S ribosomal subunit; Alu repeat; ribosomal protein. SOURCE Human leukocyte DNA, clone HGS17-7. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4029) AUTHORS Chen,I.T. and Roufa,D.J. TITLE The transcriptionally active human ribosomal protein S17 gene JOURNAL Gene 70 (1), 107-116 (1988) MEDLINE 89196902 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly submitted by D.Roufa 22-DEC-1987. FEATURES Location/Qualifiers source 1..4029 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" prim_transcript 173..3844 /note="S17 mRNA and introns" exon <198..200 /gene="RPS17" /note="S17 ribosomal protein" /number=1 gene join(198..200,493..644,1641..1746,2255..2320,3718..3798) /gene="RPS17" CDS join(198..200,493..644,1641..1746,2255..2320,3718..3798) /gene="RPS17" /note="S17 ribosomal protein" /codon_start=1 /db_xref="GDB:G00-128-851" /db_xref="PID:g337503" /translation="MGRVRTKTVKKAARVIIEKYYTRLGNDFHTNKRVCEEIAIIPSK KLRNKIAGYVTHLMKRIQRGPVRGISIKLQEEERERRDNYVPEVSALDQEIIEVDPDT KEMLKLLDFGSLSNLQVTQPTVGMNFKTPRGPV" intron 201..492 /note="intron A" exon 493..644 /gene="RPS17" /number=2 intron 645..1640 /note="intron B" repeat_region 929..1197 /note="Alu repetitive sequence element" exon 1641..1746 /gene="RPS17" /number=3 intron 1747..2254 /note="intron C" exon 2255..2320 /gene="RPS17" /number=4 intron 2321..3717 /note="intron D" exon 3718..>3798 /gene="RPS17" /note="S17 ribosomal protein" /number=5 BASE COUNT 885 a 960 c 1079 g 1105 t ORIGIN 1 gtcgggggcg ggacaggaca ccctgcccac cggaggggcg cgccaccgcc gcccccgctc 61 cactccgcca gggggcgctg tgtggctccc gagtaggctc agagcgaccg tggccgggcg 121 gaagcggctt tctggcctaa gctttaacag gcttcgcctg tgcttcctgt ttcctctttt 181 accaaggacc cgccaacatg gtaggtgttt tggctcgagg ccgccatcct ccacaatcgt 241 cccccatctg cagcttcccg cggtcctcag cccacagaga ggccctgggc cggaaggcgc 301 agagccctgt caggctggag cgcaacgggc cggcggcttc cgcattaagc ccggcgcccg 361 gcactgttta gccgggccct ctccggctca gcgcagccgg cggtccccaa ccacaccggc 421 cggggctcgg cgccgcactc ttcccagaag gttgcgctgg ctcgtgagtg atcctgtcct 481 ctccacccgc agggccgcgt tcgcaccaaa accgtgaaga aggcggcccg ggtcatcata 541 gaaaagtact acacgcgcct gggcaacgac ttccacacga acaagcgcgt gtgcgaggag 601 atcgccatta tccccagcaa aaagctccgc aacaagatag cagggtgagt cgggccttcc 661 tcggcctccg ggcgtccgcg atctgtttgt gtgctctgcc gccctccgac tcgggaagag 721 acggtgcctt ggtggtttcg ttagcctgag actccctggt gtcccggaga agggctagag 781 ctctgcgtgt gcaggatttc tgttgtgaac ttaggcctgt gtcatgactg gccgcggtga 841 ccctgggtcg aaggtagcca ccgtcagacc tcttgctttg aagctacgtt gcttttagtt 901 gtcaactttt ttctttcttt tttttctttt ttttttgaga cggaatcttg ccctggcgcc 961 caggctagag tgcagtggcg ccatctcggc tcactgcaac ctccgtctcc cgggttcaac 1021 gcgattgtca tgcctcagcc ttcccagtag ctgggattag aggcacgtgc taccacactc 1081 ggctaatttt tgtattttta gtagacaaca gggtttcaca atgttgacca ggctggtctc 1141 gaacacctga cctcaagcga tccgcccgcc tcggcctccc aaattgctgg gattacaggc 1201 gtgagcacct cgcccggctg gatgtcaact tctaatgctt acttcacagt tgcagttact 1261 gttcgagaga tttataggca tatggcctgg aagaatagtg ccaaagtcaa cgtattaatt 1321 gccaagttat gtgaatttga aattggaaga aaatgaaggc tgtgaagggt cgtgaaactg 1381 cggggcgggg cggggcgagt ggggggtggg gcgcttgatc atcatgaagg agagtttgaa 1441 tagagagtga gaaagccaag gggaaggaat gctcaggcca gtggaggttc tatgtgtagg 1501 aggtcccagg atagtgagtc ctcttccttc tcactgtgga aaagaactaa agcgggtatt 1561 tatatatcca catgtgctca gtaccaggtg ggagttgatt tgtcttctca ctgttctctt 1621 tggctgtgtg tgctttgtag ttatgtcacg catctgatga agcgaattca gagaggccca 1681 gtaagaggta tctccatcaa gctgcaggag gaggagagag aaaggagaga caattatgtt 1741 cctgaggtaa actttctgga tatttgggct tctggctaat cctcaaatga taaagctatg 1801 ggttcttggc ttccaaaggg acctcagctg aatctgagac ctttaatcca tacattccat 1861 aaccagctgt gggtgcccat aggcccccta ccagtttctt agcctccatt ctgctcttct 1921 ggaaccagtt aagtctattc catatttcat gagagccctt aatgttgatt tgatgactgc 1981 tgtcgtggct cgattgtttg cccctaatat agtattcttt ggtgtaaaca cagacttcaa 2041 gtgttctgaa tactgaatga gggcatggga aggtaaggcc agaagaacag gtcatcgggg 2101 tctgtaatac tgtaagagac tgctcacctt gtgagaggaa tgctatcctt gtgcagctcc 2161 tcttccccat atttaagaac ctggggagag gaggaggaga ggtgagtgat aatctcattg 2221 attggtattt tgaccctacc tcgtttcctt gtaggtctca gccttggatc aggagattat 2281 tgaagtagat cctgacacta aggaaatgct gaagcttttg gtaagtgttt gctggattcc 2341 taaagtggta ttttcctggt caaaaaccat cagtaggtct tattatccaa ggtcacccag 2401 ctagtaagtg gcagagttgg gaatcccatc taaacttgta gccactaaac tggactgctt 2461 tcataagggg tgtagcggaa gttgtgtgca ttgagttagt tcgccacaag atcatgattc 2521 ccaaatctag tgcttccagg acacttttct ctcttggctg cttctgttct tggtttatag 2581 ctggcttacg ttgccccctt tgtgtcagcc ctggctgaat ctgcctgctt ttcttctgtg 2641 aaggcattcc tttggtgtct ccatgctaac cactcagaaa tgagctttgg tccatttcca 2701 ggtacataca aggcactact taggtagttc acgagtatct tcattgcact ttgatgtttt 2761 ttctatggct tatcttagtc cttttgcata cttgggtgaa agcatgggct tcatgggcgg 2821 ttggtgattt tgaacaactt catttttcct atttttatgg tcattaaatg tttaagacac 2881 tcaaaaaggt agtagcaaaa tgaactcaca tgtactttcc cagctacagc cccgctgtgg 2941 gtcttttctt cctgaaataa ccactgcgct gaatgcagca tttttgttga aactaaaatg 3001 ggtttctgag gaccaaatag gacatggagt tccaggtata agttgggccc ttggcttctt 3061 gccctgccta ccagcctgaa cttgtttgag ggcagggtct gctcttgcct ccctcccttc 3121 tgcttgatgc cttgtgtact tgggcccttc acaaatgttg attagattag taatagctct 3181 tacccagggc tggtcctgga aagcaagtgg cactgtgtat tctcaaaggg acagagtttt 3241 gctttagagg acatttggca atgtccagac acatttttcg ttctcatagc caggggatac 3301 ctgctaactt ctaatgggta gaggccaagg atgttgctaa actttctgca atacacagta 3361 cagctgccta taacaaagaa tgtccagccc aggatgtcta cgttgccaag attgagaaac 3421 ggtgttttag agtgattgat aggaaggggt tgtgaaaaca gatacaaatg gcatggtaca 3481 ttgggtaaaa gtcatagagt tgtaaatggg gataccacat ggtcaggtgt gtgttgaagg 3541 aagatcactc tggaaaacgt gggaggcatt gagaaccctg ttggctctta aagggtcctt 3601 caggtcgcct ctggaccccc ttgagagtgc tgtggctcta ggggcctgtg ttttcttctg 3661 gggtccactc acagatctca ccagtaactg ctcactgaat ccattttctc gttccaggac 3721 ttcggcagtc tgtccaacct tcaggtcact cagcctacag ttgggatgaa tttcaaaacg 3781 cctcggggac ctgtttgaat tttttctgta gtgctgtatt attttcaata aatctgggac 3841 aacagccttg cctgtgtcat ctttgcagtt gtgtgtgggt agaggaaagg caggagctct 3901 cgaggcaaca gatcagctgg cacttgtctg ggattcaggt gctgctgcat ctggtttatg 3961 gggagtgggt tcaccagagt ttagtaaagg agcaagggca agcctgaacg tgttgagagt 4021 ggtctgcag // LOCUS HUMRPS6B 4990 bp DNA PRI 09-JAN-1995 DEFINITION Human ribosomal protein S6 gene, complete cds and flanking regions. ACCESSION M77232 NID g307392 KEYWORDS phosphoprotein; ribosomal protein; ribosomal protein small subunit. SOURCE Homo sapiens (tissue library: Stratagene cat.# 943202; Lambda DASH) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Lott,J.B. and Mackie,G.A. TITLE Isolation and characterization of cloned cDNAs that code for human ribosomal protein S6 JOURNAL Gene 65 (1), 31-39 (1988) MEDLINE 88284378 REFERENCE 2 (sites) AUTHORS Heinze,H., Arnold,H.H., Fischer,D. and Kruppa,J. TITLE The primary structure of the human ribosomal protein S6 derived from a cloned cDNA JOURNAL J. Biol. Chem. 263 (9), 4139-4144 (1988) MEDLINE 88153727 REFERENCE 3 (bases 1 to 4990) AUTHORS Pata,I., Hoth,S., Kruppa,J. and Metspalu,A. TITLE The human ribosomal protein S6 gene: isolation, primary structure and location in chromosome 9 [published erratum appears in Gene 1993 May 30;127(3):275] JOURNAL Gene 121 (2), 387-392 (1992) MEDLINE 93077059 REFERENCE 4 (sites) AUTHORS Pata,I., Hoth,S., Kruppa,J. and Metspalu,A. TITLE The human ribosomal protein S6 gene: isolation, primary structure and location in chromosome 9 JOURNAL Gene 127 (2), 275-276 (1993) MEDLINE 93273247 FEATURES Location/Qualifiers source 1..4990 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_lib="Stratagene cat.# 943202; Lambda DASH" /map="9p21" mRNA join(312..358,927..1058,1627..1837,2031..2177,3893..4050, 4156..4288) /gene="RPS6" /note="G00-118-869" gene join(312..358,927..1058,1627..1837,2031..2177,3893..4050, 4156..4288) /gene="RPS6" exon 312..358 /gene="RPS6" /note="G00-118-869" /number=1 5'UTR 312..352 /gene="RPS6" CDS join(353..358,927..1058,1627..1837,2031..2177,3893..4050, 4156..4251) /gene="RPS6" /codon_start=1 /db_xref="GDB:G00-118-869" /product="ribosomal protein S6" /db_xref="PID:g307393" /translation="MKLNISFPATGCQKLIEVDDERKLRTFYEKRMATEVAADALGEE WKGYVVRISGGNDKQGFPMKQGVLTHGRVRLLLSKGHSCYRPRRTGERKRKSVRGCIV DANLSVLNLVIVKKGEKDIPGLTDTTVPRRLGPKRASRIRKLFNLSKEDDVRQYVVRK PLNKEGKKPRTKAPKIQRLVTPRVLQHKRRRIALKKQRTKKNKEEAAEYAKLLAKRMK EAKEKRQEQIAKRRRLSSLRASTSKSESSQK" intron 359..926 /gene="RPS6" /note="G00-118-869" /number=1 exon 927..1058 /gene="RPS6" /note="G00-118-869" /number=2 intron 1059..1626 /gene="RPS6" /note="G00-118-869" /number=2 exon 1627..1837 /gene="RPS6" /number=3 intron 1838..2030 /gene="RPS6" /note="G00-118-869" /number=3 exon 2031..2177 /gene="RPS6" /note="G00-118-869" /number=4 intron 2178..3892 /gene="RPS6" /note="G00-118-869" /number=4 exon 3893..4050 /gene="RPS6" /note="G00-118-869" /number=5 intron 4051..4155 /gene="RPS6" /note="G00-118-869" /number=5 exon 4156..4288 /gene="RPS6" /note="G00-118-869" /number=6 3'UTR 4252..4288 /gene="RPS6" BASE COUNT 1423 a 919 c 1154 g 1492 t 2 others ORIGIN 1 ggtaccgtca gatgcaaagt gcctgggaca gaagtggggc tccgctggcg cccagctcca 61 aaacccaggc agcgtggaaa agactagaca gggaaggggt tagccctcag aattacacgc 121 gggtttgcct taccagacta ccaccactgg cacaacctca gacccacacc caaccgactt 181 tacctcccag gccctgatta atctcgcccg gaagtaccgc ccacccatgc tcacttccgc 241 tatcccgtac ttctgctcat ctcgcgagaa ctgaaagcgc ctatgtgacc tgcgctaagc 301 ggaagttggc cctcttttcc gtggcgcctc ggaggcgttc agctgcttca agatgaaggt 361 aggtgatggt ggcgagtgtt agactgggtt tggggaacgt gaatcgagtc ccagaacgcg 421 gcattgcctc agttccagca ctccaggatc ctggctttag gtggagaagg gtctcaagta 481 ggagaaggct cgcctttctg gggcatggag ctttttggcc gaacggatgg caggcgattg 541 cggctggagc ggcgggccgg gagcgccatg gtggcgtccc cgcgcccagc cgggacggat 601 gcggcgtgtt gcccagtttg cggcaggcct gtggtgcggc tcttgacccc ggctttcttg 661 cttcgggagg gtgaacggct gcggagtgcc ctcgccccca gagtcattcc gcggggcttg 721 aggggaaaac gtcctgctga gtgcggcgct tcttgactgc tactctgctt tcacgtgctt 781 ttagtgagta cagtcggcat cttatatttc ctgcttgtgt ggaggcaaca tgaaaggctt 841 tttgcagtgg aattaacttt gtagatggct ctacaattac ctgtatagat agtttcgtaa 901 actatttccc cccttttaat ccttagctga acatctcctt cccagccact ggctgccaga 961 aactcattga agtggacgat gaacgcaaac ttcgtacttt ctatgagaag cgtatggcca 1021 cagaagttgc tgctgacgct ctgggtgaag aatggaaggt aaaagttgac aaattgttgc 1081 aggtatttaa gtcagagacg gtaaacgcca ttggtaactg gtatttggaa tggggttcag 1141 actccgggtt ctggcttctg acctttggta agttgcttcc gaatgccact ttataaagtt 1201 agaggtatta ccttggaggg ggggacgtag agtaagccat aaaatatacg taaagtttac 1261 atcaacataa ttcttgccct gcatcatgca tttggcaata tgtcacatag ctgtcctcat 1321 aatccccaaa gtgccaaaaa gggttgtatc tgatttgttt gttgctgttt gatattttat 1381 cttcttagtg ctgttatctc taataaaaca tcggttagaa atgcgacttg aaagacattt 1441 gataattgaa cttgacaagt tgggaatata gacaaaactc actgaacaga gaaaacgtgc 1501 ttaagttcaa attggttttc ataattatac tttctcatat aggtgtgttc acagaggtca 1561 caatcctgta ctgtataatt ttggaataga aatagtaaat gtgtcattca ttgtatgctc 1621 ttgcagggtt atgtggtccg aatcagtggt gggaacgaca aacaaggttt ccccatgaag 1681 cagggtgtct tgacccatgg ccgtgtccgc ctgctactga gtaaggggca ttcctgttac 1741 agaccaagga gaactggaga aagaaagaga aaatcagttc gtggttgcat tgtggatgca 1801 aatctgagcg ttctcaactt ggttattgta aaaaaaggtg agggttactt gtgttcaatt 1861 ttcgttgaaa ttgatttaaa gccagttgtc caaatgctta ttcatttgca gtatccagac 1921 ttacagacca atggatttgt ttctctacca tcaattcaac attaaacatt ctgatttagg 1981 attcttaagg aagttgttga attaaaatct cttcaactgt ttcttctaag gagagaagga 2041 tattcctgga ctgactgata ctacagtgcc tcgccgcctg ggccccaaaa gagctagcag 2101 aatccgcaaa cttttcaatc tctctaaaga agatgatgtc cgccagtatg ttgtaagaaa 2161 gcccttaaat aaagaaggta ggagggatta tgcaattagg gcttgcttaa ttttggtaat 2221 ttgtctatca ttttgtgtgc atatcagaag ataaatatgt gcctttggaa ggccttagca 2281 ctattcactt ggagttttta cagaaaatat ttgggggtgg ccagcatacc cagttggtgt 2341 taatccttgc ttattatatc cagtgccttg tagagtgcag taatttcgtt gaaaatctgc 2401 ttgtgaatct ttgatgtata cattatccaa aaataacctg actttaaggt cattgctatc 2461 tggtttagga gtcacatgtt tgagaattct gggactatta gcaccacaca ctttgtaatt 2521 ttatatcaga tgttctaaat tattaatatt ttttaagaga tgaggtcaca ctgttgccca 2581 ggctggtctt gaacccttga gctcaagtga tgctcccact ttggcttccc aaagtgctgg 2641 gattatgggc gtgagcacca cacccagccc aaaattattt tttggcttac aaaagaagtt 2701 gctttaaact tctgtttcat ctgtaggata tggataacaa tagttaagag ttgtgagaat 2761 tagaagttag agaccttact cagaatacta gggagcaggt gctagggatt ttggagcttt 2821 accttttaag aacttttgtc catatagtag acaacatact caccagtgag gacctcaaat 2881 agcaatattt ctgtggtgaa atttggggat atttatgtta aataaacagc atgtcagtca 2941 aggttgggtt ttcctgacaa aatttgaaag aatttctctg gattttggtt ttcaggactg 3001 gtgtgactta acatacatgg catagatagt atacgtctgt taagtcagta attacactta 3061 cttgagcaca ggtactgttc aaggtgctga gaatacagta tagacaaatt cctaccctta 3121 tgaagctgat gttgcaaaaa aaaaaaaaaa gcaattgaca aataggtggc agttgcccat 3181 aaccatagcc ctggaaataa tccgtataat acaagcgtgg tctggttgat gggtattttt 3241 agatggttct tatgtctata gtaggaggct taacatttta gcagaatctg aacatctgag 3301 catttagcct aaggcagtta actccctaca aagtgtccat gaaacaaata catctgaaaa 3361 ggcctatnct ccaatgcaga tctgtaatcc atgagcagtt ttggagtaca gatcagatta 3421 atgaagatac tttaacaaca tcccacctaa tctgatgtta atagttaacg ttacccctga 3481 attggcagga attgtcaaag gaaatagaga tgcaagctta catcaaattt aaaagtatgg 3541 caggatgatt atgtagataa tgtgttaagt ggttttcaca gacaaattaa ctatgcacag 3601 ttatatatta acataatgcc actgtatttt agtaaaactg gaaggtagta aaagttagcc 3661 tggttatttt tcaggacttg gtttccctta tatttttata atatttaacc aatcttctag 3721 atttctgtgg agggggaaaa agcattttag aatagttctg atttgatttt aaaggcctca 3781 ttataggttt tcatagatgc tgttaagctt acgtttattt ctgatgtttt taaagtagat 3841 gtggattcta acaaaacaaa tattgaaatg attgagtcct ttgaattttt aggtaagaaa 3901 cctaggacca aagcacccaa gattcagcgt cttgttactc cacgtgtcct gcagcacaaa 3961 cggcggcgta ttgctctgaa gaagcagcgt accaagaaaa ataaagaaga ggctgcagaa 4021 tatgctaaac ttttggccaa gagaatgaag gttagtctaa gatgatttga gggggtggga 4081 ggagttcgac ctggctttgg atttgcgtct ttaacccaga aaggccttaa ctgtttgctg 4141 ttttgttttg tttaggaggc taaggagaag cgccaggaac aaattgcgaa gagacgcaga 4201 ctttcctctc tgcgagcttc tacttctaag tctgaatcca gtcagaaata agattttttg 4261 agtaacaaat aaataagatc agactctgat ctctgttgac tttycattgc tgatagaaaa 4321 ctacataaat gaagagaggg aaagttaggt catagaaaat ggggatatgt atatatggta 4381 ttccaatggt ggataaagtc agacctattg tgtgtggtag tggaagacaa ggtcagtgta 4441 caggggttcc tttcagggga tgggtggcac aggaagggat tctgagcata agacacctgc 4501 ctcttcagaa tggcatggaa cttaggggaa atgatgagga ctattaatct tttttatata 4561 gtttattttt gaggaatata gctctttaaa gcactataca caatttaata catgaaaatt 4621 gaccttgtga agatgaacat gtctttttta aaaggtaact tttaaggcat agttttagtg 4681 gatacctagt tttagtggat accatccact aaatggtttt taagccattt caaatttgtt 4741 ttaacaaagc ttttcaactc cagtgaattg ttaggctgtt cacacactga aaaggaaatg 4801 actaaaaaca ttcttggatc atgtaaaaat gcacaaagta tgtacttttt tctaacagtt 4861 ctatgctgac tagcctattg tgacaatatc aggatcttta gaaggtggca cattaggctt 4921 taaggttgaa gtaatgcaag ctctttgttg gcattactgt aataatagct ttacaaagca 4981 gagttctaaa // LOCUS HUMSAP01 1394 bp DNA PRI 20-JAN-1992 DEFINITION Human serum amyloid P component (SAP) gene with upstream promoter. ACCESSION D00097 NID g220067 KEYWORDS SAP; pentraxins; serum amyloid P component. SOURCE Human placenta DNA, clone Lm hSAP-8; Human liver, cDNA to mRNA, clone phSAP-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Mantzouranis,E.C., Dowton,S.B., Whitehead,A.S., Edge,M.D., Bruns,G.A. and Colten,H.R. TITLE Human serum amyloid P component. cDNA isolation, complete sequence of pre-serum amyloid P component, and localization of the gene to chromosome 1 JOURNAL J. Biol. Chem. 260 (12), 7752-7756 (1985) MEDLINE 85207828 REFERENCE 2 (bases 1 to 1394) AUTHORS Ohnishi,S., Maeda,S., Shimada,K. and Arao,T. TITLE Isolation and characterization of the complete complementary and genomic DNA sequences of human serum amyloid P component JOURNAL J. Biochem. 100 (4), 849-858 (1986) MEDLINE 87137351 COMMENT In [J. Biochem. 100, 849-858 (1986)], they isolated the human SAP cDNA and genomic DNA clones, elucidated the nucleotide sequences, and assigned the cap site of the SAP gene which was not done in [1]. In addition, they compared the genomic DNA sequence of human SAP with that of the reported human CRP. FEATURES Location/Qualifiers source 1..1394 /organism="Homo sapiens" /db_xref="taxon:9606" TATA_signal 143..149 exon 172..331 /note="serum amyloid P component (SAP), exon 1" prim_transcript 172..1210 CDS join(268..331,447..1054) /partial /note="serum amyloid P component" /codon_start=1 /db_xref="PID:d1000504" /db_xref="PID:g220068" /translation="MNKPLLWISVLTSLLEAFAHTDLSGKVFVFPRESVTDHVNLITP LEKPLQNFTLCFRAYSDLSRAYSLFSYNTQGRDNELLVYKERVGEYSLYIGRHKVTSK VIEKFPAPVHICVSWESSSGIAEFWINGTPLVKKGLRQGYFVEAQPKIVLGQEQDSYG GKFDRSQSFVGEIGDLYMWDSVLPPENILSAYQGTPLPANILDWQALNYEIRGYVIIK PLVWV" sig_peptide 271..324 /partial /note="SAP signal peptide" mat_peptide join(325..331,447..1051) /note="SAP mature peptide" intron 332..446 /note="SAP cds intron" exon 447..1210 /note="SAP, exon 2" misc_feature 533..541 /note="N-linked glycosylation site (putative)" conflict 565 /note="t in [J. Biochem. 100, 849-858 (1986)]; c in [1]" /citation=[1] conflict 682 /note="a in [J. Biochem. 100, 849-858 (1986)]; t in [1]" /citation=[1] conflict 683 /note="t in [J. Biochem. 100, 849-858 (1986)]; c in [1]" /citation=[1] conflict 685 /note="c in [J. Biochem. 100, 849-858 (1986)]; a in [1]" /citation=[1] misc_feature 767..775 /note="N-linked glycosylation site (putative)" conflict 814 /note="g in [J. Biochem. 100, 849-858 (1986)]; a in [1]" /citation=[1] conflict 943 /note="t in [J. Biochem. 100, 849-858 (1986)]; c in [1]" /citation=[2] polyA_signal 1182..1187 /note="polyadenylation signal of SAP gene (putative)" polyA_site 1210 BASE COUNT 375 a 312 c 302 g 405 t ORIGIN 171 bp upstream of 5' end of SAP mRNA. 1 agccatcact tgtctctaat aaataactcc cattgatttt ccagctcagg gctcaccact 61 ccttaccgta agcgcaggag gagactggaa aatcactcac atattattgg tgctcttcct 121 cccccatcct cacccaaggt gcatataaac cctgaataac ctgaagtcta agggcatgaa 181 tatcagacgc tagggggaca gccactgtgt tgtctgctac cctcatcctg gtcactgctt 241 ctgctataac agccctaggc caggaatatg aacaagccgc tgctttggat ctctgtcctc 301 accagcctcc tggaagcctt tgctcacaca ggtaaggagg tgaaggaatg gtcaagaatc 361 ataaagtgag aaaataggtt gaagctgaga tatcttttcc ctgcatttat actgaaggtc 421 attatctttc tttctttatc ccgcagacct cagtgggaag gtgtttgtat ttcctagaga 481 atctgttact gatcatgtaa acttgatcac accgctggag aagcctctac agaactttac 541 cttgtgtttt cgagcctata gtgatctctc tcgtgcctac agcctcttct cctacaatac 601 ccaaggcagg gataatgagc tactagttta taaagaaaga gttggagagt atagtctata 661 cattggaaga cacaaagtta catccaaagt tatcgaaaag ttcccggctc cagtgcacat 721 ctgtgtgagc tgggagtcct catcaggtat tgctgaattt tggatcaatg ggacaccttt 781 ggtgaaaaag ggtctgcgac agggttactt tgtggaagct cagcccaaga ttgtcctggg 841 gcaggaacag gattcctatg ggggcaagtt tgataggagc cagtcctttg tgggagagat 901 tggggatttg tacatgtggg actctgtgct gcccccagaa aatatcctgt ctgcctatca 961 gggtacccct ctccctgcca atatcctgga ctggcaggct ctgaactatg aaatcagagg 1021 atatgtcatc atcaaaccct tggtgtgggt ctgaggtctt gactcaacga gagcacttga 1081 aaatgaaatg actgtctaag agatctggtc aaagcaactg gatactagat cttacatctg 1141 cagtctttct tctttgaatt tcctatctgt atgtctgcct aattaaaaaa atatatattg 1201 tattatgcta cctgcatttg tttagtgctt gtcatagtcc catatcttta tcttatgtct 1261 actacttatc tatctactaa ttggtgtttc attggtaatt ggtgtttcat tatcctgaaa 1321 actccaattg ccaagtacgg ggaggaaaac ctgtaagtaa ctagaaagat atatcgcaaa 1381 gccagagcac tcaa // LOCUS HUMSEMI 4164 bp DNA PRI 16-MAY-1994 DEFINITION Human semenogelin I (SEMGI) gene, complete cds. ACCESSION M81650 NID g307416 KEYWORDS gel-forming protein; semenogelin I; seminal plasma protein. SOURCE Homo sapiens (library: lambda EMBL3) blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4164) AUTHORS Lilja,H., Abrahamsson,P.-A. and Lundwall,A. TITLE Semenogelin, the predominant protein in human semen: Primary structure JOURNAL J. Biol. Chem. 264, 1894-1900 (1989) MEDLINE 89109215 REFERENCE 2 (bases 1 to 4164) AUTHORS Ulvsbaeck,M., Lazure,C., Lilja,H., Spurr,N., Rao,G., Loeffler,C., Hansmann,I. and Lundwall,A. TITLE Gene structure of semenogelin I and II. The predominant proteins in human semen are encoded by two homologous genes on chromosome 20 JOURNAL J. Biol. Chem. 267, 18080-18084 (1992) MEDLINE 92388176 FEATURES Location/Qualifiers source 1..4164 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /tissue_type="blood" /tissue_lib="lambda EMBL3" TATA_signal 668..673 /note="putative" exon 696..792 /gene="SEMGI" /number=1 mRNA join(696..792,1037..2393,3264..3430) /gene="SEMGI" /note="mRNA start site experimentally determined" gene 696..3430 /gene="SEMGI" sig_peptide 717..785 /gene="SEMGI" /note="putative" CDS join(717..792,1037..2349) /gene="SEMGI" /codon_start=1 /db_xref="PID:g487420" /translation="MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK GQHYSGQKGKQQTESKGSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYS NTEERLWVHGLSKEQTSVSGAQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKG HYQNVVEVREEHSSKVQTSLCPAHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQD QQHGRKANKISYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPS QEQEHSQKANKISYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPN PKQEPWHGENAKGESGQSTNREQDLLSHEQNGRHQHGSHGGLDIVIIEQEDDSDRHLA QHLNNDRNPLFT" intron 793..1036 /gene="SEMGI" /number=1 exon 1037..2393 /gene="SEMGI" /number=2 intron 2394..3263 /gene="SEMGI" /number=2 exon 3264..3430 /gene="SEMGI" /number=3 polyA_signal 3407..3412 /gene="SEMGI" polyA_site 3430 /gene="SEMGI" /note="putative" BASE COUNT 1377 a 833 c 853 g 1101 t ORIGIN 1 gaattcagac ttcctgagaa ggttgcagct ggtctccatc cttcaactga caacccaatg 61 caattgtatt catctctcat taaagttact agcaacaact aagaaaaact ttggatacat 121 gtcagtcttt attgtgttga acttgacaga catatttgac attggagtgc cccatttact 181 tgtttaaaat ttctttcttt ggattccata acgtttcatt atcttctata tcattggact 241 tctatgattg ctcattctta ggtaaaatag aacgtgtctc aagatcattt ttaaaacaaa 301 aggaagtatg gagttctatc taaaatagag tatttaaatt attgcatcat gaagggaaac 361 tcacatttag aatataaagg cattctcaga aaccaattgc ttttgtagcc tgaagccagt 421 ctctttccta cttttgtcta taattctgta aaattaaagt gatctggcat gatgatctaa 481 aaggactgat aaaaatttgc tggggctgcc agaagaaaga aaatgccttt gacattatgt 541 tgctggggac attgactttg tcccacactc agcaggggtg aggaagttgg catttactga 601 taagctaaga aaagtcagtg ccttttgtaa tttaagctcc acccatggca cactcactca 661 aggaagatat aaatgacaag gtcggctcag ctctcagaca aggttttcca agcaagatga 721 agcccaacat catctttgta ctttccctgc tcctcatctt ggagaagcaa gcagctgtga 781 tgggacaaaa aggtgagtgg agagggtaag ccttggggaa agctgctcag acagctaata 841 atctaagatt atttggggtg caaacagtaa cctgtttagg cacagattct tctctttgaa 901 gagaattgat tttctccacc caaagcttca gcttttctag aaatatcaat aatttgttgg 961 gggaaaagtg gggggtaaga gttgtaaggg agctttggag ataatgaatg cataccttct 1021 tattatcaat taccaggtgg atcaaaaggc cgattaccaa gtgaattttc ccaatttcca 1081 cacggacaaa agggccagca ctattctgga caaaaaggca agcaacaaac tgaatccaaa 1141 ggcagttttt ctattcaata cacatatcat gtagatgcca atgatcatga ccagtcccga 1201 aaaagtcagc aatatgattt gaatgcccta cataagacga caaaatcaca acgacatcta 1261 ggtggaagtc aacaactgct ccataataaa caagaaggca gagaccatga taaatcaaaa 1321 ggtcattttc acagggtagt tatacaccat aaaggaggca aagctcatcg tgggacacaa 1381 aatccttctc aagatcaggg gaatagccca tctggaaagg gaatatccag tcaatattca 1441 aacacagaag aaaggctgtg ggttcatgga ctaagtaaag aacaaacttc cgtctctggt 1501 gcacaaaaag gtagaaaaca aggcggatcc caaagcagtt atgttctcca aactgaagag 1561 ctagtagcta acaaacaaca acgtgagact aaaaattctc atcaaaataa agggcattac 1621 caaaatgtgg ttgaagtgag agaggaacat tcaagtaaag tacaaacctc actctgtcct 1681 gcgcaccaag acaaactcca acatggatcc aaagacattt tttctaccca agatgagctc 1741 ctagtatata acaagaatca acaccagaca aaaaatctca atcaagatca acagcatggc 1801 cgaaaggcaa ataaaatatc ataccaatct tcaagtacag aagaaagacg actccactat 1861 ggagaaaatg gtgtgcagaa agatgtatcc caaagcagta tttatagcca aactgaagag 1921 aaagcacagg gcaagtctca aaaacagata acaattccca gtcaagagca agagcatagc 1981 caaaaggcaa ataaaatatc ataccaatct tcaagtacgg aagaaagacg actccactat 2041 ggagaaaatg gtgtgcagaa agatgtatcc caacgcagta tttatagcca aactgaaaag 2101 ctagtagcag gcaagtctca aatccaggca ccaaatccta agcaagagcc atggcatggt 2161 gaaaatgcaa aaggagagtc tggccaatct acaaatagag aacaagacct actcagtcat 2221 gaacaaaacg gcagacacca acatggatct catgggggat tggatattgt aattatagag 2281 caggaagatg acagtgatcg tcatttggca caacatctta acaacgaccg aaacccatta 2341 tttacataaa cctaccattc ggtaaccatg tgaaaggatg gaccaatatc aaggtaattt 2401 ttttttagca aataggggag atatctctct cattgtttag aattgttggg gactctccag 2461 gacttttgtg ggattgataa ccattgttcg caccaataga agtgctgtat aacaagtggt 2521 aggaagatga gcctccccat tccctggtga ggagagggtc tggtagtggc acagaaggat 2581 gtttggccta tcatgaggtc ctagaattcc tatccttaat tgaatattct tcaataatat 2641 ttttttgtat atgcctacct gctaaaggtt tttttgaaca ggtactgact acatatgcat 2701 atttataagt ttatggtata ctcttgtcag ttcttatact ttagattagt aaacctcagt 2761 ttctttccta tatagtataa aggattacag ccattaatgt tttctttctg cacaccagtg 2821 aatgttcttg catccctgtt agggttcatc tatgctcctt cagagaccac aagcccaaag 2881 actatcagtc ctccctctca gaatacagga aagagatatg tagaaagatc cttgttcatg 2941 ttgtgaaaga gaagatggaa agggagacac aagagatggt gagggatctc atgggccaat 3001 tgggtgttta caacatcagg agaagaaaac tgaagccctg agaggggtaa tggtgcttcc 3061 ctgcctcttt gcagtaagta acactgcact tgaagagagg gcaagcagta gaaacagtgg 3121 ccccagctgg caactagtgg cctggtatcc tagtagtaga gggctggtac agttggagct 3181 gaaggggaca gggggcccag ctaccatccc cacccttcac tttttactat actcacatca 3241 ctgattctct tctttctctc taggtgtcag ttgacctcag tgaattctgt gatgtttctg 3301 agatgcagac tcccgtgtag tttcagattc ttggtccatg gatgacacca cctgcccatg 3361 cttccttgaa ttaggctttc ctaacctgaa gcgccttcaa acttccaata aagagatcat 3421 tttctgcttc atctgctttt gactcctgtt ttctctggat tcctagtggc tcaaggggtt 3481 gagggcattt ctaatgtaag tggttttgaa agaaaataaa gggatatttt tgaagcacta 3541 gggagaatag taatcaaagc cacatttcaa gatatattga tagaaaccaa cactaaaaat 3601 agggtgctgt tagtctacca gtcacatgtc taccacctga ggccaacact aaaggctttc 3661 atgaccctgg tactaataag accccaggaa ctaaaaatat cccagaaaga caagagaaaa 3721 tgtattgcaa caggccctgg gaaagtccct gatctgctgg tgatcaattc cacagtagac 3781 acatcccttc aattagctta agaatgtctt acgcctacag tatccattga gatgcaatta 3841 caccaaaggt gatatgttaa atctgcacaa cttcagtaac agcaagtggg ttaaaatgtc 3901 acctgtggat ggaaagacca gatgaccagt caaaggaatc ttgaggcagg tggtgtcttc 3961 ataatggaaa tactgctgcc tatactgcct tttccccctc tattttaaaa aagtttccag 4021 agaaatctaa accttcagga gaagaggcca acatagggga gtgagaaaat cacaagccta 4081 agaataagag atttggattg tagttccaat tctgccataa taacctgtat atctttacct 4141 aaatatgttc ttccctgcaa gctt // LOCUS HUMSOMI 2667 bp DNA PRI 13-JAN-1995 DEFINITION Human somatostatin I gene and flanks. ACCESSION J00306 NID g338287 KEYWORDS neuropeptide Y; somatostatin; somatostatin I; somatostatin-14; somatostatin-28. SOURCE Human fetal liver DNA, Charon 4A library, clone pHSI-1-2.7 [2], and pancreatic somatostatinoma tissue, cDNA to mRNA [1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1126 to 1368; 2246 to 2605) AUTHORS Shen,L.P., Pictet,R.L. and Rutter,W.J. TITLE Human somatostatin I: sequence of the cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79 (15), 4575-4579 (1982) MEDLINE 83014931 REFERENCE 2 (bases 1 to 2667) AUTHORS Shen,L.P. and Rutter,W.J. TITLE Sequence of the human somatostatin I gene JOURNAL Science 224 (4645), 168-171 (1984) MEDLINE 84146798 COMMENT [1] mRNA. [1] reports the mRNA, [2] reports the gene. The somatostatin gene appears to belong to a family of related genes with individual moieties having distinct biological activities. Only a single somatostatin peptide has been isolated from mammals. Somatostatins modulate the secretion of several hormones and may have a role in neurotransmission. The coding region of this sequence predicts a 116-amino acid precursor protein of somatostatin-I that contains regions coding for both somatostatin-14 and somatostatin-28 at its COOH-terminus. By comparison with somatostatin-28 from pig and sheep it was found that the predicted amino acid sequence of human somatostatin-28 is identical to that isolated from pig and sheep, suggesting that these molecules are derived from a common precursor. A comparison of the amino acid sequences of human and anglerfish preprosomatostatin I indicated that the COOH terminal region encoding somatostatin-14 and the adjacent 6 amino acids are highly conserved, whereas the remainder of the molecule, including the signal peptide region, is more divergent. FEATURES Location/Qualifiers source 1..2667 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q28" prim_transcript 1126..2605 /note="som I mRNA" gene 1231..1368 /gene="SST" sig_peptide 1231..1302 /gene="SST" /note="prosomatostatin I signal peptide" CDS join(1231..1368,2246..2458) /note="preprosomatostatin I" /codon_start=1 /db_xref="PID:g338288" /translation="MLSCRLQCALAALSIVLALGCVTGAPSDPRLRQFLQKSLAAAAG KQELAKYFLAELLSEPNQTENDALEPEDLSQAAEQDEMRLELQRSANSNPAMAPRERK AGCKNFFWKTFTSC" exon <1231..1368 /gene="SST" /note="preprosomatostatin I; G00-119-604" /number=1 intron 1369..2245 /note="som I cds intron A" exon 2246..>2458 /note="preprosomatostatin I" /number=2 mat_peptide 2372..2455 /note="somatostatin-28 peptide" mat_peptide 2414..2455 /note="somatostatin-14 peptide" BASE COUNT 659 a 589 c 613 g 806 t ORIGIN Chromosome 3q28; 1 bp upstream of EcoRI site. 1 gaattcaagg acaggttttc ttaaactttc tttgtttcta ggagatcagg cagagctgaa 61 tttaaccaag aatcttttga tcctttccac atatagatat acaatagtgg tcacatatgt 121 tctgggagtt cctagacctt atatgtctaa actggggctt cctgacataa aactatgctt 181 accggcagga atctgttaga aaactcagag ctcagtagaa ggaacactgg ctttggaatg 241 tggaggtctg gttttgctca aagtgtgcag tatgtgaagg agaacaattt actgaccatt 301 actctgcctt actgattcaa attctgaggt ttattgaata atttcttaga ttgccttcca 361 gctctaaatt tctcagcacc aaaatgaagt ccatttcaat ctctctctct ctctttccct 421 cccgtacata tacacacact catacatata tatggtcaca atagaaaggc aggtagatca 481 gaagtctcag ttgctgagaa agagggaggg agggtgagcc agagtacttc tcccccattg 541 tagagaaaag tgaagttctt ttagagcccc gttacatctt caaggccttt tatgagataa 601 tggaggaaat aaagagggct cagtccttct accgtccata tttcattctc aaatctgtta 661 ttagaggaat gattctgatc tccacctacc atacacatgc cctgttgctt gttgggcctt 721 acactaaaat gttagagtat gatgacagat ggagttgtct gggtacattt gtgtgcattt 781 aagggtgata gtgtatttgc tctttaagag ctgagtgttt gagcctctgt ttgtgtgtaa 841 ttgagtgtgc atgtgtggga gtgaaattgt ggaatgtgta tgctcatagc actgagtgaa 901 aataaaagat tgtataaatc gtggggcatg tggaattgtg tgtgcctgtg cgtgtgcagt 961 attttttttt ttttaagtaa gccactttag atcttgtcac ctcccctgtc ttctgtgatt 1021 gattttgcga ggctaatggt gcgtaaaagg gctggtgaga tctgggggcg cctcctagcc 1081 tgacgtcaga gagagagttt aaaacagagg gagacggttg agagcacaca agccgcttta 1141 ggagcgaggt tcggagccat cgctgctgcc tgctgatccg cgcctagagt ttgaccagcc 1201 actctccagc tcggctttcg cggcgccgag atgctgtcct gccgcctcca gtgcgcgctg 1261 gctgcgctgt ccatcgtcct ggccctgggc tgtgtcaccg gcgctccctc ggaccccaga 1321 ctccgtcagt ttctgcagaa gtccctggct gctgccgcgg ggaagcaggt aaggagactc 1381 cctcgacgtc tcccggattc tccagccctc cctaagcctt gctcctgccc cattggtttg 1441 gacgtaaggg atgctcagtc cttctaaaga gttttggtgc ttttctgggt ccctcagctc 1501 ccgaagctct tgagaaaact atcaaaggct agaatcccct tctaactctt tttttccccc 1561 atgataagcg cagtcggtca cagttcaggt gagttcttac ttggcattca agaaaattac 1621 aaaatctggg tagttgtctg ggcacgaagc gacaatggcg tctatccctg gtgctgaccc 1681 tgggaagcgc tgacccaggt gctgaaacgc agacctctga agctgctacc tcttagcgta 1741 cctcacttcc aaacgtcggg actagggcaa aggggcaatc taaagaccga acgccgtatg 1801 tttgagattg tgagaagcct cgttccccta cagttttact tggtaaaaat ggtaaaacaa 1861 ttctactttg tagctcgtga tgtgaaaatt gaattaaact gttggcacac actttatctt 1921 accagaacgg tctttatgtg tgtgtgtgtg tgtgtgtgtg tgtttgtgcg tgtgtgtgtg 1981 tgtgtgtgtg tgttaagtct acagggacag aaaggttgca gaaacatttg agctcttaaa 2041 gcctttttgt gtaacttggt aattatagca actatcctta tttttatatc cttgattgat 2101 tttaaatgtg acaaaaaatg cgcagctgta aaaactggat tttgtgtgtg accaaatctg 2161 ttctttaatt taggcttttc aaattttttc cattgtcctc cccacttctc tttctctctt 2221 tttctatccc ttctgcccta tacaggaact ggccaagtac ttcttggcag agctgctgtc 2281 tgaacccaac cagacggaga atgatgccct ggaacctgaa gatctgtccc aggctgctga 2341 gcaggatgaa atgaggcttg agctgcagag atctgctaac tcaaacccgg ctatggcacc 2401 ccgagaacgc aaagctggct gcaagaattt cttctggaag actttcacat cctgttagct 2461 ttcttaacta gtattgtcca tatcagacct ctgatccctc gcccccacac cccatctctc 2521 ttccctaatc ctccaagtct tcagcgagac ccttgcatta gaaactgaaa actgtaaata 2581 caaaataaaa ttatggtgaa attatgaaaa atgtgaattt ggtttctatt gagtaaatct 2641 ttttgttcaa taatacataa taagctt // LOCUS HUMSPBAA 10476 bp DNA PRI 08-AUG-1995 DEFINITION Human pulmonary surfactant-associated protein SP-B (SFTP3) mRNA, complete cds. ACCESSION M24461 NID g338326 KEYWORDS pulmonary surfactant-associated protein SP-B. SOURCE Human DNA and lung cDNA to mRNA, clones 4-3 and 7-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10476) AUTHORS Pilot-Matias,T.J., Kister,S.E., Fox,J.L., Kropp,K., Glasser,S.W. and Whitsett,J.A. TITLE Structure and organization of the gene encoding human pulmonary surfactant proteolipid SP-B JOURNAL DNA 8 (2), 75-86 (1989) MEDLINE 89170128 COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by T.J.Pilot-Matias 28-APR-1989. FEATURES Location/Qualifiers source 1..10476 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /map="2" gene 1040..10375 /gene="SFTP3" exon 1040..1120 /gene="SFTP3" /note="G00-120-374" /number=1 CDS join(1054..1120,1431..1558,2060..2131,2495..2620, 3447..3635,3863..3952,5386..5569,5742..5887,7122..7202, 7697..7759) /gene="SFTP3" /codon_start=1 /db_xref="GDB:G00-120-374" /product="pulmonary surfactant-associated protein SP-B" /db_xref="PID:g338327" /translation="MAESHLLQWLLLLLPTLCGPGTAAWTTSSLACAQGPEFWCQSLE QALQCRALGHCLQEVWGHVGADDLCQECEDIVHILNKMAKEAIFQDTMRKFLEQECNV LPLKLLMPQCNQVLDDYFPLVIDYFQNQIDSNGICMHLGLCKSRQPEPEQEPGMSDPL PKPLRDPLPDPLLDKLVLPVLPGALQARPGPHTQDLSEQQFPIPLPYCWLCRALIKRI QAMIPKGALRVAVAQVCRVVPLVAGGICQCLAERYSVILLDTLLGRMLPQLVCRLVLR CSMDDSAGPRSPTGEWLPRDSECHLCMSVTTQAGNSSEQAIPQAMLQACVGSWLDREK CKQFVEQHTPQLLTLVPRGWDAHTTCQALGVCGTMSSPLQCIHSPDL" intron 1121..1430 /gene="SFTP3" /note="G00-120-374" /number=1 exon 1431..1558 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" intron 1559..2059 /gene="SFTP3" /note="Intron B" exon 2060..2131 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" intron 2132..2494 /gene="SFTP3" /note="Intron C" exon 2495..2620 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" intron 2621..3446 /gene="SFTP3" /note="Intron D" exon 3447..3635 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" intron 3636..3862 /gene="SFTP3" /note="Intron E" exon 3863..3952 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" intron 3953..5385 /gene="SFTP3" /note="Intron F" exon 5386..5569 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" /number=7 intron 5570..5741 /gene="SFTP3" /note="Intron G" exon 5742..5887 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" /number=8 variation 5838 /gene="SFTP3" /note="c in DNA; t in cDNA" /replace="t" intron 5888..7121 /gene="SFTP3" /note="Intron H" exon 7122..7202 /gene="SFTP3" /note="pulmonary surfactant protein SP-B" /number=9 intron 7203..7696 /gene="SFTP3" /note="Intron I" exon 7697..7779 /gene="SFTP3" /note="G00-120-374" /number=10 intron 7779..9552 /gene="SFTP3" /note="G00-120-374" exon 9553..10375 /gene="SFTP3" /note="G00-120-374" /number=11 variation 9691 /gene="SFTP3" /note="g in DNA; c in cDNA" /replace="c" variation 10345 /gene="SFTP3" /note="g in DNA; a in cDNA" /replace="a" BASE COUNT 2416 a 3059 c 2944 g 2057 t ORIGIN 1 ggatcctccc tcctcggcct cccaaagtgc caggattaca ggagtgagcc accacaccca 61 gccccatctc ttttcatcat ggtactaatt cctgcccgtc cacccacaaa agcactgtag 121 tcgttcccga gtatagaggc ctgtgagcct ccactaggga gagggctcct gcagagatca 181 gataaattga tcacaatggc tggggtggtg gcaatgtgct aatgctctct ttcttccact 241 caagatatcc tctgtctccc tcagcctgtg agctttttct ccagtgtgct ctgccagtgg 301 gggccctgcc tgagagcccc tgcagctgca gaggacagtt tctttctgct gaaccatcgc 361 agctatgccc cagcccctac cctggagggg tccccagggg ccatgggcag cacctcctgt 421 atagggctgt ctgggagcca ctccagggcc acagaaatct tgtctctgac tcagggtatt 481 ttgttttctg ttttgtgtaa atgctcttct gactaatgca aaccatgtgt ccatagaacc 541 agaagatttt tccaggggaa aaggtaagga ggtggtgaga gtgtcctggg tctgcccttc 601 cagggcttgc cctgggttaa gagccaggca ggaagctctc aagagcattg ctcaagagta 661 gagggggcct gggaggccca gggaggggat gggaggggaa cacccaggct gcccccaacc 721 agatgccctc caccctcctc aacctccctc ccacggcctg gagaggtggg accaggtatg 781 gaggcttgag agcccctggt tggaggaagc cacaagtcca ggaacatggg agtctgggca 841 gggggcaaag gaggcaggaa caggccatca gccaggacag gtggtaaggc aggcaggagt 901 gttcctgctg ggaaaaggtg ggatcaagca cctggagggc tcttcagagc aaagacaaac 961 actgaggtcg ctgccactcc tacagagccc ccacgccccg cccagctata aggggccatg 1021 ccccaagcag ggtacccagg ctgcagaggt gccatggctg agtcacacct gctgcagtgg 1081 ctgctgctgc tgctgcccac gctctgtggc ccaggcactg gtgagtctcc cccagcctcc 1141 cctctcctag gcagctccac cactcactga gcactgcttt gtgctaggca ttaacccaag 1201 tctgtcctca ttttaaagac aaggcagctg gggttcagag agggttcaga gcttatccaa 1261 ggtcacacag ctggcgggtc caggagcagg tggaacccag agctgtctga cgtccacatg 1321 tttaatggcc tcacactccc agcaaaactg ggtctagagg gtgggtgaaa tcatgatgcc 1381 aggtgtgtag cctggatcct gattaaggtt gctctggccc caaaccacag ctgcctggac 1441 cacctcatcc ttggcctgtg cccagggccc tgagttctgg tgccaaagcc tggagcaagc 1501 attgcagtgc agagccctag ggcattgcct acaggaagtc tggggacatg tgggagccgt 1561 gagtaccacc aaggatgcat ggcaactggg ggtctgaaat gaagggtgct gggtgggctc 1621 tggatgggca ggaggagagt ggagccccca taggggatgg atgagatgaa atgggatgag 1681 atgaaatgag ataggataaa atggaatggg atggatgcga tgggatacga tgacatagaa 1741 tagatggagt cggatgaatg ggatgggatg ggatggatgg gaggggaagg gataggatag 1801 gatgacatag aataaagatg gatgggatgg gatgggatgg gatgggatga cacagaataa 1861 agatggatgg attgggatgg atgaatagaa gagatggatg ggataaattg atatggatga 1921 gatgggacaa gttgggctgg tgggcagctg catgtgcctt ggagtgctct gttggcctct 1981 tcctaagaga acctccccat tggagctggg agcctccccc actcatgtgt cctccacctt 2041 ggggcccctc cctccccagg atgacctatg ccaagagtgt gaggacatcg tccacatcct 2101 taacaagatg gccaaggagg ccattttcca ggtaatgatg cccagatcct ggatgaaggt 2161 tggggcccaa gagatgaggg acagagcagg gaagagctga gccccctaaa ggggccattt 2221 ccaggctgag gaggaggcct gggtgcctgg gaagtcccag ctcctcctgg ctgggagcag 2281 gtcatggccc tgagctcaat agcacagcca gagatggtct tccctgaggg gaagggcccc 2341 tacatgtgcc caactactta actccttggc actcgtgaac tccagcaccc tgggggatta 2401 ggggtcagtc tgccctggtg gggccttgtg tccagggact tgggcggggt agacctcaga 2461 gaggcccagc tgacggcccc ctctggcctc ccaggacacg atgaggaagt tcctggagca 2521 ggagtgcaac gtcctcccct tgaagctgct catgccccag tgcaaccaag tgcttgacga 2581 ctacttcccc ctggtcatcg actacttcca gaaccagatt gtgagggctg caagctcacc 2641 tcctgcctgc ctccccacgc aggcccctgt gcccacccat gggggagcca cacacacagc 2701 accccagcca gccagacaca cacacacaca cacacacaca cagcacccaa gccggccaga 2761 cacaaacaca cagcacccca gccagccgga cacacacaca cacacacaca cacaacaccc 2821 cagctggccg gacacacaca cacacagtac cccagctggc cggacacaca cacacacagc 2881 accctatcca gacacataca cacacacagt accccagcca gctggaaaca cacacacaca 2941 cagcactcca tccagacaca tacccacaca gtaccccagc cagccagaca cacacacaca 3001 cacacacaca cacacacaca cagagcacac acacagcacc ccagctggcc acacacacac 3061 acacacacac cctgtccaca aagggcctag gaaactacgt gcccttcagc catgcacccg 3121 accatgggcc cccaggttca ggtgcacacg gtgggcctgt acgctcacac acccttacac 3181 cctcactctc acacacatgc ttacacactt attcattctc acatatatgc tcatgctcat 3241 tcacacacaa tcccgggcca cctgccctaa agtccccaca cagccctatc tttgcctttt 3301 gtccccccac atagagttct aaaccacagc acccccacta ggcctgcttc ctcccattcc 3361 agtggtccct gagcccttgg gccggcctga ataggggtgg gcttccctcc cagaccctaa 3421 cactcccacc ctgtgctgtg ccccaggact caaacggcat ctgtatgcac ctgggcctgt 3481 gcaaatcccg gcagccagag ccagagcagg agccagggat gtcagacccc ctgcccaaac 3541 ctctgcggga ccctctgcca gaccctctgc tggacaagct cgtcctccct gtgctgcccg 3601 gggccctcca ggcgaggcct gggcctcaca cacaggtgag ggaggccccc acagccagta 3661 aagtggagat ccagagggct agagccacct ccgaagccca tgggcactgg gccctgggag 3721 aggcagagcc gggaaggtga taggaagctc caggcagggc ctaagggagg agggagagaa 3781 agggaggaag agagagggga ggagagcctg gaggactctt ctcccagcac ccagcctggc 3841 ctccacctga ttctttcccc aggatctctc cgagcagcaa ttccccattc ctctccccta 3901 ttgctggctc tgcagggctc tgatcaagcg gatccaagcc atgattccca aggtgaggca 3961 tccagggcct caagagccca ggagcacacg catacctgta gctccctgca gctcccacct 4021 ctctcccaac tcacaccccc gtcagaccca gctggctgcc agaagttagg aggggagaga 4081 gccgcttgtg cattgccccc acccagggac cctgggctca ggctcaggcc tggtaggtgc 4141 caggtacagt tcatgcaaca aacattaagc ccccactgta tggaggtgcc agccaggagc 4201 caaagtacaa aaacggacaa gacgcagctt tgtcctccag cagctcacca tctgatggag 4261 aaagatcccc agaggtctct gtagaaaggt tgctttgatc tttcaagagg ggaatttcca 4321 cagatagatt ccccatcctt gcctgagtcc aacttggagt cttccagacc tgcagtggct 4381 attgtccaat ggccccgcca gcccagggct accttgccca aattggggcc caaatgagga 4441 aaggccctgc cccctcagcc tttcccagat tgggttgcgt gggccaccag gggcacaagg 4501 cagcaggtga ggttcctgct gaggcaggtg gttcacttga gcccaggagt tcaagaccag 4561 cttgggcaac atggcgaaac cccgtctcta ctaagaatac aaaaattagc cagatgtgac 4621 aggtgcctgt agtcccagct actcgggagg ctgaggcagg agaatcactt gaacccagga 4681 ggcggaggtt gcagtgagcc gacatcacgc cactgtactc tagcctgggt gacagagcaa 4741 gactctgtct caaaaaaaaa gaaagaagga aagatcactg cagagattgc agtgagaggt 4801 gatgggacag ggacggagct gagggctggc ctggggatgc atttgggagg tgggcccact 4861 gctatgggca tggatgggcc tggagcgtga ggaccaggga ggactccaaa gtgactttta 4921 cacactggcc agagcaacca gccctctgta atgccagcag ctgagatggg gagactaaag 4981 aagaaaacag gtttgagcaa aaaaacagag agctccctcc tggccatgtt gagttcaaga 5041 tgcctgtgtg aagtgcagga gaggagagtc aggcaagcag ctgaatccca agcattgggg 5101 gaaggtcagg tccaccatgt cagtctgaga gtcactagct gtgggccaga gcctttgggg 5161 ccagacgtag gtctgaagct ggctcctaca ctcagtgacc ctgtgtgagt cccctgcatc 5221 ccctggactc tctgatcccc agtgtcctta tttgtgaata gccttgccct cccttctaga 5281 agagaatgag ggaatgcgta ggaagtgccc agctgggtgc tgggcagaga gtggaggctt 5341 gccaagtgaa ggtcccatgc tggcctctct ccgcccccgc cccagggtgc gctacgtgtg 5401 gcagtggccc aggtgtgccg cgtggtacct ctggtggcgg gcggcatctg ccagtgcctg 5461 gctgagcgct actccgtcat cctgctcgac acgctgctgg gccgcatgct gccccagctg 5521 gtctgccgcc tcgtcctccg gtgctccatg gatgacagcg ctggcccaag tgagcccact 5581 gccccctcct tagcccaatg cccgctctcc tcctccccct accctgccac tgcatgaccc 5641 tctccctctg tggtcccact gcaatgcacc aaggaggaca gaaaccaaac acctctgtag 5701 ggtggccttg cctgctttcc ccctaatgct cacatctcca gggtcgccga caggagaatg 5761 gctgccgcga gactctgagt gccacctctg catgtccgtg accacccagg ccgggaacag 5821 cagcgagcag gccataccac aggcaatgct ccaggcctgt gttggctcct ggctggacag 5881 ggaaaaggta tgggctgggc acatggggac tcatggtcag ggcccgttca aggcagaagg 5941 ctgagcccag gaaaggcttt gcagccagag acacctagga tgggccagaa tggagcacag 6001 acaggcagac aggatgtggg gcagacaatg gtgggactgt aagttagggc agagcctgct 6061 aagggttagg agtcgcctct ggacaaaggg ctgtgggctc cagaggacca gcaggccctc 6121 ttcacgggct gagtgagcac caggcaagcc ttcagaggcc tggttatcta ccaggagatg 6181 agtaatgcta gggccagttc aagccaggaa agggactagc cttctctcca gggtcctgat 6241 ccctttactg cccccacact cctcaaggtg tgactcactc aggacaaacc cattggcaaa 6301 aggagagggc tggacttgaa ggtcctaggg cccttgccaa tactcagtca atgacaggaa 6361 attccctttt tttttttttt tttttttttt ttgagatgga gttttgctct tgttgcccag 6421 gctggagtgc aatggcacaa tcttggctca ctgcaacctc tgcctccggg ttcaggcgat 6481 tctcctgcct cagcctcttg agtagctggg attacaggca tgtgctacca ggcccggcta 6541 atttttgtat ttttagtaga gacaaggttt caccatattg gtcaggctgg tctcgaaccc 6601 ctgacctgaa gtgatctgcc cgccttggcc tcccaaagtg ctgggattac aggcataagc 6661 cactgcaccc ggacaggaaa ttcccttctt aaagcgagat cctgtcctga ggaaagccag 6721 ctgatgctct tcccaggagg cagctgtcca cactgtgctc cctgctcagc aactcccaag 6781 cctcccgact gcccatcaca tctggtctca aggaccagat gaacgttaag gttccttcta 6841 gaactgaaat ggaggtggag ggaggggagg gtggtggctg agattccacc cctctgcctg 6901 agtcctccgt ctccagtgtc gcctgctttt ctgatggaag tcctccattt cagcctggct 6961 ccagtttgtt aagggtttca actgcagcca gaggtgttcc gtgagggctg atggaggagt 7021 cgggagggag ccctagagtg atccagagat gtggagaggc ccaggaccac acgacaggag 7081 agtcctgcaa agggacctcc acagctgtgt gtctccctca gtgcaagcaa tttgtggagc 7141 agcacacgcc ccagctgctg accctggtgc ccaggggctg ggatgcccac accacctgcc 7201 aggtacaccc aacccctccc aagttggtcc taggacttcc cttggctccc agagccccca 7261 ccctttgggc ccgtgatcct cagaggcctc actcccctgg gtccaaggtg gtcccaggtg 7321 cacgggccag ggactgggag gcacccctct ctgtttcagt gtaaaaaatc atgagagcat 7381 ggaaaagggg gatgggaagg gagggatggc ctgaggagtg cggctggatg tccattatag 7441 gatggggctg tgttccctgg ccagtgtgtg ctggtggggt gggggtacaa agtgggtgtt 7501 ctggagtgaa catctcacct cctcaggctc taaaccctaa ggcctgtggc tcagggagtg 7561 gccgaggggt ctacagagtc acactggtag cacccactag gcgggaggtg gagtgagtgc 7621 tgttctttcc cggaagagct gggtgtgggg agctgagggg gcccaggcct cagccctggt 7681 gctgtccctg tgacaggccc tcggggtgtg tgggaccatg tccagccctc tccagtgtat 7741 ccacagcccc gacctttgat gagaactcag ctgtccaggt gagtccaggc ccccagttgc 7801 ggggaggtaa gggggcaggt cctgaccatc agggcatggg aggcccttct gctccccaag 7861 caggaagagg cggccactcc tgccggctgc tccatcctcc ctctcaccgc acagctggag 7921 gctcctgagg gcttctggct ggccatcagg aaaacaccct ttccggaccc cgagcactgc 7981 cccgcccaga accccagtca ctgagtgccc aacccccagc ttccccccca accccccgcc 8041 ctgccctgtc ccaggcctcc ctctcagagc ttgccccagg gactctctgg ccctcagggt 8101 tcaatgtatt ctgaccaagg ccaagctttc ctggggctca gggaaaatca cactttgcta 8161 cccgaagctg tatcccctca gatgccagga aggccgtgat catctgactc caccctcctg 8221 agacacattc tctccctgac tgtcctgttc taagtcagcg gagcacctta ggatggaggg 8281 gtggaggcga ggccagatgc agcctctgtg aacaggtgcc tggaggctgg gaaatgaccc 8341 tgagagggca ggacacagca accgtgggct taaggtgacc ttgagagcaa gcttggccca 8401 ctttacaatt ctgttcagag ccagccccta acatggtggt catttattca tttgttccct 8461 cattttaaaa aatgtaaggc caggcatggt ggctcacgcc ggtaatccca gcactttggg 8521 aggccgaggc aggcagatca cctgaggtca ggagttcgag actagcctgg ccaacatggc 8581 gaaaccctgt ctctactaaa aatatttttt aaaaattagc tgagcatggt ggcaggtgcc 8641 tgtaatccca gctactcagg acgcttaggc aggagaatca cttgaacctg ggaggcgaag 8701 gttgcggtgt gctgagatcg tgccactgca ctctagccta ggcaacagag cacaactctg 8761 tctcaggaaa aaaaaaaaaa aaaaaaaagg tatttctttg ctgggcgcag tggctcacac 8821 ctgtaatccc agcactttgg gagaccgagg cgagtggatc acttgaggtc aggagttcaa 8881 gaccagcctt accaacatga tgaaaccccg tatctactaa aaaaaaaaaa aaaaaaaaaa 8941 aaaaaattag ccagatgtgg tggcacacac ctgtaatccc agctacttgg gaggctgagg 9001 aggagaattg cttgaacctg ggaggcggag attgcagcga gccaagattg cgcctctgca 9061 ctccagcctg ggtgacagag tgagactccg tctcaaaaaa aaaaaaaaaa aagtagtggg 9121 tgcctgtggc caggccacat cctagggtag gggctatggc tgagccctgc cctcctggag 9181 ctcacagcca agtccacttc ttccatctga ggcggggaag ccagccctgt tcctgaaacc 9241 ctgcatcaca agcccctgtg ggaggcagtg gggaggggag gtcctccccc actcagacct 9301 gacccacagg gaccagttta atgtgtcctt gccccagtga tgacagctgg ggatctgggg 9361 gtggggagtc acccaggacc cgggcagtcg cctttcccca gctcctaggg ctcccggcct 9421 tccctgctga aacagcaaga ccagtgggtt ggcgtgggag gcctgggctt caaaccacct 9481 ctgctatcac ctggctgtgg gtccccaggc aggacataca cacagtccct ctctggccct 9541 catcctcctc agctgcaaag gaaaagccaa gtgagacggg ctctgggacc atggtgacca 9601 ggctcttccc ctgctccctg gccctcgcca gctgccaggc tgaaaagaag cctcagctcc 9661 cacaccgccc tcctcaccgc ccttcctcgg gagtcacttc cactggtgga ccacgggccc 9721 ccagccctgt gtcggccttg tctgtctcag ctcaaccaca gtctgacacc agagcccact 9781 tccatcctct ctggtgtgag gcacagcgag ggcagcatct ggaggagctc tgcagcctcc 9841 acacctacca cgacctccca gggctgggct caggaaaaac cagccactgc tttacaggac 9901 agggggttga agctgagccc cgcctcacac ccacccccat gcactcaaag attggatttt 9961 acagctactt gcaattcaaa attcagaaga ataaaaaatg ggaacataca gaactctaaa 10021 agatagacat cagaaattgt taagttaagc tttttcaaaa aatcagcaat tccccagcgt 10081 agtcaagggt ggacactgca cgctctggca tgatgggatg gcgaccgggc aagctttctt 10141 cctcgagatg ctctgctgct tgagagctat tgctttgtta agatataaaa aggggtttct 10201 ttttgtcttt ctgtaaggtg gacttccagc ttttgattga aagtcctagg gtgattctat 10261 ttctgctgtg atttatctgc tgaaagctca gctggggttg tgcaagctag ggacccattc 10321 ctgtgtaata caatgtctgc accagtgcta ataaagtcct attctctttt atgagaaaga 10381 aaaagacacc agtcctttaa agtgctgcag tatggccaga cgtggtggct cacacctgca 10441 atcccagcac cttaggaggc cgaggcagga ggatcc // LOCUS HUMSPERSYN 7623 bp DNA PRI 13-JAN-1995 DEFINITION Human spermidine synthase gene, complete cds. ACCESSION M64231 NID g338393 KEYWORDS spermidine synthase. SOURCE Homo sapiens blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Myohanen,S., Kauppinen,L., Wahlfors,J., Alhonen,L. and Janne,J. TITLE Human spermidine synthase gene: structure and chromosomal localization JOURNAL DNA Cell Biol. 10 (6), 467-474 (1991) MEDLINE 91299162 FEATURES Location/Qualifiers source 1..7623 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Sultan 20D" /cell_type="IgG myeloma" /tissue_type="blood" /map="1p36-p22" CAAT_signal 1212..1216 /gene="SRM" /note="box 1; G00-127-983" CAAT_signal 1242..1246 /gene="SRM" /note="box 2; G00-127-983" exon 1315..1563 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=1 /product="spermidine synthase" mRNA join(1315..1563,1995..2115,2448..2540,4584..4737, 5247..5330,5415..5560,6255..6377,6454..7132) /gene="SRM" /note="G00-127-983" /product="spermidine synthase" gene join(1315..1563,1995..2115,2448..2540,4584..4737, 5247..5330,5415..5560,6255..6377,6454..7132) /gene="SRM" CDS join(1397..1563,1995..2115,2448..2540,4584..4737, 5247..5330,5415..5560,6255..6377,6454..6474) /gene="SRM" /EC_number="2.5.1.16" /codon_start=1 /db_xref="GDB:G00-127-983" /product="spermidine synthase" /db_xref="PID:g338394" /translation="MEPGPDGPAASGPAAIREGWFRETCSLWPGQALSLQVEQLLHHR RSRYQDILVFRSKTYGNVLVLDGVIQCTERDEFSYQEMIANLPLCSHPNPRKVLIIGG GDGGVLREVVKHPSVESVVQCEIDEDVIQVSKKFLPGMAIGYSSSKLTLHVGDGFEFM KQNQDAFDVIITDSSDPMGPAESLFKESYYQLMKTALKEDGVLCCQGECQWLHLDLIK EMRQFCQSLFPVVAYAYCTIPTYPSGQIGFMLCSKNPSTNFQEPVQPLTQQQVAQMQL KYYNSDVHRAAFVLPEFARKALNDVS" exon 1995..2115 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=2 /product="spermidine synthase" exon 2448..2540 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=3 /product="spermidine synthase" exon 4584..4737 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=4 /product="spermidine synthase" exon 5247..5330 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=5 /product="spermidine synthase" exon 5415..5560 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=6 /product="spermidine synthase" exon 6255..6377 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=7 /product="spermidine synthase" exon 6454..7132 /gene="SRM" /EC_number="2.5.1.16" /note="G00-127-983" /number=8 /product="spermidine synthase" polyA_signal 7126..7132 /gene="SRM" /note="putative" BASE COUNT 1609 a 2287 c 2259 g 1468 t ORIGIN 1 ctgcaggcgc gcactaccat gcccagctaa tttttgtatt tttagtacag acggggtttc 61 accatgttgg ccaggatcgt cttgatctct tgacctcgtg atccgcccgc ctccgcctcc 121 gcctcccaca gtgctgggat taccggggtg aaccatcacg cccaggcacc tagcaattct 181 ttagcggtct tggtttacct cccctttaga aggagcttaa aagcaagcaa ggcacattgt 241 tgcctaggct tgaggcttgc tctcacccat aaacaacgct gttactctgc tctggggacg 301 acacaggaaa cgttccccac ctccaggtgg aggctgcaaa acgtgtcaaa accatccctg 361 acataatgtc aagagtagct tatactagtt tcatcttcct tccttggcat tcgaactgcg 421 tgtggaacaa ttagcattta atatttggta attagcatgc ttaatgtgat tctgagaagt 481 tctttgacat tctcataaaa acagcacatt cccacccacc cttcaaagag caagacccag 541 tttgtcaaga aaaattgcgt gccagtcttt tctggtgctg aatatgtatg ttctgggcct 601 cttcctggac actgctggta aatttagaaa ctcgtttaga aaagcacttc tctcgtattc 661 aacagcctat aggctcatgg cgcagaatct aagggaaaat ggctaaatcc agcttgttaa 721 ttcgcgggct gtgatgattc tttccaagta aataaaaacc ctcggttcgc cccgacgagc 781 cacataatct gttcaaatcc aacaaggaac cagattttgg acgcaaagaa ggatacgttc 841 tactcgcccc gtgcaacaac gtaaaccact gtagccgccc gctcccgtgt ctccagccca 901 aaggctgact ctccagtccg cacgtcgcag cgctcttgcc ctccacacca agcccgagtc 961 ccgcagcccc tcgaggccct cggtgcctcc caaccccgag aaggaagcgg gggccggtgg 1021 tgcaccgccc cggctgcttg gggcggagga aggacccgga ccccttccgc cggcccagcc 1081 cgccccggaa cccgacccgg ccgcccggcc ccggccgggc cccacgtggc ccctggagcg 1141 ggccgcacta ccctgctgcc gccgacggac ggcgcgccac agccactctg cgccgctctg 1201 cgagccggtg gccaatgagc gccaggcgag gccgctttgc cattggcgag cgcgggctcc 1261 gcccccgccg gcaggccccg ccccgcgccc gggttaggtt gcggcgcggg cggcgggcgg 1321 agctggtccc gttgtgctgc ggcgccgcgc ggcctgcagt cccgggcccg cgccccgcgc 1381 cgcccgcccg cccgccatgg agcccggccc cgacggcccc gccgcctccg gccccgccgc 1441 catccgcgag ggctggttcc gcgagacctg cagcctgtgg cccggccagg ccctgtcgct 1501 gcaggtggag cagctgctcc accaccggcg ctcgcgctac caggacatcc tcgtcttccg 1561 caggtaccgc cgctgcccgc aggcgcctgc cccctaggct cagcccgggc cgcctgctgc 1621 ccgcctcacg ggcctctcca cgccgggacc caagcgggct ggacctcgtc ctgccctggc 1681 cccctcgcca cccctcacac cgcctccctg ggctggggct gggactgcgg gctggcctct 1741 tgggtggggg agtcggagtc tgcgccccgc tccacgtgtc agccctcagg acacgtcaga 1801 gcccgaggag accccgggtc ccaccccggc ctccacccgg cggcccgcct gccgttcctc 1861 gccacgtgtc accatcgctc ctcatccctg ggacccctag gcgggatggg gagaccctcc 1921 tcacccaggg cggcttgggg tacgttttcc ccaccccaga gaacccaggt ccccgactgt 1981 cactccgccc gcagtaagac ctatggcaac gtgctggtgt tggacggtgt catccagtgc 2041 acggagagag acgagttctc ctaccaggag atgatcgcca acctgcctct ctgcagccac 2101 cccaacccgc gaaaggtacc ccagtgtccc ctggaacagt gccggacgag gggcggcccc 2161 aggtgtgctc cgggctcttc ccagatgctg cctgcatggt tgtcagagaa agtgctagca 2221 aggccagggg cgtcccgcgg aggggtgggg gccgacactg acgcggcctc ggaatcctag 2281 ggcagccctg gaaggaactt ccaggaaagg ggacaccggc acgaaagcgt ttccgagggt 2341 agaaaaagat gaggcccgtg ggtccgaggg gtcagggggt ctgcttcagg ggcctggggg 2401 ctcccagtcc tgccagggcc cctgccttga ctgccccctc ctcccaggtg ctgatcatcg 2461 ggggcggaga tggaggtgtc ctgcgggagg tggtgaagca cccctccgtg gagtccgtgg 2521 tccagtgtga gatcgacgag gtgagtgccg gcgtagagcc aggtttgagt cctggttctc 2581 ccagcggcca gctgtgccct gaaatggctg cacacccccg agcaaggcag gtagggcctg 2641 tttctccatc tggaaaacac ctggtcgggg agggttcagt aggaaaacca gatggcagag 2701 ggcctggcag gtggtgaggg cacctgcgtg gcgagctctt actaaaactg agctgatttt 2761 tttttttttt tttttttgag acagagtttc gctcttgttg cccaggctgg agtgtgatgg 2821 tgcgatctcg gctcactgca acctccacct cctgggctca agtagttctt ctgcctcagc 2881 ctccggagta gctgggatta cagacatgcg ctaccatgcc cggctaattt tgtattttta 2941 gtagagacag ggtttctcca tgttggtcag gctggtctcg gacctggcga ccacaggtga 3001 tccgccagcc tcgtcctccc aaagtactgg gattacaggc gtgagccacc acgcccagcc 3061 gactaagctg atttttaatc tgagccccag gcagggcccc aagacagctc aactatttgt 3121 acgttacccc ttacactcag tagctgctca ctaaaatcat gctacgtgcc aggtgttgcc 3181 cgggtatggg gacagtggta gacgacagat cagtccctgc cctctaggag ctgatgtcgt 3241 agttaaagga gacatcagat ggccagacgt ggtggctcac acctgtaatc ccagcacttt 3301 gggacgccaa ggcgggcaga tcacctgagg tcaggagttc aagagcagcc tggccaacct 3361 ggtgaaaccc catttctact aaaaatacaa aaattagccg ggcatggtag tgcatgtctg 3421 taaacccagc tactagggag gctgaggtgg aagaattact tgagcccggg aggcggaagt 3481 tgcaatgaac cgagatctcg ccactgcact ccagcctggg tgacagagga agaatctgtc 3541 tcaaaaaaaa caaacaacaa aaatagagac atcaaaggat ggtctgatga aggcaagaca 3601 ggggctgggg gacaggagaa ggcagggttc ctgtgaatgc atggggggtg gtcagggcag 3661 gcctccagga ggtggcgttt gagctgagac ctcagtgaaa agcaggtggc cgtgtgcagg 3721 gagggggagg ttctcctggc cagaggttgg aattgcatcc ttctaaaata ggaaacaggc 3781 caagcgctgg tggctcacac ctgtaatctc agcactctgg gaggctgagg cgggcagatc 3841 acaaggtctc tactaaaaat acaaaaaaaa aaaaaaatgg cccagcttgg tggcgtgtgc 3901 ctgtaatccc aactactcgg gaggctgagg caggagggtg cagtgagctg agatcgtgcc 3961 actgcactcc agcctgggca gcagagcaag actgtctcga aaaataaata aataaaatag 4021 gaagcgacaa gaaagccact cagatggggc gatttggtct ggaaggaggg ggtagggatg 4081 ggagagagga gcccaagccg ccggcaggag ccagctctca aggagaaatg gaggtaccag 4141 agttctcccc gctttacaca ttataaactg aggttcccaa aaggggccag gtgtggtggc 4201 tcacagctgt aatcctagca cttttggagg ccgaggtggg aggatcgagg agttcgaggc 4261 aacataggga gactctatct ctaccaaaaa tttaaaaagt agccaagtat gattgaacac 4321 acttgtccca gctactcagg aggctgatgg gggaggatca cctgagcccc ggaggccgag 4381 ggtgtagtga gccatgatcg atgccactgc actccagcct gggccacaaa gtgagaccct 4441 gtctcaaaaa aaaataataa aaaaaaggga aggggttggc caaggcggct tgcctgtgag 4501 gcacttggag agtcccacgt ggctgtgctg gctccaggtc ccccagcccc ttggcccaga 4561 ctggtccctc ccatcccctc caggatgtca tccaagtctc caagaagttc ctgccaggca 4621 tggccattgg ctactctagc tcgaagctga ccctacatgt gggtgacggt tttgagttca 4681 tgaaacagaa tcaggatgcc ttcgacgtga tcatcactga ctcctcagac cccatgggta 4741 agcagtggat gggccccagg gttttctggc agctgcaggt ctggaggtca gcctccccca 4801 ggccttcaga gtaaaggata gagcggcctc ccaccccccg aactagagct gtacttttcc 4861 cttctcattt gttacctgcc ctctgaaaca tggctcagga cagtaggcag gagccaggcg 4921 actgcccaga ttcacaagct ggtgaccaag gagagtgggg atctggcatt gggacactga 4981 ggaccctgtg tcctcttcag cctcccctct gctctgaagt ggtcagcact ggagtggggg 5041 caggttctag tcttgaacga aggcctaggt tagaggttcc tctgctgtgg tgccaatgag 5101 actcccccaa gaatgggatt caggtgtgga tcccccacag acctgggttc agatcctggc 5161 tctggccacc tggtagctgt gtgggtgaca gtggccctgg aggtcacagc cagactgttc 5221 agtgtgtctc cctctgtctt ccccaggccc cgccgaaagt ctcttcaagg agtcctatta 5281 ccagctcatg aagacagccc tcaaggaaga tggtgtcctc tgctgccagg gtgagccaca 5341 ggcctggagc actggggcgg ggcggggtgg ggcagggcag gccctgccgg atgctgatgc 5401 ttaggggccc ccaggcgagt gccagtggct gcacctggac ctcatcaagg agatgcggca 5461 gttctgccag tccctgttcc ccgtggtggc ctatgcctac tgcaccatcc ccacctaccc 5521 cagcggccag atcggcttca tgctgtgcag caagaacccg gtgagatggg ggtgtctggg 5581 ggtgggggtt ggggggaagg tgggcataaa tagagatccc tgcccctgcc gggcgcggtg 5641 gctcacacct gtaacccagc actttgggag gctgaggcgg gcagatcaca aggtcaggag 5701 atcgagacca tcttggctaa cacggtgaaa ccccgtctct actaaaaata caaaaaaatt 5761 agccaggcat ggcagcgcgc gcctgtagtc ccagctgctg gggaggctga ggcaggagaa 5821 tggcgtgaac ccgggaggcg gagcttgcag tgagccgaga ttgcgccact acattccagc 5881 ctgggtaaca gaggaagatt ccgactcaaa aaaaaaaaaa aaaccctccc caggccaggt 5941 gcggtgtctc atgcctgtaa ttccagcatt ttgggagacc aaggtgggcg gatcacttga 6001 ggtcaggagt acaagaccag cctgaccaac atggagaaac cccatcacta ctcaaaatac 6061 aaaaaaaaaa aaaattagcc gggcgtggtg gcgcgtacct gaggctgagg caggagaatc 6121 acttgaacct gggagacaga ggttgcagtg agctgagatg acgccactgc actccagcgt 6181 ggcaacagtg agactccgtc tcaaaaaaaa aaaaaaaagt gccccccctg atgtgcccct 6241 ggcccggtcc ccagagcacg aacttccagg agccggtgca gccgctgaca cagcagcagg 6301 tggcgcagat gcagctgaag tactacaact ccgacgtgca ccgcgccgcc tttgtgctgc 6361 ccgagtttgc ccgcaaggtg ggtggcctgc ggggctgggt ggtgggaccc agggacccag 6421 agcgccctcc tgactggcct catgtccctc caggcactga atgatgtgag ctgagcccag 6481 gcgccaccac tgatgccacc caggacctcg gaccttggag cctgcggggt gcctcggccc 6541 ctccagcccc gggccggacc tcctgctggc tctcgcccac caaccaagtg ttacaagccc 6601 cagaatgctg cccggcctgc cctgctgggc ggactgtctg tgtgtctgtc tctctggcgt 6661 tccacctcca agcctatacc agctgtgtac agcgccatct ctctgccttc tgttgcccct 6721 cactcaccaa acacgtgtat ttatagcaaa gattggagtc ctgtgtctcc tgaccttggc 6781 tgggcccagg cagggccaca ttcaccattg ggtgcctctg gggtgagggt ctgcagaggc 6841 cttgctggct gacccccaag tgtctgctgc agggctgagg ctgcaggcgg gccatcgtgg 6901 atagcctggg gcacagaggg tcaccgcagt cgtcacgtgg gacccagagc tgtcctggga 6961 agctgactta gctgtccttt taccaagccc ttcacaaggc cactggtgac agccccccag 7021 ggcagtgggg tgggtgagat cagggtgggg ctgcccggga gcattctcag aaaaattggg 7081 gacactcaca ggtgtaagtc aggtcccatc caggtactcc agggcaaata caggaagggg 7141 tggcggggct ggttaccttc ggccttttta agcacatcag gagcttaaca ctggcccagt 7201 gactgtgccc tgactccacc cggcattcag acttgggttc aaattcccac catgccccgc 7261 cccctatgtg gacaaattga gaaagcaagt gtgggcaccc caccagggac tgcgaggacc 7321 agggctgtcc cctctccaag gtgctgaact cccgccttcc aggacccaac ggtggtggga 7381 ggacaggaaa ggaaccctct ttgcatgggc ctgagttgcc aacccctttc cccaccctgg 7441 gcaggggctg ggctagcgga cgcatcaggg agggaggccc cactcccagc cgaggcagcc 7501 accttggagc cctaactcac ccgggtatgt tttctgggac accagtgtaa gggggattca 7561 gtttcgccat caactctggc ttcaggccag tcatagccct ccagtctcca cctgcccccc 7621 act // LOCUS HUMSTATH2 4723 bp DNA PRI 13-JAN-1995 DEFINITION Human salivary statherin gene, exons 2-6. ACCESSION M32639 NID g338504 KEYWORDS statherin. SEGMENT 2 of 2 SOURCE Human (individuals #563, #8136,and J.F.) fibroblast, cell line #563, DNA, clones 1-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4723) AUTHORS Sabatini,L.M., He,Y.Z. and Azen,E.A. TITLE Structure and sequence determination of the gene encoding human salivary statherin JOURNAL Gene 89 (2), 245-251 (1990) MEDLINE 90323623 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.M.Sabatini, 03-JAN-1990. FEATURES Location/Qualifiers source 1..4723 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q11-q13" gene join(M31077:1601..1931,1..352) /gene="STATH" prim_transcript <1..4309 /note="STATH mRNA and introns" intron <1..286 /gene="STATH" /note="STATH intron A" CDS join(302..352,1532..1552,1645..1674,2728..2814) /note="statherin precursor" /codon_start=1 /db_xref="PID:g338506" /translation="MKFLVFAFILALMVSMIGADSSEEKFLRRIGRFGYGYGPYQPVP EQPLYPQPYQPQYQQYTF" exon <302..352 /gene="STATH" /note="statherin precursor, (first translated exon); G00-120-391" /number=2 sig_peptide join(302..352,1532..1537) /note="statherin signal peptide" intron 353..1531 /note="STATH intron B" exon 1532..1552 /number=3 mat_peptide join(1538..1552,1645..1674,2728..2811) /note="statherin" intron 1553..1644 /note="STATH intron C" exon 1645..1674 /number=4 intron 1675..2727 /note="STATH intron D" exon 2728..>2814 /note="statherin precursor" /number=5 intron 2848..4051 /note="STATH intron E" polyA_signal 4294..4299 BASE COUNT 1570 a 797 c 722 g 1634 t ORIGIN About 1.8 kb after segment 1. 1 aagctttact ttcttcttta ccttcatacc cctattctct tggtctgtgt aaggtgttga 61 tttgtgtgtt actctaatgc cttagcattt ccatatcaat acactcactt ttacttatca 121 gtgtctccaa caagtctcca agtttcatga tgtcaggcaa tgggttttaa ttgtctttct 181 atctggagtg tgtctcccag tgctttggag cgtagtataa tcaataaact tttgatgggc 241 gaatcgaaga aaaaatgtga tactgaattt ccacatatgt tttcagagaa cccagccaac 301 tatgaagttc cttgtctttg ccttcatctt ggctctcatg gtttccatga ttgtaagtat 361 atcaggacat ttgaagaata tgttctcagt acctatccca agagtctctc tattccttgt 421 gttctggcat atattggtgt ttacttgaat acagctttat atgtttaaat gtttccatat 481 gaactttttt tttgtactaa cctacataag ctatataagg atttagaata aatagtcatt 541 attcatctac agacactaaa gtatatttag cttagtgtga aataaaagaa agttttactg 601 aacttcttaa atatgtggaa ctgagagctt ctaaaagtat ctttaaacat gtaatgctct 661 agacaatctg aagttagaaa tctcaggaat gaactaggtg cacttgagtg gaattcaagt 721 attagaagat gtcaaagtag aaattttcag atcattcctt gcaaaaacta acttaaaatt 781 agtatacaat tatttagcac acactttcta aaataaagct gcttattttt aactataaaa 841 caaaacttca gatctatgac atgttatgag ttcccagctt ttattgtcat ttactactat 901 gtatcactag gtaaagcact tgacttctct ggagctcttt cttctactct acttccttct 961 atttctcctg tcccttttct atgacattag taggaaactg gccatgaaag aataaaaata 1021 gttcaactta cctggtaaat tctaacagat atttcaaaaa ctaatttgat tataaaatca 1081 cagtatacca cattatttgt tataaaagga taggtgtcta aaaaataaaa aataaaaatg 1141 taaggaaaac agagaaaaac ataatgcttg tctctgtcag taaataaaag ttatgtgtaa 1201 gtagcctttg agctgctgaa aacatattct tatgaaaaaa tacaaagttt cacttttcat 1261 tcaccaatga taaccattta ctccctttta cacacaaaac aagtataaat ccaaataaat 1321 ttatatttga atatagcaca ctaattacag aaaagcatac tcttaactca ttttaaagat 1381 gtcataagca aaaatgaact aatttagtgt ctatcgatga tttgcctaca tcctgtttac 1441 tcacaactgt agctgctgag ttcatttttg tactcaatct ggaattataa cattaagata 1501 ctaactattg tctaattttc tgttttctaa gggagctgat tcatctgaag aggtgagtca 1561 ttttcattca ctggaaaact tgttttcagt tttgtcctcc ctacatttgt attgaattat 1621 taaacttttc ttcatttttc acagaaattt ttgcgtagaa ttggaagatt cggtgtaagt 1681 gttctctgat aatgctgtgt agtccaaata aattgtgttc tctgattact tattcttcta 1741 gaagaaccaa tacttatttc tcaaatatgt gtagtagttg caaaagtgtg ctttgttttc 1801 ctctatttat gatacacagg taacagtcac aaatgtattg tgtcctaaca acatttaagt 1861 tgaaactcat agtgccaacc tgaagaaacc agttactccc tgagccctca agcatcatct 1921 cctaagtaat cactttgacc taaaaaatag ttgaaattct aattctgtta attataatat 1981 atgcctcatg gagcataaag gctgataatt acacataaaa tatggtaagt ttgcaccttc 2041 tgattttttt tgacggtgat aaactcatta aattaaacag gtcatgataa atgagatagc 2101 aaatttcatt tatttaacat ttttcttaat gtatgtatct tgatgtccca gtatgttgac 2161 atataatgtc aatgtgatag ggttgcaaaa gatcccctaa ctaatatagt gaatacatgg 2221 attaaaataa caccaaataa aaagatactc taaaagggat gatggttatt tgtgttaaca 2281 gaaggaagaa gtattaagga atgtaggaat cgtaagagac agtaagctca ttaaaagagt 2341 gacttgaaga aacaccacat aagctgaatt tagtaaatgt ctccttccca taatactcaa 2401 aggtgggcaa aatggttgct gggaggtagt attttgaaga aaatttaaaa cctctttgga 2461 gtgatagcct ataatcggtg ttcctgcatg tgaaataaaa ccacaaaact gtgaacatgc 2521 agtacaaagg taaacactca atcattttta cctgctttta aaatgtgttt ataaccaaaa 2581 agaaaaatct aattgaaagt ttggaaaaaa atcttataag taatatgtaa gttcctaaaa 2641 ctttaacaat ctaagcttct aataaaaacg cttgcactgt cattatctca attttctgca 2701 attttctttc tccttgtgtg tatacagtat gggtatggcc cttatcagcc agttccagaa 2761 caaccactat acccacaacc ataccaacca caataccaac aatatacctt ttaatatcat 2821 cagtaactgc aggacatgat tattgaggta agatgggttt agtgacattt tttttacttt 2881 ctgtatcagt gctgacagtt aaaagaagaa aaacaaaaac aagacattaa accaacagca 2941 gttccagtat aagaaagcag ctgtatagct ccttgaaatg tattaattta tataaatgct 3001 tctgagataa agtctctgtc ctgctatggt tagaaggcag tgttatagag tgggggacac 3061 cagggtttgt atgctggccc tgacaactac tagtttcatg actttgaact actcctaatg 3121 ctcctgaacc tcagttttct cgtcaaaaaa ttgtgtatta tgataacact taaatgtgct 3181 ttttagaatt ggtgacactt ttgaaagaca tatcacataa aatctcttga tcagaaaggt 3241 ttgtagttat aactgcaggg cttccctgcc actaaatctg tacctaggtg ttgcttaaag 3301 ttttcactcc caaaatgcaa tctttggaga agcttccatc tgggaaccat ctgggaactt 3361 gtgagaaatt tgcattttgt accctaagta actctactga gcaattacta tagaaattat 3421 taacataatg ctaacaagaa aatatggaaa tgttatcaat ttctttttag aaagtcagtt 3481 ttacctatct ctacacaaca ttaaatgttc taaattctat gaaaaaataa tatttcaaaa 3541 caaaatagag caaactcttg gagataaggg aggattgttc tacaaagtta aaatagtcca 3601 aaggatttca attatttttc agtgcatttg ggagattata ttccaattca attctctatt 3661 gccattagtg atacagggaa gagcctgaac ttgatcccct aggttttagg taatagctga 3721 tgacatttaa tcaagaaaag atgtattgaa atattattta gaggaaatgc atataaatga 3781 aatgtgtagg ctatatttaa gtagaagaat aaaatagtat gatgattgaa aaataattta 3841 aattgtagat gctgaatgat aatttttatt tattcaggaa ccaatttaat aattgtttaa 3901 ttgtctactc tgtgtcaaat attattttgg gccctgcatt ttataaacta taattgtata 3961 taatatctac tctgtaccag gccagaatat aaatgtgttt acaggggtaa aaattttaca 4021 gataatctgt tatttttttt tttctgtgca ggcttgattg gcaaatacga cttctacatc 4081 catattctca tctttcatac catatcacac tactaccact ttttgaagaa tcatcaaaga 4141 gcaatgcaaa tgaaaaacac tataatttac tgtatactct ttgtttcagg atacttgcct 4201 tttcaattgt cacttgatga tataattgca atttaaactg ttaagctgtg ttcagtactg 4261 tttctgaata atagaaatca cttctctaaa agcaataaat ttcaagcaca tttttacatg 4321 tatgctcctt atttttctct tttctattta taggaataat gatttctttc atcatcttct 4381 ttcccagttt gactagcaca gattgaagcc cataatgtca gaggtcctta tttataattt 4441 agtctgtcat attttatatg ttacaaacat ggtactcaaa ctcacttatt attagagcta 4501 tttatgccaa gatttgattt gcccatggtt tcctttgcca caccacagtt gctaagcctc 4561 caaagcactc acatttttta tctcagtccc tgtaggtcag aagataacca ccatgtcact 4621 ctacagcagt atgtaggcac tctgcatgta aaccatctaa tgccctccat tagtttttca 4681 gcaatgactt ttattcaaga aatatttatt gaatgacccc ggc // LOCUS HUMTBGA 8769 bp DNA PRI 24-JAN-1994 DEFINITION Human thyroxine-binding globulin gene, complete cds. ACCESSION L13470 NID g405513 KEYWORDS thyroxine-binding globulin. SOURCE Homo sapiens DNA; and cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8769) AUTHORS Hayashi,Y., Mori,Y., Janssen,O.E., Sunthornthepvarakul,T., Weiss,R.E., Takeda,K., Weinberg,M., Seo,H., Bell,G.I. and Refetoff,S. TITLE Human thyroxine-binding globulin gene: complete sequence and transcriptional regulation JOURNAL Mol. Endocrinol. 7 (8), 1049-1060 (1993) MEDLINE 94049804 COMMENT All the exons were obtained from liver-derived mRNA. All the introns were obtained from white blood cells derived DNA. FEATURES Location/Qualifiers source 1..8769 /organism="Homo sapiens" /note="All the intron sequences were obtained from white blood cell derived DNA" /db_xref="taxon:9606" /cell_type="white blood cell" CAAT_signal 2474..2479 protein_bind 2575..2587 /bound_moiety="HNF-I/LFBI/HP-I" CAAT_signal 2607..2610 TATA_signal 2622..2626 exon 2652..2680 /number=1 source 2652..8174 /organism="Homo sapiens" /note="All the exons were obtained from liver-derived mRNA" /tissue_lib="liver" intron 2681..4301 /number=1 exon 4302..4940 /number=2 sig_peptide 4319..4378 CDS join(4319..4940,5991..6264,6996..7143,7676..7879) /codon_start=1 /product="thyroxine-binding globulin" /db_xref="PID:g405514" /translation="MSPFLYLVLLVLGLHATIHCASPEGKVTACHSSQPNATLYKMSS INADFAFNLYRRFTVETPDKNIFFSPVSISAALVMLSFGACCSTQTEIVETLGFNLTD TPMVEIQHGFQHLICSLNFPKKELELQIGNALFIGKHLKPLAKFLNDVKTLYETEVFS TDFSNISAAKQEINSHVEMQTKGKVVGLIQDLKPNTIMVLVNYIHFKAQWANPFDPSK TEDSSSFLIDKTTTVQVPMMHQMEQYYHLVDMELNCTVLQMDYSKNALALFVLPKEGQ MESVEAAMSSKTLKKWNRLLQKGWVDLFVPKFSISATYDLGATLLKMGIQHAYSENAD FSGLTEDNGLKLSNAAHKAVLHIGEKGTEAAAVPEVELSDQPENTFLHPIIQIDRSFM LLILERSTRSILFLGKVVNPTEA" mat_peptide join(4379..4940,5991..6264,6996..7143,7676..7876) /product="thyroxine-binding globulin" intron 4941..5990 /number=2 exon 5991..6264 /number=3 intron 6265..6995 /number=3 exon 6996..7143 /number=4 intron 7144..7675 /number=4 exon 7676..(8174.8178) /number=5 polyA_signal 7927..7932 polyA_signal 8154..8159 BASE COUNT 2521 a 1822 c 1827 g 2599 t ORIGIN 1 aagctttaaa aaagaagcct aaattcttgc tcagaagcct aatttcttgg ctcaagtgct 61 ccaggaatat atggaataca gggaacatgt aaatataatt tctggagagc taagttagcc 121 aaatcctatg cttgccattt gatcctgtgt attaccaaat tacagaattt gttgatttct 181 caacattgat gtcggttttg tcgaaattag tcactgaaat gcaaagagta aaaattcaag 241 aataggacag ttgagagagt taaaagaacc tcaactccaa atccccactg agcctctctt 301 gccagcaaaa gcagtcttct ctttgtccaa tgcagctgtt tctcctttgc ttacacaccc 361 ggtaatttcc ttgcccgtga gacttaaatt gcaaaaagaa gccaattttc ctaatgccac 421 catctcacaa ctattgttgt ctccagacct gtaagcagag taagatccca tcatttctct 481 ccagactagg gtggaggatc aaatatcaaa acaatgagaa aagtgtttac aaactgaaag 541 aactgtggaa cgttgaacat ttctaaaata aaatttgtgt gggaataatt atgagggtgt 601 tcattcaaaa aaggagatat ataattttag atctgattga atttattgat atggtaaaac 661 ttcctagaga gtctttattc aatgtgctac tctagcagct ttacacaaga ctgaaattat 721 tgtgataaat ataagtttgc aagatcttac accttctcaa tcaattgcat ctgaaggaga 781 atgtattagc agaaagcaga aatgaacagc agtagggggt agagataaag cccagggaat 841 atggagactt gaggtgaggg aatatatgat aacagaagaa agggaagaga attatctgtt 901 tctggatccc cactttgtac ggggcactat tataggagct tctggacacc acctctaaat 961 ttcattttat actctcaata accacatgac atgggaatca ttattatttc cactttagag 1021 gtgagaaaac tgaggttcaa ggatactagg taacttgtcc caaattacac agttacaaca 1081 tagagacatg gcaggggcac aggtctaatg cctgtgatgc cagagcacag ggcaagggag 1141 tctgcagagg gaagggtacc tttctcttat gtcggagata agccctgcta gcaaagctca 1201 gctgaatcag aagcagcaga aaacaacctg tgccattgtc tgactatgcc accagagggc 1261 acaggtttgg aagttttctt tttacacagt cttttgctgt tttctctgca gggacttgct 1321 gtagctgaga gagcaatgtg ttatattaca tggactaacc tcgcagtcta gagacttgag 1381 cttgtcactc cctttcttta ggacacatgt actagtgacc atcacatttg tgaggtgtgc 1441 cttcttcaat atgggcagcg ggtgtcaggt agttagaaag acctcttggg ttcaggactg 1501 atatttacca accagctctg aaacctcagt gctgactgtg gttctttcac acagtgggag 1561 cttagtctat gttttgttaa gaagaaataa tggtacacag agtcaccctc tgtctatccc 1621 ccagtggctt cccagtgccc tcaaggtaaa gtcagtacta tttgtcctgg attataagtc 1681 ctaccataga ctgcatctag gcttagttcg aactagctcc tcattccctg ccctcccctc 1741 ccactacaaa ttaacacata gattcttaat gtagtactga agctggcagg gtctggaatc 1801 tgatgcttca caaaatgtct gtagtaaaat gcttcacatg cctgtagtaa atactcagaa 1861 gcaggagttg ctgttgctgc tactgctact gctactactg ctactactac tactactaga 1921 gctggaaact ttcagaagca ggttgaactc aagtctcctg actgagtcca gttctttttg 1981 ttcccattgc ttctttgcat ataatcagta cacaattaat actggacagt ggaatcttta 2041 atttcagtct ccttgactcc ttctgcatcc tcttttggag tcctgtggcc aggggacggt 2101 acagggaatt ttaatgacga gaagacaggg ctggtttccc ccttagcaga catatatgtt 2161 tgcagcatca gaagtctagg gctggaagct acctttgaca tcatttcctc tgcgaatgca 2221 tgtataattt ctacagaacc tattagaaag gatcacccag cctctgcttt tgtacaactt 2281 tcccttaaaa aactgccaat tccactgctg tttggcccaa tagtgagaac tttttcctgc 2341 tgcctcttgg tgcttttgcc tatggcccct attctgcctg ctgaagacac tcttgccagc 2401 atggacttaa acccctccag ctctgacaat cctctttctc ttttgtttta catgaagggt 2461 ctggcagcca aagcaatcac tcaaagttca aaccttatca ttttttgctt tgttcctctt 2521 ggccttggtt ttgtacatca gctttgaaaa taccatccca gggttaatgc tggggttaat 2581 ttataactaa gagtgctcta gttttgcaat acaggacatg ctataaaaat ggaaagatgt 2641 tgctttctga gagatgcagg tggattcttg ggcatttgct gtaagtactt tttttctctt 2701 tggaaacttt gaacatctct atcagcaagt cagagtccca aataaccaag ttggcaagtc 2761 agagtcccaa accaaattcc tggctctagg ggtgaaggtc taagtgtggt catctaataa 2821 ttgactttct accagcctca gtgactttta aagaaagagg actctgatga tgatatggag 2881 attggataga gtggtgaaca aagggggttg gtatggagga tataggagat ggtgtgatct 2941 gttaggagct gatatagtag ctcaggccat gaactgagag gcagaggcag caggaatgaa 3001 gagaagggag aagttttgag agttatttaa gagttagtat tcataagaca tcatgcttca 3061 ttagatatgg ggaatgagaa taaaaggagg ctatgattat tcccaggtat ctggcttagg 3121 tgtctctata gacagtgatg tgataaactg agatgtggaa tgaagatgag ggagcaagct 3181 taaggtgtag ggagatgata tgtttacatc tggcaacttt gagattttct tattgagact 3241 gaggtctgtg tggaccatcc tggtggttat atccaaaaga aagctggata aatggctagg 3301 agctcaggag agcaatctga gagagataca gtttctagca attgagtgcg gagggtaaga 3361 tcactcagga agaagatgga gggtggagga aggatccttt ttctgagttc atagacctct 3421 tccttctaag cacacatgtc cactttcatt ggatttccac acctgacttc tttactccat 3481 ttctttgcac tctttcagga tcagagaacc ttacacctta tctttctggc tttctttata 3541 ataattaaaa gtaacatgca taggtgtaag aataatagtg ttaagatgtc ttacaactta 3601 gaaatgactg tttagatgtt actaagatta atggtgaaag aaatgtaaaa tttagaaaat 3661 tattctttca ttttctggtt gttcgaaaat acctctatgg cttccatgca ttgcaaaatg 3721 catgaatttc agctcatgac cttgtacgtg gctccataaa ccaccaccca tccattccaa 3781 ctaatcatat attgtacaaa ttactttcat tttgtctttt agccaaataa gatgcgagtg 3841 ttctctcatc tcatagattt tatcttaaga aaatgccaga tgaaaaaaaa aatctaaaat 3901 tctaatctac aatgaataat gtatacagag ggagtactag tttaacctaa gaatatccct 3961 ctctccaaaa ggctgcctct agttcagaaa aaaagtgatt caatttccat aaatttaatt 4021 tactcacagc gttttcataa tgttgctata acatctgaat gacagtccat ggcattattt 4081 ccgtggttgc agcaagccac ttctcctact ccctgtccat attagatgca tctcctgttt 4141 ttcaagggag agaaacccct gctcagcccc tgatgagcac atcatcaaat atttactatt 4201 gttttattaa tcaatgtggg tggttattat gtgtctctgg ctcagggagt gtttcatcca 4261 actaagctcc ctcagtggtt gtctcacttc atctcttata gattaacttc cttccaaaat 4321 gtcaccattc ctgtatctgg ttctcttggt acttgggctt catgctacaa tccactgtgc 4381 atcacctgaa ggcaaagtaa cagcctgcca ttcatcccaa ccaaatgcca ctctctacaa 4441 gatgtcatcc attaatgctg actttgcatt caatctgtac cggaggttca ctgtggagac 4501 cccagataag aacatcttct tttcccctgt gagcatttct gcagctttgg ttatgctttc 4561 ctttggggcc tgctgcagca cccaaactga gattgtggag accttggggt tcaacctcac 4621 agacactcca atggtagaga tccagcatgg cttccagcat ctgatctgtt cactgaattt 4681 tccaaagaag gaactggaat tgcagatagg aaatgccctc ttcattggca agcatctgaa 4741 accactggca aagttcttga atgatgtcaa gaccctctat gagactgaag tcttttctac 4801 cgacttctcc aacatttctg cagccaagca ggagattaac agtcatgtgg agatgcaaac 4861 caaagggaaa gttgtgggtc taattcaaga cctcaagcca aacaccatca tggtcttagt 4921 gaactatatt cactttaaag gtaagacttt cagatgtcaa ttttcagtct agggctcaga 4981 tacttgggtg gagtgctgag gggagctcat ggtctcttat tactggcatc agtaggtgtc 5041 cctcatacca cagtgatctg ctccactgga catagctgtg aatagtattg gatcttccaa 5101 gatggacaca cagctctgtg aatgagtcca ggaaatttct agggagagcg ccctttcaga 5161 gaggtgggtg acaatggtat gaacatcttg ctcataattg gctaaagatt catggtttcc 5221 aaaagatttg gtgaggtggg acaagcttga cttgtcaagt agacatattt gtatgtaaag 5281 cctactgtga ttctcacttg ctgtacatac ttaggcatgt tacatacata attttaccat 5341 gtattatgca ctcactgtgt actaaacacc atctgacatg tgtggtatac acacactgtc 5401 tcaatcaata ctcataagtg tccaagttta cacagttaat aagttttaga actgatatta 5461 ccaacaaggt ctgtgtgagt ctaaagctgg tgcttgtaga aacaagaatc atgtgactat 5521 ttcctaaagg tactgtgctc aaatgatttc tcctggttct atagtatatg tcagccacat 5581 ttcttggagg agtaatcaat tgtcggccac aaaccacaca tagatagatt ggctaaaatt 5641 gcagaacaga tctgttttct cacttctccc tagtgtgttg gatgtcgtta taccttttta 5701 tggaagagga aactatggct ctgatgggct tagtgtggta tagtagtttt gaatacagac 5761 tgtggagccc aacagccctg ggtatgattc ctggctctgc cacctgttaa ctgggagacc 5821 atgacaaatg actttctctg cctgatcctc agtcacttgt aaaactgatt gactgtaggt 5881 acctaactct gtggtgatct taatttctaa atgaaatcac taatggaaaa ggtcttgttg 5941 gtactgatac cggtttaaaa taaacacctc tcttttcttt gtattttaag cccagtgggc 6001 aaatcctttt gatccatcca agacagaaga cagttccagc ttcttaatag acaagaccac 6061 cactgttcaa gtgcccatga tgcaccagat ggaacaatac tatcacctag tggatatgga 6121 attgaactgc acagttctgc aaatggacta cagcaagaat gctctggcac tctttgttct 6181 tcccaaggag ggacagatgg agtcagtgga agctgccatg tcatctaaaa cactgaagaa 6241 gtggaaccgc ttactacaga aggggtaaat gccttagagg atgtgtgggt ggaaggtggg 6301 cataaaagtg caggtgagac agagttcagc agaaacccag aggcagaaga atcaccctga 6361 actacccaac agctatgtat gatcactaga atatgccaag aaaatactgt ctaccaggcc 6421 cttggctggg cattctacat acttcatatc atccaagcac atcaacaaac ctacaaagga 6481 ggcagtataa ttgggcattt ttcaaattag gaagtctggt cttgaaagat aagatggagc 6541 tccccaaagt cagcctatta gtaaattgat ggcttcctgt agaccaacta gagctcaaag 6601 aacaggttgg attttgatgt tgaaaggtga ttgggcagcc actctgtact ggaagtgcaa 6661 ggcagaaaac tgtgaagctc ctgggtgcta cccattaacc ttcatgccac tagctcccca 6721 accttggtcg tcacctcagg gatattttct cttgattata aaggagttac atctccctac 6781 tcccagtgct tcagtccaac attggctgta cctgagtgca ttcctttcag aataacttct 6841 gtgagtttgg tgccatgtat tgacttacat agtaatggtt atcaatactc agggaagaag 6901 cagagtccaa tataatacca tcatgagaga gagaaggaga gaatcataag cttgatatgg 6961 tgattgccat gtgttccctt cctcttttcc cacagatggg ttgacttgtt tgttccaaag 7021 ttttccattt ctgccacata tgaccttgga gccacacttt tgaagatggg cattcagcat 7081 gcctattctg aaaatgctga tttttctgga ctcacagagg acaatggtct gaaactttcc 7141 aatgtaagtt gataaactag agttcttaaa tgtatgaact ggaagagcaa ttaggagtca 7201 attgattcaa ctttcttatt ttaccaatta gcaatctgat gcctagagag gcggtgtgac 7261 tcctcctaag ctagaatcac tccagagtca ttcccattac tccaaataag taattgtatc 7321 agtttcttcc atccttgacc tttatgtccc agaattactc tgatagaaca gcttaatgat 7381 cagaagtgac tctggtgaga tgcagttctg aaggaggtgt gaggagggaa tgcttaagag 7441 gtgactcctg gcggcacgag ggccatagtg aatgaggtgt ttcccaatgg ctagccgtag 7501 aatcgccaga gaatgattga ctacatttag cagaggaaac ttcatattct agcaccatca 7561 acatagtata gcaaaggggc ggccccacct ttcctttccc tttcttactt ctagaccttc 7621 ccaaagactg ttcctgtttc ttcagagacc cgcaggattt gttctcccca cacaggctgc 7681 ccataaggct gtgctgcaca ttggtgaaaa gggaactgaa gctgcagctg tccctgaagt 7741 tgaactttcg gatcagcctg aaaacacttt cctacaccct attatccaaa ttgatagatc 7801 tttcatgttg ttgattttgg agagaagcac aaggagtatt ctctttctag ggaaagttgt 7861 gaacccaacg gaagcgtagt tgggaaaaag gccattggct aattgcacgt gtgtattgca 7921 atgggaaata aataaataat atagcctggt gtgattgatg tgagcttgga cttgcattcc 7981 cttatgatgg gatgaagatt gaaccctggc tgaactttgt tggctgtgga agaggccaat 8041 cctatggcag agcattcaga atgtcaatga gtaattcatt attatccaaa gcataggaag 8101 gctctatgtt tgtatatttc tctttgtcag aatacccctc aactcatttg ctctaataaa 8161 tttgactggg ttgaaaaatt ttcccttttc tgtcttctgg ggaatagcta aaaagcaggt 8221 caggtcttcg tatcccagta ctccaataag atcaatggac ataaaacaat gcagatgcat 8281 tgtcttatat gtgaacttct caaccccctt cccactgtcc cattagtagg attctctgag 8341 ttcttgccag atctgacact ggtaggttga ataaaggaac tcccactatg gggatgagtt 8401 acagactgag acaactgaac aagttactga acaagcatgg gacaatacag cagagataaa 8461 cttgcctgtc caaccccacc ttccccccaa gaaaacatac ttccaccttg ccaatgtata 8521 agctgggagt ggagaaagaa agtaattcat tcatgtcata tctcatagac cctctttttc 8581 tgtatcccaa tcagactcct ttccatatat agcacccctg tcccacttgt ttaagtagga 8641 atctcctttt tggctctaat ttcgctgaat attggtactt ttgtatcccc agatccttca 8701 acaaccttcc ctctacctgt acatgccaac tcattaccac ctaccaaact agattttatt 8761 tcagacact // LOCUS HUMTDGF1A 7355 bp DNA PRI 14-JAN-1995 DEFINITION Human (clone CR) teratocarcinoma-derived growth factor 1 (TDGF1) gene, complete cds. ACCESSION M96955 M37099 NID g339430 KEYWORDS epidermal growth factor family; teratocarcinoma-derived growth factor 1. SOURCE Homo sapiens blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ciccodicola,A., Dono,R., Obici,S., Simeone,A., Zollo,M. and Persico,M.G. TITLE Molecular characterization of a gene of the 'EGF family' expressed in undifferentiated human NTERA2 teratocarcinoma cells JOURNAL EMBO J. 8 (7), 1987-1991 (1989) MEDLINE 90005403 REFERENCE 2 (bases 1 to 7355) AUTHORS Dono,R., Montuori,N., Rocchi,M., De Ponti-Zilli,L., Ciccodicola,A. and Persico,M.G. TITLE Isolation and characterization of the CRIPTO autosomal gene and its X-linked related sequence JOURNAL Am. J. Hum. Genet. 49 (3), 555-565 (1991) MEDLINE 91353571 FEATURES Location/Qualifiers source 1..7355 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /map="Unassigned" exon 2244..2514 /gene="TDGF1" /note="G00-125-188" /number=1 gene join(2480..2514,3605..3657,3741..3875,4246..4360, 4459..4568,5643..5761) /gene="TDGF1" CDS join(2480..2514,3605..3657,3741..3875,4246..4360, 4459..4568,5643..5761) /gene="TDGF1" /standard_name="CRIPTO-1" /codon_start=1 /db_xref="GDB:G00-125-188" /product="teratocarcinoma-derived growth factor 1" /db_xref="PID:g339431" /translation="MDCRKMARFSYSVIWIMAISKVFELGLVAGLGHQEFARPSRGYL AFRDDSIWPQEEPAIRPRSSQRVPPMGIQHSKELNRTCCLNGGTCMLGSFCACPPSFY GRNCEHDVRKENCGSVPHDTWLPKKCSLCKCWHGQLRCFPQAFLPGCDGLVMDEHLVA SRTPELPPSARTTTFMLVGICLSIQSYY" intron 2515..3604 /gene="TDGF1" /note="G00-125-188" /number=1 exon 3605..3657 /gene="TDGF1" /note="G00-125-188" /number=2 intron 3658..3740 /gene="TDGF1" /note="G00-125-188" /number=2 exon 3741..3875 /gene="TDGF1" /note="G00-125-188" /number=3 intron 3876..4245 /gene="TDGF1" /note="G00-125-188" /number=3 exon 4246..4360 /gene="TDGF1" /note="G00-125-188" /number=4 intron 4361..4458 /gene="TDGF1" /note="G00-125-188" /number=4 exon 4459..4568 /gene="TDGF1" /note="G00-125-188" /number=5 intron 4569..5642 /gene="TDGF1" /note="G00-125-188" /number=5 exon 5643..6972 /gene="TDGF1" /note="G00-125-188" /number=6 BASE COUNT 1955 a 1686 c 1638 g 2075 t 1 others ORIGIN chromosome 3. 1 ctgcagtcag tacctatggg gaatgggaat ctcttcaggg ggttctggtc cttagagaag 61 ggattttgtg tggctgaaaa gctggcatcc tggggttctg gtcccgtcaa tgacaacatc 121 gccagactgg agacctcagt tacttcctct gaaaatgcag tgatttccag gggtcctatt 181 taagcctcta aaaattccac aagagcttta gataaggaaa tagcaacgca ggggtgtgtt 241 cttttgccag ttctgccaca ggctgcccag tctctaaaga caagacatcc aaatccccca 301 atagaactag ttgtcttgtc cataaagtga gactaatatt gtgggcgcta cttatctact 361 tagcaccttg gactggtgag gactgtggtg cacaagctac cttacaaatg taccacactg 421 agtaaccatc tttaaacctt cctttgcagc tccagggcta gccttctcct ttgcgagccc 481 tccccacctc ggcctcctag agcttcaggc catgttcccc tgtccctgtg aatctcagca 541 tgctacctga agcatttcac ctgaaaaggc cacacaggga ggaggcgaag cgcagcagga 601 atgaaatagt caactgctgt ggagttggaa atgttgctgc atcccaccat tgactggatg 661 gggccctcac tcccccaata caaattattt tacgtttgct tctcccaaga tcatgtgtaa 721 ggccgggcgc ggtagctcat gcctgtaatc ccagcacttt gggaggccga ggcgggtgga 781 tcatgaggtc aggaatttga gaccagactg accaacatgg tgaaaccctg tctctactga 841 aaatacaaaa attagccggg cgttgtgcgg gcgcctgtaa tcccagctac tcaggaggtt 901 gaggcaggag aatgaggcag gagaatcact tgaacccagg aggtggaggt tgcagtgagc 961 caagatcgtg ccattgcact ccagcccggg taacagaggg agactctgtc gcaaaaaaaa 1021 aaaaaaaaaa aaaaaaaaaa tcatgtgtag aagtaaataa cacctcttcc cagcacctgt 1081 ggatccatgg cattcacaca ggtaggcact gacgctgaaa tggcctcaga tgctaccaat 1141 tctttctgcc gaggcctaaa tccaataaac agagacaact atctaataaa gtttcgcatt 1201 gtgtgcctgg caacattaag taaccgtcag gctttccttt agaaattgga attaaatgcg 1261 atttagaaac tattaacgac tttgggcgta tttaataaca catgaaataa cccggaggat 1321 tgaaatgtta ggtgaggcga ctccggctca tagacgcgcc tctccatctg gggcgtctgg 1381 cacttagtgg aaccactcaa taaacacgtt tacccctgca agcggcacat cagagtccgg 1441 gggtaattct cggtgtcgtg gggccaggac ggcgaggggc tggaagaggc cgccctgtgg 1501 gagctgggag gctgagataa attcccgtga ttgggtgctg aaatggcctc ccatgccgga 1561 ctgccgtggt tctagaactt tttcctggaa caggccggca ctcccactgg agagtcccag 1621 ctgcctctgg ccgcccctcc cctctcccgg gcacctggcg ccgctcccgc gtcctttcag 1681 gaattcacgt ccgcctggaa tttgcacttc aagtctggag cccccaagga acccctcctg 1741 accctgaact tctatctcag tttcaagctt cctagtcttc cccacacaca cacacctagc 1801 tcctcaggcg gagagcaccc ctttcttggc cacccgggta tcccccaggg agtacggggc 1861 tcaaaacacc cttctggaaa aaacaaaggt ggaagcaaat ttcaggaagt aaaacttctg 1921 aaataaaata aaatatcgaa tgccttgaga cccatacatt ttcaggtttt cctaattaaa 1981 gcaattactt tccaccaccc ctccaacctg gaatcaccaa cttgattaga gaaactgatt 2041 tttctttttt cttttttttt cccgaaaaga gtacctctga tcattttagc ctgcaactaa 2101 tgatagagat attagggcta gttaaccaca gttttacaag actcctcttc ccgcgtgtgg 2161 gccattgtca tgctgtcggt cccgcccacc tgaaaggtct ccccgccccg actggggttt 2221 gttgttgaag aaggagaatc cccggaaagg ctgagtctcc agctcaaggt caaaacgtcc 2281 aaggccgaaa gccctccagt ttcccctgga cgccttgctc ctgcttctgc tacgaccttc 2341 tggggaaaac gaatttctca ttttcttctt aaattgccat tttcgcttta ggagatgaat 2401 gttttccttt ggctgttttg gcaatgactc tgaattaaag cgatgctaac gcctcttttc 2461 cccctaattg ttaaaagcta tggactgcag gaagatggcc cgcttctctt acaggtatga 2521 gctaatctta gaatagtgaa ctttttttga ttgctagaga ttgccagctt aggaagtaat 2581 gttctacact gtcatttgat ttttctcctt gctcaagcct taaaagagct gccaaccgac 2641 tgctgttttt cctgaaagac ctggaatttc acatggttac ttctaacttt gccattggct 2701 tttaacattt tcgtgttaat gttaattttc attttatgtt aatgactctg cctatgaaat 2761 agtgtttctt tacttcttgt acaaataaag gtcagtacta caaccaaatt taaatcttcc 2821 gaaaagatta aaggtataag cagattcaat acttggcaaa actattaaga taatagcaaa 2881 aaaaaaaaaa aaacccacat tttttaccta aaaacctttt aagtgattgg ttaaaatagt 2941 ttggccgggt gcggtggctc acgcctgtaa tcctagcact ttgggaggca gaggcgggtg 3001 gatcactgag gtcaggagac cagcctggcc aacatggcaa aaccccgtct ctattaaaaa 3061 tacaaaaatt agccaagcat ggtggcgggc acctgtaatc ccagctactc tggaggctga 3121 ggcaggagaa ttgcttgaac tggggagggg aggncagtga gccgagatcg caccattgca 3181 ctccagcctg ggtgaaaaac cgaaactccc tctcaaaaat aaataaataa atacagtagt 3241 ttgtaaaatg attcatcggt aacatgggat gcagctattt tttaatcctt atatgaaaat 3301 tgtatgcagg ggaaaacatg tgaaatagaa gataaaagac atatacctac ttaaaattag 3361 gtacttatgt gaggacaggg cctaagaaat aataatatat attaaaaaga cttggatatt 3421 ggtgactttt tttcaacatt tttctttgtt acatgaatta gccattaaaa aaagaaagat 3481 ggtgctctac aatttctttt cagtgatctg tggtcttgtc cttgtgatga gaggacctgg 3541 gtgttaactt gtaaggtttt atttcctttg tttggctaac tcatgtttga cttcctcttc 3601 ctagtgtgat ttggatcatg gccatttcta aagtctttga actgggatta gttgccggtg 3661 agagaccttt tgtttctttt gatcactctc aattttatgt ggcctaaaat acagactcca 3721 tgaattgatt tgtcgttaag ggctgggcca tcaggaattt gctcgtccat ctcggggata 3781 cctggccttc agagatgaca gcatttggcc ccaggaggag cctgcaattc ggcctcggtc 3841 ttcccagcgt gtgccgccca tggggataca gcacagtaag aactgcctga cttcgatgct 3901 tctgccctgg cccttcatgt gtctcctgac tatctttcca acactctttc acctaaaagg 3961 gcacctggtt ctggaactgt gcaggtgctg gactgctttg gttttggaag tgagacaagg 4021 attgtgtatt ttacttccct agagtgcagt ttcctcccct gagtccactt cacactggga 4081 acccagaacc accactggcc tatgcatgaa aatgacttct ctgctcaaag gcacagagtc 4141 ttactctgat acaacacatt ggtgttgtat taaccttcgc ttacaggaat tgcccttgca 4201 cttttccatc cctacacctc agtcattctg ttcttacctt tcaaggtaag gagctaaaca 4261 gaacctgctg cctgaatggg ggaacctgca tgctggggtc cttttgtgcc tgccctccct 4321 ccttctacgg acggaactgt gagcacgatg tgcgcaaaga gtaagcaatt cagaggggcg 4381 gggagccgtg gagaggagag agaaagggaa gtggaaattt cagacccaag ctatcgcagc 4441 ttacctgttc attctcagga actgtgggtc tgtgccccat gacacctggc tgcccaagaa 4501 gtgttccctg tgtaaatgct ggcacggtca gctccgctgc tttcctcagg catttctacc 4561 cggctgtggt aagcggaggt tctcctcttt cttttgccct ttgaagttac gtagttgcct 4621 tggggggtgc ttagttagca ggctctcctt gtacctcttg tcttgctaga gcctggcagc 4681 caaagttctg cttataaaag catcgcagac tcctgatgag atagttgcct tggcctcttt 4741 gatatttatt tcctcgggaa cctggctagt cctgctgcct ttcagataga gatgtatttc 4801 aagtctattt gacattttat ggtctgaact tctattgagg aaaataaaca agtctcggtc 4861 tcttgttaaa ccaagagatg ttctctggtg ttcctttcct ttgggtaggg gggacccaaa 4921 ccaggatggg cagctcattt agagcccacc ctgacgacaa attctatcag aggcttggcc 4981 ccttgctagt cctttagaaa cttccagagt cctaaaagtc cctggtaacc ccctccccat 5041 accttaccat gactggtcac agaaccctta ccatgactgg tcacagaacc ctttcacctt 5101 cttgattttt tactgatttg aggaatacaa tgaaaagaag ggcagcacct ggagaggaaa 5161 agaggcgaca gtcctctctc caccctagcc tgagccaggt ttctagggcc ccccaaattc 5221 agagacctat tatagttctg ggccttggag atgtagaaat ggaaaatatt caagcccagg 5281 aagtaaatga aagcaaacat ttcactgaga acaggaagga attccccaat ccagacaggg 5341 attgtgtctt tgccatttgc atcctgggtg tcaggctcag gataggtgtt tgataagtgt 5401 gggttgggtg attggatgtg tagggaacat ttgctcttcc tggaacatgg ggcccaagtc 5461 agaatctaac ccaggttgtg ctcattcctg caagtgaagg catcaccact gggctaggtt 5521 ccaggtgtga gtgtcctgag aagagcaggt tcacagtagc gtatagatat gccacatttg 5581 tgggcagcag gatgaactgc cagagaggtt tgctttaatg accaagcatc cctaccttcc 5641 agatggcctt gtgatggatg agcacctcgt ggcttccagg actccagaac taccaccgtc 5701 tgcacgtact accactttta tgctagttgg catctgcctt tctatacaaa gctactatta 5761 atcgacattg acctatttcc agaaatacaa ttttagatat catgcaaatt tcatgaccag 5821 taaaggctgc tgctacaatg tcctaactga aagatgatca tttgtagttg ccttaaaata 5881 atgaatacaa tttccaaaat ggtctctaac atttccttac agaactactt cttacttctt 5941 tgccctgccc tctcccaaaa aactacttct tttttcaaaa gaaagtcagc catatctcca 6001 ttgtgcctaa gtccagtgtt tctttttttt tttttttttg agacggagtc tcactctgtc 6061 acccaggctg gactgcaatg acgcgatctt ggttcactgc aacctccgca tccggggttc 6121 aagccattct cctgcctaag cctcccaagt aactgggatt acaggcatgt gtcaccatgc 6181 ccagctaatt tttttgtatt tttagtagag atgggggttt caccatattg gccagtctgg 6241 tctcgaactc ctgaccttgt gatccactcg cctcagcctc tcgaagtgct gagattacac 6301 acgtgagcaa ctgtgcaagg cctggtgttt cttgatacat gtaattctac caaggtcttc 6361 ttaatatgtt cttttaaatg attgaattat atgttcagat tattggagac taattctaat 6421 gtggacctta gaatacagtt ttgagtagag ttgatcaaaa tcaattaaaa tagtctcttt 6481 aaaaggaaag aaaacatctt taaggggagg aaccagagtg ctgaaggaat ggaagtccat 6541 ctgcgtgtgt gcagggagac tgggtaggaa agaggaagca aatagaagag agaggttgaa 6601 aaacaaaatg ggttacttga ttggtgatta ggtggtggta gagaagcaag taaaaaggct 6661 aaatggaagg gcaagtttcc atcatctata gaaagctata taagacaaga actccccttt 6721 ttttcccaaa ggcattataa aaagaatgaa gcctccttag aaaaaaaatt atacctcaat 6781 gtccccaaca agattgctta ataaattgtg tttcctccaa gctattcaat tcttttaact 6841 gttgtagaag acaaaatgtt cacaatatat ttagttgtaa accaagtgat caaactacat 6901 attgtaaagc ccatttttaa aatacattgt atatatgtgt atgcacagta aaaatggaaa 6961 ctatattgac ctaaatgtga actggttatt tctaggtggt gaggtgcttt atggtggtgg 7021 gtttttgctc ttgatgccct ttttgcattt tccaaagtac catggtgagg atgtgttata 7081 tcttttccag ggtcctaaaa gtccctggca actccctccc cataccctac catgactggt 7141 cacagaaccc tttcacctta ttgatttgta ctgatttcat atggaatatg gcaactacat 7201 ctggctcaaa acaaaggaaa ccagaagagc caagtcccag gtgagtgctc agttctgttt 7261 ctagctttga cgtgtgtgtt cttctgtgaa ggacaaaatt tgcttctatt atttaggtac 7321 cataatttgt gtttttccaa attaattccc tgcag // LOCUS HUMTFPB 13865 bp DNA PRI 14-JAN-1995 DEFINITION Human tissue factor gene, complete cds. ACCESSION J02846 NID g339505 KEYWORDS Alu repeat; cell surface integral membrane protein; cell surface receptor; tissue factor. SOURCE Human DNA, clones lambda-TF[559,679,753,885,1377]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13865) AUTHORS Mackman,N., Morrissey,J.H., Fowler,B. and Edgington,T.S. TITLE Complete sequence of the human tissue factor gene, a highly regulated cellular receptor that initiates the coagulation protease cascade JOURNAL Biochemistry 28 (4), 1755-1762 (1989) MEDLINE 89247359 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.H.Morrissey, 25-OCT-1988. FEATURES Location/Qualifiers source 1..13865 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p22-p21" prim_transcript 799..13232 /note="TF mRNA and introns" CDS join(922..1021,2190..2301,6392..6591,9289..9467, 10075..10234,11955..12091) /gene="F3" /note="tissue factor" /codon_start=1 /db_xref="GDB:G00-119-895" /db_xref="PID:g339506" /translation="METPAWPRVPRPETAVARTLLLGWVFAQVAGASGTTNTVAAYNL TWKSTNFKTILEWEPKPVNQVYTVQISTKSGDWKSKCFYTTDTECDLTDEIVKDVKQT YLARVFSYPAGNVESTGSAGEPLYENSPEFTPYLETNLGQPTIQSFEQVGTKVNVTVE DERTLVRRNNTFLSLRDVFGKDLIYTLYYWKSSSSGKKTAKTNTNEFLIDVDKGENYC FSVQAVIPSRTVNRKSTDSPVECMGQEKGEFREIFYIIGAVVFVVIILVIILAISLHK CRKAGVGQSWKENSPLNVS" exon <922..1021 /gene="F3" /note="tissue factor" /number=1 gene join(922..1021,2190..2301,6392..6591,9289..9467, 10075..10234,11955..12091) /gene="F3" intron 1022..2189 /note="TF intron A" exon 2190..2301 /gene="F3" /number=2 intron 2302..6391 /note="TF intron B" repeat_region 6127..6241 /note="Alu repeat partial copy A" exon 6392..6591 /gene="F3" /number=3 intron 6592..9288 /note="TF intron C" repeat_region 8391..8677 /note="Alu repeat copy B" exon 9289..9467 /gene="F3" /number=4 intron 9468..10074 /note="TF intron D" exon 10075..10234 /gene="F3" /number=5 intron 10235..11954 /note="TF intron E" repeat_region 10954..11249 /note="Alu repeat copy C" exon 11955..>12091 /gene="F3" /note="tissue factor" /number=6 repeat_region 12458..12757 /note="Alu repeat copy D" BASE COUNT 3711 a 2955 c 3240 g 3959 t ORIGIN 1 bp upstream of EcoRI site; chromosome 1. 1 gaattctccc agaggcaaac tgccagatgt gaggctgctc ttcctcagtc actatctctg 61 gtcgtaccgg gcgatgcctg agccaactga ccctcagacc tgtgagccga gccggtcaca 121 ccgtggctga caccggcatt cccaccgcct ttctcctgtg cgacccgcta agggccccgc 181 gaggtgggca ggccaagtat tcttgacctt cgtggggtag aagaagccac cgtggctggg 241 agagggccct gctcacagcc acacgtttac ttcgctgcag gtcccgagct tctgccccag 301 gtgggcaaag catccgggaa atgccctccg ctgcccgagg ggagcccaga gcccgtgctt 361 tctattaaat gttgtaaatg ccgcctctcc cactttatca ccaaatggaa gggaagaatt 421 cttccaaggc gccctccctt tcctgccata gacctgcaac ccacctaagc tgcacgtcgg 481 agtcgcgggc ctgggtgaat ccgggggcct tgggggaccc gggcaactag acccgcctgc 541 gtcctccagg gcagctccgc gctcggtggc gcggttgaat cactggggtg agtcatccct 601 tgcagggtcc cggagtttcc taccgggagg aggcggggca ggggtgtgga ctcgccgggg 661 gccgcccacc gcgacggcaa gtgacccggg ccgggggcgg ggagtcggga ggagcggcgg 721 gggcgggcgc cgggggcggg cagaggcgcg ggagagcgcg ccgccggccc tttatagcgc 781 gcggggcacc ggctccccaa gactgcgagc tccccgcacc ccctcgcact ccctctggcc 841 ggcccagggc gccttcagcc caacctcccc agccccacgg gcgccacgga acccgctcga 901 tctcgccgcc aactggtaga catggagacc cctgcctggc cccgggtccc gcgccccgag 961 accgccgtcg ctcggacgct cctgctcggc tgggtcttcg cccaggtggc cggcgcttca 1021 ggtgagtggc accagcccct ggaagcccgg ggcgcgccac acgcaggagg gaggcgacag 1081 tcctggctgg cagcgggctc gccctggttc cccggggcgc ccatgttgtc ccccgcgcct 1141 acgggactcg gctgcgctca cccagcccgg cttgaatgaa ccgagtccgt cgggcgccgg 1201 cgggagttgc agggagggag ttggcgcccc agaccccgct gccccttccg ctggagagtt 1261 ttgctcgggg tgtccgagta attggactgt tgttgcataa gcggactttt agctcccgct 1321 ttaactctgg ggaaagggct tcccagtgag ttgcgacctt caatatgata ggacttgtgc 1381 ctgcgtctgc acgtgttggc gtgcagaggt ttggatatta tctttcatta tatgtgcatc 1441 ttcccttaat aaagagcgtc cctggtcttt tcctggccat ctttgttcta ggtttgggta 1501 gaggcaatcc aaaagggctg gattgctgct tagattggag caggtacaac gttgtgcatg 1561 ccccgtattt ctacgaggtg ttcgggacgg cgtagagact gggacctgct gcgtactggc 1621 aaagcagacc ttcataagaa ataatcctga tccaatacag ccgacggtgt gacaggccac 1681 acgtccccgt gggtctctgt ggaagtttca gtgtagcgac atttcagata aaagtggaaa 1741 aagtgaagtt tggctttttt catttgtatg cagtcctaac tcttgtcaca cgtgtgggat 1801 ttatcttttt ccataactta ctgaaaaccc ttcctggcgg gctgaacctg actcttcctg 1861 agctgagtcc tggactggca cactgatggc tctgggctct tcccggtcaa gttataacaa 1921 ggctttgccc atgaataatt tcaaacgaaa atgtcaagat ccttgccggt gtcctgggat 1981 tacaaggtga atcttgtcat gaagaaattc taggtctaga aaaaatttga agattctttt 2041 tctcttgata attcactaat gaagcttttg tggttgaaaa ataaaaagtg aggtttatgg 2101 tgatgtcagg tgggaaggtg ttttatacat caatacattc gagtgctctg aagtgcatgt 2161 aataatagct gtttctctgt tgtttaaagg cactacaaat actgtggcag catataattt 2221 aacttggaaa tcaactaatt tcaagacaat tttggagtgg gaacccaaac ccgtcaatca 2281 agtctacact gttcaaataa ggtaagctgg gtacagaaaa agaaaattaa ggtctttgat 2341 gtttctactg tcctatgctg aacaagaatg tctttaaagc tgattactgg atgaaattat 2401 ttaacagatg acgaagaaga agggattctt ggcaattcgc tggccggtgt catactctat 2461 taggcctgca acatttccag accttaaact gatagaacat tttaattgtt ttaattgttt 2521 ttggaaatga tgggagagtt cctaagtgga gtataaactg tggagagatg aaccatcttg 2581 agtaggcact gaagtgtgct ttgggtcatg atagattaat taatctcatc taaacattga 2641 tgtctttttc cgttgctgtc tagactgtga acaatgtcta acaccttagg gaagaggtgg 2701 ggaggaatcc caatgtatac attgccctta agcagtgttt gattcattca tctttggact 2761 ccatgaatcg aaatctggta gaatacatga tcttagtgga ggaggccaaa tgcgtgactc 2821 actgagcctg gcagagcaga aatactctgc tgtctgcacc ctctgggtct ggtgtggctc 2881 tgcttcttgg tgcttcaact ctgactggca gctgtcccca ggaggcgata attcagcatg 2941 ttcaatctaa aggttatgac ttccttgatg gttttcacca tattcttggc aagtttttgg 3001 tttttgaaat gttctaggag gcttggtaga gatcttatga aatagagaat agctgctgtg 3061 gaaattattt taatgctaat tacataaaag tacaaaagta gcactagcta aaacaaaagg 3121 tattttgctg ttctgttttg ttttagcttg tgccaggcct tttacagcat taggaatgca 3181 acttctagat aacgatgcat cttttaagtg aatgttcttg tttttcaaaa tgaacttcat 3241 gacagtagtt gccaaaccag caaggagaac ttgcatgcat acgtgcatgc atgtgtggat 3301 atgtatgggg gtggggggag agaaagatga aggaatttca taacatgaaa taatgattac 3361 agttctggtc aaacttgtca attcagattt caccaattga gaattagtaa gtaatttctc 3421 tgatacaggc ctgaagttta ccttagtaaa cactttactt ccatatggta aaaattagat 3481 tttgggagga atgcttacct cctaaatata ttcaatctaa tatttgagga cacatgggaa 3541 tatatttatg attcatctgc tttttaaaca taagcctttg ttaactgtaa gttcttgaac 3601 tttataaggc tgctgttatt taaatgagca cagctcctga tctgcaaaca gcagagcgca 3661 gggctacagc ttgggggatg ccagccgact cagggtggtc ctgtggactg aacaatctct 3721 tgctgctgta ctggagggcc tgggagcttt tccatcagcc tcggcctgag gtgtgcactc 3781 ttctcctgcc caccccagga ataaatgaga ttcctggtta aaaaggacca gagcagtcat 3841 tttacagttg aggaaactgt tgctctgaga agtgagggat ttattcatga ctacactgat 3901 ggtgagtgcc catgtcaggt ctggaaccaa agtctaccca gtatccacac accaccatcc 3961 ctcaggtggc tctgccacag tctgatggga ggctccaaag cgggaggaag aaggaaagtc 4021 ttgcccactg catctcctca gttggccttc ctctctgcct gttttccctc cctacagtta 4081 gcatcttaag cagctgcctc tcttccctcc cgactgctct cactactgca gcctggctcc 4141 agccgcagga cactactgct gtgcagaagc ccctacttgg aactccaact gcatttttca 4201 cctttgctaa cagttttcag tggtggttgg gaaatgttat tggcttaagc cttagcacaa 4261 accgtcaccg gtgatattca ttccatggaa atgttctgaa ttctaaagct gaatttacaa 4321 agcttctgga aaacaacctg caaccaaatt agtgactgaa ttttttagtt aactcaaaat 4381 tccaaatcag agggttttgc aatgcctgga ggaaccttgg aggcttttaa agtgttaatg 4441 ctattaatgg cattcagagg gattttctac agaattgtcc cttcattacc tgtttataca 4501 gttttactac ttaccagggt actgtataaa tccttgtgct aaattttgct atagagtatg 4561 tggtccctgc tgtgagctgg gaggaaccaa atactgtatc tctatgttac atagaaagcc 4621 ctaggagact ttctcctgtt atctgaacaa ctatttgctg tactgataaa aaggaaacag 4681 catagtctca ttcacttttt gaaatggaaa tgataaaata aaacacattt tggtcattcg 4741 ggaacaaaat accctctcta cttttatcac ataaaattaa ataaatagaa accaaaatat 4801 ttcagtatca atcttagttt gtgcacttta ggataaagaa tgtgtttacc caaatccttt 4861 tggcctggtt acttagttca gattttgaaa gaaaatatat ttgtggcttt tatgtgtgaa 4921 tttagacaat ggaatccatg tggtgcctcg ttttccctga gattatgtat taattcaacc 4981 tgtaaatgca aaccatctaa tagtcagcga gaccctatag ccctgctgct taatgggggc 5041 acacaagggc atgcagccct cgtaccaggc agactgtgtt catattaaca gcatcgtgga 5101 gaaactcatg ctgggggaca ggggagggag atgtaaatgc tcagcaggga gatctggaga 5161 ttcctggagc aggtggagtt gggacctggc cttgaacgat gggtctggct ctggcagtca 5221 gtaatgccaa agggaagagc agcataactg tcactttcca tgggacagaa gtgtgtgaat 5281 caagttgcag tgacgcttca cctatttatt attttggtca tttagaagaa tttcattgtc 5341 agtagaagtc ctttaaatca tttccccttc agtgacgtct cacaaaaaaa agatctgtct 5401 ttagcttttt agtctcagac tttattagac agatactacc tgtactctta ttctgtaatc 5461 tttgttggga tggattcaca tcttgcaaag gaagggaggc atgtagtata atggggcaaa 5521 cagacccagc tctgccactc gttagatatg tgaccttctg caagttgctt agtgcctgtg 5581 agcttcagtg tcctcatgga taagaaagat ccaacacctt cttggaagga ttatatcaaa 5641 tgaagtaaca tgagtaaagg gtccagcaga atacctggca tatagtggag tcaatgaatg 5701 attaataata ttattaatag tggtcatgag agatatatgt ataacatgtt attatgtaga 5761 ctcactatat agactctatt ctacatagaa tatagaacat tatataacaa acaactataa 5821 taagtagact atagtaaaca acctcacttt gtctcagttg cctcatcttg atggaaaact 5881 gctctttctc tcctgttacc ctgacagaga gcgtctacat tctaaaagaa agatatttaa 5941 caaaatggtt gagtacagat ccaagagtca aatagctgtc tggttcaaag tccagctgtg 6001 tgattttgag ctagtcaccc aatctcactt tgtctcagta gccttatttg taaaaacaag 6061 gcaaattaca gagccatccc ctgggttgct atgaggactc aaacatgcat cccaagtgct 6121 cggtgttgct aggtatgatg gctcacacct gtacattcag cactttggga ggccgaagca 6181 gaaggatcag cctgggcaac atagcaggac cccatctcta caaaacaatg tttaaaaaaa 6241 agcaaagtgc tcagcacagt gactgcatca ttaggattga ttgtagggct cctgatgtta 6301 gcacagaaca ccacagccag gaagcagtct atcttgttgg gtgcaaattg taacattcca 6361 tttatgtttc ttccttcttt tctttcttta gcactaagtc aggagattgg aaaagcaaat 6421 gcttttacac aacagacaca gagtgtgacc tcaccgacga gattgtgaag gatgtgaagc 6481 agacgtactt ggcacgggtc ttctcctacc cggcagggaa tgtggagagc accggttctg 6541 ctggggagcc tctgtatgag aactccccag agttcacacc ttacctggag agtaagtggc 6601 ttgggctgta ataccgttca ttcttgttag aaacgtctga acattctcgt gatcttgtgc 6661 ctttaggggc tacaaaatta aaaatattta ttcttttttt ctcagaaact ggtatgtatc 6721 acagccctct tcacacattc cagatgtggt aggaggttca cagaatgtga acttttggag 6781 ctgatgacag tgtcatcaag taactttctc ccccagtctg tccccagacc ctgttactgt 6841 cctcagtaag cggctgaatg tgtgttggga gagggcgggc cagggaagcg ggtagggata 6901 ggaaatccac caaggccggg gttttagctt ttccctatat atatatcatg tatcctgatt 6961 tttctgtccc gttatcacac taaaaatccc agttgaggat ttttcccaaa cggtcataaa 7021 tcaatgagga aagtccatgg tttccctctg agcccataat tagcctaatt atgctgacct 7081 tttctaatca gttggccatg atttgagttc cgtgatgtgc cagcacctgc ccagccatct 7141 gcctgtcacc ctcgttctgg ttttggaaag gtggaatact ttcctcctca gcctttgccc 7201 ctgtaagctg gccctaggag ccagtaaaag aatgaagaga attcctgtca agtaggagat 7261 ttattctttt gccgcaactg tggctctgag ctaggcaatt tagataaatg catgtagcac 7321 attgagtaga gtgaaattag cttctcttgt aaggccagct ggttagaatg aaggtgttgt 7381 gtgagtgtta ggcccagcga gagagaacag tttctcaagg taggaatggt gaaaagaagg 7441 ggtggacgga caaccaacca accatcctcc tctggtatct actttgaggg ttgaaatagg 7501 gggcctgacc ccaggtgaat gtggctgcct tcccagagcc cccatttgca agaccctcca 7561 gacccccagg tgcttctgct tgtgtctttt gtggcaccag gcaagaatgt agcagcgtca 7621 gcagcccctc tggtgactgt ggcatggttg acattcattt cccccctaat taatggcatc 7681 ctcatgattc tcttttatat taatagttct tgagtttttt tgtaagctac ttcaaatcct 7741 ttgttggtgc aagatagaag atattttatg tgtttgtttt gcatgtgcac acacatattt 7801 ggcctgtgaa ttgatgtttg ttttcctgtc atttaaccaa agcacatgag ataattgagc 7861 cattgcagag accccgtggt taaatccggc ttctcgaggt accaaggaca tttcctgggc 7921 tttctcacag ccctacatat ttttgaacct aaaatatcgt agtttatgct accaccctgt 7981 tcagtatagt agccactagc cacatgtggc tgttgaccac ttgaaatatg gctaatgctc 8041 taagtataaa gtacacactg gaatttaaga agtgtagaat atctcaaaac ttttttatat 8101 tgattacaca ttaaaatgat tatattccag atatatgcag ttgactcaag caatgcatgg 8161 ctgagaggca ccgactccct gtgcagttga aaatccgagt ataacttgac tccccaaaaa 8221 cttaactact aatagcctac ctatcggttg actgttgact gcagccttac caataagata 8281 aacagtcaat taacacacat ttttcatgtt gcgtgtatta tatactgtat tcttacaata 8341 aagtaagcta gaggaaagaa aatgttatta agaaaattat aaggaaaaga ggctgggcat 8401 ggtggctcgt gcctgtaatc tcagaacttt gggatgctaa ggcgggtgga tcacttgagg 8461 tcaggagttc aagaccagcc tggccaacat ggtgaaaccc catctctact aaaaatacaa 8521 aaattagcca ggcgtggttg tgggtgcctg taatcccagc tacttgggag gctgaggcag 8581 gagaatcact tcgacccagg tggaggaggt tgcagtgaac tgagattgcg ccactgcact 8641 ccggcctggg tgacagagcg agactctgtc taaaaaagaa agggaaagaa agaaaaaaaa 8701 gaaaagaaaa gaaaagaaag aaggaaggaa gagaaagaat tataaggaag agaaaatata 8761 tttactattg ataaagtgga agtggatcat cataaaggtg ttcatcctcg tcatcttcat 8821 gttgagtagg ctgaggagga ggaggaggag gaagagcagg ggccacggca ggagaaaaga 8881 tggaggaagt aggaggcggc acacttggtg taacttttat ttaaaaaaat ttgcatacaa 8941 gtggatccac agagttcaaa cccatgttgt tcaggggtca actgtctttg gttaaataaa 9001 atatattatt aaaattaatt tcacctgttc ctttttactt tttctaatgt gactactaga 9061 aaacttaaaa tgacatctga ggctccattg tcttcccctt gggccagcac taccacagaa 9121 tgtcttagga ttcagctcca ggccgccacg cctgcttctt tcagggagct ggttctatgc 9181 acatgtttta tatgagagat aattaagttg tcaattgtga taacaaaaca ggatttgact 9241 ttgtacagaa ttctttggtt ccaaccaagc tcatttcctt tgtttcagca aacctcggac 9301 agccaacaat tcagagtttt gaacaggtgg gaacaaaagt gaatgtgacc gtagaagatg 9361 aacggacttt agtcagaagg aacaacactt tcctaagcct ccgggatgtt tttggcaagg 9421 acttaattta tacactttat tattggaaat cttcaagttc aggaaaggtg agcatttttt 9481 aatttgtttt tatgacctgt tttaaattgt gaatacttgg ttttacaacc catttcttcc 9541 ccaattcaaa aatagcagaa cagagttgtt gagaaggtga tggagtagaa gggggagcgc 9601 gcactgtggg gaggggtgga caacaggcct ggtcctacct gtgactctgc actaccctgt 9661 gactctggca gggccccctc ggagacccag gttcctcagc caaccggctg gatcaggtca 9721 tctctaaagg tcccgccacg ctcacatttc tccctctatt gaggatccca ggcacaaaat 9781 ttgtttttgg ttcaatgcat aatactccct tcctttttct tttactgcag atatcttcta 9841 aaggggctca atagggttca atatgcctaa attggatctt ctcagtcttg gaaaaggcat 9901 ttttagcagt gatcaaggga aactgattag cgaagtcact tctaatcctt cacgtgtcag 9961 ctgtgttctt gtaggctttg cttagaacct aggtttttac ttccacagtg acttaataaa 10021 ggggaaagaa ttgactcaga gcccagatga attaagaact ctatcttttt acagaaaaca 10081 gccaaaacaa acactaatga gtttttgatt gatgtggata aaggagaaaa ctactgtttc 10141 agtgttcaag cagtgattcc ctcccgaaca gttaaccgga agagtacaga cagcccggta 10201 gagtgtatgg gccaggagaa aggggaattc agaggtgagt ggctctgcca gccatttgcc 10261 tgggggtatg ggtgctgtgg gtgacttctg gaggagtagc tccaccctca gggctgggat 10321 atacttcctt ggttaaatat tcaggaaaac aaactgcctg gaggtttttt gttgttattt 10381 gtttgttttg gttttgattt tgctttggta caaaaaagat tttggacatt tagaaatgtt 10441 tctgtgttga ttgtgccctt gtattagcag gtgttttctt gagcacctgt catgtgctaa 10501 gccctctgct gagcactgga tacacaaact gtgtttagga tttagcaaca agtcacagat 10561 ttccctgggc attttttcat gcttaaattc taattctggg ggtggcttct ggaccagctg 10621 caacaggaca cagtagacat tcgtgagtac ccactgtggg ctgttgccac agaggctgta 10681 gagtctaacc catcaaggga agggattgag tatatcaaat atacccacat gcatgcatgt 10741 gtgtatatgg cggacacgtg tgtgtacatg catgtgcata tgttgggagc tcaggcccat 10801 tgtgcgagga acagtcccta accggaagtg ctgtgggcct tcagactctt gcaggaagct 10861 gcaagcctgt gtgtctcgat ccatgcctta cagggaaagt attctgagta ctttcagtga 10921 agaaaagagt caggggatat aaacgatggc ttacgctggg tgtggtggct cacgcctgta 10981 gtccctgcac tttgggaggc ccagacaggc aaatcacttg aggtcaggag tttgggacca 11041 gcctggccaa catggtaaaa gcccatctct actcaaaata caaaaagtag ctgggtgtgg 11101 ttgcacgtgt ctgtagtccc agctactcag gaggttgagg caggagaatt gcttgaacct 11161 gggaggcgga ggctgaagtg agctgagatt ggaccactgt actccagcct gggtgacaga 11221 gcgagattcc atctcaaaaa aaaaaaaaag aaacaacgaa aaaagaaatg atggcttagc 11281 tccatgtgaa gatgatattt gaacatttta aaacacttta aataaactgt tctctcctgt 11341 ttattgccac tgacaggaga ggtttctctt tacctctggt cctgcacccc tctgagccat 11401 cctacccaca gccttcagtc attgtcctaa agcctagctc taattccact gcctctcctt 11461 ttgtgcacac acacttctct gcttccctgg ccgttctcta tcttggagag gcatttcaaa 11521 cgccacttcc accagaaggc cttgctactg caccaactag ttactatctc ttcttcaccc 11581 aaatcctggt agcactttgg atctcccact tgcacttagg gttcaccttc cgttataatc 11641 attgccatca atctcagcat cgttttaggc acttctttcc agccattgtt cttacctcca 11701 actacatatc ttttctggac tgtgcattat tcagtttatt aaatgcccat taaatgtgtt 11761 tagccattgt caattactct gaaacgttca ggttttgaca aattctttcc taatgtaagt 11821 gtggtggaaa gagtgaaaga aagtcaaatt gcacaaaaat aggatggtgt aatttggggt 11881 tatgccgtca attttgtcca ctgataaatg ggatttgagc tctccaagtt gactagatgc 11941 cctttatttt tcagaaatat tctacatcat tggagctgtg gtatttgtgg tcatcatcct 12001 tgtcatcatc ctggctatat ctctacacaa gtgtagaaag gcaggagtgg ggcagagctg 12061 gaaggagaac tccccactga atgtttcata aaggaagcac tgttggagct actgcaaatg 12121 ctatattgca ctgtgaccga gaacttttaa gaggatagaa tacatggaaa cgcaaatgag 12181 tatttcggag catgaagacc ctggagttca aaaaactctt gatatgacct gttattacca 12241 ttagcattct ggttttgaca tcagcattag tcactttgaa atgtaacgaa tggtactaca 12301 accaattcca agttttaatt tttaacacca tggcaccttt tgcacataac atgctttaga 12361 ttatatattc cgcactcaag gagtaaccag gtcgtccaag caaaaacaaa tgggaaaatg 12421 tcttaaaaaa tcctgggtgg acttttgaaa agcttttttt tttttttttt tttttgagac 12481 ggagtcttgc tctgttgccc aggctggagt gcagtagcac gatctcggct cactgcaccc 12541 tccgtctctc gggttcaagc aattgtctgc ctcagcctcc cgagtagctg ggattacagg 12601 tgcgcactac cacgccaagc taatttttgt attttttagt agagatgggg tttcaccatc 12661 ttggccaggc tggtcttgaa ttcctgacct caggtgatcc acccaccttg gcctcccaaa 12721 gtgctagtat tatgggcgtg aaccaccatg cccagccgaa aagcttttga ggggctgact 12781 tcaatccatg taggaaagta aaatggaagg aaattgggtg catttctagg acttttctaa 12841 catatgtcta taatatagtg tttaggttct tttttttttc aggaatacat ttggaaattc 12901 aaaacaattg gcaaactttg tattaatgtg ttaagtgcag gagacattgg tattctgggc 12961 accttcctaa tatgctttac aatctgcact ttaactgact taagtggcat taaacatttg 13021 agagctaact atatttttat aagactacta tacaaactac agagtttatg atttaaggta 13081 cttaaagctt ctatggttga cattgtatat ataatttttt aaaaaggttt tctatatggg 13141 gattttctat ttatgtaggt aatattgttc tatttgtata tattgagata atttatttaa 13201 tatactttaa ataaaggtga ctgggaattg ttactgttgt acttattcta tcttccattt 13261 attatttatg tacaatttgg tgtttgtatt agctctacta cagtaaatga ctgtaaaatt 13321 gtcagtggct tacaacaacg tatctttttc gcttataata cattttggtg actgtaggct 13381 gactgcactt cttctcaatg ttttctcatt ctaggatgca aaccaatgga gaagccccta 13441 attagatcag ggcagaggga aaaacaaaaa actggtagaa accggcaacc acagcttcaa 13501 gctttaagcc catctcctac acttctgctc tgtacgtgcc cattgtcact tctgttcaca 13561 tgctactgtc ccaagcaagt gaccaagcct gacaatactt tgtctactgg agtcactgca 13621 aggcacatga cggggcaggg atgtcgtctt acagggaaga gaaaagataa tgctctctac 13681 tgcagacttg gagagatttc ttcccattgg cagtagtttg actaattgga gatgagaaaa 13741 aaagaaacat tcttgggatg attgtattga aacaaaatta ggtaaaagga caatatagga 13801 tagggagaga tataagtgga atgagatctc tagagtccat taaaagcaag ctagattgag 13861 agctc // LOCUS HUMTHROMA 6163 bp DNA PRI 29-JAN-1995 DEFINITION Human thrombopoietin gene, complete cds. ACCESSION L36051 NID g533214 KEYWORDS thrombopoietin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6163) AUTHORS Foster,D.C., Sprecher,C.A., Grant,F.J., Kramer,J.M., Kuijper,J.L., Holly,R.D., Whitmore,T.E., Heipel,M.D., Bell,L.A.N., Ching,A.F., McGrane,V., Hart,C., O'Hara,P.J. and Lok,S. TITLE Human thrombopoietin: gene structure, cDNA sequence, expression, and chromosomal localization JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (26), 13023-13027 (1994) MEDLINE 95108091 FEATURES Location/Qualifiers source 1..6163 /organism="Homo sapiens" /db_xref="taxon:9606" exon <1..158 /number=1 intron 159..1826 /number=1 exon 1827..1984 /number=2 CDS join(1972..1984,2216..2343,2630..2716,4649..4816, 5053..5718) /codon_start=1 /product="thrombopoietin" /db_xref="PID:g533215" /translation="MELTELLLVVMLLLTARLTLSSPAPPACDLRVLSKLLRDSHVLH SRLSQCPEVHPLPTPVLLPAVDFSLGEWKTQMEETKAQDILGAVTLLLEGVMAARGQL GPTCLSSLLGQLSGQVRLLLGALQSLLGTQLPPQGRTTAHKDPNAIFLSFQHLLRGKV RFLMLVGGSTLCVRRAPPTTAVPSRTSLVLTLNELPNRTSGLLETNFTASARTTGSGL LKWQQGFRAKIPGLLNQTSRSLDQIPGYLNRIHELLNGTRGLFPGPSRRTLGAPDISS GTSDTGSLPPNLQPGYSPSPTHPPTGQYTLFPLPPTLPTPVVQLHPLLPDPSAPTPTP TSPLLNTSYTHSQNLSQEG" intron 1985..2215 /number=2 exon 2216..2343 /number=3 intron 2344..2629 /number=3 exon 2630..2716 /number=4 intron 2717..4648 /number=4 exon 4649..4816 /number=5 intron 4817..5052 /number=5 exon 5053..>6163 /number=6 BASE COUNT 1524 a 1706 c 1435 g 1498 t ORIGIN 1 tggggtctcc cccctctgtg tggggagaag tgtgccagag agacgcatgt cctcctcctg 61 tggaggggct gttctccacc accacatgtc ttcctaccaa tctgctcccc agagggctgc 121 ctgctgtgca cttgggtcct ggagcccttc tccacccggt gagtggccag cagggtgtgg 181 ggttatgtga gggtagaaag gacagcaaag agaaatggcc tcccagctgg gggaggggca 241 ggcaaactgg aacctacagg cactgacctt tgtcgagaag agtgtagcct tcccagaatg 301 ggaggagcag ggcagagcag gggtaggggg tggggtgctg ttttctgagg gactgatcac 361 ttacttggtg gaatacagca cagccctggc tggccctaag gaaaggggac atgagcccag 421 ggagaaaata agagagggag ctgcacttag ggcttagcaa acacagtagt aagatggaca 481 cagccccaat ccccattctt agctggtcat tcctcgttag cttaaggttc tgaatctggt 541 gctggggaag ctgggccagg caagccaggg cgcaaggaga gggtaatggg aggaggccca 601 ctcatgttga cagacctaca ggaaatccca atattgaatc aggtgcaagc ctctttgcac 661 aacttgtgaa aggaggagga agccatgtgg ggggtcctgt gaaggaaccg gaaggggttc 721 tgccaagggg gcagggaggc aggtgtgatc tatgagacag atatgttagt gggcgcctaa 781 gacaaggtaa gcccctaagg tgggcatcac ccagcaggtg cccgttcctg ggcagctggt 841 ctcaggaagg aagtcccaga actgttagcc catctcttgg cctcagataa tggagtattt 901 caggacttgg agtccagaga aaagctccag tggctttatg tgtgggggta gatagggaaa 961 gatagaggtt aatttctccc ataccgcctt ttaatcctga cctctagtgg tcccagttac 1021 agctttgtgc agttcccctc cccagcccca ctccccaccg cagaagttac ccctcaacat 1081 attgcgcccg tttgccagtt cctcacccag gccctgcatc ccattttcca ctctcttctc 1141 caggctgaag ccacaatact ttccttctct atccccatcc cagattttct ctgacctaac 1201 aaccaaggtt gctcagaatt taaggctaat taagatatgt gtgtatacat atcatgtcct 1261 gctgctctca gcaggggtag gtggcaccaa atccatgtcc gattcactga ggagtcctga 1321 caaaaaggag acaccatatg ctttcttgct ttctttcttt ctttctttct ttcttttttt 1381 tttttgagac ggagtttcac tcttattgcc caggctggag tgcaatggtg cgatctcggc 1441 tcaccacaac ctccgcctcc caggtacaag cgattctcct gtctcagcct cccaagtagc 1501 ttggattaca ggcatgaacc accacaccct gctagttttt ttgtatttcg tagagccggg 1561 gtttcaccat gttagtgagg ctggtggcga actcctgacc tcaggtgatc cacccgcctt 1621 ggactcccaa agtgctggga ttacaggcat gagccactgc acccggcaca ccatatgctt 1681 tcatcacaag aaaatgtgag agaattcagg gctttggcag ttccaggctg gtcagcatct 1741 caagccctcc ccagcatctg ttcaccctgc caggcagtct cttcctagaa acttggttaa 1801 atgttcactc ttcttgctac tttcaggata gattcttcac ccttggtccg cctttgcccc 1861 accctactct gcccagaagt gcaagagcct aagccgcctc catggcccca ggaaggattc 1921 aggggagagg ccccaaacag ggagccacgc cagccagaca ccccggccag aatggagctg 1981 actggtgaga acacacctga ggggctaggg ccatatggaa acatgacaga aggggagaga 2041 gaaaggagac acgctgcagg gggcaggaag ctgggggaac ccattctccc aaaaataagg 2101 ggtctgaggg gtggattccc tgggtttcag gtctgggtcc tgaatgggaa ttcctggaat 2161 accagctgac aatgatttcc tcctcatctt tcaacctcac ctctcctcat ctaagaattg 2221 ctcctcgtgg tcatgcttct cctaactgca aggctaacgc tgtccagccc ggctcctcct 2281 gcttgtgacc tccgagtcct cagtaaactg cttcgtgact cccatgtcct tcacagcaga 2341 ctggtgagaa ctcccaacat tatccccttt atccgcgtaa ctggtaagac acccatactc 2401 ccaggaagac accatcactt cctctaactc cttgacccaa tgactattct tcccatattg 2461 tccccaccta ctgatcacac tctctgacaa ggattattct tcacaataca gcccgcattt 2521 aaaagctctc gtctagagat agtactcatg gaggactagc ctgcttatta ggctaccata 2581 gctctctcta tttcagctcc cttctccccc caccaatctt tttcaacaga gccagtgccc 2641 agaggttcac cctttgccta cacctgtcct gctgcctgct gtggacttta gcttgggaga 2701 atggaaaacc cagatggtaa gaaagccatc cctaaccttg gcttccctaa gtcctgtctt 2761 cagtttccca ctgcttccca tggattctcc aacattcttg agctttttaa aaatatctca 2821 ccttcagctt ggccacccta acccaatcta cattcaccta tgatgatagc ctgtggataa 2881 gatgatggct tgcaggtcca atatgtgaat agatttgaag ctgaacacca tgaaaagctg 2941 gagagaaatc gctcatggcc atgcctttga cctattcccg ttcagtcttc ttaaattggc 3001 atgaagaagc aagactcata tgtcatccac agatgacaca aagctgggaa gtaccactaa 3061 aataacaaaa gactgaatca agattcaaat cactgaaaga ctaggtcaaa aacaaggtga 3121 aacaacagag atataaactt ctacatgtgg gccgggggct cacgcctgta atcccagcac 3181 tttgggaggc cgaggcaggc agatcacctg agggcaggag tttgagagca gcctggccaa 3241 catggcgaaa ccccgtctct actaagaata cagaattagc cgggcatggt agtgcatgcc 3301 tgtaatccca gctacttgga aggctgaagc aggagaatcc cttgaaccca ggaggtggag 3361 gttgtagtga gctgagatca tgccaatgca ctccagcctg ggtgacaaga gcaaaactcc 3421 gtctcaaaaa gaaaaaaaaa ttctacatgt gtaaattaat gagtaaagtc ctattccagc 3481 tttcaggcca caatgccctg cttccatcat ttaagcctct ggccctagca cttcctacga 3541 aaaggatctg agagaattaa attgccccca aacttaccat gtaacattac tgaagctgct 3601 attcttaaag ctagtaattc ttgtctgttt gatgtttagc atccccattg tggaaatgct 3661 cgtacagaac tctattccga gtggactaca cttaaatata ctggcctgaa caccggacat 3721 ccccctgaag acatatgcta atttattaag agggaccata ttaaactaac atgtgtctag 3781 aaagcagcag cctgaacaga aagagactag aagcatgttt tatgggcaat agtttaaaaa 3841 actaaaatct atcctcaaga accctagcgt cccttcttcc ttcaggactg agtcagggaa 3901 gaagggcagt tcctatgggt cccttctagt cctttctttt catccttatg atcattatgg 3961 tagagtctca tacctacatt tagtttattt attattatta tttgagacgg agtctcactc 4021 tatcccccag gctggagtgc agtggcatga tctcaactca ctgcaacctc agcctcccgg 4081 attcaagcga ttctcctgtc tcagtctccc aagtagctgg gattacaggt gcccaccacc 4141 atgcccagct aatttgtgta tttgtggtag agatggggtt tcaccatgtt gggcaggctg 4201 atcttgaact cctgacctca ggtgatccac ctgcctcagc ctcccaaagt gctgggatta 4261 caggcgtgag ccactgcacc cagccttcat tcagtttaaa aatcaaatga tcctaaggtt 4321 ttgcagcaga aagagtaaat ttgcagcact agaaccaaga ggtaaaagct gtaacagggc 4381 agatttcagc aacgtaagaa aaaaggagct cttctcactg aaaccaagtg taagaccagg 4441 ctggactaga ggacacggga gtttttgaag cagaggctga tgaccagctg tcgggagact 4501 gtgaaggaat tcctgccctg ggtgggacct tggtcctgtc cagttctcag cctgtatgat 4561 tcactctgct ggctactcct aaggctcccc acccgctttt agtgtgccct ttgaggcagt 4621 gcgcttctct cttccatctc tttctcagga ggagaccaag gcacaggaca ttctgggagc 4681 agtgaccctt ctgctggagg gagtgatggc agcacgggga caactgggac ccacttgcct 4741 ctcatccctc ctggggcagc tttctggaca ggtccgtctc ctccttgggg ccctgcagag 4801 cctccttgga acccaggtaa gtccccagtc aagggatctg tagaaactgt tcttttctga 4861 ctcagtcccc ctagaagacc tgagggaaga agggctcttc cagggagctc aagggcagaa 4921 gagctgatct actaagagtg ctccctgcca gccacaatgc ctgggtactg gcatcctgtc 4981 tttcctactt agacaaggga ggcctgagat ctggccctgg tgtttggcct caggaccatc 5041 ctctgccctc agcttcctcc acagggcagg accacagctc acaaggatcc caatgccatc 5101 ttcctgagct tccaacacct gctccgagga aaggtgcgtt tcctgatgct tgtaggaggg 5161 tccaccctct gcgtcaggcg ggccccaccc accacagctg tccccagcag aacctctcta 5221 gtcctcacac tgaacgagct cccaaacagg acttctggat tgttggagac aaacttcact 5281 gcctcagcca gaactactgg ctctgggctt ctgaagtggc agcagggatt cagagccaag 5341 attcctggtc tgctgaacca aacctccagg tccctggacc aaatccccgg atacctgaac 5401 aggatacacg aactcttgaa tggaactcgt ggactctttc ctggaccctc acgcaggacc 5461 ctaggagccc cggacatttc ctcaggaaca tcagacacag gctccctgcc acccaacctc 5521 cagcctggat attctccttc cccaacccat cctcctactg gacagtatac gctcttccct 5581 cttccaccca ccttgcccac ccctgtggtc cagctccacc ccctgcttcc tgacccttct 5641 gctccaacgc ccacccctac cagccctctt ctaaacacat cctacaccca ctcccagaat 5701 ctgtctcagg aagggtaagg ttctcagaca ctgccgacat cagcattgtc tcgtgtacag 5761 ctcccttccc tgcagggcgc ccctgggaga caactggaca agatttccta ctttctcctg 5821 aaacccaaag ccctggtaaa agggatacac aggactgaaa agggaatcat ttttcactgt 5881 acattataaa ccttcagaag ctattttttt aagctatcag caatactcat cagagcagct 5941 agctctttgg tctattttct gcagaaattt gcaactcact gattctcaac atgctctttt 6001 tctgtgataa ctctgcaaag acctgggctg gcctggcagt tgaacagagg gagagactaa 6061 ccttgagtca gaaaacagag gaagggtaat ttcctttgct tcaaattcaa ggccttccaa 6121 cgcccccatc ccctttacta tcattctcag tgggactctg atc // LOCUS HUMTHY1A 2806 bp DNA PRI 14-JAN-1995 DEFINITION Human Thy-1 glycoprotein gene, complete cds. ACCESSION M11749 NID g339682 KEYWORDS Thy-1 glycoprotein. SOURCE Human B-lymphoblastoid cell line LG2 DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2806) AUTHORS Seki,T., Spurr,N., Obata,F., Goyert,S., Goodfellow,P. and Silver,J. TITLE The human Thy-1 gene: structure and chromosomal location JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (19), 6657-6661 (1985) MEDLINE 86016759 FEATURES Location/Qualifiers source 1..2806 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q22.3-q23" gene 27..63 /gene="THY1" exon <27..63 /gene="THY1" /note="Thy-1; G00-119-614" CDS join(27..63,547..882,1410..1522) /note="Thy-1" /codon_start=1 /db_xref="PID:g339683" /translation="MNLAISIALLLTVLQVSRGQKVTSLTACLVDQSLRLDCRHENTS SSPIQYEFSLTRETKKHVLFGTVGVPEHTYRSRTNFTSKYHMKVLYLSAFTSKDEGTY TCALHHSGHSPPISSQNVTVLRDKLVKCEGISLLAQNTSWLLLLLLSLSLLQATDFMS L" intron 64..546 /note="intron A" exon 547..882 /number=2 intron 883..1409 /note="intron B" exon 1410..>1522 /note="Thy-1" /number=3 BASE COUNT 612 a 851 c 754 g 589 t ORIGIN Chromosome 11q22.3. 1 ggatccagga ctgagatccc agaaccatga acctggccat cagcatcgct ctcctgctaa 61 caggtacccg gcatggggca ggactggggc tccaggcgcc ctggcttcct tccctccaga 121 gaagcagctt ctccctcaca gtctcagaaa agcgcaggtg acaaagagag ggctcttttt 181 catcctgaag tcagccgatc caccgcgctg atattctgac ggcctgaggt ggtttttgga 241 aacacagttt gctgagccct ccttcacact attgaactag aatccccaac tgagaaccca 301 ggaaccagca tcaactccct aagatctcct gtccttgaaa cacattgata ggatccaagg 361 ctcaagcaga gtggggaggg aggctggggt ctgcaaagga gaagtgggat ccctggggtg 421 gggaaaggca ctcagagagc agaccccggt cccctcccta gccaggccca tctctccact 481 tcaggtgggt gggaggcccc tgtgccgcag gcccctccag tttgaaggag gcactgctgg 541 tgccagtctt gcaggtctcc cgagggcaga aggtgaccag cctaacggcc tgcctagtgg 601 accagagcct tcgtctggac tgccgccatg agaataccag cagttcaccc atccagtacg 661 agttcagcct gacccgtgag acaaagaagc acgtgctctt tggcactgtg ggggtgcctg 721 agcacacata ccgctcccga accaacttca ccagcaaata ccacatgaag gtcctctact 781 tatccgcctt cactagcaag gacgagggca cctacacgtg tgcactccac cactctggcc 841 attccccacc catctcctcc cagaacgtca cagtgctcag aggtgagaca agcccctaac 901 aaggtcaagt gagctgggag agccaggctc ggggacagca ggcagttccc ttggctggac 961 tagagaggag aatagcccca taacgctctc accctctccc aactgctgcc tggtcaactg 1021 gggaaccatt gccttcggtg tgaatggggt gaagagctca gggccagaca ggcagagcag 1081 tgtggttcca ccagaactgt gggcaaggcc tttggcccct aatcttcctt ctcccagcgg 1141 gaaacaggga tgacaccacc tccctcagcc agttttcttg tcatgatgtt tagtaaggtt 1201 ttcataagat gatatgtgtg caagagatca gtaatctgca aatgggaaag atggctggtt 1261 ctgtgagacc aggctgttcc tggtcccagc taagacattg cagtacccac ctcccaaagg 1321 gagtacaccc ttgctttggg cctgtgcctg cctgagtcct gatccgtctt ccttcctacc 1381 ctgcccccgg cccccttctc tttctgcaga caaactggtc aagtgtgagg gcatcagcct 1441 gctggctcag aacacctcgt ggctgctgct gctcctgctg tccctctccc tcctccaggc 1501 cacggatttc atgtccctgt gactggtggg gcccatggag gagacaggaa gcctcaagtt 1561 ccagtgcaga gatcctactt ctctgagtca gctgaccccc tccccccaat ccctcaaacc 1621 ttgaggagaa gtggggaccc cacccctcat caggagttcc agtgctgcat gcgattatct 1681 acccacgtcc acgcggccac ctcaccctct ccgcacacct ctggctgtct ttttgtactt 1741 tttgttccag agctgcttct gtctggttta tttaggtttt atccttcctt ttctttgaga 1801 gttcgtgaag agggaagcca ggattgggga cctgatggag agtgagagca tgtgaggggt 1861 agtgggatgg tggggtacca gccactggag gggtcatcct tgcccatcgg gaccagaaac 1921 ctgggagaga cttggatgag gagtggttgg gctgtgctgg gcctagcacg gacatggtct 1981 gtcctgacag cactcctcgg caggcatggc tggtgcctga agaccccaga tgtgagggca 2041 ccaccaagaa tttgtggcct accttgtgag ggagagaact gaggatctcc agcattctca 2101 gccacaacca aaaaaaaata aaaagggcag ccctccttac cactgtggaa gtccctcaga 2161 ggccttgggg catgacccag tgaagatgca ggtttgacca ggaaagcagc gctagtggag 2221 ggttggagaa ggaggtaaag gatgagggtt catcatccct ccctgcctaa ggaagctaaa 2281 agcatggccc tgctgcccct ccctgcctcc acccacagtg gagagggcta caaaggagga 2341 caagaccctc tcaggctgtc ccaagctccc aagagcttcc agagctctga cccacagcct 2401 ccaagtcagg tggggtggag tcccagagct gcacagggtt tggcccaagt ttctaaggga 2461 ggcacttcct cccctcgccc atcagtgcca gcccctgctg gctggtgcct gagcccctca 2521 gacagccccc tgccccgcag gcctgccttc tcagggactt ctgcggggcc tgaggcaagc 2581 catggagtga gacccaggag ccggacactt ctcaggaaat ggcttttccc aacccccagc 2641 ccccacccgg tggttcttcc tgttctgtga ctgtgtatag tgccaccaca gcttatggca 2701 tctcattgag gacaaagaaa actgcacaat aaaaccaagc ctctggaatc tgtcctcgtg 2761 tccacctggc cttcgctcct ccagcagtgc ctgcctgccc ccgctt // LOCUS HUMTKRA 13500 bp DNA PRI 14-JAN-1995 DEFINITION Human thymidine kinase gene, complete cds, with clustered Alu repeats in the introns. ACCESSION M15205 M15206 NID g339718 KEYWORDS Alu repeat; repeat region; thymidine kinase. SOURCE Human DNA (library of Y.-F.Lau), clone lambda-tk46 [1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13500) AUTHORS Flemington,E., Bradshaw,H.D. Jr., Traina-Dorge,V., Slagel,V. and Deininger,P.L. TITLE Sequence, structure and promoter characterization of the human thymidine kinase gene JOURNAL Gene 52 (2-3), 267-277 (1987) MEDLINE 87277399 REFERENCE 2 (sites) AUTHORS Slagel,V., Flemington,E., Traina-Dorge,V., Bradshaw,H. and Deininger,P. TITLE Clustering and subfamily relationships of the Alu family in the human genome JOURNAL Mol. Biol. Evol. 4 (1), 19-29 (1987) MEDLINE 88188974 REFERENCE 3 (sites) AUTHORS Barik,S. and Banerjee,A.K. TITLE Cloning and expression of the vesicular stomatitis virus phosphoprotein gene in Escherichia coli: analysis of phosphorylation status versus transcriptional activity JOURNAL J. Virol. 65 (4), 1719-1726 (1991) MEDLINE 91162717 REFERENCE 4 (sites) AUTHORS Kim,Y.K. and Lee,A.S. TITLE Identification of a protein-binding site in the promoter of the human thymidine kinase gene required for the G1-S-regulated transcription JOURNAL J. Biol. Chem. 267 (4), 2723-2727 (1992) MEDLINE 92129365 COMMENT [2] sites; Alu repeats only. [1] exons, intron/exon boundaries, 5', 3' flanks. Draft entry and computer-readable sequence for [2],[1] kindly provided by P.L.Deininger, 07-APR-1987. FEATURES Location/Qualifiers source 1..13500 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q23.2-q25.3" prim_transcript 451..13417 /note="TK mRNA (alt.)" prim_transcript 458..13417 /note="TK mRNA (alt.)" exon <520..585 /gene="TK1" /note="thymidine kinase; G00-120-439" /number=1 gene 520..585 /gene="TK1" CDS join(520..585,696..727,2360..2470,4845..4938,11901..11990, 12348..12467,12567..12758) /note="thymidine kinase" /codon_start=1 /db_xref="PID:g339719" /translation="MSCINLPTVLPGSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQ IAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALPACLLRDVAQEALGVAVIGIDEGQFF PDIMEFCEAMANAGKTVIVAALDGTFQRKPFGAILNLVPLAESVVKLTAVCMECFREA AYTKRLGTEKEVEVIGGADKYHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAA RKLFAPQQILQCSPAN" intron 586..695 /note="TK cds intron A" exon 696..727 /number=2 intron 728..2359 /note="TK cds intron B" repeat_region complement(1268..1546) /note="Alu-tkA repeat" repeat_region 1697..1994 /note="Alu-tkB repeat" exon 2360..2470 /number=3 intron 2471..4844 /note="TK cds intron C" repeat_region complement(3813..4109) /note="Alu-tkC repeat" repeat_region 4393..4661 /note="Alu-tkD repeat" exon 4845..4938 /number=4 intron 4939..11900 /note="TK cds intron D" repeat_region complement(5141..5434) /note="Alu-tkE repeat" repeat_region 5471..5765 /note="Alu-tkF repeat" repeat_region complement(5962..6259) /note="Alu-tkG repeat" repeat_region complement(6263..6552) /note="Alu-tkH repeat" repeat_region complement(7434..7731) /note="Alu-tkI repeat" repeat_region 8658..8879 /note="Alu-tkJ repeat" repeat_region 9350..9646 /note="Alu-tkK repeat" repeat_region 9646..9939 /note="Alu-tkL repeat" repeat_region 10327..10397 /note="Alu-tkM repeat" exon 11901..11990 /number=5 intron 11991..12347 /note="TK cds intron E" exon 12348..12467 /number=6 intron 12468..12566 /note="TK cds intron F" exon 12567..>12758 /note="thymidine kinase" /number=7 BASE COUNT 2842 a 3614 c 3578 g 3463 t 3 others ORIGIN Chromosome 17q23.2-q25.3; 1 bp upstream of HindIII site. 1 aagcttcctt cttggaattc caaactaata aatgagctaa ctccgcccca gccccttagt 61 ccctccctgc aatccaccta cctctgcaga catcttcttc caaggaacct tgcttgggaa 121 acccacacca gacacatcca tcatggcgtc tacagccgca tgggcgtgcg tccctctgtt 181 tatatggcca gagccccgcc tcgctccgcc cctttaaact tggtgggcgg accgaggcgg 241 ggctcagacc aggccccacc ccgatcagcc acgtccatcg ccctgatttc caggccctcc 301 cagtccctgg gcgcacgtcc cggattcctc ccacgagggg gcgggctgcg gccaaatctc 361 ccgccaggtc agcggccggg cgctgattgg ccccatggcg gcggggccgg ctcgtgattg 421 gccagcacgc cgtggtttaa agcggtcggc gcgggaccag gggcttactg cgggacggcc 481 ttggagagta ctcgggttcg tgaacttccc ggaggcgcaa tgagctgcat taacctgccc 541 actgtgctgc ccggctcccc cagcaagacc cgggggcaga tccaggtgcg ggggccagcc 601 ctgcgcgtgg ctggggatga ggtggtcgtg gtgatagcct gtgtccaggc atccgcgcag 661 ggcgggccct caaatgacct caccttctct cctaggtgat tctcgggccg atgttctcag 721 gaaaaaggta atggcttcgc ggggctgggg tggagctcct tcctcttctc cggggacccc 781 ttgtccctcc cctcccctcc cctcccctcc cctcccctcc cctccccttc cctccccttc 841 ccttccctcc ccttcccttc ccctagaagg accagcacag cctcctacag ctcccgcccg 901 gggtgctcct cccttgaatt cagtccagga ggaagtctct gccctcttct gcccaggcca 961 agcccctcgt cctgtgtgga cgccactccc tcctggagct ggtgacagct gcttacagct 1021 tagctgtctt ccccaccaag tcctctgaga aggtggcaac cagttgtgtc ccctgtaggc 1081 caggcctttt tgtacacccc tattcaatgt ggctgtttcc ttctaaggcc aaggaaacgt 1141 agtcgctttc taaaccaagg agtctgaagc cgtggagcct ctgctctcct gaggtgatag 1201 aaccattccc tgacccgggt ggggctagtg agtttcttga gtaaactacc cacgcaccat 1261 tctttttgtt ttgtttttgt tcttctagag gtaggatctt gctatgttgc ccaggctggt 1321 ctcaaactcc tgggctcaag caattctctc acctcagcct cccaagtagc tgggactaca 1381 ggcgtgcacc ccccccgcct ccacccagct aattttattt tatttttata gagctggggt 1441 cttgctatgt tgcccaagct ggtcttgaac tcctggtctc aagcaatcct cctacttcag 1501 catcccaaag tgctgggatt acagatgtta gccaccatgc cctgccccaa cattctttta 1561 tggccctggg gatcacttca gctcaaaccc cttgctcagg aagatgtggc tcagagttgg 1621 acttcttgga cccagaagca agtgcttttg acgctgcaca caaagacttt ctgaaattaa 1681 tttagaaaag ctgtatgcca ggtgtggtgg cccacgcctt taatcccagc gctttggaag 1741 gctgaggtgc gttgatcact tgaggttagg agtttgagac caccctggtc aacgtggtga 1801 aaccccatct ctactgaaaa aaaaaaccaa aaattatctg ggcatggtgg cagcctcctg 1861 taatcccagc tactcgggag gttgaggcag gagaatctct tgaacccgga aggcaggggt 1921 tgcagtgagc tgagatcgct ccactgcact ctaacctagg caacagagcg agactccacc 1981 ccaaaaagaa agaaagaaaa actctgaact ctgggaacaa ctctgggatg aggttacttt 2041 ggaatgcagt cgcaggttcc ctctacatgt agcctttgct tctgccttcc ccactacatc 2101 ttggagaagg ttactcctcc cacacttcct gggaccacct gagtaccatt cctggacctc 2161 ttccccatag agaattctga cttccaaccc tctttgtagg gatattatac cctgcctgct 2221 ctgccctgct cttttctggc tgtggtgggc tcagtctgca taccactagg gacaatgagg 2281 agccaggctt gttggggagg ggtctccttc tcccactcct cccgccgtgg acctcacctg 2341 accctctctc ctcttgcagc acagagttga tgagacgcgt ccgtcgcttc cagattgctc 2401 agtacaagtg cctggtgatc aagtatgcca aagacactcg ctacagcagc agcttctgca 2461 cacatgaccg gtcagtccct gccccctgca gtcctgtcca gtggaaaatc acaaggcaca 2521 ggacacactg ttaggactct ctttaatggg gatggttaat catttgaaca ttgaatgatt 2581 caaatcagca cactttccaa ggtgcttggc aaggtagcgc acactctcca ctccctgggc 2641 tggagccagt ggttctccac tgagggtgat tttgccgcca gggtccattt gacaatgttt 2701 gaagacattt ctagttgttg caactggagg ggggagggga tgcttttggg ctttaatgtg 2761 tagaaatcag ggacactgct gctaagggtc ctatggtgca gaggacggcc cccatgcaag 2821 aacgagctgg ccccaaatgt caggagcctg ccagtgttca gaaactctgc cgtagggttt 2881 cagcttcaca caggctgcag actggtttgg tttggcctgc acgttgattt ttgtttaatt 2941 ttttagttgt ccgttgttgg ctggctcccc cgtcacctgg cagccttcac gcttccctgt 3001 tttatgtgta gctgtttgag ctcgctggac atttccgcct gcaacctcag tttgggagtt 3061 aaattcactt ccttggcagc agatgtgggc ccgatgtttc tgagcctgag acgctttgct 3121 tggtcctctg gacttgtcca cctgggcacc cagtggcaaa gccatgctgt gccacacatt 3181 atagggcttc agcctcagag ccctggctgg gagctgtatc cgagagttgc tatggctgtg 3241 cagagaacag atccacccgg cgtgtggcct tcggtgggag ctgaggggct cctgaagcca 3301 gatgctggtg gagtggaggg tgcttggggc ttggagttgc atgtgggaat ttaaccgcac 3361 cttcgtgacc atgctgtctg atgtaggtca tttacttttc caaatttgct tcctcattcc 3421 taagatgcga tgtccacggc acagggtggt gttacacctg gtggggacag ggaaagcaga 3481 ggaggtcact tcgttccagc tgttggaagt acaacttctg gagtcagtca gatccgggat 3541 taaatatgag ttctgcccgt gtgtcacaag tcatctctaa cacgggccac agaggccaag 3601 gctgggccag cagcattgat ggctcgagag gctgcccttg caggggccac agctggcctc 3661 ccacctgccc tcactttgtc tttctctgtt tagggaggga agagggaatt taaaatgccc 3721 aaaatactgt ttcacacatt ctttccagaa ctcgaagtag gattatagca aggtaataac 3781 gaaacaatag ttgtaaagta tgtttttttg tttgtttgtt gtttgttttt gggacagggt 3841 ctctctctgt cacccaggct ggagtgcagt ggctcaatca tagcttactg ttacgtgacc 3901 ccaaaccctt gggctcaagt gatcgtccca cctcagcccc ctgagcaggt gggactacag 3961 gcgcacacca ccacacccag ttaattttta catttttttc acacagtgtc tcgctgtgtt 4021 acccaggctg gtctcgaact cctgagttca agtgatcctc ccgtcttggc ctccccaaag 4081 attacgggca tgagctgctg tgtctggcca gaatacagga ttttaaaaat ttatgttttg 4141 caacataatt aatataaaga caaatataac ccaggcccag ttctagttat tcattcttct 4201 gaattttaaa aggaaacatt tggctggccc ctaatggtat catgggccct ggtacctgat 4261 gaagttggcc tagtctgccc ccagctcctg aacagtggaa gagtttttag tctcattgag 4321 ctttgtactg gacattacta atttctaatc caaagcatca agtgaagtgg cttgtataaa 4381 taactggttt tcctctggga ggctaaggcg ggtggatcac ttaaaagtta ggagtctgag 4441 accagcctgg ccaacatggt gaaaccccat gtctgctaaa aatacaaaaa ttagctgggt 4501 gtgatggtgt gtggccagta gtcccagcta ctcttgtggc tgaggtggga gaatcgcttg 4561 agacccttga gaattgggag gtagagattg cagggagccg agatggcgcc actgcactcc 4621 agcctgggtg acagagcaag actctgtttc ataaaaaata aataaataac tggttttctg 4681 gacgagggcc tttcccatag gtgctaactt ctcaaagccc ggctgggtga acactgagcc 4741 tgctttgcag gtagcaggtg gtcacgacag tgccattccc tggcccctgc attgtggctt 4801 ctggcctccc tggccctgct cacgctctgg ctttctcttc ccaggaacac catggaggcg 4861 ctgcccgcct gcctgctccg agacgtggcc caggaggccc tgggcgtggc tgtcataggc 4921 atcgacgagg ggcagtttgt aagttggctt gtcttggcat cactcttcct gccttccgct 4981 gtgtcctccc gttttccctc gctgacttgg aagttatctg anncttttag taaaataaca 5041 aggttaaata gctacaacta gtgttggaat accctctgaa ggcccctttc tagtttccct 5101 gtcatagtgt catagtcttg taggattcgt tttacttttt tttttttttt ttttgagacg 5161 gagttttgct cttgttgccc aggccggagt acgatggcac aatctcaccg caaactttgc 5221 ttcctgggtt caagcaattc tctcctgtct cagcctcccg agtagctggg attacaggca 5281 tgcgccacca cgcccagcta attttatatt tttagtagag atggggtttc tccatgttgg 5341 tcaagctggt ctcaaactcc caacctcagg tgatccgccc cgccttgaac tcccaaagcg 5401 ctgggattac aggcatgagc taccacacct ggccattgta cctttttaaa aatacatata 5461 tctatttact ggcaagatgc agtgactcac acctgtaatc tcagcctgtg ggaggccaag 5521 gtggacagat cacttgagcc caggagttgg agactcacct gggcaacata gtaaaacccc 5581 atctctacca aaaaaaaaaa gaaattagcc agtcatagca gcgcacacct gtggtccctg 5641 ctactcagga ggctgaggca gaaggatgga gcctgggagg tcgaggctgc agtgagtggt 5701 gatagcacca ctgcactcca gcccgggcga caaggccaga ccctgtctca aaaaaaaaag 5761 ggggaggtgg ggagtaatgt ttggtttgcc tcatggttcc ttttgcttgt ttcttatacg 5821 tttattttct tgttgttgaa gtaccttttt tagtagtttt tgcagccagg aggtatagat 5881 gggaagctgc cagtctttgt atggaaatct ttcttttgtc atctagttta agctgggcag 5941 caagaggtag gttgatcttg tgtgggtttg ggtttttttt tttttttgag acggagtctt 6001 actctgtcgc ccaggctgga gtgcaatggt gtgatctcgg ctcactgcaa cctctgccac 6061 ccggattcaa gcgattttcc cacctcgcct cccaagtagg tgggattaca ggcacccacc 6121 atcatgcctg gctaattttt gtagagacaa gggttcacca tgttggctag gctggtcttg 6181 aactcctgac ctcaggtgat ccacccgcct tggcttccca aagtgttgga attacaggca 6241 tgagccgccg tgcccggcct tttttatttt tatttttttt gagatggagt cttgctctgt 6301 tgccctggct ggagtggagt gacgtgatct tagctcacag caacctccgc cttttgggtt 6361 caagcagttc tgcctcatcc ttccgggtag ctgggatcac aggtgcgtgc cacatgcgta 6421 mtcatttatg tatttttaat agagatgggg tttcaccatg ttggccagct ggtctggaac 6481 tcctgacctc aggtgatccg catgcctcag ctcccaaagt gctgggatta caggcgtgaa 6541 ccacgcctgg tcttgatctt gttgctttga aaagtagcag cgctggtcat tgtgtttttg 6601 ctcagaggaa ggccgccatc tctctaatgt tacctctggt caggtattct atctgttctc 6661 tctcagcaca atgtgtgtag gggaagcttt gtttcattta tcctgcttta tagctggtgt 6721 gccttttcat ttctggggaa ggaatgaagc cattatcact tcaggtattt ctctcctcat 6781 ccatctctga ggtgttctgg gttccatctt ccagagtgtg ttttgtttca gtgactattt 6841 ttacatctgc tgctctaatt catcatgctc cgttttgttt gacaagttac tgttgggtta 6901 tttttaaatt tatgctgttc cttccattat gttcctgaaa atcttttctt agacttttcc 6961 agatttttct atttcctcag gaacatattc tgtggttgag tttctgggtt attttctgtt 7021 atcttagttt tctttcctct gctttggaga ttttattttt gttagtttat cacaaagaat 7081 gaaactgaaa ctctctccaa ggggtttagc agacttgacc tcttaggtac ttttagggtt 7141 gcctcgaagt acacaatgtg gtggtttgat ataaacataa caggaattta tttctcgctc 7201 acagaccccc tacgtggttc caggccggtt gatggggagg ccgcccacga ggcggcttag 7261 gtcgccctgg ctggctgtat acagacacgg aggggaagag acgtggcgga gcccctgggt 7321 gtgaggtttt catgggcctg accagaagct gcaaacgtca cttctgctga tctttcaaag 7381 actagaacct gggcacaggg ccacctatac gtttagtata cttagtccag ttcgtttttt 7441 gtttgttttt aaaaacagtc ttgctctgtg gcccaggctg gagtgcagtg gcgcagtctc 7501 ggctcactat aacctccatg tcccaggttc aagtgattct cccgcctcag cctcctgagt 7561 agctgggatt acaggcttct gccaccatgc ccagctaacc ttttgtattt ttagtagaga 7621 cggggtttca tcatgttgac cgggctggtc tggaactcct aacctcaggt gatctgcctg 7681 cctcagcctc ccaaagtgct gggattacag cgtgagccac cacgcctggc cacacttagt 7741 ctagttctat accctggagg aagaataaat gagtttgttt ggtgagtgct tcaaggtctc 7801 tacccgccct gcctcccagc acagagccag gccgctctgg cctgaatacc ctgcccggac 7861 gtcacagggc ctgtcccctc aaaaggccag tcctgccttc ctggttctgt tcttgcccaa 7921 cattctgtat gagtcacagc tgcaaattcc attcccgtgg ggaggctgac gggtcccttc 7981 ccctgtgcgg ggcatctgcc ctgtggagtt gaggctgcca gtgtccgctc tgggttcccg 8041 accacccggc agctggcatc tcctccccgc ttgggtatgg ccattccgtt tctgaccttc 8101 agaggtgcgc ccctgagcac ccccatgcct ctgcgtacgt ggagacgtcg ttgttgctgc 8161 cccgtgcttg agggactcct ggcgagaaag tgagcccagg ctgggaatag ggctgcagct 8221 gttctctttt gctcccaaac tgtggcctca gaatgcatcc agggattttg catcagcttt 8281 ggggacatgg ccctctcaga acaaggaagc ttcagctttg gcaaggctct ccctccttca 8341 gacctgccgc tgtgagttgt tcaatagctc tgttctcctg gctctgcgta aaccttgttg 8401 acagaggctg acccagaccc ccgaggcaga aacctttccc ttctccttcc tcgacatcca 8461 aatgccctga gtcaggagcc agcgtatgaa gtcctgtccc ctgttcagcc tgtaggaggg 8521 atttctcggt ctacttcctc cctggccagc aagtaaaact tgagttcatt cagtgagtat 8581 ttattacacc ctacccagac atcagcattc tgccctggcc tctgtgtgcc cttgttctct 8641 tcaagaagtt ccgggtcacc agcctgacca acatggagaa actccgtctc tactaaaaat 8701 acaaaaatta gccgggcgtg gtggcgcact gcctgtaatc ccagctactt gggaggctga 8761 ggcaggagaa tcgcttgaac ccggtaggcg aaggttgcag tgagccaaga tcgccccatt 8821 gcactccaag cctgggcaac aacaagagca aaactcagtc tcaaaacaaa acaaaacaaa 8881 agaagttcag ggtcttccca ttgcaagcag ttctagatcg aggagagggg ttcctagcat 8941 gggacccagc agaaggactg tccttcgctc cttcattgtc tacgtggaca gtggatgaag 9001 ctcagccgaa cctgccttgt tcccgttttc tgggtcagca gggaaagcct ttcacagagt 9061 agccaccgtg ccatcctgag gaaggccctg ggtcagaagc ttctgtgctt ctttgtaccc 9121 cgggcaagac acacaggtgc tcacactgct ctgtagaaac tgttggcatc caagagagac 9181 tcacctggaa atctctggaa aacctgaagc tcctagctgg gggtgctgtg cttcagatgc 9241 tggtggtggg tgggcaccct tgcatcaaca gctgcacagt gtgtggtggg cttgcagggt 9301 cgcttggcaa tagtaggagc tctgatttat ttttttaaac tttttttctg gctgggcagg 9361 tggctcacac ctgtaatccc agcactttgg aaggcctagg cgggcggatc acttgaggtc 9421 aggagtttga gaccagccag gccaacatgg tgaaacccca tctctactaa aaatacaaaa 9481 attagccaag cgtggtggca cacacctgta attccagcta cttgggaggc agaggcacaa 9541 gaattgcttg aacctgggag gcagaggttg cagtgagcca agattatgcc actgcactcc 9601 agcctggatg acagagcgag actctgtctc aaaaaaaata gacaaagcca ggcgcagtgg 9661 ctcatgcctg taatcccaac actttgggag gccgaggtgg gtgaatcacg aggtcaggag 9721 atcgagacca tcctggctaa cacggtgaaa ccccgtctct actgaaaata caaaaaaatt 9781 agccaggcgt ggtggtgggc acctgtagtc tcagctactc gggaggctga ggcaggagag 9841 tggcgtgaac ccaggaggcg gagcttgcag tgagctgaga tcacgccact gcactccagc 9901 ctgggcgaca gagcgagact ccgtctcaaa aaaaaaaaaa aaatagacct ttttgtgttt 9961 tctgttctac tacacaagta atacaggttg agtattcctt aacctaaatg cctgggacca 10021 gaagtgtttc ggatttcagg ttttcgaata tttgcatgtt cataatataa tgagaccttg 10081 ggaatgagcc ccaagtgtaa acacaaaatc catttatgtt ttatagacat cttaggcaca 10141 tagcctgaga gtaattttat gtatttagta atttgggcgt gagccacagt ttttgactgt 10201 gacctgtccc atgaggtcag gtgtggaatt ttccacttgt ggtgggcgct caaaaagttt 10261 cagattttgg agcctttcag gttagagaca tgcaatctat aataagttta atctaggaaa 10321 agttagggtc tggcacagag gctcacgtct gtgatcccag cactttggga ggctgaggca 10381 ggcagatcac tggaagtgct ggacgggtgg ggaagtgccg ggtgcaagaa ccaagctctt 10441 tgactatgga cctcagcctg aggttggtca agaggtggag tgagtggggg ctgaggacct 10501 tcatcctgaa accctgatgc aggagagtct ggggtctgcc ttctaccctc atgtggcggg 10561 tgaaggagca aggttctcaa ctcaggaggg ttcttcccct ctccattccc acccagggga 10621 catctcacaa caactagaaa caattttgtc gcagctgggg ggtgggaggt gtgttcctgg 10681 catctatcta atgggtgggg gcgagggacg cagcccaaca ccctacagtg cacaggacac 10741 agcgagatcc ggcctcaaac tggcagccat ggcagcgtca gccctccagg gggcgcgccc 10801 tggcgcaggt ggtgtgccgg cccacagctc cttgcaggct gggagctgca ttttcgtgac 10861 atgtcatgag tcctcagaga aaaagaggga acgagtgcat ggtggggagg ggccctggcg 10921 tgctggagtc tctgggtttc cttctccaga gacccctgca gtcagctgag cgcaatcagt 10981 cacgttgggc tttgcttgga tctcactgga atttttcgag ccacccctta gtcctcacct 11041 tgctaagccc tcacgtctca ataacctcaa acctcagtac ctgggctgag aaagcctgag 11101 tggccctggg agagagaccc tgcacccaag gacaaggaca tccctgcttc acccaaccca 11161 aaggccagtc tggacatatg aactcaacca gctaagagtg atatgattga ttgatgagaa 11221 tcaccagagc acttgccaga gtttcagctt ctccctgggc caaagtgaag tttgctttac 11281 acagtaaatg tgctctgtgc aggtcctgaa tttagaaggc tgtgctgtgt catcctgctc 11341 tgtaaatggc cagtaggacc cccgcccctt ctcaaggcac attacccgtt taaaacgggg 11401 gaggcaagag cacaaagcgc ccacctattc accgaagagc atgtatataa cttagggcct 11461 tccatcctta aacaacagga ccttccttgc tcttacggaa aaggaaacag gttcagagac 11521 gttaattcat tgccaaggtc acacagataa tgggtccagc gaagagtggt gtccgagccc 11581 aaggcagcag gcctttggcc actgcagtgt taaacagcac agctggtgtg gaagtccggt 11641 gctgagtcct gggtacctgg actcggaggg aagctggctg cagggggaag gggctgcgca 11701 gttgtggatg tacctgtcgt ctgctggggg gcgtgcgggt ggacacagtc ccccggcctg 11761 gggagcctcg tgggagaatt aagagttact ccgggccaaa tggccggagt tgtcagatct 11821 ggcagcgtct tcgctggggc tccagggagc tgctgctggg gtggaagctc tcacactctt 11881 tctccacgtg ccctttccag ttccctgaca tcatggagtt ctgcgaggcc atggccaacg 11941 ccgggaagac cgtaattgtg gctgcactgg atgggacctt ccagaggaag gtaaggcgtc 12001 tgatccaggt ctggagctgg gattgaggag ggcaagaggc ttctggatgg gcacagagac 12061 accagctctg ggtgaccagg gctcagccac cacagggtta cggccgagct gctcaggctt 12121 ggctgagcca agggactcca tggtctgtgc agactgcgtg ccatctgttg tggcaggtgc 12181 tttgaattgg caaagggaca gagccgggca tggtgctctg ggggttgggg gaaggactaa 12241 ggtcagagca aactctcctg gcttcagtac ttgtgaatca gagggtttaa aagaaaaacc 12301 cacctggtaa ggtgctgagc gccctctgtc tttccatggg agcacagcca tttggggcca 12361 tcctgaacct ggtgccgctg gccgagagcg tggtgaagct gacggcggtg tgcatggagt 12421 gcttccggga agccgcctat accaagaggc tcggcacaga gaaggaggta gctccacctg 12481 ccttccctgc aggccggcgg ggtgggggta tggctctgcc tccttcctgt cctggccctt 12541 cacccatccc ctgtccctgc ggccaggtcg aggtgattgg gggagcagac aagtaccact 12601 ccgtgtgtcg gctctgctac ttcaagaagg cctcaggcca gcctgccggg ccggacaaca 12661 aagagaactg cccagtgcca ggaaagccag gggaagccgt ggctgccagg aagctctttg 12721 ccccacagca gattctgcaa tgcagccctg ccaactgagg gacctgcaag ggccgcccgc 12781 tcccttcctg ccactgccgc ctactggacg ctgccctgca tgctgcccag ccactccagg 12841 aggaagtcgg gaggcgtgga gggtgaccac accttggcct tctgggaact ctcctttgtg 12901 tggctgcccc acctgccgca tgctccctcc tctcctaccc actggtctgc ttaaagcttc 12961 cctctcagct gctgggacga tcgcccaggc tggagctggc cccgcttggt ggcctgggat 13021 ctggcacact ccctctcctt ggggtgaggg acagagcccc acgctgttga catcagcctg 13081 cttcttcccc tctgcggctt tcactgctga gtttctgttc tccctgggaa gcctgtgcca 13141 gcacctttga gccttggccc acactgaggc ttaggcctct ctgcctggga tgggctccca 13201 ccctcccctg aggatggcct ggattcacgc cctcttgttt ccttttgggc tcaaagccct 13261 tcctacctct ggtgatggtt tccacaggaa caacagcatc tttcaccaag atgggtggca 13321 ccaaccttgc tgggacttgg atcccagggg cttatctctt caagtgtgga gagggcaggg 13381 tccacgcctc tgctgtagct tatgaaatta actaattgaa aattcactgg ttggtggacg 13441 cacatttctc tttcacctgg gtttccctgg gtctcatgga cagctccaac ttgatttggg // LOCUS HUMTNFX 3103 bp DNA PRI 15-JUN-1990 DEFINITION Human tumor necrosis factor gene, complete cds. ACCESSION M26331 NID g339763 KEYWORDS tumor necrosis factor. SOURCE Human histiocytic lymphoma cell line U-937 DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3103) AUTHORS Marmenout,A., Fransen,L., Tavernier,J., Van Der Heyden,J., Tizard,R., Kawashima,E., Shaw,A., Johnson,M.-J., Semon,D., Mueller,R., Ruysschaert,M.-R., Van Vliet,A. and Fiers,W. TITLE Molecular cloning and expression of human tumor necrosis factor and comparison with mouse tumor nectosis factor JOURNAL Eur. J. Biochem. 152, 515-522 (1985) MEDLINE 86030296 FEATURES Location/Qualifiers source 1..3103 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 362..1403 /note="tumor necrosis factor signal peptide" CDS join(570..755,1362..1407,1595..1642,1944..2365) /note="tumor necrosis factor precursor" /codon_start=1 /db_xref="PID:g339764" /translation="MSTESMIRDVELAEEALPKKTGGPQGSRRCLFLSLFSFLIVAGA TTLFCLLHFGVIGPQREEFPRDLSLISPLAQAVRSSSRTPSDKPVAHVVANPQAEGQL QWLNRRANALLANGVELRDNQLVVPSEGLYLIYSQVLFKGQGCPSTHVLLTHTISRIA VSYQTKVNLLSAIKSPCQRETPEGAEAKPWYEPIYLGGVFQLEKGDRLSAEINRPDYL DFAESGQVYFGIIAL" exon <570..755 /note="tumor necrosis factor precursor" /number=1 sig_peptide 570..755 /note="tumor necrosis factor signal peptide" prim_transcript 570..2365 /note="tumor necrosis factor mRNA and introns" intron 756..1361 /note="TNF intron A" exon 1362..1407 /number=2 mat_peptide 1404..1407 /note="tumor necrosis factor protein" intron 1408..1594 /note="TNF intron B" mat_peptide 1595..1642 /note="tumor necrosis factor protein" exon 1595..1642 /number=3 intron 1643..1943 /note="TNF intron C" mat_peptide 1944..2362 /note="tumor necrosis factor protein" exon 1944..>2365 /note="tumor necrosis factor precursor" /number=4 BASE COUNT 784 a 781 c 880 g 658 t ORIGIN 263 bp upstream of ApaI site; chromosome 6p21.3. 1 atcctgtctg gaagttagaa ggaaacagac cacagacctg gtccccaaaa gaaatggagg 61 caataggttt tgaggggcat ggggacgggg ttcagcctcc agggtcctac acacaaatca 121 gtcagtggcc cagaagaccc ccctcggaat cggagcaggg aggatgggga gtgtgagggg 181 tatccttgat gcttgtgtgt ccccaacttt ccaaatcccc gcccccgcga tggagaagaa 241 accgagacag aaggtgcagg gcccactacc gcttcctcca gatgagctca tgggtttctc 301 caccaaggaa gttttccgct ggttgaatga ttctttcccc gccctcctct cgccccaggg 361 acatataaag gcagttgttg gcacacccag ccagcagacg ctccctcagc aaggacagca 421 gaggaccagc taagagggag agaagcaact acagaccccc cctgaaaaca accctcagac 481 gccacatccc ctgacaagct gccaggcagg ttctcttcct ctcacatact gacccacggc 541 tccaccctct ctcccctgga aaggacacca tgagcactga aagcatgatc cgggacgtgg 601 agctggccga ggaggcgctc cccaagaaga caggggggcc ccagggctcc aggcggtgct 661 tgttcctcag cctcttctcc ttcctgatcg tggcaggcgc caccacgctc ttctgcctgc 721 tgcactttgg agtgatcggc ccccagaggg aagaggtgag tgcctggcca gccttcatcc 781 actctcccac ccaaggggaa atggagacgc aagagaggga gagagatggg atgggtgaaa 841 gatgtgcgct gatagggagg gatggagaga aaaaaacgtg gagaaagacg gggatgcaga 901 aagagatgtg gcaagagatg gggaagagag agagagaaag atggagagac aggatgtctg 961 gcacatggaa ggtgctcact aagtgtgtat ggagtgaatg aatgaatgaa tgaatgaaca 1021 agcagatata taaataagat atggagacag atgtggggtg tgagaagaga gatgggggaa 1081 gaaacaagtg atatgaataa agatggtgag acagaaagag cgggaaatat gacagctaag 1141 gagagagatg ggggagataa ggagagaaga agatagggtg tctggcacac agaagacact 1201 cagggaaaga gctgttgaat gcctggaagg tgaatacaca gatgaatgga gagagaaaac 1261 cagacacctc agggctaaga gcgcaggcca gacaggcagc cagctgttcc tcctttaagg 1321 gtgactccct cgatgttaac cattctcctt ctccccaaca gttccccagg gacctctctc 1381 taatcagccc tctggcccag gcagtcagta agtgtctcca aacctctttc ctaattctgg 1441 gtttgggttt gggggtaggg ttagtaccgg tatggaagca gtgggggaaa tttaaagttt 1501 tggtcttggg ggaggatgga tggaggtgaa agtagggggg tattttctag gaagtttaag 1561 ggtctcagct ttttcttttc tctctcctct tcaggatcat cttctcgaac cccgagtgac 1621 aagcctgtag cccatgttgt aggtaagagc tctgaggatg tgtcttggaa cttggagggc 1681 taggatttgg ggattgaagc ccggctgatg gtaggcagaa cttggagaca atgtgagaag 1741 gactcgctga gctcaaggga agggtggagg aacagcacag gccttagtgg gatactcaga 1801 acgtcatggc caggtgggat gtgggatgac agacagagag gacaggaacc ggatgtgggg 1861 tgggcagagc tcgagggcca ggatgtggag agtgaaccga catggccaca ctgactctcc 1921 tctccctctc tccctccctc cagcaaaccc tcaagctgag gggcagctcc agtggctgaa 1981 ccgccgggcc aatgccctcc tggccaatgg cgtggagctg agagataacc agctggtggt 2041 gccatcagag ggcctgtacc tcatctactc ccaggtcctc ttcaagggcc aaggctgccc 2101 ctccacccat gtgctcctca cccacaccat cagccgcatc gccgtctcct accagaccaa 2161 ggtcaacctc ctctctgcca tcaagagccc ctgccagagg gagaccccag agggggctga 2221 ggccaagccc tggtatgagc ccatctatct gggaggggtc ttccagctgg agaagggtga 2281 ccgactcagc gctgagatca atcggcccga ctatctcgac tttgccgagt ctgggcaggt 2341 ctactttggg atcattgccc tgtgaggagg acgaacatcc aaccttccca aacgcctccc 2401 ctgccccaat ccctttatta ccccctcctt cagacaccct caacctcttc tggctcaaaa 2461 agagaattgg gggcttaggg tcggaaccca agcttagaac tttaagcaac aagaccacca 2521 cttcgaaacc tgggattcag gaatgtgtgg cctgcacagt gaagtgctgg caaccactaa 2581 gaattcaaac tggggcctcc agaactcact ggggcctaca gctttgatcc ctgacatctg 2641 gaatctggag accagggagc ctttggttct ggccagaatg ctgcaggact tgagaagacc 2701 tcacctagaa attgacacaa gtggacctta ggccttcctc tctccagatg tttccagact 2761 tccttgagac acggagccca gccctcccca tggagccagc tccctctatt tatgtttgca 2821 cttgtgatta tttattattt atttattatt tatttattta cagatgaatg tatttatttg 2881 ggagaccggg gtatcctggg ggacccaatg taggagctgc cttggctcag acatgttttc 2941 cgtgaaaacg gagctgaaca ataggctgtt cccatgtagc cccctggcct ctgtgccttc 3001 ttttgattat gttttttaaa atatttatct gattaagttg tctaaacaat gctgatttgg 3061 tgaccaactg tcactcattg ctgagcctct gctccccagg gga // LOCUS HUMTNP1 1748 bp DNA PRI 14-JAN-1995 DEFINITION Human transition protein 1 gene, complete cds. ACCESSION M59924 NID g339778 KEYWORDS transition protein 1. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1748) AUTHORS Luerssen,H., Mattei,M.G., Schroter,M., Grzeschik,K.H., Adham,I.M. and Engel,W. TITLE Nucleotide sequence of the gene for human transition protein 1 and its chromosomal localization on chromosome 2 JOURNAL Genomics 8 (2), 324-330 (1990) MEDLINE 91065651 FEATURES Location/Qualifiers source 1..1748 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /map="2q35-q36" exon (1074.1076)..1243 /gene="TNP1" /note="G00-125-311" /number=1 gene join(1105..1243,1444..1472) /gene="TNP1" CDS join(1105..1243,1444..1472) /gene="TNP1" /codon_start=1 /db_xref="GDB:G00-125-311" /product="transition protein 1" /db_xref="PID:g339779" /translation="MSTSRKLKSHGMRRSKSRSPHKGVKRGGSKRKYRKGNLKSRKRG DDANRNYRSHL" polyA_signal 1659..1664 /gene="TNP1" /note="G00-125-311" BASE COUNT 487 a 429 c 386 g 446 t ORIGIN 1 tgctgaacca ttgatgttga cagtagcatc tccctttaca cagctcctcc agtcaataca 61 tcacccactt ggcaccctct aatctcacat gccatcctct taacattttt aaaggcttcc 121 caatgtactt gatcatggtt tgatctctgt ctctccctcc agcctcatcc tgtaccatgt 181 agggagtttc ctctgccctt ctccagccaa aatgactccg taagtactta ctcactgatt 241 atgtagacat ctgatgaatg gcttctccag tgggctgaga gttccatgag cacagtatct 301 acgtgtgttt atcctccact aattttctac cacctaaatg tatggcatat gctatgtgat 361 caatatctgt gactgaattt taaaaaatca ttaggttaat tgttctgaac cagagccatt 421 ttttccctca cttccccagc ctcccaaaga aacaggtggc aatgtttggg aatattttgg 481 gttgtcatag ctgccaggct tctctggtgt ccagtgggta atggccaggg atgttgccaa 541 acatcctagg gtgcacaaga cagcccactg aaagttaccc tgcccattgt gtcaacagtg 601 ccaagactga gaaatcctga cctagatgaa aaagaaggaa tatttctatt ttgatcatat 661 tgaattgggt tctgtacagt caagtcctta cagaaataaa tttattatct ccggatgctg 721 tgagagactt aaacagtcat gcatacacac acactcaatg actgttacat ttcctacagg 781 caggagaagc atatctgaat ggtctcttgt actcatccga tgcctggcac agggctggat 841 tcagtttctc aataacacct atatgggaaa gcctcactta acataaaaag gacctcaaaa 901 ctgagggcag cccttgagct gcacggcaaa aggaccacct tatttcacaa atgtgaccaa 961 agtgcagcaa aggtgagtga cagcacagta gagccccgcc cacctgctga gccatgcccc 1021 ttccctgtca caatgccaag gccttaaata cccagactcc tgcccccggg ccttgcaaag 1081 cccctcattt tggcagaact taccatgtcg accagccgca aattaaagag tcatggcatg 1141 aggaggagca agagccgatc tcctcacaag ggagtcaaga gaggtggcag caaaagaaaa 1201 taccgtaagg gcaacctgaa aagtaggaaa cggggcgatg acggtgagtg agggatgggg 1261 aggagattgc caaacttagg cacacattgc tgccaggccc tccctcttgg aggcagctct 1321 cacaggaccc tgaaatcttg gcctgagatt atttccaata actaaatgca gatttgagca 1381 acaagagtct tgatgggaat gttctaattc gtgtctctgt tgcaaacctg tcctttccca 1441 cagccaatcg caattaccgc tcccacttgt gagcccccag cggctctgcc ctggtgcgct 1501 tcacacagca ccaagcagca acaagaacag cagaagggga actgccaagg agacctgatg 1561 ttagatcaaa gccagagagg agcctatgga atgtggatca aatgccagtt gtgacgaaat 1621 gaggaatgta tatgttggct gtttttcccc aacatctcaa taaaactttg aaagcagaaa 1681 tgtttacttt gatcttttgg ttggaaagac ttgagttcct gagctaggtg ggtttgggga 1741 agaattgg // LOCUS HUMTNP2SS 1782 bp DNA PRI 14-JAN-1995 DEFINITION Homo sapiens transition protein 2 (TNP2) gene, complete cds. ACCESSION L03378 NID g292827 KEYWORDS sperm-specific basic nuclear protein; transition protein 2. SOURCE Homo sapiens (tissue library: pWE15) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1782) AUTHORS Nelson,J.E. and Krawetz,S.A. TITLE Linkage of human spermatid-specific basic nuclear protein genes. Definition and evolution of the P1-->P2-->TP2 locus JOURNAL J. Biol. Chem. 268 (4), 2932-2936 (1993) MEDLINE 93155116 FEATURES Location/Qualifiers source 1..1782 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_lib="pWE15" /map="chromosome 16" CAAT_signal 150..155 /gene="TNP2" /note="G00-125-312" TATA_signal 166..172 /gene="TNP2" /note="G00-125-312" gene join(258..657,1506..1522) /gene="TNP2" CDS join(258..657,1506..1522) /gene="TNP2" /note="serine tripeptide motif = aa. 48-50 and aa. 64-66" /codon_start=1 /db_xref="GDB:G00-125-312" /product="transition protein 2" /db_xref="PID:g292828" /translation="MDTQTHSLPITHTQLHSNSQPQSRTCTRHCQTFSQSCRQSHRGS RSQSSSQSPASHRNPTGAHSSSGHQSQSPNTSPPPKRHKKTMNSHHSPMRPTILHCRC PKNRKNLEGKLKKKKMAKRIQQVYKTKTRSSGWKSN" exon <258..657 /gene="TNP2" /note="G00-125-312" intron 658..1505 /gene="TNP2" /note="G00-125-312" exon 1506..>1700 /gene="TNP2" /note="G00-125-312" polyA_signal 1636..1642 /gene="TNP2" /note="G00-125-312; putative" polyA_signal 1695..1700 /gene="TNP2" /note="G00-125-312; putative" BASE COUNT 534 a 378 c 483 g 387 t ORIGIN 1 ttggacccaa gggctcctac cctgaaccag tagctgggac tatccccagg gtacccctga 61 gagctgcccc agcctggggg tgagggtaag gggtaggggg ctttgtcttg gctgagccac 121 atctctcaca cccctgtggc ctgggccatc ataatcagcc ccaactatat aaccaggtgg 181 gctgccaggg cctctgtaaa gctaggcctg ctgggagagg atgaggagga gccctgcccc 241 tccaaacgtg gcctcctatg gacacccaga ctcacagcct tcctatcacc cacactcagc 301 tccatagcaa ctctcagccc caaagccgca cctgcacccg ccattgccaa accttcagcc 361 agagttgcag acagagccat cgtggcagcc ggagccagag ctccagccag agcccggcca 421 gccaccgcaa cccaactgga gcccacagct catccggcca ccagagccag agtcccaaca 481 ctagtccacc accaaagcgc cacaaaaaga ctatgaactc ccaccactct cccatgcggc 541 ccaccatcct gcactgccgc tgccccaaga acagaaagaa cttggaaggc aagctgaaaa 601 agaaaaaaat ggccaagagg atccagcagg tgtacaaaac caagacgcgg agctcaggta 661 ccctttaagg aggtggggaa gggccaccga gcccacagat gatggagagc agaccttggg 721 ggcagtgaga ggaaggctgc agccaggtca caaaggaacc acaggcaaga aggaagaggg 781 agaagagaaa caatggcagt tggctagctg aatgtatgat acgttgacgg aaagtctctt 841 tgaaattgga tgggttgatt aggaggatgg aaagatggac agatagcaga taagctagat 901 gaaagcatga atggagttga gaggttgggt ggatgactgg gtgggtaaac aataaatagg 961 ttatagaaag gatagttgga agaatgcatt ggctgaatga taggaagttt ggatacgatt 1021 agctggatgg atggataaat ggatgaatgc actggctggc tagttatttg gttggttagg 1081 tagatgatca gtttgaagat tgtggttggt ggatgaattg gttagaaata gagttaaata 1141 gttgtagaag ttttgtaggc ttggtttgat tggttaaata ttatcttaat agagtaatat 1201 agagtaattg aataaacaga gagaagaata gatatctaga ctaatggata gaatggaaag 1261 aaatgttgaa taaatgaatg gaatgagtga actaatgaat gggtggatga caaatggaag 1321 ggataaatgg atggatacct ggattcacat aggtcaaaag gacactgacg gtagtctaaa 1381 ctctatctat gtcccatatt caatcacaaa tgagtagttg taagacctta caggaggtca 1441 aggaggtcac tgacttcatg aagtgctcag ctattaaagg ttcctttccc actcttatcc 1501 cttaggatgg aaatccaact aatgagaccg cactccttgg cttgttcctg cgtgtttcac 1561 ccaaaggaga aaatgctagg atgaagtcaa tcttcttgca ggaacatgtt actatggtga 1621 tttctacgca acactaatta aagcttgtac ctggaagact atccctgagt agtcatagtc 1681 attttgattt cactaataaa ggtgttatgt gttttggggg cctgcacagg ggcagaaatg 1741 aatgggggta ggatgccaag aagcctgcag agtccactct gc // LOCUS HUMTPA 36594 bp DNA PRI 03-MAY-1996 DEFINITION Human tissue plasminogen activator (PLAT) gene, complete cds. ACCESSION K03021 NID g339817 KEYWORDS Alu repeat; KpnI repetitive sequence; repeat region; tissue plasminogen activator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 36594) AUTHORS Degen,S.J., Rajput,B. and Reich,E. TITLE The human tissue plasminogen activator gene JOURNAL J. Biol. Chem. 261 (15), 6972-6985 (1986) MEDLINE 86196143 REFERENCE 2 (sites) AUTHORS Thomas,W. and Drayna,D. TITLE A polymorphic dinucleotide repeat in intron 1 of the human tissue plasminogen activator gene JOURNAL Hum. Mol. Genet. 1 (2), 138 (1992) MEDLINE 93244777 COMMENT Sequence and draft entry of [1] kindly submitted by S.J. Friezner Degen, 08-APR-1986. There are two genes for plasminogen activators in the haploid human genome: the tissue plasminogen activator gene (t-PA) on chromosome 8, and the urokinase-type plasminogen activator gene (u-PA) on chromosome 10. The t-PA gene spans 14 exons. [1] notes a partial KpnI repeat and 28 complete or partial Alu repeats (see FEATURES table), a long (ry)*n run (7170-7225), and a 'tgataga' tandem repeat region (23888-24458). [1] also notes a number of potential regulatory signals. FEATURES Location/Qualifiers source 1..36594 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_lib="T.Maniatis" /map="8p12-q11.2" /chromosome="8" repeat_region 740..1033 /rpt_family="Alu" repeat_region 1684..2113 /rpt_family="KpnI" gene 3531..36250 /gene="PLAT" exon 3531..3712 /gene="PLAT" /number=1 intron 3713..17969 /gene="PLAT" /number=1 repeat_region complement(5669..5958) /rpt_family="Alu" repeat_region complement(6319..6462) /note="partial" /rpt_family="Alu" repeat_region complement(6471..6744) /rpt_family="Alu" repeat_region complement(7212..7510) /rpt_family="Alu" repeat_region 8864..9176 /rpt_family="Alu" repeat_region 10067..10365 /rpt_family="Alu" repeat_region complement(10502..10801) /rpt_family="Alu" repeat_region complement(10802..10937) /note="partial" /rpt_family="Alu" repeat_region complement(11745..11868) /note="partial" /rpt_family="Alu" repeat_region complement(12704..12985) /rpt_family="Alu" repeat_region complement(13027..13142) /note="partial" /rpt_family="Alu" repeat_region 16794..17125 /rpt_family="Alu" repeat_region 17170..17466 /rpt_family="Alu" exon 17970..18067 /gene="PLAT" /number=2 CDS join(17996..18067,19767..19809,22105..22242,23160..23270, 23603..23777,26008..26099,28291..28462,29470..29555, 30806..31001,31113..31249,31425..31565,32428..32594, 35337..35495) /gene="PLAT" /note="precursor" /codon_start=1 /product="plasminogen activator" /db_xref="PID:g339818" /translation="MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARSYQVICRDE KTQMIYQQHQSWLRPVLRSNRVEYCWCNSGRAQCHSVPVKSCSEPRCFNGGTCQQALY FSDFVCQCPEGFAGKCCEIDTRATCYEDQGISYRGTWSTAESGAECTNWNSSALAQKP YSGRRPDAIRLGLGNHNYCRNPDRDSKPWCYVFKAGKYSSEFCSTPACSEGNSDCYFG NGSAYRGTHSLTESGASCLPWNSMILIGKVYTAQNPSAQALGLGKHNYCRNPDGDAKP WCHVLKNRRLTWEYCDVPSCSTCGLRQYSQPQFRIKGGLFADIASHPWQAAIFAKHRR SPGERFLCGGILISSCWILSAAHCFQERFPPHHLTVILGRTYRVVPGEEEQKFEVEKY IVHKEFDDDTYDNDIALLQLKSDSSRCAQESSVVRTVCLPPADLQLPDWTECELSGYG KHEALSPFYSERLKEAHVRLYPSSRCTSQHLLNRTVTDNMLCAGDTRSGGPQANLHDA CQGDSGGPLVCLNDGRMTLVGIISWGLGCGQKDVPGVYTKVTNYLDWIRDNMRP" sig_peptide join(17996..18067,19767..19790) /gene="PLAT" intron 18068..19766 /gene="PLAT" /number=2 repeat_region complement(18628..18796) /note="partial" /rpt_family="Alu" repeat_region 18879..19178 /rpt_family="Alu" exon 19767..19809 /gene="PLAT" /number=3 mat_peptide join(19791..19809,22105..22242,23160..23270,23603..23777, 26008..26099,28291..28462,29470..29555,30806..31001, 31113..31249,31425..31565,32428..32594,35337..35492) /gene="PLAT" /product="plasminogen activator" intron 19810..22104 /gene="PLAT" /number=3 repeat_region 20946..21259 /rpt_family="Alu" repeat_region 21280..21578 /rpt_family="Alu" repeat_region complement(21640..21938) /rpt_family="Alu" exon 22105..22242 /gene="PLAT" /number=4 intron 22243..23159 /gene="PLAT" /number=4 repeat_region 22253..22545 /rpt_family="Alu" exon 23160..23270 /gene="PLAT" /number=5 intron 23271..23602 /gene="PLAT" /number=5 exon 23603..23777 /gene="PLAT" /number=6 intron 23778..26007 /gene="PLAT" /number=6 repeat_region 25620..25911 /rpt_family="Alu" exon 26008..26099 /gene="PLAT" /number=7 intron 26100..28290 /gene="PLAT" /number=7 repeat_region 26524..26821 /rpt_family="Alu" repeat_region 26941..27239 /rpt_family="Alu" repeat_region 27880..28145 /rpt_family="Alu" exon 28291..28462 /gene="PLAT" /number=8 intron 28463..29469 /gene="PLAT" /number=8 repeat_region 28804..29100 /rpt_family="Alu" repeat_region 29297..29431 /note="partial" /rpt_family="Alu" exon 29470..29555 /gene="PLAT" /number=9 intron 29556..30805 /gene="PLAT" /number=9 exon 30806..31001 /gene="PLAT" /number=10 intron 31002..31112 /gene="PLAT" /number=10 exon 31113..31249 /gene="PLAT" /number=11 intron 31250..31424 /gene="PLAT" /number=11 exon 31425..31565 /gene="PLAT" /number=12 intron 31566..32427 /gene="PLAT" /number=12 exon 32428..32594 /gene="PLAT" /number=13 intron 32595..35336 /gene="PLAT" /number=13 repeat_region 32921..33220 /rpt_family="Alu" repeat_region 34234..34525 /rpt_family="Alu" exon 35337..36250 /gene="PLAT" /number=14 BASE COUNT 9621 a 8696 c 9145 g 9132 t ORIGIN 25 bp upstream of NcoI site; chromosome 8p12. 1 ttcacacaac tggtgctgtt accaccatgg gcgtctagtc tggatcagtg gtcctcagtc 61 ttttttgcac cagggaccag ttttgtaaag atagcttttc cacggacaga gggaggggag 121 atagtttcgg gatgattcaa gaggattaca tttattgtgc actttattta tattactatt 181 acattgtatt atataatgaa ataatggtat gactcagcat aatgtagaat cagtgggaac 241 cctgagcttg ttttcttata actagatggt cccatctggg ggttatggga gacagcgaca 301 gatcatcagg cattagattc ttataaggag tgcacaacct caatccctcg catgtacagt 361 tcacaatggg gtttgcactc ctatgagaat ctagtgccac tgctgatcca accggaggtg 421 cagctcaggc agtaatgcga gtgatgggga gcggctgtaa acacagatga agcttcactg 481 gctctccagc cactcacctc ctgctgcgca gcctggttcc taacaggcca cggacagata 541 ccagcccatg gccccagggc cggggattcc cagtctagat ggagacctag acaaggcgtg 601 cgacaataac accgatttta gatccatcat gacatttacc ccatcccctg caaagccaga 661 tggctaccaa aattaaatct tagtttagac acagaatgtc cgtcttctgg tccaaaacat 721 ccttgtcata agtttcttgg ctgggcgcgg tggctcacac ctataatccc agcactttgg 781 gaggctgagg caggtggatc acgaggtcgg gggtttgaga ccagcctgac caacatggtg 841 aaaccccgtc tctactaaaa tacaaaaaat tagctgggcg tggtggcggg cacctgtaat 901 ctcagctact caggaggctg aggcaggaga attgcttgaa cctggtggag gttgcagtga 961 gccgagatca caccactgca ctctagcctg ggcgacagag caagactctg tctcaaaaaa 1021 aaaaaaaaga aaataaaaaa aaaaaaacaa gtttcttgcc cactcttcct ttctctgagt 1081 ttccagagac atcacatcat ttcttaccca gctgagcaga gtcccagcat ggctctcgtt 1141 cgaataccca tcctgccacc tgccccagtg agaagggttg gagcagcccc tgttctctgc 1201 ccccgccacc tccatgactc atgcattctg tgggggaggc gcccactgca gaaacagcca 1261 ggctggctgg gaaaagccct gcagcaattc cccgtccagt ttctctgtgc ccactttgtc 1321 tccgtgttat ctaggcctgg atttatttct cttttttgaa aatgaagggc tttgatgaaa 1381 tgttcacagg atgtgaggtc accaagattt ttttcctttt tttctgttct tttcttttcc 1441 tcgaagtgat cccttgacta aaatcaaggc tcccattgtc accttatcag cctgcccatg 1501 cctaattctg cactctcagg cttccccaga atctgtccga gggaaaccaa tctcaattca 1561 gaaggaacac agaggcccca gtttctaggg ctgcaggata ttgctgggtc ttaatcattc 1621 gggtatattt cagcaaagct cagggcccct gctaatgatc tgcaaaccct ctcctctcag 1681 tctcccgttg tttagcccct atttataagt gagtgcatgt ggttgttcac tttctgtttc 1741 tgagttattt cagttaggat aatggtctcc agtttcatcc atgttgctcc aaaagacaga 1801 cttcattgtt ttttacggct ttgtcgtatt cctcggtgta tatgtgcatt ttctttatcc 1861 agtcctccat tgatggacac ttaggttaat tctgtgcctt tgctattgtg aattgtgctg 1921 ttgtgaattg tgtgtgataa acgtgtgagt gcaggtgtct ttttgatgta atgatttctt 1981 tttctttgaa tagatactca gtagtgggat tgccgggtca aatggatgtt ctatttttag 2041 tttgggaaat ctccatattg ttttccatag aggttcagga tccaggattt aagtgcaacc 2101 attcagtgga aaaccatcct ctcaagagtt caaatgagag tcctcatgac cagcactccc 2161 tgctgaacgg caatgcccca ccccccaaaa aagcctccat cctcaccatc ttagaccaca 2221 ttgcaggaaa gcagccccat gcccaccaga caggcccaca gctgctgctg ctcatgggag 2281 gtctacctcc cgtgctcagc aggcaaccca ccccagtgcc ccttttctac aatgaccatc 2341 agcagggctg cagcttcata aaaatgtgaa aaacagtttg aaagcaatgt gaaaagcagt 2401 taggtgcctt tcaacttcaa actcagcact catggtttga caaatatgag tatatttagg 2461 atcagaatgt atgaaatctg gcctctcctg aatagtcggt catcccacaa tttcctctgc 2521 caagcttctc ttccctccct gcgttctttc tttcacctct ctctcatttt tacaaggtct 2581 ggtctcagcc agacatgaac caatgatgat agatgctcct gctgagccct gtgatgtgcc 2641 agggccctga catgcgtggc ttctctctga tcctcacgtg acctgcagat gcaaaagcca 2701 gagctcagca tagttgaaaa tcttgattga ggtcatatcc cagttactgg cacagccgga 2761 tttaaaccta agactttccc ctacacgaca gggcttttat ttctcagtca tctgaaaagg 2821 tgtcagcaag ggaaatggct tgtctatttc caggggcatt ttacaagcaa atactgaaag 2881 gcttcggtgg gcttaagggc tgatggcttt gatcgaattt caggcatgtt ggccccaagg 2941 ccctgtgtat attccctggg cccactcaag gggatgctgg agccggaaag ttcccggagg 3001 ccacctactg cagccctgca ctttacaaag aagagaaaga ttctccctaa aattacagaa 3061 cagggccaaa gatgcctacc ggagcaaacc cccatggggg cacctcctac cgcaggtgag 3121 cccaaggctg gtcctgcctt ctcagtggct acccccctga gctcccgcac cacacaaagt 3181 gttccaatcc ttgtgcatcc tccagtccct ttaacctctc atgtcctgag aggccagagc 3241 tacagccaca gattccagaa gacaccccac tcccagcccc aacctgctgc ctttagaatt 3301 ataaacactt cttgtcatca cagggtcctg aaagtccctt ttaagcctgg gacactagga 3361 ctctaaagga agatgattct taaggtccca tcccacttcc aaattcctgc gattcaatga 3421 catcacggct gtgaataatc agcctggccc gaagccagga tgggctgtgc tgcttccacc 3481 gtgaacttcc tccccctgct ttataaaaac aggcctgcct cagctccctc atggccctgt 3541 ccactgagca tcctcccgcc acacagaaac ccgcccagcc ggggccaccg accccacccc 3601 ctgcctggaa acttaaggag gccggagctg tggggagctc agagctgaga tcctacagga 3661 gtccagggct ggagagaaaa cctctgcgag gaaagggaag gagcaagccg tggtaggtcg 3721 ggtttctgta ccttggggtc tgtctcctct tctttctctt aaaagtcttt ccagcaagct 3781 gagccagtga aggaatgctt taaccaaggg accaactttg taattcaggc aactagagga 3841 tccgctagaa tccactccag aggggagatg tctgtttgca tttgcatgtg attgggatag 3901 gctggagtgt tttggagggg agaggcagtc ggggaaaggg agactaggga ggttgcattc 3961 attggagaag cctctgctga cagagtaaag gggaaagtga aaagaacgat acagaaaagc 4021 cccaaatagc ctatgggagt acctcaagac gagcaagcaa taaaagcaaa atcggctgcc 4081 ttgggacacg tgcctgtgac agctgggagg gcccgctggg tcccctagtt tcctccctcc 4141 cccgatgaac ccccagaaag gaactgaggc tgtggatggg gcagaggggg acttacttcc 4201 agagcactat ttgctaggtg ctgggagaga gaagcaggtg gctttcacac cccacattac 4261 cccgtgggat agagaaatgc ctctgatctc cctaggtgcc acccctacaa gtctccacag 4321 gtcagtttgg ccctctggct ttttgcccag ggtgcatcag tcaagtgtgc agtcctcctt 4381 aaaggacatt ttaattacag gggaaaaaac tgacattaat gtcttttatt cctaaagcac 4441 tctgcaggaa gcatgtgggg ttctccagat tttcactgtt attatcttta ttttcttaac 4501 ctcacaggga gggagagagg aagggaggga gggaagacga aaacacgccg aggtcccagg 4561 taactaatga caactcaagc tcaaagaagt gtggtaacaa ggatgtgttt ggtggccaat 4621 tgggaataat atggtggtct cctgtagctg cttgatgaca gaaaaaatgt cgtgtgtgtg 4681 tgtatgcgtg tgcattgcag tcccgggaag gggtcttgca aataccattg tcaggtggag 4741 tcccctcaaa ggtgacttcc tagcagtact tgctgtggca agaacgctca ttctgggatc 4801 tagactcaat ccatgacggt cagcatgagc tgctgggtgc gtgcttcctg tgcaggcaga 4861 gtcagactcc agcactggca gggacactgt gagtattgag atatgaatag ctccatcctg 4921 cccacccatt tattccacct aaaatgttct caaagatgca agcctccaag ctctggatca 4981 aatgataaag ccccagatca aaatctcagg agcaaatgaa agctgccctt gtctccaaaa 5041 tgctccttct ctgcccgact ccccaagacc agcttggttt atttcacaat gactcgcgtc 5101 ggattctccc agtgtctgcg gctcccctgc ttcatcacag gcctgggaca gtccaggcag 5161 ccagggaata gctgggaata gcagttcccc tttgttggtt tttgagttca ctgaggcttt 5221 gatttccccc agggcctgac gttgctgagc agagccagcg gccagtagat ttcaaaggat 5281 tccaaaaaac agtcaggcac tcctttgtga acccaaaagt atctgagaca ggtctgtcaa 5341 tttagaaagt ttattttgcc aatgttaaga acacacatgt gacacggcct ctggagatcc 5401 tgaggacata tgtccaaggt ggccaaggta tggcttggtt tcatacattt cagggagaca 5461 tgagacatta atcaatgtat gtaagatgtg caccggttcg gtctggaaag gcaggacaac 5521 tcgaagtagg ggcttccagg tcataggtag ataagagaag gcttcattct tctgagtctt 5581 tgtcagcctt tcactgaata cacaatttac atatgagagg ggggcagagg aagagtcact 5641 tatgccttag ttggctcagt gaatctactt ttttttttga gacggagttt cactcttttt 5701 gcccaggcta gagtgcaatg gcatgatccc ggctactgca acctccacct cccaggttca 5761 agtgattctc ctgcctcagc cttctgagta actgggatta caggcgcctg ccaccacacc 5821 tggttaattt ttgtattttt agtagagaca gggtttcacc atgttggcca gactggtctt 5881 gaactcgtga cctcaggtga tccacccacc tcggcccccc gaagtgctgg gattacaggc 5941 atgagccaac atgcctggcc tgaatcttca tttttacaaa aacaataggg cagaggaagc 6001 aatcagatat gcgtttgtct caggtgagca gagggatgac tttgagttct gtcctttgtc 6061 ctgtacccat ttgcattgcc aaggtgaagt tcagcagaac tgttttaggg taaagatctt 6121 gaagtccaca aggagtttcc ttgtgggtaa attgtgagag aggtatgtag ccttttatct 6181 ttgtagccgt cttatgtagg aataaaatgg gaggcaggtt tgcctcatgc agttcccagc 6241 ttggcttttc cctttggctt agtgatttgg gggtcttgag atttattttc ctttcatacc 6301 ttggtagtgt cacttgaagc caccacgccc ggctaatttt tatattttta gtagagacag 6361 ggtttcacca tgttgatcag gctggtctcg aactcctgac ctcgtgatct gccctcctcc 6421 tccaaagtgc tgggattaca ggcatgagcc accatgcccg gccaattttt cttttctttt 6481 tttttttttt aagatggagt ctcgcttttt cacaggctgg ggtgcagtgg cacaatcttg 6541 gctcactgca acctccacct cccaggttca agcgattttc ctgcctcagc ctcccaaata 6601 gcttggatta caggcgcccg ccaccacacc cagctaagtt ttcaccatgt tggccaggct 6661 ggtcttgaac acctgacctc aagtgatctg cccgccttgg cctcgcaaag tattgggatt 6721 acaggcgtga gccaccatgc ccagacaggt ttttctttga ttatttggca ttatgcacct 6781 gcagtctcag aataaatttt ttactttctc tgagtgattt tcgcttctgc tttgatgaga 6841 ttgttgggac ggacaataag ggaccagtgc attctgctca ttccttcttg gggccatttc 6901 ttttgtatat acgggagaaa atgtctgtct atagtgtcag gctataaacc tcgtcctagt 6961 caagcagctc acaaaaccat atggtgggtc tcacatcaaa ccaggtaaaa gagtgaggat 7021 gatttagact caaagaaaat tagagctaga aataaattta aagctcatcc accctgctcc 7081 aaccctaagt caagccacat ggtcttccga tagtggctca gttttctact tacataaaaa 7141 gacagcacat tctcttagca atatgtgttt gtatgtgtgt gtgtgtgtgt gtgtgtgtgt 7201 gtatatatat atatatatat atataattta gagacaaggt ctgactccat cacccaggct 7261 ggagtgcagc agtgtgatcg taggtcactg aaccttcaaa ctcctgggct caagtgatcc 7321 tccatctcag cctcccaagt agctaggact acaggggcat gctgccatat ccagctaatt 7381 ataaaatttt tttgtagaag cagagttttg ctttgttgcc caggctagtc tcaaactcct 7441 ggcctcaagc aaccctcttg ccatggcctc acaaaatgct gggattacag gcaagagcca 7501 ctgtgcccag tccctcttag tgatattttg catattatat attcaaaaaa tattttctat 7561 gtatgccact tgggaaactt ctgtagaatg ggatagaggg acagataaac ccaattcttg 7621 atccactcaa aaataacacc tgtgacagag tgaagatgaa acaggtgctg ggcacacagc 7681 aagtgtttta gaaacatttt tgaactcaac tttgcctgag ttgtggtgct gatcatggct 7741 gccaggaaat gctagccagc tgttcttcct atacaagtat atgactgggt gtctgcagca 7801 gctgcccggc tgtgaattgc acacaccaat ggccaagacc cagcagcgac ttggctaaca 7861 ggcccagccg atgtgcatgg gtgaccacgc gatgctcctg acacacgcac tggtggctta 7921 gggaagtttt tttgaggatt gatcagaaga tctgattcca cctggagcct ctgaagtgat 7981 cacttccagg ttaggctgaa cccagatacc aggagacctg tgtattgaga gccttttgtt 8041 ttttttatgt agtcacaact attccattaa catacagcta ggcattagag aatccatgct 8101 ggactcctac aaaggcccct agatccttag aacactcaga gattccaaag cacacgtcct 8161 cccaggacac cctggcagaa gagcccgggg gcgactgtca cctttgcctg ctgtcgggag 8221 ctgtgctgat gggtaggggg ctctgtgctt gctgtggccc agagccaagc cgtggggatg 8281 tgctctgtct ggagcagacc ccttcactgt ctgcctaact ccttcgtgtg ttcccggaac 8341 agcttctgtg ttatgcattc cagaaagcct gcctgaccac cccaacctca acaagcctgg 8401 gatggacaac ctcctccaca ttccactcca agcccacccc cgtccccacc ctgctgcgtg 8461 gtatggcgct gctctgctgt accctgagtg tctagttatt catcaggccc cttcctaact 8521 atgagctctt tcaggataag agctgtgtct ttgcatcacc agagcctgtg ccagggcacg 8581 gctggtgctc catacacgtt gatggtgaat atatagcatg ggtagatcat tgtctgtgaa 8641 tgcttaacag agaagcagct catgtgggtg cttgagggct gggggggcag ggtgggggtc 8701 tggtaggcgc tgggaggagg acatccctgc agcttctcca gccccataga tgggactctt 8761 gatgtgggca gtttttattt cagctgagtc tccagaaggc aggggacagg gactgggagg 8821 ggatgctaaa cccgataacc cagtatggat gagctaacag aggccgggca cacagctcct 8881 gcctgtaatc ccagcacttt gggagcccga ggtgggcggg ttgcttgagc caaggagttt 8941 gaaaccagcc cgggtcttga acatagcgaa gactctgtct ctacaaaaaa atgaaaaaaa 9001 aaaaaaaatt agccagacat ggtggcacgc acctgtagtc ccagctactt gagaagctga 9061 ggtgagagga tcacttgagc cagggaggtt gaaactgcag tgagctgtga tcacgccact 9121 gcactccagt ctgggtgact gggcgagacc ctgtctcaaa acaaataaat aagaaagaga 9181 gactactggt caacttgatt tctgctggtt attctctttc tgtcatgttc accaaaaaga 9241 gaaacccgtt gtattttatc ttgttgtagc aagagccggt gtctaacacc tgggctactg 9301 agagaaagcg aagggttttt ttctgggaag ctccaatttc cttatttctc tcacatgatg 9361 cctctgactt tcatagtgtg gcttggtcag atgggcctta tcaatagaga tggcctgatg 9421 tctcagctgt gtcatctcag ctgctagttc tggcttctgc tgggagtcca cagtggcctc 9481 tgcatgcatg gtgtgggaaa ggcaaataaa aaagcagatt aaaaaccaga ttaaagacat 9541 caccctgtat tcaggaaata atggctcctg ttgatactgt tctacgaaca cagatgaagg 9601 cacaaaattc tacatcaatt gcagagaatt ggttataggc taacacaggt ggtcggatga 9661 tctaggaagc aggattcagt attaacgggg atccatgagg accccatgaa gcgtggcatc 9721 tcttctgcat cagcagcttc aagttgttcc agggagaagc gggaagcctc agtgcttttg 9781 ttgaggaagg atgttcatga cagtaggtgc aggtcgctcc tgccgtgcac ctaggtgtgg 9841 gggaagaact ttcctgcttc ttttccagtt accttacaat tcagcaagac ggaaatgatt 9901 agcccatgac aaaaactagg aaacaggctc agagagagtg agttgcccac acagaatgtg 9961 tgctgtaaat gcagttagat ttccaggttg agaatccagt gaccacgccc cttctttgtg 10021 gcattctgct gtcgtatacc atgtggaaca cattaagaac gttatggcca ggcgtgttgg 10081 ctcacgcctg taatcctagc actttgggag gccaaggtgg acagatcacc tgaggttggg 10141 agttcgagac cagcctggcc agcatgccga aaccctgtct ctactaaaaa tacaaaaatt 10201 agccaggcat ggtggcacac acttgtaatc cgagctactc gggaggctga agaaggagaa 10261 tcgcttaaac ccaggaggcg gaggttgcag tgagctgaga ttgcaccgtt gcaatccagc 10321 ctgggcaaca gagtgaaact ccatctcaga aaaaaaaaaa aagttacaat tgggtgtcac 10381 atagacagtg aggagtagtg gaaagagtgt tagatttggg gtaagagaac tgtgtctcct 10441 gggcttgagt cctgatgcca actctcacaa gatgtgtcac cgtacagcac gacgcttata 10501 cttttttttt tttttttttt gagacaggat cttgctctgt ctcgagctgg agtgcagtgg 10561 tatgatcaca gctcactgca gcctcaaact cccaggctca ggccatcctc ccacctcagc 10621 ctcccaagta gctgggacta caggcatgta ccaccatgcc caaataattt tttgtatttt 10681 tttgttgaga tggggttttg ccatgttacc cagggaggtc tcaaactcca aggatcaagc 10741 cacccaccct ccttggcctc ccaaagtgtt gggattacag atgtgagcca ccacaccttg 10801 caaatttttg tattttttgt acagccgagg gtctccctat gttgccgagg gtgatcttga 10861 actcctgggc tcaaatgatt caccctcctc agcctcccaa agtgttggga ttacagacgt 10921 gagccaccgc tctcagcccc ttacactttt tgaagttcag ttctcccatc tgtaaaaacg 10981 aggtaataat attgcctcaa atgacttaat ggatgtgaaa gggtttagta aactctgtgc 11041 ctggaaaggt agaagctact caaatatctg ttgagtgagt ggcattcatc catgtgtaca 11101 tttctcaggc aacaataatg tcactttcac ctaaggccag taaaatgtgg gtaaagtcat 11161 ataatgatac aatgaaagtt tctgatcagt atttcctaat tcaagtgtgt tgaaatgtgg 11221 agttgggctg gagaattgtc aaatcattgc gtatttaatt accttgtggt gatgtttgct 11281 ttagggagtg gaaggaacat gaggaacgaa cgaccttcat cacggaagat gacattcatt 11341 tcgattcact ttagatttca gggcttgttt agatgcagac tttgcagtta acattttatt 11401 gcactgactg atttagtttt caccaggtgg tccttgtgtt tgcttcagaa ggatctaagt 11461 gtattaggac agggtttctt ctgttttgat tgtgggagga ggcgggatag acttctgttt 11521 tcctagaccc acaaccaaaa ttccccactc ctccaaataa ccgggattac caggcagagg 11581 gtggagcatt gaggagatga cagctggaca tgcctgctgt agctttttct gagtttaatt 11641 ccatatagct tagacacagg ccatatgcct acatttggaa gatgttttgg agaaaattct 11701 gctctgatgg gtggggggca gggtctgcca ggaaaatatt ctattctttt gaagacatgg 11761 ggtcttgcta tgttgcctag gctggtcttg aactcctcgc ctcaagtgat gctcctgcct 11821 cagcctcctg agtagctagg actacaggtg caaaccacca cacccagctc tgagaggatg 11881 ctatactctt cacttcattg gaattggtga aaattagata agaaacagac tcttcctctc 11941 ttctccccag ttataaacag ttctgttaag agagggaaaa taaccattta gcctatagaa 12001 aatcaccttt tcgaaaagtt aagaattact tatgtgatgg tagtttttga aagtggtttt 12061 tattctgaga tcatggcctc tgtactatgc agggcctaga gaagtgggat gagagctacc 12121 aggctcaaag ctcctgttcc gaacaaaggg ttgctttcgt gggagcacgt caccctttcc 12181 aaggctagtt cctctcccag catctcactt cagctctcgt cctgctctag aaacgtgcct 12241 ttgctggtcc aaggaatcct ggccgtttct tttcttaatt gttggcattc atctgggcta 12301 tggtggttgt ggcgggggtg gcagtggaga gtgtgttgtt ttaaatccat gagtcatttt 12361 agggtattgg tactcttgaa gactgacttt ctcttgctca ggttttcaga gagacctttt 12421 gctaagcaac aataacctaa gaatctcctt gaccaaagat aaatgtagca aactttgcac 12481 atggtgcaac atttgataat agtatcaatt tacgtttgtc tttcccttgc aatttccctc 12541 ctcctctagc ttgctcacgt ttctgccttc ttcttgctgg caagagtaca gactataatt 12601 ctccagttca ctaacatctg gagcgactgc actgtttgca atttcattga ttgactacat 12661 gtgtacagtt cctttattgc aagtcacttt tctgtataat cttttttgag acgaagtcct 12721 gctctgtcgc ccaggctgca gtagcgcaat cttggctcac tgcaacttcc gcctcccggg 12781 ttcaagcgat tctcctgcct cagcctcctg agtagctggg attataggcg cccgccacca 12841 tgcccagcta atttttttgt atttttagta gagatggggt ttcactatgt tggccaggct 12901 tgttttgaac tcctgacctc aagagatctg ccagccttgg cctcccaaag tgctgggatt 12961 acaggtgtga actactgtgc ctggccatat aatctatttt taaaatgttt ttatatgaaa 13021 ataaacagag acagagtctc actgtgttgc ccaggctgat ctcgaactct tgggttcaag 13081 tgatcctccc accctggcct cccaaagtgc tggggttaca ggcgtgaacc acctcatgtg 13141 gcctcgcttt cctatataat ttaaacaata gaactatagt gccatttgta gagtttccca 13201 gtttatactg catgtagcac tccccatagt cgtatgttct cacagcagaa cggaaaccgc 13261 aggatgtcag caccggaggc gaccttggag catccagccc acttggtttt gcattgctgg 13321 caggagattt tatgcctcaa aatatagatg ctttttggaa ggacagcaca ttaattattt 13381 ttcagtcatt tttaccttat gcgaacaagc tgattagttc ctttcctagg agctggtttt 13441 tatggaggga agatggggcc ggtagaggca gatagagggg gatgtaatac acaggcactg 13501 agacagcacc ccactgactt tcccctgtcc ctcctccagc tggcctcctc ctttccaggg 13561 gctgtaccca atccagaatc agttactcac tggtgcccat cgaccatcat ttccccatgt 13621 ggccaccaaa gaaatgaaca ccacctttct ggttggcatg atcatcaatt tctcagccaa 13681 aacggcatgc atccttggct ctgggaaaca gtacatctcc caggaaagtg agcttacagc 13741 acggtacccc acttcctggt taccagacta ttgttcccgt tttccccaag aacagtcatc 13801 attcacaggt tgtttggcag ggcccaggaa cagagtcact ctggttgcta cagccaacag 13861 accaaacacg gcttccaggc aggaaggtca gagctgctct gcaggatgca catgcaggtt 13921 ggaagcagac agggctggcc gagcacagat atgctggctg tgtcctacac acaggtgcct 13981 acacacacat ccttcagatc cgggctgcac tctatcagcc aacccacgga ctctgttttt 14041 tctaatttgc acagattctt tgtcaacctg ccatgccctg agactgctga acttcatcca 14101 acagagaggg taccttttgc aaatctgcaa aaatcaactt tgtgtgctag ggtggcctgc 14161 agggctaggc cttggggtct tggggaccag cattccacct ctcctctcat acagtgattc 14221 tggattccag cccagcaaaa gtcagccaac cttggggcct tcctttcttt catcctttct 14281 tttttttcct tttctatcct tctgtgcctc ccttcctacc cccttccttc cttcttctca 14341 tttgcctttt tttttttcag ttctttgatg atcctgagaa gctactaaat atagcaaaag 14401 ggagataaca cagaacaaaa ttagaagtct atttataaaa atgagcctct tccatccctt 14461 ccgccaatgt catctctcac ccgggctcgc cagagcactc atggacctcc ctttggtgag 14521 gcccaaagtc catcccagcc acgccctgcg gtcccgagtc tgagtccatt gttctggctt 14581 tcactctgct ctgcatcccg tcactggccc cagacctcga cgcccacctc ttcagcgaag 14641 gcgaaacaag gactcatttt tcctccctct cttcgctctt tcttgttttc agactgcact 14701 gtttttctgc tggccggatg actcatttct ttgtcatctc ttaccccact tggccttttc 14761 agctgctcga atcgcctctg tgtgatgcgg acagagctat cgccctgctg gggtgagatt 14821 cacttgcatt gaatcagctc aaggcgcagg aaggtccagg gaagccaagg aagagaagag 14881 ccaagtctcc aactctctca ggagcaaaga caaaaatacg ctcccgctct cctatctctt 14941 attcagaccg ttcagttgca aaagggctcg aacatcgatg cagcgggtgg tatgtgtgga 15001 ccagccctag gtcgtgcttc tcctccatct ggtcccccac ccctagttat ccagaaaggg 15061 ggatgttccc aggcaatgcc aacatgtcca tactgcactc ttcacattgc agtttcccgg 15121 catctgggat tgcattcctt ttggaaggac tttgagttct tgaaataaag aagtaacttg 15181 gtgggcatct ggaaaaaaca cccaaactgt atcccaggct ggggtgcagg agcctatcga 15241 gagggctggg tggtgtggac agcgtgatga agaaggtcaa gttccttgta ctcactcgtt 15301 tgctgtttca ttcacaattc ctgtgaggtg ggagctctca ccatccttgg gtcttgctta 15361 agagtaagga gtggggcaac ctagagatac atgaacatca aggagaaatg gggagggagt 15421 ttcacatgag ctatttttac aagattaagt aatgggataa aatttaaaag gggaatttaa 15481 gaggaatttt gggcagtttt gagaaacagg aaatttgagg aaagctgaga aagagaagat 15541 ctggagtaag tctctgtgat gagaagattc caccccgcac tcctgttctg ggtggccttg 15601 gctcctggct ccctttcttc tgaaactgcc tggagccagg gagtagcaca gctgcttcct 15661 cccggggaga aagggacacc caaagagcgc actccaggga gaccttgggc actgcgggct 15721 caaaggctgg agatggcact tctgttccct cctgatggga caggtgcagg gaggctgaga 15781 ggatgccacg ctggcgagcc tccctctgag ctgtgttttc tcattggaat tgctgtaccc 15841 ggagcctcct gcaggtttcc ctcgccctct ccctaagctc ctctgcctga cacaggagcc 15901 cgggagaatc tgtcaggcat gaaatgtagc gaggcaagga gggtgatcag cacctcccac 15961 gggggagttc gtttcgaagt aacaatcaaa gatccagagc cggacccagg cagggaggca 16021 aggtctggaa gcccctgtgc ctccaagccc ctaggacgct cgggggaggg ggagggggag 16081 ggtgggacct gaagttgttg tgaccgaagg gaaatgagtc agggaaagaa atgccatgct 16141 ttaatttttg tggatgaagg agggaacagg ccgccagacc acaaaataac acccggccga 16201 ctaacctact tttagctgcc gagggcagga tcacagagga gaggcagatg gaaactgggg 16261 ctctggggtc caggaagatc tgagtcttcc gaccaaacat catttgcctc ccctccttcc 16321 tgcaaatatg cacattgccg gatctgctgc agtctatgaa aaaggagaca gatgttctgg 16381 cagagcggtg ggggggtggg cgtggaggtt gggttggggg gcctctgctt tgtggggaga 16441 acctcaggca ggggagtagg ctccatcctg cgtggaggaa cctgagcagt ctaaggcagg 16501 ctggctctat gatgtgctgg caataaaatc ccaatgcttc cgctgcagaa gtccaagagg 16561 acatgactgc ggctccatct agtcaagccc agggcaggaa ttcccttcca ggaaaccaag 16621 ccagagcgct gtggtctctg ggctgccaag atgtctcaga caatggtcta gcccttcagc 16681 ccacagaaat ctctgggcaa aattatctcc cagcattgac agacgaatgg ataaacaaaa 16741 tgtgttctat cccacagtgg agtattattc agctttaaaa aggaaggaaa aatgctgggc 16801 gcggtggctc acaccttgat cccagcactt ttgggaggct gaggaaggag gatcatttgt 16861 gcccaggagt tcgagactag cctggacaac atagagaaac cttgtctcta cacacacaca 16921 cacacacaca cacacacaca ctctctctct ctctctctct cagccaggca cagtggcaca 16981 tgcctgaagt cccagctctg ggaagctgag gcaggaggat ctcttgagcc tggtgggtca 17041 aggctgcagt gaaccatgtt catgccactg cactccagtc tggatgacag agcgagacct 17101 agtctcaaaa aaaaaaaaaa aaaaaaaaaa aagaaagaag aaagaagaaa ggaaaaagaa 17161 aaaaaggcag cggggcgtgg tggctcacgc ctgtaatccc accactttgg gaggccaagg 17221 cgagcggatc acgaggtcag gagatcgaga ccatcctggc taacacggtg aaaccccgtc 17281 tctactaaaa atacaaaaat tagccgggcg tggtggcagg cgcctgtagt cccagctact 17341 caggaggctg agacatgaga atggcatgaa cccgggaggc ggagctgtag tgagccgaga 17401 tcacaccact gcactccagc ctgggcgaca gagcgagact ccatcttaaa agaaaaaaaa 17461 aaaaggacat gctacagcat ggatgcacct tgaagacatt tgcaaagtga aacaaaccag 17521 tcgcaaaaga agaaacactg tctgattcca cttctaggat gtatgtagag cagtcaggtt 17581 catacagaca gaaagtagca agcgtggttg ctgagggaaa aggagaatgg gagttattct 17641 ttaatagtta cagagtttca gttaggggtg atggagaagt tctggagatg ggcggtggtg 17701 acggtcggtc gcacagcgat atgacgtgct taatgccact gaactgtgca ctgaacatcg 17761 ttaaggtggt aaattttgtg ttatgtatat tttgcaacaa taaaaaatat gtagtttcta 17821 gaatcctctt ctagatgcat gtctggcgtt tctgccgcgg ggctggcact ggaggaagcc 17881 agacaggaca acattgcggc ttatcctcct ctgttgccat catggacctc tcgcaccccg 17941 tgatcaagct gttttttctc tccttccaga atttaaggga cgctgtgaag caatcatgga 18001 tgcaatgaag agagggctct gctgtgtgct gctgctgtgt ggagcagtct tcgtttcgcc 18061 cagccaggtt ggtgtgcagg atccctgtgt cccgcccctc aaggctgtga tgcttccgga 18121 ggctccaggg gtctgtgtcc atctgggcca cggaggttgg aatgctgtca gggagcagag 18181 aggcacaggg acgaggggct ggcattgctg tggagcacac agcacacact cccttctgta 18241 cagcaaactc tgtatttggg ggcccagtgt gtagctgggc cctaggcact tattttgacc 18301 tgagtaagcg gaattaacat gccactccct gcctgtgact gccttttttt gcagctgttt 18361 ttaaagaatg tattcaaatt aatccataaa gtacagttgt ccctcatatc tatggggtat 18421 ttgttccagg acccccaagg ataccaaaat ccagggatac tcaagttcct catatagaat 18481 ggtgtagagt ttgcatatcc tatgcacacc ctcctgtata ctttaaatca tctctaaatt 18541 acttataata cctagtacaa tgttaatgct atgtaaatgg ttgttataag gtatttttta 18601 atttgtgtta tttttattgt tgtattattt atttatatat tttttggaaa cagggtctca 18661 ctctgtcacc caggctggag tgcaatggca caatcatagc tcactgcagc accaaactcc 18721 taggcttaag ccttggcctc ccaaaactct ggggttacag gtgtgagcca caagcccagc 18781 ttatattatt attttttatt tttctgtttt ggttgaatcc acagatatgg aacctgcaga 18841 tatacaaagt ggactgtatt tataaaagac aatgcaggac agggcacagt gactcacacc 18901 tgtaatctca gcactttggg aggccgaggt gggaggatcg cttgagccca ggagttggag 18961 accagtctgg gcaatatagt gagatgctgt ctctacaaaa aatttaaaaa ttagccgggt 19021 gtactagtat gcacctgtgg tcccagctac tcaggaggct gaggcgggag gatcgcttaa 19081 gttcaggagg ttgggacttc agtgagatat gattacgcca atgcactcca gcctgggtga 19141 caaactgaga tcctgtctca aaaaaaaaaa aaaaaaaaga aaaaggcaat gcaagtgcca 19201 tgagccacag cagaggcaaa ctgccctcat ggtgcctact ctctagctga gatggaagcg 19261 ctacacacac tctctgaaaa tgcctagcaa taaagggcct cacggaagtc caaactaacc 19321 acacctctcc aacaggaccc tacagcgggg agagccaggt gactgcgtgg agtagacagc 19381 cagctgcaac tccattgccc aatatggtgg tcacttacct tgtgcaactg tttaaattta 19441 aaataattaa agttaaaaat tgtttctcag tcccactggc cacattcaaa gcactcggtg 19501 gccctgtgtg gcttctgtcg tggacagcac agacagaaaa catttccacc aagcagaaag 19561 ctcttctgga cagagccagt caggaaggag aggaagaatt ggactagaag gcaggagatg 19621 gactttcact tgggccagtt acccgagggg gtctctgagc agaggtatct ggccggcagg 19681 ttcacatggg cttctgggtt ggccgccttt actcttagta cacctggcat gtgctgatcc 19741 ttcctccacc tctggtttct tcgcaggaaa tccatgcccg attcagaaga ggagccagat 19801 cttaccaagg tcgggtgaag ctgaggggtg catggggtca tcagggaggg aggcagggcg 19861 gggtgcaggg tgcatcaccc acccaccacc tgccttcttc cacccaccac ctgcctacgg 19921 tgtgacagtg ccattgcagc atggttctac agagatgcct gtgtcggcac cattctccca 19981 tcagctgcgg ccactgggcg ggttctgatt ctgaggggag cagaaacagg tgtttccccc 20041 ccaaggggcc aggctgtcag ggatcacaac ctagcggggc tgggggggcg tcactggtcc 20101 aaggttctag ttatacagat caaaggcaga gcccaggaga gagcaagtgg cttgcccagg 20161 gtcacacagc cgccagcagc acggccaggg cagtcacagc cgaggccagg cccatgctct 20221 gggtctcatc ctaggctgtg atcccaccac agtgcaccgg cagctttggt aacaacatcc 20281 ctgtgtatcc atttccctct gttgatccaa ccgggatgtc ccaactggga tgtcccaccc 20341 aggcaacccc tcgacatggt ggggtctcta agtccacatc tagagttgac tctagaaggc 20401 tgaggaatgc gaggcccagc ttcaggtcca aaactaccaa cttaggccct cgatttccct 20461 ccttcacgac ccagttcctt cgagtctgga agactctgca ataccctttg tgaaggtgtt 20521 aaaaagggtg actcactccc cagagcgagt cacacctgga ctcttaatgc cttcgttgtt 20581 ggatggcatc ctgcttccta ccaggttatc ctggcctcag tgacctcaga agcagcaagg 20641 gtctgactta cagtgggcac gtgctacttc tttaattaat tgaaagtaaa gttgttgttt 20701 ttaagttggc attctggctt gaaaacagaa agatggcatg aaaatattaa tgggagtact 20761 tcaactattt tcattatgcc aaaaatactt ggagaagtag cacattttct gcaaagtcta 20821 aaatgtcaat tttctttggg gccaaaaatg catcatttct gtgtggtaat acttatgtgg 20881 ataaaaaact ctggtagaca gttatgcata tatatatttt ttggtgaatt acaatgagaa 20941 aaagtccagg tgcaatggct cacatctgta actccagcac tttaggaggc ggatccagga 21001 ggatggcttt agcctaggag ttcaagacca gcatgggcaa catagtgaga cacctctgcc 21061 tcccccaccc caccccgcct ctacaaaaaa atttaaaaat tagctgggtg tggtggtgtg 21121 cacctgtagt cccagctact tgggaggttg acgcaggagg actgcttgag cctgggaggc 21181 agaggttgca gtgagccaag acagcaccat tggactccag cctgggtaac agagaccctc 21241 gaaaaaaaaa aagaaaaaga aaagaaaaga aaaaattagg ctgggcacgg tggctcatgc 21301 ctgtaacccc agcactttgg gagaccgagg tgggtggatc acttgagtca ggagttcgag 21361 accagcctgt tcaatatggt gaaaccccgt ctctaccaaa aatacaaaaa ttagccgggc 21421 atggtatcgg gcacctgtaa tcccagctac tcgggaggct gaggcaggag aatcacttga 21481 acccaggagg tggaggttgc agtgagccga gattgtgcca cttcactcca gcctgggcga 21541 cagagtgaga ctttgtttca aaaataataa taataaataa ataataaata aataaatcat 21601 atgagaagat actcaatggc actcagaaaa gaaatgcaaa tttattttat tttattttga 21661 gatggagtct tgctctgtcg cccaggctgg agtgcagtgg tacaatctcg gctcactgca 21721 acccccactt cctgggttca agcaatactc ctgcctcagc ctcctgagta gctgggatta 21781 caggcacctg ccaccatgcc tggctcattt ttgtattttt agtagagata gggtttcacc 21841 atgttggcca ggctggtctc gaactcctga gctcaagtga ttcacccacc tcggccaccc 21901 aaagtgctgg gattaccagc atcagtcacc gtgtttggcc agtattcatt atcttgttgt 21961 cttccagttg tgtctcttca ataagggagt atgcaatgga gtctatagaa agcaagtgag 22021 ggagttttct cagtaggtgc aagggtagtg aggcatttca ttttgagtca tactctgatg 22081 aggctggcct gtcttttctc atagtgatct gcagagatga aaaaacgcag atgatatacc 22141 agcaacatca gtcatggctg cgccctgtgc tcagaagcaa ccgggtggaa tattgctggt 22201 gcaacagtgg cagggcacag tgccactcag tgcctgtcaa aagtatgtgc tgaggctgga 22261 aggtggtgca tgcctgtgat cccagcactt taggaggcca aggtgggagg gtcgctggag 22321 cccgggagtt caagaccaat ctgggcaaac atagcaagtc ccctgtctct acaaaaaata 22381 aaaaaattag ccagacctgg tatgtagtcc caactacttg ggaggttgag gcagaaggat 22441 cacttgagcc caggagttgg aggctgcagt aatctacgat tatgccactg catttcaacc 22501 tcagtgacag ggcaagccct cacctctaaa acaaaacaaa acaacacaaa caaaaacaaa 22561 aacacagaaa agcccagtac aaatcgactt caaactgaga cttcgagttc aggggtccag 22621 tcatgaaccc tgtctttcat ggtccccgga aagttcaaat cttcaaaagg agcctccagg 22681 cctgttcggc cttaggcgga ggtgaaggta caagcagaaa agggagcaag accaagtgac 22741 cttgggcaaa tgcccgcttg cctcagtttc ctcatctgta aagaggtggg aataacagta 22801 cacacttcat ggtttgctgt gaggattaaa ttagctaatt tttgtagact gtacaacagc 22861 gcctggcaca tagagcacat tcttaatatt ttaactcaca tcatccttgg agctgctttg 22921 aaaatacagg ggacccttga acaacacttg tctgacctgc agggttcact tatgtatgga 22981 tttttttcaa ccaaaggcag gatgaaaaca gtatttgtag gatgcagaac tggtatacac 23041 gtgttccacg ctgcaggact tgagtgtgtg tggattgtgg tagacagggc tcctggaacc 23101 atggcctgag cacacagaag gaggctgtca ttaggggctt ccttttttat acttgacagg 23161 ttgcagcgag ccaaggtgtt tcaacggggg cacctgccag caggccctgt acttctcaga 23221 tttcgtgtgc cagtgccccg aaggatttgc tgggaagtgc tgtgaaatag gtgagtaggt 23281 gagagcacgt gaaacgagct gaaatcccac ctctccagtc tcccccgccc ccggaaggaa 23341 aggcagggtg cgatggcagg gccaggggtg ggggccaggg acagggaagg cgcttctgcc 23401 atgggtcagg aagcacagtg ccacacagca agcagagccc aggtgtgacc ctgaatccag 23461 gctggcaccc ttgccaccca gcctgtggct ccgagccagc ctgcatcagg cccgggggag 23521 gacggggcag agaggggagc ctcctgaggg ctgaatgaaa ggggactccc atgtgaggca 23581 tccatgccct gctctctgtc agataccagg gccacgtgct acgaggacca gggcatcagc 23641 tacaggggca cgtggagcac agcggagagt ggcgccgagt gcaccaactg gaacagcagc 23701 gcgttggccc agaagcccta cagcgggcgg aggccagatg ccatcaggct gggcctgggg 23761 aaccacaact actgcaggtg agggcacatc cccaaggagg gattccccct gtgagaagct 23821 gctacagctg gggaaagatt ctgtcagatc cgtcagggtt ctcccaagaa acataaccaa 23881 tacaggatgg atagatggat agatgataga cagataatag atgataggtg atagatgata 23941 gattgataga tgatagatga taggtgatag attagataaa tagatgatac atacatgata 24001 gatagatgat aaatagacgg tagatggatg acagatagac agatgatagg tgatagatag 24061 atgatagatt gatagatgat tgatagataa atagatgata gatacataga tgatagatgg 24121 tagatagatg atagatggat agatgataga tagatgatag gtggtagatg atagacagat 24181 gattagaaag atagatgata gatacataga tgatagacac atagatgata gacagattga 24241 cagatgatag atgattgata gatgattaaa tagatgatac atagatgata gataatgata 24301 aatagatgat agatgataga tgataggtga tagatagatt gatagatgat agaagattga 24361 tagatgatag atacataggt gatagtagat gtaagatgat agatgataga tagatagatg 24421 atagacagat tgatagatga tagagagata gagatagaga aagaggagag agaaatagag 24481 agagagagag aatgtgtgtg aacccaaagt atctgagaca ggtctcaatc gatttagaaa 24541 gttgattttg ccaaggttaa ggacaccccc gtgacacagc ctcaggaggt cctgaggaca 24601 tgtgcccaag gttgtcaggg cacagcttgc ctttagacgt tttagggagt catgagacat 24661 caatcaacat gtgtgagatg tacatcggtt tggtcgggaa agttgggata actcgaagca 24721 agggcttcca ggccataggt agataagaga caaaaggctg tattctgagt ccttgatcag 24781 cttttcactg aacacacaat tgagtctggc tcagttcatc tgcattttta cataaaaaat 24841 agggcagagg aagcaatcag atacgcattt gtctcagatg agcagaggga tgattttctg 24901 tcctgcacct gggaagataa gctatccatt tacaatgcca aggtgaaagt caacagaact 24961 gttttagggt aaagatcttt aggcctgcaa ggaatgtcct tgtagagaat taaaaaaaaa 25021 aaaaaaaact ttgcagctat cttattgagg aataaatgag aggcaggttt gcctgacgta 25081 gttcccggct tgacatttcc cttggctcat gacgttgggg tcctgagatt tattttcctt 25141 tcacactaga tagatagaca tttatgttaa ggaactggct gacaggatcg tgggggctgg 25201 caagtctgaa acatcttggt aagccaggct ggaagtgcag gcagaagctg aggctgctct 25261 cttaaggcag aatttcttat tcaggaaacc tcatatttgc tcttaaggcc tttaagtgat 25321 tggacgaggc ccaactacat tatctaggac aatttccttt tctaaagtca actggctgta 25381 gatattaacc acatctataa aacactttca caactcctag atgaatgttt ggttaactga 25441 ctgggtgcca cagcctggcc aagttgacat ataagactat tccagtccac accttgtcaa 25501 tttggcaccc atgtgcatct ccttaaacca tccttcacct ccaagtaaac acaggaacaa 25561 aatcatactc ctgcctaaca tgatagaact accagtgtac aaccaaaaac gcactcccgg 25621 ctgggcgtgg tggctcacgc gtaatcccag cactttggga agccaaggca ggtggatcac 25681 ctgaggtcag gagttcaaga ccagcctggc caacatggtg aaaccctgtc tctactaaaa 25741 atacaaaaat tagccaggca tggtggcagg cacctgtaat cccagctact cgggaagctg 25801 aggctgaaga attgcccaaa tccaggaaac ggaggttgca gtgagaggag atgcgccgct 25861 gcactccagc ctgggataaa cagcgagact ctgtcaaaaa aaaaagcacc ccccattcct 25921 ccagaaaagg ataccaaagt ccttgggtcc taaaggagag gcttcacctt tgacctaccc 25981 tggcccttcc cttggctgat ttttcagaaa cccagatcga gactcaaagc cctggtgcta 26041 cgtctttaag gcggggaagt acagctcaga gttctgcagc acccctgcct gctctgaggg 26101 taagtggcag ccaccccgcc tttccctgaa tgtccctggg ggaggaaagt ccaggaggac 26161 aggccgatag ccctcctgca cctgagggga gaaaggggga ggccaggagt gagtcatttt 26221 ggttgcagag ggaagaaaga caggacttag gtctgcgcct gggctcggcc tctgactggc 26281 tatttgtctg acctggagct gatctcctca cactctgacc ctcaatttcc tcctctatgg 26341 aggagaggat ggggctgaat cagtggtttt cagtcctttt gagaatgaat ggagattctc 26401 ccttttttga ttaatgggtg aatggggctt tgtctctctt tcgctccctc tcaaactttt 26461 taatgcaaaa aaaaaaaaaa aaaaaacttc caagattgaa aaaatatata ttaagttaca 26521 agaccaggca cagtggctta tgcctataaa cccagcactc tgggaggctg aggcgggcgg 26581 ctcacctgag gtcaggagtt tgagaccagc ctggccaaca tggcgaaacc ctgtctctac 26641 taaaaataca aaaattagcc ggatacggtg gcaggcacct gtaatcccag ctactcagga 26701 ggctgaggca ggagaattgc ttgaacctgg gagatggagg ttgcagtgag ccgagatcat 26761 gccactgcac tccagcctgg gtgatagaac aaaaaactgt ttcaaaaaaa agaaaaaaat 26821 gaaatgctaa gttacagaaa ggtactatga attcagtggt gcttatacaa atcattacgt 26881 tacttatcac atttatcaca gcactatgaa agattttttt taagttctaa atatatatag 26941 ggccaggtgc agtggctcgt gtctataatc ccagcacttt gggaggctga ggcgggagaa 27001 ctgcttgagc ccaggagttt gagaccagcc tgggcaacaa agtgagactc catctctaca 27061 aaaaatacaa aaattagtca ggtgtggtgg cttgtgccta tggtcccagc tgcttgggag 27121 gctcaggtgg gaggatcgct tgagcccagg aggttgaggc cacagtgagc aatgattgtg 27181 ccactgcact ccagcctggg tgacagtgag atcctgcctc aaaataaata cataaatatg 27241 tgtgtgggtg tgtgtatata tatatgtaat acacatatat taaatttata tatgatatat 27301 ataaacacgc accagctata tgcattcagc acccaggtcc acgggtgatg cctgtgactc 27361 tgaagacttt acagtcactc tacaatatcc tccccatccc ccatggtctg gtctaaatga 27421 ggactcgcag atctagataa tcttgaaaat cccttccatt tatggtattt atgtatagag 27481 gtatgctttg tcagtttcag gtcgggagaa aaaaatggaa aaatggggga aataaatgaa 27541 aacaggcagc actccacggc tcataccctc ctccttccca agcccatcat cagatcctca 27601 ggacactggc agacacagtc cgagaaataa cccccacccc agccccactg aagcacgttg 27661 gttcactgca gcctgttaag taacacaagg aaacccctac tggctttcaa agcagcttca 27721 gaaacagccc ccactcatgg aagtggttct tcctgctgaa cctgaagatg tccccagact 27781 ctctctcaac atagcaatct agggagccac ttccggtgga gatgaaaccc ccctgccatc 27841 tctaagtaaa aggactcaaa gagggggaaa aaggctcagg ccgggcgtgt tggctcacac 27901 ctgtactccc agcactttgg gaggctgagg cgggcagatc acctgatgtt aggagttcaa 27961 gaccagcctg gccaacatgg tgaaaccctg tctcaactaa aaatacaaaa ttagcgtagc 28021 atggtggcgc atgcctgtaa ccccacctac tcgggaggct gaggtagaag aatcgcttga 28081 acccaggagg tggagattgc agtgagccaa gatcgcgcat tgcactccag cctgggcaac 28141 aaaaggggaa gtgtgaggtg ttacactctt ctcagtgccg ggctagcggc ctccagcttc 28201 ctcgaccggc agctccgctt caagccccag ccctgacttg cgtgcttcag aatttgtttt 28261 tgtttcagtt tgagaattct tttcttctag gaaacagtga ctgctacttt gggaatgggt 28321 cagcctaccg tggcacgcac agcctcaccg agtcgggtgc ctcctgcctc ccgtggaatt 28381 ccatgatcct gataggcaag gtttacacag cacagaaccc cagtgcccag gcactgggcc 28441 tgggcaaaca taattactgc cggtaggtag cacaggggtg gggggttcag gtcttggcag 28501 aacgtgggat tagggtgtga gacgggggaa gatccaatgt ctcaagttgc atgacagacc 28561 cagtgcgtgg gaagcaccca tggatattat ctaatccaac ctcttcactt gctagaataa 28621 cacatattgt gaaaagcaag gtctaccagt tttccaacct aaatcccaag ttaagggtcc 28681 tggcctgtaa ccatttagtc ctcagctgtt ctcctgacat ctttattgca atgatttgta 28741 agagttccgt aacaggacag ctcacagttc tgtctgacaa ccctatgaga ttagaacact 28801 acggccgggc gcggtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcgggcg 28861 gatcacgagg tcaggagatc gagaccatcc cggctaaaac ggtgaaaccc cgtctctact 28921 aaaactacaa aaaatagccg ggcgtagtgg cgggcgcctg tagtcctggc tacttgggag 28981 gctgaggcag gagaatggca tgaacccggg aggcggagct tgcagtgagc cgagatcccg 29041 ccactgcact ccagcctggg caacagagcg agactccgtc tcaaaaaaaa aaaaaaaaaa 29101 aaaagaacac tacattactg actgggtaac aaagttaaag agaagttctc ctagggtggg 29161 ggtgtgctgc aaggtcaaga tgaactcggt gtcctccctc ccagctcagt ggttttcatt 29221 ggttgactga gtctccttct actcttacat ggcctgtgat gtggctgaaa atgggattga 29281 aaatcttaaa ctcctggcct ggtgtggtgg tgcatgcctg taatcccagc actttgggag 29341 gctgaggcag gaggattgct tgagcccagg agttcaagac cagcctgggc aacatggcaa 29401 gaccccatct ctacaaaaat taaaaaataa gaagataaaa aggaaacctt aaactctctt 29461 atcatttagg aatcctgatg gggatgccaa gccctggtgc cacgtgctga agaaccgcag 29521 gctgacgtgg gagtactgtg atgtgccctc ctgctgtaag ggctgggccc cggctgcctc 29581 cctgcacctg tctcccttct tcaccatctc ctcgccactt tctaggcctt catggctgtc 29641 tggtgcagac tgtgtgccta ccagacttcc aggctgggtg gggagggggc ttccatgact 29701 gaagccacgg tgggtgggcg ggtgtccatg acccatgcat gggggtggtg gggaggggca 29761 agaagaaaga aaagaacaca gcaacctctc tgagaagtca gtaggctcta cccggctctc 29821 tctagacgcc tctattggtg ttgctgagag taatggggaa tgaggaggag gcctcgcatg 29881 catggctcct tagataataa agtgccttca ccacatttgt ccaacagcgt ggtggaggtg 29941 gtcttcctac ctccagttta ctgagtagga agcagtctcc cagacgccag gtggcctgcc 30001 caagctcgct gggcaccggg aggtaaagct agggcctgcc ctcatgccct ctgaccccaa 30061 tggcatgtac ttccactgtg agatcttcca cgtttcctgt tgacctattt cacacagggg 30121 agctggctca tgatgtgggt gagagatagg agattcaagt tggtgcatat tgctaaaaaa 30181 tgaaacttta gtttggaaat ggccagagag taggttatta atcacctcct gtgaacacag 30241 atcagccaga atgttcccca aggaacacaa gttcaactgg gacgatgaga cacagaccac 30301 cagctaactc acacccaagg gcattagtgt gagtcctcag tggtccgtgt ccctaagctc 30361 ctcagagctg agcaggtggg gacgtcagga aagctttgcg aggaggcagg attgcacctg 30421 ggccttgaag catgggtaag gttggacaga tggaaaagaa atgagcagac attccacaga 30481 aggaagctgc aattcttcat ttcttaggcc aggaaatggc tgtaggaaaa cattagaagg 30541 aatgaaggat ggcagatccc cagttttgga gtcaaacata cttgggaact aatccaggtg 30601 ccattgcaac cttgggcaaa aacgtctccg cctcctgaaa cttatcgctg tgggaactaa 30661 agagaatgta tggacaacat tcggcatgct gatcccacca agtgaaagtt gtggttttac 30721 agccatgctg atggcctcct cactctacct acacacgtgc aggaggcctg tgggactgac 30781 tgcctcctcg tcctttcctc cccagccacc tgcggcctga gacagtacag ccagcctcag 30841 tttcgcatca aaggagggct cttcgccgac atcgcctccc acccctggca ggctgccatc 30901 tttgccaagc acaggaggtc gcccggagag cggttcctgt gcgggggcat actcatcagc 30961 tcctgctgga ttctctctgc cgcccactgc ttccaggaga ggtaggggct cggaaaccca 31021 gttggttttt ccaccctcta acgcctagaa accaagccct ttaggggaaa tcccacagca 31081 aaaaaagcat tctaaggctg tttctccacc aggtttccgc cccaccacct gacggtgatc 31141 ttgggcagaa cataccgggt ggtccctggc gaggaggagc agaaatttga agtcgaaaaa 31201 tacattgtcc ataaggaatt cgatgatgac acttacgaca atgacattgg taagagctcg 31261 tcattcccgg ctccagttca tacagatcac acaacacacc cacatgcact cacctatgtt 31321 tacacacacg ggcctcctcc tccagcccct gcccctcctg cgtgttcctc ccctcccggg 31381 gacgcggccc tcacgccagc tcactcctcc tttctccctc ccagcgctgc tgcagctgaa 31441 atcggattcg tcccgctgtg cccaggagag cagcgtggtc cgcactgtgt gccttccccc 31501 ggcggacctg cagctgccgg actggacgga gtgtgagctc tccggctacg gcaagcatga 31561 ggcctgtaag tggaaggaag cctcggcccc atcctgtctg cgggacagca ggggaggctg 31621 cggtccacca gaggaaaatg gggtcaggag accgaagttc cagccccggg tcaggcactg 31681 acgccagtcc cgtctccctg tgtcgtgggg gcccgtctct gctgtctctg cctcacaggg 31741 acactgcagg ccaggaaagg acacaacagt atgagggcct cgaagtgctg caggtgttgg 31801 gaaggttctg agtaggaagt ggagtacagt cgctaagcgt gtgggttctg ggaagctgga 31861 gttacaaaga cccgatttca aatcccactt gctgagctgt gtgaccgtgg acatggattt 31921 ttattttgaa ctgctcccag cctcagtttc ctcgttggta aagtggagat agaactatct 31981 ccctaattag attaggaggg ttaaatgaga taatattgaa cgaatggatg tcagctgggg 32041 tgattttgtc ccccaggaaa cgtttggcga ggtctggaga cgtttttggt tgttacagtt 32101 tagggtgagg gttaacgccg gccttcagtg agtgcagcca gggatgctgc acactgcagg 32161 gctcgggact gtccccatcc cccacaacag ggacctccct ggtccgaagc atcagcagtg 32221 ccactacgga gaggctgtgg gttcaatccc ggaatcctgc tgtttttgtg ggtcaaaacg 32281 ataaaatacc agcagtgtca taggattcga cctaatacat gttgaatctt gatcaatgtt 32341 agctattatt ggaaaatttg gctttctctt tcaaaaacgt ttataaataa ctttccctaa 32401 aaccatatga tggttctccc ctttcagtgt ctcctttcta ttcggagcgg ctgaaggagg 32461 ctcatgtcag actgtaccca tccagccgct gcacatcaca acatttactt aacagaacag 32521 tcaccgacaa catgctgtgt gctggagaca ctcggagcgg cgggccccag gcaaacttgc 32581 acgacgcctg ccaggtaacc aggagtggcg gcccatgctg ggcacaaccc aggggagaat 32641 gcccctggga caacaggttc ttcaccaaag aaaaaagtta tcaccagagt ggagtgattt 32701 cctcttagtg atacccctag ggattctaga tgagggcctc acagaataaa gcaactggag 32761 gaagatcaaa gtcacctaaa cctaagaatt ctaaactgca ggaaatttcc ccaggatctg 32821 cacaggcaac tccacctgta caggatttcc tttcagccca ggcaattgat tgactattga 32881 agtgatggag agagccacac aattaagaaa cgtttgcctc agcctggtgt ggtggttcac 32941 gcctgtaatc ccagcacttt gggaggctga ggtgggaggc tcacctgagg tcaggagttc 33001 gagaccagcc tgaccaacat ggagaaaccc catctctact aaaaatacaa aattagcctg 33061 gcatggtggc acatgcctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt 33121 gaacccggga ggcagaggtt gcagtgagcc gagatcgcgc catcgcgctt cagcctgggc 33181 gacaagagcg gaactcgatc tcaagaaaaa aaaagaagag gaggggaaga aatgtttgcc 33241 tctgttggca ctagagaatt ggagtctgct ttgtagaagg agaaggaaga aaggcaactt 33301 actgggaagc aggatctaga tgatgtgaga aagaagtttt caaaactcac attcagccct 33361 aataccctgg gaaagcacac tatttgaggt cctttgtgtg ggcaccttat cagaattagt 33421 acttatattt catgattagc cagatactgg ggtatgggag gaccaggcca gcctgctggt 33481 attaagacct gcacagagcc aaggaattga tcttcctcag acatttcagt gcctgcagcg 33541 tgcgaagctc tcctttaggg gctgggggta gaagagatac aaagagaagt gaggccaggc 33601 tctggtgctg aaggatgtgt gggctttgtt ttcgcaaggc agtcatatcg gcggtttccc 33661 ctctgattcc ctgacaggat ttagtttctt tttgaatggt ttcttcagca acctcagctg 33721 taccagctta cagtccagaa acaaaaaggt tgtaccaaag tgaaaccatt atctatctgc 33781 aagcttagtt aaaagcaaaa ccaaatcatt ttaattcaat ataagaacag ctaacaatta 33841 ccaagagttt tactagatgt cttccagtgc ttttcatgca ttcttctact gaatgccata 33901 accactctat gggtagtatg aatattatct ctatggtaca gataagaaaa ccaaagcatg 33961 gttagtctaa ggccatcagc tagtaagtga tggagccaag ccagggtttt tgaggcctga 34021 aacgtttgag taattaactc tactttacca gaatagatac agcctaatcc tccatgggac 34081 caagataact gagatacaat ctacccactc ttcagtaagt ctccacttaa cagcatgcct 34141 ggaaaaaata ggttcttaga aactgcaact gtaagtgaaa tgacgtataa caaaaccaat 34201 tttaccacca gctaattgat ataaacaaga gttggtcagg tgcggggctc atgcctgtaa 34261 tcccagcact ttgagaggct gaggaaggca gatcacctga ggtcaggagt tcgagaccag 34321 cctggccacc atggtgaaac cctgcctcta ctaaaaacac aaaaattagc caggtgtggt 34381 ggtgcgtgcc cataagcacg agaattgctt gaaccccgga ggtggagggt gcagtgagct 34441 gagattgcgc cactgcactc cagcctgggt gggtgacaga gtgagactct gtctcaaaac 34501 aaaaaaacaa aaaaaaacaa aaaaagaaga gtgaagttct tgtggcaata tttctagtca 34561 taaaaacatc agcaaacttc taaataaaga cccaaaacac ttctaacatt caatattgaa 34621 ataaatgtga gctatacaac atttaagaaa ggttaataaa aacaaacatg gtatttattt 34681 acccaatttt tggagaataa gtggatgatg gcggccataa tagtggtggg ttaattcagg 34741 gaataaatgt ttgcaaggca ggaatggtaa ggaacacctc ttactaccat gaagttcaaa 34801 aacaatcaca aatagggcag gctccctgac tgcctccaca cctcacagtt tatcatgcat 34861 tcgtgtattt agcacagact agacagattt ttactttata acactttgta ttcattcatt 34921 ttccaacgtg cttattccgg ttcagggtct tgggtggggt ggccagagcc acctggcagc 34981 tcagggcgcc aggcgggacc agccctggac agacgccacc atcctattgc agagccacta 35041 acacccacac tcactcagac tgggcccacg tagatacatc ctcagttggc ttaatgtgca 35101 agctttggga tgtggaagga aaccggggta cctggacaaa acccaggcag acagtgtccc 35161 cagacaggaa ttgatttttg ttttcctcat caaagttata atgaaatgac actgaatgga 35221 atattattga agggcctgct gtattttgaa gattacgatg ttcatgctac acatactgac 35281 tttctccaga tagtctgctt tttttgcaaa cgccaaacct gctatctcct ttgcagggcg 35341 attcgggagg ccccctggtg tgtctgaacg atggccgcat gactttggtg ggcatcatca 35401 gctggggcct gggctgtgga cagaaggatg tcccgggtgt gtacaccaag gttaccaact 35461 acctagactg gattcgtgac aacatgcgac cgtgaccagg aacacccgac tcctcaaaag 35521 caaatgagat cccgcctctt cttcttcaga agacactgca aaggcgcagt gcttctctac 35581 agacttctcc agacccacca caccgcagaa gcgggacgag accctacagg agagggaaga 35641 gtgcattttc ccagatactt cccattttgg aagttttcag gacttggtct gatttcagga 35701 tactctgtca gatgggaaga catgaatgca cactagcctc tccaggaatg cctcctccct 35761 gggcagaaag tggccatgcc accctgtttt cagctaaagc ccaacctcct gacctgtcac 35821 cgtgagcagc tttggaaaca ggaccacaaa aatgaaagca tgtctcaata gtaaaagata 35881 acaagatctt tcaggaaaga cggattgcat tagaaataga cagtatattt atagtcacaa 35941 gagcccagca gggctcaaag ttggggcagg ctggctggcc cgtcatgttc ctcaaaagca 36001 cccttgacgt caagtctcct tcccctttcc ccactccctg gctctcagaa ggtattcctt 36061 ttgtgtacag tgtgtaaagt gtaaatcctt tttctttata aactttagag tagcatgaga 36121 gaattgtatc atttgaacaa ctaggcttca gcatatttat agcaatccat gttagttttt 36181 actttctgtt gccacaaccc tgttttatac tgtacttaat aaattcagat atatttttca 36241 cagtttttcc aaaatcagag tggaatggtt ttgttataga tgctgtatcc cactctttat 36301 tcatgttcac attttaaaat catttggaat tcgccccgct aactttccca cacctgtaac 36361 atacaggtca tggctggtgc tccggaccgg aaaaggaggg acagaatgct tggtctgatg 36421 ggctaatatg gcatttagag aagtaccaag gtacagtgga gccggtcaca aaagggcaga 36481 cttgtagtag aattcagttg caagagggat tggggaatct taaggaaaaa atagaatctt 36541 aaggaaaaaa taactgggtg agacgtggac tgtggacagg cgcggaaaag gcac // LOCUS HUMTPALBU 6172 bp DNA PRI 03-JUN-1994 DEFINITION Human tear prealbumin (TP) gene, complete cds and promoter region. ACCESSION L14927 NID g307517 KEYWORDS tear prealbumin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6172) AUTHORS Holzfeind,P. and Redl,B. TITLE Structural organization of the gene encoding the human lipocalin tear prealbumin and synthesis of the recombinant protein in Escherichia coli JOURNAL Gene 139 (2), 177-183 (1994) MEDLINE 94156196 REFERENCE 2 (bases 1 to 6172) AUTHORS Redl,B. TITLE Direct Submission JOURNAL Submitted (06-JUL-1993) Bernhard Redl, Institut fur Mikrobiologie (Medizinische Fakultat), Universitat Innsbruck, Austria FEATURES Location/Qualifiers source 1..6172 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="placenta" /tissue_lib="EMBL3" CAAT_signal 868..872 /gene="TP" gene 868..5955 /gene="TP" TATA_signal 916..921 /gene="TP" exon 944..1093 /gene="TP" /number=1 precursor_RNA 944..5955 /gene="TP" CDS join(1004..1093,1490..1620,2670..2740,3314..3424, 4259..4360,4560..4585) /gene="TP" /note="precursor" /codon_start=1 /product="tear prealbumin" /db_xref="PID:g307518" /translation="MKPLLLAVSLGLIAALQAHHLLASDEEIQDVSGTWYLKAMTVDR EFPEMNLESVTPMTLTTLEGGNLEAKVTMLISGRCQEVKAVLEKTDEPGKYTADGGKH VAYIIRSHVKDHYIFYCEGELHGKPVRGVKLVGRDPKNNLEALEDFEKAAGARGLSTE SILIPRQSETCSPGSD" sig_peptide 1004..1057 /gene="TP" mat_peptide join(1058..1093,1490..1620,2670..2740,3314..3424, 4259..4360,4560..4582) /gene="TP" /product="tear prealbumin" intron 1094..1489 /gene="TP" /note="TP intron A" exon 1490..1620 /gene="TP" /number=2 intron 1621..2669 /gene="TP" /note="TP intron B" exon 2670..2740 /gene="TP" /number=3 intron 2741..3313 /gene="TP" /note="TP intron C" exon 3314..3424 /gene="TP" /number=4 intron 3425..4258 /gene="TP" /note="TP intron D" exon 4259..4360 /gene="TP" /number=5 intron 4361..4559 /gene="TP" /note="TP intron E" exon 4560..4586 /gene="TP" /number=6 intron 4587..5771 /gene="TP" /note="TP intron F" exon 5772..5955 /gene="TP" /number=7 polyA_signal 5936..5941 /gene="TP" BASE COUNT 1115 a 1958 c 1855 g 1244 t ORIGIN 1 gtgatgcaga gcatgtcaca gataaagtca tcctttccaa ggacggcttg tctcagcttc 61 tgcccttgag gctgtgttga tgtcctcttg ccttcctatg cccacagtgc tccactaagt 121 tatacgaagt tacagccctg ggatgtgtca aggagggttt ctcagacctt ctttctggag 181 tcagggaagc ccctcccatc actcccagga gccggaccta ggccctctct gcctgccttg 241 ggacttggag ctgacaggcc cctgcctctg ccccatcacc agcctgtgct tctcctccat 301 tccttgtttt atgtgagtgt gtgtgcgttt gtgcatgtat atatgtacac acatgcatac 361 acacacacac acctgtcttg gtcttctgtc tgtctgacct attgctgctc tgtgtgtgga 421 gcagagaggg tcgttactgc acaccttgct caggacagtg gacaccgtct ttaaatgcac 481 acgggcatcg gttcacacag gagggtccct ttttttccat gatgaatctg aggccaagta 541 acctgcccag agatgcagga agctcagctg agccccaagc ccacggccct cggactccaa 601 ccccaaaccc ccttgacagt gtgcacgtca gcaagtgtgg gatggctggg gggcatctgt 661 tacggctgca cacgtcccca ggaggcctgc acacccttca ctcccttcag tttcccttca 721 ggtgaaatct gcaaatctgc tcctctggac ctggtcctgt ccaagtgatc ctctggatta 781 acaagctgtt attcaccagc ctccaacaat aatggcaggt tatctgctcc attgccagaa 841 ctggcagtgg tttacatcca ggaagggatt ggtcagtctc ctgtgccagt ccctggggga 901 ccctgagccc agcggtataa agggcggctg tgggaggact ggtacagcct ctcccagccc 961 cagcaagcga cctgtcaggc ggccgtggac tcagactccg gagatgaagc ccctgctcct 1021 ggccgtcagc cttggcctca ttgctgccct gcaggcccac cacctcctgg cctcagacga 1081 ggagattcag gatgtgaggc ccggatggaa ggctgggctg gagggggcaa ggggcgaggc 1141 tgagactgga tggagacccc atgcctcctc catccaagga gaccctcgtt tttgggttgg 1201 gcattcaagc cctgccctga gggaatggtg ggatggggca gcagggggtc tggctgactt 1261 cacttcttcc ctgggatgag gctcctgtgg tcctggctgg cttgtggggc tggagccacc 1321 ttcaagtggg cctggcgagg gtgctgggtg ttttctgggt gggttagatt ggggaaacgt 1381 tccccgtttc cagccctcgg ggtgcggtag agtctagggg ctgcaggcca gggaaggggg 1441 aggctctgga gcagtcggcc tgagcctgat agagaggggc cttctccagg tgtcagggac 1501 gtggtatctg aaggccatga cggtggacag ggagttccct gagatgaatc tggaatcggt 1561 gacacccatg accctcacga ccctggaagg gggcaacctg gaagccaagg tcaccatgct 1621 gtgagtgtct gccagccggc cgggcagcct gcaacctggt ctagggcctt ccctttcccc 1681 acccaggaga gctctggtgc tggggaggtg ggcagacctc ctgggaggcc tctctcggcc 1741 cctcctgcaa tgcttgaggg caaactgtgc cactgaggtg cctcggctgg aggggccctg 1801 agggggcaga ggccggatca gaccctgtgg ctaccagtgg cagccctggg taccgcccac 1861 ttcctcctcc gactagggaa tgtctggtga cgtgcaggcc cctttggaaa aggtgtttac 1921 ccccgaccag agtcagagcc agtcctggca gtaaatttct gctttggaga agtgcaggga 1981 tggggcttgc agaatgacca agagggggca taggagaaga caccatggag aagaccccct 2041 tggaatccct gtgtagcgct cacctgtgcg gctctcacct gggctgcttc cacccgcgtg 2101 gctcacatct gtgacttccc acctgtgtga tttcacccgc atggctccct cctgagtggc 2161 atgcagtgcc tgtgctccgg tcctgggggc tgaatctcag gaagtcactg accctggcaa 2221 tgaccccgat ttttttttcc cccaaagccc ctcactgaac ctttccctct ctgggccaga 2281 tgatgatgga tatggggctt gtgcctgaat gtctctaaaa ggaagaatgc agctggcggg 2341 atgaggctca ggcccacgcc acagacgctg cctgcaaacc aggcagcctg ggtgcctctg 2401 gcaggaaatg tgttgggggc gggggggttg ggaaggcagc caccaggagg agcggtgcag 2461 agctcctggc catctcggga gaccccggga actgccgcat gggacccgga ggggccgctg 2521 gtgcctgggt ctgagatgca gagaagcatc ctcctccatg ggcctgtggc agactcgggg 2581 ccacatttgc agggcattgc agcatggttg ctccagggcc gggggaggca cagccagggc 2641 cgctttgcca gggggcttct gttttccagg ataagtggcc ggtgccagga ggtgaaggcc 2701 gtcctggaga aaactgacga gccgggaaaa tacacggccg gtgagtcccg gggcctgagc 2761 cagagcctga gcttgaacac acgctggatg tgggggggca gccagtgcct ttttggggtg 2821 gccttttggc ctgggaagct tttcctgcct tctgactcca ctaaaacccc actctctcct 2881 cccactggtc aatttgcatt agagacttga acacctgacc ccaaacacca ggtgtggcag 2941 gtttgtcccg ggcgaacctc tgggcttgcc ctgtagccct ggctgtggct ctgcagggcc 3001 aggagcagag tcgtatttgg ctgcgggggg ctgtacccca actcccacct ccgcagacct 3061 catacccgga ggaacccccc gtgactcact ggtttgggat gtggccgcct cttgggtaca 3121 tcaggttcct gggaagtggg ggtcccatgg cctggggctc aggcctggtg gggaacctgg 3181 gtcagggagc tctgggaggc tgggagggtg caggagctgc ggggcttgct gggcctggca 3241 acgcacgctg tcccgggatc ctctctgaag tccctggtgg ctaattcagg aatgtgctgc 3301 tgtctttctg cagacggggg caagcacgtg gcatacatca tcaggtcgca cgtgaaggac 3361 cactacatct tttactgtga gggcgagctg cacgggaagc cggtccgagg ggtgaagctc 3421 gtgggtgggt cccgcaccct caccctgcaa cccatgcctc cacctgccct ccctcctccc 3481 tcgccctcct gcaccctctt ccccatggga gaagccactg ggacccagga acccactgca 3541 gttttcttag gatgagtttt cctgataaag gcctcaattc ccacccctgg agttcaggat 3601 ttgggatgcc ccggcctcgg tatctgcagc agaggcagga aggagccacg gctgcaggca 3661 caggggttca aaggtcacac atgtcccaga gtggccaggc cctcccccgg agcagctccc 3721 tgcagtgtca tccttgagcg cgttcctgtg gggtgtccag caggagagag gaacagggat 3781 gtgagcagaa gtctgcatgg aaagaacgtt ccagctcccg gggttgaatt ccggacgcgg 3841 tggctcccac agatggtgac tcatcagtgc tgtccctgtc agaggacgag gcccatgaag 3901 gctcctgagt tctccctgga gccaggggtg tgcgaatgac agaggattgg gatcctggtg 3961 cccccaaggg agaggcgctt cctccagggt gggcctgtcg tgccgtccat cctcactccg 4021 ggagatgctc agggatggat ggagagccct tcccgttctc ccgttctgct gccacaggag 4081 ccttctcagc ctgcacggca ccctggtccc actttgccag cctgagggcc tctgggttca 4141 attcccccca cggtgggcga ggtccccttc cccagcccag ctcaggcctc cagactggga 4201 tggggtgccg tcggccagtg ctgcctcgag cacccacatc tcgtcctggc acccacaggc 4261 agagacccca agaacaacct ggaagccttg gaggactttg agaaagccgc aggagcccgc 4321 ggactcagca cggagagcat cctcatcccc aggcagagcg gtaggaggca tggccctgca 4381 gagcccccca tgtccccgcg tggggacatc agcagagctg cattgcacgg gcgcataact 4441 gtgctgcgtc taattctggc tttgtccttc ctggggtggt ggagctggtg gctgatgagg 4501 ggcccggtcc tgactgcatt cctggggtgt cctaacgcct gtgcttcctt ttcctgcaga 4561 aacctgctct ccagggagcg attagggtga gtgaacagct ttagaggaca tttgagaaaa 4621 tccagttctg gggctcagtg gtgcttccag aggccatggg gtctctccca cccatcccac 4681 tcctacctgg gtgggagatg ccccacgtag gtcaggcagg tgggagctct gctctgagct 4741 gcccacgctc gggcgtcaga ctccgttctt gccctgggtg tgactcctgg gcagctggga 4801 agcccacagc gctgagcccg cctttgctgg ctgctggtgg agacggtgtc taccccccag 4861 tcacatgggc aggagcaggc aggagattcg ggttcctcct cagcacccct caccctcgcc 4921 atgcctgtga ctccacttac ccgataagtt gctcaaagat tcagcaaagt ggcaggcgtg 4981 cctgtgctcg ctgtgtcaga tgggcccctt gctggctgca caaggggacc agagtttgga 5041 gactcgccca aggtcacctg cccaggtgtg gcagagccag aatcctgaca gcatcgctgt 5101 ccttcccgca ccctcccctt ggcctgcagt caccctgatg ccacttcact cagcagggag 5161 ctctgtcagc agcctgcggg ggtggtccct gctccatccg ctccaatcta gaggctggag 5221 cccccacctg cagcccccaa gagtccgtgt gtgcccccca caccatcgcc cccaagagct 5281 cgcatgtgcc caccacacca tagcccccta gagcccggtg cacgtccccc acaccattgc 5341 ctctgagatc cagcacgtgc ccccgcaacc atcgcccccg agagcccgca tgtgccttcc 5401 accgtaaccc tcaagagccc agtgtgtacc cccaccatag cccccgagag cccacatgtg 5461 cccacacacc atagcccccg agagcccaca tgtgccccca caccatgccc ctgagagccc 5521 acatgtgccc cccacaccat cacccctgag agcctgctgt gccccccaca ccatagcccc 5581 cgagagcctg catgtgccct ccacatcaca gcccccaaga gtgcagcgtg tgccccccac 5641 accatgcccc tgagagctca catgggcccc cacactgcag cacccaagag cccacatgcg 5701 cccccacatt gcatccccct cccacccttg ctgtcctggc ctcactcacc ctccccccct 5761 ttccagggca ggggacacct tggctcctca gcagcccaag gacggcacca tccagcacct 5821 ccgtcattca cagggacatg gaaaaagctc cccacccctg cagaacgcgg ctggctgcac 5881 cccttcctac caccccccgc cttccccctg ccctgcgccc cctctcctgg ttctccataa 5941 agagcttcag cagttcccag tgactccgcc ctgtctgtgg ggtccctcgg ggtgagaggt 6001 gctggctggg tgctcttttg ggtgcaggga ctgggtggga agcaggagga aggacaggga 6061 accagcctgg acccagttct ccctggtggg tccacaggcc aacagtggga ggggcctagc 6121 aggggggtgt gtgctggagg ctgggcctga aggtcccggg gtgcttcctg ca // LOCUS HUMTRHYAL 9551 bp DNA PRI 28-APR-1995 DEFINITION Human trichohyalin (TRHY) gene, complete cds. ACCESSION L09190 NID g292835 KEYWORDS trichohyalin. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9551) AUTHORS Lee,S.C., Kim,I.G., Marekov,L.N., O'Keefe,E.J., Parry,D.A. and Steinert,P.M. TITLE The structure of human trichohyalin. Potential multiple roles as a functional EF-hand-like calcium-binding protein, a cornified cell envelope precursor, and an intermediate filament-associated (cross-linking) protein JOURNAL J. Biol. Chem. 268 (16), 12164-12176 (1993) MEDLINE 93280194 FEATURES Location/Qualifiers source 1..9551 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="1q21.1" CAAT_signal 58..61 /gene="TRHY" TATA_signal 117..123 /gene="TRHY" exon 138..200 /number=1 mRNA join(138..200,1476..1644,1507..1644,2512..9120) /partial intron 201..1475 /gene="TRHY" /number=1 exon 1476..1644 /partial /gene="TRHY" /number=2 gene join(1507..1644,2512..8070) /gene="TRHY" CDS join(1507..1644,2512..8070) /gene="TRHY" /codon_start=1 /product="trichohyalin" /db_xref="PID:g292836" /translation="MSPLLRSICDITEIFNQYVSHDCDGAALTKKDLKNLLEREFGAV LRRPHDPKTVDLILELLDRDSNGRVDFNEFLLFIFKVAQACYYALGQATGLDEEKRAR CDGKESLLQDRRTEEDQRRFEPRDRQLEEEPGQRRRQKRQEQERELAEGEEQSEKQER LEQRDRQRRDEELWRQRQEWQEREERRAEEEQLQSCKGHETEEFPDEEQLRRRELLEL RRKGREEKQQQRRERQDRVFQEEEEKEWRKRETVLRKEEEKLQEEEPQRQRELQEEEE QLRKLERQELRRERQEEEQQQQRLRREQQLRRKQEEERREQQEERREQQERREQQEER REQQLRREQEERREQQLRREQEEERREQQLRREQEEERREQQLRREQEEERREQQLRR EQQLRREQQLRREQQLRREQQLRREQQLRREQQLRREQQLRREQQLRREQQLRREQEE ERHEQKHEQERREQRLKREQEERRDWLKREEETERHEQERRKQQLKRDQEEERRERWL KLEEEERREQQERREQQLRREQEERREQRLKRQEEEERLQQRLRSEQQLRREQEERLE QLLKREEEKRLEQERREQRLKREQEERRDQLLKREEERRQQRLKREQEERLEQRLKRE EVERLEQEERRDERLKREEPEEERRHELLKSEEQEERRHEQLRREQQERREQRLKREE EEERLEQRLKREHEEERREQELAEEEQEQARERIKSRIPKWQWQLESEADARQSKVLL EAPQAGRAEAPQEQEEKRRRESELQWQEEERAHRQQQEEEQRRDFTWQWQAEEKSERG RQRLSARPPLREQRERQLRAEERQQREQRFLPEEEEKEQRGRQRREREKELQFLEEEE QLQRRERAQQLQEEEDGLQEDQERRRQEQRRDQKWRWQLEEERKRRRHTLYAKPALQE QLRKEQQLLQEEEEELQREEREKRRRQEQERQYREEEQLQQEEEQLLREEREKRRRQE RERQYRKDKKLQQKEEQLLGEEPEKRRRQEREKKYREEEELQQEEEQLLREEREKRRR QEWERQYRKKDELQQEEEQLLREEREKRRLQERERQYREEEELQQEEEQLLGEERETR RRQELERQYRKEEELQQEEEQLLREEPEKRRRQERERQCREEEELQQEEEQLLREERE KRRRQELERQYREEEELQRQKRKQRYRDEDQRSDLKWQWEPEKENAVRDNKVYCKGRE NEQFRQLEDSQVRDRQSQQDLQHLLGEQQERDREQERRRWQQANRHFPEEEQLEREEQ KEAKRRDRKSQEEKQLLREEREEKRRRQETDRKFREEEQLLQEREEQPLLRQERDRKF REEELLHQEQGRKFLEEEQRLREERERKFLKEEQQLRLEEREQLRQDRDRKFREEEQQ LSRQERDRKFREEEQQVRRQERERKFLEEEQQLRQERHRKFREEEQLLQEREEQQLHR QERDRKFLEEEQQLRRQERDRKFREQELRSQEPERKFLEEEQQLHRQQRQRKFLQEEQ QLRRQERGQQRRQDRDRKFREEEQLRQEREEQQLSRQERDRKFRLEEQKVRRQEQERK FMEDEQQLRRQEGQQQLRQEDRKFREDEQLLQEREEQQLHRQERDRKFLEEEPQLRRQ EREQQLRHDRDRKFREEEQLLQEGEEQQLRRQERDRKFREEEQQLRRQERERKFLQEE QQLRRQELERKFREEEQLRQETEQEQLRRQERYRKILEEEQLRPEREEQQLRRQERDR KFREEEQLRQGREEQQLRSQESDRKFREEEQLRQEREEQQLRPQQRDGKYRWEEEQLQ LEEQEQRLRQERDRQYRAEEQFATQEKSRREEQELWQEEEQKRRQERERKLREEHIRR QQKEEQRHRQVGEIQSQEGKGHGRLLEPGTHQFASVPVRSSPLYEYIQEQRSQYRP" intron 1645..2511 /gene="TRHY" /number=2 exon 2512..9120 /gene="TRHY" polyA_site 9120 /gene="TRHY" polyA_signal 9121 /gene="TRHY" BASE COUNT 2907 a 1975 c 2869 g 1800 t ORIGIN 1 aacaagccat ttgtggagac agaggtggag ctgggcttgg ttaggaatga atcaggccat 61 tccacagagt gggtgtctcc ttcccaagtt gctttccagg gcacaattaa aacccctata 121 aaaggcccag ctcccagtta cccagtacac ttgcctgtgg tgtcagcaag cactgtcgac 181 ttcttcctct ggtgaagtgg gtaagtccca ttctgtggga tcgtggtctt ctttatgatt 241 ctccattttt atagctattt cagatgttgg gatatggggg gaggttccat gtgccagaag 301 gtatcagtat tgcagggata aataaactat cactaactct atcccatctt cttatggttg 361 gagccatcac ttgaactgaa gcatgaccct tctccttggg ctctgaactc tatacttctg 421 cacatcaagg atgatcatgt gtggctctga tagggttcat cttcctaaaa actgctatct 481 caaaagtttg ccagccttct gttctctttt acattggttc tacctaatat gggccatatt 541 catacagtca cagcatttaa ggtactggag ttgagaagta cataaagaag tcagctagat 601 gaacgactac cttatcccac cagcaaagcc attccatgta ttcttataac attgatctac 661 tgctggctaa tgttttataa aaagccaaga ttccaatgat gcatttgggt ttaacaaaac 721 caatatcatt cacagttttc tggattccgt ttggtttaga aaggacctct cagaagcttt 781 caatacttca atatccgaat atcttactat atctgagttg aggcaggtaa tatacagtct 841 ctgttttctg catttgtgta tctgaatgtt acatgccatc ttttgactag gaagaagtac 901 tatttaatct tagaattgct gacttacaaa ttatattcta taagagttcc taaatccctt 961 tatggactgt aatgttgagg aatcattcat attccctttt cattgttcta ttttctacca 1021 atcgttttgc tgacatggcc tctatccact ttaagatact ctcaagtctt ccttcacctt 1081 ttggctttac ctgtcctctt cgctcaactt taaaggaagg tgtgtagcca tatataaaat 1141 tttaatttct gcactcttct cttaattttc tactctgaaa tacggtggag agctggaaga 1201 aagacagaag aaaagggcat agataatcca cattgggtgg acaatcaaaa gctgacaaca 1261 ggatagtctg aagatgattc cctggcttgg aatttctcag gatcgctctt tctctttctg 1321 atacaatatt caaatattaa agtgctctga aagtccaggt tgaaattacc gctataaatt 1381 caaattattt agggatctgc ctgaaatagt gtgaatgaag ccttcccaaa agcagaaacg 1441 ggattttgat tctggatctt attttattgt tctaggttta cttgaacttg aaggaaagaa 1501 aaaaaaatgt ctccacttct gagaagcatc tgtgacatca ctgaaatttt caatcagtat 1561 gtctctcatg attgtgatgg agcagcatta actaagaaag acctgaagaa cctccttgaa 1621 agggaatttg gagctgtgct tcgggtaaga actaacaaga aaatgagatc tattgacttg 1681 aggctatgag atttattctc agaggagacc agagcaagga atggtggttt tatattcatt 1741 ttacaccaca acaggtctac actacatccc ccattcattt ctgagtcaaa aggtacttac 1801 ttgacattgt agtctgaata ataaagtatt tcatgtactt gatggcatgg catgtgaatg 1861 agctcttcat gggacattac tacaaaagat gtcaaatcac actagacttg gaggaacttg 1921 gaggaactta aatttgtttc caaatttcaa aactgagatc agcctgactc tattaaatgg 1981 tgctacccgt aaatgttttg ttctgttttc taatatggaa tagaaaccaa atcaagaata 2041 ctggctgctt cagacagaaa tggctactgc aaatcctcat aaatttctat tgtatctctc 2101 tcaaggatga gttcattctt tctcaattaa agcgaacttg tgttattctt tcttgatgtt 2161 gagtagcttt gttaatttac acacaagttc acgatgctgt tttgaatctt cacctcaggc 2221 tctgctctaa ggtgcgtagg cttacctgct atctacttgt gtctctcttc ctgcttcctt 2281 aggtttgatc agcactaaat tacgagatgt aaaaatttca aacgaatata tgctttaaag 2341 tgagggttca cattttacat ggggacaaaa cttgatacac actggacatt tttctaattg 2401 ctctgaatgt ctcttgaatg tcagcatagc ataaaatata tcatgtgtga atataatttt 2461 accacctgta aatagtgcat tgtaaaattt ttgtttttca ccattttata gagaccacat 2521 gaccctaaga cggtagatct gatcctggaa cttctggatc gtgacagtaa tgggcgtgtc 2581 gatttcaacg aattcctcct atttattttc aaagtggctc aagcttgtta ctatgctctc 2641 ggccaggcca cgggactgga tgaggagaag cgagcccggt gtgacggaaa ggagagcctg 2701 ttacaagatc gacggacaga agaagaccaa aggagattcg agccccggga cagacaactg 2761 gaagaagaac ctgggcaacg acgcaggcag aagaggcagg aacaggagag ggagctagct 2821 gagggagagg agcaaagtga gaaacaagag cgacttgaac agcgcgacag gcagcgccgc 2881 gacgaggagc tgtggcggca aaggcaagaa tggcaagaac gggaagagcg ccgtgcagag 2941 gaagagcagc tgcagagttg caaaggtcac gaaactgagg agtttccaga cgaagagcaa 3001 ctgcgaaggc gggagctgct ggagctgagg aggaagggcc gcgaggagaa acagcagcaa 3061 aggcgagagc gccaagacag agtgttccag gaggaagaag agaaagagtg gaggaagcgc 3121 gagacagtgc tccggaagga agaagagaag ttgcaggaag aggagccgca gcggcaaaga 3181 gagctccagg aggaagaaga gcagctacgg aagctggagc ggcaagagct gaggagggag 3241 cgccaggagg aagagcagca gcagcaaagg ctgaggcgcg agcagcaact aaggcgcaag 3301 caggaggagg agaggcgcga gcagcaggag gagaggcgcg agcagcagga gaggcgcgag 3361 cagcaggagg agaggcgcga gcagcagctg aggcgcgagc aggaggagag gcgcgagcag 3421 cagctgaggc gcgagcagga ggaggagagg cgcgagcagc agctgaggcg cgagcaggag 3481 gaggagaggc gcgagcagca gctgaggcgc gagcaggagg aggagaggcg cgagcagcag 3541 ctgaggcgcg agcagcagct gaggcgcgag cagcagctga ggcgcgagca gcagctgagg 3601 cgcgagcagc agctgaggcg cgagcagcag ctgaggcgcg agcagcagct gaggcgcgag 3661 cagcagctga ggcgcgagca gcagctgagg cgcgagcagc agctgaggcg cgagcaggag 3721 gaggagaggc acgagcagaa gcacgagcag gagaggcgcg agcagcggct gaagcgcgag 3781 caggaggaga ggcgcgattg gctgaagcgc gaggaggaga cggagaggca cgagcaggag 3841 aggcgcaagc agcagctgaa gcgcgaccag gaggaggaga ggcgcgaacg ttggctgaag 3901 ctcgaggagg aggagaggcg cgagcagcag gagaggcgcg agcagcaact aaggcgggag 3961 caagaggaga ggcgcgagca gcggctgaag cgccaggagg aggaagagag gctccagcag 4021 cggttgagga gcgagcaaca actaagacgc gagcaggagg agaggctcga gcagctgctg 4081 aagcgcgagg aggagaagag gctcgagcag gagaggcgag agcagcggct gaagcgcgag 4141 caggaggaga ggcgcgatca gctgctgaag cgcgaggagg agaggcgcca gcagcggctg 4201 aagcgcgagc aggaagagag gctcgagcag cgactgaagc gcgaggaggt ggagagactc 4261 gagcaggagg agaggcgcga cgagcggctg aagcgcgagg agccggagga agagaggcgc 4321 cacgagctgc tgaagagcga ggagcaggag gagaggcgcc acgagcaact gaggcgcgag 4381 cagcaggaaa ggcgcgagca gcggctgaag cgcgaggagg aggaagagag gctcgagcag 4441 cggctgaagc gcgagcatga ggaagagagg cgcgagcagg agctagctga ggaggagcag 4501 gaacaggccc gggagcggat taagagccgc atcccgaagt ggcagtggca gctagaaagc 4561 gaggccgacg cacggcaaag caaagtctta ctcgaggccc cgcaagcagg aagggcagag 4621 gcgccgcaag agcaggagga aaagaggcgg cgcgagagtg agctgcaatg gcaggaggag 4681 gaacgggctc accggcagca gcaggaagag gagcagcgcc gggacttcac atggcagtgg 4741 caggcggagg aaaagagcga gaggggccgt cagaggctgt cggccaggcc cccattgcgg 4801 gagcagcggg agaggcagct gagggccgag gagcgccagc agcgggaaca acggtttctc 4861 ccggaggagg aggagaagga gcagcgcggc cgccagcgac gcgagaggga gaaagagctg 4921 cagttcctgg aggaagagga gcagctccag cggcgggagc gtgcccaaca gctccaggag 4981 gaggaggacg gcctccagga ggatcaggag aggaggcgac aggagcagcg ccgcgaccaa 5041 aaatggaggt ggcaactaga agaagaaagg aagagacgcc gccacacgct gtacgccaag 5101 ccagccctac aagagcagct gaggaaggaa cagcagctgc tgcaggagga ggaggaggag 5161 ctacagagag aggagcgcga gaagagaagg cgccaagaac aggagagaca ataccgcgag 5221 gaagagcagc tgcagcagga ggaagagcag ctgctgagag aggaacggga gaaaagaaga 5281 cgccaggagc gggaaaggca atatcggaag gataagaagc tgcagcagaa ggaagagcag 5341 ctgctgggag aggaaccgga gaagagaagg cgccaggagc gggagaaaaa ataccgcgag 5401 gaagaggagt tgcagcagga ggaagagcag ctgctgagag aggaacggga gaagagaagg 5461 cgccaggagt gggagaggca gtaccgcaaa aaagacgagc tgcagcagga agaagagcag 5521 ctgctgagag aggaacggga gaaaagaaga ctccaggagc gggagaggca atatcgggag 5581 gaagaggagc tgcagcagga ggaagagcag ctgctgggag aggaacggga gacgagaagg 5641 cgccaggagc tggagaggca atatcggaag gaagaggagc tgcagcagga ggaagagcag 5701 ctgctgagag aggaaccgga gaagagaagg cgccaggagc gggagaggca atgtcgggag 5761 gaagaggagc tgcagcagga ggaagagcag ctgctgagag aggaacggga gaagagaagg 5821 cgccaggagc tggagaggca atatcgggag gaggaagagc ttcagcgcca gaaaaggaaa 5881 cagcgatacc gggatgagga tcagcgcagt gatctgaaat ggcagtggga accagaaaaa 5941 gaaaatgcag ttcgtgataa caaggtttac tgcaaaggca gagagaatga acagttccgg 6001 cagttggaag attcccaggt gcgcgacaga caatcccagc aagatctgca gcacctgctg 6061 ggtgaacagc aagagagaga tcgtgagcaa gagaggaggc gctggcagca ggccaacagg 6121 catttcccag aggaagaaca gctggagcga gaagagcaaa aggaagccaa aaggcgcgac 6181 aggaagtccc aagaggaaaa gcagttgctg agagaggaaa gagaagagaa gagacgccgt 6241 caagagacag acagaaaatt ccgcgaggag gaacagctgc tccaggaaag ggaggaacag 6301 ccgctgctcc gccaagagcg tgacagaaaa ttccgcgaag aggaactgct ccatcaggaa 6361 caagggagaa aattcctcga ggaggaacag cggctgcgcg aggaacggga gagaaaattc 6421 cttaaggagg aacagcagct gcgcctcgag gagcgcgagc aactgcgtca ggaccgcgac 6481 agaaaattcc gcgaggagga acagcagctg agccgccaag agcgtgacag aaaattccgt 6541 gaagaggaac agcaggtgcg ccgccaggaa cgagagagaa aattcctgga ggaggaacag 6601 cagctgcgcc aggagcgtca cagaaaattc cgcgaagagg aacagctgct ccaggaaagg 6661 gaagaacagc agctgcaccg ccaagagcgt gacagaaaat tcctggagga ggaacaacag 6721 ctgcgccgcc aagagcgtga cagaaaattc cgcgaacagg aactgcgcag tcaggaacca 6781 gagagaaaat tcctcgagga ggaacagcag ctgcaccgcc agcaacggca gagaaaattc 6841 ctccaggagg aacagcagct gcgccgccag gagcgcgggc aacagcggcg tcaggaccgt 6901 gacagaaaat tccgcgagga ggaacagctg cgccaggaga gggaggaaca gcagctgagc 6961 cgccaagagc gtgacagaaa attccgttta gaggaacaga aagtgcgccg ccaggaacaa 7021 gagagaaaat tcatggagga cgaacagcag ctgcgccgcc aggagggcca acaacagctg 7081 cgccaggagg acagaaaatt ccgcgaagac gaacagctgc tccaggaaag ggaagaacag 7141 cagctgcacc gccaagagcg tgacagaaaa ttcctcgagg aggaaccgca gctgcgccgc 7201 caggagcgcg aacaacagct gcgtcacgac cgcgacagaa aattccgtga agaggaacag 7261 ctgctccagg aaggggagga acagcagctg cgccgccaag agcgtgacag aaaattccgc 7321 gaagaggaac agcagctccg ccgtcaggaa cgagagagaa aattcctcca ggaggaacag 7381 cagctgcgcc gccaggaact ggagagaaaa ttccgtgagg aggaacagct gcgccaagaa 7441 acggagcaag agcagctgcg ccgccaagaa cgctacagaa aaatcctaga ggaagagcag 7501 ctccgtccgg aaagggaaga acagcagctg cgccgccagg agcgcgacag aaaattccgc 7561 gaggaggaac agctccgcca gggaagggag gaacagcagc tgcgcagcca agagtctgac 7621 agaaaattcc gcgaggagga acagctacgc caggagaggg aagaacagca gctccgcccc 7681 caacagcgtg acggaaagta tcgctgggaa gaagagcagc tccaacttga ggaacaagag 7741 cagaggctgc ggcaggagcg agaccggcag taccgggcgg aggagcagtt tgccacgcag 7801 gagaagagtc gtcgtgagga acaagaacta tggcaagaag aggagcagaa acgtcgccag 7861 gaacgggaaa ggaaattacg ggaagaacac atccgccgcc agcagaagga ggaacagagg 7921 caccgccaag tcggggagat acaatcccaa gaagggaagg gccatgggcg gcttctggag 7981 cccggcactc atcagtttgc cagtgtccca gtgcgctcca gccctctcta tgagtacatc 8041 caagagcaga gatctcaata ccgcccttaa gtgatgttgc caatatcttg acacctgcca 8101 aagcttccag cacgggaaaa tgagaaacac tgggtaccaa gtgataactc agatgtttct 8161 ggttgtggga aaactctctg atattagaat gtcttttctt ccaaaatctt aaactacgct 8221 cattttacgc actttgtact tctgcttttt attcttcctc aagtagttct ttactgcaag 8281 atgtctttct tttgctcttt gatgcagatg tggtgtgcat ttaaaaaaaa tataaatcat 8341 ttaatttgtt taagaaattt tgtttgagga acatgttcat ttattgcttt cagaagtaac 8401 aagagtaata ggatgatttg agattctaaa caatgggtcg gtttgtttaa tgactgaccc 8461 atcttgtgga aagtgcagat acttttaatg ttcaagttgc tatttcttct tgaacctaaa 8521 ttgatcattg cctccaaaca gcatttcatc cttttgtggc atagttagca caaattccag 8581 gtaactaaat ttttataacc cttgaatagt gcagggggag tgacctctgc ataaaaactt 8641 cctgtaaaat cagcccatta ctggaagaaa tatctgttaa gaataggttt agctttgaag 8701 atttagaatt taaattagat tttttttaaa ctcaactcca cttaaacaca taatctcatg 8761 aagaaataat gaggtattta gaatttaaat gagttcaaat tttaaaactg tgtctgttgt 8821 agtctatagt gttcattcta cttccccaag ttttgatgag tttcagaata ttatgaacct 8881 ttgttaattt tagcttgtta gaaggaagct gctcagaatc ccataaacat ctgtcttact 8941 ctagggccaa taagagatca catagagcat gttgggggtg taaaagggaa aaatgtgtga 9001 acataggggc aaatttctag aggccctttg acaagaccca tttgcccaca atcatttgag 9061 gcctattgat aataccttag atatattctt gttgaaataa ttggactgtg aaaaattaat 9121 aataaatgtt tggcaagtaa ctacttttgt ctgttttaac tctgcgtcaa tcataacaag 9181 atctcattgt ctggaaacta acacaagttc ccaatcacat aagggcattt tgttacttat 9241 ctatgtccaa atacgaaaaa agaggggaga gaattctttg tttttcccca accttttttt 9301 tttttttttt tttttttttt tgcagttagg ctgaactcta tttccatccc cacactgaga 9361 ttgccttcca gagtgttttt gttcttgacc cacagctttc tatgccattc ttgcagcgac 9421 tcactggtca tgacaaatac tggtgctccc aatatttgtt aatatttcct ttagagaatg 9481 cagcagcttc ttcgtctctg atgtctgatg agccaatgat agaaaatggc ctgaaacttc 9541 agatcctcga g // LOCUS HUMTRPY1B 2609 bp DNA PRI 15-SEP-1990 DEFINITION Human tryptase-I gene, complete cds. ACCESSION M33494 NID g339976 KEYWORDS serine protease; tryptase-I. SOURCE Human adult skin DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2609) AUTHORS Vanderslice,P., Ballinger,S.M., Tam,E.K., Goldstein,S.M., Craik,C.S. and Caughey,G.H. TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a multigene serine protease family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3811-3815 (1990) MEDLINE 90251647 COMMENT Draft entry and computer-readable sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by P.Vanderslice 02-APR-1990. FEATURES Location/Qualifiers source 1..2609 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 131..135 /note="putative" TATA_signal 197..201 /note="putative" prim_transcript 225..2072 /note="tryptase-I mRNA" intron 248..457 /note="tryptase-I, intron A" CDS join(458..518,673..844,954..1219,1346..1509,1595..1759) /note="tryptase-I" /codon_start=1 /db_xref="PID:g339977" /translation="MLNLLLLALPVLASRAYAAPAPGQALQRVGIVGGQEAPRSKWPW QVSLRVHGPYWMHFCGGSLIHPQWVLTAAHCVGPDVKDLAALRVQLREQHLYYQDQLL PVSRIIVHPQFYTAQIGADIALLELEEPVNVSSHVHTVTLPPASETFPPGMPCWVTGW GDVDNDERLPPPFPLKQVKVPIMENHICDAKYHLGAYTGDDVRIVRDDMLCAGNTRRD SCQGDSGGPLVCKVNGTWLQAGVVSWGEGCAQPNRPGIYTRVTYYLDWIHHYVPKKP" exon <458..518 /note="tryptase-I" /number=1 intron 519..672 /note="tryptase-I, intron B" exon 673..844 /number=2 intron 845..953 /note="tryptase-I, intron C" exon 954..1219 /number=3 intron 1220..1345 /note="tryptase-I, intron D" exon 1346..1509 /number=4 intron 1510..1594 /note="tryptase-I, intron E" exon 1595..>1759 /note="tryptase-I" /number=5 BASE COUNT 422 a 941 c 776 g 470 t ORIGIN 1 accagctgac aggtggagct gccagtctcc agtgctcagc cctcagcggg gcctgcctgg 61 cagccccaca cacagagggc atcggggtgg cgggggcacg tgttacacgg gggccctggg 121 tctgagtcat ccacttcctc cgagtctgga tgggaggacc cagcgcccct cctccgcccc 181 ctcctgatct ggaaggataa atggggaggg gagagccact gggtagaagg aacagggagt 241 ggccagggta agtccccact ctcagagacc ctgacatcag cgtcacctgg agcagagtgg 301 cccagcctca gactcagagc accaagaccc aggcccgcag gcctggaccc accccggtcc 361 ccccgtccca gctccattct tcaccccaca atctgtagcc cccagccctg ccctgtgagg 421 cccggccagg cccacgatgc tcctccttgc tccccagatg ctgaatctgc tgctgctggc 481 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctggt gagtcccagc cggggtccac 541 cctgcccctc accacattcc acaggtcagg gcctgggtgg gttctgggga ggtcgggctg 601 gccccccaca cagggaaggg ctgggcccag gcctggggct gcttcctggt cctgacctgg 661 cacctgcccc agccccaggc caggccctgc agcgagtggg catcgtcggg ggtcaggagg 721 cccccaggag caagtggccc tggcaggtga gcctgagagt ccacggccca tactggatgc 781 acttctgcgg gggctccctc atccaccccc agtgggtgct gaccgcagcg cactgcgtgg 841 gaccgtgagt ctcccggggc ctggaggggt ggggaagggc tggatgtgag ccctggctcc 901 cgggtgctcc tgggggctgc ccagggccct gagtgggatc ctccgctgcc cagggacgtc 961 aaggatctgg ccgccctcag ggtgcaactg cgggagcagc acctctacta ccaggaccag 1021 ctgctgccgg tcagcaggat catcgtgcac ccacagttct acaccgccca gatcggagcg 1081 gacatcgccc tgctggagct ggaggagccg gtgaacgtct ccagccacgt ccacacggtc 1141 accctgcccc ctgcctcaga gaccttcccc ccggggatgc cgtgctgggt cactggctgg 1201 ggcgatgtgg acaatgatgg tgggtctggg gacagtggag gtggggccag ggtcttagcc 1261 acagcccagc ccctgggtcc ctctgggctc caggtggggg ttgcccggcc ccctcctgag 1321 gctgcaccct cttccccacc tgcagagcgc ctcccaccgc catttcctct gaagcaggtg 1381 aaggtcccca taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg 1441 ggagacgacg tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac 1501 tcatgccagg tgggccccgc ctgtcccccg ccccccgccc cccaaccccc actcccaggc 1561 ctgttcggcg agcgctgacc tctgaccttc ccagggcgac tccggagggc ccctggtgtg 1621 caaggtgaat ggcacctggc tgcaggcggg cgtggtcagc tggggcgagg gctgtgccca 1681 gcccaaccgg cctggcatct acacccgtgt cacctactac ttggactgga tccaccacta 1741 tgtccccaaa aagccgtgag tcaggcctgg gttggccacc tgggtcactg gaggaccaac 1801 ccctgctgtc caaaacacca ctgcttccta cccaggtggc gactgccccc cacaccttcc 1861 ctgccccgtc ctgagtgccc cttcctgtcc taagccccct gctctcttct gagccccttc 1921 ccctgtcctg aggacccttc cctatcctga gcccccttcc ctgtcctaag cctgacgcct 1981 gcaccgggcc ctccagccct cccctgccca gatagctggt ggtgggcgct aatcctcctg 2041 agtgctggac ctcattaaag tgcatggaaa tcactggtgt gcatcgctgt gtttctggtt 2101 gtggatgtca ctgggagaga aggggtccag gtgtgctgag gacacctgcc acagtgtgag 2161 gtcctagccc tcaaggcaca gccagtcacc gtgggaccac tggaggacca acccctgctg 2221 tccaaaacac cactgcttcc tacccaggtg gcgactgccc cccacacctt ccctgccccg 2281 tcctgagtgc cccttcctgt cctaagcccc ctgctctctt ctgagcccct tcccctgtcc 2341 tgaggaccct tccctatcct gagccccctt ccctgtccta agcctgacgc ctgcaccggg 2401 ccctccagcc ctcccctgcc cagatagctg gtggtgggcg ctaatcctcc tgagtgctgg 2461 acctcattaa agtgcatgga aatcactggt gtgcatcgct gtgtttctgg ttgtggatgt 2521 cactgggaga gaaggggtcc aggtgtgctg aggacacctg ccacagtgtg aggtcctagc 2581 cctcaaggca cagccagtca ccgtgggac // LOCUS HUMTS1 18596 bp DNA PRI 21-JAN-1992 DEFINITION Human thymidylate syntase (EC 2.1.1.45) gene, complete cds. ACCESSION D00596 NID g220135 KEYWORDS thymidylate syntase. SOURCE Human DNA (genomic), clones lambdaHTS-1 and lambdaHTS-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 18596) AUTHORS Kaneda,S., Nalbantoglu,J., Takeishi,K., Shimizu,K., Gotoh,O., Seno,T. and Ayusawa,D. TITLE Structural and functional analysis of the human thymidylate synthase gene JOURNAL J. Biol. Chem. 265 (33), 20277-20284 (1990) MEDLINE 91056070 COMMENT These data kindly submitted in computer readable form by: Sumiko Kaneda National Institute of Genetics 1111 Yata Mishima 411 Japan Phone: +81-559-72-2732 Fax: +81-559-71-3651. FEATURES Location/Qualifiers source 1..18596 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_unit 1..148 /note="Alu sequence" repeat_unit 202..477 /note="Alu sequence" prim_transcript 822..16246 /note="thymidylate synthase mRNA and introns" prim_transcript 824..16246 /note="thymidylate synthase mRNA and introns" prim_transcript 828..16246 /note="thymidylate synthase mRNA and introns" prim_transcript 841..16246 /note="thymidylate synthase mRNA and introns" repeat_unit 862..889 /note="inverted repeat" repeat_unit 904..993 /note="triple tandem repeats" exon <1001..1205 /note="thymidylate synthase (part of exon 1)" CDS join(1001..1205,2895..2968,5396..5570,11843..11944, 13449..13624,14133..14204,15613..15750) /note="thymidylate synthase" /codon_start=1 /db_xref="PID:d1000927" /db_xref="PID:g220136" /translation="MPVAGSELPRRPLPPAAQERDAEPRPPHGELQYLGQIQHILRCG VRKDDRTGTGTLSVFGMQARYSLRDEFPLLTTKRVFWKGVLEELLWFIKGSTNAKELS SKGVKIWDANGSRDFLDSLGFSTREEGDLGPVYGFQWRHFGAEYRDMESDYSGQGVDQ LQRVIDTIKTNPDDRRIIMCAWNPRDLPLMALPPCHALCQFYVVNSELSCQLYQRSGD MGLGVPFNIASYALLTYMIAHITGLKPGDFIHTLGDAHIYLNHIEPLKIQLQREPRPF PKLRILRKVEKIDDFKAEDFQIEGYNPHPTIKMEMAV" intron 1206..2894 /note="thymidylate synthase intron 1" protein_bind 1660..1665 /bound_moiety="Sp1" protein_bind 1890..1895 /bound_moiety="Sp1" protein_bind 1895..1900 /bound_moiety="Sp1" exon 2895..2968 /note="thymidylate synthase (exon 2)" intron 2969..5395 /note="thymidylate synthase intron 2" repeat_unit 3049..3347 /note="Alu sequence" repeat_unit 4953..5245 /note="Alu sequence" exon 5396..5570 /note="thymidylate synthase (exon 3)" intron 5571..11842 /note="thymidylate synthase intron 3" repeat_unit 5768..9405 /note="L1 sequence" repeat_unit 10125..10389 /note="reiterated motif of GGA/TGAT" exon 11843..11944 /note="thymidylate synthase (exon 4)" intron 11945..13448 /note="thymidylate synthase intron 4" repeat_unit 12135..12397 /note="Alu sequence" repeat_unit 12567..12743 /note="Alu sequence" repeat_unit 12807..13084 /note="Alu sequence" exon 13449..13624 /note="thymidylate synthase (exon 5)" intron 13625..14132 /note="thymidylate synthase intron 5" repeat_unit 13875..14010 /note="Alu sequence" exon 14133..14204 /note="thymidylate synthase (exon 6)" intron 14205..15612 /note="thymidylate synthase intron 6" repeat_unit 14531..14832 /note="Alu sequence" exon 15613..>15750 /note="thymidylate synthase (part of exon 7)" polyA_signal 15934..15939 polyA_signal 16228..16233 repeat_unit 17262..17553 /note="Alu sequence" BASE COUNT 4521 a 3991 c 4479 g 5605 t ORIGIN 18p11.32 on chromosome 18. 1 cctgtagtcc cagctacgcg agaggctgag gcagcagaat tacttgaacc caggaggcgg 61 aggttgcagt gagccgagat cgcgccactg cactccagcc tgggtgagag agcgagactc 121 tgtctcaaaa aaaaaaaaaa aagaccgcca gggctcaaac aaaaaacctc ggaaaagccc 181 tggcggtctt tttttttttt tttttttttt ttttttggga cagtcttgct ctgtcgccca 241 ggctggagta caatggtcgg atcttggctc actgcaacct ctgcctccca ggttcaagca 301 attcttctgc ctcagcctcc caagtagcca ccacgcccag ctaatttttg tacttttagt 361 agagacgggg gtttcaccat gttgtccagg ctggtcttga actcctgacc tcaggtgatc 421 cacccgcctc ggccccccaa agtactagga ttacaggcgt gagccaccgc gtccagcgcc 481 ctggcggttt ttaatcaagt agaaaagctg cattatacca cttgcttcgg ttgcttcagt 541 gagaacgaag aaatggaaat gcaaatccct tattagttgt aggaaacaga tctcaaacag 601 cagttttgtt gacaagaccg caggaaaacg tgggaactgt gctgctggct tagagaaggc 661 gcggtcgacc agacggttcc caaagggcgc agtccttccc agccaccgca cctgcatcca 721 ggttcccggg tttcctaaga ctctcagctg tggccctggg ctccgttctg tgccacaccc 781 gtggctcctg cgtttccccc tggcgcacgc tctctagagc gggggccgcc gcgaccccgc 841 cgagcaggaa gaggcggagc gcgggacggc cgcgggaaaa ggcgcgcgga aggggtcctg 901 ccaccgcgcc acttggcctg cctccgtccc gccgcgccac ttggcctgcc tccgtcccgc 961 cgcgccactt cgcctgcctc cgtcccccgc ccgccgcgcc atgcctgtgg ccggctcgga 1021 gctgccgcgc cggcccttgc cccccgccgc acaggagcgg gacgccgagc cgcgtccgcc 1081 gcacggggag ctgcagtacc tggggcagat ccaacacatc ctccgctgcg gcgtcaggaa 1141 ggacgaccgc acgggcaccg gcaccctgtc ggtattcggc atgcaggcgc gctacagcct 1201 gagaggtgac gccgcgggcc cctgcgggac gggtggcggg aaggagggag gcgcggctgg 1261 ggagagcgct cgggagctgc cgggcgctgc ggaccccgtt tagtcctaac ctcaatcctg 1321 ccagggaggg gacgcatcgt cctcctcgcc ttacagacgc cgaaacggag ggtcccatta 1381 gggacgtgac tggcgcgggc aacacacaca gcagcgacag ccgggaggta agccgcgtcc 1441 cagcggctcc gcggccgggc tcgcagtcgc cccagtgatg ccgtggcccc cgaggcgggc 1501 gtcatcgggc agcgtttgcc cagtgctgga gggttaggga gagctgcctg ggcttgaccg 1561 cgcgccggtc tcaaagtcct ggctttggcc cctcctccgt tttcccctgt ggaccattcc 1621 gcttcgcagc gttttcaaaa actggagcga aagtgatgtg ggcggggcaa aggcggcggg 1681 aagaggacag cactgaagct ggcgcgggaa cttggtttcc tggtggcctc ccatccaatc 1741 cccacgaacc agctttcctc ttaaaccttg aaaagagaaa ttcgggagtt cgagttctta 1801 gtcgtccttt cctctttcct ttccgacagg agcaccccag gcaaaaaatg tctcgcgggt 1861 cattggcgcc aggctttcag gggacagtgg ggcggggcgg ggtgggcaca ggacgttagg 1921 cagccgttgg ccctccctaa ggccacaccg tcctgccgtc ctggatcctg cgccagctgc 1981 gcgggggagg ggactcgaag gtgtgtgagc caggggctga ccttgaccgc tcagataaat 2041 ggagcgcagc cttgacacag gggtggaggt ggttttgaat ggggaaaccc attcgtggtg 2101 aagcagattc actgtagcta gcggaaaagc cctccggccc acggacccat ctagagacga 2161 atacatagca gctgctgtgg ctgattggcg tgggacagcg tggggagttt tgtctgagga 2221 gagggatcca cttttctgca gctccaagcc caggggcctt tgatgagcca tagacctcat 2281 ttttaaccca cctttctgct tagacattga gcaagttact tctcatatag cttccctata 2341 tgttaaaaat ggagaaaata atgcttagta ggcaattctg ataaaagcag gtgcttgcaa 2401 aaatctctct gttgtctgaa tataaactgt accacaagcg agtgcggatg aacgaggact 2461 gcatttaaag ataagttttt acactttcat ttctctgtgg ctcgacactt ctgatgcctc 2521 cctttttgtt cctgggacac atgcttggtg ttgtcttcac acctttgtga caggattagc 2581 actagtgggc agtggatgat agctcctcct cccttttgcc acatgttcat ccctgccctc 2641 gccaccatct cactgtgtgg aattcctgtg tccactggtc accggggcac agaagtgctg 2701 tctcagcctg aatcgggcca ctgatgggac ttgcagcctg ggagctccac cgtgatctct 2761 ggcccacttt gcgggagtct aggctttctg gatgctccag gcctcacgtc ccagggcagt 2821 tttcttccct gaagaaagtt ggatggcatg atctgtcttc ccatcttgaa accgtatggc 2881 aaattgtttt tcagatgaat tccctctgct gacaaccaaa cgtgtgttct ggaagggtgt 2941 tttggaggag ttgctgtggt ttatcaaggt aaagaagtcg ctgctattag aagtcagtag 3001 tctgttctca acacagcagc cagtgagatc ctttcaaaac tcaaagcagc caggtgtggt 3061 ggctcacgcc tgtaatccca ccgctttggg aggctgagtc agatcacctg aggttaggaa 3121 tttgggacca gcctggccaa catggcgaca ccccagtctc tactaataac acaaaaaatt 3181 agccaggtgt gctggtgcat gtctgtaatc ccagctactc aggaggctga ggcatgagaa 3241 ttgctcacga ggcggaggtt gtagtgagct gagatcgtgg cactgtactc cagcctggcg 3301 acagagggag aacccatgtc aaaaacaaaa aaagacacca ccaaaggtca aagcatatca 3361 ttcctcaccc tcaagccctt agtggctcca tttcactcag taagagccac ggtccttatg 3421 gtgtccgttt ttcagctctg accttagctg ctgctctctg caccaccctg ctgttcttgt 3481 gagtttttga gcacaccggg acatccccac tccctggaac cttcttcccc cacacttggc 3541 ttcttccttt gagtctctac tccactcggg caagccttcc tagacctcct gatttaaaac 3601 tgtgactctc ccccaacctc cttggtgttt ctccgtagac gaacatcacc atctgatgta 3661 tgtcagcctt tcccttcccc tgttagaagg gggacagcag gtagtaaaag tgaaatgtgc 3721 tgtaagcttt atgagggcag aggatttgtt tctcgtgttc actgttgtat cgccagggcc 3781 tcaaacacag cctgccacat agtaggagtc aacatatatt gatcactaaa tgtagatacc 3841 acctgtgttc ccatgttcat ataaattcta gaagagtctc ttcagtaaca aggtgaaccc 3901 cttccagagg gctgagtagg tacctcaggc cggggccaga gtgctgtgaa gacagcagca 3961 gcccagacca agcttctctg tgttccgtgt cctggtctag aaccagcgat gttctttctg 4021 accagtgctt tttggaaggt ggctgaggtc tgggctcagg tctgggccat actagaagct 4081 gggatccctt ctatagagca cttggtatgg cttgtatggt cttggggcaa gccagaccca 4141 agccctctta tcccatttta gaaagggctt caatttggat ccagccccag gtctgcctta 4201 gctctgtatt cttggggtat tttgttctgt attggcctat cttgactaac aatgagcctt 4261 ggatttgaaa catatcatca gaaacctcag aagacaacat tcttaaactg gctagagcct 4321 ggtctgaatg gatgaaaagg agagactttt gaagcaatat gtaaaagatt gagaaatgat 4381 ttgttggaaa tttctcaatt ggagaaattt ctttgatttg ttggaaattt ctttgattct 4441 ttctcaatca aagaaaatcg ggacaaactc aacaatagaa agggaggaag caagatactc 4501 agaaataaaa tgcattcccc tgtttcaact taatgcttca attcaggatt ctaaggaatc 4561 cttgccagga atgtcagact caccttgata gttggagtta ctccattggt gactcgatca 4621 aatacaggag ttgaggcacc tgcactgtaa aatactgatt agtctgatca ttaggaatat 4681 cctgtatgcc aggtagaaga tacattgaac agattgcatg taggcattaa attcattttg 4741 gggtattaca tatagacaac acatttcatt aagaaacata aaactgtcag atcggtggaa 4801 tacttaaaag cacttggagg tgtttagcct aaaaagctta gttgagggga atggaagaaa 4861 agatctggga gggtggttcc aaagaaggga tcagactatc ctaaagccct caggaatctg 4921 ggctgggacc acctacttaa agataggatg ggcagctggg tgtggtggct cacgcctgta 4981 atcccagcac ttcgggaggc cgaagcgggc ggatcacctg aggtcaggag ttcgaggcca 5041 gcctgaccaa catggagaaa cgctgtctct actaaaaata caaaattagc tgggtgtagt 5101 ggcgcatgcc tgtaatccca gctactcggg aggctgaggc aggggaatcg cttgaacctg 5161 ggaggtggag ggtgccgtga gccacgatcg cgccattgca ctccagcctg ggcaacaaga 5221 gcgaaactct caaaaaacaa aaaaaaggat gggttccata tgggtggtgt caagtgccca 5281 cctcctagca agtcagcagg ggccagaggc ccttgtaagt ggtgtctcgg ggggatcaac 5341 tgagatggct taagatttac ctggatgcct gctctgctct ccccatctct tccagggatc 5401 cacaaatgct aaagagctgt cttccaaggg agtgaaaatc tgggatgcca atggatcccg 5461 agactttttg gacagcctgg gattctccac cagagaagaa ggggacttgg gcccagttta 5521 tggcttccag tggaggcatt ttggggcaga atacagagat atggaatcag gtgaggagat 5581 agaacaatgc cttccatttc cgggtgccct tcctagcacg tgtttgctcc gttgttttag 5641 ataaggtctg ggggatgagt caatgtcaca ggagctgatg tatagctttg accttgtgag 5701 gggtggtgcc aggttgaagc cacaattaac gcctactgaa ggccgtttca catctttttt 5761 tttttttttt ttttaattat tatactttaa gttttagggt acatgtgcac aatgtgcagg 5821 ttagttacat atgtatacat gtgccatgct ggtgcgctgc accactaact caccatctag 5881 catcaggtat atctcccaat gctatccctc ccccctcctc ccaccccaca acatccccag 5941 agtgtgatgt tccccttcct gtgtccatat gttctcgttg ttcgattccc actatgagtg 6001 agaatatgcg gtgtttggtt ttttgttctt gcgatagttt actgagaatg atgatttcca 6061 tttcaccacg tccctacaga ggacatgaac tcatcatttt ttatggctgc atagtattcc 6121 atggtgtata tgtgccacat tttcttaatc cagtctatca tgttggacat ttgggttggt 6181 tccaagtctt tgcctattgt gaatagtgcc acaataaaca tacgtgtgca tgtgtcttta 6241 tagcagcatg atttaatagt cctttgggta tatacccagt aatgggatgg ctgggtcaaa 6301 tggtatttct agttctagat ccccgaggaa tcgccacact gacttccaca atggttgaac 6361 tagtttacag tcccaccaac agtgtcaaag tgtcctattt ctccacatcc tctccagcac 6421 ctgttgtttc ctgacttttt aatgattgcc attctaactg gtgtgagatg gtatctcatt 6481 gtggttttga tttgcgtttc tctgatggcc agtgatggtg agcatttttt catgtgtttt 6541 ttggctgcat aaatgtcttc ttttgagaag tgtctgttca tgtccttcgc ccactttttg 6601 atggggttgt ttttttctta taaatttgtt tgagttcatt gtagattctg gatattagcc 6661 ctttgtcaga tgagtaggtt gcaaaaatgt tctcccattt tgtgggttgc ctgttcactc 6721 tgatggtagt ttcttttgct gtgcagaagc tctttagttt aattagatcc catttgtcaa 6781 ttttggcttt tgttgccatt gcttttggca taggcatgaa gtccttgccc atgcctatgt 6841 cctgaatggt aatgcctagg ttttcttcta gggtttttat ggttttaggt ctaacgttta 6901 agtctttaat ccatcttgaa ttgatttttg tataaggtgt aaggaaggga tccagtttca 6961 gctttttaca tatggctagc cagttttccc agcaccattt attacatagg gaatcctttc 7021 cccattgctt gtttttctca ggtttgtcaa agatcagata gttgtagata tgcggcgtta 7081 tttctgaggg ctctgttctg ttccattgat ctatgtgtct gttttggtac cagtaccata 7141 ctgttttggt tactgtagcc ttgtagtata gtttgaagtc aggtagcgtg atgcctccag 7201 ctttgttctt ttggcttagg attgacttgg cgatgcgggc tcttttttgg ttccatatga 7261 actttaaagt agttttttcc aattctgtga agaaagtcat tggtagcttg atggggatgg 7321 cattgaatct ataaattacc ttgggcagta tggccatttt cacgatattg attcttccta 7381 cccatgagca tggaatggtc ttccatttct ttgtatcctc ttttatttca ttgagcagtg 7441 gtttgtagtt ctccttgaag aggtccttca catccctttt aaggtggatt cctaggtatt 7501 ttattctctt tgaagcaatt gtgagtggaa gttcactcat gatttggctc tctgtttgtc 7561 tgttattggt gtataagaat gcttgtgatt tttgcagatt gattttatat cctgagactt 7621 tgctgaagct gcttatcagc ttaaggagat tttgggctga gacaatgggg ttttctagat 7681 atacaatcat gtcgtctgca aacagggaca atttgacttc ctcttttcct aattgaatac 7741 cctttatttc cttctcctgc ctaattgccc tggccagaac ttccaacact atgttgaata 7801 ggagtggtga gagagggcat ccctgtcttg tgccagtttt caaagggaat gcttccagtt 7861 tttgcccatt cactatgata ttggctgtgg ctttgtcata gatagctctt attattttga 7921 aatatgttcc atcaatacct aatttattga gagtttttag catgatgtgt tgttgaattt 7981 tgtcaaaggc tttttctgca tctattgaga taatcatgtg gtttttgtct ttggatctgt 8041 ttatatgctg gattacattt attgatttgc gtatattgaa ccagccttgc atcctaggga 8101 tgaagcccac atgatcatgg tggataagct ttttgatgtg ctgctggatt cggtttgcca 8161 gtattttatt gaggattttt gcatcaatgt tcatcaagga tattggtcta aaattctctt 8221 ttttggtgtg tctctgccca gctttggtat caggatgatg ttggcttcat aaaatgagtt 8281 agggaggatt ccctcttttt ctattgattg gaatagtttc agaaggaatg gtaccagttc 8341 ctctttgtac ctctggagaa ttcggctgtg aatccatctg gtcctggact ctctttggtt 8401 ggtaagctat tgattattgc cacaatttca gctcctgtta ttggtctatt cagagattca 8461 acttcttcct ggtttagtct tgggagagtg tatgtgtcaa ggaatttatc catttcttct 8521 agattttcta gtttatttgc gtagaggtgt ttgtagtaat ctctgatggt agtttgtatt 8581 tctgtgggat cggtggtgat atccccttta tcatttttta ttgcgtctat ttgattcttc 8641 tctttttctt tattagtctt gctagcggtc tataaatttt gttgatcctt tcaaaaaacc 8701 agctcctgga ttcattaatt ttttgaaggg ttttttgtgt ctctatttcc ttcagttctg 8761 ctctgatttt agttatttct tgccttctgc tagcttttga atatgtttgc tcttgctttt 8821 ctagttcttt taattgtgat gttagggtgt caattttgga tctttcctgc tttctcttgt 8881 gggcatttag tgctataaat ttccctctac acactgcttt gaatgtgtcc cagaggttct 8941 ggtatgttgt gtctttgttc ttgttggttt caaagaacat ctttatttct gccttcattt 9001 cgttatgtac ccagtagtca ttcaggagca ggttgttcag tttccatgta gttgagcagt 9061 tttgagtgag attcttaatc ctgagttcta gtttgattgc actgtggtct gagagatagt 9121 ttgttataat ttctgttctt ttacatttgc tgaggagagc tttacttcca actatgtggt 9181 cggttttgga ataggtgtgg tgtggtgctg aaaaaaatgt atattctgtt gatttgggat 9241 ggagttctgt agatgtctat taggtctgct tggtgcagag ctgagttcaa ttcctgggta 9301 tccttgttga ctttctgtct cgttgatctg tgtactgttg acagtgggtg ttaaagtctc 9361 ccattattaa tgtgtggagt ctaagtctct ttgtaggtca ctcagatgat tggcacttac 9421 tgggcgcttg gcactttcca tactgtgtca tcggcagata gctgcatggt tggtgttcgt 9481 gctggggaat gggaagttca tcggtgggac aaggacaaaa tgcccccatt gctttgttgt 9541 ggctttaatc tccctttcga ggctgagcca cagcgtgctg taggtggcgc tgctgtgaag 9601 cgcagtacca gggtcacact ccactcccag ctctgcagag gtggagaaag aatgaaacat 9661 ctcactcctg gacttccact ttcctgtcac tgttggtgtc acctcttact ggatgtcaca 9721 gagcccagcc cctcccacct gtgcctagga aaagcagatg ccaccttgga atgtggggtt 9781 tgtgtgtgca atttactagc tgggcagaga ccagcaacct ggagagcagg tgtctcgtct 9841 aaggggacag tcacatttca cctccagcca cctggaggaa tttgggcctg gtgatgtcag 9901 aattcttcaa taaaagccta aaatctatat tttatgtgcg gtcatgagat ctgttaaatg 9961 ttagcaactt caggaagttt aaaaatgctg tgtggaccta gaataggcaa gttcttaaag 10021 gcagaaagtg gaatgctagt ttccagggac tggggaacag ggaggaatgg ggagttcatg 10081 tttaatgggc acagaggttt tgttagggat gacgaaaaag ttcgggagat ggtgatggtg 10141 atggagatgg tgatggtgat ggagatggtg atggtgatgg tgatggtgat gggtgatggt 10201 gatggtgatg gtgatggtga tggagatggt gatggtgatg gtgatggaga tggtgatggt 10261 gatggtgatg gtgatggaga tggtgatggt gatggagatg gtgatggtga tggtgatgga 10321 gatggtgatg gtgatggtga tggtgatggt gatggtgatg gtgatggaga tggagatggt 10381 gatggtgatg gttgcctaac atcaggaacg tgcttaatgc ttctgaattg cacacaaaaa 10441 tggcaagttt aatattatgt gtactttatc acaatgaaaa aagctgctgc gtgggccaag 10501 ttacttgtgc aggtaatgtt ctgcaggtgg ttgcctgcac ctcagttgta gggtgtccgt 10561 aggatgtgag gccagtcccc gggcttaatg atgctttaaa tcctgcctag tattcaatta 10621 tttcttgtcg cttaaaaggc ctaataaaat tatggtctta gtttacagtg gtatgaatgc 10681 ttagctgttg gattttagta ggaaagttcg tccctttttg tttttaattt tgttttacag 10741 attcacagga attttttttt tttttttttt tttttttttt taatgcacag aaagtttccc 10801 tggactctct acccagtttc cccagtgata atatcttggg taacatcctg tatacattca 10861 cattggtgca ttcctcagag ttgtcagatt ttgctagttt tacgtgcact tgtgtatgtg 10921 tgtatttgca attttagcac gtgtagactc ttgtaaccac tacaatcaag ttacagaact 10981 acactaccaa ggttcatctt tttaaaatct ttgatgttac cttttttgga acagtgacca 11041 tgagaggact ttcctcccaa aattttgaaa actactgaac cagaatatag tctgacacta 11101 ataggtagaa atttaaccaa aggagattat gaagctctgc acttgagtta acaaaatcac 11161 ttctcagctt ccagttccat ctcagaagga aggaaaaggg attaaaaatc cagagaccag 11221 aaaatgggag caaagtacaa ggtggtgtaa tcattacaga ggtttcctga tgtttccaag 11281 tcagtcgtgt gttgagctgc taaactctaa agtaatttta ggtggaatgt tggaaacatg 11341 ctgctgaggt gatagaaagg aatccatggt cctctgttag ttggaaagta tatggaatac 11401 tatattctac ataagataca atactctctg tgagacaagg ataaagtaga ttttgtcagt 11461 gaaattgtga caagaatcgc tgatgggttt agagcctaag tttgcgagga gcactggaag 11521 aaattaagat tgttgagatt ggaaagggtt agctatgggg gaacaggagg aggtgactcc 11581 atgacagacc aaatattcaa aggactgtgt agaagaggaa aaagactttg ttagggctcc 11641 agaggacaga gccaggagtc agacagggcc ttgaactcaa cccaccgaga tctgcaaact 11701 ttgcaggatg caccagatgt cttgtagcca tgggtcaagg ggggaccctg ggtaagagac 11761 tgtaatagat gacctctaag gccatctcat gacatgtgtg attaatgtat gtacctgtcc 11821 tctctttttg acaattctac agattattca ggacagggag ttgaccaact gcaaagagtg 11881 attgacacca tcaaaaccaa ccctgacgac agaagaatca tcatgtgcgc ttggaatcca 11941 agaggttgaa agaaccccgt cgtcttcatt tatactaacc atactcttag agggaagcaa 12001 tctggttttg tgcagaggca ctgagggagg caggaccctg ggcaacttcc cccagccaca 12061 tggttgtgtg acgttgggca agtcacattt tgctgcactt tcaccttcag atcatgaggt 12121 tgggcccaga ggattttttt tttttttttt ttttttgaga cagagttttg ctctgttgcc 12181 caggctggaa tgcaacggcg tgatcttggc tcactgtaac ctctgcctcc tgggttcgag 12241 tgattctcct gcctcagcct ccaagtagct gggattacag catgtgccac catgcctggc 12301 taattttgta tttttagtag agacgggttc acatgttggt caggctggtc ttgactcctg 12361 accctcagat gatctgcctt gcctcagcct cccaaccgag tgatcttaag ttgtgtatta 12421 tactcattct tacacaaaaa gggctttaaa tgcctagaaa ctacatgaag atgttaacat 12481 tttaaatgga agcagatgaa gttccagctc gctgccacct cactaacatt tttaacaatt 12541 atattgtaaa attcaactct accagggtgt agagccaggt gtggtggctc acacctgtaa 12601 ttccaacaac tccagaggcc aaggcgagag gatcatttga acccacggaa tttgaggctg 12661 tagtgagtca tgatcacgcc attgcactcc atcctgggca acagagtgag accctgaata 12721 tttaaaaaca acaacaacaa caaaactcta tcaggatatc ataagtactt agagtgaaat 12781 acttgcatct gtaatagaga cttatttttt ttttttttga gacacagtct caccctgttg 12841 cccaggctgg agtgcagtgg tttgatctcc gctcacggca acctccatct cccaggttca 12901 agtgagttcc cattcctcag ccccagagct gggaccacag gcgcgcgaat ttttgtattt 12961 ttagcagaga cggggtttca ctatgttggc caggctagtc tcaaactcaa gttggcctca 13021 agtgatctgc ccaccctggc gtcccagtgt tgggatttca ggcatgagcc actgtgcctg 13081 gccatgtaat agagactttt aatataggag ggtgtaccag aagcaccagt ttcctgtggc 13141 aaacagaatt attcctgctg tatttgtaat ttggtgccac gaggtagccc agatcccttc 13201 agctctgatg gaagagcatt gcttcagccg taaatggaca cctgcagaaa ccttgcaccg 13261 atggatagtc tccctcagct ccgtgccatc gctgcagggg ctgttatgga catcactgca 13321 gcccagtggc tctctctcct ggtctccacc atatgagttg gcttctgttt ctctcctgtt 13381 ttactttgcc tttagctgtg gtctttcaaa ccaccatccc tccttatctt cctctgctgg 13441 ttcctcagat cttcctctga tggcgctgcc tccatgccat gccctctgcc agttctatgt 13501 ggtgaacagt gagctgtcct gccagctgta ccagagatcg ggagacatgg gcctcggtgt 13561 gcctttcaac atcgccagct acgccctgct cacgtacatg attgcgcaca tcacgggcct 13621 gaaggtgggc tgtctcggga agggtgactt gccagcctac cacatgagct cttcagttct 13681 ttaatatggg aaaacaaatt gcagagttta gtctctgatt agcttttaaa tttgatatgt 13741 gtaagtaaga catgaaccag cttttacttt gaaaccttcc ttttctggaa ggttttctgg 13801 ccctgtggta tatgcactaa cagatctata caggttgttt gtgatacagc ttctatggat 13861 cttctcaaaa gctatgctga ggttgggtat ggtggctcat gcctgtaatc ccagcacttt 13921 ggaagactga gacaggagca attgcttgag gtctggagtt caataccagc ctgggcaaca 13981 taacaagatg ctgttgctac aaaaaaatgg aaaagctaca ctaaattatt tttttaaaaa 14041 aagccttgcg gtgtctgcat attctaatgt ttttaaatga tgttttaaag aattgaaact 14101 aacatactgt tctgctttct cccggtttat agccaggtga ctttatacac actttgggag 14161 atgcacatat ttacctgaat cacatcgagc cactgaaaat tcaggtaaga attagatgtt 14221 atacttttgg gtttggtacc ttctcttgat aaaaggttga ctgtggaaca ggtatctgct 14281 caatgctgtg tccaagataa agatgactgc tccaaatgtg gggcttcagt ttagggagaa 14341 gtggtgggca ggtgggcagg acaaggcagg catctgcctc agcaaccatg gcacttaact 14401 tgtcaggtgc tgtgaggtac taagcaccag taccagagag ggaagagcca cattcaagcc 14461 aggggattgt ccaaaaggag gcattttaac tcattttaac ttgaaggaga attgaagtgc 14521 aaatgttttt ccttttcttt ttttttgaga tggagtcttt ctctgtcggc caggctggag 14581 tgtgccgtgg tgcgatctca gctcactgca acctccacct cccgggttca agcaattctt 14641 ctgcctcagc ctcccaggta gctgggatta caggcacatg ccaccacacc cagctaattt 14701 tttgtattat tagtagagat ggggtttcgt catgttggcc aggctgatct caaactcctg 14761 acttcaagtg taccacctgc ctcagcctcc gaaagttctg gaattacagg cataagccac 14821 caccctggcc ataaatattt tttgttaatt ttacattaag tacaatattt aggtccaaac 14881 ttcaaaagtc tgttgaaatc cctgaagtta tagcagccaa caattgatat gaaatggcaa 14941 taaaaatgta agttcatctg cttcatgagc cttaaggaaa aaaactcaga accagacact 15001 ttttagcccc ttccaggtta gatccaggtt ttaaaagtta ttcctttgag ggagtttggc 15061 tgcttttgag tggaggtgac ttcaggctta ttctctctgg ctctctgctc tggtcatttt 15121 tagacatagt aataggttgt gacctgtctt cacatcctaa ttgccactgt ctgttcatcc 15181 caggaatcct ggctttcatc cctttctgtt cactgtccat gcatgtcatc tttccttctt 15241 tctgccaggg accagatggg ttagggattg tgaattcaag taaacgtaga gctactatga 15301 gttacagatt gactgtgttc ctgtctttaa taaatttgcc aagagtggtt ataagaactt 15361 acacctgatg aggcaccagg ctcctgatgc tgtgtaatgt cacaaaatac ccctcactct 15421 cgatctgtgc aagagaacag ctggttgcgc tccaatcatg ttacataacc tacgcgaagg 15481 tatcgacagg atcatactcc tgtaaaatag aactttgttg atcacatcct gtgtacttgt 15541 ttcacggaca tgaggagcaa ttacaacagg tcgtacaatt atggcaaaat aatggcctta 15601 ttttgttttt agcttcagcg agaacccaga cctttcccaa agctcaggat tcttcgaaaa 15661 gttgagaaaa ttgatgactt caaagctgaa gactttcaga ttgaagggta caatccgcat 15721 ccaactatta aaatggaaat ggctgtttag ggtgctttca aaggagctcg aaggatattg 15781 tcagtcttta ggggttgggc tggatgccga ggtaaaagtt ctttttgctc taaaagaaaa 15841 aggaactagg tcaaaaatct gtccgtgacc tatcagttat taatttttaa ggatgttgcc 15901 actggcaaat gtaactgtgc cagttctttc cataataaaa ggctttgagt taactcactg 15961 agggtatctg acaatgctga ggttatgaac aaagtgagga gaatgaaatg tatgtgctct 16021 tagcaaaaac atgtatgtgc atttcaatcc cacgtactta taaagaaggt tggtgaattt 16081 cacaagctat ttttggaata tttttagaat attttaagaa tttcacaagc tattccctca 16141 aatctgaggg agctgagtaa caccatcgat catgatgtag agtgtggtta tgaactttaa 16201 agttatagtt gttttatatg ttgctataat aaagaagtgt tctgcattcg tccacgcttt 16261 gttcattctg tactgccact tatctgctca gttccttcct aaaatagatt aaagaactct 16321 ccttaagtaa acatgtgctg tattctggtt tggatgctac ttaaaagagt atattttaga 16381 aataatagtg aatatatttt gccctatttt tctcatttta actgcatctt atcctcaaaa 16441 tataatgacc atttaggata gagttttttt tttttttttt taaactttta taaccttaaa 16501 gggttatttt aaaataatct atggactacc attttgccct cattagcttc agcatggtgt 16561 gacttctcta ataatatgct tagattaagc aaggaaaaga tgcaaaacca cttcggggtt 16621 aatcagtgaa atatttttcc cttcgttgca taccagatac ccccggtgtt gcacgactat 16681 ttttattctg ctaatttatg acaagtgtta aacagaacaa ggaattattc caacaagtta 16741 tgcaacatgt tgcttatttt caaattacag tttaatgtct aggtgccagc ccttgatata 16801 gctatttttg taagaacatc ctcctggact ttgggttagt taaatctaaa cttatttaag 16861 gattaagtag gataacgtgc attgatttgc taaaagaatc aagtaataat tacttagctg 16921 attcctgagg gtggtatgac ttctagctga actcatcttg atcggtagga ttttttaaat 16981 ccatttttgt aaaactattt ccaagaaatt ttaagccctt tcacttcaga aagaaaaaag 17041 ttgttggggc tgagcactta attttcttga gcaggaagga gtttcttcca aacttcacca 17101 tctggagact ggtgtttctt tacagattcc tccttcattt ctgttgagta gccgggatcc 17161 tatcaaagac caaaaaaatg agtcctgtta acaaccacct ggaacaaaaa cagattttat 17221 gcatttatgc tgctccaaga aatgctttta cgtctaagcc agaggcaatt aattaatttt 17281 tttttttttg acatggagtc actgtccgtt gcccaggctg cagtgcagtg gcgcaatctt 17341 ggctcactgc aacctccacc tcccaggttc aagtgattct cctgcctcag cctcccatgt 17401 agctgggatc acaggcacct gccaccatgc ccggctaatt ttttgtattt tttgtagaga 17461 cagggtttca ccatgttggc caggctggtc tcaaacacct gacctcaaat gatccacctg 17521 cctcagcctc ccaaagtgtt gggattacag gcgtaagcca ccatgcccag ccctgaatta 17581 atatttttaa aataagtttg gagactgttg gaaataatag ggcagaggaa catattttac 17641 tggctacttg ccagagttag ttaactcatc aaactctttg ataatagttt gacctctgtt 17701 ggtgaaaatg agccatgatc tcttgaacat gatcagaata aatgccccag ccacacaatt 17761 gtagtccaaa ctttttaggt cactaacttg ctagatggtg ccaggttttt ttgcacaagg 17821 agtgcaaatg ttaagatctc cactagtgag gaaaggctag tattacagaa gccttgtcag 17881 aggcaattga acctccaagc cctggccctc aggcctgagg attttgatac agacaaactg 17941 aagaaccgtt tgttagtgga tattgcaaac aaacaggagt caaagcttgg tgctccacag 18001 tctagttcac gagacaggcg tggcagtggc tggcagcatc tcttctcaca ggggccctca 18061 ggcacagctt accttgggag gcatgtagga agcccgctgg atcatcacgg gatacttgaa 18121 atgctcatgc aggtggtcaa catactcaca caccctagga ggagggaatc agatcggggc 18181 aatgatgcct gaagtcagat tattcacgtg gtgctaactt aaagcagaag gagcgagtac 18241 cactcaattg acagtgttgg ccaaggctta gctgtgttac catgcgtttc taggcaagtc 18301 cctaaacctc tgtgcctcag gtccttttct tctaaaatat agcaatgtga ggtggggact 18361 ttgatgacat gaacacacga agtccctctg agaggttttg tggtgccctt taaaagggat 18421 caattcagac tctgtaaata tccagaatta tttgggttcc tctggtcaaa agtcagatga 18481 atagattaaa atcaccacat tttgtgatct atttttcaag aagcgtttgt attttttcat 18541 atggctgcag cagctgccag gggcttgggg tttttttggc aggtagggtt gggagg // LOCUS HUMTSHB2 986 bp DNA PRI 09-MAY-1994 DEFINITION Human thyrotropin beta (TSH-beta) subunit gene, exons 2 and 3. ACCESSION M21024 J03937 NID g339996 KEYWORDS thyrotropin. SEGMENT 2 of 2 SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 986; 14 to 213; 619 to 986) AUTHORS Wondisford,F.E., Radovick,S., Moates,J.M., Usala,S.J. and Weintraub,B.D. TITLE Isolation and characterization of the human thyrotropin beta subunit gene: Differences in gene structure and promoter function from murine species JOURNAL J. Biol. Chem. 263, 12538-12542 (1988) MEDLINE 88315051 COMMENT Draft entry and printed copy of sequence for [1],[Unpublished (1988) NIH NIDDK/MCNEB Bethesda MD] kindly provided by F.E.Wondisford, 08-JUN-1988. FEATURES Location/Qualifiers source 1..986 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" intron <1..23 /note="thyrotropin beta subunit" /number=1 exon 24..186 /partial /note="thyrotropin beta subunit first expressed exon" /number=2 CDS join(25..186,635..889) /note="precursor" /codon_start=1 /product="thyrotropin beta subunit" /db_xref="PID:g339998" /translation="MTALFLMSMLFGLACGQAMSFCIPTEYTMHIERRECAYCLTINT TICAGYCMTRDINGKLFLPKYALSQDVCTYRDFIYRTVEIPGCPLHVAPYFSYPVALS CKCGKCNTDYSDCIHEAIKTNYCTKPQKSYLVGFSV" sig_peptide 25..84 /note="thyrotropin beta subunit signal peptide" mat_peptide join(85..186,635..886) /product="thyrotropin beta subunit" intron 187..634 /note="thyrotropin beta subunit" /number=2 exon 635..>889 /note="thyrotropin beta subunit" /number=3 BASE COUNT 274 a 188 c 177 g 347 t ORIGIN About 3.5 kb after segmetn 1; chromosome 1. 1 ctttaatttt atctttgatt tagcatgact gctctctttc tgatgtccat gctttttggc 61 cttgcatgtg ggcaagcgat gtctttttgt attccaactg agtatacaat gcacatcgaa 121 aggagagagt gtgcttattg cctaaccatc aacaccacca tctgtgctgg atattgtatg 181 acacgggtat gtagttcatg tcacttcttt tggctgtaaa ttatataagc cctgaagaag 241 tccattccta tatagaaagg aaatgaaata aatcacaacc tcatttccca aatctaatgg 301 ttattggctc cttagaagca gagtacacag gttacaatat tatgtgaatc tactcagcac 361 aatggatacg cataatttta taacagtttt gtgtcccagc tttacttaaa ccttatcttg 421 ttcccatgat caacgatgaa agagaggagg gtctcacttt tgtctctgta gaattcaacg 481 tggttaagtt ggtattggag aatggggcta agcaattctt tcccagttgt atttgtgatg 541 aaggaatata agtgaattta tttttatgtt tctattatct atatgtttcc taaagtcctg 601 tcacattatg ctctcttttc tgttctttcc ccaggatatc aatggcaaac tgtttcttcc 661 caaatatgct ctgtcccagg atgtttgcac atatagagac ttcatctaca ggactgtaga 721 aataccagga tgcccactcc atgttgctcc ctatttttcc tatcctgttg ctttaagctg 781 taagtgtggc aagtgcaata ctgactatag tgactgcata catgaagcca tcaagacaaa 841 ctactgtacc aaacctcaga agtcttatct ggtaggattt tctgtctaat agtgatataa 901 tttgcaattt ggttaaatgt gcttgcctga aataaagcta ataaaaatat tatgtttcac 961 attatcttct gttcattttg agtact // LOCUS HUMUBILP 3583 bp DNA PRI 15-MAR-1989 DEFINITION Human ubiquitin-like protein (GdX) gene, complete cds. ACCESSION J03589 NID g340069 KEYWORDS ubiquitin-like protein. SOURCE Human SV40-transformed fibroblast DNA, (library of Okayama and Berg) clones lambda-Gd[T7,11,5A,3C], and fibroblast, cDNA to mRNA, clone pGd6405. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3583) AUTHORS Toniolo,D., Persico,M.G. and Alcalay,M. TITLE A 'housekeeping' gene on the X chromosome encodes a protein similar to ubiquitin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 851-855 (1988) MEDLINE 88124938 FEATURES Location/Qualifiers source 1..3583 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript 550..3181 /note="UB-lp mRNA and introns (alt.)" prim_transcript 550..3443 /note="UB-lp mRNA and introns (alt.)" CDS join(585..632,836..941,1188..1396,1518..1628) /note="ubiquitin-like protein" /codon_start=1 /db_xref="PID:g340070" /translation="MQLTVKALQGRECSLQVPEDELVSTLKQLVSEKLNVPVRQQRLL FKGKALADGKRLSDYSIGPNSKLNLVVKPLEKVLLEEGEAQRLADSPPPQVWQLISKV LARHFSAADASRVLEQLQRDYERSLSRLTLDDIERLASRFLHPEVTETMEKGFSK" exon <585..632 /note="ubiquitin-like protein" intron 633..835 /note="UB-lp intron A" exon 836..941 /number=2 intron 942..1187 /note="UB-lp intron B" exon 1188..1396 /number=3 intron 1397..1517 /note="UB-lp intron C" exon 1518..>1628 /note="ubiquitin-like protein" /number=4 BASE COUNT 668 a 1080 c 1174 g 661 t ORIGIN X chromosome. 1 cccgggtgtc ccactctggg agaggtggga ggatgagggc agaggcgtag cccccctgcc 61 ccctctcaag ctcagtggag cgtttcgcag ccagaccaga aaggaggagg taggaagcca 121 ccagacaggg agcgggggcg ggccgtctgt gggtatgcag ggccaggcgc cgccagcagt 181 cctagcgcgt ggggtgggcc gagcagtccg cgggggcgag cgtaccttct ggccctccct 241 tctccatgtc ccagctccca gatgacgcca acttggcccg actggttcct gctcccaact 301 cccccgcgtc gttgctttgg gaggggtccg ccggccaggg cggtcgatgc gcgcgtgcta 361 ggcgggaccc cggggggctc ctcccgggcg caattgggcc ggggggcggg gccggggcgc 421 ggacggggca ggagggcggg gccgggaagc gcgggcggcg ggcgcgcccc tctcgctgct 481 tccggcggcg gcgggcggtt ccagcgcgcg cgcccggggc ggcggcgcgc ggcggggggt 541 ggttggggtg cgcgccggcc cgagtggacg ccgtcgcgac cgccatgcag ctgacggtga 601 aggcgctgca gggccgcgag tgcagcctgc aggtagggtc cccgggccgg gccgccgggt 661 ccggcgcgct ctctcgcccg ctcctggcag ggggaccggc agggccgagg gcgggcacgg 721 ggaggccggg ccgagggccg ggagccggga gagctgccct cgggggcgga cgcgtgggct 781 tcgccgtctg ggggccgcgg gctcaaccct tgcctcccac cctccgcgcg gccaggtgcc 841 agaggacgag ctggtgtcca cgctgaagca gctggtctcc gagaagctga acgtcccagt 901 gcgccagcag cggctgctgt tcaagggcaa ggccctggca ggtacccagg gagaggagac 961 gcccggggag ccccgcagga agcgggaccg gggtgcgcgg gccgaggcca gcagccgtgt 1021 gtcgggtcgg ggtgcgggcg ccaagcgcca catgacccca agggaggcgg ccgcatgcct 1081 cacagccgga tgccgcagcg ctccgctccc cgcccgaggc ggcctcgcgc cccgggccac 1141 ccccacggcg gtggaggagg aggaacccat cgatctgtct ttggcagatg ggaaacgact 1201 ctcggattat agcatcgggc ccaactccaa gctcaaccta gtggtcaaac ccctggagaa 1261 ggtgctacta gaagaaggcg aggcccagag gctggccgac tccccacccc cgcaggtctg 1321 gcagctgatc tccaaagtct tggcccgcca cttcagtgcg gcagatgcca gcagggtcct 1381 ggaacagcta cagagggtga gaagggtaac cctgggcatc ctcttcaggc ccagttgctc 1441 ccagccccta cctctgttgc agtgtatgcc ccccccccac tggcagcccc tgaatcttcc 1501 tgcccttctc tccccaggat tacgagaggt ccctgagtcg cctgacgctg gacgacatcg 1561 aacggttggc cagccgcttc ctgcaccctg aagtgactga gacaatggag aagggcttct 1621 ccaaatagaa ttctcggagc atggggaggt gcccaacgcc aggctaccgc tgcatgtcgc 1681 actaagtgtg ttctcctgtt gcagttgggc tcatcatcgt catagctggc atgtacctgg 1741 ctctggccag gtgctaggca ctcctcacag cttgacctgg gtttgcttcc acaccctcag 1801 aagaggagag ccggcacagc agaggcaccg gcgatgaagt ggcacagccc cagtcggaat 1861 ccagcggctt ctgaaagtgc cttggtgtga gaaggaggaa ggggccgctt ggagagctgg 1921 gctctcgcat gtcatgttta gcccactgag aatataccct caggtgactc ctcccgatcc 1981 tgaaagggaa agcagctgtc acctgacttc tggcgggtcc aacatagccc tcagcttacc 2041 tctgcaggag gccgagtgac agccagccct ggaacccccc gacccccgca gctgctgcag 2101 ccagtgtgcc agtgtgcgtt cgacaaatgg aaaagcagat tggggccagg gagtcagcaa 2161 ggcaccccag ctctgtgatg tgactttggg cgagtctcag tgccacttgg tccccagcca 2221 tggactctat acagtgaggg ccactcaccg acctgagtca gtgtcctctg tctcacaggg 2281 cccctctccc ttctgttcag taaacaaact gaagccaaaa atgaagccgt ggccaagcgt 2341 gacccagaga tggggtgtcc tggcccttag tgacacagct cctctctggg gcaccctatc 2401 tgcttgtctc gtccaggaag cgttagggcg atggggactc ggcaaggcag tgtcagagct 2461 gggactgggc gtaggccttt gtcctcatcc ccagcactgg cctccctgtg ggaagatgaa 2521 catatcccag ccacctgtgt acaggggctc actttgtgtg ctccttgttg cctggagaag 2581 aaccttgggg tgccagggtg ggggcagaag catgggctgg gttccggttc atcctcctcc 2641 accctgccgt gtgtgtgggc acaagaggac atctaaccac ctgctccttg gaggaggccc 2701 ccaggggtgg tagaggctgg aaggaagcca catcaggagg acgccactcc ggcccttcac 2761 ccttgccaag tgagctgctc acagtgtggt cagggctgcg cgtgctggag gccctcctgc 2821 ctgggccttg tggggcaaat attgggtccc caggctggaa agatggacag aggcccaatg 2881 ggtgaaggct ttgaagagca cacagaagcc cctggccccc cacgagagct ggagagccat 2941 gtatatggct tcaaagccac ctacggcagg gacacactcg tgagcatgtg tggcctgcag 3001 ttcaggtgat acatttacca gtgttcttgt ttgtgtggtg ccaggaaatt gattttggaa 3061 aaagtgaaat aacattaaag gtgaatgtga ggcttctact tttatccaaa aggagctata 3121 ttagctaggc tgtttctgat atccaatcat tggtttaaca ataaaggcaa tttgtttaat 3181 cagttaacgg aaatttcttg gcttatgaaa tgaaaagtcc agtggtattg gcattggcag 3241 caggtgagca atttcaccca gtgtcttctg cctccctctg cgttggtatc tgctacatcc 3301 caggccacca cctccgagga tgaaaagatg gctgcctgca gcttccacgg aatccctccc 3361 tcactgccag agcagcatct tctctgtgta ccacctctgc tgtctcagat gcccccagca 3421 aataaacact cttctcgttg gtcagaactg gattgtgcgt ccaatcattt tggctggggt 3481 agggggtaat tctccgtcag ggctggctcc acttggagct aggcatgggg accacttttc 3541 actggacaca catcggctat gcaatgggac agcaaggact act // LOCUS HUMV2R 2282 bp DNA PRI 20-DEC-1993 DEFINITION Human vasopressin receptor V2 gene, complete cds. ACCESSION L22206 NID g347522 KEYWORDS vasopressin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2282) AUTHORS Seibold,A., Brabet,P., Rosenthal,W. and Birnbaumer,M. TITLE Structure and chromosomal localization of the human antidiuretic hormone receptor gene JOURNAL Am. J. Hum. Genet. 51 (5), 1078-1083 (1992) MEDLINE 93035372 FEATURES Location/Qualifiers source 1..2282 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1..260 /gene="V2R" /number=1 precursor_RNA 1..2276 /gene="V2R" gene 1..2276 /gene="V2R" CDS join(236..260,622..1506,1613..1818) /gene="V2R" /note="A second possible start codon is located at positiom 242-244" /codon_start=1 /product="vasopressin V2 receptor" /db_xref="PID:g398043" /translation="MLMASTTSAVPGHPSLPSLPSNSSQERPLDTRDPLLARAELALL SIVFVAVALSNGLVLAALARRGRRGHWAPIHVFIGHLCLADLAVALFQVLPQLAWKAT DRFRGPDALCRAVKYLQMVGMYASSYMILAMTLDRHRAICRPMLAYRHGSGAHWNRPV LVAWAFSLLLSLPQLFIFAQRNVEGGSGVTDCWACFAEPWGRRTYVTWIALMVFVAPT LGIAACQVLIFREIHASLVPGPSERPGGRRRGRRTGSPGEGAHVSAAVAKTVRMTLVI VVVYVLCWAPFFLVQLWAAWDPEAPLEGAPFVLLMLLASLNSCTNPWIYASFSSSVSS ELRSLLCCARGRTPPSLGPQDESCTTASSSLAKDTSS" intron 261..621 /gene="V2R" /number=1 exon 622..1506 /gene="V2R" /number=2 intron 1507..1612 /gene="V2R" /number=2 exon 1613..2276 /gene="V2R" /number=3 BASE COUNT 364 a 755 c 703 g 460 t ORIGIN 1 agaagatcct gggttctgtg catccgtctg tctgaccatc cctctcaatc ttccctgccc 61 aggactggcc atactgccac cgcacacgtg cacacacgcc aacaggcatc tgccatgctg 121 gcatctctat aagggctcca gtccagagac cctgggccat tgaacttgct cctcaggcag 181 aggctgagtc cgcacatcac ctccaggccc tcagaacacc tgccccagcc ccaccatgct 241 catggcgtcc accacttccg gtaaggcttg cccctccatg agtccggtgg gcagagtggg 301 tttgacgatt cagggaagcc cctctttcta aagacctcct tcaccctcac ctctgggtgt 361 gtctctccag gctgccaatg agtggggagg ggagcacagc cccacttccc cgccagggct 421 ggggctgggg ctggggctgg ggctgccctt ccttctggac tgcatgagcc tggggtgtgt 481 atccctcata acatggcttt cctggagtcc cctctgctag gagccaggaa gtgggtgtcc 541 ggatgggggc acgggaggca ggcctgagtc cccctgcaca gcaccctctc taaccaggcc 601 ctcttcccga ctcctgccca gctgtgcctg ggcatccctc tctgcccagc ctgcccagca 661 acagcagcca ggagaggcca ctggacaccc gggacccgct gctagcccgg gcggagctgg 721 cgctgctctc catagtcttt gtggctgtgg ccctgagcaa tggcctggtg ctggcggccc 781 tagctcggcg gggccggcgg ggccactggg cacccataca cgtcttcatt ggccacttgt 841 gcctggccga cctggccgtg gctctgttcc aagtgctgcc ccagctggcc tggaaggcca 901 ccgaccgctt ccgtgggcca gatgccctgt gtcgggccgt gaagtatctg cagatggtgg 961 gcatgtatgc ctcctcctac atgatcctgg ccatgacgct ggaccgccac cgtgccatct 1021 gccgtcccat gctggcgtac cgccatggaa gtggggctca ctggaaccgg ccggtgctag 1081 tggcttgggc cttctcgctc cttctcagcc tgccccagct cttcatcttc gcccagcgca 1141 acgtggaagg tggcagcggg gtcactgact gctgggcctg ctttgcggag ccctggggcc 1201 gtcgcaccta tgtcacctgg attgccctga tggtgttcgt ggcacctacc ctgggtatcg 1261 ccgcctgcca ggtgctcatc ttccgggaga ttcatgccag tctggtgcca gggccatcag 1321 agaggcctgg ggggcgccgc aggggacgcc ggacaggcag ccccggtgag ggagcccacg 1381 tgtcagcagc tgtggccaag actgtgagga tgacgctagt gattgtggtc gtctatgtgc 1441 tgtgctgggc acccttcttc ctggtgcagc tgtgggccgc gtgggacccg gaggcacctc 1501 tggaaggtgg gtgtagccgt ggctagggct gacggggcca cttgggcttg gccgcatgcc 1561 cctgtgcccc accagccatc ctgaacccaa cctagatcct ccacctccac aggggcgccc 1621 tttgtgctac tcatgttgct ggccagcctc aacagctgca ccaacccctg gatctatgca 1681 tctttcagca gcagcgtgtc ctcagagctg cgaagcttgc tctgctgtgc ccggggacgc 1741 accccaccca gcctgggtcc ccaagatgag tcctgcacca ccgccagctc ctccctggcc 1801 aaggacactt catcgtgagg agctgttggg tgtcttgcct ctagaggctt tgagaagctc 1861 agctgccttc ctggggctgg tcctgggagc cactgggagg gggacccgtg gagaattggc 1921 cagagcctgt ggccccgagg ctgggacact gtgtggccct ggacaagcca cagcccctgc 1981 ctgggtctcc acatccccag ctgtatgagg agagcttcag gccccaggac tgtgggggcc 2041 cctcaggtca gctcactgag ctgggtgtag gaggggctgc agcagaggcc tgaggagtgg 2101 caggaaagag ggagcaggtg cccccaggtg agacagcggt cccaggggcc tgaaaaggaa 2161 ggaccaggct ggggccaggg gaccttcctg tctccgcctt tctaatccct ccctcctcat 2221 tctctcccta ataaaaattg gagctcattt tccacatggc aaggggtctc cttggatcct 2281 ct // LOCUS HUMVIPAA 10961 bp DNA PRI 30-JUN-1995 DEFINITION Human vasoactive intestinal peptide/PHM-27 gene, exons 1-6. ACCESSION M33027 M37460 NID g340253 KEYWORDS vasoactive intestinal peptide. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10961) AUTHORS Yamagami,T., Ohsawa,K., Nishizawa,M., Inoue,C., Gotoh,E., Yanaihara,N., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of human vasoactive intestinal peptide/PHM-27 gene and its inducible promoter JOURNAL Ann. N. Y. Acad. Sci. 527, 87-102 (1988) MEDLINE 88267775 FEATURES Location/Qualifiers source 1..10961 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6q24-q27" gene 3312..3418 /gene="VIP" exon <3312..3418 /gene="VIP" /note="vasoactive intestinal protein/PHM-27, (first translated exon); G00-120-490" /number=2 CDS join(3312..3418,5295..5417,6398..6502,7263..7394, 8225..8270) /note="vasoactive intestinal protein/PHM-27" /codon_start=1 /db_xref="PID:g340254" /translation="MDTRNKAQLLVLLTLLSVLFSQTSAWPLYRAPSALRLGDRIPFE GANEPDQVSLKEDIDMLQNALAENDTPYYDVSRNARHADGVFTSDFSKLLGQLSAKKY LESLMGKRVSSNISEDPVPVKRHSDAVFTDNYTRLRKQMAVKKYLNSILNGKRSSEGE SPDFPEELEK" intron 3419..5294 /note="vasoactive intestinal protein/PHM-27 intron A" exon 5295..5417 /number=3 intron 5418..6397 /note="vasoactive intestinal protein/PHM-27 intron B" exon 6398..6502 /number=4 intron 6503..7262 /note="vasoactive intestinal protein/PHM-27 intron C" exon 7263..7394 /number=5 intron 7395..8224 /note="vasoactive intestinal protein/PHM-27 intron D" exon 8225..>8270 /note="vasoactive intestinal protein/PHM-27" /number=6 BASE COUNT 3616 a 1792 c 1854 g 3699 t ORIGIN 1 gaattcagga taaatttgac tgcagtactt tttcttatgt ttattttttc tggaagcatc 61 caaaagttaa cgctggctac aggacctctg gtttacaatt catttccttg aaagaaaaga 121 agaaataaac acaattctat aaagtctgtg aattaaaact ttgaacctgc agatggtgct 181 atgattccat tttcaaattg ttttgctttc tagtttaaat tcataagaga aaatgcagtt 241 gtttggttga gtttactttt attcagaggt aataaaattg aattatttct ttttatttta 301 tcttattatt attataatag aagcaattgt cttttgtatt gggaaaagct actatagtta 361 ctctttgtaa atgctaagtg cttgcctagg ccaaataatg atcagcttta catcttttat 421 gtttaaaatt tttcttctta tcattcagca gtcatttgtc tgtgaccctc aattaagttg 481 ctttgctgtt tctgctctta aataaaattg gttcactaag aattgggtga tgaagagcta 541 aggataaaaa aattccatat gagaactgtt tcaacctcta gtttgtaata atcaggtaaa 601 aaagatttcc tggaattaag ccacaggaac tctggcttaa gtcagagctg tcaactggga 661 aacaaatttc cagacatttt gaaacttaat tcatttaatt tttctggtaa ctggattaga 721 aaatatgatt aagcatagca ggatattctt ttactggatc agtctgactt tgaacgagga 781 aatagatact acaatcttaa agtagaagaa agaggactgt aaagccattc tgtcttatga 841 aacttagaaa caaatgtaaa ataatacaca tggcagaagc ctttaaataa aaggaaaaat 901 ggcattttcc tctgttgttt aggaaatttt catgtggctg cattttacct taaactctga 961 ttgaatgtgt gctagactac ctttagttac aaatccacaa atagaagtga agtatttatc 1021 aaagtactat ataaatgctt ttcaactatt tgaagccacc tgtcaagtga ttgaatcaga 1081 aatgttcaaa actggtttga gtatgtgtgt gtgttgggga tgagggtaga aaactaattc 1141 tacattatgc aaagacttat aaaagtgatg aggaaaatct ttaaaaagaa aagacaatta 1201 aattccttga ggtatatagc ctgacaacaa tttatttttt ggatcatttg acacagtgta 1261 tgtaaaagtg cattgggaat tgtgccattt tatacaaata tgaaatagca ctacttgtac 1321 tgctttgtac aacagcgtgc atatcttact atcttagaga ctttctgaaa cacttatctt 1381 ggaattagtt tcaacaaaga cgaaatgtaa taataggaag gtgaaaatta tctctagagc 1441 ttttaaagtt aggatagatt tcaagaaaga ttaaatctgg aactatatga atagcataaa 1501 aattaaaaat attaagagaa ttgatacttt gtaataaagt ataattatac ttttataagc 1561 gtaatgaaca taagtgtttg ttgctttatg catttgtttc tacacatgaa gtctttattt 1621 tctgaaggaa tcccaatggc cctggctata acactacaat cattttctta ttttacaatt 1681 aattaaaggt taaaacattt cacattgagc ttgctacttt gtttgtttaa agaaaagtaa 1741 atttcctctt atcttacatt ttgaagttgc tcactataaa cttatataaa ttaggcataa 1801 tgacaattaa gtaagaactt caagccctat tcatcccatg gccgtcatac tgtgacgtct 1861 ttcagagcac tttgtgattg ctcagtccta agtataagcc ctataaaatg atgggctttg 1921 aaatgctggt cagggtagag tgagaagcac cagcaggcag taacagccaa cccttagcca 1981 ttgctaaggg cagagaactg gtggagcctt tctcttactc ccaggacttc agcacctaag 2041 acagctccaa aacaaaccag aacagtcagc tccgggggag cacgactggg cgaggtaagt 2101 gaaaacttta cctttcctct tctttttccc tcctgcctct ggaacttttc tcacaaaata 2161 atgaattttt ctctcttgaa aagaatttga tagtgctttg cttttgcaac agcaaaaaac 2221 aagtttctca agtcacctgg taaggataga aatacttcaa ttctctcata tggatagatt 2281 ttctgaattc ttttgtttgt aagatgcttt accctgcttg ctcttttact gaatactgat 2341 ttttacaaat aattcattca agtaaagttt acttgtatgt agatttttga ctgaaaattt 2401 cagtgggaac atttgctttt gaaattttca acttattttt aaagtgcttt ttttttccta 2461 tttcgtatgt tggttaagtg aatctaagat gcatttattt tttccttaag tttaaattcc 2521 ataattttta taggcagcaa gagaaacaga ccttactcct aggcttttaa tctagaaact 2581 agaaatgaaa tgtgtaagtt gattaaatgc aagaaatgca agctttgctt ttaatatata 2641 ttttcctctt atggaaaaat ttatttcaat gacaacatat gattcagtaa tattttgagt 2701 attaatattc actactgagt taagaagttc aaatccaggt gtaaggaaat aagtttccgt 2761 aatatctaca attgaaagag gaaaaagtaa tgtatataca aaatcttttt taaaattata 2821 tgtatatata cacattatat ttatacattt gcacacacac gagcagaata attattaaat 2881 agatgatact ctagtcttat attagtcaga tactgaaatt ttaaagcaga agaaaatcaa 2941 tgtcttgaag ctgtttcctt cgtcaaaaaa tacactaagc ttgagtggta catccctgtt 3001 atgaaaagat tgtccccagg gttatttcca gcactataga agtatggcat tcaatagata 3061 ctggggccaa ccatgtgccc agacagtagg tgaaaaggat tctaattttt attcaagcac 3121 ctgtcactga gagaaatact gatttttgaa tgtatcatct tgcaatttaa ctctttctac 3181 aagatgaaaa ttactcttta caaactctgc aagagccatt gaaaattact gctagacaat 3241 cagaattacc ttgctggtga catctttggc tggaagaact gaggtgattc tctctcttta 3301 gaggcacaga aatggacacc agaaataagg cccagctcct tgtgctcctg actcttctca 3361 gtgtgctctt ctcacagact tcggcatggc ctctttacag ggcaccttct gctctcaggt 3421 aagttccctt tcaattcaaa catctgaaca ttccttcctc atctaaatgg gaatttccta 3481 tacattttgt tggacaacta aatagtataa attatgtatt atgtatttaa atttttcctt 3541 gatgtgttga gtgagaggtg tttgtcaaca tcaggtctga gtcctggaat tcactttcag 3601 ggcatgagag acactgtcag ttattagttt tatgggctga gtggatcaca ttcccaaaca 3661 tggtgaaagg aggctttcac tttcttttta ttcaaaacca tctttagata aataataagc 3721 ttttatccat tgttatctca agtcagttca atttctggtg gaaaaaagct gtttagttgc 3781 caaccacatt tgagtttatt ttcaagatta atcagtaaat aatcataaga ctagtagttg 3841 attagtgaaa catgtataaa atcatgttgc acagaaatct atctaggcag agtcaatttg 3901 ccaatattaa atatggggta attgatctat tgagatacac tcccccaata ggaaaatttt 3961 gatatacttt gtttggacca taaatcctta gttttattgt tctatatgtt atattatata 4021 ttccaaatcc ataaagtata acacaactct tatttaaaac tttattaatg tatgctaaag 4081 aagtaaacaa taaatgaaaa ctatattagc ttgaattttc ctttcagttt acgggcacca 4141 ctgtatctcc aaaggaaaaa taataattgg gttggtttcc ttctttgctt attaggtcat 4201 tcttgatttt ttgcttaagg tagacttatg ctcaacagag agaataacct cttagctatt 4261 agtgttctct gtgaaatctg tttaaataca gctcctgcgt tgctttgata actaaacaaa 4321 atggttctat ctgaatagtc taccacagca gatgaacaat cacaagaatg tggacatttg 4381 tgttgattgc taagcttcag acccacagag gactagttgc tagtgactcc caactcccaa 4441 atcactgaaa gatacacaag tgtcccagag ctgacacgct ccttggatgt ttgccctaac 4501 agtgaaaatc tgcacacaca aaattataag agttgccggt gattgaaata tgcatgcatc 4561 cataaatata gtcatattcc atatatgcat ttgactattc ttagtctttc agtgtaacat 4621 tgtacatccc ttattaaagt tgtgttaata cataagatta gaaaaaagaa ggccccataa 4681 aacttattga attcaaattc atcttcatag atgagctatc catctatgga gacattcttt 4741 gaactctgtc aatacagaga aatatataat gccaaagtaa atatttgaag caaagcttgg 4801 gggtcgagag agagagaagc agttaatact ctgttcttca gagatgtttg gaacaggtaa 4861 tactgaacat caccaaatcc ataaattgtt atcagttctc attcaatatt agttttttgc 4921 aagtttctat tttgcataaa atatttctac atacgaaatt tgactttgca aagtgaagat 4981 attcttggca tagtaaatta caccaaagaa gtatagtttg cttaatgctc tgataagatt 5041 gaattaagta gctcttgtgt tttgtatatt cagcttataa aagtcatatg gcatctactt 5101 tataggataa taggaatttc ctatttatac tctcattaac tatatgagcc attaagtcaa 5161 attatgctat tctcatatta tgactgggta atacttccca agagtctaag tatcttattt 5221 taaatggttg cggaaccata cacactgtct tctgaaaaaa gaatgtatta ttctcgtgta 5281 actttcccca tcaggttggg tgacagaata ccctttgagg gagcaaatga acctgatcaa 5341 gtttcattaa aagaagacat tgacatgttg caaaatgcat tagctgaaaa tgacacaccc 5401 tattatgatg tatccaggtg agtttatttt tataaaacta tccaatgagt tttattttag 5461 aaaatgtgtt taagaactat aaacgtgctt attttatctc accatgaagc tattccgata 5521 gcaaaacact agaggtttct atgtgtcatg gcttcattca tcctgattta ctgcagatct 5581 tacagctgta cataattttc atactgaaag gttgtaacaa gccaggatga acacaccatt 5641 cacttccatt ctgttgcaca taattccaac atcacagatt gtaccaattc tgataaaagt 5701 acagcattta gcattccaat gatggaagag ttgccaagga gttttgtctc tatataaaga 5761 aaagatacct gatagagctg ttcattcata gagctgtttg tctagagcat catttcaagt 5821 acagaccaag tagctctata gtacttggta cagacaaagt ataaaaggtg cttcatttga 5881 aagagaaaag gtgggatgat aggccttaat gccttctcta tcttaagaga ttttgtttcc 5941 aaatattttg gaaaaagaaa aaaaaaggtg tactggggtt tatgaacttt ccaaaatcag 6001 aatattttcc cattgtgaat gtctttattc agtggggcag acacaaagca ccatgtacaa 6061 atagcagcta atttaaatag atcatatttt tacaccaata tttcttagaa agaaatagtt 6121 ataaagattc aaccatggag gaatattgaa atgtgctcct agaaatgcct gtttctttaa 6181 tttttcagtg tcttgggcta gaaatatagg gtatatcttg atataagtga ctagaagatc 6241 tcttttgtta gggtgttaag aaatctcctt ttgaacttta gttttaatga gaagctttca 6301 taataagccc ataatttttt tgttatttaa taagccataa taatattcta gaaagccatt 6361 tacaaaataa tagctatttt tttcttcctt gttttagaaa tgccaggcat gctgatggag 6421 ttttcaccag tgacttcagt aaactcttgg gtcaactttc tgccaaaaag taccttgagt 6481 ctcttatggg aaaacgtgtt aggtaaagag aatttattat ttttataaaa tatgttatca 6541 ttattcaatc tgaatattgt attttcactc ttaacaagtt atttactttg tttgaaattg 6601 aaaactcgtt gataatgttt aatttagtta aatgaaataa cttgtattct tcaataacat 6661 ctatcaagcc catggactaa gactttttca taatatcata ttcattgact ccagacacac 6721 atttttctca agtagcaatc atttagcaat ctgcaatcaa attgaacaag actcagtaaa 6781 acaatcaaaa agagatagta cagactgagg taaattttaa ggtcaaacag aatattcttt 6841 ttacttcctg ctcaaattaa atggagaatt ccaaatggcc aacagtcaaa tatttagatg 6901 acgttgctca gggtagcata gcttctaaag aagtaattat gtttttttta aaaaaaaaaa 6961 aaatttctcc caatggcaga gcacaacttc agctattccc aaatttttat ttttggatga 7021 attatttgct cagctgtaca gaatgtgtca taaaacagtc ccaggccatg ccaaaatatg 7081 gcagatgatt ccttgctttc cttcatcctt aggtttagtt gtatttttct tttagtcttt 7141 ttatgataga aaattagttg ggtgggaatt gctttttatg tggctccaag aaacctggaa 7201 catgtgtgta tttatcattt cttgtgaaaa ctctttgatt tccttttcct catgttcctt 7261 agcagtaaca tctcagaaga ccctgtacca gtcaaacgtc actcagatgc agtcttcact 7321 gacaactata cccgccttag aaaacaaatg gctgtaaaga aatatttgaa ctcaattctg 7381 aatggaaaga ggaggtaaag aaaaagagaa cttgctaaaa tgaggaatca tgactgactt 7441 tcaaaataga agctgcacat ggagacctct ctatctcact tatctagcta tactacatga 7501 agcagtgatt ttcaaccttg gctgacatta gactaccccg cagaacttta aaaatacaga 7561 tgtctgggcc ttctcccaaa ccaattgcca ggaaatcggt gtcttcggtg gtgtggccct 7621 agacatacat ttcttttaag agttccccag ttgactttac tatgctgcca gtgttaaaaa 7681 tgatgactat acagacatct agtctgaatc taagagcaag tttcatcaaa agtgatgatt 7741 tttaaaacat gtaaagcata gtcaattgca atttggtaac acatggaact cacttcgtgt 7801 gcatattcac tactgttatt taatcaaatt aaaccattag gtattttgta tatttagatt 7861 ctgtaaaatc tatatctctc tatggtgtgt aaatattgga cacaatacat aaacagtgta 7921 gtgttctgtc tttttattcc aattgttttt gtttaagtga tgagtattaa ggataattct 7981 gtttgggtgt gatcagacaa gatgacctct ttgcccacta gatgataatt ctgtgagaat 8041 aattttagaa actctaagat tgatgaataa ttcataggta ctaacaaacc tcaaagataa 8101 ttgtttagcc tccccataca ctttcagagc acagagaaca gcacattcat tatgtcataa 8161 tagtttcttt agaccctttc tcatctgaga gccttaatat gtacaatgtt tttctggtct 8221 gcagcagtga gggagaatct cccgactttc cagaagagtt agaaaaatga tgaaaaagac 8281 ctttggagca aagctgatga caacttccca gtggtgggta tattcgtgca ttccttctgt 8341 attcttatgg ctgtctctga ttatataagt agctaagagt cacttagtaa gaaacatctt 8401 agggttaaat agtttctcat catgagacct catccaagac agacacttct acaacagtgg 8461 accccaaccc agtaaacagc ttagaatgat tcatgaaatt tttttgatct tgtaaaagag 8521 agcactcaca ataagcaaac gttttgagat ctgaatagtg ttcctggctt acttggggtt 8581 atttctacct ttgtatcaag ataaaaaaac agaggaggct gactcacact gacaagaaaa 8641 atgggctttc cttttgagga tcgacccatg ctgcttttat taattgaagg tttagtctca 8701 caattgactg ttagctcctc aattaagtat agggaaatag caccaaatta gagcatggcc 8761 cattactatg aaaatatcaa ggcattctgg gtctaactat attccttgaa acatggcaaa 8821 cttcatagag caaaagttaa atttagcttg ttttacctgt aagtttgttt tacctatatg 8881 actctaatag cattctctgc caacattttt tttgagcaga aaaaacatgg atagactttt 8941 tttctttgtg aagtaataag cttgctatca aaattgattt tatataaatg tgggttcaag 9001 acatatttct ttcctgagtt acattacatt aatattcaca ttcgctcata aaaaatctat 9061 aatactgggc aagtgcttaa agaaatttca tgctgtcacc atatcaaagg ttagcagatt 9121 ttaaagggct atggagtaaa atatgtttgg ctttgtgagc catctatttt ctgttagagt 9181 gacttaactc tgctatggta gcacaaaagc agccacagtt agatgtaaaa cgaataggag 9241 tgactgtgtt ccaataaaac tttattcata aaacctgaag gtgagccaga ttccaatccc 9301 taactaggtt aaggagatta aattcaccca tgctcaaact aaaggcaatt catggcctta 9361 cctcagttcc tgatggggtt atctaaggtt ttcaacagtc atcaagagtg gacatggtgt 9421 ctgggaagat acctgcttaa ttgactgctt ttgtccacta attaccctta gacctctaag 9481 agaaagcaga ggcaggagtg tggcccacat caactcttta tcttttatag acatctcaga 9541 gaaacaagtc cctttgctca aagaagaaag ttaattacat atagcttttc taagagtgtt 9601 tataagaagg cttctccatc aaattaagaa acacggtagg cactaaaata taaaggagaa 9661 ttgattaatc acactgtgct ttgtaaatgg taagatcact tatgagattc catcctccat 9721 tcttttcttt tttaaatggg tggctttgga caaattcttt gaagaggaaa tcatgtgggt 9781 catccattag tgaaaattac aagtgaatca ttaataatgg ttagttagac gtcttttcac 9841 attccagttc ctctctgata atcagtcctc tagaagctgc tctcttggta tgtcccagga 9901 catgatcccc cgacaggctg ttaacaaggg acaaagagaa ataagtgttt atcagataag 9961 taaatcagtg aatgaaacta agatgctaga aataaaatat attttatgcc taaagcaaca 10021 attttttctt tttcagaatt cttgaaggaa aatgatacgc aacataatta aattttgagt 10081 tctacataag taattcaaga aaacaacttc aatatccaaa ccaaataaaa atattcgtgt 10141 tgtgaatgtt gtgatgtatt ctagctaatg taataactgt gaagtttaca ttgtaaatag 10201 tatttgagag ttctaaattt tgtctttaac tcataaaaag cctgcaattt catatgctgt 10261 atatcctttc taacaaaaaa atatatgtaa tgataagtaa atgctaggtt aattccaatt 10321 atatgagacg tttttggaag agtagtaata gagcaaaatt gatgtgttta tttatagagt 10381 gtacttaact attcaggaga gtagaacaga taatcagtgt gtctaaattt gaatgttaag 10441 cagatggaat gctgtgttaa ataaacctca aaatgtctaa gatagtaaca atgaagataa 10501 aaagacattc ttccaaaaag attttcagaa aatattatgt gtttccatat tttataggca 10561 acctttattt ttaatggtgt tttaaaaaat ctcaaatttg gattgctaat caccaaaggc 10621 tctctcctga tagtctttca gttaaggaga acgacccctg cttctgacac tgaaacttcc 10681 ctttctgctt gtgttaagta tgtgtaaaat gtgaagtgaa tgaaacactc agttgttcaa 10741 taataaatat ttttgccata atgactcaga atattgcttt ggtcatatga gcttccttct 10801 gtgaaagtac atttggagac acaactattt ttccaaaata attttaagaa atcaaagaga 10861 gagaaaataa agacctttgc ttatgattgc agataatttt tttgtttgat tattttatat 10921 gattcagcag taactcattt tagctctaaa gttctaggca c // LOCUS HUNPIV 5124 bp DNA PRI 29-MAR-1995 DEFINITION Human corticostatin/defensin HP-4 precursor gene, complete cds. ACCESSION U18745 NID g665926 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5124) AUTHORS Tremblay,A. and Solomon,S. TITLE none JOURNAL Unpublished REFERENCE 2 (bases 1 to 5124) AUTHORS Palfree,R.G., Sadro,L.C. and Solomon,S. TITLE The gene encoding the human corticostatin HP-4 precursor contains a recent 86-base duplication and is located on chromosome 8 JOURNAL Mol. Endocrinol. 7 (2), 199-205 (1993) MEDLINE 93225961 REFERENCE 3 (bases 1 to 5124) AUTHORS Tremblay,A. TITLE Direct Submission JOURNAL Submitted (15-DEC-1994) Andre Tremblay, Dept. of Medicine, McGill University/Royal Victoria Hospital, 687 Pine Avenue West, Montreal, Quebec, H3A 1A1, Canada FEATURES Location/Qualifiers source 1..5124 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="M92" /clone_lib="human EMBL-3 Sp6/T7 (Clontech)" /tissue_type="placenta" /dev_stage="adult" /chromosome="8" CAAT_signal 1702..1706 TATA_signal 1752..1759 exon 1824..1862 TATA_signal 3116..3122 /note="putative" exon 3175..3358 sig_peptide 3187..3243 CDS join(3187..3358,3945..4066) /citation=[2] /codon_start=1 /product="corticostatin/defensin HP-4 precursor" /db_xref="PID:g665927" /translation="MRIIALLAAILLVALQVRAGPLQARGDEAPGQEQRGPEDQDISI SFAWDKSSALQVSGSTRGMVCSCRLVFCRRTELRVGNCLIGGVSFTYCCTRVD" exon 3945..4263 mat_peptide 3962..4063 /product="corticostatin/defensin HP-4" BASE COUNT 1334 a 1318 c 1085 g 1387 t ORIGIN 1 ggatccccat ttgtcttcag tgtaacccat tagttaaacc gcctactgca aggaaaccac 61 aaggcttgga tcagatcatg aggctgccct acaagttatg ccaaaaaata tggacttgga 121 agacctgtct gttataatat cacacccaaa tctaaccagc tctgccaata acagctctct 181 cctatgttac taggaaaatg cctatggatt ggagtgtgtt ctgtgtgcag gaggctggtc 241 caggtttcac ttctgcagga cactggacat ccccacaacc accagacctt ccccacgtgc 301 acacacaccc cttctcattt tgcctctaca tccatatcca ctgggccctt caggcaccta 361 ctaatgccct agaacctaaa accatcatct ggggcccagt tccccaaata gccctaattt 421 cttcctctgc tggaatgagt ccagtgccca cttcctccaa aggtgaaatt gctgggcctg 481 caacagatca ggaactcact gcttctcata ggggcagccg acttcactgc tctggaacag 541 cgaccacccc tagcgaggct tgagatgcct cttccctcct taagactgag agcgccgctg 601 cccccagtcc tccatagccc agtgcctggc tgccttcagc cagagctgca ggggaggccc 661 tgagcaccca agtcctgctg gaccagcgct gtgcacggcc ctcccatggc ggcaggggct 721 gcctggactg catactgggt tcagcaacct cactataggt attcattccc tcaggaacaa 781 ctgcattctt ttctcatttc cagaaacctc atcccgttta cctcactaca aggaggagga 841 tggtggagag tggtacattt taaaatgtgc actagtctcc ctgggactcc ccttcaaata 901 acccaggagg gaccacacaa gggaaagctt atgcatcccc cccacccagt gaccatcttc 961 ctaactctgg gtgtagggag actcgtaagc ctacgggatt ggtttgggaa cagggtattt 1021 gagctcacaa cacaaggtga tgcaagctaa caccaatctc gctgcagctt tggccaccat 1081 cctaagggac ttctgacaga cattaggtgt cacgcaatca tttgatgagt ccttggcctg 1141 gatgacctag acagtcattt aggcttgaac tatctaaggc caagcaaaaa ggtgactgtc 1201 ccctctagga accacatgct atatgcacat cctttactcg ggagcctgca acctgcccta 1261 tccagcaaca caagcccagg cgtattcagt ctcatccagg tattctccaa ccttacttgt 1321 ctgaatggct tggatttgtt tttatggtta gaccccaggg cctgggaggt cagttcagac 1381 cacattccaa atcctcatct gtgtgtgggt ggcattttga tcctagtctc ctcgcaaggt 1441 gtatacaaca atatgcaggc caggctctcc tggtggcttt aaatattccc tcggtccagg 1501 tagttcagcc tcagccacca gcataggtat catggggtca attgtcttag gagtcatgag 1561 gaatccacag ttgattgctg cctgggcctg gccagggctg accaaagtag acgaggggtc 1621 ggtacctccg tggactcctg cttgaactcc agctttctgc caaatttctc aactgccctt 1681 gttaacagtt atttaaagta cccaatagaa agtaacgctg aaaaattagg acacctgata 1741 ccaaaagacc cttaaataag gaagtcctct cctctgtgtg catggctgct cttgctacat 1801 aagacctgga acacaggact gctgtctgcc ctctctgctc gccctgccta gcttgaggat 1861 ctgtaagtaa cacaaaactt aaactttcac attgaggttt caatattgaa gctgtgtccc 1921 cagtctgacc tctcactgtg gggccacccc agaggaccca gcgtgaagcc cctgctgtga 1981 acttctatct gggtgtctgg cggctgctgg gggtaatggc tactagctaa gtcaatagag 2041 aaactcaaaa agtttccttc caaacacacg tgtcctactt gacatgtcca ataaagacga 2101 tcacagcttc ttaaaacatt attttattgt gagagaagcc tctgcaggtc ctaggtctgt 2161 ttttcaatca ggttgtttgt tttttgctat tgagttgttt gacttcctta tgtattcaga 2221 tatttacccc ttctaccacg taggctttgc aaacattttc tctcattttc tgggttgccg 2281 tttccctcag ttgattgttt cctttgctat gaagatgctt tagcgttcaa tgcagccccg 2341 cttgtctatt ttcccatttg tttattgcct gtgcctttgg tgtcatagcc aagaaatcat 2401 tactcacgtc aatgtccaaa gctttatctt tgtatgtgct tctcgtagtt gtatggtttc 2461 aggtcttttc aagtctatgt tgagtcttca atccatgttg agctgatttt ttacatgttg 2521 tgagagaaag gaccacgtgt atgcacctag caactcatga accttacaca actctttatc 2581 tctctcactg agctcatttc acctgtaccc tgataaggtc attgtcctct tcactctggc 2641 ccctacagga gactactcac cccattacct cagtcgcccc ttcatgaggg tataatgacc 2701 tagaagcctg caatgagtta ctctctactc caccggaatt caggtctggc accagtgttt 2761 agacctgaag agaatagtag ggcccattat caggaaataa gaggcatttg ctctcttaaa 2821 ttattgaatg aaagcactgt ttccattctt tttagaatat taaagattta accaggaaat 2881 attaggtatt tcctgaaaac aggaaaaaat gccagggtcc tcatcatcac catcaacttc 2941 aacctaggca cagacactaa acatagagct tcctgtgaag aaagctggga gagcagagga 3001 ggcattccag ggatgtcaag gccaatagga gtcggcatcc tctctaacaa aatgcacacc 3061 tcctctcact cagaaggcca aaggtttctt atctctgtgc cttctcccag aaagctataa 3121 atccaagctg gcttctccct ccccacacag ctgctcctgc tctccctcct ccaggtcacc 3181 ccagccatga ggattatcgc cctcctcgct gctattctct tggtagccct ccaggtccgg 3241 gcaggcccac tccaggcaag aggtgatgag gctccaggcc aggagcagcg tgggccagaa 3301 gaccaggaca tatctatttc ctttgcatgg gataaaagct ctgctcttca ggtttcaggt 3361 gagagaggcc agcataaaaa agctaccgag tctagagaga cggatgggag atgggctctg 3421 gaatcacatc tcaatggtgg atgtcactta ggtggcttta cttaccatct ctgggcctcg 3481 attttcttat ctcgaaactg aatagagaga caaacaaatg taagtagtct tctttctcca 3541 aagacttgat tccaaggtat gtctataaaa ttcgctaggg ttaagatatg gagagacaga 3601 ttgaccagtt ctttctggat ctaaacaagt agatattata gggaaaatat ttcattctgc 3661 caacaaagga aattttaaaa actggagatg ggcttaagag tatgttcagg tgtgtgtctg 3721 atggggcaaa agcacacaaa tcagagcaaa agagaatgag tctcaaatcc tgtatgagca 3781 gcattgctct gtgtatttat tcctattgac taaggttgtt tgtgctaccg gcactaatgc 3841 agccagcatc accggtcagc cagcatgtgc attctccaag attcccttta ccacccaccg 3901 ctgaccttgg tgcttaattt ctcagtcttc ctctgtgttc ccaggctcaa caaggggcat 3961 ggtctgctct tgcagattag tattctgccg gcgaacagaa cttcgtgttg ggaactgcct 4021 cattggtggt gtgagtttca catactgctg cacgcgtgtc gattaacatt ctgctgtcca 4081 agagaatgtc atgctgggaa cgccatcatc ggtggtgtta gcttcacatg cttctgcagc 4141 tgagcttgca gaatagagaa aaatgagctc ataatttgct ttgagagcta caggaaatgg 4201 ttgtttctcc tatactttgt ccttaacatc tttcttgatc ctaaatatat atctcgtaac 4261 aagatgtctt tgtttacacc tctttgaaat ttgatatgtg tctgtgtcaa gacactagaa 4321 agcaactgcc agaatcctac atgaatattt tggaacaaat taaagtttca aaatctagaa 4381 tttttaagga ggagccatgg cttcagacac aaactgaaac gaaggcacaa atgatgccac 4441 tgatctcacc atccactgct ctgtcttttc attgcctcat tttgaatgga ttcactctca 4501 aagacaactc tcatatacta ggcaaaagaa tcaatgtcag cttccagttt agctactcac 4561 aacctcacta cttctcggga aaaaccacca tcctctgcgt caaaatattg gagttttatt 4621 gaacatgtcc aggtcacaca tttgcctttt aagacatagg gttataggag atccactgaa 4681 acaactttaa ttagagggaa atgggttgtt agcaaagcgg gtgcatggaa cacaaacaca 4741 gcagccacaa tgggatgtcc tcattaagag cacccttcag ccacgtctat gaagacctgg 4801 atttcacacc cataaagtcg tcttcacacc cagaaaacgc acacacacaa atgctccact 4861 acactttgca acgcgcactc tcttgggttg ccttaaagct gctagagaac ctgattctgc 4921 tgggaaaata cagcttcacc agaaacctta catcctgttg ttgctgcagg ctgcggctct 4981 gataaccaca acatcctgca catcagtgag tgagcaagaa accaggtcta gagaaaggga 5041 atgaaccagg agcgctctct gagatgctca gagcatttcc tggaaaggcc acagcgtccc 5101 ctgctgtcct accttcaact gcag // LOCUS L29472 5447 bp DNA PRI 07-JAN-1995 DEFINITION Human MHC class II HLA DO-beta gene (DR4/Dw4,DPw4 homozygote), complete cds. ACCESSION L29472 J02736 N00052 NID g459824 KEYWORDS cell surface glycoprotein; class II gene; glycoprotein; integral membrane protein. SOURCE Human (individual B.O.; homozygote DR4/Dw4,DPw4), Epstein-Barr virus-transformed cell line DNA, clones A11-107 and p107-[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5447) AUTHORS Servenius,B., Rask,L. and Peterson,P.A. TITLE Class II genes of the human major histocompatibility complex. The DO beta gene is a divergent member of the class II beta gene family JOURNAL J. Biol. Chem. 262 (18), 8759-8766 (1987) MEDLINE 87250500 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.Servenius, 27-MAY-1987. FEATURES Location/Qualifiers source 1..5447 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Epstein-Barr virus-transformed" /haplotype="DR4/Dw4,DPw4 homozygote" /map="6p21.3" gene 988..1134 /gene="HLA-DOB" exon 988..1134 /partial /gene="HLA-DOB" /note="MHC HLA-DO beta-chain; G00-120-049" /number=1 sig_peptide 1044..1121 /gene="HLA-DOB" /note="MHC HLA-DO beta-chain signal peptide" CDS join(1044..1134,2679..2948,3387..3668,4154..4264, 4516..4547,4736..4771) /note="MHC HLA-DO beta-chain" /codon_start=1 /db_xref="PID:g459825" /translation="MGSGWVPWVVALLVNLTQLDSSMTQGTDSPEDFVIQAKADCYFT NGTEKVQFVVRFIFNLEEYVRFDSDVGMFVALTKLGQPDAEQWNSRLDLLERSRQAVD GVCRHNYRLGAPFTVGRKVQPEVTVYPERTPLLHQHNLLHCSVTGFYPGDIKIKWFLN GQEERAGVMSTGPIRNGDWTFQTVVMLEMTPELGHVYTCLVDHSSLLSPVSVEWRAQS EYSWRKMLSGIAAFLLGLIFLLVGIVIQLRAQKGYVRTQMSGNEVSRAVLLPQSC" mat_peptide join(1122..1134,2679..2948,3387..3668,4154..4264, 4516..4547,4736..4768) /note="MHC HLA-DO beta-chain" intron 1135..2678 /note="DOb cds intron A" exon 2679..2948 /number=2 intron 2949..3386 /note="DOb cds intron B" exon 3387..3668 /number=3 intron 3669..4153 /note="DOb cds intron C" exon 4154..4264 /number=4 intron 4265..4515 /note="DOb cds intron D" exon 4516..4547 /number=5 intron 4548..4735 /note="DOb cds intron E" exon 4736..5214 /partial /note="MHC HLA-DO beta-chain" /number=6 BASE COUNT 1371 a 1159 c 1202 g 1713 t 2 others ORIGIN 434 bp upstream of EcoRI site. 1 gtacaaatat gaatgcagtc agctttatgc ggatgtatcc aatcatgata tatatatcac 61 atactaatta ccctgatttg atcaatacac aatgtatagg tgtgttggaa catcatgctg 121 taccccataa atatgtacaa ttattacgtg tcaaatttaa aaaacccaag aaaagtattt 181 tttcttagtg ttgttttttt ctaaagtgta tcccattttt tcctttgttt tcgactttca 241 aaattcttga aagactgttt taatctgcgg cctttgtttt cttgtttcca attcattctt 301 ttgaaatatg gcttctacct ttaacactct gctgtcctct tgaaggttat tggtgcccta 361 ctaaccataa aacacaataa tgctctctta gattgtatcc tngtgagcct ttttgcaaaa 421 ttttattctt tgagaattct ctttccatct tttcttttct tttctttttt tgtggtcatt 481 ttggcttata ggaaatcgta ctctccaaat gttctctatg tcttctattt cttctctgat 541 aatcccttta ctaaatcttc ttgaggcata tgcttcagaa tgcttaattt acatactctt 601 aattccttct taatttacac actgcctgtt ggcaatgtca acacccaata agaagggaga 661 cctattggtc agaacaggcc agggaataga agaatacata ataaacagtc tgcctagttc 721 ttctagggcc ccataatatg tcaaacatat atttttactt cttctcccag cccattttta 781 gtatacctaa attactgtca gtgattctgc aggcaaacaa tggttgagtt gtatgacaca 841 actttgtgaa agtatcctac ccagtacctg atgatgcaaa actcttctat cttgattggt 901 tgtcaatctg aggagtttcc aatcctgggg aagccagaaa aacagcgatt tatactctta 961 atgggtactt tctgactgaa ttttatgagc tcattctgaa gaggctgacg attttactat 1021 ctcatttttt tcctttctcc agaatgggtt ctgggtgggt cccctgggtg gtggctctgc 1081 tagtgaatct gacccaactg gattcctcca tgactcaagg cacagactct ccaggtaaga 1141 acagagcaat tgtttttttc cagtgtgtat gcaagaattg gcatggggga gtgatgcctt 1201 tctttgtaag tccaggccac agaccagact ggaagtggct tttggtttca aagaacagtg 1261 ttcttccctt tggcagaaag gtacgccttg cctctttaca tgggatggac ttcatatacc 1321 agagccacct attcaagggg tagggaggca ggaagaggga aacattgtgt cttgtttagg 1381 atccttattg tgtatcaacc tcagtcagta cctaggcgtg ttgaaggcct ggcttgggtt 1441 cgagcctgct gggagaaaca acctgcagta ggctgggtca cagaggcaat ctgtgatttt 1501 ttggtcagga cacggaaaca aatctcagtt ggggtatatg tggacaaatg aaactggaaa 1561 caaaggttgc tccttctgtc atttattaag ccactattat attgtcagaa ttgtactaaa 1621 cagttttgag aagtaagaga agttgaatag aatacattgt ccttgtcctc cgcacnaggt 1681 acaagttact tgtcactgtt atttttcagc acaggtgaca gaatatgcag ccatgagcaa 1741 tgtgagatga agcacatatt aatgagcaga aacaggatgt aatgtgctaa gaacagaatc 1801 ccctttgcat gttagtttca ttaaatacaa aagaggaatc aaacctggcc aggagagatc 1861 attattctta gagaatagaa accgccctga gtttataatg tccattaaac aatacaactg 1921 aaaaaaaaat cagcacagat gttaaatgat gatgaaaaat tcagatttcc cccctagttt 1981 agactactag aggaaataga gaagagtata catgctgaga aattacaggc tggaacttca 2041 tctgaaatta gctactgagt gagggataag tggggttcac ctaggaaggt cattcttatg 2101 gctcagttgg aggcttcgaa cttagaaagg aaaggtaaat tacaacccaa cattaatagc 2161 aattatcttt caagtcttga cttagatgca atgtcttcag gacgtccttc ctgacttacc 2221 tacattatta actccatttg aatttccatt tttattgtag ttggtgtggc cgcctaacca 2281 gtcttggggg tctggctggt ttggtttaat ttacccaagt gttcacagca cattgtaatt 2341 attggcagta gtgcaagact ctctcatctc ttctcttgcc tccgttctca ttctctcccc 2401 tccctggaga atccattcta aacgtgtctg gtatgtctct aagtatgaga gtggctttta 2461 gaaatatgta gcattattgc tctttatgtg tttttaaaat ttaataaatg tcatttctct 2521 ggttgaatcc cattctgttt cttttctcac ttgacactgt gctttttgag catactgagg 2581 tcaaggcctc ttagatccat gtggtctgac gtaatttacc aggcatgggt tttcccagag 2641 gagggggctg gttcatggtt ttggttttgg ttttccagaa gattttgtga ttcaggcaaa 2701 ggctgactgt tacttcacca acgggacaga aaaggtgcag tttgtggtca gattcatctt 2761 taacttggag gagtatgtac gtttcgacag tgatgtgggg atgtttgtgg cattgaccaa 2821 gctggggcag ccagatgctg agcagtggaa cagccggctg gatctcttgg agaggagcag 2881 acaggccgtg gatggggtct gtagacacaa ctacaggctg ggcgcaccct tcactgtggg 2941 gagaaaaggt gagctggaag ctgaggtctg gcggggctca ggaatgtccc ccatgtgaac 3001 catggctctt ctttcttaca agcaattttc tgcttttagg ataaatggtt gtctgtgtag 3061 atgttctggc cccagctgtg atatattatc ctcacaagtc agccactgtg atcttggtct 3121 cagaccccca aggttctcag ggacttcgag ggctattgta ccctcaaaga gaagcagtaa 3181 ttgtgggagt acctcagaaa gtctaaatcc tcctgacagg cattgacata ccctgttact 3241 gatcttgggg gctgagactt gcctatactt tgtgttcact tgggtgacct gggaaagaga 3301 ttagacatag tgatagtccc taaagaatct cctgtcccag cttggtggtt ttctttcacg 3361 gtgtctcatt tttcctccct tcctagtgca accagaggtg acagtgtacc cagagaggac 3421 cccactcctg caccagcata atctgctgca ctgctctgtg acaggcttct atccagggga 3481 tatcaagatc aagtggttcc tgaatgggca ggaggagaga gctggggtca tgtccactgg 3541 ccctatcagg aatggagact ggacctttca gactgtggtg atgctagaaa tgactcctga 3601 acttggacat gtctacacct gccttgtcga tcactccagc ctgctgagcc ctgtttctgt 3661 ggagtggagt gagaattagt ttctagtact ctctgggcct gactcaggac tatactgact 3721 caatacagag cctgtgtcac ttctgcgttt atcttggtca caacatgaat tattctttcc 3781 cttgatctgg gaacagtcac gaaaccagag tccttgggtt agggtgggag aaaacatggc 3841 agatatctat cctcatatct tccaagaaat gaggagatct aatcacctca ttatgtgctt 3901 ccaaccctat gaactggtgt cctctaattc tttggtctta gtatttagga ggcattctta 3961 tgggctgtga gaatctgtaa ccgatgggtg gtaactccat gggtgccact ttggtttcga 4021 agaacctttt ctaaatttat ttatttttct ctagctagca ttggatttgg tgtctagtac 4081 agattctggg attccaagaa agtgctttaa atattgggat atttttacta atttaaagac 4141 ctgtttccca taggagctca gtctgaatat tcttggagaa agatgctgag tggcattgca 4201 gccttcctac ttgggctaat cttccttctg gtgggaatcg tcatccagct aagggctcag 4261 aaaggtaatg agcctgtgag gagtgccctg ccacctgtcc cagaccttcc ccactcccac 4321 cttccctaac gtcaatgatc tgaggcaagg aaagctgatt gtgcctctca gggatcaccg 4381 ggataatttt tttctgaagc tagaaatggg ataagcagag agagtgctga ccttgccagc 4441 catttgttct tccctcggga taatcatatt gggtcctaat tggggcaatc cattcttttc 4501 tcgatttctt tccaggatat gtgaggacgc agatgtctgg taatgaggta atgtctcttt 4561 ttccttgtct ttgagtggca gatcattctc ccggttcttt ggccagaggg agatgacatg 4621 ggggtaggga ggagtaaggt tgctgctgtc tagatgggac tgtcccctga gtctctggaa 4681 cggctgtggg gggtggtgag gctgcctcct gagaccttca tcactgtgcc tccaggtctc 4741 aagagctgtt ctgctccctc agtcatgcta aggtcctcac tgaagcttct ctctctggag 4801 cctgaagtag tgatgagtag tctgggccct gggtgaggta aaggacattc atgaggtcaa 4861 tgttctggga ataactctct tccctgatcc ttggaggagc ccgaactgat tctggagctc 4921 tgtgttctga gatcatgcat ctcccaccca tctgcccttc tcccttctac gtgtacatca 4981 ttaatcccca ttgccaaggg cattgtccag aaactcccct gagaccttac tccttccagt 5041 cccaaatcat ttacttttct gtggtccagc cctactccta taagtcatga tctccaaagc 5101 tttctgtctt ccaactgcag tctccacagt cttcagaaga caaatgctca ggtagtcact 5161 gtttcctttt cactgttttt aaaaaccttt tattgtcaaa taaaatggag atacaaaaaa 5221 tgtacatttt agtgaattat ttaagaaaaa cccctgtaat caagtcaagg aacaggactt 5281 tgccagctcc agcagaagtc tctgtacgtc aggccaatca aagcctctcc ttcccctcga 5341 aagtgaccat atcctgattt tattgtaacc tctttcatgt ctttgtagtc tagtccccca 5401 ggtatgtgtt ctggacgcca cagcttagtt gctttacgcc taactca // LOCUS S45332 8647 bp DNA PRI 23-DEC-1992 DEFINITION erythropoietin receptor [human, placental, Genomic, 8647 nt]. ACCESSION S45332 NID g255496 KEYWORDS . SOURCE human placental. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8647) AUTHORS Noguchi,C.T., Bae,K.S., Chin,K., Wada,Y., Schechter,A.N. and Hankins,W.D. TITLE Cloning of the human erythropoietin receptor gene JOURNAL Blood 78 (10), 2548-2556 (1991) MEDLINE 92399733 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 113293] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..8647 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1916..8128 /gene="erythropoietin receptor, Epo receptor" CDS join(1916..2030,2887..3022,4010..4185,4266..4423, 4907..5060,5145..5232,7334..7421,7517..8128) /gene="erythropoietin receptor, Epo receptor" /note="This sequence comes from Fig. 2; Epo receptor" /codon_start=1 /product="erythropoietin receptor" /db_xref="PID:g255497" /translation="MDHLGASLWPQVGSLCLLLAGAAWAPPPNLPDPKFESKAALLAA RGPEELLCFTERLEDLVCFWEEAASAGVGPGNYSFSYQLEDEPWKLCRLHQAPTARGA VRFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGLVARLADESG HVVLRWLPPPETPMTSHIRYEVDVSAGNGAGSVQRVEILEGRTECVLSNLRGRTRYTF AVRARMAEPSFGGFWSAWSEPVSLLTPSDLDPLILTLSLILVVILVLLTVLALLSHRR ALKQKIWPGIPSPESEFEGLFTTHKGNFQLWLYQNDGCLWWSPCTPFTEDPPASLEVL SERCWGTMQAVEPGTDDEGPLLEPVGSEHAQDTYLVLDKWLLPRNPPSEDLPGPGGSV DIVAMDEGSEASSCSSALASKPSPEGASAASFEYTILDPSSQLLRPWTLCPELPPTPP HLKYLYLVVSDSGISTDYSSGDSQGAQGGLSDGPYSNPYENSLIPAAEPLPPSYVACS " BASE COUNT 1855 a 2411 c 2432 g 1949 t ORIGIN 1 ggatccaccc acctcggcct cccaaagtgc tgggattaca ggcatgagca ctgtgcatgg 61 actatttatt tatttttttg aaacagagtt tcaatcttgt tgcacagcct ggagtgcaat 121 ggtgtgatct cagctcactg caacctctgc cttctggttt caagcaattc tcctgcctca 181 gcctcctgag tagctgggat tacaggcacc caccaccacg ctcgaatata tatatatatt 241 ttttgagacg gagtccgctc tgtcaccagg ctggagtgca gtggccaaat atcggctcac 301 tgaaacctcc ggctcctggg ttcaagcgat tctcctgcag cctcccaagt agctgggatt 361 acaggcatgc agcaccacgc ccatctaatt tttgtatttt tggtagagat ggggttttac 421 catgttggcc aggatggtct tgatctcttg acctcgtgat ctgcccacct cggcctccca 481 aagtgctggg attacaggcg tgacgaccgc gcccggccta cgcctggcta atttttgtat 541 ttttagtaga gacgtggttt cgccatgttg cccaggctgg tctcgaactc ctgacctcat 601 gatccgcctg tctcggcctc ccaaagtgtt gggattacaa gtatgagcca ccgcgccact 661 agccaatttt ttttattttt tgagatgcag tctcactctg ttgcccaggc tggagttgca 721 gtggcatgat cttggctcac tgcaatcttc atctcccaga ctgaagcagt tctcatgcct 781 cagcctcctg agtagctggg attacagcac acgccaccac acctggctaa tttttgtatt 841 tttagtagag atgggatttc accatgttgg ccaggctggt ctcaaactcc tgacctcaag 901 tgatttgccc acgtcggcct cccaaagtgc tgggattata ggcgtgagcc accgcccagc 961 ccaagagaat aaaaatgtgg gtggtaaaaa tttttttccc aaaaattcgt aaatgaaaat 1021 ctcacatatt atgcatactg cccaggagca tggcctagca ctgtgcaaac actcaactgc 1081 tggtcgttgc aaggattatt attggccggc ttcagtggct tgctggtatt cccagcacat 1141 tgggagatgg aggctggagg attgcttaag tccgggattt caagaccagc ctggacaaca 1201 tagtgggatc ccatctctac aaagaatttt aaaaattagc caggtgcagt gggaagattg 1261 cttcagtcca gaggctgcag tgagctatga ttgtgccact gcactccagc ctgggtgaca 1321 gagcaacacc ctgagacaga gagagagagg gggaaggagg gaaggaggga aggaaggaag 1381 gaaggaagga aggaaggaag gaaggaagga aggaaggaaa ggagagagag agagagagag 1441 agagagagag agagagaaaa taatttttat ttatttccag gctgggaaga gatgctgatt 1501 tctgcgataa aatcagtagg tacatttttt ggaatgttcg ctatgtgcca ggctagattt 1561 tacagatgag aagtctgaag ctcaggtaag gtaagtcacc tgtccagggc cacaaagaaa 1621 aaaaaaacgt gtgtctgaag ccagaacggg agctgttgcg cccaactccc tcccctgccc 1681 ccaagcggcc tctgggctcg ggaagggccc ctgcctcctc ccgccaggca cttatctcta 1741 cccaggctga gtgctggccc cgcccctcgg ggatctgcca cttagaggcg cctggtcggg 1801 aagggcctgg tcagctgcgt ccggcggagg cagctgctga cccagctgtg gactgtgccg 1861 ggggtggggg acggaggggc aggagccctg ggctccccgt ggcgggggct gtatcatgga 1921 ccacctcggg gcgtccctct ggccccaggt cggctccctt tgtctcctgc tcgctggggc 1981 cgcctgggcg cccccgccta acctcccgga ccccaagttc gagagcaaag gtaaggatga 2041 gctgcgtgtg gacccctacg ctggagcctg caggaccatg ctggggcctg aactcccagc 2101 ctaggtcctg ggggccatgc tgtttctgga cttcctgacc gggtcctggg ggccaagctg 2161 gcatctgaac ccttagactg ggtcctggat gggtgggggg cggggtgggg tatgttagga 2221 tccaagactc ctgatcgcgt cccgggcaag agctagagtg ggcttaacat tcccgtttta 2281 ccttttcagg gagtctggga catgctaaat cctaaggggg ctgacttggt gctaaggtcc 2341 ctggggggtg gggaccaagc cgatccctag gggagggagg gtaaagcccg ggtccgagtt 2401 agagggccaa gccacaggct actgtaaaca cggtttgtgt gagggcgcca gatcacttgc 2461 ccggcccggt ggagggaggg aggcgggggg cacggttggc gctatcggtt ggcggggagc 2521 ctgccggggc cgataggggg cccgcctctc cgcacacacc cccagccgcg cgcgtgtcct 2581 aggctggggc ggggctggca gtcccgagct cgaggtcttg aacgccgcgc ccagctcagc 2641 tggccgctgg gtgggcaggt gtgcgccagt ggtgcacggc gggggacagt aaggcgagaa 2701 acttgcccct gggaattagg ggggcaccac ctctgcggac ccctccaagg gacccgcttg 2761 ggaagatggc agggcggggc ttttttctta tcgggtccgc ccaggctgcg ggagggaaga 2821 ggagggggct gtctcccgag gatagagctc agacccccat gcccttcctt tgtcgcccct 2881 ccccagcggc cttgctggcg gcccgggggc ccgaagagct tctgtgcttc accgagcggt 2941 tggaggactt ggtgtgtttc tgggaggaag cggcgagcgc tggggtgggc ccgggcaact 3001 acagcttctc ctaccagctc gagtgagtcc gatccggcgg gtgcctccaa gggcggaggg 3061 agggggtggg gcagagctcc ctggaggtcg tagcctcgta tgtcccctgc tgtttgaggc 3121 ccgacggcgc ctccagtcgt ggtcactgga gggaaacctg cgggtccagg gctggcacgc 3181 ctctatgggc cggggcgcga acactcccgc gatcaccgct ggaacgcgac cccaaacatc 3241 aggctgggat aacaacgcct ccaaatcgag ggtaaggcgt tactacgtcg gggctgggac 3301 gccttctcga ggtagtatcc aaaaggaggc cagcagtgct catgcctgta atcccaactc 3361 tttggaaggt cgagcggaag aaccgcttga gcccaggtgt tcaagaccag cctgggcaac 3421 acagcgagat ccccgtctct taaaaaaaaa ttagactggg cgcggctgca cgcctgtaat 3481 cccagcactt tgggaggctg aggcgggcgg atcacctgag gtcgggagtt tgagagccag 3541 cctggccaac atggagaaac tctatctcta ctaaaaatac aaaattagcc gggcgtggtg 3601 gcgcatgcct gtgatcccag ctactcggga ggctgaggca ggagaatcgc ttgaacccgg 3661 gaggcggagg ttgcggtgag ccgaggtagc gccattgcac tccagcctgg gcaacaagag 3721 cgaaactccg tctcaaaaaa aaaaaaaata aaagccaggc gtggcgcgtg cctgtggtct 3781 caactacttg ggaagctgag gtgggaggat cccttaagcc ccagaatttg aggctgcagt 3841 gagccatgat cgcgccactg cactccagcc tgggcgacga aggaacacct tgtcacacac 3901 acacacaagg ctagaccttg tgtcacacat acacactgcc ccccacaggc cgggcaatgc 3961 caactccccg gtcccccctc ccaacctgct cccttccctg ggcgcatagg gatgagccat 4021 ggaagctgtg tcgcctgcac caggctccca cggctcgtgg tgcggtgcgc ttctggtgtt 4081 cgctgcctac agccgacacg tcgagcttcg tgcccctaga gttgcgcgtc acagcagcct 4141 ccggcgctcc gcgatatcac cgtgtcatcc acatcaatga agtaggtaag tgctctggga 4201 atggaggagt ggtcggagga gagggtctca gtcctcgccc acctgaccaa cccccatgcc 4261 tgcagtgctc ctagacgccc ccgtggggct ggtggcgcgg ttggctgacg agagcggcca 4321 cgtagtgttg cgctggctcc cgccgcctga gacacccatg acgtctcaca tccgctacga 4381 ggtggacgtc tcggccggca acggcgcagg gagcgtacag agggtgaggc cagcccctac 4441 ggcccagccc ccaaagctcc actgactacg gcccagccac gcctctcgag gtcgcgcccg 4501 gtgccgcttt cagggccggt ccgtaacatc ccacatccca ttaccctggt gctgaagacc 4561 gttccacgcc cacagacaca gccccctttc ctaatgtcct cgcaagcctg ttgaacccca 4621 acttcttctc cctccggccc gtaaccctag acccctttag cgcccgggtc cctctacgag 4681 tgctagccca gatattaaat tgcccgggtc ccgccctttc gtaccagaga ctctctctct 4741 gattggccct gagctttctt gggctcctcc ccctactctt attggtccca ttgcaattct 4801 agggcaccgt tttcctttcc cctgattggc tcagttccac cagggcccgc ccccacgtca 4861 tctatttttg tctgctacgc gtccctcgcc ctgattccgc ccccaggtgg agatcctgga 4921 gggccgcacc gagtgtgtgc tgagcaacct gcggggccgg acgcgctaca ccttcgccgt 4981 ccgcgcgcgt atggctgagc cgagcttcgg cggcttctgg agcgcctggt cggagcctgt 5041 gtcgctgctg acgcctagcg gtgaggcccc aggcgggggt gtaggaggag ccagggcgaa 5101 tcacggggca agcccaccgc cctgacctcc tccccgcctc ttagacctgg accccctcat 5161 cctgacgctc tccctcatcc tcgtggtcat cctggtgctg ctgaccgtgc tcgcgctgct 5221 ctcccaccgc cggtgagctc cccatttggg cgctgggccc agactcctcc ccgccaacgg 5281 tcctctttca ctatggaaac ctaggctcag agagagacac gcacttgccc aaggtcacgc 5341 agtaaggatt cacatcagtg gcagggctgg gatgcatgcc agactagacc cagactcttc 5401 gttaacattt tctgctcttg gggactttca cctgattttc cttctacatc aggggctgcc 5461 atttcttggg tccctttgtt agttcctttc cccagtgtca tcacctttgt aaaatcaact 5521 agatggattt agtgaaagaa tttaagaccc tgaatgcctc cgcacccctg cggtcaagct 5581 tctcagacac tatgatcaga ctagccgttc tgaggtattt gtaattccaa gcacacacta 5641 ggtggtttca cacccccaag cttttgccca tgctgttccc tctgcctgga atgcccttcc 5701 tgccttgtct gctaagcaat cttctagtcg tctttcatgg ccctgttcat ttacttggtt 5761 ggaaaataca aacagagtgc caaacatgtg ccaggcactg gagagagaat ggagaacaag 5821 ctagaccctg accacaagtc cctgaccttg tggatctcaa gtcaacaaac aagggaccca 5881 agaaatattt gatgacaaat tgtaatgagt gatatcacag aaacaaacag aatgtggtga 5941 catgacagga tggtcaggga aggctccagg aggaggtgac atcagagtgg aaacctgaag 6001 attggaagga agcagccgct tgaaaagtgg ggagaagaaa cagcaagtgc aaaggccctg 6061 aggtgggaat gagattggaa cgttcagcca gcttcaagaa ttgccacatg catggcctgg 6121 catggtggct cacgcctgta atcccagcac tttgggatgc cgaggcaggc agatcacctg 6181 aggttgggag ttcgcgacca gcctgaccaa catggagaaa ccccacctct actaaaaata 6241 caaaactagc caagcgtggt ggcacatgcc tgtaatcccc gctactcggg aggctgaggc 6301 aggagaatca cttgaacctg ggaggtggag gttgcgggtg agccgagatc gtgccatcgc 6361 attccagcct gggcaataag agtgaaactc cgtctcaaaa aaaaaaaaaa aattgccaca 6421 tggctagagt ggtatgtaag ggggtgtggc agatattgag atgagggagg tgacaggggt 6481 catataacgc agggccttct gcagggtggt ggggaggagt ttggaatttt tttttttttt 6541 gagacagagt cactcttgtc gcccaagctg tagtgcagtg cagcagtctt ggctcactgc 6601 aactctgcct cccaggttca agtgattctc ctgcctcaac cgcctgagta gctgagatta 6661 caggcgtgca tgcccggcta attttgtagt tttagtagag acggggttcc accatgttgg 6721 ccaggctggt ctcaaactcc tgacctcagg tgatctgctc acatcagcct ctcaaagtgc 6781 tgggattata ggcatgagcc accgtgcctg gcttggattt tatcctaaat gcctctctca 6841 ttaccccaga aggtaacata atatttatct atgaagtgac atcatggacc tcctggaaaa 6901 atctgggcca gggttttggg ttttttaatt tattttattt tatttttttt agagatgggg 6961 gtctcactat gtttcctagg ctggtcttga actcctgggt tcaaatgatc ctcccacctc 7021 agcctcccaa agtactggga ttatagtgct ggtgtaaacc actgcacctg gccatggcca 7081 ggattaaagg gagaatgacc aaggtatatt gaactcctat gcacccttca ataccctgtt 7141 ccatttaccc ttttgtaggg ccttgctgat gcttcagcca aaacccctgt cccctggccc 7201 tgatgtactc ctctgcctcc attgtgatca cagggaccaa gtgtatctgt gcctctatga 7261 ctgggagtgg agggggaatt ggtgagtatt caatgagtca tatctatgta actatttata 7321 ttggcttcaa cagggctctg aagcagaaga tctggcctgg catcccgagc ccagagagcg 7381 agtttgaagg cctcttcacc acccacaagg gtaacttcca ggtaggtggc ctggttgtcc 7441 cctcagtgcc tgggcttccc tgcttcttgc agccaaactg caggcctctc tgagcaggtt 7501 ggtgctattt cttcagctgt ggctgtacca gaatgatggc tgcctgtggt ggagcccctg 7561 cacccccttc acggaggacc cacctgcttc cctggaagtc ctctcagagc gctgctgggg 7621 gacgatgcag gcagtggagc cggggacaga tgatgagggc cccctgctgg agccagtggg 7681 cagtgagcat gcccaggata cctatctggt gctggacaaa tggttgctgc cccggaaccc 7741 gcccagtgag gacctcccag ggcctggtgg cagtgtggac atagtggcca tggatgaagg 7801 ctcagaagca tcctcctgct catctgcttt ggcctcgaag cccagcccag agggagcctc 7861 tgctgccagc tttgagtaca ctatcctgga ccccagctcc cagctcttgc gtccatggac 7921 actgtgccct gagctgcccc ctaccccacc ccacctaaag tacctgtacc ttgtggtatc 7981 tgactctggc atctcaactg actacagctc aggggactcc cagggagccc aagggggctt 8041 atccgatggc ccctactcca acccttatga gaacagcctt atcccagccg ctgagcctct 8101 gccccccagc tatgtggctt gctcttagga caccaggctg cagatgatca gggatccaat 8161 atgactcaga gaaccagtgc agactcaaga cttatggaac agggatggcg aggcctctct 8221 caggagcagg ggcattgctg attttgtctg cccaatccat cctgctcagg aaaccacaac 8281 cttgcagtat ttttaaatat gtatagtttt tttttgtatc tatatatata tatacacata 8341 tgtatgtaag tttttctacc atgatttcta caaacaccct ttaagtccca tcttcccctg 8401 ggcataggcc atagggatag aagttaaagt tcttgagctt attcagaagc tggatctgca 8461 atctgaatgc tactcataac ataacaaaat agtatgttaa acagctctta aatcttactg 8521 gcttaccaca ttaaatgatt tctctctcct aactcagctc aaatgggcag ccatccatgg 8581 atgagtcaga ggttcagact cttccagtct gtagctctac cttctcttag ggtacttaga 8641 tggatcc // LOCUS S63697 1339 bp DNA PRI 22-SEP-1993 DEFINITION prepro-melanin-concentrating hormone [human, HeLa cell, brain, Genomic, 1339 nt]. ACCESSION S63697 NID g402795 KEYWORDS . SOURCE human brain HeLa cell. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1339) AUTHORS Breton,C., Schorpp,M. and Nahon,J.L. TITLE Isolation and characterization of the human melanin-concentrating hormone gene and a variant gene JOURNAL Brain Res. Mol. Brain Res. 18 (4), 297-310 (1993) MEDLINE 93316802 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 135044] from the original journal article. This sequence comes from Fig. 5. FEATURES Location/Qualifiers source 1..1339 /organism="Homo sapiens" /db_xref="taxon:9606" gene 31..1149 /gene="prepro-melanin-concentrating hormone, prepro-MCH" CDS join(31..279,630..828,1100..1149) /gene="prepro-melanin-concentrating hormone, prepro-MCH" /note="This sequence comes from Fig. 5; prepro-MCH" /codon_start=1 /product="prepro-melanin-concentrating hormone" /db_xref="PID:g402796" /translation="MAKMNLSSYILILTFSLFSQGILLSASKSIRNLDDDMVFNTSRL GKGFQKEDTAEKSVIAPSLEQYKNDESSFMNEEENKVSKNTGSKHNFLNHGLPLNLAI KPYLALKGSVAFPAENGVQNTESTQEKREIGDEENSAKFPIGRRDFDMLRCMLGRVYR PCWQV" BASE COUNT 469 a 227 c 218 g 425 t ORIGIN 1 gtttgactgt atgcaaacat caaactaagg atggcaaaaa tgaatctctc ttcctatata 61 ttaatactaa ctttttcttt gttttctcaa ggtattttac tttcagcatc caagtccata 121 agaaatttag atgatgacat ggtatttaat acatccaggt tggggaaagg ctttcagaag 181 gaagacactg cagaaaaatc agttattgct ccttccctgg aacaatataa aaatgatgag 241 agcagtttca tgaacgaaga ggaaaataaa gtttcaaagg taagtggcaa tgtgacttgt 301 cgtttatttc aatgaaaatt taaatgattc ttacaaatcc tctgaaaagt aaaactgata 361 cttttataaa cagaagtata tgcaaacagt cacaatatgc attaggacga ctgacgatat 421 ttcttacatg ccaggtagtt cttccatccc agcaaacacc tcttatctga aagtgttttc 481 tctcctataa atttgcatct aaggtatttt taaaaagtca aaaacagtgc agagtaactt 541 tgaattgatc aacaagaata ttatacaatt gtattatagt tccactccga atacgatagc 601 taaaaaccaa atcaaccttc tctttacaga acacaggctc caaacataac ttcttaaatc 661 atggtctgcc actgaatctg gctataaaac cttatcttgc actaaaagga tctgtagctt 721 tcccagctga gaatggagtt cagaatactg aatcaacaca agaaaagaga gaaattgggg 781 atgaagaaaa ctcagctaaa tttcctatag gaaggagaga ttttgacagt gagtagtctt 841 ttcaaaattc aattcttaca tcctaccacc attaaacaga actctgagtt aaagtgaatt 901 tggatgcaat cataacaaaa tcaaataaga ccatggctca attacacctg ccaaaaatgt 961 gggattaaat agcaacaatt aattattatc attttggttt actcagaatt agttatacta 1021 gatccattct ttttttctta ataaattttg tgtgataatt atagtccttt aaacaattta 1081 aactttcttc ttccttcagt gctcagatgt atgctgggaa gagtctaccg accttgttgg 1141 caagtctgat acctgttggt ccacatcatc ttttcagaag aaaataaaag catttaattg 1201 ccaatgggag gagaagccca tactgctact ataacttgtg tatgttaaat gtctgtttta 1261 aaagaaagta gtgttaagat gtatcagtaa ctgaaatgat atgctttctt tgtgcattaa 1321 atttgtgaaa attctgcat // LOCUS S73906 4606 bp DNA PRI 12-APR-1995 DEFINITION adrenomedullin=potent hypotensive peptide [human, liver, Genomic, 4606 nt]. ACCESSION S73906 NID g765329 KEYWORDS . SOURCE human liver. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4606) AUTHORS Ishimitsu,T., Kojima,M., Kangawa,K., Hino,J., Matsuoka,H., Kitamura,K., Eto,T. and Matsuo,H. TITLE Genomic structure of human adrenomedullin gene JOURNAL Biochem. Biophys. Res. Commun. 203 (1), 631-639 (1994) MEDLINE 94354869 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 153703] from the original journal article. FEATURES Location/Qualifiers source 1..4606 /organism="Homo sapiens" /db_xref="taxon:9606" gene 2357..3297 /gene="adrenomedullin, AM" CDS join(2357..2454,2605..2754,2988..3297) /gene="adrenomedullin, AM" /note="potent hypotensive peptide; This sequence comes from Fig. 2A and 2B; AM" /codon_start=1 /product="adrenomedullin" /db_xref="PID:g765330" /translation="MKLVSVALMYLGSLAFLGADTARLDVASEFRKKWNKWALSRGKR ELRMSSSYPTGLADVKAGPAQTLIRPQDMKGASRSPEDSSPDAARIRVKRYRQSMNNF QGLRSFGCRFGTCTVQKLAHQIYQFTDKDKDNVAPRSKISPQGYGRRRRRSLPEAGPG RTLVSSKPQAHGAPAPPSGSAPHFL" BASE COUNT 916 a 1345 c 1386 g 959 t ORIGIN 1 gaattcaggt ccggctcagg tgactccttc caggaagagt ccttaaataa cttccgcgcg 61 gccatttctt ccccaggtgc aatgtccact gttgagtccc gagagtgaaa aggcagctaa 121 gccagcagtg cgcggcccag acagaccact ctgggcaagg tccgcgcgac ccactgccca 181 gccgcctttg caccggagct cagagccata tggcagagta ggaaggaagc ttaaaggtct 241 gctggggacc ggctctaaga tggggactcg agagatggga catgactcgc ccaccttcac 301 acagcgaatt gccgggaggt gaggactaga agcctgtttg gctagtggtc tcgcaccttt 361 ggcctctccg tcgagtcgct gggcttcagg acattcgatt agaggagtgg ttagtaggac 421 cccagaaacc cacaagcacg aagcaggaag ccccgtggct tagcctttcc ttcccagttt 481 ggcccccagg agagaagtaa gaaagagaag aagctgtgat gaaagagcac aaacgggtta 541 caaacgtgtc tagcgtgatt catcatgaac aggcacaaat tctttgggcg ggggctagga 601 ctctcctttg ccccttgaga agttggtgac cccagcctag aggaatccac gccgcgcccc 661 agtcggccgt gctagcgtgt cgggccgagt agggactctg ctgcttccta cctgcagggt 721 gacttcccct ctctgcagat acctcccctc tctgagcttg aattttctac ctgctgaaat 781 gggaaaagga atgttacctt ccttgcctga ctcaagggtg gctgtgaagc tcaaagtaga 841 ctgggatgtg gccgtgcttg gtaaactgta aaatgattag catacgtgaa gcgttagtgt 901 gctccctggc agtcagtcct cacgtttacg atggattaat gaaggcagcc aggcacaatc 961 tcaggttatg accttataag gcataatagg attacaagga ggcaaaatga aacagtatat 1021 ttcctcctaa ttttggttat tcctatgttt ccaaatagaa atgaaaattc tgaaatttca 1081 gataattccc ccctccaaat cccaattctc aatttagctt ttctgagcct tgtccctacc 1141 gtgaaatctt actagctcac cccgaaacgc cggggtcttg gtcctctggg ggactagtga 1201 gtcctgctcc tctcggcttt gctccgtccc tgccccggcc cctccgcggg agtctggggt 1261 cgcggcccgg tcaggcgcca agccggcagg gggcgcggtc cacttgaggc cacagctccc 1321 caggtccagg gctcgggcca tccgcgctgt cccttccgcg ggctcttgct gttcttcgcc 1381 aggaggcttt gcactccggg ccgcatgggt cctggataag gacctcaaga ggttgtagtg 1441 gcgccccttc gctccggcgt actgtctgaa ccctgtgccc aaagagggct gtgcagttcc 1501 tgcaccaacc gcctggagcc catacctaag cctctgggca cgagggacct cctctccccc 1561 gctcccccct ccccccatcc caactccagc cccaaaggaa gcaatgcgcg cgtccgagag 1621 caggagcgcg cgtggctgag gaaagaaagg gaaggcaacc gggcagccca ggccccgccc 1681 cgccgctccc ccacccgtgc gcttataaag cacaggaacc agagctggcc actcagtggt 1741 ttcttggtga cactggatag aacagctcaa gccttgccac ttcgggcttc tcactgcagc 1801 tgggcttgga cttcggagtt ttgccattgc cagtgggacg tctgagactt tctccttcaa 1861 gtacttggca gatcactctc ttagcaggta ggtgccgcag accctgcggg ttaagaggtg 1921 gggtgggggg cagtgcttgc caaggcccta aactgggagc gctgggtgag gggaacaacc 1981 cactttggag ggttctctga gagatagata caccccatat cctgggccca gctcgtgcac 2041 acagctggag gtccagagac ccagtcccct ctgctccgtc agccaagttc caagaagttg 2101 agcagagacc ctctgggagc ctggcggggt gcagcggcct cccctgcggg gcctgtcacc 2161 cggccggcgc gtgcaaacgc tctgggcctc tctgcgcgga gggagataag cgtctgagcc 2221 agggaaagcg cgggctaaac ccgcctcgcc ggggcccctg cccgccctcc gtgccccgcc 2281 cgggcggtgc agctggcccg ggtgctcacg ctcgactctc tttcttcttt tccagggtct 2341 gcgcttcgca gccgggatga agctggtttc cgtcgccctg atgtacctgg gttcgctcgc 2401 cttcctaggc gctgacaccg ctcggttgga tgtcgcgtcg gagtttcgaa agaagtgagt 2461 ccgggcagcg ccttccccct tgctggtacc tggcaggcaa ggggaactga ccgttggtcc 2521 cgaaggtcta gaagtgaatg ggagcaggga caggcctggg cgtcacctga acgcacgcga 2581 atcgggtctg cttgtgtttt ccaggtggaa taagtgggct ctgagtcgtg ggaagaggga 2641 actgcggatg tccagcagct accccaccgg gctcgctgac gtgaaggccg ggcctgccca 2701 gacccttatt cggccccagg acatgaaggg tgcctctcga agccccgaag acaggtaact 2761 acgccctgtg ctgtccaggg acgggaggga aggaaggtgt gcgggaggag ttctctgtct 2821 ccactcccct ggcccggggg atcgtcgggg ctggaccgca gctcagatgg cgcgagcagt 2881 ttccagctcc ctctggctct agaatggctc ccgttcccgg tgttggggcc aaagctctgc 2941 ttgatggggt ctcaagttgc ctttcttccc cctccccccg cccgcagcag tccggatgcc 3001 gcccgcatcc gagtcaagcg ctaccgccag agcatgaaca acttccaggg cctccggagc 3061 tttggctgcc gcttcgggac gtgcacggtg cagaagctgg cacaccagat ctaccagttc 3121 acagataagg acaaggacaa cgtcgccccc aggagcaaga tcagccccca gggctacggc 3181 cgccggcgcc ggcgctccct gcccgaggcc ggcccgggtc ggactctggt gtcttctaag 3241 ccacaagcac acggggctcc agcccccccg agtggaagtg ctccccactt tctttaggat 3301 ttaggcgccc atggtacaag gaatagtcgc gcaagcatcc cgctggtgcc tcccgggacg 3361 aaggacttcc cgagcggtgt ggggaccggg ctctgacagc cctgcggaga ccctgagtcc 3421 gggaggcacc gtccggcggc gagctctggc tttgcaaggg cccctccttc tgggggcttc 3481 gcttccttag ccttgctcag gtgcaagtgc cccagggggc ggggtgcaga agaatccgag 3541 tgtttgccag gcttaaggag aggagaaact gagaaatgaa tgctgagacc cccggagcag 3601 gggtctgagc cacagccgtg ctcgcccaca aactgatttc tcacggcgtg tcaccccacc 3661 agggcgcaag cctcactatt acttgaactt tccaaaacct aaagaggaaa agtgcaatgc 3721 gtgttgtaca tacagaggta actatcaata tttaagtttg ttgctgtcaa gatttttttt 3781 gtaacttcaa atatagagat atttttgtac gttatatatt gtattaaggg cattttaaaa 3841 gcaattatat tgtcctcccc tattttaaga cgtgaatgtc tcagcgaggt gtaaagttgt 3901 tcgccgcgtg gaatgtgagt gtgtttgtgt gcatgaaaga gaaagactga ttacctcctg 3961 tgtggaagaa ggaaacaccg agtctctgta taatctattt acataaaatg ggtgatatgc 4021 gaacagcaaa ccaataaact gtctcaatgc tgattcatct ctcggctcgg ctctcacgtc 4081 taggagggag gtgcctggtc cggccgccca ggagccttgg ggtcccgccg ccgggcttgg 4141 gaaggtggga cagcggcgcg tgcaggtgct ggccctactc tgagcgggct gagctgtcac 4201 gagctctgtt ctctgtgtgg ggcgcgcaga acacgtgtcc tgggtgcgaa tcagggcttc 4261 gcggagacgc ccgaggtcac cgcctggcgt ggcggccggc ggggggtggg tagggggagc 4321 cctagtgggt tggcgagcct ggactctcgg gttgcgcaac ggagtcctag atttataggg 4381 ttgctcacag aacccggcgc gaaagcgagc ttcgaagagt tggggtggtc tctgaggcct 4441 tctaagagag gcgttccccc tgcctgccgc ggttgttaaa gttccaagcc gccctagcaa 4501 tgcgtgggga cacctccgga aaacccagta acccctagta accccgggtc tgcccgggcc 4561 tgagcgcccg cgggccgctg gctcatgggc gaggctgtag cgcccc //