LOCUS FJ006723 7963 bp DNA circular VRL 20-AUG-2008 DEFINITION Human papillomavirus type 16, complete genome. ACCESSION FJ006723 VERSION FJ006723.1 GI:196170262 KEYWORDS . SOURCE Human papillomavirus type 16 ORGANISM Human papillomavirus type 16 Viruses; dsDNA viruses, no RNA stage; Papillomaviridae; Alphapapillomavirus. REFERENCE 1 (bases 1 to 7963) AUTHORS Hong,L.L., Hai,M.Z. and Chun,Z.F. TITLE Direct Submission JOURNAL Submitted (11-AUG-2008) Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, 14 Shengli Road, Urumqi, Xinjiang 830046, China FEATURES Location/Qualifiers source 1..7963 /organism="Human papillomavirus type 16" /mol_type="genomic DNA" /isolation_source="Uygur cervical cancer patient" /host="Homo sapiens" /db_xref="taxon:333760" /country="China: Xinjiang province" /collection_date="16-Dec-2006" /PCR_primers="fwd_seq: atagggcccttctgatccttctatagtttctttagtg, rev_seq: atagggcccacaggatctactgttaaagg" gene 83..559 /gene="E6" CDS 83..559 /gene="E6" /codon_start=1 /product="E6 protein" /protein_id="ACG75887.1" /db_xref="GI:196170263" /translation="MHQKRTAMFQDPQERPRKLPQLCTELQTTIHDIILECVYCKQQL LRREVYDFAFRDLCIVYRDGNPYAVCDKCLKFYSKISEYRHYCYSVYGTTLEQQYNKP LCDLLIRCINCQKPLCPEEKQRHLDKKQRFHNIRGRWTGRCMSCCRSSRTRRETQL" gene 562..858 /gene="E7" CDS 562..858 /gene="E7" /codon_start=1 /product="E7 protein" /protein_id="ACG75888.1" /db_xref="GI:196170264" /translation="MHGDTPTLHEYMLDLQPETTDLYCYEQLNDSSEEEDEIDGPAGQ AEPDRAHYNIVTFCCKCDSTLRLCVQSTHVDIRTLEDLLMGTLGIVCPICSQKP" gene 865..2877 /gene="E1" CDS 865..2877 /gene="E1" /codon_start=1 /product="E1 protein" /protein_id="ACG75889.1" /db_xref="GI:196170265" /translation="MADPAGTNGEEGTGCNGWFYVEAVVEKKTGDAISDDENENDSDT GEDLVDFIVNDNDYLTQADTETAHALFTAQEAKQHRDAVQVLKRKYLGSPLSDISGCV DNNISPRLKAICIEKQSRAAKRRLFENEDSGYGNTEVETQQMLQVEGRHETETPCSQY SGGSGGGCSQRHETETPCSQYSGGSGGGCSQYSSGSGGEGVSERHTICQTPLTNILNV LKTSNAKAAMLAKFKELYGVSFSELVRPFKSNKSTCCDWCIAAFGLTPSIADSIKTLL QQYCLYLHIQSLACSWGMVVLLLVRYKCGKNRETIEKLLSKLLCVSPMCMMIEPPKLR STAAALYWYKTGISNISEVYGDTPEWIQRQTVLQHSFNDCTFELSQMVQWAYDNDIVD DSEIAYKCAQLADTNSNASAFLKSNSQAKIVKDCATMCRHYKRAEKKQMSMSQWIKYR CDRVDDGGDWKQIVMFLRYQGVEFMSFLTALKRFLQGIPKKNCILLYGAANTGKSLFG MSLMKFLQGSVICFVNSKSHFWLQPLADAKIGMLDDATVPCWNYIDDNLRNALDGNLV SMDVKHRPLVQLKCPPLLITSSINAGTDSRWPYLHNRLVVFTFPNEFPFDENGNPVYE LNDKNWKSFFSRTWSRLSLHEDEDKENDGDSLPTFKCVSGQNTNTL" gene 2819..3916 /gene="E2" CDS 2819..3916 /gene="E2" /codon_start=1 /product="E2 protein" /protein_id="ACG75890.1" /db_xref="GI:196170266" /translation="METLCQRLNVCQDKILTHYENDSTDLRDHIDYWKHMRLECAIYY KAREMGFKHINHQVVPTLAVSKNKALQAIELQLTLETIYNSQYSNEKWTLQDVSLEVY LTAPTGCIKKHGYTVEVQFDGDICNTMHYTNWTHIYICEEASVTVVEGQVDYYGLYYV HEGIRTYFVQFKDDAEKYSKNKVWEVHAGGQVILCPTSVFSSNEVSSPEIIRQHLANH SAATHTKAVALGTEETQTTIQRPRSEPDTGNPCHTTKLLHRDSVDSAPILTAFNSSHK GRINCNSNTTPIVHLKGDANTLKCLRYRFKKHCTLYTAVSSTWHWTGHNVKHKSAIVT LTYDSEWQRDQFLSQVKIPKTITVSTGFMSI" gene <3396..3683 /gene="E4" CDS <3396..3683 /gene="E4" /note="start codon not determined" /codon_start=1 /product="E4 protein" /protein_id="ACG75891.1" /db_xref="GI:196170267" /translation="YYVLHLCLAATKYPLLKLLGSTWPTTPPRPIPKPSPWAPKKHRR LSSDQDQSQTPETPATPLSCCTETQWTVLQSSLHLTAHTKDGLTVIVTLHP" gene 4294..5715 /gene="L2" CDS 4294..5715 /gene="L2" /codon_start=1 /product="L2 protein" /protein_id="ACG75892.1" /db_xref="GI:196170268" /translation="MRHKRSAKRTKRASATQLYKTCKQAGTCPPDIIPKVEGKTIADQ ILQYGSMGVFFGGLGIGTGSGTGGRTGYIPLGTRPPTATDTLAPVRPPLTVDPVGPSD PSIVSLVEETSFIDAGAPTSVPSIPPDVSGFSITTSTDTTPAILDINNTVTTVTTHNN PTFTDPSVLQPPTPAETGGHFTLSSSTISTHNYEEIPMDTFIVSTNPNTVTSSTPIPG SRPVARLGLYSRTTQQVKVVDPAFVTTPTKLITYDNPAYEGIDVDNTLYFSSNDNSIN IAPDPDFLDIVALHRPALTSRRTGIRYSRIGNKQTLRTRSGKSIGAKVHYYYDFSTID PAEEIELQTITPSTYTTTSHAASPTSINNGLYDIYADDFITDTSTTPVPSVPSTSLSG YIPANTTIPFGGAYNIPLVSGPDIPINITDQAPSLIPIVPGSPQYTIIADAGDFYLHP SYYMLRKRRKRLPYFFSDVSLAA" gene 5618..7213 /gene="L1" CDS 5618..7213 /gene="L1" /codon_start=1 /product="L1 protein" /protein_id="ACG75893.1" /db_xref="GI:196170269" /translation="MQVTFIYILVITCYENDVNVYHIFFQMSLWLPSEATVYLPPVPV SKVVSTDGYVARTNIYYHAGTSRLLAVGHPYFPIKKPNNNKILVPKVSGLQYRVFRIH LPDPNKFGFPDTSFYNPDTQRLVWACVGVEVGRGQPLGVGISGHPLLNKLDDTENASA YAANAGVDNRECISMDYKQTQLCLIGCKPPIGEHWGKGSPCTNVAVNPGDCPPLELIN TVIQDGDMVHTGFGAMDFTTLQANKSEVPLDICTSICKYPDYIKMVSEPYGDSLFFYL RREQMFVRHLFNRAGAVGENVPDDLYIKGSGSTANLASSNYFPTPSGSMVTSDAQIFN KPYWLQRAQGHNNGICWGNQLFVTVVDTTRSTNMSLCAAISTSETTYKNTNFKEYLRH GEEYDLQFIFQLCKITLTADVMTYIHSMNSTILEDWNFGLQPPPGGTLEDTYRFVTSQ AIACQKHTPPAPKEDPLKKYTFWEVNLKEKFSADLDQFPLGRKFLLQAGLKAKPKFTL GKRKATPTTSSTSTTAKRKKRKL" ORIGIN 1 actacaataa ttcatgtata aaactaaggg cgtaaccgaa atcggttgaa ccgaaaccgg 61 ttagtataaa agcggacatt ttatgcacca aaagagaact gcaatgtttc aggacccaca 121 ggagcgaccc agaaagttac cacagttatg cacagagctg caaacaacta tacatgatat 181 aatattagaa tgtgtgtact gcaagcaaca gttactgcga cgtgaggtat atgactttgc 241 ttttcgggat ttatgcatag tatatagaga tgggaatcca tatgctgtat gtgataaatg 301 tttaaagttt tattctaaaa ttagtgagta tagacattat tgttatagtg tgtatggaac 361 aacattagaa cagcaataca acaaaccgtt gtgtgatttg ttaattaggt gtattaactg 421 tcaaaagcca ctgtgtcctg aagaaaagca aagacatctg gacaaaaagc aaagattcca 481 taatataagg ggtcggtgga ccggtcgatg tatgtcttgt tgcagatcat caagaacacg 541 tagagaaacc cagctgtaat catgcatgga gatacaccta cattgcatga atatatgtta 601 gatttgcaac cagagacaac tgatctctac tgttatgagc aattaaatga cagctcagag 661 gaggaggatg aaatagatgg tccagctgga caagcagaac cggacagagc ccattacaat 721 attgtaacct tttgttgcaa gtgtgactct acgcttcggt tgtgcgtaca aagcacacac 781 gtagacattc gtactttgga agacctgtta atgggcacac taggaattgt gtgccccatc 841 tgttctcaga aaccataatc taccatggct gatcctgcag gtaccaatgg ggaagagggt 901 acgggatgta atggatggtt ttatgtagag gctgtagtgg aaaaaaaaac aggggatgct 961 atatcagatg acgagaacga aaatgacagt gatacaggtg aagatttggt agattttata 1021 gtaaatgata atgattattt aacacaggca gacacagaga cagcacatgc gttgtttact 1081 gcacaggaag caaaacaaca tagagatgca gtacaggttc taaaacgaaa gtatttgggt 1141 agtccactta gtgatattag tggatgtgta gacaataata ttagtcctag attaaaagct 1201 atatgtatag aaaaacaaag tagagctgca aaaaggagat tatttgaaaa cgaagacagc 1261 gggtatggca atactgaagt ggaaactcag cagatgttac aggtagaagg gcgccatgag 1321 actgaaacac catgtagtca gtatagtggt ggaagtgggg gtggttgcag tcagcgccat 1381 gagactgaaa caccatgtag tcagtatagt ggtggaagtg ggggtggttg cagtcagtac 1441 agtagtggaa gtgggggaga gggtgttagt gaaagacaca ctatatgcca aacaccactt 1501 acaaatattt taaatgtact aaaaactagt aatgcaaagg cagcaatgtt agcaaaattt 1561 aaagagttat acggggtgag tttttcagaa ttagtaagac catttaaaag taataaatca 1621 acgtgttgcg attggtgtat tgctgcattt ggacttacac ccagtatagc tgacagtata 1681 aaaacactat tacaacaata ttgtttatat ttacacattc aaagtttagc atgttcatgg 1741 ggaatggttg tgttactatt agtaagatat aaatgtggaa aaaatagaga aacaattgaa 1801 aaattgctgt ctaaactatt atgtgtgtct ccaatgtgta tgatgataga gcctccaaaa 1861 ttgcgtagta cagcagcagc gttatattgg tataaaacag gtatatcaaa tattagtgaa 1921 gtgtatggag acacgccaga atggatacaa agacaaacag tattacaaca tagttttaat 1981 gattgtacat ttgaattatc acagatggta caatgggcct acgataatga catagtagac 2041 gatagtgaaa ttgcatataa atgtgcacaa ttggcagaca ctaatagtaa tgcaagtgcc 2101 tttctaaaaa gtaattcaca ggcaaaaatt gtaaaggatt gtgcaacaat gtgtagacat 2161 tataaacgag cagaaaaaaa acaaatgagt atgagtcaat ggataaaata tagatgtgat 2221 agggtagatg atggaggtga ttggaagcaa attgttatgt ttttaaggta tcaaggtgta 2281 gagtttatgt catttttaac tgcattaaaa agatttttgc aaggcatacc taaaaaaaat 2341 tgcatattac tatatggtgc agctaacaca ggtaaatcat tatttggtat gagtttaatg 2401 aaatttctgc aagggtctgt aatatgtttt gtaaattcta aaagccattt ttggttacaa 2461 ccattagcag atgccaaaat aggtatgtta gatgatgcta cagtgccctg ttggaactac 2521 atagatgaca atttaagaaa tgcattggat ggaaatttag tttctatgga tgtaaagcat 2581 agaccattgg tacaactaaa atgccctcca ttattaatta catctagcat taatgctggt 2641 acagattcta ggtggcctta tttacataat agattggtgg tgtttacatt tcctaatgag 2701 tttccatttg acgaaaacgg aaatccagtg tatgagctta atgataagaa ctggaaatcc 2761 tttttctcaa ggacgtggtc cagattaagt ttgcacgagg acgaggacaa ggaaaacgat 2821 ggagactctt tgccaacgtt taaatgtgtg tcaggacaaa atactaacac attatgaaaa 2881 tgatagtaca gacctacgtg accatataga ctattggaaa cacatgcgcc tagaatgtgc 2941 tatttattac aaggccagag aaatgggatt taaacatatt aaccaccagg tggtgccaac 3001 actggctgta tcaaagaata aagcattaca agcaattgaa ctgcaactaa cgttagaaac 3061 aatatataac tcacaatata gtaatgaaaa gtggacatta caagacgtta gccttgaagt 3121 gtatttaact gcaccaacag gatgtataaa aaaacatgga tatacagtgg aagtgcagtt 3181 tgatggagac atatgcaata caatgcatta tacaaactgg acacatatat atatttgtga 3241 agaagcatca gtaactgtgg tagagggtca agttgactat tatggtttat attatgttca 3301 tgaaggaata cgaacatatt ttgtgcagtt taaagatgat gcagaaaaat atagtaaaaa 3361 taaagtatgg gaagttcatg cgggtggtca ggtaatatta tgtcctacat ctgtgtttag 3421 cagcaacgaa gtatcctctc ctgaaattat taggcagcac ttggccaacc actccgccgc 3481 gacccatacc aaagccgtcg ccttgggcac cgaagaaaca cagacgacta tccagcgacc 3541 aagatcagag ccagacaccg gaaacccctg ccacaccact aagttgttgc acagagactc 3601 agtggacagt gctccaatcc tcactgcatt taacagctca cacaaaggac ggattaactg 3661 taatagtaac actacaccca tagtacattt aaaaggtgat gctaatactt taaaatgttt 3721 aagatataga tttaaaaagc attgtacatt gtatactgca gtgtcgtcta catggcattg 3781 gacaggacat aatgtaaaac ataaaagtgc aattgttaca cttacatatg atagtgaatg 3841 gcaacgtgac caatttttgt ctcaagttaa aataccaaaa actattacag tgtctactgg 3901 atttatgtct atatgacaaa tcttgatact gcatccacaa cattactggc gtgctttttg 3961 ctttgctttt gtgtgctttt gtgtgtctgc ctattaatac gtccgctgct tttgtctgtg 4021 tctacataca catcattaat actattggta ttactattgt ggataacagc agcctctgcg 4081 tttaggtgtt ttattgtata tattgtattt gtttatatac cattattttt aatacataca 4141 catgcacgct tttaattaca taatgtatat gtacataatg taattgttac atataattgt 4201 tgtataccat aacttactat tttttctttt ttattttcat atataatttt ttttttgttt 4261 gtttgttttt taataaactg ttatcactta acaatgcgac acaaacgttc tgcaaaacgc 4321 acaaaacgtg catcggctac ccaactttat aaaacatgca aacaggcagg tacatgtcca 4381 cctgacatta tacctaaggt tgaaggcaaa actattgctg atcaaatatt acaatatgga 4441 agtatgggtg tattttttgg tgggttagga attggaacag ggtcgggtac aggcggacgc 4501 actgggtata ttccattggg aacaaggcct cccacagcta cagatacact tgctcctgta 4561 agaccccctt taacagtaga tcctgtgggc ccttctgatc cttctatagt ttctttagtt 4621 gaagaaacta gttttattga tgctggtgca ccaacatctg taccttccat tcccccagat 4681 gtatcaggat ttagtattac tacttcaact gataccacac ctgctatatt agatattaat 4741 aatactgtta ctactgttac tacacataat aatcccactt tcactgaccc atctgtattg 4801 cagcctccaa cacctgcaga aactggaggg cattttacac tttcatcatc cactattagt 4861 acacataatt atgaagaaat tcctatggat acatttattg ttagcacaaa ccctaacaca 4921 gtaactagta gcacacccat accagggtct cgcccagtgg cacgcctagg attatatagt 4981 cgcacaacac aacaagttaa agttgtagac cctgcttttg taaccactcc cactaaactt 5041 attacatatg ataatcctgc atatgaaggt atagatgtgg ataatacatt atatttttct 5101 agtaatgata atagtattaa tatagctcca gatcctgact ttttggatat agttgcttta 5161 cataggccag cattaacctc taggcgtact ggcattaggt acagtagaat tggtaataaa 5221 caaacactac gtactcgtag tggaaaatct ataggtgcta aggtacatta ttattatgat 5281 tttagtacta ttgatcctgc agaagaaata gaattacaaa ctataacacc ttctacatat 5341 actaccactt cacatgcagc ctcacctact tctattaata atggattata tgatatttat 5401 gcagatgact ttattacaga tacttctaca accccggtac catctgtacc ctctacatct 5461 ttatcaggtt atattcctgc aaatacaaca attccttttg gtggtgcata caatattcct 5521 ttagtatcag gtcctgatat acccattaat ataactgacc aagctccttc attaattcct 5581 atagttccag ggtctccaca atatacaatt attgctgatg caggtgactt ttatttacat 5641 cctagttatt acatgttacg aaaacgacgt aaacgtttac catatttttt ttcagatgtc 5701 tctttggctg cctagtgagg ccactgtcta cttgcctcct gtcccagtat ctaaggttgt 5761 aagcacggat ggatatgttg cacgcacaaa catatattat catgcaggaa catccagact 5821 acttgcagtt ggacatccct attttcctat taaaaaacct aacaataaca aaatattagt 5881 tcctaaagta tcaggattac aatacagggt atttagaata catttacctg accccaataa 5941 gtttggtttt cctgacacct cattttataa tccagataca cagcggctgg tttgggcctg 6001 tgtaggtgtt gaggtaggtc gtggtcagcc attaggtgtg ggcattagtg gccatccttt 6061 attaaataaa ttggatgaca cagaaaatgc tagtgcttat gcagcaaatg caggtgtgga 6121 taatagagaa tgtatatcta tggattacaa acaaacacaa ttgtgtttaa ttggttgcaa 6181 accacctata ggggaacact ggggcaaagg atccccatgt accaatgttg cagtaaatcc 6241 aggtgattgt ccaccattag agttaataaa cacagttatt caggatggtg atatggttca 6301 tactggcttt ggtgctatgg actttactac attacaggct aacaaaagtg aagttccact 6361 ggatatttgt acatctattt gcaaatatcc agattatatt aaaatggtgt cagaaccata 6421 tggcgacagc ttattttttt atttacgaag ggaacaaatg tttgttagac atttatttaa 6481 tagggctggt gctgttggtg aaaatgtacc agacgattta tacattaaag gctctgggtc 6541 tactgcaaat ttagccagtt caaattattt tcctacacct agtggttcta tggttacctc 6601 tgatgcccaa atattcaata aaccttattg gttacaacga gcacagggcc acaataatgg 6661 catttgttgg ggtaaccaac tatttgttac tgttgttgat actacacgca gtacaaatat 6721 gtcattatgt gctgccatat ctacttcaga aactacatat aaaaatacta actttaagga 6781 gtacctacga catggggagg aatatgattt acagtttatt tttcaactgt gcaaaataac 6841 cttaactgca gacgttatga catacataca ttctatgaat tccactattt tggaggactg 6901 gaattttggt ctacaacctc ccccaggagg cacactagaa gatacttata ggtttgtaac 6961 atcccaggca attgcttgtc aaaaacatac acctccagca cctaaagaag atccccttaa 7021 aaaatacact ttttgggaag taaatttaaa ggaaaagttt tctgcagacc tagatcagtt 7081 tcctttagga cgcaaatttt tactacaagc aggattgaag gccaaaccaa aatttacatt 7141 aggaaaacga aaagctacac ccaccacctc atctacctct acaactgcta aacgcaaaaa 7201 acgtaagctg taagtattgt atgtatgttg aattagtgtt gtttgttgtt tatgtgtttg 7261 tatgtgcttg tatgtgcttg taaatattaa gttgtatgtg tgtttgtatg tatggtataa 7321 taaacacgtg tgtatgtgtt tttaaatgct tgtgtaacta ttgtgtcatg caacataaat 7381 aaacttattg tttcaacacc tactaattgt gttgtggtta ttcattgtat ataaactata 7441 tttgctacat cctgtttttg ttttatatat actatatttt gtagcgccag cggccatttt 7501 gtagctccaa ccgaattcgg ttgcatgctt tttggcacaa aatgtgtttt tttaaatagt 7561 tctatgtcag caactatagt ttaaacttgt acgtttcctg cttgccatgc gtgccaaatc 7621 cctgttttcc tgacctgcac tgcttgccaa ccattccatt gttttttaca ctgcactatg 7681 tgcaactact gaatcaccat gtacattgtg tcatataaaa taaatcacta tgcgccaacg 7741 ccttacatac cgctgttagg cacatatttt tggcttgttt taactaacct aattgcatat 7801 ttggcataag gtttaaactt ctaaggccaa ctaaatgtca ccctagttca tacatgaact 7861 gtgtaaaggt tagtcataca ttgttcattt gtaaaactgc acatgggtgt gtgcaaaccg 7921 ttttgggtta cacatttaca agcaacttat ataataatac taa //