LOCUS NC_001457 7353 bp 24-FEB-2011 DEFINITION Human papillomavirus type 4, complete genome. ACCESSION NC_001457 VERSION NC_001457.1 GI:9626597 KEYWORDS complete genome; ORF E1; ORF E2; ORF E4; ORF E6; ORF E7; ORF L1; ORF L2. COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to X70827. COMPLETENESS: full length. DBLINK Project: 15492 UNIMARK NC_001457 ORIGIN 1 GTCTGTAATG ATAGTTGGCA ACAATCATTA CTTATAGCTA TATATAACCG GAAGAGATAC 61 ATATAAAAAG GGACAGTGCA TTTCTACTAA ATCCTGTCCA GATGGCAGAT GGCAGACCTG 121 CAACCTTGGA CGACTTCTGC AGACGATTCG ACATTTCCTT TTTTGATTTG CGCCTTACTT 181 GTATTTTTTG TTCTCATACT GTCGATCTTG CGGATCTTGC TTTATTCTAT CTTAAGAAAC 241 TTAGTTTAGT ATTTAGAGGA AATTGTTATT ATGCATGTTG TTCTGAATGC TTAAGATTAA 301 GTGCACTGTT TGAACAAGAG AATTATTTTC AATGTTCTAT TAAAGCTGTA CATTTGGAGG 361 AAATTGCTCA GAAAAAGATT AAGGAAATTT GCATTAGATG CATTTGCTGC CTTAGATTAC 421 TTGATATTGT TGAGAAATTA GATTTATTAT ACTCTGACGA GACTTGCTAT TTAATAAGGG 481 GTTTGTGGAG GGGCTATTGC AGAAATTGTA TTAGGAAACA ATGAGAGGAG CAGCGCCCAC 541 GGTTGCAGAT CTTAATTTAG AACTAAATGA CTTAGTGTTA CCAGCAAACC TGCTGAGTGA 601 GGAGGTCTTG CAATCTTCAG ATGATGAGTA TGAGATTACA GAGGAGGAGT CGGTGGTTCC 661 ATTTAGAATA GACACCTGTT GCTATAGATG TGAAGTTGCT GTAAGAATTA CATTGTATGC 721 TGCTGAGCTC GGACTACGGA CCTTGGAACA ACTTCTTGTA GAAGGAAAGC TGACGTTTTG 781 CTGCACCGCT TGTGCAAGAA GTCTTAACAG AAATGGCAGA TAAAGGTACA GACAATTTTG 841 ACTTAGAAGG GAATAATTGG TATATTGTCC ATGAAGCAGA ATGCACTGAC AGTATAGATA 901 CGTTGGATGA TTTATGCGAC GAAAGTAATG ACGATTCAAA CATTTCTAAC TTAATTGATG 961 ACGATGTCGT TGATCAGGGG AATTCCCTTG CGCTGTACAA TGCACAAATA AATGAGGATT 1021 GTGACAATGC ACTAGCACAC CTAAAACGAA AGTATAACAA AAGTCCAGAG CAGGCAGTCG 1081 CTGAATTGAG TCCGCAGTTG CAGGCTGTGA AAATAACTCC TGAAAGACAC AGCAAAAGGA 1141 GATTATTTCA GGACAGTGGG ATTTTCGAAG ATGAAGCTGA AAATTCTCTT ACACAGGTAG 1201 AATCCGAGAG CCAGGCTGGA CCTTCTAGCC AAGATGGCGG CGGAGATATT AATTTGTTGT 1261 TGTTACAAAG TAGTAACAGG AGGGCAACAA TGCTAGCAAA GTTTAAAGAA TGGTATGGGG 1321 TCTCATACAA TGAAATAACA AGAATTTATA AAAGTGATAA ATCTTGTAGT GATAATTGGG 1381 TAATAGTTAT TTTTAGAGCT GCTGTTGAAG TATTAGAAAG TTCAAAGATT GTTTTAAAGC 1441 AGCATTGTAC ATATATTCAA GTTAAGATCT TTGGATTTTC AGCTTTATAT TTAGTACAGT 1501 TTAAAAGTGC GAAAAGTAGA GAAACTGTAC AAAAGTTGAT GTGTTCTATA TTAAATATCC 1561 AAGAATATCA AATGTTATGT GATCCTCCAA AATTACGAAG TGTACCCACA GCATTATACT 1621 TTTATAAGCA TGCTATGTTA ACAGAGAGTT CTGTTTTTGG ACAAACACCG GATTGGATCG 1681 CAAAACAAAC TCTCGTAAGT CATCAAGCAG CAACTACTGC AGAGACTTTT GAGTTATCTA 1741 GAATGGTTCA GTGGGCATAC GATAATAATT ATGTGGATGA ATGTGACATT GCTTATCACT 1801 ATGCAATGTA CGCAGAGGAG GATGCAAATG CTGCTGCTTA TTTAAAAAGT AATAATCAAG 1861 TAAAGCATGT ACGAGATTGT AGTACAATGG TCAGGATGTA TAAAAGATAT GAAATGAGAG 1921 ATATGTCAAT GTCAGAATGG ATTTATAAAT GTTGTGATGA ATGTTCTGAA GAAGGAGATT 1981 GGAAGCCAAT CTCACAGTTT TTAAAATATC AAGGTGTTAA TATATTATCC TTTCTTATAG 2041 TGCTTAAATC ATTTTTAAAA GGTATTCCAA AAAAAAACTG TATAGTTATT CATGGTCCAC 2101 CAGATACAGG AAAATCATTA TTTTGTTATT CTTTTATAAA ATTTTTAAAA GGAAAAGTAG 2161 TTTCATATGT AAATAGAAGT AGCCATTTTT GGTTGCAGCC TCTGATGGAT TGCAAGGTAG 2221 GATTTATGGA TGATGCTACC TATGTGTGCT GGACATATAT AGATCAAAAT TTAAGGAATG 2281 CATTAGATGG TAATCCAATG TGTATTGACG CTAAACACAG AGCACCACAA CAATTAAAAT 2341 TACCACCAAT GCTAATAACG TCAAATATTG ATATTAAACA GGAACAATCT TTAATGTATT 2401 TACACAGTAG AATACAGTGT TTTAATTTTC CTAACAAAAT GCCTATTTTA GATGATGGTA 2461 GTCCTATGTA TACATTTACT GACGGTACTT GGAAATCTTT TTTCCAAAAG CTTGGCAGAC 2521 AATTAGAATT AACAGATCCT GAAGAGGAAA ACAATGGAGT CCCTAGTCGC ACGTTTCGAT 2581 GCACTTCAAG AAGCAATTCT GACTCATATT GAGTCACAGG AGAGCACTTT GGAATCCCAA 2641 ATCCAATATT GGGAAAATAT CAGAAAAGAA AATGCTATAA TGCATTATGC TCGAAAACAA 2701 GGCCTAACCA AATTAGGTCT ACAACCACTT CCTACACTAG CAGTAACTGA ATACAATGCA 2761 AAGCAAGCTA TTCAGATACA TTTAACTTTG CAATCATTGT TAAAATCTCC CTTTGCATCT 2821 GAACGGTGGA CATTGACAGA TGTTAGTGCA GAACTGATAA ATACCTCTCC ACAAAACTGT 2881 TTAAAAAAGG GAGGTTATGA TGTTGCTGTG TGGTTTGATA ATGATAGACA GAATGCAATG 2941 CTGTACACAA ATTGGGACTT TTTATATTAT CAAGATATGA ATGAACAGTG GCACAAAGTT 3001 AAAGGTGAAG TGGATTATGA TGGCTTATAC TTTACAGACC ATACGGGAGA AAGAGCTTAT 3061 TTTACATTAT TTAGCTCTGA TGCTCAAAGA TTTAGCAGAA CTGGACTGTG GACTGTGCAT 3121 TTTAAAACCC AAGTTATTTC CTCCCCTATT GTTAGCTCTA CATACTCCTC CTCCTTCGAC 3181 ACTGAGGAAC AACAGTTACC CGGGCCCTCC ACCAGCTACT CCGAAGTTAC CGAGCAGGCG 3241 AGCCCTACTC GAAGGAGGAA ACCGAGGAAA TCCGACGCGA CCTCCACCAC GTCCCCTGAA 3301 ACCGAGGGAG TACGACTACG ACGAAGACGA CGAGAAGGAA AATCAGGGCC CGGGTCAGGA 3361 GAAACCCCCC GCAAAAGAAG AAGAGGAGGA GGAAGAGGAG GAGGAGAGAC CGAATTGGGA 3421 TCTGCACCAT CTCCTGCAGA AGTGGGGAGC AGACATCGAC AAGTTGAAAG ACAAGGTCTG 3481 TCGCGACTTG GACTCTTACA AGCAGAAGCT AGGGATCCGC CTATGATATT GTTAAAGGGC 3541 ACAGCAAATT CTTTGAAATG TTGGAGATAT AGAAAAGTTA ACTCAAATTG CTGCAACTTC 3601 TTATTCATGA GTACTGTTTG GAACTGGGTT GGAGATTGCT CACATAATCA TAGTCGCATG 3661 CTTATTGCAT TTGATAGCAC TGACCAAAGA GACGCTTTTG TAAAACACAA CCTTTTTCCT 3721 AAACTGTGTA CATATACCTA CGGCTCATTG AATAGTTTAT AAAATGCAAA GCTTGAGTAG 3781 AAGGAAAAGA GATTCAGTTC CAAATCTTTA TGCAAAATGT CAACTGTCTG GCAATTGCCT 3841 ACCTGATGTA AAAAATAAAG TAGAAGCTGA TACTCTTGCT GATCGTTTGC TGAGATGGTT 3901 GGGAAGTGTA ATATACCTAG GAGGCTTGGG TATTGGTACT GGGAGAGGTA GTGGGGGGTC 3961 AACTGGGTAT AATCCAATTG GAGCTCCAAG TAGAGTCACA CCTAGTGGTA CTTTAGTAAG 4021 GCCTACAGTG CCTGTGGAAA GTTTGGGACC CTCAGAAATA ATCCCAATAG ATGCAATAGA 4081 CCCAACAACA TCTTCTGTTG TGCCATTAGA GGATCTGACC ATCCCAGATG TCACAGTAGA 4141 TAGTGGAGAT ACAAGAGGAA TAGGGGAGAC TACTCTTCAG CCTGCACAAG TAGATATTTC 4201 AACATCACAT GACCCTATAT CAGATGTCAC TGGTGCTAGC AGCCACCCTA CAATCATATC 4261 TGGCGAGGAT AACGCCATTG CAGTGTTAGA TGTGTCCCCT ATAGAACCTC CCACAAAACG 4321 GATAGCATTG GCAACTAGGG GAGCCTCAGC AACTCCACAT GTAAGTGTCA TATCTGGCAC 4381 AACCGAATTC GGTCAGTCAT CTGATCTGAA TGTATTTGTG AATGCCACAT TTTCAGGCGA 4441 TTCCATTGGT TATACAGAAG AAATTCCATT AGAACCGTTG AACCCCTTTC AAGAATTCGA 4501 AATAGAAAGC CCTCCAAAAA CTAGTACACC ACGTGACGTT TTAAATCGTG CAATAGGAAG 4561 AGCACGGGAT TTATATAATA GAAGGGTTCA GCAAATACCT ACTAGGAACC CAGCTTTACT 4621 GACACAGCCT TCCCGCGCAA TAGTATTTGG ATTTGAAAAT CCCGCCTTTG ATGCTGACAT 4681 CACTCAAACA TTTGAGCGGG ATTTAGAACA GGTTGCAGCA GCTCCAGATG CTGACTTTGC 4741 AGACATAGTC ACTATAGGGC GTCCAAGGTT TTCAGAGACA GATGCTGGTC AAATTAGAGT 4801 TAGCAGGCTT GGACGCCGAG GCACAATAAA AACTAGAAGT GGTGTGCAAA TTGGGCAGGC 4861 GGTTCATTTT TATTACGACC TAAGTACAAT AGATACTGCT GATGCTATTG AATTATCTAC 4921 TTTAGGTCAA CATTCAGGAG AACAAAGCAT TGTTGATGCT ATGATAGAAA GCAGCTTAAT 4981 AGATCCTTTT GAAATGCCCG ATCCTACTTT TACAGAAGAA CAACAGCTTT TAGATCCACT 5041 TACAGAAGAT TTTAGTCAGT CACACTTGGT GCTTACTAGT AGCAGACGTG GGACATCATT 5101 TACTATACCT ACAATACCAC CTGGATTAGG TCTTAGAATT TATGTAGATG ATGTAGGTTC 5161 TGATTTATTT GTTTCCTATC CAGAATCTAG AGTAATACCT GCTGGAGGTT TACCAACTGA 5221 GCCATTTGTT CCTCTAGAAC CAGCTTTGTT ATCTGATATA TTTAGTACGG ATTTTGTATA 5281 TCGTCCTAGT TTATATCGCA AGAAACGGAA ACGATTAGAA ATGTTTTAAT TGTTTTGCAG 5341 GAACATGTCG AGTTGGTTAT CTACAACGGG TAAAGTCTAC TTACCTCCAG CTCAACCTGT 5401 GGCAAGAGTT TTGGAAACTG ACGAATATAT CACTGGAACA TCTCTGTATT TCCACGCTGG 5461 TACAGAAAGG CTTTTAACTG TAGGCCATCC TTATTTTCCA GTGAAAGATG TACAGGAACC 5521 TCACAAAGTA TTAGTTCCTA AGGTTTCAGG AAGTCAATTT AGAGTGTTTA GATTCAATTT 5581 GCCAGACCCA AACAGATTTG CTTTAATTGA TAATGGCTTT TATGATTCTG ATCATGAACG 5641 CCTAGTATGG AAACTGAGGG GAATAGAAAT AGGAAGAGGA GGACCGCTTG GTATAGGTAC 5701 TACAGGTCAT CCTTTATATA ATAAGTTTGG AGACACAGAA AATCCTAATG GCTACAAAAA 5761 GCAATCAGAT GATAATAGAC AGGATGTCTC TTTAGACCCA AAACAAACAC AGATGTTTAT 5821 TATAGGTTGC ACTCCTGCAA TAGGTGAACA TTGGGATAAA GCTGAACCTT GTCCCAGCCC 5881 TGCTCCGCAA CAGGGAGATT GCCCACCAAT AGAGCTTGTA AATTCATACA TTCAAGATGG 5941 AGATATGTGT GACATTGGAT TTGGGGCTTT CAATTTTAAA GCTTTGCAGG CTGATAAATC 6001 TAGTGCTCCT TTGGATGTCA TTGCCACAGT TTGTAAATGG CCAGATTTTT TAAAAATGGG 6061 GAAAGATATC TATGGAGATA GCTTGTTTTT CTTTGGAAGA AGAGAACAAC TATATGCCAG 6121 ACATTTCTTT GTCAGAGCAG GCACCATGGG AGATGCTCTA CCAGAACCTT TTGAAGCTAC 6181 CTCAGATTAT TTTATTGGTG CTCAAAACCA ACAAGATCAG TACACTTTAG GACCTCATAT 6241 TTATGTAGGG ACCCCTAGTG GCTCTTTAGT ATCCAGTGAA TCCCAGTTGT TTAATCGACC 6301 GTATTGGTTA AACAGAGCTC AGGGTACAAA TAATGGAATT TGTTGGGATA ATCAGTTGTT 6361 TGTTACTCTT GTAGATAACA CTCATAATAC AAACTTTACA ATTTCTGTGA AGTCAGATGG 6421 TGCTAATGAC AATTATCAGT ATAAAGCTAG TGATTTTAAA CAGTACCTCA GACATATAGA 6481 GGAGTTTGAA ATGGAATTTA TATTTCAACT TTGTAAAGTT CCTCTAACTG CAGATGTTAT 6541 GGCTCATTTA AATGTAATGA ATCCTAATAT TTTGGATAAT TGGCAGTTAA ATTTTGTTCC 6601 ACCACCTCCC TCTGGAATTG AGGATCAATA TAGATTTTTG CAATCTAGAG CTACAAGATG 6661 CCCTACACAG ACCCCTGCAA CTGAAAAAGA AGATCCATAT AAAGATTTGT CTTTTTGGGT 6721 TGTTGATTTA AGTGAAAGAT TTTCCAGTGA ATTGAGCCAA TTTTCCTTAG GCAGGCGGTT 6781 TTTATATCAA AGTGGTTTAA TTAATGGTTC TCTAAAACGT AAAAGAATAA TAAGTTCTTC 6841 TCATGCACAA ACTAATACCA AACGTTCTGC CAAACGAAAA CGGTCTCTGA AATAACAATG 6901 TGAACTCTTC TGGAATGTTT TATTCTGCCA GGAAAACCTT CAACTGAGCC AAATTATTAT 6961 ATAATCGTTC TTAATCTCAA AATTGAGCTA ATTATATAAG ATTTGCAAAC GTGTATGTAT 7021 CTGTTTTTGT GAACTATAGT GAAATAAACT GCCACATACT TGCCAGTGTC CAGTCTCTCT 7081 GAGTCATTTG GTCAACATGC GTCCGCACCC CAATAATTAT TTGCATACAC AGATCAGTAG 7141 GAGAGGCGCC AAGACGGACA TATCCTCTTC AAATTTCCTT AAAATTATTG AATTTAACAA 7201 CTGTAAGCTA CAAAAGACCG TTATCGTTTC CTCTAACCTT GGGAAAAAGG TGAGTGAAAG 7261 TTTTATTGCA CCTTTTGTGA GTCAATTTGT CTGGCGGCGC TGAACGAATT TGGCTGTCAG 7321 CCTTTGCACC GGGAGTGGTG GAAAATAGTT TCT // LOCUS NC_001457 features 24-FEB-2011 UNIMARK NC_001457 features FEATURES Location/Qualifiers source 1..7353 /organism="Human papillomavirus type 4" /mol_type="genomic DNA" /db_xref="taxon:10617" gene 102..524 /locus_tag="HpV4gp1" /db_xref="GeneID:1489450" CDS 102..524 /locus_tag="HpV4gp1" /note="putative E6 ORF" /codon_start=1 /product="hypothetical protein" /protein_id="NP_040889.1" /db_xref="GI:9626598" /db_xref="GOA:Q07854" /db_xref="InterPro:IPR001334" /db_xref="UniProtKB/Swiss-Prot:Q07854" /db_xref="GeneID:1489450" /translation="MADGRPATLDDFCRRFDISFFDLRLTCIFCSHTVDLADLALFYL KKLSLVFRGNCYYACCSECLRLSALFEQENYFQCSIKAVHLEEIAQKKIKEICIRCIC CLRLLDIVEKLDLLYSDETCYLIRGLWRGYCRNCIRKQ" gene 521..823 /locus_tag="HpV4gp2" /db_xref="GeneID:1489452" CDS 521..823 /locus_tag="HpV4gp2" /note="putative E7 ORF" /codon_start=1 /product="hypothetical protein" /protein_id="NP_040890.1" /db_xref="GI:9626599" /db_xref="GOA:Q07857" /db_xref="InterPro:IPR000148" /db_xref="UniProtKB/Swiss-Prot:Q07857" /db_xref="GeneID:1489452" /translation="MRGAAPTVADLNLELNDLVLPANLLSEEVLQSSDDEYEITEEES VVPFRIDTCCYRCEVAVRITLYAAELGLRTLEQLLVEGKLTFCCTACARSLNRNGR" gene 813..2612 /locus_tag="HpV4gp3" /db_xref="GeneID:1489453" CDS 813..2612 /locus_tag="HpV4gp3" /note="putative E1 ORF" /codon_start=1 /product="hypothetical protein" /protein_id="NP_040891.1" /db_xref="GI:9626600" /db_xref="GOA:Q07846" /db_xref="InterPro:IPR001177" /db_xref="InterPro:IPR014000" /db_xref="InterPro:IPR014015" /db_xref="InterPro:IPR016393" /db_xref="UniProtKB/Swiss-Prot:Q07846" /db_xref="GeneID:1489453" /translation="MADKGTDNFDLEGNNWYIVHEAECTDSIDTLDDLCDESNDDSNI SNLIDDDVVDQGNSLALYNAQINEDCDNALAHLKRKYNKSPEQAVAELSPQLQAVKIT PERHSKRRLFQDSGIFEDEAENSLTQVESESQAGPSSQDGGGDINLLLLQSSNRRATM LAKFKEWYGVSYNEITRIYKSDKSCSDNWVIVIFRAAVEVLESSKIVLKQHCTYIQVK IFGFSALYLVQFKSAKSRETVQKLMCSILNIQEYQMLCDPPKLRSVPTALYFYKHAML TESSVFGQTPDWIAKQTLVSHQAATTAETFELSRMVQWAYDNNYVDECDIAYHYAMYA EEDANAAAYLKSNNQVKHVRDCSTMVRMYKRYEMRDMSMSEWIYKCCDECSEEGDWKP ISQFLKYQGVNILSFLIVLKSFLKGIPKKNCIVIHGPPDTGKSLFCYSFIKFLKGKVV SYVNRSSHFWLQPLMDCKVGFMDDATYVCWTYIDQNLRNALDGNPMCIDAKHRAPQQL KLPPMLITSNIDIKQEQSLMYLHSRIQCFNFPNKMPILDDGSPMYTFTDGTWKSFFQK LGRQLELTDPEEENNGVPSRTFRCTSRSNSDSY" gene 2554..3762 /locus_tag="HpV4gp4" /db_xref="GeneID:1489449" CDS 2554..3762 /locus_tag="HpV4gp4" /note="putative E2 ORF" /codon_start=1 /product="hypothetical protein" /protein_id="NP_040892.1" /db_xref="GI:9626601" /db_xref="GOA:Q07849" /db_xref="InterPro:IPR000427" /db_xref="InterPro:IPR001866" /db_xref="InterPro:IPR009021" /db_xref="InterPro:IPR012677" /db_xref="UniProtKB/Swiss-Prot:Q07849" /db_xref="GeneID:1489449" /translation="MESLVARFDALQEAILTHIESQESTLESQIQYWENIRKENAIMH YARKQGLTKLGLQPLPTLAVTEYNAKQAIQIHLTLQSLLKSPFASERWTLTDVSAELI NTSPQNCLKKGGYDVAVWFDNDRQNAMLYTNWDFLYYQDMNEQWHKVKGEVDYDGLYF TDHTGERAYFTLFSSDAQRFSRTGLWTVHFKTQVISSPIVSSTYSSSFDTEEQQLPGP STSYSEVTEQASPTRRRKPRKSDATSTTSPETEGVRLRRRRREGKSGPGSGETPRKRR RGGGRGGGETELGSAPSPAEVGSRHRQVERQGLSRLGLLQAEARDPPMILLKGTANSL KCWRYRKVNSNCCNFLFMSTVWNWVGDCSHNHSRMLIAFDSTDQRDAFVKHNLFPKLC TYTYGSLNSL" gene 2981..3526 /locus_tag="HpV4gp5" /db_xref="GeneID:1489454" CDS 2981..3526 /locus_tag="HpV4gp5" /note="putative E4 ORF" /codon_start=1 /product="hypothetical protein" /protein_id="NP_040893.1" /db_xref="GI:9626602" /db_xref="UniProtKB/Swiss-Prot:Q07852" /db_xref="GeneID:1489454" /translation="MNSGTKLKVKWIMMAYTLQTIREKELILHYLALMLKDLAELDCG LCILKPKLFPPLLLALHTPPPSTLRNNSYPGPPPATPKLPSRRALLEGGNRGNPTRPP PRPLKPREYDYDEDDEKENQGPGQEKPPAKEEEEEEEEEERPNWDLHHLLQKWGADID KLKDKVCRDLDSYKQKLGIRL" gene 3764..5329 /locus_tag="HpV4gp6" /db_xref="GeneID:1489451" CDS 3764..5329 /locus_tag="HpV4gp6" /note="putative L2 ORF" /codon_start=1 /product="hypothetical protein" /protein_id="NP_040894.1" /db_xref="GI:9626603" /db_xref="GOA:Q07862" /db_xref="InterPro:IPR000784" /db_xref="UniProtKB/Swiss-Prot:Q07862" /db_xref="GeneID:1489451" /translation="MQSLSRRKRDSVPNLYAKCQLSGNCLPDVKNKVEADTLADRLLR WLGSVIYLGGLGIGTGRGSGGSTGYNPIGAPSRVTPSGTLVRPTVPVESLGPSEIIPI DAIDPTTSSVVPLEDLTIPDVTVDSGDTRGIGETTLQPAQVDISTSHDPISDVTGASS HPTIISGEDNAIAVLDVSPIEPPTKRIALATRGASATPHVSVISGTTEFGQSSDLNVF VNATFSGDSIGYTEEIPLEPLNPFQEFEIESPPKTSTPRDVLNRAIGRARDLYNRRVQ QIPTRNPALLTQPSRAIVFGFENPAFDADITQTFERDLEQVAAAPDADFADIVTIGRP RFSETDAGQIRVSRLGRRGTIKTRSGVQIGQAVHFYYDLSTIDTADAIELSTLGQHSG EQSIVDAMIESSLIDPFEMPDPTFTEEQQLLDPLTEDFSQSHLVLTSSRRGTSFTIPT IPPGLGLRIYVDDVGSDLFVSYPESRVIPAGGLPTEPFVPLEPALLSDIFSTDFVYRP SLYRKKRKRLEMF" gene 5345..6895 /locus_tag="HpV4gp7" /db_xref="GeneID:1489455" CDS 5345..6895 /locus_tag="HpV4gp7" /note="putative L1 ORF" /codon_start=1 /product="L1" /protein_id="NP_040895.1" /db_xref="GI:9626604" /db_xref="GOA:Q07860" /db_xref="InterPro:IPR002210" /db_xref="InterPro:IPR011222" /db_xref="UniProtKB/Swiss-Prot:Q07860" /db_xref="GeneID:1489455" /translation="MSSWLSTTGKVYLPPAQPVARVLETDEYITGTSLYFHAGTERLL TVGHPYFPVKDVQEPHKVLVPKVSGSQFRVFRFNLPDPNRFALIDNGFYDSDHERLVW KLRGIEIGRGGPLGIGTTGHPLYNKFGDTENPNGYKKQSDDNRQDVSLDPKQTQMFII GCTPAIGEHWDKAEPCPSPAPQQGDCPPIELVNSYIQDGDMCDIGFGAFNFKALQADK SSAPLDVIATVCKWPDFLKMGKDIYGDSLFFFGRREQLYARHFFVRAGTMGDALPEPF EATSDYFIGAQNQQDQYTLGPHIYVGTPSGSLVSSESQLFNRPYWLNRAQGTNNGICW DNQLFVTLVDNTHNTNFTISVKSDGANDNYQYKASDFKQYLRHIEEFEMEFIFQLCKV PLTADVMAHLNVMNPNILDNWQLNFVPPPPSGIEDQYRFLQSRATRCPTQTPATEKED PYKDLSFWVVDLSERFSSELSQFSLGRRFLYQSGLINGSLKRKRIISSSHAQTNTKRS AKRKRSLK" misc_feature 5475..5495 /score=75 /ugene_group="Result 1" /ugene_name="FAP59" misc_feature complement(5936..5958) /score=70 /ugene_group="Result 1" /ugene_name="FAP64" //