1LOCUS NC_001363 5833 bp ss-RNA linear VRL 27-AUG-2002 2DEFINITION Murine sarcoma virus, complete genome. 3ACCESSION NC_001363 4VERSION NC_001363.1 GI:9626100 5KEYWORDS oncogene; polyprotein; unidentified reading frame. 6SOURCE Murine sarcoma virus. 7 ORGANISM Murine sarcoma virus 8 Viruses; Retroid viruses; Retroviridae; Mammalian type C 9 retroviruses; 1-Mammalian type C virus group. 10REFERENCE 1 (bases 1 to 5833) 11 AUTHORS Van Beveren,C., van Straaten,F., Galleshaw,J.A. and Verma,I.M. 12 TITLE Nucleotide sequence of the genome of a murine sarcoma virus 13 JOURNAL Cell 27 (1 Pt 2), 97-108 (1981) 14 MEDLINE 82115347 15 PUBMED 6173134 16COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The 17 reference sequence was derived from V01185. 18FEATURES Location/Qualifiers 19 source 1..5833 20 /organism="Murine sarcoma virus" 21 /db_xref="taxon:11802" 22 misc_feature 2..590 23 /note="5' terminal repeat" 24 CDS 1042..2658 25 /codon_start=1 26 /product="gag polyprotein" 27 /protein_id="NP_040335.1" 28 /db_xref="GI:9626101" 29 /db_xref="SWISS-PROT:P03334" 30 /translation="MGQTVTTPLSLTLDHWKDVERLAHNQSVDVKKRRWVTFCSAEWP 31 TFNVGWPRDGTFNRDLITQVKIKVFSPGPHGHPDQVPYIVTWEALAFDPPPWVKPFVH 32 PKPPPPLLPSAPSLPLEPPLSTPPQSSLYPALTPSLGAKPKPQVLSDSGGPLIDLLTE 33 DPPPYRDPRPPPSDRDGDSGEATPAGEAPDPSPMASRLRGRREPPVADSTTSQAFPLR 34 TGGNGQLQYWPFSSSDLYNWKNNNPSFSEDPGKLTALIESVLITHQPTWDDCQQLLGT 35 LLTGEEKQRVLLEARKAVRGDDGRPTQLPNEVDAAFPLERPDWEYTTQAGRNHLVHYR 36 QLLIAGLQNAGRSPTNLAKVKGITQGPNESPSAFLERLKEAYRRYTPYDPEDPGQETN 37 VSMSFIWQSAPDIGRKLERLEDLRNKTLGDLVREAERIFNKRETPEEREERIRREREE 38 KEERRRTEDEQKEKERDRRRHREMSRLLATVVSGQRQDRQEGERRRSQLDCDQCTYCE 39 EQGHWAKDCPKRPRGPRGPRPQTSLLTLDD" 40 CDS join(2970..3413,3412..3873) 41 /note="Predicted by GeneMark; artificial frameshift" 42 /codon_start=1 43 /product="pol polyprotein fragment" 44 /protein_id="NP_597742.2" 45 /db_xref="GI:22507278" 46 /translation="MVAAIAVLTKDAGKLTMGQPLVILAPHAVEALVKQPPDRWLSNA 47 RMTHYQALLLDTDRVQFRPVVALNPATLLPLPEKGLQHNCLDILAEAHGTRPDLTDQP 48 LPDADHTWYTDGSSLLQEGQRKAGAAVTTETEKPSQPRKKTAKVVNIFPRFGMLQVLG 49 TDNGPAFVSKVSQTVADLLGIDWKLHCAYRPQSSGQVERINRTIKETLTKLTLATGSR 50 DWVLLLPLALYRARNTPGPHGLTPYEILCGAPPPLVNFPDPDMTRVTNSPSLQAHIQA 51 LYLVQHEVWRPLAAAYQEQLDHPLD" 52 CDS 3875..4999 53 /codon_start=1 54 /product="unknown reading frame (gene x)" 55 /protein_id="NP_040336.1" 56 /db_xref="GI:9626102" 57 /db_xref="SPTREMBL:Q85651" 58 /translation="MAHSTPCSQTSLAVPNHFSLVSHVTVPSEGVMPSPLSLCRYLPR 59 ELSPSVDSRSCSIPLVAPRKAGKLFLGTTPPRAPGLPRRLAWFSIDWEQVCLMHRLGS 60 GGFGSVYKATYHGVPVAIKQVNKCTEDLRASQRSFWAELNIAGLRHDNIVRVVAASTR 61 TPEDSNSLGTIIMEFGGNVTLHQVIYDATRSPEPLSCRKQLSLGKCLKYSLDVVNGLL 62 FLHSQSILHLDLKPANILISEQDVCKISDFGCSQKLQVLRGRQASPPHIGGTYTHQAP 63 EILKGEIATPKADIYSFGITLWQMTTREVPYSGEPQYVQYAVVAYNLRPSLAGAVFTA 64 SLTGKALQNIIQSCWEARGLQRPSAELLQRDLKAFRGTLG" 65 CDS 5048..5203 66 /note="Predicted by GeneMark" 67 /codon_start=1 68 /product="spike protein fragment" 69 /protein_id="NP_597744.1" 70 /db_xref="GI:19263358" 71 /translation="MGPLIVLLMILLFGPCILNRLVQFVKDRISVVQALALTQQYHQL 72 KPIEYEP" 73 misc_feature 5245..5833 74 /note="3' terminal repeat" 75BASE COUNT 1427 a 1673 c 1484 g 1249 t 76ORIGIN 77 1 aaatgaaaga ccccacccgt aggtggcaag ctagcttaag taacgccact ttgcaaggca 78 61 tggaaaaata cataactgag aatagaaaag ttcagatcaa ggtcaggaac aaagaaacag 79 121 ctgaatacca aacaggatat ctgtggtaag cggttcctgc cccggctcag ggccaagaac 80 181 agatgagaca gctgagtgat gggccaaaca ggatatctgt ggtaagcagt tcctgccccg 81 241 gctcggggcc aagaacagat ggtccccaga tgcggtccag ccctcagcag tttctagtga 82 301 atcatcagat gtttccaggg tgccccaagg acctgaaaat gaccctgtac cttatttgaa 83 361 ctaaccaatc agttcgcttc tcgcttctgt tcgcgcgctt ccgctctccg agctcaataa 84 421 aagagcccac aacccctcac tcggcgcgcc agtcttccga tagactgcgt cgcccgggta 85 481 cccgtattcc caataaagcc tcttgctgtt tgcatccgaa tcgtggtctc gctgttcctt 86 541 gggagggtct cctctgagtg attgactacc cacgacgggg gtctttcatt tgggggctcg 87 601 tccgggattt ggagacccct gcccagggac caccgaccca ccaccgggag gtaagctggc 88 661 cagcaactta tctgtgtctg tccgattgtc tagtgtctat gtttgatgtt atgcgcctgc 89 721 gtctgtacta gttagctaac atgctctgta tctggcggac ccgtggtgga actgacgagt 90 781 tctgaacacc cggccgcaac cctgggagac gtcccaggga ctttgggggc cgtttttgtg 91 841 gcccgacctg aggaagggag tcgatgtgga atccgacccc gtcaggatat gtggttctgg 92 901 taggagacga gaacctaaaa cagttcccgc ctccgtctga atttttgctt tcggtttgga 93 961 accgaagccg cgcgtcttgt ctgctgcagc atcgttctgt gttgtctctg tctgactgtg 94 1021 tttctgtatt tgtctgaaaa tatgggccag actgttacca ctcccttaag tttgacctta 95 1081 gatcactgga aagatgtcga gcggctcgct cacaaccagt cggtagatgt caagaagaga 96 1141 cgttgggtta ccttctgctc tgcagaatgg ccaaccttta acgtcggatg gccgcgagac 97 1201 ggcaccttta accgagacct catcacccag gttaagatca aggtcttttc acctggcccg 98 1261 catggacacc cagaccaggt cccctacatc gtgacctggg aagccttggc ttttgacccc 99 1321 cctccctggg tcaagccctt tgtacaccct aagcctccgc ctcctcttct tccatccgcg 100 1381 ccgtctctcc cccttgaacc tcctctttcg accccgcctc aatcctccct ttatccagcc 101 1441 ctcactcctt ctttgggcgc caaacctaaa cctcaagttc tttctgacag tggggggccg 102 1501 ctcatcgacc tacttacaga agaccccccg ccttataggg acccaagacc acccccttcc 103 1561 gacagggacg gagatagtgg agaagcgacc cctgcgggag aggcaccgga cccctcccca 104 1621 atggcatctc gcctgcgtgg gagacgggag ccccctgtgg ccgactccac tacctcgcag 105 1681 gcattccccc tccgcacagg aggaaacgga cagcttcaat actggccgtt ctcctcttct 106 1741 gacctttaca actggaaaaa taataaccct tctttttctg aagatccagg taaactgaca 107 1801 gctctgatcg agtctgtcct catcacccat cagcccacct gggacgactg tcagcagctg 108 1861 ttggggactc tgctgaccgg ggaagaaaaa caacgggtgc tcttagaggc tagaaaggcg 109 1921 gtgcggggcg atgatgggcg ccccactcaa ctgcccaatg aagtcgatgc cgcttttccc 110 1981 ctcgagcgcc cagactggga gtacaccacc caggcaggta ggaaccacct agtccactat 111 2041 cgccagttgc tcatagcggg tctccaaaac gcgggcagaa gccccaccaa tttggccaag 112 2101 gtaaaaggaa taacacaagg gcccaatgag tctccctcgg ccttcctaga gagacttaag 113 2161 gaagcctatc gcaggtacac tccttatgac cctgaggacc cagggcaaga aactaatgtg 114 2221 tctatgtctt tcatttggca gtctgcccca gacattggga gaaagttaga gaggttagaa 115 2281 gatttgagaa acaagacgct tggagatttg gttagagagg cagaaaggat ctttaataaa 116 2341 cgagaaaccc cggaagaaag agaggaacgt atcaggagag aaagagagga aaaggaagaa 117 2401 cgccgtagga cagaggatga gcagaaagag aaagaaagag atcgtaggag acatagagag 118 2461 atgagcaggc tattggccac tgtcgttagt ggacagagac aggatagaca ggaaggagaa 119 2521 cgaaggaggt cccaactcga ctgcgaccag tgtacctact gcgaagaaca agggcactgg 120 2581 gctaaagatt gtcccaagag accacgagga cctcggggac caagacccca gacctccctc 121 2641 ctgaccctag atgactaggg aggtcagggt caggagcccc cccctgaacc caggataacc 122 2701 ctcaaagtcg gggggcaacc cgtcaccttc ctggtagata ctggggccca gaccaacaaa 123 2761 aggcctatca agaaatcaag caagttcttc taactgcccc agccctgggg ttgccagatt 124 2821 tgactaagcc ctttgaactc tttgtcgacg agaagcaggg ctacgccaaa ggtgtcctaa 125 2881 cgcaaaaact gggaccttgg cgtcggccgg tggcctacct gtccaaacag ctagacccag 126 2941 tagcagctgg gtgaccccct tgcctacgga tggtagcagc cattgccgta ctgacaaagg 127 3001 atgcaggcaa gctaaccatg ggacagccac tagtcattct ggccccccat gcagtagagg 128 3061 cactagtcaa acaacccccc gaccgctggc tttccaacgc ccggatgact cactatcagg 129 3121 ccttgctttt ggacacggac cgggtccagt tcagaccggt ggtagccctg aacccggcta 130 3181 cgctgctccc actgcctgag aaagggctgc aacacaactg ccttgatatc ctggccgaag 131 3241 ctcatggaac ccgacccgac ctaacggacc agccgctccc agacgccgac cacacctggt 132 3301 acacggatgg aagcagtctt ttacaagagg gacagcgtaa ggcgggagct gcggtgacca 133 3361 ccgagaccga gaagccttcc caaccaagaa aaaaaaccgc caaggtcgta aatcttcccc 134 3421 aggttcggca tgcttcaggt attgggaact gacaatgggc ctgccttcgt ctccaaggtg 135 3481 agtcagacag tggccgatct gttggggatt gattggaaat tacattgtgc atacagaccc 136 3541 caaagctcag gccaggtaga aagaataaat agaaccatca aggagacttt aactaaatta 137 3601 acgcttgcaa ctggctctag ggactgggtg ctcctactcc ccttagccct gtatcgagcc 138 3661 cgcaacacgc cgggccccca tggcctcacc ccatatgaga tcttatgtgg ggcacccccg 139 3721 ccccttgtaa acttccctga ccctgacatg acaagagtta ctaacagccc ctctctccaa 140 3781 gctcacatac aggctctcta cttagtccag cacgaagtct ggagacctct ggcggcagcc 141 3841 taccaagaac aactggacca tcctctagac tgacatggcg cattcaacgc catgctccca 142 3901 aacttccctg gctgttccta atcatttctc cctagtgtct catgtgactg tcccatctga 143 3961 gggtgtaatg ccttcgcctc taagcctgtg tcgctacctc cctcgtgagc tgtcgccatc 144 4021 ggtagactcg cggtcctgca gcattccttt ggtggccccg aggaaggcag ggaagctctt 145 4081 cctggggacc actcctcctc gggctcccgg actgccacgc cggctggcct ggttctccat 146 4141 agactgggaa caggtatgtc tgatgcatag gctgggctct ggagggtttg gctcggtgta 147 4201 caaagccact taccacggtg ttcctgtggc catcaagcaa gtaaacaagt gcaccgagga 148 4261 cctacgtgca tcccagcgga gtttctgggc tgaactgaac attgcaggac tacgccacga 149 4321 caacatagtt cgggttgtgg ctgccagcac gcgcacgccc gaagactcca acagcctagg 150 4381 taccataatc atggagtttg ggggcaacgt gactctacac caagtcatct acgatgccac 151 4441 ccgctcaccg gagcctctca gctgcagaaa acaactaagt ttggggaagt gcctcaagta 152 4501 ttccctagat gttgttaacg gcctgctttt tctccactca caaagcattt tgcacttgga 153 4561 cctgaagcca gcgaacattt tgattagtga gcaggacgtt tgtaagatca gtgacttcgg 154 4621 ctgctcccag aagctgcagg ttctgcgggg ccggcaggcg tcccctcccc acataggggg 155 4681 cacgtacacg caccaagctc cggagatcct aaaaggagag attgccacgc ccaaagctga 156 4741 catctactct tttggaatca ccctgtggca gatgactacc agagaggtgc cttactccgg 157 4801 cgaacctcag tacgtgcagt atgcagtggt agcctacaat ctgcgtccct cactggcagg 158 4861 agcggtgttc accgcctccc tgactggaaa ggcactgcag aacatcatcc agagctgctg 159 4921 ggaggcccgc ggcctgcaga ggccgagtgc agaactgctc caaagggacc tcaaggcttt 160 4981 ccgagggaca ctaggctgac tccatcgagc cagtgtagag ataagctttt gtttctgttt 161 5041 attttttatg ggacccctta ttgtactcct aatgattttg ctcttcggac cctgcattct 162 5101 taatcgatta gtccaatttg ttaaagacag gatatcagtg gtccaggctc tagctttgac 163 5161 tcaacaatat caccagctga agcctataga gtacgagcca tagttaaaat aaaagatttt 164 5221 atttagtctc cagaaaaagg ggggaatgaa agaccccacc cgtaggtggc aagctagctt 165 5281 aagtaacgcc actttgcaag gcatggaaaa atacataact gagaatagaa aagttcagat 166 5341 caaggtcagg aacaaagaaa cagctgaata ccaaacagga tatctgtggt aagcggttcc 167 5401 tgccccggct cagggccaag aacagatgag acagctgagt gatgggccaa acaggatatc 168 5461 tgtggtaagc agttcctgcc ccggctcggg gccaagaaca gatggtcccc agatgcggtc 169 5521 cagccctcag cagtttctag tgaatcatca gatgtttcca gggtgcccca aggacctgaa 170 5581 aatgaccctg taccttattt gaactaacca atcagttcgc ttctcgcttc tgttcgcgcg 171 5641 cttccgctct ccgagctcaa taaaagagcc cacaacccct cactcggcgc gccagtcttc 172 5701 cgatagactg cgtcgcccgg gtacccgtat tcccaataaa gcctcttgct gtttgcatcc 173 5761 gaatcgtggt ctcgctgttc cttgggaggg tctcctctga gtgattgact acccacgacg 174 5821 ggggtctttc att 175// 176