发布时间:2019-08-02 22:34 原文链接: pEGFPC2质粒图谱及信息

多克隆位点区(MCS)

Restriction Map and Multiple Cloning Site of pEGFP-C2. (Unique restriction sites are in color or bold.) Note that the Eag I site is not unique.) The Xba I site cannot be used for fusions since it contains an in-frame stop codon. The Xba I and Bcl I sites (*) are methylated in the DNA provided by CLONTECH. If you wish to digest the vector with these enzymes, you will need to transform the vector into a dam– host and make fresh DNA.

Note: The vector sequence file has been compiled from information in the sequence database, published literature, and other sources, together with partial sequences obtained by CLONTECH. This vector has not been completely sequenced.

Description

pEGFP-C2 encodes a red-shifted variant of wild-type GFP (1-3) which has been optimized for brighter fluorescence and higher expression in mammalian cells. (Excitation maximum = 488 nm; emission maximum = 507 nm.) pEGFP-C2 encodes the GFPmut1 variant (4) which contains the double-amino-acid substitution of Phe-64 to Leu and Ser-65 to Thr. The coding sequence of the EGFP gene contains more than 190 silent base changes which correspond to human codon-usage preferences (5). Sequences flanking EGFP have been converted to a Kozak consensus translation initiation site (6) to further increase the translation efficiency in eukaryotic cells. The MCS in pEGFP-C2 is between the EGFP coding sequences and the SV40 poly A. Genes cloned into the MCS will be expressed as fusions to the C-terminus of EGFP if they are in the same reading frame as EGFP and there are no intervening stop codons. SV40 polyadenylation signals downstream of the EGFP gene direct proper processing of the 3' end of the EGFP mRNA. The vector backbone also contains an SV40 origin for replication in mammalian cells expressing the SV40 T-antigen. A neomycin-resistance cassette (neor), consisting of the SV40 early promoter, the neomycin/kanamycin resistance gene of Tn5, and polyadenylation signals from the Herpes simplex thymidine kinase gene, allows stably transfected eukaryotic cells to be selected using G418. A bacterial promoter upstream of this cassette (Pamp) expresses kanamycin resistance in E. coli. The pGFP-C1 backbone also provides a pUC19 origin of replication for propagation in E. coli and an f1 origin for single-stranded DNA production.

Fusions to the C-terminus of EGFP retain the fluorescent properties of the native protein allowing the localization of the fusion protein in vivo. The target gene should be cloned into pEGFP-C2 so that it is in frame with the EGFP coding sequences, with no intervening in-frame stop codons. The recombinant EGFP vector can be transfected into mammalian cells using any standard transfection method. If required, stable transformants can be selected using G418 (7). pEGFP-C2 can also be used simply to express EGFP in a cell line of interest (e.g., as a transfection marker).

Location of Features

  • Human cytomegalovirus (CMV) immediate early promoter: 1-589

    • Enhancer region: 59-465 TATA box: 554-560


    • Transcription start point: 583


    • C

    • ->
    • G mutation to remove

    •  Sac 
    • I site: 569

  • Enhanced green fluorescent protein gene
    Kozak consensus translation initiation site: 606-616
    Start codon (ATG): 613-615; Stop codon: 1408-1410
    Insertion of Val at position 2: 616-618
    GFPmut1 chromophore mutations (Phe-64 to Leu; Ser-65 to Thr): 805-810
    His-231 to Leu mutation (AÆT): 1307
    Last amino acid in wild-type GFP: 1327-1329

  • MCS: 1343-1421

  • SV40 early mRNA polyadenylation signal

    • Polyadenylation signals: 1554-1559 & 1583-1588 mRNA 3' ends: 1592 & 1604

  • f1 single-strand DNA origin: 1651-2106

    • (Packages the noncoding strand of EGFP.)

  • Ampicillin resistance (beta-lactamase) promoter

    • -35 region: 2168-2173; -10 region: 2191-2196 Transcription start point: 2203

  • SV40 origin of replication: 2447-2582

  • SV40 early promoter

    • Enhancer (72-bp tandem repeats): 2280-2351 & 2352-2423


    • 21-bp repeats: 2427-2447, 2448-2468, & 2470-2490


    • Early promoter element: 2503-2509


    • Major transcription start points: 2499, 2537, 2543 & 2548

  • Kanamycin/neomycin resistance gene

    • Neomycin phosphotransferase coding sequences:


    • Start codon (ATG): 2631-2633; stop codon: 3423-3425


    • G

    • ->
    • A mutation to remove

    •  Pst 
    • I site: 2813


    • C

    • ->
    • A (Arg to Ser) mutation to remove

    •  Bss
    • H II site: 3159

  • Herpes simplex virus (HSV) thymidine kinase (TK) polyadenylation signal

    • Polyadenylation signals: 3661-3666 & 3674-3679

  • pUC plasmid replication origin: 4010-4653

Primer Locations

  • EGFP-N Sequencing Primer (#6479-1): 679-658

  • EGFP-C Sequencing Primer (#6478-1): 1266-1287

  • 全序列

    TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG
    CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC CCCGCCCATT
    GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC ATTGACGTCA
    ATGGGTGGAG TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC
    AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA
    CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC
    CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG ACTCACGGGG
    ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC AAAATCAACG
    GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT
    ACGGTGGGAG GTCTATATAA GCAGAGCTGG TTTAGTGAAC CGTCAGATCC GCTAGCGCTA
    CCGGTCGCCA CCATGGTGAG CAAGGGCGAG GAGCTGTTCA CCGGGGTGGT GCCCATCCTG
    GTCGAGCTGG ACGGCGACGT AAACGGCCAC AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC
    GATGCCACCT ACGGCAAGCT GACCCTGAAG TTCATCTGCA CCACCGGCAA GCTGCCCGTG
    CCCTGGCCCA CCCTCGTGAC CACCCTGACC TACGGCGTGC AGTGCTTCAG CCGCTACCCC
    GACCACATGA AGCAGCACGA CTTCTTCAAG TCCGCCATGC CCGAAGGCTA CGTCCAGGAG
    CGCACCATCT TCTTCAAGGA CGACGGCAAC TACAAGACCC GCGCCGAGGT GAAGTTCGAG
    GGCGACACCC TGGTGAACCG CATCGAGCTG AAGGGCATCG ACTTCAAGGA GGACGGCAAC
    ATCCTGGGGC ACAAGCTGGA GTACAACTAC AACAGCCACA ACGTCTATAT CATGGCCGAC
    AAGCAGAAGA ACGGCATCAA GGTGAACTTC AAGATCCGCC ACAACATCGA GGACGGCAGC
    GTGCAGCTCG CCGACCACTA CCAGCAGAAC ACCCCCATCG GCGACGGCCC CGTGCTGCTG
    CCCGACAACC ACTACCTGAG CACCCAGTCC GCCCTGAGCA AAGACCCCAA CGAGAAGCGC
    GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG
    CTGTACAAGT CCGGCCGGAC TCAGATCTCG AGCTCAAGCT TCGAATTCTG CAGTCGACGG
    TACCGCGGGC CCGGGATCCA CCGGATCTAG ATAACTGATC ATAATCAGCC ATACCACATT
    TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC CCCCTGAACC TGAAACATAA
    AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG
    CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT
    GTCCAAACTC ATCAATGTAT CTTAACGCGT AAATTGTAAG CGTTAATATT TTGTTAAAAT
    TCGCGTTAAA TTTTTGTTAA ATCAGCTCAT TTTTTAACCA ATAGGCCGAA ATCGGCAAAA
    TCCCTTATAA ATCAAAAGAA TAGACCGAGA TAGGGTTGAG TGTTGTTCCA GTTTGGAACA
    AGAGTCCACT ATTAAAGAAC GTGGACTCCA ACGTCAAAGG GCGAAAAACC GTCTATCAGG
    GCGATGGCCC ACTACGTGAA CCATCACCCT AATCAAGTTT TTTGGGGTCG AGGTGCCGTA
    AAGCACTAAA TCGGAACCCT AAAGGGAGCC CCCGATTTAG AGCTTGACGG GGAAAGCCGG
    CGAACGTGGC GAGAAAGGAA GGGAAGAAAG CGAAAGGAGC GGGCGCTAGG GCGCTGGCAA
    GTGTAGCGGT CACGCTGCGC GTAACCACCA CACCCGCCGC GCTTAATGCG CCGCTACAGG
    GCGCGTCAGG TGGCACTTTT CGGGGAAATG TGCGCGGAAC CCCTATTTGT TTATTTTTCT
    AAATACATTC AAATATGTAT CCGCTCATGA GACAATAACC CTGATAAATG CTTCAATAAT
    ATTGAAAAAG GAAGAGTCCT GAGGCGGAAA GAACCAGCTG TGGAATGTGT GTCAGTTAGG
    GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA
    GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA TGCAAAGCAT
    GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT CCGCCCATCC CGCCCCTAAC
    TCCGCCCAGT TCCGCCCATT CTCCGCCCCA TGGCTGACTA ATTTTTTTTA TTTATGCAGA
    GGCCGAGGCC GCCTCGGCCT CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG
    CCTAGGCTTT TGCAAAGATC GATCAAGAGA CAGGATGAGG ATCGTTTCGC ATGATTGAAC
    AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA GAGGCTATTC GGCTATGACT
    GGGCACAACA GACAATCGGC TGCTCTGATG CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC
    GCCCGGTTCT TTTTGTCAAG ACCGACCTGT CCGGTGCCCT GAATGAACTG CAAGACGAGG
    CAGCGCGGCT ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG
    TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG GATCTCCTGT
    CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC TGATGCAATG CGGCGGCTGC
    ATACGCTTGA TCCGGCTACC TGCCCATTCG ACCACCAAGC GAAACATCGC ATCGAGCGAG
    CACGTACTCG GATGGAAGCC GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG
    GGCTCGCGCC AGCCGAACTG TTCGCCAGGC TCAAGGCGAG CATGCCCGAC GGCGAGGATC
    TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT GGTGGAAAAT GGCCGCTTTT
    CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG CTATCAGGAC ATAGCGTTGG
    CTACCCGTGA TATTGCTGAA GAGCTTGGCG GCGAATGGGC TGACCGCTTC CTCGTGCTTT
    ACGGTATCGC CGCTCCCGAT TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT
    TCTGAGCGGG ACTCTGGGGT TCGAAATGAC CGACCAAGCG ACGCCCAACC TGCCATCACG
    AGATTTCGAT TCCACCGCCG CCTTCTATGA AAGGTTGGGC TTCGGAATCG TTTTCCGGGA
    CGCCGGCTGG ATGATCCTCC AGCGCGGGGA TCTCATGCTG GAGTTCTTCG CCCACCCTAG
    GGGGAGGCTA ACTGAAACAC GGAAGGAGAC AATACCGGAA GGAACCCGCG CTATGACGGC
    AATAAAAAGA CAGAATAAAA CGCACGGTGT TGGGTCGTTT GTTCATAAAC GCGGGGTTCG
    GTCCCAGGGC TGGCACTCTG TCGATACCCC ACCGAGACCC CATTGGGGCC AATACGCCCG
    CGTTTCTTCC TTTTCCCCAC CCCACCCCCC AAGTTCGGGT GAAGGCCCAG GGCTCGCAGC
    CAACGTCGGG GCGGCAGGCC CTGCCATAGC CTCAGGTTAC TCATATATAC TTTAGATTGA
    TTTAAAACTT CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT
    GACCAAAATC CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT
    CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA
    ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA
    GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AGCCGTAGTT
    AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT
    ACCAGTGGCT GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA
    GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT
    GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC
    GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA
    GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG
    CCACCTCTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA
    AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT
    GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCA TGCAT