Folding and maturation, or how to evolve your own GFP (part 1)

GFP is one of the most widely used proteins in research. Its usefulness has advanced our understanding of biology in huge leaps forward. One of the greatest advantages of GFP is that the chromophore is formed in an autocatalytyic manner, no need for an enzyme or cofactor assistance. All that is required is atmospheric oxygen.

GFP has sort of a barrel shape, with the chromophore inside. The chromophore is formed from residues serine 65 (S65), tyrosine 66 (Y66) and glycine 67 (G67). The process of creating the structure of the protein is called folding, whereas the process of creating the chromophore is termed maturation. Maturation occurs after the protein folds to near native conformation.

However, the natural, wild type, GFP is not the best tool. It is rather dim and not very stable at 37°C  (Since the jellyfish lives at low temperatures). Also, it matures rather slowly. Early on, a mutation was introduced, S65T, that increased the brightness of the GFP and also shifted its excitation peak (from a major peak at 395 and a minor at 475, to a single major peak at 488).  In vivo, it was shown that maturation of GFP-S65T takes 27 minutes (this was measured by expressing GFP in E. coli bacteria under anaerobic conditions, then supplementing the bacteria with O2. Therefore, they only measured the last few steps of the maturation process). EGFP, the most common variant used in research labs, contains a second mutation, F64L. This mutation stabilizes the protein at 37°C and was suggested to increase the maturation rate.

Over the years, GFP has been a subject for further “directed evolution” to achieve required traits like different colors, brightness, pH stability and photostability, oligomerization state, as well as folding and maturation.

Why do we care about folding and maturation?

Well, GFP is often fused to other proteins. In some cases, that protein is misfolding, or folding slowly. This can affect the folding of GFP. This is particularly true when expressing fused proteins in E. coli, where in many cases the ectopically expressed protein is aggregating in inclusion bodies. Since GFP itself has a slight tendency to dimerize, any aggregation of the fusion protein may amplify the dimerization of the GFP. Thus, it would be advantageous to have a better folding GFP, with a lower tendency to dimerize.

As to maturation, having a fast-maturing GFP would be very beneficial for studies of translation, or translation-coupled localization.  Because, if the GFP takes 20-30 minutes to mature after it is being translated, we cannot really say what happened and where was the GFP or any fused protein, in that time.

Our story begins in 1996, when the lab of Willem Stemmer tried to improve GFP by a technique called DNA shuffling. Their goal was to improve whole cell fluorescent in E. coli cells expressing GFP. After three cycles of DNA shuffling, and selection based on fluorescence intensity with UV excitation, they isolated a clone (named simply ‘GFP cycle 3’) which showed 45-fold increased fluorescence compared to the then commercial Clontech pGFP. However, the spectral characteristics were unchanged, and the maturation rate (T1/2 = 95min at 37°C in whole cells) also seemed unchanged).  Mutations found in GFP cycle 3 compared to their starting GFP showed that three hydrophobic amino acids were replaced by hydrophilic amino acids. Thus, the improved fluorescence is probably due to reduction in protein aggregation by the hydrophobic surfaces, and increased solubility of the GFP.

CHO cells expressing wt GFP (A) or GFP cycle 3 (B). Source: Crameri et al. (1996) Nat. Biotech. 14:315.

Ten years later, the superfolder GFP (sfGFP) was engineered by the lab of Geoffrey Waldo for the particular purpose of creating a protein with less tendency to misfold when fused to poorly folded proteins in E. coli.

How was this protein engineered?

They started with a folding reporter GFP  (frGFP) which contained the cycle 3 & EGFP mutations.

They fused frGFP to a poorly folding protein, H subunit ferritin, which is insoluble when expressed in E. coli. After several rounds of mutagenesis and screening for fluorescence, they obtained the highly fluorescent sfGFP which bears six new mutations.  Ferritin-sfGFP was found to be 50-fold more fluorescent than the starting fusion protein. When expressed alone, cells expressing sfGFP were twice as bright as with frGFP. Importantly, the excitation & emission spectra , QY and EC were similar in both proteins.

Measuring the folding rate was done by first denaturing the proteins with urea. Then washing the urea and measuring the time it takes the GFP variant to regain fluorescence.  This method actually measures folding and maturation and this should be taken into account. In any case, though both proteins showed 95% yield within 4 minutes, sfGFP showed a 3.5-fold faster initial rate. sfGFP also tolerated higher urea levels, and showed better fluorescence with a bunch of different fused proteins with different characteristics (expression level, solubility etc…).

fusion of frGFP or sfGFP to 18 differnt proteins shows sfGFP much brighter in all cases. 0.125s, 1s: exposure time. below the images of the bacteria are Western blot images of the fusion proteins. Source: Pedelacq et al. (2006). Nat. Biotech. 24(1):79

They found that the most contributing mutation to the folding  kinetics and stability is the S30R mutation. The change from an oxygen group (that forms a hydrogen bond with E17) to a positively charged side chain mediates an electrostatically charged network of the β-barrel which seems to create a global stability to the structure. A second mutation, Y39N, slightly changes the angle of the backbone, and allows a hydrogen bond with D36, further stabilizing the structure. The other four mutations slightly contribute to the stability.

Although sfGFP seems to fold fast (though no comparison to EGFP was made in this paper), the assays here did not really differentiate folding from maturation.

A year later, the group of David Liu used sfGFP  in an effort to create a new GFP protein that is highly soluble. In order to do that, they replaced multiple neutral amino acids on the outer side of the protein with either positively or negatively charged amino acids to produce super-charged GFP variants GFP(+36) and GFP(-30) [compared to the net charge of sfGFP which is (-7) or GFP which is (-8)]. This scGFP was highly soluble, and also highly resistant to denaturing, even by boiling to protein at 100°C. The spectral characteristics of the scGFPs remained similar to those of GFP and sfGFP indicating that the structure of the outside “barrel” does not affect the chromophore features inside the barrel.

In 2008, Fisher & DeLisa produced yet another variant of GFP called superfast GFP. By exploiting the cell’s protein secretion quality control mechanisms, they screened for new super folding variants. The secretion mechanism exports unfolded proteins to the periplasm of E. coli. In the periplasm, GFP is apparently non-fluorescent. If the protein folds fast enough, the quality control mechanism prevents its export and the protein remains in the cytoplasm.  Their starting GFP was GFPmut2, a variant that was previously optimized for FACS analysis, harboring the mutations S65A/V68L/S72A. Following several round of selection, they isolated a clone which they named superfast GFP. Based on their analysis (using guanidinium chloride (GdnCl) to unfold the proteins and then diluting the GdnCl to refold the proteins, the T1/2 for superfast GFP refolding (resumed fluorescence) was 11 minutes, compared to 33 min (GFPmut2), 20min (sfGFP) and 73min (frGFP).

Graph showing folding of GFP proteins over time. black: square – frGFP; circle – sfGFP; triangle – GFPmut2. open circle – superfast GFP; triangle, square – two other mutants. Source: Fisher AC & DeLisa MP. (2008). PLoS One. 3(6):e2351.

So, far we discussed fast folding GFP proteins. However, the methods to measure folding of these proteins do not discriminate folding from maturation.  Furthermore, the in vitro results do not necessarily represent the in vivo folding rates (i.e. 11 min in vitro does not mean 11 min in vivo).

In the next post I will discuss the issue of maturation.

ResearchBlogging.orgCrameri A, Whitehorn EA, Tate E, & Stemmer WP (1996). Improved green fluorescent protein by molecular evolution using DNA shuffling. Nature biotechnology, 14 (3), 315-9 PMID: 9630892
Pédelacq JD, Cabantous S, Tran T, Terwilliger TC, & Waldo GS (2006). Engineering and characterization of a superfolder green fluorescent protein. Nature biotechnology, 24 (1), 79-88 PMID: 16369541
Lawrence MS, Phillips KJ, & Liu DR (2007). Supercharging proteins can impart unusual resilience. Journal of the American Chemical Society, 129 (33), 10110-2 PMID: 17665911
Fisher AC, & DeLisa MP (2008). Laboratory evolution of fast-folding green fluorescent protein using secretory pathway quality control. PloS one, 3 (6) PMID: 18545653

11 responses to “Folding and maturation, or how to evolve your own GFP (part 1)

  1. Hi Gal,

    This is excellent article! I am waiting for the part 2 🙂

    I am interested in following gene expression dynamics in live mice (muscle cells) by using fluorescent proteins and intravital microscopy. Any ide on the best probes to use and the approaches (FRAP, etc)? P.S. I am very new to this..

    Thanks,
    Andrius

    Like

    • Hi Andrius,
      Thank you.

      Unfortunately, work & life did not allow me the time to write part 2 yet.

      As to your research – to choose a protein and a method, you first need to define what it is you want to image.
      Would you like to image RNA or protein? single or multi color?
      Are you looking only on yes or no signal or also localization?
      Is your baseline (time zero) ON (i.e. the protein is already expressed – in this case you can use FRAP or a photoactivatable protein) or OFF (you induce the expression – in this case you may want to try a fast folder – depending on your time resolution)

      Like

  2. Hello Gal, congrats for your blog. I´d like to ask something. I was trying to determine the subcellular localization of a bacterial protein using a fusion to gfp. If i see the cells 1 hour after induction, the localization has a punctate pattern, and localizes principally into the septum and into the poles. However, when i take pictures 2 hs after induction the fusion is localized only in the membrane. Do you have any explication for this behaviour? Do you think that it could be because of the maturation or amount of the protein at different times?
    Thanks!

    Like

    • Hi Diego,
      Its hard to answer without knowing the particular biology of the fusion protein.
      The problem is not due to maturation – since you see the GFP after 1hr, it means that it matured already.
      I also do not think that it is a problem of oligomerization, since in that case I would expect to see aggregation of the signal, not a shift from poles to membrane.
      so the answer is probably biological – if I understand you correctly, you start with “Zero” protein, then follow the localization after induction.
      it looks like a regulatory mechanism that causes a shift in localization due to concentration or time after induction.
      I would suggest to try a milder induction, or to arrest induction after 1hr. and look at the localization after 2hr.
      Do the cells behave different at 1 and 2hr?
      I would also look for any possible RNA or protein sequence that may have a membrane localization signal.
      I would also test by Northern and by Western to see that the mRNA and protein at 1 and 2hr are the same.

      Like

  3. A rookie question: does the maturation of the N part of a split GFP also happen if the C- part is not yet “linked” to the N- part? Since the maturation/folding happens at the 65, 66 and 67the amino acid I figure this maturation/folding might even take place without the C part already there or are there other (more difficult parameters) that also play a role and thus meaning the C part needs to be there to have the maturation take place?

    I would say: the maturation does not happen because if it did one might except that the N part (where the maturation happens) is enough for the fluorescence to take place! Which is not the case (would destroy the entire use of BiFC).
    Is anything known about this? (why the protein has to be complete (C and N part) for the folding of the N part (65, 66 and 67the amino acids)

    Like

  4. I forgot to add the link to the paper discussing this: http://www.bioinfo.rpi.edu/bystrc/pub/crone_etal.pdf

    Like

  5. This is an interesting question – did you get a satisfactory answer yet? From what I understand, water inside the beta-barrel leads to quenching of fluorescence. In theory one could have a mature fluorophore in the uncomplimented protein, which is quenched due to water in the barrel.

    Like

Leave a comment