08 August 2022

Solving structures with selective labeling and NMR2

Protein-detected NMR first enabled fragment-based lead discovery way back in 1996, but improvements in crystallography have now allowed synchrotrons to surpass big magnets as preeminent tools to determine how fragments bind to proteins. One of the major challenges in NMR is assigning the chemical shift values of atoms in all the individual amino acid residues. A technique called NMR Molecular Replacement (NMR2) sidesteps the need for this tedious, time-consuming process. A refinement to this technique, making it more broadly applicable, has just been published (open-access) in Sci. Reports by Julien Orts (University of Vienna), Martin Scanlon (Monash University) and collaborators.
 
As we discussed previously, NMR2 relies on intensive calculations using experimental intermolecular NOEs between a protein and a ligand to generate a model. Although the method does not require assignment of backbone or side chain chemical shifts, it does require high-quality spectra. For example, if the spectra of several amino acid residues overlap it is impossible to distinguish them (this applies to conventional NMR methods too). The researchers realized that one way to simplify the spectra is through selective labeling, in which the methyl groups of the amino acid residues alanine, isoleucine, leucine, valine, and threonine are isotopically labeled with 13C. Going one step further, the entire protein can be deuterated (rendering most of the protein invisible to NMR), while these methyl groups retain ordinary hydrogen atoms.
 
For the present study, the researchers focused on the protein EcDsbA, an antibacterial target we’ve written about previously. They selectively labeled methyl groups so that, in isoleucine, leucine, and valine, only one of the two methyl groups was labeled. That reduced the total number of protons to just 6% of the unlabeled protein.
 
The researchers then solved the structure of EcDsbA with a previously identified ligand. At 23 heavy atoms the ligand is on the large side, though with an affinity of just 0.9 mM it presents a difficult test case. A total of twelve intermolecular NOEs were used in NMR2 to build a model of the complex. One challenge with NMR2 is that there may not be a single solution. For example, if two methionine methyl groups are both near a ligand, it may be impossible to determine a unique binding mode. This turned out to be the case, and the top two structures had different positions for a carboxylic acid group and a phenyl in the ligand.
 
To benchmark NMR2, the protein-ligand complex was also determined using conventional two-dimensional techniques (HADDOCK and CYANA, which made use of assigned chemical shifts) as well as X-ray crystallography. These all agreed with the NMR2 model in placing a phenylpropyl moiety from the ligand in a hydrophobic groove, but they differed in the placement of the carboxylic acid and the other phenyl moiety: the top scoring NMR2 model agreed with the crystal structure and the CYANA NMR structure but differed from the HADDOCK structure, which was similar to the second-best NMR2 model. Before assuming that the crystallographic structure is correct, though, it is worth noting that the ligand makes crystal contacts with a neighboring protein, and the electron density around the ambiguous phenyl is weak.
 
This is a nice demonstration of the utility of NMR2. It seems to provide similar information as classic NMR methods, but the time taken is “orders of magnitude” less. And selective labeling should make NMR2 applicable to even larger proteins. I look forward to seeing more people use this strategy.

No comments: