24 February 2025

Fragments beat lead-like compounds in a screen against OGG1

The twin rise of make-on-demand libraries and speedy in silico docking has supercharged fragment screening and optimization: we’ve written previously about V-SYNTHES, Crystal Structure First and a related method. Another advance is described by Jens Carlsson (Uppsala University) and a large group of multinational collaborators in an (open access) Nat. Commun. paper.
 
The researchers were interested in 8-oxoguanine DNA glycosylase (OGG1), a DNA-repair enzyme and potential anti-inflammatory and anticancer target. They started with a crystal structure into which they docked 14 million fragments (MW < 250 Da) or 235 million lead-like molecules (250-350 Da) from ZINC15. Multiple conformations and thousands of orientations were sampled for each molecule. In all, 13 trillion fragment complexes and 149 trillion lead-like complexes were evaluated using DOCK3.7, a process that took just 2 hours and 11 hours on a 3500 core cluster.
 
After removing PAINS and molecules similar to previously reported OGG1 inhibitors, the top-scoring 0.05-0.07% molecules from each screen were clustered and, after manual evaluation, 29 fragments and 36 lead-like compounds were purchased from make-on-demand catalogs. These were tested at 495 µM (for fragments) or 99 µM (for larger molecules) in a DSF screen. None of the lead-like compounds significantly stabilized the protein, while several fragments did. Four of the fragments were successfully crystallized with OGG1, and in all cases the key interactions predicted in the computational screens were confirmed in the actual crystal structures.
 
Compound 1 showed the greatest stabilization of OGG1 (2.8 ºC) and some inhibition in an enzymatic assay, but not enough to calculate an IC50. Searching for analogs that contained compound 1 as a substructure in the Enamine REAL database of 11 billion compounds produced few hits, but, as before, thinking in fragments proved fruitful. Searching for molecules containing just the core heterocycle and amide (colored blue below) yielded nearly 43,000 possibilities. Docking these and making and testing a few dozen led to compound 5, with mid-micromolar inhibition. Further iterations led to low micromolar compound 7.


At this point the researchers turned from make-on-demand libraries to synthetically accessible virtual libraries to fine-tune the molecule. After docking 6720 virtual molecules, they synthesized and tested 16, of which 12 were more potent than compound 7, with five of them being submicromolar. Compound 23 showed low micromolar activity in two different cell assays and was selective against four other DNA repair enzymes.
 
The same high-throughput docking approach was applied to three other protein targets: SMYD3, NUDT5, and PHIP. In each case crystal structures of bound fragments were available to use as starting points. Multiple compounds with improved docking scores compared to the initial fragments were identified, though no compounds were actually synthesized and tested.
 
The success in finding compound 1 demonstrates experimentally the advantage fragments have in efficiently searching chemical space. The researchers note that 97% of the >30 billion currently available make-on-demand compounds have molecular weights >350 Da, while only 50 million are < 250 Da. Screening all of these fragments in silico is possible; screening everything, less so. Although the fragment hits for OGG1 were weak, this isn’t always the case, as noted here. The fact that fragment 1 could be advanced to a sub-micromolar inhibitor after synthesizing just a few dozen molecules also testifies to the efficiency of in silico approaches.
 
The paper contains lots of useful details and suggestions for streamlining the process and is well worth perusing if you are trying to find hits against a structurally-enabled protein.

18 February 2025

A fragment prodrug discovered in a phenotypic screen

Glioblastoma multiforme (GBM) is a particularly nasty type of brain tumor with few drug options aside from the DNA alkylating agent temozolomide (TMZ), which is toxic and not particularly effective. Drugs fail for multiple reasons, among them the difficulty many small molecules have crossing the blood-brain barrier. A recent Nature paper by Luis Parada and a large group of collaborators at Memorial Sloan Kettering Cancer Center and elsewhere describes a promising new approach.
 
The researchers screened >200,000 molecules (not necessarily fragments) against primary GBM cells to look for compounds that reduced viability. Generically toxic molecules are so common that they (literally) grow on trees, so hits were counter-screened against mouse embryonic fibroblasts to looks for molecules that selectively killed GBM cells. This led to a rule-of-three compliant compound the researchers dubbed gliocidin.
 
Figuring out how gliocidin works turned out to be a complicated quest, starting with a genome-wide CRISPR-Cas9 screen to look for genes that either protected or sensitized cells to gliocidin. Subsequent work, including knocking out specific genes of interest and LC-MS/MS studies of metabolites, revealed that gliocidin leads to inhibition of a protein called inosine monophosphate dehydrogenase 2 (IMPDH2), which is necessary for guanine synthesis.
 
However, gliocidin is not a direct inhibitor of IMPDH2. Rather, it is is essentially a "pro-prodrug". Gliocidin is first converted into gliocidin-monocucleotide by the enzyme NAMPT (a target we wrote about back in 2014), and subsequently converted to gliocidin-adenine dinucleotide (GAD) by the enzyme NMNAT1. Cryo-EM showed that GAD binds at the NAD+ cofactor binding site of IMPDH2, blocking enzyme activity.


In addition to being a DNA-alkylating agent, TMZ induces NMNAT1 expression, thereby increasing conversion of gliocidin to GAD. Consistent with this, the combination of gliocidin and TMZ was more effective than either agent alone in mouse xenograft models. This is a lovely paper that reads like a detective story, and I’m only able to scratch the surface in a brief blog post. It also has multiple lessons for FBDD.
 
First, as expected given its molecular properties, gliocidin has excellent brain penetration. Vicki Nienaber argued in 2009 that FBDD may be ideally suited for finding molecules that can cross the blood-brain barrier, and gliocidin is a case in point.
 
Second, this paper answers emphatically in the affirmative the question we posed in 2022: “Is phenotypic fragment screening worthwhile?”
 
Third, this is another example of in situ inhibitor assembly to generate an analog of NAD+; we wrote about a small fragment targeting a different protein here. Given that fragments are the size of many metabolites, fragments as prodrugs could be a productive area of research.
 
But such a prodrug approach is not without risks. In that 2014 post about NAMPT inhibitors, I noted that some molecules had poorly characterized off-target activities, which could perhaps now be explained through this type of in situ activation. The new paper found that GAD does not inhibit two different NAD+ or NADPH-dependent enzymes, but hitting off-target enzymes will be something to watch for during optimization. I look forward to following this story.

10 February 2025

Flatland: still a nice place to be

In 2009 we highlighted a paper reporting that approved drugs have a higher fraction of sp3-hybridized carbon atoms than discovery-phase compounds. Perhaps focusing on molecules with a high Fsp3, the ratio of sp3-hybridzed carbons to total carbons, would lead to greater success. Or not: a new analysis in Nat. Rev. Chem. by Ian Churcher (Janus Drug Discovery), Stuart Newbold, and Christopher Murray (both at Astex) finds that the relationship has not held up.
 
The 2009 “Escape from Flatland” paper was widely discussed at conferences and has been cited more than 3000 times. But according to the authors of the new study, most of these citations are from papers describing new synthetic methodologies rather than from papers discussing medicinal chemistry.
 
And not all the attention has been positive. As we noted in 2013, Pete Kenny and Carlos Montanari reanalyzed the data and found that an apparent correlation between Fsp3 and solubility disappeared when plotting all discrete data points instead of binned data.
 
More recently, we highlighted a paper that found no significant difference between the shapeliness of drugs, as assessed by their principal moment of inertia (PMI), and the shapeliness of small molecules in the ZINC database.
 
The new paper looks at Fsp3 values for drugs approved during various time periods. Among 980 drugs approved up to 2009, the average Fsp3 was 0.458. However, of the 431 drugs approved after 2009, the average Fsp3 has dropped to 0.392. The researchers speculate that this (statistically significant) decrease may be due to an increase in the number of kinase inhibitors, which are usually highly aromatic, as well as an increase in the use of metal-catalyzed cross coupling reactions.
 
In my analysis of the 2009 paper, I asked whether higher Fsp3 ratios would lead to lower hit rates, and indeed this seems to be the case, as shown in a paper we discussed in 2020. Thus, if you pursue difficult targets, you may increase your chances of finding hits by screening molecules with lower Fsp3 ratios. Also, multiple studies, including one published just last month, have found no correlation between the shapeliness of a fragment (as defined by deviation from planarity, or DFP) and the shapeliness of the resulting lead, so there appears to be no penalty to starting with a flattish fragment. 
 
The researchers conclude that their “analysis of drug development trends over the last 15 years suggests that Fsp3 may not have been a useful metric to optimize.” Importantly, the supplementary information includes a list of >1400 approved drugs and >1500 investigational drugs along with associated properties, so you can do your own analyses.
 
In the end, generalizations will only get you so far, and may even lead you astray. At least for now, there are few shortcuts in the long slog of experimental studies necessary to discover a drug.

03 February 2025

Stitching together fragments with Fragmenstein

As we noted just last week, crystallography has unleashed a torrent of protein-ligand complexes, especially fragments. Historically a single structure might be used for fragment growing, but so many structures present an embarrassment of riches, with sometimes dozens of fragments that bind in the same region. Merging or linking these fragments can be done manually, as seen here and here, but how to do so when the binding modes are partially overlapping is not always intuitive. In a new open-access J. Cheminform. paper, Matteo Ferla and colleagues at University of Oxford and elsewhere describe an open-source solution called Fragmenstein.
 
We briefly described Fragmenstein in 2023, where it was used to combine pairs of low-affinity fragments bound to the Nsp3 macrodomain of SARS-CoV-2 to generate sub-micromolar inhibitors. The current paper describes the platform in detail.
 
Fragmenstein starts by taking two (or more) structures of fragments bound to a protein and virtually combining them. This is done by collapsing rings to their individual centroids, stitching these together along with their substituents, and then re-expanding the ring(s) so the substituents will be close to where they were in the initial fragments. This process produces a surprising array of molecules beyond the obvious. For example, if one fragment contains a phenyl ring and the other fragment contains a furan ring, the stitched molecule might just contain the phenyl (if the two rings bind in nearly the same position), a benzofuran (merging the rings), a phenyl ring linked to a furan by one or more atoms, or even a spiro compound if the rings are perpendicular to one another.
 
In silico approaches sometimes suggest molecules that are synthetically challenging to make, but Fragmenstein can also be used to find purchasable analogs.
 
Next, the new molecules are energetically minimized, first by themselves and then while docked into the protein. In contrast to other docking programs, which might allow molecules to sample thousands of different conformations and sites in a protein, Fragmenstein maintains the new molecule in a similar position and orientation to the initial fragments, with the assumption that these have already identified energetically favorable interactions.
 
The researchers successfully applied Fragmenstein retrospectively to several targets. The COVID Moonshot (which we discussed here) crowd-sourced molecule ideas for the SARS-CoV-2 main protease based on structures of bound fragments. Of 87 ligands that had been crystallographically characterized and were designed based on two fragments, Fragmenstein successfully (RMSD < 2 Å) predicted the binding mode for 69%.
 
Fragmenstein can even be used for covalent ligands, as shown for the target NUDT7, which we wrote about here. Merging two fragments led to compound NUDT7-COV-1, and the RMSD between the Fragmenstein model and the crystal structure was an impressive 0.28 Å.
 
Of course, as the researchers acknowledge, the number of possible analogs might be daunting, and deciding which to make or buy is not necessarily straightforward. Also, Fragmenstein assumes that the fragments themselves are making productive interactions with the protein, which may not be the case, as we suggested here. Still, the tool is open-source and worth trying, especially if you are swimming in crystal structures.