As we noted just last week, crystallography
has unleashed a torrent of protein-ligand complexes, especially fragments. Historically
a single structure might be used for fragment growing, but so many structures present
an embarrassment of riches, with sometimes dozens of fragments that bind in the
same region. Merging or linking these fragments can be done manually, as seen
here and here, but how to do so when the binding modes are partially
overlapping is not always intuitive. In a new open-access J. Cheminform.
paper, Matteo Ferla and colleagues at University of Oxford and elsewhere describe
an open-source solution called Fragmenstein.
We briefly described Fragmenstein
in 2023, where it was used to combine pairs of low-affinity fragments bound to
the Nsp3 macrodomain of SARS-CoV-2 to generate sub-micromolar inhibitors. The
current paper describes the platform in detail.
Fragmenstein starts by taking two
(or more) structures of fragments bound to a protein and virtually combining
them. This is done by collapsing rings to their individual centroids, stitching
these together along with their substituents, and then re-expanding the ring(s)
so the substituents will be close to where they were in the initial fragments. This
process produces a surprising array of molecules beyond the obvious. For example,
if one fragment contains a phenyl ring and the other fragment contains a furan
ring, the stitched molecule might just contain the phenyl (if the two rings bind
in nearly the same position), a benzofuran (merging the rings), a phenyl ring
linked to a furan by one or more atoms, or even a spiro compound if the rings
are perpendicular to one another.
In silico approaches sometimes
suggest molecules that are synthetically challenging to make, but Fragmenstein
can also be used to find purchasable analogs.
Next, the new molecules are
energetically minimized, first by themselves and then while docked into the
protein. In contrast to other docking programs, which might allow molecules to
sample thousands of different conformations and sites in a protein, Fragmenstein
maintains the new molecule in a similar position and orientation to the initial
fragments, with the assumption that these have already identified energetically
favorable interactions.
The researchers successfully applied
Fragmenstein retrospectively to several targets. The COVID Moonshot (which we
discussed here) crowd-sourced molecule ideas for the SARS-CoV-2 main protease
based on structures of bound fragments. Of 87 ligands that had been crystallographically
characterized and were designed based on two fragments, Fragmenstein successfully
(RMSD < 2 Å) predicted the binding mode for 69%.
Fragmenstein can even be used for
covalent ligands, as shown for the target NUDT7, which we wrote about here.
Merging two fragments led to compound NUDT7-COV-1, and the RMSD between the
Fragmenstein model and the crystal structure was an impressive 0.28 Å.
Of course, as the researchers
acknowledge, the number of possible analogs might be daunting, and deciding
which to make or buy is not necessarily straightforward. Also, Fragmenstein
assumes that the fragments themselves are making productive interactions with
the protein, which may not be the case, as we suggested here. Still, the tool
is open-source and worth trying, especially if you are swimming in crystal
structures.
No comments:
Post a Comment