Last week we described a fragment library built with the aid of machine learning and designed to contain privileged fragments that should produce high hit rates. Unfortunately, only about a tenth of the library members are commercially available, so it will be some time before we know whether the design was successful. We continue the theme of fragment libraries with a just published Nat. Commun. paper by György Keserű (Hungarian Research Centre for Natural Sciences) and a large group of multinational collaborators (see also here for a nice summary by György).
The researchers started by analyzing more than 3300 crystal structures of protein-fragment complexes in the protein data bank. Fragments were defined as having 10-16 non-hydrogen atoms, and the computational approach FTMap was used to ensure that fragments were binding at hotspots as opposed to spurious, less ligandable sites. This exercise yielded 3584 fragments, but many of them were identical or very similar to one another. The researchers used a series of computational tools to cluster similar fragments (or pharmacophores) and choose a set that would maximize diversity. This ultimately led them to assemble a library of just 96 fragments, purchased from five vendors.
This SpotXplorer0 library mostly follows the rule of three, with 7 to 17 non-hydrogen atoms, MW 100-250 (or 280 for bromine-containing molecules), ≤ 3 hydrogen bond donors, ≤ 8 hydrogen bond acceptors, and ≤ 3 rotatable bonds. In addition, all members have 1-3 rings, no more than a single halogen or sulfur atom, and no PAINS. Despite the small size, this library covers most of the pharmacophores identified in the larger set, and considerably more than the F2X-Entry fragment library we highlighted last year or the top five commercial library vendors we noted here.
The researchers then screened this library against eight targets. Three GPCRs (the serotonin receptors 5-HT1A, 5-HT6, and 5-HT7) were assessed in a cell-based radioligand displacement assay with fragments at just 10 µM. Despite the low concentration, 4-11 hits were found. Biochemical screens conducted at 800 µM against the proteases thrombin and Factor Xa yielded 7 and 8 hits respectively. Further analysis revealed that the SpotXplorer0 ligands sampled a majority of the pharmacophores found in published fragment hits against theses five targets.
Next the researchers screened their library against the histone methyltransferase SETD2, an oncology target with few known attractive ligands. An enzymatic assay yielded two hits, with IC50 values between 300 and 500 µM.
Finally, the SpotXplorer0 library was part of the XChem crystallographic screens against the SARS-CoV-2 main protease (Mpro) and Nsp3 macrodomain, which we discussed here and here. For Mpro, just a single hit was found. This is only half the overall hit rate for noncovalent fragments in the crystallographic screen against this target, but the hit is functionally active and has a high ligand efficiency.
The screen against NSP3 yielded five hits binding at two different sites, for a hit rate of 5.2%. The overall hit rate against this target was 8%, but that encompasses screens against two crystal forms of the protein. The crystal form used for SpotXplorer0 had a hit rate of 21%.
In summary, SpotXplorer0 is new fragment library that gives high coverage of experimental pharmacophore space. Laudably, structures of all 96 fragments are provided in the Supplementary Information. But the jury remains out on how hit-rich the library will be. Interestingly, the F2X-Entry library we highlighted last year gave considerably higher hit rates of 21% and 30%, albeit against two different targets. SpotXplorer0 is being screened crystallographically against multiple targets at XChem, and it will be interesting to see how it performs in the long run.