02 February 2026

xSAR: Crystallographic SAR from crude reactions

Last year we highlighted an example of crystallographic screening of crude reaction mixtures to find inhibitors against the oncology target PHIP(2). Of 957 molecules tested, 22 showed crystallographic binding in two different orientations: 19 in a “lateral” pose and 3 in a “diving” pose. In a new open-access Chem. Sci. article, Philip Biggin and collaborators at Diamond Light Source and University of Oxford try to extract information from both the binders and the non-binders using crystallographic structure-activity relationships, or xSAR.
 
Chemists often think about SAR in qualitative terms: a methyl group here improves affinity, a chlorine atom there reduces it. In xSAR, the researchers sought to take a more quantitative approach. They converted each molecule into “Morgan fingerprints,” a set of more than 2048 binary bits describing structural features such as atom type, hybridization, and connectivity to other atoms within a certain distance. Some bits were found in all binding compounds, and these were referred to as conserved binding bits (CBB), while conserved nonbinding bits (CNB) were found only in non-binding compounds. These bits were then used to calculate Positive and Negative Binding Scores (PBS and NBS); a compound with a PBS of 1 contains all the CBB. Since there were two separate binding modes, the researchers calculated PBS and NBS values for both lateral and diving poses individually as well as for all binders.
 
As the researchers note, false negatives are a likely issue in crude reaction screening for a variety of reasons. To hunt for these, the PBS and NBS values were calculated for all 957 molecules previoulsy tested. A set of 97 pure compounds having mostly high scores were acquired and tested crystallographically, yielding an additional 23 lateral binders and 3 diving binders, more than doubling the initial yield. PBS was particularly informative in this retrospective exercise to recover false negatives, outperforming both NBS as well as other methods such as Tanimoto similarity scores.
 
The researchers also used PBS and NBS scores to search prospectively for new binders in a virtual set of more than 1.7 billion compounds in the Enamine REAL database. After filtering for high PBS/NBS scoring compounds followed by docking, 93 compounds were acquired and tested crystallographically. Interestingly, this yielded a relatively low hit rate of 9 binders, 6 in the lateral pose and three in somewhat different poses. None of the new compounds bound in the diving pose, which the researchers suggest may be due to the small sample size used to calculate PBS and NBS for this binding mode.
 
The 93 new compounds were also tested for binding using grated-coupled interferometry (GCI), and 13 showed measurable affinity, with most better than 50 µM. Two even showed single-digit micromolar affinity, more than an order of magnitude better than the best compound from the screen we discussed last year, and with better ligand efficiencies too. Surprisingly, these two compounds were not hits in the crystallographic screen.
 
This is an interesting paper with a couple important lessons. First, despite the fact that affinity was not used in calculating PBS and NBS, these metrics were nonetheless useful for identifying molecules with better affinity than those in the original training set, arguing for their utility. But perhaps just as importantly, the molecules with the best affinity were missed by crystallographic screening. If anything, this observation only strengthens my conclusion last year that while “there is a strong case for using crystallography first for finding fragments, I am not yet convinced the same applies for optimizing fragments.”

No comments: