13 May 2024

Fragments in cells, writ large

Earlier this year we highlighted work in which a dozen fragments were screened against cells to look for noncovalent binders across the proteome. A new paper in Science by Georg Winter and collaborators at the Austrian Academy of Sciences, Pfizer, and several other organizations ups the game by more than an order of magnitude, and uses machine learning to make predictions about fragments’ cellular destinations and binding partners. (See also Derek Lowe’s post here.)
 
The researchers started with 407 diverse fully functionalized fragments (FFFs), which as we previously discussed consist of a variable fragment coupled to a photoreactive group and an alkyne moiety that can be used to pull down any bound proteins using click chemistry. These were selected from a larger set of ~6000 FFFs available from Enamine. The FFFs were incubated at 50 µM with intact HEK293T cells, followed by ultraviolet crosslinking.
 
Next, cells were lysed and treated with a biotin-azide probe that reacts with the alkyne on the FFFs. Covalently modified proteins were captured on streptavidin resin and proteolytically digested. Tandem mass tag (TMT) proteomics, which we wrote about here, was used to identify captured proteins. Unlike earlier methods, the researchers did not pinpoint the specific fragment binding sites on proteins.
 
In total the researchers found 2667 proteins bound to one or more fragments, of which ~86% had no reported ligands. Both proteins and ligands varied considerably in promiscuity: some proteins bound to more than half of the FFFs, and some fragments bound to hundreds of proteins, while others bound only a few, or none. To look for specific interactions, the researchers focused on proteins bound by fewer than 10 different ligands.
 
Three protein-ligand interactions were analyzed in some detail: the kinase CDK2 (and other CDK family members), the adapter protein DDB1, and the solute carrier protein SLC29A1. In each case the researchers confirmed the results from their chemoproteomic screens. Follow-up studies with related molecules led to more potent derivatives, with a CDK2 inhibitor showing low micromolar activity in a biochemical assay and an SLC29A1 inhibitor showing micromolar activity in a cell-based assay.
 
The researchers also found patterns in their larger data set. Armed with 47,658 protein-ligand interactions, the researchers were able to use machine learning to start to predict which molecular features were associated with binding. They ranked fragments as promiscuous or nonpromiscuous and built a promiscuity model. Molecules with higher lipophilicity and a greater fraction of aromatic carbon atoms tended to be more promiscuous, but the model could correctly categorize compounds as promiscuous even if they had lower ClogP values, or nonpromiscuous even if they had higher ClogP values.
 
Beyond promiscuity, the researchers used machine learning to predict other behavior, such as subcellular localization. A relatively easy case was to predict which molecules would accumulate in lysosomes; these tended to be hydrophobic basic amines. More impressively, the researchers could predict fragments likely to bind to transmembrane transporters, RNA binding proteins, and even intrinsically disordered proteins. And this is just the start: they hope one day to predict “target proteins from an input chemical structure alone.”
 
Perhaps most exciting, all of the data and models are available for free at Ligand Discovery. You can explore the proteins bound across all 407 fragments, input one or more proteins and find ligands, predict whether any given FFF is likely to be promiscuous or not, and even “build a machine learning model on the fly to predict potential interactions.” 
 
Check it out and let us know your experience.

06 May 2024

Covalent fragments vs WRN

Last week Practical Fragments highlighted a covalent clinical compound from Vividion and Roche against the oncology target WRN. Another series of inhibitors against this protein are described in a recent Cancer Discov. paper by Gabriele Picco, Mathew Garnett, and collaborators at the Wellcome Sanger Institute, GSK, IDEAYA, and several academic institutes.
 
As we described in more detail last week, WRN is a synthetic lethal target for microsatellite instability (MSI) cancers. In contrast to the Vividion paper, which started by screening covalent fragments against cell lysates, here the researchers incubated purified WRN protein against each member of their covalent library (at 20 µM for 24 hours at 21 ºC) and analyzed the reactions by intact protein mass spectrometry. The fragment library was based around the methyl acrylate warhead, which, as we discussed a decade ago, has a narrower range of reactivities than more common acrylamides.
 
GSK_WRN1 was one of the prominent hits, with 81% modification. Tryptic digestion revealed that it modified C727, the same cysteine found by the Vividion researchers. Medicinal chemistry led to GSK_WRN3, with sub-micromolar activity in MSI SW48 cells. (Unfortunately no other details on the chemistry are provided; the paper states that these will be written up separately.)
 
GSK_WRN3 or a closely related compound were tested in a battery of assays and found to be inactive against three other helicases, which is not surprising given that C727 is unique to WRN. Chemoproteomic studies in cells also revealed the compound to be quite selective towards WRN vs other proteins. The compounds selectively inhibited MSI cancer cell lines and patient-derived organoids while sparing microsatellite stable (MSS) cell lines and organoids. One of the compounds showed activity in a mouse xenograft model.
 
In a useful public service, the researchers tested two previously reported WRN inhibitors, MIRA-1 and NSC617145, in the same set of several dozen cell lines and found that they were not only ineffective, they lacked selectivity for MSI cells over MSS cells. Although Dr. Saysno might object, I nominate these molecules to be added to the “Unsuitables” bestiary at the Chemical Probes Portal.
 
I do wish more details about the molecules were provided, especially the kinact/Ki values. It is interesting that GSK_WRN3 bears remarkable structural similarities to VVD-109063. IDEAYA recently announced that their collaboration with GSK has resulted in a development candidate targeting WRN, and it will be fun to see the full story emerge.