29 September 2025

Twentieth-Third Annual Discovery on Target Meeting

The CHI Discovery on Target (DoT) meeting was held last week in Boston. More than 850 people from 24 countries attended, 75% from industry. As usual I’ll just touch on some broad themes.
 
Covalent approaches
Covalent approaches were prominent throughout the conference. One of the very first talks was by Stefan Harry (Harvard/MGH), who described screening 416 cancer cell lines with three reactive “scout probes,” identifying some 6000 cysteine residues that could be covalently liganded. There are some interesting cell and context-dependent differences, and all the data are publicly available and easily searchable through a free online portal called DrugMap. He is now profiling a library of dual-electrophile-containing compounds to identify molecular glues.
 
Knowing which cysteines can be targeted is the first step for covalent drug discovery, and Sherry Ke Li described how she and colleagues at Genentech go about finding ligands. They’ve experimentally determined the reactivity of more than 6400 compounds against free cysteine and used this to train a machine-learning model to predict chemical (as opposed to specific) reactivity. Mass spectrometry (MS) using isolated proteins is the workhorse screening approach, but Sherry also described using variable temperature surface plasmon resonance (SPR) to dissect the individual components of kinact/KI.
 
AstraZeneca has also been doing considerable covalent screening, and Hua Xu briefly described the BFL1 story we wrote about here. In addition to pure proteins, they are now also starting to screen their covalent library in cells. Hua also presented earlier work from Pfizer on the discovery of the covalent kinase inhibitor ritlecitinib, which started with a noncovalent binder. Proteomic studies revealed that in addition to the intended target JAK3, it hits other TEC-family kinases too.
 
Adding a covalent warhead to a reversible binder is also the approach taken by MOMA Therapeutics in the discovery of their clinical WRN inhibitor MOMA-341, as presented by Momar Toure. They ended up targeting the same cysteine as Vividion (see here), though the binding mode is somewhat different. MOMA is also pursuing covalent fragment screening using intact protein MS, and Brian Sosa-Alvarado described how they were able to identify nanomolar inhibitors of RAD54L within six months of starting the program, aided by DNA-encoded libraries (DEL).
 
Not everyone is pursuing cysteine: Ken Hsu described how he and his team at University of Texas Austin are using sulfur-triazole exchange chemistry (SuTEx) to target tyrosine residues across the proteome. He noted that although cysteine could react with this warhead, the resulting thiosulfonate would be unstable. This is true in general, but I wonder if, just like the reversible cyanoacrylamide warheads we wrote about more than a decade ago, they could be stabilized within folded proteins.
 
Noncovalent approaches
Covalency is not the only game in town, as exemplified in a talk by Emma Rivers on “integrated hit discovery” at AstraZeneca. I was tickled that she grouped FBLD with HTS as “traditional” approaches, onto which they’ve added DEL and peptide libraries. Importantly, they’re focused on generating and capturing as much high-quality data as possible to enable machine learning – a topic we’ll touch on more below.
 
Nor are proteins the only target; there was a whole track on RNA- and DNA-targeting small molecules, where Benjamin Brigham described the plate-based equilibrium dialysis-based approach taken at Atavistik to screen metabolites and metabolite-like molecules. This led to two fragment-sized hits against RNA encoding SERPINA1, and although the affinities are modest, they do inhibit translation in a cell-free system.
 
At FBLD 2018 Astex presented the first cryo-EM structure of a fragment-protein complex, noting that throughput was an issue. The company has leaned into that challenge and now has three microscopes, including a top-of-the-line Krios, with another on the way. Miguel Zamora-Porras described how they have now solved hundreds of structures. Their Krios can collect data on two compounds per day, and the full cycle time from protein-ligand preparation to structure is about a week. Miguel described how structures of ligands bound to the ion channel protein TRPML1 helped reveal why some were agonists and others antagonists.
 
Data and its discontents
On the subject of structures, Steve Burley (Rutgers) gave an eloquent history and defense of the “RCSB Protein Data Bank: an open access research resource that benefits all humanity.” From its humble beginnings with just seven structures in 1971, the PDB now contains more than 240,000. And these are not just of scientific interest: all 88 of the new molecular entities the FDA approved for oncology between 2010 and 2023 had PDB structures that informed the biology or druggability, and 75% of the efforts involved structure-based design. Steve also mentioned that the question of where and how to store large-scale crystallographic data will be discussed in a meeting sometime in the spring of next year. Finally, Steve is hoping to retire from his position as Director of the RCSB PDB, so if you’re looking to make an impact, please apply.
 
The dramatic advances in protein structure prediction exemplified by AlphaFold would not have been possible without the PDB, but unfortunately the same high-quality information on protein-ligand binding modes and affinities is not available, as noted by Woody Sherman of Psivant. To illustrate the importance of training data, Woody asked ChatGPT to produce a picture of an analog clock set to 6:32. The result? A clock with three hands, one at 10, one at 2, and one at 6, because most images of clocks are set to 10:10.
 
Woody asked whether machine-learning-based docking can extrapolate or just interpolate. Although impressive results have been reported for some protein-ligand complexes, it turns out that there are often similar ligands in the training data. For truly novel ligands, the predictions tend to fall flat. Similarly, allosteric ligands are often (mis)placed into an orthosteric site – just because the model has been trained that that’s where ligands should go. Indeed, although Psivant is heavily invested in computational approaches, Woody mentioned that they often use “wet” approaches for finding initial chemical matter.
 
On the subject of dubious data, Al Edwards (University of Toronto) noted that a third of all immunofluorescence images in the literature use antibodies that give signals in knockout cells. And as we wrote just last month, many reported small molecule “probes” are just as bad. Al is CEO of the Structural Genomics Consortium, whose ambitious Target 2035 aims to find a pharmacological probe for every target in the human genome. As a starter, they’re aiming for 2000 probes in the next five years. They’re using affinity-selected mass spectrometry (ASMS), screening pools of 500 compounds and 8 proteins at a time, and are getting micromolar hits against about 30% of targets. They’re accepting protein submissions, so if you’re looking for starting points against your favorite protein contact them.
 
I’ll end here, but please leave comments. And mark your calendar for Sep. 28 to Oct. 1 next year, when DoT returns to Boston.

1 comment:

Anonymous said...

On the covalent bonding/binding, I believe that there is still a struggle in figuring out the chicken-egg problem. The need for the non-covalent part to sit properly in the binding site to present the warhead or warhead reactivity dominates and the rest of the ligands placed subsequently. Is this still target dependent or binding site exposure dependent. Has a universal model been developed or identified. What are the contributing factors. a) warhead reactivity. b) transition state energy for the reaction. c) binding site accommodation for reaction. d) thiol pKa/exposure. If it is a mix of these attributes, how do we use these for building a proper model (either ML or other).