The CHI Discovery on Target (DoT)
meeting was held last week in Boston. More than 850 people from 24 countries
attended, 75% from industry. As usual I’ll just touch on some broad themes.
Covalent approaches
Covalent approaches were prominent
throughout the conference. One of the very first talks was by Stefan Harry (Harvard/MGH),
who described screening 416 cancer cell lines with three reactive “scout probes,”
identifying some 6000 cysteine residues that could be covalently liganded. There
are some interesting cell and context-dependent differences, and all the data
are publicly available and easily searchable through a free online portal
called DrugMap. He is now profiling a library of dual-electrophile-containing
compounds to identify molecular glues.
Knowing which cysteines can be
targeted is the first step for covalent drug discovery, and Sherry Ke Li described
how she and colleagues at Genentech go about finding ligands. They’ve
experimentally determined the reactivity of more than 6400 compounds against free
cysteine and used this to train a machine-learning model to predict chemical
(as opposed to specific) reactivity. Mass spectrometry (MS) using isolated
proteins is the workhorse screening approach, but Sherry also described using
variable temperature surface plasmon resonance (SPR) to dissect the individual
components of kinact/KI.
AstraZeneca has also been doing considerable
covalent screening, and Hua Xu briefly described the BFL1 story we wrote about
here. In addition to pure proteins, they are now also starting to screen their
covalent library in cells. Hua also presented earlier work from Pfizer on the
discovery of the covalent kinase inhibitor ritlecitinib, which started with a
noncovalent binder. Proteomic studies revealed that in addition to the intended
target JAK3, it hits other TEC-family kinases too.
Adding a covalent warhead to a
reversible binder is also the approach taken by MOMA Therapeutics in the
discovery of their clinical WRN inhibitor MOMA-341, as presented by Momar
Toure. They ended up targeting the same cysteine as Vividion (see here), though
the binding mode is somewhat different. MOMA is also pursuing covalent fragment
screening using intact protein MS, and Brian Sosa-Alvarado described how they
were able to identify nanomolar inhibitors of RAD54L within six months of
starting the program, aided by DNA-encoded libraries (DEL).
Not everyone is pursuing
cysteine: Ken Hsu described how he and his team at University of Texas Austin
are using sulfur-triazole exchange chemistry (SuTEx) to target tyrosine residues
across the proteome. He noted that although cysteine could react with this
warhead, the resulting thiosulfonate would be unstable. This is true in general,
but I wonder if, just like the reversible cyanoacrylamide warheads we wrote
about more than a decade ago, they could be stabilized within folded proteins.
Noncovalent approaches
Covalency is not the only game in
town, as exemplified in a talk by Emma Rivers on “integrated hit discovery” at
AstraZeneca. I was tickled that she grouped FBLD with HTS as “traditional”
approaches, onto which they’ve added DEL and peptide libraries. Importantly, they’re
focused on generating and capturing as much high-quality data as possible to
enable machine learning – a topic we’ll touch on more below.
Nor are proteins the only target;
there was a whole track on RNA- and DNA-targeting small molecules, where
Benjamin Brigham described the plate-based equilibrium dialysis-based approach
taken at Atavistik to screen metabolites and metabolite-like molecules. This
led to two fragment-sized hits against RNA encoding SERPINA1, and although the
affinities are modest, they do inhibit translation in a cell-free system.
At FBLD 2018 Astex presented the
first cryo-EM structure of a fragment-protein complex, noting that throughput
was an issue. The company has leaned into that challenge and now has three microscopes,
including a top-of-the-line Krios, with
another on the way. Miguel Zamora-Porras described how they have now solved
hundreds of structures. Their Krios can collect data on two compounds per day,
and the full cycle time from protein-ligand preparation to structure is about a
week. Miguel described how structures of ligands bound to the ion channel
protein TRPML1 helped reveal why some were agonists and others antagonists.
Data and its discontents
On the subject of structures,
Steve Burley (Rutgers) gave an eloquent history and defense of the “RCSB
Protein Data Bank: an open access research resource that benefits all humanity.”
From its humble beginnings with just seven structures in 1971, the PDB now contains
more than 240,000. And these are not just of scientific interest: all 88 of the
new molecular entities the FDA approved for oncology between 2010 and 2023 had
PDB structures that informed the biology or druggability, and 75% of the
efforts involved structure-based design. Steve also mentioned that the question
of where and how to store large-scale crystallographic data will be discussed
in a meeting sometime in the spring of next year. Finally, Steve is hoping to retire
from his position as Director of the RCSB PDB, so if you’re looking to make an
impact, please apply.
The dramatic advances in protein
structure prediction exemplified by AlphaFold would not have been possible without
the PDB, but unfortunately the same high-quality information on protein-ligand
binding modes and affinities is not available, as noted by Woody Sherman of Psivant.
To illustrate the importance of training data, Woody asked ChatGPT to produce a
picture of an analog clock set to 6:32. The result? A clock with three hands,
one at 10, one at 2, and one at 6, because most images of clocks are set to 10:10.
Woody asked whether machine-learning-based
docking can extrapolate or just interpolate. Although impressive results have
been reported for some protein-ligand complexes, it turns out that there are often
similar ligands in the training data. For truly novel ligands, the predictions
tend to fall flat. Similarly, allosteric ligands are often (mis)placed into an
orthosteric site – just because the model has been trained that that’s where
ligands should go. Indeed, although Psivant is heavily invested in computational
approaches, Woody mentioned that they often use “wet” approaches for finding
initial chemical matter.
On the subject of dubious data, Al
Edwards (University of Toronto) noted that a third of all immunofluorescence
images in the literature use antibodies that give signals in knockout cells. And
as we wrote just last month, many reported small molecule “probes” are just as
bad. Al is CEO of the Structural Genomics Consortium, whose ambitious Target 2035
aims to find a pharmacological probe for every target in the human genome. As a
starter, they’re aiming for 2000 probes in the next five years. They’re using
affinity-selected mass spectrometry (ASMS), screening pools of 500 compounds and
8 proteins at a time, and are getting micromolar hits against about 30% of
targets. They’re accepting protein submissions, so if you’re looking for
starting points against your favorite protein contact them.
I’ll end here, but please leave
comments. And mark your calendar for Sep. 28 to Oct. 1 next year, when DoT
returns to Boston.
1 comment:
On the covalent bonding/binding, I believe that there is still a struggle in figuring out the chicken-egg problem. The need for the non-covalent part to sit properly in the binding site to present the warhead or warhead reactivity dominates and the rest of the ligands placed subsequently. Is this still target dependent or binding site exposure dependent. Has a universal model been developed or identified. What are the contributing factors. a) warhead reactivity. b) transition state energy for the reaction. c) binding site accommodation for reaction. d) thiol pKa/exposure. If it is a mix of these attributes, how do we use these for building a proper model (either ML or other).
Post a Comment