Practical Fragments

15 April 2024

Detailing hot spots with atomic consensus sites

Practical Fragments has written frequently about hot spots, regions on proteins that are predisposed to bind ligands such as drugs. Determining whether a protein has a hot spot can help prioritize a target for screening, and one of the more established computational approaches to do so is FTMap, which we wrote about most recently just a couple months ago.

While FTMap can tell you whether a protein has one or more hot spots, it provides few further details, such as which regions might prefer a hydrogen bond donor or acceptor. This has now been addressed in a new J. Chem. Inf. Mod. paper by Sandor Vajda and collaborators at Boston University, Stony Brook University, and Acpharis. (Diane Joseph-McCarthy presented some of this work at the CHI DDC conference earlier this month.)

The original version of FTMap started with a collection of 16 very small molecule probes: these were docked all over a protein, with hot spots being identified as consensus sites where many probes bound. To get more information about each hot spot, the researchers have extended the method – now called E-FTMap – by increasing the number of probes to 119 covering key functional groups. For example, whereas FTMap included dimethyl ether as a probe, E-FTMap also includes 2-methoxypropane, 2-methoxy-2-methylpropane, and tetrahydropyran. If all these probes bind with the oxygen in the same part of the hot spot, this suggests a predilection for a hydrogen bond acceptor, and also provides information about nearby hydrophobic contacts.

By using a sufficiently diverse group of virtual probes, E-FTMap is able to more finely detail hot spots, tallying the “atomic consensus sites” within them. This is reminiscent of an approach we wrote about several years ago, though that method used just three different probes.

To benchmark E-FTMap, the researchers took 109 fragment-to-lead pairs with published crystallographic information and assessed whether the program could identify interactions that had been experimentally observed. The results were encouraging and far superior to the original version of FTMap. The highest ranked atomic consensus sites generally overlapped with appropriate atoms in fragments and leads. Interestingly, the results for fragments were better than those for leads, and the researchers suggest this is because the fragment “core is responsible for the bulk of the binding free energy in a ligand and that larger ligands bind by forming additional interactions at weaker hot spots that surround the fragment binding site.”

Next, E-FTMap was tested against five proteins for which between 31 and 353 fragment-bound crystal structures were available. Here too the program was broadly successful, though some fragments bound regions of the protein that E-FTMap overlooked, particularly in cases where there were conformational changes. This is not surprising given that the program assumes the protein remains rigid. (Other computational approaches such as SWISH, which we wrote about here, are starting to account for protein flexibility.)

E-FTMap looks qualitatively at specific atomic interactions, and one question I had was how well the atomic consensus sites matched up with binding affinities of known fragments; perhaps some crystallographically identified fragments bind so weakly one would not expect to find them computationally, as we discussed here and here. This hypothesis might be tested by focusing on comparisons with experimentally characterized fragments with the highest ligand efficiencies.

Also, I was struck by the fact that the virtual probes in E-FTMap are roughly the size of MiniFrags or MicroFrags, and I couldn’t help but wonder how well the atomic consensus sites from the virtual screens would correlate with the binding modes of these tiniest of fragments.

One nice feature of E-FTMap is that it can be accessed through a simple web server, so if you’re interested in these and other questions you can test it for yourself. If you do, please share your experiences.

08 April 2024

Nineteenth Annual Fragment-Based Drug Discovery Meeting

Last week the CHI Drug Discovery Chemistry (DDC) meeting was held in San Diego. This was the largest ever, with more than 900 participants, 95% of whom attended in person, up from 87% last year. I won’t attempt to cover all fourteen tracks but will just touch on some of the main themes.

Computational approaches

All four days of the conference featured dedicated sessions on machine learning and artificial intelligence, but since I was in other sessions I don’t know how relevant they were to FBLD. If you attended an interesting talk please let me know so I can watch it on-demand.

Among computational talks I did see, Antonina Nazarova (University of Southern California) provided an update on V-SYNTHES, which we first wrote about here. This synthon-based screening approach now covers 36 billion molecules and has been tested against eight different proteins, four of which yielded nanomolar hits when tested experimentally.

Computational methods have historically treated proteins as rigid, though many targets are anything but. Diane Joseph-McCarthy (Boston University) described an improvement to the pocket finding approach FTMap, called FTMove, to incorporate molecular dynamics by starting with an ensemble of different crystal structures. A further advance is E-FTMap, which expands the number of virtual probes from 16 to 119 to more finely assess ligandable sites.

Benjamin Walters (Genentech) described using protein dynamics to find cryptic pockets using ESP, or Experimental Structure Prediction. In this approach, experimental data from hydrogen-deuterium exchange (HDX) or chemical shift perturbations (CSPs) are used to constrain multiple parallel computational simulations, leading to better models than flexible docking, even for weak fragments.

Experimental approaches

Protein-detected NMR was the first practical fragment-finding method, and Steve Fesik (Vanderbilt) described using SAR by NMR to find fragments binding to the papain-like protease of SARS-CoV-2. These have been advanced to molecules with nanomolar affinity and activity in cell-based assays.

Andreas Lingel described the new fluorine-containing fragment library built at Novartis and how ¹⁹F NMR was used to generate inhibitors of IL-1β. We wrote about that success last year, noting that the initial fragment hit was “super-sized,” and Andreas confirmed that for trifluoromethyl-containing fragments the upper molecular weight limit was relaxed to 350 Da.

Sriram Tyagarajan (Merck) presented a crystallographic screen against the neurodegeneration target TTBK1 which yielded hits at 15 sites. Several potential allosteric sites were identified, but fragment growing and linking were not successful, leading them to a quick (3 month) no-go decision on the protein.

Virgil Woods (City University of New York) described using crystallographic screening to find hits against the challenging phosphatase PTP1B both under conventional cryogenic temperatures as well as at room temperature. As we noted about related work, there was a surprisingly poor overlap between the two sets of hits, and some fragments bound in a different manner at different temperatures.

Integrating FBDD and DNA-encoded libraries (DEL) for lead generation was the topic of Chaohong Sun’s talk. She noted that of some two dozen targets at AbbVie screened by both methods, 60% found hits from both, 10% found only fragment hits, and 5% found only DEL hits, with a quarter of the targets producing no hits. Hits from both approaches can be combined, as we noted here. Chaohong also noted that for both FBDD and DEL, high quality protein is essential for successful screens.

Covalent approaches

Covalent approaches to drug discovery are becoming ever more acceptable as more covalent drugs are approved. Understanding these in depth was the focus of Micah Niphakis (Lundbeck), who characterized 22 approved drugs containing 18 different warheads. The stability in buffer, liver microsomes, and hepatocytes varied dramatically, though more recently approved drugs tended to be more stable. Chemoproteomic studies revealed many off-targets in cells; for example, all the kinase inhibitors tested hit BTK to some extent even when this was not the primary target. The fact that the drugs are (mostly) safe and well-tolerated is a useful reminder that just because we can detect something doesn’t mean it is relevant.

Henry Blackwell described building a 12,000-member covalent fragment library at AstraZeneca. Due to the presence of a warhead, they relaxed rule of three parameters, with MW ranging from 250-400 Da and ClogP from 0-4. Henry also discussed the successful use of this library to identify covalent hits against the anticancer target BFL1 that were optimized to k_inact/K_I ~ 7000 M^-1s^-1. This accomplishment is all the more impressive given that screens using ASMS, DSF, ¹⁹F NMR, and SPR had all failed to yield validated hits.

We recently wrote about electrophilic MiniFrags, and György Keserű (Research Center for Natural Sciences, Hungary) described screening these against HDAC8 and the main protease from SARS-CoV-2. He also mentioned that the set is available for purchase from Enamine, so you can try it yourself against your favorite target.

As covalent modifiers become more common we will see new metrics for characterizing them, as illustrated by Benjamin Horning’s (Vividion) presentation, “Ligand Efficiency Metrics in Covalent Drug Discovery.” He described Ligand Reactivity Efficiency (LRE), defined as pTE₅₀(target, 1 hr) – pTE₅₀(glutathione, 1 hr), where TE is the target (or glutathione) engagement. LRE is analogous to LLE but focused on reactivity rather than lipophilicity. Despite my post last week, the metric could be useful, and I look forward to seeing what Dr. Saysno and friends will make of it.

Most covalent modifiers bind to a target and remain intact, but Nir London (Weizmann Institute) has developed Covalent Ligand Directed Release (CoLDR), in which a portion of the small molecule leaves; applications include release of fluorescent or chemiluminescent probes. Useable warheads include α-substituted methacrylamides and sulfamate acetamides.

Although more recent covalent drugs have targeted cysteine residues, there is growing interest in other amino acid side chains. Nir mentioned that thio-methacrylate esters can react with lysine residues, thought the kinetics are slow. And Carlo Ballatore (University of California San Diego) described hydroxy-naphthaldehyde fragments that bound reversibly to a lysine on the vascular target KRIT1.

Both plenary keynote speakers focused heavily on covalent chemistry. Dan Nomura (UC Berkeley) described using chemoproteomics approaches to find covalent molecules that could inhibit, degrade, or change the cellular localization of myriad proteins.

Finally, K. Barry Sharpless (Scripps), one of only five people to have been awarded the Nobel Prize twice, gave a rich description of sulfur (VI) fluoride exchange chemistry (SuFEx), which included drawing chemical structures on a flip chart. He presented the discovery of a fluorosulfate that is bactericidal against multiple resistant forms of Mycobacterium tuberculosis. Interestingly, the molecule works by modifying a catalytic serine residue which then cyclizes to form a β-lactam. His passion for chemistry is obvious, but he also has personal reasons for pursuing the second most deadly infectious disease: his brother died of tuberculosis before effective drugs were developed. And with the rise of extensively drug resistant TB, we’ll need new ones.

I’ll end on that note, but please leave comments. And mark your calendar for April 14-17 next year, when DDC returns to San Diego.

01 April 2024

Personality tests for molecules

Long-time readers of Practical Fragments will be familiar with various metrics for measuring molecules, such as LE, LLE, and WTF. But these are all hard-edged, numerical constructs. Some folks argue that we should take a softer, more nuanced approach. This call has been heeded by Katharine Bigg and Isabel Myerrors in the form of a “Myerrors-Bigg” Type Indicator, or MBTI.

The MBTI consists of a series of questions which rank a molecule into four dimensions: Extroversion/Introversion, Sociable/Nonsociable, Flat/Three-dimensional, and Pretty/Janky. Defining molecules as extroverts may sound strange, but it really just comes down to a question of molecular recognition: we’ve noted that 4-bromopyrazole seems to bind to just about every protein and is thus an Extrovert while other compounds, being Introverts, fall into the category of “dark chemical matter” and never come up in screens.

As for the other dimensions, Practical Fragments has written previously about (Non)Sociable fragments as well as Flat fragments. This leads to the last dimension. Claims that beauty is in the eye of the beholder are undermined by the rigorous process of the MBTI, which places molecules such as curcumin squarely in the Janky category while approved drugs are self-evidently Pretty. Thus, toxoflavin is an ESFJ, while sotorasib is an INTP.

The utility of the MBTI remains to be established, but this has not stopped companies everywhere from applying it in their acquisition and evaluation processes. And other tests, such as the Decagram of Personality and the Big Six Personality Traits, are also becoming popular. Which do you prefer?

25 March 2024

Fragments vs DHODH

Rapidly proliferating cancer cells require a steady supply of nucleic acids, and cutting that off is a potential therapy. The enzyme dihydroorotate dehydrogenase (DHODH), which is important for pyrimidine synthesis, is thus an interesting drug target. In a recent ACS Med. Chem. Lett. paper, Lindsey DeRatt, Scott Kuduk, and colleagues at Janssen describe their approach.

The researchers had previously used virtual screening and structure-based drug design to develop compound 1, which is potent in both biochemical and cell-based assays. However, the molecule is highly effluxed by P-glycoprotein, which can limit both oral bioavailability and brain penetration. Thus, they turned to fragments.

An SPR screen (about which sadly no details are provided) yielded compound 2, and crystallography revealed that the amide carbonyl makes a similar contact to tyrosine 356 (Y356) as does the carbonyl in the triazolone moiety of compound 1. Merging these led to compound 4, which was considerably more potent than compound 2 but much less so than compound 1. However, further optimization led eventually to compound 25. Although less potent in an enzymatic assay than compound 1, compound 25 is equally effective in cells. It also has excellent pharmacokinetics in mice and – importantly – a considerably lower efflux ratio.

Interestingly, when the researchers solved the crystal structure of a related molecule bound to DHODH, they found that the carbonyl no longer interacts with Y356 but is instead flipped 180º and interacts with a different residue. The researchers conclude by stating that they are designing new molecules to reengage Y356, which could further improve potency.

Several lessons emerge from this brief paper. First, the flipped urea moiety is another reminder that fragments do not always maintain their orientations, as also seen here, here, and here. Second, information from the fragment was used not to improve potency but rather to address other aspects of an existing lead series, as seen here and here. And finally, one could argue that the only critical feature of the fragment remaining in the final molecule is the NH of the urea. But the fragment did cause the researchers to examine their molecules from a different perspective, resulting in a better series. Perhaps you could call this an example of fragment-assisted drug discovery. As is so often the case, fragments can inspire new ideas that may otherwise be overlooked.

18 March 2024

Fragments vs SHP2

One of the success stories we highlighted in last week’s summary of Fragments 2024 was the discovery of a potent inhibitor of SH2 domain-containing protein tyrosine phosphatase 2 (SHP2). James Day and colleagues at Astex and Taiho have just published the full account in J. Med. Chem.

Previous studies had shown that blocking SHP2 might be effective in certain cancers, particularly those dependent on mutant KRAS. As its name suggests, however, SHP2 is a phosphatase. This class of enzymes has highly charged active sites, which makes drug discovery notoriously difficult (see here for example). Indeed, a crystallographic fragment screen of the isolated phosphatase domain produced just one hit.

Simultaneously, the researchers performed NMR and crystallographic screens of the full-length protein, which contains two SH2 domains. This campaign was much more successful, with 88 crystallographically validated fragment hits. (Interestingly, a thermal shift assay of the same construct came up empty.) As Astex has previously reported, secondary binding sites on proteins are common, and SHP2 is no exception, with fragments binding to five sites. However, the vast majority – 83 of 88 – bound to what is called the tunnel region between the phosphatase domain and one of the SH2 domains.

The researchers note that “following completion of our Pyramid fragment screen, Novartis independently reported several SHP2 inhibitors” binding to the same site, which must have been both validating and irritating. Indeed, the Astex researchers did work on fragments binding to other sites, advancing one to a low micromolar inhibitor. But it’s hard to ignore a hot spot with dozens of bound fragments, and the tunnel region became their primary focus. One fragment was optimized to a low micromolar inhibitor. Another, fragment 3, had measurable affinity by ITC and respectable ligand-efficiency, and this was taken the furthest.

We’ve written previously about the importance of water in molecular interactions, and here the researchers performed solvent mapping molecular dynamics to identify water molecules that could be advantageously engaged. Scaffold hopping led to compound 15, and crystallography confirmed that the pyridine nitrogen forms a hydrogen bond to a water molecule. Increasing the lipophilicity around the phenyl ring and adding a basic amine to engage an electronegative region of the protein led to compound 18, with nanomolar biochemical activity and low micromolar activity in cells. Further structure-based design ultimately led to compound 28, with sub-micromolar cell activity. This compound has low efflux, low clearance and excellent oral bioavailability. When dosed orally in mouse xenograft models the molecule significantly inhibited tumor growth.

The exo-diastereomer of compound 28, in which the primary amine is facing down instead of up, shows interesting differences. It has a similar pKa as well as similar biochemical and cell-based activity but is plagued by high efflux and poor oral bioavailability. The researchers suggest that “steric shielding of the tropane bridge or pharmacophoric differences in efflux transporter recognition” may be responsible. There was considerable discussion at Fragments 2024 as to the precise source of the differences, but whatever the cause, this pair serves as a useful reminder that pharmacokinetics may vary dramatically even between nearly identical molecules.

Clinical development of SHP2 inhibitors has slowed due to a variety of reasons, including apparent on-target toxicity, but this is still a nice fragment-to-lead success story. Perhaps, as with capivasertib, it will just take time to find the right clinical strategy and patients who can benefit from these molecules.

11 March 2024

Fragments 2024

Last week saw the first of four dedicated fragment meetings this year: Fragments 2024, the 9^th RSC-BMCS Fragment-based Drug Discovery Meeting, was held in historic Hinxton Hall, Cambridge, UK. I won’t attempt to cover the 17 talks, 40+ posters, and 20 exhibitors in detail but just try to hit on some broad themes.

One highlight was a talk by Chris Swain, whose Cambridge MedChem Consulting has come up several times at Practical Fragments. Chris has been systematically cataloging fragment hits reported in the literature, and his database now includes >2500 fragments from >300 papers that hit 265 targets. This has not been easy: as we’ve noted in our annual F2L reviews, papers don’t always mention fragments in the title or abstract; sometimes you need to dig deep into the experimental methods to find out the origin of the initial hits, and even then there are questions of interpretation. Chris noted that the the drug aprepitat originated from a fragment-like pharamacophore extracted from a more complex literature compound. That story was published in 1998, predating the term “fragment-based drug discovery,” but perhaps it would be considered FBDD today.

The fragments themselves are a diverse bunch, with an average Tanimoto similarity of just 0.09, but there are small clusters. Looking at them in more detail, the ten most common scaffolds are aromatic (benzene, indole), which is a departure from approved drugs. There is also a significant fraction of charged molecules, including 298 acids and 348 basic groups. About 10% of the fragments hit more than one target, exactly what you would expect from the theory of molecular complexity.

Chris’s talk was followed by a wide-ranging panel discussion that expanded on some of these themes. Solubility was recognized as important, though with different techniques being more persnickety: Justin Dietrich (AbbVie) noted that pre-screening is critical for SPR, but for protein-detected NMR the protein is present at high enough concentrations to act as a “phase transfer reagent.”

The topic of thermodynamics also came up, with Chris Murray noting that Astex collects lots of ITC data but uses it for assessing free energy (ΔG) values rather than enthalpic energy (ΔH) values. Helena Danielson (Uppsala University) noted that the early correlation between compound quality and enthalpy found with HIV protease inhibitors did not seem to apply to other targets despite significant investment in collecting data at multiple companies, as also noted by Chris Smith (Mirati) and Mike Hann (GSK). Rod Hubbard (Vernalis) puckishly suggested that the study of ΔH had produced “more heat than light.”

The topic of MiniFrags also came up during the panel discussion. Chris Murray noted that they had been tried on quite a few targets but, as Rod Hubbard confirmed, were more helpful in identifying binding sites than providing starting points. But Chris Smith pronounced himself a “complete convert” after a MiniFrag identified an induced pocket on a previously intractable target where fragments (and other techniques) had failed.

Covalent fragments also made several appearances, with Jonathan Pettinger describing a phenotypic screen at GSK looking for compounds that block the pro-inflammatory M1 polarization of macrophages. After screening some 2000 covalent fragments they used chemoproteomics to determine that one of the best compounds acted by modifying cysteine 817 of the kinase JAK1. Interestingly, this is the same cysteine identified independently by researchers from Vividion, which could speak to the centrality of this target, the reactivity of this particular cysteine, or both.

Pursuing residues other than cysteine is seen as difficult, with Mike Hann noting in the panel discussion that these may require more extensive non-covalent interactions and Chris Murray noting that the warheads themselves were less attractive. But these challenges have not dissuaded Peter Cossar (Eindoven University of Technology), who has introduced cysteine-reactive disulfide and lysine-reactive aldehyde moieties into the same fragment to crosslink a 14-3-3 protein to substrate ERRγ.

Another theme was screening crude reaction mixtures in a “direct to biology” approach. Vernalis was an early adopter with their off-rate screening, and a talk by Lucie Guetzoyan confirmed that they are continuing to invest here not just with SPR but also with affinity-selection mass spectrometry and X-ray crystallography. Lucie also described using flow chemistry to enable sensitive organometallic chemistries such as Grignard and Negishi couplings. John Spencer (University of Sussex) is also using crude reaction screening by crystallography and thought the approach can compress ten years worth of work into a few months.

As with most conferences these days there were plenty of success stories. Martina Schaefer (Nuvisan) described the discovery of the Bayer SOS1 inhibitor BAY-293, which we wrote about here. Anna Vulpetti (Novartis) described the discovery of IL-1β inhibitors, which we wrote about here. Nicola Wilsher (Astex) described the discovery of potent SHP2 inhibitors; I’ll write more about these later. And Matthew Calabrese described the discovery of allosteric activators selective for the γ3 subunit of AMPK, which could avoid the cardiotoxicity seen with less selective molecules. Three HTS screens had failed but fragments ultimately led to a potent tool molecule. Interestingly, some of the HTS compounds were later found to be hits but had been overlooked because they were so weak that they did not rise above the noise of the assay.

Finally, Justin Dietrich described several success stories, including against TNFα (which we wrote about here) as well as CD40 ligand. Justin noted that FBLD is used alongside HTS and DEL at AbbVie, and that the techniques can be complementary – a theme noted by several others.

Despite being so intimately integrated with other discovery approaches, FBLD continues to innovate and evolve and remain sufficiently quirky that stand-alone meetings are still valuable and rewarding. I’m looking forward to seeing what the next several meetings reveal.

03 March 2024

The EU-OPENSCREEN fragment library

A well-curated fragment library is usually the starting point for fragment-based lead discovery, and not an insignificant investment. If you are just starting out you may want to use an existing library. One such option is described in an (open-access) paper in RSC Med. Chem. by Jordi Mestres and collaborators at IMIM Hospital del Mar Medical Research Institute and across Europe.

The EU-OPENSCREEN European Research Infrastructure Consortium (ERIC) allows researchers to access early lead discovery and chemistry resources. Among other components, it includes a set of more than 96,000 compounds for high-throughput screening, the European Chemical Biology Library, or ECBL. To complement this, the researchers have developed what they call the European Fragment Screening Library, or EFSL.

Recognizing that rapid follow-up is a critical next step in fragment-based lead discovery, the researchers designed EFSL based on ECBL. They did this by choosing fragments commercially available from Enamine that were sub-structures of ECBL members. Fragments were chosen to represent as much of the ECBL as possible, as well as for rule-of-three compliance. Fragments with multiple vectors for growing were also prioritized, similar to the “sociable fragments” concept we wrote about here. Finally, a set of 88 very small “minifrags” were also included.

Fragments were dissolved in deuterated DMSO at 100 mM (or 1000 mM for minifrags). Solubility and integrity were assessed at 1 mM (or 10 mM for minifrags) in PBS using ¹H-NMR using an internal standard; those with solubility < 0.2 mM were rejected, as were those with missing or extra peaks in the NMR spectra. Of 1056 compounds tested, 913 passed these QC criteria.

The EFSL is available for screening (via grant applications), and the paper summarizes the results of eight screens performed over two years using a range of detection technologies including crystallography, ligand-detected NMR, small-angle X-ray scattering, thermal shift, and BLI. Hit rates ranged form just 0.1% to 31.3%, though in the last case only a small subset of the library was tested.

After fragment screening and confirmation, four of the projects tested larger compounds from the ECBL in follow-up studies, and two were able to identify hits. One project targeting a bacterial beta-ketoacyl-ACP synthase 2 (FabF) used BLI to identify a fragment with a dissociation constant of 35 µM. Of the 147 compounds related compounds from the ECBL, two had slightly higher affinity, albeit at the expense of lower ligand efficiency. Perhaps exploring Enamine REAL Space as in this example would be more effective at finding significantly more potent molecules.

In summary, the EFSL seems to be a useful resource, particularly for academic labs. If you’ve got a target and no internal fragment-screening capabilities, it might be worth putting in an application.

26 February 2024

Fragments in the clinic: 2024 edition

It has been more than a year since our last list of fragment-derived clinical compounds. Since then capivasertib has been approved, bringing the number of marketed drugs to seven. There have also been a few other changes.

As always, this table includes compounds whether or not they are still in development (indeed, some of the companies no longer even exist). Because of this, the Phase 1 section contains a higher proportion of compounds that are no longer progressing. The full list contains 59 molecules, up slightly from 2022, with just under 40% approved or in active trials.

Drugs reported as still active in clinicaltrials.gov, company websites, or other sources are in bold, and the 37 that have been discussed on Practical Fragments are hyperlinked to the most relevant post. The list is almost certainly incomplete, particularly for Phase 1 compounds. If you know of others please leave a comment.

Drug	Company	Target
Approved!
Asciminib	Novartis	BCR-ABL1
Capivasertib	AstraZeneca/Astex/CR-UK	AKT
Erdafitinib	Astex/J&J	FGFR1-4
Pexidartinib	Plexxikon	CSF1R, KIT
Sotorasib	Amgen	KRAS^G12C
Vemurafenib	Plexxikon	B-RAF^V600E
Venetoclax	AbbVie/Genentech	Selective BCL-2
Phase 3
Lanabecestat	Astex/AstraZeneca/Lilly	BACE1
Navitoclax (ABT-263)	Abbott	BCL-2/BCL_xL
Pelabresib (CP-0610)	Constellation	BET
Verubecestat	Merck	BACE1
Phase 2
ASTX029	Astex	ERK1,2
ASTX660	Astex	XIAP/cIAP1
AT7519	Astex	CDK1,2,4,5,9
AT9283	Astex	Aurora, JAK2
AUY-922	Vernalis/Novartis	HSP90
AZD5991	AstraZeneca	MCL1
DG-051	deCODE	LTA4H
eFT508	eFFECTOR	MNK1/2
Indeglitazar	Plexxikon	pan-PPAR agonist
LY2886721	Lilly	BACE1
LY3202626	Lilly	BACE1
LY3372689	Lilly	OGA
LY517717	Lilly/Protherics	FXa
LYS006	Novartis	LTA4H
MAK683	Novartis	PRC2 EED
MK-8189	Merck	PDE10A
Onalespib	Astex	HSP90
PF-06650833	Pfizer	IRAK4
PF-06835919	Pfizer	KHK
PLX51107	Plexxikon	BET
S64315	Vernalis/Servier/Novartis	MCL1
VK-2019	Cullinan Oncology / Wistar	EBNA1
Phase 1
AG-270	Agios/Servier	MAT2A
ABBV-744	Abbott	BD2-selective BET
ABT-518	Abbott	MMP-2 & 9
ABT-737	Abbott	BCL-2/BCL_xL
AT13148	Astex	AKT, p70S6K, ROCK
AZD3839	AstraZeneca	BACE1
AZD5099	AstraZeneca	Bacterial topoisomerase II
BI 1823911	Boehringer Ingelheim	KRAS^G12C
BI 691751	Boehringer Ingelheim	LTA4H
CFTX-1554	Confo Therapeutics	AT₂ receptor
ETC-206	D3	MNK1/2
GDC-0994	Genentech/Array	ERK2
HTL0014242	Sosei Heptares	mGlu5 NAM
HTL0018318	Sosei Heptares	M1-receptor partial agonist
HTL9936	Sosei Heptares	M1-receptor partial agonist
IC-776	Lilly/ICOS	LFA-1
LP-261	Locus	Tubulin
LY2811376	Lilly	BACE1
Mivebresib	AbbVie	BRD2-4
MRTX1719	Mirati	PRMT5•MTA
Navoximod	New Link/Genentech	IDO1
PLX5568	Plexxikon	RAF
SGX-393	SGX	BCR-ABL
SGX-523	SGX	MET
SNS-314	Sunesis	Aurora
TAK-020	Takeda	BTK

19 February 2024

Hot spots real and imagined

Practical Fragments has written several times about “hot spots”: regions on proteins where small molecules and fragments readily bind. Knowing whether your target protein has a hot spot can help you decide whether to pursue the target in the first place. A variety of computational approaches have been developed for finding hot spots, most of which start with a crystallographically determined structure. In a new J. Chem. Inf. Mod. paper, Sandor Vajda and collaborators at Boston University and Stony Brook University ask whether computational models of proteins can also be used for one of the more popular methods, FTMap.

The researchers started with a set of 62 proteins, each of which had a published crystal structure bound to a fragment (MW < 200 Da) as well as to a larger molecule. The predicted structures of these proteins were then downloaded from the AlphaFold2 (AF2) site, and these models were truncated to correspond to the residues seen in the crystal structures to facilitate comparisons. The computational models were quite similar to the experimental models, particularly when comparing the positions of the peptide backbone atoms which define the overall shape of the proteins.

Next, the researchers applied the program FTMap, which computationally explores the surface of proteins using a set of 16 very small probes such as ethanol. Hot spots are regions where lots of probes bind, and the “hotness” of these spots correlates with the number of bound probes. FTMap assessed hotness on the AF2 structures and the crystallographicaly determined structures. (Before running FTMap, the bound ligands in the crystal structures were computationally removed.) Additionally, the researchers ran FTMap on unliganded crystal structures for the 47 proteins where these had been reported.

FTMap was broadly successful at finding the hotspots defined by bound fragments, succeeding 77% of the time starting with either the fragment-bound or unliganded structures and 71% starting with the AF2 models. Implementing stricter criteria (demanding the experimental fragment binding site be the top hot spot, for example) reduced the success to 56% for the crystallographic starting points and 47% for the AF2 models.

The paper discusses several examples in detail, in particular the two where the AF2 models were most different from the experimental models. Both of these were large, multidomain proteins. When AF2 models of just the ligand-binding domains were used, the models were significantly improved. This seems to be a generally useful hack: generating truncated AF2 models for other proteins also improved the performance of FTMap.

The utility of AF2 models for docking has been the subject of some debate, with some arguing that even though the overall protein folds may be accurate, local side chain conformations may be wrong, and a single side chain rotation may make the difference between ligand binding or not. This paper suggests that hot spots are not too sensitive to these subtleties, and that AF2 models can be used for finding hot spots.

12 February 2024

Fragment screening across the proteome, noncovalently

Last week we discussed methodological improvements to industrialize covalent fragment screening across the proteome. While I’m a huge fan of covalent binders, their noncovalent counterparts are the vanilla ice cream of FBLD: also tasty and much more common. Back in 2017 we described how “fully functionalized fragments,” or FFFs, could be used to screen noncovalent fragments in cells. A new paper in Nat. Chem. Biol. by Christopher Parker and collaborators at Scripps and BMS further optimizes the approach.

FFFs contain, in addition to the variable fragment, a photoreactive group (often a diazirine) and an alkyne tag. When exposed to light the photoreactive group can react with nearby proteins and the alkyne tag can be used to fish out the proteins. In the new paper the researchers started with a dozen FFFs.

One challenge, which we discussed in 2021, is that the FFFs may react with many sites on a given protein. During analysis, a protein is typically digested into peptides for mass spectrometry. If a FFF reacts at several sites on a peptide the resulting spectra will be “chimeric” and more difficult to characterize.

The researchers developed methods to take these chimeric spectra into account when searching for sites of modification. The approach, called Dizco (for diazirine probe-labeled peptide discoverer) can identify three times as many peptides as standard approaches, as well as more detailed information on sites of modifications.

Two pairs of FFF probes consisted of enantiomers, and these showed differential labeling across the proteome, consistent with specific molecular recognition. The researchers also confirmed binding of a few FFF probes to several proteins using a cellular thermal shift assay (CETSA).

In all, the probes modified 3603 peptides on 1669 proteins. The sites of modification were then mapped onto predicted or modeled three dimensional structures of the proteins. Importantly, and consistent with the 2017 work, most of the labeled sites were near predicted pockets. The researchers confirmed this for four proteins by showing that FFF probe binding could be competed by adding ligands known to bind to the pockets.

Next, the researchers docked (using AutoDock) their FFF probes onto 175 proteins (108 from structures in the Protein Data Bank and 67 from AlphaFold structures). They found that the docking experiments recapitulated the experimental data, and in fact often placed the diazirine tag near the protein residues found to react. Strikingly, and in another step forward for in silico approaches, docking against structures from AlphaFold was nearly as effective as those from the protein data bank.

As the researchers conclude, “we identified many binding pockets that have no reported ligands… these probes may serve as leads for further optimization.” It will be fun to see how far they go.

05 February 2024

Fragment screening across the proteome, industrialized

Last week we discussed covalent fragment screens against isolated enzymes, which can be very effective. But screening in cells or cell lysates preserves proteins in a more physiological environment and allows many proteins across the proteome to be screened simultaneously. In 2016 we wrote about covalent screens in human cell lysates which identified fragment hits for 758 cysteine residues in 637 proteins. Mass spectrometry techniques have improved since then in terms of both speed and sensitivity, as illustrated in a new Cell Chem. Biol. paper from Steve Gygi, Qing Yu, and collaborators at Harvard Medical School and Biogen. (Disclosure: Steve Gygi is on the Scientific Advisory Board of my current company, Frontier Medicines.)

The approach is called TMT-ABPP, or tandem mass tag activity-based protein profiling, and it involves multiple improvements to previous methods, some of which Steve discussed at the Discovery on Target meeting last year. Covalent fragments are added separately to cell lysate aliquots, after which a desthiobiotin iodacetamide (DBIA) probe is introduced. If a given site on a protein has reacted with a fragment, it will not be available to react with the DBIA probe.

Next, proteins are digested to peptides and labeled with TMT (tandem mass tag) reagents, which allow multiple samples (18 in this case, either individual fragments or DMSO-only controls) to be combined for simultaneous analysis. Peptides functionalized with the DBIA probe are captured on streptavidin resin while those that had previously reacted with a covalent fragment will not stick to the resin and be lost. Peptides eluted from the resin are then analyzed by mass spectrometry. The “competition ratio” between treated and untreated lysate gives a measure of how strongly a given site on a given protein is labeled by a fragment.

Multiple other tweaks, such as capturing proteins using magnetic beads and using a special type of mass-spectrometry (high-field asymmetric waveform ion mobility spectrometry, or FAIMS), further streamline the process to a 96-well plate format, with each well containing a mere 10-20 µg of cell lysate, as much as 100-fold less than earlier approaches.

The researchers benchmarked TMT-ABPP using three reactive “scout fragments,” including compound 1 from last week’s post. Collectively they identified 6813 cysteine residues hit by one or more of the scouts.

To demonstrate throughput, the researchers next screened 192 fragments, a third of which were acrylamides while the rest were chloroacetamides. Even with two controls for every 16 samples, this only required 12 injections on a mass spectrometer and resulted in hits against 38,450 cysteines, about 50-fold more than the 2016 paper. Proteins that were more highly expressed were better represented, as were proteins with known reactive cysteine residues, such as thioredoxins. Surprisingly though, surface-exposed cysteine residues were only slightly enriched over more buried cysteines.

The researchers also applied TMT-ABPP to five well-characterized covalent molecules, including the mutant KRAS^G12C inhibitor ARS-1620, which we wrote about here. In addition to the G12C site of KRAS, several other proteins were also liganded, including adenosine kinase (ADK). The researchers confirmed that ARS-1620 inhibited ADK in an enzymatic assay.

As the researchers note, “proteome-wide profiling of thousands of compounds remains a formidable challenge, both technically and financially.” This paper reveals how to significantly reduce the costs. By using such approaches, it is possible to build a catalog of fragment ligands for thousands of proteins. Doing so with a well-curated library could enable rapid fragment-to-lead campaigns.

29 January 2024

Covalent fragments vs a SARS-CoV-2 helicase

Last week we wrote about the difficulties of trying to understand even well-characterized covalent inhibitors of well-characterized targets. Most projects have far less information, as illustrated in a recent paper in J. Am. Chem. Soc. by Ekaterina Vinogradova, Tarun Kapoor, and collaborators at Rockefeller University and Sanders Tri-Institutional Therapeutics Discovery Institute, who report the first inhibitors of a particular SARS-CoV-2 enzyme.

The researchers were interested in helicases, enzymes that unwind DNA, RNA, or both. To do so, helicases cycle between “open” and “closed” forms, with conformational changes of as much as 15 Å. That dynamism complicates structure-based drug design, and many screens have yielded false positives. An irreversible covalent inhibitor that remained bound to the enzyme through its gyrations would potentially be easier to optimize.

The protein nsp13 from SARS-CoV-2 is essential for viral replication and thus an attractive drug target. The researchers started by testing previously reported and reactive “scout fragments” in a functional assay. Compound 1 inhibited the enzyme, and mass-spectrometry (MS) assays revealed that it modified three sites on the protein. Although multiple modifications are not desirable, the enzyme does contain 26 cysteine residues, so it could be worse. Peptide mapping and mutagenesis experiments revealed that modification of cysteine 556 (C556) is responsible for the inhibitory activity of compound 1.

A series of analogs culminated in compound 3b, which had low micromolar activity after a four hour incubation and also seemed more selective than compound 1, with less modification of other cysteine residues. The enantiomer of compound 3b was at least 6-fold less potent, suggesting molecular recognition rather than simple reactivity. In addition to nsp13, the researchers examined two mammalian helicases with disease relevance, WRN and BLM, and found that compound 3b was modestly selective for nsp13. (The researchers find different inhibitors for these two enzymes, though these are weaker and not as extensively characterized as those for nsp13.)

Cysteine 556 is not in the ATP-binding site and does not seem to be involved with RNA binding, and the researchers suggest that compound 3b may act allosterically. It seems to be highly conserved too, which might mean mutational resistance is less likely to evolve.

As the researchers acknowledge, compound 3b contains a chloroacetamide warhead, which is likely too reactive and unstable to move forward into in vivo studies, let alone the clinic. Also, had I reviewed the manuscript I would have requested the researchers to provide k_inact/K_I values rather than merely IC₅₀ values; a rough calculation using the methodology in this paper suggests a modest 10 M^-1s^-1 for compound 3b. That said, the discovery that liganding C556 inhibits nsp13 means that working to develop more potent and selective molecules may be worth the effort.

22 January 2024

Covalent complexities for kinase inhibitors

Covalent drugs are becoming increasingly popular. But as more researchers search for them, they may encounter pitfalls. A new paper in J. Med. Chem. by David Heppner and collaborators at the State University of New York Buffalo, AssayQuant Technologies, and Eberhard Karls Universität Tübingen provides a nice roadmap for avoiding them.

The researchers focus on covalent inhibitors of epidermal growth factor receptor (EGFR), a kinase that is frequently mutated in cancer. The first drugs against this target, such as erlotinib, were non-covalent, and these have been largely displaced by more effective covalent molecules such as afatinib. Unfortunately, these earlier drugs are not effective against a common mutant (T790M), spurring the development of third generation molecules such as osimertinib, which was approved by the FDA in 2015. Osimertinib has been extensively studied, with more than 2800 references in PubMed. Yet it is not as well understood as you might expect.

The team uses this system to demonstrate how characterizing irreversible inhibitors is not simple. For reversible enzyme inhibitors, researchers frequently discuss IC₅₀ values or, when they are being more precise, inhibition constants (K_i). The latter are in theory absolute values that do not depend on concentrations of cofactors such as ATP. But for irreversible inhibitors, the IC₅₀ values change depending on how long (and at what concentration) incubation occurs. The proper assessment of an irreversible inhibitor is k_inact/K_I, which takes into account both the irreversible inactivation step (k_inact) as well as the inhibition constant (K_I). Note that K_i is not the same as K_I; the former describes only the initial reversible association between protein and inhibitor, while K_I incorporates the irreversible step. Told you it was complicated!

And it gets worse. The researchers examined three irreversible covalent inhibitors under various conditions. In one condition, the inhibitors were pre-dissolved in 10% DMSO before being added to the assay mixture to give a final DMSO concentration of 1%. In another condition, the inhibitors were dissolved in pure DMSO before being added to the assay. Despite the final concentration of DMSO being the same (1%), the second condition gave k_inact/K_I values up to 11-times greater (more potent).

If subtle experimental variations in one lab can change values by more than an order of magnitude, you might expect the literature to vary even more, and you’d be right. In the case of osimertinib, the reported values of k_inact/K_I vary by nearly 500-fold. Some of the experimental parameters the researchers consider are concentrations of reducing agents such as DTT, which can react with covalent inhibitors, and serum albumin, which also contains a free cysteine residue. Although these did not seem to be problematic for osimertinib itself, they could affect other molecules.

Another consideration for kinases in particular is the concentration of the cofactor ATP. The value of k_inact/K_I itself will vary depending on [ATP], and the researchers describe how to calculate a “true” k_inact/K_I which could be used to compare the potency of a given inhibitor against the wild-type vs mutant forms of the enzyme. But while this is more theoretically rigorous, it may be less biologically relevant, since physiological ATP concentrations are less variable than differences in the Michaelis constant (K_M) for ATP for different kinases and mutants.

There is lots more to digest in this paper, including analyses of structure-kinetic relationships (SKR, akin to structure-activity relationships, or SAR) for different inhibitors and thorough experimental descriptions. The take-home message is that, due in part to different and often incomplete details, “potency measurements are generally difficult to compare among literature studies,” and “any potency assessments should include appropriate controls under the same conditions as the experimental inhibitors.”

15 January 2024

What makes molecules aggregate?

The propensity for some small molecules to form aggregates in water has bedeviled fragment-finding efforts for decades. Indeed, the phenomenon was not fully recognized until early this century. Although plenty of tools are available for detecting aggregates, I still see too many papers that omit these crucial quality controls. As annoying as aggregation can be in activity assays, in certain cases it could actually be useful for formulating drugs. There has been speculation that the good oral bioavailability of venetoclax is due to aggregation. But despite computational methods to predict aggregation, the structural features of molecules that cause them to aggregate are still not well understood. In a new open-access Nature Comm. paper, Daniel Heller and collaborators at Memorial Sloan Kettering Cancer Center and elsewhere provide some answers.

The researchers had previously published an article describing how indocyanine green (ICG) could be used to stabilize and visualize aggregates, and they applied the same technique to examine the aggregation potential of a small set of fragments. Benzoic acid and 2-napthoic acid did not aggregate, while 4-phenylbenzoic acid did. Intrigued, the researchers tested a set of 14 4-substituted biphenyl fragments and found that those containing both a hydrogen bond donor and acceptor, such as acids, sulfonamides, amides, and ureas, could aggregate, while those containing only donors (aniline) or acceptors (nitrile) did not.

Fourier transform infrared spectroscopy was used to examine the stretching region of the carbonyl of 4-phenylbenzoic acid in various states: in an aqueous aggregate, in solution in either t-butanol or DMSO, or in the solid state. Interestingly, the aggregate most resembled the solid state, consistent with close-packed self-assembly as opposed to free in solution.

From all this, the researchers hypothesized that a combination of aromatic groups and hydrogen bond donors and acceptors was necessary for aggregation. However, having these features does not mean aggregation is inevitable. Neither 3-phenylbenzoic acid nor 2-phenylbenzoic acid formed aggregates, with the former precipitating while the latter remained completely soluble. These three phenylbenzoic acid isomers behave very differently despite the fact that they have the same calculated logP values, and the suggestion is that the latter two molecules are less able to form pi-pi stacking interactions that lead to stable aggregation.

Next the researchers examined the approved drug sorafenib, which had previously been shown to aggregate. This was confirmed, and the aggregates were characterized with a battery of biophysical methods including dynamic light scattering, transmission electron microscopy, and X-ray scattering, along with molecular dynamics simulations. The conclusion is that sorafenib forms amorphous aggregates whose assembly is driven by a combination of pi-pi stacking and hydrogen-bonding. A series of sorafenib analogs was synthesized, and those that could not form strong intermolecular hydrogen bonds were less prone to aggregation.

All of this is fascinating from a molecular assembly viewpoint and will help to explain and predict which compounds are likely to aggregate, for better or for worse. But as of now, experimental assessment is still best practice for any new compound.

08 January 2024

Electrophilic MiniFrags vs HDAC8

In fragment-based lead discovery, small is good – at least down to a certain point. While most fragments consist of between 7 and 20 non-hydrogen atoms, some investigators have built libraries of much smaller fragments with at most 7 or 8 heavy atoms. We’ve written about MiniFrags and MicroFrags, which are typically screened crystallographically at high concentrations to find hot spots. In a new open-access J. Med. Chem. paper, Franz-Josef Meyer-Almes, György Keserű, and collaborators at the Budapest University of Technology and Economics, the University of Applied Sciences Darmstadt, and the University of Veterinary Medicine Vienna have applied the concept to covalent fragments.

The researchers started with a set of 84 fragments, all heterocycles functionalized with one of six warheads, which we wrote about here. They systematically methylated nitrogen atoms on some of these to generate 58 more fragments containing obligate positive charges, such as compound B6+ below. The intrinsic reactivity of the fragments was assessed by reacting them with the biologically relevant thiol glutathione (GSH).

Methylating the heterocycles made them more electrophilic and thus more reactive. For example, only 16 of the 84 non-methylated fragments had a half-life (t_1/2) < 48 hours against GSH, in contrast with 30 of the 58 methylated fragments. In fact, 17 of the methylated fragments had t_1/2 < 10 minutes.

Next, all 142 fragments were screened at 250 µM for 2 hours at 30 ºC in a biochemical assay against histone deacetylase 8 (HDAC8), an enzyme important for cell cycle progression. Hits were confirmed in dose-response experiments after 1 hour pre-incubation. Consistent with the glutathione data, only 12 of the non-methylated compounds showed IC₅₀ < 50 µM, while 54 of the 58 methylated compounds were active. One of the fragments, B6+, had a k_inact/K_I value of 4006 M^-1s^-1, not far from that found in approved covalent drugs.

HDAC8 contains ten cysteine residues, and sites of modification were determined using both site-directed mutagenesis as well as tryptic digestion followed by mass spectrometry. In total, seven residues could be labeled by one or more fragments. The most reactive cysteine, C153, is close to the binding site of a previously reported inhibitor (compound 1), and the researchers tried merging reactive fragments such as B6+ onto this molecule. The best molecule, compound 3, had a k_inact/K_I value of 1566 M^-1s^-1. However, the drop from B6+ alone suggests that the non-covalent affinity component of compound 1 may have been lost.

This is an interesting approach, and as the researchers note, activity assays available for covalent fragments are higher-throughput than the crystallographic screens required for MiniFrags and MicroFrags. On the other hand, there are limitations. For one thing, the obligate positive charge on the methylated fragments could overwhelm other properties, and could even lead to denaturation of proteins at high concentrations, rendering screens uninformative. These fragments are also less likely to be cell permeable.

Finally, as we wrote ten years ago, characterizing irreversible covalent fragments presents a challenge in deconvoluting intrinsic reactivity from specific binding. Computational mapping of hot spots on HDAC8 using FTMap revealed that some correlate with modified cysteine residues. But other modified cysteine residues are in surface-exposed flexible loops with no nearby pockets, and hits against these are likely not advanceable. The fact that some of the fragments modify as many as five cysteine residues on HDAC8 suggests they may be too reactive.

Still, the systematic characterization of this library is useful experimentally and for training models. It will be interesting to see it deployed against additional protein targets.

02 January 2024

Fragment events in 2024

We don't know for sure what 2024 has in store for us, but barring pandemics or other disasters, the year is shaping up to be an annus mirabilis for fragments. For the first time ever, all four of the recurring fragment meetings are scheduled for the same year, and other conferences also look exciting. I hope to see you at one.

March 3-5: RSC-BMCS Ninth Fragment-based Drug Discovery Meeting will be held in Cambridge, UK. This venerable biannual event will be particularly focused on case studies "that have delivered compounds to late stage medicinal chemistry, preclinical, or clinical programmes." You can read my impressions of the 2013 meeting here and the 2009 event here.

April 1-4: CHI’s Nineteenth Annual Fragment-Based Drug Discovery, the longest-running fragment event, returns as always to San Diego. This is part of the larger Drug Discovery Chemistry meeting. You can read impressions of the 2023 meeting here, the 2022 event here, the 2021 virtual meeting here, the 2020 virtual meeting here, the 2019 meeting here, the 2018 meeting here, the 2017 meeting here, the 2016 meeting here; the 2015 meeting here, here, and here; the 2014 meeting here and here; the 2013 meeting here and here; the 2012 meeting here; the 2011 meeting here; and 2010 here.

June 2-4: The theme of the Tenth NovAliX Conference, to be held in the Swiss resort town of Brunnen, is "reinventing drug discovery." You can read my impressions of the 2018 Boston event here, the 2017 Strasbourg event here, and Teddy's impressions of the 2013 event here, here, and here.

June 25-27: FBDD Down Under 2024 will take place in beautiful Brisbane. I believe this is the fifth FBDD DU event and the first to be held outside Melbourne. You can read my impressions of FBDD DU 2019 and FBDD DU 2012.

September 22-25: After a six year hiatus, FBLD 2024 will be held in Boston. This will mark the eighth in an illustrious series of conferences organized by scientists for scientists. You can read impressions of FBLD 2018, FBLD 2016, FBLD 2014, FBLD 2012, FBLD 2010, and FBLD 2009.

September 30 to Oct 3: Autumn is usually a nice time of year in Boston, so why not stick around to attend CHI’s Twenty-Second Annual Discovery on Target. As the name implies this event is more target-focused than chemistry-focused, but there are always plenty of FBDD-related talks. You can read my impressions of the 2023 meeting here, the 2022 meeting here, the 2021 event here, the 2020 virtual event here, the 2019 event here, and the 2018 event here.

Know of anything else? Please leave a comment or drop me a note.