Introduction

Small molecule covalent inhibitors have a rich history spanning over a century, with their mechanism initially elucidated through the study of aspirin, a classic drug. Today, this approach is emerging as a therapeutic modality, prized for its potency and long-lasting irreversible binding nature. A significant milestone was reached in 2013 with the FDA approval of ibrutinib, the first covalent inhibitor targeting Cys481 on Bruton’s Tyrosine Kinase (BTK), for mantle cell lymphoma (MCL)1. Subsequently, covalent BTK inhibitors have evolved through several generations, each improving upon both potency and selectivity2.

This progress has catalyzed the discovery of covalent inhibitors targeting other proteins. Notable examples include osimertinib, which targets EGFR3,4, and sotorasib, which targets KRasG12C5,6, a protein previously considered undruggable. Recently, advancements in chemoproteomics platform7,8,9,10,11 together with novel covalent warhead chemistry12,13,14,15,16,17 is rapidly accelerating the discovery of covalent binders and their targets, efficiently expanding the druggable protein universe18,19,20,21. These advancements underscore the growing importance and potential of covalent inhibitors in modern drug discovery and development.

The current structural-based design strategy for covalent irreversible binders encompasses two key components: a core pharmacophore that facilitates basal reversible binding within the target protein pocket, and a covalent warhead capable of accepting nucleophilic attacks from reactive side chain groups, such as thiols on cysteine residues or phenols on tyrosine residues, to form a covalent bond. An effective covalent small molecule should possess substantial non-covalent binding affinity, ensuring sufficient residence time on the target to enhance the probability of the subsequent covalent binding event. While pursuing warheads with higher intrinsic reactivity, often referred to as ‘hot warheads’, can significantly increase the rate of covalent bond formation, this approach also elevates the risk of promiscuous off-target labeling7,22. Consequently, this may lead to a higher covalent binding burden at the proteome level. Optimizing the non-covalent interactions to achieve target specificity, while maintaining a warhead with appropriate reactivity, is crucial for developing covalent inhibitors with favorable pharmacological profiles and minimal off-target effects.

Unlike reversible inhibitors whose potency can be described by dissociation constant \({K}_{d}\), irreversible covalent inhibitors follow a two-step irreversible binding kinetics (Eq. (1)) - the first step is the affinity-driven non-covalent binding (rate constant \({k}_{{{\rm{on}}}}\) and \({k}_{{{\rm{off}}}}\)), and the second irreversible step is the covalent bond formation (rate constant \({k}_{{{\rm{inact}}}}\)). Ranking the potency of different inhibitors is based on the second-order rate constant for covalent adduct formation, or in short, inactivation efficiency \({k}_{{{\rm{eff}}}}\), which equals the maximum rate of covalent adduct formation over inactivation constant \({k}_{{{\rm{inact}}}}/{K}_{I}\) (M−1·s−1), where \({K}_{I}\) is the equilibrium constant of the first step (Eqs. (2) and (3)). A potent covalent inhibitor must exhibit both significant intrinsic reactivity (reflected by \({k}_{{{\rm{inact}}}}\)) and strong non-covalent binding affinity (reflected by KI). Due to the variability of the warhead intrinsic reactivity, another layer of complexity must be considered when trying to explain covalent structure-activity relationship (covSAR). Higher intrinsic warhead reactivity means a faster \({k}_{{{\rm{inact}}}}\), but meanwhile also increases the risk of off-target labeling, as well as a faster clearance rate in vivo through GSH conjugation23. Optimization efforts should prioritize decreasing \({K}_{I}\) to achieve tighter binding rather than switching to a more reactive warhead to push for a higher \({k}_{{{\rm{inact}}}}\).

$$E+I\begin{array}{c}{k}_{{on}}\\ \rightleftharpoons \\ {k}_{{{\rm{off}}}}\end{array}{EI}\begin{array}{c}{k}_{{{\rm{inact}}}}\\ \longrightarrow \\ \,\end{array}{{EI}}^{*}$$
(1)
$${K}_{I}=\frac{{k}_{{{\rm{off}}}}+{k}_{{{\rm{inact}}}}}{{k}_{{{\rm{on}}}}}$$
(2)
$${k}_{{{\rm{eff}}}}=\frac{{k}_{{{\rm{inact}}}}}{{K}_{I}}=\frac{{k}_{{{\rm{on}}}}\cdot {k}_{{{\rm{inact}}}}}{{k}_{{{\rm{off}}}}+{k}_{{{\rm{inact}}}}}$$
(3)

Generally, two types of methods for calculating \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) are proposed based on whether the protein has been pre-incubated with the covalent inhibitor before adding the substrate. For the co-incubation methods, the reaction is initiated by addition of enzyme to a mixture of inhibitor and substrate, and the signal indicating product formation is recorded continuously (progression curve of time-dependent product formation for enzyme inhibition)24 or obtained through a development/quenching step before readouts25,26,27. Alternatively, pre-incubation methods take advantage of a simplified system where only the drug can interact with the protein in the pre-incubation stage, eliminating any competition from the substrate. After the pre-incubation, the substrate is added, and the fraction of remaining uninhibited proteins can be quantified by comparing the product formation rate with the uninhibited control sample28,29. The choice of which experimental method to use mainly depends on the binding nature of the inhibitor (potency, or the time scale of the reaction) and the assay format (continuous reading or need post-processing). Nevertheless, all these methods require an activity-based assay for readouts of product formation.

For non-enzyme protein covalent inhibitors, \({K}_{I}\) and \({k}_{{{\rm{inact}}}}\) can be measured using a competition-based assay with a specific chemical probe that binds to the same pocket. This method allows for the calculation of protein occupancy by the covalent inhibitor through continuous monitoring. Fluorescent polarization probes binding to KRas switch II pocket30 and TR-FRET probes binding to kinase ATP pockets31 have been utilized to calculate covalent inhibitor binding kinetics. Alternatively, Craven et al. developed a fluorescence-based platform using thiol-reactive probe for high-throughput kinetic analysis through quantifying the level of covalent adduct32,33. However, this approach is only suitable for proteins with a single reactive cysteine, which significantly limits its potential applications. Also, one should always be cautious of any potential fluorescence artifacts, such as compound inner filtering effect especially at high concentrations, which can lead to questionable quantitative results. Recently, mass spectrometry (MS)-based quantification methods were reported either on the intact protein-adduct level30,34,35 or on the peptide-adduct level36. While these MS-based approaches eliminate the need for a chemical probe, prior knowledge of the off-target landscape and considerable efforts to express the target proteins are still required. As a result, measuring the kinetic parameters for all potential off-target proteins would be an extensive and labor-intensive task.

Here, we report COOKIE-Pro (COvalent Occupancy KInetic Enrichment via Proteomics), an unbiased method for irreversible covalent inhibitor binding kinetics profiling on the proteome scale, which is suitable for both enzyme and non-enzyme targets, and eliminating the need for individual protein purification by enabling direct treatment of permeabilized cells. To ensure accurate kinetic measurements, permeabilized cells are preferred instead of cell lysates. This approach is ideal because it preserves the natural environment of protein complexes while removing the experimental variability that arises when small molecules permeate the cell at inconsistent rates. As proof-of-concept, we use BTK covalent inhibitors, ibrutinib and spebrutinib, and their desthiobiotin derivatives to calculate the \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) of the BTK inhibitors at the whole proteome scale. We also demonstrate that the data analysis method presented here is readily compatible with streamlined cysteine activity-based protein profiling (SLC-ABPP) datasets7. One can easily convert the experimental condition-dependent competition ratio (CR) values of a fragment from the SLC-ABPP dataset into thermodynamic binding constants \({K}_{I}\) against the quantified cysteinome. This compatibility streamlines the data processing, enabling a more efficient translation of competition data into meaningful kinetic parameters and prioritization for downstream medicinal chemistry optimization. In addition, we propose a decision tree for the estimation of \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values using as few as two data points, allowing for profiling 8 compounds in each TMT-18plex run.

Results

Design of COOKIE-Pro workflow

To measure the \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) for a specific compound on the proteome scale, we need to solve two problems: 1) since MS can only identify the covalent adduct \(E{I}^{*}\) but not non-covalent intermediate \({EI}\), how can \(\left[E{I}^{*}\right]\) be translated into \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\), and 2) how to determine \(\left[E{I}^{*}\right]\) for the whole proteome in a single experiment.

For the first problem, although \(E{I}^{*}\) can be monitored by MS, achieving absolute quantification requires standard peptides with known concentrations to calibrate \(\left[E{I}^{*}\right]\). This approach is not feasible on a proteome-wide scale due to the impracticality of obtaining standard peptides for every protein in the proteome. The covalent compound occupancy of a certain protein (defined as \(\frac{\left[E{I}^{*}\right]}{\left[{E}^{o}\right]}\)) can, however, be calculated through carefully designed proteome experiments, as we will discuss below. We derived the expression for protein covalent occupancy concerning treatment time t and compound concentration [I] (Fig. 1A).

Fig. 1: Solving kinact and kl using two-step model fitting and in-silico protein-ligand adduct EI* formation kinetics simulation.
Fig. 1: Solving kinact and kl using two-step model fitting and in-silico protein-ligand adduct EI* formation kinetics simulation.
Full size image

A The math expression for protein covalent occupancy \(\frac{\left[{{E}}{{{I}}}^{{{*}}}\right]}{\left[{{{E}}}^{{{o}}}\right]}\) concerning incubation time \({{\boldsymbol{t}}}\) and compound concentration \({{[}}{{I}}{{]}}\). B Two-step fitting scheme to obtain \({{{k}}}_{{{\rm{inact}}}}\) and \({{{K}}}_{{{I}}}\). Protein occupancy \(\frac{\left[{{E}}{{{I}}}^{{{*}}}\right]}{\left[{{{E}}}^{{{o}}}\right]}\) at different timepoints for a single drug concentration \({{[}}{{I}}{{]}}\) will be used to obtain the apparent \({{{k}}}_{{{\rm{obs}}}}\) value at this concentration through the first one-phase association fitting. Then each (\({{[}}{{I}}{{]}}\), \({{{k}}}_{{{\rm{obs}}}}\)) was plotted, and the second Michaelis–Menten fitting will be performed to obtain \({{{k}}}_{{{\rm{inact}}}}\) and \({{{K}}}_{{{I}}}\) value simultaneously. C MATLAB simulation conditions and fitting results confirmed the proposed method is applicable to a wide range of compound affinities (fragments or inhibitors) and reactivity in real world settings.

This approach allows for the determination of occupancy levels without requiring absolute quantification of the \(E{I}^{*}\) complex. Of note, this expression is based on pseudo first-order reaction kinetics without inhibitor depletion \(\left[I\right]\gg [{E}^{o}]\), as well as the steady state of \({EI}\) non-covalent complex (see method mathematics part for detailed explanation).

With the expression in Fig. 1A, one can obtain the \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values through a two-step fitting process involving multiple experimental data points with varying incubation times (t) and covalent inhibitor concentrations ([I]) (Fig. 1B). To confirm whether the steady state approximation holds true for compounds within the realistic range of the \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values, we performed simulations for both low non-covalent affinity fragments (\({K}_{I}\) = 10 µM) and high non-covalent affinity inhibitors (\({K}_{I}\) = 10 nM) with varying \({k}_{{{\rm{inact}}}}\) (Fig. 1C). The simulation results demonstrate good coherence between the theoretical and calculated \({k}_{{{\rm{inact}}}}\) and KI, indicating the approximation we used here is valid for the typical range of current covalent inhibitor kinetic parameters.

With the data analysis method solved, we set out to design an experimental protocol for determining the occupancy \(\frac{\left[E{I}^{*}\right]}{\left[{E}_{{{\rm{total}}}}\right]}\) on the global proteome. The sample preparation workflow of COOKIE-Pro (COvalent Occupancy KInetic Enrichment via Proteomics) is shown in Fig. 2. Generally, this is a preincubation-based method. In the first preincubation step, proteins in permeabilized cells react with the compound-of-interest with varying concentration and incubation time, occupying those pockets irreversibly. However, these protein-compound adducts are not easy to quantify by untargeted MS given the substantial background proteins in the whole cells. Therefore, instead of quantifying protein-compound adducts, we introduced a second incubation step using saturating concentration and incubation time of a pulldown probe followed by streptavidin bead pulldown to quantify any unreacted target proteins from the first step. This step simultaneously lyses the cell to minimize any secondary cellular response due to high concentration probe-induced oxidative stress. The desthiobiotin (DTB) probe is derived from the compound-of-interest by modifying the solvent-exposed region to mimic the covalent warhead reactivity and inhibitor affinity and selectivity. This step significantly reduces the sample complexity and improves both the depth and quantification accuracy of the protein identifications. The pulldown proteins undergo standard on-bead proteomics sample preparation workflows, including reduction, alkylation and trypsin digestion. Quantitation of the pulldown protein level among all the samples is achieved through tandem mass tag (TMT) multiplex labeling to eliminate injection-to-injection variation.

Fig. 2: Covalent Occupancy Kinetics Enrichment via Proteomics (COOKIE-Pro) workflow using spebrutinib and its desthiobiotin probe SB-2 as an example.
Fig. 2: Covalent Occupancy Kinetics Enrichment via Proteomics (COOKIE-Pro) workflow using spebrutinib and its desthiobiotin probe SB-2 as an example.
Full size image

Created in BioRender. Lin, H. (2025) https://BioRender.com/3zmk1a2.

Each experiment has two control samples – DMSO control and saturating control (blue and red samples in Fig. 2). DMSO control quantifies the total probe-labeled proteome in the absence of any inhibitor competition, essentially indicates the total amount of any on-target and off-target proteins in the sample (regarded as 0% occupancy). The reported abundance ratios for a certain protein between any drug-treated sample and DMSO control sample \(\frac{{I}_{c,{t}}}{{I}_{{{\rm{DMSO}}}}}\) can be directly regarded as the unoccupied fraction of this protein (\(\frac{\left[{E}_{{{\rm{total}}}}\right]-\left[E{I}^{*}\right]}{\left[{E}_{{{\rm{total}}}}\right]}\)) at this specific incubation time \(t\) and concentration \(c\). It can be easily converted to occupancy \(\frac{\left[E{I}^{*}\right]}{\left[{E}_{{{\rm{total}}}}\right]}\) for the downstream fitting process to obtain \({k}_{{{\rm{inact}}}}\) and KI. The saturating control, however, uses high concentration and long-enough incubation time of the covalent inhibitor to make sure that all the pockets in target proteins have been covalently pre-occupied, and the probe is no longer able to bind and pulldown these proteins. The purpose of setting up this control is to define the MS signal baseline – any ratio reported in the drug-treated groups \(\frac{{I}_{c,{t}}}{{I}_{{{\rm{DMSO}}}}}\) should subtract the baseline ratio \(\frac{{I}_{{{\rm{saturation}}}}}{{I}_{{{\rm{DMSO}}}}}\), and then convert to \(E{I}^{*}\) occupancy.

Benchmarking COOKIE-Pro with high selectivity BTK covalent inhibitor spebrutinib

To assess the entire COOKIE-Pro workflow, we chose the second generation BTK irreversible inhibitor spebrutinib (CC-292) as a benchmark compound to identify both the known target BTK and potentially undiscovered off-targets. Since substitutions on the α- and β-carbon of the acrylamide will significantly impact the warhead reactivity37,38,39, we avoided any modification on these positions and designed the paired desthiobiotin probe SB-2 along the methoxy exit vector of spebrutinib (Fig. 3A). Ramos cell line (B cell lymphoma) was chosen due to its high expression of the target protein BTK. To minimize any concentration interference due to cell membrane permeability but keep the proteome concentrated in the cellular environment, the cells were permeabilized with digitonin40 before spebrutinib preincubation. We confirmed through microscope using trypan blue exclusion method that permeabilization happens within 30 s without any apparent disruption of Ramos cell morphology (data not shown). The permeabilized cells underwent two rounds of incubation: 1) For spebrutinib, we set up 5 different concentrations ranging from 25 nM to 400 nM and collected the sample from the incubation tubes every 2 min for 3 time points, generating a total of 15 dose-time-dependent samples plus 1 for each control sample. 2) Each sample containing ~1.8 M cells were then lysed in the NP-40 lysis buffer containing 5 µM SB-2 probe and incubated for 1 h. The BTK amount in the pulldown samples showed good dependency on spebrutinib concentration and incubation time, while no BTK was detected in the saturation control pulldown sample (Fig. 3B). With the sample quality confirmed, we then investigated the enrichment on the proteome level of these samples by performing TMT-based quantification. Among the 412 protein groups identified, 363 protein groups are quantifiable. The top-ranked protein groups not only showed high enrichment ratio between the DMSO and 5 µM saturation control groups but also across the entire concentration panel. These include TEC kinase and BTK (Fig. 3C). Some other kinases like BLK, JAK3, NEK9 and LIMK1, only shows altered enrichment ratio between the DMSO and 5 µM saturation control groups, indicating they are insignificant off-targets of spebrutinib (Supplementary Fig. 1B, Supplementary Data 1). To minimize sample-to-sample variation, we took advantage of an endogenous biotinylated protein acetyl-CoA carboxylase 1 (ACC1) which is highly and equally enriched among all the pulldown samples (Supplementary Data 1). We normalized all the protein levels according to the ACC1 levels as an internal reference. Out of the top-ranked proteins, only BTK and TEC kinases showed dose- and time-dependent enrichment within our dosing scheme. Notably, TEC kinase demonstrated stronger binding to spebrutinib than BTK, with even the lowest concentration and shortest incubation time achieving ~80% occupancy (Fig. 3D). Although only 1 peptide spectrum match (PSM) was quantified for TEC, we have verified this strong binding of spebrutinib to TEC through TEC immunoblotting of the pulldown sample (Supplementary Fig. 1A). With the occupancy data, we proceeded to COOKIE-Pro analysis using the one-phase association fitting to obtain \({k}_{{obs}}\) (Fig. 3E) and Michaelis–Menten fitting to obtain \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values (Fig. 3F). The previously reported spebrutinib \({k}_{{{\rm{inact}}}}/{K}_{I}\) value against BTK (8.28 ×104 M−1s−1)31 is very close to our fitting result (9.15 ± 0.62 ×104 M−1s−1), demonstrating the accuracy of the COOKIE-Pro workflow. We also found that spebrutinib is a very potent TEC binder which shows near 10-fold \({k}_{{{\rm{inact}}}}/{K}_{I}\) against TEC (7.06 ± 2.48 ×105 M−1s−1) compared with BTK. With similar \({k}_{{{\rm{inact}}}}\) values, the difference of inactivation potency on BTK and TEC mainly comes from the difference of the non-covalent binding affinity as indicated in \({K}_{I}\).

Fig. 3: COOKIE-Pro profiling for spebrutinib in Ramos cell using SB-2 desthiobiotin probe.
Fig. 3: COOKIE-Pro profiling for spebrutinib in Ramos cell using SB-2 desthiobiotin probe.
Full size image

A Chemical structures of Spebrutinib (Speb) and the SB-2 probe. B Spebrutinib COOKIE experiment layout and the immunoblot for BTK in the pulldown samples. Blots are representative for three independent experiments. C Ranked proteomics -log2 transformed ACC1-normalized intensity ratio between 5 µM spebrutinib saturation sample and DMSO sample as an off-target protein list. D ACC1-normalized intensity ratio heatmaps for spebrutinib kinase off-targets across the COOKIE panel. E Representative first step progressive curve fitting for \({k}_{{{\rm{obs}}}}\) at different spebrutinib concentrations against BTK and TEC. F Second step [I]-kobs plots fitting into Michaelis–Menten model (MM-fitting) to obtain \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values for BTK and TEC. Data are presented as mean ± SEM from three independent experiments. Source data are provided as a Source Data file.

Optimizing COOKIE-Pro workflow with low selectivity BTK covalent inhibitor ibrutinib

With the accuracy of COOKIE-Pro confirmed, we proceeded to the next case study compound. Ibrutinib, the first generation BTK covalent inhibitor, is thought to be a more challenging case not only due to its relatively poor selectivity, but also the reported affinity value against BTK is in the tight binding range. The progression curve-based BTK kinase activity assay failed to detect any kinase activity even when ibrutinib concentration was reduced to 9 nM (Supplementary Fig. 2A). Any concentration lower than this will violate pseudo first-order reaction assumption (\(\left[I\right]\) should \(\gg \left[E\right]\)), making it impossible to fit for kinetic parameters. The case study of ibrutinib provided valuable insights into the optimization process for COOKIE-Pro and general rules of probe design, as will be described below.

We initially designed two sets of desthiobiotin probes for ibrutinib: one has the linker attached to the β-carbon position of the acrylamide (IB-ac-DTB), and in the other set the linker was attached to the ortho-position on the piperidine ring (IB-C5-DTB and IB-C8-DTB) according to the available structure (PDB ID: 5P9I and 5P9J)41. As expected, IB-ac-DTB showed weaker inhibition potency in the BTK kinase activity assay while IB-C5-DTB and IB-C8-DTB maintained a similar potency with ibrutinib (Supplementary Fig. 2A). With LC-MS, we confirmed the loss of activity of IB-ac-DTB was due to the reduced warhead reactivity (Supplementary Fig. 2B). This leads to a rule-of-thumb for pulldown probe design: avoid any modifications to the warhead moiety, even if structural information suggests that the warhead is solvent accessible. Maintaining the intrinsic reactivity of the warhead is crucial for ensuring the effectiveness of the probe.

We then switched to IB-C5-DTB and IB-C8-DTB probes. An interesting problem we encountered for IB-DTB probes was that the BTK pulldown amount no longer shows dependency on ibrutinib or spebrutinib pre-incubation, but spebrutinib-based SB-2 probe showed robust dependency on pre-incubation as expected (Supplementary Fig. 3A). We then tried lowering the concentration of the IB-DTB probes and found that only at 5 µM of either IB-DTB probe, but not at 1 µM or 0.2 µM, has significant non-specific labeling and decoupling of BTK pulldown from IB treatment was observed (Supplementary Fig. 3B). This experiment demonstrates that probes at high concentration may lose specificity and randomly react with other cysteines on the protein surface which were not pre-occupied in the first-round pre-incubation. Such non-specific labeling undermines the effectiveness of the pulldown assay and highlights the importance of optimizing probe concentration to ensure specificity.

Our probe design investigations yielded valuable insights regarding linker length. We evaluated the pulldown efficiency of IB-C5-DTB and IB-C8-DTB at varying concentrations under native pulldown conditions (Supplementary Fig. 3C). IB-C5-DTB demonstrated the ability to pulldown BTK only at 5 μM; however, the presence of β-actin in the pulldown suggested non-specific labeling. In contrast, IB-C8-DTB successfully pulled down BTK at all tested concentrations, although at 5 μM, a small fraction of BTK could not be competed off by ibrutinib pre-incubation, presumably due to non-specific labeling. These observations underscore the critical importance of incorporating a sufficiently long linker for effective pulldown under native conditions. Recognizing the crucial role of linker length, we hypothesized that steric hindrance between the pulldown target and streptavidin might be the primary limiting factor. To address this, we postulated that employing denaturing pulldown conditions could potentially enhance pulldown efficiency by mitigating steric hindrance and facilitating improved protein engagement. We systematically evaluated various denaturing conditions and discovered that a 0.1% SDS solution not only maintains compatibility with streptavidin beads but also significantly outperforms both native and other denaturing conditions (Supplementary Fig. 4). This finding represents a significant advancement in our probe design strategy, potentially expanding the utility and efficacy of our pulldown assays. These results highlight the importance of optimizing both linker length and pulldown conditions in the development of effective covalent probes, providing a foundation for improved target engagement and specificity in future studies.

Leveraging the insights gained during optimization, we benchmarked COOKIE-Pro for ibrutinib using the IB-C8-DTB probe (Fig. 4A). The optimized protocol involved two sequential incubation steps with permeabilized Ramos cells: firstly, exposure to ibrutinib at five concentrations (12.5–200 nM) with samples collected at three time points over 6 min; then cell lysis and incubation with 1 μM IB-C8-DTB probe for 1 h. The BTK pulldown efficiency demonstrated a clear correlation with ibrutinib concentration and incubation time (Fig. 4B). Notably, the enrichment was less pronounced compared to the spebrutinib experiment (Fig. 3B), suggesting a higher overall covalent binding efficiency for ibrutinib. Subsequent multiplexing-based quantitative proteomics using TMTpro-18plex labeling identified 450 quantifiable protein groups. After normalization to ACC1, 346 protein groups exhibited reduced enrichment in the 5 μM ibrutinib saturation control relative to the DMSO control (Supplementary Data 2). BTK showed the highest enrichment ratio, followed by HMGN2, BLK, H2AC20 and TEC (Fig. 4C). Notably, BTK, BLK, and TEC are known ibrutinib off-targets in B cells42, sharing cysteine homology at BTK C481. Additionally, LYN kinase was identified as a lower-ranking hit, consistent with recent SLC-ABPP findings10.

Fig. 4: COOKIE-Pro profiling for ibrutinib in Ramos cell using IB-C8-DTB probe.
Fig. 4: COOKIE-Pro profiling for ibrutinib in Ramos cell using IB-C8-DTB probe.
Full size image

A Ibrutinib (IB) and IB-C8-DTB chemical structure. B Ibrutinib COOKIE experiment layout and the immunoblot for BTK in the pulldown samples. Blots are representative for three independent experiments. C Ranked proteomics ACC1-normalized intensity ratio between 5 µM ibrutinib and DMSO sample as an off-target protein list. D ACC1-normalized intensity ratio heatmaps for ibrutinib kinase off-targets across the COOKIE panel. E Representative first step progressive curve fitting for \({k}_{{{\rm{obs}}}}\) at different ibrutinib concentrations against BTK, TEC and BLK. F Second step [I]-kobs plots fitting into Michaelis–Menten model (MM-fitting) to obtain \({k}_{{{\rm{inact}}}}\) (s−1) and \({K}_{I}\) (M) values for BTK, TEC and BLK. Data are presented as individual datapoints from two independent experiments. Source data are provided as a Source Data file.

The three high-ranking kinases exhibited dose- and time-dependent responses to ibrutinib (Fig. 4D). Two-step fitting of these data yielded \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values for each protein (Fig. 4E, F). The derived parameters for BTK (\({k}_{{{\rm{inact}}}}\,\)= 0.04 s−1, \({K}_{I}\,\)= 108 nM, \({k}_{{{\rm{inact}}}}/{K}_{I}\) = 3.65 × 105 M−1s−1) align closely with previously reported values (\({k}_{{{\rm{inact}}}}\,\)= 0.027 s−1, \({K}_{I}\) = 54.2 nM, \({k}_{{{\rm{inact}}}}/{K}_{I}\) = 4.77 × 105 M−1s−1), validating the accuracy of our method. Our analysis revealed that despite similar kinact values among the three kinases, BLK demonstrates higher susceptibility to ibrutinib (\({k}_{{{\rm{inact}}}}/{K}_{I}\) = 7.87 ×105 M−1s−1) due to its stronger non-covalent affinity (\({K}_{I}\) = 28 nM). Conversely, TEC exhibits weaker potency (\({k}_{{{\rm{inact}}}}/{K}_{I}\) = 2.75 × 105 M−1s−1), attributable to its lower non-covalent affinity (\({K}_{I}\) = 233 nM). These findings not only validate the COOKIE-Pro methodology but also provide valuable insights into the differential binding kinetics of ibrutinib across multiple targets, contributing to a more comprehensive understanding of its pharmacological profile.

COOKIE-Pro strategy is compatible with SLC-ABPP dataset

High-throughput covalent fragment screening and target identification have gained considerable traction in recent years, particularly with the advancement of Affinity-Based Protein Profiling (ABPP) techniques. These methodologies enable efficient and systematic exploration of covalent interactions, facilitating the discovery of targets and the optimization of covalent inhibitors7,9,43,44. Despite significant progress in uncovering the cysteine-ligandable proteome and identifying novel electrophilic ligands targeting these cysteines, a critical gap remains in the field: the ability to disentangle and deconvolute the contributions from non-covalent binding affinity and reactivity. This gap presents challenges in making informed decisions and managing risks associated with pursuing specific ligand-target pairs for further development.

The recently developed SLC-ABPP platform employs a thiol-reactive probe, desthiobiotin-iodoacetamide (DBIA), to enrich cysteine-containing peptides that are not covalently bonded following pre-treatment with an electrophile library7. Our COOKIE-Pro workflow shares a similar underlying principle with SLC-ABPP, focusing on the enrichment and quantification of the unreacted portion of candidate proteins. Given this conceptual alignment, we posit that the datasets generated from SLC-ABPP are inherently compatible with the COOKIE-Pro data analysis workflow. This compatibility offers the potential for seamless integration, enabling a more comprehensive and nuanced analysis of covalent interactions.

COOKIE-Pro relies on multiple treatment dose and time points to fit accurate kinetic parameters, but SLC-ABPP, as a screening platform, for each test compound, only single dose and single time-point data was deposited. This failed to meet the minimum requirement for obtaining an estimation of the kinetic parameters. Therefore, we assumed that fragments within the same warhead class share similar electrophilic reactivity. To estimate the reactivity parameter, we queried the CovalentInDB45 for reported \({k}_{{{\rm{inact}}}}\) values for both nucleophilic substitution and Michael addition mechanism, and divided the data into subgroups based on warhead chemistry. The \({k}_{{{\rm{inact}}}}\) of the curated chloroacetamide warhead has a median of 0.0092 s−1, while both N-aryl and N-alkyl acrylamide have similar median \({k}_{{{\rm{inact}}}}\) values around 0.0025 s−1 (Fig. 5A, B). This data agrees with the observation that chloroacetamide warhead fragment can label more ligandable cysteines than acrylamide fragments, providing evidence for their higher reactivity7,39. We then converted the reported experimental condition-dependent competition ratios (CR) from the SLC-ABPP dataset into thermodynamic constant \({K}_{I}\) values to better rank the fragments among different warhead classes. After normalization by class-specific \({k}_{{{\rm{inact}}}}\) values, the \({K}_{I}\) values of fragments from both classes fell in the sub-mM to low mM range, with acrylamide-containing fragments showing overall tighter binding indicated by \({K}_{I}\,\)(Fig. 5C–F, Supplementary Data 3). This is probably a survivorship bias because acrylamide-containing fragments with lower binding affinities may not show up as hits due to the lower reactivity of acrylamides compared with chloroacetamides.

Fig. 5: Re-analyzing SLC-ABPP data and adapting workflow to ensure COOKIE-Pro compatibility.
Fig. 5: Re-analyzing SLC-ABPP data and adapting workflow to ensure COOKIE-Pro compatibility.
Full size image

A, B Reported \({k}_{{{\rm{inact}}}}\) values in CovalentInDB classified by reaction type and warhead subclasses. Red numbers are mean while gray numbers are median. Gray horizontal lines indicate three quartile values. Overall \({K}_{I}\) landscapes calculated from SLC-ABPP dataset (C, E) and distribution (D, F) using COOKIE approach for chloroacetamide (C, D) and acrylamide (E, F) warheads. Arrows on panels (D) and (F) indicate the corresponding CR values at given \({K}_{I}\) values. G, H Ligand efficiency (LE) calculated from the \({K}_{I}\) values in panels (C) and (E). I Ligand lipophilic efficiency (LLE) plot of fragment \(c\log P\) value versus \(-\log{{{K}}}_{{{I}}}\) values from panels (C) and (E). The fragment structures and targeted cysteines of highlighted dots are shown in the table. Source data are provided in Supplementary Data 35.

With \({K}_{I}\) values representing the non-covalent binding contribution, we calculated the ligand efficiency (LE) – a parameter that reflects the affinity contribution per atom by normalizing the affinity of compounds to their molecular size - for all the fragment-cysteine pairs (Fig. 5G, H, Supplementary Data 4). Surprisingly, with an LE cutoff of 0.25, the isoxazole moiety appeared three times among the top five fragments. Considering the two hydrogen bond acceptor atoms present in the isoxazole ring, this represents an attractive molecular pattern with high engagement efficiency that could be useful in cases where molecule weight budgets need to be considered like in the design of PROTACs or molecular glues.

Ligand-lipophilicity efficiency (LLE) is another parameter used in lead optimization that combines the potency of a compound with its lipophilicity to assess the overall quality of a drug candidate, defined as \({{\rm{LLE}}}=p{{\rm{I}}}{{{\rm{C}}}}_{50}-c\log P\). We calculated LLE values for all the fragment-cysteine pairs (Fig. 5I, Supplementary Data 5). Fragments with higher LLE provide more room to increase lipophilicity during lead optimization without reaching an unfavorable physical profile for the drug-candidate. In contrast, low LLE indicates that the molecular activity relies excessively on hydrophobic interactions, which is generally less successful in developing effective and well-balanced drug candidates46. Top-ranked fragments from both classes typically have more than two hydrogen bond donor or acceptor atoms in addition to their warhead moiety, which can serve as potential enhancers for non-hydrophobic binding interactions.

Overall, we have demonstrated here the compatibility of the SLC-ABPP dataset with our COOKIE-Pro analysis workflow. This integration allows for downstream comprehensive analysis by converting experimental condition-dependent CR values into a thermodynamic inactivation constant \({K}_{I}\), facilitating informed decision-making in the selection and optimization of potential drug candidates.

Increase throughput using two data points to estimate \({{{\boldsymbol{k}}}}_{{{\bf{inact}}}}\) and \({{{\boldsymbol{K}}}}_{{{\boldsymbol{I}}}}\) values

In our case study, a total of 17 samples were used, including two controls and 15 dose and time titration samples. This sample size limits the throughput for screening purposes. We ask how many samples are sufficient to achieve high-throughput quantification of both \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) without any presumption of either parameter. We were able to reduce the sample size to 7 (2 controls and 5 dose titration samples) at a single timepoint yet still obtained accurate fitting results for ibrutinib against BTK (Supplementary Fig. 5). However, sample size is still the limiting factor preventing the increase of throughput. Here, we propose a triage scheme that extracts kinetic information from a two-dose single-timepoint COOKIE fitting (Fig. 6). This approach aims to streamline the process by providing an efficient way to obtain \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values with a minimal sample number, thereby enhancing throughput and making it feasible to screen a larger number of compounds or conditions.

Fig. 6: Scenarios and obtainable parameters for two-point COOKIE-Pro screening.
Fig. 6: Scenarios and obtainable parameters for two-point COOKIE-Pro screening.
Full size image

Single timepoint with multiple concentrations are sufficient for COOKIE fitting (yellow, data from single timepoint ibrutinib COOKIE experiment Supplementary Fig. 5). Two were picked out from the five datapoints to simulate the scenarios that will be encountered in two-point COOKIE-Pro screening. Green: “linear phase” only gives accurate estimates of \({k}_{{{\rm{inact}}}}/{K}_{I}\) with no individual values (fitting with linear model \(y=\frac{{k}_{{{\rm{inact}}}}}{{K}_{I}}x\)). Blue: “curvature phase” that allows estimation of individual \({{{\boldsymbol{k}}}}_{{{\bf{inact}}}}\) and \({{{K}}}_{{{I}}}\) value (fitting with hyperbolic model \(y=\frac{{k}_{{{\rm{inact}}}}x}{{K}_{I}+x}\)). Pink: “plateau phase” that only \({{{k}}}_{{{\rm{inact}}}}\) can be accurately estimated (fitting with constant model \(y={k}_{{{\rm{inact}}}}\)). The upper limit of \({K}_{I}\) is \({\left[I\right]}_{1}/2\).

Since there is no pre-knowledge for the binding kinetics between the fragment to screen and its target proteins, the two concentrations would be chosen in an arbitrary manner. Therefore, the resultant datapoints on the [I]-kobs plot may not always be ideal for fitting. For Michaelis–Menten fitting, the curve can be roughly divided into three stages.

  1. 1)

    In low concentrations where \(\left[I\right]\ll {K}_{I}\), \({k}_{{{\rm{obs}}}}=\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\approx \frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}}\), indicating a linear relationship whose slope equals to \({k}_{{{\rm{inact}}}}/{K}_{I}\,\)(Fig. 6 green panel, scenario 1). In this scenario, minor perturbation of either data point may lead to significant changes in the individual fitting values, sometimes even failing to directly perform Michaelis–Menten fitting (as shown in Fig. 6 scenario 2). However, \({k}_{{{\rm{inact}}}}/{K}_{I}\) is rather stable and informative as it indicates the inactivation efficiency \({k}_{{{\rm{eff}}}}\), which can be fitted through Lineweaver–Burk plotting through origin point.

  2. 2)

    In middle concentrations the two points and the origin form a curvature shape (Fig. 6 blue panel, scenario 3). This is the ideal case that both individual values can be estimated through either Michaelis–Menten or Lineweaver–Burk fitting.

  3. 3)

    In high concentrations where \(\left[I\right]\gg {K}_{I}\), \({k}_{{{\rm{obs}}}}=\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\approx \frac{{k}_{{{\rm{inact}}}}\left[I\right]}{\left[I\right]}={k}_{{{\rm{inact}}}}\). When the two datapoints on the [I]-kobs plot are forming a flat line or a line with its slope <0, it potentially means the two concentrations are already in the plateau range, therefore the average value of \({k}_{{obs}}\) can be directly read out as \({k}_{{{\rm{inact}}}}\) (Fig. 6 pink panel, scenario 4). Although this scenario loses information about the concentration at which the half-maximal \({k}_{{obs}}\) is reached (i.e., \({K}_{I}\)), we can still estimate that \({K}_{I}\) is smaller than \(\left[I\right]/2\) due to the fact that \(f\left(\left[I\right]\right)=\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\) is a concave function (\({f}^{{\prime} {\prime} }\left(\left[I\right]\right) < 0\)).

This triage approach using two data points offers a balance between sample economy and the accuracy of kinetic parameter estimation. It is especially advantageous in high-throughput screening settings, where the number of available samples is often a limiting factor. This method could significantly increase the throughput of screening assays, making it possible to evaluate a larger number of compounds or conditions efficiently.

Applying two-point COOKIE-Pro strategy to profile binding kinetics for covalent fragment screening

To demonstrate the feasibility of the two-point COOKIE-Pro strategy for covalent fragment screening, 16 covalent fragments were purchased, and the screening was performed in a 96-well format using HeLa cell lysate to reduce kinetic interference from membrane permeability. It should be noted that although our preferred method involves using permeabilized cells, the one-hour incubation required for this fragment screening experiment is incompatible with our standard digitonin-based cell permeabilization method. To mitigate the risk of cell stress from prolonged digitonin exposure, we opted to use cell lysate instead. Cell lysates were treated with 20 or 50 μM of the fragments for 1 h, generating 20 μg total protein per TMT channel. For each TMTpro-18plex panel, eight fragments at two concentrations were analyzed alongside two DMSO controls. This setup yielded quantification of over 10,000 desthiobiotin-iodoacetamide (DBIA)-containing peptide groups per MS injection and ~16,000 per panel when combining replicates (Supplementary Fig. 7A), generating ~128,000 potential kinetic curves for all 8 fragments in one experiment for downstream analysis. The quantitative data was highly precise, with coefficients of variation <10% for the majority of competition ratios observed for each fragment, indicating minimal variation between replicates (Fig. 7D, Supplementary Fig. 7E).

Fig. 7: Applying two-point COOKIE-Pro strategy to profile binding kinetics for covalent fragment screening.
Fig. 7: Applying two-point COOKIE-Pro strategy to profile binding kinetics for covalent fragment screening.
Full size image

A Pie chart showing the distribution of best-fit models (Zero, Constant, Linear, Hyperbolic) for all fragment-peptide pairs based on AIC. B Confusion matrix comparing the model selections made by AIC versus BIC. C Stacked bar chart illustrating the distribution of the four kinetic models for each of the 16 fragments screened. D Box plots showing the distribution of CVs of the DBIA ratios for each compound and concentration, indicating low variation among replicates. Boxes show the three quartile values of the distribution. The whiskers extend to points that lie within 1.5 IQRs of the lower and upper quartile. E A case study of two-point fitting for various fragments targeting the C145 residue of the MGMT protein. The plot shows the observed rate constant (\({{{k}}}_{{{\rm{obs}}}}\)) versus fragment concentration, with the table summarizing the calculated kinetic parameters (\({{{k}}}_{{{\rm{inact}}}}\), \({K}_{I}\), \({k}_{{{\rm{inact}}}}/{K}_{I}\)) and literature competition ratio (CR) values7. Individual datapoints are presented for two biological replicates. F Scatter plot displaying the landscape of inactivation rates (\({k}_{{{\rm{inact}}}}\)) for all peptides that fit the hyperbolic or constant models, grouped by fragment. G Scatter plot showing the landscape of inactivation efficiencies (\({k}_{{{\rm{inact}}}}/{K}_{I}\)) for all peptides that fit the hyperbolic or linear models, grouped by fragment. Source data are provided in Supplementary Data 6.

To analyze the two-point data, we used the three models discussed in Fig. 6 and selected the best fit using the Akaike information criterion (AIC) and Bayesian information criterion (BIC). AIC or BIC has been widely used in model selection decisions, with penalty for model complexity to avoid overfitting. In this study, these criteria were employed to select the best-fitting model for the two-point kinetic data. Fragment-peptide pairs with competition ratio (CR) values ≥ 1 at both concentrations were classified under a “zero model”. The AIC and BIC provided highly similar model selections (Fig. 7B). Given the small sample size, we used AIC to guide model selection for downstream analysis to reduce parsimony with the hyperbolic model. Among the quantified 272,005 fragment – DBIA peptide pairs, the constant model was the most common (~48%), followed by the linear model (~33%). Although representing only about 2% of the total, a significant 5664 Michaelis–Menten kinetic curves were successfully fitted using the hyperbolic model across the 16 fragments (Fig. 7A). The distribution of these models varied between compounds, indicating fragment-specific kinetic behaviors (Fig. 7C, Supplementary Fig. 7B).

A case study on the MGMT-C145 peptide illustrates the utility of this approach (Fig. 7E). Different fragments targeting this same cysteine showed distinct kinetic profiles. Fragments CL94, CL112, CL113, and CL128 fit the hyperbolic model, allowing for the determination of both \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\). Fragments AC124 and CL99 fit the constant model, suggesting the concentrations used were in the plateau phase (\([I]\gg {K}_{I}\)). This provides an accurate estimate of \({k}_{{{\rm{inact}}}}\) and a lower limit for the inactivation efficiency (\({k}_{{{\rm{inact}}}}/{K}_{I}\)). These kinetic parameters generally aligned with previously published CR data7, with potent fragments from the original screen demonstrating high inactivation efficiencies, meanwhile provided information about the nuance difference of intrinsic reactivity and affinity among fragments. The fitting results also showed high correlation between replicates (Supplementary Fig. 7C, D).

The analysis provided a broad landscape of kinetic parameters. The calculated \({k}_{{{\rm{inact}}}}\) values, derived from fragment – DBIA peptide pairs fitting hyperbolic and constant models, spanned several orders of magnitude from 10−7 to 10−3 s−1, revealing a wide range of intrinsic reactivities (Fig. 7F, Supplementary Fig. 7F, Supplementary Data 6). Similarly, the inactivation efficiencies (\({k}_{{{\rm{inact}}}}/{K}_{I}\)), from pairs fitting hyperbolic and linear models, also varied widely from 10−3 to 101 M−1s−1, highlighting fragments with a favorable combination of non-covalent binding and chemical reactivity (Fig. 7G, Supplementary Fig. 7G, Supplementary Data 6). The highest inactivation efficiencies (\({k}_{{{\rm{inact}}}}/{K}_{I}\)), on the order of 101 M−1s−1, align well with the observation that fragment-based drug discovery hits typically span a range of 10−2 to 102 M−1s−147.

Analysis of the kinetic profiling data reveals that several proteins in the thioredoxin domain-containing (TXNDC) family, such as TXNDC12 C66 and TXNDC17 C43/C46, appear as frequent, high-ranking hotspots for multiple fragments (Fig. 7F, G). These proteins are involved in cellular redox regulation, a function that often relies on highly reactive, nucleophilic cysteine residues. A comparison of the kinetic parameters for these hits in Fig. 7F, G provides insight into their binding mechanism. Cysteine sites on these TXNDC proteins rank among the most potent hits, with very high inactivation efficiencies (\({k}_{{{\rm{inact}}}}/{K}_{I}\)). Concurrently, these same sites also exhibit some of the highest inactivation rates (\({k}_{{{\rm{inact}}}}\)) in the entire dataset. This strong correlation suggests that the high efficiency of these fragments against redox-prone sites is primarily driven by the high intrinsic reactivity of the cysteines (a high \({k}_{{{\rm{inact}}}}\)), rather than by strong, specific non-covalent binding affinity (a low \({K}_{I}\)).

In conclusion, the application of the two-point COOKIE-Pro strategy to a covalent fragment library successfully demonstrated its utility as a high-throughput kinetic screening platform. The method efficiently generated a large and reproducible dataset, enabling the determination of kinetic parameters for thousands of fragment-cysteine interactions. By classifying interactions into distinct kinetic profiles, this approach provides a more nuanced understanding than traditional single-point screening in SLC-ABPP. The ability to decouple reactivity from affinity at scale offers a significant advantage for medicinal chemistry, allowing for a more informed prioritization of hits. This streamlined workflow enhances the efficiency of covalent drug discovery by providing the detailed mechanistic insights necessary to guide the rapid optimization of fragments into selective and potent drug candidates.

Discussion

The evolution of covalent inhibitor development underscores the significance of this drug class in targeting difficult-to-drug proteins, with notable advancements in designing inhibitors that balance potency, selectivity, and safety34,35,36,45. The kinetics parameters of covalent inhibitors, \({k}_{{{\rm{inact}}}}\), \({K}_{I}\), and \({k}_{{{\rm{eff}}}}\), provide valuable insights into balancing potency with selectivity but are currently limited by the prior knowledge of on-target and off-target proteins and low-throughput assays for determining these parameters against these proteins.

The COOKIE-Pro method, introduced in this study, represents a significant advancement in profiling covalent inhibitor binding kinetics unbiasedly on a proteome-wide scale. This method addresses two major challenges in covalent inhibitor research: the quantification of covalent adducts and the determination of binding kinetics across the proteome. COOKIE-Pro utilizes a preincubation-based approach combined with MS-based proteomics, enabling the calculation of\(\,{k}_{{{\rm{inact}}}}\) and \({K}_{I}\) for covalent inhibitors across a broad range of proteins. The use of a desthiobiotin probe in the second incubation step and subsequent pulldown significantly reduces sample complexity, thereby enhancing the sensitivity of protein detection and addressing challenges posed by low-abundance targets. Although the DBIA probe used in SLC-ABPP can nonspecifically react with solvent-exposed cysteinome, it is not suitable for the COOKIE-Pro protein-level enrichment workflow – proteins with multiple DBIA reactive cysteines will still be enriched even if one inhibitor-specific cysteine is pre-occupied. While the DBIA workflow could function if the proteome is digested into peptides prior to streptavidin enrichment, it may still face challenges in identifying overly short or long tryptic peptides, as seen with the BTK C481-containing peptide (21 residues). Our protein-level enrichment approach better meets the need for deep profiling of potent inhibitor binding targets.

The validation of COOKIE-Pro with high selectivity BTK inhibitor spebrutinib demonstrated the method accuracy and reliability. The subsequent application to ibrutinib provided further insights into the method utility. Despite the lower selectivity of ibrutinib compared to spebrutinib, COOKIE-Pro successfully identified and quantified known off-targets in Ramos cells, such as BLK and TEC42, providing valuable kinetic parameters (\({k}_{{{\rm{inact}}}}/{K}_{I}\)) that are consistent with previous reports for BTK31,48. This consistency underscores the robustness of COOKIE-Pro in accurately profiling covalent inhibitor interactions, even for compounds with complex binding profiles.

The ibrutinib case study highlighted several important considerations for probe design and experimental conditions, including the impact of warhead reactivity, choosing a proper concentration to minimize non-specific labeling, and the effect of linker length on pulldown efficiency. The optimization process revealed that maintaining the intrinsic reactivity of the warhead is crucial for effective probe design, and that switching from native to denaturing pulldown workflow can minimize steric hindrance and improve pulldown efficiency. The concentration of probe used should be high enough to out-compete all the remaining non-covalent binding with the inhibitor as quickly as possible. However, using too high a probe concentration should be avoided, as non-specific labeling could occur. Large step dilution from the first to the second incubation is theoretically recommended over using saturating probe concentration, but in practice, it may reduce the identified protein numbers due to less input material amount.

The combination of SLC-ABPP with the COOKIE approach presents an opportunity to enhance the efficiency and throughput of kinetic profiling in drug discovery and chemical biology. While SLC-ABPP typically provides an experiment-dependent competition ratio (CR) as a measure of inhibitor potency, integrating the COOKIE approach aims to quantify intrinsic kinetic parameters \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\), which are fundamental characteristics of the enzyme-inhibitor interaction and independent of the specific experimental setup. Due to limitations in the published dataset containing only one datapoint, a surrogate method is used to assume an identical \({k}_{{{\rm{inact}}}}\) for fragments within the same warhead class and calculate the \({K}_{I}\) values for each fragment-cysteine pair.

This study further demonstrated the feasibility of a high-throughput screening application by adapting the workflow into a two-point COOKIE-Pro strategy to profile a covalent fragment library. This approach moves beyond the single, experiment-dependent competition ratio (CR) offered by platforms like SLC-ABPP by quantifying the intrinsic kinetic parameters \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) during fragment-based drug screening. The screening of 16 fragments generated a rich dataset of over 100,000 potential kinetic curves. By fitting data to distinct kinetic models, this approach provides a more nuanced understanding than single-point screening. A key insight from this analysis was the ability to decouple chemical reactivity from non-covalent affinity. For instance, frequent hits on thioredoxin domain-containing (TXNDC) proteins were found to have high inactivation efficiencies (\({k}_{{{\rm{inact}}}}/{K}_{I}\)). Our analysis revealed this was primarily driven by the high intrinsic reactivity of these redox-prone cysteines (a high \({k}_{{{\rm{inact}}}}\)) rather than specific binding affinity. Furthermore, the observed inactivation efficiencies, with top hits on the order of 101 M−1s−1, are consistent with the potency range typically expected for fragment-based hits. This ability to generate detailed mechanistic information at scale is a critical advantage for guiding medicinal chemistry efforts.

It is important to note that while the two-point COOKIE method offers significant advantages in terms of sample efficiency and throughput, there are important statistical limitations and considerations. With only two data points used to estimate both \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\), the fitting process is prone to overfitting, potentially resulting in parameter estimates that are overly sensitive to small variations in the data. This can lead to inaccurate or non-reproducible results. The limited data points also make it difficult to accurately estimate the uncertainty or confidence intervals around the fitted parameters. In real screening situations, it is important to use \({k}_{{{\rm{obs}}}}\) values fitted from multiple biological replicates to minimize the \({k}_{{{\rm{inact}}}}/{K}_{I}\) fitting error. Misinterpreting the scenario (whether the data points fall into the low, middle, or high concentration range) can lead to incorrect conclusions about the kinetic parameters, particularly if the chosen concentrations are close to \({K}_{I}\). Also, when selecting the two concentrations, a low-dose and high-dose combination increases the chance of fitting into Scenario 3, from which more information could be inferred.

In summary, the introduction of COOKIE-Pro represents a significant advancement in covalent inhibitor research, offering a powerful tool for profiling inhibitor binding kinetics across the proteome. The method’s successful application to both highly selective and less selective inhibitors demonstrates its versatility and potential for broad application in drug discovery and development. The successful extension of this strategy to a high-throughput, two-point format for fragment screening further underscores its value in early-stage drug discovery. The ability to quantitatively assess these interactions on a proteome-wide scale opens new avenues for understanding the full spectrum of covalent inhibitor activity and optimizing drug design for better therapeutic outcomes. Future studies could explore the application of COOKIE-Pro to other classes of covalent inhibitors and in different cell lines and tissues, integrating the binding kinetics parameters across the whole proteome into PK/PD modeling and toxicology predictions. We also envision with the development of data-independent acquisition (DIA) high-throughput proteomics, the theoretical framework of COOKIE-Pro can be readily applied to covalent fragment screen to prioritize the target and fragment pairs.

Methods

Mathematics

Consider a 2-step irreversible inhibitor \({I}\) binding to its target protein \(E\), the reaction can be described as:

$$E+I\begin{array}{c}{k}_{{{\rm{on}}}}\\ \rightleftharpoons \\ {k}_{{{\rm{off}}}}\end{array}{EI}\begin{array}{c}{k}_{{{\rm{inact}}}}\\ \longrightarrow \\ \,\end{array}{{EI}}^{*}$$
(4)

The equilibrium constant \({K}_{I}\) (Eq. (2)) for the first step was defined based on the steady state approximation of \({EI}\) (Eq. (5)):

$$\frac{d\left[{EI}\right]}{{dt}}={k}_{{{\rm{on}}}}\left[E\right]\left[I\right]-{k}_{{{\rm{off}}}}\left[{EI}\right]-{k}_{{{\rm{inact}}}}\left[{EI}\right]=0$$
(5)
$$\frac{\left[E\right]\left[I\right]}{\left[{EI}\right]}=\frac{{k}_{{{\rm{off}}}}+{k}_{{{\rm{inact}}}}}{{k}_{{{\rm{on}}}}}\equiv {K}_{I}$$
(6)

The rate constant for \(E{I}^{*}\) adduct formation:

$${k}_{{{\rm{eff}}}}=\frac{{k}_{{{\rm{inact}}}}}{{K}_{I}}=\frac{{k}_{{{\rm{on}}}}\cdot {k}_{{{\rm{inact}}}}}{{k}_{{{\rm{off}}}}+{k}_{{{\rm{inact}}}}}$$
(7)

Mass law for total target protein \({E}^{o}\) and non-covalent protein \({{\rm{\varepsilon }}}\):

$$[{E}^{o}]=\left[E\right]+\left[{EI}\right]+\left[E{I}^{*}\right]$$
(8)
$$[\varepsilon ]=\left[E\right]+\left[{EI}\right]$$
(9)

Assuming the reaction follows pseudo first-order reaction kinetics without inhibitor depletion, the unbound inhibitor concentration \([I]\) is a constant value unaffected by enzyme binding.

Since proteomics experiment can only quantify \(\left[E{I}^{*}\right]\), the goal is to define \(\frac{{{\rm{d}}}\left[{{\rm{E}}}{{{\rm{I}}}}^{*}\right]}{{dt}}\).

The formation rate of \(\left[E{I}^{*}\right]\) equals the consumption rate of \([\varepsilon ]\):

$$\frac{d\left[E{I}^{*}\right]}{{dt}}=-\frac{d\left[\varepsilon \right]}{{dt}}$$
(10)

On the other hand, by rearranging Eq. (2) with (9), we solve for \([{EI}]\):

$$\left[{EI}\right]=\frac{\left[\varepsilon \right]\left[I\right]}{{K}_{I}+\left[I\right]}$$
(11)

Therefore, the formation rate of \(\left[E{I}^{*}\right]\) can be expressed as:

$$\begin{array}{c}\frac{d\left[E{I}^{*}\right]}{{dt}}={k}_{{{\rm{inact}}}}\left[{EI}\right]\\={k}_{{{\rm{inact}}}}\frac{\left[I\right]}{{K}_{I}+\left[I\right]}\cdot \left[\varepsilon \right]\end{array}$$
(12)

Combining Eqs. (10) and (12), we get a differential equation for \([\varepsilon ]\):

$$-\frac{d\left[\varepsilon \right]}{{dt}}=\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\cdot \left[\varepsilon \right]$$
(13)

This equation can be solved by rearranging and integrating on both sides:

$$\int \frac{1}{\left[\varepsilon \right]}d\left[\varepsilon \right]=-\int \frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}{dt}$$
(14)
$${\mathrm{ln}}\left[\varepsilon \right]=-\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\cdot t+C$$
(15)

Here, we took advantage of the integration constant \(C\), let \(C={\mathrm{ln}}[{E}^{o}]\). After rearranging, we got:

$$\frac{\left[\varepsilon \right]}{\left[{E}^{o}\right]}=\exp \left(-\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\cdot t\right)$$
(16)

Substitute \(\left[\varepsilon \right]\) with \(\left[{E}^{o}\right]-\left[E{I}^{*}\right]\) and rearrange, we finally got the equation in Fig. 1A.

$$\frac{\left[E{I}^{*}\right]}{\left[{E}^{o}\right]}=1-\exp \left(-\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}\cdot t\right)$$
(17)
$$\Rightarrow \frac{\left[E{I}^{*}\right]}{{E}^{o}}=1-\exp \left(-{k}_{{{\rm{obs}}}}\cdot t\right),{\mbox{where}} \, {k}_{{{\rm{obs}}}}=\frac{{k}_{{{\rm{inact}}}}\left[I\right]}{{K}_{I}+\left[I\right]}$$
(18, Figure 1A)

MATLAB simulation

MATLAB (version R2022b) and the SimBiology package (v6.4) were used to build and simulate the covalent inhibition model as shown in Eq. (1). The system was built as described in the following 4 ordinary differential equations (ODEs):

$$d[E]/{dt}=-\left({k}_{{{\rm{on}}}}*\left[E\right]*\left[I\right]-{k}_{{{\rm{off}}}}*\left[{EI}\right]\right)$$
(19)
$$d\left[I\right]/{dt}=-\left({k}_{{{\rm{on}}}}*\left[E\right]*\left[I\right]-{k}_{{{\rm{off}}}}*\left[{EI}\right]\right)$$
(20)
$$d[{EI}]/{dt}=\left({k}_{{{\rm{on}}}}*\left[E\right]*\left[I\right]-{k}_{{{\rm{off}}}}*\left[{EI}\right]\right)-\left({k}_{{{\rm{inact}}}}*\left[{EI}\right]\right)$$
(21)
$$d\left[E{I}^{*}\right]/{dt}=\left({k}_{{{\rm{inact}}}}*\left[{EI}\right]\right)$$
(22)

The initial value of \([E]\) was set at 1e−10 M, \([I]\) titrating from 1e−9 to 1e−5 M with 9 steps on the log scale. Both \([{EI}]\) and \(\left[E{I}^{*}\right]\) were initially set as 0. The values of \({k}_{{{\rm{on}}}}\), \({k}_{{{\rm{off}}}}\), and \({k}_{{{\rm{inact}}}}\) were set as each condition in Fig. 1C. The simulation stop time was set at 86,400 s. ODE15s solver was used for all the simulation.

The simulated datapoints of \(\left[E{I}^{*}\right]\) were exported and used to fit for \({k}_{{obs}}\) values using lsqcurvefit function in the MATLAB. The fitting model = @(params, x) params(1) + (params(2) − params(1)) * (1 − exp(-params(3) * x)), where params(1) = 0, params(2) = 1e−10, and params(3) is \({k}_{{obs}}\) to be fitted. The fitted \({k}_{{obs}}\) values and their corresponding \([I]\) were used for the second round Michaelis–Menten fitting in GraphPad 10.2 to generate the individual \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values.

Reagents & cell culture

Spebrutinib and ibrutinib were purchased from MedChemExpress. Unless otherwise stated, all the chemicals were purchased from Sigma Aldrich. Ramos and HeLa cell line was purchased from American Type Culture Collection (ATCC, Cat. No. CRL-1596 and CCL-2). Ramos cells were maintained in RPMI-1640 (Corning, Cat. No. 10-040-CV) supplemented with 10% fetal bovine serum (FBS) (Corning, Cat. No. 35-011-CV). HeLa cells were cultured in DMEM (Corning, Cat. No. 10-013-CV) supplemented with 10% FBS. All the cell cultures were maintained at 37 °C with 5% CO2. Cells were used for COOKIE experiments only when the cell viability is higher than 95%, confirmed by trypan blue exclusion assay. The 5× NP-40 cell lysis buffer contains 250 mM Tris-HCl pH 7.5, 750 mM NaCl, 2.5% NP-40 Alternative (Millipore, Cat. No. 492018), 50% glycerol, 5 mM EDTA. Halt™ Protease inhibitor cocktail 100× (Thermo Scientific, Cat. No. 78440) was freshly added into the 5× NP-40 lysis buffer to 5× final concentration before the COOKIE experiment.

COOKIE-Pro protocol for advanced compound off-target kinetics profiling

COOKIE sample preparation for immunoblotting confirmation

Ramos cells were cultured until they met the number requirement of COOKIE experiment. Each dose and timepoint sample require 2 million Ramos cells (equal to ~120 µg total protein). In our COOKIE-Pro design, a total of 17 samples needs at least 34 million cells. To collect the cells, Ramos cell culture was centrifuged at 500 × g for 5 min followed by aspiration of the culture media. Cell pellet was washed and resuspended in phosphate buffered saline (PBS) pH 7.4 (Corning, Cat. No. 21-040-CV) to make the final cell density 10 million/mL.

In the first “pulse” step, for DMSO control sample, 2 million cells in 200 µL were incubated with 0.005% digitonin followed by 1 min 1% DMSO incubation. For saturating control sample, 2 million cells in 200 µL were incubated with 0.005% digitonin followed by 5 µM spebrutinib or ibrutinib incubation for 60 min. For each drug concentration, 6 million cells (for 3 timepoints) in 600 µL volume were incubated with 0.005% digitonin followed by adding 6 µL 100× concentrated spebrutinib or ibrutinib dissolved in DMSO for 2/4/6 min incubation. To start the second “chase” incubation, at the corresponding “pulse” timepoint, 180 µL permeabilized cell suspension was withdrawn and transferred into 45 µL 5× NP-40 lysis buffer containing 25 µM SB-2 or 5 µM IB-C8-DTB with a short gentle vortexing. The second “chase” incubation lasts for 1 h for all samples. Both incubation steps were under room temperature.

After chase incubation, all the samples were desalted to remove free DTB probes using Zeba™ Spin Desalting Plates (Thermo Scientific, Cat. No. 89807). Although the sample volume has exceeded the recommended Zeba plate loading volume, we confirmed that this condition does not impair the pulldown yield (Supplementary Fig. 6A, B). For denature pulldown protocol, 0.1% SDS was added to the desalted samples. Each desalted sample was then incubated with 50 µL slurry Dynabeads™ M-280 Streptavidin (Invitrogen, Cat. No. 11206D) pre-equilibrated three times with 1× NP-40 lysis buffer with 0.1% SDS. After overnight bead binding at 4 °C with rotation, the beads were separated from the lysate with magnetic rack. Lysate was aspirated and the beads were consecutively washed with 1× NP-40 lysis buffer twice and fresh-made 8 M urea twice. The beads were then transferred into new tubes and further washed with PBS twice. The beads were finally resuspended in 40 µL 200 mM HEPES pH 8.5. To prepare the immunoblot samples, a 5 µL aliquot was taken from each slurry bead sample and mixed with 1.6 µL 4× Laemmli buffer (Bio-Rad, Cat. No. 1610747) containing 10% β-mercaptoethanol. The immunoblot samples were incubated at 95 °C for 5 min and ready to load onto SDS-PAGE gels. The remaining slurry bead samples were stored at −20 °C until immunoblotting confirmed successful pulldown.

TMT-labeled proteomics sample preparation

The 35 µL slurry bead samples were thawed, resuspended, and reduced with 5 mM dithiothreitol (DTT) (Fisher Scientific, BP172) at 37 °C for 15 min with vigorous shaking, followed by alkylation with 20 mM iodoacetamide dissolved in 200 mM HEPES pH 8.5 (Thermo Scientific, Cat. No. A39271) at room temperature in the dark for 30 min. After aspirating all the supernatant, on-bead digestion was initiated by resuspending the beads in 40 µL 12.5 ng/µL trypsin/LysC mix (Promega, V5073) diluted in 200 mM HEPES pH 8.5. The bead-protease suspension was vigorously shaken (1,100 rpm) on a 37 °C tube incubator for 18 h. Acetonitrile (ACN) was then added to the digested samples to reach 30% v/v. For each sample, one TMTpro-18plex channel was assigned, and 10 µL of 25 µg/µL corresponding TMTpro reagent (Thermo Scientific, Cat. No. A52045), dissolved in 100% ACN, was added and incubated for 1 h at room temperature. The labeling reaction was quenched by adding 10 μL of 5% hydroxylamine to each sample and incubating for 15 min. All the samples were combined, acidified with formic acid (FA) until pH <3, and dried in the SpeedVac. The dried sample was reconstituted in 100 µL 200 mM HEPES pH 8.5 and desalted using SP3 peptide clean-up protocol10,49. Briefly, 200 µg 1:1 mixed Sera-Mag Carboxylate SpeedBeads E3 and E7 (Cytiva # 65152105050250 and # 45152105050250) were prewashed and added into the peptide sample. The binding process was initiated by adding 1.8 mL 100% ACN with vigorous vortex followed by 10 min 1000 rpm shaking. The supernatant was removed, and the beads were washed and vortexed twice with 200 µL 100% ACN. The beads were air-dried, and peptides were eluted using 2% DMSO for 10 min with intermittent sonication. Desalted peptides were SpeedVac dried until ready to inject into nanoLC-MS/MS.

Data dependent acquisition (DDA) on nanoLC-MS/MS

The peptides were reconstituted in 5% MeOH/0.1% FA. Each sample was first separated by a 15 cm PepMap™ 100 C18 HPLC Column (Thermo Scientific, Cat. No. 164534) on Thermo EASY-nLC 1200 through a 5–30% gradient of B for 95 min followed by a 30–60% gradient of B for 38 min (Mobile phase B = 90% ACN) at 300 nL/min flow rate. The eluted peptides were ionized with an Easy-Spray source (Thermo Scientific) at +2.4 kV and subjected into an Orbitrap Fusion Lumos mass spectrometer (Thermo Scientific). Data-dependent acquisition mode with a 3 s cycle-time was used. MS1 data were acquired using the Orbitrap analyzer in profile mode at a resolution of 120,000 over a range of 400–1600 m/z at 100% normalized automatic gain control (AGC) with a maximum injection time of 50 ms. Following HCD at 38% normalized collision energy and quadrupole isolation with a window of 1.2 m/z, MS2 data were acquired using the Orbitrap at a resolution of 50,000 in centroid mode with the first mass 110 m/z at 200% normalized AGC with a maximum injection time of 86 ms.

COOKIE-Pro data analysis

Proteome Discoverer 2.4 (Thermo Fisher Scientific) was used for RAW file processing, controlling peptide- and protein-level false discovery rates, assembling proteins from peptides, and protein quantification from peptides. Searches were performed using full tryptic digestion against the SwissProt human database (UP000005640, downloaded on August 12, 2022) with up to two missed cleavage sites. Oxidation (+15.995 Da) on methionine and deamidation on N and Q (+0.984 Da) were set as variable modifications, while carbamidomethylation of cysteine residues (+57.021 Da) and TMTpro labeling of peptide N-termini and lysine residues were set as fixed modifications (+304.207 Da). Data were searched with mass tolerances of ±20 ppm and 0.5 Da on the precursor ions and fragment ions, respectively. The results were filtered to include peptide spectrum matches (PSMs) with a high peptide confidence with PSM and peptide-level FDR < 0.01. PSMs with precursor isolation interference values > 50% and average TMT-reporter ion signal-to-noise values (S/N) < 10 were excluded from quantitation. Isotopic impurity correction and TMT channel normalization, based on the total peptide amount, were applied. Protein quantification uses both unique and razor peptides, with protein-level FDR < 0.01 for strict and <0.05 for relaxed restrictions.

Reported protein abundance for each TMT channel was processed for COOKIE two-step fitting in Prism v10.2 (GraphPad Software). Each individual protein abundance was first normalized against acetyl-CoA carboxylase 1 (ACC1) among all the samples. Then for each individual protein, the ACC1-normalized abundance was converted to control sample-normalized abundance by setting 0% as the protein abundance in the saturating control sample, and 100% as the abundance in the DMSO control sample. This control sample-normalized abundance is equal to the occupancy of DTB probe on each individual protein, and the occupancy of testing inhibitor \(\frac{\left[E{I}^{*}\right]}{{E}^{o}}\) can be calculated from (1 − occupancy of the DTB probe). With \(\frac{\left[E{I}^{*}\right]}{{E}^{o}}\) at different concentrations and timepoints, \({k}_{{{\rm{obs}}}}\) can be fitted for each concentration using the one-phase association model with \(Y=Y0\,+\,({{\rm{Plateau}}}-Y0)*(1-\exp (-({k\_obs})*x))\), where \(x={time}\) in seconds, \(Y=\frac{\left[E{{{\rm{I}}}}^{*}\right]}{{E}^{o}}\), the \({Plateau}\) is constrained to 1 and the \(Y0\) constrained to 0. The \(\left(\left[I\right],\,{k}_{{{\rm{obs}}}}\right)\) pairs were further plotted for the second Michaelis–Menten fitting using user-defined model \(Y={k\_inact}*X/({KI}+X)\), where the initial value of \({k\_inact}\) set as 1*YMAX and \({KI}\) set as 1*(Value of X at YMID).

Two-point COOKIE-Pro protocol for covalent fragment screening

Two-point COOKIE-Pro sample preparation with SP3 protein clean-up, TMT labeling, and peptide-level streptavidin enrichment

A day before the experiment, 5 million HeLa cells were plated in a 10 cm cell culture dish with ~80% confluency. This amount of cell generates ~1.5 mg total protein for 4 replications of TMTpro 18plex experiments with 20 μg total protein per TMT channel. After overnight culture, cells were washed with PBS and lyzed in the PBS lysis buffer pH 7.4 containing 0.1% NP-40, 1x Halt™ Protease inhibitor cocktail (Thermo Scientific, Cat. No. 78440), and 5 μL Benzonase (Sigma-Aldrich, E1014) for 10 min. The lysate was centrifuged at 14,000 × g for 5 min, and the supernatant was normalized to 2 μg/μL protein concentration. A total of 10 μL lysate (20 μg total protein) was dispensed into each well of a 96-well deep plate. Covalent fragments stocks in DMSO were diluted as 3× in the lysis buffer beforehand. A volume of 5 μL fragment were added into each well. For each fragment, two concentrations, 20 μM and 50 μM were used to incubate with the lysate for 1 h under room temperature. Desthiobiotin-iodoacetamide (DBIA) probe (2 mM in lysis buffer, 5 μL) were added into each well with a final concentration of 500 μM to react for another 1 h in dark.

After incubation, the reaction mix was cleaned-up using SP3 protein clean-up protocol49,50. Briefly, 150 μg of SP3 beads (1:1 mixture of E3 and E7 carboxylic magnetic beads from Cytiva Cat. #45152105050250 and Cat. #65152105050250) were added into each well with a brief vortex to resuspend beads. A total of 30 μL of 20 mM DTT dissolved in 100% ethanol was added into each well, followed by 10 min 1000 rpm shaking to trigger protein binding to the beads. Supernatant was aspirated and another 20 μl aliquot of lysis buffer containing 20 mM iodoacetamide was added, and the plate was vigorously shaken for 30 min in dark. Supernatant was aspirated, and beads were washed twice with 60 μL 80% ethanol. The beads were finally resuspended in 30 μL 12.5 ng/μL trypsin/LysC (Promega, V5073) in 200 mM HEPES pH 8.5. The plate was filmed and incubated at 37 °C overnight with 1000 rpm shaking.

After digestion, 11 μL ACN were added into the product peptide-bead mixture. To each sample, 120 μg TMTpro 18-plex reagent freshly dissolved in 4 μL ACN was added and incubated at room temperature for 1 h. The reaction was quenched by adding 7 µl 5% hydroxylamine for 10 min. Samples with different TMTpro channel labeling were combined, with beads removed by magnetic stand, and SpeedVac to dry. Peptides were resuspended in H2O, acidified with formic acid, and further cleaned up with SepPak SPE C18 column using 5% ACN/0.1% FA (Waters, WAT054955). Eluted peptides were dried, resuspended in 300 μL HEPES pH 8.5 buffer, and DBIA-labeled peptides were enriched using 50 μL High Capacity Streptavidin Agarose (Pierce, #20357) with 3 h end-to-end rotation at room temperature. Agarose beads were washed in the filter unit with twice HEPES/0.05% NP-40 pH 8.5 buffer, three times HEPES pH 8.5 buffer, and once H2O. Bound DBIA-peptides were eluted with 80% ACN/0.1% FA/0.2% TFA with three consecutive incubation with shaking 1) RT 20 min, 2) RT 10 min, 3) 72 °C 10 min10. Eluted fractions were combined, SpeedVac dried, and further cleaned up with homepacked C18 StageTips. Dried peptides were kept in −20 °C before LC-MS/MS analysis.

Data dependent acquisition (DDA) of DBIA peptides on nanoLC-FAIMS-MS/MS

Dried peptides were resuspended in 5% ACN/0.1% FA and each sample injected twice as technical replicates. Peptide samples were separated and analyzed using a Vanquish Neo UHPLC system coupled to an Orbitrap Ascend Multiomics Tribrid mass spectrometer (Thermo Scientific). The UHPLC was configured for direct injection with a 50 cm EasySpray HPLC separation column (Thermo Scientific, Cat. No. ES75500). Mobile phase A consisted of H2O/0.1% FA and mobile phase B consisted of 80% ACN/0.1% FA. Peptides were separated using a 150-minute gradient. The flow rate was maintained at 0.250 µL/min while the percentage of mobile phase B was increased from 4% to 25% over 120 min, then to 40% B from 120 to 140 min. The column was subsequently washed by increasing the flow rate to 0.350 µL/min and the concentration of mobile phase B to 99% from 141 to 150 min.

The Orbitrap Ascend Multiomics Tribrid mass spectrometer was operated in positive ion mode and utilized High-Field Asymmetric Waveform Ion Mobility Spectrometry (FAIMS Pro Duo) with standard resolution. The analysis was conducted by cycling through two FAIMS compensation voltages (CVs): −50 V and −70 V. The total cycle time for the data-dependent acquisition was 1.5 s.

Full MS1 scans were acquired in the Orbitrap at a resolution of 120,000 over a scan range of m/z 400–1600 (for CV −50) and m/z 400-1500 (for CV −70). The maximum injection time for MS1 scans was set to 25 ms for the −50 CV and Auto for the −70 CV. The normalized Automatic Gain Control (AGC) target was 50% (absolute target of 2e5) for the −50 CV experiment and 25% (absolute target of 1e5) for the −70 CV experiment.

For MS2 analysis, precursor ions with charge states from 2 to 6 and a minimum intensity of 5e3 were selected for fragmentation. Dynamic exclusion was enabled to exclude precursors after one acquisition for a duration of 45 s with a mass tolerance of 10 ppm. Selected precursors were isolated using the quadrupole with an isolation window of 1.0 m/z and fragmented via Higher-energy Collisional Dissociation (HCD) with a normalized collision energy of 38%. The resulting fragment ions were detected in the Orbitrap at a resolution of 30,000. The MS2 scans had a first mass fixed at m/z 120. The maximum injection time for MS2 scans was set to 96 ms, with a normalized AGC target of 200% (absolute target of 1e5). Data was acquired in centroid mode for MS2 scans.

Two-point COOKIE-Pro data analysis

The raw data files were processed using Proteome Discoverer™ software (version 3.2, Thermo Fisher Scientific). All MS/MS spectra were searched against the UniProt Homo sapiens reviewed proteome database (UP000005640, downloaded on January 7, 2025) using the Sequest HT search algorithm. A target-decoy strategy was employed to filter peptide-spectrum matches (PSMs) to a 1% false discovery rate (FDR) using the Percolator node.The search parameters for the fully tryptic digestion allowed for a maximum of two missed cleavages. A precursor mass tolerance of 20 ppm and a fragment ion mass tolerance of 0.6 Da were applied. Tandem mass tags (TMTpro) labeling on peptide N-termini and lysine (K) residues (+304.207 Da) were set as static modifications. The following dynamic modifications were permitted: Oxidation of methionine (M, +15.995 Da), Deamidation of asparagine (N) and glutamine (Q) (+0.984 Da), Carbamidomethyl (+57.021 Da) and DBIA (+296.185 Da) on cysteine (C). Quantification of TMTpro reporter ions was performed using the Reporter Ions Quantifier node based on the HCD MS2 scans. The reporter ion intensities were quantified using the “Most Confident Centroid” method with an integration tolerance of 20 ppm. PSMs with precursor isolation interference values > 50% and average TMT-reporter ion signal-to-noise values (S/N) < 10 were excluded from quantitation. Isotopic impurity correction and TMT channel normalization, based on the total peptide amount, were applied. Protein quantification uses both unique and razor peptides.

Peptide group search results were exported, and further COOKIE calculation was processed with a custom Python script. For the high-throughput covalent fragment screening performed at a single time point with two concentrations, a specialized analysis was employed to classify the kinetic behavior of each fragment-peptide interaction. After calculating the \({k}_{{obs}}\) value for each of the two concentrations, three distinct kinetic models were fitted to the \({k}_{{obs}}\) data points:

Linear Model: \({k}_{{{\rm{obs}}}}=({k}_{{{\rm{inact}}}}/{K}_{I})\cdot [I]\), applicable when inhibitor concentrations [I] are much lower than KI.

Hyperbolic Model: \({k}_{{{\rm{obs}}}}={k}_{{{\rm{inact}}}}\cdot [I]/({K}_{I}+\left[I\right])\), the full Michaelis–Menten model.

Constant Model: \({k}_{{{\rm{obs}}}}={k}_{{{\rm{inact}}}}\), applicable when inhibitor concentrations [I] are much higher than \({K}_{I}\).

To select the most appropriate model for each pair while avoiding overfitting, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) were calculated for each successful model fit under the assumption of Gaussian independent and identically distributed residuals.

$${AIC}=n{{\mathrm{ln}}}\frac{{{\rm{RSS}}}}{n}+2K$$
(23)
$${BIC}=n{{\mathrm{ln}}}\frac{{{\rm{RSS}}}}{n}+K\log n$$
(24)

where \({{\rm{RSS}}}=\sum {({y}_{i}-{\hat{y}}_{i})}^{2}\), and \(K\) indicates the number of parameters to fit in each model. \(K=1\) in linear and constant model, \(K=2\) in hyperbolic model.

The script selects the model with the lowest AIC score as the best representation of the kinetic behavior. This model selection process allows for the most reliable estimation of kinetic parameters from the limited two-point data, providing either the inactivation efficiency (\({k}_{{{\rm{inact}}}}/{K}_{I}\)) from the linear model, individual \({k}_{{{\rm{inact}}}}\) and \({K}_{I}\) values from the hyperbolic model, or an estimate of \({k}_{{{\rm{inact}}}}\) and upper boundary of \({K}_{I}\) from the constant model.

Immunoblotting

The input sample before streptavidin pulldown and the boiled beads in the Laemmli buffer were separated by SDS-PAGE using 4–20% Criterion TGX precast gels (Bio-Rad, Cat. No. 5671095) and further semi-dry transferred onto 0.45 µm PVDF membrane (Millipore, Cat. No. IPVH85R) pre-soaked with ethanol. After 1 h blocking with 4% bovine serum albumin in 1x TBST, the membranes were probed overnight at 4 °C with the specified primary antibodies at the dilution of 1:1000 (Cell Signaling Technology: BTK Rabbit mAb#8547 (Clone:D3H5, Lot:18); Abclonal: Tec Rabbit mAb A22074, Clone: ARC53949 Lot 3522071376) or 1:2000 (β-actin Rabbit mAb#4970 (Clone:13E5, Lot:19)), followed by 3 times 10 min TBST washes and incubation with the HRP-conjugated secondary antibody (Kindle Biosciences, #R1006, 1:1000) for 1 h at room temperature. For streptavidin blotting, after blocking, the membranes were probed for 1 h with HRP-Conjugated Streptavidin (Thermo Scientific, Cat. No. N100) at the dilution of 1:10,000. After 3 times 10 min TBST washes, chemiluminescence was developed using SuperFemto ECL chemiluminescence kit (Vazyme, Cat. No. E423-01) and captured in the digital imaging system. Densitometry analysis was through KwikQuant Image Analyzer 5.9 software (Kindle Biosciences).

BTK kinase activity assay

PhosphoSens-Kinetic Kinase Assays (AssayQuant Technologies, AQT0101) was used to monitor the inhibition kinetics of BTK by ibrunitib, IB-ac-DTB, IB-C5-DTB, IB-C8-DTB. Each well on the 384-well plate (Corning, 3701) contains 20 µl mixture of 2.5 µL reaction buffer (500 mM HEPES pH 7.5, 0.1% Brij-35, 100 mM MgCl2), 2.5 µL 100 µM PhosphoSens Sensor peptide substrate, 2.5 µL 10 mM adenosine 5′-triphosphate disodium salt hydrate (ATP) solution, 2.5 µL 10 mM tris(2-carboxyethyl)phosphine hydrochloride (TCEP) solution, and 10 µL ddH2O. DMSO or the testing compounds dissolved in DMSO were directly dispensed into the well using Multidrop Pico 8 Digital dispenser (Thermo Scientific) giving a final concentration of 8.7 nM to 500 nM. With all the reagents mixed, 5 µL 6.4 nM His-BTK (SignalChem, B10-10H) in enzyme dilution buffer (20 mM HEPES pH 7.5, 0.01% Brij-35, 5% glycerol, and 1 mg/ml BSA) was added to achieve final 1.28 nM BTK. The plate was immediately filmed and fluorescence intensity (Ex. 360 nm Em. 492 nm) was recorded every 3 min for 3 h on a SYNERGY H1 microplate reader (BioTek).

GSH reactivity measured by LC-UV/MS

Test compounds at 1 mM were reacted with excess of glutathione (100 mM) in 100 mM HEPES/20% ACN pH 7.5 at 37 °C in singlicate. The compound reactivity was qualitatively measured through monitoring the loss of parent compound by integrating the extracted ion chromatogram and UV chromatogram on an Agilent 1260 Infinity II HPLC system tandemed with single quadrupole InfinityLab LC/MSD system.

Zeba plate desalting capacity test

Zeba desalting plate volume capacity was initially tested with a desthiobiotin-BODIPY dye DTB-C6-BDP. Zeba plate was centrifuged at 1000 × g for 2 min to remove the storage buffer and loaded with 120–320 µL 10 µM DTB-C6-BDP in PBS and centrifuged again to collect the desalted samples. The fluorescence intensities of the desalted samples were converted into concentrations through standard curve calibration. The standard COOKIE protocol was then tested with different sample volumes and cell numbers as in Supplementary Fig. S6B.

Reprocessing SLC-ABPP dataset

We extracted the SLC-ABPP electrophile-peptide competition ratio (CR) values in HCT-116 cell screening from Kuljanin et al.7 Supplementary Table 6. In order to convert single concentration CR values into \({K}_{I}\) values, we assumed that the \({k}_{{\rm{inact}}}\) reactivity should be similar for covalent warheads in the same class. The reported \({k}_{{\rm{inact}}}\) information was extracted from the CovalentInDB45 and the unit was unified. After stratifying the entries into warhead subclasses, the \({k}_{{\rm{inact}}}\) median of chloroacetamide and acrylamide compounds were used for downstream calculation. The CR values greater than 2 were converted to \({K}_{I}\) value through these three equations:

$${{\rm{occupancy}}}=1-\frac{1}{{{\rm{CR}}}}$$
(25)
$${{\rm{occupancy}}}=1-\exp \left(-{k}_{{{\rm{obs}}}}\cdot t\right)$$
(26)
$${k}_{{{\rm{obs}}}}=\frac{{k}_{{{\rm{inact}}}}\cdot \left[I\right]}{{K}_{I}+\left[I\right]}$$
(27)

, among which the \(t\) and \([I]\) was 7200 s and 2.5e−5 M from the SLC-ABPP method, \({k}_{{\rm{inact}}}\) for chloroacetamide was 0.0092 s−1 and for acrylamide was 0.0025 s−1. The ligand efficiency (LE) was calculated according to

$${{\rm{LE}}}=-\frac{\Delta G}{N}=\frac{{RT}{\mathrm{ln}}{K}_{I}}{N}$$
(28)

where \(\Delta G\) is the molar Gibbs free energy, N is the atom number in the molecule.

The ligand lipophilic efficiency (LLE) was calculated from:

$${{\rm{LLE}}}=-\log {K}_{I}-c\log P$$
(29)

where the \({c{\rm{log}}P}\) was calculated using Crippen’s approach implemented in RDKit 2023.09 package. All the data analysis and visualization were achieved through Python 3.9.

Statistics

For spebrutinib and ibrutinib COOKIE-Pro experiments, the \({k}_{{obs}}\) obtained from each independent COOKIE-Pro experiment were combined as replicates to perform the Michaelis–Menten fitting. The reported standard error of \({k}_{{\rm{inact}}}\) and \({K}_{I}\) was calculated by GraphPad 10.2 using symmetrical approximate confidence intervals.

Data reproducibility

COOKIE-Pro result for spebrutinib was reproduced in all three independent experiments. COOKIE-Pro result for ibrutinib was reproduced in all two independent experiments. Two-point COOKIE-Pro fragment screening was performed in two biological replicates, and each sample injected twice for LC-MS/MS analysis.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.