Introduction

Traditionally, rational drug discovery focused on molecules that bind reversibly. This was due in part to the pharmaceutical industry’s aversion to covalent (i.e., irreversible) interactions1,2, which tend to raise safety concerns due to off-target binding that can result in allergies and severe idiosyncratic drug reactions, e.g., from haptenization and enzyme inhibition2. Nonetheless, meta-analyses show that 30% of marketed drugs, including blockbusters such as aspirin, omeprazole, and penicillin, employ covalent bonds2,3. Most of these were discovered using phenotypic assays (through serendipity) due to a lack of relevant methods for covalent inhibitors.

There is currently a surge in rational covalent drug development2,3. This began with an affinity-first approach, i.e., append to a high-affinity ligand, an electrophilic warhead positioned to bind a non-conserved cysteine. This approach led to at least ten marketed drugs3, e.g., ibrutinib targeting Bruton’s tyrosine kinase (BTK) and afatinib targeting epidermal growth factor receptor (EGFR). Covalent drug discovery recently adopted an electrophile-first approach, wherein relatively selective electrophiles, discovered by chemoproteomics, serve as a scaffold for further drug development. This approach is co-evolving with libraries of diverse electrophiles that expand the number of potential targets, and whole-proteome chemoproteomics technologies that identify—and assess the selectivity for—new targets4,5,6. The electrophile-first approach led to the discovery of SARS-CoV-2 inhibitor, nirmatrelvir (Paxlovid)7, and has been credited with making undruggable targets, such as KRAS, druggable (KRASG12C inhibitor, sotorasib [Lumakras])8,9.

Bioanalytical methods for traditional drugs (i.e., small molecule, non-covalent) are well-established10 and are predicated upon a basic tenet of pharmacology that states drug concentration at the site of action is related to effect. Non-covalent drugs are in equilibrium with their targets. As a result, if the equilibrium binding constant (Ki) is known and the free drug concentration is determined using a conventional liquid chromatography tandem mass spectrometry (LC-MS/MS) assay, the percent inhibition can be extrapolated using the Cheng Prusoff equation11,12. The time dependence of analyte concentration in blood is used to derive traditional pharmacokinetics (PK) parameters e.g., area under the curve (AUC), maximum concentration (Cmax), volume of distribution (Vd), clearance (CL), elimination half-life (t1/2), and to derive distribution and absorption coefficients. PK, biochemical (Ki), and phenotypic parameters are then input into a PK/PD model to predict, for example, effect and dose regimen.

For all but the highest affinity reversible ligands, PK findings connect to PD through the direct relationship of concentration to enzyme modulation, but with covalent ligands this direct relationship is lost. Drugs that form covalent bonds to their protein targets are pharmacologically unique because they bind irreversibly, which results in non-equilibrium kinetics and the uncoupling of PK and PD13. Therefore, with covalent drugs the free drug concentration at the site of action is not indicative of the effect, which limits the predictive power of traditional bioanalysis with respect to pharmacodynamics2,3. Moreover, the low concentration of free covalent drugs can impede the determination of an accurate AUC, and by extension absorption, distribution, metabolism, and excretion (ADME) parameters13. Additional PK parameters are also required for covalent drugs because in addition to the clearance of the free drug, clearance of the drug-target complex and its relationship to protein synthesis (i.e., turn-over rate) must be considered. In pioneering work to overcome these limitations, empirical relationships between PK and PD AUCs were established for a number of drugs14 including aspirin15, pantoprazole16, penicillin17,18, and anticancer drugs19,20,21.

Clearly, covalent drug discovery would benefit from bioanalytical methods and PK/PD models that complement their traditional counterparts. Intact protein MS is increasingly used for quality control of biotherapeutics22; in early covalent drug development23; and for quantitation, PK, biotransformation of biotherapeutics23,24. Intact protein MS methods combined with immunoprecipitation have been used to determine drug to antibody ratio (DAR) for antibody drug conjugates25,26, in a mass range ( > 100 kDa) that includes common covalent drug targets. Highly pertinent to the current study is the application of intact mass (using purified protein) to the determination of kinact/KI, which can be used to interpret structure-activity relationships (SARs) and measure selectivity. kinact/KI has been described as a critical in vitro parameter27,28 for selecting covalent inhibitors and may supplant IC50 (which is only appropriate for reversible inhibitors28) as the standard for predicting the in vivo occupancy of covalent drugs3,27,29,30,31. To our knowledge, intact protein MS has not been applied to in vivo PK/PD studies for covalent drug development.

Here, we show, by taking advantage of the property that covalent drugs and drug candidates (hereafter referred to as drugs for simplicity) remain conjugated to their targets, an intact protein MS assay that can analyze the drug-target complex, and assess both drug mechanism of action (MoA) and % target engagement (%TE). This bioanalytical method is applicable from screening to in vivo studies, and therefore throughout covalent drug development and clinical studies. We then derive a model to interpret time-dependent % TE data, which we term “intact protein PK/PD” (iPK/PD), that outputs both PK and PD parameters (absorption, distribution, and drug-target complex half-life, exposure, etc.). We present a workflow and create a results-based Decision Tree to guide researchers through drug candidate selection. We apply these to leading preclinical familial ALS (fALS) drug candidates targeting SOD1 and approved drugs for popular targets KRAS and BTK.

Results

Kinetic stabilization is employed by tafamidis to stabilize transthyretin32 and is a leading therapeutic strategy for stabilizing the quaternary structure of fALS-associated SOD1 variants33,34,35,36,37. Covalent approaches to kinetic stabilization of fALS SOD1 variants tend to target a Cys residue (Cys111) within the SOD1 dimer interface, and have employed cyclic disulfide derivatives38,39,40, disulfiram41, cisplatin42 and ebselen43. Our early screening efforts identified cyclic disulfide-based kinetic stabilizers using an infusion-ESI intact protein assay that could assess MoA44. Notably, this assay excluded 91 bond-forming compounds (compared to 9 true positive hits) that would have been false-positive hits in conventional assays (e.g., size exclusion-based and 35S-labeling). This illustrates the power of MS to improve covalent drug high-throughput screening efforts45. Our previous infusion-based MS assay, although capable of rapid sample analysis (<1 min/sample) is not compatible with biological matrices. All the abbreviations used throughout the manuscript are summarized in Supplementary Table 6.

Development of an LC-MS assay for determining target engagement within a biological matrix

We sought an intact protein liquid chromatography mass spectrometry (LC-MS) assay that is compatible with any phase of covalent drug development. Based upon previous intact protein LC-MS methods standardization efforts46 an LC-MS assay was developed to accommodate a variety of soluble proteins (16 proteins with diverse functions and molecular weights, Supplementary Fig. 1, Supplementary Table 1). To accommodate the complexity of biological matrices, an enrichment step is highly recommended prior to LC-MS. For SOD1 we employed a fast (~10 min) chloroform/ethanol partitioning technique that is applicable to multiple biological matrices (e.g., whole blood, red blood cells [RBCs], and brain) and when coupled to LC-MS requires less than a drop (20 µl) of blood for high S/N spectra. The output of this bioanalytical assay, %TE, serves as the organizing principle of this work. Moreover, accurate mass measurement provides an important quality control step on target protein preparations. Figure 1 illustrates the detection and chromatographic separation for unmodified SOD1 (fALS SOD1G93A) and SOD1 modified by a covalent kinetic stabilizer (cross-linker 1,2-dithiane-1-oxide, hereafter “S-XL6” to denote Sulfur mediated-crosslinking [XL] 6-membered cyclic thiosulfinate) and related sample preparation controls.

Fig. 1: Method development for intact protein MS.
figure 1

Whole blood from a transgenic fALS mouse model (SOD1G93A) fractionated using chloroform-ethanol without (ae) and with (bf) spiked S-XL6 (200 µM). S-XL6 is a kinetic stabilizer that cross-links the SOD1 dimer via two cysteines (Cys111subunit A and Cys111subunit B). Cross-linking forms two disulfide bonds that may be susceptible to cleavage by thiol-disulfide exchange with endogenous thiols (e.g., glutathione)79. To address this, we compared target engagement without (a, b, green) and with (c, d, purple; e, f, light blue) thiol alkylating (i.e., endogenous thiol blocking) agents iodoacetamide (IAA) and N-ethyl maleimide (NEM). The inset panels show the deconvoluted spectra. Ubiquitin peak eluted at 23.5 min. Alkylating agents did not improve cross-linking yield, indicating that a thiol blocking step was unnecessary in the final method. Mass spectra were obtained using reversed-phase liquid chromatography and quadrupole time-of-flight MS (Agilent 6560 LC-QToF-MS). % TE is defined as percentage of deconvoluted intensity of the drug-protein complex [drug-protein complex intensity / (drug-protein complex intensity + unbound protein intensity)]. Total ion current is represented as a.u. (arbitrary units) and mass is reported in kDa.

An MS-based decision tree for covalent drug development

For most covalent drugs, e.g., kinetic stabilizers or enzyme inhibitors, %TE is proportional to the on-target effect. Here, %TE from in vitro preparations is used for rapid, initial screening, and the time dependence of %TE in dosed animals is used to assess PK and PD parameters. Specifically, in this Decision Tree (Fig. 2) %TE is applied to preparations ranging from pure protein (Decision [D]1-2) to tissue extracts (D3-4) through dosed animals (D5-7).

Fig. 2: An intact MS-based Decision Tree for covalent drug development.
figure 2

Rectangles signify objectives and diamonds signify decisions. Blue arrows indicate advancement (go decisions), pink arrows indicate an unsuitable drug candidate, and black arrows indicate re-estimation. Once a covalent drug candidate reaches the green line, it can be advanced into safety pharmacology studies in higher animal species and Good Laboratory Practice (GLP) ADME studies. *Failures at these stages require input from traditional ADME assays. D# = Decision number, LD50 = lethal dose 50, EC50 = Half-maximal effective concentration, MoA = Mechanism of action, METE = Minimally effective target engagement.

Does the compound employ the correct MoA (D1) and sufficiently engage the purified target protein (D2)?

D1 depends upon confirmation of the proposed MoA in vitro by validation of the intended drug-protein molecular mass (Fig. 2). D2 depends upon the ability to reach a Minimally Effective Target Engagement (METE), as defined by disease biologists, following titration of the compounds into purified protein. For fALS 37% was defined as the METE to match the magnitude of the effect of the most effective fALS SOD1 modulating drug, tofersen47, an antisense oligonucleotide (ASO). The MoAs (D1 = go) and ability to reach METE (D2 = go) of leading fALS drug candidates were confirmed in purified proteins (Fig. 3) and these results were consistent with crystallographic studies of cisplatin, ebselen and S-XL6 (PDB IDs: 3RE042, 1UXM43, and 8CCX44, respectively).

Fig. 3: Intact protein LC-MS analysis applied to covalent drug screening.
figure 3

Covalent drug candidates—disulfiram (green, column 1), cisplatin (light blue, column 2), ebselen (purple, column 3), and S-XL6 (pink, column 4) were chosen for screening as they target SOD1G93A, a fALS variant of SOD1 protein involved in ALS. (First row) The proposed mechanism of binding and the predicted mass shift for each covalent drug using intact protein LC-MS analysis. (Second row) Control sample for target protein SOD1G93A. Representative raw and deconvoluted spectra (inset) for each covalent drug. (Third row) Assessment of target engagement of covalent drug using purified SOD1G93A. a Disulfiram shows the predicted molecular mass at 16006 Da ( + 148 Da shift). *However, it also forms a SOD1G93A dimer via a non-native disulfide bond with an unknown mechanism, which can result in misfolding and aggregation41,80. b Cisplatin shows the predicted molecular mass at 16311 Da ( + 452 Da i.e., 226 × 2). c Ebselen shows a molecular mass of 16133 Da ( + 274 Da shift), which confirms its proposed MoA. d S-XL6 cross-links SOD1G93A to form a dimer at 31834 Da [15976 Da shift from (15858*2 + 136 – 18 = 118 Da)], that also confirms the proposed MoA45. (Fourth row). Specifically, the mechanism of S-XL6 involves (step 1) reversible thiolate disulfide interchange between a protein cysteine and S-XL6, which due to Brönstead characteristics results in ring-opening to expose a terminal sulfenic acid, which (step 2) forms a second disulfide (and crosslink) with a second cysteine via condensation. Evaluation of in vitro target engagement in target tissue homogenate (i.e., transgenic ALSG93A mouse brain homogenate). Both ebselen and S-XL6 show the predicted mass shifts at 16133 Da and 31834 Da, respectively. (Fifth row) In vivo target engagement (SC RoA for all candidates). Only S-XL6 in vivo dosing was feasible and resulted in successful detection of cross-linked dimer at 31834 Da in brain (1-hr post-dose via SC). A red “no” symbol indicates a “no-go” decision.

Can the target be sufficiently engaged in the biological matrix of interest (D3) at an acceptable dose (D4)?

D3 is intended for rapid screening (Fig. 2) and depends upon the ability to reach METE in a target tissue homogenate with the compound at a concentration that is higher than a reasonable therapeutic dose (e.g., 10 mM, which extrapolates to a 5 mg/ml for a 500 Da compound). If a compound does not succeed at this concentration, it is unlikely to be therapeutically relevant. Treating ALS requires a CNS-permeable compound, and so a mouse brain homogenate was chosen for the present study. A “no-go” for D3 eliminated cisplatin from further consideration as a fALS drug candidate because brain target engagement is unlikely (Fig. 3). D4 requires titration of compound in the homogenate of interest to extrapolate a minimally effective dose, and is “no-go” if the extrapolated dose is greater than the LD50 or above the established safety limit (Fig. 4). Considering the extrapolated doses were all below the LD50s and the traditional preclinical screening results were acceptable, ebselen, disulfiram, and S-XL6 were a “go” for D4.

Fig. 4: Estimating effective concentration and dose to infer iPK/PD parameters for S-XL6.
figure 4

S-XL6’s concentration-%TE relationship was established by titration into target tissue homogenate, a minimum effective dose was estimated and tested in vivo. a Titration of covalent drug in whole blood and target tissue homogenate followed by intact protein LC-MS analysis. S-XL6 was spiked at increasing concentrations in whole blood and brain homogenate (target tissue) prepared from fast-line B6SJL-Tg (SOD1*G93A)1Gur/J transgenic mouse model, incubated at 37°C for 1 h, extracted using chloroform-ethanol precipitation, and %TE was analyzed by LC-MS. Dose estimation of S-XL6 was based upon 37% METE. b In vivo administration of covalent binder S-XL6 to achieve effective target engagement in systemic (whole blood) and target organ (brain). Dosed ALS SOD1G93A mice achieved METE at all predicted doses (10 mg/kg intravenously, 12.5 mg/kg and 30 mg/kg subcutaneously). In vitro and in vivo experiments were performed in singlet and duplicate, respectively. c Intravenous (IV) administration of covalent drug S-XL6 at 10 mg/kg and (d) oral (PO) administration of S-XL6 at 150 mg/kg in transgenic SOD1G93A mice (n = 2 per RoA). Whole blood was collected at 1, 4, 8 24-, 48-, 72-, and 168-h post-dose (for IV), and up to 72-h for PO, and extracted and analyzed as described above. Area under the curves (AUCs) were estimated for IV and PO curves. Additionally, brain (target) tissue was collected 1-hr post IV dose and processed as described above. Kp[brain] was calculated using %TE values (%TE brain divided by %TE blood) was c.a. ~1. For c and d raw %TE data are shown as discrete data points, and the lines illustrate fits obtained to the two term iPK/PD model [\(C\left(t\right)=A{e}^{-\alpha t}+B{e}^{-\beta t}\)] derived in Supplementary Fig. 2 and described in Supplementary File. Data (c, d) are presented as Mean ± SD. RoA = route of administration, METE = Minimally effective target engagement. Source data are provided as a Source Data file.

The experiments proposed here for D1-D4 provide useful information for determining structure activity relationships, which may merit their inclusion in high-medium throughput screening efforts. D4 serves as a gatekeeper for humane animal studies and should therefore also consider traditional preclinical screening methods and their respective exclusion criteria44, as informed by therapeutic indication48, ideally with input from a drug development specialist. For example, preclinical screening for SXL-6 included physicochemical properties (e.g., logD) and in vitro ADMET assays (e.g., plasma protein binding, hERG channel inhibited, cytotoxicity, CYP profiling, etc.,) as potential “no-gos.” We suggest that screening drug candidates for HSA binding, which serves as a proxy for plasma binding, be used as an additional preclinical assay (Fig. 6).

Defining the iPK/PD model

To our knowledge, previous PK/PD studies of covalent drugs were based upon the empirical relationship of free drug concentration and effect. Here, we derive a model that accepts % target engagement as input. Compounds that pass D4 are administered at an acceptable dose (e.g., estimated 50% TE or <½ of the LD50) and %TE is measured as a function of time in the target tissue. Traditional PK/PD models link the dose, time, and drug concentration (PK) to effect (PD) and are not applicable to covalent drugs. That notwithstanding, drug concentration-time curves and %TE-time curves are indistinguishable in their graphical form. It stood to reason that %TE-time curves could be fit using identical mathematical operations as drug concentration-time curves, and that by considering the different input (%TE instead of drug concentration), the different outputs could be derived. The mathematical models used to treat traditional PK curves as well as methods for extrapolating these results to effect (PK/PD modeling) have been reviewed49,50,51. Based upon these studies, application of identical mathematical operations to a traditional PK curve or an iPK/PD curve yields analogous parameters, including Tmax, AUC, and t1/2. In either case (traditional PK or iPK/PD) the ratio of AUCs of a particular tissue to the central compartment is the permeability coefficient (Kp with respect to the drug molecule, and Kp [TE] with respect to the target, respectively) and the ratio of AUCs of different routes of administration (RoA) is the absorption coefficient (%F with respect to the drug molecule, and %F [TE] with respect to the target, respectively). With traditional PK analysis t1/2 is that of drug itself and with iPK/PD it is that of the drug-protein complex, which we note have no a priori relationship. The detailed mathematical derivation as well as the capabilities and limitations of iPK/PD compared to existing models are described in Supplementary Fig. 2. Notably, the %TE vs. time curve for drug-protein complex is analogous to a traditional concentration-time-effect (PK/PD curve) for small molecules. By analogy to well-established protein turnover measurements52, if the amount (e.g., intensity) of the labeled target-time is fit as shown in Fig. 4, the output will be the protein degradation half-life. If the %TE-time is modeled, the output will include protein synthesis, and yield the protein turnover number.

There are a few important differences in the biological implication of these analogous parameters, because iPK/PD analyzes target engagement over time and traditional PK analyzes drug concentration over time. First, iPK/PD has the disadvantage of being blind to any compound that is not conjugated to the target and therefore does not address the absorption, distribution, metabolism, or elimination of the total compound. These must be addressed by traditional PK studies. Second, compared to traditional PK/PD models, iPK/PD has the advantage of avoiding the assumptions used during extrapolation between drug concentration and effect. Third, whereas the half-life derived from traditional PK relates to the clearance of the compound itself, the half-life derived from iPK/PD relates to the turnover of the protein target. Fourth, although the half-lives of traditional PK, iPK/PD, and protein turnover are all derived by fitting curves where the numbers of Xe-kt terms49,52 equals the number of compartments, PK compartments relate to different tissues, and iPK/PD and protein turnover compartments relate to distinct pools of proteins.

Is the target sufficiently engaged in vivo at an acceptable dose (D5)?

At this point a dose has been estimated from the concentration-%TE relationship in target tissue homogenates. The required dose, however, will be subject to the combined effects of ADME. In particular, exposure will be affected by: absorption, which can be accounted for with an IV dose; distribution, which can be estimated based upon compound- and organ-related dependencies; and the disposition of the compound via biotransformation and elimination, whose cumulative effects will be determined for compounds that proceed to D6 and D7. With the objective of estimating the maximum value of %TE (%TEmax), animals are given the estimated dose, sacrificed at early time points (e.g., 15 min, 30 min, 1 h), and the target tissues are harvested, processed, and %TEs are determined. If the METE is reached the compound is “go” for D5; if not, the dose is re-estimated [New dose = (old dose) x (METE/ %TE old dose)] and %TE is reassessed as described above. The METE could not be reached for ebselen (at ~22% of LD50) and disulfiram (at a 20 mg/kg dose imposed by disulfiram’s solubility limit in our IV dose formulation [4 mg/mL]), eliminating these compounds from further consideration (D5 “no-go”). Interestingly, D5-related results indicated that ebselen, which improves survival in a SOD1G93A mouse model43, effected SOD1 in vivo by promoting cysteinylation (in brain) and glutathionylation (in blood) of SOD1G93A, consistent with target engagement followed by compound removal via thiol/selenosulfide exchange53. This finding illustrates that the metabolic products of the target-drug conjugate can be discovered using our methods. Exceptions to a D5 “no-go” can be made, for example, in indications where the target product profile (TPP) allows an alternative RoA that could increase exposure (e.g., intrathecal for neurodegeneration).

Is the predicted dose regimen acceptable (D6)?

The remaining protocols depend upon multiple variables (e.g., animal availability, drug accumulation, a Kp estimate that enables the analysis of blood rather than target tissue etc.) and can be adjusted. Animals are treated with a single dose and %TE-time is assessed in the target tissue at additional time points (e.g., with the dose and duration informed by D5 results). Notably, an iPK/PD study will require a longer duration than traditional PK studies (0-24 h) when the target-drug conjugate’s half-life is greater than 1 day (as illustrated below for the S-XL6-SOD1 complex). iPK/PD parameters are determined and from these parameters, in particular %TEmax and the one (TE = TE0 e−k[TE]*t) or two term [TE(t) = Ce-γt + De-δt] fits of %TE-time curve, the dose and dosing interval can be estimated using established criteria52,54 (detailed description in Supplementary Fig. 2). D6 is “go” if, given the TPP, the predicted dose interval is acceptable.

SOD1G93A is contained within RBCs, which permitted the determination of oral (%FPO) and subcutaneous (%FSC) absorption as well as blood-brain barrier (BBB) permeability estimation (%TEbrain/%TEblood). The ratio of AUCs (oral/IV) indicated %FPO = 7, which is consistent with the %FPO obtained using traditional PK bioanalytical analysis of total drug (15%) and the ratio of toxicity (LD50[IV]/ LD50[PO] x100 = 6%). For additional studies, animal usage was limited by acquiring snapshot (one timepoint at 1-h) measurements, which indicated high brain penetrance (Kp [TE]@1hour = 1.0) and acceptable SC permeability (%FSC, [TE]@1hour = 38). iPK/PD modeling of %TE-time in blood indicated two compartments with 11 and 87-h half-lives (60% and 40% of SOD1, respectively), and that over half of the initially engaged SOD1 remains at the trough in a 24-h dose interval, which indicated a “go” for D6. Guidance for minimizing animal usage is provided in Supplementary.

Can an acceptable dose regimen be achieved (D7)?

The design of optimal dose regimen has been described55 and depends here upon factors that affect the dosing intervals, e.g., dose at %TEmax, drug-target complex half-life, and METE. Using a time course and a dose frequency estimated from D6-related data, and at least two doses, %TE-time is measured. After sufficient administration this provides a steady state exposure-time relationship, which based upon prior biological knowledge, can be extrapolated to an exposure-effect relationship. If exposure is sufficient following multiple doses, D7 is a “go”, i.e., the compound has potential as a therapeutic agent. It can then progress to comparably more resource intensive studies including standard (e.g., 14 day toxicology, etc.) and covalent drug-related (chemo- and toxicoproteomics to identify off-targets) safety assessment, and the precise dose-finding needed for the development stage. These experiments are currently in progress for S-XL6 in the SOD1G93A fALS mouse model.

Broader application: considerations for screening different therapeutic targets

We explored the applicability of the workflow above for the most popular covalent drug targets, KRAS (23 kDa), BTK (77 kDa) (Fig. 5), and EGFR (69 kDa) and the most prevalent off-target, HSA (Fig. 6). KRAS and BTK recombinant purified proteins were incubated with covalent drugs sotorasib and ibrutinib respectively. The observed intact mass measurements provided a precise quality control step for these drug-target preparations and satisfied the MoA (D1 = go) and METE (D2 = go) criteria (Fig. 5). Additionally, for confirming the sotorasib mechanism of action, we used in house purified GNP (active) and GDP (inactive) bound forms of KRAS. As expected, sotorasib fully bound GDP-KRAS and minimially bound GNP-KRAS, effectively locking the protein in the inactive state that inhibits oncogenic signaling56. These results were consistent with crystallographic56 (PDB: 6OIM) as well as previously reported bottom-up mass spectrometry studies57.

Fig. 5: Application of intact protein LC-MS to oncology targets.
figure 5

Covalent inhibitors sotorasib and ibrutinib targeting the undruggable KRASG12C and BTK protein, correspondingly, were chosen to test the scope of the methods presented here. Representative raw and deconvoluted spectra (inset) with drug-related mass shift for each of the protein targets are shown. Together (a) and (b) confirm the expected sotorasib-related mass shift ( + 560.6 Da) using a commercially sourced KRASG12C (green). In addition, the mechanism of action (Decision#1, D1) and METE (D2) were confirmed using an in-house purified (c), (d) GNP (blue) and (e), (f) GDP-bound (purple) versions of KRAS, confirming sotorasib selectively binds the inactive GDP-KRASG12C, thereby trapping the protein in its inactive conformation. (g), (h) Ibrutinib covalently binds to BTK ( + 438.6 Da) (pink) confirming the MoA and METE criteria. The phosphorylated proteoforms of BTK are delineated with a bracket. Notably, this analysis shows that unmodified BTK as well as BTK phospoproteoforms with 1, 2, and 3 phosphorylation sites fully bind ibrutinib. The unbound mass of BTK (average mass: 77512.5 Da) corresponds to an unknown +88 Da modified proteoform of BTK that cannot bind ibrutinib. Total ion current is represented as a.u. (arbitrary units).

Fig. 6: Plasma protein binding of covalent drug candidates using intact protein LC-MS.
figure 6

Covalent drugs were incubated with human serum albumin (HSA), the most abundant plasma protein, to test albumin-binding and to provide a go/no go decisions in the covalent discovery workflow. a Raw and deconvoluted (inset) spectra for control HSA (black). b Disulfiram ( + 147 Da) (purple), (c) cisplatin ( + 225 Da) (pink), and (d) ebselen ( + 274 Da) (blue) showed drug-related mass shifts with HSA, which affects their METE in vivo (D5 = No Go). As expected, rationally designed covalent drugs (e) afatinib (green), and (f) ibrutinib (orange), showed minimal binding to HSA. Additionally, (g) sotorasib (dark red) and (h) S-XL6 (red) showed negligible binding to HSA, which with S-XL6 is enabled by the unique ability to reversibly bind off-target lone cysteine residues, while covalently cross-linking closely spaced pairs of cysteines such as SOD1’s. *denotes cysteinylation (average mass: 66564.28 Da), which blocks the binding of cysteine-targeting drugs disulfiram and ebselen, but does not block the binding of cisplatin, presumably due to reaction with additional nucleophilic amino acids. Total ion current is represented as a.u. (arbitrary units).

The next step in the workflow requires screening for ability to reach METE in target tissue preparations. KRAS presented an analytical challenge at this step, because due to its instrinsically low abundance it could not be detected by IP-MS in the 10 mg human tissue samples available to us. The Kelleher group, however, showed that KRAS can be detected in IP-MS studies from 200 mg tissue samples, allowing studies with adequate tissue to progress through the entire iPK/PD workflow. To facilitate intact protein detection from tissue homogenates, we spiked tissues with KRAS and proceeded as describe above for SOD1 D3. Both immunoprecipitation (IP)-based and solvent extraction-based enrichment were successful in KRAS-spiked mouse lung homogenates and spiked patient lung samples (Supplementary Fig. 3), which then progressed into the iPK/PD workflow at D3. As expected, the above experiment confirmed that METE was reached with KRAS (D3 = Go). A summary of what is currently enabled for various targets, e.g., IP-MS and LC-MS methods, along with related references can be found in Supplemenatry Table 5. Briefly, SOD1 and KRAS can proceed through the entire workflow, BTK can proceed through D2 (go for ibrutinib), and EGFR could not be detected by intact protein LC-MS. At present, BTK has been immunopurified by others, and successful LC-MS studies have been performed by others, and a combined IP-MS method will need to be developed for steps beyond D2. EGFR has been immunopurified by others, but to our knowledge there are no published intact protein studies.

To overcome limitations related to immunopurification or intact protein detection, we derived the formula to convert chemoproteomics output, which has been successfully generated for BTK and EGFR, into %TE values (Supplementary Fig. 2) required for input into the iPK/PD model. To enable studies on new targets, sensitivity benchmarks and exemplar LOD, LOQ, and matrix effect calculations are provided for SOD1 in Supplementary Table 2, Supplementary Table 3, and Supplementary Table 4, respectively. Practically speaking, 1 picomol of the targets examined here (Fig. 5) should be sufficient for microflow (1-10 µL/min) LC-MS analysis. If additional sensitivity is required, more sample can be loaded or more sensitive nanoflow LC-MS analysis can be employed. For abundant the abundant target, SOD1, 1–10 mg of tissue is required, and for less abundant targets like KRAS, 200 mg of tissue is required58. For targets not considered here, the relative sample loading required can be estimated from the above considerations or the previously described relationship of intact MS sensitivity and protein size59.

Intact albumin screening reveals that more recent covalent drugs exhibit decreased off-target plasma binding

Serum albumin is the most abundant plasma protein (0.63 mM, constitutes ~60% total plasma protein) and a common off-target for covalent drugs that directly influences PK/PD60. As such, covalent compound binding to HSA can complement traditional preclinical plasma protein binding assays, by providing the MoA for HSA binding. The LC-MS assay described above was applied to HSA and a variety of covalent drugs (Fig. 6). Interestingly, the first-generation covalent drug (from phenotypic assay and not-rationally designed), disulfiram, converted 100% of HSA (at 20-fold molecular excess and equivalent to 20 mg/kg dose) to an HSA-disulfiram adduct by the mechanism shown in Fig. 3. This result is consistent with the previously described high plasma protein binding via albumin61. Second-generation covalent drugs (rationally designed by appending an electrophile to a high-affinity ligand) like afatinib and ibrutinib demonstrated limited, but detectable covalent binding to HSA. The third-generation covalent drug (electrophile-first with core electrophile discovered by chemoproteomics screening and rationally modified), soratosib, and fourth-generation electrophilic warhead (electrophile-first with electrophile rationally designed to minimize lone-cysteine binding), S-XL6, resulted in no detectable HSA binding.

Discussion

This work demonstrated how an intact protein MS assay, interpreted using our iPK/PD model, can advance covalent drug candidates throughout preclinical drug development. Provided that the tissues are available, the same techniques are applicable to clinical trials and personalized medicine. We tested the leading preclinical fALS SOD1 kinetic stabilizers and identified one compound, S-XL6, that engaged and kinetically stabilized its target and exhibited BBB penetration and bioavailability with different RoAs. We demonstrated the applicability of our workflow to assess target engagement of important targets like KRAS and BTK. For targets where intact mass measurements or protein fractionation haven’t been developed, we provide a means to port chemoproteomics data to the iPK/PD model. We extended our on-target analysis methods to off-target analysis, and demonstrated that the extent of HSA binding, which is a primary off-target for most RoAs, can be estimated. From these studies a fascinating and reassuring trend emerged—covalent drugs are evolving to exhibit less off-target binding to HSA. As we demonstrated previously for SOD1, sufficient enrichment62,63 can obviate the need for LC and prepare samples that are compatible with emerging native MS techniques64,65,66. Notably, iPK/PD analysis is applicable to any preparation where %TE can be measured or inferred, including non-covalent drugs analyzed by native MS techniques, endoproteinase digests where the unmodified and drug-modified peptides are detected67, and chemoproteomics methods that detect reactive cysteine abundance as a function of drug concentration. The iPK/PD model (i.e., % TE as input) applies to the target-bound drug, but does not directly apply to free drug absorption, distribution, metabolism, and excretion. The methods proposed here are therefore complementary to traditional bioanalytical methods for free drug analysis and chemoproteomics methods for off-target ID.

Methods

Ethical statement

All methods and animal manipulations were approved by the Northeastern University Institutional Animal Care and Use Committee (IACUC) under protocol #16-0303-R.

Chemicals

Cisplatin, ebselen, disulfiram, afatinib, sotorasib, and ibrutinib were purchased commercially from Sigma Aldrich (St Louis, MO). Human KRASG12C was purchased from Sino Biological (China) and Human Bruton’s Tyrosine kinase (BTK) was purchased from Abcam (Cambridge, UK). KRAS-GNP and KRAS-GDP were purified as per previously described68. Human Serum Albumin (HSA), Bovie Serum Albumin (BSA), NIST IgG1k mAb were purchased from Sigma-Aldrich (St Louis, MO). SOD1 was purified in-house as previously described44,62 from yeast.

Synthesis of cyclic thiosulfinate 1,2-dithiane-1-oxide (S-XL6)

Synthesis of 1,2-dithiane-1-oxide (S-XL6) was achieved according to the previously published literature procedure45. Briefly, commercially available 1,4-butanedithiol was reacted with bromine and silica gel to afford 1,2-dithiane. After crystallization from hexanes, 1,2-dithiane was then oxidized with sodium periodate and the final product 1,2-dithiane-1-oxide (S-XL6) was purified by flash column chromatography on silica gel.

Reaction monitoring of target engagement in vitro

Blood and brain were collected from a control hemizygous mouse expressing human SOD1G93A for titration experiment to determine the initial minimum concentration of compound to reach the target (e.g., hSOD1 protein). Brain homogenates were prepared using a Potter-Elvehjem homogenizer and 10 mM ammonium acetate, pH 7.8 (2:1, ammonium acetate to brain weight). 40 µL of both brain and blood were dispensed to separate vials and labeled accordingly. The test compounds for titration experiment were dispensed to both blood and brain vials according to the following final concentrations: 0.1, 0.5, 1, 5, 10, 50, 100, 500, 1000, 5000, 10000 µM ( < 2% DMSO). All samples were then fractionated using ethanol and chloroform extraction as previously described69.

Expression and purification of wild-type SOD1 and SOD1G93A

Expression and purification of SOD1 were conducted as previously described44,62. Briefly, EGy118ΔSOD1 yeast were transformed with a wild-type SOD1 and SOD1G93A YEp351 expression vector and grown at 30 °C for 44-hr. Cultures were centrifuged, lysed in a blender using 0.5 mm glass beads, and subjected to a 60% ammonium sulfate precipitation. Then the sample was centrifuged, and the resulting supernatant was diluted to 2.0 M ammonium sulfate. The diluted sample was passed through a phenyl-sepharose 6 fast flow (high sub) hydrophobic interaction chromatography column (Cytiva Life Sciences, Marlborough, MA, USA) using a linearly decreasing salt gradient from high salt buffer (2.0 M ammonium sulfate, 50 mM potassium phosphate dibasic, 150 mM sodium chloride, 0.1 mM EDTA, 0.25 mM DTT, pH 7.0) to low salt buffer (50 mM potassium phosphate dibasic, 150 mM sodium chloride, 0.1 mM EDTA, 0.25 mM DTT, pH 7.0) over 300 mL. Fractions containing SOD1 eluted between 1.6 and 1.1 M ammonium sulfate and were confirmed with SDS-PAGE. These fractions were pooled and exchanged into low salt buffer (10 mM Tris pH 8.0). Pooled fractions were then passed through a Mono Q 10/100 anion exchange column (Cytiva Life Sciences, Marlborough, MA, USA) using a linearly increasing salt gradient from low salt buffer to high salt buffer (10 mM Tris pH 8.0, 1 M sodium chloride) from 0 – 30%. SOD1 fractions were collected between 5 and 12% high salt buffer and were confirmed with SDS-PAGE, western blot, and Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS).

Confirmation of covalent drug-protein complex formation using LC-MS

In-house purified 40 µM wild-type SOD1 was diluted using 10 mM ammonium acetate, pH 7.4 and combined with 400 µM (10x, 0.25% final DMSO content) S-XL6 in equal volumes and incubated for 4 h at 37 °C and vortexed at 350 rpm. Cisplatin (1 mM) was incubated in hemizygous B6SJL mice expressing human SOD1G93A blood for 24-h at 37 °C. Complete cross-linking of cyclic thiosulfinates (S-XL6) and other reactions (e.g., drug-protein complex, post-IP (immunoprecipitation) samples, disulfiram, cisplatin, ebselen, afatinib, sotorasib and ibrutinib) were confirmed by either an Agilent 6560 QToF mass spectrometer (Santa Clara, CA) or by Bruker TIMS-ToF Flex mass spectrometer (Billerica, MA). Thiol alkylating agents – iodoacetamide (IAA, Fig. 1c, d) at 100 mM and N-ethylmaleimide (NEM, Fig. 1e, f) at 10 mM were incubated for 1 h with or without S-XL6 during method development. All in vitro assays were performed in duplicates and repeated on different MS instruments with similar results. All the samples (total: 218 and control: 6) were analyzed in duplicates by either an Agilent 1290 Infinity II-series UHPLC (Ultra-High Performance Liquid Chromatography) system coupled to an Agilent 6560 drift-tube ion mobility Q-ToF (quadrupole time of flight) mass spectrometer (IM-MS) (Santa Clara, CA) or a Bruker nanoElute2 (nanoLC) liquid chromatography system coupled to TIMS-ToF (Trapped Ion Mobility quadrupole time of flight) Flex as previously described44,46,68,70,71. The n/LC system was equipped with reversed phase Acquity UPLC Protein BEH C4 (300 Å pore size, 1.7 µm particle size, 100 mm bed length, 2.1 mm ID x 100 mm) column at 60 °C with a flow rate of 0.3 mL/min or Acquity UPLC Protein BEH C4 (300 Å pore size, 1.7 µm particle size, 100 mm bed length, 300 µm ID x 100 mm) column at 60 °C with a flow rate of 3 µL/min. The mobile phase consisted of a mixture of 0.1% formic acid in water (solvent A) and 0.1% formic acid in acetonitrile (solvent B) and combined a gradient: 0-2 min: 95% A; 2-70 min: 30% A; 72-75 min: 5% A; 78-80 min: return to initial conditions. The sample was introduced in 10% formic acid and 5 or 1 µL was injected for analysis. The mass spectrometer was calibrated prior to all the samples acquisition and the instrument was operated in positive ion mode. The source temperature was set to 150 °C and 200 °C, with drying temperatures of 150 °C and 210 °C, drying gas flow rates of 10 L/min and 9 L/min, and nebulizer pressures of 30 psi and 26 psi for the Agilent 6560 QToF and Bruker TIMS-ToF Flex mass spectrometers, respectively. Fragmentor voltages were maintained at 250 V and 300 V, while capillary voltages were set to 5000 V and 4500 V for each instrument, respectively. Agilent MassHunter Workstation Data Acquisition software v9.0 (Santa Clara, CA) was used for controlling the Agilent LC-MS and BioConfirm v10.0 software was used for data processing. Similarly, timsControl software v5.1.8 (Billerica, MA) was used for controlling the TIMS-ToF nanoLC-MS and Bruker Data Analysis v6.1 software was used for data processing.

The MS data were analyzed as previously described44. Briefly, the chromatographic window containing the proteins of interest were assigned using EICs (Extracted Ion Chromatogram), e.g., m/z 1586.25-1587.25 (SOD1G93A monomer) and m/z 1516.50-1517.50 (SOD1G93A dimer) and mass spectra from this region were summed. The summed-up mass spectra from each region were deconvoluted and average masses were calculated using the MaxEnt algorithm in either Agilent BioConfirm v10.0 or Bruker Data Analysis v6.1 software. The %TE was calculated as the ratio of dimer intensity (31836 Da) to the monomer (15858 Da) plus dimer intensity. The fit of the measured dimer values was performed and calculated by incorporating 2-compartment model equation using GraphPad Prism (version 10.1.2, San Diego, California).

In vivo dosing and tissue sample preparation for mass spectrometry analysis

All methods and animal manipulations conformed to the guidelines and were approved by the Northeastern University Institutional Animal Care and Use Committee (IACUC) under protocol #16-0303-R. In vivo dosing or ex vivo tissue sample preparation was performed using hemizygous B6SJL mice expressing human SOD1G93A (fast-line Jackson Laboratory; B6SJL-Tg(SOD1*G93A)1Gur/J, also known as SOD1-G93A stock- 002726)72. Animals in the age range of 15–30 weeks were used for in vivo testing; sex was not considered in this study. Animals were acclimatized with a 12 h dark/light cycle (24°C, 30% humidity). Test articles were prepared either in 1x PBS or water (depending upon the solubility, <2% final DMSO content) and mice/rat were dosed slowly at a specified dose level (mg/kg) via a suitable route of administration (intravenous, subcutaneous, or oral). Mice were kept in a restrainer or sacrificed to collect blood and brain at different time points (1, 4, 8, 24, 48, 72, and 168 h) for pharmacokinetic/pharmacodynamic assay or at a single time point tissue collection. Protein extraction was performed as previously described69. Briefly, homogenates were treated with an 8 equivalents of 10 mM ammonium acetate (pH 7.4), followed by 0.15 equivalents of cold chloroform and 0.25 equivalents of cold ethanol, and vortexed at 1800 rpm at 4 °C for 15 min and centrifuged at 17,700 g for 10 min. The supernatant was collected and stored at -80 °C for LC-MS analysis. The analyses (n = 2 for each RoA) herein are therefore of the supernatant resulting from a mixture of 0.25 equivalents of ethanol, 0.15 equivalents of chloroform, 0.6 equivalents of water. Ebselen was injected via SC to a hemizygous mouse expressing human SOD1G93A at 20 mg/kg followed by sacrificing the mouse and collecting both blood and brain after 1 h for further processing the samples prior mass spectrometry run as mentioned earlier.

Tissue homogenization before Immunoprecipitation (IP)

Mouse lung samples were homogenized using ceramic beads and T-PER Tissue Protein Extraction Reagent (Thermo Scientific) for extracting total proteins from tissues, using the manufacturer’s recommended method. Protease inhibitor (SIGMAFAST protease inhibitor, Sigma-Aldrich) was added to the sample homogenization procedure to avoid any proteolysis.

Immunoprecipitation Mass Spectrometry (IP-MS) sample preparation

The immunoprecipitation method optimization was done using Dynabeads Protein G IP kit (Invitrogen) and mouse lung tissue. KRAS protein was enriched using human KRAS protein-specific monoclonal antibody (Thermo Scientific). 50 µL (1.5 mg) of magnetic beads were used at 30 mg/mL in PBS, pH 7.4, with 0.01% Tween-20 and 0.09% sodium azide (preservative), for each IP experiment. The anti-KRAS antibody (10 µg) was diluted in 200 µL antibody binding and washing buffer before adding it to the prepared magnetic beads. After washing step, 250–500 µL tissue homogenate samples (containing KRAS protein) were added to the antigen binding step to form the immune complex. KRAS protein was eluted using 0.2% TFA, which enabled subsequent downstream intact protein MS analysis. Due to inherently low abundance of KRAS protein in the tissues of interest, the eluted protein was concentrated using SpeedVac (Thermo Scientific), followed by reconstitution in 100% water. With the optimized IP procedure, KRAS protein was detected with the intact protein LC-MS method described above. The experiments were repeated three times with similar results.

Reproducibility, sensitivity, accuracy, matrix effect, and precision experiments

Blood was collected from control B6SJL hemizygous mouse for the calibration curve experiments to perform reproducibility and sensitivity measurements. 40 µL of blood was dispensed in separate tubes. For monomer standard curve, human SOD1 was added to blood vials according to the following final concentrations: 0.001, 0.01, 0.1, 0.5, 1, 2.5, 5, 10, 15, 20, and 25 µM. For the dimer standard curve, hSOD1 was dispensed to blood vials containing 5 mM S-XL6 previously spiked into the preparation to form a dimer. All vials were incubated at 37 °C for 3 h. All samples were then fractionated using ethanol and chloroform extraction as described69. The internal standard 13C, 14N-labeled ubiquitin, purchased from Cambridge Isotope Labs (Tewksbury, MA), was spiked into both monomer and dimer standard curves. Quality control samples i.e., 1, 5 and 15 µM for monomer and 2.5, 5 and 20 µM for dimer, were prepared separately for each calibration run (Supplementary table 2 and 3). Additionally, matrix effect quality control samples i.e., 1, 5 and 10 µM for monomer and dimer were prepared separately and % matrix effect was calculated based on mean peak area ratio of mean peak area of the analytes spiked post-extraction to the mean-peak area of the same analyte standards (Supplementary table 4). Statistical analysis was performed using GraphPad Prism (version 10.1.2, San Diego, California). Data from three independent measurements were analyzed to calculate precision, accuracy and matrix effect.

Derivation and detailed description of the iPK/PD model

Analysis of the target tissue is required due to violation of the free drug hypothesis by covalent drugs, which results in drug concentration asymmetry. Notably, drug concentration asymmetry is pervasive, including for reversible binders, and difficult to model for the CNS, the focus of this study.

Assumptions in the iPK/PD model

  • The ionization efficiency of the protein with and without the drug remains unchanged. Note, this only applies to the measurement of %TE, and is not an assumption (i.e., is factored out) when ratios of %TE are taken [i.e., Kp(TE) and F(TE)].

  • When extrapolating in vitro (target tissue homogenates) results to in vivo, there are certain underlying assumptions such as –

    • If k is unknown, there is no barrier between target and site of measurement.

    • Equal partitioning among the organs.

    • The off-target binding is the same across the organism.

    • Rate of distribution is significantly higher than the rate of metabolism of both the small molecule and the complex.

    • There is a lack of feedback drug-related up or down regulation of the target (e.g., from downstream signaling, activate proteases), and no target manipulation (tolerance).

    • The tissue homogenate extrapolation to whole organ follows the same rules and recommendations as suggested by health authorities73. Target expression studies can be conducted while validating a target for a disease indication. The complexity and dependencies of target expressed within tissue/cell types will then be extrapolated from these studies to consider target tissue for analysis.

Drug-target complex half-lives are obtained by fitting the curve with one or more of the generalized terms Cie-kit (C=Normalization Constants), e.g., TE = TE0 e−kTE*t and TE(t) = Ce-γt + De-δt for one and two compartment models, respectively, by analogy to traditional PK methods.

Corollaries of the iPK/PD model

  • The k[TE] is the turnover constant for the drug-protein complex and the half-life obtained is the drug-protein complex degradation half-life (turnover of the target).

  • %F(TE) is the absorption coefficient with respect to (only) the drug-protein complex. When target is present in blood, absolute %F (TE) can be calculated as ratio of area under the curves (AUCs) for a specific route of administration compared to intravenous route. When target is not present in blood, a relative %F(TE) can be obtained by comparing AUCs in target tissue for two different routes of administration.

  • Kp, (TE) is the tissue partition coefficient with respect to the drug-protein complex. It can be calculated as the ratio of %TE(tissue) to %TE(blood), only when the target is present in the blood.

  • The compartments in traditional PK refer to different tissues whereas, in protein metabolism refer to different pools of proteins within a cell.

Traditional PK/PD model assumptions that do not apply to iPK/PD

  • Drug concentration (free/unbound) is the driving force of the pharmacological effect.

  • Unbound drug concentration in blood is same as that at the site of action (at steady state).

  • Drug and target are in equilibrium.

  • Drug (specifically the Drug-Target complex) is eliminated from the body through the central compartment (metabolism in liver and or excretion via kidney).

Relating the iPK/PD model to traditional PK, PK/PD, and protein metabolism models

The mathematical derivation of iPK/PD model is shown and is related to traditional PK, PK/PD, and protein metabolism models in Supplementary Fig. 2.

Consideration of target turnover in iPK/PD

In iPK/PD model presented, the overall covalent drug PK and PD outcomes depend upon the %TE measurements of the drug-target protein complex. %TE inform the drug-target binding process, that includes target turnover rate. Target half-life and de novo synthesis of target influences the duration of PD response. A potential approach is using %TE from intact covalent-drug protein complex measurements that outputs the half-life of the complex (i.e., target), enabling a direct correlation of PK to PD, which otherwise is not possible with traditional PK74,75.

Avoiding terminal experiments through the analysis of proxy tissues (D6 and D7)

Unfortunately, the analysis of the target tissue often requires a terminal experiment (e.g., neurodegenerative diseases and brain neoplasms). In these cases, certain terminal experiments can be avoided if the target is also present in an accessible tissue (ideally blood), so long as a target-specific Kp(TE) is determined (%TEtarget tissue/ %TEblood). This allows extrapolation of exposure in the target tissue through the following equation [%TEtarget tissue = %TEblood * Kp(TE)]. Likewise, if an alternative RoA (to IV) is to be considered and the target is present in blood, the ratio of the AUCs (%TE vs. time) in blood for oral vs. IV administration provides an absolute F(TE) (F with respect only to Target Engagement, for derivation, see Supplementary Fig. 2). The oral LD50 may then be determined to assure the predicted dose is within the safety margin. If the target is not present in an accessible tissue, a relative F(TE) can only be determined as the ratio of AUCs in the target tissue for two different routes of administration, as described in the main text.

Reproducibility, sensitivity, accuracy, precision, and matrix effect results

The linearity of the intact protein LC-MS assay was established over the concentration range of 0.5 µM – 25 µM for hSOD1 monomer, and with similar concentration range of hSOD1 incubated with a fixed amount of S-XL6 (dimer concentrations), spiked into mouse whole blood. The selected concentration range covers the therapeutically relevant concentrations for SOD1 in healthy and disease-state (fALS) patients. The internal standard used in the quantitation was 13C, 15N-labeled ubiquitin eluting at 22.6 min in the LC-MS method76. Regression analysis was performed using GraphPad Prism software (10.1.2) and correlations of R2 ~ 0.93 (monomer) and R2 ~ 0.95 (dimer) were obtained. The lower limit of quantitation (LLOQ) of 0.5 µM for both monomer and dimer curves was obtained using peak area ratios. The selectivity of the LC-MS method was established with the detection of intact SOD1 monomer peak at 24.2 min and S-XL6-mediated SOD1 dimer peak at 25.5 min in total ion chromatograms, without interferring with any endogenous components (Fig. 1). The data for accuracy (% bias), precision, and matrix effect for the intact protein LC-MS assay are summarized in Supplementary Table 2 (monomer), Supplementary Table 3 (dimer), and Supplementary Table 4, respectively. The intra-run bias was between -5.97 – 1.63% for monomer and -3.25 – 19.2% for dimer, with intra-run precision values ranged from 1.9 – 8.4 % and 0.6 – 0.9% for monomer and dimer, correspondingly.

Consideration of semi-physiological PK parameters

We derived the iPK/PD model from the anachronistic (simplest) PK/PD model. In the anachronistic model the number of compartments is equal to the number of exponential terms. When distribution or elimination occurs in multiple compartments (e.g., tissues), this can be expanded for a given drug and target to more customized physiologically or semi-physiologically based PK models77,78. For example, a simplistic semi-physiological PK model would include blood compartment, assumes similar perfusion across all compartments to reduce model complexity, and produces a quick snapshot of the covalent drug’s fate in vivo. A more complex semi-physiological PK model would include different rate constants for different tissues. Jones H. M. et al., reviews the use of multi-tissue compartments as either perfusion (rate of blood flow) or permeability (drug-specific)-limiting kinetics77. When the iPK/PD requires more than two terms for a fit, it is important to recognize (see Supplementary section derivation of the iPK/PD) that the terms likely involve protein degradation constant, i.e., they are compartments in the sense of protein metabolism/catabolism; not compartments in the physiological sense (e.g., organ). The derivation and meaning of such terms of different organ compartments can be found in the work of Burlingame52. In the three term models of Burlingame’s work, one compartment is the free amino acid pool, the second is the total protein pool, and the third is protein of interest pool.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.