Introduction

Fluorescence-guided surgery (FGS) is being integrated into surgical practice to improve surgical outcomes. There are several intraoperative fluorescent agents approved for surgical imaging of cancer, and more than a dozen in late-stage trials1,2,3,4,5,6. Unfortunately, targeting nerve tissue has proved more challenging, and there are currently no approved fluorescent agents for nerve visualization. Developing targeted imaging agents to enhance nerve structures for FGS has lagged behind cancer-specific contrast agent development since unique antigens are more difficult to target in normal tissues compared to tumors that overexpress many unique cancer cell-surface antigens.

Improving intraoperative imaging of nerves represents a critical unmet need in surgical practice to prevent nerve injury in the 40 million operations conducted each year in the U.S7. Injury to nerves can leave subjects with significant morbidity, including chronic pain, numbness, or paralysis8. The impact of injuring even small branches of the facial nerve can result in significant cosmetic and functional quality of life challenges, including ocular dryness and difficulty eating9,10. Injury to the recurrent laryngeal nerve resulting in vocal fold paralysis occurs during many common procedures, including thyroid surgery, anterior approaches to the cervical spine, and carotid endarterectomy. These significant morbidities associated with iatrogenic nerve injury occur up to 13% in head and neck surgeries7.

Therefore, improving a surgeon’s ability to visually efficiently identify nerves during anatomic dissection is widely recognized to have a huge potential for a paradigm-shift in the field of surgery. In fact, improvements in surgical imaging technology and fluorescence chemistry are bringing several very promising agents to the clinic. Interest in real-time intraoperative fluorescence highlighting of nerves is growing11,12 because of the obvious clinical application. The field is very new, and approaches to the problem vary between groups. Summers et al. developed a promising near-infrared probe which has the advantages of excellent tissue penetration at the higher wavelength13,14. In addition, nonspecific fluorophores, such as indocyanine green alone, have also been reported in clinical studies to improve intraoperative visualization of nerves using near-infrared fluorescence imaging, but this data is has not been widely reported15,16. It is very likely that in the near future, surgeons may have several options of imaging agents and devices to assist in the identification of nerves. However, surgeons today must rely on surgical experience and electrophysical stimulation techniques for identification of nerves.

As nerve preservation is a critical component of multiple surgical procedures, multiple nerve imaging agents are under development including NIR oxazines17,18, GAP-43-Ra-PEG-ICG19, and HNP40120. GAP-43-Ra-PEG-ICG highlights GAP-43 positive neural cells and oxazine-derived dyes are localized to myelin. We developed bevonescein, a small peptide labeled with fluorescein, a fluorophore in the visible range that has been successfully tested in preclinical models20,21,22,23,24. The specific binding site of bevonescein is not yet fully understood and is being evaluated with additional studies. We have shown in animal models binding of bevonescein to the structural extracellular matrix component of the nerves, rather than axons or myelin, which allows visualization of both intact and degenerated nerves in preclinical models25.

We designed this clinical trial for patients at high risk for nerve injury – those undergoing surgical procedures that require identification and preservation of the facial nerve (parotidectomy), recurrent laryngeal nerve (thyroidectomy) and the spinal accessory nerve (neck dissection). Although several promising agents are actively undergoing evaluation, there is currently no agents approved for real-time intraoperative fluorescence highlighting of nerves26,27. Here we report the results of a clinical trial evaluating an agent, bevonescein (ALM-488) designed to improve nerve visualization28.

Results

Clinical trial profile

Of the 27 enrolled patients, 70% were under 65 years of age, 26% were Hispanic or Latino, and 48% were female (Table 1). Parotidectomy was the most common surgical procedure (52%), followed by cervical lymphadenectomy (33%) and thyroidectomy (15%). All patients underwent surgery for neoplastic disease. The first 15 patients were enrolled in the dose-escalation cohort (Cohort A), which was designed as a traditional 3 + 3 study design, where subjects received the study drug in tiers: 100, 200, 400, 500, and 600 mg. Once the optimal dose was identified, 12 additional patients were in enrolled at the optimal dose to determine optimal timing between infusion and surgical incision (Cohort B). In the timing cohort, there were 6 subjects who received the agent 1–3 h prior to surgery start, and 6 subjects who were dosed 3–5 h prior to surgery. A total of 102 unique nerves were identified in the study (Table 2).

Table 1 Demographics and baseline clinical characteristics
Table 2 Number of unique nerves per cohort

Primary Outcome: Safety

The safety profile following intravenous ALM-488 administration was evaluated using multiple metrics including vital signs, physical examination including neurological exam, adverse event monitoring, evaluation for evidence of dose-limiting toxicities (DLT), infusion reactions as well as cardiac monitoring using electrocardiogram (ECG) and clinical laboratory assessments including urinalysis, hematology, chemistry, coagulation markers, biomarkers including histamine, tryptase, C3a, and C5 before, during and after infusion and for 30 (+5) days following infusion.

For all doses tested, no DLTs were reported and there were no infusion reactions. At any dose level including the maximal dose of 600 mg, there were no dose-limiting toxicities (DLTs). Twelve of the 27 (44%) subjects experienced one or more adverse events (AE), with a total of 32 AEs reported (Table 3). Of these AEs, a single AE was assessed as possibly related to the study drug (retching without vomiting), which resolved without intervention. Two serious AEs (SAE; dental infection, neck infection) occurred in the 27 subjects (7.4%). Neither of these was considered to be related to the study drug. No patients were discontinued from the study due to AEs. There were no patients who met study stopping or suspension rules. There were no patients who reported IRR or adverse device effects. No deaths were reported during the study.

Table 3 Adverse events

There were no trends in changes from baseline to any of the timepoints in vital sign parameters in any of the cohorts. The changes from baseline in vital signs were small, and no clinically significant or relevant differences were observed among cohorts. Overall, the majority of patients in all cohorts had normal physical examination findings at all timepoints analyzed, except for the head, eyes, ears, nose, and throat body system. A total of 22 (81.5%) patients reported abnormal results and these were comparable across all treatment cohorts, and none of the abnormalities were reported as an AE. All patients had normal ranges or not clinically significant abnormalities for hematology, chemistry, coagulation, and urinalysis as deemed by the investigators, showed no clinically relevant differences among cohorts, and did not indicate any unexpected safety issues (Table 4). All patients had normal or non-clinically significant abnormal ECG values, and these were comparable across all cohorts. Overall, the changes from baseline in 12-lead ECG were small, and no clinically significant or relevant differences were observed among the cohorts. Out-of-range histamine values were observed after infusion in 14 of the 27 patients, of which 11 had out-of-range values or values not done at pre-infusion (Table 5). None of the histamine out-of-range values were deemed by the investigators to be clinically significant, and no AEs were reported as a result of out-of-range blood allergy levels. Tryptase and complements C3a and C5 levels remained stable during the study (Table 4).

Table 4 Quantitative Laboratory Parameters (Hematology, Chemistry, Coagulation, Urinalysis, Blood Allergy Levels) by Cohort
Table 5 Histamine Values in All Patients (Original Units)

Secondary outcome: pharmacokinetics

Pharmacokinetics of bevonescein showed the mean half-life ranged from 29 to 72 min (Fig. 1). The mean AUClast values of the bevonescein not bound to plasma or tissue proteins (unbound) increased 6.1-fold over the 6-fold increase in dose suggesting dose proportionality. The mean unbound AUClast values were about 3.3- to 3.5-fold lower than the total bevonescein AUClast values, suggesting that the free fraction was approximately 30%.

Fig. 1: Pharmacokinetics of bevonescein (presented in log scale).
Fig. 1: Pharmacokinetics of bevonescein (presented in log scale).
Full size image

This demonstrates the dose-dependent pharmacokinetics of bevonescein, consistent with other systematically administered proteins. Detection at levels below 10 ng/mL was not included in the dataset. Each color represents different dose cohorts and error bars represent standard deviation where n ≥ 3. A sample size of the subjects for each dose group was the following: 100 mg, n = 3 patients; 200 mg, n = 3 patients; 400 mg, n = 3 patients; 500 mg, n = 14 patients; 600 mg, n = 3 patients. The center is defined as the mean. Source data are provided as a Source Data file.

Secondary outcome: optimal dose of bevonescein

In evaluating the secondary endpoint of optimal dose, we found that all doses of bevonescein (100–600 mg) yielded nerve fluorescence intraoperatively with significantly higher SBR in fluorescent (FL) compared to white light reflectance (WLR) images at all doses and across all nerve categories (Fig. 2). At 600 mg there was increased background fluorescence (mean pixel intensity (MPI) 73.2 ± 34, compared to 39.5 ± 17 in the 500 mg cohort; Welch ANOVA p < 0.01) which impaired tissue contrast (SBR 2.2 ± 1.6, compared to 3.0 ± 1.0 in the 500 mg cohort; Kruskal-Wallis p > 0.05) (Fig. 3). No further dose escalation was pursued beyond 600 mg due to imaging considerations. The 500 mg dose was determined to have optimal improvement in nerve visualization relative to other doses and was therefore chosen as the optimal dose for the timing cohorts (see examples of intraoperative images in Fig. 4 and additional examples in Supplementary Fig. 2).

Fig. 2: Comparison of SBR in WLR and FL images (all cohorts).
Fig. 2: Comparison of SBR in WLR and FL images (all cohorts).
Full size image

a The SBR of nerves in FL images was significantly higher (p < 0.001) compared to the paired WLR images using two-sided Wilcoxon rank sum test with the sample size n = 109 for each group. *** indicates p-value < 0.001. b The SBR of FL images were significantly higher compared to the paired WLR images across the nerve categories on a two-sided Wilcoxon rank sum test (p-value < 0.01 for all categories). Adjustments were not made for multiple comparisons. “CN main” includes the facial nerve main trunk, spinal accessory nerve, the hypoglossal nerve, and the vagus nerve (n = 28 unique nerves for each FL and WLR). “CN branch” includes recurrent laryngeal nerve and any branches off the main cranial nerve (n = 17 unique nerves for each FL and WLR). “FN division” includes the upper and lower divisions of the facial nerve (n = 13 unique nerves for each FL and WLR). “FN branch” includes branches from the divisions of the facial nerve, such as the buccal, marginal mandibular branches (n = 27 unique nerves for each FL and WLR). “Sensory” includes the greater auricular nerve (n = 22 unique nerves for each FL and WLR). Standard boxplots are shown with median as the bold center line, mean as a diamond, the first and third interquartile range (IQR), and the highest and lowest values within 1.5*IQR as hinges, and outliers as additional points. Green represents FL data, and gray represents WLR data. Source data is provided as a Source Data file. SBR signal to background ratio, FL fluorescence, WLR white light reflectance, CN cranial nerve, FN facial nerve.

Fig. 3: The optimal dose of bevonescein is 500 mg.
Fig. 3: The optimal dose of bevonescein is 500 mg.
Full size image

A The MPI of nerves and background in FL images were evaluated, and the background was significantly higher at the 600 mg dose compared to the 500 mg dose. The blue line and dots represent the background MPI, and the orange line and dots represent the nerve MPI. The centers of the lines represent the mean value. The gray shaded error regions represent the standard error. B The SBR of FL images at the 600 mg dose decreased compared to the SBR at the 500 mg dose. The green line and dots represent the SBR for FL images and the dark gray line and dots represent the SBR for WLR images. The centers of the lines represent the mean value. The gray shaded error regions represent the standard error. Source data is provided as a Source Data file. MPI – mean pixel intensity; SBR – signal to background ratio; FL – fluorescence; WLR – white light reflectance.

Fig. 4: Intraoperative nerve images at 500 mg bevonescein dose.
Fig. 4: Intraoperative nerve images at 500 mg bevonescein dose.
Full size image

Paired intraoperative nerve images from 4 different patients at 500 mg bevonescein dose. WLR images (AD) and the same field of view seen with fluorescence signal overlayed on WLR (EH). Nerves on the surgical field surface (solid yellow arrows) appear yellowish/green compared to adjacent non-nerve tissue (reddish). There is white surgical gauze visible in the left lower quadrant of A. Nerve not on the surgical field surface (dashed yellow arrows) are more easily discernible in F, G, H compared to B, C, D. WLR – white light reflectance; Paired –white light reflectance with fluorescence overlay. The Visualization Scoring System utilized the WLR and Paired images. The SBR analysis utilized the WLR and FL (grayscale) images.

Among those who received the 500 mg dose, there were 54 evaluable nerves in 14 subjects. Objective measurements showed the SBR was significantly higher under FL compared to WL (2.1 ± 0.8 vs.1.3 ± 0.1; p < 0.05). Subjectively, surgeon evaluations were obtained using the 4-point discrete Likert Visualization Scoring System (VSS) for 14 subjects who received 500 mg bevonescein. All three categories of the VSS (Contrast Enhancement (CE), Length Measurement (LM), and Branching Delineation (BD)), were superior when evaluating the paired FL overlay images compared to the WLR images (Table 6). We found that 78% of individual nerves (n = 42/54) showed improvement in length measurements, and more than a quarter of the nerves had greater than 20% increase in length of the nerves visualized (Table 7). When individual subjects were considered, we found that every subject (100%) had some improvement in visible nerve length, and 57% of subjects showed a >20% improvement in detectable nerve lengths. Surgeon confidence, nerve discrimination, and the value of fluorescent imaging were all improved in about two-thirds of cases on the NERVE Rating Scale (Supplementary Table 3).

Table 6 Improved contrast enhancement, length measurement, and branching delineation of 500 mg bevonescein using VSSa
Table 7 Improvement of length measurements in subjects receiving 500 mg bevonescein from the VSS

Secondary Outcome: optimal time from infusion to surgery

The signal to background ratio was calculated based on images captured during surgery as a prespecified secondary endpoint. The fluorescent SBR of nerves between the early (1–3 h) versus late (3–5 h) timing cohorts were not statistically different (Fig. 5a). The peak SBR intensity was at 350 min from study drug administration to the time the image was captured, although interpretation is confounded by several outliers with high SBRs during this time interval (Fig. 5b). Not surprisingly, larger nerves (≥2 mm in width) appeared to have higher SBR compared to the smaller nerves (<2 mm) during this window of peak intensity (Fig. 5c).

Fig. 5: Effect of time from infusion to intraoperative imaging on SBR in patients who received 500 mg of bevonescein.
Fig. 5: Effect of time from infusion to intraoperative imaging on SBR in patients who received 500 mg of bevonescein.
Full size image

a There was no significant difference in SBR of the key nerves when comparing the two timing cohorts: bevonescein administered 1–3 h (n = 36 unique nerves) vs 3–5 h (n = 58 unique nerves) prior to incision. “ns” = not significant using two-sided Wilcoxon rank sum test. Standard boxplots are shown with median as the bold center line, mean as a diamond, the first and third interquartile range (IQR), and the highest and lowest values within 1.5*IQR as hinges, and outliers as additional points. b The peak SBR intensity was at 350 min from study drug administration to the time the image was captured. The green line and dots represent the SBR for FL images, and the dark gray line and values represent the SBR for WLR images. The centers of the lines represent the mean value. The gray shaded error regions represent standard error. c The effect of time from infusion of 500 mg bevonescein to intraoperative imaging on SBR on nerve width. The presented data included all patients who received the 500 mg dose (including those from the dose-escalation cohort and the dose-timing cohort). The red line and dots represent the SBR for FL images for nerves less than 2 mm, and the blue line and dots represent the SBR for FL images for nerves greater than or equal to 2 mm. The centers of the lines represent the mean value. The gray shaded error regions represent the standard error. Source data is provided as a Source Data file.

Discussion

This clinical trial of a nerve-targeted fluorescent nerve agent demonstrates objective increase in signal to background ration (SBR 2.08 ± 0.83 in FL vs.1.32 ± 0.15 in WLR; p < 0.05) and improvement in subjective surgeon assessment of the nerve contrast (nerve conspicuity/contrast enhancement 3.3 ± 0.6 in FL vs 2.6 ± 0.5 in WLR; p = 0.001 and improvement in nerve length in 78% of subjects with FL images). Although the improvement in length measurement between the two groups does not have obvious clinical implications, we chose this parameter as a proxy for determining the directionality of where the nerve is heading, when it is no longer clearly visible on the surface with white light reflectance. This directionality information may be useful to direct the dissection/tracing of the nerve. There were no adverse events greater than grade 1 attributable to the study drug, and no surgical procedures were canceled or delayed as a result of the infusion. The decision for the 500 mg dose was driven by signal intensity and safety. The economics of drug pricing versus production costs are outside the scope of this manuscript.

The optimal dose and timing were identified based on separate cohorts. Since the study did not identify a maximally tolerated dose and the SBR was relatively consistent across all tested doses, the optimal dose was selected based on the mean pixel intensity (MPI). Use of visible dye requires that the intensity of the signal be strong enough for visualization with the surgical field especially in the presence of exogenous light sources while minimizing the intensity of the adjacent background tissue. Categorical 4-point discrete Likert evaluation (1–3 h versus 3–5 h) of time from the end of infusion to the first skin incision at the optimal dose of 500 mg did not demonstrate any statistical difference in nerve evaluation between cohorts. When the timing was measured from the end of infusion to when the image was captured, the highest SBR was measured around 300 min (Fig. 5). This highlights the distinction between quantitative SBR measurements and surgeon intraoperative experiences, which may not directly follow expectations based solely on single time-point quantitative fluorescence measurements.

In order to optimize integration into the surgical workflow, bevonescein was tested using a wide range of time from infusion to skin incision. The data supported a relatively broad timeframe for clinical or subjective visualization (based on nerve conspicuity Contrast Enhancement Score). However, SBR data when measured from still images showed an optimum time of 300 min from infusion to image. In this setting, while SBR can be more precisely measured and quantified, the clinical timing interval was selected for bevonescein-enabled nerve visualization conspicuity. Our experimental design recognized that the study needed to demonstrate improved nerve visualization agent should add quantifiable value to experienced surgeons, rather than to trainees. To this end, only data from attending surgeons are used in the statistical analyses (although trainee data was also collected). Furthermore, to fully delineate the range of the subjective experience, attending surgeons with a range of familiarity with fluorescence-guided surgery were included in the study.

Although this study is the first clinical trial of a targeted nerve agent, non-specific imaging agents have been introduced that have improved nerve visualization. A clinical study of high dose ICG infused the day before surgery utilizing the second window effect improved nerve visualization29, although these findings are not reported in dozens of published studies using the same dosing protocol to image lung, head and neck, and brain cancers30,31,32,33. Furthermore, case series evaluating ICG in cranial nerve surgeries (vestibular, trigeminal, vagus schwannomas) showed inconsistent results in highlighting the normal versus tumor tissue with decrease in fluorescence intensity intraoperatively, which can be attributed to the likely lack of neurotropic specificity of ICG34,35,36. ICG is a commonly used fluorophore in FGS applications since it is non-toxic and has unique optical properties, including improved tissue penetration and reduced tissue autofluorescence. Because ICG and other near-infrared dyes (e.g., IRDye800) are outside the visible wavelength on the electromagnetic spectrum, fluorescence can only be visualized using a camera and the images displayed on a screen, adding complexity to the surgical setup.

Since multiple infrared agents are being developed for intraoperative tumor visualization, we considered it advantageous for a nerve imaging agent to emit in a different spectral range so that dual-imaging (tumor and nerve) would be possible in the same case. Although infrared agents are within the workflow for endoscopic, laparoscopic, robotic, or other minimally invasive techniques, they remain outside the workflow of open surgical procedures. Those agents within the visible wavelength, such as fluorescein (the active fluorophore in bevonescein), can be visualized by the unaided eye with appropriate illumination and filters within the open surgical field. Of note, visualization with sodium fluorescein alone has also been reported in small case series showing increased fluorescence intensity, in schwannomas, neurofibromas, and spinal cord lesions compared to normal nerve tissue, likely due to accumulation of sodium fluorescein due to abnormal tumor vascular permeability37,38,39,40,41. Thus, although these studies highlight the role of fluorescein in targeting abnormal nerve neoplasms, they not address the unmet need of identifying pathologically uninvolved nerves to avoid iatrogenic nerve injury.

Surgeons rely on multiple sensory inputs that are often used in combination to identify and preserve critical structures, especially when the anatomy is distorted by the pathological process that provided the surgical indication. Enhancement of nerves traversing the operative field provides an incremental advantage to the surgeon when integrated with surgical skill, experience, and other sensory inputs. Given the number of factors that contribute to successful nerve preservation, the incremental value of improving nerve visualization is difficult to quantify. Ultimately, these technologies will not be adopted by the surgical community unless they provide clinical benefit and/or increase surgeon confidence. Similarly, evidence that nerve preservation using intraoperative nerve monitoring (IONM) was not documented until it became widely used, in large part because nerve monitoring improved surgeon confidence – a subjective parameter that is difficult to quantify. However, IONM has widespread use and the value of nerve monitoring has been demonstrated now using retrospective studies for thyroid surgery42,43,44, parotid surgery45,46, and a variety of other surgeries throughout the body, although it continues to be controversial47,48,49,50. Just as nerve monitoring provides additional information during the surgery – improvements in surgical technology can provide complementary data to surgeons to reduce iatrogenic nerve injury. Although many agents are being developed in the NIR range, agents in the visible wavelength support open field surgery and can be used with tumor detection agents that are often in the NIR range. Furthermore, as an imaging agent it is a significant improvement over the current standard of care using electrophysical stimulation, which gradually results in nerve weakness with overstimulation and cannot identify sensory nerves. Perhaps more importantly, the use of a wide field imaging technique, nerve monitoring, and stimulation techniques should provide mutually reinforcing information and could effectively be used to reduce iatrogenic nerve injury, much like the use of MRI, CT, or US, which have their own advantages and limitations. In surgical procedures where microscopic magnification is within the surgical workflow, a surgical microscope with a 560 nm filter can be used to visualize bevonescein. To address wide field procedures, we have ongoing studies to assess the feasibility of surgical loupes that have integrated filters appropriate for bevonescein to be used throughout the surgery.

Intraoperative imaging trials are complicated by the balance of objective and subjective improvements. If the signal-to-noise ratio improves objectively by objective measures, it does not necessarily translate to clinical relevance. On the other hand, subjective improvements in the visualization of nerves is biased and surgeon-dependent. Importantly, we demonstrated a statistically significant difference could be measured in both objective and subjective measurements. A reduction in iatrogenic injury is the ultimate demonstration of benefit, however, the variations in clinical settings and surgical technique will make this challenging to demonstrate. Given the favorable safety profile and encouraging initial results, several multi-institutional Phase 3 studies are currently underway.

Methods

Trial design and oversight

Our prospective open-label Phase 1 trial evaluated the safety and efficacy of bevonescein for intraoperative fluorescence nerve imaging (ClinicalTrials.gov NCT04420689). Bevonescein (ALM-488) is a synthetic peptide of 17 amino acids conjugated to a fluorescein derivative (FAM; 5-carboxyfluorescein) that selectively binds to nerve tissue. The trial was conducted at three academic hospital sites in the United States (Jacobs Medical Center, University of California, San Diego; Stanford Hospital, Stanford University; and Massachusetts Eye and Ear Infirmary, Harvard University). For this study, the three clinical sites were participants of the SMART IRB platform51. University of California, San Diego IRB served as the approving institutio,n and the other institutions (Stanford and Mass Eye and Ear) formally ceded approval according to SMART IRB policies.

Investigators from all three sites were required to attend multiple meetings to ensure consistent intraoperative data collection as specified in the protocol. Written informed consent was obtained from all participants, and the study was conducted in adherence with the Declaration of Helsinki. The Study Protocol and the amended Study Protocol are available in Supplementary Information. The study was amended to designate the trial as Phase 1/2 by the FDA, and further clarification of screening labs and follow-up visits.

Inclusion criteria included adults ≥18 years of age with a neoplasm in the head and neck region undergoing head and neck surgery (thyroidectomy, parotidectomy, cervical lymphadenectomy) as part of standard of care treatment. Included subjects had normal liver and renal function, and if of childbearing potential, had a negative urine or serum pregnancy test and were using a medically acceptable form of contraception (e.g., hormonal birth control, intrauterine devices, double-barrier method) or abstinence. If male, the subjects were included if using a medically acceptable form of contraception (e.g,. condom) or abstinence. Exclusion criteria included prior surgery or radiation to the surgical bed, prior chemotherapy, cardiac arrhythmia not controlled with medication, history of stroke within one year, heart failure within past year, history of drug-induced acute tubular necrosis, other investigational drug exposure within past 6 weeks, pregnant or breastfeeding, fluorescein allergy, or severe or steroid dependent asthma.

Following informed written consent at a preoperative encounter, laboratory tests (serum: hematology, coagulation, chemistry, and inflammatory marker panels, pregnancy test (if applicable); urine: urinalysis), electrocardiogram (ECG), and a chest X-ray were obtained.

The study aimed to enroll 21–48 subjects; 9–36 subjects in the dose escalation/de-escalation cohorts and 12 subjects in the dose timing cohort. The dose-defining portion of the study included dose escalation and dose de-escalation cohorts of 3–6 patients per cohort. The dose-escalation tiers included potential doses of 100–100 mg. The dose de-escalation tiers included potential doses of 12.5–50 mg. We initiated the study with a dose of 100 mg based on FDA guidance and non-clinical toxicology studies. Progression to the next dose tier was contingent on safety in three subjects, without DLT, with the option to enroll three additional subjects at any given tier, if needed. Safety and efficacy review of each cohort was completed before progressing to the next cohort. Five doses (100 mg, 200 mg, 400 mg, 500 mg, 600 mg) with 3 subjects at each dose were evaluated with a total of 15 subjects. No safety-related concerns were identified at any dose tested. The 500 mg dose was identified as optimal for safety and efficacy and was chosen for progression to the dose-timing portion of the study. A total of 15 subjects were accrued to the dose-defining cohorts.

Dose-timing cohorts used the optimal imaging dose to compare administration at 1–3 h versus 3–5 h before surgery. Six subjects were enrolled in each cohort. A total of 12 subjects were accrued to the dose-defining cohorts.

The total study was 27 subjects (15 subjects in the dose-defining cohorts +12 subjects in the dose-timing cohorts). Subjects received a single dose of bevonescein via intravenous infusion over 30 min on Day 1. Lab tests and ECG were repeated on Day 1 prior to administration of the study agent and at the end of infusion. For safety monitoring, subjects remained on-site for at least 23-h after dosing. Safety assessments, including adverse events (AEs), and clinical laboratory tests were performed at Day 1, Day 15 (±8 days), and Day 28 (+5 days). A follow-up visit and lab tests were performed on Day 15 (+/−7 days), and an end-of-trial encounter and laboratory tests on Day 28 (+5 days). Participants were compensated with $50 at each visit, up to four total visits.

The first participant’s first visit date was 6/18/2020, and the last participant’s last visit date was 6/25/2021.

Intraoperative imaging

The operating surgeon conducted the initial phases of the operation, including nerve dissection and exposure consistent with the surgeon’s preference. Per FDA guidelines to maintain the standard of care, once a nerve is within the surgical field, or is expected to be within the surgical field, the microscope with fluorescence capability (Carl Zeiss Tivato surgical microscope with Meditec AG yellow 560 FL Module) was brought in and was used to obtain white light reflectance (WLR) and fluorescent (FL) images during the surgery pertaining to important steps of the surgical dissection and identification of key nerves. Index nerves were defined for each surgery type – parotidectomy: facial nerve branches (frontal, zygomatic, buccal, mandibular); thyroidectomy (recurrent laryngeal nerve); cervical lymphadenectomy (mandibular branch of the facial nerve, vagus nerve, spinal accessory nerve, hypoglossal nerve) (Supplementary Fig. 1. CONSORT Flow Diagram).

To standardize the image capture, the surgeons were instructed to place the camera at a distance of 200 mm, and directly positioning the camera over the surgical field. To minimize the impact of exogenous light sources, the surgeons were instructed to ambient turn off room lights were turned off and the overhead, and ensure that spot lights were turned directed away from the surgical field, and window shades (if applicable) to be in the down position or were closed, whenever possible during image capture. The superimposition of fluorescence was in real time and through the microscope oculars. Thus, the operating surgeon, via the handgrip controller, was able to toggle back and forth between white light alone, vs. white light with fluorescence overlay to see nerves either with white light reflectance alone (WLR) or white light paired with fluorescence overlay (paired).

Outcomes

The primary objective of the Phase 1 study was to evaluate the safety of bevonescein administered by intravenous infusion by assessing the dose-limiting toxicity (DLT) according to the National Cancer Institute (NCI) Common Terminology Criteria for Adverse Events (CTCAE). Subjects had safety assessments (including clinical laboratory tests and assessment for AEs) on day 1 (day of surgery), day 15 (+−8 days), and day 28 (+−5 days). Secondary objectives were to characterize the pharmacokinetics, determine the dose needed to generate optimal nerve fluorescence, and evaluate the timing of administration. Serum pharmacokinetic (PK) samples were taken on Day 1 pre-dose and at 5, 15, and 30 min, and 1, 2, 3, 8, 12, and 22-h.

Exploratory outcomes: quantitative image analysis

Signal-to-background ratios (SBRs) of the identified nerves in the WLR and FL images were used as a standardized measure to compare across images and subjects, with the background as adjacent non-nerve tissue. All images were reviewed by the team performing image analysis, and optimal images with key indicator nerves per procedure were selected for annotation and analysis. All images were imported into ImageJ, and there was no manipulation of the raw image. For WLR images, the overlay image was used as the raw image. For FL images, the image was split into RGB channels, and the green channel image was used for analysis. For each visualized nerve and adjacent tissue, 5–20 ellipsoid regions of interests were drawn within each tissue. Using a known distance in each image (visible ruler within the operative field and image), the images were set to scale, and the length of the nerves were then measured. The pixel intensities were measured for each region of interest, and the mean of each tissue was then calculated, and background subtraction was performed. The SBR was then calculated by dividing the mean pixel intensity of the nerve by the mean pixel intensity of the background within each image.

Exploratory outcomes: qualitative image analysis

Surgeon questionnaires were utilized—Surgical Imaging Worksheet (SIW) (Supplementary Table 1), Visualization Scoring System (VSS) (Supplementary Table 2), NERVE rating scale (Supplementary Table 3). The VSS scale was developed with the goal of standardizing a clinically relevant rubric to objectively and reproducibly assess surgeon experience in WLR and FL conditions (The VSS included assessments of: Contrast Enhancement (CE) as None, Weak, Clear, Clear or Bright; Length Measurement (LM), in millimeters; and Branching Delineation (BD) as None, Poorly visible, Sufficiently visible, or Excellent visibility). All VSS assessments are done by the operating surgeon at the time of surgery by visualizing the field through the microscopic oculars.

Statistical Analysis

For this Phase 1 study, no formal power calculations were performed. The sample size was determined to be consistent with other studies to meet study objectives. The design of this trial was based on a dose-escalation schema with 3–6 subjects at each tier. The dose tiers were based upon pre-clinical safety data and practical considerations and were designed with input from the FDA, not on a formal statistical power calculation. Safety, clinical, pharmacokinetic, and exploratory outcomes were summarized according to dose with the use of descriptive statistics. Standard boxplots with median as the bold center line, mean as diamond, the first and third interquartile range (IQR), and the highest and lowest values within 1.5*IQR as hinges, and outliers as additional points. The Wilcoxon rank sum test was used to compare the means of the SBRs between FL and WLR images. The optimal dose was determined by evaluating the mean pixel intensity (MPI) of the identified nerves and background of adjacent non-nerve tissue on both FL and WLR images as well as evaluating the SBRs. For normally distributed data with homogeneous variances, ANOVA was used to compare the MPIs of the nerve; for normally distributed data with non-homogeneous variance, Welch ANOVA was used to compare the MPI of the background; and for non-normally distributed data, Kruskal-Wallis was used to compare the SBRs. Post-hoc testing was performed using the Tukey method with the caveat of small sample size and multiple testing in this preliminary study. Scatterplots with trend lines applied with locally estimated scatterplot smoothing (LOESS) and standard errors in gray were used for visualization. Data from the VSS and the NERVE questionnaire were analyzed using the Student’s t-test and paired 2-tailed statistical tests as appropriate. All statistical analysis was performed using R (version 4.4.2) and RStudio (version 2024.12.0 + 467).

The parent peptide sequence, from which bevonescein (ALM-488) was derived, was initially identified by phage display using an m13 phage library. Bevonescein and ALM-488 (referenced in the study protocol) are synonymous – ALM-488 is the internal name for the drug prior to the United States Adopted Names (USAN) Council designation. To optimize and determine the core binding domain of the parent peptide, a binding analysis assessment was conducted after systematic deletion of two amino acids from the C or N terminus (Hingorani et al. 2018). Bevonescein is a variant of the parent peptide, with removal of 2 amino acids from the N terminus. An assessment was conducted to compare versions of bevonescein with FAM attached to the C terminus versus the N terminus, which determined that FAM attached to the N terminus demonstrated the highest intensity (Hingorani et al, Theranostics 2018). No difference in toxicity was noted between the different versions of the peptide sequences from which bevonescein was derived. Based on this data, the version of bevonescein used in the clinical trial described here has FAM attached to the N terminus.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.