Introduction

Fine needle aspiration (FNA) represents the preferred diagnostic method for assessing suspicious thyroid nodules, with the initial focus being on the selection of the optimal puncture needle. Within the realm of thyroid FNA, two primary types of puncture needles are commonly employed: the conventional syringe needle, devoid of a stylet1,2,3, and the stylet-equipped puncture needle4,5,6,7,8,9,10,11. Some reports4,5,6,7,8 have advocated the adoption of stylet needles in thyroid FNA, suggesting that the presence of a stylet prevents the needle lumen from potential contamination by extraneous non-lesion materials, including blood, prior to penetration into the targeted lesion. This hypothesis implies an enhancement in specimen quality using stylet needles. However, it is worth noting that in other areas of FNA, such as gastrointestinal and respiratory endoscopic FNA, the value of stylets has come under scrutiny in recent years. An increasing body of literature12,13,14,15,16,17,18 has raised doubts regarding the efficacy of stylets in improving specimen quality and has, in some instances, even associated them with a decline in specimen quality19. Furthermore, stylet needles are comparatively costly when compared to syringe needles, potentially imposing a substantial financial burden on patients undergoing FNA. In the context of the millions of patients worldwide who undergo FNA for thyroid nodules each year20, the issue of cost assumes paramount significance.

Given the ongoing debate concerning the value of stylet needles, we conducted a preliminary study21 to assess the impact of using needles with or without stylets on specimen sampling. In this study, we compared puncture needles (Fig. 1) of identical material, length, and gauge to elucidate whether the presence of a stylet improved specimen quality. Our preliminary findings revealed that the use of a needle stylet did not yield an improvement in specimen quality. However, it should be borne in mind that these results do not directly address the comparative merits and drawbacks of syringe needles versus stylet needles in specimen sampling, as differences in material, needle morphology, length, and other factors exist between the two (Fig. 2). Therefore, the primary objective of this study was to directly compare whether syringe needles are non-inferior to stylet needles in terms of specimen adequacy for thyroid FNA.

Fig. 1
figure 1

Stylet needles with or without stylets used in our preliminary study.

Methods

Patients and ethics approval

This study prospectively enrolled patients with thyroid nodules who underwent FNA between November 2022 and December 2023. Inclusion criteria comprised (1) age 18 years and older, (2) the ability to provide informed consent, and (3) ultrasound indications of thyroid nodules. Exclusion criteria encompassed (1) undergoing anticoagulation therapy, (2) cystic nodules, and (3) inability to provide informed consent. Ethical approval was obtained from the hospital ethics committee (Approval No: 2022-80), and written informed consent was secured from all participating patients. The methods in this study were performed in accordance with the relevant guidelines and regulations. This clinical trial has been registered in the Chinese Clinical Trial Registry (Registration number ChiCTR2200065318, first registration on 02/11/2022).

Randomization

Each nodule underwent four punctures, twice with a conventional syringe needle and twice with a stylet needle. To mitigate the potential impact of bleeding from the previous puncture on subsequent specimens, nodules were randomly allocated into two groups. One group was initially punctured with a stylet needle, resulting in the sequence of punctures being stylet needle, syringe needle, stylet needle, and syringe needle for this group of nodules. The other group was initially punctured with a syringe needle, leading to the sequence of punctures being syringe needle, stylet needle, syringe needle, and stylet needle for this group of nodules. The random sequence of punctures using stylet or syringe needles was generated by a computer program and sealed in an opaque envelope. The envelope was opened by the operator immediately before each puncture, following patient confirmation of enrollment.

FNA procedure

All FNAs were performed by the same operator using an ultrasound scanner (M9, Mindray, Shenzhen, China) equipped with a 7.5–15 MHz linear array transducer. A 25G disposable syringe needle (RWLB, Lelun, Changzhou, China) and a 25G disposable stylet needle (CCZA, Leapmed, Suzhou, China) were used for syringe and stylet needle FNAs, respectively. Both needle types are illustrated in Fig. 2. To prevent tumor needle tract metastasis, needles were not reused. Patient medical records included information about the number, echogenicity, composition, Thyroid Imaging Reporting and Data System (TIRADS) score, long diameter, and calcification of the punctured nodules. In cases of multiple nodules, cytologic examination was performed on suspicious nodules. In the absence of suspicious nodules, the dominant (largest) nodule was sampled.

Fig. 2
figure 2

25-gauge stylet needles and 25-gauge syringe needles used in this study.

FNA was conducted following previously described procedures21. Each nodule was punctured four times, twice with a stylet needle and twice with a syringe, in a predetermined order. When using a stylet needle for puncture, the stylet was removed under ultrasound guidance upon puncture into the nodule, and the needle was withdrawn after rapid back-and-forth movement within the nodule for 15–20 times or when sample material was visible at the center of the needle. When using a syringe needle for a puncture, the steps mirrored those for a stylet needle, except for the removal of the stylet. Negative pressure aspiration was not employed in either needle FNA, and no on-site adequacy assessment was conducted.

After each puncture, a 5-mL air-filled syringe was used to expel the contents of the needle lumen onto a clean slide. Each puncture produced one slide specimen, which was labeled in the order of puncture as slides A1, B1, A2, and B2. This approach ensured that the choice of stylet or syringe needle for the first puncture was randomized, preventing potential bias on the part of the pathologist. Specimens were air-dried, fixed in 95% alcohol, and stained with hematoxylin and eosin.

Cytologic evaluation

Five pathologists evaluated all slides using the Bethesda System for Reporting Thyroid Cytopathology (TBSRTC)22, with each patient’s slides evaluated by the same pathologist. Adequate specimens should include at least six groups of benign follicular cells, with each group containing a minimum of ten cells. Moreover, specimens are considered sufficient for diagnosis if they contain an adequate number of typical cells, atypical cells, or large amounts of colloid, even if the required six groups of follicular cells are not identified. Each nodule’s two stylet needle specimens and two syringe needle specimens were independently assessed as whole specimens, meaning that slides A1 and A2 constituted specimens for “Nodule A”, while slides B1 and B2 were specimens for “Nodule B”. A nodule was considered inadequate if both stylet needle specimens were deemed inadequate. Similarly, the syringe needle specimen of a nodule was classified as inadequate if both syringe needle specimens were found to be inadequate.

Outcome variables

The primary outcome was the adequacy of specimens obtained with both needle FNAs. Secondary outcomes included the concordance of specimen adequacy between both needle FNAs, sensitivity, specificity, and the cost of both needle FNAs. Specimen adequacy was defined as the percentage of non-TBSRTC I diagnoses, i.e., TBSRTC II, III, IV, V, and VI diagnostic categories, as a proportion of the total number of specimens.

Costs were assessed in Chinese Yuan (CNY) and comprised anesthesia fees, ultrasound guidance costs, procedure fees, puncture needle costs, and pathology fees. The total cost of FNA for each needle, the cost of a single FNA, and the cost per adequate specimen were computed. The cost-effectiveness ratio (cost/adequacy) and incremental cost-effectiveness ratio (ICER) were calculated for both needle FNAs, with adequacy serving as the effect indicator. The ICER was calculated as follows:

$$\text{ICER} = \text{{(cA-cB)}}/{\text{{eA-eB}}}$$

Where, cA is the cost of stylet needle FNA, cB is the cost of syringe needle FNA, eA is the adequacy of stylet needle FNA, and eB is the adequacy of syringe needle FNA.

Sample size estimation

Sample size estimation was carried out using PASS software (version 11). We assumed that specimen adequacy for thyroid FNA using either syringe needles or stylet needles was 85.64%, as reported in our preliminary study21. Based on this assumption, we set the non-inferiority margin at 10%, with a type 1 error of 0.025 (one-sided) and a type 2 error of 0.8. It was also assumed that 10% of enrolled nodules would require exclusion. Consequently, the final calculation determined that 214 nodules were necessary.

Statistical analysis

Statistical analysis was conducted using SPSS software (version 23) and SAS software (version 9.4). Each nodule was treated as an independent unit for statistical analysis. Continuous variables were presented as mean and standard deviation, while categorical variables were expressed as frequency and percentage. Primary outcome statistics were analyzed using the Wald test, with the difference in adequacy between the two methods presented as a two-sided 95% confidence interval (CI). Syringe needle FNA was deemed non-inferior to stylet needle FNA if the lower bound of the two-sided 95% CI for the adequacy difference (syringe needle FNA minus stylet needle FNA) exceeded − 10%. Postoperative paraffin pathology was used to calculate the sensitivity and specificity of the two types of FNA, and both category V and VI specimens were treated as positive specimens, as both indicate a high likelihood of malignancy and follow the same clinical management approach (surgery was recommended), and the remaining categories were treated as negative specimens. Agreement between the adequacy of the two needle specimens was assessed using the Kappa test, with Kappa values interpreted as follows: 0.00 ~ 0.20, slight agreement; 0.21 ~ 0.40, basic agreement; 0.41 ~ 0.60, moderate agreement; 0.61 ~ 0.80, substantial agreement; and 0.80 ~ 1.00, almost complete agreement.

Results

Patients and lesions

A total of 193 patients were initially recruited for the study. Eight patients were subsequently excluded: one patient was under anticoagulation therapy, five patients declined to participate, one patient had no detectable nodules, and one patient presented a solitary cystic nodule (see Fig. 3 for the recruitment flowchart). Ultimately, 185 patients (comprising 220 nodules) were enrolled in the study, with female (84.86%) predominance and a mean age of 49.51 ± 11.44 (range 18 to 76 years). Among them, 50 patients had single nodules, while 135 had multiple nodules. Among the 220 sampled nodules, 171 were hypoechoic, 42 were isoechoic, and 7 were hyperechoic. There were 166 solid nodules and 54 solid-cystic nodules. The mean nodule length was 1.40 ± 1.00 mm, ranging from 0.3 to 5.1 cm. According to the TI-RADS classification, nodules were categorized as class 2 (n = 1), class 3 (n = 67), class 4a (n = 107), class 4b (n = 36), class 4c (n = 7), and class 5 (n = 2). Calcification was associated with 111 nodules. Each nodule underwent four punctures with both a stylet needle and a syringe needle. Among the 220 nodules, 107 were randomized to be punctured initially with a stylet needle, while 113 were randomized to be punctured initially with a syringe needle. Cytologic diagnoses were as follows: malignant (TBSRTC VI) (n = 35, 15.91%), suspected malignant (TBSRTC V) (n = 34, 15.45%), follicular tumor or suspected follicular tumor (TBSRTC IV) (n = 5, 2.27%), atypical (TBSRTC III) (n = 53, 24.09%), benign (TBSRTC II) (n = 64, 29.09%), and inadequate (TBSRTC I) (n = 29, 13.18%).

Fig. 3
figure 3

Study flowchart detailing screening and exclusion, as well as total patients and nodules included in the final analysis.

Specimen adequacy, sensitivity and specificity

Out of 220 nodules, 191 (86.82%) yielded overall adequate specimens. Specifically, stylet needles produced 186 out of 220 (84.55%) adequate specimens, while syringe needles yielded 189 out of 220 (85.91%) adequate specimens. The corresponding cytologic diagnostic results for both types of needles are presented in Fig. 4.

Fig. 4
figure 4

The respective fine needle aspiration cytologic results for stylet needles and syringe needles.

The difference in FNA adequacy between the two needles (syringe needle minus stylet needle) was 1.36% (95% CI, -5.27–7.99%). Importantly, the lower bound of the 95% confidence interval for this adequacy difference (-5.27%) exceeded the predefined non-inferiority threshold of -10%. These results confirmed that the FNA specimen adequacy achieved with syringe needles was not inferior to that obtained with stylet needles.

In 184 out of 220 nodules, both needle specimens were deemed adequate, while in 29 nodules, both needle specimens were inadequate. Additionally, two nodules exhibited adequate stylet needle specimens but inadequate syringe specimens, and five nodules displayed adequate syringe needle specimens but inadequate stylet needle specimens. Thus, both needle types consistently demonstrated specimen adequacy in 213 nodules (96.82%) with a Kappa coefficient of 0.87. Furthermore, the two needles exhibited consistent cytological results in 186 nodules (84.55%) with a Kappa coefficient of 0.80. The consistency of cytological results for both needle types is summarized in Table 1.

Table 1 The cytological consistency of fine needle aspiration with stylet needles and syringe needles.

Ninety-four patients underwent subsequent thyroidectomy and paraffin pathology results were obtained for 106 nodules, including 73 papillary carcinoma, 2 medullary carcinomas, and 31 benign. The sensitivity of the stylet needle FNA was 77.33% with a specificity of 90.32%. Syringe needle FNA had a sensitivity of 78.67% and a specificity of 87.09%. The difference in sensitivity and specificity between the two FNAs was not significant (P > 0.05).

Cost analysis

The cost of a single FNA session using a stylet needle was 780 CNY, whereas the cost of using a syringe needle was 500.9 CNY. This represented a 35.78% reduction in the cost of FNA with syringe needles compared to stylet needles, as indicated in Table 2. The total cost of stylet needle FNAs amounted to 171,600 CNY, while syringe needle FNAs totaled 110,198 CNY, resulting in a cost savings of 61,402 CNY with syringe needle FNAs in this group of nodules.

Table 2 Single cost of fine needle aspiration with syringe needles and syringe needles.

The cost of obtaining one adequate specimen with a stylet needle FNA was 922.58 CNY, whereas, with a syringe needle, it was 583.06 CNY. This represented a 36.80% reduction in the cost of obtaining one adequate specimen using syringe needles compared to stylet needles. In other words, the syringe needle demonstrated a lower cost-effectiveness ratio (cost/adequacy) than the stylet needle (583.06 VS 922.58). The ICER was calculated as − 20,522.06, as shown in Table 3. A negative ICER indicated that syringe needle FNA is more cost-effective.

Table 3 Cost-effectiveness analysis comparing stylet needle FNA to syringe needle FNA.

Discussion

The present study demonstrated the non-inferiority of syringe needle FNA in terms of specimen adequacy compared to stylet needle FNA. Both needle types performed similarly in producing adequate specimens, with a high degree of consistency observed when comparing their performance at the individual lesion level. Furthermore, the cost analysis highlighted the advantages of using syringe needles in terms of cost-effectiveness.

The choice of needle for FNA is a critical decision, and in thyroid FNA, two commonly used options are conventional syringe needles without a stylet1,2,3 and puncture needles with a stylet4,5,6,7,8,9,10,11. Over the years, many studies4,5,6,7,8 have assessed the value of stylet needles in thyroid FNA, with claims that they enhance specimen adequacy, attributed to the stylet needle’s ability to prevent contamination of the needle lumen with blood or cystic fluid before puncturing the target lesion, ultimately leading to a more adequate specimen when the stylet is removed7,8. However, it is worth noting that these several reports4,5,6,7,8 promoting stylet needles were authored by the same investigator, and thus, the hypothesis still lacks robust supporting evidence. Furthermore, in other areas of FNA, such as gastroenterology endoscopic ultrasound-guided fine needle aspiration (EUS-FNA) and Endobronchial ultrasonographically guided transbronchial needle aspiration (EBUS-TBNA), recent studies12,13,14,15,16,17,18 have raised doubts about the role of stylet needles. These studies have indicated that the value of stylets is primarily theoretical and, in practice, does not consistently improve specimen adequacy and can even lead to worse specimen quality19. Consequently, there is a growing call for clinicians to reevaluate the value of stylet needles. Given that stylet needles are significantly more expensive than syringe needles, their cost may not be justified if they do not substantially enhance specimen quality.

This lack of consensus regarding the value of stylet needles prompted us to conduct a preliminary study evaluating stylet needles21. Herein, we compared the impact of using or not using a stylet on specimen adequacy, using puncture needles of the same form, material, length, and gauge. The results from that study revealed that FNA without a stylet was not inferior to FNA with a stylet in terms of specimen adequacy. However, these findings did not directly address the superiority or inferiority of stylet needles versus syringe needles for specimen sampling due to differences in needle morphology, material, length, and other factors between the two. Notably, the stylet needle (5 cm) used in our study was longer than the syringe needle (3.8 cm). While a longer needle may increase the likelihood of blood coagulation in the needle lumen and susceptibility to bending, previous research4 indicated that needle length is not a major factor affecting specimen adequacy. Interestingly, needle tip morphology could potentially impact specimen adequacy, with long, pointed, and less steep tips possibly causing the drifting of sampled cells and easier capture of blood and fluids23. However, current evidence suggests that the morphology of the needle tip has limited influence on specimen adequacy, as the diagnostic outcomes of FNA are largely independent of needle morphology24. In essence, both stylet needles and syringe needles are fine needles that obtain cells by cutting the needle tip, with capillary siphonage drawing the cut cells into the needle lumen. The primary distinction between the two is the presence or absence of a stylet.

Indeed, a stylet may prevent puncture blood from entering the needle lumen before stylet removal. However, this benefit may not significantly enhance specimen quality for experienced operators. Experienced operators can often anticipate the puncture route and angle in advance, allowing for precise needle placement into the nodule with minimal adjustments. Bleeding from thyroid tissue puncture tends to occur during prolonged needle adjustments, which can lead to blood entering the needle lumen. Furthermore, the process of stylet withdrawal generates negative pressure25,26, potentially compromising specimen adequacy. When the stylet is removed after the needle enters the target nodule but before it is used to cut and collect focal cells, the negative pressure generated during stylet withdrawal may draw non-focal components such as blood and cystic fluid into the needle lumen, potentially interfering with the entry of focal cells obtained during subsequent cutting. This disadvantage was also observed in a previous EUS-FNA study19, where specimens with a stylet exhibited significantly more blood and lower specimen adequacy compared to those without a stylet.

Additionally, prior studies4,5,6,7 evaluating stylet needles did not compare both needle types on the same nodule but rather used only one needle type per nodule. This approach may introduce selection bias, such as the possibility that easier-to-puncture nodules were assigned to the experimental group. In our study, each lesion served as its own control, ensuring that variables associated with each puncture method (stylet needle FNA or syringe needle FNA) were essentially identical, except for the choice of puncture needle. Gimeno-García et al.27 deemed self-control (simultaneous use of both techniques on the same nodule) to be the best method for controlling all variables potentially influencing the results. However, self-control is not without limitations, particularly regarding the possibility of bleeding from the previous puncture affecting the subsequent puncture specimen28. To mitigate this concern, we randomized the first puncture for each nodule, alternating between stylet needle and syringe needle. Half of the patients were randomized to undergo the first puncture with a stylet needle, while the other half began with a syringe needle (the number of initial punctures with each needle type was randomized and evenly distributed). Therefore, our study design aimed for comparability, objectivity, and accuracy.

Economic evaluation has indeed become a crucial decision-making tool for healthcare systems. Stylet needles are generally more expensive than syringe needles. In our study, the stylet needle cost was 140 CNY, while the syringe needle cost was 0.45 CNY, representing a 311-fold price difference. Other commonly used stylet needles reported in the literature4,5,6,7 cost approximately $1.33–3, whereas corresponding syringe needles cost $0.19, resulting in price differences of 7–15.8 times. The increase in cost should theoretically yield increased benefits; otherwise, the high cost of stylet needles may not be justified.

Although several studies4,5,6,7,8 have reported that stylet needles are cost-effective for thyroid FNA compared to syringe needles, our research yielded contrasting results. Our cost analysis indicated that syringe needle FNA was not only less expensive (500.9 CNY vs. 780 CNY) but also more effective (85.91% adequacy vs. 84.55% adequacy). In previous studies4,5,6,7,8, investigators argued that, despite their higher cost, stylet needles were more cost-effective due to the improved adequacy of stylet needle FNAs, implying a reduced need for repeat FNAs. Conversely, in our study, stylet needles did not demonstrate an advantage in terms of adequacy, and their cost per adequate specimen was higher than that of syringe needles. Given that millions of patients worldwide undergo FNA annually20, adopting cost-effective syringe needles can result in substantial cost savings without compromising efficacy compared to expensive stylet needles.

Our cost analysis does have some limitations, as it only considered direct medical costs and did not account for indirect costs such as lost wages or expenses incurred by patients requiring repeat FNA or diagnostic surgery due to inadequate specimens. However, because the difference in adequacy favored syringe needles over stylet needles, it suggests that stylet needles may result in a higher likelihood of needing repeat FNAs or diagnostic procedures. Thus, including the costs of repeat FNAs, diagnostic procedures, and indirect expenses such as lost labor would likely further widen the cost differential between stylet needles and syringe needles, reinforcing our results. Additionally, our cost analysis was specific to our local context and may not be directly applicable to other settings.

Other limitations were also present in this study. Firstly, the sensitivity and specificity of the two needle FNAs in this study were assessed using postoperative paraffin pathology as the standard reference. However, since benign and indeterminate cases typically do not undergo surgery and thus lack paraffin pathology confirmation, some unoperated nodules may still contain malignancies. Consequently, the true sensitivity is likely somewhat higher, while the specificity may be slightly lower than the values reported. Secondly, the blinding of operators was unfeasible. Thirdly, our study involved five pathologists, and the subjectivity of pathologists could have influenced result determinations. Nevertheless, each patient’s slides were reviewed by the same pathologist. Lastly, needle size for the FNA procedure is a worthy issue29,30, given that 25G needles were exclusively used, our results may not be directly applicable to other needle sizes.

Conclusions

In summary, our findings support the routine use of syringe needles in thyroid FNA. Syringe needle FNA demonstrated non-inferiority to stylet needle FNA in terms of specimen adequacy, and the two needle types exhibited a high degree of concordance in specimen adequacy. Furthermore, syringe needle FNA was both more cost-effective and less expensive. Given the global prevalence of FNA procedures, the economic considerations are paramount, and we recommend the adoption of syringe needles for thyroid FNA.