Introduction

Forensic examination of fabric found at a crime scene is critically important in forensics1,2. Such analysis could be used to reveal the chemical nature of the fabric material, its colorant composition, as well as the presence of biological and non-biological contaminants. All this information can help to establish a connection between a suspect and the crime scene.

Fluorescence, Infrared (IR) and Raman spectroscopy (RS) are broadly used to determine the chemical structure and composition of analyzed forensic samples, including fabric and body fluids3,4,5,6,7,8,9,10,11. These techniques can easily differentiate between synthetic and natural fibers. At the same time, the accuracy of these techniques could be lowered by contaminants present in the fabric. High-performance liquid chromatography and mass-spectroscopy are commonly used to analyze dye composition of fabric12. Although highly accurate, these analytical methods are laborious, invasive and destructive13. They also require an extensive library of colorants for the confirmatory identification of dyes. A large number of techniques ranging from optical microscopy to vibrational spectroscopy could reveal the presence of biological and non-biological contaminants on fabric14,15,16. For instance, RS is capable of differentiating between blood, saliva, sweat, and semen17,18,19,20. Furthermore, RS could be used to determine the age of bloodstains on different substrates21. Hager and co-workers showed that RS was capable of identifying urine on a large variety of fabric ranging from cotton to synthetic polyester6. Although this method is non-invasive and non-destructive, RS is limited to substrates with low fluorescence background22. At the same time, dyes of plant and insect origin that are broadly used for fabric coloration typically have extremely high fluorescence background, which limits the application of traditional RS for their analysis23.

The problem of dye fluorescence can be overcome by the use of plasmonic nanostructures that enhance Raman scattering24,25,26. This analytical approach, known as surface-enhanced Raman spectroscopy (SERS), is broadly used for the confirmatory identification of hair colorants in the artwork12,23,27,28. SERS is also capable of detecting explosives29,30, illicit drugs, such as heroin and cocaine31, and body fluids on fabric.32 Furthermore, SERS could be used to detect and identify dyes present on hair33,34,35,36.

Recently, our group demonstrated that fabric dyes could be identified using near-Infrared excitation Raman spectroscopy (NIeRS)37. In this case, utilization of near-Infrared lasers allows for the minimization of dye fluorescence, while preserving the entirely non-invasive and non-destructive nature of RS. Specifically, we found that 15 different colorants could be identified on cotton fibric with ~ 100% accuracy with only 1 s spectral acquisition using a hand-held NIeRS37.

It should be noted that physical pieces of evidence found at crime scenes are typically exposed to a large number of substances of biological and non-biological origin38,39. Substances of the first category typically include blood, semen, and urine. Substantially large variability is expected for the substances of non-biological origin that can be present on fabric. These substances range from household and food contaminants to industrial products. Previously reported results by Juarez and Kurouski showed that the presence of blood on colored hair limits the use of SERS for the identification of artificial hair dyes38,39. Similar conclusions were made for a large number of household contaminants38,39. However, if removed by washing, the information about the hair colorants could be revealed by SERS.

Expanding upon this, we investigated the extent to which blood, semen and urine could alter the accuracy of NIeRS-based analysis of green, pink, and light red dyes on cotton fabric. We also investigated the extent to which body fluids could be identified using NIeRS on blank and colored cotton fabric. For this, partial least-squared discriminant analysis (PLS-DA) was used. In general, PLS-DA performs the same or better than support vector machine (SVM), soft independent modeling of class analogy (SIMCA), and principal component analysis (PCA)40,41. This method integrates regression and classification by leveraging the correlation between spectral data and sample classes. Unlike binary classifiers such as SVM and SIMCA, PLS-DA is capable of handling multiclass classification, making it more suitable for complex classification tasks. Furthermore, unlike PCA, which only identifies the most significant components in the data, PLS-DA extracts latent variables that optimize the correlation between spectral datapoints and class variables, thus enhancing its performance in datasets with intricate class structures and small sample sizes.

Experimental section

Fabric materials

Cotton canvas was purchased from Fruit of the Loom; green, pink and light red dyes (Kool Krafts, ASIN: B08N5HVXS8) were purchased from the Mosaiz Store (Vanstek Tie Dye Kit, 24 Colors Tie Dye Shirt DIY Fabric Dyes). These non-reactive colorants became highly popular around the world and broadly used to dye fabric. Coloration was performed by soaking the cotton in the corresponding liquid dye for 1 h. Next, cotton samples were rinsed under cold water until the water ran clear and dried under ambient conditions.

Near-infrared excitation Raman spectroscopy (NIeRS)

NIeRS spectra were collected using a hand-held Agilent Resolve spectrometer equipped with an 830 nm laser. Spectra were acquired with 1 s acquisition time and 495 mW laser power. Automatic baselining was performed by the instrument. Spectra were taken at different locations on the fabric sample.

Spectral processing and statistical analysis

Spectral processing and model development was done in Matlab (Mathworks) equipped with PLS_Toolbox (Eigenvector Research, Inc., Manson, WA).

Results and discussion

We first used NIeRS to examine vibrational fingerprints of dyes themselves on cotton fabric. For this, blank pieces of cotton were colored by green, light red and pink dyes. Acquired NIeRS spectra revealed vibrational fingerprints that were unique to all three colorants, Fig. 1. We found that green colorant could be identified by the presence of 750 and 1540 cm−1, whereas light red dye exhibited a unique vibrational band at 1412 cm−1. Pink colorant has two unique vibrational bands at 1285 and 1610 cm−1.

Figure 1
figure 1

NIeRS spectra of undyed (grey), green-, light red-, and pink-colored fabric with corresponding undyed spectra shown next to each. Differences between intensities of 970 and 997 cm−1 are highlighted by a yellow bar.

It should be noted that although some colorants have only one unique band, utilization of partial-least squared discriminant analysis (PLS-DA), allowed for the highly accurate identification of all three colorants on fabric, Table 1 and Figure S1–S2.

Table 1 Confusion matrix of PLS-DA model with cross (internal) validation showing the accuracy of NIeRS-based identification of dyes on uncontaminated fabric.

Next, undyed cotton fabric was contaminated with blood, urine and semen and dried under room temperature. Visual examination of samples revealed intense-brown and light-yellow coloration of blood- and urine-contaminated samples, respectively, Fig. 2A–C. No visual change in the color of semen-contaminated sample was observed, Fig. 2D.

Figure 2
figure 2

Photographs of uncontaminated (A) cotton fabric and the fabric contaminated with blood (B), urine (C) and semen (D). NIeRS spectra (E) of blank (grey), fabric contaminated with blood (maroon), urine (yellow) and semen (pink) with corresponding blank spectra shown by dashed lines. Spectra were normalized on 900 cm−1 indicated by an asterisk (*).

NIeRS-based analysis of the contaminated uncolored fabric revealed a strong signature of heme and proteins in the spectrum acquired from the blood-contaminated fabric. Specifically, we observed an increase in the intensity of 730–750 cm−1 bands (aromatics), 1236 cm−1 (amide III) and 1455 cm−1 (CH2), as well as 1555–1618 cm−1, which could be assigned to amide II and I bands on proteins, Fig. 2E. These results indicate that NIeRS could be used to detect the presence of blood on fabric. We also observed an increase in the intensity of 1000 cm−1 band in the NIeRS spectrum acquired from urine-contaminated fabric compared to NIeRS spectrum of clean fabric. Hager and co-workers previously showed that this band originates from urea6. Finally, only minor changes in the intensities of vibrational bands were observed in NIeRS spectrum of blank fabric with dried semen contamination. These results indicate that semen does not have strong spectroscopic signature if present on fabric that could be used for its unambiguous identification. It should be noted that 1000 cm−1 band was also observed in NIeRS spectra acquired from fabric contaminated by blood. In this case, this band originates from aromatic vibrations of proteins. However, in the case of blood, another vibrational bands that correspond to heme and proteins were observed. Thus, 1000 cm−1 alone cannot be solely used to differentiate between the presence of blood vs urine.

We also investigated the extent to which spectroscopic signatures of blood on undyed cotton could be altered if the body fluid was not dried and if it was removed by washing. As expected, the presence of fresh blood can be noticed by the naked eye on uncontaminated cotton, Fig. 3A. However, extensive washing of the cotton material with detergents did not allow for the complete removal of the blood from the fabric. Clearly visible brown coloration was evident on such fabric, Fig. 3B.

Figure 3
figure 3

Photographs of cotton fabric exposed to fresh blood before (A) and after washing (B). NIeRS spectra (C) acquired from wet and dried blood present on uncontaminated (control) fabric, as well as the spectra acquired from fabric after removal of blood by washing. Spectra were normalized on 900 cm−1 indicated by an asterisk (*).

We found that dried and wet blood have substantially different vibrational fingerprints on uncontaminated cotton. Specifically, we observed strong changes in 730–750, 1236 and 1455, 1555- 1618 cm−1, spectral regions, Fig. 3C. These results indicated that NIeRS could be used to reveal changes in composition of blood that take place upon drying. These results are consistent with experimental findings reported by Doty and co-workers43. The researchers also showed that RS could be used to identify aging of bloodstains43. Therefore, in our study, we analyzed blood dried only for 24 h. We also found very little if any difference in the spectroscopic signature of fabric after the blood was removed by drying. At the same time, PLS-DA allowed for unambiguous identification of all analyzed classes of samples, Fig. 4. Thus, NIeRS could be used to identify whether contaminated fabric was exposed to blood and if yes, whether blood was removed by washing, Table 2 and Figures S3 and S4.

Figure 4
figure 4

3D latent-variable plot of NIeRS spectra acquired form undyed cotton, as well as cotton contaminated by urine, semen, dry and wet blood.

Table 2 Confusion matrix of PLS-DA model with cross (internal) validation showing the accuracy of NIeRS-based identification of contaminants on blank fabric.

Similar effects of body fluids were observed for green-, pink-, and light red-colored fabric. Specifically, the greatest magnitude of the spectroscopic changes was caused by blood in all three cases. Nearly all regions of the NIeRS spectra were altered by the presence of this body fluid. We also found that the presence of urine caused an increase in the intensity of 1000 cm−1 in the NIeRS spectra acquired from green, pink, and light red-colored fabric. Finally, the presence of semen did not cause changes in any particular peak. However, similar to the uncolored fabric, primarily changes in the intensities of some vibrational bands were observed, Fig. 5.

Figure 5
figure 5

NIeRS spectra of green (A), pink (B), and light red (C)-colored cotton fabric contaminated with blood, urine, and semen. Spectra were normalized on 900 cm−1 indicated by an asterisk (*).

Next, we investigated the extent to which artificial intelligence could be used to determine the color of fabric with and without present contaminants. For this, 16 classes of data were used, Scheme 1. We first built a PLS-DA model based on all 16 classes of data and used cross-validation to identify the accuracy of color identification. The second PLS-DA model was built on 11 classes of data (training model) with the remaining 5 classes used for external validation. Our results indicate that cross-validation enabled highly accurate identification of both green and pink dyes (100% accuracy), whereas uncolored fabric and light red fabric were identified with 96.7% and 92.8%, respectively. External validation of this model enabled 100% accurate identification of green and pink dyes, with 88% identification of the light red colorant, Table 3 and Figures S5–S7. These results indicate that NIeRS could be used for highly accurate identification of colorants of the fabric with and without present contaminants. Furthermore, the design of the training model indicates that robust PLS-DA models for color identification are possible after only exposing the learning algorithm to the control, allowing the number of colors recognized to be easily upscaled.

Scheme 1
scheme 1

Classes of data used for training and validation of PLS-DA model used to identify colorants on fabric.

Table 3 Confusion matrix of PLS-DA model with cross (internal and external) validations showing the accuracy of NIeRS-based identification of dyes on contaminated fabric.

The next question to ask is whether NIeRS coupled with PLS-DA could be used to identify the presence of contaminants on fabric. To answer this question, we built a PLS-DA model in which all data were partitioned 80:20, where 80% of the data were used to build the model and 20% of the data were used to perform external validation, Scheme 2.

Scheme 2
scheme 2

Classes of data used for training and validation of PLS-DA model used to identify body fluid contaminants.

Our results show that both urine and semen could be identified with 100% accuracy in the case of cross- and external validation, Table 4 and Figures S8–S10. We also found that blood could be correctly identified with 98.3% and 88.3% accuracy in cross- and external validation, respectively. Finally, the absence of any contamination could be confirmed with ~ 90% accuracy. Finally, we investigated the extent to which wet, dry and washed blood could be identified on fabric. Our results show that external validation enabled 100% accurate identification of wet, dried blood, and washed blood on both uncolored and colored fabric, Table 5 and Figures S11 and S12.

Table 4 Confusion matrix of PLS-DA model with cross (internal and external) validations showing the accuracy of NIeRS-based identification of contaminants on fabric.
Table 5 Confusion matrix of PLS-DA model with cross (external) validation showing the accuracy of NIeRS-based identification between wet, dried and washed blood samples.

Conclusions

Our proof-of principle results demonstrate high sensitivity of NIeRS to body fluid contaminants. Specifically, NIeRS could be used for highly accurate identification of dry blood, urine, and semen present on both dyed and undyed model fabric samples. These results are in good agreement with experimental findings previously reported by Lednev group for Raman-based sensing with 785 nm42,43,44. Furthermore, NIeRS could identify whether fabric was exposed to blood after its washing. We also showed that NIeRS was capable of identifying color of fabric with and without present contaminants. These results indicate that NIeRS coupled with PLS-DA can be used in forensics analysis of dyes and contaminants present on cotton fabric. It is important to note that a hand-held NIeRS instrument requires ~ 2–3 mm2 of fabric for the analysis. However, the extension of this approach to the confocal microscopes could be used to identify both colorants and contaminants on fragments (20–30 µm2) of individual fibers (Supplementary file). Finally, it is very important to extend NIeRS sensing of colorants on synthetic fabric, as well as to test the contribution of environmental factors, such as UV radiation and heat, as well as substances of non-biological origin on the reliability of NIeRS-based identification of colorants on fabric.