Introduction

Due to the high degree of thermal evolution of characteristic light oils and condensates, their composition is dominated by light hydrocarbons, which constitute more than 90%1,2. Alkylbenzenes (monocyclic aromatic hydrocarbons) are an important component of light hydrocarbons, accounting for more than 2–6 wt.%3. Alkylbenzenes are believed to be generated through homolytic cleavage of the carbon-carbon bonds in alkyl side chains attached to macromolecular structure (such as kerogen)4. Alkylbenzene compounds are utilized as indicators for assessing the source5, sedimentary environment6 and maturity7 of ancient organic matter.

There are many previous reports on the geochemical data of C0-C4 alkylbenzenes, mainly including the biological sources of 1,2,3-trimethylbenzene (TMB), 1,3-+1,4-dimethylbenzene (DMB), 1,2,3,4-tetramethylbenzene (TeMB)4,8,9,10,11,12,13. The ratios of isoproyltoluene (iPT) isomers were proposed to indicate the origin of organic matter5. 1,2,3,5-/1,2,3,4-TeMB (TeMBr) and 1,2,3,5-TeMB concentrations were used to indicate the sedimentary environments6. 1,2,4,5-/1,2,3,4-TeMB and 1,2,4,5-TeMB concentration can be used to distinguish the source and depositional environment of ancient organic matter14. A series of alkylbenzene parameters have been employed to assess the maturity of sedimentary organic matter,6,7,15,16 while the Tol/nC7 has been used to distinguish secondary alterations17,18.

Compared with C0-C4 alkylbenzenes, the research of C5 alkylbenzenes have been studied relatively limited, and the current research mainly includes the distribution characteristics of C5 alkylbenzenes and the possible biological sources of individual compounds. The C5-alkylated benzenes are predominantly composed of 1-ethyl-2,3,6-trimethylbenzene4. 1-Ethyl-2,3,6-trimethylbenzene and 1-ethyl-3,4,5-trimethylbenzene are likely derived from the aromatic carotenoids possessing the 2,3,6- and 3,4,5- substitution pattern incorporated into the kerogen4. In addition, 1-ethyl-2,3,6-trimethylbenzene derived from Chlorobiaceae can be significantly enriched in 13C19,20.

Due to the relatively low concentration of C5 alkylbenzenes in crude oils and their tendency to co-elute with other compounds during gas chromatography (GC) analysis, as reported by Jia et al.12, these compounds have received limited research attention. Consequently, a significant amount of geochemical information related to C5 alkylbenzenes remains underexplored due to limitations in current analytical methodologies. Fortunately, comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GC×GC–TOFMS) can separate compounds based on their boiling point differences in the first dimension and their polarity difference in the second dimension, respectively. This method provides excellent resolution in compound separation, thereby offering a powerful tool for the study of C5 alkylbenzenes6,14.

In this paper, we report the results of detailed study of twelve C5 alkylated benzenes distributions in thirty-four light oils and condensates from diverse sedimentary environments and geological ages. It primarily involves the analysis of similarities and differences in C5 alkylbenzene distributions between light oils and condensates, and discusses potential geochemical indicators. The C5 alkylbenzenes, which has a specific biological origin and formation environment, are of great geochemical significance for light oils and condensates where biomarkers are scarce.

Samples and methods

Samples

A total of thirty-four light oil and condensate samples were collected from the Tarim Basin (25 samples) and Beibuwan Basin (9 samples). The detailed characteristics of crude oil samples, including sample naming, source age and sedimentary environment, is shown in Table 1.

Tarim marine and swamp oils

A total of twenty-five light oil and condensate samples were collected from the Tarim Basin, of which fifteen oils were from the Cambrian-Ordovician marine source rocks in the Halahatang Depression, five oils were derived from Jurassic coal-measure source rocks in the Luntai Sub-uplift, and five oils were originated from Jurassic coal-measure source rocks in the Kuqa Depression (Fig. 1a). The light oils and condensates from Cambrian-Ordovician marine source rocks are derived from lower aquatic organisms and formed in a reduced sedimentary environment22,23. The light oils and condensates derived from the Jurassic coal-measure source rocks originated from terrigenous higher plants and were formed in an oxidizing sedimentary environment24,25,26,27. The Tarim marine oils are at a mature stage of thermal evolution25. The swamp oils are at a mature to high-maturity stage of thermal evolution24,26,27,28.

Beibuwan lacustrine oils

A total of nine light oil and condensate samples were collected from the Fushan depression, Beibuwan Basin (Fig. 1b). The crude oils are in the stage of high maturity and their sterane and hopane biomarkers are depleted29. These light oils and condensates are mainly from black mudstone and shale of the Liushagang Formation (E2l), which are typical lacustrine oils30. The source rocks of the Liushagang Formation (E2l) were primarily deposited in an oxidizing sedimentary environment and are predominantly composed of terrigenous organic matter31,32.

Fig. 1
figure 1

The regional tectonic characteristics of (a) the northern Tarim Basin and (b) the Fushan Depression of the Beibuwan Basin.

Table 1 Geological and geochemical information in all studied oils.

Methods

The saturated hydrocarbons and aromatic hydrocarbons were separated from light oil samples by column chromatography. Crude oil samples were analyzed mainly by GC and GC×GC-TOFMS. The saturated hydrocarbons and aromatic hydrocarbons were analyzed by GC-MS.

The GC model is Agilent 6890 A with a fused silica HP-PONA column (50 m×0.2 mm×0.5 μm). The program temperature setting is: the initial oven temperature is 35℃, keep for 5 min, heated at 3 °C /min to 70 °C, and then heated at 4.5 °C to 300 °C, held for 25 min. The GC-MS model is Agilent 6890 A GC-5979i MS, with is an HP-5MS column (60 m×0.25 μm×0.25 μm). The GC program temperature setting is: for saturated hydrocarbons, the initial oven temperature is 50 °C, held for 1 min, heated at 20 °C/min to 100 °C, and then heated at 3 °C/min to 310 °C for 10 min. For aromatic hydrocarbons, the initial oven temperature is 80 °C for 1 min, and the temperature is raised to 300 °C at 3 °C/min for 20 min.

Before crude oils were analyzed, internal standard (1-hexene) was added into the crude oils of known weight. The GC×GC-TOFMS mode is Agilent 7890 GC-Pegasus 4D TOFMS with non-polar capillary HP-PONA column (50 m×0.20 mm×0.50 μm) in the first dimension and polar capillary Rxi-17 column (1.7 m×0.1 mm×0.1 μm) in the second dimensions. The GC program temperature setting is: for the first dimensional column, the initial temperature is 35 °C, held for 5 min, and the temperature is raised at 2 °C/min to 295 °C, held for 30 min. For the second dimensional column, the initial temperature is 40 °C for 5 min, and the temperature was raised to 280 °C at 2 °C/min for 30 min. The modulation period is 1.2 s hot pulse and 4.8 s cold pulse. The detector and ion source voltages are 1350 V and 70 eV, respectively. Helium is used as a carrier. The above experimental analysis was done in the National Key Laboratory of Petroleum Resources and Engineering, China University of Petroleum (Beijing).

Results

Identification of C5 alkylated benzenes in light oils

As shown in Fig. 2, mass chromatogram of m/z 148 was used to present C5 alkylated benzenes, which presents an echelon distribution. The separation of alkylbenzene compounds on the chromatogram is mainly based on the boiling point of the compounds in the first dimension and by their polarity in the second dimension. The twelve C5 alkylated benzenes were detected in the Tarim marine and swamp oils, and Beibuwan lacustrine oils using GC×GC-TOFMS and were identified by comparison with Nist Standard Reference Database version 2.0 and published literature4 (Table 2). 1-Methyl-2-tert-butylbenzene (1-M-2-t-BB), 1-methyl-4-tert-butylbenzene (1-M-4-t-BB), 2-methylbutylbenzene (2-MBB), 3-methylbutylbenzene (3-MBB) were identified by Nist Standard Reference Database version 2.0. The remaining eight C5 alkylated benzenes were identified with the aid of Hartgers et al.4. The molecular structures of twelve C5 alkylated benzenes are presented in Fig. 2.

Fig. 2
figure 2

The m/z 148 mass chromatogram of oil YM4 (a) and DB2 (b) from the Tarim Basin, oil H2 (c) from the Beibuwan Basin show the distribution of C5 alkylated benzenes in GC×GC–TOFMS. The peak numbers correspond to the compound names listed in Table 2.

Table 2 The C5 alkylated benzenes were detected and identified in all of the studied oil samples.

The first dimension and second dimension retention times, retention indices, molecular masses, base peaks, and compound abbreviations are shown in Table 2. All C5 alkylbenzenes exhibit a parent ion at m/z 148 and possess the molecular formula of C11H16. The mass spectrum of both 2-MBB and 3-MBB exhibit a base peak at m/z 92. The mass spectrum of n-pentylbenzene (n-PB) exhibits a base peak at m/z 91. The base peak of the remaining C5 alkylbenzenes is at m/z 133. A higher peak at m/z 105 occurs in the mass spectrum of both 1-M-2-t-BB and 1-M-4-t-BB. The mass spectrum of both 2-MBB and 3-MBB exhibit a more intense peak at m/z 91, whereas the mass spectrum of n-PB shows a more intense peak at m/z 92.

Distribution of C5 alkylated benzenes in light oils

Figure 3 shows the concentration distribution of twelve C5 alkylated benzenes in all analyzed oil samples from the Tarim marine and swamp oils, and Beibuwan lacustrine oils. Based on the distribution characteristics of twelve C5 alkylated benzenes in the Tarim marine and swamp oils, and Beibuwan lacustrine oils, these oils can be classified into four categories. (1) 1-M-2-t-BB, 1-M-4-t-BB and 3-MBB are enriched in the Tarim swamp oils (0.19–0.75 (average 0.46), 0.50–1.99 (average 1.28), 0.05–0.68 (average 0.22) mg/g whole oil) (Table 3), and are below the detection limit or depleted in the Tarim marine oils and Beibuwan lacustrine oils. (2) The concentration of l-ethyl-3,4,5-trimethylbenzene (1-E-3,4,5-TMB), 1-E-2,3,4-TMB, and 1-E-2,4,5-TMB in the Tarim marine oils (0.14–0.40 (average 0.25), 0.07–0.26 (average 0.14), 0-0.31 (average 0.18) mg/g whole oil) and Beibuwan lacustrine oils (0.30–0.75 (average 0.46), 0.13–0.41 (average 0.25), 0.40–1.85 (average 0.89) mg/g whole oil) are higher than those in the Tarim swamp oils (0.01–0.15 (average 0.07), 0.03–0.16 (average 0.09), 0.03–0.34 (average 0.15) mg/g whole oil) (Table 3). (3) The concentration of 1-E-2,4,6-TMB, 1-E-2,3,5-TMB and 1,2,3,4,5-PMB in the Beibuwan lacustrine oils (0.12–0.37 (average 0.22), 0.22–0.84 (average 0.58), 0.39–1.48 (average 0.71) mg/g whole oil) are higher than those in the Tarim marine oils (0.03–0.08 (average 0.05), 0.12–0.46 (average 0.25), 0-0.40 (average 0.12) mg/g whole oil) and Tarim swamp oils (0.03–0.19 (average 0.10), 0.03–0.39 (average 0.16), 0-0.41 (average 0.18) mg/g whole oil) (Table 3). (4) There was no significant difference in the contents of 2-MBB, n-PB and 1-E-2,3,6-TMB in all studied oils (Table 3).

Fig. 3
figure 3

The concentrations of C5 alkylated benzenes in light oils and condensates. (a) 1-M-2-t-BB; (b) 1-M-4-t-BB; (c) 3-MBB; (d) 1-E-3,4,5-TMB; (e) 1-E-2,3,4-TMB; (f) 1-E-2,4,5-TMB; (g)1-E-2,4,6-TMB; (h) 1-E-2,3,5-TMB; (i) 1,2,3,4,5-PMB; (j) 2-MBB; (k) n-PB; (l) 1-E-2,3,6-TMB. The concentration unit is mg/g of whole oil.

Table 3 The C5 alkylated benzenes concentrations.

Discussion

Origins of C5 alkylated benzenes

Alkylbenzenes are mainly generated via α-, β-, γ- and δ-cleavages of the side chain by which the aromatic moiety with specific substitution patterns attached to macromolecular structures (e.g., kerogen)4. For example, 1-E-2,3,6-TMB is likely generated γ-cleavage of macromolecular-bound aromatic carotenoids with a 2,3,6-trimethyl substitution pattern4. 1-E-2,3,6-TMB is indeed derived from isorenieratene biosynthesized by Chlorobiaceae33. Similar to the origin of 2,3,6-trimethylbenzene moieties in pyrolysis products, Hartgers et al.33 proposed a diaromatic carotenoid with an unprecedented substitution pattern to explain the presence of 3,4,5-trimethylbenzene moieties. 1-E-3,4,5-TMB are derived from an organism that lived in the same habitat as the Chlorobiaceae33. Other C5 alkylated benzenes may be also generated via γ-cleavages of the side chain by which the aromatic moieties with specific substitution patterns linked to kerogen.

The high contents of 1-M-2-t-BB, 1-M-4-t-BB and 3-MBB in the Tarim swamp oils indicate these compouds were likely formed in sub-oxidized environment and are derived from terrestrial higher plants. The contents of 1-E-3,4,5-TMB, 1-E-2,3,4-TMB and 1-E-2,4,5-TMB in the Tarim marine oils and Beibuwan lacustrine oils are higher than those in the Tarim swamp oils, indicating that they may not be easily formed in sub-oxidized environment and derived from the terrestrial higher plants. The contents of 1-E-2,4,6-TMB, 1-E-2,3,5-TMB and 1,2,3,4,5-PMB in the Tarim marine and swamp oils are lower than those in the Beibuwan lacustrine oils, indicating they may be formed in oxidized sedimentary environment. Based on the sensitivity of different C5 alkylated benzenes to sedimentary environments and organic matter sources, the ratios of 1-E-2,4,6-TMB to 1-E-2,3,6-TMB (C5-R1) and 1-E-2,4,6-TMB to 1-E-3,4,5-TMB (C5-R2) have been proposed to distinguish the sedimentary environments and organic matter sources of crude oils from their respective source rocks.

C5-R1 as an indicator of depositional environments

Pristane/phytane (Pr/Ph)34, 2-methyl-3-ethylheptane/2,6-dimethyloctane (MTR)28, 2-methyl-3-ethylheptane/3-methylnonane (MT1) and 1,1,2,3-tetramethylcyclohexane/propylcyclohexane (MT2) were proposed to distinguish the redox conditions of sedimentary organic matter35. In Fig. 4, all studied oils can be divided into two groups based on Pr/Ph, MTR, MT1, MT2 linked with C5-R1. The Tarim marine oils having relatively low Pr/Ph, C5-R1 values, and high MTR, MT1, and MT2 values (Table 4), indicate oils derived from the source rocks formed in anoxic sedimentary environments. The Tarim swamp oils and Beibuwan lacustrine oils relatively high Pr/Ph, C5-R1 values, and low MTR, MT1, and MT2 values (Table 4), indicate oils derived from the source rocks formed in oxic/dysoxic depositional environment.

Fig. 4
figure 4

Cross-plots of C5-R1 vs. Pr/Ph (a), MT1 (b), MTR (c) and MT2 (d) in all studied oils.

1,2,4,5-/1,2,3,4-TeMB14 and 1,2,4,5-/1,2,3,4-TeMB (TeMBr)6 were proposed as indicators to infer the sedimentary environments of the crude oils derived from their respective source rocks. In Fig. 5, the Tarim marine oils with relatively low 1,2,4,5-/1,2,3,4-TeMB, TeMBr, and C5-R1 values (Table 4), indicate oils derived from the source rocks deposited under reducing/anoxic conditions. The Beibuwan lacustrine oils and the Tarim swamp oils with higher 1,2,4,5-/1,2,3,4-TeMB, TeMBr, and C5-R1 values (Table 4), suggest oils derived from source rocks formed under oxic/dysoxic sedimentary environments. In addition, the dibenzothiophene/phenanthrene (DBT/P) ratio and the C5-R1 ratio can also well distinguish crude oil from different sedimentary environments (Table 4).

Fig. 5
figure 5

Cross-plots of C5-R1 vs. 1,2,4,5-/1,2,3,4-TeMB (a) and TeMBr (b) in all studied oils.

In general, there is a good correspondence between the C5-R1 values and the sedimentary environment indicators (Pr/Ph, MTR, MT1, MT2, 1,2,4,5-/1,2,3,4-TeMB and TeMBr). The higher C5-R1 value (> 0.5) indicates that the oils originated from source rocks formed in oxidizing sedimentary environments, while a lower C5-R1 value (< 0.5) indicates that the oils were derived from source rocks formed in reducing sedimentary environments.

C5-R2 as an indicator of organic matter sources

1-Methyl-3-isopropylbenzene (1-M-3-iPB)/1-methyl-2-isopropylbenzene (1-M-2-iPB) (iPTr1), 1-methyl-4-isopropylbenzene (1-M-4-iPB)/1-M-2-iPB (iPTr2), and (1-M-3-iPB + 1-M-4-iPB)/1-M-2-iPB (iPTr) were proposed by Meng et al.5 to distinguish the organic matter origin of the crude oils from their respective source rocks. In Fig. 6, all studied oils can be divided into two groups based on iPTr1, iPTr2, iPTr linked to C5-R2 values. The Tarim marine oils and Beibuwan lacustrine oils with relatively high iPTr1, iPTr2, iPTr values, and low C5-R2 values (Table 4), indicate that the organic matter in their respective source rocks was derived from a mixture of lower aquatic organisms and terrestrial higher plants. The Tarim swamp oils with relatively low iPTr1, iPTr2, iPTr values, and high C5-R2 values (Table 4), suggest that the organic matter in their respective source rocks was predominantly derived from terrestrial higher plants.

Fig. 6
figure 6

Cross-plots of C5-R2 vs. iPTr1 (a), iPTr2 (b), and iPTr (c) in all studied oils.

Relatively high cadinane/(cadinane + 8β(H)-drimane), 2,2,4a,7,8-pentamethyl-decalin/(2,2,4a,7,8-pentamethyl-decalin + 8β(H)-drimane), 1,2,2,5,5-pentamethyl-trans-decalin/8β(H)-drimane, 1,1,2,5,5-pentamethyl-trans-decalin/8β(H)-drimane values indicate terrestrial organic matter input36,37. In Fig. 7, the Tarim marine oils with relatively low cadinane/(cadinane + 8β(H)-drimane), bicyclic sesquiterpenes compounds ratios, and C5-R2 values (Table 4), indicate that the organic matter in their respective source rocks was predominantly derived from lower aquatic organisms. The Tarim swamp oils with relatively high cadinane/(cadinane + 8β(H)-drimane), bicyclic sesquiterpenes compounds ratios, and C5-R2 values (Table 4), suggest that the organic matter in their respective source rocks was predominantly derived from the terrestrial higher plants. The Beibuwan lacustrine oils with relatively high cadinane/(cadinane + 8β(H)-drimane), bicyclic sesquiterpenes compounds ratios, and low C5-R2 values (Table 4), suggest that the organic matter in their respective source rocks was derived from a mixture of lower aquatic organisms and terrestrial higher plants.

Fig. 7
figure 7

Cross-plots of C5-R2 vs. cadinane/(cadinane + 8β(H)-drimane) (a), 1,2,2,5,5-pentamethyl-trans-decalin/(1,2,2,5,5-pentamethyl-trans-decalin + 8β(H)-drimane) (b), 2,2,4a,7,8-pentamethyl-decalin/(2,2,4a,7,8-pentamethyl-decalin + 8β(H)-drimane) (c), 1,1,2,5,5-pentamethyl-trans-decalin/(1,1,2,5,5-pentamethyl-trans-decalin + 8β(H)-drimane) (d) in all studied oils.

Relatively high nC7/MCyC6 ratio indicates marine organic matter input, and relatively high CyC5 − 7/nC5 − 7 ratio indicates the terrestrial organic matter input38,39,40. In Fig. 8, the Tarim marine oils with relatively high nC7/MCyC6, low CyC5 − 7/nC5 − 7, and C5-R2 values (Table 4), indicate that the organic matter in their respective source rocks was predominantly derived from lower aquatic organisms. The Tarim swamp oils with relatively low nC7/MCyC6, high CyC5 − 7/nC5 − 7, and C5-R2 values (Table 4), indicate that the organic matter in their respective source rocks was predominantly derived from the terrestrial higher plants. The Beibuwan lacustrine oils with relatively low nC7/MCyC6, high CyC5 − 7/nC5 − 7, and low C5-R2 values (Table 4), suggest that the organic matter in their respective source rocks was derived from a mixture of lower aquatic organisms and terrestrial higher plants.

Fig. 8
figure 8

Cross-plots of C5-R2 vs. nC7/MCyC6 (a) and CyC5 − 7/nC5 − 7 (b) in all studied oils.

In conclusion, the relatively high C5-R2 value (> 1.0) suggests the oils originate from source rocks associated with terrestrial higher plants, while the relatively low C5-R2 value (< 1.0) indicates the oils are derived from source rocks containing a mixture of lower aquatic organisms and terrestrial higher plants.

Secondary alteration influence on C5-R1 and C5-R2 values

Evaporative fractionation

The low to medium molecular weight hydrocarbons in light oils and condensates are easily changed into vapor phase by evaporative fractionation, and then the primary composition of crude oil compounds is changed17,18. Thompson (1988) proposed the intersection chart of nC7/MCyC6 (F) and Tol/nC7 (B) values to evaluate whether evaporative fractionation occurs in crude oils. If the crude oils undergo evaporative fractionation, it will deviate from the normal oils, which is characterized by the decrease of F value (< 0.5) and the increase of B value (> 1.0)17. The Tarim marine oils with relatively high F value (1.64–2.04) and low B value (0.08–0.31) indicate these oils did not undergo the effect of evaporative fractionation. The Tarim swamp oils and Beibuwan lacustrine oils exhibit relatively low F values (0.19–0.69 and 0.18–0.53) and high B values (0.49–3.55 and 0.61–2.66). The F value is controlled by thermal maturity and organic matter type, and the low F values of Beibuwan lacustrine oils and Tarim swamp oils may mainly reflect significant the input of terrestrial organic matter. The toluene content in terrigenous organic matter is usually higher than that in marine organic matter41,42. Therefore, the Tarim swamp oils and Beibuwan lacustrine oils with relatively low F values and high B values may reflect terrestrial organic matter input. In addition, the compound into the vapor phase depends mainly on the vapor pressure of the compound, which depends on the molecular structure and molecular weight of the compound. Thus, the ratios of 1-E-2,4,6-TMB to 1-E-2,3,6-TMB (C5-R1) and 1-E-2,4,6-TMB/1-E-3,4,5-TMB (C5-R2) may not or slight be affected by evaporative fractionation due to 1-E-2,4,6-TMB, 1-E-2,3,6-TMB and 1-E-3,4,5-TMB with similar molecular structure and same molecular weight.

Biodegradation

Biodegradation can change the original composition of crude oil and has a great impact on the use of light hydrocarbon parameters, so it is of great significance to determine the impact of biodegradation on newly established parameters. The 25-norhopanes are usually effective indicator of biodegradation of crude oils43,44,45. There is no obvious 25-norhopanes detected in the Tarim swamp oils and Beibuwan lacustrine oils, indicating these oils did not undergo biodegradation. Although 25-norhopanes were detected in the Tarim marine oils, the light hydrocarbons in the Tarim marine oils were late charging components, and they did not occur biodegradation22,46. For aromatic compounds, the biodegradation rate is inversely proportional to the number of aromatic rings and directly proportional to the number of alkyl substituents. Due to 1-E-2,4,6-TMB, 1-E-2,3,6-TMB and 1-E-3,4,5-TMB with similar molecular structure and same molecular weight, even if the crude oils is biodegraded, the selective degradation of 1-E-2,4,6-TMB, 1-E-2,3,6-TMB and 1-E-3,4,5-TMB is not possible. Therefore, C5-R1 and C5-R2 ratios may not be affected by biodegradation.

Thermal maturity

The ratios of methyldibenzothiophene (MDBT), ethyldibenzothiophene (EDBT), and dimethyldibenzothiophene (DMDBT) compounds are used to indicate the thermal maturity of sedimentary organic matter based on the difference in thermal stability of compounds with different substituents of dibenzothiophene47,48. In Fig. 9, there is a good positive correlation between MDR with MDR’, EDR and EDR’ in all studied oils. There is obvious difference in thermal maturity in the Tarim marine oils, Tarim swamp oils, and Beibuwan lacustrine oils. Based on the different thermal stability of methylphenanthrene (MP) substituents, F1 and F2 have been proposed to indicate the thermal maturity of sedimentary organic matter49. The Tarim marine oils and the Beibuwan lacustrine oils are in the mature evolution stage, while the Tarim swamp oils is in the mature to high mature evolution stage (Fig. 10). The thermal maturity of the Tarim marine oils is similar to that in Beibuwan lacustrine oils, but the values of C5-R1 and C5-R2 in the Tarim marine oils are lower than those in Beibuwan lacustrine oils. Therefore, the C5-R1 and C5-R2 parameters are largely unaffected by thermal maturity.

Fig. 9
figure 9

Cross-plots of MDR vs. EDR (a) and MDR’ vs. EDR’ (b) for all studied oils.

Fig. 10
figure 10

Cross-plots of F1 vs. F2 in all studied oils. F1=[(2-MP + 3-MP)/(2-MP + 3-MP + 1-MP + 9-MP)], F2=[2-MP/(2-MP + 3-MP + 1-MP + 9-MP)]).

Table 4 Geochemical indicators of light oils used in this study.

Conclusions

The contents of twelve C5 alkylated benzenes in thirty-four light oils were quantitatively analyzed by GC×GC-TOFMS. 1-M-2-t-BB, 1-M-4-t-BB and 3-MBB are enriched in the Tarim swamp oils but are below the detection limit or depleted in the Tarim marine oils and Beibuwan lacustrine oils. The concentration of 1-E-3,4,5-TMB, 1-E-2,3,4-TMB, and 1-E-2,4,5-TMB in the Tarim marine oils and Beibuwan lacustrine oils are higher than those in the Tarim swamp oils. The concentrations of 1-E-2,4,6-TMB, 1-E-2,3,5-TMB and 1,2,3,4,5-PMB in the Beibuwan lacustrine oils are higher than those in the Tarim marine oils and Tarim swamp oils. There were no significant differences in the contents of 2-MBB, n-PB and 1-E-2,3,6-TMB among all studied oils. Based on the sensitivity of different C5 alkylated benzenes to sedimentary environment and organic matter sources, the C5-R1 and C5-R2 were proposed to distinguish the sedimentary environments and organic matter sources of crude oils derived from their respective source rocks. A higher C5-R1 value (> 0.5) indicates that the oils derived from their respective source rocks were formed in oxidizing sedimentary environments, while a lower C5-R1 value (< 0.5) indicates that the oils originated from source rocks formed in reducing sedimentary environments. A relatively high C5-R2 value (> 1.0) suggests that the oils derived from their respective source rocks was predominantly originated from terrestrial higher plants, while a relatively low C5-R2 value (< 1.0) indicates that the oils were derived from a mixture of lower aquatic organisms and terrestrial higher plants. The C5-R1 and C5-R2 ratios are unlikely to be significantly affected by evaporative fractionation, biodegradation, and thermal maturity.