Acoustic modulation signal recognition based on endpoint detection

Xiuquan, Li; Zhen, Wang; Yeyin, Jin; Jing, Chen; Zhenfei, Li

doi:10.1038/s41598-024-69934-y

Download PDF

Article
Open access
Published: 19 August 2024

Acoustic modulation signal recognition based on endpoint detection

Li Xiuquan¹,
Wang Zhen²,
Jin Yeyin²,
Chen Jing² &
…
Li Zhenfei²

Scientific Reports volume 14, Article number: 19198 (2024) Cite this article

1340 Accesses
2 Citations
Metrics details

Subjects

Abstract

The modulation recognition technology for acoustic signals holds significant research importance in signal demodulation and communication signal reconnaissance, serving as a crucial component and key aspect. This paper investigates the modulation recognition technology for acoustic signals (< 20 kHz) from the perspectives of signal preprocessing and feature extraction. Firstly, it selects seven modulation signals 2ASK, 4ASK, 2FSK, 4FSK, 2PSK, 4PSK, and OFDM as recognition targets and systematically compares the effectiveness of four different endpoint detection algorithms in modulation signal recognition. To further enhance the performance of the short-time energy entropy ratio algorithm, this study introduces three different noise reduction algorithms for optimization. Finally, to accurately identify and distinguish between 2 and 4FSK signals, this study optimizes the related algorithms of the cyclic spectrum by using the kurtosis coefficient value Kur of the cyclic spectrum parameter matrix when the cyclic frequency α = 0 to differentiate between these two signals. The results show that at SNR of 4 dB, the proposed modulation recognition algorithm can effectively distinguish between these two signals, achieving a recognition accuracy of over 99%.

Modulation recognition method of mixed signals based on cyclic spectrum projection

Article Open access 05 December 2023

Deep learning-based phase demodulation for distributed acoustic sensor

Article Open access 13 August 2025

Deep learning for enhancing automatic classification of M-PSK and M-QAM waveform signals dedicated to single-relay cooperative MIMO 5G systems

Article Open access 18 July 2025

Introduction

Major military powers worldwide attach great importance to the research in the field of signal confrontation, with communication signal reconnaissance being a focal point. Detecting crucial communication information and deciphering and disrupting it provide significant information and battlefield advantages for the side that conducts reconnaissance^1,2. Digital signal modulation recognition technology serves as a critical component in communication signal reconnaissance, acting as the foundation and basis for signal demodulation^3,4. In recent years, modulation methods and communication channels have become increasingly complex, posing new challenges to digital signal modulation recognition technology⁵. Therefore, researching the technology for recognizing digital signal modulation methods holds important research significance.

Before the invention of modern modulation recognition technology, the identification of signal modulation methods mainly relied on manual analysis using instruments such as oscilloscopes and spectrum analyzers. This involved analyzing various time–frequency domain characteristics of signals (such as instantaneous Amplitude, Frequency, and Phase) and determining the modulation method of the signal. This method was not only time-consuming but also lacked high accuracy in identification. In 1969, with the publication of the first literature discussing automatic modulation recognition technology⁶, this situation began to change, and more and more modulation recognition algorithms were proposed and validated. Currently, several mature modulation recognition algorithms can be broadly categorized into three types: those based on Decision Theory⁷, Feature Extraction⁸, and Machine Learning⁹.

In the realm of Decision Theory, Shi et al.¹⁰ explored a likelihood ratio construction algorithm based on the use of the phase probability witness function, achieving recognition of MPSK modulation methods. Zhu¹¹, using likelihood function theory under the conditions of unknown symbol rate and carrier frequency, achieved the recognition of MQAM and MPSK modulation method signals. Chen et al.¹² designed a maximum likelihood function estimator that could operate even in a fading channel environment. Shan et al.¹³ designed a likelihood algorithm based on the average likelihood ratio test method, independent of channel conditions and antenna numbers.

In terms of Feature Extraction, Nandi et al.¹⁴ extracted the instantaneous features of signals and used a Decision Tree Algorithm to achieve modulation method recognition. Fang¹⁵ and colleagues employed a recognition method based on Random Forests to automatically identify four underwater acoustic signal modulation methods: OFDM, 2FSK, 4FSK, and 8FSK. Wang¹⁶ and colleagues used Random Forests as classifiers, achieving a modulation method recognition accuracy of over 95% in underwater acoustic signals when the SNR was above -5dB. Sahidullah et al.¹⁷ proposed a speech recognition algorithm based on MFCC. Zheng et al.¹⁸ discussed in detail the factors that may affect the performance of MFCC. Shikha et al.¹⁹ utilized MFCC for gesture recognition. The results show that the proposed application of using MFCC for gesture recognition has very high accuracy.

Machine Learning-based modulation signal recognition also demonstrates good performance²⁰. Compared to traditional pattern recognition algorithms, Deep Learning Algorithms can automatically extract more effective data features to obtain better classification results. Currently, Networks with structures such as Convolutional Neural Networks ^21,22,23,24, Recurrent Neural Networks^25,26, and others have achieved good results in the field of automatic modulation recognition.

The structure of this study is as follows: Section “Methodology” introduces the endpoint detection methods, denoising methods, and the overall logic of this study regarding modulation signal recognition. Section “Different feature parameters” provides an overview of the modulation signal recognition process. Section “Conclusions” presents the results of traditional modulation signal recognition and compares them with the methods proposed in this study. Section “Discussion” discusses the work completed in this study and outlines future work.

Methodology

Modulation signal endpoint detection

(1)
Short time energy method

Assuming the time-domain waveform of the i frame signal is represented as y_i(n), the calculation method for the short time energy value E(i) of the i frame signal y_i(n) is as follows:

$$ E(i) = \sum\limits_{n = 0}^{L - 1} {y_{i}^{2} } (n) $$

(1)

In the formula, L represents the frame length.

(2)
Spectral entropy method

The concept of entropy was originally applied in thermodynamics to represent the degree of disorder in molecules and later used to reflect the uncertainty of random events ^27,28. With the interdisciplinary development and integration of various fields, entropy has also played a more significant role in the domain of signal detection and recognition. There is a clear positive correlation between the disorder level of a signal and its entropy value; typically, the entropy value is higher for useful signals compared to noise.

Assuming the original signal is denoted as x(n), and performing the Fourier transform on the framed signal, let Y_i(k) represent the spectral value of the k spectral line of the i frame signal after Fourier transform. The normalized spectral density function p_i(k) is then defined as:

$$ p_{i} (k) = Y_{i} (k)/\sum\limits_{l = 0}^{N/2} {Y_{i} } (l) $$

(2)

In the formula, p_i(k) represents the spectral density value corresponding to the k spectral line of the i frame signal after FFT transformation, Y_i(k) denotes the energy spectrum of the kth spectral line, and N is the FFT length. Therefore, the short-time spectral entropy value H_i of the i frame signal is defined as:

$$ H_{i} = - \sum\limits_{k = 0}^{N/2} {p_{i} } (k)\lg p_{i} (k) $$

(3)

(3)
Short time energy entropy ratio method

Through the introduction to the principles of the short time energy method and spectral entropy method, it is evident that the energy envelope of a signal segment and the entropy envelope tend to be opposite in the same interval. Utilizing the energy-entropy ratio can better highlight the respective characteristics and differences between useful signal segments and noise segments.

Assuming the original signal is denoted as x(n), and x(n) is framed, with the waveform of the i frame signal being x_i(m), and the total number of frames being A, then the energy of each frame is given by:

$$ AMP_{i} = \sum\limits_{m = 1}^{A} {x_{i}^{2} } (m) $$

(4)

The short time energy entropy ratio of the signal can be expressed as:

$$ EEF_{i} = \sqrt {1 + \left| {EL_{i} /H_{i} } \right|} $$

(5)

(4)
Mel frequency cepstral coefficients method

Mel Frequency Cepstral Coefficients (MFCC) is a method for analyzing the spectrum of a signal ^29,30, simulating the perceptual results of the human auditory system. The MFCC method has deep applications in signal recognition, as it extracts distinctive features from the signal. The human ear has varying perceptual sensitivity to signals in different frequency ranges. As the signal frequency gradually increases, the relationship between human ear perception and frequency slowly transitions from a linear to a logarithmic one. Additionally, in human perception, lower-frequency signals tend to mask higher frequency signals. To emulate these characteristics, a set of nonlinear filters called Mel filters has been designed. These filters exhibit sparse critical bandwidths in the high frequency region and tight critical bandwidths in the low frequency region, mimicking the masking effect in the human ear. The MFCC method employs these filters to filter the input signal, and the energy values of the resulting output are used as fundamental features of the signal. This feature is independent of prior knowledge, possesses good robustness, and performs well even in low SNR conditions.

Mel frequency reflects the relationship between perception and frequency f, and it can be expressed as:

$$ {\text{Mel}} (f) = 2595*\log (1 + f/700) $$

(6)

In the formula, f represents the frequency of the signal, and Mel(f) represents the corresponding Mel frequency, measured in Mel units.

Noise reduction methods for signals

(1)
Wavelet analysis for denoising

In recent years, wavelet theory has undergone continuous improvement and refinement, reaching a new level in understanding and application of wavelet analysis. It is applied in various fields of signal and image processing³¹. One major application of wavelet analysis in these fields is denoising one-dimensional signals. The following will briefly introduce the principles of wavelet denoising. A one-dimensional signal contaminated with noise can be represented as:

$$ s(i) = f(i) + \varepsilon * e(i) $$

(7)

In the formula, e(i) is the noise signal, s(i) is the signal with noise, and f(i) is the signal without noise, also known as the true signal.

The purpose and method of denoising the signal s(i) is to suppress the noise signal e(i) and highlight the true signal f(i). In practical applications, the audio noise signal is often of higher frequency, while the effective signal is usually composed of several low frequency stationary signals. The wavelet decomposition process generates a series of wavelet coefficients, with the noise signal primarily present in the high frequency coefficients. Therefore, selecting an appropriate threshold can filter and process the wavelet coefficients, eliminating the high frequency components and recombining the remaining parts to obtain the denoised signal.

Typically, the process of one-dimensional wavelet denoising is as follows:

(1)
Determine suitable wavelet basis and decomposition scales based on experience and experimentation, then perform wavelet decomposition.
(2)
Determine the threshold for wavelet coefficients to reduce the weight of high frequency components throughout the signal.
(3)
Reconstruct a new signal using the information from all wavelet coefficients based on the weights of low-frequency and high-frequency components, which becomes the denoised signal.

In the above steps, selecting appropriate thresholds and weights is crucial as it directly affects the quality of the wavelet denoising algorithm. There are several methods for setting thresholds in wavelet analysis:

(1)
Use the threshold provided by the ddencmp function in MATLAB software. This method is time-saving but may not perform well for specific signals.
(2)
Determine thresholds through empirical formulas and continuous experimentation. This method yields better denoising results than default thresholds but requires time and effort for derivation and experimentation, and this is the method employed in this study.
(3)
Set the weight of all high-frequency coefficients to zero. This method can eliminate the impact of noise signals but may distort the obtained denoised signal, resulting in the loss of information contained in the signal, so it is generally used less frequently.

(2)
Empirical mode decomposition denoising

Empirical Mode Decomposition (EMD) is a method for processing the time-domain part of a signal³². It decomposes the initial signal x(t) into a set of symmetric Intrinsic Mode Functions (IMF) c_i, each with local mean, and a residual term r_n:

$$ x(t) = \sum\limits_{i = 1}^{n} {c_{i} } + r_{n} $$

(8)

In this formula, each component of c_i, known as IMF, is arranged in descending order of average frequency. These IMF components reflect some detailed temporal features of the signal.

EMD decomposition can reduce the impact of noise and interference on complex and dynamic signals, revealing the intrinsic characteristics of the signal and facilitating further extraction. The IMF components obtained through EMD decomposition are derived and computed directly from the signal itself, distinguishing it from methods such as FFT and wavelet transforms. This characteristic ensures that the analyzed signal is non-stationary.

Analyzing the envelope characteristics of modulated signals allows the extraction of information from high frequency modulated signals, revealing subtle features that contribute to spectral analysis. Unlike previous envelope analysis methods heavily reliant on prior knowledge, where the accuracy of filter center frequency and bandwidth significantly affects the analysis results, EMD decomposition includes high frequency resonance components in the IMF components. EMD decomposition is adaptive, requiring no prior knowledge of these details, and is capable of envelope analysis without significant distortion of the signal.

(3)
Intrinsic time decomposition denoising

Intrinsic Time Decomposition (ITD) is similar to EMD and is also a signal processing method applied in the time domain.

EMD method, compared to wavelet analysis, exhibits better adaptability to various non-stationary signals but has some issues such as mode mixing. ITD method is a new signal processing approach in the time domain that complements EMD, addressing the mode mixing phenomenon present in EMD. These two methods construct linear signals differently, where ITD employs a linear operator, effectively suppressing the mode mixing phenomenon observed in EMD. EMD uses a two-layer loop structure, while ITD improves upon this by utilizing a single-layer iteration, enabling faster analysis and processing of signals. ITD maintains the characteristics of IMF components seen in EMD, providing orthogonality and completeness. Additionally, ITD adapts dynamically to handle a large volume of signals, effectively reducing the time required for signal analysis.

The purpose of intrinsic time-scale decomposition is to decompose the data into a sum of a series of rotational components that characterize the signal features and a monotonic trend component, thereby obtaining meaningful instantaneous frequency and amplitude information.

Let X_t be the original signal, and define L as the baseline extraction operator. After applying L to the original signal, the remaining residue is defined as the intrinsic rotation. Therefore, if we denote the intrinsic rotation extraction operator as H, then H = 1 − L. his further decomposes X_t as³³:

$$ X_{t} = Lx_{t} + (1 - L)x_{t} = L_{t} + H_{t} $$

(9)

In the formula, L_t and H_t are the baseline signal and the intrinsic rotation, respectively.

Optimization of MFSK signal recognition algorithm

Currently, there are two main methods used to distinguish between 2 and 4FSK signals. The first method involves using instantaneous parameters as feature parameters to differentiate MFSK signals. However, its recognition effectiveness is significantly influenced by the signal-to-noise ratio, making it challenging to serve as a reliable criterion for distinguishing MFSK signals, especially in low signal-to-noise ratio conditions. Another method is to calculate the number of peaks in the signal's cyclic spectrum, denoted as P_k, to differentiate between these two types of signals. However, simulation results in this study indicate that the cyclic spectrum of a 2FSK signal exhibits clear peaks, usually around 2, while the cyclic spectrum of a 4FSK signal lacks obvious peaks, and the number of peaks P_k fluctuates between 1 and 6, with the most common cases having 3 or 4 peaks.

This study optimizes the second method mentioned above. Instead of calculating the number of spectral peaks when the cyclic frequency α = 0, it now calculates the kurtosis coefficient Kur of the cyclic spectrum parameter matrix at α = 0 to distinguish between 2 and 4FSK modulated signals. The specific identification steps are as follows:

(1)
The input signal is subjected to one-dimensional discrete stationary wavelet denoising, and three levels of stationary wavelet coefficients are returned. Since Gaussian noise is mainly concentrated in the first two levels of wavelet coefficients, the third-level wavelet coefficients are taken as the denoised signal for subsequent steps.
(2)
Apply the time-domain smoothing cyclic method to calculate the cyclic spectrum parameter matrix $S_{x}^{\alpha } (f)$ for the denoised signal obtained in step 1. Extract all the values at cyclic frequency $\alpha = 0$ from the cyclic spectrum parameter matrix $S_{x}^{\alpha } (f)$, and save the results as a new row vector matrix $S_{\alpha = 0}$.
(3)
Calculate the Kur for the row vector matrix $S_{\alpha = 0}$, and use the kurtosis coefficient Kur as the feature parameter T_6* for identifying 2FSK and 4FSK modulation signals. The calculation method for kurtosis coefficient Kur is as follows:
$$ T_{6*} = Kur = \frac{{\mu_{4} }}{{\sigma^{2} }} $$
(10)

In the formula, $\mu_{4}$ represents the fourth central moment of matrix $S_{\alpha = 0}$, and $\sigma^{2}$ represents the variance of matrix $S_{\alpha = 0}$. Their calculation formulas are as follows:

$$ \mu_{4} = \frac{1}{n}\sum\limits_{1}^{n} {(S_{\alpha = 0} - \overline{{S_{\alpha = 0} }} )^{4} } $$

(11)

$$ \sigma^{2} = \frac{1}{n}\sum\limits_{1}^{n} {(S_{\alpha = 0} - \overline{{S_{\alpha = 0} }} )^{2} } $$

(12)

In the formula, n represents the number of elements in matrix $S_{\alpha = 0}$, and $\overline{{S_{\alpha = 0} }}$ represents the mean of matrix $S_{\alpha = 0}$.

The identification process described above is illustrated in Fig. 1.

Overall approach of this study

The overall technical approach of the modulation signal recognition method proposed in this study is shown in Fig. 2.

As shown in Fig. 2, firstly, several commonly used endpoint detection methods are compared, and the method with the highest accuracy is selected for further optimization by feature extraction of the signal, so as to achieve the purpose of final signal recognition.

The specific process is as follows:

(1)
Simulate the modulation, communication, and reception process of digital modulation signals using MATLAB.
(2)
Preprocess the received signals to obtain the baseband signals.
(3)
Calculate feature parameter T1. Using the T1 value, differentiate the OFDM signal from the single-carrier modulated signals, dividing the signals into {OFDM} and {2ASK, 4ASK, 2FSK, 4FSK, 2PSK, 4PSK} categories.
(4)
Calculate feature parameter T2. Using T2, separate the signal set {2ASK, 4ASK, 2FSK, 4FSK, 2PSK, 4PSK} into {2ASK, 4ASK} and {2FSK, 4FSK, 2PSK, 4PSK} categories.
(5)
Calculate feature parameter T3. Using T3, divide the signal set {2FSK, 4FSK, 2PSK, 4PSK} into {2FSK, 4FSK} and {2PSK, 4PSK} categories.
(6)
Calculate feature parameter T4. Using T4, separate the signal set {2ASK, 4ASK} into {2ASK} and {4ASK} categories.
(7)
Calculate feature parameter T5. Using T5, divide the signal set {2PSK, 4PSK} into {2PSK} and {4PSK} categories.
(8)
Calculate feature parameter T6. Using T6, divide the signal set {2FSK, 4FSK} into {2FSK} and {4FSK} categories.

Different feature parameters

In this paper, feature extraction was performed using high-order cumulants³⁴, the maximum value of the zero-center normalized instantaneous amplitude spectral density^14,35, and the absolute amplitude standard deviation^36,37.

Identification of feature parameter T ₁ for OFDM signals

The theoretical values of cumulants for each digital modulation signal can be obtained from the method of calculating high-order cumulant theoretical values, as shown in Table 1. It is assumed that the energy value of the input signal is E.

Table 1 Theoretical values of cumulants for digital modulation signals.

Full size table

From Table 1, it can be observed that the cumulative values of OFDM signals are zero for the second order and higher, allowing for the distinction of OFDM from other signals based on this characteristic. Additionally, it is noteworthy that the cumulative values for 2FSK and 4FSK signals are the same, making it impossible to differentiate between these two signals using cumulative values.

In summary, we define the feature parameter T1:

$$ T_{1} = \left| {C_{42} } \right| $$

(13)

The table indicates that, except for OFDM signals, the T₁ values for the remaining modulation signals are all greater than 0. Using the feature parameter T₁ effectively distinguishes OFDM signals from other signals. In a Gaussian channel with a signal-to-noise ratio (SNR) ranging from 0 to 15 dB, the T₁ values for the seven modulation signals were calculated. The simulation results, averaged over 500 repeated experiments, are shown in Fig. 3.

From Fig. 3, it can be observed that with the change in SNR, the T₁ values for each modulation signal remain relatively stable, and they generally match the theoretical values in Table 1. The T₁ value for OFDM signals stays close to 0, providing a clear distinction from the T₁ values of other modulation signals. This indicates that OFDM signals can be effectively identified from the seven modulation signal types based on their T₁ values.

Distinguishing feature parameter T ₂ for MASK signals from MFSK and MPSK signals

From Table 1, it can be observed that the high-order cumulants for 2ASK signals are essentially the same as those for 2PSK signals. Relying solely on high-order cumulants as feature parameters is insufficient to effectively distinguish between these two types of signals. In this study, the maximum value $\gamma_{\max }$ of the zero centered normalized instantaneous amplitude spectrum density is used as the feature parameter T₂, enabling the discrimination of {2ASK, 4ASK} signals from the modulation signal set {2PSK, 4PSK, 2FSK, 4FSK}.

$$ T_{2} = \gamma_{\max } = \max \left| {FFT[a_{cn} (i)]^{2} /N_{s} } \right| $$

(14)

Simulations were conducted for the above modulation signals in a Gaussian channel with a SNR ranging from 0 to 15 dB. The T₂ values were calculated for the received signals, and the results are shown in Fig. 4.

As shown in Fig. 4, by selecting a suitable threshold, {2ASK, 4ASK} can be distinguished from other modulation signals. With the increase in SNR, the distinctiveness of T₂ values between {2ASK, 4ASK} and other signals also increases.

Distinguishing feature parameter T ₃ for MFSK signals from MPSK signals

By analyzing the high-order cumulant values of MFSK and MPSK signals in Table 1, this study defines the feature parameter T₃:

$$ T_{3} = \left| {C_{40} } \right|/\left| {C_{42} } \right| $$

(15)

Through calculations, the theoretical values of T₃ for 2PSK and 4PSK modulation signals are 1, while the theoretical values of T₃ for 2FSK and 4FSK modulation signals are 0. Feature parameter T₃ can effectively distinguish between these two types of signals. Simulation verification is performed in the case of SNR from 0 to 15 dB, and the calculated T₃ values for each received modulation signal are shown in Fig. 5.

As shown in Fig. 5, in the Gaussian channel with SNR ranging from 0 to 15 dB, the T₃ values of various modulation signals remain relatively stable regardless of changes in SNR. Additionally, these values closely match the calculated values. The T₃ value differentiation between the two types of modulation signals is significant, allowing for the effective distinction between PSK and FSK modulation signals using T₃ values.

Identification of feature parameter T ₄ for 2ASK and 4ASK signals

The absolute amplitude standard deviation is employed as the feature parameter T₄ to distinguish the 2ASK signal from the modulation signal set {2ASK, 4ASK}.

$$ T_{4} = \sigma_{aa} = \sqrt {\frac{1}{c}\left( {\sum\limits_{{a_{n} (i){ > }a_{t} }} {a_{cn}^{2} (i)} } \right) - \frac{1}{c}\left( {\sum\limits_{{a_{n} (i){ > }a_{t} }} {\left| {a_{cn} (i)} \right|} } \right)^{2} } $$

(16)

Figure 6 shows the T₄ values for various modulation signals, simulated in a Gaussian channel with a SNR ranging from 0 to 15 dB.

As shown in Fig. 6, under low signal conditions (SNR = 0 dB), it is challenging to effectively distinguish between these two signals. However, at higher SNR (SNR = 4dB), a suitable threshold (in this study, thr = 0.3) can be employed to separate 2ASK and 4ASK. With the increase in SNR, the discrimination between T₄ values for 2ASK and 4ASK signals becomes more pronounced, facilitating a better separation of these two modulation signals.

Identification of feature parameter T ₅ for 2PSK and 4PSK signals

By analyzing the high-order cumulant values of 2PSK and 4PSK signals in Table 1, this study defines the feature parameter T₅:

$$ T_{5} = \left| {C_{30} } \right|^{3} /\left| {C_{42} } \right|^{2} $$

(17)

By calculation, the theoretical value of T₅ for 2PSK modulation signal is 32, while the theoretical value of T₅ for 4PSK modulation signal is 0. Using the feature parameter T₅ can distinguish between these two signals. Experimental simulation in a Gaussian channel with SNR ranging from 0 to 15 dB was conducted, and the calculated T₅ values for received modulation signals are shown in Fig. 7.

From Fig. 7, it can be observed that in the Gaussian channel with SNR ranging from 0 to 15 dB, the T₅ values of various modulation signals basically remain unchanged with the change in SNR, and they are consistent with the theoretical values. The discrimination between the two types of modulation signals based on T₅ values is significant, allowing us to distinguish between 2 and 4PSK signals using T₅ values.

Optimization of MFSK signal recognition algorithm

(1)
MFSK Signal Recognition Based on Instantaneous Parameters

The traditional instantaneous parameter method uses the standard deviation $\sigma_{af}$ of the zero centered normalized instantaneous frequency absolute value to distinguish between 2 and 4FSK modulation signals. This method was simulated in this study, and $\sigma_{af}$ was employed as the feature parameter T₆ to differentiate the 2FSK signal from the {2FSK, 4FSK} modulation signal set.

$$ T_{6} = \sigma_{af} = \sqrt {\frac{1}{c}\left( {\mathop \sum \limits_{{a_{n} (i) > a_{t} }} f_{NL}^{2} (i)} \right) - \frac{1}{c}\left( {\mathop \sum \limits_{{a_{n} (i) > a_{t} }} \left| {f_{NL} (i)} \right|} \right)^{2} } $$

(18)

Simulations were conducted on the above-mentioned modulation signals, with the SNR set in the range of 0 to 15 dB. The T₆ values for the received signals are depicted in Fig. 8.

As shown in Fig. 8, when the SNR is relatively high (SNR = 10dB), it is possible to differentiate between 2 and 4FSK signals by selecting a suitable threshold (in this case, thr = 0.41). With increasing SNR, the distinctiveness of T₆ values between 2 and 4FSK signals also improves, facilitating better differentiation between these two signals. However, at lower SNR values (e.g., SNR = 0 dB), distinguishing between these two modulation signals becomes challenging using the instantaneous parameter $\sigma_{af}$.

As shown in Fig. 9, this recognition method achieves accurate identification of 2FSK and 4FSK signals when the SNR is greater than 10 dB, with recognition accuracy exceeding 95% for both types of signals. However, in an environment with a SNR of 5 dB, the algorithm based on instantaneous parameters has a lower recognition accuracy for 2FSK signals, reaching only 70%, and the recognition accuracy for 4FSK signals also falls short of the requirement at 83%. Comparing these results with those in Fig. 8, it can be inferred that when the SNR is too low, the distinction between the modulation signals of 2FSK and 4FSK using the instantaneous parameter $\sigma_{af}$ becomes less obvious, making it challenging to accurately separate them using $\sigma_{af}$.

(2)
MFSK signal recognition based on the number of peaks in the cyclo-spectrum

Currently, the mainstream method is to use the number of spectral peaks P_k in the cyclic spectral profile at cyclic frequency α = 0 to identify MFSK modulation signals. This study simulates and verifies this method, using the number of spectral peaks P_k as the feature parameter T₆ to distinguish the {2FSK, 4FSK} modulated signal set.

$$ T_{6} = P_{k} $$

(19)

The theoretical values of the spectral peak count P_k for 2FSK and 4FSK modulated signals are known to be 2 and 4, respectively, based on references in the literature.

Simulations were conducted on the {2FSK, 4FSK} modulation signal set in a Gaussian channel. At a SNR of 0 dB, the cyclic spectrum cut of the 2FSK and 4FSK modulation signals is shown in Fig. 10. With SNR ranging from 0 to 15 dB, T₆ values were calculated separately for 2FSK and 4FSK signals through 500 repeated experiments. The final simulation results are presented in Fig. 11.

As shown in Fig. 10, when the frequency f is greater than zero, (a) the spectral peaks of the 2FSK signal in the graph are more prominent and distinct, and the number of spectral peaks P_k is consistent with the theoretical value. (b) The spectral peaks of the 4FSK signal in the graph are not obvious, and the number of spectral peaks P_k is between 3 and 5, deviating from the theoretical value.

As shown in Fig. 11, under a Gaussian channel with a SNR ranging from 0 to 15 dB, the number of spectral peaks for the 2FSK signal remains around 2, which is consistent with the theoretical value. The number of spectral peaks for the 4FSK modulation signal fluctuates within the range of 3.4–3.7, showing a certain disparity from the theoretical value. However, they correspond to the figures in (b) of Fig. 10, confirming the insignificant issue of spectral peak quantity for the 4FSK modulation signal.

To further analyze the discrepancy between the simulated values and theoretical values for the occurrence of spectral peaks in the 4FSK modulation signal shown in Fig. 11, under a Gaussian channel with an SNR of 0 dB, the T₆ values for randomly generated 4FSK modulation signals are calculated. This process is repeated 100 times in simulation experiments, and the occurrence of spectral peaks is recorded. The results are shown in Fig. 12.

As shown in Fig. 12, the T₆ values for the 4FSK modulation signal fluctuate between 1 and 6, with the majority (77%) falling in the range of 3–4. T6 values fluctuating between 1 and 2 occur 13 times, accounting for 13% of the total occurrences. When the T₆ values for the 4FSK signal fluctuate between 1 and 2, identifying the 4FSK modulation signal based on the spectral peak count P_k may lead to a misclassification as 2FSK in the final output.

In a Gaussian channel with SNR ranging from 0 to 15 dB, the T₆ values for 2FSK and 4FSK signals are calculated and compared with a threshold value (set to 2 in this study). Signals with T₆ values greater than 2 are classified as 4FSK, while those less than or equal to 2 are classified as 2FSK. The simulation is independently repeated 1000 times, and the recognition accuracy for 2FSK and 4FSK modulation signals is depicted in Fig. 13.

As shown in Fig. 13, this recognition method achieves perfect identification of 2FSK signals in a Gaussian channel with a SNR greater than 2 dB, with a recognition accuracy of 100%. However, the recognition accuracy for 4FSK signals does not meet the requirements, reaching only around 85%. Comparing these results with Fig. 12, it can be predicted that the cyclic spectrum peaks of 4FSK modulated signals are not distinct, and some 4FSK signals are misclassified as 2FSK signals, resulting in a lower recognition accuracy for 4FSK signals.

(3)
MFSK signal recognition based on the kurtosis coefficient of the cyclo-spectrum

In summary, this study proposes an optimized approach for distinguishing the {2FSK, 4FSK} signal set based on the number of cyclic spectrum peaks P_k. The optimization involves not calculating the number of peaks at the cyclic frequency $\alpha = 0$ but instead computing the kurtosis coefficient Kur of the cyclic spectrum parameter matrix at $\alpha = 0$ to differentiate between 2 and 4FSK modulated signals. The recognition steps are outlined in Section “Optimization of MFSK signal recognition algorithm”.

In a Gaussian channel with a SNR ranging from 0 to 15 dB, the T_6* values for received 2FSK and 4FSK signals were calculated according to the aforementioned procedure. The simulation results are depicted in Fig. 14.

From Fig. 14, it can be observed that in a Gaussian channel with a SNR ranging from 0 to 15 dB, the T_6* values for both modulation signals remain relatively stable and are not sensitive to changes in the signal-to-noise ratio. There is a significant distinction in T_6* values between the two modulation signals, allowing for effective differentiation between 2 and 4FSK signals. By setting a threshold value $\eta = 30$, signals with T_6* values greater than the threshold are considered as 2FSK signals, while those with values below the threshold are considered as 4FSK signals.

In a Gaussian channel with a SNR ranging from 0 to 10 dB, the T_6* values for received 2FSK and 4FSK signals were calculated. These values were then compared with a set threshold value $\eta$. The simulation was repeated 1000 times, and the recognition accuracy for 2FSK and 4FSK modulation signals is depicted in Fig. 15.

That algorithm can accurately distinguish these two signals when the SNR is greater than 1 dB, achieving a recognition accuracy of 92%. When the SNR is greater than 3 dB, this method can completely distinguish between 2 and 4FSK signals, with recognition accuracies exceeding 99% for both signals.

Table 2 shows the recognition accuracy of the instantaneous parameter algorithm, the peak number algorithm and the proposed algorithm under different SNR.

Table 2 Comparison of recognition accuracy of FSK signals by three algorithms.

Full size table

As shown in Table 2, the algorithm based on instantaneous parameter $\sigma_{af}$ maintains a relatively high recognition accuracy at high SNR. However, in situations where the SNR is less than 5 dB, the recognition accuracy drops significantly. The algorithm based on the number of spectral peaks P_k in the cyclic spectrum exhibits extremely high recognition accuracy for 2FSK but falls short in recognizing 4FSK signals. The algorithm proposed in this study, which uses the kurtosis coefficient Kur of the cyclic spectrum as a substitute for the number of spectral peaks P_k, maintains a 99% recognition accuracy for 2FSK and 4FSK signals even at lower SNR. This algorithm demonstrates superiority when compared to the other two methods.

Conclusions

Simulation experiment

In the simulation testing of the overall modulation recognition design scheme, the symbol rate of the modulation signal is set to 5000 Baud, the sample length of the simulated waveform is 4 s, the carrier frequency is set to 20 kHz, and the sampling frequency of the receiving device is set to 120 kHz.

Under Gaussian channels with SNRs ranging from 1 to 10 dB, the six feature parameters are calculated for the received signals to identify the signal types. The recognition experiments for each signal are repeated 500 times at each SNR, and the recognition accuracy data for the seven modulation signals are saved. The results are shown in Fig. 16. It is worth noting that reducing the signal length may lead to a decrease in recognition accuracy.

As shown in Fig. 16, when the SNR is 4 dB, the overall accuracy of the modulation recognition algorithm used in this study can reach over 92%. When the SNR is greater than 6 dB, the overall recognition accuracy of the algorithm can be maintained at over 95%. Specifically, when the SNR is greater than 2 dB, both 2FSK and 4FSK signals can be accurately identified, with recognition accuracy rates exceeding 97%.

Simulation experiment combined with endpoint detection

(1)
Comparison of the effects of several different endpoint detection algorithms

In a Gaussian channel, the endpoint detection effects of different endpoint detection algorithms for the same signal are studied. The SNR of the Gaussian channel is set to 0 dB, and the waveform of the signal before and after passing through the Gaussian channel is shown in Fig. 17.

Endpoint detection is performed using Short Time Energy Entropy Ratio Method, Short Time Energy Zero Ratio Method, Short Time Spectral Entropy Method, and Short Time MFCC Distance Method. The results are illustrated in Fig. 18.

As shown in Fig. 18, the envelope curve formed by the Short Time Energy Entropy Ratio method can best reconstruct the original waveform of the signal, making it suitable for further endpoint detection work. However, this simulation only verifies the detection effect of the Short Time Energy Entropy Ratio algorithm on general communication signals. For further research on the detection effect of the four endpoint detection algorithms on modulation signals, additional experiments are needed. By changing the modulation type of the signals, with the signal set {OFDM, 2ASK, 4ASK, 2FSK, 4FSK, 2PSK, 4PSK}, under a Gaussian channel with a SNR of 0 dB, the endpoint detection accuracy of the four algorithms is calculated for each signal type. Each signal type is experimented with 100 times, and the final endpoint detection accuracy is the average value. The experimental results are shown in Table 3.

$$ {\text{Endpoint}}\,{\text{Detection}}\,{\text{Accuracy}} = \left( {{\text{Total}}\,{\text{Frames}} - \left( {{\text{False}}\,{\text{Alarms}} + {\text{Missed}}\,{\text{Detections}}} \right)} \right)/{\text{Total}}\,{\text{Frames}} $$

Table 3 Average endpoint detection accuracy of various algorithms when the SNR of 0 dB.

Full size table

As shown in Table 3, under a Gaussian channel with a SNR of 0 dB, for these seven modulation signal types, the Short Time Energy Entropy Ratio algorithm performs the best, maintaining an endpoint detection accuracy of over 93%. This is superior to the Short Time Spectral Entropy Method, Short Time Energy Zero Ratio Method, and MFCC Distance Method. Additionally, the Short Time Energy Entropy Ratio algorithm is adaptable to various modulation signals. From Table 3, it can be observed that when the types of input modulation signals vary, the fluctuation range of the endpoint detection accuracy for the Short-Time Energy Entropy Ratio algorithm is the smallest, remaining around 2%, which is better than the other three algorithms.

(2)
Simulation of short time energy entropy ratio endpoint detection combined with denoising algorithms

In order to further improve the performance of the endpoint detection algorithm, an attempt was made to preprocess the signal with denoising before applying the Short Time Energy Entropy Ratio algorithm. Simulations were conducted for denoising effects using Empirical Mode Decomposition (EMD), Iterative Thresholding Decomposition (ITD), and Wavelet decomposition algorithms under a Gaussian channel. The time taken by each denoising algorithm was recorded.

In a Gaussian channel with SNR ranging from -20dB to 10dB, 300 Monte Carlo simulation experiments were conducted based on the Short-Time Energy Entropy Ratio and various denoising algorithms. The endpoint detection accuracy of each algorithm is shown in Fig. 19. The time taken by each denoising algorithm was 1.207 s, 0.023 s, and 0.003 s, respectively.

In Fig. 19, the ORI group serves as the control group without using denoising algorithms. The ITD group uses Intrinsic Time-Scale Decomposition, the EMD group uses Empirical Mode Decomposition, and the Wavelet group uses Wavelet decomposition as denoising algorithms.

From the graph, it can be observed that, in a Gaussian channel with a SNR greater than – 15 dB, the EMD denoising and Wavelet denoising algorithms outperform the ORI curve without denoising. Notably, the Wavelet denoising algorithm is superior to the EMD denoising algorithm. At a SNR of – 20 dB, the use of the Wavelet denoising algorithm is optimal, achieving a correctness rate of 90%, while the EMD denoising effect is similar to the control group without denoising. The ITD algorithm’s performance is inferior to the other three approaches. In the SNR range of – 20 dB to – 10 dB, the ITD denoising algorithm is less effective than the other three methods, possibly because the ITD decomposition disrupts some characteristic parameters of the input signal, degrading the Short Time Energy Entropy Ratio curve. When the SNR is greater than – 20 dB, the combination of the Wavelet denoising algorithm and the Short Time Energy Entropy Ratio-based endpoint detection algorithm achieves a 95% endpoint detection accuracy, meeting the requirements.

In the Gaussian channel, the EMD denoising algorithm takes the longest time, at 1.207 s, while the Wavelet denoising algorithm takes the shortest time, at 0.003 s. Combining the denoising algorithm’s endpoint detection accuracy under the Gaussian channel from Fig. 19, it can be concluded that, in a Gaussian channel, the denoising effect of the Wavelet denoising algorithm, in conjunction with the Short Time Energy Entropy Ratio algorithm, is the best, and this algorithm requires the shortest computation time.

(3)
Simulation of modulation recognition combined with endpoint detection

To evaluate the impact of the designed endpoint detection module on the overall modulation recognition system’s accuracy and verify the stability of the endpoint detection module in conjunction with the modulation recognition system, a simulation test was conducted by adding a preprocessing module (endpoint detection module) before the modulation recognition system. The testing steps are as follows:

(1)
Use MATLAB to generate corresponding test signals selected from the {2ASK, 4ASK, 2FSK, 4FSK, 2PSK, 4PSK, OFDM} signal set, as shown in Fig. 20A.
(2)
Add randomly sized blank signal segments before and after the generated modulation signals, resulting in mixed signal segments, as depicted in Fig. 20B.
(3)
Considering the information from Fig. 16, where overall modulation recognition accuracy is close to 95% in a Gaussian channel with SNR greater than 5 dB, simulate the channel environment with a Gaussian channel and SNR = 5 dB. The signal after adding Gaussian noise is illustrated in Fig. 20C.
(4)
Combine the foregoing, apply wavelet denoising to the noisy signal. Subsequently, use the endpoint detection algorithm based on short-time energy entropy to extract modulation signal segments. The resulting signal segments are shown in Fig. 20D.
(5)
Use the obtained signal segments as input to the entire modulation recognition system. Conduct a simulation to recognize the modulation type.
(6)
Vary the modulation signal types and repeat steps 1–5 for each of the 7 modulation signal types. Test each signal type 500 times.

Present the overall modulation recognition results as shown in Table 4.

Table 4 Modulation recognition test accuracy.

Full size table

As shown in the results in Table 4, under a Gaussian channel with a SNR of 5 dB, the overall recognition accuracy of the modulation recognition system combined with the endpoint detection module is comparable to the ideal signal recognition accuracy shown in Fig. 16. The recognition accuracy fluctuates around 95% in both simulation tests. This indicates that the modulation signal segments, obtained through endpoint detection technology, do not undergo significant distortion and do not interfere with the subsequent recognition and classification of modulation signals. This validates the feasibility and stability of the entire modulation signal recognition scheme.

Result analysis

(1)
In a Gaussian channel with a SNR of 0 dB, the short time energy entropy ratio algorithm performs the best among the seven common modulation types, with an endpoint detection accuracy exceeding 93%. It outperforms the short time spectral entropy method, short time zero crossing rate method, and MFCC method. Additionally, the short time energy entropy ratio algorithm is adaptable to various modulation signals. When there is a change in the types of input modulation signals, the fluctuation range of the endpoint detection accuracy using the short-time energy entropy ratio algorithm is minimal, staying around 2%, which is superior to the other three algorithms.
(2)
In a Gaussian channel with a SNR greater than – 15 dB, the wavelet denoising algorithm is superior to the EMD denoising algorithm. When the SNR is – 20 dB, the wavelet denoising algorithm is optimal, achieving a correctness rate of 90%. In the SNR range of – 20 dB to – 10 dB, the ITD denoising algorithm performs poorly, possibly because ITD decomposition disrupts some characteristic parameters of the input signal, degrading the short time energy entropy ratio curve of the signal. In scenarios where the SNR is greater than – 20 dB, the combination of the wavelet denoising algorithm and the endpoint detection algorithm based on short time energy entropy ratio can achieve an endpoint detection correctness rate of 95%.

In a Gaussian channel, the EMD denoising algorithm takes the longest time, at 1.207 s, while the Wavelet denoising algorithm takes the shortest time, at 0.003 s.

(6)
In a Gaussian channel with a SNR of 5 dB, the overall recognition accuracy of the modulation recognition system combined with the endpoint detection module is basically consistent with the ideal signal recognition accuracy. The recognition accuracy fluctuates around 95% in both simulation tests. This indicates that the modulation signal segments obtained through endpoint detection technology do not undergo distortion and do not affect the subsequent recognition and classification of modulation signals. This validates the feasibility and stability of the entire modulation signal recognition scheme.

Discussion

This study analyzed the performance of four common endpoint detection techniques and found that the Short-Time Energy Entropy Ratio algorithm performs the best for detecting seven common modulation signals. In a Gaussian channel with a SNR of 0 dB, the correctness of endpoint detection is above 93%. Based on this, three different denoising algorithms were introduced to further enhance the performance of the Short Time Energy Entropy Ratio algorithm. The results indicate that the wavelet denoising algorithm achieves the greatest improvement in the performance of the Short Time Energy Entropy Ratio algorithm, with a short processing time. In a Gaussian channel with a SNR greater than -10dB, the endpoint detection correctness of this algorithm can be maintained at over 95%.

Furthermore, for accurate identification and differentiation between 2 and 4FSK signals, this study optimized the relevant algorithms in the cyclo-spectrum, using the kurtosis coefficient value Kur of the cyclo-spectrum parameter matrix at the cyclic frequency $\alpha = 0$ to distinguish these two signals. The results show that under a SNR of 4 dB, the proposed modulation recognition algorithm can effectively differentiate these two signals, with an identification correctness rate of over 99%. Finally, this study compared the endpoint-detected signals with ideal signals, validating the feasibility and stability of the entire proposed modulation signal recognition scheme.

As the signals used for modulation recognition in this study are relatively stable and the noise source is relatively singular (white Gaussian noise added), in real-world applications, signals undergo more complex and varied noise and interference. In future work, the modulation recognition algorithm in complex channel environments needs to be further researched.

Data availability

Part of the data generation process used in this study has been included in this published paper, and other data used in this study can be obtained from corresponding author according to reasonable requirements.

References

Ali, M. F. et al. Recent advances and future directions on underwater wireless communications. Arch. Comput. Methods Eng. 27(5), 1379–1412 (2020).
Article Google Scholar
Kim, K. et al. Editorial for special issue: Underwater acoustics, communications, and information processing. Appl. Sci. Basel 9(22), 4873 (2019).
Article Google Scholar
Ali, A. K. & Erçelebi, E. Algorithm for automatic recognition of PSK and QAM with unique classifier based on features and threshold levels. ISA Trans. 102, 173–192 (2020).
Article PubMed Google Scholar
Ali, A. K. & Erçelebi, E. An M-QAM signal modulation recognition algorithm in AWGN channel. Sci. Program. 1, 6752694 (2019).
Google Scholar
Habets, E. A. P. & Benesty, J. Multi-microphone noise reduction based on orthogonal noise signal decompositions. IEEE Trans. Audio Speech Lang. Process. 21(6), 1123–1133 (2013).
Article Google Scholar
Weaver, C., Cole, C. & Krumland, R. The automatic classifcation of modulation types by pattern recognition. Technical Reaport No.1829–2, Stanford Electronics Laboratories, Stanford University, Stanford, April 1969.
Panagiotou, P., Anastasopoulos, A. & Polydoros, A. Likelihood ratio tests for modulation classification. Milcom Century Military Communications Conference, Los Angeles, CA, USA (2000).
Polydoros, A. & Kim, K. On the detection and classification of quadrature digital modulations in broad-band noise. IEEE Trans. Commun. 38(8), 1199–1211 (1990).
Article Google Scholar
Shi, F. et al. Combining neural networks for modulation recognition. Digit. Signal Process. 120, 103264 (2022).
Article Google Scholar
Shi, Q. & Karasawa, Y. Automatic modulation identification based on the probability density function of signal phase. IEEE Trans. Commun. 60(4), 1033–1044 (2012).
Article Google Scholar
Zhu, D., Mathews, V. J. & Detienne, D. H. A likelihood-based algorithm for blind identification of QAM and PSK signals. IEEE Trans. Wirel. Commun. 17(5), 34173430 (2018).
Article Google Scholar
Chen, W. et al. A faster maximum-likelihood modulation classification in flat fading non-Gaussian channels. IEEE Commun. Lett. 23(3), 454–457 (2019).
Article Google Scholar
Shah, M. H. & Dang, X. An effective approach for low-complexity maximum likelihood based automatic modulation classification of STBC-MIMO systems. Front. Inf. Technol. Electron. Eng. 21(3), 465–475 (2020).
Article Google Scholar
Nandi, A. K. & Azzouz, E. E. Algorithms for automatic modulation recognition of communication signals. IEEE Trans. Commun. 46(4), 431–436 (1998).
Article Google Scholar
Fang, T. et al. Modulation mode recognition method of non-cooperative underwater acoustic communication signal based on spectral peak feature extraction and random forest. Remote Sens. 14(7), 1603 (2022).
Article ADS Google Scholar
Wang, M., Zhu, Z. & Qian, G. Modulation signal recognition of underwater acoustic communication based on archimedes optimization algorithm and random forest. Sensors 23(5), 2764 (2023).
Article ADS PubMed PubMed Central Google Scholar
Sahidullah, M. & Saha, G. A novel windowing technique for efficient computation of MFCC for speaker recognition. IEEE Signal Process. Lett. 20(2), 149–152 (2013).
Article ADS Google Scholar
Zheng, F., Zhang, G. & Song, Z. Comparison of different implementations of MFCC. J. Comput. Sci. Technol. 16, 582–589 (2001).
Article Google Scholar
Gupta, S., Jafreezal, J., Ahmad, W. W. & Bansal, A. Feature extraction using MFCC. Signal Image Process. Int. J. 4(4), 101–108 (2013).
Article Google Scholar
Oshea, T. J., Tamoghna, R. & Clancy, T. C. Over-the-air deep learning based radio signal classification. IEEE J. Sel. Topics Signal Process 12(1), 168–179 (2018).
Article ADS Google Scholar
Liu, Y., Liu, Y. & Yang, C. Modulation recognition with graph convolutional network. IEEE Wirel. Commun. Lett. 9(5), 624–627 (2020).
Article MathSciNet Google Scholar
Athira, S. et al. Automatic modulation classification using convolutional neural network. Int. J. Comput. Technol. Appl. 9(16), 7733–7742 (2016).
Google Scholar
Liao, K. et al. Sequential convolutional recurrent neural networks for fast automatic modulation classification. IEEE Access 9, 27182–27188 (2021).
Article Google Scholar
Wu, H. et al. Convolutional neural network and multi-feature fusion for automatic modulation classification. Electron. Lett. 55(16), 895–897 (2019).
Article ADS Google Scholar
West, N.E. & O’shea, T. Deep architectures for modulation recognition. In 2017 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN), Batltimore, MD, USA, 1–6 (2017)
Zhang, M., Zeng, Y. & Han, Z., et al. Automatic modulation recognition using deep learning architectures. In 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Kalamata, Greece 1–5 (2018)
Zhao, Z., Yang, A. & Guo, P. A modulation format identification method based on information entropy analysis of received optical communication signal. IEEE Access 7, 41492–41497 (2019).
Article Google Scholar
Avci, D. An intelligent system using adaptive wavelet entropy for automatic analog modulation identification. Digit. Signal Process. 20(4), 1196–1206 (2010).
Article ADS MathSciNet Google Scholar
Picone, J. W. Signal modeling techniques in speech recognition. Proc. IEEE 81(9), 1215–1247 (1993).
Article Google Scholar
Schroeder, M.R. Recognition of complex acoustic signals. Life Science Research Report, Bullock T H (ed.), Abakon Verlag, 55: 323–328 (1997).
Sardy, S., Tseng, P. & Bruce, A. Robust wavelet denoising. IEEE Trans. Signal Process. 49(6), 1146–1152 (2001).
Article ADS Google Scholar
Wu, Z. & Huang, N. E. A study of the characteristics of white noise using the empirical mode decomposition method. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 460(2046), 1597–1611 (2004).
Article ADS Google Scholar
Frei, M. G. & Osorio, I. Intrinsic time-scale decomposition: Time-frequency-energy analysis and real-time filtering of non-stationarysignals. Proc. R. Soc. A Math. Phys. Eng. Sci. 463, 321–342 (2006).
ADS Google Scholar
Hipp, J.E. Modulation classification based on statistical moments. In IEEE Military Communications Conference—Communications-Computers. 20(2):1–6 (1986)
Azzouz, E. E. & Nandi, A. K. Automatic Modulation Recognition of Communication Signals (Kluwer Academic Publishers, 1996).
Book Google Scholar
Azzouz, E. E. & Nandi, A. K. Automatic identification of digital modulation types. Signal Process. 47(1), 55–69 (1995).
Article ADS Google Scholar
Nandi, A. K. & Azzouz, E. E. Automatic analogue modulation recognition. Signal Process. 46(2), 211–222 (1995).
Article ADS Google Scholar

Download references

Author information

Authors and Affiliations

Hangzhou Institute of Computer External Equipment, Hangzhou, China
Li Xiuquan
School of Electronic Information, Hangzhou Dianzi University, Hangzhou, China
Wang Zhen, Jin Yeyin, Chen Jing & Li Zhenfei

Authors

Li Xiuquan
View author publications
Search author on:PubMed Google Scholar
Wang Zhen
View author publications
Search author on:PubMed Google Scholar
Jin Yeyin
View author publications
Search author on:PubMed Google Scholar
Chen Jing
View author publications
Search author on:PubMed Google Scholar
Li Zhenfei
View author publications
Search author on:PubMed Google Scholar

Contributions

The article’s general manager, L.X., is in charge of organizing the authors and designing the article. The method presented in this study was developed by J.Y. and W.Z., who also wrote the article and oversaw its later modification. C.J., L.Z. organized and analyzed the experimental data using literature research, and she offered insightful commentary on the article’s composition.

Corresponding author

Correspondence to Jin Yeyin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Xiuquan, L., Zhen, W., Yeyin, J. et al. Acoustic modulation signal recognition based on endpoint detection. Sci Rep 14, 19198 (2024). https://doi.org/10.1038/s41598-024-69934-y

Download citation

Received: 11 April 2024
Accepted: 12 August 2024
Published: 19 August 2024
DOI: https://doi.org/10.1038/s41598-024-69934-y

Subjects

Abstract

Similar content being viewed by others

Modulation recognition method of mixed signals based on cyclic spectrum projection

Deep learning-based phase demodulation for distributed acoustic sensor

Deep learning for enhancing automatic classification of M-PSK and M-QAM waveform signals dedicated to single-relay cooperative MIMO 5G systems

Introduction

Methodology

Modulation signal endpoint detection

Noise reduction methods for signals

Optimization of MFSK signal recognition algorithm

Overall approach of this study

Different feature parameters

Identification of feature parameter T 1 for OFDM signals

Distinguishing feature parameter T 2 for MASK signals from MFSK and MPSK signals

Distinguishing feature parameter T 3 for MFSK signals from MPSK signals

Identification of feature parameter T 4 for 2ASK and 4ASK signals

Identification of feature parameter T 5 for 2PSK and 4PSK signals

Optimization of MFSK signal recognition algorithm

Conclusions

Simulation experiment

Simulation experiment combined with endpoint detection

Result analysis

Discussion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links

Identification of feature parameter T ₁ for OFDM signals

Distinguishing feature parameter T ₂ for MASK signals from MFSK and MPSK signals

Distinguishing feature parameter T ₃ for MFSK signals from MPSK signals

Identification of feature parameter T ₄ for 2ASK and 4ASK signals

Identification of feature parameter T ₅ for 2PSK and 4PSK signals