Abstract
Since electronic music is simpler to produce and distribute than analog music, the variety of musicals available worldwide has increased rapidly along with the music marketplace’s shift from analog to digital. Due to the abundance of available songs, people are discovering songs in various ways; one of them is by analyzing their emotional content. Not every age group can listen to the same music at all times. Deep learning techniques have yielded excellent results recently, marking a significant advance in NLP. However, there have been few attempts to use a deep learning model to sort out lyrics from improper music. Hence, a deep learning-based lyrics text classification process is presented in this proposal. Firstly, indispensable text data are fetched from the standard online resources and further, it is applied to the text pre-processing stage. After that, the resultant pre-processed text is subjected to the Serial Cascaded Hybrid Adaptive Deep Networks (SCHADNet) for classification purposes. The Transformer-based Bidirectional Long Short-Term Memory (Trans Bi-LSTM) is integrated with a Gated Recurrent Unit (GRU) for developing the model of SCHADNet, where the parameters of SCHADNet are optimally tuned by the Improved Marine Predators Algorithm (IMPA). Lastly, the classified outcome is accomplished from the SCHADNet. In order to enhance the classification performance, the developed model shows significant advancement by increasing the accuracy rate of 93.4%, 93.47% recall and 99.2% NPV, respectively. The numerical analysis is performed for the suggested lyrics text classification model over numerous classical text classification techniques to portray the effectiveness of the presented model.
Similar content being viewed by others
Data availability
Dataset 1: The data underlying this article are available in https://www.kaggle.com/datasets/mateibejan/multilingual-lyrics-for-genre-classification?select=train.csv. Access date: 2024-01-02. Dataset 2: The data underlying this article are available in https://github.com/wojtek11530/song_lyric_classification/tree/master/datasets. Access data: 2024-01-03.
References
Furner, M., Islam, M. Z. & Li, C. T. Knowledge discovery and visualisation framework using machine learning for music information retrieval from broadcast radio data. Expert Syst. Appl. 182, 115236 (2021).
Hizlisoy, S., Yildirim, S. & Tufekci, Z. Music emotion recognition using convolutional long short term memory deep neural networks. Eng. Sci. Technol. Int J. 24(3), 760–767 (2021).
Wang, C. & Ko, Y. C. Emotional representation of music in multi-source data by the internet of things and deep learning. J. Supercomput. 79(1), 349–366 (2023).
Jena, K. K., Bhoi, S. K., Mohapatra, S. & Bakshi, S. A hybrid deep learning approach for classification of music genres using wavelet and spectrogram analysis. Neural Comput. Appl. 35(1), 11223–11248 (2023).
Khattak, A., Asghar, M. Z., Khalid, H. A. & Ahmad, H. Emotion classification in poetry text using deep neural network. Multimed. Tools Appl. 81(18), 26223–26244 (2022).
Yang, L., Shen, Z., Zeng, J., Luo, X. & Lin, H. COSMIC: music emotion recognition combining structure analysis and modal interaction. Multimed. Tools Appl. 83 (5), 1–16 (2023).
Dong, L. Using deep learning and genetic algorithms for melody generation and optimization in music. Soft Comput. 27(1), 17419–17433 (2023).
Sarkar, R., Choudhury, S., Dutta, S., Roy, A. & Saha, S. K. Recognition of emotion in music based on deep convolutional neural network. Multimed. Tools Appl. 79, 765–783 (2020).
Policicchio, V. L., Pietramala, A. & Rullo, P. GAMoN: discovering M-of-N ¬,∨ hypotheses for text classification by a lattice-based genetic algorithm. Artif. Intell. 191, 61–95 (2012).
Dwiyani, L. K. D., Suarjaya, I. M. A. D. & Rusjayanthi, N. K. D. Classification of explicit songs based on lyrics using random forest algorithm. J. Inform. Syst. Inform. 5, 550–567 (2023).
Du, J. Sentiment analysis and lyrics theme recognition of music lyrics based on natural language processing. J. Electr. Syst. 20, 315–321 (2024).
Xie, C. et al. Music genre classification based on res-gated CNN and attention mechanism. Multimed. Tools Appl. 83(5), 13527–13542 (2024).
Jandaghian, M., Setayeshi, S., Razzazi, F. & Sharifi, A. Music emotion recognition based on a modified brain emotional learning model. Multimed. Tools Appl. 82(4), 26037–26061 (2023).
Rajan, R. & Nithin, S. K. Folk music structural segment classification using GRU-based hierarchical attention network. Sādhanā 48(4), 254 (2023).
Hongdan, W., SalmiJamali, S., Zhengping, C., Qiaojuan, S. & Le, R. An intelligent music genre analysis using feature extraction and classification using deep learning techniques. Comput. Electr. Eng. 100, 107978 (2022).
Sujeesha, A. S., Mala, J. B. & Rajan, R. Automatic music mood classification using multi-modal attention framework. Eng. Appl. Artif. Intell. 128, 107355 (2024).
da Silva, A. C. M., Coelho, M. A. N. & Neto, R. F. A music classification model based on metric learning applied to MP3 audio files. Expert Syst. Appl. 144, 113071 (2020).
Andreyan Rizky Baskara; Muti’a Maulida; Muhammad Tri Madya Lestiyanto; Yuslena Sari; Nurul Fathanah Mustamin; Eka Setya Wijaya, Explicit content classification in indonesian song lyrics using the LSTM-CNN method. 2024 Ninth International Conference on Informatics and Computing (ICIC) (2024).
Bonela, Abraham Albert, He, Zhen, Luxford, Dan-Anderson., Riordan, Benjamin & Kuntsche, Emmanuel. Development of the lyrics-based deep learning algorithm for identifying alcohol-related words (LYDIA). Alcohol Alcohol. 59, 2 (2024).
Bolla, B. K., Pattnaik, S. R. & Patra, S. Detection of objectionable song lyrics using weakly supervised learning and natural language processing techniques. Procedia Comput. Sci. 235, 1929–1942 (2024).
Syed Nawaz Pasha; Dadi Ramesh; Sallauddin Mohmmad; Shabana; D. Kothandaraman; T. Sravanthi, Song lyrics genre detection using RNN. AIP Conference Proceedings 2971(1) (2024).
Abdillah, J., Asror, I. & Wibowo, Y. F. A. Emotion classification of song lyrics using bidirectional lstm method with glove word representation weighting. J. RESTI (Rekayasa Sistem Dan Teknologi Informasi) 4(4), 723–729 (2020).
Revathy, V. R., Pillai, A. S. & Daneshfar, F. LyEmoBERT: classification of lyrics’ emotion and recommendation using a pre-trained model. Procedia Comput. Sci. 218, 1196–1208 (2023).
Jia, X. Music emotion classification method based on deep learning and improved attention mechanism. Comput. Intell. Neurosci. 2022, 5181899 (2022).
Chen, X. et al. A novel approach for explicit song lyrics detection using machine and deep ensemble learning models. PeerJ Comput. Sci. 9, e1469 (2023).
Li, Y., Zhang, Z., Ding, H. & Chang, L. Music genre classification based on fusing audio and lyric information. Multimed. Tools Appl. 82(13), 20157–20176 (2023).
Delbouys, R., Hennequin, R., Piccoli, F., Royo-Letelier, J. and Moussallam, M., Music mood detection based on audio and lyrics with deep neural net. arXiv preprint (2018).
F. Almeida do Carmo, J. L. Figueira da Silva Junior, R. Geraldeli Rossi and F. M. França Lobato, Text representations for lyric-based identification of musical subgenres. IEEE Latin America Transactions 21(6):737-744 (2023).
Tsaptsinos, A., Lyrics-based music genre classification using a hierarchical attention network. arXiv (2017).
Faramarzia, A., Heidarinejada, M., Mirjalili, S. & Gandomi, A. H. marine predators algorithm: a nature-inspired metaheuristic. Expert Syst. Appl. 152, 113377 (2020).
Ye, H. et al. Web services classification based on wide & Bi-LSTM model. IEEE Access 7, 43697–43706 (2019).
Naeem, A. et al. A novel combined densenet and gated recurrent unit approach to detect energy thefts in smart grids. IEEE Access 11, 59496–59510 (2023).
Sun, J., Han, P., Cheng, Z., Wu, E. & Wang, W. Transformer based multi-grained attention network for aspect-based sentiment analysis. IEEE Access 8, 211152–211163 (2020).
Alfarizi, M. I., Syafaah, L. & Lestandy, M. Emotional text classification using TF-IDF (Term frequency-inverse document frequency) And LSTM (Long short-term memory). J. Informatika 10, 2 (2022).
Ping Yu and XueBo Fu, Classification and identification of emotion of non-foreign music based on TR-Bi-LSTM emotion analysis. Researchsquare (2023).
Jia, C. et al. State of health prediction of lithium-ion batteries based on bidirectional gated recurrent unit and transformer. Energy 285, 129401 (2023).
Salim, A., Jummar, W. K., Jasim, F. M. & Yousif, M. Eurasian oystercatcher optimiser: new meta-heuristic algorithm. J. Intell. Syst. 31(1), 332–344 (2022).
Azizi, M., Aickelin, U., Khorshidi, H. A. & Baghalzadeh Shishehgarkhaneh, M. Energy valley optimizer: a novel metaheuristic algorithm for global and engineering optimization. Sci. Rep. 13, 226 (2023).
Askari, Q., Younas, I. & Saeed, M. Political optimizer: a novel socio-inspired meta-heuristic for global optimization. Knowl.-based Syst. 195, 105709 (2020).
Pengxu Wang, Electronic archive classification method based on convolutional neural network with fast text embeddings, 2024 4th International Conference on Mobile Networks and Wireless Communications (ICMNWC) (2024).
Najla Al-shathry, Badria Al-onazi, Abdulkhaleq Q A Hassan, Shoayee Alotaibi, Saud Alotaibi, Faiz Alotaibi, Mohammed Elbes, Mrim Alnfiai, Leveraging hybrid adaptive sine cosine algorithm with deep learning for arabic poem meter detection ACM Transactions on Asian and Low-Resource Language Information Processing (2024).
Eswaraiah, P. & Hussain, S. A hybrid deep learning GRU based approach for text classification using Word embedding. EAI Endorsed Trans. Internet Things 10, 1 (2023).
Rahayu, S. P., Afuan, L. & Yunindar, G. A. Implementation of text mining on song lyrics for song classification based on emotion using website-based logistic regression. J. Teknik Informatika (Jutif) 6(1), 359–368 (2025).
Mehra, Ashman, Mehra, Aryan & Narang, Pratik. Classification and study of music genres with multimodal Spectro-Lyrical Embeddings for Music (SLEM). Multimed. Tools Appl. 84(7), 3701–3721 (2025).
Acknowledgements
I would like to express my very great appreciation to the co-authors of this manuscript for their valuable and constructive suggestions during the planning and development of this research work.
Author information
Authors and Affiliations
Contributions
All authors have made substantial contributions to conception and design, revising the manuscript, and the final approval of the version to be published. Also, all authors agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.
Corresponding author
Ethics declarations
Competing interest
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Jasmine, R.L., Mukherjee, S., Robin, C.R.R. et al. Serial cascaded hybrid adaptive deep networks-based lyrics text classification using optimization approach. Sci Rep (2026). https://doi.org/10.1038/s41598-026-38813-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-38813-z


