Abstract
Seed quality standards are the essential basis for crop cultivation supervision. With the continuous development of China’s standard system, the number of seed quality standard documents has increased dramatically. However, the rapid growth and unstructured nature of standard documents hinder efficient query and semantic association. To address the lack of structured knowledge representation in the seed domain, this study proposes a Knowledge Graph (KG) construction framework for seed quality standards. First, a domain-specific ontology is constructed, defining 7 core classes and 12 relationship types to standardize semantic structure. Second, a hybrid knowledge extraction strategy is implemented: regular expressions are used for semi-structured tabular data, while a BERT-BiLSTM-CRF model is employed for unstructured text. Experimental results demonstrate that the proposed model achieves an F1-score of 91.61% in Named Entity Recognition (NER), outperforming than other model. Finally, a KG containing 2436 nodes and 3011 relationships is stored in Neo4j, enabling multi-dimensional retrieval and visualization. The proposed framework significantly improves the accuracy of standard information retrieval and provides a digital foundation for intelligent quality management in the plantation industry.S
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author, Qiong He, upon reasonable request.
References
Niu, S. et al. Research on a lightweight method for maize seed quality detection based on improved YOLOv8. IEEE Access 12, 32927–32937 (2024).
Gang, Y., Chen, C. & Weiyue, W. Research on the protection and utilization of agricultural germplasm resources under the action of seed industry revitalization—Taking Yangzhou City, Jiangsu Province as an Example. Jiangsu Agric. Sci. 52, 20–27 (2024).
Maredia, M. K. & Bartle, B. Excess demand amid quality misperceptions: the case for low-cost seed quality signalling strategies. Eur. Rev. Agric. Econ. 50, 360–394 (2023).
Jeon, K. et al. A relational framework for smart information delivery manual (IDM) specifications. Adv. Eng. Inform. 49, 101319 (2021).
Liu, Q., Li, Y., Duan, H., Liu, Y. & Qin, Z. Knowledge graph construction techniques. J. Comput. Res. Dev. (2016).
Peng, C., Xia, F., Naseriparsa, M. & Osborne, F. Knowledge graphs: Opportunities and challenges. Artif. Intell. Rev. 56, 13071–13102 (2023).
Li, X. et al. exploration and practice of standard digitalization application in the aviation industry. In Information Technology and Standardization. 68–72+78 (2022).
Fu, Z. & Qiang, L. Constructing ontologies by mining deep semantics from XML schemas and XML instance documents. Int. J. Intell. Syst. 37, 661–698 (2021).
Hu, D., Weng, C., Wang, R., Song, X. & Qin, L. Construction Method of National Food Safety Standard Ontology. in Green, Pervasive, and Cloud Computing, GPC 2022 (eds Yu, C., Zhou, J., Song, X. & Lu, X.) vol. 13744 50–66 (Springer International Publishing Ag, Cham, 2023).
Fan, Y., Sun, Y., Mi, B. & Fu, X. Aircraft fault knowledge graph construction based on large language model incorporating Chinese airworthiness knowledge. Int. J. Softw. Eng. Knowl. Eng. 1, 2. https://doi.org/10.1142/S0218194025500962 (2025).
da Silva, A. R. On testing for seed sample heterogeneity with the exact probability distribution of the germination count range. Seed Sci. Res. 30, 59–63 (2020).
Cui, M. et al. Semantic rule-based information extraction for meteorological reports. Int. J. Mach. Learn. Cybern. 15, 177–188 (2024).
Nastou, K., Koutrouli, M., Pyysalo, S. & Jensen, L. J. Improving dictionary-based named entity recognition with deep learning. Bioinformatics (Oxford, England) 40, ii45–ii52 (2024).
Keraghel, I., Morbieu, S. & Nadif, M. Recent advances in named entity recognition: A comprehensive survey and comparative study (2024).
Zhu, Y. A knowledge graph and BiLSTM-CRF-enabled intelligent adaptive learning model and its potential application. Alex. Eng. J. 91, 305–320 (2024).
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. In North American Chapter of the Association for Computational Linguistics (2019).
Li, L., Haruna, A., Ying, W., Noman, K. & Li, Y. Knowledge graph-driven fault diagnosis for aviation equipment: Integrating improved joint extraction with large language model. J. Ind. Inf. Integr. 50, 101039 (2026).
Li, J., Cheng, X., Zhao, X., Nie, J.-Y. & Wen, J.-R. HaluEval: A large-scale hallucination evaluation benchmark for large language models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (eds Bouamor, H., Pino, J. & Bali, K.) 6449–6464 (Association for Computational Linguistics, Singapore, 2023). https://doi.org/10.18653/v1/2023.emnlp-main.397.
Kelly, Y., O’Rourke, N., Flynn, R., Hegarty, J. & O’Connor, L. Definitions of health and social care standards used internationally: A narrative review. Int. J. Health Plan. Manag. 38, 40–52 (2023).
Gao, S., Ren, G. & Li, H. Knowledge Management in Construction Health and Safety Based on Ontology Modeling. Appl. Sci. Basel 12, 8574 (2022).
Zhou, D. et al. Ontology Reshaping for Knowledge Graph Construction: Applied on Bosch Welding Case. In The Semantic Web—ISWC 2022 770–790 (Springer, Cham, 2022). https://doi.org/10.1007/978-3-031-19433-7_44.
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. Preprint at https://doi.org/10.48550/arXiv.1810.04805 (2019).
Xu, S., Zhang, C. & Hong, D. BERT-based NLP techniques for classification and severity modeling in basic warranty data study. Insur. Math. Econ. 107, 57–67 (2022).
Huang, Z., Xu, W. & Yu, K. Bidirectional LSTM-CRF models for sequence tagging. Preprint at https://doi.org/10.48550/arXiv.1508.01991 (2015).
Monteiro, J., Sa, F. & Bernardino, J. Experimental evaluation of graph databases: JanusGraph, Nebula Graph, Neo4j, and TigerGraph. Appl. Sci. Basel 13, 5770 (2023).
Acknowledgements
The authors would like to thank the Beijing Knowledge Management Research Base for their assistance with the study.
Funding
The research is funded by the Beijing Municipal Education Commission Research Plan General Project (Grant Number: KM202411232007).
Author information
Authors and Affiliations
Contributions
Qiong He: Writing—review and editing, supervision, funding acquisition. Zhenwei Yang: Writing—original draft, data collection, data curation, software, visualization, conceptualization. Jian Zhang: Writing—original draft, Software, methodology, validation.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Yang, Z., He, Q. & Zhang, J. Construction and application of knowledge graph for seed quality standard documents. Sci Rep (2026). https://doi.org/10.1038/s41598-026-37084-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-37084-y


