Construction and application of knowledge graph for seed quality standard documents

Yang, Zhenwei; He, Qiong; Zhang, Jian

doi:10.1038/s41598-026-37084-y

Download PDF

Article
Open access
Published: 22 January 2026

Construction and application of knowledge graph for seed quality standard documents

Zhenwei Yang^1,2,
Qiong He^1,2 &
Jian Zhang^1,2

Scientific Reports , Article number: (2026) Cite this article

303 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Seed quality standards are the essential basis for crop cultivation supervision. With the continuous development of China’s standard system, the number of seed quality standard documents has increased dramatically. However, the rapid growth and unstructured nature of standard documents hinder efficient query and semantic association. To address the lack of structured knowledge representation in the seed domain, this study proposes a Knowledge Graph (KG) construction framework for seed quality standards. First, a domain-specific ontology is constructed, defining 7 core classes and 12 relationship types to standardize semantic structure. Second, a hybrid knowledge extraction strategy is implemented: regular expressions are used for semi-structured tabular data, while a BERT-BiLSTM-CRF model is employed for unstructured text. Experimental results demonstrate that the proposed model achieves an F1-score of 91.61% in Named Entity Recognition (NER), outperforming than other model. Finally, a KG containing 2436 nodes and 3011 relationships is stored in Neo4j, enabling multi-dimensional retrieval and visualization. The proposed framework significantly improves the accuracy of standard information retrieval and provides a digital foundation for intelligent quality management in the plantation industry.S

Robust seed germination prediction using deep learning and RGB image data

Article Open access 11 November 2021

Research on multilateral collaboration strategies in agricultural seed quality assurance

Article Open access 17 May 2024

Genome-wide association study identified candidate genes for seed size and seed composition improvement in M. truncatula

Article Open access 19 February 2021

Data availability

The data that support the findings of this study are available from the corresponding author, Qiong He, upon reasonable request.

References

Niu, S. et al. Research on a lightweight method for maize seed quality detection based on improved YOLOv8. IEEE Access 12, 32927–32937 (2024).
Google Scholar
Gang, Y., Chen, C. & Weiyue, W. Research on the protection and utilization of agricultural germplasm resources under the action of seed industry revitalization—Taking Yangzhou City, Jiangsu Province as an Example. Jiangsu Agric. Sci. 52, 20–27 (2024).
Google Scholar
Maredia, M. K. & Bartle, B. Excess demand amid quality misperceptions: the case for low-cost seed quality signalling strategies. Eur. Rev. Agric. Econ. 50, 360–394 (2023).
Google Scholar
Jeon, K. et al. A relational framework for smart information delivery manual (IDM) specifications. Adv. Eng. Inform. 49, 101319 (2021).
Google Scholar
Liu, Q., Li, Y., Duan, H., Liu, Y. & Qin, Z. Knowledge graph construction techniques. J. Comput. Res. Dev. (2016).
Peng, C., Xia, F., Naseriparsa, M. & Osborne, F. Knowledge graphs: Opportunities and challenges. Artif. Intell. Rev. 56, 13071–13102 (2023).
Google Scholar
Li, X. et al. exploration and practice of standard digitalization application in the aviation industry. In Information Technology and Standardization. 68–72+78 (2022).
Fu, Z. & Qiang, L. Constructing ontologies by mining deep semantics from XML schemas and XML instance documents. Int. J. Intell. Syst. 37, 661–698 (2021).
Google Scholar
Hu, D., Weng, C., Wang, R., Song, X. & Qin, L. Construction Method of National Food Safety Standard Ontology. in Green, Pervasive, and Cloud Computing, GPC 2022 (eds Yu, C., Zhou, J., Song, X. & Lu, X.) vol. 13744 50–66 (Springer International Publishing Ag, Cham, 2023).
Fan, Y., Sun, Y., Mi, B. & Fu, X. Aircraft fault knowledge graph construction based on large language model incorporating Chinese airworthiness knowledge. Int. J. Softw. Eng. Knowl. Eng. 1, 2. https://doi.org/10.1142/S0218194025500962 (2025).
Google Scholar
da Silva, A. R. On testing for seed sample heterogeneity with the exact probability distribution of the germination count range. Seed Sci. Res. 30, 59–63 (2020).
Google Scholar
Cui, M. et al. Semantic rule-based information extraction for meteorological reports. Int. J. Mach. Learn. Cybern. 15, 177–188 (2024).
Google Scholar
Nastou, K., Koutrouli, M., Pyysalo, S. & Jensen, L. J. Improving dictionary-based named entity recognition with deep learning. Bioinformatics (Oxford, England) 40, ii45–ii52 (2024).
Google Scholar
Keraghel, I., Morbieu, S. & Nadif, M. Recent advances in named entity recognition: A comprehensive survey and comparative study (2024).
Zhu, Y. A knowledge graph and BiLSTM-CRF-enabled intelligent adaptive learning model and its potential application. Alex. Eng. J. 91, 305–320 (2024).
Google Scholar
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. In North American Chapter of the Association for Computational Linguistics (2019).
Li, L., Haruna, A., Ying, W., Noman, K. & Li, Y. Knowledge graph-driven fault diagnosis for aviation equipment: Integrating improved joint extraction with large language model. J. Ind. Inf. Integr. 50, 101039 (2026).
Google Scholar
Li, J., Cheng, X., Zhao, X., Nie, J.-Y. & Wen, J.-R. HaluEval: A large-scale hallucination evaluation benchmark for large language models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (eds Bouamor, H., Pino, J. & Bali, K.) 6449–6464 (Association for Computational Linguistics, Singapore, 2023). https://doi.org/10.18653/v1/2023.emnlp-main.397.
Kelly, Y., O’Rourke, N., Flynn, R., Hegarty, J. & O’Connor, L. Definitions of health and social care standards used internationally: A narrative review. Int. J. Health Plan. Manag. 38, 40–52 (2023).
Google Scholar
Gao, S., Ren, G. & Li, H. Knowledge Management in Construction Health and Safety Based on Ontology Modeling. Appl. Sci. Basel 12, 8574 (2022).
Google Scholar
Zhou, D. et al. Ontology Reshaping for Knowledge Graph Construction: Applied on Bosch Welding Case. In The Semantic Web—ISWC 2022 770–790 (Springer, Cham, 2022). https://doi.org/10.1007/978-3-031-19433-7_44.
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. Preprint at https://doi.org/10.48550/arXiv.1810.04805 (2019).
Xu, S., Zhang, C. & Hong, D. BERT-based NLP techniques for classification and severity modeling in basic warranty data study. Insur. Math. Econ. 107, 57–67 (2022).
Google Scholar
Huang, Z., Xu, W. & Yu, K. Bidirectional LSTM-CRF models for sequence tagging. Preprint at https://doi.org/10.48550/arXiv.1508.01991 (2015).
Monteiro, J., Sa, F. & Bernardino, J. Experimental evaluation of graph databases: JanusGraph, Nebula Graph, Neo4j, and TigerGraph. Appl. Sci. Basel 13, 5770 (2023).
Google Scholar

Download references

Acknowledgements

The authors would like to thank the Beijing Knowledge Management Research Base for their assistance with the study.

Funding

The research is funded by the Beijing Municipal Education Commission Research Plan General Project (Grant Number: KM202411232007).

Author information

Authors and Affiliations

College of Management Science and Engineering, Beijing Information Science and Technology University, Beijing, 100192, China
Zhenwei Yang, Qiong He & Jian Zhang
Beijing Knowledge Management Research Base, Beijing, 100192, China
Zhenwei Yang, Qiong He & Jian Zhang

Authors

Zhenwei Yang
View author publications
Search author on:PubMed Google Scholar
Qiong He
View author publications
Search author on:PubMed Google Scholar
Jian Zhang
View author publications
Search author on:PubMed Google Scholar

Contributions

Qiong He: Writing—review and editing, supervision, funding acquisition. Zhenwei Yang: Writing—original draft, data collection, data curation, software, visualization, conceptualization. Jian Zhang: Writing—original draft, Software, methodology, validation.

Corresponding author

Correspondence to Qiong He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, Z., He, Q. & Zhang, J. Construction and application of knowledge graph for seed quality standard documents. Sci Rep (2026). https://doi.org/10.1038/s41598-026-37084-y

Download citation

Received: 04 December 2025
Accepted: 19 January 2026
Published: 22 January 2026
DOI: https://doi.org/10.1038/s41598-026-37084-y