Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Construction and application of knowledge graph for seed quality standard documents
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 22 January 2026

Construction and application of knowledge graph for seed quality standard documents

  • Zhenwei Yang1,2,
  • Qiong He1,2 &
  • Jian Zhang1,2 

Scientific Reports , Article number:  (2026) Cite this article

  • 303 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Engineering
  • Mathematics and computing

Abstract

Seed quality standards are the essential basis for crop cultivation supervision. With the continuous development of China’s standard system, the number of seed quality standard documents has increased dramatically. However, the rapid growth and unstructured nature of standard documents hinder efficient query and semantic association. To address the lack of structured knowledge representation in the seed domain, this study proposes a Knowledge Graph (KG) construction framework for seed quality standards. First, a domain-specific ontology is constructed, defining 7 core classes and 12 relationship types to standardize semantic structure. Second, a hybrid knowledge extraction strategy is implemented: regular expressions are used for semi-structured tabular data, while a BERT-BiLSTM-CRF model is employed for unstructured text. Experimental results demonstrate that the proposed model achieves an F1-score of 91.61% in Named Entity Recognition (NER), outperforming than other model. Finally, a KG containing 2436 nodes and 3011 relationships is stored in Neo4j, enabling multi-dimensional retrieval and visualization. The proposed framework significantly improves the accuracy of standard information retrieval and provides a digital foundation for intelligent quality management in the plantation industry.S

Similar content being viewed by others

Robust seed germination prediction using deep learning and RGB image data

Article Open access 11 November 2021

Research on multilateral collaboration strategies in agricultural seed quality assurance

Article Open access 17 May 2024

Genome-wide association study identified candidate genes for seed size and seed composition improvement in M. truncatula

Article Open access 19 February 2021

Data availability

The data that support the findings of this study are available from the corresponding author, Qiong He, upon reasonable request.

References

  1. Niu, S. et al. Research on a lightweight method for maize seed quality detection based on improved YOLOv8. IEEE Access 12, 32927–32937 (2024).

    Google Scholar 

  2. Gang, Y., Chen, C. & Weiyue, W. Research on the protection and utilization of agricultural germplasm resources under the action of seed industry revitalization—Taking Yangzhou City, Jiangsu Province as an Example. Jiangsu Agric. Sci. 52, 20–27 (2024).

    Google Scholar 

  3. Maredia, M. K. & Bartle, B. Excess demand amid quality misperceptions: the case for low-cost seed quality signalling strategies. Eur. Rev. Agric. Econ. 50, 360–394 (2023).

    Google Scholar 

  4. Jeon, K. et al. A relational framework for smart information delivery manual (IDM) specifications. Adv. Eng. Inform. 49, 101319 (2021).

    Google Scholar 

  5. Liu, Q., Li, Y., Duan, H., Liu, Y. & Qin, Z. Knowledge graph construction techniques. J. Comput. Res. Dev. (2016).

  6. Peng, C., Xia, F., Naseriparsa, M. & Osborne, F. Knowledge graphs: Opportunities and challenges. Artif. Intell. Rev. 56, 13071–13102 (2023).

    Google Scholar 

  7. Li, X. et al. exploration and practice of standard digitalization application in the aviation industry. In Information Technology and Standardization. 68–72+78 (2022).

  8. Fu, Z. & Qiang, L. Constructing ontologies by mining deep semantics from XML schemas and XML instance documents. Int. J. Intell. Syst. 37, 661–698 (2021).

    Google Scholar 

  9. Hu, D., Weng, C., Wang, R., Song, X. & Qin, L. Construction Method of National Food Safety Standard Ontology. in Green, Pervasive, and Cloud Computing, GPC 2022 (eds Yu, C., Zhou, J., Song, X. & Lu, X.) vol. 13744 50–66 (Springer International Publishing Ag, Cham, 2023).

  10. Fan, Y., Sun, Y., Mi, B. & Fu, X. Aircraft fault knowledge graph construction based on large language model incorporating Chinese airworthiness knowledge. Int. J. Softw. Eng. Knowl. Eng. 1, 2. https://doi.org/10.1142/S0218194025500962 (2025).

    Google Scholar 

  11. da Silva, A. R. On testing for seed sample heterogeneity with the exact probability distribution of the germination count range. Seed Sci. Res. 30, 59–63 (2020).

    Google Scholar 

  12. Cui, M. et al. Semantic rule-based information extraction for meteorological reports. Int. J. Mach. Learn. Cybern. 15, 177–188 (2024).

    Google Scholar 

  13. Nastou, K., Koutrouli, M., Pyysalo, S. & Jensen, L. J. Improving dictionary-based named entity recognition with deep learning. Bioinformatics (Oxford, England) 40, ii45–ii52 (2024).

    Google Scholar 

  14. Keraghel, I., Morbieu, S. & Nadif, M. Recent advances in named entity recognition: A comprehensive survey and comparative study (2024).

  15. Zhu, Y. A knowledge graph and BiLSTM-CRF-enabled intelligent adaptive learning model and its potential application. Alex. Eng. J. 91, 305–320 (2024).

    Google Scholar 

  16. Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. In North American Chapter of the Association for Computational Linguistics (2019).

  17. Li, L., Haruna, A., Ying, W., Noman, K. & Li, Y. Knowledge graph-driven fault diagnosis for aviation equipment: Integrating improved joint extraction with large language model. J. Ind. Inf. Integr. 50, 101039 (2026).

    Google Scholar 

  18. Li, J., Cheng, X., Zhao, X., Nie, J.-Y. & Wen, J.-R. HaluEval: A large-scale hallucination evaluation benchmark for large language models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (eds Bouamor, H., Pino, J. & Bali, K.) 6449–6464 (Association for Computational Linguistics, Singapore, 2023). https://doi.org/10.18653/v1/2023.emnlp-main.397.

  19. Kelly, Y., O’Rourke, N., Flynn, R., Hegarty, J. & O’Connor, L. Definitions of health and social care standards used internationally: A narrative review. Int. J. Health Plan. Manag. 38, 40–52 (2023).

    Google Scholar 

  20. Gao, S., Ren, G. & Li, H. Knowledge Management in Construction Health and Safety Based on Ontology Modeling. Appl. Sci. Basel 12, 8574 (2022).

    Google Scholar 

  21. Zhou, D. et al. Ontology Reshaping for Knowledge Graph Construction: Applied on Bosch Welding Case. In The Semantic Web—ISWC 2022 770–790 (Springer, Cham, 2022). https://doi.org/10.1007/978-3-031-19433-7_44.

  22. Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. Preprint at https://doi.org/10.48550/arXiv.1810.04805 (2019).

  23. Xu, S., Zhang, C. & Hong, D. BERT-based NLP techniques for classification and severity modeling in basic warranty data study. Insur. Math. Econ. 107, 57–67 (2022).

    Google Scholar 

  24. Huang, Z., Xu, W. & Yu, K. Bidirectional LSTM-CRF models for sequence tagging. Preprint at https://doi.org/10.48550/arXiv.1508.01991 (2015).

  25. Monteiro, J., Sa, F. & Bernardino, J. Experimental evaluation of graph databases: JanusGraph, Nebula Graph, Neo4j, and TigerGraph. Appl. Sci. Basel 13, 5770 (2023).

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank the Beijing Knowledge Management Research Base for their assistance with the study.

Funding

The research is funded by the Beijing Municipal Education Commission Research Plan General Project (Grant Number: KM202411232007).

Author information

Authors and Affiliations

  1. College of Management Science and Engineering, Beijing Information Science and Technology University, Beijing, 100192, China

    Zhenwei Yang, Qiong He & Jian Zhang

  2. Beijing Knowledge Management Research Base, Beijing, 100192, China

    Zhenwei Yang, Qiong He & Jian Zhang

Authors
  1. Zhenwei Yang
    View author publications

    Search author on:PubMed Google Scholar

  2. Qiong He
    View author publications

    Search author on:PubMed Google Scholar

  3. Jian Zhang
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Qiong He: Writing—review and editing, supervision, funding acquisition. Zhenwei Yang: Writing—original draft, data collection, data curation, software, visualization, conceptualization. Jian Zhang: Writing—original draft, Software, methodology, validation.

Corresponding author

Correspondence to Qiong He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, Z., He, Q. & Zhang, J. Construction and application of knowledge graph for seed quality standard documents. Sci Rep (2026). https://doi.org/10.1038/s41598-026-37084-y

Download citation

  • Received: 04 December 2025

  • Accepted: 19 January 2026

  • Published: 22 January 2026

  • DOI: https://doi.org/10.1038/s41598-026-37084-y

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Standard digitization
  • Planting industry
  • Seed quality standards
  • Knowledge graph
  • Knowledge extraction
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics