Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Precedings
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • RSS feed
  1. nature
  2. nature precedings
  3. presentation
  4. article
Integrating Text Mining into the MGI Biocuration Workflow
Download PDF
Download PDF
  • Presentation
  • Open access
  • Published: 20 May 2009

3rd International Biocuration Conference

Integrating Text Mining into the MGI Biocuration Workflow

  • Karen Dowell1,
  • Monica McAndrews-Hill2,
  • David Hill2,
  • Harold Drabkin2 &
  • …
  • Judith Blake2 

Nature Precedings (2009)Cite this article

  • 267 Accesses

  • 2 Citations

  • Metrics details

Abstract

A major challenge for the development of resources for functional and comparative genomics is the extraction of data from the biomedical literature. Although text retrieval and extraction for biological data is an active research field, few applications have been integrated into production literature curation systems such as those of the model organism databases.In September 2008, Mouse Genome Informatics (MGI) at The Jackson Lab initiated a search for dictionary-based text mining tools that we could integrate into our curation workflow. MGI has rigorous document triage and annotation procedures designed to identify articles about mouse genome biology and determine whether those articles should be curated. We currently screens approximately 1000 journal articles a month for Gene Ontology terms, gene mapping, gene expression, phenotype data and other key biological information. Although we don’t foresee that human curation tasks can be fully automated in the near future, we are eager to implement entity name recognition and gene tagging tools that can help streamline our curation workflow and simplify gene indexing tasks in the MGI system. In this presentation, we discuss our search process and the steps we took to identify a short list of potential tools for further evaluation. We present our performance metrics and success criteria, and pilot projects in progress. The primary applications under current review are Fraunhofer SCAI’s ProMiner and NCBO’s Open-Biomedical Annotator.

Similar content being viewed by others

Cross-comparison of gut metagenomic profiling strategies

Article Open access 06 November 2024

Identifying genomic data use with the Data Citation Explorer

Article Open access 06 November 2024

The chronODE framework for modelling multi-omic time series with ordinary differential equations and machine learning

Article Open access 19 August 2025

Article PDF

Author information

Authors and Affiliations

  1. University of Maine Graduate School of Biomedical Sciences https://www.nature.com/nature

    Karen Dowell

  2. Mouse Genome Informatics at The Jackson Laboratory https://www.nature.com/nature

    Monica McAndrews-Hill, David Hill, Harold Drabkin & Judith Blake

Authors
  1. Karen Dowell
    View author publications

    Search author on:PubMed Google Scholar

  2. Monica McAndrews-Hill
    View author publications

    Search author on:PubMed Google Scholar

  3. David Hill
    View author publications

    Search author on:PubMed Google Scholar

  4. Harold Drabkin
    View author publications

    Search author on:PubMed Google Scholar

  5. Judith Blake
    View author publications

    Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Karen Dowell.

Rights and permissions

Creative Commons Attribution 3.0 License.

Reprints and permissions

About this article

Cite this article

Dowell, K., McAndrews-Hill, M., Hill, D. et al. Integrating Text Mining into the MGI Biocuration Workflow. Nat Prec (2009). https://doi.org/10.1038/npre.2009.3262.1

Download citation

  • Received: 20 May 2009

  • Accepted: 20 May 2009

  • Published: 20 May 2009

  • DOI: https://doi.org/10.1038/npre.2009.3262.1

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • text mining
  • gene tagging
  • GO
  • gene ontology
  • curation
  • open-source
  • MGI
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Sign up for alerts
  • RSS feed

About the journal

  • Journal Information

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Precedings (Nat Preced)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2025 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing