Integrating Text Mining into the MGI Biocuration Workflow

Dowell, Karen; McAndrews-Hill, Monica; Hill, David; Drabkin, Harold; Blake, Judith

doi:10.1038/npre.2009.3262.1

Download PDF

Presentation
Open access
Published: 20 May 2009

3rd International Biocuration Conference

Integrating Text Mining into the MGI Biocuration Workflow

Karen Dowell¹,
Monica McAndrews-Hill²,
David Hill²,
Harold Drabkin² &
…
Judith Blake²

Nature Precedings (2009)Cite this article

352 Accesses
2 Citations
Metrics details

Abstract

A major challenge for the development of resources for functional and comparative genomics is the extraction of data from the biomedical literature. Although text retrieval and extraction for biological data is an active research field, few applications have been integrated into production literature curation systems such as those of the model organism databases.In September 2008, Mouse Genome Informatics (MGI) at The Jackson Lab initiated a search for dictionary-based text mining tools that we could integrate into our curation workflow. MGI has rigorous document triage and annotation procedures designed to identify articles about mouse genome biology and determine whether those articles should be curated. We currently screens approximately 1000 journal articles a month for Gene Ontology terms, gene mapping, gene expression, phenotype data and other key biological information. Although we don’t foresee that human curation tasks can be fully automated in the near future, we are eager to implement entity name recognition and gene tagging tools that can help streamline our curation workflow and simplify gene indexing tasks in the MGI system. In this presentation, we discuss our search process and the steps we took to identify a short list of potential tools for further evaluation. We present our performance metrics and success criteria, and pilot projects in progress. The primary applications under current review are Fraunhofer SCAI’s ProMiner and NCBO’s Open-Biomedical Annotator.

Article PDF

Author information

Authors and Affiliations

University of Maine Graduate School of Biomedical Sciences https://www.nature.com/nature
Karen Dowell
Mouse Genome Informatics at The Jackson Laboratory https://www.nature.com/nature
Monica McAndrews-Hill, David Hill, Harold Drabkin & Judith Blake

Authors

Karen Dowell
View author publications
Search author on:PubMed Google Scholar
Monica McAndrews-Hill
View author publications
Search author on:PubMed Google Scholar
David Hill
View author publications
Search author on:PubMed Google Scholar
Harold Drabkin
View author publications
Search author on:PubMed Google Scholar
Judith Blake
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Karen Dowell.

Rights and permissions

Creative Commons Attribution 3.0 License.

Reprints and permissions

About this article

Cite this article

Dowell, K., McAndrews-Hill, M., Hill, D. et al. Integrating Text Mining into the MGI Biocuration Workflow. Nat Prec (2009). https://doi.org/10.1038/npre.2009.3262.1

Download citation

Received: 20 May 2009
Accepted: 20 May 2009
Published: 20 May 2009
DOI: https://doi.org/10.1038/npre.2009.3262.1

Integrating Text Mining into the MGI Biocuration Workflow

Abstract

Similar content being viewed by others

Cross-comparison of gut metagenomic profiling strategies

Approaches for accelerating microbial gene function discovery using artificial intelligence

Identifying genomic data use with the Data Citation Explorer

Article PDF

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Search

Quick links

Abstract

Similar content being viewed by others

Cross-comparison of gut metagenomic profiling strategies

Approaches for accelerating microbial gene function discovery using artificial intelligence

Identifying genomic data use with the Data Citation Explorer

Article PDF

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links