Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • User's Guide
  • Published:

Introduction: putting it together

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

References

  1. International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).

  2. Collins, F.S. and McKusick, V.A. Implications of the Human Genome Project for medical science. J. Am. Med. Assoc. 285, 540–544 (2001).

    Article  CAS  Google Scholar 

  3. Watson, J.D. & Crick, F.H.C. Molecular structure of nucleic acids: a structure for deoxyribose nucleic acid. Nature 171, 737–738 (1953).

    Article  CAS  Google Scholar 

  4. Green, E.D. Strategies for the systematic sequencing of complex genomes. Nature Rev. Genet. 2, 573–583 (2001).

    Article  CAS  Google Scholar 

  5. Ouellette, B.F.F. & Boguski, M.S. Database divisions and homology search files: a guide for the perplexed. Genome Res. 7, 952–955 (1997).

    Article  CAS  Google Scholar 

  6. Bairoch, A. & Apweiler, R. The SWISS-PROT Protein Sequence Database and its supplement TREMBL in 2000. Nucleic Acids Res. 28, 45–48 (2000).

    Article  CAS  Google Scholar 

  7. Hubbard, T. et al. The Ensembl Genome Database Project. Nucleic Acids Res. 30, 38–41 (2002).

    Article  CAS  Google Scholar 

  8. Kent, W.J. BLAT—the BLAST-like Alignment Tool. Genome Res. 12, 656–664 (2002).

    Article  CAS  Google Scholar 

  9. Stein, L. Genome annotation: from sequence to biology. Nature Rev. Genet. 2, 493–503 (2001).

    Article  CAS  Google Scholar 

  10. Pruitt, K.D. & Maglott, D.R. RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res. 29, 137–140 (2001).

    Article  CAS  Google Scholar 

  11. Burge, C.B. & Karlin, S. Finding the genes in genomic DNA. Curr. Opin. Struct. Biol. 8, 346–354 (1998).

    Article  CAS  Google Scholar 

  12. Schuler, G.D. Electronic PCR: bridging the gap between genome mapping and genome sequencing. Trends Biotechnol. 16, 456–459 (1998).

    Article  CAS  Google Scholar 

  13. Sherry, S.T. et al. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29, 308–311 (2001).

    Article  CAS  Google Scholar 

  14. Hamosh, A. et al. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 30, 52–55 (2002).

    Article  CAS  Google Scholar 

  15. Baxevanis, A.D. & Ouellette, B.F.F. (eds.) Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (John Wiley & Sons, New York, 2001).

    Book  Google Scholar 

  16. Solovyev, V.V., Salamov, A.A. & Lawrence, C.B. Identification of human gene structure using linear discriminant functions and dynamic programming. Proc. Int. Conf. Intell. Syst. Mol. Biol. 3, 367–375 (1995).

    CAS  PubMed  Google Scholar 

  17. Yeh, R.F., Lim, L.P. & Burge, C.B. Computational inference of homologous gene structures in the human genome. Genome Res. 11, 803–816 (2001).

    Article  CAS  Google Scholar 

  18. Marchler-Bauer, A. et al. CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 30, 281–283 (2002).

    Article  CAS  Google Scholar 

  19. Apweiler, R. et al. InterPro—an integrated documentation resource for protein families, domains and functional sites. Bioinformatics 16, 1145–1150 (2000).

    Article  CAS  Google Scholar 

  20. Rebhan, M., Chalifa-Caspi, V., Prilusky, J. & Lancet, D. GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support. Bioinformatics 14, 656–664 (1998).

    Article  CAS  Google Scholar 

  21. Blake, J.A., Richardson, J.E., Bult, C.J., Kadin, J.A. & Eppig, J.T. The Mouse Genome Database (MGD): the model organism database for the laboratory mouse. Nucleic Acids Res. 30, 113–115 (2002).

    Article  CAS  Google Scholar 

  22. Hudson, T.J. et al. A radiation hybrid map of mouse genes. Nature Genet. 29, 201–205 (2001).

    Article  CAS  Google Scholar 

  23. Bateman, A. et al. The Pfam protein families database. Nucleic Acids Res. 30, 276–280 (2002).

    Article  CAS  Google Scholar 

  24. Letunic, I. et al. Recent improvements to the SMART domain–based sequence annotation resource. Nucleic Acids Res. 30, 242–244 (2002).

    Article  CAS  Google Scholar 

  25. Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).

    Article  CAS  Google Scholar 

  26. Durbin, R., Eddy, S., Krogh, A. & Mitchison, G. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (Cambridge Univ. Press, Cambridge, 1998).

    Book  Google Scholar 

  27. Peri, S., Ibarrola, N., Blagoev, B., Mann, M. & Pandey, A. Common pitfalls in bioinformatics-based analyses: look before you leap. Trends Genet. 17, 541–545 (2001) [erratum Trends Genet. 18, 218 (2002)].

    Article  CAS  Google Scholar 

  28. Ponting, C. Issues in predicting protein function from sequence. Brief. Bioinform. 2, 19–29 (2001).

    Article  CAS  Google Scholar 

  29. Aparicio, S.A.J.R. How to count ... human genes. Nature Genet. 25, 129–130 (2000).

    Article  CAS  Google Scholar 

  30. Beadle, G.W. & Tatum, E.L. Genetic control of biochemical reactions in Neurospora. Proc. Natl Acad. Sci. USA 27, 499–506 (1941).

    Article  CAS  Google Scholar 

  31. Jeffery, C.J., Bahnson, B.J., Chien, W., Ringe, D. & Petsko, G.A. Crystal structure of rabbit phosphoglucose isomerase, a glycolytic enzyme that moonlights as neuroleukin, autocrine motility factor, and differentiation mediator. Biochemistry 39, 955–964 (2000).

    Article  CAS  Google Scholar 

  32. Wistow, G. & Piatigorsky, J. Recruitment of enzymes as lens structural proteins. Science 236, 1554–1556 (1987).

    Article  CAS  Google Scholar 

  33. Jeffery, C.J. Moonlighting proteins. Trends Biochem. Sci. 24, 8–11 (1999).

    Article  CAS  Google Scholar 

  34. Chothia, C. Proteins. One thousand families for the molecular biologist. Nature 357, 543–544 (1992).

    Article  CAS  Google Scholar 

  35. Hegyi, H. & Gerstein, M. The relationship between protein structure and function: a comprehensive survey with application to the yeast genome. J. Mol. Biol. 288, 147–164 (1999).

    Article  CAS  Google Scholar 

  36. Jansen, R. & Gerstein, M. Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins. Nucleic Acids Res. 28, 1481–1488 (2000).

    Article  CAS  Google Scholar 

  37. Brenner, S.E. Errors in genome annotation. Trends Genet. 15, 132–133 (1999).

    Article  CAS  Google Scholar 

  38. Smith, R.F. Perspectives: sequence data base searching in the era of large-scale genomic sequencing. Genome Res. 6, 653–660 (1996).

    Article  CAS  Google Scholar 

Download references

Rights and permissions

Reprints and permissions

About this article

Cite this article

Introduction: putting it together. Nat Genet 35 (Suppl 1), 5–8 (2003). https://doi.org/10.1038/ng1188

Download citation

  • Issue date:

  • DOI: https://doi.org/10.1038/ng1188

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing