Bioinformatics and Management Science: Some Common Tools and Techniques

Published Online:https://doi.org/10.1287/opre.1030.0095

References

  • Altschul S. F., Gish W., Miller W., Myers E., Lipman J. Basic local alignment search tool. J. Molecular Biol. (1990) 215:403–410CrossrefGoogle Scholar
  • Altschul S. F., Madden T. L., Schaffer A. A., Zhang J., Zhang Z., Miller W., Lipman D. J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. (1997) 25:3389–3402CrossrefGoogle Scholar
  • Apaydin M. S., Brutlag D. L., Guestrin C., Hsu D., Latombe J. C. Stochastic roadmap simulation: An efficient representation and algorithm for analyzing molecular motion. (2002) (Washington, DC)12–21RECOMB2002CrossrefGoogle Scholar
  • Baldi P., Brunak S.Bioinformatics: The Machine Learning Approach (2001) 2nd ed.(MIT Press, Cambridge, MA) Google Scholar
  • Baldi P., Wesley Hatfield G.Microarrays and Gene Expression (2001) (Cambridge University Press, Cambridge, U.K.) Google Scholar
  • Baldi P., Chauvin Y., Hunkapillar T., McClure M. Hidden Markov models of biological primary sequence information. Proc. National Acad. Sci. USA (1994) 91:1059–1063CrossrefGoogle Scholar
  • Bateman A., Birney E., Cerruti L., Durbin R., Etwiller L., Eddy S. R., Griffiths-Jones S., Howe K. L., Marshall M., Sonnhammer E. L. L. The Pfam protein families database. Nucleic Acids Res. (2002) 30(1):276–280CrossrefGoogle Scholar
  • Beeman D. Some multi-step methods for use in molecular dynamics calculations. J. Comput. Phys. (1976) 20:130–139CrossrefGoogle Scholar
  • Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E. The protein data bank. Nucleic Acids Res. (2000) 28:235–242CrossrefGoogle Scholar
  • Bork P., Dandekar T., Diaz-Lazcoz Y., Eisenhaber F., Huynen M., Yuan Y. Predicting function: From genes to genomes and back. J. Molecular Biol. (1998) 283:707–725CrossrefGoogle Scholar
  • Bourne P. E. CASP and CAFASP experiments and their findings. Methods Biochem. Anal. (2003) 44:501–507Google Scholar
  • Bower J., Bolouri H.Computational Modeling of Genetic and Biochemical Networks (2001) (MIT Press, Cambridge, MA) Google Scholar
  • Bray N., Dubchak I., Pachter L. AVID: A global alignment program. Genome Res. (2003) 13(1):97–102CrossrefGoogle Scholar
  • Brown P. O., Botstein D. Exploring the new world of the genome with DNA microarrays. Nature Genetics (1999) 21:33–37CrossrefGoogle Scholar
  • Brudno M., Do C. B., Cooper G. M., Kim M. F., Davydov E., Green E. D., Sidow A., Batzoglou A. LAGAN and multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. (2003a) 13(4):721–731CrossrefGoogle Scholar
  • Brudno M., Malde S., Poiakov A., Do C., Couronne O., Dubchak I., Batzoglou S. Glocal alignment: Finding rearrangements during alignment. Bioinformatics (2003b) 19:54i–62i(Special Issue on the Proceedings of the ISMB 2003)CrossrefGoogle Scholar
  • Bryant S. H., Altschul S. F. Statistics of sequence-structure threading. Current Opinion in Structural Biol. (1995) 5:236–244CrossrefGoogle Scholar
  • Burge C., Karlin S. Prediction of complete gene structures in human genomic DNA. J. Molecular Biol. (1997) 268:78–94CrossrefGoogle Scholar
  • Cohen F. E. Protein misfolding and prion diseases. J. Molecular Biol. (1999) 293:313–320CrossrefGoogle Scholar
  • Dayhoff M. O., Schwartz R. M., Orcutt B. C. A model of evolutionary change in proteins. Atlas of Protein Sequence and Structure (1978) 5(Supplement 3(National Biomedical Research Foundation, Washington, D.C.) 345–352Google Scholar
  • Delcher A. L., Phillippy A., Carlton J., Salzberg S. L. Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res. (2002) 30:2478–2483CrossrefGoogle Scholar
  • Diaconis P., Holmes S. Random walks on trees and matchings. Electronic J. Probab. (2002) 7:1–17CrossrefGoogle Scholar
  • Doyle J. C. Robustness and dynamics in biological networks. The First International Conf. Systems Biology (2000) New York(Japan Science and Technology Corporation, MIT Press)Google Scholar
  • Dudoit S., Fridlyand J., Speed T. P. Comparison of discrimination methods for the classification of tumors using gene expression data. J. Amer. Statist. Association (2002) 97:77–87CrossrefGoogle Scholar
  • Durbin S., Eddy S., Krogh A., Mitchison G.Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (1998) (Cambridge University Press, Cambridge, U.K.) CrossrefGoogle Scholar
  • Eddy S. Profile hidden Markov models. Bioinformatics (1998) 14:755–763CrossrefGoogle Scholar
  • Eddy S. Non-coding RNA genes and the modern RNA world. Nature Rev. Genetics (2001) 2:919–929CrossrefGoogle Scholar
  • Eddy S., Mitchison G., Durbin R. Maximum discrimination hidden Markov models of sequence consensus. J. Comput. Biol. (1995) 2:9–23CrossrefGoogle Scholar
  • Efron B., Halloran E., Holmes S. Bootstrap confidence levels for phylogenetic trees. Proc. National Acad. Sci. (1996) 93:13429–13434CrossrefGoogle Scholar
  • Farris J. S., Platnick N. I., Funk V. A. The logical basis of phylogenetic analysis. Advances in Cladistics (1983) 2(Columbia University Press, New York) 7–36Google Scholar
  • Fedorov A. N., Baldwin T. O. Contranslational protein folding. J. Biol. Chemistry (1997) 272(52):32715–32718CrossrefGoogle Scholar
  • Felsenstein J. Evolutionary trees from DNA sequences: A maximum likelihood approach. J. Molecular Evol. (1981) 17(6):368–376CrossrefGoogle Scholar
  • Felsenstein J. PHYLIP(Phylogeny Inference Package), version 3.6. (2004) (Department of Genetics, University of Washington, Seattle, WA) . http://evolution.genetics.washington.edu/phylip.htmlGoogle Scholar
  • Fischer D., Barret C., Bryson K., Elofsson A., Godzik A., Jones D., Karplus K. J., Kelley L. A., MacCallum R. M., Pawowski K., Rost B., Rychlewski L., Sternberg M. CAFASP-1: Critical assessment of fully automated structure prediction methods. Proteins (1999) 3:209–217CrossrefGoogle Scholar
  • Fitch W. M., Margoliash E. Construction of phylogenetic trees. Science (1967) 155:279–284CrossrefGoogle Scholar
  • Foulds L. R., Graham R. L. The Steiner problem in phylogeny is NP-complete. Adv. Appl. Math. (1982) 3:43–49CrossrefGoogle Scholar
  • Friedman N., Linial M., Nachman I., Peter D. Using Bayesian networks to analyze expression data. J. Comput. Biol. (2000) 7:601–620CrossrefGoogle Scholar
  • Gardner M.The Last Recreations (1997) (Copernicus-Springer Verlag, New York) CrossrefGoogle Scholar
  • Geman S., Geman D. Stochastic relaxation, Gibbs distribution and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Machine Intelligence (1984) 6:721–741CrossrefGoogle Scholar
  • Gibson K. D., Scheraga H. A. Revised algorithms for the build-up procedure for predicting protein conformations by energy minimization. J. Comput. Chem. (1987) 9:327–355Google Scholar
  • Goloboff P. A. SPA, (S)ankoff (P)arsimony (A)nalysis, version 1.1. (1995) . Computer program distributed by J. M. Carpenter, Department of Entomology, American Museum of Natural History, New YorkGoogle Scholar
  • Golub T., Slonim D., Tamayo P., Huard C., Gassenbeek M., Mesirov J., Coller H., Loh M., Downing J., Caligiuri M., Bloomfield C., Lander E. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science (1999) 286:531–537CrossrefGoogle Scholar
  • Gotoh O. An improved algorithm for matching biological sequences. J. Molecular Biol. (1982) 162:705–708CrossrefGoogle Scholar
  • Gribaldo S., Cammarano P. The root of the universal tree of life inferred from anciently duplicated genes encoding components of the protein-targeting machinery. J. Molecular Evol. (1998) 47(5):508–516CrossrefGoogle Scholar
  • Haeckel E.Morphologie der Organismen: Allgemeine Grundzuge der organischen FormenWissenschaft, mechanisch begrundet durch die von Charles Darwin reformirte Descendenz-Theorie (1866) (Georg Riemer, Berlin, Germany) CrossrefGoogle Scholar
  • Hannenhalli S., Pevzner P. A. Transforming cabbage into turnip: Polynomial algorithm for sorting signed permutations by reversals. STOC (1995) (Las Vegas, NV)178–189CrossrefGoogle Scholar
  • Helden J. V., Andre B., Collado-Vides J. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J. Molecular Biol. (1998) 281:827–842CrossrefGoogle Scholar
  • Henikoff S., Henikoff J. G. Amino acid substitution matrices from protein blocks. Proc. National Acad. Sci. USA (1992) 89:10915–10919CrossrefGoogle Scholar
  • Holmes S. Bootstrapping phylogenetic trees: Theory and methods. Statist. Sci. (2003) 18(2):241–255CrossrefGoogle Scholar
  • Hooper E.The River (1999) (Little, Brown, Boston, MA) Google Scholar
  • Huelsenbeck J., Ronquist F. Mr. Bayes. Bayesian inference of phylogeny. (2002) . http://morphbank.ebc.uu.se/mrbayes/links.phpGoogle Scholar
  • Jukes T., Cantor C., Munro H. N. Evolution of protein molecules. Mammalian Protein Metabolism (1969) (Academic Press, New York) 21–132CrossrefGoogle Scholar
  • Karlin S., Altschul S. F. Methods for assessing the statistical significance of molecular sequences features by using general scoring schemes. Proc. National Acad. Sci. USA (1990) 87(6):2264–2268CrossrefGoogle Scholar
  • Keith J. M., Adams P., Bryant D., Kroese D. P., Mitchelson K. R., Cochran D. A. E., Lala G. H. A simulated annealing algorithm for finding consensus sequences. Bioinformatics (2002) 18:1494–1499CrossrefGoogle Scholar
  • Kent W. J. BLAT—The BLAST-like alignment tool. Genome Res. (2002) 12(4):656–664CrossrefGoogle Scholar
  • Kirkpatrick S., Gelatt Jr C. D., Vecchi M. P. Optimization by simulated annealing. Science (1983) 220:671–680CrossrefGoogle Scholar
  • Korf I., Flicek P., Duan D., Brent M. R. Integrating genomic homology into gene structure prediction. Bioinformatics (2001) 17:S140–S148CrossrefGoogle Scholar
  • Krogh A., Brown M., Mian I. S., Sjolander K., Haussler D. Hidden Markov models in computational biology. J. Molecular Biol. (1994) 235:1501–1531CrossrefGoogle Scholar
  • Lawrence C. E., Altschul S. F., Boguski M. S., Liu J. S., Neuwald A. N., Wootton J. Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment. Science (1993) 262:208–214CrossrefGoogle Scholar
  • Levitt M. Protein folding by restrained energy minimization and molecular dynamics. J. Molecular Biol. (1983a) 170:723–764CrossrefGoogle Scholar
  • Levitt M. Molecular dynamics of native protein: Computer simulation of the trajectories. J. Molecular Biol. (1983b) 168:595–620CrossrefGoogle Scholar
  • Levitt M., Lifson S. Refinement of protein confirmations using a macromolecular energy minimization procedure. J. Molecular Biol. (1969) 46:269–279CrossrefGoogle Scholar
  • Levitt M., Warshel A. Computer simulation of protein folding. Nature (1975) 253:694–698CrossrefGoogle Scholar
  • Li S., Pearl D. K., Doss H. Phylogenetic tree construction using MCMC. J. Amer. Statist. Association (2000) 95:493–503CrossrefGoogle Scholar
  • Li W. H.Molecular Evolution (1997) (Sinauer Associates, Boston, MA) Google Scholar
  • Lipman J. D., Altschul S. F., Kececioglu J. D. A tool for multiple sequence alignment. Proc. National Acad. Sci. (1989) 86:4412–4415CrossrefGoogle Scholar
  • Lukashin A., Engelbrecht J., Brunak S. Multiple alignment using simulated annealing: Branch point definition in human mRNA splicing. Nucleic Acids Res. (1992) 20:2511–2516CrossrefGoogle Scholar
  • Ly D. H., Lockhart D. J., Lerner R. A., Schultz P. G. Mitotic misregulation and human aging. Science (2000) 287:1241–1248CrossrefGoogle Scholar
  • Ma B., Tromp J., Li M. PatternHunter: Faster and more sensitive homology search. Bioinformatics (2002) 18:440–445CrossrefGoogle Scholar
  • Ma B., Wang Z., Zhang K. Alignment between two multiple alignments. Combinatorial Pattern Matching: 14th Annual Symposium, CPM 2003 (2003) June 25–27Morelia, Michoacán, MexicoLecture Notes in Computer Science 2676. Springer-Verlag Heidelberg, Germany.CrossrefGoogle Scholar
  • Maddison D., Maddison W.MacClade Vol. 4: Analysis of Phylogeny and Character Evolution (2000) (Sinauer Associates, Sunderland, MA) Google Scholar
  • McAdams H., Shapiro L. Circuit simulation of genetic networks. Science (1995) 269:650–656CrossrefGoogle Scholar
  • Metropolis N., Rosenbluth A., Rosenbluth M., Teller A., Teller E. Simulated annealing. J. Chem. Phys. (1953) 21:1087–1092CrossrefGoogle Scholar
  • Mjolsness E., Sharp D. H., Rinetz J. A connectionist model of development. J. Theoret. Biol. (1991) 152:429–453CrossrefGoogle Scholar
  • Morales L. B., Garduno-Juarez R., Romero D. Applications of simulated annealing to the multiple-minima problem in small peptides. J. Biomolecular Structure Dynam. (1991) 8:721–735CrossrefGoogle Scholar
  • Morgenstern B. Dialign2: Improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics (1999) 15:211–218CrossrefGoogle Scholar
  • Mountain J. L., Cavalli-Sforza L. L. Inference of human evolution through cladistic analysis of nuclear DNA restriction polymorphisms. Proc. National Acad. Sci. USA (1994) 91:6515–6519CrossrefGoogle Scholar
  • Muckstein U., Hofacker I. L., Stadler P. F. Stochastic pairwise alignments. Bioinformatics (2002) 18(2):S153–S160CrossrefGoogle Scholar
  • Nature Double helix at 50. Nature (2003) 422:6934Google Scholar
  • Needleman S. B., Wunsch C. D. A general method applicable to the search for similarities in amino acid sequence of two proteins. J. Molecular Biol. (1970) 48:443–453CrossrefGoogle Scholar
  • Nemethy G., Scheraga H. A. Theoretical determination of sterically allowed conformations of a polypeptide chain by a computer method. Biopolymers (1965) 3:155–184CrossrefGoogle Scholar
  • Notredame C., Higgins D., Heringa J. T-Coffee: A novel method for multiple sequence alignments. J. Molecular Biol. (2000) 302:205–217CrossrefGoogle Scholar
  • Park B. H., Levitt M. The complexity and accuracy of discrete state models of protein structure. J. Molecular Biol. (1995) 249(2):493–507CrossrefGoogle Scholar
  • Peitsch M. C. ProMod and Swiss-Model: Internet-based tools for automated comparative protein modeling. Biochem. Soc. Trans. (1996) 24:274–279CrossrefGoogle Scholar
  • Pevzner P. A.Computational Molecular Biology, an Algorithmic Approach (2000) (MIT Press, Cambridge, MA) CrossrefGoogle Scholar
  • Pieper U., Eswar N., Ilyin V. A., Stuart A., Sali A. ModBase, a database of annotated comparative protein structure models. Nucleic Acids Res. (2002) 30:255–259CrossrefGoogle Scholar
  • Proteins: Structure, Function, and GeneticsProteins (1997) 29(Supplement 1):1–230Google Scholar
  • Ramachandran G. N., Sasisekharan V. Conformation of polypeptides and proteins. Adv. Protein Chem. (1968) 23:283–438CrossrefGoogle Scholar
  • Rannala B., Yang Z. Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference. J. Molecular Evol. (1996) 43:304–311CrossrefGoogle Scholar
  • Richards F. M. The protein folding problem. Sci. Amer. (1991) 264(1):54–63CrossrefGoogle Scholar
  • Saitou N., Nei M. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Molecular Biol. Evol. (1987) 4(4):406–425Google Scholar
  • Schlick T., Lipkowitz K. B., Boyd D. B. Optimization methods in computational chemistry. Reviews in Computational Chemistry (1992) III(VCH Publishers, New York) 1–71CrossrefGoogle Scholar
  • Schmulevich I., Dougherty E., Kim S., Zhang W. Probabilistic Boolean networks: A rule-based uncertainty model for gene regulatory networks. Bioinformatics (2002) 18:261–274CrossrefGoogle Scholar
  • Schröder E. Vier combinatorische Probleme. Z. Math. Phys. (1870) 15:361–376Google Scholar
  • Science Bulding on the DNA revolution. Science (2003) 300:5617Google Scholar
  • Shannon C. E. A mathematical theory of communication. Bell System Tech. J. (1948) 27:379–423623656CrossrefGoogle Scholar
  • Smith T. F., Waterman M. S. Identification of common molecular subsequences. J. Molecular Biol. (1981) 147:195–197CrossrefGoogle Scholar
  • Snow M. E. Powerful simulated annealing algorithm locates global minima of protein folding potentials from multiple starting conformations. J. Comput. Chem. (1992) 13:579–584CrossrefGoogle Scholar
  • Stanley R.Enumerative Combinatorics (1996) I2nd ed.(Cambridge University Press, Cambridge, U.K.) Google Scholar
  • Swofford D. L. PAUP. (2001) . Phylogenetic analysis using parsimony, V4.0. Sinauer Associates, Boston, MAGoogle Scholar
  • Thompson J. D., Higgins D. G., Gibson T. J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. (1994) 22:4673–4680CrossrefGoogle Scholar
  • Tozeren A., Byers S. W.New Biology for Engineers and Computer Scientists (2003) (Prentice Hall, Englewood Cliffs, NJ) Google Scholar
  • Unger R., Moult J. Genetic algorithms for protein folding simulations. J. Molecular Biol. (1993) 231:75–81CrossrefGoogle Scholar
  • Venter J. C., et al. The sequence of the human genome. Science (2001) 29:1304–1351CrossrefGoogle Scholar
  • Wang L. S., Jansen R., Moret B., Raubeson L., Warnow T. Fast phylogenetic methods for genome rearrangement evolution: An empirical study. Proc. 5th Pacific Sympos. Biocomput. (2002) Hawaii:524–535Google Scholar
  • Watson J. D., Crick F. H. A structure for deoxyribose nucleic acid. Nature (1953) 171(April):737–738CrossrefGoogle Scholar
  • White K. P., Rifkin S. A., Hurban P., Hogness D. D. Microanalysis of drosphila development during metamorphosis. Science (1999) 286:2179–2184CrossrefGoogle Scholar
  • Winkler H.Verbeitung und Ursache der Parthenogenesis im Pflanzen und Tierreiche (1920) (Verlag Fischer, Jena, Germany) CrossrefGoogle Scholar
  • Xu J., Hagler A. Review: Chemoinformatics and drug discovery. Molecules (2002) 7:566–600CrossrefGoogle Scholar
  • Xu Y., Xu D. Protein threading using PROSPECT: Design and evaluation. Proteins: Structure, Function, Genetics (2000) 40:343–354CrossrefGoogle Scholar
  • Yang Z., Rannala B. Bayesian phylogenetic inference using DNA sequences: A Markov chain Monte Carlo method. Molecular Biol. Evol. (1997) 14:717–724CrossrefGoogle Scholar
  • Zhu J., Liu J. S., Lawrence C. E. Bayesian adaptive sequence alignment algorithms. Bioinformatics (1998) 14:25–39CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.