Research

Research Areas for Bioinformatics Methods

Genome-wide Association Interaction Networks

Our network hypothesis of genome-wide associations studies (GWAS) is
that most of information about disease susceptibility is not localized
in one gene’s main effect. Rather, this information is distributed
throughout a disease-specific network consisting of numerous small
gene-gene interactions and other small genetic effects. The
network-distributed information about disease risk is the motivation
for our Genetic Association Interaction Network (GAIN) and SNPrank
approaches. The SNPrank algorithm can be conceptualized as a
gene-importance seeker that surfs through the GAIN accumulating bits of
information from each genetic node about association with the
phenotype. SNPrank represents a modified eigenvector centrality
algorithm that ranks the importance of each SNP (single nucleotide
polymorphism) due to the network of complex interactions and main
effects encoded in the GAIN.

N. A. Davis, J. E. Crowe, Jr, N. M. Pajewski, B. A. McKinney, “Surfing
a genetic association interaction network to identify modulators of
antibody response to smallpox vaccine,” Genes and Immunity (Nature
Publishing). doi: 10.1038/gene.2010.3; 2010. (open
access
)

B.A. McKinney, J.E. Crowe, Jr., J. Guo, and D. Tian, “Capturing the
spectrum of interaction effects in genetic association studies by
simulated evaporative cooling network analysis,” PLoS Genetics. 5(3):
e1000432. doi:10.1371/journal.pgen.1000432; 2009. (open
access
)

N. A. Davis, Ahwan Pandey, and B. A. McKinney. “Real-world comparison
of CPU and GPU implementations of SNPrank: a network analysis tool for
genome-wide association studies,” Bioinformatics. 27 (2): 284-285;2011
(pdf)

Feature Selection

Feature selection is the determination of which genes, proteins,
and/or environmental factors are important to include in a classifier
or other model of disease susceptibility or other biological endpoint.
We are developing feature selection and classification methods for gene
and protein expression, structural and function MRI, and GWAS data. A
very popular machine learning method used in all areas of
Bioinformatics is called Random Forests, but it has very limited
ability to identify features that are important due to interaction
effects. Thus, we developed a method called Evaporative Cooling (EC),
which combines Random Forests and Relief-F. EC is able to identify both
main and interaction effects.

B.A. McKinney, J.E. Crowe, Jr., J. Guo, and D. Tian, “Capturing the
spectrum of interaction effects in genetic association studies by
simulated evaporative cooling network analysis,” PLoS Genetics. 5(3):
e1000432. doi:10.1371/journal.pgen.1000432; 2009. (open
access
)

B.A. McKinney, D. M. Reif, B. C. White, J. E. Crowe Jr., J. H. Moore.
“Evaporative cooling feature selection for genotypic data involving
interactions,” Bioinformatics. 23:2113-2120; 2007. (pdf)

B.A. McKinney, D.M. Reif, M.T. Rock, K. M. Edwards, S. F. Kingsmore,
J.H. Moore, and J.E. Crowe, “Cytokine expression patterns associated
with systemic adverse events following smallpox immunization,” Journal
of Infectious Diseases. 194(4): 36092; 2006. (free
pubmed central
)

B.A. McKinney, D.M. Reif, M.D. Ritchie and J.H. Moore, “Machine
learning for detecting gene-gene interactions,” Applied Bioinformatics,
5(2):77-88; 2006. (pdf)

D. M. Reif, A.A. Motsinger, B. A. McKinney, J. E. Crowe, Jr., J.H.
Moore, “Integrated analysis of genetic and proteomic data identifies
biomarkers associated with adverse events following smallpox
vaccination,”Genes and Immunity. 10:112-119; 2009. (abstract)

D. M. Reif, A. A. Motsinger, B.A. McKinney, and J. H. Moore, “Feature
selection using a random forest classifier for the integrated analysis
of multiple data types,” Proceedings of the IEEE Symposium on
Computational Intelligence in Bioinformatics and Computational Biology.
pp. 171-178; 2006. (pdf)

Mathematical Modeling of Time-Series Data

At the functional level, biological systems are complex dynamic
systems that must balance feedback from intrinsic noise in gene
regulatory network and environmental noise from surroundings. Thus, in
order to understand how biological systems function properly and why
they fail, one needs predictive mathematical models of the dynamic
system. One approach we have taken is to pose the problem in terms of
nonlinear system identification. We developed a hybrid approach to
automatically discover the network topology and parameters of coupled
differential equation systems that describes the kinetics of a system
of interacting biomolecules. This method combines a grammar-based
evolutionary algorithm with an Unscented Kalman particle filter. We are
also investigating agent based models.

B.A. McKinney, “New Informatics approaches for identifying biologic
relationships in time series data,” Wiley Interdisciplinary Reviews:
Nanomedicine and Nanobiotechnology. 1:60-68; 2009. (pdf)

B.A. McKinney and D. Tian, “Grammatical Immune System Evolution for
Reverse Engineering Nonlinear Dynamic Bayesian Models,” Cancer
Informatics. 6:433-447; 2008. (open access)

B.A. McKinney, J.E. Crowe, H.U. Voss, P.S. Crooke, N.L. Barney, and
J.H. Moore, “Hybrid Grammar-based Approach to Nonlinear Dynamical
System Identification from Biological Time Series,” Physical Review E,
Statistical, Nonlinear, and Soft Matter Physics 73, 021912; 2006. (pubmed)

Integrating Data for Computational Antibody-Antigen Docking

B.A. McKinney, N. Kallewaard, J.E. Crowe, Jr., and J. Meiler, “Using
the natural evolution of a rotavirus-specific human monoclonal antibody
to predict the complex topography of a viral antigenic site,” Immunome
Research. 3:8; 2007. (open
access
)

N. Kallewaard, B.A. McKinney, Y. Gu, A. Chen, B. V. V. Prasad, and J.E.
Crowe, Jr., “Functional maturation of the human antibody response to
rotavirus,” The Journal of Immunology. 180:3980-3989; 2008. (pdf)

F. Bibollet-Ruche, B.A. McKinney, F.H. Wagner, A. Duverger, A.A.
Ansari, O. Kutsch “Antibody-mediated activation of chimpanzee T cells
via the TCR/CD3 pathway is a function of the anti-CD3 antibody
isotype,” Journal of Virology, 82:10271-10278; 2008. (abstract)

And Much More.