NYAS Conferences
New York Academy of Sciences
left end
Search
divider divider feedback right end
Annals of the New York Academy of Sciences Annals of the New York Academy of Sciences login

Main

Browse Volumes

Forthcoming Volumes

Annals PrePrints

Annals Extra

E-mail Alerts

Subscriptions & Orders

New Proposals

Author Guidelines

About Annals

Help

Get free Annals volume as a NYAS member: http://www.nyas.org/annalsreaderhw
Reverse Engineering Biological Networks: Opportunities and Challenges in Computational Methods for Pathway Inference Volume 1115 published November 2007
Ann. N.Y. Acad. Sci. 1115: 154–167 (2007). doi: 10.1196/annals.1407.005
Copyright © 2007 by the New York Academy of Sciences
description | purchase volume purchase this volume

This Volume
Table of Contents
Description
This Article
Full Text
Full Text (PDF)
All Versions of this Article:
annals.1407.005v1
1115/1/154    most recent
Services
Similar articles in this journal
Similar articles in PubMed
Alert me to new issues of the journal
Download to citation manager
Citing Articles
Citing Articles via Google Scholar
Google Scholar
Articles by CRAIG, R. A.
Articles by LIAO, L.
Search for Related Content
PubMed
PubMed Citation
Articles by CRAIG, R. A.
Articles by LIAO, L.

Part V. Some Reverse Engineering Algorithms

Improving Protein–Protein Interaction Prediction Based on Phylogenetic Information Using a Least-Squares Support Vector Machine

ROGER A. CRAIGa AND LI LIAOa

a Department of Computer and Information Sciences, University of Delaware, Newark, Delaware 19716, USA

Key Words: protein–protein interaction • phylogenetic vectors • least-squares support vector machines

Addressed for correspondence: Li Liao, Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716. Voice: 302-831-3500; fax: 302-831-8458.  lliao{at}cis.udel.edu

Predicting protein–protein interactions has become a key step of reverse-engineering biological networks to better understand cellular functions. The experimental methods in determining protein–protein interactions are time-consuming and costly, which has motivated vigorous development of computational approaches for predicting protein–protein interactions. A set of recently developed bioinformatics methods utilizes coevolutionary information of the interacting partners (e.g., as exhibited in the form of correlations between distance matrices, where, for each protein, a matrix stores the pairwise distances between the protein and its orthologs in a group of reference genomes). We proposed a novel method to account for the intra-matrix correlations in improving predictive accuracy. The distance matrices for a pair of proteins are transformed and concatenated into a phylogenetic vector. A least-squares support vector machine is trained and tested on pairs of proteins, represented as phylogenetic vectors, whose interactions are known. The intra-matrix correlations are accounted for by introducing a weighted linear kernel, which determines the dot product of two phylogenetic vectors. The performance, measured as receiver operator characteristic (ROC) score in cross-validation experiments, shows significant improvement of our method (ROC score 0.928) over that obtained by Pearson correlations (0.659).






footerLeft footerRight