|
Karthik Devarajan, PhD
Assistant Professor |
Recent advances in high-throughput technologies have given rise to large-scale biological data in the form of expression profiles of tens of thousands of genes and proteins, often with only a handful of tissue samples. The focus of my research is in the development of novel statistical methodology for the analysis of large-scale data stemming from high-throughput studies such as microarrays, comparative genomics hybridization, siRNA screening and microscopy. It includes methods for pattern recognition as well as for correlating a certain phenotype (such as tissue type, patient response to a certain treatment or survival time etc.) with large numbers of covariates (genes).
We are currently investigating two methods from statistical learning theory - nonnegative matrix factorization and partial least squares. Specifically, we are developing unsupervised clustering methods for molecular pattern discovery as well as for text mining applications in biomedical informatics. In this setting, there is no prior knowledge of the expected gene expression patterns for a given set of genes or for any phenotype. Our methods are based on nonnegative matrix factorization for the discrimination of competing models and elucidation of clusters and hidden variables within such large-scale data. Another problem of interest is in associating large scale molecular data and clinical data with patient survival time in the presence of censoring. This is an important issue in translational medicine, however very little research has been done in this area. We addressed this problem by developing methods that combine partial least squares with the accelerated failure time model for censored survival data. We are currently extending this approach for other learning theoretic methods and models for censored survival data.
Description of research projectsFox Chase Programs
- Devarajan K, Ebrahimi N. A generalization of the Cox proportional hazards regression model: Inference and Applications, Computational Statistics and Data Analysis, 2010, In press.
- Devarajan K, Zhou Y, Chachra N, Ebrahimi N. A supervised approach for predicting patient survival with gene expression data, Proceedings of the IEEE Tenth International Conference in Bioinformatics and Bioengineering, 26-31, Philadelphia, Pennsylvania, June 2010.
- Devarajan K, Ebrahimi, N. Testing for covariate effect in the Cox proportional hazards regression model. Communications in Statistics – Theory and Methods. 2009;38(14):2333-47.
- Hensley HH, Merkel CE, Chang W-CL, Devarajan K, Cooper HS, Clapper ML. Endoscopic Imaging and Size Estimation of Colorectal Adenomas in the Multiple Intestinal Neoplasia Mouse Gastrointestinal Endoscopy. 2009;69(3):742-749. PubMed
- Devarajan K. Non-negative matrix factorization – An analytical and interpretive tool in computational biology. PLoS Computational Biology. 2008;4(7):e1000029.
- Rennefahrt U, Deacon S, Parker SA, Devarajan K, Besser A, Chernoff J, Knapp S, Turk BE, Peterson JR. Specificity profiling of Pak kinases allows identification of novel phosphorylated sites. Journal of Biological Chemistry. 2007;282(21):15667-78. PubMed
- Altomare DA, Vaslet CA, Skele KL, De Rienzo A, Devarajan K, McClatchey AI, Kane AB, Jhanwar SC, Testa JR. A Mouse Model Recapitulating Molecular Features of Human Mesothelioma. Cancer Research. 2005;65(18):8090-95. PubMed
- Devarajan K, Ebrahimi N. Goodness of fit testing for the Cox proportional hazards model, Goodness-of-Fit Tests and Model Validity. Statistics for Industry and Technology. Birkhauser Publishers, 235-251.


Print this Page
E-mail This Page

For Information about cancer or