Genome wide gene expression levels are used for classification of cancer states (e.g. cancer patient vs. normal patient). Due to the large number of genes in the gene expression data sets (20,000+ genes), the genes are statistically ranked and only a small number of these genes (~100) are selected for developing a classifier. Although, often the classifiers developed with these statistically highly ranked genes provide high accuracy, biological relevance of these genes to cancer is not clear. In this study, we will compare the accuracy of the feature selection methods in selecting the right set of cancer related genes.
Molecular Biology literature search skills. Students who have taken upper level molecular biology courses will be preferred.
Programming skills from COSC 102 (Introduction to Computing II). Having taken Operating Systems (COSC 301) is preferred.
Number of Student Researchers
Applications open on 01/15/2017 and close on 02/07/2017