TY - GEN
T1 - On the classification of cancer cell gene via Expressive Value Distance (EVD) algorithm and its comparison to the optimally trained ANN method
AU - Zhang, Tong
AU - Wang, Chi-Hsu
AU - Tam, Sik Chung
AU - Chen, C. L.Philip
PY - 2011/1/1
Y1 - 2011/1/1
N2 - In recent years, cancer can be detected and recognized by analyzing the sample's expression profile. The cancer gene expression data are high dimensional, high variable dependent, and very noisy. The dimension reduction method is often used for processing the high dimensional data. In this study, a new statistical dimension reduction method called Expressive Value Distance (EVD) is developed and proposed for the practical high-dimensional gene expression cancer data. The feature genes data extracted by EVD are arranged for training the optimally trained Artificial Neural Network (ANN). The trained ANN is then used to classify whether the unseen gene data is cancer or not. In comparison of ANN classification with and without EVD, it is found that both of the ANN can classify the cancer data in good accuracy. With the EVD method, the great amount of data (2000 genes) can be effectively reduced to 16 genes. Therefore, EVD is an effective dimension reduction method. Even the EVD method is not used, the optimally trained ANN is also an advanced method for classifying the high dimensional and complicated cancer data. Briefly, it proves that optimally trained ANN is a very robust classification technique.
AB - In recent years, cancer can be detected and recognized by analyzing the sample's expression profile. The cancer gene expression data are high dimensional, high variable dependent, and very noisy. The dimension reduction method is often used for processing the high dimensional data. In this study, a new statistical dimension reduction method called Expressive Value Distance (EVD) is developed and proposed for the practical high-dimensional gene expression cancer data. The feature genes data extracted by EVD are arranged for training the optimally trained Artificial Neural Network (ANN). The trained ANN is then used to classify whether the unseen gene data is cancer or not. In comparison of ANN classification with and without EVD, it is found that both of the ANN can classify the cancer data in good accuracy. With the EVD method, the great amount of data (2000 genes) can be effectively reduced to 16 genes. Therefore, EVD is an effective dimension reduction method. Even the EVD method is not used, the optimally trained ANN is also an advanced method for classifying the high dimensional and complicated cancer data. Briefly, it proves that optimally trained ANN is a very robust classification technique.
KW - Artificial neural network
KW - Classification of cancer
KW - Dimension reduction
KW - Expressive value distance
KW - Gene expression profile
UR - http://www.scopus.com/inward/record.url?scp=80053083895&partnerID=8YFLogxK
U2 - 10.1109/FUZZY.2011.6007726
DO - 10.1109/FUZZY.2011.6007726
M3 - Conference contribution
AN - SCOPUS:80053083895
SN - 9781424473175
T3 - IEEE International Conference on Fuzzy Systems
SP - 2199
EP - 2204
BT - FUZZ 2011 - 2011 IEEE International Conference on Fuzzy Systems - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
ER -