Biclustering is an important analysis method on gene expression data for finding a subset of genes sharing compatible expression patterns. Although some biclustering algorithms have been proposed, few provided a query-driven approach for biologists to search the biclusters, which contain a certain gene of interest. In this paper, we proposed a generalised fuzzy-based approach, namely Weighted Fuzzy-based Maximum Similarity Biclustering (WF-MSB), for extracting a query-driven bicluster based on the user-defined reference gene. A fuzzy-based similarity measurement and condition weighting approach are used to extract significant biclusters in expression levels. Both of the most similar bicluster and the most dissimilar bicluster to the reference gene are discovered by WF-MSB. The proposed WF-MSB method was evaluated in comparison with MSBE on a real yeast microarray data and synthetic data sets. The experimental results show that WF-MSB can effectively find the biclusters with significant GO-based functional meanings.
|Number of pages||21|
|Journal||International Journal of Data Mining and Bioinformatics|
|State||Published - 1 Feb 2011|
- Data mining
- Fuzzy set
- Gene expression
- Gene similarity measure