Combinatorial motif analysis and hypothesis generation on a genomic scale

Yuh-Jyh Hu, Suzanne Sandmeyer, Calvin McLaughlin, Dennis Kibler

Research output: Contribution to journalArticlepeer-review

21 Scopus citations


Motivation: Computer-assisted methods are essential for the analysis of biosequences. Gene activity is regulated in part by the binding of regulatory molecules (transcription factors) to combinations of short motifs. The goal of our analysis is the development of algorithms to identify regulatory motifs and to predict the activity of combinations of those motifs. Approach: Our research begins with a new motif-finding method, using multiple objective functions and an improved stochastic iterative sampling strategy. Combinatorial motif analysis is accomplished by constructive induction that analyzes potential motif combinations. The hypothesis is generated by applying standard inductive learning algorithms. Results: Tests using 10 previously identified regulons from budding yeast and 14 artificial families of sequences demonstrated the effectiveness of the new motif-finding method. Motif combination and classification approaches were used in the analysis of a sample DNA array data set deprived from genome-wide gene expression analysis. Availability: Programs will be available as executable files upon request.

Original languageEnglish
Pages (from-to)222-232
Number of pages11
Issue number3
StatePublished - 1 Jan 2000

Fingerprint Dive into the research topics of 'Combinatorial motif analysis and hypothesis generation on a genomic scale'. Together they form a unique fingerprint.

Cite this