Abstract
As more and more genomes are sequenced, comparative genomics approaches provide a methodology for identifying conserved regulatory elements that may be involved in gene regulation. In this study, we combined comparative genomics with de novo motif discovery to identify potential human transcription factor binding motifs that are overrepresented and conserved in the upstream regions of a set of co-regulated genes. We validated our approach by analyzing a well-characterized muscle specific gene set. Our approach also performed better than other existing programs, such as Toucan and CompareProspector, based on the motif discovery results for the muscle data set.