Publications
2009 |
Alves, R.; Rodríguez-Baena, D.; Aguilar-Ruiz, J. Gene association analysis: a survey of frequent pattern mining from gene expression data Journal Article In: Briefings in Bioinformatics, vol. 11, no. 2, pp. 210-224, 2009, ISSN: 1467-5463. Abstract | Links | BibTeX | Tags: Gene expression analysis, Pattern recognition @article{Alves2009, Establishing an association between variables is always of interest in genomic studies. Generation of DNA microarray gene expression data introduces a variety of data analysis issues not encountered in traditional molecular biology or medicine. Frequent pattern mining (FPM) has been applied successfully in business and scientific data for discovering interesting association patterns, and is becoming a promising strategy in microarray gene expression analysis. We review the most relevant FPM strategies, as well as surrounding main issues when devising efficient and practical methods for gene association analysis (GAA). We observed that, so far, scalability achieved by efficient methods does not imply biological soundness of the discovered association patterns, and vice versa. Ideally, GAA should employ a balanced mining model taking into account best practices employed by methods reviewed in this survey. Integrative approaches, in which biological knowledge plays an important role within the mining process, are becoming more reliable. |
2007 |
Rodríguez-Baena, D.; Díaz-Díaz, N.; Aguilar-Ruiz, J.; Nepomuceno-Chamorro, I. A. Discovering alpha–Patterns from Gene Expression Data Conference Intelligent Data Engineering and Automated Learning - IDEAL 2007, 2007, ISBN: 978-3-540-77226-2. Abstract | Links | BibTeX | Tags: Gene expression analysis @conference{Rodríguez-Baena2007, The biclustering techniques have the purpose of finding subsets of genes that show similar activity patterns under a subset of conditions. In this paper we characterize a specific type of pattern, that we have called ?–pattern, and present an approach that consists in a new biclustering algorithm specifically designed to find ?–patterns, in which the gene expression values evolve across the experimental conditions showing a similar behavior inside a band that ranges from 0 up to a pre–defined threshold called ?. The ? value guarantees the co–expression among genes. We have tested our method on the Yeast dataset and compared the results to the biclustering algorithms of Cheng & Church (2000) and Aguilar & Divina (2005). Results show that the algorithm finds interesting biclusters, grouping genes with similar behaviors and maintaining a very low mean squared residue. |