Publications
2013 |
Rodríguez-Baena, D. Extracting and validating biclusters from binary datasets Journal Article In: AI Communications, vol. 26, no. 4, pp. 417-418, 2013. Abstract | Links | BibTeX | Tags: Biclustering, Binary dataset, Biological validation @article{Rodríguez-Baena2013, This work proposes a novel algorithm to extract biclusters from binary datasets: the Bit-Pattern Biclustering Algorithm (BiBit). The selective search performed by BiBit, based on a very fast bits words processing technique, provides very satisfactory results in quality and computational cost. Besides, a new software tool, named CarGene (Characterization of Genes), that helps scientists to validate sets of genes using biological knowledge is introduced too. |
2011 |
Rodríguez-Baena, D.; Pérez-Pulido, A. J.; Aguilar-Ruiz, J. A biclustering algorithm for extracting bit-patterns from binary datasets Journal Article In: Bioinformatics, vol. 27, no. 19, pp. 2738-2745, 2011, ISSN: 1367-4803. Abstract | Links | BibTeX | Tags: Biclustering, Binary dataset @article{Rodríguez-Baena2011, Motivation: Binary datasets represent a compact and simple way to store data about the relationships between a group of objects and their possible properties. In the last few years, different biclustering algorithms have been specially developed to be applied to binary datasets. Several approaches based on matrix factorization, suffix trees or divide-and-conquer techniques have been proposed to extract useful biclusters from binary data, and these approaches provide information about the distribution of patterns and intrinsic correlations. Results: A novel approach to extracting biclusters from binary datasets, BiBit, is introduced here. The results obtained from different experiments with synthetic data reveal the excellent performance and the robustness of BiBit to density and size of input data. Also, BiBit is applied to a central nervous system embryonic tumor gene expression dataset to test the quality of the results. A novel gene expression preprocessing methodology, based on expression level layers, and the selective search performed by BiBit, based on a very fast bit-pattern processing technique, provide very satisfactory results in quality and computational cost. The power of biclustering in finding genes involved simultaneously in different cancer processes is also shown. Finally, a comparison with Bimax, one of the most cited binary biclustering algorithms, shows that BiBit is faster while providing essentially the same results. |