Publications
2011 |
Díaz-Díaz, N.; Gómez-Vela, F.; Aguilar-Ruiz, J.; García-Gutiérrez, J. Gene-gene interaction based clustering method for microarray data Conference 2011 11th International Conference on Intelligent Systems Design and Applications, 2011, ISSN: 2164-7151. Abstract | Links | BibTeX | Tags: Clustering @conference{Díaz-Díaz2011b, In this paper, we propose a greedy clustering algorithm to identify groups of related genes and a new measure to improve the results of this algorithm. Clustering algorithms analyze genes in order to group those with similar behavior. Instead, our approach groups pairs of genes that present similar positive and/or negative interactions. In order to avoid noise in clusters, we apply a threshold, the neighbouring minimun index(?), to know if a pair of genes have interaction enough or not. The algorithm allows the researcher to modify all the criteria: discretization mapping function, gene-gene mapping function and filtering function, and even the neighbouring minimun index, and provides much flexibility to obtain clusters based on the level of precision needed. We have carried out a deep experimental study in databases to obtain a good neighbouring minimun index, ?. The performance of our approach is experimentally tested on the yeast, yeast cell-cycle and malaria datasets. The final number of clusters has a very high level of customization and genes within show a significant level of cohesion, as it is shown graphically in the experiments. |
Gómez-Vela, F.; Martínez-Álvarez, F.; Barranco, C. D.; Díaz-Díaz, N.; Rodríguez-Baena, D.; Aguilar-Ruiz, J. Pattern Recognition in Biological Time Series Journal Article In: Advances in Artificial Intelligence, pp. 164-172, 2011, ISBN: 978-3-642-25274-7. Abstract | Links | BibTeX | Tags: Biclustering, Clustering, Gene Network @article{Gómez-Vela2011b, Knowledge extraction from gene expression data has been one of the main challenges in the bioinformatics field during the last few years. In this context, a particular kind of data, data retrieved in a temporal basis (also known as time series), provide information about the way a gene can be expressed during time. This work presents an exhaustive analysis of last proposals in this area, particularly focusing on those proposals using non--supervised machine learning techniques (i.e. clustering, biclustering and regulatory networks) to find relevant patterns in gene expression. |
2003 |
Aguilar-Ruiz, J.; Rodríguez-Baena, D.; Cohen, P. R.; Riquelme, J. C. Clustering Main Concepts from e-Mails Conference Current Topics in Artificial Intelligence, 2003, ISBN: 978-3-540-25945-9. Abstract | Links | BibTeX | Tags: Clustering @conference{Aguilar-Ruiz2003, E–mail is one of the most common ways to communicate, assuming, in some cases, up to 75% of a company’s communication, in which every employee spends about 90 minutes a day in e–mail tasks such as filing and deleting. This paper deals with the generation of clusters of relevant words from E–mail texts. Our approach consists of the application of text mining techniques and, later, data mining techniques, to obtain related concepts extracted from sent and received messages. We have developed a new clustering algorithm based on neighborhood, which takes into account similarity values among words obtained in the text mining phase. The potential of these applications is enormous and only a few companies, mainly large organizations, have invested in this project so far, taking advantage of employees’s knowledge in future decisions. |