The gCluster algorithm is a general clustering method that predicts clusters of any biological word or combination of them, relying only on the DNA sequence and the statistical significance. When using CG as word, gCluster works similarly to CpGcluster [1], our method to predict CpG islands. More broadly, gCluster has much in common with wordCluster [2] but uses an improved distance model.

[1] Hackenberg M, Previti C, Luque-Escamilla PL, Carpena P, Martínez-Aroza J, Oliver JL. CpGcluster: a distance-based algorithm for CpG-island detection. BMC Bioinformatics. 2006; 7:446.

[2] Hackenberg M, Carpena P, Bernaola-Galvan P, Barturen G, Alganza AM, Oliver JL. WordCluster: detecting clusters of DNA words and genomic elements. Algorithms Mol. Biol. 2011; 6:2.