Paper The following article is Open access

Analysis of genetic association in Listeria and Diabetes using Hierarchical Clustering and Silhouette Index

, , , and

Published under licence by IOP Publishing Ltd
, , Citation Inti A. Pagnuco et al 2016 J. Phys.: Conf. Ser. 705 012002 DOI 10.1088/1742-6596/705/1/012002

1742-6596/705/1/012002

Abstract

It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, where significative groups of genes are defined based on some criteria. This task is usually performed by clustering algorithms, where the whole family of genes, or a subset of them, are clustered into meaningful groups based on their expression values in a set of experiment.

In this work we used a methodology based on the Silhouette index as a measure of cluster quality for individual gene groups, and a combination of several variants of hierarchical clustering to generate the candidate groups, to obtain sets of co-expressed genes for two real data examples. We analyzed the quality of the best ranked groups, obtained by the algorithm, using an online bioinformatics tool that provides network information for the selected genes.

Moreover, to verify the performance of the algorithm, considering the fact that it doesn't find all possible subsets, we compared its results against a full search, to determine the amount of good co-regulated sets not detected.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/705/1/012002