6533b85ffe1ef96bd12c1ca7

RESEARCH PRODUCT

MGFM: a novel tool for detection of tissue and cell specific marker genes from microarray gene expression data

Miguel A. Andrade-navarroFritz LekschasHarald StachelscheidAndreas KurtzAndreas KurtzKhadija El Amrani

subject

Genetic MarkersCancer ResearchMicroarraysBiologyMarker genesWeb BrowserProteomicsMarker geneBioconductorGeneticsGeneGenetic Association StudiesGeneticsMicroarray analysis techniquesMethodology ArticleGene Expression ProfilingComputational BiologyReproducibility of Results3. Good healthGene expression profilingSamplesGene OntologyGenetic markerOrgan SpecificityDNA microarrayBiotechnology

description

Background Identification of marker genes associated with a specific tissue/cell type is a fundamental challenge in genetic and cell research. Marker genes are of great importance for determining cell identity, and for understanding tissue specific gene function and the molecular mechanisms underlying complex diseases. Results We have developed a new bioinformatics tool called MGFM (Marker Gene Finder in Microarray data) to predict marker genes from microarray gene expression data. Marker genes are identified through the grouping of samples of the same type with similar marker gene expression levels. We verified our approach using two microarray data sets from the NCBI’s Gene Expression Omnibus public repository encompassing samples for similar sets of five human tissues (brain, heart, kidney, liver, and lung). Comparison with another tool for tissue-specific gene identification and validation with literature-derived established tissue markers established functionality, accuracy and simplicity of our tool. Furthermore, top ranked marker genes were experimentally validated by reverse transcriptase-polymerase chain reaction (RT-PCR). The sets of predicted marker genes associated with the five selected tissues comprised well-known genes of particular importance in these tissues. The tool is freely available from the Bioconductor web site, and it is also provided as an online application integrated into the CellFinder platform (http://cellfinder.org/analysis/marker). Conclusions MGFM is a useful tool to predict tissue/cell type marker genes using microarray gene expression data. The implementation of the tool as an R-package as well as an application within CellFinder facilitates its use. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1785-9) contains supplementary material, which is available to authorized users.

10.1186/s12864-015-1785-9http://dx.doi.org/10.1186/s12864-015-1785-9