6533b7d7fe1ef96bd12683ea

RESEARCH PRODUCT

Hub-Centered Gene Network Reconstruction Using Automatic Relevance Determination

Lars KaderaliLars KaderaliStefan KramerStefan KramerMatthias BockSoichi OgishimaHiroshi Tanaka

subject

Dynamic network analysisTranscription GeneticMicroarraysSciencePosterior probabilityGene regulatory networkBiologycomputer.software_genreBioinformaticsNetwork topology03 medical and health sciences0302 clinical medicineYeastsGeneticsComputer SimulationGene Regulatory NetworksGene NetworksBiology030304 developmental biologyRegulatory NetworksHyperparameter0303 health sciencesMultidisciplinaryModels GeneticSystems BiologyQuantitative Biology::Molecular NetworksCell CycleQRComputational BiologyBayesian networkGene Expression RegulationROC CurveMedicineData miningcomputerAlgorithms030217 neurology & neurosurgeryCombinatorial explosionBiological networkResearch Article

description

Network inference deals with the reconstruction of biological networks from experimental data. A variety of different reverse engineering techniques are available; they differ in the underlying assumptions and mathematical models used. One common problem for all approaches stems from the complexity of the task, due to the combinatorial explosion of different network topologies for increasing network size. To handle this problem, constraints are frequently used, for example on the node degree, number of edges, or constraints on regulation functions between network components. We propose to exploit topological considerations in the inference of gene regulatory networks. Such systems are often controlled by a small number of hub genes, while most other genes have only limited influence on the network's dynamic. We model gene regulation using a Bayesian network with discrete, Boolean nodes. A hierarchical prior is employed to identify hub genes. The first layer of the prior is used to regularize weights on edges emanating from one specific node. A second prior on hyperparameters controls the magnitude of the former regularization for different nodes. The net effect is that central nodes tend to form in reconstructed networks. Network reconstruction is then performed by maximization of or sampling from the posterior distribution. We evaluate our approach on simulated and real experimental data, indicating that we can reconstruct main regulatory interactions from the data. We furthermore compare our approach to other state-of-the art methods, showing superior performance in identifying hubs. Using a large publicly available dataset of over 800 cell cycle regulated genes, we are able to identify several main hub genes. Our method may thus provide a valuable tool to identify interesting candidate genes for further study. Furthermore, the approach presented may stimulate further developments in regularization methods for network reconstruction from data.

10.1371/journal.pone.0035077http://dx.doi.org/10.1371/journal.pone.0035077