Search results for "cluster analysis."
showing 10 items of 805 documents
The latent geometry of the human protein interaction network
2017
Abstract Motivation A series of recently introduced algorithms and models advocates for the existence of a hyperbolic geometry underlying the network representation of complex systems. Since the human protein interaction network (hPIN) has a complex architecture, we hypothesized that uncovering its latent geometry could ease challenging problems in systems biology, translating them into measuring distances between proteins. Results We embedded the hPIN to hyperbolic space and found that the inferred coordinates of nodes capture biologically relevant features, like protein age, function and cellular localization. This means that the representation of the hPIN in the two-dimensional hyperboli…
Inhabiting plant roots, nematodes, and truffles—polyphilus, a new helotialean genus with two globally distributed species
2018
Fungal root endophytes, including the common group of dark septate endophytes (DSEs), represent different taxonomic groups and potentially diverse life strategies. In this study, we investigated two unidentified helotialean lineages found previously in a study of DSE fungi of semiarid grasslands, from several other sites, and collected recently from a pezizalean truffle ascoma and eggs of the cereal cyst nematode Heterodera filipjevi. The taxonomic positions and phylogenetic relationships of 21 isolates with different hosts and geographic origins were studied in detail. Four loci, namely, nuc rDNA ITS1-5.8S-ITS2 (internal transcribed spacer [ITS]), partial 28S nuc rDNA (28S), partial 18S nu…
Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs
2017
Detecting higher-order epistatic interactions in Genome-Wide Association Studies (GWAS) remains a challenging task in the fields of genetic epidemiology and computer science. A number of algorithms have recently been proposed for epistasis discovery. However, they suffer from a high computational cost since statistical measures have to be evaluated for each possible combination of markers. Hence, many algorithms use additional filtering stages discarding potentially non-interacting markers in order to reduce the overall number of combinations to be examined. Among others, Mutual Information Clustering (MIC) is a common pre-processing filter for grouping markers into partitions using K-Means…
Co-regulation of paralog genes in the three-dimensional chromatin architecture.
2016
Paralog genes arise from gene duplication events during evolution, which often lead to similar proteins that cooperate in common pathways and in protein complexes. Consequently, paralogs show correlation in gene expression whereby the mechanisms of co-regulation remain unclear. In eukaryotes, genes are regulated in part by distal enhancer elements through looping interactions with gene promoters. These looping interactions can be measured by genome-wide chromatin conformation capture (Hi-C) experiments, which revealed self-interacting regions called topologically associating domains (TADs). We hypothesize that paralogs share common regulatory mechanisms to enable coordinated expression acco…
Unexpected associated microalgal diversity in the lichen Ramalina farinacea is uncovered by pyrosequencing analyses
2017
The current literature reveals that the intrathalline coexistence of multiple microalgal taxa in lichens is more common than previously thought, and additional complexity is supported by the coexistence of bacteria and basidiomycete yeasts in lichen thalli. This replaces the old paradigm that lichen symbiosis occurs between a fungus and a single photobiont. The lichen Ramalina farinacea has proven to be a suitable model to study the multiplicity of microalgae in lichen thalli due to the constant coexistence of Trebouxia sp. TR9 and T. jamesii in long-distance populations. To date, studies involving phycobiont diversity within entire thalli are based on Sanger sequencing, but this method see…
Detection of temporal clusters of health care-associated infections or colonizations with Pseudomonas aeruginosa.
2016
International audience; We investigated temporal clusters of Pseudomonas aeruginosa cases between 2005 and 2014 in 1 French university hospital, overall and by ward, using the Kulldorff method. Clusters of positive water samples were also investigated at the whole hospital level. Our results suggest that water outlets are not closely involved in the occurrence of clusters of P aeruginosa cases.
Low-cost scalable discretization, prediction and feature selection for complex systems
2019
The introduced data-driven tool allows simultaneous feature selection, model inference, and marked cost and quality gains.
Comparison of conventional descriptive analysis and a citation frequency-based descriptive method for odor profiling: An application to Burgundy Pino…
2010
International audience; The limitations of intensity scoring when describing the odor characteristics of a complex product have been documented in the literature. In the present work, the odor properties of 12 Burgundy Pinot noir wines were described by two independent panels performing, respectively, an intensity-based (conventional descriptive analysis) and a citation frequency-based method. Methods were compared according to three criteria: similarity of the sensory maps, control of panel performance and practical aspects. Intensity scoring and citation frequency data were analyzed, respectively, by Principal Components Analysis (PCA) and Correspondence Analysis (CA) followed by Hierarch…
A Clustering approach for profiling LoRaWAN IoT devices
2019
Internet of Things (IoT) devices are starting to play a predominant role in our everyday life. Application systems like Amazon Echo and Google Home allow IoT devices to answer human requests, or trigger some alarms and perform suitable actions. In this scenario, any data information, related device and human interaction are stored in databases and can be used for future analysis and improve the system functionality. Also, IoT information related to the network level (wireless or wired) may be stored in databases and can be processed to improve the technology operation and to detect network anomalies. Acquired data can be also used for profiling operation, in order to group devices according…
Analysis of HVSR Data Using a Modified Centroid-Based Algorithm for Near-Surface Geological Reconstruction
2022
Recently, the use of microtremor techniques for subsoil investigation has increased significantly. The HVSR (Horizontal to Vertical Spectral Ratio) technique allows, in many cases, to obtain a seismo-stratigraphic reconstruction of the subsoil and to identify areas with similar seismic behavior. However, the stratigraphic interpretation of the HVSR peaks still remains a subjective choice and linked to a priori information. A non-hierarchical centroid-based algorithm was modified to group HVSR peaks of different measurements that can be attributed to the same generating seismic discontinuity. Some tests performed have shown that the proposed algorithm produces valid results even in the absen…