Search results for "Ensembl"
showing 10 items of 165 documents
Reconstructing the Phylogeny of the Human Chromosome 4 Synteny using Comparative Karyology and Genomic Data Analysis
2010
Abstract This work focuses on the evolution of the architecture of human chromosome 4 (HSA4) through the analysis of chromosomal regions that have been conserved over time, and the comparison of regions that have been involved in different rearrangements in placental lineages. As with most elements of the human genome, HSA4 is considered to be evolutionarily stable. A more detailed analysis indicates that the syntenic association has been reshuffled by a series of rearrangements, yielding different chromosomes in various taxa. In its ancestral eutherian state, HSA4 has a syntenic association with HSA8p. We investigated the complex origin of this human chromosome using three different approa…
Classes of sum-of-cisoids processes and their statistics for the modeling and simulation of mobile fading channels
2013
Published version of an article in the journal: EURASIP Journal on Wireless Communications and Networking. Also available from the publisher at: http://dx.doi.org/10.1186/1687-1499-2013-125 Open access In this paper, we present a fundamental study on the stationarity and ergodicity of eight classes of sum-of-cisoids (SOC) processes for the modeling and simulation of frequency-nonselective mobile Rayleigh fading channels. The purpose of this study is to determine which classes of SOC models enable the design of channel simulators that accurately reproduce the channel’s statistical properties without demanding information on the time origin or the time-consuming computation of an ensemble ave…
Improving Lossless Image Compression with Contextual Memory
2019
With the increased use of image acquisition devices, including cameras and medical imaging instruments, the amount of information ready for long term storage is also growing. In this paper we give a detailed description of the state-of-the-art lossless compression software PAQ8PX applied to grayscale image compression. We propose a new online learning algorithm for predicting the probability of bits from a stream. We then proceed to integrate the algorithm into PAQ8PX&rsquo
ADME Prediction with KNIME: Development and Validation of a Publicly Available Workflow for the Prediction of Human Oral Bioavailability.
2020
In silico prediction of human oral bioavailability is a relevant tool for the selection of potential drug candidates and for the rejection of those molecules with less probability of success during the early stages of drug discovery and development. However, the high variability and complexity of oral bioavailability and the limited experimental data in the public domain have mainly restricted the development of reliable in silico models to predict this property from the chemical structure. In this study we present a KNIME automated workflow to predict human oral bioavailability of new drug and drug-like molecules based on five machine learning approaches combined into an ensemble model. Th…
CovSel
2018
Ensemble methods combine the predictions of a set of models to reach a better prediction quality compared to a single model's prediction. The ensemble process consists of three steps: 1) the generation phase where the models are created, 2) the selection phase where a set of possible ensembles is composed and one is selected by a selection method, 3) the fusion phase where the individual models' predictions of the selected ensemble are combined to an ensemble's estimate. This paper proposes CovSel, a selection approach for regression problems that ranks ensembles based on the coverage of adequately estimated training points and selects the ensemble with the highest coverage to be used in th…
Evaluation of Ensemble Machine Learning Methods in Mobile Threat Detection
2017
The rapid growing trend of mobile devices continues to soar causing massive increase in cyber security threats. Most pervasive threats include ransom-ware, banking malware, premium SMS fraud. The solitary hackers use tailored techniques to avoid detection by the traditional antivirus. The emerging need is to detect these threats by any flow-based network solution. Therefore, we propose and evaluate a network based model which uses ensemble Machine Learning (ML) methods in order to identify the mobile threats, by analyzing the network flows of the malware communication. The ensemble ML methods not only protect over-fitting of the model but also cope with the issues related to the changing be…
Diversity in random subspacing ensembles
2004
Ensembles of learnt models constitute one of the main current directions in machine learning and data mining. It was shown experimentally and theoretically that in order for an ensemble to be effective, it should consist of classifiers having diversity in their predictions. A number of ways are known to quantify diversity in ensembles, but little research has been done about their appropriateness. In this paper, we compare eight measures of the ensemble diversity with regard to their correlation with the accuracy improvement due to ensembles. We conduct experiments on 21 data sets from the UCI machine learning repository, comparing the correlations for random subspacing ensembles with diffe…
Computerized Attention Training Program and Vocal Ensemble Classes – means of Adolescent Attention Focusing Ability Development
2015
Nowadays adolescents encounter difficulties focusing on particular, effective and long-term activities. These difficulties depend on their age group development regularities. The aim of the research is to evaluate computer attention training software in comparison with vocal ensemble classes on the subject of adolescent attention focusing ability development. Participants – 24 adolescents (both sexes, average age 14 ± 0,87 years) were divided into three experimental groups – experimental group A (EGA), experimental group B (EGB) and control group (KG). Two methods of adolescent attention focusing skills development were tested:computer software package CogniPlus /Schuhfried, Austria/ was ap…
Handling local concept drift with dynamic integration of classifiers : domain of antibiotic resistance in nosocomial infections
2006
In the real world concepts and data distributions are often not stable but change with time. This problem, known as concept drift, complicates the task of learning a model from data and requires special approaches, different from commonly used techniques, which treat arriving instances as equally important contributors to the target concept. Among the most popular and effective approaches to handle concept drift is ensemble learning, where a set of models built over different time periods is maintained and the best model is selected or the predictions of models are combined. In this paper we consider the use of an ensemble integration technique that helps to better handle concept drift at t…
Phase transitions in nonadditive hard disc systems: a Gibbs ensemble Monte Carlo Study
2007
we study the properties of a model fluid in two dimensions with Gibbs ensemble Monte Carlo (GEMC) techniques, in particular we analyze the entropy-driven phase separation in case of a nonadditive symmetric hard disc fluid. By a combination of GEMC with finite size scaling techniques we locate the critical line of nonadditivities as a function of the system density, which separates the mixing/demixing regions and compare with a simple analytical approximation.