Search results for "algorithm."
showing 10 items of 4617 documents
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems
2016
This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…
Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data
2016
Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has b…
Deep learning models for bacteria taxonomic classification of metagenomic data.
2018
Background An open challenge in translational bioinformatics is the analysis of sequenced metagenomes from various environmental samples. Of course, several studies demonstrated the 16S ribosomal RNA could be considered as a barcode for bacteria classification at the genus level, but till now it is hard to identify the correct composition of metagenomic data from RNA-seq short-read data. 16S short-read data are generated using two next generation sequencing technologies, i.e. whole genome shotgun (WGS) and amplicon (AMP); typically, the former is filtered to obtain short-reads belonging to a 16S shotgun (SG), whereas the latter take into account only some specific 16S hypervariable regions.…
Identification of transcribed protein coding sequence remnants within lincRNAs
2018
Abstract Long intergenic non-coding RNAs (lincRNAs) are non-coding transcripts >200 nucleotides long that do not overlap protein-coding sequences. Importantly, such elements are known to be tissue-specifically expressed and to play a widespread role in gene regulation across thousands of genomic loci. However, very little is known of the mechanisms for the evolutionary biogenesis of these RNA elements, especially given their poor conservation across species. It has been proposed that lincRNAs might arise from pseudogenes. To test this systematically, we developed a novel method that searches for remnants of protein-coding sequences within lincRNA transcripts; the hypothesis is that we can t…
mD3DOCKxb: An Ultra-Scalable CPU-MIC Coordinated Virtual Screening Framework
2017
Molecular docking is an important method in computational drug discovery. In large-scale virtual screening, millions of small drug-like molecules (chemical compounds) are compared against a designated target protein (receptor). Depending on the utilized docking algorithm for screening, this can take several weeks on conventional HPC systems. However, for certain applications including large-scale screening tasks for newly emerging infectious diseases such high runtimes can be highly prohibitive. In this paper, we investigate how the massively parallel neo-heterogeneous architecture of Tianhe-2 Supercomputer consisting of thousands of nodes comprising CPUs and MIC coprocessors that can effic…
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters
2016
Computing alignments between two or more sequences are common operations frequently performed in computational molecular biology. The continuing growth of biological sequence databases establishes the need for their efficient parallel implementation on modern accelerators. This paper presents new approaches to high performance biological sequence database scanning with the Smith-Waterman algorithm and the first stage of progressive multiple sequence alignment based on the ClustalW heuristic on a Xeon Phi-based compute cluster. Our approach uses a three-level parallelization scheme to take full advantage of the compute power available on this type of architecture; i.e. cluster-level data par…
The use of morphokinetic as a predictor of implantation.
2017
In recent years the increased efforts intended for improving future outcomes in the laboratory have focused mostly on the search of additional markers of embryo quality to add up present embryo selection criteria. Time-lapse system involves an alternative tool in assisted reproduction techniques, being able to improve the embryo selection from a dynamic and interactive approach while standard embryo assessment implies a subjective and static morphology evaluation and consequently reducing the information gained for embryo selection, time-lapse technology adds several morphokinetic parameters, providing additional input for embryo evaluation. This further information represents a challenge f…
Quantitative Assessment of Eye Phenotypes for Functional Genetic Studies Using Drosophila melanogaster
2016
AbstractAbout two-thirds of the vital genes in the Drosophila genome are involved in eye development, making the fly eye an excellent genetic system to study cellular function and development, neurodevelopment/degeneration, and complex diseases such as cancer and diabetes. We developed a novel computational method, implemented as Flynotyper software (http://flynotyper.sourceforge.net), to quantitatively assess the morphological defects in the Drosophila eye resulting from genetic alterations affecting basic cellular and developmental processes. Flynotyper utilizes a series of image processing operations to automatically detect the fly eye and the individual ommatidium, and calculates a phen…
Environmental epigenetics in zebrafish
2017
Abstract It is widely accepted that the epigenome can act as the link between environmental cues, both external and internal, to the organism and phenotype by converting the environmental stimuli to phenotypic responses through changes in gene transcription outcomes. Environmental stress endured by individual organisms can also enforce epigenetic variations in offspring that had never experienced it directly, which is termed transgenerational inheritance. To date, research in the environmental epigenetics discipline has used a wide range of both model and non-model organisms to elucidate the various epigenetic mechanisms underlying the adaptive response to environmental stimuli. In this rev…
Everolimus as first line therapy for pancreatic neuroendocrine tumours: current knowledge and future perspectives
2017
urpose Everolimus has been shown to be effective for advanced pancreatic neuroendocrine tumours (pNETs), but its positioning in the therapeutic algorithm for pNETs is matter of debate. Methods With the aim to shed light on this point, we performed an up-to-date critical review taking into account the results of both retrospective and prospective published studies, and the recommendations of international guidelines. In addition, we performed an extensive search on the Clinical Trial Registries databases worldwide, to gather information on the ongoing clinical trials related to this specific topic. Results We identified eight retrospective published studies, two prospective published studies…